• Rémi Denis-Courmont's avatar
    lavc/vp7dsp: add R-V V vp7_idct_dc_add4uv · 121fb846
    Rémi Denis-Courmont authored
    This is almost the same story as vp7_idct_add4y. We just have to use
    strided loads of 2 64-bit elements to account for the different data
    layout in memory.
    
    T-Head C908:
    vp7_idct_dc_add4uv_c:       7.5
    vp7_idct_dc_add4uv_rvv_i64: 2.0
    vp8_idct_dc_add4uv_c:       6.2
    vp8_idct_dc_add4uv_rvv_i32: 2.2 (before)
    vp8_idct_dc_add4uv_rvv_i64: 2.0
    
    SpacemiT X60:
    vp7_idct_dc_add4uv_c:       6.7
    vp7_idct_dc_add4uv_rvv_i64: 2.2
    vp8_idct_dc_add4uv_c:       5.7
    vp8_idct_dc_add4uv_rvv_i32: 2.5 (before)
    vp8_idct_dc_add4uv_rvv_i64: 2.0
    121fb846
vp7dsp_rvv.S 4.59 KB