1. 08 Nov, 2023 5 commits
  2. 07 Nov, 2023 1 commit
  3. 06 Nov, 2023 3 commits
    • Rémi Denis-Courmont's avatar
      lavc/opusdsp: rewrite R-V V postfilter · adc87a5f
      Rémi Denis-Courmont authored
      This uses a more traditional approach allowing up processing of up to
      period minus two elements per iteration. This also allows the algorithm
      to work for all and any vector length.
      
      As the T-Head C908 device under test can load 16 elements loop, there is
      unsurprisingly a little performance drop when the period is minimal and
      the parallelism is capped at 13 elements:
      
      Before:
      postfilter_15_c:         21222.2
      postfilter_15_rvv_f32:   22007.7
      postfilter_512_c:        20189.7
      postfilter_512_rvv_f32:  22004.2
      postfilter_1022_c:       20189.7
      postfilter_1022_rvv_f32: 22004.2
      
      After:
      postfilter_15_c:         20189.5
      postfilter_15_rvv_f32:    7057.2
      postfilter_512_c:        20189.5
      postfilter_512_rvv_f32:   5667.2
      postfilter_1022_c:       20192.7
      postfilter_1022_rvv_f32:  5667.2
      adc87a5f
    • Rémi Denis-Courmont's avatar
      lavc/pixblockdsp: rework R-V V get_pixels_unaligned · 02594c8c
      Rémi Denis-Courmont authored
      As in the aligned case, we can use VLSE64.V, though the way of doing so
      gets more convoluted, so the performance gains are more modest:
      
      get_pixels_unaligned_c:       126.7
      get_pixels_unaligned_rvv_i32: 145.5 (before)
      get_pixels_unaligned_rvv_i64:  62.2 (after)
      
      For the reference, those are the aligned benchmarks (unchanged) on the
      same T-Head C908 hardware:
      
      get_pixels_c:                 126.7
      get_pixels_rvi:                85.7
      get_pixels_rvv_i64:            33.2
      02594c8c
    • Rémi Denis-Courmont's avatar
      lavc/sbrdsp: R-V V sbr_hf_g_filt · f68ad5d2
      Rémi Denis-Courmont authored
      hf_g_filt_c:      1552.5
      hf_g_filt_rvv_f32: 679.5
      f68ad5d2
  4. 05 Nov, 2023 5 commits
  5. 04 Nov, 2023 18 commits
  6. 03 Nov, 2023 6 commits
  7. 02 Nov, 2023 2 commits