1. 19 Jun, 2024 9 commits
  2. 18 Jun, 2024 25 commits
  3. 17 Jun, 2024 1 commit
    • Martin Storsjö's avatar
      aarch64: Use cntvct_el0 as timer register on Android and macOS · ab8f7030
      Martin Storsjö authored
      The default timer register pmccntr_el0 usually requires enabling
      access with e.g. a kernel module (while it is accessible by
      default on Windows). On Linux, the default for checkasm benchmarks
      is to use perf (if suitable headers are available) though.
      
      On macOS, using cntvct_el0 gives measurements with the same
      magnitude as mach_absolute_time (which is used currently), but
      possibly with a little less overhead/noise.
      Signed-off-by: 's avatarMartin Storsjö <martin@martin.st>
      ab8f7030
  4. 16 Jun, 2024 4 commits
  5. 15 Jun, 2024 1 commit
    • Ramiro Polla's avatar
      swscale/x86: add sse2 and avx2 {lum,chr}ConvertRange · f6859cad
      Ramiro Polla authored
      chrRangeFromJpeg_8_c: 22.3
      chrRangeFromJpeg_8_sse2: 13.3
      chrRangeFromJpeg_8_avx2: 13.3
      chrRangeFromJpeg_24_c: 72.8
      chrRangeFromJpeg_24_sse2: 22.3
      chrRangeFromJpeg_24_avx2: 17.5
      chrRangeFromJpeg_128_c: 345.5
      chrRangeFromJpeg_128_sse2: 106.0
      chrRangeFromJpeg_128_avx2: 57.8
      chrRangeFromJpeg_144_c: 380.5
      chrRangeFromJpeg_144_sse2: 118.5
      chrRangeFromJpeg_144_avx2: 62.3
      chrRangeFromJpeg_256_c: 646.3
      chrRangeFromJpeg_256_sse2: 218.8
      chrRangeFromJpeg_256_avx2: 109.0
      chrRangeFromJpeg_512_c: 1461.5
      chrRangeFromJpeg_512_sse2: 426.5
      chrRangeFromJpeg_512_avx2: 211.5
      chrRangeToJpeg_8_c: 37.8
      chrRangeToJpeg_8_sse2: 10.5
      chrRangeToJpeg_8_avx2: 14.0
      chrRangeToJpeg_24_c: 114.3
      chrRangeToJpeg_24_sse2: 23.5
      chrRangeToJpeg_24_avx2: 16.3
      chrRangeToJpeg_128_c: 633.5
      chrRangeToJpeg_128_sse2: 107.5
      chrRangeToJpeg_128_avx2: 55.0
      chrRangeToJpeg_144_c: 758.3
      chrRangeToJpeg_144_sse2: 132.0
      chrRangeToJpeg_144_avx2: 64.5
      chrRangeToJpeg_256_c: 1345.0
      chrRangeToJpeg_256_sse2: 218.0
      chrRangeToJpeg_256_avx2: 105.3
      chrRangeToJpeg_512_c: 2524.0
      chrRangeToJpeg_512_sse2: 417.0
      chrRangeToJpeg_512_avx2: 218.8
      lumRangeFromJpeg_8_c: 11.8
      lumRangeFromJpeg_8_sse2: 11.0
      lumRangeFromJpeg_8_avx2: 10.3
      lumRangeFromJpeg_24_c: 38.5
      lumRangeFromJpeg_24_sse2: 15.5
      lumRangeFromJpeg_24_avx2: 12.5
      lumRangeFromJpeg_128_c: 232.3
      lumRangeFromJpeg_128_sse2: 60.0
      lumRangeFromJpeg_128_avx2: 26.8
      lumRangeFromJpeg_144_c: 259.5
      lumRangeFromJpeg_144_sse2: 65.3
      lumRangeFromJpeg_144_avx2: 29.0
      lumRangeFromJpeg_256_c: 464.5
      lumRangeFromJpeg_256_sse2: 107.5
      lumRangeFromJpeg_256_avx2: 54.0
      lumRangeFromJpeg_512_c: 897.5
      lumRangeFromJpeg_512_sse2: 224.5
      lumRangeFromJpeg_512_avx2: 109.8
      lumRangeToJpeg_8_c: 17.8
      lumRangeToJpeg_8_sse2: 11.0
      lumRangeToJpeg_8_avx2: 11.8
      lumRangeToJpeg_24_c: 56.3
      lumRangeToJpeg_24_sse2: 11.0
      lumRangeToJpeg_24_avx2: 12.5
      lumRangeToJpeg_128_c: 333.8
      lumRangeToJpeg_128_sse2: 53.3
      lumRangeToJpeg_128_avx2: 26.5
      lumRangeToJpeg_144_c: 375.5
      lumRangeToJpeg_144_sse2: 60.8
      lumRangeToJpeg_144_avx2: 29.0
      lumRangeToJpeg_256_c: 652.0
      lumRangeToJpeg_256_sse2: 109.5
      lumRangeToJpeg_256_avx2: 53.5
      lumRangeToJpeg_512_c: 1284.3
      lumRangeToJpeg_512_sse2: 218.0
      lumRangeToJpeg_512_avx2: 108.3
      f6859cad