• James Almer's avatar
    swscale/x86/input: add AVX2 optimized RGB32 to YUV functions · e9cfd532
    James Almer authored
    abgr_to_uv_8_c: 43.3
    abgr_to_uv_8_sse2: 14.3
    abgr_to_uv_8_avx: 15.3
    abgr_to_uv_8_avx2: 18.8
    abgr_to_uv_128_c: 650.3
    abgr_to_uv_128_sse2: 110.8
    abgr_to_uv_128_avx: 112.3
    abgr_to_uv_128_avx2: 64.8
    abgr_to_uv_1080_c: 5456.3
    abgr_to_uv_1080_sse2: 888.8
    abgr_to_uv_1080_avx: 900.8
    abgr_to_uv_1080_avx2: 518.3
    abgr_to_uv_1920_c: 9692.3
    abgr_to_uv_1920_sse2: 1593.8
    abgr_to_uv_1920_avx: 1613.3
    abgr_to_uv_1920_avx2: 864.8
    abgr_to_y_8_c: 23.3
    abgr_to_y_8_sse2: 12.8
    abgr_to_y_8_avx: 13.3
    abgr_to_y_8_avx2: 17.3
    abgr_to_y_128_c: 308.3
    abgr_to_y_128_sse2: 67.3
    abgr_to_y_128_avx: 66.8
    abgr_to_y_128_avx2: 44.8
    abgr_to_y_1080_c: 2371.3
    abgr_to_y_1080_sse2: 512.8
    abgr_to_y_1080_avx: 505.8
    abgr_to_y_1080_avx2: 314.3
    abgr_to_y_1920_c: 4177.3
    abgr_to_y_1920_sse2: 915.8
    abgr_to_y_1920_avx: 926.8
    abgr_to_y_1920_avx2: 519.3
    bgra_to_uv_8_c: 37.3
    bgra_to_uv_8_sse2: 13.3
    bgra_to_uv_8_avx: 14.8
    bgra_to_uv_8_avx2: 19.8
    bgra_to_uv_128_c: 563.8
    bgra_to_uv_128_sse2: 111.3
    bgra_to_uv_128_avx: 112.3
    bgra_to_uv_128_avx2: 64.8
    bgra_to_uv_1080_c: 4691.8
    bgra_to_uv_1080_sse2: 893.8
    bgra_to_uv_1080_avx: 899.8
    bgra_to_uv_1080_avx2: 517.8
    bgra_to_uv_1920_c: 8332.8
    bgra_to_uv_1920_sse2: 1590.8
    bgra_to_uv_1920_avx: 1605.8
    bgra_to_uv_1920_avx2: 867.3
    bgra_to_y_8_c: 22.3
    bgra_to_y_8_sse2: 12.8
    bgra_to_y_8_avx: 12.8
    bgra_to_y_8_avx2: 17.3
    bgra_to_y_128_c: 291.3
    bgra_to_y_128_sse2: 67.8
    bgra_to_y_128_avx: 69.3
    bgra_to_y_128_avx2: 45.3
    bgra_to_y_1080_c: 2357.3
    bgra_to_y_1080_sse2: 508.3
    bgra_to_y_1080_avx: 518.3
    bgra_to_y_1080_avx2: 399.8
    bgra_to_y_1920_c: 4202.8
    bgra_to_y_1920_sse2: 906.8
    bgra_to_y_1920_avx: 907.3
    bgra_to_y_1920_avx2: 526.3
    Signed-off-by: 's avatarJames Almer <jamrial@gmail.com>
    e9cfd532
input.asm 40.6 KB