• Reimar Döffinger's avatar
    lavc/aarch64: port HEVC SIMD idct NEON · 30f80d85
    Reimar Döffinger authored
    Makes SIMD-optimized 8x8 and 16x16 idcts for 8 and 10 bit depth
    available on aarch64.
    For a UHD HDR (10 bit) sample video these were consuming the most time
    and this optimization reduced overall decode time from 19.4s to 16.4s,
    approximately 15% speedup.
    Test sample was the first 300 frames of "LG 4K HDR Demo - New York.ts",
    running on Apple M1.
    Signed-off-by: 's avatarJosh Dekker <josh@itanimul.li>
    30f80d85
hevcdsp.h 7.01 KB