libavcodec/riscv/bswapdsp_rvv.S · ae5453503d1e63ef2cf6e6658e1d0b12470a39c7 · Stefan Westerfeld / ffmpeg

lavc/bswapdsp: purge RISC-V V bswap32 · 61e5ca4d

Rémi Denis-Courmont authored Jul 16, 2023

This cannot beat the Zbb implementation, and it is unlikely that a real
meaningful CPU design would support V and not Zbb. The best loop rewrite
that I could come up with (4 shifts, 2 ands, 3 ors) is still ~40% slower
than Zbb.

A proper faster vector implementation should be feasible with the
cryptographic vector extensions, but that is a story for another time.

61e5ca4d

bswapdsp_rvv.S 1.17 KB

Replace bswapdsp_rvv.S