Commit 7bbad32d authored by Wu Jianhua's avatar Wu Jianhua Committed by Paul B Mahol

libavfilter/x86/vf_gblur: correct the order of loop step

The problem was caused by if the width of the processed block
minus 1 is a multiple of the aligned number the instruction
jle .bscale_scalar would skip the Optimized Loop Step, which
will lead to an incorrect sampling when specifying steps more
than 1. Move the Optimized Loop Step after .bscale_scalar to
ensure the loop step is enabled.
Signed-off-by: 's avatarWu Jianhua <jianhua.wu@intel.com>
parent fcf10c92
...@@ -524,9 +524,8 @@ cglobal horiz_slice, 4, 9, 9, ptr, width, height, steps, nu, bscale, x, y, step, ...@@ -524,9 +524,8 @@ cglobal horiz_slice, 4, 9, 9, ptr, width, height, steps, nu, bscale, x, y, step,
cmp xq, 0 cmp xq, 0
jg .loop_x_scalar jg .loop_x_scalar
OPTIMIZED_LOOP_STEP
.bscale_scalar: .bscale_scalar:
OPTIMIZED_LOOP_STEP
sub ptrq, 4 sub ptrq, 4
sub localbufq, mmsize sub localbufq, mmsize
mulps m3, m1 mulps m3, m1
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment