Since wp$$==wq$$, it doesn't need to load the same data twice, use move instruction to replace one of the loads to let the program run faster. Reviewed-by: Alexandre Ghiti <alexghiti@rivosinc.com> Signed-off-by: Chunyan Zhang <zhangchunyan@iscas.ac.cn> Link: https://lore.kernel.org/r/20250718072711.3865118-3-zhangchunyan@iscas.ac.cn Signed-off-by: Paul Walmsley <pjw@kernel.org> |
||
|---|---|---|
| .. | ||
| test | ||
| .gitignore | ||
| Makefile | ||
| algos.c | ||
| altivec.uc | ||
| avx2.c | ||
| avx512.c | ||
| int.uc | ||
| loongarch.h | ||
| loongarch_simd.c | ||
| mktables.c | ||
| mmx.c | ||
| neon.c | ||
| neon.h | ||
| neon.uc | ||
| recov.c | ||
| recov_avx2.c | ||
| recov_avx512.c | ||
| recov_loongarch_simd.c | ||
| recov_neon.c | ||
| recov_neon_inner.c | ||
| recov_rvv.c | ||
| recov_s390xc.c | ||
| recov_ssse3.c | ||
| rvv.c | ||
| rvv.h | ||
| s390vx.uc | ||
| sse1.c | ||
| sse2.c | ||
| unroll.awk | ||
| vpermxor.uc | ||
| x86.h | ||