]> granicus.if.org Git - libjpeg-turbo/commit
Revert r1335 and r1336. It was a valiant effort, but on Windows, xmm8-xmm15 are...
authorDRC <dcommander@users.sourceforge.net>
Fri, 22 Aug 2014 18:30:44 +0000 (18:30 +0000)
committerDRC <dcommander@users.sourceforge.net>
Fri, 22 Aug 2014 18:30:44 +0000 (18:30 +0000)
commit55e328ecb83a767d88203930e84e44bf6cc883f5
tree1e85146aa9c1eb2ddc7aac10f594d57fc5019c9a
parentbdb36e1a140cd0c3be8158263afe1d02793117ce
Revert r1335 and r1336.  It was a valiant effort, but on Windows, xmm8-xmm15 are non-volatile, and the overhead of pushing them onto the stack at the beginning of each function and popping them at the end was causing worse performance (in the neighborhood of 3-5%) than just using the work areas and limiting the register usage to xmm0-xmm7.  Best to leave the SSE2 code alone.  We can optimize the register usage for AVX2, once that port takes place.

git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1382 632fc199-4ca6-4c93-a231-07263d6284db
simd/jccolext-sse2-64.asm
simd/jcgryext-sse2-64.asm
simd/jfdctfst-sse2-64.asm
simd/jfdctint-sse2-64.asm