Statistics
| Branch: | Revision:

ffmpeg / libavcodec / x86 @ 3ae079a3

Name Size Revision Age Author Comment
Makefile 2.89 KB c7b1d976 about 11 years Loren Merritt relicense h264 deblock sse2 to lgpl Originally...
cavsdsp_mmx.c 18.9 KB 7af8fbd3 about 11 years David Conrad Make ff_pw_4 128 bits Originally committed as ...
cpuid.c 4.52 KB 6526976f about 11 years Ronald S. Bultje Remove FF_MM_SSE2/3 flags for CPUs where this i...
dct32_sse.c 11.4 KB 881fd7a6 about 11 years Vitor Sessak Move SSE optimized 32-point DCT to its own file...
dnxhd_mmx.c 2.13 KB a6493a8f over 12 years Diego Biurrun Rename libavcodec/i386/ --> libavcodec/x86/. It...
dsputil_h264_template_mmx.c 10.7 KB 84dc2d8a over 11 years Måns Rullgård Remove DECLARE_ALIGNED_{8,16} macros These mac...
dsputil_h264_template_ssse3.c 7.04 KB 7af8fbd3 about 11 years David Conrad Make ff_pw_4 128 bits Originally committed as ...
dsputil_mmx.c 124 KB c7b1d976 about 11 years Loren Merritt relicense h264 deblock sse2 to lgpl Originally...
dsputil_mmx.h 6.49 KB af521abc about 11 years David Conrad Add header declarations for mmx/sse constants m...
dsputil_mmx_avg_template.c 40.7 KB 413abbe1 about 11 years David Conrad Add bitexact versions of put_no_rnd_pixels8 _x2...
dsputil_mmx_qns_template.c 3.71 KB a6493a8f over 12 years Diego Biurrun Rename libavcodec/i386/ --> libavcodec/x86/. It...
dsputil_mmx_rnd_template.c 22.7 KB 00312109 almost 12 years Reimar Döffinger Replace several #ifdef PIC with the more obviou...
dsputil_yasm.asm 9.77 KB 2966cc18 about 11 years Jason Garrett-Glaser Update x264asm header files to latest versions....
dsputilenc_mmx.c 43 KB 6526976f about 11 years Ronald S. Bultje Remove FF_MM_SSE2/3 flags for CPUs where this i...
fdct_mmx.c 17.7 KB d343d598 over 11 years Måns Rullgård Replace remaining uses of ATTR_ALIGNED with DEC...
fft.c 1.77 KB 881fd7a6 about 11 years Vitor Sessak Move SSE optimized 32-point DCT to its own file...
fft.h 1.58 KB 4dcc4f8e about 11 years Vitor Sessak SSE optimized 32-point DCT Originally committe...
fft_3dn.c 898 Bytes a6493a8f over 12 years Diego Biurrun Rename libavcodec/i386/ --> libavcodec/x86/. It...
fft_3dn2.c 5.11 KB 84dc2d8a over 11 years Måns Rullgård Remove DECLARE_ALIGNED_{8,16} macros These mac...
fft_mmx.asm 11 KB 1ee076b1 about 11 years Loren Merritt more credits to D. J. Bernstein for fft Origin...
fft_sse.c 6.75 KB 881fd7a6 about 11 years Vitor Sessak Move SSE optimized 32-point DCT to its own file...
h264_deblock_sse2.asm 19.3 KB c7b1d976 about 11 years Loren Merritt relicense h264 deblock sse2 to lgpl Originally...
h264_i386.h 6.08 KB ba87f080 over 11 years Diego Biurrun Remove explicit filename from Doxygen @file com...
h264_idct_sse2.asm 1.79 KB 2966cc18 about 11 years Jason Garrett-Glaser Update x264asm header files to latest versions....
h264_intrapred.asm 14.3 KB 17dc7c7a about 11 years Jason Garrett-Glaser Fix h264/vp8 intra pred on Athlon XP Whose idea...
h264dsp_mmx.c 97.3 KB 17dc7c7a about 11 years Jason Garrett-Glaser Fix h264/vp8 intra pred on Athlon XP Whose idea...
idct_mmx.c 22.4 KB d343d598 over 11 years Måns Rullgård Replace remaining uses of ATTR_ALIGNED with DEC...
idct_mmx_xvid.c 23.4 KB c26e58e3 over 11 years Måns Rullgård Add some missing #includes Originally committe...
idct_sse2_xvid.c 15.1 KB ba87f080 over 11 years Diego Biurrun Remove explicit filename from Doxygen @file com...
idct_xvid.h 1.2 KB ba87f080 over 11 years Diego Biurrun Remove explicit filename from Doxygen @file com...
lpc_mmx.c 5.61 KB 4a128945 over 11 years Reimar Döffinger Reduce number of ASM constraints for ff_lpc_com...
mathops.h 2.68 KB 22cb6fb6 about 11 years Michael Niedermayer Adding missing () to mathops.h. Originally com...
mlpdsp.c 6.21 KB 989b7181 about 12 years Ramiro Polla Use fewer macros in x86-optimized mlpdsp. Fixes...
mmx.h 12.8 KB a6493a8f over 12 years Diego Biurrun Rename libavcodec/i386/ --> libavcodec/x86/. It...
motion_est_mmx.c 15.7 KB c6727809 over 11 years Måns Rullgård Move array specifiers outside DECLARE_ALIGNED()...
mpegaudiodec_mmx.c 4.95 KB 89c7d805 about 11 years Vitor Sessak Fix compilation on x64. Originally committed a...
mpegvideo_mmx.c 27.6 KB 6b434361 over 12 years Stefano Sabatini Rename FF_MM_MMXEXT to FF_MM_MMX2, for both cla...
mpegvideo_mmx_template.c 17.1 KB 84dc2d8a over 11 years Måns Rullgård Remove DECLARE_ALIGNED_{8,16} macros These mac...
rv40dsp_mmx.c 3.14 KB 84dc2d8a over 11 years Måns Rullgård Remove DECLARE_ALIGNED_{8,16} macros These mac...
simple_idct_mmx.c 71 KB c6727809 over 11 years Måns Rullgård Move array specifiers outside DECLARE_ALIGNED()...
snowdsp_mmx.c 39.4 KB 05aec7bb over 11 years Måns Rullgård Separate DWT from snow and dsputil This moves ...
vc1dsp_mmx.c 34.4 KB c7eec581 about 11 years David Conrad Move ff_pw_* from vc1dsp_mmx.c to dsputil_mmx.c...
vc1dsp_yasm.asm 7.8 KB faa26db2 about 11 years David Conrad MMX/SSE VC1 loop filter Originally committed a...
vp3dsp_mmx.c 16.8 KB 179655b6 about 11 years David Conrad vp3: The DC-only IDCT is surprisingly not suppo...
vp3dsp_mmx.h 1.41 KB eb6a6cd7 over 11 years David Conrad vp3: DC-only IDCT 2-4% faster overall decode ...
vp3dsp_sse2.c 8.91 KB ba87f080 over 11 years Diego Biurrun Remove explicit filename from Doxygen @file com...
vp3dsp_sse2.h 1.17 KB a6493a8f over 12 years Diego Biurrun Rename libavcodec/i386/ --> libavcodec/x86/. It...
vp6dsp_mmx.c 5.14 KB ba87f080 over 11 years Diego Biurrun Remove explicit filename from Doxygen @file com...
vp6dsp_mmx.h 1.12 KB 6af3c226 over 12 years Sebastien Lucas add MMX version of vp6_filter_diag original pat...
vp6dsp_sse2.c 4.4 KB ba87f080 over 11 years Diego Biurrun Remove explicit filename from Doxygen @file com...
vp6dsp_sse2.h 1.11 KB ecb24904 over 12 years Zuxy Meng add SSE2 version of vp6_filter_diag original pa...
vp8dsp-init.c 18.7 KB 3ae079a3 about 11 years Jason Garrett-Glaser VP8: optimize DC-only chroma case in the same w...
vp8dsp.asm 76.5 KB 3ae079a3 about 11 years Jason Garrett-Glaser VP8: optimize DC-only chroma case in the same w...
x86inc.asm 16.4 KB 532e7697 about 11 years Loren Merritt sync yasm macros from x264 Originally committe...
x86util.asm 8.77 KB faa26db2 about 11 years David Conrad MMX/SSE VC1 loop filter Originally committed a...

Latest revisions

# Date Author Comment
3ae079a3 07/23/2010 06:02 AM Jason Garrett-Glaser

VP8: optimize DC-only chroma case in the same way as luma.
Add MMX idct_dc_add4uv function for this case.
~40% faster chroma idct.

Originally committed as revision 24455 to svn://svn.ffmpeg.org/ffmpeg/trunk

51c91564 07/23/2010 03:02 AM Jason Garrett-Glaser

VP8 asm: cosmetics (spacing)

Originally committed as revision 24453 to svn://svn.ffmpeg.org/ffmpeg/trunk

8a467b2d 07/23/2010 02:58 AM Jason Garrett-Glaser

VP8: 30% faster idct_mb
Take shortcuts based on statistically common situations.
Add 4-at-a-time idct_dc function (mmx and sse2) since rows of 4 DC-only DCT
blocks are common.
TODO: tie this more directly into the MB mode, since the DC-level transform is
only used for non-splitmv blocks?...

c25c7767 07/23/2010 12:07 AM Jason Garrett-Glaser

VP8: clear DCT blocks in iDCT instead of using clear_blocks.
~0.3% faster overall.

Originally committed as revision 24448 to svn://svn.ffmpeg.org/ffmpeg/trunk

dc5eec80 07/22/2010 07:59 PM Ronald S. Bultje

Use pextrw for SSE4 mbedge filter result writing, speedup 5-10cycles on
CPUs supporting it.

Originally committed as revision 24437 to svn://svn.ffmpeg.org/ffmpeg/trunk

003243c3 07/22/2010 01:35 AM Ronald S. Bultje

Fix and enable horizontal >=SSE2 mbedge loopfilter.

Originally committed as revision 24409 to svn://svn.ffmpeg.org/ffmpeg/trunk

c7b1d976 07/22/2010 12:39 AM Loren Merritt

relicense h264 deblock sse2 to lgpl

Originally committed as revision 24408 to svn://svn.ffmpeg.org/ffmpeg/trunk

532e7697 07/21/2010 10:45 PM Loren Merritt

sync yasm macros from x264

Originally committed as revision 24406 to svn://svn.ffmpeg.org/ffmpeg/trunk

8731dbd8 07/21/2010 10:41 PM Jason Garrett-Glaser

Eliminate one instruction in VP8 dc_add_sse4

Originally committed as revision 24405 to svn://svn.ffmpeg.org/ffmpeg/trunk

7dd224a4 07/21/2010 10:11 PM Jason Garrett-Glaser

Various VP8 x86 deblocking speedups
SSSE3 versions, improve SSE2 versions a bit.
SSE2/SSSE3 mbedge h functions are currently broken, so explicitly disable them.

Originally committed as revision 24403 to svn://svn.ffmpeg.org/ffmpeg/trunk

View revisions

Also available in: Atom