Statistics
| Branch: | Revision:

ffmpeg / libavcodec / x86 @ c6a908be

Name Size Revision Age Author Comment
Makefile 3.03 KB d0acc2d2 almost 11 years Ronald S. Bultje Move sse16_sse2() from inline asm to yasm. It i...
cavsdsp_mmx.c 19.2 KB c6c98d08 almost 11 years Stefano Sabatini Move mm_support() from libavcodec to libavutil,...
dct32_sse.c 11.5 KB c6a908be almost 11 years Ramiro Polla dct32: mark xmm registers in clobber list in ff...
deinterlace.asm 2.47 KB de4bc44a almost 11 years Vitor Sessak Convert deinterlacing MMX code to YASM Origina...
dnxhd_mmx.c 2.15 KB c6c98d08 almost 11 years Stefano Sabatini Move mm_support() from libavcodec to libavutil,...
dsputil_mmx.c 123 KB dd68d4db almost 11 years Ronald S. Bultje MMX, MMX2, SSE2 and SSSE3 optimizations for pre...
dsputil_mmx.h 8.2 KB 2c166c3a almost 11 years Ronald S. Bultje Port latest x264 deblock asm (before they moved...
dsputil_mmx_avg_template.c 40.7 KB 413abbe1 about 11 years David Conrad Add bitexact versions of put_no_rnd_pixels8 _x2...
dsputil_mmx_qns_template.c 3.71 KB a6493a8f over 12 years Diego Biurrun Rename libavcodec/i386/ --> libavcodec/x86/. It...
dsputil_mmx_rnd_template.c 22.7 KB 00312109 almost 12 years Reimar Döffinger Replace several #ifdef PIC with the more obviou...
dsputil_yasm.asm 9.77 KB 2966cc18 about 11 years Jason Garrett-Glaser Update x264asm header files to latest versions....
dsputilenc_mmx.c 35.1 KB c0bc8b9a almost 11 years Måns Rullgård x86: disable SSE functions using stack when sta...
dsputilenc_yasm.asm 9.52 KB ada65af9 almost 11 years Ronald S. Bultje Don't access upper 32 bits of a 32-bit int on 6...
fdct_mmx.c 17.7 KB d343d598 over 11 years Måns Rullgård Replace remaining uses of ATTR_ALIGNED with DEC...
fft.c 1.83 KB c6c98d08 almost 11 years Stefano Sabatini Move mm_support() from libavcodec to libavutil,...
fft.h 1.58 KB 4dcc4f8e about 11 years Vitor Sessak SSE optimized 32-point DCT Originally committe...
fft_3dn.c 898 Bytes a6493a8f over 12 years Diego Biurrun Rename libavcodec/i386/ --> libavcodec/x86/. It...
fft_3dn2.c 5.1 KB cb4f1246 almost 11 years Alex Converse imdct/x86: Use "s->mdct_size" instead of "1 << ...
fft_mmx.asm 14.9 KB dc77e985 almost 11 years Reimar Döffinger Split and then simplify address generation macr...
fft_sse.c 2.95 KB 326bf69a almost 11 years Ramiro Polla fft: mark xmm registers as clobbered in ff_imdc...
h264_chromamc.asm 17.4 KB d0eb5a11 almost 11 years Ronald S. Bultje Move H264 chroma MC from inline asm to yasm. Th...
h264_deblock.asm 22.4 KB 2c166c3a almost 11 years Ronald S. Bultje Port latest x264 deblock asm (before they moved...
h264_i386.h 6.08 KB ba87f080 over 11 years Diego Biurrun Remove explicit filename from Doxygen @file com...
h264_idct.asm 21.4 KB 02b424d9 almost 11 years Reimar Döffinger Add d suffix to movd target register to make it...
h264_intrapred.asm 26.6 KB 6c214280 almost 11 years Reimar Döffinger Add d modifier to asm argument to fix nasm comp...
h264_intrapred_init.c 7.37 KB dd68d4db almost 11 years Ronald S. Bultje MMX, MMX2, SSE2 and SSSE3 optimizations for pre...
h264_qpel_mmx.c 52.7 KB b32c9ca9 almost 11 years Ramiro Polla h264dsp: merge some asm blocks Some code was i...
h264_weight.asm 8.33 KB b1c32fb5 almost 11 years Reimar Döffinger Use "d" suffix for general-purpose registers us...
h264dsp_mmx.c 16.9 KB a52ffc3f almost 11 years Ronald S. Bultje Move static inline function to a macro, so that...
idct_mmx.c 23.7 KB 740dfe70 about 11 years Vitor Sessak Fix compilation in x86_64. I broke it with r245...
idct_mmx_xvid.c 23.4 KB c26e58e3 over 11 years Måns Rullgård Add some missing #includes Originally committe...
idct_sse2_xvid.c 15.1 KB 7e7c4b60 almost 11 years Ronald S. Bultje Put ff_ prefix on non-static {put_signed,put,ad...
idct_xvid.h 1.2 KB ba87f080 over 11 years Diego Biurrun Remove explicit filename from Doxygen @file com...
lpc_mmx.c 5.61 KB 4a128945 over 11 years Reimar Döffinger Reduce number of ASM constraints for ff_lpc_com...
mathops.h 2.68 KB 22cb6fb6 about 11 years Michael Niedermayer Adding missing () to mathops.h. Originally com...
mlpdsp.c 6.21 KB 989b7181 about 12 years Ramiro Polla Use fewer macros in x86-optimized mlpdsp. Fixes...
motion_est_mmx.c 15.8 KB c6c98d08 almost 11 years Stefano Sabatini Move mm_support() from libavcodec to libavutil,...
mpegaudiodec_mmx.c 5 KB c6c98d08 almost 11 years Stefano Sabatini Move mm_support() from libavcodec to libavutil,...
mpegvideo_mmx.c 27.7 KB c6c98d08 almost 11 years Stefano Sabatini Move mm_support() from libavcodec to libavutil,...
mpegvideo_mmx_template.c 17.1 KB 84dc2d8a over 11 years Måns Rullgård Remove DECLARE_ALIGNED_{8,16} macros These mac...
simple_idct_mmx.c 71 KB 7e7c4b60 almost 11 years Ronald S. Bultje Put ff_ prefix on non-static {put_signed,put,ad...
snowdsp_mmx.c 39.4 KB 9276bddd almost 11 years İsmail Dönmez snowdsp: Explicitly state the operand sizes Fi...
vc1dsp_mmx.c 34.5 KB c6c98d08 almost 11 years Stefano Sabatini Move mm_support() from libavcodec to libavutil,...
vc1dsp_yasm.asm 7.8 KB b1c32fb5 almost 11 years Reimar Döffinger Use "d" suffix for general-purpose registers us...
vp3dsp.asm 20.7 KB b1c32fb5 almost 11 years Reimar Döffinger Use "d" suffix for general-purpose registers us...
vp56_arith.h 1.71 KB 05c04cdf almost 11 years Jason Garrett-Glaser VP5/6/8: ~7% faster arithmetic decoding Grab fr...
vp56dsp.asm 4.86 KB 4eca52ed almost 11 years Ronald S. Bultje Fix typos when converting inline asm to yasm, f...
vp56dsp_init.c 1.73 KB c6c98d08 almost 11 years Stefano Sabatini Move mm_support() from libavcodec to libavutil,...
vp8dsp-init.c 19 KB c6c98d08 almost 11 years Stefano Sabatini Move mm_support() from libavcodec to libavutil,...
vp8dsp.asm 78.4 KB b1c32fb5 almost 11 years Reimar Döffinger Use "d" suffix for general-purpose registers us...
x86inc.asm 16.4 KB 532e7697 about 11 years Loren Merritt sync yasm macros from x264 Originally committe...
x86util.asm 9.12 KB e2e34104 almost 11 years Ronald S. Bultje Move hadamard_diff{,16}_{mmx,mmx2,sse2,ssse3}()...

Latest revisions

# Date Author Comment
c6a908be 10/25/2010 08:29 PM Ramiro Polla

dct32: mark xmm registers in clobber list in ff_dct32_float_sse()

Originally committed as revision 25569 to svn://svn.ffmpeg.org/ffmpeg/trunk

b32c9ca9 10/25/2010 06:02 PM Ramiro Polla

h264dsp: merge some asm blocks

Some code was initializing some xmm registers in one asm block and using them
in the following block, assuming they wouldn't be changed in between blocks.

Originally committed as revision 25568 to svn://svn.ffmpeg.org/ffmpeg/trunk

6c214280 10/07/2010 07:18 PM Reimar Döffinger

Add d modifier to asm argument to fix nasm compilation.

Originally committed as revision 25397 to svn://svn.ffmpeg.org/ffmpeg/trunk

326bf69a 10/06/2010 01:27 AM Ramiro Polla

fft: mark xmm registers as clobbered in ff_imdct_calc_sse

Originally committed as revision 25363 to svn://svn.ffmpeg.org/ffmpeg/trunk

dd68d4db 10/05/2010 10:06 PM Ronald S. Bultje

MMX, MMX2, SSE2 and SSSE3 optimizations for pred16x16/8x8_plane H264 intra
prediction (plus some with different rounding for svq3/rv40). Speedup (for
SSSE3) about ~6-fold, 3.6% faster overall with cathedral sample.

Originally committed as revision 25361 to svn://svn.ffmpeg.org/ffmpeg/trunk

9276bddd 10/04/2010 01:08 PM İsmail Dönmez

snowdsp: Explicitly state the operand sizes

Fixes compilation with clang's builtin assembler

Patch by İsmail Dönmez, ismail at namtrac dot org

Originally committed as revision 25331 to svn://svn.ffmpeg.org/ffmpeg/trunk

a52ffc3f 09/29/2010 05:42 PM Ronald S. Bultje

Move static inline function to a macro, so that constant propagation in
inline asm works for gcc-3.x also (hopefully). Should fix gcc-3.x FATE
breakage after r25254.

Originally committed as revision 25262 to svn://svn.ffmpeg.org/ffmpeg/trunk

329d689f 09/29/2010 03:34 PM Eli Friedman

Use sse2 variant of put_pixels16() for no_rnd also. Provides a minor speed
increase to e.g. vc1, snow and mpeg decoding.

Patch by Eli Friedman <eli dot friedman gmail com>.

Originally committed as revision 25259 to svn://svn.ffmpeg.org/ffmpeg/trunk

cd17285e 09/29/2010 02:04 PM Ronald S. Bultje

Merge b_idx and edge variables, and optimize the ASM to directly load variables
from memory locations/offsets depending on b_idx plus constants, rather than
having gcc do this. This saves several lea calls and together saves about
10 cycles in h264_loop_filter_strength_mmx2()....

0cc8a5d0 09/29/2010 02:03 PM Ronald S. Bultje

Remove mv_mask variable. Replace the related pand -1/0 instructions by either
a pxor, or remove the instruction alltogether. Altogether, this saves 1
instruction.

Originally committed as revision 25255 to svn://svn.ffmpeg.org/ffmpeg/trunk

View revisions

Also available in: Atom