Statistics
| Branch: | Revision:

ffmpeg / libavcodec / i386 @ ce611a27

# Date Author Comment
ce611a27 08/21/2007 04:29 PM Michael Niedermayer

Change rounding of the horizontal DWT to match the vertical one.
This allows some simplifications and optimizations and should
not have any effect on quality.

Originally committed as revision 10172 to svn://svn.ffmpeg.org/ffmpeg/trunk

30cd3e66 08/21/2007 12:05 AM Michael Niedermayer

remove code which become unused by the previous changes

Originally committed as revision 10166 to svn://svn.ffmpeg.org/ffmpeg/trunk

72dee89b 08/21/2007 12:03 AM Michael Niedermayer

Simplify and optimize the 4th vertical lifting step of the SSE2 code (untested)
This also reduces the needed headroom in that step by 1 bit

Originally committed as revision 10165 to svn://svn.ffmpeg.org/ffmpeg/trunk

d0dae46a 08/21/2007 12:02 AM Michael Niedermayer

Simplify and optimize the 4th vertical lifting step of the MMX code
This also reduces the needed headroom in that step by 1 bit

Originally committed as revision 10164 to svn://svn.ffmpeg.org/ffmpeg/trunk

1ffbbef2 08/20/2007 11:59 PM Michael Niedermayer

Simplify and speedup code, reduce needed headroom by 2 bits in the 3rd
vertical lifting step of the SSE2 code (untested)

Originally committed as revision 10163 to svn://svn.ffmpeg.org/ffmpeg/trunk

4bf17904 08/20/2007 11:54 PM Michael Niedermayer

simplify, speedup and reduce needed headroom by 2 bits in the 3rd
vertical lifting step

Originally committed as revision 10162 to svn://svn.ffmpeg.org/ffmpeg/trunk

dd30437b 08/20/2007 11:11 PM Michael Niedermayer

replace <<1 by add for SSE2 (untested)

Originally committed as revision 10161 to svn://svn.ffmpeg.org/ffmpeg/trunk

7e665a39 08/20/2007 11:09 PM Michael Niedermayer

replace <<1 by add

Originally committed as revision 10160 to svn://svn.ffmpeg.org/ffmpeg/trunk

eee649d3 08/20/2007 11:02 PM Michael Niedermayer

slightly change horizontal lift3 so it needs 1 bit less headroom

Originally committed as revision 10159 to svn://svn.ffmpeg.org/ffmpeg/trunk

be3b22f9 08/20/2007 10:41 PM Michael Niedermayer

remove idiotc double subtraction from the sse2 code (untested, no sse2 here)

Originally committed as revision 10158 to svn://svn.ffmpeg.org/ffmpeg/trunk

8510c519 08/20/2007 10:34 PM Michael Niedermayer

fixme note

Originally committed as revision 10157 to svn://svn.ffmpeg.org/ffmpeg/trunk

3a9f44d5 08/20/2007 10:29 PM Michael Niedermayer

and of course the unneeded double subtractions were blindly put in the
mmx code
this also makes the affected code 4% faster

Originally committed as revision 10156 to svn://svn.ffmpeg.org/ffmpeg/trunk

73f51a4d 07/24/2007 08:54 AM Aurelien Jacobs

help some gcc version to optimize out those functions

Originally committed as revision 9785 to svn://svn.ffmpeg.org/ffmpeg/trunk

674eeb5f 07/10/2007 08:27 PM Aurelien Jacobs

cosmetics: indentation

Originally committed as revision 9582 to svn://svn.ffmpeg.org/ffmpeg/trunk

eb75a698 07/10/2007 08:23 PM Aurelien Jacobs

Avoid linking with h263.c functions when the relevant codecs
are not compiled in.

Originally committed as revision 9581 to svn://svn.ffmpeg.org/ffmpeg/trunk

a00177a9 07/08/2007 11:15 PM Måns Rullgård

make arguments to ssd_int8_vs_int16() const

Originally committed as revision 9548 to svn://svn.ffmpeg.org/ffmpeg/trunk

09a66826 07/05/2007 12:06 PM Diego Biurrun

typos

Originally committed as revision 9486 to svn://svn.ffmpeg.org/ffmpeg/trunk

e5a389a1 07/05/2007 10:40 AM Diego Biurrun

license header consistency cosmetics

Originally committed as revision 9484 to svn://svn.ffmpeg.org/ffmpeg/trunk

7b94177e 07/05/2007 10:37 AM Diego Biurrun

Group all copyright and author notices together.

Originally committed as revision 9483 to svn://svn.ffmpeg.org/ffmpeg/trunk

efb77577 06/17/2007 06:59 PM Guillaume Poirier

add a comment to indicate which #endif belong to which #define

Originally committed as revision 9356 to svn://svn.ffmpeg.org/ffmpeg/trunk

699b3f99 06/17/2007 12:01 AM Måns Rullgård

add multiple inclusion guards to headers

Originally committed as revision 9345 to svn://svn.ffmpeg.org/ffmpeg/trunk

8657d5e5 06/16/2007 02:51 PM Måns Rullgård

correct type for {put,avg}_h264_chroma_mc2_mmx2

Originally committed as revision 9333 to svn://svn.ffmpeg.org/ffmpeg/trunk

90b5b51e 06/12/2007 06:50 PM Diego Biurrun

misc typo fixes

Originally committed as revision 9291 to svn://svn.ffmpeg.org/ffmpeg/trunk

755bfeab 06/12/2007 09:29 AM Diego Biurrun

misc spelling fixes

Originally committed as revision 9289 to svn://svn.ffmpeg.org/ffmpeg/trunk

ee387b57 06/08/2007 08:35 PM Graham Booker

work around issues with the old version of Gnu Assembler shipped on
Intel / OSX 10.4.9
patch by Graham Booker % gbooker A cod3r P com %
Original thread:
Date: May 25, 2007 11:51 PM
Subject: [FFmpeg-devel] r9017 breaks WMA decoding on Intel Macs
The patch:...

34933442 06/07/2007 05:19 PM Diego Biurrun

Preprocessor #defines starting with __ are reserved for the system.

Originally committed as revision 9251 to svn://svn.ffmpeg.org/ffmpeg/trunk

3d4e3e56 06/07/2007 05:18 PM Ronald S. Bultje

Use FFmpeg-specific CPU feature definitions.
taken from a patch by Ronald S. Bultje, rbultje ronald.bitfreak net
Date: Thu, 07 Jun 2007 08:57:46 -0400
Subject: Re: [FFmpeg-devel] [PATCH] Coremake support - ffmpeg_nommx.patch (1/1)
- ffmpeg-nommx.patch (1/1) - ff-cputest-noavconfig.patch (1/1)...

154e30f6 05/30/2007 09:32 AM Carl Eugen Hoyos

rename attribute_unused to av_unused and moves its declaration to common.h
patch by Carl Eugen Hoyos cehoyos chez ag or at
original thread: [FFmpeg-devel] [PATCH] attribute_unused -> av_unused
date: 05/29/2007 01:23 PM

Originally committed as revision 9155 to svn://svn.ffmpeg.org/ffmpeg/trunk

663deb54 05/20/2007 05:07 AM Zuxy Meng

Remove incorrect comment; MMX2 is preferred over 3DNow! on Athlon

Originally committed as revision 9079 to svn://svn.ffmpeg.org/ffmpeg/trunk

038bfcf9 05/18/2007 08:18 AM Zuxy Meng

3DNow! and SSSE3 optimization to QNS DSP functions; use pmulhrw/pmulhrsw instead of pmulhw

Originally committed as revision 9053 to svn://svn.ffmpeg.org/ffmpeg/trunk

5b0b7054 05/16/2007 11:23 PM Aurelien Jacobs

better separation of vp3dsp functions from dsputil_mmx.c

Originally committed as revision 9039 to svn://svn.ffmpeg.org/ffmpeg/trunk

b550bfaa 05/16/2007 09:51 AM Ronald S. Bultje

Add libavcodec to compiler include flags in order to simplify header
include paths in the source files.
mostly from a patch by Ronald S. Bultje, rbultje ronald.bitfreak net

Originally committed as revision 9034 to svn://svn.ffmpeg.org/ffmpeg/trunk

9b5dc867 05/14/2007 02:28 PM Panagiotis Issaris

Make vp3dsp*.c compilation optional.

Originally committed as revision 9025 to svn://svn.ffmpeg.org/ffmpeg/trunk

e36d79c8 05/14/2007 02:07 PM Reimar Döffinger

Change some leftover attribute((unused)) and attribute((used)) to

attribute_unused and attribute_used respectively to ease compiling on non-gcc.

Originally committed as revision 9024 to svn://svn.ffmpeg.org/ffmpeg/trunk

25e4f8aa 05/13/2007 04:32 PM Zuxy Meng

Faster SSE FFT/MDCT, patch by Zuxy Meng zuxy P meng A gmail P com
unrolls some loops, utilizing all 8 xmm registers. fft-test
shows ~10% speed up in (I)FFT and ~8% speed up in (I)MDCT on Dothan

Originally committed as revision 9017 to svn://svn.ffmpeg.org/ffmpeg/trunk

ff506a90 05/12/2007 05:55 AM Loren Merritt

sse2 & ssse3 versions of dct_quantize.
core2: mmx2=154 sse2=73 ssse3=66 (cycles)
k8: mmx2=179 sse2=149
p4: mmx2=284 sse2=194

Originally committed as revision 9003 to svn://svn.ffmpeg.org/ffmpeg/trunk

1edbfe19 05/12/2007 02:41 AM Loren Merritt

factor sum_abs_dctelem out of dct_sad, and simd it.
sum_abs_dctelem_* alone:
core2: c=186 mmx2=39 sse2=21 ssse3=13 (cycles)
k8: c=163 mmx2=33 sse2=31
p4: c=370 mmx2=60 sse2=60
dct_sad including sum_abs_dctelem_*:
core2: c=405 mmx2=258 sse2=240 ssse3=232...

561f940c 05/12/2007 01:16 AM Loren Merritt

sse2 & ssse3 versions of hadamard. unroll and inline diff_pixels.
core2: before mmx2=193 cycles. after mmx2=174 sse2=122 ssse3=115 (cycles).
k8: before mmx2=205. after mmx2=184 sse2=180.
p4: before mmx2=342. after mmx2=314 sse2=309.

Originally committed as revision 9000 to svn://svn.ffmpeg.org/ffmpeg/trunk

ba53071a 05/11/2007 03:29 AM Loren Merritt

10l, r8991 broke mmx1 sad

Originally committed as revision 8993 to svn://svn.ffmpeg.org/ffmpeg/trunk

72946825 05/11/2007 01:11 AM Loren Merritt

sse2 version of fullpel sad.
16% faster on core2, 5% faster on p4. 10% slower (and thus disabled) on k8.

Originally committed as revision 8992 to svn://svn.ffmpeg.org/ffmpeg/trunk

164d75eb 05/11/2007 12:45 AM Loren Merritt

tweak mmx2 sad.
40% faster on core2, 18% faster on k8, 5% faster on p4.

Originally committed as revision 8991 to svn://svn.ffmpeg.org/ffmpeg/trunk

eca3810e 05/10/2007 10:24 PM Loren Merritt

tweak mmx2 sad.
6% faster on core2 and k8, no change on p4.

Originally committed as revision 8984 to svn://svn.ffmpeg.org/ffmpeg/trunk

7c3a9fe2 05/10/2007 03:13 AM Loren Merritt

sse2 version of fdct_col.
k8: 72->61 cycles, core2: 51->26 cycles.

Originally committed as revision 8966 to svn://svn.ffmpeg.org/ffmpeg/trunk

5adf43e4 05/09/2007 01:46 AM Loren Merritt

cosmetics: remove code duplication in hadamard8_diff_mmx

Originally committed as revision 8946 to svn://svn.ffmpeg.org/ffmpeg/trunk

bba5293b 05/08/2007 05:55 PM Loren Merritt

cosmetics: remove duplicate transpose macro

Originally committed as revision 8939 to svn://svn.ffmpeg.org/ffmpeg/trunk

a1ce6110 04/19/2007 04:12 PM Reimar Döffinger

Fix parts missed in clip -> av_clip rename

Originally committed as revision 8760 to svn://svn.ffmpeg.org/ffmpeg/trunk

fe037229 04/07/2007 02:10 PM Diego Biurrun

typos

Originally committed as revision 8642 to svn://svn.ffmpeg.org/ffmpeg/trunk

59006372 03/30/2007 07:15 PM Loren Merritt

mmx 16-bit ssd. 2.3x faster svq1 encoding.

Originally committed as revision 8559 to svn://svn.ffmpeg.org/ffmpeg/trunk

d42f8802 02/24/2007 11:58 AM Diego Biurrun

Fix wrong conditional, Snow decoding, not encoding, was SIMD-accelerated.

Originally committed as revision 8116 to svn://svn.ffmpeg.org/ffmpeg/trunk

58e31fb1 02/23/2007 03:44 PM Michael Niedermayer

reorder a few more paddws to reduce dependancy chains
chroma mc4 put 2480 -> 2460 dezicyles on duron

Originally committed as revision 8098 to svn://svn.ffmpeg.org/ffmpeg/trunk

b4fe9769 02/23/2007 03:28 PM Michael Niedermayer

reorder paddws to reduce dependancy chain
put_h264_chroma_mc2_mmx2() 927 -> 902 dezicyles on duron

Originally committed as revision 8097 to svn://svn.ffmpeg.org/ffmpeg/trunk

0c67082e 02/23/2007 03:03 PM Michael Niedermayer

shortening dependancy chain in chroma mc2

Originally committed as revision 8095 to svn://svn.ffmpeg.org/ffmpeg/trunk

af265162 02/23/2007 02:29 PM Michael Niedermayer

remove now wrong comment

Originally committed as revision 8094 to svn://svn.ffmpeg.org/ffmpeg/trunk

61240ae5 02/23/2007 02:29 PM Michael Niedermayer

fix chroma mc2 bug, this is based on a patch by (Oleg Metelitsa oleg hitron co kr)
and does slow the mc2 chroma put down, avg interrestingly seems unaffected speedwise on duron
this of course should be rather done in a way which doesnt slow it down but its better a few %...

470d2d03 02/22/2007 12:04 AM Michael Niedermayer

gcc 2.95 fix

Originally committed as revision 8059 to svn://svn.ffmpeg.org/ffmpeg/trunk

459022f5 02/18/2007 08:00 PM Måns Rullgård

fix for x86-64

Originally committed as revision 8022 to svn://svn.ffmpeg.org/ffmpeg/trunk

b21e0b6d 02/17/2007 11:43 PM Michael Niedermayer

rewrite H264_CHROMA_MC4_TMPL (20% faster)

Originally committed as revision 8012 to svn://svn.ffmpeg.org/ffmpeg/trunk

2a115873 02/16/2007 09:22 PM Michael Niedermayer

add a few asserts to ensure alignment

Originally committed as revision 7994 to svn://svn.ffmpeg.org/ffmpeg/trunk

00e210dd 02/16/2007 09:21 PM Michael Niedermayer

prevent h.264 MC related functions from being inlined (yes this is much faster the code just doesnt fit in the code cache otherwise)

Originally committed as revision 7993 to svn://svn.ffmpeg.org/ffmpeg/trunk

392b76ca 02/10/2007 01:33 PM Reimar Döffinger

Minor AMD64 compilation fix

Originally committed as revision 7907 to svn://svn.ffmpeg.org/ffmpeg/trunk

9bc0d3ef 02/10/2007 03:56 AM Michael Niedermayer

maybe fix x86_64 (untested)

Originally committed as revision 7906 to svn://svn.ffmpeg.org/ffmpeg/trunk

7c4fd7eb 02/09/2007 12:37 PM Michael Niedermayer

factor out common subexprssion (gcc of course is too stupid to do this ...)
5% faster avg_h264_chroma_mc2_mmx2()
10% faster put_h264_chroma_mc2_mmx2()

Originally committed as revision 7898 to svn://svn.ffmpeg.org/ffmpeg/trunk

9301a0b4 02/09/2007 12:24 PM Michael Niedermayer

merge asm fragments in H264_CHROMA_MC2_TMPL()
10% faster avg_h264_chroma_mc2_mmx2()
5% faster put_h264_chroma_mc2_mmx2()

Originally committed as revision 7897 to svn://svn.ffmpeg.org/ffmpeg/trunk

9dd6c804 01/30/2007 10:31 AM Panagiotis Issaris

Add the const specifier as needed to reduce the number of warnings.

Originally committed as revision 7764 to svn://svn.ffmpeg.org/ffmpeg/trunk

9688979c 01/22/2007 01:16 AM Diego Biurrun

Fix some more license headers.

Originally committed as revision 7637 to svn://svn.ffmpeg.org/ffmpeg/trunk

5a5c770d 12/18/2006 10:43 PM Guillaume Poirier

Add SSSE3 (Core2 aka Conroe/Merom/Woodcrester new instructions) detection

Originally committed as revision 7332 to svn://svn.ffmpeg.org/ffmpeg/trunk

849f1035 12/08/2006 12:35 AM Måns Rullgård

rename always_inline to av_always_inline and move to common.h

Originally committed as revision 7256 to svn://svn.ffmpeg.org/ffmpeg/trunk

486497e0 11/14/2006 03:18 AM Måns Rullgård

revert bad checkin

Originally committed as revision 7044 to svn://svn.ffmpeg.org/ffmpeg/trunk

be6ed6ff 11/14/2006 03:12 AM Måns Rullgård

move some CFLAGS settings away from config.* writing section

Originally committed as revision 7043 to svn://svn.ffmpeg.org/ffmpeg/trunk

7466ed2f 11/12/2006 11:35 PM Måns Rullgård

zigzag_direct_noperm doesn't exist, remove declaration

Originally committed as revision 6998 to svn://svn.ffmpeg.org/ffmpeg/trunk

36cd3069 11/12/2006 06:49 PM Måns Rullgård

rename inverse -> ff_inverse

Originally committed as revision 6990 to svn://svn.ffmpeg.org/ffmpeg/trunk

bb54f6ab 11/12/2006 03:34 AM Måns Rullgård

adding more static keywords

Originally committed as revision 6976 to svn://svn.ffmpeg.org/ffmpeg/trunk

079e61db 11/03/2006 04:54 PM Michael Niedermayer

ensure alignment (no speed change)

Originally committed as revision 6891 to svn://svn.ffmpeg.org/ffmpeg/trunk

f5a9e8f3 11/03/2006 04:02 PM Michael Niedermayer

merging mov & and (no speedchange)

Originally committed as revision 6889 to svn://svn.ffmpeg.org/ffmpeg/trunk

e80cf125 11/03/2006 03:40 PM Michael Niedermayer

2 instructions less (same speed)

Originally committed as revision 6888 to svn://svn.ffmpeg.org/ffmpeg/trunk

93471182 11/03/2006 03:17 PM Michael Niedermayer

comment about failed optimization

Originally committed as revision 6887 to svn://svn.ffmpeg.org/ffmpeg/trunk

38cfdc83 11/03/2006 02:28 PM Michael Niedermayer

move luma tc0 related init into asm
5% faster filter_mb_fast() on P3

Originally committed as revision 6884 to svn://svn.ffmpeg.org/ffmpeg/trunk

25225c37 11/03/2006 12:07 PM Michael Niedermayer

2 instructions less in h264_loop_filter_luma_mmx2()

Originally committed as revision 6882 to svn://svn.ffmpeg.org/ffmpeg/trunk

bda2203d 11/03/2006 11:07 AM Michael Niedermayer

preempt possible overflow

Originally committed as revision 6881 to svn://svn.ffmpeg.org/ffmpeg/trunk

5a1553de 11/03/2006 09:59 AM Michael Niedermayer

1 instruction less

Originally committed as revision 6880 to svn://svn.ffmpeg.org/ffmpeg/trunk

e9f1885c 11/03/2006 02:03 AM Michael Niedermayer

optimize H264_DEBLOCK_P0_Q0
2.5% faster filter_mb_fast() on P3

Originally committed as revision 6877 to svn://svn.ffmpeg.org/ffmpeg/trunk

7c428ea6 10/14/2006 05:04 PM Diego Biurrun

Put libmpeg2 IDCT functions under CONFIG_GPL, fixes link failure
with --disable-opts.

Originally committed as revision 6691 to svn://svn.ffmpeg.org/ffmpeg/trunk

c26abfa5 10/11/2006 11:17 PM Diego Biurrun

Rename ABS macro to FFABS.

Originally committed as revision 6666 to svn://svn.ffmpeg.org/ffmpeg/trunk

b78e7197 10/07/2006 03:30 PM Diego Biurrun

Change license headers to say 'FFmpeg' instead of 'this program/this library'
and fix GPL/LGPL version mismatches.

Originally committed as revision 6577 to svn://svn.ffmpeg.org/ffmpeg/trunk

ade6e7f3 10/07/2006 11:30 AM Diego Biurrun

Compilation fix, printf gets redefined to please_use_av_log.

Originally committed as revision 6574 to svn://svn.ffmpeg.org/ffmpeg/trunk

0eb59ddb 10/05/2006 12:23 AM Diego Biurrun

Switch idct_mmx_xvid.c from GPL to LGPL as permitted by the
author, Peter Ross (pross xvid org).

Originally committed as revision 6557 to svn://svn.ffmpeg.org/ffmpeg/trunk

2833fc46 10/01/2006 09:25 PM Loren Merritt

approximate qpel functions: sacrifice some quality for some decoding speed. enabled on B-frames with -lavdopts fast.

Originally committed as revision 6412 to svn://svn.ffmpeg.org/ffmpeg/trunk

62bb489b 09/27/2006 07:54 PM Måns Rullgård

add some #ifdef CONFIG_ENCODERS/DECODERS

Originally committed as revision 6356 to svn://svn.ffmpeg.org/ffmpeg/trunk

a4eb118a 09/21/2006 05:43 PM Loren Merritt

cosmetics (indentation)

Originally committed as revision 6313 to svn://svn.ffmpeg.org/ffmpeg/trunk

f469094c 09/21/2006 05:42 PM Loren Merritt

tweak ff_imdct_calc_3dn2

Originally committed as revision 6312 to svn://svn.ffmpeg.org/ffmpeg/trunk

ebbafcb4 09/21/2006 04:37 PM Loren Merritt

sse implementation of imdct.
patch mostly by Zuxy Meng (zuxy dot meng at gmail dot com)

Originally committed as revision 6311 to svn://svn.ffmpeg.org/ffmpeg/trunk

99aed7c8 09/19/2006 10:22 PM Luca Barbato

New single instruction math operation header

Originally committed as revision 6291 to svn://svn.ffmpeg.org/ffmpeg/trunk

2a2311be 09/14/2006 10:13 PM Aurelien Jacobs

disable vp3 mmx idct for theora files to avoid artifacts
(see theora-a4_v6-k250-s0_2.ogg)

Originally committed as revision 6253 to svn://svn.ffmpeg.org/ffmpeg/trunk

7f889a76 09/14/2006 12:38 AM Diego Biurrun

Remove the LGPL exception clause as discussed on ffmpeg-devel
and move the dependent code under CONFIG_GPL.

Originally committed as revision 6248 to svn://svn.ffmpeg.org/ffmpeg/trunk

1dac8fea 09/12/2006 08:58 PM Aurelien Jacobs

Enables back the mmx/sse optimized version of the vp3 idct.
It generates different md5sum than the reference C implementation,
but no visual difference, so enabled only when bitexact is not set.

Originally committed as revision 6241 to svn://svn.ffmpeg.org/ffmpeg/trunk

04d7f601 09/10/2006 02:02 PM Diego Biurrun

Add official LGPL license headers to the files that were missing them.

Originally committed as revision 6219 to svn://svn.ffmpeg.org/ffmpeg/trunk

0e176c3e 09/02/2006 11:10 PM Måns Rullgård

remove redundant declarations

Originally committed as revision 6153 to svn://svn.ffmpeg.org/ffmpeg/trunk

3e20143e 08/28/2006 09:33 AM Loren Merritt

mmx implementation of deblocking strength decision.
2-3% faster h264.

Originally committed as revision 6113 to svn://svn.ffmpeg.org/ffmpeg/trunk

1e4ecf26 08/18/2006 11:53 PM Loren Merritt

ff_fft_calc_3dn/3dn2/sse: convert intrinsics to inline asm.
2.5% faster fft, 0.5% faster vorbis.

Originally committed as revision 6023 to svn://svn.ffmpeg.org/ffmpeg/trunk

cf5aed5b 08/18/2006 10:43 AM Michael Niedermayer

simplify

Originally committed as revision 6020 to svn://svn.ffmpeg.org/ffmpeg/trunk