Statistics
| Branch: | Revision:

ffmpeg / libavcodec / i386 / dsputil_mmx.c @ 5509bffa

History | View | Annotate | Download (115 KB)

# Date Author Comment
5509bffa 01/12/2006 10:43 PM Diego Biurrun

Update licensing information: The FSF changed postal address.

Originally committed as revision 4842 to svn://svn.ffmpeg.org/ffmpeg/trunk

bb270c08 12/22/2005 01:10 AM Diego Biurrun

COSMETICS: tabs --> spaces, some prettyprinting

Originally committed as revision 4764 to svn://svn.ffmpeg.org/ffmpeg/trunk

115329f1 12/17/2005 06:14 PM Diego Biurrun

COSMETICS: Remove all trailing whitespace.

Originally committed as revision 4749 to svn://svn.ffmpeg.org/ffmpeg/trunk

ea15df80 11/12/2005 05:23 AM Loren Merritt

use sse16_sse2() in nsse

Originally committed as revision 4688 to svn://svn.ffmpeg.org/ffmpeg/trunk

a6624e21 10/27/2005 06:45 AM Loren Merritt

faster h264_chroma_mc8_mmx, added h264_chroma_mc4_mmx.
2-4% overall speedup.

Originally committed as revision 4666 to svn://svn.ffmpeg.org/ffmpeg/trunk

b926572a 10/09/2005 11:38 PM Loren Merritt

h264 mmx weighted prediction. up to 3% overall speedup.

Originally committed as revision 4630 to svn://svn.ffmpeg.org/ffmpeg/trunk

5693c083 09/30/2005 02:31 AM Loren Merritt

sse2 16x16 sum squared diff (306=>268 cycles on a K8)
faster 8x8 mmx ssd (77=>70 cycles)

Originally committed as revision 4623 to svn://svn.ffmpeg.org/ffmpeg/trunk

12e96681 09/21/2005 09:17 PM Michael Niedermayer

replace a few mov + psrlq with pshufw, there are more cases which could benefit from this but they would require us to duplicate some functions ...
the trick is from various places (my own code in libpostproc, a patch on the x264 list, ...)

Originally committed as revision 4608 to svn://svn.ffmpeg.org/ffmpeg/trunk

cd7af76d 09/10/2005 07:30 PM Reimar Döffinger

Fix compile without CONFIG_GPL, misplaced #endif caused a missing }.

Originally committed as revision 4575 to svn://svn.ffmpeg.org/ffmpeg/trunk

84740d59 09/10/2005 05:01 PM Michael Niedermayer

xvids mmx&mmx2 idcts
needed to decode xvid without some minor artefacts
under #ifdef CONFIG_GPL of course

Originally committed as revision 4572 to svn://svn.ffmpeg.org/ffmpeg/trunk

79396ac6 09/06/2005 09:25 PM Måns Rullgård

Kill some compiler warnings. Compiled code verified identical after changes.

Originally committed as revision 4567 to svn://svn.ffmpeg.org/ffmpeg/trunk

d2bb7db1 06/02/2005 08:45 PM Loren Merritt

sort H.264 mmx dsp functions into their own file

Originally committed as revision 4338 to svn://svn.ffmpeg.org/ffmpeg/trunk

c26ae41d 06/01/2005 09:19 PM Michael Niedermayer

adding a few const

Originally committed as revision 4337 to svn://svn.ffmpeg.org/ffmpeg/trunk

1d62fc85 05/31/2005 10:48 PM Loren Merritt

MMX for H.264 iDCT (adapted from x264)

Originally committed as revision 4329 to svn://svn.ffmpeg.org/ffmpeg/trunk

3072f0cb 05/25/2005 08:05 AM Zoltán Hidvégi

MMX code for (put|avg)_h264_chroma_mc8

Originally committed as revision 4305 to svn://svn.ffmpeg.org/ffmpeg/trunk

5cf08f23 05/18/2005 09:17 AM Loren Merritt

H.264 deblocking optimizations (mmx for chroma_bS4 case, convert existing cases to 8-bit math)

Originally committed as revision 4271 to svn://svn.ffmpeg.org/ffmpeg/trunk

5773a746 05/17/2005 06:28 PM Michael Niedermayer

porting the mmx&sse2 (sse2 untested) vp3 idcts to the lavc idct API

Originally committed as revision 4260 to svn://svn.ffmpeg.org/ffmpeg/trunk

b178f758 05/17/2005 09:11 AM Michael Niedermayer

disabling vp3 mmx&mmx2 idcts, they must be ported over to the lavc idct API, ill port the vp3 c idct

Originally committed as revision 4255 to svn://svn.ffmpeg.org/ffmpeg/trunk

c998bdd9 05/08/2005 06:48 PM Michael Niedermayer

fix PIC

Originally committed as revision 4204 to svn://svn.ffmpeg.org/ffmpeg/trunk

42251a2a 04/25/2005 01:01 AM Loren Merritt

MMX for H.264 deblocking filter

Originally committed as revision 4158 to svn://svn.ffmpeg.org/ffmpeg/trunk

4d9ae03b 02/07/2005 05:09 PM Martin Drab

optimization and gcc 4.0 bug workaround patch by (Martin Drab >drab kepler.fjfi.cvut cz<)

Originally committed as revision 3945 to svn://svn.ffmpeg.org/ffmpeg/trunk

053dea12 10/11/2004 02:19 AM Aurelien Jacobs

adapting existing mmx/mmx2/sse/3dnow optimizations so they work on x86_64 patch by (Aurelien Jacobs <aurel at gnuage dot org>)

Originally committed as revision 3578 to svn://svn.ffmpeg.org/ffmpeg/trunk

178fcca8 09/25/2004 11:18 PM Michael Niedermayer

1/2 resolution decoding

Originally committed as revision 3507 to svn://svn.ffmpeg.org/ffmpeg/trunk

e69538fa 09/08/2004 06:49 PM Michael Niedermayer

h264_qpel8_hv_lowpass_mmx2/3dnow

Originally committed as revision 3446 to svn://svn.ffmpeg.org/ffmpeg/trunk

e772bb8a 09/08/2004 06:25 PM Michael Niedermayer

h264_qpel4_hv_lowpass_mmx2/3dnow
alignment fixes

Originally committed as revision 3445 to svn://svn.ffmpeg.org/ffmpeg/trunk

56d8bd56 09/08/2004 12:52 AM Michael Niedermayer

optimization

Originally committed as revision 3440 to svn://svn.ffmpeg.org/ffmpeg/trunk

a6e39f45 09/07/2004 10:56 PM Michael Niedermayer

optimization

Originally committed as revision 3439 to svn://svn.ffmpeg.org/ffmpeg/trunk

ed8ffdf4 09/07/2004 05:22 PM Michael Niedermayer

optimization

Originally committed as revision 3438 to svn://svn.ffmpeg.org/ffmpeg/trunk

437525c4 09/07/2004 01:48 AM Michael Niedermayer

h264 luma motion compensation in mmx2/3dnow

Originally committed as revision 3437 to svn://svn.ffmpeg.org/ffmpeg/trunk

d6af6b03 09/06/2004 03:17 AM Michael Niedermayer

10000l fix and use more mmx2/3dnow code for mpeg4 qpel which has been written and commited long time ago but appearently never used, qpel motion compensation is 5% faster
now

Originally committed as revision 3435 to svn://svn.ffmpeg.org/ffmpeg/trunk

1ec4df0f 06/08/2004 02:13 AM Michael Niedermayer

sse8 and nsse in mmx

Originally committed as revision 3206 to svn://svn.ffmpeg.org/ffmpeg/trunk

e96682e6 05/18/2004 05:09 PM Michael Niedermayer

some of the warning fixes by (Michael Roitzsch <mroi at users dot sourceforge dot net>)

Originally committed as revision 3140 to svn://svn.ffmpeg.org/ffmpeg/trunk

7daabccb 04/27/2004 04:06 AM Mike Melanson

move the 0x80 vector outside of the function, thus saving the compiler
the trouble of having to initialize each byte on the stack individually

Originally committed as revision 3083 to svn://svn.ffmpeg.org/ffmpeg/trunk

f9ed9d85 04/27/2004 03:58 AM Mike Melanson

separate out put_signed_pixels_clamped() into its own function and
implement an optimized MMX version of the function

Originally committed as revision 3082 to svn://svn.ffmpeg.org/ffmpeg/trunk

116824d0 04/26/2004 12:20 AM Mike Melanson

reorganize and simplify the VP3 IDCT stuff

Originally committed as revision 3071 to svn://svn.ffmpeg.org/ffmpeg/trunk

38acbc3c 04/25/2004 03:33 AM Mike Melanson

hook up support for SSE2-optimized VP3 IDCT

Originally committed as revision 3064 to svn://svn.ffmpeg.org/ffmpeg/trunk

01456e8e 03/08/2004 02:18 AM Mike Melanson

use optimized VP3 functions where appropriate

Originally committed as revision 2857 to svn://svn.ffmpeg.org/ffmpeg/trunk

5c0513bd 02/29/2004 10:10 PM Dmitry Baryshkov

attribute used patch by (mitya at school dot ioffe dot ru (Dmitry Baryshkov))

Originally committed as revision 2830 to svn://svn.ffmpeg.org/ffmpeg/trunk

364a1797 02/06/2004 02:12 AM Michael Niedermayer

quantizer noise shaping optimization

Originally committed as revision 2750 to svn://svn.ffmpeg.org/ffmpeg/trunk

fb85b027 01/31/2004 08:41 PM Mike Melanson

vsad16_mmx2 only applies if encoders are turned on

Originally committed as revision 2736 to svn://svn.ffmpeg.org/ffmpeg/trunk

8fd19ab2 01/29/2004 01:15 AM Michael Niedermayer

SSE2 fdct by (Balatoni Denes <pnis at coder dot hu>)

Originally committed as revision 2729 to svn://svn.ffmpeg.org/ffmpeg/trunk

8f2ab833 01/10/2004 04:04 PM Michael Niedermayer

copyright year update of the files i touched and remembered, things look annoyingly unmaintained otherwise

Originally committed as revision 2686 to svn://svn.ffmpeg.org/ffmpeg/trunk

622348f9 01/05/2004 10:57 PM Michael Niedermayer

interlaced dct decision cleanup
function moved to dspcontext
mmx&mmx2 optimized
change SSE -> SAD as default (better quality)
vbv buffer size command line option in kbyte

Originally committed as revision 2669 to svn://svn.ffmpeg.org/ffmpeg/trunk

bb198e19 12/30/2003 04:07 PM Michael Niedermayer

interlaced motion estimation
interlaced mpeg2 encoding
P & B frames
rate distored interlaced mb decission
alternate scantable support
4mv encoding fixes (thats also why the regression tests change)
passing height to most dsp functions
interlaced mpeg4 encoding (no direct mode MBs yet)...

0c11692a 12/13/2003 02:14 AM Aurelien Jacobs

Here are juste two added #ifdef CONFIG_ENCODERS to allow
libavcodec to compile without CONFIG_ENCODERS defined
patch by (Aurelien Jacobs <aurel at gnuage dot org>)

Originally committed as revision 2604 to svn://svn.ffmpeg.org/ffmpeg/trunk

3615e2be 12/02/2003 10:02 PM Michael Niedermayer

h263_h_loop_filter_mmx

Originally committed as revision 2553 to svn://svn.ffmpeg.org/ffmpeg/trunk

359f98de 12/02/2003 08:28 PM Michael Niedermayer

h263_v_loop_filter_mmx

Originally committed as revision 2552 to svn://svn.ffmpeg.org/ffmpeg/trunk

01a2ddaf 10/23/2003 11:24 PM Alex Beregszaszi

1l

Originally committed as revision 2424 to svn://svn.ffmpeg.org/ffmpeg/trunk

cf3bf5bb 10/23/2003 01:22 PM Michael Niedermayer

minor mmx2 optimization if the dct

Originally committed as revision 2423 to svn://svn.ffmpeg.org/ffmpeg/trunk

764ef400 10/14/2003 04:15 AM Mike Melanson

disable encoders where appropriate (patch courtesy of BERO
<bero at geocities.co.jp>)

Originally committed as revision 2375 to svn://svn.ffmpeg.org/ffmpeg/trunk

84705403 10/13/2003 05:27 PM Michael Niedermayer

mmx2 optimization of huffyuv median encoding

Originally committed as revision 2372 to svn://svn.ffmpeg.org/ffmpeg/trunk

4fb518c3 06/21/2003 01:11 AM Michael Niedermayer

the return of the idct with 16bit output by ("Ivan Kalvachev" <ivan at cacad dot com>)

Originally committed as revision 1983 to svn://svn.ffmpeg.org/ffmpeg/trunk

5fd74135 05/07/2003 11:32 PM Michael Niedermayer

more #ifdef CONFIG_ENCODERS by (Wolfgang Hesseler <qv at multimediaware dot com>)

Originally committed as revision 1843 to svn://svn.ffmpeg.org/ffmpeg/trunk

7db52b63 04/14/2003 09:58 AM Michael Niedermayer

10l (hadamard fix)

Originally committed as revision 1771 to svn://svn.ffmpeg.org/ffmpeg/trunk

8e0a3db7 03/26/2003 11:29 AM Michael Niedermayer

faster hadamard transform

Originally committed as revision 1707 to svn://svn.ffmpeg.org/ffmpeg/trunk

e5247b5f 03/10/2003 11:22 AM Zdenek Kabelac
  • oooooops - sorry for this one - wrong logic
    empty dsp_mask - no change
    FORCE - add, no FORCE - enabled bits are masked out

Originally committed as revision 1659 to svn://svn.ffmpeg.org/ffmpeg/trunk

63f60826 03/07/2003 01:49 PM Zdenek Kabelac
  • support FF_MM_FORCE

Originally committed as revision 1643 to svn://svn.ffmpeg.org/ffmpeg/trunk

b0368839 03/03/2003 02:54 PM Michael Niedermayer

MpegEncContext.(i)dct_* -> DspContext.(i)dct_*
bitexact cleanup

Originally committed as revision 1617 to svn://svn.ffmpeg.org/ffmpeg/trunk

2720569a 02/11/2003 04:57 PM Zdenek Kabelac
  • prototypes at one place

Originally committed as revision 1579 to svn://svn.ffmpeg.org/ffmpeg/trunk

0c1a9eda 02/11/2003 04:35 PM Zdenek Kabelac
  • UINTX -> uintx_t INTX -> intx_t

Originally committed as revision 1578 to svn://svn.ffmpeg.org/ffmpeg/trunk

5c91a675 02/10/2003 09:35 AM Zdenek Kabelac
  • static,const,compiler warning cleanup

Originally committed as revision 1567 to svn://svn.ffmpeg.org/ffmpeg/trunk

2a006cd3 01/11/2003 10:32 PM Felix von Leitner

sse16 & pix_norm1 optimization patch by (Felix von Leitner <felix-ffmpeg at fefe dot de>) (with some modifications)

Originally committed as revision 1450 to svn://svn.ffmpeg.org/ffmpeg/trunk

db794953 01/09/2003 08:42 PM Michael Niedermayer

qpel fix

Originally committed as revision 1426 to svn://svn.ffmpeg.org/ffmpeg/trunk

5a508a98 01/07/2003 01:00 PM Michael Niedermayer

PIC / ebx fix

Originally committed as revision 1408 to svn://svn.ffmpeg.org/ffmpeg/trunk

0b093b6f 01/07/2003 11:59 AM Michael Niedermayer

more gcc bug workarounds

Originally committed as revision 1407 to svn://svn.ffmpeg.org/ffmpeg/trunk

c296f66b 01/06/2003 09:55 AM Michael Niedermayer

1000l (push & esp) using mangle now ...

Originally committed as revision 1401 to svn://svn.ffmpeg.org/ffmpeg/trunk

3178ee4c 01/05/2003 08:59 PM Michael Niedermayer

qpel in b frames bugfixes

Originally committed as revision 1398 to svn://svn.ffmpeg.org/ffmpeg/trunk

3643bd9c 01/05/2003 07:53 PM Michael Niedermayer

slowdown / gcc 2.95.* bug workaround (this should be reversed as soon as gcc 2.95.* support is droped)

Originally committed as revision 1397 to svn://svn.ffmpeg.org/ffmpeg/trunk

826f429a 01/05/2003 03:57 PM Michael Niedermayer

qpel in mmx2/3dnow
qpel refinement quality parameter

Originally committed as revision 1393 to svn://svn.ffmpeg.org/ffmpeg/trunk

1457ab52 12/27/2002 11:51 PM Michael Niedermayer

qpel encoding
4mv+b frames encoding finally fixed
chroma ME
5 comparission functions for ME
b frame encoding speedup
wmv2 codec (unfinished)
user specified diamond size for EPZS

Originally committed as revision 1365 to svn://svn.ffmpeg.org/ffmpeg/trunk

11f18faf 11/14/2002 07:20 PM Michael Niedermayer

huffyuv

Originally committed as revision 1211 to svn://svn.ffmpeg.org/ffmpeg/trunk

eb4b3dd3 11/11/2002 09:40 AM Zdenek Kabelac
  • using DSPContext - so each codec could use its local (sub)set of CPU extension

Originally committed as revision 1194 to svn://svn.ffmpeg.org/ffmpeg/trunk

2ad1516a 09/29/2002 10:44 PM Michael Niedermayer

idct permutation cleanup, idct can be selected per context now
fixing some threadunsafe code

Originally committed as revision 980 to svn://svn.ffmpeg.org/ffmpeg/trunk

084c726b 09/20/2002 10:03 AM Michael Niedermayer

pix_sum16_mmx()

Originally committed as revision 961 to svn://svn.ffmpeg.org/ffmpeg/trunk

b3184779 09/11/2002 12:39 PM Michael Niedermayer

put/avg_pixels16
fixing 2 small qpel bugs

Originally committed as revision 915 to svn://svn.ffmpeg.org/ffmpeg/trunk

1565dabc 09/01/2002 07:26 AM Lennert Buytenhek

be less verbose patch by (Lennert Buytenhek <buytenh at math dot leidenuniv dot nl>)

Originally committed as revision 889 to svn://svn.ffmpeg.org/ffmpeg/trunk

e7fce5e9 08/31/2002 02:17 PM Michael Niedermayer

bitexact hack for the simple mmx idct

Originally committed as revision 886 to svn://svn.ffmpeg.org/ffmpeg/trunk

28db7fce 08/29/2002 11:55 PM Michael Niedermayer

slow but accurate integer dct from IJG (should be ok with the LGPL as the old DCT is the fast integer DCT from IJG)
per context DCT selection

Originally committed as revision 878 to svn://svn.ffmpeg.org/ffmpeg/trunk

4afeaec9 07/21/2002 12:48 AM Michael Niedermayer

pix_abs mmx2 isnt bit-exact ...

Originally committed as revision 781 to svn://svn.ffmpeg.org/ffmpeg/trunk

39825f31 06/03/2002 05:35 PM Zdenek Kabelac
  • reimplemented remaing avg_ pixel functions
  • modified PAVGB mmx macro to select mmx register with FE constant

Originally committed as revision 657 to svn://svn.ffmpeg.org/ffmpeg/trunk

fca0f0e5 05/30/2002 03:14 PM Zdenek Kabelac
  • removed MANGLE from macros for setting constants
  • using MOVQ_WONE/MOVQ_BFE as two instruction instead of static memory value access
    as its always faster
  • PAVGB_MMX macro is using now mm6 -> mm7 is unmodified
  • replaced original pixels_xy2_mmx with new faster and equal implementation...
6aa6ea8e 05/29/2002 07:57 PM Zdenek Kabelac
  • reenabled original xy2 put routine - rounding error is really bad with
    the new code
  • added PAVGP macros for parallel processing to safe few more cycles on
    celerons

Originally committed as revision 625 to svn://svn.ffmpeg.org/ffmpeg/trunk

91abb473 05/29/2002 05:16 PM Zdenek Kabelac
  • code with new PAVGB for MMX only CPU splited into separate file
    and being compiled in the same way as _avg.h
  • PAVG_MMX macros accept also output parameter
  • implemented faster put_pixels_xy2, but it has slightly smaller precission.
    But there is not visible difference in the image quality - might be eventualy...
def60345 05/29/2002 02:29 PM Zdenek Kabelac

Originally committed as revision 623 to svn://svn.ffmpeg.org/ffmpeg/trunk

5f623001 05/27/2002 11:32 AM Zdenek Kabelac
  • removed TESTCPU_MAIN - not needed for testing

Originally committed as revision 615 to svn://svn.ffmpeg.org/ffmpeg/trunk

ff4ec49e 05/25/2002 10:45 PM Fabrice Bellard

license/copyright change

Originally committed as revision 599 to svn://svn.ffmpeg.org/ffmpeg/trunk

52af45ad 05/23/2002 04:19 PM Zdenek Kabelac
  • baling 8 seems to have the same speed

Originally committed as revision 575 to svn://svn.ffmpeg.org/ffmpeg/trunk

fbdedc17 05/23/2002 03:57 PM Zdenek Kabelac
  • always baling even though it seems to produce sometime worse
    results then for -fPIC compilation - don't ask me why...

Originally committed as revision 573 to svn://svn.ffmpeg.org/ffmpeg/trunk

31ddcf98 05/23/2002 03:48 PM Zdenek Kabelac
  • cleanup for put_pixels_mmx

Originally committed as revision 572 to svn://svn.ffmpeg.org/ffmpeg/trunk

ec9a41f5 05/23/2002 01:34 PM Zdenek Kabelac
  • somewhat more advanced test suit for dsp code - for now i386 only
    cd test ; make dsptest ; ./dsptest

Originally committed as revision 568 to svn://svn.ffmpeg.org/ffmpeg/trunk

6efad6fd 05/23/2002 12:17 PM Zdenek Kabelac
  • align the source initialy address

Originally committed as revision 567 to svn://svn.ffmpeg.org/ffmpeg/trunk

dcb9cd4b 05/23/2002 10:09 AM Zdenek Kabelac
  • added simple test main - see comments about how to
    compile - should be probably made as a regression test

Originally committed as revision 565 to svn://svn.ffmpeg.org/ffmpeg/trunk

4f12a497 05/20/2002 04:25 PM Fabrice Bellard

removed unused code

Originally committed as revision 543 to svn://svn.ffmpeg.org/ffmpeg/trunk

61a4e8ae 05/18/2002 10:59 PM Fabrice Bellard

removed debug function

Originally committed as revision 519 to svn://svn.ffmpeg.org/ffmpeg/trunk

a7bd8797 05/17/2002 01:01 PM Michael Niedermayer

shared lib support (req by kabi) ...

Originally committed as revision 510 to svn://svn.ffmpeg.org/ffmpeg/trunk

607dce96 05/17/2002 01:04 AM Michael Niedermayer

hopefully faster mmx2&3dnow MC

Originally committed as revision 506 to svn://svn.ffmpeg.org/ffmpeg/trunk

cd8e5f96 04/24/2002 12:18 PM Zdenek Kabelac
  • fixes problem with -funroll-loops and buggy gcc compiler

Originally committed as revision 421 to svn://svn.ffmpeg.org/ffmpeg/trunk

9dbcbd92 04/17/2002 04:32 AM Michael Niedermayer

fixed mpeg4 time stuff on encoding
mpeg4 b-frame enoding support
removed old, out-commented ratecontrol
reuse motion compensation code between encoding & decoding
prefix newly added global functions with ff_ to reduce namespace polution
b-frame ME (unfinished, but working)...

649c00c9 03/28/2002 01:41 PM Michael Niedermayer

sizeof(s->block) isnt 64*6*2 anymore bugfix
mpeg12 decoding optimization

Originally committed as revision 364 to svn://svn.ffmpeg.org/ffmpeg/trunk

ba6802de 03/27/2002 09:25 PM Michael Niedermayer

4MV motion estimation (not finished yet)
SAD functions rewritten (8x8 support & MMX2 optimizations)
HQ inter/intra decission
msmpeg4 encoding bugfix (MV where too long)

Originally committed as revision 362 to svn://svn.ffmpeg.org/ffmpeg/trunk