Statistics
| Branch: | Revision:

ffmpeg / libavcodec / x86 @ b10fa1bb

# Date Author Comment
b10fa1bb 12/03/2009 06:53 PM Loren Merritt

port ape dsp functions from sse2 to mmx
now requires yasm

Originally committed as revision 20722 to svn://svn.ffmpeg.org/ffmpeg/trunk

45213083 10/25/2009 03:09 AM Loren Merritt

s/movdqa/movaps/ in sse1 fft. (regression in r20293)

Originally committed as revision 20371 to svn://svn.ffmpeg.org/ffmpeg/trunk

b07781b6 10/18/2009 09:44 PM Loren Merritt

fix linking on systems with a function name prefix (10l in r20287)

Originally committed as revision 20294 to svn://svn.ffmpeg.org/ffmpeg/trunk

29e4edbb 10/18/2009 09:42 PM Loren Merritt

sync yasm macros to x264

Originally committed as revision 20293 to svn://svn.ffmpeg.org/ffmpeg/trunk

e17ccf60 10/18/2009 08:47 PM Loren Merritt

huffyuv: add some const qualifiers

Originally committed as revision 20290 to svn://svn.ffmpeg.org/ffmpeg/trunk

2f77923d 10/18/2009 08:10 PM Loren Merritt

simd add_hfyu_left_prediction
2.2x faster than C on conroe, 3.6x on penryn.
4-6% faster huffyuv decoding if using left or plane mode and yuv

Originally committed as revision 20287 to svn://svn.ffmpeg.org/ffmpeg/trunk

f4d608e3 10/18/2009 07:51 PM Justin Ruggles

add CONFIG_LPC to the build system for lpc dsputil functions. fixes build
problems when lpc.c is not compiled.

Originally committed as revision 20285 to svn://svn.ffmpeg.org/ffmpeg/trunk

fde82ca7 10/17/2009 09:00 PM Justin Ruggles

Move autocorrelation function from flacenc.c to lpc.c. Also rename the
corresponding dsputil functions and remove their dependency on the FLAC
encoder.
Fixes Issue1486.

Originally committed as revision 20266 to svn://svn.ffmpeg.org/ffmpeg/trunk

ec656755 10/01/2009 03:30 PM Reimar Döffinger

Use MANGLE in cavsdsp, the current version using "m" constraints will not
compile on e.g. OpenBSD due to running out of registers.

Originally committed as revision 20123 to svn://svn.ffmpeg.org/ffmpeg/trunk

00312109 09/30/2009 09:49 AM Reimar Döffinger

Replace several #ifdef PIC with the more obvious and correct
#if !HAVE_EBX_AVAILABLE, since all it does is avoid using ebx.

Originally committed as revision 20094 to svn://svn.ffmpeg.org/ffmpeg/trunk

35de5d24 09/27/2009 04:52 PM Måns Rullgård

cosmetics: fix indentation after previous commit

Originally committed as revision 20062 to svn://svn.ffmpeg.org/ffmpeg/trunk

952e8721 09/27/2009 04:51 PM Måns Rullgård

Drop unused args from vector_fmul_add_add, simpify code, and rename

The src3 and step arguments to vector_fmul_add_add() are always zero
and one, respectively. This removes these arguments from the function,
simplifies the code accordingly, and renames the function to better...

01b22147 09/20/2009 05:30 PM Måns Rullgård

Merge FFTContext and MDCTContext

Originally committed as revision 19931 to svn://svn.ffmpeg.org/ffmpeg/trunk

f4863213 09/15/2009 09:14 PM Måns Rullgård

Move per-arch fft init bits into the corresponding subdirs

Originally committed as revision 19864 to svn://svn.ffmpeg.org/ffmpeg/trunk

4e36a5b4 08/29/2009 04:55 PM Måns Rullgård

Move declarations of some mmx functions to dsputil_mmx.h

Originally committed as revision 19739 to svn://svn.ffmpeg.org/ffmpeg/trunk

9263a05a 08/27/2009 03:52 PM Vitor Sessak

Mark "i" parameter of vector_clipf_sse() as early-clobber

Originally committed as revision 19731 to svn://svn.ffmpeg.org/ffmpeg/trunk

50e23ae9 08/27/2009 03:38 PM Vitor Sessak

Mark parameter src of vector_clipf() as const

Originally committed as revision 19729 to svn://svn.ffmpeg.org/ffmpeg/trunk

0a68cd87 08/27/2009 02:49 PM Vitor Sessak

SSE optimized vector_clipf(). 10% faster TwinVQ decoding.

Originally committed as revision 19728 to svn://svn.ffmpeg.org/ffmpeg/trunk

3f87f39c 08/04/2009 07:42 AM John Adcock

Update x264 asm code to latest to add support for 64-bit Windows.
Use the new x86inc features to support 64-bit Windows on all non-x264 nasm
assembly code as well.
Patch by John Adcock, dscaler.johnad AT googlemail DOT com.
Win64 changes originally by Anton Mitrofanov....

9be6f0d2 07/29/2009 09:54 AM Diego Biurrun

Do not check for both CONFIG_VC1_DECODER and CONFIG_WMV3_DECODER,
the former depends upon the latter.

Originally committed as revision 19533 to svn://svn.ffmpeg.org/ffmpeg/trunk

99e5a9d1 07/22/2009 10:27 PM Diego Biurrun

Do not redundantly check for both CONFIG_THEORA_DECODER and CONFIG_VP3_DECODER.
The Theora decoder depends on the VP3 decoder.

Originally committed as revision 19492 to svn://svn.ffmpeg.org/ffmpeg/trunk

36904c4c 07/17/2009 09:07 AM Carl Eugen Hoyos

Icc 11.1 still does not align the stack pointer, disable some x264 functions.

Originally committed as revision 19454 to svn://svn.ffmpeg.org/ffmpeg/trunk

73b02e24 06/16/2009 05:33 PM Jason Garrett-Glaser

SSE version of clear_blocks

Originally committed as revision 19206 to svn://svn.ffmpeg.org/ffmpeg/trunk

4f717c69 06/16/2009 09:00 AM Jason Garrett-Glaser

idct_dc for VC-1/WMV3 decoder; ~11% faster decoding overall.
Includes mmx2 asm for the various functions.
Note that the actual idct still does not have an x86 SIMD implemtation.
For wmv3 files using regular idct, the decoder just falls back to simple_idct,...

74a841af 06/04/2009 11:25 PM Ramiro Polla

Replace more uses of attribute((aligned)) by DECLARE_ALIGNED.

Originally committed as revision 19089 to svn://svn.ffmpeg.org/ffmpeg/trunk

989b7181 06/03/2009 11:48 PM Ramiro Polla

Use fewer macros in x86-optimized mlpdsp.
Fixes compilation on 32-bit llvm which didn't allow a cast in an m operand.

Originally committed as revision 19086 to svn://svn.ffmpeg.org/ffmpeg/trunk

2b9969a9 05/30/2009 10:19 PM Alexander Strange

H264: Fix out of bounds reads in SSSE3 MC

Reading above src[-2] isn't safe, so move loads and palignr ahead
3 pixels to load starting at the first pixel actually used.

Fixes issue941.

Originally committed as revision 18999 to svn://svn.ffmpeg.org/ffmpeg/trunk

7c4c60e5 05/25/2009 01:52 AM Ramiro Polla

mlp: Use LABEL_MANGLE() to export label symbols from inside asm block.

Originally committed as revision 18935 to svn://svn.ffmpeg.org/ffmpeg/trunk

5624766d 05/23/2009 12:23 AM Ramiro Polla

MLP DSP functions x86-optimized.
12.59% overall speedup in x86_32
9.98% overall speedup in x86_64
compared to gcc 4.3.3

Originally committed as revision 18903 to svn://svn.ffmpeg.org/ffmpeg/trunk

c21c835b 04/15/2009 07:10 PM David Conrad

avg_ pixel functions need to use (dst+pix+1)>>1 to average with existing
pixels, not (dst+pix)>>1.
This makes the mmx functions bitexact with the C functions.

Originally committed as revision 18527 to svn://svn.ffmpeg.org/ffmpeg/trunk

9bf0fdf3 04/15/2009 02:25 AM David Conrad

VC1: extend MMX qpel MC to include MMX2 avg qpel

Originally committed as revision 18519 to svn://svn.ffmpeg.org/ffmpeg/trunk

8013da73 04/14/2009 11:56 PM David Conrad

VC1: add and use avg_no_rnd chroma MC functions

Originally committed as revision 18518 to svn://svn.ffmpeg.org/ffmpeg/trunk

c374691b 04/14/2009 11:55 PM David Conrad

Rename put_no_rnd_h264_chroma* to reflect its usage in VC1 only

Originally committed as revision 18517 to svn://svn.ffmpeg.org/ffmpeg/trunk

cfe67526 04/09/2009 09:53 PM Michael Niedermayer

Do not use SSE2 SAD for snow as it requires more alignment than can be
easily provided.
Fixes issue315.

Originally committed as revision 18404 to svn://svn.ffmpeg.org/ffmpeg/trunk

6b434361 04/04/2009 01:20 PM Stefano Sabatini

Rename FF_MM_MMXEXT to FF_MM_MMX2, for both clarity and consistency
with libswscale.

Originally committed as revision 18330 to svn://svn.ffmpeg.org/ffmpeg/trunk

0be9e73e 04/03/2009 02:03 PM Reimar Döffinger

Mark line_skip3 asm argument as output-only instead of using av_uninit.

Originally committed as revision 18327 to svn://svn.ffmpeg.org/ffmpeg/trunk

d7460a9c 04/03/2009 02:02 PM Reimar Döffinger

Mark put_signed_pixels_clamped_mmx output operands as early-clobber because
they are. Hopefully fixes some FATE errors, too.

Originally committed as revision 18326 to svn://svn.ffmpeg.org/ffmpeg/trunk

531a3d27 04/03/2009 02:01 PM Reimar Döffinger

Use DECLARE_ASM_CONST for non-global ff_vector128 constant used via MANGLE

Originally committed as revision 18325 to svn://svn.ffmpeg.org/ffmpeg/trunk

3dd65312 04/02/2009 09:02 PM Alex Converse

Rewrite put_signed_pixels_clamped_mmx() to eliminate mmx.h from dsputil_mmx.c.

Originally committed as revision 18319 to svn://svn.ffmpeg.org/ffmpeg/trunk

710441c2 04/01/2009 09:11 AM David Conrad

Add SSE4 detection support

Originally committed as revision 18302 to svn://svn.ffmpeg.org/ffmpeg/trunk

ecf05a59 03/19/2009 11:29 PM Matthieu Castet

Remove useless casting in asm "m" operand.

Patch by Matthieu Castet, castet D matthieu A free D fr

Originally committed as revision 18054 to svn://svn.ffmpeg.org/ffmpeg/trunk

d05f808d 02/23/2009 03:53 PM Zuxy Meng

Remove CPUID availability check on AMD64 as it's architectural.

Originally committed as revision 17543 to svn://svn.ffmpeg.org/ffmpeg/trunk

e27ad118 02/22/2009 11:35 AM Jason Garrett-Glaser

Convert x264 asm files to proper unix line breaks

Originally committed as revision 17524 to svn://svn.ffmpeg.org/ffmpeg/trunk

9bd5f59b 02/22/2009 11:33 AM Jason Garrett-Glaser

Remove (incorrect) filenames from x264 asm files, add descriptions.

Originally committed as revision 17523 to svn://svn.ffmpeg.org/ffmpeg/trunk

b6188c5a 02/13/2009 12:57 AM Alexander Strange

Put dispatch_tab in the rodata section for macho64.
This fixes linking shared libavcodec, since the linker doesn't allow
text relocations in shared libraries under Darwin/x86_64.

Based on a patch by Art Clarke (aclarke xuggle com)

Originally committed as revision 17197 to svn://svn.ffmpeg.org/ffmpeg/trunk

ecb24904 02/13/2009 12:02 AM Zuxy Meng

add SSE2 version of vp6_filter_diag
original patch by Zuxy Meng zuxy.meng at gmail dot com

Originally committed as revision 17195 to svn://svn.ffmpeg.org/ffmpeg/trunk

6af3c226 02/12/2009 11:52 PM Sebastien Lucas

add MMX version of vp6_filter_diag
original patch by Sebastien Lucas sebastien.lucas at gmail dot com

Originally committed as revision 17194 to svn://svn.ffmpeg.org/ffmpeg/trunk

5110b25e 02/12/2009 11:48 PM Aurelien Jacobs

convert ff_pw_64 into an xmm_reg for future use in vp6 sse code

Originally committed as revision 17192 to svn://svn.ffmpeg.org/ffmpeg/trunk

15c13dde 02/11/2009 04:12 PM Diego Biurrun

Fix wrong file name in header, noticed by David DeHaven, dave sagetv com.

Originally committed as revision 17158 to svn://svn.ffmpeg.org/ffmpeg/trunk

d3a4b4e0 02/11/2009 11:16 AM Diego Biurrun

Add check whether the compiler/assembler supports 10 or more operands.
thanks to Loren for some help with the asm statements

Originally committed as revision 17151 to svn://svn.ffmpeg.org/ffmpeg/trunk

e090c70f 02/10/2009 04:45 PM Stefan Gehrer

avoid duplicating dsputil's clear_block

Originally committed as revision 17135 to svn://svn.ffmpeg.org/ffmpeg/trunk

ea399a87 02/09/2009 11:32 AM Diego Biurrun

Remove svn:executable property from source file.

Originally committed as revision 17098 to svn://svn.ffmpeg.org/ffmpeg/trunk

3daa434a 02/08/2009 05:45 PM Loren Merritt

ff_add_hfyu_median_prediction_mmx2
overall ffvhuff decoding speedup: 28% on core2, 25% on k8.

Originally committed as revision 17059 to svn://svn.ffmpeg.org/ffmpeg/trunk

6166516d 02/08/2009 05:44 PM Loren Merritt

re-enable mid_pred asm on x86_64. (broke in r16681)

Originally committed as revision 17058 to svn://svn.ffmpeg.org/ffmpeg/trunk

353f87b8 02/08/2009 06:35 AM Baptiste Coudurier

fix typo in h264dsp_mmx (no effect currently as the function is not used), approved by Dark Shikari on IRC

Originally committed as revision 17046 to svn://svn.ffmpeg.org/ffmpeg/trunk

bad5537e 02/01/2009 02:00 AM Diego Biurrun

Use full internal pathname in doxygen @file directives.
Otherwise doxygen complains about ambiguous filenames when files exist
under the same name in different subdirectories.

Originally committed as revision 16912 to svn://svn.ffmpeg.org/ffmpeg/trunk

137ae327 01/26/2009 03:40 AM David Conrad

Workaround for gcc 3.4 to align sh properly

Originally committed as revision 16797 to svn://svn.ffmpeg.org/ffmpeg/trunk

406792e7 01/19/2009 03:46 PM Diego Biurrun

cosmetics: Remove pointless period after copyright statement non-sentences.

Originally committed as revision 16684 to svn://svn.ffmpeg.org/ffmpeg/trunk

199436b9 01/18/2009 10:57 PM Aurelien Jacobs

moves mid_pred() into mathops.h (with arch specific code split by directory)

Originally committed as revision 16681 to svn://svn.ffmpeg.org/ffmpeg/trunk

49fb20cb 01/14/2009 05:19 PM Aurelien Jacobs

replace all occurrence of ENABLE_ by the corresponding CONFIG_, HAVE_ or ARCH_
and remove all ENABLE_ definitions.

Originally committed as revision 16600 to svn://svn.ffmpeg.org/ffmpeg/trunk

b250f9c6 01/13/2009 11:44 PM Aurelien Jacobs

Change semantic of CONFIG_*, HAVE_* and ARCH_*.
They are now always defined to either 0 or 1.

Originally committed as revision 16590 to svn://svn.ffmpeg.org/ffmpeg/trunk

1bb04d5a 01/07/2009 11:38 PM Ramiro Polla

configure: allow to disable sse code.
Based on patch by Michael Kostylev <mik at it-1 dot ru>

Originally committed as revision 16490 to svn://svn.ffmpeg.org/ffmpeg/trunk

c47d146b 01/05/2009 01:57 PM Diego Biurrun

Add missing 'void' keyword to parameterless function declarations.

Originally committed as revision 16436 to svn://svn.ffmpeg.org/ffmpeg/trunk

21ff7689 01/04/2009 01:36 AM Mathieu Velten

Use H264 MMX chroma functions to accelerate RV40 decoding.

Patch by Mathieu Velten (matmaul A gmail)

Originally committed as revision 16419 to svn://svn.ffmpeg.org/ffmpeg/trunk

37fed100 01/03/2009 12:46 AM Jason Garrett-Glaser

Add x264 SSE2 iDCT functions to H.264 decoder.

Originally committed as revision 16409 to svn://svn.ffmpeg.org/ffmpeg/trunk

2c67c659 12/28/2008 07:40 PM Carl Eugen Hoyos

Fix h264 decoding on SSE2 cores with icc compilation.

Originally committed as revision 16373 to svn://svn.ffmpeg.org/ffmpeg/trunk

c1fc7036 12/26/2008 12:19 AM Jason Garrett-Glaser

Fix compilation without optimization under 64-bit with x264 deblock asm enabled.

Originally committed as revision 16313 to svn://svn.ffmpeg.org/ffmpeg/trunk

a6493a8f 12/22/2008 09:12 AM Diego Biurrun

Rename libavcodec/i386/ --> libavcodec/x86/.
It contains optimizations that are not specific to i386 and
libavutil uses this naming scheme already.

Originally committed as revision 16270 to svn://svn.ffmpeg.org/ffmpeg/trunk