Statistics
| Branch: | Revision:

ffmpeg / libavcodec / x86 @ 888fa31e

# Date Author Comment
888fa31e 05/14/2011 07:32 PM Diego Biurrun

Fix FSF address copy paste error in some license headers.

5705b020 05/11/2011 06:09 PM Jason Garrett-Glaser

10-bit H.264 x86 chroma v loopfilter asm

Also delete some unused deblock asm macros.

9f3d6ca4 05/11/2011 03:02 AM Jason Garrett-Glaser

Port x86 10-bit H.264 deblock asm from x264

8ad77b65 05/11/2011 03:01 AM Jason Garrett-Glaser

Update x86 H.264 deblock asm

Includes AVX versions from x264.

86b29553 05/10/2011 12:39 PM Ronald S. Bultje

h264dsp_mmx: place bracket outside #if/#endif block.

Should fix compile on systems missing yasm/nasm.

19a0729b 05/10/2011 11:24 AM Oskar Arvidsson

Adds 8-, 9- and 10-bit versions of some of the functions used by the h264 decoder.

This patch lets e.g. dsputil_init chose dsp functions with respect to
the bit depth to decode. The naming scheme of bit depth dependent
functions is <base name>_<bit depth>[_

] (i.e. the old...

a734fa57 04/29/2011 06:01 PM Diego Biurrun

Remove disabled non-optimized code variants.

9d35fa52 04/26/2011 04:25 PM Vitor Sessak

Add AVX FFT implementation.

Signed-off-by: Reinhard Tartler <>

33cbfa6f 04/26/2011 04:18 PM Vitor Sessak

Update x86inc.asm from x264 to allow AVX emulation using SSE and MMX.

Signed-off-by: Reinhard Tartler <>

1500be13 03/26/2011 09:45 PM Alexander Strange

dsputil: allow to skip drawing of top/bottom edges.

e6e98234 03/23/2011 01:08 AM Justin Ruggles

Add apply_window_int16() to DSPContext with x86-optimized versions and use it
in the ac3_fixed encoder.

0aded948 03/20/2011 05:15 PM Mans Rullgard

Move dct and rdft definitions to separate files

This leaves fft.h with only the core FFT and MDCT definitions
thus making it more managable.

Signed-off-by: Mans Rullgard <>

2912e87a 03/19/2011 01:33 PM Mans Rullgard

Replace FFmpeg with Libav in licence headers

Signed-off-by: Mans Rullgard <>

0f999cfd 03/17/2011 08:46 PM Justin Ruggles

ac3enc: add float_to_fixed24() with x86-optimized versions to AC3DSPContext
and use in scale_coefficients() for the floating-point AC-3 encoder.

79414257 03/16/2011 12:49 AM Justin Ruggles

mathops: fix MULL when the compiler does not inline the function.

If the function is not inlined, an immmediate cannot be used for the
shift parameter, so the %cl register must be used instead in that case.

This fixes compilation for x86-32 using gcc with --disable-optimizations.

aaff3b31 03/15/2011 05:43 PM Justin Ruggles

mathops: change "g" constraint to "rm" in x86-32 version of MUL64.

The 1-arg imul instruction cannot take an immediate argument, only a register
or memory argument.

b181b8fb 03/15/2011 05:43 PM Justin Ruggles

mathops: convert MULL/MULH/MUL64 to inline functions rather than macros.

This fixes unexpected name collisions that were occurring with variables
declared within the macros.
It also fixes the fate-acodec-ac3_fixed regression test on x86-32.

f1efbca5 03/14/2011 12:45 PM Justin Ruggles

ac3enc: add SIMD-optimized shifting functions for use with the fixed-point AC3 encoder.

a5444fee 03/12/2011 11:35 AM Mans Rullgard

Add CONFIG_AC3DSP symbol to simplify makefiles

Signed-off-by: Mans Rullgard <>

bf6fa732 02/19/2011 03:51 PM Ronald S. Bultje

dsputil_mmx.c: remove ff_vector128.

Remove ff_vector128, it is identical to ff_pb_80.

12802ec0 02/17/2011 10:35 PM Ronald S. Bultje

dsputil: move VC1-specific stuff into VC1DSPContext.

1f004fc5 02/16/2011 07:08 PM Justin Ruggles

ac3dsp: Change punpckhqdq to movhlps in ac3_max_msb_abs_int16().

Signed-off-by: Ronald S. Bultje <>

fbb6b49d 02/13/2011 09:49 PM Justin Ruggles

ac3enc: Add x86-optimized function to speed up log2_tab().

AC3DSPContext.ac3_max_msb_abs_int16() finds the maximum MSB of the absolute
value of each element in an array of int16_t.

Signed-off-by: Ronald S. Bultje <>

e6b1ed69 02/13/2011 02:36 PM Loren Merritt

FFT: factor a shuffle out of the inner loop and merge it into fft_permute.

6% faster SSE FFT on Conroe, 2.5% on Penryn.

Signed-off-by: Janne Grunau <>

dda3f0ef 02/10/2011 08:32 PM Justin Ruggles

Add x86-optimized versions of exponent_min().

Signed-off-by: Ronald S. Bultje <>

17cf7c68 02/08/2011 11:25 PM Ronald S. Bultje

Fix ff_emu_edge_core_sse() on Win64.

Fix emu_edge_v_extend_15 to be <128 bytes on Win64, by being more strict
on the size of registers and which registers are being used for operations
where multiple are available. This fixes segfaults in emulated_edge()...

c73d99e6 02/02/2011 02:44 AM Justin Ruggles

Separate format conversion DSP functions from DSPContext.

This will be beneficial for use with the audio conversion API without
requiring it to depend on all of dsputil.

Signed-off-by: Mans Rullgard <>

770c410f 02/02/2011 02:40 AM Alex Converse

Fix ff_imdct_calc_sse() on gcc-4.6

Gcc 4.6 only preserves the first value when using an array with an "m"
constraint.

Signed-off-by: Mans Rullgard <>

81f2a3f4 02/01/2011 01:55 AM Ronald S. Bultje

Implement a SIMD version of emulated_edge_mc() for x86.

From ~550 cycles (C version) to 170 (SSE/x86-64), 206 (MMX/x86-32)
and 196 (SSE2/x86-32) cycles.

d19b744a 01/31/2011 08:30 PM Justin Ruggles

cosmetics: indentation

Signed-off-by: Mans Rullgard <>

80ba1ddb 01/31/2011 08:28 PM Justin Ruggles

Remove unneeded add bias from 3 functions.

DSPContext.vector_fmul_window()
DCADSPContext.lfe_fir()
SynthFilterContext.synth_filter_float()

Signed-off-by: Mans Rullgard <>

80944df7 01/24/2011 11:24 PM Mans Rullgard

x86: fix overflow in h264 8x8 planar prediction

Signed-off-by: Mans Rullgard <>

6eabb0d3 01/22/2011 05:53 PM Justin Ruggles

Change DSPContext.vector_fmul() from dst=dst*src to dest=src0*src1.

Signed-off-by: Mans Rullgard <>

1c189fc5 01/21/2011 07:59 PM Justin Ruggles

cosmetics related to LPC changes.

Signed-off-by: Mans Rullgard <>

77a78e9b 01/21/2011 07:59 PM Justin Ruggles

Separate window function from autocorrelation.

Signed-off-by: Mans Rullgard <>

56f8952b 01/21/2011 07:58 PM Justin Ruggles

Move lpc_compute_autocorr() from DSPContext to a new struct LPCContext.

Signed-off-by: Mans Rullgard <>

b9c7f66e 01/20/2011 01:34 AM Ronald S. Bultje

Fix horizontal/horizontal_up 8x8l intra prediction x86/simd functions.
The original functions did not work correctly for edge pixels, e.g.
when CODEC_FLAG_EMU_EDGE is set, leading to corrupt output in e.g. VLC.
Based on a patch by Daniel Kang <daniel d kang gmail com>....

ef4a6514 01/18/2011 08:48 PM Mans Rullgard

Replace ASMALIGN with .p2align

This macro has unconditionally used .p2align for a long time and
serves no useful purpose.

ac3c9d01 01/18/2011 08:48 PM Mans Rullgard

x86: remove VLA in ac3_downmix_sse

2c3589bf 01/18/2011 08:32 PM Janne Grunau

consolidate .gitignore patters into a single file

Signed-off-by: Janne Grunau <>

348b8218 01/17/2011 02:50 PM Janne Grunau

convert svn:ignore properties to .gitignore files

Signed-off-by: Janne Grunau <>

1b3e43e4 01/15/2011 10:00 PM Ronald S. Bultje

Fix overflow in pred16x16_plane x86 simd code. Fixes issue 2547.

Originally committed as revision 26381 to svn://svn.ffmpeg.org/ffmpeg/trunk

ec3233a8 01/14/2011 11:26 PM Ronald S. Bultje

Fix ff_pw_3 alignment.

Originally committed as revision 26344 to svn://svn.ffmpeg.org/ffmpeg/trunk

19fb234e 01/14/2011 09:34 PM Jason Garrett-Glaser

H.264: split luma dc idct out and implement MMX/SSE2 versions
About 2.5x the speed.

NOTE: the way that the asm code handles large qmuls is a bit suboptimal.
If x264-style dequant was used (separate shift and qmul values), it might
be possible to get some extra speed....

004357a1 01/03/2011 11:30 AM Daniel Kang

Fix compilation on x86-32 with --disable-optimizations,
fixes issue 2127.

Patch by Daniel Kang, daniel.d.kang at gmail

Originally committed as revision 26204 to svn://svn.ffmpeg.org/ffmpeg/trunk

0790caba 12/31/2010 01:29 AM Daniel Kang

Fix invalid reads in valgrind fate, patch by Daniel Kang <daniel dot d dot
kang at gmail com>, as part of Google's GCI 2010.

Originally committed as revision 26177 to svn://svn.ffmpeg.org/ffmpeg/trunk

536e9b2f 12/29/2010 11:48 PM Daniel Kang

Port pred8x8l_down_left_mmxext (H.264 intra prediction) from x264 (authors:
Jason, Loren, Holger) to FFmpeg. Patch by Daniel Kang <daniel dot d dot kang
at gmail com>, as part of Google's GCI 2010.

Originally committed as revision 26162 to svn://svn.ffmpeg.org/ffmpeg/trunk

720ea2d5 12/29/2010 09:55 PM Daniel Kang

Port pred4x4_down_right_mmxext (H.264 intra prediction) from x264 (authors:
Jason, Loren, Holger) to FFmpeg. Patch by Daniel Kang <daniel dot d dot kang
at gmail com>, as part of Google's GCI 2010.

Originally committed as revision 26159 to svn://svn.ffmpeg.org/ffmpeg/trunk

d0aebe23 12/29/2010 09:52 PM Daniel Kang

Port pred4x4_vertical_right_mmxext (H.264 intra prediction) from x264 (authors:
Jason, Loren, Holger) to FFmpeg. Patch by Daniel Kang <daniel dot d dot kang
at gmail com>, as part of Google's GCI 2010.

Originally committed as revision 26158 to svn://svn.ffmpeg.org/ffmpeg/trunk

76497232 12/29/2010 09:49 PM Daniel Kang

Port pred4x4_horizontal_down_mmxext (H.264 intra prediction) from x264
(authors:Jason, Loren, Holger) to FFmpeg. Patch by Daniel Kang <daniel dot
d dot kang at gmail com>, as part of Google's GCI 2010.

Originally committed as revision 26157 to svn://svn.ffmpeg.org/ffmpeg/trunk

e9c576a4 12/29/2010 09:42 PM Daniel Kang

Port pred4x4_horizontal_up_mmxext (H.264 intra prediction) from x264 (authors:
Jason, Loren, Holger) to FFmpeg. Patch by Daniel Kang <daniel dot d dot kang
at gmail com>, as part of Google's GCI 2010.

Originally committed as revision 26156 to svn://svn.ffmpeg.org/ffmpeg/trunk

92f441ae 12/29/2010 09:35 PM Daniel Kang

Port pred4x4_vertical_left_mmxext (H.264 intra prediction) from x264 (authors:
Jason, Loren, Holger) to FFmpeg. Patch by Daniel Kang <daniel dot d dot kang
at gmail com>, as part of Google's GCI 2010.

Originally committed as revision 26155 to svn://svn.ffmpeg.org/ffmpeg/trunk

e8d98764 12/29/2010 09:30 PM Ronald S. Bultje

Merge a few superfluous CONFIG_GPL checks.

Originally committed as revision 26154 to svn://svn.ffmpeg.org/ffmpeg/trunk

42a59278 12/29/2010 08:43 PM Ronald S. Bultje

Whitespace cosmetics.

Originally committed as revision 26152 to svn://svn.ffmpeg.org/ffmpeg/trunk

57b1f334 12/29/2010 08:42 PM Daniel Kang

Port pred8x8l_horizontal_down_sse2/ssse3 (H.264 intra prediction) from x264
(authors: Jason, Loren, Holger) to FFmpeg. Patch by Daniel Kang <daniel dot
d dot kang at gmail com>, as part of Google's GCI 2010.

Originally committed as revision 26151 to svn://svn.ffmpeg.org/ffmpeg/trunk

04cbdf3d 12/29/2010 08:38 PM Daniel Kang

Port pred8x8l_horizontal_down_mmxext (H.264 intra prediction) from x264
(authors: Jason, Loren, Holger) to FFmpeg. Patch by Daniel Kang <daniel dot
d dot kang at gmail com>, as part of Google's GCI 2010.

Originally committed as revision 26150 to svn://svn.ffmpeg.org/ffmpeg/trunk

98c6053c 12/29/2010 08:35 PM Daniel Kang

Port pred8x8l_horizontal_up_mmxext/ssse3 (H.264 intra prediction) from x264
(authors: Jason, Loren, Holger) to FFmpeg. Patch by Daniel Kang <daniel dot
d dot kang at gmail com>, as part of Google's GCI 2010.

Originally committed as revision 26149 to svn://svn.ffmpeg.org/ffmpeg/trunk

ecc7efbb 12/29/2010 08:06 PM Daniel Kang

Port pred8x8l_vertical_left_sse2/ssse3 (H.264 intra prediction) from x264
(authors: Jason, Loren, Holger) to FFmpeg. Patch by Daniel Kang <daniel dot
d dot kang at gmail com>, as part of Google's GCI 2010.

Originally committed as revision 26148 to svn://svn.ffmpeg.org/ffmpeg/trunk

bdd93f1b 12/29/2010 07:54 PM Daniel Kang

Port pred8x8l_vertical_right_sse2/ssse3 (H.264 intra prediction) from x264
(authors: Jason, Loren, Holger) to FFmpeg. Patch by Daniel Kang <daniel dot
d dot kang at gmail com>, as part of Google's GCI 2010.

Originally committed as revision 26147 to svn://svn.ffmpeg.org/ffmpeg/trunk

f25112fc 12/29/2010 07:46 PM Daniel Kang

Port pred8x8l_vertical_right_mmxext (H.264 intra prediction) from x264
(authors: Jason, Loren, Holger) to FFmpeg. Patch by Daniel Kang <daniel dot
d dot kang at gmail com>, as part of Google's GCI 2010.

Originally committed as revision 26146 to svn://svn.ffmpeg.org/ffmpeg/trunk

602a4cb2 12/29/2010 07:19 PM Daniel Kang

Port pred8x8l_down_right_sse2/ssse3 (H.264 intra prediction) from x264
(authors: Jason, Loren, Holger) to FFmpeg. Patch by Daniel Kang <daniel dot
d dot kang at gmail com>, as part of Google's GCI 2010.

Originally committed as revision 26145 to svn://svn.ffmpeg.org/ffmpeg/trunk

e916acbc 12/29/2010 07:12 PM Daniel Kang

Port pred8x8l_down_right_mmxext (H.264 intra prediction) from x264 (authors:
Jason, Loren, Holger) to FFmpeg. Patch by Daniel Kang <daniel dot d dot kang
at gmail com>, as part of Google's GCI 2010.

Originally committed as revision 26143 to svn://svn.ffmpeg.org/ffmpeg/trunk

c249e665 12/29/2010 07:02 PM Daniel Kang

Port pred8x8l_down_left_sse2/ssse3 (H.264 intra prediction) from x264 (authors:
Jason, Loren, Holger) to FFmpeg. Patch by Daniel Kang <daniel dot d dot kang at
gmail com>, as part of Google's GCI 2010.

Originally committed as revision 26142 to svn://svn.ffmpeg.org/ffmpeg/trunk

ee1ba9c3 12/29/2010 06:46 PM Daniel Kang

Port pred8x8l_vertical_mmxext/ssse3 (H.264 intra prediction) from x264 to
FFmpeg. Original authors: Holger Lubitz <holger lubitz org>, Jason Garrett-
Glaser <darkshikari gmail com> (approves LGPL relicensing for this code) and
Loren Merritt <lorenm at u dot washington dot edu> (approves LGPL relicensing...

04207ef3 12/29/2010 06:40 PM Daniel Kang

Port pred8x8l_horizontal_mmxext/ssse3 (H.264 intra prediction) from x264 to
FFmpeg. Original authors: Holger Lubitz <holger lubitz org>, Jason Garrett-
Glaser <darkshikari gmail com> (approves LGPL relicensing for this code) and
Loren Merritt <lorenm at u dot washington dot edu> (approves LGPL relicensing...

abab14ea 12/29/2010 06:33 PM Daniel Kang

Port pred8x8l_dc_mmx/ssse3 (H.264 intra prediction) from x264 to FFmpeg.
Original authors: Holger Lubitz <holger lubitz org>, Jason Garrett-Glaser
<darkshikari gmail com> (approves LGPL relicensing for this code) and Loren
Merritt <lorenm at u dot washington dot edu> (approves LGPL relicensing for...

2e93fd4b 12/29/2010 06:11 PM Daniel Kang

Port pred8x8l_top_dc_mmxext/ssse3 (H.264 intra prediction) from x264 to FFmpeg.
Original authors: Holger Lubitz <holger lubitz org>, Jason Garrett-Glaser
<darkshikari gmail com> (approves LGPL relicensing for this code) and Loren
Merritt <lorenm at u dot washington dot edu> (approves LGPL relicensing for...

54a959e4 12/29/2010 06:04 PM Ronald S. Bultje

Move PRED4x4_LOWPASS up so it can be used in 8x8l predict functions while
keeping the functions ordered in the source file (i.e. cosmetics).

Originally committed as revision 26136 to svn://svn.ffmpeg.org/ffmpeg/trunk

a2dfe8d1 12/29/2010 06:00 PM Ronald S. Bultje

Port pred8x8_dc_mmxext (H.264 intra prediction) from x264 to FFmpeg. Original
authors: Holger Lubitz <holger lubitz org>, Jason Garrett-Glaser <darkshikari
gmail com> (approves LGPL relicensing for this code) and Loren Merritt <lorenm
at u dot washington dot edu> (approves LGPL relicensing for this code). Patch...

83ff3f72 12/29/2010 05:45 PM Ronald S. Bultje

Add missing authors to copyright headers.

Originally committed as revision 26133 to svn://svn.ffmpeg.org/ffmpeg/trunk

725a3f9d 12/29/2010 05:42 PM Daniel Kang

Port pred8x8_top_dc_mmxext (H.264 intra prediction) from x264 to FFmpeg.
Original authors: Holger Lubitz <holger lubitz org>, Jason Garrett-Glaser
<darkshikari gmail com> (approves LGPL relicensing for this code) and Loren
Merritt <lorenm at u dot washington dot edu> (approves LGPL relicensing for...

98928c83 12/29/2010 05:34 PM Ronald S. Bultje

Mark recently added pred4x4_down_left_mmxext as CONFIG_GPL. Although Holger
initially said he'd be OK with relicensing, he also said he wanted to have
another look at the patch, and then he went on vacation, so let's play it
safe for now. We can consider removing this again later....

911b32f4 12/24/2010 10:43 PM Daniel Kang

Port pred4x4_down_left_mmxext (H.264 intra prediction) from x264 to FFmpeg.
LGPL relicensing approved by original authors: Holger Lubitz <holger lubitz
org>, Jason Garrett-Glaser <darkshikari gmail com> and Loren Merritt <lorenm
at u dot washington dot edu>. Patch by Daniel Kang <daniel dot d dot kang at...

8d147f1f 12/24/2010 05:23 PM Ronald S. Bultje

For rounding in chroma MC SSSE3, use 16-byte pw_3/4 instead of reading 8 bytes
and then using movlhps to dup it into the higher half of the register.

Originally committed as revision 26086 to svn://svn.ffmpeg.org/ffmpeg/trunk

90f1f3bf 12/06/2010 12:14 AM Baptiste Coudurier

In yadif filter, declare asm constants directly to avoid dependency on libavcodec

Originally committed as revision 25895 to svn://svn.ffmpeg.org/ffmpeg/trunk

9e95999e 12/04/2010 01:06 PM Baptiste Coudurier

10l, add ff_pw_1 to dsputil_mmx for yadif sse2

Originally committed as revision 25881 to svn://svn.ffmpeg.org/ffmpeg/trunk

1761272b 12/01/2010 01:12 PM avcoder

Use SECTION .text for yasm code.

Patch by avcoder, ffmpeg gmail

Originally committed as revision 25859 to svn://svn.ffmpeg.org/ffmpeg/trunk

4f9d25dd 11/02/2010 03:09 AM Ramiro Polla

dnxhd_mmx: prefer xmm registers below xmm6 when they are available

Originally committed as revision 25634 to svn://svn.ffmpeg.org/ffmpeg/trunk

80e33d24 11/01/2010 07:35 PM İsmail Dönmez

dsputil: Use explicit movzbl instead of movzx

This fixes compilation with the latest clang trunk version.

Patch by İsmail Dönmez, ismail at namtrac dot org

Originally committed as revision 25628 to svn://svn.ffmpeg.org/ffmpeg/trunk

a4ece893 10/31/2010 11:37 PM Ramiro Polla

lpc_mmx: add xmm registers to clobber list

Originally committed as revision 25620 to svn://svn.ffmpeg.org/ffmpeg/trunk

e5d5407e 10/31/2010 11:36 PM Ramiro Polla

lpc_mmx: merge some asm blocks

These blocks depended on the compiler keeping xmm registers untouched between
them.

Originally committed as revision 25619 to svn://svn.ffmpeg.org/ffmpeg/trunk

eed299b8 10/31/2010 09:20 PM Ramiro Polla

sad16_sse2: merge 2 asm blocks

Originally committed as revision 25617 to svn://svn.ffmpeg.org/ffmpeg/trunk

153ca56b 10/31/2010 06:14 PM Ramiro Polla

xmm_clobbers: list xmm registers first in clobber list

suncc does not like the leading commas inside the macro, but it has no problem
with trailing commas.

Originally committed as revision 25615 to svn://svn.ffmpeg.org/ffmpeg/trunk

ba404520 10/31/2010 04:28 PM Ramiro Polla

idct_sse2_xvid: only mark xmm>=8 as clobbered on x86_64

Originally committed as revision 25614 to svn://svn.ffmpeg.org/ffmpeg/trunk

05c01807 10/31/2010 03:07 PM Ramiro Polla

motion_est_mmx: prefer xmm registers below xmm6 when they are available

Originally committed as revision 25612 to svn://svn.ffmpeg.org/ffmpeg/trunk

5d543a3d 10/31/2010 01:57 PM Ramiro Polla

dsputil_mmx: add xmm registers to clobber list

Originally committed as revision 25611 to svn://svn.ffmpeg.org/ffmpeg/trunk

e2d13c58 10/31/2010 01:46 PM Ramiro Polla

cosmetics: split long line

Originally committed as revision 25610 to svn://svn.ffmpeg.org/ffmpeg/trunk

0d729e0d 10/31/2010 01:45 PM Ramiro Polla

fdct_mmx: add xmm registers to clobber list

Originally committed as revision 25609 to svn://svn.ffmpeg.org/ffmpeg/trunk

616735eb 10/31/2010 01:17 PM Ramiro Polla

idct_sse2_xvid: add xmm registers to clobber list

Originally committed as revision 25608 to svn://svn.ffmpeg.org/ffmpeg/trunk

9943f3b9 10/31/2010 01:15 PM Ramiro Polla

mpegvideo_mmx: add xmm registers to clobber list

Originally committed as revision 25607 to svn://svn.ffmpeg.org/ffmpeg/trunk

559738ef 10/31/2010 01:13 PM Ramiro Polla

dsputil_mmx: prefer xmm registers below xmm6 when they are available

Originally committed as revision 25606 to svn://svn.ffmpeg.org/ffmpeg/trunk

51d592db 10/30/2010 05:14 PM Ramiro Polla

h264dsp: add xmm registers to clobber list

Originally committed as revision 25604 to svn://svn.ffmpeg.org/ffmpeg/trunk

ac19f4a3 10/28/2010 06:31 PM Ramiro Polla

indent

Originally committed as revision 25598 to svn://svn.ffmpeg.org/ffmpeg/trunk

cae05859 10/28/2010 06:22 PM Ramiro Polla

h264dsp: merge some more asm blocks

Originally committed as revision 25597 to svn://svn.ffmpeg.org/ffmpeg/trunk

c6a908be 10/25/2010 08:29 PM Ramiro Polla

dct32: mark xmm registers in clobber list in ff_dct32_float_sse()

Originally committed as revision 25569 to svn://svn.ffmpeg.org/ffmpeg/trunk

b32c9ca9 10/25/2010 06:02 PM Ramiro Polla

h264dsp: merge some asm blocks

Some code was initializing some xmm registers in one asm block and using them
in the following block, assuming they wouldn't be changed in between blocks.

Originally committed as revision 25568 to svn://svn.ffmpeg.org/ffmpeg/trunk

6c214280 10/07/2010 07:18 PM Reimar Döffinger

Add d modifier to asm argument to fix nasm compilation.

Originally committed as revision 25397 to svn://svn.ffmpeg.org/ffmpeg/trunk

326bf69a 10/06/2010 01:27 AM Ramiro Polla

fft: mark xmm registers as clobbered in ff_imdct_calc_sse

Originally committed as revision 25363 to svn://svn.ffmpeg.org/ffmpeg/trunk

dd68d4db 10/05/2010 10:06 PM Ronald S. Bultje

MMX, MMX2, SSE2 and SSSE3 optimizations for pred16x16/8x8_plane H264 intra
prediction (plus some with different rounding for svq3/rv40). Speedup (for
SSSE3) about ~6-fold, 3.6% faster overall with cathedral sample.

Originally committed as revision 25361 to svn://svn.ffmpeg.org/ffmpeg/trunk

9276bddd 10/04/2010 01:08 PM İsmail Dönmez

snowdsp: Explicitly state the operand sizes

Fixes compilation with clang's builtin assembler

Patch by İsmail Dönmez, ismail at namtrac dot org

Originally committed as revision 25331 to svn://svn.ffmpeg.org/ffmpeg/trunk