Statistics
| Branch: | Revision:

ffmpeg / libavcodec / i386 @ c4ff7c53

# Date Author Comment
c4ff7c53 12/21/2008 11:56 PM Diego Biurrun

Do not use full include path for headers in the same directory.

Originally committed as revision 16260 to svn://svn.ffmpeg.org/ffmpeg/trunk

a22eff36 12/19/2008 01:45 PM Jason Garrett-Glaser

Port x264 deblocking code to libavcodec.
This includes SSE2 luma deblocking code and both MMXEXT and SSE2 luma
intra deblocking code for H.264 decoding. This assembly is available
under --enable-gpl and speeds decoding of Cathedral by 7%.

Originally committed as revision 16239 to svn://svn.ffmpeg.org/ffmpeg/trunk

40c7d0ae 12/19/2008 03:01 AM Jason Garrett-Glaser

Add automatic prefix handling to yasm functions. Does nothing now, but will
be useful for porting x264 asm in the future.

Originally committed as revision 16234 to svn://svn.ffmpeg.org/ffmpeg/trunk

ac223859 12/18/2008 02:36 AM Michael Niedermayer

H.264 idct functions that include the chroma, inter luma and intra16 luma loops
thus avoiding the calling overhead.
New functions are not yet used.

Originally committed as revision 16206 to svn://svn.ffmpeg.org/ffmpeg/trunk

2600f8c8 12/17/2008 12:39 AM Diego Biurrun

Remove obsolete and misleading comments.

Originally committed as revision 16178 to svn://svn.ffmpeg.org/ffmpeg/trunk

2169b79c 12/16/2008 12:47 AM David Conrad

Workaround for shared libs on OS X with Apple gcc 4.0.1

Originally committed as revision 16158 to svn://svn.ffmpeg.org/ffmpeg/trunk

5e660449 12/14/2008 12:48 AM Aurelien Jacobs

avoid POSIX reserved _t suffix

Originally committed as revision 16117 to svn://svn.ffmpeg.org/ffmpeg/trunk

d08ea32c 12/14/2008 12:46 AM Aurelien Jacobs

Get rid of mmx_t.

Originally committed as revision 16116 to svn://svn.ffmpeg.org/ffmpeg/trunk

193ce3ab 12/12/2008 12:19 AM Baptiste Coudurier

dnxhd get_pixels_8x4_sym sse2

Originally committed as revision 16072 to svn://svn.ffmpeg.org/ffmpeg/trunk

5fecfb7d 12/10/2008 09:35 PM Loren Merritt

clear_block mmx

Originally committed as revision 16045 to svn://svn.ffmpeg.org/ffmpeg/trunk

9686df2b 12/03/2008 03:23 PM Diego Biurrun

Delete unnecessary 'extern' keywords.

Originally committed as revision 15990 to svn://svn.ffmpeg.org/ffmpeg/trunk

35d0c217 11/25/2008 07:34 AM Reimar Döffinger

Replace pushf/popf by explicit pushfl/popfl (32 bit) or pushfq/popfq (x86_64),
to fix generated code on ICC 11.0.

Originally committed as revision 15935 to svn://svn.ffmpeg.org/ffmpeg/trunk

4deaa946 11/24/2008 09:28 AM Måns Rullgård

Add shift argument to MULL macro

This replaces use of FRAC_BITS in the MULL definition with a third
argument specifying the shift amount. All uses of this macro are
updated to pass FRAC_BITS as third argument.

Originally committed as revision 15921 to svn://svn.ffmpeg.org/ffmpeg/trunk

d45be178 11/14/2008 02:44 AM Baptiste Coudurier

convert fdct_mmx to plain asm

Originally committed as revision 15819 to svn://svn.ffmpeg.org/ffmpeg/trunk

82d1605f 11/03/2008 06:08 PM Dominik Mierzejewski

Remove duplicated MM_* macros for CPU capabilities from dsputil.h.
Add missing one for FF_MM_ALTIVEC to avcodec.h.
Rename all the occurences of MM_* to the corresponding FF_MM_*.

Originally committed as revision 15770 to svn://svn.ffmpeg.org/ffmpeg/trunk

1fcb9be3 10/29/2008 11:44 PM Diego Biurrun

asm() --> asm() in #warning comment

Originally committed as revision 15746 to svn://svn.ffmpeg.org/ffmpeg/trunk

01f54021 10/29/2008 11:43 PM Diego Biurrun

Add missing headers to pass 'make checkheaders'.

Originally committed as revision 15745 to svn://svn.ffmpeg.org/ffmpeg/trunk

782fc0c3 10/27/2008 02:35 PM Diego Pettenò

Rename template included sources from .h to _template.c.

There are multiple source files that are #include'd rather than
compiled, as they are used as template for generation of similar code,
like asm-optimised code. Some of these files are right now named with...

31c4f070 10/20/2008 04:05 PM Reimar Döffinger

Use x86_reg type instead of long in float_to_int16 MMX/SSE functions.
Fixes compilation on MinGW64.

Originally committed as revision 15655 to svn://svn.ffmpeg.org/ffmpeg/trunk

0dba1995 10/19/2008 04:44 AM David Conrad

Cosmetics: reindent

Originally committed as revision 15644 to svn://svn.ffmpeg.org/ffmpeg/trunk

ca4a4ac1 10/19/2008 04:43 AM David Conrad

Combine non-bitexact sections

Originally committed as revision 15643 to svn://svn.ffmpeg.org/ffmpeg/trunk

daa1ea04 10/19/2008 04:40 AM David Conrad

VP3 loop filter is mmx2 not mmx

Originally committed as revision 15642 to svn://svn.ffmpeg.org/ffmpeg/trunk

357f45d9 10/17/2008 03:18 AM David Conrad

MMX VP3 Loop Filter

Originally committed as revision 15630 to svn://svn.ffmpeg.org/ffmpeg/trunk

be449fca 10/16/2008 01:34 PM Diego Pettenò

Convert asm keyword into asm.

Neither the asm() nor the asm() keyword is part of the C99
standard, but while GCC accepts the former in C89 syntax, it is not
accepted in C99 unless GNU extensions are turned on (with -fasm). The
latter form is accepted in any syntax as an extension (without...

f76543c9 10/09/2008 06:36 PM Baptiste Coudurier

get_pixels_sse2, ~+12% performance compared to mmx

Originally committed as revision 15591 to svn://svn.ffmpeg.org/ffmpeg/trunk

098b4169 10/07/2008 09:41 PM Diego Biurrun

Add a bunch of const qualifiers to function arguments.
Taken from the upstream libmpeg2 version of the file.

Originally committed as revision 15585 to svn://svn.ffmpeg.org/ffmpeg/trunk

8f271177 10/07/2008 08:37 AM Diego Biurrun

Sync cosmetics from upstream.

Originally committed as revision 15576 to svn://svn.ffmpeg.org/ffmpeg/trunk

8539d8b5 10/04/2008 04:48 PM Diego Pettenò

Declare ff_pw_53 and ff_pw_18 as assembler constants.
Patch by Diego 'Flameeyes' Petten

Originally committed as revision 15553 to svn://svn.ffmpeg.org/ffmpeg/trunk

f7e5b0cc 09/18/2008 03:08 AM David Conrad

Remove duplication of TRANSPOSE8 macro, also removing some redundant load/stores

Originally committed as revision 15355 to svn://svn.ffmpeg.org/ffmpeg/trunk

7b16a979 09/18/2008 03:08 AM David Conrad

Make a separate macro for writing an entire block from registers to memory

Originally committed as revision 15354 to svn://svn.ffmpeg.org/ffmpeg/trunk

3f429d11 09/18/2008 03:07 AM David Conrad

Factorize SSE2_(Row|Column)_IDCT into one macro

Originally committed as revision 15353 to svn://svn.ffmpeg.org/ffmpeg/trunk

3c1ebfd5 09/17/2008 08:52 PM David Conrad

Cosmetics: remove excessive blank lines

Originally committed as revision 15351 to svn://svn.ffmpeg.org/ffmpeg/trunk

8cfd78ce 09/17/2008 07:49 PM David Conrad

Ensure MMX/SSE2 VP3 IDCT selection isn't disabled when only Theora is enabled

Originally committed as revision 15350 to svn://svn.ffmpeg.org/ffmpeg/trunk

c3c5bba1 09/17/2008 07:38 PM David Conrad

Cosmetics: reindent

Originally committed as revision 15349 to svn://svn.ffmpeg.org/ffmpeg/trunk

f7fc9e49 09/17/2008 07:30 PM David Conrad

Rewrite SSE2 VP3 IDCT in inline asm

Originally committed as revision 15347 to svn://svn.ffmpeg.org/ffmpeg/trunk

437e3f4d 09/17/2008 07:30 PM David Conrad

Rewrite MMX VP3 IDCT in inline asm

Originally committed as revision 15346 to svn://svn.ffmpeg.org/ffmpeg/trunk

ccd3ec82 09/17/2008 07:30 PM David Conrad

MMX/SSE2 VP3 IDCT are bitexact now that the dequantization matrices are permutated correctly

Originally committed as revision 15345 to svn://svn.ffmpeg.org/ffmpeg/trunk

98790382 08/31/2008 07:39 AM Stefano Sabatini

Globally rename the header inclusion guard names.

Consistently apply this rule: the guard name is obtained from the
filename by stripping the leading "lib", converting '/' and '.' to
'_' and uppercasing the resulting name. Guard names in the root
directory have to be prefixed by "FFMPEG_"....

b4c3d835 08/31/2008 07:05 AM David Conrad

Use ff_vp3_idct_data in vp3dsp_mmx.c rather than duplicating it

Originally committed as revision 15118 to svn://svn.ffmpeg.org/ffmpeg/trunk

43410eaf 08/31/2008 07:05 AM David Conrad

Declare ff_vp3_idct_data to be uint16_t

Originally committed as revision 15117 to svn://svn.ffmpeg.org/ffmpeg/trunk

587ae47a 08/31/2008 07:05 AM David Conrad

Don't declare SSE vp3 idct data static, so it can be used in the mmx version

Originally committed as revision 15116 to svn://svn.ffmpeg.org/ffmpeg/trunk

c979fa03 08/30/2008 07:47 PM David Conrad

Remove unused dequantization code from SSE VP3 IDCT

Originally committed as revision 15054 to svn://svn.ffmpeg.org/ffmpeg/trunk

167029a7 08/30/2008 07:41 PM David Conrad

Use ff_pw_8 in MMX/SSE VP3 IDCT

Originally committed as revision 15053 to svn://svn.ffmpeg.org/ffmpeg/trunk

21383da8 08/30/2008 07:40 PM David Conrad

Let ff_pw_8 be used as an SSE constant

Originally committed as revision 15052 to svn://svn.ffmpeg.org/ffmpeg/trunk

2ccddc02 08/26/2008 07:38 PM Vladimir Voroshilov

Add explicit (int) cast to i386 optimized MUL* macros.
Wrong result is returned when 16-bit value is passed as value.
Also fixes "Warning: using `%edx' instead of `%dx' due to `l' suffix".

Originally committed as revision 14981 to svn://svn.ffmpeg.org/ffmpeg/trunk

dad6afb4 08/24/2008 08:41 AM Alexis Ballier

stricter constraints of asm() blocks
All these variables are used as left operands of a movd instruction,
which does accept only memory or register operands while the "g"
constraint also allows immediates. Use "rm" instead.
Patch by Alexis Ballier alexis P ballier A gmail P com...

7ca7d5fa 08/14/2008 05:00 AM Loren Merritt

file which should have been added in r14749

Originally committed as revision 14751 to svn://svn.ffmpeg.org/ffmpeg/trunk

75ac2875 08/14/2008 04:41 AM Loren Merritt

missing prototype

Originally committed as revision 14750 to svn://svn.ffmpeg.org/ffmpeg/trunk

ebceaa1c 08/14/2008 04:40 AM Loren Merritt

gcc chokes on the 7 registers needed for float_to_int16_interleave6 (even inside HAVE_7REGS), so write it in yasm

Originally committed as revision 14749 to svn://svn.ffmpeg.org/ffmpeg/trunk

ee467537 08/14/2008 04:39 AM Loren Merritt

gcc chokes on xmm constraints, so pessimize int32_to_float_fmul_scalar_sse a little

Originally committed as revision 14748 to svn://svn.ffmpeg.org/ffmpeg/trunk

67587238 08/13/2008 11:36 PM Loren Merritt

special case 6 channel version of float_to_int16_interleave
5% faster ac3

Originally committed as revision 14744 to svn://svn.ffmpeg.org/ffmpeg/trunk

911e21a3 08/13/2008 11:35 PM Loren Merritt

simd int->float
20% faster ac3 if downmixing, 15% if not

Originally committed as revision 14743 to svn://svn.ffmpeg.org/ffmpeg/trunk

ac2e5564 08/13/2008 11:33 PM Loren Merritt

simd downmix
13% faster ac3 if downmixing

Originally committed as revision 14742 to svn://svn.ffmpeg.org/ffmpeg/trunk

862b98d4 08/12/2008 12:51 AM Loren Merritt

cosmetics in dsp init

Originally committed as revision 14704 to svn://svn.ffmpeg.org/ffmpeg/trunk

0a570e82 08/12/2008 12:36 AM Loren Merritt

remove mdct tmp buffer

Originally committed as revision 14702 to svn://svn.ffmpeg.org/ffmpeg/trunk

46803f4f 08/12/2008 12:33 AM Loren Merritt

optimize imdct_half:
remove tmp buffer.
skip fft reinterleave pass, leaving data in a format more convenient for simd.
merge post-rotate with post-reorder.

Originally committed as revision 14700 to svn://svn.ffmpeg.org/ffmpeg/trunk

5d0ddd1a 08/12/2008 12:26 AM Loren Merritt

split-radix FFT
c is 1.9x faster than previous c (on various x86 cpus), sse is 1.6x faster than previous sse.

Originally committed as revision 14698 to svn://svn.ffmpeg.org/ffmpeg/trunk

bafad220 08/11/2008 11:54 PM Loren Merritt

import yasm macros from x264

Originally committed as revision 14697 to svn://svn.ffmpeg.org/ffmpeg/trunk

f769b746 08/02/2008 05:32 PM Uoti Urpala

Mark add_png_paeth_prediction_* functions which are only used within this file
as static. patch by Uoti Urpala, uoti.urpala pp1.inet fi

Originally committed as revision 14509 to svn://svn.ffmpeg.org/ffmpeg/trunk

4f20b45f 07/19/2008 09:53 PM Michael Niedermayer

Fix h264_loop_filter_strength_mmx2() so it works with PAFF.
fixed at least:
CVFI1_Sony_D.jsv
CVFI1_SVA_C.264
MR6_BT_B.h264

Originally committed as revision 14310 to svn://svn.ffmpeg.org/ffmpeg/trunk

5eb0f2a4 07/16/2008 12:50 AM Loren Merritt

float_to_int16_interleave: change src to an array of pointers instead of assuming it's contiguous.
this has no immediate effect, but will allow it to be used in more codecs.

Originally committed as revision 14252 to svn://svn.ffmpeg.org/ffmpeg/trunk

4342a7f3 07/15/2008 04:11 AM Loren Merritt

10l, float_to_int16_interleave_sse/3dnow wrote the wrong samples

Originally committed as revision 14236 to svn://svn.ffmpeg.org/ffmpeg/trunk

b9fa3208 07/13/2008 03:03 PM Loren Merritt

exploit mdct symmetry
2% faster vorbis on conroe, k8. 7% on celeron.

Originally committed as revision 14207 to svn://svn.ffmpeg.org/ffmpeg/trunk

f27e1d64 07/13/2008 02:56 PM Loren Merritt

simplify vorbis windowing

Originally committed as revision 14205 to svn://svn.ffmpeg.org/ffmpeg/trunk

d7e1fc42 07/11/2008 04:48 AM Kostya Shishkov

SSE2 optimizations for Monkey's Audio decoder vector functions

Originally committed as revision 14161 to svn://svn.ffmpeg.org/ffmpeg/trunk

bc314472 07/09/2008 05:51 PM Alexander Strange

Make the function prototype visible to comply with C99 inline.
Fixes building with gcc -std=gnu99.

Originally committed as revision 14140 to svn://svn.ffmpeg.org/ffmpeg/trunk

e98750c3 07/09/2008 07:21 AM Michael Niedermayer

float_to_int16_sse2()
20% faster than sse

Originally committed as revision 14138 to svn://svn.ffmpeg.org/ffmpeg/trunk

1835cda6 07/08/2008 09:24 AM Victor Pollex

Make LOAD4/STORE4 macros more generic.
Patch by Victor Pollex victor pollex web de
Original thread: [PATCH] mmx implementation of vc-1 inverse transformations
Date: 06/21/2008 03:37 PM

Originally committed as revision 14108 to svn://svn.ffmpeg.org/ffmpeg/trunk

35ee72b1 07/07/2008 09:25 PM Michael Niedermayer

1 c-asm loop less and 1x unroll of float_to_int16_sse()
25% faster

Originally committed as revision 14104 to svn://svn.ffmpeg.org/ffmpeg/trunk

560fa9bf 07/07/2008 09:04 PM Michael Niedermayer

Fix x86-64

Originally committed as revision 14103 to svn://svn.ffmpeg.org/ffmpeg/trunk

63b737d4 07/07/2008 08:46 PM Michael Niedermayer

dont use C-asm loops and unroll once float_to_int16_3dnow()
30% faster

Originally committed as revision 14102 to svn://svn.ffmpeg.org/ffmpeg/trunk

74fd9022 06/28/2008 06:30 PM Alexander Strange

Realign newlines.

Originally committed as revision 14023 to svn://svn.ffmpeg.org/ffmpeg/trunk

00969e1c 06/28/2008 06:27 PM Alexander Strange

Use MANGLE instead of memory operands to read globals.
(fixes out of registers with apple gcc 4.2)

Originally committed as revision 14022 to svn://svn.ffmpeg.org/ffmpeg/trunk

00eebe3d 06/22/2008 07:05 AM Reimar Döffinger

Fix add_bytes_mmx and add_bytes_l2_mmx for w < 16

Originally committed as revision 13877 to svn://svn.ffmpeg.org/ffmpeg/trunk

0bd134ab 05/17/2008 02:36 PM Michael Niedermayer

Simplify vsad16_mmx2().

Originally committed as revision 13193 to svn://svn.ffmpeg.org/ffmpeg/trunk

6bf6a930 05/17/2008 02:35 PM Michael Niedermayer

Simplify vsad16_mmx().

Originally committed as revision 13191 to svn://svn.ffmpeg.org/ffmpeg/trunk

e1381022 05/17/2008 02:33 PM Michael Niedermayer

Simplify vsad_intra16_mmx2()

Originally committed as revision 13189 to svn://svn.ffmpeg.org/ffmpeg/trunk

06bb35f9 05/17/2008 02:31 PM Michael Niedermayer

Simplify vsad_intra16_mmx()

Originally committed as revision 13188 to svn://svn.ffmpeg.org/ffmpeg/trunk

a12b44d7 05/09/2008 02:34 PM Diego Biurrun

Add missing required header directly.

Originally committed as revision 13103 to svn://svn.ffmpeg.org/ffmpeg/trunk

20cd685a 05/09/2008 02:33 PM Diego Biurrun

Add missing path to #include.

Originally committed as revision 13102 to svn://svn.ffmpeg.org/ffmpeg/trunk

245976da 05/09/2008 11:56 AM Diego Biurrun

Use full path for #includes from another directory.

Originally committed as revision 13098 to svn://svn.ffmpeg.org/ffmpeg/trunk

40d0e665 05/08/2008 09:11 PM Ramiro Polla

Do not misuse long as the size of a register in x86.
typedef x86_reg as the appropriate size and use it instead.

Originally committed as revision 13081 to svn://svn.ffmpeg.org/ffmpeg/trunk

57105ddd 04/26/2008 04:02 PM Diego Biurrun

Rename i386/cputest.c --> i386/cpuid.c.

Originally committed as revision 13002 to svn://svn.ffmpeg.org/ffmpeg/trunk

c88c253d 04/17/2008 09:57 PM Diego Biurrun

cosmetics: asm volatile --> asm volatile

Originally committed as revision 12885 to svn://svn.ffmpeg.org/ffmpeg/trunk

80465c7e 04/16/2008 08:51 PM Diego Biurrun

cosmetics: Fix nonstandard indentation.

Originally committed as revision 12863 to svn://svn.ffmpeg.org/ffmpeg/trunk

591d87ba 04/16/2008 08:43 PM Jeff Downs

Cosmetics:
Break long lines.
Correct spelling in comment (duplicatin -> duplicating)

Originally committed as revision 12862 to svn://svn.ffmpeg.org/ffmpeg/trunk

52cb7981 04/16/2008 04:40 AM Jeff Downs

Redo r12838, this time using svn copy to create h264_i386.h from cabac.h.

Move decode_significance_x86() and decode_significance_8x8_x86() to
i386-specific file from cabac.h.
New file is h264-oriented and only included from h264.c
Resolves compilation when configured with --disable-optimizations due to...

3aa9ede4 04/16/2008 04:26 AM Jeff Downs

Revert 12838 to redo it the right way (use svn copy to create new
file based on old).

Originally committed as revision 12845 to svn://svn.ffmpeg.org/ffmpeg/trunk

f73a6393 04/16/2008 01:36 AM Alexander Strange

Add a new xvid-style IDCT using SSE2.

Originally committed as revision 12843 to svn://svn.ffmpeg.org/ffmpeg/trunk

e6cfd8ff 04/15/2008 01:51 PM Jeff Downs

Move decode_significance_x86() and decode_significance_8x8_x86() to
i386-specific file from cabac.h.
New file is h264-oriented and only included from h264.c
Resolves compilation when configured with --disable-optimizations due to
decode_significance_8x8_x86 using last_coeff_flag_offset_8x8, which is...

3fbe7118 04/14/2008 08:54 PM Luca Barbato

Eliminate movdqu in vp3dsp_sse2, patch from Alexander Strange astrangeAtithinkswDoTcom

Originally committed as revision 12824 to svn://svn.ffmpeg.org/ffmpeg/trunk

54a0b6e5 04/12/2008 04:54 PM Alexander Strange

Add a header file to declare Xvid IDCT functions.
patch by Alexander Strange, astrange ithinksw com

Originally committed as revision 12794 to svn://svn.ffmpeg.org/ffmpeg/trunk

96275520 04/08/2008 11:49 PM Loren Merritt

Fix H.264 interframe decoding when compiling with icc. Patch by Loren
Merritt:

"It seems that icc copies the constants from their global var onto the
stack, at which point they're not aligned, hence the crash.
[This change] really shouldn't mean anything different, but maybe it'll...

ce53144b 04/01/2008 04:51 AM Loren Merritt

h264 chroma mc ssse3
width8: 180->92, width4: 78->63 cycles (core2)

Originally committed as revision 12661 to svn://svn.ffmpeg.org/ffmpeg/trunk

04932b0d 03/22/2008 04:46 PM Diego Biurrun

cosmetics: typo fixes

Originally committed as revision 12554 to svn://svn.ffmpeg.org/ffmpeg/trunk

9e8e6d31 03/21/2008 12:36 PM Zuxy Meng

Add missed call to ff_cavsdsp_init_3dnow() in dsputil_init_mmx()

Originally committed as revision 12540 to svn://svn.ffmpeg.org/ffmpeg/trunk

943032b1 03/20/2008 02:24 PM Michael Niedermayer

Hardcode register to prevent aparent miscompilation.
Fixes regression tests with gcc 2.95.

Originally committed as revision 12512 to svn://svn.ffmpeg.org/ffmpeg/trunk

dea00a46 03/20/2008 02:09 PM Michael Niedermayer

remove unused temp

Originally committed as revision 12511 to svn://svn.ffmpeg.org/ffmpeg/trunk

b55aa9a9 03/17/2008 11:08 PM Måns Rullgård

get register names from x86_cpu.h

Originally committed as revision 12482 to svn://svn.ffmpeg.org/ffmpeg/trunk

5a6a9e78 03/04/2008 12:07 AM Aurelien Jacobs

move draw_edges() into dsputil

Originally committed as revision 12309 to svn://svn.ffmpeg.org/ffmpeg/trunk