FFT: factor a shuffle out of the inner loop and merge it into fft_permute.
6% faster SSE FFT on Conroe, 2.5% on Penryn.
Signed-off-by: Janne Grunau <email@example.com>
Move mm_support() from libavcodec to libavutil, make it a publicfunction and rename it to av_get_cpu_flags().
Originally committed as revision 25076 to svn://svn.ffmpeg.org/ffmpeg/trunk
Rename FF_MM_ symbols related to CPU features flags as AV_CPU_FLAG_symbols, and move them from libavcodec/avcodec.h to libavutil/cpu.h.
Originally committed as revision 25040 to svn://svn.ffmpeg.org/ffmpeg/trunk
Move SSE optimized 32-point DCT to its own file. Should fix breakage with YASMdisabled.
Originally committed as revision 24078 to svn://svn.ffmpeg.org/ffmpeg/trunk
SSE optimized 32-point DCT
Originally committed as revision 24077 to svn://svn.ffmpeg.org/ffmpeg/trunk
Move per-arch fft init bits into the corresponding subdirs
Originally committed as revision 19864 to svn://svn.ffmpeg.org/ffmpeg/trunk