History | View | Annotate | Download (28.6 KB)
VP8: ARM optimised decode_block_coeffs_internal
Approximately 5% faster on Cortex-A8.
Signed-off-by: Mans Rullgard <mans@mansr.com>(cherry picked from commit a7878c9f73c12cfa685bd8af8f3afcca85f56a8b)
VP8: simplify lf_delta mb mode logic(cherry picked from commit dd18c9a050ac1f1437151ceb2d2afbc96c5602d8)
VP8: slightly faster DCT coefficient probability update
Originally committed as revision 24687 to svn://svn.ffmpeg.org/ffmpeg/trunk
VP8: unroll partition type decoding tree~34% faster partition type decoding.
Originally committed as revision 24681 to svn://svn.ffmpeg.org/ffmpeg/trunk
VP8: unroll splitmv decoding treeMuch faster splitmv mode decoding.
Originally committed as revision 24680 to svn://svn.ffmpeg.org/ffmpeg/trunk
VP8: unroll MB mode decoding tree~50% faster MB mode decoding, plus eliminate a costly switch.
Originally committed as revision 24679 to svn://svn.ffmpeg.org/ffmpeg/trunk
VP8: eliminate a dereference in coefficient decoding
Originally committed as revision 24671 to svn://svn.ffmpeg.org/ffmpeg/trunk
VP8: simplify token_prob handling~1.5% faster decode_block_coeffs
Originally committed as revision 24659 to svn://svn.ffmpeg.org/ffmpeg/trunk
VP8: Inline traversing vp8_small_mvtree
Much faster read_mv_component, slightly faster overall
Originally committed as revision 24470 to svn://svn.ffmpeg.org/ffmpeg/trunk
Decode DCT tokens by branching to a different code path for each branchon the huffman tree, instead of traversing the tree in a while loop.
Based on the similar optimization in libvpx's detokenize.c
10% faster at normal bitrates, and 30% faster for high-bitrate intra-only...
Eliminate a LUT in escape decoding in VP8 decode_block_coeffs
Originally committed as revision 24441 to svn://svn.ffmpeg.org/ffmpeg/trunk
Simplify MV parsing, removes laying out 2 or 4 (16x8/8x8/8x16) MVs over all16 subblocks (since we no longer need that), which should also lead to aminor speedup.
Originally committed as revision 23854 to svn://svn.ffmpeg.org/ffmpeg/trunk
Optimize split MC, so we don't always do 4x4 blocks of 4x4pixels each, butwe apply them as 16x8/8x16/8x8 subblocks where possible. Since this allowsus to use width=8/16 instead of width=4 MC functions, we can now take moreadvantage of SSE2/SSSE3 optimizations, leading to a total speedup for splitMV...
Fix a typo, spotted by Diego.
Originally committed as revision 23738 to svn://svn.ffmpeg.org/ffmpeg/trunk
Native VP8 decoder.
Patch by David Conrad <lessen42 gmail com> and myself.
Originally committed as revision 23719 to svn://svn.ffmpeg.org/ffmpeg/trunk