For rounding in chroma MC SSSE3, use 16-byte pw_3/4 instead of reading 8 bytesand then using movlhps to dup it into the higher half of the register.
Originally committed as revision 26086 to svn://svn.ffmpeg.org/ffmpeg/trunk
Move H264 chroma MC from inline asm to yasm. This fixes VP3/5/6 and VC-1fate failures on Win64.
Originally committed as revision 24989 to svn://svn.ffmpeg.org/ffmpeg/trunk