An Efficient SIMD Architecture with Parallel Memory for 2D Cosine Transforms of Video Coding | IEEE Conference Publication | IEEE Xplore