Abstract
In the previous chapter we have analyzed various parallelization approaches for H.264 decoding and concluded that in order to scale to a large number of cores, macroblock-level parallelism needs to be exploited. The next question is how to efficiently exploit this parallelism. In other words, how to map this parallelism onto a multi-/many-core architecture. To answer this question, in this chapter we present two implementations of the 2D-Wave approach. The first implementation maintains a centralized pool of macroblocks that are ready to be decoded and cores retrieve tasks from this Task Pool. In the second approach, called Ring-Line, full lines of macroblocks are statically assigned to cores and the cores synchronize and communicate point-to-point. Both approaches have been implemented and are evaluated on a dual-chip Cell BE system with 18 cores in total.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Alvarez, M., Salami, E., Ramirez, A., Valero, M.: HD-VideoBench: A Benchmark for Evaluating High Definition Digital Video Applications. In: Proceedings IEEE International Symposium on Workload Characterization (2007). http://personals.ac.upc.edu/alvarez/hdvideobench/index.html
Alvarez-Mesa, M., Ramirez, A., Azevedo, A., Meenderinck, C., Juurlink, B., Valero, M.: Scalability of Macroblock-level Parallelism for H.264 Decoding. In: Proceedings International Conference on Parallel and Distributed Systems (2009)
Chen, T., Raghavan, R., Dale, J., Iwata, E.: Cell Broadband Engine Architecture and its First Implementation: a Performance View. IBM Journal of Research and Development 51(5) (2007)
Chi, C.C., Juurlink, B., Meenderinck, C.: Evaluation of Parallel H.264 Decoding Strategies for the Cell Broadband Engine. In: Proceedings of the 24th ACM International Conference on Supercomputing (2010)
The FFmpeg Libavcodec. http://ffmpeg.org
Hoogerbrugge J, Terechko A (2011) A Multithreaded Multicore System for Embedded Media Processing. Transactions on High-Performance Embedded Architectures and Compilers 6590:154–173
Pham D, Asano S, Bolliger M, Day M, Hofstee H, Johns C, Kahle J, Kameyama A, Keaty J, Masubuchi Y, Riley M, Shippy D, Stasiak D, Suzuoki M, Wang M, Warnock J, Weitzel S, Wendel D, Yamazaki T, Yazawa K (2005) The Design and Implementation of a First-Generation CELL Processor. Proceedings IEEE International Solid-State Circuits Conference, In
x264. A Free H.264/AVC Encoder. http://www.videolan.org/developers/x264.html
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2012 The Author(s)
About this chapter
Cite this chapter
Juurlink, B., Alvarez-Mesa, M., Chi, C.C., Azevedo, A., Meenderinck, C., Ramirez, A. (2012). Exploiting Parallelism: the 2D-Wave. In: Scalable Parallel Programming Applied to H.264/AVC Decoding. SpringerBriefs in Computer Science. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-2230-3_4
Download citation
DOI: https://doi.org/10.1007/978-1-4614-2230-3_4
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-2229-7
Online ISBN: 978-1-4614-2230-3
eBook Packages: Computer ScienceComputer Science (R0)