Skip to main content

Low-Latency Software Polar Decoders

  • Chapter
  • First Online:
High-Speed Decoders for Polar Codes

Abstract

The low-complexity encoding and decoding algorithms render polar codes attractive for use in SDR applications where computational resources are limited. In this chapter, we present low-latency software polar decoders that exploit modern processor capabilities. We show how adapting the algorithm at various levels can lead to significant improvements in latency and throughput , yielding polar decoders that are suitable for high-performance SDR applications on modern desktop processors and embedded-platform processors . These proposed decoders have an order of magnitude lower latency and memory footprint compared to state-of-the-art decoders, while maintaining comparable throughput . In addition, we present strategies and results for implementing polar decoders on graphical processing units . Finally, we show that the energy efficiency of the proposed decoders is comparable to state-of-the-art software polar decoders.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 129.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    As stated above, compiler auto-vectorization is always kept enabled.

References

  1. PCI express base specification revision 2.0. PCI-SIG (2006)

    Google Scholar 

  2. IEEE standard for floating-point arithmetic. IEEE Std 754-2008 pp. 1–70 (2008). DOI 10.1109/IEEESTD.2008.4610935

    Google Scholar 

  3. IEEE standard for information technology–telecommunications and information exchange between systems local and metropolitan area networks–specific requirements part 11: Wireless LAN medium access control (MAC) and physical layer (PHY) specifications. IEEE Std 802.11-2012 (Revision of IEEE Std 802.11-2007) pp. 1–2793 (2012). DOI 10.1109/IEEESTD.2012.6178212

    Google Scholar 

  4. Arıkan, E.: Channel polarization: A method for constructing capacity-achieving codes for symmetric binary-input memoryless channels. IEEE Trans. Inf. Theory 55(7), 3051–3073 (2009). DOI 10. 1109/TIT.2009.2021379

    Google Scholar 

  5. Bang, S., Ahn, C., Jin, Y., Choi, S., Glossner, J., Ahn, S.: Implementation of LTE system on an SDR platform using CUDA and UHD. Analog Integr. Circuits and Signal Process. 78(3), 599–610 (2014). DOI 10.1007/s10470-013-0229-1

    Article  Google Scholar 

  6. Demel, J., Koslowski, S., Jondral, F.: A LTE receiver framework using GNU Radio. J. Signal Process. Syst. 78(3), 313–320 (2015). DOI 10.1007/s11265-014-0959-z

    Article  Google Scholar 

  7. Feng, W.C., Xiao, S.: To GPU synchronize or not GPU synchronize? In: IEEE Int. Symp. on Circuits and Syst. (ISCAS), pp. 3801–3804 (2010). DOI 10.1109/ISCAS.2010.5537722

    Google Scholar 

  8. Giard, P., Sarkis, G., Thibeault, C., Gross, W.J.: Fast software polar decoders. In: IEEE Int. Conf. on Acoustics, Speech, and Signal Process. (ICASSP), pp. 7555–7559 (2014). DOI 10.1109/ICASSP.2014.6855069

    Google Scholar 

  9. Han, X., Niu, K., He, Z.: Implementation of IEEE 802.11n LDPC codes based on general purpose processors. In: IEEE Int. Conf. on Commun. Technol. (ICCT), pp. 218–222 (2013). DOI 10.1109/ICCT.2013.6820375

    Google Scholar 

  10. Jouguet, P., Kunz-Jacques, S.: High performance error correction for quantum key distribution using polar codes. Quantum Inf. & Computation 14(3-4), 329–338 (2014)

    MathSciNet  Google Scholar 

  11. Le Gal, B., Jego, C., Crenne, J.: A high throughput efficient approach for decoding LDPC codes onto GPU devices. IEEE Embedded Syst. Lett. 6(2), 29–32 (2014). DOI 10.1109/LES.2014.2311317

    Article  Google Scholar 

  12. Le Gal, B., Leroux, C., Jego, C.: Software polar decoder on an embedded processor. In: IEEE Int. Workshop on Signal Process. Syst. (SiPS) (2014). DOI 10.1109/SiPS.2014.6986083

    Google Scholar 

  13. Le Gal, B., Leroux, C., Jego, C.: Multi-Gb/s software decoding of polar codes. IEEE Trans. Signal Process. 63(2), 349–359 (2015). DOI 10. 1109/TSP.2014.2371781

    Google Scholar 

  14. Leroux, C., Raymond, A., Sarkis, G., Gross, W.: A semi-parallel successive-cancellation decoder for polar codes. IEEE Trans. Signal Process. 61(2), 289–299 (2013). DOI 10.1109/TSP.2012.2223693

    Article  MathSciNet  Google Scholar 

  15. NVIDIA: Kepler GK110 - the fastest, most efficient HPC architecture ever built. NVIDIA’s Next Generation CUDA Computer Architecture: Kepler GK110 (2012)

    Google Scholar 

  16. NVIDIA: NVIDIA management library (NVML). NVML API Reference Guide (2014)

    Google Scholar 

  17. NVIDIA: Performance guidelines. CUDA C Programming Guide (2014)

    Google Scholar 

  18. Sarkis, G., Giard, P., Thibeault, C., Gross, W.J.: Autogenerating software polar decoders. In: IEEE Global Conf. on Signal and Inf. Process. (GlobalSIP), pp. 6–10 (2014). DOI 10.1109/GlobalSIP.2014. 7032067

    Google Scholar 

  19. Sarkis, G., Giard, P., Vardy, A., Thibeault, C., Gross, W.J.: Fast polar decoders: Algorithm and implementation. IEEE J. Sel. Areas Commun. 32(5), 946–957 (2014). DOI 10.1109/JSAC.2014.140514

    Article  Google Scholar 

  20. Sarkis, G., Gross, W.J.: Increasing the throughput of polar decoders. IEEE Commun. Lett. 17(4), 725–728 (2013). DOI 10.1109/LCOMM. 2013.021213.121633

    Article  Google Scholar 

  21. Tan, K., Liu, H., Zhang, J., Zhang, Y., Fang, J., Voelker, G.M.: Sora: High-performance software radio using general-purpose multi-core processors. Commun. ACM 54(1), 99–107 (2011). DOI 10.1145/1866739. 1866760

    Article  Google Scholar 

  22. Treibig, J., Hager, G., Wellein, G.: LIKWID: A lightweight performance-oriented tool suite for x86 multicore environments. In: Int. Conf. on Parallel Process. Workshops (ICPPW), pp. 207–216 (2010). DOI 10.1109/ICPPW.2010.38

    Google Scholar 

  23. Wang, G., Wu, M., Yin, B., Cavallaro, J.R.: High throughput low latency LDPC decoding on GPU for SDR systems. In: IEEE Glob. Conf. on Sign. and Inf. Process. (GlobalSIP), pp. 1258–1261 (2013). DOI 10.1109/GlobalSIP.2013.6737137

    Google Scholar 

  24. Xianjun, J., Canfeng, C., Jaaskelainen, P., Guzma, V., Berg, H.: A 122Mb/s turbo decoder using a mid-range GPU. In: Int. Wireless Commun. and Mobile Comput. Conf. (IWCMC), pp. 1090–1094 (2013). DOI 10.1109/IWCMC.2013.6583709

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this chapter

Cite this chapter

Giard, P., Thibeault, C., Gross, W.J. (2017). Low-Latency Software Polar Decoders. In: High-Speed Decoders for Polar Codes. Springer, Cham. https://doi.org/10.1007/978-3-319-59782-9_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-59782-9_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-59781-2

  • Online ISBN: 978-3-319-59782-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics