Operations for Text Processing

Padmanabhan, T. R.

doi:10.1007/978-981-10-3277-6_7

T. R. Padmanabhan²

14k Accesses

Abstract

Information is printed/displayed in natural language character combinations. But computer communication and storage systems use only bit streams. Unicode which defines universally accepted conversion between character and bit streams is the basis to bridge the gap between the two. Different coding schemes of conversion are in vogue—UTF8 being the most widely used one. UTF8 is explained and coding/decoding related constructs in Python dealt with in detail. Character streams as strings and binary string related operations are treated comprehensively. ‘Bytes’ and ‘bytearray’ as sequence representations and number representations in different forms (binary, octal, hex, decimal, and radix-specified) and their conversions come in handy here. Exercises provided are in classical cryptography, cryptanalysis, and selected coding schemes; these are useful in relating Python operations with characters effectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Hardcover Book: USD 179.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Forouzan B (2013) Data communications and networking, 5th edn. McGraw Hill, New York
Google Scholar
Original UTF-8 paper. (http://doc.cat-v.org/plan_9/4th_edition/papers/utf)
Padmanabhan TR (2007) Introduction to microcontrollers and their applications. Alpha Science International Ltd, Oxford
Google Scholar
Shyamala CK, Harini N, Padmanabhan TR (2011) Cryptography and security. Wiley India, New Delhi
Google Scholar
The Unicode Standard: A Technical—Introduction. (http://www.unicode.org/standard/principles.html)
van Rossum G, Drake FL Jr (2014) The Python library reference. Python Software Foundation
Google Scholar

Download references

Author information

Authors and Affiliations

Amrita University, Coimbatore, Tamil Nadu, India
T. R. Padmanabhan

Authors

T. R. Padmanabhan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to T. R. Padmanabhan .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Padmanabhan, T.R. (2016). Operations for Text Processing. In: Programming with Python. Springer, Singapore. https://doi.org/10.1007/978-981-10-3277-6_7

Download citation

DOI: https://doi.org/10.1007/978-981-10-3277-6_7
Published: 14 January 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3276-9
Online ISBN: 978-981-10-3277-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics