Advertisement

Operations for Text Processing

  • T. R. PadmanabhanEmail author
Chapter

Abstract

Information is printed/displayed in natural language character combinations. But computer communication and storage systems use only bit streams. Unicode which defines universally accepted conversion between character and bit streams is the basis to bridge the gap between the two. Different coding schemes of conversion are in vogue—UTF8 being the most widely used one. UTF8 is explained and coding/decoding related constructs in Python dealt with in detail. Character streams as strings and binary string related operations are treated comprehensively. ‘Bytes’ and ‘bytearray’ as sequence representations and number representations in different forms (binary, octal, hex, decimal, and radix-specified) and their conversions come in handy here. Exercises provided are in classical cryptography, cryptanalysis, and selected coding schemes; these are useful in relating Python operations with characters effectively.

Keywords

Plain Text White Space Decimal Number Floating Point Number Probability Range 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. Forouzan B (2013) Data communications and networking, 5th edn. McGraw Hill, New YorkGoogle Scholar
  2. Padmanabhan TR (2007) Introduction to microcontrollers and their applications. Alpha Science International Ltd, OxfordGoogle Scholar
  3. Shyamala CK, Harini N, Padmanabhan TR (2011) Cryptography and security. Wiley India, New DelhiGoogle Scholar
  4. The Unicode Standard: A Technical—Introduction. (http://www.unicode.org/standard/principles.html)
  5. van Rossum G, Drake FL Jr (2014) The Python library reference. Python Software FoundationGoogle Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2016

Authors and Affiliations

  1. 1.Amrita UniversityCoimbatoreIndia

Personalised recommendations