Molecular Mechanism of Coding and Autonomous Decision-Making in Biological Systems
Biological systems are recognizable from inanimate materials through their cognition and computation capacity. Cells are the main subunits of a biological system and function as highly advanced computers by executing thousands of operations per second for different biological purposes to dynamically adapt with the environment. Unlike current electronic-based computers, biological systems utilize a molecular-based coding system in which information is stored in molecules. Information storage in molecules provides massive operation capacity for the cells. Deep understanding of mechanisms of coding and data processing in the cells could have several technology applications and trigger an industrial revolution. However, this level of progress requires the establishment of a different scientific viewpoint for life sciences – a paradigm that puts life sciences in a category that is much closer to the other experimental branches of natural sciences including chemistry, physics, and mathematics.
In this chapter, first we provide a detailed description of different aspects of molecular coding and data operation in biological systems applying new concepts of cognitive chemistry and the relativity of code, energy, and mass. We will discuss how information is stored in the patterns of molecular interactions and how real-time interactions between molecules and atoms generate a dynamic coding and operation capacity in biological systems. In the second part, we will discuss how we can leverage the cognitive chemistry knowledge in designing synthetic systems with similar autonomous properties of biological systems. In the third part of this chapter, we will discuss how basic principles of cognitive chemistry can be applied to mimic the extensive computation capacity of biomolecules in solving complex decision making problem.
KeywordsCognitive chemistry coding system Nondeterministic polynomial time problems Molecular computing Artificial Intelligence (AI) DNA computing Multilayer coding Conserved and dynamic coding Stem cells and decision-making Eternal cognition
2.1 Molecular Mechanism of Autonomous Coding and Data Processing Based on Physical and Chemical Foundation of Life
Considering the physical and chemical foundations of life, it can be concluded that living organisms are physicochemical systems and are not exempted from nature’s laws in physics and chemistry. Scientifically, we know that biological systems are composed of normal elements of nature. However, the regulatory mechanisms that for the first time caused the conversion of elements from randomness into highly organized system (determined as life) have remained a mystery.
“Science cannot solve the unlimited mystery of nature. And that’s because, in the last analysis, we ourselves are part of nature and therefore, part of the mystery that we are trying to solve”. (Max Plank).
The precise regulatory mechanisms behind “the origin of life” five billions years ago when the first self-generating reaction of life occurred remained a nature’s mystery (due to our limitations in the experience of time). However, the silver lining is that we can learn from the currently existing rules of life that already are accessible to human experience.
In the early twentieth century, right after the establishment of modern physics and the revolutionary effect of quantum physics on different technologies, Erwin Schrodinger referred to the emergence of a revision in the definition of life. Furthermore, he attempted to describe the revision in the classification of natural sciences by filling the gaps between life and other branches of sciences. In his well-known book about life, titled “What Is Life,” he referred to the missing parts in the classification of different branches of natural sciences that separate living systems from other human-made physicochemical systems. He described this gap to be related to our limited understanding of the structure and construction of living systems. Said another way, the construction of living systems is different from anything we have yet tested in the human-made physical laboratories.
“What I wish to make clear in this last chapter is, in short, that from all we have learned about the structure of living matter, we must be prepared to find it working in a manner that cannot be reduced to the ordinary laws of physics. That not on the ground that there is any new ‘force’ or what not, directing the behavior of the single atoms within a living organism, but because the construction is different from anything we have yet tested in the physical laboratory” .
At the early twentieth century, basic principles of modern physics were established by Max Plank. Modern physics later became highly practical and were applied in different technologies by research work of Albert Einstein and Erwin Schrodinger on quantum mechanics.
At 1900, Max Plank tried to explain blackbody (BB) radiation from warm objectives. Then, the BB spectrum was quantified by assuming that the energy in an electromagnetic wave, according to Max Plank eq. (E EM Wave = nhf).
At 1905, Albert Einstein explained the concept of a photon as a pocket of electromagnetic radiation which behaves as energy (wave) and particle (mass) at the same time. Also, he described the equation for the relativity of energy and mass as E = MC2 .
Later at 1927, Erwin Schrodinger attempted to define a wave equation that could precisely describe the energy in an atom particle. Schrodinger succeeded to present an equation that could precisely predict energies in hydrogen atoms which led to the establishment of the basic principles of quantum physics. Schrodinger’s equation probably was the most important equation of the twentieth century, because of its huge effect on technological progress. The major significance of Schrodinger’s work in quantum physics comes from its broad coverage of both physics and chemistry.
The physicist, Paul Dirac famously asserted: “The Schrodinger equation accounts for much of physics and all of chemistry”.
In addition, at the mid-twentieth century, Schrodinger attempted to revise the classical definition of life sciences by application of basic principles of physics and chemistry in biological systems. The full manuscript of his lecture series about the physicochemical foundation of life was later published as a book titled “What Is Life.” 
Schrodinger attempted to provide several pieces of evidence to prove the accuracy of principles of quantum physics in biological systems. He believed differences between biological systems and other physical and chemical systems are simply related to their structures. However, still, they are made of natural elements and obeying all nature’s laws in physics and chemistry.
He tried to explain that differences in the construction of two systems may cause a big difference in their function, while still the basic elements applied in both systems might be the same. This needs to be differentiated from anything else that may cause biased understanding of basic elements and nature of a system. The difference in construction is enough to make an entirely different way of functioning. Schrodinger tried to make this concept clear by comparing the structural differences between a heat motor and an electronic circuit, both of which may apply the same basic elements (e.g., copper and iron). He referred to the consistency of the natural properties of copper even though it has been applied differently in the construction of a heat motor compared to an electronic circuit.
“To put it crudely, an engineer, familiar with heat engines only, will after inspecting the construction of an electronic motor, be prepared to find it working along principles which he does not yet understand. He finds the copper familiar to him in kettles used here in the form of long, wires wound in coil; the iron familiar to him in levers and bars and steam cylinders here filling the interior of those coils of copper wire. He will be convinced that it is the same copper and the same iron, subject to the same laws of nature, and he is right in that. The difference in construction is enough to prepare him for an entirely different way of functioning .”
The concept of consistency in the subject of nature’s law earlier has also been mentioned by Max Plank and Einstein. However, Schrodinger attempted to provide a systematic description for the consistency of nature’s laws in living systems aligned with quantum mechanics.
Schrodinger referred to the “orderliness” as the major difference between a living system and anything else that exists in inanimate matter.
“The unfolding of events in the life cycle of an organism exhibits an admirable regularity and orderliness, unrivaled by anything we meet with in inanimate matter. We find it controlled by a supremely well-ordered group of atoms, which represents only a very small fraction of the sum total in every cell.”
In addition, based on the behavior of atoms, Schrodinger classified all physical systems in nature into two main categories including statistical physics and order from order (ordered) physics (Fig. 2.1d).
“I remember an interesting little paper by Max Plank on we have the topic the dynamical and statistical type of law (Dynamische und Statistische Gresetzmassigkeit). The distinction is precisely the one we have here labeled as ‘order from order’ and ‘order from disorder’ . The object of that paper was to show how the interesting statistical type of law, controlling large-scale events, in constituted from the dynamical laws supposed to govern the small-scale events, the interaction of the single atoms and molecules. The latter type is illustrated by large-scale mechanical phenomena, as the motion of the planets or of a clock, etc. thus it would appear that the new principle, the order- from- order principle, to which we have pointed with great solemnity as being the real clue to the understanding of life.”
“For the new principle that is a genuinely physical one, it is, in my opinion, nothing else than the principle of quantum theory over and over again. But, we cannot expect the ‘laws of physics’ derived from it suffice straightaway to explain the behavior of living matter whose striking features are visibly based to large extent on the ‘order-from-order’ principle. You would not expect two entirely different mechanisms to bring about the same type of law- you would not expect your latch-key, to open your neighbor door as well”. We must therefore not be discouraged by the difficulty of interpreting life by the ordinary laws of physics. For that is just what is to be expected from the knowledge we have gained of the structure of living matter. We must be prepared to find a new type of physical laws prevailing in it.” 
Now the big question is what would be the new type of physical laws that prevail the highly ordered structure, function, and behavior of living systems?
Here in this book, we attempt to explain the mechanisms that cause the ordered behavior of molecules and atoms in biological systems. We will discuss how order from order physics can be explained by unique property of biological systems for generation of molecular codes. As an additional factor in nature’s laws, we describe the coding capacity of molecules in biological systems that should be considered alongside the properties of molecules for energy and mass production. Therefore, we determined code as the third dimension of nature’s laws that have been illustrated in the autonomous properties of biological systems.
By considering code as the third dimension of nature’s laws in physics and chemistry, we determined a new branch of chemistry called cognitive chemistry that mimics the ordered physics in biological systems. Cognitive chemistry integrates nature’s laws in chemistry, physics, and mathematics for coding, data storage, and operation by molecules (Fig. 2.1g). Cognitive chemistry can be interpreted in physics terminology as the relativity of code, energy, and mass.
2.2 Basic Principles of Coding and Data Processing in Biological Systems
2.2.1 Information Storage in Molecules and Materials
Unlike human-made computers that are operated according to physical- and electrical-based coding, biological systems apply a unique chemical-based coding system. In this system biological information is embedded in materials or chemical interactions between molecules and atoms. For simplicity, in this book, code-embedded materials are called “coded materials”. As an example, deoxyribonucleic acid (DNA) is the most well-known coded material that nature utilizes for storage of biological information. DNA, also known as genetic material, contains the entire information of an organism to be copied into the next generation of the species . DNA provides a huge storage capacity because DNA encodes data applying four subunits including A, G, C, and T, while current computers apply a binary (0,1) coding system. Neurotransmitters are other examples of coded materials which play a major role in the information storage and transduction in the neural system.
Figure 2.5a illustrates the physical mass of brain, which is the initial origin of memory formation. The brain is the only computing system that has the capacity to generate expandable memory volume in its limited physical mass. The main difference between the brain and current electronic computers is in the brain’s molecular-based coding system. This molecular coding system is the origin of electrical activities of neural cells. Furthermore, memory formation originates from the molecular coding system of neural cells. While electrical coding in silicon-based computers is based on transportation of electrons over a uniform metal (e.g., in wires and transistors) media, the brain is made of soft material.
Figure 2.5b illustrates an immunohistochemistry (IHC) imaging of brain microsections. Tissue microsections are stained against specific molecular markers of neural cells at different layers of the brain. Figure 2.6a–e illustrates IHC staining of the brain. Microsections were stained against molecular markers of various types of brain neural cells at different layers of the brain.
Figure 2.7 illustrates the direct role of the molecules of neurotransmitters in both data transportation as well as memory formation through their role in the formation of neural synapses and consequently neural networks. Neurotransmitters are a class of coded chemical molecules that play a major role in the formation of memory through neural networks. Neurotransmitters function as chemical (molecular)-based logic gates and switches. Initiation of the action potential and electrical activity of neurons are regulated by the logic gate function of neurotransmitters at the location of neural synapses. Consequently, neural networks are forming, by coupling of neural synapses together.
Figure 2.8a illustrates microscopic imaging of IHC staining of the brain sections against different types of neurotransmitters in neural cells. Microscopic imaging of neural cells at high magnification illustrates the excretion of neurotransmitters from neural cells into synaptic spaces. Unlike binary coding in silicon-based computers which are completely an electronic-based coding system, data operation in neural cells involves both molecular and electrical coding systems. In fact, even the electrical activity of neural cells is originated from molecular coding by neurotransmitters, which possess a specific molecular mass.
Figure 2.8b illustrates electron microscopy imaging of a neural network. Figure 2.8c illustrates electron microscopy imaging of vesicles of neurotransmitters in a neuron. Unlike an electronic coding system in silicon-based computers that lack physical mass, the major mass of neurons as coding subunits of a neural network is generated from coded materials (e.g., vesicles of neurotransmitters and signaling proteins).
Considering that basic difference of brain operation system compared to electronic computers is related to the existence of molecular-based coding system (through different types of coded materials), it can be concluded that unique capacities of the brain (e.g., expandable memory, learning, etc.) can be explained and even simulated in synthetic form through innovative molecular-based coding systems for next-generation cognitive systems.
2.2.2 Multilayer Coding
2.2.3 Molecular Coding and Algorithmic Chemistry
Unlike a silicon-based computers that is operated by an electronic-based coding system, all biological functions in living cells are operated by chemical-based coding systems. Figure 2.10 illustrates alphabetic symbols of the universal nucleotide/amino acid coding system in living systems.
Electrochemical attractive forces among coding molecules can be ordered in the form of various molecular algorithms for different computational purposes. For example, electrochemical forces among coding molecules can be translated into values and integers based on the level of attraction strength between chemical bonds.
The massive information storage and data compaction capacity through multiple layers of coding.
The capacity of massive parallel operation applying numerous molecular operators.
The autonomous data operation and environmental recognition capacity through the electrochemical interactions of coding molecules with each other, as well as the physicochemical environmental factors. Autonomous data operation and environmental recognition capacity of coding molecules can be explained by quantum mechanics effect of atoms on each other (discussed in Chap. 3).
The spontaneous and combinatorial nature of molecular interactions, at each coding and operation layer, leads to the real-time and massive operation capacity of a cognitive chemistry coding system (e.g., for solving complex mathematical problems).
Electrochemical attractive forces among different types of coding molecules can be quantified by specific affinity indexes. Affinity indexes among coding molecules can be translated to the values and integers and be applied in designing of novel chemical-based operating systems for solving complex mathematical problems such as nondeterministic polynomial time (NP) problems. Biological systems provide highly efficient models for solving complex problems. In the following part of this chapter, we will discuss the application of biomimetic approaches for solving NP problems.
The optimal solution of a protein folding problem is highly accurate and repeatable because it provides the most stable spatial conformation through the Gibbs free energy level for the molecule. Protein folding mechanism can be simulated for solving nondeterministic polynomial time (NP) problems.
The electrochemical properties of amino acids can be applied as the coding and operational criteria for several computable functions, for example, designing of soft logic circuit networks applying the logic gate function and signal-transducing activities of proteins.
Furthermore, it needs to be considered that electrochemical attractive forces among amino acids’ side chains are highly affected by other physicochemical environmental factors. This causes the spatiotemporal flexibility in the conformation shape of proteins and subsequently their interactive affinity with other molecules. Therefore, unlike the static structure of crystals (in nonliving system), 3-D structures of proteins are highly flexible and dynamic. The highly dynamic conformational shape of proteins makes these molecules highly efficient logic gates for various biological purposes (please see part 4, conditional coding).
For future research, we suggest the careful design of DNA and protein-coding sequences for generation of autonomous systems. In addition, hybrid DNA-protein algorithms can be applied for generation of algorithmic self-assembly by the guided design of electrochemical attractive forces among coding molecules. A detailed description of algorithmic self-assembly and autonomy is discussed in Chap. 3.
2.2.4 Inherent and Conserved Coding
2.2.5 Conditional Coding
A special property of coded molecules (e.g., DNA or proteins) is related to their capacity for direct connection and reactivity to their environmental factors. In fact, even though biological data are stored in a highly conserved and stable-coded material like DNA, the coding system is still highly dynamic and able to adapt to the environmental conditions. In fact, DNA molecule is highly connected to the environmental factors and its expression always is affected by environmental conditions.
Expression of amylase, an enzyme, is critical for initiation of growth-related pathways in a seed such as metabolic and energy-producing pathways. Enzymatic activity of amylase on starch (which is the main energy storage resource for the seed) causes the release of glucose. Consequently, glucose molecules are accessible for metabolic pathways and provide the energy requirements for early stages of growth before the generation of photosynthesis machinery. Expression of the amylase gene is conditional on the existence of light, temperature, and humidity.
Conditional coding in cells is a great model for a generation of synthetic systems with the capacity of quick adaptation with the environmental factors. Unlike current computers, DNA involves special sequences that are not carrying any information for the synthesis of biomolecules. These sequences are called noncoding sequences. DNA noncoding sequences are specialized for direct connection with environmental factors. These noncoding sequences also are called gene regulatory sequences. Gene regulatory sequences are able to recognize and provide an appropriate response to the environmental factors through a set of intermediate molecules which function as logic gate switches.
The high level of structural flexibility of proteins and their interactions’ capability with each other and other biochemical components make them highly efficient switching elements for data processing networks in biological systems. For example, in a protein-protein signal transduction and data processing network, proteins can function as switching elements or signal transducers [15, 17].
Switching capability of proteins is due to a conformational change induced by an input signal . Signal transduction occurs by a switching protein (e.g., due to the phosphorylation or interactions with cAMP). In addition, the intermolecular allosteric interaction between a regulatory domain (receiving the input signal) and a functional domain (transmitting the output signal) makes the logic gate activities of proteins. Signal-transducing proteins are components of logical gates in biological data processing systems. Switches can be used to carry out logical operations of the type NOT, AND, OR, and NOR according to the rules of Boolean algebra. These operations are sufficient to process any kind of logical information .
2.2.6 Dynamic and De Novo Coding
2.2.7 Integration of Software and Hardware
2.3 Cognitive Chemistry Algorithm: A Biomimetic Multilayer Molecular Coding Algorithm for Solving NP Problems
In computational complexity theory Nondeterministic Polynomial time (NP) problems are a class of combinatorial optimization problems that are solvable in polynomial time by a theoretical non-deterministic Turing machine. That means, regarding the combinatorial nature of NP problems, the required time for solving the problem is a polynomial function of the size of input data. Therefore, by increase in the size of in put data, the operation time for solving of problem, increases exponentially. NP problems are defined as one of the most important open questions in mathematics. NP problems also are categorized as a class of decision-making problems in computational complexity theory. The main importance of NP problems is related to their vast application in designing of decision-making algorithms in artificial intelligence (AI) .
As an example of NP problems, we refer to traveling salesman problem (TSP). TSP is a combinatorial optimization problem, important in both operations research as well as theoretical computer science. Briefly, the TSP asks the following question: Given a list of cities and the distances between each pair of cities, what is the shortest possible route that visits each city once and only once and returns to the origin city? [2, 8].
The main limitation of current computers in solving NP problems is related to their sequential operation capacity. The theory of computer science is based on universal Turing machines (UTMs), which initially was described by Alan Turing [4, 33]. Based on the Turing theory, a task is computable if it could be specified by a sequence of instructions which results in the completion of a task applying the same machine. Modern digital computers are physical embodiments of classical UTMs. Modern computers operate at enormous speeds, capable of executing more than 1015 instructions per second (current fastest computer has a Linpack performance of 93 peta FLOPS). However, their sequential approach to data processing is the main limitation for solving combinatorial problems such as TSP.
Comparative analysis of genetic in different species reveals that near to optimum solution in solving NP problems is not sufficient for biological purposes. Biological systems apply special data operation algorithms to find the exact optimal solutions for NP problem in real time.
There is still no model that provides a clear explanation for this high level of accuracy and efficiency in computational methods of biological systems.
On the other hand, recent studies on complexity problems across different industries indicated that real-life NP problems are geometrical problems with multidimensional complexity in their networks [4, 23].
In fact, current computing methods which are working based on linear operation are not sufficient for solving NP problems with geometrical complexity in their networks. Solving NP problems requires innovative approaches in designing algorithms with the nonlinear and parallel operation capacities.
Biologically inspired algorithms such as neural network  and ant colony [21, 38, 39] have been applied extensively for various computational purposes. At the molecular level, DNA computing has been studied initially by Leonard Adleman . Since then, DNA has been the major focus of several research studies in designing biomimetic computing algorithms [1, 5, 13, 14, 18, 24, 27, 32, 39].
Other biomolecules including RNA and proteins also, have been applied for computing purpose [8, 11, 18, 20, 34]. Faulhammer and coworkers reported an RNA-based computation solution to chess problem . Unger and Moult designed a molecular-based computing system in which proteins were used as NADN logic gates . Nicolau et al., (2016) reported a parallel computational method with molecular- motor-propelled agents in nanofabricated protein-based networks .
However, there is still very limited information about the biologically inspired algorithms that could simulate multilayer coding and operation capacities of living cells. For example, gene regulatory networks or protein-protein signal transduction networks have yet remained to be applied as models for the generation of biomimetic computing algorithms.
Simulation of mechanisms of multilayer coding and data processing in cell signaling pathways can provide powerful tools in designing innovative algorithms for solving NP problems.
The massive information storage capacity through multiple levels of coding.
The capacity of massive parallel operation applying numerous number of molecular operators (e.g. DNA, RNA, and Proteins).
The capacity of autonomous data operation through the electrochemical interactions of molecules and atoms on each other which can be described through the quantum mechanic’s effect of atoms on each other (e.g., the autonomous formation of three-dimensional conformation of proteins through the amino acid-amino acid interaction during protein folding).
The capacity of multilayer operation applying different types of coding molecules and materials at different levels of complexity. Figure 2.27 illustrates schematic images representative of massive operation capacity of cognitive chemistry coding system for solving NP problems at different layers of complexity including gene regulatory networks, protein-protein interaction networks, as well as amino acids interaction networks during protein folding.
The autonomous and combinatorial nature of molecular interactions, at each coding and operation layer, leads to the real-time and massive operation capacity of the system in solving NP problems. Though not discussed here, the exquisite accuracy and efficiency of data operation in living cells through the highly complex and interconnected DNA-protein NP networks motivated us to develop a biomimietic computing algorithm. This algorithm is leveraged from the wide computation potentials of both nucleic acids and proteins for solving non-biological NP problems.
- 1.Adleman L (1994) Molecular computation of solutions to combinatorial problems. Science 266(5187):1021–1024Google Scholar
- 2.Afaq H, Saini S (2011) On the solutions to the traveling salesman problem using nature-inspired computing techniques. IJCSI 8:326–334 Google Scholar
- 3.Allard A, Serrano MA, Garcia-Perez G, Boguna M (2017) The geometric nature of weights in real complex networks. Nat Commun 216:1–8Google Scholar
- 7.Faulhemmer D, et al. (2000) Molecular computation: RNA solution to chess problems. PNAS, USA 57(4):1385-1389Google Scholar
- 8.Garey MR, Johnson DS (1979) Computers and intractability: a guide to the theory of NP completeness. W. H. Freeman & Company, New York, U.S.AGoogle Scholar
- 15.Mark F, Klingmuller U, Decker K (2009) Cellular signal processing, an information to the molecular mechanism of signal transduction. USA, Gaelan Science, Tylor and Francis GroupGoogle Scholar
- 17.Nelson DL, Cox M (2017) Lehninger principles of biochemistry, 7th edn. W.H. Freeman & Company, New YorkGoogle Scholar
- 25.Schrodinger E (1944) What is life. Cambridge University Press, UKGoogle Scholar
- 26.Schrodinger E (1935) Die gegenwartige Situation in der Quantenmechanik. Die Naturwissenschaften 23 (48):807–812Google Scholar
- 28.Singh S, Lodhi EA (2013) Study of variation in TSP using genetic algorithm and its operator comparison. IJSCE 3:2231–2307Google Scholar
- 30.Tarkov MS (2015) Solving the traveling salesman problem using a recurrent neural network. Am Anal Appl 8:275–283Google Scholar
- 32.Turing A (1936) On computable numbers with an application to Entcheidung problem. Proc Lond Math Soc II Ser 42:230–265Google Scholar
- 33.Unger R, Moult J (2006) Towards computing with proteins. Proteins 63(53–64):9Google Scholar
- 36.Weissman JA et al (2011) Generating and imaging multicolor Brainbow mice, Cold Spring Harbor Laboratory Protoc. https://doi.org/10.1101/pdb.top114