Origin and Basic Mechanism of Life

Abstract

This paper presents in a concise way the main characteristics of life from the physical point of view and the most successful theories of biogenesis, together with a mathematical formulation and simulation of proto-biogenesis. We present here a calculation method for biochemical reactions based on the available reaction data base, and using this method, we calculate precise scenarios for the first life cycle, and for the first stages of terrestrial biological evolution.

Share and Cite:

Helm, J. (2024) Origin and Basic Mechanism of Life. Open Journal of Biophysics, 14, 265-329. doi: 10.4236/ojbiphy.2024.143011.

1. Introduction

The evolution of life is known roughly down to LUCA (Last Universal Common Ancestor). Our knowledge about LUCA and the later life evolution is summarized in Chap. 2.11.

We have little evidence about the two preceding stages of life evolution: the prebiotic chemo-evolution and the proto-life cycle.

Important steps in the research concerning these two stages of life evolution can be summarized as follows.

1) Oparin-Haldane hypothesis

The earliest was the Oparin-Haldane hypothesis, which suggests that life arose from inorganic molecules, with basic amino acids and nucleobases forming first and building polymers within self-replicating lipid membranes [1] .

2) Miller-Urey experiment

The Miller-Urey experiment provided the first evidence that some amino acids and nucleobases could be formed in water from a reducing atmosphere, under heating and electric discharge [1] .

3) RNA-world

The RNA world hypothesis suggests that the first life was self-replicating primitive RNA [2] [3] .

4) PNA-world

PNA (peptide nucleic acid, i.e. peptides with nucleosides) has been proposed as a precursor of RNA and as a catalyzer and information carrier for peptide synthesis [2] [3] . This ansatz is shown in Chap. 2.6 and Chap. 3.2 as the most probable precursor of RNA-world and LUCA (PNA proto-cell), supported by theoretical and experimental evidence.

5) Wächtershäuser’s iron-sulfur world

Wächtershäuser traces the proto-biotic synthesis of amino acids and nucleobases back to simple, surface-mediated autotrophic reactions, driven by metal catalysts, mainly iron ions ( [2] ). Wächtershäuser’s pyrite cycle is shown in Chap. 2.6 to be the central energy cycle of the PNA proto-cell.

6) Life from HCN

Patel demonstrated that ribonucleotides, lipid-precursors and amino acids, can be formed from CH4, NH3, H2S in water under UV irradiation ( [2] ). This ansatz is proved to be successful as the basis for the numerically calculated model of PNA proto-cell in Chap. 2.6 and Chap. 3.2.

The tasks required to gain knowledge about these preceding stages can be summarized in the following scheme.

1) Prebiotic chemo-evolution

Here the first goal is to deduce the reactions and components of the chemo-evolution in space of amino acids and nucleobases.

The second goal is to find out the precursors, energy cycles, the environment, and the basic amino acids and nucleobases, which creates the conditions for the proto-life-cycle.

2) The evolution and the structure of the proto-life-cycle

Here it is required to find the minimal genetic code, which is consistent with the modern genetic code, and the reactions with the corresponding enzymes (peptides) and transferases.

The reaction network should be verified by calculation or by experiment or both.

In order to achieve this, the following methods are available

1) Prebiotic soup experiments

2) Reconstruction of primeval genetic code

This can be done by tracking down the oldest common genes, and by finding minimal sets of genes for life functionality.

3) Analysis of RNA code

Here the goal is to find the structure of early stages of genetic code.

4) Calculation of biological reaction networks

Here the goal is to find a reliable reaction model for peptide-catalyzed synthesis of poly-nucleobases, amino acids, and peptides.

For a given reaction

A + B ( k + ) C + D + Δ E resp. A + B ( k ) C + D + Δ E

with the mass action law

d C d t + d D d t = k + [ A ] n A [ B ] n B

we have the Arrhenius law for the reaction constant

k r = ( 1 / t 0 ) exp ( E a / k T )

The goal is here to calculate the fundamental constants char. Time t0 and activation energy Ea in dependence of the structure parameters (bonds, geometry) of the involved molecules A, B, C, D

t 0 = t 0 ( A , B , C , D ) , E a = E a ( A , B , C , D )

In Chap. 2, we develop a mathematical model with method 4, based on the current biochemical data base.

We use this model for precise calculation of the late prebiotic chemo-evolution and the proto-life cycle under plausible conditions, based on HCN and H2S as precursors, PNA-controlled peptide synthesis, with genetic proto-code with 5 genes, and pyrite reaction as energy cycle.

We develop plausible scenarios for the later evolution up to the LUCA stage, using method 2 and method 3.

Based on this, we present in Chap. 3 a complete scenario of terrestrial life evolution, and set up principles for biological life in general.

2. Principles and Evolution of Life Chemistry

2.1. Chemical Base of Life in General

Based on current experimental and theoretical evidence, we can conclude that the proto-biotic life chemistry in general (not only on Earth) develops in a liquid from one or several exothermic molecules (precursors) by selection-evolution via two parallel prebiotic random polymerization processes from basic compounds to catalyzers (enzymes) and synthesizers (polymer molecules which carry out template-based enzyme synthesis based on their sequence, i.e. genetic code) fed by an energy cycle. In terrestrial life chemistry, vesicle-based chemistry is a precondition for a full life-cycle with bio-matter production and proliferation (see Chap. 2.8.4). Therefore it is plausible to assume that in general there is a third class of polymers (layer-builders) produced by the life chemistry, which form a membrane and a self-dividing vesicle.

In extrapolation of these basics, we can say that life in general has 6 principal features (Figure 1):

- synthesis of biological matter (including basic compounds) in a liquid (terrestrial: water H2O) catalyzed by enzymes (terrestrial: proteins), where basic compounds (terrestrial: 20 amino acids, 4 nucleobases A, U, G, C) are synthesized from anorganic precursors (terrestrial primary: HCN, H2S)

Figure 1. Schematic life in general.

- use of an energy cycle with a continuous energy flow against entropic decay (terrestrial primary cycle: Wächtershäuser’s iron-sulfur cycle, terrestrial secondary cycle: photo-synthesis)

- life chemistry based on catalyzer enzymes (terrestrial: peptides) and synthesizers (genes, terrestrial: poly-nucleobases) built from basic compounds, and running within a cell-membrane, where synthesizers are more stable than enzymes; synthesizers serve as a pattern for enzyme structure, catalyze the enzyme synthesis, and preserve and copy the pattern

- spontaneous emergence of a proto-lifecycle, which starts reproduction and evolution through selection

- replication with copying of genes, and cell division within a fixed time interval (terrestrial: ca. 1000 s)

- evolution by mutation and adaptation to the environment with the goal of survival

We can formulate some boundary conditions for life:

• The only common element, which supports a large variety of molecules, and polymers, is carbon.

Terrestrial life is based on carbon chemistry, and presumably the same is true for most forms of life in the universe.

• The fluid carrier medium in terrestrial life is water, and water is the only common molecule with a fluid phase around T = 300 K.

• Biological carbon chemistry has activation energies in the range (25 kJ/mol, 130 kJ/mol), corresponding to per-molecule energy range (0.25 eV, 1.3 eV).

• Star light as an energy source has an approximate energy range

( E ( λ = 0.5 μ m ) = 2.5 eV , E ( λ = 1 μ m ) = 1.2 eV ), with E ( λ ) = h c λ , center wavelength λ = 0.5 μ m for sun-like stars, center wavelength λ 1 μ m for Red Dwarves, which is feasible as energy source for carbon chemistry.

• The thermal energy E t h ( T ) = k B T needed to overcome the activation energy in carbon chemistry is approximately (2σ above mean) E t h , c e 2 = E a , m i n , where E a , m i n = 0.25 eV for carbon chemistry, i.e. E t h , c = 0.034 eV , with corresponding minimum temperature T c = 390 K , which lies within the water fluidity temperature range.

We can extrapolate general features of the four life-functional molecular families.

• Precursors, solvents, energy cycle molecules

These molecules are synthesized in space, or belong to the primeval molecules on planets. Among them, there is at least one molecule, which is exothermic (supplies energy), and one which is the (fluid) solvent.

In terrestrial biology, precursors are: HCN, H2S, H2, CO, H3PO4, and the solvent is of course water H2O. Among these, HCN is the energy supplier and HCN + H2O are the atom suppliers, and determine the characteristic atom content ratio ( H : C : N : O ) = ( 3 : 1 : 1 : 1 ) in basic amino acids and nucleobases.

The energy cycle molecules participate in the basic energy cycle reaction, in the terrestrial biology they are: H2S, FeS which form the fundamental pyrite reaction F e S + H 2 S F e S 2 + 2 H + + 2 e .

• Enzymes

The enzymes are the catalyzers, which control the life synthesis reactions. They are chains consisting of basic molecules bound by one kind of chemical bond.

In terrestrial biology, they are peptides, which are linear chains of amino acids bound by the amino-bond with energy 8 - 16 kJ/mol.

• Synthesizers

The synthesizers are the coding molecules for the enzymes, they are more stable than the enzymes, and therefore are ring molecules (rings are normally more stable than their linear counterparts). Their diameter is smaller than the length of the linear enzymes by a factor 1/2 to 1/3. Therefore they form codons from 1, 2 or 3, which translate into one enzyme (1-bit, 2-bit, 3-bit code).

In terrestrial biology, they are poly-nucleobases (PNB’s), which are linear chains of nucleobases

complementary adenine Ade, guanine Gua: 5-ring + 6-ring

complementary thymine Thy, uracil Ura: 6-ring.

PNB’s are linked by hydrogen-bonds, in RNA by phosphodiester-bonds.

• Layer-bilders

These molecules form the protective semi-permeable membrane of the life vesicle (biological cell).

The membrane controls the influx and outflux of molecules, and the vesicle is capable of self-recreation.

In terrestrial biology, they are phospho-lipids, namely in the proto-cell glycerol-phosphate.

• Reaction network

Enzymes, synthesizers, and phospho-lipids are all synthesized in the life self-supporting reaction network catalyzed by enzymes.

Precursors, solvent, and energy cycle molecules are supplied outside the vesicle and enter resp. leave it through the membrane.

Alternative life-models

Carbon-nitrogen-hydrogen chemistry in water (terrestrial model)

T = 0˚C...150˚C (P = 1 - 100 bar)

activation energy range: (25 kJ/mol, 130 kJ/mol), per molecule (0.25 eV, 1.3 eV)

basic materials: H C N + F e S + H 2 S + H 3 P O 4

enzyme polymers: peptides,

synthesizers (genes): poly-nucleins

solvent: water

energy cycle: F e S + H 2 S , photosynthesis

location: water-carrying planets in habitable zone, in contact with mineral rock

This is the terrestrial life model, mainly dealt with in this paper. The spontaneous emergence of a proto-lifecycle is proved numerically based on chemical reaction data, and partly supported by direct observation (see Chap. 2.8.3, Chap. 3.2).

The proto-lifecycle runs with

- 3 amino acids Gly, Pro. Cys

- 2 nucleobases Gua, Cyt

- 5 peptides (enzymes) Gly2, Gly3, Pro, Pro2, GlyPro

- 1 lipid Gly1ph

- 5 PNB’s (genes) Gua2, Gua3, Cyt2, Cyt, GuaCyt

- 5 PNA’s GlyGua2, GlyGua3, ProCyt2, ProCyt, GlyGuaProCyt2

Carbon-hydrogen-oxygen chemistry in water

T = 0˚C...150˚C (P = 1 - 100 bar)

activation energy range: (25 kJ/mol, 130 kJ/mol), per molecule (0.25 eV, 1.3 eV)

basic materials: C O 2 + C H 4 + N H 3 , no iron or sulfur

enzyme polymers: poly-vinyl-alcohols, poly-ethylene-glycols, poly-acryl-amide

synthesizers (genes): aliphatic amines

solvent: water

energy cycle: photosynthesis, C O + H 2 O

location: water-carrying planets in habitable zone, no contact with minerals

This is a tentative life model for water environments without minerals, iron, and sulfur.

Aliphatic amines like tetra-methyl-ethylene-diamine function as catalyzers for poly-acryl-amides.

Carbon-hydrogen-nitrogen chemistry fluid hydrocarbons

T = 0˚C...300˚C (P = 1 - 100 bar)

basic materials: C H 4 + H 2 S + N H 3 , no oxygen

enzyme polymers: hydrocarbon linear-branched polymers with substitutions N, S

synthesizers (genes): polycyclic aromatic hydrocarbons (PAH) with substitutions N, S

solvent: fluid hydrocarbons

energy cycle: photosynthesis

location: waterless planets in habitable zone with dense atmosphere with hydrocarbons and nitrogen

This is a hypothetic life model based on hydrocarbon solvent, where the spontaneous emergence of proto-lifecycle with proto-genes from PAH’s and enzymes from poly-hydro-carbons with N and S substitutions is still to be proved.

This model is arguably the only viable alternative to water-based life models, it supports also temperatures up to 300˚C.

PAH’s are known to be common in the cosmic matter, and they are also rather stable and can function as catalyzers for hydrocarbons. PAH’s can polymerize under UV radiation [4] .

Silicon chemistry in molten rock

T = 1000˚C...1500˚C

basic materials: S i O 2 + A l 2 O 3 + M g O

polymers: polysilicates

solvent: molten silicates

energy cycle: radiation

location: hot Earth planets

The temperatures in this model are far outside the life-favorable range, therefore it is very probably no viable model for life.

Carbon + salts compounds in liquid ammonia

reactions probably too slow for life evolution

T = −78...−33˚C (P = 1 bar)

T = −77 C...98˚C (P = 60 bar)

solvent: liquid ammonia

basic materials: hydrocarbons, salts, organic metal complexes, organic carbon compounds C-H-N

polymers: complex hydrocarbons with oxyl-, amino- and metal-radicals

energy cycle: carbon-hydrogen chemistry?

location: Titan-like planets

The temperatures in this model are far below the life-favorable range, therefore it is very probably no viable model for life.

Carbon compounds in liquid ethane-methane

reactions probably too slow for life evolution

T = −183... −89˚C (P = 1 bar)

solvent: liquid ethane-methane

energy cycle: photosynthesis? tidal heating?

basic materials: organic carbon compounds C-H-N

polymers: complex hydrocarbons with oxyl-, amino-radicals

membrane (computer-modeled in February 2015): acrylonitrile similar to a phospholipid bilayer

location: Titan-like planets

The temperatures in this model are far below the life-favorable range, therefore it is very probably no viable model for life.

2.2. Principles of Terrestrial Life Chemistry

In the case of the terrestrial life, the process took place probably from the precursor molecules {HCN, H2} in water via prebiotic synthesis of amino acids and nucleobases under UV-radiation, and then parallel polymerization to catalyzers (poly-peptide enzymes) and synthesizers (peptide-nucleotides PNA) fed by the pyrite energy cycle (see Figure 2 “random prebiotic chemistry”).

The polymers poly-peptides and PNA are not very stable, especially in acidic water and temperatures around boiling point 100˚C, so they must be continually reproduced in order to survive [5] [6] : that is where the Darwinian evolution mechanism sets in. The amino acids and nucleobases also degrade, although at higher temperatures (T > 185˚C), so they, too, must be replenished [7] . On the other hand, the enzymes are continuously in contact with precursors molecules and with the energy cycle, so they degrade relatively quickly, so they must be reproduced as quickly. Simple copying is not as efficient as replication based on a stable pattern molecule: this is the role of the synthesizers (PNA).

The evolution selects a coupling between peptides and PNA-segments (genes), where the PNA-segments build-up the corresponding peptide from amino acids, and in turn each of the peptides catalyzes one of the required metabolic processes: synthesis of required amino acids and nucleobases from precursors, the energy cycle, synthesis of phospholipids and the build-up of the protective membrane.

Figure 2. Schematic random prebiotic chemistry.

The result for the proto-life cycle (Chap. 3.2) is a model1 3-bit binary (G/C) 5 × 2 genetic code (Gua-Cyt nucleobase pair) stored in the PNA (Leu-Gua and His-Cyt), coding for Gly and Pro amino acid sequences, which in turn catalyze the synthesis of 5 needed amino acids (Gly, Pro, Leu, His, Cys), nucleobases (Cyt, Gua) and phospholipid glycerol-phosphate, and the catalysis of the energy cycle via Cys with SH-radical (see Figure 3 “proto-life chemistry”).

This model is of course at first only a plausible scenario, but it is supported by the known prebiotic HCN-chemistry able to produce all amino acids, all nucleobases, phospholipids and sugars [8] [9] . Furthermore, the binary genetic code

Figure 3. Schematic terrestrial proto-life chemistry.

has been shown to be the probable original mechanism of PNA-RNA-coding and protein synthesis [10] [11] . Then, one can show by enzyme calculations, that Gly-Pro-peptides indeed act as enzymes in the synthesis of amino acids and nucleobases from (HCN, H2O, H2) (see Chap. 2.2, Chap. 2.4, Chap. 2.5 and [12] ). Finally, digital simulation based on a simplified model of the HCN chemistry shows that molecular evolution selects indeed a binary genetic code like the one proposed in [10] [11] .

A general consequence of the validity of this scenario is that the elementary enzymes (amino acids) and elementary synthesizers (nucleobases) are selected by evolution based on the precursor chemistry, i.e. here of the HCN-water chemistry, and not vice versa.

The present stage of terrestrial life evolution (chap. 3.5) represents the DNA-cell (bacteria, archaea, eukaryotes), which uses the full DNA genetic code with 20 amino acids, and corresponding 20 aaRNA’s for peptide synthesis, 4 nucleotides: cytosine, thymine, adenine, guanine (thymine replaces uracil in RNA-coding). The pairs A-G and C-T are complementary in the DNA-double-helix. The self-sustainable energy cycle is mostly the photosynthesis (plants, cyanobacteria). The self-sustainable energy cycle is mostly photosynthesis (plants, cyanobacteria).

2.3. Empirical Models of Molecular Energy and Reaction Rate

In principle, it is possible with methods of quantum chemistry (e.g. Hartree-Fock calculation) to calculate the chemical evolution in time of a given mixture of molecules in liquid solution, given the initial concentration, and physical conditions (temperature, pressure, external flow of matter, energy supply by radiation and spark discharge), and based solely on the structure of initially given and of emerging molecules. In reality, in order to calculate a realistic scenario in this way, the needed computational power exceeds by far the performance of today’s supercomputers.

There are two kinds of data, which are needed for a structure-based numerical simulation of chemical evolution.

First, we need the molecular formation energy Hf (molecular energy for short) for a given molecule, as a function of its bond structure. Hf is required for the calculation of the reaction energy ΔE of a given reaction A + B ( k + ) C + Δ E .

Second, we need the reaction rate d C ( t ) d t for such a reaction as a function of the bond structure of the reacting molecules A and B (here C(t) denotes the concentration of the compound C).

2.3.1. Empirical Model of Molecular Energy

The molecular forming free enthalpy ΔHf has been measured for hundreds of molecules, values can be found in [13] [14] [15] [16] . It is defined as the forming energy from a natural state at normal conditions (T = 300 K, P = 1 bar). The molecular forming energy from atoms Hf is calculated as Hf = ΔHfE(gas), where E(gas) is the forming energy of gaseous components (H2, O2, N2) in the molecule, e.g. for glycine we get

formula C2H5NO2

ΔHf = −390.5 kJ/mol [13]

E ( gas ) = E ( 2 . 5 HH , O = O , 0.5 N N ) = 2.5 432 + 494 + 0.5 942 = 2045

H f = Δ H f E ( gas ) = 2045 390.5 = 2435.5 kJ / mol

where the bond energies of HH = 432 kJ/mol in H2, (O=O) = 494 kJ/mol in O2, (N≡N) = 942 kJ/mol in N2 [16] .

The values of Hf are calculated in [16] .

A molecule can be characterized by a list of bonds, e.g. for glycine we get {OH, CO, C2O, CC, 2CH, CN, 2NH}, where C2O is the double bond C=O [16] . The values of the bond energies are given in data tables like [13] [14] . The sum of bond energies is in most cases considerably higher than the molecular energy Hf, the actual bond energy Eb in a molecule is mostly lower than the measured breaking energy of a bond. The bond factor fb depends predominantly on the bond, and, much more weakly, on the neighboring bonds in the molecules.

The empirical formula with the bond factors becomes H f = i f b , i E b , i , where the bond factors fb,i are fitted on a large set of 67 molecules, among them amino acids, nucleobases and biochemical building block molecules [12] [16] .

The original bond energies Eb,i

{H1HO = 21., H3N1H = 13., HO1H = 138., CH = 411., C3N = 887., H1CN = 120., CN1CN = 535., HH = 432., CO = 358., C1OH = 14.2, NH = 314., OH = 459., C2O = 799., CC = 346., CN = 305., C2C = 602., C3C = 835., C2N = 615., CS = 272., SH = 363., NN = 167., C2S = 573, NO = 201., N2O = 607., O2O = 494., N3N = 942., OO = 142., FeS = 315., Fe2S = 330., P2O = 544., PO = 335.}

are factorized with the fb,i and modified

{CH = 126.369, C3N = 596.779, CO = 345.13, NH = 381.107, OH = 341.591, C2O = 551.761, CC = 34.60, CN = 167.601, C2C = 60.20, C3C = 612.333, C2N = 313.895, CS = 214.021, SH = 36.30, NN = 317.912, C2S = 420.200, NO = 402.000, N2O = 637.217, P2O = 533.905, PO = 457.687}

with mean relative error merr = 0.056

example: for glycine measured (absolute) Hfm = 2435.5

calculated H f = i f b , i E b , i = 2499.9 , with the relative error rerr(Hf) = 64.4/2435.5 = 0.026.

2.3.2. Empirical Model of Reaction Rate

In physical chemistry, the reaction rate of a reaction is described by the mass action law [17] .

A + B ( k + ) C + D + Δ E resp. A + B ( k ) C + D + Δ E (1)

k + [ A ] n A [ B ] n B = k [ C ] n C [ D ] n D at equilibrium, where [A] is concentration and nA is the multiplicity of A, and ΔE is the reaction energy, the reaction is exothermic, when ΔE > 0.

The time-dependent form of the mass action law is

d C d t + d D d t = k + [ A ] n A [ B ] n B (2)

The Arrhenius law for the reaction constant states that [17]

k r = A 0 exp ( E a / k T ) (3)

where Ea is the activation energy, A0 is the pre-exponential factor.

The reaction energy is the difference of the sum of the bond energies of input and output:

Δ E = k E o b , k k E i b , k (4)

When concentration are measured in (dimensionless) relative mole (and not 1/mole as usual), the constant A0 has the dimension 1/s, i.e. A 0 = 1 t 0 , where t0 is the interaction reaction time, in biochemical reactions under normal conditions in water, t0 ~ 10−9 s. The constant A0 is only weakly temperature-dependent.

The rate constant A0 depends on the diffusion constants and the critical length (mean free path) λ of the liquid ( A 0 + ) 1 = t 0 = λ 2 / 6 ( D A + D B ) [17] , where D is the diffusion constant.

In liquids, D is described by the Einstein formula D = k T b π r 0 η , where η is the liquid viscosity, r0 is the molecule radius, b = 6 for large molecules.

For linear molecules, we can approximate r0 by r 0 = L m d m [18] , where the molecule is described by a cross section area of length Lm and diameter dm.

So we can describe D by the formula D = D 0 k R b , k [18] , where Rb,k are the bond lengths in A and D0 (solvent) is a diffusivity constant of the solvent, here water.

Fitting measurement data with these models shows that the geometric mean of the Di rather than the arithmetic mean in the formula for t0 is a good model:

t 0 = λ 2 3 D 1 D 2 (5)

Fitting the measured t0 with these models with the bond-factors fb,k, the diffusivity D0 and the mean free path λ yields the values

λ ( water ) = 1.62 nm , D 0 ( water ) = 2.52 × 10 9 m 2 / s

The resulting bond length in A are

H1HO = 0.0268, H3N1H = 10.10, CH = 0.19027, C3N = 0.0116, HH = 0.0102, CO = 0.0143, C1OH = 0.020, NH = 0.0101, OH = 0.3861, C2O = 0.0120, CC = 0.01540, CN = 0.0147, C2C = 0.0134, C2N = 0.0129, CS = 6.18268, NO = 0.0140, N2O = 0.01210, O2O = 12.10, N3N = 0.010, OO = 0.2257, D0 = 0.7681, λ = 2.04029

with mean relative error merr = 0.073.

In the same way, fitting the activation energy Ea with the bond-factors fb,k of the bond energies Eb,k in the model,

E a = | k f b , k E i b , k N i k f b , k E o b , k N o | ,

where Eib,k resp. Eob,k are the input resp. output bond energies yields the following values for corrected bond energies E ˜ b , k = f b , k E b , k in kJ/mol

{H1HO = 112.775, H3N1H = 130., HO1H = 292.07, CH = 58.5568, C3N = 939.947, HH = 2406.09, CO = 211.088, C1OH = 0.142, NH = 322.903, OH = 87.8274, C2O = 207.482, CC = 307.252, CN = 327.516, C2C = 622.483, C2N = 495.348, CS = 2221.8, SH = 3159.27, NO = 917.996, N2O = 204.21, O2O = 302.165, N3N = 1210.54, OO = 737.965}

with mean relative error merr = 0.066.

2.4. Theory of Enzymes: Lock-and-Key Theory

Enzymes (in terrestrial life peptides) are the key element of life in general. They accelerate specific reactions 104 - 106 fold and, coupled with synthesizers (PNA/RNA), they enable the reproduction and survival of all life molecules, including themselves.

The specific action of an enzyme with a single substrate (precursor molecules) can be explained using a lock-and-key analogy. In this analogy, the lock is the enzyme and the key is the substrate. Only the correctly sized key (substrate) fits into the key hole (active site) of the lock (enzyme).

Below, we present the mathematical formulation of the action of an enzyme: the substrate (with site bonds En) allocates along the enzyme (with site bonds E n ), the substrate sites are bound to the substrate sites with bond energy Fn. The substrate bonds are then weighted with local density pn, and the resulting mean bond energy E b c = n p n E n is considerably lower than the original mean bond energy E b m = n E n / N : this is threshold-reducing action of an enzyme, illustrated in the graphics below Figure 4 [19] .

Initially assuming a well-mixed distribution of enzymes and substrate of equal concentration, we view the “lock” as constantly-spaced enzyme molecules of density profile r n = r ( x n + Δ x / 2 ) , xn = nΔx, Δx small, with density values rn.

The substrate (or “key”) molecules are particle pairs having a local density profile pn = p(xn) at positions xn = nΔx. Each enzyme-substrate “complex” locally lowers the activation energy of the reaction so that overall activation energy is maximally lowered when all key particles are closest (Kullback-Leibler-distance) to the corresponding lock particles. The KL-distance is a rn-weighted Boltzmann entropy of the profile pn, which makes it a plausible ansatz from the viewpoint of thermodynamics.

The enzyme KL-model is formulated as follows.

Minimization problem:

Figure 4. Information in living systems manifests through “temporal gradients”. Here the system contains initially two substrates and one enzyme. In the absence of the enzyme, reaction CG + H will proceed more rapidly because it has both lower final free energy and lower activation energy. However, the enzyme lowers the Ea for reaction BE + F. The information in the enzyme produces an observable gradient over time as the concentrations of E and F are increased and B is decreased when compared to an uncatalyzed system. In contrast, because of its specificity, the enzyme has no effect on the temporal evolution of the substrate and product concentrations of reaction CG + H.

H K L ( p r ) = n = 1 N p n ln [ p n r n ] = min (6)

n p n = 1 , constraint: normalization substrate, with bond energies En, n E n = E b , Eb is the molecular energy of the substrate.

Enzyme density values rn are normalized n r n = 1 , with bond energies E n and the molecular energy n E n = E b , and r n = E n E b , E b is the molecular energy of the enzyme.

With Lagrange-multipliers we get the minimization problem

n = 1 N p n ln ( p n r n ) Λ 1 ( n = 1 N p n 1 ) = min

Differentiation p n and Λ 1 gives N + 1 equations for N + 1 variables {pn, Λ1}

1 + ln ( p n r n ) Λ 1 = 0 and n = 1 N p n = 1

we get 1 n = 1 N 1 p n = p N and Λ 1 = 1 + ln ( p N r N ) as 2 equations for pN and Λ1 and p n = r n exp ( Λ 1 1 )

In this case, differentiation yields the (approximate) local minimum pi = ri with min = 0, numerical minimization yields global minimum with negative values, the solution has one large value pi0 close to 1, at the index of the smallest ri ( r i 0 = min ( r i ) ), and the rest of the pi is small. The enzyme coupling “selects” the weakest bond and preserves it, and attenuates the energy of the other bonds: this is the catalyzer mechanism.

Example lock-and-key theory

We illustrate the Kullback-Leidler enzyme model by calculation of a concrete example: the prebiotic synthesis of glycine from HCN, water and hydrogen catalyzed by diglycine [12]

2 H C N + 2 H 2 O + 2 H d i g l y c i n e g l y c i n e + N H 3 , where g l y c i n e = O H C O C H 2 N H 2 and d i g l y c i n e = O H C O C H 2 N H C O C H 2 N H 2

We formulate the substrate-enzyme interaction pattern in the input as follows

H O H H C N H H H 2 O H C N substrate 2 H C N + 2 H 2 O + 2 H

O H C = O H C H N H C = O H C H H N H enzyme diglycine

with substrate bond energies in kJ/mol (H1HO is HH bond in water)

Ein = {H1HO, H1HO, CH, C3N, HH, H1HO, H1HO, CH, C3N} {143.694, 143.694, 143.694, 165.472, 658.049, 143.694, 931.529, 143.694, 143.694, 143.694, 143.694, 165.472, 658.049}

enzyme bond energies in kJ/mol

E i n = {C2O, C2O, CH, CH, NH, C2O, C2O, CH, CH, NH, NH} {110.436, 110.436, 165.472, 165.472, 381.609, 110.436, 110.436, 165.472, 165.472, 381.609, 381.609}

The KL-weights are

pin = {0.000751014, 0.990651, 0.00112518, 0.00112514, 0.0025965, 0.000750763, 0.000750619, 0.00112519, 0.00112512}

enzyme density values r i n = E i n / n E i n , {0.0743559, 0.0743559, 0.111411, 0.111411, 0.256934, 0.0743559, 0.0743559, 0.111411, 0.111411}

We formulate the substrate-enzyme interaction pattern in the output correspondingly

O H C = O H C H N H C = O C H 2 N H 2 enzyme diglycine

O H C = O H C H N H 2 N H 3 g l y c i n e = O H C O C H 2 N H 2 + N H 3

Eon = {OH, CO, C2O, CC, CH, CH, CN, NH, NH, NH, NH, NH} {257.109, 290.849, 110.436, 157.643, 165.472, 165.472, 35.7925, 381.609, 381.609, 381.609, 381.609, 381.609}

E o n = {OH, CO, C2O, CC, CH, CH, CN, NH, C2O, CH, CH, NH} {257.109, 290.849, 110.436, 157.643, 165.472, 165.472, 35.7925, 381.609, 110.436, 165.472, 165.472, 381.609}

enzyme density values r o n = E o n / n E o n , {0.107695, 0.121828, 0.0462586, 0.066032, 0.0693113, 0.0693113, 0.0149924, 0.159845, 0.0462586, 0.0693113, 0.0693113, 0.159845}

The KL-weights are

pon = {0.000213839, 0.000242668, 0.000090888, 0.000129961, 0.00013647, 0.00013647, 0.998051, 0.000317807, 0.0000907091, 0.00013647, 0.00013647, 0.000317807}

The mean bond energies original and original activation energy are

E i b m = ( n E i n ) / n i = 350.37

Figure 5. Schematic calculation of biochemical reactions (N = n (elements)).

E o b m = ( n E o n ) / n o = 257.57

E a = E o b m E i b m = 92.80

The mean bond energies catalyzed, and catalyzed activation energy are

E i b m = ( n p i n E i n ) / n i = 11.10

E o b m = ( n p o n E o n ) / n o = 2.79

E a = E o b m E i b m = 8.31

so the original activation energy is E a = 92.80 , and the catalyzed activation energy is E a = 8.31 , and the attenuation ratio is f a = E a E a = 11.17 .

The calculation of biochemical reaction rates from 2.3.2 is described below in Figure 5.

The calculation of lock-and-key enzyme model from 2.4 is described below in Figure 6.

Figure 6. Schematic catalytic action of enzyme.

2.5. Synthesis Reactions in Prebiotic HCN-Chemistry

We present here three prebiotic reactions of the HCN-chemistry, which generate important amino acids and nucleobases of the proto-life scenario described in the following section [20] . These reactions have been carried out in the lab by Patel et al. [8] and Das et al. [9] .

Cytosine reaction [8]

Glycine reaction [9] , including energy output in kJ/mol (negative = released)

Proline reaction [8]

2.6. Models of Proto-Code

We present here a plausible scenario of terrestrial proto-life based on the HCN-chemistry according to the Patel-Das model [8] [9] , the binary genetic proto-code proposed by Carter-Wills [10] and Rodriguez et al. [11] , and the pyrite (iron-sulfur) energy cycle proposed by Wächtershäuser [21] .

There are several basic features of the terrestrial life chemistry, which support this scenario

- the basic components, amino acids and nucleobases, have roughly the composition scheme CxNxH2xO2, which corresponds to the precursor input xHCN + 2H2O + xH of the HCN chemistry with hydrogen provided by the energy cycle.

- the maximum activation energy Ea of the spontaneous HCN-chemistry is 40 kJ/mol [8] , which fits very well with the reaction energy ΔE = 41 kJ/mol of the iron-sulfur-cycle, the iron-sulfur-cycle and the reverse acetogenesis, both involving sulfur, are the two main non-photogenetic energy cycles (apart from the Wood-Ljungdahl cycle) used in the terrestrial life chemistry.

- both basic component families can be traced back to fundamental components.

For the amino acids, it is the special group C of amino acids, set apart from the remaining groups A (electrically charged side chain), B (polar uncharged side chain) and D (hydrophobic side chain). The special group C consists of the simplest linear amino acid glycine, the simplest ring-amino-acid proline with its penta-ring and the simplest sulfur-amino acid cysteine: these are the 3 components of the model1-proto-code.

For the nucleobases it is cytosine, the hexa-ring nucleobase, and the coupled nucleobase guanine, which is a double hexa-penta-ring molecule, these are the 2 nucleobases of the model1 proto-code.

The model2 proto-code consists of max-4-bit-codons (guanine Gua or cytosine Cyt) coding for 2 amino acids (glycine Gly or proline Pro). It is a 1-1 code, i.e. a gene has only one codon, so there is no need for several steps of “sequencer” tRNA’s as in the full DNA-code, the peptide synthesis in the proto-life-cycle is a 1-step process. The codon = gene e.g. GG for Gly is represented by the corresponding poly-nucleobase PNB GuaGua, which “grabs” the corresponding amino acids (in this case Gly) via H-bonds (consuming energy) and forms the corresponding “stacked” poly-nucleic-acid (PNA), which after completion spontaneously splits-off the complete peptide (releasing energy).

The enzymes are Gly-Pro-sequences, which catalyze the synthesis of the 8 needed compounds (5 amino acids Gly, Pro, Leu, His, Cys, 2 nucleobases Gua, Cyt, 1 phospholipid glycerol-1-phosphate) from the precursors (hydrogen cyanide HCN, H2O, H, hydrogen sulfide H2S, phosphoric acid H3PO4).

The model1 proto-code is a simpler version with max-3-bit codons, where the 2 additional amino acids Leu and His in the PNA are replaced by the enzyme-building amino acids Gly and Pro.

This is a minimalistic version of a proto-code: at least 2 amino acids are needed as elements of enzymes (here Gly, Pro), a third (here Cys) containing sulfur is needed for catalyzing the energy cycle and stabilizing the lipid membrane.

Glycine is the simplest amino acid and is a linear molecule with 2 carbon-nodes (COOH-head and C-node with a NH2-radical), the corresponding nucleobase guanine is a double penta-hexa-ring with 5 C- and 4 N-nodes.

Proline consists of the COOH-head and a penta-ring with 4 C-nodes and one N-node, the corresponding nucleobase cytosine is a hexa-ring with 4 C-nodes and 2 N-nodes.

Therefore, it is obvious that enzymes, which catalyze linear molecules like glycine must contain glycine, and those which catalyze ring-molecules like proline or the nucleobases must contain proline.

During catalysis, the precursors align along the enzyme, so the enzyme must have at least as many nodes (C, N) as the resulting compound. The precursors for the ring-parts of the compound align along the ring-part of the enzyme. The precursor sequence is contiguous.

The alignment rules are:

● HCN binds to HCH or HCN

● water OH-H binds to CO

● HH resp. 2HH binds to NH or OH or (if none is available) to CO

2.6.1. Model2 Proto-Code with 5 Amino Acids

According to the above rules, we set up the model 2 proto-code with 7 proto-genes for 8 compounds (enthalpy values from [22] ) and 2 PNA’s for 5 amino acids, 1 lipid and 2 nucleobases.

glycine gene GlyGly, coded GG, produces glycine Gly

synthesis reaction

( H C N + 2 H 2 O ) + ( H C N + 2 H ) G l y + N H 3

molecule enthalpy Hf(Gly) = −2435.5 kJ/mol

enzyme-substrate alignment (NH-CO is the peptide bond between amino acids)

OH-H HCN HH OH-H HCN precursors

OHCO HCH NH CO HCH NH2 GlyGly

cysteine gene GlyGlyGly, coded GGG, produces cysteine Cys (catalyzes pyrite reaction)

synthesis reaction

( H C N + 2 H 2 O ) + ( H C N + 5 H ) + ( H C N + H 2 S ) C y s + 2 N H 3

molecule enthalpy Hf(Cys) = −2780.1 kJ/mol

enzyme-substrate alignment

OH-H HCN HH OH-H HCN HH SH-H HCN H precursors

OHCO HCH NH CO HCH NH CO HCH NH2 GlyGlyGly

cysteine gene GlyGlyGly, coded GGG, produces glycerol-1-phosphate with phosphorous acid (Glyc1Ph forms membranes)

synthesis reaction

3 H C N + 2 H 2 O + 8 H + H 3 P O 4 G l y c 1 P h + 3 N H 3

molecule enthalpy Hf(Glyc1Ph) = −5033.8 kJ/mol

enzyme-substrate alignment

HH OH-H HCN HH OH-H HCN HH HPO4 HCN 2HH precursors

OH CO HCH NH CO HCH NH CO HCH NH2 GlyGlyGly

proline gene ProPro, coded CC, produces proline Pro

synthesis reaction

( H C N + 2 H 2 O ) + 4 H C N + 12 H P r o + 4 N H 3

molecule enthalpy Hf(Pro) = −3275.2 kJ/mol

enzyme-substrate alignment (N-CO is the peptide bond between amino acids, the penta-ring is split at CH-CH2)

HH OH-H HH HH HCN HCN HCN precursors

OH CO CH N HCH HCH HCH Pro

CO CH NH HCH HCH HCH Pro

OH-H HH HH HCN HCN HH precursors

cytosine gene Pro, coded C, produces the nucleobase cytosine Cyt

synthesis reaction

4 H C N + H 2 O + 2 H C y t + N H 3

molecule enthalpy Hf(Cyt) = −2799. kJ/mol

enzyme-substrate alignment (the penta-ring of Pro is split at CH-CH2, marked ---)

OH-H HCN HH HCN HCN HCN precursors

OH CO CH NH HCH HCH HCH Pro

guanine gene GlyPro, coded GC, produces the nucleobase guanine Gua

synthesis reaction

5 H C N + H 2 O G u a + H 2

molecule enthalpy Hf(Gua) = −3753.3 kJ/mol

enzyme-substrate alignment (the penta-ring of Pro is split at CH-CH2, marked ---)

OH-H HCN OH-H HCN HCN HCN HCN precursors

OH CO HCH NH CO CH NH HCH HCH HCH GlyPro

leucine gene GlyGlyGlyGly, coded GGGG, produces leucine Leu

synthesis reaction

( H C N + 2 H 2 O ) + 5 H C N + 18 H L e u + 5 N H 3

molecule enthalpy Hf(Leu) = −4181.7 kJ/mol

enzyme-substrate alignment

HH OH-H HCN 2HH OH-H HCN 2HH HCN HCN 2HH HCN HCN 2HH

precursors

OH CO HCH NH CO HCH NH CO HCH NH CO HCH NH2 GlyGlyGlyGly

histidine gene ProGlyGly, coded CGG, produces histidine His

synthesis reaction

( H C N + 2 H 2 O ) + 5 H C N + 8 H H i s + 3 N H 3

molecule enthalpy Hf(His) = −4259.7 kJ/mol

enzyme-substrate alignment (the penta-ring of Pro is split at CH-CH2)

2HH OH-H HCN HCN HCN HCN precursors

OH CO CH N HCH HCH HCH Pro

CO HCH NH CO HCH NH2 GlyGly

2HH HCN 2HH OH-H HCN 2HH precursors

The ligase and PNA for Gly is Leu-Gua bound by the peptide bond –NH-C= O-

L e u + G u a L e u G u a + H 2 O

The ligase and PNA for Pro is His-Cyt bound by the peptide bond –NH-C= O-

H i s + C y t H i s C y t + H 2 O

The peptide bond, which forms poly-peptides, has a bond energy of 8 - 16 kJ/mol.

The hexa-rings of the PNA’s are stacked on each other by H-bonds, and the amino acids do not form peptide bonds, because they are not aligned properly, they form H-bonds: the resulting structure is a poly-PNA built from Leu-Gua and His-Cyt. During the synthesis or the replication process however, the adjoining acid forms a peptide-bond with its twin, and at the end the peptide bond breaks, because it is weaker than the several H-bonds between stacked amino acids of the PNA-chain: the complete enzyme (resp. PNA-copy) breaks off.

The model2-version poly-PNA from Leu-Gua and His-Cyt elements is more stable than the model1-version from Gly-Gua and Pro-Cyt, its H-bonds are stronger, because the active linear part of the amino acid is twice as long: this is an advantage of the 5-gene-code. In the simple PNA formation reactions of model 1 (see reaction table below),

G l y + G u a G l y G u a + H 2 O , ΔE = 339 kJ/mol, Ea = 14 kJ/mol

P r o + C y t P r o C y t + H 2 O , ΔE = −44.3 kJ/mol, Ea = 12.6 kJ/mol

the Pro-reaction is endothermic (needs energy).

In the corresponding model2-reactions

Leu + Gua LeuGua + H 2 O , ΔE = 339 kJ/mol, Ea = 10 kJ/mol

His + Cyt HisCyt + H 2 O , ΔE = 109.4 kJ/mol, Ea = 11.9 kJ/mol

the His-reaction is exothermic, and both activation energies are lower than with their model1-counterparts.

2.6.2. Simplified Model1 Proto-Code with 3 Amino Acids

One gets a simpler version of the proto-code if the 2 additional amino acids Leu and His in the PNA are replaced by the enzyme-building amino acids Gly and Pro, as described above.

The proto-code consists now of binary 3-bit-codons (guanine Gua or cytosine Cyt) coding for 2 amino acids (glycine Gly or proline Pro). The enzymes are Gly-Pro-sequences, which catalyze the synthesis of the 6 needed compounds (3 amino acids Gly, Pro, Cys, 2 nucleobases Gua, Cyt, 1 phospholipid glycerol-1-phosphate) from the precursors (hydrogen cyanide HCN, H2O, H, hydrogen sulfide H2S, phosphoric acid H3PO4).

The simplified model1 proto-code with 5 proto-genes for 6 compounds (enthalpy values from [22] ) and 2 PNA’s for 3 amino acids, 1 lipid and 2 nucleobases is now as follows.

glycine gene enzyme GlyGly, coded GG, produces glycine Gly

synthesis reaction

( H C N + 2 H 2 O ) + ( H C N + 2 H ) G l y + N H 3

molecule enthalpy ΔH(Gly) = −390.5 kJ/mol (negative = energy released in synthesis)

enzyme-substrate alignment (NH-CO is the peptide bond between amino acids)

OH-H HCN HH OH-H HCN precursors

OHCO HCH NH CO HCH NH2 GlyGly

cysteine gene enzyme GlyGlyGly, coded GGG, produces cysteine Cys (catalyzes pyrite reaction)

synthesis reaction

( H C N + 2 H 2 O ) + ( H C N + 4 H ) + ( H C N + H 2 S ) C y s + 2 N H 3

molecule enthalpy ΔH(Cys) = −534.1 kJ/mol

enzyme-substrate alignment

OH-H HCN HH OH-H HCN HH SH-H HCN H precursors

OHCO HCH NH CO HCH NH CO HCH NH2 GlyGlyGly

cysteine gene enzyme GlyGlyGly, coded GGG, produces glycerol-1-phosphate with phosphorous acid (Glyc1Ph forms membranes)

synthesis reaction

3 H C N + 2 H 2 O + 8 H + H 3 P O 4 G l y c 1 P h + 3 N H 3

molecule enthalpy ΔH(Glycerol) = −577.9 kJ/mol

enzyme-substrate alignment

HH OH-H HCN HH OH-H HCN HH HPO4 HCN 2HH precursors

OH CO HCH NH CO HCH NH CO HCH NH2 GlyGlyGly

proline gene enzyme ProPro, coded CC, produces proline Pro

synthesis reaction

( H C N + 2 H 2 O ) + 4 H C N + 12 H P r o + 4 N H 3

molecule enthalpy ΔH(Pro) = −366.2 kJ/mol

enzyme-substrate alignment (N-CO is the peptide bond between amino acids, the penta-ring is split at CH-CH2)

HH OH-H HH HH HCN HCN HCN precursors

OH CO CH N HCH HCH HCH Pro

CO CH NH HCH HCH HCH Pro

OH-H HH HH HCN HCN HH precursors

cytosine gene enzyme Pro, coded C, produces the nucleobase cytosine Cyt

synthesis reaction

4 H C N + H 2 O + 2 H C y t + N H 3

molecule enthalpy ΔH(Cyt) = −221.3 kJ/mol

enzyme-substrate alignment (the penta-ring of Pro is split at CH-CH2, marked ---)

OH-H HCN HH HCN HCN HCN precursors

OH CO CH NH HCH HCH HCH Pro

guanine gene enzyme GlyPro, coded GC, produces the nucleobase guanine Gua (alignment and catalyzing not so good as ProGly)

synthesis reaction

5 H C N + H 2 O G u a

molecule enthalpy ΔH(Gua) = −183.9 kJ/mol

enzyme-substrate alignment (the penta-ring of Pro is split at CH-CH2, marked ---)

HCN OH-H HCN HCN HCN HCN precursors

OH CO HCH NH CO CH NH HCH HCH HCH GlyPro

guanine gene alternative enzyme ProGly, coded CG, produces the nucleobase guanine Gua

synthesis reaction

5 H C N + H 2 O G u a

molecule enthalpy ΔH(Gua) = −183.9 kJ/mol

enzyme-substrate alignment (the penta-ring of Pro is split at CH-CH2, marked ---)

HCN HCN HCN HCN precursors

CH N HCH HCH HCH ProGly

CO HCH HNH precursors

OH-H HCN ProGly

The ligase and PNA for Gly is GlyGua bound by the peptide bond –NH-C= O-

G l y + G u a G l y G u a + H 2 O

The ligase and PNA for Pro is ProCyt bound by the peptide bond –NH-C= O-

Pro + Cyt ProCyt + H 2 O

The peptide bond, which forms poly-peptides, has a bond energy of 8 - 16 kJ/mol.

The hexa-rings and the penta-rings of the PNA’s are stacked on each other by H-bonds, and the amino acids do not form peptide bonds, because they are not aligned properly, they form H-bonds: the resulting structure is a poly-PNA built from Gly-Gua and Pro-Cyt.

During the synthesis or the replication process however, the adjoining acid forms a peptide-bond with its twin, and at the end, the PNA peptide bond and the vertical H-bonds break, with energy input: the complete enzyme (resp. PNA-copy) breaks off.

During polymerization of PNA H-bonds form in those sites, which have H-atoms.

GlyGuaGlyGua has 6 vertical H-bonds (in red)

2 G l y G u a G l y G u a G l y G u a + 6 ( H − )

GlyGuaGlyGuaGlyGua has 12 vertical H-bonds (in red)

3 G l y G u a G l y G u a G l y G u a G l y G l u a + 12 ( H − )

ProCytProCyt has 9 vertical H-bonds (in red)

2 P r o C y t P r o C y t P r o C y t + 9 ( H − )

GlyGuaProCyt has 13 vertical H-bonds (in red), the arrow marks the alignment of the penta-rings

G l y G u a + P r o C y t G l y G u a P r o C y t + 13 ( H − )

The poly-peptide (enzyme) formation works by breaking-off of the amino-acid-stack from the PNA-stack under energy input ΔE.

During enzyme formation the peptide bond pb (PNA) breaks, as well as some of the H-bonds, and pb (peptide) is formed instead.

G l y G u a G l y G u a + Δ E G l y G l y + G u a G u a + p b ( G l y G l y ) 2 p b ( P N A ) 3 ( H − )

P r o C y t P r o C y t + Δ E P r o P r o + C y t C y t + p b ( P r o P r o ) 2 p b ( P N A ) 6 ( H − )

G l y G u a P r o C y t + Δ E G l y P r o + G u a C y t + p b ( G l y P r o ) 2 p b ( P N A ) 9 ( H − )

G l y G u a G l y G u a G l y G u a + Δ E G l y G l y G l y + G u a G u a G u a + 2 p b ( G l y G l y ) 3 p b ( P N A ) 6 ( H − )

2.7. Reactions of Model1 Proto-Code

The reactions of the model1-proto-code fall into 9 categories (Table 1, Table 2)

1) Catalyzed synthesis of basic compounds and energy reaction (pyrite reaction)

The basic compounds (amino acids and nucleobases) are synthesized from precursors HCN, H2O, H2, H2S, catalyzed by enzymes controlled by the corresponding genes.

Example: (ΔE reaction energy, Ea original activation energy, Ec catalyzed activation energy, nB number of bonds)

Direct glycine synthesis catalyzed by GlyGly:

Table 1. Reactions of model1 proto-code [12] (ΔE reaction energy (kJ/mol), Ea original activation energy (kJ/mol), Ec catalyzed activation energy (kJ/mol), reaction time t0 (ns), nB number of bonds).

Table 2. Model1 genetic code.

2 H C N + 2 H 2 O + H 2 G l y ( C O O H C H 2 N H 2 ) + N H 3 , enzyme GlyGly, ΔE = 223 kJ/mol, Ea = 94.8 kJ/mol, Ec = 7.6 kJ/mol, nB = 25

A key role plays the pyrite reaction, as energy provider, but foremost as the source of hydrogen for the synthesis of basic compounds:

F e S + H 2 S F e S 2 + H 2 , enzyme Cys, ΔE = 41 kJ/mol, Ea = 160.6 kJ/mol, Ec = 0.02 kJ/mol, nB = 16

The pyrite reaction has a low energy output, but a high activation energy, so it runs practically only with catalysis (uncatalyzed reaction time is t r = t 0 exp ( E a k T ) = 3.45 × 10 9 × 1.37 × 10 21 = 5.32 × 10 12 s = 1.69 × 10 5 a ).

The synthesis reactions have high reaction energy (ΔE = 100...1200 kJ/mol), high activation energy (Ea = 100...200 kJ/mol), and do not run in water without enzymes.

The “self-catalyzing” synthesis of proline via cytosine (ring) has a very low activation energy

C y t ( C 4 H 5 N 3 O 3 ) + H C N + 6 H 2 P r o ( C 5 H 9 N O 2 ) + H 2 O + 2 N H 3 , ΔE = 582.9 kJ/mol, Ea = 21.7 kJ/mol, nB = 16

The basic compounds synthesis reactions are all exothermic, and have high activation energy Ea = 92...145 kJ/mol (using cytosine-proline synthesis). when running without catalysis, the maximum reaction time is (Gua)

t r = t 0 exp ( E a k T ) = 0.99 × 10 9 × 1.59 × 10 19 = 1.57 × 10 10 s = 0.50 × 10 3 a

Apart from amino acid and nucleobase synthesis, the third important basic compounds are the phospholipids, which form the proto-membrane, here the synthesis of the lipid precursor glycerol-1-phosphate Glyc1ph from the basic precursor phosphoric acid H3PO4, catalyzed by triglycine GlyGlyGly 3 H C N + 2 H 2 O + 4 H 2 + H 3 P O 4 G l y c 1 p h + 3 N H 3 , ΔE = 395.5 kJ/mol, Ea = 139.5 kJ/mol, Ec = 6.6 kJ/mol, nB = 27.

2) Nucleobase polymerization

Nucleobases form stacks bound by H-bonds (poly-nucleobase = PNB), these reactions are spontaneous, strongly exothermic (ΔE ~600 kJ/mol) and have low activation energy (Ea = 5…10 kJ/mol).

C y t + C y t C y t C y t ( 3 H ) CytCyt is bound by 3 H-bonds, ΔE = 491 kJ/mol, Ea = 5 kJ/mol

3) Simple PNA formation

Simple peptide-nucleic-acid (PNA) form from an amino acid and a nucleobase

Gly + Gua GlyGua + H 2 O , ΔE = 339 kJ/mol, Ea = 14 kJ/mol

Pro + Cyt ProCyt + H 2 O , ΔE = −44.3 kJ/mol, Ea = 12.6 kJ/mol

These reactions are low exo- or low endothermic and have low activation energy.

4) Direct PNA polymerization

Simple PNA’s form chains

GlyGua + GlyGua GlyGuaGlyGua ( 6H ) , ΔE = 126 kJ/mol, Ea = 3 kJ/mol

5) PNA formation from amino acids and nucleobases

Two amino acids and a simple pNB form a PNA

Gly + Gly + GuaGua GlyGuaGlyGua ( + 2H ) + 2H 2 O , ΔE = 468 kJ/mol, Ea = 10 kJ/mol

These reactions are mostly endothermic and have a low activation energy.

6) Peptide forming

GlyGuaGlyGua + 2H 2 O GuaGua ( + 4H ) + GlyGly + H 2 O , ΔE = 682 kJ/mol, Ea = 11.3 kJ/mol

A peptide “breaks-off” from a PNA-stack, the reactions are exothermic and have low activation energy.

The sequence type2 → type5 → type6 is the PNA-controlled peptide synthesis, in contrast to type9, the spontaneous peptide synthesis (see below). This sequence is exothermic (the endothermic reactions type5 use the energy from the preceding reaction type2), whereas the reactions of type9 are endothermic, and their activation energy Ea is mostly lower than the corresponding Ea in type 9.

7) Diverse synthesis and energy cycles

The HCN chemistry as the basis for the proto-code reactions depends on a sufficient supply of HCN. As discussed in ( [2] Chap. 3.4), there is evidence of the presence of HCN, along with methane CH4 and ammonia NH3 in the early Earth atmosphere. But there are also cyanide synthesis reactions, which could produce HCN under volcanic pools or in hydrothermal vents.

CO 2 + NH 3 + H 2 HCN + 2H 2 O , ΔE = −90.9 kJ/mol, Ea = 74.2 kJ/mol

CO 2 + NH 3 + SH 2 HCN + 2H 2 O + S , ΔE = −111.5 kJ/mol, Ea = 6.5 kJ/mol

They have as reducing component H2 or SH2, which are supplied by the pyrite reaction.

They are both endothermic, so they depend upon energy supply, e.g. from the pyrite reaction.

Their activation energy Ea is relatively low, especially for the SH2-reaction: this one can run purely thermally, without enzymes.

There is the synthesis of the proto-sugar glyceraldehyde Gle, a precursor of ribose needed for RNA:

Gly + H 2 O Gle ( HCO HCOH H 2 COH ) + NH 3 , ΔE = 346.2 kJ/mol, Ea = 14 kJ/mol

As for energy cycles, there is the mentioned pyrite reaction

FeS + H 2 S FeS 2 + H 2 , ΔE = 41 kJ/mol, Ea = 160.6 kJ/mol

but also the CO2-reduction by hydrogen, which was present in the prebiotic Earth atmosphere

CO 2 + H 2 CO + H 2 O , ΔE = 90.1 kJ/mol, Ea = 115.8 kJ/mol

and oxidation of carbon monoxide CO in water, which is still used by methanogen bacteria [23]

CO + H 2 O HCOOH , ΔE = 548.8 kJ/mol, Ea = 33.3 kJ/mol

8) Decay of basic compounds

Under present biotic conditions on Earth, the basic compounds are long-lived, amino acids decompose in water thermally at temperature T > 185˚C at neutral pH, nucleobases have in water a half-life of 20 - 200 days at 100˚C, and they degrade at 250C at neutral pH [24] .

The decay via hydrolysis of the basic compounds functions via break-off of the C = O radical, which all of them contain, with the formation of formic acid. All decays are endothermic (run only with energy input), and have a low activation energy.

A typical decay reaction is the hydrolysis of glycine:

Gly ( COOH CH 2 NH 2 ) + H 2 O OH CH 2 NH 2 + HCOOH , ΔE = −335.9 kJ/mol, Ea = 9.0 kJ/mol

9) Direct peptide polymerization

These are the direct merging reactions of amino acids into peptides, without the interaction of PNA’s (see peptide formation):

Gly + Gly GlyGly + H 2 O , ΔE = −117.7 kJ/mol, Ea = 64.8 kJ/mol

Pro + Pro ProPro + H 2 O , ΔE = −1097.7 kJ/mol, Ea = 27.9 kJ/mol

These reactions are endothermic, with moderate activation energy, they run spontaneously only at high temperature and with a multiple energy input from the energy reaction ( Δ E ( Gly + Gly GlyGly + H 2 O ) 3 E ( FeS + H 2 S FeS 2 + H 2 ) = 123 kJ / mol ), i.e. very slowly even with a catalyzed energy reaction.

The opposite reactions like

GlyGly + H 2 O Gly + Gly

are exothermic decay reactions of peptides with a decomposition temperature Td, where t 0 exp ( E a / k T d ) = 1 s , so for t 0 10 9 s we get approximately for Gly: k T d = E a / log ( 10 9 ) and T d = 350 K = 77 ˚ C , which is in good agreement with the known decomposition temperature of poly-peptides (75˚C - 85˚C).

Therefore the PNA-controlled peptide formation sequence type2 → type5 → type6 is really necessary for the molecular evolution of life from the HCN-HCN-chemistry.

On the other hand, in the prebiotic chemistry on Earth there was a seed concentration of amino acids from meteorites (rough estimation for the concentration c = 10−6), but no polymers, i.e. no enzymes, and therefore no organic catalysis. In the initial prebiotic period before the onset of the model1 proto-code life cycle, peptide polymerization was spontaneous, with the support of an energy cycle: Cys-catalyzed pyrite reaction of H2S, CO2-reduction or CO-oxidation.

With seed-concentration for the basic compounds of c = 10−6, and remaining concentration from [23] given below, we get the energy production of the 3 energy cycles [12]

E p ( H 2 S ) = 0.0045 kJ / s mol , E p ( CO 2 ) = 5.3 × 10 7 kJ / s mol , E p ( CO ) = 0.00062 kJ / s mol ,

and using this, for the effective build-up time of G l y + G l y G l y G l y + H 2 O with the H2S-pyrite reaction

t c ( 2 Gly ) = t r ( 2 Gly ) c ( Gly ) Δ E ( 2 Gly ) t r ( 2 Gly ) 1 E p = 1.0 × 10 9 s = 31.7 a with the effective reaction time t r = t 0 exp ( E a k T ) .

2.8. Numerical Simulation of Terrestrial Proto-Life Evolution

2.8.1. Diffusion and Convection

Molecular flow plays an important part in biochemical reactions and must be taken into account in the simulation model.

In prebiotic chemistry, there are strong thermal and concentration gradients.

The purely diffusion-driven flow is very slow in water: the diffusivity constants are in the range of 10−9 m2/s, so in the length scale of L = δ θ ~ 10 cm ( δ θ is the width of the thermal boundary layer, see below), we get a time scale of t ~107 s for the thermal vent scenario, which is too slow. For the lipid-bubble scenario the length scale is the bubble radius Rm = 20 μm, and the time scale is t ~1 s [25] , here diffusion is a realistic mechanism.

For diffusion, we have Fick’s law [25]

c ( x , t ) t = D 2 c ( x , t ) x 2 (7)

For the gradient-driven osmosis through a membrane, the following relation holds [26] :

d c d t = A d V K D Δ c (8)

where V is the volume, D is the diffusion coefficient of the molecule, K is the (dimensionless) partition coefficient, A is the membrane area, d is the membrane thickness, c is the relative concentration, Δc is the difference in relative concentrations at the membrane. K depends on the molecule and the material of the membrane, and has values (for lipids and octanol) in the range log Kow = −4...6.5 [24] .

The thermal convection flow, driven by a thermal difference of ΔT ~80 K is much faster [25] [27] .

In thermal convection, there are two opposite forces: the buoyancy force Fb, driven by the thermal density gradient, and the viscosity force Fv driven by viscous friction.

We get the buoyancy force density F b = β Δ T ρ 0 g

G r = F b F v = g β * Δ c δ θ 3 ν 2 ,

where L = δ θ is the width of the thermal boundary layer, here δ θ = L N u , here Nu(L = 10 m, ΔT = 80 K) = 160 δθ = 6.3 cm [25] [27] and β * = 1 ρ ρ c and β * c Δ ρ ρ then follows F v = F b G r Δ ρ Δ ρ / ρ ν 2 δ θ 3 = ρ ν 2 δ θ 3 with the denominations: ρdensity, ΔT temperature difference, Δc concentration difference, L edge length of the reaction region, ν kinematic viscosity of water (m2/s), g gravitational acceleration constant, β water volume expansion coefficient (1/K).

Resulting viscosity force density is F b F v = Δ ρ g ρ d ( ν 2 x 3 ) d x Δ x = Δ ρ g ρ 3 ν 2 x 4 Δ x

so we get the acceleration Δ a ( x ) = Δ ρ ρ g 3 ν 2 x 4 Δ x = Δ x ( ρ ( x ) ρ ( x ) g 3 ν 2 x 4 )

and the convection time t c = 1 / ρ ( x ) ρ ( x ) g 3 ν 2 x 4 or for thermal convection

t c = 1 / T ( x ) T ( x ) g 3 ν 2 x 4 ,

With real values: ν ( water , T = 20 ˚ C ) = 1.0 × 10 6 m 2 / s β ( water , T = 20 ˚ C ) = 2.1 × 10 4 K 1

x = δθ = 6.3 cm, ΔT = 80 K, t c = 1 / 80 K 300 K 9.81 1 3 × 1.0 × 10 12 ( 6.3 × 10 2 ) 4 = 0.61 s and

F v F b , the viscosity force is negligible against the buoyancy force component,

and the convection velocity vc becomes v c = ( d t c d x ) 1 = 2 T ( x ) T ( x ) g 3 g ( ( T ( x ) T ( x ) ) 2 T ( x ) T ( x ) ) , with the given data, vc = 1.07 m/s.

If we have a concentration profile c(x), then the thermal convection-driven concentration flow will be

c ( t , x ) t = c ( t , x ) x / d t c d x = c ( t , x ) x v c (9)

2.8.2. Scenario1: Hydrothermal Vent with Spontaneous Synthesis of Basic Components

As was outlined in ( [2] Chap. 3.4), the most plausible scenario for the origin of the first life-cycle is hydrothermal vents in submarine volcanic rocks or volcanic pools. We carried out calculation based on this scenario, and on the reaction table in Chap. 2.7.

In this calculation, the reaction is described by a differential equation for the corresponding law of mass action according to the scheme

c ( t , x ) t = k c 1 ( t , x ) k 1 c 2 ( t , x ) k 2 c n ( t , x ) k n ,

where ci are the concentrations of the reaction participants with multiplicities ki, and with the reaction constant k = exp ( E a / k T ) t 0 , in time t and location x.

Furthermore, we have terms for three possible transport mechanisms (see Chap. 2.8.1)

diffusion c ( x , t ) t = D 2 c ( x , t ) x 2 (10a)

membrane osmosis c ( t , x 1 ) t = A d V K D ( c ( t , x 1 ) c 0 ) the boundary x1 (10b)

convection c ( t , x ) t = c ( t , x ) x v c (10c)

So in general, we have coupled algebraic (non-linear) partial differential equations of degree 1 in t, and of degree 1 or 2 in x.

We impose boundary conditions c ( t , x 1 ) = c 0 at the external boundary x1, and initial conditions in the form c ( 0 , x ) = c b ( x )

The parameters of scenario1 and its reactions can be described by the following scheme (Figure 7).

Scenario1 represents basically the primordial amino acid and peptide synthesis cycle (Table 3(a), Table 3(b)).

Scenario1 starts with realistic primordial concentrations (init1) of basic compounds (amino acids and nucleobases) c = 10−6 = 1 ppm, low concentration of peptides c = 10−8 = 0.01 ppm, and c = 0.001 for precursors.

A second simulation starts with higher “enriched” concentrations (init2) of c = 10−4 = 100 ppm for basic compounds, c = 10−6 = 1 ppm for peptides, and c = 0.01 for precursors.

The reaction network consists of spontaneous and peptide-catalyzed basic-compound-synthesis (t1), which uses the H2S energy cycle as a source of energy and hydrogen, and spontaneous peptide polymerization from amino acids (t9), which uses the CO energy cycle with its high energy yield, because it is highly exothermic.

The peptides decay thermally (t8) above decomposition temperature Tc.

The result [12] is an enrichment of amino acids, but no significant enrichment of nucleobases and peptides, as shown in Table 3 below.

2.8.3. Scenario2: Hydrothermal Vent with Proto-Lifecycle

Scenario2 is the full proto-life cycle from the reaction table in Chap. 2.7 (Figure 8), in the physical environment of a hydrothermal vent. It serves as an amplifier process for all involved molecule classes (amino acids, nucleobases, peptides,

Figure 7. (a) Schematic scenario1/2 model; (b) Schematic scenario1 reactions.

poly-nuclein-acids PNA, lipids), which are distributed in surroundings by thermal convection, keeping the concentration of bio-molecules in the thermal boundary layer stable and relatively low.

Table 3. (a) Scenario1 reactions simul1; (b) Scenario1 reactions simul2.

Scenario2 starts with enriched concentrations (init2) of c = 10−4 = 100 ppm for basic compounds, c = 10−6 = 1 ppm for peptides, and c = 0.01 for precursors.

The reaction network consists of spontaneous and peptide-catalyzed basic-compound-synthesis (t1, t9. t8), with the H2S energy cycle and the CO energy cycle, then the PNA-controlled peptide synthesis (t2, t5, t6, t4), and direct PNA-polymerization t3.

The PNA-controlled peptide synthesis works also without initial presence of peptides (i.e. initial peptide concentration c = 0), whereas the direct peptide polymerization t9 requires high rates of CO energy cycle to counteract the reverse reaction (direct peptide decay), which is exothermal and has the same activation energy Ea, although it has a little larger reaction times t0 and therefore is a little slower.

The results for convection velocity fvc = 0.5m/s are given in the following Table 4(a).

This can be formulated concisely in the following scheme (Table 4(b)):

time constant: t r 500 s

Figure 8. Schematic scenario2 reactions.

Table 4. (a) Scenario2 reactions [12] ; (b) Scenario2 amplification factor.

Figure 9. Scenario2 concentration buildup.

A typical concentration build-up for the amino acid Gly, the peptide Gly2 and the peptide Gua2 is shown below [12] (Figure 9).

2.8.4. Scenario3: Lipid Vesicles with Proto-Lifecycle and Proliferation

Scenario3 is the full proto-life cycle, confined in the interior of a lipid vesicle. As was reported in ( [2] Chap. 3.4), amino acids within a lipid membrane stabilize the lipid layer, and the lipids serve under certain conditions as catalyzer for peptide polymerization. Precursor molecules enter the vesicle via osmosis, but bio-molecules are enclosed in it, and are not carried away by convection, there is diffusion only within the vesicle. Therefore the concentration of bio-molecules can increase until it reaches a critical level, and the system becomes unstable, the vesicle divides and proliferation takes place. The reaction cycle now has all attributes of life.

Synthesis of basic compounds from precursors (“food”) using an energy cycle, self-regulation by catalysis through peptides-enzymes, PNA-gene-controlled enzyme production from amino acids, and proliferation through bio-matter production and vesicle division (Figure 10, Figure 11).

Scenario3 reaches significantly higher concentrations than scenario2, because the bio-molecules cannot leave the vesicle. Only precursor molecules can pass the membrane, for them the boundary condition c = c0 is valid at the membrane.

The PNB’s become now the proto-genes, which are stable in the cycle.

The basic components synthesis runs under catalytic peptide control, the pyrite energy cycle is catalyzed by Cys, PNA building from amino acids and PNB’s takes energy from the energy cycle, the PNA’s split into peptides and PNB’s releasing energy, and the cycle runs anew.

Figure 10. Scenario3 model.

Figure 11. Schematic scenario3 reactions.

The pyrite energy reaction delivers energy and H2 at the membrane, catalyzed by Cys, the insoluble FeS2 falls out in colloidal form.

The current of precursors through the membrane is governed by osmosis:

c ( t , x 1 ) t = f v m ( c ( t , x 1 ) c 0 )

and within the vesicle, there is diffusion for all molecules

c ( x , t ) t = f v d 2 c ( x , t ) x 2

If fvd = 0, i.e. without diffusion, the system reaches an equilibrium, and there is a solution for all times.

The diffusion introduces a definite critical time tc, where the system becomes unstable, i.e. where the solution of the differential equations ceases to exist: the vesicle divides, separates into two, and the reaction cycle starts again. This critical time depends on the system parameters and the initial concentration, and has values around 1000 s, which agrees well with the observed division periods of self-reproducing lipid-amino-acid vesicles and also of bacteria.

The results for osmosis constant fvm = 0.3 s−1 and diffusion constant fvd = 0.01 m2·s−1 are given in the following Table 5.

Table 5. Scenario3 reactions [12] .

The concentration c(t, x) for the amino acids Gly, Pro, the peptides Gly2, Gly3 and the PNA’s Gua2, GuaCyt is shown below [12] (Figure 12).

2.9. The Complete Scenario of the Terrestrial Life Origin with Proto-PNA Genetic Code

The result of the simulation is a completely realistic scenario of the origin of terrestrial life based on HCN and H2S as precursors, and the H2S energy cycle. It consists of two phases:

Phase1 (scenario2) is the enrichment of amino acids, nucleobases and peptides in a hydrothermal pool with amplification factors of 25 to 8000 and a time constant of 500s, the process is guided by thermal convection and proceeds via spontaneous peptide polymerization, which needs considerable energy taken from the more powerful CO-energy cycle.

Phase2 (scenario3) is the build-up of a genuine self-catalytic life-proto-cycle in a self-assembling lipid membrane.

The underlying genetic code is a proto-code with 3 amino acids (Gly, Pro, Cys) and 2 nucleobases (Gua, Cyt) with 5 proto-genes (GG, GGG, CC, C, GC) coding for 5 peptides (Gly2, Gly3, Pro2, Pro, GlyPro), which catalyze the synthesis of 5 basic components (Gly, Cys, Pro, Cyt, Gua). The peptide synthesis is carried out by peptide-nucleid-acids (PNA’s) generated in the proto-cycle, which correspond to the modern RNA-transferases. The process is guided by diffusion, and the calculation shows, that there is a periodic instability, i.e. a collapse-division time of the proto-cell of roughly 1000 s, which is about the same as the cell-division cycle time in modern bacteria. The PNB-proto-genes remain stable in the cycle and play the role of information carriers.

Figure 12. Scenario3 concentration buildup.

The process described above uses precise reactions and proto-code.

On the other hand, it is unrealistic that the proto-code described above is the only alternative. The PNA building is a spontaneous process, which is not very accurate. There will be other PNA’s, which will produce other similar peptides. Those peptides will survive, if they can at least partly catalyze a basic reaction,

That means, there will be not one gene PNB = GuaGua with the corresponding PNA = (GlyGua)2 for glycine synthesis, but a family of genes (quasi-gene in the terminology of Manfred Eigen). The set of quasi-genes forms a quasi-species, where the variable genes adapt to the changing environment.

The evolution leads to an increasing number of more and more complex genes and peptides, where the catalysis is more and more specific: the quasi-species evolves into a genuine species.

Considering all this, we get the following life-origin quasi-species scenario (Figure 13(a), Figure 13(b)).

Complete genesis of proto-life cycle reaction scenario

phase1: red, phase2: black

In phase1, there are additional spontaneous polymerization reactions fed by the additional energy cycle, which are replaced by peptide-catalyzed reactions in phase2.

Complete genesis of proto-life cycle model

phase1: red, phase2: black

Here is included in red the phase1 open hydrothermal pool, whose reactions yield the enrichment in basic compounds and peptides necessary for the start of phase2.

We have the following Table 6 of genetic proto-code (model1) of the proto-lifecycle, discussed in full in Chap. 3.2.

2.10. Advanced PNA Genetic Code with Non-Cyan Precursors

The model1 binary proto-life-cycle uses a binary 1-1-code, i.e. a 2-letter-code (G, C) with 1 codon per gene. It has 2 coding amino acids (Gly for G, Pro for C) and 3 precursors (HCN, H2S, H2).

HCN has a relatively low molecular energy (=free enthalpy Hf) of |Hf| = 551.9, so the life-cycle needs energy only for PNA-building, which is supplied by the pyrite energy cycle (ΔE = 41 kJ/mole), whereas the basic-compound-synthesis runs exothermally without energy input.

HCN forms by electric discharge or UV-light from CO2 and NH3, and is also destroyed by UV-light, so it was only available early in the Earth history. Later (after ~100 My) the life-cycle had to adapt to other, more stable precursors. The

Table 6. Genetic proto-code model1.

Figure 13. (a) Schematic complete genesis of proto-life cycle reactions; (b) Complete genesis of proto-life cycle model.

most plausible and least energy-consuming alternative is (CO, NH3, H2S, H2), where the energy needed for the basic-compound-synthesis is in the range 140 - 400 kJ/mol, and can be supplied by multiple pyrite energy cycle (or single CO energy cycle).

The synthesis of the 2 basic amino acids (Gly, Pro) and the 2 nucleobases (Gua, Cyt) runs according to the scheme ( [16] p. 94)

2CO + NH 3 + H 2 Gly ( C 2 H 5 NO 2 ) , ΔE = −139 kJ/mole (endothermal)

5CO + NH 3 + 6H 2 Pro ( C 5 H 9 NO 2 ) + 3H 2 O , ΔE = −163.3 kJ/mol

4CO + 3NH 3 + H 2 Cyt ( C 4 H 5 N 3 O ) + 3H 2 O , ΔE = −340.5 kJ/mol

5CO + 5NH 3 Gua ( C 5 H 5 N 5 O ) + 4H 2 O + H 2 , ΔE = −400 kJ/mol

Model3 proto-PNA code

In order to produce enzymes for these reactions in addition to the existing ones, more coding amino acids and a larger code with 4 nucleobases is needed. A minimalistic version is model3: the extended model1 proto-PNA ternary code with 4 letters (G, C, A, U), 2-bit-codons, 12 coding amino acids and 4 nucleobases, and 1 codon per gene (1-1-code).

We start with the PNA 2-bit code (Table 7)

We take model 2 with (Gly, Pro, Cys, His, Leu) as a basic configuration, and the code of model1 includes the enzymes.

The coding is the same as in the RNA-code except, when modified because of a nucleobase instead of an amino acid: Gua (GC) replaces Ala, Cyt (CG) replaces

Table 7. PNA genetic 2-bit code.

Arg. Furthermore, we replace Tyr/Stop (UA) by Uracil Ura, and we replace Asn/Lys (AA) by Adenine Ade (the two U-A-codons with A at the end).

For the enzymes, we accept the extended correspondence from model1

G↔Gly, C↔Pro, A↔His, U↔Leu

In case of double-coding like Asp/Glu (GA) or Asn/Lys (AG), we take the simpler (=older) amino acid.

The nucleobase-pairing rule in PNA is G↔G, C↔C, A↔U.

Now we have the following code table Table 8.

Now, we have 4 basic-enzyme-coding amino acids (Gly, Pro, His, Leu), plus 8 enzyme-coding amino acids (Cys, Phe, Ile, Val, Ser, Thr, Asp, Asn), plus 4 nucleobases (Gua, Cyt, Ura, Ade).

This code is a plausible configuration, not a calculated solution, as in model1. Still, it can be regarded as a “basic” code within the quasi-species of the 2-bit ternary 1-1 PNA-proto-code.

The compounds in the reaction network are:

- 12 amino acids (Gly, Pro, His, Leu, Cys, Phe, Ile, Val, Ser, Thr, Asp, Asn)

- 4 nucleobases (Gua, Cyt, Ura, Ade)

- 16 peptides (enzymes) (Gly2, LeuGly, Pro2, ProGly2, ..., GlyHis, HisGly)

- 1 lipid Gly1ph

- 16 poly-nucleobases PNB’s (genes) (GG, UG, CC, CG, ..., AA, GA, AG)

- 16 poly-nucleobase-aminoacids PNA’s

Table 8. Model3 genetic code.

GlyGua2, LeuAdeGlyGua, ProCyt2, ProCytGlyGua2,

GlyGuaProCyt, ProCytHisUra, ProCytLeuAde

LeuAde2, HisUraLeuAde, GlyGuaLeuAde, LeuAdeProCyt, HisUraProCyt,

LeuAdeHisUra, HisUra2, GlyGuaHisUra, HisUraGlyGua

The precursors are CO, NH3, H2S, H3PO4

The energy cycle: pyrite reaction F e S + H 2 S F e S 2 + 2 H + + 2 e

We have the following model3 scenario Figure 14.

2.11. LUCA with RNA Genetic Code

The model3: the extended model1 proto-PNA ternary code with 4 letters (G, C, A, U), 2-bit-codons, 8 coding amino acids and 4 nucleobases, and 1 codon per gene (1-1-code).

A plausible transition from model3 proto-PNA code to the proto-RNA code of LUCA proceeds by full use of the 3-bit-code, by increasing the number of amino acids to 20, and by extending the 1-codon genes to multi-codon genes with the introduction of start- and stop-codons.

Furthermore, the PNB-PNA synthesis is replaced by tRNA-synthesis with one tRNA for every amino acid, with the addition of 20 codons coding for the 20 tRNA’s.

Figure 14. Schematic model3 (advanced PNA) reactions.

The PNB’s are replaced by a single RNA-strand with the supporting skeleton of ribose and phosphate radicals, which makes the RNA much more stable than its fore-runner PNB’s. The RNA is exactly copied by the enzyme RNA-polymerase (which is coded in a dedicated gene). Now, the RNA carries fixed genes, and is not a collection of quasi-genes like the PNB’s.

LUCA was living in hydrothermal vents and/or hot volcanic pools.

LUCA’s environmental conditions were: T ~80˚C, pH = 9, intermediate pressure.

LUCA used as energy cycle the acetogenesis (Wood-Ljungdahl)

H 2 + C O 2 f o r m a t e C H 3 p r o t e i n C O a c e t y l C o A a c e t a t e + A T P

The precursors were: H2, CO2, NH3, H2S, PO 4 3 ion

Now, we have the following reaction scenario Figure 15.

3. New Picture of the Terrestrial Life Evolution

We present now a completely modified model of terrestrial life evolution based on current knowledge ( [2] Chap. 3) and on numerical simulation results from Chap. 2 (Figure 16).

Figure 15. Schematic LUCA reactions.

Figure 16. Terrestrial life evolution.

3.1. Prebiotic Enrichment Cycle

Scenario2 Chap. 2.8 is the full proto-life cycle from the reaction table in Chap. 2.7, in the physical environment of a hydrothermal vent. It serves as an amplifier process for all involved molecule classes amino acids, nucleobases, peptides, poly-nuclein-acids PNA, lipids), which are distributed in surroundings by thermal convection, keeping the concentration of bio-molecules in the thermal boundary layer stable and relatively low.

Scenario2 starts with enriched concentrations (init2) of c = 10−4 = 100 ppm for basic compounds, c = 10−6 = 1 ppm for peptides, and c = 0.01 for precursors.

The reaction network consists of spontaneous and peptide-catalyzed basic-compound-synthesis, with the H2S (pyrite) energy cycle and the CO energy cycle, then the PNA-controlled peptide synthesis, and direct PNA-polymerization.

The PNA-controlled peptide synthesis works also without the initial presence of peptides (i.e. initial peptide concentration c = 0), whereas the direct peptide polymerization requires high rates of CO energy cycle to counteract the reverse reaction (direct peptide decay).

The results of the enrichment reactions are given in the following Table 9.

Table 9. (a) Pre-biotic enrichment cycle; (b) Pre-biotic enrichment cycle amplification.

This can be formulated concisely in the following scheme:

time constant: about 500 s

Yaman & Harvey [28] calculate minimal reaction time for amino acid synthesis catalyzed by dinucleotide t r ( E a = 20 kcal / mol ) = exp ( E a / k T ) 1 ns 200 s .

3.2. PNA Proto-Cell with Proto-Code for 3 Amino Acids and Cyan Precursors

The proto-code consists of binary 3-bit-codons (guanine Gua or cytosine Cyt) coding for 2 amino acids (glycine Gly or proline Pro). The enzymes are Gly-Pro-sequences, which catalyze the synthesis of the 6 needed compounds (3 amino acids Gly, Pro, Cys, 2 nucleobases Gua, Cyt, 1 phospholipid glycerol-1-phosphate) from the precursors (hydrogen cyanide HCN, H2O, H, hydrogen sulfide H2S, phosphoric acid H3PO4).

The simplified model1 proto-code with 5 proto-genes for 6 compounds (enthalpy values from [22] ) and 2 PNA’s for 3 amino acids, 1 lipid and 2 nucleobases is now as follows (Table 10, Figure 17).

The scenario (=scenario3 Chap. 2.8) of the PNA proto-cell is the full proto-life cycle, confined in the interior of a lipid vesicle. As was reported in ( [2] Chap. 3.4), amino acids within a lipid membrane stabilize the lipid layer, and the lipids

Table 10. Genetic proto-code.

Figure 17. PNA proto-cell model.

serve under certain conditions as catalyzer for peptide polymerization. Precursor molecules enter the vesicle via osmosis, but bio-molecules are enclosed in it, and are not carried away by convection, there is diffusion only within the vesicle. Therefore the concentration of bio-molecules can increase until it reaches a critical level, and the system becomes unstable, the vesicle divides and proliferation takes place. The reaction cycle now has all attributes of life (Figure 18(a)):

- synthesis of basic compounds from precursors (“food”) using an energy cycle

- self-regulation by catalysis through peptides-enzymes

- PNA-gene-controlled enzyme production from amino acids

- proliferation through bio-matter production and vesicle division.

The compounds in the reaction network are:

- 3 amino acids Gly, Pro. Cys

- 2 nucleobases Gua, Cyt

- 5 peptides (enzymes) Gly2, Gly3, Pro, Pro2, GlyPro

- 1 lipid Gly1ph

- 5 PNB’s (genes) Gua2, Gua3, Cyt2, Cyt, GuaCyt

- 5 PNA’s GlyGua2, GlyGua3, ProCyt2, ProCyt, GlyGuaProCyt2

The precursors are HCN, H2S, H3PO4

The energy cycle: pyrite reaction F e S + H 2 S F e S 2 + 2 H + + 2 e

This scenario3 reaches significantly higher concentrations than scenario2, because the bio-molecules cannot leave the vesicle. Only precursor molecules can pass the membrane, for them the boundary condition c = c0 is valid at the membrane.

The PNB’s become now the proto-genes, which are stable in the cycle.

The basic components synthesis runs under catalytic peptide control, the pyrite energy cycle is catalyzed by Cys, PNA building from amino acids and PNB’s takes energy from the energy cycle, the PNA’s split into peptides and PNB’s releasing energy, and the cycle runs anew.

The pyrite energy reaction delivers energy and H2 at the membrane, catalyzed by Cys, the insoluble FeS2 falls out in colloidal form.

The current of precursors through the membrane is governed by osmosis:

c ( t , x 1 ) t = f v m ( c ( t , x 1 ) c 0 )

and within the vesicle there is diffusion for all molecules

c ( x , t ) t = f v d 2 c ( x , t ) x 2

If fvd = 0, i.e. without diffusion, the system reaches an equilibrium, and there is a solution for all times.

The diffusion introduces a definite critical time tc, where the system becomes unstable, i.e. where the solution of the differential equations ceases to exist: the vesicle divides, separates into two, and the reaction cycle starts again. This critical time depends on the system parameters and the initial concentration, and has values around 1000s, which agrees well with the observed division periods of self-reproducing lipid-amino-acid vesicles and also of bacteria.

Figure 18. (a) Schematic PNA proto-cell reactions; (b) Schematic advanced PNA-cell reactions.

Yaman & Harvey [28] presented a simplified proto-genetic code with 9 two-letter codons from (A, G, C) for 5 amino acids and 3 precursors, which runs purely abiotically by catalysis with the corresponding dinucleotide (Table 11).

With only two letters (G, C) and 4 codons, it reduces to the table that matches the model1 table for the entries CC and GG, whereas the entries CG and GC catalyze indirectly arginine resp. alanine, which does not participate directly in the proto-lifecycle of model1.

3.3. Advanced PNA Cell with Non-Cyan Precursors

After ~100 My, the life-cycle had to adapt to other, more stable precursors than HCN.

The most plausible and least energy-consuming alternative precursors (CO, NH3, H2S, H2).

We have the following model3 code in Table 12.

We have here a full 2-bit code for aminoacids + nucleobases, specifically, we have 4 basic-enzyme-coding amino acids (Gly, Pro, His, Leu), plus 8 enzyme-coding amino acids (Cys, Phe, Ile, Val, Ser, Thr, Asp, Asn), plus 4 nucleobases (Gua, Cyt, Ura, Ade) (Figure 18(b)).

This code is a plausible configuration, not a calculated solution, as in model1. Still, it can be regarded as a “basic” code within the quasi-species of the 2-bit ternary 1-1 RNA-proto-code.

3.4. LUCA with Proto-RNA Genetic Code

The model3 is the extended model1 proto-PNA ternary code with 4 letters (G, C, A, U), 2-bit-codons, 8 coding amino acids and 4 nucleobases, and 1 codon per gene (1-1-code).

A plausible transition from model3 proto-PNA code to the proto-RNA code of LUCA proceeds by full use of the 2-bit-code, by increasing the number of amino acids to 20, and by extending the 1-codon genes to multi-codon genes with the introduction of start- and stop-codons.

Table 11. (a) Yaman & Harvey genetic proto-code; (b) Yaman & Harvey simplified genetic proto-code.

Table 12. Advanced PNA cell genetic code.

Furthermore, the PNB-PNA synthesis is replaced by tRNA-synthesis with one tRNA for every amino acid, with the addition of 20 genes coding for the 20 tRNA’s.

The PNB’s are replaced by a single RNA-strand with the supporting skeleton of ribose and phosphate radicals, which makes the RNA much more stable than its fore-runner PNB’s. The RNA is exactly copied by the enzyme RNA-polymerase (which is coded in a dedicated gene). Now, the RNA carries fixed genes, and is not a collection of quasi-genes like the PNB’s.

The full RNA code is shown in 1.1.

The precursors are CO2, NH3, H2, H2S, H3PO4

The energy cycle is the aceto-genesis from H2 and CO2

H 2 + C O 2 f o r m a t e C H 3 p r o t e i n C O a c e t y l C o A a c e t a t e + A T P

The biosynthetic reaction network consists of ≈150 basic reactions, from which 95% were exergonic [29] . The four key reactions are acetogenesis (Wood-Ljungdal), gluconeogenesis, reverse citric acid cycle, pentose phosphate pathway. From the 26 key intermediate cofactors the most important 7 are: ATP, GTP, THF, NAD, NADP, CoA, SAM.

Now, we have the following reaction scenario (Figure 19).

3.5. DNA cell with Full DNA Genetic Code

The DNA-cell (bacteria, archaea, eukaryotes) uses the full DNA genetic code (Table 13) with 20 amino acids, and corresponding 20 aaRNA’s for peptide synthesis,

Figure 19. LUCA reactions.

Table 13. DNA-cell genetic code.

4 nucleotides: cytosine, thymine, adenine, guanine (thymine replaces uracil in RNA-coding). The pairs A-G and C-T are complementary in the DNA-double-helix.

The cell membrane consists of ester and ether lipids (bacteria, eukaryotes) or diether lipids (archaea).

The (self-sustainable) energy cycle is widely diverse:

- reverse acetogenesis with sulfate reduction (sulfur bacteria)

- Wood-Ljungdahl aceto-genesis (archaea and bacteria in hydrothermal vents and volcanic pools)

- methanogenesis from carbon dioxide and hydrogen (archaea)

- photosynthesis (cyanobacteria)

4. Conclusions

Chap. 2 presents the mathematical formulation and numeric simulation of the genetic proto-code model based on an empirical reaction model developed for life chemistry.

The result of the simulation is a complete realistic scenario of the origin of terrestrial life based on HCN and H2S as precursors, and the H2S energy cycle. It consists of two phases:

Phase1 is the enrichment of amino acids, nucleobases and peptides in a hydrothermal pool with amplification factors of 25 to 8000 and a time constant of 500 s, the process is guided by thermal convection.

Phase2 is the build-up of a genuine self-catalytic life-proto-cycle in a self-assembling lipid membrane based on a genetic proto-code with 3 amino acids (Gly, Pro, Cys) and 2 nucleobases (Gua, Cyt) with 5 proto-genes (GG, GGG, CC, C, GC) coding for 5 peptides (Gly2, Gly3, Pro2, Pro, GlyPro), which catalyze the synthesis of 5 basic components (Gly, Cys, Pro, Cyt, Gua). The peptide synthesis is carried out by peptide-nucleid-acids (PNA’s) generated in the proto-cycle, which correspond to the modern RNA-transferases. The process is guided by diffusion, and the calculation shows, that there is a fundamental collapse-division time of the proto-cell of roughly 1000 s, which is about the same as the cell-division cycle time in modern bacteria.

Chap. 3 presents a detailed modified picture of terrestrial life evolution based on the models derived in the preceding chapter.

In this picture, there are 4 basic stages of genetic complexity.

- Stage1 enrichment cycle

This is the enrichment cycle without membrane, for 3 amino acids, 2 nucleobases, peptides, poly-nucleobases (PNB)

- Stage2 PNA proto-cell

This is the proto-lifecycle in the proto-cell within lipid vesicle with cyan precursors.

The compounds in the reaction network are:

- 3 amino acids Gly, Pro. Cys

- 2 nucleobases Gua, Cyt

- 5 peptides (enzymes) Gly2, Gly3, Pro, Pro2, GlyPro

- 1 lipid Gly1ph

- 5 poly-nucleobases PNB’s (genes) Gua2, Gua3, Cyt2, Cyt, GuaCyt

- 5 poly-nucleobase-aminacids PNA’s GlyGua2, GlyGua3, ProCyt2, ProCyt, GlyGuaProCyt2

The precursors are HCN, H2S, H3PO4

The energy cycle: pyrite reaction F e S + H 2 S F e S 2 + 2 H + + 2 e

The genetic code table (model2) is as follows.

- Stage3 advanced PNA-cell

After ~100 My there is a more advanced cell with more stable precursors.

We have here a full 2-bit code for aminoacids +nucleobases, specifically, we have 4 basic-enzyme-coding amino acids (Gly, Pro, His, Leu), plus 8 enzyme-coding amino acids (Cys, Phe, Ile, Val, Ser, Thr, Asp, Asn), plus 4 nucleobases (Gua, Cyt, Ura, Ade).

The genetic code table (model3) is as follows.

The compounds in the reaction network are:

- 12 amino acids (Gly, Pro, His, Leu, Cys, Phe, Ile, Val, Ser, Thr, Asp, Asn)

- 4 nucleobases (Gua, Cyt, Ura, Ade)

- 16 peptides (enzymes) (Gly2, LeuGly, Pro2, ProGly2, ..., GlyHis, HisGly)

- 1 lipid Gly1ph

- 16 poly-nucleobases PNB’s (genes) (GG, UG, CC, CG, ..., AA, GA, AG)

- 16 poly-nucleobase-aminoacids PNA’s

GlyGua2, LeuAdeGlyGua, ProCyt2, ProCytGlyGua2,

GlyGuaProCyt, ProCytHisUra, ProCytLeuAde,

LeuAde2, HisUraLeuAde, GlyGuaLeuAde, LeuAdeProCyt, HisUraProCyt,

LeuAdeHisUra, HisUra2, GlyGuaHisUra, HisUraGlyGua

The precursors are CO, NH3, H2S, H3PO4

The energy cycle: pyrite reaction F e S + H 2 S F e S 2 + 2 H + + 2 e

- Stage4: LUCA

LUCA lived about 4.3 Gy ago and had

• RNA genome ~150 genes

• ribosome

• virus-like protein capsid + lipid envelope

• RNA 4 nucleotides

• proteins with 20 aminoacids

• RNA replication via DNA intermediates

• DNA polymerase

• Class I: 10 aaRS (D, Cys, Gly)

• Class II: 10 aaRS (A, B, Pro)

• 15 families of small subunit proteins, 18 families of large subunit proteins

• synthesis of RNA using DNA templates

• acetogenesis energy cycle from H2

• temperature ~80˚C, pH = 9, pressure intermediate ~1 - 10 bar

• LUCA used as energy cycle the acetogenesis (Wood-Ljungdahl)

H 2 + C O 2 f o r m a t e C H 3 p r o t e i n C O a c e t y l C o A a c e t a t e + A T P

• The precursors were: H2, CO2, NH3, H2S, PO 4 3 ion

Conflicts of Interest

The author declares no conflicts of interest regarding the publication of this paper.

References

[1] Ruse, M., et al. (2009) Evolution: The First Four Billion Years. Harvard University Press, Cambridge.
[2] Helm, J. (2021) Origin and Basic Mechanism of Life.
https://www.researchgate.net/
[3] Lehman, N. (2015) The RNA World: 4,000,000,050 Years Old. Life, 5, 1583-1586.
https://doi.org/10.3390/life5041583
[4] Allamandola, L. et al. (2016) Cosmic Distribution of Chemical Complexity. NASA
[5] Lindahl, T. and Karlstrom, O. (1973) Heat-Induced Depyrimidination of Deoxyribonucleic Acid in Neutral Solution. Biochemistry, 12, 5151-5154.
https://doi.org/10.1021/bi00749a020
[6] Locke, W. (1997) Introduction to Molecular Orbital Theory. ICSTM Department of Chemistry.
https://www.ch.ic.ac.uk/vchemlib/course/mo_theory/main.html
[7] Weiss, I.M., Muth, C., Drumm, R. and Kirchner, H.O.K. (2018) Thermal Decomposition of the Amino Acids Glycine, Cysteine, Aspartic Acid, Asparagine, Glutamic Acid, Glutamine, Arginine and Histidine. BMC Biophysics, 11, Article No. 2.
https://doi.org/10.1186/s13628-018-0042-4
[8] Patel, B. et al. (2015) Common Origins of RNA, Protein and Lipid Precursors in a Cyanosulfidic Protometabolism. Nature Chemistry, 7, 301-307.
https://doi.org/10.1038/nchem.2202
[9] Das, T. et al. (2019) Insights into the Origin of Life: Did It Begin from HCN and H2O? ACS Central Science, 5, 1532-1540.
https://doi.org/10.1021/acscentsci.9b00520
[10] Carter Jr., C.W. and Wills, P. (2017) Interdependence, Reflexivity, Fidelity, Impedance Matching, and the Evolution of Genetic Coding. bioRxiv.
https://doi.org/10.1101/139139
[11] Martinez-Rodriguez, L., Erdogan, O., Jimenez-Rodriguez, M., et al. (2015) Functional Class I and II Amino Acid-Activating Enzymes Can Be Coded by Opposite Strands of the Same Gene. Journal of Biological Chemistry, 190, 19710-19725.
https://doi.org/10.1074/jbc.M115.642876
[12] Helm, J. (2021) Code EnzymeCalc.nb.
https://www.researchgate.net/publication/358271046_Reaction_constants_in_organic-chemical_reactions
[13] NIST Chemistry WebBook.
https://webbook.nist.gov/chemistry/
[14] Chemeo Molecular Database.
https://www.chemeo.com/
[15] NIST Chemical Kinetics Data Base.
https://kinetics.nist.gov/kinetics/
[16] Helm, J. (2021) Chemical Data Base.
https://www.researchgate.net/publication/343182078_ChemicalData_JH0420doc
[17] Chasnov, J. (2009) Mathematical Biology. Lecture in Hong Kong University, Hong Kong.
[18] de With, G. (2013) Liquid State Physical Chemistry: Fundamentals, Modeling, and Applications. Wiley-VCH, Weinheim.
https://doi.org/10.1002/9783527676750
[19] Gatenby, R. and Frieden, B. (2016) Investigating Information Dynamics in Living Systems through the Structure and Function of Enzymes. PLOS ONE, 5, e0154867.
https://doi.org/10.1371/journal.pone.0154867
[20] Biro, J. (2006) Amino Acid Size, Charge, Hydropathy Indices and Matrices for Protein Structure Analysis. Theoretical Biology and Medical Modeling, 3, Article No. 15.
https://doi.org/10.1186/1742-4682-3-15
[21] Wächtershäuser, G. (2000) Life as We Don’t Know It. Science, 289, 1307-1308.
https://doi.org/10.1126/science.289.5483.1307
[22] NIST Chemistry Webbook.
https://www.chemeo.com/
[23] Zahnle, K. (2010) Earth’s Earliest Atmospheres. Cold Spring Harbor Laboratory Press, Long Island.
https://doi.org/10.1101/cshperspect.a004895
[24] Helm, J. (2021) Biochemistry Basics: A Compendium of Models and Data.
https://www.researchgate.net/publication/343182129_BiochemistryBasics_JH0420doc
[25] Helm, J. (2021) Chemical Thermodynamics: A Compendium of Models and Data.
https://www.researchgate.net/publication/343181993_ThermodynChemical_JH0420doc
[26] Lodish, H., Berk, H., et al. (2000) Molecular Cell Biology. W. H. Freeman, New York.
[27] Landau, L.D. and Lifshitz, E.M. (1987) Fluid Mechanics. 2nd Edition, Pergamon Press, Oxford.
[28] Yaman, T. and Harvey, J.N. (2021) Computational Analysis of a Prebiotic Amino Acid Synthesis with Reference to Extant Codon-Amino Acid Relationships. Life, 11, Article 1343.
https://doi.org/10.3390/life11121343
[29] Wimmer, J.L.E., Xavier, J.C., Vieira, A.D.N., et al. (2021) Energy at Origins: Favorable Thermodynamics of Biosynthetic Reactions in the Last Universal Common Ancestor (LUCA). Frontiers in Microbiology, 12, Article 793664.
https://doi.org/10.3389/fmicb.2021.793664

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.