HGM2001 Poster Abstracts: 9. Genome Informatics


    

POSTER NO: 306

'The Great Genetics Worlds Unification': Genomics and Proteomics Equivalence law in whole Genomes and Chromosomes

J.C. Perez
GeNum lab (NUMerical GENetics), 7 Av de terre-rouge F33127 Martignas France

1 Introduction.

We are researching hypothetical Numerical background Foundations of Genetics from the middle of eighties (1 2 3). Recently, we have discovered an integer numbers numerical code unifying all kinds of biological worlds: bio-atoms, nucleotides, amino acids, DNA and RNA strands, proteins, whole chromosomes and genomes. This novel discovery unifies and globalize the well known universal genetic code generalizing and explaining this empirical structure from the atomic weights C O N H S P bio-atoms level to the whole genomes long-range global level. Particularly, a subset of this discovery, named GUGC: the « Global Unified Genetic Code » proposes an “information equivalence law” between Genomics data and Proteomics data. In fact, this law unifies mathematically the three languages of genetics: DNA, RNA and proteins.

2 “GUGC”: the Global Unified Genetic Code.

Then, we code any DNA sequence like its potential translations in amino acids according to the above associated numerical signatures. Now, we obtain two separate codons and amino acids classification patterned signatures. One discovers then a global Genomics/Proteomics unification in the form of a coupling with very strong linear correlations (linear correlations ratios are >97% in both cases of the Prion PrP protein and of the whole 9000bp length HIV1 complete genome). We thus show the Unification, equivalence, and the global conservation between the two kinds of Genetics information: Genomics and Proteomics.

3 “genChrom” theory and technology: Unifying Genomics and Proteomics in whole chromosomes and Genomes.

The MALARIA complete Genome sequencing is in progress (Plasmodium Falciparum). This whole genome includes 14 chromosomes. Presently, whole chromosome 2 (TIGR) and chromosome 3 (SANGER centre) are available (one Mbp each). We have completely analysed both chromosomes. Then, these global novel analyses reveals: - strong Genomics/Proteomics long-range correlation coupling providing an equivalence of 98% between Genomics and Proteomics patterned signatures. - Consensus tuning linking exactly the same optimal codon site position from Genomic data (exp. codon 325238) and the same amino acid site position from Proteomic data (exp. AA 325238). This strange tuning high level reveals an underground shaded theoretical structure providing a long-range level EQUIVALENCE between Genomic information code and Proteomic information code. - In other hand, replacing the same chromosomal sequence by randomly TCAG nucleotides reorganization destroys totally this global unification between Genomics and Proteomics patterned signatures. - Then, we introduce the formal proof of the existence of a global information INVARIANCE and equivalence between Genomics and Proteomics information.

4 Perspectives.

Presently, we are running simulations generalizing this basic research discovery to all kinds of whole chromosomes and genomes already sequenced and available. We are particularly generalizing theory and proofs to: - Eucharyota whole chromosomes: particularly Drosophila melanogaster chromosomes or Leishmania chromosome: 268984bp. - Human genome: analyzing whole chromosomes 21 and 22. - Arabidopsis thaliana flowering plant whole chromosomes. - Bacteria and Archaea typical whole genomes: particularly Thermoplasma volcanium: 1584799bp. This Archaeon has optimal growth temperature of 60 degrees C with strong genome specific motifs at local level (separate dinucleotides motifs AA and TT). Then what about the global level ? - We are also generalizing this exploration large screening proof extending it to ALL VIRUSES GENOMES available to-day: 615 viruses with lengths varying between some kilo-bases to 236000 bases- pairs.

References summary.

[1] J.C Perez, general theory and applications overview in http://www.genum.com

[2] J.C Perez, book “L’ADN décrypté” – Ed Marco Pietteur, 39 Av. Du centenaire – 4053 Embourg Belgium ISBN: 2-87211-017-8. (1997).

[3] J.C Perez, the global language of DNA, HGM93 HUGO Kobe Japan 1993.

    


Other abstracts in same session



Generated by SubmitEd V1.50 - Copyright © 1999-2001 Alastair Brown