We looked at the distribution of strong and weak operon genes according to COG category and compared this to the overall distribution of COG categories in E. coli (Figure 8). Here r-protein genes were included. The strong operon genes are overrepresented in several of the COG categories compared to the weak operon genes; Translation, ribosomal structure and biogenesis (J), Transcription (K), Cell wall/membrane/envelope biogenesis (M), Energy production and conversion (C), Lipid transport and metabolism (I) and Secondary metabolites biosynthesis, transport and catabolism (Q). On the other hand, the weak operon genes are mainly overrepresented in Replication, recombination and repair (L), Posttranslational modification, protein turnover, chaperones (O) and Nucleotide transport and metabolism (F). This difference between strong and weak operon genes was confirmed with DAVID (excluding r-proteins), showing that whereas gene ontology terms like cell wall biogenesis and ATP metabolic process are overrepresented in strong operon genes, terms like DNA replication, response to stress and nucleotide binding are overrepresented in weak operon genes (p-values < 0.05 after Benjamini and Hochberg correction).
Good and weak operon family genes considering COG classes. The new chart is sold with ribosomal genetics (Interpretation, ribosomal construction and you may biogenesis (J)).
Variation inside evolutionary price
About phylogenetic investigation we checked-out the evolutionary length considering all genes defined as chronic. Yet not, there may naturally end up being inter-gene version on the evolutionary price. It was analysed that with pair-wise Blast piece results normalised up against alignment length; come across Suggestions for subsequent info.
Singleton as opposed to backup genetics
Prior to analyses are finding a distinction in the evolutionary rates from singletons and you may duplicates, but which visualize is firmly dependent on the fresh new forty five roentgen-proteins inside our investigation put. Analyses conducted having r-healthy protein within the singletons class show that there’s indeed a significant difference about your evolutionary speed. The average of your own mediocre portion scores (normalised over positioning size) are 0.81 into the singletons and you can 0.73 to the duplicates (data maybe not found), implying one to genetics for the clusters dominated of the down dating singletons is way more like one another and you may develop more sluggish than just duplicates. Yet not, it’s traditional to depart aside roentgen-protein when looking at evolutionary price because they’re extremely expressed and you will progress alot more slower than other proteins. With no r-proteins discover zero factor involving the singletons and you can duplicates (median off average bit scores 0.71 and you may 0.72 correspondingly). As expected brand new r-proteins evolve much slower that have a median out-of average portion many 0.97. I including tested whether there is certainly people change out of necessary protein length to possess singletons and copies. Whenever r-healthy protein was basically left out, that it study did not promote one factor.
Strong instead of weak operon genetics
We after that performed a comparable analyses because the revealed above, however, contrasting solid and you can poor operon healthy protein. This new ribosomal and the fused/blended protein was omitted of investigation. As a result, found from inside the Shape nine. The fresh median off average piece ratings having solid and you may poor operon necessary protein are 0.65 and you will 0.79 correspondingly, hence proving the strong operon genetics develop faster as compared to poor operon genes (p-well worth 3.527 ? 10 -5 ). Due to the fact mentioned previously the brand new roentgen-healthy protein has actually a median away from average piece scores of 0.97. There is also a distinction out of necessary protein duration for strong and you will weakened operon proteins. The protein away from weakened operon genes (Figure 10) enjoys an average period of proteins compared to the proteins for necessary protein of good operon family genes (p-well worth step one.361 ? 10 -5 ).
Average protein section get getting good and you can poor operon gene clusters. A box patch indicating different gene groups rated predicated on mediocre partners-smart piece get of your own necessary protein sequences (BitScore) normalised against positioning size (AliLen). New legend text reveals the average get of each group (poor operon 0.79 bits, good operon 0.65 bits). Ribosomal genetics commonly provided. When they are provided brand new amounts are 0.81 and you will 0.75, respectively.