This is the parental strain (Genbank #CP009273) of the widely used Escherichia coli Keio gene knockout collection [1].
An iPtgxDB was created by hierarchically integrating protein coding sequences from the following annotation resources:
| Hierarchy | Resource | Link |
|---|---|---|
| 1 | NCBI RefSeq | CP009273.1; from 30/10/2014 |
| 2 | IMG [2] | Integrated Microbial Genomes (IMG) initiative of the Joint Genome Institute (JGI); Ga0058822, from 12/08/2014 |
| 3 | Prodigal [3] | Ab initio gene predictions from Prodigal (v2.6) |
| 4 | ChemGenome [4] | Ab initio gene predictions from ChemGenome (v2.0, http://www.scfbio-iitd.res.in/chemgenome/chemgenomenew.jsp; with parameters: method, Swissprot space; length threshold, 70 nt; initiation codons, ATG, CTG, TTG, GTG) |
| 5 | in silico ORFs | The in silico ORF annotations were generated as described by Omasits and Varadarajan et al., 2017 |
Only ORFs above a selectable length threshold (here 18 aa) were considered. The iPtgxDB was created using the hierarchy RefSeq > JGI > Prodigal > ChemGenome > in silico. Files were parsed to extract the identifier, coordinates and sequences of bona fide protein-coding sequences (CDS) and pseudogene entries.
| iPtgxDB Release Info | |
|---|---|
|
Version
|
1 |
|
Date
|
26.09.2016 |