 B. thetaiotaomicron DSM2079_tryptic
B. thetaiotaomicron DSM2079_trypticThe extended simplified human intestinal microbiota (SIHUMIx) consists of eight bacterial members (Anaerostipes caccae (DSMZ 14662); Bacteroides thetaiotaomicron (DSMZ 2079); Bifidobacterium longum (NCC 2705); Blautia producta (DSMZ 2950); Clostridium butyricum (DSMZ 10702); Clostridium ramosum (DSMZ 1402); Escherichia coli K-12 (MG1655); Lactobacillus plantarum (DSMZ 20174)) of the human intestine and thus represents a model community to analyze such microbial interactions [1].
A tryptic iPtgxDB of Bacteroides thetaiotaomicron (DSM 2079) was created by hierarchically integrating protein coding sequences from the following annotation resources:
| Hierarchy | Resource | Link | 
|---|---|---|
| 1 | NCBI RefSeq | CP040529.1 / CP040530.1 from 20-MAY-2019 (NCBI Prokaryotic Genome Annotation Pipeline (PGAP v.4.7) | 
| 2 | Prodigal [2] | ab initio gene predictions from Prodigal (v2.6) | 
| 3 | ChemGenome [3] | ab initio gene predictions from ChemGenome (v2.0, http://www.scfbio-iitd.res.in/chemgenome/chemgenomenew.jsp; with parameters: method, Swissprot space; length threshold, 70 nt; initiation codons, ATG, CTG, TTG, GTG) | 
| 4 | in silico ORFs | in silico ORF annotations were generated as described by Omasits and Varadarajan et al., 2017 (v2.0, Only ORFs above a selectable length threshold (here 18 aa) were considered.) | 
The iPtgxDB was created using the hierarchy RefSeq > Prodigal > ChemGenome > in silico. Files were parsed to extract the identifier, coordinates and sequences of bona fide protein-coding sequences (CDS) and pseudogene entries.
| iPtgxDB Release Info | |
|---|---|
|   Version | 1 | 
|   Date | 10.08.2020 |