The extended simplified human intestinal microbiota (SIHUMIx) consists of eight bacterial members (Anaerostipes caccae (DSMZ 14662); Bacteroides thetaiotaomicron (DSMZ 2079); Bifidobacterium longum (NCC 2705); Blautia producta (DSMZ 2950); Clostridium butyricum (DSMZ 10702); Clostridium ramosum (DSMZ 1402); Escherichia coli K-12 (MG1655); Lactobacillus plantarum (DSMZ 20174)) of the human intestine and thus represents a model community to analyze such microbial interactions [1].
A tryptic iPtgxDB of Anaerostipes caccae (DSM 14662) was created by hierarchically integrating protein coding sequences from the following annotation resources:
Hierarchy | Resource | Link |
---|---|---|
1 | NCBI RefSeq | CP036345.1; from 22-FEB-2019 (NCBI Prokaryotic Genome Annotation Pipeline (PGAP v.4.7) |
2 | Prodigal [2] | ab initio gene predictions from Prodigal (v2.6) |
3 | ChemGenome [3] | ab initio gene predictions from ChemGenome (v2.0, http://www.scfbio-iitd.res.in/chemgenome/chemgenomenew.jsp; with parameters: method, Swissprot space; length threshold, 70 nt; initiation codons, ATG, CTG, TTG, GTG) |
4 | in silico ORFs | in silico ORF annotations were generated as described by Omasits and Varadarajan et al., 2017 (v2.0, Only ORFs above a selectable length threshold (here 18 aa) were considered.) |
The iPtgxDB was created using the hierarchy RefSeq > Prodigal > ChemGenome > in silico. Files were parsed to extract the identifier, coordinates and sequences of bona fide protein-coding sequences (CDS) and pseudogene entries.
iPtgxDB Release Info | |
---|---|
Version
|
1 |
Date
|
10.08.2020 |