Bradyrhizobium diazoefficiens USDA 110 (Genbank #NC_004463) is a widely used model organism to study rhizobial symbiosis [1].

An iPtgxDB was created by hierarchically integrating protein coding sequences from the following annotation resources:

Hierarchy Resource Link
1 NCBI RefSeq NC_004463.1; from 22/07/2013
2 Ensembl Ensembl's Genomes project (GCA_000011365.1, Feb/2011)
3 Genoscope [2] NC_004463, accessed 09/09/2013
4 CMR [3] J. Craig Venter Institute's Comprehensive Microbial Resource (CMR)
5 Prodigal [4] Ab initio gene predictions from Prodigal (v2.5)
6 ChemGenome [5] Ab initio gene predictions from ChemGenome (v2.0,; with parameters: method, Swissprot space; length threshold, 70 nt; initiation codons, ATG, CTG, TTG, GTG)
7 in silico ORFs The in silico ORFs annotations were generated as described by Omasits and Varadarajan et al., 2017

Only ORFs above a selectable length threshold (here 18 aa) were considered. The iPtgxDB was created using the hierarchy RefSeq > Ensembl > Genoscope > CMR > Prodigal > in silico. Files were parsed to extract the identifier, coordinates and sequences of bona fide protein-coding sequences (CDS) and pseudogene entries.


