As a part of bacterial genome analysis suite of programs, and to enforce operon and gene prediction by FGENESB program, we introduce BPROM, bacterial promoter prediction program
Algorithm predicts potential transcription start positions of bacterial genes regulated by sigma70 promoters (major E.coli promoter class). Linear discriminant function (LDF) combines characteristics describing functional motifs and oligonucleotide composition of these sites. BPROM has accuracy of E.coli promoter recognition about 80%. Its specificity is also about 80% when tested on sets containing promoter and non-promoter sequences in equal numbers. It is not advisable to run BPROM on whole genomes: To increase specificity, run BPROM on a region between two neighboring ORFs located on the same strand, or on a sequence upstream from an ORF, keeping in mind that most promoters are located within 150 bp region from protein coding sequence.
First line - name of your sequence;
Second and Third lines - LDF threshold and the length of presented sequence
4th line - The number of predicted promoters
Next lines - positions of predicted promoters, and their scores with 'weights' of two conserved promoter boxes. Promoter position assign to the first nucleotide of the transcript (Transcription Start Site position).
After that we present elements of Transcriptional factor binding sites for each predicted promoter (if they found).
bprom Sat Jan 18 21:11:25 EST 2003 >Region of E.coli genome between protein_id="AAC76687.1" and protein_id="AAC7668 Length of sequence- 420 Threshold for promoters - 0.20 Number of predicted promoters - 1 Promoter Pos: 145 LDF- 6.02 -10 box at pos. 130 ctttatgat Score 66 -35 box at pos. 109 tttaat Score 36 Oligonucleotides from known TF binding sites: For promoter at 145: fis: TCTTTAAT at position 107 Score - 6 rpoD17: TTATGATA at position 132 Score - 7 lexA: ATAAATAA at position 137 Score - 14 rpoD17: ATAATAAT at position 141 Score - 8