Prediction of potential genes in microbial genomes Time: Tue May 17 21:47:11 2011 Seq name: gi|226332055|gb|ACIB01000001.1| Bacteroides sp. 3_2_5 cont1.1, whole genome shotgun sequence Length of sequence - 277237 bp Number of predicted genes - 238, with homology - 235 Number of transcription units - 110, operones - 56 average op.length - 3.3 N Tu/Op Conserved S Start End Score pairs(N/Pv) - Term 318 - 364 12.4 1 1 Op 1 . - CDS 398 - 844 405 ## COG2731 Beta-galactosidase, beta subunit 2 1 Op 2 1/0.190 - CDS 859 - 2097 998 ## COG0477 Permeases of the major facilitator superfamily 3 1 Op 3 . - CDS 2113 - 3297 1032 ## COG2942 N-acyl-D-glucosamine 2-epimerase 4 1 Op 4 . - CDS 3321 - 4238 953 ## COG0329 Dihydrodipicolinate synthase/N-acetylneuraminate lyase - Prom 4486 - 4545 7.8 + Prom 4227 - 4286 10.3 5 2 Tu 1 . + CDS 4536 - 5744 341 ## PROTEIN SUPPORTED gi|163762640|ref|ZP_02169704.1| ribosomal protein L33 - Term 5508 - 5559 2.6 6 3 Op 1 . - CDS 5741 - 6205 240 ## BF1704 hypothetical protein 7 3 Op 2 . - CDS 6205 - 6735 302 ## BF1703 putative transcriptional regulator 8 3 Op 3 . - CDS 6711 - 7823 906 ## BF1702 putative protein involved in capsular polysaccharide biosynthesis 9 3 Op 4 . - CDS 7829 - 10177 1919 ## COG1596 Periplasmic protein involved in polysaccharide export 10 3 Op 5 . - CDS 9978 - 10379 297 ## BF1701 putative capsule polysaccharide export protein - Prom 10410 - 10469 6.4 11 3 Op 6 . - CDS 10494 - 12398 1268 ## COG1086 Predicted nucleoside-diphosphate sugar epimerases - Prom 12614 - 12673 75.4 + TRNA 12597 - 12669 79.3 # Thr CGT 0 0 + Prom 12927 - 12986 2.8 12 4 Tu 1 . + CDS 13093 - 14655 666 ## COG3291 FOG: PKD repeat + Term 14711 - 14747 0.1 - Term 14847 - 14887 -0.2 13 5 Tu 1 . - CDS 14958 - 15104 175 ## - Prom 15324 - 15383 5.0 + Prom 15055 - 15114 6.5 14 6 Tu 1 . + CDS 15250 - 16128 479 ## COG1864 DNA/RNA endonuclease G, NUC1 + Prom 16167 - 16226 4.4 15 7 Op 1 . + CDS 16286 - 17323 564 ## COG1559 Predicted periplasmic solute-binding protein 16 7 Op 2 11/0.000 + CDS 17407 - 18999 1352 ## COG4231 Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits 17 7 Op 3 1/0.190 + CDS 19003 - 19587 525 ## COG1014 Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, gamma subunit + Term 19613 - 19662 9.7 + Prom 19608 - 19667 6.3 18 8 Op 1 . + CDS 19692 - 20999 1226 ## COG1541 Coenzyme F390 synthetase + Prom 21004 - 21063 4.2 19 8 Op 2 . + CDS 21085 - 21654 404 ## COG0503 Adenine/guanine phosphoribosyltransferases and related PRPP-binding proteins + Term 21668 - 21715 1.1 - Term 21643 - 21689 3.2 20 9 Tu 1 . - CDS 21729 - 22520 339 ## COG1145 Ferredoxin - Prom 22571 - 22630 10.5 - Term 22883 - 22920 7.1 21 10 Tu 1 . - CDS 22948 - 23298 594 ## PROTEIN SUPPORTED gi|53712979|ref|YP_098971.1| 50S ribosomal protein L20 22 11 Op 1 . - CDS 23399 - 23596 332 ## PROTEIN SUPPORTED gi|53712978|ref|YP_098970.1| 50S ribosomal protein L35 23 11 Op 2 16/0.000 - CDS 23662 - 24258 364 ## PROTEIN SUPPORTED gi|163801060|ref|ZP_02194960.1| 50S ribosomal protein L35 - Prom 24314 - 24373 5.9 24 11 Op 3 . - CDS 24392 - 26332 1997 ## COG0441 Threonyl-tRNA synthetase 25 11 Op 4 . - CDS 26413 - 28461 2075 ## COG0457 FOG: TPR repeat + Prom 28490 - 28549 11.4 26 12 Tu 1 . + CDS 28763 - 29980 850 ## BF1685 hypothetical protein + Term 30016 - 30082 22.5 - Term 30012 - 30064 17.6 27 13 Op 1 . - CDS 30099 - 30653 649 ## COG0242 N-formylmethionyl-tRNA deformylase 28 13 Op 2 . - CDS 30689 - 31105 426 ## COG0816 Predicted endonuclease involved in recombination (possible Holliday junction resolvase in Mycoplasmas and B. subtilis) - Prom 31141 - 31200 3.0 - Term 31146 - 31192 11.2 29 14 Op 1 . - CDS 31227 - 32345 1354 ## COG2885 Outer membrane protein and related peptidoglycan-associated (lipo)proteins 30 14 Op 2 . - CDS 32407 - 33153 463 ## BF1681 hypothetical protein 31 14 Op 3 . - CDS 33224 - 34300 1038 ## BF1680 hypothetical protein 32 14 Op 4 . - CDS 34311 - 35558 541 ## BF1679 hypothetical protein - Prom 35579 - 35638 2.5 + Prom 35337 - 35396 3.7 33 15 Tu 1 . + CDS 35591 - 35731 57 ## - Term 36151 - 36190 1.1 34 16 Tu 1 . - CDS 36278 - 37420 704 ## COG0582 Integrase - Prom 37568 - 37627 8.7 + Prom 37847 - 37906 3.2 35 17 Op 1 . + CDS 37951 - 38745 426 ## BF1676 hypothetical protein 36 17 Op 2 . + CDS 38756 - 40975 2004 ## BF1683 hypothetical protein + Prom 40983 - 41042 3.6 37 18 Op 1 . + CDS 41087 - 41842 565 ## BF1682 hypothetical protein 38 18 Op 2 . + CDS 41844 - 42965 1135 ## COG2885 Outer membrane protein and related peptidoglycan-associated (lipo)proteins + Term 43018 - 43052 5.2 + Prom 43105 - 43164 10.1 39 19 Tu 1 . + CDS 43189 - 43587 278 ## BF1672 hypothetical protein 40 20 Op 1 . - CDS 43856 - 44818 954 ## BF1679 hypothetical protein 41 20 Op 2 . - CDS 44833 - 45939 943 ## BF1678 sulfotransferase 42 20 Op 3 18/0.000 - CDS 46010 - 47440 1473 ## COG2895 GTPases - Sulfate adenylate transferase subunit 1 43 20 Op 4 8/0.000 - CDS 47452 - 48363 996 ## COG0175 3'-phosphoadenosine 5'-phosphosulfate sulfotransferase (PAPS reductase)/FAD synthetase and related enzymes 44 20 Op 5 1/0.190 - CDS 48391 - 48999 635 ## COG0529 Adenylylsulfate kinase and related kinases 45 20 Op 6 . - CDS 49028 - 50584 1288 ## COG0471 Di- and tricarboxylate transporters 46 20 Op 7 . - CDS 50605 - 51423 761 ## COG1218 3'-Phosphoadenosine 5'-phosphosulfate (PAPS) 3'-phosphatase - Prom 51523 - 51582 6.8 + Prom 51965 - 52024 13.2 47 21 Tu 1 . + CDS 52058 - 54733 1938 ## COG3250 Beta-galactosidase/beta-glucuronidase 48 22 Tu 1 . - CDS 54986 - 55765 527 ## COG0500 SAM-dependent methyltransferases - Prom 55964 - 56023 6.4 - Term 56019 - 56067 8.3 49 23 Tu 1 . - CDS 56115 - 56612 660 ## BF1662 hypothetical protein - Prom 56660 - 56719 3.9 50 24 Tu 1 . - CDS 56727 - 57497 923 ## COG4221 Short-chain alcohol dehydrogenase of unknown specificity - Prom 57552 - 57611 4.3 + Prom 57479 - 57538 5.4 51 25 Tu 1 . + CDS 57688 - 58971 899 ## COG0513 Superfamily II DNA and RNA helicases + Prom 59044 - 59103 8.7 52 26 Op 1 . + CDS 59270 - 59479 388 ## BF1659 hypothetical protein 53 26 Op 2 . + CDS 59506 - 59871 310 ## BF1658 hypothetical protein 54 26 Op 3 . + CDS 59876 - 60139 248 ## BF1657 hypothetical protein - Term 60131 - 60178 11.6 55 27 Tu 1 . - CDS 60205 - 61176 1009 ## COG1482 Phosphomannose isomerase - Prom 61249 - 61308 7.7 + Prom 61148 - 61207 7.5 56 28 Op 1 . + CDS 61440 - 62537 360 ## PROTEIN SUPPORTED gi|15900011|ref|NP_344615.1| aldose 1-epimerase 57 28 Op 2 . + CDS 62588 - 63913 1722 ## COG0738 Fucose permease 58 28 Op 3 . + CDS 63949 - 65103 1344 ## COG0153 Galactokinase + Term 65125 - 65174 8.9 + Prom 65133 - 65192 4.0 59 29 Op 1 . + CDS 65258 - 67267 2030 ## COG0021 Transketolase + Term 67280 - 67333 14.2 60 29 Op 2 . + CDS 67350 - 67784 478 ## COG0698 Ribose 5-phosphate isomerase RpiB + Term 67845 - 67905 15.3 + Prom 67839 - 67898 6.5 61 30 Op 1 . + CDS 68094 - 68369 302 ## BF1650 hypothetical protein 62 30 Op 2 . + CDS 68388 - 68615 344 ## BF1649 hypothetical protein 63 30 Op 3 . + CDS 68627 - 69709 1241 ## COG0674 Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, alpha subunit 64 30 Op 4 . + CDS 69720 - 69893 202 ## BF1647 hypothetical protein 65 30 Op 5 22/0.000 + CDS 69990 - 70751 861 ## COG1013 Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, beta subunit 66 30 Op 6 . + CDS 70770 - 71312 712 ## COG1014 Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, gamma subunit + Term 71339 - 71399 3.3 + Prom 71338 - 71397 2.3 67 31 Op 1 . + CDS 71435 - 73477 1817 ## BF1652 hypothetical protein 68 31 Op 2 . + CDS 73486 - 74310 755 ## COG0834 ABC-type amino acid transport/signal transduction systems, periplasmic component/domain + Prom 74447 - 74506 7.4 69 32 Tu 1 . + CDS 74526 - 75512 890 ## COG0673 Predicted dehydrogenases and related proteins + Prom 75570 - 75629 6.5 70 33 Op 1 . + CDS 75666 - 76187 332 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 71 33 Op 2 . + CDS 76184 - 76609 403 ## BF1638 hypothetical protein 72 33 Op 3 . + CDS 76638 - 77087 511 ## BF1637 hypothetical protein 73 33 Op 4 . + CDS 77101 - 77994 962 ## BF1636 hypothetical protein + Prom 78081 - 78140 5.9 74 34 Tu 1 . + CDS 78162 - 78629 528 ## COG0590 Cytosine/adenosine deaminases + Term 78640 - 78683 -0.2 - Term 78622 - 78677 5.7 75 35 Op 1 . - CDS 78717 - 80387 1504 ## BF1644 hypothetical protein 76 35 Op 2 . - CDS 80422 - 81756 1071 ## BF1643 hypothetical protein - Prom 81902 - 81961 4.7 + Prom 81734 - 81793 5.8 77 36 Tu 1 . + CDS 81964 - 82806 1480 ## PROTEIN SUPPORTED gi|53712914|ref|YP_098906.1| ribosomal protein L11 methyltransferase + Term 82991 - 83028 -0.7 - Term 82822 - 82861 -0.2 78 37 Op 1 . - CDS 82883 - 83389 684 ## COG0716 Flavodoxins 79 37 Op 2 24/0.000 - CDS 83408 - 85444 2081 ## COG0022 Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit - Prom 85492 - 85551 2.4 80 37 Op 3 . - CDS 85588 - 86955 1387 ## COG0508 Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes - Prom 86980 - 87039 6.7 - Term 87275 - 87315 5.1 81 38 Op 1 3/0.000 - CDS 87464 - 88165 496 ## COG0095 Lipoate-protein ligase A 82 38 Op 2 . - CDS 88165 - 89514 671 ## PROTEIN SUPPORTED gi|163788782|ref|ZP_02183227.1| 30S ribosomal protein S1 - Prom 89639 - 89698 6.2 + Prom 89848 - 89907 4.3 83 39 Tu 1 . + CDS 90079 - 90285 214 ## BF1617 hypothetical protein 84 40 Op 1 . - CDS 90466 - 92055 1528 ## BF1616 hypothetical protein 85 40 Op 2 . - CDS 92081 - 93736 1374 ## BF1615 hypothetical protein - Prom 93952 - 94011 6.4 + Prom 93646 - 93705 2.6 86 41 Tu 1 . + CDS 93745 - 93930 79 ## BF1627 hypothetical protein + Prom 94084 - 94143 3.2 87 42 Tu 1 . + CDS 94189 - 95451 1256 ## COG1228 Imidazolonepropionase and related amidohydrolases + Term 95506 - 95555 3.6 - Term 95497 - 95538 8.2 88 43 Op 1 . - CDS 95557 - 96081 684 ## COG1038 Pyruvate carboxylase 89 43 Op 2 2/0.048 - CDS 96095 - 97606 1587 ## COG0439 Biotin carboxylase 90 43 Op 3 . - CDS 97625 - 99169 1603 ## COG4799 Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) - Prom 99273 - 99332 6.0 + Prom 99254 - 99313 7.2 91 44 Op 1 . + CDS 99431 - 100024 651 ## BF1622 hypothetical protein 92 44 Op 2 . + CDS 100076 - 101524 1699 ## BF1609 hypothetical protein + Term 101583 - 101625 10.2 + Prom 101596 - 101655 9.1 93 45 Tu 1 . + CDS 101729 - 103228 1466 ## COG1620 L-lactate permease + Term 103252 - 103305 8.4 - Term 103236 - 103297 12.7 94 46 Op 1 . - CDS 103364 - 104956 1692 ## BF1606 hypothetical protein - Prom 105002 - 105061 5.3 95 46 Op 2 . - CDS 105064 - 108192 3001 ## BF1605 hypothetical protein - Prom 108266 - 108325 7.7 + Prom 108262 - 108321 6.1 96 47 Op 1 . + CDS 108519 - 108920 199 ## BF1604 hypothetical protein 97 47 Op 2 6/0.000 + CDS 108917 - 111163 1716 ## COG0161 Adenosylmethionine-8-amino-7-oxononanoate aminotransferase + Term 111220 - 111269 -0.8 + Prom 111178 - 111237 2.8 98 47 Op 3 6/0.000 + CDS 111302 - 112498 671 ## COG0156 7-keto-8-aminopelargonate synthetase and related enzymes 99 47 Op 4 9/0.000 + CDS 112503 - 113978 864 ## COG0500 SAM-dependent methyltransferases 100 47 Op 5 . + CDS 114013 - 114654 647 ## COG0132 Dethiobiotin synthetase + Term 114900 - 114941 -0.3 + Prom 114878 - 114937 6.3 101 48 Tu 1 . + CDS 115066 - 116421 1227 ## COG1252 NADH dehydrogenase, FAD-containing subunit 102 49 Op 1 36/0.000 - CDS 116593 - 119013 2347 ## COG0577 ABC-type antimicrobial peptide transport system, permease component 103 49 Op 2 36/0.000 - CDS 119010 - 119726 359 ## PROTEIN SUPPORTED gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) 104 49 Op 3 13/0.000 - CDS 119748 - 122153 1760 ## COG0577 ABC-type antimicrobial peptide transport system, permease component 105 49 Op 4 . - CDS 122166 - 123413 1472 ## COG0845 Membrane-fusion protein - Prom 123441 - 123500 4.7 106 50 Tu 1 . - CDS 123554 - 124723 1143 ## BF1594 putative outer membrane efflux protein - Prom 124844 - 124903 5.6 + Prom 124871 - 124930 3.6 107 51 Op 1 8/0.000 + CDS 125154 - 126491 1264 ## COG2204 Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 108 51 Op 2 . + CDS 126497 - 127747 966 ## COG5000 Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation 109 51 Op 3 . + CDS 127816 - 128814 1018 ## COG0628 Predicted permease 110 51 Op 4 . + CDS 128885 - 130057 990 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 111 52 Op 1 . - CDS 130063 - 131370 1184 ## COG4591 ABC-type transport system, involved in lipoprotein release, permease component 112 52 Op 2 . - CDS 131380 - 132579 1272 ## COG0436 Aspartate/tyrosine/aromatic aminotransferase - Prom 132622 - 132681 5.2 + Prom 132546 - 132605 8.9 113 53 Op 1 . + CDS 132750 - 133985 1092 ## COG2407 L-fucose isomerase and related proteins + Term 133997 - 134045 12.4 + Prom 133990 - 134049 6.6 114 53 Op 2 . + CDS 134069 - 136093 1078 ## BF1599 putative chondroitinase AC precursor - Term 136002 - 136060 8.2 115 54 Op 1 . - CDS 136108 - 136932 708 ## COG2273 Beta-glucanase/Beta-glucan synthetase 116 54 Op 2 . - CDS 136944 - 137777 918 ## COG1387 Histidinol phosphatase and related hydrolases of the PHP family 117 54 Op 3 . - CDS 137774 - 139639 1859 ## COG0488 ATPase components of ABC transporters with duplicated ATPase domains 118 54 Op 4 . - CDS 139665 - 140495 811 ## BF1581 hypothetical protein - Prom 140518 - 140577 8.2 - Term 140555 - 140598 9.2 119 55 Op 1 . - CDS 140637 - 141020 391 ## BF1580 hypothetical protein 120 55 Op 2 . - CDS 141017 - 142456 1148 ## BF1593 hypothetical protein - Prom 142484 - 142543 5.3 121 56 Tu 1 . - CDS 142657 - 143850 1036 ## BF1578 hypothetical protein 122 57 Op 1 40/0.000 + CDS 144158 - 144847 723 ## COG0745 Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain + Prom 144902 - 144961 3.1 123 57 Op 2 . + CDS 144987 - 146768 1721 ## COG0642 Signal transduction histidine kinase 124 58 Tu 1 . - CDS 146774 - 148612 1420 ## BF1574 hypothetical protein - Prom 148657 - 148716 6.5 + Prom 148556 - 148615 7.2 125 59 Tu 1 . + CDS 148707 - 149642 1063 ## BF1587 hypothetical protein + Term 149685 - 149741 18.2 + Prom 150625 - 150684 3.6 126 60 Op 1 . + CDS 150747 - 151949 968 ## BF1569 outer membrane vitamin B12 receptor protein 127 60 Op 2 . + CDS 151950 - 152783 663 ## BF1569 outer membrane vitamin B12 receptor protein 128 60 Op 3 . + CDS 152830 - 154872 1570 ## COG3391 Uncharacterized conserved protein 129 60 Op 4 . + CDS 154908 - 156272 810 ## BF1567 hypothetical protein + Term 156288 - 156325 5.3 130 61 Tu 1 . + CDS 156339 - 157799 488 ## BF1566 hypothetical protein + Term 157903 - 157959 -0.0 131 62 Op 1 . - CDS 157952 - 159466 1327 ## BF1580 hypothetical protein 132 62 Op 2 . - CDS 159516 - 160544 818 ## BF1564 hypothetical protein 133 62 Op 3 . - CDS 160571 - 161656 1179 ## BF1563 hypothetical protein 134 62 Op 4 . - CDS 161700 - 162563 733 ## BF1562 hypothetical protein - Prom 162692 - 162751 6.1 + Prom 162690 - 162749 5.6 135 63 Op 1 . + CDS 162853 - 163155 345 ## COG2388 Predicted acetyltransferase 136 63 Op 2 . + CDS 163169 - 163384 290 ## BF1560 hypothetical protein + Term 163391 - 163438 2.4 - Term 163379 - 163426 12.1 137 64 Op 1 . - CDS 163469 - 165712 1473 ## BF1574 hypothetical protein 138 64 Op 2 . - CDS 165793 - 166467 348 ## BF1573 hypothetical protein 139 64 Op 3 . - CDS 166482 - 167828 1058 ## COG1253 Hemolysins and related proteins containing CBS domains - Prom 167899 - 167958 2.1 - Term 167897 - 167947 6.8 140 65 Op 1 . - CDS 167964 - 168422 380 ## COG0629 Single-stranded DNA-binding protein - Prom 168446 - 168505 3.8 141 65 Op 2 . - CDS 168510 - 170078 1604 ## COG3119 Arylsulfatase A and related enzymes 142 65 Op 3 . - CDS 170124 - 171170 686 ## COG1194 A/G-specific DNA glycosylase - Prom 171296 - 171355 5.8 + Prom 171185 - 171244 5.4 143 66 Op 1 . + CDS 171376 - 171651 301 ## COG0776 Bacterial nucleoid DNA-binding protein + Term 171805 - 171861 11.8 + Prom 171839 - 171898 2.3 144 66 Op 2 . + CDS 171930 - 173504 1727 ## COG1530 Ribonucleases G and E + Term 173508 - 173549 10.4 - Term 173490 - 173543 14.4 145 67 Tu 1 1/0.190 - CDS 173563 - 174510 748 ## COG0472 UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase - Prom 174531 - 174590 4.3 146 68 Op 1 1/0.190 - CDS 174629 - 175525 727 ## COG0451 Nucleoside-diphosphate-sugar epimerases 147 68 Op 2 6/0.000 - CDS 175537 - 176358 293 ## COG1216 Predicted glycosyltransferases 148 68 Op 3 . - CDS 176371 - 177414 185 ## COG0438 Glycosyltransferase 149 68 Op 4 . - CDS 177426 - 178481 268 ## BT_1343 putative capsule biosynthesis protein 150 68 Op 5 1/0.190 - CDS 178486 - 179205 234 ## COG3774 Mannosyltransferase OCH1 and related enzymes 151 68 Op 6 . - CDS 179222 - 180289 375 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 152 68 Op 7 . - CDS 180286 - 180984 364 ## Geob_1473 putative transferase - Prom 181056 - 181115 2.7 - Term 181191 - 181239 2.3 153 69 Tu 1 . - CDS 181299 - 181604 140 ## Rmar_1143 hypothetical protein - Prom 181730 - 181789 6.2 154 70 Op 1 . - CDS 181805 - 182233 196 ## gi|265762935|ref|ZP_06091503.1| predicted protein 155 70 Op 2 . - CDS 182247 - 183107 434 ## BF1837 putative alpha-1,2-fucosyltransferase 156 70 Op 3 . - CDS 183095 - 184027 381 ## COG1442 Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases 157 70 Op 4 3/0.000 - CDS 184032 - 185465 460 ## COG2244 Membrane protein involved in the export of O-antigen and teichoic acid - Prom 185627 - 185686 7.7 - Term 186386 - 186441 -0.0 158 70 Op 5 8/0.000 - CDS 186481 - 187416 932 ## COG0451 Nucleoside-diphosphate-sugar epimerases 159 70 Op 6 . - CDS 187424 - 188740 1137 ## COG1004 Predicted UDP-glucose 6-dehydrogenase 160 70 Op 7 . - CDS 188760 - 189149 281 ## BF1553 hypothetical protein 161 70 Op 8 13/0.000 - CDS 189170 - 189619 370 ## COG1898 dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes - Term 189664 - 189697 -1.0 162 70 Op 9 . - CDS 189745 - 190635 828 ## COG1209 dTDP-glucose pyrophosphorylase 163 70 Op 10 . - CDS 190672 - 191154 613 ## BF1529 hypothetical protein 164 70 Op 11 . - CDS 191166 - 191804 518 ## BF1528 putative transcriptional regulator UpxY-like protein - Prom 191906 - 191965 8.8 165 71 Op 1 . + CDS 192528 - 192719 67 ## BF3700 hypothetical protein + Prom 192732 - 192791 2.9 166 71 Op 2 . + CDS 192815 - 193162 356 ## BF1526 hypothetical protein + Prom 193202 - 193261 1.9 167 72 Tu 1 . + CDS 193305 - 194219 749 ## BF1525 hypothetical protein + Term 194419 - 194456 3.3 168 73 Op 1 . + CDS 194686 - 195684 514 ## BF1523 hypothetical protein 169 73 Op 2 . + CDS 195653 - 196102 84 ## BF1522 hypothetical protein - Term 196225 - 196262 1.1 170 74 Tu 1 . - CDS 196301 - 196702 252 ## BDI_0882 hypothetical protein - Prom 196804 - 196863 7.2 171 75 Tu 1 . - CDS 197033 - 198022 650 ## BF1454 putative transmembrane protein - Prom 198064 - 198123 5.9 + Prom 197854 - 197913 5.5 172 76 Op 1 . + CDS 198154 - 199851 1975 ## COG2985 Predicted permease 173 76 Op 2 . + CDS 199893 - 201539 1664 ## COG0436 Aspartate/tyrosine/aromatic aminotransferase + Term 201663 - 201701 4.5 174 77 Op 1 13/0.000 - CDS 201528 - 202820 1016 ## COG0642 Signal transduction histidine kinase 175 77 Op 2 . - CDS 202864 - 204213 1405 ## COG2204 Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains - Prom 204244 - 204303 1.8 - Term 204278 - 204332 5.5 176 78 Tu 1 . - CDS 204363 - 205835 1627 ## BF1514 putative outer membrane protein OprM precursor - Prom 205945 - 206004 4.3 - Term 206057 - 206085 -1.0 177 79 Tu 1 . - CDS 206290 - 208902 1800 ## COG0642 Signal transduction histidine kinase - Prom 208932 - 208991 2.8 178 80 Op 1 . - CDS 208998 - 210242 1246 ## BF1446 putative ABC transporter permease component 179 80 Op 2 . - CDS 210257 - 211555 835 ## COG0577 ABC-type antimicrobial peptide transport system, permease component 180 80 Op 3 . - CDS 211569 - 212213 515 ## BF1509 hypothetical protein 181 80 Op 4 . - CDS 212229 - 212894 337 ## PROTEIN SUPPORTED gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) - Prom 212931 - 212990 4.7 - Term 213015 - 213044 0.0 182 81 Op 1 . - CDS 213054 - 214304 998 ## BF1507 ABC transporter permease 183 81 Op 2 36/0.000 - CDS 214313 - 215587 1214 ## COG0577 ABC-type antimicrobial peptide transport system, permease component 184 81 Op 3 . - CDS 215629 - 216294 338 ## PROTEIN SUPPORTED gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) - Prom 216390 - 216449 3.0 + Prom 216515 - 216574 3.8 185 82 Tu 1 . + CDS 216611 - 217510 665 ## COG2207 AraC-type DNA-binding domain-containing proteins - Term 217455 - 217485 -0.6 186 83 Op 1 . - CDS 217658 - 218929 973 ## BF1436 putative ABC transporter permease component 187 83 Op 2 . - CDS 218936 - 220243 1235 ## COG0577 ABC-type antimicrobial peptide transport system, permease component 188 83 Op 3 . - CDS 220268 - 221554 1068 ## BF1434 putative ABC transporter permease component 189 83 Op 4 . - CDS 221558 - 222865 1080 ## COG0577 ABC-type antimicrobial peptide transport system, permease component 190 83 Op 5 . - CDS 222884 - 224152 1143 ## BF1499 ABC transporter permease 191 83 Op 6 13/0.000 - CDS 224172 - 225437 777 ## COG0577 ABC-type antimicrobial peptide transport system, permease component - Prom 225462 - 225521 2.1 - Term 225471 - 225506 4.7 192 83 Op 7 . - CDS 225558 - 226805 1440 ## COG0845 Membrane-fusion protein - Prom 226908 - 226967 4.9 - Term 227037 - 227081 7.5 193 84 Tu 1 . - CDS 227130 - 228029 894 ## COG1705 Muramidase (flagellum-specific) - Prom 228059 - 228118 5.1 + Prom 228062 - 228121 7.4 194 85 Op 1 . + CDS 228145 - 228627 610 ## COG0295 Cytidine deaminase 195 85 Op 2 . + CDS 228630 - 229469 697 ## COG2207 AraC-type DNA-binding domain-containing proteins 196 85 Op 3 . + CDS 229557 - 230159 738 ## COG3059 Predicted membrane protein 197 85 Op 4 . + CDS 230216 - 231592 469 ## PROTEIN SUPPORTED gi|163788782|ref|ZP_02183227.1| 30S ribosomal protein S1 + Term 231637 - 231683 11.2 + Prom 231650 - 231709 2.5 198 86 Op 1 . + CDS 231909 - 233498 1054 ## COG0488 ATPase components of ABC transporters with duplicated ATPase domains 199 86 Op 2 . + CDS 233530 - 234312 690 ## COG2816 NTP pyrophosphohydrolases containing a Zn-finger, probably nucleic-acid-binding - Term 234345 - 234389 0.1 200 87 Tu 1 . - CDS 234440 - 235336 993 ## BF1487 histidine decarboxylase - Prom 235547 - 235606 5.3 + Prom 235218 - 235277 1.8 201 88 Tu 1 . + CDS 235384 - 235545 94 ## BF1486 hypothetical protein 202 89 Tu 1 . - CDS 235866 - 237368 805 ## PROTEIN SUPPORTED gi|90021240|ref|YP_527067.1| ribosomal protein S32 - Prom 237393 - 237452 7.7 - Term 237397 - 237444 12.2 203 90 Op 1 . - CDS 237483 - 237941 455 ## BF1484 hypothetical protein 204 90 Op 2 . - CDS 237983 - 239680 1562 ## BF1416 putative outer membrane protein 205 90 Op 3 . - CDS 239696 - 243022 2663 ## BF1482 hypothetical protein - Prom 243097 - 243156 8.7 206 91 Op 1 6/0.000 - CDS 243240 - 244130 596 ## COG3712 Fe2+-dicitrate sensor, membrane component - Prom 244150 - 244209 4.7 207 91 Op 2 . - CDS 244228 - 244806 352 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog - Prom 244939 - 244998 7.4 - Term 245585 - 245630 7.9 208 92 Tu 1 . - CDS 245697 - 247442 1888 ## COG1109 Phosphomannomutase - Prom 247476 - 247535 5.7 209 93 Op 1 . - CDS 247550 - 249187 1742 ## COG4690 Dipeptidase 210 93 Op 2 . - CDS 249236 - 250888 1485 ## COG0739 Membrane proteins related to metalloendopeptidases - Prom 250911 - 250970 5.8 + Prom 250870 - 250929 5.0 211 94 Tu 1 . + CDS 251076 - 252185 1273 ## COG0686 Alanine dehydrogenase + Term 252260 - 252291 0.1 212 95 Op 1 . - CDS 252281 - 253225 789 ## COG2070 Dioxygenases related to 2-nitropropane dioxygenase 213 95 Op 2 . - CDS 253283 - 253987 632 ## BF1404 hypothetical protein 214 95 Op 3 . - CDS 254001 - 254531 406 ## BF1472 hypothetical protein 215 95 Op 4 . - CDS 254550 - 255590 766 ## BF1471 hypothetical protein 216 95 Op 5 . - CDS 255574 - 256125 348 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog - Prom 256218 - 256277 4.1 - Term 256300 - 256360 13.2 217 96 Op 1 . - CDS 256396 - 257235 858 ## PROTEIN SUPPORTED gi|163755345|ref|ZP_02162465.1| 30S ribosomal protein S6 218 96 Op 2 . - CDS 257228 - 257620 334 ## BF1468 hypothetical protein - Prom 257676 - 257735 2.3 + Prom 257608 - 257667 5.5 219 97 Tu 1 . + CDS 257717 - 258190 531 ## COG1576 Uncharacterized conserved protein 220 98 Op 1 . - CDS 258200 - 258655 437 ## COG0780 Enzyme related to GTP cyclohydrolase I 221 98 Op 2 . - CDS 258669 - 259328 604 ## COG0603 Predicted PP-loop superfamily ATPase 222 98 Op 3 . - CDS 259367 - 260047 751 ## COG1738 Uncharacterized conserved protein - Prom 260139 - 260198 3.4 - Term 260148 - 260217 6.2 223 99 Tu 1 . - CDS 260267 - 260770 656 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog - Prom 260802 - 260861 5.3 + Prom 260705 - 260764 4.1 224 100 Tu 1 . + CDS 260934 - 262223 1291 ## BF1393 hypothetical protein + Term 262373 - 262407 3.0 - Term 262065 - 262098 -0.9 225 101 Tu 1 . - CDS 262312 - 263463 790 ## BF1461 hypothetical protein - Prom 263498 - 263557 8.5 - Term 263508 - 263549 7.1 226 102 Tu 1 . - CDS 263594 - 265099 1453 ## BF1460 putative outer membrane protein precursor - Prom 265288 - 265347 6.5 + Prom 265087 - 265146 4.7 227 103 Tu 1 . + CDS 265320 - 266330 1096 ## COG1052 Lactate dehydrogenase and related dehydrogenases + Term 266386 - 266424 3.4 228 104 Tu 1 . - CDS 266549 - 267091 559 ## BF1388 hypothetical protein - Prom 267162 - 267221 1.9 - Term 267162 - 267201 8.2 229 105 Tu 1 . - CDS 267328 - 269145 1455 ## BF1455 transcriptional regulator - Prom 269289 - 269348 7.3 + Prom 269318 - 269377 5.7 230 106 Op 1 . + CDS 269398 - 270102 745 ## COG1741 Pirin-related protein 231 106 Op 2 . + CDS 270120 - 270827 422 ## COG0259 Pyridoxamine-phosphate oxidase 232 106 Op 3 . + CDS 270899 - 271624 658 ## COG2220 Predicted Zn-dependent hydrolases of the beta-lactamase fold 233 106 Op 4 . + CDS 271644 - 272720 962 ## BF1451 hypothetical protein + Prom 272737 - 272796 3.0 234 107 Tu 1 . + CDS 272842 - 273222 171 ## PROTEIN SUPPORTED gi|148984704|ref|ZP_01817972.1| 50S ribosomal protein L20 + Term 273235 - 273297 8.6 - Term 273221 - 273283 1.0 235 108 Tu 1 . - CDS 273298 - 273486 217 ## - Prom 273657 - 273716 2.0 + Prom 273364 - 273423 3.9 236 109 Tu 1 . + CDS 273584 - 274318 860 ## COG0500 SAM-dependent methyltransferases - Term 274310 - 274352 1.0 237 110 Op 1 1/0.190 - CDS 274361 - 276226 1738 ## COG0471 Di- and tricarboxylate transporters 238 110 Op 2 . - CDS 276286 - 276882 739 ## COG0526 Thiol-disulfide isomerase and thioredoxins - Prom 277046 - 277105 80.3 + TRNA 277023 - 277099 73.6 # Thr TGT 0 0 Predicted protein(s) >gi|226332055|gb|ACIB01000001.1| GENE 1 398 - 844 405 148 aa, chain - ## HITS:1 COG:SP1327 KEGG:ns NR:ns ## COG: SP1327 COG2731 # Protein_GI_number: 15901181 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-galactosidase, beta subunit # Organism: Streptococcus pneumoniae TIGR4 # 1 146 1 147 152 73 29.0 1e-13 MIVSNLQNSQRVEGLHPLFKTLFDYVKTHDLFHAELGRIEIDGDNLFINNVNPECVARDK QVLELHRDYIDVHILLEGTETIGWKAIEDLKDEVKPYEANGDCALYSDAPTTFVDLLPGQ FMIVYPEDPHAPLIGQGKIRKLIAKVKL >gi|226332055|gb|ACIB01000001.1| GENE 2 859 - 2097 998 412 aa, chain - ## HITS:1 COG:CC2486 KEGG:ns NR:ns ## COG: CC2486 COG0477 # Protein_GI_number: 16126725 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Caulobacter vibrioides # 7 358 40 405 519 114 27.0 2e-25 MKNSKIYPWIVVALLWGVALLNYMDRQMLSTMKDAMQVDIVELQSATNFGRLMAVFLWIY GLMSPISGMIADRLNRKWLIVGSLFVWSFVTYLMGIAETFNQVFWLRALMGVSEALYIPA GLSLIADYHTEKSRSLAVGIHMTGLYTGQAIGGFGATVAAAFSWHTTFHWFGIIGIAYAL VLVLFLKDKKEHVKTERLKPSSKNGEKAGLFKGLSLLFSNIAFWVILLYFAAPSLPGWAT KNWLPTLFAENLDIPMSQAGPMSTITIALSSFIGVILGGTLSDKWVQKNIRGRVYTGAIG LGLTIPSLLLLGFGHSFVAVVGAGLLFGIGYGIFDANNMPILCQFVSSKYRATAYGIMNM TGVFAGAFITDLLGKWTDGGNLGLGFAMLAIIVFIALAVQLYFLRPKTDNME >gi|226332055|gb|ACIB01000001.1| GENE 3 2113 - 3297 1032 394 aa, chain - ## HITS:1 COG:slr1975 KEGG:ns NR:ns ## COG: slr1975 COG2942 # Protein_GI_number: 16330802 # Func_class: G Carbohydrate transport and metabolism # Function: N-acyl-D-glucosamine 2-epimerase # Organism: Synechocystis # 8 391 7 386 391 261 35.0 1e-69 MNTTEYLQTWSDSYKNDMISNIMPFWMKYGWDRKNGGVYTCVDRDGQLMDTTKSVWFQGR FAFTCSYAYNHIERNTEWLAAAKSTLDFIEAHCFDTDGRMFFEVTETGLPIRKRRYVFSE TFAAIAMSEYAIASGDHSYAVKALKLFNDIRHFLSTPGILEPKYCERVQMKGHSIIMILI NVASRIRAAINDPVLDRQIEESIAILRKDFMHPEFKALLETVGPNGEFIDTNATRTINPG HCIETSWFILEEAKNRNWDKEMVDTALTILDWSWEWGWDKEYGGIINFRDCRNLPSQDYA HDMKFWWPQTEAIIATLYAYQATKNEKYLAMHKQISDWTYAHFPDAEFGEWYGYLHRDGT ISQPAKGNLFKGPFHIPRMMTKGYALCQELLSEK >gi|226332055|gb|ACIB01000001.1| GENE 4 3321 - 4238 953 305 aa, chain - ## HITS:1 COG:YPO3024 KEGG:ns NR:ns ## COG: YPO3024 COG0329 # Protein_GI_number: 16123201 # Func_class: E Amino acid transport and metabolism; M Cell wall/membrane/envelope biogenesis # Function: Dihydrodipicolinate synthase/N-acetylneuraminate lyase # Organism: Yersinia pestis # 1 286 1 283 297 195 37.0 1e-49 MEKIIGLINAPFTPFYENGEVNYEPIEAYAKMLVKNGLQGVFINGSSGEGYMLTDEERMK LAERWVEVSPKGFKVIVHVGSCCVKSSRKLAEHAQKIGAWGIGAMAPPFPKVGRVEELVK YCEEIACGAPDLPFYYYHIPAFNGAFLSMVAFLEAVDGRIPNFAGIKYTFESMYEYNQCR LYKGGKFDMLHGQDETILPCLAMGGAQGGIGGTTNYNGVNLVGIIEAWKAGDLEKARELQ NFSQEVINVICHFRGNIVGGKRIMKLIGLDLGKNRTPFQNMTDDEEVRMKAELEAIHFFD RCNKF >gi|226332055|gb|ACIB01000001.1| GENE 5 4536 - 5744 341 402 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163762640|ref|ZP_02169704.1| ribosomal protein L33 [Bacillus selenitireducens MLS10] # 87 399 5 317 323 135 27 1e-30 MKQHLLKEIELGTKSALLKKKIITHYIYNGSSTITDLSKELDLSVPTVTKFISEMCEEGY INDYGKLETSGGRHPNLYGLNPESGYFIGVDIKRFAINIGLINFKGDMMELKMNIPYKFE NSIEGLNELCKLISNFIKKLTIAKDKILNINVNVSGRVNPESGYSFSQFNFEERPLSEVL AEKLGYKVTIDNDTRAMTYGEYLKGCVNGEKDIIFVNISWGLGVGIIIDGKIYTGKSGFS GEFGHTSTFDNEIICHCGKKGCLETEASGSALHRILLERIQNGENSILSNRIGDINNPIT LDEIIASVNKEDLLCIEIVEEIGQKLGKQIAGLINLFNPELVIIGGTISLTGDYITQPIK TAVRKYSLNLVNKDSAIVTSKLKDRAGIVGACMLARSRMFEC >gi|226332055|gb|ACIB01000001.1| GENE 6 5741 - 6205 240 154 aa, chain - ## HITS:1 COG:no KEGG:BF1704 NR:ns ## KEGG: BF1704 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 154 18 171 171 312 99.0 2e-84 MKLITEGLLDKVTDQAKENSRLRMNYNFHDSMDAPIHRMLNALEPGTYLPPHRHKNPDKE EVYLVLRGSLLAILFDDEGNVTEKVHLNPAEGHYGIEIPPCVWHTIVVLESGTVIYEIKQ GPFAPLIPENLASWAPPATDEEAARVFMQRMLEL >gi|226332055|gb|ACIB01000001.1| GENE 7 6205 - 6735 302 176 aa, chain - ## HITS:1 COG:no KEGG:BF1703 NR:ns ## KEGG: BF1703 # Name: not_defined # Def: putative transcriptional regulator # Organism: B.fragilis # Pathway: not_defined # 1 176 1 176 176 340 98.0 1e-92 MEETARKIKENASCWYAVYTAPRAEKKVKEQLDKIGVENYLPLQPVVRLWNNRKKKIFIP VVPGCLFVHISSEEIAHVAGIHGVAFLLKEKGQYVSIPEVQMETFKTMIEHSCELVEFAP NEFVPGTVVRVISGQLQGLEAELVECQGNNKLLLRVEGLGCALVTVSTDCVASKEE >gi|226332055|gb|ACIB01000001.1| GENE 8 6711 - 7823 906 370 aa, chain - ## HITS:1 COG:no KEGG:BF1702 NR:ns ## KEGG: BF1702 # Name: not_defined # Def: putative protein involved in capsular polysaccharide biosynthesis # Organism: B.fragilis # Pathway: not_defined # 1 370 1 370 370 707 100.0 0 MTEDKNINKTTPQSEEQEIDLIELAQKVWAGRKLVLKVCGVAVLVGLVVAFSIPKEYSTS VTLAPETGSKSSTGGMGALAAMAGINLGSSTGEDALSPELYPDIVSSTPFLLEMFDVKVA DQKGKINTTLYEYLDKYQRAPWWGAVASAPFKALGWVVSLFKDAPEEQGDAKIDPFYLTA DQAGIADALSHRISVSVDKKTGVTTLTVTMQDPLISAALTDTVMHCLQNYITDYRTNKAR HDLAFTEKLFNEAQENYYEAQQKYARFMDGNQNIIMQSFRTEQERLQNEMNLAYGVFTQV SQQLQLAKAKVQEITPVYTVVQPATVPLRPAKPNKIMILIGFVFLAGVGSIGWILFVKDL LNGWKKQPEK >gi|226332055|gb|ACIB01000001.1| GENE 9 7829 - 10177 1919 782 aa, chain - ## HITS:1 COG:aq_505 KEGG:ns NR:ns ## COG: aq_505 COG1596 # Protein_GI_number: 15605977 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Periplasmic protein involved in polysaccharide export # Organism: Aquifex aeolicus # 117 386 83 351 725 142 34.0 3e-33 MEVAVPVHRTTRTQQGRVRVLSKMMKVITLIALKKNLRDQKNQKNQKNIKGLRQSNNQKN KRGIGDENLEMTDEDMMNEEDWSDEYTVKPEEDPTQQIFGHNIFTNENLTFEPNLNIATP VSYRLGPGDEVIIDVWGASQTTIRQTISPEGSILVDNLGPIYLSGMTVREANNAVRREFA KIYAGISGPNPNTSVDLTLGNIRTIQISIMGEVAVPGTYALSAFSSVFHALYRAGGVNKI GSLRSIKVVRNGKKIADLDVYDFIMKGKLNDDVRLQDGDVVIVDPYESLVQITGKVKRPM FYEMKPSETMATILKYSGGFTGDAYKKAIRLIRKTGREHQVYNVDEMDYSVFKLDDGDVL AVDSVLERFENRVEVRGAVYRAGMYQIDGTVNTVKQLIKKAEGVRGDAFLNRAIIDREND DLTHEMIQIDLKGLLNGTVADIPLQKNDILYIPSIEDLKEEATLTIHGEVANPGTYLYSS NMSVEDLVLQAGGLLEAASTARVDVSRRIKNSKSTELSNVVGKTFSFELKDGFLVGGDQD FHLEPFDEVYIRRSPAYHQQQNVTVGGEVLFGGRYALSKKNERLSDLISKAGGITQDAYV KGARLIRKMTEEELRRKEDALRMANKGGADSISVKTLDVSDTYSVGIELEKALANPGSDF DMVLREGDILFVPEYVSTVKINGAVMYPNTVLYKKGESLKYYINQAGGFASLAKKKRAFV VYMNGTVSRLRTGNSKAIEPGCEIIVPSKDPKKRMSAAEIIGMGTSAASLATMIATMVNL FK >gi|226332055|gb|ACIB01000001.1| GENE 10 9978 - 10379 297 133 aa, chain - ## HITS:1 COG:no KEGG:BF1701 NR:ns ## KEGG: BF1701 # Name: not_defined # Def: putative capsule polysaccharide export protein # Organism: B.fragilis # Pathway: not_defined # 1 102 1 102 852 188 100.0 5e-47 MRRFITLFFLIFTLSGVAVAQQMSDDQVVQYVKDAQKMGKTQKQITTELMRRGVTKEQVE RIQEKYENGSGSTGTQNNQNSTRSRTRTQQNDESDYSNRSQKKSERSEKSKEPEEYKRAS SVEQPEKQAWNRR >gi|226332055|gb|ACIB01000001.1| GENE 11 10494 - 12398 1268 634 aa, chain - ## HITS:1 COG:BH3718 KEGG:ns NR:ns ## COG: BH3718 COG1086 # Protein_GI_number: 15616280 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Predicted nucleoside-diphosphate sugar epimerases # Organism: Bacillus halodurans # 19 580 9 559 608 361 36.0 2e-99 MNIRFYYKYLSSRVASKWLILAVDVLLVIFSMFLASLLQIGLSALVFEFSLWVWTTLFCV IFNVCFFHLNRTYVGVIRYSSFIDISRIFISLTLGYLVTCVGNLLWMGWSGREVLPISVI LTAYIVNFSLMVCLRILVKMIHELMTFDRRHSIRVFVYGSKGSGINIAKSLRVSRSNHFR LKGFISDDTGFIGKQTMGCRVYANNESLFDILEEERIEAIIVSSEKVHRLETSGMIDRLI AEDIRILTVPPFNDLGKEGMQIKDIQIEDLLQRDPIHVDIRKISSHIEGKRIMITGAAGS IGREMVRQIAGLNPYKLILVDQAESPLHNVQLELLDNWRDIDAKMLVADVTNQTRMESIF KDYRPQYVFHAAAYKHVPMMEDNVSEAIQVNVLGTRIMADLAVKYGVEKFVMVSTDKAVN PTNVMGCSKRLAEIYVQSLAHQLSKYANDGALVKFITTRFGNVLGSNGSVIPRFKQQIEK GGPVTVTHPQVIRYFMTIPEACQLVLEAGSMGNGGEIYIFDMGNPVKIVDLARRMIYLSG QKNIKIEFTGLRHGEKLYEELLNVKEFTCPTYHEKIMIAKVREYDYEEVKQEIQKLIDLS YTSDTMGIVASMKKIVPEFVSKNSEFEILDKASF >gi|226332055|gb|ACIB01000001.1| GENE 12 13093 - 14655 666 520 aa, chain + ## HITS:1 COG:MA4289 KEGG:ns NR:ns ## COG: MA4289 COG3291 # Protein_GI_number: 20093078 # Func_class: R General function prediction only # Function: FOG: PKD repeat # Organism: Methanosarcina acetivorans str.C2A # 191 484 494 785 1734 91 25.0 3e-18 MKQKQFYFIYVFLLSMTFLGACSKDSPNELIPNTIVKIEIDKLPGKRIYFIGEELDVSDM TLKVFYSNETSEIVPVKKDEITGFNNTVPENDQILEVHKGSFTVTFKIQVLINDIQAISI KTLPSKTVYTLGEPLSLSNMVLEINYADGTIKVNSAPSADWVQGFNSSVPAQLQIVTLEL DGKQVSFDVQILPVKVDGDKVVSVIDSDFTSITFPDGIRTIGSKAFENKNIKASELLFPA SLSTIEQAAFAYCRNLKIVDLSHTSIKELPEEAFLFSGIKKIALPASLEVVGKEAFYGCT DLNVIDISHTSVKELQNGAFGKSGISSISLPSTFRIVGASAFIETKNLKELTLPEGSEVI DLEAFSGSSIQKVTLPNTIYHIDRSFYNCPELTTIETYGTRITPSPVDRTAAIVSECFNH SPKLTVLKIPASIAKIGISALNKCQVKTLILPASVKALDFNAFGNAVSLDEISLMSPTMV TADYYPVPPGIQKIRVPQNLVETYKQNKAWKPFAEKIVAL >gi|226332055|gb|ACIB01000001.1| GENE 13 14958 - 15104 175 48 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MKAAIDYKKANNELTGVDTFEGILGYIVATVISNKNIQAHERSGKIYE >gi|226332055|gb|ACIB01000001.1| GENE 14 15250 - 16128 479 292 aa, chain + ## HITS:1 COG:BB0411 KEGG:ns NR:ns ## COG: BB0411 COG1864 # Protein_GI_number: 15594756 # Func_class: F Nucleotide transport and metabolism # Function: DNA/RNA endonuclease G, NUC1 # Organism: Borrelia burgdorferi # 101 273 4 175 195 127 39.0 3e-29 MKNKRKRPSKKQHHNSFKSFWIIALFAILPLIYGVYLCTPEIQAVFFQATKVSRPNVARP NYSHDENLKVPVSQFPLTEQIIHHKGYTVSYNKDKKIPNWVAYELTKQKTQGNIKRNERF IADPVVKGGMANNSDYSRSGFDKGHMAPAADMKWSNEAMKESFYFSNVCPQHPELNRRKW KTLEDKVREWAVADSAILIICGPVTNKKSPVIGKNRVTVPSKFFKVILSLHGSTPKAIGF IFKNERAIAPLRNYAVSIDSIEQLTGLDFFSSLPDSLENEIESRIDTTLWSI >gi|226332055|gb|ACIB01000001.1| GENE 15 16286 - 17323 564 345 aa, chain + ## HITS:1 COG:RSc1783 KEGG:ns NR:ns ## COG: RSc1783 COG1559 # Protein_GI_number: 17546502 # Func_class: R General function prediction only # Function: Predicted periplasmic solute-binding protein # Organism: Ralstonia solanacearum # 81 337 122 375 377 123 30.0 6e-28 MNKKRKKIFLSILATFFFICIAGAGTVYYYLFYPQFHPSKTTYIYIDRDDTTDSIFNKIK KQGNPHSFNGFKWMSHFREYSKNIHTGRYAIKPGDNTYQLYSRLSRGYQTPVNLTIGSVR TLDRLVRSVGKQLMIDSAEIAMALYDSIFLEKMGYTEATIPCLFIPETYQVYWDVSAADF LARMKKEHDKFWNKDRLSKAQAIGMTPEEVCTLASIVEEETNNNAEKPMVAGLYINRLHA GMPLQADPTIKFALQDFGLRRITNQHLNVQSPYNTYLNAGLPPGPIRIPSPKGLDSVLNY VKHNYIYMCAKEDFSGTHNFASNYADHMVNARKYWKALNERKIFK >gi|226332055|gb|ACIB01000001.1| GENE 16 17407 - 18999 1352 530 aa, chain + ## HITS:1 COG:CAC2001 KEGG:ns NR:ns ## COG: CAC2001 COG4231 # Protein_GI_number: 15895271 # Func_class: C Energy production and conversion # Function: Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits # Organism: Clostridium acetobutylicum # 3 529 2 521 584 358 38.0 2e-98 MSKQLLLGDEAIAQAALDAGLSGVYAYPGTPSTEITEYIQMAPITSERNIHNRWCANEKT AMEAALGMSFVGKRALVCMKHVGMNVAADCFINSAITGVKGGLIVVAADDPSMHSSQNEQ DSRFYGDFSLIPMYEPSNQQEAYDMVYNGFEFSEKIGEPILMRMVTRLAHSRSGVENKAQ KPQNEISFSEDPRQFILLPGNARKRYKVLLTRQEEFIKASEESPYNRYIDGPNKKTGIVA CGIGYNYLMENYPEGCEYPVLKVGQYPLPKKQLMQLIDACDEILVLEDGQPFVEKQLKGY LGIGLKVKGRLDGTLSQDGELNPDTVARALGKENSSEFNVPNIVEMRPPALCEGCGHRDM YITLTQVLKEEYPTHKVFSDIGCYTLGANAPFNAINSCVDMGASITMAKGASDGGLHPAV AVIGDSTFTHSGMTGLLDCVNENANVTIVISDNETTAMTGGQDSAGTGRLEAICAGLGVD PAHIRVVVPLKKNYEEMKQIIREEINYKGVSVIIPRRECIQTLARKKRSK >gi|226332055|gb|ACIB01000001.1| GENE 17 19003 - 19587 525 194 aa, chain + ## HITS:1 COG:MA1023 KEGG:ns NR:ns ## COG: MA1023 COG1014 # Protein_GI_number: 20089898 # Func_class: C Energy production and conversion # Function: Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, gamma subunit # Organism: Methanosarcina acetivorans str.C2A # 2 192 9 200 200 106 30.0 3e-23 MKKDIILSGVGGQGILSIATVIGKAALKDGLYMKQAEVHGMSQRGGDVQSNLRISDQPIA SDLIPSGKCDLIISLEPMEGLRYLPYLGHEGWLVTNETPFVNIPNYPAESDVMAEINKLP HKVVLNVDKVAKELGSTRVANIVLLGATIPFLGIDYEKIQDSIREIFQRKGDAIVELNLK ALAAGKEIAEKTMK >gi|226332055|gb|ACIB01000001.1| GENE 18 19692 - 20999 1226 435 aa, chain + ## HITS:1 COG:AF2013 KEGG:ns NR:ns ## COG: AF2013 COG1541 # Protein_GI_number: 11499595 # Func_class: H Coenzyme transport and metabolism # Function: Coenzyme F390 synthetase # Organism: Archaeoglobus fulgidus # 5 432 11 437 440 446 48.0 1e-125 MNTKYWEEEIETMSRKKLQELQLQRLKKTINIAANAPYYKKVFQEHGITPESIQSLDDIR KLPFTTKADMRANYPFGLVAGNMKEDGVRIHSSSGTTGTPTVIVHSQHDLDSWANLVARC LYCVGIRNTDVFQNSSGYGMFTGGLGFQYGAERLGALTVPAAAGNSKRQIKFITDFKTTA LHAIPSYAIRLAEVFQEEGIDPRSTTLKTLVIGAEPHTDEQRKKIERMLGVKAYNSFGMT EMNGPGVAFECTEQNGMHFWEDCYYVEIINPETAEPVPEGEIGELVLTTLDREMMPLIRY RTRDLTRILPGDCPCGRTHIRIDRIKGRSDDMFIIKGVNIFPMQVEKILVQFPELGSNYL ITLETVNNQDEMIVEVELSDLSTDNYIELEKIRKDITRQLKDEILVTPKLKLVKKGSLPQ SEGKAVRVKDLRNNK >gi|226332055|gb|ACIB01000001.1| GENE 19 21085 - 21654 404 189 aa, chain + ## HITS:1 COG:CAC0873 KEGG:ns NR:ns ## COG: CAC0873 COG0503 # Protein_GI_number: 15894160 # Func_class: F Nucleotide transport and metabolism # Function: Adenine/guanine phosphoribosyltransferases and related PRPP-binding proteins # Organism: Clostridium acetobutylicum # 1 180 1 181 189 179 49.0 3e-45 MQLLKKRILQDGKCYEGGILKVDSFINHQMDPVLMKSIGVEFVRLFAGTNVNKIMTIEAS GIAPAIMTGYLMDLPVVFAKKKSPRTIQNALSTTVHSFTKDRDYEVVISSDFLTPKDNVL FVDDFLAYGNAALGVIDLIKQSGANLVGMGFIIEKAFQNGRKTLEERGVRVESLAIIEDL SNCRITIKD >gi|226332055|gb|ACIB01000001.1| GENE 20 21729 - 22520 339 263 aa, chain - ## HITS:1 COG:MA4170 KEGG:ns NR:ns ## COG: MA4170 COG1145 # Protein_GI_number: 20092963 # Func_class: C Energy production and conversion # Function: Ferredoxin # Organism: Methanosarcina acetivorans str.C2A # 2 240 12 247 294 74 27.0 2e-13 MIFYFSGTGNSKWIAEQIAKAQNEVLVFMPNAIRDGIEEFVLADDEKVGFVFPVYSWGPP LSVLRFLDWVTLSNYHSQYVFFVCSCGDDTGLTEELFSRALSRKGMECNAGFSVAMPNNY VLLPGFDVDKKELEKKKLDEAVGRVEEINDSITGKKIGFHCNEGSFPWFKTKVLNPLFNR FMTSAKPFYATDDCIGCKRCERICPVGNVVMIGWRPVWGMDCTSCLACYHVCPKHAVQYG RRTKRKGQYLNPNVSISHEAAAQ >gi|226332055|gb|ACIB01000001.1| GENE 21 22948 - 23298 594 116 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|53712979|ref|YP_098971.1| 50S ribosomal protein L20 [Bacteroides fragilis YCH46] # 1 116 1 116 116 233 100 6e-60 MPRSVNHVASKARRKKILKLTRGYFGARKNVWTVAKNTWEKGLTYAFRDRRNKKRNFRAL WIQRINAAARLEGMSYSKLMGGLHKAGIEINRKVLADLAVNHPEAFKAVVAKAKVA >gi|226332055|gb|ACIB01000001.1| GENE 22 23399 - 23596 332 65 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|53712978|ref|YP_098970.1| 50S ribosomal protein L35 [Bacteroides fragilis YCH46] # 1 65 1 65 65 132 100 1e-29 MPKMKTNSGSKKRFALTGTGKIKRKHAFHSHILTKKSKKRKRNLCYSTTVDTTNVSQVKE LLAMK >gi|226332055|gb|ACIB01000001.1| GENE 23 23662 - 24258 364 198 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163801060|ref|ZP_02194960.1| 50S ribosomal protein L35 [Vibrio campbellii AND4] # 7 169 1 166 166 144 46 3e-33 MKSQYRINEQIRAKEVRIVGDDVEPKVYPIFQALKLAEEKELDLVEISPNAQPPVCRIID YSKFLYQLKKRQKEQKAKQVKVNVKEIRFGPQTDDHDYNFKLKHAKGFLEDGDKVKAYVF FKGRSILFKEQGEVLLLRFANDLEDYAKVDQLPVLEGKRMTIQLSPKKKESATKKPATPK PATPAAVKAEKPAGDNEE >gi|226332055|gb|ACIB01000001.1| GENE 24 24392 - 26332 1997 646 aa, chain - ## HITS:1 COG:DR2081 KEGG:ns NR:ns ## COG: DR2081 COG0441 # Protein_GI_number: 15807075 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Threonyl-tRNA synthetase # Organism: Deinococcus radiodurans # 2 641 1 647 649 652 52.0 0 MIKITFPDGSVREYNEGVNGLQIAESISSRLAQDVLACGVNGEIYDLGRPINEDASVVLY KWEDEQGKHAFWHTSAHLLAEALQELYPGIQFGIGPAIENGFYYDVDPGEAVIKEADLPA IEAKMAELVAKKEAVVRRDIAKGDALKMFGDRGETYKCELISELEDGHITTYTQGDFTDL CRGPHLMTTAPIKAIKLTSVAGAYWRGHEDRKMLTRIYGITFPKKKMLDEYLALMEEAKK RDHRKIGKEMQLFMFSDTVGKGLPMWLPKGTALRLRLQDFLRRIQTRYDYQEVITPPIGN KLLYVTSGHYAKYGKDAFQPIHTPEEGEEYFLKPMNCPHHCEIYKNFPRSYKDLPLRIAE FGTVCRYEQSGELHGLTRVRSFTQDDAHIFCRPDQVKGEFLRVMDIISIVFRSMDFDNFE AQISLRDKVNREKYIGSDENWEKAEQAIIEACEEKGLKAKIEYGEAAFYGPKLDFMVKDA IGRRWQLGTIQVDYNLPERFELEYMGSDNQKHRPVMIHRAPFGSMERFVAVLIEHTAGKF PLWLTPEQVVILPISEKFNEYAEKVKTYLKMKEIRAIVDDRNEKIGRKIRDNEMKRIPYM LIVGEKEAENGEVSVRRQGEGDKGTMKFEEFGEILNEEVQNMINKW >gi|226332055|gb|ACIB01000001.1| GENE 25 26413 - 28461 2075 682 aa, chain - ## HITS:1 COG:all0889 KEGG:ns NR:ns ## COG: all0889 COG0457 # Protein_GI_number: 17228384 # Func_class: R General function prediction only # Function: FOG: TPR repeat # Organism: Nostoc sp. PCC 7120 # 35 631 50 584 605 166 25.0 2e-40 MIKKILAALLLFPTFAYAQINTDRVMMIARNALYFEDYVLSIQYFNQVINAKPYLYEPYF FRGLAKINLDDYQGAEADCDAAIDRNPFVVGAYQIRGLARIKQNKYDGAIEDYKKALHFD PENITLWHNLTLCHIQKEDYKAAEDDLGKLLAVAPKYTRAYLMRGEVALKQQDTLRALND FNTAIEMDKYDPDAWASRAIVRLQQGKYAEAESDLNHATHLNAKNAGNYINRALARFHQN NLRGAMSDYDLALDIDPNNFIGHYNRGLLRAQVGDDNRAIEDFDFVLQIEPDNMMATFNR GLLRAQTGDYRGAIKDYTKVIDVYPNFLAGYYQRAEARRKIGDRKGAEMDEFKVMKAQLD KQNGVSNADKSVADNKNGNNKDENKTRKKSDKDMNNYRKIVIADNSEVEQKYKSDIRGRV QDRNVNIKMEPMYALTYYEKMSDVKRAVHYYKYIDDLNRAGVLPKRLYITNMESPLTEEQ VKFHFALIDTHTSAIVENPKDARTRFSRALDFYLVQDFASSIEDLTQAILLDDSFFPAYF MRSLVRCKQLEYQKAEEANAASATPSTLPGISAPQKSEVSALDYDIVKSDLDHVITLAPD FVYAYYNRANVLAMLKDYRAAIADYDKAIELNKEFAEAYFNRGLTHIFLGNNKNGIADLS KAGELGIVSAYNILKRFTEVPE >gi|226332055|gb|ACIB01000001.1| GENE 26 28763 - 29980 850 405 aa, chain + ## HITS:1 COG:no KEGG:BF1685 NR:ns ## KEGG: BF1685 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 405 1 405 405 707 84.0 0 MRQKSLLLLLPFLLLSCGGKKNSGQLSDTAVQIVKPEFPQIVPFETGIETEQEILLSEIA DSIRYIPLETNNKCLMKRLRTTNILQTKDYFFLPWIEKLFQYTRDGKFVRTIGRKGNGPG EFNWIMQIDVDEEKGLVYMLTTSAKINIYSIETGKFVRAMKTPSMEAGDFAMLRTQDTIA ATFIRNNNGRRKERIYLSDLKGDTLKIFNRWDLFELNSSYSWMMSSDEDRYMFHYKNNTC YKEYYNDTLFTITPDSLEPRYIFQMGKYALPMECRFEYLNGDGKRFQELAAPYLQYNTIE TDSYVFMPYSNWTGEKARENQLAIYDKKGRSCFKVANGYIKNDLTPGLPFRPVTALDEHT LLCMWDAAEILEKAEKTPSILQIEPLKGLKEDDNPVMMIVYLKQP >gi|226332055|gb|ACIB01000001.1| GENE 27 30099 - 30653 649 184 aa, chain - ## HITS:1 COG:XF0926 KEGG:ns NR:ns ## COG: XF0926 COG0242 # Protein_GI_number: 15837528 # Func_class: J Translation, ribosomal structure and biogenesis # Function: N-formylmethionyl-tRNA deformylase # Organism: Xylella fastidiosa 9a5c # 2 177 3 170 170 120 41.0 1e-27 MILPIYVYGQPVLRQVAEDITVDYPNLKELIENMFETMDHADGVGLAAPQIGLPIRVVVI NLDVLSEDYPEYKDFRKAYINAHIDVVEGEEVSMEEGCLSLPGIHESVKRGSKIHVRYMD ENFVEHNEVVEGFLARVMQHEFDHLDGKMFIDHISPLRKQMIKGKLNTMLKGKARSSYKM KQVK >gi|226332055|gb|ACIB01000001.1| GENE 28 30689 - 31105 426 138 aa, chain - ## HITS:1 COG:CAC1680 KEGG:ns NR:ns ## COG: CAC1680 COG0816 # Protein_GI_number: 15894957 # Func_class: L Replication, recombination and repair # Function: Predicted endonuclease involved in recombination (possible Holliday junction resolvase in Mycoplasmas and B. subtilis) # Organism: Clostridium acetobutylicum # 3 135 2 134 135 83 37.0 1e-16 MSRIVAIDYGRKRTGIAVSDTMQIIANGLTTVPTHELLDFITNYVKQESVERIIIGLPKQ MNNEVSENMKNIEPFVRSLKKRLPDMPVEYVDERFTSVLAHRTMLEAGLKKKDRQNKALV DEISATIILQSYLETKRL >gi|226332055|gb|ACIB01000001.1| GENE 29 31227 - 32345 1354 372 aa, chain - ## HITS:1 COG:PA3692 KEGG:ns NR:ns ## COG: PA3692 COG2885 # Protein_GI_number: 15598888 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Outer membrane protein and related peptidoglycan-associated (lipo)proteins # Organism: Pseudomonas aeruginosa # 275 354 144 222 261 62 41.0 1e-09 MKSKLVILSLLLAGATAATAQTKETFYSESFKDNIFVSVGVGAQGCVNPDNFDYGFGKAI TPLINVSVGKLFNPVWGIRGQVAGAWTTLYSNYGQPADTYIKSKNKHYFTMRADGMFNLS NAIGGYNPDRLFTVSVFAGPGLTFAKAYGDQDKVNALINGSVGLAGQFNINKYLDINIEA RGEVSPSPFGNISSAHTDGAVSLTAGVTYTFGGKRFVSCGSQVDQNAINEELNRYRSELA KAQSDLADAKNALANVKPVTKEVVKEIEVAGPRAIFFQIGKSKIDDYGMVNIQLAAKILK ANPDKKYKVAGYADKATGSAKWNQKLSEARAQAVYDALIKEGVSKDQLELVGFGGTANMF GKNFLNRVVILE >gi|226332055|gb|ACIB01000001.1| GENE 30 32407 - 33153 463 248 aa, chain - ## HITS:1 COG:no KEGG:BF1681 NR:ns ## KEGG: BF1681 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 248 2 249 249 508 100.0 1e-143 MKKILLIGLFLCCHLALSAQLTYGTTGLLHAPSAEMQRDKTVMIGGNFLNKEITPPTWDY HTYNYFLNVTIFPWLEIAYTCTLFQSQTIGIDWKVGKKKFTNQDRYFSARLRVLKEGQLW KYMPAVVVGTSDPYTESGDGQVGSADGNGYFCRFYVAATKHIPIGKEKIGVHLSYLYNRR VDYHLNGLAGGLTYAPSFAPDLTVIAEYDAKDFAFGATYLLFNHLHAQVELQRMKYFTGG LTYKIYLK >gi|226332055|gb|ACIB01000001.1| GENE 31 33224 - 34300 1038 358 aa, chain - ## HITS:1 COG:no KEGG:BF1680 NR:ns ## KEGG: BF1680 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 358 1 358 358 465 75.0 1e-129 MKMKSKFFGNGAKLALAVLAVCGTLFTSCYEKAEVDEATKPAPAKYYIAGTITDATTGQV LTTATVTLGGESVGATFNKEVSYKAEGYSLEVSATGYYTVSKQVYLNQVSDGQTSVATVD VALVSVEATTIPPVVPPTDPKTDVAESEAKALEAKAAEEAKPSTETIKNMWDGATATADE KASLETTKELTGNVTTGTTTAEAQADGSILVTTPFEFANSVKDAAILVPYFYNEGCELIG DVKEVAAPVTRAEGTVSADIQAAFIANAAKALNKNAGFVQKIGYAKISVLSGYGIIGYRV QGQLVSKKLTFLISGKYYEGIVSYQKGVMIYPNYYSHDTHDSHDSHGFNPNAGGGSND >gi|226332055|gb|ACIB01000001.1| GENE 32 34311 - 35558 541 415 aa, chain - ## HITS:1 COG:no KEGG:BF1679 NR:ns ## KEGG: BF1679 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 415 1 415 415 847 99.0 0 MNHKGPYMRKNITIGTTLRRISLILFCLLPLKGICQTGEATVDALVKMGFENVGWTEDTE ERVFVIQNSAYRLEGVGIGKAVDLIQKMGLPENKPCRLIVLDNNIPQISLYYQPMKGDSV AEVSRADWSVSYDLGEGWKQARRIKKQNSSLFKVDIVVYPELLFRNYILSKVYEIVVNVS PAIEVSLWKGMKLTGQVIFPIYNDYGQRYKQIRPGFVTLSQTVRLPQRTFLTASVGFFNK FRWGGDLKAKHFFKDERFSVDARIGYTGRGYFEDWAFYHGTKWTLTGSIGANFYWPKYNT QFSLKGERYLEGEYGARFDMIRHFRYASIGFYGMKVQHAGNKGLNGGFLFQIALPPYKYK RKGYIPRVIPNNFGFQYNAGNERIYGKGYSPQASDNVMENNSFNPYFIKSELLNF >gi|226332055|gb|ACIB01000001.1| GENE 33 35591 - 35731 57 46 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MDAKYGAKIDTILIIPKSFTCFFAYLYQFFFIQIPVCLKMDQKTHL >gi|226332055|gb|ACIB01000001.1| GENE 34 36278 - 37420 704 380 aa, chain - ## HITS:1 COG:TM0967 KEGG:ns NR:ns ## COG: TM0967 COG0582 # Protein_GI_number: 15643727 # Func_class: L Replication, recombination and repair # Function: Integrase # Organism: Thermotoga maritima # 187 370 76 248 253 64 29.0 3e-10 MKKEVRIKEPVRIRTKRLSNGCESIYLDIYMDGRRRYEFLKLYIIPEHTRTDKDLNQSTM KLASAVKAQRIIELQNGVYGFNHQQEKKDIMLIDYIKYLADKDIEKTSRKVSMYTLIYHL QHYDKMGIRLRQVDKKYILGFVEYLKTATQKHCKSVKNISANTQVHYYKVFHHCLNSAVI DEIITSNPMDKIKNEEKPKRRRTERAFLTIDELKILSQTDFHNATLKKAFLFSCFCGLRH SDIVALTWGNLKKGKVGKMELHMTQQKTQEILSLPLSKEALKQLPVRGKVPDTEKVFKGL ISLGRTNEILPKWAAKAGIQKHITFHSARHTHATMMITLGADLYTVSKLLGHTNIQTTQI YAKIVDESKEKAIDLIPDIT >gi|226332055|gb|ACIB01000001.1| GENE 35 37951 - 38745 426 264 aa, chain + ## HITS:1 COG:no KEGG:BF1676 NR:ns ## KEGG: BF1676 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 264 51 314 314 509 99.0 1e-143 MYKVDITIYPELSLKNLVITQIYQVLFNLSPAIEVSFWKGMKFTAQMVIPVYNDGYASRY DKLHPGFLELSQTVRLPYNFWATLAIGSFNNSRYGIDFNLIHHFKDERFSIEGRIGYTGT GYWEGFTMHYGTKMRATWSLGGSFYWPRYNVELNARVEQYLLKEKAVRVEAIRHFRYASI GFYAMKAKDVKANGGFRFQIALPPYRYKRKGYIPRITPSNNMGMSYNAGNEQYYYKTYRS APDDNIMKNNSFNPYFIKSELLNF >gi|226332055|gb|ACIB01000001.1| GENE 36 38756 - 40975 2004 739 aa, chain + ## HITS:1 COG:no KEGG:BF1683 NR:ns ## KEGG: BF1683 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 739 1 739 739 1230 99.0 0 MKMKSKLFGNGAKLALTLLAMCGMFASCYEKDEIDVPKVTDPTATTYQVAGIVTDAESNA VMDGVTVTNVTNSKSMTTAADGKYVLEAKVGETNVVTFAKSGYKTVTSSVFVTKQENGSI VAYTIDAKMEKGKETKPTKDVEYQIEGSVFDAESGEPVAIKTATMPGILTATIQNGNEFT MSNVGLGQHTIYITADGYKPATADVVISEVAGPEGEGIFIAKTSVTVAMQKKEVEVAPKY YLAGNIYNQDGGMVTKAEVTLNMSGYEVTTTASNGFYKFEIPADKVKGLTKATVTIRHNS YKIYVYTAFIAPVDNGQVSNTTVNVTLILKPDTEKPNDDDNVFIGGNVEIEVPTENAVEA APEKAEGEKVPSKSAVEESINKVLEAAGSDIKITPAQAEQVVNTMEKYVANGTLTEIDKV AVLPIEKETTFKLESKVIDTNNDKEETVIDEIVLPANTIIIFTSGVASTLNISRTDEGKE GASVREYDGTPTGTIFVTPMEVKFTPAVVVAQGETPEVALATLYFNEKTNTWEAEENYAT YQNGVFVGNVHHFSKFKFGFEEADSKATAEAALDSMKFDKACYTEGETAKVKMEINWKGG IKCEGGASVEEIIKKAHSTLTTTTIKMVSAALEEAIKDDNANVTPGAAFTDKKFTYELEV PAYTQLTGFDVTKNVIKTTYVLPFAVYNKATKAIEKKTAEVTISKISSVVVRTIEAIGHG HGHGHGDDLNAGGGIIISE >gi|226332055|gb|ACIB01000001.1| GENE 37 41087 - 41842 565 251 aa, chain + ## HITS:1 COG:no KEGG:BF1682 NR:ns ## KEGG: BF1682 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 251 21 271 271 494 99.0 1e-138 MLSAQLTYGTTGLLHAPSAEMQKDKTVMLGANFMNKEITPPTWYYHTYNYYLNVTILPWM EVAYTCTLFKAEALGLKPYGYSGFTNQDRYFSLRLRALEEGQFWKYMPAVVVGTSDPFTS SGNGVVAPTEGNGYFSRFYIAATRHVQLGRETVGVHLSYLYNKRIEYKLNGIAAGISYNP SFHPQLRLIAEYDSKDFALGATYLLFNHLHAQVELQRMKYFTGGLTFQFRLSGKDGMKKQ KRNKELKQKMK >gi|226332055|gb|ACIB01000001.1| GENE 38 41844 - 42965 1135 373 aa, chain + ## HITS:1 COG:PA3692 KEGG:ns NR:ns ## COG: PA3692 COG2885 # Protein_GI_number: 15598888 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Outer membrane protein and related peptidoglycan-associated (lipo)proteins # Organism: Pseudomonas aeruginosa # 276 355 144 222 261 63 41.0 7e-10 MKSKIAILSLLLTGAAVSASAQTKEHYYSEKAKDNIFISVGVGAQGCVNPDNFDYGFGHA ITPLIHASVGKLFDPIWGIRGQVAGCWSTLYSEYGMPEGEYKKMKNKKYFTLRADGLFNL SNAIGGYNPDRLFTVSVFAGPGLTFAKAYGNQDKLNALINGSVGLMGQFNINKYLDINVE ARGEVSPSVFGHYSSARTDGAVSLTAGVTYTFGGKRFVSCGAQVDQNAINEELNRYRSEL SKAQSDLADAKNALANVKPVTKEVVKEIEVAGPRAVFFQIGKSKIDDYGMVNIQLAAKIL KANPDKKYKIAGYADKATGSAKWNQKLSEARAQAVYDALIKEGVSKDQLELVGFGGTANM FGKNFLNRVVILE >gi|226332055|gb|ACIB01000001.1| GENE 39 43189 - 43587 278 132 aa, chain + ## HITS:1 COG:no KEGG:BF1672 NR:ns ## KEGG: BF1672 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 132 1 132 132 255 100.0 4e-67 MDNRTEQYSLRELLESITENIKTKRLADVFRFISFHPGEMYGSHQHLRIEINYVKKGSCI LHPDHESITFREGEIMITTSDISHLFEAGADGTTLMQLEFLPEIFSHFSLNATVDSNGSA LLWTNRFKIKIE >gi|226332055|gb|ACIB01000001.1| GENE 40 43856 - 44818 954 320 aa, chain - ## HITS:1 COG:no KEGG:BF1679 NR:ns ## KEGG: BF1679 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 320 1 320 320 651 99.0 0 MIQKTYRLWVASLLTLCSASAVAQQKVEILPFGNMDQWVTREIKESGIIGGNTKKVYAIG PTETIVGAKPYANKGGSPWGTSNVMARVSGITKTNTSVFPETRGDGFCARLDTRMESVKV LGIVDITVLAAGSMFLGTVHEPIKSTKNPNKMLQMGIPFTERPSAIQFDYKVKMSDRENR IRATGFSKITDVPGKDFPAVILLLQKRWEDAKGNVYAKRIGTMVNYYYHSTDWKNGSKYD IMYGDITKDPAYKAHMMRLQASEYFTVNSKGESVPIHEVAWGEADDVPTHMILQFTSSHG GAYIGSPGNSLYIDNVKLIY >gi|226332055|gb|ACIB01000001.1| GENE 41 44833 - 45939 943 368 aa, chain - ## HITS:1 COG:no KEGG:BF1678 NR:ns ## KEGG: BF1678 # Name: not_defined # Def: sulfotransferase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 368 1 368 368 706 99.0 0 MGLLEFNKLPINTLVGADWRTFKAITGGREIDAAYTGKYRLTKAVCRLLSTLAPLQDKHY EKLLANKPLEHDPVFILGHWRSGTTFVHNVFSCDSHFGYNTTYQTVFPHLMMWGQPFFKK NMSWLMPDKRPTDNMELAVDLPQEEEFALANMMPYTYYNFWFLPRYQQEYADKYLLFDTI SEAELKVFEETFTKLIKISLWNTQGTQFLSKNPPHTGRVKELVKMFPNAKFIYLMRNPYT VFESTRSFFTNTIQPLKLQDITPAELENNILSIYAKLYHKYEADKQFIPEGNLMEVKFED FEADAMGMTENIYHALDIPGFAEARGSIEKYVGGKKGYKKNKYKYDDRTVQLVQDHWDFA LKQWDYSL >gi|226332055|gb|ACIB01000001.1| GENE 42 46010 - 47440 1473 476 aa, chain - ## HITS:1 COG:PA4442_1 KEGG:ns NR:ns ## COG: PA4442_1 COG2895 # Protein_GI_number: 15599638 # Func_class: P Inorganic ion transport and metabolism # Function: GTPases - Sulfate adenylate transferase subunit 1 # Organism: Pseudomonas aeruginosa # 6 426 11 433 451 526 62.0 1e-149 MADKLDIKAFLDKDEQKDLLRLLTAGSVDDGKSTLIGRLLFDSKKLYEDQLDALERDSKR LGNAGEHIDYALLLDGLKAEREQGITIDVAYRYFSTNNRKFIIADTPGHEQYTRNMITGG STANLAIILVDARMGVITQTRRHTFLVSLLGIKHVVLAVNKMDLVDFSEERFNEIVAEYK KFVAPLGIPDVTCIPLSALDGDNVVDKSERTPWYEGLSLLDFLETVHIDSDNNFSDFRFP VQYVLRPNLDFRGFCGKVASGIIRKGDKVMALPSGKVSHVKSIVTFDGELDYAFPPQSVT LTLEDEIDVSRGEMLVHPDNLPIVDRNFEAMLVWMDEEPMDINKSFFIKQTTNVSRTRID SIKYKVDVNTMEHSSVPFLSLNEIARVVFTTAKELFFDPYRKNKSCGSFILIDPITNNTS AVGMIIDRVEKKDMNIADDFPVLNLPELGIAPEHYEAIEKAVKSLSEQGFEVRIEK >gi|226332055|gb|ACIB01000001.1| GENE 43 47452 - 48363 996 303 aa, chain - ## HITS:1 COG:VC2560 KEGG:ns NR:ns ## COG: VC2560 COG0175 # Protein_GI_number: 15642555 # Func_class: E Amino acid transport and metabolism; H Coenzyme transport and metabolism # Function: 3'-phosphoadenosine 5'-phosphosulfate sulfotransferase (PAPS reductase)/FAD synthetase and related enzymes # Organism: Vibrio cholerae # 3 303 15 315 315 441 70.0 1e-124 MKEEYKLSHLKELEAESIHIIREVAAEFENPVMLYSIGKDSSVMVRLAEKAFYPGKVPFP LMHIDSKWKFKEMIQFRDEYAKKYGWNLIVESNMEAFHAGVGPFTHGSKVHTDLMKTQAL LRALDKYKFDAAFGGARRDEEKSRAKERIFSFRDKFHQWDPKNQRPELWDIYNARVHKGE SIRVFPLSNWTELDIWQYIRLENIPIVPLYYAKERPVVQMDGNLIMADDERLPEKYRDQI EMKMVRFRTLGCWPLTGAVESEADTIEKIVEEMMTTTKSERTTRVIDFDQEGSMEQKKRE GYF >gi|226332055|gb|ACIB01000001.1| GENE 44 48391 - 48999 635 202 aa, chain - ## HITS:1 COG:CAC0103 KEGG:ns NR:ns ## COG: CAC0103 COG0529 # Protein_GI_number: 15893399 # Func_class: P Inorganic ion transport and metabolism # Function: Adenylylsulfate kinase and related kinases # Organism: Clostridium acetobutylicum # 15 192 16 192 200 202 53.0 2e-52 MEENNNIYPIFDRMLTRQDKEELLKQRGVMIWFTGLSGSGKSTIAIALERELHKRGLLCR ILDGDNIRTGINNNLGFSETDRVENIRRIAEVSKLFIDTGIITIAAFISPNNDIREMAAR IVGPDDFLEIFVSTPLAECEKRDVKGLYAKARRGEIKNFTGISAPFEAPEHPALSLDTSV LSLEESVNRLLEIVLPRVSRHE >gi|226332055|gb|ACIB01000001.1| GENE 45 49028 - 50584 1288 518 aa, chain - ## HITS:1 COG:BH3384 KEGG:ns NR:ns ## COG: BH3384 COG0471 # Protein_GI_number: 15615946 # Func_class: P Inorganic ion transport and metabolism # Function: Di- and tricarboxylate transporters # Organism: Bacillus halodurans # 1 241 2 237 589 154 36.0 3e-37 MTFEIAFVLLSLLGMVIALILDKMRPGMILFSVVVLFLCAGILTPKEMLEGFSNKGMITV ALLFLVSEGIRQSGALGQVVKKLLPQKRTTVFRAQLRLLPAVAFISAFLNNTPVVVIFAP IIKRWARTVRLPATKFLIPLSYVTILGGICTLIGTSTNLVVHGMILESGHEGFTMFELGK VGIFIAIAGIIYLFAFSKKLLPDARPDTAVPDEEVEEGDKLHRVEAVLGARFPGINKTLG EFNFKRHYGAEVKEIKKRNGQRFINNLEEVILREGDTLVVMADDTFIPTWGESSVFVLLA NGNDNEPIPGKGKRWFALILLILMIAGATIGELPVVKEMFPDMKLDMFFFVSVTTIIMAW TKIFPARKYTKYISWDILITIACAFAISKAMVNSGVADKVAGFIIGMSHDYGPHVLLAVL FIITNLFTELITNNAAAALAFPLALSLSVQLGVDPTPFFVVICMAASASFSTPIGYQTNL IVQGIGNYKFMDFVRIGLPLNLITFLISIFLIPLIWPF >gi|226332055|gb|ACIB01000001.1| GENE 46 50605 - 51423 761 272 aa, chain - ## HITS:1 COG:aq_337 KEGG:ns NR:ns ## COG: aq_337 COG1218 # Protein_GI_number: 15605852 # Func_class: P Inorganic ion transport and metabolism # Function: 3'-Phosphoadenosine 5'-phosphosulfate (PAPS) 3'-phosphatase # Organism: Aquifex aeolicus # 10 267 6 249 268 249 49.0 4e-66 MEHKYLFVAVDAALKAGEEILSIYTDPASDFEIERKADHSPLTIADRKAHVTIATILNET PFPVLSEEGKHLEYNTRRNWDVMWIVDPLDGTKEFIKRNGEFTVNIALVKAGVPIIGVIY LPVKKELYFAGQEIGAYKLSGITTLEDDATLDKLVAASVRLPQDLQRDRFVVVASRSHLT PETEAYIDAVKQKHKHVELISSGSSIKICLVAEGKADVYPRFAPTMEWDTAAGHAIARAT GMEIYQADKKDVPLQYNKEDLLNPWFIVEKRR >gi|226332055|gb|ACIB01000001.1| GENE 47 52058 - 54733 1938 891 aa, chain + ## HITS:1 COG:SMb21655 KEGG:ns NR:ns ## COG: SMb21655 COG3250 # Protein_GI_number: 16263752 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-galactosidase/beta-glucuronidase # Organism: Sinorhizobium meliloti # 39 808 3 734 755 229 28.0 3e-59 MKKSIILLFISLLLSPLCIKAHQPEFSTAGFFRLADSGRDVYSMNPAWRFYKGNCPGAET GNFDDRSWTVVSLPHGIEYLPTEASGCINYQGEVWYRKHFTPDAALKGKKLFLHFEAIMG KSKIYVNGKLLAEHFGGYLPVVVDVTDALEFGKENLIAVWADNSNDPNYPPGKQQEVLDF TYFGGIYRDCWLIAHNQVFITDPNYENEEAGGGLFVAYNNVSDHSAEVLLKIQIRNSGRK AFKGVVEYELQQPDGQQVASLNSVIRIKPGKSTYSSDKLTVKSPMLWSPENPALYNLIVR IRDEKGNPIDGYRRRIGIRSIEFKGEDGFWLNGKPYEAPLIGANRHQDFAIVGNAVPNSI HWRDAKKLRDAGMKVIRNAHCPQDPAFMDACDELGLFVIVNTPGWQFWNDAPVFAQRVYS DIRNLVRRDRNHPCVWMWEPILNETWYPADFARNAHDIVEAEYPYPYCYSGCDSEAKGKE HFQILFTHPLNGDGGAYSTNDIKKQLTYFTREWGDNVDDWNSHNSPSRVARNWGEQAMLI QAQHYARPSYKYTSYDALYRTPRQHVGGCLWHSFDHQRGYHPDPFYGGVMDVFRQPKYSY YMFMAQRSPIKEERLFQTGPMVYIAHEMTPFSGKDVTVYSNCDEVRLTYLKGGQTQTYVH KQEKEGMPHPVITFENVYDFMKDKALSRQGKQADVYLLAEGLIDGKVVATHKVAPARRPE KLLLWVDNEGTDLKADGSDFVTVIAAVADKDGNIKRLNNYTIKFQIEGEGRILGGAGNLA NPAPVRWGTAPILVQSTLKPGKIKITASVLFEGSQMPSSAVLELESKPGDFPQIYDREES ALIPNFTGQEAVNALAPKSEAALEKERLQKAENAAKLKEVEQQQEEFGEKK >gi|226332055|gb|ACIB01000001.1| GENE 48 54986 - 55765 527 259 aa, chain - ## HITS:1 COG:slr1117 KEGG:ns NR:ns ## COG: slr1117 COG0500 # Protein_GI_number: 16329224 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism; R General function prediction only # Function: SAM-dependent methyltransferases # Organism: Synechocystis # 13 256 7 252 253 198 41.0 1e-50 MSNENKTIHDFELNLICDFFSNMERQGPGSPEVTLKALSFIDNLTEKSLIADIGCGTGGQ TMVLAGHVTGQVTGLDFLSGFIDIFNRNARQSGLQNRVTGIVGSMDDLPFRNEELDLIWS EGAIYNIGFERGLNEWRKYLKKGGYLAVSECSWFTDERPAEINDFWMDAYPEIDTIPNQV AKIHKAGYLPVATFILPENCWTDHYFTPKVAAQKIFLTKYAGNKIAEEFSMLQSIEEELY HKYKEYYGYTFFIAKKIRL >gi|226332055|gb|ACIB01000001.1| GENE 49 56115 - 56612 660 165 aa, chain - ## HITS:1 COG:no KEGG:BF1662 NR:ns ## KEGG: BF1662 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 165 1 165 165 286 100.0 1e-76 MKKLVVLGMGVCVALAFASCKSSESAYKKAYEKAKQQELAEPQVEAPVEVTPVVAAPVET KKAVDNAAGVRQEKVTVVSGADGLKDYSVVCGSFGLKANAEGLKDFLDKEGYNATIAFNA ETAMYRVIVNTFADRASAAQARDAFKAKYPSRKDFQGAWLLYRIY >gi|226332055|gb|ACIB01000001.1| GENE 50 56727 - 57497 923 256 aa, chain - ## HITS:1 COG:all0475 KEGG:ns NR:ns ## COG: all0475 COG4221 # Protein_GI_number: 17227971 # Func_class: R General function prediction only # Function: Short-chain alcohol dehydrogenase of unknown specificity # Organism: Nostoc sp. PCC 7120 # 1 253 4 256 257 293 54.0 3e-79 MKGKIVFITGASSGIGEGCARKFASQGSDLILNARNVAKLEELKVELEAKYGVRICLLPF DVRDRNAATMALASLPEEWKRIDVLVNNAGLVIGVDKEFEGNLDEWDIVIDTNIKALLAM TRIVVPGMVERGHGHIINIGSIAGDAAYPGGSVYCATKAAVKALSDGLRIDLVDTPLRVT NIKPGMVETNFTVVRYRGDKDAADAFYKGIRPLTGDDIAETVYYAASAPEHIQIAEVLVM PTYQATGTISYKKKDC >gi|226332055|gb|ACIB01000001.1| GENE 51 57688 - 58971 899 427 aa, chain + ## HITS:1 COG:CC0835 KEGG:ns NR:ns ## COG: CC0835 COG0513 # Protein_GI_number: 16125088 # Func_class: L Replication, recombination and repair; K Transcription; J Translation, ribosomal structure and biogenesis # Function: Superfamily II DNA and RNA helicases # Organism: Caulobacter vibrioides # 3 365 4 369 476 398 56.0 1e-111 MTFENLNLIEPILKALRQEGYTSPTPIQEQSIPILLQGKDLLGCAQTGTGKTAAFSIPIL QKLYKTDHRKGIKALVLTPTRELAIQIGESFEAYGRYTGLKHAVIFGGVGQKPQTDALRS GIQILVATPGRLLDLISQGFISLSSLDFFVLDEADRMLDMGFIHDIKRILKLLPARRQTL FFSATMPPEIETLANSMLTKPEKVEVTPASSTVDIISQQVYFVEKKEKKDLLIHLLKDTS IESVLIFTRTKYGADKLARVLTKAGIGAEAIHGNKTQNARQRALTNFKNHTLRALIATDI AARGIDVDQLSHVINYELPNVPETYVHRIGRTGRAGHEGVAISFCESEELPYLKDIQKLI GKNIPVVKDHPFVTTEGIKAQEEKQEEIKVKAKANKTYRGSRANGDFWRRKKQKTNQPSS TKQEKRK >gi|226332055|gb|ACIB01000001.1| GENE 52 59270 - 59479 388 69 aa, chain + ## HITS:1 COG:no KEGG:BF1659 NR:ns ## KEGG: BF1659 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 69 1 69 69 87 100.0 2e-16 MKGLNVLAAFLGGAAVGAALGILFAPEKGEDTRHKIAEILRKKGIKLNRNEMENLVDEIA AEIKGEVID >gi|226332055|gb|ACIB01000001.1| GENE 53 59506 - 59871 310 121 aa, chain + ## HITS:1 COG:no KEGG:BF1658 NR:ns ## KEGG: BF1658 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 121 1 121 121 162 100.0 5e-39 MFTDDKSIENIQQLFAEFKKFLVLQKEYTKLELTEKLTILLSTLIMILVLTILGMVALFY LLFALAYILEPLVGGLMVSFGIIAGINVLLIAIIYFFRRQLIISPMVNFLANLFLNDSNK K >gi|226332055|gb|ACIB01000001.1| GENE 54 59876 - 60139 248 87 aa, chain + ## HITS:1 COG:no KEGG:BF1657 NR:ns ## KEGG: BF1657 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 87 1 87 87 157 98.0 1e-37 MNNSTPTPKKITLEDIAQRKKEVLQEICDQKQAMADTTRRIFAPLAPAASGGNALMRSFS TGMAIFDGVMLGMKMIRKVRGLFRKRY >gi|226332055|gb|ACIB01000001.1| GENE 55 60205 - 61176 1009 323 aa, chain - ## HITS:1 COG:CAC2918 KEGG:ns NR:ns ## COG: CAC2918 COG1482 # Protein_GI_number: 15896171 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphomannose isomerase # Organism: Clostridium acetobutylicum # 1 323 1 310 326 203 38.0 3e-52 MYPLKFEPILKQTLWGGDKIIPFKHLNDDLKGVGESWEISGVENNESVVANGPDKGLTLT DMVKKYREELVGEANYARFGNEFPLLIKFIDAKQDLSIQVHPTDELAKKRHNSKGKTEMW YVVGADEGAKLRSGFSEQITPKEYKDRVHNNTITDVLQEYEIHPGDVFFLPAGRIHSIGA GAFIAEIQQTSDITYRIYDFNRKDANGKTRELHTSQALDAINYEVLDDYRTKYEPLKDEP VELVACPYFTTSVYDMSEQISCDYSELDSFVIFICIEGSCLMTDNEGNEVRLGAGETVLL PATTQELTIVPQEGNVKLLETYV >gi|226332055|gb|ACIB01000001.1| GENE 56 61440 - 62537 360 365 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|15900011|ref|NP_344615.1| aldose 1-epimerase [Streptococcus pneumoniae TIGR4] # 38 364 27 345 345 143 30 8e-33 MINTFPTEGNLSGLSRKDFQKDINDKKTDLFILKNKKGMEVAVTNYGCAILSIMVPDKDG KYANVVLSYGTLDALMHGPEPFLSTTIGRYGNRIAKGKFTLYGEEHSLTINNGPNSLHGG PTGFHARVWDAEQLEEGVIRFNYTSADGEEGFPGNLEVEMTYRLEEEENAIVIEYRATTD KATVVNLTNHGFFNLAGTANPTPTVENNIVTINADFYTPIDEVSIPTGEIAKVEGTPMDF RTPHTVGERINDKFQQLIYGAGYDHCYVLNKAETGSLDLAATCKEPNSGRTMEVYTTEAG VQLYTGNWLGGFEGTNGATFPARSAICFEAQCFPDTPNKAHFPSATLLPGDEYQQVTIYK FGVEK >gi|226332055|gb|ACIB01000001.1| GENE 57 62588 - 63913 1722 441 aa, chain + ## HITS:1 COG:BMEII1053 KEGG:ns NR:ns ## COG: BMEII1053 COG0738 # Protein_GI_number: 17989398 # Func_class: G Carbohydrate transport and metabolism # Function: Fucose permease # Organism: Brucella melitensis # 2 433 15 412 412 138 29.0 3e-32 MTQQKKNYVLPIAMMFALFAMISFVTGLTNPLGLIVKEQFQAANWMTQLGNAANFIAYAF MGLPAGMMLKRIGYKKTALTAVAVGFIGVGIQVLSGQMDYQPGELTVFWIYLTGAFVSGF SMCMLNAVVNPLLNTLAGGGKKGNQLIQFGGSLNSISATIVPVLGGYLIGTISQDTRISD ANPALFIAMGIFAVVFIVLAIMDIPEPHKESASDHKVKDTHSPLSFRHFVLGTVAIFVYV GVEVGIPNFINLFLTTSPDAAGAKGFGMDTAMAGSIVGTYWFLMMIGRLCGGALGAKFSS KTQLTVVSSLALIFLLIGMFAPSATTVAMPVFKGGASIGFGMETVPVGIMFFALCGLCTS IMWGGIFNLAVEGLGKYTAMASGIFMVMVCGGGILPLIQGAVADVTSSYIASYWVIFAAV AYMLYYALVGCKNVNKDIPVE >gi|226332055|gb|ACIB01000001.1| GENE 58 63949 - 65103 1344 384 aa, chain + ## HITS:1 COG:CAC2959 KEGG:ns NR:ns ## COG: CAC2959 COG0153 # Protein_GI_number: 15896212 # Func_class: G Carbohydrate transport and metabolism # Function: Galactokinase # Organism: Clostridium acetobutylicum # 4 383 9 388 389 251 39.0 2e-66 MDIEHVRSRFIKHFDGTTGFVYASPGRINLIGEHTDYNGGFVFPGAIDKGMIAEIKPNGT DKVNAYSIDLKDYVTFGLNEEDAPRASWARYIFGVCREMIKRGVDVKGFNTAFSGDVPLG AGMSSSAALESTYAFALNDLFGENKIDKFELAKIGQATEHNYCGVNCGIMDQFASVFGKE GSLIRLDCRSLEYQYFPFKPEGYRLVLLDSVVKHELASSAYNKRRQSCEAAVAAIQKKHP HVEFLRDCTMDMLAEAKADISEEDYMRAEYVIEEIQRVLDVCDALERGDYETVGQKMYET HHGMSKLYEVSCEELDFLNDCAKECGVTGSRVMGGGFGGCTINLVKDELYDNFIEKAKES FKAKFGRSPKVYDVVISDGSRRLV >gi|226332055|gb|ACIB01000001.1| GENE 59 65258 - 67267 2030 669 aa, chain + ## HITS:1 COG:BH2352 KEGG:ns NR:ns ## COG: BH2352 COG0021 # Protein_GI_number: 15614915 # Func_class: G Carbohydrate transport and metabolism # Function: Transketolase # Organism: Bacillus halodurans # 10 669 9 663 666 464 42.0 1e-130 MNDSKLMNRAADNIRILAASMVEKANSGHPGGAMGGADFVNVLFSEFLVYDPQNPRWEGR DRFFLDPGHMSPMLYSVLAFTGKYTLDELKQFRQWGSPTPGHPEVNVDRGVENTSGPLGQ GHTYAVGAAIAAKFLKARFGEVMNQTIYAYISDGGIQEEISQGAGRIAGTLGLDNLIMFY DSNDVQLSTNTEDVTTENVAMKYEAWDWKVITINGNDPDEIRKALTEAKAEKNRPTLIIG KTTMGKGARRADGSSYEADCATHGAPLGGDAYVNTIKNLGGNPENPFTIFPEVAELYAKR AEELKKIVADKYAAKAEWAKANPEKAAKLAEFFSGKAPKVNWDAIEQKAGGATRAGSATV LGALATQVENMIVSSADLSNSDKTDGFLKKTHAFKKGDFSGAFLQAGVSELSMACICIGM SLHGGIIAACGTFFVFSDYMKPALRMAALMEQPVKFIWTHDAFRVGEDGPTHEPVEQEAQ VRLLEKLKNHKGHNSMLVLRPADVEETTIAWKLAMENMSTPTALILSRQNIVNLPAGTDY SQAAKGAYIIADADENPDVILVASGSEVSTLVAGAELLRKEGVKVRIVSAPSEGLFRNQS KEYQESILPAGARIFGLTAGLPVNLEGLVGSNGKVFGLESFGFSAPYKVLDEKLGFTAEN VYNQVKAML >gi|226332055|gb|ACIB01000001.1| GENE 60 67350 - 67784 478 144 aa, chain + ## HITS:1 COG:TM1080 KEGG:ns NR:ns ## COG: TM1080 COG0698 # Protein_GI_number: 15643838 # Func_class: G Carbohydrate transport and metabolism # Function: Ribose 5-phosphate isomerase RpiB # Organism: Thermotoga maritima # 3 141 2 140 143 150 51.0 7e-37 MKKIGICCDHAGFELKEYVRGWLEAKGWEYKDFGTYSTDSCDYPDFAHPLALAVEAGECY PGIAICGSGNGISMTLNKHQGIRAALCWTAEIAHMARLHNDANVLVMPGRYISTEEADMI MTEFFSTEFEGGRHQKRIDKIPVK >gi|226332055|gb|ACIB01000001.1| GENE 61 68094 - 68369 302 91 aa, chain + ## HITS:1 COG:no KEGG:BF1650 NR:ns ## KEGG: BF1650 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 91 1 91 91 157 100.0 1e-37 MDQLKTIKELINQGDIENALQALEEFLQTEPVGKDEAYYLMGNAYRKLGDWQKALNNYQS AIELNPDSPALQARKMVMDILNFYNKDMYNQ >gi|226332055|gb|ACIB01000001.1| GENE 62 68388 - 68615 344 75 aa, chain + ## HITS:1 COG:no KEGG:BF1649 NR:ns ## KEGG: BF1649 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: Citrate cycle (TCA cycle) [PATH:bfr00020]; Metabolic pathways [PATH:bfr01100] # 1 75 1 75 75 114 100.0 1e-24 MAKIKGAIVVDTERCKGCNLCVVACPLNVISLAKEVNVKGYNYAQQILEDTCNGCSSCAT VCPDGCISVYKVKVE >gi|226332055|gb|ACIB01000001.1| GENE 63 68627 - 69709 1241 360 aa, chain + ## HITS:1 COG:TM1759 KEGG:ns NR:ns ## COG: TM1759 COG0674 # Protein_GI_number: 15644505 # Func_class: C Energy production and conversion # Function: Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, alpha subunit # Organism: Thermotoga maritima # 6 356 7 351 356 384 55.0 1e-106 MAEEVVLMKGNEAIAHAAIRCGADGYFGYPITPQSEVLETLAELRPWETTGMVVLQAESE VAAINMVYGGAGSGKMVMTSSSSPGVSLKQEGISYLAGAELPCLIVNVMRGGPGLGTIQP SQADYFQTVKGGGHGDYKLIALAPASVQEMADFVGLAFELAFKYRNPAIILADGVIGQMM EKVVLPPAKARRTDAEVIAQCPWASTGKTKDRKPNIITSLELRPEEMEKNNLRFQAKYRV IEENEVRFEEIDCEDAEYLIVAFGSMARIGQKAMELAREEGIKVGMLRPITLWPFPTKAI AEYANKVKGMLVTELNAGQMVEDVRLAVNGRVKVEHFGRLGGIVPDPDEIVTALKEQLIK >gi|226332055|gb|ACIB01000001.1| GENE 64 69720 - 69893 202 57 aa, chain + ## HITS:1 COG:no KEGG:BF1647 NR:ns ## KEGG: BF1647 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 57 1 57 57 82 100.0 6e-15 MNPDKIRNVLNILFMILALAAIITYFVAKDDFKVFIYVCGAAIFVKLMEFFIRFTNR >gi|226332055|gb|ACIB01000001.1| GENE 65 69990 - 70751 861 253 aa, chain + ## HITS:1 COG:MA2909_1 KEGG:ns NR:ns ## COG: MA2909_1 COG1013 # Protein_GI_number: 20091730 # Func_class: C Energy production and conversion # Function: Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, beta subunit # Organism: Methanosarcina acetivorans str.C2A # 5 252 6 262 296 253 47.0 2e-67 MTKEEIIKPENLVYKKPTLMNDNGMHYCPGCSHGVVHKLIAEVIEEMELEDKAVGISPVG CAVFIYNYLDIDWQEAAHGRAPALATAIKRLWPDRLVFTYQGDGDLACIGTAETIHALNR GENITIIFINNAIYGMTGGQMAPTTLMGMKTATCPYGRDPELHGYPLKITEIAAQLEGTA YVTRQSVQSVPAIRKAKKAIRKAFENSMNGKGSNLVEIVSTCSSGWKMTPEKANKWMEEH MFPFYPLGDLKDK >gi|226332055|gb|ACIB01000001.1| GENE 66 70770 - 71312 712 180 aa, chain + ## HITS:1 COG:MA2909_2 KEGG:ns NR:ns ## COG: MA2909_2 COG1014 # Protein_GI_number: 20091730 # Func_class: C Energy production and conversion # Function: Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, gamma subunit # Organism: Methanosarcina acetivorans str.C2A # 7 179 12 183 186 115 39.0 5e-26 MKEEIIIAGFGGQGVLSMGKILAYSGLMEGKEVTWMPAYGPEQRGGTANVTVIVSDDKIS SPILSKYDTAIILNQPSLEKFESRVKPGGILIYDGYGIINPPTRKDIKVYRIDAMDAANE MNNAKAFNMIVLGGLLKLRPIVTLENVVKGLKKTLPERHHHLIPMNEEAIKKGMELIREA >gi|226332055|gb|ACIB01000001.1| GENE 67 71435 - 73477 1817 680 aa, chain + ## HITS:1 COG:no KEGG:BF1652 NR:ns ## KEGG: BF1652 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 680 5 684 684 1339 100.0 0 MRRILLTYILLVVGLLTAQAQFNPTQQVDPRTGRDANGNQIDPAMRVQEDSTDVEIQGLP PTLYMWHVSENLGTIQRIPADTATHMFQNTNLVEGLTGHYNYLGNLGSPRLSRLFFERRD AEPTIFMEPFSSFFIRPDEFNFTNSNVPFTNLTYYKAGNKINGEERFKSYFSVNVNKRLA FGFNFDYLYGRGYYNNQNTSYFNAALFGSYIGDRYEATLLYSNNYLKMNENGGITDDRYI TRPEEMAEGKKEYESQNIPTLLSKSANRNKDFYIFLTQRYKLGFTREVEKAKDDTTNTQK TEFVPVTSFIHTMKVERSRHQFTSEDELYKIYPEAYIQPGNKLVNDSTSYIGVKNTLGIA LLEGFNKYAKAGLTAFISHKLSNYRLMDRDSVSVDKYSEHEVFVGGELAKRQGKTLHYRA MGEVGILDKAIGQFRVNADLDLNFRLWKDTVSFIARGSISNTLPAFYMRHYHSKYFYWDN DNMEKEFRTRLEGELNIEHWQTNLKAGVENIKNYTYFNQKALPQQNGGNIQVLSATLSQN FRLGILHLDNEVTWQKSSNNTVLPLPELSLYHNLYIQTTLAKKVLHVQLGADVRYFTKYY APAYTPAIQQFHLQPEDDQVKIGGYPIINVYANLQLKRTRLFAMMYHVNQGMGNSNYFLS PHYPINPRLFKIGVSWNFYD >gi|226332055|gb|ACIB01000001.1| GENE 68 73486 - 74310 755 274 aa, chain + ## HITS:1 COG:AF0231 KEGG:ns NR:ns ## COG: AF0231 COG0834 # Protein_GI_number: 11497847 # Func_class: E Amino acid transport and metabolism; T Signal transduction mechanisms # Function: ABC-type amino acid transport/signal transduction systems, periplasmic component/domain # Organism: Archaeoglobus fulgidus # 66 273 62 264 264 80 27.0 4e-15 MNLLRPKYLKYVVLGLISALVVTCWPRKEKPKGHPRDYAEIKESGILHAATEYNSISFYV DGDTVSGFHYELIEAFARDKGLQVQVSPVMSFNQRLEGLANGTYDVVAYGIPATSELKDS LLLTSPIILSKQVLVQRKVGENDSLAIRSQLDLAGKTLNVVKGSPSILRIRNLSNEIGDT IYVNEIEKYGSEQLIAMVAHGDIDYAVCDEGIARMAVDSLPQLDINTAISFTQFYSWGVS KQSPALLDSLNTWLSDFRKKGEYQSVYRKYYGKQ >gi|226332055|gb|ACIB01000001.1| GENE 69 74526 - 75512 890 328 aa, chain + ## HITS:1 COG:BH3843 KEGG:ns NR:ns ## COG: BH3843 COG0673 # Protein_GI_number: 15616405 # Func_class: R General function prediction only # Function: Predicted dehydrogenases and related proteins # Organism: Bacillus halodurans # 6 325 5 323 334 112 28.0 7e-25 MSNKIIKWGFIGCGEVTKYKSGPAFQKVEGSKVVAVMSRDGKKAKAYAKERNIPKWYDDA QELIDDPEVNAVYIATPPSSHATYAIMSMKAGKPVYIEKPMAQTYEECARINRISQETGV PCFVAYYRRYLPYFMKVKELVDKGTIGNVINVQIRFAQPPRDLDYNRENLPWRVQADIAG GGYFYDLAPHQIDLLQEMFGCILEASGYKSNRGGLYPAEDTLSACFQFDNGLVGSGSWCF VAHDSAREDRIEIIGDKGMICFSVFTYDPIALHTERGREEIVVENPEHVQQPLIQAVVDH LLGKSTCSCDGESATTTNWVMDKILGKI >gi|226332055|gb|ACIB01000001.1| GENE 70 75666 - 76187 332 173 aa, chain + ## HITS:1 COG:mll1867 KEGG:ns NR:ns ## COG: mll1867 COG1595 # Protein_GI_number: 13471781 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Mesorhizobium loti # 1 164 16 177 298 60 25.0 1e-09 MELKQFKIAVLPLRDKLLSYARKLTDDHSDAEDAVQEVMLKLWNLRPKLDEYHSIEALAM TMTHHTCMDILRGKHPDNLSLDSVQAASPVATPERLLEEKDEFSLMRHIISTLPPLQQTI LRMKDVEEYETEEIAEITGCSSEAIRSNLSRARKKVRDIYLQTIQQRKRRNEA >gi|226332055|gb|ACIB01000001.1| GENE 71 76184 - 76609 403 141 aa, chain + ## HITS:1 COG:no KEGG:BF1638 NR:ns ## KEGG: BF1638 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 141 35 175 175 268 99.0 3e-71 MNIEELLNKYFEGETTCEEERELRRFFTRGIIPEHLQMYRPMFAFLNEENRQSKTAVSEV PKTSVPLRRRLLYIFSGMAAGILLILGIAGLNRHFNTSTANYVIIDGKCYTDAKLVREQA MIAFRDVSISEEEVFATLFSE >gi|226332055|gb|ACIB01000001.1| GENE 72 76638 - 77087 511 149 aa, chain + ## HITS:1 COG:no KEGG:BF1637 NR:ns ## KEGG: BF1637 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 149 1 149 149 240 100.0 1e-62 MNKVIRTTWIILLLTVASGMARAQQGLQIASVFQKYGKQKGVTMVELSNEMLETYQMTLY KSLVFKDVEEALPTILNCLDADKKKAKKVKEVVAGGQIQSGYYQLPQLKEDVNRFILFKT GKKGSATLIYIEGELDADDLVTMLFMKKN >gi|226332055|gb|ACIB01000001.1| GENE 73 77101 - 77994 962 297 aa, chain + ## HITS:1 COG:no KEGG:BF1636 NR:ns ## KEGG: BF1636 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 297 1 297 297 607 99.0 1e-172 MKHIYLLLFMLIPMAGMAQEGISQDTTLYVNGRKILIKENEGKIKVKLYEQSSHGDTIEN DQIFEGIYTDGQTTERRTAFTVPFVKRKNHYRFDPHIAGFYMGYTRLSDGINFNTPDGLN INANKSWEIGFNLFQGSLTLSRDRQWGITTGLGWGYRSFRLGNNYAFRQIDGVTGIVPGV PDEEVYTKSRLRYFYFRIPVALEWQKRFSHSNAHGPLFFSAGLEAEIRHGAKSKAKVNGH KKNLDSGLNVHPVGINLLAQAGYGDIGVYLRYSTYSLFEHKKGPELYPYSFGLCWYW >gi|226332055|gb|ACIB01000001.1| GENE 74 78162 - 78629 528 155 aa, chain + ## HITS:1 COG:MA3407 KEGG:ns NR:ns ## COG: MA3407 COG0590 # Protein_GI_number: 20092219 # Func_class: F Nucleotide transport and metabolism; J Translation, ribosomal structure and biogenesis # Function: Cytosine/adenosine deaminases # Organism: Methanosarcina acetivorans str.C2A # 6 155 13 162 162 186 59.0 1e-47 MTKEELMRKAIELSRENVANGGGPFGAVIAKDGEIVATGVNRVTASCDPTAHAEVSAIRA AASKLGTFNLSGYEIYTSCEPCPMCLGAIYWARLDKMYYGNNKTDAKNIGFDDSFIYDEL ELKPENRKLPSEVLLHDEAIKAFEEWMEKEDKIEY >gi|226332055|gb|ACIB01000001.1| GENE 75 78717 - 80387 1504 556 aa, chain - ## HITS:1 COG:no KEGG:BF1644 NR:ns ## KEGG: BF1644 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 556 1 556 556 1022 100.0 0 MKKIITLATLGLLVAAAPISAQTVYDAAKITGKDLNGTARFVGMGGAMGALGGDISTIGT NPAGIGIYRSNDFMTSFGFSSYGTESKYLGDKFKSDKIQGSFDNLGFVFASKIGNETSLR YVNFGFNYHKAKSFYKNMNMKGGLGNSSQTYQMAQQASGITKWGDYPYDDPEIGWLSILG YDGWLISDITTDKMNAAGKPNSPYVDKDGNQRYDANGKPLYTTPGNYYGMYDDGVANFHS QERGGIDQYDFNVAFNFNDRFYLGLTLGAYAVDYSKYTYYSEAYTGANAPQNYNLRSWNK VRGSGFDLKLGTIIRPFENSPFRIGLAIHTPTFYNLDYKTSARVESDVLNVETGKIDQWS VDTRDKLPGNGDMVREFRLQTPWTYNVSLGYTIGTSLALGAEYEYQDYSTMKFRGPTGSS SEFTFENSTRPMMKGVNTLRLGLEYKVIPQFALRAGYNYTSAIFHGDAFKDLPYYSIQTD TDWANTKALSNYTLGIGYRGSVFYADLAYKFSTYNEDFYPFVNKYEENNATTVLTPETTK VTNTRSQVLFTLGLRF >gi|226332055|gb|ACIB01000001.1| GENE 76 80422 - 81756 1071 444 aa, chain - ## HITS:1 COG:no KEGG:BF1643 NR:ns ## KEGG: BF1643 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 396 1 396 444 473 100.0 1e-132 MKKIVLLSLFALCLPLLVMAQSNNDDLYFVPSKEKKQEAKKTPVKKEPEKKVVTTNIYTS PGTTVVVQDRKGNKRDMRDVDEYNRRYDAKDNEFAMEDDTLYVKEKAVSDPDGEWVNGFN GSQDDYEYAERIIRFRNPRFAVSISSPLYWDIVYGTNSWDWNVYTDGFYAYAFPTFTNRL WWDWRYNSYGSGWGWGWGWSSPYYAWGGYYPGYWGGYWGGYWGGWYGGGYWGHHHHYHPG WGGGGSWAGRYNTYTRRGSSAVRSSYGNSSTVRRYSSGAVRSNSGSSVRSSATSSYNRGE SSARRVIGTRVVGERPGSTTRTDASSSRRSTYTRPSSTRSSSSYEGTRGGSTTTGSPSYS EGGRRASTRSSSTYTRGSSTAPSRSYNESTTRRSYNSTPSRSSSSSTRSYSSPSGSSSRS YSTGGGGGSSRSSGGGGGSSRGRR >gi|226332055|gb|ACIB01000001.1| GENE 77 81964 - 82806 1480 280 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53712914|ref|YP_098906.1| ribosomal protein L11 methyltransferase [Bacteroides fragilis YCH46] # 1 280 1 280 280 574 100 1e-162 MKYFEFTFDTHPCTETVNDVLAAVLGEAGFESFVEREGGLTAYIQQSLYNEETLKTELAN FPVPDTEISYTFAEAEDKDWNEEWEKNFFQPIVIGDRCVIHSTFHQDVPKAEYDILINPQ MAFGTGHHETTSLIIGELLDSELTGKSLLDMGCGTSILAILARMRGAKPCTAIDIDEWCV RNSIENIELNGVTDIAVSQGDASALQGKGPFDVVIANINRNILLNDMKQYVACMHPGSEL FMSGFYIDDIPAIRREAEKHGLTFVHHQEKNRWAAVKFVL >gi|226332055|gb|ACIB01000001.1| GENE 78 82883 - 83389 684 168 aa, chain - ## HITS:1 COG:alr2405 KEGG:ns NR:ns ## COG: alr2405 COG0716 # Protein_GI_number: 17229897 # Func_class: C Energy production and conversion # Function: Flavodoxins # Organism: Nostoc sp. PCC 7120 # 2 166 3 168 170 132 43.0 3e-31 MKKIGIFYAAKADKTSWVAEKIQKEFGTSAESVAIENAWQNDFEAYDNFIVGASTWFDGE LPTYWDELLPELRTLQLKGKKVAIFGLGDQVKYPENFADGVGLLAEVFENDGATLVGFTS IDGYSFERSRALKGDKWCGLVIDIENQSEMTDKRIKDWCRQVKKEFEV >gi|226332055|gb|ACIB01000001.1| GENE 79 83408 - 85444 2081 678 aa, chain - ## HITS:1 COG:CT340_2 KEGG:ns NR:ns ## COG: CT340_2 COG0022 # Protein_GI_number: 15605063 # Func_class: C Energy production and conversion # Function: Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit # Organism: Chlamydia trachomatis # 352 678 3 328 328 263 44.0 1e-69 MKKYDIKTTDEQTLRKWYHLMTLGRALDEKAPAYLLQSLGWSYHAPYAGHDGIQLAIGQV FTLGEDYLFPYYRDMLTVLSAGMTPEELILNGISKATDPGSAGRHMSNHFAKPEWHIENI SSATGTHDLHAAGVGRAMVYYGHKGVAITSHGESATSEGFVYEAINGASNERLPVIFVFQ DNGYGISVPKKDQTANRKVADNFSGFKNLRIIHCNGKDVFDSMNAMTEAREFAIANRTPV IVHANCVRIGSHSNSDKHTLYRDENELAYVKEADPLMKFRRMLLRYKRLTEEDLQQIEAA AKKELAAANRKALAAPDPIPESIYDFVLPEPYIPQKYKDGLPGPVEGEKSFMVNAINETL KEEFRRNPDTFIWGQDVANKDKGGVFNVTKGMQQEFGDARVFSAPIAEDYIVGTANGMCR FDPKIHVVIEGAEFADYFWPAVEQYVECTHEYWRSNGKFTPNITLRLASGGYIGGGLYHS QNLEGALTTLPGARIVCPSFADDAAGLLRTSMRSKGFTLYLEPKALYNSVEAAAVVPEEF EVPFGKARIRREGTDLTIITYGNTTHFCLDVAERLAREGVGSVEVIDLRSLIPLDKEAIF ASVRKTGKVMVVHEDKVFSGFGAEIAAQIAGEMFRYLDAPVQRVGSTFTPVGFNPILERA ILPNDEKIYKAAKELLEF >gi|226332055|gb|ACIB01000001.1| GENE 80 85588 - 86955 1387 455 aa, chain - ## HITS:1 COG:BH2761 KEGG:ns NR:ns ## COG: BH2761 COG0508 # Protein_GI_number: 15615324 # Func_class: C Energy production and conversion # Function: Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes # Organism: Bacillus halodurans # 5 452 4 417 426 277 41.0 3e-74 MARFEIKMPKLGESITEGTILSWSVQVGDVVNEDDVLFEVNTAKVSAEIPSPVAGKVVEI LFKEGDTVPVGTVVAIVDMDGEGSGEASETAGSVETASAPKAAEVSGIASVPKVQAEVTA PKVERWYSPAVLQLAREAKISQEELDSIPGTGYEGRLSKKDIRTYIEMKKGAPAADVSTT VVSTVSANNSGFSPVPSAEVQKKAAAAAPQAQYGQSASAVSSDASVEVKEMDRVRRIIAD HMVMSKKVSPHVTNVVEVDVTRLVRWREKTKDAFFRREGVKLTYMPAIAEATAQALAAYP QVNVSVDGYNILYKKHINVGIAVSQDDGNLIVPVVHDADRLNLNGLAVAIDSLAKKARVN KLMPDDIDGGTFTITNFGTFKMLFGTPIINQPQVAILGVGVIEKKPAVVETPEGDVIAIR HKMYLSLSYDHRVVDGSLGGNFLHFIADYLENWKE >gi|226332055|gb|ACIB01000001.1| GENE 81 87464 - 88165 496 233 aa, chain - ## HITS:1 COG:SP1160 KEGG:ns NR:ns ## COG: SP1160 COG0095 # Protein_GI_number: 15901025 # Func_class: H Coenzyme transport and metabolism # Function: Lipoate-protein ligase A # Organism: Streptococcus pneumoniae TIGR4 # 1 230 1 235 329 143 35.0 3e-34 MRFINSSFTDAGFNLAAEEYLLKQGTEDVFMLWQSAPSVIIGKHQRVETEVNRTMAEQNK IPVFRRFSGGGAVYHDLGNINLTFIETTCLARFETYLERTVEMLTAAGVAVRGDERLGIY VDGRKVSGSAQCVHRNRAMYHCTLLYDTNLALLNKLLEVEGLEEKVAVHPAVRSVRSEVT NLKEYMHPALSTDKFREWVFRYFAGPSVAEAFSKEELAIIEGLRENKYNTICN >gi|226332055|gb|ACIB01000001.1| GENE 82 88165 - 89514 671 449 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163788782|ref|ZP_02183227.1| 30S ribosomal protein S1 [Flavobacteriales bacterium ALC-1] # 2 449 3 450 458 263 35 7e-69 MRYDIAIIGGGPAGYTAAERAGANGLRAVLFEKKAMGGVCLNEGCIPTKALLYSAKVLDG IKSAPKYGVSVEGAPAFDMEKIIGRKNKTVQKLTGGVRMTVNSYGVTIVDKEAVIEGEGE EGFHIRCDGEVYEATYLLVCTGSDTVIPPIKGLSDVDYWTSREALDSTVLPSSLAIIGGG VIGMEFASFFNSMGVRVKVIEMMPEILGAMDKETSAMLRADYTKKGVNFYLNTKVTEVSD KGVTVEKDGKSSFIDADRILVSVGRKANITQVGLDKLNIELHRNGVVVDEHMLTSHPRVY ACGDITGFSLLAHTAIREAEVAINHILGIDDRMDYDCVPGVVYTNPELAGVGKTEEELIA KGIYYRIQKLPMVYSGRFVAENELGNGLCKLIIDHNDRIVGCHMLGNPASEIIVVAGIAI QRGYTVDEFRKSVFPHPTVGEIYHETLFA >gi|226332055|gb|ACIB01000001.1| GENE 83 90079 - 90285 214 68 aa, chain + ## HITS:1 COG:no KEGG:BF1617 NR:ns ## KEGG: BF1617 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 68 1 68 68 142 100.0 4e-33 MKKLLCPQCKIAGMYVKNDQGERLLVYIAQDGTVVPKYPEIPIEGFDFTEVYCLGCSWHG SPKQLTRF >gi|226332055|gb|ACIB01000001.1| GENE 84 90466 - 92055 1528 529 aa, chain - ## HITS:1 COG:no KEGG:BF1616 NR:ns ## KEGG: BF1616 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 529 2 530 530 1065 100.0 0 MKTKMIYMILLFSSILFGGCVRDVIDDVPGTDEDYAADKKDVVLIRMGTNDAEVKDITSR LFNNIPSSWEIGKKTIVDYDKMKDPETELNADIKNPKIYMTVILNIDDVISGKYPISLFR MLKFYKRDFYVIATQSTPEQKEEMLSLIGVYMEAGYYAINYDNVQHYRIFPSADPYAKDN MIGKGVASFNQKLLTRSADTLPEGNDPDPGYNAQMADKEALKRIEIYNRIYGYAQGDMNY SMQAVPYKMKPGAAPDVIIDNSWAIDAYNFQIYSKNNNCILSITNNAGNGFTANIKDYTT PKGANVYAYIWNLMREASSEITVHSDGGIFREISYQPQTVNHGASYTENSFWKVSVSVSP TKLEDKPWEAFKVSFGKESSTEVSYKTKSMDYSCKGQFGNGLYSKLWQFIPGEFYDRAEA FVYETSQGLRWIDAAQAMTPNYVTWGGWTLRQNYLNHINNSMKLHQQQCIYSLESDAQPG VVSVAITDAIDLQKTYVHYNCGIRYGHDSARTDLKMSKMVWIDFRQWDN >gi|226332055|gb|ACIB01000001.1| GENE 85 92081 - 93736 1374 551 aa, chain - ## HITS:1 COG:no KEGG:BF1615 NR:ns ## KEGG: BF1615 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 551 3 553 553 1114 99.0 0 MKTLFIFLFVLLAFYSCDTDEPISAEDLSAATGTNTDEVDVASRSGSYLIWLNGMSAETK KVADQLQTVNHYDKMVAFNDLPGTDKVETLETMAKDSLNDLTIVGSVADFIAGTYSPEMI QDIRHNRQGVILIAKDGKGEDLKAQMLKLFGLYVDKGYYKVSFPREQRYAYYDANDPDAL KKLCGTGGLNPHTRSMLGSPGPTDDPSAPEVTERALHAMKKIYISNRYYVYTSDYKYHMT GYETLLPYTADIKEYLPRDDKKYDGMVDREWTIDAYNLRIYAPTNGDNMLAVYSSGGNGF SNRLDHSLVSLNNAPMYELWALIWGLRNNAYTKINIRDDQSSQLNLKLIDFAPGLPETES TIEHAYEKSVGFDLGATPVLKGEYRWGKSITYQLAEMTREVTRTITDAEMSYCWKWYPET LFRGSKAMNAEGMIDAVTMISPAWYDILYDHVAGTPGTDNPFCDYDNDLQFNQNLLNYEQ QCAITARTTGASAGVVAIEITDGMVLQRGGAWFTSWGAAVIHSLPSSAGTAYNTTVDVHK TTTVWIDYNNW >gi|226332055|gb|ACIB01000001.1| GENE 86 93745 - 93930 79 61 aa, chain + ## HITS:1 COG:no KEGG:BF1627 NR:ns ## KEGG: BF1627 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 61 1 61 61 111 100.0 6e-24 MFKSETICHIYRLKCDGFSYFVYKKRTYSKKNRFNPISFELKIQSIKWKQSMPGTPINTA M >gi|226332055|gb|ACIB01000001.1| GENE 87 94189 - 95451 1256 420 aa, chain + ## HITS:1 COG:SMa1677 KEGG:ns NR:ns ## COG: SMa1677 COG1228 # Protein_GI_number: 16263373 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism # Function: Imidazolonepropionase and related amidohydrolases # Organism: Sinorhizobium meliloti # 3 418 63 476 480 379 49.0 1e-105 MDNSILINNVQIFDGSSEQTGRGNVLVVNSRIKTVSESPIPTDGIPNVTILDGKGKFLMP GLIDAHWHAYMSCNTMMDLLTADTAYTQFKAGQEARATLLRGFTTIRDAGGPVFGLKRAI DEGILSGPRIYPSGSLISQTGGHGDFRAVYDEPRPFDCCGLTHTEKMGAAIIADGIDAVT VAARNNLRLGASQIKLMTGGGVASLYDRLEDTQFFEEEIHAAVKAAEDAGTYVMVHVYVP RAIQRAIHAGVKSIEHGHLIDEPTMQLIAEKEIWLSMQPFTLGDNQFPTKEQQEKHALVV QGTDQTYQLAKKYNVKLAWGTDLLFNPANTKNQNQGILKLRQWFSNFEILKMVTHDNAEL LALSGARNPYPGKLGVIEEDAWADLILVDGDVLKDITLLGDPEKNFIMIMKGGEIYKNRV >gi|226332055|gb|ACIB01000001.1| GENE 88 95557 - 96081 684 174 aa, chain - ## HITS:1 COG:YBR218c KEGG:ns NR:ns ## COG: YBR218c COG1038 # Protein_GI_number: 6319695 # Func_class: C Energy production and conversion # Function: Pyruvate carboxylase # Organism: Saccharomyces cerevisiae # 79 174 1074 1170 1180 66 39.0 2e-11 MTTILATYYAKIQDVPDSEYKVEILEDGPVKKVAVNGKVYDVDYNVGGDSIYSIIINHHS HGVQISPTSHSSYTIMNKGELYQIELKGELEKIHNARSGADAVGRQVVVAPMPGVILKTY VRKGDEVKKGDPLCVLVAMKMENEIRSVADGVVKEIFVEENTKVGLNERIMVVE >gi|226332055|gb|ACIB01000001.1| GENE 89 96095 - 97606 1587 503 aa, chain - ## HITS:1 COG:MA0675 KEGG:ns NR:ns ## COG: MA0675 COG0439 # Protein_GI_number: 20089560 # Func_class: I Lipid transport and metabolism # Function: Biotin carboxylase # Organism: Methanosarcina acetivorans str.C2A # 1 479 1 479 493 497 51.0 1e-140 MIKKILVANRGEIAMRIFRTCRVMNIATVAIYTRVDRGALHVRYAEEAYCISDSPEDTSY LKPEKILQIAKKTGAAIHPGYGFLSENADFARRCEEEGVIFIGPSADIIARMGIKTEARR IMREAGLPIVPGTEDPVKGIAEAKKVAAEVGYPIMLKALAGGGGKGMRLVRSEEEMETAL RLSQSEAGTSFGNDAVYIEKYIENPHHIEVQILGDKYGNVIHLYERECSIQRRNQKVIEE SPSPFVKPETRAKMLKVAVEACKRINYYSAGTLEFMMDKDQNFYFLEMNTRLQVEHPVTE ECTGVDLVRDMILVAAGNRLPYRQEDVEFRGAAIECRIYAEDPENNFMPSPGVITVREAP EGRNVRLDSAAYAGFEVSLHYDPMIAKLCCWGRNRDSAISNMARALREYKILGIKTTIPF HQRVLKNAAFLKGEYDTTFIDTRFDKEDLKRRQNTDPTVAVIAAALKHYEEEKEAASRAT TLPVVGDSLWKYYGKLQMTANNY >gi|226332055|gb|ACIB01000001.1| GENE 90 97625 - 99169 1603 514 aa, chain - ## HITS:1 COG:BMEI0801 KEGG:ns NR:ns ## COG: BMEI0801 COG4799 # Protein_GI_number: 17987084 # Func_class: I Lipid transport and metabolism # Function: Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) # Organism: Brucella melitensis # 1 514 1 510 510 676 64.0 0 MKELINRLEELNRQAEKGGGDARIEKQHSVGKLTARERIDLLLEKGSFIELDKLVTHRCT DFGMEKQKISGDGVVTGYGMIGKRLVYVFAQDFTVFGGALSETYARKICKVMDMAMQMGA PIIGLNDSGGARIQEGVRSLAGYAEIFLRNSLASGVIPQISAIMGPCAGGAVYSPALTDF ILMVKNSGYMFITGPDVVKSVTQEEVSKEDLGGADVHTMKSGVAHLSAENDIECINYIRE LISYLPGNNMEEPPFVATTDSPTRLTPELSDLVPTNPNQPYDMKEMIRAVADDNSFFELQ EEFARNIVIGYIRLNGKTIGVVANQPLALAGTLDINASVKAARFVRFCDAFNIPLLTLVD VPGFLPGVDQEYGGIIRNGAKLLYAYCEATVPKVTVIARKAYGGAYDVMSSKHIRGDVNL AYPTAEIAVMGPDGAVNILFRKEIDKAADTEARRKELQDDYRSKFANPYRAAELGYVDEV IDPAITRMRLIRSFEMLANKRQSNPPKKHSNLPL >gi|226332055|gb|ACIB01000001.1| GENE 91 99431 - 100024 651 197 aa, chain + ## HITS:1 COG:no KEGG:BF1622 NR:ns ## KEGG: BF1622 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 197 1 197 197 397 99.0 1e-109 MKMRTIKHFLAACFLLVFATSIHAQEQGTKTYLPKFAIKTNALYWATSTPNLGFEVALAK KLTLDVSGNYNPWKFSKDRQIKHWLVQPELRYWLCERFNGSFFGLHGHYADINMSNLDIF GLGNYRYDGKIYGAGISYGYHWILKNRWSMEATIGAGYARLDYDKYACGKCGEKLGHNNK NYFGPTKVGLSIIYTIK >gi|226332055|gb|ACIB01000001.1| GENE 92 100076 - 101524 1699 482 aa, chain + ## HITS:1 COG:no KEGG:BF1609 NR:ns ## KEGG: BF1609 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 482 1 482 482 924 99.0 0 MKRKIIYFLLALAVVMPASAQKFFKDAISLSDVSLWQQGNSLYVDMKIDMKNLTVSPERM LTLTPLLTDGQHNVALEEIIINGKRRQKAYLRGLAISRELPAGIVIPYNKREVLNYAQVI PYEPWMANASLNLVENLCGCGNNEEMLAQELITNDVSTEAKRLSTMIPVVAYIQPTVEVV KNRSEQYEAHLDFPVSKAVIQPEFMNNHKELMNIHAMFDKIQNDKNLTVTGISIEGFASP EGPLKFNEQLSQKRAEALKNYLTTNEKVPGKLYKVTFGGENWDGLVQALEKSSMKDKDKF IGIIKNTTDDARRKQEIMRVDGGAPYRTMLKEIYPGLRKVNCKIDYTVANFDVEQGRVVI KVNPKYLSLNEMYQVANSYPKGSNDFVNVFDIAVRMYPNDEVANLNAAAVSLTKKDLENA IKYMDKANHQTAEFINNVGVYNFLNGDVQRAIAAFNQAAQMGNEAAKANLQQLQQILNMK KK >gi|226332055|gb|ACIB01000001.1| GENE 93 101729 - 103228 1466 499 aa, chain + ## HITS:1 COG:BB0604 KEGG:ns NR:ns ## COG: BB0604 COG1620 # Protein_GI_number: 15594949 # Func_class: C Energy production and conversion # Function: L-lactate permease # Organism: Borrelia burgdorferi # 3 497 6 499 500 297 39.0 3e-80 MTLILAIVPVLLLIVLMAFFKMPGDKSSVISLIVTILIALFGFHYAVDNLVFSFVYGALK AVSPILIIILMAIFSYNVLLKTEKMEIIKQQFSSISTDKSIQVLLLTWGFGGLLEAMAGF GTAVAIPAAILISLGFKPIFSATVSLIANSVATAFGAIGTPVLVLAKETNLDVQVLSTNV VLQLSVLMFLIPLVLLFLTDPKIKSLPKNLFLALLVGAVSLGSQYVAARYMGAESPAIIG SILSIVVIVIYGKLTAPKKEKAQQNALKRKDILNAWSIYLLILLLIILTSPLFPGLRSTL ENNWVTRISLPINDSTVNYTIAWLTHAGVLLFVGTFVGGLIQGAKVKELFIVLWNTVKQL KKTFVTVICLVGLSTIMDTAGMISVIATALATATGSLYPLFAPVIGCLGTFITGSDTSSN ILFGKLQASVAGHINVSPDWLSAANTVGATGGKIISPQSIAIATSAGNQQGKEGEILKAA IPYALAYVVITGIIVYIFS >gi|226332055|gb|ACIB01000001.1| GENE 94 103364 - 104956 1692 530 aa, chain - ## HITS:1 COG:no KEGG:BF1606 NR:ns ## KEGG: BF1606 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 530 25 554 554 1048 99.0 0 MRTIRFNNNRHISSSPLLRTEGGREASVSDTSGDKIPKGKFSFLLLFFLLFLGSCSSDFL DNKPTDAVDSGLVPVPSNAERIFNGAWYNLFEYGTTYANIGYRALMCLDDMMADDVVSRP MYGFNSSYQFNDVVMPSDGRTTFAWYLMYKTIDNCNTAISIQATGDDDTPEFRHAQGQAL AVRAFCYLHLAQHYQFTYLKDKNALCVPLYTEPTNPNTKPKGKATLEEIYTQIINDLNRA KGLLDGYVRPNDKSKYKPNTDVVNGLLARTYLLTGQWDEAAKAALEAAKGYTLMTDAKNY MGFNDISNTEWIWGHPQSVSQSDASYNFYYLDVVEPDSYNSFMADPHFKDAFTEGDIRLE LFQWMREGYLGYRKFRIRSDQTGDIVIMRSAEMYLIAAEALARKGQLGEAVKPLNTLRNA RGLADYDLTGKNQEQVIDEILMERRRELWGEGFGITDVLRTQRPVVRVALTEDEAAKEYD CWQQNGTYKKYHPEGHWFTSFPDGTKFVPNSIYYLYSIPEKETNANPNLK >gi|226332055|gb|ACIB01000001.1| GENE 95 105064 - 108192 3001 1042 aa, chain - ## HITS:1 COG:no KEGG:BF1605 NR:ns ## KEGG: BF1605 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1042 1 1042 1042 2012 99.0 0 MKTNLRLLFTLLFVVSVVALRAGTMNDIRVSGTVVSEGDPLPGVSVLVKGTGVGTITGID GKYSINVPSDGTLVFSFIGLKSVEHKVGGRSVINVELVPDSKQLEEVMVVAYATAKKYSF TGAASTVKGDEIAKLQTSSVSRALEGTVAGLQASAASGQPGTDATIRIRGIGSINASSAP LYVVDGVPYDGSVNSINPEDIASMTVLKDAASAALYGSRGANGVIIITTKQGQSDSKTTV NVKASFGGSNRAVRDYDRIGTDQYFELYWEALRNQYALDTKNYTPQTAAIKASKDLVGKL MGAGPNPYGSKYPQPVGTDGKLADGAVPLWDFDWQDAMEQQALRTELGLNVSGGGKTNQY YFSAGYLNDKGIALESGYERFNLRSNVTSQMTKWLRGGVNMSFAHSLQNYPVSSDTKTSN VINAGRLMNGFYPIYQMNEDGTYKLDSEGQWIYDFGSYRPSGSMANWNLPATLPNDKSER MKDEFSGRTFLEVTFIEGLKFKTSFNFDLINYNSLDYTNPKIGPAVNTGGSSSRENDRTF SWTWNNILTYDKTLGEHHFNLLAGQEAYSYRYDVLRASRSNMALPDFPELAVGSLVTGGT GYRVDYSLVGYFLNAQYDYQSKYFFSGSYRRDGSSRFAPETRWGNFWSVGASWRIDREDF MVATSDWLSALTLKVSYGAQGNDNLGTYYASSGLYSVVSNNGENALVSDRLATPKLKWET NLNFNAGIDFSLFNNRFSGSFDFFQRRSKDLLYSRPLAPSLGYNSVDENVGELKNTGVEI DLKGTLIHTRDFMWRLGLNLTHYKNVVTDLPLKDMPVSGVHKLAVGRSVYDFYMKQWAGV DPENGDPLWYKNVKDANDKITGRTTTNDYAQADYYYTGKSSLPKVYGGFNTAFSYKGFEL STIFAYSIGNYIVDRDVTMLWHNGSSTGRAWSTEILNRWTPENRYTDVPALKTVSNSWNA NSTRNLFNNSFLRMKNITLSYNFPQPMIKKISLNSLQLFVQADNLLTVSKNQGLDPEQDI SGLTYYRYPAMRSISGGINVSF >gi|226332055|gb|ACIB01000001.1| GENE 96 108519 - 108920 199 133 aa, chain + ## HITS:1 COG:no KEGG:BF1604 NR:ns ## KEGG: BF1604 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 133 1 133 133 227 99.0 1e-58 MKKERYISHSFIRFRQWSRKAYAIFATLGLCVTIGQLRKNITECALCKQQTPHTTGLDPQ RETEDSVPEEDWENLTLSPEPLLLLLTQLQSSKYSYAGAAQSTVVTSASYITPKEVASHP DGTGNLFFNIPTL >gi|226332055|gb|ACIB01000001.1| GENE 97 108917 - 111163 1716 748 aa, chain + ## HITS:1 COG:HI1554 KEGG:ns NR:ns ## COG: HI1554 COG0161 # Protein_GI_number: 16273454 # Func_class: H Coenzyme transport and metabolism # Function: Adenosylmethionine-8-amino-7-oxononanoate aminotransferase # Organism: Haemophilus influenzae # 324 735 10 421 430 587 63.0 1e-167 MTIEEIKNQVLQGTAISREQAEWLALYPRKEELYDAAHDITTACASQEFDMCSIINARSG RCPENCKWCAQSSHYKTKADVYDLVSAEECLRQAKYNEAQGVNRFSLVTSGRKPSPKNMK ELCVAARRMRRHSSIRLCASLGLLDEEELQALYDAGVTRYHCNLETAPSHFDSLCTTHTQ EQKLKTLHAARRVGMDLCCGGIIGMGETVEQRIEFAFTLRDLNIQSIPINLLQPIPGTPL EHQSPLSEEEILTTVALFRFINPAAYLRFAGGRSQLTPEAVRKSLYIGINSAIVGDLLTT LGSKVSDDKEMILSEGYHFADSQFDREHLWHPYTSTSNPLPVYKVKRADGATITLESGQT LIEGMSSWWCAVHGYNHPILNQAVQDQLSRMSHVMFGGLTHDPAIELGKLLLPLVPPSMQ KIFYADSGSVAVEVALKMAVQYWYAAGKPEKNNFVTIRNGYHGDTWNAMSVCDPVTGMHS IFGSALPIRHFLPAPSSRFGDEWNPEDIRPLEHLLEKHADELAAFILEPIVQGAGGMRFY HPEYLREAARLCHRYGVLLIFDEIATGFGRTGKLFAWEHAGVEPDIMCIGKALTGGYMTL SAVLTTNEVADCISNHTPGAFMHGPTFMGNPLACAVACASVRLLLTSGWQENVKRIEAQL NRELAPARELPQVADVRVLGAIGVIEMKEPVNMAYLQRRFVEEGIWLRPFGKLIYVMPPF IITPEQLTKLTEGMIRIISNGLPGSQTK >gi|226332055|gb|ACIB01000001.1| GENE 98 111302 - 112498 671 398 aa, chain + ## HITS:1 COG:PM1901 KEGG:ns NR:ns ## COG: PM1901 COG0156 # Protein_GI_number: 15603766 # Func_class: H Coenzyme transport and metabolism # Function: 7-keto-8-aminopelargonate synthetase and related enzymes # Organism: Pasteurella multocida # 15 395 5 386 387 435 54.0 1e-122 MNKNQETERGELTRFLDELQLLKKKDNFRTLPTLVHQGKEVIIGGQRMLNLSSNDYLGLA NDSGLLKAFWQTVKPEEIKFSSSSSRLLTGNFAAYDELEATLSSLFGTEAALVFNCGYHA NTGILPAVCDTKTLILADKLIHASLIDGIRLSDAKCIRYRHNEYSQLERLVETYHKEYEQ VIIVTESIFSMDGDEADLPRLVKLKRKYPNVLLYLDEAHAVGVRGTGGLGCAEAYGCISD IDFLVGTFGKALASSGAYIVCRQVIRDYLINKMRPFIFTTALPPVTLQWTSFVLRHLAEY QEKREHLAAISSNLRTGLQEKGYFSASASQIVPMIAGESSSAVRMAEELQRKGFYALPVR PPTVPEGTSRIRFSLTADVTEEEVKRVIATIRPVSSKS >gi|226332055|gb|ACIB01000001.1| GENE 99 112503 - 113978 864 491 aa, chain + ## HITS:1 COG:PM1903 KEGG:ns NR:ns ## COG: PM1903 COG0500 # Protein_GI_number: 15603768 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism; R General function prediction only # Function: SAM-dependent methyltransferases # Organism: Pasteurella multocida # 247 489 9 249 251 202 41.0 1e-51 MKLKRLYPPQKSVNSLHSETGTGCEAAGYPPLQKEALLFFTGWGMDETPFMHNLPPNKDL IICYDYRSLDFDSTLLSTYEGIYVVAWSMGVWAASQVLPDSNLPLKQSIAINGTLFPIDD MRGIPPAIFEGTLNNLNEATLQKFRRRMCGSGSAFQAFLEIAPQRPVEELKEELVAIGRQ YSELPPSTFVWNKAIIGESDHIFPPKNQEQAWQKYCNEIQRYDGAHYDEKILKENLAPAA ASSKLLIAQRFTKAIGTYPHEARVQQQIARKMCSLLQQHLPAHSLRRVVEFGCGTGTYSR LLLRSFRPEHLLLNDLCEEMRHSCRDILNERVSFLPGDAEALDFPHGTELITSCSVLQWF EHPDAFFRKCENILNAQGYIAFSTFGKENMKEIRQLTGQGLAYRSREELTASLSALYDIV HTEEEVISLNFNNPMEVLYHLKQTGVTGTCNQSWTRSKLNLFCQEYERLFSPGKGSVSLT YHPIYIIAKKR >gi|226332055|gb|ACIB01000001.1| GENE 100 114013 - 114654 647 213 aa, chain + ## HITS:1 COG:NMA0943 KEGG:ns NR:ns ## COG: NMA0943 COG0132 # Protein_GI_number: 15793901 # Func_class: H Coenzyme transport and metabolism # Function: Dethiobiotin synthetase # Organism: Neisseria meningitidis Z2491 # 3 207 2 206 215 239 54.0 4e-63 MKKNVYFISGIDTDAGKSYATGFLARELNRKGQRTITQKFIQTGNTGHSEDIDLHRRIMG IAPTDDDREGLTMPEIFSYPCSPHLASRIDGRPIDFDKIERATEELSRRYDVVLLEGAGG LMVPLTDELLTIDYIAQKEYPLLFVTSGKLGSINHTLLSLEAIKNRGIKLDTVLYNLYPT VEDRTIQEDTQLFIQRYLKKDFPETKFCMVPEL >gi|226332055|gb|ACIB01000001.1| GENE 101 115066 - 116421 1227 451 aa, chain + ## HITS:1 COG:all2964 KEGG:ns NR:ns ## COG: all2964 COG1252 # Protein_GI_number: 17230456 # Func_class: C Energy production and conversion # Function: NADH dehydrogenase, FAD-containing subunit # Organism: Nostoc sp. PCC 7120 # 11 423 5 424 442 274 36.0 3e-73 MSFNIVKNDKKRVIIVGGGFGGLKLANKLKKSGFQVVLVDKNNYHQFPPLIYQVASAGLE PSSISFPFRKIFQKRKDFYFRMAEVRAIFPEKKMIQTSIGKAEYDYLVLAAGTTSNFFGN EHIEEEAMPMKTVSEAMGLRNALLANFERSITCSTERERQELLNVVVVGGGATGVEIAGV LSEMKKFVLPNDYPDMPSSLMHIYLIEAGDRLLAGMSEDSSRHAEQFLREMGVNILLNKR VTDYKDHKVMLEDGTEIATRTFIWVSGVAAITFGNIDGELLGRGRRIKVDEFNRVQGTDT IFAIGDQCIQTTDKNYPNGHPQLAQVAIQQGELLAKNLQRLQKGKPMQPFHYRNLGSMAT VGRNRAVAEFSSFKTAGWLAWVMWLVVHLRSILGVRNKANVLLNWVWNYFTYDQSLRMIV YAKKAKEVRDREELEAKTHWGEEIQTQQPKG >gi|226332055|gb|ACIB01000001.1| GENE 102 116593 - 119013 2347 806 aa, chain - ## HITS:1 COG:Cj0607_2 KEGG:ns NR:ns ## COG: Cj0607_2 COG0577 # Protein_GI_number: 15791967 # Func_class: V Defense mechanisms # Function: ABC-type antimicrobial peptide transport system, permease component # Organism: Campylobacter jejuni # 277 413 258 391 394 66 31.0 3e-10 MIGNYWNSAYRNLMKRKGFSFINVFGLAVGMASALLILTYVTFEFSFDKMHEKYERIFRV ESTFYEGEVQTDYWASSSFGYGSAMKENLAGIEDYTRVVSLYQPEQIVKYGELTLRENQI AYADPGFFRLFDFELVKGDKATCLSMPRQVVITERIARKYFQDEDPIGKILIFTGPYDKV VCEVTGVMKEMPSNSHIHYNFLISYKSLGQYLHDYWYKHEVYTYVLLDSPERKEEIEKAF PAMSEKYKTDEALKNKIWGVSLTPLADIHLKPQVGYEAEIKGNRTAMIALIFAAIAILAI AWINYINLTVARSMERAKEVGVRRVIGAFRKQLVSQFLFEALVMNLVALVLAVGLIELIL PYFNQLVSRTVTFSVWLTGYWWLLLLIVFVGGIFLSGYYPALALLNRKPITLLKGKFLNS KSGEGTRKVLVVVQYTASMILLCGTLIVFAQLNFMRNQSLGVKTDQTLVVKFPGRTEGMN TKLEAMKKAIARLPLVDKVTFSGAVPGEEVATFLSNRRKSDALKQNRLYEMLVCDPDYID AYGLQLVAGRGFSEDYGDDVNKLVVNESAVRNLGIASNEEALGEEIEVECTDAPMQIIGV VKDYHQQALNKNYTPIMLIHKDKIGWLPQRYISIVMKSGDPKELVSQVEEIWHRYFEDSS YDFFFLDQFFDHQYRQDEVFGVMIGCFTGLAIFISCLGLWVLVMFSCTTRTKEMGIRKVL GATRWNLFYQLGKGFFQLILIAVIIALPVAWFSMNAWLSHYAFRTDLKIWFFAVPVVLML LISFVTVACQTVKIIVGKPARSLRYE >gi|226332055|gb|ACIB01000001.1| GENE 103 119010 - 119726 359 238 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) [Campylobacter concisus 13826] # 1 211 4 214 223 142 39 1e-32 MIKTEKLSMLFTTEEVQTKALNEVTMQVERGEFVAIMGPSGCGKSTLLNILGTLDSPTSG SYFFEGKQVDKMNENQLTALRKGNLGFIFQSFNLIDELTVYENVELPLVYLGMKPAIRKE KVKQVLEKVNLLHRANHFPQQLSGGQQQRVAIARAVVTDCKLLLADEPTGNLDSVNGVEV MELLRELNRQGTTIIIVTHSQRDATYAHRIIRLLDGKIVAENINRPLDETSGTNTESV >gi|226332055|gb|ACIB01000001.1| GENE 104 119748 - 122153 1760 801 aa, chain - ## HITS:1 COG:TM0351 KEGG:ns NR:ns ## COG: TM0351 COG0577 # Protein_GI_number: 15643119 # Func_class: V Defense mechanisms # Function: ABC-type antimicrobial peptide transport system, permease component # Organism: Thermotoga maritima # 121 409 123 402 404 70 23.0 1e-11 MNYISQVIRSQRVKKSLTLINITGLSVCIAAALLIMLYVWSELSYDSFHDTDRVYRVESR LYEGEMLTDNWATTAYGHAPAMNREIAGIEKYVRVTAQDREQVVNYFDRRFAEEHYCYTE PAFFEIFNFPIVKGEKTGQLVRPNTVVLTESAASRYFGEEDPIGKILTFSTSSSQQNFEV TGIIADMPVRSHLHYDFLLSYNTIPKERQDIWYIHGVYTYVRLMPGKTPGEIEQAFRDIS DKYKTDALKHKTWAVELISLKDIHLTPQKAYEKEVKGSRTAVLILFVMSAILLLIGWANA LNLTVARFLERGREFGLRKAFGASRRQIIIQGLLESGFMNLLATLIAFGWLELLLPLVYR WAGQNFGTDILMQPAFWGIVAGVVIIGTLVVGLYPSWLMVTIRPSEIMRGKLLHGKRGNR IRKALIVVQFLASFVLIAGTFTVFQQVRYMQREAESDLNTRILVIKYPSFTEGLSLRMES FTKRLKQRADVSHVTVSGAVPGVEVANYFTNRPYGSDPSQVKLIQMFSVDYDYLSAYMPR MICGRSFSEDYGGDLNRVVLNEEAVRLLGYESAEAALGQQLKMEVVSDPLEIIGVVENYH QQSLAVAYKPIIFFLKERVPFIATPYISVCLKGKGDAGVLTEIEQMYREYFPTSLFSYFF LNDFNEFLYKSDRNFGWIFASASLLAVFVACLGLWIVTLFSTLSRLKEVGIRKVLGANKT SLFFVLTKELLLLTVLASAIGIPVSAVLMNAWLETYAFHISLSWWIYAATFVLLMLIAFL TVLQQVWRTIRQKPMRILKYE >gi|226332055|gb|ACIB01000001.1| GENE 105 122166 - 123413 1472 415 aa, chain - ## HITS:1 COG:YPO1498 KEGG:ns NR:ns ## COG: YPO1498 COG0845 # Protein_GI_number: 16121771 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Yersinia pestis # 31 399 34 404 420 116 23.0 9e-26 MDTMIERKPGMKRKHYYAVAGAVVLVGLVFYFIFRDTSSSMNVEKDRLTIATVTRGEFSD YIRVIGQVMPNRIIYMDAIEGGRVEERLKEEGAIVKAGDVILRLSNPLLNIGIMQSEADL AYQENELRNTRISMEQEHLRLKQERIGLNKELAVKQRRYEQYSRLIKEQLIAEEEFRLAA EEYEAARAQLEVIDERIRQDNFFRESQVHSLDENIRNMKRSLALVRERVENLKVKAPIDG QVGNLDAQIGQSIAAGEHIGQIITPDLKVQAQIDEHYVERVVPGLPADFTRDGGNYKLEV TKPYPEVKEGQFRTDLQFTAGRPENIRAGQTYHINLQLGDPAQAILVPRGGFFQITGGRW MYVVDESGKFATRRPIRIGRQNPQYYEVTEGLKPGDRVIVSGYELFGDNEKLILK >gi|226332055|gb|ACIB01000001.1| GENE 106 123554 - 124723 1143 389 aa, chain - ## HITS:1 COG:no KEGG:BF1594 NR:ns ## KEGG: BF1594 # Name: not_defined # Def: putative outer membrane efflux protein # Organism: B.fragilis # Pathway: not_defined # 1 389 51 439 439 685 99.0 0 MDYISSVASFLPRVSVSAEAGRNFGRSIDPNTNGYTNDTFDEGTVGLDMTLSLFEGFTRI NRVRFEKMNRNRSEWALKERRNELACQVTDAYYKLLLEERMLDLALEQSRLSERYLKQTE AFVELGLKSVSDLQEVKARREGDIYRYQARQNGCRLALLRLKQLLNLHDEDTLAVQDTIN YELLSAYPLPQTEELYTQSLVAMPSMRMMELRQRAARKEYAMAGGKFSPTVFARFSMASR YLDGFSTKQLNDNLGKYIGIGISIPLLSGLERLTTLRKHKLNIFRLRNEEELQKQQLYTE VEQTVLSLRSGYDEFRQVLQQFRAEELVLKESERKWEEGLISVFQLMEARNRFISSKAEL ARVRLQVDMTLKMETYYRTGSFCTLPGEE >gi|226332055|gb|ACIB01000001.1| GENE 107 125154 - 126491 1264 445 aa, chain + ## HITS:1 COG:STM4174 KEGG:ns NR:ns ## COG: STM4174 COG2204 # Protein_GI_number: 16767428 # Func_class: T Signal transduction mechanisms # Function: Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains # Organism: Salmonella typhimurium LT2 # 6 442 8 441 441 309 38.0 8e-84 MNTGTILIVDDNKGVLASLELLLENYFSKILTASNPNQITALLTTRRIDVVILDMNFSAG INNGNEGLYWLGHIRQMAPTLPVVMLTAYGDVELAVKALKNGAADFLLKPWNNQTLIDKV TEAYRSHKAPERTAKVDRGEGFEMLVGRSPAMLQLMKVVSKVARTDANILITGENGTGKE ILAREIHRLSPRGARPMLNVDMGAISESLFESELFGHERGAFTDAHESRPGKFEAANGSS LFMDELGNLPLPLQTKLLTVLQSRNVTRLGSNKVIPVDIRLISATNKDIPEMVKQGMFRE DLFYRINTIHLELPPLRERGDDILLFIDCFLRKFTSKYQRQEIRIHEQTVEKLRSYHWPG NIRELQHTIEKAVILCEGSVIRPKDILVKQTWQPQATATVPNLEEVERRAIETAILQNNG NLTAAAEQLGVSRQTLYNKLKRFKL >gi|226332055|gb|ACIB01000001.1| GENE 108 126497 - 127747 966 416 aa, chain + ## HITS:1 COG:SMc01044 KEGG:ns NR:ns ## COG: SMc01044 COG5000 # Protein_GI_number: 15965213 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation # Organism: Sinorhizobium meliloti # 40 406 306 718 753 108 24.0 1e-23 MIFSQRIYFIILFHVLLILVTAGTGLWLVVSHTGYIIGGILLLCSLFQIGALTRQLNTFN RKIRLFFDAVQDKDNMLYFPEANVGKEQAQLNRSLNRLNDLLARTKGESHKQEHFYQSLL EEVPSGVLAWDDSGRIIIANSAAFSLLKCNRLYTETQLAQLLNVTGIRKSLSISEKKMIL DGETITILSIKDIGDELSDKENESWSKLTHVLTHEIMNTIAPILSLSQTLSERPDINEKS ARGLRIIQAQSERLMEFTESFRHLSYLPHPEKRRFALTEMLRNLEELLQSDFRERGIRFM LQCVPDPIETDGDPNQLSQVFLNLLKNAMQALEGQADGRIILRVRRADRLQVEIEDNGPG IPEEIREQIFIPFFTTKTEGSGIGLSLCKQIIRQHDGHLSIRESRPGQTIFSLDLP >gi|226332055|gb|ACIB01000001.1| GENE 109 127816 - 128814 1018 332 aa, chain + ## HITS:1 COG:VC0624 KEGG:ns NR:ns ## COG: VC0624 COG0628 # Protein_GI_number: 15640644 # Func_class: R General function prediction only # Function: Predicted permease # Organism: Vibrio cholerae # 125 327 150 351 361 99 34.0 8e-21 MKEKYWKYSLIIIIIGLGIILFRQITPFLGGLLGALTIYILVRKQMIYLGARMKRSFAAL LITGEAILCFLVPISLIVWMLVNKLQDINLDPQAIITPVEELAGIIKAKTGYDVLGSDTL SFIVSLLPKIGQAVMGGISSFAVNLFVLVFVLYFMLIGGTKMEAYIDDILPFNEKNTREV THEINMIVRSNAIGIPLLAVIQGGVALIGYFIFGAPNAWLIGVLTCFATIIPMVGTALVW FPVAAYLALTGEWANAIGLAAYGGIVVSQCDNLIRFILQKKMADTHPLITIFGVVIGLSL FGFMGVIFGPLVLSLFLLFVDMFKKEYLDNKK >gi|226332055|gb|ACIB01000001.1| GENE 110 128885 - 130057 990 390 aa, chain + ## HITS:1 COG:Ta1048 KEGG:ns NR:ns ## COG: Ta1048 COG0463 # Protein_GI_number: 16082079 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Thermoplasma acidophilum # 57 299 7 215 256 76 24.0 7e-14 MEAFTFNTTELILLSATGVLLIIQLIYYLGLYNRIHTHNLSVGKDEVHFGRELPPLSVVI CARNESENLRRNLPTILKQDYPDFEVIVINDGSTDESEDLLSALEEEYPNLYHSFTPDSA RYISRKKLALTLGIKASKHDWLVFTEADCAPVSNQWLRRMARNFTSSTDIVLGYSGYERG KGWLHKRASFDSLFTSLRYLGFALAGKPYMGIGRNLAYRKELFFKVKGFSTHLNMQRGED DLFINQVANENNTRVETSPDAVIRMQPVERYKDWKEEKVSYMATSRFYKGSQRWLLGLET GTRLLFYAACLAGIVFGILSFHWLVVGLSLLLWFVRYSVQAYVINKTAGEMGDNRSYYFT LPVFDIIRPLQTLKLKLYRLYRGKGDFMRR >gi|226332055|gb|ACIB01000001.1| GENE 111 130063 - 131370 1184 435 aa, chain - ## HITS:1 COG:RSc1117 KEGG:ns NR:ns ## COG: RSc1117 COG4591 # Protein_GI_number: 17545836 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: ABC-type transport system, involved in lipoprotein release, permease component # Organism: Ralstonia solanacearum # 33 435 11 416 416 119 24.0 1e-26 MSFTHPHNQSNSVSSSGAKGKLRLSYFLARRIYRDTDGGKQVSRPAVVIAMIGIAIGLAV MIIAVSVVIGFKSEVRNKVIGFGSHIQISNFDAVGSYETHPVVVNDSMMVALSALPEVKH VQRYSTKPGMIKTDDAFQGMVLKGVGPEFDPTFFRDHLIEGEIPAFSDTASTNQVVISKA IADRLKLKLGDKIYTYFIQKDVRARRLTVKGIYQTNFSEYDNLFLLTDLYLVNRLNNWAP GQVSGAELQVRDYDKLEDITYRIATDTDNKQDIYGGTYYVQSIEQMNPQIFAWLDLLDLN VWVILILMVGVAGFTMISGLLIIILERTQMIGILKALGANDFIIRKVFLWFSVFLIGKGM LWGNAIGIVFCILQSQFGLFKLDPETYYVSMVPVSMNIWLFLLINAGTLLTSVLMLVGPS YLITKINPADSMRYE >gi|226332055|gb|ACIB01000001.1| GENE 112 131380 - 132579 1272 399 aa, chain - ## HITS:1 COG:CAC1001 KEGG:ns NR:ns ## COG: CAC1001 COG0436 # Protein_GI_number: 15894288 # Func_class: E Amino acid transport and metabolism # Function: Aspartate/tyrosine/aromatic aminotransferase # Organism: Clostridium acetobutylicum # 5 394 4 393 395 361 46.0 1e-99 MPTISIRGTEMPASPIRKLAPLADAAKQRGIHVFHLNIGQPDLPTPQAAIDAIRNIDRKV LEYSPSAGYRSYREKLVGYYEKFNINLTADDIIITTGGSEAVLFSFMSCLNPGDEIIVPE PAYANYMAFAISAGAKIRTIATTIEEGFSLPKVEKFEELINERTKGILICNPNNPTGYLY TRREMNQIRDLVKKYDLFLFSDEVYREFIYTGSPYISACHLEGIENNVVLIDSVSKRYSE CGIRIGALITKNKEVRDAVMKFCQARLSPPLIGQIAAEASLDAPEEYSRETYDEYVERRK CLIDGLNRIPGVYSPIPMGAFYTVAKLPVDDSDKFCAWCLSEFEYEGQTVFMAPASGFYT TPGSGKNEVRIAYVLKKEDLTRALFVLQKALEAYLGRTE >gi|226332055|gb|ACIB01000001.1| GENE 113 132750 - 133985 1092 411 aa, chain + ## HITS:1 COG:APE1887 KEGG:ns NR:ns ## COG: APE1887 COG2407 # Protein_GI_number: 14601699 # Func_class: G Carbohydrate transport and metabolism # Function: L-fucose isomerase and related proteins # Organism: Aeropyrum pernix # 53 398 72 417 433 141 28.0 2e-33 MTINLITFASILHKQVTIRSSHEAVLSELEKYFTVKFVDYRDIHQLTKDDFSIIFIATGG VERLVMQCFESLPRPAIILADGMQNSLAAALEISSWLRGRGMKSEILHGDFMSIVQRIQV LYTNFKAQRSLVGLRIGVIGTPSSWLIASNVDYLLAKRRWGIEYLDIPLERIYDVYDRIK DNEVGASCAAVASQALACREGTPEDMIKAMRLYRAIKRICKEEKLSAVTVSCFKLIERTG TTGCLALAMLNDEGIIAGCEGDLQSVFTLLVAKALTGKVGFMANPSMINARNNEIILAHC TIGMKQTERYIIRNHFETESGIGIQGLLPEGDVTIVKCGGECLDEYYLSTGTLTENTNYI NMCRTQVRVKMNTPADYFLKNPLGNHHILLQGNYENSLNEFLMANSCKRTE >gi|226332055|gb|ACIB01000001.1| GENE 114 134069 - 136093 1078 674 aa, chain + ## HITS:1 COG:no KEGG:BF1599 NR:ns ## KEGG: BF1599 # Name: not_defined # Def: putative chondroitinase AC precursor # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 674 1 674 674 1405 99.0 0 MKRISLFVLFICTIFSCLSAQTDKENERIKQNYIRSIIGMDEEKQPYLQLLSKIPPEKEV SDQNVIELQQLYPIRQEEIHRLISTIRGDGSWPDINYADTKRSGWEPKQHTERILKLTKY YAREQENVSAQEISAVVSIIHQAMKYWFVQKPVCKNWWYNQIGIPRTLGPAFLLFETEMT EKEKQEAIRVMEQSQFGMTGQNKVWLAGNVLIRGLLLNDAELIKEARENICSEIVLGQKE GIQPDWSFHQHGPQQQFGNYGLSFLCNMSFYSELFAGTPLAFDRRQQDILVSLLLKGYQW IVWRGYWDVNGLNRQLFHSADIHKSFNLLFAAYSLMKGSNDQQTREIKELIARNFLHPDT NNEFTGNKFFGDSDLTIHRTPHWMASVRMASDRVIGTELVNEDNLKGYYMADGAIYTYIR GDEYHNIFPFWDWRRIPGITTYESDAPIPTESGADSRNQTNLVGGTTDGKHGITAMHLNR NGLSANKVWIFTDEFILCLGSNIHTDSTATLITSIDQRFKKGEVWSEGNRRYFHDNTGYI LLQDELCPVQTEKKKGQWHDFMGMYAPKMLESNIFSIYIKHSPGAPASYRYLLLPGSTQE KTATFDTSRIQILRNDEEAQVAFTGGMYYIAAWQTATIRLSGNKEICIKTPGTYLYRADG VPVSQAVFPKKGIQ >gi|226332055|gb|ACIB01000001.1| GENE 115 136108 - 136932 708 274 aa, chain - ## HITS:1 COG:TM0024 KEGG:ns NR:ns ## COG: TM0024 COG2273 # Protein_GI_number: 15642799 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-glucanase/Beta-glucan synthetase # Organism: Thermotoga maritima # 39 268 211 457 642 122 35.0 7e-28 MKHKNLLLALPFAFFVFAGISLSSCKDTPVRKLYKSDIDWKLTWQDEFDKDGVPDPEKWV FSPWHPFCRDNNFVTFVKDGKLVLRALPNNDPNDTIRYMAGCVETLGKKDFLYGRFEVCA KLGSAKGSWPAIWLKPTDSTTYGAWPKCGEIDIMEQLNKDTFVYHTMHTEYIKVLGHKTD PDYYTTASYNPHEFNVFGMEWTPDKIDMFINDSLTFSYPRIQEEGSVQWPFDVPFFLLLN QILGGWAGEIDDTELPVQMEVDWVRYYELPENKK >gi|226332055|gb|ACIB01000001.1| GENE 116 136944 - 137777 918 277 aa, chain - ## HITS:1 COG:SPCC1672.01 KEGG:ns NR:ns ## COG: SPCC1672.01 COG1387 # Protein_GI_number: 19075372 # Func_class: E Amino acid transport and metabolism; R General function prediction only # Function: Histidinol phosphatase and related hydrolases of the PHP family # Organism: Schizosaccharomyces pombe # 7 264 6 271 306 82 25.0 1e-15 MNLTNYHSHTLYCDGRAGMEDFIRFAISRGFTSYGISSHAPLPFPTHWTMEWDRMDDYLD EFTRMKEKYASEIELAVGLEIDYLNEDSNPSVRRFQELPLDYRIGSVHLLYNDRDEVVDI DVCAEVFRDIVDQHFGGDLDRVIHLYYDRLLRMVELGGFDILGHADKMHHNAACYRPGLL DESWYDALIHDYFSAVAAKGYIVEINTKALHHLNTFFPNERYFPLLKELGVRVQVNSDSH YPERINSGRPEALAALQKAGFETVTEWHGGKWIEMPL >gi|226332055|gb|ACIB01000001.1| GENE 117 137774 - 139639 1859 621 aa, chain - ## HITS:1 COG:sll0912 KEGG:ns NR:ns ## COG: sll0912 COG0488 # Protein_GI_number: 16331003 # Func_class: R General function prediction only # Function: ATPase components of ABC transporters with duplicated ATPase domains # Organism: Synechocystis # 5 618 4 634 636 480 42.0 1e-135 MIPYLQVDNLTKSFGDLVLFENISFGIAEGQRIGLIAKNGTGKTTLLNILSGKEGYDSGN IVFRRDLRVDYLEQDPQYPEELTVLEACFHHGNSTVELIKEYERCMETEGHPGLDDLLVR MDHEKAWEYEQKAKQILSQLKIRNFDQQVKHLSGGQLKRVALANALITEPDLLILDEPTN HLDLDMTEWLEEYLRRTNLSLLMVTHDRYFLDRVCSEIIEIDNRQVYQYKGNYSYYLEKR QERIEAKSVEIERANNLYRTELDWMRRMPQARGHKARYREDAFYELEKVAKQRFNNDNVK LEVKASYIGSKIFEADHLYKSFGDLKILEDFSYIFARYEKMGIVGNNGTGKSTFIKILMG QVQPDSGTLDIGETVRFGYYSQDGLQFDEQMKVIDVVQDIAEVIELGNGKKLTASQFLQH FLFTPETQHSYVYKLSGGERRRLYLCTVLMRNPNFLVLDEPTNDLDIITLNVLEEYLQNF KGCVIVVSHDRYFMDKVVDHLLVFNGQGDIRDFPGNYSDYRDWKEAKSQKEKEAEKPQEE KTARVRLNEKRKMSFKEKREFEQLEKEIAELETEKLQIEELLCSGTLSVDELTEKSKRLP EVNDLIDEKTMRWLELSEIES >gi|226332055|gb|ACIB01000001.1| GENE 118 139665 - 140495 811 276 aa, chain - ## HITS:1 COG:no KEGG:BF1581 NR:ns ## KEGG: BF1581 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 276 1 276 276 552 100.0 1e-156 MIQGKKFISPGTWFSMIYPSDWSEFEDGEGSFLFYNPEHWTGNFRISAYKEDAAAPGGMN YGKDSVRQELKENPSASLVKVGHLECAYSKEMFEEEGTYYTSHLWVTGIDNVAFECSFTV PKGEHVDEAEKIISSLEIRKDGQKYPAEIIPIRLSEIYQVNEAYEWVTDTVKTQLKKDFQ GLEEDLQKIQDVIDSGVLSPKKKEPWLAFGIAICTILANEVEGMEWMTLVDGNREVPVLR YEGSEQLIDPMKLLWSRVKAGEACEVAEAYKQALIH >gi|226332055|gb|ACIB01000001.1| GENE 119 140637 - 141020 391 127 aa, chain - ## HITS:1 COG:no KEGG:BF1580 NR:ns ## KEGG: BF1580 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 127 1 127 127 244 100.0 9e-64 MKKGILLLFVVLMGFTSCGKKFGHRYLDGMWQMQRIEYKDGNIDTPLDTYFSFQMDIIHL RKLGNSEFYGKYVYENDSMHIQVLDATAEQMKVFGMDGRVQDFAVEKLNSNKLVLQSDYA RLEFRKY >gi|226332055|gb|ACIB01000001.1| GENE 120 141017 - 142456 1148 479 aa, chain - ## HITS:1 COG:no KEGG:BF1593 NR:ns ## KEGG: BF1593 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 479 22 500 500 983 100.0 0 MLMAPAAVQAQFDKGLNYKIETSATVSGGHNTPFWLVANKHGLSSIRKNNAYLDAGIFRD LEKDKKFSYAFGLEMVGASRFTSKFFIQQAYVDLRYRGMEISVGSKERGNEMKNDQLSSG SMTFSTNARPIPQVRVSIPEYIPFPWTKNWLHIKGHVAYGMFTDEKFQERFTAGRSKYTK ETLFHSKAAFLKVENLQKSPVSVELGIEMAAQFGGDCYYPDGTVLRTPDSWKDFFRIFFP SNGDSGASESDQINILGNHVGSYSAAVGYHFPTWKIKAYWEHFFEDRSGMTLTYGMWRDC LAGLEVTLPENPFVKTIVGEFLYTKHQSGAFHYFATPAIDHSFTGADNYYNNSQYAGWEH WGQGIGNPLVTSPIYNKDGNLAFESNRVKGFHIGLNGSPTSEIDYRILVSVAKHWGTYGS PYRNIRRNQNGLLEVTYKPEQIRGWSFTLAGAVDGGNMLGESWGGMLTIRKTGLIGKKK >gi|226332055|gb|ACIB01000001.1| GENE 121 142657 - 143850 1036 397 aa, chain - ## HITS:1 COG:no KEGG:BF1578 NR:ns ## KEGG: BF1578 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 397 1 397 397 773 99.0 0 MIKIDFKQRKREVLWTVVALFFIAIGVSVFKFSSATSGFEITDELGGNIFPSAILSVATT DAQVIQPADSIYLGNPKSCIAIKVKSRSAYSRVRIDVAETPFFSRSVSEFVLNKPRTEYT IYPDIIWNYEALKNNNQAEPISVAVTVEMNGEDLGQRVRTFSVRSVNECLLGYVTHGTKF HDTGIFFAAYVNEENPMIDQLLREALNTRIVNRFLGYQNPAPGAVDKQVYALWNVLQKRK FRYSSVSNTSLSSNVVYSQRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVRTPG HMFVGYYTDNSHKDMNFLETTMIGDVDLDDFFPDEQLDSTMVGKSQNQMSLLTFEKSKQY ANKKYKENEKGIHSGKLNYMFLEISKDVRRKIQPIGK >gi|226332055|gb|ACIB01000001.1| GENE 122 144158 - 144847 723 229 aa, chain + ## HITS:1 COG:TM1655 KEGG:ns NR:ns ## COG: TM1655 COG0745 # Protein_GI_number: 15644403 # Func_class: T Signal transduction mechanisms; K Transcription # Function: Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain # Organism: Thermotoga maritima # 1 226 9 240 247 173 43.0 3e-43 MNDYRILVVDDEEDLCEILKFNLENEGYEVDTANSAEEALKMNISSYHLLLLDVMMGEIS GFKMANLLKKDKKTAQVPIIFITAKDTENDTVTGFNLGADDYISKPFSLREVIARVKAVL RRTATSDTEKAPEQLCYQSLVIDITKKKVSIDGEEVPLTKKEFEILFLLLQNKGRVFSRE DILSRIWSDEVYVLDRTIDVNITRLRKKIGTYGKRIVTRLGYGYCFETE >gi|226332055|gb|ACIB01000001.1| GENE 123 144987 - 146768 1721 593 aa, chain + ## HITS:1 COG:CAC1701 KEGG:ns NR:ns ## COG: CAC1701 COG0642 # Protein_GI_number: 15894978 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Clostridium acetobutylicum # 85 589 71 563 566 163 28.0 1e-39 MNLPVTQKHFLSFSRKLFLSVISLFLVFAACFMAYQYQREKEYKVELLNIQLQDCNNLLY EKLYDHPQDMTEVISDYLQHNVLKDLRITVVNLDGKVLFDSQEQNVEQLGNHLDREEVQK ALYNGVGFDVRRTSETTGVPYFYSATLYKDYIIRSALPYSVSLINNLKADPHYIWFTVIV TLLLMIIFYKFTNKLGTSISQLREFAMRADRNEPIEMAMQSAFPHNELGEISQHIIQIYK RLHETKEALYIEREKLITHLQISHEGLGVFNKDKKEILVNNLFTQYSNLISDSNLETAEE VFNISELKEITDFINKAPKRPVGKEEKRMSITINKNGKTFIVECIIFQDMSFEISINDVT QEEEQIRLKRQLTQNIAHELKTPVSSIQGYLETIVNNENLPKEKMNVFLERCYAQSNRLS RLLRDISVLTRMDEAANMIDMEKMDISVLVTNIVNEVSLELEEKHITVVNSLKKEIQIRG NYSLLYSIFRNLMDNAIAYAGNNIQIHINCFREDEKFYYFSFADTGVGVSPEHLNRLFER FYRVDKGRSRKLGGTGLGLAIVKNAVILHGGNISAKNSQGGGLEFVFTLAKER >gi|226332055|gb|ACIB01000001.1| GENE 124 146774 - 148612 1420 612 aa, chain - ## HITS:1 COG:no KEGG:BF1574 NR:ns ## KEGG: BF1574 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 612 1 612 612 1226 99.0 0 MENKKGQPTTEAIFRGIQSGKVLELFDKLQYQIAIHGDLTYSDPWGEVHRFRDQFESAKH DSDSPTAIGRYPFADVWIQFYETEVKDYSLLLEMCLMASHSRTSVWRKGFGTLLDKLYGK IPLVEYEQALEHLEHPYALSEILWALEWDYRDQEVYLKFSHYILLHLLPLLTPRNITFLY SVREWFGSTSDHRVVLVHCYWIDCWLKHPKRLLTDDEFTADFKIRYELYRLCNFLSYKEE PYPLEFPIRAVDFGRACQMGLLSEDTLMVELMDRPLSPVLIEEAVDFFYKKDQKEKRLYT DCRDYDFSRFKKVLEKVTERILDIELERGEACTDVTSLARKLDGVTGAELMIRLLSLMGK EKFIRLDKWYYDTGESRTGMFCHLMLHCAPSPTDTPDWLKMLVERAGITPKRLVEMAVYS PRWLEMVEEAIGWKGLTCAANLFYAYTRECYDDVDEARITPYTLLSPLEISVGVVDTAWF WKAYNALGRERYEKVFAASKAVTESSGVYSRFRKYTDALVGKYTIAQLESLVMDNRNKDW VRAYPLAPFAGKARKKEVDARLRFLKAFWLSSDTLSGRHTAEKEAVQVALDNLTGNSGLG NLDTRWFKKKVW >gi|226332055|gb|ACIB01000001.1| GENE 125 148707 - 149642 1063 311 aa, chain + ## HITS:1 COG:no KEGG:BF1587 NR:ns ## KEGG: BF1587 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 311 1 311 311 616 99.0 1e-175 MMNNVKYLSFALILAFSACKSGSASSQEPSEKQDTVKIFNLPQIPVVLNTVEQRTDYMVK HYWDRFDFSDTTYISQAEVPEQAWVDYCDLLEHVPLPVAQTAMKETFNRAEKNRKMLHFF EELADKYLYDPNSPMRNEEFYIPVLEALIASPALDETAKIRPQARLELAQKNRLGTKALN FTYTLDSGVKGTLYQFPAEYTLLFINNPGCHACAEMIEGLKASPVINGFTAAKKLKVLSI YPDEELDEWKKHRNDFAKEWTNGYDKELVIKNKNLYDLRAIPTLYLLDKNKTVLLKDATL QKVEQYLAERG >gi|226332055|gb|ACIB01000001.1| GENE 126 150747 - 151949 968 400 aa, chain + ## HITS:1 COG:no KEGG:BF1569 NR:ns ## KEGG: BF1569 # Name: not_defined # Def: outer membrane vitamin B12 receptor protein # Organism: B.fragilis # Pathway: not_defined # 1 400 1 400 678 789 100.0 0 MNILRLPKLLYSCVAFCCCQALTVSLSAQQQKADTARTYSIPEVTVAEAYHTREVRAMAP TQVFSKEELKSLNVLQVSDAVKHFAGVTVKDYGGIGGLKTVSLRSLGAEHTAVGYDGITI SDCQTGQIDIGRFSLDNVDRLSLSNGQSDNIFQPARFFASAGILNIQTLTPQFKDNRRTN LSASFKTGSWGLVNPSLLLEQKLSRKWVLSANGEWMSADGHYPFTLHYGEDNDLTSREKR KNTEVKNLRAEAGLFGNFSDTEQWRLKAYYYQSSRGLPNATTYYYDYSSQHLWDKNVFVQ SQYKKEFSRQWVFQTSAKWNWSYQRYLDPDYKGSEGKTENSYYQQEYYLSASALYRVLSN LSFSLSTDASINRLNANLKDFAYPTRYSWLTAFAGKYVND >gi|226332055|gb|ACIB01000001.1| GENE 127 151950 - 152783 663 277 aa, chain + ## HITS:1 COG:no KEGG:BF1569 NR:ns ## KEGG: BF1569 # Name: not_defined # Def: outer membrane vitamin B12 receptor protein # Organism: B.fragilis # Pathway: not_defined # 1 277 402 678 678 540 99.0 1e-152 MTASASVLATVINEEVRQGSAAANRRKLSPYVSASFKPFAGEEFRIRLFYKDIFRLPSFN DLYYGQVGNTNLKPESTTQYNLGLTYSRSINELIPYVSVTADAYYNKVKDKIIAIPTKNL FIWSMVNLGKVDIKGIDIAGNISLQPWEKLRVNLSGNYTYQRALDMTEPGGKTYKQQIAY TPRVSGSGQAGIETPWVNLSYSFLFSGKRYMLGQNLRENRLDSYSDHSVSVSRDLRIRNV NTSLTVEVLNLLDKNYEIVKNFPMPGRSVRVTMKVRY >gi|226332055|gb|ACIB01000001.1| GENE 128 152830 - 154872 1570 680 aa, chain + ## HITS:1 COG:MA1904_1 KEGG:ns NR:ns ## COG: MA1904_1 COG3391 # Protein_GI_number: 20090753 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Methanosarcina acetivorans str.C2A # 112 347 109 326 361 69 28.0 2e-11 MKMNPFNNSRSNSSPRFRRGVGGGSFLFLLFLLLFASCDDLEDTDTPSGGGDGMPVETGT AELYILSEGLFNLNNSSLALYSFKNNQLNTDYFRSINRRGLGDTANDMAIYGSKLYIVVN VSSQIEVVDLQSGKSVKQIPMLSENGSSRQPRNIAFEGGKAYVCSFDGTVARIDTASLSI DALTRAGRNPDGICVQNGKLYVSNSGGLDWEGIGVDRTVSVIDIPSFTEIKKIEVGPNPG KIQAGPDGNIYVATHGENIEAGDYHFVQIDGHTDQVVRTFDEKVLSFTIHDNMAYLYNYD YRTQDSQIKVFNLKTGKTERENFITDGTAIRTPYSISVNPYSGNIYITDAYDYKVKGDVL CFSPQGQLIFKLPNVGINSNTVLFRNKASQGNPDENPADPEAGAFANKVLEYNPAPSQYM NTSYTAYEEGFTGIQVLARATELLQDRTTCLFTLGGFGGNITVGFDHTILNVPGEYDFKI YGNAYYDMYGTLLDKPGGNSEPGIVLVSKDTNGNGLPDDEWYELAGSEYNSPATIRNYEI TYYRPTPADGDVKWKDNQGKEGYIYRNTYHTQGSYYPAWMPAEITFRGSRLADNSINEPR PGMPEHWVGYCYAWGYADNHPNNTEYSQFKIDWAVDKDGKPVHLDGIDFVKIYTAVNQNC GWLGEASTEIQAVEDLHYKK >gi|226332055|gb|ACIB01000001.1| GENE 129 154908 - 156272 810 454 aa, chain + ## HITS:1 COG:no KEGG:BF1567 NR:ns ## KEGG: BF1567 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 454 1 455 455 852 99.0 0 MKKINALITKMCFIALCALPIIISSCNDDDDKYYYPTNFENLSLPNDTIIAKGEDLTLKP TLNLINPKIYSWKIDGKEVSNEVNYTFSTSVGGKHEIIFEAQDSKGNTDKAQITVDVFAY YGGFYVINEGSMGHSASVNYYKDGKWNFNIVESLGQTGTVGVIQDSYMYIVAKDAPYLTQ IELANFNITKQLSTEIEEQLDYGQANSFCTINETTGILTASRGAYKINLNPLSIGNMLPQ LNSESANGNGCKDILKSGDYVFVNNMDTIKIYKSSDLSFVKKLTALMKTGFAQTKDGNIW AANGQSLIKIDPKTLDETTVELPDNINVFYNQWAYTPTGLCASTTENALYIVNTTVVPGA YGDSYYGKNIYKYNTESKIAQEFFKAPSKIQSIYGAGIQVNPQNGDVYLIYTEDGYGAHY LNTNIYVADGQSGTQKQIIDYTGKYWFPSSIVFK >gi|226332055|gb|ACIB01000001.1| GENE 130 156339 - 157799 488 486 aa, chain + ## HITS:1 COG:no KEGG:BF1566 NR:ns ## KEGG: BF1566 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 486 1 486 486 934 100.0 0 MKKITYLLSTLLIVSCTDNGLIDTEQLENIQPSTRAVGDGKYDALGYGYNCFYADFSDPL YVQGKVIDIERLEEGRGRDQITKDELSFTPAKINEAILHGRAQSRIAYGTSIDKLTKKLN VNVKTKIGTKILKVFSLDLEATINQNSTSNNLNSFYKVDALKTTRRLTLPYTSPSRLKYF LSDEFLADLKELSGQEIIQKYGTHVMTDILLGGNFSAFYTGKYESTDQFNEQEFKAKSNF LLSSVKAGTKYDRTLFKSFKQVNVYIKTQGGTVNSSAIISQSPDGVLDNVSIDYTGWINS VSQNSESLIGIGNPDSQMYLLSDFIEDPIARIDIEAALLSLPAQEIVLSSRKNNVINSTA ILTVMDVSNKEVKRPELYPIQGGSISRGLIKFIKSKGYYRIQSMVDRNNEYYLDSSGNYS VFKNDNSQLWQVCVPENNASDFMLKNIGTGLYLSSYDLKFYEKPQVEKEPNFCWHIGYSN VYGNDF >gi|226332055|gb|ACIB01000001.1| GENE 131 157952 - 159466 1327 504 aa, chain - ## HITS:1 COG:no KEGG:BF1580 NR:ns ## KEGG: BF1580 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 504 1 504 504 956 99.0 0 MSLFISLFLTGCEKELVEVPGEVPGQLPPEGKPSDLPDGYFEVTFSAGYGGGDTRTPVTG TDSRVRHLRYVIYKSTGEFVKEKVVLNTTDAAPSWPMAALKDTLPKGQYTAVFLANVEKT QFPIPVSGGGTNYADVLTNYQTTRANARIVLPGAEFTDTSEFYLANVSFSDTSAQPYVLL QRIISMLNLRRVFVDAQTALNSLTNNIVTQVGYKNIIRTTVQGILPGLLKTAMNLGPVGN LVYDVVGGLDAAVNLVVAALVEPVVDALYAQLLQGLVNQIGLTLSGNATQNGALGALGDL LNPWRGSDAAYALVTINNFPKTMDLDLNVKDFYTGNHRFRYGFTPTPGTTNSEKDILIRG FHGLYDVREIHVAKPGLISGLLVDDVIDGPLLLNGVFVNITDPIQATVNTNYRYRSNYSF VSLGLKSYAQQSDGNHSLTLSVQLKNIANLDGILGGIPVLGPVLNGTVRLLIGNITVTVP VNLPLLGTDNLTLSGSWSTPPVQY >gi|226332055|gb|ACIB01000001.1| GENE 132 159516 - 160544 818 342 aa, chain - ## HITS:1 COG:no KEGG:BF1564 NR:ns ## KEGG: BF1564 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 342 1 342 342 645 98.0 0 MAMRKTGYITILLTGFLATLLLGSCSTDDVAELPLEEGVPVRFEIKRDGIMSRAPGDAAL SVNRILILPFRKTDEASSNDAANFIPEYSAARQLDVNSFPAVVTMLTLSAASTYQLLVIG YNRSDYDFTGGGGATKRFNIGSTDSPATLANLYLQPVNPTVVPEFFSCFGNGYRGATLVG PIFKPSQINYVTGTLKRLVSGFTLEVTNVPAYVNSMTLIAEQLVTATRATDGTALTWQTA GDGGTKTLATQAPVSGKVSFNQFLLAIPDSRKTLFYLDVSYGIFTERYTVKLPDTPGVVS GNRIIFTPNHWVKVTGNYANINIGFTLAGNINLDDNAWDGLQ >gi|226332055|gb|ACIB01000001.1| GENE 133 160571 - 161656 1179 361 aa, chain - ## HITS:1 COG:no KEGG:BF1563 NR:ns ## KEGG: BF1563 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 361 29 389 389 674 99.0 0 MKKLMFLVAASLFVFAACSSEDDSSPEVNPENAAITFELSAVNGLTDGIGTRMPVYSQEA TQHVTRVSVYAFVQNGSTYLHQKTYDITGWTDGTTFKRFAVPDADKLPVGVYKFLAVGRD ASDRFSVTTPTSGNTNYTDMLASIVNSGDESEIFAGSADAEVMAQGGTRVSIEMTRKVAG VLGYFKNVPQVLNGSTVKYLRLKVSNSNQQVNLTNGVGINTAPTPYNIIDMDLSGQAVSN GVYVGNDLSGQGVVKVPNSQLGGSFYIPVSGVSMTLGLYDASGVAIKEWTVSDTNSSGAT QFNLMANHFYSLGVKGATGSVDGGTPGNPGDDDAPVDLLTDQNIVITISPAWELIHNLVI Q >gi|226332055|gb|ACIB01000001.1| GENE 134 161700 - 162563 733 287 aa, chain - ## HITS:1 COG:no KEGG:BF1562 NR:ns ## KEGG: BF1562 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 287 1 287 287 583 99.0 1e-165 MNRRVIQIFCMVVGMVLASSCGDECPVEQPYSVRVSVKDKNYLNISQFPQLSQIDENLPF RTYAGTLYYALYDASTGALIRESAVVSTEGEEKEYTLTFPGVPDGDYKLAVWGNLTTDYP AGILHQDGKEHTDIYVTSGDLHFSPDYQTEELTLERTKGKLLLLCSNFPSEITRIEQNVS HVYQSTDAGLNYSGSTQVDKNVPFTSVLETLLAPTSAEGHSKLTLTFYTGGTRASETPYL KLPVMEMDMRRNEITAVSIDYNTSEGIWEIKMFIRGEWITIHRLDIF >gi|226332055|gb|ACIB01000001.1| GENE 135 162853 - 163155 345 100 aa, chain + ## HITS:1 COG:DR1844 KEGG:ns NR:ns ## COG: DR1844 COG2388 # Protein_GI_number: 15806844 # Func_class: R General function prediction only # Function: Predicted acetyltransferase # Organism: Deinococcus radiodurans # 10 96 7 93 93 67 39.0 7e-12 MTENYELIDNEEKHQYEFHVEGYVPRIEYIKSLNGEIYLTHTEVPAALGGHGIGSQLAEK VLTDIERQGLRLVPLCPFVAGYIHKHPEWKRIVLRGIHIQ >gi|226332055|gb|ACIB01000001.1| GENE 136 163169 - 163384 290 71 aa, chain + ## HITS:1 COG:no KEGG:BF1560 NR:ns ## KEGG: BF1560 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 71 1 71 71 137 98.0 1e-31 MGKQIEYSNGELTIVWQPELCRHAGICVKTLPNVYHPQERPWIKMENATTEELIAQIKMC PSGALSYKLKK >gi|226332055|gb|ACIB01000001.1| GENE 137 163469 - 165712 1473 747 aa, chain - ## HITS:1 COG:no KEGG:BF1574 NR:ns ## KEGG: BF1574 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 747 22 768 768 1548 99.0 0 MSVSQRLSGHPSELIYLQTGKDIYETGEDLWFKAYQLDAKSFALSEQSETLYLQMLNDKD SVVWAEKYPIEKGIAEGHVYIDTKLSEGNYRLGTYTRHSYYNDTTGISPERKIRIVRNIA LDSLPEHREKPGEFRFNLFPEGGNLVSGLPFRLAFKATGSGGCPVDVEGTLYQDEIPVLS FKSSHDGMGVIPFTPSSGKEYRIELANGYSYALPEIYRQGMGLRLSGRDGKQLEFLISQT EGLSDQEVYLVGQMRGTVCCVAKGLLKDRLKMKIPLSEFPYQGIAEFTLFNAAMQPVAER LVYVHPEKKLHIDIVTEKESYVLREKATLKVKVTDDNGQPVKADLGISVFDKAYSNPDDR VNMLAYCYLSSQIRGAVCRPAYYFDEKNADRMQAMDLLLLTQGWRRYVWELNGTVRHGEM FLRDDVTGIQTLGSKKKSKGTGGAKQLIQVSGAEGNSTYLMTDSLGRFTVDTDLMKTLRG GYVYLKPMLSKEFKPELEIQDYFPAIDSIRRKKSFDCSLINCTQRPKEEIYDAPVVSGDS TILLDEVVIARKARKPFRDKLMGRLDSLAQMNLNSVYVCTSCGLLLNYNPDYQGHHALVG EGGCPAKGRKQPVDGETYRIAKYKYYGDAKGGGVYFSVVDAHSVVYHATEFTEEELLRMN NMWRVKGYYGKREFYQPDEIDMQLPVPDARNTLLWAPSVVTDEKGEATVSFYCSDINTGF IGVAEGVDGTGLLGTDQCEFRVIRRAD >gi|226332055|gb|ACIB01000001.1| GENE 138 165793 - 166467 348 224 aa, chain - ## HITS:1 COG:no KEGG:BF1573 NR:ns ## KEGG: BF1573 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 224 1 224 224 428 99.0 1e-119 MALLREYKEIAYQWGIWKTEESPEELLALLPDPERYEQQLTLFSSPHRKLEWLSVRVLLY QLLGEEKTIEYAPSGKPHLADSSYFISISHTRGYVAVILSAVSEVGIDIEQYGQRVHKVA HKYMRPDELISEYQGEDTWSLLLHWSAKEVMFKCMDTSEVDFREHLRIMPFQVCEHGEFP AEEYRTEHKKKFMIRYLLHPDFVMTWQVTSAYSVNECDIMKKMQ >gi|226332055|gb|ACIB01000001.1| GENE 139 166482 - 167828 1058 448 aa, chain - ## HITS:1 COG:FN1486 KEGG:ns NR:ns ## COG: FN1486 COG1253 # Protein_GI_number: 19704818 # Func_class: R General function prediction only # Function: Hemolysins and related proteins containing CBS domains # Organism: Fusobacterium nucleatum # 40 446 17 426 426 207 33.0 4e-53 MDPDAYLCQLAEIFNGISVNTPSLSAIIAIILAGLLLLASGFASASEIAFFSLSPSDLND IEEGNHPSDGKISNLLADSERLLATILITNNFVNVTIIMLCNFFFMNVFVFHSPLAEFLI LTVILTFLLLLFGEIMPKIYSAQKTLAFCRFSAPVIYMLRKVFAPISAVLVHSTAFLNKH FAKKNHNISVDELSHALELTDKAELTEENNILEGIIRFGGETAKEVMTSRLDVVDLDIRT PFKDVIQCIIDNAYSRIPIYSGTRDNIKGVLYIKDLLPHLNKGDNFRWQSLIRPAYFVPE TKMIDDLLRDFQANKIHIAIVVDEFGGTSGIVTMEDIIEEIVGEIHDEYDDEERTYTVIN DHTWVFEAKTQLTDFYKITKVDEDDFDKVDGDADTLAGLLLEIKGEFPALHEKVLYHRYE FEVLAMDSRRILKVKFTVNEPSTEEEAS >gi|226332055|gb|ACIB01000001.1| GENE 140 167964 - 168422 380 152 aa, chain - ## HITS:1 COG:XF1644 KEGG:ns NR:ns ## COG: XF1644 COG0629 # Protein_GI_number: 15838245 # Func_class: L Replication, recombination and repair # Function: Single-stranded DNA-binding protein # Organism: Xylella fastidiosa 9a5c # 3 151 5 149 151 107 38.0 5e-24 MSVNKVILIGNVGKDPEVRYLDTGIAVASFPLATTDRAYTLSNGTQVPERTEWHNLVLWR GLAETAEKYVHKGDKLYVEGKIRTRSYDDQNGAKRYVTEIFVDNMEMLTPKGTGSGSYAP AQQQTAAPVRPQSQQPQQPVSSQDNSADDLPF >gi|226332055|gb|ACIB01000001.1| GENE 141 168510 - 170078 1604 522 aa, chain - ## HITS:1 COG:STM0035 KEGG:ns NR:ns ## COG: STM0035 COG3119 # Protein_GI_number: 16763425 # Func_class: P Inorganic ion transport and metabolism # Function: Arylsulfatase A and related enzymes # Organism: Salmonella typhimurium LT2 # 38 415 31 382 497 170 31.0 5e-42 MKRPYLLLACLSPVACLMAASGQKGGKNKQKVNDRQLPNVVFIYADDLGYGDLECYGAKN VQTPNVNRLASEGIRFTNAHATAATSTPSRYSMLTGEYAWRRPGTDVAAGNAGMIIRPED YTMADMFKNSGYVTAALGKWHLGLGDKSGEQDWNAPLPAALGDLGFDYSYIMAATADRVP CVFIENGKVANYDPSAPIEVSYRKPFEGEPLGKDHPELLYNQKHSHGHDMAIVNGIGRIG YMKGGGKALWKDENIADSITTHAINFIREHKDEPFFMYFATNDVHVPRFPHERFRGKNPM GLRGDAIVQFDWSVGQILETLDKLGLSENTLIILSSDNGPVVDDGYQDRAEELLNGHSPA GPLRGNKYSAFEGGTRIPAIVRWPKKITQPQVSDVLVSQIDWLASLASLVDARVPKGAAP DSFDRLGNWLGTDSTDRPWVIEQASNHTLSVRTKDWKYIEPNDGPHMITWGPKIETGNLS IPQLYDMTKDYEQENLAEKNPAKLFELQTILRKVRNKTYRAL >gi|226332055|gb|ACIB01000001.1| GENE 142 170124 - 171170 686 348 aa, chain - ## HITS:1 COG:L0296 KEGG:ns NR:ns ## COG: L0296 COG1194 # Protein_GI_number: 15672823 # Func_class: L Replication, recombination and repair # Function: A/G-specific DNA glycosylase # Organism: Lactococcus lactis # 3 320 9 332 387 233 37.0 6e-61 MNRNFSNAIENWYKEYKRELPWRDSADPYVIWISEIILQQTRVVQGYDYFVRFMKRFPDV ATLAEADEDEVMKYWQGLGYYSRARNLHAAAKSMNGVFPKTYPEVRALKGVGEYTAAAIC SFAYNMPYAVVDGNVYRVLSRYLGIDTPIDSTEGKKLFAAVADELLDRKNPALYNQAIMD FGAIQCSPQTPNCMFCPLADSCAALAKGTVAELPVKQHKIKTTNRYFNYIYVRMGVHTFI NKRTGNDIWRNLFELPLIETPVAVSEEEFLALPELKALFASKELPVVRSVCRDVKHVLSH RVIYANFYIVDLPEDSHSFAAYQKIKAEELEQYAVSKLVHAFIEKYID >gi|226332055|gb|ACIB01000001.1| GENE 143 171376 - 171651 301 91 aa, chain + ## HITS:1 COG:lin2048 KEGG:ns NR:ns ## COG: lin2048 COG0776 # Protein_GI_number: 16801114 # Func_class: L Replication, recombination and repair # Function: Bacterial nucleoid DNA-binding protein # Organism: Listeria innocua # 3 90 4 91 91 59 40.0 1e-09 MTKADIVNEIAKNTGVDKVTVLTTVEAFMDAVKDSLSKDENVYLRGFGSFVVKKRAQKTA RNISKNTTIIIPEHNIPAFKPAKTFTLSVKK >gi|226332055|gb|ACIB01000001.1| GENE 144 171930 - 173504 1727 524 aa, chain + ## HITS:1 COG:XF1125 KEGG:ns NR:ns ## COG: XF1125 COG1530 # Protein_GI_number: 15837727 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Ribonucleases G and E # Organism: Xylella fastidiosa 9a5c # 1 438 1 425 497 278 37.0 2e-74 MTSELVVDVQPKEVSIALLEDKSLVELQSEGRNISFSVGNMYLGRVKKLMPGLNACFVDV GYEKDAFLHYLDLGPQFNSLEKYVKQTLSDKKKLNPISKATLLPDLDKDGTVANTLKVGQ EVVVQIVKEPISTKGPRLTSEISFAGRYLVLIPFNDKVSVSQKIKSSEERARLKQLLMSI KPKNFGVIVRTVAEGKRVAELDGELKVLLKHWEESITKVQKATKFPTLIYEETSRAVALL RDLFNPSFENIYVNNEAVFNEIRDYVTLIAPERAGIVKLYKGQLPIYDNFGITKQIKSSF GKTVSYKSGAYLIIEHTEALHVVDVNSGNRTKNANGQEGNALEVNLGAADELARQLRLRD MGGIIVVDFIDMNEAENRQKLYERMCANMQKDRARHNILPLSKFGLMQITRQRVRPAMDV NTTETCPTCFGKGTIKSSILFTDTLESKIDYLVNKLKIKKFSLHIHPYVAAYLNQGLMSL KRKWQMKYGFGIKIIPSQKLAFLEYVFYDSHGEEIDMKEEFEIK >gi|226332055|gb|ACIB01000001.1| GENE 145 173563 - 174510 748 315 aa, chain - ## HITS:1 COG:PA3145 KEGG:ns NR:ns ## COG: PA3145 COG0472 # Protein_GI_number: 15598341 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase # Organism: Pseudomonas aeruginosa # 4 272 9 294 339 83 29.0 4e-16 MYYLIILVLLFLAELFYFHIADKYNIIDKPNKRSSHTRITLRGGGIIFYLGALAYFLTNQ FEYPWFMLALTLVTVISFVDDIRSISQGLRLVFHFTAMGLMFYQWELFTLPWWTVVVALI TCTGIINAYNFMDGINGITGGYSLVVLGSLAFINHWVVSFVEPGLIYTMLCAVLVFNFFN FRKRAKCFAGDVGSVSIAFIILFLIGKLIIDTEDFSWIVLLAVYGADSVLTIVHRLMLHE NIALPHRKHLYQIMANELKIPHVAVSLTYMVVQGVVVAGYLVLREYGYVYLAGSILLLSI LYLLFMKRFFHLHQF >gi|226332055|gb|ACIB01000001.1| GENE 146 174629 - 175525 727 298 aa, chain - ## HITS:1 COG:ECs2847 KEGG:ns NR:ns ## COG: ECs2847 COG0451 # Protein_GI_number: 15832101 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Nucleoside-diphosphate-sugar epimerases # Organism: Escherichia coli O157:H7 # 2 279 4 294 331 106 28.0 6e-23 MNLLFTGASGFLGYNVLPLLKKKYSVETIGLTSQDDYIVNLAAEIPKLAGKYDVILHVAG KAHSLPKTEAEKRLFFDVNLQGTKNLCTALEQSGIPKSFIFISTVAVYGCDSGENITEEY PLNGTTPYALSKIKAEKYLEEWCAMHNVKLSILRPSLIAGPNPPGNLGAMIHGIENGKYL SIAGGKARKSVLMVQDIANLLPMLIEKGGIYNVCDSYQPSFRELEMVICKQLNKKLPLSI PYWFAKSMAVLGDCFGENTPINSLKLRKITNSLTFSNEKAVRELGWKPMNVLKNFRIE >gi|226332055|gb|ACIB01000001.1| GENE 147 175537 - 176358 293 273 aa, chain - ## HITS:1 COG:RSc0688 KEGG:ns NR:ns ## COG: RSc0688 COG1216 # Protein_GI_number: 17545407 # Func_class: R General function prediction only # Function: Predicted glycosyltransferases # Organism: Ralstonia solanacearum # 2 263 1 261 275 172 36.0 7e-43 MLDKITATIVTYKNPDSVLLKAINSFLNTKIEVRLYIIDNSPTNYLKDISHDPRVEYIFM NSNNGFGAGHNVILRDPEKMGKYHLILNPDISFEEGTLEKLYDYMEGKPDVGNVMPKVIY PNGELQYLCKLLPTPKDWIVRMFLPIKSIKNRIDYNFEMKFADYDREMNIPYLSGCFMFL RKSVIEGIGVFDEGIFMYGEDTDLNRRIYRKYKTMYYPQVTITHHFEKGSHKSLRLLWIH VKAAIYYLNKWGWFFDKERSIINITVKQQYIRK >gi|226332055|gb|ACIB01000001.1| GENE 148 176371 - 177414 185 347 aa, chain - ## HITS:1 COG:SMc01220 KEGG:ns NR:ns ## COG: SMc01220 COG0438 # Protein_GI_number: 15965324 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Sinorhizobium meliloti # 179 340 170 335 340 71 27.0 2e-12 MFLPNLSNTGPNIVAKDLCTGLVRKGHVCQVFYFDDIVELDMPCSIERITNMKRFDFSNW DIIHSHMFRPDLYIRLHTLFSRKKHRTRFISTLHNPISYRALKIDYPLVHSLIGSFLWKF ALRAFNELVVLNEDTYQQLSKVSKEHLHIIHNGRDIIPSAVSNEKDLEEIRKLKQKYIIV GTVSRIIKRKGIEQMIRALVLLPNYAFVVVGDGSELENIKILAKELSVSERCYWVGYRED ATSYQSLFDLFVMCSRSEGFPLALIEAAGYGVPSILSDISIFKSIMTEREVLFYKLDNID SLVSAIHVATRNREKFANRIYDYYIKNLTVNSMTDKYLELYIHSLNS >gi|226332055|gb|ACIB01000001.1| GENE 149 177426 - 178481 268 351 aa, chain - ## HITS:1 COG:no KEGG:BT_1343 NR:ns ## KEGG: BT_1343 # Name: not_defined # Def: putative capsule biosynthesis protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 350 1 349 355 181 36.0 3e-44 MLPYYAIFTFLAFLSLAESQSVKRKQRSLLLSGVWLLLTLFAGLRYNNADWNSYFDFYKN IANGSGEGSADIGFNLICLFLSIFSNSPILMFVVVAGTSVALNLNSFKKYSPFFLICVLY YFVHLYVLKEMIQIRGGLASAICLYSIRFLFNRKYKSFWLFWLLALSIHFSVIVWALVGL VYKYQPSLKTLKWTLFICFAVGLICPLGQFIKLLAVGVDARLGAYIAYGDSEYAAALGIF TNINAIKSLIVGIILLYFHDKMKNISPYFSPLLYAYILGVCWLMLFNDFAIIGGRMSGVL LCVEPVLVSYLTILLSKRTKWFFVSILIVVTYTMLSLNVSPDKISPYQFYF >gi|226332055|gb|ACIB01000001.1| GENE 150 178486 - 179205 234 239 aa, chain - ## HITS:1 COG:FN1241 KEGG:ns NR:ns ## COG: FN1241 COG3774 # Protein_GI_number: 19704576 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Mannosyltransferase OCH1 and related enzymes # Organism: Fusobacterium nucleatum # 1 210 1 205 243 121 39.0 1e-27 MIPKTIHYCWFGGKPLPPLAIKCLESWNKYLPDYEIIEWNESNFPVNSIPYTKEAAAKGK WAFVSDYARFYILYHNGGIYLDTDVQILKSLNPFLKHHSFSGFESKDRVAPGLILGAEKG CSLMKKLMNSYHERHFINQNGLLNEKTVVSYMTEELVSEGLILNGELQNIHDFIVYPIDF FSPKSLETGKLKITSNTYSIHHYAGSWMSNTSRLKRYVYLLIAKVPFVYKMYNKIYRKY >gi|226332055|gb|ACIB01000001.1| GENE 151 179222 - 180289 375 355 aa, chain - ## HITS:1 COG:BS_yveR KEGG:ns NR:ns ## COG: BS_yveR COG0463 # Protein_GI_number: 16080483 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Bacillus subtilis # 7 221 6 218 344 86 32.0 8e-17 MNLPCKVTLAMPVYNVEKYIERALLSALNQTFESIEYLVIDDKGQDESMNIVRRIVTKHP RGKYVRIIDHGENRGTGATKNTAIKEAKGKYLYFMDSDDEIIPNCIALLYDKIHNTDLDW VMGAYSEITFSTNKIINSIGENRQLRGKYCLVEHILKGHFVSIATWNKLYELDFLRKNKI KCVSNHLNEDAWFFFQLMLMTRKCILIPEITYYYYKTESSITDFRSYTEEKKQRITSQYI EIDLLKKNLLNQYTFDKSYPLLLCVTMRESFRYAYEVSCLNVSGENMNMFSKFYMRYKNN SSRIAVRQFMSYPVSIGQIFKMPRKRIEIFLYWLISFIPYFFQYVFIVIVRFIRK >gi|226332055|gb|ACIB01000001.1| GENE 152 180286 - 180984 364 232 aa, chain - ## HITS:1 COG:no KEGG:Geob_1473 NR:ns ## KEGG: Geob_1473 # Name: not_defined # Def: putative transferase # Organism: Geobacter_FRC-32 # Pathway: not_defined # 3 209 104 313 329 139 37.0 7e-32 MPASPWFVGAQMLSRNPWVWDENNYLDYNYFCLETGNNSINKLAVVTSNKVTTKGHQRRL DFIYKLKELLSDLVDIYGTGFIPIKDKYEIYSKYKYALIIENSNYLDYWTEKIADCYLSN CFPFYIGCPNIKTYFPDSSLEELSFDNLEYAVKAIKEAIMNDRYNRVKPILKEAKRMVLD KYNIFFVILQYIKQCDRQNYTLKEDIEINPISCFTSDLGYRIKNRIIRLIGE >gi|226332055|gb|ACIB01000001.1| GENE 153 181299 - 181604 140 101 aa, chain - ## HITS:1 COG:no KEGG:Rmar_1143 NR:ns ## KEGG: Rmar_1143 # Name: not_defined # Def: hypothetical protein # Organism: R.marinus # Pathway: not_defined # 1 101 253 352 368 87 37.0 2e-16 MLSQSKIVLCPKGFHSTECFRHYEALKQGCIVISEKLSDSYLYNNSPIIQLDNWNGIRKI VNDLLQDESLLMKKSEESLRWYENVMSEKATAMYVLSKIEQ >gi|226332055|gb|ACIB01000001.1| GENE 154 181805 - 182233 196 142 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|265762935|ref|ZP_06091503.1| ## NR: gi|265762935|ref|ZP_06091503.1| predicted protein [Bacteroides sp. 2_1_16] # 1 139 1 139 215 254 100.0 1e-66 MRYFLDIREVDNIYYERNSDSNFEDLNECQFLINFFKELRLKCINFNSYNFYIYSTKNPT LPPSSFDLPNTGKDILLFLSDETGELPLHLKQRYKCIFKPYIRKDYDNIYPFPLGYVNND VSLEYIPIKDRCYNVFFSGILI >gi|226332055|gb|ACIB01000001.1| GENE 155 182247 - 183107 434 286 aa, chain - ## HITS:1 COG:no KEGG:BF1837 NR:ns ## KEGG: BF1837 # Name: not_defined # Def: putative alpha-1,2-fucosyltransferase # Organism: B.fragilis # Pathway: not_defined # 3 286 2 286 289 187 37.0 4e-46 MQVVARIIGGLGNQMFIYATARALALRIDADLILDTQSGYKNDLFKRNFLLDSFCISYRK ANCFQKYDYYLGEKVKSLGKKIHFSVIPFMKYISENTSCDFVDGLLKKHILSVYLDGYWQ NEAYFKDYASIIKKDFQFCQVNDLRTLSEAEIIKKSITPVAIGVRRYQELNSHQNTKVTD LDFYQKAINYIESKVDNPTFFIFSEDQEWVKNNLEQKSNFIMISPKEGNYSALNDMYLIS LCKHHIVSNSSFYWWGAWLANNKNKIVVASDCFLNPQSIPDSWIKF >gi|226332055|gb|ACIB01000001.1| GENE 156 183095 - 184027 381 310 aa, chain - ## HITS:1 COG:BS_gspA KEGG:ns NR:ns ## COG: BS_gspA COG1442 # Protein_GI_number: 16080894 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases # Organism: Bacillus subtilis # 1 266 7 270 286 130 31.0 4e-30 MNILCCLDDNYVQHTSVMLTSFFINNDFEHHNIYVITMQLNDGNVAYLREVVNKYHSNFY LYQVNEAMLSGFVRKETDYVSLAAYLRLFSTQVLPFNCSKVLYIDGDIVVRKSLEELWKM DIENYAVAAVDETIKANCIRHNYDVTLGYFNSGFMLINLSFWRENSVAEKAIDYMKRFPE RIKSWDQDALNGILYGGLWKRLDLKYNLTTIFLCKQYVEGQDFPKIYTEEYNSAISDPAV VHYTGPDKPWKYTVVDHPFKKDYLQYARMLGINHDFNISIFFKRIVRKLLCATGVLKNSY VSINNSICKL >gi|226332055|gb|ACIB01000001.1| GENE 157 184032 - 185465 460 477 aa, chain - ## HITS:1 COG:BS_tuaB KEGG:ns NR:ns ## COG: BS_tuaB COG2244 # Protein_GI_number: 16080613 # Func_class: R General function prediction only # Function: Membrane protein involved in the export of O-antigen and teichoic acid # Organism: Bacillus subtilis # 4 460 3 466 483 166 27.0 1e-40 MVESLKHQAIKGVVWSAVERFSVQGIQFVLSIIIARLVAPSEYGLIAMLGIFLAIAQIFI DSGFSNALIQKNDRTDIDFNTVFYFSSVISIIVYGLLYLLAPFIALFYHEPLLVKLLRFI GLGLIISNISIIQRTKISISLNFKLLARVSLTSVVISGAIGIFMAIKGYGVWALAVQSFM NATFNTVFLFFFVRWHPSWSFSFSSFKILFSFGSKLLIGGLLHVIYTNLYTMVIGRKFTS AEVGLFNRAQTFSTFPAINVTDISSRPIYPLMCEIQDDGDKLRIAFLQYLRMMSYIIFPL MIGLSVLSTPFINLILTETWKGAAPLLSILCLSYMWYPVMNINWQILNVKGRSDLSLKSE IIKKTVAFIILFSTMPWGLEIMCWGLVLYSIIDIIIVIYFVRRVISVGYTAQATSILPTF VAALLMGGGVYLVVSLFSSSLLQLVIGILSGIVFYVLISILFNIPELRSITNIISKK >gi|226332055|gb|ACIB01000001.1| GENE 158 186481 - 187416 932 311 aa, chain - ## HITS:1 COG:XF0611 KEGG:ns NR:ns ## COG: XF0611 COG0451 # Protein_GI_number: 15837213 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Nucleoside-diphosphate-sugar epimerases # Organism: Xylella fastidiosa 9a5c # 3 307 22 325 329 471 69.0 1e-133 MKRILVTGGAGFIGSHLCERLLNEGNDVICLDNYFTGSKDNIRHLLDNHNFELVRHDVTT PYYAEVDEIYNLACPASPPHYQYNPIKTMKTSIYGAMNMLGLAKRTRAKILQASTSEVYG DPSIHPQVEAYWGNVNPIGIRSCYDEGKRASETLFMDYHRQNGVRIKIIRIFNTYGPRMN PNDGRVVSNFIAQALRNQDITIYGNGSQTRSFQYVDDLIEAMTRMMATDDSFIGPVNTGN PGEFTMLELAQKVIDLTNSKSKIVFCPLPSDDPKQRRPDISLAKEKLAGWEPRIKLEEGL KKTIEYFASIV >gi|226332055|gb|ACIB01000001.1| GENE 159 187424 - 188740 1137 438 aa, chain - ## HITS:1 COG:XF1606 KEGG:ns NR:ns ## COG: XF1606 COG1004 # Protein_GI_number: 15838207 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted UDP-glucose 6-dehydrogenase # Organism: Xylella fastidiosa 9a5c # 1 437 1 444 450 488 54.0 1e-138 MNIAIVGTGYVGLVSGTCFSEMGINVTCVDVDEKKIQKLQDGVMPIYEPGLDELVERNVK AGRLHFTTDLTTCLDEVEIIFSAVGTPPDEDGSADLKYVLEVARTVGRNITKHVVLVTKS TVPVGTAKKVRAVIQEELDRRGTDLEFDVASNPEFLKEGAAIKDFMAPDRVVVGVESEKA KKIMERLYRPFTLNGYPILMMDVASAEMTKYAANAMLATRISFMNDIANLCERVGANVDN VRKGMGADSRIGSRFLYAGCGYGGSCFPKDVKALVHTGIQNGYHMQVIEAVEAVNEKQKS IVFDKLLKAFGGNLQDKIVAMWGLSFKPETDDMREAPALVVIEKLLQAGAIVKVFDPVAM EETERRIGKKVIYCKDMYEAVIDADAIALMTEWKQFRMPSWAIIRKAMKNFVVVDGRNIY DGEELKELGFTYSRIGQK >gi|226332055|gb|ACIB01000001.1| GENE 160 188760 - 189149 281 129 aa, chain - ## HITS:1 COG:no KEGG:BF1553 NR:ns ## KEGG: BF1553 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 129 5 133 133 254 100.0 7e-67 MFLNELKDIAWQSAQIVEEYNPARIRKDACGAWIAYADFNNRNSLFGWELDHIYPVSRLK LQNVPEELWDNPLNIRAFHWQNNQSKGNSYPMYTAVVSDEGATNMKCEAVYLVNEALQYS LRKLFKITE >gi|226332055|gb|ACIB01000001.1| GENE 161 189170 - 189619 370 149 aa, chain - ## HITS:1 COG:MA3780 KEGG:ns NR:ns ## COG: MA3780 COG1898 # Protein_GI_number: 20092576 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes # Organism: Methanosarcina acetivorans str.C2A # 2 127 45 170 183 177 66.0 4e-45 MFVQDNESKSSYGVIRGLHFQKPPFAQSKLVRVVKGAVLDVAVDIRKGSPTFGKHISVEL TEDNHRQFFIPRGFAHGFSVLSEEVVFQYKCDNFYAPQCEGAIAWDDPDLGIDWKIPMEE VILSEKDSCHSALKDAAWLFDYYDKQDML >gi|226332055|gb|ACIB01000001.1| GENE 162 189745 - 190635 828 296 aa, chain - ## HITS:1 COG:YPO3861 KEGG:ns NR:ns ## COG: YPO3861 COG1209 # Protein_GI_number: 16123996 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: dTDP-glucose pyrophosphorylase # Organism: Yersinia pestis # 1 289 1 286 293 408 67.0 1e-114 MKGIVLAGGSGTRLYPITKGVSKQLLPIFDKPMIYYPISVLMLAGIREILIISTPYDLPG FQRLLGDGSDFGVQFEYAEQPSPDGLAQAFIIGEKFIGGDSVCLILGDNIFHGNGFSAML KEAVRIADEKQEATVFGYWVNDPERYGVAEFDKGGKCLSIEEKPKVPKSNYAVVGLYFYP NKVVEVAKNIKPSARGELEITTVNQYFLKKEQLKVQTLGRGFAWLDTGTHDSLSEASTFI EVIEKRQGLKIACLEGIALRQGWINSDKMKKLAQPMLKNQYGQYLLKVIDELAADQ >gi|226332055|gb|ACIB01000001.1| GENE 163 190672 - 191154 613 160 aa, chain - ## HITS:1 COG:no KEGG:BF1529 NR:ns ## KEGG: BF1529 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 160 1 160 160 294 98.0 6e-79 MNYLESEISALYASAHELCYLGMDGRPIYSDQFTRLNRDVFSQANALYDKHGDSDEEEAR LCLSLLMGYNATLYNNGDKEERIQHILDRCWDVLEHLPASLLKVQLLVYCYGEVFDEELA REAQAIIDTWQDRELSEEEHEVMERLKDVQENPYPWSEVE >gi|226332055|gb|ACIB01000001.1| GENE 164 191166 - 191804 518 212 aa, chain - ## HITS:1 COG:no KEGG:BF1528 NR:ns ## KEGG: BF1528 # Name: not_defined # Def: putative transcriptional regulator UpxY-like protein # Organism: B.fragilis # Pathway: not_defined # 27 212 1 186 186 329 99.0 4e-89 MLRYSGVPKEHPDVNDMTTSASIEASMERSQSILSSSALNWYALRITYGRELALQGYLNS EGIENFIPMHYEYTIKNERRVRKLVPAVHNLVFVRSSRSCIDAIKESRSATLPIRYIMDR EYHRPIIVPDSQMRNFMAVSANYDESLLYFEPSELNIRKGTRVRITGGLFEGVEGEFVRV RNDRRVVVTIEGVMAVATTFVHPSLVEPVTEK >gi|226332055|gb|ACIB01000001.1| GENE 165 192528 - 192719 67 63 aa, chain + ## HITS:1 COG:no KEGG:BF3700 NR:ns ## KEGG: BF3700 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 63 1 63 63 94 95.0 8e-19 MDAKEQNIKTCKDSLARYIEEKELFGKMRNGVFKPLVFSTIRNYVNEIWNKMERKKKNQE GKR >gi|226332055|gb|ACIB01000001.1| GENE 166 192815 - 193162 356 115 aa, chain + ## HITS:1 COG:no KEGG:BF1526 NR:ns ## KEGG: BF1526 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 115 1 115 115 203 98.0 2e-51 MKQKKRPASQTEAMKLRWKKRIVFEKGYTEMCAEWMAERLEALTDHLQYGHAAIAYQKQN GDFRLVKATLIYYEAEFHKKYEPTKIEGAVVYWNVDEQRWTTFQVENFMEWRPIV >gi|226332055|gb|ACIB01000001.1| GENE 167 193305 - 194219 749 304 aa, chain + ## HITS:1 COG:no KEGG:BF1525 NR:ns ## KEGG: BF1525 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 304 1 304 304 579 98.0 1e-164 MATIYDGINYFPVGVNFMEENAMEVIEAKYGIKGSAIVLKLLCKIYKEGYFIRWDEEQCL IFANKAGREVQAAEVQGIIEILFIKGIMDKNSYLENGILTSENIQKVWMEATKRRKRELS ELPYLMVKTEKENDKPEKESGKPDNASTQQEVEQPKPLKEGKAAVGTGDVAVSPGNVVHD VAVDAKNACNSGQSKVKKSRAKENKELPPSAPPEGKEEERMEDSASLPIPGYAFNTMTHN YPGLTDTLQRLGINEVGEVNAILRLSDYGRKGTTVWRLIANTCWSDIGAKGRYLIAALNR AKRK >gi|226332055|gb|ACIB01000001.1| GENE 168 194686 - 195684 514 332 aa, chain + ## HITS:1 COG:no KEGG:BF1523 NR:ns ## KEGG: BF1523 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 165 332 1 168 168 305 98.0 1e-81 MKYFVSLFLSLFFLVFMSCGNEDNAWDNNIPIITPNEQSDSSGLLKNQITDIINYSKINF DENFNNTELYKNLILSPKWENVSMVLQKQDTLHLCVPLLAQDNPEHNSYYLFISNIKSAN IIRFTIIGLPENFWDIINAPITRAGIIDAGWIPEVTILGDLCRQMSHICSDPYDEAFLEF LHSKWLKEHGNESSSDSSSSGGDYSRLTEAEKRFLMRHPQVIKKFHDNARKDSEAAKKFP GQHNGEGDAVRHVYWSALNTLSENANLAKEFGDAHEQNPGQDIAEKNMDLFNNSIGYQLG DLAKQNKWSEERLFKEIIKYKNDGKLQTKLHP >gi|226332055|gb|ACIB01000001.1| GENE 169 195653 - 196102 84 149 aa, chain + ## HITS:1 COG:no KEGG:BF1522 NR:ns ## KEGG: BF1522 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 149 2 150 150 272 99.0 3e-72 MENYKQNYIHKPYLFLAILFSLLSCQKEVVSKVTFERKLSGIKPETEFRLDSLRNDKWQK CYIIPPYQQYNSALNRIKLRKHDLNKIKENAISDGINTFVFINNDGSISIETVSRSIIDI QDTLLDSIFLFYPTTIMKMDSKRKIIDIK >gi|226332055|gb|ACIB01000001.1| GENE 170 196301 - 196702 252 133 aa, chain - ## HITS:1 COG:no KEGG:BDI_0882 NR:ns ## KEGG: BDI_0882 # Name: not_defined # Def: hypothetical protein # Organism: P.distasonis # Pathway: not_defined # 21 133 8 120 120 189 88.0 2e-47 MILGAKRLVVTIYIQYHLCLKYEFALVRVKELLPLVDDNIPANDKNAVELSVMSDIVIAY GKEHYSIEKPTVAELIELYLEEKGMSQKQLAIEIGISLSRVNDYIAGRSEPTLKIARLLC RVLNIPPVAMLGF >gi|226332055|gb|ACIB01000001.1| GENE 171 197033 - 198022 650 329 aa, chain - ## HITS:1 COG:no KEGG:BF1454 NR:ns ## KEGG: BF1454 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 8 329 1 322 322 550 98.0 1e-155 MKNKFRNLFLAFGILAVIIMLFTFDVSYDELLDNLRRAGFYLPLVLVLWLFIYLINTLSW YIILRSSGPVNSLSFARLYKFTVSGFALNYVTPVGLMGGEPYRIMELTPYVGVERATSSV ILYVMMHIFSHFCFWLSSVLIYVFFYPVGWGMGIVLGLITLFCLLLVTLFIKGYRNGMAV ACIRLGSHIPFLKKHAVRFAELHKEKLETIDSQIALLHQQRKSTFYSALGLEYTARIVGC LEVWLILNVLTTDVSFVGCILIVAFSSLLANLLFFLPMQLGGREGGFALAVAGLSLSGAY GVFAALITRVREMVWIVIGLVLMKIGNRR >gi|226332055|gb|ACIB01000001.1| GENE 172 198154 - 199851 1975 565 aa, chain + ## HITS:1 COG:STM0870 KEGG:ns NR:ns ## COG: STM0870 COG2985 # Protein_GI_number: 16764232 # Func_class: R General function prediction only # Function: Predicted permease # Organism: Salmonella typhimurium LT2 # 14 564 15 556 561 241 29.0 2e-63 MDWIVHQLRVHPELAIFLTLFMGFWIGKIKIGKFSLGVVTSVLLVGVLVGQLDITVDGPI KSVFFLLFLFAIGYKVGPQFFRGLKKDGLPQMGFAAIMCVFCLIIPWILAKIMGYNVGEA AGLLAGSQTISAVIGVAGDTINELNISPETKEAYNNIIPVSYAVTYIFGTAGSAWVLGSL GPRLLGGLDKVKAACKELEAKMGNNEADQPGFMAAARPVTFRAYKIANDWFGDGKRVSDL ESYFQENDKRLFVERVRQAGIIVKEVSPTFVLKKGDEVVLSGRREYVIGEEDWIGPEVLD PQLLDFPAEVLPVMVTRKTVAGEKVSTIRALKFMHGVSIRRIKRAGIDIPVLAQTVVDAG DMVELVGTKHEVDAAAKQLGYADRPTNQTDMIFVGLGILIGGLIGALSIHMGGVPISLST SGGALIGGLFFGWLRSKHPTFGRIPEPALWILDNVGLNMFIAVVGIAAGPSFVQGFKEVG LSLFIVGALATSIPLIAGILMAKYIFKFHPALVLGCTAGARTTTAALGAIQEAVESETPA LGYTVTYAVGNTLLIIWGVVIVLLM >gi|226332055|gb|ACIB01000001.1| GENE 173 199893 - 201539 1664 548 aa, chain + ## HITS:1 COG:mlr5693 KEGG:ns NR:ns ## COG: mlr5693 COG0436 # Protein_GI_number: 13474739 # Func_class: E Amino acid transport and metabolism # Function: Aspartate/tyrosine/aromatic aminotransferase # Organism: Mesorhizobium loti # 248 358 150 265 388 63 33.0 1e-09 MMKSNENNGAVTKSFAKKMESISPFELKNKLIEMADESIKKIAHTMLNAGRGNPNWIATT PREAFFLLGKFGLEECRRVMYLPEGIAGIPQKDGIAARFETFLKTNHSQPGAELLKGTYQ YMLLEHAADPDTLVHEWAEGVVGDQYPVPDRILQFTEMIVQDYLAQEMCDRRPPKGKYDL FATEGGTAAMCYVFDSLQENFLLNKGDGIALMVPVFTPYIEIPQLRRYEFNVTEISADQM TTDGLHTWQYKDEDIDRLRNPQIKALFITNPSNPPSYTLSPETAARIVDIVKKDNPNLMI ITDDVYGTFSPHFRSLMAELPQNTLCVYSFSKYFGATGWRDAVIALHEENIFDRMIAHLP EEQKTILNKRYSSLTLTPEKLKFIDRMVADSRQVALNHTAGLSLPQQTQMSLFASFAILD KENRYKNKMQEIIRRRLKALWDNTGFSLVDDPLRVGYYSEIDMLVWAKIFYGEEFVSYLK KTYSPLDVVFRLANETSLVLLNGGGFAGPEWSVRVSLANLNEKDYVKIGQGIKRILDEYA VKWQESRK >gi|226332055|gb|ACIB01000001.1| GENE 174 201528 - 202820 1016 430 aa, chain - ## HITS:1 COG:BH1920 KEGG:ns NR:ns ## COG: BH1920 COG0642 # Protein_GI_number: 15614483 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Bacillus halodurans # 104 430 188 537 548 97 25.0 7e-20 MSVKGFFFILVFFLVAIMGFLIYISETVVVKYLYIAEALMLLLMLYLILFYRKIVKPMNT IGSGMELLREQDFSSRLSHVGQQEADRVVNVFNRMMEQLKNERLRLREQNHFLDLMINAS PMGVIIMTLDEEVSQLNPMAMKMMGVRPEEAEGRKLSEIDSPLALELAAIPNGETSTVRL NDSSIYKCTHSSFVDRGFQHPFYLMEGLTDEVMKAEKKAYEKVIRMIAHEVNNTTAGITS TLDTVEQALSESEGMEDICDVMRVCTERCFSMSHFITRFADVVKIPEPRFTPTNLNDLAF TCKRFMEGMCNDRNIRLQLICDESLDDVKLDASLFEQVLVNIIKNAAESIGQDGQIIIRT SLPTAIEVVDNGPGISKETEAKLFSPFFSTKPNGQGIGLIFIREVLSRHGCTFSLRTYAD GLTRFRILFP >gi|226332055|gb|ACIB01000001.1| GENE 175 202864 - 204213 1405 449 aa, chain - ## HITS:1 COG:atoC KEGG:ns NR:ns ## COG: atoC COG2204 # Protein_GI_number: 16130157 # Func_class: T Signal transduction mechanisms # Function: Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains # Organism: Escherichia coli K12 # 2 446 7 456 461 322 40.0 1e-87 MLLIIDDDSGVRSSLSFMLKRAGYQVIAVTGPREAMEVVRSEAPSLILMDMNFTLSTSGE EGLTLLKQVKVFRPDVPVILMTAWGSIQLAVQGMQAGAFDFITKPWNNAALLQRIETALE LTATPKDTPQEQSGTLNRSHIIGKSRGLMEVLNTVARIAPTNAPVLITGESGTGKELIAE AIHINSQRVRQPFVKVNLGGISQSLFESEMFGHKKGAFTDATADRMGRFEMANKGTIFLD EIGDLDPSCQVKLLRVLQDQTFEVLGDSRPRKTDIRVVSATNADLSKMVSEHTFREDLFY RINLITVKLPALRERREDIPLLARHFADRQAEINNLPRTEFSSDALNFLSRLPFPGNIRE LKNLVERTILVSGKEVLDAIDFENQYQRHDESVATSSSFAGMTLDEIEKQTILQALERYK GNLSQVATALGISRAALYRRLEKYDIGDK >gi|226332055|gb|ACIB01000001.1| GENE 176 204363 - 205835 1627 490 aa, chain - ## HITS:1 COG:no KEGG:BF1514 NR:ns ## KEGG: BF1514 # Name: not_defined # Def: putative outer membrane protein OprM precursor # Organism: B.fragilis # Pathway: not_defined # 1 490 1 490 490 880 100.0 0 MKRYFLLSAFAFCSLALSAQETQEITLNEAIALARTQSVDAAVALNELKTAYWEYRTFRA DLLPEVNFSGTLPSYSKQYNSYQNEDGSYSFVRSNKLGLNGALSIDQNIWFTGGKVSLSS SLDFMKQLGSGGSRQFMSVPIALQLTQPIFGVNNLKWNRRIEPVRYEEAKAAFITATETV TMNAITYFFNLLSAKETLGTARQNQVNADRLYEVAGAKRKMGQISENELLQLKLAALKAR AAVTDAESNLNAHMFRLRSFLAIGNDLILEPVVPESAPNLKMEYNQVLNKALERNSFAHN IRRRQLEAEYEVATARGNLRSVDLFANVGYTGLNKDLSPAYHNLLDNQVVEVGVKIPILD WGKRRGKVRVAKSNRDVTLSKIKKEQMDFDQDIFLLVEHFNNQAQQLSIANEADKIAQQR YKTSVETFLIGKINTLDLNDAQNSKDDARQKHINELYWYWYYYYQLRSLTLWDFQNNTPL EADFEDIVKK >gi|226332055|gb|ACIB01000001.1| GENE 177 206290 - 208902 1800 870 aa, chain - ## HITS:1 COG:mll3725_2 KEGG:ns NR:ns ## COG: mll3725_2 COG0642 # Protein_GI_number: 13473203 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Mesorhizobium loti # 608 866 45 311 328 162 39.0 2e-39 MNKRLYTIFLISVFLLLPGFSTAAERIYNVLFVQSYAPETPWHNDLVRGLKDGFGESGLK VNITTEFLDANFWTYQSEKLIMRRFCERARERGTDLIVTVSDEAFHTLLTCGDSLALQLP VVFFNIKYPEGSLIDSLPNVCGYTANPDFGELLRQASRLFPTRTEVVCISDNSLLSSKGK DDFMNEWEGFVEEHPEYTVTFYNSQTDTTNKIIASTCYPRNTHKTLIIAPKWSSFMSFIG RNSKAPFFSCENLALTNGAFGAYDADSYASAHEVGRTAADVLRGKSPSEVGIIESPLKFM YDFKQLVFFKVDPKQASAIGGTIINEPYMEKYRMLYILLYSSILALLVFLIVWLYRINRR ESRRRIHAQTRLLIQNRLVAQRDEFDNVFHSIRDGVITYDTDFRIHFTNRSLLKMLHLPK DEAARPYEGLPAGSIFKIYNNGKEILRPMLKQVVTEESSVVIPENSFMQEVHSGSYFPVS GEVVPIRAHGKITGMALSARNISDEEMQKRFFSMAVDESSIYPWQYNIRTGLFTFPAGFL TRFGFAENKTTISRGEMDRMIHPDDQESAYEIFNRALAGLSQSTRMSFRQLSGDGNYEWW EYRTSVLAGLTTDTPYSILGVCQSIQRYKTTEEELTAARDKALQADKLKSAFLANMSHEI RTPLNAIVGFSDLLSDTSGFTEEEVKLFIETINKNCGLLLALINDILDLSRIESGTMDFQ FAGHNLPLLMKNVYDSQRLNMPPGVQLVLKLPENSKKYLVTDNVRLQQVVNNLINNAVKF TTQGSITFGYTEEEPGYTSLFVEDTGKGISEDGLRHIFERFYKVDSFTQGAGLGLSICQT IVGRLNGTITVTSEEGHGTRFTVRLPDICE >gi|226332055|gb|ACIB01000001.1| GENE 178 208998 - 210242 1246 414 aa, chain - ## HITS:1 COG:no KEGG:BF1446 NR:ns ## KEGG: BF1446 # Name: not_defined # Def: putative ABC transporter permease component # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 414 1 414 414 762 99.0 0 MNKKLLKQIVNERRSNSWLFIELLLVSIVLWYVVDYMFVTLYTYFEPRGFDIENTYRVEF DYLTEKSPDYIANRTDEEAHADMRELLDRLRRRPGVEAVSMSQNSFPYNGSNSGMDVRLD TMESKYNIRRWVTPDFFRVFRYQGANGETPEQLAALLKEGTFMVSRNVFESRYKIDLKDY VGKEFCLDQDTAHLSKLVAALQVVRYDDFSSGAYSRSAVILLPENRLASGNEICLRTNKN ESAAFAEQLMKDAPSQYRVGNLFLTKVSSFRDIRHTFQLDDMNTLRNYLVGMGFLLLNIF LGLLGTFWFRTQQRKGEMALMMAVGGSKQSVFFRLLSEGWLMLLLVTPLAIGVDFYIAKS ELTPSWYFSTFSVGRFMLCEGITLLLMALMILAGIWFPARQSMKIQPAEALHEE >gi|226332055|gb|ACIB01000001.1| GENE 179 210257 - 211555 835 432 aa, chain - ## HITS:1 COG:ZybjZ_2 KEGG:ns NR:ns ## COG: ZybjZ_2 COG0577 # Protein_GI_number: 15800637 # Func_class: V Defense mechanisms # Function: ABC-type antimicrobial peptide transport system, permease component # Organism: Escherichia coli O157:H7 EDL933 # 126 428 121 393 395 74 23.0 3e-13 MIKQYFKQALAQLRQQPLLTTISVLGTALTICLIMVVVMQQQIKTTPFAPESNRNRLLHV KQMSTSNKNWSDDGSSNGPMGLQTAKGCFEGLTTAEEVSIYTIPETMQVALPRGVRTGID ALETDGAFWRIFDFSFIDGKPYSDAEVKSGLPVAVITESVARLLFGTSHQVSGKEILVND AVYRISGVVKDVSSMASTAYAQIWVPYSSTHITGGDNTWCDGIMGVMRVVILARSSSDFE AIRAECERRRLAYNAGLGDYFVFYRGQPDDQLTMSQHKWANVQPDMAAYFRQQVIIFLIL LLVPAINLSSMTHSRLRQRVAEIGVRRSFGATRGGVMGQIVAENLVLTLMAGVVGLLFCL IISYCWGGTLFADSRLMYLNTAPVIEWKMLFKFSTFIYALLFCLALNLLSSGWPAWRASR MSIINALSGKLN >gi|226332055|gb|ACIB01000001.1| GENE 180 211569 - 212213 515 214 aa, chain - ## HITS:1 COG:no KEGG:BF1509 NR:ns ## KEGG: BF1509 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 214 1 214 214 379 99.0 1e-104 MKAKIILVALALFLGAIGYTVQAQCSADCQCGKTTEDKTLAVGNRKVADFSMIRLEAVGD IFFTQSDRCSVRIEGPQEYVSKTTTVVKNGVLVIGYQKNNNNSKHIKLYITAPNLDNVKL QGVGSFNCREPLRSRRFDLILSGVGDVNIDNLKCKDFTVKLDGVGCVNVKVDCDALEAQA NGVGSMTLNGKAGTAKISRNGVGGVNTDGLKIGK >gi|226332055|gb|ACIB01000001.1| GENE 181 212229 - 212894 337 221 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) [Campylobacter concisus 13826] # 1 201 4 200 223 134 37 4e-30 MITLTSLSKIYRTNEIETVALENVNLTVDRGEFLSIMGPSGCGKSTLLNIMGLLDAPTTG TIEINGTHTEGMKDKELAAFRNKTLGFVFQSFHLINSLNVLDNVELPLLYRRVSSSERRK LAQEVLEKVGLSHRMRHFPTQLSGGQCQRVAIARAIIGNPEIILADEPTGNLDSKMGAEV MELLHRLNKEDGRTIVMVTHNEEQAKQTSRTIRFFDGRQVQ >gi|226332055|gb|ACIB01000001.1| GENE 182 213054 - 214304 998 416 aa, chain - ## HITS:1 COG:no KEGG:BF1507 NR:ns ## KEGG: BF1507 # Name: not_defined # Def: ABC transporter permease # Organism: B.fragilis # Pathway: not_defined # 1 416 1 416 416 864 99.0 0 MIKHLLKQIWAQRSVNAWLWFELMIVSVCLFYVMDYLYVTGRLYATPLGFDTEHVYRVKL ASIPPGGKEYKPGDTDSLKIEQWFSILSRLRAYPGVEAVSLSIGSHPYNQSSSSGSRGID TTWVHGYVYNVSPDYFRVFRITDKQGKTESLVQAATQENTWIISAETEREFSAKGTDALG KGVKNWGETEPTHTIRGICNTIRFDDFYPLYPTYIECHSEAALLGWRGNNAEFCVRVRPD ADGVDFPSRFRKDMKVQLRVGNFYLLDITSFDDLRENYYRSNGKINDVKTRIAALGFFLL NILMGVIGTFWIRTQQRRSEMGLRLALGSTRANLRSLLIGEGVLLLILATVPAAVISLNL AFMDLLTDTMPVVTVTRFLIVQAMTFVSIVVMIVIGICIPARQVMRIQPAEALHEE >gi|226332055|gb|ACIB01000001.1| GENE 183 214313 - 215587 1214 424 aa, chain - ## HITS:1 COG:YPO1365_2 KEGG:ns NR:ns ## COG: YPO1365_2 COG0577 # Protein_GI_number: 16121645 # Func_class: V Defense mechanisms # Function: ABC-type antimicrobial peptide transport system, permease component # Organism: Yersinia pestis # 6 423 1 392 395 89 23.0 1e-17 MIKQYFKQSLVLLQQNKLLSVIAIIGTALAIAMIMCIVLIYQARTANYEPEINRDRTLSI EMTIAQKIDDKGWNMGNQLSLRTIKECFYPMTTAEAVSAVHYSMTSLAATPDGTREVKCA VSYTDDNFWRVFGFRFLHGKPYGQEFVSGEKKLVVTRSLARRLFGIDNAVGRIISLGFVD YTVCGVVTDVSVLAEAAYAEAWAPYTALPDYERSVSEGLQGGYSCYILVPKGGDPDVVRA EAQQNVDRMNANQKELKLLLGGAPDTRLMSLARDNPFDDPDTSRLVLIYIVVITILLLVP AINLSGITLSRMRRRMEEIGVRRAFGATRGELLRQVLAENLVVTLMGGVLGLILSYIAVL CMRDWLLNTSMSGYYGVDTQVSAGMVIQPFVFVCALLFCLLMNLLSAGIPAIRVSRTNIV NAIK >gi|226332055|gb|ACIB01000001.1| GENE 184 215629 - 216294 338 221 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) [Campylobacter concisus 13826] # 1 201 4 200 223 134 37 3e-30 MITLTSLSKIYRTNEIETVALENVNLKVDRGEFLSIMGPSGCGKSTLLNIMGLLDAPTAG TIEINGTHTEGMKDKELAAFRNKTLGFVFQSFHLINSLNVMDNVELPLLYRHIASSERRK LAQEVLEKVGLSHRMRHFPTQLSGGQCQRVAIARAIIGNPEIILADEPTGNLDSKMGAEV MELLHRLNKEDGRTIVMVTHNEEQAGQTSRTVRFFDGRQVQ >gi|226332055|gb|ACIB01000001.1| GENE 185 216611 - 217510 665 299 aa, chain + ## HITS:1 COG:BMEII0641 KEGG:ns NR:ns ## COG: BMEII0641 COG2207 # Protein_GI_number: 17988986 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Brucella melitensis # 196 296 199 292 307 63 31.0 5e-10 MEEVIKLNSVDQYNKMYGLETLHPLVTVVDLSKATVFPTHFTLNYGLYALFLKQTKCGDL RYGRQMYDYQEGTVTSFAPGQVVEVKLNDGVRPMSHGILFHPDLIRGTSLGQEIKHYSFF SYASNEALHLSDDEKKIFQDCLDKVQQELSRPIDKHSKRLIARNIELLLDYCMRFYERQF VTRSKVNKDVLMKFEDLLDVYFQSEQSPNEKLPTVKYFADKVNLSSNYFGDLIKKETGKT AQEYIQGKIINIAKERILASEKTVSEIAYELGFQYPQHFTRIFKKVVGCTPTEYRVIQV >gi|226332055|gb|ACIB01000001.1| GENE 186 217658 - 218929 973 423 aa, chain - ## HITS:1 COG:no KEGG:BF1436 NR:ns ## KEGG: BF1436 # Name: not_defined # Def: putative ABC transporter permease component # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 423 1 423 423 867 98.0 0 MNLMMILRQLWNQRAANGWILGELVVVTYFLWGVVDPVYVLLSDKALPDNYDLTDTYLLS IGAYSANHTRYNPELDSDSLKQVDFMRIVDQVRRYPGVSDVTVSFFNSYPQSGTWNGGQL FNDTIFANIQQMTFLSGTDYFGVFRIHDARTGEIPPVLAEGEQGIYLTPDVAEKLFGEKY PQNKWIHWGDSTRKSPLTAVIDPLQIRSISQPGPLVFKATSELIHLPGAARICFRVRDGL ASPAFTETFKREMRPRMQIGNYYLVSLTDFKTVSKHFEYYMGTTGTIRLQIILAAFFLLC VFLGMGGTFWLRCNSRREEMGIYMTMGSTHHRLIRQFLLEAWWMVTIAFVIGALAQFQIV YLNGFAFAPDDPNPDYIQNRPVLHFLIVSAISYILILAVSFVATYIPVSKAARMNPADAL RDE >gi|226332055|gb|ACIB01000001.1| GENE 187 218936 - 220243 1235 435 aa, chain - ## HITS:1 COG:BS_yknZ KEGG:ns NR:ns ## COG: BS_yknZ COG0577 # Protein_GI_number: 16078501 # Func_class: V Defense mechanisms # Function: ABC-type antimicrobial peptide transport system, permease component # Organism: Bacillus subtilis # 119 348 118 330 397 60 23.0 5e-09 MIKLYFKQAWQLLKQNPLFSSVYVLGTGLGIAMTMSLVIIYYIKMAPVYPEENRNRILVS KGMTAIEQGNENNWFSSNVSFQTVKRFYYPLKSAEAVGCSLDGHTTSLLELPESKDLKEV QVKLVDAGFWKVFSFAFVDGKPFTRADFDSGLRKAVISVTLARRLYGDNAPVGRTFVLDS DEYQVCGVVKDVSFITPATYADIWLPLTVDAEVVEEQEGSYELIGNLSVYMLAPSVGSKD KVAGEVRDAFRKYNFSQKKYKVDLYGQPVSYWKSTFYEYCNSAPDWGKLIRTYGTILLAL LFVPALNLAGMIASRMKRQLSEMGIRKAFGASKASLLMQVFWENLFLTGLGGLLGLLLSY LIVYCGRNWLPDLLSAYSDVIPEGVDSFLTPGMLLNPVVIGITFLVSLILNVLSALIPAL HALKKDIVYSLNDKR >gi|226332055|gb|ACIB01000001.1| GENE 188 220268 - 221554 1068 428 aa, chain - ## HITS:1 COG:no KEGG:BF1434 NR:ns ## KEGG: BF1434 # Name: not_defined # Def: putative ABC transporter permease component # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 6 428 1 423 423 844 100.0 0 MIRLIMKNLWARRRKNGWLLAELILVSIVTWVIVDPLVVLTHDRNLPEGYRPDHLFLLQL ASYPAGSEQFRAEENDTLARKANFERLLNKLKHYEGVKYTTFILNEQYPSALSISNGNTM FDTIPVRTLRLAFIPHTDYFRTMGMEGAEGMTAEQLDNRDFLWTESVLTADISQRLKDGK PLYGRRLGNGDEEDYRVGGVIAPVRYRSYMQPMPIELYVYEEFPDYMYYYIPLIALRIDD HLSEKVFLHHFREWMNKELTVGNYYVKSVQSFSDIQEQHEFSEGITNQYRLNLALGIFFL VNLCLGVAGTFWMQTRSRREEVGIMLSFGGTPSHITRLLLYEGWILTTLGTLTGCLLYLQ YALRDGLYTTCNSAEEAMPAYWINHFGLHFTAVTLIVYLLLLIVVSIGIWMPAHKLSRIS PVDALRDE >gi|226332055|gb|ACIB01000001.1| GENE 189 221558 - 222865 1080 435 aa, chain - ## HITS:1 COG:AGpA247_2 KEGG:ns NR:ns ## COG: AGpA247_2 COG0577 # Protein_GI_number: 16119402 # Func_class: V Defense mechanisms # Function: ABC-type antimicrobial peptide transport system, permease component # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 129 431 120 390 393 73 24.0 6e-13 MIKLYLKQSWMLIRQNKLFSSLYVLGTGLAIAMTMIIAIVYYIKIAPIYPEVNRSLTMRM KGVSAMHVKGGGNSYSCSYEMLKDWFYPLQSAELVTAVNEHFLTRKGSYIQPAGGGEQIP ALVKYTDPNFFRLFEFEFLDGKPFSEADLASGIRNVVLSDRMARRIFGRTDVVGQTFKQD FKESKVVGVVREGSYLLPASYGQIYMPYSCLPGYDKNNDGSHKVGTYVVYFKVRQKEDMP KLYAEVNELVRKYNTSQKEYTVDIFHQPDPYWQTWFREGNTNEIDWASVIKLYGGALLAL LLVPAINLSGMISSRMDDRLAEMGIRKAFGANRKQLLNQVLWENLLLTCIGGLMGLIVSW GLLVLGRNWVFSLFDKYPTVISDGVDVAINPQMLFSPLMFCVTFAFCLILNLLSAWWPTW RSLHKDIIDSLNEKK >gi|226332055|gb|ACIB01000001.1| GENE 190 222884 - 224152 1143 422 aa, chain - ## HITS:1 COG:no KEGG:BF1499 NR:ns ## KEGG: BF1499 # Name: not_defined # Def: ABC transporter permease # Organism: B.fragilis # Pathway: not_defined # 1 422 1 422 422 816 99.0 0 MMIKQIFKMIWNQRRLNGWIWMELLVVFVALWYLVDMFVVQLYSYTRPMGYDITNCWKLS FDVYPEDADEYVNDTTRTQTEGEALAKILERLRRAPEVDNACVAFYSSPYSGGNSWTQIM PCTADSSKFKEQSYHQYIVSAEFYDVFRIKSREGKPLSELLTQKQLSYFITPALEKDFFG SQSAVGQKVRYPGSTREIHIAAVTAPVRITEFVKPEPELFFTMWPKELERQVNATGASNM EVTVRMKEELTSEQMGHFLNRMKNQLTENNLYITGMEDMKQQRSDRLQYEWRKISINLLL SVFILLNVLFGITGTFWLRIEQRRCETGLRMALGSTRRRVGWFFTAEGWLLLTTVVPLVL VVIFNMVHMEIPDLYNLSFTWWRFAVSFGGVLLLMGLIIALGTWLPARRAMKLQPAEALH YE >gi|226332055|gb|ACIB01000001.1| GENE 191 224172 - 225437 777 421 aa, chain - ## HITS:1 COG:PA2390_2 KEGG:ns NR:ns ## COG: PA2390_2 COG0577 # Protein_GI_number: 15597586 # Func_class: V Defense mechanisms # Function: ABC-type antimicrobial peptide transport system, permease component # Organism: Pseudomonas aeruginosa # 6 417 6 389 392 74 23.0 4e-13 MITIYLKQSYNLLKENRFVNGISIAGTALSVAAVMLIYLVYQVNFSAYAPESNRYRILFV SSLQACGSDGHPINNGGMSHKVVRECLYPLQTPEAVTAFTSGDLPVNVHGQQFYDKYAIK FTDDGFWKVFDFTFLAGGPFTHTDWESGIRKAVISDKLARRLFGTVEAVGQTLRMNYADY RICGVVKEVSQAAESSYGDVWIPYTANASLLKDNISYCEGTTGEFQACILSRSRSDFEAI RREMLKLQSTFNASLTGTKLDYMHSPFTQWQAVLGTNGFSEGTVGEWLKSTGAVILFLLL LPALNIIGITLTQFRKRRSEIGVRKAFGACSFSLVEQVVIENLLTSCMGGLIGLLLSFGL LSLCKSLFFSGDVSLTHDMLIQPLTFVAAFFFTLILNLLSAAIPAWRASRMPITEALHDM E >gi|226332055|gb|ACIB01000001.1| GENE 192 225558 - 226805 1440 415 aa, chain - ## HITS:1 COG:YPO1498 KEGG:ns NR:ns ## COG: YPO1498 COG0845 # Protein_GI_number: 16121771 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Yersinia pestis # 69 415 71 420 420 60 18.0 4e-09 MDREIPKEVRNKERNKKIIRFSAIGIVGIIGISVLISLLRTGVQKKDLVFSTVDKGTIEV SVSASGKVVPAFEEIINSPINTRIVEVYRKGGDSVDVGTPILKLDLQSTETDYKKLLDEE EMKRYKLDQAKVNSQTKLSDMAMQIKVSEMKLARMKVELRNEQYLDSLGAGTTDKVRQAE LSYNTSRLELEQLKQQYANEKQIAAADLKVLELDLNMFRKGLAEMKRTLDDAQIRSPRKA ILTYINNQIGAQIPQGGQVAIISDLSHFKVDGEIADTYGDRVAAGGKAIVKIGSEKLEGI VSSVTPLSKNGVISFSVQLKEDNNKRLRSGLKTDVYVMNAVKEDVLRIANASYYVGRGEY DLFVMTSDDEIVKRKIQLGDSNFEFVEVVSGLNPGDKVVVSDMTNYKNKNKLKVK >gi|226332055|gb|ACIB01000001.1| GENE 193 227130 - 228029 894 299 aa, chain - ## HITS:1 COG:lin1178 KEGG:ns NR:ns ## COG: lin1178 COG1705 # Protein_GI_number: 16800247 # Func_class: N Cell motility; U Intracellular trafficking, secretion, and vesicular transport # Function: Muramidase (flagellum-specific) # Organism: Listeria innocua # 27 172 52 206 289 88 40.0 1e-17 MNSKLLRLIAIVTVLAFAAGAQAQRRNSRYVDYINKYSALAVQQMKEHKIPASITLAQGL LESGAGMSTLARKSNNHFGIKCGSNWNGRTVRHDDDARNECFRAYRNPRDSYEDHSAFLK RGARYAFLFKLKITDYKGWARGLKKAGYATDPSYANRLITIIEDYDLYKYDRKGGWSSSK SEPTVLNPHQVYIANGIAYIVARDGDTFKSLAKEFDIRWKKLVKYNDLQRDYTLMSGDII YLKEKKKRASKPYTVYIVKDGDSMHTISQKFAIRLKNLYKMNRKDGDYIPEVGDRLRLR >gi|226332055|gb|ACIB01000001.1| GENE 194 228145 - 228627 610 160 aa, chain + ## HITS:1 COG:SP0844 KEGG:ns NR:ns ## COG: SP0844 COG0295 # Protein_GI_number: 15900731 # Func_class: F Nucleotide transport and metabolism # Function: Cytidine deaminase # Organism: Streptococcus pneumoniae TIGR4 # 25 153 6 125 129 92 41.0 4e-19 MKDLTITSVIKVYEYDELNDTDRALLDDAIEATRRSYAPYSHFSVGAAALLANGVVVTGT NQENAAYPSGLCAERTTLFYANSQYPDQAVMTLAIAARTEKDFIDTPIPPCGACRQVILE TEKRYKQPIRILLYGKKCIYEVQSIGHLLPLSFDASAMED >gi|226332055|gb|ACIB01000001.1| GENE 195 228630 - 229469 697 279 aa, chain + ## HITS:1 COG:PA0248 KEGG:ns NR:ns ## COG: PA0248 COG2207 # Protein_GI_number: 15595445 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Pseudomonas aeruginosa # 116 272 130 285 288 77 29.0 2e-14 MIRTYQTKLKGMLALTTSYHTEKSLQKEKSLYKFIWVRKGSITLEIDHQEVILAENEVIS LSNLQHLEFLSIDGEYLTVLFNSNFYCIYGNDHEVSCSGFLFNGSSHVVRFMLNESERRS MEDVVGLLDREFTVSDNLQEEMLRILLKRFIIQSTRIARQRLNITQEKEYSFEIIRQYYN LVDEHFRTKKQVQDYADLLHKSPKTLSNIFSSCKLPSPLRVIHERVEAEAKRLLLYSNKS SKEIADILGFEDQSSFSRFFKNMTGESPVQYRNSVEGKN >gi|226332055|gb|ACIB01000001.1| GENE 196 229557 - 230159 738 200 aa, chain + ## HITS:1 COG:HI0219 KEGG:ns NR:ns ## COG: HI0219 COG3059 # Protein_GI_number: 16272180 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Haemophilus influenzae # 20 187 27 209 213 173 50.0 2e-43 MKEKFITLLTVASGLKGFGLKFIRVAILVVFVWIGGLKYFHYEADGIVPFVANSPFMSFF YAKDAPEYKEHKNPEGAYVPANREWHEANRTYTFSYGLGALIMSIGILVFLGIFFPKVAL VGDTLAIIMTLGTLSFLVTTPEVWVPNLGSGEYGFPLLSGAGRLVIKDIVILASAVTLLS DSSQRVLNSLKKADWEKRNK >gi|226332055|gb|ACIB01000001.1| GENE 197 230216 - 231592 469 458 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163788782|ref|ZP_02183227.1| 30S ribosomal protein S1 [Flavobacteriales bacterium ALC-1] # 1 451 1 444 458 185 28 2e-45 MKQYDAIIIGFGKGGKTLTAELAKRGWKVALIERSAMMYGGTCPNIACVPTKRLIHEAEK VSWLYPTDYEKQAEAYKAAIARKNEMTAASRANMFNKLSSLPNVTIYTGMASFVSKDVVK VTLPDEVIELQGKKIFINTGSTSIIPAIDGLKDSKRVYTSTSLMELDVLPRHLIIVGGGY IGLEFASMYASFGSKVTVLEGGNKFIAREDRDIADAVKETLEKKGIEIRLNARAQSIQDT ADGVTLTYTDTADGNPVTIEGDAILVATGRKPMTEGLNLQAAGVEVDSHGAIVVNGYLHT TAPNIWAMGDVKGGLQFTYISLDDYRIIRDDLFGNKERTADDRNPVAYSVFIDPPLSHVG LTEEEAIKRGYSFKVSRLPASALPRARTLQQTDGILKAIVDSHSGRIMGCTLFCAESSEV INVVNMAMKTGQHYTFLRDFIFTHPSMGEGLNDLFSID >gi|226332055|gb|ACIB01000001.1| GENE 198 231909 - 233498 1054 529 aa, chain + ## HITS:1 COG:CC2587 KEGG:ns NR:ns ## COG: CC2587 COG0488 # Protein_GI_number: 16126825 # Func_class: R General function prediction only # Function: ATPase components of ABC transporters with duplicated ATPase domains # Organism: Caulobacter vibrioides # 3 529 4 525 535 278 32.0 1e-74 MCIQVQQITYIHPDKEVLFRNLSFTVGKGRQLALIGNNGCGKSTLLQIMAGKLQPSSGNV LRPDDLYYVPQHFGQYDEMSIAQALGIDRKQKALHEILNGNASIDHFNILDDDWNIEEKA LAALNGWGLGNRSLSESMHTLSGGEKTRVFLAGLELQEPSAVLMDEPTNHLDNQGRNRLY EWVKKCRSTLIVVSHDRTLLNRFPETCELSRNALVCYGGNYEFYKQQKELQQNALQQQLD EKEKELRQIRKLAREVAERKEKQNIRSEKAVPKKGISRMAVHTLKDRAEKSTTKLADIHQ NKSEKLTSERSLIRNRLAENTLLKTDFHASALHTGKTLLTAHEINFSYGNSPLWEVPLCF QLKSGDRIRIEGDNGSGKTTLLKLVTGDLLPTSGTLERAGFSYVYLNQEYSIICNQLTVL QQAESFNVRALPEHEIKIRLNRFLFPATSWDQPCIQLSGGEKMRLAFCCLMISNNTPDLF ILDEPTNNLDIQSIEIITATVRNYTGSVLLVSHDEYFVKETGIRKSIFL >gi|226332055|gb|ACIB01000001.1| GENE 199 233530 - 234312 690 260 aa, chain + ## HITS:1 COG:MA1439 KEGG:ns NR:ns ## COG: MA1439 COG2816 # Protein_GI_number: 20090298 # Func_class: L Replication, recombination and repair # Function: NTP pyrophosphohydrolases containing a Zn-finger, probably nucleic-acid-binding # Organism: Methanosarcina acetivorans str.C2A # 10 260 30 281 285 181 38.0 1e-45 MINTETNYRWFIFYQDQLLLEKNDNAFSIPTGKNAPVTTSEGITVHTITTATGMVCKAFY TDSPIAETPEYVQIGLRASYDYLSPEHYQAAGKVHEILYWDRSNRFCPTCGTPLVQKEPI MKKCPNCGREIYPVISTAILVLVRKEDSLLLVHARNFKGTFNSLVAGFLETGETLEECVA REVKEETGLDVKNIRYFGSQPWPYPSGLMVGFIADYAGGDIHLQDDELSSGNFYTRDHLP ELPRKLSLARKMIDWWIAQQ >gi|226332055|gb|ACIB01000001.1| GENE 200 234440 - 235336 993 298 aa, chain - ## HITS:1 COG:no KEGG:BF1487 NR:ns ## KEGG: BF1487 # Name: not_defined # Def: histidine decarboxylase # Organism: B.fragilis # Pathway: not_defined # 1 298 1 298 298 615 99.0 1e-175 MKDLVSGIRYDASNVISQAVGPSKEFCMGYLNPGVVGGEGYISTMKLSVGTVDVKDLDAI TERIVAKDRCEKNDAYLGQVNLMKASSFCGQNGAIWGFDLAMHDDIAKRKEMPIYMQAQP EGADIPVYNIRPLLEATERLFGRAKERRFPVLPGAYVPGGSRKVVACGPVWVWSVIGLAI LKDRSKGACLFVKDAGTYGDDSTTEGEAIGFLEGILRKATNSIALCGEDQDVIYDRIYIG YKYTFVEPGQVGCALSCTPAVYMAQNAIPADMKPADLCQMTISDWEEKLGLEELTIFE >gi|226332055|gb|ACIB01000001.1| GENE 201 235384 - 235545 94 53 aa, chain + ## HITS:1 COG:no KEGG:BF1486 NR:ns ## KEGG: BF1486 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 53 1 53 53 88 100.0 9e-17 MIDPYSLKKAKEDEEDKMLKEVRLPIGQVQTDIPLTCLLFKEYVPRNDQLKNR >gi|226332055|gb|ACIB01000001.1| GENE 202 235866 - 237368 805 500 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|90021240|ref|YP_527067.1| ribosomal protein S32 [Saccharophagus degradans 2-40] # 117 443 28 339 408 314 47 2e-84 MNKPYFIAFLLSLALWTVIPSVFAGDVVLKVFEGKPRINSPHIIGNYPSTPFIFYIPTSG QRPMQWSAEKLPEGLELDSKTGIISGVMTSKGDYTVTLKAENALGVSVKQLVIRIGDELL LTPPMGWNSWNTFGQHLTEELVLQTADAMITNGMRDLGYSYINIDDFWQLPERGADGHLQ IDKTKFPRGIKYVADYLHERGFKLGIYSDAAEKTCGGVCGSYGYEETDAKDFASWGVDLL KYDYCNAPVDRVEAMERYAKMGRALRATNRSIVYSVCEWGQREPWKWAKQVGGHLWRVSG DIGDIWYRDGNRVGGLHGILNILEINAPLSEYAGPSGWNDPDMLVVGIDGKSMSIGYESE GCTQEQYKSHFSLWCMMASPLLSGNDVRNMNDSTLKILLDPDLIAINQDVLGRQAERSIR SDHYDIWVKPLADGRKAVACFNRTSSPQTVILNENTIADLSFEQIYCLDSHLTKSGSDSK ELIVKLAPYQCKVYIFGKTD >gi|226332055|gb|ACIB01000001.1| GENE 203 237483 - 237941 455 152 aa, chain - ## HITS:1 COG:no KEGG:BF1484 NR:ns ## KEGG: BF1484 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 152 1 152 152 278 94.0 4e-74 MKKLINTVLLLFLIGTVSSCLKSGLDDLEAYNEAEITNLNFEYRWWDEAKDQMAVKTLNI EKQISKDDNLITCKLTVPTASGSFTDAVRQNVSLSNLIAYIDLSTAARIMPLNGAPKLGS PGDFSAKEFKYQVTAADGTKREWTIKITDFVK >gi|226332055|gb|ACIB01000001.1| GENE 204 237983 - 239680 1562 565 aa, chain - ## HITS:1 COG:no KEGG:BF1416 NR:ns ## KEGG: BF1416 # Name: not_defined # Def: putative outer membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 565 1 565 565 1140 99.0 0 MNKIKNLFVIILFALVAFSCSDILDKGPLDKYSENDVWKSTDLTQAFIYTALANATNMMV WKDNWTDNEAIMEDGRDVNAELIDRYYDAGWNKYEDIRRCNMVLARVPEAPFTNAEKANF IAQAKTIRAMIYFTRARLFGKLMLVKELIDPEADMKFPRTATVKETYDFILNDLREAAPD LPVDAPSGALSRGVAYALLGEAALHGAAYIENGQEEYYRIAAKACEDLFALDKYSLDGNY AGMFNDYDHSLASSEIILAQWRSAENTNFSDTWMQRLVPNIDPSKLIANVQAKYPLVEEM AGWPQRFPSVDLVNDYLVVDEDGKAKEWDQTSYYKKFLVDGGTVEDAIYKNRDKRFKASI VYDGCSYFANRVWLREGGNLYYTSKTTEFWGMPVSGYVYRKCVYEAKRLLNSEKTDYHYT LLRLGRSYLNYAEIKLRQGDKETAIDYINRTRVTHGGLPELPKSLSLEDTWKEYKRERRI ELVSEGDRYWSVLRWGKADGLEVVPELTVEQKFMKIAPDGKSFEIIPIPIYQSDNERTFT KKHYLFPVPQGQRDLNPNLDQNEGW >gi|226332055|gb|ACIB01000001.1| GENE 205 239696 - 243022 2663 1108 aa, chain - ## HITS:1 COG:no KEGG:BF1482 NR:ns ## KEGG: BF1482 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1108 24 1131 1131 2177 99.0 0 MRITLFLLMACVFSLYAGNSYSQNTRVSFAVDNVGLNKVLEEIESQTDYLFIYNSQINVN KLVTIKANKQTVSKVLDQILQNTGIEYKLEGSHIILEKKVEEVHNSSSAVQQQQTKKITG KVVDKTGEAIIGANVKIQGTDKGTITDLDGNFILEVAPKDVLAISYIGYLDTKVPMTGQK QIHVVLSEDNKMLDEVVVIGYGTTSTRKMASAVTAVKGEKLQDLPFNSVAASLAGRATGV IVQSSGGEPGSAPSISIRGGGAPVYVIDGVISDAWDFNTLNPNDIESLSILKDAASLAVY GSRAANGIVMVKTKQGGKGKTAVNYTFNAEFSQPTKLLKKTRGYDYAYNQMLAGINDGLD EADLPFNQEVLDIIKNQSDPYTSGHADTDWLGEGLKTVAPQYKHTVSLSGSGNKVNYYIS LGMLNQGSIYTSNALNYDRYTVRSNVNTTFDKIGLKVSLNLNGAYEKKEYPSFSAAKIWE DLYNQSPLNPAYNKDGTYAAVTDHPLAEMDKRSGYNRNYGKFINTQVAADWTLPWLKELT LGAMFNYRLNDSHVKKFSTKAPQYYADGAVYPIGKPTLNEEGYWGESYNFEVSAAYVKTF AEKHTIDAKFVYNVAENTGWNFNAYRGEYLSTVVDQLFAGAADTQQNGGNSDEGGRMGLV GRLKYDFMNRYIVEGSFRYDGSDNFTPGHRWGFFPSGAVAWAISEEPFFKEWDQHVFDLL KLRASYGQTGTENGVNRFGYLSTYSLDEKKIVIGGKLQSGFSEGALVSPELLSWYQVNSF NLGLDMAFFNNRLKGTFDYFYYVTKGGLMSPGDRYTTPLGKPLPQIKSNSEQRREGVELT MRWSDTTPRKFTYEVGFNMTYFNSLWKVKADEALSDLMNPYKRQTHQTDYYGLGYIDTGL YQNKEDILNSPRRLGSTQTKLGDIGYTDVNGDGKIDGEDQVRIGKPTMPHFTYAFDFSLG YEGFTLSGLLYGTGERYMTFGNRYQSGEGKYLYYENQLNYWRPDNTGADFPRISISSGVN GNNNKAGSTFWMRNASYLRLKDLQLSYDFKYKYLKKCDWLQTCRVNLSGSNLFTISGVSK FFDPETSSTSGDGYPVQRVYSIGVTIGF >gi|226332055|gb|ACIB01000001.1| GENE 206 243240 - 244130 596 296 aa, chain - ## HITS:1 COG:AGl2871 KEGG:ns NR:ns ## COG: AGl2871 COG3712 # Protein_GI_number: 15891547 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 6 276 34 310 331 72 26.0 1e-12 MEEGQRVKAWVEASDENERAFYRERKIFDALMLNNPLPVKKTSFFNFTHYKKIEWLKIAM AVILTFLLSYFYQEYKAGLDSVAMSTISVPEGQRTNVTLPDGSNVWLNACTTIQYPTSFN SRERFVILKGEAYFDVKKNKSRPFIVHTDAYSIEVLGTKFNVDAYPETEKFETTLMHGSV KVTLKADSSQTVILKPDHKLSLEKGRFVMTKVEDYNPYRWKEGLICFSDESFPNIMKDFE KYYGVKIVIENKNVLQINFTGKFRQTDGIDYALRILQKNIDFQYEKDNEKQIIYIK >gi|226332055|gb|ACIB01000001.1| GENE 207 244228 - 244806 352 192 aa, chain - ## HITS:1 COG:TP0092 KEGG:ns NR:ns ## COG: TP0092 COG1595 # Protein_GI_number: 15639086 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Treponema pallidum # 28 173 13 150 162 60 28.0 2e-09 MHIKTNPKDKISFNQLYNDYQTRFLNFANTYVRDWDVAEDITTEALIYYWENRNTLSEVS NIPAYILTIIKNKSLNYLRHLQIREEHSENIRKYIEWELNARIVSLDACEPYELLVKEMQ ELIQQTLDKLPERTRKIFILSRYENKSYKEIAALMNMTTKGVDFHICKALKALQINLKDY FPLFLYFLMKFH >gi|226332055|gb|ACIB01000001.1| GENE 208 245697 - 247442 1888 581 aa, chain - ## HITS:1 COG:CAC2337 KEGG:ns NR:ns ## COG: CAC2337 COG1109 # Protein_GI_number: 15895604 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphomannomutase # Organism: Clostridium acetobutylicum # 12 553 5 549 575 449 44.0 1e-126 MGNEELIKQVTEKAEKWLTPAYDAETQAEVKRMLENEDKTELIEAFYKDLEFGTGGLRGI MGVGTNRMNIYTVGAATQGLSNYLNANFKDMKQISVVVGYDCRNNSSLFAKISADIFSAN GIKVYLFEEMRPTPEMSFAIRHLGCQSGIILTASHNPKEYNGYKAYWDDGAQVLAPHDKG IIDEVNKIASAADIKFQGNPDLIQIIGEDVDKIYLDMVKTVSIDPEAIARHKDMKIVYTP IHGTGMMLIPRALKMWGFENVYTVPEQMIKDGNFPTVVSPNPENAEALTMALNLAKEIDA DLVMASDPDADRVGIACKNDKGEWVLINGNQTCLMYLYYIITQYNKLGKMTGNEFCVKTI VTTELIKKIADKNHIEMLDCYTGFKWIAREIRLREGKKKYIGGGEESYGFLAEDFVRDKD AVSACCLIAEVAAWAKDNGKTLYQLLMDIYVEYGFSKEFTVNVVKPGKSGAEEIKAMMEN FRANPPKELGGSKVVLSKDYKTLKQTDAAGHVTDIDMPEPSNVLQYFTEDGGKVSVRPSG TEPKIKFYIEVKGEMGCRNCFATADAEATEKVEAVKKSLGI >gi|226332055|gb|ACIB01000001.1| GENE 209 247550 - 249187 1742 545 aa, chain - ## HITS:1 COG:MA3377 KEGG:ns NR:ns ## COG: MA3377 COG4690 # Protein_GI_number: 20092191 # Func_class: E Amino acid transport and metabolism # Function: Dipeptidase # Organism: Methanosarcina acetivorans str.C2A # 21 499 2 538 574 177 27.0 4e-44 MKRIFLFAALLTAAVAETFACTNLIVGKNASTDGSTIVSYSADSYGLFGELYHYPAATYP KGTMMDIHEWDTGKYLGQIEQARQTYNVIGNMNEFQVTIGETTFGGRPELVDTLGIIDYG SLIYVGLQRSRTAREAIKVMTELVQEYGYYSSGESFTIADPNEIWIMEMIGKGPGIRGAV WVAVRVPDDCISAHANQSRIHQFDMADKANCMYSNDVISFAREKGYFSGVNKDFSFADAY APLDFGARRFCEARVWSYFNMFTDQGEAYLPYIQGKTNDPMPLFVKPKRKLSVQDVKNAM RDHYEGTALDISNDFGAGPYKTPYRLSPLTFKVGDQEYFNERPISTQQSGFVFVAQMRAN LPDAVGGVLWFGTDDANMTVFTPVYCCTTKAPVCYTRVDGADYITFSWNSAFWIFNWVAN MVYPRYDLMIGDVRATQNELETTFNEAQEGIEAVAVKLYEKNPETAVKFLTNYTDMTAQS TFDTWKRLGEFIIVKYNDGVIRKMKDGKFERNAIGQPAGVVRPGYSKEFLEEYVKQTGER YKVTE >gi|226332055|gb|ACIB01000001.1| GENE 210 249236 - 250888 1485 550 aa, chain - ## HITS:1 COG:TM1660 KEGG:ns NR:ns ## COG: TM1660 COG0739 # Protein_GI_number: 15644408 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane proteins related to metalloendopeptidases # Organism: Thermotoga maritima # 25 272 19 256 323 72 26.0 2e-12 MKQYIVALMLACSIQGHSQDVKQATFVPPFDFTLTLSGNFGEIRANHFHGGLDFKTQGVI GKPVRALADGYISRIRVTNGSGHVLDVVYNNGYTTINRHLSGFMPDIARRVEKLQYEKED WEVEIVPEPGEYPVKAGQQIAWSGNTGYSFGPHLHLDVMETATGESIDPMPFFKSKIKDN TAPRAEGIMLFPQPGSGVVGGSPERQSFPINTARPIEAWGVIGSGIKAYDYMDGVHNRYG VHTVVLRVDGTEVFRSVVDRFSQDENRMINSWTCGQYMKSFIDPGNTLRMLRASNDNRGL ITIDEERDYKFEYELKDAFGNTSRYRFTVRGKRQPIQPLGHREKYFFAWDKTNFLQEPGL TLTVPRGMLYDNVPLNYEVHSDSGAIAYTYQLNDEKVPLHGECELRIGLRRQPVADSTKY YVARVTPKGSMYSVGGTYEDGFMKTRIRELGTYTVAVDTVPPEITPVGKNTWGRNGKVVY RLKDKETGIRAYRGTIDGKFALFGRPNLTKSHWECKLDPKHVKKGVRHTVVMTTTDDCGN ETTVRDTFVW >gi|226332055|gb|ACIB01000001.1| GENE 211 251076 - 252185 1273 369 aa, chain + ## HITS:1 COG:VC1905 KEGG:ns NR:ns ## COG: VC1905 COG0686 # Protein_GI_number: 15641907 # Func_class: E Amino acid transport and metabolism # Function: Alanine dehydrogenase # Organism: Vibrio cholerae # 1 363 1 363 374 384 58.0 1e-106 MIIGVPKEIKNNENRVGMTPSGVAELVKRGHTVYIQHTAGENSGFQDEAYTAVGAQILPT MEETYAMAQMIVKVKEPIAPEYRLIRKGQLLFTYFHFASDRELTLAMIENGSVCLAYETV EKADHSLPLLIPMSEVAGRMAIQEGARFLEKPQGGKGILPGGVPGVKPAKVLILGGGIVG SNAAQMAAGLGADVTIADINLSRLRYLSETLPKNVKTLYASEVRLKKELPDVDLVVGSVL IPGDKAPHLITRDMLKMMQPGTVLVDVAIDQGGCFETSHPTTHSEPTYVVDGIVHYAVAN IPGAVPYTSTLALTNATLPYVIALANKGWRKACKEDPALALGLNVVEGKVVYRAIADVFG LKYEQLNLE >gi|226332055|gb|ACIB01000001.1| GENE 212 252281 - 253225 789 314 aa, chain - ## HITS:1 COG:CAC3576 KEGG:ns NR:ns ## COG: CAC3576 COG2070 # Protein_GI_number: 15896810 # Func_class: R General function prediction only # Function: Dioxygenases related to 2-nitropropane dioxygenase # Organism: Clostridium acetobutylicum # 7 310 9 310 310 233 42.0 4e-61 MNRISSLFGIQYPIIQGGMVWCSGWKLASAVSNAGGLGLIGSGSMYPDVLREHIRKCRAA TDKPFGVNIPLMYPQIEEIMNIVVEEGVKIVFTSAGNPKTWTGWLKERGITVAHVVSSSK FAMKCEEAGVDAIVAEGFEAGGHNGREETTTLCLIPAVREATTLPLIAAGGIGTGEAIFA LMALGAEGVQMGTRFALTDESSASDIFKEYCLRLNEGDTKLLLKKLAPTRLVTNSFRDRV ESAEGRGASAEELRELLGRGRAKKGIFEGDLEEGELEIGQVAALFRRRQSVDEVMKELLE GYRRASDKIRSAGL >gi|226332055|gb|ACIB01000001.1| GENE 213 253283 - 253987 632 234 aa, chain - ## HITS:1 COG:no KEGG:BF1404 NR:ns ## KEGG: BF1404 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 234 1 234 234 465 100.0 1e-130 MKRIVFLMLSLGMAVTLHAQWTAKDSVNLQRILNGKEDIKLNMDAVKQIDFNSSPTVPKM SKERPGLRLDETLPQVLEKKKVVLTLRPYTANTKYNWDPIYQKKIRVDADTWRGDPFVEL YQTVPSNWAKNVYDKGIRSSYEEIRSSGLRHNLFGERANGMMVPTQSMVHTSAMKLGKSG VTVNGGTIGGLDLMTIFTKDFWDKKGRNRRSRTLEVLKAYGDSTTVLLPEPIVH >gi|226332055|gb|ACIB01000001.1| GENE 214 254001 - 254531 406 176 aa, chain - ## HITS:1 COG:no KEGG:BF1472 NR:ns ## KEGG: BF1472 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 176 1 176 176 341 100.0 7e-93 MKKTFLMLLTLVLSLLTCVSCSEETLDYNNPDVDLFVRQLKAGNYNTKSPKGFVEVPKFT EKDIPTLLNYAEDLTLITSFPLPPVSAYYSGKVRLGECMLWVVETIRLGHYASFGCKMVR ANAENYEGIYFLTDEELLDAAARYRRWWENRQYPRTAWTIDACFDEPLCGSGYRWW >gi|226332055|gb|ACIB01000001.1| GENE 215 254550 - 255590 766 346 aa, chain - ## HITS:1 COG:no KEGG:BF1471 NR:ns ## KEGG: BF1471 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 346 1 346 346 652 98.0 0 MKKENDEITDLFRSRLGNAEMTVRDGFWEELNSEMMVRSHHRKVVFFRVAAAASVLLVLA ASSAAFWFFSPKAEIEEAFTQVAVVSGNTTHLDGDVVKQDFTPMRSEPVLGKPAPKRSGV LAQSSGEEDDSVSVTVSMSFSFSSTTTRRRQNNYPDKSYWQAGGEGGALASSADGPRPDD HTVVADKSRTWAVKAAVGTALPAANGKYKMPVTAGLTVEKKINKHLAVETGLLYSNLRAE QNLHYLGIPVKLNVMLAETPKFDLYASVGGVADKCIAGAPDNSFKNEPVQLALTAGVGMN YKINDKLALFAEPGVTHHFKTDSKLETVRSARPTNFNLICGLRMTY >gi|226332055|gb|ACIB01000001.1| GENE 216 255574 - 256125 348 183 aa, chain - ## HITS:1 COG:mll8140 KEGG:ns NR:ns ## COG: mll8140 COG1595 # Protein_GI_number: 13476734 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Mesorhizobium loti # 44 175 40 177 208 81 34.0 8e-16 MENEIELIKGCRAGKDSARKELYTLYSRQMLAVCFRYTGDMEAAHDVLHDGFIKIFTNFS FRGEASLGTWVTRVMVTQALDYLRRQKRVSQLEVHEEQLPDIPDLPEGGEAGRISEEQLM KFVADLPDGCRTVFNLYVFEEKSHKEIADMLGIKEHSSTSQLHRAKFLLAKRIKEYRNHE ERK >gi|226332055|gb|ACIB01000001.1| GENE 217 256396 - 257235 858 279 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163755345|ref|ZP_02162465.1| 30S ribosomal protein S6 [Kordia algicida OT-1] # 5 278 11 284 286 335 60 1e-90 MNELIDRLIDLAFAEDIGDGDHTTLSCIPATAMGKSKLLIKEAGVLAGIEIAKEIFHRFD PTMKVEVFINDGAEVKPGDVAMIVEGKIQSLLQTERLMLNVMQRMSGIATMTRKYVKQLE GTKTRVLDTRKTTPGLRMLEKAAVKIGGGVNHRIGLFDMILLKDNHVDFAGGIDKAINRA KEYCKEKGKDLKIEIEVRNFDELRQVLSIGGVDRIMLDNFTPENTKKAVEMIGGKYETES SGGITFDTLRDYAECGVDFISVGALTHSVKGLDMSFKAC >gi|226332055|gb|ACIB01000001.1| GENE 218 257228 - 257620 334 130 aa, chain - ## HITS:1 COG:no KEGG:BF1468 NR:ns ## KEGG: BF1468 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 130 1 130 130 238 99.0 5e-62 MKKRVLLWMAGLVFAVTSLFAQDIPVGVVVAFKKGNSQELNRYLGEKVNLVIQNHSESVD RQAAEGTLAAFFSSNKVSGFNVNHEGKRDESSFIIGTLTTANGNFRINCFFRRVQNKYLI NQIRIDKTNE >gi|226332055|gb|ACIB01000001.1| GENE 219 257717 - 258190 531 157 aa, chain + ## HITS:1 COG:SA0023 KEGG:ns NR:ns ## COG: SA0023 COG1576 # Protein_GI_number: 15925729 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Staphylococcus aureus N315 # 1 155 1 158 159 98 34.0 5e-21 MKTTLIVVGRTVEQHYITAINDYIERTKHFISFDMEVIPELKNTKSLTPEQQKEKEGELI AKALQPGDVVVLLDEHGKEMRSVEFARWMEKKLVNVNKRLVFIIGGPYGFSQKVYDAAHE KISMSKMTFSHQMIRLIFVEQIYRAMTILNGGPYHHE >gi|226332055|gb|ACIB01000001.1| GENE 220 258200 - 258655 437 151 aa, chain - ## HITS:1 COG:NMA2170 KEGG:ns NR:ns ## COG: NMA2170 COG0780 # Protein_GI_number: 15795041 # Func_class: R General function prediction only # Function: Enzyme related to GTP cyclohydrolase I # Organism: Neisseria meningitidis Z2491 # 2 151 5 155 157 229 72.0 1e-60 MTELKEQLSLLGRKTEYKQDYAPEVLEAFDNKHPENDYWVRFNCPEFTSLCPITGQPDFA EIRISYLPDVKMVESKSLKLYLFSFRNHGAFHEDCVNIIMKDLIRLMDPKYIEVTGIFTP RGGISIYPYANYGRPGTKYEEMATHRLMNHE >gi|226332055|gb|ACIB01000001.1| GENE 221 258669 - 259328 604 219 aa, chain - ## HITS:1 COG:CAC3627 KEGG:ns NR:ns ## COG: CAC3627 COG0603 # Protein_GI_number: 15896861 # Func_class: R General function prediction only # Function: Predicted PP-loop superfamily ATPase # Organism: Clostridium acetobutylicum # 1 213 5 217 222 309 65.0 3e-84 MKNDSAVVLFSGGQDSTTCLFWAKKHFKKVYALSFLYGQKHAHEVELARGIAERAGVEFH VMDTSFIGSLGSNSLTDTSISMDEDKPKDSFPNTFVPGRNLFFLSIAAVFAREQGAFHLV TGVSQTDYSGYPDCRDSFIKSLNVTLNLAMDEQFVIHTPLMWIDKAETWALADELGVFDL VRNETLTCYNGIPADGCGHCPACKLRKQGLEEYLSKRNR >gi|226332055|gb|ACIB01000001.1| GENE 222 259367 - 260047 751 226 aa, chain - ## HITS:1 COG:Cgl0234 KEGG:ns NR:ns ## COG: Cgl0234 COG1738 # Protein_GI_number: 19551484 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Corynebacterium glutamicum # 9 213 43 249 250 136 37.0 3e-32 MKEKVSVPFMLLGILFNVCLIAANLLETKVIQVGSITVTAGLLVFPISYIINDCIAEVWG FKKARLIIWSGFAMNFFVVALGLIAVALPAAPFWEGEQHFDFVFGMAPRIVVASLLAFLV GSFLNAYVMSKMKVASGGRNFSARAIWSTVVGETADSLIFFPIAFGGLIAWPELLVMMGT QIVLKSLYEVIILPITIRVVKAVKRIDGSDVYDTDISYNVLKVKDI >gi|226332055|gb|ACIB01000001.1| GENE 223 260267 - 260770 656 167 aa, chain - ## HITS:1 COG:mll3697 KEGG:ns NR:ns ## COG: mll3697 COG1595 # Protein_GI_number: 13473184 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Mesorhizobium loti # 15 164 17 161 183 97 38.0 7e-21 MEKMNITHKIVAMQPELEHFAYKLTADRESANDLVQDCLLKALDNKEKFVHTQNFKGWMY TIMRNIFINNYRKSLREVDMTDSTYNLYAQTMTEGEEGNRFETIYDLKELYKVINAVPED LKKPFMMFVAGFKYREIAEKMDLPVGTIKSRLFLIRKRLQQDLKDFS >gi|226332055|gb|ACIB01000001.1| GENE 224 260934 - 262223 1291 429 aa, chain + ## HITS:1 COG:no KEGG:BF1393 NR:ns ## KEGG: BF1393 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 429 1 429 429 866 100.0 0 MKTFFRSVAMLLGLGLLPVAAHAQKKVIIEDEEPNSVMFVSTNKAGDEIIRIMNETRLPR FHEPKAPRFVLTDRQGKFALGIGGYVRATAEYDFGGIVKDVDFYPALIPNKGSADRARNQ FQMDISTSTLFLKLVGHTKRLGDFVVYTAANFRGDGKTFELQNAYATFLGFTLGYSYGNF MDLAALPPTIDFAGPNGSAFYRTTQLSYMCDKLKNWKFGVGVEMPSVDGTTNQYLTINTQ RMPDFTASAQYNWNANSHLKLAAIVRSMTYSSSVDNKAHSKAGYGLQASTSFNVTPKWQV YGQVNYGKGIGQYLNDLSNLNVDIVPNPEKEGRMQTLPMLGWFAGLQYNISKNVFVSGTY SLSRLYSEHTYPSDEPDMYRKGQYLVANLFWNVTSNLQVGAEYLRGWRTDFSSNTRHANR MNALVQYSF >gi|226332055|gb|ACIB01000001.1| GENE 225 262312 - 263463 790 383 aa, chain - ## HITS:1 COG:no KEGG:BF1461 NR:ns ## KEGG: BF1461 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 383 1 383 383 775 100.0 0 MKNTFILFLVALSFVCSSCNRTPSVEEPDVLKVELKETPVSVSDFFSKVEVIPLETSDSC LLARILRVRVSGDTTYILTQDYPTFRHITLMAFDKKGNYLRSIGRVGQGPGEYSQVYDAV ISEQRNRVYMLSPFGSMYTYRLDGSFIDRKILPQKMNFQEIGLTPDGDLLTWSAQNKDDG ACIMRLDADSAKLINEFWSDDFWLNWACRDMFYSYQGKAYFAPAFYEEVYELTSDSFRVA YRWDMGEQNINIAQYHFNSNPDTRREEDRRLNEYRETGKIAFNFTNQYQNGKYYYAQLLS LASKPKWKYTNLFYRKKDGAVFYFEKTREGIRIDPEVLTEDFMLCIVPTEELENYKSILP EEEYRVLSKRVEDDNPCVVKFYF >gi|226332055|gb|ACIB01000001.1| GENE 226 263594 - 265099 1453 501 aa, chain - ## HITS:1 COG:no KEGG:BF1460 NR:ns ## KEGG: BF1460 # Name: not_defined # Def: putative outer membrane protein precursor # Organism: B.fragilis # Pathway: not_defined # 1 501 1 501 501 966 100.0 0 MRKLFLISIGLLIVSTSTFAGGLLTNTNQHVLFLRMLARDASTQIDAVYSNPAGVAFMEN GFHLSLNGQSAFQTRTITSTFAPFAGFGGNATKVYKGEASAPFIPSVFAVYKKDKWAFSG NFAVTGGGGKATFNEGLGSFESLVSVVPGMLVAAGNEMVDKGVLPINIFNGTNKYSVDSY MRGKQMIFGLQLGATYRITDYLSAFAGVRMNYVSNGYEGHIRNIEANIGGGEMVNVNKYF SDYAKQARTAADAYLAANDLANYAKYDAIAKEATKVAGLTADKELDCDQTGWGVTPILGL DFKMNKWNIGVKYEFNTKLNVENKTRIDNTGLFANGVNTPHDIPALLTVGVSYEILPVLR ASVGYHHFFDTNARMADAKDPVTGQTMGKQHFIKQGTNEYLGGIEWDVCKWAQVSAGMQR TKYGVGDNYQSDMSFAVSSYSYGFGAGFDIAKNLKLNVAYFWTDYDKYTKETPNYGGTGI AGKDVFTRTNKVFGIGLDYKF >gi|226332055|gb|ACIB01000001.1| GENE 227 265320 - 266330 1096 336 aa, chain + ## HITS:1 COG:slr1556 KEGG:ns NR:ns ## COG: slr1556 COG1052 # Protein_GI_number: 16332154 # Func_class: C Energy production and conversion; H Coenzyme transport and metabolism; R General function prediction only # Function: Lactate dehydrogenase and related dehydrogenases # Organism: Synechocystis # 5 331 3 329 333 360 55.0 2e-99 MAYTIAFFGTKPYDEASFNEKNKEFGFELRFFKGHLNRHNVLLTQGVDAVCIFVNDTADA EVIRTMAANGVKLLALRCAGYNNVDLKAAADNGVTVVRVPAYSPYAVAEYTVALMLSLNR KIPRASWRTRDGNFSLHGLLGFDMHGKTAGIIGTGKIAKILIHILKGFGMNILAYDLYPD YNFARENQIVYTTLDELYHSSDIISLHCPLTEQTKYLINDYSISKMKDGVMIINTGRGQL IHTNALIEGLKNKKIGSAGLDVYEEESEYFYEDKSDRIIDDDVLARLLSFNNVIVTSHQA FFTREALANIAATTLENIRDFKNQKPLVNEVKLSNA >gi|226332055|gb|ACIB01000001.1| GENE 228 266549 - 267091 559 180 aa, chain - ## HITS:1 COG:no KEGG:BF1388 NR:ns ## KEGG: BF1388 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 180 1 180 180 341 97.0 8e-93 MIEKVFKVGDTITIKRTSQAGTGYRYALVRLTGGVALVEELSEDADTPGGTSVQSFIFRF LQPGQVEIQFAYYRDSEEVLYEDIFSYGVVTSEKANPIIGGFGEFRPLTDQEKEIFRTCM TLKGVDYTPLLVAEQLVSGYNYRFICMTESLIREPKYGFAKVTIYAPLRGEPILESIIEC >gi|226332055|gb|ACIB01000001.1| GENE 229 267328 - 269145 1455 605 aa, chain - ## HITS:1 COG:no KEGG:BF1455 NR:ns ## KEGG: BF1455 # Name: not_defined # Def: transcriptional regulator # Organism: B.fragilis # Pathway: not_defined # 1 605 1 605 605 1102 99.0 0 MRKLLYVILLLLFRLLMFPGAVCAEILSLPDSLITDDNVYKYTFSDFDKAQQVMEQLRKR KSLSMFRMDVVEGDLYFNVGQYYKALKFYKRALDSDSVRNNDKNYMEQVHRMISCYDCLH NENKKSLYVYLLLKRAEQCGDKAMQSVALFNMGKMLYYQGNKEKGYEYLEQAIEMMSKTD YRYKYDNLRYDYNTLLIFQKSDRRNEEALRTLAALEKVVTEETGSETPMEGLSAKEKKAM YAHYAVVLFRLGQAEEAERYYRKFLAASEEYDRDDYLIMPYLFDRKMYDQVIRMNAAREK MYIMRGDTVNYYMTTIKRSLGWAYRDKGDYRTAARYFEQLAVLRDSIKNREQKSAALELA AVYETNEKDLFIQQQAADMQKRNLLLAFILCIVFLLGVLLWRTIRHNRIIRRKNKAMVGT IEDLLIHKEELYRKKEENLILKEQLEREQNLRMSAGGGSLSTDKDGKAETTVVPAPEAGD GNDIHDRILFDKLEHEIISRQLYLQPDFSREELIKTIYIPKNKFAPLFKQYAGMSFSKYI NNLRLEYAAKMLKNHPDYTVDTIAQECGMSTQSLYRLFSGKYGVTPTDFQVGVQHINNKN ITEDK >gi|226332055|gb|ACIB01000001.1| GENE 230 269398 - 270102 745 234 aa, chain + ## HITS:1 COG:sll1773 KEGG:ns NR:ns ## COG: sll1773 COG1741 # Protein_GI_number: 16330260 # Func_class: R General function prediction only # Function: Pirin-related protein # Organism: Synechocystis # 8 213 7 211 232 167 42.0 2e-41 MKKVVHRSDTRGRSVYDWLDSHHSFSFDEYYNPERVHFGALRVLNDDRVAPGEGFQTHPH KNMEIVSIPLKGLLAHGDSKKNSRTITVGDIQVMSAGTGIYHSEMNGSKSEPVEFLQIWI IPKERNTHPLYQDYDIRGLLKKDELAFILSPDGSTPAKLLQDTWFSMGEIGAGKTVEYTL HGTDMGVYVFLIEGEVKIDDVILTRRDGLGISEIKNFEIETLKDSKILLIEVPM >gi|226332055|gb|ACIB01000001.1| GENE 231 270120 - 270827 422 235 aa, chain + ## HITS:1 COG:sll1440 KEGG:ns NR:ns ## COG: sll1440 COG0259 # Protein_GI_number: 16330895 # Func_class: H Coenzyme transport and metabolism # Function: Pyridoxamine-phosphate oxidase # Organism: Synechocystis # 22 235 18 230 230 204 47.0 8e-53 MSTDHISTHPDSPLHGNGIGSEAINLAAIRQEYTKGGLKEGDLPDNPLSLFNRWLHEAID AQVDEPTAMLVGTVSPEGQPSTRTVLLKDLHDGKFIFYTNYESRKGTHLAKNPYISLSFV WHALERQVHIEGIASKVPAGESDTYFRQRPYKSRIGARISPQSRPLKSRMQLIRNFVAEA ARWVGREVERPAHWGGYTVTPHRIEFWQGRANRLHDRFLYSLQPDGSWQKERLAP >gi|226332055|gb|ACIB01000001.1| GENE 232 270899 - 271624 658 241 aa, chain + ## HITS:1 COG:FN1387 KEGG:ns NR:ns ## COG: FN1387 COG2220 # Protein_GI_number: 19704722 # Func_class: R General function prediction only # Function: Predicted Zn-dependent hydrolases of the beta-lactamase fold # Organism: Fusobacterium nucleatum # 5 212 4 207 237 152 40.0 5e-37 MILDYIYHSGFAIEAEGVTLIIDYYKDSSETELNKGIVHDRLLQRPGKLYVLASHFHPDH FNREVLTWKEQRPDIIYIFSKDILKHHRAQKEDAIYINKGEEYEDDMLRIQAFGSTDVGI SFLIHLQGKSIFHAGDLNNWHWSEESTEQEIRKAEGDFLAEIKYLQAAVPAIDLVMFPVD RRMGKDYMKGAQQFIERIKTTIFVPMHFSEDYEGGNAFRSIAEAGGCRFITITHRGESFD I >gi|226332055|gb|ACIB01000001.1| GENE 233 271644 - 272720 962 358 aa, chain + ## HITS:1 COG:no KEGG:BF1451 NR:ns ## KEGG: BF1451 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 358 1 358 358 667 99.0 0 MNKLTQIFLVLFMICLPVTLWADNDKDDDDSRYLAGAVPEVDGRVVFSREFSIPGMSQDE IYERMLKWLDGRMAQNKNNSRVVYKEKGVIAAAGEEWLVFSSTALSLDRTWLTYQVTVNC QPQKCTMEVEKIRYTYREKEKYAAEEWITDKYALNKAKTKMVRGLAKWRRKTVNFADNLF EEAAKALSQKPMEAKAEATPKKPAVVTAPKVVVIGDNQETGKVEKAAELTPAIPVTSPST MPGYKEVAPDQVPANAIQMGAGRLVIAIGSDPFNMTMMTANAGGSIGKISGSPVVFSILS PDQPYEQLEKAETYSIRFYPTGQKEPSVILECQKLPAPTPMEGQPRTYAGKITKAFMK >gi|226332055|gb|ACIB01000001.1| GENE 234 272842 - 273222 171 126 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|148984704|ref|ZP_01817972.1| 50S ribosomal protein L20 [Streptococcus pneumoniae SP3-BS71] # 5 125 3 126 126 70 34 7e-11 MEIKSRFDHFNINVTDLERSIAFYEKALGLKEHHRKEAADGSFILVYLTDNTTGFLLELT WLRDHTEAYELGENESHLCFRVAGDYDAVRQYHKEMDCVCFENTSMGLYFINDPDDYWIE ILPERL >gi|226332055|gb|ACIB01000001.1| GENE 235 273298 - 273486 217 62 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MSLEDAIMGGIVFKGKKEGPKKDENKVKTKAKKTTYIKGTHGSGAAKMKAEIRRKRANRH KR >gi|226332055|gb|ACIB01000001.1| GENE 236 273584 - 274318 860 244 aa, chain + ## HITS:1 COG:Ta0580 KEGG:ns NR:ns ## COG: Ta0580 COG0500 # Protein_GI_number: 16081683 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism; R General function prediction only # Function: SAM-dependent methyltransferases # Organism: Thermoplasma acidophilum # 10 226 5 210 227 104 33.0 2e-22 MNTSLLPAEKDPMGAAIADFYHRQKADRLRVFSSQFDEDEIPIKQLFRKAGQMPLLERTA LAMATGTILDVGAGSGCHALALQESGKEVSAIDISPLSVEVMKLRGVKDARQVNLFDERF AETFDTILMLMNGSGIIGRLENMPLFFRKMKQLLRPDGCILMDSSDLRYLFEDEDGSFLI DLAGDYYGEIDFRMQYKDIQGDPFDWLYIDFQTLSAYAADNGFKAEMIKEGKHYDYLARL TVAL >gi|226332055|gb|ACIB01000001.1| GENE 237 274361 - 276226 1738 621 aa, chain - ## HITS:1 COG:STM2333 KEGG:ns NR:ns ## COG: STM2333 COG0471 # Protein_GI_number: 16765660 # Func_class: P Inorganic ion transport and metabolism # Function: Di- and tricarboxylate transporters # Organism: Salmonella typhimurium LT2 # 7 621 11 608 608 372 37.0 1e-102 MAITIIILVLSAVFFMSGKVRSDLVALCALVLLILFNILTPEEALSGFSNSVVIMMVGLF VVGGAIFQTGLAKMISSRILKFAGTSELKLFILIILVTAAIGAFVSNTGTVALMLPIVVS MAMSANINVSRLLMPLAFASSMGGMMTLIGTPPNLVIQNALIEAGYERLSFFTFTPVGLV CVTVGLIVLIPLSKIFLTKKSDKEARGKRKNKSLQELAGEYQLSQNLYRVEVEESSGFVG KTIQELGIPQRYHVTILEVRRRSVSSRHFFKTKKQEMAEAGTLIEAEDILYVLGEFENVK RFAGENRLSLLDTHKTEGSLHDTDEGFDFQEIGIAEILLLPTANLINKPVKASGFRENYS INILGIQRKRDYLLKNLKDEKMHSGDILLVQGSWANIARLGEDQSQWVVLGQPLAEAAKV TLNHKAPVAALIMLGMIVTMMFDFIPVAPVTAVIIAALLMVLTGCFRNVEEAYKTINWES IVLIAGMLPMSLALEKTGASEVVSQSLVNGLGAYGPFALLAGIYFTTSLMTMFISNTATA VLLAPIALHSAVQLGLSPYPFLFAVTVAASMCFASPFSTPPNALVMPAGRYTFMDYVKVG LPLQIIMGVVMVFVLPLLFPF >gi|226332055|gb|ACIB01000001.1| GENE 238 276286 - 276882 739 198 aa, chain - ## HITS:1 COG:DR0189 KEGG:ns NR:ns ## COG: DR0189 COG0526 # Protein_GI_number: 15805225 # Func_class: O Posttranslational modification, protein turnover, chaperones; C Energy production and conversion # Function: Thiol-disulfide isomerase and thioredoxins # Organism: Deinococcus radiodurans # 55 173 47 160 185 77 36.0 2e-14 MKKNFFYAGGLSLLLWGMAACSGQGKADKAAVVADSVVVKTDSVAADSTGYIVKVGESAP DFTITLTDGKQMKLSELRGKVVMLQFTASWCGVCRKEMPFIEKDIWLKHKNNPEFALIGI DRDEPLDKVIAFGKSVGVTYPLGLDPGADIFAKYALRESGITRNVLIDREGKIVKLTRLY NEEEFASLVDQIDEMLKK Prediction of potential genes in microbial genomes Time: Tue May 17 21:54:22 2011 Seq name: gi|226332054|gb|ACIB01000002.1| Bacteroides sp. 3_2_5 cont1.2, whole genome shotgun sequence Length of sequence - 15892 bp Number of predicted genes - 18, with homology - 18 Number of transcription units - 2, operones - 2 average op.length - 9.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 5/0.000 - CDS 1 - 235 225 ## COG0399 Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis 2 1 Op 2 11/0.000 - CDS 264 - 1070 559 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 3 1 Op 3 1/0.000 - CDS 1080 - 2072 533 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 4 1 Op 4 . - CDS 2059 - 3315 591 ## COG0677 UDP-N-acetyl-D-mannosaminuronate dehydrogenase 5 1 Op 5 . - CDS 3326 - 4363 201 ## BF1440 putative capsular polysaccharide polymerase 6 1 Op 6 . - CDS 4370 - 5626 414 ## BF1439 oligosaccharide repeat-containing polymerase 7 1 Op 7 . - CDS 5613 - 6797 446 ## BF1438 hypothetical protein 8 1 Op 8 . - CDS 6794 - 7318 270 ## COG0110 Acetyltransferase (isoleucine patch superfamily) 9 1 Op 9 . - CDS 7305 - 8075 152 ## BF1437 hypothetical protein - Prom 8303 - 8362 7.0 10 2 Op 1 . - CDS 8490 - 9539 464 ## COG1208 Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) 11 2 Op 2 1/0.000 - CDS 9554 - 10195 164 ## COG0110 Acetyltransferase (isoleucine patch superfamily) 12 2 Op 3 3/0.000 - CDS 10198 - 10920 187 ## PROTEIN SUPPORTED gi|163739489|ref|ZP_02146899.1| 50S ribosomal protein L17 13 2 Op 4 . - CDS 10917 - 12134 477 ## COG0318 Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 14 2 Op 5 . - CDS 12136 - 12357 276 ## BF1432 putative acyl carrier protein 15 2 Op 6 7/0.000 - CDS 12363 - 13505 751 ## COG0399 Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis 16 2 Op 7 . - CDS 13517 - 14725 929 ## COG1086 Predicted nucleoside-diphosphate sugar epimerases 17 2 Op 8 . - CDS 14756 - 15229 420 ## BF1368 putative LPS biosynthesis related transcriptional regulatory protein 18 2 Op 9 . - CDS 15249 - 15767 391 ## BF1428 putative transcriptional regulatory protein UpxY-like protein Predicted protein(s) >gi|226332054|gb|ACIB01000002.1| GENE 1 1 - 235 225 78 aa, chain - ## HITS:1 COG:SP1837 KEGG:ns NR:ns ## COG: SP1837 COG0399 # Protein_GI_number: 15901666 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis # Organism: Streptococcus pneumoniae TIGR4 # 3 78 6 81 408 88 56.0 2e-18 MKIPFSPPYIDEAVINEVVDSLRSGWITSGPKVKALEEEIKSFSGAKEVLCVNSWTSGAI MMLRWLGVKEGDEVIVPA >gi|226332054|gb|ACIB01000002.1| GENE 2 264 - 1070 559 268 aa, chain - ## HITS:1 COG:CAC2175 KEGG:ns NR:ns ## COG: CAC2175 COG0463 # Protein_GI_number: 15895444 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Clostridium acetobutylicum # 1 229 1 232 333 107 31.0 3e-23 MLYVIMSLYAKDTLLKVQEAIESLLNQIYTPFYLYAICDGPIKSDVASYVANLSTNHVHI STRERNLGLAYSLNELLNTVLKKEDCTYIARMDADDISMPERFVKQIAFMNSHPDIDCLG TWAIEIDDDGKEYFRKKMPITHEECLELFKKRDCMIHPTVMFRRSYFEKAGLYPEDTYFG EDTMMWAKGFKSGCKFANVPEYLFKFRLDSNFFERRRGWKHAKSIYTLRHRVNRMLGFGW KEDCYALLYAMAKLMPKSILDIIYKTVR >gi|226332054|gb|ACIB01000002.1| GENE 3 1080 - 2072 533 330 aa, chain - ## HITS:1 COG:BH3713 KEGG:ns NR:ns ## COG: BH3713 COG0463 # Protein_GI_number: 15616275 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Bacillus halodurans # 13 200 3 186 303 110 37.0 3e-24 MDDCKNNMVSGLVSVIIPTYKRPNMLGRAIDSVLEQSYSNIEVIVVDDNSDGDKYRLETI QYMERYADDPRVKYIKHKINQSGSAARNTGIQNSVGEYIAFLDDDDYFFKDRVKEAILFL RNSNVDCGGACCNYVKKYKKKIYKVSKNVGCFDSCYELLSAQIDYAAGSTLMIKREVLNK VGLFDVSFKRHQDWEFLIRLFRFYHIEILPYVGVVICADGIRNTPRTDVLLEVKKKLLQT FVTDIEMLDGQKQADIYRVQWKEIIYNYLKEKRYGTAISFAQKNISWTSFEKKDILNIIL SCVVGVIPSFMIVVYSIYDLKFNRFKKDII >gi|226332054|gb|ACIB01000002.1| GENE 4 2059 - 3315 591 418 aa, chain - ## HITS:1 COG:PM1003 KEGG:ns NR:ns ## COG: PM1003 COG0677 # Protein_GI_number: 15602868 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetyl-D-mannosaminuronate dehydrogenase # Organism: Pasteurella multocida # 1 418 6 424 424 496 57.0 1e-140 MKIAVIGLGYVGLPLARLFSTKYQTIGFDMNQKRVESLMGGHDTTLEVTDVLLQAALDSG FKCTSNLEDIRDCNFYVIAVPTPVDRNNRPDLTPLISASETVGKVISKGDIVVYESTVYP GVTEEECLPVVERVSGLKFNLDFFAGYSPERINPGDKEHTVEKIKKVTSGSTPEIADIVD KVYNSVLINGTHKAPSIRVAEASKIIENSQRDVNIAFMNELAKIFNAMDIDTNDVIEAAS SKWNFIKLRPGLVGGHCISVDPYYLIQKAQVYGVLPRIMSSARRLNDGMGDYVANQVIKL MNKKGVLVKDSNILLLGITFKENCPDIRNTKVVDIYSTLLEYTKNISVYDPWANAEKVYE EYGIRMIDNVNEKKYDAIILAVGHNEFKTIDIKALGNDISVVFDVKCFLDRAIVDGRL >gi|226332054|gb|ACIB01000002.1| GENE 5 3326 - 4363 201 345 aa, chain - ## HITS:1 COG:no KEGG:BF1440 NR:ns ## KEGG: BF1440 # Name: not_defined # Def: putative capsular polysaccharide polymerase # Organism: B.fragilis # Pathway: not_defined # 1 345 1 345 345 453 99.0 1e-126 MFPYFLIYFFLLIFCSLGSVTKNRFFLICVFILFSIFSGFRYYVGVDYVNYVKIYNLEEG YGSRELGFNLILDFLRYIGASYQFMFFIMAVVMQILVYNIIKRYNYSVWISVFIYYCISP FYIATFNGMRQYLAIAVFIVALKYIEQKKIFKYIVSLLLGGFFFHESILVFIPLYYILNK TISIKGKLLAFLLTIAGSLAIDKLISYTPYIVYLTRDRETHISSFTYIFAGISILFIIFW NKLNSFKSKLIMENMNLFCFLSLLVVLLQSNGVLIQMTLRMNSYFFFVYIILVPAVISSI KNVHMRIGMYFSLHLVLLLYLVRTICFNGHLYDLVPYSMNFNLFK >gi|226332054|gb|ACIB01000002.1| GENE 6 4370 - 5626 414 418 aa, chain - ## HITS:1 COG:no KEGG:BF1439 NR:ns ## KEGG: BF1439 # Name: not_defined # Def: oligosaccharide repeat-containing polymerase # Organism: B.fragilis # Pathway: not_defined # 1 418 1 418 418 621 95.0 1e-176 MKRINSDFISGFIKRGGFSVFFSTALVKISAALLSIIVVQLLTKEDYGILSYVLSIYAIA IVIAGFGGNYSLLRFGSIVNSFLKKKQYYYYTQRIGIKYTSVVVAIIVVYSCFLPEGMKN AQPLVIYMCLGLYSFYMLETMRSYFRIVNLNRTYSKLNVYNSVILLLLTVCLTLFFKNYG YITALILAPLLTFFLFRKKISYIQFSNQIEISKKEYWGYGIHTSVSAIANQIIFSIAPLL LGILNEPEHSIASFRVATIIPFNLLTLPGILMISDFSYLSRNYMDSNCLKRYYYNYLKVV IPISFVVFLLLIIYGDFVVENLFGAQYNDCVYMYKMFMVATFVTYIFRNPLGNILLAVGK AKWNGYNTYAFCFLYIIFSLIFYQYWGAIAIVYGLCATFILSGIVSLFLFYYYIKGLR >gi|226332054|gb|ACIB01000002.1| GENE 7 5613 - 6797 446 394 aa, chain - ## HITS:1 COG:no KEGG:BF1438 NR:ns ## KEGG: BF1438 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 393 82 474 475 620 77.0 1e-176 MKRMCLSLIGDICFPEEFKQIFTLDGKFLITRSLGSDWCFEMSGESMFYALCEIEKSQKE SLFDYKLSIIEAIDRRIIWGNGKLLHRSFTGDYDIQFRSTNSALRTLLYACEDGFDTDKN IRIIANEQFKYFFEWKEGIWFCHDSSEYGGKTPFSHIRSKVAGKSWRNTLTLNTHIDSLN TLLLLKFYRKEYLLNFDLDYWINKGLISINQLLGLTNKGKIMNKLQEIDNYCLNLFLKEQ HNSFWIDCVYEKIVHPIIFKFVFPTVFFNNGFISRDLSVLNRHIDYQLVNIVDFARLLSL YKKNRETEILTASILNYNEIEDRLERAIEFVESNKYLLNYIQSSDLQRAWYAEMYYAMSN INSKYIKIAQNLYSDQLYCSESPFYKLDFLNEKN >gi|226332054|gb|ACIB01000002.1| GENE 8 6794 - 7318 270 174 aa, chain - ## HITS:1 COG:L1734467 KEGG:ns NR:ns ## COG: L1734467 COG0110 # Protein_GI_number: 15673662 # Func_class: R General function prediction only # Function: Acetyltransferase (isoleucine patch superfamily) # Organism: Lactococcus lactis # 24 129 79 187 203 78 41.0 5e-15 MIINKFFKKVYHLIYPPRVRFKHGQRFYSNSLIDTLFPELVEIGDDFISAPGSIILAHDA STLWHTGKYRVQKTKIGNRVFLGANSIILPGVVVGDDVIIGSGSVVTKDIASNSVVAGNP ARVVSTISEYIDKCEQRKILYTPDEKFIDLITRGEIITEERQNELRDTIYTQLK >gi|226332054|gb|ACIB01000002.1| GENE 9 7305 - 8075 152 256 aa, chain - ## HITS:1 COG:no KEGG:BF1437 NR:ns ## KEGG: BF1437 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 256 134 389 392 498 98.0 1e-140 MTVLFNRADHITIASDKVIEEFKTCYQKSSFLSKIVLCRFGLEPLESLISILRSGANARI SKSKIGLCADKIVITIGYNASRLQHHIDIIENIERSPLLSPFHDKVEFLLPVTYPEDAEY IGIIKKTVLNSKFHYNVIEQFLSDEDIAHLRVASDIFIQLQPTDMLSGSMLEHLSAQNIV ITGSWLPYDCLDQWGVFYRKIDCSELISNELYDVLNKFVSYKTLTCKNSEIIIDKFRWNN VIQDWLDLYINRYDNK >gi|226332054|gb|ACIB01000002.1| GENE 10 8490 - 9539 464 349 aa, chain - ## HITS:1 COG:Cj1329_2 KEGG:ns NR:ns ## COG: Cj1329_2 COG1208 # Protein_GI_number: 15792652 # Func_class: M Cell wall/membrane/envelope biogenesis; J Translation, ribosomal structure and biogenesis # Function: Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) # Organism: Campylobacter jejuni # 126 344 6 221 228 139 35.0 1e-32 MDYQQLIVDINVSILHTLEQMDKCNRKLLLVFSEGEYFGLVSIGDIQRAIIGNKSLDTPI KNILRDRIITADDTLSLDEIRTLMMSYRMEFIPILNTERKLVKVLMWSELFPNEDNAPID QFDLPIVIMAGGQGTRLKPLTNIIPKPLIPIGEKTFMEDIMDRFVKCGSNNFYVSVNYKA DVIKHYFSTLRDSSYRINYFQENVPLGTAGSLTLMRDKIHTTFFVSNCDIIINEDYSQIL KYHKENKNELTVVAALKNYPIAYGVLYTKENGLLDSIVEKPDLTFKINTGLYILEPNLLD EIPEGQFYHITSLIDKLRKENRRIGVFPVSEKSWIDVGNWNEYFSIINK >gi|226332054|gb|ACIB01000002.1| GENE 11 9554 - 10195 164 213 aa, chain - ## HITS:1 COG:Cgl0360 KEGG:ns NR:ns ## COG: Cgl0360 COG0110 # Protein_GI_number: 19551610 # Func_class: R General function prediction only # Function: Acetyltransferase (isoleucine patch superfamily) # Organism: Corynebacterium glutamicum # 65 198 75 208 215 73 34.0 4e-13 MKPTLFIVGGSSTALEIRETVDQFFKDKYFAVYNVISDEEPPILQTYIRDSSLLDRLSTI EVSHYIIGFTNRTLRLRFVDLFGGFNSVLDNVIHPSSYISPSAILGKGNYLAANAVISSN ALIGNSNLINYNVTIGHDVVVGSDCFFNPGARISGNVKIGNGCLFGANSFVFQGLEIKDD CQIDALCYIDRVIEANSMCTSNGGSLRVYRKRY >gi|226332054|gb|ACIB01000002.1| GENE 12 10198 - 10920 187 240 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163739489|ref|ZP_02146899.1| 50S ribosomal protein L17 [Phaeobacter gallaeciensis BS107] # 3 236 6 237 242 76 30 1e-13 MNILVTGCSRGVGLEICRVLLEEGHVVYGVSRSYSNEFRILEEKYESKLFFKSVDLSDTT NIHKIIFKEFLTNSVLLDGYVNNAAVAYDDIITNLQIDKLRAMYNVNVFSPMLLTKYAIR NMLLHHTQGSIIHISSISVHTGYKGLSMYASSKGALEAFSKDTAREWGQLGIRSNVVVPG FMETAMSSSLTEEQKDRIYKRTSLKQATDVRSVAETVAFLLSKKACSITGQNIFVDNGTI >gi|226332054|gb|ACIB01000002.1| GENE 13 10917 - 12134 477 405 aa, chain - ## HITS:1 COG:BH2006 KEGG:ns NR:ns ## COG: BH2006 COG0318 # Protein_GI_number: 15614569 # Func_class: I Lipid transport and metabolism; Q Secondary metabolites biosynthesis, transport and catabolism # Function: Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II # Organism: Bacillus halodurans # 285 405 381 501 513 95 37.0 2e-19 MGLFLVDQHKSYTYEDLLQRINATKSYIPLMQTNNLYDFFLNMILSLISDQPLILLDSDT KMNELEGIENTDINEPKNILFDDKADIDDLINSVLTSKSEITIFTSGTTGQPKKVVHTIG TLTRSTRISFKYHSQIWAFAFNPTHMAGLQVFFQAFANKNALINVFNKNRVEVYELLSEY HVTHISATPTFYRLLLPVEKSYPEVCRVTLGGEKSNTKLYESMLLIFPNARINNIYASTE AGSLFAAKGDLFQIPVELKDKIKVEDEELLIHNSLLGKSDSFSCVGDYYHSGDLIEWMDS DKGLFRFKSRKNELINVGGYKINPGEVESVIMQFDGVQQAFVYGKPNSVLGNILCAEIKI EEGCSLSELDIRHWLADKLQEFKIPRKIKFVENIALTRTGKLKRT >gi|226332054|gb|ACIB01000002.1| GENE 14 12136 - 12357 276 73 aa, chain - ## HITS:1 COG:no KEGG:BF1432 NR:ns ## KEGG: BF1432 # Name: not_defined # Def: putative acyl carrier protein # Organism: B.fragilis # Pathway: not_defined # 1 73 1 73 73 101 100.0 6e-21 MEEKILIIINEIRKAKNLSELSVINLSDTLRDDIGFTSFDLAELTVRIEDEFDIDIFEDG LVNTVGEIIEKLR >gi|226332054|gb|ACIB01000002.1| GENE 15 12363 - 13505 751 380 aa, chain - ## HITS:1 COG:Cj1320 KEGG:ns NR:ns ## COG: Cj1320 COG0399 # Protein_GI_number: 15792643 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis # Organism: Campylobacter jejuni # 1 380 1 379 384 378 47.0 1e-105 MYNNITTFIHNLFGTDEFVPLHAPLFIGNEKKYLAECIDTTFVSSVGKFVDRFEELVACY TGSKRAVVCVSGTNALHMGMLLVGVERDDEVLTQALTFIATCNAISYIGAHPVFLDVDRD TLGLSPLAVKRWLSGHAEVRNGQCYNKKTGRRIKACVPMHTFGHPMKIDELSVVCNEYHI ELVEDAAESIGSFYKGRHTGTFGRVGAISFNGNKTITTGGGGMLLFQDEELGKFAKHLTT QAKVPHRWAFVHDHIGYNYRMPNINAALGCAQMENLDRYVSNKRETAERYREFFSHIPDV EFVVEPANSRSNYWLNAVLLKDRRAQQSFLEYTNDHGVMTRPVWELMNRLEMFRGCETDG LENTVWLEERIVNIPSSVRL >gi|226332054|gb|ACIB01000002.1| GENE 16 13517 - 14725 929 402 aa, chain - ## HITS:1 COG:FN1696 KEGG:ns NR:ns ## COG: FN1696 COG1086 # Protein_GI_number: 19705017 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Predicted nucleoside-diphosphate sugar epimerases # Organism: Fusobacterium nucleatum # 3 266 238 507 607 117 31.0 4e-26 MLNVDNFIGDHITFRRSSMFAPDIAANSDRLRQEVEGKSLLVIGGAGSIGSSYIKAILPF KPSKLVVIDLNENGLAELTRDLRSTYGLYIPDEYRTYTLNFADPIFERMFRKEQGFDIVA NFSAHKHVRSEKDEYSVQALIENNVIKAKKLLDLLSEFPPHHFFCVSTDKAANPVNIMGA SKRIMEDMIMAYSSKFKVTTARFANVAFSNGSLLAGFIDRIMKKQPLAAPNDVKRYFVSP EESGQICMLACILGNNGEIFFPKLGEEKMITFSSICDRFLRTLGYEKKECATDEEARRYA AEMADDSKIYPVVYFKSDTTGEKGYEEFYVPGERLNLERFSSLGVIEDVSKRPLSELDSF FDELESLFAFPDCHKSDIVTALKRFLPNFEHVEKGKNLDQKM >gi|226332054|gb|ACIB01000002.1| GENE 17 14756 - 15229 420 157 aa, chain - ## HITS:1 COG:no KEGG:BF1368 NR:ns ## KEGG: BF1368 # Name: upaZ # Def: putative LPS biosynthesis related transcriptional regulatory protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 157 1 157 157 293 100.0 1e-78 MNQVQNLQHIARELLYLGMDGSPIYTDHFRQLNTEVFRLSEALFSMKGATSEEEAAICLS LLMGYNATIYNDGDKESKIQSILDRSFAVLDHLPASLLKCQLLTYCYGEVFEEDLAQEAH QIMDSWKNRALSEEELEVMETLQTMEDNRYPCSEVED >gi|226332054|gb|ACIB01000002.1| GENE 18 15249 - 15767 391 172 aa, chain - ## HITS:1 COG:no KEGG:BF1428 NR:ns ## KEGG: BF1428 # Name: not_defined # Def: putative transcriptional regulatory protein UpxY-like protein # Organism: B.fragilis # Pathway: not_defined # 1 172 1 172 172 337 100.0 1e-91 MSEQQEYWFAARTKKDQEFSVRNALEKLGIEYFLPTQFVIRQLKYRRRRVEVPVIKNLIF VRTTKDRAWSITKDDHVPLYYMKDLYTHTLLIVPNKQMEDFKFVMDLAPENVTFDDLPLT VGTKVQVVKGEFCGIEGELSSLANRTYVVIRIHGVLSASVKVPKSYLRILSA Prediction of potential genes in microbial genomes Time: Tue May 17 21:55:30 2011 Seq name: gi|226332053|gb|ACIB01000003.1| Bacteroides sp. 3_2_5 cont1.3, whole genome shotgun sequence Length of sequence - 171756 bp Number of predicted genes - 130, with homology - 127 Number of transcription units - 72, operones - 36 average op.length - 2.6 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 333 - 392 5.2 1 1 Tu 1 . + CDS 458 - 628 59 ## BF1426 hypothetical protein 2 2 Tu 1 . - CDS 963 - 1967 992 ## BF1363 hypothetical protein - Prom 2026 - 2085 2.1 3 3 Op 1 . - CDS 2087 - 3100 776 ## BF1423 hypothetical protein 4 3 Op 2 . - CDS 3097 - 4191 801 ## BF1422 hypothetical protein - Prom 4327 - 4386 7.6 + Prom 4332 - 4391 4.7 5 4 Tu 1 . + CDS 4411 - 7440 2729 ## BF1360 hypothetical protein 6 5 Tu 1 . - CDS 7829 - 8422 588 ## BF1359 hypothetical protein - Prom 8456 - 8515 5.5 - Term 9131 - 9172 -0.8 7 6 Tu 1 . - CDS 9190 - 9933 505 ## COG3177 Uncharacterized conserved protein - Prom 9954 - 10013 5.7 + Prom 10715 - 10774 3.8 8 7 Op 1 . + CDS 10803 - 11708 462 ## gi|253563305|ref|ZP_04840762.1| predicted protein 9 7 Op 2 1/0.077 + CDS 11702 - 13204 312 ## COG0827 Adenine-specific DNA methylase 10 7 Op 3 . + CDS 13286 - 14470 313 ## COG0582 Integrase 11 7 Op 4 . + CDS 14507 - 15781 513 ## BDI_0498 integrase + Term 15939 - 15980 6.3 + Prom 15851 - 15910 4.8 12 8 Tu 1 . + CDS 16081 - 16452 107 ## gi|253563309|ref|ZP_04840766.1| predicted protein + Prom 16505 - 16564 3.5 13 9 Op 1 . + CDS 16647 - 17123 409 ## gi|253563310|ref|ZP_04840767.1| predicted protein 14 9 Op 2 . + CDS 17188 - 18222 421 ## gi|253563311|ref|ZP_04840768.1| predicted protein + Prom 18919 - 18978 6.3 15 10 Tu 1 . + CDS 18998 - 20200 390 ## Ppha_2135 RES domain protein + Term 20237 - 20281 -1.0 - TRNA 20276 - 20350 52.4 # Cys GCA 0 0 - Term 20224 - 20270 5.2 16 11 Op 1 . - CDS 20402 - 20614 154 ## BF1354 hypothetical protein 17 11 Op 2 . - CDS 20611 - 23094 2383 ## COG0370 Fe2+ transport system protein B - Prom 23115 - 23174 5.9 + Prom 23110 - 23169 4.5 18 12 Tu 1 . + CDS 23219 - 24526 776 ## COG0037 Predicted ATPase of the PP-loop superfamily implicated in cell cycle control + Prom 24570 - 24629 6.3 19 13 Op 1 7/0.000 + CDS 24785 - 30379 3948 ## COG2373 Large extracellular alpha-helical protein 20 13 Op 2 . + CDS 30388 - 32727 1026 ## COG4953 Membrane carboxypeptidase/penicillin-binding protein PbpC 21 13 Op 3 . + CDS 32752 - 33957 705 ## COG0477 Permeases of the major facilitator superfamily 22 13 Op 4 . + CDS 33961 - 34389 246 ## BF1412 hypothetical protein + Prom 34474 - 34533 2.5 23 14 Op 1 . + CDS 34718 - 35596 950 ## COG1814 Uncharacterized membrane protein 24 14 Op 2 . + CDS 35621 - 37051 968 ## COG1757 Na+/H+ antiporter 25 14 Op 3 . + CDS 37120 - 37710 695 ## BF1409 hypothetical protein + Prom 37735 - 37794 6.4 26 15 Tu 1 . + CDS 37816 - 38493 480 ## COG2197 Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain 27 16 Op 1 . - CDS 38618 - 39679 864 ## COG0482 Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain 28 16 Op 2 . - CDS 39684 - 40970 1175 ## COG2873 O-acetylhomoserine sulfhydrylase - Prom 41116 - 41175 6.8 + Prom 40932 - 40991 5.2 29 17 Tu 1 . + CDS 41140 - 41964 661 ## BF1404 hypothetical protein + Term 42002 - 42075 25.3 - Term 41990 - 42060 24.0 30 18 Op 1 . - CDS 42181 - 42999 726 ## COG0627 Predicted esterase 31 18 Op 2 . - CDS 43031 - 43468 255 ## BF1402 hypothetical protein - Prom 43549 - 43608 5.6 + Prom 43368 - 43427 5.0 32 19 Op 1 . + CDS 43575 - 45947 1747 ## BF1336 putative TonB dependent outer membrane protein 33 19 Op 2 . + CDS 45989 - 46789 934 ## COG0501 Zn-dependent protease with chaperone function + Term 46817 - 46865 12.2 + Prom 46866 - 46925 2.8 34 20 Op 1 . + CDS 46946 - 48235 1421 ## COG1253 Hemolysins and related proteins containing CBS domains 35 20 Op 2 . + CDS 48272 - 48745 577 ## BF1333 hypothetical protein + Term 48778 - 48817 6.5 - Term 48766 - 48805 1.2 36 21 Tu 1 . - CDS 48831 - 50420 1521 ## COG1501 Alpha-glucosidases, family 31 of glycosyl hydrolases - Prom 50457 - 50516 4.8 + Prom 50699 - 50758 6.0 37 22 Op 1 . + CDS 50805 - 51203 233 ## BF1396 hypothetical protein 38 22 Op 2 . + CDS 51255 - 53828 1758 ## BF1327 hypothetical protein - Term 53824 - 53848 -1.0 39 23 Tu 1 . - CDS 54010 - 54477 526 ## BF1326 putative transmembrane protein - Prom 54545 - 54604 9.2 - TRNA 55262 - 55346 48.3 # Ser TGA 0 0 + Prom 55506 - 55565 6.1 40 24 Op 1 11/0.000 + CDS 55601 - 57829 2471 ## COG1882 Pyruvate-formate lyase 41 24 Op 2 . + CDS 57834 - 58559 400 ## COG1180 Pyruvate-formate lyase-activating enzyme - Term 58695 - 58758 18.0 42 25 Op 1 . - CDS 58792 - 60051 815 ## BF1321 hypothetical protein 43 25 Op 2 . - CDS 60106 - 60654 470 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog - Prom 60812 - 60871 5.7 + Prom 60710 - 60769 4.3 44 26 Tu 1 . + CDS 60842 - 61594 591 ## BF1335 hypothetical protein + Term 61660 - 61702 9.0 - Term 61648 - 61687 3.0 45 27 Tu 1 . - CDS 61699 - 62625 888 ## COG0583 Transcriptional regulator - Prom 62757 - 62816 10.1 + Prom 62660 - 62719 5.4 46 28 Tu 1 . + CDS 62767 - 63240 656 ## COG0783 DNA-binding ferritin-like protein (oxidative damage protectant) + Term 63294 - 63340 11.2 - Term 63277 - 63332 20.1 47 29 Op 1 . - CDS 63344 - 64342 950 ## COG2152 Predicted glycosylase 48 29 Op 2 . - CDS 64397 - 65665 1330 ## COG0477 Permeases of the major facilitator superfamily - Prom 65717 - 65776 4.7 - Term 65842 - 65885 7.4 49 30 Op 1 . - CDS 65915 - 66928 771 ## BF1314 hypothetical protein 50 30 Op 2 . - CDS 66951 - 68087 866 ## BF1329 hypothetical protein 51 30 Op 3 . - CDS 68105 - 69157 1148 ## BF1328 putative secreted endoglycosidase 52 30 Op 4 . - CDS 69185 - 70729 1380 ## BF1327 hypothetical protein 53 30 Op 5 . - CDS 70742 - 74047 2795 ## BF1326 hypothetical protein - Prom 74263 - 74322 10.9 - Term 74279 - 74330 11.9 54 31 Tu 1 . - CDS 74357 - 75367 704 ## COG3712 Fe2+-dicitrate sensor, membrane component - Prom 75388 - 75447 5.8 + Prom 75364 - 75423 7.1 55 32 Tu 1 . + CDS 75533 - 76132 457 ## BF1324 RNA polymerase ECF-type sigma factor + Term 76201 - 76235 1.4 - Term 75987 - 76027 8.2 56 33 Op 1 . - CDS 76109 - 78124 1775 ## BF1307 putative alpha-glucosidase protein 57 33 Op 2 . - CDS 78147 - 79004 864 ## COG3568 Metal-dependent hydrolase 58 33 Op 3 4/0.000 - CDS 79075 - 80139 674 ## COG0318 Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 59 33 Op 4 . - CDS 80183 - 81202 646 ## COG4948 L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily - Term 81205 - 81247 5.1 60 33 Op 5 . - CDS 81251 - 82075 1026 ## COG0447 Dihydroxynaphthoic acid synthase 61 33 Op 6 . - CDS 82097 - 83098 914 ## BF1302 hypothetical protein 62 33 Op 7 10/0.000 - CDS 83103 - 84770 1305 ## COG1165 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase 63 33 Op 8 . - CDS 84791 - 85897 655 ## COG1169 Isochorismate synthase 64 33 Op 9 . - CDS 85894 - 87126 1208 ## COG0561 Predicted hydrolases of the HAD superfamily - Prom 87190 - 87249 4.2 - Term 87614 - 87663 9.0 65 34 Op 1 . - CDS 87693 - 89267 1454 ## COG4108 Peptide chain release factor RF-3 66 34 Op 2 . - CDS 89292 - 90155 892 ## COG1091 dTDP-4-dehydrorhamnose reductase - Prom 90223 - 90282 3.7 67 35 Op 1 . - CDS 90290 - 90835 454 ## BF1309 hypothetical protein 68 35 Op 2 . - CDS 90854 - 91507 445 ## COG1280 Putative threonine efflux protein 69 35 Op 3 . - CDS 91549 - 93813 1665 ## COG0475 Kef-type K+ transport systems, membrane components - Term 93830 - 93876 6.1 70 36 Op 1 . - CDS 93889 - 94632 697 ## BF1306 putative lipoprotein 71 36 Op 2 . - CDS 94614 - 96224 554 ## BF1305 hypothetical protein - Prom 96303 - 96362 5.7 + Prom 96318 - 96377 7.2 72 37 Tu 1 . + CDS 96398 - 96610 112 ## - Term 97287 - 97339 14.2 73 38 Op 1 . - CDS 97360 - 98445 1195 ## COG0404 Glycine cleavage system T protein (aminomethyltransferase) 74 38 Op 2 . - CDS 98479 - 99702 1223 ## COG2195 Di- and tripeptidases - Prom 99728 - 99787 3.9 - Term 99751 - 99793 12.6 75 39 Op 1 . - CDS 99835 - 101241 1055 ## COG0034 Glutamine phosphoribosylpyrophosphate amidotransferase 76 39 Op 2 . - CDS 101325 - 101687 361 ## COG3189 Uncharacterized conserved protein - Prom 101728 - 101787 4.8 + Prom 101998 - 102057 6.0 77 40 Tu 1 . + CDS 102201 - 103337 1141 ## COG2885 Outer membrane protein and related peptidoglycan-associated (lipo)proteins + Term 103363 - 103401 5.4 - Term 103351 - 103389 5.4 78 41 Op 1 . - CDS 103397 - 103828 217 ## COG0735 Fe2+/Zn2+ uptake regulation proteins 79 41 Op 2 . - CDS 103831 - 105777 1616 ## COG2217 Cation transport ATPase - Prom 105833 - 105892 4.1 80 42 Tu 1 . - CDS 105896 - 106930 796 ## COG2855 Predicted membrane protein - Prom 106956 - 107015 3.8 81 43 Tu 1 . - CDS 107095 - 109473 1582 ## COG0642 Signal transduction histidine kinase - Prom 109513 - 109572 7.9 + Prom 109452 - 109511 6.8 82 44 Tu 1 . + CDS 109712 - 111241 1008 ## COG3119 Arylsulfatase A and related enzymes + Term 111284 - 111356 12.6 - Term 111170 - 111210 -0.9 83 45 Tu 1 . - CDS 111359 - 111682 336 ## COG0526 Thiol-disulfide isomerase and thioredoxins - Prom 111749 - 111808 4.0 84 46 Op 1 . - CDS 111858 - 112670 764 ## COG0566 rRNA methylases 85 46 Op 2 . - CDS 112750 - 113790 592 ## COG1063 Threonine dehydrogenase and related Zn-dependent dehydrogenases - Prom 113982 - 114041 8.3 + Prom 113819 - 113878 5.7 86 47 Op 1 . + CDS 114073 - 114642 306 ## BF1288 hypothetical protein 87 47 Op 2 . + CDS 114712 - 115626 461 ## COG0697 Permeases of the drug/metabolite transporter (DMT) superfamily - Term 115860 - 115896 -0.7 88 48 Tu 1 . - CDS 115968 - 117026 768 ## BF1286 hypothetical protein + Prom 117687 - 117746 5.8 89 49 Op 1 . + CDS 117769 - 119061 961 ## BVU_2469 tyrosine type site-specific recombinase 90 49 Op 2 . + CDS 119075 - 120091 562 ## BVU_2468 hypothetical protein + Prom 120113 - 120172 1.9 91 50 Op 1 . + CDS 120314 - 120622 244 ## BVU_2467 hypothetical protein 92 50 Op 2 . + CDS 120635 - 121708 610 ## BF1280 hypothetical protein + Prom 121739 - 121798 3.0 93 51 Tu 1 . + CDS 122037 - 123005 647 ## BF1279 DNA primase + Prom 123034 - 123093 2.0 94 52 Tu 1 . + CDS 123189 - 124529 826 ## BVU_2464 mobilization protein - Term 124565 - 124613 10.1 95 53 Op 1 . - CDS 124663 - 125160 262 ## BF1270 hypothetical protein 96 53 Op 2 . - CDS 125184 - 126152 438 ## BVU_2456 hypothetical protein 97 53 Op 3 . - CDS 126182 - 126304 88 ## 98 53 Op 4 . - CDS 126317 - 127096 352 ## BVU_2455 hypothetical protein - Prom 127165 - 127224 8.0 99 54 Op 1 . - CDS 127422 - 127631 88 ## BF1275 putative hemolysin 100 54 Op 2 . - CDS 127678 - 128517 328 ## BF1274 hypothetical protein - Prom 128636 - 128695 9.0 - TRNA 128936 - 129023 48.9 # Ser TGA 0 0 + Prom 129073 - 129132 4.4 101 55 Op 1 . + CDS 129178 - 131676 2421 ## COG1193 Mismatch repair ATPase (MutS family) 102 55 Op 2 . + CDS 131689 - 132750 654 ## COG0598 Mg2+ and Co2+ transporters 103 55 Op 3 . + CDS 132807 - 134015 856 ## COG1760 L-serine deaminase + Term 134030 - 134098 11.8 - Term 134023 - 134079 8.1 104 56 Op 1 . - CDS 134119 - 134619 453 ## BF1264 hypothetical protein 105 56 Op 2 . - CDS 134657 - 135100 507 ## BF1263 hypothetical protein - Prom 135134 - 135193 2.9 106 57 Tu 1 . - CDS 135218 - 135772 537 ## BF1262 hypothetical protein - Prom 135819 - 135878 3.5 - Term 135842 - 135911 16.1 107 58 Tu 1 . - CDS 135921 - 136388 503 ## BF1212 hypothetical protein - Prom 136463 - 136522 5.1 + Prom 137233 - 137292 3.5 108 59 Op 1 11/0.000 + CDS 137329 - 137895 649 ## COG0450 Peroxiredoxin + Term 137909 - 137949 5.9 109 59 Op 2 . + CDS 137996 - 139546 409 ## PROTEIN SUPPORTED gi|148988049|ref|ZP_01819512.1| 30S ribosomal protein S9 110 60 Op 1 . - CDS 139696 - 140703 976 ## BF1208 putative endonuclease/exonuclease/phosphatase family protein 111 60 Op 2 . - CDS 140776 - 141675 837 ## COG1524 Uncharacterized proteins of the AP superfamily 112 60 Op 3 . - CDS 141717 - 142703 923 ## COG1409 Predicted phosphohydrolases - Term 142718 - 142769 1.2 113 61 Op 1 . - CDS 142791 - 144314 1476 ## BF1254 hypothetical protein 114 61 Op 2 . - CDS 144326 - 147712 3022 ## BF1253 hypothetical protein - Prom 147817 - 147876 6.0 115 62 Tu 1 . - CDS 147885 - 148850 708 ## BF1252 putative anti-sigma factor + Prom 149569 - 149628 3.3 116 63 Tu 1 . + CDS 149662 - 150564 621 ## COG2367 Beta-lactamase class A - Term 150606 - 150642 -0.3 117 64 Op 1 . - CDS 150748 - 151749 830 ## COG3507 Beta-xylosidase 118 64 Op 2 . - CDS 151806 - 152351 509 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 119 64 Op 3 . - CDS 152348 - 152494 65 ## - Prom 152720 - 152779 5.7 + Prom 152372 - 152431 8.5 120 65 Tu 1 . + CDS 152486 - 155023 2314 ## COG1501 Alpha-glucosidases, family 31 of glycosyl hydrolases + Term 155208 - 155244 -1.0 + Prom 155270 - 155329 6.2 121 66 Tu 1 . + CDS 155495 - 156955 1201 ## COG0753 Catalase + Term 156972 - 157020 14.9 - Term 156808 - 156842 -0.8 122 67 Op 1 17/0.000 - CDS 157052 - 157738 759 ## COG0569 K+ transport systems, NAD-binding component 123 67 Op 2 . - CDS 157754 - 159589 1485 ## COG0168 Trk-type K+ transport systems, membrane components - Prom 159672 - 159731 5.5 + Prom 159527 - 159586 7.3 124 68 Tu 1 . + CDS 159725 - 161092 1381 ## COG1350 Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB) + Term 161110 - 161182 26.6 + Prom 161102 - 161161 1.9 125 69 Tu 1 . + CDS 161221 - 162057 909 ## COG0648 Endonuclease IV + Term 162131 - 162173 2.3 126 70 Tu 1 . - CDS 162051 - 163622 1192 ## COG0038 Chloride channel protein EriC + Prom 163886 - 163945 7.9 127 71 Tu 1 . + CDS 164034 - 165575 1013 ## COG2865 Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen + Prom 165590 - 165649 2.3 128 72 Op 1 . + CDS 165694 - 166296 280 ## BT_4629 hypothetical protein 129 72 Op 2 6/0.000 + CDS 166329 - 169802 2053 ## COG0553 Superfamily II DNA/RNA helicases, SNF2 family 130 72 Op 3 . + CDS 169815 - 171756 1243 ## COG1002 Type II restriction enzyme, methylase subunits Predicted protein(s) >gi|226332053|gb|ACIB01000003.1| GENE 1 458 - 628 59 56 aa, chain + ## HITS:1 COG:no KEGG:BF1426 NR:ns ## KEGG: BF1426 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 56 1 56 56 99 92.0 3e-20 MLIEERYKDEDTGSDGVNSLPKLELSYSAGVCFFLSKQAKRTIINLEIKNLPPNLI >gi|226332053|gb|ACIB01000003.1| GENE 2 963 - 1967 992 334 aa, chain - ## HITS:1 COG:no KEGG:BF1363 NR:ns ## KEGG: BF1363 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 334 1 334 334 673 100.0 0 MNLKFLYLLLLISALCISCSKDEEPSDKGSTSPQEPVYTTFTDAGEVVVPGVLPANFTPR SVRVKGDTLFVANTNAADRSVLLLNLTTGELIGRIDSWVRKGVKETFNAEIGDMAVSDRY IFVGMYNSRINIFDRRTLQFVNAIGRSDGKWGDDIYSMTHCYGLRECGERLMVRDKNTIR GYWIYEAVTEPAFRVPWIGKVKVPEGVGYDYQARIHGMAEYDGRMYLTDWYNKSVQVFTP SKMEIVFGEETHITSDAVFKYDDIQPLGLLAYGGELLMSVQKSGKIQRYDPETGDLLGVL AEFPGKEIGRMEIARGMLYYIDLKAGKLMKAKGE >gi|226332053|gb|ACIB01000003.1| GENE 3 2087 - 3100 776 337 aa, chain - ## HITS:1 COG:no KEGG:BF1423 NR:ns ## KEGG: BF1423 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 337 1 337 337 653 99.0 0 MMKRHNILTVLAVLLLTFAACDKNSDPGFSFTTDSILTLTFDKELDAAFVGVGTQNYNPV GMRRSGDTLFIANRAEGSDGVWVVCASTGELLYSLTGWTYNGKNEKFDNQVMDVAVSSDY IFVVNRSSRIDLFRRNDYSYVTTIGRTGWQSSSLLQCEAAEVAGDKLFIRDKQKIKVVQI SDCTPENRFKVPVFAQNTDSTSSNNGFNLESVARHEGLIYVSDYETSRILVIDPATVAVK GEPVRFLRSYRMPNKPLSMGFFQNEMYVVCANNRIVRVDLRTGKELGSYSSFAGGVGLGT PGRLFFHNDTFYLAGRNANAPRLIQGKVMFVEISELD >gi|226332053|gb|ACIB01000003.1| GENE 4 3097 - 4191 801 364 aa, chain - ## HITS:1 COG:no KEGG:BF1422 NR:ns ## KEGG: BF1422 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 364 1 364 364 728 99.0 0 MNRKQILCISLLAFSLCACQDDDTADILDAFGTPPALTKMTAQEFLDQGEQMFEDGESVD EIVSADRAFYVNNPFNVSMADGKLRLFNGMAHDFKEVSLWLTMPHLRDTIQLMQFEEVPG FYNIKLDSPLKMGEVVYASKSGKPVRVNNLALLSPDQYELLLSCNDSVFDMLKTIKMKTR IQMGKYGSGNWGVMTANAARYYATSAVNMAVMFSSEIFRDSLMNYKGSIHNDAGKEIDRE GLLNSILNKQSLVFGVVTGSGIAGLGGGNALGLREEYLAGFFYQNRVAIDCNWVLHVWIH ELGHCLGYGHSSSLCYGSVPDEIVPKVYRYMMKHRMLPYIINPFKSYNDYNPGINDADNP DIEL >gi|226332053|gb|ACIB01000003.1| GENE 5 4411 - 7440 2729 1009 aa, chain + ## HITS:1 COG:no KEGG:BF1360 NR:ns ## KEGG: BF1360 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1009 1 1009 1009 2083 99.0 0 MKKLFLWALSALLTLPAATQDFVPEASFYGENYWTPDTLGNHRAIVSVNTPASVAEAYIP WRRRDANPEQKGIIVINASTGKAVDNVLPVEINREYGRIRFDASGNAGDYYVYYLPYHTS GGPYPKVNYPQQPDKADPQWKATCKATPAGKAVQAKLVRFESLGSFNSFYPMEIIATAQE KQALIDANSNKPFLLLPEDRKYPIRMFDDLSYRQVTQGATGEFFGEADLNEYYVLQLGLW AFKNPVNGVKVTFTDLKGKNGSIPASALTCFNTEGTDWLGRPIHPEVNVGKGRVQPLWIG IQMPEHAGRGIYRGTVTVSDLSGASQEVNIAINLSDNVLVDKGDGDLWRLSRLRWLNSQY AVNNRPVKPFIPIKVADRTISVLGRSVTAGELGLPASIQSYFTEEMTSIGKEPKDILSQP MEFVIRQNGKPLPVTITSPLKFGKKEDGTVSWTATGKAGALDITVLASMEFDGFMNYQIK VKAAENTSVDDIALLTSMPATTAKYRLGMGYEGSLRPKSDQWKWNVERNQEGFWFGDVNA GMQCLFRAENYRRPLNTNFYKMQPLNMPPSWFNDGKGGISYKEKGNQVDIKTYSGSRTLQ KGEELNFDFLVLVTPFKPIDTMKQWTDHYYHGYQPAEKLKEGDAHAYQAVDVVGETGANV INLHHGNAVNPHINYPFFRPAFMKQYVDESHAKGYKVKIYYTVRELTNHAPELFALKSLG HEIFSPGKGGGYAWLQEHLDGDYIGAWFVDAYKDAAIVNTGISRWHNFYVEGLNWLTKNV GIDGVYIDDLAFDRNTMKRIRRVLESNRPDPRIDVHSANQFNPADGYINSIFLYMEHMPY LDRLWFGEYFKYEKSPEYWLTDVSGIPFGMMSEMLQDGGNPYRGMLYGMTAREPMESVPS QLWKVWDAFGIKDSRMMGYWVSYNPVKTGRNDILATSFVKDGKVMIAIASWAKKDSDIKL QIDWEKLGIDADKARLTAPDIKGFQEGFSLSPKDKIKVPKDKGFIFILE >gi|226332053|gb|ACIB01000003.1| GENE 6 7829 - 8422 588 197 aa, chain - ## HITS:1 COG:no KEGG:BF1359 NR:ns ## KEGG: BF1359 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 197 49 245 245 314 99.0 9e-85 MKKILFILFTVQFILIPRITGSYGANSIGNNHGHSVTNQGKKTIKKDIFGDTVIEDNHGN RKTIKKDIFGDTVIEDNRGNRKTIKKDIFGDTVIEDNRGNRKTIKKDIFGDTVIENNCGN RKTIKKDIFGDTVIEDNRGNRKSIKKDIFGNTVIENNKGYKKTIKTDIFGNKIIEDNHGK KQIIKKDIFGNVIIENY >gi|226332053|gb|ACIB01000003.1| GENE 7 9190 - 9933 505 247 aa, chain - ## HITS:1 COG:NMA2029 KEGG:ns NR:ns ## COG: NMA2029 COG3177 # Protein_GI_number: 15794909 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Neisseria meningitidis Z2491 # 20 236 54 267 290 114 34.0 1e-25 MKHIFNKEQCQKATFILESPLAKLSELYSSEIKDLAVVWCYYSGRIEGNTYTYVETEALL KDGITSEKKYEDAKMLKNLYNTFISELEYIHQEKNKEIIDERTLFRLHQSIFTGLVSNEE SGFLRTRAVRISGTDYAPPKDLQEIKSKLGEILYEQDVYTNPLEKAVFLHCNIARLQPFI DGNKRTSRMIESVVMMNADIIPVYSAKDADILNYRKGLIRFYETGDYTKYSDYFLNRQLE RIKEIDI >gi|226332053|gb|ACIB01000003.1| GENE 8 10803 - 11708 462 301 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253563305|ref|ZP_04840762.1| ## NR: gi|253563305|ref|ZP_04840762.1| predicted protein [Bacteroides sp. 3_2_5] # 1 301 1 301 301 557 100.0 1e-157 MDITRNTRNAKNELMTYFRSESENLKSILNQKLDHKGKAKILNSKLVSCKEELILATKKK AAEENWTKIELLECILMITYCNYVVMLETRNSVWAYEYMAFSRRIGELWEPFCKLAFEYP INDLELFTPPSFSDIKNRVTSEFTNKINELDILDNKKESLINSYLAVWEMVTSGEIQMNL DLHFKIGIEKYVVDFKSGFGSNEKGNTNRLLLVAQIYHDLNDNYNPLLFVRSMENNNYFN TLKNSGIWNAYSGADTYVEIYKYSGYNIKEWIDNNIDWENDLSQDFVAHLRINNLLQYLT W >gi|226332053|gb|ACIB01000003.1| GENE 9 11702 - 13204 312 500 aa, chain + ## HITS:1 COG:HI0513_1 KEGG:ns NR:ns ## COG: HI0513_1 COG0827 # Protein_GI_number: 16272457 # Func_class: L Replication, recombination and repair # Function: Adenine-specific DNA methylase # Organism: Haemophilus influenzae # 1 190 1 178 276 72 31.0 2e-12 MVENQRKYQAYYTKSNSIVTYMTKLLNLNGSERILEPCAGDGVFIDSIINLFPNISIDAL ELSIDSVKMLLQKYRKSKSIQIRQTDYLKDTTLDDYIVKGGVYDAIIANPPYGAWREIEE RKQLKDKFNGLYAKESYTLFLIHSINLLKEGGKLSFIIPDTWLNVHMHKQVRKYILTNTM ITEISLFPSSFFPNVNFGYANLMIISLKKNSIIEENLNNKFNIYSNFKSVDELLKNNLSN ISKVSLSQNQIFTNRDYSFVINPNEHVLNCINNNNVCIGDICDCATGFYSGDDKSFLKVL NRDIKNSKHYELVDCNKVQYNCSYRDLMGLANGKYYVPIVKGGNTKYWKSDMWFMSWTKE NVNHYITNKKSRYQNSSFYFKQGIGIPMVSSSSITAALINNRLFDQSIVGVFPKNENFLY YLLAFFNSPTCNILIRTINSSTNNSSNYIKKIPFIAPSIEDLHEVTKNTKEIITRITTEN LDYHNLEDKNNNIFYKIYGF >gi|226332053|gb|ACIB01000003.1| GENE 10 13286 - 14470 313 394 aa, chain + ## HITS:1 COG:mll9329 KEGG:ns NR:ns ## COG: mll9329 COG0582 # Protein_GI_number: 13488150 # Func_class: L Replication, recombination and repair # Function: Integrase # Organism: Mesorhizobium loti # 110 392 16 316 338 78 23.0 2e-14 MAEYSICKCIRTDKPVKRNGKYPIYLRVRVGFKDTKFPTNLDVWKEHWDIKKNEPKNKAL LIQLNKKVLELDLYINRLLADGQELTLDLVRDYYSGKKKIKPENSSFYDYYLEFVERKRK EGLNPETIRVYMTTFNVLKEFKAEFRISDISLSFIEQFDDHMREVNGNSPGGRNPKHKNF RTVILDIQKHNIPVDNPYRWFKIPSSEVKEVYLDKSELHTIMEYTERFDKSSKEYKILKM YQFSCFCGLRFSDTMDLRWKDIDFANCLIRRMMIKTKTEVITPLFPMARDILMERSNNGK LIGSDEKVFYNFAEPTVNQALRKEAKLAGIDKHITYHSSRHTFATLLVIDNVDIYKISKY LGHKSVNMTQRYLKYDLSIAKESAKDISTFSGEN >gi|226332053|gb|ACIB01000003.1| GENE 11 14507 - 15781 513 424 aa, chain + ## HITS:1 COG:no KEGG:BDI_0498 NR:ns ## KEGG: BDI_0498 # Name: not_defined # Def: integrase # Organism: P.distasonis # Pathway: not_defined # 32 389 30 371 392 99 26.0 3e-19 MAIKIQEPTLYLRVDKKKKDGKMPICIRFQRIDKKEPKFSLGINCSPEQWDEVGQRILNS DGLDIILQEEVNRIKKEVRNAEVNGVEITKELLKEIVSGKQKKYNQSENQSFYYYFEEYI SKKREIGKIQESTWNTYQTTLRSLKEFRQEIRIIDISKKLLDDFDKYLIKRGTENHKGTV EGSRRNRIKHIRAVIRYIEDLRIPIKNPYKTLDLAIPNDKENNIFLEIDELRRMFCLINK VKEKSIEYRVLLMYLFSCVTGIRIGDALAVKWGDLDVERSPIILSFVAIKTKKCVSVPIF PLAEEVLMYAPEGNIDNVERNKKIFHTYPQELINKTLRELAKKANIDKHITFHSSRRTFA TLAIIQGIPLNDLKNYMGHSSTKTTERYTKWSSSLAEYSAQKVDLFQVKELLTSKKRTPR KNKP >gi|226332053|gb|ACIB01000003.1| GENE 12 16081 - 16452 107 123 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253563309|ref|ZP_04840766.1| ## NR: gi|253563309|ref|ZP_04840766.1| predicted protein [Bacteroides sp. 3_2_5] # 29 123 1 95 95 191 100.0 2e-47 MDVISFVAIILLIALLPHFIIGWAASSKMRSFGGWTFLSFIICNLSGFLEYVFGTWGIFT LIIFIALLIMALQPSDAYRRKEIFEEEKLRANMREEQERLKEKDNAPLIHNSTGKTINDL YRK >gi|226332053|gb|ACIB01000003.1| GENE 13 16647 - 17123 409 158 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253563310|ref|ZP_04840767.1| ## NR: gi|253563310|ref|ZP_04840767.1| predicted protein [Bacteroides sp. 3_2_5] # 1 158 67 224 224 222 100.0 7e-57 MIVKFNNVVDKITEGANKNDKKKNANNVTIYLQYVDDFNSYLSANITNISNSLFATIKTN TKGLASEIDKYLEKSNSNNIEKTDSEIVVDLNDYFNSSTGWDDETDWNCTGMEDEESQKA FDKVLKKNGIYYSSKKGTWVKKKSKAKSIEEQTVKVEF >gi|226332053|gb|ACIB01000003.1| GENE 14 17188 - 18222 421 344 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253563311|ref|ZP_04840768.1| ## NR: gi|253563311|ref|ZP_04840768.1| predicted protein [Bacteroides sp. 3_2_5] # 1 344 9 352 352 629 99.0 1e-179 MKDFSGNFDNCRETSEEVFEHPDKKEYGNYRLANKTKFDKHSLKVAHAKGTNTVNIVGSL RKRAYHKATFNDLTKDKFEQTIRNLAKDFNISYEEIRKAEFTQCEIGANINMSYPAKEIL HMVVAYSSLKRDDKCIDQGTLYFKTKGKQKNENKRVKLYAKDIEIAKNSHWKKRERRDEA FARMRKCGNNMLRIEFTLNFHSSFQSHQMGHIETIGDLIDNYYDLYEFWTREVGKFVFYN QLDYSNAKPTTKDKYIIAGLEQLGFECFVEEYRDLRSQDKKEPKDIKSARSKAYTSILKV LDKYYDRHSYNINDFKKDIEKFLIRKSRSESFNLPLLRGNLWNN >gi|226332053|gb|ACIB01000003.1| GENE 15 18998 - 20200 390 400 aa, chain + ## HITS:1 COG:no KEGG:Ppha_2135 NR:ns ## KEGG: Ppha_2135 # Name: not_defined # Def: RES domain protein # Organism: P.phaeoclathratiforme # Pathway: not_defined # 1 386 1 374 413 205 34.0 3e-51 MGRIKQDWIESQERGYSLPELKEKYVCANHFDDSYLKQYINKNSISGTCSYCGKKKKVID LNHLMGYIVDKIMSYYGNPSDEGLYLASSFFDDDKERIPGFQRVGCYVTPSFAHNYESTK ELFYDINLTTDSEVLDNDMESCFINDEWIQHTPYMMSESQELSFMWKTFQRMVKHEQRFT FFKRPEFTGEEVSNDNGLMDILTELGAKITAHNLYSNISAGTELYRCRFINEGETVNKFD EITSSPDNKAKQSRMSPAGISMFYGAFDKKTAIVESSPDGEGTGKYVIGKFNLKKNLIVL DLTKLPKPSFWIGKDWEGIEFLHSFNREITKRIERDDRIHIDYIPSQVFTEYLRYIHRLP NGKKIDGIIYKSSLKSTDNNIVLFYNQRSSSDVLELVEIT >gi|226332053|gb|ACIB01000003.1| GENE 16 20402 - 20614 154 70 aa, chain - ## HITS:1 COG:no KEGG:BF1354 NR:ns ## KEGG: BF1354 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 56 1 56 70 106 100.0 3e-22 MNWQEWVVGLLIVLCVARILYGIYLFFRRVKENDNPCASCASGCELKDMMEKKQKECSSK KKNTKKNCCR >gi|226332053|gb|ACIB01000003.1| GENE 17 20611 - 23094 2383 827 aa, chain - ## HITS:1 COG:MA3477 KEGG:ns NR:ns ## COG: MA3477 COG0370 # Protein_GI_number: 20092288 # Func_class: P Inorganic ion transport and metabolism # Function: Fe2+ transport system protein B # Organism: Methanosarcina acetivorans str.C2A # 112 827 11 669 670 550 40.0 1e-156 MRLSELKTGEKGVIVKVLGHGGFRKRIVEMGFIKGKTVEVLLNAPLKDPIKYKIMGYEIS LRRQEADMIEIISEQEAKSQTQEAAYHQGLTEDIDVKEEDLKRVALGKRRTINVALVGNP NCGKTSLFNLASGAHEHVGNYSGVTVDAKEGYFDFQGYHFKIVDLPGTYSLSAYTPEEIY VRRHIIDETPDVIINVVDSSNLERNLYLTTQLIDMNVRMVVALNMYDELEASGNTLDYLL LGKLFGVPMVPTVCKRNIGVDRLFHVVINLYEGADFIDKKGHIHPEVAKEIMDWHQSLPN FKDHGEHPADYTHGKEPVGKVFRHIHINHGPDLEKAIDAVKEEISKNEFIRHKYSTRYLA IKLLENDPEIERFIHTLPNAVEVEKKRDKAARRIQETMNEDSESALTDAKYGFISGALKE TFTDNHLEQEQTTKVLDSIVTHRIWGYPIFFLFMYLMFEGTFVIGEYPMMGIEWLVEALG NLIRDNMSEGPLKDLMIDGIIGGVGGVIVFLPNILILYFCISLMEDSGYMARAAFIMDKI MHKMGLHGKSFIPLIMGFGCNVPAIMASRTIENRKSRLITMLVNPLMSCSARLPIYLLLV GAFFPKNGSLVLLAIYAIGIALAVIMARLFSRFLVKGDDTPFVMELPPYRMPTMKSIFRH TWEKGAQYLKKMGGIIMIASIIIWFLGYYPDHDAYPTQAEQQENSYIGQIGQAVEPVLKP LGFDWKLSIGLLSGVGAKELVVSTLGVLYTNDADADVVSLAERIPITPLAAFSYMLFVLI YFPCIATLVAIKQESGSWKWAIFTAGYTTALAWLVSFAVYQIGGMFL >gi|226332053|gb|ACIB01000003.1| GENE 18 23219 - 24526 776 435 aa, chain + ## HITS:1 COG:CAC3204 KEGG:ns NR:ns ## COG: CAC3204 COG0037 # Protein_GI_number: 15896451 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Predicted ATPase of the PP-loop superfamily implicated in cell cycle control # Organism: Clostridium acetobutylicum # 4 428 3 460 461 204 31.0 3e-52 MIQNRVAQYIEKEKLFCLNDKVLVTLSGGADSVALLRLLLSMGYTCEAAHCNFHLRDKES DRDEAFVRRLCHESGVLLHIEHFDTTQYAAKKHISIEMAARELRYEWFETLRRQREASVI ATAHHKDDSVETVLLNLIRGTGINGLLGIRPRNGNIVRPLLCLSREEIIAYLQYIDQDYV TDSTNLLDEYTRNKIRLNLLPLMKEINPSVKESIIRTTNYLNDAATLYNQSIGEARKRIL TPEGIRIEALLQEPVPEAILFEVLHPLGFNTTQIDNIRQTLDGQPGKTFLGKGWRVIKDR DLLLIEEDTTAEKSQPPFRLVMEEYDYTSEFIIPKDKNTACFDADKINKTWEIRKWKPGD VFIPFGMTGKKHVSDYLTDKKFSLSEKEKQWILCFGEQIAWLIGERTDNRFKVNENTKRV IIVRIVSEHSDFIEE >gi|226332053|gb|ACIB01000003.1| GENE 19 24785 - 30379 3948 1864 aa, chain + ## HITS:1 COG:FN0579 KEGG:ns NR:ns ## COG: FN0579 COG2373 # Protein_GI_number: 19703914 # Func_class: R General function prediction only # Function: Large extracellular alpha-helical protein # Organism: Fusobacterium nucleatum # 256 1858 54 1609 1611 362 24.0 3e-99 MKGKKNLLILFSCVLTVIIAGLFACTRSAKEIIPSSEYAPYVNAYTGGVISQSSPIRIEL TQDQPMVDLNNELKDCPFSFTPSLKGKAYWVSNNTIEFLPEEGELKPGKLYQGSFQLGDF VEVDSKLKVFDFSFRVQESNFTLHTAPLEIASSSPDKVTVKGEIRFSDKITKEQVEKILS TNGTSTITIGSTSNPLEYSFIISNIQRKEQDYDLKITVDGEPVGMDRKQSESITIPAKDS FRFLSAERISQPENGIQIVFSDPVSDTQDLKGLIEIPEIPSYIFQITDNKVNVYFEAGHL SKLTLKIHEGVKNNQGKALGGSHSISFSELNLKPQVEISSAGAIIPDSKNLVIPFRAVSL YAVDLRVIRIFENNVLMFMQNNSLSSANELRRSGRLVYKKTLFLGKDPSKDLHKWENYSI DLAGLIHQEPGAIYRVILSFKQEYSAYPCGSGENPKMQFSEETESLTKVKSDILSEEDEA VWDKPETYYYFSGNENADWSQYRWDERDNPCHPSYYMTSDRIAACNVLASNIGMIVKRNS MNKLWIAVSNILDTKPVEKAKVIVYNFQLQPIGEALTDAEGFAIIETKGTPFIVVAESEK QKAYVKVADGEEQSTSRFDVGGKDIQKGLKGFVYGERGVWRPGDTLHISFVLEDRNKRIP DKHPVTLELYNPRGQFYSKQISTNGLNGFYTFKVPTQPEDPTGLWNAYVKVGGTAFHKSL RVETIKPNRLKINLKLPGEILKAADKEVQAHLSSAWLTGATASRLKAKVEMSLSKVNTQF KNYETYIFNNPATEFTSIRTDIFNGTLDENGNTGFMLKLPTSENAPGMLRANITTQVFEP GGDASIQTLSVPYSPFPSYVGINLNQPQGRYIETDKEHIFDIVTVNSDGKPVNRSGLEYK IYRISWSWWWENDNESFETYINSSSITPVATGRLNTTGGKGQFCFKVNYPDWGRYLVYVK DRESGHATGGTIYVDWPDWRGRSNKSDPSGIKMLSFSIDKDSYEPGETVTAILPASAGGR ALVTLENGSSVIQREWIEVSGKEDTKYQFKVTPEMAPNVYLHISLLQPHAQTVNDLPIRM YGIMPVFVTDKQTVLEPQIKMPEVLRPEQAFNLTINEKHGRPMTYTLAIVDDGLLDLTNF KTPDPWNNFYAREALGIRTWDMYDDVLGATAGSYGSMFSTGGDETLKPSDQKANRFKPVV KFIGPFSIGKGKSRTHQITLPMYVGSVRAMIVAGQDGAYGNAEKTVPVRTPLMILSTLPR VLSTNEEIEVPVNVFALENNVKNVNVSIQASGAGTQVTGNKQQTLTFTQTGDRLIFFKLK TGTKTGKATIHLTANGNGQNTKETIEIEVRNPNPVVTLRESQWVETGKSEELNYQLSSGS EGNSIQLEVSRIPSVDISRRFDFLYNYQHNCTEQLTSKALPLLFISQFKTVDNEESEKIK VNVQEAIRQLYGRQLPNGGFVYWPGNASADEWITSYAGMFLVLAQEKGYAVNSNVLNKWK RFQRAAAQNWQPVAQEQSWWYWQADFQQAFRLYTLAMAGAPEHGAMNRMKELPTLSQQAK WRLAAAYALNGKEKAAGELVFSAKTTVEPYSSNNYVYGSSDRDEAMILETLLLMNRQREA MEQAKIVSHNLTRETWFSTQSTAFSLMAMGRLAEKLSGTLDFSWTLNGKQQPAVKSAKAV YETLISTSSREGKVILKNNGKGALNADLITRTQLLNDTLPPIANNLRISVKYVDNNGSPI DTHSLHQGTNFMAVVTVANTSGTTDYTNLALTHIIPAGWEIFNERMAGQITTSAPYSYQD IRDDRILTYFNLQQGQAKTFTVRLQATYSGDFVMPAIQCEAMYDANVQARTQAGRTKVVR QSIE >gi|226332053|gb|ACIB01000003.1| GENE 20 30388 - 32727 1026 779 aa, chain + ## HITS:1 COG:FN0580 KEGG:ns NR:ns ## COG: FN0580 COG4953 # Protein_GI_number: 19703915 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane carboxypeptidase/penicillin-binding protein PbpC # Organism: Fusobacterium nucleatum # 34 769 1 712 724 384 32.0 1e-106 MNQILKPHNLPPSFQGRGWKISIKSWWKPALFLLLVLYIFCLPSQLFTSPYSTVVTDRNG ELLGARIATDGQWRFPPRDNIPEKVATCLIEFEDRQFYHHWGVNPLAIGRAVVQNLKHKR IVSGGSTLTMQTIRLARNKPRTFKEKLIEMVWATRLEFRKSKKEILSLYISHAPFGGNVV GLDAAAWRYFGHSAEELSWAESAMLAVLPNSPAMIHLSKSRQALLDKRNRLLTHLHKKGI LDTSTYELAISEPLPQEPLPLPHIAPHLTDYFYQTRNGKYSVSTIDRGIQTQIESLVERW NSEFKRSDIRNLAILVIDIRTNQAIAYCGNVHFDKEQSGNQVDVIRSPRSTGSILKPFLY YAMLQEGEILPNTLLPDIPVNINGFAPQNFNLQFEGAVPASEAIARSLNIPSVTMLQRYG VPKFHSFLKQIGLTTLNRPSSHYGLSLILGGAEATLWDITSAYANMGRSLNRLPQFPCTL LLSDSISVHRPSFQSGAVWQTFDAIKEVNRPEEIDWRTIPSMQTIAWKTGTSYGFRDAWA VGVTPKYAVGVWVGNATGEGKPGLVGARTAGPVLFDVFNLLPSSPWFERPQGELVEAEIC RQSGHLKGRFCEETDTLLILPAGLKTEACPYHHPVTLSANERFRIYENCANSEPVVRRNW FTLPPVWEWYYKQHHPEYRPLPPFKSGCGEDRFQPMQFIYPPMGARIHLPKQMDGSKGQL TVELVHSHPNTTIYWHLDETYLTQTQDFHKLSLRPSPGKHSLTAVDDEGNTISTTFFVE >gi|226332053|gb|ACIB01000003.1| GENE 21 32752 - 33957 705 401 aa, chain + ## HITS:1 COG:CAC3482 KEGG:ns NR:ns ## COG: CAC3482 COG0477 # Protein_GI_number: 15896719 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Clostridium acetobutylicum # 15 386 19 390 394 249 37.0 9e-66 MKQILKENGGLPASILWTLAIVAGISVANLYYNQPLLNMIRHELGVSEFETNLIAMVTQI GYALGLLFIVPLGDLYQRKRIILTSFSILIVSLLVIAMAPNIHVILCASLLTGICSVMPQ IFIPIASQFSRPENKGRNVGIVVSGLLTGILASRVVSGFIGELFGWREMYYIAAGMMLIC GMVVMRVLPDIRPNFQGKYSDLMKSLLSLLKQYPELRIFSVRAALAFGSFLAMWSCLAFK MGSAPFYADSHIIGMLGLCGIAGALSASLVGKYVRKVGVRRFNFIGCGLILSAWLLLFIG ENSYWGIVAGIIIIDIGMQCIQLSNQTRIFELCPSASNRINTIFMTTYFIGASTGTFLAG TFWQAFGWHGVIGTGVALTTGSLLIYFFFKTIIRIVTKVVI >gi|226332053|gb|ACIB01000003.1| GENE 22 33961 - 34389 246 142 aa, chain + ## HITS:1 COG:no KEGG:BF1412 NR:ns ## KEGG: BF1412 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 142 1 142 142 243 99.0 1e-63 MKYGVNRQVLLITAGIVWIVAGANILRIGIVTWLNDSQYWLFKVGEATVVFLLFFVLVFR KLYYKHTRRIEQKKKEKNCPFSFFDAKGWIVMCFMIAMGITIRTLHLLPDTFISVFYTGL SLALMFTGTLFIRYWWRKRPHT >gi|226332053|gb|ACIB01000003.1| GENE 23 34718 - 35596 950 292 aa, chain + ## HITS:1 COG:TM0497 KEGG:ns NR:ns ## COG: TM0497 COG1814 # Protein_GI_number: 15643263 # Func_class: S Function unknown # Function: Uncharacterized membrane protein # Organism: Thermotoga maritima # 11 291 2 283 284 234 47.0 1e-61 MKMDLKKEVKEEFIRFQRNEKTESIVYERLASIEKDESNRKVLRLISAEEKAHYATLKKY TETDVAPDKLRIAKYYWLARILGITFAIKLMESSEENAHHDYAKYTDYPDLRQLANEEEV HEQKLIGLINEERLEYMGSVVLGLNDALVEFTGALAGFTLALSDSRLIALTGSITGIAAA LSMASSEYLSTKSEGGETKHPIKAAIYTGIAYIITVVALVAPFILIENVLIALGVMLAMA LVIIALFNYYYSVARGESFRKRFTEMAVLSFSVAGISFLIGYALKTFTGIDA >gi|226332053|gb|ACIB01000003.1| GENE 24 35621 - 37051 968 476 aa, chain + ## HITS:1 COG:FN1422 KEGG:ns NR:ns ## COG: FN1422 COG1757 # Protein_GI_number: 19704754 # Func_class: C Energy production and conversion # Function: Na+/H+ antiporter # Organism: Fusobacterium nucleatum # 49 476 31 446 473 281 39.0 3e-75 MKKAPSPLVSLIPLVVLVIMLFATIRTFGSDALSGGSQVSLLTTTAVCILIGMGFYKIGW KDFELAITNNITGVSTALIILLIIGALSGAWMISGVVPTLIYYGVQIIHPSFFLTSTCII CALVSVMTGSSWTTIATIGIALMGIGKAQGFEEGWIAGAIISGAYFGDKISPLSDTTILA ASVTDTPLFRHIRYMLITTVPSLIITLVIFTVAGLSHNAGSTEHIAEFSAALAGKFHITP WLLIVPVVTGVLIARRVPSVITLFLSAALAGAFAVFFQPDLLQEISGLQNSEGTQSIFKG LMMTLYGGTSLQTSNEALTELMATRGMAGMMNTVWLIICAMCFGGAMTAGGMLGSITSVF VRFMKNTVSMVASTVCSGIFLNLATADQYISIILTGNMFRDIYEKKGYESCLLSRTTEDS VTVTSVLIPWNTCGMTQATILSVPTLVYLPYCFFNIISPLMSITIAAIGYKIVRRK >gi|226332053|gb|ACIB01000003.1| GENE 25 37120 - 37710 695 196 aa, chain + ## HITS:1 COG:no KEGG:BF1409 NR:ns ## KEGG: BF1409 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 196 1 196 196 346 100.0 2e-94 MKKFIALVALVLVSASTLMYAQESQRDIRRADRKAQRDAERARLKAEEQAADQVAYQQAV QAIKDKQFVLEADQVIFKRGQTAFVSSNTNFVMLNGQRATVQVAFNTPYPGPNGIGGVTV DGTTSDVKVTTDKRGNVNCNFSVQGIGISAQVFITLTNGGNNATVTINPNFNSNTLTLSG NLVPLNQSDVFKGRSW >gi|226332053|gb|ACIB01000003.1| GENE 26 37816 - 38493 480 225 aa, chain + ## HITS:1 COG:mll4697 KEGG:ns NR:ns ## COG: mll4697 COG2197 # Protein_GI_number: 13473938 # Func_class: T Signal transduction mechanisms; K Transcription # Function: Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain # Organism: Mesorhizobium loti # 5 214 9 209 215 88 28.0 8e-18 MREYIIADNQDITKAGMMFLLSKQKEVSLLLEADNKMELVQLLRIHPQAVVILDYTLFDF SGADELIILQERFKESDWLLFSDELSIGFLRQVLFSSNAFGVVLKDNSKEEIMSALQCAS RKERFICNHVSNLLLSGSGNVNTASAVHTFVPQEDRLLTPTEKIILKEIASGKTTKEIAA EKHLSFHTINSHRKNIFRKLGVNNVHEATKYAMRAGIVDLAEYYI >gi|226332053|gb|ACIB01000003.1| GENE 27 38618 - 39679 864 353 aa, chain - ## HITS:1 COG:BB0682 KEGG:ns NR:ns ## COG: BB0682 COG0482 # Protein_GI_number: 15595027 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain # Organism: Borrelia burgdorferi # 1 352 1 350 355 303 44.0 4e-82 MDIAALLSGGVDSSVVVHLLCEQGYKPTLFYIKIGMDGAEYMDCSAEEDIELSTAIARRY GLALEVVDLHREYWDNVAAYAIEKIRKGQTPNPDVMCNKLIKFGCFEQQVGKDFDLTATG HYATTLQLGGKTWLGTAKDPIKDQTDFLAQIDYLQVSKLLFPIGGLMKHEVREIALQAGL PSARRKDSQGICFLGKINYNDFVRRFLGEKEGAVIEFETGKKIGTHRGYWFHTIGQRKGL GLGGGPWFVVKKDIQDNIIYVSHGYDAEQQYGYEFRMKDFNFITDNPWEGSTGEEEVTFK IRHTPEFIKGRLLHDEEGYRIISSEKLQGIAPGQFGVIYDAESRVCFGSGEIG >gi|226332053|gb|ACIB01000003.1| GENE 28 39684 - 40970 1175 428 aa, chain - ## HITS:1 COG:L75975 KEGG:ns NR:ns ## COG: L75975 COG2873 # Protein_GI_number: 15672055 # Func_class: E Amino acid transport and metabolism # Function: O-acetylhomoserine sulfhydrylase # Organism: Lactococcus lactis # 1 428 1 426 426 523 61.0 1e-148 METKKLHFETLQLHVGQETPDPATDARAVPIYQTTSYVFRDSAHAAARFGLQDPGNIYGR LTNSTQGVLEERIAALEGGVGGLAVASGAAAVTYAIENITRSGDHIVAAKTIYGGTYNLL AHTLPAYGVTTTFVDPSDLFNFERAIRENTKAIFIETLGNPNSNIIDMDAVAAIAHKYRI PLIVDNTFGTPYLIRPIEHGADIVVHSATKFIGGHGSSLGGVIVDSGKFDWVASGKFPQL TEPDASYHGVRFVDAAGAAAYIVRIRAVLLRDTGAAISPFNAFILLQGLETLSLRVERHV ANALKVIDFLVNHPKVAAVNHPSLPGHPDHAIYQRYFPGGAGSIFTFEVKGGTEEAQKFI DSLQIFSLLANVADVKSLVIHPATTTHSQLNAQELEEQGIKPGTVRLSIGTEHIEDIIDD LRQALEKI >gi|226332053|gb|ACIB01000003.1| GENE 29 41140 - 41964 661 274 aa, chain + ## HITS:1 COG:no KEGG:BF1404 NR:ns ## KEGG: BF1404 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 274 1 274 274 494 100.0 1e-138 MNKLLYCFFLSFSVIATSACTDDKEEREFPPAPDFSLMILKENLHSNGGDVLRTDTYVYD NNKLTTHTTLQEFYGQSLTHETTLSYSGNEVTLADENGNTAIYILGSAGYATECTYKLSD QVRKYTFTYSGEYLTRIDEEINSTPYSSVELAYDDNGNLSHIIANGLQTNYQAGNTENLY QLPCLQVCETYPLSFHNDAIYAGLLGRQSKHLIIGNTPKENKEEYTKYTYELDENEKPTG IIAKTTSTGTVIDINGNAYDETKTDTRTIGITIE >gi|226332053|gb|ACIB01000003.1| GENE 30 42181 - 42999 726 272 aa, chain - ## HITS:1 COG:PM1451 KEGG:ns NR:ns ## COG: PM1451 COG0627 # Protein_GI_number: 15603316 # Func_class: R General function prediction only # Function: Predicted esterase # Organism: Pasteurella multocida # 28 270 30 264 269 179 40.0 4e-45 MKRRNLFLACLLLFVLPLSAARVDTLMVKSPSMNKEVQVLVVTPDVALGKNAAACPVLYL LHGYGGHAKTWIQIKPNLPEIADEKGIIFVCPDGKDSWYWDSPKNPAYRYETFVSSELVN YIDRNYKTIADRKGRAITGLSMGGHGAMWLGIRHKDVFGAAGSTSGGVDIRPFPKNWSMN KQLGELASNKRIWDEHTVVNQLDKIQNGDLALIIDCGEDDFFLNVNKDFHDRLLGRKIDH DFITRPGEHNGKYWNNSIDYQILFFSKFFAGE >gi|226332053|gb|ACIB01000003.1| GENE 31 43031 - 43468 255 145 aa, chain - ## HITS:1 COG:no KEGG:BF1402 NR:ns ## KEGG: BF1402 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 145 1 145 145 303 99.0 1e-81 MKTIKLITCDDAFQAHIIQGALANEGIDSLLHNENMSTLLRGFVHDISGVDVLVADCDYE AAMQLLRQNQMIPEEQKFCPFCGSDRIKFVLKKEHRVRAVSAAIVSMLATVPPGGNHWEY ICDHCGKAFEKPVTEFNPSALEEKD >gi|226332053|gb|ACIB01000003.1| GENE 32 43575 - 45947 1747 790 aa, chain + ## HITS:1 COG:no KEGG:BF1336 NR:ns ## KEGG: BF1336 # Name: not_defined # Def: putative TonB dependent outer membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 790 13 802 802 1543 99.0 0 MKKNVLFLLLLGLLTTVSAQPTHRIKGTVIDKASRQPLEFINVLVLGLGRGGVTDAEGHF NIGEVPPGIYRLQASAVGYKTILTPEYIVSTKDLTIQIETEENLTELEGVTVTASPFRRD PESPVGLRIIGLQEIEKSPGANRDISRIVQSYPGVAFSPAGYRNDLIVRGGSPSENRFYL DGVEIPNINHFSTQGASGGPVGIINADLIREVNFYTGAFPTDRGNAMSSVLDFKLRDGDM ERNSLKATLGASEVSLASNGHIGKKTSYLVSVRQSYLQFLFDMLGLPFLPTFTDAQFKLK TRFNANNELTILGLGGIDNMKLNTKLDGEKAEYILSYLPKIQQETFTLGAVYRHYAGIHV QSVVVSHSYLNNRNTKYLNNDESSTDNLSLKLRSVEQETKFRIENTSTFGNWKINFGANL DYSQYTNTTFQRVYIDEGRTFDYHTYLGMWRWGIFGTINYATTNERFTASLGVRTDANNF SSGMKGMGDQLSPRLSLSYRLTDGLYLSGNAGLYYQLPPYTGLGFKDNNGAWVNKYLRYM SVSQESLGLSWHPGNTFELSAEGFYKQYDKIPFSIADGIPLACKGNDYGVIGNEALSSTA QGRAYGIEILMKWLIAKKLNLASSFTLFKSEYRNNKQSEYIASAWDNRYIFNMSGTYNFP HNWSLGMKISCIGGAPYTPYDVEKSSLVTAWNAQGRPYYDYTKYNTGRLPAFGQLDVRVD KTFYLKRCMLGFYIDLQNVTNSKFKQPDILMSTGVIENPSAPMAEQRYKMKYITQKSGTL MPTLGITFEY >gi|226332053|gb|ACIB01000003.1| GENE 33 45989 - 46789 934 266 aa, chain + ## HITS:1 COG:yggG KEGG:ns NR:ns ## COG: yggG COG0501 # Protein_GI_number: 16130837 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Zn-dependent protease with chaperone function # Organism: Escherichia coli K12 # 2 258 27 287 294 160 34.0 3e-39 MKKRIVFIAALLCGMCFTTASAQFKIGGKKINVGKVVQAGSDAAKAITLSDSDIAAMSKE YMQWMDTHNPLTAADSEYGKRLEKLTGHIKEVDGLKVNFGVYEVVDVNAFACGDGSVRIC AGLMDIMTDDEVMAVVGHEIGHVIHTDSKDAMKSAYLRSAVKNAAGAASSTVSKLTDSEL GAMAEALAGAQYSQKQESEADDYGFEFSIKHNIDPYAMYNALNKLLELSAEAPKESKFQK MFSSHPDTAKRVARAKEKADNYTKNK >gi|226332053|gb|ACIB01000003.1| GENE 34 46946 - 48235 1421 429 aa, chain + ## HITS:1 COG:sll0260 KEGG:ns NR:ns ## COG: sll0260 COG1253 # Protein_GI_number: 16331101 # Func_class: R General function prediction only # Function: Hemolysins and related proteins containing CBS domains # Organism: Synechocystis # 1 422 8 430 448 302 41.0 8e-82 MEFLVIILLLVLNGIFAMYEIALVSSSKARLETLVSKGNKSARGVLKQLEEPEKFLSTIQ IGITLIGIVSGAFGGVAIADDVTPLFAMIPGAEVYAKDLAMITTVIVITYLSLIIGELVP KSIALSNPERYATLLSPVMILLTKISFPFVWLLSISTRLLNKLIGLKSEERLMTQEELKM ILHQSSEQGVIDKEETEMLRDVFRFSDKRANELMTHRRDLVVLHTTDSKEKVLQIIDNEH FSKYLLIDDDTDEIAGVVSVKDIILMIGSEQEFNLREIARPALFIPESLYAKKVLELFKK NKNKFGVVVNEYGSTEGIITLHDLTESIFGDILEEDDTEEEEIVRRQDGSLLVEASMNIG DFMEEMGILSYDDIESEDFTTLGGLAMFLIGRIPKAGDIFTYKNLQFEVVDMDRGRVDKL LVIKREEEE >gi|226332053|gb|ACIB01000003.1| GENE 35 48272 - 48745 577 157 aa, chain + ## HITS:1 COG:no KEGG:BF1333 NR:ns ## KEGG: BF1333 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 157 1 157 157 256 100.0 1e-67 MKKVIMLAAVVAALVSCQSKGTKAEEAVADSLAVAMEPTMEETQVYEGVLPAADGPGIRY VLTLNTLANATDTTYTLDVTYLDAEGKGKDKTFTSKGKPVKVEKTVKDKKKTAIKLNPSD GSEPVYFVIANDTTLTLADDSLEVSESDLNYNIIRVK >gi|226332053|gb|ACIB01000003.1| GENE 36 48831 - 50420 1521 529 aa, chain - ## HITS:1 COG:STM3749 KEGG:ns NR:ns ## COG: STM3749 COG1501 # Protein_GI_number: 16767034 # Func_class: G Carbohydrate transport and metabolism # Function: Alpha-glucosidases, family 31 of glycosyl hydrolases # Organism: Salmonella typhimurium LT2 # 69 498 197 635 772 120 26.0 5e-27 MKRKATLLFLVTFVLLLGAVAQEKLKTVIEPLQNERWWGGFVALGNQMPFNDHLRMQDMS RNNMNNQVVPFMLSSEGRYIWAENPFCFEVKDGQLIIYSDSEKIEPVKAGTTLKEALLAA SAKHFPPSGKIPEPTFFSLPQYNTWIELMYDQNQEDIMNYAHKAVENGFPQGVFMVDDNW QRYYGNFDFKTERFPDPKGMTDELHRMGFKVMLWVAPYVSPDSPEFRELEAKGYLLKDKN GRTAIIHWWNGYSACYDTTNPEAMNYLKEQLKANQEKYGIDGFKFDGGDVAYMTGEYTFH DKNANVNTFMEKWAEIGLSFPYNELRASWKLGGQALVQRLGDKDYSWRATQLLIPDMTAA GLLGHYYTCPDMVGGGQFGAFLNVKKFDEELIVRSCQVHALMPMMQFSVAPWRILSKENV AICAKYAHLHQQMGDYILELAKHASKTGEPIVRHMEYQYPHQGFIDCKDQFMLGDKYLVA PMLTSGTSRTVMLPKGRWKDDRGKVFKGPRTMTIDVPLERLPYFEKLTK >gi|226332053|gb|ACIB01000003.1| GENE 37 50805 - 51203 233 132 aa, chain + ## HITS:1 COG:no KEGG:BF1396 NR:ns ## KEGG: BF1396 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 132 1 132 132 236 98.0 2e-61 MEQTFKARNPQLGVWLFAIGIIACIAISFITIQAIWIGTACIGPSVTIYFSQTFCKYIVK KNGDLQIVNDFFRQKRTFSHITDVTYTRHALGMQKIKIRHATGFVMIDPQSPRELIKALQ KTNPDVRVKNFI >gi|226332053|gb|ACIB01000003.1| GENE 38 51255 - 53828 1758 857 aa, chain + ## HITS:1 COG:no KEGG:BF1327 NR:ns ## KEGG: BF1327 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 857 1 857 857 1704 98.0 0 MKTPKILKVHLLLYLFCLSVSAQTTDSIVQKLSQCAYLIDNFSKYIPQEKVYMQFDNTSY YQGDNIWFKCFLVTSDLHPATDLSKTLYVELLNPGGEVIDKRTLKIENGQCHGDFTLNQI PFYSGFYEVRAYTKYMLNFGGDIIFSRLLPVFDKPKAEGDFTEKEMRKYATGNYPLERPK PTKGKKVNLKFFPEGGNLVRGIESEVAFEATDTYGNPITLTGIVINEEKQETARFNVTHD GRGIFNYTPTGEKQKAIVEFNGKKHQFNMPEALPEGYILHADNLSYTDSIEIAVQKNSNT PSDVLGLAVISGGKLYKFCLIDVEGNETIRFKIDKSKLVPGVSRIALFNSNGEVLSDRLI FTYPQEQLSVQVQADKETYAPYELVNLEFTLTGKEKIPVQTPFALSVRDGMNEVESGHNM LTDLLLMSEIKGYVNNPQYYFESRDDTHRKAIDLLLRVQGWRRYSWKQMTGIESLDLKYG PEQGIETRGQVVSFVRQLPKPNVEVSCFLKKRGENEENSSFIEAITTDSLGHFSLISDIY GKWDLILAVTVNRKKKDYRILLDRLFSPAPRKYHYADMQVSIANAEKEKELMPEVETTPL PEEDIEAFFAAYADSLEKAGNHEKNYRLKEVTIKAKKRTKEKEIYESRSKSIAYYDVHSE LDDIKDSGKFIGDDIHELMMNMNKNFSPVSGRNYLYYKSKMPLFVINYERTRHTEMDYNK YKYIRLEAIKSIYITENLSTICQYADPRLTPFDVSDLYSCAVLIETYPEGKIPTEAGKGV RKTWLDGYSQVKEFYHPNYSVLPPVPDYRRTLYWNPSVTPDKEGKAHIRFYNNSRCRKLK ISAETITTNGLIGTYGN >gi|226332053|gb|ACIB01000003.1| GENE 39 54010 - 54477 526 155 aa, chain - ## HITS:1 COG:no KEGG:BF1326 NR:ns ## KEGG: BF1326 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 155 15 169 169 287 99.0 8e-77 MAKELNFTLEGVQGDLKLKYGPFNQRLYQDGREIKKQGRFNPKYYVINTNGEKEEIKVVY GFDFVHVAVFRGQKIDLEERLSIREYIVGGLPVLLVFLGGLIGALFGIMGATFNYNHMRQ EKSFIKQLLVSLGVSILCYVAYFIFAIGVQLIVAR >gi|226332053|gb|ACIB01000003.1| GENE 40 55601 - 57829 2471 742 aa, chain + ## HITS:1 COG:CAC0980 KEGG:ns NR:ns ## COG: CAC0980 COG1882 # Protein_GI_number: 15894267 # Func_class: C Energy production and conversion # Function: Pyruvate-formate lyase # Organism: Clostridium acetobutylicum # 7 742 8 743 743 934 62.0 0 MELNKTFKDGLWSKEINVRDFVSNNITPYEGDASFLQGPTERTKAVWNHCLKALEEERNN NGIRALDYITVSTITSHPAGYIDKENELIVGLQTDQVLKRAIKPFGGINVVMKACRENGV EVDDRVKDIFTHYRKTHNDGVFDVYTEEIRSFRSLGFLTGLPDNYARGRIIGDYRRLALY GLDRLIEAKKEDLHNLTGPMTEARIRLREEVAEQIKALKEIKVLGEYYGLDLSRPAYTAQ EAVQWVYMAYLAAVKEQDGAAMSLGNVSSFLDIYLEYELSQGTITETFAQELIDQFVIKL RMVRHLRMQSYNDIFAGDPTWVTESIGGRFNDGRTKVTKTSFRFLQTLYNLGPSPEPNMT VLWSPELPEGFKAFCAQVSIDTSSVQYENDDLMREVRQSDDYGIACCVSYQEIGKQIQFF GARANLAKALLLAINGGRCENTGTVMVKDIPVLTGETLKFEEVMANYKKVLIQIARVYNE AMNIIHYMHDKYYYEKAQMAFVDTDPRINLAYGVAGLSIAIDSLSAIKYAHVKARRNDIG LTEGFDIEGAFPCFGNDDDRVDHLGVDLVYFFSEELKKLPVYKNARPTLSLLTITSNVMY GKKTGATPDGRAKGIAFAPGANPMHGRDKNGAIASLSSVAKLRYRDSQDGISNTFSIVPK SLGATQEDRIDNLVTMMDGYFTKGAHHLNVNVLNREMLRDAMEHPEKYPQLTIRVSGYAV NFVKLSREHQLEVISRSFHERM >gi|226332053|gb|ACIB01000003.1| GENE 41 57834 - 58559 400 241 aa, chain + ## HITS:1 COG:VC1869 KEGG:ns NR:ns ## COG: VC1869 COG1180 # Protein_GI_number: 15641871 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Pyruvate-formate lyase-activating enzyme # Organism: Vibrio cholerae # 4 239 7 245 246 219 46.0 3e-57 MINVHSYESMGTFDGPGLRLVVFLQGCNFRCLYCANPDTIEAKGGTATDPEEIVRMAVSQ KAFFGKKGGITFSGGEPTFQAKSLIPLFKRLKEAGIHICLDTNGGLWNNDVEELLELTDL VLLDIKEFNPEHHQSLTGRSNEQTLKTAAWLETNHKPFWLRYVLVPGYSDFEDDIRQLGE HLGTYQMIQRVEILPYHTLGVHKYEAMNKEYMLKGVKENTPEQIEKAEKLFRQYFRTVQV N >gi|226332053|gb|ACIB01000003.1| GENE 42 58792 - 60051 815 419 aa, chain - ## HITS:1 COG:no KEGG:BF1321 NR:ns ## KEGG: BF1321 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 419 2 420 420 755 98.0 0 MKEDEKWIKAFKDKLEDYSEPMPASGWERLERELMPVTEKRIYPYRRWAVAAAAVVLVVT TAVSLYFLNSPVADEIRYATTPSLAVNPDVLPEPALPDVQVAVSEPVKPVGTTSINPVSG YLAKNTDPVIVPEVSSLAEKRPEAVTEEKRAEPQQEATAAIEKKESATAQPPKRKEARRP SGKDKYQLPIGDSSAKRGGKWSMGVGIGNGGGLPTNGSENFAPRPMTNRVDLMTIMNGAV SIPADQEVIFEEGVPYLKSNTTAVVDYEHHQPVSFGLSVRKSLPKGFSVETGLTYTLLSS DIKRQGDTKMQSQKLHYIGIPVRGNWNFLEKKYFTLYVSAGGMVEKCVYGKLADDKVNVK PLQFSVAGAVGAQFNATDHVGLYVEPGVSYFFDDGSKVQTIRKERPCNFNLQAGLRFTY >gi|226332053|gb|ACIB01000003.1| GENE 43 60106 - 60654 470 182 aa, chain - ## HITS:1 COG:PM1789 KEGG:ns NR:ns ## COG: PM1789 COG1595 # Protein_GI_number: 15603654 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Pasteurella multocida # 1 178 5 189 191 75 28.0 4e-14 MNELELSERCRQGDNRARKELYEQYAGRMLGVCLRYAGDRDMAQDLVHDGFLKIFDSFDK FTWRGEGSLRAWMERVMVNTALQFLRKNDVMNQTTALDEVPETYEEPDASAVEAIPQKVL MQFINELPAGYRTVFNLYTFEDKSHKEIAQMLGINEKSSASQLFRAKSVLAKKVKEWLVT NG >gi|226332053|gb|ACIB01000003.1| GENE 44 60842 - 61594 591 250 aa, chain + ## HITS:1 COG:no KEGG:BF1335 NR:ns ## KEGG: BF1335 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 250 1 250 250 493 100.0 1e-138 MKKFNLITFAVILMLLPILQSCLDDANNDEWVTCPPGGILAIGTMKIPNADTPRDFFIAL DNGDNVLPADTADIRNRKYTVAEGQRVFVGYLQMGKEKPGYENGKIFTIEDILTKEIIPL TEATADSIGDDRINVTAHALTKDYLTIEYQYLGSMNENKKHMLNLVQNEITGPIKDDGYI YLEFRHNAFNDSPNQLGSSLVSFKLDSIAEQLATAKGIKLRVNTIYDNIQYVTIDINEDK NLKIKSFHSQ >gi|226332053|gb|ACIB01000003.1| GENE 45 61699 - 62625 888 308 aa, chain - ## HITS:1 COG:STM4125 KEGG:ns NR:ns ## COG: STM4125 COG0583 # Protein_GI_number: 16767389 # Func_class: K Transcription # Function: Transcriptional regulator # Organism: Salmonella typhimurium LT2 # 1 294 1 293 305 195 38.0 8e-50 MTIQQLEYILAVDQFRHFAKAAEYCRVTQPTLSAMIQKLEDELGVKLFDRSMTPVCPTTI GKKVLEQARKILSEVICVKEIISEEQHSLSGTFRLAVLPTIAPYLLPRFFPQLMEKYPDL DIRVMEMKTPDIRKALLTGEADAAIIASMLDDAALTEETLFYEQFLGYVSKKEPLFKHDV IRTSDVTGERLWLLDEGHCFRDQLVRFCQMEAVKISQMAYRLGSMETFMHMVESGKGITF IPELAVMQLSEEQKELVRPFAIPRPTRQIVLVTRKDFIRTSLLQVLKEEIQAAVPKEMLT LQAVQCLV >gi|226332053|gb|ACIB01000003.1| GENE 46 62767 - 63240 656 157 aa, chain + ## HITS:1 COG:PM0817 KEGG:ns NR:ns ## COG: PM0817 COG0783 # Protein_GI_number: 15602682 # Func_class: P Inorganic ion transport and metabolism # Function: DNA-binding ferritin-like protein (oxidative damage protectant) # Organism: Pasteurella multocida # 21 154 19 152 159 149 55.0 1e-36 MKTLNYTHLEEKGANTIVLSLQQLLADFQIHYANLRGFHWNIKGHGFFVLHSKFEDLYNG AAEKVDEIAERILMLGGTPANKYSDYLKMAQIKEVDGVNKADDALNHILETYGHLIAEER KILSLASSHNDEVTVAMMSDYLKEQEKMVWMLTAYNG >gi|226332053|gb|ACIB01000003.1| GENE 47 63344 - 64342 950 332 aa, chain - ## HITS:1 COG:TM1225 KEGG:ns NR:ns ## COG: TM1225 COG2152 # Protein_GI_number: 15643981 # Func_class: G Carbohydrate transport and metabolism # Function: Predicted glycosylase # Organism: Thermotoga maritima # 4 331 1 326 326 395 58.0 1e-110 MEEIKIAGAALPAMPWEERPAGCKDVVWRCSANPIIPRDLLPTSNSIFNSAVVPFKDGYA GVFRCDDTNRRMRLHVGFSKDAVHWDINEEPLKFQCDDAEVGTWVYGYDPRVCFIEDRYY VTWCNGYHGPTIGVAYTYDFVTFHQLENAFIPFNRNGVLFPRKINGRFAMLSRPSDNGHT PFGDIFYSESPDMEFWGRHRHVMSPAPFEDSAWQCTKIGAGPIPIETSEGWLLIYHGVLA SCNGFVYSFGSALLDIDQPWKVKFRSGPYLISPQKDYECMGDVPNVCFPCAALHDSETGR IAIYYGCADTVTGLAFGYIPEIIEFTKRTSII >gi|226332053|gb|ACIB01000003.1| GENE 48 64397 - 65665 1330 422 aa, chain - ## HITS:1 COG:RSc0154 KEGG:ns NR:ns ## COG: RSc0154 COG0477 # Protein_GI_number: 17544873 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Ralstonia solanacearum # 17 410 23 401 426 82 24.0 2e-15 MNKKPISPVCWVPTVYFAMGLPFVALSMVSVLMFSDMEVSNAQIAFWTSLIMLPWTLKPL WSPFLEMFKTKKYFVVATEIITGLAFALVALSLPLPDFFRYAIALMGIIALSGATHDIAG DGVYLTELTATQQAKYIGWQGAFYNLAKILANGGLVWLAGMLKDEFGVVHAWMIVMLMCA GIMILIGLYHIRILPSGGGASGEVNTMSDALNMLWEVIRSFFQKKHIAFYIVFIILYRFA EGYAIKIVPLFLKASVADGGLGLSTQDIGLVYGTFGAGAFILGSLLAGYYISAFGLRKTL FSLCCAFNIPFLVYFLLALYQPSDLWIIGMAIVSEYLGYGFGFVGLMLFMMQQVAPGKHQ MAHYAFATGIMNLGVMLPGMMSGYLSDWLGYRDFFIWVLIATIPAFIVTWLVPFTYPDGK KK >gi|226332053|gb|ACIB01000003.1| GENE 49 65915 - 66928 771 337 aa, chain - ## HITS:1 COG:no KEGG:BF1314 NR:ns ## KEGG: BF1314 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 337 1 337 337 664 99.0 0 MNRHLNFVTLICLITGFVSLASCSNDDDEMDKYQKIYIRTGVAPGAEMGAAVVVDCVSNS STADRSDFEIQLCAVQPVTRDVTAALGVDTLKVDTYNSANKTKCKLLPQDNYTIETSEVV LKTGQTVADSSFQVVLKNIEKLTNPDGYVLPVALKGVTGMDEQAVSTSMKTVYIRIYTSV LYTSYTKPGNWSAVDRSAWSVSCSNVYADDDAKYGAHLAIDGEINTTWFTWGVANAGECW WNTVLDRPVTLTGFSVTKQSAYGSGYNLRSAEIKVRKEVETEWVTYPRVLTFRNFKGADP QYAAIEPPIPNVKEFRINCLTPDNYTGFAEINLYVKQ >gi|226332053|gb|ACIB01000003.1| GENE 50 66951 - 68087 866 378 aa, chain - ## HITS:1 COG:no KEGG:BF1329 NR:ns ## KEGG: BF1329 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 378 1 378 378 748 99.0 0 MKRITIKQCLLACFVLPMILAGCNDADYQVIDNAIYLNEASENGSAKVTVDPENVTTTTL TVRVGQPVTENVTAVLTVDPSILKEYNEANETSYEVLPEQYFTYDKEIKIAAGDVSAIPS EFRVKPYSNENGELYAIPVSLTEIQGPVSTIGLSSKFIILLDKPLIQSVPFMNTTNAVKP AKDELWGVTTNEWSLEAWVQMDGFDINNQAIFNSGSSDHEVYIRFGDAMIPYNSLQVKTL GSQVNTVTLFEKNKWYHLAFVYNSSGLLSIYINGVLDVTLQTKGGPVRFDKMNMVSSGSY FRNNCQMAQVRLWKSAISQTQIQSNMYFAVKPADPNLIGYWKMDEGKGNAFVDCTGHGYD LVAGGTLVWKEHVRFDKQ >gi|226332053|gb|ACIB01000003.1| GENE 51 68105 - 69157 1148 350 aa, chain - ## HITS:1 COG:no KEGG:BF1328 NR:ns ## KEGG: BF1328 # Name: not_defined # Def: putative secreted endoglycosidase # Organism: B.fragilis # Pathway: not_defined # 1 350 1 350 350 712 99.0 0 MKKRYLNIVVFALFVSMLWGCSDWTKPEAEDFFEMPGNDYYENLRAYKRSEHSVAFGWFG GWTGVGASMVGSLMGLPDSVDFVSIWGNWKNLDEARMLDKKKVKEQKGTRALMCFIVANV GDQLTPEEHKENYKEYWGWKDGDQEAIDGAIRKYANAICDSIDKYGYDGFDIDYEPNYGS PGNLASYPENMLTFVKALGERIGPKSGTGRLLVIDGEPQSIHPETGPYFDYFIVQAYSNL AGNSDANLDRRLAGTIANFKGILPPEKVANMYIVTENFESYAPAGGGDYVDRYGNKMRAL AGMARWTPTIDGKQVRKGGVGTYHMEYDYPGDIEYKYLREAIRIMNPAVK >gi|226332053|gb|ACIB01000003.1| GENE 52 69185 - 70729 1380 514 aa, chain - ## HITS:1 COG:no KEGG:BF1327 NR:ns ## KEGG: BF1327 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 514 1 514 514 1047 99.0 0 MKKKILNIVLAAAVTPFLCCCTDNFEEYNTNPYEPHSLNPPMLFATMITTGINVQQNDNQ MIDQMVAGPFSGYLTMANSWGGSNFNTFNQTESWNQIPFNTPFEKFYSNYFKLETATGGK GHYWAMAKLLRVNTMLRVTDCYGPIPYSQVANGKTAVAYDSQEDVYKHMFEDLDYVIQML GEFVDEVGGLKPLEGYDPVYNGDYNKWMRFANSLKLRLAVRISNVSPELARTKAEEAVKS TRGLIDTNDNNAYVGVGAEPNPLWLVASSWGEIRINATIASYMKGYSDPRSAVYFTTSKL GGDSPYMGMRSGLEGVKPATYSGYSMPNYEQKDDMLMFCAAETAFLRAEGALRGWDMGGS ARDFYEQGVKLSFDQRKVSGADEYLANAVAVPEPFVDPVNPAKCNYTPKTKITIAWNEGA STEEKLERIITQKWIANFPLGFEGWADYRRTGYPEVFPSVSNLSNGVIDTNRQLRRLPFP LSEKQGNSANVSAAVSMLGGPDTGATDLWWAKKN >gi|226332053|gb|ACIB01000003.1| GENE 53 70742 - 74047 2795 1101 aa, chain - ## HITS:1 COG:no KEGG:BF1326 NR:ns ## KEGG: BF1326 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1101 1 1101 1101 2090 100.0 0 MKKSVKHKFQYLLFFLLSMPAFIFASAQDIRVNIDFTKASLGSVLNEIGRQTSLSIVYNT GDVNPTQPVSIKASNENITTVMNRLLRGTGLSYSIMNKHLILSTTDKNHSIQQEKVTVTG NISDAKGEPLIGVSILVKGTSNGTITDMNGHFSLPVAKGDVIEISYIGYAPQAITVTDSK PLKIVMKEDAEVLDEVVVTALGIKRAQKALSYNVQQVGGDAINTVKDANFINSLQGKVAG VTINNSAGGVGSASRVVMRGTKSITKDNNALYVIDGIPMFNVSFGKSEGSFATQSGSDGV ADLNPDDIESINMLTGPSAAALYGNAAANGVVLINTKKGSAEKTTLTVSNNTMFSDAYMM PEMQNRYGNNPGEFASWGNKTKQSYDPSRFFNTGVNVINAISFSTGTKKNQTYASASTTN ATGILPNNSYSRYNFSIRNTATFLKDRLTLDVGASYIIQNDKNITAQGQYFNPLPALYLF PRNDNFEEIRMFERYSESRGVNVQFWPYGHQGLSLQNPYWIMKRMNRKTEKKRYMINASL TYKLTDWLNVAGRVKVDNSDIRMTQERYASTLTTFAGANGFYSDQNRTDRNTYADMMVNI DKRIGDFSLNANIGASIKDLVYEQMGNEGDLAGIPNFFTVRNINYESNYKPKQFGYHDQS QGVFANIELGWRSMAYLTLTGRNDWESQLAFTKHSSFFYPSIGGSVVLSEMFRLPEFISY AKLRGSYSSVASSFERYLSNPGFEFNEQSHQWGSSTTLPATNLKPEDTRSWEIGLNARLW NHFSIDATYYHSNTYNQTFNITLASSSGYSSAIVQTGNIQNYGLELALGYNNTWGDFSWN SSLTYTMNRNKVKRLASGATNPITGEIIDMPELRMAVLGADGYGPRVILREGGTMGDLYV DKGLRTDGNGNIWVDSQTGKVGVQDYAEPKKIGTMNPDFNMGFSNTFSYKGINLGVVLTA RVGGLCVSNTQGILDYYGVSKATADARDAGGVWINNGFVDAKSYYQTIGGSTGGLGQYYT YSATNIRLSELNLSYTLPRKWFNNKVGITAGIVGKNLWMIYCKAPFDPEMTPSTTSNFYQ GVDYFMQPSTRNIGFNVKFQF >gi|226332053|gb|ACIB01000003.1| GENE 54 74357 - 75367 704 336 aa, chain - ## HITS:1 COG:PA2388 KEGG:ns NR:ns ## COG: PA2388 COG3712 # Protein_GI_number: 15597584 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Pseudomonas aeruginosa # 105 313 105 306 331 91 32.0 2e-18 MDETKLLNYLKGESDAEECLEVEAWYYASAEHKKQLDQLYYMLFVGERKVAMDGVDTENS LSVLKDRIKHKESEKKSVHRIRVSKWKRYAMPLAAFLCGLLVSVGALYWISSGKSAGYIF ATESGQRAQAVLPDGTKVWLNASTQIVYKPSFWKRERQVDLSGEAYFEVSRNKTKPFVVN SNDVRTCVLGTKFNVRARPSEEKVVTTLLKGSVKVQLPGQSEEEGILLKPGQMLSVDTRT MQPMLTEASRPGDVLLWINGKLKFEQATLQEIVQCLEKHFDVHFIISDAQLQKDRFTCTF STDDDIRQILSILALTKRFDYKCEDNNIILTPKTNR >gi|226332053|gb|ACIB01000003.1| GENE 55 75533 - 76132 457 199 aa, chain + ## HITS:1 COG:no KEGG:BF1324 NR:ns ## KEGG: BF1324 # Name: not_defined # Def: RNA polymerase ECF-type sigma factor # Organism: B.fragilis # Pathway: not_defined # 1 199 1 199 199 367 100.0 1e-100 MAQTLTNLNINDNNAVISALREGDEEVFDHIYRYYFRGLCAFCSQYVTLSESEEIVQDTM MWLWENRKTLIPELTLKTLLFTIVKNKALNKISHFEIKRKVHQEIAEKYETEFSSPDFYL ENELFRLYEEALRKLPAEFRQAYEMNRSLQMTHKEIAEKLNVSPQTINYRIGQALKILRS ELKDYLPLIMLFLFLQGHK >gi|226332053|gb|ACIB01000003.1| GENE 56 76109 - 78124 1775 671 aa, chain - ## HITS:1 COG:no KEGG:BF1307 NR:ns ## KEGG: BF1307 # Name: not_defined # Def: putative alpha-glucosidase protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 671 1 671 671 1390 99.0 0 MKQILHILLIALMLMGFYSCVPKTELASPDGHIKVAFTADTDGKMMYRVTVNDTLLLDNS PLGFEAKDGIDLNRGFRVVNTVFTDKDEIWTQPWGENKTNRNHYNEMAVLLKNAANVELT LRFRAFDDGVAFRYEYEVPGVDSLLITDELTAFRFHEDGTSWSIPASFETYELLYSKQKI SEVENANTPFTFKTSGGVYGSIHEAALYDFPEMTLKKDGENTLKSELASWPDGIKARKEN RFTTAWRTIQIAPQAVGLINSSLILNLNEPCKLDTTDWIKPMKYVGVWWGMHLGVETWKM DDRHGATTANAKKYIDFAHANNIEGVLFEGWNEGWESWGGMQSFDFTKPYADFDMDEITR YAREKNVQIIGHHETGGNIPNYERQMEKAIKWYTDKGIHILKTGYAGAFPDGHSHHGQYG VNHYQKVVETAARYRMTLDAHEPIKDTGIRRTWPNMMTREGARGMEWNAWSEGNPPSHHE MLPFTRLLGGPMDYTPGTFDILFTQTKDSPKRQKWNDQDKGNSRVNTTLAKQLANWVILY SPLQMASDMIEHYEGHPAFRFFRDFDPDCDESRALAGEPGEFVAVVRKAKQNYFLGASTN EEPRVLPVSLDFLEKGKIYKAIIYADGEKADWKTNPTEYQITEQEVTADDTLNIRMAAGG GQAISFMPLQK >gi|226332053|gb|ACIB01000003.1| GENE 57 78147 - 79004 864 285 aa, chain - ## HITS:1 COG:lin0348 KEGG:ns NR:ns ## COG: lin0348 COG3568 # Protein_GI_number: 16799425 # Func_class: R General function prediction only # Function: Metal-dependent hydrolase # Organism: Listeria innocua # 29 285 4 256 257 158 37.0 1e-38 MNKICFFIGVFITLLLAGCSSNPISHVRVATFNIRYDNLGDSLNSWKYRKEKVCEFIREK HPDVLGMQEVLNHQLKDLLSGLPDYAYVGVGREDGKTQGEYAPVFYRKDKYDLLDSNTFW LSEHPDSIGKLGWDAACTRVATWAKLKDKTTGKEFLMLNTHFDHVGTEARRNSALLIIDK IKEIAGTHPAMMTGDFNVSEEWEAYKTITSNEFVLKDAWKIAGKQSGENYTFHDFGRVPV AEREKIDFIFVTPQIKVADAEIISSAITDSTYLSDHNAHLADLEF >gi|226332053|gb|ACIB01000003.1| GENE 58 79075 - 80139 674 354 aa, chain - ## HITS:1 COG:Cgl0445 KEGG:ns NR:ns ## COG: Cgl0445 COG0318 # Protein_GI_number: 19551695 # Func_class: I Lipid transport and metabolism; Q Secondary metabolites biosynthesis, transport and catabolism # Function: Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II # Organism: Corynebacterium glutamicum # 44 346 63 375 376 102 27.0 1e-21 MLEGSVLSKEDVKRLVACYTEEASGFLYELYRFLEEWFSDSPYLTVKTSGSTGTPKLLKV RKEQMMQSARLTCEFLGLRQGDSVLLCMPLQYIAGKMVVVRALVAGLNLVIRTPSGHPMA DVDIPLRFAAMVPLQVYNTLQVSAEKERLCRTDILIIGGGAIDAGLEAEIRQLPVKVYST YGMTETLSHIALRQLNGPDASMLYRPFPSVRLSLSSEHTLVIDAPLVCDTTLVTNDVAEI YSDGSFSILGRKDNTINTGGIKVQAEQIEEILRPWMRVPFAITSVPDARLGEAVVLLVEK GADAELPETKMKELLSKYQLPKMILSVGAIPLTETGKINRAACRQLALTYRTDR >gi|226332053|gb|ACIB01000003.1| GENE 59 80183 - 81202 646 339 aa, chain - ## HITS:1 COG:AGpA707 KEGG:ns NR:ns ## COG: AGpA707 COG4948 # Protein_GI_number: 16119707 # Func_class: M Cell wall/membrane/envelope biogenesis; R General function prediction only # Function: L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 123 236 67 179 299 82 35.0 1e-15 MYTINIIPRVLHFKQPAGTSRGSYTTRNVWYIHLSSIECPGRVGVGECAPLPKLSCDDLP DYEQVLRSACQRLEQTGELDIESLRGYPSILFGLETALRHYETQSWALWDTPFSRGEAGI PINGLIWMGSFDRMLQQIEVKMQAGYRCIKLKIGAINFEEELALLRHIRAHYSAREIELR VDANGAFSPADAMDKLNRLAELDLHSIEQPIRAGQWEEMARLAAESPLPIALDEELIGCN AIERKRELLAAIHPRYIILKPSLHGGISGGNEWIAEAEKQHIGWWITSALESNIGLNAIA QWCATFRNPLPQGLGTGLLFTDNVEMPLEIRKDCLWFCK >gi|226332053|gb|ACIB01000003.1| GENE 60 81251 - 82075 1026 274 aa, chain - ## HITS:1 COG:BS_menB KEGG:ns NR:ns ## COG: BS_menB COG0447 # Protein_GI_number: 16080132 # Func_class: H Coenzyme transport and metabolism # Function: Dihydroxynaphthoic acid synthase # Organism: Bacillus subtilis # 6 274 3 271 271 403 67.0 1e-112 METKREWTSIKEYEDILFDYYNGIARITINRERYRNAFTPTTTGEMSDALRICREEPDIN VVVLTGAGDKAFCSGGDQNVKGRGGYIGKDGVPRLSVLDVQKQIRSIPKPVIAAVNGFAI GGGHVLHVVCDLSIASENAIFGQTGPRVGSFDAGFGSSYLARVVGQKKAREIWFLCRKYN AQEALDMGLVNKVVPLDKLEDEYVQWAEEMMQLSPLALRMIKAGLNAELDGQAGIQELAG DATLLYYLTDEAQEGKNAFLEKRKPDFKQYPKFP >gi|226332053|gb|ACIB01000003.1| GENE 61 82097 - 83098 914 333 aa, chain - ## HITS:1 COG:no KEGG:BF1302 NR:ns ## KEGG: BF1302 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 333 1 333 333 672 99.0 0 MTKHKMNLFCGMLLLSASLFVEKAQAMFPERYVSEESAARADTINFRLASAGEAKRLIAA KDSYTANWSPFDIAARLENPEGKREDLVSLAVREVREWTAEEAQQIEQIRKNLNDTIRKY GYRIPFPKEIVLVKTTMKDEGGAGGYTRSNWIALTDATFQRGTEASHTRLLVHETFHILT RLNPGFKEKLYRAIDFNILPKEIEFPEDIRKSRISNPDVSRCDSYATFTIDGKPQNCTMI IYTNRPYTTGKFYQYINVGLIPLDESFKPLRESGKTVIYPLQKATDFFDKVGRNTGYVID PEEVLADNFAIALLNTPNVHTPELQKKVQELLK >gi|226332053|gb|ACIB01000003.1| GENE 62 83103 - 84770 1305 555 aa, chain - ## HITS:1 COG:BS_menD KEGG:ns NR:ns ## COG: BS_menD COG1165 # Protein_GI_number: 16080134 # Func_class: H Coenzyme transport and metabolism # Function: 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase # Organism: Bacillus subtilis # 19 474 21 494 580 186 31.0 9e-47 MYSDKKNILQLVALLRAHGVTKVVLCPGSRNAPIVHTLAGHPDFTCYSVTDERSAGFFAI GLALQGGTPAAVCCTSGTALLNLHPAIAEAYYQKVSLVVISADRPAAWINQMDGQTLPQP GVFRSLVKKSVDLPEIHTDEDEWYCNRLLNEALLELNHHGKGPVHINVPVSEPLFQFTAE SLPEVRVITRYQGLNVYDRDYDGLIDRLNKYNRRMMIVGQMNLIYLFEKKYSKMLYKQFA WFTEHLGNQTVPGIPIRNFDAALYAMSPEMQEKMIPELVITYGGHIVSKRMKKYLRQHPP KEHWHVSPDGEVIDLFQGALTTIIEMDPFEFMEKIAFLLDNRTPEYPRQWENFCKELPRP ELPYSEMSAIGSLIQALPASCALHLANSSAVRYAQLYSLPDTVEVCCNRGTSGIEGSLST AIGYAAASKKLNFVVIGDLSFFYDMNALWNNHFGSNLRILLLNNGGGEIFHTLPGLEMSG TSHRFVTAVHKTSAKGWAEERGFLYQEVQDEKQLDEAMKTFTQPELLTQPVIMEVFTNKN KDARILKDYYHQLKN >gi|226332053|gb|ACIB01000003.1| GENE 63 84791 - 85897 655 368 aa, chain - ## HITS:1 COG:VNG1083G KEGG:ns NR:ns ## COG: VNG1083G COG1169 # Protein_GI_number: 15790177 # Func_class: H Coenzyme transport and metabolism; Q Secondary metabolites biosynthesis, transport and catabolism # Function: Isochorismate synthase # Organism: Halobacterium sp. NRC-1 # 90 359 175 432 441 112 30.0 1e-24 MTLEETNNCEIIDELIRQGRSFAMYRNPGEEEPHFLMQTCGEVHLIRKMEDLNGRTGFVI APFRVTPQCPIILIRPDCHEIPSCAGNLHTPAQGDDAASYGQQLRSDRSLGAKKAYEACF DVFIRALRERTFDKLVLSRRMTVRREPGFSPAAAFYRACRRYIYSYVYLCYTPQTGVWMG STPEIILSGEKGEWNTVALAGTQSLQNGELPQQWDEKNREEQEYVAAYIRKQLRSLGISP TEKGPYPAFAGALSHLKTDFHFSLNDSQRLGDLLKLLHPTPAVCGLPKEEAYRFILDNEG YDRSYYSGFIGWLRPEGRTDLYVNLRCMNVKEDSLTLYAGGGLLASSELDDEFQETEKKM QTMQNLNS >gi|226332053|gb|ACIB01000003.1| GENE 64 85894 - 87126 1208 410 aa, chain - ## HITS:1 COG:VC1364 KEGG:ns NR:ns ## COG: VC1364 COG0561 # Protein_GI_number: 15641376 # Func_class: R General function prediction only # Function: Predicted hydrolases of the HAD superfamily # Organism: Vibrio cholerae # 3 269 2 267 273 162 35.0 8e-40 MKYKLLVLDVDGTLLNDEKEITPRTLATLLKVQQMGVHIVLASGRPTYGILPLAKKLELG NYGGYILSYNGAQVINAKNGEVLLERRINPEMLPYLEKKARKNGFAIFTYTEDRMIADQA DNEHILQEAFLNRMELIEEPEFSVAVDFAPSKCMLVSDDEEALIGLEEHWKKRLNGALDV FRSEPYFLEVLPCGIDKSTSLGALLSHLDITPEEIIVIGDGVCDVSMIQFAGLGIAMGNA QDSVKVCADVVTASNEEDGVALAVEKAILSEIRPAEIPLDQLNERARHALMGNLGIQYTY ASEDRVEATMPVDERTRQPFGILHGGATLALAETVAGLGSMILCQPDEIVVGMQVSGNHM SSAHEGDTVRAVGTIIHKGRSSHVWNVDVFTSTDKLVSSIRVVNSILKKR >gi|226332053|gb|ACIB01000003.1| GENE 65 87693 - 89267 1454 524 aa, chain - ## HITS:1 COG:XF0174 KEGG:ns NR:ns ## COG: XF0174 COG4108 # Protein_GI_number: 15836779 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Peptide chain release factor RF-3 # Organism: Xylella fastidiosa 9a5c # 6 523 21 540 548 530 50.0 1e-150 MADNTEILRRRTFAIIAHPDAGKTSLTEKLLLFGGQIQVAGAVKSNKIKKTATSDWMEIE KQRGISVTTSVMEFDYRDYKINILDTPGHQDFAEDTYRTLTAVDSVIIVVDGAKGVETQT RKLMEVCRMRNTPVIIFVNKMDREGKDPFDLLDELEEELMIQVRPLSWPIEQGPRFKGVY NIYEQKLDLYQPSKQVVTEKVEIDIHSDELDRQIGDTLADKLRGDLELIEGVYPEFDVET YLQGECAPVFFGSALNNFGVQELLNCFVEIAPAPRPVHAEEREVIPEEPKFTGFIFKITA NIDPNHRSCVAFCKICSGKFVRNAPYLHVRHGKTMRFSSPTQFMAQRKTTIDEAWAGDII GLPDNGTFKIGDTLTEGEQLHFRGLPSFSPEMFKYIENADPMKQKQLAKGIDQLMDEGVA QLFVNQFNGRKIIGTVGQLQFEVIQYRLLNEYNASCRWEPLSLYKACWIESDDLEELEAF KKRKYQYMAKDREGRDVFLADSNYVLQMAQMDFKNIRFHFTSEF >gi|226332053|gb|ACIB01000003.1| GENE 66 89292 - 90155 892 287 aa, chain - ## HITS:1 COG:CAC2315 KEGG:ns NR:ns ## COG: CAC2315 COG1091 # Protein_GI_number: 15895582 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: dTDP-4-dehydrorhamnose reductase # Organism: Clostridium acetobutylicum # 1 284 1 280 280 243 48.0 3e-64 MNILVTGANGQLGNEMQVLARENLQHTYFFTDVQELDICDEQAVYAYVSEHKIDIIVNCA AYTAVDKAEDNVELCDKLNNIAPGYLARAAQANGAAMIQVSTDYVFDGTAHIPYTEEEPT CPASVYGSTKLAGEQNVMDHCEKAMVIRTAWLYSIYGNNFVKTMIRLGQERDSLGVIFDQ IGTPTYANDLAQAIFAAINKGVVRGIYHFSDEGVCSWYDFTVAIHRLAGIASCKVKPLHT ADYPAKAPRPHYSVLDKTKIKDTFGIEIPHWEESLKRCINQLRMETL >gi|226332053|gb|ACIB01000003.1| GENE 67 90290 - 90835 454 181 aa, chain - ## HITS:1 COG:no KEGG:BF1309 NR:ns ## KEGG: BF1309 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 181 1 181 181 332 100.0 3e-90 MKIAQQLKEKNIAEYLIYMWQIEDLIRANDCDVDRIEENIVSRYQVSDEERRELTEWYAN LASMMREEGVREKGHLQINRNVIINLTELHAALLASPKFPFYSSAYFKALPFIVELRNKN GQKEEPELETCFEALYGMMLLRLQKKPVSPETTKAMEAISSFLSMLANYYDKDRKGELKL E >gi|226332053|gb|ACIB01000003.1| GENE 68 90854 - 91507 445 217 aa, chain - ## HITS:1 COG:BS_ycgF KEGG:ns NR:ns ## COG: BS_ycgF COG1280 # Protein_GI_number: 16077378 # Func_class: E Amino acid transport and metabolism # Function: Putative threonine efflux protein # Organism: Bacillus subtilis # 8 213 1 203 209 59 28.0 4e-09 MIQIETILDILVKGFVIGIVVSAPLGPVGVLCIQRTLNKGRWYGFVTGLGASLSDIAYAL LTGYGMSFVFDYINKNIFYLQLLGSIMLLAFGIYTFRSNPVQSIRPVSANKGSYFHNFIT AFAVTLSNPLIIFLFIGLFARFAFVQPGVLVFEEITGYLAIALGALAWWFGITFFVNKVR TRFNLRGIWILNRVIGSIVMAVSVFGLIFTLLGESIY >gi|226332053|gb|ACIB01000003.1| GENE 69 91549 - 93813 1665 754 aa, chain - ## HITS:1 COG:PA5529 KEGG:ns NR:ns ## COG: PA5529 COG0475 # Protein_GI_number: 15600722 # Func_class: P Inorganic ion transport and metabolism # Function: Kef-type K+ transport systems, membrane components # Organism: Pseudomonas aeruginosa # 7 442 6 440 585 363 45.0 1e-100 MSQLPTLIADLALILICAGIMTLLFKKLKQPLVLGYVVAGFLASPHMPYTPSVMDVANIK TWADIGVIFLLFALGLEFSFKKIVKVGGTAVIAACTIIFCMILLGIAVGMGFGWQRMDSL FLGGMIAMSSTTIIYKAFDDLGLRKKQFTGLVLSILILEDILAIVLMVMLSTMAVSNNFE GTEMLGSIGKLLFFLILWFVVGIYAIPEFLKRCRKLMSEETLLIVSLALCFGMVAIAANT GFSAAFGAFIMGSILAETIEAESIDRLVKPVKDLFGAIFFVSVGMMVDPAMIIEYAVPII VITLAVILGQAFFGTMGVMLAGQPLKTAMQCGFSLTQIGEFAFIIASLGLSLHVTSDFLY PIVVAVSVITTFLTPYMIRVAEPASNFVDRKLPESWRRFLMRYTTGTRTVNHESLWRKLL FALARILVVYSIVSISVITLSFRFVVPLLREHLPGIWGPLAGAVFTILCISPFLRAIMVK KNHSVEFITLWNDNRVNRGPLVSTVVFRVTVSVLFVMFVITRLFKASVGLMFGVALLLVI LMILSRQLKKQSILIERKFFQNLRSRDIRAEYLGEKKPAYAGRLLSRDLHLTDFEIPGES AWAGKTLLELNLGKKYGVHVVSILRGKRRINIPGGSIRLFPMDKIQVIGTDDQLNTFAEE MSHVAVIDSGVFEKSEMTLKQLLIDTDSAFLGKTLRESGIRDKYHCLIAGVERGGEALMT PDVNVPFEEGDVVWVVGENEDVYRLIGQNCDRKR >gi|226332053|gb|ACIB01000003.1| GENE 70 93889 - 94632 697 247 aa, chain - ## HITS:1 COG:no KEGG:BF1306 NR:ns ## KEGG: BF1306 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis # Pathway: not_defined # 1 247 1 247 247 386 99.0 1e-106 MDNFKVIYSIPFLFFIIVSCSNSSTEMVAKSKYDAKIAEYKELNEQQAAVIEDNLEKSKI INNVVTELNQIAGNTHSLRVNVEHGVGELSQAEEINQKLQTLKKRLSAVEGKRSDGSKNL LATMDKLKSIIEQKEIEINNLKQEIANQQQTIANQKNTIASQQVTIDAQSQELMNKQQEM WYKLGTELHSVVEELPKVKGRKDKRNIKNTRYYILNKAKECFEHAAQLGHSLAGSKARQV EGEMSRL >gi|226332053|gb|ACIB01000003.1| GENE 71 94614 - 96224 554 536 aa, chain - ## HITS:1 COG:no KEGG:BF1305 NR:ns ## KEGG: BF1305 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 536 44 579 579 1024 99.0 0 MKEEFLNFDYPAGIVPILFAVDSIEAIQMGAYADECFNDLVDSISESRDFKHRGMLVLVS KNPKLIQIRLGNRYRVYCNMTGATSGLDYLDLQKQIQERGVEETLPLFLQNTSVRIQELN ELPSYKKYRINSAISVISTCLEYIGTPSENFYGKCVLTPILKITSFGYYIFKSWLLTFMF VCLIMLLCRWMIFLLVKRLLGENVIALMWTQKIINWGLGLLFSISAAASAIILSSGRMED AIALQAIGIPFMENFQIAAADYVLKTSFVAAFFFVLMYALKRNIISDIFLMSLLEPAKQQ EVYRSLSDAQKTALVIGHEADLNEVETSSEPYSELFTSRVSKQETVTIVSLAIAALFLIP RPLIIMGIALTIYPLVGQCVKIYHVVSNHTLPAQIKGDRRRALITNLLIIFAIAFITVLI GLFFNPMPDKKEIDRNEIKMELIAPDRLEGNYTVSKSIVGQIQVSSGIIKKVKDGTFQLL ITGKSSPKVYKLDFNSDKMIFVSGELGNGAIHYDKDLDKIKIVFNINEQTTWTISK >gi|226332053|gb|ACIB01000003.1| GENE 72 96398 - 96610 112 70 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MQFKEQKLFFNTHTNSVNLLTKEKYENLLIFRISANRTLRLPIRFVIVGSFRYHQNYTTF IKKATTKSIF >gi|226332053|gb|ACIB01000003.1| GENE 73 97360 - 98445 1195 361 aa, chain - ## HITS:1 COG:BH2816 KEGG:ns NR:ns ## COG: BH2816 COG0404 # Protein_GI_number: 15615379 # Func_class: E Amino acid transport and metabolism # Function: Glycine cleavage system T protein (aminomethyltransferase) # Organism: Bacillus halodurans # 1 361 4 362 365 325 46.0 8e-89 MKTTPFTEKHIALGAKMHEFAGYNMPIEYSGIIDEHLTVCNGVGVFDVSHMGEFWVKGPH ALDFLQKVTSNNVAALVPGKIQYTCFPNEDGGIVDDLLVYQYEPEKYLLVVNASNIEKDW NWCISHNTEGAELENSSDNMAQLAVQGPKAIQALQKLTDINLADIPYYTFKVGEFAGEKN VIISNTGYTGAGGFELYFYPDAAMKIWDAVFEAGAEFGIKPIGLGARDTLRLEMGFCLYG NDLDDTTSPIEAGLGWITKFVDGKNFTNRSMLEKQKAEGTVRKLVGFEMIDRGIPRHGYE LTTAEGDKIGVVTSGTMSPIRKIGIGMGYVKPEYSKIGTEICIDMRGRKLKAVVVKPPFR K >gi|226332053|gb|ACIB01000003.1| GENE 74 98479 - 99702 1223 407 aa, chain - ## HITS:1 COG:CAC0476 KEGG:ns NR:ns ## COG: CAC0476 COG2195 # Protein_GI_number: 15893767 # Func_class: E Amino acid transport and metabolism # Function: Di- and tripeptidases # Organism: Clostridium acetobutylicum # 3 407 4 408 408 510 60.0 1e-144 MNLVERFLKYVSFDTQSDELTRLTPSTPGQMVFAEYLKSELESLGLEDITLDENGYLFAT LPANTEKELPVIGFIAHMDTSPDMSGKNVTPRIVEKYDGSDIVLCAEENIVLSPSQFPEL LDHKGEDLIVTNGKTLLGADDKAGIAEIVSAVVYLQEHPEIKHGKIRIGFNPDEEIGEGA HKFDVQKFGCEWAYTMDGGEVGELEFENFNAAAAKITFKGRNVHPGYAKHKMINSIRIAN QFITMLPRHETPEHTSGYEGFYHLIGIQGDVEQSTVSYIIRDHDRNKFEDRKKEIEHLVN KINAEFGEGTATLELRDQYYNMREKIEPVMHIIDTAFAAMEAVGVKPNVKPIRGGTDGAQ LSFKGLPCPNIFAGGLNFHGRYEFVPIQNMEKAMKVIVKIAELVASK >gi|226332053|gb|ACIB01000003.1| GENE 75 99835 - 101241 1055 468 aa, chain - ## HITS:1 COG:lin1880 KEGG:ns NR:ns ## COG: lin1880 COG0034 # Protein_GI_number: 16800946 # Func_class: F Nucleotide transport and metabolism # Function: Glutamine phosphoribosylpyrophosphate amidotransferase # Organism: Listeria innocua # 3 453 13 438 475 192 31.0 1e-48 MGGFFGTVSKTSCVTDLFYGTDYNSHLGTKRGGLATYSEEQGFIRSIHNLQSSYFRTKFE EELDKFKGNAGIGIISDTDAQPIIINSHLGRFAIVTVAKVTNLKELEEELLSQNMHFAEL SSGSTNQTELIALLIIQGKNFVEGIENVYNHIKGSCSMLLLTEDGVIAARDKWGRTPIVI GKKEGAYAATSESNSFPNLDFEIERYLGPGEIVRMHADRLEQLRKPDDKMQICSFLWVYY GFPNSCYEGRNVEEVRFTSGLKMGEQDDCDADCVCGIPDSGIGQALGYAEGKGIPYHRAI TKYTPTWPRSFTPSKQELRSLVAKMKLIPNRAMLQDKRIIFCDDSIVRGTQLHDNVKILF DYGAKEVHMRIGCPPLIYGCPFIGFTASKSDMELITRQIIKELEGDENKNLDKYATTGSP EYEKMVGIIAKRFGLSSLKFNTIETLIEAIGLPKCKVCTHCFDGSSCF >gi|226332053|gb|ACIB01000003.1| GENE 76 101325 - 101687 361 120 aa, chain - ## HITS:1 COG:BMEII0787 KEGG:ns NR:ns ## COG: BMEII0787 COG3189 # Protein_GI_number: 17989132 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Brucella melitensis # 1 116 1 115 116 106 46.0 1e-23 MTKIKIKRVYEDPSDTDGYRVLVDRLWPRGMKKEHLKYDYWAKELTPSSDLRKWFHDDVP GHWKEFAEMYRKELETSDKTSEFLSRIRSCESVTLLYASKEPVYNHARILQAFLEERLKK >gi|226332053|gb|ACIB01000003.1| GENE 77 102201 - 103337 1141 378 aa, chain + ## HITS:1 COG:PA1777 KEGG:ns NR:ns ## COG: PA1777 COG2885 # Protein_GI_number: 15596974 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Outer membrane protein and related peptidoglycan-associated (lipo)proteins # Organism: Pseudomonas aeruginosa # 270 360 233 323 350 60 34.0 4e-09 MKKSIFMFALATLMSASVFAQDNDAKRLPGYKTTFEGNGFWNNWFMSANFGAQSLFAENS KDAKFRNTITFMPTLSVGKWFNPYWGVRLQGTGGSLHGFTSGANSMLHYQYGAVHADFMF GLINFFAPYKENRRFDIVPFAGIGGAFIRKGDQSFTINAGIQARYRISKRFDINVEYQGA ILDDDMVVRGGFPNDGISGLTAGVTFRFGKTGFKKGYSSRQYNAIKSNYSDLEANYATLK KENAAQQAEIAELKARKPEIVKEETIIDNSEIKALPSTITFPFNSSKIELSQEVSIFNIA EFLKANPEIRVRLTGYADKRGSEQANRIVSERRANAVTEVFVNKYGIAKDRITTEFKGTS TKFENDDWNRAVVVELIK >gi|226332053|gb|ACIB01000003.1| GENE 78 103397 - 103828 217 143 aa, chain - ## HITS:1 COG:RSc0048 KEGG:ns NR:ns ## COG: RSc0048 COG0735 # Protein_GI_number: 17544767 # Func_class: P Inorganic ion transport and metabolism # Function: Fe2+/Zn2+ uptake regulation proteins # Organism: Ralstonia solanacearum # 13 137 30 161 172 60 28.0 7e-10 MEDLCLKILEQRGIKPTAIRMLVLKAMMETEQAVSLLDLENKLDTVDKSTIFRTITLFVS HHLAHSVDDGTGSLKYAVCDSECTCAVKDLHTHFYCEYCHKTFCLENIHVPVVDLPEGFA VRSINYVLKGCCAECAAELKKDH >gi|226332053|gb|ACIB01000003.1| GENE 79 103831 - 105777 1616 648 aa, chain - ## HITS:1 COG:PAB0626 KEGG:ns NR:ns ## COG: PAB0626 COG2217 # Protein_GI_number: 14521140 # Func_class: P Inorganic ion transport and metabolism # Function: Cation transport ATPase # Organism: Pyrococcus abyssi # 32 643 87 687 689 545 49.0 1e-154 MGHCKCEMNANVRVESCQEANGFIKEYWKVILSLLMLIAGAIMNQLDVAFFRDNTISLVW YILAYLPVGLPVIKKAWESILQKDFFSEFTLMSVATLGAFYIGEYPEGVAVMLFYSVGEL FQDKAIDKAKRNISALLDVRPEKAVVVRSNEIVTVDPRSVLINEIIEIKAGERVPLDGVM LDEVAVFNTAALTGESVPRDISRGEEVLAGMIVTDKVIRMKVTKPFDKSALARILELVED ASERKAPAELFIRKFARVYTPIVIGFAFLIVLIPYIYSFINPLFGFVFNDWLYKALVFLV ISCPCALVISIPLGYFGGIGAASRLGILFKGGNYLDAITRVNTVVFDKTGTLTKGVFEVE SCRVVPGTSEEDLLRVVASIEKNSNHPIARAIVAYAQDKGIDLIVTKNIEEIAGYGLMTE IDGKRVLVGNTRLLSKYSIEFPKAVFSITETTVVCAVGDKYIGCIILSDVLKDDASDTVK ALKELNIKNIQILSGDKQSIVTIFANKLGITQAYGDLLPEGKVEHIERLKEEKGNQIAFV GDGMNDAPVLALSDVGIAMGGLGSDAAIETADVVIQTDQPSKVATAIKVGRCTRRIIWQN VLLAFGVKLLVLILGASGIATLWEAVFADVGVTLIAIMNAVRIQKMIK >gi|226332053|gb|ACIB01000003.1| GENE 80 105896 - 106930 796 344 aa, chain - ## HITS:1 COG:NMA0465 KEGG:ns NR:ns ## COG: NMA0465 COG2855 # Protein_GI_number: 15793467 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Neisseria meningitidis Z2491 # 25 335 22 330 338 229 42.0 6e-60 MLSEKRNSMLHGVLLIALFSCAAFYIGDMEFIKKLSFSPMIVGIILGMLYANSLRNNLPE TWVPGIQFCSKRILRIGIILYGFKLTFQDVLAVGLPAILIDTIVVTITILGGILIGRMLK MDRGVALLTSIGSGICGAAAILGAESTIQTKPYKTAVAVSTVVIFGTLSMFIYPILYRNG TFVLSPNEMGIFTGATLHEVAHTVGAGNAMGKEVSDVAIIVKMIRVMMLVPVLLITSFMV SRAAVKAGGQGGSMKDISIPWFAIGFLAVIGFNSFDLLPQSLIAFINNLDTFLLTMAMTA LGAETSIDKFKKAGAKPFVLASLLYLWLIVGGYFLVKLLAPVLM >gi|226332053|gb|ACIB01000003.1| GENE 81 107095 - 109473 1582 792 aa, chain - ## HITS:1 COG:rcsC_1 KEGG:ns NR:ns ## COG: rcsC_1 COG0642 # Protein_GI_number: 16130155 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Escherichia coli K12 # 534 785 426 677 700 144 35.0 6e-34 MEFINPKFKYIFASILLLGGFLCGYNFKQWYVTNHRNEKHISLIYSCSKKEADRGQFEEL LKKEFRSKGVEPIFDKYYLDCNRFGQKVEIEHISNYLEALKSKPIDLVMAMGDQATHSLL STRHPLLYSVPVVACNVHFPDEDLLKKYESRKVYALRDTPDFKHNMEFIRSLQPHVGLEI VYNIDLTPLGHKSFDLLNQVVDRKDVQILGHKSAFSMEYEYKEMREMIEFYNLMPAVANN RIKKNELTISLCPFRYIKGASLLVMMENSKSEQGKKAFLLDKFDMVSFPIVNALSIPSFS CVHAGFGEGNKIVGGYMATKEISVQAAADLSTRLMNKEKIGMPKIRDLRKEYVLDWSVFS TYNGYDIKNVGKNVRIINYPIYDHYREEFYLLGVLFVFAFIFISVSLLHTRRRSLIERKN LKILEEAHKRLTLSADGGQISLWNMQGSDIEFDDNYARLTGVEKRKFKRTDFLEYVHPDD SQLLSSFYEALCKSPGVEIQRVRFRFDEKKGYQWYELRCRSLKDIKGEMMLAGIMQNIQE SVEREHQLIVAKQMAEKAELKQSFLNNMSHEIRTPLNAIVGFTNVLLGEGSEEIDPDEKA SMLEIINHNNELLLKLINDVLEISRLDSGSLDFDMKEWNMTDIVKEIYKTYQPLIRSSLQ FRLELDDTVPVPVHTDRLRFAQVISNFLNNANKFTQNGYIALGCKVDKKHREVRIYVEDS GKGIDEKELMMIFERFYKTDEFEQGSGLGLAISKVIIERLSGRIKVHSEKGKGSCFTVIL SLADAPENHMLI >gi|226332053|gb|ACIB01000003.1| GENE 82 109712 - 111241 1008 509 aa, chain + ## HITS:1 COG:CC1172 KEGG:ns NR:ns ## COG: CC1172 COG3119 # Protein_GI_number: 16125424 # Func_class: P Inorganic ion transport and metabolism # Function: Arylsulfatase A and related enzymes # Organism: Caulobacter vibrioides # 22 484 27 482 521 157 27.0 6e-38 MSNRIVSRSLMGALCSACFTTASAQERPNIIVFLVDDMGLMDTSVPFVTDENGNAQRQPL NDWYRTPNMERLANQGIRFSTFYAQSVSSPSRASIMTGQNAARHRTTNWINAESNNRTPY GPFHWNWKGLTHQDMIYPYLLQQAGYKTIHVGKAHFGCLKSEGENPTNLGFDVNIAGSAI GHPGSYHGENGYGWIKGQRARAVPDLEQYHKTHTFLSDALTLEAGKEIEKAVAEKKPFYL NMAHYAVHSPFETDERFISHYTDPNKSQQARAFATLIEGMDKSLGDILDKLEDMGIAENT LIIFLGDNGGDAPLGDAADYGSSAPFKGKKGSEYEGGVRVPFIVSWAHPNPNNKFQKAYP IARNAIQTQMGTVMDIYPTVLSVAGVKPAPNHILDGADLRKLLKGKRDKKHRDDFLMHFP HEHRGSYFTSYRKGDWKFIYYYNPQTPEAPTYKLFNLSEDPYEKNDLSKTNQQKAKELFR LMVQRLEKEQALYPVDADKNVLSPIFVAE >gi|226332053|gb|ACIB01000003.1| GENE 83 111359 - 111682 336 107 aa, chain - ## HITS:1 COG:slr0233 KEGG:ns NR:ns ## COG: slr0233 COG0526 # Protein_GI_number: 16331440 # Func_class: O Posttranslational modification, protein turnover, chaperones; C Energy production and conversion # Function: Thiol-disulfide isomerase and thioredoxins # Organism: Synechocystis # 22 91 21 90 105 66 38.0 1e-11 MEEKKEARQKRNREKLATADWVMAEFYATWCPHCERMQPVVEEFKKLMEGTLEVVQIDID QEDALANFYTIESVPTFILMRKGEQLWRQSGELDLERLKKAVKDFKS >gi|226332053|gb|ACIB01000003.1| GENE 84 111858 - 112670 764 270 aa, chain - ## HITS:1 COG:Cgl0802 KEGG:ns NR:ns ## COG: Cgl0802 COG0566 # Protein_GI_number: 19552052 # Func_class: J Translation, ribosomal structure and biogenesis # Function: rRNA methylases # Organism: Corynebacterium glutamicum # 6 262 7 270 276 141 33.0 1e-33 MPIIEISSLSHPGVEIFSTLTEAQLRNRIDSDRGIFIAESPKVIQVALDAGYEPLAILCE QKHIIGDAAVIIERCGDIPVYTGKRDILALLTGYTLTRGVLCAMRRPELRSVEEVCREAK RIVVIDGVVDTTNIGAIFRSAAALGIDAVLLTRNSCDPLNRRAVRVSMGSIFLVSWTWID GSLSRLGDLGFRTVAMALTDDSISIDDPVLKTEPKLAIIMGTEGDGLPYETISEADYVVR IPMSHSVDSLNVAAAASVAFWELRAPTFKE >gi|226332053|gb|ACIB01000003.1| GENE 85 112750 - 113790 592 346 aa, chain - ## HITS:1 COG:RSc0194 KEGG:ns NR:ns ## COG: RSc0194 COG1063 # Protein_GI_number: 17544913 # Func_class: E Amino acid transport and metabolism; R General function prediction only # Function: Threonine dehydrogenase and related Zn-dependent dehydrogenases # Organism: Ralstonia solanacearum # 1 335 1 335 345 214 37.0 2e-55 MLAYTYVEHGKFELLEKLRPQIKDARDAIVRVTLGSICTSDLHIKHGSVPRAVPGITVGH EMVGIVEEVGSGVISVRPGDRVTVNVETFCGECFFCRHGYVNNCTDPDGGWALGCRIDGG QAEYVRVPYAEQGLNRIPDNVSDEQALFVGDVLATGFWATRISEISEDDTVLIIGAGPTG ICTLLCVMLKKPRRIIVCERSAERIRFVREHYPDVLVTEPENCRDFVLCNSDHGGADVVL EVAGSEETFRLAWECARPNAIVTIVALYDKPQFLPLPDMYGKNLIFKTGGVDGCDCTEIL SLIEQGRIDTTPLITHRFPLNEIEEAYRIFENKLEGVIKIAITGGR >gi|226332053|gb|ACIB01000003.1| GENE 86 114073 - 114642 306 189 aa, chain + ## HITS:1 COG:no KEGG:BF1288 NR:ns ## KEGG: BF1288 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 189 1 189 189 356 100.0 3e-97 MENGIKKILRDKYGLFEKDIEILLEKFTKFSVPKREIILQEGQTDHYIYFVEKGIVKSTI LREGREFIIFFALENDAPLSSPNLTESRQSLYTLEAVDTCILWRISRKDLATSFKDSLNL SNWGRMILQEWLTSSAFYFSSIHWMSKKEQYQYLLKEMPKLIQRLSMRDLSAWLDITPQS LSRIRADMH >gi|226332053|gb|ACIB01000003.1| GENE 87 114712 - 115626 461 304 aa, chain + ## HITS:1 COG:PAB0040 KEGG:ns NR:ns ## COG: PAB0040 COG0697 # Protein_GI_number: 14520295 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; R General function prediction only # Function: Permeases of the drug/metabolite transporter (DMT) superfamily # Organism: Pyrococcus abyssi # 6 282 23 291 295 63 26.0 5e-10 MKSLKGIIYAMISSGTFGLIPLFTIPLMGNMEMNESNILFYRFLFSTLMMGAVCLLHKST LKINMKHLISIIGLGALYALTALFLIYSYHYITSGVATTIHFLYPICVSFLMVVFFKEQK SKSLFLAASLSLIGVALMCWSGGGSIRLMGIGLAALTILTYGIYIVALNQLDIGKLPAEV LTFYVLLGGCVIFFIYSLFTAGISSIPSTKAGFYILALAFLSTVISDLTLILAVKYAGST TTAILGSMEPLVAVVVGVLVFSEHFTLQSLIGLLLILMSVIIVILADQQKKSKRITEQKI IKPH >gi|226332053|gb|ACIB01000003.1| GENE 88 115968 - 117026 768 352 aa, chain - ## HITS:1 COG:no KEGG:BF1286 NR:ns ## KEGG: BF1286 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 352 1 352 352 589 99.0 1e-167 MKVKLLFLVFVLYACGTKTVSEEKSYDRITMTTYENKYDENNRLSEVQLTRTSHHRYEED SETIDLIDDKSTYYYTYINNEEFTVRRKSKRSGNIKIMRYAPQREEVLTLNAQGDTIDYL LQKYYDKSKLKLVYVRNINNDYVLHEDNDYEEKNEYDGNGNLTKRVQYYFDIGKKRTTYF FRGLSYEEAKKRIPRTDEDYDIVCDIEKMAGDTLIRKCIKNGIVSSINKTIVDEKGKKEF IFDADMKFTGSFTEFKSDGFDIHVDRIVLDDCTDVDSTYYKNGKEVRCVYLSDTSKRIVL SKYDKWGNMVERVEKTKYFYSQDGEELINEMLQVVRENEKKKESRKRLKISK >gi|226332053|gb|ACIB01000003.1| GENE 89 117769 - 119061 961 430 aa, chain + ## HITS:1 COG:no KEGG:BVU_2469 NR:ns ## KEGG: BVU_2469 # Name: not_defined # Def: tyrosine type site-specific recombinase # Organism: B.vulgatus # Pathway: not_defined # 1 430 1 430 430 849 100.0 0 MNIKRNIIFALESRKKNGVPITENVPIRMRVIFASQRIEFTTGYRIDATKWDTDKQRVKN GCTNKLKLSASEINADLLRQYTEIQNIFKEFEVQNVLPTPAQIKDSFNARMKDAPVTEQE TAEGPKISFWEAFEEFVRECGKQNDWTDATYEKFAAVKNHLKEFRDELSFDTFTENGLND YVDFLRNKKDMRNSTIGKQIAFLKWFLRWSFKKGYNQNMAYDSFKPKLKNTPKKVIFLTW EELNRLKDYKIPQTKQYLERVRDVFLFCCFSGLRYSDVYNLKRSDIKPDHIEVTTVKTAD SLIIELNNHSKAILEKYKSVHFEDHKALPVISNQKMNDYLKELGELAEINDPVRETYYKG NERIDTVTPKYALLGTHAGRRTFICNALALGIPAQVVMKWTGHSDYKAMKPYIDVADDIK ANAMNKFNQL >gi|226332053|gb|ACIB01000003.1| GENE 90 119075 - 120091 562 338 aa, chain + ## HITS:1 COG:no KEGG:BVU_2468 NR:ns ## KEGG: BVU_2468 # Name: not_defined # Def: hypothetical protein # Organism: B.vulgatus # Pathway: not_defined # 1 338 1 338 338 603 99.0 1e-171 MEERITSMIPRYGKLNKTYTEITSGDGLSFEKQKFIHDFYKEYEDTQTFEKALISLMLET EGTHFSILLNSLKREIENNISMYNTCKELFDRLDIEHICRQHERCHDRDIERQMQITNEY YRELMEANGSLEAVGFREHDRQEEERLEKRYGQCKREYDREKAKLDELYAQKEQARREAL QYLKNRCGDIYRLDGSLLAILEKYMTGQKKKEGEEKEAATPTPSPTYFPMKLLSAVYEKC NGEQFEAISELDFYASMNLQPCEGKLIIRPREKARVCYLIFLMGETLHKPDREKWRKDIM NLLGIDDTYYKSKYKEPVSDFPSDSNQIFAKEMQSIFR >gi|226332053|gb|ACIB01000003.1| GENE 91 120314 - 120622 244 102 aa, chain + ## HITS:1 COG:no KEGG:BVU_2467 NR:ns ## KEGG: BVU_2467 # Name: not_defined # Def: hypothetical protein # Organism: B.vulgatus # Pathway: not_defined # 1 102 1 102 102 190 100.0 1e-47 MEIRELLSKPVWQMTGEEFILLNRHALQEREARAAQPAADTEKKYVYGIGGIARLFGCSM PTANRIKKSGKIDRAITQIGRKIIVDADMALELAGHKSGGRR >gi|226332053|gb|ACIB01000003.1| GENE 92 120635 - 121708 610 357 aa, chain + ## HITS:1 COG:no KEGG:BF1280 NR:ns ## KEGG: BF1280 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 357 1 357 357 692 99.0 0 MEYRERKEVTPEAAVILWQASRLDLSEDYERAPEILKVHGSVIGTLGNFSASIGKAKSKK TFNVSAIVAAALKNGTVLSYTAELPENRRKILYVDTEQSSYHCAKVAKRILRMAGLPTGR NHKDLEFLVLRKYTPEERIAIVREAIYRTENVGLVVIDGIRDMVYDINSPSESTKVISLL MTWTGERHIHIHTILHQNKGDENARGHIGTELSNKAETVLQVEKDEKDPDISTVKAAHIR AMNFEPFAFRINGEALPELLDEYLFKHKDPGKGKKEKFDPYKDITEKQHRIALEAAFTLK DEYGYKELAEELRKTYASVGVMLGGNRLTDLITVLKNKRMIVQENGRKYTFKPDFHY >gi|226332053|gb|ACIB01000003.1| GENE 93 122037 - 123005 647 322 aa, chain + ## HITS:1 COG:no KEGG:BF1279 NR:ns ## KEGG: BF1279 # Name: not_defined # Def: DNA primase # Organism: B.fragilis # Pathway: not_defined # 1 322 1 322 322 626 99.0 1e-178 MNMNEAKQIRIEEYLHSLGYSPVRQQGGSLWYNSPFRDEQEPSFKVNTERNLWYDFGAGK GGNIIALAQELYASDSLPYLLERIREQAQNVRPVSFSFGKQPLSKPSFRQLEVVPLSSPA LYTYLRQRGINTELAKRECREVRYLTGETPYYAIGFPNRSGGYEIRNKHFKGCIAPKDIT HIRQSESKEACYIFEGFMDYLSFLTLRLERCPDRPELDGQDYIVLNSTSDLSKAIRPLGG YESIHCFLDNDKAGMEAVQELQKEYGLRIRDASHIYEGYNDLNDFLRDKRSGQAQRQQEK PEAEKRQSQTEQPKKKGKGIRM >gi|226332053|gb|ACIB01000003.1| GENE 94 123189 - 124529 826 446 aa, chain + ## HITS:1 COG:no KEGG:BVU_2464 NR:ns ## KEGG: BVU_2464 # Name: not_defined # Def: mobilization protein # Organism: B.vulgatus # Pathway: not_defined # 1 446 1 446 446 782 96.0 0 MGYAVLHLEKAKGTDSRMSAHIERTVHPKNADRTRTHLNRELVQFPEGVRNRTQAIAHRV ETAGIRRKVGTNQVRAIRVLLTGSNRDMKQIEADGRLDDWCNDSLQWLRETYGEQNLVSA VLHMDEKTPHIHATVIPIVTGERRKAGQEEQNGKKKYRKKNPQDVRLCADDVMARHRLKH YQDTYAQAMNKYGLQRGVDGSLARHISTMQYYKQLVEQQDSLQENIENLLGLEEEAMKKL KQVKGEINVQKMKGAAVNATTAIADGVSSLFGGSKVKKLEAENENLKRNIVNLQKQVQAE QREQTKMENRHSSEINRVDRSYRQKIAEYDNRLELIDTYFPIVKELIPIAEQCREVGFTE ELTRRIVSLQPVEFKGRLYSKEYKEKFRTEHSTATVERDPQEKGKFRLCIDGVPILDWFR KKFQEIKEKLGLNTPVENKQRKGLKI >gi|226332053|gb|ACIB01000003.1| GENE 95 124663 - 125160 262 165 aa, chain - ## HITS:1 COG:no KEGG:BF1270 NR:ns ## KEGG: BF1270 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 165 1 165 165 298 98.0 5e-80 MSVYKSKFQPIMERWSLVLKIVLLTTGLMFLLRTFMLQSLQDQIASLLLGIILVSIYIYM LIVIPSFICLEKDQLVIKRVFGKKCIPYSCIKNAFVYDGVQNDVRYFGSNGFLGYIGVMG STHYGKYYSYVKNPNQQIFILTEKKNYLLSCENRDMFIKDLLIHL >gi|226332053|gb|ACIB01000003.1| GENE 96 125184 - 126152 438 322 aa, chain - ## HITS:1 COG:no KEGG:BVU_2456 NR:ns ## KEGG: BVU_2456 # Name: not_defined # Def: hypothetical protein # Organism: B.vulgatus # Pathway: not_defined # 1 322 1 322 322 637 99.0 0 MKIEKIFGCILSACMIVSLLSCNSENEFIGSAQETIPDVEVSTMDMWVKETRSVSEYLNM PVLHFKDEQVYSETLRQLKNMTENERFTYFQQLGFEGAYILWEQADRELDKIFDMESDDS HLIQEMINTYKDKYSDIFSFNTVDLFDVTPYFTFTDNDLSLLGNIKGYVVIGNSLRGPKY DYPTYDLDEVVSATRAAEPTPIEPGFKGFKDASLTIKNGKYKSTMTIGRIVNGNSFAVEF KTKKKQLFWKKSVKAGYSAMLTMKSSKFNYKNTVFCPYGKEVLILNLPIERVGNVFDAVV ENFKSSRGDAKGNQSFHNIRVI >gi|226332053|gb|ACIB01000003.1| GENE 97 126182 - 126304 88 40 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MKKIVFVLFGCLIIFQSCSNVFYNHANKTEKVTVKVKPFK >gi|226332053|gb|ACIB01000003.1| GENE 98 126317 - 127096 352 259 aa, chain - ## HITS:1 COG:no KEGG:BVU_2455 NR:ns ## KEGG: BVU_2455 # Name: not_defined # Def: hypothetical protein # Organism: B.vulgatus # Pathway: not_defined # 1 258 6 263 269 509 98.0 1e-143 MYRKVTTLISFAAFILFWIIMCPLGMYGQESKWVDMNPVGKRYLRFHKDFAVDSVKYPIW EYQNSVIFANSGVTLQFKKGGPFFYTDQSPLHKGEYNVSGVIAPLWNGHVYGMGRQMNLI GIGVFNYAAIGYNKSLSDNLFADVALHAMKLSTPRHVDETFGISGKLSYSFNSRMSLHVF GGFLFTPVTSFSRNDYGGSVAFDITDSFGTELGIRGYREYPFERGGVTPILMPYYKFKKH KVEMDFGGLLQEIIYGLFK >gi|226332053|gb|ACIB01000003.1| GENE 99 127422 - 127631 88 69 aa, chain - ## HITS:1 COG:no KEGG:BF1275 NR:ns ## KEGG: BF1275 # Name: not_defined # Def: putative hemolysin # Organism: B.fragilis # Pathway: not_defined # 2 45 7 50 368 81 88.0 9e-15 MLLILPIMFLILCSDDTVEEIGGSNHVLFERYLQEVINISMDNLERVRLVVQELVLSHYL LSVQYREPL >gi|226332053|gb|ACIB01000003.1| GENE 100 127678 - 128517 328 279 aa, chain - ## HITS:1 COG:no KEGG:BF1274 NR:ns ## KEGG: BF1274 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 279 1 279 279 494 89.0 1e-138 MKRIFILLFVSLALFSCEKEENNKILEYQVLPSENVSKPFNCLGEKRTITFTIIQKTLID DVLDSEVPIIPKDVSIEFDKTLFSDIETKVKGDQVVLNITSNINKEDKILNGDLQISYST INGIKVEKIPLIIDKGELTFMYKIHSEQNPFILPAEGGKFELPFTCKKQTYLNGQFIEER YSALKGLRFKTISTGNIWFLTVRKDGEKIGFYKFTFVGEGPYNQKREPECYFNIYIHDAD VVADNPPEIFRQDFMQPQTPGEDYYIPSRASYKHGTFDF >gi|226332053|gb|ACIB01000003.1| GENE 101 129178 - 131676 2421 832 aa, chain + ## HITS:1 COG:BH3106 KEGG:ns NR:ns ## COG: BH3106 COG1193 # Protein_GI_number: 15615668 # Func_class: L Replication, recombination and repair # Function: Mismatch repair ATPase (MutS family) # Organism: Bacillus halodurans # 13 831 10 784 785 352 30.0 2e-96 MIYPQNFEQKIGFDQIRQLLKDKCLSTLGEERVNEMNFSDHFEEVDELLNQVAEFVRIIQ EEDNFPDQFFFDVRPSLKRIRIEGMYMDEQELFDLRRSLETIRDIIRFLQRNDEEESDCP YPSLKKLAGDITVFPQLITKIDGILNKYGKIKDNASTELSRIRRELANTMGSISRSLNSI LRNAQSEGYVDKDVAPTMRDGRLVIPVAPGLKRKIKGIVHDESASGKTVFIEPAEVVEAN NRIRELEGDERREIIRILTEFSNTLRPSIPEILQSYEFLAEIDFIRAKSHFAIQTNSIKP SLENEQLLDWTMAVHPLLQLSLAKHGKKVVPLDIELNLKQRILIISGPNAGGKSVCLKTV GLLQYMLQCGMLVPMHERSHVGLFGSIFIDIGDEQSIEDDLSTYSSHLTNMKIMMKNCNE RSLILIDEFGGGTEPQIGGAIAEAVLKRFNIKGTFGVITTHYQNLKHFAEDHEGVVNGAM LYDRHLMQALFQLQIGNPGSSFAVEIARKIGLPEDVIADASEIVGSEYINADKYLQDIVR DKRYWEGKRQTIRQREKHMEETIARYQAEMEELQKSRKEIIRQAKEEAERLLQESNARIE NTIRTIKEAQAEKEKTRLVRQELADFRESIDNLTSKEQEDKIARKMEKLKEKQNRKKEKK QNGTKEQPAVQQTPKATPITEGCPVRIKGQSSVGEVLEINGKNAVVAFGSIKTTVKTERL ERSNAIPQKQESAKSSFVSNQTQDSMYEKKLNFKQDIDVRGMRGDEALQAVTYFVDDAIL VGMSRVRILHGTGTGILRTLIRQYLQTIPGVRHFADEHIQLGGAGITVVDLA >gi|226332053|gb|ACIB01000003.1| GENE 102 131689 - 132750 654 353 aa, chain + ## HITS:1 COG:MA1721 KEGG:ns NR:ns ## COG: MA1721 COG0598 # Protein_GI_number: 20090573 # Func_class: P Inorganic ion transport and metabolism # Function: Mg2+ and Co2+ transporters # Organism: Methanosarcina acetivorans str.C2A # 54 352 66 355 356 216 39.0 6e-56 MKNNLLSERLVYNGESQTPTHLHLCTYNALVMQEVSGVNFQTVANSLNREQINWLQVHGL QNTEVIREICSHFEIDFLILQDILNAEHPTKIEEHDKYTVLIMKLFRFNNKEEKSPEDEL DELEQQQVCIIQGSNFVLTFLENETDFFDDVTSALHNDVLKIRGRQSDYLLSVLLNSIMG NYIATVSTIDDSLEDLEEELLTISDGNDIGIQIQALRRQYMLMKKAILPLKEQYVKLLRA ENSLMHKVNRAFFNDVNDHLQFVLQTIEICRETLSSLVDLYISNNDLRMNDIMKRLTIVS TIFIPLTFLVGVWGMNFKIMPELDWRYGYVYAWILMLLVGGAVYWFFRKKKWY >gi|226332053|gb|ACIB01000003.1| GENE 103 132807 - 134015 856 402 aa, chain + ## HITS:1 COG:FN1106 KEGG:ns NR:ns ## COG: FN1106 COG1760 # Protein_GI_number: 19704441 # Func_class: E Amino acid transport and metabolism # Function: L-serine deaminase # Organism: Fusobacterium nucleatum # 1 399 1 399 408 436 54.0 1e-122 MKSIKELYRIGTGPSSSHTMGPRKAAEMFLTRHPEAASFKVTLYGSLAATGKGHMTDVAI IDTLKPTAPVDIIWQPKIFLPFHPNGMNFVALDAGGNELENWTVYSVGGGALAENNKQPS IESPEVYSMNSMTEILDWCEHTGKSYWEYVKECEDPDIWDYLKEVWDTMKESVQRGLEQE GVLPGPLNLRRKASTYYIRATGYKASLQSRGLVFAYALAVSEENASGGKIVTAPTCGSCG VMPAVLYHLAKSREFSEMRILRALATAGLIGNIVKQNASISGAEVGCQGEVGVACAMASA AANQLFGGSPAQIEYAAEMGLEHHLGMTCDPVCGLVQIPCIERNAYAAARALDANLYSSF TDGMHRVSFDKVIQVMKQTGHDLPSLYKETSEGGLAKDYKPM >gi|226332053|gb|ACIB01000003.1| GENE 104 134119 - 134619 453 166 aa, chain - ## HITS:1 COG:no KEGG:BF1264 NR:ns ## KEGG: BF1264 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 166 1 166 166 334 100.0 7e-91 MKFLKFSLLTAVLLSVVFAFSSCGDDDDTGYLPPSQAIQDALKKLYPNATAIKWEQKGVY YVADCQADGREKEVWFDANANWLMTETELNSINNLPPAVLTAFMESSYNNWVVDDVVILE YPNEPSTEFVVTVEQGKKVDLYFSEGGGLLHEKDVTNGDDTHWPRI >gi|226332053|gb|ACIB01000003.1| GENE 105 134657 - 135100 507 147 aa, chain - ## HITS:1 COG:no KEGG:BF1263 NR:ns ## KEGG: BF1263 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 147 1 147 147 287 100.0 1e-76 MKKILSILVLAIAAVQFAFAGDIITKDAMKLPLPARNFINRHFSNPQISHIKIENEILQT KKYDVLLTNATEIDFDNRGNWIEVDCKKAAVPATIVPDFVKEYMKANGYHSEFVTQIERD RKGYEVELNTDLSLKFTKDGKFRKAEH >gi|226332053|gb|ACIB01000003.1| GENE 106 135218 - 135772 537 184 aa, chain - ## HITS:1 COG:no KEGG:BF1262 NR:ns ## KEGG: BF1262 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 184 1 184 184 350 100.0 1e-95 MKKLIYLLGLAMLMAVAACSGSAEKKKSDIRVLMQDSTDAHGVQRMTARKSEVDIKYKGK EYHSFISRTPNDSLPRVVSQMGNTYVDNQIVLRLTRGNERVFSRTFTKKQFESLIGDDFM AKSILEGIVYDKTTPEGIVYAASICYPQTDLYVPISITISPDGKISMKKEELLEEVYDED TSAR >gi|226332053|gb|ACIB01000003.1| GENE 107 135921 - 136388 503 155 aa, chain - ## HITS:1 COG:no KEGG:BF1212 NR:ns ## KEGG: BF1212 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 155 1 155 155 188 100.0 7e-47 MKRISFLLAALLMMGGMVMAQGPRRGGQDMDPKTRAERMTERMAKEYSLNETQKKELLEV NMAFVQKMGERPGRMKPEMRQGKKGQSQATDSCTCKQDRRKAPRMSKEDREKMRQEMKAS RESYEAGLKKIMTKDQYAAYTKKQAEREQRRGGGR >gi|226332053|gb|ACIB01000003.1| GENE 108 137329 - 137895 649 188 aa, chain + ## HITS:1 COG:STM0608 KEGG:ns NR:ns ## COG: STM0608 COG0450 # Protein_GI_number: 16763985 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Peroxiredoxin # Organism: Salmonella typhimurium LT2 # 4 188 3 187 187 253 62.0 1e-67 MEPIINSQMPEFKVQAFQNGSFKTVSSEDVKGKWAIFFFYPADFTFVCPTELVDVAEKYE QFQAMGVEVYSVSTDSHFVHKAWHDASESIRKIKYPMLADPTGVLSRGFGVMIEEDGMAY RGTFLVNPEGKIKIAEIQDNNIGRNADELLRKVEAAQFVATHDGEVCPAKWKKGEATLKP SIDLVGKI >gi|226332053|gb|ACIB01000003.1| GENE 109 137996 - 139546 409 516 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|148988049|ref|ZP_01819512.1| 30S ribosomal protein S9 [Streptococcus pneumoniae SP6-BS73] # 212 505 2 297 306 162 34 1e-38 MLDSAIKEQLRTIFAALEANYTFDITVSPQHESRTELLELLNDVASCSEKLSCRINEGKG LEFTLLKGEHRTGITFRAVPNGHEFSSLLLAILNTDGKGKNFPDEGICNRVKSLKGPVRL TTYVSLTCTNCPDVVQALNAMTTLNGQIQHQTVDGAINQAEVEALKIQGVPSVFADGKLI HVGRGDFGELLAKLEEQYGTETNSAETSIKKYDVIVAGGGPAGSAAAIYSARKGLNVAVI AERIGGQVKETVGIENLISVPSTTGSQLADNLKTHMSQYPIDLLEHRQIEKIELDGKDKV LTAKGGERFSAPAVIVATGASWRKLNVPGEAEYIGRGVAFCPHCDGPFYKGKQVAVVGGG NSGIEAAIDLAGICSKVTVLEFMDELKADQVLQEKLKSLPNVEVLVHSQTTEVVGNGDKV TGIRVKDRKTEEERVISLDGIFVQIGLVANSGIFRDVVEVNRPGEIVIDAHCRTNVPGIY AAGDVSTVPYKQIIIAMGEGAKAALAAFEDRMRGEI >gi|226332053|gb|ACIB01000003.1| GENE 110 139696 - 140703 976 335 aa, chain - ## HITS:1 COG:no KEGG:BF1208 NR:ns ## KEGG: BF1208 # Name: not_defined # Def: putative endonuclease/exonuclease/phosphatase family protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 335 7 341 341 693 100.0 0 MTLFLLVAMTGQSKGKGEFTVLQWNVWQEGTMVPGGYDAIVNEIVRLQPDFVTFSEVRNY HNTRFNERIVASLKEKGLDYYSFYTYDTGLLSKHPITDSLTVFPENGDHGSIYRLTSSVN GHKVAVYTSHLDYLDCAYYNVRGYDGSSWKEIPIPTTVEEVLKVNVASQRDDAIKMFIAQ AQKDIANGYSVIIGGDFNEPSHLDWIEANKNLYDHNGLVIPWTVTTLLEQNGFVDTYRHI YPNPLTHPGFTYPADNPLKEPGKLTWAPKADERDRIDFIFYKGKNLEAKKAIVFGPKGSI VRNKRVQETSKDNFLLPLDVWPTDHKGLLVTFKIK >gi|226332053|gb|ACIB01000003.1| GENE 111 140776 - 141675 837 299 aa, chain - ## HITS:1 COG:CAC0477 KEGG:ns NR:ns ## COG: CAC0477 COG1524 # Protein_GI_number: 15893768 # Func_class: R General function prediction only # Function: Uncharacterized proteins of the AP superfamily # Organism: Clostridium acetobutylicum # 23 233 2 245 434 76 26.0 6e-14 MKKLFVLVCLLAIGFCNVAWAAKRQAKHVVLIALDGWGAYSVPKADIPNIKSLMDEGCYT LHKRSVFPSSSAINWASMFMGVGTELHGYTEWGSRTPEIPSRVVNEHGISPTIFSVMRQQ YPEAETGCLYEWEGIKYLVDTLALSYHAQAPDYDKYPTALCEMAEKYIKDKKPAMLAVCF DQLDHTGHAVGHDTPGYYEKLKELDGYVGRIIAAIKEAGIYDDTIIMMTADHGGIKKGHG GITLQEVEIPFIIAGKNVRKGGEFQESMMQFDTAATMGYVLGVKQPQVWIGRPMIQVFK >gi|226332053|gb|ACIB01000003.1| GENE 112 141717 - 142703 923 328 aa, chain - ## HITS:1 COG:CAC1961 KEGG:ns NR:ns ## COG: CAC1961 COG1409 # Protein_GI_number: 15895233 # Func_class: R General function prediction only # Function: Predicted phosphohydrolases # Organism: Clostridium acetobutylicum # 21 313 32 321 324 174 37.0 3e-43 MKRIYLLLAILLGSLQLANAQQQTLQFNKDGKFKIVQFTDVHYIYNDPRSDVSIERINQV LDMEKPDLVLFTGDVIYGKPAEEGMRTVLNLVSKRKIPFAVTFGNHDNEQGLSREELLKI IQSVPFNLTQTTPGISGVTNFILPVKASDGKRNATVLYCIDSHSYSQIKGVNGYDYIKFD QIQWYRENSKKFTEENNGVPVSSYAFFHIALPEYNQAASSESAILYGIRKEKACAPQLNS GLFAAMKEMGDVRGVFVGHDHDDDYAVSWKGILLAYGRYTGGNTVYNHLTNGARVIELDE NANSFRTWIRLKEGVVQQVTYPADFIKE >gi|226332053|gb|ACIB01000003.1| GENE 113 142791 - 144314 1476 507 aa, chain - ## HITS:1 COG:no KEGG:BF1254 NR:ns ## KEGG: BF1254 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 507 1 507 507 986 100.0 0 MKLINKTLLLLTASATILASCNKFDDLNTNPDAATKVTAPMLATKLILNITQQGSAKNFV YHSMLSKQIAWSENASDEQYNLFGRTDFDGYTILTNCQKMVDAANEGDKDSYAALAKFMK AYKLFYKSLEVGDIPYSDALKGEEGVQKPKYDTQKEVMRQVLEDLEEAYTLFSSGSDFDG DPIFGGDTENWKKTVVAFELKVLTHLCKKDTDADLKVKERFARLVSSGSLMTSNADNFQL VYSNKAGQLYPFYYTQNKHSDYLMMSSVIIDNLKKFNDYRMFYYASPAKAQTDKGIKDSE WDAYLSIDPSDSYSNISVLYGKGEFCLANLRYTRVPEGEPLMRLSYAEQNFILAEGAVRG WISEDASTYYKKGIEASMNFIADNTPDKEEYTHGRTITQEYIQEYLAQPVIQLSGDKEKD LEMILLQRYLASFLQHPYDAYYDYRRTGYPVLPINPNTNLNTEKNKIPTRWMYPEAEFSY NPDNANEAVQRQYNGSDDVNKLMWILQ >gi|226332053|gb|ACIB01000003.1| GENE 114 144326 - 147712 3022 1128 aa, chain - ## HITS:1 COG:no KEGG:BF1253 NR:ns ## KEGG: BF1253 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1128 1 1128 1128 2150 99.0 0 MKKNLLLLSYAIKQREIIQILRIMKCTVLLLFLLILQAHASVSSQNARVNMSRNQLPLKE FMAEIEKQTDYLFIYSDAEINASRQVTVKKGTHRVADLLREVLSKNNISYNFADNYISLH VRKEADVQSLAVSPQQKKNTVLVSGTIVDTAGEPIIGANIKLKGAQGTGTITDVEGNFKL EVPTNGVLIVSFIGYQQQEVAVNGKTMLKIKMAEDAEMIDEVVVTALGIKREEKALGYAV QKVGGSKLSTVKPVNIATSLTGKVAGLNVTNSTEFNTAPTLKLRGETPLLVVDGVPYSNI SLNDIAADDIESVDVLKGATASALYGARGGSGAIMVTTKKGSQEGLNISVNSSTMFNAGY LVLPEVQHGYSTGQGGKYTAADFVWGDKLDIGRTAVQYDPYTYEWKEMPLISKGKDNFKN FLETGFITNNNISISQTGKYGSFRSSLSHVYNKGQYPNQRLNKITYSVGGDMKFGKLSFE GGAIYNKRFYPNGEGAGYGGGGYIYNLLVWTGTDYDVRDYKNYWRKKDEEQNWMNDVWYD NPYYLAHEMTSSNDYDKVNTYLSGKYDIMPWLNFSMRAGADAYASRTEKKNAMSARGGWD KNGYFYTSKSTGFSFNGDALLSANHSFGDFAIDGFVGGTIYYYYDDAISSNTRNGLSIPG YYSLKASVDPIASSSSYKQKQVNSIYGKFSASWKSTVFVDVTARNDWSSTLPSETRSYFY PAVSGSIIMSQLLKMPEWLNFWKLRGAWTVTKSDLGIYDTNQAYSVSTNVWDGMNTAVYP EMIRSTTLEPTAARSYEIGTAFNVWDNRLRFDISYYNKLKYNLTREATISGSSGFTKTLV NYDEEQVRRGVEVSLTASLIQTKDWNWEVNANWARDRYFYAKVDPVYSTQKPWVAAGKRW DWYGIYDWERDPQGNIIHENGYPVQSKYQSVMGNEYPDWIWGLSTTLRYKDWTLGISLDG RVGGMAYSRTEQTMWNTGVHPDSDNKWRYDEVVNGKKNYVGQGVKVVSGKVEYDTTGKIV SDTRVFAPNDTQVSYESYIKNYNPWSGGKVYQNVHDCTFLKLRELSLLYTMPKSVCEKIH MKGVTLGLIGQNLLIWMKEFKYADPDVDSDDLNSPSMRYVGFNVKFDL >gi|226332053|gb|ACIB01000003.1| GENE 115 147885 - 148850 708 321 aa, chain - ## HITS:1 COG:no KEGG:BF1252 NR:ns ## KEGG: BF1252 # Name: not_defined # Def: putative anti-sigma factor # Organism: B.fragilis # Pathway: not_defined # 1 321 1 321 321 620 99.0 1e-176 MSELIDKYFAGEMTCEEKKDLFDRIESDEALKKEFLRMQNVVALTQILSCQDDSETSRKG KQHFMQLLFRKRLKRAITVSLKYAAVFAVLVVGTFYTAKLYLSEEFGKSYTIVTAPKGQR VKIELPDGTIAWLSPCSRLRFAASFNETDRKIELDGATYFDVAKNPEKPFVVSAKGYRIR VLGTKFNISAYKNSKEFETDLVEGCVHIYDPADIRNEVFLQPKEKAVLWGDRLMKRESDF DNEEYLKNGVVSFLSEPFGRVLNSVALWNDVNIKIERSVNATQRISGKFRQSDSLESILK ALQGAMPFKYKIVSEEEIIIY >gi|226332053|gb|ACIB01000003.1| GENE 116 149662 - 150564 621 300 aa, chain + ## HITS:1 COG:SMa1953 KEGG:ns NR:ns ## COG: SMa1953 COG2367 # Protein_GI_number: 16263522 # Func_class: V Defense mechanisms # Function: Beta-lactamase class A # Organism: Sinorhizobium meliloti # 11 279 13 317 334 72 24.0 1e-12 MQKRLIHLSIIFFLLCPALVVAQNSPLETQLKKAIEGKKAEIGIAVIIDGQDTITVNNDI HYPMMSVFKFHQALALADYMHHQKQPLETRLLIKKSDLKPDTYSPLRETYPQGGIEMSIA DLLKYTLQQSDNNACDILFNYQGGPDAVNKYLHSLGIRECAVIHTENDMHKNLEFCYQNW TTPLAAAKLLEIFRNENLFDKEYKNFIYQTMVECQTGQDRLIAPLLDKKVTMGHKTGTGD RNAKGQQIGCNDIGFILLPDGHVYSIAVFVKDSEADNRENSEIIAEISHIVYEYVTQQID >gi|226332053|gb|ACIB01000003.1| GENE 117 150748 - 151749 830 333 aa, chain - ## HITS:1 COG:yagH KEGG:ns NR:ns ## COG: yagH COG3507 # Protein_GI_number: 16128256 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-xylosidase # Organism: Escherichia coli K12 # 33 289 5 269 536 61 26.0 3e-09 MKHLVICIALLTVGACCFAQQKKVASHKATSGNPVFQGWYADPEGIIYDDTYWIFPTWSD LYENQTFFDCFSSKDLVNWTKHASVLDTTAVKWAKKAMWAPSVIRKNGKYYIFFGANDVH EGEIGGIGVAVSDRPEGPYKDLLGKPLINEIVNGAQPIDQFVYNDNGHYYMYYGGWGHCN VVQLNDDFTGLVPFEDGTVYKEVTPENYVEGPFMFKKDGKYYFMWSEGGWGGPDYSVAYA ISDSPFGPFKRVAKILQQDPSVATSAGHHSLLHAPGTDDYYIVYHRRPLNDNARDHRVTC IDKMTFDKDGFINPVKMTFEGVPAQKIKKSKKK >gi|226332053|gb|ACIB01000003.1| GENE 118 151806 - 152351 509 181 aa, chain - ## HITS:1 COG:PA0149 KEGG:ns NR:ns ## COG: PA0149 COG1595 # Protein_GI_number: 15595347 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Pseudomonas aeruginosa # 31 170 38 168 181 62 31.0 5e-10 MKKDGFSEIYDIYFPKLLRFTRTYLISEDESENIVQEIFIYLWEHRDIIETLQNLNAYLF TLAKNRCIDYFRKEMVREVRKGSLSEIENRELQLKLYSLEAFDNDRLSDADIEEILNNAI NRLPERCREIFIMSRLQNLRYKEIAEKLNVSPNTVENQIVIALRKLKEDLKDYFPLFVFI I >gi|226332053|gb|ACIB01000003.1| GENE 119 152348 - 152494 65 48 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MFHVVFMLVFHKDKSNIHKLVIMNFKSFYIFATCVINIASLAYNRIIV >gi|226332053|gb|ACIB01000003.1| GENE 120 152486 - 155023 2314 845 aa, chain + ## HITS:1 COG:SP0312 KEGG:ns NR:ns ## COG: SP0312 COG1501 # Protein_GI_number: 15900245 # Func_class: G Carbohydrate transport and metabolism # Function: Alpha-glucosidases, family 31 of glycosyl hydrolases # Organism: Streptococcus pneumoniae TIGR4 # 99 766 11 646 679 422 37.0 1e-117 MKHLTVYFLLVLQGVLFSFSLSAATPQNAIAYHDGNVRFSVLTDGVIRMEWNPSGSFTDG PSFVAIHRNLPVPKYTVQNKNATVVISTARMKLKYKKGQGPFTKDNLTITSAKGMFPFQW TPGTIQKGNLKGTDRTLDGFEGDHNIYSHQDMKLEDGLLSTDGWTLIDDSKNFLFDNSEW AWVKEREHKDGQDWYFMAYGHDYKAALKDFTVFAGKVPLPPRYAFGYWWSRYWSYSDKEM RQLVDNFHTYNIPLDVLVVDMDWHYTDPGFGGWTGWTWNRRLFPNPAKFLGYLKSNDLKI TLNLHPADGVAPYEEKYPEMAQWMGVDTAKQERIPWVVSDKRFINGMFNKVLRPMEKQGV DFWWLDWQQWMYDKKVDSLSNTWWINYVFFSDMERNRDTRPMLYHRWGGLGNHRYQIGFS GDAVISWKSLEFQPYFTNCASNVLYGYWSHDIGGHMFKGGDKEILDPELFTRWMQYGALT PVMRTHSTKNSVLNKELWNFKGDYFEALRNSILFRYQLAPYIYTMARETYDNGISICRPM YYDYPEAKEAYDFKSEYMFGDQILVAPITTPMQNGLSTVKVWLPEGNDWFEWTTGTLLKG GQIIERSFTLTEYPVYVKAGSVLPLYNRVKNLNSNSEEIIVNIFPGENGSFTFYEDNGND KYYANEYARTRMYTERKENHLTVTVGPRQGKYRNMPTDRQFKIKVLGSAIPETITINGNR AEYEYIGDELALLITIPQTACDQEKTIEIQYPTSIPELNDGIVSQFKRFSKAITALKYRD AGIVLTPAMGATEATSIALTYSPERFNELIETFKRNYSQMPEMLKEQKLNEANSQWFMKA IGWKK >gi|226332053|gb|ACIB01000003.1| GENE 121 155495 - 156955 1201 486 aa, chain + ## HITS:1 COG:NMA0050 KEGG:ns NR:ns ## COG: NMA0050 COG0753 # Protein_GI_number: 15793081 # Func_class: P Inorganic ion transport and metabolism # Function: Catalase # Organism: Neisseria meningitidis Z2491 # 6 483 11 488 504 726 73.0 0 MENKKLTAANGRPIADNQNSQTAGPRGPIMLQDPWLIEKLAHFDREVIPERRMHAKGSGA YGTFTVTHDITKYTRAAIFSQVGKQTECFVRFSTVAGERGAADAERDIRGFAMKFYTEEG NWDLVGNNTPVFFLRDPLKFPDLNHAVKRDPRNNMRSANNNWDFWTLLPEALHQVTITMS PRGIPASYRHMHGFGSHTYSFLNAENKRIWVKFHLKTMQGIKNLTDQEAEAIIAKDRESH QRDLYESIERGDFPKWKFQIQLMTEEEADNYRINPFDLTKVWPHKDFPLQDVGILELNRN PENYFAEVEQSAFNPMNIVEGIGFSPDKMLQGRLFSYGDAQRYRLGVNSEQIPVNKPRCP FHAFHRDGAMRVDGNYGSAKGYEPNSYGEWQDSPEKKEPPLKVHGDVFNYNEREYDDDYY SQPGDLFRLMPADEQQLLFENTARAMGDAELFIKQRHVRNCYKADPAYGTGVAQALGIDL EEALKE >gi|226332053|gb|ACIB01000003.1| GENE 122 157052 - 157738 759 228 aa, chain - ## HITS:1 COG:aq_1503 KEGG:ns NR:ns ## COG: aq_1503 COG0569 # Protein_GI_number: 15606658 # Func_class: P Inorganic ion transport and metabolism # Function: K+ transport systems, NAD-binding component # Organism: Aquifex aeolicus # 3 181 5 183 218 111 35.0 9e-25 MKYIIIGLGNYGHVLAEELSTLGHEVIGADLDEGRVDSIKDKIATAFVIDATDEQSLSVL PLNSVDMVIVAIGENFGASIRVVAMLKQKQVKHIYARAIDGVHKAVLEAFGLEKILTPEE DAARSLVQLLDFGTKMETFRVDSEYYVVKFNVPEKFVGYFVNELNLDEEFNLKLIGLKRS NTIKNCLGISLVEHKVVNELPEDAKIRPDDVLVCYGKYSDFQKLWKAL >gi|226332053|gb|ACIB01000003.1| GENE 123 157754 - 159589 1485 611 aa, chain - ## HITS:1 COG:BH0598 KEGG:ns NR:ns ## COG: BH0598 COG0168 # Protein_GI_number: 15613161 # Func_class: P Inorganic ion transport and metabolism # Function: Trk-type K+ transport systems, membrane components # Organism: Bacillus halodurans # 154 608 18 445 448 214 35.0 5e-55 MKIYHKFFLYQNKLLQPYVRLLLRLMAIITYLASIMLIVGVIYEHGFTLSAHEVTKIHLL YKTVWIIFLIDVTLHILLEYRDTKKNFRKLAWILSWLLYLTLIPVIFHRPEEGGAILHLW EFLHGYFYHIVLLLLFSLLNLSNGLVRLLGRRTNPSLILAASFLVIILIGAGLLMLPRCT VDGVTLSWVDALFTSTSAVCVTGLVPVDVSATFTPAGLTVIILLIQIGGLGVMTLTSFFA MFFMGNTSLYNQLVVRDMVSSNSLGSLLSTLLYILGFTLVIEGVGMLSIWFSIHGTLGMD VQDELAFSAFHSISAFCNAGFSTLPGNLGNPMVMTNHNWLYITVSLLIIFGGIGFPILVN FKDIIVYHARRFWRLIRTRQWDNHRVHHLFNLNTKIVLIMTVLLLVFGTVAIAIFEWNHS FAGMSIADKWTQAFFNATCPRTAGFSSVDLTSLSIQTIMLYIVLMWIGGAAQSTAGGVKV NAFAVASLNLIAVLRGTERVEVFGRELSHDSIRRSNAAVVVSLGILFVFIFILSILEPKM SIMTLTFECVSALSTVGSSLNATPLLCDESKLLVSLLMFIGRVGFITLVLGIVKQKKNTK YRYPSDNIIIN >gi|226332053|gb|ACIB01000003.1| GENE 124 159725 - 161092 1381 455 aa, chain + ## HITS:1 COG:TM0539 KEGG:ns NR:ns ## COG: TM0539 COG1350 # Protein_GI_number: 15643305 # Func_class: R General function prediction only # Function: Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB) # Organism: Thermotoga maritima # 10 426 7 421 422 483 57.0 1e-136 MSDKKKRYMLPEEEIPHYWYNIQADMVNKPMPPLHPGTKQPLKAEDLYPIFAEELCRQEL NQTDQWIEIPEEVREMYKYYRSTPLVRAYGLEKALGTPAHIYFKNESVSPMGSHKLNSAI PQAYYCKKEGVQNVTTETGAGQWGASLAYAAKLFGLEAAVYQVKISYEQKPYRRSIMQTY GAQVTPSPSMSTRAGKDILTAHPNHQGSLGTAISEAIELAQTTPNCKYTLGSVLSHVTLH QTVIGLEAEKQMAMAGEYPDMVIACFGGGSNFGGIAFPFMRHNILEGKKTRFIAAEPASC PKLTRGKFQYDFGDEAGYTPLLPMFTLGHNFAPANIHAGGLRYHGAGVIVSQLLKDKLME AVDISQLESFEAGCLFAQVEGIIPAPESCHAIAATIREANKCKESGEEKVILFNLSGHGL IDMASYDKYLSGDLVNYSLTDDDIQKNLDEIGNLA >gi|226332053|gb|ACIB01000003.1| GENE 125 161221 - 162057 909 278 aa, chain + ## HITS:1 COG:STM2203 KEGG:ns NR:ns ## COG: STM2203 COG0648 # Protein_GI_number: 16765533 # Func_class: L Replication, recombination and repair # Function: Endonuclease IV # Organism: Salmonella typhimurium LT2 # 1 276 1 278 285 392 68.0 1e-109 MKYIGAHVSASGGVEFAPVNAHEIGANAFALFTKNQRQWVSKPLTEDSIRLFKENCEKFG FAPEYILPHDSYLINLGHPEEEGLTKSRAAFLDEMQRCEQLGLKLLNFHPGSHLNKISVE ECLDRIAESINLALEKTKGVTAVIENTAGQGSNLGNEFWQLKYIIDRVEDKSRVGVCLDT CHTFTAGYDFLNDYDDVFGEFGEVVGFEYLRGMHLNDSKKELGSRVDRHDSIGKGLIGFA FFEKLMKDPRFDNMPLILETIDETLWPEEIAWLREQTQ >gi|226332053|gb|ACIB01000003.1| GENE 126 162051 - 163622 1192 523 aa, chain - ## HITS:1 COG:FN1727 KEGG:ns NR:ns ## COG: FN1727 COG0038 # Protein_GI_number: 19705048 # Func_class: P Inorganic ion transport and metabolism # Function: Chloride channel protein EriC # Organism: Fusobacterium nucleatum # 16 522 8 520 521 252 32.0 1e-66 MFGKSDDLKDMRKWRVWKLKLIDARLYFVSIFVGLLTGLVAVPYHYLLQLFFNTRRDFFD SHPHWYWHIPIFLSLWGILLFVMWLVKKMPLITGGGIPQTRGVINGRISYKHPFIELVSK FVGGVLALSAGLSLGREGPSVQIGSYVGCLVSKWGRVLAGERKQLLAAGAGAGLAAAFAA PLASSLLVIESIERFDAPKTAITTLLAGVVAGGVASMIFPINPYFQIDAISPGLTFFSQV KLFLLLAAVISVSGKIYSVITVWFKRLYPAIKHPAYVKMLYLLFIAYLISLTEVNLTGGG EQFLLGQAMHADTHIMWIVGMMILHFVFSVLSFSSGLPGGSFIPTLVTGGLIGQIVALIL VRQGIIGYENISYVMLICMSAFLVAVIRTPLTAIVLITEITGHLEVFYPSIVVGGLTYYF TEMLQIQPFNVTLYDDMINSPEFQEEKRYTLSVEVMSGSYFDGKTVNEIQLPERCEIINI HRDRKDIPPAKQKLVPGDQVQIEMDAQDIEKLYEPLVSMANIY >gi|226332053|gb|ACIB01000003.1| GENE 127 164034 - 165575 1013 513 aa, chain + ## HITS:1 COG:UU038 KEGG:ns NR:ns ## COG: UU038 COG2865 # Protein_GI_number: 13357594 # Func_class: K Transcription # Function: Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen # Organism: Ureaplasma urealyticum # 9 509 8 459 463 307 36.0 3e-83 MQIHNNTLIAECSAYDFKEMLERKKVKSWLKSVSAFANTDGGSLFYGVNDDGIIVGLENP QADADFISEMIKARLDPVPEIQLIPIEHEGHTLLEVKVKAGTLTPYYYYQDGTRTAYVRV GNESVECNSQQLLSLVLKGTHMTWDSLPTQVDAGKHSFIILANTFREQTHQEWNDKYLES FGLVTPDGKLTNAGLLFVDNCTVFQSRIFCTRWTGLYKDDAISSIEHRANLVLLLKYGID FIKNYTMSGWVKMPNYRLNLPDYSDRAIFEGLVNHLIHRDYTVMGGEVHIDIYDDRVELV SPGAMLDGTQIQDRDIYKVPSMRRNPVIADVFTQLDYMEKRGSGLRKMRELTEKLPNFLQ GKEPQYQTEATSFYTTFYNLNWGENGRIPVEEVANRVNSTLEKYPVNEESSVEKFGVNTK EFGVNEESSVEKFGVNADKFGDTSETQKKVSKTAQKIIDLVISDPSITADNMANKIGVTK RAIEKNIKSLRGMGILVHEGSDKAGYWRIIVKP >gi|226332053|gb|ACIB01000003.1| GENE 128 165694 - 166296 280 200 aa, chain + ## HITS:1 COG:no KEGG:BT_4629 NR:ns ## KEGG: BT_4629 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 200 35 234 234 366 99.0 1e-100 MLYAIAIIMLLAVIPISEYMAGGIQNSSNNYLLVLIFDIAVGYFCMYIAALLKFNVLKRK NQALENALTEKQQENVAILLEHQNEKQQALRQRELEWLTGKIKMFTEEEQEAILASALSF AEHDLIVAPSISIQPKETCSQQELMYFVCSAFYNMDKSRSEVVGFLYKVFPLYFPAGESA LAKKMPGLEKVKERREKECN >gi|226332053|gb|ACIB01000003.1| GENE 129 166329 - 169802 2053 1157 aa, chain + ## HITS:1 COG:FN0414 KEGG:ns NR:ns ## COG: FN0414 COG0553 # Protein_GI_number: 19703756 # Func_class: K Transcription; L Replication, recombination and repair # Function: Superfamily II DNA/RNA helicases, SNF2 family # Organism: Fusobacterium nucleatum # 94 909 28 792 1014 135 25.0 7e-31 MNNTLIDNSIEALSMTTTLRKCLAIDGIKTVSIATGYWDIPGLAILQENLRSFIEKEGTT LRLLIGKDPYIYVNQLKSPKYKDANYPLDFIKTDIHELNVTEEYKDAIRLLLDYCTENND SKIQIRVFRKNENDETQFLHSKCYIFDGKSNGIGIGIVGSTNFTQKGLQGNAELNYIETA PNQVLSLDEIPGNKSHLIWFNEKWEISEPWNKQFLEQILKSAPITDDVEKERKSEQQSFT PYELYIKLLQIKFGDIVDKSLGQQIETYLPANIHKLEYQIEAVKRCIGIMHEHGGFMLAD VVGLGKTIIGTLIIKRFLSVPEDDGRERKVLVITPPAIQSGWKKTIAMFDKNSDEKIAPY IDFITTGRIGNVAEDESCEDDDDDSRDSGDFGGTLQEKNYGFIVIDESHKFRNSATLMYQ SLDELIQKICSNTGVYPYVGLLSATPQNNRPNDLKNQIYLFERNHNDSTLKKAESGNIER FFADVNREYESLIDRSNDIPADERHQRLDAVSKRLRDCVLSDILERRTRTDVEKYYKDDM ESQGLIFPKIVGPNNLEYIMDDELAQLFSDTMTIIAPTEAEKLQTDEWLKYFRYRAIEYF TDPANERKHTGRGNRGVNDVAKQLAIIMQILLVKRLESSFTAFTQSLLNLRRYTENMIKM WENDTIFVCPQIDVNKELDYETKTLKRGRRVSFNDCVEEIRAKITKLTEQGRNEKGQNAE YNRKDFKEEYYIQLKEDFRLISNLYDRWAKNPQDPKFDAFKENIKPELFNPQKNTSGKLV IFSEAIDTVQSLARAVKAKGYKALVITAANRDEMEHTIEENFDANYEGVWKDDYNVIITT EVLAEGVNLHRANVILNYDTPWNSTRLMQRIGRVNRIGSKEPFVYVYNFMPSAEGDAQIQ LVRKAHTKLQSFHVLFGEDSKIFSEEESVVHYDIAKAVEGEESPLQKYVYELKQYKDTHP ERYLQIEQADKDWQIAQAASGTAYFIVKAPHSARLAIRIRTEAEGLYNAKIISLLELLED MRVKENAKRVPLPDNWRQLSAEAIKTYNQYFVRINKSRAGDKATAAKEMLVKINNTPSLS LQSKILLKNARKFIDRGSFDIIKKVLAIGQELEERGLRLFAIEQQDIDEILEREIGKLVA HVESKQGEASIVLGTIK >gi|226332053|gb|ACIB01000003.1| GENE 130 169815 - 171756 1243 647 aa, chain + ## HITS:1 COG:TVN0681 KEGG:ns NR:ns ## COG: TVN0681 COG1002 # Protein_GI_number: 13541512 # Func_class: V Defense mechanisms # Function: Type II restriction enzyme, methylase subunits # Organism: Thermoplasma volcanium # 205 542 152 467 1007 166 33.0 2e-40 MNKESLKEYLSSRYQGWSSFLNNVIFPIFGEDDFEDGFETELLESQPERRQLAEATGIRS IKQVGMMYVGVEPLQIFDVTVSDRVMMEHNRVNIQRLIRAVMDQFSCAFMLFHYEDDTRW DWRFTYCRKSGNKEESTNSKRYTFLLGPGQSCRTATDNFIALYDKRNSLEIKDIENAFNV EALSKEFFGKYKAQYEAFVNYMVDPTNGMRQHFIDTGFDHTGMAADKIRDREEKPIRDYV KKLLGRIVFLHFLQKKGWLGVPASKEWGEGDRDFMLNIFKNANERQKENFLDDILEDLFT EGLDRNRSDQGDLYDTKVEGFRNCRIPYLNGGLFERDILDKKPSHFPASYFNGLLTMLSQ YNFTIDENDPNDAEVGVDPEMLGRIFENLLEDNKDKGAFYTPKEIVQYMCRESLIAYLQT DMREEDKECIRQFVTTHDASQLGELKEYIDQKLYDVKICDPAIGSGAFPMGLLRELFFCR SAIEPNIVENAANIKRHIIQNNIYGVDIERGAVDIARLRFWLSLIVDEKSPEALPNLDFK IMQGNSLLEQYKGVDLSTMTEKKIGAGESLTFFDSMLDVYRKNLRDKLTEYYACPEHDKK MQLRKDIADIVKQELVEQGIHIDFEDMDLSANSQFFLWHTWFHDVFS Prediction of potential genes in microbial genomes Time: Tue May 17 22:00:22 2011 Seq name: gi|226332052|gb|ACIB01000004.1| Bacteroides sp. 3_2_5 cont1.4, whole genome shotgun sequence Length of sequence - 2064 bp Number of predicted genes - 2, with homology - 2 Number of transcription units - 2, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 67 - 126 4.2 1 1 Tu 1 . + CDS 356 - 1015 181 ## BT_4626 hypothetical protein + Term 1017 - 1062 10.3 - Term 1005 - 1048 10.3 2 2 Tu 1 . - CDS 1074 - 2063 263 ## BT_4627 DNA modification methylase Predicted protein(s) >gi|226332052|gb|ACIB01000004.1| GENE 1 356 - 1015 181 219 aa, chain + ## HITS:1 COG:no KEGG:BT_4626 NR:ns ## KEGG: BT_4626 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 219 47 265 265 420 99.0 1e-116 MHRRGNEFELQRLIDNLQFVDVKGQTLYGRIPKIGSEIEKTILNKLFNYTKLGALIKTSG SPIIYRFAGGRYFKVVTNYSTGSSAERTIYFANSKIADAVGCILSSSLSFWFYQIFSDNL NWKTYEIENFTVPQLSAEDIDYLDKLYSRYLSDIEAKANIRITSGESTYNVDSFKEYKIV RSKAIIDEIDDYICPLYGLTQEETDFIKNYELEFRLAGE >gi|226332052|gb|ACIB01000004.1| GENE 2 1074 - 2063 263 329 aa, chain - ## HITS:1 COG:no KEGG:BT_4627 NR:ns ## KEGG: BT_4627 # Name: not_defined # Def: DNA modification methylase # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 329 663 991 991 672 99.0 0 YGAKISSIDKACFKHIFTSAQTIPNIQKGSLDTFSLFIDLGYQILHTKGNAIFIVPLSVT ASDAMSGLHRLLINHCDEIYVSSYGDRPRRIFESAEQQVSIISFKKSSNKATRIMTTYIN KRYSDESLWLLLDDLKFVNALHHIRNGRIPKIGNEIELGILCKLERCVTTIKDVYKREGL PIYYRKAGGRYYKIITKIPTHSSAEGELKVREKYQSLVGAALSSNLFYWFWLIHSDWHNL RSSELEMFPIPFESFSDEELDKINTLYDTYLNDLYSKSQTTKTGLKCFFARQSKMHIDAI DKFIGEKYGLSEIEINFLINYDYQYRNAE Prediction of potential genes in microbial genomes Time: Tue May 17 22:00:39 2011 Seq name: gi|226332051|gb|ACIB01000005.1| Bacteroides sp. 3_2_5 cont1.5, whole genome shotgun sequence Length of sequence - 35236 bp Number of predicted genes - 40, with homology - 38 Number of transcription units - 17, operones - 11 average op.length - 3.1 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 21 - 674 356 ## COG1961 Site-specific recombinases, DNA invertase Pin homologs - Prom 696 - 755 10.0 - Term 702 - 745 4.0 2 2 Op 1 . - CDS 790 - 1428 358 ## BT_4623 hypothetical protein 3 2 Op 2 . - CDS 1428 - 2327 381 ## BT_4622 mobilization protein BmgA 4 2 Op 3 . - CDS 2311 - 2691 256 ## BT_4621 mobilization protein BmgB 5 2 Op 4 . - CDS 2781 - 4583 1093 ## BT_4620 hypothetical protein 6 2 Op 5 . - CDS 4534 - 5130 346 ## BT_4619 hypothetical protein 7 2 Op 6 . - CDS 5130 - 5483 278 ## BT_4618 hypothetical protein - Prom 5683 - 5742 4.2 - Term 5682 - 5727 8.3 8 3 Op 1 . - CDS 5798 - 7060 573 ## COG4974 Site-specific recombinase XerD 9 3 Op 2 . - CDS 7060 - 7602 273 ## BT_4616 hypothetical protein 10 3 Op 3 . - CDS 7629 - 7841 101 ## 11 3 Op 4 . - CDS 7807 - 9726 2293 ## COG0443 Molecular chaperone - Prom 9807 - 9866 7.7 - Term 9938 - 9985 9.6 12 4 Op 1 . - CDS 10147 - 10671 254 ## BF1188 putative transcriptional regulator 13 4 Op 2 . - CDS 10679 - 12841 1472 ## COG1596 Periplasmic protein involved in polysaccharide export 14 4 Op 3 . - CDS 12851 - 12964 92 ## - Prom 13106 - 13165 6.9 + Prom 13117 - 13176 5.0 15 5 Op 1 33/0.000 + CDS 13374 - 14513 767 ## COG0614 ABC-type Fe3+-hydroxamate transport system, periplasmic component 16 5 Op 2 . + CDS 14510 - 15514 859 ## COG0609 ABC-type Fe3+-siderophore transport system, permease component + Term 15623 - 15679 20.5 - Term 15660 - 15705 -0.9 17 6 Tu 1 . - CDS 15711 - 16502 774 ## COG3187 Heat shock protein - Prom 16542 - 16601 6.8 + Prom 16515 - 16574 8.0 18 7 Op 1 . + CDS 16627 - 17877 674 ## BF1183 hypothetical protein + Prom 17893 - 17952 5.6 19 7 Op 2 . + CDS 17972 - 19165 1244 ## COG1748 Saccharopine dehydrogenase and related proteins + Prom 19169 - 19228 3.0 20 8 Op 1 . + CDS 19248 - 19694 374 ## COG1225 Peroxiredoxin 21 8 Op 2 . + CDS 19705 - 20748 1179 ## COG0468 RecA/RadA recombinase + Term 20766 - 20812 4.1 - Term 20747 - 20809 14.2 22 9 Tu 1 . - CDS 20814 - 21095 384 ## BF1212 hypothetical protein - Prom 21115 - 21174 2.4 23 10 Op 1 . + CDS 21180 - 21695 303 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 24 10 Op 2 . + CDS 21676 - 22203 458 ## BF1210 hypothetical protein 25 10 Op 3 . + CDS 22210 - 22893 566 ## BF1209 hypothetical protein + Term 22909 - 22980 24.0 - Term 22897 - 22966 21.3 26 11 Op 1 . - CDS 22982 - 23974 939 ## COG2855 Predicted membrane protein 27 11 Op 2 . - CDS 24054 - 24950 671 ## COG0583 Transcriptional regulator 28 11 Op 3 . - CDS 24975 - 25556 659 ## BF1206 hypothetical protein - Prom 25654 - 25713 6.7 + Prom 25561 - 25620 6.1 29 12 Tu 1 . + CDS 25779 - 28367 1916 ## PROTEIN SUPPORTED gi|163764771|ref|ZP_02171825.1| ribosomal protein S8 + Term 28399 - 28438 8.2 30 13 Tu 1 . - CDS 28443 - 28652 128 ## BF1204 hypothetical protein - Prom 28756 - 28815 3.0 - Term 29097 - 29152 15.0 31 14 Op 1 . - CDS 29169 - 29585 534 ## BF1203 hypothetical protein 32 14 Op 2 . - CDS 29603 - 30220 234 ## COG0237 Dephospho-CoA kinase 33 14 Op 3 . - CDS 30210 - 31223 743 ## BF1168 hypothetical protein 34 14 Op 4 . - CDS 31247 - 31573 446 ## COG1862 Preprotein translocase subunit YajC 35 14 Op 5 . - CDS 31620 - 32546 998 ## COG0781 Transcription termination factor - Prom 32685 - 32744 10.9 + Prom 32464 - 32523 6.5 36 15 Op 1 . + CDS 32715 - 33101 498 ## BF1198 hypothetical protein + Prom 33167 - 33226 3.1 37 15 Op 2 22/0.000 + CDS 33246 - 33836 978 ## PROTEIN SUPPORTED gi|53712489|ref|YP_098481.1| 50S ribosomal protein L25/general stress protein Ctc + Term 33872 - 33923 14.4 + Prom 33856 - 33915 3.2 38 16 Op 1 . + CDS 33943 - 34506 518 ## COG0193 Peptidyl-tRNA hydrolase 39 16 Op 2 . + CDS 34533 - 34958 365 ## COG1188 Ribosome-associated heat shock protein implicated in the recycling of the 50S subunit (S4 paralog) - Term 35014 - 35059 11.5 40 17 Tu 1 . - CDS 35068 - 35235 121 ## BF1194 hypothetical protein Predicted protein(s) >gi|226332051|gb|ACIB01000005.1| GENE 1 21 - 674 356 217 aa, chain - ## HITS:1 COG:ECs5249 KEGG:ns NR:ns ## COG: ECs5249 COG1961 # Protein_GI_number: 15834503 # Func_class: L Replication, recombination and repair # Function: Site-specific recombinases, DNA invertase Pin homologs # Organism: Escherichia coli O157:H7 # 2 190 5 183 191 72 28.0 4e-13 MIYGYIRVSSDKQTVENQRFEISNFCKINELTIDDWIEETISGTKNYTKRQLGRLLRKVC KDDIIICSELSRLGRNLFMIMEILNICMTKECKVWTIKDNYRLGEDIQSKVLAFAFGLSA EIERNLISQRTKEALARKKSEGAMLGHCRGFRCRLNPKCANKHDYIVKELAKGTEKTVIS KRLKVSKTTLYRYLVYTGLHLPINCQQEGWEEYGIYH >gi|226332051|gb|ACIB01000005.1| GENE 2 790 - 1428 358 212 aa, chain - ## HITS:1 COG:no KEGG:BT_4623 NR:ns ## KEGG: BT_4623 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 206 1 206 215 350 86.0 1e-95 MKEDVVAANLANLRQSINELKVCIEKQNTTVPKPEQSVKADFNEKTIYTGIAKSFCTCWN EALSVVKKHIWQQQPNTLPFSLWFPKLIDLFKQKSKLLEYLYRHVCDYNQNRMTIETNTR CILKRQDEILVKINELKSPVTVIPPNISGLFICGYHIKLQYIVPIIAIIIIWAVAASASS MKFKEESFAHYSMYRAVKEQHQYLIETIKRGN >gi|226332051|gb|ACIB01000005.1| GENE 3 1428 - 2327 381 299 aa, chain - ## HITS:1 COG:no KEGG:BT_4622 NR:ns ## KEGG: BT_4622 # Name: not_defined # Def: mobilization protein BmgA # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 299 1 299 299 492 87.0 1e-138 MIAKIKTRVDFGGIVNYANDQKNKKKCATLLAHEGVCAISNKLIADSFCLQASMRPKVKS PVKHVSLAFSSQDISRFPDNEEGDALMVEIAKKWMEQMGIRNTQYIIARHHDTKHPHCHL VFNRIDNKGNLISDSNERIRNAKVCRTLTKEYGLYFAPKNSKARNKSRLRPHQLRKYNLR SSVLDARANSHSWNDFFSILKGLGIDMRFYHAENSDKIRGISFSQDEYSMAGSKLDRDLS FNSLCATLGNMAAELIIQPHQAITPSGGGGTNNEQGWRNDKNRDNERNEPFHKTTKRRR >gi|226332051|gb|ACIB01000005.1| GENE 4 2311 - 2691 256 126 aa, chain - ## HITS:1 COG:no KEGG:BT_4621 NR:ns ## KEGG: BT_4621 # Name: not_defined # Def: mobilization protein BmgB # Organism: B.thetaiotaomicron # Pathway: not_defined # 5 126 1 122 122 216 92.0 2e-55 MNMSIKEKKLGGRPKLASYQKRTKCFRVMFTENDYIYIQSKAEQAGLSVNEFCHQAAMDC QVCQRISPEMVSAIRDLSGIANNVNQIAHQMHIYGLETVKQQCFLIVSEVSRIITQVKNT CHDSED >gi|226332051|gb|ACIB01000005.1| GENE 5 2781 - 4583 1093 600 aa, chain - ## HITS:1 COG:no KEGG:BT_4620 NR:ns ## KEGG: BT_4620 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 221 600 1 380 380 747 94.0 0 MSKKIFSAQDWENVPSEIQQIHTPSIAPIYNKVEEDVESVVREIERRAIDIAPNYKDWVE LGFALVDGLGENGREYYHRISRFYPTYQREETDKQYTHCLHSKGQGITIRSFFHLANQAV ISLAPFSKEHLSILPNIQNGKTGKWIKSEEELPVFPECVFEHLPPFLNEVVNNSISVDDR DTILIGAIVCLSVCFHNVCGVYDERIVYPNLYLFVVADAGMGKGALTLCRELVAPINRHL HELSKRLEQEYKEAMNAYIKGKKDSGMTMLVEPPMRMLVIPANSSASSFLKILGDNDGIG LLFESEGDTLSQTLKSDYGNYSDVLRKAFHHELVSLSRRKDREYCEVANPRVSVALAGTP EQVRRLIPDAENGLMSRFCFYIIRFKRGIRNVFATSDISQSKNGMFKLLGDKFCHLHEEF VRQGNYSFSLPSDLQEHFIEYLSRVNEECCDEVDNKMQGVVRRMGLIAYRIMMVLTAIRH LDNVIHKSSSDETVQLVCHEFDYSIAMSICDTLLYHAVFIYQNLSGNQSKRLQPASQEIG VYARRNTLYNMLPETFTKKDYDATVLALGENGSTANKWIEAFIKDSKLCRIEQGKYRKIF >gi|226332051|gb|ACIB01000005.1| GENE 6 4534 - 5130 346 198 aa, chain - ## HITS:1 COG:no KEGG:BT_4619 NR:ns ## KEGG: BT_4619 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 198 2 199 199 397 92.0 1e-109 METTDYCFSFFRKPIQNIEPIRAVGIVDVYRYIIGHYAQPQTESLRSMLSSPESKRYKAT HFDYCTFSGLFRKRNEKELIMHSGLMCLDFDHVENIVELKQQLLNHEYFDTELLFVSPSG NGLKWIIPVDLKGWEHSRYFKAVANCIKATGLPSADISGSDVARSCFLPYDPQAYINHKY KDDVEENIFRPRLGECPF >gi|226332051|gb|ACIB01000005.1| GENE 7 5130 - 5483 278 117 aa, chain - ## HITS:1 COG:no KEGG:BT_4618 NR:ns ## KEGG: BT_4618 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 117 1 117 117 210 99.0 1e-53 MAYKVNSLEEMPNALSYLIESVEVLQSKVNALQHKQASSSPKWMDIDELCAYLPSHPAKQ TVYGWVSTKQIPVHKINKALAFLQSEIDDWLKNKSHKTQDDLMEEARRFVESKKIIR >gi|226332051|gb|ACIB01000005.1| GENE 8 5798 - 7060 573 420 aa, chain - ## HITS:1 COG:lin2069 KEGG:ns NR:ns ## COG: lin2069 COG4974 # Protein_GI_number: 16801135 # Func_class: L Replication, recombination and repair # Function: Site-specific recombinase XerD # Organism: Listeria innocua # 214 414 76 294 297 68 30.0 3e-11 MKKALPNTKVTVKLRRSNYKEEWYLIIESYPVYKRGSTRASRVVESINRTISTPIWDKSS IARILPDGTFNYKPKRDLNGIIQCRSTIDQEACIYADNVRKLRQHEYDSAILYTDKENEI AAQNERSEQDFIKYFNRIISTRHPNSSDSIIVNWRRVGELLKMYSQGQPIPFKAISVKLL EDIKMFLLRAPMGGNKKGTISQNTASTYFSILKAGLKQAFIDEYLTVDISAKVKGITNIE KPRVALTMNEVQMLVGTPCKDDVLKRAFLFSILTGLRHSDIQTLKWKQIQQTSKGTWQAV VIQQKTKRPDYKPVIQQALQLCGIRPDDDEALVFEGLTDASWISRPLKVWIEASGIKKHI TFHCGRHSYASLLLENGVDIYTIKSLMGHTNVKTTQIYTHIVNEQKEKAANTLHIDNLDL >gi|226332051|gb|ACIB01000005.1| GENE 9 7060 - 7602 273 180 aa, chain - ## HITS:1 COG:no KEGG:BT_4616 NR:ns ## KEGG: BT_4616 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 180 1 180 181 324 97.0 7e-88 MPRGNYTIQRSCEECGKIFTPSTLVSKYCCPACSKRAYKKRQIAKEKEAIRQALIRRIPS CKGYLTVKEAMLIYGISKDVLYRMIRQGLIPSYNFGQRLTRLSRQYMDEHFKTKAGSRKR KKEALSFEPKDCYTIGEIAKKFHINDSSVFKHIRRHSIPIRQIGNYVYVPKSEIDKLYKS >gi|226332051|gb|ACIB01000005.1| GENE 10 7629 - 7841 101 70 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MFRMLTLRKSNDSDKDTDNKQQAKACSSMSCTLFAFMGVHGCYVFSMGFFISYLRHVCDT SYLAIRYNLD >gi|226332051|gb|ACIB01000005.1| GENE 11 7807 - 9726 2293 639 aa, chain - ## HITS:1 COG:ECs0014 KEGG:ns NR:ns ## COG: ECs0014 COG0443 # Protein_GI_number: 15829268 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Molecular chaperone # Organism: Escherichia coli O157:H7 # 1 639 1 635 638 684 60.0 0 MGKIIGIDLGTTNSCVSVFEGNEPVVIANSEGKRTTPSIVAFVDGGERKVGDPAKRQAIT NPTRTIFSIKRFMGENWDQVQKEIARVPYKVVKGDNNTPRVDIDGRLYTPQEISAMILQK MKKTAEDYLGQEVTEAVITVPAYFSDSQRQATKEAGQIAGLEVKRIVNEPTAAALAYGLD KAHKDMKIAVFDLGGGTFDISILEFGGGVFEVLSTNGDTHLGGDDFDQVIIDWLVQEFKN DEGADLTQDPMAMQRLKEAAEKAKIELSSSTSTEINLPYIMPVGGVPKHLVKMLTRAKFE SLAHNLIQACLEPCKKAMQDAGLSNSDIDEVILVGGSSRIPAVQKLVEDFFGKTPSKGVN PDEVVAVGAAVQGAVLTDEIKGVVLLDVTPLSMGIETLGGVMTKLIDANTTIPARKSETF STAADNQTEVTIHVLQGERPMAAQNKSIGQFNLTGIAPARRGVPQIEVTFDIDANGILKV SAKDKATGKEQAIRIEASSGLSKEEIEKMKAEAEANAEADKKEREKIDKLNQADSLIFQT ETQLKELGDKLPADKKAPIEAALQKLKDAHKAQDMTTIDSAMAELNTAFQAASAEMYAQS GAQGGAQAGPDMNAGQSNAGQNNGKQDDNVQDADFEEVK >gi|226332051|gb|ACIB01000005.1| GENE 12 10147 - 10671 254 174 aa, chain - ## HITS:1 COG:no KEGG:BF1188 NR:ns ## KEGG: BF1188 # Name: not_defined # Def: putative transcriptional regulator # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 174 1 174 174 347 100.0 8e-95 MNTNERPEMTEKQSCHWYLAFTASRAEQRVKQELDQRKVRNYLPLRKITYQWQGRSREAL CPQIARCVLIWTSLSDIRQLSGISGLIIPQNIWDYRVPEWQVESYQLLFSQMDTAVEWIP DCLESATMVRVTGGPLTGLVGELDTSDTGFRIRIRFHSMGCFRVAVPEEWIEKF >gi|226332051|gb|ACIB01000005.1| GENE 13 10679 - 12841 1472 720 aa, chain - ## HITS:1 COG:aq_505 KEGG:ns NR:ns ## COG: aq_505 COG1596 # Protein_GI_number: 15605977 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Periplasmic protein involved in polysaccharide export # Organism: Aquifex aeolicus # 24 506 55 528 725 157 29.0 9e-38 MGKKVLVFLLLLSFIGWTKAQQEELQNGTEQSKQLQVFGRNIFASRNLSFEPNLNIPTPE NYRLGPGDEVIIDVWGTSENTVRETISPEGSIMVENIGPIYLSGMNMEEAERYLRHEFSK IYAAISGESAHIKVTLGKIRSIMVNVMGEVEVPGTYRLSAFASVFHALYRAGGVNRIGSL RTIQVVRSGMKVADVDVYEYIMKGKLTDDIRLSEGDVILVSPYENLVGISGKVKRPMIYE MKHGESLATLIGYTGGFTGDAYRNTVRLVRRSGREKQIYNVDQQDYDNFILTDNDEVSVE AVLGRFSNKVEIHGAVYRAGMYQLDSVTGTIKRLIQQAEGLRGDAFLNRALLRREREDLS HEMIPVDLKKLMAGTAPDLPLQKNDVLYISSIKELEKEGVLFIYGDVAKPGYFPFARNMS VQDLILKAGGLLESASTVRIDVSRRIKDPKSVSSSTVIGKSFTVELKNGLLIGESNTLKL EPYDMVFVRRSPGYQKQANVTVNGEVTFTGNYALTKKNERLSDLIAKAGGLSKSAYAKGA RLMRRMTADEIRQKQDAVRFATKGTGKDSVSLSSLEVDQTYSVGIELEKALAKPKSDEDL VLREGDVLFVPKYVSTVTVNGAVMYPNTVLYQKGSGIDYYIGQAGGFGNRALKRRAYVVY MNGTVSRLRRNTANAIEPGCEIIVPSKGERKKMTTAGAVGMSSSIASIAAMVASMVSLTK >gi|226332051|gb|ACIB01000005.1| GENE 14 12851 - 12964 92 37 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MVLLGCILGSVILGLLACLIWILYDFRRFKKKNNFIE >gi|226332051|gb|ACIB01000005.1| GENE 15 13374 - 14513 767 379 aa, chain + ## HITS:1 COG:alr4031 KEGG:ns NR:ns ## COG: alr4031 COG0614 # Protein_GI_number: 17231523 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-hydroxamate transport system, periplasmic component # Organism: Nostoc sp. PCC 7120 # 6 376 39 424 426 249 34.0 5e-66 MTRFSFVLFALISLLLTACQDKQKVVSNTENGTSVPLSYAQGFGITHSSGYTTITVYNPW KPGEIYDTYYLVKNEGQAVPTDGLKVVIPLKSIMTNSATHLGFLELLGELDKVTGVCNSN YIYNPTILQGVKDGTIKDLGDAFNLDIENLLLLHPQAVMTSAYNAEDENSRRMKQTGLPI LYNIEWQEKSILGRAEWIKFIGAFFDKEKLADSIFSQIAEQYNEIKKKAEKLSYAPSILS GQDYRGTWSMPSGRSYNAQLFRDAGANYYYANDTTVSGSISSSIEEALIHFNQADIWVGV QANTLEDLGKMDSKYKLFKSYKNGNVYHINKRTNITGGNDYWESGVARPDLLLNDMIKII HPSLLPEYELTYMDKLKSK >gi|226332051|gb|ACIB01000005.1| GENE 16 14510 - 15514 859 334 aa, chain + ## HITS:1 COG:alr4032 KEGG:ns NR:ns ## COG: alr4032 COG0609 # Protein_GI_number: 17231524 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-siderophore transport system, permease component # Organism: Nostoc sp. PCC 7120 # 8 330 27 355 362 236 48.0 4e-62 MKHYRLFLLLIGLLVLAFLADIAIGSVSLSIRDVWNTFIGSNDNLIYREIILNHRLPKAL TAILAGASLSVAGVLMQTLFHNPLAGPDVLGVTSGASLGVALLTLGTSSLPLWLITGWGQ VTAAIIGAIGVLLLVIIVSIKIPQTISLLIIGMMFGNFAGAIVSILQSMSNPDTLKLFIT WTFGSLSSVGWEQMSVMAPVIACGILTALLLQKQLNILLLGKNYANGLGVSVPRLRLWII LATALLAGTSTAFTGPIAFIGITMPHVARGLFGSPNHRIILPTSMLCGAITLLVCDLISQ LPGMQGTLPINAVTALFGSPIIVWIILRNSYINK >gi|226332051|gb|ACIB01000005.1| GENE 17 15711 - 16502 774 263 aa, chain - ## HITS:1 COG:DR1940 KEGG:ns NR:ns ## COG: DR1940 COG3187 # Protein_GI_number: 15806938 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Heat shock protein # Organism: Deinococcus radiodurans # 16 222 171 365 403 82 29.0 1e-15 MKKVLLSICMVSAVFAMSSCGSTKEAASLSSLNGEWNIIEVNGSAIVPAENQELPFIGFD TATGKVYGNSGCNRMMGSIDLNSKPGTIDMSRLGSTRMACPDMTTEQNVLNALGQVKSYK KLGKHNMALCNASNRPVVVLQKKASDVKLSALNGEWKIEEVNGEAIPSGMEKQPFINFDV KKKSIHGNAGCNLINGGFETDKENPRSISFPNVISTMMACPDMEVEGKVMKAINEVKSFD VLSGGGIGFYSADGTLVMVLVKK >gi|226332051|gb|ACIB01000005.1| GENE 18 16627 - 17877 674 416 aa, chain + ## HITS:1 COG:no KEGG:BF1183 NR:ns ## KEGG: BF1183 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 21 416 21 416 416 756 98.0 0 MKRRLRTLLLLLLIATRTLLAQNSHFASSSSPDSLSPDEETDFITTHFSLKQLCKWTPGM KFMFIPDSSDEFVPILCKYEDGKEVDNDLLKSKTLEYTGSEETVHETYIGKIYTSRFIFQ CEDHKYYYEMKDVKLNDLCDQNPYASIPALVYLQDVNKAKELLIGKTLYTRTTIAKTDDA NSYSGYREVNIAKGEPVKITTIDVGNKSFPVKITFIDRKGVSYYIDVAMSRINSGMEPAD FQAEKRINYFPNAFSFTNPDVKTRESIQSKYIGQSVYPQKTIRVKQTELLRYTPLHIKDV QPEKAGTSATLLLTDIQGNTYQVKVDLKYDPILKNEDFIEDLFGFSDIRKKYPNISESNW LMLAKGEVKPGMTTEECKLAIGEPIEIRVRTDTRFETWLYRGKILEFENGILLRAK >gi|226332051|gb|ACIB01000005.1| GENE 19 17972 - 19165 1244 397 aa, chain + ## HITS:1 COG:slr0049 KEGG:ns NR:ns ## COG: slr0049 COG1748 # Protein_GI_number: 16331467 # Func_class: E Amino acid transport and metabolism # Function: Saccharopine dehydrogenase and related proteins # Organism: Synechocystis # 1 393 1 392 398 563 66.0 1e-160 MGRVLIIGAGGVGTVVAHKVAQNADVFTDIMIASRTKSKCDDIVKAIGNPNIKTAQVDAD NVDELVALFNDFKPEMVINVALPYQDLTIMEACLKAGVNYLDTANYEPKDEAHFEYSWQW AYHERFKEAGLTAILGCGFDPGVSGIYTAYAAKHYFDEIQYLDIVDCNAGNHHKAFATNF NPEINIREITQNGRYYENGQWVTTGPLEIHKDLTYPNIGPRDSYLLYHEELESLVKNFPT IKRARFWMTFGQEYLTHLRVIQNIGMARIDEIDYNGQKIVPLQFLKAVLPNPQDLGENYE GETSIGCRIRGLKDGKERTYYVYNNCSHEEAYKETGMQGVSYTTGVPAMIGAMMFFKGEW KRPGVNNVEEFNPDPFMEQLNKQGLPWHEVFDGNLEL >gi|226332051|gb|ACIB01000005.1| GENE 20 19248 - 19694 374 148 aa, chain + ## HITS:1 COG:HI0254 KEGG:ns NR:ns ## COG: HI0254 COG1225 # Protein_GI_number: 16272212 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Peroxiredoxin # Organism: Haemophilus influenzae # 1 148 4 149 155 143 45.0 1e-34 MNVGDKAPELLGINEKGEEVRLNNYKGRKIVLYFYPKDNTSGCTAQACSLRDNYAELRKA GYEVIGVSVDNEKSHQKFIEKNNLPFTLIADTDKKLVEQFGVWGEKKLYGRAYMGTLRTT FLINEEGVIERIIGPKEVKTKEHASQIL >gi|226332051|gb|ACIB01000005.1| GENE 21 19705 - 20748 1179 347 aa, chain + ## HITS:1 COG:mlr0030 KEGG:ns NR:ns ## COG: mlr0030 COG0468 # Protein_GI_number: 13470353 # Func_class: L Replication, recombination and repair # Function: RecA/RadA recombinase # Organism: Mesorhizobium loti # 21 346 15 348 365 429 66.0 1e-120 MAKKEDELNFETDNNKMASSEKLKALQAAMDKIEKSFGKGSIMKMGEEVVEQVEVIPTGS IALNAALGVGGYPRGRIIEIYGPESSGKTTLAIHAIAEAQKAGGIAAFIDAEHAFDRFYA AKLGVDVDNLFISQPDNGEQALEIAEQLIRSSAIDIIVVDSVAALTPKAEIEGDMGDNKV GLQARLMSQALRKLTSAVSKTRTTCIFINQLREKIGVMFGNPETTTGGNALKFYASVRLD IRGSQQIKDGEEVIGKQTKVKVVKNKVAPPFRKAEFDIMFGEGISHSGEIIDLGADLGII KKSGSWYSYNDTKLGQGRDAAKQCIADNPELAEELEGLIFEKLREHK >gi|226332051|gb|ACIB01000005.1| GENE 22 20814 - 21095 384 93 aa, chain - ## HITS:1 COG:no KEGG:BF1212 NR:ns ## KEGG: BF1212 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 93 1 93 93 158 100.0 5e-38 MKIFGEKDVLLMTELKIDTMSQNETTKLDIIVEVLGEREPEIRRLVILDDRLRMFAESND ENGPGIPIELVAEWAMLLNKYYPLALEKRNMMN >gi|226332051|gb|ACIB01000005.1| GENE 23 21180 - 21695 303 171 aa, chain + ## HITS:1 COG:mll3697 KEGG:ns NR:ns ## COG: mll3697 COG1595 # Protein_GI_number: 13473184 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Mesorhizobium loti # 3 158 5 158 183 66 30.0 3e-11 MDTESFKREFLPYHRKLYCVAYRLLENAADAEDLVQEAYLKLWDKREGLSVISNPEAFSV TLVKNMCFDLLRSGKYVLSRQSVELSAAQDVFQPDNLEAREGVRQIKDIIAHLPEQQQRI INMRDIKGCSYEEIEQVTGLNSINVRVLLSRARKKIREEFNKWNNYESRRN >gi|226332051|gb|ACIB01000005.1| GENE 24 21676 - 22203 458 175 aa, chain + ## HITS:1 COG:no KEGG:BF1210 NR:ns ## KEGG: BF1210 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 175 1 175 175 341 100.0 7e-93 MKVEEIERLLAEFYEGTTTESQEEVLRNYFRTTEVPGHLLKDKEIFLNLCPDADQDIEVP AHLEDKLNLLIDEMAEKEQHFFRPNNSKNSWHWIGGVAATILLLIGIGYGIDNLSKNVCP PTPQDTFSDPEEAYRMLQATLLEISANLNYGLNEVKESQIDMRKIHQEVRNEIKK >gi|226332051|gb|ACIB01000005.1| GENE 25 22210 - 22893 566 227 aa, chain + ## HITS:1 COG:no KEGG:BF1209 NR:ns ## KEGG: BF1209 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 227 1 227 227 379 100.0 1e-104 MKTMKFILACVLLLSPLLCQAQKNLFNKYNDMKGVSSVYISKAMMELNPNLFMKDLYIGK VAEHLNSVQVLSTHDNKVREEMAKDIRSLVQSSKYELLMKQKSTVSGSEVYVNRKGSKVK ELIMVMNGASSLKFVYMEGDMTTDDIKKLMLYQSTSQNFIISGDLFYANNKPVTYFKKEN SDNQKDMAELSGTYNLNSIDMNYKEELSTLNDKLKRIEQGLKNINIK >gi|226332051|gb|ACIB01000005.1| GENE 26 22982 - 23974 939 330 aa, chain - ## HITS:1 COG:SPy1056 KEGG:ns NR:ns ## COG: SPy1056 COG2855 # Protein_GI_number: 15675048 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Streptococcus pyogenes M1 GAS # 45 325 34 331 339 140 35.0 3e-33 MVSAITKTLRTNNKTVYVSLLSILTFFLMLDYIPGLQAFSTWVTPPLALFLGLAFALTCG QAHPKFNKKTSKYLLQYSVVGLGFGMNLHSALASGKEGMEFTIVSVIGTLILGWFIGRKF LKVDRNTSYLISSGTAICGGSAIAAVGPVVKANDSEMSVALATIFILNALALFIFPVIGH ALNMSQHEFGTWAAIAIHDTSSVVGAGAAYGEEALKVATTIKLTRALWIIPMAFATSFIF KSKGQKISIPWFIFFFVLAMIVNTYLLGSVPELGAAINGLARKTLTITMFFIGASLSLDV VKSVGIKPLIQGVLLWVVISLSTLAYIYWF >gi|226332051|gb|ACIB01000005.1| GENE 27 24054 - 24950 671 298 aa, chain - ## HITS:1 COG:aq_638 KEGG:ns NR:ns ## COG: aq_638 COG0583 # Protein_GI_number: 15606065 # Func_class: K Transcription # Function: Transcriptional regulator # Organism: Aquifex aeolicus # 5 298 6 299 303 137 29.0 2e-32 MSDFRLKVFLSVAKNLSFTKASQELFVSQPAITKHIQELETCYQVRLFDRQGNKISLTEA GKLLQEHSEKILEDYKRLEYEMHLLHNEYIGDLKLGASTTISQYVLPPLLANFIAKFPQV NLSLLNGNSREIEAALQEHRIDLGLVEGICRLPNLRYTTFLQDELVAVVHTGSKLSLPDE ITPEDLSRIPLVLRERGSGTLDVFERALSEHNMKLSSLNVLLYLGSTESIKLFLEHTDCI GIVSIRSISRELLSGTFRVIEIKGMPMLREFCFAQPQGQESGLSQVLMQFAMHHNKKL >gi|226332051|gb|ACIB01000005.1| GENE 28 24975 - 25556 659 193 aa, chain - ## HITS:1 COG:no KEGG:BF1206 NR:ns ## KEGG: BF1206 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 193 1 193 193 351 100.0 7e-96 MDEKIKFPSNVVLIDAAFLNLVVTDLKKYFEKTLMRELQEIDLSELVTYIVLDAGMAVGD NQIQILMVYDKDSAQLSNCRPSDLSAELNGVAFKSQFGEFSFASVPCEEMVSREELYLDL LSIVLDSADVERLILVSFNEEYGDKVMERLKGVKNKETIQFRMNEPEESIEGYQWEMLAY PVMQALGIRGEEL >gi|226332051|gb|ACIB01000005.1| GENE 29 25779 - 28367 1916 862 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163764771|ref|ZP_02171825.1| ribosomal protein S8 [Bacillus selenitireducens MLS10] # 1 861 1 810 815 742 47 0.0 MNFNNFTIKSQEAVQEAINLAQSRGQQAIETAHILYGVMKVGENVTNFIFQKLGLNGQQI SLVLDKQIDSFPKVSGGEPYLSREANEVFQKATQYSKEMGDEFVSLEHLLLALLTVKSTV STILKDAGMTEKELRGAISELRKGEKVTSQSSEDNYQSLEKYAINLNEAARSGKLDPVIG RDEEIRRVLQILSRRTKNNPILIGEPGTGKTAIVEGLAHRILRGDVPENLKNKQVYSLDM GALVAGAKYKGEFEERLKSVVNEVKKSEGNIILFIDEIHTLVGAGKGEGAMDAANILKPA LARGELRSIGATTLDEYQKYFEKDKALERRFQIVQVDEPDNLSTISILRGLKERYENHHH VRIKDDAIIAAVELSSRYITDRFLPDKAIDLMDEAAAKLRMEVDSVPEGLDEISRKIKQL EIEREAIKRENDEPKLQTIGKELAELKEQEKSYKAKWQSEKSLMDIIQQNKVEIENLKFE ADKAEREGNYGKVAEIRYGKLQELHKEIEDTQKKLHEMQGDTAMIKEEVDAEDIADVVSR WTGIPVSKMMQSEKDKLLHLEEELHQRVIGQDEAIAAVSDAVRRSRAGLQDPKRPIGSFI FLGTTGVGKTELAKALAEFLFDDETMMTRIDMSEYQEKHSVSRLVGAPPGYVGYDEGGQL TEAIRRKPYSVVLFDEIEKAHPDVFNILLQVLDDGRLTDNKGRVVNFKNTIIIMTSNMGS SYIQSQMEKLNGANNEEVVEETKKEVMNMLKKTIRPEFLNRIDETIMFLPLTEKDIKQIV LLQIKSVQKMLAGNGVELELTDAALDFLSQVGYDPEFGARPVKRAIQRYLLNDLSKKLLA QEVDRSKAIIVDAQGDGLVFRN >gi|226332051|gb|ACIB01000005.1| GENE 30 28443 - 28652 128 69 aa, chain - ## HITS:1 COG:no KEGG:BF1204 NR:ns ## KEGG: BF1204 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 69 11 79 79 133 100.0 2e-30 MKLDDFTGVLSLEHLDVNTMVYLYSEQGELIGKIHSTKSSATFTLPQKGMYVLVIHCLSY PVEVRRVIY >gi|226332051|gb|ACIB01000005.1| GENE 31 29169 - 29585 534 138 aa, chain - ## HITS:1 COG:no KEGG:BF1203 NR:ns ## KEGG: BF1203 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 138 1 138 138 249 100.0 2e-65 MLKTILSISGKPGLYKLISQGKNMLIVETIDAAKKRFPAYGNEKIISLADIAMYTNDSEV PLRDVLRSIKEKENAAIASIDVKKATSEQLREYLAEVLPDFDRDRVYTNDIKKLILWYNI LVSNGITDFGEETAVEAE >gi|226332051|gb|ACIB01000005.1| GENE 32 29603 - 30220 234 205 aa, chain - ## HITS:1 COG:DR1892 KEGG:ns NR:ns ## COG: DR1892 COG0237 # Protein_GI_number: 15806892 # Func_class: H Coenzyme transport and metabolism # Function: Dephospho-CoA kinase # Organism: Deinococcus radiodurans # 4 177 15 188 207 96 34.0 2e-20 MAIKIGITGGIGSGKSVVSHLLEVMGVPVYISDEESKKVVATDPVIRKELCDLVGEEVFS GGKLNKTLLATYLFASSTHASQVNGIIHPRVKEHFRQWSSHKECLDIIGMESAILIESGF ADEVDCIVMVYAPLELRVERAVRRDNASCEQIMQRIRSQMSDEEKCERASFVIINDGEKP LIPQILELIAFLYQKIHYLCSAKNN >gi|226332051|gb|ACIB01000005.1| GENE 33 30210 - 31223 743 337 aa, chain - ## HITS:1 COG:no KEGG:BF1168 NR:ns ## KEGG: BF1168 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 337 1 337 337 607 100.0 1e-172 MFERRNIKYIYLKLSRKIKDFLLSDKSREFLIFLFFFFIAGGFWLLQTLNNDYEAEFSIP VRLKGVPNHVVLTSEPPSELRIKVKDKGTVLLNYMLGKSFFPVNIDFSESKVPDNHVKIY ASELEKKIAGQLNVSTRLLSVKPDTLEYIYSTGKSKLVPVKLEGKVVAGRQYYISDTIYS PDSVLVYAPVAILDTITAAYTQKVNFENVMDTLKQRIALAGVKGAKFVPGAIDLTLPVDI YTEKTVEVLLRGINFPADKVLRAFPSKVQVTFQVGLSRFREVNASDFVVNVSYEELLKLG TDKYTVKLKSLPRGVSHVRIHPEQVDFLIEQLSSDGN >gi|226332051|gb|ACIB01000005.1| GENE 34 31247 - 31573 446 108 aa, chain - ## HITS:1 COG:XF0224 KEGG:ns NR:ns ## COG: XF0224 COG1862 # Protein_GI_number: 15836829 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Preprotein translocase subunit YajC # Organism: Xylella fastidiosa 9a5c # 1 94 5 107 120 72 38.0 2e-13 MNLLTVFFQAPAAGPDGSLMWIMLIAMFVIMYFFMIRPQNKKQKEIANFRKSLQVNQNVI TAGGIHGVIKEINDDYIVLEIASNVKIKIDKNSIFADASAANSQSATK >gi|226332051|gb|ACIB01000005.1| GENE 35 31620 - 32546 998 308 aa, chain - ## HITS:1 COG:TM1765 KEGG:ns NR:ns ## COG: TM1765 COG0781 # Protein_GI_number: 15644510 # Func_class: K Transcription # Function: Transcription termination factor # Organism: Thermotoga maritima # 187 298 22 133 142 62 34.0 8e-10 MINRVLIRLKIIQIVYAYYQNGSKNLDSAEKELFFSLSKAYDLYNYLLMLMIALTEYAQK RIDTAKAKLAPTKEELYPNMKFVENKFVAQLEVNKQLSEFIANQKRTWANDQDFIKELYE KIIASDIYKEYMASSDKSYEADRELWRKLYKTFVFNNDSLDQVLEDQSLYWNDDKEIVDT FVLKTIKRFEEKQGANQPLLPEFKDDEDQEFARRLFRRAILNADYYRHLISENTKNWDLD RVAFMDVIIMQCALAEILSFPNIPVSVSLNEYVEIAKLYSTVKSGSFINGTLDGIVNQLK KEGKLTKN >gi|226332051|gb|ACIB01000005.1| GENE 36 32715 - 33101 498 128 aa, chain + ## HITS:1 COG:no KEGG:BF1198 NR:ns ## KEGG: BF1198 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 128 1 128 128 187 100.0 1e-46 MEDFKKKIGTDMNDKEIVFSKSIKAGKRIYYLDVKKNRKDEMFLAITESKKVVMGEGDDS QVSFEKHKIFLYKEDFGKFMAGLEQAINFINQNQEYTEDSESEEKVEPESEPETTVLDSE IKIDIDFE >gi|226332051|gb|ACIB01000005.1| GENE 37 33246 - 33836 978 196 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53712489|ref|YP_098481.1| 50S ribosomal protein L25/general stress protein Ctc [Bacteroides fragilis YCH46] # 1 196 1 196 196 381 100 1e-105 MRSIEVKGTARTIAERSSEQARALKEIRNNGGVPCVLYGGEEVVHFTVTNEGLRNLVYTP HIYVVDLVIDGKKVNAILKDIQFHPVKDTILHVDFYQIDEAKPIVMEVPVQLEGLAEGVK AGGKLALQMRKLKVKALYNIIPEKLTINVSHLGLGKTVKVGELSYEGLELLNAKEAVVCA VKLTRAARGAAAAAGK >gi|226332051|gb|ACIB01000005.1| GENE 38 33943 - 34506 518 187 aa, chain + ## HITS:1 COG:BH0068 KEGG:ns NR:ns ## COG: BH0068 COG0193 # Protein_GI_number: 15612631 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Peptidyl-tRNA hydrolase # Organism: Bacillus halodurans # 4 185 3 185 185 140 40.0 2e-33 MKYLIVGLGNIGPEYHETRHNIGFMVLDALARANNLSFTDGRYGFTTTLSVKGRQMILLK PSTFMNLSGNAVRYWMQKENIPLENVLIIVDDLALPFGTLRLKSKGSDAGHNGLKHIATI LGTQNYARLRFGIGNDFPRGGQIDFVLGHFTDEDWKTMDERLETAGEIAKSFCLAGIDIT MNQFNKK >gi|226332051|gb|ACIB01000005.1| GENE 39 34533 - 34958 365 141 aa, chain + ## HITS:1 COG:Cgl2072 KEGG:ns NR:ns ## COG: Cgl2072 COG1188 # Protein_GI_number: 19553322 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Ribosome-associated heat shock protein implicated in the recycling of the 50S subunit (S4 paralog) # Organism: Corynebacterium glutamicum # 5 121 10 122 126 91 41.0 5e-19 MPEARIDKWMWAVRIFKTRTIAAEACKKGRISINGSFVKAARMIKPGDVIQVKKPPITYS FKVLQAIEKRVGAKLVSEMMENVTTPDQYELLEMSKISGFIDRARGTGRPTKKDRRSIEE FTTPEFMDDFDFDFDFEEDNE >gi|226332051|gb|ACIB01000005.1| GENE 40 35068 - 35235 121 55 aa, chain - ## HITS:1 COG:no KEGG:BF1194 NR:ns ## KEGG: BF1194 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 55 203 257 257 110 100.0 1e-23 YFVLPALYIGAQIDPFAYTYNKTTYNPQAGLGDLSADSHNYSVLAAPTFKIGFKF Prediction of potential genes in microbial genomes Time: Tue May 17 22:01:50 2011 Seq name: gi|226332050|gb|ACIB01000006.1| Bacteroides sp. 3_2_5 cont1.6, whole genome shotgun sequence Length of sequence - 32805 bp Number of predicted genes - 34, with homology - 34 Number of transcription units - 16, operones - 10 average op.length - 2.8 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 6 - 1520 1592 ## BF3181 putative lipoprotein - Prom 1544 - 1603 2.6 + Prom 1492 - 1551 6.2 2 2 Tu 1 . + CDS 1742 - 2107 259 ## BF1192 hypothetical protein + Term 2275 - 2312 1.4 - Term 2424 - 2470 9.1 3 3 Tu 1 . - CDS 2519 - 4678 2167 ## BF1158 putative alpha-glucosidase - Prom 4834 - 4893 7.6 + Prom 4629 - 4688 3.6 4 4 Op 1 . + CDS 4832 - 5215 358 ## COG0239 Integral membrane protein possibly involved in chromosome condensation 5 4 Op 2 . + CDS 5230 - 5895 498 ## COG1357 Uncharacterized low-complexity proteins + Prom 5963 - 6022 2.0 6 5 Tu 1 . + CDS 6043 - 7332 1532 ## COG0148 Enolase + Term 7396 - 7444 2.3 - TRNA 7503 - 7579 73.6 # Thr TGT 0 0 + Prom 7725 - 7784 5.2 7 6 Op 1 . + CDS 7923 - 8435 521 ## BF1187 hypothetical protein 8 6 Op 2 . + CDS 8462 - 9415 660 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 9 6 Op 3 . + CDS 9440 - 9967 425 ## BF1185 hypothetical protein 10 6 Op 4 . + CDS 9924 - 10676 612 ## BF1184 hypothetical protein 11 6 Op 5 . + CDS 10681 - 11265 380 ## COG1971 Predicted membrane protein 12 6 Op 6 . + CDS 11337 - 12353 907 ## COG1477 Membrane-associated lipoprotein involved in thiamine biosynthesis 13 7 Op 1 . - CDS 12690 - 13058 299 ## COG3169 Uncharacterized protein conserved in bacteria 14 7 Op 2 . - CDS 13072 - 13821 562 ## COG0037 Predicted ATPase of the PP-loop superfamily implicated in cell cycle control - Prom 13935 - 13994 6.3 + Prom 13699 - 13758 2.5 15 8 Op 1 . + CDS 13962 - 15389 1056 ## BF1145 hypothetical protein 16 8 Op 2 . + CDS 15396 - 15866 491 ## BF1177 hypothetical protein 17 8 Op 3 . + CDS 15892 - 16893 968 ## COG4864 Uncharacterized protein conserved in bacteria + Prom 16939 - 16998 2.1 18 9 Op 1 . + CDS 17020 - 17898 835 ## COG2820 Uridine phosphorylase 19 9 Op 2 . + CDS 17898 - 18425 468 ## COG0847 DNA polymerase III, epsilon subunit and related 3'-5' exonucleases 20 10 Op 1 . - CDS 18409 - 19059 591 ## COG2910 Putative NADH-flavin reductase - Prom 19084 - 19143 3.6 21 10 Op 2 . - CDS 19146 - 21053 1274 ## COG0642 Signal transduction histidine kinase - Prom 21094 - 21153 4.7 + Prom 21043 - 21102 4.8 22 11 Tu 1 . + CDS 21175 - 22572 951 ## COG0486 Predicted GTPase + Term 22617 - 22649 4.0 + Prom 22597 - 22656 6.3 23 12 Op 1 . + CDS 22820 - 23722 581 ## BDI_2123 transposase in mobilizable transposon, TnpA protein 24 12 Op 2 . + CDS 23807 - 24913 716 ## COG0582 Integrase 25 12 Op 3 . + CDS 24923 - 25288 263 ## BVU_3718 hypothetical protein + Term 25313 - 25364 10.6 + Prom 25314 - 25373 3.9 26 13 Tu 1 . + CDS 25554 - 26435 417 ## BDI_2125 mobilizable transposon, TnpC protein - Term 26373 - 26407 4.4 27 14 Op 1 . - CDS 26466 - 27323 215 ## COG3344 Retron-type reverse transcriptase 28 14 Op 2 . - CDS 27349 - 27564 173 ## gi|167761895|ref|ZP_02434022.1| hypothetical protein BACSTE_00238 - Prom 27795 - 27854 3.2 + Prom 27909 - 27968 2.4 29 15 Op 1 . + CDS 28030 - 28395 454 ## BDI_2127 excisionase in mobilizable transposon, Xis protein 30 15 Op 2 . + CDS 28407 - 29783 733 ## PG0871 hypothetical protein 31 15 Op 3 . + CDS 29861 - 30940 487 ## PG0870 hypothetical protein + Prom 30943 - 31002 1.7 32 16 Op 1 . + CDS 31078 - 31386 204 ## BVU_3674 mobilization protein MocB 33 16 Op 2 . + CDS 31376 - 32317 906 ## BVU_3675 mobilization protein MocA/BmgA 34 16 Op 3 . + CDS 32353 - 32751 415 ## BVU_3676 hypothetical protein + Term 32770 - 32801 2.5 Predicted protein(s) >gi|226332050|gb|ACIB01000006.1| GENE 1 6 - 1520 1592 504 aa, chain - ## HITS:1 COG:no KEGG:BF3181 NR:ns ## KEGG: BF3181 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 504 1 511 512 174 37.0 6e-42 MKKKLMMVAVLLGALSLGACVDNNESASVEAVRNAKAEQLKGLAALANAQAEATKITAEA EAALKNAQAEYQKEMTEEAKQEFAVEIERIKAEAERAIAEAKKAASEAELAILKNADERV QWLYGQYTTAADELATLNEKLLTKTAGLAQLEAGITTAEANAKVNTIAWNRTIAAETAKL EVLKDPANTNIDKDALNAKKEAAYQKYTLAYSTLMNNEGAALDADAKGIQEAIDALDRDA IEAVNNLYSNVVAFTGYEYLSWETTSGSTSRSLPSGAYVSEAQKLNAENYFATNLEDAAN ALGTSADTKDKNTAYGHLAAANAQLEDANKMGETTDAEKEAKKQAIKDAKTAIALAKDEI VRAQASYDEEKAASDEFTAALAAVDVKAYNDAVSAIVALVKANETVAKAFSDASETSTKL WNEYKVLNALYDNSQNLEELIAQCEYNIAYAKEQIKFYEANITNAEAQLAKGKEELDNLE KEIAAKKIIVDNAKAALDAELNAE >gi|226332050|gb|ACIB01000006.1| GENE 2 1742 - 2107 259 121 aa, chain + ## HITS:1 COG:no KEGG:BF1192 NR:ns ## KEGG: BF1192 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 121 1 121 121 192 99.0 2e-48 MQMKKSNIYIGEIIKNVMSERQVTKAELARRLDVKPQSVDYLLTRKSIDTDTLYNISIAL DYDFSLLYSIKREQRNSDNEGIRYKLGNAKIMVEIELQQDEIIKLNLKKKIAELLDGGAN K >gi|226332050|gb|ACIB01000006.1| GENE 3 2519 - 4678 2167 719 aa, chain - ## HITS:1 COG:no KEGG:BF1158 NR:ns ## KEGG: BF1158 # Name: susB # Def: putative alpha-glucosidase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 719 5 723 723 1478 100.0 0 MKNMKKLTFKLACFFLSLLVSSVAMAESITSPNGLLKLNVSVNEKGEPVYELSYKGKEVI KPSKLGLELKDDPGLMDGFTLSDAKTSSFDETWEPVWGEVKQIRNNYNELAITLDQKAQD RKMVIRFRLYNDGLGFRYEFPQQKNLNYFVIKEEHSQFAMAGDHTAFWIPGDYDTQEYDY TRSKLSEIRGLMKTAVTPNASQTPFSPTGVQTALQMKTDDGLYINLHEAALVDYSCMHLN LDDKNFVFESWLTPDAQGTKGYMQTPCHSPWRTVMVSDDARDILASKLTLNLNEPCKLED TSWIKPVKYVGVWWEMITGKSTWAYTDDVYSVKLGETDYTKTKPNGRHGANNENVKRYID FAAEHGFDQVLVEGWNEGWEDWFGKSKDYVFDFVTPYPDFDVKMLNEYARSKGVKLMMHH ETSASVRNYERHMDEAYQFMEDNGYNAVKSGYVGNIIPRGEHHYGQWLNNHYLYAVKKAA DHKIMVNAHEAVRPTGLCRTYPNLIGNESARGTEYEAFGGNKPFHTTLLPFTRLIGGPMD YTPGIFDTQLSFLSGEHSFVHTTLAKQLALYVTLYSPLQMAADLPESYERYMDAFQFIKD VAVDWDESKYIEAEPGEYITVARKAKNTNNWFVGGITGENARTSTFVLDFLEPGKQYVAT LYADGKDADYEKNPTSYQIKKGLVTYKTKISTDLARSGGFAISLIEATPADKKALKKWK >gi|226332050|gb|ACIB01000006.1| GENE 4 4832 - 5215 358 127 aa, chain + ## HITS:1 COG:AGc2712 KEGG:ns NR:ns ## COG: AGc2712 COG0239 # Protein_GI_number: 15888794 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Integral membrane protein possibly involved in chromosome condensation # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 1 126 1 124 125 79 43.0 2e-15 MKEIIYIFIGGGMGSVTRYLTQIAVNERLSPALFPFPWGTFAVNIIGSLLIGFFYSFSER FNLSFELRLFLTVGFCGGFTTFSTLTNDSLSLLKGGFYGIFTFYVFISILLGLLAVLAGG YLGEQFK >gi|226332050|gb|ACIB01000006.1| GENE 5 5230 - 5895 498 221 aa, chain + ## HITS:1 COG:CAC1657 KEGG:ns NR:ns ## COG: CAC1657 COG1357 # Protein_GI_number: 15894934 # Func_class: S Function unknown # Function: Uncharacterized low-complexity proteins # Organism: Clostridium acetobutylicum # 55 216 52 213 216 124 42.0 1e-28 MKPTIKKVQPVKVVAPFLNSQSESPVPLDALTDQEKVSDLYFLKGTVHQIAKPYLSINNC TFKQQIFSECQFKSAQLTDVRFENCDLSNVSFAGTTFYRVEFISCKLLGTGFPEATLNHV LMDHCYGQYINLSMVKMRTVRFSHCNFRNGSLNDSKLMPAAFDTCELLEADFSHTSLKGI DLRNSRIAGIQLNIADLKGAIVSSLQAIDLLPLLGVKIEDD >gi|226332050|gb|ACIB01000006.1| GENE 6 6043 - 7332 1532 429 aa, chain + ## HITS:1 COG:SA0731 KEGG:ns NR:ns ## COG: SA0731 COG0148 # Protein_GI_number: 15926453 # Func_class: G Carbohydrate transport and metabolism # Function: Enolase # Organism: Staphylococcus aureus N315 # 3 420 4 423 434 605 72.0 1e-173 MKIEKITGREILDSRGNPTVEVDVVLESGIMGRASVPSGASTGEHEALELRDGDKHRYGG KGVQKAVENVNKVIAPHLIGMSALDQIGIDHAMLALDGTKTKAKLGANAILGVSLAVAKA AANYLDIPLYRYIGGTNTYVLPVPMMNIINGGSHSDAPIAFQEFMIRPVGASSFKEGLRM GAEVFHALKKVLKDRGLSTAVGDEGGFAPNLEGTEDALNSILAAIKAAGYEPGKDVMIGM DCASSEFYHDGIYDYTKFEGEKGKKRTADEQIDYLEKLINEYPIDSIEDGMSENDWEGWK KLTQRIGDRCQLVGDDLFVTNVDFLAKGIEKGCANSILIKVNQIGSLTETLNAIEMAHRH GYTTVTSHRSGETEDATIADIAVATNSGQIKTGSLSRSDRMAKYNQLLRIEEELGDRAVY GYKRIVVKG >gi|226332050|gb|ACIB01000006.1| GENE 7 7923 - 8435 521 170 aa, chain + ## HITS:1 COG:no KEGG:BF1187 NR:ns ## KEGG: BF1187 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 170 13 182 182 281 99.0 8e-75 MHFGTYMGVYWILKFILFPLGLSIPFLLFLFFGLTLGVPFMGYYYARTYRDKVCGGSIRF LQAWVFIVFMYMFAALLTAVAHYIYFRFIDHGFIVNTYMGMFDELSNKEVPGIEGYISQL KEVMEMISTLTPIDITMQLMSQNVFYGSILAVPTALFVMRKPKSPEVQPL >gi|226332050|gb|ACIB01000006.1| GENE 8 8462 - 9415 660 317 aa, chain + ## HITS:1 COG:mlr7556 KEGG:ns NR:ns ## COG: mlr7556 COG0463 # Protein_GI_number: 13476277 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Mesorhizobium loti # 3 309 4 301 326 238 41.0 1e-62 MDISVVVPLFNEEESIPELFAWIERVMKANGFSYEVIFVNDGSTDRSWEIIEEFQKQSST VKGIKFRRNYGKSPALYCGFERAEGNVVITMDADLQDSPDEIPELYRMITEDGYDLVSGY KQKRYDPLSKTLPTKLFNATARKVSGIHNLHDFNCGLKAYRKAVVKNIEVYGEMHRYIPY LAKNAGFQKIGEKVVHHQARKFGKTKFGGWNRFFNGYLDLISLWFLSKFGIKPMHFFGLL GSLMFILGFISVVIVGASKLYSMNHGMPYRLVTDSPYFYLSLTAMIIGTQLFLAGFLGEL ISRNAPERNNYQIEKII >gi|226332050|gb|ACIB01000006.1| GENE 9 9440 - 9967 425 175 aa, chain + ## HITS:1 COG:no KEGG:BF1185 NR:ns ## KEGG: BF1185 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 175 1 175 175 324 100.0 9e-88 MKKLIKLVLFLMVAYPLTGAILSACSEESDCSMTGRPMVYAKMYTINPETKAVLNDTLDS LSVTAFGTDSIIINNQKKVHDIALPLRYTSDSTILVFHYTRLLRDTMVILQTNTPYFQSM DCGYSMKQNIISIHPIDYTETNKKKYHSIDSLYIKSNAANINGTENLKIFYRYNR >gi|226332050|gb|ACIB01000006.1| GENE 10 9924 - 10676 612 250 aa, chain + ## HITS:1 COG:no KEGG:BF1184 NR:ns ## KEGG: BF1184 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 250 1 250 250 464 99.0 1e-129 MEQKISKYSTATIVSLLCLIFSLPLQAQQQRPGARPAVKQKAKEEIKADTIPFYNGTYVG VDLFGLGSKLLGGDFLSSEVNVRVNLKKKFIPTVEIGFGQTDTWSDTGIHYKSAAPYFRV GADYNVVKEYLYVGLRYGFSSFKYDISSTPFSDPIYGGSMANPGLIDGIWGGSVPYHYNG LKSNMQWLELVAGVNVQIYKSFYMGWTLRFKFKTAGSISEHGNPWYVPGFGEYDSSNIGI TYTLIYKLPF >gi|226332050|gb|ACIB01000006.1| GENE 11 10681 - 11265 380 194 aa, chain + ## HITS:1 COG:Cj0167c KEGG:ns NR:ns ## COG: Cj0167c COG1971 # Protein_GI_number: 15791554 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Campylobacter jejuni # 8 193 8 187 187 123 45.0 2e-28 MTGLEIWLLAIGLAMDCLAVSIASGIILRRIQWRPMLIMAFFFGLFQAIMPLLGWLGAST FSHLIESVDHWIAFAILAFLGGRMIKESFKEEDCCQRFNPASLKVVITMAVATSIDALAV GVSFAFLGIKSCSSILYPAGIIGFVSFFMSLIGLIFGIRFGCGIARKLRAELWGGIILIL IGTKILIEHLFFNN >gi|226332050|gb|ACIB01000006.1| GENE 12 11337 - 12353 907 338 aa, chain + ## HITS:1 COG:VC2289 KEGG:ns NR:ns ## COG: VC2289 COG1477 # Protein_GI_number: 15642287 # Func_class: H Coenzyme transport and metabolism # Function: Membrane-associated lipoprotein involved in thiamine biosynthesis # Organism: Vibrio cholerae # 35 338 59 367 367 202 38.0 1e-51 MEKKTRKSFIWLAILLLGTIWILAQRNKQIPYNSINGLVFGTVYNITYQYDGNLKAEIDA ELKKFDGSLSPFNDTSVITRVNRNEEIVTDTFFQTCFNRSMEISAETRGAFDITVAPLAN AWGFGFKKGAFPDSIMIDSLLQITGYQKVKLENGKVIKEDPRVMLSCSAVAKGYSVDVVA RYLDSKGIKNYMVDIGGELVVKGVNPKEEAWRIGINKPVDDSLSLNQEIQTTLKLTNVGI ATSGNYRNFYYKDGKKYAHTIDPRTGYPVQHNILSATVVADDCMTADALATAFMVMGLDE AEAFTKSHPNIGAYFIYSDEKGEVKSYFTKNMKQYLDK >gi|226332050|gb|ACIB01000006.1| GENE 13 12690 - 13058 299 122 aa, chain - ## HITS:1 COG:VC1574 KEGG:ns NR:ns ## COG: VC1574 COG3169 # Protein_GI_number: 15641582 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Vibrio cholerae # 4 120 16 122 127 87 46.0 6e-18 MKGIYAISLLVVSNIFMTFAWYGHLKLQETKIISNWPLYGVVLFSWVIALAEYSCQVPAN RLGFSGNGGPFSLMQLKIIQEVITLIIFTVFSTLLFKGESLHWNHVAAFVCLIAAVYFVF MR >gi|226332050|gb|ACIB01000006.1| GENE 14 13072 - 13821 562 249 aa, chain - ## HITS:1 COG:NMA1465 KEGG:ns NR:ns ## COG: NMA1465 COG0037 # Protein_GI_number: 15794367 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Predicted ATPase of the PP-loop superfamily implicated in cell cycle control # Organism: Neisseria meningitidis Z2491 # 1 238 5 245 319 133 34.0 3e-31 MAQFTEEEKTIRRIEKRFNKGMVQYGLIEEGDKVLVGLSGGKDSLALVELLGKRSHIFKP RFSVVAVHVVMKNIPYQSDWDYLREHAEKNGVPLVVYETSFDPSTDTRKSPCFLCSWNRR KALFTVAKEQGCNKIALGHHMDDILETLLMNITYQGAFSTMPPRLVMNKFDMTIIRPMCL VHEADLLELAQIRGYRKQVKNCPYESQSSRSDMKGILRQLEKMNPEARYSLWGSMTNVQE ELLPKEVEF >gi|226332050|gb|ACIB01000006.1| GENE 15 13962 - 15389 1056 475 aa, chain + ## HITS:1 COG:no KEGG:BF1145 NR:ns ## KEGG: BF1145 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 475 1 475 475 893 100.0 0 MKKKNILFFLLCFLLTSLSAQTLEQARGMYGRGQYAEAKPVFQKYVKSQPANGNYNLWYG VCCLKTGNAAEALKYLETAVKKRIPSGQLYLAQTYNDLYRFQDAVDCYEEYIADLSKRKK PTEEAEQLLEKAKGNLRMLKGVEDVCVIDSFVIDKANFLKAYKISEESGKLFTYNDYFKT KGYHPGTVYETEIGNRIYYSEQGEESLNILSKTKMLDEWSQGKPLPGSINASGNANYPYV LSDGVTIYYASDGDGSMGGYDIFVTRYNTNTDTYLVPENVGMPFNSPYNDYMYVIDEYNN LGWFASDRYQPEDKVCIYVFVPNDSKRTYNYEAMEPEKMIELAQLHSLESTWKDSKIVDD ARQRLEAVINHKPAVEQNFDFEFIIDDHSTYHHLTDFKSPKAKQLYLKYEQMEKDYRQQT GKLKSQREGFARSNKDEQSKMAPAIRDLEKRVLQMSEELDKQAIEVRNAEKQNLK >gi|226332050|gb|ACIB01000006.1| GENE 16 15396 - 15866 491 156 aa, chain + ## HITS:1 COG:no KEGG:BF1177 NR:ns ## KEGG: BF1177 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 156 1 156 156 239 100.0 3e-62 MDVLIIIALIAAAVILFLVELFVIPGISLAGISALVCIIYANYYAFANLGTGAGFITLII SGIACIGSLVWFMRSKTLDKLALKKDITSKIDRSAAEKVKVGDTGITITRLAQIGNAEIN GNIIEVKSMDGLLNEKTPIVVNRITDGIIFVEKLKS >gi|226332050|gb|ACIB01000006.1| GENE 17 15892 - 16893 968 333 aa, chain + ## HITS:1 COG:BS_yqfA KEGG:ns NR:ns ## COG: BS_yqfA COG4864 # Protein_GI_number: 16079592 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Bacillus subtilis # 3 321 1 319 331 385 66.0 1e-107 MNVEPMYLTIFLIAGGIIFLVLFFHYVPFFLWLSAKVSGVNISLVQLFLMRIRNVPPYII VPGMIEAHKAGLSNITRDELEAHYLAGGHVERVVHALVSASKANIELPFQMATAIDLAGR DVFEAVQMSVNPKVIDTPPVTAVAKDGIQLIAKARVTVRANIRQLVGGAGEDTILARVGE GIVSSIGSSENHKSVLENPDSISKLVLRKGLDAGTAFEILSIDIADIDIGKNIGAALQID QANADKNIAQAKAEERRAMAVATEQEMKAKAEEARANVIQAEAEVPKAMAEAFRSGNLGI MDYYKMKNIQADTSMRENIAKPIGGATSKPLSD >gi|226332050|gb|ACIB01000006.1| GENE 18 17020 - 17898 835 292 aa, chain + ## HITS:1 COG:VNG0893G KEGG:ns NR:ns ## COG: VNG0893G COG2820 # Protein_GI_number: 15790029 # Func_class: F Nucleotide transport and metabolism # Function: Uridine phosphorylase # Organism: Halobacterium sp. NRC-1 # 19 268 14 227 273 99 30.0 8e-21 MKKYFPSSELIINEDGSVFHLHVKPEWLADKVILVGDPGRVALVASHFENKECEVESREF KTVTGTYKGKRITVVSTGIGCDNIDIVVNELDALANIDFQTREEKEHLRSLELVRIGTCG GLQPNTPVGTFVCSEKSIGFDGLLNFYAGRNAVCDLPFERAFLNHMGWSGNMCAPAPYVI DANAELIDRIAQEDMVRGVTIAAGGFFGPQGRELRVPLADPKQNDKIEKFEYKGYKITNF EMESSALAGLSKLMGHKAMTVCMVIANRLIKEANTGYKNTIDTLIKTVLDRI >gi|226332050|gb|ACIB01000006.1| GENE 19 17898 - 18425 468 175 aa, chain + ## HITS:1 COG:CAC0738 KEGG:ns NR:ns ## COG: CAC0738 COG0847 # Protein_GI_number: 15894025 # Func_class: L Replication, recombination and repair # Function: DNA polymerase III, epsilon subunit and related 3'-5' exonucleases # Organism: Clostridium acetobutylicum # 3 163 1 159 306 113 35.0 2e-25 MNLSFAAIDFETATGYMESACAVGIVTVTDGEITDEYYSLIQPPENEYWRANMLVHGITP GMTESLPGFHAIYPEVKKRLQGNVVVAHNEQFDRNVLKNSMRMYGLDYDELSLPERWECT CRIYRSLGYKPVNLSACCEREGIELKHHEALSDARGCAKLYLNFLEKYRPLSTLW >gi|226332050|gb|ACIB01000006.1| GENE 20 18409 - 19059 591 216 aa, chain - ## HITS:1 COG:PA0741 KEGG:ns NR:ns ## COG: PA0741 COG2910 # Protein_GI_number: 15595938 # Func_class: R General function prediction only # Function: Putative NADH-flavin reductase # Organism: Pseudomonas aeruginosa # 3 216 2 213 213 201 50.0 7e-52 MKKVVLIGASGFVGSAILNEALNRGFHVTAVVRHPEKIRIENENLEVKRADVSSLDEVCK VCKGADAVISAFNPGWNNPDIYKETIEVYLTIIDGVKKAGVNRFLMVGGAGSLFIAPGIR LVDSGEVPEKILPGVRALSDFYLDFLKKEKEVDWVFFSPAADMAPGVRTGRYRLGKDEMI VDMVGNSHISVEDYAAAMIDELEKPEHHQERFTIGY >gi|226332050|gb|ACIB01000006.1| GENE 21 19146 - 21053 1274 635 aa, chain - ## HITS:1 COG:mll3725_2 KEGG:ns NR:ns ## COG: mll3725_2 COG0642 # Protein_GI_number: 13473203 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Mesorhizobium loti # 205 490 4 304 328 166 37.0 1e-40 MEHTLDSFPDTGDLENLKDNYQKITSVLAGHQIAFWEYDIPTGECNFTDEYFHILGLKEA GIIFRDINDFYRFAHPEDVISYQTTFARMLESETKISQIVVRCVGRQGETIWLEDNFIAY KKNKENGSDKIIAYTANITSRCEKEVQIRQLEERNRKIIEALPEFIFIFDDNFFITDVLM APDTELLHPVEVLTGADGRSIYSSEVSDLFISSIHECLKSGKLKEIEYPVDVEAGRHFFQ ARIAPFEGNKVLALIHDIGDRMRRSQELLEAKQRAEEADRMKSVFLANMSHEIRTPLNAI VGFSEIIALTEDEKEKEEYLGIIQQNSNLLLQLINDILDLSRIESGKSEMHCQLTEMSGL VDEVDKVHRLKMKKGVKLNVIRPSEEIWISTDRNRVTQVLFNFLSNAIKNTIEGSITFGL VKEEEWVKLYVTDTGCGISKEKLPLIFTRFEKLNDFVQGTGLGLPICKSIVERLGGRIEV ESELGQGSTFILYLPNRQVQEVVVGERENAAGNMGVENRQKKILIAEDVESSYLQINAFL KKEYTILWVPNGEEAVKSFIREKPDLILMDIRMPVMNGIQATAKIRAISQEIPIIAITAY AFCPEGERALEAGCNEVIAKPYPLEKLKETIETYL >gi|226332050|gb|ACIB01000006.1| GENE 22 21175 - 22572 951 465 aa, chain + ## HITS:1 COG:CAC3734 KEGG:ns NR:ns ## COG: CAC3734 COG0486 # Protein_GI_number: 15896965 # Func_class: R General function prediction only # Function: Predicted GTPase # Organism: Clostridium acetobutylicum # 4 465 5 459 459 303 39.0 3e-82 MNQDTICAIATAQGGAIGSIRVSGPEAITITSRIFTPAKSGKLLSEQKPYTLTFGRIYNG EEMIDEVLVSLFRAPHSYTGEDSTEITCHGSSYILQQVMQLLIKNGCRMAQPGEYTQRAF LNGKMDLSQAEAVADLIASSSAATHRLALSQMRGGFSKELTTLREKLLNFTSMIELELDF SEEDVEFADRSALRRLADEIEEVIARLANSFSVGNVIKNGVPVAIIGETNAGKSTLLNVL LNEDKAIVSDIHGTTRDVIEDTVNIGGITFRFIDTAGIRETSDTIESLGIERTFQKLDQA EIVLWMIDSADAISQLTLLSDKILPRCEHKQLILVFNKVELINETQKNELASQFSEHIGS EIESIFISAKQRLHTDELQQRLVAAAHLPTVTQNDVIVTNIRHYEALTRALDAIHRVQEG LDVNISGDFLSQDIRECIFHLSDIAGEVTNDMVLQNIFAHFCIGK >gi|226332050|gb|ACIB01000006.1| GENE 23 22820 - 23722 581 300 aa, chain + ## HITS:1 COG:no KEGG:BDI_2123 NR:ns ## KEGG: BDI_2123 # Name: not_defined # Def: transposase in mobilizable transposon, TnpA protein # Organism: P.distasonis # Pathway: not_defined # 3 300 4 303 303 330 59.0 5e-89 MSSKIDIQKRCKWCNAVFTAHKSTTEYCSHRCVNLAYKDRVRKQRIESLQHELGKIIKTP PNLNKEYLTPSEVAVLLNIGRTSIYRYIRNGVIKVIRFERKTLVRRADIEDMTDFIVPET ENKQLKEKAPITDFYTTAEVKEKYHVNESWIFVVAKKNNIPRTFNRGKTYWSKKHMDAYF AKKAPNPDITEWYSTQEMQEKFSMTLSAIYCFASKNAIPKKKEGIIVYYSKKHVDIAKGI AAPEEPQYYTVAEAMEKFNLTRDQLYHYAKYHNIHKVKKGKYTLISKLELDKLLAAPKIE >gi|226332050|gb|ACIB01000006.1| GENE 24 23807 - 24913 716 368 aa, chain + ## HITS:1 COG:TM0967 KEGG:ns NR:ns ## COG: TM0967 COG0582 # Protein_GI_number: 15643727 # Func_class: L Replication, recombination and repair # Function: Integrase # Organism: Thermotoga maritima # 188 350 81 239 253 69 32.0 1e-11 MSHTCTKVTVRQRAIRNNRISLYLDYYPAVRNPETMQMSRREYLGIYIYAHPKNEMEREF NNDMLNKAEAIRCIRVQSLINEEFGFLDKTKQKADFLAYFKKMCRNKDQKWQFVYQHFYN FVKGQCTFGDVNVDLCKKFREYLLNAKQLKHSNRPMSLNSASGYYSTFRGLLKIAYRDKW FRENINDYLDKIEPQDVKKEYLTLNEVKQLAATPCDIPVLKAASLFACLTGLRISDILNL QWEDFTIAPDQGYCLRIRTQKTQTEATLPISYEAYELCGTPGTGKVFKDLKRSMINYPLK SWLKKAGITKPITFHGFRHSYAVIQISLGTDIYTVSKMLTHKNVSTTQIYADLVNSKKRE TANKISLK >gi|226332050|gb|ACIB01000006.1| GENE 25 24923 - 25288 263 121 aa, chain + ## HITS:1 COG:no KEGG:BVU_3718 NR:ns ## KEGG: BVU_3718 # Name: not_defined # Def: hypothetical protein # Organism: B.vulgatus # Pathway: not_defined # 1 114 1 117 119 93 39.0 3e-18 MKRGIITNKGLGIHISDGEVWMTTWELADLFYTTAGAIHAAIKRILKTNILKSHEVCKYI KLENGNNADVYNLDMVVALSYQIDTGHSIVFREWLINKVAHNQDYNILLYLNKGTNHTLY C >gi|226332050|gb|ACIB01000006.1| GENE 26 25554 - 26435 417 293 aa, chain + ## HITS:1 COG:no KEGG:BDI_2125 NR:ns ## KEGG: BDI_2125 # Name: not_defined # Def: mobilizable transposon, TnpC protein # Organism: P.distasonis # Pathway: not_defined # 1 292 1 254 255 212 45.0 1e-53 MEQRKIEHITLHLIVFGTIAIIGVLARQTVLHYGWDEFSSYLILVVCSIVIGAIYLNLQI AFRQLLSPTIERCFMRFESYRNKTVVAEIPIEHHEVIESAADIESIVSESEIENETPSDT TEEVSTLSLSESNEDSTELPKNEATSSIINNTPIEESSQPTEYEIYHATAMAEKERASQK KLDKVLTYIKQTLVLYLNETDLNRLCGYVTEYYLSDSLPKVEPIKVDSQLKTIDIMHFGW NIGKAFGKPRLQTATFIKRVFAHTLSDSEISTIERKMSHTESVCKIKLDRRIA >gi|226332050|gb|ACIB01000006.1| GENE 27 26466 - 27323 215 285 aa, chain - ## HITS:1 COG:MA2102 KEGG:ns NR:ns ## COG: MA2102 COG3344 # Protein_GI_number: 20090946 # Func_class: L Replication, recombination and repair # Function: Retron-type reverse transcriptase # Organism: Methanosarcina acetivorans str.C2A # 45 262 87 301 563 99 28.0 9e-21 MITSEKYLLYILGVNKEQLDYLLTHIEDYYYSFERVKFNKFTDKPKKNSSGEIATRQINS SKGKLKEVQTRLYDFMSKQVEIPQYVYGGVLRKNNVRNARLHQGNKYIFTTDLKSFFPSI SHKQVFQMFLREGCTPAIARILTKLTTHKYQVPQGIPTSTLIANLVFKPIGMEIDQLAKE HHIKFSMFVDDITLSSKVDFKNLVPQFLAIIKKSGFRISHKKTHYQTKNPIITGVICQNN RLLAPLGYKKKIAILSKQLSTHESIKNKLQGIKAYLSIIEKSSSL >gi|226332050|gb|ACIB01000006.1| GENE 28 27349 - 27564 173 71 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|167761895|ref|ZP_02434022.1| ## NR: gi|167761895|ref|ZP_02434022.1| hypothetical protein BACSTE_00238 [Bacteroides stercoris ATCC 43183] # 1 71 3 73 73 93 100.0 5e-18 MTKKKKKPSKRGKKANKNSLHCSPMIVTQEKKIKNLAKWIKAIAEFIKALIQLSNAMKPL IEWCKSFFELL >gi|226332050|gb|ACIB01000006.1| GENE 29 28030 - 28395 454 121 aa, chain + ## HITS:1 COG:no KEGG:BDI_2127 NR:ns ## KEGG: BDI_2127 # Name: not_defined # Def: excisionase in mobilizable transposon, Xis protein # Organism: P.distasonis # Pathway: not_defined # 1 121 1 122 122 143 61.0 2e-33 MQKETVTFDKLPEAVGYLTEQIIELKRMVSELQPPASDKHVLVEIEDACRIIRKAKPTIY TLVRKGLLPAYKKGKKLYFYEDELLAWIENGRRKTSEQTYEEMLANMQGGVRHKPKSSVK F >gi|226332050|gb|ACIB01000006.1| GENE 30 28407 - 29783 733 458 aa, chain + ## HITS:1 COG:no KEGG:PG0871 NR:ns ## KEGG: PG0871 # Name: not_defined # Def: hypothetical protein # Organism: P.gingivalis # Pathway: not_defined # 1 457 1 457 458 620 64.0 1e-176 MDTSSINPASIIDETVDLSMRLAGTDFPVSIFPTKIQRIISEVHKCHNYPTDYIAAAILT AIAVGIGNTHLAQIKQGWIESPILYMALIGRPGANKSHPLSFAMKPFLDYDYQQNQVFEK ALTKYDELMSMSRKERTESGEEQFPQEPVRKRFLISDVTPEGLSLIHAQNKRGLCLWADE LSAWFKNFNRYNNGSEEQFWLSVFSAKTTISDRKNAKSSIFIKRPYISVIGTIQKKILSE LAKGERSSNGFIDRILFVMPNLQQKARWNDKELPENIEQEWNAIIDKLIQQEYALNEFGE IEPQILLFTEDAKRRLYEWQHHFSELCDQETNDTIVSIYCKLEIYIIRFCLIIQLARWTC GECGKTYIDLLTVERAIKLTEYFKESALSVQNILNENMLNSQQQAIVNLLPPSFTTAQAI QIAEQNGMKERTFQRFLNDNIGTLFRKEKHGEYSKINP >gi|226332050|gb|ACIB01000006.1| GENE 31 29861 - 30940 487 359 aa, chain + ## HITS:1 COG:no KEGG:PG0870 NR:ns ## KEGG: PG0870 # Name: not_defined # Def: hypothetical protein # Organism: P.gingivalis # Pathway: not_defined # 1 358 1 351 352 447 56.0 1e-124 MSTHRFILEPYKGVSTRHTCPNCHRQRCFSKYIDTEKQIKFPEYVGRCDHEQKCGYHFTP RDYFEQNPSEKEKFAENSFRSYAPIKEAKPIATSYIDLDIVNQSLRGYPANKLFQFLSAQ FGETETLKLMKRYKVGTSKYWDGATVFWQTDNQNKVRTGKIMLYNSETGKRIKEPYNHVT WVHSVLHKGDYNLKQCFFGEHLLPEDKSRPVALVESEKTTLIASYYLPQFLWIASGGKNG CFNANSLSVLAGRSVVLFPDLGATDYWQSKIGLMKSYGIKVQMFDYLEANATENERKEGY DIADYLLKVKPDEAILQQMIMMNPALKILIKTFDLKLISIQQGTPQPKVSPPKKRGFRL >gi|226332050|gb|ACIB01000006.1| GENE 32 31078 - 31386 204 102 aa, chain + ## HITS:1 COG:no KEGG:BVU_3674 NR:ns ## KEGG: BVU_3674 # Name: not_defined # Def: mobilization protein MocB # Organism: B.vulgatus # Pathway: not_defined # 1 102 1 102 102 162 89.0 3e-39 MEKKYTITIRLSYQQHAWLGALCRRSKQTQSEVIRSLIENGSVRERITQEHIHIIRQLIG ESTNLNQLARQANTYGFFAVADRCEEMAQHINQLIKQLKNDR >gi|226332050|gb|ACIB01000006.1| GENE 33 31376 - 32317 906 313 aa, chain + ## HITS:1 COG:no KEGG:BVU_3675 NR:ns ## KEGG: BVU_3675 # Name: not_defined # Def: mobilization protein MocA/BmgA # Organism: B.vulgatus # Pathway: not_defined # 1 313 1 313 313 531 92.0 1e-149 MIGKQTKGTSFGGCVRYVLKEEKSKLLEAAGVEGTPEQIAEQFELQALLNDKVKNIVGHT SLNFSPEDSARLKSDDVLMLNIAHDYMKLMGIENTQYIIARHIDREHPHCHIVFNRVDND GKTISDKNDFRKNEKVCKMLTAKYRLHFANGKDHIKEERLRPYDKAKYEIYKALKKELPT AHSWDDLKDALADREIDMKFKVSRTTREIQGVKFEHNGFSFSGSKVGREFSYLNIDNRLE ENACASLLESAKQESKQQKEEVQQSVSHSDDSFGISLGLLNGSSSYDATAAEEAEFNRLM KKKKAKRKRGFHL >gi|226332050|gb|ACIB01000006.1| GENE 34 32353 - 32751 415 132 aa, chain + ## HITS:1 COG:no KEGG:BVU_3676 NR:ns ## KEGG: BVU_3676 # Name: not_defined # Def: hypothetical protein # Organism: B.vulgatus # Pathway: not_defined # 1 132 1 132 132 249 97.0 3e-65 MVQKDLILDFNLYLCEKFGYRESCSVMSHANGFCVDIRERDLDCYIRFWEYSCGRGNFPD WSIIIVRSNFKKNQEESLKDLARFFKEYMPRYGYKYLCTEDDDYKYYQTLGLKCIMDGFY PNYALALKDLNV Prediction of potential genes in microbial genomes Time: Tue May 17 22:03:02 2011 Seq name: gi|226332049|gb|ACIB01000007.1| Bacteroides sp. 3_2_5 cont1.7, whole genome shotgun sequence Length of sequence - 2526 bp Number of predicted genes - 3, with homology - 3 Number of transcription units - 1, operones - 1 average op.length - 3.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . + CDS 43 - 1035 680 ## COG0582 Integrase 2 1 Op 2 . + CDS 1105 - 1986 616 ## Coch_0959 DNA-damage-inducible protein D 3 1 Op 3 . + CDS 1983 - 2526 143 ## FMG_1373 type I restriction-modification system specificity subunit Predicted protein(s) >gi|226332049|gb|ACIB01000007.1| GENE 1 43 - 1035 680 330 aa, chain + ## HITS:1 COG:SP0890 KEGG:ns NR:ns ## COG: SP0890 COG0582 # Protein_GI_number: 15900773 # Func_class: L Replication, recombination and repair # Function: Integrase # Organism: Streptococcus pneumoniae TIGR4 # 3 329 2 320 321 354 57.0 2e-97 MKENIIQAIVAEMQRDLDCRQMARLKAVLTSELHNVEIIEKSDCATLQTQENEHLLNSFI SAKKIEGCSEKTLTYYRNTIERLLVSLSLAICHITTTDIRTYLSNYQEEHQSSKVTIDNM RRIFSSFFAWLEDEDYIAKSPVRRIHKVKTDSLVKEVLSDEQLEQLRDSCTTKRDLAIID FLSSTGIRVGELVKLNREDIDFHERQCVVFGKGNKERVVYFNARTKLHLQQYLNGRTDDN PALFVSLHSPHTRLTISGVEVRIRKLGHTLSMPKVHPHKFRRTLATMAIDKGMPIEQVQR LLGHVRIDTTLHYAIVNQNNVKLAHKKYLG >gi|226332049|gb|ACIB01000007.1| GENE 2 1105 - 1986 616 293 aa, chain + ## HITS:1 COG:no KEGG:Coch_0959 NR:ns ## KEGG: Coch_0959 # Name: dinD # Def: DNA-damage-inducible protein D # Organism: C.ochracea # Pathway: not_defined # 1 286 1 286 287 419 75.0 1e-116 MDLQRINKHKQSFDDICHYIEDDNGADKVEVWFARELQIILGYARWENFQVALTRAVESC KTQNINIDDHFREVTKMVTLGSGAKREIQDFMLTRYACYLVAQNGDPKKEEIAFAQGYFA VQTRRAELIAEHIEQLSRLETRDRLRSSEKQLSRNIYERGVDDKGFGRIRSKGDGVLFGG HTTEDMKNRLGIKSTRPLADFLPTLTIAAKNLATEMTNYNVEQKDLYGEHSITTEHMDNN RSIRQMLGQRGIRPEELPAAEDIKKVERRVASNEKKIEKSSSKLPKLKPEDNK >gi|226332049|gb|ACIB01000007.1| GENE 3 1983 - 2526 143 181 aa, chain + ## HITS:1 COG:no KEGG:FMG_1373 NR:ns ## KEGG: FMG_1373 # Name: not_defined # Def: type I restriction-modification system specificity subunit # Organism: F.magna # Pathway: not_defined # 5 145 50 190 254 134 45.0 1e-30 MKLIDIIEVFIAGDWGEETYSKETPCAVTCVRGADIIPISEYDFSAIPVRYINQQAYARK CLQVGDIIIEKSGGSPTQSTGRVSLVSQELLDHAGAVICSNFCTAFRVKKGWIPLYVYYY LQFIYNLGAFFNFEGKTSGIKNLQLDAAFAAIPIEDISESIQNNIVAILQGLERKIAINR Q Prediction of potential genes in microbial genomes Time: Tue May 17 22:04:14 2011 Seq name: gi|226332048|gb|ACIB01000008.1| Bacteroides sp. 3_2_5 cont1.8, whole genome shotgun sequence Length of sequence - 237626 bp Number of predicted genes - 187, with homology - 182 Number of transcription units - 89, operones - 45 average op.length - 3.2 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . - CDS 1 - 454 262 ## COG0732 Restriction endonuclease S subunits 2 1 Op 2 . - CDS 451 - 714 229 ## gi|253563524|ref|ZP_04840981.1| predicted protein 3 1 Op 3 2/0.000 - CDS 784 - 1785 884 ## COG3943 Virulence protein 4 1 Op 4 4/0.000 - CDS 1778 - 3439 1136 ## COG0286 Type I restriction-modification system methyltransferase subunit 5 1 Op 5 . - CDS 3445 - 6675 2340 ## COG0610 Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 6 1 Op 6 . - CDS 6677 - 6880 161 ## BT_4515 hypothetical protein - Prom 7036 - 7095 4.8 + Prom 6860 - 6919 4.9 7 2 Tu 1 . + CDS 6978 - 9431 1033 ## RPB_3532 hypothetical protein + Term 9498 - 9540 8.5 - Term 9356 - 9398 0.1 8 3 Tu 1 . - CDS 9412 - 9567 76 ## - Prom 9604 - 9663 2.3 9 4 Tu 1 . - CDS 9683 - 9955 102 ## gi|301162168|emb|CBW21713.1| putative transmembrane protein - Prom 10168 - 10227 5.1 + Prom 10081 - 10140 4.3 10 5 Tu 1 . + CDS 10166 - 11050 605 ## COG1028 Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) + Term 11182 - 11230 17.4 - Term 11160 - 11224 12.9 11 6 Tu 1 . - CDS 11260 - 15126 3552 ## COG1501 Alpha-glucosidases, family 31 of glycosyl hydrolases - Prom 15211 - 15270 5.8 + Prom 15444 - 15503 4.7 12 7 Op 1 4/0.000 + CDS 15583 - 17577 1603 ## COG2189 Adenine specific DNA methylase Mod 13 7 Op 2 . + CDS 17599 - 20631 2532 ## COG3587 Restriction endonuclease 14 7 Op 3 . + CDS 20647 - 22266 264 ## PROTEIN SUPPORTED gi|161507907|ref|YP_001577871.1| ribosomal protein large subunit 15 7 Op 4 . + CDS 22277 - 23176 466 ## COG0454 Histone acetyltransferase HPA2 and related acetyltransferases 16 7 Op 5 . + CDS 23235 - 26093 1611 ## BF1139 hypothetical protein + Term 26100 - 26132 0.3 17 8 Op 1 . - CDS 26528 - 27742 1314 ## COG1883 Na+-transporting methylmalonyl-CoA/oxaloacetate decarboxylase, beta subunit 18 8 Op 2 . - CDS 27754 - 28191 344 ## BF1049 hypothetical protein - Prom 28222 - 28281 4.3 19 9 Tu 1 . - CDS 28335 - 28718 127 ## BF1135 hypothetical protein - Prom 28774 - 28833 5.9 - Term 28813 - 28854 8.8 20 10 Tu 1 . - CDS 28883 - 29017 98 ## - Prom 29130 - 29189 6.7 + Prom 29140 - 29199 6.0 21 11 Op 1 . + CDS 29237 - 30832 1159 ## COG1288 Predicted membrane protein 22 11 Op 2 . + CDS 30745 - 30912 73 ## + Term 31017 - 31064 0.1 23 11 Op 3 . + CDS 31072 - 33750 2557 ## BF1133 hypothetical protein + Term 33786 - 33835 14.4 - Term 33827 - 33871 -1.0 24 12 Tu 1 . - CDS 33873 - 34301 323 ## BF1132 hypothetical protein - Prom 34349 - 34408 4.7 25 13 Tu 1 . - CDS 34413 - 35477 1051 ## COG0389 Nucleotidyltransferase/DNA polymerase involved in DNA repair - Prom 35555 - 35614 3.3 - Term 35587 - 35633 13.1 26 14 Op 1 . - CDS 35641 - 36624 555 ## COG2207 AraC-type DNA-binding domain-containing proteins 27 14 Op 2 . - CDS 36696 - 39380 1042 ## BF1129 two-component system sensor histidine kinase + Prom 39812 - 39871 2.5 28 15 Tu 1 . + CDS 39895 - 40242 97 ## BF1040 hypothetical protein + Prom 40496 - 40555 5.2 29 16 Tu 1 . + CDS 40613 - 41767 919 ## COG2706 3-carboxymuconate cyclase + Prom 41794 - 41853 7.2 30 17 Tu 1 . + CDS 41886 - 43070 797 ## BF1125 hypothetical protein + Term 43096 - 43141 3.3 31 18 Op 1 . - CDS 43009 - 43740 404 ## COG0739 Membrane proteins related to metalloendopeptidases 32 18 Op 2 . - CDS 43752 - 44417 579 ## COG3382 Uncharacterized conserved protein 33 18 Op 3 . - CDS 44439 - 45224 198 ## COG1496 Uncharacterized conserved protein 34 18 Op 4 . - CDS 45248 - 46408 727 ## PROTEIN SUPPORTED gi|149915191|ref|ZP_01903719.1| 50S ribosomal protein L27 35 18 Op 5 . - CDS 46498 - 47067 677 ## COG0563 Adenylate kinase and related kinases 36 18 Op 6 . - CDS 47123 - 47659 743 ## COG0634 Hypoxanthine-guanine phosphoribosyltransferase - Prom 47796 - 47855 6.1 + Prom 47681 - 47740 4.2 37 19 Op 1 . + CDS 47910 - 49970 1583 ## COG4771 Outer membrane receptor for ferrienterochelin and colicins 38 19 Op 2 . + CDS 49987 - 50631 599 ## BF1116 hypothetical protein 39 19 Op 3 . + CDS 50645 - 52159 557 ## BF1115 hypothetical protein - Term 52175 - 52216 1.1 40 20 Op 1 1/0.231 - CDS 52260 - 53216 457 ## COG0472 UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase 41 20 Op 2 12/0.000 - CDS 53220 - 54239 637 ## COG0451 Nucleoside-diphosphate-sugar epimerases 42 20 Op 3 26/0.000 - CDS 54236 - 55000 259 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 43 20 Op 4 6/0.000 - CDS 55017 - 56240 333 ## COG0438 Glycosyltransferase 44 20 Op 5 . - CDS 56295 - 57074 333 ## COG0726 Predicted xylanase/chitin deacetylase 45 20 Op 6 . - CDS 57067 - 58200 551 ## BF1021 putative glycosyltransferase 46 20 Op 7 . - CDS 58187 - 59464 729 ## COG0677 UDP-N-acetyl-D-mannosaminuronate dehydrogenase 47 20 Op 8 . - CDS 59473 - 60279 291 ## COG1216 Predicted glycosyltransferases 48 20 Op 9 . - CDS 60345 - 60581 137 ## BF1018 putative polysaccharide polymerase - Prom 60612 - 60671 9.2 - Term 60649 - 60693 1.2 49 21 Tu 1 . - CDS 60727 - 60918 89 ## - Prom 61024 - 61083 9.8 + Prom 61273 - 61332 5.2 50 22 Tu 1 . + CDS 61459 - 61689 117 ## - Term 61908 - 61952 2.4 51 23 Op 1 . - CDS 62126 - 63286 634 ## COG0438 Glycosyltransferase 52 23 Op 2 . - CDS 63262 - 63966 371 ## BF1015 putative fucosyl transferase - Prom 64024 - 64083 5.8 53 24 Op 1 . - CDS 64129 - 65664 434 ## BF1014 putative O-antigen flippase 54 24 Op 2 . - CDS 65657 - 66178 78 ## COG0110 Acetyltransferase (isoleucine patch superfamily) 55 24 Op 3 13/0.000 - CDS 66197 - 66745 384 ## COG1898 dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes 56 24 Op 4 . - CDS 66761 - 67648 534 ## COG1209 dTDP-glucose pyrophosphorylase 57 24 Op 5 . - CDS 67651 - 68043 300 ## BF1093 hypothetical protein - Prom 68099 - 68158 4.8 58 25 Tu 1 . - CDS 68226 - 68744 332 ## BF1009 putative transcriptional regulator - Prom 68916 - 68975 10.3 59 26 Tu 1 . - CDS 69074 - 69310 103 ## BF1091 hypothetical protein - Prom 69339 - 69398 6.3 60 27 Tu 1 . - CDS 69458 - 70093 433 ## COG0500 SAM-dependent methyltransferases - Prom 70280 - 70339 3.7 + Prom 70163 - 70222 4.7 61 28 Op 1 . + CDS 70399 - 71739 1026 ## BF1089 hypothetical protein 62 28 Op 2 . + CDS 71783 - 73294 488 ## PROTEIN SUPPORTED gi|153836659|ref|ZP_01989326.1| ribosomal protein S15 63 28 Op 3 . + CDS 73367 - 74422 1144 ## BF1004 hypothetical protein + Term 74454 - 74514 16.2 - Term 74442 - 74500 18.7 64 29 Op 1 . - CDS 74556 - 74909 496 ## BF1086 hypothetical protein 65 29 Op 2 . - CDS 74916 - 76304 1533 ## BF1085 putative oxalate:formate antiporter - Prom 76384 - 76443 5.6 - Term 76393 - 76452 18.5 66 30 Op 1 . - CDS 76475 - 76855 469 ## BF1084 preprotein translocase subunit SecG 67 30 Op 2 . - CDS 76860 - 77615 602 ## BF1000 hypothetical protein 68 30 Op 3 . - CDS 77621 - 78142 625 ## BF1082 hypothetical protein 69 30 Op 4 . - CDS 78129 - 79355 941 ## COG2204 Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 70 30 Op 5 . - CDS 79380 - 80477 876 ## PROTEIN SUPPORTED gi|163786851|ref|ZP_02181299.1| 50S ribosomal protein L32 71 30 Op 6 . - CDS 80482 - 81534 852 ## BF1079 hypothetical protein 72 30 Op 7 . - CDS 81603 - 82637 819 ## COG0820 Predicted Fe-S-cluster redox enzyme - Prom 82661 - 82720 8.7 - Term 82733 - 82774 11.3 73 31 Tu 1 . - CDS 82814 - 84952 2185 ## BF1077 peptidyl-prolyl cis-trans isomerase - Prom 85005 - 85064 3.8 - Term 85006 - 85044 -0.7 74 32 Op 1 . - CDS 85073 - 86329 998 ## COG1253 Hemolysins and related proteins containing CBS domains 75 32 Op 2 . - CDS 86332 - 86949 364 ## BF0992 hypothetical protein 76 32 Op 3 . - CDS 86954 - 88279 1498 ## BF1074 hypothetical protein 77 32 Op 4 . - CDS 88325 - 89599 976 ## BF1073 putative outer membrane protein 78 32 Op 5 . - CDS 89586 - 90317 381 ## COG1521 Putative transcriptional regulator, homolog of Bvg accessory factor - Prom 90363 - 90422 9.3 79 33 Op 1 . + CDS 90891 - 92117 590 ## BF0985 putative transmembrane protein 80 33 Op 2 . + CDS 92114 - 93688 1051 ## COG1524 Uncharacterized proteins of the AP superfamily + Prom 93691 - 93750 3.6 81 34 Tu 1 . + CDS 93796 - 97125 3588 ## COG0653 Preprotein translocase subunit SecA (ATPase, RNA helicase) + Term 97157 - 97197 10.6 + Prom 97144 - 97203 6.7 82 35 Op 1 . + CDS 97245 - 98345 949 ## BF1067 hypothetical protein 83 35 Op 2 . + CDS 98410 - 98856 520 ## BF1066 hypothetical protein + Term 98884 - 98925 7.0 - Term 98866 - 98919 13.0 84 36 Tu 1 . - CDS 98936 - 100492 1487 ## COG3119 Arylsulfatase A and related enzymes - Prom 100576 - 100635 4.8 - Term 100603 - 100642 7.5 85 37 Op 1 . - CDS 100668 - 102233 1562 ## COG3119 Arylsulfatase A and related enzymes 86 37 Op 2 . - CDS 102250 - 103890 1603 ## BF1063 hypothetical protein 87 37 Op 3 . - CDS 103897 - 107277 3024 ## BF1062 hypothetical protein - Prom 107300 - 107359 3.9 88 38 Tu 1 . - CDS 107446 - 108453 684 ## COG3712 Fe2+-dicitrate sensor, membrane component - Prom 108517 - 108576 5.0 + Prom 108510 - 108569 10.6 89 39 Tu 1 . + CDS 108589 - 109164 424 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 90 40 Op 1 . - CDS 109366 - 110439 917 ## COG1409 Predicted phosphohydrolases - Prom 110463 - 110522 3.8 91 40 Op 2 . - CDS 110524 - 111552 898 ## BF1058 hypothetical protein - Term 111564 - 111610 0.4 92 41 Op 1 . - CDS 111631 - 113151 1487 ## BF1057 hypothetical protein 93 41 Op 2 . - CDS 113165 - 116467 2980 ## BF0971 putative outer membrane protein - Prom 116653 - 116712 5.2 94 42 Tu 1 . - CDS 116725 - 117750 607 ## COG3712 Fe2+-dicitrate sensor, membrane component - Prom 117815 - 117874 4.5 + Prom 117701 - 117760 7.3 95 43 Tu 1 . + CDS 117855 - 118436 379 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog + Term 118529 - 118596 8.6 + Prom 118438 - 118497 2.9 96 44 Tu 1 . + CDS 118646 - 119074 379 ## BF1052 hypothetical protein 97 45 Tu 1 . - CDS 119040 - 120941 1238 ## COG0642 Signal transduction histidine kinase - Prom 120983 - 121042 2.2 - Term 120988 - 121032 6.0 98 46 Tu 1 . - CDS 121060 - 123690 2763 ## COG0525 Valyl-tRNA synthetase - Prom 123738 - 123797 3.7 + Prom 123707 - 123766 7.3 99 47 Op 1 . + CDS 123807 - 124868 869 ## BF1049 hypothetical protein 100 47 Op 2 . + CDS 124874 - 125659 952 ## COG3956 Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain + Term 125686 - 125745 7.0 - Term 125673 - 125733 7.2 101 48 Op 1 . - CDS 125761 - 126234 455 ## BF1047 hypothetical protein 102 48 Op 2 . - CDS 126263 - 126676 305 ## BF0962 hypothetical protein 103 48 Op 3 . - CDS 126694 - 127245 399 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 104 48 Op 4 . - CDS 127318 - 128232 549 ## COG1234 Metal-dependent hydrolases of the beta-lactamase superfamily III - Prom 128291 - 128350 9.3 - Term 128318 - 128359 8.1 105 49 Tu 1 . - CDS 128382 - 130175 3053 ## PROTEIN SUPPORTED gi|53712335|ref|YP_098327.1| 30S ribosomal protein S1 - Prom 130218 - 130277 6.4 106 50 Tu 1 . - CDS 130317 - 131429 399 ## COG3344 Retron-type reverse transcriptase - Prom 131493 - 131552 1.7 107 51 Tu 1 . - CDS 131636 - 133855 1228 ## BF1041 hypothetical protein - Prom 133891 - 133950 8.4 + Prom 133896 - 133955 9.3 108 52 Tu 1 . + CDS 134095 - 136284 1819 ## COG3968 Uncharacterized protein related to glutamine synthetase + Term 136299 - 136352 12.6 + Prom 136501 - 136560 2.7 109 53 Op 1 . + CDS 136626 - 137327 629 ## COG0664 cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases 110 53 Op 2 . + CDS 137372 - 139816 1972 ## COG3525 N-acetyl-beta-hexosaminidase - Term 139830 - 139897 15.4 111 54 Op 1 . - CDS 139970 - 140917 708 ## PROTEIN SUPPORTED gi|148988049|ref|ZP_01819512.1| 30S ribosomal protein S9 112 54 Op 2 . - CDS 140973 - 141614 584 ## BF1034 hypothetical protein 113 54 Op 3 . - CDS 141639 - 144128 2239 ## COG1674 DNA segregation ATPase FtsK/SpoIIIE and related proteins - Prom 144152 - 144211 5.8 + Prom 144103 - 144162 5.2 114 55 Op 1 . + CDS 144241 - 144876 469 ## BF1032 hypothetical protein 115 55 Op 2 . + CDS 144883 - 145533 431 ## BF1031 hypothetical protein 116 55 Op 3 . + CDS 145558 - 146736 597 ## PROTEIN SUPPORTED gi|223476703|ref|YP_002580685.1| ribosomal protein L11 methyltransferase, putative + Term 146886 - 146917 2.5 117 56 Tu 1 . - CDS 147119 - 148276 742 ## BF0946 hypothetical protein - Prom 148498 - 148557 7.4 + Prom 149193 - 149252 7.0 118 57 Op 1 . + CDS 149402 - 150634 885 ## COG0477 Permeases of the major facilitator superfamily 119 57 Op 2 . + CDS 150637 - 151206 599 ## COG1259 Uncharacterized conserved protein 120 57 Op 3 . + CDS 151212 - 151901 617 ## COG1385 Uncharacterized protein conserved in bacteria 121 57 Op 4 . + CDS 151929 - 153482 1500 ## BF1024 hypothetical protein 122 57 Op 5 . + CDS 153495 - 154139 181 ## PROTEIN SUPPORTED gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) 123 57 Op 6 . + CDS 154136 - 155341 795 ## BF1022 hypothetical protein + Term 155364 - 155395 1.8 + Prom 155465 - 155524 2.7 124 58 Tu 1 . + CDS 155555 - 156481 858 ## COG0324 tRNA delta(2)-isopentenylpyrophosphate transferase + Prom 156635 - 156694 5.7 125 59 Op 1 . + CDS 156773 - 157699 888 ## COG1597 Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 126 59 Op 2 . + CDS 157715 - 158518 834 ## COG2877 3-deoxy-D-manno-octulosonic acid (KDO) 8-phosphate synthase + Prom 158563 - 158622 1.9 127 60 Tu 1 . + CDS 158667 - 161486 2538 ## COG0612 Predicted Zn-dependent peptidases + Prom 161489 - 161548 3.5 128 61 Op 1 . + CDS 161583 - 163064 1286 ## BF1017 hypothetical protein 129 61 Op 2 . + CDS 163096 - 163803 219 ## PROTEIN SUPPORTED gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 130 61 Op 3 . + CDS 163808 - 164323 232 ## PROTEIN SUPPORTED gi|167856514|ref|ZP_02479226.1| 50S ribosomal protein L1 131 61 Op 4 . + CDS 164353 - 165039 451 ## COG0546 Predicted phosphatases + Term 165183 - 165219 6.3 + Prom 165673 - 165732 4.3 132 62 Op 1 32/0.000 + CDS 165819 - 166136 534 ## PROTEIN SUPPORTED gi|53712302|ref|YP_098294.1| 50S ribosomal protein L21 133 62 Op 2 . + CDS 166159 - 166428 460 ## PROTEIN SUPPORTED gi|53712301|ref|YP_098293.1| 50S ribosomal protein L27 134 62 Op 3 . + CDS 166391 - 166549 72 ## BF1008 hypothetical protein 135 62 Op 4 . + CDS 166585 - 167859 1269 ## COG0172 Seryl-tRNA synthetase + Term 167964 - 167997 1.3 + Prom 167975 - 168034 4.4 136 63 Tu 1 . + CDS 168079 - 170367 2089 ## COG0493 NADPH-dependent glutamate synthase beta chain and related oxidoreductases + Term 170493 - 170533 6.4 - Term 170476 - 170526 11.4 137 64 Op 1 12/0.000 - CDS 170550 - 170903 479 ## COG0853 Aspartate 1-decarboxylase 138 64 Op 2 . - CDS 170926 - 171774 835 ## COG0414 Panthothenate synthetase - Prom 172021 - 172080 8.0 139 65 Op 1 . + CDS 171862 - 172845 662 ## COG0297 Glycogen synthase 140 65 Op 2 . + CDS 172869 - 174359 1173 ## BF0922 hypothetical protein + Term 174386 - 174438 13.2 - Term 174656 - 174705 10.4 141 66 Op 1 2/0.000 - CDS 174746 - 176134 1149 ## COG1449 Alpha-amylase/alpha-mannosidase 142 66 Op 2 4/0.000 - CDS 176147 - 177421 1059 ## COG0438 Glycosyltransferase 143 66 Op 3 . - CDS 177436 - 179376 1614 ## COG3408 Glycogen debranching enzyme - Prom 179435 - 179494 10.1 + Prom 179438 - 179497 10.8 144 67 Tu 1 . + CDS 179547 - 180227 289 ## COG0705 Uncharacterized membrane protein (homolog of Drosophila rhomboid) - Term 180217 - 180259 1.0 145 68 Tu 1 . - CDS 180265 - 180861 659 ## COG2095 Multiple antibiotic transporter - Prom 180941 - 181000 8.4 + Prom 180838 - 180897 4.0 146 69 Tu 1 . + CDS 180977 - 181720 643 ## BF0995 CRP family transcriptional regulator + Term 181725 - 181753 -0.9 147 70 Tu 1 . - CDS 181733 - 181999 226 ## BF0994 hypothetical protein - Prom 182094 - 182153 4.2 - Term 182074 - 182123 3.1 148 71 Tu 1 . - CDS 182359 - 183465 593 ## COG3712 Fe2+-dicitrate sensor, membrane component 149 72 Op 1 . - CDS 183575 - 184156 569 ## BF0992 RNA polymerase ECF-type sigma factor 150 72 Op 2 . - CDS 184182 - 185417 1057 ## COG2873 O-acetylhomoserine sulfhydrylase 151 72 Op 3 . - CDS 185449 - 186633 1066 ## COG1168 Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities - Prom 186657 - 186716 4.5 + Prom 186621 - 186680 6.9 152 73 Op 1 . + CDS 186805 - 187590 734 ## COG0561 Predicted hydrolases of the HAD superfamily 153 73 Op 2 . + CDS 187592 - 189592 1667 ## COG0507 ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member + Prom 189603 - 189662 5.6 154 74 Op 1 . + CDS 189830 - 192343 2377 ## BF0987 outer membrane assembly protein 155 74 Op 2 . + CDS 192391 - 193005 766 ## COG0009 Putative translation factor (SUA5) 156 74 Op 3 . + CDS 193069 - 193482 493 ## BF0985 hypothetical protein + Term 193518 - 193571 4.1 - Term 193506 - 193557 15.1 157 75 Op 1 . - CDS 193576 - 194388 912 ## COG0363 6-phosphogluconolactonase/Glucosamine-6-phosphate isomerase/deaminase 158 75 Op 2 . - CDS 194428 - 195627 1229 ## COG0426 Uncharacterized flavoproteins - Prom 195661 - 195720 2.5 159 76 Tu 1 . - CDS 195745 - 196680 825 ## COG1242 Predicted Fe-S oxidoreductase - Prom 196847 - 196906 5.1 - Term 196943 - 196987 5.4 160 77 Op 1 . - CDS 197014 - 197856 848 ## BF0902 putative transmembrane and transcriptional regulatory protein 161 77 Op 2 . - CDS 197866 - 198594 546 ## BF0979 hypothetical protein - Prom 198627 - 198686 3.1 + Prom 198899 - 198958 1.8 162 78 Tu 1 . + CDS 199032 - 201242 2017 ## COG1506 Dipeptidyl aminopeptidases/acylaminoacyl-peptidases + Prom 201259 - 201318 4.2 163 79 Op 1 . + CDS 201369 - 202235 727 ## COG0320 Lipoate synthase 164 79 Op 2 . + CDS 202300 - 203418 774 ## BF0975 hypothetical protein + Term 203546 - 203580 3.0 - Term 203198 - 203233 2.8 165 80 Op 1 . - CDS 203384 - 204088 608 ## COG0313 Predicted methyltransferases 166 80 Op 2 . - CDS 204115 - 205128 782 ## BF0895 hypothetical protein 167 80 Op 3 . - CDS 205133 - 205990 907 ## COG0623 Enoyl-[acyl-carrier-protein] reductase (NADH) + Prom 206711 - 206770 3.6 168 81 Op 1 . + CDS 206908 - 210174 3405 ## BF0893 putative outer membrane receptor protein 169 81 Op 2 . + CDS 210188 - 212230 1788 ## BF0892 hypothetical protein + Term 212326 - 212375 5.2 + Prom 212532 - 212591 5.9 170 82 Op 1 . + CDS 212778 - 216047 2952 ## BF0890 putative outer membrane receptor protein 171 82 Op 2 . + CDS 216063 - 218087 1970 ## BF0889 hypothetical protein + Term 218209 - 218252 4.5 - Term 218592 - 218649 9.1 172 83 Op 1 . - CDS 218692 - 220215 1469 ## COG0519 GMP synthase, PP-ATPase domain/subunit 173 83 Op 2 . - CDS 220246 - 220686 451 ## COG1970 Large-conductance mechanosensitive channel - Prom 220790 - 220849 7.0 + Prom 220654 - 220713 7.7 174 84 Tu 1 . + CDS 220826 - 221827 1146 ## COG0057 Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase + Term 221849 - 221895 12.8 + Prom 221858 - 221917 8.6 175 85 Op 1 . + CDS 222051 - 224108 2102 ## COG0339 Zn-dependent oligopeptidases 176 85 Op 2 . + CDS 224115 - 224624 553 ## BF0965 hypothetical protein 177 85 Op 3 . + CDS 224636 - 225073 514 ## COG2131 Deoxycytidylate deaminase 178 85 Op 4 . + CDS 225151 - 226878 1664 ## COG0793 Periplasmic protease 179 85 Op 5 . + CDS 226808 - 227338 356 ## COG0212 5-formyltetrahydrofolate cyclo-ligase 180 85 Op 6 . + CDS 227371 - 228123 392 ## COG1387 Histidinol phosphatase and related hydrolases of the PHP family + Term 228191 - 228236 4.2 - Term 228179 - 228223 11.6 181 86 Op 1 . - CDS 228246 - 228536 296 ## BF0959 hypothetical protein 182 86 Op 2 . - CDS 228533 - 229645 1041 ## COG1195 Recombinational DNA repair ATPase (RecF pathway) - Prom 229744 - 229803 9.0 + Prom 229716 - 229775 7.8 183 87 Op 1 . + CDS 229806 - 230489 851 ## BF0957 hypothetical protein + Term 230520 - 230557 3.0 184 87 Op 2 . + CDS 230575 - 231069 474 ## COG0054 Riboflavin synthase beta-chain + Term 231172 - 231242 30.1 + TRNA 231142 - 231227 63.9 # Tyr GTA 0 0 + TRNA 231233 - 231308 69.5 # Gly TCC 0 0 + Prom 231152 - 231211 80.4 185 88 Tu 1 . + CDS 231370 - 232620 1224 ## COG0673 Predicted dehydrogenases and related proteins + Term 232646 - 232686 11.3 - Term 232634 - 232674 10.5 186 89 Op 1 . - CDS 232712 - 234496 1425 ## BF0953 hypothetical protein 187 89 Op 2 . - CDS 234515 - 237625 2463 ## BF0952 hypothetical protein Predicted protein(s) >gi|226332048|gb|ACIB01000008.1| GENE 1 1 - 454 262 151 aa, chain - ## HITS:1 COG:MT2825 KEGG:ns NR:ns ## COG: MT2825 COG0732 # Protein_GI_number: 15842293 # Func_class: V Defense mechanisms # Function: Restriction endonuclease S subunits # Organism: Mycobacterium tuberculosis CDC1551 # 7 88 9 91 91 84 45.0 5e-17 MKLTRYKLGEILELQRGYDLPSSQMKKGDILVAGSNGVIGYHNEARSNHPCITVGRSGSV GKVHYYEQATWAHNTALFVKDFKGNDPKYLYYFLKNLHLDKMFDKGSSVVPSLDRKVVHS LNVPCHKDIDCQKRIAAILSKIDRKIELNCA >gi|226332048|gb|ACIB01000008.1| GENE 2 451 - 714 229 87 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253563524|ref|ZP_04840981.1| ## NR: gi|253563524|ref|ZP_04840981.1| predicted protein [Bacteroides sp. 3_2_5] # 1 87 14 100 100 154 100.0 1e-36 MISKSGGCAQLTGVAFNDLFGAGNLVISEKDLHDGFEYIIARHKKEIEENRYATTEQKEV EKNIRELILKSIEEQINVYLKSQNRLL >gi|226332048|gb|ACIB01000008.1| GENE 3 784 - 1785 884 333 aa, chain - ## HITS:1 COG:STM3755 KEGG:ns NR:ns ## COG: STM3755 COG3943 # Protein_GI_number: 16767039 # Func_class: R General function prediction only # Function: Virulence protein # Organism: Salmonella typhimurium LT2 # 6 331 13 337 345 224 38.0 2e-58 MSNEIQFLLYSLPDEQGKVQVVIKDETIWCTQKAMAELFGIDKSGISRHISNIFKEDELQ QDTTVAKIATVVNRGIRGEVEELVDFYNLDMIIAVGYRVSSPKATKFRQWATKILNEYIK KGFVLDDERLKQGTAVFGKDYFRELLERVRSIRASERRIWQQITDIYAECSVDYDKNSPT TRDFYAMIQNRFHYAITGQTAAEIIYTKADHRKEHMGLTTWKNAPDGRILKSDVSIAKNY LQEKEIRQLERAVTGFFDYIEDLIERENTFNMSQFSESVNEFLTFRRYQILPDKGRISAS QAKAKAESEYDIFNKTQRIDSDFDKQVRGILDK >gi|226332048|gb|ACIB01000008.1| GENE 4 1778 - 3439 1136 553 aa, chain - ## HITS:1 COG:jhp0415 KEGG:ns NR:ns ## COG: jhp0415 COG0286 # Protein_GI_number: 15611483 # Func_class: V Defense mechanisms # Function: Type I restriction-modification system methyltransferase subunit # Organism: Helicobacter pylori J99 # 4 545 9 542 543 471 47.0 1e-132 MSDIKEKTLRLIDGLKATCQSYGMGNDGNEYKIITQVFLYKFLNDKFGYELKNAKSEIAK KLTGDVKWETAYENLSDDERMLIQSAISPDVPMLEPYHLIANLWNQQGKGDFDTIFDSTM TDIAEQNADIFSTQTTANTKIPLFEALTPFVTDSAQRAPFARALVDKLVNFSFEEAFAQN YDFFSSIFEYLIKDYNTAGGGKYAEYYTPHAIATIMARLLVGDNVDLHSMECYDPSAGTG TLLMALSHQIGEERCTIFSQDISQRSNKMLKLNLLLNGLVSSLDNAIQGDTLVSPYHKSD DGQQLRQFDFVVSNPPFKMDFSDTREKIAAMPARFWAGVPNVPAKKKESMAIYTCFIQHV INSLKKTGKGAIVIPTGFITAKSGIENKILHKIVDDKVVFGCVSMPSNVFANTGTNVSVL FFDRSATADKVILIDASKLGEEYKDANGLKKVRLNDEEIEKIVGTFQRKEAVEDFSVAVS YDEIKEKGYSLSAGQYFDIKIDYVDITEEEFNFRMADYKQILSEQFAESHRLEEEIMKQL DALQFNANVGNNE >gi|226332048|gb|ACIB01000008.1| GENE 5 3445 - 6675 2340 1076 aa, chain - ## HITS:1 COG:HP0464 KEGG:ns NR:ns ## COG: HP0464 COG0610 # Protein_GI_number: 15645092 # Func_class: V Defense mechanisms # Function: Type I site-specific restriction-modification system, R (restriction) subunit and related helicases # Organism: Helicobacter pylori 26695 # 4 1071 3 1051 1055 548 35.0 1e-155 MKQFSEATRVQMPAMVHLTRIGYTYFGKLSEDKNGTVYDSDTNILLQVFEQQFKNLNPGH EGEFLQILKDIRKELNDDDLGRGFYNRLKAVSPVKLIDFDNIGNNTFHFTAEFTCKNGQD EFRPDITLFVNGLPLCFVEVKKPNNQGGMLAESARMNKERFPNKKFRRFINITQLMIFSN NMEYDALGGIVPIQGAFYCTGARSYSPFNCFREENLSAQKIAPFHCDYPYKDIDKTAEKQ ILSDYNCQVIHTSPEYQTNLDFNTPTNRILTSMCSPERLLYIIKYGIAYVRMEREVDGKI ESTDQKHIMRYQQLFASLAIRKKLAEGVKSGVVWHTQGSGKTALSYYLTYILNDFYSKQN KVAKFYFIVDRLDLLEQATQEFEARGLVVSTANSRAELMAQFRSNQAQQGVSGQAEITVV NIQRFAEDKEKVRINDYATNFQRIFILDEAHRGYKPGGCFLANLFDADTDAIKIALTGTP LLKEERASCKVFGNYLHTYYYDKSIADGYTLKIIREDIETSYKERLSDVYDKLETLVQKK DIRKSEIIEHPSYVSELARYIMTDLKEFRKIQGDDTLGGMVICETSEQARRLYDVFQEEW QKYQPKPIKIKLSDGSYVVGEPEVDYKSKYRPLKAGIILHDTDDKETRKQIVKDFKKNMT VDILIVFNMLLTGFDAPRLKRLYFGRKLKDHNLLQAITRVNRPYPGMRYGFVIDFADIKR NFKETNEAYLQELNRFNDVDETGESAAADTFTQVIEDKEEILNQMKKVRQTLFNYTYDNA EEFSSEISTEEDKAVLLDLKQALESAKNMANIVRTFGDDEMKEQFAKLEITKLPQLLSEV QRRISIINQKEAFNTNEETKTLINEAMMDIEFTFSKIGQEEMRLISGGVELKEKWQRTIS SFTQNFDQDDPEFISLREAFMERFKEHGFVIDTIAKFNEETQALDEIIGRLQDLQKRNNV LLKKYKGDEKFARVHKRIREVNKQREDKGQKPMFSFLDEEIAAILNIIKEDVDAKVYDRN DILKKDAYFGRTVMALINGCLFHFPQIKPEMEDYKFIQTRISQQYINQYNATYGIA >gi|226332048|gb|ACIB01000008.1| GENE 6 6677 - 6880 161 67 aa, chain - ## HITS:1 COG:no KEGG:BT_4515 NR:ns ## KEGG: BT_4515 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 67 79 145 145 120 91.0 2e-26 MKLNRIKAVLLEKGISQTWLAKKLDKSFSMVNAYACNRIQPNLETLQQIAEILQVDLKDL ITDKKDR >gi|226332048|gb|ACIB01000008.1| GENE 7 6978 - 9431 1033 817 aa, chain + ## HITS:1 COG:no KEGG:RPB_3532 NR:ns ## KEGG: RPB_3532 # Name: not_defined # Def: hypothetical protein # Organism: R.palustris_HaA2 # Pathway: not_defined # 15 804 12 789 792 175 22.0 6e-42 MEIREAIIHALDGDAILFIGSGFSLGAINEGNKKIETATPLAHKLLAECDFEEKDFTNDL GIASRIYQSAKSEIDLIEFLRKEYTAIDVTPEQEIIAQINWQRIYTTNYDNVFELACEKN KKKIQSVTLSDRPNDFKNKSNLCIHINGDIKRLTQEKLNSEFKLTNVSYLTEDFNKSEWL TLFRSDLQTAKSIIFIGYSMQYDLDLQRIVYLTPKLIDKTFFIISEQASKTEQALIKTFG MPFPIGIKKLTEHINEIKKNHVHITKLPDSYLCFSKPKIKDFPSSILDIDEFNLLVRGEY NIDNLYYSTINPTDFVYSIHRSKHEEVINLIKTGEHNILIHSDLGNGKSIFMTTLTAFLS KEGYNVLRFNKYYTTYNREIEQICQKEGQHILVFDDYMSYIDCLKELKIHRTNQILILSE RSAMNDIYYNSLCDLFGDFHNIDLNRLDSNEIKQFVNILDHYGFWNKFSAERIDRKEDYI KTVCKGQIKNIILKLLNSKTILESFQKLITSIRKRNGYYEAILFILIARVSKLDLDLEDL AYSLNMSQLNSPSFQKDPHVREFVDFNTYSIKSKSSIISQVLLQQIFDSTIVVDVMLSIF RNLNAHRHDEKIKRILKNMMMFTNIQQTINKDDANYKHNILRYYENIKPLSSCNKNPHFW LQYAIVKLSEYDYEQAQIYFDTAYSFAKKIENFDTYQIDNHYARFIIENEIKFGTKATCM QAFSYAHSILMDPKHKTEVRYYPYRVAQNYYPFYERFYKELSHKEQEIFIQSCFEILKRL KSYLETTTTASDRTDVKKSEKNLLRIFKELNITYETK >gi|226332048|gb|ACIB01000008.1| GENE 8 9412 - 9567 76 51 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MSYEGWQNPFLKKYPCHSYLYRGKSYYQDIMTSFKSTNRKYLIILFIWFHK >gi|226332048|gb|ACIB01000008.1| GENE 9 9683 - 9955 102 90 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|301162168|emb|CBW21713.1| ## NR: gi|301162168|emb|CBW21713.1| putative transmembrane protein [Bacteroides fragilis 638R] # 1 90 10 99 99 167 100.0 2e-40 MVLHIRVVPPLLSSFVECPLSDSMYWMIGHFRRYVPERLLNAKVVVFLGFLWFTVDFYSL KGVGWWKIGHKTGYLLWNIYDSLEKMWGNR >gi|226332048|gb|ACIB01000008.1| GENE 10 10166 - 11050 605 294 aa, chain + ## HITS:1 COG:BH1510 KEGG:ns NR:ns ## COG: BH1510 COG1028 # Protein_GI_number: 15614073 # Func_class: I Lipid transport and metabolism; Q Secondary metabolites biosynthesis, transport and catabolism; R General function prediction only # Function: Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) # Organism: Bacillus halodurans # 55 292 6 241 243 192 46.0 8e-49 MADNYIERQYEQYETRKAAWEKARKYGKKKTGITHPARTEQPGQTTTEPHHYKRVFVTGG ANGIGKAIVEIFCKSGYRVAFCDKDEIAGKRTAEETGAIFHQVDISDKDMLEHCMQSIIE EWDDIDILINNAGISDFSPITETSIEDFDRILSINLRPVFITSRFIAIHRQSQTTSNPYG RIINICSTRYLMSESGSEGYAASKGGIYSLTHALALSLAQFHITVNSIAPGWIQTHDYDR LRPEDHAQHPSRRVGKPEDIARMCRFLCEEGNDFINGENITIDGGMTKKMIYTE >gi|226332048|gb|ACIB01000008.1| GENE 11 11260 - 15126 3552 1288 aa, chain - ## HITS:1 COG:alr4773 KEGG:ns NR:ns ## COG: alr4773 COG1501 # Protein_GI_number: 17232265 # Func_class: G Carbohydrate transport and metabolism # Function: Alpha-glucosidases, family 31 of glycosyl hydrolases # Organism: Nostoc sp. PCC 7120 # 457 761 466 759 779 134 30.0 1e-30 MKNAIGNCRQRKAVLFALALPLMFSGSPAQAMHRSEIVREVTQQNLKIVSAKKINPTTIE VLFSNNQRMTFDFYGENIFRVFQDNAGGIIRDPEAKPEAQILVNNPRNTVSTLNLNDGSN LISITTGKIKVEIDKNTSLMKVIDLEKNTVAFEEVEPVLFDKGKVTVTLKENPNEYFYGG GVQNGRFSHKGKAINIVNENSWTDGGVASPAPFYWSTNGYGMMWYTFKPGKYDFGADEKG KVKLTHDSPYLDLFYMVSDGAVGVLNDFYQLTGNPVLLPKFGFYQGHLNAYNRDYWKEDE KGILFEDGKRYKESQKDNGGIKESLNGEKNNYQFSARAVIDRYKNHDMPLGWLLPNDGYG AGYGQTETLDGNIQNLKSLGDYARKNGVEIGLWTQSDLHPKEGVSALLQRDIVKEVRDAG VRVLKTDVAWVGWGYSFGLNGVADVGHIMPYYGNDARPFIISLDGWAGTQRYAGIWSGDQ TGGVWEYIRFHIPTYIGSGLSGQPNISSDMDGIFGGKNMIVNTRDFQWKTFTPMQLNMDG WGSNEKYPHALGEPATSINRWYLKLKSELLPYTYSFAKEAVTGMPLIRAMFLEYPNAYTL GTATQYQFMYGTDFLVAPIYKATKADAEGNDIRDGIYLPEGEWIDYFTGEKYQGNCVLNN FAAPLWKLPVFVKNGAIIPMTNPNNNVAEINKGLRIYEIYPYKHMMTVEYDDDGISEAYK EGKGTTTFIESNVDSKNNVKISIRPTQGDFDGFVKEKATEFRVNVTAKPKKVSAQIGKGK VKLTEVSSMDDFRKGENVYFYDAAPNLNKFATKGSEFEKKVITKNPQVLVKLAATDITKN QVVMDIEGFQYAPADNYRVTSGSLTAPAARIAAEDIEAYTLKPTWNKVPNADFYEIEFNG MLYTTIKDTELLFDGLAAETDYTFKIRAVNKDGYSDWAEFGAKTKANPLEFALHGIKGET TAKNQEGFDIDRLFDFAELGDMWHTKYGAKALPYDMIIDLRTVNQLDKFEYLPRTDGGNG TILKGTVYYSMDKENWTEAGAIDWKRNGDVKVFTFTERPTARYIKLAVTEGVNNYGSGRE LYVFKVPGTESRLQGDINNDGKIDNNDLTSYTNYTGLRKGDSDYEGYISVGDIDQNGLID AYDISVVATQLEDGVSEEPIEKLDGTIEISTAKRNYSKGDVVEVLVKGVNLRSVNALSFA LPYNQQDYEFVGVEPLNLKAMENLTYDRLHTNGTKALYPTFVNLGAKEALEGTNDLFILK LKAKRAVKFDLKAIDGVLVDKNLNTRKF >gi|226332048|gb|ACIB01000008.1| GENE 12 15583 - 17577 1603 664 aa, chain + ## HITS:1 COG:PM0698 KEGG:ns NR:ns ## COG: PM0698 COG2189 # Protein_GI_number: 15602563 # Func_class: L Replication, recombination and repair # Function: Adenine specific DNA methylase Mod # Organism: Pasteurella multocida # 2 664 3 636 636 396 38.0 1e-110 MEKLKMESVSIAEDSLNKIAELFPNVVTESMGKDGQLHKAIDFDKLKFLLTANQAEMGVV YDDDERYELTWVGKKQAIREVAHPIRKTLRPCPEESRNWEQTQNLYIEGDNLDAMKLLKK SYAGKVDVIYIDPPYNTGKDFIFNDTFALSQEESDEKQGRYNEEGQRLFQNTEANGKFHS DWCSMMYARLMLARTLLNDNGIIFISIDDHELANLIKIGNEVFNASNFIDVFNWAKTETP ENLSKKSKQIIEYIVCYQKKKNDMKFQGLKKESVSSNGLLNQPNSVGILTFPANKVVTSI PDGVIKAGMYGTDAYDVELLEDTTVRGGLFTAPVKLKAKFKWSQANLDKEIQKGTTIKIP TLKLSPSYEKLEYDPEVPPNLINYKVGVETNEQAGNHQLQFFDKKVFNFPKPVSLIQYLC EFIDTKNKDCIVMDFFSGSGTTAEAVMRMNMKPRKNKVKYILVQLPEDVTETIKKAKTPS EKEIMQNAIDFLTENHKALNICELSKERIRRAGDTIEAECNQRKSKDLPDIGFRVFRIAD SNMKDVYYSAKEYSQSDLFYFTDNIKEDRTGLDLLYGCLTNLGLSLSLPHDEEDIHGYTV YSVDKTELMACFAEQIPEKVFREIAGRQPRRVVFRDASFRDSSDRINIDEIFKTLSPGTT IEIL >gi|226332048|gb|ACIB01000008.1| GENE 13 17599 - 20631 2532 1010 aa, chain + ## HITS:1 COG:PM0699 KEGG:ns NR:ns ## COG: PM0699 COG3587 # Protein_GI_number: 15602564 # Func_class: V Defense mechanisms # Function: Restriction endonuclease # Organism: Pasteurella multocida # 1 1005 1 1041 1043 706 41.0 0 MKLKFKHQKFQEDAAKAVCDVFGGQPYKIFDYQVETRKKDGQTSFEKFTGFRNHPIVPQL TDEIVLKHIRDIQRAQQIKPSEALEGKYNLTIEMETGVGKTYTYIKTIFELNKRYGWCKF IIVVPSVAIREGVHKSLEIMKEHFASDYSTPLSYFIYDSKQLGELNAFVTDSKIHVMIIN SQKFNATNKDARRIYMKLDDFGGNCPIDVIAQMNPILIIDEPQSVEGAKTKEGLKRFNPL FTLRYSATHRELYNLVYRLDAMEAYNLQLVKKIAVKGISISGTTATEGFVYLEGLNLYPD KNPTANIGFEVKRTKAVNQVVRALKINDDLYAKSNHLEEYRNDYVITDINGVEDSVTFRN GIKLYAGDVAGSVNETQLRRIQIRETILSHIEKEQELFEKDIKVLSLFFIDEVAKYRRYN PDGKGEYAEIFEQEYTDIIKHLDPSLFNQPEYIDYLKSTVASKAHKGYFSKDKKGKLIDS KTERGTKESADEDAYDLIMKNKERLLDRKEPIRFIFSHSALREGWDNPNVFQICTLKQSS AEVRKRQEVGRGLRLCVNGQGDRMDANVLGVEVHRVNLLTVIASESYESFAKGLQTEMAE AIADRPQKVTIQLFKDQSLRLPNGETIIATEDIAQSIYDSLLENKYIKKGELTDKFYEDR KQGEVIFDDELTDYKASIMTILASIYNPREMQPNDARKSKINLRLSKDKLENSKLQELLK LLCSKSTYTVKFDEKELVERAIESLNEKLRVSQLYLSVITGQMEKIKSKAALISGEAFKV DANQAHYEKIDAMANDQVKYDLLGKLTDATNLTRQAVAQILSRIKPNVFGQFKNNPEDFI IKASELINEEKACLIVKHIEYTPIDQYYDVSVFTRATIQGRLGVNTIKADKHLYDHVRFD SQNEKTFMERLEENDEIEAYVKLPANFYIPTPMGKYHPDWAIVFKQKLSKYPYFIAETKA SDSSLQDRRIEEAKIECAKKHFAKTNGGKLKYNKVSSFEELLKIVTQESV >gi|226332048|gb|ACIB01000008.1| GENE 14 20647 - 22266 264 539 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|161507907|ref|YP_001577871.1| ribosomal protein large subunit [Lactobacillus helveticus DPC 4571] # 327 538 72 281 285 106 32 1e-21 MFHSFQTSIAGIELPRLFTYPFHYTPHPLCVMAAGEVQAYINKQTRWKEELDKGKMFGVL IVRTSNGQTGYLAAFSGNLCGSNSHSFFVPPVYDLLKPDGFFKIEEEQISAINHQIGQLQ NCDRYLELQQKMERETASSQQALSEARKVLKAAKEKREQRRLHRPNENEQAAMIRESQYQ KAEFKRLERYWKEQISEIKTELESFSSQIEALKAERRNRSAALQQKLFQQFNFLNAKGET KNLCAIFEETVQKTPPAGAGECAAPKLLQYAYLSGLSPIAMAEFWWGESPKTEIRHHGYY YPSCRGKCEPILRHMLQGLNVEPAPSERYSLSQNMPEILFEDQWLLVLHKPEGVLSVPGK SEEQSIYSLLRARYPEATGPLVVHRLDMATSGLLLAAKTQEVHRHLQAQFENRSIKKRYI ALLDGILPEEEGVIDLPICPDYLDRPRQMVNEELGKTAITRYRVMDRKNGQTRIAFFPLT GRTHQLRVHAAHPLGLNCPIVGDELYGRKAERLYLHAEYLEFIHPVSGQRMVIEKKAEF >gi|226332048|gb|ACIB01000008.1| GENE 15 22277 - 23176 466 299 aa, chain + ## HITS:1 COG:lin1092 KEGG:ns NR:ns ## COG: lin1092 COG0454 # Protein_GI_number: 16800161 # Func_class: K Transcription; R General function prediction only # Function: Histone acetyltransferase HPA2 and related acetyltransferases # Organism: Listeria innocua # 10 142 13 143 150 109 47.0 6e-24 MIEQPSPKVYDELLSIWEEAVRSTHHFLTEADIQFYKPLIRHEYLAAVRLYIIREDSGTI AAFMGLSNDCIEMLFVRPNAHGHGYGSRLVEFAIRKKRIYKVDVNEQNAAALGFYLHMGF ETASRDALDATGKPFPILHLQIPPIRLRKATLEDIDLLRALFTQSVQNTCSADYNRLQIQ AWTGRGTLQRWHELFQSDLYFLLAEDSRKSQVAGFTSVNSKGYLHSMFVHPDYQRQGIAS RLLLKAEEYVRIRQGVSVYSEVSITARPFFEKHGYSIEKEQTVSVGDIEMTNFLMYKRI >gi|226332048|gb|ACIB01000008.1| GENE 16 23235 - 26093 1611 952 aa, chain + ## HITS:1 COG:no KEGG:BF1139 NR:ns ## KEGG: BF1139 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 952 1 952 952 1915 99.0 0 MLKSKYLFLSLICLLTSFRLHAQFMDYGSDPAKFKWNIARLPHYNLVYPQGNDSMAYRYA LFLENVYPHMSKTIGKPIKAKFPVILHPGNMQSNGMVSWAPRRMELITTPSSDLNNQSWD KHLVLHESRHVFQTGKVMHGIFKPLYYIIGEQAAGVASFFLPVWFLEGDAVSTETAMSNG GRGRLPEFNMVYRAQMLGGKKNYSFDKWLMGSYKNYTGTYYALGFDMTSYARQRYGADIW DKSTSRYIRNLLFEGSFKHYTGSSFKRLQHDTFDFLRAEWEKQDTCTQSPQYLSPAKETY TSYRYPQPINDSIVIAVKSELKDINSLVIINNGREKHLDYIGSINSRLSYRNGRVYWSEL VPGLRWTHQNYSIIKYYDLDKKNIKTLTPRQRYLSPAIDEQGQHIAVSRPTVEGKNQLVL IQAEKGNELAAFDVPDNAFIKELTFAGGDTIISIAVADSGIRLLQFNFGNGIWKELLKTA SANITSPVWKDGKIFFESGANGTNNIYSLNPADGQVRRMTAARFGAFDPSFGSSDGRLFF SDYQADGYRIASLPTDSMLFEKADLNRPASMPFVETLAAQEQFNLDSARLTSVDFNPKRY RKAEHTFKIHSWAPFYYDVAEAMNSGASDLSTIVKPGATLMSQNTLNTAIMQAGWYIDKG YHHGKLSFIYQGWFPVINLSVDYGDKAFNVDWTQNDKGQDITQGHYTQRNLVEAEARVYL PFNLTHNQRIRGIQPALTYYFTNNKYQEYHSRKFHNFQYILPEILFYDYRRKAQRDILPR TGYQLRLQYLKTPFNSENYGSLYAARLTTYWPGIIRNHGLMIRVGYQYQDLDNKALYLPK HLLEKPRGYHFQYQTRQQWAFKADYALPLLSPDWSIGSLIYIRRLRANLFYDLSRNQASS KSRWSNQSSYGGDLIFDWNILRMSYPLTTGIRLIQPIDYGKFQVEALFSISF >gi|226332048|gb|ACIB01000008.1| GENE 17 26528 - 27742 1314 404 aa, chain - ## HITS:1 COG:PAB1772 KEGG:ns NR:ns ## COG: PAB1772 COG1883 # Protein_GI_number: 14521092 # Func_class: C Energy production and conversion # Function: Na+-transporting methylmalonyl-CoA/oxaloacetate decarboxylase, beta subunit # Organism: Pyrococcus abyssi # 23 401 19 399 400 166 30.0 6e-41 MESIDFGTLFQGFGTMIASGWFLASARMFLIALGFLLIYLGWKGVLEPMVMIPMGLGMVA INCGTLIMPDGTLGNLFLDPMLSDTDALMNTMQIDFLQPVYTLTFSNGLIACFVFMGIGT LLDVGFLLQKPFASIFLALCAELGTFLTVPIASGLGLSLKESASVAMVGGADGPMVLFTS LALAKHLFVPITVVAYLYLGLTYGGYPYLVKLLIPKCLRAIKMVEKKAPKNYDAKVKLAF SAILCAVLCFLFPVASPLFFSLFLGVAVRESGMKHIYDFVSGPLLYGSTFMLGLLLGVLC DAHLLLDPKILKLLVLGMLALLLSGIGGIMGGYIMYFIKKGNYNPVIGIAAVSCVPTTAK VAQKLVSKDNPNSFILGDALGANISGVITSAIITGIYITIIPYL >gi|226332048|gb|ACIB01000008.1| GENE 18 27754 - 28191 344 145 aa, chain - ## HITS:1 COG:no KEGG:BF1049 NR:ns ## KEGG: BF1049 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 145 28 172 172 286 100.0 2e-76 MFSLLLSTFAMMGLTFVIGFFVAGVIKLIASAADSLAFYSSHQEELARLKRIRKLHQKVA TLITESALSDEEYGSDGREDFSRGVTKHPGDNRGFYHGVSPGESERGLMDYFYPEDTRTM FLRKEEQMLQHDKKNNKTSSTNKKQ >gi|226332048|gb|ACIB01000008.1| GENE 19 28335 - 28718 127 127 aa, chain - ## HITS:1 COG:no KEGG:BF1135 NR:ns ## KEGG: BF1135 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 127 1 127 127 214 98.0 7e-55 MIKYIFCILIGIFFVYGAGYTASIEEAAELPAEVTATFVSQYAGDHSLFNDETAESKVCD AILPHSSFSRELSSSKILKLKLQTAIRLLNASLFHQSERGDTYPDFNHNFIKYSSGYYVY SLEHILI >gi|226332048|gb|ACIB01000008.1| GENE 20 28883 - 29017 98 44 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MKKNANEIMMLQYRIKRYQAMGNGTMCQLLNGKLQKLLAKQVTM >gi|226332048|gb|ACIB01000008.1| GENE 21 29237 - 30832 1159 531 aa, chain + ## HITS:1 COG:FN0023 KEGG:ns NR:ns ## COG: FN0023 COG1288 # Protein_GI_number: 19703375 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 4 525 5 495 499 251 33.0 3e-66 MLKRIPHTYTIISSVILLCAVLSWIIPAGEYMRETIDVNGISRTVIVDHSFHRVEQTPQT WQVFSSLLEGFERQAGIIAFLLIMGGAFQIMNSSRAIDTGIFSFLNFTKGLEKHRLIKIL GVNNVVISLVIILFSLFGSVFGMSEETLAFVIIIVPLAISMGYDSITGLCMVYVAAHIGF SGAVLNPFTIGIAQGLSDLPLFSGFEYRMFCWLVLTTALIVCVLRYAAVVKKHPEKSPMY HADAYWRKREKESCGEISHVTTRQAWIVYLLLLVSLGLFSIIYPISTFSVGEASVTCYAV PTLSILFAVFGWLGLRKSNQFFILTLLAFTILFLIIGVMGHGWYLPEISAIFLAMGILSG FANSEHADAIIKQFMDGAKDMLSAAIVVGLAGGIIQILQDGHIIDPILHSLASLMGEAGK IVSLGVMYLIQTLINLIIPSGSAKAALTMPIMAPFSDVIGLSRQATVMAYQFGDGFTNMI TPTSAVLMGALGIARIPYEIWVKWFWKILLLFIILGMVLLIPTVLFPLNGF >gi|226332048|gb|ACIB01000008.1| GENE 22 30745 - 30912 73 55 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MVLEDTSFIHYPRNGTTDSHGTFPIEWILEFRSLSFTRKMEKSRTPKNVNQNFHK >gi|226332048|gb|ACIB01000008.1| GENE 23 31072 - 33750 2557 892 aa, chain + ## HITS:1 COG:no KEGG:BF1133 NR:ns ## KEGG: BF1133 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 892 1 892 892 1843 99.0 0 MKTLTRFLIIPMLSVAFFSCSESHFLKDVAYRNQVTQDFEMKKQQLPNGELFAVFNEKLT IPEQEALMFLYAYMPTGDVTDYTGDYYLENVRLSDQARQEMPWGKEIPDDVFRHFVLPIR VNNENLDDSRRVFYNELKDRVKNLSLHDAVLEVNHWCHEKVIYTPSDARTSSPLASVKTA YGRCGEESTFTVAALRSVGIPARQVYTPRWAHTDDNHAWVEAWVDGKWYFFGACEPEPVL NLGWFNAPASRGMLMHTKVFGRYTGQEEIMYETPNYTEINVIDNYAPTAKGSVLVTDAEG QPVADATVEFKVYNYAEFYTVATKHTDQSGHASLTAGKGDMLVWASKDGRFGYSKLSFGK DNELKITLDKNAGETYSLPLDIVPPAEGANLPEVTPEQRTENDRRMAQEDSIRNAYVATF ITEEQARTFAKENKLDETETVRLLIASRGNHQTLTDFLSDAVKADKAGQAISLLKVISAK DLRDVSPEVLNDHLNNSGLPASEDFCSNVLNPRVANEMITPYKAFFRKEIPASEAEAFRK NPQALVEWCKKEITINNELNSQRIPMSPMGVWKARVADEKSRNIFFVSMARSLGIPAWID EVTGKIQYRTFNDNNLKNGKVYDVDFEAAQQTQAPTGTLVARYRPIPSLSDPKYYSHFTL SKFRNGTFQLLNYDEGDVDMGGGATWSNLLKNGTRLDTGYYMMVTGTRMASGAVLANVTF FTIEEGKTTTVDLVMRESKDQVQVIGNFNSESTYLPIGTSEPQSILQTCGRGYYVVAVLG AGQEPTNHALRDIAALSGEFEKWGRKMVLLFPSEEQYKKFRPSEFPGLPSTITYGIDVDG AIQKQIAESMKLPNSTILPMFIIGDTFNRVVFVSQGYTIGLGEQLMKVIHGL >gi|226332048|gb|ACIB01000008.1| GENE 24 33873 - 34301 323 142 aa, chain - ## HITS:1 COG:no KEGG:BF1132 NR:ns ## KEGG: BF1132 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 142 1 142 142 250 100.0 1e-65 MSLHKYSIVLLALLALLCSCHDEDKGDIPQSDERTADFIVKYKDDFGIHTDYKAKVYIYY GIYSMDIVGFHYLPDGVLDHEGKEITPDIRLSADGKEDITLLLDNAEKVTVIVESSYYEG RVGITSYSPGDTPIKGIFTFGE >gi|226332048|gb|ACIB01000008.1| GENE 25 34413 - 35477 1051 354 aa, chain - ## HITS:1 COG:SMa2355 KEGG:ns NR:ns ## COG: SMa2355 COG0389 # Protein_GI_number: 16263727 # Func_class: L Replication, recombination and repair # Function: Nucleotidyltransferase/DNA polymerase involved in DNA repair # Organism: Sinorhizobium meliloti # 1 330 34 363 379 335 53.0 7e-92 MDAFYASVEQRDHPELRGKPLAVGHAEERGVVAAASYEARRYGVRSAMSSQKAKRLCPQL IFVPGRMEVYKSVSRQVHEIFHEYTDLIEPLSLDEAFLDVTENKQGILLAVDIAKAIKQR IREELSLVASAGVSYNKFLAKIASDFRKPDGLCTIHPDQAIDFIARLPIESFWGVGPVTA RKMHLLGIHNGLQLRECSSEMLVRQFGKVGLLYYDFARGVDLRPVEAVRIRKSIGCEHTL EKDIHVRSSVIIELYHVATELVERLQQKEFRGNTLTLKIKFHDFSQITRSMTQAQELTNL ERILPLAKQLLKEVEYEQHPIRLIGLSVSNPREEADEHRGVWEQLSFEFSDWGK >gi|226332048|gb|ACIB01000008.1| GENE 26 35641 - 36624 555 327 aa, chain - ## HITS:1 COG:CAC1451 KEGG:ns NR:ns ## COG: CAC1451 COG2207 # Protein_GI_number: 15894730 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Clostridium acetobutylicum # 229 323 190 283 295 69 36.0 8e-12 MHKGKIKFAYKEEQKLVLILIFPIVIRQSERVSEFSGILSVPEVSTGNSELLSVTGMLPA VKTSEQKRPDKKKEKHSLVLVERNKDLCNYLVQILMKEYKIVSVCDAEAAFETVCEQCPD AVLASSVLTRISGEELAVRIKSDDRVAHIPVILLVKPGEDDRYIQRNADLYVWMPFAISS LKTEIAALIANREMIRKRYIRLALGGEASDPIDKEVESSEGDREFIRQVRSLIEERMTDS GFKIGELSDCMNMSRSSFYNKIKEITGHAPADYVRNVRLNRALVLLMSRKYTVAEVADMT GFSDPKYFGIVFKKYYGGSPTKYINNL >gi|226332048|gb|ACIB01000008.1| GENE 27 36696 - 39380 1042 894 aa, chain - ## HITS:1 COG:no KEGG:BF1129 NR:ns ## KEGG: BF1129 # Name: not_defined # Def: two-component system sensor histidine kinase # Organism: B.fragilis # Pathway: not_defined # 1 894 1 894 894 1881 98.0 0 MKYIIYFMMMLYGSLCHAIVCKHIVERSETNTRKVYQIQRDALGYMWFMNHAGISRFDGT KLKHYKLPAEGRTMDYYMGNCRLLTDNRNGLWVVTRNGYLWMYNPSLDKFECRNHLVIPN DVSLHFLCVDNSSHIWFSVGNRLIAYQILSNTFHRVDHSLATISCMVEVAPGEYFVGSDE GLFGITIKNYAVDRQTGELSGKRCSRIHEILFHPYTQRLVVFDYSEGLGVWDMKSEQLVG TWNRLLNSRVSGLRIWDDRTVLVATDGEGIFRMDIVNPDITSFIQTDFENDNSIRTNRIA DVFVDDQKLIWVADYPEGVSMIDVESPDDYKWYRARSGDSHSLTNNRVNAVLHDSDGDVW FATDHGISCFHPSTGLWNRIVTPLPCQMYTALCEVKPGEICAGNYVHGLFFIRKQSNYSV TPYVRISGVNALCRKDKDGFWIGTDEGVFFYCPENDSIVEVKRLSGLHIHALHQSDDCLY IGTEGNGLMVYHPEHEQMDTVAALGTGNVYAVWSDDSRRLMGSSDGFAFSLDLVQHSYYR FLSKGIRITSGTFLGNGRYILGTYQGAIEYDKQEARPLRKACLGFYLDELRVLDKEVTVE TENSPLKKALNCTATLQLEHNENTFSFTATAIRYTEKQDIAYSWKLDHTDWSTPSVDNRI RFSNLPPGEYIFSVRALSIDNGRPFAQRNMHIIIRQPLWKTGGAFFCYGLLALILGSLAV RSWFVWQDRNLSREQVRLFVNTTRNLCLPLTLIKVPLEYLYEKSSSELVSNVLQQIKGVN NLLAELENISRVSAAPGRLSLADYELSIFLKETVARIRDYISEKDIMLRWTEEPAFATVC LDKDKMSAILRNLLMAFTDSMDRGGEILLSTSCNNQKWELRLESEDNGFLKKKF >gi|226332048|gb|ACIB01000008.1| GENE 28 39895 - 40242 97 115 aa, chain + ## HITS:1 COG:no KEGG:BF1040 NR:ns ## KEGG: BF1040 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 115 1 115 115 218 100.0 4e-56 MKLIQYISTTYVREELYNPDNWDLINPKSPPEFISSGGPFCHKVELEGFEPSSKRGNHKL STCLSLPKFFVQEQNQSHQFLPYPLKFHQKRKATSDYPRCNCTTEPECFGATASE >gi|226332048|gb|ACIB01000008.1| GENE 29 40613 - 41767 919 384 aa, chain + ## HITS:1 COG:BS_ykgB KEGG:ns NR:ns ## COG: BS_ykgB COG2706 # Protein_GI_number: 16078366 # Func_class: G Carbohydrate transport and metabolism # Function: 3-carboxymuconate cyclase # Organism: Bacillus subtilis # 41 383 8 346 349 212 36.0 1e-54 MVKKIISICAAGMIVASCSPKKTTAQPTDPSTTDSELTMLVGTYTSGNSKGIYTFRFNEE TGESLPLSDAEVANPSYLIPSADGKFVYSVNEFSKDQAAVSAFAFDKEKGTLHLLNTQKT MGADPCYLTTNGKNIVTANYSGGSITVFPIGQDGALLPASDVIEFKGSGPDKERQTMPHL HCVRITPDGKYLLADDLGTDQIHKFNINPNANADNKEKFLTKGTPEAFKVAPGSGPRHLI FNSDGKFAYLINEIGGTVIAFRYADGMLDEIQTVAADTVNAQGSGDIHLSPDGKYLYASN RLKADGVAIFKVDETNGTLTKVGYQLTGIHPRNFIITPNGKYLLVACRDTNVIQIFERDQ ATGLLTDIKKDIKVDKPVCLKFVD >gi|226332048|gb|ACIB01000008.1| GENE 30 41886 - 43070 797 394 aa, chain + ## HITS:1 COG:no KEGG:BF1125 NR:ns ## KEGG: BF1125 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 394 1 394 394 796 100.0 0 MKRHVFLLVTLFTMSTVAAQQQPIISPKDSIPSVIERVTGKENKGFSAHMNLQLYTSCAA SFTENELDEVAFKLNRFKLEIIGNINRKFSYHFRQSFNKYSNPFALDNLSSSVEYAYLTY HLSDRFSITAGKQFLMLGGYEYYVNPIKVREFSEFNNYVNCFLAGVSATWNVTPTQELNF QIVNNRNGGDADTYLHGLPTDVEATKVPLISTINWNSYYLDKAIQLRYAASWGQQAKGRN IMYLTAGNVYEKGPWIAYMDFMYSRQGIDNKGIISALPRIDLENPQTAQHTEYFTTIANV DYRFHPNWNAYLKGIYESGKIYKANGIFEKGTYRRTWCGQVCVEYYPMRNSELLIFLHYQ YKRNKLLKPARNLDAIDPNTQRISLGLVYSIPVF >gi|226332048|gb|ACIB01000008.1| GENE 31 43009 - 43740 404 243 aa, chain - ## HITS:1 COG:SMc00539 KEGG:ns NR:ns ## COG: SMc00539 COG0739 # Protein_GI_number: 15965497 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane proteins related to metalloendopeptidases # Organism: Sinorhizobium meliloti # 63 188 281 412 413 93 39.0 4e-19 MKNIFTLLILSVCFLCANISGRAQNKFSDMEVNHVRVATPGLFSKENCVMLDLKSLSRNY SFPLPGGKVISGYGTRGGHSGDDIKTCARDTIRAAFDGVVRMAKPYGAYGNVIVIRHPNG LETVYSHNVKNLVKSGDVVKAGMAIGLTGRTGRATTEHLHFETRINGQHFNPGLIFDMKK GTLRTDYLQCTKKGKGIVVKALKSEKVLPKYKTLSPFLYELPGIKKPVWNIPALARSAAY SGL >gi|226332048|gb|ACIB01000008.1| GENE 32 43752 - 44417 579 221 aa, chain - ## HITS:1 COG:SSO0658 KEGG:ns NR:ns ## COG: SSO0658 COG3382 # Protein_GI_number: 15897568 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Sulfolobus solfataricus # 48 208 46 210 224 87 37.0 2e-17 MYTIIVSKELKEACPVFAGVAIYAEVKNTSYCEGLWEEIQSFTEMLTATTRLEDIKKQPV IAATREAYKRCGKDPGRYRPSAEALRRRLMRGIALYQIDTLVDLINLVSLRTGHSIGGFD ADKIAGTGLELGIGKMNEPFEGIGRGVLNIEGLPVYRDAVGGIGTPTSDNERTKMGLETT HILAIVNGYNGKEGLQEAAEMIQTLLKKYADSDGGTITYFE >gi|226332048|gb|ACIB01000008.1| GENE 33 44439 - 45224 198 261 aa, chain - ## HITS:1 COG:BMEI0486 KEGG:ns NR:ns ## COG: BMEI0486 COG1496 # Protein_GI_number: 17986769 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Brucella melitensis # 13 259 24 262 265 125 33.0 1e-28 MLGYGLLGAYPNISHFVTTRHGGYSEGAYASFNCSPFSGDELERVEKNQTLLFQSLSQAP RHLIISFQTHGTKILPVDEKFLGASGQQQQEMLNGIDALITTEPGCCICISTADCIPVLL YDRVHHAVAAVHAGWRGTVEYIVGHTLEKMRAVFGTEGQDVIACIGPGISLQSFEVGDEV YETFRLNGFDMSRISFRHSVTHKYHIDLWEANRQQLLDFGVPGVQIEIADICTYIRHEDF FSARRLGIKSGRILSGIMINS >gi|226332048|gb|ACIB01000008.1| GENE 34 45248 - 46408 727 386 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|149915191|ref|ZP_01903719.1| 50S ribosomal protein L27 [Roseobacter sp. AzwK-3b] # 6 331 3 329 345 284 46 2e-75 MAESNFVDYVKIYCRSGKGGRGSTHMRREKYTPNGGPDGGDGGRGGHVILRGNRNYWTLL HLRYDRHAMAGHGESGSKNRSFGKDGADKIIEVPCGTVVYNAETGEYVCDVTEHGQEVIL LKGGRGGLGNWHFKTATRQAPRFAQPGEPMQEMTVILELKLLADVGLVGFPNAGKSTLLS AISAAKPKIADYPFTTLEPNLGIVSYRDGQSFVMADIPGIIEGASEGKGLGLRFLRHIER NSLLLFMIPADSDDIRKDYEVLLNELKTFNPEMLDKQRVLAITKSDMLDQELMDEIEPTL PEGIPHVFISSVSGLGISVLKDILWTELNKESNKIEAIVHRPKDVSRLQQELKDMGEDEE LDYEYEDDGDEDDLDYEYEEEDWEDK >gi|226332048|gb|ACIB01000008.1| GENE 35 46498 - 47067 677 189 aa, chain - ## HITS:1 COG:CC1269 KEGG:ns NR:ns ## COG: CC1269 COG0563 # Protein_GI_number: 16125518 # Func_class: F Nucleotide transport and metabolism # Function: Adenylate kinase and related kinases # Organism: Caulobacter vibrioides # 2 187 1 186 191 174 46.0 9e-44 MLNIVIFGAPGSGKGTQSERIVEKYGINHISTGDVLRAEIKNGTELGKTAKGYIDQGQLI PDELMVDILASVFDSFKDSKGVIFDGFPRTIPQAEALKVMLKERGQDISVMLDLDVPEEE LMTRLIKRGKESGRADDNEETIKKRLVVYNTQTSPLKEYYKGEGKYQHINGLGTMEGIFE DICKAVDTL >gi|226332048|gb|ACIB01000008.1| GENE 36 47123 - 47659 743 178 aa, chain - ## HITS:1 COG:DR1376 KEGG:ns NR:ns ## COG: DR1376 COG0634 # Protein_GI_number: 15806393 # Func_class: F Nucleotide transport and metabolism # Function: Hypoxanthine-guanine phosphoribosyltransferase # Organism: Deinococcus radiodurans # 13 173 10 171 176 126 38.0 2e-29 MDTIQIKDKLFTVSIREQEIQKEVIRVANEINRDLAGKNPLFLSVLNGSFMFTADLLKHI TIPCEISFVKLASYQGVSSTGSIKEVIGINEDIAGRTIVIVEDIVDTGLTMQRLLETLGT RGPKEIHIASLLVKPDKLKVDLNIEYVAMNIPNDFIVGYGLDYDGFGRNYPDIYTVVD >gi|226332048|gb|ACIB01000008.1| GENE 37 47910 - 49970 1583 686 aa, chain + ## HITS:1 COG:ECs3047 KEGG:ns NR:ns ## COG: ECs3047 COG4771 # Protein_GI_number: 15832301 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor for ferrienterochelin and colicins # Organism: Escherichia coli O157:H7 # 40 684 31 657 659 145 24.0 3e-34 MKVKLALLLTLIGTLPLAAQNVRQEQDTVSYMNDDPFNLEQIVVTATRTEKKIKNTPVIT QIITSKQIEERGTGNIQDLLTQEVPGLNFQEVGYGTSIDIQGLGSKHILFLIDGERIAGE NGGNIDYSRINLYNIDHIEIVKGASSALYGSQAMGGVINIITRKAKKKFEASAGIRYAGR NQQNYKDTPKDHSQYKYRIHLDKPNLNTNLSLGLNLGKFTMNTDVLYKSFDGYQLFDKKP LVKYFPAYNTTITEELSKTPTSISGYEDVQVAHKMDYRFSKRLKVQLKGSYYMLNKYDFQ ADNIFEKSEDYTYGGSIDYTISDKSSLVASVHTDHYNRYDKYELKSGRRLEYKNNIIQPR IVYSTTALDKQTITGGLEYYRESLFSDKFETGVKENKSQWYATAFLQDDWSINKQFSVIA GLRCDYHEKYGTNLTPKASVMYKIFPFTVRFNYARGYRSPSIKELYMNWDHLGMFWIYGN SKLKPETNNYISLSGEYVNSWININANVYSNWFRNKIEGMWSNDQTELHYINIGKSRLAG VETMCKIQINRHINVHGAYNYLYTSKDADGVRLSSSSPHSGNIRAEYNTRIPRYATVVNL SGNIMGKKKFDVLDELEIDGKKVEAYYQAKVNPYCLWDLTVSQYIMQNLRITAGITNLFD YTSDRVTFNTSTSPGRNYFIACNYTL >gi|226332048|gb|ACIB01000008.1| GENE 38 49987 - 50631 599 214 aa, chain + ## HITS:1 COG:no KEGG:BF1116 NR:ns ## KEGG: BF1116 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 214 1 214 214 384 100.0 1e-105 MNNKNKFRFAILLFGVLSAFIITACSDNNSPDDPSQGENTLPVKQVSLSRKTAYGNDWIY YSLEKGKEVSVSEESHAENTDWDIAFNRYNVRTNSGASGKGKGGALLTNIKDLAACTTVP QGTFTVDAAYTITAPGTGFPPPTMESTANEVLCKAITFAGPPPTYTPSDYVFIVRTASGK YAKLKAKSFYDDEGKSGIYSFEYAIQPDGSTNLN >gi|226332048|gb|ACIB01000008.1| GENE 39 50645 - 52159 557 504 aa, chain + ## HITS:1 COG:no KEGG:BF1115 NR:ns ## KEGG: BF1115 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 504 1 504 504 1039 100.0 0 MKNFWKKYHKWVGLFFSFFILMFCFSGIVLNHRTLFSKAEVSRNWMPESYHYKNWNNGII KGTLRLPDGKILAYGNAGVWKTDSCFATFADFNRGLAEGIDNRKISNIVRVANNDIWCAG LYSIYLLNHDSWKEYPIAGNDERISDITQRGDTLVILTRSYLYTSVSPYDEFRKTELKTP ENYSPKTSLFRTIWLLHSGELFGTPGKLAVDFLGVVLIVLSATGIIYTLLPPFIRRRHRK RLPVKTQAKALKTSLNWHNKLGTWLIGLTLLLSVTGMCLRPPLMIPFVLVNTRPVPGSTL DSDNPWHDKLRSIRWDASRNVWLLSSSMGFYRINDLQLPPVKLKQTPPVSPMGVNVFHPQ SPDEWLIGSFSGLFVWNPSTGTVLDYYTGQPPAAVHGRPLGGSLVNGFTDDLVTREVIFE YDNGARNKENNLVLPAMPDLIKQQPMSLWNFCLELHVGRCYSPFLGVFSDLFVFISGLLL TLILISGYIVYKRHHKRSKKIRMH >gi|226332048|gb|ACIB01000008.1| GENE 40 52260 - 53216 457 318 aa, chain - ## HITS:1 COG:PA3145 KEGG:ns NR:ns ## COG: PA3145 COG0472 # Protein_GI_number: 15598341 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase # Organism: Pseudomonas aeruginosa # 4 267 9 289 339 88 30.0 2e-17 MYYLIILVLLFLAELFYFRIADKCNIIDKPNERSSHTRITLRGGGIIFYFGALAYFLTNH FEYPWFMLALSLITFISFIDDIRSTSQGLRLVFHFTAMALMFYQWGLFSLPWWTILVALI ICTGIINAYNFMDGINGITGGYSLIILIALAYINRIYVPFVEPDLIYTMLCAVLVFNFFN FRKQARCFAGDVGSVSIAFVILFLIGSLIIKTENFGWLILLAVYGVDSVLTIVHRLMLHE NIGLPHRKHLYQIMANELRIPHVVVSLVYMIAQIIIIIGYLYCRNYGYWYLLGCILLLSG IYIVFMHKYFHLHLLSKR >gi|226332048|gb|ACIB01000008.1| GENE 41 53220 - 54239 637 339 aa, chain - ## HITS:1 COG:FN1694 KEGG:ns NR:ns ## COG: FN1694 COG0451 # Protein_GI_number: 19705015 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Nucleoside-diphosphate-sugar epimerases # Organism: Fusobacterium nucleatum # 1 275 4 250 290 106 30.0 5e-23 MNILISGIHGFVGSNFIRALKDKHTLYGLDIVSPAKEGVVTTFSWQDFEPTSFPFQTLPQ FDAIIHLAGKAHDTKNQSAAQSYFDINTGLTQKIFDFFLESSAKKFIFFSSVKAAADSVV GDVLTEDVIPTPVGPYGESKIKAEEYIKNHFMFPTSSISEDRSLRLEKEKGRIPKNKQVY ILRPCMIHGPGNKGNLNLLYNVVKKGIPWPLGDFDNRRSFTSIDNLCYVIEGLLNQDVPT GIYHMGDDEALSTNELIGIMCEAMGKKPHIWKMNKRVMEGCAGLGTLMHLPLNTERLRKL TENYVVSNAKIKAALGIDKLPVTAKEGLMKTIRSFEETK >gi|226332048|gb|ACIB01000008.1| GENE 42 54236 - 55000 259 254 aa, chain - ## HITS:1 COG:jhp0094 KEGG:ns NR:ns ## COG: jhp0094 COG0463 # Protein_GI_number: 15611164 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Helicobacter pylori J99 # 1 249 2 247 260 232 47.0 5e-61 MKISLVTVTFNSAKTLCDTIHSVLSQSYTNIEYIIVDGLSNDETVTLIKEYEPLFQGRLK WISEKDKGLYDAMNKGIRMSTGDIVGIINSDDFYHRGDVLEKVAESFEAGETEAIYGDVR FVNPDNLDRTVRYYSSKRFVPSLFRFGFMPAHPTFFTYRKYFDQFGYYKTNYKIAADYEL LVRFLYVHRLKSKYLPLDFMKMRTGGASTASIKSNILLNKEIVRACKENGIWTCYPLLLL KYLVKVFELIFIKK >gi|226332048|gb|ACIB01000008.1| GENE 43 55017 - 56240 333 407 aa, chain - ## HITS:1 COG:RP336 KEGG:ns NR:ns ## COG: RP336 COG0438 # Protein_GI_number: 15604204 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Rickettsia prowazekii # 83 405 78 407 407 113 27.0 6e-25 MKTLFQINVVANSGSTGHIAEELGRLVIASGWKSYIAYGRWACPSQSMLIRVGTRFDLFV HGLKSMFFDRHGFGSRRATLHLISKIEKIKPDIIHLHNLHGYYLNCEVLFDYLSTAKIPV VWTLHDCWSFTGHCVHFQNIGCEKWKTGCFACPNIRDYPKALGCDNSRMNFIEKKRLFTS VERMMIVPVCNWLSDMLSKSYLSQIKRQTIVNGIDLEMFTPKGNRDVIKSNLGVGTRYMI LAVATVWGITKGFDDLIYLNSLLPDKSVIVVVGVTTKQIKKLPGNMIGIERTENTSQLVE IYSAADIFINPTYQDTLPTVNIEALACGTPVITYDTGGSADIVDSDTGMVLKRGDIHTLL EKILEIRERGKESYIIKCRQRALQFFDKSKQLSYYLSLYDQLLNKER >gi|226332048|gb|ACIB01000008.1| GENE 44 56295 - 57074 333 259 aa, chain - ## HITS:1 COG:BMEI1603 KEGG:ns NR:ns ## COG: BMEI1603 COG0726 # Protein_GI_number: 17987886 # Func_class: G Carbohydrate transport and metabolism # Function: Predicted xylanase/chitin deacetylase # Organism: Brucella melitensis # 80 199 67 181 237 68 33.0 1e-11 MNDFRSLIRNSVLDILGIFSRPQNGIHILNGHMICRGVANDQAKYYFSYQLKELSRHVRF IRVEEATSLILNNESVDEPLVAFTFDDGFMECHSMIAPVLEQFGVNAAFFINPNFANGDD VYIQNFTNNIVLTPGKTPMRWKEIRDLHERGHIIGAHTMDHYMINDSNWVELDKQIGCCK SVIEQELSTSCEYFAFPYGRLEHANQSSIDIACKYYKYVFSQSDYKHYFSFGGRVINRRH FEPFWPVKHVSYFLSCHKK >gi|226332048|gb|ACIB01000008.1| GENE 45 57067 - 58200 551 377 aa, chain - ## HITS:1 COG:no KEGG:BF1021 NR:ns ## KEGG: BF1021 # Name: wcfG # Def: putative glycosyltransferase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 377 1 377 377 744 99.0 0 MADYNSSICCIFNIGTHYRNPIYSKMSSELPCDFYFGDRLLTPIKKMDYIQLNHFRSELH NKYLFSQFYWQSKSVRLVFKPYTYYVLDGEPYCLSSWVILFWAKLLNKRTVAWTHGWYGR ESIVKKVIKKLFYSLFSELMVYGEYAISLMSKEGFDKSKMVCIANSLDYDNQLKIRLNLS PSSIYSTHFSNSYPVLFYIGRVQKSKKLEYIIQAMDILKQKGFPVNLVVVGKDVDGVHLD CEIAKYNLGSHVWLYGPCYDEMRIGEMFYNADVCVSPGNVGLTAIHSLTYGCPVITHNNF PFQGPEFESIIQGKTGDFFQENDVNSLADTIQKWLSQNLHSREAIRQFAYQTIDTKWNLY YQMNILKQVFLKQAKDE >gi|226332048|gb|ACIB01000008.1| GENE 46 58187 - 59464 729 425 aa, chain - ## HITS:1 COG:PM1003 KEGG:ns NR:ns ## COG: PM1003 COG0677 # Protein_GI_number: 15602868 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetyl-D-mannosaminuronate dehydrogenase # Organism: Pasteurella multocida # 1 425 3 424 424 476 55.0 1e-134 MEELKIAVIGLGYVGLPLARLFSTKYECIGFDLNQSRVDALNSGHDMTLEVSDDLLRKAL ENGFKCTSNIDEISNCNFYVVAVPTPVDLNNRPDLTPLIGASTTVGKVISKGDIVVYEST VYPGVTEEECLPVVEEVSGLTYNVDFYAGYSPERINPGDKEHTVEKIKKVTSGSTPEIAD IVDSVYNSVLVNGTHKAPSIKVAEASKIIENSQRDVNIAFMNELSKIFNAMGIDTNDVIE AASSKWNFIKLKPGLVGGHCISVDPYYLIQKAQVYGVLPRIMSAARRLNDGMGDYVANQV IRLMNKKGILVKDSKILLLGFTFKEDCPDIRNTKVIDIYSTLHEYTSDITVYDAWANSAK VAHEYGISILTAGLDNLVGQFDAVVLCVGHKEFRHMNIRGFLRSDMGVVYDVKAVLSKDI IDGRL >gi|226332048|gb|ACIB01000008.1| GENE 47 59473 - 60279 291 268 aa, chain - ## HITS:1 COG:alr4493 KEGG:ns NR:ns ## COG: alr4493 COG1216 # Protein_GI_number: 17231985 # Func_class: R General function prediction only # Function: Predicted glycosyltransferases # Organism: Nostoc sp. PCC 7120 # 2 197 24 227 295 63 26.0 4e-10 MDCLDSIFKYNDISDDLEVVLVDNCSKNYLSMFGSIEEKYGNKVVLINNKVNGGYGQGNN LGVEVAKAPIILIMNPDVRLVKPIFKRILSLFERRNIAIAGMQQYESLLKRSQSFLMLRE DLCSLFLYAVYTKINKFNEKYFCISGACFAVRKSVFTKVGMFDEQMFLYGEERMLHYKIL RLGNYHIVYDSTIGYLHPKENREFSSKNFLLGYHSFIYTCDKLGLDLHRSRNKMLNMYRF LEFYSWARGDKLKVKYYKEIIKLIKNNI >gi|226332048|gb|ACIB01000008.1| GENE 48 60345 - 60581 137 78 aa, chain - ## HITS:1 COG:no KEGG:BF1018 NR:ns ## KEGG: BF1018 # Name: wzy2 # Def: putative polysaccharide polymerase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 78 291 368 368 100 100.0 1e-20 MPLYYYKYLNVYIVGLFLGALFSAIEAFSRIMDFFVIYSFILISMRLSTINSFKIRFSIM FSIVLIQLLFVYRIIIGL >gi|226332048|gb|ACIB01000008.1| GENE 49 60727 - 60918 89 63 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MVSDIFISNYILFFPYYCPFRNYCSYYFLYIKENDNEIVFVTFLVDLYNFIWKGKLLDCR YVR >gi|226332048|gb|ACIB01000008.1| GENE 50 61459 - 61689 117 76 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MVRIILAGHPTAIELAGIFFVTTACAATIELSPIVTFGKIEAPSPIHTLLPITTGPLEYS SLFLTGICKSLYVVLP >gi|226332048|gb|ACIB01000008.1| GENE 51 62126 - 63286 634 386 aa, chain - ## HITS:1 COG:CAC2313 KEGG:ns NR:ns ## COG: CAC2313 COG0438 # Protein_GI_number: 15895580 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Clostridium acetobutylicum # 27 358 17 336 377 125 31.0 1e-28 MEKKEEYVKKKMTLIVNHPWNDVFEGKDVFLVPCYIGKIYHYDVKIVFPSNYLYEAKTIR DVELVPVSYIKKLRPFSFVFGFKLMRYLFCNARKIDLFMRFHVTITTAIMIIIYKMLNRN GIAYIKLDSNGVMNFEYKNNLKSLFRKLLYRRMFELVDFVSYETKMGLENITQQTIGIDI SSKLFYMPNGFDEDMIKHLDINVKEYSDKENVMITVGRLGTNQKNTELFLKAVENIDLKD WVIYLIGDIDPQNITFLEFVSNFFSNHPEKKKSVIFTGAISDKRHLWEYYNKSKVFVLTS RCESYGLVLNEAKRFRNFIVSTNVGAFEDLVESGKYGCEIPQDNTDYLACILEKIILGQL DIDVYNDFSPESLSYYYQVKRMKLNN >gi|226332048|gb|ACIB01000008.1| GENE 52 63262 - 63966 371 234 aa, chain - ## HITS:1 COG:no KEGG:BF1015 NR:ns ## KEGG: BF1015 # Name: wcfB # Def: putative fucosyl transferase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 234 54 287 287 482 100.0 1e-135 MKGVPDGIPYYKEPFHEFSRIPYEEGKDLIIDGYFQSEKYFKRSVVLDLYRITDELRKKI WNICGNILEKGETVSIHVRRGDYLKLPHALPFCGKSYYKNAIQYIGEDKIFIICSDDIDW CKKNFIGKRYYFIENTTPLLDLYIQSLCTHNIISNSSFSWWGAWLNENSNKIVIAPQMWF GISVKLGVSDLLPVSWVRLPNNYTLGRYCFALYKVVEDYLLNILRLIWKRKKNM >gi|226332048|gb|ACIB01000008.1| GENE 53 64129 - 65664 434 511 aa, chain - ## HITS:1 COG:no KEGG:BF1014 NR:ns ## KEGG: BF1014 # Name: wzx2 # Def: putative O-antigen flippase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 511 1 511 511 818 100.0 0 MNNSRTAKGLKNAQVALLFYFINLILSFLSRKAFIEHLGAEVLGLNTTLTNILVYLNLAE LGIGYAISYALYKPLYEGDKQAVNEIISIQGWLYKRIAIIVIIMSGVILLFFPLFFAKIE IPIWYVYATFLVLLSSSIFSYFFNYKQFVLTADQKEYKLITNVQGFKVVKTILQIVVIVY FCNGYVWWLLLELLMGVITVFVLNSIVRKEYPWLQTSPKIGKDVKDKYPYIIKKTKQLFF HKIANVVLNQTSPIIIYSYTNLTMVAVYGNYMLIISGISLLINSVFSSIGAGIGNLVAEG NQAKIIQVFNELLSSRIWIVSILCFGVYQMSRPFIVLWVGDRFVLDDFYLLLMLLIAFIS LTRLVDLFIAAYGLYQDVWAAILEAFLNLGLSILFGYYWGLSGILGGVIVSLVIIALLWK PFFLFKYGFNKNCLNFYFVYMKCVAFALITFYFSIRVIDYIGIGMCTDYSSWILISMLNI FVYTIISFPIFFFFSDGTNRFIKRVINIVFN >gi|226332048|gb|ACIB01000008.1| GENE 54 65657 - 66178 78 173 aa, chain - ## HITS:1 COG:PM1056 KEGG:ns NR:ns ## COG: PM1056 COG0110 # Protein_GI_number: 15602921 # Func_class: R General function prediction only # Function: Acetyltransferase (isoleucine patch superfamily) # Organism: Pasteurella multocida # 47 167 59 185 203 96 46.0 2e-20 MKKLVYDVRKDIIPWNSRITRILFFLNKIPYLRLFSKPLLRKQFNLSNTVQFNSGFFCHA PKLKCGNYVGLSDTFILAYADVVIGNNVSFSFRNMLITSTHDVNDFNKIIASSIVIGNNV WITSNVIILAGVKIGDNTIIGAGSVVTHDIPANVFAAGNPCRVIKSIRFNKNE >gi|226332048|gb|ACIB01000008.1| GENE 55 66197 - 66745 384 182 aa, chain - ## HITS:1 COG:MA3780 KEGG:ns NR:ns ## COG: MA3780 COG1898 # Protein_GI_number: 20092576 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes # Organism: Methanosarcina acetivorans str.C2A # 1 173 1 174 183 219 61.0 3e-57 MNIIKTAIEGLVILEPQLFKDDRGYFFESFNQREFEEKVCKTTFVQDNESKSSYGVIRGL HFQKPPFAQSKLVRVIKGAVLDVAVDIRKGSPTFGKHVSVELTEDNHRQFFIPRGFAHGF SVLSEEVIFQYKCDNFYHPEAEGAIAWNDPDLGIEWRVPCKSIILSKKDRVHPLLKYIIL FK >gi|226332048|gb|ACIB01000008.1| GENE 56 66761 - 67648 534 295 aa, chain - ## HITS:1 COG:NMB0062 KEGG:ns NR:ns ## COG: NMB0062 COG1209 # Protein_GI_number: 15675999 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: dTDP-glucose pyrophosphorylase # Organism: Neisseria meningitidis MC58 # 1 291 1 288 288 417 66.0 1e-116 MKGIVLAGGSGTRLYPITKGVSKQLLPIFDKPMIYYPISVLMLAGIREILIISTPDDLPG FQRLLGDGSDFGVRFEYAEQPSPDGLAQAFIIGEKFIGDDSVCLVLGDNIFYGQGFTRML NEAVRIAESESKATVFGYWVSDPERYGVAEFDENGNVFSIEEKPQKPKSNYAVVGLYFYP NKVVEVAKSIRPSSRGELEITTVNQNFLSDKELRVQLLGRGFAWLDTGTHDSLSEASTFI EVIEKRQGLKVACLEGIALRKGWISPEKMKALAQPMLKNQYGQYLLKVIDELYVK >gi|226332048|gb|ACIB01000008.1| GENE 57 67651 - 68043 300 130 aa, chain - ## HITS:1 COG:no KEGG:BF1093 NR:ns ## KEGG: BF1093 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 130 1 130 130 216 100.0 2e-55 MKNSLCNSVSSVVKKLLEYGEDGTPVYVNELTALNQELRNLCADLLLQKGESPEEEAEIL VTLFKGYDTMLFNFSSENEQVIQELLDRSMTVLEKLPASVLKCQLLLECFEQTGDEELIR EAKKNIERVI >gi|226332048|gb|ACIB01000008.1| GENE 58 68226 - 68744 332 172 aa, chain - ## HITS:1 COG:no KEGG:BF1009 NR:ns ## KEGG: BF1009 # Name: upcY # Def: putative transcriptional regulator # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 172 1 172 172 329 100.0 3e-89 METTDSDKHWYVVLTRTNSERKVRDYFQLQEVDTFLPVQNRVIEREGKRIERERLLLPRM VFVHISRQEMAAVRSTLNVYDFLRDRSTGAPTCIPDAQMADFRYMLDYSQDQVILTGESI PKGTRVVVAKGDLQGLRGELVRYNNKYHILVRIDMFGSAMVTIPASYVRKEK >gi|226332048|gb|ACIB01000008.1| GENE 59 69074 - 69310 103 78 aa, chain - ## HITS:1 COG:no KEGG:BF1091 NR:ns ## KEGG: BF1091 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 61 1 61 61 77 98.0 2e-13 MLTSYSSLIKILLFDNQLLIFWMHSVVKFFSYVQDVRSCLFIKLPLRLSIACLSTYFHIL LSATLVLCILLLSLFAKR >gi|226332048|gb|ACIB01000008.1| GENE 60 69458 - 70093 433 211 aa, chain - ## HITS:1 COG:CAC0567 KEGG:ns NR:ns ## COG: CAC0567 COG0500 # Protein_GI_number: 15893857 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism; R General function prediction only # Function: SAM-dependent methyltransferases # Organism: Clostridium acetobutylicum # 7 206 4 209 209 130 34.0 2e-30 MNKRGFVSRILQNFRKPEGFFGRMILWGMNTGHASLAQWGMSCLQWQPEWSVLDIGCGGG ANLLQILQRCPQGKAYGIDISSESVTFARKKNKKYLGTRCFIEQGGVHRLPYPDYAFDAV TAFETVYFWGNLQHAFTEVARVLKPGGSFLICCEISDPANKAWTGLVEGMEIHSCDELKA ILSKSGFTDTAIFRTKKEELCLVSHRQTVRL >gi|226332048|gb|ACIB01000008.1| GENE 61 70399 - 71739 1026 446 aa, chain + ## HITS:1 COG:no KEGG:BF1089 NR:ns ## KEGG: BF1089 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 446 1 446 446 806 99.0 0 MNNVQQVKTYSQRKISDFLFILWAGGAALLSYSLVYALRKPYTAAGFDGLEAFGMDYKVV VTIAQILGYVLSKFIGIKLISELKRENRMKFILISIILAEASLILFGLLPAPYNIVAMFL NGLSLGCMWGIIFSFIEGRRMTDILASLLGVSMVISSGTAKSAGLYVMDTLNISEFWMPA LIGGVALPLLALLGYALNRLPQPTAEDIAMKSKRETLNGKQRWELFKNFMPFLTLLFIAN VVLTILRDIKEDFLVKIIDVSQYSSWMFAQVDSVVTLIILIIFGLMVFVRSNLKALSILL GLIIASMVVMAVVSFGYEQLQLNAIAWLFIQSLCLYLAFLTFQTIFFDRFIACFKIRGNV GFFIAMNDFLGYTGTVIVLAVKEFFSPDINWTAFYNLMAGYVGIICFVAFVCSFIYLHQR YRRENYGKTGVFRKKEEEKEVPDFVY >gi|226332048|gb|ACIB01000008.1| GENE 62 71783 - 73294 488 503 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|153836659|ref|ZP_01989326.1| ribosomal protein S15 [Vibrio parahaemolyticus AQ3810] # 1 497 7 505 513 192 30 1e-47 MKIFPSSSIKKLDAYTIEHEPIASIDLMERAAQALTKAITERWDITTPVTVFAGPGNNGG DALAVVRMLAEKEYKVEAYLFNPKGELSADCQTNKELVEMMDNVKFSEVSTQFVPPALTM DHLVVDGLFGSGLNKPLSGGFAAVVKYINASPATVVAIDIPSGLMGEENTFNVKANIIRA QLTLSLQLPKLAFLFAENSEFVGEWKLLDINLSREAIEETESNYALLEAEEIHALIKPRN TFSHKGNFGHALLIAGSYGMAGASILAARACMRSGVGLLTVHAPIRNNDILQISVPEAII ESDASDTYFACPTDTDDYQAVGIGPGIGRSEETEAALLEQLSGCQTPLVLDADALNILAN HRHALTTLPKGSILTPHPKELERMVGKCQNSYERLMKACELARTAKVHIILKGAYSAIIT PSGKCYFNSTGNPGMATAGSGDVLTGVVLALLAQGYPAEEAAKIGTYVHGLAGDFARKKQ GVISMTAGDIISNLPLAWRLVSE >gi|226332048|gb|ACIB01000008.1| GENE 63 73367 - 74422 1144 351 aa, chain + ## HITS:1 COG:no KEGG:BF1004 NR:ns ## KEGG: BF1004 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 351 1 351 351 647 100.0 0 MKKLIILTGLLLSTSAYAQTEVTAGVTRGKDYGVTYALPKTAINIEVKVNKVTYTPGEFS KYADRYLRLTDVSGEPQEYWELVSVKAKSVGIPDSEHTYFVKLKDKTVAPLIELTEDGIV KSINVPLSPKKSAPMQPATTQKKKINPRDFLTEEILMAGSTAKMAELVAKEIYNIRESKN ALVRGQADNMPKDGEQLKIMLANLEEQEAAMTEMFSGTLNKDEKIFNIRLTPDKEMDNEV AFRFSKKLGIVANNDLAGEPVYITLKNLKTVNVPEDDGKKKVDGIAYNVPGKAQVTLTEG KKQWFNGELPVTQFGTIEYLAPALFNKKSTVQVTFNPDTGGLIKVDREEGE >gi|226332048|gb|ACIB01000008.1| GENE 64 74556 - 74909 496 117 aa, chain - ## HITS:1 COG:no KEGG:BF1086 NR:ns ## KEGG: BF1086 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 117 1 117 117 227 99.0 1e-58 MADVKEKINLLDVIPFRSENITAEKGSDGTVTIAFPRFKYEWMRRFLLPKGMSTDIHVRL EDHGTAVWELIDGKRTVRRIIEELAEHFNYEENYESRITAYITQLQKDGFVKLVIEN >gi|226332048|gb|ACIB01000008.1| GENE 65 74916 - 76304 1533 462 aa, chain - ## HITS:1 COG:no KEGG:BF1085 NR:ns ## KEGG: BF1085 # Name: not_defined # Def: putative oxalate:formate antiporter # Organism: B.fragilis # Pathway: not_defined # 1 462 1 462 462 853 100.0 0 MTEQLKNKLSDSKTLRWSVLALVAFTMLCGYFLTDVMSPLKPMLEKELLWDSLDYGFFTS AYGWFNVFLLMLIFGGIILDKMGVRFTGMGACILMVLGCGLKYYAISTTFPEGALIMGFK TQVFLAALGYAIFGVGVEIAGITVSKIIVKWFKGKEMALAMGLEMATARIGTTLAMVLTV PIADYFGYTDESGSFHTNIPMPILLCLIMLCIGTIAFFIYTFYDKKLDASLDAQGEEPEE PFRMKDVMLIVTNKGFWLIALLCVLFYSAVFPFIKYATDLMVQKYNVDPKLAGNIPGLLP IGTIFLTPLFGTLYDRIGKGATLMIIGAVMLIGVHTLFALPILNVWWFATVIMIVLGIAF SLVPSAMWPSVPKIIPEKQLGTAYALIFWVQNWGLMGVPLLIGWVLNTYCKGPVVDGAQT YDYTLPMAIFACFGVLALIVALMLKAEDKKKGYGLQEANIKK >gi|226332048|gb|ACIB01000008.1| GENE 66 76475 - 76855 469 126 aa, chain - ## HITS:1 COG:no KEGG:BF1084 NR:ns ## KEGG: BF1084 # Name: secG # Def: preprotein translocase subunit SecG # Organism: B.fragilis # Pathway: Protein export [PATH:bfr03060]; Bacterial secretion system [PATH:bfr03070] # 1 126 1 126 126 208 100.0 4e-53 MYLLLVILMVIAAILMCFIVLIQNSKGGGLASGFSSSNQIMGVRKTTDFLEKATWGLAAF MVVMSIATAYVVPTSSSKTQDVIMEQAQQEEQTNPYNLPVGTTAPKTDAAAPVEAPATET PATPAN >gi|226332048|gb|ACIB01000008.1| GENE 67 76860 - 77615 602 251 aa, chain - ## HITS:1 COG:no KEGG:BF1000 NR:ns ## KEGG: BF1000 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 251 1 251 251 391 99.0 1e-107 MISANLQQWIQHPETLNKDTLYELRTLVTRYPYFQSLRLLYLKNLYLLHDISFGAELRKA ILHVADRRKLFYLIEGERYILKPRKKNALPETEVLEEEPSLDRTLSLIDAFLATVPEEVS AQTSLDYATDYTTYLLQEDDTPELEETPKLRGHELIDGFIERSEEETSIRLQPADENKAI SEEEQSETHHEEDEDDSCFTETLAKIYVKQHRYSKALEIIKKLSLKYPKKNAYFADQIRF LEKLIINAKSK >gi|226332048|gb|ACIB01000008.1| GENE 68 77621 - 78142 625 173 aa, chain - ## HITS:1 COG:no KEGG:BF1082 NR:ns ## KEGG: BF1082 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 173 1 173 173 321 100.0 6e-87 MDWNKKIMRISLLVFTLVVGISCTVSYKFNGGNINYDKVKTISIADFPIKSDYVYAPLGT KFNEDLKDIFLRQTRLKLVNNNADLEIDGEITGYNQYNQAVSADGYSSETKLTITVNVRF VNNTNHEQDFEQQFSAFRVYDSRELLTAVQDGLIAEMTKEITDQIFNATVANW >gi|226332048|gb|ACIB01000008.1| GENE 69 78129 - 79355 941 408 aa, chain - ## HITS:1 COG:BMEI0866 KEGG:ns NR:ns ## COG: BMEI0866 COG2204 # Protein_GI_number: 17987149 # Func_class: T Signal transduction mechanisms # Function: Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains # Organism: Brucella melitensis # 15 263 181 428 528 240 48.0 3e-63 MTKAEIQQVKLRFGIIGNTEALTRAIDVAIQVAPTDLSVLITGESGVGKESFPQIIHQYS RRKHGQYIAVNCGAIPEGTIDSELFGHEKGAFTGAIGERKGYFGEADGGTIFLDEVGELP LPTQARLLRVLESGEFIKVGSSKVQKTDVRIVAATNVNLTQAIAEGRFREDLYYRLNTVP IQIPPLRERGEDVLLLFRKFASDFAEKYRMPAIQLTEDAKRVLLSYSWPGNVRQLKNITE QISIIETNREINAPILQSYLPAQSTQRLPALFGVKTGKSFESEREILYQVLFDMRQDVTE LKKLVHEIMSERGAVTSNVGTFYTPAPVVTPTPSVPAIIHPVKPNCPDDDDIQDTEEYVE ESLSLDEVEKEMIRKALEKHHGKRKSAAKDLNISERTLYRKIKEYGLE >gi|226332048|gb|ACIB01000008.1| GENE 70 79380 - 80477 876 365 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163786851|ref|ZP_02181299.1| 50S ribosomal protein L32 [Flavobacteriales bacterium ALC-1] # 2 344 3 343 346 342 48 1e-92 MEDNKIKIGITQGDINGVGYEVILKTFADPVMLELCTPVIYGSPKVAAYHRKSLDLPTNF SIVNTAAEAAHNRLSVVNCTDDEVKVEFSKPDPEAGKAALGALEKAIEEFREGLIDVIVT APINKHTIQSEGFAFPGHTEYIEQRLGNGSKSLMILMKEDFRVALVTGHIPVREIASSIT KELIQEKLAIFNRSLKQDFGIGAPRIAVLALNPHAGDDGLLGTEEQEIISPAIQEMAAKG ILCYGPYPADGFMGSGNFTHFDGVLAMYHDQGLAPFKALAMDEGVNYTAGLPVIRTSPAH GTAYDIAGKGVACEDSFRQAIYVAIDVFRNRQREKEAHANPLRKQYYEKRDDSDKLKLDT VDDDI >gi|226332048|gb|ACIB01000008.1| GENE 71 80482 - 81534 852 350 aa, chain - ## HITS:1 COG:no KEGG:BF1079 NR:ns ## KEGG: BF1079 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 3 350 1 348 348 692 100.0 0 MHMKKYLFYLSMALVAVVLFSCKSGKKSVFTPTSSGRAYEVLVVVEKPVWERPAGRALYN VLDTDVPGLPQSERSFRIMSTSPKDFDAILKLVRNIIIVDIQDIYTQPKFKYAKDVYASP QMILTIQAPDEASFEKFVEENKQPIIDFFTRAEMNRQISFLEKNHNDYISTKVGSMFNSD IWLPAELSSSKTGEDFFWAGTNAATGDQNFVIYSYPYTDKDTFTKEFFIHKRDSVMKANI PGAKEGMYMATDSSTVEVRPIDIHGDYTMEARGLWRIKGDFMGGPFVSHTRLDKASHRII TTEVFIYSPDKMKRDLMRRLEASLYTLQLPTEKAQEQIPMGIEQEEKTNK >gi|226332048|gb|ACIB01000008.1| GENE 72 81603 - 82637 819 344 aa, chain - ## HITS:1 COG:NMB1308 KEGG:ns NR:ns ## COG: NMB1308 COG0820 # Protein_GI_number: 15677174 # Func_class: R General function prediction only # Function: Predicted Fe-S-cluster redox enzyme # Organism: Neisseria meningitidis MC58 # 13 343 9 351 364 247 36.0 3e-65 MPKYPLLGMTLTELQSVTKDLGMPAFAAKQIASWLYDKKVTSIDEMTNLSLKHRELLKGE YDLGISAPVDEMRSVDGTVKYLYQVSDNHFVEAVYIPDEDRATLCVSSQVGCKMNCKFCM TGKQGFTASLTANQILNQIAALPERDKLTNVVMMGMGEPLDNLDEVLKALHILTASYGYG WSPKRITLSSVGLRKGLQRFIEESECHLAISLHSPFPSQRSELMPAERAFSIKEMVDLLK NYDFSKQRRLSFEYIVFKGVNDSLIYAKELLKLLRGLDCRVNLIRFHAIPGVDLEGAGME TMTSFRDYLTSHGLFTTIRASRGEDIFAACGMLSTAKQEESNKN >gi|226332048|gb|ACIB01000008.1| GENE 73 82814 - 84952 2185 712 aa, chain - ## HITS:1 COG:no KEGG:BF1077 NR:ns ## KEGG: BF1077 # Name: not_defined # Def: peptidyl-prolyl cis-trans isomerase # Organism: B.fragilis # Pathway: not_defined # 1 712 1 712 712 1313 100.0 0 MATLQNIRSKGPLLVIVIGLALFAFIAGDAWKVLQPHQSHDVGEVNGETLSAQDYQNMVE EYTEVIKFSSGMSSLNDEQTNQVKDEVWRSYVNNKLIEKEAKKLGITVSKAEIQSIINEG VNPLLQQTPFRNPQTGAFDKDMLKKFLADYSKMDKTKMPSQYVEYYEGMHKLWSFVEKTL IQSRLAEKYQALVTKALFSNPVEAQDAFDARVNQSDVLLAAVPYSSIVDSTITVKESELK DLYNKKKEQFKQYVETRNIKYIDVQVTASAEDRAAIQQEVTDYTNQLATANGDYTTFIRS TGSEYPYVDLYYTKKAFPSDVVARMDSASIGQVYGPYYNAGDNTINSFKVLSKVAAADSV QFRQIQVYTEDAAKTKALADSIYTAIKGGADFTALAKKYGQTGESNWISSANYENAQVDG DNLKFISTINNLGVNELSNVALGQGNIILQVTDKKAVKDKYKVAVIKRAVEFSKETYNKA YNEFSQFIAANPTVDKVAANAEESGYKLLERNDLYSSEHGIGGIRGTKEALKWAFAAKPG EVSGLYECGESDRMLVVGLVSVIEEGYRPLAQVQDQLRAEIIRDKKAEKIMADMKAANAT TIAQYTSMANAVSDSVKHVTFAAPAYVAALRSSEPLVGAYASVSDINKLSAPIKGNGGVF VLQVYAKDKLNETFDAQSEEATLENMHARLASRFMNDLYLKGDVKDKRYLFF >gi|226332048|gb|ACIB01000008.1| GENE 74 85073 - 86329 998 418 aa, chain - ## HITS:1 COG:SP1963 KEGG:ns NR:ns ## COG: SP1963 COG1253 # Protein_GI_number: 15901786 # Func_class: R General function prediction only # Function: Hemolysins and related proteins containing CBS domains # Organism: Streptococcus pneumoniae TIGR4 # 5 416 14 431 443 160 29.0 6e-39 MSIIIYLLITMAFSAFFSGMEIAFVSVDKLRFEMDRKGGVSSRILSLFFRNPNDFISTML VGNNIALVIYGILMAQIIGDNLLAGWITNHFVMVLVQTVISTLIILVTGEFLPKTLFKIN PNLALNVCAVPLFICYVVLYPISKFSSGVSYLFLRLFGMKVNKEASAKAFGKVDLDYFVQ SSIDNAESEETLDTEVKIFQNALDFSAVKIRDCIVPRTEVVAVALDTSLEELKGRFVESG ISKIIVYDGNIDNVVGYIHSSEMFRSPKDWRDHVKEVPIVPETMAAHKLMKLFMQQKKTI AVVVDEFGGTSGIVSLEDLVEEIFGDIEDEHDNTSYICKQIGEHEYVLSARLEIEKVNET FNLELPESDDYLTVGGLILNQYQSFPKLHELVSVGKYQFKIIKVTATKIELVRLKVME >gi|226332048|gb|ACIB01000008.1| GENE 75 86332 - 86949 364 205 aa, chain - ## HITS:1 COG:no KEGG:BF0992 NR:ns ## KEGG: BF0992 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 2 205 1 204 204 398 99.0 1e-110 MLRQQSNSLLNKSLSITIVFGAIVMLLLFSSCGGRNKAMADAITERDSLPVMDTRGVTTL ISDSGVTRYRVNTEEWLIFDKKKPSYWAFEKGIYLEQFDSLFHIDASIKADTAYYYDRDR LWKLIGNVDIKSLKGDHVTTELLYWNEATKKVYTDKFVRMEKPDQIMTGYGFESDDQFMK PVVHNISGIVYIDEDAEKAKTDSVN >gi|226332048|gb|ACIB01000008.1| GENE 76 86954 - 88279 1498 441 aa, chain - ## HITS:1 COG:no KEGG:BF1074 NR:ns ## KEGG: BF1074 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 441 1 441 441 774 100.0 0 MKIKTLVAVLFLSAGATTVVAQDDANCNSNSSISHEAVKAGNFKDAYTPWKAVLENCPTL RFYTFTDGYKILKGLLGQIKDRNSAEYKKYFDELMNTHDLRMKYTQEFLGKGVKVSSEDE ALGIKAVDYIAFAPKVDVNQAYDWLKKSVDAAKAESAAATLFYFLQMSHDKLKEDPAHKE QFIQDYLAASEYADDAIAAADKESVKKAFGGIKDNLVALFINSGTADCESLQGIYGPKVE TNQTDLNYLKKVISIMKMMKCTDSDAYQQASFYVYKIEPSAEAATGCAYQAYKKGDIDGS VKFFDEAINLETDNAKKAEKAYAAASVLTTAKKLSQARSYAQKAISFNENYGAPYILIAN LYAMSPNWSDESALNKCTYFAVIDKLQKAKSVDPSVTEEVNKMISRYSAYTPQAKDLFML GYKAGDRITIGGWIGESTTIR >gi|226332048|gb|ACIB01000008.1| GENE 77 88325 - 89599 976 424 aa, chain - ## HITS:1 COG:no KEGG:BF1073 NR:ns ## KEGG: BF1073 # Name: not_defined # Def: putative outer membrane protein # Organism: B.fragilis # Pathway: not_defined # 1 424 1 424 424 804 100.0 0 MVGYKQTLCALLLTILLPGVAIAQNNTNSPYTRYGYGQLADQSFANSKAMGGIAYGLRDG SHINPLNPASYTAIDSLTFLFDGGFSMQNTNFSSEGTKLNAKNSSFDYIAMQFRLHQRVA MSIGLLPYSSVGYNMAKANNDVASEEARSVTSFAGDGGLHQLYVGLGVKVLKNLSVGANV SYFWGEITRQARITFPYNDNAFAFQHVDYLSVRDYKLDFGAQYTQQLGRKHAVTLGVVFS PKKDLHNEAYVQRSTLTNSNSTQAVTTNTVDTVATFGMPNSFGVGLTYEYDKRLIVGADF NLQKWGDVTYMNQPNAFCDAMKISVGAEYMPSRFSRSYLAHIKYRVGGYYSEPYYKIGGE RASREYGVTAGLGLPLPGSRSLINVSAQYIKVHGLKAGMVDENTLRLSIGITFNEGWFFK RKVK >gi|226332048|gb|ACIB01000008.1| GENE 78 89586 - 90317 381 243 aa, chain - ## HITS:1 COG:TM0883 KEGG:ns NR:ns ## COG: TM0883 COG1521 # Protein_GI_number: 15643645 # Func_class: K Transcription # Function: Putative transcriptional regulator, homolog of Bvg accessory factor # Organism: Thermotoga maritima # 1 232 1 239 246 90 31.0 3e-18 MNLIIDIGNTVAKVALFDRTSMVEVVYDSNQSLDSLEAVCNKYDVRKAIVATVIDLNECV LAQLNKLPVPVLWLDSHTPLPVINLYETPETLGYDRMAAVVAAHDQFPGKDILVIDAGTC ITYEFVDSLGQYHGGNISPGLWMRLKALHQFTGRLPLVHAEGRMPDMGKDTETAIRAGVK KGIEYEIAGYITAMKHKYPELLVFLTGGDDFSFDTKLKSVIFADRFLVLKGLNRILNYNN GRI >gi|226332048|gb|ACIB01000008.1| GENE 79 90891 - 92117 590 408 aa, chain + ## HITS:1 COG:no KEGG:BF0985 NR:ns ## KEGG: BF0985 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 408 1 408 408 803 93.0 0 MNSKLRHLLLIVFSIFPILTWGTESPSTADSIRISLLTCAPGEEIYSLFGHTAIRYEEPA RGIDRVYNYGLFSFNTPNFTLRFALGNADYRLGVEDYRHFAVEYEYFGRSVWQQTLNLTA EEQQQLITLLEENYRPENRIYRYNFFYDNCATRPRDKVEESLQKSGSQLLFSNTYTENNE TKSYRDIVHQYTKGHPWAQFGIDFCIGSQADRPISERQMMFSPFYLMDAFDKARIVNASE NRPLVTTTKKIIDCEPDVSGSAENDIWNMLTPIRLSLLVFIAIGMATAYGLRKKKSLWGL DIAVFAAAGIAGCIVAFLALFSEHPTVGSNYLLFVFHPGHLLCLPFFINDERKRRKSRYH LLNCIVLTLFIVLFPVIPQNFDLAVLPLALCLLIRSASNLILTYKKAK >gi|226332048|gb|ACIB01000008.1| GENE 80 92114 - 93688 1051 524 aa, chain + ## HITS:1 COG:CC2461 KEGG:ns NR:ns ## COG: CC2461 COG1524 # Protein_GI_number: 16126700 # Func_class: R General function prediction only # Function: Uncharacterized proteins of the AP superfamily # Organism: Caulobacter vibrioides # 1 322 13 358 577 77 25.0 6e-14 MKGLLTSILTVLTFTGLQAQPLPSTPKLVVGLTIDQLRTDYLEAFSTLYGDRGFRRLWKE GRVFRNAEYTFSGTDRASAIAAIYTGTTPSVNGIIGKRWMDVSTLRTVSCVDDPAFMGNY TNESSSPSHLLTSTIADELKIATRNEGLVYAIAPFRDAAILAAGHAGNGAFWLNNTTGKW CGTTYYSEFPWWVSQYNDRNAIDFRIADMTWTPVHPVQSYSFLPEWRDAAFKYKFDDDRV NKYKRLITSPFINDEINTLTEELLDKSIMGKDHVPDMLALTYYAGNYAHKSVQECAMEMQ DTYVRLDRSIASLLDIIDKKVGLQNVVFFITSTGYTDTESPDLGLYRVPTGEFHLNRCAA LLNMYLMATYGQGQYVEAYYDQQIYLNHKLIEEKQLNLADIQEKAAEFLIQFSGVNEVYS GKRLLLGSWTPDISMIRNSFHRKRSGDLLIDVLPGWSIVNENTSDHKVVRKAHIPSPLIF MGSGVKPAVINTPVTIDHIAPTVAHILRIRSPNACSATPITDIR >gi|226332048|gb|ACIB01000008.1| GENE 81 93796 - 97125 3588 1109 aa, chain + ## HITS:1 COG:CT701 KEGG:ns NR:ns ## COG: CT701 COG0653 # Protein_GI_number: 15605434 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Preprotein translocase subunit SecA (ATPase, RNA helicase) # Organism: Chlamydia trachomatis # 5 1007 3 917 969 616 38.0 1e-176 MGFNEFLSSIFGNKSTRDMKEIQPWVDKIKAAYPEVAKLDNDGLRAKTEELKEYIRNSAS KERAKADELRAGIENVELEDREEVFAQIDKIEKEILEIYEKALDEVLPVAFSIVKESAKR FSENEEIVVTATDFDRTLAATKDFVRIEGDKAIWQNHWNAGGNDTVWNMVHYDVQLFGGV VLHKGKIAEMATGEGKTLVATLPVFLNALTGNGVHVVTVNDYLAKRDSEWMGPLYMFHGL SVDCIDRHQPNSDARRQAYLADITFGTNNEFGFDYLRDNMAISPKDLVQRQHNYAIVDEV DSVLIDDARTPLIISGPVPKGEDQLFDQLRPLVERLVEAQKVLATKYLSEAKKLINSDDK KEVEEGFLALFRSHKALPKNKALIKFLSEQGIKAGMLKTEEVYMEQNNKRMHEATDPLYF VIDEKLNSVDLTDKGVDLITGNSEDPTLFVLPDIAAQLSELENEHGLSDEQKLEKKDALL TNYAIKSERVHTINQLLKAYTMFEKDDEYVVIDGQVKIVDEQTGRIMEGRRYSDGLHQAI EAKEGVKVEAATQTFATITLQNYFRMYHKLSGMTGTAETEAGELWDIYKLDVVVIPTNRP IARKDMNDRVYKTKREKYKAVIEEIEQLVQAGRPVLVGTTSVEISEMLSKMLTMRKIEHN VLNAKLHQKEADIVAKAGLSGTVTIATNMAGRGTDIKLSPEVKAAGGLAIIGTERHESRR VDRQLRGRAGRQGDPGSSVFFVSLEDDLMRLFSSDRIASVMDKLGFQEGEMIEHKMISNS IERAQKKVEENNFGIRKRLLEYDDVMNKQRTVVYTKRRHALMGERIGMDIVNMIWDRCAA AIENNADYEECKLDLLQTLAMEAPFTEEEFRNEKKDKLADKTFDVAMANFKRKTERLAQI ANPVIKQVYENQGHMYENILIPITDGKRMYNISCNLKAAYESESKEVVKSFEKSILLHVI DESWKENLRELDELKHSVQNASYEQKDPLLIYKLESVTLFDNMVNKINNQTVSILMRGQI PVAEPTEEQQEAARRVEVRQAAPEQRQDMSKYREQKQDLNDPNQQAAAQQDTREAVKREP IRAEKTVGRNDPCPCGSGKKYKNCHGRNS >gi|226332048|gb|ACIB01000008.1| GENE 82 97245 - 98345 949 366 aa, chain + ## HITS:1 COG:no KEGG:BF1067 NR:ns ## KEGG: BF1067 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 366 1 366 366 713 100.0 0 MTKYPYILFVLLLASFSSCQTVEQLSIDYMLPAEISFPNELKRVAVVNNVSDTPDNTLPP KDNTIKNKNELSRAVAYHEGQPALTTEALAKAIAEQNYFNEVVICDSALRARDFTPREST LSQEEVQTLAQFLDVDCIISLENLQMKSTRVLSYIPEWNTYYGTLDTKVYPTLKIYLPGR KSPMVTINTHDSIFWEEYGNTEGFVRSRLPDERQMIREASEFAGSVPVNRILPYWKTANR YYFINGSVAMRDAAVYVKENEWEKASKLWEQAFKAAKNDKKKMRAAFNLALYYEMKDSVE EAHKWAVTAQELARKIDKIDTLKRNDIDLSEIPNYYLTSLYVNELKERSNGLGKLKGQMS RFNEDF >gi|226332048|gb|ACIB01000008.1| GENE 83 98410 - 98856 520 148 aa, chain + ## HITS:1 COG:no KEGG:BF1066 NR:ns ## KEGG: BF1066 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 148 1 148 148 229 100.0 2e-59 MKLSQQSQAIIESAIQKAINKYTCGCEQTIVTDIHIQPNQNSGELFIYDDEDEELSSVTI DEWTAYEGDDFYEDAERIFRTVLCRMKENGSFDKLTILKPYSFVLVDEDKETISELLLVD DDTLLVNDELLKGLDKELDDFLKDLLEK >gi|226332048|gb|ACIB01000008.1| GENE 84 98936 - 100492 1487 518 aa, chain - ## HITS:1 COG:STM0035 KEGG:ns NR:ns ## COG: STM0035 COG3119 # Protein_GI_number: 16763425 # Func_class: P Inorganic ion transport and metabolism # Function: Arylsulfatase A and related enzymes # Organism: Salmonella typhimurium LT2 # 19 516 10 492 497 163 27.0 6e-40 MTNERMKLLNCTLGLVAGVSLPVSALAVPQPAQEQTEKQPNIILIVADDLGYGDLSCYGA HRIQTPGMDRIANEGIRFTQGFCTAATSTPSRYSVMTGKYPWSNVDAKILPGNAALIIDT QKITLPKLMKQAGYTTGSVGKWHIGLGDGHVDWNKEVHPGAAEIGYDYSFIQAATNDRVP CVFLENGRVVGLDPNDPLYVDYRKNFPGEPTGKENPELLRMHPSVGHAGSIVNGVPRIGF QKGGKAAQWKDEEMAGLFLDKARQFVDDNKDKPFFLYYGLHQPHVPRVPNERFVGKSGMG PRGDVILEADWCVDQFLKELDKLGLAENTIVILTSDNGPVLDDGYQDDAVELVGDHKIAG PLRGGKTSMFDGGTRIPFMLRWPAKVKPQVSDVFVCQMDLLASFAFLLGQTYPDKVDSEN TLDAFLGKSKKGRKKLVIEGMFNYAYRQGDWALIPPYYNPYSKEDGDFIGLGYGYKLYNL KSDIGQQKNLAEKYPKKLGELINRFEYLKAHSDKVTRF >gi|226332048|gb|ACIB01000008.1| GENE 85 100668 - 102233 1562 521 aa, chain - ## HITS:1 COG:STM0035 KEGG:ns NR:ns ## COG: STM0035 COG3119 # Protein_GI_number: 16763425 # Func_class: P Inorganic ion transport and metabolism # Function: Arylsulfatase A and related enzymes # Organism: Salmonella typhimurium LT2 # 37 414 31 382 497 153 28.0 7e-37 MRQYVLLACLSPVACLMAATGQKGGKAKQKINDRQLPNVVFIYADDLGYGDLECYGAKNV QTPNVNRLAAEGIRFNNAHATAATSTPSRYSMLTGEYAWRRPGTDIAAGNAGMIIRPERY TMADMFKNAGYATAAIGKWHLGLGDKDGEQDWNAPLPTALGDIGFDYSYIMAATADRVPC VFIENGKVANYDPSAPIEVSYRKPIEGEPLGKDHPELLFNLKSSHGHDMAIVNGIGRIGY MKGGGKALWKDENIADSITSHAIGFIREHKDEPFFMYFATNDVHVPRFPHDRFRGKNPMG LRGDAIVQFDWSVGQIMETLDKLGLSENTLIILSSDNGPVVDDGYQDRAEELLNGHSPAG PLRGNKYSAFEGGTRIPAIVRWPKGAASSQVSNALVSQIDWFASLASLVGAGLPKGAAPD SFNYLDTWLGKNQSDRSWVIEQASNHTLSVRTKDWKYIEPNDGPAMITWGPKIETGNLST PQLYHVVDDVAEQKNVASLHPDLVFELQNILRHVRMKNLKP >gi|226332048|gb|ACIB01000008.1| GENE 86 102250 - 103890 1603 546 aa, chain - ## HITS:1 COG:no KEGG:BF1063 NR:ns ## KEGG: BF1063 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 546 1 546 546 1114 100.0 0 MKSLKYIVALALAAGLFQACDLERYPLTDLSEETFWNSESNAELALTSLYRGSLTDGVEY NPSDWWSYHGMIMMEHLSDNAFDRRGENNPFFKISSGNLTADNAFIKRYWETSYKRIGYC NRFLVGIQNSSESEKKTRMIAEARFLRATQYFYLASYFKNVPLVENVLTGEEANNVTKTS QADILKWCVTEFTAAAADLPRFSAIPAGEAGRACKQAALAFLGRTCMLQKDWKSGAKAFH DIMELGDNAINANYQELFYPSTGTSNKENIFYIQYLENYLGTGLPQHALSAKDGGWSLVN PAADLYESYEFKDGTPFSYDDPRYDPSNLGKDRDPRLDYTIYYNGAIFMGTEYKMSPDYS AAKKEKLDYTSEASRTGFMMRKYFEESTPINDVQSANGLTPVIRYAEVLLGYLECLVEDN QTITQGILDETINAVRGRASVNMPPVTEVTPAKLREIVRHERRIELAMEGIRYWDIMRWG IAHEVLSQKIWGAPYPGSTQYATTTKEVDPTGNYRWYVGKRAFRNPTDYTWPIPQSEQNI NPNLRD >gi|226332048|gb|ACIB01000008.1| GENE 87 103897 - 107277 3024 1126 aa, chain - ## HITS:1 COG:no KEGG:BF1062 NR:ns ## KEGG: BF1062 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1126 1 1126 1126 2189 100.0 0 MQNYFSIQLLRVVKSSLWLTSKKIPRTMRLFILFLICSMSFVHATDSFAQKVEISIDAQN QTVEKVLKEIEKQSGFGFFFNNKHVNLKRVVSVSVDKSNIFKVLDKIFEGTDVKYSVLDK KIILSTEMTSKQQQAVKISGKVVDVNGEPVIGASIVEKGTTNGTVTNLQGDFSLSVSSDK AVIEISYIGYQPQELKVIAGKPLNVTMKEDAQALEEVVVVGYGSQKKVNVIGSIAAVDSK KLESRTAPSVSNMLTGQLSGVTITQSSGNPGQDQGTIRVRGVGSFGATPDPLVLVDGLPG SLNDLNPADIESISILKDASSAAIYGSRAANGVVLVKTKGGQKGKVTVSYNGYVGFNQAT ELPEMCDSWEYAELYNKAMGKEVYSAEEIQKYKDGSDPYNYPNEHYLDKLLGNKGLQTGH ELTVNGGNDKTQYMVSFGYVKQNGLMEHNHYDRYNGRVNLTTELAKNLTLTTRLGGVVSK RSEPSTPGGMDSAGFKAFSSNALRFPGLWATKLEDGSYGLGPKVLGTPLAWLDSGSFYDE NFDKFRSNVELAFTPVKGLTLKAIGGYNYTGQQIRHYRSAMEITGGKKLGPSSLSDTMYK TVYKTFQALADYNVKFSKNDLSVLVGYTWEDESQRTVGGSRLNFPSDEVPYLNAGGADGQ TNSGGGYDWAIMSVFGRLTYNYDQRYLFETTMRYDGSSRFPTDNKFGFFPSVAVGWRLSE EQFFKEAESLSFIDNLKLKASYGILGNNNIGNYPYQSTYALGKAMNYVFGGVYTQGAAVT TYVDPTLKWEKTRTTDVGIETAFWNNKLTFNAAYFYRKTTDILYKPSASYSSIFGLGLSQ VNTGSLENKGWEFEIGHQNKIGEFSYHVNGNFSIIKNKVISLGVGDVEQKSGMIGNGSDL FLGYPMNMFYGYKTDGVFLTDDEVKEWHDQSKIAPNSKAGDLRYVDISGDGKVDESDKTY LGSKIPQYTFGLGLGAEYKGFDFNILLQGVAKVKGQLTNYAGYAFFQEGNIQKWQAEETW TNNQSNRYPKYPRLEVMSNAGSNNTLGSDFWILDASYLKVRNIQLGYTLPKRITQKFGSS NLRFYISLDNPFSISGYRKGWDPEINTDGSYYPILSTYTFGLTLKF >gi|226332048|gb|ACIB01000008.1| GENE 88 107446 - 108453 684 335 aa, chain - ## HITS:1 COG:PA2388 KEGG:ns NR:ns ## COG: PA2388 COG3712 # Protein_GI_number: 15597584 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Pseudomonas aeruginosa # 121 326 125 323 331 66 26.0 8e-11 MNYEDIDHLLPRYCEGLATEEECRQVESWMEESEDNRKIVDQINTLYIAVDTVNVMRKVD TEKALKKVSSRMIVRKTTWWEWMQRVAAILFIPLSVAFLVQYMHNGKSAVCQMMEIKTNP GMTTSVVLPDSTVVYLNSESSLCYPSVFEGDIRNVELKGEAYFAVAKDLKKKFVVSAPHS SQIEVLGTHFNVEAYEDEPDVSTTLVEGQVCFHFSDKDYLAKKVVMKPGQRLVYSSTNGD VQLYATSCLSETAWKDGKIIFNNTPLDVALRMLEKRFNVTFKLKNARLKTNAFTGTFTEQ RLERILEYFKISSKIQWRYLESPDIRDERSIIEVY >gi|226332048|gb|ACIB01000008.1| GENE 89 108589 - 109164 424 191 aa, chain + ## HITS:1 COG:VC2467 KEGG:ns NR:ns ## COG: VC2467 COG1595 # Protein_GI_number: 15642463 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Vibrio cholerae # 4 182 2 190 190 62 26.0 4e-10 MKPNRTKEDILLLSQLQQGDKKAFNTLFRRYYPILCAYAHRFVDLEDAEEIVQDVMLWLW ENREILLIESSLSQYLLKMIYHRSLNRIAQKEVKYRADTLFYEKSQAMIYDVDFYQIEEL TKRIHTAIAELPESYREAFIMHRFRDMSYKEIAQTLNTSTKTVDYRIQQALKLLRKELKE FLSFALIFLAA >gi|226332048|gb|ACIB01000008.1| GENE 90 109366 - 110439 917 357 aa, chain - ## HITS:1 COG:AGl909_1 KEGG:ns NR:ns ## COG: AGl909_1 COG1409 # Protein_GI_number: 15890570 # Func_class: R General function prediction only # Function: Predicted phosphohydrolases # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 27 333 350 660 1299 108 27.0 2e-23 MNRKNYLLAFILCVQTLFVSAQVYPVRAKLTDEKSFSMILLPDPQSYTKFDANQPLFELQ TAWVANSIESLNIKGVLCTGDLVEQNEIRIPDGVNGNQTSEEQWRAASRAFERLDGKLPY VICTGNHDYGYQKAENRLCHFPDYFPAERNSCWRKSLVAVGNNYQGIPTLENAAYEFITD TWGKILVVSLEFAPRDEALAWAKKVVDAPRYKDHKVILLTHSYLAWTGKVIESENYKVTP ANYGKAIWDKLVYPAKNICMVICGHECEIADYKDNVSFRIDKNASGKNVPQMMFNAQTAD KQWFGNGGDGWLRIMEFMPDGKTIKIKTFSPLFALSPLTCDKSWRTDSYDQFDITIE >gi|226332048|gb|ACIB01000008.1| GENE 91 110524 - 111552 898 342 aa, chain - ## HITS:1 COG:no KEGG:BF1058 NR:ns ## KEGG: BF1058 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 342 1 342 342 714 100.0 0 MLRYSLLNITLWLFAVMAYGKGSELKVLQMNIWQEGTMVKGGFEAIADEVARLEPDIVLF SEVRNYHGKQFISRILQALEERGKKYYGENSKLDVGILSKYKIEEQAPNCPLEDDAGSVL KARIRINGRDVVVYSAHLDYTHYACYLPRGYSGVTWKKLDAPVLDAVAIEKANNESMRDE AICHVIEDARKEKGNIILLGGDFNEPSHLDWKENTKNLWDHNGTVVRWDCSVLLENAGFK DAYRTKYPNPVTHPGFTFPSDNEGVPVQKLSWAPDADERDRIDFIYFMPDRKLKLKDVSV VGPSKSIVRSERVEESGKDSFITPLGVWPTDHKAVMATFSLK >gi|226332048|gb|ACIB01000008.1| GENE 92 111631 - 113151 1487 506 aa, chain - ## HITS:1 COG:no KEGG:BF1057 NR:ns ## KEGG: BF1057 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 506 1 506 506 978 100.0 0 MKRINKYISLLLATALLASCDKFDEINTDPDATTKVTSSLLATGLLLDITSSSASKSFIY DELLAKQMAWGESMEDYQYNVFGRSGFGGYTTLINAQKMVESVSDDNVNAYDGLAHFIKA YKIFYMSMEMGDLPYEEALQGELGLVRPKYNTQKEVMNFILSDLETAYELFSTAKDFDGD PILGGSISKWKKATTAFQLKVLMHLSKKESDADLKVKERFARIVASGSLMESNEDNLQMK YADKANTVYPFHNTNTKHAGYAMLSTMLIDKFKATGDIRMFYYAKPAKAKLNEGVTADSW DAYIGTDPSLPFEQIEKAYATEQYSGFNARYTDYPSGEPVVRLGYAEQNFILAEAAVRGW ISGDASAYYKKAIRAHMEFIASNTPDEEVYHHGHPITEEAIAAFLETPAIQLSGEKEADI EKILTQRYLASFMQHPYDVYYDYRRTGYPVLPINPATNRNTMNDRLPMRWMYPKSESDYN LEHQNEALERQFGGVDDVNKLMWILQ >gi|226332048|gb|ACIB01000008.1| GENE 93 113165 - 116467 2980 1100 aa, chain - ## HITS:1 COG:no KEGG:BF0971 NR:ns ## KEGG: BF0971 # Name: not_defined # Def: putative outer membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1100 8 1107 1107 2094 99.0 0 MKLSAIMLFVFASMAFAAKGYSQSVRVTLNVNNSSLQKVLDEIEKQSEFHFFYNNKQVDI SRNVTIKSNQKEISQVLNQLFAGTNIGYKILENSIILSPKQILETTAAQQQTTKKIKGLV TDETGSPVIGANIKEKETGNGTITDINGNFSLSVGTKSTIIISYIGYITKEVPVGNNTSL TVQLAENTKQLDEVVVTALGIKREEKALGYAVQKVDGDKLAAVKTVNVATSLTGKIAGLN VKNSTEFNTSPSLSLRASAPLLVIDGVPYGNVGLNDIAADDIESVDVLKGATASALYGAR GGAGAVMITTKKGKEEGLNVTVNSSTMFAAGYLRKPEVQTSYSSGSQGTYSTGGYVWGDK LDIGRTALQYDPYTHEWVDMPLVSKGKNNLKNFQELSMVTNNNVSVSQKGKYGSVRTSLT HVYNKGQYPNQKLNKITYSVSGDMKWKKFSFDGGLTYNKRFYPNDMGAGYGGSGFLYNLL VWSGAEYDIRDYKNYWIKQDEQQNWMDTKWYDNPYFIANEIVRSSDYDLINGYLSANYDF TPWLNLSLRSGLDSYSQKKEWRNAVSAVGGWHKQGYYGLQRLGGYSLNNDLILSADHKFG DFNVDGFIGGNVYYWKSDNILGETQNGLKIPGYYSLKSSIDPVKTTSGITKKLVTSVYAK ASVSWKSTLFLDVTGRNDWSSSLPSETRSYFYPSVAGSVVLSQFIPMPEVIDFWKVRGAW TQTKSDLGVYDTNNTYSVSTDLWNGESAAYYPTSIRGVAVKPSATRSYEIGTAIHMFKNR LKLDFTYYNKLYYNLTRSAGISNSSGFTSTLINIDEEYVGRGVELTLSGDIIRTRDLKWE SSFNWSRDRWYYTKIDPVYSTQKPWVAVGKRWDWYGIYDWERDSQGNLVNYNGYPKQSDY QSVIGYEYPDWIWGWTNTVTYKNFTLSFTLDGRVGGMAHSKTNQAMWNSGAHIDSDNQWR YDEVVNKKTNFVGSGVKVVSGSVDYDSNGKIIHDNRVFAPNDVQVSYESYMKSTNPYIGT VTRQNVFDETFFKLRDLSLSYQMPKSVCDKLRMKGLTLAFVGQNLFVWTKEFRFTDPDSD SDNLSSPSTRYLGFNVKLDF >gi|226332048|gb|ACIB01000008.1| GENE 94 116725 - 117750 607 341 aa, chain - ## HITS:1 COG:AGl2289 KEGG:ns NR:ns ## COG: AGl2289 COG3712 # Protein_GI_number: 15891252 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 100 317 100 311 323 67 26.0 3e-11 MDESCYINESVLLNYFAGELPADRRKEVEEWIAASEDNEKMARDIFRLYRAADTLDYMNR VDASAALQQVKGRIHKHRHRISWMVWGQRIAACMALPLLATTLYLILKNPPQEYVEIRTN PGMVAEANLPDGTKVWLNSGSSLKHPVKFTGDTRTVELDGEAYFSVRKDRSKRFVVNTPF NIQTEVLGTEFNMEAYRTDSVVRTTLVSGSVRLSFLGKGDTKETFVMKPDEEFVYNTATH EARAEKSYVEIYTAWKNGQVVLKNTSLAETLKILSKRFNVEFIVKDSTLYANSFTGVFSR QYLPLILEHFRYASGIQYKYLDLEYDATHKAIQEKTKIELY >gi|226332048|gb|ACIB01000008.1| GENE 95 117855 - 118436 379 193 aa, chain + ## HITS:1 COG:SMc04203 KEGG:ns NR:ns ## COG: SMc04203 COG1595 # Protein_GI_number: 15965784 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Sinorhizobium meliloti # 29 180 1 155 159 58 28.0 6e-09 MYNDTSRTVSDETLFFQVQQGNEGAFEALFLRHYPALCAYARLFVEPDDGQEIVSDVMVW LWENKEMQAFESSLKSYLFKAVKNRCLTLINRNEVKQRIEKMIFDNLQSQYDDPDFYAIQ ELTEKIEEALARLPENVREAFELNRFQNLTYNEIAERLGVSPKTIDYRIQQALKQLRIDL KEYLPLLLPFLLH >gi|226332048|gb|ACIB01000008.1| GENE 96 118646 - 119074 379 142 aa, chain + ## HITS:1 COG:no KEGG:BF1052 NR:ns ## KEGG: BF1052 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 142 1 142 142 278 100.0 5e-74 MPINYVVRKKKDQSGNEVKELYYAVPSAIQNKGVSEKQLAEDLHDNSSLSAGDVLSVLEQ LPKAIARHMKEGRTVTIRGLGTFYPALSSEGCETPEECTPNKVRLTRICFRADTAFTYDV KHCEFESMQLRFTKRPKPGKEE >gi|226332048|gb|ACIB01000008.1| GENE 97 119040 - 120941 1238 633 aa, chain - ## HITS:1 COG:mlr3786_1 KEGG:ns NR:ns ## COG: mlr3786_1 COG0642 # Protein_GI_number: 13473249 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Mesorhizobium loti # 375 626 198 462 478 167 37.0 7e-41 MKQTPFRCLCLLLILLGAVHSRLSAHADKTREDVLFLNSINFNLPWAKDVFWYTHQALQK KNISVKAESLSVPALCNRKEAAAVVEQLRRKYDVPPRLIVFIGDPGWIVCRELFDDVWKD VPVIITNTRDRLPATLDILLSHEELTESNTVPAYEWRKGYNVTTLGQVYYVKETIGLMRQ LMPDMKRLAFISDDRYISEAVRGDVEQAMTGSFPELAFEQLSTRNISTEMLLDTLKSYDK TTGLIYYSWFETHNQDDNNYLFDHIQEIITRFVHSPLFLLAPEDLSNNTFAGGYYVSVES FGDSLLQLIHRVLEGEFPRDIPPALGGKPAAYLCYPALQSYDIPVSLYPKEAVYINLPVS FFEQYKKEILMTVVLLLVVVSAVGYYIHILKRAHQRMKEAQLKAEEANQLKSAFLANMSH EIRTPLNAIVGFSNLLSMVEDKEEMLEYAGIIETNTELLLQLINDILDMSKIESGMYDFH VTQVDANQLMSEVEQVARLRIRTDEVSLSFAERLPQCVFHTDKNRLIQVLTNLVVNAIKF TSQGEIQIGYRLQDAHTLYFYVSDTGCGMSVEQCEHVFERFVKYNTFIQGTGLGLSICKM IIEKLGGEIGVQSESGKGSVFWFTLPYRASASL >gi|226332048|gb|ACIB01000008.1| GENE 98 121060 - 123690 2763 876 aa, chain - ## HITS:1 COG:FN2011 KEGG:ns NR:ns ## COG: FN2011 COG0525 # Protein_GI_number: 19705307 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Valyl-tRNA synthetase # Organism: Fusobacterium nucleatum # 2 874 3 886 887 767 45.0 0 MELASKYNPADVEGKWYQYWLDHKLFSSKPDGREPYTIVIPPPNVTGVLHMGHMLNNTIQ DILVRRARMEGKNACWVPGTDHASIATEAKVVNKLAAQGIKKTDLSRDEFLKHAWAWTDE HGGIILKQLRKLGASCDWDRTAFTMDEKRSESVLKVFVDLYNKGLIYRGVRMVNWDPKAL TALSDEEVIYKEEHGKLFYLRYKIEGEDGYAVVATTRPETIMGDTAMCINPNDPKNQHLK GKKVIVPLVGRVIPVIEDDYVDIEFGTGCLKVTPAHDVNDYMLGEKYNLPSIDIFNDNGT ISKAAGMYIGMDRFDVRKQIEKDLEAAGLLEKTEAYTNKVGYSERTNVVIEPKLSMQWFL KMEHLAQIALEPVMKDDIKFYPAKYKNTYRHWMENIKDWCISRQLWWGHRIPAYFLPEGG YVVAVTDEEALKLAREKTGNPNLKMTDLRQDEDCLDTWFSSWLWPISLFDGINNPGNEEI NYYYPTSDLVTGPDIIFFWVARMIMAGYEYEGKMPFKNVYFTGIVRDKLGRKMSKSLGNS PDPLELIEKYGADGVRMGMMLSAPAGNDILFDDALCEQGRNFCNKIWNAFRLVKGWENGM GTIDIPADAHLAVQWFDQRLDAAAVEVADLFSKYRLSEALMLIYKLFWDEFSSWLLEIVK PAYGQPVNGFIYSMTLSAFERLLAMLHPFMPFITEELWQQLREREPGASLMVQPLGEPGE VNEEFLQQFETAKEIISSVRTIRLQKNIALKEPLELQVVGANPVEKMNPVIRKMCNLSAI EVVDAKADGASSFMIGTTEFAVPLGNMIDVDAEIARMEAELKHKEGFLQGVLKKLSNEKF VNNAPAAVIEMERKKQADAESIIQSLKESIASLKNV >gi|226332048|gb|ACIB01000008.1| GENE 99 123807 - 124868 869 353 aa, chain + ## HITS:1 COG:no KEGG:BF1049 NR:ns ## KEGG: BF1049 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 353 1 353 353 702 99.0 0 MKNRKRIIRYSVLGTLIALVWLTQMLPALGEGYARTAYPVISYILSGFSNIIPFALGDLF IALSIGGMIAYPFYARIRLKLSWKKILRRDVEYLLWIYVWFYLAWGLNYSQKNFYERTGI PYTAYTPEIFNEFVDNYISKLNKSYVPVNRIDKEVIRKEAVRQYNLISDTLGIHRPPHSR PRVKTMMFTPLISMVGVTGSMGPFFCEFTLNGDLLPPQYPATYTHELAHLLGITSEAEAN FYAYQVCTRSVNKEIRFSGYFSILGHVLANARQLMTEEEYKKLFGSIRPEIIELARKDQE YWMAKYNPLIGDIQDWIYDLYLKGNKIESGRKNYSEVIGLLISYNEWKKESNK >gi|226332048|gb|ACIB01000008.1| GENE 100 124874 - 125659 952 261 aa, chain + ## HITS:1 COG:BS_yabN KEGG:ns NR:ns ## COG: BS_yabN COG3956 # Protein_GI_number: 16077126 # Func_class: R General function prediction only # Function: Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain # Organism: Bacillus subtilis # 4 258 224 484 489 213 47.0 3e-55 MHTREEQMEAFGRFLDILDELRVKCPWDRKQTNESLRPNTIEETYELCDALMRNDKKDIC KELGDVLLHVAFYAKIGSETGDFDMKDVCDKLCEKLIFRHPHVFGEVKAETAGQVSENWE QLKLKEKDGNKSVLSGVPAALPSLIKAYRIQDKARNVGFDWEEREQVWDKVKEEIAEFQV EVANMDKDKAEAEFGDVMFSLINAARLYKINPDNALERTNQKFIRRFNYLEDHTIKEGKN LKDMSLDEMDAIWNEAKKKGL >gi|226332048|gb|ACIB01000008.1| GENE 101 125761 - 126234 455 157 aa, chain - ## HITS:1 COG:no KEGG:BF1047 NR:ns ## KEGG: BF1047 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 157 1 157 157 287 99.0 6e-77 MKKLIILLIIVCGFTPALRAVGSPNQHLSPKEFRAKQQAFITEKAGLTQEEAAKFFPVYF ELQDRKKQLNDEAWKLLRSGKDEKTTDTQYGEILEGVYDARIASDRLDKTYFEKFKKILS CKKIYLVQRAEMRFHRELLKGVRDNKGGNERPQGKRK >gi|226332048|gb|ACIB01000008.1| GENE 102 126263 - 126676 305 137 aa, chain - ## HITS:1 COG:no KEGG:BF0962 NR:ns ## KEGG: BF0962 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 137 1 137 137 254 100.0 7e-67 MSKKKRGEERMKEEDNILKKVGKKNSFKVPEGYFENLTSEVMGKLPEKEGPAFEEVKQPT MWIRMKPLLYMAAMFIGAALIIRVASSNHQPTTAGDHLTANEAATEVVSDEYIDVALDRS MLDDYSLYVYLSDATAE >gi|226332048|gb|ACIB01000008.1| GENE 103 126694 - 127245 399 183 aa, chain - ## HITS:1 COG:mll8140 KEGG:ns NR:ns ## COG: mll8140 COG1595 # Protein_GI_number: 13476734 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Mesorhizobium loti # 22 181 14 177 208 77 29.0 9e-15 MNPYNEREVLALLQDERTQKQGFERIVSQYSEQLYWQIRRMVLSHDDANDLLQNTFIKAW INIDYFRAEAKLSTWLYRIALNECITFLNKQRAMNTVAIDDPEADVTQKLESDPYFSGDR AELLLQKALLTLPEKQRMVFNLKYYQEMKYEEMSEIFGTSVGALKASYHHAVKKIEKFLE EAN >gi|226332048|gb|ACIB01000008.1| GENE 104 127318 - 128232 549 304 aa, chain - ## HITS:1 COG:slr0050 KEGG:ns NR:ns ## COG: slr0050 COG1234 # Protein_GI_number: 16331469 # Func_class: R General function prediction only # Function: Metal-dependent hydrolases of the beta-lactamase superfamily III # Organism: Synechocystis # 5 304 2 307 326 202 37.0 9e-52 MEKFELHILGCGSALPTTRHFATSQVVNLRDKLFMIDCGEGAQMQLRKSRLKFSRLNHIF ISHLHGDHCFGLMGLISTFGLLGRTAELHIHSPKGLEELLTPMLNFFCHTLAYKVIFHEF DTRQTSVVYEDRSMTVTTIPLQHRIPCCGFLFAEKARPNHIIRDMVDFYKVPVYELNRIK NGSDYVTPEGEVIANTRLTRPSDPPRKYAYCSDTIFRPEIVEQLSGVDLLFHEATFAESE LARAKETYHTTAAQAARIALEAGVRQLVIGHFSARYEDESILLKEASAVFPNTILAKENL CISL >gi|226332048|gb|ACIB01000008.1| GENE 105 128382 - 130175 3053 597 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|53712335|ref|YP_098327.1| 30S ribosomal protein S1 [Bacteroides fragilis YCH46] # 1 597 1 597 597 1180 100 0.0 MENLKNVAPIEDFNWDAYENGESFAGASHEELEKAYDGTLNKVNDREVVDGTVIAMNKRE VVVNIGYKSDGIIPLNEFRYNPDLKVGDTVEVYIENQEDKKGQLVLSHRKARATRSWDRV NAALENEEIIKGYIKCRTKGGMIVDVFGIEAFLPGSQIDVKPIRDYDVFVGKTMEFKVVK INQEFKNVVVSHKALIEAELEQQKKEIIGKLEKGQVLEGTVKNITSYGVFIDLGGVDGLI HITDLSWGRVSDPKEVVELDQKLNVVILDFDDEKKRIALGLKQLTPHPWDALDPNLQVGD KVKGKVVVMADYGAFIEIAPGVEGLIHVSEMSWSQHLRSAQDFMKVGDEVEAVVLTLDRE ERKMSLGIKQLKQDPWETIEEKYPVGSKHTAKVRNFTNFGVFVEIEEGVDGLIHISDLSW TKKVKHPSEFTQIGADIEVQVLEIDKENRRLSLGHKQLEENPWDVFETVFTVGSVHEGTI IEMLDKGAVVALPYGVEGFATPKHLVKEDGSQAQMDEKLEFKVIEFNKDAKRIILSHSRI FEDVAKAEERAEKKAASNAKKSSKREETPAIQNQAASTTLGDIDALAALKEQLEGKK >gi|226332048|gb|ACIB01000008.1| GENE 106 130317 - 131429 399 370 aa, chain - ## HITS:1 COG:MA2102 KEGG:ns NR:ns ## COG: MA2102 COG3344 # Protein_GI_number: 20090946 # Func_class: L Replication, recombination and repair # Function: Retron-type reverse transcriptase # Organism: Methanosarcina acetivorans str.C2A # 97 369 79 343 563 177 36.0 2e-44 MELLLGIVIVVTCWMVVRIIRSSKNQEGYKRWRAGNYASENPYAKEKASGPLSQGLFSKR VRTTGVRRFDDGAIRWCANLLATEESRLREVLDYIPRQYTCFHVRKRSGGFRYISAPAGD FRSMQQTIYHRILLLANIHPAVTGFCPGKSVSDNARVHLGRKNVLKVDLHDFFPSIRSPR VRAAFREMGYSRSIAKVLAELCCLRCFLPQGAPTSPALSNIIAYPMDKKMMALAGEYGLV YTRYADDLTFSGDYLPKDEVLVRIHRIIREEGFTMNVKKTRFLSEHKRKIITGVSVSSGK KMTLPKVKKREIRKNVHYVLTKGLVGHQEHIGSTDPVYLKRLLGSLCYWRSIEPDNRYVS DSITALKRLM >gi|226332048|gb|ACIB01000008.1| GENE 107 131636 - 133855 1228 739 aa, chain - ## HITS:1 COG:no KEGG:BF1041 NR:ns ## KEGG: BF1041 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 739 1 739 739 1494 100.0 0 MRCNILIWCFWLFSLCFVQKGVANVVLDPQTNRPISADSILNNVMTFAPFYEKLVTDYRA DLYIKGTMDIKRKNFILRYVPSMFRLQKGVRQYMVETYSDLHFTAPNIYDQKVKASMGTV HNSRGVPGMLEYFNINLYSSTLLYDRLLSPLAKNGRKYYKYLIDSIMGGTDNRQYKIRFI PRSKSDQLVGGYMIVSSDVWSVREIRFSGRSELLLFSCLIKMGRVGKDDEFLPVSYNVEG QFNFLGNRINGVYVASLNYHDIVLEENQNKWKEKVRARIKGKSKYDLSDSYNLQCETTSF HTDSAYFETLRPIPLSEAERRLYKEDALRKDTIQRNIKPKSKSQVFWGQVGDVLISDYKL NLSNLGSVKCSPLINPFLLSYSGSNGLSYRQSFKYNRLFKHDRLLRVVPKLGYNFTRKEF YWSVNTEFNYLPEKMGAVHIDFGNGNRIYSSDVLDDLKAIPDSVFDFNQIHLDYFYDLYF NFRHSIEIINGLELSVGLSTHRRKAVKSSKLVPLTKSRETLNEDIQNKIRNTYLSFAPRV RLEWTPCLYYYMNGHRKINLRSKYPTFSIDWERGIKGVFGSTGQYERLEFDLQHHIPLGL MRNIYYRFGFGMFTNQKEMYFVDFNNFTRSNLPEGWNDEIGGVFQLLDRRWYNASRKYIR GHFTYEAPFLLLKHLIKYTRYVQNERLYASILSVPHLQPYVELGYGIGTHIFDFGVFVGS ENWKYTEVGCKFTFELFNR >gi|226332048|gb|ACIB01000008.1| GENE 108 134095 - 136284 1819 729 aa, chain + ## HITS:1 COG:slr0288 KEGG:ns NR:ns ## COG: slr0288 COG3968 # Protein_GI_number: 16331104 # Func_class: R General function prediction only # Function: Uncharacterized protein related to glutamine synthetase # Organism: Synechocystis # 5 729 7 724 724 620 44.0 1e-177 MSKMRFFALQELSNRKPLEITTPSNKLSDYYASHVFDRKKMQEYLPKEAYKAVVDATEKG TPISREMADLIANGMKSWAKSLNVTHYTHWFQPLTDGTAEKHDGFIEFGEDGEVIERFSG KLLIQQEPDASSFPNGGIRNTFEARGYTAWDVSSPAFVVDTTLCIPTIFISYTGEALDYK TPLLKALAAVDKAATEVCQLFDKNITRVFTNLGWEQEYFLVDTSLYNARPDLRLTGRTLM GHSSAKDQQLEDHYFGSIPPRVTAFMKELEIECHKLGIPVKTRHNEVAPNQFELAPIFEN CNLANDHNQLVMDLMKRIARKHHFAVLFHEKPYNGVNGSGKHNNWSLCTDTGINLFAPGK NPKGNMLFLTFLVNVLMMVHKNQDLLRASIMSAGNSHRLGANEAPPAILSIFLGSQLSAT LDEIVRQVTNSKMTPEEKTTLKLGIGRIPEILLDTTDRNRTSPFAFTGNRFEFRAAGSSA NCAAAMIAINAAMANQLNEFKASVDKLMEEGIGKDEAIFRILKENIIASEPIRFEGDGYS EEWKQEAARRGLTNICHVPEALMHYMDNQSRAVLIGERIFNETELACRLEVELEKYTMKV QIESRVLGDLAINHIVPIAVSYQNRLLENLCRMKEIFSEEEYEVMSADRKELIKEISHRV SAIKVLVRDMTEARKVANHKENFKEKAFAYEETVRPYLESIRDHIDHLEMEIDDEIWPLP KYRELLFTK >gi|226332048|gb|ACIB01000008.1| GENE 109 136626 - 137327 629 233 aa, chain + ## HITS:1 COG:BMEII0986 KEGG:ns NR:ns ## COG: BMEII0986 COG0664 # Protein_GI_number: 17989331 # Func_class: T Signal transduction mechanisms # Function: cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases # Organism: Brucella melitensis # 14 231 10 228 231 84 25.0 1e-16 MVKMNLSEINVPERIAEMWAPLNAEQREFLANNFTLQNYKKNETIYCEGESPTYLMCLLS GKVKIYKDGVGGRSQIIRMIKPTEYFGYRAYFAKEDYVTAASAFEPSTICLIPMSAIMTL VSQNNDLGMFFIRQLSIDLGIADERTVNLTQKHIRGRLAESLLFLKESYGLEEDGSTLSI YLSREDLANLSNMTTSNAIRTLSNFATERLITIDGRKIKIIDEEKLKKISKIG >gi|226332048|gb|ACIB01000008.1| GENE 110 137372 - 139816 1972 814 aa, chain + ## HITS:1 COG:CC0447 KEGG:ns NR:ns ## COG: CC0447 COG3525 # Protein_GI_number: 16124702 # Func_class: G Carbohydrate transport and metabolism # Function: N-acetyl-beta-hexosaminidase # Organism: Caulobacter vibrioides # 32 642 31 609 757 459 38.0 1e-129 MKKGITLFLLSLLFITPGCKQSKETIVNEYNIVPLPNQMIPQQGRFEISKKVRVITTACT PDVQIIADSLINRLKLTSGITIKQTFENVTDEPVIRFVPQDGMPEEGYKLSVTPQNITLT ASTPKGFFYAVQTLYQLLPPVVYGNQKVKNAEWSVPAVEIEDAPRFAYRGLMLDVCRHFS PVEYIYKFIDMLAMHKMNTFHWHLTDDQGWRIEIKKYPKLTEIGSKRKETLVDYYYVNYP QVFDGKEHGGYYTQEQIRAIVDYAASKFITVIPEIEMPGHAIAAIASYPELSCTPDSTCD VTGTWGVFEQVFCPSDTTFQFLEGVMDEVMDLFPSKYIHIGGDECPKTAWINSEYCQSLI KQLGLKDDVTPNVIDGKKHTKEEKLQSYFITRMEKYLNSKGRNIIGWDEILEGGLAPNAT VMSWRGVEGGLNAAKAGHNAIMTPNPYAYLDQYQEEPEIAPVTIGGYNTLKKTYSYNPVP DDANELVKKHIIGVQGNIWTEYMPGNDNRDYQAFPRAVAIAETGWTLNANKNWNNFCQRM VEDFRRMDVKNVKACRNFFDVNINTHVDETNTLKVVLESFYPNAEIHYTTNGSVPTVESA IYNQPFALSGEMDVKAAAFKDGKMLGKVSGKKLYGNLISGKSFTVTPPIGGAKGDIFGEN DVLGTDISTFGLTNGKRGNIASMTPWSGFRMNDACNKLVFIVEFEQPTTVSKVVFGSLYN PASVILPPSVATVETSSDGRKYDKMAEASFKRNYPERGRKAFTDTLGFAPKEVKYIKITL QNGGTLRNGIDFVKDPNEKDVVQANIYLDEIEVY >gi|226332048|gb|ACIB01000008.1| GENE 111 139970 - 140917 708 315 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|148988049|ref|ZP_01819512.1| 30S ribosomal protein S9 [Streptococcus pneumoniae SP6-BS73] # 9 309 5 303 306 277 48 3e-73 METERIKCLIIGSGPAGYTAAIYAGRANLSPVLYEGIQPGGQLTTTTDVENFPGYPQGIS GPQLMEDLRTQAERFGADIRFGIATASDLGQAPYKITIDGEKVIEADSLIIATGATAKYL GLDDEKKYAGMGVSACATCDGFFYRKKVVAVVGGGDTACEEAIYLAGLASKVYLVVRKPY LRASKIMQERVRKHDKIEVLFEHNVVGLFGENGVEGMNLVKRWEEPDEERYSLPIDGFFL AIGHKPNSDIFKPYLDTDEVGYITTDGDSPRTKVPGVFAAGDVADPHYRQAITAAGSGCK AAIEAERYLSEKGLI >gi|226332048|gb|ACIB01000008.1| GENE 112 140973 - 141614 584 213 aa, chain - ## HITS:1 COG:no KEGG:BF1034 NR:ns ## KEGG: BF1034 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 213 1 213 213 418 99.0 1e-116 MKQIIISIWIALLTLPVFAQQQMQAKVVLDKTAATFEKAGGICAEFNVTVFNKSRMAGQS AGVIELKGEKFVLKTDDGITWFDGKTQWSYLRSSDEVNISNPTGTELQGLNPYALLQIYR HGFDYKIGSLKNFGGKPVYEVVLTATDKKRDLSRIVLYVSKDTYQPLFIMMEQRDKSRSE ITVTGYQTGLKYADGMFVFDKKQYPHAEVIDLR >gi|226332048|gb|ACIB01000008.1| GENE 113 141639 - 144128 2239 829 aa, chain - ## HITS:1 COG:BS_spoIIIE KEGG:ns NR:ns ## COG: BS_spoIIIE COG1674 # Protein_GI_number: 16078743 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: DNA segregation ATPase FtsK/SpoIIIE and related proteins # Organism: Bacillus subtilis # 246 823 225 785 787 407 41.0 1e-113 MAKKKSDKEAEQKPASTKKYVAFFRNETIHFVIGLVLVIFSVYLLLAFTSFFFTGAADQS IIDSGNAQDLAAVNNHVKNYAGSRGAQLASYLINDCFGISSFFILIYLAVAGLKLMRVRV VRLWKWFIGCSLLLIWFSVFLGFVFMDHYQDSFIYLGGLHGYNISNWLISQVGIPGVWLI LLATGICFLIYMSARTIIWLRKLFSLSFLKRKQKEELAEVTQAPQPHEYDNPKPQEVEFD VNRTFRQEVPVKKVETTVVPETPVESSTEMPVTPEDRDVTSDGDVTMTFEQTAPDPVPPF RAASADKEPEFEIEPAADDENYLGAETEPYNPKLDLENYHFPTIDLMKHYENSEPTINME EQNANKDRIINTLRSFGIEISTIKATVGPTVTLYEITPEQGVRISKIRGLEDDIALSLSA LGIRIIAPIPGKGTIGIEVPNSNPKIVSGQSIIGSKKFQESTYDLPIALGKTITNEVFMV DLCKMPHVLVAGATGQGKSVGLNAIITSLLYKKHPAELKFVLVDPKKVEFSIYSVIEHHF LAKLPDGEDAIITDVTKVVQTLNSVCVEMDSRYDLLKMAHVRNIKEYNEKFINRRLNPEK GHKFMPYIVVVIDEFGDLIMTAGKEIELPIARIAQLARAVGIHMIIATQRPTTNIITGTI KANFPARIAFRVSAMMDSRTILDRPGANQLIGRGDMLFLQGADPVRVQCAFIDTPEVEEI TKYISRQQGYPTAFFLPEYVSEDSGSDLGEVDMGRLDPLFEEAARLIVIHQQGSTSLIQR KFSIGYNRAGRLMDQLEKAGIVGPSQGSKARDVLCVDENDLEMRLNNIQ >gi|226332048|gb|ACIB01000008.1| GENE 114 144241 - 144876 469 211 aa, chain + ## HITS:1 COG:no KEGG:BF1032 NR:ns ## KEGG: BF1032 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 175 1 175 211 336 100.0 4e-91 MKKESQVIFDKNVIEFVTVAAEFCAFLERAESMKRSTFVDTTLKILPLLYLKASMLPKCE MIGDESPETYVTEEIYEVLRINLASILAEKDDYLEVFLPDMAYSDEPIKKNISEDLADIY QDIKDFIFVFQLGLNETMNDSLAICQENFGLLWGQKLVNTMRALHDVKYSPKARGEDEEE EEYEPENNEDCHCEDDDCHCHDHGCHCHDDE >gi|226332048|gb|ACIB01000008.1| GENE 115 144883 - 145533 431 216 aa, chain + ## HITS:1 COG:no KEGG:BF1031 NR:ns ## KEGG: BF1031 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 216 1 216 216 415 100.0 1e-115 MILKRTISKEEVKEMPKAAFPGRIHVIQTESEAQKAVAYLQSQAILGIDSETRPSFTKGH SHKVALLQISSDECCFLFRLNMTGLTQPIIELLEDPKVIKVGLSLKDDFMMLHKRAPFNQ QACIELQEYVRPFGIQDKSLQKIYGILFSEKISKSQRLSNWEADVLTDAQKQYAATDAWA CLNIYHLLEELKRTGNYELAPEEEATEKVKVGSDQQ >gi|226332048|gb|ACIB01000008.1| GENE 116 145558 - 146736 597 392 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|223476703|ref|YP_002580685.1| ribosomal protein L11 methyltransferase, putative [Thermococcus barophilus MP] # 1 392 1 393 396 234 37 2e-60 MHKVYLKPGKEDSLKRFHPWIFSGAIAHFDGEPEEGEVVEVYTSKKEFIAKGHFQIGSIA VRVLSFKQEEINHDFWKHKLEVAYDMRRSIGIATNPTNNTYRLVHGEGDNLPGLVIDVYA RTAVMQAHSAGMHVDRMAIAEALSEVMGDQIENIYYKSETTLPFKADLFPENGFLKGGSS DNIAQEYGLKFHVDWLKGQKTGFFVDQRENRALLERYAKGRSVLNMFCYTGGFSFYAMRG GAKQVHSVDSSAKAIDLTNKNVELNFPGDTRHAAFAEDAFKYLDRMGDQYDLIILDPPAF AKHKDALRNALQGYRKLNAKAFEKIKPGGILFTFSCSQVVTKDNFRTAVFTAAAMSGRSV RILHQLTQPADHPVNIYHPEGEYLKGLVLYVE >gi|226332048|gb|ACIB01000008.1| GENE 117 147119 - 148276 742 385 aa, chain - ## HITS:1 COG:no KEGG:BF0946 NR:ns ## KEGG: BF0946 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 385 1 385 385 704 99.0 0 MKQTKTILAVILLVVLAGCGENKQSNNDLIIVDVSKSYPKKELILQDFMDVEYVALETTD EFLTQGLVQDVGKEYILATNRNNDGDIFIFDRKTGKGVRKINRRGQGAEEYARINEIILD ENNGEIFVKSPGNKILVYDLYGKFKRCLSLDREVSSIFDYDKDNLICYDMSDYHSKGEDR TKSYHIILSKQDGSITRDIFIPFKTIDTPIVNDGDRFIANYSYQIRLSNGKCTLMDTSAD TLYNYASDGTLSPFVVRTPSAHTMEPEVFLYMGIHTDRYYFMEAVKNVFNFEKGNGFYAD ELVYDKEEKAVFQVTIYNDDYVDKRTVAMTAKPINREIEDVTSLNAARLVEIYKKDQLKD GKLKEIASRLNEEDNPVIMLVKQKK >gi|226332048|gb|ACIB01000008.1| GENE 118 149402 - 150634 885 410 aa, chain + ## HITS:1 COG:STM3113 KEGG:ns NR:ns ## COG: STM3113 COG0477 # Protein_GI_number: 16766414 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Salmonella typhimurium LT2 # 1 400 10 406 418 410 54.0 1e-114 MNFLQFFVWGSWLISLGGYMERGLHFEGGQIGAIFATMGIASIIMPGITGIIADKWFNAE RLYGICHLLGAGCLFYASTATDYNQMYWAMLLNLMVYMPTLSLANTVSYNALEQYKCDLV KDFPPIRVWGTIGFICAMWAVDLTGFKNSSAQLYVGASSALLLGLYSFTLPPCPPAKNQS KTLLSSFGLDALSLFKRKKMAIFFFFSMLLGAALQITNTYGDAFLGSFAKIPEFADSFGV KHSVILLSISQMSETLFILAIPFFLKHFGIKRVMLISMFAWVFRFGLFGFGDPGSGIWML ILSMIVYGMAFDFFNISGSLFVELETKPETRASAQGLFFIMTNGLGAVIGGYASGAVVDA FSVYENGMLASRNWPAIWFIFAAYALAIGILFAIVFRYKHQPGELKKVNN >gi|226332048|gb|ACIB01000008.1| GENE 119 150637 - 151206 599 189 aa, chain + ## HITS:1 COG:MT1877 KEGG:ns NR:ns ## COG: MT1877 COG1259 # Protein_GI_number: 15841299 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Mycobacterium tuberculosis CDC1551 # 6 150 3 145 164 87 38.0 1e-17 MDKKVELQVLNISNSQAQVGAYAMVLGEVDGERQLPIIIGPAEAQATAICLKGIKAPRPL THDLFYSCLNVLGATLLRVLIYKAKEGVFYSYIYFKKDEEIIRIDARTSDAVALAVRADC PIFIYESILERECIRLTDGDERPDTPEEDENSRTEPVSIISLEEALNKAIQEENYELAAR LRDEINRHK >gi|226332048|gb|ACIB01000008.1| GENE 120 151212 - 151901 617 229 aa, chain + ## HITS:1 COG:BH1350 KEGG:ns NR:ns ## COG: BH1350 COG1385 # Protein_GI_number: 15613913 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Bacillus halodurans # 1 223 1 238 250 86 30.0 4e-17 MHVFYTPDIQTSTELPEEEAQHCVRVLRLTAGDEISLTDGKGNFYRAEISVATHKRCLVN IKETIYQEPLWDGHLHIAMAPTKNMDRNEWFAEKATEIGFDELTFLNCRFSERKVIKTER IEKILVSAIKQSLKARLPRLNEMTDFCTFIEKDFKGQKFIAHCYEGEKPLLKDVLTKGED ALVLIGPEGDFSEEEVKKAIEKGFVPISLGKSRLRTETAALVACHTMNM >gi|226332048|gb|ACIB01000008.1| GENE 121 151929 - 153482 1500 517 aa, chain + ## HITS:1 COG:no KEGG:BF1024 NR:ns ## KEGG: BF1024 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 517 1 517 517 914 99.0 0 MVKRMISRLSVLAVLIVFMAACSKKAEYIHVIPADASAVASINLNSLADKAGLNDKQNEG MKQKMMEALKSGMNAAAFQQLEKIMKNPSQSGIDVKAPVFVFTSKTFISPTIVAKVSNIE DLRASLDLMAKEGICQPIAEEEGYSFTSLQKNNLLVFNENAAVLTEAYGTSQMDVAKQTI STLLKQTEENSIASNGSFRKMQDQKGDINFFASMDAVPKMYTQQISLGLSSQIDLSEVKA VGNLNFEKGKIALQIETYSDNAETDALLKKQAQAVKKLNTTFLQNFPESTLAFLNIGVNG AAFYDLLFNNEEFRRNVSLAKADEVKSLFASFDGDISIGLINVTLNSVPTFAAYADAKNG NALKALYDNKKQLKLGKNEDIIQLGENEYVYKSRATNVFFGIRNKQMYATNDELLYKSIS KPVEKSIKDAGYVSDMKGKNVFFVINMDAILDLPVVKMMAGFGGEEYQTYYKLASKISYI EAFSDSEGKTETAILLKNKDDNALKQIVDFAKQFAGM >gi|226332048|gb|ACIB01000008.1| GENE 122 153495 - 154139 181 214 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) [Campylobacter concisus 13826] # 18 199 22 198 223 74 27 4e-12 MDSIHLQQTLPQVFADRNSITSDVWHRNLVFHKGKSYLIEAASGTGKSSLCSYIYGYRND YQGIINFDETNIKAYPVKQWVEIRKHSLSMLFQDLRIFTELTAIENIRLKNNLTGYKTRK EVLSLFEALGLSDKLNVKAGKLSFGQQQRVAFIRSLCQPFDFIFLDEPISHLDDNNARIM GELVMEEASKQGAGIIVTSIGKHIELTYDRILKL >gi|226332048|gb|ACIB01000008.1| GENE 123 154136 - 155341 795 401 aa, chain + ## HITS:1 COG:no KEGG:BF1022 NR:ns ## KEGG: BF1022 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 401 1 401 401 733 100.0 0 MNNLVWKLLRQHISIGQLTGFFFANLFGMVIVLLSAQFYKDVVPIFTEGDSFMKKDYMTA TKKISTLGSFAGKSNTFSSEEIEELKKQPFTRSVGAFTPSQFKVSAGLGMQEAGIHLSTE MFFEAVPDKFVDVSLDKWHFDENTHTIPIIIPRNYLNLYNFGFAQSRSLPKLSEGLMSLI QMDILMRGNGRVEQYKGNIVGFSNRLNTILVPQSFMNWANQNFAPDSQPDPSRLIIEVDN PADASIAKYFQQKGYETEDGKLDAGKTTYFLRLIVGIVLAVGLFISILSFYILMLSIFLL LQKNTVKLESLLLIGYSPSRVALPYQILTLGLNVVVLLLSVGIVSWARTSYLTTLNLLFP QMSVGSLWPTFAIGIFLFLLVSSINVIILKKKMLSIWIHKA >gi|226332048|gb|ACIB01000008.1| GENE 124 155555 - 156481 858 308 aa, chain + ## HITS:1 COG:TP0637 KEGG:ns NR:ns ## COG: TP0637 COG0324 # Protein_GI_number: 15639624 # Func_class: J Translation, ribosomal structure and biogenesis # Function: tRNA delta(2)-isopentenylpyrophosphate transferase # Organism: Treponema pallidum # 6 288 20 306 316 224 43.0 2e-58 MTMPDYDLIAILGPTASGKTPFAAALAAELNTEIISADSRQIYRGMDLGTGKDLEDYTIN GRQIPYHLIDIADPGYKYNVFEYQRDFLTAYETIKQKGCLPVLCGGTGLYLESVLKGYRL IPVPENQELRVRLAEKSLEELTAILSSYKTLHNSTDVDTVKRAIRAIEIEEYYAKTPIEE REFPQLNSLIIGVDIDRELRREKITRRLKQRLDDGMVEEVRRLLAEGIQPDDLIYYGLEY KYLTLYAIGKMTYDEMFTGLETAIHQFAKRQMTWFRGMERRGFTIHWVDASLPMEEKINF VKQKLKEF >gi|226332048|gb|ACIB01000008.1| GENE 125 156773 - 157699 888 308 aa, chain + ## HITS:1 COG:TM0358 KEGG:ns NR:ns ## COG: TM0358 COG1597 # Protein_GI_number: 15643126 # Func_class: I Lipid transport and metabolism; R General function prediction only # Function: Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase # Organism: Thermotoga maritima # 10 284 6 281 304 108 27.0 1e-23 MSVEPGKWGVIYNPKAGTRKVQKRWKEIKEYMDSKGVSYDYVQSEGFGSVERLAGILANN GYRTIVVVGGDGALNDAINGIMSSNAEKKEEIAIGIIPNGIGNDFARYWELNLEYKQAVD WIINNRRKKIDVGYCNFYDGEKHQRRYFLNAVNIGLGARIVKITDQTKRFWGVKFLSYLA ALFLLIFERKLYRSHLKINDEHIRGRIMTVCVGSATGYGQTPSAVPYNGWLDVSVIYRPE FLQILSGLWMLIQGRILNHKVVKSYRTRKVKVLRAQNAAVDLDGRLLPRHFPIEIGIIPE ATTLIIPN >gi|226332048|gb|ACIB01000008.1| GENE 126 157715 - 158518 834 267 aa, chain + ## HITS:1 COG:FN1224 KEGG:ns NR:ns ## COG: FN1224 COG2877 # Protein_GI_number: 19704559 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: 3-deoxy-D-manno-octulosonic acid (KDO) 8-phosphate synthase # Organism: Fusobacterium nucleatum # 1 266 10 283 286 265 48.0 5e-71 MIEQLKNNKTGNFFLLAGPCVIEGEEMAMRIAEKIVNITDKLGIPYVFKGSYRKANRSRL DSFMGIGDEKALKVLEKVHNTFGVPTVTDIHAADEAAMAADYVDILQIPAFLCRQTDLLV AAAQTGKTINIKKGQFLSPLAMQFAADKVIEAGNKNVMLTERGTTFGYQDLVIDYRGIPE MQSFGYPVILDVTHSLQQPNQTNGVTGGMPQLIETVAKAGIAVGADGLFIETHENPAVAK SDGANMLKLDRLEGLLTKLVKIREAIM >gi|226332048|gb|ACIB01000008.1| GENE 127 158667 - 161486 2538 939 aa, chain + ## HITS:1 COG:ZpqqL KEGG:ns NR:ns ## COG: ZpqqL COG0612 # Protein_GI_number: 15801644 # Func_class: R General function prediction only # Function: Predicted Zn-dependent peptidases # Organism: Escherichia coli O157:H7 EDL933 # 30 869 29 861 931 287 27.0 6e-77 MKHLFRGLLLVAVILCCNFQQAFAQQMPPIPIDKNVRIGKLDNGLTYYIRKNNLPANRAD FYIAQKVGSIQEEENQRGLAHFLEHMCFNGTTHFPGDALKQYLERIGVKFGENLNAYTAI DETVYNISNVPVKTPGAVDSCLLILHDWSNDLTLDPKEIDKERGVINEEWRTRMSAMMRM QEKLLPMMYPGDKYAHSFPIGTMDVVMNFKPQTLRDYYEKWYRPDLQGIVIVGDIDVDAV EAKIKTMFADIPAQPNAAERIYYPVADNKEPIICILKDKEQPHVQVLLFNKHEAVPDNQK GNVDYLIQQYAKNLISIMLNARLNELVQTANPPYIYARADDSNFFVAKTKDAFLGIVVCK EDSIENGIAAMLRELERARQFGFTETEYNRARAEYLRQLESSYNERDKQKNEKYVNEYVR HFLDNEPIPGIENEYTIINQIAPNIPVAAINQLMKGLITDDNQALALFAPEKEDLKLPSE AAIAKLLKDAKTEKLTAYVDKVSDEPLMAEAPKGGKIVSESKDDIFGATTLTLSNGVKVI IKKTDFKADEIRMKGVSLGGSSVFPDSEIININGLDAVGVGGLGNFSAVNLEKVLAGKKA SVSYDIANKTESVSGSCSPKDFETMMQLTYLTFTAPRRDDDAFASYKNRNKAALKNQELN PNVAFSDSIQAGIYMKHPRIIRIKADMVDQMDYDKILSMYQDRFKDASDFTFIFVGNVDV EKMKPVIAEYLGALPAVNRKETFKDNKIEMRQGIYKNEFTKQQETPKASVFAFYNGDCKY DLRNNLLLSMTSQILDLVYTEKVREDEGGTYGVYVGGTLQKYPKEKAILQIIFDTAPEKK EKLMKIIFGEIDNITKTGPSEANLNKVKEYMLKKHTEDLKENSYWLGSIDEYLYTGMNRM NDYEKIVNSITVNDIRKFADDLFKQKNEVEVTMVSPEKK >gi|226332048|gb|ACIB01000008.1| GENE 128 161583 - 163064 1286 493 aa, chain + ## HITS:1 COG:no KEGG:BF1017 NR:ns ## KEGG: BF1017 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 493 1 493 493 900 100.0 0 MLFNELRLHGKLAAKRHPMYEKNKIGKYIMYASFIFWGAYFIFIGIGLAKAISTEVPNME AYHILNSGLIFALALDFVIRFPFQKTPTQEVKPYLLLPVKRSRILDFLLLRHGLSSFNLI WLFLFVPFAALTVFPFYGISGVLTYSIGIWLLMVFNGYWYLLCRTLINEHIWWVVLPIVV YSGIAIAIFIPKTGFISNFFMNLGEGYIEGNLLAYLGTLAATVLVWCINRKVMTGLIYNE INKVEDTKVKTVSEYRFLERFGEVGEYMRLELKMLLRNKRCKASLRSVAMLVIIFSIILS FSSTYDHMKSFVQVYSFLGFGMVILIQLMGFEGNYIDGLMTRKESIKSLLTAKYYIYSLA EIIPLILLIPAFVMGKASLLGAVALIFLTTGPVYCILFQLAVYNHKTVPLNESITGKQSM NTGMQMLISFGLFIVPTTLYGTLPMLLGQTWAYIVILIIGLGFTLTSPMWINNVYVRFMQ RRYENMEGFRDSK >gi|226332048|gb|ACIB01000008.1| GENE 129 163096 - 163803 219 235 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 [Vibrio campbellii AND4] # 1 212 1 218 245 89 28 2e-16 MITIHNLQKNFGTQTAVDIENYTINQGEMVGLVGNNGAGKTTLFRLMLDLLKADTGEIII NDIHVSQSEDWKSFTGAFIDDGFLISYLTPEEYFYFIGKMYGLKKEEVDERLIPFERFMN GEVLGQKKFIRNFSAGNKQKIGIVSAMLHYPKLIILDEPFNFLDPSSQSVIKHLLKKYNE EHNATVIISSHNLNHTVDVCPRIAVLEHGVIIRDLVNENNSAEKELEDYFNVEEE >gi|226332048|gb|ACIB01000008.1| GENE 130 163808 - 164323 232 171 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|167856514|ref|ZP_02479226.1| 50S ribosomal protein L1 [Haemophilus parasuis 29755] # 58 169 69 174 175 94 43 5e-18 MKKHLFNLLLFTVLVIGLSGCRTSAPKLDYKKLARASVRLGVDIGMEDNHKLYLEAAEWI GTPYRGGGETKRGTDCSGMTCQIYKKVYHTKLQRSTDGQKKESSKVARRNLREGDLVFFS SRKSRRKVAHVGIYLKDGKFVHASTSQGVIVSSLNEPYYRTHWISGGRVRK >gi|226332048|gb|ACIB01000008.1| GENE 131 164353 - 165039 451 228 aa, chain + ## HITS:1 COG:MA2967 KEGG:ns NR:ns ## COG: MA2967 COG0546 # Protein_GI_number: 20091785 # Func_class: R General function prediction only # Function: Predicted phosphatases # Organism: Methanosarcina acetivorans str.C2A # 2 216 58 272 279 202 48.0 4e-52 MPKIPQMKYTVYLFDFDYTLADSSRGIVTCFRSVLERHGYTGITDDMIKRTIGKTLEESF SILTGITDADQLESFRQEYSKEADIYMNANTILFPDTLPTLTHLKKQGIRIGIISTKYRF RILSFLRNHMPDDWFDIIIGGEDVTHHKPDPEGLLLAIDRLKACPEEVLYIGDSTVDAGT AAAAGVSFTGVTSGMTTAQEFQAYPYDRIISTLGQLISVPEDKSGCPL >gi|226332048|gb|ACIB01000008.1| GENE 132 165819 - 166136 534 105 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53712302|ref|YP_098294.1| 50S ribosomal protein L21 [Bacteroides fragilis YCH46] # 1 105 1 105 105 210 100 5e-53 MYAIVEINGQQFKAEAGQKLFVHHIQNAENGATVEFDKVLLVDKDGNVTVGAPTVDGAKV VCQIVSSLVKGDKVLVFHKKRRKGHRKLNGHRQQFTELTITEVVA >gi|226332048|gb|ACIB01000008.1| GENE 133 166159 - 166428 460 89 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53712301|ref|YP_098293.1| 50S ribosomal protein L27 [Bacteroides fragilis YCH46] # 1 89 1 89 89 181 100 2e-44 MAHKKGVGSSKNGRESQSKRLGVKIFGGEACKAGNIIVRQRGTEFHPGENIGMGKDHTLF ALVDGTVNFKVGREDRRYVSIIPAEATEA >gi|226332048|gb|ACIB01000008.1| GENE 134 166391 - 166549 72 52 aa, chain + ## HITS:1 COG:no KEGG:BF1008 NR:ns ## KEGG: BF1008 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 52 1 52 52 79 100.0 5e-14 MFLSSLLKQQKHKFPEFKGNCKKLKKEMQSVDCVSFLFYTRRTLLSYFYISL >gi|226332048|gb|ACIB01000008.1| GENE 135 166585 - 167859 1269 424 aa, chain + ## HITS:1 COG:aq_298 KEGG:ns NR:ns ## COG: aq_298 COG0172 # Protein_GI_number: 15605830 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Seryl-tRNA synthetase # Organism: Aquifex aeolicus # 1 423 1 422 425 326 43.0 7e-89 MLTIKQITENTDAVIRGLEKKHFKGAKETIAQVIEVNDKRRNTQNQLDKNLAEVNSLSKT IGQLMKEGKKEEAEVAKARVAEIKESNKTLQADMDQAANDMLNLLYTIPNIPYDSVPEGV GAEDNVVEKMGGMETQLPTDALPHWELAKKYDLIDFDLGVKITGAGFPVYKGKGAQLQRA LINFFLDEARKSGYTEIMPPTVVNAASGYGTGQLPDKEGQMYHCEVDDLYLIPTAEVPVT NIYRDVILDEKQLPIKNCAYTQCFRREAGSYGKDVRGLNRLHEFSKVELVRIDKPEHSRQ SHQEMLDHVEGLLQKLELPYRILRLCGGDMSFTAALCFDFEVYSEAQKRWLEVSSVSNFD TYQANRLKCRYRSGEKKTELCHTLNGSALALPRIVAALLENHQTPEGIRIPKALVPYCGF DMID >gi|226332048|gb|ACIB01000008.1| GENE 136 168079 - 170367 2089 762 aa, chain + ## HITS:1 COG:TM1640 KEGG:ns NR:ns ## COG: TM1640 COG0493 # Protein_GI_number: 15644388 # Func_class: E Amino acid transport and metabolism; R General function prediction only # Function: NADPH-dependent glutamate synthase beta chain and related oxidoreductases # Organism: Thermotoga maritima # 309 760 8 460 468 434 51.0 1e-121 MNKIISKEHFSEKVFKLVIEAPLIAKSRKAGHFVIVRVGEKGERMPLTIAAADPKAGTIT LVVQEVGLSSTRLCELNEGDYITDVVGPLGQATHIENFGTVVCAGGGVGVAPMLPIVQAL KAAGNRVITVLAGRSKELIILEKEMRESSDEVIIMTDDGSYGRKGLVTEGVEEVIKREKV NKCFAIGPAIMMKFVCLLTKKYEIPTEVSLNTIMVDGTGMCGACRITIGGKTKFVCVDGP EFDGHQVDFDEMLKRMGAFKTIEREELHKLDECEATKVIDENGRTAPWREALRKAIKAKD RANIERCQMNELDPEYRSHSRKEEVNQGLTKEQAVTEAQRCLDCANPGCMTGCPVGIDIP RFIKNIERGEFLEAAKTLKETSALPAVCGRVCPQEKQCESKCIHLKMGKEAVAIGHLERF AADYERESGQISVPEVGEKNGIKVAVIGSGPAGLSFAGDMAKYGYDVTVFEALHEIGGVL KYGIPEFRLPNKIVDVEIENLAKMGVTFIKDCIVGKTISVEQLEEEGFKGIFVASGAGLP NFMNIPGENSINIMSSNEYLTRVNLMDAASPDSDTPVAFGKNVAVIGGGNTAMDSVRTAK RLGAERAMIIYRRSEEEMPARLEEVKHAKEEGIEFLTLHNPIEYLADEQGRVKQVILQKM ELGEPDASGRRSPVPIPGATETVDIDLAIVSVGVSPNPIVPSSIKGLELGRKGTITVNDQ MQSSIPTIYAGGDIVRGGATVILAMGDGRRAAAAMNEQLSTK >gi|226332048|gb|ACIB01000008.1| GENE 137 170550 - 170903 479 117 aa, chain - ## HITS:1 COG:BH1689 KEGG:ns NR:ns ## COG: BH1689 COG0853 # Protein_GI_number: 15614252 # Func_class: H Coenzyme transport and metabolism # Function: Aspartate 1-decarboxylase # Organism: Bacillus halodurans # 1 105 1 105 127 125 59.0 2e-29 MMIEVLKSKIHCARVTEANLNYMGSITIDENLLDAANMIAGEKVYIADNNNGERFETYII KGERGSGKICLNGAAARKVQPDDIVIIMSYALMDFEEAKSFKPTVIFPDPATNSVVK >gi|226332048|gb|ACIB01000008.1| GENE 138 170926 - 171774 835 282 aa, chain - ## HITS:1 COG:CAC2915 KEGG:ns NR:ns ## COG: CAC2915 COG0414 # Protein_GI_number: 15896168 # Func_class: H Coenzyme transport and metabolism # Function: Panthothenate synthetase # Organism: Clostridium acetobutylicum # 1 279 1 279 281 234 43.0 1e-61 MKVIHTIKDLQAELSVLKAQGKKVGLVPTMGALHAGHASLVKRSVNENEVTVVSVFVNPT QFNDKNDLVKYPRTLDADCKLLEACGATYAFAPSVEEMYPEPDTRQFSYAPLDTVMEGAF RPGHFNGVCQIVSKLFEAVKPHRAYFGEKDFQQLAIIREMVRQMQFDLEIVGCPIVREED GLALSSRNARLSAEERENALKISQTLFKSRTFAATHTVSETLKFVEDAIAAVPGLRLEYF EIVDGNTLQKVDNWNQTSYVVGCITVFCGDVRLIDNIKYKES >gi|226332048|gb|ACIB01000008.1| GENE 139 171862 - 172845 662 327 aa, chain + ## HITS:1 COG:TM0895 KEGG:ns NR:ns ## COG: TM0895 COG0297 # Protein_GI_number: 15643657 # Func_class: G Carbohydrate transport and metabolism # Function: Glycogen synthase # Organism: Thermotoga maritima # 59 244 2 183 486 90 26.0 4e-18 MQKKTKESPKQQDDLSLKNKHLTIHKELYRQRTCFNARFFLYLCRIFTRTIVIMTKANKV LFITQEITPYVSESEMANIGRHLPQAIQEKGREIRTFMPKWGNINERRNQLHEVIRLSGM NLIIDDTDHPLIIKVASIQSARMQVYFIDNDDYFQNRLQTADENGVEYDDNDSRAIFYAR GVLETVKKLRWCPDVIHCHGWMTALAPLYIKKAYKDEPSFRDAKVVFSVYEDDFKGTFNN DFASKLMLKGINKKDVATLKDPVDYATLCKLAVDYSDGIVQNSEHVNEDVMNYARQSGKL VLDYQAPDALADACNSFYDQVWEAEQK >gi|226332048|gb|ACIB01000008.1| GENE 140 172869 - 174359 1173 496 aa, chain + ## HITS:1 COG:no KEGG:BF0922 NR:ns ## KEGG: BF0922 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 496 1 496 496 954 100.0 0 MKAKYVWVALLALTFFGCDDNTGTIGWDMLPDSDQNINGRYTTYELTTNSDLSGPVFAKT SVGYVGKFTDKEFGEYEASFLAQLNSPDGISFPSVYDPETNPKGVMAGDSIHTAELILYY KSYFGDSINPCRMTVYELDENLTQNYYTDIDPLKYYNPNNLLARKAYTAVDQSLSDSIRN SDDFYPNVRLTSEEITKLGKRIYRLNRDHPEYFKTSEAFINNVFKGIYAKNDYGNGTILY VDQINLNVVIRCHEKDSLGNNLKKKNGADSLYYTTRTFATTKEVIQANKFVNSEKLNEIA KKTDCTYLKSPAGIFTQATLPINKIYEELSHDTINAVKLTFNSYNQPDNGKFSMKAPTYV LLLREKERQSFFEENKLTDNITSYLAVHNAIISNKPTTNQYVFTNLTRLINACVNEKQEA KKKAGDSWNEAAWEAANPDWNKVVLIPVLVQYDSSSNKNMISIQHDLQPGYVKLEGGPDG TKLKLEVTYTNFNGKQ >gi|226332048|gb|ACIB01000008.1| GENE 141 174746 - 176134 1149 462 aa, chain - ## HITS:1 COG:MA4052 KEGG:ns NR:ns ## COG: MA4052 COG1449 # Protein_GI_number: 20092845 # Func_class: G Carbohydrate transport and metabolism # Function: Alpha-amylase/alpha-mannosidase # Organism: Methanosarcina acetivorans str.C2A # 1 387 1 390 396 257 36.0 3e-68 MRTICLYFEIHQIIHLKRYRFFDIGADHYYYDDYANETGINEVAERSYIPALNTLIEMVK NSGGAFKVALSISGVALEQLEIHAPAVIDLLHILNDTGCCEFLAEPYSHGLSSLANEDCF REEVMRQSEKMKQMFGKAPKVFRNSSLIYSDEIGATVASMGFKGMLTEGAKHVLGWKSPH YVYHCNQAPSLKLLLRDFKLSDDISLRFSNSDWSEYPLFADKFIGWIDALPQEEQVINIF MELKALGMAQPLSSNILEFLKALPYCAKEKGITFSTPSEIISKLKSVSQLDVPYPMSWVD EERDTSSWLGNVLQREAFSKLYSVAERVHLCDDRRIKQDWDYLQASNNFRFMTTKNTGVW LNRGIYDSPYDAFTNYMNILGDFIKRVNSLYPEDIDNEELNSLLTTIKNQGEEIAELHKE VDKLQAKAEKAAKTVKAEPKAAPKKAAAKKPAAKKATAKKED >gi|226332048|gb|ACIB01000008.1| GENE 142 176147 - 177421 1059 424 aa, chain - ## HITS:1 COG:Ta0340 KEGG:ns NR:ns ## COG: Ta0340 COG0438 # Protein_GI_number: 16081471 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Thermoplasma acidophilum # 1 417 20 388 388 211 30.0 2e-54 MKVLMFGWEFPPHILGGLGTASYGLTKGMSQQEDMEITFCIPKPWGDEDQSFLRIIGMNS TPIVWRDVDWEYVKGRVGSYMDPQLYFDLRDHIYADFNYLNANDLGCIEFSGRYPDNLHE EINNYSIVAGVIARQQEFEIIHSHDWLTYPAGIHAKQVSGKPLVIHVHATDFDRSRGNVN PTVYAIEKNGMDHADHIMCVSELTRQTVIHKYFQDPKKVSTVHNAVSPLSQEIQDIVPNK NPKEKVVTFLGRITMQKGPEYFVEAAAMVLQRTRNVRFVMAGSGDMMDQMIRLAAERGIA DRFHFPGFMKGKQVYEVLKASDVYIMPSVSEPFGISPLEAMQCSVPSIISKQSGCAEILE KCIKTDYWDIHAMADAIYSICTYPAMYEYLRDEGKKEVDEIKWENVGYKVRGIYDEVIKN YGKQ >gi|226332048|gb|ACIB01000008.1| GENE 143 177436 - 179376 1614 646 aa, chain - ## HITS:1 COG:MA0905 KEGG:ns NR:ns ## COG: MA0905 COG3408 # Protein_GI_number: 20089784 # Func_class: G Carbohydrate transport and metabolism # Function: Glycogen debranching enzyme # Organism: Methanosarcina acetivorans str.C2A # 15 636 36 669 680 279 32.0 1e-74 MSYLHFDKTLMINLEESLPREILRTNKSGAYHCTTIVDCNTRKYHGLLVIPVPNLDDENH VLLSSLDETVIQHGAEFNLGLHKYQGNHFSPNGHKYIREFDCEHIPATTYRVGGVILRKE KIFVHHENRILIRYTLVDAHSATTLRFRPFLAFRSVREYTHENSQASRDYQLVENGIKTC MYPGYPELFMQLNKKNEFHYEPNWYRGIEYPKEQERGYDFNEDLYVPGYFEVDIKKGESI IFSAGISEISPRRLKQTFEAEVADRTPRDSFYHCLKNSAHQFHNKQEEDHYILAGYPWFK CRARDMFVSLPGLTLAVDEIGEFEDVMETARKAINNYIKGEPRGCKIYEMDDPDVLLWAV WALQQYAKETSREQCRAKYGSLLEEIIDFIRQRKHDNLFLHENGLLYANGADRAITWMNS TVNGRPVIPRTGYIVEINALWYNALRFIADLVREGGNGLLADSLDAQAEVTGKSFVEVFR NEYGYLLDYVDGNMMDWSVRPNMIFTVAFDYSPLDRVQKKQVLDIVTKELLTPKGLRTLS PKSGGYNPNYVGPQIQRDYAYHQGTAWPWLMGFYMEAYLRIYKMSGISFVERQLIGLEDE MTSHCVGSLPELFDGNPPFKGRGAVSFAMNVAEILRILKLLSKYNL >gi|226332048|gb|ACIB01000008.1| GENE 144 179547 - 180227 289 226 aa, chain + ## HITS:1 COG:Rv1337 KEGG:ns NR:ns ## COG: Rv1337 COG0705 # Protein_GI_number: 15608477 # Func_class: R General function prediction only # Function: Uncharacterized membrane protein (homolog of Drosophila rhomboid) # Organism: Mycobacterium tuberculosis H37Rv # 16 181 44 216 240 99 36.0 7e-21 MRPEIRQILLTMVLPLFLIFILYMIKVLEIGMDWDFTSLGVYPLSKKGMFGIFTHPLIHS GFKHLLTNTLPLFFLSWCLFYFYRSIAPSIFLIIWIGCGAITFLIGKPAWHIGASGIIYG LAFFLFFSGLLRKYIPLIAISLLVTFLYGGLIWNMLPYFTPSGISWEGHLSGAIIGTICA FSFMGYGPQKPDPFANEEEEESVSATDETDNIETDEEEEEHEIDAE >gi|226332048|gb|ACIB01000008.1| GENE 145 180265 - 180861 659 198 aa, chain - ## HITS:1 COG:BMEI0883 KEGG:ns NR:ns ## COG: BMEI0883 COG2095 # Protein_GI_number: 17987166 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Multiple antibiotic transporter # Organism: Brucella melitensis # 7 193 4 207 209 76 29.0 3e-14 MFANFNFQQMVSAFIVLFAVIDIIGSIPIIINLKEKGKDVNATKATVISFALMIGFFYAG DFMLKLFHVDIESFAVAGAFVIFLMSLEMILDVEIFKNQGPIKEATLVPLVFPLLAGAGA FTTLLSLRAEYASINIVIALILNMLWVYFVVSMTGRVERFLGKGGIYIIRKFFGIILLAI SVRLFTANITLLIAALQK >gi|226332048|gb|ACIB01000008.1| GENE 146 180977 - 181720 643 247 aa, chain + ## HITS:1 COG:no KEGG:BF0995 NR:ns ## KEGG: BF0995 # Name: not_defined # Def: CRP family transcriptional regulator # Organism: B.fragilis # Pathway: not_defined # 1 247 1 247 247 484 99.0 1e-136 MVSANPYNRYICFQIRRLRYMETMFDTLLQLPLFQGLCHEDFTNILEKVKLHFTRHKPGE PLIKSGEVCDQLLFLLKGRLSSVTVSEDDTLTVIEYFEAPAVLEPYSMFGMNTRYISSYI PYNEEAQMVSISKSFVMGELFKYDIFRLNYMNIVSNRAQNLYTRLWDKAPKDIEDKIIRF ILGHIERMTGEKLFKVKMDDLARMLDDTRLNVSKALNGLQELNLLELHRKEIRIPDLSLL TEWNEKR >gi|226332048|gb|ACIB01000008.1| GENE 147 181733 - 181999 226 88 aa, chain - ## HITS:1 COG:no KEGG:BF0994 NR:ns ## KEGG: BF0994 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 88 57 144 144 170 100.0 1e-41 MIRIFDEIEQKSDFVFAWSCDIDNEIHEEISICVTEEPIQKVMEKVLKGSGLVYQRLDRQ IVVYRLLGHNACRVDSVRVMTNMEQNDR >gi|226332048|gb|ACIB01000008.1| GENE 148 182359 - 183465 593 368 aa, chain - ## HITS:1 COG:PA2388 KEGG:ns NR:ns ## COG: PA2388 COG3712 # Protein_GI_number: 15597584 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Pseudomonas aeruginosa # 171 365 132 325 331 68 30.0 2e-11 MEKTKRDSRHIDPLKDEEKFFQWLKFKSVRMNQEHLSKDDYERLRQRIHVSLRMLRRKRT VRRVAYYGVSVCVIIALGIVAYLNHYERVEPVPSVVEKKAEIVWQPLKSEDIRLVSGDSI TSFRQNVQLLLSKDGSAMVYHPNSGQKRIRMEQDEVNELVVPYGKRSKVKLEDGTEIWLN SGSVLKFPTHFSGEKREVSLKGEMYAEVTADSKKPFIVHTAHFDIQVYGTRFNISAYEDE PTPSCVLVDGIVGFRPESGPEIRMKTNEKVLYDGKRFEKRKVSASRYTCWKEGYLELDDA NIMDVLNRIGRYYNLSFSFGDKKRLTGRKCSGKIYLSDNIDNVLTTISLLYSTDYRKEER TIFISENP >gi|226332048|gb|ACIB01000008.1| GENE 149 183575 - 184156 569 193 aa, chain - ## HITS:1 COG:no KEGG:BF0992 NR:ns ## KEGG: BF0992 # Name: not_defined # Def: RNA polymerase ECF-type sigma factor # Organism: B.fragilis # Pathway: not_defined # 1 193 1 193 193 345 100.0 4e-94 MLPKKKNLEEERAKHTVDSLYKDYVDDLFSYALGFGFDKQTAMDAIHDVFCRVCIREREV QEIQNPKFYLLRALRNQLIDTYKLKRNYSEVLTGEITDELPYKIKITVEDEIIAAEEQAE VSQKVDEILSILTERQREIIYLRYMQECSYEEIAEIMQISVPACRKLLYRTLLKLKHNNT LVLFYLLLSINVG >gi|226332048|gb|ACIB01000008.1| GENE 150 184182 - 185417 1057 411 aa, chain - ## HITS:1 COG:MA2715 KEGG:ns NR:ns ## COG: MA2715 COG2873 # Protein_GI_number: 20091539 # Func_class: E Amino acid transport and metabolism # Function: O-acetylhomoserine sulfhydrylase # Organism: Methanosarcina acetivorans str.C2A # 25 409 43 438 441 276 37.0 7e-74 MKKNSFETQILHTPFEKEDAYHSLSMPVYHTAAYEFETAEEMEAAFCGQKAGHAYSRITN PTVQYFEQRVQRVTGALSVTALNSGMAAISNALITLSSAGANVVTSTHLFGNTYSFLKST LEAFGVEVRFCDLTCPEEVKQQIDGDTCALFLEVITNPQLEVADLKALADIAHKAGVPLL ADTTAIPFHVFHATDFGVDIEIVSSTKYISGGATCIGGLIIDYGTFDWEHSAKLAALSAD TGKEAFTVKLRKEVHRNLGAYMTPQVAYMQTLGLETMEVRFARQAETCLKLAQCLQELPE IESVNYTGLESNPFYELSTRQFGSLPGAMLTFDLPSREICFRFINRLRIIRRATNLFDNK TLAIHPASTIYGSFTEDQRRGMDVSQKTIRLSVGLERADDLLDDIIQALKS >gi|226332048|gb|ACIB01000008.1| GENE 151 185449 - 186633 1066 394 aa, chain - ## HITS:1 COG:YPO3006 KEGG:ns NR:ns ## COG: YPO3006 COG1168 # Protein_GI_number: 16123185 # Func_class: E Amino acid transport and metabolism # Function: Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities # Organism: Yersinia pestis # 1 387 1 390 393 373 44.0 1e-103 MKYNFDEVVERRGTDSVKYDAVSERWGRSDLLPMWVADMDFRTPPFVIEAIRRRLDHEVL GYTFACEAWYTSIINWQKERHGWNVTREMLTFTPGIVRGLAFALQCFTAPGDKVMVMPPV YHPFFLVTEHNHREVVYSPLLLKDGQYQIDFERFRADVKGCKMLILSNPHNPGGRVWTRE ELAEIAEICFDNQVLVISDEIHADLTLPGYTHPTFALVSEKARRNSLVFMSPSKAFNMPG LASSYCIIEDEAIRHRFQTYMEASEFSEGHLFAYLGVAAAYSNGTEWLDQALEYIQENID FTDEYLKTHIPAIRMIRPQASYLIFLDCRGMGVSQKELVDFFVDGAHLALNDGAMFGKEG EGFMRLNVACPRSVLRQALDQIKEAYELKHDTIA >gi|226332048|gb|ACIB01000008.1| GENE 152 186805 - 187590 734 261 aa, chain + ## HITS:1 COG:lin1028 KEGG:ns NR:ns ## COG: lin1028 COG0561 # Protein_GI_number: 16800097 # Func_class: R General function prediction only # Function: Predicted hydrolases of the HAD superfamily # Organism: Listeria innocua # 1 261 1 256 256 142 34.0 9e-34 MIKALFFDIDGTLVSFNTHEIPSSTLAAIAEAKAKGIKIFIATGRPKAIINNLTALQERE LIDGYITMNGGYCFVGDEVIYKHSIPVQDVKALAALSDERNFPCIFVAEHTVAVCNTNKL VNEIFHDFLHVDILPIQTTDEATQAEIFQMTPFITTEEEKTILPLLPNCESGRWFPAFTD IVAKGIRKQKGIDEIIRHFGIGQEETMAFGDGGNDISMLRHAAIGVAMGNANDDVKETAD YITTSVDEDGIQKALKHFGII >gi|226332048|gb|ACIB01000008.1| GENE 153 187592 - 189592 1667 666 aa, chain + ## HITS:1 COG:YHR031c KEGG:ns NR:ns ## COG: YHR031c COG0507 # Protein_GI_number: 6321820 # Func_class: L Replication, recombination and repair # Function: ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member # Organism: Saccharomyces cerevisiae # 11 418 235 698 723 166 30.0 1e-40 MEFTIDTGNKEFQDALNLIQYTRQSVFLTGKAGTGKSTFLKYICKNTKKKHIVLAPTGIA AINAGGSTLHSFFKLPFHPLLPDDPNLSLQRGRIHEFFKYTKPHRKLLEQVELVIIDEIS MVRADMIDAVDRILRVYSRNLRDPFGGKQVLLVGDVFQLEPVIKGDEREIINRFYPTPYF FSARVFNEIELVSIELQKVYRQSDAVFVSVLDHIRSGAAGAADLQLLNTRYGAQIDASEE DLYITLATRRDTVETINERKLTELPGDPVVFEGEINGDFPESSLPTSKELTLKPGAQIIF IKNDFERRWVNGTIGVVSGIDNDGIIYVITDDGKECDVHRESWRNIRYKYNEEKKEIEEE ELGTFTQYPIRLAWAITVHKSQGLTFSRVVIDFTGGVFAGGQAYVALSRCTSLEGIQLKK PISRADIFVRPEIVSFSGRFNNRQAIDKALKQAQADVQYAAAARAFDKGDFETCLEQFFL AIHSRYDIEKPAARRLIRRKLGVVNLLREQKRKLQAQMEAQKKSLQKYAREYLLMGNECI TQAHDVRAALANYDKAIELYPEYIDAWIRKGITLFNEKEFFDAENCLNRAVSLRPSEFKA LYNRGKLRLQTENIEGALSDLDKATSLKPEHPGAHELFGDALLKVGKETEAAIQWRIAEE LRKKKK >gi|226332048|gb|ACIB01000008.1| GENE 154 189830 - 192343 2377 837 aa, chain + ## HITS:1 COG:no KEGG:BF0987 NR:ns ## KEGG: BF0987 # Name: not_defined # Def: outer membrane assembly protein # Organism: B.fragilis # Pathway: not_defined # 1 837 1 837 837 1563 99.0 0 MKKGFKITAIVIGVILILMFLLPFAFRGKIEGIVKSEGNKMLNGHFDFSSLDISLFRNFP KASVTLNDFWLKGTGEFENDTLVKAGEVTAAINLFSLFGDDGYDVSKVSVENTRLHAIVL PDGKTNWDIMKPDSSTASETQESGESSTFRIKLQRFVIKNMNVVYDDRQSAMYADIHNFN ALCSGDLGSDQTLLSLEAETEALTYKMNGIPFLSQANVYAKMDVDADLAHNKFTLKKNEF RLNAIKAGIDGWIELKDPAIDMDLKLNTSEIGFKEILSLIPAIYSKEFKNLKTDGTATLE ATAKGILQGDMVPQFDVRLAVKNAMFRYPSLPAGVDQINIDAQVRNPGGNIDLTEISIHP FSFRLAENPFSLTADIKTPVSDPDFTAEAKGVLNLGMIKQVYPLDDMELNGTVRADMTMA GHLSYIEKEQYDRFSASGTIALSDMKLKMKEMPDVEIKKSLFTFTPKYLQLSETTVAIGK NDLTADCRFENYMGYILKGSTLKGTLNVRSNHLNLNDFMTATTDSTAQTSQASSTEETAS MIEVPQNIDFQMDAGLKEVLFDKMTFTNMNGKLIVKNGKVDMTNLSMNTMGGSVVMNGYY STADPKKPEMNAGFRMENIGFAQAYKELDMVQQMAPIFENLKGNFSGNMHIRTLLDNQMS PVMDTMQGNGSLSTQDLSLSGVKVIDQIAEAVKKPELKEMKVKDMALDFTIKDGRVSTKP FDIKLGDYVMNLSGSTGLDQTIDYSGKIKLPASAGDIAKLTTLDLKIGGTFSSPKVSLDT KSMTNQAVEAVTDKAISEIGKKLGLDSATTANKDSVKEKVKEKAVEKALDFLKKKIK >gi|226332048|gb|ACIB01000008.1| GENE 155 192391 - 193005 766 204 aa, chain + ## HITS:1 COG:YPO2212 KEGG:ns NR:ns ## COG: YPO2212 COG0009 # Protein_GI_number: 16122440 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Putative translation factor (SUA5) # Organism: Yersinia pestis # 5 201 7 202 206 158 41.0 5e-39 MILKLYEKNNNPQDLQRIVDLLNDGGLIIYPTDTMYAIGCHGLKERAIERICRIKEIDPK KNNLSIICYDLSSISEYAKVDNNIFKLMKRNLPGPFTFILNGTNRLPKIFRNRKEVGIRM PDNSIIREIARLLDAPIMTTTLPHDEHEDIEYVTDPELIDEKLGDVVDLVIDGGIGGIEP STVVNCTEGEAAIVRQGKGELEEA >gi|226332048|gb|ACIB01000008.1| GENE 156 193069 - 193482 493 137 aa, chain + ## HITS:1 COG:no KEGG:BF0985 NR:ns ## KEGG: BF0985 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 137 1 141 141 176 96.0 3e-43 MKKVLFFALVLTIATACSQTKDSYLEGFKLFVESVQKNAQDYTKADWEKADEQFTKLKDS YNNFSKQMTSDEKGEVIKLESTYAALKLKKIGDDLKESTKDAFEKVKDTAKDVKEGAQKA AKKGEKAMEGIKDGLKD >gi|226332048|gb|ACIB01000008.1| GENE 157 193576 - 194388 912 270 aa, chain - ## HITS:1 COG:BB0152 KEGG:ns NR:ns ## COG: BB0152 COG0363 # Protein_GI_number: 15594497 # Func_class: G Carbohydrate transport and metabolism # Function: 6-phosphogluconolactonase/Glucosamine-6-phosphate isomerase/deaminase # Organism: Borrelia burgdorferi # 1 264 1 264 268 373 69.0 1e-103 MRLIIQPDYQSVSQWAAHYVAAKIKAANPTPEKPFVLGCPTGSSPLGMYKALIDLNKKGI VSFQNVVTFNMDEYVGLPKEHPESYYSFMWNNFFSHIDIKPENTNILNGNAADLDAECAR YEEKIKSYGGIDLFMGGIGPDGHIAFNEPGSSLSSRTRQKTLTTDTIIANSRFFDNDINK VPKTSLTVGVGTVLSAREVMIIVNGHNKARALYHAVEGAITQMWTISALQMHEKGIIVCD DAATAELKVGTYRYFKDIEADHLDPQSLLK >gi|226332048|gb|ACIB01000008.1| GENE 158 194428 - 195627 1229 399 aa, chain - ## HITS:1 COG:FN0512 KEGG:ns NR:ns ## COG: FN0512 COG0426 # Protein_GI_number: 19703847 # Func_class: C Energy production and conversion # Function: Uncharacterized flavoproteins # Organism: Fusobacterium nucleatum # 5 398 5 402 403 359 45.0 5e-99 MEQKTRIKGNVHYVGVNDRNKHLFEGMWPLPYGVSYNSYLIDDEMVALIDTVDICYFEVY LRKIRNIIGDRPINYLIINHMEPDHSGSIRLIKQHYPDIVIVGNKQTFGMIEGFYGVTGE QYLIKDGDFLALGRHKLRFYLTPMVHWPETMMTFDETDGILFSGDGFGCFGTLDGGFVDT RMNIDHYWGEMVRYYSNIVGKYGSPVQKALQKLGGLPISAICSTHGPVWTENITKVVGIY DKLSRYDADEGVVIAYGSMYGNTEQMAEAIAAELSAQGIKNIVMHNVSKSNPSYILADIF RYKGLIIGSPTYSNQIFPEVESLLSKILVRELKGRYLGYFGSFTWAGAAVKRMAEFAEKS KFELVGDPVEMKQAMKEITYQQCENLARAMAGRLKKDRV >gi|226332048|gb|ACIB01000008.1| GENE 159 195745 - 196680 825 311 aa, chain - ## HITS:1 COG:BS_ytqA KEGG:ns NR:ns ## COG: BS_ytqA COG1242 # Protein_GI_number: 16080100 # Func_class: R General function prediction only # Function: Predicted Fe-S oxidoreductase # Organism: Bacillus subtilis # 10 309 16 313 322 237 39.0 2e-62 MTKPAPTPLYNEFTFFLKKYFPYKVQKISLNAGFTCPNRDGTKGLGGCTYCNNQTFNPEY CKTEKSVTRQLEEGKQFFAHKYPDMKYLAYFQAYTNTYAEFEGLKRKYEEALSVDGVVGL VIGTRPDCMPDPLLRYLEELNKHTFLLVEYGIETTRDVTLKRINRGHTYADTVETVNRTA ACGILTGGHVILGLPGETHDEIIAQAAELSRLPLTTLKMHQLQLIRGTKMAREFECRPED FHLFSVDEYIDLVIDYVEHLRPDLILERFVSQSPKELLIAPDWGLKNYEFTARVQKRMKE RGAYQGKAYLV >gi|226332048|gb|ACIB01000008.1| GENE 160 197014 - 197856 848 280 aa, chain - ## HITS:1 COG:no KEGG:BF0902 NR:ns ## KEGG: BF0902 # Name: not_defined # Def: putative transmembrane and transcriptional regulatory protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 280 1 280 280 521 99.0 1e-147 MDIHPIQESSRRWMTALILAVVAAGIQTTLLWGYAGADMLPAAIDGILSVGLLCLLAYLA WYVIGLVSILQTDLLIAALALLFWLAGGFAVQYVLEQNMGQVYAPFGETLPFRILFGALA WGVMMLWYRLQSLNTVQEEILEEAVSREEALREELRQIECREDKALPEEAECIDRITVKD GTHIHLIRTDELLYIQACGDYVTLVTPSGQYVKEQTMKYFDAHLPSAGFVRVHRSTIVNV TQISRVELFGKENYQLSLKNGVRLKVSNSGYKLLKERLEL >gi|226332048|gb|ACIB01000008.1| GENE 161 197866 - 198594 546 242 aa, chain - ## HITS:1 COG:no KEGG:BF0979 NR:ns ## KEGG: BF0979 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 242 1 242 242 475 99.0 1e-133 MEKKEFSSPARRYGKFFIAFIFITAGVLLLARNLGWISYTLFGILVSWQMLLILLGIYLM LRRQILRGGILLAIGTYLISPYLGWMPAGVHVTLFPIVLIVIGLAFLFRPKRARHERSHR GNFASSQYNSTDGVLHSENTFSGIRQVVLDEVFKGGTIQNSFGGTVIDLRRTTLPEGETF LDIDCTFGGIEIYVPSDWKVVFRCTTCLGGCQDKRFGGGMIDQNRILVIRGDLTFGGIDI KS >gi|226332048|gb|ACIB01000008.1| GENE 162 199032 - 201242 2017 736 aa, chain + ## HITS:1 COG:CC2154 KEGG:ns NR:ns ## COG: CC2154 COG1506 # Protein_GI_number: 16126393 # Func_class: E Amino acid transport and metabolism # Function: Dipeptidyl aminopeptidases/acylaminoacyl-peptidases # Organism: Caulobacter vibrioides # 153 721 157 719 738 303 33.0 9e-82 MKRKFIFLFFCLCCLAGFAQGGKALDLKEINSGKFNPENIYGVVPMPDGEHYTQRNAEGT QIVKYSFRTGEPVEVVFDVTKARECPFKKFDSYQFSPDGSKILIATETKPIYRHSYTAVH YLYPVKRNDKGVTTNNIVEKLSDGGPQQAPVFSPDGNLVAFVRDNNIFLVKLLYGNSESQ VTEDGKLNSVLNGIPDWVYEEEFGFNRALEFNADNTMLAYVRFDESEVPSYTFPLFAGEA PRYDALQDYPGEYTYKYPKAGYPNSKVSVHTFDIKSKVTRQVKLPIDADGYIPRIRFTQD PNKLAIMTLNRHQNRFDMYFADPRSTVCKLALRDESPYYINENVFDNIQFYPEYFSFVSD KSGYPHLYWYSMNGNLIKQVTSGNYEVKNFIGWNPDTNEFYYTSNEESPMRQAVYKIDRK GKKMKLSNQPGTNSPIFSSSMKYFMNKFTSLDTPILITLNDNTGKVLKTLVTNDKLKQKL AEYAIPQKEFFTFKTTEGVDLNGWMMKPVNFDPAKRYPVLMFQYSGPGSQQVLDKWGISW ETYMASLGYVVACVDGRGTGGRGSEFQKCTYLNLGVKEAKDQVEAAKYLGGLPYVDKGRI GIWGWSFGGYMTIMSMSEGTPVFKAGVAVAAPTDWKYYDTVYTERFMRTPKENAEGYKAA SAFSRADNLHGNLLLVHGMADDNVHFQNCTEYAEHLVQLGKQFDMQVYTNRNHSIYGGNT RNHLYTKLTNFFRNNL >gi|226332048|gb|ACIB01000008.1| GENE 163 201369 - 202235 727 288 aa, chain + ## HITS:1 COG:BH3435 KEGG:ns NR:ns ## COG: BH3435 COG0320 # Protein_GI_number: 15615997 # Func_class: H Coenzyme transport and metabolism # Function: Lipoate synthase # Organism: Bacillus halodurans # 4 283 5 287 303 288 50.0 1e-77 MGNDKRVRKPEWLKISIGANERYTETKRIVESHCLHTICSSGRCPNMGECWGKGTATFMI AGDICTRSCKFCNTQTGRPLPLDPDEPTHVAESIALMKLSHAVITSVDRDDLPDLGAAHW AQTIREIKRLNPETTTEVLIPDFQGRKELVDQVIKACPEIISHNMETVKRISPQVRSAAN YHTSLEVIRQIAESGITAKSGIMVGLGETPAEVEELMDDLISVGCKILTIGQYLQPTHKH FPVAAYITPEQFAVYKETGLKKGFEQVESAPLVRSSYHAEKHIRFNNK >gi|226332048|gb|ACIB01000008.1| GENE 164 202300 - 203418 774 372 aa, chain + ## HITS:1 COG:no KEGG:BF0975 NR:ns ## KEGG: BF0975 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 372 1 372 372 732 100.0 0 MGNKGVLSSAFNMSLGFIPVIVSILLCEFITQDISIYIGTGIGLIYSYRSLSRKGARIPN FILYISTGILTLLTLASFIPGDFVPEGALPLTLEVSILIPMVILFLHRRKFISHYLRQNA QCNRRLFAQGAESAIVSARVVLILGILHFAVISLTVLVAHPLTRTSILVLYHVLPPTIFI LSILLNQIGIRYFNHVMAHTEYVPIVNTRGDVIGKSLAVEAINYKNAYINPVIRIAVSTH GMLFLCNRPQSCILDKGKVDIPMECYLRYGETLTAGANRLLSNAFPKASDLKPTFTISYH FENEQTNRLVYLFIVDMEDDSILCDPRFKGGKLWTFQQIEHNLGTHFFSECFELEYEHLK QVIGIREKYKVS >gi|226332048|gb|ACIB01000008.1| GENE 165 203384 - 204088 608 234 aa, chain - ## HITS:1 COG:NMA0547 KEGG:ns NR:ns ## COG: NMA0547 COG0313 # Protein_GI_number: 15793541 # Func_class: R General function prediction only # Function: Predicted methyltransferases # Organism: Neisseria meningitidis Z2491 # 1 233 6 239 241 215 46.0 6e-56 MDTALYLLPVTLGDTSIESVLPSYNKEIIQGIKHFIVEDVRSARRFLKKVDREIDIDSLT FYPLNKHTSPEDISGYLKPLAGGLSMGVISEAGCPAVADPGADVVAIAQRKNLKVVPLVG PSSIILSVMGSGFNGQSFAFHGYLPIEPGERAKKIKALEQRVYAEHQTQLFIETPYRNNK MVEDILHNCRPQTRLCIAANITCEGEYIRTKTIKEWQGKVPDLTKIPCIFLLYQ >gi|226332048|gb|ACIB01000008.1| GENE 166 204115 - 205128 782 337 aa, chain - ## HITS:1 COG:no KEGG:BF0895 NR:ns ## KEGG: BF0895 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 337 1 337 337 645 100.0 0 MKIRKDDILLILLSLLFASCRVGKAEPVADPMEEEGISVVRYDKLLDEYVRFNSFSALQK MNLEYALPTKLLIEDVLAIGQVSDDHIFQRLKTFYSDTTLVRLIEDVEAKYPELESVEKN LTKGFGKLQKEIPDIMIPMIYTQISAFNESIVLSDSVLGISLDKYMGEDYPLYKRFYYNY QRRTMRPDRIVPDCLVFYLMSQYPFPMDYSRTLLDVMMHYGKINYVVQHLLDYSSSEEAL GYSDLEREWCKENQQQMWRYILEQDHLHATDPMVVRQYTRPAPFTNTLGENAPSMVGTWI GTKIITSYMKHHKKTTLRQLLEMSDYERMFTESRFNP >gi|226332048|gb|ACIB01000008.1| GENE 167 205133 - 205990 907 285 aa, chain - ## HITS:1 COG:BMEI1958 KEGG:ns NR:ns ## COG: BMEI1958 COG0623 # Protein_GI_number: 17988241 # Func_class: I Lipid transport and metabolism # Function: Enoyl-[acyl-carrier-protein] reductase (NADH) # Organism: Brucella melitensis # 5 260 4 257 272 119 32.0 5e-27 MSYNLLKGKRGIIFGALNEQSIAWKVAERAVEEGAVITLSNTPVAVRMGQVSALSEKLNC EVIAADATNVEDLENVFKRSMEVLGGQIDFVLHSIGMSPNVRKKRTYDDLDYNMLNTTLD VSAVSFHKMIQAAKKQNAIAEYGSIVALSYVAAQRTFYGYNDMADAKALLESIARSFGYI YGREHNVRVNTISQSPTFTTAGSGVKGMDKLYDFANRMSPLGNASADECADYCIVMFSDL TRKVTMQNLFHDGGFSSVGMSLRAMATYEKGLDEYKDENGNIIYG >gi|226332048|gb|ACIB01000008.1| GENE 168 206908 - 210174 3405 1088 aa, chain + ## HITS:1 COG:no KEGG:BF0893 NR:ns ## KEGG: BF0893 # Name: not_defined # Def: putative outer membrane receptor protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1088 1 1088 1088 2156 100.0 0 MTKKINLFPSLIRFRETNRLKMAIAASIMLWCMAPQQAVADTYEKHEVASIQQQKVKANG TVVDQTGEPLIGVSVKVKDAPNGTITNLDGKFSIDVAKGATLEISYVGYKTVIVKAESTP MHIVLKEDSEMIDEVVVVGYGSQKKVNVTGAVGMVNSEVLEARPVQNVSQALQGVVPGLN LSVNNGGGSLDSEMSINIRGTGTIGDGSGSSPLVLIDGIEGSLNTVNPNDIESVSVLKDA ASASIYGARAAFGVVLVKTKSGQSGKPRVTYSGNVRFSDATNIPEMLDSYTFAQYFNRAA ANDNGGTVFSKEQLERIKAYQDGTLKSSATFNEQSRRWNYYTGSNANTDWFKEVYEDWVP SMDHNLSISGGTDKTQYIVSGSFLDQKGLIRHGKDTFQRYTLNGRITSNITDWFTLGYST KWTREDYDRPSYLTGLFFHNVARRWPTVPVYDDNGYLTEPSELIQLEDGGRQINQKDLFT QQLQLTFEPIKNWKIYVEGSLRVTANNQHWEVLPVYQHDVDGNPVGMTWDAGVGSYPVGG SKVSEYAYKENYYSTNIYSDYFKQLDNGHYFKAMVGFNAELYKDRSVSADKSTLITPSVP TINTAVGEPSVAGGYRHTSVAGFFARLNWNYKDRYMLEANGRYDGSSRFIGDKRWGFFPS FSGGWNIAREAFFEETANKLKIGTLKLRASWGQLGNTNTNEAWYPFYQTLPQGQNYGWLV NGVRQNYASNPGIVSSEKTWETIETWDAGLDWGLFNNRLTGSFDYFVRYTYDMIATAPEL PSILGTGVPKINNADMKSYGFELEIGWRDRIKNFSYGVKFVLSDAQQKILKYNNPDKSLS NPYYEGQKLGEIWGYKTIGIAQSDEEMNQHLANAKQPMGQKWAAGDIMYADLDNSGSVDQ GNYKVGDSGDWQIIGNNTPRFNYGITIDAAWKGLDFRAFVQGIGKRDYWLNGPYFWGFSG GGEWASAGFKEHWDFWRPEGDPLGANTNSYFARVIRGTSKNQQRQTRYLQDASYWRLKNI QIGYTLPKVWTKKAGMESVRVYVSGDNLLTVSDITGVFDPENLGTQWTDPGKVYPLQKVI AIGLTVNF >gi|226332048|gb|ACIB01000008.1| GENE 169 210188 - 212230 1788 680 aa, chain + ## HITS:1 COG:no KEGG:BF0892 NR:ns ## KEGG: BF0892 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 680 1 680 680 1371 99.0 0 MKLKNRIIQLCVLTGGVLLFPSCNDFLDREPLDQVTPESYFQNADHLAAYSISKYQNLFS THSGFSAGTVNNDGATDNMVSGGSSGSGLQNYYTKDVWKTEAANDNWDFSFFRYCNYFFE KVLPKYEAGEISGNADDVKHYIGEMYFIRAWKYFQKLRMYGDYPIITEVLPDNAEILIEK GVRQPRNKVARFILEDLDKAAEYMHDHGFAGNNRLNKQCALLIKSRVALYEATFEKYHQG TGRVPGDANWPGKRVHPDYKYDANTEINFFLDQAMSAAEQVADVIKLTPNSGVFNPANDN DISGWNDYFDMFSAEDMSGFEEVLFWRDYYSGDFTIAHGATAYVASGGNNGMLHNYVQSF LMKDGMPWYAATAAYPFKGDERVMDEKANRDERLQLFLFGEEDMIPAVSNAATNAMKTYQ DANYPNIIVAESEIKDLTGYRIRKCLSYDQKQYVSGQAQSTTGCVIFRAVEAYLNYMEAA CMKNNGNVTGKAAEYWRAVRIRAGVDPDFTKTIAATDLSKENDLAKYKGENEMIDVTLFN IRRERRCEFIGEGMRMDDLIRWRSLDRLLVERFIPEGFNFWGSDAYKRYEGEDAKFEYIE GPDNAMANVSSRKLSNYLRPYSVVQKNNEIYDGYTWAKAHYLYPVPIRQIELLSPSGEIA SSVIYQNPYWPEERNTAAIE >gi|226332048|gb|ACIB01000008.1| GENE 170 212778 - 216047 2952 1089 aa, chain + ## HITS:1 COG:no KEGG:BF0890 NR:ns ## KEGG: BF0890 # Name: not_defined # Def: putative outer membrane receptor protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1089 1 1089 1089 2167 99.0 0 MKEKTNLFPSLIRLRETSRLKMAIAASIMLWCATPQQATADTNEEHAIEAVQQAKVKVKG TVVDETGEPMIGVAIKVLANNTGTITDLEGKFSVEAPLGGAIQISFIGYKTVTVKASSEP ISVTLKEDSQQLDEVVVVGYGSQKKVNVTGSVSMVDSKVIESRPVQNVSQALQGVVPGLN MSVGNSGGALDSSLSINIRGAGTIGEGSSGSPLVLIDGIEGDMNTVNPNDIENISVLKDA ASSSIYGARASFGVIMITTKSGKSGKTRVNYSGNVRFSDAIQIPEMVDSYTFAQYFNRAN TNDGGGLVFDEAALERIKNYQTGKYTDPNTPEYYGAKAGNDGKWQNYTGSFANTDWFKEF YKNWVPSTEHNLNISGGTDKLTYMISGSFLDQKGLLRHGEDQFNRYTMNAKISAKLTDWV TLNYTSKWTREDYDRPTYMTGLFFHNIARRWPTCPVRDPNGHYQQKMEIIEMEDGGKQTS QKNWYTQQLQAIFEPIKDWRIVAEGSMRTYTRKQSWAVLPIYAYDADNQPYLLGWGDNAA GYSEVQDSRESEDYFSTNIYTDFAKTFGDHNFKIMVGFNGELYRPSGLTGFGTDLISPEV PSLGLTQDNKKASSWASEKAIAGFFGRLNYNYKERYMLEANLRYDGSSRFIGDKRWGLFP SFSAGWNISREAFFEPLTQVVGTLKLRGSWGQLGNNNTSETNAWYPFYQNMPTGSASSGW LINGKKQNVAGLPGIVSSLMTWETIESWNVGIDWGLFDNRLTGSFDYYNRYTYDMIGPAP TLPSVLGASAPQINNCDMKSYGWELELSWRDRIQQFNYGVRLVMSDNMQKILEYPNKTLS LGEKYYTGKTIGEIWGYKTIGIAQTQEEMDKHLANGGKPNWGSAWGAGDIMYANIDGKDG VNSGANTVNDHGDLKIIGNSTPRYNFGLTLDGSWKGLDFSLFIQGVMKRDYMLDGPYFWG ANGGMWQSCVFKEHLDYWRPEGDPLGANTNAYYPKPYFSSNKNQKTQSGYLQNAAYCRLK NAQIGYTLPKAWTKKAAMESVRVYVSGDNLLTISGISDIFDPETLGGDWGPGKLYPLQRT ISIGLNVNF >gi|226332048|gb|ACIB01000008.1| GENE 171 216063 - 218087 1970 674 aa, chain + ## HITS:1 COG:no KEGG:BF0889 NR:ns ## KEGG: BF0889 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 674 1 674 674 1321 100.0 0 MRLKLKHIYFCSLIAMGGLAITSCEDFLDRSPISQVTPEKYFSTVDQVANYLNNYYNDYL DDSRNYKLYHQQAWNSGMQRNDANTDNLLADDSSLDYFAGNWQVGSGKSIQAPLNRIRTW NYLLEQVLPKEKEGSIQGSVEDLKHYIGEAYFFRAMAYYKALVKYGDYPIVDKVLPDQEE ILLEYSTRAPRNEVARQILKDLDEAINRMHDQGFQNNQRINKQVAQLYKSRVALFEATFE KYHRGTGRVPGDESWPGAKMSYNSGKTFNIDGEIDFFLTEAMNAAAAVADHCTLTENSHV LNPEYGQIYNWNPYYEMFSTPDASGYSEVLLWKQYDKSLNVSHCAPARLQNGDRTGLTRG FITTFLMKSGLPIYAAGNEYHGDVSISDEKENRDERLQLFVWGEKDVLHSDTKNPAVAAA GTTLLFGVPNIISEQKQTQDLTGYRPRKAHTYDYAQTKGDELLGTNACVVFRSAEANLNY MEACYEKTGSLDAKAQKYWKALRTRAGVDDDYAKTIAATDLSKENDLAVYSGSKMVDVTL YNIRRERRCEFIGEGMRWDDLKRWRSWDQLLTKPYIIEGINFWDAAYKDHKDIKDDGTLD ANVSPKSDSKYLRPLRRTSINNELYDGLTWRKAFYLDPIGIEDMSLTATNPEDINTTQLY QNPYWPMTAGKALE >gi|226332048|gb|ACIB01000008.1| GENE 172 218692 - 220215 1469 507 aa, chain - ## HITS:1 COG:FN1444_2 KEGG:ns NR:ns ## COG: FN1444_2 COG0519 # Protein_GI_number: 19704776 # Func_class: F Nucleotide transport and metabolism # Function: GMP synthase, PP-ATPase domain/subunit # Organism: Fusobacterium nucleatum # 193 507 1 318 318 420 62.0 1e-117 MQEKIIILDFGSQTTQLIGRRVRELDTYCEIVPYNKFPKGDETVKGVILSGSPFSVYDES AFKVDLSEIRGKYPILGICYGAQFMAYTNGGKVEPAGTREYGRAHLTSFCKDNVLFKGVR EGTQVWMSHGDTITAIPENFKTIASTDKVAIAAYQVEGEQVWGVQFHPEVFHSEDGTQML RNFVVDVCGCKQDWSPASFIESTVAELKAQLGDDKVVLGLSGGVDSSVAAVLLNRAIGKN LTCIFVDHGMLRKNEFKNVMHDYECLGLNVIGVDASEKFFSELEGVTEPERKRKIIGKGF IDVFDEEAHKLKDVKWLAQGTIYPDCIESLSITGTVIKSHHNVGGLPEKMNLKLCEPLRL LFKDEVRRVGRELGMPEHLITRHPFPGPGLAVRILGDITPEKVRILQDADDIFIQGLRDW GLYDQVWQAGVILLPVQSVGVMGDERTYERAVALRAVTSTDAMTADWAHLPYEFLGKVSN DIINKVKGVNRVTYDISSKPPATIEWE >gi|226332048|gb|ACIB01000008.1| GENE 173 220246 - 220686 451 146 aa, chain - ## HITS:1 COG:YPO0238 KEGG:ns NR:ns ## COG: YPO0238 COG1970 # Protein_GI_number: 16120576 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Large-conductance mechanosensitive channel # Organism: Yersinia pestis # 5 146 2 135 137 155 59.0 3e-38 MGKSTFLQDFKAFAMKGNVVDMAVGVIIGGAFGKIVSSVVADIIMPPLGLLIGGVNFTDL KWVMKAAEYGADGKETAAAVTLNYGNFLQATFDFLIIAFSIFLFIKLITKLTQKKAEAPA APPAPPAPTKEEILLTEIRDLLKEKQ >gi|226332048|gb|ACIB01000008.1| GENE 174 220826 - 221827 1146 333 aa, chain + ## HITS:1 COG:RSc2749 KEGG:ns NR:ns ## COG: RSc2749 COG0057 # Protein_GI_number: 17547468 # Func_class: G Carbohydrate transport and metabolism # Function: Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase # Organism: Ralstonia solanacearum # 2 331 3 331 332 503 78.0 1e-142 MIKVGINGFGRIGRMVFRAAVKNFGNDIQIVGINDLLDAEYLAYMLKYDSVHGRFEGEVA VEDGALIVNGNKIRLTAEMDPANLKWNEVDADVVVESTGFFLTDETARKHIQAGAKKVIM SAPSKDSTPMFVYGVNHTSYAGQDIISNASCTTNCLAPIAKVLNDKFGIVKGLMTAVHAA TATQKTVDGPSKKDWRGGRGILENIIPSSTGAAKAVGKVLPVLNGKLTGMAFRVPTSDVS VVDLTVVLEKAATMAEINAAMKEASEGELKGILGYTEDAVVSTDFRGCANTSIYDSKAGI SLDSNFAKVVSWYDNEWGYSNKVCEMARVIAAK >gi|226332048|gb|ACIB01000008.1| GENE 175 222051 - 224108 2102 685 aa, chain + ## HITS:1 COG:XF1944 KEGG:ns NR:ns ## COG: XF1944 COG0339 # Protein_GI_number: 15838538 # Func_class: E Amino acid transport and metabolism # Function: Zn-dependent oligopeptidases # Organism: Xylella fastidiosa 9a5c # 9 684 36 716 716 485 40.0 1e-136 MTNIANAQNPFFEKYTTPYGTVPFDKIKNEHYEPAIREGISRQAAEIDAIVNNPEAPTFA NTILAYEKSGELLDRVTTVFGNLRSAETNDDLQKIAQEMIPLLSEHSNNISLNQELFERI KVVYGQKDSIELTPEQTKLLENAYNGFIRRGANLQGEAKEKYRELTKNLSKLTLDFSENN LKETNNYQLTLTDEAQLAGLPESAIEAAAETAREKGVNGWVFTLHAPSYIPFMTYADNRD LRRELYMAYNTKCTHDNEYNNLEIVKKIANIHMEIAQLLGYDNYAEYTLKERMAETGDAV YKLLNQLLDAYTPTAHKEYEAVQELARTEQGDAFEVMPWDWSYYSNKLKDRQFNINEEML RPYFELSKVKAGVFGLATKLYGITFHKNPDIPVYHKDVDAYEVLDKDGSFLAVLYTDFHP REGKRSGAWMTEFKGQWREDTGENSRPHVSVVMNFTKPTESKPALLTYDEVETFLHEFGH ALHGMFANSTYQSLSGTNVYWDFVELPSQIMENFGIEKEFLHTFANHYQTGEPLPDELIS RLVDASNFNVAYACLRQVSFGLLDMAWYTRNTPFEGDVKAYERQAWAQAQILPTVSETCM STQFSHIFAGGYSAGYYSYKWAEVLDADAFSLFKQKGIFNEEVANSFRNNILSKGGTEHP MILYKRFRGQEPTIDALLIRNGIKK >gi|226332048|gb|ACIB01000008.1| GENE 176 224115 - 224624 553 169 aa, chain + ## HITS:1 COG:no KEGG:BF0965 NR:ns ## KEGG: BF0965 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 14 169 14 169 169 288 100.0 5e-77 MKYKYLLLLLLMLPFVSGCNDSDDVNGIFTGKVWKLTYITKKNEHKPYDFWGDKDKYEQS IKNYINKEGAYTIKFEGETTDNVISGKFSGTLLSHSYTGTWSANGESNAFSASVKGSEND PLGFSNKFVEGLNRATSYKGNYDNLFIYYKDEGGRELCLVFHVDKDNNK >gi|226332048|gb|ACIB01000008.1| GENE 177 224636 - 225073 514 145 aa, chain + ## HITS:1 COG:AF1764 KEGG:ns NR:ns ## COG: AF1764 COG2131 # Protein_GI_number: 11499353 # Func_class: F Nucleotide transport and metabolism # Function: Deoxycytidylate deaminase # Organism: Archaeoglobus fulgidus # 7 139 2 145 157 117 44.0 8e-27 MDTANSKQSDLDKRYIRMASIWSENSYCQRRKVGALIVKDKMIISDGYNGTPSGFENVCE DDNNVTKPYVLHAEANAITKIARSNNSSDGATMYVTASPCIECAKLIIQAGIKRVVYSEH YRLEDGIELLKRAGIEVVFVDTSEK >gi|226332048|gb|ACIB01000008.1| GENE 178 225151 - 226878 1664 575 aa, chain + ## HITS:1 COG:aq_797 KEGG:ns NR:ns ## COG: aq_797 COG0793 # Protein_GI_number: 15606169 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Periplasmic protease # Organism: Aquifex aeolicus # 51 358 43 346 408 210 40.0 7e-54 MSTKNSSRFTPVIIAISVVIGILIGTFYAKHFAGNRLGIINGSSNKLNALLRIVDDQYVD TVNMADLVEKAMPQILAELDPHSTYIPAQNLEEVTSELEGSFSGIGIQFTIQDDTIHVNS VIQGGPSEKVGLMAGDRIVMVDDSLFVGKKVTNERAMRTLKGPKGTQVKLGVKRATEKDL LNFTITRGDIPQNTIDAAYMLTDDFGYIQVSKFGRTTHVELLNAIALLNHKNCKGLIIDL RGNTGGYMEAAVRMVNEFLPEGKLIVYTEGRKYPRADEFANGTGSCQKMPVIVLIDEGSA SASEIFTGAIQDNDRGMVVGRRSFGKGLVQQPIDFSDGSAIRLTIARYYTPSGRCIQRPY QNGKDRNYEMDWLTRYEHGEYFSKDSIKLDENLRYSTALGRPVYGGGGIMPDVFVPQDTT GVTSYLTEVLSKGLTIQFTFHYTDNNRDKLKKYEDEESLLNYLRRQGLVEQFIRYADSKG VKRRNILIQKSYKLLEKSIYGNIIYNMLGKEAYIQYLNQSDQTVKKAVELLESGEAFPKA PVSVEPQKEEKKDGKKKTTAQVDSTGEEEILRLYA >gi|226332048|gb|ACIB01000008.1| GENE 179 226808 - 227338 356 176 aa, chain + ## HITS:1 COG:aq_1731 KEGG:ns NR:ns ## COG: aq_1731 COG0212 # Protein_GI_number: 15606807 # Func_class: H Coenzyme transport and metabolism # Function: 5-formyltetrahydrofolate cyclo-ligase # Organism: Aquifex aeolicus # 4 176 3 175 186 130 38.0 1e-30 MERKKQLRKWIAQEKKKYSDSTLKSLSEKVLITLEACPEFQKGHTILLYHSMKDEVQTHA FIEKWSRSKRIILPVVTGDELELRVYTGPQDLAIGSYGIAEPTGAPFTDYETIDLAVIPG VAFDRYGHRLGRGKGYYDRLLPQIPAPKVGICFPFQLIEEVPAEAFDFRMDTIIAQ >gi|226332048|gb|ACIB01000008.1| GENE 180 227371 - 228123 392 250 aa, chain + ## HITS:1 COG:DR0470 KEGG:ns NR:ns ## COG: DR0470 COG1387 # Protein_GI_number: 15805497 # Func_class: E Amino acid transport and metabolism; R General function prediction only # Function: Histidinol phosphatase and related hydrolases of the PHP family # Organism: Deinococcus radiodurans # 2 216 16 227 260 77 28.0 3e-14 MHAVGDDEDYVRSAIKGGFQELGFSDHGPWKYHTDFVSDIRMLPEDLPEYIESIRALKEK YRNQISIKIGLEYEYFPEYIHWLKEIIKEYRLDYILFGNHHYHTDEKFPYFGHHTTNRDM LDLYEESTIEGMESGLFAYLAHPDLFMRSYPEFDKHCISVSSHICRAAARFHIPLEYNIG YVAINEARGITTYPCPQFWHIAANERCTAIIGLDAHNNLDLENPTYYDRACQELNALKMP VIDTIPFLKY >gi|226332048|gb|ACIB01000008.1| GENE 181 228246 - 228536 296 96 aa, chain - ## HITS:1 COG:no KEGG:BF0959 NR:ns ## KEGG: BF0959 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 96 1 96 96 171 100.0 1e-41 MKRNDAEPIGKLIQKYLRQESLESPLNEQRLLDSWETVLGPTIMSYTRDLYIRNQVLYVH LTSAALRQELMMGRELLVRNLNQKVGATVITNIIFR >gi|226332048|gb|ACIB01000008.1| GENE 182 228533 - 229645 1041 370 aa, chain - ## HITS:1 COG:BH0004 KEGG:ns NR:ns ## COG: BH0004 COG1195 # Protein_GI_number: 15612567 # Func_class: L Replication, recombination and repair # Function: Recombinational DNA repair ATPase (RecF pathway) # Organism: Bacillus halodurans # 1 364 1 369 371 171 30.0 3e-42 MILKRISILNYKNLEQVELNFSAKLNCFFGQNGMGKTNLLDAVYFLSFCKSAGNPIDSQN IRHEQDFFVIQGFYEAMDGTPEEIYCGMKRRSKKQFKRNKKEYSRLSDHIGFIPLVMVSP ADSELIAGGSDERRRFMDVVISQYDKEYLDALIRYNKALVQRNTLLKSEQPIEEELFLVW EEMMAQAGEVVFRKREAFISEFIPIFQSFYSYISQDKEQVGLTYESHARNASLLEVLKES RVRDKIMGYSLRGIHKDELNMLLGDFPIKREGSQGQNKTYLVALKLAQFDFLKRTGSTVP LLLLDDIFDKLDASRVEQIVKLVAGDNFGQIFITDTNREHLDRILYKVGSDYKMFRVESG AINEMEEKER >gi|226332048|gb|ACIB01000008.1| GENE 183 229806 - 230489 851 227 aa, chain + ## HITS:1 COG:no KEGG:BF0957 NR:ns ## KEGG: BF0957 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 227 1 227 227 348 100.0 6e-95 MAEQKNHNNELNVEDALTQSEAFLVKNKKAIIGAVVAVIVIVAGAILYKNFYAEPREEKA QAALFKSEQYFEQSAYEQALNGDSIGSIGFLKVADQFSGTKAANLAKAYAGICYQNLGKY EEAIKALDGFSGDDQMVAPAIQGAIGNCYAQLGQLDKATSALLKAADHADNSTLSPIFLL QAGEILMKQGKNEEAVKAFTKIKDKYFQSYQAMDIDKYIEQAKLLKK >gi|226332048|gb|ACIB01000008.1| GENE 184 230575 - 231069 474 164 aa, chain + ## HITS:1 COG:BH1557 KEGG:ns NR:ns ## COG: BH1557 COG0054 # Protein_GI_number: 15614120 # Func_class: H Coenzyme transport and metabolism # Function: Riboflavin synthase beta-chain # Organism: Bacillus halodurans # 19 164 11 156 156 140 52.0 9e-34 MATAYHNLSDYDFNSVPNAEEMKFGIVVSEWNANITGALLDGAVKTLKKHGAKEENILVK TVPGSFELTFGANQMMENSDIDAIIIIGCVIKGDTPHFDYVCMGVTQGVAQLNATGDIPV IYGLITTNTMEQAEDRAGGKLGNKGDECAITAIKMIDFVWSLNK >gi|226332048|gb|ACIB01000008.1| GENE 185 231370 - 232620 1224 416 aa, chain + ## HITS:1 COG:SMc01163 KEGG:ns NR:ns ## COG: SMc01163 COG0673 # Protein_GI_number: 15964109 # Func_class: R General function prediction only # Function: Predicted dehydrogenases and related proteins # Organism: Sinorhizobium meliloti # 81 190 67 172 376 60 32.0 5e-09 MKTPSQTHVLGLAHPPLPMVRLAFIGLGNRGVLTLQRYLQIEGVEIKALCEIREGNLVKA QKILREAGYPQPDGYTGPDGWKRMCERDDIDLVFICTDWLTHTPMAVYSMEHGKHVAIEV PAAMTVEECWKLVDTAEKTRQHCMMLENCCYDPFALTTLNMAQQGVFGEITHVEGAYIHD LRSIYFADESKGGFHNHWGKKYSIEHTGNPYPTHGLGPVCQILNIHRGDRMNYLVSLSSL QAGMTEYARKNFGADSPEARQKYLLGDMNTTLIQTVKGKSIMIQYNVVTPRPYSRLHTVC GTKGFAQKYPVPSIALEPDAGSPLEGKALEEIMERYKHPFTATFGTEAHRRNLPNEMNYV MDCRLIYCLRNGLPLDMDVYDAAEWSCITELSEQSVLNGSIPVEIPDFTRGAWKEK >gi|226332048|gb|ACIB01000008.1| GENE 186 232712 - 234496 1425 594 aa, chain - ## HITS:1 COG:no KEGG:BF0953 NR:ns ## KEGG: BF0953 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 594 1 594 594 1211 100.0 0 MKTINKLFKGIFFCTLVSVSVSSCDLEVEPPANIAAETYWTSEKDAWYNLNSIYSAAIPG IGIYGDAYSDDVYCQYAHESNAKIFQQDGFSPLYDEGWNFETIRKENLFLQKVGNCEMDE SLRERFKAEVRAMRAWTYLGMTMTFGKVPLITEVLDYNSPNIPRDEVSVIRDFIMKELTE AAAILPEKYAGGYPNEKGRITKYACLSLKARAALYFGDYALAESTAKEVMDKGGFSLFKI SSLSDAQKKEAEEMSLYIDFAEKGIDKDEFVKGMFNYEALWHTENANPDNPEYIMTRQYA ASSWDYQDMTRYTSMRPNQLGGWSSVTPTQNLVDAYWGVDGHSVPQLPTPEERAKAYNQI KADLDAYQKPEGEAKFIAFCQEKIKNGTLKDYKYIQEFRNRDSRMYVSILMPFKSWYESN YGDKFVYEWIKNGNNESKTGFNFRKMLSLENDANGDGQATGDYPCIRYAEILLIYAEAHT QTTGYDAATEAALNQLRDRCGMPDVPSGLSKEEGLKLIQNERRIELAGEGFRGDDMTRYS DDYWKEHMNNVPIMTPDGDTELTMKWSSRMRLKPIPQTAIDLNPLLAGDQNPGY >gi|226332048|gb|ACIB01000008.1| GENE 187 234515 - 237625 2463 1036 aa, chain - ## HITS:1 COG:no KEGG:BF0952 NR:ns ## KEGG: BF0952 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1036 92 1127 1127 2005 100.0 0 DEVFKNSGYTYKIVDNQIVVSTAAAAAKEVQATQQQKQRKISGVVKDAMGEAIIGASVIE KGNPTNGTITNIDGEFTLNTAGKELQVTYIGYIPQAIVLKPGVNSYTVTMKEDTKTLDEV VVVGYGTQKKVNLTGAVSSVGADELKERVNTDVLASVQGQVPGVTIISRPGSTPSINMRG RGNLGTSSPLFVIDGAIADASFFSSLDPNSIESISFLKDAASSAIYGSRAAYGVVLVKTK GGKEGDLKISYDGSVAVKMATYTPDVLGSEWYARLSNEAALNENPNTSTLPYTDKEIQMF RDGSNPDMYPNTNWYDLVLKDEAVMTKHSVSFSGGNKVKYFTSLGYMYDDKFTPGAKSER YNLTTNISSDIKSWLTMRSNINYIQSTSDNDKGGVVYTHLLTIPSTYVARQSNGEWGSYE GGKPAATVNMERNPLRRLEEGGWSNSKTQNTLINLALDIKPVKGLVLTGEMIYKAWDYKS KTYTANKSKIKDFQTGTELNGTDVTNSKMEYSWEENSRLTYNALANYVWSNEKHNVNVLA GVSYEHYKYQKQKSYRLKFPTNGMTDMNGGSSAPDDTYAEGGSNEDKLMSYFGRVNYSFM DRYLLEANIRADASSRFHKDNRWGVFPSFSAGWRISQEGFMQDINWINNLKLRASWGQLG NINNVGQYDYFSSYQQGGNYNFEDAIVSGIVESKPANPTLGWETVTITDIGVDFDIFNGL LNFTADYYNKKTDDILLAYPSPKEIGIGSDFKVSQNIGTVSNKGLELSITHNKTLGDFAY TVGFNMSKNWNKVTNLGANDPIIESPWIKKVGYAIGTFYGYRSDGLLTQEDIDTGNYITD GLVPQAGDIKYVDLDGDGKLTDKDRTYIGCDVPDITYGVNLNLRYKGFELSMFGQGVTGT KVNFSMENAWAFSDYASPRKYHLKRWTVDNPNPNAAYPRIYPRTSKHSTYNQYFSDYWLF NADYFRIKNITFGYSFQKPVLQKLSLEALKLYVAAENPFTIRADHRMEDFDPETASGRGV NTRGTSSIAFGVNLTF Prediction of potential genes in microbial genomes Time: Tue May 17 22:10:43 2011 Seq name: gi|226332047|gb|ACIB01000009.1| Bacteroides sp. 3_2_5 cont1.9, whole genome shotgun sequence Length of sequence - 7834 bp Number of predicted genes - 4, with homology - 4 Number of transcription units - 2, operones - 2 average op.length - 2.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 6/0.000 - CDS 41 - 1027 445 ## COG3712 Fe2+-dicitrate sensor, membrane component - Prom 1087 - 1146 5.7 - Term 1071 - 1111 -1.0 2 1 Op 2 . - CDS 1153 - 1701 441 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog - Prom 1774 - 1833 7.6 + Prom 2344 - 2403 7.0 3 2 Op 1 . + CDS 2608 - 5928 2434 ## BF0868 putative TonB-linked outer membrane protein 4 2 Op 2 . + CDS 5942 - 7708 1149 ## BF0867 outer membrane protein + Term 7749 - 7793 9.2 Predicted protein(s) >gi|226332047|gb|ACIB01000009.1| GENE 1 41 - 1027 445 328 aa, chain - ## HITS:1 COG:AGl2289 KEGG:ns NR:ns ## COG: AGl2289 COG3712 # Protein_GI_number: 15891252 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 115 278 119 276 323 73 34.0 4e-13 MKQATTGNKDIIKKLLTDSLSPEEREKLNNYKFVNQAIYSQWEQVSDMYTDVDKEERMLT NVMHQIKKGKTGRFRQSLHRYGWVASIALLLICGTLSLMLLSRKAEPEVWYVLNSGRQSM DSVRLADGTLVMLNAGSRLTYPKEFSGNKREVTLSGQAFFSVHPDKVHPFVVKTKNMDVT ALGTAFEVFSFDGDESVETVLLNGKVKVEPKDHKEQIKGEYILQPNEKLTCQVNGDIRID RVDANSYSAWRIGGRLSFKNETLAMILPRLEKWYGQKIDCPQKTADHYRFTFTLRNEPLD LILNIMSHSAPLNYKLISNDYYVLEELK >gi|226332047|gb|ACIB01000009.1| GENE 2 1153 - 1701 441 182 aa, chain - ## HITS:1 COG:all2193 KEGG:ns NR:ns ## COG: all2193 COG1595 # Protein_GI_number: 17229685 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Nostoc sp. PCC 7120 # 1 171 16 190 201 61 25.0 7e-10 MTEKELIVSLKQGDEAAFTALYRMYWPKVHNFSRLYLSSIAEVEEVVQEVFVKLWEARIF LKENESFKGFLFIITRNIIFNQFRKSFNENAYKTTVLSSAEVEYDIENEMDAADLQGYIK KLISELTPRQQEVFHLSREEHLSYKEIAIRLSISEKTVERHINEALKFLRKNIYLFFIFL SL >gi|226332047|gb|ACIB01000009.1| GENE 3 2608 - 5928 2434 1106 aa, chain + ## HITS:1 COG:no KEGG:BF0868 NR:ns ## KEGG: BF0868 # Name: not_defined # Def: putative TonB-linked outer membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 4 1106 1 1103 1103 2032 99.0 0 MVFMKKCNQKWFNPQKVKKQIAIAVAISLVCAIPISTFAQVLKFSIKKSNTSIQSVLQEL EKESGYTLFYNDNQVKLDKKISINIEDASIEVALNQIFENSGYSYRIVENQIVIYTTPTT TVQQTVQQKKQQKVTGVVKDIAGDPIIGASIIEKGSSSNGTITNVNGDFSLIVTGNELQV SYIGYIPQTINLKPGVSSYNVIMKEDTKTLDEVVVVGYSTQKKESLTGALQTVKSDKLKD ITTPSVENMLNGKVPGVYVAPGSGQPGSGGAVVIRGQATLSGTTAPLWVIDGVIVGSNAG ALNPSDIETMTILKDAASTAIYGSQGANGVILVTTKNGKAEKMTVNVSAKVGISKLGRGN MEMMDGAELYDYYKSFSNQEAITFSRYNDKLRNCNFDWFDLAAQTGVTQDYNVSLSGGNE KIRSFLSIGVYDEEGAVKGYDYTRYNFRLKTTYKPFEWLSIKPALAGSRRDIEDKQYDVT SMFQRLPWDSPFDEEGNLVPNRYTGWVNSSNSNYLYDLQWNKSNSTNYEFMGNLDFDIRI TDWLNFSSVNNYKYIGYNYSEYTDPRSSSGEGVDGRMREYQTTTVRRYSNHIIRFNKMFG KHSINALAAYEFNDYWAKATDMYGIGFIPGFEVLDVVAKPEKVGGSISEWAVQSLLFNAN YAFDNKYLAQLSFRRDGASNFGDNAKYGNFFSISAGWNINREKWFHASWVDILKLRISYG SVGNRPSSLYPQYDLYSVSSKYNEESGALISQLGNKDLTWEKTYTTGTGIDVAFFDNRLR ASFDWYNKYTSNILYAVPISGLVGVTSMWKNIGEMQNQGFELSIGGDIIRTKDWDWNIEI NLGHNKNKLKKLYKTKNAEGQFVEKPIIISDGTSIAGTAKRVLQPGYPCDTYYLKEWAGV NPENGAPQWYKTVENEDGTLSRQKTSNYSEADQVKCGSSSPDIFGGFSTVLRWKDIDLNA VFGYSVGGQIYNYSRQEYDSDGAYNDRNQMKLQKGWNRWEKPGDIATHPVASYSNTSKSN SSSSRYLEKNDYLKLRSLSIGYNLKLPQYYINNMRIFFTGENLFCVTNYSGVDPEIPASD GSVIGTALPSVYPTVRKFMFGLNLTF >gi|226332047|gb|ACIB01000009.1| GENE 4 5942 - 7708 1149 588 aa, chain + ## HITS:1 COG:no KEGG:BF0867 NR:ns ## KEGG: BF0867 # Name: not_defined # Def: outer membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 588 1 588 588 1163 100.0 0 MKKIITLLCAVATLTACDIDRLPYGSMSAEQITQDPTSSLESLVNGCYAQLKSWSDPMHR LGEYAGDNMAKDKSSTDAFFDFISYSRDADNYRLQSFWDSGYKAIAQASNIIKMIDEGKS KTIDYQLGECYYIRGMMYFYLGRAFGRPYWDKPEGHMGVPIVNGTPDDVNNLNLPDRSTV QDTYEQAIDDLKVAARLMENGETKREGPAYASKEAAWAMLSRIYLFMSGTYEAPNSENAQ LAIDYATRVIESTTSEGGLKYELLSRENFMRYNTFMPENNKESIFVVKIMASEKPDYWNS IGGMYSYAGQQGWGEMYASAKYMDLLNEQGRNDWRPDKKKIVDARANFISPSYITDSDGK YVEVFRFIKNVYNKNNIHTGYTYVQLPISKRGNTVTCKEGETNYTLSLINSSEEKYSINY SDGQTYSGVIDYEIELSSGQPKFYILKCSNEGTASGEAESQLHSPVISRLGEVYLNRAEA YAKKGDYSHAQADLNIIRERSLPGRGYNDLNASNAKVRIEKERQLELAYQAERSYDVFRN CETLTRKYPGVHDAMLEIPATDYRVIYFIPQSAINSYPGTLTQNPTSN Prediction of potential genes in microbial genomes Time: Tue May 17 22:11:01 2011 Seq name: gi|226332046|gb|ACIB01000010.1| Bacteroides sp. 3_2_5 cont1.10, whole genome shotgun sequence Length of sequence - 5485 bp Number of predicted genes - 2, with homology - 2 Number of transcription units - 1, operones - 1 average op.length - 2.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 5 - 64 4.4 1 1 Op 1 . + CDS 143 - 3553 2273 ## BF0946 hypothetical protein 2 1 Op 2 . + CDS 3582 - 5342 1550 ## BF0945 hypothetical protein + Term 5371 - 5410 8.5 Predicted protein(s) >gi|226332046|gb|ACIB01000010.1| GENE 1 143 - 3553 2273 1136 aa, chain + ## HITS:1 COG:no KEGG:BF0946 NR:ns ## KEGG: BF0946 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1136 1 1136 1136 2175 100.0 0 MKKCNLRWFSPQKIKRQLAFVLAICLVCIVPVTTYAQILKISMKRTNVSIQNVIRELEQK SGYTFFYNDNQVKLTKKVSVDVTDAPIENVLDQIFNNSGYTYKIIDNQIVVSAIKAEIPK TSSLQQQKSVRITGQVKDTTGEPIIGASVVEKGSASNGTITDVGGNFKLTVSGNEIQITY IGYIPQTVKIHPGVTDYSITMKEDTKTLDEVVVVGYGTQKKVNLTGSVASVSTDEIKDRV QTNVLSAVQGTVPGVTVISRPGQTPSINFRGRGNLGTSSPLYVIDGAIADATFFSNLDPN SIESISFLKDAASSAIYGSRAAYGVVLVTTKQGKSDKMNVSYSGYVGLSNPTYKPEYVNS TQYAELYNEALYNYNPKGGKYQGYTEEEIGYFRDGSKPDLYPNTDWNDLVLDKNVLTTQH SLDFSGGTDKIRYFIGLGYVYKDNMIPGQDSQRYNLNTNLSSDITKWLTVKAGVKYIRND SDRDCGAPSLASFSMVPVTFVAKQSNGDWGTVNGGQTATSNFITGNPLRALSKKDWSKSK SENTMYDLGFDIKPVKGLIISGQGVFKGYEYKSKSYTALQPNAINYFSGEEIAGTGVTKN KMSMDWQSTNTMLYTATARYDWSNDKHAVGALVGTSYEHYKYERLAGSREEFPSDALTDM EAGSTSGAGYTNGAGSSEYKMLSYFARVNYTLMDRYLFEVNMRADASSRFHKDHRWGYFP SFSAGWRMSEESFMKDIEWINNLKIRASYGTLGNINNVGNYDYFQNYSSGNHYNFSDSPV IGIGESKPANETLGWEKVALTDIGLDFDIFNGLLGVTADYYIKNTSDILLGYNVPTETGI TAAPSQNIGKVKNTGFELALNHRNKIGAVNYSIGANIATNKNKITNLGGSDNIIQTSSYI VKYILKKGESIGSFYGFKTDGLYTQADIDAGHYYTLSGVVPNAGDIKFVPQRDIEYKQEI TDEDRTILGKDVPDFTYGVNLSLQYKGFEFSMFGQGISGTKVAFDVYGVHPFYHGQDSPR KYHLKRWTEENPNPHAAYPRIYSASSVHTTYNRNFSDYHLFDSDYFRFKTLSLGYTVPSA TVKNWGLQSLKVYVTGENLFTVRADKKMEDFDPETAGGVIYTLGTKSVAFGVNISF >gi|226332046|gb|ACIB01000010.1| GENE 2 3582 - 5342 1550 586 aa, chain + ## HITS:1 COG:no KEGG:BF0945 NR:ns ## KEGG: BF0945 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 586 1 586 586 1166 100.0 0 MKNNIRKIALGLCLTGALTACDLDVVPPADIAAENFWQTEKDAWYALNTCYATLDGVDIW DELCTDNAHSHKPWEGNFEMVQQNGISTANGYGSYYFGTVRIVNNFIANIDKCAVSEELK TRMKAEARFFRALSYLDLTTKFGKVPVITEVLAYDAPNVKRDEVETVRKFILDELAEIAE ILPDSYNGSYLYETGRITRAGALALRARAALYFGNYAEAEASAGKIISEGHHSLFRVSSL TTAQQKEADEMDAYIDYAAKGIDKDKFVKGMFSYESLWHKGNASPANPEYIVTREYMADA NNYDWTRYTYFIPKSFSQYDGYCSYEPMQDLIDAYWDVDGKTMRNDITMEQRKERYAEIW KDFKDMSQSQFIEKVPQTDIMKYDYMKEFRNRDSRLYVSMMFPFKGWHETIKGTFYFRWD PDLINKDGNESWTGYFYRKMVTLDPYDTWTAEEDYPVIRYAEVLLTYAEARIQNSGWDTE VQKALNDLRDRCGMPDVPTTMPSKEEALAFVRNERRIELAAEGHRFDDIRRYGNDYCSKA MNGPSYAPNGYVVINKVWDNRLMLMPIPQGAIDLNPLLKDDQNPGY Prediction of potential genes in microbial genomes Time: Tue May 17 22:12:08 2011 Seq name: gi|226332045|gb|ACIB01000011.1| Bacteroides sp. 3_2_5 cont1.11, whole genome shotgun sequence Length of sequence - 182672 bp Number of predicted genes - 133, with homology - 132 Number of transcription units - 56, operones - 32 average op.length - 3.4 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . + CDS 2 - 3028 2339 ## BF0944 hypothetical protein 2 1 Op 2 . + CDS 3050 - 4681 1138 ## BF0863 putative outer membrane protein + Term 4697 - 4749 5.7 + Prom 5211 - 5270 4.7 3 2 Op 1 . + CDS 5341 - 6243 559 ## BF0862 putative transmembrane protein 4 2 Op 2 . + CDS 6255 - 7691 665 ## BF0861 putative cytochrome c binding protein + Term 7714 - 7758 2.4 + Prom 7703 - 7762 2.2 5 3 Tu 1 . + CDS 7798 - 10449 2195 ## COG1472 Beta-glucosidase-related glycosidases 6 4 Tu 1 . - CDS 10604 - 11500 511 ## COG2207 AraC-type DNA-binding domain-containing proteins - Prom 11532 - 11591 5.3 + Prom 11542 - 11601 3.4 7 5 Op 1 . + CDS 11638 - 13017 1244 ## COG0477 Permeases of the major facilitator superfamily 8 5 Op 2 . + CDS 13031 - 16333 2566 ## BF0857 TPR domain-containing protein 9 5 Op 3 . + CDS 16330 - 19101 2105 ## COG3250 Beta-galactosidase/beta-glucuronidase + Term 19129 - 19174 5.8 + Prom 19124 - 19183 4.6 10 6 Tu 1 . + CDS 19225 - 21492 1824 ## BF0855 hypothetical protein + Term 21544 - 21588 10.5 + Prom 22391 - 22450 7.3 11 7 Tu 1 . + CDS 22598 - 23992 1401 ## COG0673 Predicted dehydrogenases and related proteins 12 8 Op 1 . + CDS 24104 - 24994 699 ## COG1284 Uncharacterized conserved protein 13 8 Op 2 . + CDS 25020 - 28355 3089 ## COG3250 Beta-galactosidase/beta-glucuronidase 14 8 Op 3 . + CDS 28377 - 29462 1317 ## BF0928 hypothetical protein + Term 29492 - 29540 11.5 - Term 29406 - 29440 1.1 15 9 Op 1 . - CDS 29443 - 30096 465 ## BF0849 hypothetical protein 16 9 Op 2 . - CDS 30124 - 31023 764 ## COG0697 Permeases of the drug/metabolite transporter (DMT) superfamily 17 9 Op 3 . - CDS 31055 - 31813 674 ## COG0101 Pseudouridylate synthase 18 9 Op 4 . - CDS 31894 - 33048 1052 ## COG1853 Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family 19 9 Op 5 . - CDS 33080 - 34867 1582 ## BF0923 hypothetical protein - Prom 34935 - 34994 5.5 + Prom 34926 - 34985 9.1 20 10 Op 1 . + CDS 35022 - 35519 433 ## BF0922 hypothetical protein 21 10 Op 2 . + CDS 35544 - 37169 2048 ## COG2268 Uncharacterized protein conserved in bacteria + Term 37185 - 37236 7.5 - Term 37176 - 37220 7.2 22 11 Tu 1 . - CDS 37253 - 37933 652 ## BF0920 hypothetical protein - Prom 37958 - 38017 3.6 + Prom 37827 - 37886 6.3 23 12 Op 1 . + CDS 38098 - 39111 975 ## COG1702 Phosphate starvation-inducible protein PhoH, predicted ATPase 24 12 Op 2 . + CDS 39131 - 40075 1187 ## COG0152 Phosphoribosylaminoimidazolesuccinocarboxamide (SAICAR) synthase + Prom 40118 - 40177 2.8 25 13 Op 1 . + CDS 40197 - 40934 569 ## PROTEIN SUPPORTED gi|163754278|ref|ZP_02161401.1| 30S ribosomal protein S15 26 13 Op 2 . + CDS 40961 - 41704 716 ## COG0169 Shikimate 5-dehydrogenase 27 13 Op 3 . + CDS 41705 - 42655 593 ## COG1073 Hydrolases of the alpha/beta superfamily 28 13 Op 4 8/0.000 + CDS 42657 - 43577 666 ## COG1512 Beta-propeller domains of methanol dehydrogenase type 29 13 Op 5 . + CDS 43672 - 44253 731 ## COG1704 Uncharacterized conserved protein + Prom 44537 - 44596 5.5 30 14 Op 1 . + CDS 44839 - 46005 1107 ## COG0150 Phosphoribosylaminoimidazole (AIR) synthetase 31 14 Op 2 . + CDS 46069 - 47181 1291 ## COG0216 Protein chain release factor A 32 14 Op 3 . + CDS 47230 - 48054 995 ## COG0284 Orotidine-5'-phosphate decarboxylase 33 14 Op 4 . + CDS 48054 - 49283 1132 ## COG1078 HD superfamily phosphohydrolases + Prom 49285 - 49344 2.4 34 15 Op 1 . + CDS 49366 - 50406 1046 ## COG1044 UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase 35 15 Op 2 . + CDS 50410 - 51795 1429 ## COG0774 UDP-3-O-acyl-N-acetylglucosamine deacetylase 36 15 Op 3 . + CDS 51806 - 52573 703 ## COG1043 Acyl-[acyl carrier protein]--UDP-N-acetylglucosamine O-acyltransferase + Term 52595 - 52635 8.3 + Prom 52628 - 52687 3.7 37 16 Op 1 . + CDS 52707 - 53270 794 ## BF0904 hypothetical protein + Term 53283 - 53320 0.8 + Prom 53280 - 53339 2.6 38 16 Op 2 . + CDS 53367 - 54272 843 ## COG0324 tRNA delta(2)-isopentenylpyrophosphate transferase 39 17 Tu 1 . - CDS 54412 - 54567 56 ## BF0902 hypothetical protein - Prom 54606 - 54665 7.0 40 18 Tu 1 . + CDS 54670 - 55749 621 ## BF0824 hypothetical protein 41 19 Op 1 . - CDS 55845 - 57887 1740 ## COG4585 Signal transduction histidine kinase 42 19 Op 2 . - CDS 57938 - 58111 77 ## BF0822 hypothetical protein 43 19 Op 3 . - CDS 58129 - 59688 1036 ## BF0821 hypothetical protein 44 19 Op 4 . - CDS 59707 - 60039 403 ## BF0820 putative regulatory protein - Prom 60212 - 60271 4.8 45 20 Tu 1 . + CDS 60507 - 60896 113 ## BF0818 hypothetical protein + Term 60971 - 61025 9.3 + Prom 61348 - 61407 4.4 46 21 Tu 1 . + CDS 61641 - 61865 116 ## BF1426 hypothetical protein + Prom 61902 - 61961 3.6 47 22 Op 1 . + CDS 61991 - 62977 648 ## BF0893 hypothetical protein 48 22 Op 2 . + CDS 63007 - 65502 1917 ## COG0787 Alanine racemase + Prom 65549 - 65608 6.9 49 23 Op 1 . + CDS 65664 - 65891 333 ## BF0891 putative sec-independent protein translocase 50 23 Op 2 . + CDS 65968 - 66762 661 ## COG0805 Sec-independent protein secretion pathway component TatC + Prom 66881 - 66940 6.3 51 24 Tu 1 . + CDS 66970 - 68010 398 ## COG1835 Predicted acyltransferases 52 25 Tu 1 . + CDS 68121 - 71567 2282 ## COG1112 Superfamily I DNA and RNA helicases and helicase subunits + Term 71692 - 71721 1.4 - Term 71756 - 71803 5.1 53 26 Op 1 . - CDS 71887 - 73134 1219 ## BF0887 hypothetical protein 54 26 Op 2 . - CDS 73153 - 74502 1494 ## COG3669 Alpha-L-fucosidase - Prom 74526 - 74585 4.4 + Prom 74827 - 74886 7.7 55 27 Tu 1 . + CDS 74946 - 76391 1148 ## COG0477 Permeases of the major facilitator superfamily - Term 76446 - 76496 5.4 56 28 Tu 1 . - CDS 76558 - 80643 2174 ## COG0642 Signal transduction histidine kinase - Prom 80671 - 80730 6.8 + Prom 80732 - 80791 1.6 57 29 Op 1 . + CDS 80829 - 84014 2180 ## BF0883 hypothetical protein 58 29 Op 2 . + CDS 84027 - 86009 1116 ## BF0806 hypothetical protein + Term 86043 - 86085 6.1 + Prom 86047 - 86106 6.3 59 30 Op 1 . + CDS 86128 - 87579 1112 ## COG3119 Arylsulfatase A and related enzymes 60 30 Op 2 . + CDS 87582 - 88964 1168 ## COG3669 Alpha-L-fucosidase 61 30 Op 3 . + CDS 88974 - 91049 1257 ## BF0803 alpha-galactosidase 62 30 Op 4 . + CDS 91070 - 92626 1579 ## COG3119 Arylsulfatase A and related enzymes + Term 92651 - 92710 2.3 63 31 Tu 1 . - CDS 92706 - 93353 574 ## COG0546 Predicted phosphatases - Prom 93484 - 93543 6.0 + Prom 93384 - 93443 7.3 64 32 Op 1 . + CDS 93559 - 93855 279 ## BF0876 hypothetical protein 65 32 Op 2 . + CDS 93852 - 95195 1175 ## COG1073 Hydrolases of the alpha/beta superfamily + Term 95425 - 95467 5.1 66 33 Op 1 . + CDS 95687 - 97633 1836 ## COG1154 Deoxyxylulose-5-phosphate synthase 67 33 Op 2 17/0.000 + CDS 97672 - 99012 1314 ## COG0569 K+ transport systems, NAD-binding component 68 33 Op 3 . + CDS 99017 - 100468 1168 ## COG0168 Trk-type K+ transport systems, membrane components + Prom 100567 - 100626 6.3 69 34 Tu 1 . + CDS 100688 - 101032 174 ## BF0870 hypothetical protein + Term 101101 - 101149 1.2 70 35 Op 1 30/0.000 + CDS 101176 - 101526 218 ## PROTEIN SUPPORTED gi|154175415|ref|YP_001407462.1| NADH dehydrogenase subunit A 71 35 Op 2 9/0.000 + CDS 101517 - 102107 428 ## PROTEIN SUPPORTED gi|154175216|ref|YP_001407461.1| NADH dehydrogenase subunit B 72 35 Op 3 8/0.000 + CDS 102129 - 103721 1568 ## COG0649 NADH:ubiquinone oxidoreductase 49 kD subunit 7 73 35 Op 4 31/0.000 + CDS 103801 - 104877 1167 ## COG1005 NADH:ubiquinone oxidoreductase subunit 1 (chain H) 74 35 Op 5 28/0.000 + CDS 104894 - 105373 429 ## COG1143 Formate hydrogenlyase subunit 6/NADH:ubiquinone oxidoreductase 23 kD subunit (chain I) 75 35 Op 6 30/0.000 + CDS 105380 - 105892 662 ## COG0839 NADH:ubiquinone oxidoreductase subunit 6 (chain J) 76 35 Op 7 26/0.000 + CDS 105899 - 106210 419 ## COG0713 NADH:ubiquinone oxidoreductase subunit 11 or 4L (chain K) 77 35 Op 8 30/0.000 + CDS 106247 - 108157 1882 ## COG1009 NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit 78 35 Op 9 22/0.000 + CDS 108170 - 109654 1312 ## COG1008 NADH:ubiquinone oxidoreductase subunit 4 (chain M) 79 35 Op 10 . + CDS 109661 - 111112 1494 ## COG1007 NADH:ubiquinone oxidoreductase subunit 2 (chain N) + Term 111137 - 111201 20.7 - Term 111125 - 111189 4.6 80 36 Tu 1 . - CDS 111209 - 113998 2599 ## COG0642 Signal transduction histidine kinase - Prom 114130 - 114189 2.0 + Prom 113945 - 114004 1.8 81 37 Op 1 . + CDS 114236 - 116935 2697 ## COG1629 Outer membrane receptor proteins, mostly Fe transport 82 37 Op 2 . + CDS 117004 - 118278 1491 ## BF0856 hypothetical protein + Term 118300 - 118348 4.7 83 38 Tu 1 . + CDS 118351 - 119499 768 ## BF0855 hypothetical protein + Term 119525 - 119552 -0.8 - Term 120027 - 120069 3.3 84 39 Op 1 . - CDS 120104 - 121234 1047 ## COG0251 Putative translation initiation inhibitor, yjgF family 85 39 Op 2 . - CDS 121238 - 122671 949 ## BF0851 hypothetical protein 86 39 Op 3 . - CDS 122700 - 124463 1071 ## BF0850 hypothetical protein 87 39 Op 4 . - CDS 124533 - 125711 1210 ## COG2942 N-acyl-D-glucosamine 2-epimerase 88 39 Op 5 . - CDS 125724 - 127112 614 ## PROTEIN SUPPORTED gi|90020673|ref|YP_526500.1| ribosomal protein L9 89 39 Op 6 . - CDS 127129 - 128301 1091 ## COG2152 Predicted glycosylase 90 39 Op 7 . - CDS 128338 - 129462 1005 ## COG4124 Beta-mannanase - Prom 129487 - 129546 1.9 - Term 129510 - 129546 -0.8 91 40 Op 1 . - CDS 129554 - 130558 791 ## COG0407 Uroporphyrinogen-III decarboxylase 92 40 Op 2 . - CDS 130565 - 131269 319 ## BF0769 hypothetical protein 93 40 Op 3 . - CDS 131272 - 131910 756 ## COG5012 Predicted cobalamin binding protein - Prom 131972 - 132031 4.7 - Term 131930 - 131968 1.1 94 41 Op 1 . - CDS 132103 - 133356 1112 ## BF0842 hypothetical protein 95 41 Op 2 . - CDS 133362 - 135806 2001 ## COG1472 Beta-glucosidase-related glycosidases 96 41 Op 3 . - CDS 135803 - 137110 897 ## COG3934 Endo-beta-mannanase 97 42 Op 1 . - CDS 137228 - 138253 743 ## COG4124 Beta-mannanase - Prom 138279 - 138338 2.8 98 42 Op 2 . - CDS 138343 - 141558 1834 ## BF0763 putative secreted glucosidase 99 42 Op 3 . - CDS 141555 - 143342 1266 ## BF0762 hypothetical protein - Prom 143404 - 143463 3.0 - Term 143372 - 143411 4.2 100 43 Op 1 . - CDS 143470 - 144576 947 ## BF0761 putative lipoprotein 101 43 Op 2 . - CDS 144612 - 146279 1591 ## BF0834 hypothetical protein 102 43 Op 3 . - CDS 146305 - 149442 2552 ## BF0759 putative outer membrane protein - Prom 149462 - 149521 7.0 - Term 149600 - 149643 1.0 103 44 Tu 1 . - CDS 149822 - 151990 1549 ## COG1472 Beta-glucosidase-related glycosidases - Prom 152063 - 152122 4.8 + Prom 151943 - 152002 4.5 104 45 Tu 1 . + CDS 152170 - 153171 692 ## COG0667 Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 105 46 Tu 1 . - CDS 153242 - 154789 1183 ## BF0756 hypothetical protein - Prom 154813 - 154872 3.3 - Term 154854 - 154907 14.0 106 47 Tu 1 . - CDS 154937 - 155029 121 ## - Term 155441 - 155491 15.0 107 48 Op 1 . - CDS 155604 - 156191 617 ## BF3910 putative phage-related protein 108 48 Op 2 9/0.000 - CDS 156213 - 157343 872 ## COG0399 Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis 109 48 Op 3 3/0.250 - CDS 157357 - 157941 530 ## COG0110 Acetyltransferase (isoleucine patch superfamily) 110 48 Op 4 12/0.000 - CDS 157954 - 158562 371 ## COG2148 Sugar transferases involved in lipopolysaccharide synthesis 111 48 Op 5 3/0.250 - CDS 158565 - 159773 533 ## COG0438 Glycosyltransferase 112 48 Op 6 3/0.250 - CDS 159784 - 160968 673 ## COG0381 UDP-N-acetylglucosamine 2-epimerase 113 48 Op 7 3/0.250 - CDS 160976 - 162127 778 ## COG0451 Nucleoside-diphosphate-sugar epimerases 114 48 Op 8 4/0.250 - CDS 162172 - 163191 767 ## COG1086 Predicted nucleoside-diphosphate sugar epimerases 115 48 Op 9 25/0.000 - CDS 163223 - 164077 473 ## COG0438 Glycosyltransferase - Prom 164121 - 164180 3.5 - Term 164197 - 164233 -0.1 116 48 Op 10 1/0.500 - CDS 164320 - 165393 248 ## COG0438 Glycosyltransferase 117 48 Op 11 8/0.000 - CDS 165405 - 166721 852 ## COG1004 Predicted UDP-glucose 6-dehydrogenase 118 48 Op 12 . - CDS 166728 - 166934 147 ## COG0451 Nucleoside-diphosphate-sugar epimerases - Prom 167135 - 167194 6.0 - Term 167080 - 167115 -0.5 119 49 Tu 1 . - CDS 167196 - 168050 306 ## CPF_0915 putative polysaccharide polymerase protein - Prom 168117 - 168176 14.4 120 50 Op 1 . - CDS 168250 - 168798 238 ## COG0110 Acetyltransferase (isoleucine patch superfamily) 121 50 Op 2 . - CDS 168795 - 169703 362 ## BF3916 hypothetical protein - Prom 169762 - 169821 3.9 - Term 169807 - 169852 -0.1 122 51 Op 1 . - CDS 169858 - 170229 98 ## CA2559_13018 hypothetical protein 123 51 Op 2 . - CDS 170232 - 171242 381 ## Avi_3137 hypothetical protein 124 51 Op 3 1/0.500 - CDS 171326 - 172456 894 ## COG0399 Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis - Prom 172503 - 172562 3.4 - Term 172715 - 172742 -0.8 125 52 Tu 1 . - CDS 172922 - 173482 393 ## COG0399 Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis - Prom 173660 - 173719 7.8 126 53 Op 1 2/0.250 - CDS 174253 - 175680 296 ## COG2244 Membrane protein involved in the export of O-antigen and teichoic acid 127 53 Op 2 . - CDS 175687 - 176274 367 ## COG1898 dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes 128 53 Op 3 . - CDS 176274 - 176630 375 ## gi|253563861|ref|ZP_04841318.1| predicted protein - Prom 176790 - 176849 4.3 129 54 Op 1 8/0.000 - CDS 177262 - 178320 837 ## COG0451 Nucleoside-diphosphate-sugar epimerases 130 54 Op 2 . - CDS 178325 - 179662 1120 ## COG1004 Predicted UDP-glucose 6-dehydrogenase - Prom 179786 - 179845 5.3 131 55 Tu 1 . - CDS 180356 - 181246 714 ## COG1209 dTDP-glucose pyrophosphorylase - Prom 181268 - 181327 3.0 132 56 Op 1 . - CDS 181411 - 181899 454 ## BF0804 hypothetical protein 133 56 Op 2 . - CDS 181919 - 182455 528 ## BF0803 putative transcriptional regulator UpxY-like protein - Prom 182607 - 182666 4.6 Predicted protein(s) >gi|226332045|gb|ACIB01000011.1| GENE 1 2 - 3028 2339 1008 aa, chain + ## HITS:1 COG:no KEGG:BF0944 NR:ns ## KEGG: BF0944 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1008 92 1105 1105 1412 71.0 0 NEVFKNSGYTYKIVDNQIVVSTAAAAAKEVQATQQQKQRNISGVVKDAMGEAIIGASVIE KGNPTNGTITDINGKFSLNVGGNELQITYIGYMPETVSLKTGVASYNIIMKEDTKTLDEV VVVGYGVQKKANLTGSVASISADALESRSVASVSAALAGQIPGVTSIQTSGAPGSQTGSI TIRGKNSINAASPLVIVDGVPGTMDTIDPSDIETLTVLKDAASSAIYGVQAANGVIVITT KQGKKGEKTHINYSGIVSWASPVVKRQYVNAYEHAILYNEAVHNENPNAVLPFTDEDIEN YRNGTYPSTNWYDEAFKKSAFEQMHNLSISGGSEKTTYNASIGYTNQGGLTDEISYKRYN ARMSLNSDINKYVSVGVNASGYRGIKEDGWMGYVTVAQGVSRSYPTDPVYAEDGSFNYSG KDNPVAIQDQSGFTRSTAQQLNATAYAQINILPELSVKGVFSLRHDYTNQEGFKKHFTYG NFDSGLREGYDEYYNHNRYTSQVLANYNKTFGKHSIAVLGGFESFEHIYKFTKASRKGGG NNELTESLNTLDASSQKNEDGGYEMSRLSYFGRVQYDFMNKYLFEANLRADASSRFPKDN RWGVFPAISAGWRISEETFIKDNVSWISNLKLRLGWGKTGNEELDPDDIYPAIPTYAYEK YMFGNSLYSTAYESRYVNNNLQWATVTNYELGLEAGFLNNMFGFELSVYKKKTNDMLLYM PIQGVIGMGAPAQNAGSVENTGFDLNLLHNNRINKDWSYAVNLNIAYVKNEIIDMNGTEG ANSKNDKLWYIEGNPIGSYYGYVANGYFNTDDELANYPKRTGKEQLGDIKYLDLNGDGKI TADGDRQIIGKNFPSWTTGLNLTLYYKDFDFSAMFQGAFDVDGYYMAEAAYAFYNGANAL KRHLDRWTPEHHNASYPRITKDSQTNFTTSSFWLQNASYVRLKTISLGYNLPNSFLSKLG VQKAKLYVAGENLLTFSDLEGIDPEEGNERGWSYGNVKKVSIGLKVSF >gi|226332045|gb|ACIB01000011.1| GENE 2 3050 - 4681 1138 543 aa, chain + ## HITS:1 COG:no KEGG:BF0863 NR:ns ## KEGG: BF0863 # Name: not_defined # Def: putative outer membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 543 1 545 545 811 74.0 0 MKNKILYIIASVMLTGFTSCSDFLDRYPTEELSDGSFWKTKTDAEMAVSDIYRCLPNWDI DEDINSDNAVHGIKWANGNVSKGVYDPADQGWSEDYGYIRRCNLVLQKLEEMELSQSDKE PIAGQAYFFRGYIYFELIRKFGDVPYIDQPLKLTDVEDITRTSKDEIYTKIMADFDKAFS YLPEEWPATQWGRITKGAAMAMKARAALYFGNWETAATAAKSVMDLNKYDLYDKENTGKY QELFWEKTDGCEEIILAVQYNAPDKTNYLIGWECFPTKGWGGLNPTQSLVDAFEDSEGAP ISKSKIYSEKNPFANRDPRLEVNVLHDGEEMYGVTIKVAPLKSSGSTGIAQHGDATATGY YQQKWLDPSIDPQSAGWEMGKDWVTIRYAEVLLTYAEAKNEVSPLDDSAFEAVNQVRRRV GMPELQKTDATKPTYCATQDDLRQRIRNEWRVEFALEGGKRQWDIRRWGIAKDVLNAPFL GIKYKMVDDAVNADPKDGGKVCVLYEGDNVKLAGSRYEDHNYIYPIPQSEIDLNPKLTQN PGY >gi|226332045|gb|ACIB01000011.1| GENE 3 5341 - 6243 559 300 aa, chain + ## HITS:1 COG:no KEGG:BF0862 NR:ns ## KEGG: BF0862 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 300 210 509 509 598 99.0 1e-170 MIMAEKSVKEKNWENVLTQTEKYINSGRTNQLISYFHNLALYHTGKLPYQLFDYPQKLGV KALYFPWNSDSRESEYGHFIYEDLGYINEAQRWEFEAMVVWGETAPHLLNLARYNIVNKR PEVARRFINLLKQSLFYRKDAEELEKQLYAGSVPGLRMALENNKEHPARFANVINIGPEL QYLCEQDTTNRMAFEYLMSDLLLSNNVVRFVDNLKFIRHFKYPEMPPAYQEALYIYKLGV DGETFSKSGFNVSENTEKRFQRYYSLYKNRQMQRLKAEFGNTYWYYLNFISPYGDKIIRN >gi|226332045|gb|ACIB01000011.1| GENE 4 6255 - 7691 665 478 aa, chain + ## HITS:1 COG:no KEGG:BF0861 NR:ns ## KEGG: BF0861 # Name: not_defined # Def: putative cytochrome c binding protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 478 1 478 478 999 98.0 0 MKTIYILLITVLSWSLQACTAQCGKPDTCTDSIPCIYPDYAGVTFPLNIAPPNFRIGENA DAFQTEIGTGETADILCTSKSPEVIIPTKKWKKLLQKAAGKEIFIRITLLRGGKWTRYAD IKDTISNEPIDEYLVYRLLYPGYELWNEMGIYQRDLTGYEETPIAENRNFGKQCINCHTF NQNSPETMMVHVRGKSGGTLVCKNGKVEKVNTKPEGFKNGGTYAAWHPSGRYIAFSMNEI QQFFHSSGQKPIEVSDLAADLMVYDTEKKTFLTDSLICGERYMETFPNWTPDGKTLYFCR GNAYKEKMPLDSIRYDLCRIGFDPESGKFGTPECVYRASEEGKSVSFPRVSPDGKYLMFT LSDYGNFSIWHPESELCLLTMDTGEIRLLNEVNSNDVESFHTWSSSGRWFVFSSKRLDGL WARPFFASFDPETGKAGKPFLMPQKDPDFYDTFTKTYNLPELIKQPVRNGNEMIETIH >gi|226332045|gb|ACIB01000011.1| GENE 5 7798 - 10449 2195 883 aa, chain + ## HITS:1 COG:SPBC1683.04 KEGG:ns NR:ns ## COG: SPBC1683.04 COG1472 # Protein_GI_number: 19111852 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-glucosidase-related glycosidases # Organism: Schizosaccharomyces pombe # 36 883 3 823 832 379 31.0 1e-104 MKNIFLTMSLGIGLLFPCKLHAQSQYPFQNTTLSTEERVDDLIKRMTLEEKIDLLSGYND FYLHPCERLGIPAFKLADGPLGVASWGLFGRATAFPSALSLAASWNKNLAEKTGAMYAQE WRARGIHFLLAPGVNNYRASKGARNFEYFGEDPYLASEMVVPFIKAVQDGGVIATIKHFA ANDQEFDRYTVSTEVSERALQEIYLPPFKAAVQKAGVKAVMTGYNLVNGVYCTENKHLID ILKKDWGFKGMLMSDWACTYSAENAANYGLDLEMGSNDWFTRKELLPLVKEGKVTEEVIN DKVRRIYGACISMGFFDRPQQDTDIPTFNPQANQMALNTACEGIILLKNEQNTLPIHRPK VIAVIGPTANPAIVSDRIYNVNSIVYGGGGSSKVHPWYVVSALEGIRQEFPEATVLYTEG ISNQFKPRLFRNSKFRTKEGKPGLEASYYALSSDTSATLSDKMIQQQAVAAGRTVSVNQS ADRTIETDKEESGLILRRTDRTVNYEWWGYPFNESKLGNDYRVCWEGYVDVEKTDSIRFF VDAQGAYRLWIDGTLALDASQSQSFDVRNTAISAKKGDAKHIRLEFCNQRSTPAEIRMGY AYQSDIDFSEAKRLAAKADLVVFCAGLDGSIELEGRDRPFDLPYGQDMLIQELVKVNPKL IVAIHAGGGINMTRWIDQVPAVVHALYPGQEGGHALAHILSGKVNPSAKLPFTIEKRWED SPACGHYDETRKEKKVYYTEGIFTGYRGYDQKGIEPLFPFGFGLSYTTFDYSGLNIRMTD KKQKQLVVSFTVTNTGQRDGYEVAQLYVRDMQSKEPRPLKELKGFDKVYLKAGESKQIEI GLSEDAFQYFNAKQNRWVFEKGEFEILVGASSKDIRLAEKIKM >gi|226332045|gb|ACIB01000011.1| GENE 6 10604 - 11500 511 298 aa, chain - ## HITS:1 COG:PA3571 KEGG:ns NR:ns ## COG: PA3571 COG2207 # Protein_GI_number: 15598767 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Pseudomonas aeruginosa # 14 298 26 303 307 150 33.0 2e-36 MAKQKDGFLGEQALVLPPAIVQRMKTDPATSILYITDIGYYPKAYNHFRERETPIDQYVF IYCTEGRGWFSLDGQKHPVVPNQYFILPAGLPHAYGADEKEPWTIYWIHFGGTLAPLYCT HRTCRLTDIKPGMHSRISYRTELFEEIFRVLKMGYSLENLSYASSVFHHYLGSLRYLREY REAVSEHRPAGEEDPVNAAIHYMKENLGKKLTLAELADYTGYSSSYFSNLFLKRTGYAPL SYFNQIKIQKACQFLDFTDMKVNQVCYRVGIEDAYYFSRLFSQIMGMSPREYKKVKKG >gi|226332045|gb|ACIB01000011.1| GENE 7 11638 - 13017 1244 459 aa, chain + ## HITS:1 COG:ECs5014 KEGG:ns NR:ns ## COG: ECs5014 COG0477 # Protein_GI_number: 15834268 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Escherichia coli O157:H7 # 7 454 9 475 491 299 38.0 8e-81 MKQTRGYLLLICIVSAMGGLLFGYDWVVIGGAKIFYEPFFGIENSAALRGWAMSSALIGC LAGALLSGIWSDKYGRKKMLVIASFLFALSAWGTGAVDHFSYFIFYRIVGGLGIGIASNI SPVYIAEVSPAHVRGKFVSLNQLTIVLGILFAQLANWQIGEYYTQGSDILSETSVQWAWR WMFWAELIPAGIFFLLSFIIPESPRWLATVHQQEKAQKTLTRIGGETYARQTLEELNQLT QSQGNRQNNEWKSVFRPEMRKVLIIGIVLAIFQQWCGINVIFNYAHEIFSSAGYAVSDVL MNIVVTGITNVIFTFVAIYTVDKWGRRTLMLIGSAGLALIYLILGTCYFLDVNGLPMLLL VVLAIACYAMSLAPVVWVVLSEIFPVKIRGMAIAISTFFLWVACFILTYTFPVLNESIGA EGTFWLYGGICLAGFLFIRQNLPETKGKTLEEIEKELIK >gi|226332045|gb|ACIB01000011.1| GENE 8 13031 - 16333 2566 1100 aa, chain + ## HITS:1 COG:no KEGG:BF0857 NR:ns ## KEGG: BF0857 # Name: not_defined # Def: TPR domain-containing protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1100 1 1100 1100 2278 99.0 0 MKESVNVWEEDILLPTYGIGRPEKNPMFLEKRVYQGSSGVVYPYPVIEKIEDTCEEKSYH AVWLENEYIKVMILPELGGRIQMAFDKVKQRHFIYYNHVIKPALVGLTGPWISGGIEFNW PQHHRPSTFLPVDYSIERCKDGSVVVWVSERERMFGQKGTAGFTLRPGRAVLEIQGKVSN PTPLPQTFLWWANPAVAVNDCYQSVFPSDVNAVFDHGKRDVSRYPIATGTYYKMDYSAGV DISRYKNIPVPTSYMAIRSNYNFVGGYENDTQAGVLHVANHHISPGKKQWTWGNGDFGQA WDRNLTDADGPYIELMTGVYTDNQPDFSWLQPYEEKTFTQYFMPYRELGVVKNASSELLM NLETEGEECRLKLFATSAQQGLRIVVRQAGVIRFEEIGSLSPEQVFDRLIPINDLHEAEV IIYDTNGRKKLSWKAEPEIIKAVPEAAKPALAPEEIKTNEELYLTGLHLEQYRHATYCPT DYYREALRRDNGDARCNNAMGLWLIRKGEFAQAEPYLRNAIARLTEKNPNPYDGEAFYNL GLALKFQGKDDEAYDSFYKSCWNAAWQDAGYYSLAQISVSRSNWEEALEEIEKSLLHNWH NLRGRHLKAIILRHLEQKEEALAWIEDSLKTDAFNFGCLFEKYLITRDETVLIQLRNLMK RGAPDYEALVLDYTSAGRFEEALAVAELAIAQPVGEQTLLHYYKSWCLIRLGKTAEAQIA IATAEKEPADYCFPNALEAIEALQCVVDFAGKAPKALYYLGNLWYDKRRYPTAIAAWERS SREDETFPTVWRNLSLAYFNKMNRPDEAVALLEKAFRLDHTDARVLMELDQLYKRLNRPH IERLCFLEQHIDIVMTRDDLYLEYVTLLNQTGQYREAIRRIDQRKFHPWEGGEGKVPAQY QLARLELAKELINRKKYDDALALIDECYIYPTHLGEGKLPGAQENDFNYYKAYILQQQGR PEEAHSLFVKACSGNSQPAAAMYYNDQKPDKIFYQGLAYRKLGEEEKARSRFNQLITYGE EHLFDRFKMDYFAVSLPDLLIWEDDMDKKNRIHCNYLMALGHLGLGNRTKAEQFFDIAAS MDNNHQGVQIHRKLMNTILS >gi|226332045|gb|ACIB01000011.1| GENE 9 16330 - 19101 2105 923 aa, chain + ## HITS:1 COG:TM1193 KEGG:ns NR:ns ## COG: TM1193 COG3250 # Protein_GI_number: 15643949 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-galactosidase/beta-glucuronidase # Organism: Thermotoga maritima # 35 471 43 481 1087 93 25.0 2e-18 MKQSFQNTRYRAILLLTGLMLAMTAMAQTLTNDALDLSGMWRFQLDPMGFGKTPGSEFYL DKLSETIMLPGSTDQGGKGIKNTARYVDRLSRKFEYQGAAWYQREVVIPEDWADREIYLK LERCHWETTVYVDDKEAGMKEHLSTPNTFVLTPLLTPGIHTLTICVNNTLKYPMDQWNHG TTEYTQTNWNGIAGDISLYAKEKAHIRQINVYPDVSSKAVEVSVQPAPLKTGQTGKLELC IREQGGKIIVRQTLNADSLQVHAGIRQTLPMGNRVKLWDEFTPYLYEIEASWNVDGKTDT QTRTFGMRNVEQGKHHIRLNGRDIHLRGVLDCAVFPLTGYPSTNVDDWKRIFTTIKEYGM NHVRFHSWCPPEAAFEAGDEVGMYLQAELPMWIKDVGKYPDRRDFFEKEMYAILDAYGNH PSFILMCNGNENEGDFAVLEDLVKKAQKYDNRRLYSASTARTHTPSDQYYVSHVTSKGWI TVYEGKPSTDWDRCKESDIDVPVIAHETGQRCMYPNFEEIKKYTGVVEARNFEVFRERLA KNGMLHQANDFFRATGAHTVLQYKEVNESLLRTRNSGGFQLLGLADFPGQGSAFVGILDA FWESKGLVTPEKFRESCAPTVLLARLPKRTFRNGEKLKAKMEIYHFGKDALNSRKLNWTL TGEDGTVYHKGSLKTKSIQPATVDSLGIIELPLNGMDSARKLTLKAELGGIHNEWDVWVY PEQPKTEQHNFVYTRTWNDETKQWLNEGKNVLLIPEKCKGRKAHFASHFWNPIMFNWNPM IVGTLIDNEHPAFRDFPTRNYADWQWWDILNYSTALELDELKEITPLIQSIDSYETNQKL GISFEAKVGKGKLFVLCADPEKKIDERPAMQQLLTSVRNYVSSGYFNPTKSLPVYLLDAL FAPASEEKSGDKGSKAIELLLNK >gi|226332045|gb|ACIB01000011.1| GENE 10 19225 - 21492 1824 755 aa, chain + ## HITS:1 COG:no KEGG:BF0855 NR:ns ## KEGG: BF0855 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 755 1 755 755 1529 99.0 0 MKIKLLLLLCCGLWSSCNSYDYCPVTPSESDLVFTGLARSWDEAMPLGNATVGALVWQRD STLRLSLDRTDLWDLRPVDSLSGDNFRFSWVKEHIRQKNYLPVQKKLDWPYDMNPAPSKI PGAAIEFPLEQIGTPTQVRLYLNNALCEADWADGTQMQTFVHATEPIGWFVFRNLKTPIE PSIITPVYNKTKPDGSLDPVSGQDLHRLGYQQGKVVREGNQITYHQKGYGDFSYDVTVCW KQEGETLYGTWSVTSSLSGEQASEKAEAALQRGLKHDYQAHLEYWDKYWAQSSITLPDSV LQKQYQNEMYKFGSTTREHSYPISLQAVWTADNGKLPPWKGDYHHDLNTQLSYWPTYTGN HLTEGMGYLNTLWNQRDAYKRYTRRYFGTEGMNIPGVCTLTGEPMGGWIQYSMSQTVAAW LAQHFYLQWKYSADRTFLKERAYPFIKDVAIYLEQISEVTPEGVRKLEFSSSPEIFDNSL QAWFSDMTNYDLAMMHFLFKAASELAHELNLADEAGHWASLEAQLPDYDVDEEGCLTFAK GYPYKESHRHFSHAMAIHPLGLIDWSDGEKSQHIIRATLKRLDEVGPDYWTGYSYSWLAN MKARAFDGEGAAQALKTFAECFCLKNTFHANGDQTQSGKSRFTYRPFTLEGNFAFAAGIQ EMLLQSHTGVIRIFPAIPKEWKDVSFESLRAMGAFLVSARMEGGEINRVRIYSEKGGMLK IARPGTLKPNKNYTLSGTDILNIDTQAGEWIELNP >gi|226332045|gb|ACIB01000011.1| GENE 11 22598 - 23992 1401 464 aa, chain + ## HITS:1 COG:lin2262 KEGG:ns NR:ns ## COG: lin2262 COG0673 # Protein_GI_number: 16801326 # Func_class: R General function prediction only # Function: Predicted dehydrogenases and related proteins # Organism: Listeria innocua # 105 352 42 269 349 68 26.0 3e-11 MFKHLNALFIGLALFACTSGAVAQTIKPIETSVPVRPAGQKDVVGLTTPKLDVVRVGFIG LGMRGPGAVERFTHIPGTQIVALCDLIPERVAGAQKILTKANLPEAASYSGSEDAWKKLC ERKDIDLVYIATDWKHHAQMAIYAMEHGKHVAIEVPSAMTLDEIWALINTSEKTRKHCMQ LENCVYDFFELTTLNMAQQGVFGEVLHTEGAYIHNLEDFWPYYWNNWRMDYNQNHRGDVY ATHGMGPACQLLDIHRGDKMNYLVSMDTKAVNGPAYIKKTTGKEVKDFQNGDQTSTLIRT EKGKTILIQHNVMTPRPYSRMYQVVGADGYASKYPIEEYCMRPTQIASNDVPNHEKLNAH GSVPADVKKALMDKYKHPIHKELEETAKKVGGHGGMDYIMDYRLVYCLRNGLPLDMDVYD LAEWCCMADLTKLSIENSSAPVAIPDFTRGAWNKVKGYRHAFAK >gi|226332045|gb|ACIB01000011.1| GENE 12 24104 - 24994 699 296 aa, chain + ## HITS:1 COG:TM0177 KEGG:ns NR:ns ## COG: TM0177 COG1284 # Protein_GI_number: 15642951 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Thermotoga maritima # 13 294 1 281 283 160 33.0 3e-39 MKMKISKPSKQGVIRETKDYLMIALGMILYGIGWTLFLLPNDITTGGVPGIASIVYFATG FPVQYTYFAINAVLLMVSLKVLGFRFSLKTIFGVFTLTFFLSVIQKLTANVTLLHDQPFM ACVLGASFCGSGIGIAFSANGSTGGTDIIAAVINKYRDITLGRVMLICDLIIISSSYFVL KDWEKVVYGYVTLYVCSFVLDQVVNSARQSVQFFIISNKYEEIGKRINEYPHRGVTIINA TGFYTGKEQKMMFVLAKKRESTIIFRLIKDCDPTAFVSQSAVIGVYGEGFDHIKVK >gi|226332045|gb|ACIB01000011.1| GENE 13 25020 - 28355 3089 1111 aa, chain + ## HITS:1 COG:TM1193 KEGG:ns NR:ns ## COG: TM1193 COG3250 # Protein_GI_number: 15643949 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-galactosidase/beta-glucuronidase # Organism: Thermotoga maritima # 33 1106 5 981 1087 687 37.0 0 MKLRLSDTLNSLMLGGLLMASTFASAGNKPTKPYWQDVQVVAVNKEYPRSSFMTYENRAN ALTGKFEKSKYYQLLNGTWKFYFVDSYKNLPANITDPAISTADWADIKVPGNWEVQGHGV AIYTNHGYEFQPRNPQPPTLPEANPVGVYRRDIDIPADWDGRDIYLHLAGAKSGVYVYIN GQEVGYSEDSKNSAEFLINKFVKPGKNVLTLKIYRWSTGSYLECQDFWRISGIERDVFLY SQPKAALKDFRVTSTLDDTYKDGIFKLGVDLRNNGSTAGNMTLVYELLDANGKVVATGEK ATNVAAGETRTVSFDQTLPDVKTWTSEAPNLYKLVMTVKENGKVNEIIPFNVGFRRIEIK PTEQLARNGKPYVCLFINGQPLKLKGVNIHEHNPATGHYMTEELMRKDFELMKQHNLNTV RLCHYPQDRRFYELCDEYGLYVYDEANIESHGMYYDLAKGGTLGNNPEWLKAHMDRTINM FERNKNYPSLTFWSLGNEAGNGYNFYQTYLWVKNADKDIMNRPVNYERAQWEWNSDMYVP QYPGADWLEAMGKRGSDRPIVPSEYSHAMGNSNGNLWDQWKAIYKYPNLQGGYIWDWVDQ GIDAVDENGRHFWTYGGDYGVNTPNDGNFNCNGIVSPDRTPHPAMAEVKYVHQNVAFEAV DPANGKFLVKNRFYFTNLQKYMISYTIKANGKTVKGGKMSVNVEPQGSKEITIATSGLKS KPGTEYFIYFNVTTTEPEPLIPVGHEIAYEQFRLPIEPGERTFATGGPALKVSAEGNELT ASSSKVNFVFDKKTGLVSSYKVGGTEYFKDGFGLQPNFWRAPNDNDYGNGNPKRLQVWKQ SSKNFNVVDANIVMDGKDAVLTANYLLAAGNLYIVTYRISPSGVVKADFTFTSTDMEAAK TEASEATLMATFTPGSDAARKAASKLEVPRIGVRFRLPAEMNQVEYFGRGPEENYIDRNA GTLIDLYKTTADQMYFPYVRPQENGHHTDTRWLTLNKKGGKGLTIYADKTIGFNALRNSV EDFDGEETVSRPYQWLNRDAGELVHDESKAKDQLPRKTHINDITPRNFVEVCVDMKQQGV AGYNSWGARPEPGYNIPANQEYKWGFTIVPR >gi|226332045|gb|ACIB01000011.1| GENE 14 28377 - 29462 1317 361 aa, chain + ## HITS:1 COG:no KEGG:BF0928 NR:ns ## KEGG: BF0928 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 361 1 361 361 743 100.0 0 MKDLLSIVSKFKVQGTVGEIKPLGAGLINDTYKVNTTEADAPDYVLQRINHAIFQNVEML QDNIAAVTGHIRKKLTEAGETDVDRKVLTFLPTEEGKTYWFDGDSYWRVMVFIPRAKTYE TVNPEYSYYAGAAFGNFQAMLADIPATLGETIPDFHNMEFRLKQLREAVAANAAGRVAEV QYYLDEIEKRADEMCKAERLYREGKLPKRVCHCDTKVNNMMFDEDGKVLCVIDLDTVMPS FIFSDYGDFLRTGANTGDEDDKNLDNVNFNMEIFKAFTKGYLEGAGSFLTPIEIENLPYA AALFPYMQCVRFLADYINGDTYYKIKYPEHNLVRTKAQFKLLQSVEEHTPEMEAYIKECL G >gi|226332045|gb|ACIB01000011.1| GENE 15 29443 - 30096 465 217 aa, chain - ## HITS:1 COG:no KEGG:BF0849 NR:ns ## KEGG: BF0849 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 217 1 217 217 409 100.0 1e-113 MKKSTLTLLLFILGISSSFSVGAQEAKTVFVNIPDSLCPLLSSVNRADCIDFIESKMKAQ VTNRFGGKSEMTELSPDYVSLQMSDASNWQMKLLPLNDTTKVVCAVSTVCAPACDSHIRF YTTDWKELPATDFLPSVPQMNDFFTSSDSTDYDFIDARLQADMTLMQAELSKENGTLTFT LTTPEYMEKETAEKLKPFLRRSIVYTWKDGKFIPDTL >gi|226332045|gb|ACIB01000011.1| GENE 16 30124 - 31023 764 299 aa, chain - ## HITS:1 COG:CAC1984 KEGG:ns NR:ns ## COG: CAC1984 COG0697 # Protein_GI_number: 15895255 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; R General function prediction only # Function: Permeases of the drug/metabolite transporter (DMT) superfamily # Organism: Clostridium acetobutylicum # 2 296 4 283 285 130 34.0 2e-30 MWLLLAFLSATLLGFYDVFKKKALKDNAVLPVLFFNTLFSSLIFLPFILLSAFAPGVLEG TMLDVPVVGWEVHKFIIIKSFIVLSSWILGYFGMKHLPITIVGPINATRPVMVLVGAMLV FGERLNLYQWIGVMLAIISFFMLSRSGKKEGIDFKHNKWILFIILAAVAGAVSGLYDKYL MKQLPPMVVQSWYNVYQMFIMCPILALLWWPKRKSSTPFRWDWAIIFISIFLCAADFVYF YALSYEDSMISIVSMVRRGSVIVSFLFGAMVFREKNLKSKAIDLILVLIGMIFLYLGTK >gi|226332045|gb|ACIB01000011.1| GENE 17 31055 - 31813 674 252 aa, chain - ## HITS:1 COG:BH0167 KEGG:ns NR:ns ## COG: BH0167 COG0101 # Protein_GI_number: 15612730 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Pseudouridylate synthase # Organism: Bacillus halodurans # 3 239 1 238 263 155 39.0 9e-38 MSVQRYFIYLAYDGTHYHGWQIQPNGISIQECLMKALATFLRKDTEVIGAGRTDAGVHAS LMVAHFDYEGEPLDVDKVAEKLNRLLPQDISVYKVCRVKPDAHARFDATARTYKYYITTV KFPFNRQYRYRIHNPLDFQKMNEAALTLFHYSDFTSFSKLHTDVKTNICKIMHAEWTQED EYTWVFTIQADRFLRNMVRAIVGTLLEVGRGKLSVVDFRKIIEQQNRCKAGTSAPGNALF LVNVEYPQEIFE >gi|226332045|gb|ACIB01000011.1| GENE 18 31894 - 33048 1052 384 aa, chain - ## HITS:1 COG:TM0564 KEGG:ns NR:ns ## COG: TM0564 COG1853 # Protein_GI_number: 15643330 # Func_class: R General function prediction only # Function: Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family # Organism: Thermotoga maritima # 232 381 7 156 159 76 28.0 7e-14 MKRILLFLLACCPMLLCAQEDNSKYLAGAVPVVNGKVIFAEVIQASDMSKRQIYDALLKW AEKRFTPSKGQKGRVAYFDEKKGQIACLGEEYLQLSATNSFFLDRATIKYRLVINCLDGS CKMEMYNISYFHGDDTEMEAEDWITDETGLNKAKTKVVAKYGKLRIKTIDLFDDLTEQVT KTLGGAKSEVPLLAKEPKVTPEVFDRELPKAVEQGAMAGYKHISADKIPGNIIKMLSEDW MLITAGTEDKYNMMTASWGGLGYLYNKPVSFCFIYPTRYTYQLMEKNDTYTISFYTETYR DALKYCGSHSGGDVDKVKGAGLTPLTTPSGSKAFSEAWMIIECKKMLSQPITPGAFDTPE LKEAWKDKSLHTMYIGEIMNVWVK >gi|226332045|gb|ACIB01000011.1| GENE 19 33080 - 34867 1582 595 aa, chain - ## HITS:1 COG:no KEGG:BF0923 NR:ns ## KEGG: BF0923 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 595 1 595 595 1231 99.0 0 MKTILLFALSLLLSLSVSDVCAQERVYDISQFGLKANSKKNASPVVRKAIAKIKAECRDG EKVILRFPAGRYNFHEAGSTVREYYISNHDQDNPKKVGIALEDMKNLTIDGQGSEFVFYG RMIPVSLLRSENCVLKSFSIDFEQPHIAQVQVVENDPEKGITFEPAPWVDYRISKDSVFE GLGEGWVMRYSWGIAFDGKTKHVVYNTSDIGCPTKGAFEVAPRRICSPKWKDARLVPGTV VAMRGWGRPTPGIFMSHDVNTSLLDVKVHYAEGMGLLAQLCEDITLDGFGVCLKGDNDPR YFTTQADATHFSGCKGKIVSKNGLYEGMMDDAINVHGTYLKVIKRVDDHTLIGRYMHDQS WGFEWGRPGDDVQFVRSETMELIGKQNQITAIRPYDKGEIQGAREFSITFKEAIDPAINE KSGFGIENLTWTPEVLFAGNTIRNNRARGTLFSTPKKTVVEDNLFDHTSGTAILLCGDCN GWFETGACRDVTIRRNRFINALTNMFQFTNAVISIYPEIPNLKDQQKYFHGGKDGGIVIE DNEFDTFDAPILYAKSVDGLIFRNNVIKTNTEFKPFHWNKDRFLLERVTNVKISE >gi|226332045|gb|ACIB01000011.1| GENE 20 35022 - 35519 433 165 aa, chain + ## HITS:1 COG:no KEGG:BF0922 NR:ns ## KEGG: BF0922 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 165 1 165 165 269 100.0 2e-71 MSSNIFLTIAIVTTGIFAVQFILSIFFGDIDADADIDTDISSVVSFKGLTHFGIGFGWYM YLQHNTEIQTYLTGVAIGLIFVFAVWFLYKKAYQLQQTTHSERTEQLVGRECTIYFKQNE KKYTVQISRDGAMREIDVVTESGKSYQTGDKATITAYKDGTLYIQ >gi|226332045|gb|ACIB01000011.1| GENE 21 35544 - 37169 2048 541 aa, chain + ## HITS:1 COG:BS_yuaG KEGG:ns NR:ns ## COG: BS_yuaG COG2268 # Protein_GI_number: 16080153 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Bacillus subtilis # 1 401 1 387 509 114 25.0 4e-25 MTQEMMIMAAILVAVILITFIGILSRYRKCKSDEVLVVYGKTGGDKKSAKLYHGGAAFVW PIVQGYEFLSMKPMQIDCKLTGALSAQNIRVDVPTTITVAISTDPEVMQNAAERMLGLTM DDKQNLITDVVYGQMRLVIADMTIEELNSDRDKFLSKVKDNIDTELRKFGLYLMNINISD IRDAANYIVNLGKEAESKAQNEAQANIEEQEKLGAIKIANQIKERETKVAETRKDQDIAI AETKKLQEISVANADKDRISQVAIANAEKESQVAKAEAEKNIRIEQANTEKESRIAELNS DMEIKQAEAQKKAAIGRNEAQKEIALSNSELAVTQANADKQAGEASAKSEAAVQTAKEIA QKEVEEAKARKVESSLKAEKIVPAEVARQEAILQAEAVAEKITREAEARAKATLAQAEAE AKAIQLKLEAEAEGKKRSLLAEAEGFEAMVKAAESNPAIAIQYKMVDQWKEIAGEQVKAF EHINLGNITVFDGGNGGTSNFLNTLVKTVAPSLGVLDKLPIGETVKNMIHPEEKKEETEK K >gi|226332045|gb|ACIB01000011.1| GENE 22 37253 - 37933 652 226 aa, chain - ## HITS:1 COG:no KEGG:BF0920 NR:ns ## KEGG: BF0920 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 226 1 226 226 462 99.0 1e-129 MNQSKCRSHFVISIFLCFILPCLAGCKKKDMSLKLNEPRNIKGVVSYKRSFGDLNDVQLK AAHAWGIAPLASREEAEEMDGKLVHIVDNDFYVVDSLTHSIPYLVPRASALLDTIGANFL DSLTAKGLNPNKIIVTSVLRTENDVKRLRRRNGNASKNSCHFYGTTFDVSWKRFKKVEDE EGRPLQDVSADTLKLVLAEVLRDVRKADKCYVKYELKQGCFHITTR >gi|226332045|gb|ACIB01000011.1| GENE 23 38098 - 39111 975 337 aa, chain + ## HITS:1 COG:DR1988 KEGG:ns NR:ns ## COG: DR1988 COG1702 # Protein_GI_number: 15806986 # Func_class: T Signal transduction mechanisms # Function: Phosphate starvation-inducible protein PhoH, predicted ATPase # Organism: Deinococcus radiodurans # 18 333 65 380 380 270 44.0 3e-72 MIEKLIVLEDIDPVIFYGVNNANIQLIKALYPKLRIVARGNVIKVLGDEEEMCAFEENIT KLEKYCAEYNSLKEEVIIDIIKGNAPQAEKAGNVIVFSVTGKPIIPRSENQLKLVEGFAK NDMVFAIGPAGSGKTYTAIALAVRALKNKEIKKIILSRPAVEAGEKLGFLPGDMKDKIDP YLQPLYDALQDMIPAAKLKEYMELNIIQIAPLAFMRGRTLNDAVVILDEAQNTTTQQIKM FLTRMGMNTKMIITGDMTQIDLPASQTSGLVQALRILKGVKGISFVELNKKDIVRHKLVE RIVDAYEKFDKEKKAEREKLNGERLTISKERQNVGNL >gi|226332045|gb|ACIB01000011.1| GENE 24 39131 - 40075 1187 314 aa, chain + ## HITS:1 COG:CC3242 KEGG:ns NR:ns ## COG: CC3242 COG0152 # Protein_GI_number: 16127472 # Func_class: F Nucleotide transport and metabolism # Function: Phosphoribosylaminoimidazolesuccinocarboxamide (SAICAR) synthase # Organism: Caulobacter vibrioides # 10 313 13 319 320 272 46.0 7e-73 MKALTKTDFNFPGQKSVYHGKVRDVYNINGEQLVMVATDRISAFDVVLPEGIPYKGQMLN QIAAKFLDATTDICPNWKLATPDPMVTVGVLCEGFPVEMIVRGYLCGSAWRAYKNGVREI CGVKLPEGMKENQKFPEPIVTPTTKAEMGLHDEDISKEEILAQGLATPEEYAILEKYTLA LFKRGTEIAAERGLILVDTKYEFGKHNGTIYLMDEIHTPDSSRYFYAEGYQERFEKGEAQ KQLSKEFVREWLMENGFQGKEGQKVPEMTPAIVESISERYIELFENITGEKFVKEDTSNI AERIEKNVMAFLAK >gi|226332045|gb|ACIB01000011.1| GENE 25 40197 - 40934 569 245 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163754278|ref|ZP_02161401.1| 30S ribosomal protein S15 [Kordia algicida OT-1] # 24 245 1 221 221 223 48 4e-57 MNYPQEKIKPYSNDGKKSEQVEQMFDNIAPAYDQLNHTLSLGIDRSWRRKAINWLKPFRP QQIMDVATGTGDFAILACHELQPEQLIGTDISKGMMNVGREKVKKEGLSEKISFAREDCT SLSFADNRFDAITVAFGIRNFEDLDKGLSEMYRVLKTGGHLVILELTTPDRFPMKQMFTI YSKIVIPTLGKLLSKDNSAYSYLPQTIKAFPQGEVMKNVISRVGFSQVQFRRLTFGICTL YTATK >gi|226332045|gb|ACIB01000011.1| GENE 26 40961 - 41704 716 247 aa, chain + ## HITS:1 COG:MK0117 KEGG:ns NR:ns ## COG: MK0117 COG0169 # Protein_GI_number: 20093557 # Func_class: E Amino acid transport and metabolism # Function: Shikimate 5-dehydrogenase # Organism: Methanopyrus kandleri AV19 # 5 245 15 271 290 139 32.0 5e-33 MQKYGLIGYPLKHSFSIGFFNEKFKSEGIDAEYVNFEIPEINDFMEVIEENPNLCGLNVT IPYKEQVIPFLDELDKDTAKIGAVNVIKIIRTPKRKIKLVGYNSDIIGFSQSIQPLLQPY HKKALILGTGGSSKAIYHGLKNLGIDSVFVSRTKKEGMLTYEELTPEVMAEHTVIVNCTP VGMYPKVDFCPAIPYELLTPNHLLYDLLYNPNITLFMKKGEEHGAVTKNGLEMLLLQAFA AWEIWNK >gi|226332045|gb|ACIB01000011.1| GENE 27 41705 - 42655 593 316 aa, chain + ## HITS:1 COG:SPy1892 KEGG:ns NR:ns ## COG: SPy1892 COG1073 # Protein_GI_number: 15675706 # Func_class: R General function prediction only # Function: Hydrolases of the alpha/beta superfamily # Organism: Streptococcus pyogenes M1 GAS # 12 313 13 305 308 217 39.0 3e-56 MKKGIKIGVITLLLLLTGCTIGGSFFMLNYSLRPEAKIRAKNADSYPFMYKNYPFLRPWV DSLNQAHALRDTFVLNPEGIRLHAYYIAAPQPTKKTAVIVHGYTDNAIRMFMIGYLYNHD LQYNVLLPDLQHQGESGGPAIQMGWKDRLDVMQWMHIANQIYGDSTQMVVHGISMGGATT MMVSGEAQPYFVKCFVEDCGYTSVWDEFSHELKSSFHLPSFPLMNTTSWLCQKKYGWNFE EASSLNQVKKSHLPMFFIHGDKDTYVPTWMVYPLYEAKSAPKQLWIVPGAAHAVSYKENK EEYTRKVKEFTDRYIH >gi|226332045|gb|ACIB01000011.1| GENE 28 42657 - 43577 666 306 aa, chain + ## HITS:1 COG:PA1450 KEGG:ns NR:ns ## COG: PA1450 COG1512 # Protein_GI_number: 15596647 # Func_class: R General function prediction only # Function: Beta-propeller domains of methanol dehydrogenase type # Organism: Pseudomonas aeruginosa # 12 208 36 216 419 89 27.0 6e-18 MKQILTYLLFIWILLPLKAEEKIYTVDNIPKVHLQNKMQYVCNPAGILSQQACDEIDAML YALEQQTGIETVVAIVPSIGDKDCFEFSHQLLNQWGVGKKGKDNGLVILLVTDQRCIQFY TGYGLEGILPDAICKRIQMQEMIPYLKKGEWNQGMLAGVKAVCQRLDGSMVNDDEGRGEE GISVSMLLVVILGFITIAGVVGILAVRASTRCPKCGKHQLQRSSTKLISNRNGVKTEDII YTCRNCGHTVVRRQQSYDDNYRGRGGGGPFIGGFGGGSFGSGGGGGFSGGSFGGGSGGGG GAGSRF >gi|226332045|gb|ACIB01000011.1| GENE 29 43672 - 44253 731 193 aa, chain + ## HITS:1 COG:PM0785 KEGG:ns NR:ns ## COG: PM0785 COG1704 # Protein_GI_number: 15602650 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Pasteurella multocida # 3 193 2 192 193 199 57.0 2e-51 MKKSVIILIAVVAVIIIWAISAYNGLVSMDENVSSQWANVETQYQRRADLIPNLVNTVKG YASHEKETLEGVVEARSKATQIKVDANDLTPEKLAEYQKAQGAVTSALGKLLAITENYPD LKANQNFLELQAQLEGTENRINVARKNFNDAAQAYNTSIRRFPKSIFASVFGFEKRTYFE AAEGTEKAPEVKF >gi|226332045|gb|ACIB01000011.1| GENE 30 44839 - 46005 1107 388 aa, chain + ## HITS:1 COG:MJ0203 KEGG:ns NR:ns ## COG: MJ0203 COG0150 # Protein_GI_number: 15668375 # Func_class: F Nucleotide transport and metabolism # Function: Phosphoribosylaminoimidazole (AIR) synthetase # Organism: Methanococcus jannaschii # 47 372 49 326 350 112 29.0 2e-24 MSNQRYMMRGVSASKEDVHNAIKNIDKGIFPQAFCKIIPDILGGDPEYCNIMHADGAGTK SSLAYMYWKETGDLSVWKGIAQDALIMNIDDLLCVGAVDNILVSSTIGRNKLLIPGEVIS AIINGTDELLAELREMGVGVYATGGETADVGDLVRTIIVDSTVTCRMKRADVINNANIRP GDVIVGLASHGQATYEKEYNGGMGSNGLTSARHDVFAKYLAEKYPESYDAAVPEELVYSG GLKLTDTVEGSPIDAGKLVLSPTRTYAPVVKKLLDALRPEIHGMVHCSGGAQTKVLHFVG DVRVVKDNLFPVPPLFRTIQEQSGTDWAEMYKVFNMGHRLEVYLSPEHAAEVIAISESFG IPAQIVGRIEESDKKELIIKSEFGEFRY >gi|226332045|gb|ACIB01000011.1| GENE 31 46069 - 47181 1291 370 aa, chain + ## HITS:1 COG:VC2179 KEGG:ns NR:ns ## COG: VC2179 COG0216 # Protein_GI_number: 15642178 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Protein chain release factor A # Organism: Vibrio cholerae # 6 365 4 355 362 333 48.0 3e-91 MADNNSILEKLDGLVARFEEVSTLITDPAVIADQKRYVKLTKEYKELDDLMKARKEYMQL LANIEEAKDILSNESDADMREMAKEEMDNSQERLPVLEEEIKLLLVPADPQDGKNAILEI RGGTGGDEAAIFAGDLFRMYAKFCETKGWKMEVSSANEGAAGGYKEIICSVTGDNVYGTL KYESGVHRVQRVPATETQGRVHTSAASVAVLPEAEEFDVVINEGEIKWDTFRSGGAGGQN VNKVESGVRLRYIWKNPNTGVAEEILIECTETRDQPKNKERALARLRTFIYDKEHQKYID DIASKRKTMVSTGDRSAKIRTYNYPQGRITDHRINYTIYNLAAFMDGDIQECIDKLTVAE NAERLKESEL >gi|226332045|gb|ACIB01000011.1| GENE 32 47230 - 48054 995 274 aa, chain + ## HITS:1 COG:RSc2773 KEGG:ns NR:ns ## COG: RSc2773 COG0284 # Protein_GI_number: 17547492 # Func_class: F Nucleotide transport and metabolism # Function: Orotidine-5'-phosphate decarboxylase # Organism: Ralstonia solanacearum # 11 268 29 285 288 202 41.0 7e-52 MNKQSLFENIKRKKSFLCVGLDTDIKKIPDHLLDDPDPIFAFNKAIVDATADYCIAYKPN LAFYESMGVKGWTAFEKTVNYIKENYPDQFIIADAKRGDIGNTSAMYARTFFEELDIDSV TVAPYMGEDSVTPFLSYEGKWVILLALTSNKGSHDFQLTEDANGERLFEKVLKKSQEWAN DEQMMYVVGATQGRAFEDIRKIVPNHFLLVPGIGAQGGSLEEVCKYGMNSTCGLIVNSSR GIIYVDKTENFAAAARAAAKEVQEQMAEQLKAIL >gi|226332045|gb|ACIB01000011.1| GENE 33 48054 - 49283 1132 409 aa, chain + ## HITS:1 COG:lin2710 KEGG:ns NR:ns ## COG: lin2710 COG1078 # Protein_GI_number: 16801771 # Func_class: R General function prediction only # Function: HD superfamily phosphohydrolases # Organism: Listeria innocua # 4 317 10 321 440 186 34.0 9e-47 MPHERKIINDPVFGFINIPKGLLYDIVRHPLLQRLNRIKQVGLSSVVYPGAQHTRFQHSL GAFYLMSEAITQLASKGNFIFDSEAEAVQAAILLHDIGHGPFSHVLEDTIVKGVSHEEIS LMLMERMNREMNGQLSLAIQIFKDEYPKRFLHQLVSGQLDMDRLDYLRRDSFYTGVSEGN IGSARIIKMLDVADDHLVVESKGIYSIENFLTARRLMYWQVYLHKTSVAYERMLISALLR AKELASKGVELFASPALKFFLYNHIDPEVFYNNPDCLENFIQLDDNDIWTALKVWSTHTD KVLSTLSLGMINRNIFKVEICSEPISEERKKELTLLISRQLGITLSEADYFVSTPSIEKN MYDSADDSIDIIYKDGTIKNIAEASDMLNISLLSKKVKKYYICYLRWDR >gi|226332045|gb|ACIB01000011.1| GENE 34 49366 - 50406 1046 346 aa, chain + ## HITS:1 COG:FN1909 KEGG:ns NR:ns ## COG: FN1909 COG1044 # Protein_GI_number: 19705214 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase # Organism: Fusobacterium nucleatum # 1 336 1 331 332 221 37.0 2e-57 MEFSAKQIAAFIQGEIIGDENATVHTFAKIEEGIPGAISFLSNPKYTPYIYETKASIVLV NKDFTPEQEVKATLIKVDNAYESLAKLLNLYEMSKPKRTGIDERAYVAETAKIGKDVYIA PFACIGDHAEVGDNTVIHPHATVGGGAKIGSNCILYANSTVYHDCRVGNNCILHAGCVIG ADGFGFAPTPQGYEKIPQIGIVILEDNVEVGANTCIDRATMGATVIHSGVKLDNLVQIAH NDEIGSHTVMAAQVGIAGSTKVGEWCMFGGQVGIAGHLKIGNQVNLGAQSGVPGNIKSGS QLIGTPPMELKQFFKASIVQKSLPEMQIELRNLRKEIEELKQQLNK >gi|226332045|gb|ACIB01000011.1| GENE 35 50410 - 51795 1429 461 aa, chain + ## HITS:1 COG:XF0803 KEGG:ns NR:ns ## COG: XF0803 COG0774 # Protein_GI_number: 15837405 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-3-O-acyl-N-acetylglucosamine deacetylase # Organism: Xylella fastidiosa 9a5c # 1 319 1 297 304 177 35.0 4e-44 MLKQKTLKESFSLRGKGLHTGLDLTVTFNPAPDNHGYKIQRTDLEGQPIIDAVADNVGET TRGTVLSKNGVKVSTVEHGMAALYALGIDNCLIQVNGPEFPILDGSAQYYVQEIEKVGIE EQNAVKDFYIIKSKIEFRDEETGSSIIVLPDENFSLNVLVSYDSTIIPNQFATLEDMKKF KDEIAPSRTFVFVREIEPLLSAGLIKGGDLDNAIVIYEREMSQENYDKLADVMRVPHMDA KLLGYINHKPLVWPNECARHKLLDVIGDLALIGKPIKGRIIATRPGHTINNKFARQMRKE IRLHEIQAPTYDCNRAPIMDVNRIRELLPHRYPFQLVDKVIEIGANYIVGVKNVTSNEPF FQGHFPQEPVMPGVLQIEAMAQIGGLLVLNSVDEPERYSTYFMKIDGVKFRQKVVPGDTL IFRVELLAPIRRGISTMKGYAFVGEKVVCECEFMAQIVKNK >gi|226332045|gb|ACIB01000011.1| GENE 36 51806 - 52573 703 255 aa, chain + ## HITS:1 COG:FN0595 KEGG:ns NR:ns ## COG: FN0595 COG1043 # Protein_GI_number: 19703930 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Acyl-[acyl carrier protein]--UDP-N-acetylglucosamine O-acyltransferase # Organism: Fusobacterium nucleatum # 2 255 4 257 257 205 42.0 8e-53 MISPLAYIHPEAKIGENVEIAPFVYIDRNVVIGDNNKIMANANILYGSRIGNGNTIFPGA VIGAIPQDLKFKGEESTAEIGDNNLIRENVTINRGTAAKGRTIVGNNNLLMEGVHVAHDA LIGNGCIVGNSTKMAGEIIIDDNAIISANVLMHQFCRVGGYVMIQGGCRFSKDIPPYIIA GREPIAYSGINIIGLRRRGFSNEIIENIHNAYRIIYQSGLNTSDALTKVEAEVPASPEIE YIVDFIRNSERGIIR >gi|226332045|gb|ACIB01000011.1| GENE 37 52707 - 53270 794 187 aa, chain + ## HITS:1 COG:no KEGG:BF0904 NR:ns ## KEGG: BF0904 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 187 36 222 222 335 100.0 4e-91 MIYRFTIISDEVDDFVREIQIDPEATFFDLHEAILKAANYTNDQMTSFFICDDDWEKEKE ITLEEMDNNPEMDSWIMKETRLNELIEDEKQKLLYVFDYMTERCFFIELSEIITGKEIKG AKCTKKSGEAPKQTVDFEEMAAGGGSLDLDENFYGDQDFDMEDFDAEGFDVNDGAAGGGS SYDEDKF >gi|226332045|gb|ACIB01000011.1| GENE 38 53367 - 54272 843 301 aa, chain + ## HITS:1 COG:BH2366 KEGG:ns NR:ns ## COG: BH2366 COG0324 # Protein_GI_number: 15614929 # Func_class: J Translation, ribosomal structure and biogenesis # Function: tRNA delta(2)-isopentenylpyrophosphate transferase # Organism: Bacillus halodurans # 4 284 3 287 314 194 36.0 2e-49 MTAKTLIVLIGPTGVGKTELSLRIAEYFKTSIISSDSRQLYAELKIGTAAPTPEQLKRVP HYFVGTLQLTDYYSAAQYETEVMSVLEQLFQQHHVVLLTGGSMMYVDAICKGIDDIPTVD AETRELLLHKYDTEGLDNLCAELKLLDPEYYKIVDLKNPKRVIHALEICYMTGKTYTSFR TQQKKERPFHILKIGLTRDRAELYDRINRRVDQMMDEGLLEEARSVYAHRELNSLNTVGY KEIFKYLDGEWDLDFAIEKIKQNSRIYSRKQMTWFKRDEEIRWFHPEQEKEILSYLQASI K >gi|226332045|gb|ACIB01000011.1| GENE 39 54412 - 54567 56 51 aa, chain - ## HITS:1 COG:no KEGG:BF0902 NR:ns ## KEGG: BF0902 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 51 1 51 51 102 100.0 3e-21 MRLYPLLYSYPIGVNAPILDKKLKIEFNCSDYAQDNGWVLQCRVRKVLVFA >gi|226332045|gb|ACIB01000011.1| GENE 40 54670 - 55749 621 359 aa, chain + ## HITS:1 COG:no KEGG:BF0824 NR:ns ## KEGG: BF0824 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 359 33 391 391 688 98.0 0 MTQNEITGSVAKLETVEKVVDDTLFLKNPQVFCLETKDNALIKRINRVIEWKNTYYILDK SMKQVLAFNDKGRHLFTIHRVGIGKGEYGSILDIAIDRQNENLVFLADPTSLIYYDLQGN FIKTTKLPGYYHSIAIDNGMIYLENETYINNQLSTSSITVIAPDNQKTELLKPLREIAPY CFIGESRLNGTTPIVFTRKFDNTIYQLEDGKITPYYSFDFMNENFPEAAKDKEYTCRELN KFTWDRYVYLMANVANAPQYLMFSTNLFGVYVFDKTQNKLLKYNKIRNTGYQTDLHQYIP VEGANNRVFFTVYPTTLFSLKAIVDNHPSFKDKMSDKLYKLTESLDSDSNPIIFSYQIK >gi|226332045|gb|ACIB01000011.1| GENE 41 55845 - 57887 1740 680 aa, chain - ## HITS:1 COG:PA0600_3 KEGG:ns NR:ns ## COG: PA0600_3 COG4585 # Protein_GI_number: 15595797 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Pseudomonas aeruginosa # 474 675 27 239 244 103 32.0 2e-21 MKRLLSVFLFLFCCVIAADAQDDAAQYDSIMNLMKNKKIPLMERYYMTGDIEYLSREHQI AVLKQLIPEAKEVEDKAVITRLYSIVAMFENQLGHMTEAKDYLDSAFMNKGKFENNNISG MMHYIAGIYYSDKNLMEQAHENYYQAAEYFNRNEMKPAILTEIYYDLSIIYSMWQDDEGL HELSEAMKDLPVDFPFQQILKWTIKAKYFYALYQNEHRVDLLDSVTKYNQEAFKVYTSTE NPYDVGYVISDNYLHQAIVYSEAGKIKEAEQCFETGKKLMNPKKIDANVSVSYVSGVIAY YQADYELAEQHLQDGLRELKRMDEEQEVDYYHALIEFYTLLAKVYEKQELYNKALEAARN SLKYETRLFDKNSNKTIQKLRTQYNLNEKERVVEQLSAINEKNRRINILSAILIVLALVT IFLLLKRYRSRQRIHEGMLQIAKLKQQEAELLVKLQKTKLEEREREFQSLVHEAQQRKVQ YYLEGLEVERKRLAKELHDNVSNELLAIKMKITDGTSSCEEIMDTLQTLQAEVRGISHDL MPPIFKYASLSEILQDYVYQHNQPGQTELELLLEPEDNFDNLSQKVSLEIYRIVQEAVGN SLKHAQATLVKIILVREDNKVKLTVSDNGRGFEQQAGKTGIGLTIIKERVENLRGTLTLN SAPGKGTELIVEIDLENLEK >gi|226332045|gb|ACIB01000011.1| GENE 42 57938 - 58111 77 57 aa, chain - ## HITS:1 COG:no KEGG:BF0822 NR:ns ## KEGG: BF0822 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 57 1 57 57 93 94.0 2e-18 MLLPQTNVSKLTMEDKSSIPSRKDGKKWENIDLKELYQLYNEIDSYISQRYNELFGL >gi|226332045|gb|ACIB01000011.1| GENE 43 58129 - 59688 1036 519 aa, chain - ## HITS:1 COG:no KEGG:BF0821 NR:ns ## KEGG: BF0821 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 519 2 520 520 953 99.0 0 MGLFFVYIVKSSVCLAGFYLFYRLLLSKETFHRFNRIALLTLLLLSLLLPLVQLTTIEQT EVHQTMLTIEQLLMMADMPVSEVTPAEAVTLSGIQILLMVYLSGVLFFACRHIYSLGRLL MLLRSGEKEKMENGMTLVIHQQKISPFSWMKYIVISKVDLEEDGREILIHEAAHVRNRHS VDLLIADICIFFQWFNPASWLLKQELQNIHEYEADESVIREGVNARQYQLLLIKKAVGTR LYSMANSFNHSKLKKRITMMLKEKSNPWARLKYLYVLPLAAIAVTAFARPEISGKMDEIS AVKVNDFVQIVGTKVPEKKVEVLKDTVKKDAPKEEAFEVSRLETVPTHSKVTITDGMKRS GMDLFSVRNAGSQPQPLILVDGKEITGEQMQRDINPDMIESISVLKDEASTAIYGDKAKH GVILITLKGKDVQNVLSLSASDPEDGVKVVGVVKDHLDKPLAGASVFISGTVSGTMSDAY GHFVLLAPKNAMLRISYTGMTTVEKAVAPEVNVTLNPAD >gi|226332045|gb|ACIB01000011.1| GENE 44 59707 - 60039 403 110 aa, chain - ## HITS:1 COG:no KEGG:BF0820 NR:ns ## KEGG: BF0820 # Name: not_defined # Def: putative regulatory protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 110 12 121 121 199 99.0 2e-50 MGFFWEKGPLFVKEILAFYDEPKPHFNTLSTIVRGLEEKGFLAHHTYGNTYQYYAVVSES DFSKRTLKSVISKYFNNSYLSAVSSLVKEEDISLDDLKKLIQEVEQKNEE >gi|226332045|gb|ACIB01000011.1| GENE 45 60507 - 60896 113 129 aa, chain + ## HITS:1 COG:no KEGG:BF0818 NR:ns ## KEGG: BF0818 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 129 1 129 129 231 100.0 6e-60 MGKPIIRILYFQRRGIDANGYYRLNYIIQRGRKILYYATETKDFEYSDCFTSSCLAERKE TILDTAQYIFKVHKVRYSRLILSPSYSYITKYKSRIRQICTATRFCKSDIIWTMCPRSFI PISKIPFSF >gi|226332045|gb|ACIB01000011.1| GENE 46 61641 - 61865 116 74 aa, chain + ## HITS:1 COG:no KEGG:BF1426 NR:ns ## KEGG: BF1426 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 56 1 55 56 69 69.0 4e-11 MLIEKRYKDEDTGSDGVNSLPKLELSYSSGVYFFIKTKKDNYQLGNKEIKNLPPNLKKNH LLQEMVFQIRSVNA >gi|226332045|gb|ACIB01000011.1| GENE 47 61991 - 62977 648 328 aa, chain + ## HITS:1 COG:no KEGG:BF0893 NR:ns ## KEGG: BF0893 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 328 1 328 328 658 99.0 0 MNLNTPVEIPSGFTPISHAQQLLIMGSCFAENIGTLLAENKFRIDINPFGILYNPRSISM ALREIISQKQYKASDLFLHRECWHSPMHHGSFSAATLANTLRNIQSRVEQAHKELKQLDR LMLTFGTAYVYEQKETGKVVANCHKLPEKNFIRRRLEIDEIVEDYTLLLDELISLNPQLK ILFTVSPIRHIRDGMHANQLSKSVLLLAIDRLMQRYPQVTCYFPSYEIVLDELRDYRFYA DDMVHPSTLTVNYLWERFSETFFTPETQSLIKECETIRKAIAHKPFHPESEEHKRFLGQI VLKIERLNGKYPYLDFEKETNMCRLALQ >gi|226332045|gb|ACIB01000011.1| GENE 48 63007 - 65502 1917 831 aa, chain + ## HITS:1 COG:CAC0492 KEGG:ns NR:ns ## COG: CAC0492 COG0787 # Protein_GI_number: 15893783 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Alanine racemase # Organism: Clostridium acetobutylicum # 474 831 11 376 386 206 36.0 2e-52 MSYTIESIVEKIGARRLGNKPAAIDWLLTDSRSLCFPEETLFFAIPTKRNNGARYIPDLY ERGVRNFVVTSEDFKGLTMDSEQLKVTMAQECNFLIVPNTLKALQKLAEQHRGSFQIPVV GITGSNGKTIVKEWLHQLLSPDRVIVRSPRSYNSQIGVPLSVWQMNEESELGIFEAGISE MGEMRPLQNMIKPTIGILTNIGGAHQENFFSLQEKCMEKLSLFKDCDVVIYNGDNEMISN CVGKSMLTAREIAWSMRDIERPLYISRVEKKEDHTVISYRYLEMDNTFCIPFIDDASIEN SLNCLAACLYLMVPADQITERMARLEPVAMRLEVKDGKNNCILINDSYNSDLASLDIALD FLYRRSQSKGLKRTLILSDILETGQSTTTLYRKVAQLVHSRGIEKIIGVGAEISSCASKF DIEKYFFPDTKALLASDVIKKLRNEIILIKGSRNFGFDLVSEELELKVHETILEVNLGAM VANLNHYRSMLKPETKMVCMVKASAYGAGSYEIAKTLQEHHADYLAVAVADEGSDLRKAG ITASIIIMDPELTAFKTMFDYKLEPEVYNFHLLDALIKAAEKEGITNFPIHVKLDTGMHR LGFEEKDIPQLIRRLKNQNALIPRSVFSHFVGSDSAQFDAFTRQQIERYEKMSKELQDAF PHKILRHICNTAGIERFPGAQFDMVRLGIGLYGISPIDNSIINNVSTLKTTILQIRDVAE EDTVGYSRKGHLIRPSRIAAIPIGYADGLNRHLGCGHGYCLVNGKKAPYVGNICMDVCMI DVTDIDCREGDQAIIFGDELPITVLSDALETIPYEVLTGISTRVKRVYYQD >gi|226332045|gb|ACIB01000011.1| GENE 49 65664 - 65891 333 75 aa, chain + ## HITS:1 COG:no KEGG:BF0891 NR:ns ## KEGG: BF0891 # Name: not_defined # Def: putative sec-independent protein translocase # Organism: B.fragilis # Pathway: Protein export [PATH:bfr03060]; Bacterial secretion system [PATH:bfr03070] # 1 75 1 75 75 102 100.0 5e-21 MTNLFLLGFMPSGSEWIIILLVILLLFGGKKIPELMRGLGKGVKSFKEGVNEAKEEINKA KEEIDEPENKEKKDN >gi|226332045|gb|ACIB01000011.1| GENE 50 65968 - 66762 661 264 aa, chain + ## HITS:1 COG:DR0806 KEGG:ns NR:ns ## COG: DR0806 COG0805 # Protein_GI_number: 15805832 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Sec-independent protein secretion pathway component TatC # Organism: Deinococcus radiodurans # 3 257 18 251 270 127 35.0 3e-29 MAEIKELTFWDHLDELRRVLFRIIGVWFVLAVGYFIAMPYLFDHVILAPCHNDFIFYHLL RDIGQAFDLTDDFFTREFKVKLVNINLAAPFFIHMSTAFWMSVVTATPYLFFEIWRFIRP ALYPNERKGVRKALTIGTVMFFIGVLLGYFMVYPLTLRFLSTYQLSVEIENQISLNSYID NFMMLVLCMGLAFELPLVTWLLSLLGLVNKSFLRKYRRHAIVLIVIAAAVITPTGDPFTL SIVAIPLYLLYEMSILMIKDKNRS >gi|226332045|gb|ACIB01000011.1| GENE 51 66970 - 68010 398 346 aa, chain + ## HITS:1 COG:SA0834 KEGG:ns NR:ns ## COG: SA0834 COG1835 # Protein_GI_number: 15926563 # Func_class: I Lipid transport and metabolism # Function: Predicted acyltransferases # Organism: Staphylococcus aureus N315 # 28 345 35 368 604 82 26.0 2e-15 MINTLTSLRFIFAIMVFGAHCYVIDNVFNTHFFKEGFVGVSFFFVLSGFIIAYNYQEKLK DNKIDKRTFWVARIARIYPLHWLTLFIAAILGSYVIASGTLDWLKHFLASLTLTNAYIPR ADYFFSFNSPSWSLCCEQLFYICFPFLIPLAKNYKYLLSVFGIVAILMVVGMYFTPEDEI KGFWYVNPITRFPDFIVGMLLFQLYERLKNKNITALQGSIIEISSIILFLIFYLYAADIP KVYRYSCYYWLPVAVILISFSLQKGIFSRILSNRFLVIGGEISYSFYLIHLFVLLTYSEW QKENNLHTEWYISVPILFSIIILLSLLSYYYFEKPMNKRVKTLLNR >gi|226332045|gb|ACIB01000011.1| GENE 52 68121 - 71567 2282 1148 aa, chain + ## HITS:1 COG:sll1582 KEGG:ns NR:ns ## COG: sll1582 COG1112 # Protein_GI_number: 16329815 # Func_class: L Replication, recombination and repair # Function: Superfamily I DNA and RNA helicases and helicase subunits # Organism: Synechocystis # 186 1147 198 1101 1118 276 27.0 2e-73 MENEVNESFEILLAACRTADSNLAVAYKQLRDLLERLCRAQMQDESLQMTDLSARISFVS ARAGLTIVEQNRLHTFRLTSNAILNRKEEPVREKLLRDAKTLAFFIRKLYEVEIPGELYH LLPRADATYLVAPPAKKRIERMRVCYQYADEQYLYVTPVDTIADEYLRIRYNVPQINEEF AQTCEILWCHAQLNLLDVAIDKTGVLTPSFIVLEPDYLIDISSLAECFRDYGHHPANYVL ARLQPIDNARPLLLGNIANLFLDEWIHAENAPDYRECMQKAFRRYPIELAACTDLRDREK ERQFFDDCKLHFEHIREVVTDTFRAPGYELDKTDAVLEPSYICEALGLQGRLDYMQRDMS SFIEMKSGKADEFSIRNKVEPKENNKVQMLLYQAVLQYSMGMDHHRVKAYLLYTRYPLLY PARPSWAMVRRIINLRNRIVSDEYGIQLRNSVEYTASKLQAIRSDILNERGLSGRFWEQY LRPSIDNLSQKLASLTPLEQSYFYALYNFITKELYTSKSGDVDYEGRTGSAALWLSTLTE KCEAGEILYDLRIKENHAADEHKAYILLEQRKEGYGENKLSPEPNEISSEVEKGAQALPN FRQGDAIVLYERNRNEDNVTNKMVFKGNIEFITEEEIGIRLRATQQNSSVLPPDSLYAIE HDTMDTTFRSMYQALSAFASATKERRDLLLAQRMPEFEYGLDKQILTAPDDFTRVTLKAL AAKDFFLLVGPPGTGKTSCALKKMVETFHCEAQTQILLLSYTNRAVDEICKAISSIRPEV DFIRVGSELSCDEAYRHHLIENELSLCTRRSEVAERIARCRIFVGTVASISGKPELFRLK RFDVAIIDEATQILEPQLLGILCARSENGENAVGKFILIGDHKQLPAVVLQNTEQSEIYD EGLRSAGLKNLKDSLFERLYRTLQTSSEDLFPDSASVSAPNHRSFDMLCKQGRMHPEVAH FANQAFYEGRLLPVGLPHQMEDNQDVQRMVFLPSEPEPQGTSAKVNHSEARIVARIAADV YQQYGGTFDGMRTLGIITPYRSQIALIRKEIVKMGIPELNSILVDTVERFQGSERDVIIY SFCVNYPYQLRFLSNLTEENGVFIDRKLNVALTRARKQMFITGVPRLLEQNPIYDSLIKL IKQQEPLS >gi|226332045|gb|ACIB01000011.1| GENE 53 71887 - 73134 1219 415 aa, chain - ## HITS:1 COG:no KEGG:BF0887 NR:ns ## KEGG: BF0887 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 415 1 415 415 859 99.0 0 MNIRLTSLFVSLFLSVPVWAGGHKNLPAKGDLHIPVFENVNVRFSPDTYPDNYNEADGTG VYHLVNGRIILKKITLPEYKRNVSVSLKVTLASNGDRWDKSGSCFVLPKSSAINLLTIAR DGMKFPSVDSLKLEKMVGIVPGKDYLPTVELMRFMTPFGVGHYSNNNDSLSSKRRPVYIP KWESNVTWQQDITDLYPLLEGEAYVGIYIDTWTSEGYLANADIDVKESRLACDVLPKRHV EPLMNTVYYMGQSYPDIFARRDVSTDFTVPKGAKNIRLKYIVTGHGGHSGGDEFVQKRNI ISVDGKEVLNFIPWRDDCASFRRFNPATGVWLIKRLASYIGEKGYTEKEVEEPLASSDLS RSNWCPGSDVVPEEAVIGTLAPGKHTFTVSIPEAQAVDGNKLNHWLVSAYLVWEE >gi|226332045|gb|ACIB01000011.1| GENE 54 73153 - 74502 1494 449 aa, chain - ## HITS:1 COG:TM0306 KEGG:ns NR:ns ## COG: TM0306 COG3669 # Protein_GI_number: 15643075 # Func_class: G Carbohydrate transport and metabolism # Function: Alpha-L-fucosidase # Organism: Thermotoga maritima # 39 340 22 358 449 166 35.0 9e-41 MKNNRLIITLIALFLLGFGLKAQTASTEETAAQKEKRMEWFAQAKLGIFIHWGIYAVNGV SESWSFFNNYLPYEEYMAQEKGFTASAYNPQEWVKLIKESGARYTVITTKHHDGVALWDT KAGDLSTVKSTPAGRDLIAPFVKEVRKQGLKLGFYYSLLDWSHPDYPNKTRTEVRYKNDP DRWAKFVKFNFGQLSELNKTWKPDLYWFDGDWEQTAEAWDSKGIINLLRSTNPNVIVNSR IQGYGDYATPEQGVPVVRPADKYWELCMTMNDSWGYQHADTNYKTPFMLLRTFVDCLSMG GNLLLDIGPKEDGTIPAEQIAVLKEFGRWTKKHKEAIYETRAGIPCEHFQGYTTLNKAGD ILYLYLPYKPNGPIEVKGLVNKVNRVWVVGNGAMLPYKVYNKNYWSEVPGNLYIDIPERV QDEQITVIAVLLDGPIKLYRGVGQVIESN >gi|226332045|gb|ACIB01000011.1| GENE 55 74946 - 76391 1148 481 aa, chain + ## HITS:1 COG:CC1508 KEGG:ns NR:ns ## COG: CC1508 COG0477 # Protein_GI_number: 16125755 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Caulobacter vibrioides # 10 478 19 429 431 247 33.0 4e-65 MSTLQNAMGKMTNYRWTICAMLFFATTINYLDRQVLSLTWDEFIKPEFHWNESHYGIITA VFSIVYAICMLFAGRFIDWMGTKKGYLWSIGIWSAGACLHAFCGIITEEYVGMHSAAELI AATGDVVVVLATISMYCFLVARCILALGEAGNFPAAIKVTAEYFPKKDRAYATSIFNAGA SIGALIAPLSIPLLAKAWGWEMAFVIIGALGFVWMGFWVFMYTAPSKNKFVNSAELEYIE QDKHETYTATVKENEEKKSMTFRQCFTYRQTWAFAFGKFMTDGVWWFFLFWAPSYLNTQF DIKTSEGLGRALIFTLYAITMLSIYGGKLPTIIIHKTGLNPYAARMRAMLIFAFFPLLVL LAQPLGTISPWFPVIMIGIGGAAHQSWSANIFSTVGDMFPKSAIASITGIGGMAGGVGSM ILQYSAGELFVHADKTQMVFMGFIGKPAGYFVIFCICSVAYLIGWIVMKALVPKYKPIIL N >gi|226332045|gb|ACIB01000011.1| GENE 56 76558 - 80643 2174 1361 aa, chain - ## HITS:1 COG:all4963_3 KEGG:ns NR:ns ## COG: all4963_3 COG0642 # Protein_GI_number: 17232455 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Nostoc sp. PCC 7120 # 836 1076 2 236 294 119 34.0 5e-26 MKKLNLFLLVLFICNCPVVSGYAFFDRDIRLLTMQDGLADNTITSIYKDRDGFMWFGTNN GLSRYDGKLIKNFSSSPAYMYVSEIVEMSDRYLGVIAGNTLYCFARSLEKFIPIVHATDY SSVHVSHLLPIDNNSFWGLSGNKLYLYTQEEVKNEKGEVVQIKLKCEKQYKDLIDSGDNF CAMCYTDNHEMLCLVTQQGNLLLFQPESSEKSKKISLWKNKTWDATSVLYDKGVVWVSTI GHGILRYYVSSGYIDRITYKENNKENSLSHTDVFQVIPINNNRYLAVTWSGYTLLFQDKN DPKRMMTEIYYNTASQLHRNLETRMISAYYDPSGIVWIGTNGGGVIYSDLRSQFYNQFHQ ERHNEICGIVMDNRKYVWMATFHQGIMKSEQPFEPGRRMNFTRVGTPDIQSKNTVLCAIN DNRGSLWFGNRDGTLTSYNEATKQFRLHFLQDRGKVNTVSIWALYWDTNRNLWVGTNDGV WKLNIDSGFCKKIPIEILFKDPTPICIRAIAGTKDGTIWLGTSNAGVCKLKIDSRGEMSL ETGYEKKANIKNNSVRSLLVSSDGNVYVGYMDGFAILSPKKDAIREYYTTRNGLCSNFIG CLVEDNRGHIWLGSNSGVSRYSRHQHLFYNYYISGSNRSALLADNTLFFGNNKSLTYFDP DDVGGHLDEDQVLITGLEVDGRPVGIGDKINGQTVLAEGISYTSSITLNNENRDFVLSFN NLSYAEEQQKYNYRLLPYQTHWLVSNDGEKATYMNLPEGDYTFEVKNIYPDGKDGKVTSL QIHILPHWSRTLPFRLFILLLLAGGVAYLIRLVKHRQMRMEREMRMEHELLSVNLEREKE RQIRMERENFFTSAAHELRTPLTLILAPLQELLEHIKASDPLYSKLYTMYKNSSSLHTLV DQLLYVQKIEAGMVKLRLSEADIVELVREVAESFRQMAGIKGCTFQVRLPEDPVFLWIDT EKITSSVGNLLSNAFKYTSPNGEVLLTLTRMEQDGKPFCQITVSDTGEGIPDEFQKRIFD SFITGDNSPAFSTKVGIGLRIVKNTMDLHHGQVILDSEPGKGSTFVLLIPEGKSHFTGDL YEIVDYRGHETEPQFQPLSVQEKSEEGVPVTKKTLLIVEDNVDVRQYIRSLFVTKYTVLE AADGEEGVRIATNEIPDLIISDVMMPVKDGFACCREIRERQETAHIPILMLTAKAEDADV LQGSYSGADDYMMKPFNPEVLKAKVENLILQRERLKRIYTKALMLKRESVEDEEADDEFI QKLIHVVEKNLSNENFNVKMLAEQLHMSQPTLYRKVKQRSELSVVDMIRSVRVSKAASLI MENRYSIQEISEKVGFSDARTLRKHFTEQFGVPPSKYMENK >gi|226332045|gb|ACIB01000011.1| GENE 57 80829 - 84014 2180 1061 aa, chain + ## HITS:1 COG:no KEGG:BF0883 NR:ns ## KEGG: BF0883 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1061 1 1061 1061 2055 100.0 0 MKKISILFMLLLGITTLYAQQLNITGTVIDKKLNEPIIGATVQVKGTNNGSITDMEGKFS LKNVSKGGILTVSYIGYTTQSIPLNGTQTSFRIELSEDSKTLDEVVVVGFGTQKKVNLTG AVTSVDTKALASRPVSQVGQALQGVVPGLNLSTPDLGGQLGQTMNVNIRGTGTIGKGSSA SPLILIDGMEGNMNNLNPEDIENISVLKDAASSSIYGSRAAFGVILITTKKGKAGKMQVN YNNSFRYSGPTSLPNQLDSYRFANYFNDAAINQGGSVIFDEETIDRIQKYMAGEITTTTI ANGTNWHFHEKANDNVNWWKKHFQWAWSNEHNISLNGGTEKLQYYVSGSYLNQDGNLRYG NDNYKRYNATAKVNTQINKYVDFNINTKFVRFDLDNPVYLEEGGLLYHDIARMWPMMPFK DPNGYYMRNGKLNQLTDGGRAKTHNDNIYLQGQLVIHPLKGWNIYAEAGMRVINQNKQTN LNPIYEHDVNGNPLALAFSGSYSPGSSFARSAYHNSNFYTTSVYTDYTLQIKDHYFKALV GMNTEEYVYRELAAQRPDVISSLIPEISAATGEDKINSSKYNDWSTAGFFGRLNYSYKDR YMAEVNVRYDGSSRFLKDQRWNVFPSFSLGWNLARESFFEPINNIINTLKPRVSWGMLGN QNTDSYYPFYLTQSVTANGGNWLMDGSRPTTAGVPGMVSSTLTWEKIYNTNLGIDLGMFN NRLNMTFEYFIRRTKDMVGPAAEVGAILGTALPNTNNAELKNKGWELQANWRDNIGKVNY NIGFNLSDNRAKVISYPNASKALWDSNGNTLYYNGMTIGEIWGYETEGIAQTDAQMTEWL ASNDQSKIGSVWGAGDIMYRDLNGDGIVDKGNSTATDHGDLKKIGNSTPRLRFGLSLGAD WKGFDIQMFFQGVMKRDLWLSGPMFWGADGGEWQSVGFDEHLDYFRPENTTSIFGANLNS YYPKAYLGDKGNKNKQTQTRYLQNGAYMRMKNLQIGYTFPKAWMNKAKIEKLRIYVSGEN LFTISGIADMFDPEATAGNGFSNGKTYPLSKTISFGLNITL >gi|226332045|gb|ACIB01000011.1| GENE 58 84027 - 86009 1116 660 aa, chain + ## HITS:1 COG:no KEGG:BF0806 NR:ns ## KEGG: BF0806 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 660 1 660 660 1312 100.0 0 MKKNLLYIFSLASVLCSCNDFLDKEPLDAVPTDKYLLAESDLAAYSANLYDQLPSHTPGQ YSMGVFATDNNSDNQAASNPNGSFVKGETRVAQSGGAWDFGKIRNVNYFINKVRPRLEAG ELSGVEANNMHYLGEMYFFRAYIYFTKLVALGDFPILKHWISEDYETVREASKRRPRNEV ARFIIQDLDSAYYYMKATPPMSNRLTKDCAALMKSRVALFEGTWEKYHKGTARVPGGPGW PGANKDYLKDFTINIDSEIKYFLTEAKTAAQIVADKYTLFNDYPSLFNSQSLANASEVLL WRAYDASLTPAVNHFVVGYIQRNGGGNTGWTRSMMQSYLMENGLPIYANNSGYQGDKTYE AVATNRDPRLIYNTLLPGDLLSEGGSNIEYLVKGYGYYYRAPIVLGQDENKCPTGYSVKK GLATDAAQGPTLPSTTACVIFRAAEAYLNYMEADYELNNSLDANSSKYWKALRNRAGMDT DFQKTIDATDLSKEIDFARYSGSEFVSTTLYNIRRERRIEFAAEGLRLNDLKRWRALDMM QGYHVEGFDLWSENYQRYKTPSPIPVADVTLSVINLIESGNNNANVSAKSESQYLRPYRI NTNNIAYNGYNWNQNKYLNPIAFDHFRLTTAEEGSTDYTTSTIYQNPGWKIETSSLPEGD >gi|226332045|gb|ACIB01000011.1| GENE 59 86128 - 87579 1112 483 aa, chain + ## HITS:1 COG:YPO0829 KEGG:ns NR:ns ## COG: YPO0829 COG3119 # Protein_GI_number: 16121138 # Func_class: P Inorganic ion transport and metabolism # Function: Arylsulfatase A and related enzymes # Organism: Yersinia pestis # 14 474 27 501 517 400 43.0 1e-111 MKNIIPQALLTMPILSTGLQAQEKQPTPNLVFIMADQYRGDAIGCIGKEPVKTPHLDKLA SEGINFTNAISSYPVSSPARGMLMTGMYPIGSKVTGNCNSETAPYGVELSQNARCWSDVL KDQGYNMGYIGKWHLDAPYKPYVDTYNNRGKVAWNEWCPPERRHGFDHWIAYGTYDYHLK PMYWNTTAPRDSFYYVNQWGPEYEASKAIEYINGQKDQKQPFALVVSMNPPHTGYELVPD RYKEIYKDLDVEALCKGRPDIPAKGTEMGDYFRNNIRNYYACITGVDENVGRIIEALKQN NLFDNTIVVFTSDHGICMGAHENAGKDIFYEESMRIPMILSWPDQIKPRKSDPLMIAFAD LYPTLLSMMGFSKEIPETVQTFDLSNEVLTGKNKKDLVQPYYFVKFDNHATGYRGLRTDR YTYAVHATDGKIDNVILFDRTNDPHEMNNIASQQLKLTHTFNRQLKTWLEKTNDPFAQYI KLK >gi|226332045|gb|ACIB01000011.1| GENE 60 87582 - 88964 1168 460 aa, chain + ## HITS:1 COG:XF0106 KEGG:ns NR:ns ## COG: XF0106 COG3669 # Protein_GI_number: 15836711 # Func_class: G Carbohydrate transport and metabolism # Function: Alpha-L-fucosidase # Organism: Xylella fastidiosa 9a5c # 4 459 14 453 460 187 29.0 3e-47 MNRINTILLLLFCSVYCLAQQATIPVPKPFQLKWHQAEMGAVFHYDLHVFDGVRYGQGNN RINPIEDYNIFNPTELNTDQWVLAAKAAGCKFAVLTATHETGFGLWQSDVNPYCLKAVKW RDGKGDIVRDFVNSCRKYGLQPGIYIGIRWNSLLGIHNFKAEGEGEFAHNRQAWYKRLCE KMVTELCTRYGDLYMIWFDGGADDPRGDGPDVEPIVNKYQPNCLFYHNIDRADFRWGGSE TGTVGYPCWSTFPAPCSHHKRIESNVDQIELLKHGDKDGKYWVPAMADTPLRGANGRHEW FWEPDDENNIYPLNELMDKYEKSVGRNATLILGLTPDPNGLIPTGDEQRLKEFGTEINRR FSSPLAQISGQKKSLTLKLDKKQPVNYCIIQENIQNGERIRQYKVEAKVNGKWQTVCSGE SVGHKRIEKFDPVEATALRLTVLQSTALPDIINFSAFSVN >gi|226332045|gb|ACIB01000011.1| GENE 61 88974 - 91049 1257 691 aa, chain + ## HITS:1 COG:no KEGG:BF0803 NR:ns ## KEGG: BF0803 # Name: not_defined # Def: alpha-galactosidase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 691 1 691 691 1427 99.0 0 MKRYFIISLLTLASTVAPLTAVFAQSSFIYEKGKSFKDVNASPMPQTIRLDRTAEPVIYE NAVPENATTICYRIQLPSYVRGTFFSRDSRPGDYEWPNNTNRLLPWMFNHLTDLTRDDYP GIPSNARPSTLGDALLLQLTDGSYLFTKAVAGDNSLSWFQVNTDGSLNLYVSTLGTDRLE HKVPVALVQSAGNIYQVFRQAYETLISDRNVSALQKRTEKNYFEALNYLGWCTWEHYHFD IDETKILNDLDAIETSGVPVRYVLIDDGHLANKNRQLTSFTPDPQRFPNGWAPIMAHKNK DKIRWIGLWYALSGYWMGISPDNDFPTHVKNSLYSFNGSLLPGKSTPNIDTFYQYYVHSL KTHGFDFLKVDNQAFTLPLYMGSTEVVRQAKECNLALEKQTHAQQVGLMNCMAQNVLNTD HTLHSGVARVSIDYKKYNENMAKSHLFQSYTNTLLQGQTVWPDHDMFHSSDTICGSLMAR SKAISGGPVYLSDSPKEFVKENIFPLIDKEGKIFRPEAPAIPTPESVLTNPLQDGKAYRV FAPTGDEAVSVICYNLNTSPKHQKVTAEIDPKDYLLRETLTGKPTPQQKRVILFDWNNQT ATELTGKQTVELDGFTDRLFHLCPIHDGWAVIGIQEKYLSPAAVRILSSTPDKLVLNVLS PGTLKIWTENSGKQELRNIQVKETGKMTIRK >gi|226332045|gb|ACIB01000011.1| GENE 62 91070 - 92626 1579 518 aa, chain + ## HITS:1 COG:SPBPB10D8.02c KEGG:ns NR:ns ## COG: SPBPB10D8.02c COG3119 # Protein_GI_number: 19111838 # Func_class: P Inorganic ion transport and metabolism # Function: Arylsulfatase A and related enzymes # Organism: Schizosaccharomyces pombe # 33 486 11 519 554 162 28.0 2e-39 MNKKIIIPLALAPLAAPALQAQHQQPNGRTDTRPNIILFMVDDMGWQDTSLPFWTQKTHY NEVYETPNMERLAKQGMMFTQAYASSISSPTRCSLITGTNAARHRVTNWTYPKGQQTDRP SDVFNVADWNVNGVCQVPNIDHTFQATSLAEILKDNGYHTIHCGKAHFGAVNTPGESPYH MGFEVNIAGHAGGGLASYLGENNYGNRTDGKPNPWFAVPGLEKYWGTDTFVSEALTLEAI KALDHAKEYNQPFFLYMAHYAIHVPIDKDKRFYQKYINKGLTPKEAAYAALIEGMDKSLG DLMDWLDKNGEADNTIVIFMSDNGGLSSEPEWRDGKLHTQNSPLNSGKGSAYEGGVREPM IVRWPGVVKPDTKCDKYLIIEDFYPTILEMAQIKHYKTVQPIDGISFMPLLTHTGDPSKG RSLHWNFPNHWGNDGPGIGPTCTVRKGDWKLIYYYDSGKKELFNIPEDIGEKNDLAALHP DIVKSLSKELGDYLRKVGGQRPSFKATGKPCPWPDEIK >gi|226332045|gb|ACIB01000011.1| GENE 63 92706 - 93353 574 215 aa, chain - ## HITS:1 COG:BB0676 KEGG:ns NR:ns ## COG: BB0676 COG0546 # Protein_GI_number: 15595021 # Func_class: R General function prediction only # Function: Predicted phosphatases # Organism: Borrelia burgdorferi # 3 215 4 219 220 134 34.0 2e-31 MKKLIIFDLDGTLLNTIADLAHSTNHALQTLGYPTHEVASYNFMVGNGINKLFERALPEG EKTEENVLRVRKEFLLHYDRHNADESRPYPGIPELLETLQHKGYKLAVASNKYQAATEKL IAHYFPGIRFVAVFGQREGVKVKPDPAVVHDILQIAGVSKDEVLYVGDSGVDMQTAINSG VTSCGVTWGFRPRTELESFCPDYIVDKAETILSIV >gi|226332045|gb|ACIB01000011.1| GENE 64 93559 - 93855 279 98 aa, chain + ## HITS:1 COG:no KEGG:BF0876 NR:ns ## KEGG: BF0876 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 98 1 98 98 162 100.0 4e-39 MGSKKTDLMRISYFVAIVILVCLIGNLKDLWLQSWTDLIIYLIVLFAAAECLFSTFARIR AVGEQKQPRWLTSISLLIYGIFFLGTLFFIGDFLINKL >gi|226332045|gb|ACIB01000011.1| GENE 65 93852 - 95195 1175 447 aa, chain + ## HITS:1 COG:TM0336 KEGG:ns NR:ns ## COG: TM0336 COG1073 # Protein_GI_number: 15643104 # Func_class: R General function prediction only # Function: Hydrolases of the alpha/beta superfamily # Organism: Thermotoga maritima # 30 440 21 410 412 219 33.0 8e-57 MIKKNLLKGICLLWLLLAVTPVLQAQDRAQQAFELLDRLIAGQGDSVYVHLDDNIRKMLS VEMLNGLFKQLEQQAGKYQSHGEWKTEPINGMTIYYCDVKFERLPLRFLTAFNPDGKVNT IRFVPVPAEKTTPPTTSVQDKIKETDIQVCTGNFKLPGTLTLPKNGKDLPVVILVHGSGA SDRDETVGANKPFRDLAYGLAERGIAVIRYDKRTKVYGADSAPAGKEITFDEESVDDALS AIKLARSIPTINPERIYILGHSLGGTLAPRIAQRSDKVPAGIILLAGAARPLEDLFISQV KFLASALPSTKDIEKEIAELQKQVDNVKRLGTDTFDITTPLPMNLSQAYWMLANQYKPLE VVRKLTLPILVLQGERDYQVTMQDFELWKSALAKHPNAIFKSYPRLNHLFQEGEGKSTPL EYSRPSSIPSYVTDDIAAFINRSKPGN >gi|226332045|gb|ACIB01000011.1| GENE 66 95687 - 97633 1836 648 aa, chain + ## HITS:1 COG:HI1439 KEGG:ns NR:ns ## COG: HI1439 COG1154 # Protein_GI_number: 16273346 # Func_class: H Coenzyme transport and metabolism; I Lipid transport and metabolism # Function: Deoxyxylulose-5-phosphate synthase # Organism: Haemophilus influenzae # 3 629 4 615 625 536 43.0 1e-152 MKNEPTYSLLNAINYPKDLRQLSVDQLPEVCEELRQDIIKELSCNPGHFAASLGVVELTV ALHYVYNTPYDRIVWDVGHQAYGHKILTGRREAFSTNRKLGGIRPFPSPEESEYDTFTCG HASNSISAALGMAVAAERKGEKDRHVVAVIGDGSMSGGLAFEGLNNASSTANNLLIILND NDMAIDRSVGGMKQYLFNLTTSNRYNQLRFKTSRLLFKMGLLNEERRKALIRLGNSLKSL AAQQQNIFEGMNIRYFGPIDGHDVKNIARILHDIKDMQGPKILHLHTIKGKGFGPAEKQA TIWHAPGKFDPVTGKRIVANTDGMPPLFQDVFGHTLVELAEKNKRIMGVTPAMPSGCSMN MLMDRMPDRAFDVGIAEGHAVTFSGGMAKDGLLPFCNIYSSFMQRAYDNIIHDVAIQKLN VVFCLDRAGLVGEDGPTHHGVFDMAYLRPIPNLTISSPMDEHELRRLMYTAQLPDKGPFA IRYPRGRGSLVDWECPLEEILVGKGRKLKDGNDLAVITIGPIGKLAARAIERAEADTGIS VAHYDLRFLKPLDEELLHEVGKKFRHIVTIEDGIIKGGMGCAILEFMADNGYYPEIRRIG VPDQFIEHGSVQQLYHLCGMDEEGIYKVITKNELRMDAPVESCMATHS >gi|226332045|gb|ACIB01000011.1| GENE 67 97672 - 99012 1314 446 aa, chain + ## HITS:1 COG:PA0016 KEGG:ns NR:ns ## COG: PA0016 COG0569 # Protein_GI_number: 15595214 # Func_class: P Inorganic ion transport and metabolism # Function: K+ transport systems, NAD-binding component # Organism: Pseudomonas aeruginosa # 1 445 1 450 457 221 32.0 2e-57 MKIIIAGAGAVGTHLAKLLSREKQDIILMDDDEEKLSTLSSNFDLMTVTASPSSISGLKE VGIKEADLFIAVTPDESRNMTACMLATNLGAEKTVARIDNYEYLLPKNKEFFQKLGVDSL IYPEMLAAKEIVSSMRMSWVRQWWEFCGGSLILIGTKMREKAEILNVTLAELGAPDIPYH VVAIKRGTETIIPRGDDTIKLHDIVYFTTTRKYIPYIRKIAGKEEYADVRNVMIMGGSRI AVRTAQYVPDYMQVKIVDNDINRCNRLTELLDDKTMIINGDGRDMDLLIEEGLKNTEAFV ALTGNSETNILACLAAKRMGVSKTVAEVENIDYIGMAESLDIGTVINKKMIAASHIYQMM LDADVSNVKCLTFANADVAEFTVPENAKITKNKVKDLGLPKGTTIGGLIRNGEGILVTGD TLIQAGDHVVVFCLSMMIKKIEKYFN >gi|226332045|gb|ACIB01000011.1| GENE 68 99017 - 100468 1168 483 aa, chain + ## HITS:1 COG:MA1483 KEGG:ns NR:ns ## COG: MA1483 COG0168 # Protein_GI_number: 20090342 # Func_class: P Inorganic ion transport and metabolism # Function: Trk-type K+ transport systems, membrane components # Organism: Methanosarcina acetivorans str.C2A # 2 482 1 476 476 305 39.0 1e-82 MINSKMIYRITGFLLLIETGLLLCCAGVSLIYREDDLSSFLLSAGLTTLVAILLLALGKG AEKQLNRRDGYVIVSVAWVVFSLFGMLPFYLSHYIPSITNAFFETMSGFSSTGATILDDI EALPHGLLFWRSMTQWIGGLGIVFFTIAVLPIFGVSGVQLFAAEASGPTYDKVHPRIGVT AKWIWTIYAGLTAIEVILLLFGGMGLFDSICHSFATTGTGGYSTKQDSIAYYNSPYIEYV IGVFMFLSGINFTLLLLLFTGKLKKVSQNAELKWYVMSVILFTAFIAAVLYRTTPMGAEE SFRKAFFQVASLHTSTGFVTADYMQWVPVLWGTLTVIMLIGACAGSTTGGMKCIRMVILA KVSRNEFKHIVHPNAVLPVRVNKQVISPAILSTVLAFSFIYAVIIIVSVLLMLAMGVGFT ESIGTVISSIGNMGPGLGSCGPAYSWDGLPDLAKWLLSFLMLLGRLELFTVLLLFSSDFW KRN >gi|226332045|gb|ACIB01000011.1| GENE 69 100688 - 101032 174 114 aa, chain + ## HITS:1 COG:no KEGG:BF0870 NR:ns ## KEGG: BF0870 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 114 12 125 125 217 99.0 1e-55 MLAVLLHSVTMKAANTSYIIEDPDQEECFISQATPASRNILERFHFYCTIMPCEMGHADI SHVPTDKSFIRPEMIFHKYRMRNNPFSVHSNHSHTYNPSDPLTYYVYGLRKIII >gi|226332045|gb|ACIB01000011.1| GENE 70 101176 - 101526 218 116 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|154175415|ref|YP_001407462.1| NADH dehydrogenase subunit A [Campylobacter curvus 525.92] # 3 116 14 126 129 88 36 2e-16 MNFTLLVVVLLTAIAFVGVVIALSNAISPRSYNAQKFEAYECGIPTRGKSWMQFRVGYYL FAILFLMFDVETVFLFPWAVIARDLGPQGLISILFFLVVLVLGLAYAWKKGALEWK >gi|226332045|gb|ACIB01000011.1| GENE 71 101517 - 102107 428 196 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|154175216|ref|YP_001407461.1| NADH dehydrogenase subunit B [Campylobacter curvus 525.92] # 33 190 12 169 170 169 50 8e-41 MEIMKKPKIKSIPYEDFIDNESLEKMVKELNEGGANVFVGVLDDLINWGRSNSLWPLTFA TSCCGIEFMALGAARYDMARFGFEVARASPRQADMIMVCGTITNKMAPVLKRLYDQMADP KYVIAVGGCAVSGGPFRKSYHVVNGVDKILPVDVYIPGCPPRPEAFYYGMMQLQRKVKIE KFFGGVNRKEKKPEGK >gi|226332045|gb|ACIB01000011.1| GENE 72 102129 - 103721 1568 530 aa, chain + ## HITS:1 COG:SMa1529 KEGG:ns NR:ns ## COG: SMa1529 COG0649 # Protein_GI_number: 16263284 # Func_class: C Energy production and conversion # Function: NADH:ubiquinone oxidoreductase 49 kD subunit 7 # Organism: Sinorhizobium meliloti # 163 530 11 404 404 303 38.0 7e-82 MEEIKYIEPAALHDEMLRLRNEKQMDFLESLTGMDWGVADEGDAPNVTRGLGVVYHLEST VTGERIAIKTSTNNRETPEIPSVSDIWKAADFNEREVFDYYGIVFIGHPDMRRLYLRNDW VGHPMRKDNNPEKDNPLRMDNEETYDTTREIELNPDGTYQTQENVIFDDREYVVNIGPQH PATHGVMRFRVSLEGETIKKLDANCGYIHRGIEKMNESLTYPQTLALTDRLDYLGAHQNR HALCMCIEKAMGIEVSERVKYIRTIMDELQRIDSHLLFYSCLAMDLGALTAFFYGFRDRE MILDMFEETCGGRLIMNYNTIGGVQADLHPNFIPRVKKFIPYLRGIIHEYHDVFTGNVIA RQRLKGVGVLSREDAISFGCTGGTGRASGWACDVRKRMPYGVYDKVDFKEIVYTEGDSFA RYMVRMDEIMESLNIIEQLIDNIPEGPIQEKMKPIIRVPEGSYYTAVEGSRGEFGVFLES HGDKTPYRLHYRSTGLPLVSAVDTICRGAKIADLIAIGGTLDYVVPDIDR >gi|226332045|gb|ACIB01000011.1| GENE 73 103801 - 104877 1167 358 aa, chain + ## HITS:1 COG:MT3240 KEGG:ns NR:ns ## COG: MT3240 COG1005 # Protein_GI_number: 15842728 # Func_class: C Energy production and conversion # Function: NADH:ubiquinone oxidoreductase subunit 1 (chain H) # Organism: Mycobacterium tuberculosis CDC1551 # 7 351 1 337 410 239 41.0 7e-63 MFDFSIITSWIHQTLTSVMPEGLAVFIECVVIGVCIVALYAILAILLIYMERKVCGFFQC RLGPNRVGKWGSIQVLCDVLKMLTKEIIELKHSDKFLYNLAPFMVIIASFLTFSCLPISK GLEVLDFNVGVFFLLAASSIGVVGILLAGWGSNNKFSLIGAMRSGAQIISYELSVGLSIL TMVVLMGTMQFSEIVESQANGWFIFKGHIPALIAFVIYLIAGNAECNRGPFDLPEAESEL TAGYHTEYSGMHFGFFYLAEYLNMFIVAAVAATIFLGGWMPLHIVGLDGFNAVMDYIPGF IWFFGKAFFVVFLLMWIKWTFPRLRIDQILNLEWKYLVPISMVNLVIMVLIVVFGLHF >gi|226332045|gb|ACIB01000011.1| GENE 74 104894 - 105373 429 159 aa, chain + ## HITS:1 COG:SMa1519 KEGG:ns NR:ns ## COG: SMa1519 COG1143 # Protein_GI_number: 16263279 # Func_class: C Energy production and conversion # Function: Formate hydrogenlyase subunit 6/NADH:ubiquinone oxidoreductase 23 kD subunit (chain I) # Organism: Sinorhizobium meliloti # 19 144 20 140 188 90 38.0 1e-18 MKNEEYTYLGGLMQGIGSLLTGMKTTIKVYFRKKVTEQYPENRAELKMFDRFRGTLNMPH NENNEHRCVACGLCQMACPNDTIKVTSETIETEEGKKKKILAKYEYDLGSCIFCQLCVNA CPHDAITFDQVFEHAVFDRTKLVLQLNREGSKVIEKKKE >gi|226332045|gb|ACIB01000011.1| GENE 75 105380 - 105892 662 170 aa, chain + ## HITS:1 COG:jhp1190 KEGG:ns NR:ns ## COG: jhp1190 COG0839 # Protein_GI_number: 15612255 # Func_class: C Energy production and conversion # Function: NADH:ubiquinone oxidoreductase subunit 6 (chain J) # Organism: Helicobacter pylori J99 # 5 164 2 162 182 63 30.0 1e-10 MGLTLETVVFYFLAVFIIAMSILTVTTQRIVRSATYLLFVLFGTAGIYFLLGYTFLGSVQ IMVYAGGIVVLYVFSILLTSGEGDRAAHLKRSKFLAGLVTTIIGAILVLFITLTHKFVPT SDPEPVEISIKTIGHALLSSGKYGYVLPFEAVSILLLACIVGGLLIARKR >gi|226332045|gb|ACIB01000011.1| GENE 76 105899 - 106210 419 103 aa, chain + ## HITS:1 COG:VNG0643G KEGG:ns NR:ns ## COG: VNG0643G COG0713 # Protein_GI_number: 15789840 # Func_class: C Energy production and conversion # Function: NADH:ubiquinone oxidoreductase subunit 11 or 4L (chain K) # Organism: Halobacterium sp. NRC-1 # 2 103 1 100 100 76 44.0 1e-14 MMIHMEYYLVVSTIMMFAGIYGFFTRRNTLAILISVELMLNATDINFAVFNRFLFPGELE GYFFALFSIAISAAETAIAIAIMINIYRNIRSIQVKNLDELKW >gi|226332045|gb|ACIB01000011.1| GENE 77 106247 - 108157 1882 636 aa, chain + ## HITS:1 COG:slr0844 KEGG:ns NR:ns ## COG: slr0844 COG1009 # Protein_GI_number: 16331732 # Func_class: C Energy production and conversion; P Inorganic ion transport and metabolism # Function: NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit # Organism: Synechocystis # 6 636 10 680 681 385 39.0 1e-106 MEYTILILLLPFLSFLALGIGGKWMSHRTAGTIGTLVLAAVTVLSYVTAVHYFSAPRLAD GTFATLIPYNFEWLPFTETLTFNLGILLDPISVMMLIVISTVSLMVHIYSFGYMKGERGF QRYYAFLSLFTMSMLGLVVATNIFQMYLFWELVGVSSYLLIGFYYTRPAAIAASKKAFIV TRFADLGFLIGILIYGYYGGTFGFTPDTVSMLSGGAGMLPLALGLMFVGGAGKSAMFPLH IWLPDAMEGPTPVSALIHAATMVVAGVYLVARMFPLFIEYAPDVLHLIGWVGAFTAFYAA SVACVQSDIKRVLAFSTISQIGFMIVALGVCTSSDPHHGGLGYMAGMFHLFTHAMFKALL FLGAGSIIHAVHSNEMSAMGGLRKYMPITHITFLIACLAIAGIPPFSGFFSKDEILAACF QYSPTMGWVMTVIAAMTAFYMFRLYYGIFWGGTAPGQKSTSDGTSHVHTPHESPLTMTVP LIFLAAVTCVAGFIPFGHFISSNGESYTIHLETSVAVTSVVIAVASIVLATCMYLRQQQP LADKLAKRFAGLHRAAYHRFYIDEVYQFITHRIIFRCISTPIAWFDRHVVDGFFNFIAWG THATSDEIRGLQSGRVQQYAYVFLLGALILILILIL >gi|226332045|gb|ACIB01000011.1| GENE 78 108170 - 109654 1312 494 aa, chain + ## HITS:1 COG:slr1291 KEGG:ns NR:ns ## COG: slr1291 COG1008 # Protein_GI_number: 16329430 # Func_class: C Energy production and conversion # Function: NADH:ubiquinone oxidoreductase subunit 4 (chain M) # Organism: Synechocystis # 69 439 71 443 559 249 37.0 1e-65 MNFLSLFVLIPLLMLGGLYLAKSIKAIRGVMVAGSTALLILSVVLTFLYLGERQAGATAE MLFRADTVWYAPLHIAYSVGVDGISVAMLLLSAVIVFTGTFASWKLQPLTKEYFLWFTLL SMGVFGFFISIDLFTMFMFYEIALIPMYLLIGVWGSGRKEYAAMKLTLMLMGGSAFLLIG ILGIFFGAGGTTMNILEIAQLHNIPFAQQCIWFPLTFLGFGVLGALFPFHTWSPDGHASA PTAVSMLHAGVLMKLGGYGCFRIAMYLMPEAANELGWIFLILTGISVVYGAFSACVQTDL KYINAYSSVSHCGLVLFAILMMNQTAATGAVLQMLSHGLMTALFFALIGMIYGRTHTRDV RELNGLMKVMPFLSVCYVIAGLANLGLPGLSGFVAEMTIFVGSFQNFDVFHRTLTIIACS SIVITAVYILRLVGKILYGTCTNKHHLALTDATWDERFAVICLIICVAGLGMAPFWVSHM IGESVLPVVSHLIP >gi|226332045|gb|ACIB01000011.1| GENE 79 109661 - 111112 1494 483 aa, chain + ## HITS:1 COG:SMc01927 KEGG:ns NR:ns ## COG: SMc01927 COG1007 # Protein_GI_number: 15965032 # Func_class: C Energy production and conversion # Function: NADH:ubiquinone oxidoreductase subunit 2 (chain N) # Organism: Sinorhizobium meliloti # 12 444 11 440 480 249 36.0 1e-65 MDYSQFLYMKEELSLIAVILILFVVDLFTCPDQKGATPKVNVRSLTLPAVILMTLHTVIN LFPGTPAEAFGGMYQYTPMQTIIKAVLNVGTIIVLLMAHEWLKREDTRIKQGEFYVLTLS TLLGMYFMISAGHFLMFFIGLEMASIPMAALVAFDKYRHHSAEAGAKYILTALFSSALLL FGLSMIYGTSGTLYFNDLPGHITGNMLQIMAFVFFFAGMGFKISLVPFHLWTADVYEGAP TAVTSYLSVISKGSAAFVLMTILMKVFAPMVAQWQEVLFWVTIASITIANLFAIRQQNLK RFMAFSAISQAGYIMLGVIGGSEMGMTALVYYVLVYLAANLGVFAVISIVEQRSNKVEID DYNGLYKTNPKLAFIMTLALFSLAGIPPFAGFFSKFFIFMAAFNSGFHLLVFIALINTVV SLYYYLLIVKAMYINPNEEPIPTFRSDNYTKVSLALCTLGIIALGIASCIYQGIDKFSFG MGM >gi|226332045|gb|ACIB01000011.1| GENE 80 111209 - 113998 2599 929 aa, chain - ## HITS:1 COG:mll3725_2 KEGG:ns NR:ns ## COG: mll3725_2 COG0642 # Protein_GI_number: 13473203 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Mesorhizobium loti # 670 917 51 308 328 176 39.0 2e-43 MDNDIERFRQMASMAQLGWWEADFTAGHYVCSEYLCDLLGLEGNTISFTDFRKRVREDYQ EQIVREFNASIHREFYEQTFPIHSKEGIVWLHTRLGEREEILGRGVVSFGIIQRVEAPND TSERVLERVNDLLYRQNSISHSLLRFLKDDSVDLCIMEILKDILDLFHGGRVYIFEYDEY YRYQDCTYEVVAEGVLPEIDSLQRIPTDSLPWWRQQTLSGKPVILDSLDQLPKHAKAEYA ILSRQNIKSLMITPLIAGEHVWGYMGIDLVKNYRNWNNEDFQWLSSLANIISICIELRKT KDEAVRERSFLRNLFRFMPMGYIRMTMVRDAAGLPCDYRIADANDLSSELIGMSLSDYVG CLASELHADFKAKVDYLLDVMEGSVHKETDVYFHRTQRSSHCIVYSPEKDEVVALFLDST ETIRAHRALDRSEKLFKNIFANIPAGVEIYDKDGNLLDLNNWDMETFGVKDKADVMGVNF FENPNVPLEIRERVRNEDLVDFRLNYSFNKASDYYHSDKSNIIELYTKVSKLFDSQGNFN GYVLINIDNTERIDAINRIRDFENFFLLISDYAKVGYAKLNLLSKRGYAIKQWFKNMGET EDIPLSSVVGVYDKMHPEDRQKVFDFYEKVLAGEEKDFRSEMRILKPGATNEWNWVRMNV VVTKFESEHGEVEIIGINYDITELKETEAMLIEAKEKAETMDRLKSAFLANMSHEIRTPL NAIVGFSGLLVDTEDMEERCEYIKIVQENNDLLLQLISDILDLSKIEAGTFEFTYGETDV NMLCEDIVRSSQIKVPQGVELVFDPHPSDCTVISDRNRLHQVISNFVNNALKFTSSGSIH VGYEKKEEGVEFYVSDTGIGISKEQLTHIFERFVKLNSFIHGTGLGLSICKSIVEQLGGV IGVDSEEGKGSRFWFTIPYINSEQSIVND >gi|226332045|gb|ACIB01000011.1| GENE 81 114236 - 116935 2697 899 aa, chain + ## HITS:1 COG:CC0171 KEGG:ns NR:ns ## COG: CC0171 COG1629 # Protein_GI_number: 16124426 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor proteins, mostly Fe transport # Organism: Caulobacter vibrioides # 137 899 82 888 888 163 22.0 2e-39 MKQQIGRLLSTLLLATFSLGITAGVIQGTIIDKQTKEPLTGATVQIAGTTTGTVADVEGN YTLTLSNGTYTIEVKYIGYKTLRMNEVKVKANATLNFELEVDAQTLDAVTVVARKNLEGE KALLQERQKATLAIENMGAKEMTLKGISNVQDGVKKITGISIASAGQLIVRGLGDRYSTT TLNGLPIASPNPDNKLIPLDLFPASTVKNITVSKVYAAGAFADYSGAHIDISTKENTGSD FFSIGFNVGGRFNTVGKDFYYSDRKGGLFSTGNLRNKDRILAMGKSEFRDYARNNDPFGT NFAISKHRSLPEFGGNLGGGKSWTLPNGNRLSVLASVGVSNENQILKDAYVTTMTAQGTH LDKFNYDSYSSALKIAGLGNIGYSFRQADHINFTVFYARNAINDYMSREGIDAEKNNITS SNSVFHAYSLLNNQLLGHHELTSQWDVNWSASYGLTNSDEPDRRQVVFFRNEGSDKLNLF KLNQTTNRYFGELQEKEIVGDLRTSYKWGDANLIRVGGTYKSKKRDFESVNFYYDINALN ADVTNIYDTNGYLNQENIANGTIKANIDAQPRYNYYAGMDVWAGFAEIEYYPMESLLVNV GLRYEQAKQWVRYWTDGGQEKKTNLDKGDFFPALNLKYSLNETNSLRLSVSRTVTRPSFI EMAPFLYQESYGSAYIRGNNELKNAYNYNIDLRYDFFPKRNNGDMFSVTGYFKKLKSPIE QTQESSGGTVIRSFRNAEDGIATGVEIEFRKELFKNFRIGANGSYMYTNVVLPEGGVYTD SERALQGASPFLINADLSYTPQLRRESDLTLALVYNVQGPRIETVGIYGTGNIKQQTLHT MDFIASYAINKHLSLRLQMKDLLNSTIRFKQELPATGQKVEVESFRPGTSAEIGVSYRF >gi|226332045|gb|ACIB01000011.1| GENE 82 117004 - 118278 1491 424 aa, chain + ## HITS:1 COG:no KEGG:BF0856 NR:ns ## KEGG: BF0856 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 424 1 424 424 766 99.0 0 MNMKTNYLKLNSWAVAALMGMCSLAACSDDNSGEGGGNGDSEEVIANNGTLKGSVDGSKT VILTKGYNFSLDGEYIVKAGSTLKIGEGVTISAKSDDATIDYILVEQGAKIEAVGTASAP IVMTADTKEPGAWGGIHICGKAPINIGSTGKSEVGDAAYGGSDPADNSGILKYIRLEYAG YKFTTEKECNGFTFYGVGNGTTLEYLEAYKGTDDGFEWFGGTVNAKYLVSVSNSDDSFDW TEGWSGKGQFFVAYQEDPATLGYTCDCLIEADNYDKNMDAAPISCPTLANLTLIGANNDE GKRGIRLRAGTQAKIYNALVTGKANNLTTETEQTEKFLIDGPSVLNYIAIAGDIKASGDG GYSSALFTAEGNHNAINQTLSFSNIFIGTQDGGADLSADSFFEKAAYKGAVKADNEWTKG WTKL >gi|226332045|gb|ACIB01000011.1| GENE 83 118351 - 119499 768 382 aa, chain + ## HITS:1 COG:no KEGG:BF0855 NR:ns ## KEGG: BF0855 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 382 1 382 382 755 100.0 0 MDRYRLKIIALTALVCLTGSSCTDDENNGQGNNIIYGENIIGNGEQTFEIKDHQYLKRGT YLMKGWCYVTYGSTLTIEAGTVIKGDKETRAALIVEPGGKLIARGTVDAPIVFTSEMPAG KRKPGDWGGLILCGYARNNEDIMQIEGGPRTMHGGPNNADNSGVLSYVRVEFAGYPFKKN QEINGITFGSVGNGTQIDHLQVSYANDDAFEWFGGTVHAEYLVAYHCWDDDFDIDNGYSG TCRHLLGIRHPRIADITGSHAFECSNNGTNTPATPTTAATFEDVTIYGPASGDASFVNHP DFINGGGLRPENESMLGLFGAALYMGNNTSVTFRNCRISGYPSDMEGTPASADNVVFSER EETGYPEWTQGWCNFNPQETEY >gi|226332045|gb|ACIB01000011.1| GENE 84 120104 - 121234 1047 376 aa, chain - ## HITS:1 COG:PAB0825 KEGG:ns NR:ns ## COG: PAB0825 COG0251 # Protein_GI_number: 14521450 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Putative translation initiation inhibitor, yjgF family # Organism: Pyrococcus abyssi # 270 364 28 124 127 65 37.0 1e-10 MSKNMEYRKHRIEYLRTTVEYSLFGGEGGTREAHLMFHVDPEAGSYEEQLTAIRKAYHRI LSRKVKIRGMVPVFCRYFLSDAANQWEALQAVLQKEPSCAVSVVQQPPLDGSKIALWVYL TSEPNAAYKHYWTAGAGVSCGKSERQMKTLLKSYETDLVGKGCTLASDCIRTWIFVQNVD VNYAGIVKARRENFLGQGLTESTHYIASTGIEGRHADPKIHVLFDAYAVKGLQPGQVTYL HALSHLSPTALYGVTFERGTSVEYGDRRHLFISGTASIDHRGEVVHVGDVREQTRRMWEN VEKLLEEGKAGFEDVAQMIVYLRDASDYPVVRALFAKRFPDTPIQFVVAAVCRPAWLIEM ECIAIVANSNSSYESF >gi|226332045|gb|ACIB01000011.1| GENE 85 121238 - 122671 949 477 aa, chain - ## HITS:1 COG:no KEGG:BF0851 NR:ns ## KEGG: BF0851 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 477 29 505 505 987 99.0 0 MIGLLFLFAACQEKVTSPARVDTLPSIFPDYVGVTIPSTIAPLNFRVTDDGVEAVDVVIA GTKGKPVRLNGRSVDIPAKQWHELLESNKGDSIEVKVSVRQGKKWKEYRPFPIYVSPFPI DYGLVYRLLAPGYEVYSKMGIYERELSTFRQTPLFENTQVTAACINCHAFNRTEPTPSSV HVRGGHGATVIDTGDRLEFLDTKADGQLSACVYPYWHPSGEYIAYSVNKTNQAFHLGGKK PIEVFDQASDVVVYHPRSHRILTTPLLSTASFETFPAFSPDGRTLYFCSAGQKEMPVRYK DVKYSLCSIAFHPEDGTFGDRIDTLISARTLDKSISFPKPSFDGKYLMFTLSDYGNFSIW HKEADLWLLDLKTGTYRNLEEVNSDDTESYHNWSSNSHWFVFSSRRGDGLYTRLYISSVD GQGRIGKPFLLPQQDPYTFYDQLIYSYNVPEFVSAPVQWDKREMAKGLMSKERVKVK >gi|226332045|gb|ACIB01000011.1| GENE 86 122700 - 124463 1071 587 aa, chain - ## HITS:1 COG:no KEGG:BF0850 NR:ns ## KEGG: BF0850 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 587 1 587 587 1167 99.0 0 MKNKHVKRINLLLSILLGAGVFIFFGVYYSYHLHYQEQFQMFLFTSDYFVEQVSHPGGMA DYLGGFLTQFYYYSWAGAAILTGAIGGIHRLMVWIANRLGGHPAWYPLTLLPSLCFFILF CDENFLLSGAISVGMVLGALIGYTFIESRRIRLIYWGVGIPLLYLLAGGCAWLFIPLIWI TEFCRFAGRRLPWWILVGGTLGIAGVTYWISLAVFPYPADRLLWAIGSYRFPLVFPQMQV IAWLAVILVPLLVACLPEKMTWRYYSGAWVLQFILMLFVLNLYGKYGIGLNKEEVMGYDY HVRMQEWDEVIAMAEKKAPDTPMSVSCLNLALAMKGQLPERMFSFYQRGKEGLLMSFVND FTIPLVAGEPYYYLGLVNVAQQFVFEAMEAVPDYRKSVRCFKRLAETNLINGRYEVARKY LRILQHTLFYKDWATETLACLNDEDRVNAHPEYGRLRRLTPRTDFFFNPDLPEMTLEFLL HANPRNRMAYEYLMACTLLKKDVGRFVHYYPLGADLGYSSVPKGYQEALLFYWLMSKHTA TDTIPWKIDPQTENRLREYAQIFTSARSADALSARFGDTYWFYADFR >gi|226332045|gb|ACIB01000011.1| GENE 87 124533 - 125711 1210 392 aa, chain - ## HITS:1 COG:all3695 KEGG:ns NR:ns ## COG: all3695 COG2942 # Protein_GI_number: 17231187 # Func_class: G Carbohydrate transport and metabolism # Function: N-acyl-D-glucosamine 2-epimerase # Organism: Nostoc sp. PCC 7120 # 3 384 6 375 388 79 26.0 1e-14 MDEILKQEMQKELTTRILPYWMERMVDQENGGFYGRITGQEELMPRADKGAILNARILWT YSAAYRLLGREEYKEMANRAKRYLIDHFYDSEFGGVYWSLNYRGEPLDTKKQIYAIGFAI YGLSEFHRATGDPEALMYAVRLFNDIESHSFDGLKNGYCEALTREWNEIADMRLSEKDAN ERKTMNTHLHILEPYTNLYRVWKDARLERQLYNLIGLFTEKILDKDTSHLQLFFDNDWQS KYPVVSYGHDIEASWLLHEAARVLGDAGLIAEIEPVVKKIAAAASEGLTSDGGMIYEKNL TTGHIDGDYHWWVQAETVVGYYNLFRYFGDRGALQHSIDCWEFIKRHLTDDVHGEWFWSL RADGSLNRDDDKAGFWKCPYHNGRMCIELLGE >gi|226332045|gb|ACIB01000011.1| GENE 88 125724 - 127112 614 462 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|90020673|ref|YP_526500.1| ribosomal protein L9 [Saccharophagus degradans 2-40] # 5 457 9 521 522 241 29 2e-62 MTAMITLKEKIGYGLGDMASSMFWKLFGSYLMIFYTDVFGLPAAVVGTMFLITRVWDSAF DPIVGVIADRTQTRWGKFRPYLLYLAVPFALIGIFTFTTPELNDTGKLVYAYITYSLMMM VYSAINVPYASLLGVMSPDPKERNTLSTYRMTFAYIGSFIALLLFMPMVNLFGGAEDEQR GWMLSVVVIAVMCAALFYLCFALTRERVKPIREVQNSLKDDLKDLLHNRPWWILLGAGVA ALVFNSIRDGATVYYFKYFVVEEDYSTVSFFGVSFVLSGLYLAVGQAANIVGVILAAPVS NRIGKKNTYMGAMSLATLLSVIFYWFGKGDITLIFVFQVLISICAGSIFPLLWSMYADCA DYSELKTGNRATGLIFSSSSMSQKFGWAIGSALTGWLLAYFGFRANEVQSVEAIHGIKMF LSWLPAVGTVLSVVFISMYPLSEKKMREVTSELEKRRKAIQS >gi|226332045|gb|ACIB01000011.1| GENE 89 127129 - 128301 1091 390 aa, chain - ## HITS:1 COG:TM1225 KEGG:ns NR:ns ## COG: TM1225 COG2152 # Protein_GI_number: 15643981 # Func_class: G Carbohydrate transport and metabolism # Function: Predicted glycosylase # Organism: Thermotoga maritima # 44 360 19 320 326 117 29.0 3e-26 MSLFNDKVAKLLAGHEALLMRKNEPVEEGNGVITRYRYPVLTAAHTPVFWRYDLNEETNP FLMERIGMNATLNAGAIKWDGKYLMLVRVEGADRKSFFAVAESPNGIDNFRFWEYPVTLP EDVVPATNVYDMRLTAHEDGWIYGIFCAERHDDNAPIGDLSSATATAGIARTKDLKNWER LPDLKTKSQQRNVVLHPEFVDGKYALYTRPQDGFIDTGSGGGIGWALIDDITHAEVGEEK IIDKRYYHTIKEVKNGEGPHPIKTPQGWLHLAHGVRNCAAGLRYVLYMYMTSLDDPTRLI ASPAGYFMAPVGEERIGDVSNVLFSNGWIADDDGKVFIYYASSDTRMHVATSTIERLVDY CLHTPQDGFSSSASVEILKNLIERNLRLMK >gi|226332045|gb|ACIB01000011.1| GENE 90 128338 - 129462 1005 374 aa, chain - ## HITS:1 COG:BS_ydhT KEGG:ns NR:ns ## COG: BS_ydhT COG4124 # Protein_GI_number: 16077655 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-mannanase # Organism: Bacillus subtilis # 80 333 79 326 362 86 27.0 8e-17 MIKRIKILATGALLLAGLGACSPSGKKTGADSTVDTLRTAETVNLLNNLRKVPTQGIMFG HHDDPLYGVGWEGDEDRSDVKSVCGDYPAVMSFDLGHIELEREKSLDNVPFRKIRQETIN QYKRGGVVSFSWHLDNPLTGKDAWDVSDTTVVASILPGGVHHAKFISWLDAVAAFMNTLE TEEGTKIPVIFRPWHEHTGSWFWWGQNLCTADQYKALWRMTHDRMHARGVKNLLYAYSPG SEPKDSTAYLERYPGDDIIDLVGFDTYQFDRTQYMEQLDKSLAILTEVGKAHDKPIAITE TGFEAIPDSVWWTQTLYPVISKYPISYVLVWRNARERVNHYYAPYPGQVSADDFVKFYRE PKTLFVSDVKNLYK >gi|226332045|gb|ACIB01000011.1| GENE 91 129554 - 130558 791 334 aa, chain - ## HITS:1 COG:MA0146 KEGG:ns NR:ns ## COG: MA0146 COG0407 # Protein_GI_number: 20089044 # Func_class: H Coenzyme transport and metabolism # Function: Uroporphyrinogen-III decarboxylase # Organism: Methanosarcina acetivorans str.C2A # 66 330 74 337 339 110 29.0 4e-24 MELSLYIKDKLVSGKRVAIPIMTHPGIELLEKRVLDAVTNGEIHYHAIRALNECFPQSAA CTTIMDLTVEAEAFGARLSMSPNEVPSVCGRLLTGYADVEALQIPSVESGRMPQYLLADR LAAEGIDKPVLAGCIGPYSLAGRLYDMTEIMMAIYTEPDTVLLLLEKCTEFILRYCLAIK ETGVAGVIMAEPAAGLLSNEDCQRYSSVYVKRIIDAVQDDSFAVILHNCGNTGHCTAAML ATGAKGYHFGNKADMITALRECPSDVWVMGNLDPVGVFRVLTPEDVFARTEELLTCTGEY ANFIISTGCDTPPEVPFDNIQAFYLAVEKYNKGR >gi|226332045|gb|ACIB01000011.1| GENE 92 130565 - 131269 319 234 aa, chain - ## HITS:1 COG:no KEGG:BF0769 NR:ns ## KEGG: BF0769 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 234 1 234 234 481 99.0 1e-135 MPLVNSHIVEECLPFTSLRLDPEDINLSMGAGYVPDAEIQAISDALETEIAGICTPRFLY ALFDAEPAGTCEVGVNGISLKTGSVITPYLKDAAGYVLFVATAGYEFEAFQHKIGSQGDI LREFLLDAYGSAIAEAVVREVCRKVESRMFPLGYGVSHPYSPGYCGWHVTQQQLLFSCLP EFPCGVRLSDSSLMSPIKSVSGIIAYGPCIVKRKYGCELCGKADCYKNRNKLNR >gi|226332045|gb|ACIB01000011.1| GENE 93 131272 - 131910 756 212 aa, chain - ## HITS:1 COG:mlr1231 KEGG:ns NR:ns ## COG: mlr1231 COG5012 # Protein_GI_number: 13471298 # Func_class: R General function prediction only # Function: Predicted cobalamin binding protein # Organism: Mesorhizobium loti # 4 209 23 228 238 172 43.0 3e-43 MNNLNELYEAILAGKLEQAVSVTREAVAGGAAPQEIINEYMIKAMEAIGARFESGQVFVP NLLMSARAMRGALDILKPLMQGQVNSYIGRIVIGTVKGDLHDIGKNLVASMFEGCGFEVI NLGVDVSSDKFISAALENKADIICMSALLTTTMNYMKEVIDALETSGLRGKVKVMVGGAP VSDAFAKSIGADAYTSNANAAVIMAKKLINAC >gi|226332045|gb|ACIB01000011.1| GENE 94 132103 - 133356 1112 417 aa, chain - ## HITS:1 COG:no KEGG:BF0842 NR:ns ## KEGG: BF0842 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 417 1 417 417 892 100.0 0 MNRSRDKVRCALNHQNAGSIPVDFGSTAVTGIHCRIVEALRNYYGLAPRPVKIVDAFQML GEIDAELAEKIGVDCIGIGGPKDIFDLDTTRMHEQTTPWGQRVLVPEAMDLTPDMRGDVY VYAGGDQNYPPSAVMPKGCYFINAIERQQPIEEDRLDPEDNVEEFGLLTENDLAYYCAEA DKAYQTGRAVVASFGGTALGDVAFVPGMGLKQPKGIRSVVEWYMSTAMRQDYLHQVFEKE IDIAIANYEKLWAALGDKIDVVLTCGTDFGSQESQFCSIDTFRELWLPHYRRMNDWIHQH TTWKIFKHSCGAIIPILPGLIEAGFDIINPVQINAKDMDSRRLKEEFGSQLTFWGGGVDT QKILPFGTPDEIRRHVMGQCEILGRDGGFVFNAVHNVQANVPVDNVVAMFDALKDIS >gi|226332045|gb|ACIB01000011.1| GENE 95 133362 - 135806 2001 814 aa, chain - ## HITS:1 COG:SSO3032 KEGG:ns NR:ns ## COG: SSO3032 COG1472 # Protein_GI_number: 15899739 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-glucosidase-related glycosidases # Organism: Sulfolobus solfataricus # 60 806 4 735 754 594 42.0 1e-169 MKMKLICFLMLSVFFIFPVRAKNTFGKKKDKVTRLHFYDLNKNGRMDTYENPSAPVEYRV EHLLSQMTLEEKVGQMLTSLGWPMYERVGEDIRLTPQLEKEIGEYHIGSLWGFMRADPWT QRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIGTTVFPTSIGQAS TWNPELIRQMGRVIAIEASAQGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMGTAL VRGFQGETLNDGKSVIATLKHFASYGWTEGGHNGGTAHIGERELEEAIFPPFREAVGAGA LSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGGLREHGVAGNDYEAAI KAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILSLKFQMGLFDDPFVDEKQ AVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKKDIRTLAVIGPNADNVYNMLGDYTA PQADGTVVTVLDGIRQKVSKETRVLYAKGCAVRDSSRTGFKDAIETARNADTVVMVMGGS SARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGRQLELLEEISRLGKPVVLVL IKGRPLLMEGAIQEAEAIVDAWYPGMQGGNAVADVLFGDYNPAGRLTLSVPRSVGQLPVY YNTRRKGNRSRYVEEPGTPRYPFGYGLSYTTFSYTDMKVQVTEGSDDCWVDVTVTIQNQG TADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDKKSLALYMQEGEW VVEPGRFTIMVGGSSEDITCRQAFEINRKYTFKM >gi|226332045|gb|ACIB01000011.1| GENE 96 135803 - 137110 897 435 aa, chain - ## HITS:1 COG:CC0801 KEGG:ns NR:ns ## COG: CC0801 COG3934 # Protein_GI_number: 16125054 # Func_class: G Carbohydrate transport and metabolism # Function: Endo-beta-mannanase # Organism: Caulobacter vibrioides # 27 422 27 432 442 342 44.0 7e-94 MIMKILSTILLTLLIVLGACTSPQVSPDPFVRVSNGRLTVNGKPYYYIGTNFWYGAILGS QGQGGNRERLLRELDYLKALGINNLRVLVGADGKDGIPTKAEPALQVEAGVYNDTIFDGL DFFLSELDKRDMYAVLFLNNSWEWSGGYSQYLYWAGHGEVPMPNVAGWDAFSNYVAQYAK SEKAHHLFRDHITHVVNRVNRYTGKKYSEDPAIMSWQIGNEPRPFGEDNKKSFAAWIADC AALIKSMDSNHLVSIGSEGMAGCEGDLSLWTSIHADANVDYTTIHIWPNNWGWIDKKDIP GTIGQAIENTCSYIDMHVQEAFKINKPLVLEEFGLPRDSVKFTSNTSTVQRDRYYRAVFD IVEKHAAEKGVFQGCNFWAWGGFAEPQHLFWQRGDDYMGDPGQEEQGLNSVYATDSTINM IKEAVSDINQIIQKQ >gi|226332045|gb|ACIB01000011.1| GENE 97 137228 - 138253 743 341 aa, chain - ## HITS:1 COG:BS_ydhT KEGG:ns NR:ns ## COG: BS_ydhT COG4124 # Protein_GI_number: 16077655 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-mannanase # Organism: Bacillus subtilis # 48 305 75 325 362 73 26.0 5e-13 MLVDTRATEHTAALFYNLRQLTGKRVVYGQHNYEMDGFDSDSTRWRDEANRCDAYDVTGA YPALASFDFLHFTNPRSWETKELNYIQEKFHVAYNRGNVITFCWHYYNPVTGGNFYDTTQ VVRHILPGGSYHATFKADLKIIADFAHNAKGDDGELIPIIFRPWHEFDGNWFWWGKNHCS VEEFKKLYRFTVTYLRDSLEVHNFLYAFSPDCGFTTEAEYLERYPGDKYVDVVGMDNYWD FRPDGGDTSLVVLKARILTQYAQKHGKLSAITETGTQTRDSLWYTQLLSILRSEGVALNY VCTWSGFSPYKGHPAAADFCRFKRDTLVLFADEIPNFYTWH >gi|226332045|gb|ACIB01000011.1| GENE 98 138343 - 141558 1834 1071 aa, chain - ## HITS:1 COG:no KEGG:BF0763 NR:ns ## KEGG: BF0763 # Name: not_defined # Def: putative secreted glucosidase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1071 1 1071 1071 2255 99.0 0 MKRMAYHLLAILILGMAASAVKAQILLTASATPIEEYAAVELQRYYYQLSGRLLSIDHEE VPDRKTEFVLTRLDHPLVKSWRDKGVLPLKSMPGEQGYVIRTVKEKGRELVVIAGVDANG LLYGVYGLLEDHLGMRFYMNGDVYPDKKEVQTRIPLIQDERTPTVAIRGFLPWTNFPQSA TIYSWDDWRYIIDQAARMRMNFIMIHNYNGFCGHNELFHNFEYKGHLSRGWMPTIKTGHG WGCPGWNINEYLFGASEVYDDYDFGADYGLHNETLTNGQIKEKGATIFRKVIAYAHLRGV KIGLGLDIDVVLPEYQSEPDNKDLIKVQVAEIAREYPELDYLLCFQSEGQKNEAFYARWR RVFDGFYEEMKRKSPFTRIAVSGWGLTAESVNSLPEDVICAPISYYSAAFEPGSVYGNRE YWGCPWLERDFNSSEYYYPYNVDLSETIRAFGDASANMNGFYALTWRLADAISPKMWYIS KAPWYNHEVLDSSEKVYRDFALANYGENAVDAITDIIDQNEPFATDFGECQETPGFNQMV HTYPLMNLYSMTFGGKNGKDVEIKATGYAEKKGTKNAPCDEGGECVGYIMADDWLQYPAV DFSNSPERMSIRIASASSGGVATVYLDRLGGPVIARFEVKNTEGWQSWKSLTVPVKGLKG VHTLYVRFQPFNVIAKAGKLADKQLKTIDSCMAVTSDVLQQLRLSRLRARIHGAACHIAL NTDFENYQWNDLPGKMDEWARSFLYRIEDISSYGNIMSTQNRFVKQNYVEKINQLRKQQR VQAPSHIIAKGTLEGAQISWRNEEPAVSSFVVCRNGEEIDTLASDVNCYQDKFHGAASYT VYAVDIEGHKSPLGIPADCLAGSADREAPVIVINSPLASIMEGTPLHIRFSVVENRLPEF VSGIFHYRRTGEKVWKKIPFKHRTRGVFTLTLPASEITCQGIEYYISVSDSDNVFCYPGS APARNHTVVVTEVPGDDKPEVPMIKPICGKRMFWSRVPNVEMYRIYRSRTPDFKIGADTF VTFVAGNTQSFADNGFDFDGTSLKGTYYYCVTSVSFWDHESEASEIIQIDY >gi|226332045|gb|ACIB01000011.1| GENE 99 141555 - 143342 1266 595 aa, chain - ## HITS:1 COG:no KEGG:BF0762 NR:ns ## KEGG: BF0762 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 595 3 597 597 1230 99.0 0 MRKIQYLFIALFICLEIQAQDKFNIRGVLPWHNFLSGPTSWNLSDYRIYLDECRKNGINF IGFHNYTGGGERYATYVEPMIKIEYKNILPQACFDNSMTARWGYLPMAVKDFAFDTGKIF QLPVGAEAFGNNGSITSHSSREHYEKAQSLMRDVLKMAHERGIRMAMGFEFGVIPPEYFS LNVAGDCFYWAGESNMIPNPKSQIAAEIHYAAIDDILNTYPDIDYIWMWLNEHSFMGVDV QKALRDKPFARAYQENQALFKEAADSSARFVGVWALEYMKLTYKHLKSKGSRAKLILGGW GGGHQLPSLLKGLDRALPQDIIFSCLNPDLGKSPQPDFLEEIARNRSVWAVPWLEGDHQL WHFQPRVNMMREQVKLAAEQNLDGVIAIHWRTEEPRFNFRTFARFASDKGADESVDQLYD RYLTEEFGEEVAKEMTPLLARMDREQIQWNVPSPEFYAYTPEWGLLDENNVRIRQELVSS GESLLKKLRGEKRENLKRFIAMFRFELLLGEVDRAMMPAFILKKKEVQGEKINDSQEYMD AYRLLVSAPVKEMFDTYMERVHSRGELGVLSSLNQRVWREYNDLKIYLENKIKEK >gi|226332045|gb|ACIB01000011.1| GENE 100 143470 - 144576 947 368 aa, chain - ## HITS:1 COG:no KEGG:BF0761 NR:ns ## KEGG: BF0761 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 368 1 368 368 751 100.0 0 MKTFRYILFVWVMLGCGLFASCEDDEVEYAPLAVTRVSTVLDREQGIDQANLAQYIIVQG TGLNAVNSILVNDVQVDLKDAYITSGEITFPIPRVIPGEINNLITLGSGNSTVTAPISVF IPELEVNGMFNEFTPAGDTMKVVGDYFDLYEITTESGQLFFGGKEVKITKSTGNSLSFVL PEDAVMGSKIKLVSPVCGEVTVPGKYMEKGNMLCDFDPFTGWGGSKYVIDGPVPAPYSGY FSRFKINKGDANDWDWNEVTTIAQCAVEYSPEVIADQNKYLLKFEVNTIKPLTKRQIRFY FSQINYDWEPFASGLALNTNGEWKTVSIDLGEMWKGDIPNDGVLQIMGNSWAEDTDICFD NFRIVPKD >gi|226332045|gb|ACIB01000011.1| GENE 101 144612 - 146279 1591 555 aa, chain - ## HITS:1 COG:no KEGG:BF0834 NR:ns ## KEGG: BF0834 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 555 1 555 555 1143 99.0 0 MKKIIFKSFLLLWTVLLACSCNDLLDEKAYDFVSPEQLGDSESAASQLVTGAYNTVITSF IAPGSYLYLTNMDCDYASGASWAFGNVGAGNPQGFWGIDHMWQGSYTLIHRANLGISKIS AMSNLSQESKQDALAQLCFLKAWAYFNLVRNYGPVPIFRKSISEGEAMSQPRASVSDVYA HIIELLEQAEGMYSKDDAGFVVGHASNGAAKALLAKVYVTMASGAMSGVPIVVKGGDPNI FEPQSITHIAKTVAGYESFDPAKYYALARDKAWEVINEYTLFDNYMDVWAIGNRNKGEHI WMAQAISGDKDFGNTICQDYVGIFKEDGTMEGNWYGMRDHWYLLFEEQDKRIVDGVIHRY ASDGISNGKVIYNYYPRWYADKVENKEVYDSQGNAFDGTEVYHEGQGWTLAKLTKFTFVT DRKQKNSDFHFPLLRLPDIMLIYAEAVNELNGGPDAEAYNQVNRIRTRAHATPFSGMNQD EFRSAVLEERARELAYEADRRYDLFRWGIYLDVMNAIDMDEHNVTKRRLERNLLYPIPTS EVNSNDKIDSNNPGW >gi|226332045|gb|ACIB01000011.1| GENE 102 146305 - 149442 2552 1045 aa, chain - ## HITS:1 COG:no KEGG:BF0759 NR:ns ## KEGG: BF0759 # Name: not_defined # Def: putative outer membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1045 1 1045 1045 1979 99.0 0 MKNMMLFLMALLMSGHMMAQQTIVTGVITDANDGSPLIGANVLVKGAGTGSIANVDGKYS VNVPNGKNVLVFSCVGYKEQEITLKPGQKVLNVTMKEDTELLDEVVVIGYGSMKKSDLTG SVTSIKSEDLMKTNPISINQGLQGRIAGVQVNQNDGAPGAGVSIQIRGANSFSTSTEPLY IVDGIPFTSSGMPGTGKDGMMQTANPLSTINPSDIESIEILKDASATAIYGSRGANGVVL ITTKRGAKGKDNISFSANFGISKVVKKLDMLDGYAYAMYRNEAAQMFNEYENANEAIPYP GTSKVDPSTGESVYSPGPEDYRNGTYPSVNWQDEVFETAFSQEYNLSVNGSNDKGYYAIS GNILDQSGIIHNSGYKRYSFRANLARKVHEWIEIGTNMSFTNSLNKLAKTNSVSDGIIRG ALFYPATAPLDDETNNAQLNWFSSNPYVYTRAAKDELTTNSFFSSSFVEITPYKDLKVCQ NVGFSYNINERDVYYNRETVEGKDPTNGYASKADNWSKNLVLETMATYNKTFNRNHSLNV VAAFSYERGDYGNKAMVATGFPQDLTEDFDMSAAVNPQKPTSGRGMTSLVSFLGRANYNL MNKYLFTASFRRDGSSKFAPGNKWSNFASGAIAWRASEEQFIKDLNVFSNLKFRASYGQT GNQAIGAYATRDYLTVANYPINGALASGFANLTWRGPANPDLKWETTSQYNVGVDMGFFQ NRINLTIDLYYKKTSDLLQNIQIPQSTGFSNMTTNFGNVTNKGLEITGKFYAITGKNLNW DFDANISFNRNKISGLPGDQFAQGWSKADNVFLQRNGMPIGTIYGFVEDGFYDNIAEVRA DPFYAKESEAVCKAMVGEVKYKDFDGVAGITNADRQVIGDTNPDFTFGMTHNFTYKNFSL SFFLQGCVGGDIFNANLLEVTMSGIGNIPQNIYESRWTPENRENAKWPKAYAGYGRTMKL SDRYVEDGSYLRMKNINLGYKFISPFKGIESINLFASVSNVFTISGYSWYDPDVNSFGSD ASRRGVDLFSYPSSRTFSFGLQCTF >gi|226332045|gb|ACIB01000011.1| GENE 103 149822 - 151990 1549 722 aa, chain - ## HITS:1 COG:SSO3032 KEGG:ns NR:ns ## COG: SSO3032 COG1472 # Protein_GI_number: 15899739 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-glucosidase-related glycosidases # Organism: Sulfolobus solfataricus # 71 704 68 722 754 374 36.0 1e-103 MKDMQTKIVMIGILLSTGIMLRAQDNGGCEDCPAFNAGGKVEVRGVKERIIGDLSQPVAV RVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGEVTVFPQAINLAS TWDTVLVKRVASAISTEARLKYLEIGKGLTYWSPTINMARDPRWGRNEETYGEDPHLTSR LGVAFVKGLQGDHPTYLKTVATIKHFVANNEENNRFSSSSQIPTKQLYEYYFPAYEACVK EANAQSVMTAYNAFNGVPPSGSHWLLDDVLRKEWGFDGFVVSDCGAIGVMNWQHRVVNSL EEAAALGVNSGCDLECGTTYKEKLVQAVEQGLISEVAIDRALTRVLTARFKLGEFDPMEL VPYNHYDKKLLAGKKFAELAYEAAVKSVVLLKNDALLPLNKEKIKSVAVVGPFADYNYLG GYSGQPPYSVSLLKGVKELIGKKGKVTYLNGMGTSADSIAQVVKGADIVLVALGSDEKMA RENHDMPSIYLPEEQEKLLKKIYQVNPRIVLVFHTGNPLTSEWADTHIPAIMQAWYPGQE AGRALANLLFGNENPSGKLPMTIYKTEEQLPDILDFDMWKGRTYRYMKGEPLYGFGHGLS YTSFEFDNIQGNDTLQPDAILQCSVELSNSGQLAGEEVVQVYVSRENTPVYTYPLKKLVA FKKVKLASGEKKKVDFTIAPRELSVWEDGKWRMLSGKYTLFIGSGQPGLAKGITKGFEVK IR >gi|226332045|gb|ACIB01000011.1| GENE 104 152170 - 153171 692 333 aa, chain + ## HITS:1 COG:STM2406 KEGG:ns NR:ns ## COG: STM2406 COG0667 # Protein_GI_number: 16765732 # Func_class: C Energy production and conversion # Function: Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) # Organism: Salmonella typhimurium LT2 # 5 333 2 329 332 393 59.0 1e-109 MINPIYSADKARYSGGMKYRRCGKSGILLPEVSLGLWHNFGDVDTLANSLRMAHFAFDKG ITHFDLANNYGPSYGSAEETFGLIMKKSFMPYRDELFISTKAGHDMWEGPYGNWGSRKYL MSSLNQSLKRMNLEYVDIFYTHRYDPETPLEETLQALVDIVRQGKALYIGISKYPKEKAE FAYRYLEERDVHCLLYQGRYNLFNREPEEQGILQQAKENGTGFIAFSPLAQGLLTNRYLN GIPEDSRIARGGFLKKEQLTEQVFNKIKALNEVAGNRGQTLAEMALAWVLKDDLVTSVIV GASSVKQLEDNLKVTENCKFSTDEIQRIGNILQ >gi|226332045|gb|ACIB01000011.1| GENE 105 153242 - 154789 1183 515 aa, chain - ## HITS:1 COG:no KEGG:BF0756 NR:ns ## KEGG: BF0756 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 515 1 515 515 1033 99.0 0 MAKLYPIGIQNFEKIRREGYLYIDKTALVCRLVKTGSYYFLSRPRRFGKSLLISTLEAYF QGKKDLFRGLAMEELEKDWIKYSILHLDLNTEKYDTPESLDRILNDTLAKWEMVYGTAPS ETSIPLRFKGIVQRACEQSGQRVVILIDEYDKPMLQAIGNEELGEKYRDTLKGFYSVLKT MDGYIRFALLTGVTKFGKVSVFSDLNNLNDISMDEPYVELCGITEKEIHHYLEPEIRQLA KYQKMSYEDACRELKERYDGYHFTENSIGLYNPFSILNTFYKMKFGSYWFETGTPSYLVK LLLRDKYDLQQLAHDEATSDMLNCIDSTSKNPLPVIYQSGYLTIKGYDERFDIYRLGFPN REVEEGFIRYLLPFYANIDKGQTGFHITRFVSEVEQGDCDAFFRRLQSFFADTPYELVRD LELHYQNVLFIVYKLLGFYVQAEYHTSNGRIDMVLKTDRYIYVMEFKFDGSAEEALKQIE EKGYAAPFANDPRQLLKAGVNFSSKTRNIDCWVVD >gi|226332045|gb|ACIB01000011.1| GENE 106 154937 - 155029 121 30 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MKKNSWNILLKVIIAVASAIAGVIGGQGLL >gi|226332045|gb|ACIB01000011.1| GENE 107 155604 - 156191 617 195 aa, chain - ## HITS:1 COG:no KEGG:BF3910 NR:ns ## KEGG: BF3910 # Name: not_defined # Def: putative phage-related protein # Organism: B.fragilis # Pathway: not_defined # 1 194 1 194 194 343 87.0 2e-93 MENLTENDFQRVADLLGIEVAVVKAVQAVETGGHGGFVAPGRPMILFEGHIFWRELKKRG LDPDRYVAGNENILYPKWEKGHYYGGMKEYERLEKAREIHKEAADASTSWGMFQVMGFNY AMCGYGSVEEMVKDMCVGEDKQLEAFARFVKLAKLQSYLEQKDWVGFARRYNGPGYAQNQ YDKKLEEAYRKFTKE >gi|226332045|gb|ACIB01000011.1| GENE 108 156213 - 157343 872 376 aa, chain - ## HITS:1 COG:Cgl0347 KEGG:ns NR:ns ## COG: Cgl0347 COG0399 # Protein_GI_number: 19551597 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis # Organism: Corynebacterium glutamicum # 2 373 3 375 385 313 48.0 3e-85 MNKRIWLSLAHMGGREQDFIKEAFDTNWVVPLGPNVDAFEQSLVEYLHEDRRVVALSAGT AALHLGLILLDVESGDEVICQSFTFAASANPISYLEAKPVFVDSEKDTWNMDPVLLEEAI KDRLRKTGKLPKAIIPVHLYGMPAKMDEIMDIAGRYGIPVLEDAAEALGSELNGRKCGTF GELAALSFNGNKMITTSGGGALICRTEEEAKQTKFYATQARDAAPHYQHTHIGYNYRMSN ICAGIGRGQMFVLDEHIARRRAIHSLYVDLLKDVAGITVMENPDSRFASNFWLTCILVDP KLAGKSREDIRLRLDSENIETRPLWKPMHLQPVFTDAPFYGNGTSERLFDIGLCLPSGPT LTDEDIRRVVDTIRAI >gi|226332045|gb|ACIB01000011.1| GENE 109 157357 - 157941 530 194 aa, chain - ## HITS:1 COG:BS_yvfD KEGG:ns NR:ns ## COG: BS_yvfD COG0110 # Protein_GI_number: 16080477 # Func_class: R General function prediction only # Function: Acetyltransferase (isoleucine patch superfamily) # Organism: Bacillus subtilis # 3 194 6 210 216 143 42.0 2e-34 MFLYGASGHAKVIIDILRAGHESIEALFDDNVEVTSLLGHPVLRPSEVRGPLIVSIGNNR IRKRIVDTLSVEFGCAIHPLSIVSEFADIGEGSVVMQGSIIQVCAQVGRHCIINTGASVD HECVIEDYVHISPHSTLCGNVLVGEGTWIGAGTTVIPGVKIGKWSVVGAGSVVTKDIPDH VLAVGNKCKIIKSI >gi|226332045|gb|ACIB01000011.1| GENE 110 157954 - 158562 371 202 aa, chain - ## HITS:1 COG:BS_yvfC KEGG:ns NR:ns ## COG: BS_yvfC COG2148 # Protein_GI_number: 16080478 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Sugar transferases involved in lipopolysaccharide synthesis # Organism: Bacillus subtilis # 5 201 2 197 202 250 61.0 1e-66 MYQYVIKRLIDFVVVFFVLIIIWPVLLLVTLWLHFANKGAGAFFLQERPGRHGKIFKVIK FKTMTDERDAEGNLLPDDKRLTKVGKFVRSTSIDELPQLINILKGDMSFIGPRPLLPQYL PLYNKEQARRHEVRPGITGWAQVNGRNAISWVRKFELDVWYVDHCSFFLDLKIFFLTIKK VFVREGISSDTSVTMEPFTGNN >gi|226332045|gb|ACIB01000011.1| GENE 111 158565 - 159773 533 402 aa, chain - ## HITS:1 COG:TM0631 KEGG:ns NR:ns ## COG: TM0631 COG0438 # Protein_GI_number: 15643396 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Thermotoga maritima # 19 401 32 426 434 119 25.0 1e-26 MNILFLTLNRVSDLSERGIYTDLMREFICHGHRVYMVVPAERRFHESTSIKESCGAQILR VKTLNIQKSNVVEKGIGTLLLEMQYQCAIKRYWKDIRFDLILYSTPPITFNRVISSQKKR CKAKSYLLLKDIFPQNAVDLGMFSKRSLIYRLFRKKEKVLYQISDFIGCMSPANVDYVLT HNPEIKADRVEICPNSIKLLEKPLMASTARKNILQKLHIPINKTLFIYGGNLGRPQGLIF LLDVIAANEERNDSYFIIVGSGTEYGKIKSWFEANHPDNSMLLSSLPKKEYDDLVKACDV GLIFLDKRFTIPNYPSRLLSYLENRMPVLLATDLNTDIGRIAERNGYGFWTENGNLDTFM EMVDSLSADREKIKVMGEKGYEYLKSNYTVERGYRMIMKHFE >gi|226332045|gb|ACIB01000011.1| GENE 112 159784 - 160968 673 394 aa, chain - ## HITS:1 COG:SP0360 KEGG:ns NR:ns ## COG: SP0360 COG0381 # Protein_GI_number: 15900289 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylglucosamine 2-epimerase # Organism: Streptococcus pneumoniae TIGR4 # 1 394 1 394 394 668 79.0 0 MEIKLDYSDIKFRYNGKLRLLIIVGTRPEIIRLAAVINKCRRYFDCILAHTGQNYDYNLN GVFFHDLGLQAPDVYMDAVGDDLGSTMGNILNASYKLMSHLRPDAVLVLGDTNSCLSVIS AKRLHIPIFHMEAGNRCFDECLPEETNRRIVDIISDMNLCYSEHARRYLNASGVAKERTY VTGSPMAEVLSENLSAIESSDIHARLGLRKGQYILLSAHREENIDTDKNFASLFEGINAM AEKYDMPVLYSCHPRSRNRLESSGFKLDSRVIRHAPLGFHDYNCLQMHAYAVVSDSGTLP EESSFFTSVGHSFPAVCIRTSTERPEALDKGCFILAGIDKASLLQAVDTAVEMNRNGDNG VPVPDYMDRNVSTKVVKLIQSYTGIVNKIVWRKS >gi|226332045|gb|ACIB01000011.1| GENE 113 160976 - 162127 778 383 aa, chain - ## HITS:1 COG:SP0359_1 KEGG:ns NR:ns ## COG: SP0359_1 COG0451 # Protein_GI_number: 15900288 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Nucleoside-diphosphate-sugar epimerases # Organism: Streptococcus pneumoniae TIGR4 # 3 262 5 281 281 309 53.0 7e-84 MKVLVTGAKGFVGRNLVSQLRNIQSGKAKNYALSGNELIIFEYDVDSAPSELDDYCRQAD FIFNLAGVNRPLDQSEFMKGNFGFASTLLASLKRHGNTCPIMISSSTQAALDNPYGASKR AGEQLLFEYSRETGSKVLVYRFPNVFGKWCRPNYNSAIATFCYNIAHDLPIQVNDPNVEM NLVYIDDVVDELISALTGNEHREGAYCKVSAVYTVTLGAIVELLYSFRENRNNLGVPHVG DAFTKKLYSTYLSYLPKDGFGYPLKMNVDARGSFTEIIRSTDRGQFSVNISKPHITKGNH WHHTKNEKFVVVSGQGVIRFRNVYDSSSEILEYFVSGDKLEIIDIPTGYTHNIENLGDTD MITFMWCNECFDPGRPDTYFEEV >gi|226332045|gb|ACIB01000011.1| GENE 114 162172 - 163191 767 339 aa, chain - ## HITS:1 COG:RC0457 KEGG:ns NR:ns ## COG: RC0457 COG1086 # Protein_GI_number: 15892380 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Predicted nucleoside-diphosphate sugar epimerases # Organism: Rickettsia conorii # 1 335 1 338 341 462 65.0 1e-130 MFKDKILLITGGTGSFGNAVLRRFLDSDIREIRIFSRDEKKQDDMRHHLQNPKVKFYIGD VRDKRSVDGVMNGVDYIFHAAALKQVPSCEFFPTQAVRTNVLGTENVLDSAIAHGVKNVV VLSTDKAAYPINAMGISKAMMEKVAIAKGRQLGNCGGTTICCTRYGNVMASRGSVIPLWV EQIKKCNPITITDPNMTRFMMTLDDAVDLVIYAFQHGKNGDLFVQKAPAATLNVLADALK SLYHSNADVKVIGTRHGEKLYETLVTREEMSKAEDMGDYYRIPCDTRDLNYDKFFVEGSE EVSKIEDYHSHNTRRLDVEGMKELLLKLDFIREDLGLEK >gi|226332045|gb|ACIB01000011.1| GENE 115 163223 - 164077 473 284 aa, chain - ## HITS:1 COG:RP414 KEGG:ns NR:ns ## COG: RP414 COG0438 # Protein_GI_number: 15604279 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Rickettsia prowazekii # 72 271 131 326 338 68 30.0 1e-11 MHITTSGSLGTVRDYCLAKICIKHGIKTIMHCRYGCISQDLKAKFIWGYFFRKTLMLYNQ IWVLDSYSEKSLKNIEILKDRVFLTPNSIDVPDTFCLSTKKINRVAFVGNLEPTKGLFEL IQAVLKIDYDIKLFIVGPGASNVIDKIKILAGVDLGHKIILMGGMANEDVINFMKTIDIL VLPTYYKAEAFPISILEAMSLGKIVISTERAAIKDMLTGKNGNNCGLFVKEKSVDDIVDK IKYCIENHNEAELLMKNAYNKVWESYRKEVVYDIYRLHYSSLIQ >gi|226332045|gb|ACIB01000011.1| GENE 116 164320 - 165393 248 357 aa, chain - ## HITS:1 COG:MA2173 KEGG:ns NR:ns ## COG: MA2173 COG0438 # Protein_GI_number: 20091015 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Methanosarcina acetivorans str.C2A # 156 318 181 341 387 82 32.0 1e-15 MKNIVCFIDSLTSGGAQKQIVQLAILLSEHKYNVKLVCYWDKAFFLPVIEQHNIEYICIG GASNKYKRLIKVYKYFHAEKPDCVISYLSTPNIIACICKILGCCYNLIVSERNTSQKYDL RTRFRFFLYHKSANSVISNSRAQWNFIKRNKPHLIEKSYIITNYIDTSYFVPDVVKSGDV INAIVVGRITEQKNVLRFIEAIHRACISGVNIRVKWYGRCDSERYYKTCIEAIVNFELGD IFEFLPNTQEILRAYQEADLFILPSLFEGYPNVICEAMSCGLPVLCSDVCDNLDLIKDGI NGFLFNPRSIESIVNAIFKYSQLSIEEIEYIRKSNRKYAIETFNKNDFILKYSILFQ >gi|226332045|gb|ACIB01000011.1| GENE 117 165405 - 166721 852 438 aa, chain - ## HITS:1 COG:XF1606 KEGG:ns NR:ns ## COG: XF1606 COG1004 # Protein_GI_number: 15838207 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted UDP-glucose 6-dehydrogenase # Organism: Xylella fastidiosa 9a5c # 1 437 1 444 450 494 54.0 1e-139 MKIAIVGTGYVGLVTGTCFSEMGIDVTCVDVMESKIENLKKGIIPIYEPGLEDMVYRNYN AGRLRFTTSLISCLNEVEVVFCAVGTPPDEDGSADLKYVLEVASTIGRNMNKYILIVTKS TVPVGTAQKVRITIQSELDKRGVKLDFDVASNPEFLKEGDAIADFMSPDRVVVGVGSDKA RGIMERLYKPFMMNNYRLIFTDISSAEMIKYAANSMLATRISFMNEIANLCELVGADVNM VRKGIGSDSRIGHKFLYAGCGYGGSCFPKDVKALIKTAEKEGYNMRVLKSVEEVNEAQKI ILFNKLLSFYDGNIENKLIALWGLSFKPETDDVREAPALVLIDKLIAVGAKVKVFDPIAL DIVRRQYGDKIEYAQDMYDAVLDSDALLLVTEWKEFRIPSWGVVKKTMKYPLIIDGRNIY DKAEMETQGFVYTCIGRN >gi|226332045|gb|ACIB01000011.1| GENE 118 166728 - 166934 147 68 aa, chain - ## HITS:1 COG:BH3709 KEGG:ns NR:ns ## COG: BH3709 COG0451 # Protein_GI_number: 15616271 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Nucleoside-diphosphate-sugar epimerases # Organism: Bacillus halodurans # 1 64 270 333 343 69 51.0 1e-12 MDFIHAIEEACGKLAKKIYLLMQSGDVYQTNADVCSLEKATGFKLNTSIKDGVKQTVDWY KSFYEFYK >gi|226332045|gb|ACIB01000011.1| GENE 119 167196 - 168050 306 284 aa, chain - ## HITS:1 COG:no KEGG:CPF_0915 NR:ns ## KEGG: CPF_0915 # Name: not_defined # Def: putative polysaccharide polymerase protein # Organism: C.perfringens_ATCC13124 # Pathway: not_defined # 13 282 78 357 358 89 29.0 2e-16 MDFSDYLLVGAIKAEIGYSFFMKLVSYVSNHEQAIFIATSLVIIGCYIKTIKNYSMMPWM SLFLFLIGGFNQSMYVLRQHLAMAILLVSLPYIIKRKFGKFLLLNGLACSIHLTAIVFFP IYFIYERTLSKKNIFLLILLGAILGCILNQLILLVSSNYQLYTAYIDSDAPGTNLKMFLF LFVIFCFCLMYMKKEYSVGGINKLLIMILIIGMIIQAIGTGNPITGRLNMYYSNFIFLLI PNMLSSISSKTTRYILSFILLLFLTCIYLVNLQSMETYKFFWEL >gi|226332045|gb|ACIB01000011.1| GENE 120 168250 - 168798 238 182 aa, chain - ## HITS:1 COG:ylaD KEGG:ns NR:ns ## COG: ylaD COG0110 # Protein_GI_number: 16128443 # Func_class: R General function prediction only # Function: Acetyltransferase (isoleucine patch superfamily) # Organism: Escherichia coli K12 # 47 136 94 183 183 93 55.0 2e-19 MNLYKVITLLKDRLFYSPIKCYRRRGMKIGTNCSITTWNLYSEAFLISIGNNVQITSGVK IFTHGAGWVLRNKYANYDAFGKVIIGNNVYIGNNAMIMPGITIGSNVVIAAGAVVTKNIP DGVVVGGNPAKIIETIENFEKKYIKYNMGCKFLSHDQKLDFLKKQPEELFIKVVGVMSFK DK >gi|226332045|gb|ACIB01000011.1| GENE 121 168795 - 169703 362 302 aa, chain - ## HITS:1 COG:no KEGG:BF3916 NR:ns ## KEGG: BF3916 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 41 300 47 308 316 125 33.0 2e-27 MIRQCVRNIIYYIYSYQLFRDFVPFDNTVFFIIDESRKHPGFADRLKAIVCVYYIAKING YKFKLIFSYPFKLHKYLEPNEVDWLSVDGTMSYSLMNTRLLSYRGYGHIPILNKKVKQYH VYNYVGKNILENNGIENWELIWGNCFKELFEPTNLMKKIIEDQEWKENAYIAVHIRFVNA LGHFESGENVTLNSAEKKKLINRCLCKLLEIKKINSLPILIFSDSNLFLNIAKENGYSIL DGEVIHISNTIGEEDVQKTLLDFFMISRAKKVYSIIGDNLYTSAFSKYAALAGVKIFKSV KL >gi|226332045|gb|ACIB01000011.1| GENE 122 169858 - 170229 98 123 aa, chain - ## HITS:1 COG:no KEGG:CA2559_13018 NR:ns ## KEGG: CA2559_13018 # Name: not_defined # Def: hypothetical protein # Organism: C.atlanticus # Pathway: not_defined # 6 123 2 119 170 103 45.0 2e-21 MLNSFVRLVNLYWRFLVSPEKYALHIGIKIGKNCFIATREWSSEPYLISIGYNCQIIKNV YIHTHGGGQAVRNICPEFDTYGKVTINDWTYVGANSHIMPGVTIGEYCLIAAGSVVTKFL SPN >gi|226332045|gb|ACIB01000011.1| GENE 123 170232 - 171242 381 336 aa, chain - ## HITS:1 COG:no KEGG:Avi_3137 NR:ns ## KEGG: Avi_3137 # Name: not_defined # Def: hypothetical protein # Organism: A.vitis # Pathway: not_defined # 93 297 80 282 330 89 29.0 1e-16 MEYKYNYVIFNSPDNKLRVDNDGYYTICTKDLENLEQTRVVSYPLDKHPYWIRLLFALHT SEKISKHIKLPFQNLWYPLYFKNNFSVQLPMCFIIISRSLPLGYLHYLKKKYPNCKIVHM HRDFLSVGKRMRPDLHFNPIFDLEMTYDEAESKEYNIPHFDEFESAIEITREREFESDVF FAGKAKDRLPLLSKAYDQLTKAGLKVFFYLTQVPKKERTELPGIVYSDTFMSYREMLYHS VNTRCMLDITQNNQQGYTSRFLEAVIYGKKLITSCNYVQKSKFYDRNKIQVLEDMNNINI DFITEGTGFVDYGYHGEFSPLNMVERVEEELNELFG >gi|226332045|gb|ACIB01000011.1| GENE 124 171326 - 172456 894 376 aa, chain - ## HITS:1 COG:MJ1066 KEGG:ns NR:ns ## COG: MJ1066 COG0399 # Protein_GI_number: 15669255 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis # Organism: Methanococcus jannaschii # 13 375 16 382 386 189 33.0 8e-48 MSIQVLKPKFEIEECLEAVRECLEKGWTGMGFKTIQFEEAWKEYTGHKHAYYLNSNTVGL HLAVKILKMQNGWADGDEIITTPITFISTNHAIMYENLHPTFADVDDYLCLDPVSVESRI NDKTRAVIFVGYGGRVGQLDKIVEICKKYNLKLILDAAHMSGTRVNGITPGTWEGIDVAV YSFQAVKNLPTGDSGMICFLNAEYDRLARQLAWLGINKDTYARSNKGTYAWKYDVDYVGY KYNGNAIMAAIALVQLKYLDRDNARRRDIVEMYDNAFANNPKIQIVGAPYHEECSYHIYE IVVPDREALLGKLAENDIYGGVHYRDNIEYSMYRYAEGTCPKARELSEHIITLPMHMWLT DEDVQKIATIVNVFVK >gi|226332045|gb|ACIB01000011.1| GENE 125 172922 - 173482 393 186 aa, chain - ## HITS:1 COG:YPO3859 KEGG:ns NR:ns ## COG: YPO3859 COG0399 # Protein_GI_number: 16123994 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis # Organism: Yersinia pestis # 3 182 65 241 376 147 44.0 9e-36 MHLCDIKPGDEVILPTISFVGAGNAVCANGSKMVLCDVDPRTLNARAEDIEKRITSKTKA ILLLHFGGIACGMDEIMALADAHNLKVIEDCVAGVCSSYKGKVLGTFGDIGMWSFDAMKI LVCGDGAALYFRDPELRERAEKWLYFGLEAKSGYENSVAQKWWELAISSFGHRAIMNDVT AAMVLE >gi|226332045|gb|ACIB01000011.1| GENE 126 174253 - 175680 296 475 aa, chain - ## HITS:1 COG:mll5270 KEGG:ns NR:ns ## COG: mll5270 COG2244 # Protein_GI_number: 13474395 # Func_class: R General function prediction only # Function: Membrane protein involved in the export of O-antigen and teichoic acid # Organism: Mesorhizobium loti # 20 389 89 462 561 113 23.0 8e-25 MAKANGIIGSLLWKSGERIMVQGIGLLVQIILARLLMPEDFASLAIITAIVNYLGIFVQC GLSAAVVQKKDLSEIDVSTLTTLSLLVALILYVGLFLMAPVINSWYNMEELVWPIRVMGI ALFLYSFNSIQSGLLQRKMMFQTMFVRSLLATPISAVIGITMAYLGCGVWALICYVLSNI LAIVIFMNMLPEIRLRLGFSKQSAKALYSFSLKILGTNLVSAGGDTIRTMTIGKVYKPAT LAYYDRAYTYSGLVTQVVNTSISSVLLPVFSRSQDDKSHIKEMARRSVSMSAFVMIPVLL LVALVSKPLILIVLTDKWLPCAPFLSLFCLLRIPGIITSVDKQVYLSLWKSQIGLYYEMI LLAINLLSLVLMIPYGVFAIAIGYVVVEFAGNFMLCVISDKVYNYSLIERTKDLIKPVFS SIIMLACGYLLSLIELPLWMTLICQVLVCSVIYLFMQYILRDSSLAFIINKFRKK >gi|226332045|gb|ACIB01000011.1| GENE 127 175687 - 176274 367 195 aa, chain - ## HITS:1 COG:MA3780 KEGG:ns NR:ns ## COG: MA3780 COG1898 # Protein_GI_number: 20092576 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes # Organism: Methanosarcina acetivorans str.C2A # 1 175 1 170 183 196 54.0 2e-50 MNVIKTTIEGVVVIEPKVFKDSRGYFVESFSQREFDEKVAIPQYGKPILFVQDNESMSSY GVMRGLHFQLPPFTQSKLVRCVRGKVLDVAVDIRKGSPTYGHHVAVELTEDNHRQLFVPR GFAHGFAVLSETAVFQYKCDNFYAPQADGGISIKDESLGIDWCIPMDKVILSEKDTQHEC LKDFTPPFDIEVSLY >gi|226332045|gb|ACIB01000011.1| GENE 128 176274 - 176630 375 118 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253563861|ref|ZP_04841318.1| ## NR: gi|253563861|ref|ZP_04841318.1| predicted protein [Bacteroides sp. 3_2_5] # 1 118 57 174 174 230 100.0 2e-59 MVERLNEDFPIVAYPIATVYLKDMDYANKRCELCIFTSTDTEWTPDSQSMAIRMLVDKAF TEYGMHKVYSYVFYKYPDEVELLKNAGFSAEAILKDEALNAEGKYEDIVRLSIINTEK >gi|226332045|gb|ACIB01000011.1| GENE 129 177262 - 178320 837 352 aa, chain - ## HITS:1 COG:BH3709 KEGG:ns NR:ns ## COG: BH3709 COG0451 # Protein_GI_number: 15616271 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Nucleoside-diphosphate-sugar epimerases # Organism: Bacillus halodurans # 13 351 3 334 343 363 53.0 1e-100 MLEYNIDLKGKTILVTGAAGFIGSNLVKRLLTDFDNIKVIGIDSITDYYDVNLKYERLTE IDSLSKDWTFLKGSIADKALIENVFTENKIDVVVNLAAQAGVRYSITNPGSYIESNLIGF YNILEVCRHHEVAHLVYASSSSVYGSNKKIPYSTDDKVDNPVSLYAATKKSNELMAHAYS KLYNIPSTGLRFFTVYGPAGRPDMAYFGFTNKLVKGETIKIFNYGNCKRDFTYVDDIVEG VVRVMRHAPERRTGEDGLPVPPYIIYNIGNNQPENLLDFVTILQEELVRAGVLPAEYDFE AHKELVPMQPGDVPVTYADTTPLVQDFDFKPSTPLREGLCKFAEWYNGYYNK >gi|226332045|gb|ACIB01000011.1| GENE 130 178325 - 179662 1120 445 aa, chain - ## HITS:1 COG:STM2080 KEGG:ns NR:ns ## COG: STM2080 COG1004 # Protein_GI_number: 16765410 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted UDP-glucose 6-dehydrogenase # Organism: Salmonella typhimurium LT2 # 7 445 1 388 388 463 55.0 1e-130 MQEYKDLKIAVAGTGYVGLSMAVLLSQHHNVVAVDVIPEKVEKINRRVSPIQDEYIEKYF AEKDLKLTATLDGKAAYKDVDFVIIAAPTNYDPVKNFFDTHHIEDVIDLVLEVNPDAIMV IKSTIPVGYTRSLYKKYAHLFQLEPELKGKHFNLLFSPEFLRESKALYDNLYPSRIIVGY PKIIDGKEFDEENTAIKSIGNSDAEEYAKIFAKLLQEGAIKEEIDTLFMGMKEAEAVKLF ANTYLALRVSYFNELDTYAEMKGLDSQAIISGVGLDPRIGTHYNNPSFGYGGYCLPKDTK QLLANYADVPQNMMSAIVESNRTRKDFIANQVLNKAGYYHESANWEAEKEKVVVIGVYRL TMKSNSDNFRQSAIQGIMKRIKAKGATTIIYEPTLVDGESFFGSKVVNDLDAFKRQSQAI IANRYDACLDDVREKVYTRDIFKRD >gi|226332045|gb|ACIB01000011.1| GENE 131 180356 - 181246 714 296 aa, chain - ## HITS:1 COG:YPO3861 KEGG:ns NR:ns ## COG: YPO3861 COG1209 # Protein_GI_number: 16123996 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: dTDP-glucose pyrophosphorylase # Organism: Yersinia pestis # 1 289 1 286 293 410 67.0 1e-114 MKGIVLAGGSGTRLYPITKGVSKQLLPIFDKPMIYYPISVLMLAGIREILIISTPYDLPG FQRLLGDGSDYGVRFEYAEQPSPDGLAQAFIIGEKFIGDDSVCLVLGDNIFHGNGFSAML KEAVRVADEKQKATVFGYWVNDPERYGVAEFDKRGNCLSIEEKPKVPKSNYAVVGLYFYP NKVVEVAKNIKPSARGELEITTVNQHFLNDKELKVQTLGRGFAWLDTGTHDSLSEASTFI EVIEKRQGLKIACLEGIALRQGWINSDKMKKLAQPMSKNQYGQYLLKVIDELAADQ >gi|226332045|gb|ACIB01000011.1| GENE 132 181411 - 181899 454 162 aa, chain - ## HITS:1 COG:no KEGG:BF0804 NR:ns ## KEGG: BF0804 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 162 1 162 162 317 98.0 9e-86 MTLSEEVASLQRAAHDLMYLGMDGSPIYSDDLSRRNNEVYRLTTTLYNSGVQGSTVEEQA SVCLALLMGYNASFIDHGEKREHVQKILDRCWDILDTLPASLLKLRLLTACYGEVFDEPL ADEARAIIASWDSVSLTSEQQEAINEFQTVVDNPYPWEYVEE >gi|226332045|gb|ACIB01000011.1| GENE 133 181919 - 182455 528 178 aa, chain - ## HITS:1 COG:no KEGG:BF0803 NR:ns ## KEGG: BF0803 # Name: not_defined # Def: putative transcriptional regulator UpxY-like protein # Organism: B.fragilis # Pathway: not_defined # 1 178 1 178 178 348 100.0 5e-95 MEVEKETEIWFAMRATYRRETDAMRLLAKENLGCFVPMQYKISIKKGKKVRVLVPIIHNL IFIHACPSEVKRVKSMVAYLQYITDTRSGKKIIIPDNEMQRFIAVAGTYSDHLLYFQPDE LNLSKGTKVRITGGDFEGQEGVFLKVKGARDRRVVIAIQGVIAVAMATIHPDLIEVIK Prediction of potential genes in microbial genomes Time: Tue May 17 22:17:04 2011 Seq name: gi|226332044|gb|ACIB01000012.1| Bacteroides sp. 3_2_5 cont1.12, whole genome shotgun sequence Length of sequence - 182756 bp Number of predicted genes - 133, with homology - 130 Number of transcription units - 63, operones - 38 average op.length - 2.8 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 535 - 594 3.0 1 1 Tu 1 . + CDS 657 - 1004 332 ## BF0728 hypothetical protein + Prom 1006 - 1065 3.2 2 2 Op 1 . + CDS 1147 - 2019 647 ## BF0799 hypothetical protein 3 2 Op 2 . + CDS 1929 - 2207 89 ## BF0798 hypothetical protein + Prom 2428 - 2487 5.3 4 3 Op 1 . + CDS 2509 - 2721 164 ## BF0726 putative DNA-binding protein 5 3 Op 2 . + CDS 2718 - 3047 294 ## BF0796 hypothetical protein 6 3 Op 3 . + CDS 3040 - 3252 144 ## gi|255007618|ref|ZP_05279744.1| hypothetical protein Bfra3_00667 7 3 Op 4 . + CDS 3263 - 3946 388 ## COG3550 Uncharacterized protein related to capsule biosynthesis enzymes 8 4 Op 1 2/0.154 - CDS 3949 - 4842 772 ## COG2207 AraC-type DNA-binding domain-containing proteins 9 4 Op 2 . - CDS 4884 - 6131 1197 ## COG0668 Small-conductance mechanosensitive channel 10 4 Op 3 . - CDS 6103 - 7119 890 ## BF0792 putative ABC transporter ATP-binding protein 11 4 Op 4 . - CDS 7141 - 7968 847 ## COG3950 Predicted ATP-binding protein involved in virulence - Prom 7995 - 8054 4.0 + Prom 7935 - 7994 3.2 12 5 Tu 1 . + CDS 8062 - 8970 847 ## COG0697 Permeases of the drug/metabolite transporter (DMT) superfamily 13 6 Op 1 . - CDS 9002 - 10093 1070 ## COG1703 Putative periplasmic protein kinase ArgK and related GTPases of G3E family 14 6 Op 2 . - CDS 10102 - 11190 969 ## BF0717 hypothetical protein - Prom 11214 - 11273 5.6 15 7 Tu 1 . - CDS 11457 - 17237 3683 ## COG2373 Large extracellular alpha-helical protein - Prom 17280 - 17339 6.6 + Prom 17360 - 17419 4.0 16 8 Op 1 . + CDS 17460 - 18200 546 ## BF0785 hypothetical protein 17 8 Op 2 . + CDS 18197 - 20947 1703 ## BF0711 putative TonB-dependent outer membrane receptor protein 18 8 Op 3 . + CDS 20952 - 22103 853 ## BF0710 hypothetical protein 19 8 Op 4 . + CDS 22125 - 23807 942 ## BF0782 hypothetical protein + Term 23860 - 23903 6.6 - Term 24391 - 24442 10.1 20 9 Tu 1 . - CDS 24461 - 25918 1346 ## COG2195 Di- and tripeptidases - Prom 25973 - 26032 7.4 21 10 Tu 1 . - CDS 26085 - 27065 496 ## BF0779 putative dolichol-P-glucose synthetase - Prom 27135 - 27194 1.6 + Prom 26980 - 27039 7.9 22 11 Tu 1 . + CDS 27136 - 27954 650 ## COG0030 Dimethyladenosine transferase (rRNA methylation) + Prom 27974 - 28033 2.5 23 12 Op 1 . + CDS 28055 - 29395 1200 ## COG2239 Mg/Co/Ni transporter MgtE (contains CBS domain) + Prom 29409 - 29468 6.2 24 12 Op 2 . + CDS 29514 - 31364 1884 ## BF0776 hypothetical protein + Term 31393 - 31443 16.2 - TRNA 31703 - 31776 47.1 # Gln CTG 0 0 + Prom 31988 - 32047 2.4 25 13 Op 1 . + CDS 32079 - 32438 347 ## COG0799 Uncharacterized homolog of plant Iojap protein 26 13 Op 2 . + CDS 32448 - 34481 1263 ## PROTEIN SUPPORTED gi|157803230|ref|YP_001491779.1| 50S ribosomal protein L9 27 13 Op 3 . + CDS 34497 - 35336 776 ## COG0575 CDP-diglyceride synthetase - Term 35370 - 35413 0.5 28 14 Tu 1 . - CDS 35451 - 36194 628 ## BF0772 hypothetical protein - Prom 36220 - 36279 7.2 - Term 36220 - 36259 3.2 29 15 Op 1 . - CDS 36282 - 37415 992 ## COG0763 Lipid A disaccharide synthetase 30 15 Op 2 . - CDS 37412 - 38179 630 ## COG0496 Predicted acid phosphatase - Prom 38286 - 38345 6.9 + Prom 38481 - 38540 3.7 31 16 Op 1 25/0.000 + CDS 38684 - 39451 774 ## COG1192 ATPases involved in chromosome partitioning 32 16 Op 2 . + CDS 39460 - 40350 1126 ## COG1475 Predicted transcriptional regulators 33 16 Op 3 . + CDS 40351 - 41217 538 ## BF0766 hypothetical protein 34 16 Op 4 . + CDS 41242 - 42540 934 ## COG0741 Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) 35 16 Op 5 . + CDS 42609 - 44852 2074 ## COG0317 Guanosine polyphosphate pyrophosphohydrolases/synthetases + Term 44876 - 44935 18.4 36 17 Op 1 . - CDS 44907 - 45272 379 ## COG0789 Predicted transcriptional regulators 37 17 Op 2 . - CDS 45287 - 46255 750 ## COG0739 Membrane proteins related to metalloendopeptidases - Prom 46282 - 46341 4.1 + Prom 46256 - 46315 4.2 38 18 Tu 1 . + CDS 46359 - 48977 2803 ## COG0013 Alanyl-tRNA synthetase + Term 49066 - 49109 8.3 - Term 49123 - 49165 -1.0 39 19 Tu 1 . - CDS 49224 - 50078 457 ## COG2207 AraC-type DNA-binding domain-containing proteins - Prom 50212 - 50271 2.7 + Prom 50063 - 50122 3.8 40 20 Tu 1 . + CDS 50228 - 51421 609 ## COG0477 Permeases of the major facilitator superfamily + Prom 51726 - 51785 4.5 41 21 Op 1 . + CDS 51920 - 54136 2120 ## COG3537 Putative alpha-1,2-mannosidase 42 21 Op 2 6/0.077 + CDS 54175 - 54729 467 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 43 21 Op 3 . + CDS 54782 - 55780 740 ## COG3712 Fe2+-dicitrate sensor, membrane component 44 21 Op 4 . + CDS 55810 - 58077 1745 ## COG3537 Putative alpha-1,2-mannosidase + Prom 58098 - 58157 3.1 45 22 Tu 1 . + CDS 58180 - 60450 1840 ## COG3537 Putative alpha-1,2-mannosidase + Term 60569 - 60605 2.1 + Prom 60617 - 60676 9.1 46 23 Op 1 . + CDS 60696 - 64112 3225 ## BF0752 hypothetical protein 47 23 Op 2 . + CDS 64135 - 65868 1601 ## BF0751 hypothetical protein + Term 65922 - 65958 7.1 - Term 65908 - 65948 4.2 48 24 Tu 1 . - CDS 65964 - 67382 1035 ## COG0507 ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member - Prom 67425 - 67484 10.8 + Prom 67330 - 67389 7.8 49 25 Op 1 . + CDS 67479 - 68162 897 ## BF0749 hypothetical protein 50 25 Op 2 . + CDS 68170 - 68964 483 ## BF0677 hypothetical protein 51 25 Op 3 . + CDS 68955 - 69488 208 ## PROTEIN SUPPORTED gi|163764797|ref|ZP_02171850.1| ribosomal protein L29 52 26 Tu 1 . - CDS 69506 - 70945 1073 ## COG1502 Phosphatidylserine/phosphatidylglycerophosphate/cardioli pin synthases and related enzymes - Prom 71041 - 71100 5.2 - Term 71018 - 71080 23.4 53 27 Op 1 . - CDS 71108 - 74020 2352 ## BF0745 hypothetical protein 54 27 Op 2 . - CDS 74093 - 74497 363 ## BF0744 hypothetical protein - Prom 74624 - 74683 5.2 + Prom 74352 - 74411 5.8 55 28 Op 1 . + CDS 74634 - 75695 994 ## COG0337 3-dehydroquinate synthetase 56 28 Op 2 . + CDS 75708 - 76148 288 ## BF0742 hypothetical protein + TRNA 76389 - 76466 65.9 # Pro GGG 0 0 + Prom 76694 - 76753 5.3 57 29 Tu 1 . + CDS 76812 - 77900 330 ## BF0670 putative transmembrane acyltransferase protein + Term 78030 - 78079 11.3 - Term 78018 - 78067 11.3 58 30 Op 1 . - CDS 78100 - 79680 972 ## PRU_0502 hypothetical protein 59 30 Op 2 . - CDS 79684 - 80718 883 ## COG1566 Multidrug resistance efflux pump 60 30 Op 3 . - CDS 80730 - 82049 1359 ## PRU_0500 outer membrane efflux protein - Prom 82110 - 82169 6.5 - Term 82776 - 82817 2.0 61 31 Op 1 . - CDS 82868 - 84922 1585 ## BF2269 putative lipoprotein 62 31 Op 2 . - CDS 84933 - 88148 2261 ## BF0971 hypothetical protein + Prom 88114 - 88173 5.2 63 32 Tu 1 . + CDS 88361 - 89236 212 ## COG2207 AraC-type DNA-binding domain-containing proteins + Term 89422 - 89466 6.1 - Term 89410 - 89452 9.5 64 33 Tu 1 . - CDS 89473 - 91434 1853 ## COG3525 N-acetyl-beta-hexosaminidase - Prom 91491 - 91550 4.5 - Term 91627 - 91663 4.2 65 34 Op 1 . - CDS 91712 - 92881 897 ## COG0642 Signal transduction histidine kinase 66 34 Op 2 . - CDS 92894 - 93769 383 ## BF0738 hypothetical protein - Prom 93837 - 93896 5.5 + Prom 93860 - 93919 5.2 67 35 Tu 1 . + CDS 94054 - 95292 1091 ## COG0128 5-enolpyruvylshikimate-3-phosphate synthase - Term 95314 - 95372 22.4 68 36 Op 1 . - CDS 95433 - 98987 2843 ## COG3250 Beta-galactosidase/beta-glucuronidase - Prom 99008 - 99067 2.4 69 36 Op 2 . - CDS 99105 - 100556 1357 ## COG3669 Alpha-L-fucosidase 70 36 Op 3 . - CDS 100585 - 101538 1222 ## BF0734 exo-alpha-sialidase 71 36 Op 4 . - CDS 101560 - 103242 1710 ## BF0733 hypothetical protein 72 36 Op 5 . - CDS 103256 - 106441 3151 ## BF0732 hypothetical protein - Prom 106628 - 106687 3.7 - Term 106977 - 107013 4.0 73 37 Op 1 . - CDS 107223 - 109889 1641 ## BF0660 putative transmembrane protein 74 37 Op 2 . - CDS 109930 - 112215 1897 ## COG3537 Putative alpha-1,2-mannosidase - Prom 112270 - 112329 2.7 75 38 Op 1 . - CDS 112366 - 113538 639 ## BF0728 hypothetical protein 76 38 Op 2 . - CDS 113546 - 114547 926 ## COG2234 Predicted aminopeptidases - Prom 114597 - 114656 4.8 + Prom 114556 - 114615 5.4 77 39 Op 1 . + CDS 114650 - 115072 447 ## BF0726 hypothetical protein 78 39 Op 2 . + CDS 115111 - 115920 781 ## COG1108 ABC-type Mn2+/Zn2+ transport systems, permease components 79 39 Op 3 . + CDS 115935 - 116354 611 ## COG0802 Predicted ATPase or kinase 80 39 Op 4 . + CDS 116366 - 116587 197 ## BF0653 putative transmembrane protein - Term 116385 - 116428 7.5 81 40 Op 1 23/0.000 - CDS 116579 - 117889 1256 ## COG1721 Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 82 40 Op 2 . - CDS 117892 - 118887 1114 ## COG0714 MoxR-like ATPases 83 40 Op 3 . - CDS 118951 - 120198 514 ## BF0650 hypothetical protein 84 40 Op 4 . - CDS 120195 - 120818 456 ## BF0719 hypothetical protein 85 40 Op 5 . - CDS 120829 - 121761 914 ## BF0718 hypothetical protein 86 40 Op 6 . - CDS 121742 - 122704 594 ## COG1300 Uncharacterized membrane protein - Prom 122743 - 122802 3.9 87 41 Tu 1 . + CDS 123626 - 124357 576 ## COG1714 Predicted membrane protein/domain + Term 124527 - 124562 1.3 - Term 124515 - 124550 5.1 88 42 Op 1 . - CDS 124580 - 124915 521 ## BF0714 hypothetical protein 89 42 Op 2 . - CDS 124942 - 125256 176 ## PROTEIN SUPPORTED gi|124485582|ref|YP_001030198.1| ribosomal protein L12E/L44/L45/RPP1/RPP2-like protein 90 42 Op 3 . - CDS 125317 - 129111 3488 ## COG0587 DNA polymerase III, alpha subunit - Prom 129230 - 129289 7.1 + Prom 129192 - 129251 6.5 91 43 Op 1 14/0.000 + CDS 129275 - 129961 616 ## COG0688 Phosphatidylserine decarboxylase 92 43 Op 2 . + CDS 129971 - 130681 601 ## COG1183 Phosphatidylserine synthase 93 43 Op 3 . + CDS 130744 - 131031 258 ## BF0709 hypothetical protein - Term 130899 - 130937 -1.0 94 44 Op 1 . - CDS 131048 - 131485 478 ## COG0590 Cytosine/adenosine deaminases 95 44 Op 2 . - CDS 131490 - 131723 251 ## BF0707 hypothetical protein - Prom 131836 - 131895 4.5 + Prom 131703 - 131762 8.1 96 45 Op 1 . + CDS 131861 - 132226 367 ## COG0792 Predicted endonuclease distantly related to archaeal Holliday junction resolvase 97 45 Op 2 . + CDS 132233 - 132589 437 ## COG2315 Uncharacterized protein conserved in bacteria 98 45 Op 3 . + CDS 132573 - 133328 566 ## COG0340 Biotin-(acetyl-CoA carboxylase) ligase 99 45 Op 4 . + CDS 133367 - 134704 817 ## BF0631 hypothetical protein + Term 134710 - 134768 20.6 - Term 134628 - 134665 1.4 100 46 Tu 1 . - CDS 134731 - 135414 535 ## BF0630 hypothetical protein - Prom 135440 - 135499 5.4 101 47 Op 1 . - CDS 135529 - 136851 787 ## COG2273 Beta-glucanase/Beta-glucan synthetase 102 47 Op 2 . - CDS 136876 - 137034 91 ## BF0699 hypothetical protein - Prom 137154 - 137213 3.2 + Prom 136787 - 136846 5.5 103 48 Op 1 . + CDS 137063 - 138364 975 ## COG0534 Na+-driven multidrug efflux pump 104 48 Op 2 . + CDS 138413 - 139123 930 ## COG0528 Uridylate kinase + Term 139146 - 139204 14.1 - Term 139137 - 139189 13.3 105 49 Tu 1 . - CDS 139370 - 140530 598 ## BF0625 hypothetical protein - Prom 140582 - 140641 6.0 + Prom 141481 - 141540 6.9 106 50 Op 1 . + CDS 141587 - 142435 616 ## BF0694 hypothetical protein 107 50 Op 2 . + CDS 142475 - 144064 1567 ## COG0443 Molecular chaperone 108 50 Op 3 . + CDS 144092 - 146887 969 ## BF0692 hypothetical protein 109 50 Op 4 . + CDS 146898 - 147449 302 ## BF0691 hypothetical protein 110 50 Op 5 . + CDS 147461 - 148519 466 ## BF0690 hypothetical protein + Prom 148558 - 148617 4.8 111 51 Op 1 . + CDS 148646 - 149206 840 ## COG0233 Ribosome recycling factor 112 51 Op 2 . + CDS 149302 - 150234 930 ## COG1162 Predicted GTPases - Term 150089 - 150128 1.6 113 52 Tu 1 . - CDS 150268 - 150456 82 ## gi|255007500|ref|ZP_05279626.1| hypothetical protein Bfra3_00075 - Prom 150672 - 150731 4.3 + Prom 151085 - 151144 7.9 114 53 Op 1 . + CDS 151205 - 152458 552 ## BF0615 hypothetical protein 115 53 Op 2 . + CDS 152455 - 154017 905 ## BF0614 glycosyl transferase + Prom 154019 - 154078 5.1 116 54 Op 1 . + CDS 154105 - 154233 78 ## 117 54 Op 2 . + CDS 154246 - 154590 245 ## BF0612 hypothetical protein - Term 154498 - 154533 -0.3 118 55 Tu 1 . - CDS 154641 - 154793 71 ## - Prom 154892 - 154951 10.3 + Prom 154852 - 154911 8.4 119 56 Tu 1 . + CDS 155085 - 155252 145 ## 120 57 Op 1 13/0.000 + CDS 155633 - 157804 173 ## PROTEIN SUPPORTED gi|169795303|ref|YP_001713096.1| ABC transporter ATP-binding protein 121 57 Op 2 7/0.077 + CDS 157816 - 159129 313 ## COG0845 Membrane-fusion protein + Prom 159140 - 159199 3.7 122 58 Op 1 27/0.000 + CDS 159233 - 160315 1080 ## COG0845 Membrane-fusion protein 123 58 Op 2 . + CDS 160330 - 163365 2972 ## COG0841 Cation/multidrug efflux pump 124 58 Op 3 . + CDS 163362 - 164687 1267 ## BF0682 putative outer membrane protein TolC + Term 164778 - 164815 2.1 125 59 Tu 1 . + CDS 165215 - 166882 1311 ## COG0531 Amino acid transporters 126 60 Op 1 . - CDS 166975 - 169131 2172 ## BF0603 putative alpha-N-acetylglucosaminidase 127 60 Op 2 . - CDS 169156 - 170301 963 ## COG3274 Uncharacterized protein conserved in bacteria - Prom 170358 - 170417 5.7 - Term 170383 - 170427 6.1 128 61 Tu 1 . - CDS 170434 - 172080 497 ## PROTEIN SUPPORTED gi|169634422|ref|YP_001708158.1| fumarate hydratase - Prom 172134 - 172193 5.8 - Term 172376 - 172416 8.3 129 62 Op 1 . - CDS 172504 - 175017 1859 ## COG0308 Aminopeptidase N 130 62 Op 2 . - CDS 175047 - 177038 1805 ## BF0673 hypothetical protein 131 62 Op 3 . - CDS 177063 - 178316 1502 ## COG2262 GTPases - Prom 178342 - 178401 4.4 - Term 178370 - 178413 6.8 132 63 Op 1 . - CDS 178433 - 179827 1095 ## BF0597 hypothetical protein 133 63 Op 2 . - CDS 179862 - 182756 2530 ## BF0670 hypothetical protein Predicted protein(s) >gi|226332044|gb|ACIB01000012.1| GENE 1 657 - 1004 332 115 aa, chain + ## HITS:1 COG:no KEGG:BF0728 NR:ns ## KEGG: BF0728 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 115 1 115 115 205 99.0 4e-52 MKKLKSPASQSEAMKLRWKKRIVFEKGYTESCAEWMAERLEALLDHMQYGHATVAYRKQN GSFQLVKATLIYYEAEFRKKYDPTKIEGAVVYWNVDEQRWMTFQVENFMEWRPIV >gi|226332044|gb|ACIB01000012.1| GENE 2 1147 - 2019 647 290 aa, chain + ## HITS:1 COG:no KEGG:BF0799 NR:ns ## KEGG: BF0799 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 290 1 290 290 557 97.0 1e-157 MAIAYDGINYFPVGVNFMEENAMEVIEAKYGIKGSAIVLKLMCKIYKEGYYIRWDEEQCL IFANKAGREVQAEEVQGIIEILFTKGILDRNSYQENGILTSESIQKVWMEATKRRKRELS ELPYLMVKPEKENGKADTPPALQEIQQPELFKKEKTPVNPKNVVHHVVVDAKNACNSGQS KVKEKKAEENKEFPPSAPPKGEEEERKGDSAYLPIPGYAFNTMTHNYSGLMDTLKRLSIT DTGEVNSILRLSDYGRKGTTVWKLIANTCWSDIGAKGRYLIAALNKTKRR >gi|226332044|gb|ACIB01000012.1| GENE 3 1929 - 2207 89 92 aa, chain + ## HITS:1 COG:no KEGG:BF0798 NR:ns ## KEGG: BF0798 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 55 1 55 55 87 96.0 1e-16 METDCQHLLERHRSQRKIPDSGAKQDEKKVAESVSPLFVVDKKANIQAFDQKGIQRNKEV KSVLNELKHSVFKAQDFSRPKLCFNATLKLVL >gi|226332044|gb|ACIB01000012.1| GENE 4 2509 - 2721 164 70 aa, chain + ## HITS:1 COG:no KEGG:BF0726 NR:ns ## KEGG: BF0726 # Name: not_defined # Def: putative DNA-binding protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 70 20 89 89 101 100.0 9e-21 MKQIGIQIRQRRKMLGINQQTLADLAQISINTITKIENGEININFQKLYAILEVLGLELS LKIKNKEGHL >gi|226332044|gb|ACIB01000012.1| GENE 5 2718 - 3047 294 109 aa, chain + ## HITS:1 COG:no KEGG:BF0796 NR:ns ## KEGG: BF0796 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 109 1 109 109 211 100.0 7e-54 MRQGVVYLNKERVGIITELSSNEYKFRYDDEYFNDPSKPSISLTLTKQQQEYTSHYLFPF FANMLSEGHNRIVQARLLQIDEKDDFGILLATAHTDTAGAVTIKPLDYD >gi|226332044|gb|ACIB01000012.1| GENE 6 3040 - 3252 144 70 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|255007618|ref|ZP_05279744.1| ## NR: gi|255007618|ref|ZP_05279744.1| hypothetical protein Bfra3_00667 [Bacteroides fragilis 3_1_12] # 1 68 1 68 336 127 95.0 2e-28 MIELTCCPSTLQKGFSTYSPVALKELFNSQKVNHILPYNGMDNNETEQKEFQDNNKHMSI SGAQQNKSSQ >gi|226332044|gb|ACIB01000012.1| GENE 7 3263 - 3946 388 227 aa, chain + ## HITS:1 COG:SMa0592 KEGG:ns NR:ns ## COG: SMa0592 COG3550 # Protein_GI_number: 16262763 # Func_class: R General function prediction only # Function: Uncharacterized protein related to capsule biosynthesis enzymes # Organism: Sinorhizobium meliloti # 23 196 196 361 390 106 40.0 5e-23 MQIASQVYNIPTAANGLCFFQNDEPAYITRRFDIAPNGRKFRKEDFASLAGISKGNKGPN YKYDVLSYEEMADIIKQYVSASSVEVLKFFRLVIFNFLFSNGDAHAKNFSLLETPSGDFI LAPAYDLLNTRLHIFDDHVFALQRGLFKENTLNGNDGAVTGKEFIEFGIRIGIPPKRVHK ELISFCQKAEQVQDLVEKSFLPNQLKKQYLLHYQMRKDSYLSVGIPT >gi|226332044|gb|ACIB01000012.1| GENE 8 3949 - 4842 772 297 aa, chain - ## HITS:1 COG:AGl1135 KEGG:ns NR:ns ## COG: AGl1135 COG2207 # Protein_GI_number: 15890685 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 25 295 36 304 313 128 30.0 1e-29 MNALQSNIIREITPLSDKDCFYIAERYKTEFTYPIHNHAEFELNFTEKAAGVRRIVGDSA EVISDYDLVLITGKDLEHVWEQHDCHSKEIREITIQFSSDLFFKSFINKNQFDSIRDMLE KAQKGLCFPMSAILKIYPLLDTLASEKQGFYAVIKFLTILYELSLFNEEARTLSSSSFAK IGIHSDSRRVQKVQEYINAHYQEEIRLNQLADMVGMTPVSFSRFFKLRTGKNLSDYIIDI RLGFAARLLVDSTMSIAEICYECGFNNLSNFNRIFKKKKECSPKEFRENYRKKKKLV >gi|226332044|gb|ACIB01000012.1| GENE 9 4884 - 6131 1197 415 aa, chain - ## HITS:1 COG:VC0265 KEGG:ns NR:ns ## COG: VC0265 COG0668 # Protein_GI_number: 15640294 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Small-conductance mechanosensitive channel # Organism: Vibrio cholerae # 28 409 26 408 412 350 45.0 2e-96 MEQITEKINDLFVSWGFDSSEVGPIMTLVLIIGIAFLADLICRNILLRVVAKLVKKTKAT WDDIVFDRKVLIYLSHLVPPIIIYVLIPLAIPNVSALDFIRRICMIYIIAVFLRFISAFL SAVYHVYSEREQFRDRPLKGLLQTAQVILFFIGGIVVISVLIDKSPMVLLTGLGASAAIL MLVFKDSIMGFVSGIQLSANNMLKVGDWIAMPKYGADGTVIEVTLNTVKVRNWDNTITTI PPYLLVSDSFQNWRGMQESGGRRVKRSINIDMNSVRFCTPEMLAKYKKIQLLTDYVEQTE QVVKEYNKEHHIDNSILVNGRRQTNLGVFRAYLTNYLKSLPDVNKNLTCMVRYLQPTEQG IPVELYFFSAVKEWVPYEGIQADVFDHLLAIVPEFGLRVFQNPTGEDFREWNRRN >gi|226332044|gb|ACIB01000012.1| GENE 10 6103 - 7119 890 338 aa, chain - ## HITS:1 COG:no KEGG:BF0792 NR:ns ## KEGG: BF0792 # Name: not_defined # Def: putative ABC transporter ATP-binding protein # Organism: B.fragilis # Pathway: not_defined # 1 338 1 338 338 655 99.0 0 MAISLKDNLTSSYFNAAHKLYSKKARRRIVAYVESYDDVAFWRTLLEEFEDEEHYFQVML PSATSLAKGKKMVLMNTLNTAELGKSLIACVDSDYDFLLQGATATSRKINRNRYIFQTYA YAIENYHCYADSLHEVCVQATLNDRHLIDFNEFMKRYSQIAYPLFLWSVWFYRRHDTYTF TMSEFNACVRLHDVSLRHPERSLEAVRRSVTSKLSELSTRFPQGIEEVDKLSVELKELGV LPDTTYLFIQGHHIMDNVVMKVLTPVCTALRREREQEIKKLAEHDEQFHNELTCYQNSQV NVEVMLRKNSAYKDLYLYQWLKEDIKEFLYGTDNRKNK >gi|226332044|gb|ACIB01000012.1| GENE 11 7141 - 7968 847 275 aa, chain - ## HITS:1 COG:STM2746 KEGG:ns NR:ns ## COG: STM2746 COG3950 # Protein_GI_number: 16766058 # Func_class: R General function prediction only # Function: Predicted ATP-binding protein involved in virulence # Organism: Salmonella typhimurium LT2 # 176 260 320 410 427 60 36.0 3e-09 MELSANYIKRIEIDGLWDRFNIVWDLRPDVNILSGINGVGKTTILNRSVGYLEQLSGEVK SDEKNGVHIFFDNPEATYIPYDVIRSYDRPLIMGDFTARMADKNVKSELDWQLYLLQRRY LDYQVNIGNKMIELLSSNNEEERSKAATLSIAKRRFQDMVDELFSYTRKKIDRRRNDIAF YQDGELLFPYKLSSGEKQMLVILLTVLVQDNAHCVLFMDEPEASLHIEWQQKLISMIREL NPNVQIILTTHSPAVIMEGWLDAVTEVSDIATSYK >gi|226332044|gb|ACIB01000012.1| GENE 12 8062 - 8970 847 302 aa, chain + ## HITS:1 COG:AF0266 KEGG:ns NR:ns ## COG: AF0266 COG0697 # Protein_GI_number: 11497882 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; R General function prediction only # Function: Permeases of the drug/metabolite transporter (DMT) superfamily # Organism: Archaeoglobus fulgidus # 20 293 14 276 276 69 24.0 9e-12 MEIDKNLKGHALAFTANVMWGLMSPIGKSALAEFSALSVTTFRMVGAAAAFWILSAFCKQ EQVGHRDMVKIFFASLFALVFNQGIFIFGLSLTSPIDASIVTTTSPIITMIVAAIYLKEP VTNKKVLGIFIGAMGALILILSSQAVSAGGGSIWGDLLCMIAQLSFSIYLTVFKGLSQRY SAITINKWMFIYASICYIPFSYQDIASIKWDSISTAAIYQVLYVVLCGSFIAYICIMTAQ KLMRPTVVSMYNYVQPIVASIAAILMGIGSFGWEKGVAIALVFLGVYFVTQSKSKADLEG VS >gi|226332044|gb|ACIB01000012.1| GENE 13 9002 - 10093 1070 363 aa, chain - ## HITS:1 COG:BH2954 KEGG:ns NR:ns ## COG: BH2954 COG1703 # Protein_GI_number: 15615516 # Func_class: E Amino acid transport and metabolism # Function: Putative periplasmic protein kinase ArgK and related GTPases of G3E family # Organism: Bacillus halodurans # 34 363 5 334 340 333 48.0 4e-91 MEHPENNEAYKGLVVNAGIEQPSSVNPYLKRKVKKRQLSVSEFVEGIVKGDVTILSQAVT LVESVRPEHQATAQEVIEKCLPYSGNSIRVGISGVPGAGKSTSIDVFGLHVLEKGGKLAV LAIDPSSERSKGSILGDKTRMEQLSVHPKSFIRPSPSAGSLGGVARKTRETIILCEAAGF DKIFVETVGVGQSETAVHSMVDFFLLIQLAGTGDELQGIKRGIMEMADGIVINKADGSNI DKAKLAAAQFRNALHLFPAPDSGWTPRVLTYSGFYNLGVKEIWDMVYEYIDFVKGNGYFE YRRNEQSKYWMYESINEQLRDSFYHNAKIESMLQEKEQQVLRGNLTSFVAAKSLLDTYFE DLK >gi|226332044|gb|ACIB01000012.1| GENE 14 10102 - 11190 969 362 aa, chain - ## HITS:1 COG:no KEGG:BF0717 NR:ns ## KEGG: BF0717 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 362 14 375 375 692 100.0 0 MKRSLLFIFTLLTITLSAVAQPRISSNKETHHFGQIEWKRPVSVEYTITNTGDKPLVLTN VTTSCACSVANWTKTPIAPGEKGTVSATFDAKALGHFNKSIGIYSNAQPSLVYLNFDGEV VQEIKDFTKTHPYAIGQIRIDRTDIDFPDAHSGEKPVITLGVVNLSDRPYEPVLMHLPPY LKMETNPTVLLKGKKGTITLTLDTKQLMDLGLTQSSVYLARFAGDKVGEENEIPVSAVLL PDFSGMTEQDKAVAPVIRLSESKIDLSQVLAKKNKARRDIVITNTGKSPLQISKLQVFNP AVGVALKKTVLQPGESTRLRVTVLKKNLGKKKRHLRILMITNDPVQPKVEIDVKATNNES HN >gi|226332044|gb|ACIB01000012.1| GENE 15 11457 - 17237 3683 1926 aa, chain - ## HITS:1 COG:TM0984 KEGG:ns NR:ns ## COG: TM0984 COG2373 # Protein_GI_number: 15643744 # Func_class: R General function prediction only # Function: Large extracellular alpha-helical protein # Organism: Thermotoga maritima # 1078 1315 633 863 1536 67 26.0 3e-10 MRIKLICIIVLLSMGMMSWTHAQSYDRLWKQVEQAQQKSLPQTVVRLTGEIYQKAKAEKN SPQMLKAYIWQMKFREEITPDSFYVSLNGLEQWAVTTDKPLDRAILHSLIGSMYADYASQ NRWKLNQRTDLEEEAPSVDIREWSKNQFVTKVMTEIAVTFQDSLLLLDTSSRSYIPFVEL GVTSDYYHHDMYHLLASRAITSLENLSGFGRDSLINVRIEEIYQHMMNSYRRTDNHDALL LTTLDYLQWKRRTDIDFRPYRAPEGKLGLTQDPYLAALDKLIAENKSHDVCAEVYLLKAQ AAMDAGVPASALQLCEEAISRYPDYRRINALKELKQEILRPDLTVQSPSTVYPGEEFDLK VSFKNLKDFTVELYAINLPARPNTVEAPNDVFLKKHGRLLSSEHYVLFPSDDYKVKDSIY HIKAPETGLYALRVIPGVKVRSNVSKFLYSTCFKVLTRSLPSNLSEVAILDAMSGKPLQG VVLSFFDRQNKQLLTATTNTEGKVQFASSEKYRYLTAAKGNDTAMPQMYLWGGDYNFADH SKPVSVVTLLTDRSVYRPGQTVYVKGIAYEQYPDSAHVIAGQEYTLTLSDANGQEISAKK LRTNDFGSFTAEFVLPSVCLNGTFSLNTQNGFRSIRVEDYKRPTFDITFEPVTESYRLGD RVELKGSVKTFSGVPLQDIPVTYTITRSLYTWRMWGMNPVILASDTVRLGVDGNFEIPVD LKPDTSNPDLGDGDNTSLYYDYKVQLSVTNVAGETQTSETSLRAGKTSLLLFADISGLIC KDDSVKATFRVNNLDRKPVSVEGSYRLFLISDYQKSKPLKEQDVSDQPALSGSFRSNEEI LLSDWKKLPSGAYKLVASVKDDQGRKVDAEKVVILFASDDKRPPVSMPLWCYEVNTRFDA AHPALFYFGTSEKDTYVLMDVFCGNKHLESKLLHLSDSLVRFEYPYREAYGNGLGITFVF VRKGVVYEQEVSLIKRLPDHNLNMRWDVFRDKLRPGQEEEWKLTIRNPQKSPVLAEMLAT MYDASLDKIWKTNQSLQLHYQLSVPIARWRRDYVGSNYFYFGFRRTDFKVPPFSYDHFDL PPVLYAVAEMLSVTNDAAPTTRYARLRGMGAAKPQMKSAAVADVVFESEMVPVTEESGMA MSMDNADMGRTTDIELRTDFAETAFFYPQLHTNVQGEVSFSFRMPQSLTTWNFRGYAHTQ DMMTGQMDATAVTSKEFMLTPNLPRFVRVGDHTSMAASVSNLTGKNLSGTVKLVLFDPMT DQVISTQQKKFNAGAGQSVGVSFLFTVTDKYELLGCRMIAEGGNFSDGEQHLLPVLSDKE NLTETLPMPVRGEQTRTFSLADLFNHHSKTATNRRLTVEFTSNPAWYAVQALPALSQPRN DDAISWATSWYANTMASYIMNAQPRIQAIFDSWKLQGGTKESFLSNLQKNQEVKNILLSE SPWVMEATSESEQKERIATLFDLNNIRNSNTAALLKLKELQLPDGSWSWYKGMDGSLFVT DFIVEQNARIALLTGKPLEGGALDMQQAAFGYLHKEALQEYRSIREAEKVGNKSEGISRS ALKYLYLIAVSGEKVPASVKEGYDYFLSKVAPSLSQQSVTEKAWSAIVLQKAGKVKEAQE FMASLKEYLTQTDEQGMFFDRTDSPYAWNNLKVPAHVDVMEAFEMVGSNATIVEEMKMWL LKQKQTQQWDSPVATANAVYALLYRGTNLLDNQGDVRIVLGNEVLETISPAKTTVPGLGY IKKTFTDKKTVNTDEIIVEKRDPGIAWGAVYAQFEENLDKVVRQGSGLNVDKKLYVETIV NNNRLLQPVIGKTQLKVGDKVVVRLTVRLDRTMDFVQLKDQRAACLEPVEVLSGYRNVGD VGCYVAVKDASTDFFFDTLNKGTYVLEYSYRVDRAGSYEAGIATIQSAYAPEYAAHSASA RYEVSQ >gi|226332044|gb|ACIB01000012.1| GENE 16 17460 - 18200 546 246 aa, chain + ## HITS:1 COG:no KEGG:BF0785 NR:ns ## KEGG: BF0785 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 246 1 246 246 461 99.0 1e-129 MKNRHYLRHILAITALLFNGEAIYSQTYPIENYLKAAGDYVTIYNGEIELTYSLAQYDNL PYFQGDEFTTGEIIFKGNRYPGLDLHLDLHKDQLCALTPDSHYSMIINNEGIEQVNLHNT TFIYFRPTKKTDLNKGFYELLQDGKRLRLLARKTYSVAQINVEKIAKTRKHQTEYFIYGV KYYLEYNGIYYPVSNNKSFAKIFPEQHKLIKRYARKHKLNFRHDADASLIALTNFCEELI DQKQTR >gi|226332044|gb|ACIB01000012.1| GENE 17 18197 - 20947 1703 916 aa, chain + ## HITS:1 COG:no KEGG:BF0711 NR:ns ## KEGG: BF0711 # Name: not_defined # Def: putative TonB-dependent outer membrane receptor protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 916 1 916 916 1820 99.0 0 MKKSTYLWMLPLLFAWPQQMTAQHPLSLPADSITIDSLLETVEKNTPYRVFSTISAPFKV LVKGKASPLQQLKEALEPTPYKLSVSGNNLFVLKEQELITLLPAKLTGEPEKGESYYGDV YTYLSGEPEKASSENKVYNVGDVRIKQPPRKAVLKGQVTNFKTGEPMIGINLILKDPWIA TTTDVKGNFTLELPTGHKQIDIKGLNIKDTRRQIMLYSDGTLDIELEETTHMLDEVTITS GRIQNVKSTQLGAETLRPTQLKNIPMALGEVDILKMVQALPGVKTVGEASSGFNVRGGAT DQNLILLNDGTIYNPNHLFGFFAAFNSDMVKEAEIYKSSIPAQYGGRISSILDITGKEAN KEKFTGSAGIGLVTSKLNLEIPIIKDRTSVLLSGRTTYSDWIMKQLPEKSGYKNGTAGFY DLAAIVAHKFNDKHSLNVYGYYSHDRFAFNSNEKYGYNNLNVSARWRAVFNEKLIGYFSA GYDHYDYNNRETVNASAAYKLSFDINQYFVKADFTNILADKHTLNFGFKSMLYHINSGTY EPEGSESFVKKDVLQKDKALETAFYLGDEWEITPKLSVNAGIRYSLFSALGPRSYYQYAS GMLPHESTITDTITAGAGKFMKTYHGPEFRLSARYAFTDNFSVKAGFNSMRQYIHKLSNT VIMSPTDTWKLSDVNIKPQRGWQAAAGLYLNSPSGIWEYSVEGYYKRMSDYLDYRGGAKL LMNHHIETDVINTQGHAYGVELQVKKQVGKLNGWMSYTYSRTFLRQNDKRIEKPVNNGDW YPTEYDKPHDFKFVGNYKFTHRYSMSINVDYSTGRPTTIPAGQYYDESTQSMRVYYTERN SYRIPDYFRTDISFNIEPSHHLTLLTHSSISIGVYNVTGRKNVYSIYYMPEEGQIKGYQI SIFGVPIPFITYNIKF >gi|226332044|gb|ACIB01000012.1| GENE 18 20952 - 22103 853 383 aa, chain + ## HITS:1 COG:no KEGG:BF0710 NR:ns ## KEGG: BF0710 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 383 1 383 383 761 99.0 0 MLKSKYKIYLLLLCLTGCVSEYNAQLPSSDEELLVVTGDIIANTEAIFSLSKSIPLSEDM PEDYRNIYARIAVVGSDGYRSDFGTALGDGKYQVSIGELQDDVSYGIEIEYDGEIYTSSP STPMVSSEIDSVSWIQPEPEQALSIRVSTHGDPGKTQYYMWNYREDWEIRASYITTCYFD PDMNRIYEDSNYPTFYCWKKEISRNILIGSTEKLKEHLIINNKLLDVPVNEDRFTVLYSI QVQQRALSKEGYEYYLNVQQQNEEMGGIFTPQPSEIQGNISCISQPGRRTIGYVGVYKNI SEKRIYIHPNEIKRPPLYSGCEEVSDSEMDEQGYSTYLIRYLVGYRPVGTGTHIDYWALR RCTECEANGGSKNKPSFWPNDHQ >gi|226332044|gb|ACIB01000012.1| GENE 19 22125 - 23807 942 560 aa, chain + ## HITS:1 COG:no KEGG:BF0782 NR:ns ## KEGG: BF0782 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 560 1 560 560 1140 99.0 0 MKRIFTLYLFILFCLILQAQEELYERVYVHTDKTCYLAGEEVWLKFYTIDTHFRPSSFSK VGYIEISNTERPKAQLKLALDNGSGSGKVKIPTDAPSGIYELTGYTRYMRNEGEKVFFRK SIAVINTFRVSDSDPIELADSAEIYPKGKPATTENIHIKTSRSNYNTRQLVELTINRLPD EVSDLTVSVSRNDSLVTLPPLEESTWRKQVTATPGTFSGKWIPEYEGHIICGQIESPTGE TLKQVQNEPISADIAFVGKDIRYVQGQVESGGNTLFYTSHVYGTNDVVAAAWNINGEPFR MNILSPFSEKLPQNLPSLKLYRNKKRLLERSIGIQLQQVTVLDSLDHAIPLQSCYGLQPY LNYNLDEYTRFNTMTETFVEFVRSVIIRKVNGKRRLRVLKEGEKRFNIGNTLVLLDGVPI HDHEDILKYNPRLVKKIEIYNGRYGFGGEVFECMISLTTQRGDLPSIQLSDDSRLTVYEC PQLPVTFKMPEYKDATDKKSRRPDFRHTLYWNPSVETEAGIDTTLSFYTSDLEGEFKVVV EGFTLKGELIRGEVNFHVKK >gi|226332044|gb|ACIB01000012.1| GENE 20 24461 - 25918 1346 485 aa, chain - ## HITS:1 COG:VC2279 KEGG:ns NR:ns ## COG: VC2279 COG2195 # Protein_GI_number: 15642277 # Func_class: E Amino acid transport and metabolism # Function: Di- and tripeptidases # Organism: Vibrio cholerae # 2 485 50 533 534 452 45.0 1e-127 MSTILDLAPQNVWKHFYSLTQIPRPSGHMEKITEFLVNFGNSLGLKTFVDDAGNVIIRKP ATPGMENRKGVILQAHMDMVPQKNNDTVHDFEKDPIETYIDGEWVKAKGTTLGADNGLGV AAIMAVLEDQNLKHGPLEALITKDEETGMYGAFGLKPGTVNGEILLNLDSEDEGELYIGC AGGMDVTASLEYKEVAPEEGDIAIRVNLKGLRGGHSGLEINQGRANANKLLVRFIREAVA TYEARLASWEGGNMRNAIPREAHAVVTIPAENEEELLALVKYCEDLFNEEFKAIETPISF TAERVELPAGEVPEEIQDNLIDAIFACQNGVMRMIPTIPDTVETSSNLAIINIGEGKASF KILARSSSDSMKECLTTSLECCFSMAGMKVEMTGGYSGWQPDINSPILHAMKESYKKQFG TEPAVKVIHAGLECGIIGAIIPGLDMISFGPTLRSPHSPDERALIPTVQKFYDFLIATLE QTPMK >gi|226332044|gb|ACIB01000012.1| GENE 21 26085 - 27065 496 326 aa, chain - ## HITS:1 COG:no KEGG:BF0779 NR:ns ## KEGG: BF0779 # Name: not_defined # Def: putative dolichol-P-glucose synthetase # Organism: B.fragilis # Pathway: not_defined # 14 326 21 333 333 548 99.0 1e-154 MKKLIKKALKLILPLVLGGFILYWVYRDFDFVKAMEVLQHGTNWWWMAFSLLFGIFAQVF RGWRWRQTLEPLGAFPRRRDCVDAIFISYAASLVVPRVGEVSRCGVLAKYDNVSFAKSLG TVVTERLVDTVTILLITGVTVLLQMPVFVTFLEQTGTKIPSFMHLLTSVWFYIILFCTIG VIVLLYYLIRTLSFFEKVKGVVLNVCEGIMSLRNVKNLPLFLLYSFLIWLSYFLHFYFTF YCFAFTAHLGLLAALVMFVGGTFAVIVPTPNGAGPWHFAVITMMMLYGVNATDAGIFALI VHGIQTLLVILLGVYGLVTISFLHRK >gi|226332044|gb|ACIB01000012.1| GENE 22 27136 - 27954 650 272 aa, chain + ## HITS:1 COG:PA0592 KEGG:ns NR:ns ## COG: PA0592 COG0030 # Protein_GI_number: 15595789 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Dimethyladenosine transferase (rRNA methylation) # Organism: Pseudomonas aeruginosa # 5 257 8 263 268 166 38.0 5e-41 MKLVKPKKFLGQHFLKDLKVAQDIADTVDTFPDLPILEVGPGMGVLTQFLVKKERLVKVV EVDYESVAYLREAYPSLEDNIIEDDFLKMNLQRLFDGHPFVLTGNYPYNISSQIFFKMLD NKDLIPCCTGMIQKEVAERIAAGPGSKTYGILSVLIQAWYRVEYLFTVNEQVFNPPPKVK SAVIRMTRNETQELGCDPKLFKQIVKTTFNQRRKTLRNSIKPILGKDCPLTEDALFNKRP EQLSVQEFIHLTNQVEQALKVPIEPVSQIENP >gi|226332044|gb|ACIB01000012.1| GENE 23 28055 - 29395 1200 446 aa, chain + ## HITS:1 COG:BH0511 KEGG:ns NR:ns ## COG: BH0511 COG2239 # Protein_GI_number: 15613074 # Func_class: P Inorganic ion transport and metabolism # Function: Mg/Co/Ni transporter MgtE (contains CBS domain) # Organism: Bacillus halodurans # 21 441 25 441 452 231 32.0 3e-60 MNEEYIDNVKELIEEKDADKVKELLIDLHPADIAELCNELNPEEARFVYRLLDNETAADV LVEMDEDVRKEFLDILPSETIAKRFVDYMDTDDAVDLMRELDEDKQEEILSHIEDIEQAG DIVDLLKYDENTAGGLMGTEMVTVNENWSMPECLKEMRQQAEELDDIYYVYVIDDDERLR GIFPLKKMITSPSVSKVKHVMQKDPISVHVDTPIDEVAQIIEKYDLVAIPVLDSIGRLVG QITVDDVMDEVREQSERDYQLASGLSQDVETDDNVLRQTTARLPWLLIGMIGGIGNSMIL GNFDSTFAAHPEMALYIPLIGGTGGNVGTQSSALVVQGLANSSLDAKNTFKQVSKEAVVA LINATIISLLVYTYNFIRFGATATVTYSVSISLFSVVMFASIFGTLVPMTLEKMKIDPAI ATGPFIAITNDIIGMMMYMGITVLLS >gi|226332044|gb|ACIB01000012.1| GENE 24 29514 - 31364 1884 616 aa, chain + ## HITS:1 COG:no KEGG:BF0776 NR:ns ## KEGG: BF0776 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 616 1 616 616 941 99.0 0 MMDSHDTNQPLKQGELEEEKKAVEVSEEITETPAEETIVEKPTENASKLSTKEEVLLRLK EVAQDAENANKQELDGLKQTFYKIHNAEIEAAKKTFVENGGAEEEFIAQPSGVEEEFKSL MAAIKEKRSALAAEIEKQKEENLQVKLSIIEELKELVESPDDANKSYNEFKKLQQQWNEV KLVPQAKVNELWKNYQLHVEKFYDILKLNNEFREYDFRKNLEIKTHLCEAAEKLADEQDV VSAFHQLQKLHQEFRDTGPVAKELRDEIWNRFKAASTAVNRRHQQHFEALKETEQHNLDQ KTVICEIVEAIEFDQLKTFAAWETKTQEVIALQNKWKTIGFAPQKMNVKIFERFRKACDE FFKKKGEFFKLLKEGMNANLEKKKALCEKAESLKDSTEWKETAEILTKLQKEWKTIGPVS KKYSDAVWKRFITACDYFFEQKGKATSSQRSVEQENLEKKKAIIARLTAIDETTDADEAS KEVRELMKEWNGIGHVPFKEKDRLYKQYHGLIDQLFDRFNISASNKKLSNFKSSIGNIQS GGSQSLYREREKLVRTYENMKNELQTYENNLGFLTTSSKKGNSLLTEINRKVEKLKSDLE LVLQKIKVIDESIKEE >gi|226332044|gb|ACIB01000012.1| GENE 25 32079 - 32438 347 119 aa, chain + ## HITS:1 COG:slr1886 KEGG:ns NR:ns ## COG: slr1886 COG0799 # Protein_GI_number: 16330295 # Func_class: S Function unknown # Function: Uncharacterized homolog of plant Iojap protein # Organism: Synechocystis # 4 118 27 140 154 89 40.0 2e-18 MNETKVLIEKITEGIQEKKGKNIVIADLTNIDDTICKYFVICQGNSPSQVIAIVDSIKEF TRKGAGTKPSAIDGQRNAEWVAMDFSDVLVHVFLPEARNFYNLEHLWADAKLTTIPDID >gi|226332044|gb|ACIB01000012.1| GENE 26 32448 - 34481 1263 677 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|157803230|ref|YP_001491779.1| 50S ribosomal protein L9 [Rickettsia canadensis str. McKiel] # 13 663 3 635 636 491 44 1e-137 MDNNSKKPNNKVNMPKFNLNWMYMIIALMLLGLYFANGSSSISKNISYDEFQQYVRDGYV SKVIGYDDNSVEIYIKPQYVGTVFKQDSTRVGRNPMITTEAPSRENLDNFLQKEKEETHF DGSVSYDKKKDYFSAILWNVLPIVFLIALWIFFMRRMGSGASGGAGGVFNVGKSKAQLFE KGGSIKVTFKDVAGLAEAKQEVEEIVEFLKEPQKYTDLGGKIPKGALLVGPPGTGKTLLA KAVAGEANVPFFSLAGSDFVEMFVGVGASRVRDLFKQAKEKAPCIVFIDEIDAVGRARGK NPAMGGNDERENTLNQLLTEMDGFGSNSGVIILAATNRVDVLDKALLRAGRFDRQIHVDL PDLNERKEVFGVHLRPIKIDDTVDVDLLARQTPGFSGADIANVCNEAALIAARHGKKFVG KQDFLDAVDRIIGGLEKKTKITTEAERRSIALHEAGHASISWLLEYANPLIKVTIVPRGR ALGAAWYLPEERQITTKEQMLDEMCATLGGRAAEDLFIGRVSSGAANDLERVTKQAYGMI AYLGMSEKLPNLCYYNNDEYSFQRPYSEKTAELIDEEVKRMVNEQYERAKQILSEHKEQH NELAQLLIDKEVIFAEDVERIFGKRPWASRSEEIMAANNKQENAVHPADGEDVDTTTPQA TESQEGNTQQESAASQN >gi|226332044|gb|ACIB01000012.1| GENE 27 34497 - 35336 776 279 aa, chain + ## HITS:1 COG:BH2422 KEGG:ns NR:ns ## COG: BH2422 COG0575 # Protein_GI_number: 15614985 # Func_class: I Lipid transport and metabolism # Function: CDP-diglyceride synthetase # Organism: Bacillus halodurans # 113 267 108 256 264 103 44.0 3e-22 MKNNFLQRAITGILFVAIIVGCILYDPLAFGTLFVIVSALTIREFGHLVNQSGEVSINRT ITMLGGAYLFLAIMGFCIDAAGSKIFIPYLILIIYLMVSELYLKKKNPVLNWAYSMLSQM YIALPFAMLNVLAFQNDPEASSVSYNPILPLSIFVFLWLNDTGAYCFGSLFGKHRLFERI SPKKSWEGSIGGGIVAIASSFVFACYFPIMTWAEWAGLALVVVIFGTWGDLTESLLKRQL QIKDSGSILPGHGGMLDRFDSSLMAIPAGVIYLYALTLV >gi|226332044|gb|ACIB01000012.1| GENE 28 35451 - 36194 628 247 aa, chain - ## HITS:1 COG:no KEGG:BF0772 NR:ns ## KEGG: BF0772 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 247 1 247 247 495 99.0 1e-139 MKKIKLLWMAMLTLMLPALQSCDDNDGYSLGDIAVDWATVRVVGGDTYSLNADRWGTLWP AATAIPFYKPIDGQRVITYFNPLYDNYEGYDHAVKVEHNYNVLTKQVEDLTAENESEFGN DPVWVNKDMMWIGGGYLNVIFRQNLPVKEKHLVSLVRDMRATVAEGEDDGYIHLELRYKT YDDVTARQANGAVSFNLNSLDLTGKKGIKVKLNSVKDGETEVVFNLKGQSMPEEAKQVTL SDEVQIK >gi|226332044|gb|ACIB01000012.1| GENE 29 36282 - 37415 992 377 aa, chain - ## HITS:1 COG:alr2274 KEGG:ns NR:ns ## COG: alr2274 COG0763 # Protein_GI_number: 17229766 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Lipid A disaccharide synthetase # Organism: Nostoc sp. PCC 7120 # 1 375 1 384 384 161 31.0 2e-39 MKYYLIVGEASGDLHASHLMAALKEEDPEAEFRFFGGDLMAAVGGTMVKHYKELAYMGFI PVLLHLPTIFANMKRCKEDIVAWSPDVVILVDYPGFNLDIAKFVHAKTKIPVYYYISPKI WAWKEYRIKNIKRDVDELFSILPFEVGFFKGHRYPIHYVGNPTVDEVTAFKASHQESFAD FIADSELADKPIIALLAGSRKQEIKDNLPDMIRAASAFPGYQLVLAAAPGISPEYYAKFV KGTELAVIFDRTYRLLQQADVALVTSGTATLETALFRVPQVVCYHTPVGKLVSFLRRHIL KVKFISLVNLIAGREVVRELVADTMTVENMRAELECLLFREDYRRKMLDGYEEMARLLGP AGAPRHAAREMVKLLKK >gi|226332044|gb|ACIB01000012.1| GENE 30 37412 - 38179 630 255 aa, chain - ## HITS:1 COG:aq_832 KEGG:ns NR:ns ## COG: aq_832 COG0496 # Protein_GI_number: 15606188 # Func_class: R General function prediction only # Function: Predicted acid phosphatase # Organism: Aquifex aeolicus # 6 254 2 248 251 157 38.0 2e-38 MENKRPLILVSNDDGIMAKGISELIKFLRPLGEIVVMAPDAPRSGSGCALTVTQPVHYQL LKKDVGLTVYKCSGTPTDCIKLARNQILDRKPDLVVGGINHGDNSATNVHYSGTMGIVIE GCLNGIPSIGFSICDHAPGADFDAAGPYVRRIAAMVLEKGLPPLTCLNVNFPNTQEIKGV RICEQAKGHWSGEWQACPRRDDANFYWLTGEFIDHEPENEKNDHWALANGYVAITPTVVD MTAYHFMDELKSWEL >gi|226332044|gb|ACIB01000012.1| GENE 31 38684 - 39451 774 255 aa, chain + ## HITS:1 COG:lin2923 KEGG:ns NR:ns ## COG: lin2923 COG1192 # Protein_GI_number: 16801982 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: ATPases involved in chromosome partitioning # Organism: Listeria innocua # 1 251 1 251 253 276 56.0 2e-74 MGKIIALANQKGGVGKTTTTINLAASLATLEKKVLVVDADPQANASSGLGVDIKQSECTI YECIIDRANVQDAIHDTEIDSLKVISSHINLVGAEIEMLNLKNREKILKEVLTPLKEEYD YILIDCSPSLGLITINALTAADSVIIPVQAEYFALEGISKLLNTIKIIKSKLNPALEIEG FLLTMYDSRLRQANQIYDEVKRHFQELVFKTVIQRNVKLSEAPSYGLPTILYDAESTGAK NHLALAKELISRNSK >gi|226332044|gb|ACIB01000012.1| GENE 32 39460 - 40350 1126 296 aa, chain + ## HITS:1 COG:ML2706 KEGG:ns NR:ns ## COG: ML2706 COG1475 # Protein_GI_number: 15828464 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Mycobacterium leprae # 32 295 64 334 335 193 40.0 3e-49 MATQRRNALGRGLDALLSMEEVKTEGSSSINEIELSKISVNPNQPRREFDETALEELADS IREIGIIQPITLRKVSDDEYQIIAGERRYRASQKAGLDTIPAYIRTADDENVMEMALIEN IQREDLNSVEIALAYQHLIEQYDLTQERLSERVGKKRTTIANYLRLLKLPAPIQMALQNK QIDMGHARALITLGDPKLQVKIFEEILEHGYSVRKVEEIVKSLSEGEAVKSGTKKITPKR AKLPEEFNMLKQHLSGFFNTKVQLTCSEKGKGKISIPFSNEEELERIMEIFDSLKK >gi|226332044|gb|ACIB01000012.1| GENE 33 40351 - 41217 538 288 aa, chain + ## HITS:1 COG:no KEGG:BF0766 NR:ns ## KEGG: BF0766 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 288 1 288 288 560 100.0 1e-158 MAKKTKTYPLYIALLLCFFQVAGIDVYAQEPVKVSQDSISPVREAPKARARRHREPVVST PATDSVKVEKAVVLPPIDSLENLKPAIVTADSLEEVNRQNLERIETPVMPSVVKADSLPP VMPKKLFVPNPTKATWYAIVFPGGGQIYNRKYWKLPIIYGGFAGCAYALSWNGKMYKDYA QAYMDIMDNNPNTNSFQDLLPPNHNYTDTQLKDLLRKRKDTYRRYRDLSIFAVIGVYLIS IIDAYVDAELSNFDISPDLSMRVEPTIINNNPLQPGSKSVGVQCSLRF >gi|226332044|gb|ACIB01000012.1| GENE 34 41242 - 42540 934 432 aa, chain + ## HITS:1 COG:PA1812 KEGG:ns NR:ns ## COG: PA1812 COG0741 # Protein_GI_number: 15597009 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) # Organism: Pseudomonas aeruginosa # 126 430 118 462 534 170 32.0 6e-42 MKKLANYCPLILLFLFATPKVNAQSVDVVIRDNGKERQESIELPKSMTYPLDSLLNDWKA KNYIDLGKDCSTAEINPLFSDSVYIDRLSRMPTVMEMPYNEIVRKFIDMYAGRLRNQVSF MLSACNFYMPIFEEALDAYNLPLELKYLPIIESALNPSAVSRAGAGGLWQFMIGTGKMYG LESNSLVDDRRDPIKATWAAARYLKDLYDIYHDWNLVIAAYNCGPGTINKAIRRSGGETD YWSIYNYLPKETRGYVPAFIAANYVMTYYCDHNICPMETNIPESTDTIQVNKNLHFQQIA DLCNVPMDQIRSLNPQYKKEIIPGESKSYTLRLPQNAVSSFIDRQDTIYAHRAGELFKNR RTVAIRDDSSASKRRGSSAKAGSGTPTYYKIKNGDTLGAIAAKYGVRVKDLQNWNGLRGT NISAGKRLKIYK >gi|226332044|gb|ACIB01000012.1| GENE 35 42609 - 44852 2074 747 aa, chain + ## HITS:1 COG:VC2710 KEGG:ns NR:ns ## COG: VC2710 COG0317 # Protein_GI_number: 15642704 # Func_class: T Signal transduction mechanisms; K Transcription # Function: Guanosine polyphosphate pyrophosphohydrolases/synthetases # Organism: Vibrio cholerae # 18 747 6 705 705 459 36.0 1e-129 MDTENQKEIAEEQMIEQAFQELLNDYLATKHRKRIEIITKAFNFANQAHKGIKRRSGEPY IMHPIAVAKIVCNEIGLGSTSICAALLHDVVEDTDYTVEDIENIFGAKIAQIVDGLTKIS GGIFGDRASAQAENFKKLLLTMSDDIRVILIKIADRLHNMRTLGSMLPNKQYKIAGETLY IYAPLANRLGLYKIKTELENLSFKYEHPEEYQEIEEKLNATAAERDKVFNEFTAPIREQL DKMGLKYRILARVKSIYSIWNKMQTKHVPFEEIYDLLAVRIIFEPRNMDEELNDCFDIYV SISKIYKPHPDRLRDWVSHPKANGYQALHVTLMGNNGQWIEVQIRSERMNDVAEQGFAAH WKYKEGGGSEDEGELEKWLRTIKEILDDPQPDAIDFLDTIKLNLFASEIFVFTPKGELKT MPQNSTALDFAFSLHTDIGSHCIGAKVNHKLVPLSHKLQSGDQVEILTSKSQRVQPQWEV FATTARARAKIAAILRKERKTFQKEGEELLNEFFKKEEIRPEAAVIEKLCKLHNMKNEEE FLVAIGNKTIVLGDADKNELKEKQSSNWMKYLTFSFGNNKDKQQEEKEPQEKEKINTKQI LKLTEDALQKKYIMAECCHPIPGDDVLGYMDENDRIIIHKRQCPVAAKLKSSYGNRIIAT EWDTHKKLSFLVYIYIKGIDNVGLLNEITQVISRQLNVNIRKLDMETDDGIFEGKVQLYV HDVEDVKAICNNLRKIPNIKSVTRVEN >gi|226332044|gb|ACIB01000012.1| GENE 36 44907 - 45272 379 121 aa, chain - ## HITS:1 COG:AGc2183 KEGG:ns NR:ns ## COG: AGc2183 COG0789 # Protein_GI_number: 15888519 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 3 82 26 105 203 64 38.0 4e-11 MQNTERDLKLYYSISEVAQMFDVNESLLRFWEKEFPQISPKKGSRGVRQYRKEDVETIRL IYHLVKERGMTLPGARQKLKDNREATIRNFEIIDRLKQIRQELIGMRDALDGFSTRREEE Q >gi|226332044|gb|ACIB01000012.1| GENE 37 45287 - 46255 750 322 aa, chain - ## HITS:1 COG:CC1872 KEGG:ns NR:ns ## COG: CC1872 COG0739 # Protein_GI_number: 16126115 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane proteins related to metalloendopeptidases # Organism: Caulobacter vibrioides # 167 294 245 369 383 124 47.0 2e-28 MRKVYYIYNPQTQTYDRIYPTVRQRALSILRRLFIGMGLGAGSFIILLLIFGSPSEKELR KENSQLLAQYNVLSRRLDEAMGVLQDIQQRDDNLYRVIFMADPVSPAIRQAGYGGTNRYE QLMYMANSKLVINTTQKMDVLSKQLYIQSKSFDDVVAMCKNHDQMLKCIPAIQPISNKDL RKTASGYGTRIDPIYGTTKFHAGMDFSAHPGTDVYATGDGTVVKMGWETGYGNTVEIDHG FGYRTRYAHLQEFRTKLGKKVVRGEVIAGVGSTGKSTGPHLHYEVHVKGQVVNPVNYYFM DLSAEDYERMIQIAANHGKVFD >gi|226332044|gb|ACIB01000012.1| GENE 38 46359 - 48977 2803 872 aa, chain + ## HITS:1 COG:ZalaS KEGG:ns NR:ns ## COG: ZalaS COG0013 # Protein_GI_number: 15803211 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Alanyl-tRNA synthetase # Organism: Escherichia coli O157:H7 EDL933 # 3 870 4 871 878 653 43.0 0 MLTAKETRDSFKNFFESKGHQIVPSAPMVIKDDPTLMFTNAGMNQFKDIILGNHPAKYHR VADSQKCLRVSGKHNDLEEVGHDTYHHTMFEMLGNWSFGDYFKKEAISWAWEYLVDVLKL NPEHLYATVFEGSPEEGLERDNEAASYWEQYLPKDHIINGNKHDNFWEMGDTGPCGPCSE IHIDLRPAEERAKISGRDLVNHDHPQVIEIWNLVFMQYNRKADSTLEPLPAKVIDTGMGF ERLCMALQGKTSNYDTDVFQPLIKAIAQMAGTEYGKNEQNDIAMRVIADHIRTIAFSITD GQLPSNAKAGYVIRRILRRAVRYGYTFLGQKQAFMYKLLPVLIDSMGDAYPELIAQKELI EKVIKEEEESFLRTLETGIRLLDKTMADTKANGKTEISGKDAFTLYDTFGFPLDLTELIL RENGMTVNVEEFDAEMQQQKQRARNAAAIETGDWIILKEGTTEFVGYDYTEYETSILRYR QVKQKNQTLYQIVLDYTPFYAESGGQVGDTGVLVNEFETIEVIDTKKENNLPIHITKKLP EHPEAPMMACVDTDKRAACAANHSATHLLDEALREVLGEHVEQKGSLVTPDSLRFDFSHF QKVTDEELRKVEHLVNAKIRANVPLQEHRNIPIEEAKELGAIALFGEKYGDHVRVIQFGS SIEFCGGTHVAATGNIGMVKIISESSVAAGVRRIEAYTGARVEEMLDTIQDTLSDLKALF NNAPDLGVAIRKYIDENAGLKKQVEDFMKEKEAAVKERLLKNVQEINGIKVIKFCLPMPA EVVKNIAFQLRGEITENLFFVAGTVDANKPMLTVMISDNLVAGGLKAGNLVKEAAKLIQG GGGGQPHFATAGGKNPDGLNAAVEKVLELAGI >gi|226332044|gb|ACIB01000012.1| GENE 39 49224 - 50078 457 284 aa, chain - ## HITS:1 COG:PA0248 KEGG:ns NR:ns ## COG: PA0248 COG2207 # Protein_GI_number: 15595445 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Pseudomonas aeruginosa # 151 278 156 285 288 59 31.0 9e-09 MKTDRLLADTKTIRYEETDLASLISAPYHFRCGIYLICTQGEAVVSTGVQKYIFNEQTEL IFLTGGLLQVLETSGDLQVKMLMFPKEAFLNAMLPIDTPYFNYTHEHPCYHHTADERSQK TWRQINLWMDMAQMLFTEPMPQFRGQQEHNFLQSLLMWLFNTIPEKLAVSKQYSRTQLLC HRFMQLIREYSMHEHQVAFYAEKLCISSRYLHKITVRHLDGKKPKQLIDEQLVAEIKVLL NEPRLSVTEIAEQLHFPDQSYLTHFFKKNTGISPKEFRAIKNLG >gi|226332044|gb|ACIB01000012.1| GENE 40 50228 - 51421 609 397 aa, chain + ## HITS:1 COG:ECs1866 KEGG:ns NR:ns ## COG: ECs1866 COG0477 # Protein_GI_number: 15831120 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Escherichia coli O157:H7 # 13 360 10 357 387 192 37.0 7e-49 MTDRQNSRRFLLLFLGVLSAFGPFIMDMYLPTLPAMADFFHTSSSMVQLGLTTSMIGLAA GQLIFGPLSDKYGRRPPLLLAMILFLLATVGCIFSHTISQFVTSRFLQGIAGAGGVVISR SIATDEYSGQQLAGMLAVIGGINGIATVIAPIGGGILAQITSWQGIFICLFFMGVVLLSG SLHLNESLPAKHRQTVSWQDLYHSFGEVLRNRRYVGYVLQYGFTMGILFVNISSAPFIMQ QHYGLSPLSFSLCFGINAIAMVIFSAISIKLPTMERALYIGSRGMLSVSALLMVFLSLGC DFWIYELLIFALLSMIGMTFTASNTLAMECERRNAGIASALLGATGFAFGGIVSPLVSLG DMMTSTGILFLAGSACAYACTRYVLSQSVQPSGVLYH >gi|226332044|gb|ACIB01000012.1| GENE 41 51920 - 54136 2120 738 aa, chain + ## HITS:1 COG:XF0842 KEGG:ns NR:ns ## COG: XF0842 COG3537 # Protein_GI_number: 15837444 # Func_class: G Carbohydrate transport and metabolism # Function: Putative alpha-1,2-mannosidase # Organism: Xylella fastidiosa 9a5c # 24 732 41 764 790 521 40.0 1e-147 MKVLFHSLFILLFVFTACTSTPKQATIDYTQYVNPFIGTDFTGNTYPGAQVPFGMVQLSP DNGLPGWDRISGYFYPDSTIAGFSHTHLSGTGAGDLYDISFMPVTLPYKEAEAPLGIYSK FSHQDESATAGYYQVLLKDYGINVELTATERCGIQRYTFPEAKAAIFLNLKKAMNWDFTN DSHIEVIDSVTIQGYRFSDGWARDQHVYFRTRFSKPFTAVQMDTTAILKDGKRMGTATIA RFDFDTQKGEQILVNTALSGVSMEGAAQNLAAEVPEDNFDKYREAARDNWNRQLSKIAVK GDHKDDWVNFYTALYHTMLAPTIYSDVDGSYYGPDKKVHRTDGWVNYSTFSLWDTYRAAH PLFTYTEPERTNDMVQSFLAFYEQNGRLPVWNFYGSETDMMIGYHAVPVIVDAYLKGIGN FDPEKALEACVATANLDNYRGIGAYKELGYVPFNEKDSYNAENWSLSKTLEYAYDDYCIA RMAEKLGKKEIADEFYKRSQNYRNVYNPATSFMQPRDDKGEFQKDFKADAYTPHICESNG WQYFWSVQHDIDGLIGLTGGKERFAQKLDSMFTFHPSADDELPLFSTGMIGQYAHGNEPS HHVIYLYNAVDQPWKTQEYVAKVMNELYLNSPAGLCGNEDCGQMSAWYVFSAMGFYPVNP ISGQYEIGTPLFPEVQLHLDNGKTFTVKAPAVSKENIYIRSTKLNGKPYDKSYITHEQIM SGATLEFEMGKEKVTSDQ >gi|226332044|gb|ACIB01000012.1| GENE 42 54175 - 54729 467 184 aa, chain + ## HITS:1 COG:PA1363 KEGG:ns NR:ns ## COG: PA1363 COG1595 # Protein_GI_number: 15596560 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Pseudomonas aeruginosa # 15 172 88 240 246 64 28.0 1e-10 MDDTFDITYKALFRRYYPNLMFYATRLVGDEEAEDVVQDVFVELWKRRDSMVIGDQIQAF LYRAVYTRALNVLKHRSIEDGYCAAVEEINQKRAEFYQPDNNEVIRRIEDRELRNEIYQA INELPDKCKEVFKLSYLHEMKNKEIADVMGVSLRTVEAHMYKALKFLRNRLGHLWFLFLL FFIK >gi|226332044|gb|ACIB01000012.1| GENE 43 54782 - 55780 740 332 aa, chain + ## HITS:1 COG:PA2388 KEGG:ns NR:ns ## COG: PA2388 COG3712 # Protein_GI_number: 15597584 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Pseudomonas aeruginosa # 10 330 21 327 331 73 26.0 4e-13 MNNLSEDILMRYLTGECSDEDFARVNAWIKESDDNARRLFRMEEVYHLGRHDSFPDQKKV ARTEARLYKKLAQEDAHSRKVIRMHQWIRYAAIIVLALMIGTGGGYLYYQADPTRNMITA SSTDGKVKEVMLPDGTKVWLNQSATLKYPKEFSESERDVYLDGEAYFEVTKNRRCPFVVE SEAMRIKVLGTTFNFKCDKSHKLAEATLIEGEIEVRGNHDEGMIILSPGQKAELNKTTRR LVVKQVDAKLDAVWHNDLIPFEQADIFAITRTLERFYDVKIILSPDIKSDKTYSGVLKKK DNIESVLQSLDNSIPINYKIVGDNIFISSRNK >gi|226332044|gb|ACIB01000012.1| GENE 44 55810 - 58077 1745 755 aa, chain + ## HITS:1 COG:XF0842 KEGG:ns NR:ns ## COG: XF0842 COG3537 # Protein_GI_number: 15837444 # Func_class: G Carbohydrate transport and metabolism # Function: Putative alpha-1,2-mannosidase # Organism: Xylella fastidiosa 9a5c # 28 748 45 761 790 547 39.0 1e-155 MKFKTFLAGCLGGLLSLNSCTNSPDMTDYAAYVNPFIGTGGHGHTFPGAIVPHGMISPSP DTRIDGWDACSGYYYADSTINGFSHTHLSGTGCCDYGDVLLMPTVGEQKYLPTGSQSQQM AYASAFSHQNETAEPGYYSVFLDTYQVKAELTASKRAAIHRYTFPESKEAGFILDLDYSL QRQTNKEMEIEVISPTEICGRKKTMYWAFDQYINFYAKFSKPFTYTLVTDSMALDDGGRL LPTCKALLHFDTTKDEQVFVKVGISAVDIEGARKNVETEIPDWDFDGIRKDARKAWNETL SKIDITTNDKNDKTIFYTALYHTAISPNLFTDVDGRYLGMDLQVHQGDTLNPMYTIFSLW DTFRALHPLMTIIDPDLNNAFIRSLIKKHQEGGIFPMWDMASNYTGTMIGYHAVPVIVDA YMKGDRNFDIQEAYRACVRAAEYDTTGIKCPDLVLPHLMPKAKYYKNSIGYVPCDRENES VAKALEYAYDDWCISVFADALNDYDTRDKYARFAKAYEFYFDPGTRFMRGLDSKGEWRTP FNPRSSTHRNDDYCEGTAWQWTWFVPHDIEGLVKLMGGEDAFVGKLDSLFTADSSLEGET TSSDISGLIGQYAHGNEPSHHVIHMYNYVNRPWRTQELVDSVYRSQYANAVDGLSGNEDC GQMSAWYVLNSMGFYQVCPGKPVYSIGRPAFDKAVVNLPDGKKFTVIAKNNSKKNKYIKS MTLNGKPLDKPFFTHDDIIAGSTLEIEMTDRRTQP >gi|226332044|gb|ACIB01000012.1| GENE 45 58180 - 60450 1840 756 aa, chain + ## HITS:1 COG:L135972 KEGG:ns NR:ns ## COG: L135972 COG3537 # Protein_GI_number: 15673483 # Func_class: G Carbohydrate transport and metabolism # Function: Putative alpha-1,2-mannosidase # Organism: Lactococcus lactis # 31 754 11 716 717 436 33.0 1e-122 MRKILLTACLIACSLMAEAKDWTQYVNPLMGTQSSFELSTGNTYPAIARPWGMNFWTPQT GKMGDGWQYTYTANKIRGFKQTHQPSPWINDYGQFSIMPVTGKLEFDEEKRASWFSHKGE IATPSYYKVYLAEHDVVTEMTPTERAVLFRFTFPENEHSYIVVDAFDKGSYVKVIPEENK IIGYTTRNSGGVPENFKNYFVIEFDKPFTYKGTFADKKLEEGNLEQKADHTGAIIGFSTR KGEIVHARIASSFISFEQAAQNLKELGNDSFEQLAQKGNDAWNNVLGKIEVEGGNLDQYR TFYSCLYRSLLFPRKFYEFTADGQPIHYSPYNGQVLPGYMYTDTGFWDTFRCLFPFLNLM YPSVNKEIQEGLINTYKESGFFPEWASPGHRGCMIGNNSASVLVDAYMKGVKVDDVKTLY EGLIHGIENVHTEVSSTGRLGYQYYNKLGYVPYDVKINENTARTLEYAYDDWCIYQLAKA LNRPKKEIELFAKRAMNYRNVFDKESKLMRGRNENGQFQSPFSPLKWGDAFTEGNSWHYS WSVFHDPQGLIDLMGGKKMFITMLDSVFAVPPVFDDSYYGQVIHEIREMTVMNMGNYAHG NQPIQHMIYLYNYAGQPWKAQYWLRQVMDRMYTPGPDGYCGDEDNGQTSAWYVFSALGFY PVCPGTDEYVIGAPLFKKATLHFENGNNLVIDAQNNSKENLYIESLRVNGQESTRNYLKH ADLLQGGTIEFKMGSHPNLNRGINDDDAPYSFSKMK >gi|226332044|gb|ACIB01000012.1| GENE 46 60696 - 64112 3225 1138 aa, chain + ## HITS:1 COG:no KEGG:BF0752 NR:ns ## KEGG: BF0752 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1138 1 1138 1138 2172 99.0 0 MKNKILFQRPRLLKASLVLLLAAAPIQWSFAQFTFSTSRTTLGQVIKTVQSQSKYQFFYD DQSSNMPIENLNVKNVSLEQLLNTALKGKNLTYKIEDNIVYLSQASKQQDPVQASAKRQI SGTVVDANGEALIGVNVSVKGTSTGAITDLEGKYTLTVNDPKAEIVFSYIGYKQQVLPAK ESILNVTMSEDTQMISEVVVTALGIKREKKMLGYAVQEIKGDQLNQTGDPSVTSALQGKV AGLQMNTAGTGLGGSTKITIRGNSSLADNNQPLWIVDGVPFSDNNNSDATYYGGVDRGGS SVDINPEDIESISVLKGPNAAALYGSRAGNGVILVTTKKGSKKDGFGVRYSGNFTWSQVA ETLEMQDRYGQGHIVTQDENKNPLSQYYAKYDPTDSSSWGPVLDGSMQKAWNGDTYAFSK YGNKLKDYFDTGFAQNHNVSVSNVTDKSHFRASFGSSNNKGVFPNEKLNRINLDLNAGME MNKYLSMDGKISLSRTKAEDRPYFGTYGAIAQLMGIPNNIRLNDLKQYSTDGNAHVNWTG PTAGIRNPYYVLNQRHNSDERWRAFGYYGMKINFTDWLHLSAKYAFDYYRIRIEETNAGD GINGESSIKDITDDEMNREEQNFFESNAEIVLMGDKQLTDNFRLGFTVGGNFMYQNYESL NAGVRNMLDKGQWIFNAANMLNTAGETGHERATNSVFGSLQLSWKEYLSLDLTARNDWSS TLPKKNNSFFYPSANLSFVVSDFVRSLDKTLPNWLTFAKVRLSAAQVGKDTDPYQLYNTY GFKFEKGELIPSKSNVKMNDQLKPEISSSYEAGLDMKFLNNRLGFDFTYYYSRTKNQIMK VPAAAPWSGGKWVNAGLITNKGFEMMIYSTPVQLKDFSFDLNVNLAKNVSNVEKLADGVD YIYFNGDSNFPINVGARPGHKLGEIYAKTLYKRDEQGNIIINKENGLPMTTTDADERLAK PIGNIQPNLLMSVSPSFTYKGFTLSAMFDMKFGGDIISISEMNATGSGMAKRTMNRGESD NFMMIFPGVYEDGTPNTQKISASNYYGAQNAEDFIYDASFIKLKELAIGYTFPKSMLKKT PINSLNVSFVARNLAYLLKHTPGTSPEGGYDTTMFSQAIDFMAVPYTRTFGFSVNLGF >gi|226332044|gb|ACIB01000012.1| GENE 47 64135 - 65868 1601 577 aa, chain + ## HITS:1 COG:no KEGG:BF0751 NR:ns ## KEGG: BF0751 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 577 1 577 577 1161 99.0 0 MKIKIYKSLAMCTLLLGLAACNDFEEMNTDPYAPVYDPEIIGATPDGIDIDYELTPSALK SLQGTESAIGSVFANLTYEGLYNDYQVTTNLTHDIYAAYFANNVSGFVTNSPTYGYNDGW SKRRWEHFYDNRTVEEYSQLIKTFWFCGKERYHNAFYMTRIYYAFLISMQTDTYGAIPLE YYVKGAMPTDEKVKYMDQKDVYDVIFKMLDQAITALHNTPSTAQYSLGENDKCFGGDKDK WLRFANTLRLRLALRVSNVDPQLAKEQGEKAMADPAGLMQSDDDNMKQTPKYSYITGGNE NIYTLLYNWSANVVLSKEMERAYKEQSTILDPRCEILWWRPTALEDLNQTEPKEDMTKDF NGCENGETSLGGSYTTTYSPSRVFIKQDQKKLDRKHWWCYAREIVWLGYSESLFLRAEAA LRGWAGAKGTAEDFYKEGIEASFNYYQIGADEEGQEKISKYMEGLKGLQAFKSGDREAQL EQIITQKWIAVYPNGNEGWAEFRRTDYPRYMKTPKGGNNSGGEVANGKFIKRLRYPDSEV SNPNRPKDVDTQGTRLWWDVADTNDDGGNYVTPNNFR >gi|226332044|gb|ACIB01000012.1| GENE 48 65964 - 67382 1035 472 aa, chain - ## HITS:1 COG:SMc01414 KEGG:ns NR:ns ## COG: SMc01414 COG0507 # Protein_GI_number: 15965850 # Func_class: L Replication, recombination and repair # Function: ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member # Organism: Sinorhizobium meliloti # 21 464 41 409 410 91 26.0 4e-18 MINNYLERQIKENFSYQPTFEQEIAVKSLSEFLLSTANDTVFVLRGYAGTGKTSLVGALV KAMDKLQQKSVLLAPTGRAAKVFSAYAGHPAFTIHKKIYRQQSFSNEVSNFSINDNLTTH TLYIVDEASMISNEGLSGSMFGTGRLLDDLVEFVYSGVGCRLLLMGDTAQLPPVGEEQSP ALATEALKGYGLNVIEVDLTQVVRQVQSSGILWNATQIRQLIAEDECFSLPKIKVSGFPD IQVVRGDELIDTLTGCYEKDGMDETIVVCRSNKRANIYNKGIRAQILYREDELNTGDLLM VAKNNYFWTEKYKEMDFIANGEIAVVRRVRRTRELYGFRFAEVLLAFPDQNDFELEANLL LDTLHSDAPALPKTENDRLFYSVLEDYVDITVKRERMKKMKADPHYNALQVKYAYAVTCH KAQGGQWQNVFLDQGYMSDEYLTPDYFRWLYTAFTRASKTLYLVNYPEEQIE >gi|226332044|gb|ACIB01000012.1| GENE 49 67479 - 68162 897 227 aa, chain + ## HITS:1 COG:no KEGG:BF0749 NR:ns ## KEGG: BF0749 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 227 1 227 227 447 100.0 1e-124 MKTVFNIVLGVCAIALVYICYASIMGPINFEKAKKHRDKAVVARLIDIRKAQAEYRNIYK QYTASFDTLIDFVKTQKIPFVSKEGVLSDKQLEDGMTEKKAMALINKAKKTNNWKEVEAA GLMGFKRDTIWVAVTDTIYDKSFNADSLRYVPFGNGAQFEMYTKNDTTKSGAPIFLFQAN TPYDVYLNGLDKQEIANLKDLQVKLGKYAGLMVGSIDTPNNGAGNWE >gi|226332044|gb|ACIB01000012.1| GENE 50 68170 - 68964 483 264 aa, chain + ## HITS:1 COG:no KEGG:BF0677 NR:ns ## KEGG: BF0677 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 264 1 264 264 505 99.0 1e-142 MPDRAIAETIDFGKSEQYTLSIRLSTDGFSFSIYNPIHDSSLSFFEKEVEASLSLTANLK QAFRELDFLNHTYKRVNILMADKHFTLIPLELFEDDQSEMIFYHNHTPKENETVKYNILK KNNAVVIFGMDKSTCQFLSDQYPEARFYSQAAPLAEYFSAKSRLGNSKKIYASIRRDAID FFCYERGHLLLANSFECRKTGDRIYYLLYLWKQLNFDQERDELHLTGAFYDKEKLMQELR KYIQQVFIMNPASNIDMQALLTCE >gi|226332044|gb|ACIB01000012.1| GENE 51 68955 - 69488 208 177 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163764797|ref|ZP_02171850.1| ribosomal protein L29 [Bacillus selenitireducens MLS10] # 1 177 13 193 199 84 31 3e-15 MRVISGIYKRRRFDVPRTFKARPTTDFAKENLFNVLSNYMDFEEGIVALDLFAGTGSISI ELVSRGCDRVISVEKDGAHHAFISKIMKEVQTDKCLPIRGDVFKFINGSHERFDFIFADP PYELKELETIPDLIFKNNLLKEDGLFVLEHGKKNNFEDHPHFIERRVYGSVNFSFFR >gi|226332044|gb|ACIB01000012.1| GENE 52 69506 - 70945 1073 479 aa, chain - ## HITS:1 COG:BS_ywnE KEGG:ns NR:ns ## COG: BS_ywnE COG1502 # Protein_GI_number: 16080712 # Func_class: I Lipid transport and metabolism # Function: Phosphatidylserine/phosphatidylglycerophosphate/cardioli pin synthases and related enzymes # Organism: Bacillus subtilis # 11 479 3 482 482 352 38.0 7e-97 MIDWNYIASVIATVAFDIIYFGAIIGTIVIVILDNRNPVKTMAWILILMFLPVVGLVFYF FFGRSQRREKIIGKKSYDRLLKKPMAEYLAQNCCETPKEYARLIQLFQNTNQAFPFEGNR VDIYTGGYSKLQALLRELQKARLHIHMEYYIFEDDPVGRLVRDVLIEKAREGVEVRVIYD DVGCWHVPHRFFEEMRDAGIEVRSFLKVRFPLFTSKVNYRNHRKIVVIDGRIGFIGGMNL AERYMRGFSWGIWRDTHILLEGKAVHGLQTAFLLDWYFVDRTLITASRYFPKIEAYGNSL VQIVTSEPIGPWKEIMQGLTVAISGAKKYFYMQTPYFLPTEQILGAMQTAALAGVDIRLM LPEHADNRVTHLGSCSYLADVLRAGVKVYFYKKGFLHSKLMVSDDMLSTVGSTNLDFRSF EHNFEVNAFMYDMETALQMREIFLQDQRESTQIFLKSWEKRSSRQKAMESVVRLLAPLL >gi|226332044|gb|ACIB01000012.1| GENE 53 71108 - 74020 2352 970 aa, chain - ## HITS:1 COG:no KEGG:BF0745 NR:ns ## KEGG: BF0745 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 947 1 947 970 1734 100.0 0 MKKLLIWTVLLLTATLSTYAQNKIVTVSGRVIEAGTKEPVELAAVQLLSLPDSAQVAGMT TSTQGYFSLSKQKPGKYLLKVSFIGYVTKIIPVQLTANVPAKKMGNIELATDAVMLQEAV VVAEAPQVTVVEDTLMYNSSAYRTPEGAMLEELVKKLPGAEIDDDGNVKINGKDLKKIMV DGKEFFGGDVKTGLKNLPVDMVDKLKTYDKKSDLARVTGIDDGEEETVLDLTVKKGMNQG WFGNADLGAGTKDRYTGRMMLNRFVDKTQFSIIGSANNVNDQGFSGGGGGPRWRSNNGLN ATKMLGANFATQTNKLELGGSVRYNFQDADISSINSSERFLQNGNSYSNSNNKNRNKGTN LNADFRMEWKPDTLTNIIFRPNFSYGRTNNASRSESGTFNEDPFNLIVNPNDYLNFDNLS DDPLKDIRVNATNSASLSKGKSLSGNATLQVNRKLNNRGRNLTFRGVFGYGDNDNDQYTQ SETRYYQLLNHLGGDSILYRNQYITTPTRNYNYTAQVTYSEPIAKATFLQFSYQFQYKYS KSDKTTFDLLDYPDWAIGGALPSGYESHAVDSLSKNAEYRYYNHDASVGLRFIRPKYQLN VGMSFQPQNSTLSYKKGDYMIDTTRTVFNFAPNMDLRFRFSKVSQLRFTYRGRSNQPTME NLLPITDNSNPLNIRMGNPGLKPSFAHTMRLFYNTYNAEKQRGIMTHFSFTATQNSISNS TRYNEETGGLITRPENINGNWNAFGMFGFNTALKNKKYTINTFTNVNYQNNVAFLYNQDT KNNDRNTSTGLTLGERVTGSYRNDWFEFSLNGSINYTAERNKLRPENNQEPYTYSYGAST NITMPWKMTLATNIANQSRRGYRDSSMNRDELIWNAQLAQSLLKGAATVSFEVYDILRQQ SNISRSLSADMRSVSEYNGINSYCMVHFIYRLNIFGSKAAREKMMNSGRRGFGGPGRGPG GGFGGGHPRF >gi|226332044|gb|ACIB01000012.1| GENE 54 74093 - 74497 363 134 aa, chain - ## HITS:1 COG:no KEGG:BF0744 NR:ns ## KEGG: BF0744 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 134 1 134 134 213 100.0 2e-54 MKALLPVFCRPLGYVILVISMFIPFLLLMQGMVTDSNLLFYKECIKLLMILGAMMIIFAL SRNEGRETEIIRNKATRNAIFLTVLFLFGGMLYRVATGDIMTVDTSSFLIFLIINVLCLE FGMQKARIDKIFKR >gi|226332044|gb|ACIB01000012.1| GENE 55 74634 - 75695 994 353 aa, chain + ## HITS:1 COG:FN0871_1 KEGG:ns NR:ns ## COG: FN0871_1 COG0337 # Protein_GI_number: 19704206 # Func_class: E Amino acid transport and metabolism # Function: 3-dehydroquinate synthetase # Organism: Fusobacterium nucleatum # 26 348 26 349 350 182 35.0 7e-46 MSKQEVILCEDLESSLKRAIDNCPHDRLFILTDDHTHRLCLSQLAGLSILKDAVEITIGA EDVHKTLETLASVWQVLSEKGATRHSLLINLGGGMVTDLGGFAAATFKRGIAYINIPTTL LAMVDASVGGKTGINFNGLKNEIGAFAPAASVLIETEFLRTLDAHNFFSGYAEMLKHGLI SNTSHWAELLAFDTEKMDYGYLKKLVGHSVQVKEDIVEQDPFEHGIRKALNLGHTVGHAF ESLALAENRPVLHGYAVAWGVVCELYLSHLKAGFPKEKMRQTIQFIKDNYGAFHFDCKQY DRLYEFMQHDKKNSAGVINFTLLKEIGDISINRTADKDTIFEMFDFYRECMGM >gi|226332044|gb|ACIB01000012.1| GENE 56 75708 - 76148 288 146 aa, chain + ## HITS:1 COG:no KEGG:BF0742 NR:ns ## KEGG: BF0742 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 131 1 131 131 259 97.0 1e-68 MKQRVTSNHIERLQPNEIFVFGSNLSGHHGGGAALLAMNKWGAIWGQGVGLQGQTYGIPT MQGGVETIRPYVDEFIQFANKHPEMTFLVTEIGCGIAGFPPQEIAPLFAKATTTENIHLP QRFWDLLGKIPEHFIYQKIILLIIYF >gi|226332044|gb|ACIB01000012.1| GENE 57 76812 - 77900 330 362 aa, chain + ## HITS:1 COG:no KEGG:BF0670 NR:ns ## KEGG: BF0670 # Name: not_defined # Def: putative transmembrane acyltransferase protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 362 1 362 362 667 100.0 0 MKRIVFLDYVRVFACFLVMIVHSSENFYGAAGSTDMAGPQSYLASEADRLWVAVYDGFSR MAVPLFMIVSAFLLAPMKEEQTTWQFYRQRFLRIIPPFFIFMILYSTLPMLWGQINGETS LTDLSRIFLNFPTLAGHLWFMYPLISLYLFIPIISPWLRKASAKEERFFIGLFLLSTCIP YLNRWCGEVWGQCFWNEYHMLWYFSGYLGYLVLAHYIHTHLTWSRSKRFIIGTILMVIGA VLTIYSFYSQAIPGETLSTPVIEIGWAFCTINCVLLTAGTFLMFTCINSSKAPRIITEMS KLSYGMYLMHIFWLGLWATVFKYTLALPTVAAIPCIAISTFICCFITSKLISFIPGSKWI IG >gi|226332044|gb|ACIB01000012.1| GENE 58 78100 - 79680 972 526 aa, chain - ## HITS:1 COG:no KEGG:PRU_0502 NR:ns ## KEGG: PRU_0502 # Name: not_defined # Def: hypothetical protein # Organism: P.ruminicola # Pathway: not_defined # 7 488 3 488 526 250 31.0 1e-64 MHEITPFTIPAMRDFVPEKWRPWIVILFVIVFQFSGGVYLAAVSEMVGSTALMQEDIMMA GYASLVGMSLTFAIMFRLKFRFPSKTAFLTCSIAIILCNLICMYTRSVPLLVTACFFGGI FRMWATFECNSTIQLWLTPQRDLSVFFCYIYLLVQGCLQLSGLITVYTAFWIKWEYMHWL IIALLLLVMIATMILFRNYRSMPKLPLFGIDWLGALMWGIVILCIIFVCVYGEHYDWYES VYIRMVTVGGVVVLLLNLWRASFIRHPFIALVTWRFKAVYLTFLLYMVVDILLAPSHLLE HIYMEAVLGYDSLNVISLNWVLVLGTIVGSVFTFYTFSRRKWCYRTMTVIAFISITGYLM MFYFTIDYNLCKEVLAFPLFLRSFGYVIIAICFITALSRVPFQNFFEAVSVQAFVSASFG GALGSAVLGRVLNVVMQKNVMLLGSTLDALHPQASHIPLGELYAALQQQALMVSMKELYG WLTLISLFCLILLGVGHNSVRPLYALHPRYRVIRRYIRHELRMIKQ >gi|226332044|gb|ACIB01000012.1| GENE 59 79684 - 80718 883 344 aa, chain - ## HITS:1 COG:mll0995 KEGG:ns NR:ns ## COG: mll0995 COG1566 # Protein_GI_number: 13471111 # Func_class: V Defense mechanisms # Function: Multidrug resistance efflux pump # Organism: Mesorhizobium loti # 24 343 74 385 417 156 34.0 6e-38 MNKKTKKMVFNLSVIFVLLLAFGWVCSRFVHLGNVEFTDNAQVRQHIVPINCRIQGFIKK IYFTEYQAVHKGDTLALIEDAEFRFRLVQAEADYQNALSGKSVVTSSIHTIQNNISVSDA GIQEAKVRMDNAEREYKRYQNLLERDAVTRQQYDAVKTNFDASKARYELLFRQKKSTALV KQEQTERLEQTEAGIKLAEATLEIARLNLSYTVIIAPCDGTTGRKEIREGVLVQPGQTLV NLVDDHDKWVVANYKETQTAHIGEGYPVGIEVDAIPGIMFKGIVKSVSQATGASFSLLPQ DNSAGNFVKVEQRIPVRIEFTGENRPEDMKRLRAGMNVECVVNY >gi|226332044|gb|ACIB01000012.1| GENE 60 80730 - 82049 1359 439 aa, chain - ## HITS:1 COG:no KEGG:PRU_0500 NR:ns ## KEGG: PRU_0500 # Name: not_defined # Def: outer membrane efflux protein # Organism: P.ruminicola # Pathway: not_defined # 5 439 9 460 460 337 42.0 5e-91 MNKFSLFIMMLCASLCPLGVYAQEVAWHIMSIEDMFALADRNSKSLRPYATGICEAREGV RIAGNARLPEIEAALSFSYLGDALLIDRNFSNGTNAPMPHLGNNFSVEASQVVYAGGSIT NSIAIARLQEKMARLGLDAGRDKVGFLLIGYYLDLFKQQNMLQVYEKNIEQTKQVIEELR AKECEGIVLKNDITRYELLLANLELTRTQIENTISILNSNLTTTLGLPEDARIQPDTTIL SKALPIENKENWTNTAYANSPVLKQMLLTVEMSKHQQKITRSEQIPQIALFAGNKLDGPI TIEVPPINKNLNYWYVGVGVKYNFSSLFKASKSIARDKFTVRRNTEQYNDAKEQTGLAVN AAQVKYMETYVELNTQQKSVELANQNYAVIHDRYKNDMALITDMLDASNSKLEAELQLVN ARINIVFNFYKLLYISGTI >gi|226332044|gb|ACIB01000012.1| GENE 61 82868 - 84922 1585 684 aa, chain - ## HITS:1 COG:no KEGG:BF2269 NR:ns ## KEGG: BF2269 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 4 684 1 694 694 686 52.0 0 MKSMKINIKHLGFYALLGMMPFSVTSCNDLLDLEPVSQITSEVYYETADQLASYLNNYYN SFLQNPYNGYMYHEAAYNDGMARSDRNTDIFVQGISGNTTLFADDYWEVPTDKNMQSSYF SYIRICNYFLEQVLPKYEAGEISGEEELIKNYIGEAYFMRALTYYRALVVFGDFPIVTTV LEDNNDEVVEASKRSPRNEVARFILEDLDKAIDLLASRSKFSGQRLNREAALLFKSRVAL YEGTFEKYHRNSGRVPGDSEWPGANADYNSGKIFGIDGEIEFFLTEAIAAAEEVAGTAQL TSNNHVIEPTVGVTTGWNPYFEMYSQPSLANVTEVLLWKEYNSELSVKHNAPYRCKIGCA DGYTRTFVEAFLMKNGLPIYAENSGYQGDTSIDNAKKDRDERLQLFVWGESTILDTDESA PTVGTLFSKADITASEDEKRCITGYQPRKYYTYDYEQTTNDEIRGTNACPIFRVAEAMLN YMEASYELTGTLNSTALGYWKALRERAGVSTDVEATIAATDLSKEGDFGVYSGTSMVDKT LYNIRRERMAETFSEGLRYTDLLRWRSFDNMLSDKWIPEGVNFWDGMYRNYMVDSDGNEV ELVADGSDNAIVSSSELSKYLRPYSRTMSSTNALKDGYNWHKAYYLYPISIADMQYASPD GSKENTYLYQNIYWPNTGGGHAEE >gi|226332044|gb|ACIB01000012.1| GENE 62 84933 - 88148 2261 1071 aa, chain - ## HITS:1 COG:no KEGG:BF0971 NR:ns ## KEGG: BF0971 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 35 1071 51 1089 1089 1187 58.0 0 MFKKSSSICFLLFFLSNSPGMAYAWTVGIGDGTYVVQQNGKCTGVVKDVTGETVIGASVV VKGTSNGTITDINGSFSLPGVKKGDIIQISFIGYQKVEVVWNNKPLNVTLKEDAQALEEV VVVGYGTQKKVNVTGAVSMVGSEVIESRPVANVTQALQGAIPGLNLTTSVNGGDLNSGMS INIRGTGSISSNDDPLVLIDGIEGDLTLVNPNDIESVSVLKDAASASIYGSRAAFGVILI TTKSGKAGKVKVTYSGDVRFSTATQLPDMVDSYSFATYFNAANSNAGAGNYFSDEIMQKI LNFQAGMYTDPSQPEYYGTTANTSTNKWNQYGSSFANTNWFDEFYKKNVPSTQHNLSLSG GTEQFNWSVSGSFLNQNGLIRHGHDELNRYTMNAKIGAILTDWARMDLTTKWTRKDYTKP QYLDGLFFHNIARRWPTCFAVDPNGQWSEGMEIAELEDGGTYDEHNDLFTQQIRFTFEPI KDWRIYMEGSMRVANNKTTENTIPIYHYYVDGTAFLRDSGYGTETYVYDNRYRQNYYTVN VYSDYSKSFGKHNGKIMVGMNYERYDQDNLWGSGTNLTTEEKPYLSQAQSNKKNGDSYWN RATAGYFGRFNYDYDERYLFEFNLRYDGSSRFVGDKRWAWFPSMSLGWNIARESFFEKLS DKISMLKLRASWGQLGNTSSDYNSFTDWYPFYQQQSISSSSGSWLIDGEKQNTASIPDIV SSLLTWETIETWDIGVDWAAFNNRLTGSFDWFCRKTKDMIGPAPTLGSALGTSAPQINNC DMRSIGWELEIGWRDRIGQVKYGARLNLSDATQKILNYPNETGSLSTYYNGKSLNEIWGY VTEGIASSQEEMDAWLVNNKPNWGSGWGAGDVMFKDLDGDGIVSTGDNTIGDSGDRVVIG NSTPRYRIGLNLDVAWKGIDFSVFFQGVLKRDYAFSEGDPYFWGATGNVWQSACFKEHLD YWTENNVDAYYPKPYFGGITKNQVVQTRYLQDASYIRCKNIQLGYTLPQSLIRKIGVDNC RVYVSCDNLFTLTGLSSIYDPEALGSYNSYGTSGKTYPLQRTVALGVTLNF >gi|226332044|gb|ACIB01000012.1| GENE 63 88361 - 89236 212 291 aa, chain + ## HITS:1 COG:PM1524 KEGG:ns NR:ns ## COG: PM1524 COG2207 # Protein_GI_number: 15603389 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Pasteurella multocida # 176 291 214 327 334 59 31.0 8e-09 MHLKEFIQYYNVSFTSNEEIAFWYINPKLINENKNIPAIQDLHSILFVIDGELKIEVNGK IFLMNRNCFADINTIHKPTYRLLSASENLRAYHIIMTKEYVMRLVGNRRLFSTRYICERM NNPILQITPSNTGIFIKYLLDIKQIFKNEGHLFRTAILTHCLWNFLAEVANFSLLEKEKT PEEVGNKRKLYIQFLEMISTEINRKYSVNFYASRLCVTPQYLQRVVKMFSNRTVHQWIKE AIMGEIMKLLNETDMTIQQIAEDLGFPDQAALSKFFKQNKKVPPSIYRNKE >gi|226332044|gb|ACIB01000012.1| GENE 64 89473 - 91434 1853 653 aa, chain - ## HITS:1 COG:SMb21160 KEGG:ns NR:ns ## COG: SMb21160 COG3525 # Protein_GI_number: 16264574 # Func_class: G Carbohydrate transport and metabolism # Function: N-acetyl-beta-hexosaminidase # Organism: Sinorhizobium meliloti # 95 477 207 622 639 108 24.0 3e-23 MKFVNRFFMILVALCLVCPGVSAVVNPKPFVIPELKEWKGAEGAFVPTETTKIVCPANQP ELLRIARMLADDCETMFGHKPEVVQGKGGAGDVILAIRADKKLGKEGYTVKVTDRILLTA PESIGVYWGTRTLLQIAEQSENHQFPKGTLRDFPDYAMRGFMIDCGRKFIPLSFLQDYVK IMAYYKMNTLQIHLNDNGFKQFFGHDWSKTYAAFRLESDTYPGLAAEDGYYTKREFIDLQ KLAENLYVEIIPEIDAPAHTLAFTHYKPEIGSKEYGMDHLDLFNPETYKFMDGLFKEYLE GDEPVFRGKKVHIGTDEYSNKKKDVVEKFRAFTDHYIRYVESFGKQACVWGALTHAKGDT PVKSENVIMSAWYNGYANPKDMIEQGYKLISIPDGLVYIVPAAGYYYDYLNTKYLYENWT PVQIGKAVFPEKDPSILGGMFAVWNDHVGNGISTKDIHHRVYPALQTLAVKMWTGKDVSV PYADFDKQRNAISEAPGVNQLGRIGTTPGLVYEQASVAPNSETPHREIGYDYLVTFDIDG ANEAKGTELFRSPDAVFYLSDPIRGMLGFARDGYLNTFSHRIHKGEKATIGISGDNKSTR LLINGQVVEEMNTQKLYYNAGKDSMNYVRTLVFPLEKAGKFDSKITNLKVYKK >gi|226332044|gb|ACIB01000012.1| GENE 65 91712 - 92881 897 389 aa, chain - ## HITS:1 COG:MA4377_3 KEGG:ns NR:ns ## COG: MA4377_3 COG0642 # Protein_GI_number: 20093164 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Methanosarcina acetivorans str.C2A # 141 382 6 254 311 176 41.0 6e-44 MKQEKQTELLRTQQMLNEVNRKLLMTLGVAEVFPWKWELAEHMVWFDARPSDDPAGWEIF HNDSFPVSIARVYQGIYKEDRLRVIEACQGILKGKMKKVVVELRFWAKKREGFVLEWLEM HAIPGKVDENGRLLTVEGSLMSITRRKVMEEELTAAKEKAEEANRLKSALIANMNHEIRT PLNAIVGFASLLSIIDDEKEQQEYIGLIQSNTEHLLRLMNDVIDLSNIESGVMDIVGSDV VLDSLMKEMELTYVPKAETDNLVLAWDREQSNAHIYTDRDRLIQILEHLLDNAIKFTTKG TVHWGYQEKENGQIRFYVTDTGCGIPNDKKEVIFERFVKLDSFTQGLGLGLSLCKIIIER MHGTIGVESEVGEGSTFWFTIPNLSDNHI >gi|226332044|gb|ACIB01000012.1| GENE 66 92894 - 93769 383 291 aa, chain - ## HITS:1 COG:no KEGG:BF0738 NR:ns ## KEGG: BF0738 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 291 1 291 291 577 99.0 1e-163 MPNRDSEKLITDGYTTTLLGDELMVCENMKHIRVTSVPSYTEVGMIILCLRGTAKICIFD NIHILAKNELAVILPGQLVSITELSHDFLCNIVVLSRALFDDTLSGFSRFSPLFFVYMRS HYWYELTGEEIERFKLFYGSLSQRAKDPTHFFRRESVLLLLRIFYLDIYNEYYSKSHLIK GGSDMGKSKLAHDFFCLIMQHYREHKDVAFYADKLCITSKYLSMVIKEASGKTAKDWIVE YAILEIKAMLKNSTMNIQEISIKTNFANQSSLGRFFRKHTGMTLTEYRMHR >gi|226332044|gb|ACIB01000012.1| GENE 67 94054 - 95292 1091 412 aa, chain + ## HITS:1 COG:PM0839 KEGG:ns NR:ns ## COG: PM0839 COG0128 # Protein_GI_number: 15602704 # Func_class: E Amino acid transport and metabolism # Function: 5-enolpyruvylshikimate-3-phosphate synthase # Organism: Pasteurella multocida # 5 396 10 430 440 234 35.0 2e-61 MRYLLSAPSQIKATIQLPASKSISNRALIIHALSKGDDVLSNLSDCDDTQVMIKALTEGN EVIDILAAGTAMRFLTAYLSSTPGIHTITGTERMQQRPIQILVNALRELGAHIEYVRNEG FPPLRIEGTELTGSEITLKGNVSSQYISALLMIGPVLKNGLLLRLTGEIVSRPYINLTLQ LMKDFGASASWTSDQNIQVDPQPYHCLPFTVESDWSAASYWYQIAALSPQADIELTGLFR HSYQGDSRGAEVFARLGVATEYTETGIRLKKNGTCVERLDEDFVDIPDLAQTFVVTCALL NVPFRFTGLQSLKIKETDRIEALKTEMKKLGYILHDKNDSILSWDGERVEQQTCPVIKTY EDHRMAMAFAPAAIHYPTIQIDEPQVVSKSYPGYWDDLRKAGFGIKVGEELR >gi|226332044|gb|ACIB01000012.1| GENE 68 95433 - 98987 2843 1184 aa, chain - ## HITS:1 COG:TM1193 KEGG:ns NR:ns ## COG: TM1193 COG3250 # Protein_GI_number: 15643949 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-galactosidase/beta-glucuronidase # Organism: Thermotoga maritima # 183 1180 7 983 1087 697 39.0 0 MLNFKHLFLVSVALWSAVGMVRAQEFDPKQSYEIHTQNGLVLDNQESLDLGSKIFISKKE PHKESQVWNLIPCGDGCYSIVSPLTELGIDNSGNGSKECPVIQWDPNKENPNQQWRITAL PNGNYLFTSVASGYNLGFPDAGLVAEPVYQLKPDAQKISQQWKIVKSNLKVVAEAFKTRS DNDWENERIFAVNKEEGRSTFVPFADTEEMKGDPSYTRPWIRNQSSRYLLLNGDWKFHWV KQPSERPVDFYKPGYDVSAWKEIPVPSNWEMLGYGTPIYTNITYPIRNNPPFIQGQRGYT VEKEPNAVGSYRREFSLPADWKNKEVFIHFDGVYSAMYLWINGKKVGYSQGANNDAEFNI TQYVKPGKNILAVEVYRWSDGSYLEDQDMFRLSGIHRDVYLVATPKLRLRDVHLTSVISD RFDKADLCVKADVKNYGKSAVNASVRIALLDAGGKTLRTFTTAAGNLAAGKEICLPAKAS IRDPQLWSAETPYLYTVNLELLDASGKVLEATTQQYGFRKIEIRDNKVYINDALVLFKGA NRHDIHPRFGKAVPVESMIEDILLFKRFNLNTIRTSHYPNDPKMYALFDYYGLYVMDEAD LECHGNMMLTKRESWKAAFVDRVVRMVERDKNHPSVIFWSMGNESGGGPNFEAAYQAARE IDARFIHYEGMNDVADMDSRMYPSIESMIAQDNEPRNKPFFLCEYAHAMGNAVGNLEEYW DYIANQSKRMIGGCIWDWVDQGINKPGEAPDRYYFGGSFGDRPNDNDFCCNGLVTPDRRV TPKLWEVKKVYQYMTFEEVGENNVGLRNHYSFLNMRHFNLRYVILKNGVPVAEEEFGLPD GKPGEHRTIHIPYLRHLTEDADYHINLEVKLKHDCVWAKAGHVVATEQFLLRERKQKTEV PELSASLQVVEERQYIRFRAPGTEISFDSKTGMMIGLRYDGQNMIHGQQGPALNWYRSIS NDPREWIQPVIALRGFDWKLAEDGKSASVQSQIEVKVGQVNIPYTVRYRVYSNGRVDVAA DFTTENNFDLPRLGLQMFLNPSLEQVEWYGRGPMENYRDRKNAAYMGRYKTTVTDMAEHY ARAQTMGGRTDTHWLALTDGQGKGLRITATDTLDFSAQHYTDKDLWQVKYGHDLSDIRRA EVVLNLDCIQRGLGNASCGPGPRPHYEIRKDTVYSYSFRMEPLR >gi|226332044|gb|ACIB01000012.1| GENE 69 99105 - 100556 1357 483 aa, chain - ## HITS:1 COG:SP2146 KEGG:ns NR:ns ## COG: SP2146 COG3669 # Protein_GI_number: 15901959 # Func_class: G Carbohydrate transport and metabolism # Function: Alpha-L-fucosidase # Organism: Streptococcus pneumoniae TIGR4 # 23 481 2 453 559 256 35.0 9e-68 MNLRNFFLAAGIAAVAVPVWGQKAPEPYGLTPSARQVEWYNREMIAFFHFGINTFEEYVN EGDGRAPAAIFNPTALDCGQWMRTLQSAGIPSAIITAKHADGFCLWPSKYTDYGVKNSAW KNGKGDVVREFVDACEEYGIKAGIYLGPHDRHEHLSPLYTTEKYKQYYGHQLEELMGDYG KVWETWWDGAGADELTTPIYTHWYKIVREKQPDCVIFGTKNSYPFADVRWMGNESGKAGD PCWSTTDSVCVRDEWKNYEGLNEGVKGGDAYIPAETDVSIRPSWFYHAEEDSRVKSVKEL WDIYCTSVGHNSVLLLNFPPDRRGLIHPTDSLHAALLKQGLDETFGNNLLAKAKVKATNG RGGKFRPEFLTDNNKETYFAGRDGAKTSDIVFTLPRQTEFDCLMIQEVIELGHRTTKWSV EYSNDGRNWTPIPEATDKQTVGYKWIVRFAPVKAKQVRLRILDGFACPAIHTFGVYKQSA LFQ >gi|226332044|gb|ACIB01000012.1| GENE 70 100585 - 101538 1222 317 aa, chain - ## HITS:1 COG:no KEGG:BF0734 NR:ns ## KEGG: BF0734 # Name: not_defined # Def: exo-alpha-sialidase # Organism: B.fragilis # Pathway: not_defined # 1 317 5 321 321 646 99.0 0 MKQNILKAAILATGFCCTVGCSDNDSDLTIYIPDAGTAVKANFYIEDEPYVQTFNVAVTP ETYPQLTEYGLPNDAVVHLQADPGKVDEYNAKNGTEYELLPEAAYDLTEEVLVKAGTPKS EEAVITVHAKGNIEAFKEYLLPVSITSVEGAVADHCCQTIYFIFRGSMDASNMELLDRTG WEALSASSEEPKEGDWGHSGLKEACLDGDLNTFWGTDWSTQHPQPPHWIVIDLKKTTHIQ GFACQAREEGYDGPKEVTVEVSDDNATWTVAAKFTDIPAQGEFRSFLPAAVDGRYLKVTI TAVNGGPHVTISELNLF >gi|226332044|gb|ACIB01000012.1| GENE 71 101560 - 103242 1710 560 aa, chain - ## HITS:1 COG:no KEGG:BF0733 NR:ns ## KEGG: BF0733 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 560 1 560 560 1124 100.0 0 MKKNKILLIAAVACTSMFSSCSDLLDLYPEDSLSDPKFWTSVQDLELYANGFYGILPGAQ GPGADDESDCYVNEKPKTWIFNLETIPTSGGGWGNGDWGNIRSCNYFMARYKQVKGDESQ INRSVAVVRFFRAWEYVYKVKRFGDVPWYETELQMDSEELYKGRDSRKVVFTHILEDLDF AIEHLPKPNETKTGNMHKYAALAFKSRACLYEASFRKYHGLGDYEELYREAADAAAQVIE EGGYSIYKTDKPLQDYYNLFVQEDLASNSECIMPRVYITKLLMHNNTRQMEESYTGLSRA MFEQYLCADGLPTAVSPGYEEADMPMDELMQRDPRLRQTIDNPELPFKVLSDGTKQFNAL PIIDTKYCTTGYYVMKYHSTDPEQWNIGQSTLDVFIFRYAEVLLNYAEAKAELGECTQEV LDQTVNELRDRVEMPHLKVNVGFTDPNWPDYGYELSPLLYEIRRERAVELVGEGFRWDDI VRWKAGKLLEAPKSMLGMKVSDKLKEQYDSFSRELTQDNLLIVYPDRTTRKWDDKLYLHP LPIDETTMNPNLLPNNPGWE >gi|226332044|gb|ACIB01000012.1| GENE 72 103256 - 106441 3151 1061 aa, chain - ## HITS:1 COG:no KEGG:BF0732 NR:ns ## KEGG: BF0732 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1061 1 1061 1061 2117 99.0 0 MRERKLVERAVRISGMFVLLSGMGLQPCLASHSYNAASAGVVQVEQQKQKITGCVLDVTG SPLPGASVLVKGAPGRGAITDMDGNFSVEAEQGEQLEISFIGMETQVVTVSAKTRLQITL KEDSKTLDEVVVVGYGTQKKANLTGAVASVSAETLESRPITNLSQGLQGMIGNLNISADN GAPGKGYGFNVRGTTSINSSGPLVLVDGVQMDPNMINPADVESVSVLKDAASASIYGTQA AYGVVLITTKKGKDEKAKISFSSNWSINSPTRKPEYMDSWTFANFHNLTNRNSGGGDYYD KNYMDHIYAYYTDPKHNLPVFIDPSNPNKYLYCGNTDWIDETIKDNTLMQQYTLSLNGGS GKTAYYGSIGFLDQGGTLKHYDDKYRRFNVNLNLTSDVTNWLQVRLKTVYNNTYRDAPYG SNNSDETAQINSAFYGADLRPMMPVYHPDGNFSGQGSWTNMVATQSISGERKNKENDLWL TGGLKITPLKDWTINVDYTYNMYILSKKQHGKEILEHTANPDIVTVFPHTTPSRVKFTTD DNYYQTFNAYTDYSKSLGKHNIKLLLGYNYETKSYRWFNAERENLISNDLAGLGQAYGEK YNGSGQHSWATMGYFARINYNYDERYLLELNGRYDGSSKFPKDKRFVFFPSVSAAWRISS EAFFEPAKKIVDELKIRASYGSLGNQSVGGDYPYIATMGTNGEMGYLVDGKKIASVSPGG IVSPYLTWETVRQIDLGLDWALLNSRLYGTFDWYMRKTLDMVTNGTPLPAVLGTGAPQAN TADLKTTGWELSMGWRDRVNKDFSYDVSFVLSDYQAEITKFNNPQNLLSTHYVGKKWGEI WGLVTEGLFQSAEEVSAHADQSEIYGGGWYPGDVKYKDLNGDGKINKGKNTLDDPGDQKI IGNSEPRYSYGIKGGLQWRDFDFDVFFQGIGKKDVVLGGNQFWGFGNEWHVPFKHALDSW TEDNRNAYFPRSTYDNVTGNRETQTRYLQNAAYLRLKSITLGYSLPKALLAKWKIDHVRF YVSGQNLLTFDHLFDIYDPETVSLSTYPLTKSVSFGLNITL >gi|226332044|gb|ACIB01000012.1| GENE 73 107223 - 109889 1641 888 aa, chain - ## HITS:1 COG:no KEGG:BF0660 NR:ns ## KEGG: BF0660 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 888 1 888 888 1822 99.0 0 MLHYETDDAFIEEQYMRRLGCLFAIGMMWLCIVDCFSQGLLFYGNEKRISERATYSVLRE GHERTFTNAFRISFDYLVRNVESPGYILYLEDRDAGKIYSFTYLHKPGDRCSFSFNEDGK RIFCTFELDKEDYDHRWLPVSIALDIPADCARITIGKRSKEISNLGLEPTGFSPHIYFGK CEYILDVASYAIRNLSLTDGEEIMIFPLNESSGEDVHDSRGHVVGQVTNPVWLINQSYYW KELFTDYSSTPSGLNFDKTHQNILFFNQDSLSAFDLYTHELSKTPYKTPLPVQLRLGMNF LDEASRELYTYEVADSHGDAMVAALDLTTKEWRPASSDYLCQQLHHHCGFFHPGERKFFL FGGYGSRRYSNTFLAYDLNTCRWDTLAFSGDRVIPRYFSGMAVSNDYKRVYLYGGMGNEA GDQNVGRNYLYDLYRIDMQTRSVQKLWETKAPAVNRVVPRNMILSADEKHLFLLGYPEYL PNSTLQLYRLSVRDGECEAVGDSIPIVSEEIATNANLYFNEELNEFYCSVQEFEKHGQVT TRLYSLSAPPVTLDDVEYYSRRRNALNAGLLGAVVGGVLLLAACVWGIMHHKRKRDRMSA ETASAGVRPEEVSDISEVPAVESPEVKEEEVEKEPSEWEELPAVLPPDKNAIFLFGMFTI LDNAGRDITYMLSPKLRTIFLYILLNSVGKRGVLSSDMNQIFWPDKSGSNVKNLKGVSMN HVRKILQDVDGIELVYRNGYFCMVFGEEFYCDYIRLMNLTSPEKKKKETPGAIRAEWLEI LLRGKFIPAAESDLFDYYKQKVETLIHSLLPGQLELAYRDARFSIVIRLCNILFIADPLS ELALAYTVCALRKQNNQEEAIRRYAAFTKDYRRVMNEEYGIAFADIKV >gi|226332044|gb|ACIB01000012.1| GENE 74 109930 - 112215 1897 761 aa, chain - ## HITS:1 COG:L135972 KEGG:ns NR:ns ## COG: L135972 COG3537 # Protein_GI_number: 15673483 # Func_class: G Carbohydrate transport and metabolism # Function: Putative alpha-1,2-mannosidase # Organism: Lactococcus lactis # 30 758 3 715 717 436 33.0 1e-122 MRKNFSTILIVGAALLLASCVQQKESFSPVDYVNPLMGTESTYAFSHGNTYPAVAVPWGM NFWSPQTGENGSGWMYTYTDSLIRGFRQTHQPSPWINDYGTFSIMPLSGVLKMDHKERGV PFSHIQEEAAPYSYSVTFANGLRTELSATSRGAVFEVTFPQDSAQYIVVDAYNGGSALTI DRENRCVTGVARNHNGGVPDNFANYFRIEFSHPIAEEGVYDGDTLMRHRPTLESDYTCAY LRFNVPAGEKLTVRTASSFISPAQALVNFSREVGGKSLAQVREEARKQWNSYLGRIEAEG GSEEQLRTFYSCLYRTLLFPREFYEFDAQGKPVYYSPYNGKIQDGYMYTDNGFWDTFRAV HPLFTLLYPEVSERVTQSILNAYDESGFMPEWASPGHRECMIGNNSISLLTDAWMKGIRT ICPEKALEAMIHQTEARHPGISSVGRDGFGYYDRLGYVPYPEVHEATAKTLEYAYADWCV ARFADSIGRKEIADTYYRKALNYRNLYYPDYGFMWAKDANGKWRDAFDATEWGGPFTEGS SWHWTWSVLHDPEGLSRLMGGHTAMEARLDSMFTAPNTYNYGTYGFVIHEIAEMVALDMG QYAHGNQPVQHAIYLYDYIGRPWKTQKHVREVMDKLYHSGSKGYCGDEDNGQTSAWYVFS AMGFYPVCPGVPEYAMGSPLFPKLTLHLPDGKNFTVKAEGNSPANRYIGKALLNESEFTR NYLTHRELTSGGELVLWMDSVPDSRRGTQKEDLPYSYSNGH >gi|226332044|gb|ACIB01000012.1| GENE 75 112366 - 113538 639 390 aa, chain - ## HITS:1 COG:no KEGG:BF0728 NR:ns ## KEGG: BF0728 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 390 1 390 390 805 99.0 0 MKTKLTNCVKTVLLSFMLGVLFACCQGAASSGNAGKNASRVRIASNDSAKLVPDKALNDA SCVLAGLPVDKASGKLYALTRTKEWKNHARYMDQIWNVFRQTAPRLVAFSQTELEDINTR CHTLFYPFGGPDFLFANAFFPEMDTYVLIGLEPAGTAPKVKHPSAETYRLYQNAVSNVLN LSFFNDMDKELANDTIDGVVPIYSLLMARGNRKIVSIQEVWLSETGDLFERKEGDTIRNT CSAGMEVRFFRPGASRLQTLYYFCTDISNEGLQANRPLQAFMDRFDAETTATFVKSASYL MHEPAFSRIRETILQKSSAIVQDDSSIPLSCFDPEVWSVTLYGTFYKPISTFSQYLQPEL RDAYQLGDPKPLNFRIGYARQSNLQVARRK >gi|226332044|gb|ACIB01000012.1| GENE 76 113546 - 114547 926 333 aa, chain - ## HITS:1 COG:CC2502 KEGG:ns NR:ns ## COG: CC2502 COG2234 # Protein_GI_number: 16126741 # Func_class: R General function prediction only # Function: Predicted aminopeptidases # Organism: Caulobacter vibrioides # 40 303 34 272 309 117 28.0 3e-26 MKRKHLITVLMWCVATTLCAQTPVEKGLKSINRQAAEAYIGFLADDELQGREAGFHGSRV AARYIASLLKEMGIRPLGESYYQPFDAYRKERQQKGRLEVHPDSIAKLKQVVHQKLSMNN VLGMIPGKKTNEYVIVGAHFDHLGIDPALDGDQIYNGADDNASGVSAVLQIAKAFVVSGQ QPERNVIFAFWDGEEKGLLGSKYFVQECPFINQVKGYLNFDMIGRNNQPQNPKHVVYFYT EANPAFGRWLKEDIKKYGLQLEPNYRPWDKPVGGSDNGSFAKAGIPIIWYHTDGHPDYHQ PSDHADRLNWDKVVEISKASFLNVWNLANEKDY >gi|226332044|gb|ACIB01000012.1| GENE 77 114650 - 115072 447 140 aa, chain + ## HITS:1 COG:no KEGG:BF0726 NR:ns ## KEGG: BF0726 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 140 1 140 140 274 100.0 7e-73 MWILIVGLVLLAIVAMIAGYIRNKRLQKKIDNGELDSFPEVKEVDVECCGQHEVCERDSL LAAVSKQIEYYDDEELDTFIGRAPEDYTPEEADKFRDVFYTMQDTDVAGWVRSLQLRGIS LPDEIKDEVFLVVGERRIHP >gi|226332044|gb|ACIB01000012.1| GENE 78 115111 - 115920 781 269 aa, chain + ## HITS:1 COG:MA0025 KEGG:ns NR:ns ## COG: MA0025 COG1108 # Protein_GI_number: 20088924 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Mn2+/Zn2+ transport systems, permease components # Organism: Methanosarcina acetivorans str.C2A # 3 269 4 270 274 204 46.0 2e-52 MNLLQYTFFQHALLGSLLASIACGIIGTYIVTRRLVFISGGITHASFGGIGLGLYAGISP ILSAAVFSVLSAFGVEWLSKRKDMREDSAIAMFWTLGMALGIMFSFLSPGFAPDLSAYLF GNILTITQTDLLMLGILAILLILFFTLFIHPIIYVAFDREFARSQGIPVVVLEYILMMFI ALTIVSCLRMVGIVLAISLLTIPQMTANLFTNKFKRIIWLSIAIGYLSCLGGLLISYRLN VPSGASIIFFSILIYIACKIGKSLFRKKQ >gi|226332044|gb|ACIB01000012.1| GENE 79 115935 - 116354 611 139 aa, chain + ## HITS:1 COG:BS_ydiB KEGG:ns NR:ns ## COG: BS_ydiB COG0802 # Protein_GI_number: 16077658 # Func_class: R General function prediction only # Function: Predicted ATPase or kinase # Organism: Bacillus subtilis # 27 135 30 134 158 91 41.0 3e-19 MEIKIQSLEQIHEAAREFISAMGDNTVFALYGKMGAGKTTFVKALCEELGVSDVITSPTF AIVNEYRSDENGELIYHFDFYRIKKLSEVYDMGYEDYFYSGALCFIEWPELVEELLPGDA VKVTIEELEDGTRKIVIND >gi|226332044|gb|ACIB01000012.1| GENE 80 116366 - 116587 197 73 aa, chain + ## HITS:1 COG:no KEGG:BF0653 NR:ns ## KEGG: BF0653 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 73 1 73 73 119 97.0 2e-26 MAPQYFVQGIFATAGLIALLASILNWNWFFTAQNAQLIVRNVGRGRARLFYGLLGVIMIG MAVFFLNTQPVSE >gi|226332044|gb|ACIB01000012.1| GENE 81 116579 - 117889 1256 436 aa, chain - ## HITS:1 COG:slr2013 KEGG:ns NR:ns ## COG: slr2013 COG1721 # Protein_GI_number: 16329852 # Func_class: R General function prediction only # Function: Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) # Organism: Synechocystis # 1 436 1 435 435 176 27.0 6e-44 MFLTKRFYILVLVVILLLGGGYLFGSLFIIGQLGLLALLLALAFDGYLLYRTKGIQAFRQ CAGRFSNGDDNEVSLRIESRYSYPVRLIVIDEVPVIFQQRNVHFELSLLPNEGKTLTYRL RPTRRGEYGFGFIRVFTTTRIGLISRRATCGRPEAVKVYPSYLMLHRYELLAMSDNLTEL GIKRIRRAGHQTEFEQIKEYVKGDDYRTINWKASARRHQLMVNVYQDERSQQIYSVIDKG RVMQQAFRGMTLLDYAINASLVLSYVAMRKDDKAGLVTFNEYFDTFVPASKQVGQMQTLL ENLYKQETTFGETDFSALCGHLGKHVNKRSFLVLYTNFSNMTSLNRQLVYLQQLARQHRV LVVFFEDADLKEYIAGKSVTTEDYYRHVIAEKFAFEKRLIVSTLKQHGIYSLLTTPDKLS IDVINKYLEMKSRQLL >gi|226332044|gb|ACIB01000012.1| GENE 82 117892 - 118887 1114 331 aa, chain - ## HITS:1 COG:PH0776 KEGG:ns NR:ns ## COG: PH0776 COG0714 # Protein_GI_number: 14590644 # Func_class: R General function prediction only # Function: MoxR-like ATPases # Organism: Pyrococcus horikoshii # 23 331 5 312 314 281 48.0 2e-75 MEENMNDMHGEETPRTDLTLFSEKMQELKSRIASVIVGQERTVDLVLTAILANGHVLIEG VPGVAKTLLARLTARLIDADFSRVQFTPDLMPSDVLGTTVFNMKTNEFDFHRGPVFANII LVDEINRAPAKTQSALFEVMEERQASIDGTTYRMGELYTILATQNPVEQEGTYKLPEAQL DRFLMKITMDYPSLDEEINILERHHTNAALVKLEEIQPVITREELLSLRRLTEKVFVDRT LLQYIALIAQQTRTSKAVYLGASPRASVAMLQASKAYALLQGRDFVTPEDIKFVAPYVLQ HRLILTAEAEMEGYSPVKVTQRLIDKVEVPK >gi|226332044|gb|ACIB01000012.1| GENE 83 118951 - 120198 514 415 aa, chain - ## HITS:1 COG:no KEGG:BF0650 NR:ns ## KEGG: BF0650 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 415 1 415 415 835 99.0 0 MRSSRWFIIGIVLFLLIMFVVESHLPKKFVWNPTFAQHDHQPLGCAVFDDVLKSSLPDGY SLSRKTFYQFAADSDSCRSILVITQHVNLVEADLNALLDLAQRGNKILIAASSFSTSLSD TLGFDNSYAYFNPRRMKEYAGNLLERDSVCWIGDSAVYDKRTFRFYPHLCGIYFTKYDLL SLPLATSRIDSMQMFNDSLPDCFPPVALSRSVGSGEIVLVTTPLLFTNYGMLDGDNAAYL FRLLSHLKGLPVVRTEAYGAGAQVEVSPFRYFLSQRPLRWALYLTMLVLVLFMVFTARRR QRAIPVIREPANRNLEFAELIGTLYYQKKNHADLVRKKFIYFAESLRRYIQVDVEDDSDD NALSRRISRKTGTDEEKVRNLFRKLRPVIRGQQEVGETLMKDLIDGMNEIEKPQP >gi|226332044|gb|ACIB01000012.1| GENE 84 120195 - 120818 456 207 aa, chain - ## HITS:1 COG:no KEGG:BF0719 NR:ns ## KEGG: BF0719 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 207 1 207 207 358 100.0 6e-98 MTLQADTLVCDTARVAFWQSNPDYDYNRELMTPEIDIYGWLSMQLSKLLRAIFGSRFAEE YSGIILIIIAILILLLILWFLYKKRPELFMRSRRGPVNYSVHEDTIYGVDFDAEIRRAID RKDYREAIRLLYLQTLKLLSDDGRIDWQLYKTPTEYIYEVKQEMLRTPFRNLTHGFLRVR YGNFPASESLFEELAALQTQIRKGGDV >gi|226332044|gb|ACIB01000012.1| GENE 85 120829 - 121761 914 310 aa, chain - ## HITS:1 COG:no KEGG:BF0718 NR:ns ## KEGG: BF0718 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 310 1 310 310 532 99.0 1e-150 MESQKPKIALYVKRPFGDKLNATMDFIKENWKPMLKFCTYLILPLCLIQAISMNGIMGGA MGIAAAKEAGTNSLAAIGMQFWVNYGLMFLCYLVGSILLTSIIYGLMQIYNQREERLAGV TFADLKPFLFKNIRRLLVMVLFCIGLTIVVGIVMGILVVASPFTLLLTVPLLIACAVPLA LFTPIYLFEEIGILVAFWKTFRLGFATWGGVFLVSLVMGLISSVLQGVTTTPWYIATIVK YFFMLSDTQNELTISAGYSFMVYLLAIVQTFGAYLSMIFSLIGMVYQYGHASEVVDSISV ESEIDKFEQL >gi|226332044|gb|ACIB01000012.1| GENE 86 121742 - 122704 594 320 aa, chain - ## HITS:1 COG:BH0733 KEGG:ns NR:ns ## COG: BH0733 COG1300 # Protein_GI_number: 15613296 # Func_class: S Function unknown # Function: Uncharacterized membrane protein # Organism: Bacillus halodurans # 6 284 6 287 355 120 28.0 5e-27 MKEVTFIRRNIEKWKETEKVVEQADKLTPDRLADAYTELTADLAFAQTHYPSSRITIYLN NLASALHNVIYRNKKEKWTRIFTFWTQEVPQTMYHARKELLVSVLIFWASVLVGIVSAAN DDNFVRLILGNGYVDMTLDNIARGEPMAVYNGSEEVPMFLGITLNNIMVSFNVFAMGLLT SFGTGWLLFNNGVMLGAFQTFFFKHGLLGESMLAIWLHGTLEIWAIIVAGAAGLALGNGW LFPGTYSRKESFMRGAKKGLKIIVGTVPIFIMAGFIEGFITRHTELPDVLRLGIILLSLS FIIYYYIYLPNRKTHGITKT >gi|226332044|gb|ACIB01000012.1| GENE 87 123626 - 124357 576 243 aa, chain + ## HITS:1 COG:Rv3695 KEGG:ns NR:ns ## COG: Rv3695 COG1714 # Protein_GI_number: 15610831 # Func_class: S Function unknown # Function: Predicted membrane protein/domain # Organism: Mycobacterium tuberculosis H37Rv # 4 155 2 154 310 67 31.0 2e-11 MAESTIITGQFVRISQVPASLGERILARIIDYFLLFIYILATSYILGKLNIHAFSGSTFF LLFLFIYLPVLCYSLLCEVFNQGQSAGKKLMNIRVVKADGTTPSLSAYLLRWLLYGIDVT ITGGLGVLVILLTKNSQRLGDLAAGTMVIKEKNYRKIQVSLDEFDYLTKGYHPSFPSAAD LSLEQINVISKALELHHKDRTRHIAQLAPKVRALLSVDQTNINDEKFLQTVVRDYQYYAL EEI >gi|226332044|gb|ACIB01000012.1| GENE 88 124580 - 124915 521 111 aa, chain - ## HITS:1 COG:no KEGG:BF0714 NR:ns ## KEGG: BF0714 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 111 1 111 111 160 100.0 9e-39 MGLEDDFLLNDADDEKTIEFIRNYLPQELKEKFSEDELYYFLDLIDEYYSESGILDVQPD ADGYVDIDLEQVVEFIVKEAKKDEVGEYDPEDILFVVQGEMEYGNSLGQVE >gi|226332044|gb|ACIB01000012.1| GENE 89 124942 - 125256 176 104 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|124485582|ref|YP_001030198.1| ribosomal protein L12E/L44/L45/RPP1/RPP2-like protein [Methanocorpusculum labreanum Z] # 3 103 18 117 120 72 36 1e-11 MALEITDNNFKEILAEGSPVVIDFWAPWCGPCKMVGPIIDELAKEYEGKVIMGKCDVDEN SDLPAEFGIRNIPTVLFFKNGELVDKQVGAVGKPAFVEKVEKLL >gi|226332044|gb|ACIB01000012.1| GENE 90 125317 - 129111 3488 1264 aa, chain - ## HITS:1 COG:CAC0516 KEGG:ns NR:ns ## COG: CAC0516 COG0587 # Protein_GI_number: 15893807 # Func_class: L Replication, recombination and repair # Function: DNA polymerase III, alpha subunit # Organism: Clostridium acetobutylicum # 2 1235 9 1140 1167 705 36.0 0 MQDFVHLHVHTQYSLLDGQASVSALVDKAMKDGMKGIAVTDHGNMFAIKEFTNYVNKKNG GPKGEIKDLKKRIAAIEAGEVECADKDAEIADCKAKIADAEGRLFKPIIGCEMYVARRTM DKKEGKPDQSGYHLIVLAKNEKGYHNLIKLVSRAWTKGYYMRPRTDRNELEKYHEGLIIC SACLGGEVPKKITQGLLAEAEEAIQWYKNLFGDDYYLEMQRHKATVPKANHEAYPLQVNV NKHLIEYSKKYNVKLICTNDVHFVNEEHAEAHDRLICLSTGKDLDDPNRMYYTKQEWMKT KAEMNELFADVPEALSNTLEILDKVEYYSIDHAPIMPTFAIPEDFGTEEGYRQKYTEKDL FDEFTQDENGNVVLSEEAAKDKIKRLGGYDKLYRIKLEADYLKKLTFDGAKKFYGDPLSP EVKERLVFELHIMKTMGFPGYFLIVQDFIAAGRNMGVSIGPGRGSAAGSAVAYCLQITKI DPIKYDLLFERFLNPDRISLPDIDIDFDDDGRGEVLRWVTEKYGQEKVAHIITYGTMATK LAIKDVARVQKLPLAESDRLAKLVPDKIPDKKLNLKNAIEYVPELQAAEASPDPLVRDTM KYAKMLEGNVRGTGVHACGTIICRDDITDWVPVSTADDKETGEKMLVTQYEGSVIEDTGL IKMDFLGLKTLSIIKEAVENIRLSKGMELDIDSISIEDPATYKLYSDGRTIGTFQFESAG MQKYLRELQPSTFEDLIAMNALYRPGPMDYIPDFIDRKHGRKPIEYDIPVMEKYLKDTYG ITVYQEQVMLLSRLLADFTRGESDALRKAMGKKLRDKLDHMKPKFVEGGRKNGHDPKVLE KIWADWEKFASYAFNKSHATCYSWVAYQTAYLKANYPAEYMAAVMSRSLSNITDITKLMD ECKMMGVQTLGPDVNESNLKFTVNRNGNIRFGLGAVKGVGEAAVQSIMEERKENGPFKGI FDFVQRVNLNACNKKNMECLALAGGFDSFPELKREQYFAVNSKGETFLETLMRYGNRYQA DKAAAVNSLFGGDNVIDIATPEILPAERWNDLERLNKERELVGIYLSAHPLDEYAIVLEH VCNTHMSELDDKSALAGREITMGGIVTSVRRGISKNGNPYGIAKIEDYSGSAEIPFFGND WVTFQGYLGEGTFLFIKARCQPKQWRPDELDVKITSMELLPDVKEQLIEKITILIPLAEL NSALVTELASLTKEHPGNTELYFKVTDTEGKMYVDLISRPVKLSVGRELISYLKERPELA FHIN >gi|226332044|gb|ACIB01000012.1| GENE 91 129275 - 129961 616 228 aa, chain + ## HITS:1 COG:NMA1160 KEGG:ns NR:ns ## COG: NMA1160 COG0688 # Protein_GI_number: 15794106 # Func_class: I Lipid transport and metabolism # Function: Phosphatidylserine decarboxylase # Organism: Neisseria meningitidis Z2491 # 12 227 10 213 265 148 38.0 6e-36 MGRLKKLKKIRIHREGTHILWGSFFLLLIINLALYWGIDCKIPFYLVALVSIVVYLLMVN FFRCPIRLFGQDTEKIVVAPADGKIVVIEEVDEHEYFHDRRIMVSIFMSILNVHANWYPV DGVVKKVTHDNGKFMKAWLPKASTENERSMIVIETPEGVEVMARQIAGAMARRIVTYAEP GEECYIDEHLGFIKFGSRVDVYLPLGTEICVSMGQLTTGNQTVIAKLK >gi|226332044|gb|ACIB01000012.1| GENE 92 129971 - 130681 601 236 aa, chain + ## HITS:1 COG:SMc00552 KEGG:ns NR:ns ## COG: SMc00552 COG1183 # Protein_GI_number: 15964875 # Func_class: I Lipid transport and metabolism # Function: Phosphatidylserine synthase # Organism: Sinorhizobium meliloti # 9 195 42 226 289 93 34.0 4e-19 MANAITRHIPNTVTCLNLFSGCIAGVMAFEAKYELAFIFIILSAVFDFFDGMLARLLHAY SPIGKELDSLADDVSFGVAPSLLVFSFLKEPGLIYPDFLAGLRDYIPYLAFLISIFSALR LANFNVDERQTSSFIGLPVPANALYWGALIVGGKDFLLAHCNAIYLIIMVMLFSWLLVAE IPMFSLKFKNLSWKDNKVSFIFLIVCIPLLLFLGISGFSAVIVWYIILSLLTRKNK >gi|226332044|gb|ACIB01000012.1| GENE 93 130744 - 131031 258 95 aa, chain + ## HITS:1 COG:no KEGG:BF0709 NR:ns ## KEGG: BF0709 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 95 1 95 95 164 100.0 7e-40 MFHILGFLFIIVIAVIIIGLALVGSVLRAVFGLGKRSPSSGSDRNGPNNNSGSRRYYHQT QANDKEEIITGTGAKHKKLFDDNEGEYVDYEEIKE >gi|226332044|gb|ACIB01000012.1| GENE 94 131048 - 131485 478 145 aa, chain - ## HITS:1 COG:SA0516 KEGG:ns NR:ns ## COG: SA0516 COG0590 # Protein_GI_number: 15926236 # Func_class: F Nucleotide transport and metabolism; J Translation, ribosomal structure and biogenesis # Function: Cytosine/adenosine deaminases # Organism: Staphylococcus aureus N315 # 1 144 1 149 156 127 46.0 6e-30 MLDDSYFMKQALIEAVKAGERGEVPVGAVVVCKERIIARAHNLTETLNDVTAHAEMQAIT AAANVLGGKYLNECTLYVTVEPCVMCAGAIAWAQTGKLVFGAEDDKRGYQRYAPQALHPK TVVVKGILADECAGLMKDFFAAKRR >gi|226332044|gb|ACIB01000012.1| GENE 95 131490 - 131723 251 77 aa, chain - ## HITS:1 COG:no KEGG:BF0707 NR:ns ## KEGG: BF0707 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 77 1 77 77 119 100.0 3e-26 MRKRDKKCNKATPDEPKRERRMSILLSEDEQQIVDRYLDKYKITNKSRWLRETILMFIHK NMEEDYPTLFGEHDMRR >gi|226332044|gb|ACIB01000012.1| GENE 96 131861 - 132226 367 121 aa, chain + ## HITS:1 COG:FN1370 KEGG:ns NR:ns ## COG: FN1370 COG0792 # Protein_GI_number: 19704705 # Func_class: L Replication, recombination and repair # Function: Predicted endonuclease distantly related to archaeal Holliday junction resolvase # Organism: Fusobacterium nucleatum # 7 102 6 102 119 60 37.0 7e-10 MAEHNLLGKAGEDAAVDYLERHDYVIRHRNWRKGHFELDIVAAKNGELIIVEVKTRSDTD FALPQDAVTPQKIRRTVIAADTYIKLFQIDEPVRFDIITVIGKTGNFRIEHIKEAFYPPL F >gi|226332044|gb|ACIB01000012.1| GENE 97 132233 - 132589 437 118 aa, chain + ## HITS:1 COG:YPO0323 KEGG:ns NR:ns ## COG: YPO0323 COG2315 # Protein_GI_number: 16120660 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Yersinia pestis # 1 113 1 114 119 69 34.0 2e-12 MNIESAREYCLRKKAVTECFPFDEYSLVMKVMDKMFALIDLEGANTISLKCDPDYAIELR EHYSAIEGAYHFHKKYWNQVYFDRDADDKLIKQLIDHSYDEVMKKFTKKLRTEYDALP >gi|226332044|gb|ACIB01000012.1| GENE 98 132573 - 133328 566 251 aa, chain + ## HITS:1 COG:lin2018_2 KEGG:ns NR:ns ## COG: lin2018_2 COG0340 # Protein_GI_number: 16801084 # Func_class: H Coenzyme transport and metabolism # Function: Biotin-(acetyl-CoA carboxylase) ligase # Organism: Listeria innocua # 36 239 35 239 253 92 32.0 1e-18 MMPCPESFPVPLIHIAKADSTNGYLNALCEKEKVSELTTVVADFQTAGRGQRGNSWESED GKNLMFSFVLYPTFLEARKQFLLSQIASLAVKETLDLYIGDVSIKWPNDIYWKDKKICGM LIENDLMGIHISQSIAGVGININQKEFHSSAPNPISIIQITHRESDRMEILAQVLQRIKE YYKILQEGDIEFITDRYQAALFRKEGIHFYKDSEGTFNAGIVEVEADGHLVLQDETGKIR RYLFKEVQYIL >gi|226332044|gb|ACIB01000012.1| GENE 99 133367 - 134704 817 445 aa, chain + ## HITS:1 COG:no KEGG:BF0631 NR:ns ## KEGG: BF0631 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 11 445 2 436 436 806 99.0 0 MFFIKKHSFRKTEYLFVILLVLIGFSSCEDSTSDATLELSQSTFENISSDGATLTVNITS SDSWTAASSSTACNPVPNQGTSNQSLSIVVEANLDEAERNMTVVVTSGGIKKTISISQQG RSTTAGEYHYNLPVIFHVLYKDKNNPLQYVKQDRLAKILDTVNKLYKDKTKSVDMNLTFT LATTDEDGKPLSTPGMEYVLWEESYPIDCDVFMNDETGKYVKYIWEPNNYINVMVYNFKD DESTNSTTLGIAHIPFSTVGSNYLEGLSKTQKSYLEKQNLKFPYSVSINSLFINDQSTST QYSTADITVTLAHELGHYLGLHHAFSENDEGIYDGCFDSDYCDDTPTYNKVEYDADYAYT AKNDPANFTFDYLVKRKNCKTNQTFTSTNIMDYSVSYSDRFTNDQRSRIRHVLTYSPLIP GPKQGQTQTRSVVEGPIDLPIRTAR >gi|226332044|gb|ACIB01000012.1| GENE 100 134731 - 135414 535 227 aa, chain - ## HITS:1 COG:no KEGG:BF0630 NR:ns ## KEGG: BF0630 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 15 227 18 230 230 385 100.0 1e-106 MKTLTLFLSLLFSFPFVLSAQMVGETLQKVSAALDNRQWDQAVTLFRQAVNTNVEKAEMF YWTGVDKSLEVSSRIGRELAAYYKKSRSYDKAYLFYKELLQKSPNDVNCLVSCAEMEVCR GRESEALETYRKVLSLDADNLAANIFIGNYLYLKAEREKKQLEADYKKISAPTRMQYARY RDGLSRVMSTGYGKAREYLQKVISQFPSTEAQKTLERIKLIEKEVNR >gi|226332044|gb|ACIB01000012.1| GENE 101 135529 - 136851 787 440 aa, chain - ## HITS:1 COG:CC0380 KEGG:ns NR:ns ## COG: CC0380 COG2273 # Protein_GI_number: 16124635 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-glucanase/Beta-glucan synthetase # Organism: Caulobacter vibrioides # 154 419 19 301 301 67 24.0 4e-11 MKQFILIMICILLQVIHVHAGSKDITWKWDYSEADDIFIDLLGKAATGFMETNTPNCYGG KLVVKLPEGLHFKQTEAPAFLLIGNAEEIAFNVDVNDSMAILSLKTSSVQTVPSEAINIT LNPKAIKGKLRKGMNTTGRLSFHPSEIPTVFKSPITGRTYHLVFNDEFDDGVIDTLKWDT RSRRSPFTRRGMYQEKPYYVLCHEDWTKELHGELRLEVSKYPTQNNVVMTGGILSLGRFM ARYGYYETKASFRDCIGEGYWPAFWIHFDEADKYGKGTEIDVFEYIPKDKQIFQTLHWYK KQAMEEQQSEVQHAALDYDKSGQFKEHRSSTKYFVLDEAQSKEHTFAVEWTPEELIFYTD GKVTRRVNRKDDPKQVPSAYQMVYFSCSAGEWGGNVMENQVPAYVYFDYCRCYQESDQDA IYTVKGNGMKVPASRRVGKL >gi|226332044|gb|ACIB01000012.1| GENE 102 136876 - 137034 91 52 aa, chain - ## HITS:1 COG:no KEGG:BF0699 NR:ns ## KEGG: BF0699 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 52 13 64 64 79 100.0 3e-14 MKNREYENDVNNHVIILVRNVLDTGINIIFVCILKDILQTINKHRSEVALSI >gi|226332044|gb|ACIB01000012.1| GENE 103 137063 - 138364 975 433 aa, chain + ## HITS:1 COG:VC0090 KEGG:ns NR:ns ## COG: VC0090 COG0534 # Protein_GI_number: 15640122 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Vibrio cholerae # 5 422 15 432 454 275 39.0 2e-73 MHISNGNKRILQIAVPSIISNITVPLLGLVDVTIVGHLGSAAYIGAIAVGGMLFNIIYWI FGFLRMGTSGMTSQAFGQRNLEEVTKLLLRSVGVGLFIALCLMTLQYPIQKAAFAFIQTS DEVERLATLYFRICIWGAPAMLGLYGFAGWFIGMQNSRFPMYIAITQNIVNILASLCFVF LFGMKVEGVALGTLIAQYAGFLMALLLWLRYYKQLRKRVHWRGIWQKQAMYRFFQVNRDI FLRTLCLVAVTMFFTSAGAAQGEVVLAVNTLLMQLFTLFSYIMDGFAYAGEALAGRYIGA GNRMELHRTVRQLFGWGVGLSAGFTLLYGIGGQSFLGLLTNESSVIQEADTYFYWVLAIP LAGFSAFLWDGIFIGATATRQMLFSMFIASASFFLTYYIFQEVMGNHALWMAFIIYLSLR GLVQAFLAKKIVH >gi|226332044|gb|ACIB01000012.1| GENE 104 138413 - 139123 930 236 aa, chain + ## HITS:1 COG:FN1622 KEGG:ns NR:ns ## COG: FN1622 COG0528 # Protein_GI_number: 19704943 # Func_class: F Nucleotide transport and metabolism # Function: Uridylate kinase # Organism: Fusobacterium nucleatum # 4 234 6 236 239 263 58.0 2e-70 MAKYKRVLLKLSGESLMGEKQYGIDEKRLAEYAAQIKEIHEQGVQIGIVIGGGNIFRGLS GANKGFDRVKGDQMGMLATVINSLALSSALVAAGVKARVLTAVRMEPIGEFYSKWKAIEC MENGEIVIMSAGTGNPFFTTDTGSSLRGIEIEADVMLKGTRVDGIYTADPEKDPTATKFH DITYDEVLKRGLKVMDLTATCMCKENNLPIVVFDMDTVGNLKKVITGEEIGTLVHN >gi|226332044|gb|ACIB01000012.1| GENE 105 139370 - 140530 598 386 aa, chain - ## HITS:1 COG:no KEGG:BF0625 NR:ns ## KEGG: BF0625 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 386 1 386 386 767 99.0 0 MKKINAALVISLFVMTGCGGNKQLTDDCITVDVSADYPKKELILQDFMDVEYVPLETTDD FITQGIVKATGKKILLVANRIMDGNIFVFDRATGKGVRKINRLGQSGEEYSHIASIVLDE DNNEMFVVDYPARKILVYDLYGEFNRSLPFPDTCYYEFLSDYDRDHLIGYKSYLPLIETD ESCHVLISKKDGSVTRKIQIPFKELETPVVTKDEAIVTPGFFLITPHDGNCLLTKTSSDT VYNYLPDGTLSPFIVRTPSIHSMDPKVFLFPTIITDRYYFMQTLDKKFNFEKGRGFPTND LVYDKQEKAIFQYTVYNDDFSNKHRVALGQQPEKSVDEEIVTCRALNASDLVEANEKGEL KGKLKEIAAGLNEESNSVIMLIKRKK >gi|226332044|gb|ACIB01000012.1| GENE 106 141587 - 142435 616 282 aa, chain + ## HITS:1 COG:no KEGG:BF0694 NR:ns ## KEGG: BF0694 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 282 1 282 282 437 100.0 1e-121 MEWPKSKNNKMEELLERMSQFEANLAQLISTGIPNHTPSPATDEATSSPNEQEQLSPEQE EEMKLKIQELQQKEEELNLRAEKLDKLAKELEERQQNLENRNPNDERSIEPATHPDHSFP SQIGDQINALKKLLEDSSYKDKIIKDLHEELQSHNRDLHAEIVKPLLKNMIKMHERLTKT YKFYENTEAKSSPETYTRLLREVENCKLHIQDILEDEYDLEYFEPTIGSAYSPKEQTAIR TVITDTPEQAGTIKEFHYGGFRNTTTNKIFQPSTVTVYKKSE >gi|226332044|gb|ACIB01000012.1| GENE 107 142475 - 144064 1567 529 aa, chain + ## HITS:1 COG:CAC0472 KEGG:ns NR:ns ## COG: CAC0472 COG0443 # Protein_GI_number: 15893763 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Molecular chaperone # Organism: Clostridium acetobutylicum # 7 520 4 542 551 327 39.0 3e-89 MDNTRSVYGIDLGTTYSCIAQVDKFDQAIVLRNFEGDATTPSAVYFEDMDHVVVGKEAKG MLATEPTKTAVFIKRHIGVDDSFDKNTNEFPYHYDPTEISAFILKKLVKDANDLGDNPEP IKDVVITCPAYFGTKERMQTKQAGEIAGLNVLSIINEPTAAAISYGVKTDQKKTVLVYDL GGGTFDVTLINVNGGAIKVIATGGDHHLGGVDWDTALAEYMLAAFNEQNNTSYSFEDRLD LKYELLLLAEDKKKVLTAKQTAKATYQYEGNSARIEISRELFNSLTERKLDETIDATKKV IAIAKEKGYNNIDEILLVGGSSRMPQIKERVDKEFNCDAKLTDPDECVAKGAAIYAMNAA YSQAVRDYEEGESDDKPAPLRGDRTTVVNVTSKTYGTDVIIEGQSMVQNLIFANSSLPTK RIETFTTSIPNQRGVSVKVFESDFTNMETESIVEERFCTLIDDHTLKLSKDWPQGTQISV TYQIDQEGILHGLAYVENDKLEFDLKITGVKCEEELRKSKAIIDKASVE >gi|226332044|gb|ACIB01000012.1| GENE 108 144092 - 146887 969 931 aa, chain + ## HITS:1 COG:no KEGG:BF0692 NR:ns ## KEGG: BF0692 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 931 1 931 931 1761 99.0 0 MDKTLLKLEIGKWNEACRFLKSKQTELSTAEWTAIETLADYWRKKFDPDATSECMDLIST LEQSDKLHSNNKEIIVWSKEFRTIARKYYTKDETTFLVFQNALKELYQSSAPRSSAPLQT PVNSESQNIILKQNAKKWVSTYSQLYENQNELDRDEWNAVRFLANFWKNGTFNETSRKEC EQYISILIASDKLHSNNKNIITWSKDFRGIAKTYYTKDSKTFEDFKKFMTKYTRENPIQE RAPQERIPQTPPRQTSPSIEITGLVIANTDEKGDPIPSNQAELDTRSCYLQPRIDYRVLR GGSSVDIWYKLYAPDRTLMTASNSKSGYTWYGNVPLTGSRSAYPLNGFGSMSGNVFSAGQ WIIEFFENDLQIATYTFAIKQHRTSTPPPQSRVTPPPPPRTPSNRRTSTVSPKKGHGGLW SFLIIAAIIGFCGYQYWYKPMTIDRDAERTYVYVSSLLQRSDKNANVEYNRIQSLPYGSE LITYQKEGDGWSYIKANQKKGYVSTNYTLSKTDFELLDNLWGSKEAMEGAPTAKCRLALV DFIKKNNYKTGTGQWQLFAQPIEVKPNAVLYPRLANGYTKFTEFAFVLSNSSTHEGVLAI YSFADDETPVFIYQEQTTENAKIKDVRYYPWKTDKYKVIYATPNAMVSRSPQLPTNQQPS KAKSENGLKITKALFGNTDKSYNILTQFGTQLPTTTQYLSPRIFYENPAGKSSVVIKYKI ITPQGNLMTGTGSPSGYTNEQKVTLNRSGYINLAGWGNTTGDAYTSGTYRIEFWSEGQLL YSTGVQIQGNSEKAVTVSSKPVECPIKIRTMLFANSNDKGTILEDYGKPLYGNKLQYLKP KIIYSSLNGARNITLYAKIYRPNGVLIANKNSPNGYTYKHGMYTPAVGADNNVSYLTSWG SPECDVYSPGTYQYEIWYEGSKIFSCQVIVH >gi|226332044|gb|ACIB01000012.1| GENE 109 146898 - 147449 302 183 aa, chain + ## HITS:1 COG:no KEGG:BF0691 NR:ns ## KEGG: BF0691 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 183 1 183 183 361 99.0 5e-99 MKKKIIRYLIISHLTFILSIGGMAVAFYYYHTAQMAQIKNANILLISKDEMKLRLIDYKG QELFTADIACGKNYGNKEKQGDLKTPEGTFKIIDIQDASKWKHDFGDGKGEIEGAYGNHF IRLETPGHKGIGIHGTHDPLSIGTRATEGCIRIKNSELEQLVSLIRVPTTVIITPSVKDI KTQ >gi|226332044|gb|ACIB01000012.1| GENE 110 147461 - 148519 466 352 aa, chain + ## HITS:1 COG:no KEGG:BF0690 NR:ns ## KEGG: BF0690 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 352 40 391 391 610 99.0 1e-173 MKTFKRFLFICIGLFIIVLSTQALEKYEVTANTFLNIRSHGSTNAPIIGTINHGGIVNVE SIDGEWAKVSFNGGYGYVSTTYIRPVTPTPPAKAPTNTLSDWFRQTNYDCRPLVYIILGL SIVLFILRMRRGESTPLEDSEHTINLSLFITVCLLELFYVFSMGANSIWFCTPDKIGWLW TIINFFIFGAVVFNQWMCFFNTLNDVQYNSYATFNWTWGIYTWGICIIGAIICGFFFVGF LPVVGIGFLVGQSVQTGIIFNKVLPKGGWKHACICTLTYLIGSTATVLIVAHFLILLLIV LIALFLLSLLGKSSSSSGGKRCSNCSHLSGSSCNLSGRYISSPSTTYCDNYQ >gi|226332044|gb|ACIB01000012.1| GENE 111 148646 - 149206 840 186 aa, chain + ## HITS:1 COG:RSc1407 KEGG:ns NR:ns ## COG: RSc1407 COG0233 # Protein_GI_number: 17546126 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Ribosome recycling factor # Organism: Ralstonia solanacearum # 2 186 1 186 186 159 47.0 4e-39 MVDVKTIIEESQEKMDMAVMYLEEALAHIRAGKASTRLLDGIRVDSYGSMVPISNVAAVT TPDARSITIKPWDKSMFRVIEKAIIDSDLGIMPENNGEIIRIGIPPLTEERRKQLAKQCK AEGETAKVSIRNARRDGIDALKKAVKDGLAEDEQKNAEAKLQKVHDKYIAKIEEMLAEKD KEIMTV >gi|226332044|gb|ACIB01000012.1| GENE 112 149302 - 150234 930 310 aa, chain + ## HITS:1 COG:TM1717 KEGG:ns NR:ns ## COG: TM1717 COG1162 # Protein_GI_number: 15644464 # Func_class: R General function prediction only # Function: Predicted GTPases # Organism: Thermotoga maritima # 2 298 6 286 295 209 38.0 5e-54 MKGLVIKNTGSWYQVKTDDGQLVECKIKGNFRLKGIRSTNPVAVGDRVQIILNQEGTAFI SEIEDRKNYIIRRSSNLSKQSHILAANLDQCMLVVTVNYPETSTTFIDRFLASAEAYRVP VKILFNKVDAYDEDELHYLDSLITLYTQIGYPCFKISALTGEGVDAIREELKGRVTLFSG HSGVGKSTLINALVPGLEVKTAEISAYHNKGMHTTTFSEMFPVPGDGYIIDTPGIKGFGT FDMEEEEIGHYFPEIFKTSANCKYGNCTHRQEPGCAVRKAVEEHYISESRYTSYLSMLED KEEGKYRAAY >gi|226332044|gb|ACIB01000012.1| GENE 113 150268 - 150456 82 62 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|255007500|ref|ZP_05279626.1| ## NR: gi|255007500|ref|ZP_05279626.1| hypothetical protein Bfra3_00075 [Bacteroides fragilis 3_1_12] # 1 62 1 62 62 113 96.0 4e-24 MALKLTGGRWKMRVEDKQAENAIFHRVIALIINGVSGKRWKTEDRNVFFVGEAVALPSEA LK >gi|226332044|gb|ACIB01000012.1| GENE 114 151205 - 152458 552 417 aa, chain + ## HITS:1 COG:no KEGG:BF0615 NR:ns ## KEGG: BF0615 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 417 1 417 417 733 97.0 0 MKILFSLDVSNQRQVDFCNRILKILDNKDYNNDITLICKSQNHLNDYLYIPPIKSYKTEE AEIILTYGKTGLSHISQFARKSNIPIIHFINTEYLKDEYLSEDQQVEKIILCDCFNQLLE SFFQKDKMFVLPYFSKPVVTKNVEIRNKNSPKLLIAIAHPNLKNSPVYYISNLLNILSDY RITILYNGDPLIPIFNSNITLINVKESNIEKVILSNDIIIGDGISIYTGIMLGKPCIVIG EQGYGGLITPQNLSQQFANKFQGRIGGSLNEYIPLNLIMNDIQYVQNTEKSKNIDCIIIK NKELLDNEYRQTQQLLNDLILEVAANHKQLYTFPMEIHLRLSDAFHLIKFSDTKFVLAYT ANNKVHSSFGKEEAEIIALFKRSCLIKDAINMSPYKKEPKIFVEFIQMLFNEKILIA >gi|226332044|gb|ACIB01000012.1| GENE 115 152455 - 154017 905 520 aa, chain + ## HITS:1 COG:no KEGG:BF0614 NR:ns ## KEGG: BF0614 # Name: not_defined # Def: glycosyl transferase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 520 1 520 520 1012 99.0 0 MNVNKPTTEKKLIDLNNDIIHNFDVSIVMSFYKRYTEFKKVLPHNAPYLQRNGIEVIIVL DDPDEKSELLMLLQNYPFINWKLIINERKHAPRNHASVLNVGLKHATKKYILQIDPEVEF LTDIIWQMRDAIEKYPMHYILAMMAYVPYEQELTENNIKELDFIPWGNLMVERNHLYKLH GYDETFITWGGEDNNMRARLDMSGIKKFILPEAKTIHREKNYDPNERSKRINKHSISDWR KMNYPSEAIANKDIWGSEFNKVIYDWQDNQYAKDLCYTYLQQFIGFEIRHPAAFRKRHKK IVLCQAYNEEKLIEGFLTNMANYFDGIILLDDESTDRTWDLAIHDKIILKVKKKRSGFND LENRNILLDLSAFFQSEWFCFMDIDERFDERFTNFSEFENNKEIHVVSFRGVYLWNDEQS YKGDIPNSNKGILTVYRMFRPIGHTHINTHKKLHFIATPYFTNTWQSNILFKDYGSMKEN DRIRKYERYIQEDQQKDMSSGYDYLLNSENLYQLDKIEEY >gi|226332044|gb|ACIB01000012.1| GENE 116 154105 - 154233 78 42 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MQIKTKIMFISAMVSLTYHAAQQVEKQKILLHEKVKKQLTSY >gi|226332044|gb|ACIB01000012.1| GENE 117 154246 - 154590 245 114 aa, chain + ## HITS:1 COG:no KEGG:BF0612 NR:ns ## KEGG: BF0612 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 114 1 112 112 188 83.0 6e-47 MKKLNINKLSEYPVVNIEEQTSLKGGVSQDEFFRMLENNTWQGGYVDGYGYAAPAVTIYG TSWNETGRWDIDGCPACGNGSRQGWNTNQTAAEDNLVTRLTHFFFHKHAYYGDK >gi|226332044|gb|ACIB01000012.1| GENE 118 154641 - 154793 71 50 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MGCDQRVGFCTKFPIINTIKRLIKGYVRITTIKDKKRCTFFMGFKNFACR >gi|226332044|gb|ACIB01000012.1| GENE 119 155085 - 155252 145 55 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MDYNYNDRIVFWYATCRQYIYGYWFIMLWICISIFIYQIRKWASSYTMSFLLQYY >gi|226332044|gb|ACIB01000012.1| GENE 120 155633 - 157804 173 723 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|169795303|ref|YP_001713096.1| ABC transporter ATP-binding protein [Acinetobacter baumannii AYE] # 483 707 7 225 311 71 25 3e-11 MRINFPSYIQHDQMDCGPACLKIIAQNYGKRFSLKYLRDRCYATREGVSLFDIGRAAEEI GFRTLAIKVTFEDLIEKMPLPLIVHWKQSHFVVVHKITNRKVYISDPAQGLTHYNHKEFR EAWEMCNGLGTIMILETTPEFHNMNEMETRASFSHFMKYLKPHHRYLGQVIVGMVAGILI GLLSPFISQSIVDFGIGSGNIQFVNTMLIAGMILAFSSMASDFIQSRLMLYVSERINMGM VSDFLRKTMSLPITFFERKMVSDLLTRIDDHGRIQSFIMSTFLGIFINILLFVIYSLLML YYESNMFLVFMIGNTVYTGWIFLFLKQRKKLDNQLFGCRATNQNDLLELLENVNEIKINN IANKRRWKWELSRFKIYGLRVKNMNLDQIEATGASFIIHLQGLFITYIAALNVIEGTMTL GMMMAAQYILGQLNAPIKSMIGYVHSLQFARISLKRVNEVIWEEEPEISESKVSIPIEKG IKVKDLDFYYNPNLNKVLDNINLEIPEGKITAIVGESGSGKTTLLKLLLRFYKPTNGEIE VGGVPLDNIDLYRWRNSCGAVLQDGKLFNDTILYNITLEDEEMNVNQKQLVKAIQLANAE NFINARPLKLYTPLGTNGSGLSQGQKQRILIARAIYKNPDFIFLDEATNSLDTNNEKQIS KNLETILEGKTAIVIAHRLSTVKNAHNIVVMEKGKIVEQGTHQELINLKGIYYDLISSQL EIG >gi|226332044|gb|ACIB01000012.1| GENE 121 157816 - 159129 313 437 aa, chain + ## HITS:1 COG:XF1216 KEGG:ns NR:ns ## COG: XF1216 COG0845 # Protein_GI_number: 15837818 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Xylella fastidiosa 9a5c # 21 427 30 413 420 83 23.0 7e-16 MSKIQLRQVYRDQLYNYRPTWILRWGITIFFVFLLLVISVSGFIRYPDIVPATVEITTIN PPANLISKVNGKIEIIFTEEGESITKGQVLAILESPAQWKDMKILDHYITVLENTIGKDS LSVIPEPDFLRNDLELGEVQGRYADLKLNYTELYNFLHSGLFEEEVLSLQEKKQAQKQLL VQENRKRELLKTQIRLADKEYQRDSILFVKEVISESEIEQRHQNRLQFQSSLVDMEVNIL NIKSSLKQLRSDLKKIELKHNTDRQELTNKLLQSTHLLKAQTETWKQNYLITTPIDGKVS FTTYWSKNQNVKSGELIFSVVPIDSMTTKARLQFPIQNSGKIKEGQQVNIKLQNYPYQEF GMLVGHLSKISEVPNELLYSADVVLDKGLITSYGKRLPKVQQLKGDAEILTDDLSLLMRF FNPLRAIFDHRLRKHNR >gi|226332044|gb|ACIB01000012.1| GENE 122 159233 - 160315 1080 360 aa, chain + ## HITS:1 COG:VC0165 KEGG:ns NR:ns ## COG: VC0165 COG0845 # Protein_GI_number: 15640195 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Vibrio cholerae # 66 356 58 353 368 132 28.0 1e-30 MDKKVKWGIVILVGAGLIGWGIYSRLPKANEELAAADKVMTGKQINKRVLNVNAKIIKPQ LLTDQIQISGSLMPDEEVDLSFETSGKIVEINFDEGTTVKKGQLLAKVNDRQLQAQLQRL VAQLKLAEDRVFRQNALLERDAVSKEAYEQVKTELATLNADIDLVKANIAMTELRAPFDG VIGLRQVSVGSYASPTTIVAKLTKIIPLKIEFSVPERYASQVKKGTNLNFELEGKLSSFP AKVYATESRIDQSTRTLTVRALYANSNGAILPGRYASIQLKKEEIPNAIAIPSESIVPEM GKDKVFLYKSGKAEPVEITAGIRTEAEVQVIKGLQMGDTIITSGTLQLRTGLPVTLDNIN >gi|226332044|gb|ACIB01000012.1| GENE 123 160330 - 163365 2972 1011 aa, chain + ## HITS:1 COG:VC0914 KEGG:ns NR:ns ## COG: VC0914 COG0841 # Protein_GI_number: 15640930 # Func_class: V Defense mechanisms # Function: Cation/multidrug efflux pump # Organism: Vibrio cholerae # 1 1001 1 1009 1036 645 37.0 0 MNISELSIRRPVLSTVLTIIILLFGLIGYNYLGVREYPSVDNPIISVSCSYPGANADVIE NQITEPLEQNINGIPGIRSLSSVSQQGQSRITVEFELSVDLETAANDVRDKVSRAQRYLP RDCDPPTVSKADADATPILMVALQSDKRSLLELSEIADLTVKEQLQTISDVSSVSIWGEK RYSMRLWLDPIKMSGYGITPIDVKNAVDKENVELPSGSIEGNTTELTIRTLGLMHTAEEF NNLIVKEENDRIIRFSDIGRAELGPADIKSYMKMNGVPMVGIVVIPQPGANHIKIADAVY ERMEKMQKDLPEDVKYSYGFDNTKFIRASISEVKETVYVAFILVIIIIFLFLRDWRVTLV PCIVIPVSLIGAFFVMYLADFSINVLSMLAVVLAVGLVVDDAIVMTENIYVRIEKGMPPK EAGIEGAKEIFFAVISTTITLVAVFFPIVFMEGMTGRLFREFSIVISGSVIISSFAALTF TPMLATKLLVKREKQNWFYLKTEPFFEGMNRLYSRSLAVFLHKRWIALPFVAITIGIIAF LWNYIPAEMAPLEDRSQISINTRGAEGVTYEYIRDYTEDINDLVDSIIPDAESVTARVSS GSGNVRITLKDMKDRDYTQMDVAEKLSAAVQKKTMARSFVQQSSSFGGRRGGMPVQYVLQ ATNIEKLQEVLPKFMAKVYENPVFQMADVDLKFSKPEARININRDKASIMGVSTRNIAQT LQYGLSGQRMGYFYMNGKQYEILGEINRQQRNTPANLKSIYIRSDKGDMVQLDNLIELTG GIAPPKLYRYNRFVSATVSAGLAEGKTIGQGLDEMDKIAKETLDDTFRTALTGDSKEYRE SSSSLMFAFILAIVLIYLILAAQFESFKDPLIIMLTVPLAIAGALVFMYFGGITMNIFSQ IGIIMLIGLVAKNGILIVEFANLKQETGEDKMTAIKDAALQRLRPILMTSASTILGLIPL AFASGEGCNQRIAMGTAVVGGMLVSTLLTMYIVPAIYSYISTNRSNKLKQE >gi|226332044|gb|ACIB01000012.1| GENE 124 163362 - 164687 1267 441 aa, chain + ## HITS:1 COG:no KEGG:BF0682 NR:ns ## KEGG: BF0682 # Name: not_defined # Def: putative outer membrane protein TolC # Organism: B.fragilis # Pathway: not_defined # 1 441 1 441 441 799 100.0 0 MKQIVLSIIALCCAPFAMQAQQPQIYTLKSCLEYGLQNNYSLQIVRNEEQVSRNNATPGN AGYLPTLDFTAGYKGTVDNTNTKVRATGESVKENGVFDQTLNVGLNLNWTIFDGFNITAN YQKLKELQLQGETNTRIAIEDLIANLAAEYYNYVQQKIRLQNFRYAVSLSKERLRIVEER YHIGNFSRLDYQQAKVDFNADSAKYMKQQELLHTSRIQLNELMANEDVDQPLVIEDSIIK VNAGLRFEELWNATLLTNASLLKAEQNNTLAMLDYKKVNSRNYPYLKMNTGYGYTFNKYD IAANSQRGNLGANFGVTVGFNIFDGNRRREKNNARIAIKNARLQREQLEQGLKADLSNLW QAYQNNLQMLKLERQNLVAAKENHEIAMERYMLGNLSGIEMREAQKSLLDAEERILSAEY DTKLCEISLLQISGKITKYLE >gi|226332044|gb|ACIB01000012.1| GENE 125 165215 - 166882 1311 555 aa, chain + ## HITS:1 COG:XF2207 KEGG:ns NR:ns ## COG: XF2207 COG0531 # Protein_GI_number: 15838798 # Func_class: E Amino acid transport and metabolism # Function: Amino acid transporters # Organism: Xylella fastidiosa 9a5c # 3 484 6 482 483 387 43.0 1e-107 MGLFIKKPFEALLAEANASGSKSLKRVLGPWSLVALGVGVIIGAGLFSITGTVAAGYTGP AITLSFAIAALGCCFAGLCYAEFASMIPVAGSAYTYSYATMGELIAWIIGWDLVLEYTVA ATTVSISWSRYLVVFLEGLNIHLPQALTACPWDGGIVNIPAFMIVVLMSIFLIRGTEGSS IFNGIVVFLKVSVIAIFVVLGWKYINADNYTPYIPANTGTLGEYGLSGVLRGAAIVFFAF LGFDAVSTAAQETKNPKRNMPIGILVSLLVCTVLYMLFAHVMTGVAHYTEFSGQQGIAPV AIAIEHMGHADATGIIHPDYPWLNRAIVLAILFGYCSVIMVTLLGQSRVFLSMSHDGLLP PFFSHINEKFRTPARSNLLFMLIVGLLAAFVPARLAGEMTSIGTLMAFTLVCAAVLVVRK TMPNIPRSFKTPFVPLVPILGILTCLCMMLFLPADTWIRLVLWMLIGLDIYVGYGMKHSK LEHGVKNRRGQSALNMIGIALSLLCVITGLWHQQTVGWNESKVLLIISFVFAFTHCAYYM MRIWKGTTKQTNDNG >gi|226332044|gb|ACIB01000012.1| GENE 126 166975 - 169131 2172 718 aa, chain - ## HITS:1 COG:no KEGG:BF0603 NR:ns ## KEGG: BF0603 # Name: not_defined # Def: putative alpha-N-acetylglucosaminidase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 718 1 718 718 1468 100.0 0 MNRKSILLTILLCVASWAMASPVTGLLERIDKGASKKFIIEQVKSDADFFELDQKGDKVV IRGNNYVSIATGLNWYLKYYAGIHLSWNGMQAKLPAVLPPVTKKERRETTLPYRYDLNYC TFSYSMAFWDWNRWEKEIDWMALHGINIPLAVTGAEAVWHNVLDKLGYTKTEINEFISGP GFFAWWLMNNLEGWGGPNPDSWYTRQIALQKKILKRMREYGIEPVLPGYCGMVPHNAKEK LGLNVSDPGTWCGYRRPAFLQPSDPRFEEISSLYYKELEKLYGKANFYSMDPFHEGGNTA GVDLDAAGKAVMKAMKKANPKAVWVAQAWQANPRPKMIENLKAGDLLILDLTSECRPQWG DSTSEWYRKNGYGQHDWIYCMLLNYGGNVGLHGKMDNVIDNFYLAKADPHASATLKGVGM TPEGIENNPVMYELVMELPWRPDRFTKEEWLKEYVKARYGVDDPVVQAAWTNLANSIYNS PKNLTQQGTHESVFCARPAEDVYQVSSWSEMKDYYRPQEVIEAARLMVSVADRFKGNNNF EYDLVDIVRQALAEKGRLMQKAVTAAYRAGDKQLFALASGKFLDLILLQDKLLGTRPEFR VGKWIEEARALGDTPEEKELYEWNARVQITTWGNRNAADYGGLRDYAHKEWNGLLKDFYY MRWKLYFDFLSQRIEGKTPAEIDFYAIEEPWTKAANPYSAEAEGDCIEVAKQVMQAVE >gi|226332044|gb|ACIB01000012.1| GENE 127 169156 - 170301 963 381 aa, chain - ## HITS:1 COG:RSc3292 KEGG:ns NR:ns ## COG: RSc3292 COG3274 # Protein_GI_number: 17548009 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Ralstonia solanacearum # 10 371 1 329 336 92 26.0 1e-18 MELQKNPQHIVWLDVVRFVAMFTVVCCHCTDPFNFYPGTAPNIGEIKLWGAIYGALLRPC VPLFVMITGALLLPVRGEISVFYKKRIPRVLWPFLIWSVIYNLFPWITGVLGIKPEVILD FFPYSGEEVMRQSLPVSLDYIAQIPFNFSIVDVHMWYIYLLIGLYLYLPVFSAWVEKASD KAKLWFLGAWAVTLLLPYYNQFVAQYLWGTCSWNAFGMLYYFAGFNGYLLLGHYLRNLDW SLGKILAIGLPMFVIGYAVTFFGFRYVTALPEFSDEMLELFFTYCSLNVVMMTIPVFMLC KKVNFRSEGIRKALANLTLCGFGIYMIHYFFTGPSVLLVRALGVPLGIQIPVASVFAFGA SWLIVWTVYRVLGKKAKWIMG >gi|226332044|gb|ACIB01000012.1| GENE 128 170434 - 172080 497 548 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|169634422|ref|YP_001708158.1| fumarate hydratase [Acinetobacter baumannii SDF] # 53 531 12 482 508 196 31 8e-49 MATPPFKYQPMFEHGEDKTEYYLLTKDYVSVSEFEGTPVLKVQKEGLTAMANTAFRDVSF MLRRAHNEQVAKILSDPEASENDKYVALTFLRNAEVAAKGILPFCQDTGTAIIHGEKGQQ VWTGYCDEEALSLGVYKTYTEENLRYSQNAPLNMYDEVNTKCNLPAQIDIEATEGMEYKF LCVTKGGGSANKTYLYQETKAILNPGTLVPFLVEKMKTLGTAACPPYHIAFVIGGTSAEK NLLTVKLASTHYYDELPTTGNEYGRAFRDVELEKEVLAEAQKIGLGAQFGGKYLAHDVRI IRLPRHGASCPVGLGVSCSADRNIKCKINKDGIWIEKLDSNPGSLIPAELRGAGEGDVVK IDLNQPMADILKELTKYPVATRLSLNGTIIVGRDIAHAKLKERLDRGEDLPQYIKDHPIY YAGPAKTPAGMACGSMGPTTAGRMDPYVDLFQSHGGSMIMLAKGNRSQQVTDACKKYGGF YLGSIGGPAAILAQNNIKSIECVEYPELGMEAIWKIEVEDFPAFILVDDKGNDFFKQIKP RCGSCSNK >gi|226332044|gb|ACIB01000012.1| GENE 129 172504 - 175017 1859 837 aa, chain - ## HITS:1 COG:TVN1299 KEGG:ns NR:ns ## COG: TVN1299 COG0308 # Protein_GI_number: 13542130 # Func_class: E Amino acid transport and metabolism # Function: Aminopeptidase N # Organism: Thermoplasma volcanium # 65 496 7 453 783 181 28.0 6e-45 MKIHILFGMIGFILLSGCNRAAEKNPRFYDAGVSRELAGHRKAQIKNLKYELSFNIPRQK EVAIEGDITLRFDLASRQEVLIDFREEREKIKEVIANGVPVDKVRFENEHIILPASSTVE GANGIRIRFTAGNQSLNRNDEYLYTLLVPDRARTVFPCFEQPNLKAEFTLQLELPADWKA VSNTYIRSETVTDDRKTVCFAPTEPLSTYLFSFVAGKLERREYTRDGRTIAAYYRETDPK KVAQLDIVFGQVMASLHWLEEYTGIAYPFAKYDFIVLPGFQFGGMEHTGATLYNDNGIFL SEHPTPDEELNRAELIAHETSHMWFGDLVTMDWFDDVWTKEVFANYFAARIVEPLFPEIN HTQNKLKTFTAASLSEDRTMGTNAIRQPLDNLRNAGLIYGQIIYNKAPVMMEKLVDKMGE ANFRSGIQEYLKTYSYGNATWDDLIRILDSKTTEDLAAFSDVWVNRKGMPTLTFRTDGQE LEIRQHDPYNRGLLWPQRFAVTLCGERDSVIRVNMTDTLFRMQLPFVPSRVLPNTDGRGY GVFVPDEPALHWLAAHWWEIEDDTARQSLLMVLYENYLAKHISADDWVNSLITGLPAEKN ALVASTASGYLANVMREIAPANRAEVEARIYTMTQNHPLPSCRIQLIRLFMQNAISEPMV KKLYILWQQQSDKHLNRQDYTTLAYELAIRMPLESEQILRTQRARIDDPDRLRQFDFISR AAVSDTARLDTLFNSLLAAENRRIEPWTTAVIRYLNHPLREDQSVKYIRPGLEVLEEVQR TGDIFFPKNWAAALLGNHLSSSAYEEVVRFLNERPDYSPLLKNKILQAAYPLYRANN >gi|226332044|gb|ACIB01000012.1| GENE 130 175047 - 177038 1805 663 aa, chain - ## HITS:1 COG:no KEGG:BF0673 NR:ns ## KEGG: BF0673 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 663 1 663 663 1378 99.0 0 MGKTYRRLTEDEVLQLKSQSCLADDWNKVAVAEEFTTEFVHHTRFSGEVKLGVFHSDFIL PGGIKKHSGLRHVTLHNVTVGDNCCIENIQNYIANYEIGNNTFIENVDIILVDGLTQFGN GVETAVLNETGGREVLINDKLSAHQAYILALYRHRPELISRMKEITDYYSNKHASAVGTI GNHVMILNTGSIKNVRIGDFCRICGTCRLYNGSINSNESAPVHIGHGVICDDFIISSGSH VDDGAMLTRCFVGQACQLGHNYSASDSLFFSNCQGENGEACAIFAGPYTVTHHKSTLLIA GMFSFMNAGSGSNQSNHMYKLGPIHQGTLERGAKTTSDSYILWPARVGAFSLVMGRHVNH ADTSNLPFSYLIEQQNTTYLVPGVNLRSVGTIRDAQKWPKRDKRTDPNRLDYINYNLLSP YTIQKMFKGRSILKELKRVSGETSEIYSYQSAKIKNSSLNSGIRYYEIAIHKFLGNSIIK RFEGINFKDNEEIRRRLKPDTEIGVGEWVDIAGLIAPKSEVEKLIDGIESGEINRLKSMN ACFAAMHDNYYTYEWTWAYHKIQEFYGLNPETITAKDIIAIVRAWREAVVGLDRMVYDDA RKEFSLSSMTGFGADGSRDEMKLDFGQVRGDFESNPFVTAVLKHIDDKTALGEELINRIG QLA >gi|226332044|gb|ACIB01000012.1| GENE 131 177063 - 178316 1502 417 aa, chain - ## HITS:1 COG:XF0088 KEGG:ns NR:ns ## COG: XF0088 COG2262 # Protein_GI_number: 15836693 # Func_class: R General function prediction only # Function: GTPases # Organism: Xylella fastidiosa 9a5c # 19 401 13 371 450 260 40.0 3e-69 MKEFVISEAKVETAVLVGLITQTQDERKTNEYLDELAFLAETAGAEVVKRFTQKLPTANS VTYVGKGKLEEIKEYIRIEAEEDREVGMVIFDDELSAKQIRNIEAELKVKILDRTSLILD IFAMRAQTANAKTQVELAQYKYMLPRLQRLWTHLERQGGGSGSGKGGSVGLRGPGETQLE MDRRIILNRMSLLKERLAEIDKQKSTQRKNRGRMIRVALVGYTNVGKSTMMNLLAKSEVF AENKLFATLDTTVRKVIIDNLPFLLSDTVGFIRKLPTDLVDSFKSTLDEVREADLLVHVV DISHPGFEEQIEVVNKTLADIGGGGKPMILIFNKIDAYTYVEKAPDDLTPKTKENLTLEE LMKTWMAKMEDNCLFISAREKINIDELKSVVYQRVKELHVQKYPYNDFLYQTYEEEE >gi|226332044|gb|ACIB01000012.1| GENE 132 178433 - 179827 1095 464 aa, chain - ## HITS:1 COG:no KEGG:BF0597 NR:ns ## KEGG: BF0597 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 464 1 464 464 908 99.0 0 MKKIILFSTLCASLFLGSCSSSFLDQEPPLYVNENDIFTSPTRIEATLNGLYAAIKNTGT KSLMGGKSYLVFDNRGDDVINISNNLVTLFNTYNMNVGITDAENADTWTYAYLAINKVNT FLQSLEGAREVAGENYDRYVQEAKFVRALAYYYLNNLYPTPYSVNPDAKSVPLRLTAEAG TENNNMPRSTVKQIYEHILSDLENISALDTEVNTYTGVTHATQAAANMLKMRVYMAMNEW DKAIAAGELVTGYSLPEDVTLIYKAPYFSQESIFSLPMADTNIPNTQQSLAEYYYDGKIM LIDTKSGIMSKPDYSLATDKRIIAFKGEKDLLMKFTDAKTKLQWVPIFRYAETLLDLAEC YANKAGGEATAKSLLKQVRGRSVDAATDPLNIDNLSGDALKEAIYNEKRLEFIGEGIRGI DIMRRGEHFIKVGENETINVGPSDEKYTWPIPQVELLLNKDINK >gi|226332044|gb|ACIB01000012.1| GENE 133 179862 - 182756 2530 964 aa, chain - ## HITS:1 COG:no KEGG:BF0670 NR:ns ## KEGG: BF0670 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 964 4 967 967 1818 100.0 0 AEVAIAPNIRVILKTDSKALDEVVVVAYGTQSARTVTASVSTVRADALKDVPSVSFDQML QGRASGVSITTPSAGVGQAPIVRVRGVNSITSGTSPLYVVDGVPIESGNLSYLANANALA DINPADIVSMDVLKDAAAAALYGSRAANGVILITTKQGQSGKVKVSYDGFVGFSNATDFY EMMNAQEYVDFKNLAVKNRYGTDELSLTTGYVSPYGNKAFNMMKDANGNYVDTDWKDAAF QNGLSQSHSVAVSGGSDKVRYYLSGNYTTQEGIVKGDKYDRLGVKANINVQATDWLKVGM NTNVTTGTTSYVDAARRGSNFAVGGFPRLALINAPNLPMYNEDGTPYYLAQGLGYGGNTV FSTFSNPAAILSLGNGLSSDVTRFIGVFYAEATPLKGLSLKTQYGVDYARIEEQRFWSPL HGDGVNSKGLANAYNTKNNRWTWTNTATYNFSLGQNNFNLLAGTEASERNNSRWTAQRKD LQDDKFVVFQGPFGSATAGGSLSNNTMVSYFGRINYDYASKYIVSLNYRRDGYSALSEKN RWGNFGGVSAAWRVSEEGFFKPLRNVVDDLKIKGSYGVVGNTDIYDYASKSFYSSYNYGI NGTYGLAQIADPNLKWESSEKYSIGFNARLLDRISVDFDYYYTKSSDLILDVPQSPSKGI PGNIITTNAGKMKNSGIELTVSADVIRNSQFTWETSFNITTNKNKVISLADGVENILKGD NGGLEITNITVPGKSIGRLYLYPTAGVDPKSGRRVFITPEGDRTLLMFEKGGWFYEDGTE YAGEFEPVDCGNTLPTWYGGWTNNFKYKGFDLSLFFQFSGGNKIYNGTKASVSDMRYWNN SKDVYKKYWTPERTHAEYPMPIYGDNYSNGSALPISDLVERGDYLRLKNVSLGYTFNTKN WSKAVGISALRLYVQAQNLFVITGYSGMDPETLTNVESATLSGGTDKNTLPQARTYTIGV NLTF Prediction of potential genes in microbial genomes Time: Tue May 17 22:22:28 2011 Seq name: gi|226332043|gb|ACIB01000013.1| Bacteroides sp. 3_2_5 cont1.13, whole genome shotgun sequence Length of sequence - 7290 bp Number of predicted genes - 4, with homology - 4 Number of transcription units - 3, operones - 1 average op.length - 2.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) - Term 438 - 471 -0.9 1 1 Op 1 . - CDS 477 - 1979 1079 ## BF0591 putative outer membrane receptor protein 2 1 Op 2 . - CDS 2001 - 4823 1771 ## BF0592 putative surface membrane protein - Prom 5011 - 5070 8.4 - Term 5313 - 5366 -0.8 3 2 Tu 1 . - CDS 5456 - 6637 746 ## BF0667 tyrosine type site-specific recombinase - Prom 6728 - 6787 8.4 + Prom 6968 - 7027 12.8 4 3 Tu 1 . + CDS 7091 - 7289 149 ## BF0594 hypothetical protein Predicted protein(s) >gi|226332043|gb|ACIB01000013.1| GENE 1 477 - 1979 1079 500 aa, chain - ## HITS:1 COG:no KEGG:BF0591 NR:ns ## KEGG: BF0591 # Name: not_defined # Def: putative outer membrane receptor protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 500 1 500 500 964 99.0 0 MKSTIYKALIGATLIIGLSSCVNNWLDVEPGDQVEADEAVTTSADLSSVRAGMYQIIKGT SDFKDYYAARMFYYGEVRGEDMQTEKSGSRSQLCYDMTYSTADNAPSIWQTPYIVIGRAN RIIEAAKSGKLTDKEEAADIIAQYAAEAKVARAMAHFDLVRVYGKTYTAPDAPNSLGIPL VTTVLGSDSKLIRNTVSEVYTQVIKDLNEAINSKALSESSTPGYINLWAAKALLSRVYLT QGDNQKSLDVAEDIIKNSPYKLWKNEEYVGAWSKISGVHSNEMLFEIAITGSTDWTDREG IAYLYNEDGYADIIATKKFLDLLNEDPDDVRLGVFLAPTTKDFKKLYGTNTVFLNKYPAD GLSDLRYNNVPLVRLSEVYLNAAEAAAKLGNQNDKAVEYLDAIVKRANPNKTVKGTTVTV DRVLLERRKELIGEGHRFFDAMRNNETVIRYTSKEDQGWQQALKEEARSFNRDFYKTLLP IPQDEIDANPSMKDQQNTGY >gi|226332043|gb|ACIB01000013.1| GENE 2 2001 - 4823 1771 940 aa, chain - ## HITS:1 COG:no KEGG:BF0592 NR:ns ## KEGG: BF0592 # Name: not_defined # Def: putative surface membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 940 13 952 952 1798 99.0 0 MQTAEVAIRPVVKIILKPDTEVLDEVVVTGYGTFKKSTFTGAASTMATEKLQDIPTMAIE DKLAGSIPGVQISSFSGAPGATTSVRIRGMGSMNAGNDPLYVIDGTPVQSGNISAFNTPN TGEGYNSTGTNVLATLNSNDIESITVIKDAAAASLYGSRAANGVIVITTKRGATGKTQFN FRSDWGFSNIAVNYRPTLNGDDRRELIKFGLKNYYMDEEGMTAAQAEIALEDDIDAFAAK PVNGWTNWKEILFKNGSHQNYEISAQGGTEKTKFYTSLAYAKQEGITARSGLERMTGNAN LSHETGRVKVEASTLFSRILQNMTNEGTSFASPIMNAFWTASPSTVPYNEDGTFSSNFPL TNGANPVQTRTYNYDRNAITRSFNTLAATVTLWDELKLREKIAFDYTSSIESVWWDPRSN DGRSSNGVFQRYNRTLETLTTQTQLTYIKTFAQKHNLDALLGFETEDFTDSWVYTHGSQY PGYKNEIANAGETSSNSNRDKSRLTSFLGRVNYNFDNTYYAGVSYRRDGSSRLSRDNRWG NFWSISGSWRFMQESFLESIKNVITDGKLRLSYGVNGTQPSTFYSYMINMYRAGQIYNGQ SGMGVIGIGSPDLKWEKNKAFNLGLDLTFWDRLSLTLDYYTRKTNDLLMNKRISYVPGYY DPISFAPTTLQNVGSLENKGVEISLSSTNMQTQDLIWTTTFNIGHNKNKLVKLDGIQTDE IDGALIHRVGEPYYSYYMFEYAGVDPKTGNEMYYKNDGTNETTTLTFEAQKVIVGHHDPK VEGGLTNFVKWKFIDLNLTLTYSLGGKAVDYATWLHDNGGTYTSYGAVPSYYKLEDMWKQ EGDNAKLPKFKYGNSRVLSSRWLMPTDYLRLKNLSLGFSAPKGLLSKMGISKARVYFSAN NLLTWKSKDLLVDPEMPVDGLCTFEMPALRTYTFGLEIGF >gi|226332043|gb|ACIB01000013.1| GENE 3 5456 - 6637 746 393 aa, chain - ## HITS:1 COG:no KEGG:BF0667 NR:ns ## KEGG: BF0667 # Name: not_defined # Def: tyrosine type site-specific recombinase # Organism: B.fragilis # Pathway: not_defined # 1 393 1 393 393 703 99.0 0 MDKIIYSLVYNRKKSLNKKGMALVQVEAYLNRKKKYFSTKVYLSPDQWDFKKRMVKNHPN ADAINHMLYEFMAEIEKKELGLWQQGKQISLDSLKNSMENQDDSTSFIAFFRNEIAKSSL KESTKRNHLSTLELLRSYKKDVSFSELTFEFISSFDHYLQQKGYHTNTIAKHMKHLKRHI NVAINKEYMEIQKYAFRKYKIKSVENNHTHLSPEELGKIESLELGGRFTKLEKTKDAFLF CCYAGLRYSDFTNLSPENIVKMHQETWLIYKSVKTNTEVRLPLYLLFEGKGIEVLNKYQD DLADFFKLRDNSNVNKELLIIAKLSGLNKRISFHTARHTNATLLIYSGVNITTVQKLLGH KSVKTTQVYTNIMDITIVRDLEKSKNNHKVSYM >gi|226332043|gb|ACIB01000013.1| GENE 4 7091 - 7289 149 66 aa, chain + ## HITS:1 COG:no KEGG:BF0594 NR:ns ## KEGG: BF0594 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 66 1 66 1011 133 100.0 2e-30 MKRKLMLLLTCLFMGIGLVTAQTQKVTGVVISEEDGQPVVGASVLAKGTTVGVITDVDGK FTLSGI Prediction of potential genes in microbial genomes Time: Tue May 17 22:23:17 2011 Seq name: gi|226332042|gb|ACIB01000014.1| Bacteroides sp. 3_2_5 cont1.14, whole genome shotgun sequence Length of sequence - 105197 bp Number of predicted genes - 72, with homology - 71 Number of transcription units - 35, operones - 19 average op.length - 2.9 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . + CDS 11 - 2956 2474 ## BF0666 hypothetical protein 2 1 Op 2 . + CDS 2968 - 4563 1335 ## BF0665 hypothetical protein + Term 4603 - 4649 13.6 - Term 4593 - 4634 7.0 3 2 Op 1 . - CDS 4667 - 5311 650 ## COG2095 Multiple antibiotic transporter 4 2 Op 2 . - CDS 5368 - 6060 477 ## COG1011 Predicted hydrolase (HAD superfamily) - Prom 6089 - 6148 4.0 - Term 6077 - 6121 8.0 5 3 Op 1 . - CDS 6150 - 7007 977 ## BF0662 hypothetical protein 6 3 Op 2 . - CDS 7043 - 7717 595 ## COG0313 Predicted methyltransferases 7 3 Op 3 . - CDS 7737 - 8465 513 ## BF0584 hypothetical protein - Prom 8556 - 8615 6.1 + Prom 8466 - 8525 3.9 8 4 Op 1 . + CDS 8581 - 9180 420 ## COG1435 Thymidine kinase 9 4 Op 2 . + CDS 9243 - 10382 804 ## COG0628 Predicted permease + Term 10577 - 10615 6.4 + TRNA 10477 - 10549 70.0 # Lys TTT 0 0 - Term 10804 - 10838 -1.0 10 5 Tu 1 . - CDS 10980 - 11984 521 ## COG2207 AraC-type DNA-binding domain-containing proteins - Prom 12128 - 12187 6.2 11 6 Op 1 . + CDS 12170 - 12328 106 ## BF0629 hypothetical protein 12 6 Op 2 . + CDS 12318 - 14657 1838 ## COG4771 Outer membrane receptor for ferrienterochelin and colicins 13 6 Op 3 . + CDS 14691 - 15914 1187 ## BF0627 hypothetical protein + Prom 15930 - 15989 2.2 14 6 Op 4 . + CDS 16010 - 16186 326 ## BF0576 hypothetical protein - Term 16164 - 16204 -0.9 15 7 Op 1 . - CDS 16382 - 17962 1296 ## COG3507 Beta-xylosidase 16 7 Op 2 . - CDS 17974 - 20757 2581 ## COG1874 Beta-galactosidase 17 7 Op 3 . - CDS 20770 - 23697 1569 ## COG3250 Beta-galactosidase/beta-glucuronidase 18 7 Op 4 . - CDS 23716 - 25284 1397 ## BF0622 hypothetical protein 19 7 Op 5 . - CDS 25297 - 28578 2635 ## BF0621 hypothetical protein - Prom 28602 - 28661 11.1 20 8 Tu 1 . - CDS 28674 - 31076 1321 ## BF0620 hypothetical protein - Prom 31215 - 31274 5.1 - Term 31212 - 31259 4.2 21 9 Op 1 6/0.000 - CDS 31283 - 32281 741 ## COG3712 Fe2+-dicitrate sensor, membrane component 22 9 Op 2 . - CDS 32343 - 32930 376 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 23 10 Tu 1 . + CDS 33058 - 33297 284 ## BF0617 hypothetical protein + Term 33317 - 33371 12.8 - Term 33303 - 33358 9.2 24 11 Tu 1 . - CDS 33367 - 34272 958 ## COG0668 Small-conductance mechanosensitive channel - Prom 34420 - 34479 2.9 + Prom 34223 - 34282 5.5 25 12 Op 1 . + CDS 34485 - 36704 1978 ## BF0565 putative TonB-dependent transmembrane receptor protein 26 12 Op 2 2/0.000 + CDS 36708 - 37280 531 ## COG3201 Nicotinamide mononucleotide transporter 27 12 Op 3 . + CDS 37267 - 37890 510 ## COG1564 Thiamine pyrophosphokinase 28 13 Tu 1 . - CDS 37844 - 38794 816 ## COG0697 Permeases of the drug/metabolite transporter (DMT) superfamily - Prom 38876 - 38935 4.4 - Term 38978 - 39029 10.4 29 14 Tu 1 . - CDS 39207 - 40508 1322 ## COG0498 Threonine synthase - Prom 40566 - 40625 4.1 30 15 Op 1 . - CDS 40651 - 41862 1356 ## COG3635 Predicted phosphoglycerate mutase, AP superfamily 31 15 Op 2 . - CDS 41874 - 44309 2765 ## COG0527 Aspartokinases - Prom 44379 - 44438 4.6 + Prom 44635 - 44694 8.6 32 16 Tu 1 . + CDS 44803 - 45852 949 ## COG0252 L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D 33 17 Op 1 . + CDS 46511 - 47878 1175 ## COG1066 Predicted ATP-dependent serine protease 34 17 Op 2 . + CDS 47934 - 48305 316 ## BF0604 hypothetical protein 35 17 Op 3 . + CDS 48350 - 49939 1311 ## COG2509 Uncharacterized FAD-dependent dehydrogenases 36 17 Op 4 . + CDS 49944 - 50543 525 ## COG2197 Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain + Prom 50571 - 50630 1.7 37 17 Op 5 . + CDS 50654 - 53041 2249 ## COG1629 Outer membrane receptor proteins, mostly Fe transport + Term 53086 - 53134 17.2 - Term 53074 - 53122 13.4 38 18 Op 1 . - CDS 53194 - 54168 693 ## BF0600 hypothetical protein 39 18 Op 2 . - CDS 54165 - 55991 1745 ## COG0826 Collagenase and related proteases 40 18 Op 3 . - CDS 55988 - 56905 833 ## COG1897 Homoserine trans-succinylase - Prom 56928 - 56987 2.5 + Prom 56887 - 56946 4.2 41 19 Tu 1 . + CDS 57065 - 57235 161 ## COG2768 Uncharacterized Fe-S center protein + Term 57255 - 57304 10.9 - Term 57244 - 57290 13.2 42 20 Tu 1 . - CDS 57318 - 60053 1978 ## BF0546 putative exported rhamnosidase A - Prom 60112 - 60171 3.9 - Term 60115 - 60159 3.1 43 21 Op 1 . - CDS 60183 - 61376 1336 ## COG0436 Aspartate/tyrosine/aromatic aminotransferase - Prom 61397 - 61456 1.8 44 21 Op 2 . - CDS 61459 - 62673 1331 ## COG0807 GTP cyclohydrolase II 45 21 Op 3 . - CDS 62678 - 64588 1169 ## COG0795 Predicted permeases 46 21 Op 4 . - CDS 64665 - 65051 204 ## BF0592 hypothetical protein - Prom 65295 - 65354 80.3 + TRNA 65276 - 65352 85.3 # Met CAT 0 0 + Prom 65714 - 65773 7.0 47 22 Tu 1 . + CDS 65926 - 66582 498 ## BF0591 hypothetical protein + Term 66753 - 66797 7.6 - Term 67353 - 67391 4.2 48 23 Tu 1 . - CDS 67538 - 68293 446 ## BF0590 hypothetical protein - Prom 68332 - 68391 5.6 49 24 Op 1 . - CDS 68399 - 71347 1982 ## BF0589 hypothetical protein 50 24 Op 2 . - CDS 71399 - 71584 67 ## 51 24 Op 3 . - CDS 71520 - 73913 2011 ## BF0588 hypothetical protein - Prom 73959 - 74018 1.8 52 25 Op 1 . - CDS 74037 - 75524 1276 ## BF0587 hypothetical protein 53 25 Op 2 . - CDS 75548 - 78961 2631 ## BF0536 putative outer membrane protein - Prom 79019 - 79078 3.9 - Term 78978 - 79020 1.1 54 26 Op 1 6/0.000 - CDS 79080 - 80090 609 ## COG3712 Fe2+-dicitrate sensor, membrane component 55 26 Op 2 2/0.000 - CDS 80144 - 80707 458 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog - Prom 80739 - 80798 6.9 56 26 Op 3 . - CDS 80866 - 82305 1062 ## COG0642 Signal transduction histidine kinase 57 26 Op 4 . - CDS 82334 - 83461 873 ## COG2205 Osmosensitive K+ channel histidine kinase - Term 83477 - 83516 7.5 58 27 Op 1 . - CDS 83557 - 84267 566 ## BF0581 hypothetical protein 59 27 Op 2 18/0.000 - CDS 84316 - 84891 608 ## COG2156 K+-transporting ATPase, c chain 60 27 Op 3 20/0.000 - CDS 84903 - 86951 1904 ## COG2216 High-affinity K+ transport system, ATPase chain B 61 27 Op 4 . - CDS 86973 - 88679 1548 ## COG2060 K+-transporting ATPase, A chain - Prom 88737 - 88796 3.8 - Term 88999 - 89047 13.4 62 28 Tu 1 . - CDS 89120 - 90463 1235 ## COG2204 Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 63 29 Tu 1 . + CDS 90768 - 92132 1033 ## COG0527 Aspartokinases + Term 92182 - 92226 14.1 - Term 92170 - 92214 14.1 64 30 Tu 1 . - CDS 92241 - 92357 122 ## gi|253564079|ref|ZP_04841536.1| predicted protein - Prom 92420 - 92479 7.3 + Prom 92346 - 92405 5.6 65 31 Op 1 . + CDS 92451 - 94931 2084 ## COG0446 Uncharacterized NAD(FAD)-dependent dehydrogenases 66 31 Op 2 . + CDS 94941 - 95291 339 ## BF0574 putative transcriptional regulator - Term 95326 - 95385 -0.0 67 32 Tu 1 . - CDS 95450 - 96748 1008 ## BF0573 putative secreted tripeptidyl aminopeptidase - Prom 96908 - 96967 6.0 + Prom 96830 - 96889 5.3 68 33 Op 1 . + CDS 96926 - 97558 616 ## BF0572 hypothetical protein 69 33 Op 2 . + CDS 97576 - 98202 706 ## BF0571 hypothetical protein + Term 98236 - 98274 6.2 - Term 98221 - 98265 6.2 70 34 Tu 1 . - CDS 98326 - 99786 1230 ## COG3119 Arylsulfatase A and related enzymes - Prom 99852 - 99911 4.4 - Term 99854 - 99889 1.9 71 35 Op 1 . - CDS 99981 - 101951 1943 ## BF0569 hypothetical protein 72 35 Op 2 . - CDS 101967 - 105197 2951 ## BF0568 hypothetical protein Predicted protein(s) >gi|226332042|gb|ACIB01000014.1| GENE 1 11 - 2956 2474 981 aa, chain + ## HITS:1 COG:no KEGG:BF0666 NR:ns ## KEGG: BF0666 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 981 80 1060 1060 1833 99.0 0 MQTAEVAIAPVIKVILKTDAKALDEVVVTAMGISKEKKALGYAVQDVKGDKLTQASSSNL SSALQGKVSGVEISPSSGMPGASSKMTIRGSRSFTGDNTPLYVVDGMPISSSTDLDTFNS VTGSDYANRAVDIDPNDIESINILKGQAASALYGMRASNGVVVITTKSGKGARKGKPEVT INTGLSFDKVSTMPDFQKEFAQGFNGAFSPSDSRSWGPLISELANDPKYGGNTDNSYTQK FGKHQGQYYVDQRAKAGLDPWATPRAYDNAKDFFNTGVTWNSSANVAQSLDKGSYSLSLG STTANGIVPSTGMDRYNAKLTAQAQLSKNWSTGFNGNFVYSKIKKQTGANNGIMATVYGA PSSYDLGGIPSHIDGDPYTQNTYRSTGGFDGAYWAVENNSFKERTQRFFGNAFAKYSTDF GTENHKLDVKYQLGTDAYTTNYTDTWGYGHSNGTGSIEEQSVTNNEINSLLTVTYDWKIT PELDLNVLYGNELVDNSSKYKYNLGSNFNFPGWNHIANASIYSSTGTYHRERTVGNFGNI SLAYRNMLYLNATVRNDIVSSMPRNNRSFTYPSVSLGFIFTELEALKNDVLTYGKIRASY AEVGQAGTYYPSYYRTPVYGGGFSSGTPIQYPIGSISSYIPYYKIYDPNLKPQNTKSYEI GADFSFWNGLISLNYTYSRQNVKDQIFEVPLATSTGYSELITNGGSVHTNSHEITLSVNP IQTKNFNWDFAFNFSKIDNYVDKLAPGVSSIMLGGFVDPQVRLSAGDKFPVIYGTSYQRN EEGQIVVDENGMPTLGENRVLGNVSPDFRMGFNTTFEFYKFRLSAVLDWKQGGCMYAGSV STLDYYGVTQKSADYRKADHFYFEKPAVKQLADGSYAPNDIKISGENAYNYFDRLSTISE AGVYGSSFLKLREIALSYPVLNKSYLGVTVNVFARNLLLWSEMDNGIDPESSQGNNNMAG AFERFSLPGTSSYGFGITVKF >gi|226332042|gb|ACIB01000014.1| GENE 2 2968 - 4563 1335 531 aa, chain + ## HITS:1 COG:no KEGG:BF0665 NR:ns ## KEGG: BF0665 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 531 1 531 531 1024 99.0 0 MKSIFNLSKGILSVALISVAFASCSEDTMDNINKDKDHTTSVPAKFILADVITATAFSNI GGDFNTYYSTYVEHMVGVDNQLANAEKRNGEPSASSTFNNVWGNLYSTLKNARIAINISS NEVTGNYTTKGIGEVLAAINAGLIADSFGDTPFSQAALPELANGQPQFLTPELDKQEAIY TAIMEYLDTAITDLPKGDKSDEIGEYDFIYKGDGEAWLKLAYGLKARYTMRLLARSSSKD ADLQKILEYVDKSYTSIEEQAAFSIYSATNLNPLFDFQWSRDGLAASKSYADKLIERNDP RLRRIFCIGQGKLTENENAVSIQVTGADDPRFLMADNGTAESVKYEYNTPIFVYSQTCPT LLMSYHELLFLKAEALARLNRKDEAANALKDAVIAAIANAETGVSAAFNAPTVKSYGGVK ETTKAITATEAEEYFTNNVKPLFDANPVKEVMVQKYIAFLGAFGETTECYNDVRRLKAMG EEYIKLDNPYKFPLRAPYGADDVSANPNVEVAFGNGQYVYTDPVWWAGGSR >gi|226332042|gb|ACIB01000014.1| GENE 3 4667 - 5311 650 214 aa, chain - ## HITS:1 COG:BS_yvbG KEGG:ns NR:ns ## COG: BS_yvbG COG2095 # Protein_GI_number: 16080438 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Multiple antibiotic transporter # Organism: Bacillus subtilis # 4 201 2 202 211 125 38.0 6e-29 MDTLLPFALLCFTSFFTLTNPLGTMPVFLTMTHGMTDKERQAVVRRATFVSFITLMVFVF AGQFLFKFFGISTNGFRIAGGVIIFKIGFDMLQARYTPMKLKDEEIKTYADDISITPLAI PMLCGPGAIANAIVLMEDAHTIEMKGTLIGIIALIYFITFLILRASTRLVKVLGETGNNV MMRLMGLILMVIAVECFVSGLHPILVGILKEGLL >gi|226332042|gb|ACIB01000014.1| GENE 4 5368 - 6060 477 230 aa, chain - ## HITS:1 COG:YPO2295 KEGG:ns NR:ns ## COG: YPO2295 COG1011 # Protein_GI_number: 16122519 # Func_class: R General function prediction only # Function: Predicted hydrolase (HAD superfamily) # Organism: Yersinia pestis # 1 230 1 222 224 117 32.0 1e-26 MKYKNLFFDLDDTLWAFSQNAYDTFEEVYDKYRLGQYFDSFSHFYSLYQRRNTELWVEYG NGQVTKEELNRQRFLYPLQAVGIDNEMLAKRYSDDFFSIIPTKSRLMPYAEEVLSYLAPK YNLYILSNGFRELQSRKMRSSGIDTYFNKIILSEDLGVMKPWPEIFYFALSATQSELRES LMIGDSWEADITGANGIGMHQAYYNVSGRADFPFRPTYLVTDLKELMELL >gi|226332042|gb|ACIB01000014.1| GENE 5 6150 - 7007 977 285 aa, chain - ## HITS:1 COG:no KEGG:BF0662 NR:ns ## KEGG: BF0662 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 285 1 285 285 442 100.0 1e-123 MKKLAILLVCAAMLASCNGLGGGSKDLKAENDSLLMELTQRNAELDEMMGTFNEVQEGFR KINAAESRVDLTRGTISENSATAKQQIASDIEFITKQMEENKAQIAKLQAMLKSSKNNSA QLKKAVESLTQELVAKTQRIEELQAELASKNIRIQELDAAVTGLTADKESLAAENEAKAK TVAEQDKAINSAWFVFGTKSELKTQKILEKGDVLKSADFNKDYFTQIDIRTTKEIKLYSK RAELLTTHPAKSYELVKDDKGQLTLKITNPKEFWSVSKYLVIQVK >gi|226332042|gb|ACIB01000014.1| GENE 6 7043 - 7717 595 224 aa, chain - ## HITS:1 COG:all4680 KEGG:ns NR:ns ## COG: all4680 COG0313 # Protein_GI_number: 17232172 # Func_class: R General function prediction only # Function: Predicted methyltransferases # Organism: Nostoc sp. PCC 7120 # 2 221 8 228 285 225 48.0 4e-59 MGKLYVVPTPVGNLEDMTFRAIKVLKEADLILAEDTRTSGILLKHFEIKNVMQSHHKFNE HKTVESVVNRIKAGETIALISDAGTPGISDPGFLVVRECVRNGIEVQCLPGATAFVPALV ASGLPNEKFCFEGFLPQKKGRMTKLKSLVDEHRTMVFYESPHRLLKTLTQFAEYFGPERQ VSVSREISKIHEETVRGTLSELIEHFTATDPRGEIVIVLAGIDD >gi|226332042|gb|ACIB01000014.1| GENE 7 7737 - 8465 513 242 aa, chain - ## HITS:1 COG:no KEGG:BF0584 NR:ns ## KEGG: BF0584 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 242 13 254 254 362 99.0 5e-99 MQALLNDIELDIQELKYLMEIISREPDSVLRGVARRNIVQMRGRLDALLELLDAKPVIAA DASESPENRPVTETIEVVEIPALKETVADIPEENEEAAVAEDSQEAVEPGPLSLSEAVIE QVPTVELDPVADEKEEVFESVSVPQSEEPTVKVETRVASSPILAERIKTAGDLRRSISLN DSFRFSRELFGGSMEQMNNVLHQIGEMSSLDAALVFLSSKIKVDEENEAMNDFVELLRKH FI >gi|226332042|gb|ACIB01000014.1| GENE 8 8581 - 9180 420 199 aa, chain + ## HITS:1 COG:BH3779 KEGG:ns NR:ns ## COG: BH3779 COG1435 # Protein_GI_number: 15616341 # Func_class: F Nucleotide transport and metabolism # Function: Thymidine kinase # Organism: Bacillus halodurans # 9 189 1 188 204 187 50.0 8e-48 MVLFSEDHIQETRRRGRIEVICGSMFSGKTEELIRRMKRAKFARQRVEIFKPAIDTRYSE GDVVSHDSNSISSTPIDSSASILLFTSEIDVVGIDEAQFFDSGLIDVCNQLANNGVRVII AGLDMDFKGVPFGPMPALCAIADEVSKVHAICVKCGQLASFSHRTVKNDKQVLLGETAQY EPLCRECYQRALQEDREKS >gi|226332042|gb|ACIB01000014.1| GENE 9 9243 - 10382 804 379 aa, chain + ## HITS:1 COG:RSc2624 KEGG:ns NR:ns ## COG: RSc2624 COG0628 # Protein_GI_number: 17547343 # Func_class: R General function prediction only # Function: Predicted permease # Organism: Ralstonia solanacearum # 37 347 36 338 356 94 26.0 4e-19 MERKKITFDSFIRGSICCALIVGLLILFKRLSGVLLPFFVAWLIAYMIYPLVKFFQYKLR FKSRIISIFCALFSITIVGISLFYLLVPPMLAEMGRMNDLLVTYLTNGTYSSGTVPPTLS EFIHKHIDLQALNRILSEENIMNTIKETVPKLWALVAESINILFSVFASFIILLYVVFIL LDYESIAEGWLHLLPGKYRTFASNLVNDIQDGMNRYFRGQAFVAFCVGILFSIGFLIIDF PMAIALGLFIGALNMVPYLQIIGFLPTIVLAILKAADTGENFWVILAGALIVFIVVQAIQ DGFLVPRIMGKITGLNPAIILLSLSIWGSLLGMLGMIIALPLTTLMLSYYQRFIINKEKI KYDRHEVTDNQSAEENTKK >gi|226332042|gb|ACIB01000014.1| GENE 10 10980 - 11984 521 334 aa, chain - ## HITS:1 COG:alr2587 KEGG:ns NR:ns ## COG: alr2587 COG2207 # Protein_GI_number: 17230079 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Nostoc sp. PCC 7120 # 189 326 183 321 327 115 43.0 2e-25 MVTRLMKHDLCEMVLEWPMPNIVKGKRTEKNNYRIDNTYLKATFEEISTPLFSIMDQHIS SEEPIKIYTRADDYHAVWFCATFAGHATCCYNSVIRSEEWNKGDANLLKCDGVDSCVHFP KNTPFHMMEIMLSHDYLRELAIHYPDLLGGKDFETVLCNGLYRAYRKNRPFGPGVYKALH DIRLSQLNGNMASMYADAKIREILSLFLAGQEEENYLHCSCTIGITCDKIHHARAIIEQE YLNPPSLHQLALRVGTNECTLKRGFKTVFGTTVFGHIFEYRMKMACRYLLDSSKTIQEIG ACVGYEYHAHFSTAFKRKFGLTPLEYRCSRLSGS >gi|226332042|gb|ACIB01000014.1| GENE 11 12170 - 12328 106 52 aa, chain + ## HITS:1 COG:no KEGG:BF0629 NR:ns ## KEGG: BF0629 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 52 1 52 52 82 100.0 4e-15 MKVNLSGVPETLFITLRVQAAETAKPNSSTRAPYTIEVRPTFIEIRRESHEI >gi|226332042|gb|ACIB01000014.1| GENE 12 12318 - 14657 1838 779 aa, chain + ## HITS:1 COG:STM2199 KEGG:ns NR:ns ## COG: STM2199 COG4771 # Protein_GI_number: 16765529 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor for ferrienterochelin and colicins # Organism: Salmonella typhimurium LT2 # 112 239 33 164 663 76 36.0 2e-13 MKYRILVLTAILFTACNLLHAQVQTGKLSGVIIDANNEETLVGANIYVESLKKGTSTDKN GEFSIEVPAGHYRIVISYMGYRTRQEEVRISELKTKKLVIRLEPETQSLGEVVVTAKSEA RQLREQAMPMSVISMQQLQGTVSNVQDVLSKTVGVTIRNTGGVGSSSRVSVRGLEGKRIG FFIDGSPMNDNSDFIDINDIPVDMIDRIEIYKGVVPARFGGSSVGGAVNIVIREYPPKYL DASYSIESFNTHKLSLVTKRNIATKGLEFGGGGFYTYSDNNYKMESPFEEGLIIKRNHDK FKKLAVAGSLKARKWWFDLVEFEPVFIHTFKEIQGIEYNIEKAHTYSDAFIFANKLEKEN FLTEGLDMESNLAYAYTVFHMVDTAAYRYNWDGTTYPAVSEYGGEIGKWASNARNEKHTI THKLHLNYVINNNHSINLNSLFSFASGHPKDDLKNKVVGYKTNFRSTMASWIAGLGYDFR TDNDIFLNSLNVKYYMYGMNTHMSSIMSSEAEKVDMLKRDFGISNALRYRFTPDFMGKLS VGYDVRLPAESELLGDGYTVAPSGNLLPERNTSVNLGFLLDRTGKDASNLQVEVNTFYGY LENMIRFTGGYLQSQYQNFGKMRTLGVEVEVKADLTHWLYGYCNMTYQDLRDVRKFEPNT HITNPTKGSRMPNIPYLLANAGLEYHKENLFGGKGQNTRIFTDGSFVEEYFYDFEQSRFQ ERRIPRTFSANIGIEHSFMNGRFIISARMNNLTDARMMSEFNRPLPGRNWEVRLRYVLK >gi|226332042|gb|ACIB01000014.1| GENE 13 14691 - 15914 1187 407 aa, chain + ## HITS:1 COG:no KEGG:BF0627 NR:ns ## KEGG: BF0627 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 407 1 407 407 790 99.0 0 MKIRNLMLALAGTALVFSACEKDPNPKVPTTDANIALATILPNPDGMTGAAYLQLIRDEF PQNTNNNNGIPIPYGGSYPIIEGNDIFVFPGYMGDSKNELVKYTRNNGQLSRTGTMKLPP NSSATNIVFASTGKAYLSMAGLGKIAIFDPTTMTQQGEIDLTSLGVSDSNPDPSAMLLRD GLLFVGLSQMVGGWIPPQDRPYSDIAIIDTQTDKLLKMITDKTSGISMPTRPIDRYSIFM DEKKDIYISCMGGFGMVKGHNAGVMRIKAGETEFDPTYQWTITGAAIGGEEKTAGFISAI RYVGNGKAYAYINMPGYYKPGEQGHTAIADLAVEIDLYNKTMKKVQGLDLSNGFGVMLSL YKGSMLIGNSSAKAKGIYSLDIQTGEVSKEPILTTVGNPILCYYFEK >gi|226332042|gb|ACIB01000014.1| GENE 14 16010 - 16186 326 58 aa, chain + ## HITS:1 COG:no KEGG:BF0576 NR:ns ## KEGG: BF0576 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 58 1 58 58 106 100.0 2e-22 MELPEDPMMLFSVINMKLRDCYASLDELCEDMNISKDILIGKLESIGFEYNAEQNKFW >gi|226332042|gb|ACIB01000014.1| GENE 15 16382 - 17962 1296 526 aa, chain - ## HITS:1 COG:CAP0114 KEGG:ns NR:ns ## COG: CAP0114 COG3507 # Protein_GI_number: 15004817 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-xylosidase # Organism: Clostridium acetobutylicum # 22 525 30 530 531 316 37.0 7e-86 MKKYLYVLIAWMLPLCSIHAQAPVWNPDNGNGTFTNPIMWGDWPDPDVIRVGDDFYFVST SMHYVPGCPIATSKDLVNWKMAGYAVDRYDEDARYDLKGGDRYLRGSWAATIRHHNGKFY VGFCTPSWEGEGPGHFSICIADDVKGPWERTIFPEYLYDPGLLFDDDGKVYVFHGQGTLY VTELASDAKSVKGEKVKIWDKRFKNAHEFGGGYGMEGAHAYKINGKYYLTCPTGGTEGWQ ICLRSDNIYGPYEHKLIMNDSGSYPPNGLHQGGMVQLKNGDWWFIIMQDRGPIGRVPHLV PVKWVDGWPMLGSGGKDIITYPKPEVGKTYPFASPATTDEFNTSSLGLQWQWNHNPDNSR WSLKERKGHMRLKASYAESLKTARNTLTQRVQGPSSEATVELDVSGLKDGNVAGFGVFEF PYAYVAVEQTNGEKKIVMCNDGQTIETIDRFEGNKIWIRARVMDVGFRAVFYYSTDGKYF LPIGNELSMGLGLVWTANRFALFNFSKEKVGEDGYADFNWFRFTNK >gi|226332042|gb|ACIB01000014.1| GENE 16 17974 - 20757 2581 927 aa, chain - ## HITS:1 COG:CAC2514 KEGG:ns NR:ns ## COG: CAC2514 COG1874 # Protein_GI_number: 15895779 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-galactosidase # Organism: Clostridium acetobutylicum # 26 370 46 371 982 134 29.0 6e-31 MNMNYRSLTLSILLGMIVHFAVEAQTLPEKQFFHPDRVKYDKDCFTIEGKDIFILSAAFH YFRCPQELWRDRFRKIKEAGFNTVETYVPWNWHERNMPKSVDDYSQCNFDDLKAWLHMAH EEFGLYTIVRPGPFICAEWAGGAYPRWLAKFCPDSYDTSFWLRSNHPEHMKWSEHWYNAV CRVFSEEQLTRKQPGEKGIIMVQLENEYIYFDMESEKKEEFLRVLSDACIRNGIDVPLFT CVTPEVRGSHDPVISQLFDMDNQYVWWNMHEAKSRIEKLKAEQSNAPAFVCELQGGWFST VGGRLSEDSYLDGRHARGMALMAMAGGSTGLNYYMFFGGTHFAGWGARRMTTTYDYGAPL KENGGVGEKYAAVKGIGEVVDKFGGLLVRSRPVRFDVQGADNLTIGIRRAADGTLFVFLL NRDKKQAFRQLVNLTVEGKPMRIDCQLAALDSKLLVVHAGTDTVEWYPREQTLPERPVAL PLPITITDVWRKDEDFRGDWIPLRKGKSLPELGVNDCRYSMYRSQVNLTQKEVDRYGSLV FELFTGDPVYVRVNGEFAERVSKDELDNTFIVSGLLHKGVNEIIAIYENRGHAHGYRPME ELSGVKQAGFGRRQTGIQPIEEWFVKSVGTDEPRLMPAVFAEDAGWEKILLDQQTIDNLA TLQIAGLEKPKWPAAWILQEKAGCAVYRTCIRWTPEMIREGMTVLEFGCIDDAGTLWVNG VEVGTHEEWDKPYVINVAPFIHEGDNEIAMVVSNRSGAGGLLKGVRLKQEFEVIKKLNWE VSTDLGGICQDWNLGTGSTEGWSVVKLSADYPLVRKGELTGTVDGGRDALLTWYRLEFNM PRKNPSVWIPWKLIVNATGTGYMWLNGHNIGRYWEEGPQREFYLPECWLRAGEKNVIILG LRQSETKGACLFGAEVAPYTEDVECME >gi|226332042|gb|ACIB01000014.1| GENE 17 20770 - 23697 1569 975 aa, chain - ## HITS:1 COG:SMb21655 KEGG:ns NR:ns ## COG: SMb21655 COG3250 # Protein_GI_number: 16263752 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-galactosidase/beta-glucuronidase # Organism: Sinorhizobium meliloti # 90 811 57 746 755 216 26.0 2e-55 MVKVTEMFFRIFFFCSMLVFCSFPALAGEAEARVVLNMNTGWAFHRGEVESGGQPGLDDS GWIAAIIPHIMQLEKKHCGGDIIYDGVGWYRRTFRVPSQYKDKQIKISFEGVMNACEVYL NGQKISAHRGGYVGFVTDITTRINWDRDNLLAVRVSAEYDPLTPPGKPQAGMDFYYYSGI YRDVEMVISDPLHITHALEEEEVAGGGIFVTYPVVGKEKAVTHVKAHVRNEGKRKRKAQL RTQLIDKSGKIVACQLTPFRLSAGEAIHLEQNLEIVHPSLWHPYDPNLYTLQNEIVENGK VVDCHTESIGIRTIAYTRDGGFYINGESLYLRGANRHQAFAHIGDAAANSMQERDVIDLK RGGCNAVRAAHYPQDPAFLAACDKYGLLVVECIPGWQYFKNDSTFISRLYEVGKQMIRRD RNHPSVILWETALNESRYPAEIARNLYAIAHTEYPGDQMYTAGDYFGHADRVDCFDVFYK QVSRFPKDGDVMSNYPEDQIAVKPLFCREWGDGVGEKPRVSLMENEEEQLKQCRGRFLQL NGHGYFDWCMLDANPRMGGHFLWSYNDYARGAEQETMFCGIVDINRYPKFSYYMMQSMRD KEISQPGLYDGPMVFIASQNTASRYVSSVNEITVFSNCDEVRLFRNHHLIGKQMRKERTP LYRSIVEKGGSPCYVFNAGTYEAGELVAEGIVDGKVVATHSVRTPEQPRQVKIWLKEENI QPVADGSDMIPVYFKVCDSNGTLVNTSDVRIHISVSGEGSLIGDGIERIGINPQLVEGGV GYALIRTTCRPGKIHISVTADGLRGDTREIVTRRYDGVFVPEGYHVPYSGDEEEGVVVKA TAWENVICTKTPLKVVRVEATSEQKRYEASHITDGDDFSWWIADDESQPQIVLLELERSV NVFASRIRFQKDSSTYTHKVEISIDGKSWETLYERECTGWDFKPVQIGKELKYMRLTIEK SSEGAAGLAEVTLYQ >gi|226332042|gb|ACIB01000014.1| GENE 18 23716 - 25284 1397 522 aa, chain - ## HITS:1 COG:no KEGG:BF0622 NR:ns ## KEGG: BF0622 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 522 1 522 522 1061 100.0 0 MKKIIYSLLFAALFSACSTDFLDRNPLDKPSNEAFWRTEKDAMAAATGCYNGWFSMDEVI YADCASDNAYNPFTWEGWAVQAAGTATPTDPGYSYMGYGNMVRYNNFLENIHRPEMNEDL RKRLTAEVRFLRAWDYFQKVTHYGDVPLVTSVLEIKNANLPRTEKAKVVEFILNELKEIA PQLPESYSGSDVGRITRGAALTLKARLELFEHQYDDCMATCSEVMGLGYELFPDYKGLFK IANENNSEVILDVQYVESLYGNWVLGVLPPASVGGWCSINPTQSLVDAFECEDGKTIEES EVYDPKEPYLHRDPRLAVTVLAPGNLYEGKRYDPIDVKDPNGDYYAPYGRSKTGYLVRKY VDDLSDYADMWDCGMNAIVMRYAEVLLMYAECKIELNQIDNSVYDILDDIRTRAKMPVVD RTRYTGQEKLRELLRRERRVELAMEGLRWFDICRWKIGEQVLNGKVYGCLLGTVDPVTGA LSLTNERIFVENRKFDPAKHYLWPIPQTVIDATPAIVQNPHY >gi|226332042|gb|ACIB01000014.1| GENE 19 25297 - 28578 2635 1093 aa, chain - ## HITS:1 COG:no KEGG:BF0621 NR:ns ## KEGG: BF0621 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1093 1 1093 1093 2110 100.0 0 MKTRTRTKMLLLLGVLLFGFTVSAWSQKVSLNFNNEKVEKILSSIKAQTGMGLVFSDQLI DVNRKISIQVKDASLDEALSKLLAGTKVTFEIKNNKIYFIEKKADQQSGSRKKKVSGVVK DATGEPIIGANVVEKGVGTNGVITNLDGEFTLEVPENASLIISYIGYLQQDVSTKGKDAF NIIMKEDTKTLDEVVVVGYGVQKKVNLTGAVAAVDSKSLQNRPVTNVSNAIQGLLPGVTV ISGTGQPGSDNTTIRVRGVGTLNNSNPMYVVDGLPVSSINEVDPSDIENISVLKDASSAA IYGSRAANGVILITTKKGGDKVPTLRYDGYVGWQKPTALPEYLHSWEYAKLYNKAMVNEG KNPIYTDEEIEKFRNGSDLDNYPDTDWQGLFYKTGLQHSHRAEISGGTDKMTYMFSAGYL GQDGIIDIAKYDRYSVRGNMNAKMGKFTAGMNLSFTYGEAQEPVSGFTGEMSNIFSQINQ IAPFIPYKYSNGYYGYANDGNPLAFIEEGNLRTTKQHITRAIGNVSYEPIKGLKIQEIVG YEYKSISDEKFIKDIQYYNWKTGEPTKYQGPNNQTDERKNGLKLNLQTLVSYNNTFGKHT VGALAGYEQEYYREDWTKGYRKNFLNNDLWELNAGSPDGQTADGSANEYALRSFFGRLTY DYDNRYLVEANIRRDGTSRIFKDSRWGVFPSFSGAWRIINEPFMEGTRNVLSDLKLRGGW GVLGNQAISYYSYQSVLDQANYSFGGTVVQGVAPVDGANRDLIWETTETLNFGLDMGFLG NQYTLSIEGYRKLTYDILMKLPVSTLYGLNAPYQNAGKVKNTGLEITAGYKLNTHGWNFQ VSANAAYNKNEVMDLKNGGARIWSGKYFNQEGYAINSIGGYIAEGLFKTEEEVANSATIP GTDTAPGDIKYRDINNDGKIDGEDRVYIGNTMPKWTFGLNLFAEWKGLDATFLFQGAADV QGYLAGPGVVGEMIGAKGKPSYMYRDCWDAETNPNGKFPRAFSSYRQNNSIYNPSSFWIV NSSYLRLKNFQLGYTLPKEWCNMMGISRIRVYYSGQNLLTFTKFDKGFDPESPEGGSSYP QVKTNTFGLNITF >gi|226332042|gb|ACIB01000014.1| GENE 20 28674 - 31076 1321 800 aa, chain - ## HITS:1 COG:no KEGG:BF0620 NR:ns ## KEGG: BF0620 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 800 1 800 800 1668 99.0 0 MKYLFTLLMACLSLSTTYGTVHHPSSSVSSGRWVVWAEKPATSWQDAFVTGNGRHGTMVM GQPGSERIICVHEELFIRGWDRHKVAVPSTASLLPEVRRLIESGRSDAADELMTGEADRQ LVAMGAVQRWPLIPHPAFDLCIRYTDTTLQAGNGYRRQLDLETGETSAFWGGRGGVTESV FSSREHNVNVLRLKATGQRKINLVLGLKETPGREGVHFEHNLDSAFSSVSTEAFSGWLSY RAAYKYDAGGYEGLARVSLKGGSMITEGDSLRIADADEVLVLVRITPLENANLSVRPSVQ GELSRLPLDYNTLLLPHSRKHAEMFRRMQLDLGCSSDWKTTSTEKMLSDIHKHGVTPLFL EQIHAMGRYLLISSSGKYPPPLQGIWGGGWKPGWIGGFVWDSNINLAVSAASMGNLHECA ESYIGYVESLLPGWRLNARNYLGCRGFIVAHYNDPESGYLTHFGRSFPWMCWPGGAGWNI RPFYEYAMLTGNEAFLKKHVFPLYREMADFYEDYLTMDGDSLYHICPSVSPENAPPGTDT WLSKDATMDVAIAKEVFRLLLEMGRTFRADKKELAKWNNYLQRLPSYQINDEGALAEWID EAYQDVYNHRHLSHLYPVFPGSQLGKSEGDPRLIHAARIALNKRFAFDTGSAHGLIHVAL QAVRLGDIDKVKTNLDRFSRRHYLYDGLVTSHDPEHQVYNLDAVLSIPRLLMEMLVYTEK GKIELLPAWPCDYADGSIKGIKIYGGHTLDITWKAGKLIEAVLYARQNERYEVVCGDVRR HVQLHKGKTYHFSASFFAEK >gi|226332042|gb|ACIB01000014.1| GENE 21 31283 - 32281 741 332 aa, chain - ## HITS:1 COG:PA1364 KEGG:ns NR:ns ## COG: PA1364 COG3712 # Protein_GI_number: 15596561 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Pseudomonas aeruginosa # 126 331 74 276 280 71 27.0 3e-12 MKIDDAIIDKVLNNEASVEEAGLVAEWFATEEGSEYLSGRLESESARLTEERAREWLDHP VPEERMRERFIGQIKPEKKIVSYRRGLIAAAVLIPFLFLSISLWFLADRTGVFSATEYAE LKVPCGEQMQVVLQDGTVIQLNSDTRLRYPKKFGLFSRSVELWGEGFFVVAKDKKRPFIV DLKGVEVKVTGTKFNVKAYPSEPNVWVTLEEGGVLLKDTKNKEYPLVPGESAEYNRTSGI CQITKPDDMSQISSWRSNSLNFYLTPLRDIIKVMERQYDVHFVVRDSTLLNNRFTLSTSK VNVDDVLRDLEAVSWIRFLQTEDGVFEIQKEK >gi|226332042|gb|ACIB01000014.1| GENE 22 32343 - 32930 376 195 aa, chain - ## HITS:1 COG:all2193 KEGG:ns NR:ns ## COG: all2193 COG1595 # Protein_GI_number: 17229685 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Nostoc sp. PCC 7120 # 8 176 18 189 201 74 28.0 1e-13 MNQFLDFDQKLYAELRKGSEHAFVTVFERYNRLLYALAYRYFKSGEEAEDAVQYTFMKLW EQRSSFEFQSGIRSLLFTILKNYIMNELRHRQIVFEKHYEMAQRNEEADDSFLKNFEDKD FREHLRTAIGKLPPQKQKICRLKIEKGLSNQEIADEMHITVPTVKSHYTQAIKILRAEIE SLIVLLHVLWIHFLE >gi|226332042|gb|ACIB01000014.1| GENE 23 33058 - 33297 284 79 aa, chain + ## HITS:1 COG:no KEGG:BF0617 NR:ns ## KEGG: BF0617 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 79 1 79 79 147 100.0 2e-34 MEKYLIHSNELHLIDQERIHQAVEQMVESLDMAAGSTFSFDLYKVVETYFKDLDKRREIN HLLGITDNTYDPTEDFGVC >gi|226332042|gb|ACIB01000014.1| GENE 24 33367 - 34272 958 301 aa, chain - ## HITS:1 COG:VC0480 KEGG:ns NR:ns ## COG: VC0480 COG0668 # Protein_GI_number: 15640507 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Small-conductance mechanosensitive channel # Organism: Vibrio cholerae # 46 297 31 282 287 214 40.0 2e-55 MLLFQTTQQVADSLQVAAEKLDQAIAQADGLDKLGLITQQLIDSGIQAGGHILKAVIVFL VGRFLIRMLNRLVGRVMDKRNVDISIKTFVKSLVNILLTVLLIVSVVGALGVETTSFAAL LASAGVAVGMALSGNLQNFAGGLVILLFKPYKVGDWIEAQSVSGTVKEIQIFHTILTTAD NKLIYVPNGALSSGVVTNYSNQKTRRVEWIFGVDYGEDYNKVEKVVREVLTADKRILNDP APFIALHALDASSVNVVVRVWVESGDYWGVYFDINKTIYAMFNEKGINFPFPQLTVHQAP N >gi|226332042|gb|ACIB01000014.1| GENE 25 34485 - 36704 1978 739 aa, chain + ## HITS:1 COG:no KEGG:BF0565 NR:ns ## KEGG: BF0565 # Name: not_defined # Def: putative TonB-dependent transmembrane receptor protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 739 1 739 739 1429 100.0 0 MKKMIFAGCLMAGASVFAQAGETDSLKVVNLQEVQIVSTRATSKTPVAFTNVSKEELKKQ NFGQDIPFLLSMTPSALTTSDAGAGIGYTTLRVRGTDGTRINITANGIPMNDAESHTLFW VNMPDFASSVKDIQVQRGAGTSTNGAGAFGASVNMQTEGISMQPYAEINASYGSFNAHKE TVKFGTGLLKDHWAFDARLSNIGTDGYIDRASVDLYSFYAQGGYFADNTSVKFITFGGKE KTYHAWNYATKEEMKKYGRRFNSCGMYTDDDGHIRFYKDQTDNYLQMNYQLLLNHTFSAA WNLNAALHYTKGDGYYQEYKEDRSLKEYRLHPFMYDGKEVEKSDLIRQKKMDNHFGGGVF SVNYRNEKLDASLGGALNYYDGWHFGRVIWVKNYIGELLPDHEYYRNKAKKTDGNLYLKA NYNLVAGLNAYADLQYRYINYKIHGDNDKYDYNTDGLQKLAVNDHFNFFNPKAGLNWDID SNNRVYASFSVAQKEPTRNNYTDGNADEYPKAEKLYDYELGYTYRNTWLSAGVNFYYMDY KDQLVLTGELNEIGEAMARNVPDSYRTGVELMLGVKPCRWFQWDINGTLSKNRVKNFTEK LYEDEWKNPIEVEHGNTPIAFSPDFILNNRFSFSHKGFEAALQSQYVSKQYMSNAKQAEQ TLDAYFVSNLNLAYTFQLRHVKSVTVGFTIYNLFNEKYENNGYAGSGYTLKDGKPERYNY AGYAAQAGTNVMGNISIRF >gi|226332042|gb|ACIB01000014.1| GENE 26 36708 - 37280 531 190 aa, chain + ## HITS:1 COG:PA1958 KEGG:ns NR:ns ## COG: PA1958 COG3201 # Protein_GI_number: 15597154 # Func_class: H Coenzyme transport and metabolism # Function: Nicotinamide mononucleotide transporter # Organism: Pseudomonas aeruginosa # 1 183 1 181 191 88 30.0 8e-18 MNYLEITGALVGLIYLWLEYRASIYLWIAGIIMPAIYIFVYYQAGLYADFGINIYYLLAA AYGWMVWMRGSEKETRAELPITHTPLKRYLPLLLVFLAAFFGIAWILIWFTDSNVPWLDS FTTALSIIGMWMLARKYVEQWWAWIVVDVVCCGLYIYKELYFTSALYGLYSIIAIFGYFK WKQMMHYESK >gi|226332042|gb|ACIB01000014.1| GENE 27 37267 - 37890 510 207 aa, chain + ## HITS:1 COG:jhp1211 KEGG:ns NR:ns ## COG: jhp1211 COG1564 # Protein_GI_number: 15612276 # Func_class: H Coenzyme transport and metabolism # Function: Thiamine pyrophosphokinase # Organism: Helicobacter pylori J99 # 8 203 1 197 204 120 34.0 2e-27 MKVNDIDMDAVILGNGEYPTHSMPETMLIMAPYVVCCDGSADEHIRRGFTPDAIIGDGDS LSPENKERFRTIFHQIDDQETNDQTKAVHFLLDQGKKTIILVGATGKREDHTLGNISLLI DYMKAGAQVTMLTDHGMFIPASGRNCFKSYPGQQISIFNFNATGLRADGLVYPLSDFSNW WQGTLNEATGTEFTIHAEGDYLVYLNY >gi|226332042|gb|ACIB01000014.1| GENE 28 37844 - 38794 816 316 aa, chain - ## HITS:1 COG:BS_yyaM KEGG:ns NR:ns ## COG: BS_yyaM COG0697 # Protein_GI_number: 16081133 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; R General function prediction only # Function: Permeases of the drug/metabolite transporter (DMT) superfamily # Organism: Bacillus subtilis # 19 270 16 267 305 77 25.0 4e-14 MMKNKKVEANLSMVVSKTFSGLNMNALKYLLPVWVSPLSGVTLRCVFAAIAFWIIGMFVK PEISTRKEKIFLFLLGALGIYGFMFLYLMGLSKTTPVSSSIFTSLQPIWVFVIAVVFFKE KISAMKIAGISLGLGGAILCILAQKSDDLASDALTGNMLCLLSSIAYAVYLVASNRILKS VGMFTVLKYTFAGAAFSSIVVSAVTGFHAPVFSGPLHWFPLSVLLFVLIFPTVVSYLLVP IGLKYLKTTVVAIYGYLILIVATIVSLLVGQDRFSWSQTIAIGMICVSVYLVEVAETKEK PVSNSDKPSSLPPHGS >gi|226332042|gb|ACIB01000014.1| GENE 29 39207 - 40508 1322 433 aa, chain - ## HITS:1 COG:PM0115 KEGG:ns NR:ns ## COG: PM0115 COG0498 # Protein_GI_number: 15601980 # Func_class: E Amino acid transport and metabolism # Function: Threonine synthase # Organism: Pasteurella multocida # 1 432 1 424 424 373 45.0 1e-103 MKYYSTNKQAPLASLEEAVVKGLASDKGLFMPMTIKPLPQEFYDEIENLSFREIAYRVAD VFFGEDVPAETLKEIVYDTLNFDVPLVPVKENIYSLELFHGPTLAFKDVGGRFMARLLGY FIRKEGRKQVNVLVATSGDTGSAVANGFLGVEGIHVYVLYPKGKVSEIQEKQFTTLGRNI TALEVDGTFDDCQALVKAAFMDQELNEQLLLTSANSINVARFLPQAFYYFYAYAQLKKAG RAENVVICVPSGNFGNITAGLFGKKMGLPVRRFIAANNKNDIFYQYLQTGQYNPRPSVAT IANAMDVGDPSNFARVLDLYGGSHAAIAAEISGTTYTDEQIRESVKACWQQTGYLLDPHG ACGYRALEEGLQPGETGVFLETAHPAKFLQTVESIIGTEVEIPAKLRAFMKGEKKSLPMT KEFADFKSYLLGK >gi|226332042|gb|ACIB01000014.1| GENE 30 40651 - 41862 1356 403 aa, chain - ## HITS:1 COG:MA0132 KEGG:ns NR:ns ## COG: MA0132 COG3635 # Protein_GI_number: 20089031 # Func_class: G Carbohydrate transport and metabolism # Function: Predicted phosphoglycerate mutase, AP superfamily # Organism: Methanosarcina acetivorans str.C2A # 1 397 1 391 397 368 47.0 1e-101 MKHIIILGDGMADWAVKSLGDKTLLQYAKTPYMDKLARMGRNGRLITVADGFHPGSEVAN MSVLGYNLPKVYEGRGPLEAASIGVDLQPGEMAMRCNLICTKGDILKNHSAGHITTEEAD VLIQYLQEKLGDDRVRFHTGVQYRHLLVVKGGNKQLDCTPPHDVPLKPFRPLMVKPLVPE AEETASLLNELILKSQELLKDHPLNLKRMAEGKDPANSIWPWSPGYRPQMERLSDTFPQV KRGAVISAVDLINGIGYYAGLRRIAVEGATGLYDTNYENKVAAALEALKTDDFVYLHIEA SDEAGHEGDVALKLKTIENLDSRAVGPIYEAVKEWEEPVAIAVLPDHPTPCELRTHTNEP VPFFIWYPGIEPDSVQTFDEVAAVEGSYGLLKEDEFIKEFMNR >gi|226332042|gb|ACIB01000014.1| GENE 31 41874 - 44309 2765 811 aa, chain - ## HITS:1 COG:MJ0571 KEGG:ns NR:ns ## COG: MJ0571 COG0527 # Protein_GI_number: 15668751 # Func_class: E Amino acid transport and metabolism # Function: Aspartokinases # Organism: Methanococcus jannaschii # 3 454 4 467 473 277 39.0 7e-74 MKVMKFGGTSVGSVNSILSVKKIVESAGEPVIVVVSALGGITDQLISTSRMAAMGDAAYE GAYREIVRRHEEMVQGVIPAGETQTLLHYQVNELLDELKDIFQGIYLIKDLSPKTSDTIV SYGERLSSLIASRLIQGAVWFDSRTFIKTEKKHNKHTLDTELTNRLVREAFKEIPRVSLV PGFISSDKVSGDVTNLGRGGSDYTAAVIAAALDADSLEIWTDVDGFMTADPRVISTAYTI SELTYVEATELCNFGAKVVYPPTIYPVCHKNIPILIKNTFNPDARGTVIKQHVDHTKSKA IKGISSINDTSLITVQGLGMVGVIGVNYRIFKALAKNGISVFLVSQASSENSTSIGVRNA DADLACEVLNEEFAKEIEMGEISPIQAEKNLATVAIVGENMKHTPGIAGKLFGTLGRNGI NVIACAQGASETNISFVVDSKSLRKSLNVIHDSFFLSEYQVLNLFICGVGTVGGSLVEQI RQQQKKLMVENGLKLHVVGIIDATKAMFSRAGFDLANYREELKEKGVDSSLDTIRDEIIG MNIFNSVFVDCTASPDIASLYKDFLQHNISVVAANKIAASSAYENYRELKLIARQRGVKY LFETNVGAGLPIINTINDLIHSGDKILKIEAVLSGTLNYIFNKISADVPFSRTIKMAQEE RYSEPDPRIDLSGKDVIRKLVILAREAGYKLEQEDVEKNLFVPNDFFEGSLEDFWKKVPS LDADFEARRKVLESENKHWRFVAKLENGKASVGLQEVDRNHPFYGLEGSNNIILLTTERY KEYPMMIQGYGAGAGVTAAGVFADIMSIANV >gi|226332042|gb|ACIB01000014.1| GENE 32 44803 - 45852 949 349 aa, chain + ## HITS:1 COG:ECs2474 KEGG:ns NR:ns ## COG: ECs2474 COG0252 # Protein_GI_number: 15831728 # Func_class: E Amino acid transport and metabolism; J Translation, ribosomal structure and biogenesis # Function: L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D # Organism: Escherichia coli O157:H7 # 7 346 5 337 338 280 45.0 3e-75 MNPSDTSILLIYTGGTIGMIENPETGALENFNFEQLQKHVPELQKFTFPIDSYQFDPPMD SSDMEPEAWRKLVHVISEHYHQYTGFVILHGTDTMAFTASALSFMLEGLDKPVILTGSQL PIGVLRTDGKENLMTSIEIASARDRSGNPMVPEVCIFFENRLMRGNRTTKMSAENFNAFR SFNYPVLAEAGIHIKYNNVQIHIEGEKRELHPHYLLDTNIAILKLFPGIQENVVAATLAI EGLKAVVLETYGSGNASRKEWFLRRLRDASERGVVIVNVTQCSAGTVEMERYETGYHLLK AGIVSGHDSTTESAVTKLMFLLGHGYSPDEVRRRMNESMAGEISIDLSK >gi|226332042|gb|ACIB01000014.1| GENE 33 46511 - 47878 1175 455 aa, chain + ## HITS:1 COG:BS_sms KEGG:ns NR:ns ## COG: BS_sms COG1066 # Protein_GI_number: 16077155 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Predicted ATP-dependent serine protease # Organism: Bacillus subtilis # 1 455 1 457 458 474 51.0 1e-133 MAKEKTVYVCSNCRQESPKWVGKCPSCGEWNTYVEEIVRKEVPNKRPVSGIESPKAKPVT LSEIEADEEPRIDMHDEELNRVLGGGLVPGSLVLIGGEPGIGKSTLVLQTVLHMPERRIL YISGEESARQLKLRADRLTRTSSDCLIVCETSLEQIYVHIKNTRPDLVIIDSIQTISTES IESSPGSIAQVRECSASILRFAKETHTPVLLIGHINKEGSIAGPKVLEHIVDTVLQFEGD QHYMYRILRSIKNRFGSTAELGIYEMRQDGLRQVSNPSELLLSQDHEGMSGVAIASAIEG VRPFLIETQALVSSAVYGNPQRSATGFDIRRMNMLLAVLEKRVGFKLAQKDVFLNIAGGL KVNDPAIDLAVISAILSSNMDAAIEPEVCMAGEIGLSGEIRPVNRIEQRIGEAEKLGFKR FILPKYNMQGIDTKKLRIELVPVRKVEEAFRTLFG >gi|226332042|gb|ACIB01000014.1| GENE 34 47934 - 48305 316 123 aa, chain + ## HITS:1 COG:no KEGG:BF0604 NR:ns ## KEGG: BF0604 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 123 1 123 123 219 100.0 3e-56 MEINELTGLILKKAYEVHTVLGPGLLESAYEECLCYELIQCGLNVEKQKALPVVYKNIKV DAGYRLDIVVNNAVIIEIKAVEELHPVHTAQLITYLKLSGIKYGLLINFNVRSLKEGIRR YIV >gi|226332042|gb|ACIB01000014.1| GENE 35 48350 - 49939 1311 529 aa, chain + ## HITS:1 COG:L195271 KEGG:ns NR:ns ## COG: L195271 COG2509 # Protein_GI_number: 15673161 # Func_class: R General function prediction only # Function: Uncharacterized FAD-dependent dehydrogenases # Organism: Lactococcus lactis # 1 526 1 528 535 362 40.0 1e-99 MIQEYQLRILPEIAASEQQLKAYLSQEKGLNQRDITALRTLKRSIDARQRTIYVNLKVRV YLNEMPKDDEYEHTIYNNVEGKPQVIVVGAGPGGLFAALRLIELGLRPVVVERGKDVRER KKDLAQISREHRVDPESNYSFGEGGAGAYSDGKLYTRSKKRGNVDKILNVFCQHGASTAI LVDAHPHIGTDKLPRVIENMRNTIIECGGEVHFKTRMDALIIEQGEVKGIETNTGETFLG PVILATGHSARDVYRWLAANNVTIEAKGIAVGVRLEHHAGLIDQIQYHNRSGRGKYLPAA EYSFVTQVDGRGVYSFCMCPGGFIVPAASGPEQVVVNGMSPSNRGSRWSNSGMVVEIQPE DIINDKRLTVNNEAEETFPELAVLHFQEELERQCWLQGGRRQTAPAQRMVDFTRKKLSYD LPESSYSPGLISSPLHFWMPEFIAGRLSQGFQQFGRSSHGFLTNEAVMIGVETRTSSPVR IVRDKDTLQHITLRGLFPCGEGAGYAGGIVSAGIDGERCAEAVAQFMNP >gi|226332042|gb|ACIB01000014.1| GENE 36 49944 - 50543 525 199 aa, chain + ## HITS:1 COG:BMEI1582 KEGG:ns NR:ns ## COG: BMEI1582 COG2197 # Protein_GI_number: 17987865 # Func_class: T Signal transduction mechanisms; K Transcription # Function: Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain # Organism: Brucella melitensis # 134 191 149 206 213 65 60.0 5e-11 MMNHQPEIAIVEANTLTSLGLKTILERMIPMAVIRTFHSFGELTDDTPDMYAHYFISAQI YVEHNAFFLPRKRKTIVLAGDSHQFQLSGVPILNIYQAEEQLVKDILKLHQHAHHDGYPI KDMPPTPPTTGHELSAREIEVLVLITKGLINKEIADKLNIGLTTVITHRKNITEKLGIKS VSGLTIYAVMNGYVEADRI >gi|226332042|gb|ACIB01000014.1| GENE 37 50654 - 53041 2249 795 aa, chain + ## HITS:1 COG:YPO1011 KEGG:ns NR:ns ## COG: YPO1011 COG1629 # Protein_GI_number: 16121312 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor proteins, mostly Fe transport # Organism: Yersinia pestis # 32 795 31 690 690 181 25.0 6e-45 MKKNSILIALALFSAGGTWAEELPKDTLKVIDVEEIVVIAAPKENRKLREQPTAVTMLSQ QDMQANQVNSIKSLTALVPNIFIPDYGSKLTSAIYIRGIGSRINTPSVGLYVDNVPYIDK SAFDFNYADIERIDVLRGPQGTLYGRNTMGGLIKVHTKSPFSYQGTDLRLSAGTYDNYNA SVTHYHRMSNQFAFSTGAFYEYGGGFFKNTYLNKKIDKSQAAGGRFRGIYLPSENMKLDL NVSYEYSDQGGYAYGAYDKHTGTYLKPAYNDPSSYYRNLLNAGLNLEYQGDHFTLSSVTG LQHLRDRMFIDQDFSPANIYVLEQKQKLTTISEEIVLKSKSGRRWEWTTGAFGFYQWLTT NGPVTFKEDGVTEMLEKGINSHFPDLSAMGMKMNLNITNPTLLVDGRFHTPILSGAVYHQ STFRDLLIEGLSATVGLRLDYEKNWMKYNSGSTIDYQFNMTSPFMPVRLEQTSSPRLNGK FSNDYLQLLPKFALQYEWKKGNNVYATVSRGYRSGGYNIQMFSDLIQGNMQNDMQGQIKA GTKQIFDRLVTQGMPQAIADRILSNIPDAGENADPKASTIYKPEYSWNYEVGSHLTLWEG KLWMDLAAFLMDTRDQQIAQFAKSGLGRITVNAGKSRSYGAEAALRANLTDALSMNASYG YTYATFTDYKTIDRGQNEISYDGNYVPFVPKHTLNVGGQYIFRIAPRHWLDHVQVNVNYN GAGRIYWTEQNDVSQSFYGTLNSRVSLAKGNGQIDFWVRNALDKKYATFYFESMGNGFMQ KGRPVHFGVDIRCRF >gi|226332042|gb|ACIB01000014.1| GENE 38 53194 - 54168 693 324 aa, chain - ## HITS:1 COG:no KEGG:BF0600 NR:ns ## KEGG: BF0600 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 324 1 324 324 636 100.0 0 MREKVRFCVALCCSVVMVLLASCRYSLPDLPAEGMSRKTKDSLTYLSKYHYTWNTNLEVL DDSVRLEYLPLKDAYVNLYKGDRVVVAEFSVHPQDSVDSIWVKVAHSQEVQGWVRNKELV RSFVPTDSISQFIHLFSDTHASYFVFIFALFVGVYLLRAFMKKRLQMVYFNDIDSVYPLF LCLLMAFSATVYETMQVFVPDTWEHFYFNPTLSPFKVPFILSVFLTGIWLFIIVTLAVLD DLFRQLTPAAAVFYLLGLMSCCIFCYFFFILTTHIYIGYFFLACFIWLFVKKVRRGSGYK YRCGNCGQKLREKGQCPHCGAVNE >gi|226332042|gb|ACIB01000014.1| GENE 39 54165 - 55991 1745 608 aa, chain - ## HITS:1 COG:ZydcP KEGG:ns NR:ns ## COG: ZydcP COG0826 # Protein_GI_number: 15801708 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Collagenase and related proteases # Organism: Escherichia coli O157:H7 EDL933 # 2 605 17 632 667 529 46.0 1e-150 MIKQRKIELLAPAKNLECGIAAIDHGADAVYIGAPKFGARAAAVNSLEDIAALVEYAHLY NARIYVTVNTILKDEELQETEKMIWALFRAGVDALIVQDMGITGLNLPPIPLHASTQMDN RTVEKVRFLADAGFRQVVLARELSLREISKIHEACPDVPLEIFVHGALCVSYSGQCYVSQ ACFGRSANRGECAQFCRLPFSLVDAEGRVIVKDKHLLSLKDLNQSDELEALLDAGASSFK IEGRLKDVSYVKNVTAAYRRKLDAIFARRKEYARASSGSCRYAFNPQLDKSFSRGFTHYY LHGRTKDVFSFDTPKSLGEEMGTMKEARGNYLTVAGLKSFNNGDGVCYIDEQGRLQGFRI NRVEGNKLYPQEMPRIKPRTVLYRNFDQEFEKILARKSSERRIAVSVRLTDTPFGFALTL TDEDDNSVTLSLAREKEPARTPQEENLKTQLAKFGNTPFEAVRIDIDFAGNWFLPASVLA DFRRQAVEKLISARRINYRRELFVLKPTAHAFPQSTLTYLGNVMNGQAVSFYAGHGVASI APAFERAPAEKAVLMFCKHCLRYSMGWCPVHQRERSPYREPYYLVSTDGKRFRLEFDCKN CQMKVNAV >gi|226332042|gb|ACIB01000014.1| GENE 40 55988 - 56905 833 305 aa, chain - ## HITS:1 COG:CAC1825 KEGG:ns NR:ns ## COG: CAC1825 COG1897 # Protein_GI_number: 15895101 # Func_class: E Amino acid transport and metabolism # Function: Homoserine trans-succinylase # Organism: Clostridium acetobutylicum # 1 301 1 301 301 386 57.0 1e-107 MPLNLPDKLPAIELLKEENIFVIDNSRATQQDIRPLRIVILNLMPLKITTETDLVRLLSN TPLQVEISFMKIKSHTSKNTPIEHMKTFYTDFDKMREDRYDGMIITGAPVEQMDFEEVNY WDEITEIFDWARTHVTSTLYICWAAQAGLYHHYGIPKYALDKKMFGIFKHRTLLPLHPIF RGFDDEFYVPHSRHTEVRKEDILKVPELTLLSESDDSGVYMVVARGGREFFVTGHSEYSP LTLDTEYRRDVSKGLPIEIPRNYYVNDDPDKGPLVRWRGHANLLFSNWLNYFVYQETPYN IEDIR >gi|226332042|gb|ACIB01000014.1| GENE 41 57065 - 57235 161 56 aa, chain + ## HITS:1 COG:MA3446 KEGG:ns NR:ns ## COG: MA3446 COG2768 # Protein_GI_number: 20092258 # Func_class: R General function prediction only # Function: Uncharacterized Fe-S center protein # Organism: Methanosarcina acetivorans str.C2A # 3 52 181 231 360 58 52.0 2e-09 MAYVISEDCIACGTCIDECPVGAISEGDIYSIDPEQCTDCGTCADVCPSEAIHPAE >gi|226332042|gb|ACIB01000014.1| GENE 42 57318 - 60053 1978 911 aa, chain - ## HITS:1 COG:no KEGG:BF0546 NR:ns ## KEGG: BF0546 # Name: not_defined # Def: putative exported rhamnosidase A # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 911 1 911 911 1894 99.0 0 MKNIFFICLFLSMSLAVSGKKAQVSISCLKTEMLVNPQNIDCPSPRLSWEILSDVRDVRQ VSYHILVSTSLEKLNREEGDLWDTGDVASDQSAYISYGGDMLDSRTQCYWKVRVKTNKGI SAWSEPASWEMGLLHPSDWQAVWIGRAFPQDKLEGNTRVPARYLRKPFNLNSKKVRKATL YICGLGFYEAYINGKKIGDQVLAPTPTDYSKSVKYNRFDVTGQLLKGANAIGVILGNGRY TSMRMPGVRHFDVPKMIAQLEVYYEDGEKRVIASDASWKITAEGPIGTNNEFDGEEYDAR KEMPGWNTYPFDDTKWLQAEVVSLPGGKLEAQLNRNMKVMDTVKPIGITESAPGVYILDM GQNMVGWLRMKVKGQSGDTLKLRFAELLQKDGSIYTANLRTAHSADTYILKGNSMEEWQP IFTYHGFRFVELTGFREKPSLSDFEGQVIYDEMETTGNLETSDPMINRIYKNAYWGIRGN YRGMPTDCPQRDERMGWLGDRAVGSQGESYIFNNHLLYAKWLDDIEQAQKENGAVPDVAP NYWDVCTDNMTWPGAYLIIANMLYDQFGDKQPIIKHYPSMKKWMRYMKDKYMVDHIMTKD NFGDWCMPPESPELIHSKDPSRITEAAVLGTTFYYYLSNLMVRFAALAGYPQDAEDFRKE SELVKEAFNSKYLHTELGYYSNNTVTANILSLRFGMVPKAYKEVVFRNIVEKTMKDFNGH VSTGLVGIQQLMRGLSDYGRIDLAYRIATNRTYPSWGYMVDNGATTIWELWNGNTADPAM NSANHVMLLGDLIVWAYGYLGGISNAPGSVGFKQIQLKPYPVDGLDFVNTSFHSIYGEVR SHWKKEDGSFCWDISVPCNTSALVYVPVADKSIDPKEKKRIVGEGGTFLRMEGSYAVFSF PSGSYRLKTRL >gi|226332042|gb|ACIB01000014.1| GENE 43 60183 - 61376 1336 397 aa, chain - ## HITS:1 COG:BMEI0516 KEGG:ns NR:ns ## COG: BMEI0516 COG0436 # Protein_GI_number: 17986799 # Func_class: E Amino acid transport and metabolism # Function: Aspartate/tyrosine/aromatic aminotransferase # Organism: Brucella melitensis # 1 397 22 421 421 376 49.0 1e-104 MNQLSDRLNSLSPSATLAMSQKSAELKAQGVDVINLSVGEPDFNTPDHIKEAAKKAVDDN FSRYSPVPGYPALRNAIVEKLKKENGLEYTAAQISCANGAKQSVCNTIMVLVNPGDEVIV PAPYWVSYPEMVKLAEGTPVIVTAGIEQDFKITPAQLEAAITPKTKALILCSPSNPTGSV YSKEELAGLAAVLAKYPQVIVVADEIYEHINYIGKHQSIAQFPEMKDRTVIVNGVSKAYA MTGWRIGFIAGPEWIVKACNKLQGQYTSGPCSVSQKAAEAAYTGSQAPVEEMRQAFERRR DLIVKLAKEVPGFEVNVPEGAFYLFPKCSSFFGKSAGDRKIENSDDLAMYLLEDAHVACV GGTSFGAPECIRMSYATSDENIVEAIRRIKEALAKLK >gi|226332042|gb|ACIB01000014.1| GENE 44 61459 - 62673 1331 404 aa, chain - ## HITS:1 COG:BH1556_2 KEGG:ns NR:ns ## COG: BH1556_2 COG0807 # Protein_GI_number: 15614119 # Func_class: H Coenzyme transport and metabolism # Function: GTP cyclohydrolase II # Organism: Bacillus halodurans # 209 402 1 194 197 244 59.0 2e-64 MEPIRLNTIEEAIADFKEGNFVIVVDDEDRENEGDFIIAAEKITPEKVNFMLTHGRGVLC APITEERCAELELDMQVSSNTSIYETPFTVTVDLLEGCTTGVSMHDRAMTIRALADPKTK PADLGRPGHINPLRARSRGVLRRAGHTEASVDLAKLAGLYPAAALIEIINEDGTMARLPQ LVEVARRFGLKIISIKDLIAYRLQMESIVDRGVEVDMPTQFGHFRLIPFRQKSNGMEHIA LIKGTWDTDEPILVRVHSSCMTGDIFGSCRCECGEQLHKAMEMIEAAGKGVIVYMNQEGR GIGLMNKIAAYKLQEEGYDTVDANLHLGFDADERDYGVGAQILREIGVKKMKLMTNNPVK RIGLEAYGLEITENVGIEIKPNPYNERYLKTKKDRMGHTLHFNK >gi|226332042|gb|ACIB01000014.1| GENE 45 62678 - 64588 1169 636 aa, chain - ## HITS:1 COG:TM1735 KEGG:ns NR:ns ## COG: TM1735 COG0795 # Protein_GI_number: 15644481 # Func_class: R General function prediction only # Function: Predicted permeases # Organism: Thermotoga maritima # 11 184 17 194 1074 68 27.0 4e-11 MLRIKRLDIFIIKSFLLLFVGTFFICLFIFMMQFLWKYVDELVGKGLEMSVLAQFFFYSA LSLVPMSLPLAVLLASLITFGNFGERFELLAMKAAGISLLKIMRPLIVLVFAICCVSFYF QNVIGPQAQAKLGTLLISMKQKSPEVDIPEGVFYDEIDGYNLKVQRKDRKTGMLYDVIIY DFSNNFDNARIIVADSGRLEMTADKQHLYLHLYSGEMFENLKAQSMSSKNVPYRRESFRE KHSIIQFDSDFNMADASIMSNQSTTKDMIKIQASIDSMTVLADSIGRQYFVEASKGPYRT AVGLTKEDTLKMQEAQIRDYNVDSLFEAATLMNKQKIIASAVGRTENLSSDWGFKSFTMT QNDFSIRKHKIEWHRKITISLSCLLFFFIGAPLGGIIRKGGLGMPVIVSVLTFIIYYIID NTGYKMARDGKWIVWMGMWMSSAILAPLGYFLTYKSNKDSVVLNTDVYISWFKRVFGVRS VRHLSKKEVIIHDPDYQRLPFDLNGLSEECRAYMQKNRLAKAPNYFSLWMSGGQDQEIIA INNRMEALVDEMSNTRSLILLQKLEKYPIIPVNAHVRPFHNYWLNMAIGIVIPIGLFFYF RIWAFRIRLNKDMERIIALNRDVELTIKDINNENKI >gi|226332042|gb|ACIB01000014.1| GENE 46 64665 - 65051 204 128 aa, chain - ## HITS:1 COG:no KEGG:BF0592 NR:ns ## KEGG: BF0592 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 128 1 128 128 238 99.0 7e-62 MKKEKIHLEYLLNATSKNILWSAISTPTGLEDWFADKVVSDDKTVTFCWGKTEQRQAGIV AIRAYSFIRFHWLDDENERDYFEIKMSYNELTGDYVLEITDFSEADEEDDLKELWDSQVS KLRRTCGF >gi|226332042|gb|ACIB01000014.1| GENE 47 65926 - 66582 498 218 aa, chain + ## HITS:1 COG:no KEGG:BF0591 NR:ns ## KEGG: BF0591 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 218 1 218 218 408 99.0 1e-113 MEIFRNLTVVLAITTLVACNDVTIEGSWVESVPGISNLKQGFTLEANGSASSINMATLKY EKWKKEGNLLLLSGISIGNHQSISFTDTLTVEQLTQDSLILKKGELVLRYSKTNEIPDEE AIPTPDTPTQKLLSVKGKLIIGHETRSFTSEGDSTDYWIVDKTGELLQKYDGETKGIKNG TPVYVELEVIDMGKSDEGFAADYAGVYHVMKINKITVK >gi|226332042|gb|ACIB01000014.1| GENE 48 67538 - 68293 446 251 aa, chain - ## HITS:1 COG:no KEGG:BF0590 NR:ns ## KEGG: BF0590 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 251 1 251 251 501 99.0 1e-141 MKICSCLNKGVMIAVAVVCSAGLQSCLDDNDKYYYVEGFPNALVTVKNATDGSFFLQLND STTLLPTNMSSSPFGDKEVRALVNYDEVSESGGRYDKAVQVNWIDSILTKSIAPDLGSEN DVVYGTDPVEIVNDWVTIAEDGYLTLRFRTKWGDYNKAHFVNLLLGQNPSDPYEVEFRHN AYGDTYGTVGDGLVAFKLVGLPDTNGKTVPLTLKWKSFSGDKSAVFDYCTQKTVTPAKPA IVSVRSNLNLK >gi|226332042|gb|ACIB01000014.1| GENE 49 68399 - 71347 1982 982 aa, chain - ## HITS:1 COG:no KEGG:BF0589 NR:ns ## KEGG: BF0589 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 982 1 982 982 2041 99.0 0 MRHLFLLAALFCACISYAQIRVQERLPKSKTGFALTTAKTRAKIYYDVNDALVIKRSAEL FARDIQMVTGQKPELIPKRERAKALVIVGTIEKNQWICELAQKGKIDIRPLQGAWERYLI QTVDNPSPGVAKALVIAGSDRRGAAYGLFSISEMMGVSPWYWWGDVPVKTHKALYVDAPP TYSKTPSVKYRGIFLNDEDWGLKPWAAKTFEKERGNIGPRTYAKICELLLRLKANHLAPA MHPVSTAFYQIPENKLVADTFAIVMGSSHCEPLLLNTASEWHSKTMGPWDYNANKDKINE VLGNRVKENCAYENVYTLALRGLHDAAMGGGDVPMKEKVKMLENALKDQRSLLTRHIDKP AETIPQAFTPYKEVLEIYSNGLELPDDVTIIWADDNFGYMKRLSGPQEQKRTGRAGVYYH ISYLGVPHSYLWFSTTPPALMYEELRKAYDTTADRIWLANCGDLKGAEAQVSFFLDMAYD IDQFNENNVHTYPARWLAKIFGEQYYDTLKDITCSHINLAFSRKPEYMGWGYWNNYWGGG EKRTDTEFSFINYNEAGRRLAEYRRIGKKAEEMLATVDKKAKPALYQLLYYPVKGAELMN RMNMTGQLYRQYVRQKRAAADDLKRETTTCHDSLEIITDGYNSLLDGKWKYMMSLRQNYD GSSSYFMLPLMEESYVATGDPKLAVQVESEQLNRGGFSFRALPVFNTYSRKSHWIDVYNQ GGGMLDWKSELSDEWIVLSRQSGSTRTEDRIQVSIDWNKVPSGEKVTGFIEFSSGMQKER VLVSVFNPKTPVRDELQGLFIEENGYVSLPATAFHRKFESEDVKMSVLPGLGFEGAALQI GSPIAPLQMYRSSEVPRVEYDFYTFNAGMVDVYTYVLPTFPLHADRDYKLPEHTNSDTKY SVRIDDGSISTPSTSAIEYSQIWYDSVLKNCRVNKSTLYVKAPGKHTLQIRCGDPGTVIQ KIVIDMGGLKRSYLGPETTLCR >gi|226332042|gb|ACIB01000014.1| GENE 50 71399 - 71584 67 61 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MVKPVLCLKDIAIGLLGSPVIDFTITPVNPIHSQTETYDMFQFVSVWDIFMRNSFCSKLE K >gi|226332042|gb|ACIB01000014.1| GENE 51 71520 - 73913 2011 797 aa, chain - ## HITS:1 COG:no KEGG:BF0588 NR:ns ## KEGG: BF0588 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 797 1 797 797 1658 99.0 0 MLFKRIRVTILLLGLLSFSLRVSAQIVLKHSDWEWKISETGCAEQLIFKGGKRNDTIPFF REGEHAGPSFYAKREGKEVRGSWIPDGYASYRSEIDGVLCRISYIKDHGQPALRVKLTNN SPVPYQPQKAGLKLGIDTYMDKFPDWFGKYFPTLMRNEKTHFYGYLQTPSGHTLGLVSQQ PVASWSVDYNLGYQDPAPFWFMGHRIESLNLDLLNELPLPARHPQNLYELKQGESKEWIF TFVNVGNLDNLEHAIARVSDIPLIDIRQTSHAAREEASFTLTADNPNVKVTNDAGKELPV VLTKTKGNRWIGKVRLEDAGLYTLSVRSGNKVAEAIWTVHHPWQWVMEKARENAARYHQK PTSHAESWYGFYSAFLAARYFPNESLDKQLSNYFDRLYNKLHDSVKVEPLYFKTRIQNTS TTIGMLVDKYEAQGDLEDLKKASKLADWMIATSQRENGAYYNHGTVYTSVIYIAKSVLEL AVLERKLGEQDLFWRTCADRHFLSAKKAVDQLVASQGDFQTEGELTFEDGMISCSALQIG MMGVIEQDAVARKYYTDAMLKILNSHDCLTQLRVPDGRRRQGTMRYWEAQYDVQMLPNMF NSPHGWSGWRAYATYYAYLLTGDEKWLEQTFNAMGAFANLIDYKTGQLRWAFVVDPHLEV EQACSADTKLDFSDLSFGNPHPKLYDTRKFVIGEQYVNMISDWQTVNTQDNDVHELFKCI GEAVLTNAFVIERPNGEVVGYNCRVTRKGNTLTVKADEKQIVNLHCNLKHSFSVSFDGKT CSLPEGYCNWAFGQSGY >gi|226332042|gb|ACIB01000014.1| GENE 52 74037 - 75524 1276 495 aa, chain - ## HITS:1 COG:no KEGG:BF0587 NR:ns ## KEGG: BF0587 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 495 1 495 495 1033 100.0 0 MKKIIYIFSLFITVIGLLGSCINLESESYDSINTTIFPTNADDADALVIAAAYGPFRADG YSGLFQCAKGGLASNTDMSTDLLDCKWGDSWWPAVLQLNFTATSDIPTSFYGTWANHIGK MTLTLDRISGIDMKEEEKTRLIAETRCGRGWLAYILYDLYGPIQIPSLEVLQNPTQKVIV PRSSKEETVKLIEDDLKAAAEVLPAKYSKSDENFGRFTKGLAYTVLMKLYMHEKEWGKAV ECGREVMKCGYSLVTNYKDIFTLDNEGNDEMIFSCIETRGVNEQMWHAHVLPSNYPTTNP NIQKWNGYRMPWQFYHSFDPKDKRLEVICSEYVGTDGVTYNETNPGEYLDKGALPIKYGE DPTQTSENSEVDVVVYRYADVLTLMAEALARSNNAVTQEAVDRLNDVHTRAGLAAYQLSD FTSLDDFLGAVLKERGHELWGEGCRRSDLIRYGLYIDYAIKYKGSTTAKEYMNLMPLPQS VITESSGQVIQNEGY >gi|226332042|gb|ACIB01000014.1| GENE 53 75548 - 78961 2631 1137 aa, chain - ## HITS:1 COG:no KEGG:BF0536 NR:ns ## KEGG: BF0536 # Name: frrG # Def: putative outer membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1137 1 1137 1137 2182 99.0 0 MNKNRSIRVAAFCLGLMLAGSLQVHAQKVTFRDGTVSLKQAFEKIEASSKYKIAYNDSHL NVSRKVSLNQKETEVLEILATILKDTGYTYKKNGNYIVIVPVEQKPASKVKKVKGVVKDA SGEAIIGANVLVKGTATGVITDMNGSFELEVPGNAVLQITYIGYVQQDVAVKNRDQLAVL LKEDTKTLDEVVVVGYGTMKKKDLTGAVASVKMDDTPLSTISTVSHALAGKAAGLQVNTI SAQPGGGTTFRIRGAASTGAGNDPLIIVDGFPVSNAGNVSVGYNSDNGTTDNILASINPN DIESIEVLKDASSTAIYGARAGSGVIIITTKRGKEGKPKVTYSGSATVQTMATKYEMLDA QDFMIQSNRWFKEKWMYDNKVGIYGGKNESEASSAYHPKYSDADIANPVNDTDWYDRITR TGFQTQHNISINGGTEYTKYLISGNFFNQKGIVKNNGMSRYTGRVNLDQKLSKYAKVGIN LTVSRNTLDNVPLGAGQNEYASILVSAAQFSPLLSVKDENGDYSLNQQAAYIPNPVSLLE ISDQTTKERFLATPFVEIKPINELTLKASFGIDRNYQRREVYMPKTTLYGEKADGRADIG QYDRSDYLLELTANYAKRLGDHNLNALVGYSFQRFTSKYLNAGNQGFLTDAFLFNNLGAG TYEKPWVGSSASKSEMASFFGRVNYTYKDRYLVTATLRADGASNFAKNNRWGYFPSVALG WRFTEENFARSLNLDKVLSNGKLRLSYGQTGNSNIGDKSVTYYGTGYNKVFGNKEYTGVY VSQIGNPDLKWETTTEWNVGLDLGFLNNRFNVTAEYFHKVVSDLLSSRSVLSYNEVGSIA ANIGKTQSQGFELTINTKNFDTKDFSWNTDFTFSFYRDKWKERDENWSPNPYDMYNAPLR GYYMYLSDGLIQPGEEVSWMPGALPGQVKLKDIDGYAYNEDGTYKTDKHGIPLRTGKPDG KIDYADAVFKGCSDPGYLLGLNNTFRYKNFDLNVYFYGQLDLLKSGSYKDYWIVGSGTMT GVNNLYRGYNMPTSAKEVWSHDNTSGKMPGYFQYMSNYGYGDYFLEKSWFIRCRNITLGY TLPVKSAKKLVSNVRIYADVNNLFTITPYDGLDVETDDSYWAYPNVRSFSIGLDITF >gi|226332042|gb|ACIB01000014.1| GENE 54 79080 - 80090 609 336 aa, chain - ## HITS:1 COG:PA2388 KEGG:ns NR:ns ## COG: PA2388 COG3712 # Protein_GI_number: 15597584 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Pseudomonas aeruginosa # 126 282 119 272 331 70 32.0 3e-12 MGYLKNIITYFFHHPASDGVVERVHQRLADTNSGQEKEEVLSGIWEQIGFPQADEHQTLR AFEKLEQQIGGDSLKSESSFSRFRIPRWSWIAASIIVPLLLLFGSAYLYKETLIIKNELS NVTFIQYYVSNGKREQVTLPDRSKVWLNSGSLLIYPSAFIGNEREVYLAGEGYFSVTKDK ECPFIVKTNSVSVSVLGTEFNINAYPNIDKVVTTLEEGSIRMSLNRFDSSYLLEPDDQIV YIPSTGHIERKRVKASDYSDWRGGGLYFSNSPFKEVIQTIERTYSVQVHLQTSIYQSNNL TIHFYPNESIENIMMLIKEMIPGLEYQIEGKDIYID >gi|226332042|gb|ACIB01000014.1| GENE 55 80144 - 80707 458 187 aa, chain - ## HITS:1 COG:PA0149 KEGG:ns NR:ns ## COG: PA0149 COG1595 # Protein_GI_number: 15595347 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Pseudomonas aeruginosa # 27 178 17 168 181 77 32.0 2e-14 MHSEIAEEKLLVSQFSQGSHIAFKALFMRYYPKVRSFIMGLVKSESEAEDLTQEVFLKLW THREKFCEVEVFGTYLYVLTKNTTFNYLRSRQNRQDSQGTEWVEESTDTTPYEELVAKDL QLLIDMVVENMPPQRQMIFRLNREAGLTNAEIAEKLQISKKTVENHLNLALKELKNALLF FIFLYLC >gi|226332042|gb|ACIB01000014.1| GENE 56 80866 - 82305 1062 479 aa, chain - ## HITS:1 COG:SA1322 KEGG:ns NR:ns ## COG: SA1322 COG0642 # Protein_GI_number: 15927072 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Staphylococcus aureus N315 # 66 477 195 584 588 158 27.0 3e-38 MNIKTKLLFGIGILAGMIILLVTLSVVNLQILTATEPDSPVAMPALERALLWISVTGGIC ILTGLVLLIWLPRSINRPVKELTCGILEIANHNYEKRLDMRGYEEFREVSDSFNRMAEKL TEYRDSTLADILSAKKFLEAVVNSIHEPIIGLNTEREILFVNNEALNVLNMKRENVIRKS AEELSLKNDLLRRLIRELVTPGEKNEPLKIYADNKESYFQASYIPIENAAAEEGEARNLG DVILLKNITEFKELDSAKTTFISTISHELKTPISAIMMSLQLLEDKRVGSLNGEQEQLSK NIRDNSQRLLDITGELLNMTQVEAGKLQMMPKITKPIELIEYAIKANQVQADKFNIQIEV DYPQEKIPKLFVDSEKIAWVLTNLLSNAIRYSKENGRVVIGVRHKKEYIELYVQDFGKGI DPRYHQSIFDRYFRVPGTKVQGSGLGLSISKDFVEAHGGTLTVQSEPGKGSCFVIRLKA >gi|226332042|gb|ACIB01000014.1| GENE 57 82334 - 83461 873 375 aa, chain - ## HITS:1 COG:AGl2094 KEGG:ns NR:ns ## COG: AGl2094 COG2205 # Protein_GI_number: 15891164 # Func_class: T Signal transduction mechanisms # Function: Osmosensitive K+ channel histidine kinase # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 1 343 8 349 900 239 36.0 5e-63 MDREESVQHFLDLIKKSRRGKFKVYIGMIAGVGKSYRMLQEAHDLIDNGVDVRIGYIETH GRAGTEALLKGLPVIRRRKLFYKGKELEEMDLEAIIRIHPEIVIVDELAHTNVEGSLNEK RWQDVMTLLDEGINVISAVNIQHIESVNEEVQDIAGIEVKERIPDSVLQEADEVVNIDLT AEELITRLKAGKIYKPEKIQTALNNFFKTENILQLRELALKEVALRVEKKVENEVVMGIG LGVRHEKFMACISSQEKTPRRIIRKVARLATRYNTSFVALYVQVPKESMERIDLASQRHL LNHFKLVTELGGEVVQVQSKDILGSIVQVCKEKQISTVCMGSPGLRLPGALCSVLKYRRF LNNLAQANIDLIILA >gi|226332042|gb|ACIB01000014.1| GENE 58 83557 - 84267 566 236 aa, chain - ## HITS:1 COG:no KEGG:BF0581 NR:ns ## KEGG: BF0581 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 236 14 249 249 446 99.0 1e-124 MVIGMMCFWGSMNANAQQFNVQGDLVSSYVWRGMYQTGASIQPTLGFSVGNFSLTAWGST DFDGTSASAGAAAKEIDLTAAYTLGRSGLTVSVADLWWAGQGAHKYFNFKSHETAHHFEA GVAYTVQSETFPLSVAWYTMFAGQDKNAEGDQNYSSYVEFNYPFRVRMVDLNVTCGMVPY AAPQYNCDGFAVTNVALKGTTQIRFTDKFALPVFAQAVWNPRMEDAHLVFGITLKP >gi|226332042|gb|ACIB01000014.1| GENE 59 84316 - 84891 608 191 aa, chain - ## HITS:1 COG:AGl2092 KEGG:ns NR:ns ## COG: AGl2092 COG2156 # Protein_GI_number: 15891163 # Func_class: P Inorganic ion transport and metabolism # Function: K+-transporting ATPase, c chain # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 22 186 23 184 188 148 47.0 5e-36 MKTLLKSIKITLVFCVFFSVFYILVLWLFAQVAGPNRGNAEVVTLNGKVVGAANVGQTFT EEKYFWGRPSCAGDGYDATSSAGSNKGPTNPEYLAEVEARIDTFLIHHPYLARKDVPAEM VTASASGLDPDITPQSAYVQVKRVAQARGMDVEEVRRVVDKAVEKPLLGIFGTEKVNVLK LNIALEELKNR >gi|226332042|gb|ACIB01000014.1| GENE 60 84903 - 86951 1904 682 aa, chain - ## HITS:1 COG:DRB0083 KEGG:ns NR:ns ## COG: DRB0083 COG2216 # Protein_GI_number: 10957402 # Func_class: P Inorganic ion transport and metabolism # Function: High-affinity K+ transport system, ATPase chain B # Organism: Deinococcus radiodurans # 8 678 10 670 675 773 63.0 0 MKNKMSASMFQKEQVIESIRQSFVKLNPRMMIKNPIMFTVEVVTVIMLVVTLLSLFTPEY GTFGYNLCVFLILFVTLLFANFAEAIAEARGKAQADSLRKTREETPAKLVIGDKVQTVSS SKLKKGDVFVCEAGDTIPADGEIIEGLASIDESAITGESAPVIREAGGDKSSVTGGTKVL SDHIKVLVTQQPGESFLDKMIALVEGASRQKTPNEIALTILLAGFTLVFVIVCVTLIPFA DYTSLEHPGTAISIAAILSLFVCLIPTTIGGLLSAIGIAGMDRALRANVITKSGKAVETA GDIDTLLLDKTGTITIGNRRATKFHSAPGVGPREFVTTCLLASLSDETPEGKSIVELGRE SGVRMRSLQTAGARMIQFTAETKCSGVDLADGTQIRKGAFDAIRKITEAAGNKFPKEIEE VISEISGNGGTPLVVCVDRKVTGVIELQDIIKPGIQERFERLRKMGVKTVMVTGDNPLTA KYIAEKAGVDDFIAEAKPEDKMEYIRKEQQSGKLVAMMGDGTNDAPALAQANVGVAMNSG TQAAKEAGNMVDLDNDPTKLIEIVEIGKQLLMTRGTLTTFSIANDVAKYFAIVPALFMVA IPELGALNIMNLHSPESAILSAVIFNAIIIPILIPLALKGVQYKPIGASALLRRNLLIYG LGGVIVPFVGIKLIDLVVSLFF >gi|226332042|gb|ACIB01000014.1| GENE 61 86973 - 88679 1548 568 aa, chain - ## HITS:1 COG:CAC3682 KEGG:ns NR:ns ## COG: CAC3682 COG2060 # Protein_GI_number: 15896914 # Func_class: P Inorganic ion transport and metabolism # Function: K+-transporting ATPase, A chain # Organism: Clostridium acetobutylicum # 4 568 2 557 557 447 43.0 1e-125 MNTEILGVAVQIVLMVVLAYPLGRYIARVYKGEKTWSDFMAPIERVIYKICGINPQEEMN WKQFLKALLILNAFWFVWGMVLLVSQGWLPLNPDGNGAQTPDQAFNTCISFMVNCNLQHY SGESGLTYFTQLFVIMLFQFITAATGMAAMAGVMKSMAAKSTQTIGNFWHFLVISCTRIL LPLSLVVGFILILQGTPMGFDGRMQLTTLEGQEQMVSQGPTAAIVPIKQLGTNGGGYFGV NSSHPLENPTYLTNMVECWSILIIPMAMVLALGFYTNRRKLGYSIFGVMLFAYLAGVFIN VGQEMGGNPRISEMGIAQDHGAMEGKEVRLGAGATALWSVTTTVTSNGSVNGMHDSTMPL SGMVEMLNMQINTWFGGVGVGWLNYYTFIIMAVFISGLMVGRTPEFLGKKVEAREMKIAT FVALLHPFVILVFTAISSYVYTHHPDFVESEGGWLNNLGFHGLSEQLYEYTSSAANNGSG FEGLGDNTYFWNWTCGIVLILSRFIPIVGQVAIAGLLAQKKFIPESAGTLKTDTVTFAVM TFAVIFIVAALSFFPVHALSTIAEHLSL >gi|226332042|gb|ACIB01000014.1| GENE 62 89120 - 90463 1235 447 aa, chain - ## HITS:1 COG:atoC KEGG:ns NR:ns ## COG: atoC COG2204 # Protein_GI_number: 16130157 # Func_class: T Signal transduction mechanisms # Function: Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains # Organism: Escherichia coli K12 # 1 445 4 454 461 330 39.0 3e-90 MSKILIIDDEVQIRGLLARMMELEGYEVCQAGDCKTALRQLELQMPDVALCDVFLPDGNG VDLVTEMKKKSPGVEIILLTAHGNIPDGVQAIKNGAFDYITKGDDNNKIIPLISRAVEKA RMNIRLDKLEKKMGLMYSFDSILGESKALKEAVSLARKVSVTDVPVLLTGETGTGKEVFA QSIHYSSKRARQNFVAVNCSSFSKELLESEMFGHKAGSFTGALKDKKGLFEEANNGTIFL DEIGEMAFELQAKLLRILETGEYIKIGDTKPTHIDVRIIAATNRNLPAEIAAGRFREDLF YRLSVFQVHLPPLRERTGDIKILAESFVRSFSEKLSHPVNRIAPAYLEALCQQPWKGNIR ELRNVIERSLIVCEGDRLDVDDLPLEIQNSHYEKSDEGMPGSFELSAMERRHIARVLEYT KGNKTEAARLLKIGLTTLYRKIEEYKL >gi|226332042|gb|ACIB01000014.1| GENE 63 90768 - 92132 1033 454 aa, chain + ## HITS:1 COG:VC0391 KEGG:ns NR:ns ## COG: VC0391 COG0527 # Protein_GI_number: 15640418 # Func_class: E Amino acid transport and metabolism # Function: Aspartokinases # Organism: Vibrio cholerae # 3 436 34 476 479 154 25.0 3e-37 MKIYKFGKIPTGSVQGMKGMLRLIDNSIPQIIVLSATTETTERLAGIAAHLFNRDTEQAH DEISRLEFRFIDFANELFNDESIKQQAVDSIIDRFRTLWNFTRQRFTSVDEKDILAQGEF ISSILVSLYLKEQGINNRLLNSLDFMRLAPEEEPDMEYIGTKLHLLLAEHKSTNVFLTQG HLCRNAYNETCYLKQGGDDVSATLIGAALQAQEVCLWTDSKELHSCDPRFVKHPAMVKQL SFDEAEQLAYCGWTGFNPHCILPARENNIPIRLLCSMEPAEGGTLISNSQSGENIKAITA RDNIYYIKFQSNRTLRPYLFISKIFDTFAKYHTSLCLFASSGSDVSVAINDKERLSHILH ELSRYAATVVKDHMCILSAIGNIQWQCAGFEARIINALATIPIRMISYGSNNNNVSLVIR AEDKREALQRLNDTLFAPCHANPSQIHVPTLKHS >gi|226332042|gb|ACIB01000014.1| GENE 64 92241 - 92357 122 38 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253564079|ref|ZP_04841536.1| ## NR: gi|253564079|ref|ZP_04841536.1| predicted protein [Bacteroides sp. 3_2_5] # 1 38 10 47 47 72 100.0 6e-12 MGYLTYGSAVTPKDILLSPYRLWSDTGMLDIKTWGCCV >gi|226332042|gb|ACIB01000014.1| GENE 65 92451 - 94931 2084 826 aa, chain + ## HITS:1 COG:VCA0644_1 KEGG:ns NR:ns ## COG: VCA0644_1 COG0446 # Protein_GI_number: 15601402 # Func_class: R General function prediction only # Function: Uncharacterized NAD(FAD)-dependent dehydrogenases # Organism: Vibrio cholerae # 2 462 3 483 484 427 49.0 1e-119 MKIIIIGGVAGGATTAARIRRSDETAEIILLEKGKYISYANCGLPYYIGDVIEEREKLFV QTPEAFGVRFRVDVRTENEVIFIDRKKKTVTVRLKSEDTYEESYDKLLISTGASPVRPPL PGIDSTGIFTLRNVADTDRIKAYVNNRPPRRAVVIGAGFIGLEMAENLHALGAQVSIVEM GNQVMAPIDFSMAALVHQHLMEKGVNLYLEQAVASFEQAGKEVKVVFKNGQSILADIVIL SIGVRPETTLARAAELTIGEAGGIAVNDYLQTSDESIYAIGDAIEFRHPITGKPWLNYLA GPANRQGRIVADNLLGAQIPYEGAIGTSIAKVFDMTVASTGLPGKRLKQAGIVYASSTTH PASHAGYYPDAMPMSIKITFDPQTGKLYGGQIVGYDGVDKRIDELSLVIKHEGTIYDLMK VEQAYAPPFSSAKDPVAIAGYVAENIILGRVKPVYWRDLRDIELKDVFLLDVRTPDEFAL GSLPGAVNIPLDEIRDRIAELPSNKPIYTFCAVGLRGYLAYRILIQHGFKEVYNLSGGLK TYRAATAPIILHENEETDDTPSAQDSPAKPSVTAEAPQTTTAANPKTIRVDACGLQCPGP VLKMKKTMDTLVPGERVEIVATDPGFSRDAAAWCNSTGNKFISKDSTGGKSVVVIEKGEP QACNLTTTCDSKGKTLIMFSDDLDKALATFVLANGAAATGQKVTIFFTFWGLNVIKKLHK PKVEKDIFGKMFGTMLPSSSLKLKLSKMSMGGMGGKMMRYIMHRKGIDSLESLRQQALEN GVEFIACQMSMDVMGVKREELLDKVTVGGVATYMERADNANINLFI >gi|226332042|gb|ACIB01000014.1| GENE 66 94941 - 95291 339 116 aa, chain + ## HITS:1 COG:no KEGG:BF0574 NR:ns ## KEGG: BF0574 # Name: not_defined # Def: putative transcriptional regulator # Organism: B.fragilis # Pathway: not_defined # 1 116 1 116 116 206 100.0 3e-52 MNTICKMRDIYKALSIFETAFEEVYGISLNEAMVLCALREAGKEITSTAIAERTEMAPSH TSKVIRAVEDKGLIRRALGEVDKRQMYFSLTEAGKKRLNELDLDKVEIPEMLKPLI >gi|226332042|gb|ACIB01000014.1| GENE 67 95450 - 96748 1008 432 aa, chain - ## HITS:1 COG:no KEGG:BF0573 NR:ns ## KEGG: BF0573 # Name: not_defined # Def: putative secreted tripeptidyl aminopeptidase # Organism: B.fragilis # Pathway: not_defined # 1 432 24 455 455 852 100.0 0 MKNLRIWQTLSLFILLLSITLPLSAKSDLLTKLNTITLIIRTQSLETSLFAEKYLLRFKQ PLDHSHPEKGSFSQRVIVAHVGYDRPTLMVTEGYGAARSLNPGYYEELSKLFNTNIIAVE HRYFLESTPKPKDWKYLTAWNSARDLHAIREAFRSIYPGKWIATGISKGGQTAMLYRTYF PDDIDITVPYVAPLCRSVEDGRHEPFLRTVAMPDDRQKVEDFQMEVLKRKAALLPHFKKY CSVRKLQFRAPVEDIYDYTVLEYSFSLWQWGIPVSRIPSVSASDKELFDHLVAISAPSYF VKEGSNTSFFVQAARELGYYGYDIRPFREYLSIGTSKDYLRRLMIPEELADMEFDETLSY KITRFLKENDPKMIFIYGQYDPWTAAGVTWLKGKKNIHVFVQPKGSHMARIHTLPQKEKE EAIGLIKKWLEE >gi|226332042|gb|ACIB01000014.1| GENE 68 96926 - 97558 616 210 aa, chain + ## HITS:1 COG:no KEGG:BF0572 NR:ns ## KEGG: BF0572 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 210 1 210 210 387 100.0 1e-106 MKRLIPILLAVFAFAACEKDPDMDKLDNDYLVYTNYDKKADFKQFSTYYIPDSVLVIGDK KDPEYWKGEAAEAIINAYKENLNSKGFTYTDNKDAADLGIQVSYVQSTYYFTDYGQPEWW WNYPGYWDAPYWGNWGGWYYPYVVNYSITTNSFLTEIMNLKAPEGEKQKLPVLWSSFLSG PASYSGKVNQTLVVRAINQSFAQSPYLTNK >gi|226332042|gb|ACIB01000014.1| GENE 69 97576 - 98202 706 208 aa, chain + ## HITS:1 COG:no KEGG:BF0571 NR:ns ## KEGG: BF0571 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 208 1 208 208 355 100.0 6e-97 MKTMKYISLRVLAIAALALAFAMPAKAQLSDNGYANVDWQFNAPLKNHFADKASGWGMNF EGGYFVTPNLGLGLFMNYSSNHEYFPRATFPVGEGTVNTDQQHTIFQLPFGAAARYQMNR GGAWQPYFGVKLGANYAKVRSNYNMFETRDNTWGFYVSPEIGLNVYPWAYGPGLHIAVYY SYATNKANVLTYNVDKLNNFGFRVGLAF >gi|226332042|gb|ACIB01000014.1| GENE 70 98326 - 99786 1230 486 aa, chain - ## HITS:1 COG:STM0035 KEGG:ns NR:ns ## COG: STM0035 COG3119 # Protein_GI_number: 16763425 # Func_class: P Inorganic ion transport and metabolism # Function: Arylsulfatase A and related enzymes # Organism: Salmonella typhimurium LT2 # 10 468 7 467 497 189 30.0 1e-47 MNQKLLLGSALLVGMASTQQALARQKKAKEQTRPNVVFILADDLGYGDLSCYGQEKFETP NIDRLAQNGMRFTQCYSGTTVSAPSRSCLITGTHSGHTAIRGNKELAPEGQFPLPENSQT IFNDFRNAGYRTGAFGKWGLGYIGSAGDPYKQGIDQFYGYNCQLLAHSYYPDHLWDNDKR VDLPDNNLNVQYGKGTYSQDLIHSKALAFLDEAAKEKDQPFFMWYPTIIPHAELIVPEDS IIKKFRGKYPEKPYRGVEPGSPAFRKGGYCTQFYPHATFAAMVYRLDVYVGQIVQKLKDM GVYDNTIIIFSSDNGPHMEGGADPDFFNSNGIWRGYKRDVYEGGIRVPMIISWPGHVQPS TETDFMCSFWDLMPTFREVLNPKADTRNMDGVSILPLLQNRKGQKEHEYLYFEFLEMNGR QAVRKGDWKLVHMNIRGNKPYYELYNLASDPSEKYNVLNQYPEKADELKAIMKEAHIEDS NWPLFR >gi|226332042|gb|ACIB01000014.1| GENE 71 99981 - 101951 1943 656 aa, chain - ## HITS:1 COG:no KEGG:BF0569 NR:ns ## KEGG: BF0569 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 656 1 656 656 1347 100.0 0 MNMKKILLFLTLFTGMILAGCNDSFLEKYPVTSLTEENAFKSYDNFKAFMWPCYEMFSNT NIATSTTAIGRNSHYMGDVYAGYLNQRGASSQNKYAFQTVTNATSGNGWNFSTFIRRINL MLSHVDDSDMTEAEKNHWKAVGYFFHSFWYMELIDRFGDVPWIDKPLDETSEEAYGTRMP RLEVADKVLERLQWAEQNIGDASVYEKKDGSNTINRDCVRAALSRFTLREATWRKYHELG SYDKYFDECIRVSKLLMADYPTLYYGTDGQPAAGYGEMWTTEDLSKVQGVILYQQWLETI KPGHSCYYEHTSSHDIEMHQGTVDLYLCKDGKTISHSDQYQGDKDIYATFRNRDPRMYHT IMPPYKVKDGKGDYSTWSYTSDPADREYIDIMGANESCSNPGIGMKRLPGQNWSASLVRR VPNLQGGAQYTIEGKKYGPHAYVASRSGYYVWKNWDNWEENYNNAQVNTADKPVFKIEEV LLNYAEAMFETNQFTQTIADETINKLRKRAGVADMVVAQIDGNFDPKRGKYYPKGNDNGI LVDPVLWEIRRERIIELMGEGFGFYDVRRWRMAPWFVNMQQKGMWISKTELSSLTLLNEM TGTSDGANGSMTEGYIYLFNDPLKEGKGWLEKYYLYQVPLEEIALNPNLTQNPGWE >gi|226332042|gb|ACIB01000014.1| GENE 72 101967 - 105197 2951 1076 aa, chain - ## HITS:1 COG:no KEGG:BF0568 NR:ns ## KEGG: BF0568 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 14 1076 1 1063 1063 2024 99.0 0 GLGISAGLLSPNYVFATSLETYENQSVAAVQQARKITGTLTDAVGEPIIGATVLEKGNPS NGTITDINGKFSLSVHPNAVISISYIGYITQNIKITNQTSLKVVMMDDTQALEEVVVVGY GSQKKANLTGAVSSVKMDEVLGDRPILNASDALQGAVPGLFVSNGGNAPGTSKSFQIRGA YSVGVKNSDGSYGNTIKPLVLIDNVEGDLDMVNPEDIESISVLKDAASAAIYGARAAGGV IVVTTKRPKGAAKFSLNYNNNFAFGTAVNLPKQAPLMDYLQAYLDCGYSDAYWSLGSPSV SKWMEYLSEYQKNPSAFNTVGDGIYMDESGVPYYLNEKDLYKNFMETSFQMTHNISASGG TDKLRYRISGGYTSNDGVLVSDRDKFERMNINTFISADVTNWFTQEVTMSYAHSLQTSPG GMGGVYNTRLVSYYPEGDLPASVNTLANEDLPLFTPRNQILLSNPVNNNNDNPRIFLKSI LKPLKGLEAVFEYTFDKNIYDYHWYTGQYDYTTIQGGSSKSFVDDYLRKYKQHTNYNAIN VYATYSKKFGDHNFKVMAGFNQESSYQETLDAYSYNQAVIDVPAMGSGTGTIKATDSYSE YAVRGGFFRVNYNYQDKYLLEVNGRYDGSSKFPKSSRFGFFPSVSAGWQIAQEKFMESTR NWLDGLKIRASYGVIGNQNVNPYTFTPTMSVSNKSTSWIIDNTYVTSISSLPALVSQNFT WEKVGTVNVGLDINLFNNRLNGVFEWYQRNTNGMLAPGVQLPAVVGASAPYQNTADMRTR GWELSLNWRDQIGKVGYRLGFNLSDYKSKITKYDDNATTKLLSSFYPGQVMGEIWGYIAD GYYSVDDFEDTSSWKLKEGITSINGYNVRPGDVKFKNLRDDESSTNVITSGDNTFDNPGD RKVIGNTTPRYQYGINLGANYAGFDLNVILQGTGKRDYWISNVLTFPMNGDNFIPLFDGL SDYWMPKDPDNGDWTAVNPNAKYPRLYGNRGNSGSNLRQSDKYLSDASYLRIKNITLSYN LPKKWLSQIFLSQMKAFVSVENVATFTSLPSGIDPERIEWNYPAFRTVSFGINITL Prediction of potential genes in microbial genomes Time: Tue May 17 22:26:42 2011 Seq name: gi|226332041|gb|ACIB01000015.1| Bacteroides sp. 3_2_5 cont1.15, whole genome shotgun sequence Length of sequence - 99770 bp Number of predicted genes - 88, with homology - 87 Number of transcription units - 41, operones - 23 average op.length - 3.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) - Term 182 - 222 5.2 1 1 Op 1 1/0.000 - CDS 254 - 3289 2642 ## COG1472 Beta-glucosidase-related glycosidases 2 1 Op 2 . - CDS 3337 - 6327 2642 ## COG1472 Beta-glucosidase-related glycosidases 3 1 Op 3 1/0.000 - CDS 6395 - 7267 266 ## PROTEIN SUPPORTED gi|145635642|ref|ZP_01791339.1| 30S ribosomal protein S16 4 1 Op 4 . - CDS 7307 - 8095 678 ## COG0737 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases - Prom 8232 - 8291 9.0 + Prom 8063 - 8122 4.4 5 2 Tu 1 . + CDS 8252 - 8605 584 ## PROTEIN SUPPORTED gi|53711854|ref|YP_097846.1| 50S ribosomal protein L19 + Term 8637 - 8675 7.1 - Term 9292 - 9333 9.7 6 3 Tu 1 . - CDS 9360 - 10340 481 ## PROTEIN SUPPORTED gi|116517028|ref|YP_816079.1| glucokinase - Prom 10360 - 10419 7.3 7 4 Op 1 36/0.000 + CDS 10519 - 11235 364 ## PROTEIN SUPPORTED gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 8 4 Op 2 10/0.000 + CDS 11250 - 12509 1228 ## COG0577 ABC-type antimicrobial peptide transport system, permease component 9 4 Op 3 13/0.000 + CDS 12512 - 13756 1225 ## COG0577 ABC-type antimicrobial peptide transport system, permease component + Prom 13783 - 13842 4.0 10 4 Op 4 13/0.000 + CDS 13865 - 14965 1244 ## COG0845 Membrane-fusion protein 11 4 Op 5 . + CDS 15014 - 16336 1261 ## COG1538 Outer membrane protein + Term 16360 - 16403 7.6 + Prom 16348 - 16407 3.5 12 5 Op 1 . + CDS 16576 - 17337 468 ## BF0555 RNA polymerase ECF-type sigma factor 13 5 Op 2 . + CDS 17432 - 18361 747 ## COG3712 Fe2+-dicitrate sensor, membrane component + Prom 18489 - 18548 6.3 14 6 Op 1 . + CDS 18570 - 21740 2824 ## BF0553 hypothetical protein 15 6 Op 2 . + CDS 21757 - 23472 1683 ## BF0552 hypothetical protein + Prom 23500 - 23559 4.3 16 7 Tu 1 . + CDS 23580 - 25817 1628 ## COG1501 Alpha-glucosidases, family 31 of glycosyl hydrolases 17 8 Tu 1 . + CDS 25931 - 28099 1560 ## COG3345 Alpha-galactosidase 18 9 Op 1 . - CDS 28273 - 28833 517 ## BF0549 hypothetical protein 19 9 Op 2 . - CDS 28912 - 30210 593 ## COG0249 Mismatch repair ATPase (MutS family) - Prom 30251 - 30310 3.8 + Prom 30194 - 30253 3.4 20 10 Tu 1 . + CDS 30303 - 32225 1365 ## COG0744 Membrane carboxypeptidase (penicillin-binding protein) + Term 32253 - 32284 0.1 - Term 32225 - 32290 10.6 21 11 Op 1 . - CDS 32345 - 33208 468 ## BF0494 hypothetical protein 22 11 Op 2 . - CDS 33218 - 33886 495 ## BF0544 hypothetical protein 23 11 Op 3 . - CDS 33901 - 34554 442 ## BF0543 hypothetical protein - Term 34579 - 34615 7.3 24 12 Tu 1 . - CDS 34657 - 35496 221 ## PROTEIN SUPPORTED gi|212640476|ref|YP_002316996.1| Uncharacterized protein conserved in bacteria containing two ribosomal protein S1-like RNA-binding domains - Prom 35603 - 35662 5.7 + Prom 35456 - 35515 5.4 25 13 Tu 1 . + CDS 35717 - 36718 1150 ## COG0039 Malate/lactate dehydrogenases + Term 36747 - 36795 13.2 26 14 Tu 1 . - CDS 36810 - 37547 809 ## BF0540 putative potassium channel subunit - Prom 37568 - 37627 3.3 27 15 Tu 1 . + CDS 37764 - 39362 1404 ## COG0531 Amino acid transporters + Term 39385 - 39452 6.6 - Term 39377 - 39435 7.2 28 16 Tu 1 . - CDS 39455 - 40690 878 ## COG0642 Signal transduction histidine kinase - Prom 40778 - 40837 5.5 + TRNA 41131 - 41203 82.1 # Phe GAA 0 0 + TRNA 41216 - 41288 74.7 # Pro CGG 0 0 + Prom 41535 - 41594 6.5 29 17 Op 1 . + CDS 41614 - 42087 506 ## COG1438 Arginine repressor 30 17 Op 2 . + CDS 42121 - 42699 475 ## BF0534 hypothetical protein 31 17 Op 3 . + CDS 42713 - 43918 1457 ## COG0137 Argininosuccinate synthase 32 17 Op 4 1/0.000 + CDS 43915 - 44883 817 ## COG0002 Acetylglutamate semialdehyde dehydrogenase 33 17 Op 5 . + CDS 44893 - 46017 944 ## COG4992 Ornithine/acetylornithine aminotransferase + Term 46041 - 46102 2.2 + Prom 46028 - 46087 5.1 34 18 Op 1 . + CDS 46130 - 46903 787 ## COG0345 Pyrroline-5-carboxylate reductase 35 18 Op 2 1/0.000 + CDS 46963 - 47517 770 ## COG1396 Predicted transcriptional regulators 36 18 Op 3 . + CDS 47535 - 49190 1434 ## COG0365 Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases + Prom 49210 - 49269 5.3 37 19 Tu 1 . + CDS 49339 - 49539 133 ## BF0527 hypothetical protein + Term 49571 - 49619 8.2 - Term 49564 - 49603 4.2 38 20 Tu 1 . - CDS 49679 - 51994 1508 ## BF0526 hypothetical protein - Prom 52100 - 52159 7.5 39 21 Tu 1 . - CDS 52238 - 52348 74 ## - TRNA 52915 - 53001 61.5 # Leu TAA 0 0 - TRNA 53016 - 53088 84.5 # Gly GCC 0 0 - Term 53180 - 53224 6.5 40 22 Op 1 . - CDS 53233 - 54576 1119 ## COG0165 Argininosuccinate lyase 41 22 Op 2 . - CDS 54582 - 54992 468 ## BF0511 hypothetical protein - Prom 55012 - 55071 2.5 - Term 55005 - 55063 12.5 42 23 Op 1 . - CDS 55092 - 55730 661 ## COG0461 Orotate phosphoribosyltransferase - Prom 55756 - 55815 4.7 43 23 Op 2 . - CDS 55826 - 56290 491 ## BF0509 putative regulatory protein 44 23 Op 3 . - CDS 56287 - 57123 383 ## PROTEIN SUPPORTED gi|225874212|ref|YP_002755671.1| ribosomal protein L11 methyltransferase - Prom 57181 - 57240 5.0 45 24 Tu 1 . + CDS 57177 - 58217 364 ## COG0117 Pyrimidine deaminase + Prom 58220 - 58279 6.9 46 25 Op 1 . + CDS 58356 - 59714 861 ## BF0451 hypothetical protein 47 25 Op 2 1/0.000 + CDS 59719 - 60453 612 ## COG0020 Undecaprenyl pyrophosphate synthase 48 25 Op 3 . + CDS 60477 - 63113 2751 ## COG4775 Outer membrane protein/protective antigen OMA87 49 25 Op 4 . + CDS 63138 - 63653 680 ## BF0503 cationic outer membrane protein precursor 50 25 Op 5 . + CDS 63711 - 64220 656 ## BF0502 putative outer membrane protein OmpH 51 26 Op 1 . + CDS 64323 - 65165 609 ## COG0796 Glutamate racemase 52 26 Op 2 . + CDS 65242 - 65478 313 ## BF0500 hypothetical protein + Prom 65495 - 65554 4.0 53 27 Op 1 22/0.000 + CDS 65592 - 66674 1070 ## COG0263 Glutamate 5-kinase 54 27 Op 2 . + CDS 66687 - 67934 1175 ## COG0014 Gamma-glutamyl phosphate reductase + Term 68047 - 68082 1.1 + Prom 68341 - 68400 7.5 55 28 Op 1 . + CDS 68422 - 69444 656 ## BF0441 hypothetical protein 56 28 Op 2 . + CDS 69483 - 70211 631 ## BF0495 hypothetical protein 57 28 Op 3 . + CDS 70265 - 71221 1077 ## COG0078 Ornithine carbamoyltransferase - Term 71232 - 71297 15.1 58 29 Op 1 . - CDS 71320 - 71715 378 ## COG0607 Rhodanese-related sulfurtransferase 59 29 Op 2 . - CDS 71720 - 72415 329 ## BF0492 hypothetical protein 60 29 Op 3 . - CDS 72412 - 73560 1177 ## BF0491 hypothetical protein - Term 73620 - 73663 -0.8 61 30 Op 1 . - CDS 73701 - 74675 907 ## COG1181 D-alanine-D-alanine ligase and related ATP-grasp enzymes 62 30 Op 2 . - CDS 74672 - 75745 1036 ## COG0564 Pseudouridylate synthases, 23S RNA-specific 63 30 Op 3 . - CDS 75775 - 76416 593 ## BF0488 hypothetical protein - Prom 76543 - 76602 3.8 - Term 76557 - 76606 10.8 64 31 Tu 1 . - CDS 76628 - 76789 270 ## PROTEIN SUPPORTED gi|53711778|ref|YP_097770.1| 50S ribosomal protein L34 - Prom 76945 - 77004 6.5 + Prom 76789 - 76848 5.9 65 32 Tu 1 . + CDS 76983 - 77549 802 ## COG0231 Translation elongation factor P (EF-P)/translation initiation factor 5A (eIF-5A) + Term 77575 - 77629 17.7 + Prom 77867 - 77926 2.8 66 33 Op 1 . + CDS 77985 - 78998 615 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 67 33 Op 2 . + CDS 79041 - 79733 546 ## COG2003 DNA repair proteins 68 34 Op 1 . - CDS 79762 - 80526 799 ## COG2908 Uncharacterized protein conserved in bacteria 69 34 Op 2 . - CDS 80572 - 80883 407 ## COG2151 Predicted metal-sulfur cluster biosynthetic enzyme 70 34 Op 3 . - CDS 80966 - 81490 565 ## BF0480 alkaline phosphatase III precursor 71 34 Op 4 . - CDS 81438 - 82361 882 ## COG1785 Alkaline phosphatase - Prom 82404 - 82463 6.2 - Term 82511 - 82556 8.5 72 35 Op 1 21/0.000 - CDS 82584 - 83780 1370 ## COG0282 Acetate kinase 73 35 Op 2 . - CDS 83799 - 84818 1252 ## COG0280 Phosphotransacetylase - Prom 84866 - 84925 3.2 - Term 84880 - 84939 13.0 74 36 Op 1 . - CDS 84952 - 86505 1321 ## BF0477 hypothetical protein 75 36 Op 2 . - CDS 86550 - 87347 725 ## BF0421 hypothetical protein 76 36 Op 3 . - CDS 87373 - 88056 839 ## BF0475 hypothetical protein - Prom 88212 - 88271 4.6 + Prom 88113 - 88172 7.5 77 37 Op 1 23/0.000 + CDS 88265 - 88603 369 ## COG1380 Putative effector of murein hydrolase LrgA 78 37 Op 2 . + CDS 88600 - 89295 649 ## COG1346 Putative effector of murein hydrolase 79 37 Op 3 . + CDS 89350 - 90267 905 ## COG4866 Uncharacterized conserved protein 80 37 Op 4 . + CDS 90277 - 91296 506 ## COG4552 Predicted acetyltransferase involved in intracellular survival and related acetyltransferases 81 38 Op 1 . - CDS 91389 - 93302 1078 ## BF0412 hypothetical protein 82 38 Op 2 . - CDS 93344 - 94444 396 ## BF0469 hypothetical protein 83 38 Op 3 . - CDS 94506 - 94676 75 ## BF0468 hypothetical protein 84 38 Op 4 . - CDS 94646 - 95722 618 ## BF0467 hypothetical protein - Prom 95800 - 95859 6.4 85 39 Op 1 . - CDS 95921 - 97087 234 ## BF0466 hypothetical protein 86 39 Op 2 . - CDS 97157 - 98326 604 ## BF0413 hypothetical protein - Prom 98367 - 98426 2.1 - Term 98367 - 98408 8.1 87 40 Tu 1 . - CDS 98447 - 98734 262 ## BF0404 hypothetical protein - Prom 98754 - 98813 9.9 88 41 Tu 1 . - CDS 98862 - 99749 462 ## BF0463 hypothetical protein Predicted protein(s) >gi|226332041|gb|ACIB01000015.1| GENE 1 254 - 3289 2642 1011 aa, chain - ## HITS:1 COG:BH0675 KEGG:ns NR:ns ## COG: BH0675 COG1472 # Protein_GI_number: 15613238 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-glucosidase-related glycosidases # Organism: Bacillus halodurans # 56 428 131 528 686 239 34.0 2e-62 MAGLVVSALLGVGAKVPASMDAPVREVFHTPPGMSAPIEPLLLYQASQDEKCRHWVDSVY NRMNLREKVGQLFIYTIAPVQTKRNMQLLRDAVHTYKVGGLLFSGGKIQNQATLTNEAQR MARCPLLITFDGEWGLSMRLRGTPVFPRNMVLGCIQDNRLIYEYGREMARQCREMGVQVN FAPVADVNINPDNPVINIRSFGEDPVKVADKVIAYASGLESGKVLSVCKHFPGHGDTDVD SHKALPVLPFTRERLDSVELYPFKEAIRAGVSGMMVGHLQVPVIEPIGDLPSSLSRNVVY GLLTEELAFKGLIFTDALAMKGVAGNKSVCLQALQAGNDMVLAPRRLKEEMDAVLEAVEK GELPEEEINAKCRKVLTYKYILGLERKPFVKLSGLGTRINTPQTRDLISRLNLAAITVLN NKNDVLPLHPDLKEAAILNVGKPEEIEPFDRKMKKYTSFARFQLRKDLPEAEQQKLRDSL AAYRRVIVTVTEQRLAPYQSFFAKFAPESPVIYVFYTPAKSMLQIQRAVSAAEAVVLAHA SRDDVQERVADLLFGKATADGRLSASIGGLFPTGSGVTITPHTPFHFVPEEYGMKSEVLR RIDTIALEGIKEGAYPGCQVLVMKDGKALYDRCFGYHTDANSEKVKPTDIYDLASLSKTT GTLLAIMKLYDKGRFNLTDKVSDYLPFLRKTNKENLTIRELLMHQSGLPSGLLFYQEAID GKSYKGSLFKQSKDALHTVRLGVRTWGNPRFRFNKGMASKEKNGDYTLQVCDSLWLNRSF REEIRKKIAEAPLKDKSYRYSDVGFILLQMLAEELSGKPMDEYLWQEFYQPMGLEHTAYL PLRYFDKKEVVPSAVDRFLRKTTLQGFVHDESAAFQGGISGNAGLFSNAREVGRIYQMLL NGGELDGRRYLSKETCALFTTEKSKISRRGLGFDKPDVVNESKSPCAASVPVTVFGHTGF TGTCAWVDPDNGLIYVFLSNRTYPDAWVNKLSKLEIREKIQETIYEAMKEK >gi|226332041|gb|ACIB01000015.1| GENE 2 3337 - 6327 2642 996 aa, chain - ## HITS:1 COG:BH0675 KEGG:ns NR:ns ## COG: BH0675 COG1472 # Protein_GI_number: 15613238 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-glucosidase-related glycosidases # Organism: Bacillus halodurans # 24 411 117 529 686 244 33.0 7e-64 MLKQLLTVVFLLSAGTLWAQQAAGLLPVQEDTHCKEWVEQTLSRMKLKDKVGQLFVYTLA PRADKDTEKLVGKLTRKFKVGAFLYSEGTVEDQANLTNYAQRQSKIPLMITFDGEWGLAM RLENTPVFPRNAALGCISDNTLIEAYGQEVARELREIGAHVNFAPDADVNTNPENPVIHV RSFGENPKTVAEKVIAYGRGLETGGILSVSKHFPGHGDTDVDSHQALPAVYYNRARLDSV ELYPFKEAIQAGLGGVMVGHLQVPALEPDRITPSSLSHSIVTDLLRGELGFNGLVFTDAL AMKGVAAESDVTVKALKAGNDMVLVQQNVEKAQESVVQAIKDGRLTMEEIDAKCRRILAY KYRLGLSRRPMIPVDGLSDRIHTPEAQALVTKLRTSAVTVLGNYFQILPLTATKGEIAVL TVGDEGSDASFIEGLRSELPLKTFRMDKNTGEEERRKIVKELGNYRRVVVCITVQDKEAG EYRSFFAGFRPQAPVVYAFFTSYRALASLEEAAARSAAVVLAHSGEEDLQRYVADVILGK ASATGRLSMRIGNTFAAGSGVDVISGSPAGIAPEDYGLKSYRLHRIDSVVAAGLAAKAFP GCQVLVLRHGQPVYDKCFGTHSVTDTTPVRATDLFDLASLTKTSATLLAVMKLYDQGRIE LTDAVSKYVPALRATNKKNITIRELLLHESGLVPYIRFYRDAIDEYSVTGPFTQGFVDEW HHTRMGEYTYACSDFKFKKGLVSATKTSGHTLQIADGLWLDKKFKAAMMKSIAQSELDRK RFVYSDIGFILLQQVVEAVTGKTLDAYLVSEFYRPMGLEHTLFQPLNRYKKADIMPTAAN DYLRRQDLCGYVHDEAAAFMGGVSGNAGLFSTAQELGKIYQMILNEGELDGKRYLRPETC RIFTTEKSAVSHRGLGYDKPNLKDPKANACASSAPASVYGHTGFTGTCAWVDPENDLVYI FLSNRLCPDAWNGKLNSMKIRQAIQEVIYQSLYTPE >gi|226332041|gb|ACIB01000015.1| GENE 3 6395 - 7267 266 290 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|145635642|ref|ZP_01791339.1| 30S ribosomal protein S16 [Haemophilus influenzae PittAA] # 5 250 3 252 603 107 28 3e-22 MIFPINKKHTFFSKEKTLLTFFFILFISFSAFGQQDKKLILLQTSDVHSRLEPINQEGDR NYDKGGFVRRATFVKEFRKEHPDMLLFDCGDISQGTPYYNMFQGEVEVKMMNEMKYDAMT IGNHEFDFDLDNMARLFRMADFPVVCANYDVSATVLKDLVKPYVVFERDGVKIGVLGLGC QLEGMVQANKCVGVVYNDPVTVANEVAAVLKEKEGCDVVVCLSHLGVQYDENQLIPKTRN IDVVLGGHSHTFMKGPKTLLNMDGKNVSLMHTGKSGIYVGQMDLTLEKKK >gi|226332041|gb|ACIB01000015.1| GENE 4 7307 - 8095 678 262 aa, chain - ## HITS:1 COG:BH1015_1 KEGG:ns NR:ns ## COG: BH1015_1 COG0737 # Protein_GI_number: 15613578 # Func_class: F Nucleotide transport and metabolism # Function: 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases # Organism: Bacillus halodurans # 28 261 270 504 707 93 29.0 3e-19 MKQNYAKIISGFILAGLLTFSSCQSTHEMAKTDYQIAKVEGRMIDIDAKWDTHPDADAVA ILKPYKEKIDNMMYEVIGSSEQKMDKGHPESLLSNLVAEVLRQAATKVQDKPADMGLVNM GGLRNILPAGDITVGTVYEILPFENSLCVMKMKGTHLKALLTSIASLKGEGVSGIRMEIT KDGKLLNATVGGQPIDDNKLYTVATIDYLADGNGSMEAFLQADDRVCPEGATLRGLFLDY VRQQTAAGKKITSALDGRITVK >gi|226332041|gb|ACIB01000015.1| GENE 5 8252 - 8605 584 117 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53711854|ref|YP_097846.1| 50S ribosomal protein L19 [Bacteroides fragilis YCH46] # 1 117 1 117 117 229 100 4e-59 MDLIKIAEEAFATGKQHPSFKAGDTVTVAYRIIEGNKERVQLYRGVVIKIAGHGEKKRFT VRKMSGTVGVERIFPIESPAIDSIEVNKVGKVRRAKLYYLRALTGKKARIKEKRVNG >gi|226332041|gb|ACIB01000015.1| GENE 6 9360 - 10340 481 326 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|116517028|ref|YP_816079.1| glucokinase [Streptococcus pneumoniae D39] # 10 317 5 316 319 189 36 4e-47 MNSSMEKPYVVGIDIGGTNTVFGIVDARGTIIASGAVKTQVYPTVEEYADEVCKNLLPLI IANGGVDKIKGIGIGAPNGNYYTGTIEFAPNLPWKGVLPLASMFEERLGIPTALTNDANA AAVGEMTYGAARGMKDFIMITLGTGVGSGIVINGQVVYGHDGFAGELGHVIVRRDGRICG CGRKGCLETYCSATGVARTAREFLAARTDASLLRNIPAESIVSKDVYDAAVQGDKLAQEI FEFTGNILGEALADAIAFSSPEAIILFGGLAKSGDYIMKPIMKAMENNLLNIYKGKAKLL VSELKDSDAAVLGASALAWELKDLRD >gi|226332041|gb|ACIB01000015.1| GENE 7 10519 - 11235 364 238 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 [Vibrio campbellii AND4] # 1 219 1 220 245 144 38 1e-33 MIHLKDINKTYNNGAPLHVLKGINLDIERGEFVSIMGASGSGKSTLLNILGILDNYDDGE YYLNNVLIKDLSETKSAEYRNRMIGFIFQSFNLISFKNAVENVALPLFYQGVSRKKRNAM AMEYLDKLGLKDWAHHMPNEMSGGQKQRVAIARALITQPQIILADEPTGALDSKTSVEVM QILKDLHKTGMTIVVVTHESGVANQTDKIIHIKDGIIERIEENLNHDASPFGKDGYMK >gi|226332041|gb|ACIB01000015.1| GENE 8 11250 - 12509 1228 419 aa, chain + ## HITS:1 COG:SMc04351_2 KEGG:ns NR:ns ## COG: SMc04351_2 COG0577 # Protein_GI_number: 15965824 # Func_class: V Defense mechanisms # Function: ABC-type antimicrobial peptide transport system, permease component # Organism: Sinorhizobium meliloti # 12 419 10 393 393 115 26.0 1e-25 MIDIWQEIYGTIKRNKLRTLLTGFAVAWGIFMLIVLLGAGNGLIHAFEKSSSARALNSIK IYPGWTGKPYDGLKEGRRIQLDNKDLDATMEHFSDNIISVGASQWQSNVNLSYGQEYVNL SLEGVYPNFTEVESVKSTDGRFINDIDLKERRKVIVLHTKTAEILFGKSKTEPIGKFVNA GGVSYQVVGLYTDPGDQGSSEAYIPFSTLQVIYNKGDKLNNLTFTTKGLTTIETNEAFEA AYRKVMGAKHRFDPSDNSALWIWNRFTNYLQSQNAMGILRTAIWVIGIFTLLSGIVGVSN IMLITVKERTREFGIRKALGAKPFSILWLIIVESVTITTLFGYIGMVAGIAATEWMNKVA GEQTVDVGMFSETVFLNPTVDISIAIQATLTLVVAGTLAGFFPAKKAVSIRPIEALRAD >gi|226332041|gb|ACIB01000015.1| GENE 9 12512 - 13756 1225 414 aa, chain + ## HITS:1 COG:AGc3336 KEGG:ns NR:ns ## COG: AGc3336 COG0577 # Protein_GI_number: 15889120 # Func_class: V Defense mechanisms # Function: ABC-type antimicrobial peptide transport system, permease component # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 6 414 22 420 420 164 28.0 4e-40 MRVDMDTCEEILVTITRNKTRSLLTAFGVFWGIFMLVALMGGGQGMQEMMQAEFEGFATN SGFMASQKTGEAYKGFRKGRYWDIENADIERIRKKVKDIDVITPSIARWGSTAIYGEKKY DCSVKGLYPDYAKIENQDMAYGRFINDVDVREGRKVCVIGKRVYESLFNPGEDPCGKYVR VDGIYYQVIGMCVSEGNMNIQGRASEAVVLPFSTMQQAYNMGKRIDVICYTVKPGKKVSD LEPEIEAILKEAHYISPDDKQAVMKLNAEAMFSMMDNLFTGIHVLIWMVGLGTLLAGAIG VSNIMMVTVKERTTEIGIRRAIGARPKDILQQILSESMVLTTIAGMAGISFGVLILQLME IGVNSGKDHYSHFQVSFGMAIGTCLLLVTLGLLAGLAPAYRAMAIRPIEAIRDE >gi|226332041|gb|ACIB01000015.1| GENE 10 13865 - 14965 1244 366 aa, chain + ## HITS:1 COG:VC1563 KEGG:ns NR:ns ## COG: VC1563 COG0845 # Protein_GI_number: 15641571 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Vibrio cholerae # 8 316 15 335 338 131 25.0 2e-30 MKKYLKITLLVVVAAIFIGTFIFLYQKSKPKTTVYETVTPEIADLEKTTVATGKVEPRDE VLIKPQISGIISEVYKEAGQTIKQGEVIAKVKVIPELGQLNSAESRVRVAEISTAQAETD HERIKKLYNDKLISREDYEKSEVEIKKAREELQTAKDALEIIKEGITKNSASFSSTLIRS TIDGLILDVPIKVGNSVIMSNTFNDGTTIATVANMNDLIFKGKIDETEVGRIHEGMPVKL TIGALQNLTFDAELEYISPKGVEENGANQFEIKAAVHAPDSVQIRSGYSANAEIVLQRAQ KVLAVPEGIIEFSGDSTFVWVMTDSIPEQKFERRQIKTGMSDGIKLEIKEGLTGKEKVRA SEKKDK >gi|226332041|gb|ACIB01000015.1| GENE 11 15014 - 16336 1261 440 aa, chain + ## HITS:1 COG:CC1318 KEGG:ns NR:ns ## COG: CC1318 COG1538 # Protein_GI_number: 16125567 # Func_class: M Cell wall/membrane/envelope biogenesis; U Intracellular trafficking, secretion, and vesicular transport # Function: Outer membrane protein # Organism: Caulobacter vibrioides # 14 437 19 430 483 89 23.0 1e-17 MRNKILINLLILTGLSAYTAQAQEGWTLRRCIDYAIEHNINVQQTANSAEQSKVEVNTAK WARLPNLSGSASQNWSWGRTASPVDNTYNDINSGSSSFSLGTNIPLFTGLELPNQYALTK LNLKAAIEDLNKAKEDLAINVTSAYLQVLFNQELSKVAQSQVGLSKEQLNRITRLHEVGK ASPAEVAEAKARVAQDEMSAVQADNNYRLALLDLSQLLELPTPENFSLATPDTELEFSPL TSPDEIYNQAMLYKPGIKAAEYRLEGSEKNVRIAKSSYYPQLSFSAGLGTNFYTVNGNAG SNFGNQMKNNLNKYAGFSLNIPLFNRLATRNRVRTARLQQTNLALQLDNTKKVLYKEIQQ AWYNAIAAESKFKSSESAVEASQESFRLMSEKFDNGKATSVEYNESKLNLTKALSDRIQA KYDYLFRTKILDFYKGQPIE >gi|226332041|gb|ACIB01000015.1| GENE 12 16576 - 17337 468 253 aa, chain + ## HITS:1 COG:no KEGG:BF0555 NR:ns ## KEGG: BF0555 # Name: not_defined # Def: RNA polymerase ECF-type sigma factor # Organism: B.fragilis # Pathway: not_defined # 1 245 1 245 253 444 97.0 1e-123 MFDNEHLTYICGHYKRRASKTYDSRQHPRERSKTEILTTINPLLLFFSYLAALLDHTEKY PGTQYTRMDDLNINSFNALYTLYYRKSFLFAKSYVHDEQVAEDIAAEALIKLWEKLKTDI INSPQAMLLTILKNKSLDYLRLEQNKLNAMSELSELYVRELDIRVSSLEACDPSEIFSEE VNQIIQATLRTLPEQTRRVFKMSRFENKMNKEIAENLGITVKGVEYHISRALKEFRISLK DYLPLFYFFFYFH >gi|226332041|gb|ACIB01000015.1| GENE 13 17432 - 18361 747 309 aa, chain + ## HITS:1 COG:PA2388 KEGG:ns NR:ns ## COG: PA2388 COG3712 # Protein_GI_number: 15597584 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Pseudomonas aeruginosa # 13 280 27 298 331 84 28.0 2e-16 MNQEILNRYLTGDASAEEKQMVVRWLDTDPQHMREYLALRKLHDITLWQEKPAATVDKKK RLTLPYIREFIKIAAIFLIAVTSVYFLAPERGRDTPDLKAIHVPSGQRAELTLGDGTRVC LNSNTTLTFPDHFDRKERRVTLDGEGYFQVAKNEKKPFIVQAKEYEVRVLGTEFNVMMYK DQDFFETVLLKGSVEVNETTTGRKVKLQPNERLFGKAGQLRKESISEWNRALWMQGILYF DNTRMDEIIRQLGLYYDVKFILEKESLANVRFTGKFRIRDGVEHVLKVLQLKCKFTYQRD EDSNTITIN >gi|226332041|gb|ACIB01000015.1| GENE 14 18570 - 21740 2824 1056 aa, chain + ## HITS:1 COG:no KEGG:BF0553 NR:ns ## KEGG: BF0553 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1056 1 1056 1056 2031 99.0 0 MKMIVLFLFVFISGVFAGNANSQETKVSISKNNKPIREILGEIERQTDYLFVYSEKEVDV NQRKTVNVSQQRVADVLSSLFRSTNVGYAMEGHNIMLMAKTTQTDAAQQKRHITGVVKDI KGETIIGANIMIKGTGTGVSTNIDGEFSIEAAAGDELIVSFIGYLTQTIKIDSQKTLNIK LLEDTKTLEEVVVVGYTVQTKSAVTGSVAVVKADKLKDVNTLEVGSMLQGKVSGVYVSGS SGEPGQASKIRIRGKGTLNSSVSPLWVVDGVIVGEDPGLNPNEIDNISVLKDGSATALYG SRAANGVIVVTTKRGEYDANKYSVSVNAGVSLLSTGRLEMMNSQELYDYQKSWNNQSWFT EELLKHNTDWFKEASKPGLYTNANITYTGSSGRMRSFVMADYYREEGAIKDFTLDRFTFR SNNDVKFTDRFTMSTKISGSLSRTDSQQRSVYNTYLYLPWDFPYNEDGSIRSGQEQDWRG RDGINDMYDLQWNWSRSKKLTVDGTINFNYQITDWLRFESNNYIRYISNRSESYTDKRSR SGQSDKGSLSNSNSLLTKQFTNQMIRFEKSFGKHKVNALGAYEYTRHFYESTSAEGRGIQ PGREILDVTTGIKSIGGYKDAIATQSALFNANYDYDNRYMGQVSYRMDESSCFGKNNRMG HFFTVSGGWNIQNETFFESLRESVNQLKVRVSYGSLGNTPGAYYGHYPLYSSMMYNDEVA YFPSQMGNADLSWEKCYTTNIGIDARFFDRFGVTIDLYNKNTSDLLYYAPLPNISGYTGQ YKNVGAINNKGLEISLNADVIRTSKFQWTSDFNIGFNRNRVTELYGGKPELKGLKRLEEG RDMDEWYLREWAGVDPANGSPLWYTTDENGKRTTTDSYNKADRVYCGSAAPKFTGGWMNS FSYKGFTLTANFDFVYGNLLYNQSRELLDSDGAYADYNSMKLKSGWKRWEKEGDIATHPK AINGGNKNSNKSSSRYLEKGNYFSLRNLSLGYSIPEKLCGKLGLQQVNVSCSADNLFTLT PFSGVSPQLSDSSTDGYAGTIYPLSRRIVFGLNVSF >gi|226332041|gb|ACIB01000015.1| GENE 15 21757 - 23472 1683 571 aa, chain + ## HITS:1 COG:no KEGG:BF0552 NR:ns ## KEGG: BF0552 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 571 11 581 581 1130 99.0 0 MKIKHIVSCFFVSLIGLSACSIEELPYNQLTEDELDGSYESLLSATRGNYAVFKQTAFHQ GWHYAGELASDNVSLSGVSSDALMYIYNYQRITDNYHMSNMWGWAYRSIINSNKILEKAQ EGESKEMDQLIGENYFLRGWLEFVLVNVFGRPYNQSPETNLGIPLKLTADINDYPMRSTV KESYEQILKDLKKAETLLNSESNIYAGPSAAKALLSRVYLYMGNNKLAAEYATEVIESSG RTLLEGEAYATANVLVPEDNPEIIFAIRCTKDKDDYGWNSIGGFYANIDGVGWGELYASE PLRDAYAEYPEDLRSRYIVPQYLKDDETGEYRKEFIYIESSEEDGVPRKYYRWNEIIEEN GNYRIKDAYLSKYEYKDTLTMKQDAGGYYVESRLKSGKDNPTPGTYEKHYVTIQNLMAKR NDYPKYYVYKCSKQENQPQLWSPTVLRLGEMYLNRAEAYAKEPALGDALADLNVIRTRAH IPALSAGDMKPGKTMLEYVLEERRKELAFEGHRRFDIFRNGLTMNRTYPGTHDRGAATSV RLTISADDPAAIEFIPQREIDSYPGVLEQNP >gi|226332041|gb|ACIB01000015.1| GENE 16 23580 - 25817 1628 745 aa, chain + ## HITS:1 COG:SP0312 KEGG:ns NR:ns ## COG: SP0312 COG1501 # Protein_GI_number: 15900245 # Func_class: G Carbohydrate transport and metabolism # Function: Alpha-glucosidases, family 31 of glycosyl hydrolases # Organism: Streptococcus pneumoniae TIGR4 # 90 705 8 597 679 348 33.0 3e-95 MRNIQRTILWIAGLLFCLPSSSSNPVVIGNSRFTFITDHLVRMEYAQQGKFLNDSTLFAV DRTPRCTEVKVERKEGNRYIMTTPAMRVEYYNDGFPFGQTNLFVYFRNGDSPKEKRWYIA SRQSRNLLGAVTTLDDVEGPIDRQEGLLSRDGWYLINDTGKEVLKNGWVATRDRNHVQDL YLFVYGNDYKAALKSLQAVSGPSPMTRKYVHGSWYCRWWNYTDEDYRQLVREYREHDFPL DIMVFDMGWHTQNAKVGTGHAGTRGWTGYSWNRKLIPEPEKLIKDFKDDHIYVVLNEHPH DGIRPHEDSYQAFVRDLGVDTQQTGVPLFDAGDRDYMNAFMKHAHQESDSMGVAFWWLDW QQDYLYPLVRGTNMKHLPWMNHIYYNYSSGNHLRGAGFSRWAGWGDHRHPIQFSGDAVGN WDLLRFEVDLTTTSGNAGCFFWAHDLGGFYDGTDPELYTRWTQFGLLNSSLRIHSVYDEK LDRRPWLWGVEAEKAMHRIYHLRSQLMPYIYSSVRQCHTDMLPLNRGMYIEYPDEEKAYQ YPGQFLFGDLLLGAPITAKGEGEKKIATQEVWLPGGTDWYNFFTGERQEGGQVIKTKSPL EQFPLFIKGGCPLPMQPYTERMCSTPLTELIVRCYPGKEGANNTYILYEDDGLTQDYLQG KYATTRLNYQKSGGQTIITVSPVEGTYEGQPRKRAYRIELPGIPVQARVSVNGKKARTTP NQELNGVIVPIKVMDIHKPIVIKIQ >gi|226332041|gb|ACIB01000015.1| GENE 17 25931 - 28099 1560 722 aa, chain + ## HITS:1 COG:BH2223 KEGG:ns NR:ns ## COG: BH2223 COG3345 # Protein_GI_number: 15614786 # Func_class: G Carbohydrate transport and metabolism # Function: Alpha-galactosidase # Organism: Bacillus halodurans # 25 721 13 729 748 361 32.0 3e-99 MLKRTFILIGLVLSFCSLPAQELIQITTRNTALVFRVANQSLRQVYYGPRLADTDVLQKQ GNNFPAYSTYGMGEQNEVALHAVHADGNTSTLLNFENVKQESPEPGITLTTISLKDPLYP FQVKLFYKAYEESDLIEQWTIYQHTEKKPVTLYQFASAQLSFKSSSYRLTHFAGDWAGEC NMSEVELTEGIKVIDSKLGTRATFFAHPMCLLSLNGRMTEDNGEVIGMALAWPANFKLEF EKNNNQELRVLAGMNPYASHYKLKKGDVFQTPSFLYTYSTKGNGQVSRNFHRWARKYGLR HGENSRYTLMNNWEATYFNFNEPKLKSIIEDAAGMGFELFLLDDGWFGQKHPRNNDDAGL GDWVVNKEKLPNGLGWLVKQCTDNDIKFGIWVEPEMVNPQSELFDKHPDWVIQQLGREHI LFRRQLVLDLSNPEVQEFVYKSVHDILKDNPQIAFVKWDCNRAVTNPGSTYLPADEQSHI WIEYGRGLLNVFKKVRDSHPDVHFMLCSGGGGRLDYGSLRYFEEYWPSDNTDALQRILIQ WGNSQFFPSIAMCCHVSASPNHQTGRTTPLKFRFDVAMQGALGMDLQPSTMNEKEVIFAK EAIKTYESIRNIVFTGDLYRILSPYEGNRTSMMYVLPDKSRAVFYAYQLKSHIGEVSAPM RFKGLIPDKKYNVKELNIYPGSRAATGSANGQSFSGDFLMNQGLPIGLSGDYSSAVIELE QQ >gi|226332041|gb|ACIB01000015.1| GENE 18 28273 - 28833 517 186 aa, chain - ## HITS:1 COG:no KEGG:BF0549 NR:ns ## KEGG: BF0549 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 186 1 186 186 352 99.0 5e-96 MRIKLLFLISILFCTGSYAQETVTEPDFIGEVLVLNPDNSTTPLEKATVKIKTKANASIY LVGMGKVKTKINVDGPSAQVRLHQGDDFKLIVRAVDNNTDPMSIINIFQFETGKKVRKAE LSSLSTFGGASSNNLELLPYTAKKYGESSYLITLKEKPVGEYGITVRNPNSLDEKNIIVA SFGIDQ >gi|226332041|gb|ACIB01000015.1| GENE 19 28912 - 30210 593 432 aa, chain - ## HITS:1 COG:CAC3034 KEGG:ns NR:ns ## COG: CAC3034 COG0249 # Protein_GI_number: 15896285 # Func_class: L Replication, recombination and repair # Function: Mismatch repair ATPase (MutS family) # Organism: Clostridium acetobutylicum # 236 424 403 591 598 109 35.0 1e-23 MKKLRTDRQTLNDLGIVESTYGEKTLFSLFDMTESDGGKRCLEEWLVHPLSDWKALHERQ EAIRYPDFPEIRICREELDFIEFYLSQGDRPTRVSYLESAFSYIFRHFRATPERYVIRRG TKLLENVLLELKTFADHVTELSPRLIRNIAATISEIYQATELGKMVDSCGQEDSFYRTDR LDYIFRYRRNHTIAALLSIVYQLDAIRTVHRTAVMKGWCFPSFTNDSKFMLCNFYHPQVK DAVANDWEMENGNICIFTGSNMAGKSTTLKAIASAVWLAHAGFPVPASSMVCPMFDGIFT SINLPDSLRDGRSHFYAEVLRVKEVLEQINKGHHCFVLFDELFRGTNARDAFEASVAVAE VLKAKAYSRFLISTHIIELARKLDGDDACCFYYLESAIVDDELICNHKVKPGISESRVGY WIVKKELAGFEK >gi|226332041|gb|ACIB01000015.1| GENE 20 30303 - 32225 1365 640 aa, chain + ## HITS:1 COG:alr5324 KEGG:ns NR:ns ## COG: alr5324 COG0744 # Protein_GI_number: 17232816 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane carboxypeptidase (penicillin-binding protein) # Organism: Nostoc sp. PCC 7120 # 440 602 97 257 643 126 39.0 1e-28 MGKYRTKGSIALIITGSVLILVLAGLYLGRNGILCRTADKRILYAEQKYGLSICYEDLRM KGLNEIELKNLSIVPRNRDTLLTLHTLNMHLNFWKLIRGKIEVRNVTVDQLKASFIKADS MANYDFLFLKRKRETSSGQVQTDYAHRINRILNLFYGFLPENGTLCQIDIAERKDRNFVN IHIPKLTVRQNHFRSDIEVHEDSTTQHWTACGKVNRSSHTLKAELYAQQNNKIILPYLKR RFDADIRLDTLTYSLTKSEKNGGQVQLTGQAAVSGLEVYHKALSPETVNLDRGQVSYRIL VGKESAELDSTTVIRFNQMQFHPYLKAEKKKQQWHFTAAVDKPWFPADQLFGSLPKGLFS NLEGIKTSGELAYHFFLDANFALLDSLKLESELKERNFCIVNYGVTDLGKMSEEFIYTAY ENGQPVRTFPIGPSWEHFTPLDSISPLLQMSVMQSEDGAFYFHRGFLPEAMREALIQDLK VKRFARGGSTITMQLVKNVFLNRNKNIARKLEEALIVWLIETERLTSKERMYEVYLNIVE WGPLVYGVQEAATYYFKKRPSQLTAEESIFLASIIPKPKHFRNSFNNDMQLKESLEGYYR LITERLVKKGIISEVAADSIRPEINVTGEAKKDLQRDSIQ >gi|226332041|gb|ACIB01000015.1| GENE 21 32345 - 33208 468 287 aa, chain - ## HITS:1 COG:no KEGG:BF0494 NR:ns ## KEGG: BF0494 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 287 1 287 287 441 99.0 1e-122 MKRILIFFLVIGITAISSVSMAAMSNSRIRKETRFLTDKMAYELNLSTGQYNDVYEINYD FIYSIRYLMDDVIRGEEWALDKYYRTLDIRNDDLRWVLTASQYRRFIGVDYFYRPVYASG GSWSFRIYIRYTNHNHFYFGKPYHYNSYCGGHYRTHYHNSYYRGRYRHDFYSGSHSIRDH RNYNTHRRSDFGSVTIRSNSGRRDEVRRGVSQRESSASRDNNRVTPGNATRTGRETRSTE NNRRTNTVRKSENISTPHRVTPERTDSKRSNSRSTSSRSAERGNRER >gi|226332041|gb|ACIB01000015.1| GENE 22 33218 - 33886 495 222 aa, chain - ## HITS:1 COG:no KEGG:BF0544 NR:ns ## KEGG: BF0544 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 222 1 222 222 448 100.0 1e-125 MKKIHVSAILILLVVMSSCAGLILNFKNSQLMSIQKGMTQQEVKAILGKPNYRRFDGAME EWEYRGYLSKAGHSVICVNFIDNRVVGLDSFRDGAPTAPPAPSFSLGIGGTVTASDIAPA CDYRAMRNDEFARFLNDVKSKTFDSDRTDFIEKATRSTGFTSEQCCRLIKLYSFDDDRTK VLKILYPSVVDKDNFSAAIDGLDFLSNQDTVKNFVRNYNRIK >gi|226332041|gb|ACIB01000015.1| GENE 23 33901 - 34554 442 217 aa, chain - ## HITS:1 COG:no KEGG:BF0543 NR:ns ## KEGG: BF0543 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 217 1 217 217 444 100.0 1e-123 MRKVIITLCFLFVAFVAQAGRISGINIQSSGEAILVFVDGEQICTPTETCFIANYSGRHR IEVYAVRYIPRTGQSVKGDLLFQEWVSNPGMNIRDIRVGYNDRPDFCPDRPVRPGYDVVM NRTEFDRFLRTVKDKHFDSDRNKLIETTLVSTGFTSDQCLQLVNLFSFDSEKIKLMQAMY PRIVDKPNFYLVIESLTFQSDKNKMNEFVRKYHNQRN >gi|226332041|gb|ACIB01000015.1| GENE 24 34657 - 35496 221 279 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|212640476|ref|YP_002316996.1| Uncharacterized protein conserved in bacteria containing two ribosomal protein S1-like RNA-binding domains [Anoxybacillus flavithermus WK1] # 5 273 2 275 285 89 26 5e-17 MSIELGKFNQLEVVKQVDFGMYLDGGEEGEILLPTRYVPEDCKLGDWLNVFLYLDNEERL IATTLTPLVQVGEFACLEVSWVNQFGAFLNWGLMKDLFVPFSEQKMKMQVGNKYVIHAHI DDESFRIVASAKVDRYLSKEKASYQPGEEVNILIWQKTDLGFKAIIENMYSGLLYDSEIF QTLHTGDVLKAYVKQVREDGKIDLILQKPGFEKIDDFSKTLHRYITEHGGWIGLTDKSPA EEIYDTFGVSKKTFKKAVGDLYKKRLILLHEDGIELVRP >gi|226332041|gb|ACIB01000015.1| GENE 25 35717 - 36718 1150 333 aa, chain + ## HITS:1 COG:TVN1097 KEGG:ns NR:ns ## COG: TVN1097 COG0039 # Protein_GI_number: 13541928 # Func_class: C Energy production and conversion # Function: Malate/lactate dehydrogenases # Organism: Thermoplasma volcanium # 4 304 1 307 325 110 29.0 4e-24 MEFLTNEKLTIVGAAGMIGSNMAQTALMMKLTPNICLYDPYAPALEGVAEELYHCAFEGV NLTYTSDIKEALSGAKYIVSSGGAARKAGMTREDLLKGNAEIAAQFGKDIRQYCPDVKHV VVVFNPADITGLIVLLYAGLKPSQVSTLAALDSTRLQNELVKYLHIPASEIVNCRTYGGH GEQMAVFASTTKVQGEALTKIIDTPRMPMQDWEDLKVRVIQGGKHIIDLRGRSSFQSPAY LSIEMIAAAMGGQPFRWPAGTYVSDKKFDHILMAMETSITKEGVSYKEIQGTPEEQKEME ESYAHLCKLRDEVIAMGILPEINKWHELNKHIN >gi|226332041|gb|ACIB01000015.1| GENE 26 36810 - 37547 809 245 aa, chain - ## HITS:1 COG:no KEGG:BF0540 NR:ns ## KEGG: BF0540 # Name: not_defined # Def: putative potassium channel subunit # Organism: B.fragilis # Pathway: not_defined # 1 245 1 245 245 422 100.0 1e-117 MKMALSDFALRKKGIYGILHVIILLLSLFLVISISIDTFKGIPFYTQSVYMKVQLWICVL FLFDFILELFLSKNKWHYLSTHFIFLLVAIPYQNIISYMGWTFSPEVTYMIRFVPLVRGG YAMAIVVGWLTYNKASGLFVSYLTMLLATVYFSSLAFFVLEHKVNPLVTGYGDALWWAFM DVTTVGSNIIAVTVTGRVLSVLLAALGMMMFPIFTVYVTSLIQKKNKEKEEYYKQLEAAD ESKPK >gi|226332041|gb|ACIB01000015.1| GENE 27 37764 - 39362 1404 532 aa, chain + ## HITS:1 COG:BMEII0909 KEGG:ns NR:ns ## COG: BMEII0909 COG0531 # Protein_GI_number: 17989254 # Func_class: E Amino acid transport and metabolism # Function: Amino acid transporters # Organism: Brucella melitensis # 9 498 23 510 510 501 56.0 1e-141 MANIKQAVKLGVFTLAIMNVTAVVSLRGLPAEAVYGMSSAFYYLFAAIVFLIPTSLVAAE LAAMFQDKQGGVFRWVGEAYGKKLGFLAIWVQWIESTIWYPTVLTFGAVSIAFIGMNDTH DMTLASNKYYTLAVVLIIYWLATFISLKGMGWVGKVAKIGGMVGTIIPAALLIILGIVYL ASGGHSNLDFHSSFFPDLTNFDNVVLAASIFLFYAGMEMGGIHVKDMQNPSKNYPKAVFI GALITVIIFVLGTFSLGIIIPAKDISLTQSLLVGFDNYFRYIHASWLSPIIAIALAFGVL AGVLTWVAGPSKGIFAVGKAGYMPPFFQKTNKLGVQKNILFVQGGAVTVLSLLFVVMPSV QSFYQILSQLTVILYLVMYLLMFSGAIYLRYNMKKANRPFRIGKKGNGLMWIVGGLGFLG SLLAFILSFIPPSQISTGSNTVWFSVLIIGALVVVIAPFIIYAAKKPSWADPNSTFEPFH WETQAKPQVAPATATTAGPATSSTTTVGSTTSAPSTGSGSVSSDKDTPQKQS >gi|226332041|gb|ACIB01000015.1| GENE 28 39455 - 40690 878 411 aa, chain - ## HITS:1 COG:mll3725_2 KEGG:ns NR:ns ## COG: mll3725_2 COG0642 # Protein_GI_number: 13473203 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Mesorhizobium loti # 150 409 44 310 328 176 37.0 7e-44 MMIYIIFSVFIIIILFICARYWYLWRKISVQKNEWVAQTKESDTILRSMNACFILINSDL VVIRTNYYDLSGISEEPASSGRVGDLLNCKNAVRSGGGCGAHKNCENCMIRHTIENAFCH KKGFHKLEASMRLLSSDHQQIIPCDVSVSGTYLNNEGHEQMLLTVYDITELKNMQRLLNI EKENAVSAEKLKSAFIANMSHEIRTPLNAIVGFSGLLASADDDTEKKMYLDIVAENNDRL LQIVTDVLDLSKIESGSLDFHYSEFDVNDLLCGLHGILNIRLKDKPEIKLNCEAGTDEWI IYSEQHRIVQIITNLVHNAMKFTHSGEICFGCRPQGEDEIYFYVSDTGIGIPAGEQDKIF DRFTKLDHEVPGTGLGLTLSQTIVQNLGGEMGVESEVTKGSTFWFTLPLKS >gi|226332041|gb|ACIB01000015.1| GENE 29 41614 - 42087 506 157 aa, chain + ## HITS:1 COG:BS_ahrC KEGG:ns NR:ns ## COG: BS_ahrC COG1438 # Protein_GI_number: 16079481 # Func_class: K Transcription # Function: Arginine repressor # Organism: Bacillus subtilis # 4 149 3 145 149 96 36.0 2e-20 MKKKANRLDAIKMIISSKEVGSQEELLQELGQEGFELTQATLSRDLKQLKVAKAASMNGK YVYVLPNDIMYKRVGDQSASEMLMNNGFISLQFSGNIAVIKTRPGYASSMAYDIDNRESD TILGTIAGDDTIMLVLREGATPTAVRHFLSLIIPNIN >gi|226332041|gb|ACIB01000015.1| GENE 30 42121 - 42699 475 192 aa, chain + ## HITS:1 COG:no KEGG:BF0534 NR:ns ## KEGG: BF0534 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 192 1 192 192 406 100.0 1e-112 MDTQQIDVMVADASHEVYVDTILETIRNAAKVRGTGIAERTHEYVATKMKEGKAIIALCG DVFAGFTYIESWGNKQYVATSGLIVHPDFRGLGLAKRIKQASFQLARLRWPRAKIFSLTS GAAVMKMNTELGYVPVTFNELTDDEAFWKGCEGCINHEILMAKDRKFCICTAMLYDPTDP HNIKKEQERNNI >gi|226332041|gb|ACIB01000015.1| GENE 31 42713 - 43918 1457 401 aa, chain + ## HITS:1 COG:L126739 KEGG:ns NR:ns ## COG: L126739 COG0137 # Protein_GI_number: 15672106 # Func_class: E Amino acid transport and metabolism # Function: Argininosuccinate synthase # Organism: Lactococcus lactis # 1 391 1 391 398 196 33.0 5e-50 MEQKKKVVVAFSGGLDTSFTVMYLAKEKGYEVYAACANTGGFSEEQLKQNEENAYKLGAV KYVTLDVTQEYYEKSLKYMIFGNVLRNGTYPISVSSERIFQALAIARYAKEIGAEAIAHG STGAGNDQIRFDMTFLVMTPGVEIITLTRDMALSRQEEIDYLNKNGFEADFTKLKYSYNV GLWGTSICGGEILDSAQGLPETAYLKQVTKEGSELLRLEFKNGELHAVNGEVFEDKIAAI QKVEEIGAAYGIGRDMHIGDTIIGIKGRVGFEAAAPMLIIGAHRFLEKYTLSKWQQYWKD QVANWYGMFLHESQYLEPVMRDIEAMLQESQRNVNGTAILELRPLSFSTVGVESQDDLVK TKFGEYGEMQKGWTAEDAKGFIKVTSTPLRVYYANHKDEEV >gi|226332041|gb|ACIB01000015.1| GENE 32 43915 - 44883 817 322 aa, chain + ## HITS:1 COG:AF2071 KEGG:ns NR:ns ## COG: AF2071 COG0002 # Protein_GI_number: 11499653 # Func_class: E Amino acid transport and metabolism # Function: Acetylglutamate semialdehyde dehydrogenase # Organism: Archaeoglobus fulgidus # 2 319 1 329 332 215 39.0 8e-56 MIKAGIIGGAGYTAGELIRLLINHPETEIVFINSTSNAGNKITDVHEGLYGECDLTFTDE LPLEDIDVLFFCTAHGDTKKFMESHNIPEELKIIDLSMDYRIASPDHDFIYGLPELNRRA TCTAKHVANPGCFATCIQLGLLPLAKHLMLNEDVMVNAITGSTGAGVKPGATSHFSWRNN NMSVYKAFEHQHIPEIKQSLKQLQNSFDAEIDFIPYRGDFPRGIFATLVVKTKVALEEIV RMYEEYYAKDSFVHIVDKNIDLKQVVNTNKCLIHLEKHGDKLLIISCIDNLLKGASGQAV HNMNLMFNLEETVGLRLKPSAF >gi|226332041|gb|ACIB01000015.1| GENE 33 44893 - 46017 944 374 aa, chain + ## HITS:1 COG:BS_argD KEGG:ns NR:ns ## COG: BS_argD COG4992 # Protein_GI_number: 16078187 # Func_class: E Amino acid transport and metabolism # Function: Ornithine/acetylornithine aminotransferase # Organism: Bacillus subtilis # 2 374 3 378 385 256 40.0 6e-68 MNLFDVYPLFDINIIKGKGCHVWDENGTEYLDLYGGHAVISIGHAHPHYVDMISKQVATL GFYSNSVINKLQQQVAERLGKISGYEDYSLFLINSGAEANENALKLASFHNGRTKVISFG KAFHGRTSLAVEATDNPKIIAPINANGHITYLPLNDIEAAKAELAKEDICAVIIEGIQGV GGIKIPTPEFLQELRKACTEHGTILILDEIQSGYGRSGKFFAHQYAGIKPDIITVAKGIG NGFPMAGVLISPMFTPVYGMLGTTFGGNHLACSAALAVMDVIEQENLVENAANIGSYLLE ELKKFKEIKEVRGCGLMIGMEFDQPVKEIRSRLIHEQKVFTGASGTNVIRLLPPLCLSKE EADEFLARLRKVLG >gi|226332041|gb|ACIB01000015.1| GENE 34 46130 - 46903 787 257 aa, chain + ## HITS:1 COG:lin0414 KEGG:ns NR:ns ## COG: lin0414 COG0345 # Protein_GI_number: 16799491 # Func_class: E Amino acid transport and metabolism # Function: Pyrroline-5-carboxylate reductase # Organism: Listeria innocua # 2 256 3 259 266 138 35.0 1e-32 MKVAIIGAGNMGGSIACGLAKGKLIPASDIIVSNPSIGKLEALKKEFPSIAITRNNAEAA TGADIVILAVKPWLIRGVLREMKLRSKQILVSVAAGISFEQLAHDVVEPEMPMFRIVPNT AISELQSMTLIASRNAGQELEALMVNLFSEMGMAMILPEDKLEAATALTSCGIAYVLKYI QAAMQAGIEMGIRPSDAMDMIAQSVKGAAELILNNDTHPSVEIDKVTTPGGITIKGINEL EHNGFTSAIIKAMKASR >gi|226332041|gb|ACIB01000015.1| GENE 35 46963 - 47517 770 184 aa, chain + ## HITS:1 COG:MTH700 KEGG:ns NR:ns ## COG: MTH700 COG1396 # Protein_GI_number: 15678727 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Methanothermobacter thermautotrophicus # 1 184 1 182 182 168 48.0 5e-42 MDDQIKQIAERLRGLRDVLELTAEDIARDCEISAEEYRLAETGDYDISVSMLQKIARKYG IALDALMFGEEPKMSSYFLTRAGKGTSIERTKAYKYQSLAAGFMNRNADPFIVTVEPKPD IEPIHYNSHSGQEFNLVLEGRMMISIDGKDLILNEGDSLYFNSKLPHGMKALDGKTVRFL AVIM >gi|226332041|gb|ACIB01000015.1| GENE 36 47535 - 49190 1434 551 aa, chain + ## HITS:1 COG:MA2912 KEGG:ns NR:ns ## COG: MA2912 COG0365 # Protein_GI_number: 20091733 # Func_class: I Lipid transport and metabolism # Function: Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases # Organism: Methanosarcina acetivorans str.C2A # 1 550 7 558 560 696 58.0 0 MIDRFLSQTSFSSQEDFVKNLKIHVPDNFNFGYDIVDAWAAEQPDKPALLWTNDKGEHHQ FSFADMKQYTDRTASYFQSLGIGHGDMVMLILKRRYEFWFSIIALHKLGAVVIPATHLLT KKDIVYRCNAADIKMIVAAGEEVVTKHIIDAMPDSPTVKHLVSVGPEIPEGFDDFHQGIE HAAPFVKPEHPNTNDDISLMYFTSGTTGEPKMVAHDFTYPLGHIVTGSFWHNLKENSLHL TIADTGWGKAVWGKLYGQWIAGANVFVYDHEKFTPADILEKIQNYHVTSLCAPPTIFRFL IHEDLTKYDLSSLEYCTIAGEALNPAVFDTFKKLTGIKLMEGFGQTETTLTVATFPWMEP KPGSMGVPNPQYNVDLIDYEGRSVEAGEQGQIVIRTDKGKPLGLFKEYYRDASRTHEAWH DGIYYTGDVAWKDEDGYLWFVGRADDVIKSSGYRIGPFEVESALMTHPAVVECAITGVPD EIRGQVVKATIVLAKEYRERKGEDLVKELQNHVKKVTAPYKYPRVIEFVDELPKTISGKI RRVEIRKNDEK >gi|226332041|gb|ACIB01000015.1| GENE 37 49339 - 49539 133 66 aa, chain + ## HITS:1 COG:no KEGG:BF0527 NR:ns ## KEGG: BF0527 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 66 1 66 66 105 98.0 3e-22 MIVSRKISQIEVFAGSPWEVASVKSLLKAASIEVAMKDKGIGSILLSVPCEYYTAAMRVI GGRTTS >gi|226332041|gb|ACIB01000015.1| GENE 38 49679 - 51994 1508 771 aa, chain - ## HITS:1 COG:no KEGG:BF0526 NR:ns ## KEGG: BF0526 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 771 1 771 771 1589 99.0 0 MKLKFLFLFLSYSLSIHSQNNLLYADSSKDSLFFEQKKALLTATWHKPFSYSSIQSNHAP LGPYMGNGDVGVVAFTSDNSQTLKISKVDFVTDGWTDWAGSGPAALPIGGVNITVNSPVY SGFVTVNRADPSGFSYQMDQLNSELRMTTATAQQVKMVSWMGVNENMIITELTTSSKTPV PISVDTYADNQSASYTTTAQVNGQIAQVTRQTKTDAVRWISCAGISTKIVGVMSKPECLS ESMVRSNFQLTASDTVLVVVYVSGGGKGNDPQLPTAYNKLLTLNKADVTQLKMAKKAWWK DMWTRSYVETNDELLNRHYLSSIYLLASAYNEHSPVSGGMYGVWNMDDKMMYHGDIHLNY NSQAGFYSAFSSNRPEIALPFYKTIELLIPEGRRRAKEEMGIMHPSWEGKSCRGILFPVG ALGIGVFYNYYWQQTMNAPFNVPLFSWYYEYTGDLNFLRYRAYPYIRLCGDFYEDYMQKE TYGKSYRYTITTGGHEDSWDLNPPSDLAFVKQTFGLLVRYSKLLGVDQKRRKKWNDILSH LPEYKVIMPTKTPNQGLPVYAKNEAGWDLPSHAIQLHAAYPCEILNLHSDSTALQIARNT LYYYEVSQKGFTNTMNELGLSAFVMGARIRFDPDLLLENMKTLIKTAGTNFLIIDGHHCT EKTAVIETVNSMMLQTVEGVIYLFPCWTQTPAAFTRLRAKGAFLVSADYDGTSVGGLKIF SEKGGICRLSNPWRGRKLRVTENGKPVSVKEQNNVCSFITRKGSTYTIVGL >gi|226332041|gb|ACIB01000015.1| GENE 39 52238 - 52348 74 36 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MLYRDGNWDKVILIKIIFFLIPIHKLRLGLRITLLM >gi|226332041|gb|ACIB01000015.1| GENE 40 53233 - 54576 1119 447 aa, chain - ## HITS:1 COG:XF1003 KEGG:ns NR:ns ## COG: XF1003 COG0165 # Protein_GI_number: 15837605 # Func_class: E Amino acid transport and metabolism # Function: Argininosuccinate lyase # Organism: Xylella fastidiosa 9a5c # 1 335 6 340 445 250 38.0 4e-66 MAQKLWEKSVEVNKDIERFTVGRDREMDLYLAKHDVLGSMAHITMLESIGLLTKEELAQL LTELKDIYASTERGEFVIEEGVEDVHSQVELMLTRRLGDVGKKIHSGRSRNDQVLLDLKL FTRTQIREVAEAVEQLFHVLIRQSERYKNVLMPGYTHLQIAMPSSFGLWFGAYAESLVDD MLFLQAAFKMCNKNPLGSAAGYGSSFPLNRTMTTELLGFDSLNYNVVYAQMGRGKMERNV AFALATLAGTISKLAFDACIFNSQNFGFVKLPDECTTGSSIMPHKKNPDVFELTRAKCNK LQSLPQQIMMIANNLPSGYFRDLQIIKEVFLPAFQELKDCLQMTTYIMNEIKVNEHILDD DKYLFIFSVEEVNRLAREGMPFRDAYKKVGLDIEAGHFSHDKQVHHTHEGSIGNLCNDEI SALMQRIIEGFNFQGMEQAEKTLLGRK >gi|226332041|gb|ACIB01000015.1| GENE 41 54582 - 54992 468 136 aa, chain - ## HITS:1 COG:no KEGG:BF0511 NR:ns ## KEGG: BF0511 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 136 1 136 136 246 100.0 2e-64 MTKFESSVKVIPYSQERVYEKLADLSNLEAIKDRLPEDKVKNMSFDTDTLSFNVDPVGQL TLRIIEREPSKCIKFETTNSPLPFNMWIQLVAVSEEECKLKVTIGLEINPFMKAMVQKPL NEGLEKMADMLSMIQY >gi|226332041|gb|ACIB01000015.1| GENE 42 55092 - 55730 661 212 aa, chain - ## HITS:1 COG:lin1945 KEGG:ns NR:ns ## COG: lin1945 COG0461 # Protein_GI_number: 16801011 # Func_class: F Nucleotide transport and metabolism # Function: Orotate phosphoribosyltransferase # Organism: Listeria innocua # 3 208 2 207 209 227 53.0 2e-59 MKNLERLFAEKLLKIKAIKLQPANPFTWASGWKSPFYCDNRKTLSYPSLRSFVKFEITRL VLERFGQVDAIAGVATGAIPQGALVADALNLPFVYVRSTPKDHGLENLIEGELRPGMKVV VVEDLISTGGSSLKAVEAIRRDGCEVIGMVAAYTYGFPVAEQAFKDAKVPLVTLTNYEAV LDVALRTGYIEEEDIATLNEWRKDPAHWETGK >gi|226332041|gb|ACIB01000015.1| GENE 43 55826 - 56290 491 154 aa, chain - ## HITS:1 COG:no KEGG:BF0509 NR:ns ## KEGG: BF0509 # Name: not_defined # Def: putative regulatory protein # Organism: B.fragilis # Pathway: not_defined # 1 154 1 154 154 284 100.0 7e-76 MNTITEEEALNRMAAYCSAAEHCKAEVNEKLQKWGLPYEVINRIIDRLVVEKFIDEERYC RAFVNDKFRFAKWGKMKITQALYMKKIPREVTYRYLNDIDREEYLAILGDLIAAKRKSIH AKDEFELNGKLIRFAMSRGFEMDDIRRCVQVEEE >gi|226332041|gb|ACIB01000015.1| GENE 44 56287 - 57123 383 278 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|225874212|ref|YP_002755671.1| ribosomal protein L11 methyltransferase [Acidobacterium capsulatum ATCC 51196] # 1 278 16 290 294 152 37 8e-36 MNHVTTYIRQALHDIYPPGELRSLTKIICCDLLGQDAIDYYLGKDITLSVNEQCDLESIV ERLKKNEPIQYIQGETCFYGSMFRVAPGVLIPRPETEELVDLIVKEAATGTRLLDIGTGS GCIAISLAKHIPQAVVTAWDVSEEALAIAGENNRELKAGVHFEKMDVLSAEPVGDDQYDM IVSNPPYVTESEKNEMEPNVLDWEPRLALFVPDNDPLRFYRRIASLGRKMLRLHGRLYFE INRAYGEEVLQMLHEQGYEELRLIKDISGNDRIVTAKR >gi|226332041|gb|ACIB01000015.1| GENE 45 57177 - 58217 364 346 aa, chain + ## HITS:1 COG:BH1554_1 KEGG:ns NR:ns ## COG: BH1554_1 COG0117 # Protein_GI_number: 15614117 # Func_class: H Coenzyme transport and metabolism # Function: Pyrimidine deaminase # Organism: Bacillus halodurans # 1 143 1 141 143 159 52.0 9e-39 MKEEKYMRRCIQLAKNGLCNVSPNPMVGAVIVCEGQIIGEGYHIRCGEAHAEVNAIRSVK DPSLLKHSTIYVSLEPCSHHGKTPPCADLIIEKQIPRIVIGCQDPFSKVAGKGIQKLRDA GCEVIVGVLEPECRELIRKFITFHTLHRPYIVLKWAESADGFIDLERTEGQPVILSTPLT SMLVHKKRAESDAIMVGTRTALLDNPALTVRNWYGHNPVRIVMDRNHSLPQTSHLSDNSV STLVFTEHPRSGKENLEYITLNYQTDILPQILSALYQRNLQSLMVEGGRILLESFIRSGI WDEVIIEKSDKLLYSGVKAPEISDKISYSEEKHFCTTFRHYLKRNT >gi|226332041|gb|ACIB01000015.1| GENE 46 58356 - 59714 861 452 aa, chain + ## HITS:1 COG:no KEGG:BF0451 NR:ns ## KEGG: BF0451 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 452 1 452 452 855 99.0 0 MRIKFLSVIVSFFLVSFAVTSCLDTEEIEYSPDATIHAFALDTIHGVNYKFTIDQLGPDG VGLIYNQDSLPVGSDTIIDRILIKTLTTTSGIITAKNAEGQDTLFNYSDSIDFRGTMQKP MRIKVWAADMQYTKEYTISVRVHQQDPDSMNWTKMTDNFANYSGYQKSVTLNEDLLIYTS NTTAYKSSGDVISKGRSWTPVSITGLPDNIKLSSIISFGGKLYATNGESAYVSSDGALWN VATDLNKNGKVEMLIAPFPKNEGNLLGISGIAGIINNGDQSTFAITNPEATAWNIGSETV GADFPLENLSATSYLTATGIQTIAVMGNNRNANDTTSIAWTSQDGLLWIPLKTSSSTAYC PKLDNPSFFYYDNAFLAFGGNFETIYTSEAGIAWYKANKKIFLPAEFKDRENNYSIVVDK NNFIWVIWSNGGANEVWRGRINKFGFKRQNNN >gi|226332041|gb|ACIB01000015.1| GENE 47 59719 - 60453 612 244 aa, chain + ## HITS:1 COG:SPy1965 KEGG:ns NR:ns ## COG: SPy1965 COG0020 # Protein_GI_number: 15675763 # Func_class: I Lipid transport and metabolism # Function: Undecaprenyl pyrophosphate synthase # Organism: Streptococcus pyogenes M1 GAS # 9 238 13 248 249 251 51.0 9e-67 MSYKEQIDLNRIPKHVAIIMDGNGRWAKLRGHERSFGHQAGAETVHIITEEAARLGIKFL TLYTFSTENWNRPSDEVAALMSLLFDSIEEETFMKNNISFRIIGDINKLPENVRERLNAC VEHTSKNTGMCLILALSYSARWEITEATRQIATLVQNGEMNPEEITSESIRTHLTTNFMP DPDLLIRTGGEVRLSNYLLWQCAYSELYFCETFWPDFKEEELCKAICDYQKRERRFGKTS EQIS >gi|226332041|gb|ACIB01000015.1| GENE 48 60477 - 63113 2751 878 aa, chain + ## HITS:1 COG:RSc1412 KEGG:ns NR:ns ## COG: RSc1412 COG4775 # Protein_GI_number: 17546131 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Outer membrane protein/protective antigen OMA87 # Organism: Ralstonia solanacearum # 45 878 30 765 765 153 23.0 2e-36 MHYRISFIFVTFISLFCFALPGVAQDANTDESKPVIMYSGTPKKYEIADIKVEGVKNYED YVLIGLSGLSVGQTITVPGDEITGAIKRYWRHGLFSNVSITAEKIEGNKIWLKITLAQRP RISDIRYHGVKKSEREDLQTKLGMIKGSQITPNLVDRAKVLSKRYFDDKGFKNAEITITQ RDDPANENQVLVDIDIDKKEKVKVHAITFAGNHAIKTSKLKRVMKKTNEKGKLLNLFRTK KFVNEKYEEDKQLIIDKYNELGYRDARIVKDSVSKFDDKTVDVFIELEEGDKYYLRNVTW VGNTLYPSEQLNYMLRMKKGDVYNQKLLNERLSTDEDAIGNLYYNNGYLFYTLDPVEVNI DNDSIDLEMRIFEGRQASINKIKINGNDRLYENVVRRELRTRPGELFSREDLMRSMREIQ QMGHFDPENIKPDIQPDPVNGTVDIAYDLVSKANDQVEFSAGWGQTGVIGKLSLKFTNFS LANLLHPGENYRGILPQGDGQTLTISGQTNAQYYQQYSISFFDPWFGGKRPNSFSLSAFF SVQTDISSRYYNSSYYNNYYNSYYSGLGGYGMYNYGNYNNYENYYDPDKSIKMWGLSAGW GKRLNWPDDYFQLSAELSYQRYILKDWQYFPVTNGKCNDLSIGLTLARASYDNPIYPRSG SDFSLSVQFTPPYSLFDGVDYSKYNEYNQNDMNKMHKWVEYHKWKFKAKTYIPLLNPTVV KKTPVLMTRVEFGILGHYNKYKKSPFGTFDVGGDGMTGYSSYATESVALRGYENSSLTPY RGQEGYAYTRIGFELRYPLMLETSTNIYALTFLEAGNAWHDVKNFNPFDLKRSAGVGVRI FLPMIGMMGIDWAYGFDKVLGDKSAGGSRFHFVLGQEF >gi|226332041|gb|ACIB01000015.1| GENE 49 63138 - 63653 680 171 aa, chain + ## HITS:1 COG:no KEGG:BF0503 NR:ns ## KEGG: BF0503 # Name: not_defined # Def: cationic outer membrane protein precursor # Organism: B.fragilis # Pathway: not_defined # 1 171 1 171 171 272 100.0 3e-72 MKKSVLFIILLFAVGMTAQAQKFALIDMEYILKNIPAYERANEQLSQATKQWQGEVEVLA KEAQTMFKDYQAASAKLTAAQKTQKEDAIVEKEKAASELKRKYFGPEGELFKKREELMKP IQDEIYNAVKAVAEENGYAVVVDRASASSIIFATPRIDVSNEVLAKLGYSN >gi|226332041|gb|ACIB01000015.1| GENE 50 63711 - 64220 656 169 aa, chain + ## HITS:1 COG:no KEGG:BF0502 NR:ns ## KEGG: BF0502 # Name: not_defined # Def: putative outer membrane protein OmpH # Organism: B.fragilis # Pathway: not_defined # 1 169 1 169 169 268 100.0 7e-71 MLKKIALLMMLILPMGVFAQNLKFGHINAMEIVSAMPEYTKAQSELQALNKQLGQDLQRS QEEFSKKYQEFMQQKDSLPAVIAERRQKELEDMMQRQEQFQAKAQQDMEKANNDLMAPVY KKLDDAIKAVGAAEGVIYIFDMARTPIPYVNEAQSINLTPKVKTQLGIK >gi|226332041|gb|ACIB01000015.1| GENE 51 64323 - 65165 609 280 aa, chain + ## HITS:1 COG:lin1200 KEGG:ns NR:ns ## COG: lin1200 COG0796 # Protein_GI_number: 16800269 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glutamate racemase # Organism: Listeria innocua # 12 279 5 264 266 168 38.0 1e-41 MKQSLPYQPGPIGVFDSGYGGLTILSKIREALPEYDYIYLGDNARTPYGTRSFEIVYEFT LQAVNKLFEMGCHLVILACNTASAKALRTIQMNDLPNIDPDRRVLGVIRPTAECIGSMTQ TRHVGILATAGTIKSESYPLEVHKLFEDIKVSGEACPMWVPLVENNEASGEGADFFIRKY IDNLLAKDRQIDTLVLGCTHYPILLPKIQKFIPQGVKVVAQGEYVATSLKDYLHRHPEMD MKCTREGKCRFYTTEAEDKFIESASMFLNENITVQRITLE >gi|226332041|gb|ACIB01000015.1| GENE 52 65242 - 65478 313 78 aa, chain + ## HITS:1 COG:no KEGG:BF0500 NR:ns ## KEGG: BF0500 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 78 1 78 78 139 100.0 4e-32 MKEDKDTRVVEVFTGSPWEAEFIKGLLESNGIESILKDGGGLAALAPYYIGQEIAVLVNE DDYENAMEIVRNREKANE >gi|226332041|gb|ACIB01000015.1| GENE 53 65592 - 66674 1070 360 aa, chain + ## HITS:1 COG:BS_proJ KEGG:ns NR:ns ## COG: BS_proJ COG0263 # Protein_GI_number: 16078908 # Func_class: E Amino acid transport and metabolism # Function: Glutamate 5-kinase # Organism: Bacillus subtilis # 7 341 9 342 371 199 33.0 6e-51 MKKEFTRIAIKVGSNVLTRQDGTLDVTRMSALTDQIAALHKAGVEVILISSGAVASGRSE IRTLRKLDSVDQRQLFSAVGQAKLINRYYELFRDHGIAVGQVLTTKENFGTRRHYLNQKN CMMVMLENGVIPIVNENDTISVTELMFTDNDELSGLIASMMNAQALIILSNIDGIYNGSP SDPDSLVIREIGQGKDLSNYIQTSKSSFGRGGMLTKTNIARKVADEGITVIIANGKRDNI LINLIEHPEETVCTRFIPASQPVSSIKKWIAHSEGFAKGEIHINEQATEVLNSDKAVSIL PVGITRIEGEFEKDDIVRIIDYQGTPVGVGKVNCDSMQARDSIGKHGKKAVVHYDYLYIE >gi|226332041|gb|ACIB01000015.1| GENE 54 66687 - 67934 1175 415 aa, chain + ## HITS:1 COG:NMB1068 KEGG:ns NR:ns ## COG: NMB1068 COG0014 # Protein_GI_number: 15676952 # Func_class: E Amino acid transport and metabolism # Function: Gamma-glutamyl phosphate reductase # Organism: Neisseria meningitidis MC58 # 2 415 3 420 420 338 45.0 2e-92 MNLNDTFAAVQAAGRHLALLPDDRINQILNAVAEAALEQTSYILSENRKDLERMSPDNPK YDRLRLTEERLRGIASDIRNVATLPSPLGRILKESIRPNGMRLTKISVPFGVIGIIYEAR PNVSFDVFSLCLKSGNACILKGGSDADYSNRAIVEVIHQVLRQFNIDTHMVELLPTDREA TRELLHAAGYVDLIIPRGSSALINFVRQNATIPVIETGAGICHTYFDEYGDTAKGAAIIH NAKTRRVSVCNALDCVIVHESRLSDLPLLCEKLKADKVIIYADPSAYQALEGHYPAGLLK PATPESFGTEFLDYKMAIKTVNSFENALGHIQEYSSRHSESIVTENPERAALFTRMVDAA CVYTNVSTAFTDGAQFGLGAEIGISTQKLHARGPMGLEEITSYKWIIEGDGQTRQ >gi|226332041|gb|ACIB01000015.1| GENE 55 68422 - 69444 656 340 aa, chain + ## HITS:1 COG:no KEGG:BF0441 NR:ns ## KEGG: BF0441 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 340 1 340 340 685 99.0 0 MKQTAYIYLLTLSCLLCACNRENRTNLPQPQVTGVADSLETVPPEEKPKAISAEQIEIKK DLLYDKYTLEDTYPYKDTTRSFQWDKIKERLALLENIQQTPSQWAILQNYKNRNGEAPLV RHYKRNAYKRIADTLGIERYQSVPLYLLTDTLVPERYGEDGSLVRFLADGENFVKVSPIY IGEEWYVPKRYVKVLPDTTHFIKTIMIDRRDQNIMTLEQTSEAQWTVRSMNPATTGRHRP PYAQETPLGIFVLQEKKTRMIFLKDGSTATGGFAPYASRFSDGGYIHGVPVNEPRKALIE YSPSLGTTPRSHMCVRNATSHSKFIFDWAPVNETIIFVLE >gi|226332041|gb|ACIB01000015.1| GENE 56 69483 - 70211 631 242 aa, chain + ## HITS:1 COG:no KEGG:BF0495 NR:ns ## KEGG: BF0495 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 242 1 242 242 436 99.0 1e-121 MQKVFLFLLLFLLPLAEVPNHAPTSEPVSVASTPETDEIDQLFDDMQLDGIVSYTAFRQA VTGYRKIEQKSKSIMTLIDFSKPSTEKRLYVLDMKNKKLLYTSVVSHGKNSGGNYATSFS NKNGSYKSSLGFYLTENTYQGRNGYSLVLNGLEKGINDQAKQRAIVMHGAAYANPNITAS AGRLGRSLGCPALPQALAKPIIDTIKKGSVLFIYANNKDYLANSTFLSPRQTEYLSWAQP AN >gi|226332041|gb|ACIB01000015.1| GENE 57 70265 - 71221 1077 318 aa, chain + ## HITS:1 COG:XF0998 KEGG:ns NR:ns ## COG: XF0998 COG0078 # Protein_GI_number: 15837600 # Func_class: E Amino acid transport and metabolism # Function: Ornithine carbamoyltransferase # Organism: Xylella fastidiosa 9a5c # 28 302 27 322 336 190 37.0 3e-48 MKKFTCVQDIGDLKSALAESFEIKKDRFKYVELGRNKTLLMIFFNSSLRTRLSTQKAALN LGMNVIVLDINQGAWKLETERGVIMDGDKPEHLLEAIPVMGCYCDIIGVRSFARFENREY DYNEVIINQFIQHSGRPVFSMEAATRHPLQSFADLITIEEYKKTARPKVVMTWAPHPRPL PQAVPNSFAEWMNATDYEFVITHPEGYELDPKFVGNARVEYDQMKAFEGADFIYAKNWAA YTGDNYGQILSTDRNWTVGDRQMAVTNNAYFMHCLPVRRNMIVTDDVIESPQSIVIPEAA NREISATVVLKRLLENLP >gi|226332041|gb|ACIB01000015.1| GENE 58 71320 - 71715 378 131 aa, chain - ## HITS:1 COG:MA0746 KEGG:ns NR:ns ## COG: MA0746 COG0607 # Protein_GI_number: 20089631 # Func_class: P Inorganic ion transport and metabolism # Function: Rhodanese-related sulfurtransferase # Organism: Methanosarcina acetivorans str.C2A # 20 129 31 149 151 70 36.0 8e-13 MSKMNSMLMGICFLLSSLFSCQQSKGNFKTVPVKEFASLIEDASVQRLDVRTMAEYSEGH IPGTININVLDDSFAVMADSTLQKDKPVALYCRSGKRSKKAAAILSEKGYKVYELDKGFN AWQEAGEKVEK >gi|226332041|gb|ACIB01000015.1| GENE 59 71720 - 72415 329 231 aa, chain - ## HITS:1 COG:no KEGG:BF0492 NR:ns ## KEGG: BF0492 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 231 1 231 231 467 100.0 1e-130 MRIWTLHIVLSVLFSVFLFSCNEDDGAPYPSVRLEFLTAESGADGRLQTLVTDKGERLVV AEDRTGSELLPNSSSRVVSNYEVLSSVGGRKEIRIYALANTVSPVPLPAGEFRNGLKFDP VDMLSIWMGRDYLNMTLSIRAQNARHTFHFIQESVERDAVTGKLKVHLMLYHDDGGDVEA YAKRAYVSVPLRQYTSDIGESVIIYFSFHTYDGEVQTYQFEYVPSHLTINY >gi|226332041|gb|ACIB01000015.1| GENE 60 72412 - 73560 1177 382 aa, chain - ## HITS:1 COG:no KEGG:BF0491 NR:ns ## KEGG: BF0491 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 382 1 382 382 788 100.0 0 MNTTEFDEIRPYNDEELSGVFEELIADPAFQKVVAGVIPDVPFEMLAQKMRACRTKLEFQ KTFCYGLLWKLAGDCTDGISLDHTAIPDKSKAYTYISNHRDIILDSGFLSVLLVDQGMDT VEIAIGDNLLIYPWIKKFVRVNKSFIVQRALTMRQMLESSARMSRYMHYTIGEKNQSIWI AQREGRAKDSNDRTQDSVLKMLAMGGEGDVVSRLMEMNIAPLAISYEYDPCDYLKAQEFQ LKRDIEGYKKTMADDLKNMQTGLFGYKGRVHFQTGACLNDLLSTVDRSLPKPELFARIST WIDQRIHSNYRLYPGNYVAHDLLTGKNDFESHYTLAEKQRFEAYVEKQLEKIEIPNKDIS FLREKLLLMYANPLTNYLAARQ >gi|226332041|gb|ACIB01000015.1| GENE 61 73701 - 74675 907 324 aa, chain - ## HITS:1 COG:HI1140 KEGG:ns NR:ns ## COG: HI1140 COG1181 # Protein_GI_number: 16273066 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: D-alanine-D-alanine ligase and related ATP-grasp enzymes # Organism: Haemophilus influenzae # 2 321 5 303 306 177 32.0 3e-44 MKRNIAIVAGGDTSEIVVSLRSAQGIYSFIDKEKYNLYIVEMEGRRWEVQLPDGSKTPVD RNDFSFMNGAEKVVFDFAYITIHGTPGEDGRLQGYFDMMRIPYSCCGVLAAAITYDKFVC NQYLKAFGVRISESLLLRQGQAVSDEDVVEKIGLPCFIKPNLGGSSFGVTKVKTREQIQP AIAKAFSEAEEVMIEAFMGGTELTCGCYKTKEKSVVFPLTEVVTHNEFFDYDAKYNGQVD EITPARISEELTRRVQTLTSAIYDILGCSGIIRVDYIITEGEKINLLEVNTTPGMTATSF IPQQVRAAGLDIKDVMTDIIENKF >gi|226332041|gb|ACIB01000015.1| GENE 62 74672 - 75745 1036 357 aa, chain - ## HITS:1 COG:BH2542 KEGG:ns NR:ns ## COG: BH2542 COG0564 # Protein_GI_number: 15615105 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Pseudouridylate synthases, 23S RNA-specific # Organism: Bacillus halodurans # 26 346 1 300 305 236 43.0 7e-62 MIEELPDDIEQDELDDIEPVGDENQLYEHFRVVVDKGQAMVRVDKYLFERIVNASRNRIQ KAAEDGFVMANGKPVKSSYKVKPLDVITVMMDRPRYDNEIIPEDIPLHIVYEDKYLMVVN KPAGLVVHPGHGNYHGTLVNAIAWHLKDDPVYDANDPHVGLVHRIDKDTSGLLVIAKTPD AKTNLGVQFFNKTTKRRYRALVWGIVDQDEGTIVGSIARNPKDRMQMAVMADPTQGKHAV THYRVLERLGYVTLVECILETGRTHQIRVHMKHIGHVLFNDERYGGHEILKGTHFSKYKQ FVNNCFDTCPRQALHAMTLGFVHPVTGEEMHFTSELPDDMTRLIEKWRGYISNRDLE >gi|226332041|gb|ACIB01000015.1| GENE 63 75775 - 76416 593 213 aa, chain - ## HITS:1 COG:no KEGG:BF0488 NR:ns ## KEGG: BF0488 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 213 1 213 213 395 100.0 1e-109 MTIKEFFSFKANRFFWINIIAMVVVAVLIVVGTLKGLDIYTRHGEAVIVPDVKGMSVSEA EKMFRNHGLTCVVSDSSYVKNKPSGIILDLNPSVGQKVKEGRTIYLTINTLSTPLSVVPD VADNSSVRQAQAKLIAAGFKLTENRMVSGEKDWVYGVIYQGRQLQIGDKAPIGATLTLMV GDGVQSTATDSVDMVENAAMSVEDSGTDDDSWF >gi|226332041|gb|ACIB01000015.1| GENE 64 76628 - 76789 270 53 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|53711778|ref|YP_097770.1| 50S ribosomal protein L34 [Bacteroides fragilis YCH46] # 1 53 1 53 53 108 100 1e-22 MKRTFQPSNRKRKNKHGFRERMATANGRRVLAARRAKGRKKLTVSDEYNGVKA >gi|226332041|gb|ACIB01000015.1| GENE 65 76983 - 77549 802 188 aa, chain + ## HITS:1 COG:MT2609 KEGG:ns NR:ns ## COG: MT2609 COG0231 # Protein_GI_number: 15842068 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Translation elongation factor P (EF-P)/translation initiation factor 5A (eIF-5A) # Organism: Mycobacterium tuberculosis CDC1551 # 1 188 1 187 187 183 46.0 2e-46 MINAQDIKNGTCIRMDGKLYFCIEFLHVKPGKGNTFMRTKLKDVVSGYVLERRFNIGEKL EDVRVERRPYQYLYKEGEDYIFMNQETFDQHPIAHDLINGVDFLLEGAVVEVVSDASTET VLYADMPIKVQMKVTYTEPGLKGDTATNTLKPATVESGATVRVPLFISEGETIEIDTRDG SYVGRVKA >gi|226332041|gb|ACIB01000015.1| GENE 66 77985 - 78998 615 337 aa, chain + ## HITS:1 COG:TVN0547 KEGG:ns NR:ns ## COG: TVN0547 COG0463 # Protein_GI_number: 13541378 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Thermoplasma volcanium # 12 244 7 221 250 84 29.0 4e-16 MKINYTYMMRYSVIIPVYNRPDEVDELLQSLTAQHFKDFEVVVVEDGSSVPCEKIVNQYQ GKLDIHYYNKPNSGPGQTRNYGAERSNGEYLIILDSDCILPEGYLDAVEKELQTDPADAF GGPDRAHSSFTDIQKAINYSMTSFFTTGGIRGGKKKMDKFYPRSFNMGVRREVYQALGGF SNMRFGEDIDFSIRIFKGGYQCRLFPDAWVYHKRRTDFKKFFKQVHNSGIARINLYKKYP ESLKVVHLLPAVFTLGVALLLLCTPFCLFSLVPILLYALLVCLDSALQNKSLRIGIYSIA ASFIQLIGYGTGFWRAWWERCILGRNEFEAFRKNFYK >gi|226332041|gb|ACIB01000015.1| GENE 67 79041 - 79733 546 230 aa, chain + ## HITS:1 COG:MA1979 KEGG:ns NR:ns ## COG: MA1979 COG2003 # Protein_GI_number: 20090827 # Func_class: L Replication, recombination and repair # Function: DNA repair proteins # Organism: Methanosarcina acetivorans str.C2A # 1 230 1 228 229 144 32.0 1e-34 MNKQKLNIKQWSKADRPREKMMTKGSEALSDAELLGILIGSGNTEESAVELMRRILATCD NNLNELGKWEVRNFSSFKGMGPAKSLTIMAALELGKRRKLQESKEREQIRCSEDIYKLFH PLMCDLPQEEFWILLLNQACKVINRLRISTGGIDGTYADVRTILREALIGRATQIALIHN HPSGHAKPSQEDKRLTGAIQKASQTMNITLVDHVIVCDGCFYSFADEGLI >gi|226332041|gb|ACIB01000015.1| GENE 68 79762 - 80526 799 254 aa, chain - ## HITS:1 COG:NMA0723 KEGG:ns NR:ns ## COG: NMA0723 COG2908 # Protein_GI_number: 15793700 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Neisseria meningitidis Z2491 # 1 233 1 222 240 94 29.0 2e-19 MKNVYFLSDAHLGSRAIEHGRTQERRLVNFLDSIKHKAAAVYLLGDMFDFWYEFRLVVPK GYTRFLGKLSELTDMGVEVHFFTGNHDIWCGDYLTKECGVTIHREPVTTEIYGKEFYLAH GDGLGDPDKKFKLLRTMFHSRTLQTLFSAIHPRWSIDLGLNWAKHSRLKREGGKEPDYMG ENKEFLVLYTKEYLKSHPNINFFIYGHRHIELDLMLSATARILILGDWINFFSYAVFDGE NLFLENYIEGETQL >gi|226332041|gb|ACIB01000015.1| GENE 69 80572 - 80883 407 103 aa, chain - ## HITS:1 COG:CC1859 KEGG:ns NR:ns ## COG: CC1859 COG2151 # Protein_GI_number: 16126102 # Func_class: R General function prediction only # Function: Predicted metal-sulfur cluster biosynthetic enzyme # Organism: Caulobacter vibrioides # 6 100 21 115 118 98 52.0 2e-21 MTKFEIEEKIVDMLKTVFDPEIPVNVYDLGLIYKIDVSEDGEVSIDMTLTAPNCPAADFI MEDVRQKVESIDGVNSATINLVFEPEWDKDMMSEEAKLELGFL >gi|226332041|gb|ACIB01000015.1| GENE 70 80966 - 81490 565 174 aa, chain - ## HITS:1 COG:no KEGG:BF0480 NR:ns ## KEGG: BF0480 # Name: not_defined # Def: alkaline phosphatase III precursor # Organism: B.fragilis # Pathway: gamma-Hexachlorocyclohexane degradation [PATH:bfr00361]; Folate biosynthesis [PATH:bfr00790]; Metabolic pathways [PATH:bfr01100]; Two-component system [PATH:bfr02020] # 13 174 305 466 466 316 99.0 2e-85 MSFTRNTRKKLWVTADHETGGIALGTGKYALNLKALENQKASAEVLSKKISDLRKAKNNH VAWEDIKNLLSEEMGFWSVLPITWEQEKKLHDEYEKSFVRNKVEFAESMYAKTEPMAAKA KEVMDQIAMVGWTSGGHSAGYVPVFAIGAGSDLFIGKMDNTEIPKRIAKAGGYK >gi|226332041|gb|ACIB01000015.1| GENE 71 81438 - 82361 882 307 aa, chain - ## HITS:1 COG:TM0156 KEGG:ns NR:ns ## COG: TM0156 COG1785 # Protein_GI_number: 15642930 # Func_class: P Inorganic ion transport and metabolism # Function: Alkaline phosphatase # Organism: Thermotoga maritima # 1 296 1 285 434 155 36.0 8e-38 MKRLFYFFLFVCVAVIANAQAKYVFYFIGDGMGVNQVNGTEMYRAEIQKGRIGVEPLLFT QFPVGTMATTFSATNSVTDSSAAGTALSTGEKTYNGSIGMDGQKNPLQTVAEKAKKAGKR VGVTTSVSVDHATPAAFYAHQPDRNMYYEIATDLPKAGFDFYAGAGFLKPTTTYDKKEAP SIFPMFEEAGYTIARGYNDYKAKAAAAGKMILIQEEGADTGSLPYAIDSKEGDLTLAQIT ESAIDFLTKGKNKGFFLMVEGGKIDWACHGNDAATVFHEVADMDNAIKVAYEFYKKHPKE TLGNCRP >gi|226332041|gb|ACIB01000015.1| GENE 72 82584 - 83780 1370 398 aa, chain - ## HITS:1 COG:TM0274 KEGG:ns NR:ns ## COG: TM0274 COG0282 # Protein_GI_number: 15643044 # Func_class: C Energy production and conversion # Function: Acetate kinase # Organism: Thermotoga maritima # 1 398 1 400 403 474 59.0 1e-133 MKVLVLNCGSSSIKYKLFDMDSKEVIAQGGIEKIGLKDSFLKLTLPNGEKKILEKDIPEH TVGVEFILNTLVSPEYGAIQSLEEINAVGHRMVHGGERFSKSVLLTKEVLEAFAACNDLA PLHNPANLKGVDAITAILPNVPQIGVFDTAFHQTMPEHAYLYAIPYELYKKYGVRRYGFH GTSHRYVSQRVCEYLGIKPEGLKLITCHIGNGGSIAAIKDGKCIDTSMGLTPLEGLMMGT RSGDIDAGAVTFIMDKEGLTTTGISNLLNKKSGVAGMMNGSSDMRDLEAAVAKGDPQAIL TEQMYFYRIKKYIGAYAAALGGVDVILFTGGVGENQATCRAGVCEGLEFLGVKLDPEKNK VRGEEAIISTDDSRVKVVVIPTDEELLIASDTMAILDK >gi|226332041|gb|ACIB01000015.1| GENE 73 83799 - 84818 1252 339 aa, chain - ## HITS:1 COG:CAC1742 KEGG:ns NR:ns ## COG: CAC1742 COG0280 # Protein_GI_number: 15895019 # Func_class: C Energy production and conversion # Function: Phosphotransacetylase # Organism: Clostridium acetobutylicum # 2 332 1 329 333 314 51.0 2e-85 MLNLINSIVARAQANRQRIVLPEGTEERTLKAANQILTDEVADLILLGNPEEINAAAAKW GLGNINRATIIDPENHPKKEEYAQLLCELRKKKGMTIEEARKLVLDPLYLGCLIIKSGDA DGQLAGARNTTGDVLRPALQIIKTSPGITCVSGAMLLLTHAPECGQNGLLVMGDVAVTPV PDASQLAQIAVCTARTAQAVAGIAEPKVAMLSFSTKGSAKHENVDKVVEALKLAKEMAPD LNIDGEMQADAALVPSVGASKAPGSPVAGEANVLIVPSLEVGNISYKLVQRLGHADAVGP ILQGIARPVNDLSRGCSIEDVYRMIAITANQAIAAKNGK >gi|226332041|gb|ACIB01000015.1| GENE 74 84952 - 86505 1321 517 aa, chain - ## HITS:1 COG:no KEGG:BF0477 NR:ns ## KEGG: BF0477 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 517 2 518 518 1025 99.0 0 MKTKHYFSFVFASLFLIPSVITAQDVNYQKEFDTFQEKQQKEYKEFKNKADEEFATFLKE AWQKYNASMEDSMPTRPEPVKPTLFDKKKPVPAPVEIKPEVPKIPIADKPGVGGEVNVEV KKQDLPVVADKPAPGVYVPGKPYTPVKVDIPAPLPGSSAHRNAIEFYGTRFEVATDVIDG FELGGTSESKVAGAWSRLCKADHEQLINDCIRLKKEHQMNDWAFLMFIKQLGVQVCGVAK KDDVAFLQMFILNKCGYKVRLSKINDKLKLLVAPAGTIFGIPYITFKGVKYYVFEADKGG SMAVYTYSQDFANAKNLVCMDLSAVPQFGMQEFSKTVSPSEKSLLKVNTAVNKNLMDFYK DYPQCEVAVYYKTPMSKELKSALYPPLQAAIKGKSEKDAANILIDFVQNSFQYQTDGEQF GYEKPFFMDENFYYPACDCEDRAILFSNLVKDLLGLDAVLLDYPNHIASAVRFNEDISGD YILLDGKKYLICDPTYIGAPIGMCMDRFKSVPPEIIR >gi|226332041|gb|ACIB01000015.1| GENE 75 86550 - 87347 725 265 aa, chain - ## HITS:1 COG:no KEGG:BF0421 NR:ns ## KEGG: BF0421 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 265 1 265 265 500 100.0 1e-140 MNVNRLLGILAILLLTAANALYAQKPVKVKGVQGRWQVSDDITLKQAEERAFMEAKKAAL QKAGVMENVWSVFGQITQEDGQELHEAYSQMNVLAIGGMVNVTNKKVEEVWDTDTRSLYK VVTIDAEVRKEDKSDSSYALEVKGVETLYREGDVFHCKLTIHGTDSYLKFFWFDSNGGAL LYPNSYEPNTLLKAGKEYAIPFSNAVDYRMEKQHGKESEKINMMMVATKEDIPFTKEVTY QNVLEWVYSIPAVQRCAFYDMVLIK >gi|226332041|gb|ACIB01000015.1| GENE 76 87373 - 88056 839 227 aa, chain - ## HITS:1 COG:no KEGG:BF0475 NR:ns ## KEGG: BF0475 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 227 1 227 227 343 99.0 2e-93 MKKSLLISVFATFAMMISLNALAQEKATGKAYKAIQKDEKVINKDLQKKAIKEARKQAKE LTKEGFKTPVGKLPLDKQLENSWEKQMEIDMNGNPYWYIATSRVIGGNQSAAAMQATNTA KIDIAGQVQTKVTQLIESKVANDDMGQEEAASLSSAVAAGKSIISGTLGRTIPLVEVYRT LPNKNVEVMVTIGYSLEAANKVAVKALSEELAKKSPELAKELDKLAQ >gi|226332041|gb|ACIB01000015.1| GENE 77 88265 - 88603 369 112 aa, chain + ## HITS:1 COG:NMA0437 KEGG:ns NR:ns ## COG: NMA0437 COG1380 # Protein_GI_number: 15793442 # Func_class: R General function prediction only # Function: Putative effector of murein hydrolase LrgA # Organism: Neisseria meningitidis Z2491 # 1 111 3 113 114 87 46.0 5e-18 MIRQCAILFGCLALGELIVYLTGIKLPSSIIGMLLLTLFLKLGWIKLHWVQGMSDFLVAN LGFFFIPPGVALMLYFDIIAAQFWPIVIATLVSTLLVLVITGWVHQLTRKLK >gi|226332041|gb|ACIB01000015.1| GENE 78 88600 - 89295 649 231 aa, chain + ## HITS:1 COG:NMB2004 KEGG:ns NR:ns ## COG: NMB2004 COG1346 # Protein_GI_number: 15677832 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Putative effector of murein hydrolase # Organism: Neisseria meningitidis MC58 # 10 229 11 229 230 177 45.0 2e-44 MNYLENEFFLLAITFGIYFFAKLLQKKTGILLLNPILLTIAVIIIFLKLTNISFETYNQG GHLIEFWLKPAVVALGVPLYLQLETIKKQLLPIILSQLAGCIVGVISVVLIAKLMGASQE VILSLAPKSVTTPIAMEVTKTLGGIPSLTAAVVVCVGLLGAVLGFKTMKIMHVGSPIAQG LSMGTAAHAVGTSTAMDISSKYGAYASLGLTLNGIFTALLTPTILRLLGIL >gi|226332041|gb|ACIB01000015.1| GENE 79 89350 - 90267 905 305 aa, chain + ## HITS:1 COG:FN0277 KEGG:ns NR:ns ## COG: FN0277 COG4866 # Protein_GI_number: 19703622 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 4 291 2 284 290 152 32.0 8e-37 MIAFRDITIQDKDTITAYTMNSCRRNCDLSFSNLCSWRFLYHTKFAIINNFLVFKFWAGD ELAYMMPVGEGNLEEVLNELIEDARQEGEPFCMLGVCSCMREDLEAIMPGQFGFTVDRDY ADYIYLRSDLATLKGKKFQSKRNHINKFRNTYPDYEYSPITKDRIQECLELEAKWCKAND CDQQEGTGNERRALIYALNHFEELGLTGGILHVNGQIVAFTFGMPINKETFGVHVEKADT SIDGAYAMINYEFANHIPEQYIYINREEDLGIEGLRKAKLSYHPETILEKYMACLKEQPV EMIKW >gi|226332041|gb|ACIB01000015.1| GENE 80 90277 - 91296 506 339 aa, chain + ## HITS:1 COG:FN1041 KEGG:ns NR:ns ## COG: FN1041 COG4552 # Protein_GI_number: 19704376 # Func_class: R General function prediction only # Function: Predicted acetyltransferase involved in intracellular survival and related acetyltransferases # Organism: Fusobacterium nucleatum # 3 298 11 321 391 73 24.0 7e-13 MIKEQVKSLWKLCFDDSEAFIELYFRLRYNNEVNLAIQSGEEVIAALQMLPYPMTFCNKI VPTSYISGACTHPDYRAKGVMRELLSQSFARMLRNGVLFSTLIPAEPWLFGYYAKTGYIP AFRISHKVFSLSELTIDPEPDTMIEETTEYQEDYYQYLTGKLSERACCLQHTPTDFKVVL ADLALTQNTVLIARKDNRVTGIAVVYKHDDSSYINELFADNEAVRAQLLYRAGLRNGTER IILQLPPVESLPSVPLGMARIIDARAVLQLYAAGCPEVEMNIELTDEQLSVNNGYYYLCK GKCMTSEVRLPGVHTRMTIAELSEKILGEMQPYMSLMMN >gi|226332041|gb|ACIB01000015.1| GENE 81 91389 - 93302 1078 637 aa, chain - ## HITS:1 COG:no KEGG:BF0412 NR:ns ## KEGG: BF0412 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 637 1 637 637 1309 98.0 0 MKYVLFILLALILAACRSEKDRRLEYALEFAGDNRVELEKVLEHYRTDPEKLEAARFLIR NMSGWYSYEGNELDSIHHLLVGVCEGRSISKREKNKWNRISFDSLSKIYDAQVITAEYLI DNIDLAFEVWRKYPWNRNLPFDDFCELILPYRIADEPLSDWRKLYYEDYGTLLDSLYKGS DVIEASKIIDGKLRKLYYIYNTDFRVPHLNAVFLYHNRIGYCREACDLTIYAMRACGIPV ATDYFVYSPDYQHYHCWTMLRDTTGTFLQFGFNEFEASRDTLRHDGRKKGKVYRYCFGMQ ADKNSGTSGNRQLSPVLKNRFVKDVTSEYFGSNDTTIPIQMSGEQYIYLGIFSPGGWIPI DMALGNAGKVTFRDIEPDVIYQTLYQGDGGKLYPAGYPFISKTGGGFVLLKPNIDLMEEA ILKRKMPQQKTIAEWAYRAIIGAKVEAADDLSFMQADLLWQFEDTLTTNYCVLTPLLRKK YRYVRYVAPIGKRMELAELALFKDSLCKEKVRLGRINSIEPIAKLEYVTDGNILTYFQAR DTSCYLAYDLGESTLIERIVFSPRNDDNYIWPGDNYELFYQDGINGWKSLGSKVATEREI DFLVPQNALLWLRNRTKGREEQVFIYKNGRQYFAFDL >gi|226332041|gb|ACIB01000015.1| GENE 82 93344 - 94444 396 366 aa, chain - ## HITS:1 COG:no KEGG:BF0469 NR:ns ## KEGG: BF0469 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 366 1 366 366 761 99.0 0 MKNFSLIVVGILFIYCVSSCRKQVDKGFEMTVDLSISNPYLPMSVLVDTIESVRLQLPSP YFWGMVDNVISKDSCYYISDRKQEMAFRFSKNGTFLNAIGQRGEGPGEYREMDSFFVGKD CVYVCDMGKRTIYSYSFDGKFLHSLSFPYSLVFNDVVELPDGRFLCHRPSQSENCKGLWI LDQKGRRVKNLLEYEKGTPCKNSYWNTLCAQEDGTIKIYNPVDGSYYQYDAVNDTVVRTM RQKSNLPMLADLHCSDRELYETKEECTYSLFTVDGKNLVFSLWSFNSANKGMWSVYFKKD GRIEQGNLTKMDIPGYSEMGRPVSSNIPNTFVTVYTDEFPDDAFPSAYQQQEINEQTAIL SLLRLK >gi|226332041|gb|ACIB01000015.1| GENE 83 94506 - 94676 75 56 aa, chain - ## HITS:1 COG:no KEGG:BF0468 NR:ns ## KEGG: BF0468 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 56 1 56 56 106 100.0 2e-22 MENNMLCARLMIHTIYGFSLPADIFPFVFTDSEKSTSFVVLCVVKDSLYRYRLKKE >gi|226332041|gb|ACIB01000015.1| GENE 84 94646 - 95722 618 358 aa, chain - ## HITS:1 COG:no KEGG:BF0467 NR:ns ## KEGG: BF0467 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 358 1 358 358 687 99.0 0 MIKRPYLSFTLLLLLSMCILGGCEKEKSHTVLELFSESQSLSPKKDFYVNEDSIAIIEGL SCDGKNLIVNDYHSGCCYTLFDKKSGEYIAGFGTIGQGPAELPSPCYGYLTESGFTVFDD QTRIVMKYSLDSLRNSRKKDGSPVRLAQYKIPEAQISKLIAIDDTTFLCAGTYKSRYQYL LFNKNDSVLDYGVDVYNAADSAFQTYTRYLSNQGNLVMNPEKHTFAGSINFSSNIDFFEI VNNKIELIKSLRLGDPINKPVNEEGIYYVDLTENTQTGYIDLSATSKYVYALYSDKKMYE NNRKSDTVLVFDWDGNPIKKYSLDTDAYYIAVDSTQQSLFAAVKNSSSGWKIICYALD >gi|226332041|gb|ACIB01000015.1| GENE 85 95921 - 97087 234 388 aa, chain - ## HITS:1 COG:no KEGG:BF0466 NR:ns ## KEGG: BF0466 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 282 388 2 108 108 213 97.0 7e-54 MKTVLFLIISFCLFSCISKGGMAGGKAICLDSIVIQKELRFSSLFEKPEIIILDDDTIVG NIDKLLAKDSLLLILDKELNALHVFDRSGKFKYNIGSQGSGPGEYTLLCDFTLDKEGNVY ALDFTRQIVYCYSLSGQFFGSLSLNCDEGQSHYIHYNQGDLYTDIHSKSGEDAMMRKVDL KSGQTERKYFSVNVNNEGWRQIQGHSPFCLPLEKELCFSPLYSKDILCLQGDREDIWLKF VTDAWMDKKTLDKIDVNKGETLRYLVESGKVHSLIFLAHVKNDYLFKYQKGYQTYYGLLS ASRDSCSIFNGCVDDWVYRTKGDDVHIIPNLITYDEKGVYGCLDMYTISLFLSLVRSGQV NESLDRILELKNLIENSNPLIFYYPVKK >gi|226332041|gb|ACIB01000015.1| GENE 86 97157 - 98326 604 389 aa, chain - ## HITS:1 COG:no KEGG:BF0413 NR:ns ## KEGG: BF0413 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 294 389 2 97 97 156 75.0 1e-36 MRSFIFIFFVLLFFSSCVEQKKVDPYANESTRIINVPEDYTVSVCSENLFSSVKLLPLET KEECLIGRMDRLVICDSLFIINDTRQRILVFDREGAFLRQIGKRGGGPGEYLEVRDFFIN NKNELEVLDFKKILRYSLTGEFIGDMRFDYLSDKNLYCNPSYFIASPIAGYYIWGGTTGV RKVEDESCLMYKTDGAMKIEKGYFPIEHGAGANYYKFSKYDNHILIDPTFGDYNIYQIDS LDNLSTRYFFDFGNKSCKQTINFPDKMSVDAKESLDESVVALYNFQETKKWLHLDFVYKE NVYSLFYSKANDEVSIIDIKNCQLENKSSFGFWGAIGVEGEVLLNAIEASWIKAELDRLG PAMVEKLNLKKWDSIHESDNPVLVFYELK >gi|226332041|gb|ACIB01000015.1| GENE 87 98447 - 98734 262 95 aa, chain - ## HITS:1 COG:no KEGG:BF0404 NR:ns ## KEGG: BF0404 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 49 1 50 88 72 86.0 4e-12 MRTKVKVMIAFIAVIAVSFIGYNVYKAQSTKLLSDIAMANVEALATPEGDDWVKENLDTT CAYCINMVGGHGVFFYCQYGTGSCYNTICTSGYCW >gi|226332041|gb|ACIB01000015.1| GENE 88 98862 - 99749 462 295 aa, chain - ## HITS:1 COG:no KEGG:BF0463 NR:ns ## KEGG: BF0463 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 295 1 295 295 613 98.0 1e-174 MGRPIKLIGYLCFLMALCSCKETKEQQISRLIHEWEGRTIVYPAGMTFSVLGKDSAGYSF PQNEYTIMTYVDSVGCTSCKLQLPTWKRLISMVDTVAAGKVSFLFVFHPKNKKEISFLLK RDRFLYPVFIDEKGDFDALNHFPSDVNFQTFLLDSQNKVLAIGNPVHNKKVCDLYLQIIA GQQTGVATSSQTKVVLDKKLDEMGDFDWKIPQTATFSLRNLGDHLLIIEDINASCGCTSV TYSKEPVPSGKSADIQVTYRAEHPEHFEKTITVYCNTPTSPIRLKIRGNAIDEEY Prediction of potential genes in microbial genomes Time: Tue May 17 22:29:12 2011 Seq name: gi|226332040|gb|ACIB01000016.1| Bacteroides sp. 3_2_5 cont1.16, whole genome shotgun sequence Length of sequence - 3208 bp Number of predicted genes - 3, with homology - 3 Number of transcription units - 3, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 428 - 487 6.2 1 1 Tu 1 . + CDS 696 - 1133 79 ## COG3293 Transposase and inactivated derivatives 2 2 Tu 1 . - CDS 1188 - 2246 463 ## gi|253564181|ref|ZP_04841638.1| predicted protein - Prom 2269 - 2328 6.9 - Term 2387 - 2439 -0.8 3 3 Tu 1 . - CDS 2476 - 2736 220 ## gi|301161524|emb|CBW21064.1| putative export protein - Prom 2756 - 2815 4.3 Predicted protein(s) >gi|226332040|gb|ACIB01000016.1| GENE 1 696 - 1133 79 145 aa, chain + ## HITS:1 COG:MA0831 KEGG:ns NR:ns ## COG: MA0831 COG3293 # Protein_GI_number: 20089715 # Func_class: L Replication, recombination and repair # Function: Transposase and inactivated derivatives # Organism: Methanosarcina acetivorans str.C2A # 1 143 48 262 271 99 30.0 2e-21 MLYLTKTACQWQMLLKEFGPWQTIYFYFRKWKLERVFEELMHHLRESVRKAFGKAISPID FRTIRTSHHIDTLGIIPVVVIHTANILRPDAAKKFTVPSKRWIVDRTFSWFESFRRLSKD NEVLPETSQTMIYLTMIQMMLNRTK >gi|226332040|gb|ACIB01000016.1| GENE 2 1188 - 2246 463 352 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253564181|ref|ZP_04841638.1| ## NR: gi|253564181|ref|ZP_04841638.1| predicted protein [Bacteroides sp. 3_2_5] # 1 352 1 352 352 712 100.0 0 MQKQFFFIVIFSCLLSSCNLRDNTNISTFDKVIEVKCEQSDTELLINMGWIGCVDTFLVM THIAQKDFCNVYSIPSGMKKIYAYGSLGNGPGEFLQPMITYAHENTFGLNDINTQTLAVM SLNDSKDGVVVKELSRNRVPYKRKKGELNPADYNFVKLDDKHFVSLLCGKDGSFFSLLDS NLQPLQRFGNSPIEGELSMQSSRMNLKGCLSAYEGNMVFTPNKLPYLAKYHLENDVMIKD WSFYFDKSFYECKNSDLLFSKERSFGQVLDLAMDDKYIYVLYLDQLLSDYDFKDPHKSMA NKILIFDYKGLPIAKLLLDKRIYRIALCTKLHKIIGLGNMPEPTLVSFDNIP >gi|226332040|gb|ACIB01000016.1| GENE 3 2476 - 2736 220 86 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|301161524|emb|CBW21064.1| ## NR: gi|301161524|emb|CBW21064.1| putative export protein [Bacteroides fragilis 638R] # 1 86 1 86 86 154 100.0 2e-36 MRIKIFGFIAFVAIAVAAGFNYQQNKQEAELPDLTIANIEALASGESVDREDCETASDLC SMIVIYPDGDYGEDILLGHTKKPGWI Prediction of potential genes in microbial genomes Time: Tue May 17 22:29:37 2011 Seq name: gi|226332039|gb|ACIB01000017.1| Bacteroides sp. 3_2_5 cont1.17, whole genome shotgun sequence Length of sequence - 24354 bp Number of predicted genes - 21, with homology - 21 Number of transcription units - 13, operones - 6 average op.length - 2.3 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 3 - 1059 339 ## BF0461 hypothetical protein - Prom 1113 - 1172 5.0 + Prom 1579 - 1638 3.9 2 2 Tu 1 . + CDS 1734 - 1886 61 ## BF0459 hypothetical protein + Term 1928 - 1962 -0.9 3 3 Op 1 1/0.250 - CDS 1941 - 2705 648 ## COG1624 Uncharacterized conserved protein 4 3 Op 2 . - CDS 2740 - 3600 768 ## COG0294 Dihydropteroate synthase and related enzymes - Prom 3663 - 3722 4.5 + Prom 3466 - 3525 3.8 5 4 Tu 1 . + CDS 3685 - 5727 1438 ## COG0642 Signal transduction histidine kinase - Term 5778 - 5807 -0.2 6 5 Op 1 . - CDS 5813 - 6778 1253 ## COG2066 Glutaminase 7 5 Op 2 . - CDS 6794 - 8236 1620 ## COG0076 Glutamate decarboxylase and related PLP-dependent proteins - Prom 8342 - 8401 5.2 + Prom 7982 - 8041 4.7 8 6 Op 1 . + CDS 8268 - 8495 82 ## BF0453 hypothetical protein 9 6 Op 2 . + CDS 8498 - 9733 1072 ## COG0477 Permeases of the major facilitator superfamily 10 6 Op 3 . + CDS 9776 - 11074 1204 ## COG0770 UDP-N-acetylmuramyl pentapeptide synthase - Term 11000 - 11042 4.4 11 7 Tu 1 . - CDS 11079 - 11471 270 ## BF0390 putative transmembrane protein - Prom 11545 - 11604 8.2 + Prom 11471 - 11530 6.7 12 8 Op 1 . + CDS 11576 - 12943 1026 ## COG0733 Na+-dependent transporters of the SNF family 13 8 Op 2 . + CDS 12946 - 13848 610 ## COG1555 DNA uptake protein and related DNA-binding proteins + Term 13863 - 13904 -0.9 - Term 13659 - 13693 -1.0 14 9 Op 1 . - CDS 13849 - 14499 308 ## PROTEIN SUPPORTED gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) 15 9 Op 2 . - CDS 14504 - 15214 622 ## COG1179 Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 1 16 9 Op 3 . - CDS 15214 - 17349 1648 ## COG0475 Kef-type K+ transport systems, membrane components - Prom 17501 - 17560 6.2 + Prom 17424 - 17483 5.6 17 10 Tu 1 . + CDS 17547 - 18557 1272 ## COG0136 Aspartate-semialdehyde dehydrogenase + Term 18614 - 18656 4.2 18 11 Tu 1 . - CDS 18611 - 18808 102 ## BF0443 hypothetical protein - Prom 19032 - 19091 9.6 + Prom 18607 - 18666 6.1 19 12 Tu 1 . + CDS 18857 - 19705 568 ## BF0442 hypothetical protein + Prom 20004 - 20063 8.3 20 13 Op 1 . + CDS 20167 - 23502 2580 ## BF0440 hypothetical protein 21 13 Op 2 . + CDS 23532 - 24354 838 ## BF0439 hypothetical protein Predicted protein(s) >gi|226332039|gb|ACIB01000017.1| GENE 1 3 - 1059 339 352 aa, chain - ## HITS:1 COG:no KEGG:BF0461 NR:ns ## KEGG: BF0461 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 352 1 352 356 704 98.0 0 MKFIYSILLALLWVGITITLGSYLYRMGLPVVNASAQKLLLRTIDEDLEYRFRKLNPNHA YATGKKQTKHKKVTLTDKSGTCNVCQFAIDTVYADTSFIQKAKQSFLIERNPIHVDSLNQ KWQLKLRMDGIRAKTGIKLINSLKDGERISVSSGLNEPDCFLLAYSTGVGYCIKMDAFIR PFWVDVILKAHWNNIRTWSYVLFSLIFCLFYIPSVRLFLVRILSGSRIEDNHVESSQPLA QQKGEFVWEVDGLTFDYLQRSITYHDQTCILRKQVAEVLLAFLKAPGHLLLNEDLKKLFW KELDNVDSFMERRNRLITDLRTDLRKIGANLSVTLVNGGYQLHFSLENSKKS >gi|226332039|gb|ACIB01000017.1| GENE 2 1734 - 1886 61 50 aa, chain + ## HITS:1 COG:no KEGG:BF0459 NR:ns ## KEGG: BF0459 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 50 27 76 76 89 100.0 3e-17 MLIEERCKNEDTGSDSVNSLPELELSYSAGICFFLLKQAKRTIINLKIKK >gi|226332039|gb|ACIB01000017.1| GENE 3 1941 - 2705 648 254 aa, chain - ## HITS:1 COG:BH0265 KEGG:ns NR:ns ## COG: BH0265 COG1624 # Protein_GI_number: 15612828 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Bacillus halodurans # 7 253 13 253 274 175 40.0 8e-44 MFFEFGIKDFIDILLVAFLLYYTYKLMKASGSINVFTGILVFILIWLVVSQVLEMKLLGS IFDKLVSVGVLALIVLFQDEIRRFLLTLGSHQHASALVRFLTGNKKEKLQHDDIMPIVMA CISMGKQKVGALIVMERNVPLDDVIRTGEIIDANINQRLIENIFFKNSPLHDGAMVISKK RIKAAGCILPVSHNLDIPKELGLRHRAAMGISQVSDALAIIVSEETGAISVAWRGQFYLR QSAEELESLLTKES >gi|226332039|gb|ACIB01000017.1| GENE 4 2740 - 3600 768 286 aa, chain - ## HITS:1 COG:VC0638 KEGG:ns NR:ns ## COG: VC0638 COG0294 # Protein_GI_number: 15640658 # Func_class: H Coenzyme transport and metabolism # Function: Dihydropteroate synthase and related enzymes # Organism: Vibrio cholerae # 16 279 11 274 278 215 41.0 8e-56 MDSTIFKSLNVNGRLLDLSIPQVMGILNVTPDSFYAGSRSRTEADIAARARQILDEGASM IDIGAYSSRSNAEHISPEEEMRRLRTGLEILNRNHPGAIISVDTFRAGVAEECVKEYGVA IINDISAGEMDEQMFPTVARLNVPYIMMHMQGTPQNMQKEPHYENLLKEVFIYFARKVQQ LRDLGVKDIILDPGFGFGKTLEHNYELMAHLEEFGIFELPLLVGVSRKSMIYRLFGTTPQ EALNGTTVLDTVALMKGADILRVHDVREAVESVRLIEKLKSVSACS >gi|226332039|gb|ACIB01000017.1| GENE 5 3685 - 5727 1438 680 aa, chain + ## HITS:1 COG:mll3725_2 KEGG:ns NR:ns ## COG: mll3725_2 COG0642 # Protein_GI_number: 13473203 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Mesorhizobium loti # 435 679 64 313 328 147 36.0 8e-35 MQLSRLLKNMSSKPYRHFIALLLVLFCSRFEAQAEDLFAKTDSTLQQYLIRCKAQIKDPD FLQTNDTLTRMAQEKNDKRMQVIAVALKLDYYYYPNNPDSILVMVERVKKISRRNNELKY FYFAWGSRLIIYYIKQHQTNTAIYEARKMLQSAEADNFIPGIVQCYRTLGTVYMTQSNPK LAYENFRKQIALIEENEIEDINLPTQYASLAQCALEMHRPDEALKALEKGSKCTRSAYQI FTVQKAYILYYLETKEYEKARKILVELEQLFEKDKSLALYKSGLFYIQIEYYRNTGQYRK ALDVIEEIKNDSSSINKYLDYTLTQKQGDIYWEMNQKARAAQYYRDYILATDSIRSQEIQ NSTNEFYTIMEVEQLHKEKNELLLHMQEEKLQKINIALVSLVIILVAGTMLLFHISKLNK KLKRSEAKVIQQNKELVENGEELRKAKEQAENASRMKTTFIQSMSHEIRTPLNSIVGFSQ VLSNYFKEEDNDEIKEFASIIEISSSNLLRLINDVLDISYLDQSEILPYDKPEDINNCCL LSIERTRNSIKKEVSLRFEPSCGPLMILTNPERVAQILTHLLHNAIKFTDKGNITLAYTI SPTEKQIVYTVTDTGKGIPVEQQEYVFERFAKLNDFSQGTGLGLPICRIIAEKLGGSLII DKTYTKGCCFILTLPLIKAD >gi|226332039|gb|ACIB01000017.1| GENE 6 5813 - 6778 1253 321 aa, chain - ## HITS:1 COG:ECs0538 KEGG:ns NR:ns ## COG: ECs0538 COG2066 # Protein_GI_number: 15829792 # Func_class: E Amino acid transport and metabolism # Function: Glutaminase # Organism: Escherichia coli O157:H7 # 8 314 5 310 310 273 44.0 4e-73 MDKKISISQIKEVVQQAYEQVKGNTGGKNADYIPYLANIDKNLFGISVCLLNGQTITVGD FDYRFGIESVSKVHTAILILRQYGAQKVLEMIGADATGLPFNSIIAILLENDHPSTPLVN AGAISACSMVTPIGNSDKKWDAIVQNITDLCGSAPQLIEELYKSETATNFNNRSIAWLLK NYNRIYDDPNMSLDLYTRQCSLGVTAQMLSVAAGTVANGGVNPVTKKQVFDAELTPKITS MIATVGFYEHSGDWMYTSGIPAKTGVGGGVMGVLPGVFGVSAFAPPLDGSGNSVKAQLAI KYIMNKLGLNVFNGARVTIVD >gi|226332039|gb|ACIB01000017.1| GENE 7 6794 - 8236 1620 480 aa, chain - ## HITS:1 COG:sll1641 KEGG:ns NR:ns ## COG: sll1641 COG0076 # Protein_GI_number: 16329656 # Func_class: E Amino acid transport and metabolism # Function: Glutamate decarboxylase and related PLP-dependent proteins # Organism: Synechocystis # 29 443 35 448 467 443 49.0 1e-124 MEDLNFRKGDAKTEAFGSNRMLQPSPVEKIPDGPTTPEIAYQMVKDETFAQTQPRLNLAT FVTTYMDDYATKLMNEAININYIDETEYPRIAVMNGKCINIVANLWNSPEKDTWKTGALA IGSSEACMLGGVAAWLRWRKKRQAQGKPFDKPNFVISTGFQVVWEKFAQLWQIEMRQVPL TLDKTTLDPEEALKMCDENTICVVPIQGVTWTGLNDDVEALDKALDAYNAKTGYDIPIHV DAASGGFILPFLYPDTKWDFRLKWVLSISVSGHKFGLVYPGLGWVVWKGKEYLPEEMAFS VNYLGANITQVGLNFSRPAAQILGQYYQFIRLGFQGYKEVQYNSLQIAKYIHSQIAKMTP FVNYSEDVVNPLFIWYMKPEYAKNAKWTLYDLQDKLAQHGWMVPAYTLPAKLQDYVVMRV VVRQGFSRDMADMLLGDIKNAIAELEKLEYPTSTRIAQEKNLPVEAKVFNHTGKPQAAKK >gi|226332039|gb|ACIB01000017.1| GENE 8 8268 - 8495 82 75 aa, chain + ## HITS:1 COG:no KEGG:BF0453 NR:ns ## KEGG: BF0453 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 75 1 75 75 145 100.0 5e-34 MGIFPVEFPTTTVRGYESSSVLHYLGFIKKGGDGKVNKLSPALIGRYPNTLSAACVPNIL TDRLFFTHLNFIIPI >gi|226332039|gb|ACIB01000017.1| GENE 9 8498 - 9733 1072 411 aa, chain + ## HITS:1 COG:AGc4286 KEGG:ns NR:ns ## COG: AGc4286 COG0477 # Protein_GI_number: 15889635 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 21 365 15 350 400 91 26.0 3e-18 MRIQTGHGTIPLITLIGIWSISALNALPGLAVSPILGKLSAIFPHSTELDIQMLSSLPSL LIIPFILLAGKLTERVNFIRLLQAGLAIFALSGVLYLLSGQMWQLIAVSALLGVGSGLIV PLSTGLISKYFVGSYRVKQFGLSSAITNITLVVATAVTGYLAEVNWHLPFVVYLLPIISL VLSVYLQRSMASEGSTSLTNDKAPADKEEDVDTGNSKYGIHVRHLAGIMGVYGLATFLVL VVSFNLPFLMEEYHFTSGNSGIMISLFFLAIMTPGFFLNRIVGTLKEKTKFCSFLSIGIG LALIWISPKEWVIAPGCILVGLGYGVIQPVVYNQTTHTAISRKVTLALAFVMAMNYLAIL LCPFIIDFFQSTVFHIKSQQFAFVFNLCISIVMLVISYTKRNSFLFNDNLK >gi|226332039|gb|ACIB01000017.1| GENE 10 9776 - 11074 1204 432 aa, chain + ## HITS:1 COG:BS_murF KEGG:ns NR:ns ## COG: BS_murF COG0770 # Protein_GI_number: 16077524 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylmuramyl pentapeptide synthase # Organism: Bacillus subtilis # 16 432 30 451 457 222 36.0 1e-57 MKLSALYQIFLDCAVVTTDSRNCPAGSLFIALKGESFNGNAFAAQALKDGCAYAIVDEAE YAPENNRHIILVDNCLQTLQQLANYHRRQLGTKVIGITGTNGKTTTKELISAVLSKSHNV LYTEGNLNNHIGVPMTLLRLKAEHELAVIEMGANHPGEIKFLVHIAEPDYGIITNVGKAH LEGFGSFEGVIRTKGELYDYLREKEDSTVFIHHDNAYLMDIAHDLNLIPYGSEDSLYVNG HVTGNSPYLTFEWKAGKDGDLHKVQTQLIGEYNFPNALAAVTIGRFFGVEAGKIDEALAG YTPRNNRSQLKKTADNTLIIDAYNANPTSMMAALQNFRNMTVEKKMLILGDMRELGTESA AEHRKIVDFLQECSFEKVLLVGEQFTATHPPYHTYANAQEVIKELQTEKPKDYTILIKGS NGIKLSTVVEFL >gi|226332039|gb|ACIB01000017.1| GENE 11 11079 - 11471 270 130 aa, chain - ## HITS:1 COG:no KEGG:BF0390 NR:ns ## KEGG: BF0390 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 130 1 130 130 262 100.0 3e-69 MLSYIKKYPVSLFIILAVIYLSFFKPPSTEISKIPNIDKVVHICMYFGMSGMLWLEFLRA HRRDHTPVWHAWVGAFICPVLFSGCVELLQEYCTTYRGGDWMDFAANTTGAVLASLIGYF IVRPRILSKK >gi|226332039|gb|ACIB01000017.1| GENE 12 11576 - 12943 1026 455 aa, chain + ## HITS:1 COG:BH1128 KEGG:ns NR:ns ## COG: BH1128 COG0733 # Protein_GI_number: 15613691 # Func_class: R General function prediction only # Function: Na+-dependent transporters of the SNF family # Organism: Bacillus halodurans # 6 449 9 446 453 328 45.0 1e-89 MTKKERGNFGSKLGVILASAGSAVGLGNIWRFPYETGNHGGAAFILIYLGCILLLGLPIM IAEFLIGRHSQANTARAYQILAPGTQWRWVGRMGVLAGFLILGYYSVVAGWTLEYIFEAV SNSFAGKTPAEFISSFQSFSSNPWRPALWLTLFLLATHFIIVKGVEKGIEKSSKIMMPTL FIIILILVGCSVTLPGAGKGIEFLLKPDFSKVDGNVFLGAMGQAFFSLSLGMGCLCTYAS YFSKNTNLTRTAFSVGIIDTFVAVLAGFIIFPAAFSVGIQPDAGPSLIFITLPNVFQQAF SGIPILAYIFSVMFYVLLALAALTSTISLHEVVTAYLHEEFNFTRGKAARLVTTGCILLG ILCSLSLGVTKEFTIFGLGMFDLFDFVTAKLMLPLGGLLISIFTGWYLDKKLVWSEITNN GTLKVPTYKLIIFILKYVAPIAISVIFINELGLLK >gi|226332039|gb|ACIB01000017.1| GENE 13 12946 - 13848 610 300 aa, chain + ## HITS:1 COG:TM1052 KEGG:ns NR:ns ## COG: TM1052 COG1555 # Protein_GI_number: 15643810 # Func_class: L Replication, recombination and repair # Function: DNA uptake protein and related DNA-binding proteins # Organism: Thermotoga maritima # 158 299 31 181 181 70 34.0 4e-12 MWKDFFYFTRAERQGILILAVLCILVFVAGWLIPDKANTATNDTEKFKKEYAGFMSSIRE KEQKIYSHNNRFQPPRTVRLTTFDPNITDSVGFLDLGLPAWMAKNILKYRNKGGKFRRAE DFRKVYGLTQEQYEMLLPYIYIATLAKPQDTLRLYTRKIEEDTLNFFKYAAGTVVELNSA DTTELKKIPGIGSGIARMITGYRNRLGGFYDIAQLKEIHLDVEKLRPWFNVATGNTRRLN INRTGIERLKAHPYINFYQAKIIVEYRKKKGILKSLKQLSLYEEFTPQDLERISHYICFE >gi|226332039|gb|ACIB01000017.1| GENE 14 13849 - 14499 308 216 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) [Campylobacter concisus 13826] # 16 196 20 199 223 123 36 1e-27 MIHLEGITKSFGSLQVLKGIDLEITQGEVVSIVGPSGAGKTTLLQIMGTLDSPDAGMINI DGTNVSRMKEKELSAFRNKHIGFVFQFHQLLPEFTALENVMIPAFIAGVPTKEASMRAME ILDFMGLKERASHKPNELSGGEKQRVAVARALINQPAVILADEPSGSLDSHNKEELHQLF FDLRNRFGQTFVIVTHDEALAKITDRTIHMVDGNII >gi|226332039|gb|ACIB01000017.1| GENE 15 14504 - 15214 622 236 aa, chain - ## HITS:1 COG:FN0725 KEGG:ns NR:ns ## COG: FN0725 COG1179 # Protein_GI_number: 19704060 # Func_class: H Coenzyme transport and metabolism # Function: Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 1 # Organism: Fusobacterium nucleatum # 6 234 4 229 234 168 41.0 7e-42 MENWQQRTELLLGAEKMERLRKSHVLVVGLGGVGAYAAEMICRAGVGRMTIVDADIVQPT NINRQLPATHATLGMEKAKVLEARFRDINPEIELTVLPVYLKDDNIPELLDAARYDFIVD AIDTISPKCYLIYHALQRRIKIISSMGAGAKSDITQVRFADLWDTYHCGLSKAVRKRLQK MGVKRKLPVVFSTEQADPKAVLLTDDERNKKSTCGTVSYMPAVFGCYLAEYVIKRL >gi|226332039|gb|ACIB01000017.1| GENE 16 15214 - 17349 1648 711 aa, chain - ## HITS:1 COG:BH2844_1 KEGG:ns NR:ns ## COG: BH2844_1 COG0475 # Protein_GI_number: 15615407 # Func_class: P Inorganic ion transport and metabolism # Function: Kef-type K+ transport systems, membrane components # Organism: Bacillus halodurans # 11 400 5 388 388 308 44.0 2e-83 MHWFDLSLQLPITDPTWVFFLVLIIILFAPMILGRLHIPHIIGMILAGVVIGEYGFNVLE RDSSFELFGKVGLYYIMFLAGLEMDMEDFKKNRTKGLVFGWFTFIIPMALGVWSSMELLG YGFTTAVLLASMYASHTLIAYPIISRYGLSRLRSVNITIGGTAVTVTLALIILAVIGGMY KGAVDGMFWVLLVVKVAFLSFLIVFFFPRIGRWFFRKYDDSVMQFVFVLAMVFLGSGLME FVGMEGILGAFLAGLVLNRLIPHVSPLMNRLEFVGNALFIPYFLIGVGMIIDVRTLFTGG EALKVAVVMTVFATLSKWLAAWITQKIYHMQPNERSMMFGLSNAQAAATLAAVLIGHEII MENGERLLNDDVLNGTVVMILFTCVISSLVTERSARRFALHEEMQFEDNKEKTEQEQILI PVANPDTIEDLINLALVIRDTKQKRELIALNVINDNNNSENKELQGKRNLEKAAMIAASA DVSVNMVSRYDLNIASGIIHTIKEYDATDVVIGLHRKANIVDSFFGNLAESLLKGTHREV IIAKFLMPVNTLRRIIIAVPPKAEYETGFSKWVEHFCRMGSLLGCRVHFFANEQTLMRLQ QLVKKKHGSTPTEFSRLDEWDDLLLLTGQVNFDHLLVVISARRGSISYDPSFERLPNQLG KYFSNNSLIILYPDQFGEPQEIVSFSDPRGYNESQHYDKVGKWFYKWLKKN >gi|226332039|gb|ACIB01000017.1| GENE 17 17547 - 18557 1272 336 aa, chain + ## HITS:1 COG:aq_1866 KEGG:ns NR:ns ## COG: aq_1866 COG0136 # Protein_GI_number: 15606903 # Func_class: E Amino acid transport and metabolism # Function: Aspartate-semialdehyde dehydrogenase # Organism: Aquifex aeolicus # 2 335 4 340 340 373 58.0 1e-103 MKVAIVGVSGAVGQEFLRVLDERNFPLDELVLFGSKRSAGTKYTFRGKQIEVKLLQHNDD FKGVDIAFTSAGAGTSKEFEKTITQYGAVMIDNSSAFRMDADVPLVVPEVNAADAKDRPR GVIANPNCTTIQMVVALKAIEELSHIKTVHVSTYQAASGAGAAAMDELYEQYHQVLANEP VTVEKFAYQLAFNLIPQIDVFTENGYTKEEMKMYNETRKIMHSDVKVSATCVRVPALRAH SESIWVETERPISIEEAREAFAKGEGLVLQDNPAEKEYPMPLFLAGKDPVYVGRIRKDLT NDCGLTFWIVGDQIKKGAALNAVQIAEYLIKEGNIG >gi|226332039|gb|ACIB01000017.1| GENE 18 18611 - 18808 102 65 aa, chain - ## HITS:1 COG:no KEGG:BF0443 NR:ns ## KEGG: BF0443 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 65 38 102 102 116 98.0 3e-25 MICFDRKVFSYQINAFGNVVPSLNACAEQSFVFILVSDLIKSFLFNRLYEVYKKKISAGL TTWLR >gi|226332039|gb|ACIB01000017.1| GENE 19 18857 - 19705 568 282 aa, chain + ## HITS:1 COG:no KEGG:BF0442 NR:ns ## KEGG: BF0442 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 282 1 282 282 572 99.0 1e-162 MKLIAESGSTRTEWALVEDNHLVQRVFTEGLNPFFQTRREISRSVRLGLPESFFKKKLDQ VYYYGAGCSSYEKKNILGASLVAQFKTPIQVESDLLAAARGLFKCEAGIACILGTGSNSC FYDGKIIVKNVKAAGYILGDEGSGAVLGKLFLADLLKGLAPKELANEFHEKFRISVNDVM ESVYNLPFPNRFLGTIAYFLGDYMDNEYVYNLLTNNLRSFFNRNICQYDYINYPIRFVGS LAYAYPDILQEVAQEFGVEIDVIEETPMNGLIEFHSMNIEES >gi|226332039|gb|ACIB01000017.1| GENE 20 20167 - 23502 2580 1111 aa, chain + ## HITS:1 COG:no KEGG:BF0440 NR:ns ## KEGG: BF0440 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1111 1 1111 1111 2164 99.0 0 MNEDRKKRGLANSNLLIAMAITTLIIGSSNAMANQTASGSSYKVTEQMQIQTVTGVVVDA NGEPIIGASVVEKGTTNGIVTDMDGKFSLNVKVGTTLQITFVGYQPQDVKATKSMKVVLK EDNELLDEVVVVGYGTQKKANLTGAVSTVDVSKTLEARPQSDVSKALQGVVPGLTITNTS GKLNSKPTMTIRGTGTLSNSATSNPLIVVDGVPMDDISYLNTQDIDNISVLKDAASTSIY GTRAAFGVILVTTKSAKKTDKVTINYTNNFSWDTPTILPNYPDVATQARALRAANTRANL ENELFGMYMDDNFIAKAEAWKQRHGGKKAGYREMIPGDDFDLGEDGSALYYADWDVVGIM FRDWKPAQSHNISIQGTSGKTSYFLSVGYNHEEGVMTFNPDKLNKYNANMNVTSDITNWL QIGGRFSYSDKAYTTPNTRRNTYTYMWRWGSFFGPYGTYQGIDMKNDIAYLKQAGDDKTN DSYTRIGTFLKATIIKGLTLNADYTFNINNKTTKSVGLPVICWNSWGGKLNTPTTAAGAN GDTWVYQNSVRDNSYALNVFANYELTVAKDHHFNFMIGANAEEGEYQNHWSQRKGLLDDK LPEFNLATGDQTVGGTHNEWGTAGWFGRINYDYNGIWLLELNGRYDGSSKFPSSDRWAFF PSGSVGYRISEEKFFGPIKKVVSNTKIRASYGEIGNQAVGSNMYISTVSKRTDGNTHWLN GSNKVVAYDLPSLVSPTLKWERIQTLDIGGDFGFFNNELNISFDWYQRTTKDMLAPGQTM PDVLGAGAPKINAGTLRTRGWELSIDWRHHFNEVNVYANASIGDFKTVITKWDNDSQLLN ENYSGKVYGDIWGFETDRYFTKDDFNADGSYKEGIASQKKLEQDGFVYGPGDIKFKDLNN DKEINGGEGTVKDHGDLKVIGNTTPRYQYGFRLGGEWKGIDIDMFFQGVGKCDAWTQSAF VMPMMRGADAIYANQANYWTDENPDPNADFPRMWPGNAGKGTVSVLDLGNHNFYPQSKYL VNMAYLRFKNLTIGYTLPKDWTRKVYMDKVRVYFSANNICELINKSNAPVDPEVNTSEAI ANGGSSDYGNGTWGRVDPMYRTVSFGLQVTF >gi|226332039|gb|ACIB01000017.1| GENE 21 23532 - 24354 838 274 aa, chain + ## HITS:1 COG:no KEGG:BF0439 NR:ns ## KEGG: BF0439 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 274 1 274 565 530 99.0 1e-149 MKKIKYIACLLSLAVVSGCDSILDKGPLDSFTNDNFWTGEGNISGYANAFYEQFLGYGNG NGYGDFYFKTLSDDQAGMSFAKWTYPDNAPSTSATWKNGWIEVRRANIMLENVPTVASLD EATKNHWLGVARLMRAWQYYHLVRMYGNLPWIDKALNINDEGEIYGNREDRDMVMDKVLE DLDFAVTNIKDISSKTTWSRSLANAMKAEVCLYEGTFRKYRKNEDGQQAPDATGAARYLT ACKEACLAVMSKGYKLNTSYQENYNSTDLSSNPE Prediction of potential genes in microbial genomes Time: Tue May 17 22:30:26 2011 Seq name: gi|226332038|gb|ACIB01000018.1| Bacteroides sp. 3_2_5 cont1.18, whole genome shotgun sequence Length of sequence - 76237 bp Number of predicted genes - 57, with homology - 55 Number of transcription units - 28, operones - 12 average op.length - 3.4 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . + CDS 3 - 875 462 ## BF0439 hypothetical protein + Term 902 - 958 12.2 + Prom 978 - 1037 9.2 2 2 Op 1 . + CDS 1060 - 2265 492 ## COG3876 Uncharacterized protein conserved in bacteria 3 2 Op 2 . + CDS 2306 - 3757 879 ## COG0591 Na+/proline symporter 4 2 Op 3 . + CDS 3765 - 6836 1838 ## BF0377 exported xanthan lyase/N-acetylmuramoyl-L-alanine amidase + Prom 6847 - 6906 6.9 5 3 Op 1 . + CDS 6932 - 8797 1338 ## COG1520 FOG: WD40-like repeat 6 3 Op 2 . + CDS 8821 - 10287 1200 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 7 3 Op 3 . + CDS 10284 - 11216 515 ## BF0433 putative glycosyltransferase 8 3 Op 4 . + CDS 11213 - 12568 898 ## COG2385 Sporulation protein and related proteins 9 3 Op 5 . + CDS 12584 - 13882 1174 ## COG0477 Permeases of the major facilitator superfamily 10 3 Op 6 . + CDS 13924 - 15198 957 ## COG3876 Uncharacterized protein conserved in bacteria 11 3 Op 7 . + CDS 15240 - 18104 667 ## PROTEIN SUPPORTED gi|167010850|ref|ZP_02275781.1| ribosomal protein L36, putative 12 3 Op 8 . + CDS 18113 - 19276 1062 ## COG4299 Uncharacterized conserved protein 13 3 Op 9 . + CDS 19273 - 20109 775 ## BF0427 hypothetical protein 14 3 Op 10 . + CDS 20133 - 20978 967 ## COG2103 Predicted sugar phosphate isomerase 15 3 Op 11 . + CDS 21011 - 22894 1149 ## COG1680 Beta-lactamase class C and other penicillin binding proteins - Term 22871 - 22903 4.9 16 4 Op 1 . - CDS 22906 - 23592 625 ## COG0664 cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases 17 4 Op 2 . - CDS 23639 - 24904 1133 ## BF0423 hypothetical protein 18 4 Op 3 . - CDS 24937 - 25728 585 ## COG0755 ABC-type transport system involved in cytochrome c biogenesis, permease component 19 4 Op 4 . - CDS 25725 - 26966 962 ## BF0421 hypothetical protein 20 4 Op 5 1/0.000 - CDS 26970 - 28451 1485 ## COG3303 Formate-dependent nitrite reductase, periplasmic cytochrome c552 subunit 21 4 Op 6 . - CDS 28475 - 29062 381 ## COG3005 Nitrate/TMAO reductases, membrane-bound tetraheme cytochrome c subunit - Prom 29141 - 29200 5.1 22 5 Tu 1 . + CDS 29372 - 29659 175 ## gi|253564224|ref|ZP_04841681.1| predicted protein - Term 29811 - 29863 5.1 23 6 Tu 1 . - CDS 29925 - 33962 1193 ## COG0642 Signal transduction histidine kinase - Prom 33983 - 34042 9.4 24 7 Op 1 . + CDS 34473 - 37658 2257 ## BF1415 putative outer membrane protein 25 7 Op 2 . + CDS 37665 - 39368 1043 ## BF1416 putative outer membrane protein 26 7 Op 3 . + CDS 39362 - 39859 333 ## BF1484 hypothetical protein 27 8 Op 1 . + CDS 39968 - 41797 1032 ## BT_1035 hypothetical protein 28 8 Op 2 . + CDS 41724 - 42002 70 ## gi|253564230|ref|ZP_04841687.1| predicted protein 29 9 Tu 1 . - CDS 41959 - 42219 102 ## gi|301161476|emb|CBW21016.1| hypothetical protein 30 10 Op 1 . - CDS 42684 - 42845 88 ## BF0421 hypothetical protein 31 10 Op 2 1/0.000 - CDS 42849 - 43046 138 ## COG3303 Formate-dependent nitrite reductase, periplasmic cytochrome c552 subunit 32 10 Op 3 . - CDS 43098 - 43493 343 ## COG3005 Nitrate/TMAO reductases, membrane-bound tetraheme cytochrome c subunit - Prom 43550 - 43609 5.4 33 11 Tu 1 . - CDS 43634 - 44137 322 ## COG3467 Predicted flavin-nucleotide-binding protein - Prom 44177 - 44236 5.8 + Prom 44081 - 44140 4.2 34 12 Tu 1 . + CDS 44222 - 45919 337 ## BF0358 putative transmembrane protein + Prom 45935 - 45994 3.2 35 13 Op 1 . + CDS 46068 - 46448 264 ## BF0357 hypothetical protein 36 13 Op 2 . + CDS 46508 - 47389 453 ## BF0356 hypothetical protein + Term 47446 - 47498 4.2 - Term 47433 - 47485 4.2 37 14 Op 1 . - CDS 47501 - 48052 498 ## COG0545 FKBP-type peptidyl-prolyl cis-trans isomerases 1 38 14 Op 2 . - CDS 48064 - 49605 1606 ## COG0423 Glycyl-tRNA synthetase (class II) - Prom 49626 - 49685 11.3 + Prom 49568 - 49627 6.1 39 15 Op 1 . + CDS 49790 - 52438 2562 ## COG0188 Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit 40 15 Op 2 . + CDS 52463 - 53332 879 ## BF0352 hypothetical protein 41 15 Op 3 . + CDS 53345 - 54376 1038 ## COG0793 Periplasmic protease - Term 54432 - 54483 2.3 42 16 Op 1 . - CDS 54532 - 56019 1336 ## BF0350 putative outer membrane protein 43 16 Op 2 . - CDS 56048 - 59107 2601 ## BF0349 putative TonB dependent outer membrane exported protein - Prom 59152 - 59211 9.1 + TRNA 59504 - 59588 67.9 # Leu TAG 0 0 44 17 Tu 1 . - CDS 59632 - 59781 84 ## BF0405 hypothetical protein - Prom 60015 - 60074 4.3 45 18 Tu 1 . + CDS 59884 - 60060 80 ## - Term 60457 - 60509 11.2 46 19 Tu 1 . - CDS 60590 - 60940 110 ## BF0402 hypothetical protein - Prom 61096 - 61155 8.0 + Prom 61052 - 61111 5.6 47 20 Tu 1 . + CDS 61236 - 61427 76 ## - Term 61466 - 61527 2.0 48 21 Tu 1 . - CDS 61598 - 62470 440 ## COG3943 Virulence protein + Prom 62832 - 62891 6.9 49 22 Op 1 . + CDS 63031 - 63231 97 ## BF0344 hypothetical protein 50 22 Op 2 . + CDS 63234 - 63524 347 ## BF0343 putative DNA-binding protein + Term 63530 - 63576 10.1 - Term 63576 - 63612 2.7 51 23 Tu 1 . - CDS 63655 - 63873 159 ## BF0396 hypothetical protein - Prom 64072 - 64131 7.9 + Prom 64370 - 64429 2.9 52 24 Op 1 . + CDS 64479 - 67547 3473 ## BF0394 hypothetical protein 53 24 Op 2 . + CDS 67559 - 69079 1423 ## BF0393 hypothetical protein 54 25 Tu 1 . + CDS 69182 - 71482 2393 ## COG1472 Beta-glucosidase-related glycosidases + Prom 71558 - 71617 2.8 55 26 Tu 1 . + CDS 71648 - 73003 1144 ## COG5368 Uncharacterized protein conserved in bacteria 56 27 Tu 1 . + CDS 73190 - 73747 570 ## COG2755 Lysophospholipase L1 and related esterases + Term 73773 - 73810 -0.8 + Prom 73786 - 73845 7.7 57 28 Tu 1 . + CDS 73887 - 76236 1159 ## COG3292 Predicted periplasmic ligand-binding sensor domain Predicted protein(s) >gi|226332038|gb|ACIB01000018.1| GENE 1 3 - 875 462 290 aa, chain + ## HITS:1 COG:no KEGG:BF0439 NR:ns ## KEGG: BF0439 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 290 276 565 565 603 100.0 1e-171 ILYKAYKEGLLMHSTIDYTCSSTQISGMSKNAFESYLFKDGKPMALTSLNKSDEAPFQYG HLSLKDILAVRDKRLAQTIDTVLLYNGRGFTRFNTGMESTSSTGYGVAKYDNEAIPEGFR SQSGKNYTHAPLFWLSVIYLNYAEACAELGNITQDDLDKSINLLKDRAGLPHLNPIVGFS DPANNHGVSDLIWEIRRERRCELMFDNDNRYWDLIRWHQLDKLDTTKYPDIILGANVAND MDGCEANKVGKYIDGSKDGSRIYDKKHYLYPIPTGQIALNPQLAPNNPGW >gi|226332038|gb|ACIB01000018.1| GENE 2 1060 - 2265 492 401 aa, chain + ## HITS:1 COG:BS_ybbC KEGG:ns NR:ns ## COG: BS_ybbC COG3876 # Protein_GI_number: 16077233 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Bacillus subtilis # 50 398 52 414 414 240 40.0 5e-63 MLQQELPPNTIFMKKILLIVVLLCSTLFAQAQKSDVIIGAEQTKAYFPILKNKRIAIFSN HTGMVGNKHLLDILLENNFNVVAIFSPEHGFRGNADAGEHVSSTIDSKTGVPILSLYNGK SKKPSEASMKKFDILIVDIQDVGLRFYTYYISMVRLMDACAEYDRKILILDRPNPNGHYV DGPILDMKYKSGVGGLPIPIVHGMTLGELALMVNGERWLPSSRICDVTVIPCKNYTHQTM YRLPIPPSPNLPNMKAIYLYPSICLFEGTPVSLGRGTTLPFQVYGHPNMTGYNYNFTPRS IPGAKNPPQLNKLCHGVNLSNLSDEEIWKQGINLDYLIDAYHNLNMGDRFFRPFFELLVG TDYVRKMIEGGKSADEIKARWKRDVERFKIQRKPYLLYQDN >gi|226332038|gb|ACIB01000018.1| GENE 3 2306 - 3757 879 483 aa, chain + ## HITS:1 COG:sll1087 KEGG:ns NR:ns ## COG: sll1087 COG0591 # Protein_GI_number: 16330938 # Func_class: E Amino acid transport and metabolism; R General function prediction only # Function: Na+/proline symporter # Organism: Synechocystis # 26 424 25 423 512 130 28.0 7e-30 MTPVAVLITIASYFLILFTISYIAGRKADNEGFFVGNRKSAWYVVAFAMIGSSISGVTFV SVPGMVGISNFSYLQMVLGFVTGQIIIAFVLIPLFYRMNLVSIYEYLENRFGTSSYKTGA WFFFISKILGAAVRLYLVCLTLQLLVFEPFHMPFIMNVILTVALVWLYTFRGGVKSLIWT DSLKTFCLIVSVVLCIYYIATDLKLSFTEMFSTVSDSALSHMFFFDNVNDKRYFFKQFLA GVFTMIAMTGLDQDMMQRNLSCKNFKDSQKNMITSGISQFFIILLFLMLGVLLYTFTSHQ GITNPAKSDELFPMIATGGYFPIIVGILFIIGLISSAYSAAGSALTALTTSFTVDILGIK GKTEDTIRKTRKKVHVGMAIVMGIVIFIFNLLNNTSVIDAVYILASYTYGPILGLFAFGI LTKRQVRDRYIPLVSILSPILCFILQKNSETWFHGYQFSYELLIFNALFTFIGLCFLIKK QLT >gi|226332038|gb|ACIB01000018.1| GENE 4 3765 - 6836 1838 1023 aa, chain + ## HITS:1 COG:no KEGG:BF0377 NR:ns ## KEGG: BF0377 # Name: not_defined # Def: exported xanthan lyase/N-acetylmuramoyl-L-alanine amidase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1023 1 1023 1023 2111 99.0 0 MKPILTSLILLLTAGLFPQAAATQELSKELRSQIGDFLNETARKEISVGKIHIDSVNTEG NDLILFANINCSYIPFRTDNITKIYQGIKALLPPELAKRKLQIRTDHHAIEELIPLALRN TRGRKIPTFSYKADTPLITRLSVPYTPTNGLQNRHIALWQSHGFYYESKLARWEWQRARI FQTVEDLYTQSYVLPFLVPMLENAGATVLLPRERDPQTVEIIVDNDRCRDGHSVYSELNG SKMWKNGEEAGFAHLKRTYKDFENPFREGTYRQVETTKKGTVSVAEWIPEIPRAGRYAVY ISYKTVNNSTEDALYTVYHQGGKSQFKVNQQMGGGTWIYLGNFSFGIGKTDCKIVLSNQS AKEGRLVTADAVKIGGGYGNIARSISEEGATVNTKSSDTMITDTYHPKVQVNYPYEISGY PRFCEAARYWMQWAGIPDSVYSDSHGKNDYTDDYKSRGIWVNYLAGGSAANPTEKGLNIP VDMAFAFHSDAGTTYGDTIIGTLGIFHTSAYNGAYANGASRYASRDLCDLVQSNIVKDVR TLYEPEWTRRGMWNQSYYEARVPRVPTMLLELLSHQNFADMRYGLDPRFRFTVSRAIYKG ILQFICSQYKMEYVVQPLPVDHMSLRFEEGNRIKLSWQPVDDPLETTAKADQYIVYTRIG DSDFDNGVIVNSPTYQTVIPSGVVCSFKVTALNKGGESFPSEILSIGKTFNDKGTVLIIN GFDRVCAPADFTADADTLAGFLDELDHGVPYKTDISYIGPMKEFRRQIPWMDDDASGFGD SYGTHETMVIAGNTFDYPAIHGEAILKAGYSFTSCSDESIVHPDSSPKERETQICMNDYK YVDLILGKQCQTKMGRGGIRPLEFKTFSKEMQNAITNYCQAGGNFFVSGAYVASDLWDNR LVKANEEDKKFAMEVLKYKWRVGQAARNGKVKSVASPFPEITGSYTYYQDLNPESYVVES PDALEPAAQGAFTILRYSENNLSAGIAYKGNYKTCVLGFPFEAIRTVTERELLMKAILTF FEH >gi|226332038|gb|ACIB01000018.1| GENE 5 6932 - 8797 1338 621 aa, chain + ## HITS:1 COG:MTH1485 KEGG:ns NR:ns ## COG: MTH1485 COG1520 # Protein_GI_number: 15679482 # Func_class: S Function unknown # Function: FOG: WD40-like repeat # Organism: Methanothermobacter thermautotrophicus # 292 593 53 342 407 96 24.0 1e-19 MNRRLSIFVIILFFLLPVAARAQVTGSFRFAQLTDIHLNPNNPKPTEDLKRSVEQINATP GVDFVLVTGDLTEEGDRTTMLVVKSILDRLKVKYYVIPGNHETKWSDSGCTAFSEIFGGE RFKFEHKGFLFLGFNSGPLMRMAYGHVVPQDITWMKQEMDKVGKDKPVILVTHYPMQDGD VDNWYDVTDAVRPYNIRTFIGGHYHRNRFLSYDGIPGILTRSNLRDKNGSSGYSIFDITP DSIITYEQRIDEPMKRWTALSLTKSYYNRTGKAVKYPSFSVNKEYPQVKIGWQVQTGVGI YCSPALWKGRVYVGDDLGFLTCYTLKEGRKLWSFQSGKRIVGTPAATDGIVVFGSADHNI YGLDAVTGKERWRVTVAQPVLGAVTIEKGIAYIGGSDSTFRAIRIKNGKVVWTYTGIKGY IETKPLVEGDKVIFGAWDNTLYALNKSNGKELWKWTGGLTRMHFSPAAVWPVAAHGKVFI TDPQRAMTAISLKTGKTVWRTFQSMVRETIGLSANKNQIYSKTMNDSVVCYSTISDTPKE IWASNVGFGYEHAPSMQMEKDGIVFSSTKEGLIFALDASTGQVLWKHKIGNSLINTVLPI SRHQVLFTATSGETGLLEWKE >gi|226332038|gb|ACIB01000018.1| GENE 6 8821 - 10287 1200 488 aa, chain + ## HITS:1 COG:PH0430 KEGG:ns NR:ns ## COG: PH0430 COG0463 # Protein_GI_number: 14590346 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Pyrococcus horikoshii # 247 339 7 97 334 62 41.0 2e-09 MKSTINCFIPYAGAVQAERTVQGLQATGLVKKIYLLVTSPSFDPLPGCELLYVDKLTNSA SMYAIATYSDASYTLLYTKYTSLELGLFALERMIHIAEDSAAGMVYADHYQVTEGKQSNA PVIDYQFGSLRDDFNFGSVLLFKASALKEAAKRMKSDYDFAGFYDLRLKLSQKYPLVHIN EYLYSEVENDTRKSGEKIFDYVDPKNRDRQIEMEEACTEHLQEIGGYLKPEFQKIEFNTG NFPYEASVIIPVRNRIRTIRDAIRSVLSQKADFKFNLIIIDNHSTDGTTEAIDEFKDDER LIHLIPERNDLGIGGCWNLGVHHPLCGKFAVQLDSDDVYAHDGTLQVMVNAFYEQNCAMV VGTYMMTDFDMNMIAPGIIDHKEWTPENGRNNALRINGLGAPRAFYTPILRELKVPNTSY GEDYALGLNFSRQYQIGRVYEVVYLCRRWDDNSDASLDIVKMNAHNLYKDRIRTWELQAR IALNKKQR >gi|226332038|gb|ACIB01000018.1| GENE 7 10284 - 11216 515 310 aa, chain + ## HITS:1 COG:no KEGG:BF0433 NR:ns ## KEGG: BF0433 # Name: not_defined # Def: putative glycosyltransferase # Organism: B.fragilis # Pathway: not_defined # 1 310 1 310 310 647 99.0 0 MNKEIETLLSEQLSSWETAQNNYAALKRVRIKDVKVNGCLYRIQFNPARIVSSAAKVDSK SIQERKCFLCPANLPPMQKGIPFGEHYQILVNPFPIFPKHLTVPELQHVEQRILHRFSDM LDLATYAEDYIIFYNGPQCGASAPDHLHFQAGNKGFLPIEQEWKEKRGEKVITCKDATLW ALNDYPRATLVIEARSKETAIMLFNTVYQAMPSTSGEDEPMMNVLVWKEQKNWIVCIFPR NKHRPSCYTAEGDTNILISPASVDMGGVFITPQEKDFEKITANDIADILGEVCLRPAAFR ILIERIKQQL >gi|226332038|gb|ACIB01000018.1| GENE 8 11213 - 12568 898 451 aa, chain + ## HITS:1 COG:sll1283 KEGG:ns NR:ns ## COG: sll1283 COG2385 # Protein_GI_number: 16329811 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Sporulation protein and related proteins # Organism: Synechocystis # 70 450 117 389 391 95 25.0 2e-19 MKEPKVQVGILFEPQIKFILLTPYHINGEEVSGKQVVTYDNGHILWQGHSYDELLFEPLH EKSDAFELQDVTIGINFHWERKENQRFIGALKIIVENKKLTGINVIHVEDYLTSVISSEM SATASLELLKAHAVISRSWLLAGLSLPYSKDREKSNTTPEKVPYSTSSFPPLAQEAENKI LIRWYERDAHTHFDVCADDHCQRYQGITRASTDMVRQAISATRGEVLMSEGTICDARFSK CCGGAFEEFQYCWENIRHPYLSKQRDSKKATDLPDLCKEAEAERWIRTSPEAFCNTKDKK VLSQVLNNYDQETTDFYRWKVEYEQEELSKLILKRSGIDYGQILDLVPVERGTSGRLVRL KIIGTKRTMIIGKELEIRRTLSPSHLYSSAFIIDKVDVTNGIPDRFILTGAGWGHGVGLC QIGAAVMGEQGYTYDTILLHYYIGATIDKLY >gi|226332038|gb|ACIB01000018.1| GENE 9 12584 - 13882 1174 432 aa, chain + ## HITS:1 COG:YPO3162 KEGG:ns NR:ns ## COG: YPO3162 COG0477 # Protein_GI_number: 16123324 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Yersinia pestis # 15 399 20 387 492 108 25.0 2e-23 MKNKNISPWAWIPTLYFAQGLPYVAVMTISVIMYKNLGISNTDIALYTSWLYLPWVIKPF WSPFVDLLKTKRWWIVSMQLLVGAGLAGIAFTIPMSNFFQTTLAIFWLVAFSSATHDIAA DGFYMLALNVQDQALYVGIRSTFYRIATIAGQGLLVMLAGGLEIWTGSIKYGWSITFFIL AGLFLAFCFYHKCILPKPDSDKAVVGENSASAIFSGFIETFASFFRKKQAGVAILFMLFY RFPEAQLVKLINPFLLDPIDKGGLGLTTAEVGLVYGTIGIIGLTLGGIIGGICAAKGGLQ KWLWPMAWSLSLTCLTFVYLGYFQPQNFVIINLCVFIEQFGYGFGFTAYMLYLIYYSDGE HKTAHYAICTAFMALGMMLPGMAAGWLQELIGYENFFIWVMVCCTATIAVCAFIKIDPNY GKKAEGIPQKTK >gi|226332038|gb|ACIB01000018.1| GENE 10 13924 - 15198 957 424 aa, chain + ## HITS:1 COG:BS_ybbC KEGG:ns NR:ns ## COG: BS_ybbC COG3876 # Protein_GI_number: 16077233 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Bacillus subtilis # 4 423 2 414 414 337 43.0 3e-92 MTYRRSILKLLLTFFVFMTSTLLSQGAEPRARPPRIRIKTGIEVLKEQNFKCLEGKRVGL ITNPTGVDNHLISTIDILHEAPNVNLVALYGPEHGVRGDVHAGDKVDNANDSSTGLPVYS LYGKTRKPTPEMLKDIDVLVYDIQDIGCRSFTYISTMGVAMEAAAENNKEFIVLDRPNPI DGLKIEGNVVEDGYISFVSQFKIPYLYGLTCGELALMLNGEQMLSKPCNLHVVKMKGWKR KMDYVQTGLQWIPSSPHIPHPHSAFFYPVSGILGELGYMSIGVGYTIPFQMFAAPWVEAE KLADNLNRLHLPGVIFRPMHLKPFYSVGKEEHLQGVQVHIVDFNKASLSEIQFYVMQEVT ALYPDRAVFDHADKERFHMFDLVSGSKEIRERFSQRNRWEDVRDYWYKDVDDFRRLSQKY YLYK >gi|226332038|gb|ACIB01000018.1| GENE 11 15240 - 18104 667 954 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|167010850|ref|ZP_02275781.1| ribosomal protein L36, putative [Francisella tularensis subsp. holarctica FSC200] # 496 940 1 431 436 261 35 8e-69 MERNYSAFLQDIGRFIPRDRLFTDELHRLAWGTDAGFYRLIPQIVIHSINDDEVIEIIYL ADRYNIPVTFRAAGTSLSGQAISDSVLIIAGKGWENYELSPDHEKIYLEPGIVGQRVNEI LAPYGRKFAPDPASIKSAMVGGIVMNNASGMNCGTHANSDKMLLSVHIIFPDGYYLNTDS QESRENFKEDYPEFLQRICELRDQIRSNEKLSARIRHKYSIKNVTGLNILPFLVYDDPFD IIAHLMVGSEGTLAFLAGVTMKTEYDYPYKASAMLYFSDIKEACRAVVAMKRLVNANGEW IVKGAELLDWKSLASVNDPTGEGLTAVLTETKACTQEELNQNIAIIEEALKAFDTYIPVH FTDQPEEYSKYWAIRAGIFPSVGGTRPSGTTCLIEDVAFHIEDLPEATAELQQLIARHGY DDACIYGHALEGNYHFIINQSFSSEAEVKRYEALMNDVIDLVVGKYDGSLKAEHGTGRNM APFVEYEWGEEAYAIMKEVKQLFDPKGLFNPGVIFNDDPQCHIKHFKPLSPLTIGQDTQV TRQIDRCIECGFCEVNCLSCGFTLSSRQRIVIQREISRLKKSGENPQLLETLSELYRYSG NRTCAGDGLCAMSCPMGINTGDLTHILRQAEFPPGSTGYRAGKFAANHFAGIKSTLRPVL SLANAAHSLLGTSTMTSITRKMHSAWGLPQWTPAMPKSYKIRKNDQTPAMNNKVVYFPSC INQTMGLAKDSPVDQPLVKQMLSLLQKAGYEVIFPPKKEKLCCGTIWESKGMLDIADSKS TELEASLWEASEQGRYPVLCDQSPCLHRMRATIQKIKLYEPAEFIYTFLRDKLEFTPTDR PIAIHITCSMRKMGLANILISLAKLCSTQVFIPEEVGCCGFAGDKGFTQPELNTYALRKL RPQLTKAGIGIGYSNSRTCEIGLATNTGIPYVSIAYLVDQCTRPIKQENNLTLK >gi|226332038|gb|ACIB01000018.1| GENE 12 18113 - 19276 1062 387 aa, chain + ## HITS:1 COG:all1887 KEGG:ns NR:ns ## COG: all1887 COG4299 # Protein_GI_number: 17229379 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Nostoc sp. PCC 7120 # 8 387 2 375 375 191 33.0 2e-48 MNQTVNKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFI MGISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLSGEDISFFS RLYESVWTFGHIRILGVMQRLALCYGATAIIALIMKHKYIPYLIAILLIGYFIILINGNG FEYNSSNILSIVDHTVLGEAHMYKDNGIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIH EKIERLFLIGTILTFAGFLLSYGCPISKKIWSPTFAIVTCGLASSFLALLVWIIDVRGYT RWSRFFESFGVNPLFIYVMGAVLSILLGSILIPYDGGSISLHGFVYNAILQPVLGDYPGS LAFAILFVGLNWCIGYILYKKKIYIKI >gi|226332038|gb|ACIB01000018.1| GENE 13 19273 - 20109 775 278 aa, chain + ## HITS:1 COG:no KEGG:BF0427 NR:ns ## KEGG: BF0427 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 278 1 278 278 580 100.0 1e-164 MILIADSGSTKTDWCVVEHGQLIQQIFTKGTNPFFQSEEEISNEIATALIPQLKTNKFEA VHFYGAGCAFPDKIETMRKAIASHLQVSGEIEVSTDMLAAARSLCGHQPGIACIMGTGSN SCYYDGKNIVTNVSPLGFILGDEGSGAVLGKLLVGDILKNQMTPGLKEKFLEQFNLTPAE IIDRVYRKPFPNRFLASFSPFLVQHLDEPVIRELVLNSFKKFLKRNVMQYDYQHAPVHFI GSVAFYYRELLSEACKIMGVHLGTIIQSPMEGLIKFHE >gi|226332038|gb|ACIB01000018.1| GENE 14 20133 - 20978 967 281 aa, chain + ## HITS:1 COG:STM2571 KEGG:ns NR:ns ## COG: STM2571 COG2103 # Protein_GI_number: 16765891 # Func_class: R General function prediction only # Function: Predicted sugar phosphate isomerase # Organism: Salmonella typhimurium LT2 # 25 265 17 257 297 234 51.0 1e-61 MNSNIEKSDKPSFIKISEQPSLYDDLEKKSVREILEDINKEDQKVAIAVQKAIPQIEKLV TQIVPRMKQGGRIFYMGAGTSGRLGVLDASEIPPTFGMPPTLIIGLIAGGDTALRNPVEN AEDNTIRGWEELTEHNINDKDTVIGIAASGTTPYVIGAMHAAREHGILTGCITSNPNSPM AAEADIPIEMIVGPEYVTGSSRMKSGTGQKMILNMITTSVMIQLGRVKGNKMVNMQLSNR KLVDRGTRMIIEELGLEYDKAKALLLMHGSVKKAIDAYKAG >gi|226332038|gb|ACIB01000018.1| GENE 15 21011 - 22894 1149 627 aa, chain + ## HITS:1 COG:CAC0181 KEGG:ns NR:ns ## COG: CAC0181 COG1680 # Protein_GI_number: 15893474 # Func_class: V Defense mechanisms # Function: Beta-lactamase class C and other penicillin binding proteins # Organism: Clostridium acetobutylicum # 49 424 3 349 351 196 35.0 2e-49 MENKINFSPPSTREGKGVRFLLTTFSILLCSLQAVAQSLPRVAPEQVGMDSHRLLHADEA IHRAIDHKEIPGAVLAVIRHGKMAYLKAYGNKRIYPNVEPMEINTVFDMASCSKSMSTAV SVMILVERGQLRLLDRVSFYLPDFQEWRGENGEKKDIRIIDLMTHTSGLPPYAPVSELQE KYGSPNPKGLMEYISTCKREFKPQTKFQYSCLNYITLQHIIETITGQSLRDFAKENIFDI LGMQYTDYLPTIQQQDGKWINTVACPWMDRIAPTEKQKDGSVLCGQVHDPLARILNGGIS GNAGIFSNANDIGILAAALLNGGEYNGRRILSPLGVKTMCTVPRELTAFGRTPGWDIFSP YASNKGDLFSPNTFGHTGYTGTSIIIDPDNDTAVILLVNAVHPEDRHSIVRLRSLVANAV AASICPPAQVYTDHYYKRFLQFETETPISPKDIVMVGNSLTENGGNWSKRLNKKNIRNRG IIGDEALGICQRLFQILPGTPQKLFLMAGINDVSHDLSTDSVVSLITKVIEKIQTESPRT KLYIQSLLPINESFGRYKTMTGKTDLIPEINRKLEALAKEKKIPFIHLFPLFTEKNSNVM RKELTTDGLHLTEEGYRIWSKALKRYL >gi|226332038|gb|ACIB01000018.1| GENE 16 22906 - 23592 625 228 aa, chain - ## HITS:1 COG:CAC0884 KEGG:ns NR:ns ## COG: CAC0884 COG0664 # Protein_GI_number: 15894171 # Func_class: T Signal transduction mechanisms # Function: cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases # Organism: Clostridium acetobutylicum # 13 228 8 225 229 81 23.0 1e-15 MVKVEFTDEYQKILWQISLFKDMDNSLQRRLPQELELSVYEVARKEIVLKQDTYCNHLYV LLKGELEVNIVDVAGNLVKVEDIRAPRAFATPHLFGDKNLLPATFTASEDSVLLMATRTS VFKLISSVPDLLHRFLCVTGNCNKCTVTRLRILSYKMLRSRLVYYFMEHKISPDTALLEH NQVQLAEYLGVTRPALSKEINKMMKEGLISINKKVVTLEDMAALKEYI >gi|226332038|gb|ACIB01000018.1| GENE 17 23639 - 24904 1133 421 aa, chain - ## HITS:1 COG:no KEGG:BF0423 NR:ns ## KEGG: BF0423 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 421 1 421 421 840 99.0 0 MRITNLFWTVVLLAPLSVFAQQQKENLFTIDAQIRTRGEYRNGVLNPRPEGEGPTFFVNE RARLSLGYQRDRLQMRLSAQHVGVWGQDPQIDKNGRFILHEAWARLDFSKGLFAQLGRQP LSYDDERLLGGLDWNVAGRFHDALKLGYESKLHKLHLILAFNQNDENRSYGGTYYASGAQ PYKTMQTVWYNGHWAKDFTVSLLFMNTGFETGTEGNGKTANMQTMGTYLVYTPGAWLFNG SAYYQFGKNKADKKVSAYMFSLKAGYKIDPKWSVSLGTDYLSGDPDSKKVSTFDPLYGTH HKFYGGMDYFYASAYNKGLWDKILSVDFKPTKKLSFSLNYHHFSTTYDVMATDGKEGRCL GSELDMQVDYTLMKDVKLTAGYSTMLGTKYMDIVKGGNHKSWQDWGWLTLNINPRILFTK W >gi|226332038|gb|ACIB01000018.1| GENE 18 24937 - 25728 585 263 aa, chain - ## HITS:1 COG:RSc2985 KEGG:ns NR:ns ## COG: RSc2985 COG0755 # Protein_GI_number: 17547704 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: ABC-type transport system involved in cytochrome c biogenesis, permease component # Organism: Ralstonia solanacearum # 114 252 244 388 395 82 35.0 8e-16 MSWDQFVIFAIVALLCWVIGAVAAWRGKRQWMVYTATLAGLAVFFAFILGMWISLERPPM RTMGETRLWYSFFLPLAGIITYSRWRYKWILSFSFILSLVFVCINLFKPEIHNKTLMPAL QSPWFAPHVIVYMFAYAMLGAAAVMAVYLLWIKKKTPEEREMELCDNLVNVGLAFMTLGM LFGALWAKEAWGHYWSWDPKETWAAATWLGYLCYIHFRMNRRQKVRTALVGLLICFVLLQ MCWYGINYLPSAQGTSVHTYNLN >gi|226332038|gb|ACIB01000018.1| GENE 19 25725 - 26966 962 413 aa, chain - ## HITS:1 COG:no KEGG:BF0421 NR:ns ## KEGG: BF0421 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 413 1 413 413 825 99.0 0 MWQKPWGYKEGFAICGGLFLTGTFLQITIGKCELSILSYPMNVCVGVLYLVILLLIYAFS QKSYFIRWMGSCQAAVSSMVSVAMLTVVMGLIRQVKSDVPLLGAESWLGFSQMLSACSFV LLFLWMITLLGLTTIRRIHHFRWCDFPFVLNHLGLFLALTGAILGNADMERLRMTTKTGQ AEWRALDENQKMRELPLAIELQDFTIDEYPPKLMLINNETGEALPSKKPENLLIEDNFHT GRLLDWEIGIEKKIPLAASVVTQDTLNFVEFHSMGATYAVYLKAVNNKTGQQREGWVSCG SFMFPYKAIRLDDQTSLVMPEREPRRFASEVKVYTESGRRDSATIEVNKPFELEGWKIYQ LSYDESKGRWSDISVFELVRDPWLPVVYTGIWMMIAGAVCLFALSQKRKEDNT >gi|226332038|gb|ACIB01000018.1| GENE 20 26970 - 28451 1485 493 aa, chain - ## HITS:1 COG:HI1069 KEGG:ns NR:ns ## COG: HI1069 COG3303 # Protein_GI_number: 16273000 # Func_class: P Inorganic ion transport and metabolism # Function: Formate-dependent nitrite reductase, periplasmic cytochrome c552 subunit # Organism: Haemophilus influenzae # 52 490 75 531 538 457 47.0 1e-128 MEKKLKSWQGWLLFCGAMAVVFVLGLVVSSLMERRAETVSVFNNKRVEITGIEARNEVFG ENYPRQYETWKETAKTDFKSEFNGNEAVDVLEQRPEMVVLWAGYAFSKDYSTPRGHMHAI EDITHSLRTGAPMDDKSGPQPSTCWTCKSPDVPRMMEAIGVDSFYNNKWGAFGSEIVNPI GCADCHEPTNMKLHISRPALREAFARQGKDIDKATPQEMRSLVCAQCHVEYYFKGDGKYL TFPWDKGFSVEDMEAYYDEADFADYTHALSKARILKAQHPDYEISQMGIHAQRGVSCADC HMPYKSEGGMKFSDHHIQSPLAMIDRTCQVCHRESEETLRNNVYDRQRKANEIRGRLEQE LAKAHIEAKFAWDKGATDVQMAEALKLIRQAQWRWDFGVASHGGAFHAPQEIQRILGHGL DKALQARLAISKVLAQHGYTADVPMPDISTKEKAQEYIGLDMEKERKAKDKFLKTIVPEW LEKARANGRLAKL >gi|226332038|gb|ACIB01000018.1| GENE 21 28475 - 29062 381 195 aa, chain - ## HITS:1 COG:Cj1358c KEGG:ns NR:ns ## COG: Cj1358c COG3005 # Protein_GI_number: 15792681 # Func_class: C Energy production and conversion # Function: Nitrate/TMAO reductases, membrane-bound tetraheme cytochrome c subunit # Organism: Campylobacter jejuni # 19 165 20 167 171 82 32.0 4e-16 MKIREYIQWLLPSHKWKVLAIIFLGMVVGGGAFFLYMLRAHTYLTDDPSACVNCHIMGPY YATWFHSSHSRNATCNDCHVPHENPVKKWVFKGMDGMRHVAVFLTRGEKDVLRANKESAE VIMNNCIRCHTQLNTEFVNTGRIDYMMSQVGEGKACWDCHRDVPHGGSNSAASTPDALVP YPDSPTPEWLRKMIE >gi|226332038|gb|ACIB01000018.1| GENE 22 29372 - 29659 175 95 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253564224|ref|ZP_04841681.1| ## NR: gi|253564224|ref|ZP_04841681.1| predicted protein [Bacteroides sp. 3_2_5] # 1 95 1 95 95 191 100.0 1e-47 MNILNSMQQFPDEVSCVAYLKGQKEQSGIAYKRCGYNKAQEVIKSEFDDWHSTKFIIALR VHYRNIEDHLAGIDYQSLFESITQSVLIMFFPRMM >gi|226332038|gb|ACIB01000018.1| GENE 23 29925 - 33962 1193 1345 aa, chain - ## HITS:1 COG:all4963_3 KEGG:ns NR:ns ## COG: all4963_3 COG0642 # Protein_GI_number: 17232455 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Nostoc sp. PCC 7120 # 827 1104 12 288 294 123 29.0 3e-27 MKVIKTQLPLILIFIIANNLFAINPINTNFHTLTTSDGLADNTIYCIYKDKKGFMWFGTN NGLSKYDNYQFKNYTIGNKFSIPVTKIEKSQAEDLYLLTNDWLTYFNCRNESFVQFHPSI KDQHCQITDFTLSTDSSLWACDKRSLLKLKTILKREKKDIQIQYIHTFPLEDSERFMKLC LSENRKALYLVTNHGRIFQFDTNSKKIVNYAQLPILKAEFGASSMSYQNGKVWISSMVYG IFICDPSLQQIDNLVNQPNAKILSHNDVYNITAISHDLYLAVTWYGYTTIHTSSKNPKTW TTDIYTHMPAWDTQDIETRMISSYFDPNGILWIGTHGGGIIISDRRWDIIKRYQQNCDNE TNSIITDSSDRIYLATYHKGIMRSTKSFCPSDSTLQFVTIPIPQLEPTYFCAVRDKNGLL WFGNKNGRLLCYNTSNDTYTIHPLKTHTPIYSLLFDSQGRFWVGTGEGLLLFNQDTYATE LVPLNQTINQILDIDEDKDGNIWVATSLGVVKIRKKETKWDIQKYEFITATNTAQAILVA TDGQIYVGFKNGLGIIQPNQHASNQFLTTNEGLSSNWINCFVEDQYGNIWIGSNSGITRY DPFQQLYYNYDISKSNKSVSLFKDFIFWGGSKHLVYFSPQQAIATLDAFCKSPVFITNIE VDNQPVKIGKAINGQVILKEAITYTDQITLSHTNRNFSLSFTNLAYSNNRLEYNYRLYPY QTQWIACSDKERISYANLSSGIYTFQVKSINEKSSDKITALQIVILPHWSETWIFRSSIL LFCIAIAYYFIRKFKKQQQRKEHMIHLQHELFVANMMREQEQKSKIEKETFFTQVAHELR TPLTLILAPLTEVIQNIKPAEAIYAKLVLIYKSAQSLHTLVSHLLQVQKIGAQMVKLKLA EVNILELIKRTATPFQELAQTQNIHFSLNLCYEQTTLYIDENKIESAIRNLLSNAFKYTP EQGTIMLSVYHEEKDEKAYCAIQVSDNGAGISLQDQEHIFEPFTTGNNKPNLSTAVGIGL YIVKHTISLHHGMVCLDSKENEGSKFILYIPEGNTHFINDESETDTRPSVPTTYKTIIPP TPKKSVSMTTLSVLVVEDNEDINNYITSLLDDKYRIYQASNGEEGIKIAKEILPSLIISD IMMPIKDGFEFSQEIRSNLSTAHIPIIFLTAKAEDIDIIHATHIGIDDYLTKPFNSEILK AKVDNLISQRKQLKKIYSKLLTLKVPLAEEEQDPHNQFMQQVINIIEANLTNESFNVKYL ASSLNMSQPTLYRRIKENSQQSIIELIRNVRISKAASLLLLKKYSVQEIAEMVGYNDYDT FRKCFIKQFNISPSKYIKESQNNIQ >gi|226332038|gb|ACIB01000018.1| GENE 24 34473 - 37658 2257 1061 aa, chain + ## HITS:1 COG:no KEGG:BF1415 NR:ns ## KEGG: BF1415 # Name: not_defined # Def: putative outer membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 34 1061 119 1131 1131 722 41.0 0 MLKIYCNLLFKSFSKYIFRGFLCTAVLSLPMADLEAKNEIKNEVADLQTTQQSRKVSGVV LDTNNQPLVGATILQKGTSNGVITNLEGRFTIDLNGNAPVLEISYLGYVTETVKVGSRNT IKVILNEAAEQIDEVVVVGYGKSSVKRLTSAISTVKGDKLTNLPNTNLISSLEGRASGVF IQSAGGEPGALPTISIRGGGDPLYVIDGIPSSKAEFSVLSPSDIESFSILKDAAASAVYG ARAGNGIVMVTTKKGSDGKVKITYNGSYAMSSPTESIDFLENWEVAEAWDRAAIYRGNKP SYMKLDNDGSLYWPVGRLDSIKNGIFNTTTGNTDWNELLFRKFAPSQTHNVSINGGNKTT HYYMSARYYTLGGIYNTNISKNDRLNVRMHVDHYFEGIGLRLDGDVSFSQNKVKYPPHGL YTIWTHVARLNALGRCFNVDGNPTGGQENPYVEIDPSAGYRKTDTRYSNYNLAVTWDVPK VKGLSVGALVRYNLYDNYGKDWYANESGVGPVWDWNNDPIDLGKPRLSEKVDRSNEITTE FRIDYNRTFAEAHTIGATAVFNSWQYDSNNLGASRKEYETGVIEQINGGPSSTAENSGTA NEHGRMGLVGRVKYDYKMRYLLGFSFRYDGSDKFPKNKRWGFFPSIEAGWNVDKEPFMTP ILEQGWLDGFKLRFSWGKIGLDNVDDFAYLAVYKKGHDFYEGNLWNSTLYEGGLVSQDLT WYTRNTINIGVDAEFFKRRLTLGFDYFYYRTTGYLASPQDQYTTPLGTGLPKIKTNSAHR RAGYELNMNWSDKIGRDFSYNVGFNLSQYDELWEKKYDEIEADLKNPLRRLTHQKSYFDL LYVSNGLYQNMDEILNNPRPTASSQLRAGDIAYKDMNGDGKIDSNDQVREGSPRFPHTTY GISLGASYKGFSLDVLFQGTGARDMLLESFNRKFNSNQIGLVGSNQFWYPGNNGEILYPR YTDNAQENGGNNNLNSTYWLLDASYFRLKNLKIGYDLKYSLLKKLDFLSQFEIYVSGNNL LTFSPTKKYHIDPEDGRDDDAMGQVGYPVQKIIQFGINLTF >gi|226332038|gb|ACIB01000018.1| GENE 25 37665 - 39368 1043 567 aa, chain + ## HITS:1 COG:no KEGG:BF1416 NR:ns ## KEGG: BF1416 # Name: not_defined # Def: putative outer membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 3 567 5 565 565 254 32.0 6e-66 MKKYIYFIIAFTSLFASCDVLDTEPLDTYNELVVWKDRGLAENVLKETYWSVLKDLYCSG DNNTECLRTEAWTDNIWTKDNNTVASEGISPTNINDVVGNYNRYSYIRKASLIIENLTDN SNIEEKYCKQYIAEARCLRVMVYSWMARRWGGVMLVDKLMTPNDEMKLPRTSEEETYRFM VEELKKAIPDLPTKANKERFTKGAALTLLTRVALDGGLYDEVIAAGKQLFEGEEAANWTI DPEYRKMFGSYTYPVESKEIQFYFTLGDKKQYCDNLLPMYVAGTMAPGRNTKGSNFNEKI DAWCSNWPSHDLSEAYLAIDVDGDGTAKPYTETARWINAPVKRASIMYSNRDKRMDATIC RDSTLYFTSPIEMNTLGNCFWENTQNHGFMTQSGYMWRKYIYELDNTLPGYQILYNFRYI LLRLGEASLNYAEALGRKGQIKEAVQFMNQTRVQHGGLPALPEDVTADVFWKNYKVERRV ELVLEGDRYFSVIRWAKAENATKVPEFNKRTSAIIIDGEDGTFEVTDDRHGSSTGSDKIF SWPKRMYFPIPESETISNPNINQNPQW >gi|226332038|gb|ACIB01000018.1| GENE 26 39362 - 39859 333 165 aa, chain + ## HITS:1 COG:no KEGG:BF1484 NR:ns ## KEGG: BF1484 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 9 159 1 145 152 108 40.0 6e-23 MVNNMKTYIRQMAQIISVLLMIVLNSSCLTGGLDELESFDDTDLKNFQFEYRWERPLNET NPNNTQLGVVSLATDCKIEGDVIYCTITVPEAGNPSMFTESIRAQVSLDNIVGMATISTA ATISPIGNAPILGKFGNFSNECKYVVTAADGKTQKEWTVICKMIK >gi|226332038|gb|ACIB01000018.1| GENE 27 39968 - 41797 1032 609 aa, chain + ## HITS:1 COG:no KEGG:BT_1035 NR:ns ## KEGG: BT_1035 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 2 558 3 522 734 328 35.0 3e-88 MIMRHILLMILMTFNGFYCCFSKVTIQKKNTYIIYTSLNTSRVVDYAAKELEDYLHRTTG AMCVFKRNDVCKANQLIFGTKTSAILPDSFSSGEHPGEDGYIIEIKDNLFSISGGNARGV LYGVYSFLEEYLNCRWYASDAFYIPFKGSVTLQNGKLAYTPPVKWREVYYYDLFDSYLAG VLKLNGNALKQGQVQPNRFAVKGGANAGWGYWCHSLYTMVPPSLYKSHPEYFSEINGKRI PPAGPEGGTQLCLTNPDVLSLSVEQLKKDMVKPMTGLPLWADSLAYYWSVSQMDGNGNCT CSHCMALDKYDGSPSGSILNYVNKVAAQFPDKKIATLAYIYSRKAPKYTKPASNVAIQLC AIETARDGINEPIGTSPLHISFRKDMESWGKICKDIIVWDYVIQFQNLVSPFPNFDVMQS NIQFYTKNHATGIFCQGNREKGGEFAELRGYLLSKLLWNPDCDVEKVMDDFLQGYYGISG KYIKQYISLMESELKKSKLRLSMDGEPEAHRNGYLSEVCINQYNKLFDLAEQSVAHDSLL LARVQKDRMPLMYVQLRLKYGTVEKRRKILDELIHLAEINDIWMFSEVDWRSDQSGNREM FKKKIESQL >gi|226332038|gb|ACIB01000018.1| GENE 28 41724 - 42002 70 92 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253564230|ref|ZP_04841687.1| ## NR: gi|253564230|ref|ZP_04841687.1| predicted protein [Bacteroides sp. 3_2_5] # 20 92 1 73 73 138 98.0 9e-32 MRLIGDLIKVGIERCLRKRLNPNCKDCIYEILFIFYVLSEGFILFTQKKKNWQKKALAYS KLTFQHIFPRMVSSIRTLIMSDLTDLFWEIVI >gi|226332038|gb|ACIB01000018.1| GENE 29 41959 - 42219 102 86 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|301161476|emb|CBW21016.1| ## NR: gi|301161476|emb|CBW21016.1| hypothetical protein [Bacteroides fragilis 638R] # 1 86 6 91 91 168 100.0 1e-40 MTLSSFFCERSCSVQLSGGGIIYDIIVPFSDSGIFATLFYPEDIVTEEGAGHCGAAFFAM GFGDKMHTYAIPPKSLFPKRDLLNPT >gi|226332038|gb|ACIB01000018.1| GENE 30 42684 - 42845 88 53 aa, chain - ## HITS:1 COG:no KEGG:BF0421 NR:ns ## KEGG: BF0421 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 39 1 39 413 87 100.0 1e-16 MWQKPWGYKEGFAICGGLFLTGTFLQITIGKCELSILSYDFIYVFFSLYFVFY >gi|226332038|gb|ACIB01000018.1| GENE 31 42849 - 43046 138 65 aa, chain - ## HITS:1 COG:ECs5052 KEGG:ns NR:ns ## COG: ECs5052 COG3303 # Protein_GI_number: 15834306 # Func_class: P Inorganic ion transport and metabolism # Function: Formate-dependent nitrite reductase, periplasmic cytochrome c552 subunit # Organism: Escherichia coli O157:H7 # 2 64 412 478 478 57 41.0 7e-09 MGRLAISKVLAQHGYTADVPMPDISTKEKAQEYIGLDMEKERKAKDKIVPEWLEKAKGNG RLAKL >gi|226332038|gb|ACIB01000018.1| GENE 32 43098 - 43493 343 131 aa, chain - ## HITS:1 COG:Cj1358c KEGG:ns NR:ns ## COG: Cj1358c COG3005 # Protein_GI_number: 15792681 # Func_class: C Energy production and conversion # Function: Nitrate/TMAO reductases, membrane-bound tetraheme cytochrome c subunit # Organism: Campylobacter jejuni # 21 128 20 134 171 65 31.0 3e-11 MKVKEYIQWLLPSRKWRVLAIIITGVIVGGGALTLYMLRAHTYLTDDPAACVNCHIMGPY YATWFHSSHSRNATCNDCHVPYENPVKKWVFKGMDGMRHVAVFLTRGEKDVLRANKESAE VIMNNCIQKRV >gi|226332038|gb|ACIB01000018.1| GENE 33 43634 - 44137 322 167 aa, chain - ## HITS:1 COG:CAC2475 KEGG:ns NR:ns ## COG: CAC2475 COG3467 # Protein_GI_number: 15895740 # Func_class: R General function prediction only # Function: Predicted flavin-nucleotide-binding protein # Organism: Clostridium acetobutylicum # 8 159 5 154 154 97 42.0 8e-21 MEYINDLIRRKDRLLSEDEAMSLLENGEYGVLSISSTEEGVYGIPINYVWDKQQSIYFHC APEGHKLCILQNCNQASFCVVGRTRVVSNQFSTEYESIVINGTMICQLEETEKRKALELL LDKYSPEDKKVGLKYIEKSFHRTHVLKLVIKVISGKCKKINQHCQDY >gi|226332038|gb|ACIB01000018.1| GENE 34 44222 - 45919 337 565 aa, chain + ## HITS:1 COG:no KEGG:BF0358 NR:ns ## KEGG: BF0358 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 565 1 565 565 956 99.0 0 MRQYLLNIILLFLGFTIIYSCSRHQQINRTIFLADSIMEYQPDSAFKLLKTINQTDLSVS ENAKYALLLAQAQDKAGHQLINDSLILIAINHYDHLSKDNNKAKAYFYLGRFYQNNNDYA KAINSYLIAEKATSDHDTLLTLIYDNLGTCYKNQDFYDKALEVYKDAYYIYKQYNSKNIL YPLRGMASIYAIQEQFEKALKYYQTALTIASSTNDSTWQSILFCDISRIYDNKNLYEAAY SYIVRSIQYAPRSSDLSAMYFWKGEILHNLNQLDSAFYYINLAKKSSDLNTQASAYQALY EIKKEQGELNDAILYNDTSLILYDSIQDLNHSAEISHILKQHATETLQQAEVIKRQKHTA FLIVTTLLLIACITFIFLYKDNKRKKVYIKIQCELRNNQIEKDELKEKIKTLIDSNTYIA ERNKELKKEELKQQQVELWKRTLQICTRLFHTSTSYKKLHAIETAKFKKEREEKQKEINS IQKEINEVFIEAIQELREQYPKLTQEDLFYCILQYLRLSTSTIKFCMRVESNQALTQRKY RIKKQISPQTFSIIFNESSPSEGVL >gi|226332038|gb|ACIB01000018.1| GENE 35 46068 - 46448 264 126 aa, chain + ## HITS:1 COG:no KEGG:BF0357 NR:ns ## KEGG: BF0357 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 126 1 126 126 242 100.0 3e-63 MKKLTCLITLFLFISVGYINAETMASRKKIKMKVETQHHQRSLPPPCPAEAFICGNTVDL IFRETNKTAVVTIMNLDTGEAIHYNVSTNDCSISIDLGNNQSESNYNIELILDGKAYTGE FTTNEI >gi|226332038|gb|ACIB01000018.1| GENE 36 46508 - 47389 453 293 aa, chain + ## HITS:1 COG:no KEGG:BF0356 NR:ns ## KEGG: BF0356 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 293 1 293 293 523 100.0 1e-147 MKTISKLFLLLSIILLSSCDETELDQTVTNQVNPQKEEDSRIVFNSAKEFYNTLDTLSYM TYEEQTEWIRASGIKYPLYKDLEFCEDEIMTEMPRAFQALFNHKMEMQINDTVIAFEKGN MYVKSIKEKILPVPVLYGQVGVNQEESEVTTRTVYETKYGKIGTSYQYEFKIPEGKSKYK YVHELKSVIIKENLPPYKSWSNLFLVLKLEWKGKKKWKVAEKEERNISIDLNVCKRNLAK VKYNQEIRIDVQNMDSKSHSINIEGYIIHEVVGIPSTKMFNGWSYPLVEWQPR >gi|226332038|gb|ACIB01000018.1| GENE 37 47501 - 48052 498 183 aa, chain - ## HITS:1 COG:PA4572 KEGG:ns NR:ns ## COG: PA4572 COG0545 # Protein_GI_number: 15599768 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: FKBP-type peptidyl-prolyl cis-trans isomerases 1 # Organism: Pseudomonas aeruginosa # 74 181 109 205 205 79 38.0 3e-15 MNKKIYLLPLLLLALIFVSCEETKEATKFDNWRARNEGYIDSLKTVFDEKTDPELKAFEP EINPKLRIYYKKKISNDTGAIPLYTDSVNVFYRGSFIFGETFDQNFTGADPGPFDSPTKF VIQTFITVGGVSGWAEILQRMRVGERWLVYIPWELAYGASGTDDIPGYSTLIFDMQLEEI LDE >gi|226332038|gb|ACIB01000018.1| GENE 38 48064 - 49605 1606 513 aa, chain - ## HITS:1 COG:SA1394 KEGG:ns NR:ns ## COG: SA1394 COG0423 # Protein_GI_number: 15927145 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Glycyl-tRNA synthetase (class II) # Organism: Staphylococcus aureus N315 # 10 502 8 460 463 473 50.0 1e-133 MAQEDVFKKLVSHCKEYGFVFPSSDIYDGLGAVYDYGQMGVELKNNIKKYWWDSMVLLHE NIVGIDSAIFMHPTIWKASGHVDAFNDPLIDNKDSKKRYRADVLIEDQLAKYDDKINKEV AKAAKRFGEAFDEAQFRSTNGRVLEHQAKRDALHERFAKALNDNNLEELRQIIVDEEIAC PISGTKNWTEVRQFNLMFSTDMGSTADGSMKIYLRPETAQGIFVNYLNVQKTGRMKVPFG IAQIGKAFRNEIVARQFIFRMREFEQMEMQFFVRPGSELEYFKKWKEIRLKWHKALGFGD DHYRFHDHDKLAHYANAATDIEFLMPFGFKEVEGIHSRTNFDLSQHEKFSGKSIKYFDPE LNESYTPYVIETSIGVDRMFLSIMSAAYCEEQLENGESRVVLKLPAALAPVKLAVMPLVK KDGLPEKAREIIDNLKFHFHCQYDEKDSIGKRYRRQDAIGTPYCVTVDHQTLEDNCVTLR NRDTMEQERVAISELNNIIADRVSITSLLKTIQ >gi|226332038|gb|ACIB01000018.1| GENE 39 49790 - 52438 2562 882 aa, chain + ## HITS:1 COG:BB0035 KEGG:ns NR:ns ## COG: BB0035 COG0188 # Protein_GI_number: 15594381 # Func_class: L Replication, recombination and repair # Function: Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit # Organism: Borrelia burgdorferi # 37 667 11 626 626 429 38.0 1e-119 MSEENNEITEGHSDYKPADSHNESIKHQLTGMYQNWFLDYASYVILERAVPHINDGLKPV QRRILHSMKRMDDGRYNKVANIVGHTMQFHPHGDASIGDALVQLGQKDLLVDCQGNWGNI LTGDGAAAPRYIEARLSKFALDVVFNPKTTEWKLSYDGRNKEPITLPVKFPLLLAQGVEG IAVGLSSKILPHNFNELCDASISYLRNEEFKLYPDFQTGGSIDVSKYNDGERGGAVKIRS KINKVDNKTLAITEIPYGRTTTSVIDSILKAVDKGKIKIRKVDDNTAANVEILVHLAPGT SSDKTIDALYAFTDCEVSISPNCCVIDDQKPHFLTISHVLRKSADNTLSLLRQELEIKKD ELQENLHFASLEKIFIEERIYKDKEFEQSKDMDAACEHIDRRLTPFYSQFIREVTKDDIL RLMEIKMGRILKFNSDKAEEAIARMNEDIAEINNHLANIVEYTIQWYRMLKEKYGKNFPR RTELRNFDTIEAAKVVEANEKLYINREEGFIGTSLKKDEFVACCSDIDDVIIFYRDGRYM VTPVADKKFVGKNVIYVNVFKKNDKRTIYNVAYRDGAEGTHYIKRFAVTSIVRDREYDVT QGKPDSRISYFSANPNGEAEIIKVTLKPNPRVRRIIFERDFSEVTIRSRQSQGVILTRLP VHKIVLKQRGGSTLGGRKVWFDRDVLRLNYDGRGEYLGEFQSDDNILVVLNTGEFYTSNF DLSNHYEDNVSIVEKFDPNKIWTVALYDADQQNYPYLKRFCFEATTRKQNYLGENKHNRL ILMTDEYYPRLEIIFGGHDSFRDPVVVDAEEFIAVKGFKAKGKRLTTYTVETINELEPTR FPDPPQNNEEDDTGEEPENLDPDSDKTENDILDEMTGQMKLF >gi|226332038|gb|ACIB01000018.1| GENE 40 52463 - 53332 879 289 aa, chain + ## HITS:1 COG:no KEGG:BF0352 NR:ns ## KEGG: BF0352 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 289 1 289 289 545 99.0 1e-154 MKVKTSILLLLMIATALPSFGGNGNGNGTDSLQATRYVTRATMYGVGYTNVFDTYLSPQE YKGIEFRISRETMRMTTLGDGNVSVQNFFQANLAYTHNRVDNNNTFAGLVNWNYGLHYQF RITDNFKLLAGGMGDFNGGFVYNLRNTNNPASARAYINLDASGMAIWHTKIKNYPLALRY QVNLPVIGVMFSPHYGQSYYEIFTLGHASGVVRFTSLHNQPALRQMLSVDFPIRYTKMRL SYLCDLQQSKLNGIKTHTYSQVFMVGFVHDLFRIRNKNGTPLPPAVRAY >gi|226332038|gb|ACIB01000018.1| GENE 41 53345 - 54376 1038 343 aa, chain + ## HITS:1 COG:CC2028 KEGG:ns NR:ns ## COG: CC2028 COG0793 # Protein_GI_number: 16126271 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Periplasmic protease # Organism: Caulobacter vibrioides # 57 329 146 439 462 82 25.0 8e-16 MKNPIIYRILQQGFCLIFCLGLLSSCIREEEFVNNPQGNFEQLWKIIDEQYCFLDYKQID WDEIHTRYQKLITPNMGSEGLFEVLSEMLYELQDGHVNLASAHNVSYYDAWYQDYPRNFR ADLLEDSYLGRASTDYRTAAGLKYKILKDNIGYIRYESFADPVGNGNLDEVLSYLSVCNG LIIDVRDNGGGNATNSARIASRFTNEKILTGYISHKTGTGHNDFSKPYAIYLEPANGVRW QKKVVVLTNRRSFSATNDFVNHMRCLPNVTTIGDKTGGGSGMPFTSELPNGWSVRFSASP HFDAEMNHIEFGIEPDIKADMLQEDELRGKDTLIEMARRFLSE >gi|226332038|gb|ACIB01000018.1| GENE 42 54532 - 56019 1336 495 aa, chain - ## HITS:1 COG:no KEGG:BF0350 NR:ns ## KEGG: BF0350 # Name: not_defined # Def: putative outer membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 495 1 495 495 1008 100.0 0 MKRTLIYMFAAMSLVGSSCSGDWLNLNPSTSVTTPQAIRTLEEAQIALNGIYRIAASHSY YGDNYLYYADCRGEDVQARVSKGAGKRVSPYYEFNVTADDALNITRVWNQPYSVIHQANS LLERIESGAVVTDDAVALNCIKAEALALRGLALFDLTRLFAMPYTLNNGTSLGVPIEIKT TLPTHQPARNTVAECYQQVIDDMTGALKLSALSADKKNGYLNVWSVKALLSRVYLYMNDN EKALALAKEVMNNGGLYQLFTHDEYPTVWGKDFSSESLFEFYYTLSEPDGGTGGEGAPMV YADNVKDWNNLVLTKAFLDLLGEDPDDVRHSLNRLPEKPEEDILPEGSKGYPKYLNKYPG KTGDNPQDNDICIIRLSEVYLNAAEAAFKLGGAENLKFSLDCLNAIVSRANPVKSVKETE LSLERILKERRKELVGEGHAFFDAMRNGLSVSRTGGWHLPSVAAAAVISPSDPRVALPIP QAEIDANPNMVQNPH >gi|226332038|gb|ACIB01000018.1| GENE 43 56048 - 59107 2601 1019 aa, chain - ## HITS:1 COG:no KEGG:BF0349 NR:ns ## KEGG: BF0349 # Name: not_defined # Def: putative TonB dependent outer membrane exported protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1019 1 1019 1019 2014 99.0 0 MRNLKAILIVLLCLCGIHVQAQQLEIKGLVTSSEDKQPLIGATVAVKGVPGRGVVTDMDG RYTLKIEPSDKILVISYIGMKSSEVKVPKNGVLNVVLQPESLNLDEVVVTGYGNFSKSSF TGSANTLRADMLKNVPVLSVEQKLQGMTTGVNITSGSGQPGANQSIRIRGMGSFNASNEP LFVIDGIPVTSGSMGAGTGADAAYMNNAKTNVMSTLNPADIENITVIKDAAAASLYGSRA ANGVILITTKKGAVGRTKVTLSASGGFSNAAVNLRPTLNGEQRREMIYEGLYNSAVDKGL QSPEAYANANIDTYAGIPELGYTDWRKELIRTAHNQNYEVTASGGNERTTFYASLGFNRQ EGLVENSNLDRYSARLNMTHKIGSRVEVGGNMMFTQISQEMNEERGSNINPFLCVAVSAT PSMPVRDVSGNYVGSYAGTNVNPLRDIRTDYNRSRMTRMFSTGYASVDIIKGLKLKETLS YDYTVQKDSRYLNPLSGAGPKSGSDAQTSKGFTEYGKLLSSTSLNYTHTFAAKHHLDILA AYELESYQSDKAMGEKAKLPSDVLLEPDNAAVLKSFVSSTQAYRMISYLSRLNYDYDNRY YIAGSYRRDGSSRLAPESRWGDFWSVSGMWHLSSEPFMEAVKPVLNDVKIRASYGVNGNQ PGAFYGYMGLYSYGQNYMGAAGSYESAQANPKLKWEKNYNLNIGLSLTFIDRIFVNLEYY NRDTKDLLYNRPISSTTGFLNYLANIGQLNNKGVEFELRTINFAGPDFNWTSVLNLTHNR NKIVALDGDIKQSVEGSWFIHKIGLPYNSFYVKEFAGVDPSTGKGLYYLNTQDEKGNYNR EMTDDASKAQAIPYKSADPKISGGFTNILSYKWFDLGLTFTYSLGGYSFDKTGTLIETDG SKEKSYNLPVYALDRWQKPGDRTDVPRFVLEQGAGPQNSSRYIHSTDHIRLKNLTLGFTL PGRWVQKALIENARVYFSGSNLLTWAKWKQYDPEVPVNGEVFCEAPPMRTFNFGVEITF >gi|226332038|gb|ACIB01000018.1| GENE 44 59632 - 59781 84 49 aa, chain - ## HITS:1 COG:no KEGG:BF0405 NR:ns ## KEGG: BF0405 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 49 27 75 75 100 100.0 2e-20 MLIEERYKDEDTGSGGVNSLPKPKLSYLADVCFFVCIFRITYLLQTETR >gi|226332038|gb|ACIB01000018.1| GENE 45 59884 - 60060 80 58 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MLKQSFGGQKSYFLKILIFRWFSLRFASSPHGWTLQAKAGILAFLSTIKEGLHFRKLN >gi|226332038|gb|ACIB01000018.1| GENE 46 60590 - 60940 110 116 aa, chain - ## HITS:1 COG:no KEGG:BF0402 NR:ns ## KEGG: BF0402 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 116 23 138 138 224 99.0 9e-58 MGKPIIKILYFHRRRTNASGWYRLEYIFQRGKKTLRYVLSTRDPEYLVCFCSPCMIERSE MIVFVAELLLEIHKVKYSHLKVTPSYAYATPKYKSRINQMCKKSRYHKTPVIIDGL >gi|226332038|gb|ACIB01000018.1| GENE 47 61236 - 61427 76 63 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MKHFYRWDCLLNTVSIGRFNENRTVLLTRLNYINFNAKQRKKTYADPLQYQAREQNRLSK VFK >gi|226332038|gb|ACIB01000018.1| GENE 48 61598 - 62470 440 290 aa, chain - ## HITS:1 COG:STM3755 KEGG:ns NR:ns ## COG: STM3755 COG3943 # Protein_GI_number: 16767039 # Func_class: R General function prediction only # Function: Virulence protein # Organism: Salmonella typhimurium LT2 # 4 136 7 138 345 115 41.0 1e-25 MANTNMEHGEIILYQPDNTIKLEVRIENETVWLTQAQIVNLFQSSKANISEHIRNIYDSD ELSAESTVRKFRTVRMEGNRKVTRILEYYNLDMIISVGYRVNSKRGVQFRQWSTGVLKEY LLKGYAINQRVEQLENKANTHDRQLEELTNKVDFFVRTSLPPIEGVFFNGQIFDAYVFSA QLIKFAKSSLVLIDNFVDESVLLLLSKRLPGVTSIIYTKQITPQLELDLTKHNSQYPPID IRTYQHAHDRFLIIDNLEVYHIGASLKDLGKKLFAFSKMEMPAKIITDLL >gi|226332038|gb|ACIB01000018.1| GENE 49 63031 - 63231 97 66 aa, chain + ## HITS:1 COG:no KEGG:BF0344 NR:ns ## KEGG: BF0344 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 66 29 94 94 104 98.0 8e-22 MESTDGLYEMRITLGSDIFCVFCFFDKGRLVVLLSGFQKKTQKTPKKEIDKAVRLIAQYY DDKKRR >gi|226332038|gb|ACIB01000018.1| GENE 50 63234 - 63524 347 96 aa, chain + ## HITS:1 COG:no KEGG:BF0343 NR:ns ## KEGG: BF0343 # Name: not_defined # Def: putative DNA-binding protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 96 2 97 97 152 100.0 4e-36 MDITTLDQFKDEIYGVKGTPRRDNLERELETLRIGVQIRNARQKKEMTQAQLAERINKKR TFISKVENDGGNLTLKTLIDIVERGLGGKLNIEVKI >gi|226332038|gb|ACIB01000018.1| GENE 51 63655 - 63873 159 72 aa, chain - ## HITS:1 COG:no KEGG:BF0396 NR:ns ## KEGG: BF0396 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 72 37 108 108 132 100.0 5e-30 MANIKTDVGQVFPAVEGTTSADARLRLNRLQGTAPSDWTANAFVDYCTPLASALSFYPDV TISTPRYPRMSE >gi|226332038|gb|ACIB01000018.1| GENE 52 64479 - 67547 3473 1022 aa, chain + ## HITS:1 COG:no KEGG:BF0394 NR:ns ## KEGG: BF0394 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1022 1 1022 1022 2001 100.0 0 MVKFNAMKTKPLRLLRQKRASWLKSVFVLTCLLLSTSVAMAQTKTVTGTVTDSFNEPLIG ASILVKGTSTGAVTDMDGKYSISVTPNDVLVFSYVGYEKQEIKVGQQTVINVTLKDDSQM LAETVIIGYGSAKKRDLTGSITNIKGSEIANKPATNPLSSLQGKIAGVQIVNSGQAGADP EIRIRGTNSINGYKPLYVVDGLFNDNINFLNPEDIESMEVLKDPSSLAIFGVRGANGVII VTTKRAKEGQTLVNINTSFGWKSVVDKIKMVNAPQFKELYNEQMANQGNALFDFSNWNAN TDWQDEIFQTGFITNNNVSITGASEKHSFYLGVGYSHEQGNIKHEKYSKVTINASNDYKI TKDIKVGFQFNGARMLPADSKTVLNAIRTTPVAPVYNEEYGLYTSLPEFQKAQMNNPMVD VNLRANTTRAVNYRASGSIYGEVDFLKHFTFKAMFSMDYATNDSRTYKPIIKVYDATVAG NVATLGNGKTEVSQNKQNETKVQSDYLLTYQNSFADGTHNLTATAGFTTYYNSLSGLDAS RGQGIGLVIPNNPDKWFVSIGDLATATNGSTQWERTTVSMLARIIYNYKGKYLFNGSFRR DGSSAFAYTGNQWQNFYSIGAGWLMTEEEFMKDITWLDMLKLKGSWGTLGNQNMDKAYPA EPLLENAFAAVFGKPAIIYPGYQLAYLPNPRLRWEKVEAWETGFETNMFRNRFHFEGVYY KKNTKDLLATVPGLSGTVPGIGNLGEIENKGVELMASWRDQIGDWGYSVSANLTTISNKV KSLVQDGYSIIAGDKSQSYTMAGYPIGYFYGYKVAGVYQNQAEIDASPVNTLATVTPGDL KFADVNGDGKITPDDRTKIGDPTPDVTYGISLGLSYKNWELSMEMMGQGGNQIYRTWDNY NWAQFNYMEQRLDRWHGEGTSNTQPLLNTKHAINSENSEYFIEDGSFFRIRNLQLAYSFD KTLLSKIRMQALKVYVNIQNLKTWKHNTGYTPEIGGSAIAFGVDNGTYPVPAVYTFGINL TF >gi|226332038|gb|ACIB01000018.1| GENE 53 67559 - 69079 1423 506 aa, chain + ## HITS:1 COG:no KEGG:BF0393 NR:ns ## KEGG: BF0393 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 506 1 506 506 1035 99.0 0 MKLNKYILPIALAVSALSFCSCNDFLDRHPQGQFTEDDNPGALAEGKVFNIYTMMRNYNI TAGIPAFAIEYFRSEDSEKGSTASDGADQAAMYDDFQYNASNGLIKAYWSQNYAVIYQCN DVIETIEKGNLTEENDLRNKGEALFFRAYCYFNLVRAFGEVPLVTFKVNDASEANVPKTT AEEIYKQIDSDLTQAEGLLPRQWQSAYLGRLTWGSARALHARTYMMRNDWQNMYTAATDV MNSGQYNLNTPYDVIFTDEGENSSESVFELQCASTAALPASDKIGSQFCEVQGVRGSGQW DLGWGWHMGTELMGEAFEPGDPRKDATLLYFRRSDTDPITPENTNKPYGESPVSQADGAY FNKKAYTNPALREEFTRHGFWVNIRIIRYGDVVLMAAESANELGKTGEASNYLEMVRARA RGNNPDILPKVTSLDQTVLRDAIRHERRVELGLESGRFYDLVRWGIASQVLHAAGKTGYQ PKNALLPLPQDEIDKSKGVLVQNPDY >gi|226332038|gb|ACIB01000018.1| GENE 54 69182 - 71482 2393 766 aa, chain + ## HITS:1 COG:STM2166 KEGG:ns NR:ns ## COG: STM2166 COG1472 # Protein_GI_number: 16765496 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-glucosidase-related glycosidases # Organism: Salmonella typhimurium LT2 # 25 765 31 764 765 691 47.0 0 MNKKYSLLILLALLLPLGLQAQKPPQDMDRFLDNLLKRMTLEEKIGQLNLPVTGEITTGQ AKSSDIAAKIKRGEVGGLFNLKGVDKIRDVQRLAVENSRLGIPLLFGMDVIHGYETIFPI PLGLSCTWDIPAIEESARIAAVEASADGISWTFSPMVDISRDPRWGRVSEGSGEDPFLGA LIARAMVRGYQGKDMSRNDEIMACIKHFALYGAAEAGRDYNTVDMSRQRMFNDYMLPYQA GVEAGAGSVMASFNEVEGVPATANKWLMTDVLRGTWGFNGFVVTDFTGISEMIEHGIGDL QTVSARAINAGVDMDMVSEGFIGTLKKSVEEGKVSVETVNTACRRILEAKYKLGLFDNPY KYCDLKRPARDIFTKEHRAAARKIAGESFVLLKNEGLSPTLAPVLPLSPTGTIAVIGPLA NTRSNMPGTWSVAAVLDKSPSLVEGLTEWVGNQGKILYAKGSNLIGDAAYEERATMFGRS LNRDNRTDQQLLDEALKIASQADVIVAALGESSEMSGESSSRTNLNLPDVQHTLLEALLK TGKPVVLVLFTGRPLVLNWEQEHVPAILNVWFGGSEAGPAIGDVLFGAVNPGGKLTMTFP KSVGQIPLYYAHKNTGRPLKEGKWFEKFRSNYLDVDNDALYPFGYGLSYTTFRFSDITLN RSSIGMDNELVASVTVTNTGDRAGSEVVQLYIRDLVGSVTRPVKELKGFEKIYLQPNESR TVRFTIAPEMLKFYNADLKFVAEPGDFDVMIGPDSRNVKTARFTLR >gi|226332038|gb|ACIB01000018.1| GENE 55 71648 - 73003 1144 451 aa, chain + ## HITS:1 COG:AGl3503 KEGG:ns NR:ns ## COG: AGl3503 COG5368 # Protein_GI_number: 15891871 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 41 447 9 404 425 313 39.0 4e-85 MNIQLVTHNSKLLFVLFALFLFVACKPKEKPSSATSLTDDALMDTVQRRTFNYFWDAAEP NSGLARERYHMDGEYPAGGPEIVTSGGSGFGIMAILAGIDRGYVSREEGLRRMEKIVGFL EKADRFKGAYPHWWNGETGHVQPFGQKDNGGDLVETAFLMQGLLAVHQYYAEGSAEEKKL AGRIDKLWREVDWNWYRHGGQNVLYWHWSPEYGWEMNFPVHGYNECLIMYILAAASPTHG VPAAVYHEGWPQNGAIVSPHKVEGIELHLRYQGGEAGPLFWAQYSFLGLDPVGLKDEYCP SYFNEMRNLTLVNREYCIRNPKHYKGYGPDCWGLTASYSVDGYAAHGPLERDDRGVISPT AALSSIVYTPDQSLQVMHHLYEMGDKVFGPYGFYDAFSETADWYPKRYLAIDQGPIAVMI ENYRTGLLWKLFMSHPDVQNGLKKLGFNVKK >gi|226332038|gb|ACIB01000018.1| GENE 56 73190 - 73747 570 185 aa, chain + ## HITS:1 COG:AGpA668 KEGG:ns NR:ns ## COG: AGpA668 COG2755 # Protein_GI_number: 16119683 # Func_class: E Amino acid transport and metabolism # Function: Lysophospholipase L1 and related esterases # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 9 177 36 203 217 80 28.0 2e-15 MLKDTSCDVLFLGSSSINLWDNIYRDMAPLKILRRSYGGAALRDMLYNYDVIARRYHPRS IVIYVENDLAGTPEDLTVGETFDFFRLLTNRLQRDYPDIPIFILSYKPSLARKEMIPKHE IINALLQEYASKREGLTYIDVASCLYDNNGKLRKDIFKQDGLHMNQNGYDLWTAILKPKI LESIR >gi|226332038|gb|ACIB01000018.1| GENE 57 73887 - 76236 1159 783 aa, chain + ## HITS:1 COG:XF1330_1 KEGG:ns NR:ns ## COG: XF1330_1 COG3292 # Protein_GI_number: 15837931 # Func_class: T Signal transduction mechanisms # Function: Predicted periplasmic ligand-binding sensor domain # Organism: Xylella fastidiosa 9a5c # 24 736 28 733 740 115 23.0 4e-25 MNRHKLAFSLIGIFIVQLSYAGYFKHIGREEGLSQSSVMAIYQDKLGRMWFGTREGVNIY NSNKMAVYKAWIQNGNRPDQKILIGNEVSAITGSQNGDVFLIVDHALLKYDIRKETFERL RQGSVYALTSHAGEIWCAGHDSIFRYNPQNNQLDFQLKTGISSINYLTINGNRFYIGAKE GLYTTENKGRVQCLIPKVDVYRIFQSSCQELWVGCRTQGLYRINRNGRINRIPYDPSSPN GISSEQIREFVEDQQGNIWFGTFDGLQKYDPSTQTYSLIKQEQRLGGLSHSSIFSLYQDV QGTIWIGSYYGGVNYFNPDNNAFNYYTYNPDRSDCLNYPFAGAMTEDKDHHLWICTDGGG LACLDRQAGHFTTYTAGGPNSLPHNNLKSICYDPKRDCLYIGTHMGGLSRFDRKTGRFYN YLNHSTKGLKEPNDVIFQVSFYNDQLIVSARNGVFSMNPDTNEFRLLYDGYYYQTFTIDP KGFLWLSAGTNLYSINLKHPEEVKSFSLPASIGQFGISKILKGNNQYLYIATLGSGLFCY NEQTQTCINYTPEQNQLLSNYCYNLLQTSTDNILITSDRGITLFNPTTESFRSIELDNGL SLSSIINGCGVWMCSDHTIFIGGTGGLSSFLEKDLNKEYPKPKLYFSSLSVNNARISPDD KSRILTEGLPFVREINLNATQNNLTIEFASSNYVDILNNTWYEYQLEGFDKQWSLTSQTS LKYTNLDPGDYVLHVRQKGNSLKMRKAQEILLQIHINTPWYLTWWAWLSYITISISVTYF IWR Prediction of potential genes in microbial genomes Time: Tue May 17 22:32:56 2011 Seq name: gi|226332037|gb|ACIB01000019.1| Bacteroides sp. 3_2_5 cont1.19, whole genome shotgun sequence Length of sequence - 1802 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 155 - 1672 880 ## COG0642 Signal transduction histidine kinase - Prom 1735 - 1794 2.4 Predicted protein(s) >gi|226332037|gb|ACIB01000019.1| GENE 1 155 - 1672 880 505 aa, chain - ## HITS:1 COG:BS_resE_4 KEGG:ns NR:ns ## COG: BS_resE_4 COG0642 # Protein_GI_number: 16079368 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Bacillus subtilis # 18 199 88 269 269 114 34.0 6e-25 MLQKNTIPPSLHNSIFRIRKHAQQMKLLISELLDFRKFDQNYIQLKLSEQSLNTFLEEVY LSFSAYASQKSISYHLKLLEQDISIWIDDWQMRKVLFNLLSNAFKHVPDKGEISILTSTT PDQVVIAVKDSGNGISKEEQERIFDRFYQADNRNKALHVGTGIGLALTKSIIQLHHGTIE VESESNEGSCFIVKLPKTRDCFEKDTEVVFLESPEKEPMVQENTIPDENFMKKDDFTFET PLIDEREEKRKVLLVEDNMELLQVLKEIFSPLYQVVTAANGEEGLKQVFAEVPDLIVSDV MMPVMTGTEMCLKIKNNISLCHIPVVLLTALDTVDQNIEGLRRGADDYITKPFNAKILIT RCNNLIRNRLLMQSRFAKDQILEINLLAANPIDKGFLDRVIKVVDKHIDNEDFDIGMLCQ ELGMGRTLLHTKFKALTGMTPNEFILNHRLKIASLMLKNEPYLQVAEISDRLGFGSPRYF SRCFKNQYNVTPMEYRKGAKQENLK Prediction of potential genes in microbial genomes Time: Tue May 17 22:34:11 2011 Seq name: gi|226332036|gb|ACIB01000020.1| Bacteroides sp. 3_2_5 cont1.20, whole genome shotgun sequence Length of sequence - 279509 bp Number of predicted genes - 211, with homology - 207 Number of transcription units - 106, operones - 50 average op.length - 3.1 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 210 - 269 4.9 1 1 Op 1 . + CDS 343 - 3696 2924 ## BF0387 hypothetical protein 2 1 Op 2 . + CDS 3714 - 5561 1606 ## BF0386 hypothetical protein + Prom 5767 - 5826 5.3 3 2 Op 1 . + CDS 5913 - 7820 1370 ## BF0332 hypothetical protein 4 2 Op 2 . + CDS 7824 - 9035 897 ## BF0331 putative exported hydrolase 5 2 Op 3 . + CDS 9056 - 9967 717 ## COG1524 Uncharacterized proteins of the AP superfamily 6 2 Op 4 1/0.062 + CDS 9998 - 11533 1130 ## COG3525 N-acetyl-beta-hexosaminidase 7 2 Op 5 1/0.062 + CDS 11590 - 13677 1614 ## COG3250 Beta-galactosidase/beta-glucuronidase 8 2 Op 6 . + CDS 13677 - 14897 925 ## COG4289 Uncharacterized protein conserved in bacteria - Term 15025 - 15076 1.4 9 3 Tu 1 . - CDS 15178 - 17754 2669 ## COG1629 Outer membrane receptor proteins, mostly Fe transport - Prom 17835 - 17894 4.6 - Term 17818 - 17854 -1.0 10 4 Tu 1 . - CDS 17925 - 19220 1190 ## COG3174 Predicted membrane protein 11 5 Op 1 . - CDS 19277 - 19747 561 ## COG2954 Uncharacterized protein conserved in bacteria 12 5 Op 2 . - CDS 19755 - 21074 1172 ## COG3746 Phosphate-selective porin - Term 21114 - 21163 3.8 13 6 Tu 1 . - CDS 21191 - 21325 195 ## - Prom 21386 - 21445 7.7 - Term 21416 - 21460 7.2 14 7 Tu 1 . - CDS 21478 - 22437 1047 ## COG1186 Protein chain release factor B - Prom 22586 - 22645 5.4 15 8 Tu 1 . - CDS 22655 - 24460 1786 ## COG1022 Long-chain acyl-CoA synthetases (AMP-forming) - Prom 24656 - 24715 4.0 + Prom 24562 - 24621 4.0 16 9 Tu 1 . + CDS 24649 - 25710 1055 ## COG0624 Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases - Term 25794 - 25836 1.0 17 10 Op 1 . - CDS 25890 - 28529 1821 ## BF0371 alpha-rhamnosidase 18 10 Op 2 1/0.062 - CDS 28539 - 30938 1901 ## COG3534 Alpha-L-arabinofuranosidase 19 10 Op 3 . - CDS 30967 - 33054 1597 ## COG3533 Uncharacterized protein conserved in bacteria - Prom 33256 - 33315 6.4 + Prom 33062 - 33121 4.2 20 11 Tu 1 . + CDS 33266 - 34243 721 ## BF0368 putative transcriptional regulator 21 12 Tu 1 . - CDS 34404 - 35786 799 ## BF0314 hypothetical protein - Prom 35810 - 35869 3.7 22 13 Op 1 . + CDS 36089 - 36652 373 ## COG0664 cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases + Term 36663 - 36703 4.1 + Prom 36670 - 36729 4.1 23 13 Op 2 . + CDS 36749 - 37903 1012 ## COG1835 Predicted acyltransferases - Term 37949 - 38013 24.7 24 14 Op 1 . - CDS 38138 - 39169 682 ## BF0310 lipoprotein - Prom 39241 - 39300 4.2 25 14 Op 2 . - CDS 39356 - 39568 91 ## BF0309 hypothetical protein - Term 39806 - 39852 -0.1 26 15 Tu 1 . - CDS 39990 - 40286 166 ## BF0307 hypothetical protein - Prom 40359 - 40418 3.9 - Term 40438 - 40489 1.1 27 16 Tu 1 . - CDS 40498 - 41409 383 ## BF0306 lipoprotein - Prom 41434 - 41493 3.0 - Term 41507 - 41557 2.1 28 17 Tu 1 . - CDS 41568 - 41861 210 ## BF0356 hypothetical protein - Term 42214 - 42259 4.0 29 18 Op 1 . - CDS 42331 - 43299 743 ## COG3940 Predicted beta-xylosidase 30 18 Op 2 . - CDS 43331 - 46324 2268 ## COG5498 Predicted glycosyl hydrolase 31 18 Op 3 . - CDS 46374 - 47594 1007 ## COG2311 Predicted membrane protein - Term 47613 - 47673 5.3 32 19 Tu 1 . - CDS 47698 - 48819 1403 ## COG2017 Galactose mutarotase and related enzymes - Prom 48952 - 49011 1.8 - Term 48891 - 48943 1.3 33 20 Op 1 1/0.062 - CDS 49040 - 50506 1609 ## COG3538 Uncharacterized conserved protein 34 20 Op 2 . - CDS 50550 - 52835 2198 ## COG3537 Putative alpha-1,2-mannosidase - Prom 52855 - 52914 3.9 35 21 Op 1 . - CDS 52955 - 55435 2690 ## BF0349 glutaminase 36 21 Op 2 . - CDS 55462 - 57624 1722 ## BF0348 hypothetical protein 37 21 Op 3 . - CDS 57653 - 58771 1051 ## COG4833 Predicted glycosyl hydrolase - Prom 58825 - 58884 5.1 38 22 Op 1 . - CDS 59808 - 60785 594 ## COG3507 Beta-xylosidase - Prom 60813 - 60872 2.2 - Term 60797 - 60853 5.2 39 22 Op 2 . - CDS 60880 - 61308 303 ## BF0343 hypothetical protein - Prom 61331 - 61390 3.9 - Term 61340 - 61385 11.4 40 23 Op 1 . - CDS 61407 - 62114 626 ## BF0290 hypothetical protein 41 23 Op 2 . - CDS 62134 - 64020 1443 ## BF0289 putative lipoprotein 42 23 Op 3 . - CDS 64028 - 67231 2392 ## BF0288 putative TonB dependent receptor outer membrane protein + Prom 67486 - 67545 8.4 43 24 Op 1 . + CDS 67575 - 71192 3163 ## COG0383 Alpha-mannosidase 44 24 Op 2 . + CDS 71212 - 73680 1300 ## BF0286 hypothetical protein + Term 73757 - 73806 4.4 - Term 73745 - 73794 3.0 45 25 Op 1 . - CDS 73829 - 76339 2826 ## BF0285 putative exported glutaminase - Prom 76359 - 76418 6.2 46 25 Op 2 . - CDS 76435 - 78249 1754 ## COG3250 Beta-galactosidase/beta-glucuronidase 47 26 Tu 1 . - CDS 78352 - 79458 404 ## PROTEIN SUPPORTED gi|90020424|ref|YP_526251.1| ribosomal protein L11 methyltransferase - Prom 79700 - 79759 4.7 48 27 Op 1 . + CDS 79736 - 81799 1985 ## COG3533 Uncharacterized protein conserved in bacteria 49 27 Op 2 . + CDS 81838 - 83874 1494 ## COG3533 Uncharacterized protein conserved in bacteria + Prom 83894 - 83953 2.0 50 28 Tu 1 . + CDS 83991 - 85775 1930 ## COG3250 Beta-galactosidase/beta-glucuronidase + Prom 85826 - 85885 4.2 51 29 Tu 1 . + CDS 85991 - 87157 1018 ## COG4833 Predicted glycosyl hydrolase + Prom 87211 - 87270 2.0 52 30 Op 1 . + CDS 87300 - 89798 1645 ## COG1472 Beta-glucosidase-related glycosidases 53 30 Op 2 . + CDS 89880 - 90956 395 ## PROTEIN SUPPORTED gi|90020424|ref|YP_526251.1| ribosomal protein L11 methyltransferase + Term 91009 - 91072 6.6 54 31 Op 1 . - CDS 91137 - 95126 2756 ## COG0642 Signal transduction histidine kinase 55 31 Op 2 . - CDS 95196 - 96509 1283 ## COG4942 Membrane-bound metallopeptidase 56 31 Op 3 . - CDS 96506 - 97093 509 ## BF0326 hypothetical protein 57 31 Op 4 . - CDS 97090 - 98847 2011 ## COG0457 FOG: TPR repeat 58 31 Op 5 . - CDS 98878 - 99312 488 ## COG0756 dUTPase - Prom 99395 - 99454 4.2 + Prom 99278 - 99337 4.2 59 32 Tu 1 . + CDS 99414 - 100742 872 ## COG0232 dGTP triphosphohydrolase + Term 100746 - 100790 4.4 - Term 100614 - 100655 9.1 60 33 Op 1 . - CDS 100714 - 101727 812 ## COG3176 Putative hemolysin 61 33 Op 2 . - CDS 101758 - 102576 770 ## COG3176 Putative hemolysin - Prom 102663 - 102722 3.8 - Term 102751 - 102812 -0.0 62 34 Op 1 . - CDS 102842 - 103345 360 ## BF0319 hypothetical protein 63 34 Op 2 . - CDS 103342 - 105501 1481 ## COG1629 Outer membrane receptor proteins, mostly Fe transport - Term 105524 - 105570 4.1 64 34 Op 3 . - CDS 105576 - 105917 107 ## BF0317 hypothetical protein - Prom 106089 - 106148 6.3 + Prom 105982 - 106041 3.6 65 35 Tu 1 . + CDS 106115 - 106621 570 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog + Term 106683 - 106725 6.5 66 36 Op 1 . - CDS 106720 - 107346 339 ## Phep_4133 hypothetical protein 67 36 Op 2 . - CDS 107343 - 108515 622 ## BF0264 hypothetical protein - Prom 108542 - 108601 5.7 68 37 Op 1 . - CDS 108737 - 109285 311 ## gi|253564340|ref|ZP_04841797.1| predicted protein 69 37 Op 2 . - CDS 109245 - 110285 529 ## gi|253564341|ref|ZP_04841798.1| predicted protein 70 37 Op 3 . - CDS 110288 - 110632 248 ## gi|253564342|ref|ZP_04841799.1| predicted protein - Prom 110680 - 110739 8.1 71 38 Tu 1 . + CDS 111078 - 111578 440 ## BF0315 hypothetical protein 72 39 Op 1 29/0.000 + CDS 111685 - 112161 399 ## COG2001 Uncharacterized protein conserved in bacteria 73 39 Op 2 . + CDS 112158 - 113072 865 ## COG0275 Predicted S-adenosylmethionine-dependent methyltransferase involved in cell envelope biogenesis 74 39 Op 3 . + CDS 113101 - 113442 266 ## BF0312 hypothetical protein 75 39 Op 4 26/0.000 + CDS 113446 - 115569 2353 ## COG0768 Cell division protein FtsI/penicillin-binding protein 2 76 39 Op 5 4/0.000 + CDS 115587 - 117044 1596 ## COG0769 UDP-N-acetylmuramyl tripeptide synthase 77 39 Op 6 28/0.000 + CDS 117130 - 118398 1081 ## COG0472 UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase + Prom 118424 - 118483 2.6 78 39 Op 7 25/0.000 + CDS 118538 - 119872 1205 ## COG0771 UDP-N-acetylmuramoylalanine-D-glutamate ligase 79 39 Op 8 31/0.000 + CDS 119931 - 121226 1165 ## COG0772 Bacterial cell division membrane protein 80 39 Op 9 26/0.000 + CDS 121228 - 122370 1036 ## COG0707 UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase 81 39 Op 10 . + CDS 122367 - 123761 1221 ## COG0773 UDP-N-acetylmuramate-alanine ligase 82 39 Op 11 . + CDS 123781 - 124533 269 ## PROTEIN SUPPORTED gi|163752975|ref|ZP_02160099.1| 30S ribosomal protein S12 83 39 Op 12 35/0.000 + CDS 124610 - 126040 1281 ## COG0849 Actin-like ATPase involved in cell division 84 39 Op 13 . + CDS 126067 - 127377 1653 ## COG0206 Cell division GTPase 85 39 Op 14 . + CDS 127452 - 127901 270 ## PROTEIN SUPPORTED gi|42519249|ref|NP_965179.1| 30S ribosomal protein S21 + Term 127927 - 127969 9.5 - Term 127908 - 127959 9.9 86 40 Tu 1 . - CDS 128011 - 128739 471 ## BF0300 DNA repair protein + TRNA 129038 - 129112 50.0 # Glu TTC 0 0 + Prom 129040 - 129099 78.1 87 41 Op 1 . + CDS 129149 - 129403 419 ## PROTEIN SUPPORTED gi|53711589|ref|YP_097581.1| 30S ribosomal protein S20 + Term 129434 - 129481 6.4 + Prom 129501 - 129560 4.9 88 41 Op 2 . + CDS 129584 - 131542 2256 ## COG0187 Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit + Term 131591 - 131629 0.8 - Term 131571 - 131619 -0.8 89 42 Op 1 . - CDS 131643 - 134609 2347 ## BF0296 outer membrane assembly protein 90 42 Op 2 . - CDS 134606 - 135187 646 ## BF0295 hypothetical protein 91 42 Op 3 . - CDS 135249 - 135389 100 ## - Prom 135467 - 135526 2.8 92 43 Op 1 10/0.000 - CDS 135542 - 137485 1252 ## COG0642 Signal transduction histidine kinase 93 43 Op 2 . - CDS 137482 - 138576 593 ## COG0642 Signal transduction histidine kinase - Prom 138596 - 138655 6.3 - Term 138635 - 138686 13.4 94 44 Tu 1 . - CDS 138725 - 140239 1739 ## COG0696 Phosphoglyceromutase - Prom 140260 - 140319 6.5 95 45 Tu 1 . + CDS 140525 - 141451 763 ## COG0598 Mg2+ and Co2+ transporters + Term 141526 - 141568 5.1 96 46 Tu 1 . - CDS 141459 - 142064 653 ## COG0164 Ribonuclease HII - Prom 142087 - 142146 4.8 97 47 Tu 1 . - CDS 142280 - 144484 2436 ## COG3808 Inorganic pyrophosphatase - Prom 144522 - 144581 2.7 98 48 Tu 1 . - CDS 144608 - 145162 398 ## BF0288 hypothetical protein - Prom 145182 - 145241 8.3 + Prom 145137 - 145196 7.2 99 49 Tu 1 . + CDS 145406 - 147565 1924 ## COG3345 Alpha-galactosidase + Term 147600 - 147637 7.2 + Prom 147656 - 147715 3.1 100 50 Op 1 6/0.000 + CDS 147775 - 148338 453 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog + Prom 148342 - 148401 3.6 101 50 Op 2 . + CDS 148426 - 149574 764 ## COG3712 Fe2+-dicitrate sensor, membrane component 102 50 Op 3 . + CDS 149593 - 149724 56 ## 103 50 Op 4 . + CDS 149740 - 153138 2903 ## BF0273 hypothetical protein 104 50 Op 5 . + CDS 153153 - 154997 1810 ## BF0272 hypothetical protein + Term 155022 - 155075 8.6 + Prom 155006 - 155065 1.9 105 50 Op 6 . + CDS 155089 - 156681 940 ## BF0271 alpha-galactosidase precursor + Term 156701 - 156750 8.4 - Term 156689 - 156738 15.4 106 51 Tu 1 . - CDS 156756 - 157076 449 ## COG0393 Uncharacterized conserved protein - Prom 157271 - 157330 4.7 - Term 157302 - 157349 10.2 107 52 Op 1 24/0.000 - CDS 157374 - 158585 1034 ## COG0520 Selenocysteine lyase 108 52 Op 2 41/0.000 - CDS 158597 - 159940 1289 ## COG0719 ABC-type transport system involved in Fe-S cluster assembly, permease component 109 52 Op 3 . - CDS 159949 - 160758 210 ## PROTEIN SUPPORTED gi|90020817|ref|YP_526644.1| ribosomal protein S16 110 52 Op 4 . - CDS 160725 - 161471 389 ## BF0266 hypothetical protein 111 52 Op 5 . - CDS 161471 - 162925 1226 ## COG0719 ABC-type transport system involved in Fe-S cluster assembly, permease component 112 52 Op 6 . - CDS 162931 - 163434 449 ## BF0220 hypothetical protein - Term 163440 - 163486 7.1 113 53 Op 1 20/0.000 - CDS 163509 - 166556 3287 ## COG0532 Translation initiation factor 2 (IF-2; GTPase) - Prom 166597 - 166656 2.4 114 53 Op 2 . - CDS 166677 - 167939 601 ## PROTEIN SUPPORTED gi|17988250|ref|NP_540884.1| transcription elongation factor NusA 115 53 Op 3 . - CDS 167942 - 168409 608 ## BF0261 hypothetical protein - Prom 168470 - 168529 4.3 - Term 168421 - 168452 -0.8 116 54 Op 1 . - CDS 168547 - 169761 483 ## BF0260 hypothetical protein 117 54 Op 2 . - CDS 169761 - 170912 651 ## BF0259 hypothetical protein - Prom 170938 - 170997 6.0 118 55 Op 1 . - CDS 171083 - 172186 780 ## BF0258 hypothetical protein 119 55 Op 2 . - CDS 172200 - 173531 956 ## COG0738 Fucose permease 120 55 Op 3 3/0.000 - CDS 173543 - 174955 972 ## COG1070 Sugar (pentulose and hexulose) kinases 121 55 Op 4 5/0.000 - CDS 174973 - 175611 544 ## COG0235 Ribulose-5-phosphate 4-epimerase and related epimerases and aldolases 122 55 Op 5 1/0.062 - CDS 175608 - 176762 1332 ## COG1454 Alcohol dehydrogenase, class IV 123 55 Op 6 . - CDS 176781 - 178553 1965 ## COG2407 L-fucose isomerase and related proteins 124 55 Op 7 . - CDS 178626 - 179606 635 ## COG1609 Transcriptional regulators - Prom 179698 - 179757 4.7 + Prom 179551 - 179610 3.3 125 56 Tu 1 . + CDS 179698 - 179877 82 ## - Term 179710 - 179763 -0.2 126 57 Op 1 . - CDS 179811 - 180371 325 ## BF0251 hypothetical protein 127 57 Op 2 . - CDS 180358 - 181542 495 ## BF0250 hypothetical protein - Term 181554 - 181591 7.0 128 57 Op 3 . - CDS 181613 - 181879 193 ## BF0207 hypothetical protein - Prom 181969 - 182028 6.3 - Term 181904 - 181937 0.6 129 58 Tu 1 . - CDS 182060 - 182287 218 ## BF0249 hypothetical protein - Prom 182307 - 182366 5.8 - Term 182377 - 182429 8.1 130 59 Tu 1 . - CDS 182448 - 183503 534 ## BF0248 hypothetical protein - Prom 183576 - 183635 6.9 + Prom 183520 - 183579 5.1 131 60 Tu 1 . + CDS 183608 - 184006 447 ## BF0247 hypothetical protein - Term 184066 - 184117 4.1 132 61 Tu 1 . - CDS 184168 - 184866 400 ## COG1451 Predicted metal-dependent hydrolase - Prom 184895 - 184954 6.7 + Prom 184709 - 184768 5.0 133 62 Op 1 . + CDS 184987 - 185721 550 ## BF0245 hypothetical protein 134 62 Op 2 . + CDS 185741 - 186520 597 ## BF0244 hypothetical protein + Term 186578 - 186624 8.1 - Term 186632 - 186663 -0.5 135 63 Op 1 . - CDS 186708 - 187166 328 ## BF0243 hypothetical protein 136 63 Op 2 . - CDS 187189 - 187704 559 ## BF0242 hypothetical protein 137 63 Op 3 . - CDS 187688 - 188197 375 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 138 63 Op 4 . - CDS 188258 - 189031 820 ## COG0548 Acetylglutamate kinase 139 64 Op 1 . - CDS 189171 - 191063 1725 ## COG1166 Arginine decarboxylase (spermidine biosynthesis) 140 64 Op 2 . - CDS 191121 - 192200 811 ## BF0195 hypothetical protein 141 64 Op 3 . - CDS 192209 - 193975 1142 ## COG0326 Molecular chaperone, HSP90 family 142 64 Op 4 . - CDS 193983 - 196481 2570 ## COG0790 FOG: TPR repeat, SEL1 subfamily - Prom 196567 - 196626 2.4 - Term 196520 - 196548 -0.9 143 65 Tu 1 . - CDS 196628 - 197167 400 ## COG0703 Shikimate kinase - Prom 197215 - 197274 2.5 + Prom 197531 - 197590 2.5 144 66 Tu 1 . + CDS 197618 - 197854 92 ## BF4496 hypothetical protein - Term 197733 - 197769 0.9 145 67 Tu 1 . - CDS 198004 - 198606 333 ## COG3560 Predicted oxidoreductase related to nitroreductase - Prom 198638 - 198697 3.9 + Prom 198594 - 198653 8.1 146 68 Tu 1 . + CDS 198726 - 199355 484 ## COG3341 Predicted double-stranded RNA/RNA-DNA hybrid binding protein - Term 199122 - 199169 -1.0 147 69 Op 1 . - CDS 199359 - 200591 632 ## BF0189 hypothetical protein 148 69 Op 2 . - CDS 200594 - 202429 214 ## PROTEIN SUPPORTED gi|169795303|ref|YP_001713096.1| ABC transporter ATP-binding protein - Prom 202646 - 202705 5.8 + Prom 202760 - 202819 5.1 149 70 Op 1 11/0.000 + CDS 202849 - 203919 490 ## COG0463 Glycosyltransferases involved in cell wall biogenesis + Term 203985 - 204027 4.2 + Prom 203924 - 203983 6.3 150 70 Op 2 . + CDS 204040 - 204843 394 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 151 70 Op 3 . + CDS 204846 - 205868 554 ## BF0185 putative lipopolysaccharide core biosynthesis protein 152 70 Op 4 . + CDS 205896 - 206831 567 ## COG1442 Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases + Term 206847 - 206887 10.6 - Term 206749 - 206791 -1.0 153 71 Op 1 . - CDS 206800 - 207513 506 ## BF0183 hypothetical protein 154 71 Op 2 . - CDS 207518 - 208234 534 ## BF0182 hypothetical protein - Prom 208281 - 208340 3.6 + Prom 208178 - 208237 5.5 155 72 Tu 1 . + CDS 208367 - 209413 735 ## COG0111 Phosphoglycerate dehydrogenase and related dehydrogenases 156 73 Tu 1 . - CDS 209451 - 210074 439 ## COG0299 Folate-dependent phosphoribosylglycinamide formyltransferase PurN - Prom 210119 - 210178 8.8 + Prom 210094 - 210153 5.3 157 74 Op 1 27/0.000 + CDS 210173 - 210409 385 ## COG0236 Acyl carrier protein 158 74 Op 2 1/0.062 + CDS 210425 - 211687 1306 ## COG0304 3-oxoacyl-(acyl-carrier-protein) synthase 159 74 Op 3 . + CDS 211692 - 212564 660 ## COG0571 dsRNA-specific ribonuclease - Term 212468 - 212521 7.1 160 75 Tu 1 . - CDS 212583 - 213593 926 ## COG0205 6-phosphofructokinase - Prom 213720 - 213779 6.2 + Prom 213566 - 213625 5.3 161 76 Tu 1 . + CDS 213763 - 215274 1223 ## BF0216 putative auxin-regulated protein - Term 215097 - 215129 -0.4 162 77 Tu 1 . - CDS 215334 - 216428 663 ## COG0482 Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain - Prom 216671 - 216730 1.7 163 78 Tu 1 . + CDS 216480 - 219530 1605 ## COG0553 Superfamily II DNA/RNA helicases, SNF2 family - Term 219430 - 219488 1.2 164 79 Tu 1 . - CDS 219577 - 220374 747 ## BF0213 hypothetical protein - Prom 220400 - 220459 6.9 165 80 Op 1 . - CDS 220595 - 221653 420 ## BF0171 hypothetical protein - Prom 221674 - 221733 2.0 166 80 Op 2 1/0.062 - CDS 221745 - 224396 1584 ## COG0474 Cation transport ATPase - Prom 224489 - 224548 2.5 - Term 224438 - 224478 1.2 167 81 Op 1 40/0.000 - CDS 224556 - 225923 1078 ## COG0642 Signal transduction histidine kinase 168 81 Op 2 . - CDS 225920 - 226606 782 ## COG0745 Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 169 82 Tu 1 . - CDS 226963 - 228243 500 ## BF0208 hypothetical protein - Prom 228268 - 228327 4.8 - Term 228251 - 228316 11.2 170 83 Op 1 . - CDS 228411 - 229274 341 ## BF0403 hypothetical protein 171 83 Op 2 . - CDS 229332 - 230399 427 ## BF0467 hypothetical protein - Prom 230441 - 230500 4.7 - Term 230577 - 230622 -1.0 172 84 Op 1 . - CDS 230649 - 230888 325 ## gi|253564445|ref|ZP_04841902.1| predicted protein 173 84 Op 2 . - CDS 230903 - 231172 289 ## BF0057 hypothetical protein - Prom 231195 - 231254 3.3 174 85 Tu 1 . - CDS 231405 - 231626 195 ## BF0051 hypothetical protein - Prom 231652 - 231711 4.2 175 86 Tu 1 . - CDS 231750 - 233141 912 ## BF0207 hypothetical protein - Prom 233177 - 233236 7.8 176 87 Tu 1 . + CDS 233359 - 234195 815 ## COG0561 Predicted hydrolases of the HAD superfamily + Prom 234563 - 234622 1.9 177 88 Tu 1 . + CDS 234653 - 236128 1791 ## COG0215 Cysteinyl-tRNA synthetase + Term 236166 - 236221 9.7 + Prom 236149 - 236208 2.1 178 89 Tu 1 . + CDS 236239 - 236643 479 ## COG2050 Uncharacterized protein, possibly involved in aromatic compounds catabolism - Term 236547 - 236591 1.2 179 90 Op 1 . - CDS 236633 - 237511 637 ## BF0194 hypothetical protein 180 90 Op 2 . - CDS 237450 - 239369 1573 ## BF0193 hypothetical protein - Prom 239394 - 239453 5.5 + Prom 239333 - 239392 8.1 181 91 Tu 1 . + CDS 239475 - 242579 2982 ## COG3250 Beta-galactosidase/beta-glucuronidase + Term 242601 - 242646 11.2 + Prom 242591 - 242650 5.4 182 92 Op 1 27/0.000 + CDS 242888 - 244042 1077 ## COG0845 Membrane-fusion protein 183 92 Op 2 9/0.000 + CDS 244067 - 247270 2928 ## COG0841 Cation/multidrug efflux pump 184 92 Op 3 . + CDS 247287 - 248678 366 ## PROTEIN SUPPORTED gi|157165073|ref|YP_001466086.1| 30S ribosomal protein S12 185 92 Op 4 . + CDS 248698 - 249468 701 ## COG1043 Acyl-[acyl carrier protein]--UDP-N-acetylglucosamine O-acyltransferase + Term 249491 - 249539 9.4 - Term 249482 - 249524 7.2 186 93 Tu 1 . - CDS 249570 - 250211 687 ## BF0187 hypothetical protein - Prom 250231 - 250290 6.1 + Prom 250195 - 250254 8.4 187 94 Op 1 . + CDS 250442 - 251662 1248 ## BF0151 outer membrane protein 188 94 Op 2 . + CDS 251669 - 252601 749 ## COG0042 tRNA-dihydrouridine synthase 189 95 Op 1 . - CDS 252795 - 253355 581 ## BF0149 hypothetical protein 190 95 Op 2 . - CDS 253447 - 254715 785 ## BF0183 hypothetical protein + Prom 254713 - 254772 4.1 191 96 Op 1 . + CDS 254811 - 255539 764 ## COG0289 Dihydrodipicolinate reductase 192 96 Op 2 2/0.000 + CDS 255544 - 257028 1199 ## COG0681 Signal peptidase I 193 96 Op 3 . + CDS 257036 - 257971 592 ## COG0681 Signal peptidase I 194 96 Op 4 . + CDS 258035 - 258679 556 ## BF0179 hypothetical protein + Prom 258692 - 258751 4.3 195 97 Op 1 3/0.000 + CDS 258808 - 259620 190 ## PROTEIN SUPPORTED gi|163739489|ref|ZP_02146899.1| 50S ribosomal protein L17 196 97 Op 2 . + CDS 259717 - 260925 874 ## COG1312 D-mannonate dehydratase 197 98 Tu 1 . - CDS 261072 - 261602 556 ## BF0176 hypothetical protein - Prom 261816 - 261875 2.9 + Prom 261684 - 261743 3.6 198 99 Tu 1 . + CDS 261850 - 262200 130 ## BF0140 hypothetical protein + Term 262382 - 262417 -0.6 - Term 262135 - 262189 3.1 199 100 Tu 1 . - CDS 262367 - 263614 1276 ## COG0612 Predicted Zn-dependent peptidases - Prom 263642 - 263701 3.4 200 101 Op 1 . + CDS 263716 - 264246 550 ## COG1611 Predicted Rossmann fold nucleotide-binding protein 201 101 Op 2 . + CDS 264246 - 264851 549 ## COG0794 Predicted sugar phosphate isomerase involved in capsule formation 202 101 Op 3 . + CDS 264839 - 265759 670 ## COG0524 Sugar kinases, ribokinase family + Term 265809 - 265850 0.4 + Prom 265840 - 265899 2.6 203 102 Op 1 . + CDS 265944 - 266999 898 ## COG2365 Protein tyrosine/serine phosphatase + Prom 267010 - 267069 4.3 204 102 Op 2 . + CDS 267094 - 269004 1869 ## COG0513 Superfamily II DNA and RNA helicases + Term 269120 - 269181 9.8 + Prom 269640 - 269699 2.3 205 103 Tu 1 . + CDS 269751 - 270614 841 ## COG0697 Permeases of the drug/metabolite transporter (DMT) superfamily - Term 270447 - 270500 6.7 206 104 Op 1 . - CDS 270664 - 272841 2164 ## BF0166 hypothetical protein 207 104 Op 2 . - CDS 272893 - 274260 1225 ## COG1808 Predicted membrane protein 208 104 Op 3 . - CDS 274263 - 275759 1471 ## COG2244 Membrane protein involved in the export of O-antigen and teichoic acid 209 104 Op 4 . - CDS 275788 - 276816 1108 ## COG2255 Holliday junction resolvasome, helicase subunit - Prom 276910 - 276969 2.5 210 105 Tu 1 . - CDS 277145 - 277843 496 ## COG4912 Predicted DNA alkylation repair enzyme - Prom 277925 - 277984 6.4 + Prom 277754 - 277813 3.6 211 106 Tu 1 . + CDS 277950 - 279182 1111 ## COG2715 Uncharacterized membrane protein, required for spore maturation in B.subtilis. Predicted protein(s) >gi|226332036|gb|ACIB01000020.1| GENE 1 343 - 3696 2924 1117 aa, chain + ## HITS:1 COG:no KEGG:BF0387 NR:ns ## KEGG: BF0387 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1117 1 1117 1117 2144 99.0 0 MRKSKLHLLPLSSKRVLVSTSLIMLLSGSAWAVSSQETVENGDAITAVPQQRRTVKGIVK DANGEPIIGANVIVKGNKTIGVITNLNGEFSLEVPSNATLQISYIGYLNKEVKVSGNQVS FNIQLEEDSKTLDEVVVVGYGTQKKANLTGAVSSVDFEEQTKSRPITTVSSALAGLSPGL QASSGSAMPGEDNTTLRVRGNGTMNNASPLIIIDGMEGSLNAINPQDIENISILKDAASC AIYGARAANGVILVTTKSGDRDKIQVNYSGRISFNSPTRMIETMSNYADYMELMNESCEN VGSGTLFDQKYIDLWREKSKDPNGVNENGVPNYIAYPNTNWLKELYSGGMIHEHNLSVSG GSNKIRFLLSARYQDNEGIVDNTANKTYSVRANIEANPTQWLTLGTRTYASQMDREVGDF SNANTFLRQSTAGTYPEWNGSFGYPECPDERATANNPLYKLARNDGFKRYNRFNTTLFSK VKFFKDLSWDFNFNYNRYIYETRQWGVPAYQTRFSDGVIVDGITPPSQLSTSFGYESNYS YTLENLLNYHHTFAQKHDVSALLGYQEFYKNYYTVDAAKKGLIDESLNQFDEATEMTSTK GATQDYATRSVFGRVNYAYNSRYLFEANFRYDGSSRFHKDHRWGFFPSLSGAWRISEESF MENTRTWLDNLKVRASWGKLGNSEIGNYEYMSVYSTTNAVFGNALNSALYMGAIANSLLK WESTTSVNFGIDVNLLKNRLSISADLYQKKTDGILYRPTIPYVFGTMTAPRQNLAKVSNK GVELSLGWRDNIGGVSYSINGNFSYNKSNIDAYNGTYERTWVEDPNNKLTGGKWEDNIGK VSSGGTTPIVEGRMMNEYYLRNVYHGNGSYYNADGSVNPQGGPKTGMIRTEKDMAWVKDM IAAGYEFQPGKTVAKNKIWYGDYIYADSNNNGVYGDDNDYTFQKTSNKPKYNFGFQASAA WKGFDLSMVWAGAAGFSIYWGATTGYNAASTEWGSTIAQRVAENHYFYNPENPDDPRTNI NAKYPRMAYIDGYVQNRHGNTTLWLYKGDYIKLKNLSLGYTLPKNWVSKIAMQNARIYVS AENLLTITGFEGQDPESATGMGYSPFRTIAIGANITF >gi|226332036|gb|ACIB01000020.1| GENE 2 3714 - 5561 1606 615 aa, chain + ## HITS:1 COG:no KEGG:BF0386 NR:ns ## KEGG: BF0386 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 615 1 615 615 1240 99.0 0 MKLLNKLFILTATAIAISSCQGDLLDTKPYDKASSGSMWSNENFCTMGVASIYATLREGY VAKEAYLMEAFSVGATCRDNDYPLLAGTASIGSGIFSDYWKQHYAGIYRANDAIVHLPDA PISESVKGKLLSEAKVLRAFYYYKLNAVFRGVPYYNTPMELDQADKPRESEENIWNFCIQ DLTDAINDPNFPDRIAAGKAEWGHVTKSVAYALRGKIYLWTKEWSKAEADFRKVGELGHS LFQDGYKQLFKEANEQSDEAIFSLQCIDNNGSTYGNSMSFRYGGRTTFGSCWNTMLASVD FVETYENIDGSKFNWDEYIPGFSSLDIKDREVFFLRNTDPATVKAQYREIGFDGTEEALD DMVKKIKAKVDGRLEKLSDKAKALYLPAGNEARIKAAYDSRDPRLSQTVITPYATYDGSG NSVDHTFTSRWPYYGADTDYPYDLRTDTQSHLYYLFRKFVAEGSSEMTNREQSPIDLPII RYATVVIGLAEALNEQGKTDEAIEWLNKVRQRAGVALLNSNTATMVQGQEDMRVRIQNEF RWETAGEGVDFYEELRWKTWKESKFNNADGTAGMKDVWGTITYPYTWGGDQYYVWPIPKH ETDMNKSLTQNSGWN >gi|226332036|gb|ACIB01000020.1| GENE 3 5913 - 7820 1370 635 aa, chain + ## HITS:1 COG:no KEGG:BF0332 NR:ns ## KEGG: BF0332 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 635 1 635 635 1305 99.0 0 MKKLLIFTLLLCCTVKALSYTERNYLQKQANVSSLQEVLILNQQWVTYPAYTDRTGWDTF LGSFKNECILRGEKQLNYQWQVVKATDYMEFERSGNRSIMETPFANNNNAIADLLLAELA EGKGRFIDQLINGVYHSCEMTSWALAAHLNAQQSHRSLPDFKENIIDLTAGDLGSLLAWT YYYMHKEFDKLNPAISERLRHTLQQRILDPYMNNDHFWWMAVNYRPGMLVNNWNPWCNSN ALMCFMLLENDKEMLAKAIYRSMVSVDKFINYTHTDGACEEGPSYWGHAAGKMFDYLELL SAVTGGTVSIFDNPMIKNMGEYISRSYVGKGWVVNFADASAKGSGDAPLIFRYGKAVNSN EMKGFAAMINTNKLPSGRDIYRTLAAIKIANELKETQAAHLTPPFSWYPETEFCYITDNK GNFLAAKGGYNDESHNHNDAGTFSFWIDQTPFLIDAGVGTYTRQTFSKDRYTIWTMQSNY HNLPLINGVPQKYGATYKATQVQADKRKNTFTANIATAYPAEAEVESWIRSYSLQKGQLR ISDSFRLTTAKQPNQINFMTWGQVNTEIPGKIQLEVKGKKAVIEYDKQLFDVKTEIIPLT DTRLSNVWGKSICRITLTATNIYKTGNYSFTIKKQ >gi|226332036|gb|ACIB01000020.1| GENE 4 7824 - 9035 897 403 aa, chain + ## HITS:1 COG:no KEGG:BF0331 NR:ns ## KEGG: BF0331 # Name: not_defined # Def: putative exported hydrolase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 403 1 403 403 810 97.0 0 MNLKPLLGAACAMICVACSNHKPTVPSFITENVEFAKVQLGLAIDTIEASGKCLNPVTLN RDGSVYYCGYADWRSGFFPGSIWYLYELTGDTSYLPLARKYTEAIRPAEHLTWHHDIGFI INCSFGNGLRLAPDTAAYKDVMVQAAKSLCTRFRPNAGVIQSWDVKGNSWQSERGWECPV IIDNMMNLELLFEATKLSGDSTFHKVAVAHADRTLSEHFRPDGSCYHVVDYNISDGSVRH KQTAQGYADESVWSRGQAWAIYGFTICYRETKDRKYLDQALKTFNRMKNDPHMPEDLIPY WDMDAPNIPDEPRDVSSASCIASALYEISTYDVPDAASYREYADRIMHSLASPDYRAALG TNGYFILMHSVGSIPHNSEIDVPLNYADYYFLEALKRRKDLDK >gi|226332036|gb|ACIB01000020.1| GENE 5 9056 - 9967 717 303 aa, chain + ## HITS:1 COG:CAC0477 KEGG:ns NR:ns ## COG: CAC0477 COG1524 # Protein_GI_number: 15893768 # Func_class: R General function prediction only # Function: Uncharacterized proteins of the AP superfamily # Organism: Clostridium acetobutylicum # 164 237 172 245 434 61 40.0 2e-09 MKTQLFKFACLIICLLTIAPNCHAGSKWKAKHVVLIGLDGWGAYSVEKANIPHIKQLMND GSYTLTKRSVLPSSSAVNWASMFMGAGPELHGYTTWNSSTPDLPSKELSKDGIFPTIFQL LREADPKAEIGTFYEWVGIKYLVDTLAVNKYNQGINYEKYPTELCEKAVKYIKEKKPTLT LIAWDNPDHVGHKEGHDTPAYYHKLEEIDGYIGKVMNAVKEAGILDETIFIITSDHGGIN KGHGGKTMQEMETPFIISGKNIKKGHEIQASMMQFDVAATVAAIFKLKQPQVWIGRPIME VFK >gi|226332036|gb|ACIB01000020.1| GENE 6 9998 - 11533 1130 511 aa, chain + ## HITS:1 COG:CC0447 KEGG:ns NR:ns ## COG: CC0447 COG3525 # Protein_GI_number: 16124702 # Func_class: G Carbohydrate transport and metabolism # Function: N-acetyl-beta-hexosaminidase # Organism: Caulobacter vibrioides # 78 485 103 519 757 297 40.0 3e-80 MKRTNRTLFIHNTARGRIKFLLLALFAFQGLQAQKLFPAPSAIETHKGTFSYDEVSAKCV RTTISKSLPAIGIEYSDEAYQLEITPDSIFIDATSAKGAFYARQAIKQLARHERGKIRCC RIYSSPRYAWRGFMLDESRHFFGKEKVKQYLDLMALLHLNVFHWHLTDEPGWRIEIKKYP KLTKIGAVGNWHDAQATPQFYTQDDIREIVAYAAERQIMVVPEFDMPGHATAVCRAYPEV SGGGEGRWKHFTFHPCKEETYRFISDVLDEIVALFPAPYIHIGGDEVHYGNQNWFTDPEI QNFIKEKGLINETGLEHYFIRRAADLVAAKGKKMIGWDEIVDAGISPSKALVMWWRHDRK YQLLKALEQGYQVVLTPRRPLYGDFVQDASHKVGRYWDGFNPLQDIYAFPEPISHLFKGY EDQILGMQFTLWTERIADGKRLDFMTFPRLIALAESAWTSSKEKDWSRFCMRLPSFLEYL KEQGIYYFDVIHPQETPEPGGPEKADVLQNG >gi|226332036|gb|ACIB01000020.1| GENE 7 11590 - 13677 1614 695 aa, chain + ## HITS:1 COG:SSO3036 KEGG:ns NR:ns ## COG: SSO3036 COG3250 # Protein_GI_number: 15899743 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-galactosidase/beta-glucuronidase # Organism: Sulfolobus solfataricus # 38 591 6 552 570 166 26.0 1e-40 MKQQKCNYFPSLWWRGREKGLSTFLFLLLFSISLHAQRQDILLNNNWNFRFSHQVQGDTR RVDLPHTWNAQDALASKIDYKRGIGNYEKALYIRPEWKGKRLFLRFDGVNSIADVFINRK HIGEHRGGYGAFIFEITDLVKYGEKNSVLVRANNGEQLDIMPLVGDFNFYGGIYRDVHLL ITDETCISPLDYASPGVYLVQEVVSPQEAKVCAKVNLSNRAADGTAELQVLVTDGTKVIC KESRNVSLKQGADIQEQLPLLIQKPRLWNGCEDPFMYQVSISLHKDGKQIDSVTQPLGLR YYHTDPDKGFFLNGKHLPLHGVCRHQDRAEVGNALRPQHHEEDVALMREMGVNAIRLAHY PQATYMYDLMDKHGIVTWAEIPFVGPGGYADKGFVDQASFRENGKQQLIELIRQHYNHPS ICFWGLFNELKEVGDNPVEYVKELNVLAKQEDPTRPTTSASNQDGNLNFITENIAWNRYD GWYGSTPKTLATFLDRTHKKHPELRIGISEYGAGASIYHQQDSLKQPSASGWWHPENWQT YYHMENWKIIAERPFVWGTFVWNMFDFGAAHRTEGDRPGINDKGLVTFDRKVRKDAFYFY KANWNKQEPMIYLAEKRCRLRYQPEQTFMAFTTAPEAELFVNGISCGKQKADTYSTVVWK NVKLTSGENIIRVTTPGKKPLTDEVTVEYKEDRPL >gi|226332036|gb|ACIB01000020.1| GENE 8 13677 - 14897 925 406 aa, chain + ## HITS:1 COG:TM1061 KEGG:ns NR:ns ## COG: TM1061 COG4289 # Protein_GI_number: 15643819 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Thermotoga maritima # 27 406 2 385 387 331 46.0 2e-90 MKRIILSIVWGLLTGWAAVPCLWAQSRTGTADREIWVKTLVRLADPVLSNLANETLKKEM PYESLAPNRQRFSYLEAVGRTVCGIAPWLELGEDDTPEGQLRKKYIELTVKGISNAVNPS SPDYLIFGEPSQPLVDAAFLAEGLLRAPKQLWGNLSPTARKQVVTELKRSRVIKPNESNW LLFASIVEAALQEFTGECDTTRLNYGVRKFRDLWYKGDAQYGDGAEFHLDYYNSFVIHPM LTDVLVVMQKHRMPESEFLNVQQKRLGRYAEQLERFISPEGTYPVIGRSIVYRTGVFHAL GQAALLHLLPQQIVPAQVRCGMTKVIENQFRSAANFDTKGWLKIGFSGNQVQMSESYINT GSTYLCLTGFLPLGLPADDPFWSAPPAEWTNLKAWSGKEMPADHSL >gi|226332036|gb|ACIB01000020.1| GENE 9 15178 - 17754 2669 858 aa, chain - ## HITS:1 COG:CC0815 KEGG:ns NR:ns ## COG: CC0815 COG1629 # Protein_GI_number: 16125068 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor proteins, mostly Fe transport # Organism: Caulobacter vibrioides # 125 826 45 720 737 155 22.0 3e-37 MRNKLLLLLLLMFMTSGMMYAQQQKTQQSQQRTAAPSYTLKGVLLDSLTQEGEPYATIKI AKKNAPQKALKMAVTDLNGKFQEKLTVAPGEYIITLSSVGKVTIVKDFTVKASEKVIDLG KLNMAEATNELKGIEVVAQKPLVKVDVDKIEYNIEDDPDSKTNTVMEMLRKVPLVTVDGE DNIKVNGSSSFKIHVNGKPNNMMSNNPKDVLKSMPANTIKHIEVITSPGAKYDAEGVGGI LNIVTVGGGFEGYTATFRASGSNRGAGAGGYATIKSGKLTITGNYNYNYDTSPKSYSDSY RENYDSEDQKYLESKSSSDYNGSFQYGNLEASYEIDTLRLLTASFGMYGGANDNKSDGLT TMWNAQRDRLAYQYRSLSDGDGSWYSMRGNVDYQRTSKKNKDRMITLSYKISTQPQNSDY YTDYKDIKDPFEMDIVKKFLLNNSHSDGKTNTTEHTFQVDYTTPIGKLHTIEAGAKYIIR NNLSDNKLFEAEGVSDNYEYNNDRSSKYKHLNDILAAYLGYTLRYKTFSFKPGVRYEYTS QDVKYLAGAIGPEADFSTSYNDFVPSVTMGIKIGKTQNLRGGYNMRIWRPGIWNLNPYFD DRNPMFISQGNSNLESEKSHSFNLSYSMFSMKFNVNISLRHSFGNNGIERVSRLIGKGGE EFPGGHHAPEGALYSTYENIGKNRNTGLSLYGNWNASPNTRIYLNGDGSYVDIKSPAQGL HNYGWNASLYGGIQHTFPLKIRASLNAGGSTPYISLQGKGSGYYYYSLGVNRSFIKDRFT VSAYVSNIFEKYRSYNNTTMGENFLSKSSSRYQSRSFGISLSYRIGELKASVKKAARSIN NDDVKGGGGQGGQGGGAN >gi|226332036|gb|ACIB01000020.1| GENE 10 17925 - 19220 1190 431 aa, chain - ## HITS:1 COG:MTH1451 KEGG:ns NR:ns ## COG: MTH1451 COG3174 # Protein_GI_number: 15679448 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Methanothermobacter thermautotrophicus # 188 403 3 213 236 79 28.0 1e-14 MEQLYNYLPEKLVTFILVTLFSLLIGLSQRKISLKREGETTLFGTDRTFTFIGMLGYLLY ILDPEEMHLFMGGGLILGILLGLNYYVKQSQFHVFGVTTIIIALITYCIAPIVSTQPSWF YVMVIVTVLLLTELKHTFTEIAQRMKNDEMITLAKFLAISGIILPMLPNENIIPDINLTP YTIWLATVVVSGISYLSYLLKRYVFRESGVLVSGIIGGLYSSTATISVLARKSRNTHSQE ASEYVAAMLLAVSMMFLRFMILILIFSSTIFTSIYPYLLIMAAVAAGVAWFIHTRRKRTP DADLVEEEDDSSNPLEFKVALIFAGLFVIFTVLTHYTLIYAGTGGLNLLSFVSGFSDITP FILNLLQGTGSVAATVVMACTMQAIISNIVVNMCYALFFSGKQSKLRSWILGGFGCVIAA NVVVLFFFYLI >gi|226332036|gb|ACIB01000020.1| GENE 11 19277 - 19747 561 156 aa, chain - ## HITS:1 COG:all4694 KEGG:ns NR:ns ## COG: all4694 COG2954 # Protein_GI_number: 17232186 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Nostoc sp. PCC 7120 # 1 155 1 153 153 130 48.0 6e-31 MSQEIERKFLVSGDYKSQAFDQSRIVQGYISSARGRTVRVRIRDGKGYLTIKGASDASGI SRYEWEKELSLAEAEELMKLCEPGVIDKTRYLVRSGKHIFEVDEFYGENEGLVVAEVELG SEDEVFVKPSFIGEEVTGDIRYYNSQLMKKPYTTWS >gi|226332036|gb|ACIB01000020.1| GENE 12 19755 - 21074 1172 439 aa, chain - ## HITS:1 COG:CC0911 KEGG:ns NR:ns ## COG: CC0911 COG3746 # Protein_GI_number: 16125163 # Func_class: P Inorganic ion transport and metabolism # Function: Phosphate-selective porin # Organism: Caulobacter vibrioides # 34 389 118 466 493 98 26.0 2e-20 MNKQLITIFSIFSITCLGMVSAQESKSFLPEVKKESLTFSSEDNKFKLTFNGRIQADGAM FFGEDYQPIGNGVGFRRVRLGATAAFGKSLSGKIEMDLTDGGFSLKDCFIKYAFPNGLYF RVGNFKESFGMAAMTSSGDLWFMEKANVVSAFAPEYHIGVQGTWEHDQFLGVAGVHFKKI EGNKEKDYSESNNKAGEDEGISVTARAVWQPVSADKVKGFHLGIAASYRTPKTTVGSLMP NTVRYSTRSLSYINKIKFLDTSPIASVSHDWLAGAELAGFYRGFRFQGEYIMNNTVRMEG LATEKFNGFYVQAAYLLFGGQQRYSKSRGAFSQPSFGRSWGDIELAARFDRIDLNGTEVM GGSSNGWTFGVNYYATRNLKFQLNYSYVDNDKYANAFGQAAVGYKSNGEIAYKPEEVDES LGKGGNAYGILGLRIQLNF >gi|226332036|gb|ACIB01000020.1| GENE 13 21191 - 21325 195 44 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MSKNKKKVTHSKKEEEQAKKVVKIVFVSLVILALAMIIGFSLLG >gi|226332036|gb|ACIB01000020.1| GENE 14 21478 - 22437 1047 319 aa, chain - ## HITS:1 COG:RSc1030 KEGG:ns NR:ns ## COG: RSc1030 COG1186 # Protein_GI_number: 17545749 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Protein chain release factor B # Organism: Ralstonia solanacearum # 24 313 19 298 300 247 45.0 3e-65 MKLVKGLQKWIEGYNDVKTLTDELELAFDFYKDELVTEQEVDEAYAKALEHVENLELQNM LRDEADQMSCVLKINSGAGGTESQDWASMLMRMYLRYAETNGYKATMANLQEGDEAGIKT CTIQIEGDYAYGYLKGENGVHRLVRVSPYNAQGKRMTSFASVFVTPLVDDSIEVNILPAC ISWDTFRSGGAGGQNVNKVESGVRLRYQYKDPYTGEEEEILIENTETRDQPKNRENAMRQ LRSILYDKELQHRMAEQAKVEAGKKKIEWGSQIRSYVFDDRRVKDHRTNFQTSDVNGVMD GKIEGFIKAYLMEFSSEEA >gi|226332036|gb|ACIB01000020.1| GENE 15 22655 - 24460 1786 601 aa, chain - ## HITS:1 COG:VC2484 KEGG:ns NR:ns ## COG: VC2484 COG1022 # Protein_GI_number: 15642480 # Func_class: I Lipid transport and metabolism # Function: Long-chain acyl-CoA synthetases (AMP-forming) # Organism: Vibrio cholerae # 5 601 7 597 601 485 40.0 1e-136 MTYHHLSVLVHRQAEKYGDKTALKYRDYEKAQWIPISWNEFSQTVRQAANAMVELGVQEE ENIGIFSQNKPECLFTDFAAFANRAVTIPLYATSSPAQAQYIINDAQIRFLFVGEQFQYD AAFSVFGFCPSLVQLIIFDPAVVKDPRDMSSIYYDEFLAKGKDLPHNEVVEERTARASAE DLANILYTSGTTGEPKGVMLHHSCYLEAFRIHDIRLVDMTDKDVSMNFLPLTHVFEKAWT YLCVHKGVQVCINLRPADIQTTIKEIRPTLMCSVPRFWEKVYAGVQEKIAETTGIKKMLM LDAIKVGRIHNLDYLRVGKTPPRMIQLKYKFYEKTIYALLKKTIGIENGNFFPTAGAAVP DEICEFVHSVGIDMLVGYGLTESTATVSCTSKTGYDIGSVGQVMPEVEVKIGEDNEILLR GKTITKGYYKKAEATAAAIDEEGWFHTGDAGYFKNGQLYLTERIKDLFKTSNGKYIAPQA LETKLVIDRYIDQIAIIADQRKFVSALIVPVYGFVKQYAKEKGIEYKDMAELLEHPKITA LFRARIDTLQQQFAHYEQIKRFTLLPEPFSMEKGELTNTLKLKRPVVARNYKEVIDKMYE E >gi|226332036|gb|ACIB01000020.1| GENE 16 24649 - 25710 1055 353 aa, chain + ## HITS:1 COG:MT1240 KEGG:ns NR:ns ## COG: MT1240 COG0624 # Protein_GI_number: 15840646 # Func_class: E Amino acid transport and metabolism # Function: Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases # Organism: Mycobacterium tuberculosis CDC1551 # 1 350 1 354 354 104 27.0 2e-22 MIDESITSEAVGLLKSLISIPSLSREEEKAADYLQNYIEAEGMTTGRKGNNIWCLSPMFD LKKPTILLNSHIDTVKPVNGWRKDPFTPREENGKLYGLGSNDAGASVVTLLQVFLQLCRK QQSYNLIYLASCEEEISGKGGIESVLPGLPPISFAVVGEPTEMQPAIAEKGLMVLDVTAT GKAGHAARNEGDNAIYKVLDDIAWFRDYRFAKESPLLGPVKMSVTVINAGTQHNVIPDRC SFVVDVRSNELYSNEELFTEIQKHIFCKAEARSFRLNSSRIEESHPFVQKAKKLGRVPFG SPTLSDQALMAFPSVKIGPGRSSRSHTSDEYIMIKEIEEALELYLKILDGLEI >gi|226332036|gb|ACIB01000020.1| GENE 17 25890 - 28529 1821 879 aa, chain - ## HITS:1 COG:no KEGG:BF0371 NR:ns ## KEGG: BF0371 # Name: not_defined # Def: alpha-rhamnosidase # Organism: B.fragilis # Pathway: not_defined # 1 879 1 879 879 1835 99.0 0 MKIGYLKPGILLLVLLFSYRQDLFAKKRITSLKCEYIETPLGLDVQRPRLMWKVDTSDVP SRQTAYRILVSSTPELLRQGEADIWDSGKQKSDEQLVSYAGSTLRPHTRYWWRVEVWLNN KKVVSEPVWFETGKFSATDWEASWITDGYDKDYEPSPMFRKVFDVSKEVASARCYISGLG YYRLSFNGKAVNDHALDPGFTDYSKRVLYLTYDISGLLRHGKNCIGVQLGNGWFNEQTPA VWYFHEAPWRKRPQMIAEIHLCYTDGSKDIITTDTSWKTSTGPLLFDNLYVGSFYDARLE QKGWDTELFDDVSWQHAKLTAAPAPLIEAQKMPSITTADTLSVVSVNCISDTCYVFDMGI NTAGVPRLEIKGERGTRIRLRHSEMLQKDGNIDQRNIDMHLRPRNKREIIQTDEYILKGE GVETFIPPFTYHGFRYIELTSDRPLTVADVKLQTLRMHSDVAEVGSFKCSDQLLNTIFNI CRNSYLSNLFGIPTDCPTREKNGWMADGFMVQEAGMFNYDSRNVYAKWVKDMIDTQEANG NVAGIAPTSRRWDSNWAGPLWDAAIFIVPSYLYRYTGDIETMRQVYPAAERYLKYIETTE DERGLINHGLGDWLFYKAETPVDFMATGFVYWDNLMMAQMAELTGRVEDRQKYLAKAEEL KKRINDHFFDPQTVSYANKTQLSYALPLYLHIVPEAYRERLAENLHKIIAANDYSLDFGF IGSVMVPDVLAETGYAETAYRMLTKTTLPSWGYWIKETGATSLYETWDVTRRIGDASLNH PSMGAVSAWMYKYPAGIRLSPDASAFKKILIQPCFLSDLDFVEASHESMYGTIRVDWRRE EGKIRLHLVLPSTAMGTVVLPGQKPKAVKGGEHIFVIPE >gi|226332036|gb|ACIB01000020.1| GENE 18 28539 - 30938 1901 799 aa, chain - ## HITS:1 COG:CAC3436 KEGG:ns NR:ns ## COG: CAC3436 COG3534 # Protein_GI_number: 15896677 # Func_class: G Carbohydrate transport and metabolism # Function: Alpha-L-arabinofuranosidase # Organism: Clostridium acetobutylicum # 322 576 160 434 835 108 27.0 4e-23 MKINCLLALALSASSLAAQDAVIRVDAGKVENKITPYLYGACMEDVNHEIYGGLYDQKIF GESFEEPVSMDNFIGFTRCEGVWKLQDGQLSVQAHPGAKLVYDDTQLSDGTVGVDLKFTN GTSSSNNAGLLLRVGEYGGGADNFDGYEVSLFADGKRLLLGKHLHNWQELKTTPVDVDPT QWNRMEVKLDGDELVILLNDCEVLRFEDRDTGLRNGKVAFRNWGADVSYRSLTIGSNTPV KLLTKVTPQVSGMWDAFADLTAAAVYKQIADKPFHGRYAQEIEYLAGTGKVGIANRSLNR WGISVRQGEKKTGSLYLKGKAEVRVALQSVDGEKEYAVQCIRANAGDWKKYTFELTPDKT DENARLAIYLEEKGRIQVDMVTLMNGADRQFCGLPLRNDIGQAMVDQGLRFLRYGGTMVN APEYRFKRMIGDRAERPPYKGHWYTYSTNGFGIEDFLHFCEKAGFMPAYAVNVEESAQDM ADMIEYLNGSVDTKWGKKRAENGHPEPYGLKYLEIGNEEVIWGDIEADYQHYIDRFNDIY EAVHAKDPEVQFIHSAWWRPESPNMEKVFKALDGKAAYWDYHPWTDDLGSHVNIDRELTD MKAKFLKWNPKTSMRCAIFEENGNLHNVQRAIVHATVQNVVRRHGDFILTTCAANALQPY LQNDNGWDQGQIFFTSGQVWGMPTFYAQQMSSAHHHPLRLWSETVGELDMTATTNEARDE VILHVVNTSSEVRETQVSLSGFIPKGNMEIYTLSGELNDENLPDTPTRILPKATWKQAPG SDFTYSFPGYSYTIVVLKK >gi|226332036|gb|ACIB01000020.1| GENE 19 30967 - 33054 1597 695 aa, chain - ## HITS:1 COG:mlr2247 KEGG:ns NR:ns ## COG: mlr2247 COG3533 # Protein_GI_number: 13472070 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Mesorhizobium loti # 102 691 99 659 662 322 33.0 1e-87 MRTYRFMFPVLLAMTATGSLTAGPSDKGSSGIRFPEVNQVKIEDTFWKPKLNLWQKVTVN DVFDKFEGKHLEHPGEFCNTFENFDRVAKGERNIGRHAGAPWFDGLVYETIRGVADLMAR QSDVALEKRVDGYIDRIAVAQASEPNGYIDTYTQLMEADHQWGERGGMLRWQHDVYNAGM LVEAGVHYYNATGKTKLLEVATRCANLMAEYMGTFPKKNVVPAHSGPEEALVKLYTLYRD NPGLKKQINVPVVEREYLRLAEFWIEGRGKHCGFPLWGTWGNPAAEQWIRDRKYEAPEFG NHTRPSWGDYAQDSIPLFEQKTIEGHAVRATLFATGATSAALENRSPEYIAAVSRLWDNM IGKRMFITGGVGAVHFDEKFGPDYFLPTDAYLETCAAVGAGFFSQRMNELTGDAKYMDEL ERTLYNNVLTGISLSGTQYTYQNPLNSAKHARWGWHDCPCCPPMFLKMMSAMPGFIYSQK GDDIYVNLFIGSETELSLSDQSRIRLTQKTGYPWDGSVVMTVEPEKEKTFLLKVRIPGWA QGVENPYDLYRSEVKSAVNLKVNGKSIAMKIFKGYAEIQRKWKKGDRVELTLPVQPRLVT ANEAVADLQNKVAIAAGPFVYCLEGCDNEGVADLRLNTRAPLSMTFEKELLNGVNVIKGQ ALDKTGKKVSVSAIPYYALGNRQDKGYVVWMPANK >gi|226332036|gb|ACIB01000020.1| GENE 20 33266 - 34243 721 325 aa, chain + ## HITS:1 COG:no KEGG:BF0368 NR:ns ## KEGG: BF0368 # Name: not_defined # Def: putative transcriptional regulator # Organism: B.fragilis # Pathway: not_defined # 1 325 1 325 325 640 100.0 0 MKRNESKYKQVVNHVIDGINNGSYKKGDWILSINEFRKNYNLSRDTVFAGLSELKSKGII DSTPGVGYYIATTRIAQKLNIFLLFNEFNEFKEDLYNSFISSIRKTANVDLYFHNYNRKV FETLINEANYKYTTYILMPGKFTNIAPLLESLSGRVFLLDHFHPELAGKYSSVAQNFEKD TYEALVYGLPHLKKYDHIIMVQKEEKEPIERYNGLCAFCEEYHFTHEYTDSVRNREIRQG ETFMVVNDRDLVDLLKQAQLQNFAPGKDFGIISYNDTPLKEILAGGITTLSTDFKQMGQT MASLITQKEIKTIENPWKLDIRNSL >gi|226332036|gb|ACIB01000020.1| GENE 21 34404 - 35786 799 460 aa, chain - ## HITS:1 COG:no KEGG:BF0314 NR:ns ## KEGG: BF0314 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 460 1 460 460 956 99.0 0 MKSFYPVLYLCFLFGLLPFLGSCSEENESSGTIEMNQLPGTAQSFLSNYFPGQTPEKIER TNTDQENARLLYRVVFPNEVKVEFSENGGWKRLMIPDQKLPGSLDSLWGKIIEYVQQLFP DDPFIGIENACYGDCVLLSSGKKIAFYYDGTCIGYEMDIKDESGVPQPVRDFVATYFPDG VFQAVVEHIPNENVTAGYSFWLENGFKCVLNDRGQWTEVNGGTELLPVSILEALPAKVTE QLYRDYPAAQVTYIRLEGTCYTIQVSKTVYVTIDPENKPIVVPVMQAQALAEEYFGKLRS ISISHPLHTDVLNFKVCLPNGFNMLVNEDASEWLNIDGNGFAFPEKLVASLPEKITEYIS AHSNSEITRVDRSVAASFLVELTNGDGLMFDSQGGFLGKEKIELSISEKTYRYMRHQFPD DLNMYFSSYSIEGWIYKLGDGSQVRFDRDGNFVEMIAAAK >gi|226332036|gb|ACIB01000020.1| GENE 22 36089 - 36652 373 187 aa, chain + ## HITS:1 COG:CAC3336 KEGG:ns NR:ns ## COG: CAC3336 COG0664 # Protein_GI_number: 15896579 # Func_class: T Signal transduction mechanisms # Function: cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases # Organism: Clostridium acetobutylicum # 14 186 21 194 199 67 30.0 2e-11 MKNIINKIRQLYPVSDEALQALQANMQVKYYPKDTYIVQSGITDRLVYFIEEGVTRSVFH HNGQDTTTWFSQEGDVTFGMDSLYYKQPSIESIETLSDCKIYVLHIDELNALYEKYIDIA NWGRILHQDVNKELSHMFVERLQLSPKERYEQFNRRYPGLINRVKLKYVAAFLGISIYTL SRVRAKK >gi|226332036|gb|ACIB01000020.1| GENE 23 36749 - 37903 1012 384 aa, chain + ## HITS:1 COG:CC1328 KEGG:ns NR:ns ## COG: CC1328 COG1835 # Protein_GI_number: 16125577 # Func_class: I Lipid transport and metabolism # Function: Predicted acyltransferases # Organism: Caulobacter vibrioides # 16 375 12 332 337 119 31.0 8e-27 MSNISSTVFADTKPHYHLLDGLRGVAALMVIWYHVFEGYAFAGGTTIDTFNHGYLAVDFF FILSGFVIGYAYDDRWGKNFTMKDFIKRRLIRLHPMVIMGAVVGAITFYIQGSVQWDGTH IGISMVMLSLLCTIFFIPAMPGVGYEVRGNGEMFPLNGPCWSLFFEYIGNILYALFIRRL SNKALTIVVVLLGVALASFAIFNVSGYGNIGVGWTLDGVNFIGGLLRMLFPFSMGMLLSR NFKPMKLRGAFWICTLVMIALFAVPYLEGTESICTNGIYEAFCIIIAFPILLWIGASGTT TDKKSTQICKFLGDISYPIYVIHYPFMYLFYAWLIKNQLFTLGETWQVALCVYAWNILFA YLCLKLYDEPVRKYLAKRFLNKKQ >gi|226332036|gb|ACIB01000020.1| GENE 24 38138 - 39169 682 343 aa, chain - ## HITS:1 COG:no KEGG:BF0310 NR:ns ## KEGG: BF0310 # Name: not_defined # Def: lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 343 1 343 343 714 100.0 0 MVKLLLICLATIVMISCGSEQEKLTVYIPDTCTDTLQYEIYLPEEEGLNIFYDLCVTDSF CAFLDTRNDTLLKIFTATIPPALVGLGMKGEGPDDFLFPFFEKSIGREGKGKLSFIELNS WNKKIVAIHSAASPAPVAVSVVEAQQLPEMPVVRDYNETDSCVYGIDVDMQHGLFFIYDK HTARVKTVDYHRDIRSGYPEGHLSYLYESCLMVNQDAKAACMGLLNLNSLCFYDLKGNLM KEIVIGKELKSPEYDPEFLDFPNAPKYFISLCGTPNYLYALYNGFPGTSGKSKIMVFTWQ GAPVAIYQTDVKLERIAVAPSGRYVLGLNITEEGGSDVLKFEL >gi|226332036|gb|ACIB01000020.1| GENE 25 39356 - 39568 91 70 aa, chain - ## HITS:1 COG:no KEGG:BF0309 NR:ns ## KEGG: BF0309 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 70 9 78 78 123 95.0 2e-27 MLLLFLVDKHRRLLSDFLKALYFCEKQTRMLLLSPLGVHCVVKTSGFLQYDKRKNEIETI FSTNPTNVLK >gi|226332036|gb|ACIB01000020.1| GENE 26 39990 - 40286 166 98 aa, chain - ## HITS:1 COG:no KEGG:BF0307 NR:ns ## KEGG: BF0307 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 98 43 140 140 209 100.0 2e-53 MAMLIANIPVRDLHVKCKEFDNPRLYRVDSLSFDFDYSLTPYVFKHKELYYMAYSGNSFE SPFYGVGCATATDIIGKWTKYDENLILQNPGAFAGCRV >gi|226332036|gb|ACIB01000020.1| GENE 27 40498 - 41409 383 303 aa, chain - ## HITS:1 COG:no KEGG:BF0306 NR:ns ## KEGG: BF0306 # Name: not_defined # Def: lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 303 52 354 354 614 99.0 1e-174 MDSSLLYDICALFKQDDRYFIWSRDNAYVFNDKGDFLFNISCKGQGPGEYLSFGCMFMED GEVCIFDQDKQQILRFDINGKFMGVQKVLLDEGAPSPSMIIPIGTERYLSTNRFGGDYRK MPVLSFWNKNFSSQQIVKGRFMNDGIHFPDAFFVGEEGRRVLYWEPLKDTLFTVTDNFLV PEYKIDFGTYAIPEEEGAKDIYARIMYLNKPENQSCASVARFYQIDGYYIYFTFMWYDRV YLCRYSEKTKKSEIFAILTDEMQLKECSFFKILGDDIVIAFEDKGNLEKNPSLCVFNKKI LDI >gi|226332036|gb|ACIB01000020.1| GENE 28 41568 - 41861 210 97 aa, chain - ## HITS:1 COG:no KEGG:BF0356 NR:ns ## KEGG: BF0356 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 97 1 97 97 169 100.0 2e-41 MKKFVFIAAVLGFTNVFVAMTQQKNSPIPLHNNGIETFTVYEADGSNYCYNGGRGATSCS ISGGIDIKGGGASAACDVSCQTGYYACCGIRCTCEKY >gi|226332036|gb|ACIB01000020.1| GENE 29 42331 - 43299 743 322 aa, chain - ## HITS:1 COG:CAC1529 KEGG:ns NR:ns ## COG: CAC1529 COG3940 # Protein_GI_number: 15894807 # Func_class: R General function prediction only # Function: Predicted beta-xylosidase # Organism: Clostridium acetobutylicum # 44 286 15 283 327 60 27.0 3e-09 MGKKKISTGVWLLGLLIVFCSYTGVAGAKNRSKTKTVTKSVPLGDPFILLHDGTYYAYGT HAADGIEVYTSKDLRKWKLHGLALHKDDVWADSRFWAPEIYEIDGKFYMYYTADEHICVA IADSPLGPFRQNEKKPMVAGEKMIDSSLFIDEDGKPYLFFVRFNDGNNVWVAELEDDYMT IKTETMRPCIHVSQAWEEVWPRVNEGSYVLKHNGLYYMTYSGNSFESPFYGIGCATATDI MGEWTKYQENPILQKPGNLQGVGHSAMFRDKKGRLRIVYHAHKDKEHIHPRGMYIGKVYF EKVDGIDRMRINKEYIAAELVE >gi|226332036|gb|ACIB01000020.1| GENE 30 43331 - 46324 2268 997 aa, chain - ## HITS:1 COG:BH0236 KEGG:ns NR:ns ## COG: BH0236 COG5498 # Protein_GI_number: 15612799 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted glycosyl hydrolase # Organism: Bacillus halodurans # 867 991 799 921 1020 62 33.0 3e-09 MFINSLRKSPFSCSPGLILLGAFTLLSVPVYGQQIQQVERQVQQVPFLQFNFDEQGGETA RNSGSGGSKYDARINGGTVEWVPGLQQGAARLSNKGHFKLPDGVLAHVKDFTLSVWVYLN EQSDNQTVCTFACGTDRYLILTTQRGNEENGVSLVMTKTQESGNHTDKEERIAYTRQKGK LSANAWHHLAFTLKGSVGTLYVDGVKAEIKTDFTVNPSLLGNTTDNYIGRPTWPDPYLNG GIDDFRLYDYALTDRQVYELASVADGRLVQEDRDGLSLGDLSAVTTDLVLPSSGKSGTTI SWSSAQGQYISDSGKLYRPDAGTGNKKATLTATVRKGDVALTKDFVLTVKDIGTEPEDVN VFSMQTGNPTVPAYLADASFYYDDRTKTFYAFGTNDGAGGENVYPAQMWYSKDCKEWKNK VIAFPKSWTDYAGTLCVWAPSIEYNPDTKKYYLMYSIASNTFVGMADDLLGPWEDANGAA PGKMLFKGYDGQFFMDDDRTMYIVTDSWHFKIMKLKFDEAGKIYIDNSDPVFAKSDSNPF IGTYHYTQIEEIKNAFEASFIFKRNNLYYLMWSFNGSENYNVRYAVADKITGPYREINRS MTVPILQRDDANRILGPGHHSMFCYGGRTFIAYHRQHYPFVDSKRQTCIDEVFFNEDGSI RPITPTHKGVTVAPDVPGDHRTNLALGKQTLTSSARVYDDSEFAPRYRTHGISFCYAGNF AVDENYGTHWDPGVGAHKPWLIVDLGSECKVDEIETIFEFTSRTYKYKLEYLSQKEADSL DAASGSHLWKVFADRSTDGVGQSPVTDTKPGNSPVKARFIRLTILEGVDIPLRADGLDKK NAENALSIFELKVFGEDRSDDLNRIFEAESFHNLYGIALEKNAAENGFIMGQIDNNDYLL YRNVDLGKGAGTFTAKVASGTEGGKIEVYLGSLKGKPIGVLEVGNTGGDQSWEIKSTSLE RIARGRQEKLYLLFKGKTGTENLLKLDWFQFTKDRMK >gi|226332036|gb|ACIB01000020.1| GENE 31 46374 - 47594 1007 406 aa, chain - ## HITS:1 COG:BS_yxaH KEGG:ns NR:ns ## COG: BS_yxaH COG2311 # Protein_GI_number: 16081049 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Bacillus subtilis # 1 401 1 399 402 115 26.0 2e-25 MTHTQTITPKKRINSIDALRGFALIGIMLLHCMERFDLTLAPVVESPFWQAIDTAVYDSL YFLFSGKSYAMFSLLFGLSFFMQMESQAAKGVDFRGRFLWRLALLFLFGYINGLVYMGEF FMVYAVLGVFLIPLYKVSTRWLLVLCILLFLQIPAVISFVSLLSDNVANEPTAAAAYMDR LFERAADVFINGSLMDVLSFNTFDGQSAKCLWVFNNFRYLQLLGLFIAGMLIGRQGIHKS EEKMVKYSRLFLPYCLAFWAVFYAVAFLLPVWGVDGFALRVGQTLFKTYGNLGQMMVYFC GFTLLYYRYKGQKVLDRIAPVGRMSVTNYMAQSIVGVSLFYGFGGNFAVEFNYLQSFLLG AAFCVIQIAYSNWWIKRFYYGPMEWLWRSLTWFQVVPLSRRKASLG >gi|226332036|gb|ACIB01000020.1| GENE 32 47698 - 48819 1403 373 aa, chain - ## HITS:1 COG:CC1418 KEGG:ns NR:ns ## COG: CC1418 COG2017 # Protein_GI_number: 16125667 # Func_class: G Carbohydrate transport and metabolism # Function: Galactose mutarotase and related enzymes # Organism: Caulobacter vibrioides # 24 370 24 377 378 283 44.0 5e-76 MWALGALFVAGCAETEKATTDSGLVKSNFQTEVGGKKTDLYVLRNQNNMEVCVTNFGGRI VSVMVPDKEGVMRDVVLGFDSIQDYISKPSDFGASIGRYANRINQGKFTLDGVEYQLPRN NYGHCLHGGPKGFQYQVYDAKQVGPQELELTYLSKDGEEGFPGNITCKVIMKLTDDNAID IKYEAETDKPTIVNMTNHSYFNLDGDAGSNADHLLTIDADAYTPVDSTFMTSGEIVTVEG TPMDFRTPTPVGKRINDFDFVQLKNGNGYDHNWVLNAKGDITRKAATLESPKTGIVLDVY TDEPGIQVYAGNFLDGSLTGKKGITYNQRASVCLETQKYPDTPNKPEWPSAVLRPGETYN SHCIFKFSVDNGK >gi|226332036|gb|ACIB01000020.1| GENE 33 49040 - 50506 1609 488 aa, chain - ## HITS:1 COG:XF0843 KEGG:ns NR:ns ## COG: XF0843 COG3538 # Protein_GI_number: 15837445 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Xylella fastidiosa 9a5c # 47 475 67 497 516 466 49.0 1e-131 MPGKNSKKMIGACVVTAALLCAPSALKAEGMLSHYTCVADAIQKDNRPEPAKRLFRSQAV ENEIIRVQKLLRNSKLAWMFTNCFPNTLDTTVHFRKGKDGKPDTFVYTGDIHAMWLRDSG AQVWPYVQLANSDPELKTMLAGVINRQFKCINIDPYANAFNDGPKGGEWMSDLTDMKPEL HERKWEIDSLCYPLRLAYQYWKTTGDASIFDEEWIQAITNILRTFKEQQRKDGVGPYKFQ RKTERALDTVTNDGLGNPVKPVGLIVSTFRPSDDATTLQYLVPSNFFAVSSLRKAAEILT TVNKKTALANECKALANEVETALKKYAVYNHPKYGKIYAFEVDGFGNHMLMDDANVPSLL AMPYLGDVSIDDPIYQNTRRFVWSLDNPYFFKGKAGEGIGGPHIGYDMVWPMSIMMKAFT SKDDAEIKSCIEMLMNTDAGTGFMHESFHKDNPEKFTRAWFAWQNTLFGELILKLVNEGK VDMLNSIQ >gi|226332036|gb|ACIB01000020.1| GENE 34 50550 - 52835 2198 761 aa, chain - ## HITS:1 COG:L135972 KEGG:ns NR:ns ## COG: L135972 COG3537 # Protein_GI_number: 15673483 # Func_class: G Carbohydrate transport and metabolism # Function: Putative alpha-1,2-mannosidase # Organism: Lactococcus lactis # 35 758 11 716 717 420 32.0 1e-117 MKKLALLLVGVLGTAFCTFAKSTTEPVDYVSPLVGTQSKHALSTGNTYPAIAMPWGMNFW VAQTGKMGDGWAYTYDADKIRGFKQTHQPSPWINDYGQFAIMPVTGKVVFDQDQRASWFS HKAEVAKPYYYKVYLADHDVTTEIAPTSRAAMFRFTFPESKDSYVVVDAFDNGSYVKVIP EENKIIGYTTKNSGGVPENFKNYFVLVFDKPFTFTAAVTNGNIRPGELESKDKHAGGIIG FSTRRGETVNVRVASSFISPEQAEQNLKELGKDNLEAVAAKGRQEWNKVLGRIEVEDDNT DHLRTFYSCLYRSVLFPRSFYELDAKGKPVHYSPYNGKVLPGYMFTDTGFWDTFRCLFPF LNLMYPSMNEKMQEGLANTYKESGFLPEWASPGHRGCMVGNNSASVVADAYLKGLKGYDI ETLWEAVKHGANAVHPQVSSTGRLGYEYYNQLGYVPYNVGINENAARTLEYAYDDWCIYQ LGKALNKPEEEIAVYAQRAMNYKNLYDKEHKLMRGKNKDGQFQSPFNPLKWGDAFTEGNS WHYTWSVFHDPQGLIDLMGGQQGFNQMMDSVFILPPVFDDSYYGGVIHEIREMQIMNMGQ YAHGNQPIQHMLYLYNYSGQPWKAQHWIREVMDKLYTPNPDGYCGDEDNGQTSAWYVFSA MGFYPVCPGTDQYVMGTPYFKQMKLHLENGKTVQISAPGNSDENRYIASMTVNGKTLTRN YLTHKELMNGAKITMKMSSTPNKQRGVRESDFPYSFSKEVR >gi|226332036|gb|ACIB01000020.1| GENE 35 52955 - 55435 2690 826 aa, chain - ## HITS:1 COG:no KEGG:BF0349 NR:ns ## KEGG: BF0349 # Name: not_defined # Def: glutaminase # Organism: B.fragilis # Pathway: not_defined # 1 826 12 837 837 1669 99.0 0 MKLKLSTLFLGAAAMLSSCGTPQDVKSEKSEMRAPAYPLVMIDPYTSAWSFTDNLYDGPV KHWTGKDFPFLGVAKVDGQIYRFMGTEELELLPLVKTSEQGRWTAKYTTKKPADGWQNAD FNDAAWKEGEGAFGTMENESTARTQWGEEYIWIRRKADIKDNLQGKNVYLEYSHDDDAII YVNGVKVVDTGNSAKKHMLAKLPEEAVAALKQGENLIAIYCNNRVANGLIDCGLLVEKDN TQNFTQTAIQKSVDVQAMQTNYEFTCGPVDLKLAFTSPLFMDNLDLMTRPVSYLTYEVAS NDGNKHNVELYFEAGPQWALDQPHQEAVAESFTEGNLLYLKTGSRNQEILGKKGDDVRID WGYFYMAADKENSSCATGEGKTLRKSFIDGKLTSSKTDGSDKLALVRSLGETKKAEGHLL LGYDDLYSIQYFGENLRPYWNRNRNETIQSQFAKADKEYDAVMDKCAAFDANLMKEATEV GGRKYAELCALAYRQAIAAHKLVEAPNKDLLFLSKENFSNGSIGTVDITYPSAPLFLVYN PELAKGLMNHIFYYSESGKWNKPFAAHDVGTYPLANGQTYGGDMPIEESGNMLILSAAIA IVEGNADYAQKHWDVLTTWTDYLAQYGLDPENQLCTDDFAGHFAHNANLSIKAILGVASY GYLADKLGKKEVAEKYTQKAKEMAAEWVKMADDGDHYRLTFDKPGTWSQKYNLVWDKLMN LQIFPETVAQKEIAYYLGKQNQYGLPLDNRETYTKTDWIMWTATLAPDKATFEKFIDPVY LFMNETTDRVPMSDWVFTDRPNQRGFQARSVVGGYYIKMLEKKLKK >gi|226332036|gb|ACIB01000020.1| GENE 36 55462 - 57624 1722 720 aa, chain - ## HITS:1 COG:no KEGG:BF0348 NR:ns ## KEGG: BF0348 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 720 1 720 720 1492 99.0 0 MSKRVLVLIGLFLACGGVYSQTATGTKTNFQTAESWKPETDVRADAVMVYGTLDKKGVTF EQRIQSWRDKGYRAEFMTGVAWGDYQDYFLGKWDGVKDHLKEGQRDREGREIAHGHLIPY IVPTESFIRYMQEKQIKRVIDAGITSIYLEEPEFWMRGGYSEAFKSEWQKYYGFPWRAQH ESPENTYLSNKLKYYLYYNALNQIFTYAKTYGKSKGLDVKCFVPTHSLVNYTSWQIVSPE ASLASLDCVDGYIAQVWTGTAREPNYYDGVKKERVFENAFLEYGCMKSMTAPLNRKMYFL TDPIEDRAKDWLDYKINYQATFAAQLMYPAVDTYEVMPWPDRIYQGLYQVAGTDRKERIP RDYSTQMQIMVNTLNDIRTSETQVSGTHGIGVLMANSLMFQRFPDHDGYDDPQFSSFYGQ TLPLLKRGIPVELVHMENTPFGDTFKGLKVLVMSYSNMKPMESRYHDFLADWVRKGGALI YCGEDIDPYQSVLEWWNSNGNQYKAPSEHLFEKLGLDRVPAAGTYPCGKGMVTVIREDPK HFVLKSGNDRQYFDAVSAAYRKSAGKEVELKNSFLLERGPYTIAAVLDESVSDAPMELSG VYIDLFDKDLPVLTHKVIRPGEQGYLYNVKRISGRAKAKVLCGASRIYDEKAGKRSYSFV AKSPLHTTNASRILLPKQPIRVCVNGKEEPQPEKLWEERSRTLLLKFENDPAGVQVDIEW >gi|226332036|gb|ACIB01000020.1| GENE 37 57653 - 58771 1051 372 aa, chain - ## HITS:1 COG:lin0763 KEGG:ns NR:ns ## COG: lin0763 COG4833 # Protein_GI_number: 16799837 # Func_class: G Carbohydrate transport and metabolism # Function: Predicted glycosyl hydrolase # Organism: Listeria innocua # 70 364 46 334 341 76 27.0 9e-14 MKNKVLTIVAILLLLPNAMAWAHQPADGNQKHFTKKDATTAMDAFHSTFYNPDMKLYAIS SDMKGRAAIWVQAIYWDMIMNAYKRTKAPKYRRLIEEVYQGGYEQYDKYNWDNKIEWFIY DDMMWWIISLARAYEITNDPKYLAHASSGFYHVWKESYDKERGGLWWNFKHDGKMACINY PTTVGAMTLYNVTKDPDYLEKAKSVYAWSRDVFFDKEKGRIADNMHYHFQRQNGMDIDWT PQLYNQATFIGSAVMLYKATGEKAYLDDAVLAADYVRNEMCDADGLLPFKNGVEQGIYAA IFAQYIIRLIEDGNQPQYMDWLRHNIDVAWNNRDVNRNVTFKDAAKPCPTGVMESYDASG CPALMQVISPFK >gi|226332036|gb|ACIB01000020.1| GENE 38 59808 - 60785 594 325 aa, chain - ## HITS:1 COG:BS_abnA KEGG:ns NR:ns ## COG: BS_abnA COG3507 # Protein_GI_number: 16079933 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-xylosidase # Organism: Bacillus subtilis # 29 264 39 284 313 113 35.0 4e-25 MKKLLFSLFTVFSFCVPSIAQQYSNPVINYSLPDPTVIKADDGYYYLYATENIRNLPIHR SKDMVNWSFVGTAFTNETRPTFEPKGNLWAPDINKIGDRYVMYYSMSVWGGEWTCGIGVA TADKPEGPFTDHGKLFRSNEIGIQNCIDPFYIEDGGKKYLFWGSFHGIYGAELSDDGLSL KEGMKPQQVAGTAYEGTYIHKRGGYYYLFASIGRCCEGLKSTYTTVVGRSKYLFGPYVDK KGESMLENHHEVLIDKNEAFVGPGHNSEIVTDDKGADWVFYHAVSVANPEGRVLMLDRVN WKKGWPVVEGDTPSLQAKAPVIRHK >gi|226332036|gb|ACIB01000020.1| GENE 39 60880 - 61308 303 142 aa, chain - ## HITS:1 COG:no KEGG:BF0343 NR:ns ## KEGG: BF0343 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 142 1 143 143 169 69.0 4e-41 MNKKFFITAVVLVSLGMTNVMANTSNNEQIEVSSVNKVATTEYYLEIGSQSGSGLFEEIE LLDPTGKVLRIKTKRGGRVEIFCHGDNSDSYRLWYTADSWTGEIITEEIKANRFTISTSY NESASSSSTTKYETRYYYIREK >gi|226332036|gb|ACIB01000020.1| GENE 40 61407 - 62114 626 235 aa, chain - ## HITS:1 COG:no KEGG:BF0290 NR:ns ## KEGG: BF0290 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 235 1 235 235 465 98.0 1e-130 MKTISKIFSVLLLGAMMFTSCMKDNYDAPESMLTGRVMYNGEALQLRGNEAVQLQLYQHG YAKHDPINVYVNQDGIYSASLFNGEYQMITKSGNGPWTSEGRDTINVTVAGNTVQDVEVT PYYLVRDAQMTLEGNKVNASFKVEKVAGGGIDRVFFMLSTTQFVNDAEHNVDRYDETDNL DAYDETGKLYTFATRDYTDNSMFQTALKRGTLFGRICIWPKGSDQGIYSKVIRLK >gi|226332036|gb|ACIB01000020.1| GENE 41 62134 - 64020 1443 628 aa, chain - ## HITS:1 COG:no KEGG:BF0289 NR:ns ## KEGG: BF0289 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 628 1 628 628 1265 99.0 0 MKNRYFLMALAAAGVLSLSSCNGFLDTKPNDIMTQDQVYADPALVKSVLANFYGRITFGQ RNEAVEDYFMLDEAIHYDNNSDENIDRNKWRPYDYTLIRNLNQFLQGIKGSTAVDEETKR LYEGEVRYIRAWTYFCIARGLGGIPIVGDDVFDYTGGMDITTIQVPRSTEAETYDYIIKE CQEAAAMMSKQTNKNNSRANYWVAKMLEARVAITAASLATYNTVAEHPQLRTAGGEVGIP ADKAEGYYRTALAAAKEVIEGAADGTASPYRLMLAADKTSEALADNFFKAVCEKSGNTEV IWTRDYATPGYGHEFTKNCLPKSIEQDTGSDRMSVLLNLVEAYESTDATESERGKAAKFD IGTLDDPKFFDDPMDLFADRDPRLAATVLLPGSTFDGKLIELQAGQLNKVNGQWIERTGR RNETDAQGRLITANNGPFGGNEREINRTGFFVRKYLDKTPLAGTQGTKSAMWNVYFRISE AYLIAAEASWELSRNNSDVEALKYINAVRERAGIQPLTSIDHQKIMHEYQVEFAFEGHRW WDLKRWMEADNIWTGNENDRTAQRLGLWPYRVVADGDANNGKWVFVEKNMQTLDLWRKPL KCTDVQYYSEIDNGWINNNPKLVKNPYQ >gi|226332036|gb|ACIB01000020.1| GENE 42 64028 - 67231 2392 1067 aa, chain - ## HITS:1 COG:no KEGG:BF0288 NR:ns ## KEGG: BF0288 # Name: not_defined # Def: putative TonB dependent receptor outer membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1067 1 1067 1067 2113 99.0 0 MRKKEQMFWLASRSRMWRIPLCMAAFSLLPSAYSFASAENPATETVLAVNSVQQQRTVKG IVIDANGEAVIGANVKEPGSTTGTITDMNGEFSLSVGPKATLEISFIGYTTQKVNVGASN TVKVILKEDTKVLDEVVITGFGMSQKKATLTGAVAAIASTDIERSNATTASGALVGKIAG INTRQNDGRPGASTQLQIRNMGAPLYVIDGVVSDDGQFNHMDFNDIESISILKDASAAIY GVRAANGVVVVTSKKGQRNSKNSVSINAYYGWQHNSRYVQPAKTKDYVNAYVSAETWAKK ADNDRRYNKEEYAKWMAGTEKNYQGFDWSDYIWITAPQSYISANLSGGTDKANYYVAVSH IDQEATVRGYGGFRRTNAQMNIDMNISDRFKIGATMNGRIEDRHHPGVPGGDDTWLPRFA TMKNQPTKRPYANDNPLYPQLVSDQKETNFALLNYDTSGKVSDVWRVIQMQTTAEYEILK GLKAKGMLGYYYAYNELDNQEYTFKLYGYNEKTGEYYETAAMDNPYRERNREKVEDQFAN FQLNFDRRFGNHSINAVASFEATQRKRPKSWVHSVPVANGMDLIRFKEIVEYNDTGNNTE ARMGWLGRINYSFADRYLIELIGRWDGSWKFKPENRWGFFPSASLGWRISEENFWKESKI ANVFSNLKIRGSYGVVGDDNVSDYTAFDYLPGYKYNNGGAVLDGDWVVGTETRGLPNKTL SWMESKILDIGVDMGFFNNRLNAQVDFFQRIRDGIPESRYDVLIPNEAGFSLPKENLRSD KHVGFDAMVNWTDHVSDFNYSVGANMTYSRFWDWEQYDTRHSNSWDVYRNSIWHRVGYVN WGYEAVGRFTSWEQIATYPIDNDRKGNKTVVPGDIMYKDVNGDGVINYMDERPIGYRSDS TPTLNFGINLSASWKGFDLAMDWTGSGMTSWQQQYETARPFQNDGNSPDEVFKDAWHLAD IWDADSQLIPGKYPLIRLNNEETSAYDKSTFWLHNVKYIKLRNLEFGYTLPKRIVAKAGI SNLRLYVSGTNLFSISNIPFMDPECINSNGLDYPTMRVVNLGINLKF >gi|226332036|gb|ACIB01000020.1| GENE 43 67575 - 71192 3163 1205 aa, chain + ## HITS:1 COG:lin2123 KEGG:ns NR:ns ## COG: lin2123 COG0383 # Protein_GI_number: 16801189 # Func_class: G Carbohydrate transport and metabolism # Function: Alpha-mannosidase # Organism: Listeria innocua # 31 847 239 1032 1032 302 29.0 3e-81 MKLHIAMLAATLLLSGGDSYAQGNKQEKKAKAYMVADAHLDTQWNWDVQTTIKEYVWNTI NQNLFLLKKYPNYVFNFEGGVKYAWMKEYYPAQYEEMKKYIGEGRWHISGSSWDATDALV PSTESFIRNIMLGQQYYRQEFGVESTDIFLPDCFGFGWTLPTIASHCGLIGFSSQKLDWR VHPFYGKSKHPFTIGLWKGIDGSSIMLAHGYDYGRRWNDEDLSENEQLKELAGRTPLNTV YRYYGTGDIGGSPTLASVRSVEKGLRGNGPVEIVSATSDQLYKDYLPYKNHPELPVFDGE LLMDVHGTGCYTSQAAMKLYNRQNELLGDAAERAAVTAEWLNQAKYPGSTINEAWKRFIY HQFHDDLTGTSIPRAYEFSWNDELISLKQFSNVLTSSIHGIGRELDTRVSGIPVILYNAL GFTVTDIAEIELDLPKAPKGITVYDEKGKKVSAQLISCTDGKARILVEATVPATGYVVYD VRTSGTGASNISTNVNTLENSLYKITLDKNGDIVSLTDKKNGKELVKAGKAIRLALFTQN KSYNWPAWEVLKETTDRTPVSITDDVKITLVEDGTLRKSLCVEKRHGESVFRQYIRLYEG SRAERIDFYNEIDWQSTNALLKAEFPLNIENEKATYDLGIGSIQRGNNTETAYEVYAQYW ADLTDRDGSYGVSVMNDSKYGWDKPDNHTIRLTLLHTPETRGGYAYQDHQDLGHHTFTYS LIPHQGALDKPATVEKAEKLNQQLKAFRTEKHKGNAGKSFSFVASDNRNVLIKALKKAEE TDEYVVRVYETEGRKAQRATLTFAGEIISASEANGTEKTIGNATFEGNKLQVNITPYSVR TYKVRLKPSGREASPIEYAALPLDYDRKCASYNEFRGEGDFESGYSFAAELLPDSLIAGQ ITFRLGEKEIANGMTCEGDTLQLPAGNKYNRLYILAASTEGDNQADFRIGKQTASFVVPS YTGFIGQWGHKGHTKGYLKDAEIAYVGTHRHASNGDQPYEFTYMFKFGMDIPKGATSVIL PRNEKVVLFAATLVAENEPATTVAGTLFRTNNVGNAATAGNDEEAVRENILKRAKIIACS GYTNDEEKPDFLLDGKTDTKWCDVSQTPNYVDFDLGEAQNISGWKMVNAGQESHSYITNG CFLQGKMNPGDEWTTLDAIDGNHANVVSRPLNYDGKVRYIRLLVTRPTQSTGGRDTRIYE LEVYK >gi|226332036|gb|ACIB01000020.1| GENE 44 71212 - 73680 1300 822 aa, chain + ## HITS:1 COG:no KEGG:BF0286 NR:ns ## KEGG: BF0286 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 822 1 822 822 1654 98.0 0 MTIVADSWNTVSHYYLFSLPLFIMKTHSALTILCLLCTIHCIATPQKAVSDTLLSARFNR YARENPIEKLYLHQDRTNYYAGETIWFKVYQTLSSSATEASRIVYIDLSNSKGDIVKQVK YPLADGAASGSFSIPEHLHAGHYQLRAYTRWMQNFDPEFFFHRELTIYGDSEKDNIPQQT TKFKLRFFPEGGNLINGLTSRIAFEVVDANTGKGVQTEGVILNAHGDSIRKFATSHLGKG SFFFIPQKKEKYIARLAGDNTDFKIPSISEQGFVITVKHLKEALRILLTQKIGPSTKNSV YSLILHQEGRLIAILPVDGSQPRTLFDLPLDKLPTGVFTLTLIDEDYHAYCERLVFTHFP ETLNLKLSSTISVQEGHRKMSVNIRSTDKKGIPQPGSFSLAVAQTFLEQPTIRDNFSTYL FLSSNLKGQTEQPLSYWNPEDTESLSKIELLLLTQGWRRYSLEVFNQPDNLPRYPMEQSL ILSGKVENINKQKAKSVELQAILRQDSLKQFITCPLDGQKRFSLSGISFEGTKEVMLSAT DKNGKTYPIKLDDSIPVPSVKYMPSPFAPDSSFHVQWDITKSYIPQKEIDKQLFELGEVK VTARKKDPIEKRRPYSEGFVKTSTQVKASNSFGDVRQLLRTVPGITMVPNPDKTKSNLQY AHINGLPGGTVAVLVLDGYIAKDPEVVYSMEASRIERVEVLQQTSTQFGGFSSYGGTIVL YSRPIQGEVIATNNKICQWIGYNQTKEFYTPTLSDHSFFEHSEQRNTLYWNPTVKTDKEG RAQVSFFLNDQEDGEYVIHCEGYSEEGLIGTDFRVTEVPEHP >gi|226332036|gb|ACIB01000020.1| GENE 45 73829 - 76339 2826 836 aa, chain - ## HITS:1 COG:no KEGG:BF0285 NR:ns ## KEGG: BF0285 # Name: not_defined # Def: putative exported glutaminase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 836 1 836 836 1693 99.0 0 MKKLFTMALAVGMLCNAQAADLYKASRNVALRAPAVPLITSDPYLSIWSPTDALNESSTM HWTGTEHPLLGAIRVDGKTYRFMGKDKLNLETLVPMTDTDTWQGSYTFDEPAAGWNALSF NAAGWKEGQGAFGTPDMPRVHTRWTTPDIWVRRDFQINDDMNGETIYLKYSHDDVFELYL NGEKLVATDYSWNNDVLLELSDAAKKKLQKGKNVLAAHCHNTTGGAYVDFGLYRLNKQTT GFETAAVQKSVSVLPTQTYYTFTCGPVELDLVFTAPLMMDDLDLLSTPVNYISYRVRSLD KKQHDVQMYVETTPQLAINELTQPTRSKVIRRNGINYVQAGTIDQPILARKGDGICIDWG YAYLAGNIGANTAVSLGNYYGMKNEFATKGSLLPTQAECVTRRADQMPAMAYTDDLGEVG TDGKSGFLMLGYDDIYAIEYFYQPRMAYWKHDGKVSIFDAFERAKANYASVMERCRAYDE MILNDAEKAGGKEYSELCALAYRQVIAAHKLFKDADGNLLFFSKENNSNGCINTVDLTYP SAPLFLAYNPELQKGMMTSIFEYSASGRWNKPFPAHDLGTYPIANGQVYGGDMPIEEGGN MVVLAAAIAKVEGNADYAKKYWDLLTIWTDYLAEYGQDPENQLCTDDFAGHWAHNANLSV KAIMGVAAYSEMARMLGMDDVADRYAAKAKAMATKWEQMAREGDHYRLAFDRENTWSQKY NMVWDKMWNLNLFPNNVIEKEISYYQTKLQNPYGLPLDSRKEYTKSDWIMWTAAMSSDKA TFEKFISPVYKYANETVSRVPLSDWHHTDSGKFVGFKARSVIGGYWMKVLMDKMQK >gi|226332036|gb|ACIB01000020.1| GENE 46 76435 - 78249 1754 604 aa, chain - ## HITS:1 COG:BH2723 KEGG:ns NR:ns ## COG: BH2723 COG3250 # Protein_GI_number: 15615286 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-galactosidase/beta-glucuronidase # Organism: Bacillus halodurans # 106 472 124 484 1014 88 23.0 4e-17 MKKSILIAVLTASIATSAFAQWKPAGDKIKTKWAEQVNPENVLPEYPRPVMERGEWKNLN GLWNYAITEKGAAPSAYEGQILVPFAIESSLSGVGKKVGPDKELWYQRTFTVPASWKGKK VMLNFGAVDWKADIWVNDIKVGQHTGGFTPFSLDITAALATKGDNKLVVKVWDPTDRGPQ PRGKQVNRPEGIWYTAVTGIWQTVWMEPVAERHITNVRTTSDIDRKKLTVDVTTSTSCPS EVVEVKVFDGKQLVATGKGLNGQTIDIQMPADAKLWSPASPTLYSMQIALLSNGKVTDKV DSYTAMRKYSTRRDKDGIVRLQLNNEDVFQFGPLDQGWWPDGLYTAPTDEALVYDIQKTK DFGFNMIRKHVKVEPARWYTHCDKLGIIVWQDMPNGDREPEWQMYNYFTGNELNRSEESE QIYRKEWKEIMDYLYNYPCIGVWVPFNERWGQFKTEDIATWTKKYDPSRLVNPASGGNHF PCGDILDLHHYPNPSLDFYDAGRATVLGEYGGIGLALNEHLWEPNHNWGYVKFNSPEEVT KQYVEYGNELYRLISRGFSAAVYTQTTDVEMEVNGLMTYDRKVIKVNEAQVKAINTKICN SLNK >gi|226332036|gb|ACIB01000020.1| GENE 47 78352 - 79458 404 368 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|90020424|ref|YP_526251.1| ribosomal protein L11 methyltransferase [Saccharophagus degradans 2-40] # 39 352 4 310 314 160 33 7e-38 MKTKIYLLFITTLFFCAGCGNKSGGQKQESVSAAKDTYVNPLFPEGADPSALFHNGKYYY THGTEDKIMLWETSDITDMAHAVCKIVWKPQDPSNSCHLWAPEIHYINDKWYIYYAADDG NTDNHQLYVLENSSPDPMEGKFEMKGSIITNPEWNWGIQATTFEHKGVRYLAWSGWPKRR TNAETQCIYIARMKDPWTLDSPRVLISKPEYEWERQWVNPDGSRTAYPIYVNEGPQFFHS KDNKTLILYYAASGSWSPYYCVGMLTADAESDLLDPASWTKSSVPVFQQSLENEVYGPGG LSFVPSPDGTEWYMIYHARQVTNGDTGSPETRNPRIQKIGWDAHGMPDLGIPVRAGVTLP KPSGTLLK >gi|226332036|gb|ACIB01000020.1| GENE 48 79736 - 81799 1985 687 aa, chain + ## HITS:1 COG:TM0280 KEGG:ns NR:ns ## COG: TM0280 COG3533 # Protein_GI_number: 15643049 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Thermotoga maritima # 64 530 18 505 620 88 23.0 5e-17 MKKMKSLKGGLAGILFTAGLLAASCTSDQSSSAVTIVNRPDCTQTNVNYVGNRLPLKPMN FIKLPVGSIQPEGWLKKYLELQKDGLTGHLNEISAWLGKENNAWLTKGGDHGWEEVPYWL KGYGNLAYILKDQKMIDEAKVWLEGAFASQQPDGYFGPINERNGKRELWAQMIMLWCLQS YYEYSNDQRVIDLMTNYFKWQLSVPDEQFLEDYWENSRGGDNLLSVYWLYNRTGDQFLLE LAEKIHRNTADWTRPSALPNWHNVNIAQCFREPATYYMMTGDSAMLKASYNVHNLIRRTF GQVPGGMFGADENARMGSIDPRQGVETCGLVEQMASDELMLCMTGDPLWAEHCEEVAFNS YPAAVMPDFKGLRYITCPNQTVSDSKNHHPGIDNRGPFLAMNPFSSRCCQHNHAQGWPYY AEHLILATPDNGVAAAMYAACKATVKVGDGNEISLHEQTNYPFEETIRFTVNTPKAVSFP FYLRIPSWTEGATIFVNGKKVAANPEAGQYACINREWKDNDQVEIQLPMQLSMRTWQVNK NSVSVDYGPLTMSLKIDEDYVKKDSRATAIGDSKWQEGADASQWPTYEIYAKTPWNYALV LGKNEPLKDFKVVHKEWPADNFPFTVASTPIEVKAIGRKVPSWVIDQYDLCSELPEMDAP KGEKEEITLIPMGSARLRVSAFPNTRE >gi|226332036|gb|ACIB01000020.1| GENE 49 81838 - 83874 1494 678 aa, chain + ## HITS:1 COG:SMb20631 KEGG:ns NR:ns ## COG: SMb20631 COG3533 # Protein_GI_number: 16265291 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Sinorhizobium meliloti # 316 559 328 559 640 93 31.0 2e-18 MKKKNTFTYLLIGLGLCFSSLGSGLRADTPENYTNNRYPLVRKPLMELPLGSIKAKGWLQ EMLVRQKNGATGQMDKLYPLVMGERNGWLGGDGDQWERGPYWIDGLLPLAYILDDAQLKA KVQPWIEWALKSQREDGFFGPAKDYPGEAGIQRDNSHDWWPRMVMLKILQQYYSATNDQR VIRFMTDYFRYQLKTLPEKPLGNWTFWAEFRACDNLQAVYWLYNITGDSFLLDLGKLIHQ QSFSFVDMVNRGDLKRINTIHCVNLAQGIKEPVIYYQQEPDKMYLDAVKCAFRDIRQFHG QPQGMYGGDEALHGNNPTQGSELCSAVELMYSLEKMVEITGDIDFADHLERIAFNALPTQ ISDDFMTKQYFQQANQVMVSRHRRNFDQDHGGTDNCFGLLTGYPCCASNMHQGWPKFTQS LWYATPDGGLAVTAYAPSEVTAKVADGCTVTFSEETYYPMDDKISFTLQSMDKKRKEVNF ALQLRIPKWCRQAGISVNGQLLQHAEGGRMAIVNRNWKKGDRVELHLPMEVTASTWYENS VTIERGPLVFALKMEEKWEKKEFEEPWYGPYYYSVTPTEPWNYGLVDFNRNKANEHARVT IHTEKQSSVFPWNKENAPIEIRMKARLVPSWKLYNEMAGPQPYSFCSGGEGPETEITLIP YGCTTLRITEFPVVGASR >gi|226332036|gb|ACIB01000020.1| GENE 50 83991 - 85775 1930 594 aa, chain + ## HITS:1 COG:uidA KEGG:ns NR:ns ## COG: uidA COG3250 # Protein_GI_number: 16129575 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-galactosidase/beta-glucuronidase # Organism: Escherichia coli K12 # 41 511 15 506 603 117 24.0 8e-26 MKKLLAAAMLFMLNSWSCFSADTPRAEYPRPQFEREQWVNLNGTWTFDFDFGKSGKDRRL QSAEKFDKNITVPFCPESKLSGVGYTDFIEQMWYQRNITIPSDWNEKKIFLNFGAVDYCA EIYVDGKFVQRHFGGSSSFAVDLTRYVTPGKTHNLVVFVQDDLRSGLQTGGKQCGNYYSG GCSYTRTTGIWQTVWMEAVSADGLKSVFVRPDIDQKQLVIEPEFYNESANTLEITLKDGN KTVAKKSVNCANSSVVVLPVKNMKLWSPEDPFLYDLVYQVKDAKGNVLDEVKSYAGMRKV HTANGRFYLNNQPYFQRLVLDQGFYPEGIWTAPSDEDLKNDIVLGKEAGFNGARLHQKVF EERYYYWADKLGYITWGESASWMLDVNKELAARNFLGEWSEVVVRDRNHPSLVTWTPFNE TWGGGPDAYVRLVRDVYNITKAIDPTRPVNDASGDNHVITDIWSVHNYEQDRAKLTEQLK MEEGKEPYRNARDKDFLAVYEGQPYMVDEFGGIPWMAEKDRKNSWGYGGMPENAEAFYKR LEGQIDAFIDSPHVTGFCYTQLTDVEQEKNGIYYYDRTPKLDMKRIKAIFEKIK >gi|226332036|gb|ACIB01000020.1| GENE 51 85991 - 87157 1018 388 aa, chain + ## HITS:1 COG:lin0763 KEGG:ns NR:ns ## COG: lin0763 COG4833 # Protein_GI_number: 16799837 # Func_class: G Carbohydrate transport and metabolism # Function: Predicted glycosyl hydrolase # Organism: Listeria innocua # 113 381 56 334 341 82 29.0 2e-15 MRKSFIALTFLLVPACLSAQLGGKIYLQRADSLLEQVLSLYEVKKYGLLMETYPRNPKQQ ITYTANTGSEVTQQEVSFLWPYSAMVSGCVSLYKTSGNKKYKKLMDKQIKPGLDLYWDTT RQPECYQSYPAFAGQNDRYYDDNDWVAIDFCDYYAVTKNKEYLKKAIALHDYIYSGWSDE LGGGIYWCEQKKESKNTCSNAPATVLCMKLYKLTKDKKYLDQAMATYQWTRDNLRDPSDF VYWDNKNLQGKIGYAKYTYNSGQMIQAGVLLYQATGDEQYLKDAQQTAKGSYEHFLKPQP TVKGEMKFFPSSPWFNVILFRGLKALYEVDKNDTYVKAMIDNADYAWQYTRDENGLLNND WSGNRKDKFKSLLENSCMIELYSEISEL >gi|226332036|gb|ACIB01000020.1| GENE 52 87300 - 89798 1645 832 aa, chain + ## HITS:1 COG:SPBC1683.04 KEGG:ns NR:ns ## COG: SPBC1683.04 COG1472 # Protein_GI_number: 19111852 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-glucosidase-related glycosidases # Organism: Schizosaccharomyces pombe # 37 822 2 815 832 409 33.0 1e-113 MKATTSLATSCRRLLLWGVLTLQCLFSTAQKRFTADVEQQAEKILSQMTLDEKLSYIGGI NWMYTRPLERFGIPRLKMSDGPQGLGTHGPSTAYPCALMLAATWNEQLATEYGSSLGKDC RARGVHVLLGPAVNIYRAPMCGRNFEYMGEDPYLTSRMATGYIKGVQGQGVMATIKHFIA NNSDYDRDHISSDIDERTLNEIYFPSFRAAVQEAEVGAVMSSYNLLNGIYTTEHPWLLKD VLRQQWGFKGILMSDWGSTHHCIPAVKGGLDLEMPAGSKMQPEELKYYLRTGDITIEMID EKVRHILQTLLAFGFRETQQPDTHIPLNNPQCAQTALNVASEGLVLLKNTNQILPIRSGK VKTIAVVGKNAQGYVCGGGSGEVHPFQYVSVLDGIRKEAAERGIRVEYLDVYDYLPTIIF TDTERKQKGFRAQYFDNMNLEGTPKVEQTETKINYSWSGGTGLKEMPKEQFSVRWNGTIC PQETDEYLFTLGGDDGYRLYIDGKLIADEWHEGAFRNSTYRCMLEAGKKYDLKIEYFQKG GGAAVNFIWKQKNASNNLFVEALNRNDLVVACIGFNSDTEGEGRDRTFELPEDEAQLLQN TLQSKRPVVGIVNAGGNVEMQSWEPSLKGLLWAWYGGQEAGTAIARTLFGELNPSGKLPV TFEKRWEDNPTFHSYYDPDGDKHVEYTEGIFVGYRGYDKLKREVQYPFGYGLSYTRFKLS APTVGTPKTDGSVTVTCKLTNTGRTAGAEVVQLYVSNKDTTVEHPEKELKGFRKVYLEPG ETKSIEITVPAEAFSHYDTGSRRLVIDRGSHDILLGFSSRDIKAKMSVGISR >gi|226332036|gb|ACIB01000020.1| GENE 53 89880 - 90956 395 358 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|90020424|ref|YP_526251.1| ribosomal protein L11 methyltransferase [Saccharophagus degradans 2-40] # 38 353 5 313 314 156 33 7e-37 MKLKHIPLLLLVLTCCPACQNKKASATKEIPSEATYTNPLLAVGAEPWAVFHEGKYYYTQ GAENKIILWETNDITDLEHAVRKEVWIPKEISNSYHLWGPEIHRIDGKWYVYFAADDGNM DNHHIYVIENSSPNPLEGEFVMKGRIKTDKDDNWAIHASTFEHQGQRYLIWCGWPKRRIE TETQCIYIARMENPWTLSSDRVMIAEPEYEWERQWISPDGSKTAYPIHVNESPQFFESKN KDKVLIYYCASGSWTPYYCIGLLTADAGSDLTNAASWKKQDTPVFEQQPEDSVFGPGSPS FVPTPDEKEWYMLYHARKIPNDAPGATDSRSPRLQKISWDANGMPVLGKPCKEGTQIK >gi|226332036|gb|ACIB01000020.1| GENE 54 91137 - 95126 2756 1329 aa, chain - ## HITS:1 COG:VCA0709_1 KEGG:ns NR:ns ## COG: VCA0709_1 COG0642 # Protein_GI_number: 15601465 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Vibrio cholerae # 803 1045 486 726 738 130 33.0 2e-29 MRIITLLITLFAFGITPLQAFSYFSFKKYQVEDGLSHNTVWCAMQDSYGFIWLGTSDGLN CYNGRGNKVYRNVLNDKFSLENNFVQALFEEENHNIWVGTNWGLYIYDREHDRFTYFDKK TRYGVFISSEIKKIIKSESGLIWIATLGQGLFVYNPQTDVLTQNSLQTSFVWDVCENNSG HIYASSLQEGLLCFDENGKFLQSYPLLSAGNNPDNYRINCLLDIRSDIWFGAGSNLLYCL NERTGNLDCYNASHLNFGAIRCLLNYSETELLVGTDNGLYLFDLHDKNFSRIDNPSDPRS LSDQSINAMMRDAEGGIWVLTNLGGVNYLAKPTKRFDYYPPVYRDGGVAAGKVVGPFCEN AAGDIWVGTRDGLCFFDVSTHQLTAYPIGGGVDKKYDIRSLLLDGERLWIGTYAEGLKVL DLRTGHVKSYNHLQDTPNTICSDDVLAVYKDRSGDIFVGTSWGLCRYNPREDNFSTITTV GSMVSVVDILEDMYDNLWIGTSNSGVFRFNTRNGHWKHFQHERNDSTTITNNSVITLFED LKGTMWVGTNGGGLCSFDPKTETFIDFDPDNTILPNRVIYSIEQDKTGDFWISSNAGLIT INPITKQHFRQFTVNDGLQGNQFTAQSSLKTASGKLYFGGISGFNSFVPDQFMDNQYIPS VYITEIRLPYTTDERLVQDILHLEGPLYRAETITLPYEHNTFSVSFVALSYEDPLKNRYS YRLKGVDKEWVINSEQNTASYTNLPPGKYEFEVRGSNNDHKWNDQTTSLLIVVTPPWWLT TWAYCVYTLLLLGLAYYAGWHWNRHVKKKYKRRMEEYQTTKEKEVYKSKISFFINLVHEI RTPLSLIRLPLEKLLEDKREGRDAKYLSVIDRNVNYLLGITNQLLDFQKMENGGVQLSLK KCDINQLVSDVHSQFTSPAELKGISVMLDLPEGEIFASVDREKVCKIIVNLIGNAVKYAQ SRIDIKLVSSDEGFRVSVSDDGPGIPDVEKRKVFEAFYQVKDGKSGAVGTGIGLAFSKSL AEAHHGTLSLEDSVYGGSSFVLTLPWGEEAVSEEPEVVIPDGREAADEEQGTELSGSKFT ILLVEDNVDLLNLTRESLSTWFKVLKAQNGRQALEVLANETVDVIVSDVMMPEMNGLELT AKVKSDIEYSHIPVILLTAKTTLEAKVEGFECGADVYIEKPFSIRQLRKQIENLLKLRQA FHKMMSELSGGNGAAPISPVEYSVSQKDCELMAKVRAAVEAQLSDENFSVDTLAESLNMS RSNFYRKIKALVGMPPNDYLKTIRLNKAAELLKSGVRITEVCEKIGFSSSSYFAKCFKIQ FGVLPKDYH >gi|226332036|gb|ACIB01000020.1| GENE 55 95196 - 96509 1283 437 aa, chain - ## HITS:1 COG:YPO0063 KEGG:ns NR:ns ## COG: YPO0063 COG4942 # Protein_GI_number: 16120414 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Membrane-bound metallopeptidase # Organism: Yersinia pestis # 28 437 62 450 450 81 25.0 3e-15 MNRFLLILISCLCLTTTISAQSNKLIKELESKRGALQQQIAETESLLKNTKKDVGSQLNG LAALTGQIEERKRYIIAINNDVEAVGREIAGLERQLRGLQRDLKDKKKKYESSVQYLYKN KSVEERLMFIFSAKSLGQTYRRLRYVREYATYQRLQGEEILKKQEQVNRKKKELQQVKVA KENLLREREGEKAKLEAQEKEKREIVAGLQKKQKGLQSEISKKRREANQLNAKIDKLIAE EIERARKRAEEEARREAAARRKAAAKESKSSSTGGGTVPAKKKAEPLERFTMSKADRELS GNFVSNRGKLPMPITGPYIITSHYGQYAVEGLRNVKLDNKGIDIQGKPGAQARAIFDGKV AAVFQLNGLFNVLIRHGDYISVYCNLSSASVKSGDTVTTRQAIGPIFSDGSDNGRTVLHF QLRRERDKLNPEPWLNR >gi|226332036|gb|ACIB01000020.1| GENE 56 96506 - 97093 509 195 aa, chain - ## HITS:1 COG:no KEGG:BF0326 NR:ns ## KEGG: BF0326 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 195 1 195 195 305 100.0 8e-82 MKGSKLKQTVIKQSYLLPLLLMVVLLAGCKTSKVVKTTPVEPAYLSSKLQLTVPNKNGSM TVSGSMKMKSGERIQLSVLMPVFRSEVMRMEVTPDEVLLIDRMNKRYVRATRDELKGILP ENADFDRLEKLLFKASLPGEKKELTGRELGIPSLEKAKVRLSDFSTAEFELIPTEVSSRY TQVALEDLLKMLMKL >gi|226332036|gb|ACIB01000020.1| GENE 57 97090 - 98847 2011 585 aa, chain - ## HITS:1 COG:aq_854 KEGG:ns NR:ns ## COG: aq_854 COG0457 # Protein_GI_number: 15606205 # Func_class: R General function prediction only # Function: FOG: TPR repeat # Organism: Aquifex aeolicus # 94 571 63 528 545 102 23.0 2e-21 MKINKLLFGMLLLLLASCGSSRKVEKQSEQVAVQEINLTPEQQRKYDYFFLEASRLKVKK EYTAAFDLLQHCLAINPSGSAALYEIAQYYLFLKQVPQGQEALEKAVAYAPDNYWYSQAL AGLYQQQDQKEKAIGILEKMATRFPAKQDPLFNLLDLYNQKEDYGKVISTLNRIEEKTGK NEQITMEKFRIYLQMKDDKKAFEEIESLVNEYPMDYRYQVILGDVYMQNGKKQEAYDTYK KVLAAEPDNPMALFSLASYYEQTGQKELFEQQMDTLLLNRKVPSDTKVNVMRQFIVQSEQ EGKDSTQVIGLFDRMMQMDMDDVQIPMLYAQYLLSKGMEAQSIPVLEQVVQIDPTNKAAR MTLLGSAIRKNDYEQVIKICEPGIEATPDALEFYFYLVIAYNQAEHWDDVLEVSRKALEH VTPESDKQMVSDFYTIIGDVYHTKKLMKEAYAAYDSALVYNPSNIGALNNYAYYLSVERR DLDKAEEMSYKTVKAAPNNATYLDTYAWILFEKGNYAEARIYIDDAIKNTKPEEESSVVF EHCGDIYFMTGDVEGALKYWKKALELGTESKTLKQKIEKKKYIAE >gi|226332036|gb|ACIB01000020.1| GENE 58 98878 - 99312 488 144 aa, chain - ## HITS:1 COG:FN1028 KEGG:ns NR:ns ## COG: FN1028 COG0756 # Protein_GI_number: 19704363 # Func_class: F Nucleotide transport and metabolism # Function: dUTPase # Organism: Fusobacterium nucleatum # 1 143 4 146 146 169 58.0 2e-42 MNIQVINKSKHPLPAYATELSAGMDIRANISEPISLAPMQRCLVPTGLFIALPQGFEAQI RPRSGLALKKGITVLNSPGTIDADYRGEICIILVNLSAETFVIEDGERIAQMVIARHEQA VWKEVEVLDETERGAGGFGHTGRG >gi|226332036|gb|ACIB01000020.1| GENE 59 99414 - 100742 872 442 aa, chain + ## HITS:1 COG:sll0398 KEGG:ns NR:ns ## COG: sll0398 COG0232 # Protein_GI_number: 16331575 # Func_class: F Nucleotide transport and metabolism # Function: dGTP triphosphohydrolase # Organism: Synechocystis # 2 439 1 440 440 283 38.0 5e-76 MMDWKRLISAKRFGMEEFHEERQENRSEFQRDYDRLVFSAPFRRLQNKTQVFPLPGSIFV HNRLTHSLEVSCVGRSLGNDVSKAILARQPELQDSFLPEIGSIVSAACLAHDLGNPPFGH SGEKAISTFFSEGKGAQLQEKLSPMEWNDLTHFEGNANAFRLLTHQFEGRRKGGFVLTYS TLASIVKYPFSSSLAGNKSKFGFFTTEEEGFRRIATELGLIQLSDRPLKYARHPLVYLVE AADDICYQMMDIEDAHKLKILTTEETKELLLAYFADERQTHIRKTFDIVKDTNEQIAYLR SSVIGLLIKECTQVFLNNETEILSGTFEGALIKHISERPGKAYKHCSEVSFSKIYRSRDV LDIELAGFRVINTLLELMIDAVTSPKKAYSQLLINRVSGQYNIKAPALYERVQAVLDYIS GMTDVFALDLYRKINGNSLPAV >gi|226332036|gb|ACIB01000020.1| GENE 60 100714 - 101727 812 337 aa, chain - ## HITS:1 COG:VCA0646 KEGG:ns NR:ns ## COG: VCA0646 COG3176 # Protein_GI_number: 15601404 # Func_class: R General function prediction only # Function: Putative hemolysin # Organism: Vibrio cholerae # 4 223 304 515 605 124 35.0 2e-28 MEEVIEPVSKELIIAELTEDKRLRMTNKSNNQIYIITYQDSPNIMREIGRLREIAFRAAG GGTGLSMDIDEYDTMENPYKQLIVWNPEAEEILGGYRYILGTDVRFDEHGAPVLATSHMF NFSDRFVKEFLPTTIELGRSFVTLEYQSTRAGSKGLFALDNLWDGLGALTVVMPNVKYFF GKVTMYPSYHRQGRDMILYFLKKHFGDKDGLITPMKPLEMETDEAELARIFCKDSFKDDY RILNGEIRKLGFNIPPLVNAYMSLSPTMRMFGTAINYGFGDVEETGILIAVDEILEEKRM RHIESFVKNDPEDCQITSGVNKVFTPKVVTPQEDCSR >gi|226332036|gb|ACIB01000020.1| GENE 61 101758 - 102576 770 272 aa, chain - ## HITS:1 COG:VCA0646 KEGG:ns NR:ns ## COG: VCA0646 COG3176 # Protein_GI_number: 15601404 # Func_class: R General function prediction only # Function: Putative hemolysin # Organism: Vibrio cholerae # 55 267 68 273 605 85 32.0 1e-16 MADDSLFLIDIDKILQTKAPKHYKYIPKFVVSYLKRIVHQEELNVFLRDSKDKVGVDFLG ACLEFLDAKLEVKGLENIPKDGLYTFVSNHPLGGQDGVSLGYILGRHFDGKVKYLVNDLL MNLHGLAPLCIPINKTGKQAKDFPKMVEAGFKSDDQLIMFPAGLCSRRQNGVIRDLDWKK TFIVKSVQFQRDVIPVHFEGRNSDFFYNLANLCKALGIKFNIAMLYLADEMLKNRHKTFT VTFGKPIPWQTFDKSKTPAEWAQYVKDIVYKL >gi|226332036|gb|ACIB01000020.1| GENE 62 102842 - 103345 360 167 aa, chain - ## HITS:1 COG:no KEGG:BF0319 NR:ns ## KEGG: BF0319 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 151 1 151 167 287 98.0 1e-76 MKTKNYLSIISILFFSFLFVSCSKEDEGDTIKPVIDLLEPEEGAILRIGSSHGVHFEMNL HDNEAIASYKINIHNNFDGHSHTRASEAGVTKPFTFERTYTDKAGQKDAHVHNHDIKIPA DATPGNYHLMVYCLDQSGNETYVVRNIVLSVEGGEEGEHHHDEHHHD >gi|226332036|gb|ACIB01000020.1| GENE 63 103342 - 105501 1481 719 aa, chain - ## HITS:1 COG:CC0214 KEGG:ns NR:ns ## COG: CC0214 COG1629 # Protein_GI_number: 16124469 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor proteins, mostly Fe transport # Organism: Caulobacter vibrioides # 437 714 410 687 687 95 29.0 3e-19 MRIGFFSGMLGGLCLISTLHAQEPDSLKAVSLSEVVVTESYQHLKNKNSTWRMEVVGKEF LREHFTGNLIQTLGTLPGVHSMDIGSGFSKPMIRGMGFNRISVVENGIKQEGQQWGADHG LELDAFNAGQVSIRKGPASLLYGSDAMGGAIELVPLPLPAGNRLFGEASLLGKSVNGTLG GSLMLGIKKDAWYTWARYSEQHFGDYRIPTDTIVYLTQRMPVYHRRLKNTAGFERDVSWA AGFRKERYVSSYWVSNVFQKTGFFPGVHGIPDVSRLQDDGDSRNIELPYSQVNHLKVSTR QSLLYDKWALTWDIGFQKNHREEWSRFHTHYDAQPVPDKDPDKELAFTLNTYSSAVKLKL FASAVWQHTAGWDVQYQRNTIAGYSFLLPAYRRFTTGAFWMTTYRPGPTLSFSGGLRYDY GKIDASAYTDPYLAIYLREQGYGDEFIRKYEWRSYPVRRHFGDYSGSLGLVWSPSGGHLL QVNVGHSFRLPGANELASNGVHHGTFRHEQGDAALASERGWQFDASYTYENGPLSVSLSP FVSWFSNYIFLRPTGEWSILPHAGQIYRYTGAEALFAGGEAAVGIDFLRHFNYRVSGEYV YTYNCDEHIPLSFSPPASLRNTLTWQYKEFSIYGEVQHIAAQHRVARNEDPTPGAQLLNA GVSANLRIGGIWAEVTLSARNLSGAKYFNHLSFYRKVEIPEPGRNFQILIKVPFKSLLK >gi|226332036|gb|ACIB01000020.1| GENE 64 105576 - 105917 107 113 aa, chain - ## HITS:1 COG:no KEGG:BF0317 NR:ns ## KEGG: BF0317 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 113 1 113 113 217 100.0 1e-55 MGVKRNTRHRIWLAWMLLMTFMPLSVVKVFHNHSEETSITCTDAHSGKSHHTCETCPICQ FMLSPFIETPSTLLTYTPLYVKWESGTFQDKKLSIAFYPHYLRGPPPVFYHIV >gi|226332036|gb|ACIB01000020.1| GENE 65 106115 - 106621 570 168 aa, chain + ## HITS:1 COG:mll3697 KEGG:ns NR:ns ## COG: mll3697 COG1595 # Protein_GI_number: 13473184 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Mesorhizobium loti # 3 164 5 161 183 97 36.0 9e-21 MKSLSFRKDLIGVQEELLRFAYKLTTDREEANDLLQETSLKALDNEDKYTPDTNFKGWMY TIMRNIFINNYRKVVRDQTFVDQTDNLYHLSLPQESGLDSTESRYDLKEMHRIVNSLPKE YKVPFSMHVSGFKYREIAEKLDLPLGTVKSRIFFTRQRLQEELKDFRQ >gi|226332036|gb|ACIB01000020.1| GENE 66 106720 - 107346 339 208 aa, chain - ## HITS:1 COG:no KEGG:Phep_4133 NR:ns ## KEGG: Phep_4133 # Name: not_defined # Def: hypothetical protein # Organism: P.heparinus # Pathway: not_defined # 72 208 65 200 202 76 28.0 6e-13 MKKKYLVIVLLFLVANTCYIYHQHVGLKKVHSFLSELRQDTGERLGILEMQKEDRMYEIQ FNGQLIDKELTVIDTDGKQKKIGDLIIDNPKLVFRFSELNCDKCIDAQIRNLNEYVDSIE LQNIILLTDFQSLEYMRSFQKSNKVKFAIYNMEAEIDSVLVNIDLPYFFVLTPQEERIQC MYIPHKEIPFLTEVYLSSVKRKFFTDLE >gi|226332036|gb|ACIB01000020.1| GENE 67 107343 - 108515 622 390 aa, chain - ## HITS:1 COG:no KEGG:BF0264 NR:ns ## KEGG: BF0264 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 338 389 7 58 59 72 61.0 2e-11 MKIVNIFVALFCLTACSLSNGLKDRLILKPQSILIDPDKVKDFIDLTPLLRDSVEIIPLE TKDECLLSEIERIEFYKDRIFVLDRTRKGVYMFDQSGRFIGKIGCQGSGPGEFTSVGFFC VTGDSVLISDQHQSKWIVYNLQDKRTTEFSCGEFTYLNGFLMGRNLYLVSNYNKSQSDRF NLYKFDLSTRKIEEVLIPFEEKMDKYSTTAFTIYDSQYQDTAFLIYPFNDTIYEVSSKGT QPFYTIDFTQRNLPDDIEPINNSFRLAVAKGNFVKGLSYMQMSGNYILGRYADKGYFRYL SVDRSTLKSTVGNSFVVRDLGYLPVTSFYTIGDALVSVYSASALMQMLDVILSPDSPIKE KYRTKFESLKQITNCEGNPVLLKFQFESAE >gi|226332036|gb|ACIB01000020.1| GENE 68 108737 - 109285 311 182 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253564340|ref|ZP_04841797.1| ## NR: gi|253564340|ref|ZP_04841797.1| predicted protein [Bacteroides sp. 3_2_5] # 17 182 1 166 166 331 100.0 1e-89 MNVLRNWLYLFPISLLMILLFSCNQIKEYDAKERTSRNTRVQHLIEKDITWLVGKKLNSV DSILPNELLHEKVLFLFNYHDCGTCIKTGFAVVNSIDRQKGKEYVKVIGSMISDLTSVQR FNEYHGYIYVDTKDLIRRELKYAPTPMLLRVDTDNRILEALIPTTEMDNQTIKMFIASCL KR >gi|226332036|gb|ACIB01000020.1| GENE 69 109245 - 110285 529 346 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253564341|ref|ZP_04841798.1| ## NR: gi|253564341|ref|ZP_04841798.1| predicted protein [Bacteroides sp. 3_2_5] # 1 346 4 349 349 689 100.0 0 MSASFNLLDRKRLKGLLLFCPVWVLLFWGCGRTVESPEKVLKCELVSYIKSYPDSSFFSQ VGTMQYQDGKIYLLDEARRDVAVMDLEFSDFSLIGKPGDGPGELVRPVGFYVEKDTVYIL DGGTVNVKRYFDSEFISSFSVPAANDYRFFMNKDTIFLSAVTDSTFYTKSAESWQRGDLF TLVLAGNVHDFGNARRNMVLNQRHLVKDSTSLYGITSSSSLLGKYDLSSNKQVATFDLSS VSLIKDNLTYEGSQPYDPKSYYTFISDAYAMNGYLYLLCSELKDRDKGGFRVNKILCLKT EPELQLDVIYALPGEIYTSFCVTPDYIFATNYSNERIEKLALPVSD >gi|226332036|gb|ACIB01000020.1| GENE 70 110288 - 110632 248 114 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253564342|ref|ZP_04841799.1| ## NR: gi|253564342|ref|ZP_04841799.1| predicted protein [Bacteroides sp. 3_2_5] # 1 114 1 114 114 204 100.0 1e-51 MKRKIFSLIVVLVVTISFINVWISVGSTNAKVKLRLAAIDAMAMIEAETPGDTELSLSGS CKITFTCYDSWTGAADGSITCWGAEYCKRGIKKEGIVIITETRWVECDGKRTEC >gi|226332036|gb|ACIB01000020.1| GENE 71 111078 - 111578 440 166 aa, chain + ## HITS:1 COG:no KEGG:BF0315 NR:ns ## KEGG: BF0315 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 166 1 166 166 293 100.0 1e-78 MSINYAVTKKVDKSKGIAKERYYATTRALQKKPVNSVQIANQLAERSSLQNGDVLSALTQ LSDIIAAHLKEGRTVSIDGLGNFYPSITSEAVDKPEECTANKVWVSRICFKAAPAFLNNV RKTDFVSLQLKYGRKSAKSQNGSDKETTDVIPHQQSISEDSSLSDE >gi|226332036|gb|ACIB01000020.1| GENE 72 111685 - 112161 399 158 aa, chain + ## HITS:1 COG:CAC2133 KEGG:ns NR:ns ## COG: CAC2133 COG2001 # Protein_GI_number: 15895402 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Clostridium acetobutylicum # 4 127 2 119 142 70 33.0 1e-12 MIRFLGNIEAKADAKGRVFIPAQFRRQLQSGSEDKLIMRKDVFQDCLVLYPEEVWNEELD ELRQRLNKWNANHQLIFRQFVSDVEIITMDGNGRILIPKRYLQITGIQSDVRFIGVDNKI EIWAKERAEKLFMEPKAFGAALEEIMKEERRTTNNELK >gi|226332036|gb|ACIB01000020.1| GENE 73 112158 - 113072 865 304 aa, chain + ## HITS:1 COG:BS_ylxA KEGG:ns NR:ns ## COG: BS_ylxA COG0275 # Protein_GI_number: 16078578 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted S-adenosylmethionine-dependent methyltransferase involved in cell envelope biogenesis # Organism: Bacillus subtilis # 9 304 4 310 311 234 43.0 1e-61 MKEEETTYHVPVLLKESVDAMNISPDGTYVDVTFGGGGHSREILSRLGDGGRLLGFDQDE DAERNIVNDPHFTFVRSNFRYLHNFLRYHDIGEVDAILADLGVSSHHFDDSERGFSFRFD GKLDMRMNKRAGITAADVVNTYEEERLADIFYLYGELKNSRKLASVIVKARTGQKIETIG EFLEIIKPLFGREREKKELAKVFQALRIEVNQEMEALKEMLMAATEALKPGGRLVVITYH SLEDRMVKNIMKTGNVEGKATQDFFGNLQTPFRLVNNKVIVPDEDEITRNPRSRSAKLRI AEKK >gi|226332036|gb|ACIB01000020.1| GENE 74 113101 - 113442 266 113 aa, chain + ## HITS:1 COG:no KEGG:BF0312 NR:ns ## KEGG: BF0312 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 113 1 113 113 164 100.0 6e-40 MEDKEAKKKKSNSLKSILGGDILATDFFRRQTKLLVLIMVLIIFYIHNRYASQQQQIEID KLKKELIDIKYDALTRSSELMEKSRQSRIEDYISTKESDLQTSTHPPYLISTK >gi|226332036|gb|ACIB01000020.1| GENE 75 113446 - 115569 2353 707 aa, chain + ## HITS:1 COG:CAC2130 KEGG:ns NR:ns ## COG: CAC2130 COG0768 # Protein_GI_number: 15895399 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Cell division protein FtsI/penicillin-binding protein 2 # Organism: Clostridium acetobutylicum # 3 706 7 723 729 166 24.0 2e-40 MAVNKKNIMTRYFFVILLMGLIGVAIVVKAGITMFAERQYWQDVADRFVKENVTVKPNRG NIISSDGKLMASSLPEYRIYMDFKAGGVKKDTMLMNHLDEICEGLHKIFPDKSASEFKTH LKKGRKQGSRNYLIYPKRISYIQYKEAKRLPVFNLNKYKGGFHELAYNQRKKPFGSLAAR TLGDLYADTAQGAKNGIELAFDSILKGHDGITHRQKVMNKYLNIVDIPPVDGCDLLSTID VGMQDICEKALTDKLKELNASVGVAVLMEVATGEVKAIVNMTKAGDGNYYEMRNNAISDM LEPGSTFKTASIMVALEDGKITPEDGIDTGNGIKMMHGRPMKDWNWYKGGYGYLTVTQIL EVSSNIGTSSIIEKYYGSNPQKFVDGLKRMSIDQPLQLQIAGEGKPNIKGPKERYFAKTT LPWMSIGYETQVPPMNILTFYNAIANNGVMVRPKFVKAAIKNGEIVKEYPTEIINPKICS ERTLKQIQEILYKVVHEGLAAPAGSKQFAVSGKTGTAQISQGAAGYKSGRVNYLVNFCGY FPSEAPKYSCIVSIQKPGLPASGGLMAGSVFSKIAERVYAKDLRLDIRNAIDTNTVVIPD VKAGEMIEARQVLEGLNIQTQAEFKAKKNKEVWGHAQAAPKAVILQGKEQLRNFVPSVIG MGAKDAVYLLESKGLKVTLSGVGKVKSQSLPQGTTIKKGQTISIHLN >gi|226332036|gb|ACIB01000020.1| GENE 76 115587 - 117044 1596 485 aa, chain + ## HITS:1 COG:CAC2129 KEGG:ns NR:ns ## COG: CAC2129 COG0769 # Protein_GI_number: 15895398 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylmuramyl tripeptide synthase # Organism: Clostridium acetobutylicum # 1 479 1 476 482 344 40.0 3e-94 MKLKEILTSIQPVKITGNQDIEITGVDIDSRQVESGHLFMAMHGTQTDGHAYIPAAVEKG ATAILCEELPAELAEGVTYIQVADSEDAVGKAATTFYGNPSSKLELVGVTGTNGKTTIAT LLYNTFRYFGYKVGLISTVCNYIDDEAIPTEHTTPDPITLNRLLGRMADEGCKYVFMEVS SHSIAQKRISGLKFAGGIFTNLTRDHLDYHKTVENYLKAKKKFFDDMPKNSFSLTNLDDK NGLVMTQNTKSKVYTYSLRSLSDFKGRVLESHFEGMLLDFNNHELAVQFIGKFNASNLLA VFGAAVLLGKKEEDVLVALSTLHPVAGRFDAIRSPQGYTAIVDYAHTPDALVNVLNAIHG VLEGKGKVITVVGAGGNRDKGKRPIMAKEAARASDRVIITSDNPRFEEPQDIINDMLAGL DTEDKKKTLSIADRKEAIRTACMLAEKGDVILVAGKGHENYQDIKGVKHHFDDKEVLKEI FSLTV >gi|226332036|gb|ACIB01000020.1| GENE 77 117130 - 118398 1081 422 aa, chain + ## HITS:1 COG:NMA2066 KEGG:ns NR:ns ## COG: NMA2066 COG0472 # Protein_GI_number: 15794944 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase # Organism: Neisseria meningitidis Z2491 # 5 422 1 360 360 213 35.0 7e-55 MLYYLFEWLHKLNFPGAGMFGYTSFRALMAIILALLISSIWGDKFINLLKRKQITETQRD AKIDPFGVNKVGVPSMGGVIIIVAILIPCLLLGKLHNIYMILMLITTVWLGSLGFADDYI KIFKKDKEGLHGKFKIIGQVGLGLIVGLTLYLSPDVVIRENIEVQKSENEIEVIHGTHDL KSTQTTIPFFKSNNLDYADLVGFMGEHAQTAGWILFVIITIFVVTAVSNGANLNDGMDGM AAGNSAIIGLTLGILAYVSSHIEFAGYLNIMYIPGSEELVIFICAFIGALIGFLWYNAYP AQVFMGDTGSLTIGGIIAVFAIIIHKELLIPILCGIFLVENLSVLLQRFYYKAGKRKGVK QRLFKRAPIHDHFRTSMSLVEPGCSVKFTKPDQLFHESKITVRFWIVTIVLAAITIITLK IR >gi|226332036|gb|ACIB01000020.1| GENE 78 118538 - 119872 1205 444 aa, chain + ## HITS:1 COG:BH2567 KEGG:ns NR:ns ## COG: BH2567 COG0771 # Protein_GI_number: 15615130 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylmuramoylalanine-D-glutamate ligase # Organism: Bacillus halodurans # 2 437 10 441 450 241 37.0 1e-63 MKRIVVLGAGESGAGAAVLAKVKGFDTFVSDMSAIKDKYKTLLDGHGIAWEEGRHTEEQI LSADEVVKSPGIPNDAPLILKLREQGTPIISEIEFAGRYTDAKMICITGSNGKTTTTSLI YHIFKSAGLNVGLAGNIGKSLALQVAEEKHDYYVIELSSFQLDNMYNFRADIAVLMNITP DHLDRYDHCMQNYINAKFRITQNQTSEDAFIFWNDDPIIKRELDKHGIRAHLYPFSAIKE EGSIAYVEDHEVVITEPIAFNMEQEQLALTGQHNLYNSLAAGISANLAGITKEDIRKALS DFQGVEHRLEKVARVRGIDFINDSKATNVNSCWYALQSMTTKTVLILGGKDKGNDYTEIE ELVREKCSALVYLGLHNEKLHEFFDRLGLPVAEVQTGMKDAVEAAYKLAKKGETVLLSPC CASFDLFKSYEDRGEQFKKYVREL >gi|226332036|gb|ACIB01000020.1| GENE 79 119931 - 121226 1165 431 aa, chain + ## HITS:1 COG:PA4413 KEGG:ns NR:ns ## COG: PA4413 COG0772 # Protein_GI_number: 15599609 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Bacterial cell division membrane protein # Organism: Pseudomonas aeruginosa # 27 381 37 375 399 142 30.0 1e-33 MDLLKNIFKGDKVIWIIFLCLCLISIIEVFSAASTLTYKSGDHWGPITQHSIILMVGAVV VVLMHNIPYKWFQVFPVFLYPISVVLLAFVTLMGVITGDRVNGAARWMSFMGLQFQPSEL AKMAVIIAVSFILSKKQDDEGANPKAFKYIMILTGLVCMLIAPENLSTAMLLFGVVVLMM FIGRVAFKKLAMLLGGLALVGCLGAVFLLAIPKDTDIPFLHRFDTWKSRITNFTEKEEVP AAKFDIDKDAQIAHARIAIATSNVIGKAPGNSIQRDFLSQAFSDFIFAIIIEELGLVGGA FVVILYIWLLVRTGRIAQKCERTFPAFLVMGIALMLVSQAILNMMVAVGLFPVTGQPLPL ISKGGTSTLINCAYIGMILSVSRYTAYLEEKKENPAPLLTQSEGNEAIASEAQTAAEPTA EVLNSDAKFEE >gi|226332036|gb|ACIB01000020.1| GENE 80 121228 - 122370 1036 380 aa, chain + ## HITS:1 COG:BH2565 KEGG:ns NR:ns ## COG: BH2565 COG0707 # Protein_GI_number: 15615128 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase # Organism: Bacillus halodurans # 14 375 1 363 363 254 36.0 2e-67 MNKENNKEGQGDALRVIISGGGTGGHIFPAVSIANAIKELRPDAQILFVGAEGRMEMQRV PDAGYQIIGLPVAGFDRKHLWKNVAVLLKLVRSQWKARNIIRQFRPQVAVGVGGYASGPT LKMAGMMGVPTLIQEQNSYAGVTNKLLAQKARRICVAYDGMEKFFPANKIIMTGNPVRQN LLAEKPEREQAIRSFGLNPEKKTILILGGSLGARTINNTLIAGLQLIRRTTDVQFIWQTG KIYHQQVTEAVKAAGEIPNLFVTDFIKDMAAAYAAADLVISRAGAGSISEFCLLNKPVIL VPSPNVAEDHQTKNALALVNKQAAIYVKDAEAENKLLPVALETIANAEKLSELSENIAHL ALPDSAVVIAKEVIKLAQQS >gi|226332036|gb|ACIB01000020.1| GENE 81 122367 - 123761 1221 464 aa, chain + ## HITS:1 COG:CAC3225 KEGG:ns NR:ns ## COG: CAC3225 COG0773 # Protein_GI_number: 15896472 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylmuramate-alanine ligase # Organism: Clostridium acetobutylicum # 7 444 11 448 458 233 34.0 5e-61 MNIETIQSVYFVGAGGIGMSALVRYFLSKGKVVAGYDRTPSELTQHLIEEGAQIHYEENI DLIPEACKDKATTLVVLTPAVPQEHAELTYFRDNGFEIQKRAQVLGTITRSSKGLCVAGT HGKTTTSTMTAHLFHQSHVGCTAFLGGISKNYGTNLLLSSTSPYTVIEADEFDRSFHWLS PYMSVITATDPDHLDIYGTEQAYLESFEHYTTLIQPGGALIIRKGISLQPKVKEGVKMYT YSRDEGDFHAENIRIGNGEIFIDFVGPDIRIDNIQLGVPVSINIENGVAAMALAHLNGVT PEEIKQGMASFRGVDRRFDFKIKNNRIVFLSDYAHHPSEIKQSVMSMRELYRDKKITAVF QPHLYTRTRDFYKDFADSLSLLDEVILVDIYPAREQPIPGVSSRLIYDNLRPGIEKSMCK KEEILDVLKAKHIEVLITLGAGDIDNYVPDICDLLSRRMVPSDN >gi|226332036|gb|ACIB01000020.1| GENE 82 123781 - 124533 269 250 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163752975|ref|ZP_02160099.1| 30S ribosomal protein S12 [Kordia algicida OT-1] # 1 248 3 239 239 108 26 3e-22 INFLMIKRILLTIVMLLLIAYLVAAVTVFNDKPAHQVCRDMELVIKDTLNAGFVTKNEVA AILQKKGIYPVGKKMDRVHTKTLEKELDKHPLINEAQCYKTPNGKICVEVTQRVPILHIM SSNGENYYLDNKGKMMPPDAKCVAHRAIVTGNVEKSFAMKDLYKFGVFLQNNPFWEAQIV QINVLPGKEIELVPRVGNHIIYLGKLEHFEDKLKRLKTFYEKGLNQVGWNKYSRISLEFG NQIICTKKKQ >gi|226332036|gb|ACIB01000020.1| GENE 83 124610 - 126040 1281 476 aa, chain + ## HITS:1 COG:RSc2840 KEGG:ns NR:ns ## COG: RSc2840 COG0849 # Protein_GI_number: 17547559 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Actin-like ATPase involved in cell division # Organism: Ralstonia solanacearum # 5 334 7 339 410 148 27.0 3e-35 MATTDFIAAIELGSSKIAGIAGKKNSDGSIQVLAYAREDSSSFIRKGVIYNLDKTAQSLT SIINKLEGALNNSIAKIYVGIGGQSLRTVRNVVSRDLEEETIISQELVDSICDENLEIPL IDMDILDVAPQEYKIGNNLQADPVGVAGSHIEGRFLNIVARASLKKNLERCFEQAKIEIA DLLISPLVTADAVLTESERRSGCALIDFGADTSTISIYKNNILRFLTVLPLGGNSITHDL VSLQMEEEEAERLKIRYGNAFYEEEEGEEPATCQLEDGNRTIELGKLNNIIEARTEEIIA NVWNQIQLSGYDDKLLAGLIITGGAANLKDLDEVLRKRSKIEKVRNARFVRNTIHADEDV VKKDGTQNTLFGLLIAGNENCCLLETPAPQPHIQPQPQPEPVNMFEEDESLKEQEAAARA AKKKKEEEEKKRKEEEKQRKLEEKKRREEERRNKPNWFKSTFDKLSNEIFSDEDMK >gi|226332036|gb|ACIB01000020.1| GENE 84 126067 - 127377 1653 436 aa, chain + ## HITS:1 COG:TM0836 KEGG:ns NR:ns ## COG: TM0836 COG0206 # Protein_GI_number: 15643599 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Cell division GTPase # Organism: Thermotoga maritima # 17 324 24 330 351 221 47.0 3e-57 MDEIVQFDFPTDSPKIIKVIGVGGGGGNAVNHMYREGIHDVTFVLCNTDNQALAESPVPV KLQLGRSITQGLGAGNRPERARDAAEESIEDIKTLLNDGTKMVFITAGMGGGTGTGAAPV IARIAKEMDILTVGIVTIPFIFEGEKKIIQALDGVERIAQHVDALLVINNERLREIYSDL TFMNAFGKADDTLSIAAKSIAEIITMRGTVNLDFADVKTILKDGGVAIMSTGFGEGENRV TKAIDDALHSPLLNNNDIFNAKKVMLNVSFCPASELMMEEMNEVHEFMSKFREGVEVIWG VAMDNSLDTKVKITVLATGFGVEDVPGMDDLHEKRSQEEEERQLQLEEEKEKNKERIRKA YGESASGIGTRNLRKRRHIYLFNAEDLDNDDIIAMVEDSPTYLRDKTTLGKIKAKAALEE EIATEEAIDDSGVITF >gi|226332036|gb|ACIB01000020.1| GENE 85 127452 - 127901 270 149 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|42519249|ref|NP_965179.1| 30S ribosomal protein S21 [Lactobacillus johnsonii NCC 533] # 1 148 1 146 147 108 41 2e-22 MDLFERVSEDIKNAMKAKDKVALETLRNVKKFFLEAKTAPGANDTLTDADALKIVQKLVK QGKDAAEIYIGQGRQDLADAELAQVQVMETYLPKQMSAEELEAALKEIIAEVGATSGKDM GKVMGVASKKLAGLAEGRAISAKVKELLG >gi|226332036|gb|ACIB01000020.1| GENE 86 128011 - 128739 471 242 aa, chain - ## HITS:1 COG:no KEGG:BF0300 NR:ns ## KEGG: BF0300 # Name: not_defined # Def: DNA repair protein # Organism: B.fragilis # Pathway: Homologous recombination [PATH:bfr03440] # 1 242 1 242 242 464 100.0 1e-129 MLQKTVGIVLHVLKYNDTSNIVEMYTELSGRASFLVTVPRSKKATVKSVLFQPLALIEFE ADYRPNTSLFRIKEAKSFSPFTSIPYDPFKSAIALFLAEFLYRAIREEAENRPLFAYLQH SILWLDTCKISFANFHLVFLMRLSRFLGLYPNLDDYHAGDYFDMLNATFTSVRPQLHSSY IQPDEAGRLLQLMRMNYETMHLFGMNRTERARCLAIINEYYRLHLPDFPILKSLDVLKEL FD >gi|226332036|gb|ACIB01000020.1| GENE 87 129149 - 129403 419 84 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53711589|ref|YP_097581.1| 30S ribosomal protein S20 [Bacteroides fragilis YCH46] # 1 84 1 84 84 166 100 1e-39 MANHKSSIKRIRQEETRRLRNRYYGKTMRNAVRKLRSTTDKAEATAMYPGIVKMVDKLAK TNVIHKNKANNLKSKLAIYINKLA >gi|226332036|gb|ACIB01000020.1| GENE 88 129584 - 131542 2256 652 aa, chain + ## HITS:1 COG:CAC0006 KEGG:ns NR:ns ## COG: CAC0006 COG0187 # Protein_GI_number: 15893304 # Func_class: L Replication, recombination and repair # Function: Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit # Organism: Clostridium acetobutylicum # 10 646 5 630 637 697 56.0 0 MSEEQNPTNNGSYSADSIQVLEGLEAVRKRPAMYIGDISVKGLHHLVYEIVDNSIDEALA GYCDHIEVTINEDNSITVQDNGRGIPVDFHEKEQKSALEVAMTVLHAGGKFDKGSYKVSG GLHGVGMSCVNALSTHMTTQVFRNGKIYQQEYEIGKPLYPVKEVGIADHTGTKQQFWPDD SIFTETIYDYKILASRLRELAYLNAGLRISLTDRRVVNEDGSFKHETFYSEEGLREFVRF IESSREHLINDVIYLNTEKQNIPIEVAIMYNTGFSENIHSYVNNINTIEGGTHLAGFRRA LTRTLKKYAEDSKMLEKVKVEISGDDFREGLTAVISVKVAEPQFEGQTKTKLGNNEVMGA VDQAVGEVLNYYLEEHPKEAKAIVDKVILAATARHAARKAREMVQRKSPMSGGGLPGKLA DCSDKDPQKCELFLVEGDSAGGTAKQGRNRAFQAILPLRGKILNVEKAMYHKALESEEIR NIYTALGVTIGTEEDSKAANIDKLRYHKIIIMTDADVDGSHIDTLIMTFFFRYMPQIIQN GYLYIATPPLYLCKKGKIEEYCWTDAQRQKFIDTYGGGSENAIHTQRYKGLGEMNAQQLW ETTMDPENRMLKQVNIDNAAEADYIFSMLMGEDVGPRREFIEENATYANIDA >gi|226332036|gb|ACIB01000020.1| GENE 89 131643 - 134609 2347 988 aa, chain - ## HITS:1 COG:no KEGG:BF0296 NR:ns ## KEGG: BF0296 # Name: not_defined # Def: outer membrane assembly protein # Organism: B.fragilis # Pathway: not_defined # 1 988 1 988 988 1886 99.0 0 MNRQVKKTLKISGITLGTVLLVLLVAIAFVINFIVTPKKLTPVVLDAANQTLNAHLDMES VELTFFSTFPQFGLKVKNGSLVSKALNDSSWCKTDSLLSFKECVLTVNPIAYLTENRIVV HNLSLEEVAVYAYRNKTGKANWEVTRASVDTIPADTASTDFNSEIDIRNIELKHANLVFD DRNTDIYSRIDDANLKLRLSLTKGISTLGLKFDNKNILFWQQGELLVNKIATSLRTDIMV DRQTAVWKLKDTELDVNGIRLDVNGAFRRDTVAKTIGMDLEYGLHAPSMETVLRMIPKSY VKDSKVSAKGEVTVSGRVRGVYGDKKLPAVSLKIGIKEASAQYKGLPYGIDEVTADFDAY VDLMRHQPSYLNLKIFHFKGAHTEVLADAKVDDLLDDPLITFHTKSTVDLDALAKTFPLQ ESVTITGKLDADMGMKCRLSALKKQDIGRMKLGGKLELKDFELKDTAKDFDFLGNATFRF RDNETLQAQMDVRKLVLRSRFLSSDIERLVANVSSTNPQDTNRIVSLQCDMEVSKLRASM GDSIKLYSARAKAQAALGPQEVDVTKPAIDFSLRADSLFFSAAGTRMAMNVAGIKMKADK LNDSLWMPKGIVGFNRLRFRTPEFGLPIRMSKTAVTVDGPKITLKNASVRIGRSNMTATG DMMGVYRAMTKGEKLTAHLSLTSDLIDCNQLINSLSFPEDTTEVLTDSVPSEMKLFVIPR NIDFELQTDLKKVIFEKMLFENVHGAVDIKNQAIHLEDLSMRALDADMKAVMVYKAGSPR GGYAGFDFKIRNINIAKLVDFVPALDTIVPMLRSFKGRVMFDVAADARLDSAMNIRIPTL RSAIHIKGDSLVLMDGETFAEISKMLMFKNKKENVFDSISVNVTVHDGNVTVYPFLVEID RYKAAVGGEQGLDMNFNYHISILKSPLPFKAGVNISGNLDKMKFRIGKAKYKDAVTPAAV HRVDSTRMNMGNEIVNRFRRVVLGRQPR >gi|226332036|gb|ACIB01000020.1| GENE 90 134606 - 135187 646 193 aa, chain - ## HITS:1 COG:no KEGG:BF0295 NR:ns ## KEGG: BF0295 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 193 1 193 193 360 99.0 1e-98 MIQIDNVVVSLDVLREKFVCNLEACKGECCIEGDAGAPVELEEVEKLEEVLPVVWDELSP EARAVIDKQGVVYTDRDGDLVTSIVNGKDCVFTCYDEKGYCYCAIEKAYRGGKTDFYKPV SCHLYPIRVGNYGPYQAVNYHRWDVCKAAVLLGKKENVPVYRFLKEPLIRKFGKEWYDEL EIAVKELQDRGMI >gi|226332036|gb|ACIB01000020.1| GENE 91 135249 - 135389 100 46 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MICYAISHYKIINFIANYLFVNILIYMIIYQTQVALLSDLNISLRV >gi|226332036|gb|ACIB01000020.1| GENE 92 135542 - 137485 1252 647 aa, chain - ## HITS:1 COG:MA1149_2 KEGG:ns NR:ns ## COG: MA1149_2 COG0642 # Protein_GI_number: 20090015 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Methanosarcina acetivorans str.C2A # 425 568 2 147 279 74 29.0 5e-13 MKRLVVLLFGLVGAALFMSAANSGYRVLRDSIFHVYRSMPADTTRTVFAKEAAMGHVKES WALELLDSALVFAREIKDVEGELELQYGIFRYYTFRMDGENMEKTCATLREACYRYKKYD NYFLALHYVLQLKGSEGDTEYAILESRKMREEAVRLHADRGVFLSYITEGKSYVFARNTE KAIEQYLKALEIEGTTFGDKLMVHGYLASSYYLKDKYKEALGELKAQRQLIDGVIKKKPS MLGVYRSTLLTIELMYCKIYLGMVDADPLWIHLNEAAKFYDDDCFSATAVNYHFSWAGYY YLRQDWERCFPEFERTLAAFKGTQPMYEIEIRRIMGDAYVDAGRYEEAARTYKTAAVMCD SVNKATLRMNEETVEANYRIRKALLDKELGEKRFLQVAVVGLSLFVVLLVWGVIRLIRIR GELVKSQKEMAESYAVVVATDKMKEVFLRNITDEIRIPLDTVVELSDRLCRETNLKQEKQ QEYSATIKKCASKLIGLIFNVLDLARLESGMMKFVVEEYDVVQLCTDARLMVEMQTENRT KVDFHTEPDMLLIDVDTNRFMKMLASVLKYPEESEGLFRVKFILSCPSEDYLQIKVVNSP IFMTAESEKEFDVLHTINRLYLETFQGSYQLLEESGERMIIITYPVS >gi|226332036|gb|ACIB01000020.1| GENE 93 137482 - 138576 593 364 aa, chain - ## HITS:1 COG:CC2501_1 KEGG:ns NR:ns ## COG: CC2501_1 COG0642 # Protein_GI_number: 16126740 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Caulobacter vibrioides # 118 354 244 486 538 133 38.0 7e-31 MKQLILSGMFVVCGLSVSLASEKWAVDSINWHRAEQKMQEKDFQDAALTYKELITKGDSL FVDYAGRHVEDMREQYSIDELDLQNGMQQKKIWKLVFITILCLVVLLFVGLLYLRRAERK LLFSREELQKAKRLAEESVRNKSVFLSNMSHEIKTPLNALAGFSEILITPGIDDEVRAQC NDVIRLNSDLLLHLVNDVVDVSCLDVANMRFSVAPHEVVALCRNVVEMLRNIKQTSAEMI FETELSALEMETDPCRLQQVLINLLVNATKFTKEGYITLTLRINEVGVPEFMLTDTGCGI PLENQEAVFGRFEKLNEGIQGTGLGLSICKLIINRMGGDIRVDSTYSKGARFIFTHPLKQ EENR >gi|226332036|gb|ACIB01000020.1| GENE 94 138725 - 140239 1739 504 aa, chain - ## HITS:1 COG:MA4007 KEGG:ns NR:ns ## COG: MA4007 COG0696 # Protein_GI_number: 20092802 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphoglyceromutase # Organism: Methanosarcina acetivorans str.C2A # 6 503 15 518 521 498 50.0 1e-141 MSKKALLMILDGWGLGDHGKDDVIFNTATPYWDYLMETYPHSQLQASGENVGLPDGQMGN SEVGHLNIGAGRVVYQDLVKINLSCRDNSILKNPEIVSAFSYAKENGKNVHFMGLTSDGG VHSSLDHLFKLCDIAKEYNIENTFVHCFMDGRDTDPKSGKGFIEQLEAHCAKSAGKVASI IGRYYAMDRDKRWERVKEAYDLLVNGIGKKATDMVQAMQESYDEGVTDEFIKPIVNAGVD GTIREGDVVIFFNYRNDRAKELTVVLTQQDMPEAGMHTIPGLQYYCMTPYDASFKGVHIL FDKENVANTLGEYLAANGKKQLHIAETEKYAHVTFFFNGGRETPYDNEDRILVPSPKVAT YDLKPEMSAYEVKDKLVAAINENKYDFIVVNYANGDMVGHTGIYEAIEKAVVAVDACVKD TIEAAKAQGYEAIIIADHGNADHALNEDGTPNTAHSLNPVPCVYVTENKEAKVADGRLAD VAPTILHILDMVQPAEMTGCNLIK >gi|226332036|gb|ACIB01000020.1| GENE 95 140525 - 141451 763 308 aa, chain + ## HITS:1 COG:CAC0294 KEGG:ns NR:ns ## COG: CAC0294 COG0598 # Protein_GI_number: 15893586 # Func_class: P Inorganic ion transport and metabolism # Function: Mg2+ and Co2+ transporters # Organism: Clostridium acetobutylicum # 19 308 24 315 315 182 36.0 7e-46 MRTYLYCEAGFVEKAQWLPNSWVNVVCPDSSDFKFLTETLKVPESFLNDIADTDERPRTE TEGNWLLTILRIPVQNAQSSIPYTTVPIGIITNNEIIVSVCYHQTDMIPDFIEHTRRKGI EVRNKLDLIFRLIYSSAVWFLKYLKQINIDITAAEKELERSIRNEDLLRLMKLQKTLVYF NTSIRGNEVMIGKLKTIFQDTDYLDEELVEDVIIELKQAFNTVNIYSDILTGTMDAFASI ISNNVNAIMKRMTSLSITLMIPTLIASFYGMNVDIHLEEMPHAFLLIILVSVFLSALSFV IFRKIKWF >gi|226332036|gb|ACIB01000020.1| GENE 96 141459 - 142064 653 201 aa, chain - ## HITS:1 COG:NMA0075 KEGG:ns NR:ns ## COG: NMA0075 COG0164 # Protein_GI_number: 15793104 # Func_class: L Replication, recombination and repair # Function: Ribonuclease HII # Organism: Neisseria meningitidis Z2491 # 10 196 3 193 194 180 51.0 1e-45 MLLPWLNEELIEAGCDEAGRGCLAGAVYAAAVILPKDFENELLNDSKQLSEKQRYALREV IERDAVAWAVGIVSPEEIDKINILNASFLAMHRAVDRLKTRPQHLLIDGNRFKKYPDIPH TTVIKGDGKYLSIAAASILAKTYRDDYMNKLHQEFPCYDWEHNKGYPTKKHRAAIAGHGT TPYHRMTFNLLGDGQLELFSK >gi|226332036|gb|ACIB01000020.1| GENE 97 142280 - 144484 2436 734 aa, chain - ## HITS:1 COG:MA3879 KEGG:ns NR:ns ## COG: MA3879 COG3808 # Protein_GI_number: 20092675 # Func_class: C Energy production and conversion # Function: Inorganic pyrophosphatase # Organism: Methanosarcina acetivorans str.C2A # 4 727 12 683 685 535 49.0 1e-151 MDSILFWLVPFASVLALCFALYFHKQMMKESEGTPQMIKIAAAVRRGAMSYLKQQYKIVG WVFLGLVILFSVMAYGFQVQNAWVPIAFLTGGFFSGLSGFLGMKTATYASARTANAARTS LNAGLRIAFRSGAVMGLVVVGLGLLDISFWYLLLNWAIPADVLTPTHKLCIITTTMLTFG MGASTQALFARVGGGIYTKAADVGADLVGKVEAGIPEDDPRNPATIADNVGDNVGDVAGM GADLYESYCGSILATAALGAAAFIHTGDTVMQFKAVIAPMLIAAIGIILSIIGIFSVRTK ENATMKDLLGSLAWGTNLSSALIVAATFFILWLLQLDNWMWISCAVVVGLVVGIVIGRST EYYTSQSYRPTQKLSESGKTGPATVIISGIGLGMLSTAIPVVAVVIGIIASYLLASGFDF NNVGMGLYGIGIAAVGMLSTLGITLATDAYGPIADNAGGNAEMSSLGKEVRKRTDALDSL GNTTAATGKGFAIGSAALTGLALLASYIEEIRIGLTRLGNLELTFPNGDTISTANATFVD FMNYYEVNLMNPKVLSGMFLGSMMAFLFCGLTMNAVGRAAGHMVDEVRRQFREMKGILTG ETEPDYERCVAISTKGAQREMVVPSLIAIIAPILTGLIFGVPGVLGLLIGGLSSGFVLAI FMANAGGAWDNAKKYVEEGNFGGKGSEVHKATVVGDTVGDPFKDTSGPSLNILIKLMSMV AIVMAGLTVAWSLF >gi|226332036|gb|ACIB01000020.1| GENE 98 144608 - 145162 398 184 aa, chain - ## HITS:1 COG:no KEGG:BF0288 NR:ns ## KEGG: BF0288 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 184 1 184 184 357 98.0 1e-97 MKTGMKLIQGALFVLLFLFAACTPHRRPIPDEMIGEHRIHRADGFMTQDYRLSIIPVNDS VYDIKLWERKSSSESPEKVYYDGLRFHYRDTVHIFTKRVTSGTETYRVNRRRVTRDKNPR IWYTLKLNSTGFVMSDSLVERKGASDEQHRKLLYKRYGRLTFREGKIIFLNSRCPAFYVK KEEK >gi|226332036|gb|ACIB01000020.1| GENE 99 145406 - 147565 1924 719 aa, chain + ## HITS:1 COG:BH2223 KEGG:ns NR:ns ## COG: BH2223 COG3345 # Protein_GI_number: 15614786 # Func_class: G Carbohydrate transport and metabolism # Function: Alpha-galactosidase # Organism: Bacillus halodurans # 72 701 77 712 748 333 31.0 9e-91 MKKLLATLLILVACIHVNAQESRQIRISTDRTDLILEVAPDGRLYQSYLGDRLLNEQDLK NLSGSSRGWEVYPGSGGEDYFEPAVAITNNDGNLSTILRYVSSEQKAVEGGTETIIRMKD DQYPVDVTLHYVAYPKQNVIKTWSEIKHQQKKPVVLWRYASTMLYFSNQKYYLTEFSSDW AKEVQMSTQQLQPGKKILDTKLGSRAAMHMQPFFELGLEQPAQEHQGQVVLGTIGWTGNY QFTFEVDNEGNLRIIPAINPYASDYQLKANETFTTPEFIFTLSNNGTGEASRNLHNWARN YQLKDGKGDRMTLLNNWENTYFTFDEELLGKLMKEAKHLGVDMFLLDDGWFGNKHPRNDD HAGLGDWEAMKSKLPGGIPALVEKAKEAGVKFGIWIEPEMVNPKSDLFETHPEWAIHYPN RETYYFRNQLVLDLSNPKVQDFVFGVVDKIMTENPDVAFFKWDCNSPITNIYSPYLKDKQ GQLYIDHVRGIYNVLKRVKEKYPNAPMMLCSGGGARCDYEALKYFTEFWCSDNTDPVERL FIQWGFSQFFPAKAMCAHVTSWNSRTSVKFRTDVASMCKLGFDIGLKDMKADELTYCQEA VANYKRLKPVILDGDQYRLVSPYDGNHMAVMYTAPDASKAVLFTYDIHPRFGEKLLPIKL RGLDAQKMYRVKEINLMPGRKSNLSGNEKIFSGDYLMKIGLNAFTTSQTNSRVIELVAE >gi|226332036|gb|ACIB01000020.1| GENE 100 147775 - 148338 453 187 aa, chain + ## HITS:1 COG:RSc1055 KEGG:ns NR:ns ## COG: RSc1055 COG1595 # Protein_GI_number: 17545774 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Ralstonia solanacearum # 15 186 11 188 199 68 23.0 7e-12 MVIITLYMNNNIEYISKIKKGEEASFRHFVNSYSKDLFYYAQCFVRSKETAEEVVSDVFL DVWRHREEIDEIKNIKAWLLTLTHNKAISYLRKAENSSEIASWEEIDDFQIIGNLQTPDE EMISKEEIAQINSLIQTLPPKCKVVFALAKIERLPYKEIADMLNISVKTINVHVAKALEI ISNGLKK >gi|226332036|gb|ACIB01000020.1| GENE 101 148426 - 149574 764 382 aa, chain + ## HITS:1 COG:PA3900 KEGG:ns NR:ns ## COG: PA3900 COG3712 # Protein_GI_number: 15599095 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Pseudomonas aeruginosa # 182 371 119 302 317 67 24.0 3e-11 MRKNKFKSFASRLNKDGDHPEKISFESPEEQAEYDKLDFLWNRCLPEETGEPDIWAKVQA KINADNTPVRLALKSNKTARLFSILKYSAVAASVALLIGAGCFLLLNDEERHDLNKIAQS LQTEIPQDIKEVTLVVSDQKQIELDNNAQIVYSATGQVQVNSNKLVEDDIKEEYNQIIVP KGKRSQIVLADNSKIWINSGSKVIYPRAFEGKYREIYVEGEVYLNVTHDTSKPFIVNTSG FEVRVLGTSFNISAYKNQEKAAVVLVEGSVNVKDQQNHHIKMVPNEKVELNQEGISGKEK VNARDYISWIDGIWTLQGESLKQVLLRLQDYYGQNIRCDAAIENEQMFGKLFLNDDLNQV MKSILSILPAEYTMKNNVIYIE >gi|226332036|gb|ACIB01000020.1| GENE 102 149593 - 149724 56 43 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MFNLKKHTLLCRKKADKKRKSDNTPILSDSNDFLLNHKLLMIN >gi|226332036|gb|ACIB01000020.1| GENE 103 149740 - 153138 2903 1132 aa, chain + ## HITS:1 COG:no KEGG:BF0273 NR:ns ## KEGG: BF0273 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1132 1 1132 1132 2191 99.0 0 MKKKAIPCHKAGRITSFFLLISIFLLIPSITTPVYAVETYTQQTVFTLHATNKTVKEVFE YIEKNSEFVVLYSKDLLPVLQKKVSVSIDKQNVESILNILSKEAGLKYNINDRQITITKV TAEAPQQEKKIKITGQVLDENGEGIPGANIVIKGNSTLGTVTNVEGNFTLMAPENSTLVA SFIGYTPVEIPLKGKKIVVFKLVPDAQSLEEVVVVGFGTQKKASVVGAVQSIKPAELRVP SSNLSTSFAGRIAGVISMQRTGEPGADGANFWIRGAATFSGTTDPLIFIDGVEVSAGDMN AIPSEAIENFSILKDASATALYGARGANGVILITTRTGKDLEKARINVRIDNTFTAPTRT LKLADAVTAMKLRNEAILTRNPDGTPAFSDDKIQGTLEGRNQYVYPNVDWFDYMFKDYSM NQSANLNVMGGTKKVDYFISASINNDNGMLKKDPNNTFDNNIQNLRYSFQSNVGAWLTSS TKVNVRINSQIVNYNGPSTSMDDLYKYVMEAPSMYFAPVYPNINREDHTIFGNKSGGPIG SGGFSIYRNPYASMVQGSSKQSAYTINTAFELEQKLDFLTKGLNFKALVSFKNWSKTTVN RSFSPYFYELQNPQEQEDGSYLYDYNSISKGRTALETSTSTTGDRLMNLQATLNYQRMFG DKHDVGAMLVYLQREYNLNNPDNNYYNTLPERNQGLAGRVTYAYDGRYLAEFNFGYNGSE NFEKGSRYGFFPSLAVGYLISNEKFFEPLTKVISNLKIRASYGLVGNADIGSNRFPYLTK VDLGGAGFVFGDQWQTSSNGATITTYGAEKVTWEIGKKYNVGFDLGLFNKLSLNVDFFRE DRKDIFLRRNTIPAESGITGDLRPYGNLGKVRNQGVDMSLDYNHAVSKDFMISAKGTFTY AKNQYMEIDEPDYEYAYMSQVGRPLNQYKGYIALGLFKDQEEIDNSPKQILTGVVQPGDI KYADLNNDGKIDGNDQTYIGNPELPQISYGLGVSIQYKKWDASIFFQGVGKRSIMLSDIH PFGGESYGVMQFVADNHWTEANPNPEAMYPRLTNGKNNNNNPNSTYWLRDGSYIRLKNVE LGYSYKFLRAYISGQNLLTFSKFKLWDPELYTSNGLKYPTQIMGSIGLQFTF >gi|226332036|gb|ACIB01000020.1| GENE 104 153153 - 154997 1810 614 aa, chain + ## HITS:1 COG:no KEGG:BF0272 NR:ns ## KEGG: BF0272 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 614 1 614 614 1272 100.0 0 MKLKNIIVALLIGASLHSCDYLDIVPDDTPILADAFKNEQTAENFVFACYSFIPNYLNFR QNFSWCTTPETVGSAHWTTTWFTFMRMQQGLYNSADPIIDVWQSSYNGIRQCYTFLDNID DVKPSQISEADLAAKKVLWKGEVKFLIAYYHYLLLQNYGPIVILDEAIPLNAPKEELFKP RVPYDECVSRIAQMFDNASADLPMTVKASNYGRATKVIAQALKARMYLYAASPQFNGNAD MYKNFKNKDGQLLMNLTYDKNKWKTAMDECKKAIDMAHQAGAELYKYTKKGNLPEFNQAI ANARNLVVDAWNKELIWGYSGWKETWADGNSIQTHVIPKGISTSSGAPYGALGATAFSAD MYLTKNGLPIDEDPEFDYAHRFTVAEGDSVAVLHRNREPRFYGSIGFNRGDYLINGDTIN LKMRFKEQNGTRDAGSDQLYGSYAIAKLAHPETFVSGTSNSLVAFPFPIIRLGELYLDYA EAYFEYNGTLEGDALTYFNLIRQRAGIPNVEVSYKGLPSGDKLREVIHRERTIELMFEGH MSYDYRRWLIALKEWSGMENGMIGLNSYGTTNEEYYKNARLDAQPFIFRDEQYLSPIKQD YLNVNSNLVQNPGW >gi|226332036|gb|ACIB01000020.1| GENE 105 155089 - 156681 940 530 aa, chain + ## HITS:1 COG:no KEGG:BF0271 NR:ns ## KEGG: BF0271 # Name: not_defined # Def: alpha-galactosidase precursor # Organism: B.fragilis # Pathway: not_defined # 1 530 1 530 530 1093 100.0 0 MKKKKVTTYCCLLLLASFFTTVTAQNTNTPMMGWSSWNTFRVHINEELIKETADAMVNRG LKDVGYGYVNIDDGYFGGRNSEGRLFANKKKFPNGMRVLSDYIHSKGLKAGIYSDAGSNT CGSIYDADTLGIGVGLWKHDDIDCQTFLKDWGYDFIKIDWCGGEATGQSEQQRYTDIYKA IRRTGRTDVRYNICRWQFPGTWATQLAGSWRIHTDINPRFTTIDRIIERNLYLAPYASPG HYNDMDMLEVGRGLTEDEEKTHFGIWSILSSPLMIGCDLRTIPEKTLSIITNKEVIALNQ DSLGLQAEAIERGKDYLILSKAIQKREGKLRAVALYNRSNTDQQIRVDFDKLYLSGDVRV RDLWNHQEMGTFTDYYETLVPAHGTALIRLEGSKRHDRTCYEAEYAFMQEFLPDNKQAAH FTPKSGASGEYIMKNLGNSPSNWAEFRNVYISKGGDYQLKLTYYSGDKRDIQIAVNGTEY KQSNLYSGTWDQAATTTIKVKLRKGYNTIRLYNSYGWAPDIDKMEIIKGR >gi|226332036|gb|ACIB01000020.1| GENE 106 156756 - 157076 449 106 aa, chain - ## HITS:1 COG:STM0930 KEGG:ns NR:ns ## COG: STM0930 COG0393 # Protein_GI_number: 16764292 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Salmonella typhimurium LT2 # 1 103 1 103 107 127 63.0 6e-30 MLLATTPIIEGKRITTYYGIVSGETIIGANVFRDFFASIRDIVGGRSGSYEEVLREAKDT ALKEMSEQARQMGANAVIGVDLDYETVGGSGSMLMVTASGTAVFLE >gi|226332036|gb|ACIB01000020.1| GENE 107 157374 - 158585 1034 403 aa, chain - ## HITS:1 COG:mlr0021 KEGG:ns NR:ns ## COG: mlr0021 COG0520 # Protein_GI_number: 13470346 # Func_class: E Amino acid transport and metabolism # Function: Selenocysteine lyase # Organism: Mesorhizobium loti # 2 403 11 412 413 457 52.0 1e-128 MNIHKIREDFPILSRTVYGKPLVYLDNGATTQKPRLVIDSIVDEYYSVNANVHRGVHFLS QQATELHEASRETVRQFINARSTREVIFTRGTTESINLIVSSFGEEFMQEGDEVIVSVME HHSNIVPWQLLAARKGIAIKVIPMNDKGELLLEEYENLFSERTKIVSVTQVSNVLGTINP VKEMIATAHAHGVPVMIDGAQSIPHMKVDVQDLDADFFVFSGHKIYGPTGIGVLYGKEDW LERLPPYQGGGEMIQSVSFEKTVFGELPFKFEAGTPDYIATTGLAKALDYVTGIGLDPIA LHEHELTVYAMQRLKEIPNMRIFGEAEHKSSVISFLVGDIHHLDLGTLLDRLGIAVRTGH HCAEPLMRRLGIEGTVRASFAVYNTKEEVDALVAGIERVSKMF >gi|226332036|gb|ACIB01000020.1| GENE 108 158597 - 159940 1289 447 aa, chain - ## HITS:1 COG:alr2494 KEGG:ns NR:ns ## COG: alr2494 COG0719 # Protein_GI_number: 17229986 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: ABC-type transport system involved in Fe-S cluster assembly, permease component # Organism: Nostoc sp. PCC 7120 # 26 425 45 440 453 179 30.0 7e-45 MNAEQQYIDLFSQCEAMICRHSAEALNAPRATAFADFERQGFPTRKQEKYKYTDVSKFFE PDYGLNLNRLPIPVNPYEVFKCDVPNMSTSLFFVVNDAFYNQALPKSGLPEGVIFGSLRN MAEQHPELVKKYYGKLADTSKDAVTAFNTAFAQDGVLMYVPKNVIVDRPIQLVNILRADV NFMVNRRVLIILEEGAQARLLICDHAMDNVNFLATQVIEVFAEENSVFDLYELEETHTST VRFSNLYVKQGANSNVLLNGMTLHNGTTRNTTEVTLAGEGAEINLCGMAIADKNQHVDNN TSIDHAVPNCTSNELFKYVLDDQSVGAFAGLVLVRPDAQHTSSQQTNRNLCATRDARMYT QPQLEIYADDVKCSHGATVGQLDENALFYMRARGIAEKEARLLLMFAFVNEVIDTIRLEA LKDRLHLLVEKRFRGELNKCQGCSICK >gi|226332036|gb|ACIB01000020.1| GENE 109 159949 - 160758 210 269 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|90020817|ref|YP_526644.1| ribosomal protein S16 [Saccharophagus degradans 2-40] # 16 254 7 232 318 85 27 2e-15 MVGRGYFWKRIDNEIIDKIMLEIKDLHASINGKEILKGINLTVKPGEVHAIMGPNGSGKS TLSSVLVGNPAFEVTKGSITFYGKNLLELSPEDRSHEGIFLSFQYPVEIPGVSMVNFMRA AVNEQRKYKGLPALTASEFLKLMREKRAVVELDNKLANRSVNEGFSGGEKKRNEIFQMAM LEPRLSILDETDSGLDIDALRIVAEGVNKLKTPDTSCIVITHYQRLLDYIKPDIVHVLYK GRIVKTAGPELALELEEKGYDWIKKELGE >gi|226332036|gb|ACIB01000020.1| GENE 110 160725 - 161471 389 248 aa, chain - ## HITS:1 COG:no KEGG:BF0266 NR:ns ## KEGG: BF0266 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 248 1 248 248 493 99.0 1e-138 MKTKRVGWLLIFLSYVGVVLAQNLDDQERRWAISGSWGGNWPIVTKNTLSGKAVSAGHIH TLMLEYYIPYTRFSLKGGYTGEEIGLNPGISASMSNLEIGGRYYFLPQRFAIQPYGGLST GWNLSPRRQEGMGSSSYYDPSRQEFRKDYDYRYRIKEPLFTVSPVVGADIYFLSCLALTL KYNFRMGIAGKISGEIEKTNSRGTGFVRSNGMRQTVSVGVKVNFPFTITQTDGNSILQWL DEVIFGKE >gi|226332036|gb|ACIB01000020.1| GENE 111 161471 - 162925 1226 484 aa, chain - ## HITS:1 COG:SMc00530 KEGG:ns NR:ns ## COG: SMc00530 COG0719 # Protein_GI_number: 15965488 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: ABC-type transport system involved in Fe-S cluster assembly, permease component # Organism: Sinorhizobium meliloti # 4 484 5 489 489 714 69.0 0 MQQEEPNKYVKELTQEKYKYGFTTEVHTDIIEKGLNEDVVRLISSKKNEPEWLLEFRLKA YRHWLTLEMPTWAHLRIPEIDYQAISYYADPTKKKEGPKSMDEVDPELIKTFNKLGIPLE EQMALSGMAVDAVMDSVSVKTTFKETLMEKGIIFCSFSEAVREHPDLVKKYLGSVVGYRD NFFAALNSAVFSDGSFVYIPKGVRCPMELSTYFRINAANTGQFERTLIVADDDSYVSYLE GCTAPMRDENQLHAAIVEIMVHDRAEVKYSTVQNWYPGDAEGKGGVYNFVTKRGNCKGVD SKLSWTQVETGSAITWKYPSCILSGDNSTAEFYSVAVTNNYQQADTGTKMIHLGKNTRST IVSKGISAGKSENSYRGLVRVAEKADNARNYSQCDSLLLGDKCGAHTFPYMDIHNETAVV EHEATTSKISEDQIFYCNQRGISTEDAIGLIVNGYAKEVLNKLPMEFAVEAQKLLTISLE GSVG >gi|226332036|gb|ACIB01000020.1| GENE 112 162931 - 163434 449 167 aa, chain - ## HITS:1 COG:no KEGG:BF0220 NR:ns ## KEGG: BF0220 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 167 1 167 167 250 99.0 2e-65 MTTIDIIILIALGAGVIVGFMKGFIRQLASILGLIVGLLAAKALYTSLAVKLCPTVTDSM TVAQILAFVIIWIAVPLIFTLVASVLTKALEAVSLGWLNRMLGAGLGALKYLLLVSLVIC VIQFIDSDSQLISQTKKEQSLLYYPMESFAGIFFPAAKEVTQQYIFK >gi|226332036|gb|ACIB01000020.1| GENE 113 163509 - 166556 3287 1015 aa, chain - ## HITS:1 COG:BH2413 KEGG:ns NR:ns ## COG: BH2413 COG0532 # Protein_GI_number: 15614976 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Translation initiation factor 2 (IF-2; GTPase) # Organism: Bacillus halodurans # 442 1014 157 730 730 542 53.0 1e-153 MTIRLNKVTRDLNVGIATVVEFLQKKGYTVEANPNTKITEEQYAMLVKEFSTDKNLRLES ERFIQERQNKDRNKASVSIDGYDKKEPEKTVADDVIKTVIPEDVRPKFKPVGKIDLDKLN RKVEKEPVKEEPKPQPVAAEEKKVAEEVKPVVNEVKKEEVTVTPATSEPKPVKEEPKPVV VEKPVETEKKVVEEVKKEEPKVVVSPEKTEKKEEKPVAEAPVTPVEKEEEGVFKIRPTEF VSKINVIGQIDLAALNQSTRPKKKSKEEKRKEREEKEKLRQDQKKQMKEAIIKEIRKEDS KQAKVVGKENLDPNGKKKRNRINNNKEKVDVNNVASNFAHPTPNSERTNNNRGGNQQGGG GQNRNRNNNNKDRFKKPVVKQEVSEEDVAKQVKETLARLTSKGKNKGAKYRKEKRDMASN RMQELEDQEMAESKVLKLTEFVTANELASMMNVSVNQVIGTCMSIGMMVSINQRLDAETI NLVAEEFGFKTEYVSAEVAQAIVEEEDAPEDLEHRAPIVTVMGHVDHGKTSLLDYIRKAN VIAGEAGGITQHIGAYHVTLEDGRKITFLDTPGHEAFTAMRARGAKVTDIAIIIVAADDD VMPQTKEAINHAAAAGVPIVFAINKIDKPHANPEKIKETLAQMNYLVEEWGGKYQSQDIS AKKGLGVPELMEKVLLEAEMLDLKANPNRNATGSIIESTLDKGRGYVATVLVSNGTLKVG DIVLAGTSYGRVKAMFNERNQRVAQAGPSEPVLILGLNGAPAAGDTFHVIETDQEAREIA NKREQLQREQGLRTQKLLTLDEVGRRIALGNFQELNVIVKGDVDGSIEALSDSLIKLSTE QIQVNVIHKAVGQISESDVTLAAASDAIIIGFQVRPSASARKFAEQEGVDIRLYSVIYAA IEEVKAAMEGMLAPEVKEVVTATIEVREVFHITKVGTVAGAVVKEGKVKRSDKARLIRDG IVIFSGSINALKRFKDDVKEVGTNFECGISLVNYNDLKVGDMIETYEEVEVKQTL >gi|226332036|gb|ACIB01000020.1| GENE 114 166677 - 167939 601 420 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|17988250|ref|NP_540884.1| transcription elongation factor NusA [Brucella melitensis 16M] # 1 403 1 426 537 236 33 1e-60 MAKKEETISLIDTFSEFKELKNIDRTTMVSVLEESFRSVIAKMFGTDENYDVIVNPDKGD FEIWRNREVVADEDLTNPNMQISLTEAQKIDASYEVGEEVTDEVIFAKFGRRAILNLRQT LASKILELEKDSIYNKYIDKVGTIINAEVYQIWKKEMLLLDDEGNELLLPKTEQIPSDFY RKGETARAVVARVDNKNNNPKIILSRTSPVFLQRLFEMEVPEINDGLITIKKIARIPGER AKIAVESYDDRIDPVGACVGVKGSRIHGIVRELRNENIDVINYTSNISLFIQRALSPAKI SSIRLNEEERKAEVFLKPEEVSLAIGKGGLNIKLASMLTEYTIDVFRELDENAQDEDIYL DEFRDEIDGWVIDAIKAIGIDTAKSVLNAPREMLIEKTDLEEETVDEVLRILKSEFEDNE >gi|226332036|gb|ACIB01000020.1| GENE 115 167942 - 168409 608 155 aa, chain - ## HITS:1 COG:no KEGG:BF0261 NR:ns ## KEGG: BF0261 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 155 1 155 155 268 100.0 5e-71 MIEKRTVCQIVEEWLEDKDYFLVEVTVSPDDKIVVEIDHAEGVWIEDCVELSRFIESKLN REEEDYELEVGSAGIGQPFKVLQQYYNHIGLEVEVLTKGGRKLSGVLKDADEEKFVVTVQ KKVKPEGAKRPQLVEEDETFTYDDIKYTKYLISFK >gi|226332036|gb|ACIB01000020.1| GENE 116 168547 - 169761 483 404 aa, chain - ## HITS:1 COG:no KEGG:BF0260 NR:ns ## KEGG: BF0260 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 404 2 405 405 853 99.0 0 MKLFREILIICLLGKLIACSPLASGEINDVWGHKQVATVEMAGSDSVWVCHLSMLKDTVT VPLSYFVEELEMVKLDNRDAALVSSSKTIIGKQYILVHKMGHVPFKLFTKSGTYLRDIGS FGQGAGEYGLAYDAQMDEENNRFYVLCWQADHILVFDLQGNILQPIRLAHWSPKGVFHVE TERGRVHVCALSFNRDFVGDRHSPMIWTQSLDGKIIKELPAGYLAVNDYGNEIKSLNNGT MMDIGFWFGGQYRNDSLYHYNNQEFRLVPRFTLDYGGHELTPHSFGELPNHFWGEISYPV RLSPHSSTTTPPEYYMVDKHTLRGAFVEIYNDFLEGIPADWFFSSHDGYYVWNVEPVRLK QMVEDRLSSGEIVSDSDRRKLTELLRSTKENDNNYIFYGRLKCR >gi|226332036|gb|ACIB01000020.1| GENE 117 169761 - 170912 651 383 aa, chain - ## HITS:1 COG:no KEGG:BF0259 NR:ns ## KEGG: BF0259 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 383 41 423 423 795 99.0 0 MLNRLNYFIMLAGLLVLVACSSNSGKQVEVANTPFVYDGLKEYPVKELKLSDLAVSDYVL LKDDENSLLGRLPTNPCMQVTEDRIYIQDEEQQAIFIFDRQGNPLLQMRHKGGGPQEWAS LNSFYVDSPNKEIIVLDWAKKFIVYDLNGKFKRSFPTPGCSWKFANLNDEAVLIYCPFTN RNNGEAVCILSKKDGKKLYVCPITIDNFVWDSEGRIGYEPLKPAYGGILFSDLSLKGVYF IDAETYEVKQVIDEVTEYKFENAEFVKLHPAIDAKDYTLYTTLGTKWLTPDMPMNYYYFD KKEQKMYTLKNETGWAVLKDVCNVQRTRTTNTPGLGIGYYWPSTMKGESMQAEKEQFDPR FRAIMESIPEEGNPVLQIMNFNK >gi|226332036|gb|ACIB01000020.1| GENE 118 171083 - 172186 780 367 aa, chain - ## HITS:1 COG:no KEGG:BF0258 NR:ns ## KEGG: BF0258 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 367 1 367 367 685 99.0 0 MRCLTILLGNCFLLLVSLASCGKVSLAEEAVFSIPVDTTFMRLRQWEWYCQKRADSCLTE NNYQGALSWLDSARIQVEHYGRPYYILARGDVYYSIHQYDSARRYFSMAAHSIHPHIAIE AWRKLAELELMEGNEKQVFYSTQKADALFRVEIGHVQSDNSEALYQEERLKNELNQLKIA KQNREIAMLTLSLCLIILIALFIFYRQNKIKREKERLLLEEKAKLEQENQILKQTEELSA LREKEAVLRESLFRKVDVLRKIPSLNEEEQESGEHRIALSEREWEEIRQTVDNAYDGFSQ RLLARFPLLTLKDIYFCCLVKINVSIKDLSDIYCISRTSVSKKKFRIKREKLGAEDSDSL DDFLRGF >gi|226332036|gb|ACIB01000020.1| GENE 119 172200 - 173531 956 443 aa, chain - ## HITS:1 COG:HI0610 KEGG:ns NR:ns ## COG: HI0610 COG0738 # Protein_GI_number: 16272552 # Func_class: G Carbohydrate transport and metabolism # Function: Fucose permease # Organism: Haemophilus influenzae # 5 430 2 422 428 398 51.0 1e-110 MKNTNRSILHKDGVSYILPFILVTSCFALWGFANDITNPMVKAFSKIFRMSVTDGALVQV AFYGGYFAMAFPAAMFIRKYSYKAGILLGLGLYALGALLFFPAKMTGDYYPFLLAYFILT CGLSFLETSANPYILSMGTEETATRRLNLAQSFNPMGSLLGMYVAMNFIQARLNPMDTVE RSQLSPAEFEVLKESDLSVLIAPYLIIGLVILAMLFVIRAVKMPKNGDKNHNIDFIPTLK RIFKIPHYREGVIAQFFYVGAQIMCWTFVIQYGTRLFMSQGMEEKAAEVLSQEYNIIAMI IFCISRFVCTFILRYLNPGMLLKILAIAGGAFTLGVIFLQDIWGLYCLVAVSACMSLMFP TIYGIALRGLGDDAKFGAAGLIMAILGGSVLPPLQACIIDQHTLLGMPAVNLSFILPFIC FVVIIIYGHRTCARVKKIKAARK >gi|226332036|gb|ACIB01000020.1| GENE 120 173543 - 174955 972 470 aa, chain - ## HITS:1 COG:SP2167 KEGG:ns NR:ns ## COG: SP2167 COG1070 # Protein_GI_number: 15901977 # Func_class: G Carbohydrate transport and metabolism # Function: Sugar (pentulose and hexulose) kinases # Organism: Streptococcus pneumoniae TIGR4 # 1 470 1 465 467 343 40.0 3e-94 MSTYLAADFGGGSGRIMAGTLTEGKLKLEEVYRFANRQIKLGNCVYWDFLSLFEEMKNGL RVAARKGYEVKSMAIDTWGVDFGLIDKDGKLLGNPVCYRDSRTDGIPERVFKQIDQTVHY AEIGIQVMPINTLFQLYSMKQNDDVQLRVADKLLFMPDLFSYFLTGVANNEYCIASTSEL LDARQRNWSDNLISELGLPRQLFGEIVFPGTVRGKLKQEIADETGLGCINVVAVGSHDTA SAVFAVPSNEPNRAYLSSGTWSLLGAEVDQPILTEEARVAGFTNEGGIQGKIRFLQNITG LWILQRLMAEWKEQGKEISYDCAIAEATASDIRSVIDVDDSAFCNPDHMEESIIKYCHKH HLRTPVSQGEFVRCVIESLAYRYKLGVEQMNRCLPAPVKQLHIIGGGCQNRLLNQLTANA LGIPVYAGPVEATAIGNILVQAKAQGEVDSWEELKEIIINSVEPQVYYPE >gi|226332036|gb|ACIB01000020.1| GENE 121 174973 - 175611 544 212 aa, chain - ## HITS:1 COG:BMEII1095 KEGG:ns NR:ns ## COG: BMEII1095 COG0235 # Protein_GI_number: 17989440 # Func_class: G Carbohydrate transport and metabolism # Function: Ribulose-5-phosphate 4-epimerase and related epimerases and aldolases # Organism: Brucella melitensis # 26 202 40 219 224 103 34.0 3e-22 MITNEHIEQYLAQAHRYGDAKLMLCSSGNLSWRIGEEALVSGTGSWVPNLQKEKVSICNI ATGTPQNGVKPSMESTFHLGILRERPDVNVVLHFQSEYATAVSCMKNKPSNFNVTAEIPC HVGKEIPIIPYYRPGSPALAKAVVEAMKEHNSVLLTNHGQVVCGKDFDQVYERATFFEMA CRIIVQSGGDYSVLTPEEIDDLEVYVLGKKTK >gi|226332036|gb|ACIB01000020.1| GENE 122 175608 - 176762 1332 384 aa, chain - ## HITS:1 COG:STM2973 KEGG:ns NR:ns ## COG: STM2973 COG1454 # Protein_GI_number: 16766278 # Func_class: C Energy production and conversion # Function: Alcohol dehydrogenase, class IV # Organism: Salmonella typhimurium LT2 # 1 383 1 381 382 421 56.0 1e-117 MINRFILNEVSYFGPGAREVLPKEISRLGLHKAFVATDKDLIKFGVADKVLKVLEAAKIP YEIFSEIKPNPTVSNVKAGVEAFASSGADFILAIGGGSSMDTAKAIGIITNNPEFSDVVS LEGVADTKKKSVPIIALPTTAGTAAEVTINYVITDEKNQKKMVCVDPNDIPSIAIVDAEL MYTLPKSLTAATGLDALTHAIEGLITKGAWEMSDMFEIKAIEMINRYLVTAVEEPSNAEA RNGMAVAQYIAGMAFSNVGLGVVHGMAHPLGAIFDIPHGVANALLLPIIMEFNAPAALDK YVEIAKAMNVYSTDMTKEKAAEAAVEAVKTLSLRVNIPQHLSDLGIQESDLDRLATAAFA DVCTPGNPREVTKEIILDLYKKAL >gi|226332036|gb|ACIB01000020.1| GENE 123 176781 - 178553 1965 590 aa, chain - ## HITS:1 COG:SP2158 KEGG:ns NR:ns ## COG: SP2158 COG2407 # Protein_GI_number: 15901968 # Func_class: G Carbohydrate transport and metabolism # Function: L-fucose isomerase and related proteins # Organism: Streptococcus pneumoniae TIGR4 # 1 590 1 588 588 845 67.0 0 MKKYPKIGIRPTIDGRQGGVRESLEEKTMNLAKAVAELITSNLKNGDGTPVECVIADGTI GRVAESAACAEKFEREGVGATITVTSCWCYGAETMDMNPYYPKAVWGFNGTERPGAVYLA AVLAGHAQKGLPAFGIYGRDVQDLNDNSIPADVAEKILRFARAAQAVATMRGKSYLSMGS VSMGIAGSIVNPDFFQEYLGMRNESIDLTEIIRRMAEGIYDKEEYAKAMAWTEKYCKKNE GNDFNIPEKTKTRAQKDEDWVFIVKMTIIMRDLMQGNPKLKELGFKEEALGHNAIAAGFQ GQRQWTDFYPNGDFSEALLNTSFDWNGIREAFVVATENDACNGVAMLFGHLLTNRAQIFS DVRTYWSPEAVKRVTGKELTGMAANGIIHLINSGATTLDGTGQQTNANGEPAMKPCWEIT EGEVEKCLEATTWYPANRDYFRGGGFSSNFLSKGGMPVTMMRLNLIKGLGPVLQIAEGWT VEIDPEIHKLLDERTDRTWPTTWFVPRLCDKPAFKDVYSVMNNWGANHGAISYGHIGQDV ITLASMLRIPVCMHNVEEDQIFRPAAWNAFGMDKEGADYRACTTYGPIYK >gi|226332036|gb|ACIB01000020.1| GENE 124 178626 - 179606 635 326 aa, chain - ## HITS:1 COG:L0146 KEGG:ns NR:ns ## COG: L0146 COG1609 # Protein_GI_number: 15673482 # Func_class: K Transcription # Function: Transcriptional regulators # Organism: Lactococcus lactis # 13 314 7 328 345 70 25.0 5e-12 MKITFGQQTTKVKQLADKISFDISMGVYKSGDSLPSINQLSQAYEVSRDTVFKAFLDLKE RGIIDSTPGKGYYVVGRLKNVLLLLDEYSPFKYALYNSFVKRLSIRYKVDLLFHQYNERL FNTIIRESLGRYNKYIVMNFDNEKLSPNLYKINPSKLLLLDFGKFEKEGFSYVCQDFDQG FYNALFQLADRLRKYQKLVFVLVDDSMHPRSSRDFFERFCADQHLGCEVVSDIEGLQVRR GEVYIAIRQIDVVSIIKKSRVEGLQCGVDFGLIGYNDTPAYEVIDQGITALSVDWEKMGD KAAEFVLQGKTIQDYLPTEVRLRASL >gi|226332036|gb|ACIB01000020.1| GENE 125 179698 - 179877 82 59 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MVCYTTSKAENKAIIYSNHLLYNQQSYSYLNIEKHPLCYKKSKSIDFTNLKYKSKSIFL >gi|226332036|gb|ACIB01000020.1| GENE 126 179811 - 180371 325 186 aa, chain - ## HITS:1 COG:no KEGG:BF0251 NR:ns ## KEGG: BF0251 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 186 1 186 186 350 100.0 1e-95 MRKSNDIIFYSLLALCLFTNCLFIGYYYYQQNREVLLGQELEHQKKQNYELIVNQIESGI IPHVISDKKEFAGYFVLVFPNGICDVCNKWLFKQISELSSTSDLVVVVPDKLKKNMEIYN TVYKLKLSSIFCSEKYAMPQEEFKDMTYIFYCSKTGTVLYPLALHHKNIDLDLYFKLVKS IDLDFL >gi|226332036|gb|ACIB01000020.1| GENE 127 180358 - 181542 495 394 aa, chain - ## HITS:1 COG:no KEGG:BF0250 NR:ns ## KEGG: BF0250 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 394 68 461 461 768 100.0 0 MKYLNLFIFVLLLAGCNRPVKHSDIIQADTMVSIIPQEDTITLSALFSRCEIVKLNDIVL ASINKVFKYDSLWIVQGKSDQGGVHLFNNEGRYLKTVLKWGQGPEEAYDIWSIKLLDGSI YLLINSGTEVVEYSLQKQKMVERFRLPSEILSATDFVVDNGGNYIFLKSISREKKKEEYK LYVYNKKEGTIVNRILNMDKKSSEYISFDQSDCLYRVQDEIYYYEVFRNGICRLSANDMT GYIAFKQNEYTFPEKELYNEDHTFQSFIDVCENSPFIWAHRNLFEGERFVSSTYMYKKEL FWNIIDKSDYSVHSYKWVYDDLILNEVVPVEDYLYRANVQENIHYYTLSFYDFDRIMQLK KKCKKSVGEKWMVKLDDMLDENSNDIIVCFYEKK >gi|226332036|gb|ACIB01000020.1| GENE 128 181613 - 181879 193 88 aa, chain - ## HITS:1 COG:no KEGG:BF0207 NR:ns ## KEGG: BF0207 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 88 1 88 88 142 100.0 4e-33 MNYKKKIICLLALFTIVVVNVLNVVVKSDDAETLTLSGIEAVAATYENSPGNYTGAHNQY CTSPKNATGCVSDPDPTRTCSYSIFCKK >gi|226332036|gb|ACIB01000020.1| GENE 129 182060 - 182287 218 75 aa, chain - ## HITS:1 COG:no KEGG:BF0249 NR:ns ## KEGG: BF0249 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 75 1 75 75 141 100.0 8e-33 MRNFFVSAFLLLVGIAVMTVCRMNNKQYLSELALVNVEALATGEGDVPTSCYGSGNVDCP ISDSKVSYVMNGRSF >gi|226332036|gb|ACIB01000020.1| GENE 130 182448 - 183503 534 351 aa, chain - ## HITS:1 COG:no KEGG:BF0248 NR:ns ## KEGG: BF0248 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 351 36 386 386 674 99.0 0 MVMSWGKTILGCLIGGYALLGLLGGNYAYEQEVKALHVYADSVFHEAFHVELQKRGMDQV ESWRYGCEDSFVSSVDTAFKKVTIQDEYGTYSFRVDAMKIRKNIVSSPGEQGLHTVVCLT HPLSVDTLNILWRTMLNERQKFPIRTGLKLTVSDNNGVVRSSFSPDSLSCLSYSSIFTYY VGYRCEIEILGFVSISFFSVFVNIVWTLIGVVVAFVLCVILTIYIYKLSVHPPKIKEVTT YIQTVAVKKGTLPIYDLKDDLKLDVGKGVLICENMEVSLTPQQRVLLVLFIKAENHTLSM SQIMADVWPGKSISPDCFHKAIERLRDLLRQLPMTIQIEYLGEEIYQMQIL >gi|226332036|gb|ACIB01000020.1| GENE 131 183608 - 184006 447 132 aa, chain + ## HITS:1 COG:no KEGG:BF0247 NR:ns ## KEGG: BF0247 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 132 1 132 132 219 100.0 2e-56 MAHRLNTNKQFMVGNGILAFAVIFVVVIFVYMSLRLQREKEANRHFSETYSIQLTKGFVG DSISLFVNDSLIMNKQIKEEPTAIEVERFAEQSALMIVNNQTETVAAFDLSEKGGTYRFE KDIDGIKQLPQK >gi|226332036|gb|ACIB01000020.1| GENE 132 184168 - 184866 400 232 aa, chain - ## HITS:1 COG:AGc25 KEGG:ns NR:ns ## COG: AGc25 COG1451 # Protein_GI_number: 15887375 # Func_class: R General function prediction only # Function: Predicted metal-dependent hydrolase # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 13 232 33 249 255 82 28.0 1e-15 MDRVIEDKELGRLVVRDNVRAKRLVFRTKADAIYISIPLGVTMREVKEAIEKLRPRLLDS RQKLVRPLIDLNYRIETEYFKLSLVSGKRERFLAHSELGEMRIICPPTADFTDSNLQDWL RKVIEEALRRNAKIILPPRLYMLSEKHRLPYESVQINSSRGRWGSCSSRKKINLSYFLVL LPKHLIDYVLLHELCHTCEMNHGDRFWDLLNGLTDGKALELREELKRYKTEI >gi|226332036|gb|ACIB01000020.1| GENE 133 184987 - 185721 550 244 aa, chain + ## HITS:1 COG:no KEGG:BF0245 NR:ns ## KEGG: BF0245 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 244 1 244 244 400 100.0 1e-110 MKTIFRMLSVLLLTTGLLSSCIQIGEGIQPSKKLITRDYKVKEFNKIDAGTVGNIYYTQS TDGKTDLQIYGPDNIVALIQVAVKDNTLFLSIDKSKKVRNFKKMKITITSPTLNGISFKG VGDVHIENGLTTDNLDIESKGVGNVDIQSLTCQKLNVQSMGVGDVKLEGTAQIAALHSKG VGNIEAGNLRANAVEASSQGVGDITCNATESIDAAVRGVGSIKYKGSPTIKSLSKKGVGT IKNI >gi|226332036|gb|ACIB01000020.1| GENE 134 185741 - 186520 597 259 aa, chain + ## HITS:1 COG:no KEGG:BF0244 NR:ns ## KEGG: BF0244 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 259 1 259 259 422 100.0 1e-117 MKKYILSSLTITFLLLSITACSQGKQISGSSNYITKNIKVGSFDQIKSMSSSDIVYTQKQ GAPTVQIYGPDNIVELMETSVSGRTLTIKFKKNTSIRNSGKLEIRVSSPSLKHLSIYGSG NTTFTNGIKSHDELQMSIYGSGNISGNSFSCAKLAARIYGSGNVNLKRISTSDTQVNISG SGNVLLDGKSTEAEYHIAGSGDINATELKVDNVNARISGSGSIRCYATENLTGGVSGSGN VAYKGNPQINFSKRGLQKL >gi|226332036|gb|ACIB01000020.1| GENE 135 186708 - 187166 328 152 aa, chain - ## HITS:1 COG:no KEGG:BF0243 NR:ns ## KEGG: BF0243 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 152 7 158 158 266 100.0 1e-70 MCWMALLSVAVSAQDFASRFMAEHQADSNLTCVTISPKMMEEIMKSDAEKDKEVLDMISN LKSMQVLTSDVEGKKYFNAALKVVEKNSGRFESFLSFKDKSENCQIMVRKKKSTIVELVM LMHEKNHFAVVNFTGNMSPEFIAQIKRHFHLL >gi|226332036|gb|ACIB01000020.1| GENE 136 187189 - 187704 559 171 aa, chain - ## HITS:1 COG:no KEGG:BF0242 NR:ns ## KEGG: BF0242 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 171 1 171 171 296 100.0 2e-79 MNMDYKYIEQLLERYWQCETSLEEESELRSFFSEEEVPAHLLRYKELFVYQTVQQEAGLG KDFDARILAQVEAPVVKAKHLTMVGRFMPLFKAAAVVALILSLGNVAQHTFFADEALDYN YDAYKDTYDDPEVAYKQVSSALMMLSEGINKSQDQVVRDSVKVEPVRVMKE >gi|226332036|gb|ACIB01000020.1| GENE 137 187688 - 188197 375 169 aa, chain - ## HITS:1 COG:MT3320 KEGG:ns NR:ns ## COG: MT3320 COG1595 # Protein_GI_number: 15842811 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Mycobacterium tuberculosis CDC1551 # 6 158 97 260 284 66 26.0 3e-11 MQEISFRDDILPLKDKLFRLALRITFDRAEAEDVVQDTMIRVWNKREEWTQFGSIEAYCL TVAKNLAIDRSQKKEAQNVELTPEMEEESEISGPYDQLVNNERMSIIHRLINELPEKQRL IMQLRDIEGESYKEIAKILNLTEEQVKVNLFRARQKVKQRYLEIDEYGL >gi|226332036|gb|ACIB01000020.1| GENE 138 188258 - 189031 820 257 aa, chain - ## HITS:1 COG:MK1631 KEGG:ns NR:ns ## COG: MK1631 COG0548 # Protein_GI_number: 20095067 # Func_class: E Amino acid transport and metabolism # Function: Acetylglutamate kinase # Organism: Methanopyrus kandleri AV19 # 5 256 1 246 246 159 39.0 4e-39 MKEKLTVIKVGGKIVEEEATLNQLLNDFAAIEGHKVLVHGGGRSATKIAAQLGIDSKMVN GRRITDAETLKVVTMVYGGLVNKNIVAGLQARGVNALGLTGADMNVIRSMKRPVKEVDYG FVGDVERVDSTLLSDLIHKGVVPVMAPLTHDGQGNMLNTNADTIAGETAKALSAIFDVTL VYCFEKKGVLRDENDDESVIPQINHAEFQRYIAEGVIQGGMIPKLENSFEAINAGVSEVV ITLASAIHTDGGTRIKK >gi|226332036|gb|ACIB01000020.1| GENE 139 189171 - 191063 1725 630 aa, chain - ## HITS:1 COG:slr0662 KEGG:ns NR:ns ## COG: slr0662 COG1166 # Protein_GI_number: 16332143 # Func_class: E Amino acid transport and metabolism # Function: Arginine decarboxylase (spermidine biosynthesis) # Organism: Synechocystis # 2 630 43 687 695 600 46.0 1e-171 MRKWRIEDSEELYNITGWGTSYFGINDKGHVVVTPRKDGVAVDLKELVDELQLRDVAAPM LVRFPDILDNRIEKTAYCFKQASEEYGYKAQNFIIYPIKVNQMRPVVEEIISHGKKFNLG LEAGSKPELHAVIAVNTDSDSLIICNGYKDESYIELALLAQKMGKRIFLVVEKMNELKLI ARMAKQLNVQPNIGIRIKLASSGSGKWEESGGDASKFGLTSSELLEALDFLESKGMKDCL KLIHFHIGSQVTKIRRIKTALREASQFYVQLHAMGFNVEFVDIGGGLGVDYDGTRSSSSE SSVNYSIQEYVNDSISTLVDASDKNGIPHPNIITESGRALTAHHSVLIFEVLETATLPQW DDEEEIAPDAHELVQELYGIWDTLNQNKMLEAWHDAQQIREEALDLFSHGIVDLKTRAQI ERLYWSITREINQIAGGLKHAPDEFRGLSKLLADKYFCNFSLFQSLPDSWAIDQIFPIMP IQRLDEKPDRSATLQDITCDSDGKIANFISTRNVAHYMPVHSLKQKEPYYVAVFLVGAYQ EILGDMHNLFGDTNAVHVSVNEKGYNIEQIIDGETVAEVLDYVQYSPKKLVRTLETWVTK SVKEGKISVEEGKEFLSNYRSGLYGYTYLE >gi|226332036|gb|ACIB01000020.1| GENE 140 191121 - 192200 811 359 aa, chain - ## HITS:1 COG:no KEGG:BF0195 NR:ns ## KEGG: BF0195 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 359 1 359 359 686 100.0 0 MKYTLEIQKLLLQAQNDNLHPREKANLLREAIRIADENEDVQWAVEMRLDLIYELNLLSA DAEEIAVFSKILDSYENHKDQINEDDILWKYKWIWSCTFDLPSIPMEQVEAVGEDYKTRI LRNGYSLRTYYHRLSVEYTKMREYAKAKECIDKMLAEKMDDLTCEACELNFMLDYYLETG QFEEAYNRAQPLITRQVSCYEANLRAYMKLAYYACKAGKPEIAADMCARAEEALVGREKD EYLLLYLGLFIAYYFMTHPDRGWEYAERCIPWSLNTNMQKKYRFSCDMVEALSYESREEV SLSLPEEFPLYRADGIYSVAALRDYFYKQATQLASLYDTRNGNNGYQERLFNVNLIGNL >gi|226332036|gb|ACIB01000020.1| GENE 141 192209 - 193975 1142 588 aa, chain - ## HITS:1 COG:lin0941 KEGG:ns NR:ns ## COG: lin0941 COG0326 # Protein_GI_number: 16800010 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Molecular chaperone, HSP90 family # Organism: Listeria innocua # 9 534 8 556 564 303 34.0 6e-82 MEKEGNNLFQVNLKGMIALLSEHIYSNPNTFVRELLQNSVDAITALHNIDENYSGRIDVF LNGDGSMVFQDNGIGLKEEEVYRFLTVIGESSKRDTPDADDFIGRFGIGLLSCFVVTNEI RVESRSAMGGNPVCWCGKVDGTYQTTFPDEEWEIGSRVVLRPKNEWAHLFEYEVFKKILV NYGEVLPYPVYLHRGEEEELINTPSPVWLDPKATRKELLDYGTKVFQSSALDAFPIRTEH GRIEGVLYVLPFRTQFSVRNSHKVYLKRMLLSEDDCNLLPSWAFFIRCLVNADGLLSTAS RESFVSNGSLKDARKEIGVAIKEYLRALVQNNRSVFNKILDVHHFHIKAIASEDNELLRL FMDYLPFETNKGIRSFGSIRSSNNTIYYTRNLEDFRQVRRIAGAQGRLVVNAAYTFDETL LKKYIRLNQELSLEEISPARLLEEFAEVEGNKEHRSFETKASELLKRFGCICRLKHFTPV DTPVIFVAEEKEENSKVANNPLAAVLGSVNAKKRLPPTLTFNADNEMVQTLLRIQGDNKL FQHVVHILYVQSLLQGKYPVNSEEMELFNHSLSELMTAKMNDFINFLN >gi|226332036|gb|ACIB01000020.1| GENE 142 193983 - 196481 2570 832 aa, chain - ## HITS:1 COG:STM1760 KEGG:ns NR:ns ## COG: STM1760 COG0790 # Protein_GI_number: 16765101 # Func_class: R General function prediction only # Function: FOG: TPR repeat, SEL1 subfamily # Organism: Salmonella typhimurium LT2 # 454 828 43 427 509 151 30.0 4e-36 METLKEKFEALAHRIQSSGKPAAAWFPQFTPVTLLNAENWWEALAVCEYALDTHEDEALT AGFFELIFSAYDCNVEVDLNEEEYAYWWEKVISVCDRVAVFNGAGWSQKGAQYSEARYGK RDLSLLFPCYEKAAEMGSPEAEATVAYWRYMGFYCEQDRAEGERRFAALSSPEALLWGKY YRAYAEQHTGSKEKALLMRKELLDELPEGHRLRAHVYAAMGDALDIEEGSVAEEAACYEK SLELVPNLYSLKNLATLYFRYPELGKQKELAFELWEKAWHAGVWSAANFLGYNYQEEEWL DMPKAIEWLEKGMLYCESYCAYELALIYLYNDEYKNVERGLMCLQRCVDDNYVEAIETLA NVYFNGELVEENISYACQLLERAIKLGSGSAAYRIGWMYERGLLSEEPDYQKAMEYYEKA VSMDNADGYARAALYLANGYSGVTDAGKSKAYYEKAAELGSCFAMVELAFLYENGEVVEQ SYEKAFDLLQKAAGQEYPYAMYRVGLYLDRGVIGEPRPEEAFAWYAKAAERGDGDAIFAL GRCYKNGIGTEENPDKALEWFTKGAENNEPRCLTEMGLAYEYGSGIEENPHQAVEYMTKA AEQNYGYAQFKMGDYFFFGYGACPEDNKQAVEWYEKAVANDIPLAMLRMGEYYLYDYDKL NESEKAFSYFKKAAEAECYNEGLGICYEMGIGVEDNETEAFKYYTLAAGSGNVMSMYRTG LCYYNGVGVKQNYTEAYRWFNDAAGNDNVASYYYLGKMLMYGEGCVPDAEAGLQWLMKAA EHNSDKAQFELGNAYLMGNGVEENDEIAMEWFEKAAENGNAKALKITGRRQR >gi|226332036|gb|ACIB01000020.1| GENE 143 196628 - 197167 400 179 aa, chain - ## HITS:1 COG:alr1244 KEGG:ns NR:ns ## COG: alr1244 COG0703 # Protein_GI_number: 17228739 # Func_class: E Amino acid transport and metabolism # Function: Shikimate kinase # Organism: Nostoc sp. PCC 7120 # 2 168 8 169 181 113 37.0 2e-25 MIRIFLTGYMGAGKTTLGKALARELHIPFIDLDWYIEERFHKTVGELFSERGEASFRELE KNMLHEVGEFEDVVISTGGGAPCFFDNMEYMNRVGTTVFLDVDPKVLFSRLRVAKQQRPI LQGKKDDELLDFIVQALEKRAPFYRQANYIYCADKLEDRSQIETSVQQLRKLLNLHIAS >gi|226332036|gb|ACIB01000020.1| GENE 144 197618 - 197854 92 78 aa, chain + ## HITS:1 COG:no KEGG:BF4496 NR:ns ## KEGG: BF4496 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 43 1 43 91 82 95.0 3e-15 MDSSGESRDTSFFVNYKRGLTLPKTELDSMSKRKLTTKFEEEPQKIISIFRLTPSVFQAI RRPISPHACIHEPEYAGL >gi|226332036|gb|ACIB01000020.1| GENE 145 198004 - 198606 333 200 aa, chain - ## HITS:1 COG:CAC3314 KEGG:ns NR:ns ## COG: CAC3314 COG3560 # Protein_GI_number: 15896557 # Func_class: R General function prediction only # Function: Predicted oxidoreductase related to nitroreductase # Organism: Clostridium acetobutylicum # 1 199 1 198 198 249 61.0 2e-66 MKKSFEEALKHRRTYYSITNQSPVSDEEIERIVNLAVTHVPSAFNSQSTRVVLLLGENHK KLWHIVKETLRKIVPPEVFKTTEAKIDNSFASGYGTVLFFEDQSVVKGLQEAFSSYKDNF PGWSLQTSAMHQLAVWTMLEDVGFGASLQHYNPLIDEEVRHTWHLPEEWHLIAEMPFGLP VQGPGDKDFKDLDTRVKVFK >gi|226332036|gb|ACIB01000020.1| GENE 146 198726 - 199355 484 209 aa, chain + ## HITS:1 COG:BH0863 KEGG:ns NR:ns ## COG: BH0863 COG3341 # Protein_GI_number: 15613426 # Func_class: R General function prediction only # Function: Predicted double-stranded RNA/RNA-DNA hybrid binding protein # Organism: Bacillus halodurans # 1 209 1 196 196 189 49.0 4e-48 MGKQKFYVVWDGVTPGIYTSWTECQLQVKGYDSAKYKSFDNREEAERAFAASPYAYIGKN AKKKTTGPSTDMLPVAVIENSLAVDAACSGNPGPMEYRGVHVASRQEIFHFGPMKGTNNI GEFLALVHGLALLKQKGFDMPIYSDSANAISWVKQKKCKTKLSRTAETEALFVLIERAEK WLKENKYTTPILKWETREWGEIPADFGRK >gi|226332036|gb|ACIB01000020.1| GENE 147 199359 - 200591 632 410 aa, chain - ## HITS:1 COG:no KEGG:BF0189 NR:ns ## KEGG: BF0189 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 410 1 410 410 861 99.0 0 MTKKWQYLCRCLLVLVFIGGVIPAKAQLVERVCRTDYKISPERKGELLLELDNISFFKDN EFAGTVIKGYSLPGLWIQPKFVYYPLKNIKLEGGVHMLWFSGAYRYPSVSYQDIALWKGE QYQKGAHLLPFFRAQISMKSVDLILGNIYGGSNHGLIAPLYNPELNLTADPETGFQVLAG APWIDLDAWIDWQSFIFRDDTHQEAFTVGLSTRFKLNAPSSTFHCYIPLQILAQHRGGEI DTIRESSVQTLMNGAVGAGVTWNIDRRILKRVNVELDAAGYYQQKGELWPYRKGIGVYSS AFVDLGNFRVKMGHWICNDFITMFGIPYFGTISTKKEGITYDKPQTLFCSIEYSRMFGKH YALGLKADAYQFFPGTMRSANGELTSPGSTTSFSVGVYFRINPSFLLKKF >gi|226332036|gb|ACIB01000020.1| GENE 148 200594 - 202429 214 611 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|169795303|ref|YP_001713096.1| ABC transporter ATP-binding protein [Acinetobacter baumannii AYE] # 370 607 1 229 311 87 28 7e-16 MKEFFQLMRRFVSPYKKFLGWAVFLNLLSAVFNIFSFTLLIPILQILFKMDNKVYEFIPW DAAGEGLKDIAVNNFYYYVTRMIEINGPSLTLLFLGLFLAFMTLLKTSCYFASSAVMIPL RTGVVRDIRIMVYSKVMSLPLGFFSEERKGDIIARMSGDVGEVENSITSSLDMLIKNPIL IVMYFGTLIITSWQLTLFTLLVVPGMGWIMGKVGKKLKRQSLEAQAKWSDTMSQLEETLG GLRIIKAFIAEQKMINRFTECSNEFRDATNRVAMRQALAHPMSEFLGTLLIVVVLWFGGS LILGNHSSIDAPTFIFYMVILYSVINPLKEFSKAGYNIPKGLASMERVDKILKAENKIVE IPNPKPLNGLEEQVEFKDISFSYDGKKEVLQHINLTVPKGKTVALVGQSGSGKSTLVDLL PRYHDVQEGTITIDGVSIKDVRISDLRSLIGNVNQEAILFNDTFFNNIAFGVENATMEQV IEAAKIANAHDFIMEKEDGYHTNIGDRGSKLSGGQRQRISIARAILKNPPILILDEATSA LDTESERLVQEALERLMKTRTTIAIAHRLSTIKNADEICVLYEGEIVERGKHEELLAKNG YYKRLNDMQSL >gi|226332036|gb|ACIB01000020.1| GENE 149 202849 - 203919 490 356 aa, chain + ## HITS:1 COG:SP1771_1 KEGG:ns NR:ns ## COG: SP1771_1 COG0463 # Protein_GI_number: 15901601 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Streptococcus pneumoniae TIGR4 # 10 218 7 223 259 165 40.0 1e-40 MEVNKPLPLISIIVPIYNIAEYASECIQSLINQTYKNIEIILIDDGSTDHSPVICDEFAE QDERIKVIHKRNGGLSDARNAGLDVATGEYIGFVDGDDWVDEDMYETLYHLIYEHQADIS ICTHYTELPNRTKVKYKSKKTKIFSSQKAIATLIEDKIIQNYIWEKLFKRELFTELRFPV GWSFEDIALCYKIFHKARKIVLLQTPKYHYRTRPGSITNSTRNPLKEFQYLQALHEQFQF AAENNIKVRKPKKLVQKTFHFINHIIILPPSSLKKKYINDAFEIAHTYDYLRNWEIGVAA TLRRFFVYNYFNAYASVIITYRKYIPARTVKSTTEFFFVRRVATSLATAMRSILYI >gi|226332036|gb|ACIB01000020.1| GENE 150 204040 - 204843 394 267 aa, chain + ## HITS:1 COG:Cj1135 KEGG:ns NR:ns ## COG: Cj1135 COG0463 # Protein_GI_number: 15792460 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Campylobacter jejuni # 5 251 260 511 515 171 40.0 2e-42 MKTTLIISTYNRPEALSVCLDSVRFQTVMPDEVIVGDDGSTSETKDLIESFKKDFPVPLI HLWQEDKGFRLAMMRNKSVAAATGDYIIEIDGDIFLHNKFVEDHKRLAKPGHYLRGTRVN LGQKLTEEICKSKVNRRIYPWTIGIQNRAETAIHSTPVSNFFADRYKKNVSSGLGCNMSF WRSDFLAINGYDEFFEGWGKEDDDLTHRLQRKGCKKRSLRFAGIVYHLWHGHESMESDQK NAEYFRKNNEKNIVYCENGVSKYLKQE >gi|226332036|gb|ACIB01000020.1| GENE 151 204846 - 205868 554 340 aa, chain + ## HITS:1 COG:no KEGG:BF0185 NR:ns ## KEGG: BF0185 # Name: not_defined # Def: putative lipopolysaccharide core biosynthesis protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 340 1 340 340 696 100.0 0 MGDSLVYKLRSGKPPKFIYFAYDMLKMAIPSYFYRMQLKRTIAQLSKRPDRAYIEERVSY YNRLSGNNHPIPQGTHVENKIRYLIYRGKLGNYKMSYFHKAYFFDAREYTRWFSPDLRWQ YCPGDVYFTPDSPTIVKSRLLAGDNQNSVILKLDALRHFMFVNDKRSFTTKKDCAIFRGK IRDSRIRTQFIKMYINHPLCDCGVVGHETGIPQEWMVAKKTIREHLEYKFILSLEGNDVA SNLKWVMSSNSIAVMTRPTCETWFMEGKLIPDYHYIEIKNDFSDFEEKLTYYINHPEKAQ QIIDHAHEYIKQFQNKKRERLISLLVLDKYFKATGQSGEM >gi|226332036|gb|ACIB01000020.1| GENE 152 205896 - 206831 567 311 aa, chain + ## HITS:1 COG:BS_gspA KEGG:ns NR:ns ## COG: BS_gspA COG1442 # Protein_GI_number: 16080894 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases # Organism: Bacillus subtilis # 1 270 6 276 286 126 29.0 5e-29 MIHIACNIDSNFTIHCAVTLTSLFANNRNSEFCVHIIASTLPEADQKALSSIAESYGNKI CFYFPEKDLLNNFSIKKSGNRISIATYYRCLLSRILPVNIDKILYIDCDIVVLNDISEFW NTDITQYAIGCIEDIGSDEEEYYSRLQYDKKYSYFNAGVLLINLKYWREHKIDEMCEQYF LAHSDRIRFNDQDLLNALLYKDKLFVPFRWNVQDTFYRRTYSHKVKEHSGLKEALLHPAI LHYTNKKPWNYDSMHPLKQEYFKYLDMTPWKGTRPIIDFQTRVITGFKRLLYITGIKKSK YINLKDYELAQ >gi|226332036|gb|ACIB01000020.1| GENE 153 206800 - 207513 506 237 aa, chain - ## HITS:1 COG:no KEGG:BF0183 NR:ns ## KEGG: BF0183 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 237 1 237 237 489 100.0 1e-137 MTKRLTKIGVNPDFQELSSFVHELPTVFETGGKVIYKGRNELKEFDVEGKKLIVKSYQLP HLLNRIIYNFFRASKAKRSYSYALMLRKLGIGSPAPVGYYSTGSWLLFGRSYFVCLKSDC PYTYRDFEKTVFPNQEQILRAIARTTAMLHENGLLHKDYSAGNILFRTIDEKVEVEIIDL NRMRFGNVGIEAGCKNFERLPGTHEMFAILAEEYAKARGFDVQTCLELIEQAHSLSD >gi|226332036|gb|ACIB01000020.1| GENE 154 207518 - 208234 534 238 aa, chain - ## HITS:1 COG:no KEGG:BF0182 NR:ns ## KEGG: BF0182 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 238 1 238 238 478 100.0 1e-134 MKCIHIITPVKDSIELTLQTAEAILKSDFTVPFHYTIYNDFSTDENTKQLKEASLKMGFE LVNLSEITSHPSPNYLLVLQMAQEKAIAAEAGLLIVESDVIVKKHTLQSLFDGAQARKDC GIAAAVTVDEHEAINYPYLYAKGKENQVFPEKKHLSFCCSLLTTDFLRAFDFHSLNPEKN WFDVTISHQALEKGFVNYLFTTLTVWHRPHSSRPWKQLKYTNPLKYYWLKFTKGLDKI >gi|226332036|gb|ACIB01000020.1| GENE 155 208367 - 209413 735 348 aa, chain + ## HITS:1 COG:STM2370 KEGG:ns NR:ns ## COG: STM2370 COG0111 # Protein_GI_number: 16765697 # Func_class: H Coenzyme transport and metabolism; E Amino acid transport and metabolism # Function: Phosphoglycerate dehydrogenase and related dehydrogenases # Organism: Salmonella typhimurium LT2 # 1 338 1 348 378 273 41.0 3e-73 MKVIVDNKIPYIREAIEQIADEVIYAPGKDFTPELVQDADALIIRTRTRCDRSLLAGSKV KFIATATIGFDHIDTAYCREAGITWTNAPGCNSASVAQYIQSALFILQQTRGMKLNQMTI GIVGVGNVGSKVADVARKLGIQVMLNDLPREEREESTMFASLKSIAEKCDIITFHVPLYK EGKYKTYHLADKHFFHSLKKGAVIMNTSRGEVIETEALLEALRSGILSDAVIDVWEHEPD IDLELLEKVIIGTPHIAGYSADGKANATRMSLEALCRFFRIETDYRITPPEPKNKLISTA TYEEASLMIYDPRRDSDALKSHPGLFEQLRGDYPLRREEGAYRIVITK >gi|226332036|gb|ACIB01000020.1| GENE 156 209451 - 210074 439 207 aa, chain - ## HITS:1 COG:MA0316 KEGG:ns NR:ns ## COG: MA0316 COG0299 # Protein_GI_number: 20089214 # Func_class: F Nucleotide transport and metabolism # Function: Folate-dependent phosphoribosylglycinamide formyltransferase PurN # Organism: Methanosarcina acetivorans str.C2A # 21 204 10 202 204 134 40.0 2e-31 MQSFAHFSLFCALKGLIMGKNIAIFASGSGTNAENIIRYFEKNASVRVRLVLSNRKDAYV LERACRLGVPYRAFPKSDWEAAESILDLLRKYQIDFIVLAGFLLRIPDALLHAYPDKIIN IHPALLPKFGGKGMYGDRVHEAVVMAGESESGITIHYIDEHYDEGSTVFQAKCPVLPGDT PADVAKKVHALEYEWFPKIIERVVNSL >gi|226332036|gb|ACIB01000020.1| GENE 157 210173 - 210409 385 78 aa, chain + ## HITS:1 COG:SMc00573 KEGG:ns NR:ns ## COG: SMc00573 COG0236 # Protein_GI_number: 15964896 # Func_class: I Lipid transport and metabolism; Q Secondary metabolites biosynthesis, transport and catabolism # Function: Acyl carrier protein # Organism: Sinorhizobium meliloti # 1 75 1 75 78 76 64.0 1e-14 MSEIASRVKAIIVDKLGVEESEVTETASFTNDLGADSLDTVELIMEFEKEFGISIPDDQA EKIGTVQDAVAYIEEHAK >gi|226332036|gb|ACIB01000020.1| GENE 158 210425 - 211687 1306 420 aa, chain + ## HITS:1 COG:BS_yjaY KEGG:ns NR:ns ## COG: BS_yjaY COG0304 # Protein_GI_number: 16078199 # Func_class: I Lipid transport and metabolism; Q Secondary metabolites biosynthesis, transport and catabolism # Function: 3-oxoacyl-(acyl-carrier-protein) synthase # Organism: Bacillus subtilis # 1 418 1 411 413 430 54.0 1e-120 MELKRVVVTGLGAITPVGNNVPEFWENLVNGVSGAGPITHFDASQFKTQFACEVKGFDAT QYIDRKEARKMDLYTQYAVAVAKEAVADSGLDIENEDLNRIGVIFGAGIGGIRTFEEETS NYALHKENGPKYNPFFIPKMISDIAAGQISIMYGFHGPNYATCSACATSTNAIADAFNLI RLGKANVIVSGGSEAAIAAAGVGGFNAMHALSTRNDEPQSASRPFSASRDGFVMGEGGGC LILEELEHAKARGAKIYAEVAGVGMSADAHHLTASHPEGLGAKLVMKNALEDAEMSPEEV DYINVHGTSTPVGDISEAKAIKEVFGEHAFELNISSTKSMTGHLLGAAGAVESIASILAI KNGIVPPTINHAEGDNDENIDYNLNFTFNKAQKREINVALSNTFGFGGHNACVIFKKYAE >gi|226332036|gb|ACIB01000020.1| GENE 159 211692 - 212564 660 290 aa, chain + ## HITS:1 COG:SA1076 KEGG:ns NR:ns ## COG: SA1076 COG0571 # Protein_GI_number: 15926816 # Func_class: K Transcription # Function: dsRNA-specific ribonuclease # Organism: Staphylococcus aureus N315 # 27 241 24 240 243 108 36.0 1e-23 MLRNKIDKIRLLFRKDRESYSCFYRILGFYPRNIRLYEQALLHKSTAVRSEKGRPLNNER LEFLGDAILDAIVGDIVYQHFEGKREGFLTNTRSKIVQRETLNKLAVEIGLDKLIKYSTR SSSHNSYMYGNAFEAFIGAIYLDRGYECCKQFMERRIIEPYIDLDKLSRKEVNFKSKLIE WSQKNKMEVSFELIEQSLDKENNPVFQTEVRIEGILGGSGTGYSKKESQQNAAQMTLKKI KGDPEFMASVQEAKTQNNVPAEDTTPESETSLTAENQQIDEIISTEEISV >gi|226332036|gb|ACIB01000020.1| GENE 160 212583 - 213593 926 336 aa, chain - ## HITS:1 COG:Cgl1221 KEGG:ns NR:ns ## COG: Cgl1221 COG0205 # Protein_GI_number: 19552471 # Func_class: G Carbohydrate transport and metabolism # Function: 6-phosphofructokinase # Organism: Corynebacterium glutamicum # 1 328 4 343 346 275 44.0 1e-73 MKIGILTSGGDCPGINATIRGVCKTAINHYGMEVVGIHSGFQGLLTKEVESFTEKSLSGL LNLGGTMLGTSREKPFRKQGIISDVDKPALIQRNIAELGLDCVVCIGGNGTQKTAAKFAA MGINIVSVPKTIDNDIWGTDISFGFDSAVSIATDAIDRLHSTASSHKRVMVIEVMGHKAG WIALYSGMAGGGDVILVPEIPYNIKNIGDTILNRLKKGKPYSIVVVAEGIQTDGRKRAAE YIAQEIEYETGIETRETVLGYIQRGGSPTPFDRNLSTRMGGHATELIANGQFGRMIALKG DEISSVALEEVAGKLKLVTEEHDLVVQGRRMGICFG >gi|226332036|gb|ACIB01000020.1| GENE 161 213763 - 215274 1223 503 aa, chain + ## HITS:1 COG:no KEGG:BF0216 NR:ns ## KEGG: BF0216 # Name: not_defined # Def: putative auxin-regulated protein # Organism: B.fragilis # Pathway: not_defined # 1 503 1 503 503 1036 100.0 0 MNITKIISKVFDSRLKAIDLYDTQAGEIQHRVLTRLVKQAENTEWGKKYDYKSIRNYEDF KNRLPIQTYEEVKPYVERLRAGEQNLLWPSEIRWFAKSSGTTNDKSKFLPVSKEALEDIH YRGGKDAAAIYFRMNPESRFFSGKGLILGGSHSPNLNSNHSLVGDLSAILIQNVSPLINL IRVPSKQIALMDEWEAKIEAIANSTIPVDVTNLSGVPSWMLVLIKRILEKTGKQTLEEVW PNLEVFFHGGVAFTPYREQYRQVIHSSKMHYVETYNASEGYFGTQNDLSDPSMLLMIDYG VFYEFIPLEDVEKENPRTYCLEEVELNKNYAMVISTSCGLWRYMIGDTVKFTRKNPYKFV ITGRTKHFINAFGEELIVDNAEKGLAKACAETGAQVSEYSAAPVFMDANAKCRHQWLIEF AKMPDSIEKFAMILDATLKEVNSDYEAKRWKDIALQPLEVIVARKGLFHDWLAKKGKLGG QHKVPRLSNTRDYIEEMIALNER >gi|226332036|gb|ACIB01000020.1| GENE 162 215334 - 216428 663 364 aa, chain - ## HITS:1 COG:CAC2233 KEGG:ns NR:ns ## COG: CAC2233 COG0482 # Protein_GI_number: 15895501 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain # Organism: Clostridium acetobutylicum # 5 358 2 354 355 229 37.0 7e-60 MMEKNKRVLLGMSGGTDSSVAAMLLLEAGYEVTGVTFRFYEFNGSTEYLEDARALAARLG IGHITYDARKVFQEQIIDYFIDEYMSGHTPVPCTLCNNQLKWPLLAKIADEMGIFYLATG HYVRKQWVDGNYYIAPAEDVDKDQSFFLWGLKQEILQRMLLPMGGMTKSEARAYAAGRGF EKVSKKKDSIGVCFCPLDYRSFLKKCLCDESGDKNRNIYRKVERGRFLDESGNFIAWHEG YPFYTIGQRRGLGIQLNRAVFVKEIHPETNEVVLASLKSLEKSEMWLKDWNIVDESRLLG CDDVIVKIRYRKQENHCSVTITPEGLLHIRLHEPLSAIAEGQAAAFYKDGLLLGGGIITM SDQR >gi|226332036|gb|ACIB01000020.1| GENE 163 216480 - 219530 1605 1016 aa, chain + ## HITS:1 COG:MA0189 KEGG:ns NR:ns ## COG: MA0189 COG0553 # Protein_GI_number: 20089087 # Func_class: K Transcription; L Replication, recombination and repair # Function: Superfamily II DNA/RNA helicases, SNF2 family # Organism: Methanosarcina acetivorans str.C2A # 458 1015 555 1070 1078 296 34.0 2e-79 MKEALTTNQVVIVLVEHPVLGLLLVPYTVGRALDNTLEVIEQAFHASPDALKKMNEAEQK AIDIASHYTEKYLMGVYSREKTVPKFLRKLTEDSNKLKQQIRPFIEKKLLEMLELICNGQ LPFYQKPSGSKQLYEHHAYRVHPHNLKTHFSFKVTEEHFSYQLQCYDDDTPVSLMEQKPV VVLTSNPATLLLGMDLYTFSHIEASRLLPFTKKERISADASLTEKYIDNIIIPLARYHDI SIQGLKVVREKRPCNAYLYLEDTIYNDTLLRLDFRYGEQSFSPQPSDETRKFVFREQEEE EIVIHYFQRNSTAERKAVHLLQKAGLQCISDSHFKLSSAAPEKNITEWISHHRQMLLEEF VLSSDTQNKPYYLPEIRIEQSCEDGPDWFDLHITVVIGNQRIPFSRFRKNILEGNREYIL PDGRIVLLPEEWFSKYANLLEAGKESDKTIRLKRPFIGVIESILEKDRQSTSIKTLLSKE IPVPIGLKANLRSYQQKGFSWLANLYLEGFGGCLADDMGLGKTLQTLALLQYVYKPGNTT EAIRETIDLKKAESTSDYLPQKQVFFDEKGQFSLFPMQSKEEENSRIAPQVPQIPEPVQK QNRISPLHGTLIVVPTSLLHNWKREASRFTNLSMMEYNGSSPNEITRLKKYFDRYHLIFT TYGTMRNNIATLSQYTFECIVLDESQNIKNSESLTFRSAIQLRSKHRLILTGTPIENSLK DLWAQFHFLQPELLGNETTFSKHFINAIRQGDERMKDRLRQLITPFILRRSKQEVTPELP SLTEEVVYCDMTERQNELYQHEKNSLRNILLEQTAEKGQQSFTVLNGILRLRQLSCHPQL VLPDFIGDSGKLYQIIETFETLRSEGHKVLIFSSFVKHLELVAGEFRKRKWDYAFLTGSS TNRPEEIARFNRDPKIQAFLISLKAGGVGLNLTQADYVFIIDPWWNPAAESQAIARAHRI GQNNQVIAYRFITQGSIEEKIIQLQEEKRKLAETFITDTEQLPALTNREWARLLGS >gi|226332036|gb|ACIB01000020.1| GENE 164 219577 - 220374 747 265 aa, chain - ## HITS:1 COG:no KEGG:BF0213 NR:ns ## KEGG: BF0213 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 265 1 265 265 497 100.0 1e-139 MKTRALRSYLCFTLLATAFLLGLAGCDKEESESLRIGDDSFNNIENGVWTAYYPNTNQTS ITIYGGVKPYTVSSNSDILKVNMDKLSDAFNYETLGVGDAEVTITDAKGESVGLKVKIDY WSDKMKIVKLDAYVKGDKMTVAAQKELKEKALASIPVKAGGGYQFIYTKDQGGIVYVYPD KYGEKYKEGTFTRSSLAVGNSSYRKYEIKLDGMERTYIVQRYYPSKTRSVAMVPYGFYED LLDQFTDDYPEVESVYTMQVVSAVF >gi|226332036|gb|ACIB01000020.1| GENE 165 220595 - 221653 420 352 aa, chain - ## HITS:1 COG:no KEGG:BF0171 NR:ns ## KEGG: BF0171 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 352 10 361 361 740 99.0 0 MLFGSVLRLCAQTCGATYITEFQYDLKKRTNWCNLLRLDAYVPIGTKGILEFASIHVYKT RPERIINDLQTFSNIEEDNLPCAIAVLGYTRLIGNITLFAGIRNLNEDYFTTPCMSLFTN SSCGIFPTLSANYPIANYPLAALCLDYKMTLGRFGIESSLYNGKGYNGWSKGKHPFTFNP RKDGVFSITEINYQTEYGKCFGGFSLHTNGDMPDVAGEWKTREREKKVSPKMTFAWWGYA ERKLWSRVRQEVNLLVQYTRTSSVFSECRDYMGAGVTWIYVPGGQKRHEAGLFLSAAQFK SCNEVAGEVTYRYSFNRDTYIQPAIHLIKNGGGLHEVFLIRMGYILNGGRVR >gi|226332036|gb|ACIB01000020.1| GENE 166 221745 - 224396 1584 883 aa, chain - ## HITS:1 COG:PA4825 KEGG:ns NR:ns ## COG: PA4825 COG0474 # Protein_GI_number: 15600018 # Func_class: P Inorganic ion transport and metabolism # Function: Cation transport ATPase # Organism: Pseudomonas aeruginosa # 41 883 45 903 903 949 53.0 0 MIWRKKEKQKAQYQFNSERVFLVATQPAKTAYSYFQTSSVGLSEEEIGQRQSAYGKNEIS REQKKNPLVLFIRTFINPFIGVLTALAVISLVIDVVMARPEDREWTAVLIIISMVVCSAI LRFWQEWKASEATDSLMKMVKNTCFVKRFSGSEEVDITELVPGDVVCLAAGDMIPADIRI IESKDLFVSQASLTGESDPVEKFPEINGRRHSHGSVIELDNICYMGSTVISGSAKGIVFG TGNDTYLGTIARSLVGERATTAFDKGISKVSFLLIRFMLVMVPFVFFVNGFTKGDWFEAF IFALSVAVGLTPEMLPMIVTANLSKGAVSMSKKKTVVKNLNAIQNFGAMNILCTDKTGTL TCDKIVLEKYINADGSDDHSRRILRHAFLNSYFQTGLKNLMDKAILAHVREENLEHLTEG YTKIDEIPFDFSRRRMSVVIEDQQGKRQIITKGAVEEMLNICSHAEFNGQVYELTDKLRS KAKRISDDMNRNGMRVLAIAQKSFISKARDFAVTDEDEMVLIGYLAFLDPPKPSSAEAIR QLREYGIEVKILSGDNDVIVNAIARQIGIDTCHSVTGVELEGKDGEELREIVGQATLFSR LTPLQKSEIIMILQQNGNTVGFLGDGVNDAGALRQSDIGISVDSAVDIAKESADIILLDK DLSVLKEGVLEGRKTFGNITKYIKMTASSNFGNMFSVMFASAFLPFLPMLPIHLLIQNLL YDISQTTIPFDRMDAEFLKQPQKWDASDLSRFMIYIGPISSVFDIATYCLMWYVFACNSP EHQTLFQSGWFVEGLLSQTLIVHMIRTRKIPFFQSRATWPVLGLTFLIMAMGIAIPFTSF GLSIGLEPLPLSYFPWLVLILVSYCVLTQFMKSWYIRRFSKWL >gi|226332036|gb|ACIB01000020.1| GENE 167 224556 - 225923 1078 455 aa, chain - ## HITS:1 COG:RSp1043 KEGG:ns NR:ns ## COG: RSp1043 COG0642 # Protein_GI_number: 17549264 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Ralstonia solanacearum # 149 452 158 462 466 132 29.0 1e-30 MKIRSILTIKYAGITATIFLVFMATIYCVNEHLRSDSFYRSLRSEAITKAHLFLNNQVDV ETMQSIYLNNRQFIDEVEVAVYTPDFRILYHDALHNDIIKETPGMIDEIVRKKEIDFRTG DYQGIGILYEFGGKNYVVTAAAYDRYGHINQVIMTRLLLLLSIGGLSVLVIVGYMLAKSS LAPIRSIVRKAEGITVTQIDERLPVKNEHDELGELALAFNALLDRVEKTFNDQRMFVSNV SHELRTPMAALSAELDLALQKERTVTQYQNSIHNALQDSQRVIELIDGLLNLAKADYYPE QIKKEEIRLDELLLDANELVLKAHPDYHIELIFEQEADDDRVLTVIGNPYLLTTAFVNLI ENNCKYSDNRTSFIQISFCDQWTIVSLSDNGAGMSETDKENLFKLFYRGENKNQAQGHGI GMTLTKKILTLHKSEISVYSHQGEGTTFVVRFHHL >gi|226332036|gb|ACIB01000020.1| GENE 168 225920 - 226606 782 228 aa, chain - ## HITS:1 COG:ECs0609 KEGG:ns NR:ns ## COG: ECs0609 COG0745 # Protein_GI_number: 15829863 # Func_class: T Signal transduction mechanisms; K Transcription # Function: Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain # Organism: Escherichia coli O157:H7 # 4 224 3 221 227 183 42.0 3e-46 MYSILIIEDEQRVADLLRAGLEENGYNCLVAYDGAMGLRMFRANTFDLVISDIVLPKMDG FELCKEIRAANPAIPILMLTALGSTDDKLDGFDAGADDYMVKPFDFRELYARIRVLLKRK LAVVTDVEEELNYADLSVNLLDKSVKRAGRDIKLSPKEYNLLVYMIENAEKVVSRMDIAD KVWNTHFDTGTNFIDVYINYLRKKIDRDFDTKLIHTKTGMGFILTDKL >gi|226332036|gb|ACIB01000020.1| GENE 169 226963 - 228243 500 426 aa, chain - ## HITS:1 COG:no KEGG:BF0208 NR:ns ## KEGG: BF0208 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 426 1 426 426 851 99.0 0 MRSLFLIFFFCYVTGKAQDRACVVSEKDSIPLSNVYICLKDRRVIAISDEKGVFSLEKYD SLSLNDTLYFSHINYLHKKLSYGDLIKNRCTVFLIENNRVLEEVSIFSNRHLNRFLHYEI LSPLKRGVYSFASVLVDGQIYIVGGSTSCGPFQSNIRSTLFWEKYSNMMYRYDIKQDKWE TIRHKFRERAYHTAGYYDGKIFILGGKRLSETRIVDYLDNAIEIYDIKRDTVWTDYTNPH QATLLGSVVYKDNMIVLGDVKKVLQNNEGVYSDEMHLWNLKSGYWYELGKMPIRQAPETI LVDHCIYLIGNRNGGGWSIECYNLLTGAWANAGRLLYRLGWPSLAYHNDIIYIFEQGVVQ TFNIKSRQVRSYMIDLKLKSPALYYFDGKLYILGGFYMGDPSRNVYSVDLKEFDKTEVDY YYNNVR >gi|226332036|gb|ACIB01000020.1| GENE 170 228411 - 229274 341 287 aa, chain - ## HITS:1 COG:no KEGG:BF0403 NR:ns ## KEGG: BF0403 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 286 5 290 295 301 48.0 2e-80 MRILYVLFFLLTLNSCQKVKEKEVVRLMKEWNRKEVLFPDDLLFTVLEKDTLKEIPKSDY TIITYVDSIGCTSCKLKLQNWLLLERKLRSITNKRVSCLFVIHPKSKKEVNYILQENHFD CPVCIDDSDIFNKLNKFPANIMYQTFLLDAGNKVVAIGNPIHNDRIKKLYLDIISGNKMT SKKGSMQTEITVSETLFDFGKISFKESQKCIFTLQNTGKALLVIDDVNTSCGCTSVQYSK EPVKPGESLRITVIYKADHPEHFRKTITIYCNVPTSPLQLKITGNAE >gi|226332036|gb|ACIB01000020.1| GENE 171 229332 - 230399 427 355 aa, chain - ## HITS:1 COG:no KEGG:BF0467 NR:ns ## KEGG: BF0467 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 13 355 17 358 358 285 45.0 2e-75 MCEIRFLLFILFMGCLCACGRVDLKSPKTLFSSVSEISEVEWSIEADSLARVEGIQCNDS VLLVFDFYSGKSYSLFDIYSGKMISRLGSIGQGRDEIPLGVFGYIENNRFYIYYDQTGYI GKFNLDSCSVILDCPPVCLAKYKIPGAQISKIAVVNDSLFLGAGTYNAEYQYLLFDNKSN VIDSAVTIYNSNEANLNIYHKFLSNQGRLRKRPGKSQFVYSINNSSNIDFFEIKNNKINL IKSLRFNNPEYTPTQDGDYSRVLFSDNNVIGYIDIGVTDKYVYTLYTNKKICDNNTYNDL SSTTVLVFDWNGNPVKQYELSKEAYYITIDKTLQRMYAVVRKPDMGWTIVCYAID >gi|226332036|gb|ACIB01000020.1| GENE 172 230649 - 230888 325 79 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253564445|ref|ZP_04841902.1| ## NR: gi|253564445|ref|ZP_04841902.1| predicted protein [Bacteroides sp. 3_2_5] # 1 79 1 79 79 153 100.0 4e-36 MKKYIYAFFFASLIIVALGLKAKIKSDTEFSELTLSNIEALGMEETDGAKDYWCCGNEDV CAEGPHYKIKGKLKENPCK >gi|226332036|gb|ACIB01000020.1| GENE 173 230903 - 231172 289 89 aa, chain - ## HITS:1 COG:no KEGG:BF0057 NR:ns ## KEGG: BF0057 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 67 1 78 83 76 61.0 2e-13 MGKKIFATLIVAVVATFAGYNIYQSQRAKNTMSELAMANVEALADINETDSSGQTLYCCG NEDTCAKGEDEDTGEEFIIHGVLSSKKCK >gi|226332036|gb|ACIB01000020.1| GENE 174 231405 - 231626 195 73 aa, chain - ## HITS:1 COG:no KEGG:BF0051 NR:ns ## KEGG: BF0051 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 73 1 73 77 72 50.0 4e-12 MKKFLKAIGCFAIFVLAVFSYFREQPYKLDSLSLQNVEALAEGEEYTHISCIGVGSLDCP VNHSGVKYIFKGY >gi|226332036|gb|ACIB01000020.1| GENE 175 231750 - 233141 912 463 aa, chain - ## HITS:1 COG:no KEGG:BF0207 NR:ns ## KEGG: BF0207 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 62 208 1 147 175 258 86.0 3e-67 MRPIKLIWGIGGTLLVALLIGCYFYQRSNEMMRVESKNLFIQALKGEMLRMERELDLSCV SMNVSGATALWTRLSVKTEFGTKDYVVDVKKDLKNISSDFNERSLHSVVCMEKQLSADTL NRIWADSLRMRHIVAKTSVQVVSADSSKCIRSEGSCDNNCFVSPIWIAYVGNECEVEVAG FLSSTCWSVIRYSSSSFVLIVGVTFILFAFFYYVYKVKKHLSDSEVEEELLKERLEKERE RYQDLEEKRRQYEQQLNLLQQEEKMRIRELSDLREKHKQKEAQIEVLSVEKKRSKEQQSI LEKECQEGMLQITMLSDKLEKEKIDRGELQQKLAACELQIAELRKIKEMLGEELVIYKLC SGLTFDPRNHILDCAGNRIKLDAQGSRLLLLFLEASENKLSYEELLEMMWADRRKDKNRL WTAVSRLRKALAPTRVIIKVKNEGGYQLVLPKESDDFSQRELF >gi|226332036|gb|ACIB01000020.1| GENE 176 233359 - 234195 815 278 aa, chain + ## HITS:1 COG:VC1364 KEGG:ns NR:ns ## COG: VC1364 COG0561 # Protein_GI_number: 15641376 # Func_class: R General function prediction only # Function: Predicted hydrolases of the HAD superfamily # Organism: Vibrio cholerae # 3 266 2 265 273 179 36.0 4e-45 MKYKLLVLDLDGTLTNAKKEITPRNREALIRVQQQGVKLILASGRPTFGIAPLADELRMK EFGGFILSYNGGEIIDWSTGEIVYANVLPDEVIPRLYECATRNQLPILTYDRQYIITEYP DDVYVRKEAFLNKMQIYPSKDFLKDIRLPLPKCLIVGEPHRLIPIEAELSVELQGQLSVY RSEPFFLELVPQGIDKAQSLSVLLNKLNMNREEIVAVGDGYNDLSMIQFAGLGVAMGNAQ EPVKKAADYITLSNEEDGVAAVVNKFFTKAPEKGETTA >gi|226332036|gb|ACIB01000020.1| GENE 177 234653 - 236128 1791 491 aa, chain + ## HITS:1 COG:DR1670 KEGG:ns NR:ns ## COG: DR1670 COG0215 # Protein_GI_number: 15806673 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Cysteinyl-tRNA synthetase # Organism: Deinococcus radiodurans # 5 488 52 531 532 451 48.0 1e-126 MEHQLTIYNTLDRKKELFVPLHAPHVGMYVCGPTVYGDAHLGHARPSITFDVLFRYLTRL GYKVRYVRNITDVGHLEHDADEGEDKIAKKARLEQLEPMEVVQYYLNRYHKAMEALNVLP PSIEPHASGHIIEQIELVKKILDAGYAYESQGSVYFDVAKYNKDYHYGKLSGRNLDDVLN TTRELDGQEEKHNPADFALWKRAQPEHIMRWPSPWGDGFPGWHAECTAMGRKYLGEHFDI HGGGMDLIFPHHECEIAQSVASQGDDMVHYWMHNNMITINGTKMGKSLGNFITLDEFFSG SHKLLTQAYSPMTIRFFILQAHYRSPVDFSNEALQAAEKGLSRLMEAVDSLEKITPAATS NVDVKSLRTKCFEAMNDDLNTPIVISHLFDGAKMINNIIAGNNTISADDLKDLKEVFHTF CFDILGLKEEIGSSDGREAAYGKVVDMLLEQRVKAKANKDWATSDLIRNELTALGFEIKD TKDGFEWKLNK >gi|226332036|gb|ACIB01000020.1| GENE 178 236239 - 236643 479 134 aa, chain + ## HITS:1 COG:MA0735 KEGG:ns NR:ns ## COG: MA0735 COG2050 # Protein_GI_number: 20089620 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism # Function: Uncharacterized protein, possibly involved in aromatic compounds catabolism # Organism: Methanosarcina acetivorans str.C2A # 4 133 16 144 146 116 50.0 1e-26 MTAQEFFKNDLFATNAGVELIEIREGYSKAKLEIKPEHLNAGQRTQGGAIFTLADLALAA AANSHGTLAFSLSSNITFLRASGPGDTLYAEARERYTGRSTGYYQIDVTDQEGRLIATFE SSVFRKKDEVPFTL >gi|226332036|gb|ACIB01000020.1| GENE 179 236633 - 237511 637 292 aa, chain - ## HITS:1 COG:no KEGG:BF0194 NR:ns ## KEGG: BF0194 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 292 1 292 292 570 99.0 1e-161 MKTRRKRPEIVKTQTVAAAIRRKEWICLIIALLFAFPSSGNAQCEAKNDAFKSGEHVMYE LYFNWKFIWKKVGLASLTTNSTTYHSEPAYRVNLLAISSKEADFFFKMRDTLTSVMTEKL EPRYFRKGAEEGKRYTVDEARFSFREGMCYVNQKRVRKDGSITETEQSDNRCIYDMLTIL AQARSFDPKEYTIGQRIQFPMATGRRVEEQTLIYRGIKKITAENDTTYRCLIFSLVEYNK KGKEKEVITFYVTDDRNHLPVRLDMHLNFGSAKAFLKSVSGYRHPQTSIVTK >gi|226332036|gb|ACIB01000020.1| GENE 180 237450 - 239369 1573 639 aa, chain - ## HITS:1 COG:no KEGG:BF0193 NR:ns ## KEGG: BF0193 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 639 1 639 639 1231 100.0 0 MDKAYFCSMLKDNIYRKKRLIRSLLGVAALVATLYSCASMGRPDGGPFDETPPRFIGSTP AAGAVNTKKSKIVLDFDEFIKLEKASEKVVVSPPQLQQPEIKPGGKRITVNLLDSLKPNT TYTIDFSDAIVDNNEGNPLGNFAFTFSTGASIDTMEVSGTLLEASDLEPIKGMLVGLHSN LNDSAFTKLPFDRVARTDSRGHFTIRGIAPGKYRIFGLMDADQNFFYNQKGEAVAFNDSL IIPRFEERIRQDTAWVDSLTIDTIVEQKYTYFLPDNIVLRSFKKPSVSQYLVKSERLTPN KFSLYFSAPADSLPVLKGLNFDEKDAFVIEKTFRNDTIHYWIRDSLLYQQDTLTLSLNYL YTDTLNQLVPRTDTLRLAAKKVKKEEPKKKKKKDDEPEPTKFLSVNTHAPSSMDVFDYIT MTFEEPVARFDSAAIHLRQKVDTIWTDVPFEFEHDSLDVRRYNLYYDWEPGGEYEFAVDS TAFHGIYGLFTDKIKQAFKVRQIEEYGNVFLNITGADSIAFVELLDNQDKVLRRRPVIDG RAEFYYLNPGKYGARLVNDTNGNGVWDAGDYEKGIQPEMVYYYPHIIEFKANWDATQDWN VTAVPLDKQKPDELKKQKPDEDKKKKTRDSQNANRSRRN >gi|226332036|gb|ACIB01000020.1| GENE 181 239475 - 242579 2982 1034 aa, chain + ## HITS:1 COG:TM1193 KEGG:ns NR:ns ## COG: TM1193 COG3250 # Protein_GI_number: 15643949 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-galactosidase/beta-glucuronidase # Organism: Thermotoga maritima # 23 1023 7 981 1087 696 39.0 0 MKRQLLTCCLAMCSLATMAQHDEWKNPEINAVNRAPMHTNYFAYSSSEEAAKADKENSSN FMTLNGIWKFNWVKNADARPTDFYRTDYNDKGWGQMKVPGVWEMNGYGDPIYVNVGYAWR SQYKNNPPYVPIENNHVGSYRKEIIIPAEWSGKEIFAHFGSVTSNMYLWVNGKYVGYSED SKLEAEFNLTKYLKPGKNLIAFQVFRWCDGTYLEDQDFFRYSGVGRNCYLYSRNKKYIQD IRVTPDLDSNYTNGTLNVALNLNGSGTVELNLTDPAGKSVATAQVNGNGQKSVVMDVSNP EKWTAETPNLYTLTATLKNGSNTLEVIPVKVGFRKIELKGGQILVNGQPVLFKGADRHEM DPDGGYVVSRERMLQDILRMKQLNINAVRTCHYPDDNLWYDLCDQYGIYVVAEANIESHG MGYGKETLAKNPSYKKAHMERNQRNVQRGYNHPSIIFWSLGNEAGYGPNFEQCYTWIKNE DKTRAVQYEQAGTNEFTDIFCPMYYDYDACKKYSEGNIDKPLIQCEYAHAMGNSQGGFKE YWDLIRKYPKYQGGFIWDFVDQSNHWKNKDGIDIYGYGGDFNKYDASDNNFNDNGLISPD RRPNPHAHEVGYFYQSIWTTPGDLSKGEIKVYNENFFRDLSAYYMEWQLLANGEVMQTGV VQDLNVAPQQTATLKLNLNTEKICPCKELLLNVTYKLKAAETLMPAGSTVAYDQLTIRPY TAKALELKNQKASNLDIVVPVIKDNDHNYLIVEGENFIIEFNKHNGYLSRYEADGMQLLN PGAQLTPNFWRAPTDNDYGAGLQHRYAVWKNPGLKLTSLKQSIENEQAIVQAEYEMKAVK GKLFLTYVINNEGAVKVTQKMEAGKEEKVSDMFRFGMQMQMPENFNEVEYYGRGPVENYA DRNHSTLIGKYRQTVAEQFYPYIRPQETGTKTDLRWWRVLNISGNGLQFVGDAPFSASAL NYSIESLDDGVQKDQRHSPEVAKAPFTNLCIDKVQMGLGCVNSWGTLPLEKYRVPYQDYE FSFILTPVRHKVNM >gi|226332036|gb|ACIB01000020.1| GENE 182 242888 - 244042 1077 384 aa, chain + ## HITS:1 COG:ECs4393 KEGG:ns NR:ns ## COG: ECs4393 COG0845 # Protein_GI_number: 15833647 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Escherichia coli O157:H7 # 5 371 8 379 385 171 32.0 2e-42 MKSKIVLFAFCLALLSGCGKKGFNMGGTPECAVETLQPTTVNLKSSYPATIKGKQDIEIR PQVSGFITKLNIDEGSMVKKGQVLFVIDPVQYESAARAAKAAVATAKANVSTQEITVKNK RELNKKNIISDYDLEMAENTLASAKAQLASAEAQLISANQNLAYTRVTSPSDGVAGTIPY RVGSLVSASSPSPLTVISDITQMYVYFSLTEKELLNLIRQDGSQTEFLNSFPAVQLTLAD GTLYADSGKIETVSGVIDQNTGAVSMRATFPNHGHLLRTGGTGNIQIPYSKENVIVIPQK ATYEIQDKKFVYLLQPDNTVKNTEIEILNLNDGQNYVVTAGLKAGDKIVVENVSTLKDGA TIKPLTQQESAERFKAALEERKNQ >gi|226332036|gb|ACIB01000020.1| GENE 183 244067 - 247270 2928 1067 aa, chain + ## HITS:1 COG:BMEI1629 KEGG:ns NR:ns ## COG: BMEI1629 COG0841 # Protein_GI_number: 17987912 # Func_class: V Defense mechanisms # Function: Cation/multidrug efflux pump # Organism: Brucella melitensis # 6 1035 5 1022 1051 790 42.0 0 MKLDRFINRPVLSTVISIVIVILGVLGLLSLPISQYPDIAPPTVRVNTTYQGANAQTVLN SVIAPLEEQINGVENMMYMTSTATNTGEASIEVYFKQGTDPDMAAVNVQNRVAKAQGLLP AEVTKVGVITSKRQTSMLLVFSLYSSDDKYDNEFLENYAKINLVPEVQRVPGVGDAMVLG ADYSMRIWLKPDVMAQYHLMPTDVSAALAEQNIEAAPGSFGEQGKQTFQYTLRYKGRLQS QEEFENIVIRANSDGQVLRLKDIATIELGRLTYGFSNNVNGHPAVTVIVFQTAGSNATAI INDILDLLEKSESTFPPGVKVNISQNANDFLFASIHEVVKTLIEAFILVFIVVYIFLQDL RSTLIPAIAIPVALIGTFFVLYIIGFSINLLTLCAMVLAIAIVVDDAIVVVEGVHAKLDQ GYKSARLASIDAMNELGGAIVSITLVMMSVFIPVSFMTGTSGTFYRQFGLTMAIAIALSA VNALTLSPALCAILLKPHDPDAKKKSTLASRFHASFNAAYDTVLKKYKKRVLFFIQKPVL TIGSVVVGFALLIFLMKVTPTGLVPNEDTGTIMAVVDMPPGSSLERTQEVMWQVDSLLAS DPAIESRTMIAGYSFIAGQGPSYGSFICKMKNWDERSIAQRSDFVSGMLYLKAREVIKDA RVLLFAPPMIPGYSVSNGFEMNLQDKTGGSLDKFYEVAQDFITKLQARPEIQSAQTSFNP NFPQYMIDIDAAACKKAGLSPSDILTTLQGYYGGLYSSNFNRFGKMYRVMIQADPNSRTN LESLNSVKVRNGNEMAPITQFMSVKRIYGPDNIKRFNMFTAMTINGSPADGYSSGQAIQA MQEVAEQTLPTGYGYEFSGMTREEQSSSGSTTAMIFVLCFVFVYLLLSAQYESYILPFAV LLSIPFGLAGSFIFAHLMGLANNVLPILGAATNNIYMQIALIMLMGLLAKNAILIVEFAL DRRKMGMSITWAAVLGAGARLRPILMTSLAMVVGLLPLMFAMGVGANGNRALGTAAVGGM FIGMICQIFVVPALFVIFQYLQEKVKPIEWEDIDNTDAETEIEQYAK >gi|226332036|gb|ACIB01000020.1| GENE 184 247287 - 248678 366 463 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|157165073|ref|YP_001466086.1| 30S ribosomal protein S12 [Campylobacter concisus 13826] # 45 460 33 455 460 145 25 2e-33 MKKQIIGMLCATALLSSCHIYKSYDRPKDIEASGLYRDTVSVADTLVSDTVNFGNLPWRE VFTDPQLQALIEQGLTHNTDLLTAALKVKEAQASLMSARLAYAPSLGLSPQGTISSFDKH AATKTYSLPATASWEIDLFGKLLNAKRGVQVTLLQTKAYRQAVQTQIISGIANTYYTLLM LDRQLDITEQTADIMKRNVETMQAMKDAAMFNTTSAGVEQSKAAYAQVLASIPAIQKSIR EAENAMSMLLAQAPQTIKRGVLEEQQLPEDFSVGVPLQLLSNRPDVKAAEMALAGTYYNA NSARAAFYPQITISGSAGWTNSAGSAIINPGKLLASVLGSLTQPLFYRGANIARLKIAKA QQEEAKLAFQQSLLNAGSEVSNALYQYQSASEKTASRKLQVESSEKASEYTKELFKLGTS TYLEVLSAEQSLLSARLSQVNDTFDRMQAVVSLYQALGGGRED >gi|226332036|gb|ACIB01000020.1| GENE 185 248698 - 249468 701 256 aa, chain + ## HITS:1 COG:PM1996 KEGG:ns NR:ns ## COG: PM1996 COG1043 # Protein_GI_number: 15603861 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Acyl-[acyl carrier protein]--UDP-N-acetylglucosamine O-acyltransferase # Organism: Pasteurella multocida # 1 256 1 262 262 173 38.0 3e-43 MISPLASIAPGAKIGKNVIIQPFAYIEDNVEIGDDCIIMPYASVLNGTRLGKGNKVYQHA VLGAEPQDFHYKGEESSLIIGDNNHIRENVVISRATFGGNATKIGNGNFLMDKVHICHDV QIGDNCVAGIGTTIAGECTLDDCVILSGNVTLHQYCHVGQWTLVQSGCRISKDVPPYSIM AGNPVEYHGVNAVVLQQHKNTSERVLRHIANAYRLIYQGNFSLQDAVQKIIDQVPMSEEI ENIVAFVKESKRGIVK >gi|226332036|gb|ACIB01000020.1| GENE 186 249570 - 250211 687 213 aa, chain - ## HITS:1 COG:no KEGG:BF0187 NR:ns ## KEGG: BF0187 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 213 1 213 213 451 100.0 1e-125 MKVKKISIANVEVDALPELLDKEKIGFQPIDNVNWEAYPYRPKVEFRIAHSDDAVLLHFN VKEASVRAKYGEDDGSVWTDSCVEFFSVPTGDGIYYNIECNCIGTILIGAGAERNNRERA SREVTDQVKRWASLGRQPFDERIGECNWEVALVIPYTAFFKHHITSLDGKTITANFYKCG DELQTPHFLSWNPIKIEKPDFHRPDFFGTLEFE >gi|226332036|gb|ACIB01000020.1| GENE 187 250442 - 251662 1248 406 aa, chain + ## HITS:1 COG:no KEGG:BF0151 NR:ns ## KEGG: BF0151 # Name: not_defined # Def: outer membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 406 1 406 406 805 100.0 0 MKKHLILSACLIMAISSFAQKKDFSYKFYGQIRTDLYYNSRANEETVDGLFYMYPKDEVF DSNGRDLNATANGSFYTLYTRLGLDVKGPKLGRAMTSAKVEADFRGSGTSYSTIRLRHAY LNLDWGRSALLLGQTWHPLFGDVSPQILNLSVGAPFQPFSRAPQIRYRYTHKGFQLTGAA IWQSQYLSQGIVGKSQTYIKNSCVPEFYLGLDYKANGWIAGAGIELLSLKPRTESKVEDQ VYKVNERITTLSYEGHVKYSNKDWFVGAKTVLGSNLTQTSMLGGFGIKSIDNRTGEQKYT PIRVSSSWLNVVYGQKWKPGIFLGYVKNMGTSDALASNQVYGTGTNVDQVVTAGAELTYN VNHWKFGVEYTYTSAAYGSLYLKNGKIIDTHSVGNNRIVGVAMFMF >gi|226332036|gb|ACIB01000020.1| GENE 188 251669 - 252601 749 310 aa, chain + ## HITS:1 COG:CAC3454 KEGG:ns NR:ns ## COG: CAC3454 COG0042 # Protein_GI_number: 15896694 # Func_class: J Translation, ribosomal structure and biogenesis # Function: tRNA-dihydrouridine synthase # Organism: Clostridium acetobutylicum # 7 310 4 304 311 184 34.0 3e-46 MNTLPIHLAPLQGYTEAAYRNAHAAVFGGVDVYHTPFVRIDRGEFRHKDVRDILPENNRV PHLIPQLIASEMDKTERIIALFIEQGYREMDINLGCPFPMLAKRQCGSGMLPHPDKVETL LKQIEQYPDVSFSVKMRLGWEKPDECLTLLPLLNAAPLTEIIVHPRLGIQQYKGEVNMEG FTAFYEACRHPVIYNGDILTIEDIRCITEKFPKLTGVMIGRGLLANPALGWEYKEGRKLT PEEWREKLRALHTAVFQHYETQIQGGEAQLVTKMKTFWEYLAPQIDRKSWKAIHKSTTLA KYNIAVRSAL >gi|226332036|gb|ACIB01000020.1| GENE 189 252795 - 253355 581 186 aa, chain - ## HITS:1 COG:no KEGG:BF0149 NR:ns ## KEGG: BF0149 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 186 1 186 186 359 100.0 3e-98 MKVKRFVLCLFMLTLIGGICFISCGNTSKAKAESDVAAETAEETFQSFLKKFTSSASFQY TRVKFPLKTPITLMTDDGNSEKTFPFTQEKWPLLDAETLKEERITQEEGGIYVSKFTVNE PTHKEFEAGYEESEVDLRVIFDLIDGKWYVTDCYTGWYGYDLPIDDLNETVKQVKEENDT FKELHP >gi|226332036|gb|ACIB01000020.1| GENE 190 253447 - 254715 785 422 aa, chain - ## HITS:1 COG:no KEGG:BF0183 NR:ns ## KEGG: BF0183 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 422 1 422 422 889 99.0 0 MEQLLHYVWKHKIFPLHELQTTTGLPVEVIDTGLPNSDSGPDFFNAKLKIGGTLWVGNVE IHTASSDWFRHGHDRDIAYDSVILHIVTEIDCEIYRSNGEPVPQLRLPCPEQVKEHYDEL CRADIHPPCYSILETLPKLTIHSWLTALQTERFDQKNRTITRRLQRCNQHWEDAFFITLA RNFGFGLNGDAFETWANLLSFRAIDKHRDDLTQVEAFFFGQAGLLEGESADDYFSWMQKE FRYLQHKFELPPVMNPSLWRFLRLRPGNFPHVRLAQLASLYYRERSLFSRVMEAETLKDL KQIFAGHTSAYWEEHFMFGKSSPRREKSIGAGAKELIIINTVIPFLYAYGLHKADERLCE RAASLLEELKAENNYVTRMWSGAGIPVQTAADSQALLQLQKEYCDKKKCLYCRFGYEYLR HK >gi|226332036|gb|ACIB01000020.1| GENE 191 254811 - 255539 764 242 aa, chain + ## HITS:1 COG:TM1520 KEGG:ns NR:ns ## COG: TM1520 COG0289 # Protein_GI_number: 15644268 # Func_class: E Amino acid transport and metabolism # Function: Dihydrodipicolinate reductase # Organism: Thermotoga maritima # 1 223 1 197 216 115 35.0 8e-26 MKIALIGYGKMGKEIEKVALSRGHEIVSIIDINNQDDFESEAFKSADVAIEFTNPMVAYS NYMKAFKAGVKLVSGSTGWMAEHGDEVKELCNKGGKTLFWSSNFSLGVTIFSAVNKYLAK IMNQFPAYDVTMSETHHVHKLDAPSGTAITLAEGILENMERKSVWVKEEAHATNELPIHS IREGEVFGIHTIRYDSEADSISITHDAKNRGGFALGAVLAAEYTAAHEGYLGMSDLFPFL KD >gi|226332036|gb|ACIB01000020.1| GENE 192 255544 - 257028 1199 494 aa, chain + ## HITS:1 COG:BU259 KEGG:ns NR:ns ## COG: BU259 COG0681 # Protein_GI_number: 15616870 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Signal peptidase I # Organism: Buchnera sp. APS # 438 489 259 310 314 74 63.0 6e-13 MRKATRTQWIKCSIAILLYLIFLIWVKSWWGLIVVPFIFDIYITKKIPWSFWKKSKNPTV RSVMSWVDAIVFALVAVYFVNIYVFQNYQIPSSSLEKSLLVGDFLYVSKMSYGPRVPNTP LSMPLAQHTLPILNTKSYIEWPQWKYKRVPGFGKVKLNDIVVFNFPAGDTVALNFQDADF YTLAYNIGKQIYPNPIDMDSLTREQQKTVYDLYYNAGRKEILSNPQRYGKVVTRPVDRRE NYVKRCVGLPGDTLQIINGQVMIDGKAIENPENLQFNYFVQTTGPYITEEMFRELGISKA DQRLTPEGAGYEEGLIELGLDGRNAQGGLNPVYHLPLTKKMYDTLSGNKKLVGKIVIEPE EYSGEVYPLNLNTHWNRSDYGPIWIPAKGATITLTPDNLPIYERCITAYEGNKLEQKEDG IYINGVKTNQYTFQMDYYWMMGDNRHNSADSRYWGFVPEDHVVGKPIVVWLSLDKDRNWF DGKIRWNRIFKWVD >gi|226332036|gb|ACIB01000020.1| GENE 193 257036 - 257971 592 311 aa, chain + ## HITS:1 COG:NMB0765 KEGG:ns NR:ns ## COG: NMB0765 COG0681 # Protein_GI_number: 15676663 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Signal peptidase I # Organism: Neisseria meningitidis MC58 # 10 284 98 321 339 71 26.0 2e-12 MGKGLKWIIAFAGAMVIVLLLRGFAFTSCLIPSAGMENSLFQGERILVNKWSYGLRVPYM SLFSYHRWGESPIHKDDIVVFNNPAGIKEPVIDRREIYISRCIGVPGDTLLIDSLFNVVD RSTQLGPDRKQLYTYPQTKEQQLDSLLSILSIGPNELMGQHEGKNVRSFSRYEYYLLDQA MNGKSWIQPLQQSLQEEAKPLIVPGKGKAVRVYPWNRTLLRNTLVLHEGKQAEIRNDTLY IEGRPSQHCYFTKDYYWMASNNSVNLSDSRLFGFVPQDHVIGKASRIWFSKTDHTGIFSG YRWERFFQPVK >gi|226332036|gb|ACIB01000020.1| GENE 194 258035 - 258679 556 214 aa, chain + ## HITS:1 COG:no KEGG:BF0179 NR:ns ## KEGG: BF0179 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 214 1 214 214 418 98.0 1e-116 MKVAYLSSAYLAPVEYYSKLLNYDKIFIEQHDHYMKQTYRNRCTIAGPEGELALSIPTVK PEGPKCPMKDIRISDHGNWRHLHWNAIESAYNSTPFFEYYKDDFRPFYEKKHEFLTDFNE ELCRLVCELIDIQPAIERTKEYKTDFAPNEIDFREAIHPKKDFHRTDPEFISQPYYQVFE ARHGFLPNLSIIDLLFNMGPESLLILQKTCADSQ >gi|226332036|gb|ACIB01000020.1| GENE 195 258808 - 259620 190 270 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163739489|ref|ZP_02146899.1| 50S ribosomal protein L17 [Phaeobacter gallaeciensis BS107] # 7 263 1 238 242 77 28 4e-13 MANLFSVKDKVVVITGGTGVLGKAIAAHLAEEGAKVILLGRKTEVGNKIVESIRTQGGEA LFLTTDVLDRKILEQNLADILKAYGRIDALLNAAGGNMPGATISPTGDIFDLKIDEFQKV LDLNLTGTILPTQVFLKPMVEQRAGAIVNFSSMAAFRPLTRVAGYAAAKAGITNFTAFMA TEIAKKFGEGIRINAIAPGFFLTEQNRALLTNPDGTYTQRGQDVIRQTPFGRMGRAEELC GTIQYLISDAASFVTGTVAVVDGGFNAFAM >gi|226332036|gb|ACIB01000020.1| GENE 196 259717 - 260925 874 402 aa, chain + ## HITS:1 COG:HI0055 KEGG:ns NR:ns ## COG: HI0055 COG1312 # Protein_GI_number: 16272029 # Func_class: G Carbohydrate transport and metabolism # Function: D-mannonate dehydratase # Organism: Haemophilus influenzae # 18 402 2 392 394 525 63.0 1e-149 MNQPNIMTLPKDKLFLSEQTWRWYGPDDPVSLWDIKQAGATGIVNALHHIPNGEVWTVEE IMKRKELIESVGLKWSVVESVPVHEHIKTQTGNFRKYIENYKESLRNLGQCGIHIVTYNF MPVLDWTRTDLAYTLPDGSKALRFERAAFIAFDLFLLKRPGAEAEYTDEEKTKARIRFEQ MDEKEKQLLVRNMIAGLPGSEESFTLEQFQHELDRYRGIDAEKLRTHLIHFLKEITSTAD EAGVKLVIHPDDPPCSILGLPRIMSCAEDFQALIDVVPNESNGLCLCTGSLGVSCANDLE GMMRRFGDRINFVHFRSTQRDAEGNFYEANHLEGDVDMYHVMKAFLELQQRRKVSIPMRP DHGHQMVDDLKKKTNPGYSCIGRLRGLAELRGLEMGIAKSIF >gi|226332036|gb|ACIB01000020.1| GENE 197 261072 - 261602 556 176 aa, chain - ## HITS:1 COG:no KEGG:BF0176 NR:ns ## KEGG: BF0176 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 176 1 176 176 343 100.0 1e-93 MMTLIRITSADDSRLNRLIPLYEESFPESERRKIGQLKRMIENHAPMYFNAIECDGELSG MFVYWDMGDFYYLEHLAVFPEMRNKKIGQQVLDYVAEHLKGVRLLEVEPTEDEMTTRRVN YYRRNGYEVLDKTYVQPSYHALEDACPLWIMGSEDSPRLAEQVERIKEEVYRQQVG >gi|226332036|gb|ACIB01000020.1| GENE 198 261850 - 262200 130 116 aa, chain + ## HITS:1 COG:no KEGG:BF0140 NR:ns ## KEGG: BF0140 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 8 116 20 128 128 205 99.0 4e-52 MLCLLFLVFGTQKSSALQPVNNGVAVTTAVKSHTHNVPSFDNSGHDLFDFNQGILLKKSV SDSANPYKQSNNFHPAIKTDAQRPITFKEDTSTYTLRNRVKPAQKYDVFALRHILI >gi|226332036|gb|ACIB01000020.1| GENE 199 262367 - 263614 1276 415 aa, chain - ## HITS:1 COG:BH2405 KEGG:ns NR:ns ## COG: BH2405 COG0612 # Protein_GI_number: 15614968 # Func_class: R General function prediction only # Function: Predicted Zn-dependent peptidases # Organism: Bacillus halodurans # 13 413 3 401 413 214 33.0 3e-55 MKQLSTRAEMQYNIHTLSNGLRIIHEPSSSKVAYCGFAVDAGTRDEAENEQGMAHFVEHL IFKGTRKRKAWHILNRMENVGGDLNAYTNKEETVIYSAFLTEHFGRALELLADIVFHSTF PQNEIEKETEVIIDEIQSYEDTPSELIFDDFEDMIFRNHPLGRNILGRPDLLKKFRSEDA MAFTSRFYQPSNMVFFVLGDFNFQKIVRQVEKLLVDLPLVTVENQRTIPPLYVPEQLVVH KETHQAHVMIGSRGYNAYDDKRTALYLLNNILGGPGMNSRLNVSLRERRGLVYTVESNLT SYTDTGAFCIYFGTDPEDVDTCLKLTYKELKRMRDVKMTSSQLMAAKKQLIGQIGVASDN NENNALGMAKTFLHYNKYESSESVFRRIEALTAEGLLEVANEMFAEEYLSTLIYR >gi|226332036|gb|ACIB01000020.1| GENE 200 263716 - 264246 550 176 aa, chain + ## HITS:1 COG:BH2746 KEGG:ns NR:ns ## COG: BH2746 COG1611 # Protein_GI_number: 15615309 # Func_class: R General function prediction only # Function: Predicted Rossmann fold nucleotide-binding protein # Organism: Bacillus halodurans # 3 154 2 152 190 124 40.0 6e-29 MEKIGIFCSASGSIDPIYFDAAHQIGEWMGKNGKTLIYGGANLGLMECVAKAVKENGGHV IGVVPSKLEENGKVSTYPDEIIATHDLSDRKDIILQQSDVLVALPGGIGTLDEVFHVMAA ASIGYHQKKVIFYNADGFYNPLLAVLSELQARGFTRHPLSSYYEVANTFNELTIKI >gi|226332036|gb|ACIB01000020.1| GENE 201 264246 - 264851 549 201 aa, chain + ## HITS:1 COG:PA4457_1 KEGG:ns NR:ns ## COG: PA4457_1 COG0794 # Protein_GI_number: 15599653 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted sugar phosphate isomerase involved in capsule formation # Organism: Pseudomonas aeruginosa # 9 194 19 198 209 144 42.0 9e-35 MIESIQELLQKEAQAVLNIPVTDAYEKAVELIVEQIHRKKGKLVTSGMGKAGQIAMNIAT TFCSTGIPSVFLHPSEAQHGDLGILQENDLLLLISNSGKTREIVELTQLAHNLNPGLKFI VITGNPDSPLASESDVCLSTGHPAEVCTLGMTPTTSTTVMTVIGDILVVQTMKRTEFTIE EYSKRHHGGYLGEKSRKLCVK >gi|226332036|gb|ACIB01000020.1| GENE 202 264839 - 265759 670 306 aa, chain + ## HITS:1 COG:VCA0656 KEGG:ns NR:ns ## COG: VCA0656 COG0524 # Protein_GI_number: 15601414 # Func_class: G Carbohydrate transport and metabolism # Function: Sugar kinases, ribokinase family # Organism: Vibrio cholerae # 1 255 18 274 323 92 25.0 1e-18 MRKVIGIGETILDIIFRNDQPSAAVPGGSVFNGIVSLGRMGVNVCFISETGNDHVGNIIL QFMRDNHIPTDHVNVFPDGKSPVSLAFLNEHSDAEYIFYKDYPKQRLDVLFPSINEDDII VIGSYYALNPVLRDKILELLDIAKEKRAIVYYDPNFRSSHKNEAMKLAPTIIENLEYADI VRGSLEDFYYMYGIKDVEKIYKDKIKFYCPLFLCTAGAERISLRTNAISKEYPVAPLEAV STIGAGDNFNAGLIYGMLKYDVRYRDLQHLSEETWDKVIQCGQDFAAEVCKSFNNSISGE FAANYR >gi|226332036|gb|ACIB01000020.1| GENE 203 265944 - 266999 898 351 aa, chain + ## HITS:1 COG:lin2049 KEGG:ns NR:ns ## COG: lin2049 COG2365 # Protein_GI_number: 16801115 # Func_class: T Signal transduction mechanisms # Function: Protein tyrosine/serine phosphatase # Organism: Listeria innocua # 39 351 12 326 326 143 33.0 4e-34 MYKNLFNLLTILLILPSCTDMSPNISVVCEENNIGNCILKWETTPLIKGQVKVYTSDNPE FIPEDNPVAMANISDARMTIVTNDPSRRSYYMLVFNDKYRVKVAPRNVNMPGIQNFRDLG GYKSATGKHVRWGKLYRSAQIDSLNCFALRKLQNLGIKTILDLRSESELHNTPPLQKGFN VVHIPINTGDMEHILHGIQQEKIKTDTIYHMVEAMNRELVAKYQKEYKEIFDILLDKNSY PVVIHCSSGKGRTGIVSALILASLDVNADIIMEDYRLSNDYFNIPKASKYAYNLPVNSQE AITTLFSAKEDFLNAAKDEIERKYGDVPTYLRKAIGLQSEDIHRLRTILLE >gi|226332036|gb|ACIB01000020.1| GENE 204 267094 - 269004 1869 636 aa, chain + ## HITS:1 COG:BH2384 KEGG:ns NR:ns ## COG: BH2384 COG0513 # Protein_GI_number: 15614947 # Func_class: L Replication, recombination and repair; K Transcription; J Translation, ribosomal structure and biogenesis # Function: Superfamily II DNA and RNA helicases # Organism: Bacillus halodurans # 1 550 5 535 539 377 39.0 1e-104 MKTFEELGVSPEIRKAIEEMGYENPMPVQEEVIPYLLGENNDVVALAQTGTGKTAAFGLP LIQKINVKNRIPQSLVLCPTRELCLQIAGDLNDYSKYIDGLKVLPVYGGSSIDSQIRSLK RGVHIIVATPGRLLDLMERKTVSLATVTNVVMDEADEMLNMGFTDSINAILADVPQERNT LLFSATMSPEIARISKNYLHNAKEITIGRKNESTSNVKHVVYTVHAKDKYAALKRIVDYY PQIYGIIFCRTRKETQEIADKLMQEGYNADSLHGELSQAQRDTVMQKFRIRNIQILVATD VAARGLDVDDLTHVINYGLPDDTESYTHRSGRTGRAGKTGTSIAIINLREKGKMREIERI IGKKFIAGEMPTGKQICEKQLLKVIDDLEKVKVNEEDINDFMPEIYRKLEWLSKEDLIKR MVSHEFNRFVDYYRNREEIEVPTDSRSERAGKSREGKSSRQAEPGYTRLFINLGKMDNFF PHELITLLNNNTRGRVELGRIDLMKNFSFFEVEEKQAQNVVKALNRTNWNGRKVTVEVAG EEASTEHKGRGKRNEGNDQGGRGRSSAPSADRKERGKGSKDSKQADSRKGKKPSREERGY SAARGPKGKDEWKQFFKDAEPDFSEEGWARRKPKKS >gi|226332036|gb|ACIB01000020.1| GENE 205 269751 - 270614 841 287 aa, chain + ## HITS:1 COG:PAB0040 KEGG:ns NR:ns ## COG: PAB0040 COG0697 # Protein_GI_number: 14520295 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; R General function prediction only # Function: Permeases of the drug/metabolite transporter (DMT) superfamily # Organism: Pyrococcus abyssi # 6 282 23 291 295 74 26.0 3e-13 MNKLNGFLYGLLSSASFGLIPLFTIPAMREGMNFESILFYRFLFACLALGCILLLDKQSF HIKRKEIPSLMLLAFLYLMSAVFLFWGYKFMASGVATTIHFMYPVLTTLIMMIFFRERKS TWRFFAIALAVVGVSCLSYGDSSGGITALGLFIVLLSALGYALYLVTVSQLKIGQMKGLR LTFYVFLFGTLLLLAGIGATTGIQTIPDWHTGGNLVLLALIPTVVSNLALVRAVKSIGST LTSVLGAMEPVTAVCVGIFIFGEPFTQSIGIGILLIISAVIVIILKR >gi|226332036|gb|ACIB01000020.1| GENE 206 270664 - 272841 2164 725 aa, chain - ## HITS:1 COG:no KEGG:BF0166 NR:ns ## KEGG: BF0166 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 725 1 725 725 1475 99.0 0 MNKMKVITSFRNGKRWGEALFLALLLLYPVSMHADEGMWMLGNLNKETRKTMKELGLQMP ADQLYCTRHPSLKDAVVSFGGFCSGVVVSEDGLVFTNHHCGFSSIQQHSSVDHDYLKDGF TAHSREEELPNPELYVRFLLRTENVTRRVLKATTPGMTESERSLAIDSMMVLLGDEVTKK DSTLVGIVDAYYGGNEFWLSVYRDFNDVRLVFAPPSSIGKFGWDTDNWMWPRHTGDFCVF RIYAGKDNRPADYSPDNVPYRPEYVAPITLDGYKEGSFCMTLGYPGSTERYLSSFGIEEM MNGMNQAMIDVRGVKQAIWKREMDRRDSIRIKYASKYDESSNYWKNSIGTNKAIRKLKVL DKKRQAEEALRQWIQKTPAEREKLLHLMSSLELNYKDRKEVNRAMSYFGESFINGPELVQ FALTILNFDFEAEQKQVVAQLQKLLDKYANYDVSIDKEVFVAMLKEYRTKVDKAYLPDLY QAIDTLYGGNEQMYVDTLYAHSELTSPRGLKRFLERDTTFHMIDDPAVSLGIDLIVKLFD MRSQMAEASDNIEKDEREFNAAMRRMYADRNFYPDANSTMRLSFGTIGSYSPYDGADYGY YTTVKGIFEKVKEHSGDPDFAVQPEVLSLLSSGDFGRYADTKGDMNVCFISNNDITGGNS GSAMFNGKGELLGLAFDGNWEAMSSDIVFEPDVQRCIGVDVRYMLFIIEKFGKASHLIQE LKIEG >gi|226332036|gb|ACIB01000020.1| GENE 207 272893 - 274260 1225 455 aa, chain - ## HITS:1 COG:SP1264 KEGG:ns NR:ns ## COG: SP1264 COG1808 # Protein_GI_number: 15901124 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Streptococcus pneumoniae TIGR4 # 33 329 14 306 347 236 42.0 7e-62 MKADNRNIFAIKAFLREYLDLRKDKDNELATVDSIRKGVEFKGANLWILIFAIFMASLGL NVNSTAVIIGAMLISPLMGPIMGVGLSVGMNDFELMKRSLKSFLITTAFSVTTATVFFLF TPIAEAQSELLARTSPTIYDVFIALFGGLAGVVALSTKEKGNVIPGVAIATALMPPLCTA GYGLASGNLIYFLGAFYLYFINSVFISLATFIGVRLMHFQRKEFVDKNREKKVRKYIVLI VILTMCPAIYLTMGIIRSTFFEAAANRFVSEQLNFDNTQVLNKKIHYEGDSCEIRVVLIG PEVPEASIAIARSKMKDYKLENTKLIVLQGMNNNEAMDMSSIRAMVMEDFYKNSEQRLQE QQVKIATLQQSLERYKSYDEMSRKIIPELKVLYPSVTALSISHTIETTVDSMKTDTIALA VLKFTRHPKNEEKAKISAWLQARIGAKKLRLIVEE >gi|226332036|gb|ACIB01000020.1| GENE 208 274263 - 275759 1471 498 aa, chain - ## HITS:1 COG:TM0620 KEGG:ns NR:ns ## COG: TM0620 COG2244 # Protein_GI_number: 15643386 # Func_class: R General function prediction only # Function: Membrane protein involved in the export of O-antigen and teichoic acid # Organism: Thermotoga maritima # 5 453 6 437 479 70 20.0 8e-12 MAGLKSLVKDTALYGLSSMVGRFLNYLLVPLYTAVLPAASGGYGVVTNVYAWAGLIMVLL TFGMETGFFRFANKSEEDPVKVYANSLISVGGISLIFAILCLTFLQPVSHLLEYGDHPDF IGMMIIVMALDAFLCIPFAYLRFKKRPIKFVAIKFVSIIANIVLNLFFLLLCPWLHEHFP AWVDWFYNPTYLVGYIFVSNLITTCLQLFCLIPELRGFAYRVDKQLLKRMLIYSFPILIF GLVGILNQTVDKIIYPFLFADRQEGLVQLGIYGAATKIAMVMAMFTQAFRYAYEPFVFGK QKEGDNRRMYAQAMKYFLIFAMFAFLVVMFYLDLLRYMVAPDYWAGLSVVAIVIGAEIFK GIYFNLSFWYKLIDETRWGAYFSIVGCVIIVGMNVMLVPTYGFVASAWASVAGYAVITIL SYWIGQKKYPIHYDLKHLGTYVLFTAVLYVIGEWVPIENIVLRLAFRTVLLLIFMAYVVR KDLPLSQIPVINRIIKKK >gi|226332036|gb|ACIB01000020.1| GENE 209 275788 - 276816 1108 342 aa, chain - ## HITS:1 COG:NMB1243 KEGG:ns NR:ns ## COG: NMB1243 COG2255 # Protein_GI_number: 15677115 # Func_class: L Replication, recombination and repair # Function: Holliday junction resolvasome, helicase subunit # Organism: Neisseria meningitidis MC58 # 13 329 22 338 343 389 62.0 1e-108 MEEDFNIRDHQLTSRERDFENALRPLSFEDFNGQDKVVDNLRIFVKAARLRAEALDHVLL HGPPGLGKTTLSNIIANELGVGFKVTSGPVLDKPGDLAGVLTSLEPNDVLFIDEIHRLSP VVEEYLYSAMEDYRIDIMIDKGPSARSIQIDLNPFTLVGATTRSGLLTAPLRARFGINLH LEYYDDDILSNIISRSAGILDVPCSSQAAGEIASRSRGTPRIANALLRRVRDFAQVKGSG SIDTEIANYALEALNIDKYGLDEIDNKILCTIIDKFKGGPVGLTTIATALGEDAGTIEEV YEPFLIKEGFLKRTPRGREVTELAYKHLGRSLYNSQKTLFND >gi|226332036|gb|ACIB01000020.1| GENE 210 277145 - 277843 496 232 aa, chain - ## HITS:1 COG:RC0866 KEGG:ns NR:ns ## COG: RC0866 COG4912 # Protein_GI_number: 15892789 # Func_class: L Replication, recombination and repair # Function: Predicted DNA alkylation repair enzyme # Organism: Rickettsia conorii # 60 194 4 136 139 122 43.0 8e-28 MKAIEIQKELETYIDPVKREYLPGFFKTGKGQYGEGDRFLGIVVPATRLVAKKYKNAPFE VMAELLQSEWHECRLCALLMMVERFKKSGGEEREAIYRFYLSQTERINNWDLVDLSAPYI VGEYLKDKSRDDLYRLAESTLLWDQRIAVVSTVTFIRNNDFIDILRLSELLLQHKHDLMR KAIGWMLREMGKRDKTLLLQFLDKYSKVMPRTMLRYSIEKLTDEERKLYMGR >gi|226332036|gb|ACIB01000020.1| GENE 211 277950 - 279182 1111 410 aa, chain + ## HITS:1 COG:PA5478_1 KEGG:ns NR:ns ## COG: PA5478_1 COG2715 # Protein_GI_number: 15600671 # Func_class: R General function prediction only # Function: Uncharacterized membrane protein, required for spore maturation in B.subtilis. # Organism: Pseudomonas aeruginosa # 2 246 1 245 245 202 45.0 8e-52 MVLNYIWIAFFVIAFVVAVCKLLFFGDTQIFTEIINSTFDSSKTAFEISLGLTGVLSLWL GVMKIGENSGLINALSRWLSPVFCRLFPDIPKGHPVMGSIFMNMSANMLGLDNAATPMGL KAMKELQELNPKKDTASNPMIMFLVINTSGLIIIPISIMVYRAQMGAAQPTDIFIPILLS TFISTLVGVIAVSISQRINLINKAILTLMGCLSLFFGGIIYLTTTLSREEMGVYSTLIAN VILFSVILLFIIAGIRKKINVYDSFVEGAKEGFTTAVRIIPYLVAFLVGIAVFRTSGAMD ILVGGIGAIVEFCGLDTGFVGALPTALMKSLSGSGANGLMIDTMKQFGPDSFVGRVSCVV RGASDTTFYILAVYFGSVGITKTRNAVTCGLIADFAGIIAAILISYLFFF Prediction of potential genes in microbial genomes Time: Tue May 17 22:41:11 2011 Seq name: gi|226332035|gb|ACIB01000021.1| Bacteroides sp. 3_2_5 cont1.21, whole genome shotgun sequence Length of sequence - 36525 bp Number of predicted genes - 34, with homology - 34 Number of transcription units - 17, operones - 7 average op.length - 3.4 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . + CDS 53 - 1369 654 ## BF0123 hypothetical protein + Term 1483 - 1546 11.1 2 2 Tu 1 . - CDS 2038 - 2919 484 ## BF0159 hypothetical protein - Prom 2943 - 3002 4.4 3 3 Op 1 . - CDS 3084 - 3509 264 ## BF0158 hypothetical protein 4 3 Op 2 . - CDS 3587 - 3727 61 ## gi|301161179|emb|CBW20717.1| conserved hypothetical protein - Prom 3836 - 3895 5.2 + Prom 3842 - 3901 6.0 5 4 Tu 1 . + CDS 3960 - 4379 447 ## COG0319 Predicted metal-dependent hydrolase + Term 4397 - 4449 1.2 6 5 Op 1 . - CDS 4671 - 5051 348 ## BF0155 hypothetical protein 7 5 Op 2 . - CDS 5069 - 6292 558 ## BF0154 putative pyrogenic exotoxin B 8 5 Op 3 . - CDS 6328 - 6678 215 ## BF0153 hypothetical protein - Prom 6752 - 6811 5.7 - Term 6697 - 6749 6.7 9 6 Tu 1 . - CDS 6848 - 8536 636 ## BF0114 putative lipoprotein - Prom 8677 - 8736 5.0 + Prom 8471 - 8530 2.2 10 7 Tu 1 . + CDS 8551 - 8775 73 ## BF0101 hypothetical protein + Term 8984 - 9030 5.6 + Prom 9180 - 9239 2.0 11 8 Op 1 . + CDS 9273 - 11150 1466 ## COG0445 NAD/FAD-utilizing enzyme apparently involved in cell division 12 8 Op 2 . + CDS 11201 - 11737 510 ## COG0503 Adenine/guanine phosphoribosyltransferases and related PRPP-binding proteins 13 8 Op 3 . + CDS 11790 - 13616 1048 ## COG0322 Nuclease subunit of the excinuclease complex 14 8 Op 4 . + CDS 13662 - 14114 369 ## COG1490 D-Tyr-tRNAtyr deacylase 15 8 Op 5 . + CDS 14114 - 14452 347 ## COG1694 Predicted pyrophosphatase 16 8 Op 6 . + CDS 14439 - 15341 785 ## COG0274 Deoxyribose-phosphate aldolase + Term 15369 - 15420 7.2 17 9 Tu 1 . - CDS 15521 - 15694 70 ## BF0107 hypothetical protein - Prom 15733 - 15792 4.2 18 10 Tu 1 . + CDS 15742 - 16449 500 ## BF0093 hypothetical protein + Term 16651 - 16681 0.4 19 11 Tu 1 . - CDS 16459 - 17502 707 ## COG0142 Geranylgeranyl pyrophosphate synthase 20 12 Tu 1 . + CDS 17432 - 20326 2205 ## COG0749 DNA polymerase I - 3'-5' exonuclease and polymerase domains + Term 20397 - 20433 4.1 + Prom 20387 - 20446 4.9 21 13 Tu 1 . + CDS 20596 - 20991 337 ## BF0090 hypothetical protein - Term 21233 - 21264 -1.0 22 14 Op 1 9/0.000 - CDS 21367 - 22122 782 ## COG3279 Response regulator of the LytR/AlgR family 23 14 Op 2 . - CDS 22135 - 24165 1220 ## COG3275 Putative regulator of cell autolysis - Prom 24188 - 24247 5.2 + Prom 24209 - 24268 4.8 24 15 Op 1 . + CDS 24292 - 25191 754 ## COG1045 Serine acetyltransferase 25 15 Op 2 . + CDS 25254 - 26609 1167 ## COG0116 Predicted N6-adenine-specific DNA methylase + Prom 26676 - 26735 2.4 26 16 Op 1 . + CDS 26760 - 28904 1811 ## COG1506 Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 27 16 Op 2 . + CDS 28926 - 30200 1679 ## COG0151 Phosphoribosylamine-glycine ligase 28 16 Op 3 . + CDS 30200 - 31192 551 ## BF0083 hypothetical protein 29 16 Op 4 . + CDS 31177 - 31653 412 ## COG1238 Predicted membrane protein 30 16 Op 5 . + CDS 31662 - 32438 430 ## COG4121 Uncharacterized conserved protein 31 16 Op 6 25/0.000 + CDS 32444 - 33376 706 ## COG0803 ABC-type metal ion transport system, periplasmic component/surface adhesin 32 16 Op 7 . + CDS 33397 - 34206 711 ## COG1121 ABC-type Mn/Zn transport systems, ATPase component + Term 34234 - 34294 12.8 - Term 34331 - 34365 2.0 33 17 Op 1 . - CDS 34379 - 35137 807 ## BF0090 hypothetical protein 34 17 Op 2 . - CDS 35166 - 36473 520 ## BF0089 hypothetical protein Predicted protein(s) >gi|226332035|gb|ACIB01000021.1| GENE 1 53 - 1369 654 438 aa, chain + ## HITS:1 COG:no KEGG:BF0123 NR:ns ## KEGG: BF0123 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 438 104 541 541 775 96.0 0 MELTEDTLPRVSTRATVPAGVRFRLIVFRKSGNNYVFQSVADYASNGTGTPVLERGKLLT RSGTIRVVGYSFNTTADLGDMPSTYAYNSSTVSIPDMSKDFMTFDSGDITNVNSLSHNLP VSFSQKLCKLTITISPTGFPSNTITNCTGVYVKQGGNSTSWKIGPSTNVVAANTNNTAAF SPNTTLSTTIRMVPFAGARTITVHFNTLTISGRTVPNNTEITSTQSVQLKEGKSYTMKIQ FKKAIGINVPASNINLTQNGCIENDKTILSKLRWAEGNLNSQSNYNLTWASSTTDYGYYY IWKNVYVSSGYTGYGAVDPCTRLDESKYGSGWRTPTRSEFISLSRCSNKQLVKYNGIQGM WFMNSSTGLFLPAAGWRDGNNGSGTTATAYTDGTGSYYWSSDLNGNSAYGLYILKNDVFV DANGNRKSGVSVRCVHNL >gi|226332035|gb|ACIB01000021.1| GENE 2 2038 - 2919 484 293 aa, chain - ## HITS:1 COG:no KEGG:BF0159 NR:ns ## KEGG: BF0159 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 293 1 293 293 544 98.0 1e-153 MKIVIKEKVIPYILISLFSFIGLSAYGYKAERQGGSKAVVWSISKIDTMHKNVQRNDGRN PNIQNIEYLKKMFRQKAVDEISENIVYPLKRTSPIPSVENAEELKERFDSIFDEDLIRII TSSDIDQWSEMGWRGIMLDDGILWMDYDGKITAVNYQSKYEKKLAKKLTSKVKGDLSSDL RHNFKGEVYKFKTKNYFIRIDELKNGMYRYACWKKENPESTKPDLVLENGKIEFSGSGGN HVITFKNNIYEYKVFHNKIAASGIADITLVVEKNGKEILSEDGKLEGDTTQTD >gi|226332035|gb|ACIB01000021.1| GENE 3 3084 - 3509 264 141 aa, chain - ## HITS:1 COG:no KEGG:BF0158 NR:ns ## KEGG: BF0158 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 141 1 141 141 270 100.0 1e-71 MTEFRYSICEPLNPKVIEKGMIAPDSVIGLFNDFQWDYYLKQIEVAETRKMDIYFSPSLE VENKANKNGLTISAVGDPEDPEFYIFYKRPISVVKKQFFRKPQTVVEDYVSEITGQTKED VIECLNALIKNDQEFLRRKIA >gi|226332035|gb|ACIB01000021.1| GENE 4 3587 - 3727 61 46 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|301161179|emb|CBW20717.1| ## NR: gi|301161179|emb|CBW20717.1| conserved hypothetical protein [Bacteroides fragilis 638R] # 1 46 12 57 57 90 100.0 3e-17 MIFRHYLFANITEAPKSRSDIEENREMSNWTACPVRVLLHKKNTIF >gi|226332035|gb|ACIB01000021.1| GENE 5 3960 - 4379 447 139 aa, chain + ## HITS:1 COG:TP0650 KEGG:ns NR:ns ## COG: TP0650 COG0319 # Protein_GI_number: 15639637 # Func_class: R General function prediction only # Function: Predicted metal-dependent hydrolase # Organism: Treponema pallidum # 37 122 40 135 160 73 40.0 1e-13 MAITYQTEGIKMPDIKKRETTEWIKAVAATYEKRIGEIAYIFCSDEKILEVNRQYLQHDY YTDIITFDYCEGNRLSGDLFISLETVKTNSEQFNTPYEEELHRTIIHGILHLCGINDKGP GEREIMEAAENKALAMRKQ >gi|226332035|gb|ACIB01000021.1| GENE 6 4671 - 5051 348 126 aa, chain - ## HITS:1 COG:no KEGG:BF0155 NR:ns ## KEGG: BF0155 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 126 1 126 126 242 100.0 3e-63 MKKVFFLYCISLLCIYGCNDDSDKYSSNFLKSREVTLNAEGGDITVEGREKFILVQTEEI LNQDTIFSIGEINGVEYQGGWYQLEVNGKEMTLNCDRNLTGNKRKVVLHFQGTGNSFDVF SLTQLE >gi|226332035|gb|ACIB01000021.1| GENE 7 5069 - 6292 558 407 aa, chain - ## HITS:1 COG:no KEGG:BF0154 NR:ns ## KEGG: BF0154 # Name: not_defined # Def: putative pyrogenic exotoxin B # Organism: B.fragilis # Pathway: not_defined # 1 407 1 407 407 829 99.0 0 MKEIFKLILLLVILSGCIENDEEIILNNSDLEDVSEYKRAEDLICQFSERIDKEKGGTRS STSQIILSLAGKKSVVIPKIATRTGEISTDSVNMFIFDTEKDGRFGFAIATGKAEVGRVY AYVENGILSDTIENEGMAYLVSQIPDIIKQDQLNPNLTRSGEQRTTHVSIPLVKTEWNQH YPYNAQMPTNGKCSISYYYAGCIPIAVAQAITYYRKCPVAYDWDAFTVNTGIYDANLIAP VSQFVKKVADGIKVGYKCDGTGAKNLGSTNDFLKGWGYNVERHKTNDVDKNLLYKCLITK NVVIFGGKKKKSTGHVWLVDGGVFEYSGNMMIGCTNIQVKAIHCNFGWNRANNGWYAIKD GAYNRPANSASQDGNNPTNDKNSPNGNFYTENDYIYFHEAMGTEILW >gi|226332035|gb|ACIB01000021.1| GENE 8 6328 - 6678 215 116 aa, chain - ## HITS:1 COG:no KEGG:BF0153 NR:ns ## KEGG: BF0153 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 116 1 116 116 210 100.0 2e-53 MKTKMIVLVGMLFTQIVFTSNNVYGEDVMAIMKDRHKVHLITHKEANLQRSTLLVACGYI EESQLFLNFNSSLENRKIQVVDSETGQTVFDDTITGTSFSIFLERDSDSFDIYIGR >gi|226332035|gb|ACIB01000021.1| GENE 9 6848 - 8536 636 562 aa, chain - ## HITS:1 COG:no KEGG:BF0114 NR:ns ## KEGG: BF0114 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 562 1 562 562 1075 99.0 0 MNKTFVGFIFLLFLVSGVVSCQRSSSPYPYSLRYADSLMEISPERTLAYLRKLDVSTYSA GDRAYFSLLFTQATDKNMLSLLPCDSLIDTALDYYIKKDGVNWAKAWLYKGRIQKKMNMT EQALKSCFTALQGVEGNTGEELKLKGMLYEDMGSIYLHQSLYQKAFDAFYRSYQCDSLLN DHRLVMYPLSNMGWVRVIQGKTVEAFYYLNQSIQLALRLNDSAFVSDIYERMSLNCENVD SAFLYAHLSHQYLTKDGDSISLWLTFGDLYLDKQELDSAEYYLKRILDTADFKRKILASY SLAEVEKIRGNYQRAFEYQSYYGDNIDSIFLLNKASDIERLAYKYDSEAKVVKEKEKQRF FIQQLCYGGVLFLLVIIVIFQCIYRRRQIARLLYEQRITYLNEKTALSQLQIERLEVQIS ALKQSGMEREQEIDLKQAELCCVIDEKARLRNCLFMETSIFKHIRELSTQPRLGQNGTKG SPKVLLMKEQEQLKNILFGIYDDYIRYLKGTYPKITDNDCIYCCLKLCEFDDQTIAYCFG NVSKQIVAQRRLRLKKKMAEAN >gi|226332035|gb|ACIB01000021.1| GENE 10 8551 - 8775 73 74 aa, chain + ## HITS:1 COG:no KEGG:BF0101 NR:ns ## KEGG: BF0101 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 74 1 74 74 120 100.0 2e-26 MMTNLRFFSATSKFFKGRQLSICHSVFSNRSKTLYFRPTGRLIKKNACISIFVISIKQQF AIYTNNIFDINVKQ >gi|226332035|gb|ACIB01000021.1| GENE 11 9273 - 11150 1466 625 aa, chain + ## HITS:1 COG:RSc3328 KEGG:ns NR:ns ## COG: RSc3328 COG0445 # Protein_GI_number: 17548045 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: NAD/FAD-utilizing enzyme apparently involved in cell division # Organism: Ralstonia solanacearum # 4 621 6 627 647 583 48.0 1e-166 MDFKYDVIVIGAGHAGCEAAAAAANLGSKTCLITMDMNKVAQMSCNPAVGGIAKGQIVRE IDALGGYMGLVTDQTAIQFRILNRSKGPAMWSPRAQCDRNKFIWAWREILENIPNLHIWQ DTVKEIIVENGEVVGLKTFWDVTFHARCIVLTAGTFLNGLMHVGKNQLPGGRMAEPASYK LTESIAKHGIEYGRMKTGTPVRIDGRSVHYELMDTQDGECDFHKFSFMNTSVRHLKQLQC WTCFTNEEAHNVLRNGLADSPLFNGQIQSIGPRYCPSIETKIVTFPDKEQHQLFLEPEGE TTQELYLNGFSSSLPMEIQIEALKKIPAFKDLVIYRPGYAIEYDYFDPTQLKHTLESKKI KNLFFAGQVNGTTGYEEAGGQGIIAGINAHINCHGGEPFTLARDEAYIGVLIDDLVTKGV DEPYRMFTSRAEYRILLRMDDADMRLTERAYKLGLVKEDRYALLKSKREAVENIVNFTRN YSIKAALINDALENLGTTPLRQGCKLIDLINRPQITIENISEYVPAFKRELDKITDERKE EILEAAEILIKYEGYIGRERIIADKLARLESIKIKGKFDYDSLQSLSTEARQKLKKIDPE TIAQASRIPGVSPSDINVLLVLSGR >gi|226332035|gb|ACIB01000021.1| GENE 12 11201 - 11737 510 178 aa, chain + ## HITS:1 COG:VC1053 KEGG:ns NR:ns ## COG: VC1053 COG0503 # Protein_GI_number: 15641066 # Func_class: F Nucleotide transport and metabolism # Function: Adenine/guanine phosphoribosyltransferases and related PRPP-binding proteins # Organism: Vibrio cholerae # 1 165 1 166 181 159 49.0 2e-39 MIMSKEKLIKSIREIPDFPIPGILFYDVTTLFKDSERLQELSDIMYEMYKDKGITKVVGI ESRGFIMGPILATRLGAGFIPIRKPGKLPAETMEESYDKEYGKDTVQIHKDALNENDVVL LHDDLLATGGTMKAACNLVKKLHPKKVYVNFIIELKELNGKQVFENDQDVDIQSVLSL >gi|226332035|gb|ACIB01000021.1| GENE 13 11790 - 13616 1048 608 aa, chain + ## HITS:1 COG:lin1197 KEGG:ns NR:ns ## COG: lin1197 COG0322 # Protein_GI_number: 16800266 # Func_class: L Replication, recombination and repair # Function: Nuclease subunit of the excinuclease complex # Organism: Listeria innocua # 9 589 2 573 603 404 41.0 1e-112 MDTNQELKTSEYLKGIVSNLPEKPGIYQYLNAEGTIIYVGKAKNLKRRVYSYFSKEHQPG KTRVLVSKIADIRYIVVNSEEDALLLENNLIKKYKPRYNVLLKDDKTYPSICVQNEYFPR VFKTRRIIRNGSSYYGPYSHSPSMHAVLDLIKHLYPLRTCNLNLSPENIRAGKFNVCLEY HIKNCAGPCIGLQSQEEYLKNIAEIKEILKGNTQEISRLLYQRMQDLAAEMKFEEAQKVK EKYALIENYRSKSEVVSSVLHNIDVFSIEEDGEKSAFINYLHITNGAINQAFTFEYKKKL NETKEELLTLGIIEMRERYKSASREIIVPFDIEIELNDVTFTIPQRGDKKKLLELSLLNV KQYKADRMKQAEKLNPEQRSMRLMKEIQQELHLDRLPMQIECFDNSNIQGTDAVAACVVF KKAKPSKSDYRKYNIKTVVGADDYASMKEVVRRRYQRAIEEESPLPDLIITDGGKGQMEV VRQVMEELQLDIPIAGLAKDRKHRTSEVLFGFPPQTIGIKQHSPLFRLLEQIQDEVHRFA ITFHRDKRSKRQVASALDNIKGIGEKTKTALLKEFKSVKRIKEATIEEVSAIIGESKAKI IKEGLDNH >gi|226332035|gb|ACIB01000021.1| GENE 14 13662 - 14114 369 150 aa, chain + ## HITS:1 COG:L110564 KEGG:ns NR:ns ## COG: L110564 COG1490 # Protein_GI_number: 15672090 # Func_class: J Translation, ribosomal structure and biogenesis # Function: D-Tyr-tRNAtyr deacylase # Organism: Lactococcus lactis # 1 147 1 145 151 152 53.0 1e-37 MRVVIQRVSHASVTIDGHCKSAIQKGMMILVGIEETDSREDIDWLCKKIVNLRIFDDENG VMNKSILEDEGNILVISQFTLHASTKKGNRPSYIKAAKPEISIPLYEQFCKDLSCALGKE VKTGEFGADMKVELLNDGPVTICIDTKNKE >gi|226332035|gb|ACIB01000021.1| GENE 15 14114 - 14452 347 112 aa, chain + ## HITS:1 COG:SA1292 KEGG:ns NR:ns ## COG: SA1292 COG1694 # Protein_GI_number: 15927040 # Func_class: R General function prediction only # Function: Predicted pyrophosphatase # Organism: Staphylococcus aureus N315 # 2 99 3 101 105 78 45.0 3e-15 MTLEEAQKAVDEWIHKYGVRYFSELTNMAVLTEEVGELARIMARKYGDQSFKEGEKDDIS DEITDVLWVLLCIANQTGVNLTEAFARNLEKKTQRDNKRHINNPKLSEHGNE >gi|226332035|gb|ACIB01000021.1| GENE 16 14439 - 15341 785 300 aa, chain + ## HITS:1 COG:mll4784 KEGG:ns NR:ns ## COG: mll4784 COG0274 # Protein_GI_number: 13474008 # Func_class: F Nucleotide transport and metabolism # Function: Deoxyribose-phosphate aldolase # Organism: Mesorhizobium loti # 22 284 52 323 348 192 38.0 8e-49 MEMNDTPQDKYLTALAKYDTQLNDADVQVQVAALIEKKVPENNTEEVKKFLFNCIDLTTL NTTDSDESVMRFTEKVNRFDDEFPDLKNVAAICVYPNFAQVVKDTLEVEGINIACVSGGF PSSQTFTEVKIAETAMALADGADEIDIVIPVGAFLSGDYETMCEEIMELKETCKEHHLKV ILETGALKTASNIKKASILSMYSGADFIKTSTGKQQPAATPEAAYVMCQAIKEYYEQTGN KVGFKPAGGINTVNDALIYYTIVKEVLGKEWLSNELFRLGTSRLANLLLSEIKGEELKFF >gi|226332035|gb|ACIB01000021.1| GENE 17 15521 - 15694 70 57 aa, chain - ## HITS:1 COG:no KEGG:BF0107 NR:ns ## KEGG: BF0107 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 57 1 57 57 104 100.0 1e-21 MLCIFPGASASRYDDTSNSLTLHSIYNKSIYVKVRETRLLITFHNMNFAEFQEKDNN >gi|226332035|gb|ACIB01000021.1| GENE 18 15742 - 16449 500 235 aa, chain + ## HITS:1 COG:no KEGG:BF0093 NR:ns ## KEGG: BF0093 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 235 1 235 235 471 99.0 1e-131 MAEQALFQPFNRNQQRENLENTKETGLLISLIPSLQEDFYVDSLTTGASKETLSKLLLRN PRIKNETAVIEMIHFLHDEGDRISFSILLPFLVAEYDPKELEEKIRERFFGIELFIRKCN NLHHFITCIKADQTFKIGEEELKRGVLAWDMGRMVCLTRIAYDAGFIDESLAWNYICSAG QQCIQAFNDWTEVGKSFLLGQAMEATEKRKQELYIRLYRQATENPNSPWKKRTLK >gi|226332035|gb|ACIB01000021.1| GENE 19 16459 - 17502 707 347 aa, chain - ## HITS:1 COG:PA4569 KEGG:ns NR:ns ## COG: PA4569 COG0142 # Protein_GI_number: 15599765 # Func_class: H Coenzyme transport and metabolism # Function: Geranylgeranyl pyrophosphate synthase # Organism: Pseudomonas aeruginosa # 39 345 12 320 322 177 34.0 3e-44 MVKVQKNTVLTVLLLLLCSNSVCMDSLSLIKSPITTELEDFKNLFDSSLSSSNLLLNNVI AHIRQRNGKMMRPILVLLVAKLYGEIKPETLHAAVSLELLHTASLVHDDVVDESTERRGQ LSVNAIFNNKVAVLVGDFLLATSLVHAERTRNHDIIGVVACLGQDLAEGEILQLSNVSNR EYSETIYFDVIRKKTAALFAACTKAAALSVEVTDEKAEFARLFGENIGICFQIKDDIFDY FESKEIGKPTGNDMLEGKLTLPALYVLNTTTDAWAQEMALRVKAGTATVDEITRLIEFIK QNGGIEYAVKVMYEYKERALDLLRTLPDSAVKTSLITYLDYVVDRDK >gi|226332035|gb|ACIB01000021.1| GENE 20 17432 - 20326 2205 964 aa, chain + ## HITS:1 COG:polA_2 KEGG:ns NR:ns ## COG: polA_2 COG0749 # Protein_GI_number: 16131704 # Func_class: L Replication, recombination and repair # Function: DNA polymerase I - 3'-5' exonuclease and polymerase domains # Organism: Escherichia coli K12 # 351 964 20 640 640 520 48.0 1e-147 MHTLFEHKSNNKTVNTVFFCTFTMKKLKNMNQNSKLFLLDAYALIYRAYYAFIKNPRINS KGFNTSAILGFVNTLEEVLKKENPTHIGVAFDPPGPTFRHEAFEQYKAQREETPEAIRLS VPIIKDIIKAYRIPILEVAGYEADDVIGTLATEAGNQGITTYMMTPDKDYGQLVTDHVFM YRPKYGDKEFEVMGVEQVKAKFDIQSPAQVIDMLGLMGDSSDNIPGCPGVGEKTAQKLIA EFGSIENLLEHTDQLKGALKTKVETNREMIIFSKFLATIKVDVPIRLDMNSLVREQADED TLRKIFEELEFRTLMERIFKKESSPASPIAGTLFNQENGPVQGNLFEEFTPDHTNEEKKS NLESLNSLSYDYQLIDTEEKRNEIIKKLLTSEILALDTETTGTDPMDAELVGMSFSITEN QAFYVPVPAEREEAIKIVREFEPVFKNEKSLKVGQNIKYDMLVLQNYGIEVRGKLFDTMV AHYVLQPELRHNMDYLAEIYLHYQTIHIEELIGPKGKGQKNMRDLSPQEVYLYACEDADV TLKLKNILEQELKKNDAEKLFYEIEMPLVPVLVNIESNGVRLDTEALKQSSEHFTTRLQS IEKEIYTLAEGEFNIASPKQVGEILFDKLKIVEKAKKTKTGQYVTSEEVLESLRNKHDII GKILEYRGLKKLLSTYIDALPQLINPKTGRIHTSFNQTVTATGRLSSSNPNLQNIPIRDE DGKEIRKAFIPDDGCSFFSADYSQIELRIMAHLSEDKNMIDAFLSGYDIHAATAAKIYKV DIKEVTADMRRKAKTANFGIIYGISVFGLAERMNVDRKEAKELIDGYFETYPQVKSYMDK SIQVAREHGYVETIFHRKRFLPDINSRNAVVRGYAERNAINAPIQGSAADIIKVAMARIY ERFKAEGLKAKMILQVHDELNFSVPAKEKEIVEQVVIEEMEKAYRMHVPLKADCGWGTNW LEAH >gi|226332035|gb|ACIB01000021.1| GENE 21 20596 - 20991 337 131 aa, chain + ## HITS:1 COG:no KEGG:BF0090 NR:ns ## KEGG: BF0090 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 131 1 131 131 207 100.0 9e-53 MKTTGFLKTVALSAVLLVSSVAVSARNYDNNLIYNSEEENGMLIGQTVYKKEGSSLANYM KYNYKYDDNKRMIESQTMKWNSNKNNWENDLLVRYTYEGKTITTNYYKWNNRKSEFILAP EMTVTMDNPNL >gi|226332035|gb|ACIB01000021.1| GENE 22 21367 - 22122 782 251 aa, chain - ## HITS:1 COG:VC0693 KEGG:ns NR:ns ## COG: VC0693 COG3279 # Protein_GI_number: 15640712 # Func_class: K Transcription; T Signal transduction mechanisms # Function: Response regulator of the LytR/AlgR family # Organism: Vibrio cholerae # 1 116 1 114 237 93 43.0 4e-19 MYKVIIIDDEKAAIETLRRDLEVQTDLEIKGTAGNGAKGKKLIMDIHPDLLFLDIELPDI QGIRLLSEIREQVLWDMKVVFYTAYDKYLLQALRESAFDYLLKPYDIDELNLIIERYRKT MASVQPLPSFASAVGTLMPGRDLFMISTVTGFRFLRLEEIGYFEYLKDKRLWQVELFNQT KLCLKKNTTAGDIIGYSDAFVQISQSAIININYLAMIKSKQCLLYPPFSDKEDLIISRGF LKELQERFCII >gi|226332035|gb|ACIB01000021.1| GENE 23 22135 - 24165 1220 676 aa, chain - ## HITS:1 COG:ECs3260 KEGG:ns NR:ns ## COG: ECs3260 COG3275 # Protein_GI_number: 15832514 # Func_class: T Signal transduction mechanisms # Function: Putative regulator of cell autolysis # Organism: Escherichia coli O157:H7 # 469 669 357 551 565 88 28.0 4e-17 MRHRANLAIVFFCLLLPLTGCKQKINRSDSAFGGMSIDSVERCAQDSLFSNTRYSRSLLR HAMADAPDSLSYYYLLSFYSKSYFVTADFDSVLYYNRLVKRFCNEVELSAEVHDLLSTVY NMEGNVLMQRTQPDSAIISYKKAYEERLQGQQTDYLPDLCINLADANVHKGDYAYAAYYY RRALFICDSLGLPDRNKFPVYYGLGQTYMELRDFELSNHYYELAGNFFPQMSVSEKWTYL NNRGNHFYYKKDYPQAVHYIGRALEVVKSYPQMVFEQNLCKANLGELYVITNKLDSAQLY LDESYRFFSGIGNQSALYYIETQMIELALKKGNVALAGDIIRRSADYGHIDANMINIRNH YLQHYYEQVGNYKKAYEYQKHDLQLNDSIRNERVRTRVAELDMRYRQDTIVMRKELVIEK QKGEMEVLKLTTYIWALIGIVSVIVAGLVYWYMKKKRMFLQERHINQISRFRMENIRNRL SPHFTFNVLNREISRFRDGETLCGDLTELVKLLRKSLELTEKLSISLYDELEFVKTYIHL EQGRLGSDFSMDVKIEEDLDIQQVVIPSMVVQIPVENALKHGLAGIDGLKLLGISVCRKG SGILIDICDNGRGYSPQTLSSTRGTGTGLKVLYQTIQLLNDKNREKIRFEIKNLVNNGQT GTQVLIYIPLDYSYNL >gi|226332035|gb|ACIB01000021.1| GENE 24 24292 - 25191 754 299 aa, chain + ## HITS:1 COG:PA3816 KEGG:ns NR:ns ## COG: PA3816 COG1045 # Protein_GI_number: 15599011 # Func_class: E Amino acid transport and metabolism # Function: Serine acetyltransferase # Organism: Pseudomonas aeruginosa # 131 293 8 161 258 141 44.0 1e-33 MSPLNFTHILTQTVDELSESESYKGLFHQHKDGEPLPSPRVLCDVIELARSILFPGYYGN STVNSRTINYHIGVNVEKLFNLLTEQILAGLCFGNGDRCDECTEAKREEAARLAAKFISK LPHLRRVLATDVEAAYNGDPAAQSFGEVIFCYPAIKAISNYRIAHELLELGVPLIPRIIT EMAHSETGIDIHPGARIGTHFTIDHGTGVVIGATSIIGNNVKLYQGVTLGARSFPLDADG KPIKGIPRHPILEDNVIVYSNATILGRITIGRDATVGGNIWVTENIPAGARIVQTKAKK >gi|226332035|gb|ACIB01000021.1| GENE 25 25254 - 26609 1167 451 aa, chain + ## HITS:1 COG:slr0064 KEGG:ns NR:ns ## COG: slr0064 COG0116 # Protein_GI_number: 16331495 # Func_class: L Replication, recombination and repair # Function: Predicted N6-adenine-specific DNA methylase # Organism: Synechocystis # 3 379 5 384 384 265 37.0 2e-70 MLSIFMSEQFEMIAKTFQGLEEILAEELTTLGANDVQIGRRMVSFTGDKEMMYKANFCLR TAIRILKPIKHFTAKDADAVYEQIKAIRWEEILDVDKTFAVDAVVFSDEFRHSKFVSYKV KDAIVDYFRELNGKRPSVRISRPDVLLNIHIAQTTCTLSLDSSGESLHRRGYRQEAVEAP LNEVLAAGMILMTGWKGECDLIDPMCGSGTIPIEAALIARNIAPGVFRKEFAFEKWGDFD QNLFDRIYNDDSQEREFTHKIYGYDNNPKANEIATHNVKAAGVSKDIILKLQPFQQFEQP AEKSIIITNPPYGERISTNDLLGLYNMIGERLKHAFVGNDAWILSYREECFDQIGLKPSV KTPLFNGPLECEFRKYQIFDGKYKEFKSQEGGDENGERAPKERREFKPRREEGGFRGERS PREERNSEYGDRRPREFKGNREPKIKKPQED >gi|226332035|gb|ACIB01000021.1| GENE 26 26760 - 28904 1811 714 aa, chain + ## HITS:1 COG:CC2154 KEGG:ns NR:ns ## COG: CC2154 COG1506 # Protein_GI_number: 16126393 # Func_class: E Amino acid transport and metabolism # Function: Dipeptidyl aminopeptidases/acylaminoacyl-peptidases # Organism: Caulobacter vibrioides # 139 696 148 715 738 304 33.0 5e-82 MKTNSKQKSALLVMLLSLLIPNIMAQEPKMPTLEDLIPGGATYRSAENISGLQWWGDQCI KPGIEAVFMINPKNGKETPLTTRNIVNKALEAGNHGKLQHFYNVSFPWPKKSLMLITLPD KYIVYDFDYREVISTRPLPKEGANRDYHPETGHVAYTIGNNLYVDDRAVTNEPEGIVCGQ SVHRNEFGIKKGTFWSPLGNLLAFYRMDQSMVAQYPLVDVTAPIAEVNNIRYPMAGMTSH QVKVGIYNPATGKSIYLNAGDPTDRYFTNISWAPDEKSLYLIELNRDQNHAKLCRYDVET GELTATLFEEKSDKYVEPQDPIIFLPWDNSKFIYQSQKDGFSHLYLYDTNGRQIRQLTEG DWLVKEVLGFDTKKKEIIIASTEFSPLQDNLFRLDTKTGTRTPLGSAEGVHSGQLSPSGR YLIDQYNSPTVPRSINIIDVQSGKSVNLLTAADPFTGYKMPGIETGTIKAADGKTDLYYR LIKPADFDPNKKYPAIVYVYGGPHAQLVTNGWQNGARGWDIYMANKGYIMFTVDGRGSSN RGLDFENVTFRQLGIEEGRDQVKGTEFLKSLPYVDGNRIGVHGWSFGGHMTTALLLRYPE IFKVGVAGGPVIDWGYYEVMYGERYMDTPQSNPKGYKECNLKNLAGNLKGHLMIIHDDHD DTCVPQHTLSFMKACIDARTYPDLFIYPCHKHNVSGRDRVHLHEKITRYFEDYL >gi|226332035|gb|ACIB01000021.1| GENE 27 28926 - 30200 1679 424 aa, chain + ## HITS:1 COG:VC0275 KEGG:ns NR:ns ## COG: VC0275 COG0151 # Protein_GI_number: 15640304 # Func_class: F Nucleotide transport and metabolism # Function: Phosphoribosylamine-glycine ligase # Organism: Vibrio cholerae # 1 422 1 420 429 375 46.0 1e-104 MKILLLGSGGREHALAWKIAQSPKVEKLFIAPGNAGTGEVGENVNIKATDFAALGAFALE ESIDMIVVGPEDPLVEGIYDHFQSNPCLKNIAIIGPSKEGARLEGSKEFAKEFMHRHHIP TARYQSVTADTLNEGLAFLETLEAPYVLKADGLCAGKGVLILPTLEEAKKELKEMLGGMF GNASATVVIEEFLSGIECSVFVLTDGDHYKVLPVAKDYKRIGEGDKGLNTGGMGSVTPVP FADEVFMEKVRTRIIEPTVNGLKAEGITYKGFIFLGLINVKGEPMVIEYNVRMGDPETES VMLRIQSDLVELLEGVAEGNLDARSLVIDPRTATCVMMVSGGYPETYQKGYAINGLEAAR ATDSILFHAGTAMKDGQAVTSGGRVLAICSYGNDKADALAQCYKVADMIDFKDKNYRRDI GFDL >gi|226332035|gb|ACIB01000021.1| GENE 28 30200 - 31192 551 330 aa, chain + ## HITS:1 COG:no KEGG:BF0083 NR:ns ## KEGG: BF0083 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 330 1 330 330 551 100.0 1e-155 MRNQRLQGRITAGRFTLPIVIFTCIACWVLTSVLLPGLEVKESSYPLWNIVNIPAWANRI VSFLLFAGIGYFLIELNNTFAIIRMRASVQTSLYFLLITACPGMHLLYAGDVAAVTFLIS LYFLFKSYQQPRPAGYLFHSFALLSAGSVAFPQLTYFIPIWLMGASGFQSLTFRSFCGAI IGWSIPYWFLLGHAFFHNEIELFYQPFIHLADFRDIDFGRDFQLWEVVTLGYLFILYIVS SIHCIVAGYEDKIRTRAYLHFLIFLNFCIFLFIVLQPALSMNLLSLLLIGISILVGHLFV LTNSKSSNLFFIGSMITLIALFCFNIWTLL >gi|226332035|gb|ACIB01000021.1| GENE 29 31177 - 31653 412 158 aa, chain + ## HITS:1 COG:Cj0341c KEGG:ns NR:ns ## COG: Cj0341c COG1238 # Protein_GI_number: 15791709 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Campylobacter jejuni # 14 147 8 141 147 86 34.0 2e-17 MDAFIETVSQILIDWGYPGLFISALLAGSIVPFSSEIVLLGLVKLGLDPTLCLISASLGN TAGGMTCYYMGRLGRIDWIEKYFKVKKEKIDKMQRFLQGKGALMAFFAFLPAIGEVISIA LGYMRSNVWLTTASMFAGKLIRYIILLKAMQEALNLVM >gi|226332035|gb|ACIB01000021.1| GENE 30 31662 - 32438 430 258 aa, chain + ## HITS:1 COG:DR1672 KEGG:ns NR:ns ## COG: DR1672 COG4121 # Protein_GI_number: 15806675 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Deinococcus radiodurans # 8 237 22 232 234 109 34.0 6e-24 MERIIEYTADGSATLFVPELNEHYHSVKGALTESSHIFIDMGLKASASPAPHILEIGFGT GLNALLTLIEAERSGRQIHYTGIELYPLPWETVEKLRYNDRPGGDGEQRLTTGDEQTAQW MKALHTSPWGEDVRITPHFTLRKIQGDFTIMDRSSLITDRTSLFSLLYFDAFAPEKQPEM WTQELFDELYVMMEEEGILTTYCAKGVVRRMLQAAGFIVERLPGPPGGKREILRAKKRSQ DSPAVTQCNAQPTLKQKG >gi|226332035|gb|ACIB01000021.1| GENE 31 32444 - 33376 706 310 aa, chain + ## HITS:1 COG:MTH604 KEGG:ns NR:ns ## COG: MTH604 COG0803 # Protein_GI_number: 15678632 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type metal ion transport system, periplasmic component/surface adhesin # Organism: Methanothermobacter thermautotrophicus # 30 305 22 293 295 193 36.0 3e-49 MRMNRTHQSIVTHIPGRGMRGQTVLFLFLLLLSACSGRGKGDAGERIITVTMEPQRYFTE AIAGDKFTVRSMVPKGSSPETYDPTPQQLVSLGESEAYLRIGYIGFERSWMDRLMNNTPH IQVFDTSTGVDLIFESAFDHGDHRHEGGVEPHIWNSTANALIIAGNTFKALTVLDKENEA YYKTRYDSLCQRIEQTDSLIRQTLSVPGADRAFIIYHPALSYFARDYGLHQISIEEGGKE PSPAHLKGLMDLCKKEGVRVIFVQPEFDRRNAEIIAKQTGTRVISINPLSYDWEEEMLNV ARSLRGESGY >gi|226332035|gb|ACIB01000021.1| GENE 32 33397 - 34206 711 269 aa, chain + ## HITS:1 COG:slr2044 KEGG:ns NR:ns ## COG: slr2044 COG1121 # Protein_GI_number: 16329703 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Mn/Zn transport systems, ATPase component # Organism: Synechocystis # 16 260 19 267 289 204 38.0 1e-52 MDKERKPHKHTEAATSSPLLSIRGLSAAYDGRTVLHDVDLEVYEHDFLGIIGPNGGGKTT LIKCILGLLRPTGGEIIKYHEPLTTGYLPQYNSIDRSFPISVLEVVLSGLSSKKSLTGRF NDRHREKARQVIHRMGLEGLEHRAIGQLSGGQLQRALLGRAIISDPQLLILDEPSTYIDK RFEARLYQLLAEINRDCAIILVSHDIGTVLQQVKNIACVNETLDYHPAASVNTEWLERNF NCPIELLGHGTLPHRVLGEHCHCHEDGKA >gi|226332035|gb|ACIB01000021.1| GENE 33 34379 - 35137 807 252 aa, chain - ## HITS:1 COG:no KEGG:BF0090 NR:ns ## KEGG: BF0090 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 252 1 252 252 462 98.0 1e-129 MKTELKGWLADNTVTTDNKEDKILVLESAGNLTLSDVLDEMKKEDTGLRAETLKHAVDLF QRTVSELVLNGYSVNTGLFRAVPQFRGVIDGGVWNPEKNSIYVSFNQDKDLREAIARTGV KILGAKGDPAYFIGGEDAATRATDGSATAGRNYRLQGKNIKVTGTDSAVGIVLIDEKGTE TKLPMDMIAVNNPSEVLVLLPADLKDGVYELRLTTQFSTGNKLLKVPRTVIRSLVIGLPS GGGGDIVDDPTA >gi|226332035|gb|ACIB01000021.1| GENE 34 35166 - 36473 520 435 aa, chain - ## HITS:1 COG:no KEGG:BF0089 NR:ns ## KEGG: BF0089 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 435 104 538 538 761 97.0 0 MELTEDTLPRVSTRATVPAGVYFRLIVFRKSGNNYVFQSVADYASNGTGTPVLKQGKLLT RSGTIRVVGYSFNTTADLGDMPSTYAYNSSTVSIPDMSKDFMTFDSGDITNVNSLSHNLL PVSFSQKLCKLTITISPTGFPSNTITNCTGVYVKQGGNSTSWKIGPSTNVVAANTNNTAA FSPKTTLSTTIRMVPFAGARTITVHFNTLTISGRTVPNNTEITSTQSIQLKEGKSYTMKI QFKKTIGINVPSGSINLTTNGCSSADKTALAKLIWADGNLKSTGSASYVWGTYSEYGYYY TWKSTYTGNTSLNNTDPCPKLKSDYGTGWRTPSKDEMDKLSRCTNKAVVTSGGNKGFWFM NSTIGLFLPFGGSHGGATGSKTTPPGNAGSEGAYWCLDANGSTYGYLLYFTTGGESRTGH YPKTNGQSVRCVKNK Prediction of potential genes in microbial genomes Time: Tue May 17 22:42:08 2011 Seq name: gi|226332034|gb|ACIB01000022.1| Bacteroides sp. 3_2_5 cont1.22, whole genome shotgun sequence Length of sequence - 10255 bp Number of predicted genes - 7, with homology - 7 Number of transcription units - 2, operones - 2 average op.length - 3.5 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 67 - 126 3.5 1 1 Op 1 . + CDS 338 - 3697 3841 ## BF0076 hypothetical protein 2 1 Op 2 . + CDS 3714 - 4328 537 ## COG0726 Predicted xylanase/chitin deacetylase 3 1 Op 3 . + CDS 4328 - 5257 573 ## COG1600 Uncharacterized Fe-S protein + Term 5470 - 5519 5.0 - Term 5168 - 5211 5.3 4 2 Op 1 . - CDS 5291 - 6535 1142 ## BF0073 hypothetical protein 5 2 Op 2 9/0.000 - CDS 6562 - 7353 182 ## PROTEIN SUPPORTED gi|163739489|ref|ZP_02146899.1| 50S ribosomal protein L17 6 2 Op 3 1/0.000 - CDS 7420 - 8262 899 ## COG3717 5-keto 4-deoxyuronate isomerase - Prom 8320 - 8379 6.4 7 2 Op 4 . - CDS 8386 - 9966 1047 ## COG3119 Arylsulfatase A and related enzymes - Prom 10118 - 10177 7.0 Predicted protein(s) >gi|226332034|gb|ACIB01000022.1| GENE 1 338 - 3697 3841 1119 aa, chain + ## HITS:1 COG:no KEGG:BF0076 NR:ns ## KEGG: BF0076 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1119 1 1119 1119 2314 100.0 0 MKQYKTVNNLVGWITFLIAATVYCMTIEPTASFWDCPEFITTAYKLEVGHPPGAPFFMLT ANLFTQFVSDPALVAKMVNYMSALMSGACILFLFWSITHLVRKLVITDETNITRGQLITV MGSGLVGALAYTFSDTFWFSAVEGEVYAYSSMFTAIVFWLILKWEDVADQPHSDRWIILI AYLTGLSIGVHLLNLLCLPAIVLVYYYKKVPGANAKGSLLALAGSMVLVAAVLYGIVPGV VKVGGWFELLFVNSLGMPFNTGVIVYVALLAAAIIWGIYESYNEKSRTRMNLSFLLTIAM LGIPFYGHGASAVIIGILVLGVLAAYLFASKLNEKIRMSARTMNTALLCTMMIMVGYSSY ALIVIRSVANTPMDQNSPEDIFTLGEYLGREQYGTRPLFYGPAYSSKVALDVEDGYCVPR QKSTDTKYVRKEKTSPDEKDSYVELPGRVEYEYAQNMLFPRMYSSAHTAYYKSWQDITGY DVPYDQCGEMLMVNMPTQWDNIKFFFSYQLNFMYWRYFMWNFAGRQNDIQSSGEIEHGNW ITGIPFIDNLLYGDQNMLPQELKDNKGHNVFYCLPLILGIIGLFWQAWRGQKGIQQFWVV FFLFFMTGIAIVLYLNQTPGQPRERDYAYAGSFYAFAIWIGMGVAGIVHLLRNYMKEVPA AALTSAVCLLVPIQMASQTWDDHDRSGRYVARDFGQNYLMSLQESGNPIIYTNGDNDTFP LWYNQETEGFRTDARTCNLSYLQTDWYIDQMKRPAYDSPALPITWDRTEYMEGQNEYVPI RPDFKKQIDKAYKAAEEEVLNGKNPEALNNIRAQFGDNPYELKNILKYWVRTKDGQAVIP TDSIVVKIDKEAVRRSGMMIPEALGDSIPDYMHISLKDEKGNPKRALYKSELMMLEMLAN ANWERPIYMAITVGTDNQLNMREHFIQEGLTYRFTPFDTEALGATIDSEKMYDNLMNKFK FGGIDKPGIYIDENTMRMCYTHRRIFAQLITQLMKEGKKDKALAALEYAEKMIPAFNVPY DVQNGALEMAEAYYQLGNNTKADQIIDELANKSVEYLTWYLSLDDNHLLMSQREFIMHLS ALDMEVKMMEKYKSKLAGNYTPKVNELYNIYVGRMKAHQ >gi|226332034|gb|ACIB01000022.1| GENE 2 3714 - 4328 537 204 aa, chain + ## HITS:1 COG:all4345 KEGG:ns NR:ns ## COG: all4345 COG0726 # Protein_GI_number: 17231837 # Func_class: G Carbohydrate transport and metabolism # Function: Predicted xylanase/chitin deacetylase # Organism: Nostoc sp. PCC 7120 # 25 201 105 284 305 124 40.0 1e-28 MFIEQPPWFFRAIYPDAIFRMDPDEKAVYLTFDDGPIPEVTPWVLELLDKHNIKATFFMV GDNIRKHPDVFRMVVERGHRIGNHTFNHIRGFEYLSSNYLANTDKANEMMKTDLFRPPHG HMRWMQYMTLKRHYKIIMWDLVTRDYSKKLRPPQVLANVMRYARNGSIITFHDSLKSWNN GNLQYALPRAIDFLKEEGYEFRLL >gi|226332034|gb|ACIB01000022.1| GENE 3 4328 - 5257 573 309 aa, chain + ## HITS:1 COG:PA4950 KEGG:ns NR:ns ## COG: PA4950 COG1600 # Protein_GI_number: 15600143 # Func_class: C Energy production and conversion # Function: Uncharacterized Fe-S protein # Organism: Pseudomonas aeruginosa # 7 304 19 319 361 229 42.0 5e-60 METKINSKTLKAEALRLGFSACGIAPAEPIDQAHQNALKMWLDADRQAGMTYMANHFDKR CDPALLVEGTRCVVSVALNYYPATRIPDEEYQFAWYAYGKDYHDLMREKLAALFRFIQES DVPELNGRMFCDTAPVPERYWAWRAGLGWIGKNTQLIIPHAGSTFFLGELFLNTEADTYD RPQPNRCGRCNRCLQACPTKALETPYNLNAHRCLSYLTIENKSEIPDSIAPFMGNRVYGC DECQKACPWNRFATPCRTPELQPSPEFMNMKKEDWKQLSEEKYRALFKGSAVKRAKYSGL IRNIRQMED >gi|226332034|gb|ACIB01000022.1| GENE 4 5291 - 6535 1142 414 aa, chain - ## HITS:1 COG:no KEGG:BF0073 NR:ns ## KEGG: BF0073 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 414 1 414 414 828 100.0 0 MKKVFLFVAAVLLCLACNEAGRTVSVTVSNATSLERSGEMVEVSMGEVSSKLHLPDTAQI VVVDAEGQQVPYQITSDEKVIFPVTVQANGSAVYTIKVGIPQECPVKACGRYYPERVDDV AWENDLTAFRAYGPALQETGERAFGYDIWTKYNTTEPVVEARYEGELNPDMKAKIAELGK TDPKAAQELYRSVSYHVDHGNGLDCYKVGPTLGGGTTALMAGDTIIYPYCYATQEILDNG PLRFTVKLVYNPLVVKGDSTIIETRIITLDAGSYCNKTVVSYTNLKETMPLAVGIVLHEP DGAIVADAANGYMTYVDPTDNAGGDNGKIFVGAAFPTLVKEAKAVLFPEKEKKELRGGAD GHVLAISEYEPGADFTYYWGAAWSKADIKDSAAWNAYMADFAQKVRNPLTVTVK >gi|226332034|gb|ACIB01000022.1| GENE 5 6562 - 7353 182 263 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163739489|ref|ZP_02146899.1| 50S ribosomal protein L17 [Phaeobacter gallaeciensis BS107] # 9 253 4 238 242 74 25 3e-13 MVNFSLEGKVAWVTGASYGIGFALATAFSEAGAKIVFNDISRELVDKGLAAYKELGIEAR GYVCDVTSEEQVNALVAQIEKEVGVVDILVNNAGIIKRIPMCEMTAEQFRQVIDVDLNAP FIVSKAVIPSMIKKGHGKIINICSMMSELGRETVSAYAAAKGGLKMLTRNIASEYGEFNI QCNGIGPGYIATPQTAPLREVQPDGSRHPFDSFIISKTPAARWGTPEDLMGPAVFLASDA SNFVNGHVLYVDGGILAYIGKQP >gi|226332034|gb|ACIB01000022.1| GENE 6 7420 - 8262 899 280 aa, chain - ## HITS:1 COG:YPO1725 KEGG:ns NR:ns ## COG: YPO1725 COG3717 # Protein_GI_number: 16121985 # Func_class: G Carbohydrate transport and metabolism # Function: 5-keto 4-deoxyuronate isomerase # Organism: Yersinia pestis # 6 280 2 278 278 303 50.0 2e-82 MKTNYEIRYAAHPEDARSYDTKRIRRDFLIEKVFSADEVNMVYSMYDRMVVGGAMPVKEA LKLEAIDPLKAPYFLTRREMGIFNVGGPGVVRAGDAIFQLDYKEALYLGAGDRDVTFEST DAAHPAKFYFNSLAAHRNYPDKKVTKADAVVAEMGTLEGSNHRNINKMLVNQVLPTCQLQ MGMTELAPGSVWNTMPAHVHSRRMEAYFYFEVPEEHAVCHFMGEVDETRHVWMKGDQAVL SPEWSIHSAAATHNYTFIWGMGGENLDYGDQDFSLITDLK >gi|226332034|gb|ACIB01000022.1| GENE 7 8386 - 9966 1047 526 aa, chain - ## HITS:1 COG:SMc00127 KEGG:ns NR:ns ## COG: SMc00127 COG3119 # Protein_GI_number: 15964702 # Func_class: P Inorganic ion transport and metabolism # Function: Arylsulfatase A and related enzymes # Organism: Sinorhizobium meliloti # 49 481 4 430 512 171 30.0 3e-42 MKRRDFLKCSLAVGAGLAASPSTYAFNGESKETGNDSSKLSKAPAIKGGKPHIIFIMSDQ HRGDALHCMGNKAVISPNIDKLAQEGSLFVCGYSSAPSSTPARAGLLTGMSPWHHGMLGY GKVASKYKYEMPQMLRDLGYYTFGIGKMHWFPQKALHGFHATLVDESGRSETRDFISDYR EWFQLQAPGKNPDLTGIGWNNHNAGTYKLEERLHPTAWTGQTACELIRNYDSDQPLFLKV SFARPHSPYDPPKRYLDMYEKVDIPVPFVGDWCGKYAERKDPERVSKDAAFANLGEEYAV NSRRHYYANVTFIDDQIGQIIQILKEKGMYENAIICYTADHGDMLGDHYHWRKTYAYEGS AKIPYIIKWPSAMTTQAIRGKRIEQPVELRDFLPTFIELAGGTVPDDMDGKSLVALASGN KNGWRKYIDLEHATCYSADNYWCALTDGKMKYIWFIHTGEEQLFDLSSDPGEQKNLSGNS RYADRLVEMRKAMVDHLQERGTEFVKDGKLAVRDQTLLYSPNYPKD Prediction of potential genes in microbial genomes Time: Tue May 17 22:42:46 2011 Seq name: gi|226332033|gb|ACIB01000023.1| Bacteroides sp. 3_2_5 cont1.23, whole genome shotgun sequence Length of sequence - 77623 bp Number of predicted genes - 71, with homology - 69 Number of transcription units - 34, operones - 16 average op.length - 3.3 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . + CDS 49 - 207 128 ## - TRNA 52 - 122 48.0 # Gln TTG 0 0 2 1 Op 2 . + CDS 286 - 3558 2517 ## COG0793 Periplasmic protease + Term 3576 - 3622 14.0 - Term 3563 - 3610 14.2 3 2 Op 1 . - CDS 3672 - 4979 737 ## PROTEIN SUPPORTED gi|163739624|ref|ZP_02147033.1| 50S ribosomal protein L32 4 2 Op 2 . - CDS 5032 - 5637 452 ## COG0084 Mg-dependent DNase 5 2 Op 3 . - CDS 5674 - 5895 155 ## COG0759 Uncharacterized conserved protein 6 2 Op 4 . - CDS 5892 - 6272 296 ## BF0065 ribonuclease P (EC:3.1.26.5) - Term 6273 - 6342 8.1 7 3 Op 1 . - CDS 6363 - 7115 629 ## BF0075 uroporphyrinogen-III synthase 8 3 Op 2 . - CDS 7120 - 7950 302 ## BF0063 hypothetical protein 9 3 Op 3 . - CDS 7947 - 8534 660 ## COG1611 Predicted Rossmann fold nucleotide-binding protein - Prom 8559 - 8618 2.5 - Term 8567 - 8617 4.6 10 4 Tu 1 . - CDS 8675 - 9967 1471 ## COG0192 S-adenosylmethionine synthetase - Prom 9987 - 10046 8.5 + Prom 9923 - 9982 4.9 11 5 Tu 1 . + CDS 10176 - 10370 149 ## BF0060 hypothetical protein 12 6 Op 1 . - CDS 10350 - 10811 302 ## PROTEIN SUPPORTED gi|148994682|ref|ZP_01823786.1| 50S ribosomal protein L13 13 6 Op 2 . - CDS 10811 - 11869 1076 ## COG0809 S-adenosylmethionine:tRNA-ribosyltransferase-isomerase (queuine synthetase) 14 6 Op 3 . - CDS 11918 - 12619 572 ## COG0130 Pseudouridine synthase 15 6 Op 4 . - CDS 12627 - 13427 751 ## COG1968 Uncharacterized bacitracin resistance protein 16 6 Op 5 . - CDS 13434 - 13667 348 ## BF0067 hypothetical protein 17 6 Op 6 . - CDS 13708 - 14589 757 ## COG2177 Cell division protein 18 6 Op 7 . - CDS 14589 - 15488 605 ## COG0500 SAM-dependent methyltransferases - Prom 15547 - 15606 5.9 19 7 Op 1 . - CDS 15638 - 17647 1250 ## BF0052 hypothetical protein 20 7 Op 2 . - CDS 17722 - 18765 566 ## BF0051 hypothetical protein 21 7 Op 3 . - CDS 18845 - 19066 157 ## BF0048 hypothetical protein 22 7 Op 4 . - CDS 19026 - 19280 102 ## BF0050 hypothetical protein 23 7 Op 5 . - CDS 19267 - 19884 227 ## BF0050 hypothetical protein 24 7 Op 6 . - CDS 19922 - 20893 426 ## BF0049 hypothetical protein - Prom 20942 - 21001 8.9 - Term 20977 - 21030 9.2 25 8 Tu 1 . - CDS 21045 - 21296 247 ## BF0046 hypothetical protein - Prom 21371 - 21430 3.2 26 9 Op 1 . - CDS 21566 - 21799 170 ## BF0041 hypothetical protein 27 9 Op 2 . - CDS 21881 - 23020 261 ## BF0040 hypothetical protein 28 10 Op 1 . - CDS 23347 - 23505 95 ## BF0050 hypothetical protein 29 10 Op 2 . - CDS 23478 - 23597 76 ## BF0047 hypothetical protein 30 10 Op 3 . - CDS 23594 - 24712 558 ## BF0046 hypothetical protein - Prom 24738 - 24797 5.7 - Term 24789 - 24834 -0.8 31 11 Op 1 . - CDS 25075 - 25209 58 ## 32 11 Op 2 . - CDS 25178 - 25405 125 ## BF0035 hypothetical protein - Prom 25467 - 25526 5.4 33 12 Tu 1 . - CDS 25889 - 26956 553 ## BF0037 hypothetical protein - Prom 26991 - 27050 1.8 34 13 Tu 1 . - CDS 27096 - 27314 172 ## BF0036 hypothetical protein - Prom 27355 - 27414 8.6 35 14 Tu 1 . - CDS 27419 - 28636 318 ## BF0035 hypothetical protein - Prom 28667 - 28726 3.3 + Prom 29269 - 29328 7.2 36 15 Tu 1 . + CDS 29385 - 29546 64 ## BF0033 hypothetical protein + Term 29769 - 29805 1.0 37 16 Tu 1 . + CDS 30004 - 31338 454 ## BF0032 two-component system response regulator - Term 31586 - 31639 11.1 38 17 Op 1 . - CDS 31730 - 33844 1868 ## COG3250 Beta-galactosidase/beta-glucuronidase 39 17 Op 2 . - CDS 33900 - 35318 1157 ## COG3669 Alpha-L-fucosidase - Prom 35369 - 35428 3.5 - Term 35359 - 35414 9.3 40 18 Tu 1 . - CDS 35436 - 36809 457 ## PROTEIN SUPPORTED gi|227395721|ref|ZP_03879044.1| SSU ribosomal protein S12P methylthiotransferase - Prom 36962 - 37021 6.3 + Prom 36952 - 37011 5.6 41 19 Op 1 . + CDS 37140 - 37469 229 ## BF0027 hypothetical protein 42 19 Op 2 . + CDS 37506 - 39005 1562 ## COG0427 Acetyl-CoA hydrolase + Term 39040 - 39088 8.7 - Term 39023 - 39079 9.0 43 20 Tu 1 . - CDS 39081 - 40490 970 ## COG2027 D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein 4) - Term 40512 - 40575 0.3 44 21 Op 1 . - CDS 40594 - 41937 645 ## PROTEIN SUPPORTED gi|163788782|ref|ZP_02183227.1| 30S ribosomal protein S1 45 21 Op 2 . - CDS 41944 - 42342 357 ## BF0023 hypothetical protein + Prom 42143 - 42202 4.0 46 22 Tu 1 . + CDS 42450 - 44021 1587 ## COG0029 Aspartate oxidase + Term 44055 - 44096 8.8 - Term 44039 - 44088 8.5 47 23 Tu 1 . - CDS 44105 - 44683 648 ## COG1592 Rubrerythrin - Prom 44895 - 44954 6.0 + Prom 44814 - 44873 7.2 48 24 Tu 1 . + CDS 44944 - 46623 1875 ## COG0659 Sulfate permease and related transporters (MFS superfamily) + Term 46648 - 46701 13.9 + Prom 46707 - 46766 8.2 49 25 Op 1 2/0.000 + CDS 46824 - 49238 1672 ## COG1629 Outer membrane receptor proteins, mostly Fe transport 50 25 Op 2 . + CDS 49239 - 49865 302 ## COG3182 Uncharacterized iron-regulated membrane protein + Term 49873 - 49940 11.2 - Term 49868 - 49920 6.2 51 26 Op 1 . - CDS 49969 - 51582 1197 ## COG3119 Arylsulfatase A and related enzymes 52 26 Op 2 . - CDS 51594 - 52109 433 ## COG1853 Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family 53 26 Op 3 . - CDS 52151 - 52924 689 ## COG0731 Fe-S oxidoreductases 54 26 Op 4 . - CDS 52982 - 55009 1039 ## BF0013 hypothetical protein - Prom 55086 - 55145 8.6 + Prom 55044 - 55103 4.8 55 27 Tu 1 . + CDS 55123 - 55728 517 ## BF0012 hypothetical protein + Term 55756 - 55812 -0.8 56 28 Op 1 13/0.000 - CDS 55845 - 58028 219 ## PROTEIN SUPPORTED gi|149915877|ref|ZP_01904401.1| 50S ribosomal protein L17 57 28 Op 2 . - CDS 58033 - 59391 685 ## COG0845 Membrane-fusion protein 58 28 Op 3 . - CDS 59388 - 60056 313 ## BF0009 putative glycosyltransferase 59 28 Op 4 . - CDS 60037 - 61155 544 ## COG0438 Glycosyltransferase - Prom 61245 - 61304 3.1 60 29 Tu 1 . - CDS 61312 - 63123 524 ## BF0007 hypothetical protein - Prom 63199 - 63258 3.9 - Term 63209 - 63246 3.5 61 30 Op 1 . - CDS 63313 - 65118 517 ## BF0006 hypothetical protein 62 30 Op 2 . - CDS 65179 - 65712 350 ## BF0005 hypothetical protein 63 30 Op 3 . - CDS 65746 - 66585 439 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 64 30 Op 4 . - CDS 66599 - 67570 382 ## BF0003 hypothetical protein 65 30 Op 5 . - CDS 67563 - 69032 649 ## BF0002 putative outer membrane protein TolC - Prom 69091 - 69150 11.7 66 31 Tu 1 . - CDS 69187 - 69768 185 ## PROTEIN SUPPORTED gi|163764761|ref|ZP_02171815.1| ribosomal protein S11 + Prom 69997 - 70056 7.6 67 32 Tu 1 . + CDS 70171 - 71163 957 ## COG0379 Quinolinate synthase + Term 71219 - 71279 11.3 - Term 71473 - 71517 13.2 68 33 Op 1 . - CDS 71754 - 72338 305 ## PROTEIN SUPPORTED gi|71274727|ref|ZP_00651015.1| Ham1-like protein 69 33 Op 2 . - CDS 72349 - 73248 1099 ## COG1284 Uncharacterized conserved protein 70 33 Op 3 . - CDS 73263 - 76094 2926 ## COG0495 Leucyl-tRNA synthetase - Prom 76173 - 76232 7.8 - Term 76355 - 76405 12.1 71 34 Tu 1 . - CDS 76410 - 77486 696 ## BF4585 hypothetical protein - Prom 77514 - 77573 3.5 Predicted protein(s) >gi|226332033|gb|ACIB01000023.1| GENE 1 49 - 207 128 52 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MLSYQDSNLDKQNQKLLCYHYTIGQAIALSVKKRCKDNAMQAIFQIFLDFFV >gi|226332033|gb|ACIB01000023.1| GENE 2 286 - 3558 2517 1090 aa, chain + ## HITS:1 COG:VCA0045 KEGG:ns NR:ns ## COG: VCA0045 COG0793 # Protein_GI_number: 15600816 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Periplasmic protease # Organism: Vibrio cholerae # 715 1070 22 379 394 289 42.0 2e-77 MRKLFVCIALGLTTLTGNAASPLWMRDVQISPDGTEIAFCYKGDIYKVSAGGGTAIQLTT QPSYECTPIWSPDSKQIAFASDRNGNFDIFVMPATGGTAQRLTTHSSSELPSAFTPDGKY ILFSASIQDPSQSALFPTTAMTELYKVPVNGGRTEQVLGTPAEAVCYAPSGEFFLYQDRK GFEDEWRKHHTSSITRDIWLYDTKTGKHTNLTNHAGEDRNPVLSPDGKSVYLLSERKGSF NVYSFPLDNAQDLKAVTSFKTHPVRFLSMSHGGTLCYAYDGEIYTQKDNATPQKINIDIV RDDQDKIADLTFTNGATSGTVSPDGKQIAFIVRGEVFVTSTDYATTKQITHTPAREAGLT FAPDNRTLAYASERNGNWQLFLAKIARKEEANFPNATIIEEEVLLPSATVERAYPQFSPD GKELAFIEERNRLMVINLDTKKVRQITDGSTWFSTDGNFDYQWSPDGKWFTLEFIGNRHD PYSDIGLVSAKGDSPITNLTNSGYMSGSPRWVLDGNAILFTTERYGMRAHASWGSQNDAM LVFLNQDAFDKFRLSKEDYELQKELEKEQQKDKEKASIDPKKDKKKDPQTDTEKKDEIKN ILVELNGLEDRIIRLTPNSSNLGSTIISKDGETLYYLSAFEGGFDLWKMDLRKKETKLLH KMNAGWASMDMDKDGKSLFVLGGNTMQKMDLSGETLKPINYKAEMKMDLAAEREYMFDHV YKQQQKRFYNANMHGVNWDTMSAAYRKFLPHINNNYDFAELLSEWLGELNVSHTGGRFSP SIPGDATASLGVLTDWNYKGKGASIMEVIEKGPFDHARSKVKAGTIIEKINGQEITPETD YHTLLNDKANKKTLVSLYNPQSGERWEEVVIPIGNGILNNLLYKRWVKQRAADVDKWSDG RLGYVHIQSMGDDSFRSVYSDILGKYNNREGIVIDTRFNGGGRLHEDIEVLFSGKKYFTQ VVRGREACDMPSRRWNKPSIMLTCEANYSNAHGTPWVYSHQKLGKLVGMPVPGTMTSVSW ERLQDPSLVFGIPVIGYRLPDGSYLENTQLEPDIKVANSPETIVKGEDTQLKTAVEELLK ELPAGKGKKH >gi|226332033|gb|ACIB01000023.1| GENE 3 3672 - 4979 737 435 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163739624|ref|ZP_02147033.1| 50S ribosomal protein L32 [Phaeobacter gallaeciensis BS107] # 1 432 2 414 418 288 37 6e-77 TYYHSMNFVEELRWRGMVHDMMPGTEELLAKEQVTAYVGIDPTADSLHIGHLCGVMILRH FQRCGHKPLALIGGATGMIGDPSGKSAERNLLDEETLRHNQACIKKQLAKFLDFESDAPN RAELVNNYDWMKEFTFLDFAREVGKHITVNYMMAKESVKKRLNGEARDGLSFTEFTYQLL QGYDFLHLYETKGCKLQMGGSDQWGNITTGTELIRRTNGGEAYALTCPLITKADGGKFGK TESGNIWLDPRYTSPYKFYQFWLNVSDADAERYIKIFTSLDKAEIDGLVAEHNEAPHLRV LQKRLAKEVTVMVHSEEDYNAAVDASNILFGNATSDALKKLDEDTLLAVFEGVPQFEISR DALAEGVKAVDLFVDNAAVFASKGEMRKLVQGGGVSLNKEKLAAFDQVITTADLLDEKYL LVQRGKKNYYLIIAK >gi|226332033|gb|ACIB01000023.1| GENE 4 5032 - 5637 452 201 aa, chain - ## HITS:1 COG:VC2353 KEGG:ns NR:ns ## COG: VC2353 COG0084 # Protein_GI_number: 15642350 # Func_class: L Replication, recombination and repair # Function: Mg-dependent DNase # Organism: Vibrio cholerae # 24 162 78 224 283 73 27.0 3e-13 MPVELGQAIQNCQPAEFDPLAGAYYSVGIHPWYLTRENLDRQWEMLLAAIQCPQVLAIGE AGLDKLVRTDYMLQQEVFEKQAMLAHEMKYPLVIHAVRSANEIICLRKKMKPSNPWIIHG FRGKKELALQYIREGIYVSLGEKYQEEVLWGIPLEYLFLETDESMIDIHCLYERAALLLE IPLCKLMQQVRQNINNVFFRQ >gi|226332033|gb|ACIB01000023.1| GENE 5 5674 - 5895 155 73 aa, chain - ## HITS:1 COG:FN0003 KEGG:ns NR:ns ## COG: FN0003 COG0759 # Protein_GI_number: 19703355 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 8 73 4 69 82 92 66.0 3e-19 MKRLLSYILLLPIYFYRACISPMTPPSCRFTPTCSQYAIEAIKKHGPFKGLYLAVRRILR CHPWGGSGYDPVP >gi|226332033|gb|ACIB01000023.1| GENE 6 5892 - 6272 296 126 aa, chain - ## HITS:1 COG:no KEGG:BF0065 NR:ns ## KEGG: BF0065 # Name: rnpA # Def: ribonuclease P (EC:3.1.26.5) # Organism: B.fragilis # Pathway: not_defined # 20 126 1 107 107 169 100.0 3e-41 MANTLCKAERLNSKILIEKMFAGGSKSFSIFPLRVVYMPVENQDVQASILLSVSKKRFKR AVKRNRVKRQLREAYRMHKHQLLQILTDKQQQLAIAFIYLSDELTSSAEIEEKMKILLAR ISEKLV >gi|226332033|gb|ACIB01000023.1| GENE 7 6363 - 7115 629 250 aa, chain - ## HITS:1 COG:no KEGG:BF0075 NR:ns ## KEGG: BF0075 # Name: not_defined # Def: uroporphyrinogen-III synthase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 250 1 250 250 486 100.0 1e-136 MKIKKVLVSQPKPASEKSPYYDIAEKYGVKIDFRPFIKVESVSAKEFRQQKVSILDHTAV IFTSRHAIDHFFHLCTELRVTIPETMKYFCVTEAVALYIQKYVQYRKRKIFFGATGKIED LIPSIVKHKTEKYLVPMSDVHNDDVRDLLDKNNIQHTECVMYRTVSNDFMEGEEFDYDML VFFSPAGVSSLKKNFPDFDQKDIKIGTFGSTTAQAVRDAGLRLDLEAPNVKAPSMTAALD LFIKENNKGK >gi|226332033|gb|ACIB01000023.1| GENE 8 7120 - 7950 302 276 aa, chain - ## HITS:1 COG:no KEGG:BF0063 NR:ns ## KEGG: BF0063 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 276 1 276 276 494 99.0 1e-138 MIFVQDSLGAQEADTVQHVISAAGAPQDTLGSRVDLQAVSETVTGAEGMPIPYSPRTDDG LAMILLGCFFVSAYVLARSKKFLLQQVKDFMLHRERTSIFASSTAADMRYLLLLIVQTCV LGGVCIFNYFNDIRPALMERVSPHILLGVYVAVCLLYLLFKWILYSFLGWVFFDKSKTDI WLESYSTLIYYLGFALFPFVLFLVYFDLNVTFLVSIGCVLVIFTKILMFYKWLKLFSCNI YGVFLLILYFCALEIVPCLIVYQGMIQLNNVLIINF >gi|226332033|gb|ACIB01000023.1| GENE 9 7947 - 8534 660 195 aa, chain - ## HITS:1 COG:RSc2087 KEGG:ns NR:ns ## COG: RSc2087 COG1611 # Protein_GI_number: 17546806 # Func_class: R General function prediction only # Function: Predicted Rossmann fold nucleotide-binding protein # Organism: Ralstonia solanacearum # 4 183 1 180 194 164 44.0 7e-41 MNQITSVCVYCASSTKIDQTYFDAAMKLGHLLANRHIRLINGAGNIGLMRSVADAVLQNG GEVTGVIPHFMVDQGWHHTGLTELIEVESMHERKRLMAEKSDAVIALPGGCGTLEELLEI ITWKQLGLYLNPIVILNTNGFFDPLMEMLENAIEGNFMRKQHGDIWHVAHTPEEAVELVY SIPVWDGSIRKFAAI >gi|226332033|gb|ACIB01000023.1| GENE 10 8675 - 9967 1471 430 aa, chain - ## HITS:1 COG:TM1658 KEGG:ns NR:ns ## COG: TM1658 COG0192 # Protein_GI_number: 15644406 # Func_class: H Coenzyme transport and metabolism # Function: S-adenosylmethionine synthetase # Organism: Thermotoga maritima # 1 430 1 395 395 416 52.0 1e-116 MGYLFTSESVSEGHPDKVADQISDAVLDKLLAYDPSSKVACETLVTTGQVVLAGEVKTGA YVDLQLIAREVIQKIGYTKGEYMFESNSCGVLSAIHEQSADINRGVEREDPMNQGAGDQG MMFGYATNETENYMPLSLDLAHRILLVLADIRREGKEMTYLRPDAKSQVTIEYDDNGTPV RIDTIVVSTQHDEFILPADNSAAAQLKADEEMLAVIRKDVIEVLMPRVIASINHPKVLAL FNDHIIYHVNPTGKFVIGGPHGDTGLTGRKIIVDTYGGKGAHGGGAFSGKDPSKVDRSAA YAARHIAKNLVAAGVADEMLVQVSYAIGVARPINIYVNTYGRSNVKMSDGEIARKIDELF DLRPKAIEDRLKLRYPIYSETAAYGHMGREPQMVTKHFQSRYEGDRTMEVELFTWEKLDY VDKVKAAFGL >gi|226332033|gb|ACIB01000023.1| GENE 11 10176 - 10370 149 64 aa, chain + ## HITS:1 COG:no KEGG:BF0060 NR:ns ## KEGG: BF0060 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 64 1 64 64 115 100.0 6e-25 MKNKLKTAFTLLVYIAVTVGIYALICHLNHQPFDDLRILYAVLIGCVAYLPRHLMVRKSR KSQK >gi|226332033|gb|ACIB01000023.1| GENE 12 10350 - 10811 302 153 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|148994682|ref|ZP_01823786.1| 50S ribosomal protein L13 [Streptococcus pneumoniae SP9-BS68] # 2 152 121 268 278 120 42 2e-26 MKVYLGLGTNLGDKELNLRVALQKIEERIGKIISLSAFYATAPWGFQSENNFLNAAVGVE TVLSPVGILESTQRIEQEIGRLHKSRDGVYSDRLIDIDLLLYGDKILQDERLIVPHPLMT DRKFVLEPLAEIAPDVVHPVFHKTIKELFLALS >gi|226332033|gb|ACIB01000023.1| GENE 13 10811 - 11869 1076 352 aa, chain - ## HITS:1 COG:SA1466 KEGG:ns NR:ns ## COG: SA1466 COG0809 # Protein_GI_number: 15927220 # Func_class: J Translation, ribosomal structure and biogenesis # Function: S-adenosylmethionine:tRNA-ribosyltransferase-isomerase (queuine synthetase) # Organism: Staphylococcus aureus N315 # 1 350 1 341 341 293 42.0 3e-79 MKLSQFKFKLPEEKIALHPTKYRDESRLMVLHKRTGEIEHKMFKDILNYFDDKDVFVFND TKVFPARLYGNKEKTGARIEVFLLRELNEELRLWDVLVDPARKIRIGNKLYFGDDDSMVA EVIDNTTSRGRTLRFLYDGPHDEFKKALYALGETPLPHTILNRPVEEEDAERFQSIFAKN EGAVTAPTASLHFSRELMKRMEIKGIDFAYITLHAGLGNFRDIDVEDLTKHKMDSEQMFV TEEAVKIVNRAKDLGKNVCAVGTTVMRAIESTVSTDGHLKEYEGWTNKFIFPPYDFTVAN AMVSNFHMPLSTLLMIVAAFGGYDQVMDAYHIALKEGYRFGTYGDAMLILDK >gi|226332033|gb|ACIB01000023.1| GENE 14 11918 - 12619 572 233 aa, chain - ## HITS:1 COG:MT2862.1 KEGG:ns NR:ns ## COG: MT2862.1 COG0130 # Protein_GI_number: 15842331 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Pseudouridine synthase # Organism: Mycobacterium tuberculosis CDC1551 # 8 222 8 220 298 173 43.0 2e-43 MNFKEGEVLYFNKPLGWTSFKVVGHARYHMCRRMKVKKLKVGHAGTLDPLATGVMIVCTG KATKRIEEFQYHTKEYVATIQLGATTPSYDLEHEIDATYPTEHITRELVEKTLKTFVGEI QQIPPAFSACKVDGARAYDLARKGQEVELKPKLLVIDEIELLECNLPEIKIRVVCSKGTY IRALARDIGEALQSGAHLTGLIRTRVGDVKLEQCLDPAKFAEWIDQQDVEISD >gi|226332033|gb|ACIB01000023.1| GENE 15 12627 - 13427 751 266 aa, chain - ## HITS:1 COG:ZbacA KEGG:ns NR:ns ## COG: ZbacA COG1968 # Protein_GI_number: 15803599 # Func_class: V Defense mechanisms # Function: Uncharacterized bacitracin resistance protein # Organism: Escherichia coli O157:H7 EDL933 # 6 262 10 271 273 137 35.0 2e-32 MEWFEALILGLIQGLTEYLPVSSSGHLAIGSALFGIEGEENLAFTIVVHVATVFSTLVIL WKEIDWIFRGLFKFEMNSETRYVINILISMLPIGIVGVFFKDEVEAIFGSGLLIVGCMLL LTAALLSFSYYAKPRQKENISMKDAFIIGLAQACAVLPGLSRSGSTIATGLLLGDNKAKL AQFSFLMVIPPILGEALLDGMKMIKGEAIAGDIPTLSLIVGFIAAFVSGCLACKWMINIV KKGKLIYFAIYCAIVGVVTIVVSQLQ >gi|226332033|gb|ACIB01000023.1| GENE 16 13434 - 13667 348 77 aa, chain - ## HITS:1 COG:no KEGG:BF0067 NR:ns ## KEGG: BF0067 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 77 1 77 77 132 100.0 4e-30 MGDKQKFAFDKTNFILLAIGMAVVILGFILMTGPSSSETVFQADIFSVRRIKVAPVVCFL GFIFMIYGVMRKPKTKE >gi|226332033|gb|ACIB01000023.1| GENE 17 13708 - 14589 757 293 aa, chain - ## HITS:1 COG:L2 KEGG:ns NR:ns ## COG: L2 COG2177 # Protein_GI_number: 15672955 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Cell division protein # Organism: Lactococcus lactis # 23 285 29 311 311 84 28.0 3e-16 MKSKSRNNAVSYFDMQFITSSISTTLVLLLLGLVVFFVLAANNLSVYVRENINFSVLISD DMKETDILKLQKRLNNEPFVKETEYISKKQALKEQTEAMGTDPQEFLGYNPFTASIEIKL HSDYANSDSIAKIEKLIKRNTNIQDVLYQKDLIDAVNENIRNISLVLLALAVMLTFISFA LINNTIRLAIYSKRFLIHTMKLVGASWGFIRRPFLKRNIWSGVLAAFIADTILMGAAYWL VSYEPELIRVITPEVMLLVSGAVLVFGVVITFLCAYLSINKYLRMKASTLYYV >gi|226332033|gb|ACIB01000023.1| GENE 18 14589 - 15488 605 299 aa, chain - ## HITS:1 COG:alr2865 KEGG:ns NR:ns ## COG: alr2865 COG0500 # Protein_GI_number: 17230357 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism; R General function prediction only # Function: SAM-dependent methyltransferases # Organism: Nostoc sp. PCC 7120 # 6 221 4 211 318 68 22.0 1e-11 MNKLTINACPLCGGAHLKRAMTCTDFYASGEQFDLYTCEDCGFTFTQGVPVEAEIGRYYE TPDYISHSDTKKGAMNAIYHHVRQYMLGRKARLVMKESHRKTGRILDIGTGTGYFAHTMQ NRGWEVEAVEKSGQARNFAREHFGLNVRPEAALKELVPGTFDVITLWHVMEHLEHLDETW ELLRELLTEKGVLIVAVPNCSSYDAMKYGKYWAAYDVPRHLWHFTPATIQQFGAKHGFIL AARHPMPFDAFYVSMLTEKHKGSAYSFVKGMWTGTAAWLSAQAKKERSSSMIYVFRKKR >gi|226332033|gb|ACIB01000023.1| GENE 19 15638 - 17647 1250 669 aa, chain - ## HITS:1 COG:no KEGG:BF0052 NR:ns ## KEGG: BF0052 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 11 669 1 659 659 1354 99.0 0 MLRGEKSYMNMKIMKYIGLGLLPLVCSCGGRDRQVEEALSLSGNNRNELEAVLKHYEGDG RKLEAARFLIGNMPGSYGANPIVEQDCSAFYEAYDSLGQKYDYRVGTEWGKQVDSLWQNF SNRHRVRQELNYDITRMKAEDLIREIDLAFRAWVENVHSRNCSFEDFCEYILPYRRQNGL LIDNARREFNKRHQGKYFVKEGKDWQQEIDSLLYEYKYLTHSGFWGTKIPIWNAATLEKM RHGLCAQRCWYNSLLLSSLGIPVAIDFVPAWGNRNNSHTWNVVLINGESHAFEAFWDNDR WKYKRIYNNRNDDELWGRFRLPKVYRYTYSNHIEGPLADVEVDKADIPELFRSVKKVDVS SEYFETADVTVELTGEAPQGVKYAYLAVFGYQDWHPVQWGKVENGRAVFREIGKDMVYLP VYYKRGGLLPAAEPFRLRNDGTMEKLSGNEETEEVAVRMVTGAPAYDQNREYLGCMKGSR IVGLLDGKSEEELCRWTDSLALQSVVRKVSARLPYRFVRLLLPSDSIALGELSFYTEEGR IGNVRIITSMRATGRNEVPGMITDGLGATGYRGRVAERLVDIDLGKEYMVSHIGMTSYLK TQLFCPDEFELRYWDNGWKTVERKQADHKGYLVFERVPRGALLMLKNCRWKGKTAERIFT YEKGDVKWE >gi|226332033|gb|ACIB01000023.1| GENE 20 17722 - 18765 566 347 aa, chain - ## HITS:1 COG:no KEGG:BF0051 NR:ns ## KEGG: BF0051 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 347 1 347 347 693 97.0 0 MKQKYILFLSLCFVFSSCRKNDVGSVYSFDTVCEIVDYNLICSDENAPWGAIMNMEIVDS ILILQHAMDEYAFSFINVNNGELLSQWGRTGEGPEEFIDFGSGFEIVDSRIVFLDRMKKE RISVLISDILSKKEHPDITREAYPYNVDFRVLEINAVGNKKIVTGGFKEGYWGALDSQNH IIPNVAELPFDAGEVSGLEKGTVFGGILKANSKQSKFVLSIRASDIFEIYRVSDDGINRV YVSPFKHIPKTWKKGGGYAIDYNQSIGGIKNIAVSDDLICFSLFLQNYNEAAKTDFASNE LFCFDWDGNKVKKYVLPFPIGNFCIDGTHIYGVRNFEDKIIIYRFNM >gi|226332033|gb|ACIB01000023.1| GENE 21 18845 - 19066 157 73 aa, chain - ## HITS:1 COG:no KEGG:BF0048 NR:ns ## KEGG: BF0048 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 8 73 285 350 350 143 95.0 2e-33 MFMRFTVGQIGKNKSIANEVYVFDWEGRAVKKVILDRWGVCISVDSNDERLCLMTKETDG GEERYHYYCYRLN >gi|226332033|gb|ACIB01000023.1| GENE 22 19026 - 19280 102 84 aa, chain - ## HITS:1 COG:no KEGG:BF0050 NR:ns ## KEGG: BF0050 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 79 207 285 350 155 96.0 4e-37 MKPDKKRFAYLAYECDLLSIQKVVNDTCLESVVHLNTYTPLFENQSTNEVSSVDVSADSP KGFLRGVATENYVYALYSGANWEK >gi|226332033|gb|ACIB01000023.1| GENE 23 19267 - 19884 227 205 aa, chain - ## HITS:1 COG:no KEGG:BF0050 NR:ns ## KEGG: BF0050 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 199 4 202 350 393 97.0 1e-108 MRFNLVVLFVILLSFSSCGREEKTVYDFPLEQSLKSDKEVSLNKELLAPYLVCSYDSTLC LIDWTANPMVHVYNMNTGKEMVAFGNKGMGPDDFLSISQMYVDMGKRSLVLYDQSLQTIS SFQIDSLAQGSLSKIDCVSAPKLGMNRVYAYSDSIFYGSGTFESGLIAKCNQKEILNQYL PFPQTEQAVNRDVNYLLFRSYYEAG >gi|226332033|gb|ACIB01000023.1| GENE 24 19922 - 20893 426 323 aa, chain - ## HITS:1 COG:no KEGG:BF0049 NR:ns ## KEGG: BF0049 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 323 29 351 351 660 99.0 0 MNSIDIKTNRINKDEVIARGAFPFEMIDSLFFLFNGDPSSGALVLCESNASELGHFLQKG NGFGECITPGYIGHCNDTIYVSERSRTRRMTYLLSNHNDSLQYKCLEDVSPKMNSEFYYQ ICRLQSGLFVGARLFGKEHLFTLLDESLDTLTTFARVPIDIEENANNKLAPFIGHLCIDD NTVYYASNDFSYMAAYDILSEKEIKPVFERMYISPIIQKSANGISLDKYKHLLGFGDIRV YQNYIFATYIGKPDITMDQENDTSALVPTHLLVFNKDGVPIVKFKFPFKIRSFVFTKSKM YLLDVDCNIESVDLAELWKHLPD >gi|226332033|gb|ACIB01000023.1| GENE 25 21045 - 21296 247 83 aa, chain - ## HITS:1 COG:no KEGG:BF0046 NR:ns ## KEGG: BF0046 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 6 83 3 80 80 134 96.0 1e-30 MSKKIFAALIVAVVATFAGYNIYQSQRVESIMSDLTMANVEALAGSEINDEDCVSASNRY CSVLIVTPNGNYLETYFDQKTKY >gi|226332033|gb|ACIB01000023.1| GENE 26 21566 - 21799 170 77 aa, chain - ## HITS:1 COG:no KEGG:BF0041 NR:ns ## KEGG: BF0041 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 77 1 77 77 154 98.0 7e-37 MKNYMKLIFGLGLIALVGYWPAAKTPKRVNSLFLQNVEALAGGEHVTNLGCLGDGSVDCP INHIKVEYVVQGFSLGE >gi|226332033|gb|ACIB01000023.1| GENE 27 21881 - 23020 261 379 aa, chain - ## HITS:1 COG:no KEGG:BF0040 NR:ns ## KEGG: BF0040 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 379 1 379 379 725 99.0 0 MTKNKLLSCVIGVIILSLLVGAYFYQRNKVAVHQQAERLFVQMLQEEIERKERNLNLFHL FSESSSDTLPLKICIITEEGKKEYEVDSLKSKKNISQNLRNRSIHSILCEKSHLLPDSLN EHWQSMLKKDHIDTESTIHVRMENLQGKIISSSSHDGVWDTSSGIITSYIGNRCEIEVIG LLAFSWKTILWYHWQPFGWIVICLLLMLLFICFYYKMVNRPPELKEVPYEVVVEKEVVVE KEVIREIIVEKETSPEKKAPLIKQICKVEGQLYGLRYGVVFDAQNRVLNCNGKKMSLSPQ QCQILKLFLDAPDYTVTDEDIIKFIWKGQSNVQINTFCSAGNKLGKRLEQAGCGVCFRRF GSDRYRMLFIDDLVDNDLT >gi|226332033|gb|ACIB01000023.1| GENE 28 23347 - 23505 95 52 aa, chain - ## HITS:1 COG:no KEGG:BF0050 NR:ns ## KEGG: BF0050 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 2 52 301 350 350 100 90.0 2e-20 MEWEGGTIVKKVILDRWDICISVDSNDERLCLMTKETDGGEERYHYYCYQLN >gi|226332033|gb|ACIB01000023.1| GENE 29 23478 - 23597 76 39 aa, chain - ## HITS:1 COG:no KEGG:BF0047 NR:ns ## KEGG: BF0047 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 39 1 39 289 81 97.0 8e-15 MKNLILVLGCFFFLISCQQTEKEKLEELVKNWNGKEVLL >gi|226332033|gb|ACIB01000023.1| GENE 30 23594 - 24712 558 372 aa, chain - ## HITS:1 COG:no KEGG:BF0046 NR:ns ## KEGG: BF0046 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 372 1 372 372 686 99.0 0 MKNILLTLLLFILFSCKSTGDKTDCEVLHVDLVERPVPTEELFSKISVIPLETNDSSFLV RPVKVIIKDNRYYIVDEGVPAVFSFDEEGHLLHKIGKKGQGPGEYREIYDAVIKEKENAV YMLSPFGSLYVYSLDGKFIKEIKLPTRSNYQLIEELDSKYFVTWTLPASENENCISVISK ESFKNMKEFWHVPPVLTTLNSKPFYNYEHKVYFSNPYQNEVYEVRTDSLRVAYRWDFGKD NLDLKEYGFTLLEDQKVEEYKLMLQYLRDSTVPYLLRHQFQNKKYYYTMLTFGLRHRINL FYRKDDGKSFFFEKTAEGVLLHPLALNEDFLTCIVFNEDFPNYEKVLPSEEYKKLEERLE DDNPCLIKFYFK >gi|226332033|gb|ACIB01000023.1| GENE 31 25075 - 25209 58 44 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MWFKDLVLGSEDSFYINLITLESFIALSIRVRKGVERSTSWEKS >gi|226332033|gb|ACIB01000023.1| GENE 32 25178 - 25405 125 75 aa, chain - ## HITS:1 COG:no KEGG:BF0035 NR:ns ## KEGG: BF0035 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 75 1 75 75 153 100.0 1e-36 MKNYMKLIFGLIVLIGYWPTTKIPKRVNSLFLQNVEALAGSEHVTNLGCLGDGSVDCPIN HIKVEHVVQGFSLGE >gi|226332033|gb|ACIB01000023.1| GENE 33 25889 - 26956 553 355 aa, chain - ## HITS:1 COG:no KEGG:BF0037 NR:ns ## KEGG: BF0037 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 355 1 355 355 652 97.0 0 MGKYISILILIETILLGCHSTKEKIEFSNRVICSDSISRELVVLNDTFLFSYPLQIECID SMLLVLDNVNNNFFHLFTLKGVPIKSFGEKGQGPIDFINVESFNLSEDRKIMYAYDTSLR KIVKYDVSSFLKDSLKSEVIQVNYDSLPQTEVPTIIYDMLSLKDSNFLVKANHKGLRFGL LKDGKVTQLYNSFSDCVNTNDDEEVWSVFCSNTKTKLRPDRTKMLNATYLGGVLELFDLD DNCSLSLAKILYIYEPKYGIAEGAIPKYVVFNETTQIGFEDIYVTNNSIYTLLHSIGSET LPSEITVFDWAGIPITKIKTGCSLSNIAVDGKDNTIYVIAENEQNAYELSCLSLN >gi|226332033|gb|ACIB01000023.1| GENE 34 27096 - 27314 172 72 aa, chain - ## HITS:1 COG:no KEGG:BF0036 NR:ns ## KEGG: BF0036 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 71 1 73 79 111 72.0 9e-24 MKMKNYLKVTVFWVAVLSVWCLKPTKKSQDTLLLQNVEALASGEEPSQIHCYWRGSVDCP VSHDKVEVVYEY >gi|226332033|gb|ACIB01000023.1| GENE 35 27419 - 28636 318 405 aa, chain - ## HITS:1 COG:no KEGG:BF0035 NR:ns ## KEGG: BF0035 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 403 1 400 402 651 85.0 0 MKRSRLFLWIFGILMQAFLISMHFYQRNMEAMYAETEYLLKEVLNEELHRKQQELNLFYI SKVTIDTIPLTIRVTTSKGVKTFTVDAKKSKKNISQSMAERSWHSAACMKSRLSTDTLNL LWNRRLKSQQIFAKTDVHITTTHLDNTISYCKCKNCKDYCFGTHKFTFYVGNRCEIEVIA FCSYLRWAVYQYHSIPFEVIWSVTAVLIIILCSWYLIKKYISKIRNDKKHLANDRDRERK VRIQLEKDQKRLEVKQKEYEKRIKDFSAKGEEYEEERKSMEKILKEYENQIQKLKELRES GKEPLLYRLSPKVTFDSYAKVLICSDQTISLTSQACQLLDAFLNASEYILTYEELLRYLW EDGTGDMIRLRVAISRLRVALSIDPEISIFQKDINKYQLVLPEKR >gi|226332033|gb|ACIB01000023.1| GENE 36 29385 - 29546 64 53 aa, chain + ## HITS:1 COG:no KEGG:BF0033 NR:ns ## KEGG: BF0033 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 53 1 53 53 87 96.0 2e-16 MTKGSLLRKRYTSQFCTKYYPFLLKDYTQSGCSKNHILAATTIVKIFIGIKNV >gi|226332033|gb|ACIB01000023.1| GENE 37 30004 - 31338 454 444 aa, chain + ## HITS:1 COG:no KEGG:BF0032 NR:ns ## KEGG: BF0032 # Name: not_defined # Def: two-component system response regulator # Organism: B.fragilis # Pathway: not_defined # 1 444 136 579 579 889 99.0 0 MSCYQNIAVFDDKENELHPGSSPIEFELYTFITSIVNQCRAYADTRQIKLNINKDFSYIS CRVDEITMTAALQCLLNKMIEATPCKGCINMDVSHSIKHWNLQITNGPECRQSHKKMLSF ISTFMLIHYCGSLQIIKKIIRLHGGKLIGSYHGRSITLRVTVPINGYCNTIQCPEVVPPV MKDDKIIRPDKKQHHILLVMADTELSNYLHKAFSILFRITILENPEQILHFSGDRLPDII VIDETVNGIRGKEICSKIKSNTSMVHIPVILLISNNDNGSYLAHADCGVDKLEPRAINIC RLKMDIQILINKHERIMKLLEKNLSDNLPSPTAKSEEDALFINKVNKLLEKNLSTESYTV DMLSADMGMCRTKFYTKIKEITDKTPTEYMHYFKMNKAKILLVTQQYTVTEIATFLGFCN AKYFGKRFKKFYKVPPTQYIKEVF >gi|226332033|gb|ACIB01000023.1| GENE 38 31730 - 33844 1868 704 aa, chain - ## HITS:1 COG:SSO3036 KEGG:ns NR:ns ## COG: SSO3036 COG3250 # Protein_GI_number: 15899743 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-galactosidase/beta-glucuronidase # Organism: Sulfolobus solfataricus # 56 587 38 554 570 157 23.0 7e-38 MRKLSYLIIVCLCLCSGVIPLMARQVTAFNTGWQFKKGPFATDPMRAASQWDGKWETVEI PHTWNAMDMQVQSGSFYEGAGYYRKTQFFPHDLEGKRVFLRFEGVGACAEVYVNGKLAGT HKGGYSAFACEIGTALKLGAENEIIVKADNKARPDVIPVNQNLFGVYGGIYRPVWLIVTE QNNITVTDCASPGVYITQKDVSKKSADITVKVKLDNAGLQPAAVTLENTIYTQEGRKVGT HSRSFDLSPQGTQTYLSTFKLKNPHLWQGRKDPYLYKVVCRLMADGKVIDEVVQPLGVRK YEIVAGKGFFLNGEKYPMYGVTRHQDWWGLGSALKNEHHDFDLAAIMDVGATTVRFAHYQ QSDYLYSRCDTLGLIIWAEIPCVNRVTGYETENAQSQLRELIRQSFNHPSIYVWGLHNEV YQPHEYTAALTRSLHDLAKTEDPDRYTVSVNGYGHMDHPVNLNADIQGMNRYFGWYEKKI QDIKPWVEQLEKDYPYQKLMLTEYGADANLAHQTEYLGDALNWGKPFYPETFQTKTHEYQ WSIIKDHPYIIASYLWNMFDFAVPMWTRGGVPARNMKGLITFDRKTKKDSYFWYKANWSE EPVLYLTQRRNADREKRTTAVTVYSNIGIPKVYLNGQELSGIRNGYTDVHYVFDNVSLAD GKNILKAVVSTKGKEYTDEIEWNYSGEKNREIDSYENKNEHSGF >gi|226332033|gb|ACIB01000023.1| GENE 39 33900 - 35318 1157 472 aa, chain - ## HITS:1 COG:TM0306 KEGG:ns NR:ns ## COG: TM0306 COG3669 # Protein_GI_number: 15643075 # Func_class: G Carbohydrate transport and metabolism # Function: Alpha-L-fucosidase # Organism: Thermotoga maritima # 38 373 16 357 449 115 26.0 1e-25 MKNKLFILFAFCISVHVYAQQPSREIPLKYGATNIGKRQDDAMKRFRNNRLGEFIHWGLY AIPGGEWKGKVYNEAAEWLKSWAKVPAADWLELMKQWNPVKFDARQWARMAKEMGVKYVK ITTKHHEGFCLWPSQYSQYTVAQTPYRKDILGELVKAYNDEGIDVHFYFSVMDWSHPDYR YEITSKEDSIAFSRFLTFTDHQLKELATRYPTVKDFWFDGTWDASIKKNGWWTAHAEQML KELVPGVTVNSRLRADDYGKRHFDSNGRLMGDYESGYERRLPDPVKDLQVTKWDWEACMT VPENQWGYHKDWSLSYVKTPIEVIDRIVHAVSMGGNMVVNFGPQPDGDFRSEEKELAMAL GCWMKRYGECIYGCDYAGWDKQDWGYYTRKGQEVYMVVFNRPYSGLLKVKIPKGTEIERA VLPDGQVVKVTETARNEYNVAMPSQDPGEPFIIKLQVKEASGAADGYRDALT >gi|226332033|gb|ACIB01000023.1| GENE 40 35436 - 36809 457 457 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|227395721|ref|ZP_03879044.1| SSU ribosomal protein S12P methylthiotransferase [Haliangium ochraceum DSM 14365] # 20 450 1 450 461 180 29 2e-44 MNELTGADFKSATADDNKKLFIETYGCQMNVADSEVIASVMQMAGYSVAETLEEADAVFM NTCSIRDNAEQKILNRLEFFHSMKKKKKHLIVGVLGCMAERVKDDLIEHHHVDLVVGPDA YLTLPELIASVEAGEKAMNVELSTTETYRDVIPSRICGNHISGFVSIMRGCNNFCTYCIV PYTRGRERSRDVESILNEVADLVSKGYKEITLLGQNVNSYRFEKEGGEVVTFPMLLRLVA EAAPGIRVRFTTSHPKDMSDETLEVIAQVPNVCKHIHLPVQSGSSRILKLMNRKYTREWY LDRVAAIKRIVPDCGLTTDIFSGFHSETEEDHRESLSLMEACGYDAAFMFKYSERPGTYA SKHLEDNVPEEIKVRRLNEIIALQNRLSAESNNRCIGKTYEVLVEGVSKRSRDQLFGRTE QNRVVVFDRGTHRIGDFVNVRITEASSATLKGEEVFS >gi|226332033|gb|ACIB01000023.1| GENE 41 37140 - 37469 229 109 aa, chain + ## HITS:1 COG:no KEGG:BF0027 NR:ns ## KEGG: BF0027 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 109 1 109 109 167 100.0 1e-40 MDDIVKVLVIMAAFALPLIRQIKKSKTERSAQKPFVPIPDTEEPEVLKVTRKYQPLHSQP TSQKVEVKKNKTVSQKIETTPANDPEFTIHSAEEARKAIIWSEILNRKY >gi|226332033|gb|ACIB01000023.1| GENE 42 37506 - 39005 1562 499 aa, chain + ## HITS:1 COG:ygfH KEGG:ns NR:ns ## COG: ygfH COG0427 # Protein_GI_number: 16130821 # Func_class: C Energy production and conversion # Function: Acetyl-CoA hydrolase # Organism: Escherichia coli K12 # 6 489 8 490 492 499 48.0 1e-141 MAFKSISAAEAASLVKHGYNIGLSGFTPAGTAKAVTSEIAKIAEAEHAKGNPFQIGIFTG ASTGDSCDGILSRVKAIRYRAPYTTNPDFRKAVNNGEIAYNDIHLSQMAQEVRYGFMGKV NVAIIEACEVTPDGKIYLTAAGGIAPTVCRLADQIIVELNSAHSKNMMGMHDVYEPLDPP YRREIPIYKPSDRIGLPYIQVDPKKIVGIVETNWPDEARSFAAADPITDKIGQNVADFLA ADMKRGIIPSTFLPLQSGVGNIANAVLGALGRDQTIPAFEMYTEVIQNSVIGLIREGRVK FGSACSLTVTNDCLQGIYDDMDFFRDKLILRPSEISNSPEVVRRLGIISINTAIEADIYG NVNSTHIGGTKMMNGIGGSGDFTRNAYISIFTCPSVAKEGKISSIVPMVSHLDHSEHSVN IVITEQGVADLRGKSPKERAQAIIENCAHPDYKQILWDYLKLAGNKSQTPHAIQAALGMH AELAKSGDMKNVNWAEYER >gi|226332033|gb|ACIB01000023.1| GENE 43 39081 - 40490 970 469 aa, chain - ## HITS:1 COG:BS_pbp KEGG:ns NR:ns ## COG: BS_pbp COG2027 # Protein_GI_number: 16078896 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein 4) # Organism: Bacillus subtilis # 37 468 51 491 491 145 26.0 2e-34 MKKLNLFILFSFCFSIITWGQANFAAIDSLIKKELPQGSEVGISVYDLTARKTLYTYRDT KLSRPASTMKLLTTITALARPDADEPFRTEVWYKGTIEHDTLRGDIYVVGGFDPEFDDEG MNALVEEVITFPFSVLKGNIYGDISMKDSLYWGSGWAWDDTPSSFQPYLSPLMYHKGMVK VTAVPGATRGDSARLSFEPSSSYYTMTNETKTRTSSAGKFSVSRGWLENKNNLIVSGNVE NRRIGDVNVYSSQDFFMHTFVERLRNKGIEISNHYAFDSFRSDSLSICMARWECPVQDVI DQIMKESDNLSAEALLCRLGARATGKKQVSAKDGIEEIYRLIQDLGHDPDNYKIADGCGL SNYDYLSPALLVDFLKFAYSRTDIFRKLYKTLPVAGIDGTLKNRMKQGAAFKNVHAKTGS YTAINTLAGYLKMANGHQVAFAIMNQNILSAAKARNFQNKVCEILANHQ >gi|226332033|gb|ACIB01000023.1| GENE 44 40594 - 41937 645 447 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163788782|ref|ZP_02183227.1| 30S ribosomal protein S1 [Flavobacteriales bacterium ALC-1] # 2 447 3 448 458 253 32 3e-66 MKYQVIIIGGGPAGYTAAEAAGKAGLSVLLFEKQNLGGVCLNEGCIPTKTLLYSAKTYDG AKHASKYAVTVPEVSFDLPKIIARKSKVVRKLVLGVKSKLTSNNVTIISGEATILDKNTV RCGEETYECDNLILCTGSETFIPPIPGIDSVNYWTHREALDNKELPASLAIVGGGVIGME FASFFNSLGVKVTVIEMMDEILGGMDKELSALLRADYAKRGIQFLLSTKVVSLAQTEEGA VVSYENAEGAGSVIAEKLLMSVGRRPVTKGFGLENLNLQRTERGSIVVNGQMESSLPGVY VCGDLTGFSLLAHTAVREAEVAVHAILGKEDRMSYAAIPGVVYTNPEIAGVGQTEESLTA KGIAYRAVKLPMAYSGRFVAENEGVNGVCKVLLGEDDTILGAHVLGNPASEIITLAGMAV EMKLKAAEWKKIVFPHPTVAEIFREAL >gi|226332033|gb|ACIB01000023.1| GENE 45 41944 - 42342 357 132 aa, chain - ## HITS:1 COG:no KEGG:BF0023 NR:ns ## KEGG: BF0023 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 132 1 132 132 231 100.0 8e-60 MNRIFHARIVWYQYFLLVVLGVNAFGFLWCKNIILATLMMLFLIVVIEQIIHTVYTVTAD GLLLLNHGRFIRKKTIPIAEITSIRKVHSMKFGSFSVTNYLLIEYGKGKYASVLPVKEKE FMELIEKTRNLI >gi|226332033|gb|ACIB01000023.1| GENE 46 42450 - 44021 1587 523 aa, chain + ## HITS:1 COG:YPO2710 KEGG:ns NR:ns ## COG: YPO2710 COG0029 # Protein_GI_number: 16122914 # Func_class: H Coenzyme transport and metabolism # Function: Aspartate oxidase # Organism: Yersinia pestis # 6 519 22 532 545 475 46.0 1e-133 MVKKFDFLVIGSGIAGMSFALKVAHKGKVALVCKSGLEEANTYFAQGGVASVTNLLVDNF EKHIEDTMIAGDWISDRTAVEKVVREAPAQIQELISWGVNFDKNEKGEFDLHREGGHSEF RILHHKDNTGAEIQDSLIRAVQQHPNITVIENHFAIEILTQHHLGVTVTRQTPDIKCYGA YILDPKTGKVDTYLAKVTLMATGGVGAVYQTTTNPLVATGDGIAMVYRAKGTVKDMEFVQ FHPTALYHPGDRPSFLITEAMRGYGGVLRTMDGKEFMQKYDPRLSLAPRDIVARAIDNEM KNRGDDHVYLDVTHKDPEETKKHFPNIYEKCLSLGIDITREYIPVAPSAHYLCGGIKVDL NGQSSIERLYAAGECSCTGLHGGNRLASNSLIEAVVYADAAARHCLSVIDQYTYNEEIPE WNDEGTRSPEEMVLITQSMKEVNQIMSTYVGIVRSDLRLKRAWDRLDILYEETESLFKRS VASKEICELRNMINVGYLIMRMAMERKESRGLHYTVDYPHAGK >gi|226332033|gb|ACIB01000023.1| GENE 47 44105 - 44683 648 192 aa, chain - ## HITS:1 COG:CAC2575 KEGG:ns NR:ns ## COG: CAC2575 COG1592 # Protein_GI_number: 15895835 # Func_class: C Energy production and conversion # Function: Rubrerythrin # Organism: Clostridium acetobutylicum # 3 192 2 195 195 203 58.0 2e-52 MTKSIKGTQTEKNLLTSFAGESQARMRYTYFASVAKKEGYEQIAAIFTETADQEKEHAKR MFKFLEGGMVEITASYPAGVIGNTLQNLQAAAAGEHEEWSLDYPHFADVAEQEGFPMIAA MYRNISIAEKGHEERYLAFVKNIEVASVFAKEGEVVWQCRNCGYIEVGKEAPEVCPACLH PQAYFEIKKENY >gi|226332033|gb|ACIB01000023.1| GENE 48 44944 - 46623 1875 559 aa, chain + ## HITS:1 COG:VCA0077 KEGG:ns NR:ns ## COG: VCA0077 COG0659 # Protein_GI_number: 15600848 # Func_class: P Inorganic ion transport and metabolism # Function: Sulfate permease and related transporters (MFS superfamily) # Organism: Vibrio cholerae # 20 554 16 543 553 403 45.0 1e-112 MKVLDFKPRLFSTLKNYSKETFMSDLMAGIIVGIVALPLAIAFGIASGVSPEKGIITAII AGFIISLLGGSKVQIGGPTGAFIVIIYGIIQQYGEAGLIVATLMAGILLILLGVFKLGAI IKFIPYPIIVGFTSGIAVTIFTTQIADIFGLNFGGEKVPGDFIGKWMIYFRHFDTVNWWN AVVSILSIIIIAITPRFSKKIPGSLIAIIVVTIGVYVLKTYAGIDSIDTIGDRFTIKSEL PEAAIPTLNWEAIKDLFPVAITIAVLGAIESLLSATVADGVTGDKHDSNTELIAQGTANL ITPLFGGIPATGAIARTMTNINNGGKTPVAGIIHTIVLLLILLFLMPLAQYIPMACLAGV LVIVSYNMSEWRTFKALLKNPKSDVTVLLITFFLTIIFDLTIAIEVGLVIACILFMRRVM ETTEISVIKDEIDPNDELDIAVCEEHLIIPAGVEVYEINGPYFFGIATKFEETMAQLGDR PKVRIIRMRKVPFIDSTGIHNLTSLCKMSQKEKITIVLSGVNEKVHKTLEKSGFYELLGK QNICPNINVALDRAKEIIN >gi|226332033|gb|ACIB01000023.1| GENE 49 46824 - 49238 1672 804 aa, chain + ## HITS:1 COG:alr2185 KEGG:ns NR:ns ## COG: alr2185 COG1629 # Protein_GI_number: 17229677 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor proteins, mostly Fe transport # Organism: Nostoc sp. PCC 7120 # 146 775 224 821 853 253 29.0 2e-66 MKQIYSTLLLLVLLIFPSLLFATEPESVDRVPAIRGVVYDETDTPLASATVQIEGTTIGT TTNSEGRFILRNLARKVYKINVSFVGYVTQTRTVDLTSRSVAQLSFTLLPDDNLLSTVEV FGERYKQPKKLDAITRMPLRPSEQIQSISVISEKSITEQGALTVTDVARNVPGVTLFGSY GGVRESMSIRGYRGVPILKNGVRIDSDFRTGSALSEMQGVESIQVIKGSAAVTQGIGNDL GSAGGVINVVTKTPKFTNEGEVSLRAGSWGLFRPTFDVQSVLDKNQTIAFRMNGAFERSD NYRPVIHSNRVYINPSLEWRPDDKTSVTIEMDYLNDNRTPYTSSVNLSKDTEENLYDMPH NKFLGFKNDNVNNKTLTYAARITRQLTDNISVRAAYFGSSYKVDNTSTSVKTVVNKEYNM RRRTISRSLRDDRNSTFQLDFIGRDIFTGPVKHTFQLGFDYKNTDLSITNYTPVNIDTIN VLAPSISNVLPVAVKFVPETPVESNSSSYGIMAQEVMTFNKYIKAILGLRYSYISSQDGT SAGPTTGDAWNPMLGIMLTPIKNINLFGSYTTTTSLLHAARRMENGDEIGPSKTRQFEVG IKSDWLNNRLRFNLTYFDILTKNLSYSTYHPGTTQPTGYFDKAGSLKRKGIETELSGSIL ENLQVMMGYAYLDAKYENSPAFKNGSAPMNTPKHTANGWIQYRFDKGVLKRLSAGIGVYF VGKRPVNDFAIKPDGHGSMTNEKPFDMPGYTTINAQLAYSIHKFTARVYLNNLFDALGYN SYYRGGYINQIDPRNFSAVISYHF >gi|226332033|gb|ACIB01000023.1| GENE 50 49239 - 49865 302 208 aa, chain + ## HITS:1 COG:PA4513_1 KEGG:ns NR:ns ## COG: PA4513_1 COG3182 # Protein_GI_number: 15599709 # Func_class: S Function unknown # Function: Uncharacterized iron-regulated membrane protein # Organism: Pseudomonas aeruginosa # 54 161 123 235 395 63 35.0 3e-10 MKIKKYCRYIHLWLSLPAGILISIICFTGAILVFKEELLTIMGYDSIRESPLMIVMKLHR WLMDDTRTTGKMIVGISTLFFIFILISGLTVYWPRKWKKSRLIIEHQKGRRRLMFDLHSV LGLYATLILLVCALTGLMWSFQWYRDIVSFIFDAEVKRGAPIWKIVRALHFGTYAGMFSK IVTFIAALIGTSLPVTGYWMYLKRKKLL >gi|226332033|gb|ACIB01000023.1| GENE 51 49969 - 51582 1197 537 aa, chain - ## HITS:1 COG:PA0183 KEGG:ns NR:ns ## COG: PA0183 COG3119 # Protein_GI_number: 15595381 # Func_class: P Inorganic ion transport and metabolism # Function: Arylsulfatase A and related enzymes # Organism: Pseudomonas aeruginosa # 35 510 4 526 536 261 31.0 3e-69 MKNNCLICSLLFASGIQNAWGAQITDRKANPDQAKPNIILIMCDDMGFSDLSCYGGEVHT PHIDFLAENGIRFSQFKNTGRSCPSRAALLTGRYQHEVGMGWMTAVDEHRPGYRGQISDR YPTIAEVFRENGYHTYMSGKWHVTVEGAFTQPNGSYPVERGFEKYYGCLSGGGSYYTPKP VFSGLQRITEFPKDYYYTTAITDSAVSFIRQHPVDEPMFMYLAHYAPHLPLQAPKERVEA CRERYKAGYDVLRKQRFERIRRNGLIDIEGELPVFEKEFGGKRPAWNSLTPQQQERWITE MATYAAMIEIMDDGIGEVIKATKEKGIFDNTIFLFLSDNGATNEGDMITQLRADLSNTPF RSYKQWCFQGGTSAPLIIMYGGGQPDGKKGAVRHEFTHIIDLFPTCLDMASIEYPREFRN HAIDAPGGRTILPVLKGKKLSKRDLFFEHQTSCGIISGDWKLVRANGKQPWELFNLLQDP FEQNDLSARYPDRVKTLEKKWNQWAEKQQVFPFEYRPWTKRINYYKSLYPDQSGKDL >gi|226332033|gb|ACIB01000023.1| GENE 52 51594 - 52109 433 171 aa, chain - ## HITS:1 COG:FN0320 KEGG:ns NR:ns ## COG: FN0320 COG1853 # Protein_GI_number: 19703665 # Func_class: R General function prediction only # Function: Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family # Organism: Fusobacterium nucleatum # 3 129 4 131 180 71 32.0 7e-13 MMKPIAVSQLSDNFFETISKEWMLVTAGNKDAFNTMTANWGGIGFLWNKPVVYVFIRPER YTFGFMEKSDYFTLSFLGEENKSIHKICGSKSGREVDKIKETGLKPMITDKGNVLFEQGR LSLECRKLYTDVLRKENFLDPSVYEQWYTTHGGLHHVYVAEITSAWIKDKS >gi|226332033|gb|ACIB01000023.1| GENE 53 52151 - 52924 689 257 aa, chain - ## HITS:1 COG:HP0117 KEGG:ns NR:ns ## COG: HP0117 COG0731 # Protein_GI_number: 15644747 # Func_class: C Energy production and conversion # Function: Fe-S oxidoreductases # Organism: Helicobacter pylori 26695 # 6 226 6 214 308 96 31.0 4e-20 MTIIFPSPIFGPVHSRRLGVSLGINLLPSDGKVCSFDCIYCECGYNGEHRPKSSLPTREE VRMALEEKLKEMKSNGPAPDVLTFAGNGEPTAHPHFPEIIEDTLALRDAYFPDAKVSVLS NATFINRPAVFDALNRVDNNILKLDTVDEEYIRTVDRPNGRYDLNGTVGLLKAFKGNCIV QTMFMKGKYKGKDVDNTSDKYVLPWLKVVKDIAPRQVMIYTIDRETPDQDLQKATHEELD RIVALLTKEGLSASASY >gi|226332033|gb|ACIB01000023.1| GENE 54 52982 - 55009 1039 675 aa, chain - ## HITS:1 COG:no KEGG:BF0013 NR:ns ## KEGG: BF0013 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 675 1 675 675 1340 99.0 0 MNESISILSIFLLVNMTLITSTCHAQNRSDYPWEEVMENLSISDEEGDIRNWENELEELT DLVNNPVNINSATKEQLQRFPFLNDVQIENLLAYIYIHGSMQTVYELQLVEELDRQTIQY LLPFVCVEPVDKKESVTLKQILKYGKHEAVTRMDVPLYKRKGYEKNYLGPAVYNSVKYGF HYREKVYAGIVAEKDSGEPFGALHNKQGYDYYSFYLLLHDIGILKTGIVGNYRLNFGQGL VLGQGSMFGKTAYSSSFTFRSTGIRRHTSTDEYNYFRGCGIALKWKQWTLSVFYSHRSLD GVIKGGEITSIYKTGLHRSEKEADKMNQLTMQMSGGNISYTGNSYQLGITGVYYCFNRSY EPELKDYSKYNLHGRSFYNLGMDYKYRFHRFSIQGEAALGISGMAFMNQVLYSPLQDIRL MLVHRYYSHDYWAMFAHSFSEGSSVQNENGWYLAASVNPFNRWTFFVSADLFSFPWWRYR ISKASKGVDLLFQANYVPSKTVDMYVNYRYKQKERDVTGTQGKVILPTYHHRLRYRLNYL PCSSLSLRTTVDYNHFHSSGKTAGQGYQLTQTAGWKLPWLPLTAELQGSYFHTDDYDSRI YIYEKGLLYSFYTPSFQGEGVRLAIHFRYDMNKHWTAIAKLGQTTYFDRDEIGSGNDLIR GNKKTDVQMQLRLKF >gi|226332033|gb|ACIB01000023.1| GENE 55 55123 - 55728 517 201 aa, chain + ## HITS:1 COG:no KEGG:BF0012 NR:ns ## KEGG: BF0012 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 3 201 1 199 199 370 100.0 1e-101 MNMRLTIGLLMLSIALLFSSESLAQEKTNLGGYLVPMCVYNGDTIPAFQIPTIHIFKPLK FRNRKEQMEYYKLVRNVKKVYPIAREINRTIIETYEYLQTLPNEKARQRHIKRVEKGLKE QYTPRMKKLSFAQGKLLIKLIDRQSHQSSYELVKAFMGPFKAGFYQTFAALFGASLKKQY DPEGEDKLTERVILLVESGQL >gi|226332033|gb|ACIB01000023.1| GENE 56 55845 - 58028 219 727 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|149915877|ref|ZP_01904401.1| 50S ribosomal protein L17 [Roseobacter sp. AzwK-3b] # 464 727 261 525 563 89 28 7e-17 MLLHRFPVEYQMDSQDCGPASLKIIAKHFGKFYSLQFMRDRCGITKEGVSLLDLSTGAES IGLRTLAIKCTIDDVVNSIPFPAIVFWNDSHFIVVYHSDRKYIWVSDPAKGRIKYTHEEF RKGWYQRDESQGVLLAVEPTTDFKNSKAEQEQKRNSFSSILKYFFPYKKSFGLIFIIMLV VTVLQGMLPFISKAVIDVGIKTSDRNFINMVLVGNICILLSVMIFNVLRDWILLHITARV NIALISDYLIKLMKLPVTFFENKLLGDILQRAQDHERIRSFIMNNSLALIFSTLTFAVFS IILLIYNSIIFYIFLSGSVLYACWVLLFLSIRKKLDWEYFELLSKNQSYWVETVSTIQDI KIYNYDKYRRWKWEEIQARLYHVNKRVLAITNAQNLGAQFIENIKNMAIVFFCAMAVIKG EITFGIMISTQFIIGMLNGPLVQFINFVVSAQYAKISFLRINEIRQLENEDELLSIGSTT ILPERKTILLENIHFQYTPNSPLVLRNIYLQIPENKITAIVGGSGSGKSTLLKLLVRLYK PSHGEIKMDKMNVSAINLRQWRNMCGVVMQDGKIFSDTILNNIVLDDEQINYTRLREVCR IAQIEDEINAMPKGFETTIGETGRGLSGGQKQRLLIARALYRDPKFLFMDEATNSLDSIN ERKIVNALNNAFEQRTVVVIAHRLSTIRNADQIVVLDKGFIVETGTHEILMEKKGHYFEL VSSQIQD >gi|226332033|gb|ACIB01000023.1| GENE 57 58033 - 59391 685 452 aa, chain - ## HITS:1 COG:alr1928 KEGG:ns NR:ns ## COG: alr1928 COG0845 # Protein_GI_number: 17229420 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Nostoc sp. PCC 7120 # 206 447 255 503 512 72 23.0 2e-12 MNSRIQKQEQPICSPKIILPNPNKKSDVIARSEEVQAIIDRMPTYWAKWVILCVGVLMGM IILLGFLIQYPDTVDGQISVTANAAPVRLVANSNGRITLFQPNKALLHKNDVISCIESGA DYKHILWIDSFLKTLSDKSTIRVALPDTLLLGEVSSAYNSFLLSFLQYERLLTSDIYSTM RQKLQQQIISDEAVIANFNNELRLKKQILDNSQNQLSKDSILLSMKGISEQEYQQKFSTH LSLKESQLNLQSNRQMKQSEISRNQLEIQRICLEETEAKEKAYSDYITRKNELSNAIKLW KEHYLQYAPVEGELEYLGFWRNNRFVQSGQELFSIIPDKTNILGEVVIPSFGAGKVEVGQ TVNVKMDNYPYDEYGLLKGVVKSVSRITNKIKTQNGDMDTYLVIISFPDGTLTNFGKILP LDFETKGTVEIITKRKRLIERLFDNLKSKGEK >gi|226332033|gb|ACIB01000023.1| GENE 58 59388 - 60056 313 222 aa, chain - ## HITS:1 COG:no KEGG:BF0009 NR:ns ## KEGG: BF0009 # Name: not_defined # Def: putative glycosyltransferase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 222 1 222 222 456 99.0 1e-127 MGTNNSDFYLPVYVINLKERTERRQHIEEQFQGRVEFALHWIEAIEHSIGAVGLWQSMLK AVQTAIDKRDDIMIICEDDHIFTPAYNKDYLFANIIGANAQGSELLSGGVGGFGTAVPVD TNRYWMDWFWSTQFIIIFKPLFQKILDYDFKDTDTADGVLSVLAKDKMTIYPFISVQKDF GYSDVTVYNGTPGMISNYFSQANYRLRMIHHVSHKFKEQAKR >gi|226332033|gb|ACIB01000023.1| GENE 59 60037 - 61155 544 372 aa, chain - ## HITS:1 COG:MTH173 KEGG:ns NR:ns ## COG: MTH173 COG0438 # Protein_GI_number: 15678201 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Methanothermobacter thermautotrophicus # 122 360 139 376 382 68 25.0 2e-11 MNIAFLTTLNPADINNWSGTTFHLFHALSRKHHVKVIGQNTLLQAAYFTKDNYIKKNPLE NYVSVFGKLCTEQLTNYDLVFFGDLYLAPFLDVNVPVVHLSDVTYHSFQSYLNPLKNEEQ IKKIEMMEKKLLNKYTAIIYSSEWAKQSTINYYDIEPGKIHVVEFGANIPTPSDYKIDIQ TDICNLVFIGKNWQKKGGDKVLGAYRKLKSDGFRCTLTIIGSIIREPYDEDENLVIIPYL DKSQPEHLERFCNILQEAHFLVLPTEFDAFGIVFCEASAYAVPSIAANVGGVSQPVREGK NGYLLMPDATAEDYAEKIKSVFADKENYLKLRMSSRQEFETRLNWEVWSEKVNKILEEIV EEHHKNNGNKQQ >gi|226332033|gb|ACIB01000023.1| GENE 60 61312 - 63123 524 603 aa, chain - ## HITS:1 COG:no KEGG:BF0007 NR:ns ## KEGG: BF0007 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 603 1 603 603 1166 98.0 0 MNSITKLHVLFFFVFIFYAVSCTAKLEKQTYTNVYDLHFAMRFDSAVVYPWRENGAYSNY TIPAYIQDSNRNLFAKKYFKGFPFSKRLRSEYEQRILLPNNNIKEAVIGFEGKGDNIKLV SIILDAIGKQENILFSDTLRFRPDSILSLFTQNINLNNAEMLNVRINVEGEIDKDAYIAF SRLDILIDGKPIDEFPVRTLSPLIVDKKINYTGINVDRKIGLEQINEINDKKIIGLGESV HGNDGIKNLAYQLIIQAVERLNCKLVLQEMPLEQSFAYNRFIQDDNYELDSSLVINHATI NFLNRLRSFNSGKTKDSKVKLYGMDYNSILSSTQSSAMDIFDFITVLNQKSQIPEVDQLS LLLMKKDRNCAINFLDTHRDKIKKLLTAEEIECILHILRVSKQAGDAGIERFIRRDSIMF VNARFLIDKFAKDENVKTVIYGHAGHINPISSYPAVPCIPFGRYMRKAYGESYSPLLFLI GSGEAMAYDEYYNRKDNWLSRPPENSMEYFLSLIDDNVFYTPLTVDFNELTLSRLQGSHH IPQEFYPFNLYQRFKGVFFIKSTDCTHKDEKEISFEKASDRLIMKIKQRQEKIKEIQKQI ENL >gi|226332033|gb|ACIB01000023.1| GENE 61 63313 - 65118 517 601 aa, chain - ## HITS:1 COG:no KEGG:BF0006 NR:ns ## KEGG: BF0006 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 601 1 601 601 1212 99.0 0 MKYLILLASVIFLAQSCSVAPSMRESARSYDWATYTNFSWQSKIDSAISSYPLLLYPSYE AKGSAGFTVPLFYHMDKKKRADIEVRIKYKTENCNDLYLKLSGISECREVTSVDTFRLSA ANTWRVARRSVDIASPLLLGVALEAQGEKPRKKDFPVDPLGSADNSFKPGEYSKIWIDSL DILIDGKYAVEPPSLNNGATASVRELDVIPVNGDDLKSLSFSDKRILAIGESVHGTGTMN DMGVEIIKNRIEHGKCRLVLLEMPLTLSFHINRYLEGDERFKPDSIASSFDKVLFSSSSF VSLMRWIKEYNRHSEEKVSFFGIDRNIYRLQSSIDLFYFFYTLRRGKGDEGLKAICNSLL LSDEKFPFKGADSVLHANHGFKGILTRREAEIMSYCLNAEEEATVDELNRFRGRDSGMYE NAKFLMKTMLKKDETTTVYCHLGHANYTSIAGWLGSDMRPFGEYMKGSYGDDYSAVGLLA GGGSYLTWVFPGKMGIRRLQSSSSAGLEYCIERSGISPCYLSMDKLSDADVLKMRYIGNT ESKIGQFQWVFPKCMMDGVLFTKNASAINKREEFFKMNLDYHVQTLFALMYLYEKKRKWI P >gi|226332033|gb|ACIB01000023.1| GENE 62 65179 - 65712 350 177 aa, chain - ## HITS:1 COG:no KEGG:BF0005 NR:ns ## KEGG: BF0005 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 177 1 177 177 295 100.0 3e-79 MKKLTRKSLNELAKTMPVIEESLQMSYVGGGNGTSANPYTQAEFDNMLSNDNWNGGYVEG MGYVATNTYIYGSSVYSGSVSQMYYTFPDYVTSISSTGWDRFLSEAVGLTPLGSLVSHVS QDITNMELSILRELLEKGYNASSSFNFVKTNIPYGGTQISVYDAATGQFVTSRTVGE >gi|226332033|gb|ACIB01000023.1| GENE 63 65746 - 66585 439 279 aa, chain - ## HITS:1 COG:CAC2174 KEGG:ns NR:ns ## COG: CAC2174 COG0463 # Protein_GI_number: 15895443 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Clostridium acetobutylicum # 3 237 6 246 336 119 30.0 8e-27 MCEISVVMPVYNAEMHIKDAIESVLEQSFVDFEFILIDDGSTDRTSSIIQSYNDKRVRLI QNSHNFIESLNLGIENSLGKYMARMDGDDIMHIDRLKIQYAIMQEYPDVTVCGTWMNSIG TYSQTNGLLSTLSGWVGQPLLKFTKGNFLFHPTTMIRMDFLKKNALKYENCPYAEDFKFW VEIAKSGGRFYIDSQPLLYYRISDSQVSSQKSSEQRATTESIINEVLEYLIELNKNEYPE LVAAYGDLCKLYEKQLLTKCEVLTLFQTLFSKNEKKLNL >gi|226332033|gb|ACIB01000023.1| GENE 64 66599 - 67570 382 323 aa, chain - ## HITS:1 COG:no KEGG:BF0003 NR:ns ## KEGG: BF0003 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 323 1 323 323 602 99.0 1e-171 MISDTTIRKLVDYISLNACSVNSSGFYNGKSGISLALFETAKCLQDTEIEDKAFSLFQES LIRKTNDYGFENGMSGIGYVLIYLITNKLIDADFEDLFGDQREAIIKHFENIDKQPDKLL VSYKIIYFLFVLDKLQKQDKRIYSIIEKIFQGLELYLSLQFFDWKNIYYINSKDYVLQMY EAYLKLVDFCNYKYFSKSLMDSYVTLYSEGRIASSLVRGYYLGSIITKNNMVGFNDVIRD HIRYGQKNINPAILFLDQKINLTGIIENADENRVKIQRIEMDLFEESLERIKRMVRPNCI HVGYQYGLARYLGFCANKKFPLL >gi|226332033|gb|ACIB01000023.1| GENE 65 67563 - 69032 649 489 aa, chain - ## HITS:1 COG:no KEGG:BF0002 NR:ns ## KEGG: BF0002 # Name: not_defined # Def: putative outer membrane protein TolC # Organism: B.fragilis # Pathway: not_defined # 1 489 1 489 489 842 99.0 0 MKTLLVYILVFSLCYTNAYCQSIPREVTLDEVINRLSLESSSAKIELLNFQNDLLRYENY KKSFLPAFVLNFNPINFNRSLRLLQQPIDGSYSYVEDNSNNTNFGTTVRQKISITGGELS IGSNINYLNEFSRKQNSFSTNPFFISYSQQLWGGGKLQRLENKIERAKNEVAVKQYCSNI AQIQQQALTLYLSAILSKMDSELAIDIKQSNDTLLHIAEIKLRNGSITEYDYKQMELQSL NLQYMYENAVKHYAESIQKLFTFLGIENNAEITIPDFDLPLTIDARLAIYYVKKNNPISN QQEIQQLEEEKKLFSIKLKNRFNGNISLNYGINQYAETLADAYRHGNTRQSVIIEFQIPI FQWGINKNNIRIAKNNYDASRLRIEKKNFEFENEVKEKINAYDHSVKLWLPASRAYALSK EQYKMLTKKFSLGKVSVYELATAQKERNDAMQRYYSAIKDSYESFFTLRNLALYDFKKNV ELEKILFND >gi|226332033|gb|ACIB01000023.1| GENE 66 69187 - 69768 185 193 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163764761|ref|ZP_02171815.1| ribosomal protein S11 [Bacillus selenitireducens MLS10] # 36 191 93 245 255 75 31 6e-13 MLTRKELLLQHTNRNDIIMRKLKITELNRISIEEFKEADKLPLVVVLDDIRSLHNIGSVF RTADAFRIECIYLCGITATPPHPEMHKTALGAEFTVDWKYVNNAVETVDNLRSEGYVVYS VEQAEGSIMLDELTLDRSKKYAVVMGNEVKGVQQEVIDHSDGCIEIPQYGTKHSLNVSVT AGIVIWDLFKKLK >gi|226332033|gb|ACIB01000023.1| GENE 67 70171 - 71163 957 330 aa, chain + ## HITS:1 COG:all4673 KEGG:ns NR:ns ## COG: all4673 COG0379 # Protein_GI_number: 17232165 # Func_class: H Coenzyme transport and metabolism # Function: Quinolinate synthase # Organism: Nostoc sp. PCC 7120 # 14 325 13 323 324 380 58.0 1e-105 MNREEWVNKGFVDEPVDKSIDLKAAINELKKEKNAVILGHYYQKGEIQDIADYIGDSLAL AQIAAKTDADILVMCGVHFMGETAKVLCPDKKVLVPDLNAGCSLADSCPADKFAEFVKAH PGYTVISYVNTTAAVKAVTDVVVTSTNAKQIVESFPKDEKIIFGPDRNLGNYINSITGRE MLLWDGACHVHEQFSVEKIVELKAQYPDAVVLAHPECKSVVLKLADMVGSTAALLKYAVN SDKQRFIVATEAGILHEMQKKCPQKTFIPAPPNDSTCGCNECNFMRLNTLEKLYNCLKYE FPEVTVDPEVAREAVKPIKRMLEISAKLGL >gi|226332033|gb|ACIB01000023.1| GENE 68 71754 - 72338 305 194 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|71274727|ref|ZP_00651015.1| Ham1-like protein [Xylella fastidiosa Dixon] # 1 193 1 197 200 122 40 8e-27 MKRKLVFATNNAHKLEEVSAILGNKVELLSLNDINCHTDIPETAETLEGNAYLKSSFIYR NYGLNCFADDTGLEVESLGGAPGVYSARYAGGEGHNAEANMLKLLHELEGKDNRRAQFRT AISLILDEKEYLFEGIIKGEIIKEKRGDSGFGYDPVFVPEGYDRTFAELGNEIKNQISHR ALAVNKLCEFLRSI >gi|226332033|gb|ACIB01000023.1| GENE 69 72349 - 73248 1099 299 aa, chain - ## HITS:1 COG:TM0177 KEGG:ns NR:ns ## COG: TM0177 COG1284 # Protein_GI_number: 15642951 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Thermotoga maritima # 10 297 1 281 283 139 30.0 6e-33 MISKPTKSDVMRELRDYIFITLGLISYALGWTAFLIPYQITTGGTTGIGAIIYYATGFPI QWSYFIINAVLMTFAIKILGPKFSIKTTYAIFMLTFFLWFFQLIIVDDKGAPLQLVGEGQ DFMACIIGAIMCGLGLGVVFNNNGSTGGTDIIAAIVNKYKDVTLGRMIMFCDIIIISSCY FIFNDWRRVIFGFVTLFIIGFVLDYVVNSARQSVQFFIFSKDYAKIADRITKETHRGVTV LDGLGWYSQNNVKVLVVLAYKRQSLDIFRLVKDIDPNAFISQSSVIGVYGEGFDRLKIK >gi|226332033|gb|ACIB01000023.1| GENE 70 73263 - 76094 2926 943 aa, chain - ## HITS:1 COG:SP0254 KEGG:ns NR:ns ## COG: SP0254 COG0495 # Protein_GI_number: 15900189 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Leucyl-tRNA synthetase # Organism: Streptococcus pneumoniae TIGR4 # 3 943 4 832 833 716 41.0 0 MEYNFREIEKKWQKIWVDNHTYQVNEDASKQKFYVLNMFPYPSGAGLHVGHPLGYIASDI YARYKRLQGFNVLNPMGYDAYGLPAEQYAIQTGQHPAITTVNNINRYREQLDKIGFSFDW NREIRTCDPEYYHWTQWAFIKMFNSYYCNDEKQARPIEELIEAFSTNGTQGMNVACGEEM DFTADEWNAKSEKEQQEILMNYRIAYLGNTMVNWCPALGTVLANDEVVDGVSERGGYPVI QKVMRQWCLRVSAYAQRLLDGLETVEWTDSLKETQRNWIGRSEGAEMNFKVKDSDIEFTI FTTRADTVFGVTFMVLAPESELVAKLTTPEQKAEVDAYLDRTKKRTERERIADRSVSGVF SGSYAINPLTNEPIPVWISDYVLAGYGTGAIMAVPAHDSRDYAFAKHFNLEIRPLIEGCD VSEESFDAKEGIMMNSPRPGAPEGGLVLNGLTVKEAIAKTKEYIKATGLGRVKVNFRLRD AIFSRQRYWGEPFPVYYKDGMPYMIDESCLPLELPEVAKFLPTETGEPPLGHATKWAWDT VNKCVTDNENIDNITIFPLELNTMPGFAGSSAYYLRYMDPRNHEALVSPAVDQYWKNVDL YVGGTEHATGHLIYSRFWNKFLHDWGISVAEEPFQKLVNQGMIQGRSNFVYRIKDTNTFV SLNLKDQYEVTPIHVDVNIVSNDILDLEAFKAWRPEYETAEFILEDGKYICGWAVEKMSK SMFNVVNPDMIVEKYGADTLRMYEMFLGPVEQSKPWDTNGIDGVHRFIKKFWSLFYDRNG EYLVKDEPATKEELKALHKLIKKVTGDIEQFSYNTSVSAFMICVNELSSLKCNKKEVLEQ LIVVLAPFAPHVCEELWDTLGNTTSVCDAQWPAFNEQYLVEDTVNYTISFNGKARFNMEF PANAASDAIQATVLADERSLKWTEGKTPKKVIVVPKKIVNIVI >gi|226332033|gb|ACIB01000023.1| GENE 71 76410 - 77486 696 358 aa, chain - ## HITS:1 COG:no KEGG:BF4585 NR:ns ## KEGG: BF4585 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 358 1 358 358 715 99.0 0 MKNKSACFFVLSLFVCSMFTSCNKESTTECQTIDFSTLFDGQPEKIPLKEWAKSIHFVQL ETNDSVLIGNIRATILHKDKILVHHNNLSLFDLSGKFICNIGSKGGGPTEYSGINNAWTD DEGIHIFDIANKIKTYNWNGKWIKTEPIPESNIKEVFPLASGNNIKAGYIQNITGNEPHK IYLFKDSTILAKIPYGKSFQKGEMTMVFYNECYPFHANGWTFFKEMFNDTIFSIDNQYQP VPRWYIELGKYKIAEDARYTLTDPRKSVFDNAATLTPIGKWDNKLFFSARANKQNYLFYY DLKEKKSNSIQISYPENSFAIPEEHSFIPKCMSDDGKYLISYEIQENDENPVIILAEK Prediction of potential genes in microbial genomes Time: Tue May 17 22:45:10 2011 Seq name: gi|226332032|gb|ACIB01000024.1| Bacteroides sp. 3_2_5 cont1.24, whole genome shotgun sequence Length of sequence - 57267 bp Number of predicted genes - 45, with homology - 45 Number of transcription units - 23, operones - 12 average op.length - 2.8 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 99 - 158 7.6 1 1 Tu 1 . + CDS 349 - 2964 2362 ## COG0249 Mismatch repair ATPase (MutS family) 2 2 Op 1 . - CDS 2965 - 3606 490 ## COG4845 Chloramphenicol O-acetyltransferase 3 2 Op 2 . - CDS 3603 - 4436 757 ## COG0682 Prolipoprotein diacylglyceryltransferase 4 2 Op 3 . - CDS 4447 - 5376 957 ## COG1893 Ketopantoate reductase - Prom 5542 - 5601 2.4 5 3 Tu 1 . - CDS 5672 - 6775 1224 ## COG0012 Predicted GTPase, probable translation factor - Prom 6864 - 6923 3.5 6 4 Tu 1 . - CDS 6996 - 8381 843 ## BF4364 putative metalloprotease - Prom 8464 - 8523 8.1 + Prom 8406 - 8465 8.8 7 5 Tu 1 . + CDS 8554 - 9366 784 ## COG0657 Esterase/lipase + Term 9393 - 9423 1.4 + Prom 9430 - 9489 4.8 8 6 Tu 1 . + CDS 9514 - 10461 832 ## PROTEIN SUPPORTED gi|148988856|ref|ZP_01820271.1| 50S ribosomal protein L9 + Term 10566 - 10597 -0.7 9 7 Tu 1 . - CDS 10559 - 11032 386 ## COG3467 Predicted flavin-nucleotide-binding protein - Prom 11065 - 11124 4.4 10 8 Op 1 . - CDS 11201 - 11809 373 ## BF4574 hypothetical protein 11 8 Op 2 . - CDS 11815 - 12300 423 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog - Prom 12345 - 12404 4.0 - Term 12517 - 12556 2.1 12 9 Op 1 . - CDS 12710 - 12988 66 ## BF4571 hypothetical protein 13 9 Op 2 . - CDS 13001 - 15145 1237 ## PROTEIN SUPPORTED gi|15894003|ref|NP_347352.1| fused ribonuclease/ribosomal protein S1 - Prom 15232 - 15291 4.3 + Prom 15096 - 15155 4.8 14 10 Op 1 . + CDS 15249 - 16148 624 ## COG0053 Predicted Co/Zn/Cd cation transporters 15 10 Op 2 . + CDS 16170 - 17177 843 ## COG0451 Nucleoside-diphosphate-sugar epimerases 16 10 Op 3 . + CDS 17168 - 18133 723 ## BF4567 hypothetical protein 17 11 Tu 1 . - CDS 18284 - 19258 463 ## PROTEIN SUPPORTED gi|148828154|ref|YP_001292907.1| ribosomal protein L11 methyltransferase - Prom 19342 - 19401 4.0 18 12 Op 1 . + CDS 19353 - 19811 460 ## BF4565 hypothetical protein 19 12 Op 2 . + CDS 19873 - 21114 1417 ## COG0826 Collagenase and related proteases 20 12 Op 3 . + CDS 21116 - 21520 517 ## COG0824 Predicted thioesterase 21 12 Op 4 . + CDS 21517 - 22638 1095 ## COG0758 Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake + Term 22655 - 22686 1.8 + Prom 23139 - 23198 3.3 22 13 Op 1 . + CDS 23238 - 24209 574 ## BF4348 putative transmembrane protein 23 13 Op 2 . + CDS 24216 - 25157 834 ## BF4559 hypothetical protein 24 13 Op 3 . + CDS 25209 - 26222 805 ## BF4558 hypothetical protein + Term 26245 - 26284 6.7 + Prom 26266 - 26325 9.3 25 14 Tu 1 . + CDS 26353 - 27198 538 ## BF4345 AraC family transcriptional regulator + Term 27229 - 27288 9.6 + Prom 27212 - 27271 5.2 26 15 Op 1 . + CDS 27332 - 29749 2210 ## COG4206 Outer membrane cobalamin receptor protein 27 15 Op 2 . + CDS 29759 - 30979 1234 ## BF4555 hypothetical protein 28 15 Op 3 . + CDS 30986 - 32116 510 ## COG3182 Uncharacterized iron-regulated membrane protein + Term 32131 - 32182 8.4 - Term 32123 - 32167 10.8 29 16 Op 1 36/0.000 - CDS 32197 - 32952 887 ## COG0479 Succinate dehydrogenase/fumarate reductase, Fe-S protein subunit 30 16 Op 2 . - CDS 32982 - 34925 2017 ## COG1053 Succinate dehydrogenase/fumarate reductase, flavoprotein subunit 31 16 Op 3 . - CDS 34963 - 35655 700 ## BF4551 fumarate reductase cytochrome b subunit - Prom 35755 - 35814 3.0 - Term 36094 - 36126 -0.6 32 17 Tu 1 . - CDS 36335 - 37210 676 ## BF4549 transcriptional regulator - Prom 37419 - 37478 4.9 + Prom 37240 - 37299 5.0 33 18 Tu 1 . + CDS 37326 - 37949 589 ## COG0671 Membrane-associated phospholipid phosphatase + Term 37958 - 38019 5.4 - Term 37945 - 38007 14.0 34 19 Op 1 . - CDS 38033 - 39655 1927 ## COG0793 Periplasmic protease 35 19 Op 2 . - CDS 39671 - 40129 293 ## PROTEIN SUPPORTED gi|163764798|ref|ZP_02171851.1| ribosomal protein S19 36 19 Op 3 . - CDS 40126 - 41997 1973 ## COG0187 Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit 37 19 Op 4 . - CDS 42003 - 43139 1116 ## BF4544 hypothetical protein - Term 43157 - 43196 5.4 38 19 Op 5 . - CDS 43228 - 44184 855 ## COG0530 Ca2+/Na+ antiporter - Prom 44210 - 44269 5.4 + Prom 44172 - 44231 4.3 39 20 Tu 1 . + CDS 44311 - 46119 1584 ## COG0668 Small-conductance mechanosensitive channel + Term 46126 - 46170 9.9 - Term 46113 - 46158 10.1 40 21 Op 1 . - CDS 46191 - 47123 885 ## BF4541 acid phosphatase - Term 47222 - 47260 4.1 41 21 Op 2 . - CDS 47312 - 50074 2795 ## COG1629 Outer membrane receptor proteins, mostly Fe transport - Prom 50287 - 50346 8.1 + Prom 50034 - 50093 3.6 42 22 Op 1 . + CDS 50305 - 51951 1674 ## COG1621 Beta-fructosidases (levanase/invertase) + Term 52007 - 52052 -0.5 43 22 Op 2 . + CDS 52115 - 54871 2165 ## COG1879 ABC-type sugar transport system, periplasmic component + Term 54900 - 54937 4.8 - Term 54971 - 55012 10.4 44 23 Op 1 . - CDS 55041 - 55805 648 ## BF4534 hypothetical protein 45 23 Op 2 . - CDS 55831 - 57267 605 ## BF4533 hypothetical protein Predicted protein(s) >gi|226332032|gb|ACIB01000024.1| GENE 1 349 - 2964 2362 871 aa, chain + ## HITS:1 COG:MA0523 KEGG:ns NR:ns ## COG: MA0523 COG0249 # Protein_GI_number: 20089412 # Func_class: L Replication, recombination and repair # Function: Mismatch repair ATPase (MutS family) # Organism: Methanosarcina acetivorans str.C2A # 7 868 5 899 900 625 41.0 1e-178 MSNDIELTPMMKQFLDLKAKHPDAVMLFRCGDFYETYSTDAIIAAEILGITLTKRANGKG KTVEMAGFPHHALDTYLPKLIRAGKRVAICDQLEDPKTTKKLVKRGITELVTPGVSINDN VLNYKENNFLAAVHFGKSACGIAFLDISTGEFLTAEGPFDYVDKLLNNFAPKEILFERGK RGMFEGNFGSKFFTFELDDWVFTESSSREKLLKHFETKNLKGFGVEHLKNGIIASGAILQ YLDMTEHTQVGHITSLARIEEDKYVRLDKFTVRSLELIGSMNDGGSSLLHVIDKTISPMG ARLLKRWMVFPLKDEKPINDRLNVVEYFFRKPDFRELIEDELHRIGDLERIISKVAVGRV SPREVVQLKVALQAIEPIKEACQQADNPSLNRIGEQLNLCISIRDRIEKEINNDPPLLIN KGGVIKDGVDTELDELRQIAYSGKDYLLKIQQRESELTGIPSLKIAYNSVFGYYIEVRNV HKDKVPQEWIRKQTLVNAERYITQELKEYEEKILGAEDKILVLETRLYTELVQALSEFIP AIQINANQIARIDCLLSFANVAKENNYIRPVIEDNDVLDIRQGRHPVIEKQLPIGEKYIA NDVLLDNATQQVIIITGPNMAGKSALLRQTALITLLAQIGSFVPAESAHIGLVDKIFTRV GASDNISVGESTFMVEMNEASDILNNISSRSLVLFDELGRGTSTYDGISIAWAIVEYIHE HPKAKARTLFATHYHELNEMEKSFKRIKNYNVSVKEVDNKVIFLRKLERGGSEHSFGIHV AKMAGMPKSIVKRANEILKQLESDNRQQGISGKPLAEVSENRGGMQLSFFQLDDPILCQI RDEILHLDVNNLTPIEALNKLNDIKKIVRGK >gi|226332032|gb|ACIB01000024.1| GENE 2 2965 - 3606 490 213 aa, chain - ## HITS:1 COG:MA1703 KEGG:ns NR:ns ## COG: MA1703 COG4845 # Protein_GI_number: 20090555 # Func_class: V Defense mechanisms # Function: Chloramphenicol O-acetyltransferase # Organism: Methanosarcina acetivorans str.C2A # 3 212 6 206 209 89 29.0 6e-18 MKHIIDIKTWERKENYEFFLGFQNPTISITSEVECSGARTRAKAAGESFFLHYLYAVLRA VNEIKEFRFRIDSEGRVVYFDTVDMLTPIKVADNGRFFTVRLPWYPDFKTFYTEAKAIIS GIDPDKDPYEAEKTGGSDLLDVVLLSATPDLYFTSLTCTQEHRHGGNYPLMNAGKAVIRG GVLVMPIAMTIHHGFIDGHHLSLFYKKVEEFLK >gi|226332032|gb|ACIB01000024.1| GENE 3 3603 - 4436 757 277 aa, chain - ## HITS:1 COG:RP046 KEGG:ns NR:ns ## COG: RP046 COG0682 # Protein_GI_number: 15603925 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Prolipoprotein diacylglyceryltransferase # Organism: Rickettsia prowazekii # 1 267 5 262 268 129 30.0 7e-30 MNNLLLSINWNPNPELFNLFGISIRYYGLLWAIGIFFAYIVVHYQYRDKKIDEKKFEPLF FYCFFGILIGARLGHCLFYDPGYYLNHFWEMILPVKFLPGGGWKFTGYEGLASHGGTLGL IISLWLYCRKTKMNYMDVVDMIAVATPITACFIRLANLMNSEIIGKVTDVSWAFVFERVD MQPRHPAQLYEAIAYFILFLVMMFLYKNYSKKLHRGFFFGLCLTAIFTFRFFVEFLKENQ VDFENSMALNMGQWLSIPFVIIGIYFMFFYGKKKSVK >gi|226332032|gb|ACIB01000024.1| GENE 4 4447 - 5376 957 309 aa, chain - ## HITS:1 COG:BH1763 KEGG:ns NR:ns ## COG: BH1763 COG1893 # Protein_GI_number: 15614326 # Func_class: H Coenzyme transport and metabolism # Function: Ketopantoate reductase # Organism: Bacillus halodurans # 7 297 1 287 304 103 24.0 5e-22 MESTNRLRYLIAGTGGVGGSIAGFLSLAGKDVTCIARGAHLQAIQQDGLKLKSDLKGEYA LRINACTAEEYNGKADVIFVCVKGYSVDSITELIKRAAHDRTIVIPILNVYGTGPRIQRL VPGVTVLDGCIYIVGFVSGPGEITQMGTIFRLVYGAHRGILVPTGLMEAVQRDLQESGIK VEISPDINRDTFIKWSFISAMAVTGAYFDVPMGEVQKPGKVRDTFIGLSTESAALGKKLG IEFKEDIVTYNLKVIDKLAPESTASMQKDIARGHESEVQGLLFDMITAAEEQGIDVPTYR EVAKKFIKQ >gi|226332032|gb|ACIB01000024.1| GENE 5 5672 - 6775 1224 367 aa, chain - ## HITS:1 COG:BS_yyaF KEGG:ns NR:ns ## COG: BS_yyaF COG0012 # Protein_GI_number: 16081144 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Predicted GTPase, probable translation factor # Organism: Bacillus subtilis # 1 367 1 366 366 431 59.0 1e-120 MALQCGIVGLPNVGKSTLFNCLSNAKAQAANFPFCTIEPNVGVITVPDERLNKLAELVHP NRIVPTTVEIVDIAGLVKGASKGEGLGNKFLANIRETDAIIHVLRCFDDDNVTHVDGSVN PVRDKEIIDYELQLKDLETIESRIQKVQKQAQTGGDKAAKQAYDVLVQFKDALEQGKSAR TVTFETKDEQKIAKELFLLTSKPVMYVCNVDEASAVNGNKYVDMVREAVKDEDAEILVVA GKTEADIAELETYEDRQMFLAEIGLEESGVARLIKSAYKLLNLETYFTAGVQEVRAWTYE KGWKAPQCAGVIHTDFEKGFIRAEVIKYEDFLQYGSEAAVKEAGKLGVEGKEYVVQDGDI MHFRFNV >gi|226332032|gb|ACIB01000024.1| GENE 6 6996 - 8381 843 461 aa, chain - ## HITS:1 COG:no KEGG:BF4364 NR:ns ## KEGG: BF4364 # Name: not_defined # Def: putative metalloprotease # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 461 1 461 461 939 100.0 0 MRKITLIMFTLLICAAQQVKAQTDSMLIRPTVDKRVELLSIIFRLTGNPEYNRNDFKLYT DRIESHFSPYKNHELISFARSLVKTDGVSYDAVMSMAINLDNQFNLPADYGSLDSRWNRN QVGPFIKLLKKFVKDSRFDAFYHSNENLYQEAVSRFMPIYKSIDTQWYNDFYGQKSNDRF HIILSMSNGPGNYGPSVTDKENVHNVFSVMGAWVTDSVGMVVYPPELILPILIHEFNHSF INFDPEMFRTSGEQIYAAVGEQMARQAYGQWSIVLTEAMVRAAVIKYMKDHNFPAVEITK ETVIQKTRGFVWISKLVDELEKYSSDRTTYPTLNSYMPRLAEAYTGFAQYTANYDSIRPK VVSIDEFTNGDTTVRSDIKTITVHFDRPLVGRGHSFNYGHLGMEAMPKIINVNYANDNRT VIIGVELLPGKEYGITLLGLSFRTPEGDAIKPYEISFKTAE >gi|226332032|gb|ACIB01000024.1| GENE 7 8554 - 9366 784 270 aa, chain + ## HITS:1 COG:DR0821_2 KEGG:ns NR:ns ## COG: DR0821_2 COG0657 # Protein_GI_number: 15805847 # Func_class: I Lipid transport and metabolism # Function: Esterase/lipase # Organism: Deinococcus radiodurans # 43 209 6 176 242 111 36.0 2e-24 MRKYLSFILLLIGLTLQAQETYKTVKDISYIPAGETDGYRKERCKLDVYYPVGKKDFPTI VWFHGGGLEGGGKYVPEMFMNQGFAVVAVNYRLSPKAQNPAYTEDAAAAVAWAYKHIEEY GGSPRRVFVTGHSAGGYLTLMVGLDKSYLQEYGVDADSIAAYLPISGQTVTHFTIRKERS LPEGIPVIDQYAPCNKARKDTPPFVLITGDRNLEMADRYEENALLASVLKNIGNKKVSLY ELQGFDHGQVYVPGCCLVANYIRNFIADGR >gi|226332032|gb|ACIB01000024.1| GENE 8 9514 - 10461 832 315 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|148988856|ref|ZP_01820271.1| 50S ribosomal protein L9 [Streptococcus pneumoniae SP6-BS73] # 4 311 3 307 308 325 56 5e-88 MGKIAKKLTDLVGNTPLLELSNYNASNGLKARVVAKLEYFNPAGSVKDRISLAMVEDAEA KGTLKSGATIIEPTSGNTGVGLAFVAAAKGYKLILTMPDTMSMERRNLLKALGAELVLTP GANGMKGAIAKAEELRAATPGSVILQQFENPANPAVHVRTTGQEIWRDTDGKVDIFVAGV GTGGTVSGVGAALKEHNPAVRIVAVEPVDSPVLSGGAPGPHKIQGIGAGFIPKTYHAAVV DEIRQVGNDDAIRTSRELAAKEGLLVGISSGAAVYAATELAKLPENEGKLIVVLLPDTGE RYLSTILYAFEEYPL >gi|226332032|gb|ACIB01000024.1| GENE 9 10559 - 11032 386 157 aa, chain - ## HITS:1 COG:MA2197 KEGG:ns NR:ns ## COG: MA2197 COG3467 # Protein_GI_number: 20091038 # Func_class: R General function prediction only # Function: Predicted flavin-nucleotide-binding protein # Organism: Methanosarcina acetivorans str.C2A # 5 150 6 149 152 96 34.0 2e-20 MKTVIIENKEQVEEIISRCDICFVGITDLEGNPYVIPMNFGYQDGVIYLHSGPTGSSIDM LARNNRVCITFSVDHELVFQHPKVACSYRMRAKSVICRGRVNFIEDPEEKREALNILMRH YSSREFVYSDPAVKNVKIWEIPIDSVTAKEYAVPHTK >gi|226332032|gb|ACIB01000024.1| GENE 10 11201 - 11809 373 202 aa, chain - ## HITS:1 COG:no KEGG:BF4574 NR:ns ## KEGG: BF4574 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 202 1 202 202 379 100.0 1e-104 METNDIKETWKAGIERTIKPYPEERLNEMVVNSARKSIKTVYPGTVFRLVIIAVAVFLIV SQFVKEQNATRMYLDMGALAVLSVSYFLWERSAYKMRKYTHGMPVKEWLEYRIKEIEKSI RFNTKYDWIVYTCSFLSAIGFYVFYLMATNVVPGILNVIVIPLGMCIYLLIIKRSLKRNY RRTLQELKELYRQFEIEDQEGR >gi|226332032|gb|ACIB01000024.1| GENE 11 11815 - 12300 423 161 aa, chain - ## HITS:1 COG:DR0180 KEGG:ns NR:ns ## COG: DR0180 COG1595 # Protein_GI_number: 15805216 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Deinococcus radiodurans # 10 155 64 218 229 63 28.0 2e-10 MKSNKNEFLKLLTAYQGIIHKVNRIYFRSEADREDNFQETVYQLWRSFPALQNKEKPASW IYTVAINTSISKVRKDSRIEFRDSPPDGEPVDPWEQQEQDENWQRLVNALQKLNEIDKSI MLLYMEDYSYEEIAGIVGISSSAVGVKIHRLKGQLQKQFKK >gi|226332032|gb|ACIB01000024.1| GENE 12 12710 - 12988 66 92 aa, chain - ## HITS:1 COG:no KEGG:BF4571 NR:ns ## KEGG: BF4571 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 92 1 92 92 148 97.0 8e-35 MIIVFAQIGLFTAKSMGTKKNRFPYGLAIFFLLSQSKKKQPNRKQKQEKQEQNGRKKEYM KVFYLSGSLLHSRATGGAYSLILLQNRDKERT >gi|226332032|gb|ACIB01000024.1| GENE 13 13001 - 15145 1237 714 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|15894003|ref|NP_347352.1| fused ribonuclease/ribosomal protein S1 [Clostridium acetobutylicum ATCC 824] # 86 714 70 703 730 481 40 1e-135 MAKKKEKKAGKRMKKKELSKAVLDFFHAKQDEVISLKYIFSELKLTTHPLKMLCMDILSD LLADDYITEVDKNKYKLNNHGIEMTGTFQRKSNGKNSFIPEGGGDPIFVAERNSAHAMNN DKVRIAFYAKRRGCEAEGEVIEILQRANDTFVGTLEVEKSYAFLVTENRTLANDIFIPKD KLKGGKTGDKAVVKVTEWPDKAKNPIGQVLDILGKAGDNTTEMHAILAEFGLPYVYPQSV EKAADKIPAEISAEEIARREDFRKVTTFTIDPKDAKDFDDALSIRPLKDGLWEVGVHIAD VTHYVKEGSIIDKEAEKRATSVYLVDRTIPMLPERLCNFICSLRPNEEKLAFSAIFDITE KGEVRDSRIVHTVIESDRRFTYEEAQQIIETKEGDFKDEILMLDTIAKALREKRFTAGAI NFDRYEVKFEIDEKGKPISVYFKESKDANKLVEEFMLLANRTVAEKIGKAPKGKKPKVLP YRIHDLPDPEKLDNLAQFIARFGYRLRTSGTKTDVSKSINHLLDDIQGKKEENLIETVSI RAMQKARYSTHNIGHYGLAFDYYTHFTSPIRRFPDMMVHRLVTRYMDGGRSVSETKYEDL CDHSSNMEQIAANAERASIKYKQVEFMSERLGQTYDGVISGVTEWGLYVELNENKCEGMI PIRDLDDDYYEFDEKNYCLRGRRKNRIYSLGDAITVKVARANLEKKQLDFALVE >gi|226332032|gb|ACIB01000024.1| GENE 14 15249 - 16148 624 299 aa, chain + ## HITS:1 COG:MA0617 KEGG:ns NR:ns ## COG: MA0617 COG0053 # Protein_GI_number: 20089506 # Func_class: P Inorganic ion transport and metabolism # Function: Predicted Co/Zn/Cd cation transporters # Organism: Methanosarcina acetivorans str.C2A # 2 297 17 311 331 277 52.0 2e-74 MESEKSSREKGIYKVTIVGSIVNFLLLVFKFFAGIAGHSAAMLADAVHSLSDFITDIVVI VFVRIAGKPEDKGHDYGHGKYETLATAIIGLLLLCVGFGIFWNGASSIYTFLRGGQLESP GVVALVAALVSIVSKEILYQYTVIQGKKLNSQAVIANAWHHRSDALSSIGTAIGIGGAIL LGDHWRVLDPVAAVVVSFFIMKVSVRLLIPCVDELLEKSLPEDVEKEIEQTVLSFPGVSQ PHHLRTRRIGNYYAIELHVRMDGKITLEEAHSTATAIENKLKEMFGKGTHVGIHVEPTK >gi|226332032|gb|ACIB01000024.1| GENE 15 16170 - 17177 843 335 aa, chain + ## HITS:1 COG:PAB2145 KEGG:ns NR:ns ## COG: PAB2145 COG0451 # Protein_GI_number: 14520521 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Nucleoside-diphosphate-sugar epimerases # Organism: Pyrococcus abyssi # 4 331 6 303 307 87 28.0 3e-17 MESILITGASGFIGSFIVQEALKRRFGVWAGIRASSSKKYLKERKIHFLELDFAHPNELR AQLSGHKGTYNKFDYIVHCAGVTKCADKSDFDRVNYLQTKYFVDTLRELNMIPKQFIYIS TLSVFGPIREKDYSPISEEDTPAPNTAYGLSKLKAELYIQSIPGFPYVIYRPTGVYGPRE ADYFLMAKSIRQHTDFSVGYKRQDLTFVYVKDIVQAIFLGIEKEVSRRAYFLSDGKVYKS RAFSDLIQKELGDPFVIHVKCPLIVLKVVSLLAEFIATRSGKSSTLNSDKYKIMKQRNWQ CDITPAVKELGYAPEYDLEKGVKETIAWYKNEGWL >gi|226332032|gb|ACIB01000024.1| GENE 16 17168 - 18133 723 321 aa, chain + ## HITS:1 COG:no KEGG:BF4567 NR:ns ## KEGG: BF4567 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 321 1 321 321 571 100.0 1e-161 MALDLFKRVESRKGLFAVEKITLIYNLLTSILILFMFQRMDHPLHMLWDRAVIAAMTFLL MYLYRLAPCKFSAFVRIAIQMSLLSYWYPDTFEFNRVFPNLDHLFATAEQWMFGGQPAVW FCHAFPQMWVSEPFNMGYFAYYPMILVVTLFYFIYRFDLFEKMSFVLVTCFFIYYLIYIF VPVAGPQFYFPAIGMDSVSQGVFPSIGDYFNHNQELLPGPGYQHGFFYSLVEGSQQVGER PTAAFPSSHVGVSTILMIMAWRASKKLFACLMPFYLLLCGATVYIQAHYLIDAIAGFVSA FVLYVLVTKMFKKWFAVPMFK >gi|226332032|gb|ACIB01000024.1| GENE 17 18284 - 19258 463 324 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|148828154|ref|YP_001292907.1| ribosomal protein L11 methyltransferase [Haemophilus influenzae PittGG] # 1 287 1 284 326 182 37 3e-45 MKIGHIDLGERPVFLAPMEDVTDPAFRLMCKKFGADMVYTEFVSSDALIRSVNKTTQKLT ICDEERPVAIQIYGKDTEAMVGAARIVEEAQPDILDINFGCPVKKVAGKGAGAGMLQNIP KMLEITRAVVDAVKIPVTVKTRLGWDADHKIIVDLAEQLQDCGIAALAIHGRTRAQMYTG EADWTLIGEVKNNPRMHIPIIGNGDVTTAAGAKECFERYGVDAIMIGRGSIGRPWIFREV KHYLETGEELPRESFEWYLDVLREEVLNSVARLDERRGIIHIRRHLAATPLFKGIPNFRE TRIAMLRTESVEELFRIFDGLTTE >gi|226332032|gb|ACIB01000024.1| GENE 18 19353 - 19811 460 152 aa, chain + ## HITS:1 COG:no KEGG:BF4565 NR:ns ## KEGG: BF4565 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 152 1 152 152 268 99.0 5e-71 MKKRLFSIALCLTFILSATSVSAQDAAYREALSKMLEASGAMTTVKSMVPQMIGMMKHTY SNVPDEFWKSFEEQLSEKANTQFLDIYVSIYHRYLTISDLKKITAFYESPVGKKLAESTP VMTAEAMEAGQQIGMGIAKEIMANLKEKGYVQ >gi|226332032|gb|ACIB01000024.1| GENE 19 19873 - 21114 1417 413 aa, chain + ## HITS:1 COG:aq_1015 KEGG:ns NR:ns ## COG: aq_1015 COG0826 # Protein_GI_number: 15606313 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Collagenase and related proteases # Organism: Aquifex aeolicus # 1 405 7 401 409 265 36.0 1e-70 MAPVGSRESLAAAIQAGADSIYFGIENLNMRARSANTFTIEDLREIARTCDEHGMKSYLT VNTIIYDKDIELMRTIVDAAKEAGISAVIAADVAVMSYANKIGQEVHLSTQLNISNAEAL KFYAQFADVVVLARELNLEQVAEIYRQIQEEHICGPSGELIRIEMFCHGALCMAVSGKCY LSLHEMNHSANRGACMQICRRAYTVRDKDTDVELEVDNQYIMSPKDLKTIHFMNKMIDAG VRVFKIEGRARGPEYVRTVVECYKQAIRAYLDDSFTDEKIAAWDERLKTVFNRGFWDGYY LGQRLGEWTKNYGSAATERKIYVGKGIKYFSNIGVAEFLVEAAEVSVGDKLLITGPTTGA VFATLDEARVDLKPVETVKKGEHFSMKLDKIRPSDKLYKLVSTEELKKFKGLE >gi|226332032|gb|ACIB01000024.1| GENE 20 21116 - 21520 517 134 aa, chain + ## HITS:1 COG:CC3234 KEGG:ns NR:ns ## COG: CC3234 COG0824 # Protein_GI_number: 16127464 # Func_class: R General function prediction only # Function: Predicted thioesterase # Organism: Caulobacter vibrioides # 6 118 14 126 147 60 32.0 8e-10 MEKYIYELTLKVRDYECDLQGIVNNANYQHYLEHTRHEFLSSVGVSFAKLHEEGVDPVVA RINMAFKTPLKSGDEFVSKLYMKKEGIKYVFYQDIFRKSDDKVVVKSTVETVCVVNGRLS DSELFDQIFAPYLQ >gi|226332032|gb|ACIB01000024.1| GENE 21 21517 - 22638 1095 373 aa, chain + ## HITS:1 COG:FN1068 KEGG:ns NR:ns ## COG: FN1068 COG0758 # Protein_GI_number: 19704403 # Func_class: L Replication, recombination and repair; U Intracellular trafficking, secretion, and vesicular transport # Function: Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake # Organism: Fusobacterium nucleatum # 84 370 11 285 288 192 38.0 8e-49 MSTDEERIYSIALTQIPGVGHIGAKRLVDGMGSATDVFRYRTELPDRLPGVNRAVVAALD SPQALKRAEQEYEFARNNRISCFTLADEDYPSRLRECDDAPVVLFFKGKANLNALHIINM VGTRNATDYGKHICVNFLQELQMQCPDVLVVSGLAYGIDINAHRSALSVELPTVGVLAHG LDRIYPSLHRKTAIDMLRQGGLLTEFLSGTNPDKHNFISRNRIVAGISDATIVVESAAKG GSLITADIAGSYHRDCFAFPGRVTDEYSKGCNQLIQDNKAVLLESASDFVKAMGWDSDLK SVKPETVQRNLFPDLSAEELRIVDILGKLGDLQINALMVQADIPINKISAILFELEMKGV IRVLAGGVYQLLR >gi|226332032|gb|ACIB01000024.1| GENE 22 23238 - 24209 574 323 aa, chain + ## HITS:1 COG:no KEGG:BF4348 NR:ns ## KEGG: BF4348 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 323 17 339 339 652 100.0 0 MENRICILTLFVFLFLLSGESRAMVAMPDAWTRMMIGVNQKDWSLLRERVQADLHVPDRE DVLMLIDYHRDDRMKCENLLRRLNGGEAYRYIMNKILPLLYVYREHSPVPIPDDKAAGVF LSTVNVLPRVKFVHPQPVVPEEGKIETPVIIEQRTVLALKNNLLYDLALAPNIEVEIPLN RRWSVNAEYKCPWWLNSSREFCYQLLSGGVEGRCWLGNRKRRNRLAGHFIGAYAEGGIYD FQFKGDGYQGRYYAASGLTYGYAKQIARHLSFEFSLGIGYLTTEYKKYTPYEGDLVWKSS ARYNFIGPTKAKVSLVWLITARR >gi|226332032|gb|ACIB01000024.1| GENE 23 24216 - 25157 834 313 aa, chain + ## HITS:1 COG:no KEGG:BF4559 NR:ns ## KEGG: BF4559 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 313 1 321 321 414 67.0 1e-114 MKDITRLLCMVLLSFLVVTGCSRRDILDDYPVSGVEVRLDWSGVTEKLPEGVRIIFYPKG DNGRKVDTYLPVKGGHVGVPPGIYSVVVYNYDTETVLVRGDESYETIETYTGLCNFGIAG TEKMVWGPEDFYTTCVDELMVEKSDESLILKLSPKLVVKTYHFSIKVTGIKNISQVFGSV VGMADHYLLGKSFSLCDGCPIYFDVVRGKDTIEGSFTTFGISQIVRTRAENTEVSLNLLL LKVDDTVQEVKVDVTEVIHKSEAGGETDKPEIDVPIEDEIKVDDVETPPDGNGGMNGDVD DWEDETDIVIPVE >gi|226332032|gb|ACIB01000024.1| GENE 24 25209 - 26222 805 337 aa, chain + ## HITS:1 COG:no KEGG:BF4558 NR:ns ## KEGG: BF4558 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 337 1 330 330 151 38.0 4e-35 MKKILFAAVAALAITGCSQNEEIEKAAKTAEIGFNAIVKNTTRAAVTNMGNLGNFKVHSY ITGTAFDGTTAELGTAYMNGVLFETIDNTTWTKATSDTKTYYWPSDASKSVQFFAYPSTL ISDFSIPDTGTAGYPSFNYTVDKEADEPGDLVVAYESNKTATSEGVNAGKLTLNFKHILS RINFAYIPGNTNLIYTVTAVKIADIKGGTAKYTFSASNGAWDVTSGTSKEYTYTVTQSPN VVENKSYYMLGGEDASLMLFPQDVAKKVITVTYTSKDDDGVQVFSGDKTVTLPDNSKWEV GKNILYILTLPAGGTEMTVTPKVSEWNAAEDKEQTAQ >gi|226332032|gb|ACIB01000024.1| GENE 25 26353 - 27198 538 281 aa, chain + ## HITS:1 COG:no KEGG:BF4345 NR:ns ## KEGG: BF4345 # Name: not_defined # Def: AraC family transcriptional regulator # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 281 1 281 281 525 100.0 1e-148 MPSSVCGDPMMCQNCKKSVPNAIFHVYHRRGFHYPAQKCEENFILFLIKGEMLVNSREYA GTMLKAGEFMLQPISSKIEMLAMTDVECIYYQFNQPELFCDIRYNRIMKETDPPLIPSPL PIIPELQHFLESARTYLSEKKICRDLLSLKRKELAFILGYFYSDYDLASLVHPLSKYASS FECFVYQNYKKVKTVEEFAKLGGYSQTTFRRIFDNVFHEPVYEWMLSRRKEEIIYELQNT EATISEICYKFGFESLPHFSNFCKKSFGTSPRSIRLKRSSE >gi|226332032|gb|ACIB01000024.1| GENE 26 27332 - 29749 2210 805 aa, chain + ## HITS:1 COG:BMEI0657 KEGG:ns NR:ns ## COG: BMEI0657 COG4206 # Protein_GI_number: 17986940 # Func_class: H Coenzyme transport and metabolism # Function: Outer membrane cobalamin receptor protein # Organism: Brucella melitensis # 118 243 14 141 599 65 32.0 4e-10 MNHKVLYRVALFCLFSMLSAVLMHAGEQQQQSKGIVTGRIVDQQKQPYYPVAVAIEGVYI GGYTNENGVYHINDVPAGSQTIVVSGIGVKTKKVPIHVTAGKVNRIPDIEIDTQAEELEE VQVIGKSEARRQQEQAYAISVLDIKKAYNSAAPLNKLLNNVSSVRIREEGGMGSNYNFSL NGFSGNQVKFFLDGIPMDNFGSSFNLANISANMAERVEVYKGVLPVNLGADALGGAVNIV SRRDANYLDATYSFGSFNTHKVSVNGAYTHLKTGFTVRANAFYNYSDNDYKVFVPIIDLA TNKKIDERWVKRFNDAYRSGGIRLETGITNKPYADYLLAGIILSKNDKDVQTGATMDAVY GGVKMKSESVIPSIRYKKDDLFLDGLSLSLYGTYNSVNTFNVDTIARRYNWLGESVPSTS AGEGYYTDSKIKNREWLGNGNISYVIDGHQSLILNHVVSAMRRTMNDKVRPDDENNNVPQ QLTKNITGLGWQIRYDRWNANVFGKMYKLYSSTYKRLDEYTENARWEKVRDHKTNFGYGA AATYYILPSLQAKFSYEHAYRLPESIEMFGDGLIQQRNPDLKPESSRNLNLGLSFIQTFG AHQLSADGNFIYRYTTDFILKGVSLTSNPTTGYENLGKVLTKGVEAAVRYNYKDLFHTGA GFTYQDITDRQRYEKTKDSFVGEGITENITYKERLPNIPYLFANADAGVRFHDLIWRNSV LTFDYNLNYIHSYYLSFPGLGAKSSKKVIPEQFSHDLALGYSMDNGKYSVVVECTNLTNQ KLYDNYRLQKPGRAFNVKLRYFFSK >gi|226332032|gb|ACIB01000024.1| GENE 27 29759 - 30979 1234 406 aa, chain + ## HITS:1 COG:no KEGG:BF4555 NR:ns ## KEGG: BF4555 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 406 1 406 406 803 99.0 0 MITMYRIYMYLLAACLLLGVTACEEEGLGNEETPFAPYVLSLGINSNGTTTYYVVTAPEL MSGTINAVAKGIEQNGYRDYEQAGQTVFSIGGLGLTSATGIVRDANGYLTERGDFVFNSS LNAFTQMDGQNMIGLELPANKESGDQMTLYTVNISDVSITSQVKAPVFPLNQLEWPSITG MCYSEGNVYVTYFPMNPSTFETLYTDTTFVAVYSYPDMQFKTLMKDTRTGPAGSWNAFNG IFKVESGDMYIMSNSAIANGFSQSTKNAAFLRIPKGETHFDDYYFDFETVSGGLKPAHIK YIGNGLVFAEVSTIGPQTSADRWGDKSLKCCIIDLNNKTVRDIKEIPVHNGDGGRRFAAL VDGGYVYRPVTTSEGTYIYQVDPQAATAVRGAKVSTTFVGGFFRLD >gi|226332032|gb|ACIB01000024.1| GENE 28 30986 - 32116 510 376 aa, chain + ## HITS:1 COG:BH0982 KEGG:ns NR:ns ## COG: BH0982 COG3182 # Protein_GI_number: 15613545 # Func_class: S Function unknown # Function: Uncharacterized iron-regulated membrane protein # Organism: Bacillus halodurans # 11 376 27 404 461 90 23.0 4e-18 MNKTFRKACAKLHLWLGLLSGIVVFIVCITGCMYAFKDEITDATQPWRFVAPLEKEFLPP SRLLAIADSVMGGASATAITYGERSDAAWVDYYQPEAGMSTVFINPYSGQVLKSVVNHNG DFDFFRLVLSGHRTLWLPREIGSPIVGYSVLIFLITLITGVILWWPRSWTRKALVQRLTL KRPFTFSRLNFDLHNVAGFYAALVLAVLCFTGLIFSLNWFSRGVYSITSGGEELKPYVLP VSDTLQVVNRVTNPLDRLYTQLRLEEPAAKTFYFALPGQADGVYRVSVVHKRGSYYRTDN LFFDRYTLVSLKGAGPYAGKYTEASPADKFRRMNLEIHDGRIWGLPGKIIMFLASLTGAS LPVTGFIIWYRKRRKR >gi|226332032|gb|ACIB01000024.1| GENE 29 32197 - 32952 887 251 aa, chain - ## HITS:1 COG:Cgl0368 KEGG:ns NR:ns ## COG: Cgl0368 COG0479 # Protein_GI_number: 19551618 # Func_class: C Energy production and conversion # Function: Succinate dehydrogenase/fumarate reductase, Fe-S protein subunit # Organism: Corynebacterium glutamicum # 5 243 1 238 249 241 46.0 1e-63 MDKNISFTLKVWRQKGPKAKGAFETYQMKDIPGDTSFLEMLDILNEQIINDGGEPIVFDH DCREGICGMCSLYINGHPHGPATGATTCQMYMRRFKDGDTITVEPWRSAGFPVIRDLMVD RSAYDKIMQAGGYVSVNTGAPQDANAILISKDIADEAMDAAACIGCGACVAACKNGSAML FVSAKVSQLNLLPQGKVEAARRAKAMLSKMDELGFGNCTNTRACEAECPKNISISNIARL NRDFIIAKLKD >gi|226332032|gb|ACIB01000024.1| GENE 30 32982 - 34925 2017 647 aa, chain - ## HITS:1 COG:Cgl0367 KEGG:ns NR:ns ## COG: Cgl0367 COG1053 # Protein_GI_number: 19551617 # Func_class: C Energy production and conversion # Function: Succinate dehydrogenase/fumarate reductase, flavoprotein subunit # Organism: Corynebacterium glutamicum # 2 646 26 673 673 619 49.0 1e-177 MTKIDSKIPEGQLAEKWSNYKAHQKLVNPANKRRLDIIVVGTGLAGASAAASLGEMGFRV FNFCIQDSPRRAHSIAAQGGINAAKNYQNDGDSVYRLFYDTIKGGDYRAREANVYRLAEV SNAIIDQCVAQGVPFARDYGGTLDNRSFGGAQVSRTFYARGQTGQQLLLGAYSALSRQVQ KGTVKLYTRYEMLDLVVIEGRARGIIARNLVTGEIERFAAHAVVIGTGGYGNAFFLSTNA MGSNGSVAIQCYKKGAYFANPCFAQIHPTCIPVHGDKQSKLTLMSESLRNDGRIWVPKKI EDAKALQAGTKKPTEIPDEDRDFYLERRYPAFGNLVPRDVASRAAKERCDAGFGVNNTGL AVFLDFKYAIDRLGEDVVRARYGNLFDMYEEITDENPYKTPMMIFPAIHYTMGGIWVDYE LMTSIPGLFAIGEANFSDHGANRLGASALMQGLADGYFVLPYTIQNYLADQIQVPRFSTD LPEFAAAEKEVKDKIQKIKAVNGKHSVDSIHKKLGHIMWDFVGMARTKESLQKALTGIEE VKKDFWTNVRIPGDVNELNVELEKALRLIDFIEVGMLMARDGLNREESCGGHFRTEYQTP EGEALRDDKNFSYVACWKYTGENSEPELIKEDLNYQFVKVQTRNYKS >gi|226332032|gb|ACIB01000024.1| GENE 31 34963 - 35655 700 230 aa, chain - ## HITS:1 COG:no KEGG:BF4551 NR:ns ## KEGG: BF4551 # Name: not_defined # Def: fumarate reductase cytochrome b subunit # Organism: B.fragilis # Pathway: Citrate cycle (TCA cycle) [PATH:bfr00020]; Oxidative phosphorylation [PATH:bfr00190]; Benzoate degradation via CoA ligation [PATH:bfr00632]; Butanoate metabolism [PATH:bfr00650]; Metabolic pathways [PATH:bfr01100]; Biosynthesis of secondary metabolites [PATH:bfr01110] # 1 230 1 230 230 380 100.0 1e-104 MWLSNSSVGRKVVMSVTGIALVLFLTFHMAMNLVAIISAEGYNLVCEFLGANWYALVATL GLAALFVIHIIYAFWLTIQNRAARGSERYAVVDKPKTVEWASQNMLVLGIIVILGLGLHL FNFWAKMQLPELVHNMGGVADTTYAADGVYHIMNTFSNPVYVVLYLVWLGALWFHLTHGF WSSMQSLGWNNKIWINRWKCISNIYSTIVVVCFALVVVVFFVKSLACGAC >gi|226332032|gb|ACIB01000024.1| GENE 32 36335 - 37210 676 291 aa, chain - ## HITS:1 COG:no KEGG:BF4549 NR:ns ## KEGG: BF4549 # Name: not_defined # Def: transcriptional regulator # Organism: B.fragilis # Pathway: not_defined # 1 291 1 291 291 597 100.0 1e-169 MARETQCNRQTDCAQCPKATEGILVHRKFPKGQHFPPDKCTQNCILFILQGELLVNSEEY PGTTLREGQFILQSIGSKLELLALTDVNYVVYWFNEPPLICEQRYHEILQQSEAPLTYTP LVMTQRITNFMKDICDYLDEQMPCGAFIDLKCQELMYLIICYYPIPQLSKFFYPISSYTE SFQYFVMQNYEKVKNVEEFAHLGGYTTTTFRRLFKNMYGVPVYEWILSKKREGILEDLQH TKQRITEISNRYGFDSLSHFAHFCKASFGDSPRALRTRAARGEKITALKTE >gi|226332032|gb|ACIB01000024.1| GENE 33 37326 - 37949 589 207 aa, chain + ## HITS:1 COG:MJ0374_2 KEGG:ns NR:ns ## COG: MJ0374_2 COG0671 # Protein_GI_number: 15668550 # Func_class: I Lipid transport and metabolism # Function: Membrane-associated phospholipid phosphatase # Organism: Methanococcus jannaschii # 86 182 60 154 168 71 44.0 9e-13 MKKFVLICFCVLSSVGGYAQNWDINTLHKINSLDSKFARNYSKAFSKSAPYIAVGVPVAM AVYAGIDKDKELLKDAIYIGTSVAEAVVITYGMKYAFDRERPYDRYPDRVDARSHESSPS FPSGHTAAAFSLATSLSIRYPKWYVIAPSAFWACSVGFSRMNEGVHYPSDVAAGAVIGAG CAVANIYVNRWLNKWLFGEKKKVTISY >gi|226332032|gb|ACIB01000024.1| GENE 34 38033 - 39655 1927 540 aa, chain - ## HITS:1 COG:XF2704 KEGG:ns NR:ns ## COG: XF2704 COG0793 # Protein_GI_number: 15839293 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Periplasmic protease # Organism: Xylella fastidiosa 9a5c # 36 340 84 386 508 217 41.0 5e-56 MKRIIYLLIGLCSVLGLQAQNFGSPAMRKLQLAEFAISNLYVDTVNENKLVESAIIEMLA QLDPHSTYSDAEEVKKMNEPLQGNFEGIGVQFQMIEDTLLIVQPVSNGPSEKVGILAGDR IIAVNDTAIAGVKMGTEEIMGRLRGPKDSKVNLTIIRRGVKEPLLFNVKRDKIPILSLDA AYMIQPKIGYIRINRFGATTAEEFLKALKELQKKGMKDLILDLQGNGGGYLNAAIDLANE FLGQKELIVYTEGRSAQRSEFFAKGNGNFRNGRLVVLVDEYSASASEIVTGAIQDWDRGV VVGRRSFGKGLVQRPIDLPDGSMIRLTIARYYTPAGRCIQKPYDSSINEKPGKGKSSESS IEKYNQDLIDRYNHGEMVSADSIHFPDSLKCQTKKLGRTVYGGGGIMPDYFVPVDTTLYT DYHRNLVAKGVVIKTTMNFIEKNRKALLDKYKTFEKFNEKFEIDDQLLNYLREAADKEKI EFNEEQYNKALPLIKAQLKALIARDLWDMNEYFQVMNATNKSVERALEILNDKEYEKILK >gi|226332032|gb|ACIB01000024.1| GENE 35 39671 - 40129 293 152 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163764798|ref|ZP_02171851.1| ribosomal protein S19 [Bacillus selenitireducens MLS10] # 1 151 1 152 164 117 38 1e-25 INMRRAIFPGTFDPFTIGHYSVVQRTLTFMDEVVIGIGINENKNTYFPIEKRVEMIRKFY KDEPRIKVESYDCLTIDFARQVDAQFIVRGIRTVKDFEYEETIADINRKLAGIETILLFT EPELTCVSSTIVRELLGYNKDISMFIPKGMEM >gi|226332032|gb|ACIB01000024.1| GENE 36 40126 - 41997 1973 623 aa, chain - ## HITS:1 COG:CT661 KEGG:ns NR:ns ## COG: CT661 COG0187 # Protein_GI_number: 15605394 # Func_class: L Replication, recombination and repair # Function: Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit # Organism: Chlamydia trachomatis # 16 617 7 602 605 561 50.0 1e-159 MEENELIPVDNNPVEYTDDNIRHLSDMEHVRTRPGMYIGKLGDGSHPEDGIYVLLKEVID NSIDEFKMQSGKKIEIRVEENLRVSVRDYGRGIPQGKLIEAVSVLNTGGKYDSKAFKKSV GLNGVGVKAVNALSSNFEVRSYRDGKVRCATFTKGELVTDHTEDTEEENGTYIFFEPDET LFLNYSFRPEFIETMLRNYTYLNTGLAIIYNGQRILSRNGLVDLLNDNMTATGLYPIVHL KGEDIEIAFTHTGQYGEEYYSFVNGQHTTQGGTHQSAFKEHIARTIKEFYNKNQDYTDIR NGLVAAIAVNVEEPMFESQTKIKLGSTNMSPGGITVNKFVGDFIKQEVDNFLHKHADIAE IMLQKIQDSEKERKAIAGVTKLARERAKKANLHNRKLRDCRVHLNDVKGKGLEEESCIFI TEGDSASGSITKSRDVNTQAVFSLRGKPLNSFGLTKKVVYENEEFNLLQAALNIEDGIEG LRYNKVIVATDADVDGMHIRLLLITFFLQFFPDLIKKGHVYILQTPLFRVRNKKKTLYCY TEEERVNAIKELSPNPEITRFKGLGEISPDEFRHFIGKDMRLEQVSLRKTDTVKELLEFY MGKNTMERQNFIIDNLVIEEDIA >gi|226332032|gb|ACIB01000024.1| GENE 37 42003 - 43139 1116 378 aa, chain - ## HITS:1 COG:no KEGG:BF4544 NR:ns ## KEGG: BF4544 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 378 1 378 378 749 100.0 0 MAITIKKVSTKRELKKFIRFNYELYKENPYSVPDLYDDMLNTFNKKKNAAFEFCEAEYFL AYKDGKIVGRIAGIINHRANATWNKKDVRFGWIDFIDDLEVSSRLLQTVEEWGKSKGMEN IQGPLGFTDFDAEGMLIEGFDQLSTMATIYNHPYYPQHMEKLGFEKDADWVEYKIYIPDA IPEKHQRISDLIQRKYNLKIKKYTSSRKIAADYGQAIFELMNEAYSPLYGYSPLSQRQID QYVKMYLPIVDLRMVTLITDAEDKLIAVGISMPSLSEALQKSHGRLLPLGWYYLLKALFM KRRAKMLDLLLVAVKPEYQNKGVNALLFSDLIPVYQKLGFIFAESNPELEMNGKVQAQWE YFKTEQHKRRRAFTKKID >gi|226332032|gb|ACIB01000024.1| GENE 38 43228 - 44184 855 318 aa, chain - ## HITS:1 COG:BH0465 KEGG:ns NR:ns ## COG: BH0465 COG0530 # Protein_GI_number: 15613028 # Func_class: P Inorganic ion transport and metabolism # Function: Ca2+/Na+ antiporter # Organism: Bacillus halodurans # 16 317 16 317 318 217 45.0 2e-56 MDILLLIGGLLLILIGANCLTDGAASVAKRFRIPSIVIGLTIVAFGTSAPELTVSVSSAL KGSADIAVGNVVGSNIFNTLMIVGCTALFAPIVITRNTLRKEIPLCILSSIVLLICANDV FLNKASSNILSISDGLILLCFFTIFLGYTFAIASPTNNTQPEEEIKSLPMWKSVLFILGG LAGLIFGGQWFVEGASNIARHLGVSESVIGLTLVAGGTSLPELATSIVAALKKNPEIAIG NVIGSNLFNIFFVLGCSASITPLRLTGINNFDLFTLVGSGILLWFFGLFFAKRTITRIEG SILVLCYIAYTTYLIYQI >gi|226332032|gb|ACIB01000024.1| GENE 39 44311 - 46119 1584 602 aa, chain + ## HITS:1 COG:sll0590_2 KEGG:ns NR:ns ## COG: sll0590_2 COG0668 # Protein_GI_number: 16331818 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Small-conductance mechanosensitive channel # Organism: Synechocystis # 354 596 1 245 264 252 50.0 1e-66 MKKNRLFGLLLFLLVTNNMCAQLGKAVREILTGDSVATRTAVRNDSDSVRMADMKKELEE ARLSEANMRMEMEQMRLKAFAADSVKLAQQRARIDSLRQFTPGVPVVVEGDTLFYLYTKR GGYTPLQRAEMIDAAIMQLGKRFTLHPDSVYIESSDIVTDLMYGNKVIASFTDQDGLWEG RSREQLATDKRKIVVQKLKELKEEHSLWQLGKRILYFVLVLAGQYLLFWLTGWLFRKLKV RIQKLKDTRLKPISIQNYELLDTQRQVNLLIFLSNLLRYVIMLLQLLITVPLLFAIFPQT KGLAYQIFSYIWNPIKNILVGIVDYIPNLFAILIICFAVKYLVRLVHYLSREVEAGRLKF GGFYPDWAMPTYHIIRFLLYAFMIAMIYPYLPGAKNGVFQGISVFVGLIISLGSSTVIGN VIAGLVITYMRPFKLGDRIQLNDTTGNVIEKTPLVTRIKTPKNEVVTIPNSFIMSSHTVN YSASAREYGLIIHSEVTIGYDVPWRQVHQLLIEAALNTPGVIDDPRPFVLETSLSDWYPV YQINAYIREADKLAQIYSDLHQNIQDRFNEAGVEIMSPHYMAMRDGNESTIPKDDLRPKT DK >gi|226332032|gb|ACIB01000024.1| GENE 40 46191 - 47123 885 310 aa, chain - ## HITS:1 COG:no KEGG:BF4541 NR:ns ## KEGG: BF4541 # Name: not_defined # Def: acid phosphatase # Organism: B.fragilis # Pathway: not_defined # 1 310 1 310 310 660 100.0 0 MKLKNLLILLFISIVATASAQLKDYSMFDKKFNFYIANDLGRNGYYDQKPIAELMGTMGE EIGPEFVLAAGDVHHFEGVRSVNDPLWMTNFELIYSHPELMIDWYPVLGNHEYRGNTQAV LDYSGVSRRWTMPARYYTKTFEEKGATVRIVWIDTAPLIDKYRNESATYPDACHQDMNGQ LAWLDSVLTVAKEDWVIVAGHHPIYAETPKDQSERSDLQSRLDPILRKHKVDMYICGHIH NFQHIRVPGSDIDYIVNSAGSLARKVKPIEGTQFCSPEPGFSVCSIDKQELNLRMIDKKG NILYTVTRKK >gi|226332032|gb|ACIB01000024.1| GENE 41 47312 - 50074 2795 920 aa, chain - ## HITS:1 COG:CC1113 KEGG:ns NR:ns ## COG: CC1113 COG1629 # Protein_GI_number: 16125365 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor proteins, mostly Fe transport # Organism: Caulobacter vibrioides # 108 920 46 881 881 216 24.0 1e-55 MKRVLKLVSSILLLSVMGGHLYADEKANVVKQGTIRGRIIDTSKQTLPGASIYIENLKTG VISDVNGFYTFANLNPGTYTVKVTYVGYAPVEMKITIPEGKTLERDVILNEGVELQEIVI GGAFQGQRRAINSQKSSLGIKNVVSADQVGKFPDSNIGDALKRISGINVQYDQGEARFGQ VRGTSADLSSVTINGNRIPSAEGDTRNVQLDLIPADMIQTIEVSKVVTSDMDADAIGGSI NLVTKNSPYKRTLTATAGSGYNWISEKAQLNLGLTYGDRFFNDKLGVMLSASYQNAPSGS YDTEFLWEKDDKGNVYINDYQIRRYYVTRERQSYSAALDWDISENHKLMFKGIFNNRNDW ENRYRTTLKDMDEEGKATVRVQTKAGTPDNRNARLERQRTMDFTLGGEHLFGPVSMDWNA SYAKATEERPNERYIDFQLKKQQFDMDLSNEREPLATPKSGSTMTLNKDFGLKELTEQQE DIKEKDLKFSMNFKLPFRNGNKLKFGAKVVRKTKDKEVDFYEYTPKDEEAFMANSLQNTV DQTNKNFMPDHKYQAGIYADKQYVGSLDLNNPSLFDKEQVQEELAGNFEARETVSSGYIR FDQKLTDNVELMTGLRIENTSLSYTGRTYDDETDQTSKTARETNSYINFLPSLLMKWNVN EDFKVRGSFTQTLSRPKYSALVPSVNIKRSDNEVTVGNPGLKPTLSYNFDLSADYYFKSI GLVSAGVFYKKIDDFIVNQVSTNYEYNGNLYNRFIQPKNAGNANLIGMELSYQRDFGFIA PALKCIGFYGTYTFTHSRVEDFNFEGRENEKDLSLPGSPKHTANASLYFEKNGLNLRLSY NFASAFIDEMGEDTFHDRYYDRVNYLDVNASYTFAKHYTLYAEANNLLNQPLRYYQGTQD RTMQAEYYGVKINAGFKINF >gi|226332032|gb|ACIB01000024.1| GENE 42 50305 - 51951 1674 548 aa, chain + ## HITS:1 COG:BS_sacC KEGG:ns NR:ns ## COG: BS_sacC COG1621 # Protein_GI_number: 16079757 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-fructosidases (levanase/invertase) # Organism: Bacillus subtilis # 106 548 24 513 677 328 40.0 2e-89 MHLTTKLFFTATLLAGNLFASFAGDVTLKITKKYINLPVSHQVDRKKMTFEVKGTPERNF VIRLADEKPDYWVFCDVSSWKGKTLRISYEGESAGLEKIYQDDVIAGQDSLYAEKNRPQF HFTTRRGWINDPNGLVFYEGEYHLFYQHNPYEREWENMHWGHAVSRDLVHWEELPDALHP DELGTIFSGSAVIDYDNTAGFNKKNEPALVAAYTVDNPEKQRQCIAYSLDKGRTFTKYEG NPVIDSKAKWNSKDTRDPKVFWYAPGKHWVMVLNERDGHSIYNSADLKNWEYKSHVTGFW ECPELFELPVDGDKNHTKWVMYGASGTYMLGSFDGQTFTPEAGKYYYYTGSMYAAQTYSN IPASDGRRIQIGWGRISHDGMPFNGMMLLPNELTLRTTSKGVRLFSVPVRETEQLFQPVG NWTSLSSDAANQRLQAFSSKDCLRIRTTIKLSHATSAGLNLYGQPLVDYDMNSNLINGVF YSPDDRTSMELTADIYIDRTSIEVFIDGGAYSYSMKRSPREGNREGLHFWGNNIEVKNLE VFSVKSIW >gi|226332032|gb|ACIB01000024.1| GENE 43 52115 - 54871 2165 918 aa, chain + ## HITS:1 COG:SMb20671 KEGG:ns NR:ns ## COG: SMb20671 COG1879 # Protein_GI_number: 16265126 # Func_class: G Carbohydrate transport and metabolism # Function: ABC-type sugar transport system, periplasmic component # Organism: Sinorhizobium meliloti # 31 312 32 317 322 177 36.0 7e-44 MKNTNLFRLAFLLFAGLSIFLSSCQPKEEGDKKYVIGFSQCTSDSWREAVLLEMQIEASN YRNVELVVYNAMDNSSRQVSQIRKLISQNVDVLIISPNEAVPITDVAVEAYRKGIPTIIH DRKIQSDEYTVSIGANNYNIGSAIGEYINGQLPTNSKILEIWGLEGSSPAMERHDGFIDH LRSDKNFQVTQVFGKWHYNSAYDAVNRLATFADIDLVYAHNDVMALAARDVIMKRDSVSG KRIRFIGIDGVYGDGAGLQAVADEKLEASFQYPTGGAISIQVAMQIINGEKVKKNYVLNT AIINRGNAKTILAQSEQLNHYQKRINRQKQEEDNLLSRFKFLRNSTILILALMLLIIPLL GYVMYMNLRVKNKNKELHDKNQLVEAQKEELAVKNSQIENISNQKLQFFTNISHEIRTPL TLILGPVNKLIKNSKLDPSIQEDVALMKRNVDRLYRIVNQILDFRRIDNDKMKLILRQVD LIGMVREVFDYFTGIAEEKQIHYRFSTNIDELNIYIDVNKIEQVLVNIISNAFKYSDSGG DISVRITGEAETVLLEVEDHGRGISKESMEHLFERFYTGNKTFGTVGFGIGLNLSKEYVD LHDGEIRAESQPGEYTLFSVRLYKDIAHYTHEYILEETDRFNLSYHDMEVDTTVVNEMLS KTYDYHVLVVEDDPDVRYSLRKELSANFQVEVAGNGNEALDLLGQGDAFHLILSDVLMPG MNGFQLVNRVKNDLAFSHIPIILLTALSEDSQRIYGIAEGADEYIPKPFNIDFLKIRIIN MISERQKMKEAYMKNLRAGTMDNVEVCKLMKVDELFRDKLLSIVDTQYENSDFSIEDLSE HLGLSRVHLYRKMKTLFGVSPTDYLRNYRLNKAMLLLKARQYNISEIAYMTGFTSPAYFT KCFRTLYGVTPTEAMVAN >gi|226332032|gb|ACIB01000024.1| GENE 44 55041 - 55805 648 254 aa, chain - ## HITS:1 COG:no KEGG:BF4534 NR:ns ## KEGG: BF4534 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 254 1 254 254 471 98.0 1e-132 MKTELKGWLADNTVTTDNKEDKILVLESAGNLTLSDVLDEMKKEDTGLRAETLKHAVDLF QRTVSELVLNGYSVNTGLFRAVPQFRGVIDGGVWNSEKNSIYVSFNQDKDLRETIARTGV KILGAKGDPAYFIGGEDAATRATDGSATAGRNYRLQGKNIKVTGTDPAVGIVLIDEKGTE TKLPMDMIAVNNPSEVLVLLPADLKDGTYELRLTTQYCHSSQTMLKTPRTVSRFINIGAS QGSGDDDIVDDPTA >gi|226332032|gb|ACIB01000024.1| GENE 45 55831 - 57267 605 478 aa, chain - ## HITS:1 COG:no KEGG:BF4533 NR:ns ## KEGG: BF4533 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 478 58 533 533 813 96.0 0 DYHSSSFNDASTRSHSPLIAEWVGVKAFSPTRTGEQPDYDGPRIASMELTEDTLPRVSTR ATVPAGVYFRLIVFRKSGSEYVFQSADDYTSNGTGTPVLKQGRLLTRSGTIRVVGYSFNT TTAADLGTMPSTYAYNSSTVSIPNMSKDFMTFDSGDITNVNSLSHNLPVSFNQKLCKLTI TISPTGFPSNTITNCTGVYVKQGGNSTSWKIGPSTNVVAANTNNTAAFSPSTTLSTTIRM VPFAGARTITVHFNTLTVGGRIVNNNTEITSTQSVQLKEGKSYTLKIQFKKGPGINVLES DINLTGNGCTAQDKKDLAKLIWADGNLKSTGNSNYVWTTSTDRGYYYTWYSTYTGNTSQN NTDPCSKLNVSTYGTGWRTPSRNELTKLSRCTNKAKVNNGMWFMNSSKGLFLPLAGHTPS ASGASTGGNAVVNGNRDGNYWCTEKYYRFLFGTNGHTVSDAASGAFGCSVRCVKGTKQ Prediction of potential genes in microbial genomes Time: Tue May 17 22:46:18 2011 Seq name: gi|226332031|gb|ACIB01000025.1| Bacteroides sp. 3_2_5 cont1.25, whole genome shotgun sequence Length of sequence - 1259 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) - Term 303 - 338 -0.9 1 1 Tu 1 . - CDS 371 - 1258 161 ## BF4531 hypothetical protein Predicted protein(s) >gi|226332031|gb|ACIB01000025.1| GENE 1 371 - 1258 161 295 aa, chain - ## HITS:1 COG:no KEGG:BF4531 NR:ns ## KEGG: BF4531 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 295 242 534 534 479 89.0 1e-134 QFGSNTFTNCTGVYVSQGGNASAWTIGPSTNNVSANTGNTPTFNIANNSTATVRLVPFSG SRAITVHIGTLKLSNYFNANNRNITSSQNVQLLPGKSYTITLKFELGIQLAASDINLTQN GCTASDKNDLAKLRWATGNLKSTGNVNYVWASSQTEGGHFYSFNKLYDGSTGDPCSKLNT AYYGTNWRTPSKNELEKLSRCTDLVHTNGGMWFMNNRLGLFLKAAGMRLESGAGLEGTGS GTGGVYVTSTVGRNGNNCYAMEFTTKPGIYVSDDGSWWCLQINGYSVRCVKGTKQ Prediction of potential genes in microbial genomes Time: Tue May 17 22:46:25 2011 Seq name: gi|226332030|gb|ACIB01000026.1| Bacteroides sp. 3_2_5 cont1.26, whole genome shotgun sequence Length of sequence - 11489 bp Number of predicted genes - 7, with homology - 7 Number of transcription units - 2, operones - 2 average op.length - 3.5 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 364 - 423 5.5 1 1 Op 1 . + CDS 509 - 1882 1355 ## COG0477 Permeases of the major facilitator superfamily 2 1 Op 2 . + CDS 1906 - 3558 1540 ## BF4528 hypothetical protein 3 1 Op 3 . + CDS 3610 - 6801 2847 ## BF4527 hypothetical protein 4 1 Op 4 . + CDS 6813 - 8558 1469 ## BF4526 hypothetical protein 5 1 Op 5 . + CDS 8575 - 8982 311 ## BF4525 hypothetical protein + Term 8984 - 9037 11.5 - Term 8979 - 9021 7.9 6 2 Op 1 . - CDS 9028 - 10227 811 ## BF4524 hypothetical protein 7 2 Op 2 . - CDS 10236 - 11027 289 ## COG2173 D-alanyl-D-alanine dipeptidase - Prom 11201 - 11260 2.7 Predicted protein(s) >gi|226332030|gb|ACIB01000026.1| GENE 1 509 - 1882 1355 457 aa, chain + ## HITS:1 COG:BS_ywtG KEGG:ns NR:ns ## COG: BS_ywtG COG0477 # Protein_GI_number: 16080636 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Bacillus subtilis # 3 438 2 439 457 262 37.0 1e-69 MNRENKPLFLYTFITVFGGLIVGLNMAGISGAVPFLQEQFMLDDMALGLVVSILTVGCLC GALLGGGFSDRYGRQKVMFSSAVFFIVSSLGCALSGNLVSLLVFRLICGLGIGVISAVAP IYISEISPARLRGTLVSYNQLAIVIGILIAYIVDYILLDYERNWRLMLGFPFFFSVAYLL LLGILPESPRWLSARGKAGRARQVASKLNLEAGEMTVSDTNTQEGRDRIKVTELFKGNLA KVVFIGSILAALQQITGINVIINYAPSIFEMTGVAGDIALVQSILVGVVNLLFTLIAVWL VDKVGRKILLLCGSLGMGISLLYLVYTFVVPAANGIGALIAVLCYIGFFAASLAPLMWVV TSEIYPSRIRGTAMSLSTGISWLCTFLTVQFFPWILNNLGGSVAFGIFAVFSIAAFAFIL FCVPETKGKSLEAIEKELGVDKEAEENVKEEHAFSKI >gi|226332030|gb|ACIB01000026.1| GENE 2 1906 - 3558 1540 550 aa, chain + ## HITS:1 COG:no KEGG:BF4528 NR:ns ## KEGG: BF4528 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 550 1 550 550 1130 99.0 0 MKQSKRNLWWSVCAGLLFCSWGCSSRVSDKPVTLETLLDEMVSVEEQALYPVPSYTCRQE SSYDRASVSPDSAGWFANSDGFGIKRVDTVAGRIEKVMFDEVGPGAITRIWITTIDKRGT WRFYFDGSDQPGWIIPAYDLMRINVPGLGKGMLQAHTSYTPEGKGGNTLFLPIPYARGCK VTFEDEPGVNPTPKYYHINFRKYPEGTQVETFSKEVVERAAQKIAEVDNRLLQPVAGRKG EILREEKSILPSDSLVIPLPAGENAVYEVKFNIRTDNPEQYAQLMRELVFSAGFDGKQTV WVPLSDFSGGGMGAPKVDSWYLTSDGKGNISSRWLMPYQKDGVLKVLNLSSRSVAATMEV NVAPLKWNKDRSLYFYASWRQENGIYIHDKPEEADQCIEWNFATLKGRGVYKGDLLSLYN HAPLWYGEGDEKIWVDDDTFPSHFGTGTEDYYNSSWAPVVPFYTPFGGAPRADLESSHGY NAFYRTRHLDGIPFNKSFKFDIEMLGWKRGEADYATTIYWYGDPEAQVFGTSGIEEARRQ LLPAVEASAD >gi|226332030|gb|ACIB01000026.1| GENE 3 3610 - 6801 2847 1063 aa, chain + ## HITS:1 COG:no KEGG:BF4527 NR:ns ## KEGG: BF4527 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1063 1 1063 1063 2079 100.0 0 MKHERPCLVASGSDWLQEVLRRLLVWVVVLSSGTLMAYAGNLDISQTKVITGTVTEKETG DPLIGVNVSVKGTSKGTITNLNGQYSVDVNSDKDILVFSFIGKATQEKAVKGLSQIDVVM ADDATMLGEVVIQTGYMTQRKADLTGSVAMANTSDLAQNPSTNALKSLQGKLPGVFITTD GSPGGNASIQIRGLTSLNAQPNPLIVLDGLAGDYNLRDINPANIESIQVLKDAASASIYG SRAAGGVIIIETKKGKKGESKISYEGRVQFSKWVNKPDLLNTDEYGRAIWQAYANDDKLG EIKQSVRFFDYDWSYDSNGYPVLNSVKPVEWLNTAQTMRSADTNWIDEISRTGVSQNHQI SVSSGTDKSRTFFNLGYENTEGIQIETFWKKYSARLNSEYDLLNGRLKVGENLELNYMNY REANHTQLAANEPPIIPVYTETGGWGGASLDVGMDDYRNPVKDLILGKDNVNKFLKVIGN TYADLMIIKGLNLRTSFGVDYRGSYYRAVDKKWSEADGSGRDEKFNYVRNDQTHFLEYQW TNQINYNGQFGKHSIGAVVGMEFTKAESEAFYARKDGLTLEDRDYAFLSSATGDKITEAT GSGDAYALLSYFGKFNYSYASKYLASATLRYDGSSKFGADNRWAVFPAFSLGWRIKNEPF LENVDFLSDLKLRFSWGRNGNSAIPSGYLQSSYVADYNGTSYAMNGQESGSLQSGFRKYL TGNSTLKWETVTQTNYGIDFGFFNQSLVGTIDYFYKKTTDMLYLPPYIGAFGEGGDTYVN GPSMENRGVEILLTYRNSLPSGFNYSITGNIATFKNKITELPDNVRSVYGGNGMLDDIIG RPRNSIYGYVADGIFKTQEEVDNSPQQAGKGLGRIRYKDLDGDGRITQDYDRTWIGVSDP DFTYGLNLQASYKNVDLALFFQGVHGGDVWDSWIEYSDFWNIQNVNNTNHLKGVFNAWSP QNPDSNIPALSTRNTNDEKRTSTYFLKDGSYLKLRTIELGYTFPESMVKKAMISRLRAYV SANNVFTIKKWWDSNRFSGPDPEIRDFGYVIPFTATVGVNITF >gi|226332030|gb|ACIB01000026.1| GENE 4 6813 - 8558 1469 581 aa, chain + ## HITS:1 COG:no KEGG:BF4526 NR:ns ## KEGG: BF4526 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 581 1 581 581 1154 100.0 0 MKTFYSLFLSSALLLSLLVTSCADQLNINPKGVLADDLLLGKPEHIDDFVTPCYSLIPYL PWSEAHAWWMHGSIRSDDAYKGGSGVSDQSAWHDMEVFSSVTANVGNNDGPWFKGYVAIS RYNLALYALSKVTDENYPLKGVRTGEVKFLRGATYFFMKTLWRYIPWVDEENGRTVEDVT NISNRPNGTDDTYLWEHIVADLEEAVRLLPEKQEEIGRINKNAARAMAAKALLFMAYKQD ARHQVVEVDKKILERALVYINEITDQEGGNVGLCEDFAENFLPEYDNATKEAIWEIQYSI NDGTSSGGNTNNGAELNAPSWEPYFPCCDFHKMSFNMANAFRTGTDGLPLFDTFNDAEMK GRFKEYFDENSFDPRLSHTAAIPGYPYKYNPDLLYEEKASRSPGDYGYLKSVKELVPAGC DCIIVNRNSMNVKQIRYAEVLLWKAEILIQLDRHKEARPIINKLRERANNSRIRLLMADG TPYMNYKVSLYTDEAAWTKDYAWKALMFENRLETACEGRRFFDLQRWGILEPTMNAYFKK EKTRFSWMNNAVFVAGRDEYKPIPQQQMNWAKGNYIQNPGY >gi|226332030|gb|ACIB01000026.1| GENE 5 8575 - 8982 311 135 aa, chain + ## HITS:1 COG:no KEGG:BF4525 NR:ns ## KEGG: BF4525 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 135 1 135 135 261 100.0 4e-69 MKRILYFFSVMLCILAVTGCQDRDIIDFKDGVSLPPVTDLKSSLTPDNDAVLEWKLPSAI PEEIQRPLSVYVQVYKGAVLEHQISLEGEPTSWEYTLKEPESKYRIVVKVQGMLKEKPYG QSDEIYSLGQTVSIN >gi|226332030|gb|ACIB01000026.1| GENE 6 9028 - 10227 811 399 aa, chain - ## HITS:1 COG:no KEGG:BF4524 NR:ns ## KEGG: BF4524 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 399 1 399 399 821 99.0 0 MYLSPETILFIREHRNDDVHSLALQAKRYPQVDMPLAIVQIAGWQATVSKIPSWHATEGL LYPRHLSLEQCSSEVTALYKASLVHGEGLVDLTGGFGIDCAFLATQFKTVTYIERQEELC ELAMHNFPLLGLKHIRVQNGDGVEYLQQMPAVDCIFLDPARRNEHGGKTVAISDCEPNVA TLEKLLLEKGKQVMIKLSPMLDLSLAIRDMQHVSEAHIVSVNNECKELLLLLRPGDDSPE IPSAMSQPIVCINFANQEIQRFVFTRESEQATECSYTHEIGTYLYEPNASILKAGAFRSI ASSFHLSKLHANSHLYTSNERIEKFPGRIFRITGYSSLNKKELKNILNGLDKANITTRNF PQSVAELRKRLKLTDGGDIYLFATTLNDERKIIIRCEKA >gi|226332030|gb|ACIB01000026.1| GENE 7 10236 - 11027 289 263 aa, chain - ## HITS:1 COG:ECs2092 KEGG:ns NR:ns ## COG: ECs2092 COG2173 # Protein_GI_number: 15831346 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: D-alanyl-D-alanine dipeptidase # Organism: Escherichia coli O157:H7 # 70 249 3 168 193 92 33.0 9e-19 MFLYFCIHIQKRNYMRLGLLTAFILLLSFTTSCQYKEAQPHPAFKDSIVLVTPVEPQPIA SKSQTAFYLDSIGLMNIAELDSTFIIRLMYATPDNFTGQLLYTDLKEAYLHPDAAKALLE AHLLLKAQYPSYRLIIYDAARPMSVQKKMWDMVKGTSKYMYVSNPSRGGGLHNYGLAVDI SIADSLGHPLPMGTEVDYMDAASHITNEAKLVREGKITQQERENRILLRQVMKSAGFRAL PSEWWHFNLCSRDEAKQKYKLIN Prediction of potential genes in microbial genomes Time: Tue May 17 22:47:07 2011 Seq name: gi|226332029|gb|ACIB01000027.1| Bacteroides sp. 3_2_5 cont1.27, whole genome shotgun sequence Length of sequence - 32292 bp Number of predicted genes - 31, with homology - 31 Number of transcription units - 16, operones - 9 average op.length - 2.7 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 45 - 104 8.8 1 1 Tu 1 . + CDS 278 - 754 20 ## BF2751 hypothetical protein 2 2 Tu 1 . - CDS 1813 - 2964 1373 ## COG1820 N-acetylglucosamine-6-phosphate deacetylase - Prom 3120 - 3179 6.0 + Prom 3003 - 3062 9.2 3 3 Op 1 40/0.000 + CDS 3121 - 3798 863 ## COG0745 Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 4 3 Op 2 . + CDS 3816 - 5114 1118 ## COG0642 Signal transduction histidine kinase + Prom 5210 - 5269 6.0 5 4 Op 1 . + CDS 5433 - 5870 460 ## BF2738 putative periplasmic protein 6 4 Op 2 . + CDS 5886 - 6725 862 ## BF2739 hypothetical protein + Term 6730 - 6785 16.0 - Term 6718 - 6773 16.0 7 5 Op 1 . - CDS 6780 - 7961 847 ## BF2740 clostripain-related protein - Prom 7982 - 8041 4.8 - Term 7986 - 8029 1.9 8 5 Op 2 . - CDS 8052 - 8453 425 ## BF2758 hypothetical protein - Prom 8473 - 8532 5.7 9 6 Op 1 . - CDS 8554 - 9414 833 ## COG0568 DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 10 6 Op 2 . - CDS 9449 - 9646 72 ## BF2743 hypothetical protein 11 6 Op 3 . - CDS 9746 - 11293 1799 ## COG0265 Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain - Prom 11317 - 11376 6.8 - Term 11492 - 11564 9.8 12 7 Op 1 1/0.000 - CDS 11599 - 12933 1272 ## COG1305 Transglutaminase-like enzymes, putative cysteine proteases 13 7 Op 2 3/0.000 - CDS 12965 - 14116 1427 ## COG4948 L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 14 7 Op 3 . - CDS 14130 - 15332 989 ## COG0791 Cell wall-associated hydrolases (invasion-associated proteins) - Prom 15376 - 15435 5.4 + Prom 15990 - 16049 5.8 15 8 Tu 1 . + CDS 16081 - 17400 1090 ## COG1295 Predicted membrane protein 16 9 Op 1 . - CDS 17645 - 18184 512 ## COG0778 Nitroreductase 17 9 Op 2 . - CDS 18215 - 18817 657 ## COG0307 Riboflavin synthase alpha chain - Term 19117 - 19156 6.0 18 10 Op 1 32/0.000 - CDS 19215 - 19904 1048 ## COG0704 Phosphate uptake regulator 19 10 Op 2 41/0.000 - CDS 19980 - 20741 228 ## PROTEIN SUPPORTED gi|229849245|ref|ZP_04469311.1| LSU ribosomal protein L17P 20 10 Op 3 38/0.000 - CDS 20777 - 21652 902 ## COG0581 ABC-type phosphate transport system, permease component 21 10 Op 4 . - CDS 21654 - 22850 1146 ## COG0573 ABC-type phosphate transport system, permease component - Prom 23075 - 23134 5.7 + Prom 23029 - 23088 3.3 22 11 Op 1 . + CDS 23155 - 23967 979 ## COG0226 ABC-type phosphate transport system, periplasmic component 23 11 Op 2 . + CDS 24038 - 25777 1829 ## COG0008 Glutamyl- and glutaminyl-tRNA synthetases + Prom 25797 - 25856 2.5 24 11 Op 3 . + CDS 25876 - 27312 1592 ## BF2774 hypothetical protein + Prom 27342 - 27401 1.8 25 12 Op 1 . + CDS 27445 - 28086 666 ## COG0586 Uncharacterized membrane-associated protein 26 12 Op 2 . + CDS 28165 - 28365 71 ## BF2761 hypothetical protein + Prom 28370 - 28429 2.6 27 12 Op 3 . + CDS 28449 - 29066 669 ## BF2776 hypothetical protein + Term 29112 - 29139 0.1 - Term 29135 - 29180 8.1 28 13 Tu 1 . - CDS 29214 - 29714 608 ## COG2077 Peroxiredoxin - Prom 29787 - 29846 8.1 + Prom 29694 - 29753 4.5 29 14 Tu 1 . + CDS 29808 - 30389 713 ## BF2764 hypothetical protein 30 15 Tu 1 . - CDS 30528 - 31121 609 ## COG1961 Site-specific recombinases, DNA invertase Pin homologs - Prom 31307 - 31366 6.5 + Prom 31266 - 31325 5.8 31 16 Tu 1 . + CDS 31356 - 32276 810 ## BF2780 putative recombinase/integrase Predicted protein(s) >gi|226332029|gb|ACIB01000027.1| GENE 1 278 - 754 20 158 aa, chain + ## HITS:1 COG:no KEGG:BF2751 NR:ns ## KEGG: BF2751 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 158 1 158 158 184 99.0 9e-46 MEKEKAGEEKVHIIILYRGNLSGRGKRGIVAKGRERRCRREGNLPRKGGKGAFRGRERDV SGKGKRRFGEENRGGNWDRERDKLKSRRRIEGRWTGIERKLSRDQTGIKRKSNKNGTESC VNIVNAIGETFISEMERTKLLLLKDLFFEDKRLVLTVR >gi|226332029|gb|ACIB01000027.1| GENE 2 1813 - 2964 1373 383 aa, chain - ## HITS:1 COG:lin2213 KEGG:ns NR:ns ## COG: lin2213 COG1820 # Protein_GI_number: 16801278 # Func_class: G Carbohydrate transport and metabolism # Function: N-acetylglucosamine-6-phosphate deacetylase # Organism: Listeria innocua # 44 378 46 378 380 189 32.0 7e-48 MLTQIINGRIFTPQGWLNEGSVLMRDGKILEVTNCDLALIGANLVDARGMYIVPGFVCMH AHGGGGHDFTECTEEAFRTAIAAHMKHGATSFFPTLSSSPFSEIRKAVDICEKLMAEPDS PILGLHVEGPYLNRKMAGEQFANQVKEVDVAEYTSLLESTDCIKRWDASPELPGALDFAR YLKSKGIVGAVSHTEAEYDGIKEAYEAGFTHAAHFYNAMPGFHKRREYKYEGTVESVYLT DGMTIELIADGIHLPSTILKLAYKLKGVEHTCLVTDALSYAAAEGKAIDDPRIIIEDGVC KLADRSALAGSIATMDQLVRTMVKADIPLADAIRMASETPARIMGVYDRKGSLQKDKDAD ILILDRDLNVKAVWAMGQLVKES >gi|226332029|gb|ACIB01000027.1| GENE 3 3121 - 3798 863 225 aa, chain + ## HITS:1 COG:BH3157 KEGG:ns NR:ns ## COG: BH3157 COG0745 # Protein_GI_number: 15615719 # Func_class: T Signal transduction mechanisms; K Transcription # Function: Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain # Organism: Bacillus halodurans # 2 222 4 232 239 140 34.0 2e-33 MKILIIEDEPSLRELIQRSLEKERYVVEAAADFQSGLRKIEDYDYDCVLLDIMLPDGNGL NLLEQLKKMRKRENVIIISAKDSLDDKVLGLELGADDYLPKPFHLAELNARIKSVIRRQR RDGEMDIRLANIRIVPDTFQVFVDDKEIELNRKEYDILLYFANRPGRLVNKNTLAESVWG DHIDQVDNFDFIYAQIKNLRKKLKDAGALAELKAVYGFGYKMTVE >gi|226332029|gb|ACIB01000027.1| GENE 4 3816 - 5114 1118 432 aa, chain + ## HITS:1 COG:mll7952 KEGG:ns NR:ns ## COG: mll7952 COG0642 # Protein_GI_number: 13476585 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Mesorhizobium loti # 160 400 178 423 452 95 28.0 2e-19 MKLIYRIIIRIALMLTVVLGAWAVFFYIAVIDEVNDEVDDSLEDYSETIIIRALAGEELP SKNNGSNNQYYLRKVSKEYADEREDICYKDSMVYIVEKEETEPARILTTLFKDDEGQYYE LTVSTPTIEKDDLKDAMLGWIIFLYIVLLLTILIVCIWIFHRSMKPLYNLLRWLDAYRIG RPNRPLKNETQITEFRKLNEAAIRNAERSEHIFEQQKQFIGNASHEMQTPLAICRNRLEM LMEDDSLSEAQLEELIKTHQTLEHITKLNKSLLLLSKIDNGQFSDTRTVEFNSMLKRYIE DYKEVYGYREIELTLDEQGIFRAEMNESLAVALITNLLKNAFVHNVDGGHIRIEITGHSM TFRNSGAGRPLDATHIFERFYQGSKKEGSTGLGLAIADSICKLQHLTLRYYFEKDEHCFE LRKNNFTTDYAD >gi|226332029|gb|ACIB01000027.1| GENE 5 5433 - 5870 460 145 aa, chain + ## HITS:1 COG:no KEGG:BF2738 NR:ns ## KEGG: BF2738 # Name: not_defined # Def: putative periplasmic protein # Organism: B.fragilis # Pathway: not_defined # 1 145 1 145 145 275 100.0 3e-73 MKKLLLLFVCLFTLQTIARADDDKPIQVSQMPQKAQQFIKQHFAGSNIAMAKVESDFLQK SYDVIFTDGNKVEFDKKGNWTEVNCKFSVVPQGIIPSPIQKYTATNYPDAKVLKIERDKT DYEVKLSNGWELKFDSKFNLIDIDN >gi|226332029|gb|ACIB01000027.1| GENE 6 5886 - 6725 862 279 aa, chain + ## HITS:1 COG:no KEGG:BF2739 NR:ns ## KEGG: BF2739 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 279 1 278 278 477 98.0 1e-133 MKLKMYFLLLALGALGLQSCNDDDDHLSSVPTELKNAFTEKYPSVNNEKWETKGNYYIAE FRQQNYETSAWFTPNGVWQMTETDLPYQALPAAVKSAFESSEYAKWKVDDVDMLERPDME KVYVIEVESGKQEFDLYYSEEGILVKSVADTDNDSENYLPAEIPAAIETFIKKQYPNAHL VEIEVEHGMTEVDIIDGNISKEIVFNSSNEWISTSWDVRRNELPETVTHAIASSEKYAGY QIDDADFVETPDEGEYYLVELEKGELEVKVKVNAEGEFI >gi|226332029|gb|ACIB01000027.1| GENE 7 6780 - 7961 847 393 aa, chain - ## HITS:1 COG:no KEGG:BF2740 NR:ns ## KEGG: BF2740 # Name: not_defined # Def: clostripain-related protein # Organism: B.fragilis # Pathway: not_defined # 1 393 1 393 393 791 99.0 0 MKLKITLYSVCLLMLLAACQQDGPTPEPSVGSRTVLVYMIAQNSLAPLASADIEEMKEGM RQVDATSGNLLVYIDDYSAPRLIRLGKDKKGKVVEETIENYPEQNSADANVMKKVISTAF NQYKAEKYGMVFWSHGEGWIPSPAKTRWFGQDGNNYMDIADLHAALQVAPDLDFLFFDAC FMEAVEVAYALRDCGSYLISSPTEIPGPGAPYQTVVPAMFSAENAVLKIASCYYDYYQSR YNDGIGMSNEDWTGGVSVGVAKMSELENLAVATSKVLPRYITGKQNFDLSGVMCYDRRTD KQYYYDLDRFIYQITAGNGDYDSWREAFDKVMVYWKSTPRNYSAYAGMFTMNQDAKGLST YIPRMSAPSLNTSYQQTEWYKVSGWADTGWYKN >gi|226332029|gb|ACIB01000027.1| GENE 8 8052 - 8453 425 133 aa, chain - ## HITS:1 COG:no KEGG:BF2758 NR:ns ## KEGG: BF2758 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 133 1 133 133 248 100.0 4e-65 MKYVKILFAVALVFTLCSAFSLKKGGHKPVYAFGVSASFTDTVIYYTEIQMLDSVALDKN GFLPHRELYSYQLKNYLEFDKGLPNRTCMIYFSENKKKLGKEAAKVVGKFKKNKTVAVEK IDPQNFRFSKPEE >gi|226332029|gb|ACIB01000027.1| GENE 9 8554 - 9414 833 286 aa, chain - ## HITS:1 COG:lin1491 KEGG:ns NR:ns ## COG: lin1491 COG0568 # Protein_GI_number: 16800559 # Func_class: K Transcription # Function: DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) # Organism: Listeria innocua # 21 285 108 373 374 223 45.0 4e-58 MRQLKITKSITNRESASLDKYLQEIGREDLITVEEEVELAQRIRKGDRVALEKLTRANLR FVVSVAKQYQNQGLSLPDLINEGNLGLIKAAEKFDETRGFKFISYAVWWIRQSILQALAE QSRIVRLPLNQVGSLNKISKAFSKFEQENERRPSPEELAGELDIPVDKISDTLKVSGRHI SVDAPFVEGEDNSLLDVLVNDDSPMADRSLVNESLAREIDRALSTLTDREKEIIQMFFGI GQQEMTLEEIGDKFGLTRERVRQIKEKAIRRLRQSNRSKLLKSYLG >gi|226332029|gb|ACIB01000027.1| GENE 10 9449 - 9646 72 65 aa, chain - ## HITS:1 COG:no KEGG:BF2743 NR:ns ## KEGG: BF2743 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 65 28 92 92 124 98.0 8e-28 MLLCVLCGAYGHTSESPAEKVFGSLIIAKEIEKRVPAINQSVLFLFILVDNNGEKVVKIY TLTNK >gi|226332029|gb|ACIB01000027.1| GENE 11 9746 - 11293 1799 515 aa, chain - ## HITS:1 COG:PA0766 KEGG:ns NR:ns ## COG: PA0766 COG0265 # Protein_GI_number: 15595963 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain # Organism: Pseudomonas aeruginosa # 60 489 25 452 474 232 34.0 1e-60 MKQTTKNILGIGAVVLLSAGVAGVTTYTMLKPENRDSLSFNEQFRQNPGARLAAYDAINA QPVDLTQAAENSLHAVVHIKSTQQAKEQTVTVRDPFAEIFGDIFGNGGRQQRRVQTQPRV GFGSGVIISKDGYIVTNNHVIDGADEIIVKLNDNREFKGRMIGTDPNSDLALVKIEGDDF PTIPVGDSDALKVGEWVLAVGNPFNLTSTVTAGIVSAKARTLGVYGIGGVESFIQTDAAI NQGNSGGALVNAKGELVGINAVLSSPTGAYAGYGFAIPTSVMTKVVSDLKQYGTVQRALL GIKGTSLAGDGDMMSDQPIDKSGATLSDKRKEFGVVDGVWVREIVDGGSAAGSDIKVDDV IIGIDGKKVQNFADLQEAIAQHRPGDKVTVKVMRDKKEKNINITLKNEQGTTKIVKDAGM EILGAAFKELPDDLKKQLNLGYGLQVTGVTSGKMADAGVRKGFIILKANDQPMRKVSDLE EVMKAAVKSPNQVLFLTGVFPSGKRGYYAVDLTQE >gi|226332029|gb|ACIB01000027.1| GENE 12 11599 - 12933 1272 444 aa, chain - ## HITS:1 COG:TM0007 KEGG:ns NR:ns ## COG: TM0007 COG1305 # Protein_GI_number: 15642782 # Func_class: E Amino acid transport and metabolism # Function: Transglutaminase-like enzymes, putative cysteine proteases # Organism: Thermotoga maritima # 34 430 52 426 438 193 31.0 6e-49 MRKLLYIVMIFIGVSACTSRQESKPYDWDDDLHQRLLADFCLTESQVKDYIRKYIPDVTD EQMRQWEESKALECRVIDGEKRYFRNAGPNLFRIDSACCAVKIEKEGTSLSTSEQVNKEH LPEVMAAVRKEKTPVVQPKRMRVTYTLTVDSNAVPAGKMIRCWLPYPRTDQPRQQNVKLL HVSEPRYTLSPPSCRHSTLYMEKQAIPGEPTVFSETFEYTSCAEWHPLKPGNILPYDTAG ALYKEYTAERETHIRFTPRIKELAANLTVGETNPLLKAQRIFRWINDHFPWASAREYSTI ENIPEYVLDNRHGDCGQVSLLFITLCRCSGIPARFQSGFMMHPRAWNLHDWAEVYFEGAG WVPVDQSFGISTFADNPEEKMFFMGGIDSWRMIVNSDYSMPLVPEKKYPRSETVDFQRGE VEWEGGNLYFPQWSYHMDIDYLNY >gi|226332029|gb|ACIB01000027.1| GENE 13 12965 - 14116 1427 383 aa, chain - ## HITS:1 COG:all3532 KEGG:ns NR:ns ## COG: all3532 COG4948 # Protein_GI_number: 17231024 # Func_class: M Cell wall/membrane/envelope biogenesis; R General function prediction only # Function: L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily # Organism: Nostoc sp. PCC 7120 # 46 380 1 343 350 207 36.0 3e-53 MQNRRDFLKTAAFAAIGSGLSLQGAFAGEKAPVSFAINQLGLGAKMKLRFFPYELKLKHV FTVATYSRTTTPDVQVEIEYDGVIGYGEASMPPYLGQTVDSVMGFLKKVDLEQFDDPFRL EDILAYVDGLTPGDTAAKAAIDIALHDLVGKLLGAPWYRIWGLDKAKAPSTTFTIGIDTP EVVREKTLEVAGQFNILKVKLGRENDKQMIETIRSVSDLPIAVDANQGWTDKKYALDMIQ WLKEKGIVMIEQPMPKTQLDDIAWVTQHSPLPVFADESLQRLSDVAGLKGAFTGINIKLM KCTGMREAWKMVTLARALGMKVMVGCMTETSCAVSAAAQFSPAVDFADLDGNLLIANDRF KGMEVVKGKITLNDLPGIGVAKI >gi|226332029|gb|ACIB01000027.1| GENE 14 14130 - 15332 989 400 aa, chain - ## HITS:1 COG:BH3007 KEGG:ns NR:ns ## COG: BH3007 COG0791 # Protein_GI_number: 15615569 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Cell wall-associated hydrolases (invasion-associated proteins) # Organism: Bacillus halodurans # 116 350 71 306 336 90 27.0 5e-18 MKNMIPFVSLLLAVSACTGIPENKKNALPQDVERLSGEVRDRFIPDKRVILWDVDYDVSG KTITVKGATTSPEAKAALLSGLEEKAYEVKDSLQLVPDSAALEGKMYGIVNLSVCNMRVE DDFSSEMTTQALMGMPVKVLQHRNWYRIQTPDNYIAWVHRVGIHPVTKAGLDAWNKADKI VVTSHYGFTYQQPDAKSQSVSDVVAGNRLKYEGKQGGFYKVSYPDGRQAYISQSISMPEK EWRASLKQDASSIIRTAYTMMGIPYLWAGTSSKGVDCSGFVRTVLFMHDIIIPRDASQQA YVGEHIDIAPDFGNVQPGDLIFFGRKATAEKRERVVHVAIYLGDKKFIHSQGDVHVSSFD PADADFDEYNLNRLLYAVRVLPSIDKEETLNTTVTNPYYN >gi|226332029|gb|ACIB01000027.1| GENE 15 16081 - 17400 1090 439 aa, chain + ## HITS:1 COG:FN1154 KEGG:ns NR:ns ## COG: FN1154 COG1295 # Protein_GI_number: 19704489 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 55 335 19 294 396 159 33.0 9e-39 MNQRLSDIWKFITYDIWRITESEVTRTKFSIYNIIKTIYLCVNRFNKDRIVNKASALTYS TLLAIVPILAIVFAIARGFGVSTLMESQFRDGFGGSTEATDIILQFVDSYLSQTKNGIFI GVGLVMLLWTVLNLVSNIEITFNRIWQVKKGRSMYRKITDYFSMFLLMPILIVVSGGLSI FVGTMLKSMADFVLLAPILKFLIRLIPFVLTWLMFTGLYIFMPNTKVKFKHALISGILAG SAYQAFQFLYISSQLWVSKYNAIYGSFAALPMFLLWLQISWTICLFGAELTYAGQNIRNF SFDRDTQNISRRYRDFISILIMSLIAKRFENNETPYTAEEISEEHRIPIRLTNQILYQLQ EIHLIHEVVTDQKSEDIAYQPSIDINQLNVALLLDRLDTYGSEDFKVDKDEEFSEQWKVL LDSREEYYKKASKVLLKDL >gi|226332029|gb|ACIB01000027.1| GENE 16 17645 - 18184 512 179 aa, chain - ## HITS:1 COG:CAC2311 KEGG:ns NR:ns ## COG: CAC2311 COG0778 # Protein_GI_number: 15895578 # Func_class: C Energy production and conversion # Function: Nitroreductase # Organism: Clostridium acetobutylicum # 3 150 2 145 187 75 32.0 7e-14 MNDFLQLVNARQSDRAYDKSRLVEADKLERILEAGRLAPSACNAQPWRFVVVTDPSLAEK VGKAAAGLGMNKFAKDAPVHILVVEESANITSRLGGKLKGKHFPLIDIGIVAAHIVLAAE SEGLGSCILGWFDEKEIKSLTGIPSSKRVLLDILIGYPVKEKRKKIRKESGKIISYNSY >gi|226332029|gb|ACIB01000027.1| GENE 17 18215 - 18817 657 200 aa, chain - ## HITS:1 COG:L0164 KEGG:ns NR:ns ## COG: L0164 COG0307 # Protein_GI_number: 15672976 # Func_class: H Coenzyme transport and metabolism # Function: Riboflavin synthase alpha chain # Organism: Lactococcus lactis # 1 196 1 192 216 149 40.0 3e-36 MFSGIVEEYATVVALVKDQENIHFTLKCSFVNELKIDQSISHNGVCLTVVSMTEDTYTVT AMKETLDRSNLRLLKVGDKVNVERSMMMNGRLDGHIVQGHVDQTAECIDIKDADGSWYFT FKYAFDKEMAKRGYITVDKGSVTVNGVSLTVCNPTDDTFQVAIIPYTYEHTNFHTFGKGS VVNLEFDIIGKYISRMIQYK >gi|226332029|gb|ACIB01000027.1| GENE 18 19215 - 19904 1048 229 aa, chain - ## HITS:1 COG:VC0727 KEGG:ns NR:ns ## COG: VC0727 COG0704 # Protein_GI_number: 15640746 # Func_class: P Inorganic ion transport and metabolism # Function: Phosphate uptake regulator # Organism: Vibrio cholerae # 8 220 14 223 236 102 31.0 5e-22 MVKFIESELVLLKKEIDEMWTLVYNQLDRAGEAVLTLDKELAQQVIVRERRVNAFELKID SDVEDVIALYNPVAIDLRFVLAMLKINTNLERLGDFAEGIARFVVKSEEPVLDEELLKRL RLEEMQKQVLSMLEVAKRALNEESLELATSVFAKDNLLDEINAEATAVLAEYIKEHPEST LSCLNLVGVFRKLERSGDHITNIAEEIVFFIDAKVLKHSGKVEEHYPAK >gi|226332029|gb|ACIB01000027.1| GENE 19 19980 - 20741 228 253 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|229849245|ref|ZP_04469311.1| LSU ribosomal protein L17P [Thermanaerovibrio acidaminovorans DSM 6589] # 1 238 124 354 398 92 29 3e-18 MDTTVKIDARDVNFWYGDFHALKGISMEIEEKSVVAFIGPSGCGKSTFLRLFNRMNDLIP ATRLTGEIRIDGENIYDKGVQVDELRKNVGMVFQRPNPFPKSIFENVAYGLRVNGVKDNA FIRQRVEETLKGAALWDEVKDKLKESAFALSGGQQQRLCIARAMAVSPSVLLMDEPASAL DPISTAKVEELIHELKERYTIVIVTHNMQQAARVSDKTAFFYMGQMVEFGDTKKIFTNPE KEATQNYITGRFG >gi|226332029|gb|ACIB01000027.1| GENE 20 20777 - 21652 902 291 aa, chain - ## HITS:1 COG:MA0889 KEGG:ns NR:ns ## COG: MA0889 COG0581 # Protein_GI_number: 20089773 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type phosphate transport system, permease component # Organism: Methanosarcina acetivorans str.C2A # 11 290 28 307 307 266 51.0 3e-71 MEILNNTKAKRRSQGIAFGIFRLLSLCIVLILFAILGFIIYKGIGVISWDFLTTAPTDGM TGGGIWPAIVGTFYLMVGSALFAFPVGVMSGIYMNEYAPKGKLVRFIRVMTNNLSGIPSI VFGLFGMALFVNYMDFGDSILAGSLTLGLLCVPLVIRTTEEALKAIPDSMREGSRALGAT KLQTIWHVILPMGMPNIITGLILALGRVSGETAPILFTCAAYFLPQLPTSILDQCMALPY HLYVISTSGTDMEAQLPLAYGTALVLIVIILLVNLLANALRKYFEKKVKMN >gi|226332029|gb|ACIB01000027.1| GENE 21 21654 - 22850 1146 398 aa, chain - ## HITS:1 COG:MA0888 KEGG:ns NR:ns ## COG: MA0888 COG0573 # Protein_GI_number: 20089772 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type phosphate transport system, permease component # Organism: Methanosarcina acetivorans str.C2A # 109 392 8 290 296 253 49.0 6e-67 MKKVFERIIEGILTCSGFVTSITILLIVLFLFTEAFGLFSSKVIEEGYVLALNKDNKVSE LTPMQIKDVFDEEITNWKELGGEDLPIRVFRLEDITHYYSEEELGEAYENAGEKIMELIQ KTPGIVAFIPQKFVVRPDAVHFIKDNTISVKEVFAGAEWFPTATPAPLFGFLPLITGTLW VSLFAILIALPFGLSVSIYMSEVADSKVRSWLKPVIELLSGIPSVVYGFFGLIVIVPLIQ KVFDLPVGESGLAGSIVLAIMALPTIITVTEDAMRNCPRAMREASLALGASQWQTIYKVV IPYSISGITSGVVLGIGRAIGETMAVLMVTGNAAVIPTTILEPLRTIPATIAAELGEAPA GGPHYQALFLLGVVLFFITLIINFSVEYISSKGVKRSK >gi|226332029|gb|ACIB01000027.1| GENE 22 23155 - 23967 979 270 aa, chain + ## HITS:1 COG:MA0887 KEGG:ns NR:ns ## COG: MA0887 COG0226 # Protein_GI_number: 20089771 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type phosphate transport system, periplasmic component # Organism: Methanosarcina acetivorans str.C2A # 23 269 70 315 317 199 47.0 5e-51 MKVRTILFAALSLVALHANAQRIKGSDTVLPVAQQTAERFMNREPDARVTVTGGGTGVGI SALMDNTTDIAMASRPIKFSEKMKAKAAKRDIDEVIVAYDALAVVVHPSNPVKKLTRRQL EDIFRGKITNWKQVGGDDRKIVVYSRETSSGTYEFFKESVLKNKNYMSSSLSMPATGAII QSVSQTKGAIGYVGLAYVSPRIKTLSISYDGEHYATPTVENATNKTYPIVRPLYYYYDAK NKTQIAPLLEFILSPEGQDIIKKSGYIPVK >gi|226332029|gb|ACIB01000027.1| GENE 23 24038 - 25777 1829 579 aa, chain + ## HITS:1 COG:VC0997 KEGG:ns NR:ns ## COG: VC0997 COG0008 # Protein_GI_number: 15641012 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Glutamyl- and glutaminyl-tRNA synthetases # Organism: Vibrio cholerae # 10 579 4 555 556 590 52.0 1e-168 MTDIKNEEAGEKKSLNFIEQAVENDLKAGKNGGKVQTRFPPEPNGYLHIGHAKAICLDFG IAAAHGGVCNLRFDDTNPTKEDMEYVEAIQEDIRWLGFQWGNVYYASDYFQQLWDFAVTL IKEGKAYVDEQTSEQIAQQKGTPTQPGVESPYRNRPIEESLALFEKMNSDEAKEGSMVLR AKIDMASPNMHFRDPIMYRILHVAHHRTGTQWKAYPMYDFAHGQSDYFEGVTHSLCTLEF VPHRPLYDLFIDWLKEGKDLDDNRPRQTEFNKLNLNYTLMSKRNLLILVKEGLVNDWDDP RMPTLCGFRRRGYSPESIRKFIDKIGYTTYDALNDFALLESAVREDLNARATRVSAVLNP VKLIITNYPEGQVEELEAINNPEDPTAGSHTIEFSRELWMERDDFMEDAPKKYFRMTPGQ EVRLKNAYIVKCTGCKKDENGTVTEVYCEYDPNTRSGMPDANRKVKGTLHWLSCNHCLPA EVRLYDRLWKVENPRDEMAAIREAKGCDALEAMKEMINPDSLTVLPHCYIEKYVADMPAL SYLQFQRIGYFNIDKDSTPGHLVFNRTVGLKDTWGKINK >gi|226332029|gb|ACIB01000027.1| GENE 24 25876 - 27312 1592 478 aa, chain + ## HITS:1 COG:no KEGG:BF2774 NR:ns ## KEGG: BF2774 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 478 1 478 478 889 100.0 0 MGRKNSPSERERELQNLIAQYEAVKAKNESLYLDGDQLADIADLYASERKFKEAQEVITY GLGLHPGHTDLMVEQAYLFLDLNQPQKAKEVAELITDTYSSNVKLLLAELLLNEGKLDAA DQMLDSIEEEEKNDLGILVDIVYLYTDLGYPEKGVQWLKRGAEMYKDDEDFLAATADCYG AAGAEYIEQAIVVFNKLIDKNPYNPAYWVGLAKCQFATKDFDKAIESCDFAIAADEEFGE AHIIKAHSLFHLENIEGAIVEYRKALKYKTLSPEFTYMFIGLAYTQQENWAEANESYSMA LRAIEENGNGSSPLLSDIYSNKALCASRQGDSEEAHRLCRLAKELAPQDAEPYLLEGRIY MEEDNFDLARAEWALALRYAPEADTWMEIGNYSLEFRMLENARFCFEQVLEEDPEYPKIC EQLAAVCLVLQDHEGFKKYNAMSGDSINLDSLRDTILEMGVDGEQMLRELDDFLKDEK >gi|226332029|gb|ACIB01000027.1| GENE 25 27445 - 28086 666 213 aa, chain + ## HITS:1 COG:Cj1168c KEGG:ns NR:ns ## COG: Cj1168c COG0586 # Protein_GI_number: 15792492 # Func_class: S Function unknown # Function: Uncharacterized membrane-associated protein # Organism: Campylobacter jejuni # 2 166 3 161 200 136 48.0 3e-32 MESVAFIQWCLDHLNYWTITLLMTIESSFIPFPSEVVVPPAAYKAAVNEELNIYLVVLFA TLGANLGAIINYYLARWLGRPIVYKFANSRFGHMCLIDEAKVQHAEEYFDKHGALSTFIG RLIPAVRQLISIPAGLARMKLHTFLIYTTLGAGLWNTILAAIGYYLSTVPGIESEEQLLA KVTEYSHELGYCFIVIGVFIVGFLVYKGMKKKK >gi|226332029|gb|ACIB01000027.1| GENE 26 28165 - 28365 71 66 aa, chain + ## HITS:1 COG:no KEGG:BF2761 NR:ns ## KEGG: BF2761 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 66 1 66 66 84 100.0 9e-16 MVVNSKNFVLLCITLYSLLLRETLWYSVVKLRLHPSLKKNLSFFSSSMQYLYIFCIYLLY VHQTSY >gi|226332029|gb|ACIB01000027.1| GENE 27 28449 - 29066 669 205 aa, chain + ## HITS:1 COG:no KEGG:BF2776 NR:ns ## KEGG: BF2776 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 205 1 205 205 352 99.0 4e-96 MKKDFNLTKLFYSFAIAFSVVTLSSCNNDDNSPLPPPSTNDVAGTYNGKVLITPVTPATV KENAGEAPQGQDVNATVKNDTVFFDKLPVTELITSIVGDKDKAEAIVKAIGDVKYKVGYK PALNTEKDSIYLAFDPKPLTLQLPAAVEGQEGQTVTVTISSPDKGSFAYKKNQLKLKLSA DKVELAGVAVPVPQTLFNFDMTKKK >gi|226332029|gb|ACIB01000027.1| GENE 28 29214 - 29714 608 166 aa, chain - ## HITS:1 COG:YPO2342 KEGG:ns NR:ns ## COG: YPO2342 COG2077 # Protein_GI_number: 16122566 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Peroxiredoxin # Organism: Yersinia pestis # 3 166 4 167 167 187 58.0 7e-48 MATTNFKGQPVKLIGEFIQVGKVAPDFELVKSDLSSFALKDLKGKNIVLNIFPSLDTGVC ATSVRKFNKMAAGMKDTVVLAISKDLPFAQGRFCTTEGIENVIPLSDFRFSDFDESYGVR MADGPLAGLLARAVVVIGKDGKVAYTELVPEITQEPDYEKALAAVK >gi|226332029|gb|ACIB01000027.1| GENE 29 29808 - 30389 713 193 aa, chain + ## HITS:1 COG:no KEGG:BF2764 NR:ns ## KEGG: BF2764 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 193 1 193 193 306 100.0 3e-82 MKTLFDEMEHAVKNWWLSLILGILYIIVALCLLFAPGSSYIALSVIFSISMLISGIIEII FSISNRRGISSWGWYLAGGIIDLILGIYLVAYPLLSMEVIPFIVAFWMMFRGFSATGYSM DLKRYGTREWGWYMGFGILAIICSLIILWQPAVGALYVIYMLAFTFLIIGFFRVMLSFEL KSLHKRSTVMNGK >gi|226332029|gb|ACIB01000027.1| GENE 30 30528 - 31121 609 197 aa, chain - ## HITS:1 COG:ECs5249 KEGG:ns NR:ns ## COG: ECs5249 COG1961 # Protein_GI_number: 15834503 # Func_class: L Replication, recombination and repair # Function: Site-specific recombinases, DNA invertase Pin homologs # Organism: Escherichia coli O157:H7 # 2 197 5 191 191 82 30.0 4e-16 MVIAYLRVSTEKQFLANQKEEIMRFAEKNGLSIDKWYTETVSGSVSTKDRKLSELLKRMH PGDTLIVTEISRLSRTLLEIMTILNFCIKKQVVLYSTKEGYVFQDDINSKVLGFAFGLMA EIERNLISMRTKEALARRKQEGMTLGRKKGDTPKIKLLRANKRVLTKELDKGTTYSELAE KMGVSRTTLFRFMKTMY >gi|226332029|gb|ACIB01000027.1| GENE 31 31356 - 32276 810 306 aa, chain + ## HITS:1 COG:no KEGG:BF2780 NR:ns ## KEGG: BF2780 # Name: not_defined # Def: putative recombinase/integrase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 306 74 379 379 578 100.0 1e-164 MEIEKFIKSLARKAKLGGRYSTANTYLYTLHSFQKFAGKASLTFEEITPESIKEYEQYLI LNGKRYNTISLYMRMLRSICNQASEQNIASLNTRELFENVFIGNEPTAKRAISPVLISRL LEADFSKNSRLDFARDLFLLSFYLRGIPFVDLVHLRKTDVQGNMLVYFRQKTGQQLTVII ENCAKVILRKYASLCKESVYLLPVISAAGEEGHKQYRSALRVYNKRLNQISGILKLKTPL TSYVARHSWATTALQKGVPVSVISAGMGHASEKVTYIYLASFDNKTLSNANKKVIAAVRF KKEEEE Prediction of potential genes in microbial genomes Time: Tue May 17 22:47:54 2011 Seq name: gi|226332028|gb|ACIB01000028.1| Bacteroides sp. 3_2_5 cont1.28, whole genome shotgun sequence Length of sequence - 34008 bp Number of predicted genes - 33, with homology - 32 Number of transcription units - 10, operones - 6 average op.length - 4.8 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 2/0.000 + CDS 87 - 1490 923 ## COG2148 Sugar transferases involved in lipopolysaccharide synthesis + Term 1599 - 1637 -0.6 + Prom 1541 - 1600 2.3 2 1 Op 2 2/0.000 + CDS 1646 - 2434 753 ## COG1596 Periplasmic protein involved in polysaccharide export 3 1 Op 3 . + CDS 2448 - 4853 2325 ## COG0489 ATPases involved in chromosome partitioning + Term 4926 - 4973 10.2 - Term 4913 - 4961 14.2 4 2 Op 1 . - CDS 4983 - 5432 240 ## COG3023 Negative regulator of beta-lactamase expression - Prom 5621 - 5680 4.4 5 2 Op 2 . - CDS 5713 - 6165 493 ## BF2771 hypothetical protein - Prom 6278 - 6337 4.2 + Prom 6263 - 6322 5.3 6 3 Tu 1 . + CDS 6357 - 6605 331 ## BF2772 hypothetical protein + Term 6737 - 6782 2.2 - Term 6718 - 6774 13.1 7 4 Op 1 . - CDS 6875 - 9250 1749 ## BF2789 hypothetical protein 8 4 Op 2 . - CDS 9247 - 9444 109 ## BF2774 hypothetical protein 9 4 Op 3 . - CDS 9475 - 9627 111 ## - Prom 9677 - 9736 9.1 + Prom 9624 - 9683 6.6 10 5 Tu 1 . + CDS 9910 - 10428 486 ## BF2775 putative transcriptional regulator UpxY-like protein + Prom 10480 - 10539 2.0 11 6 Op 1 . + CDS 10601 - 12145 643 ## BF2791 putative transmembrane protein 12 6 Op 2 . + CDS 12147 - 13184 145 ## BF2792 putative transmembrane protein + Term 13217 - 13253 2.6 13 7 Op 1 . + CDS 13264 - 14349 374 ## BF2793 hypothetical protein 14 7 Op 2 . + CDS 14346 - 15440 309 ## BF2794 hypothetical protein 15 7 Op 3 11/0.000 + CDS 15474 - 16556 459 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 16 7 Op 4 11/0.000 + CDS 16562 - 17551 310 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 17 7 Op 5 4/0.000 + CDS 17561 - 18550 271 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 18 7 Op 6 . + CDS 18564 - 19115 333 ## COG1045 Serine acetyltransferase + Prom 19120 - 19179 3.2 19 8 Tu 1 . + CDS 19388 - 20158 381 ## BF2799 glycosyltransferase + Term 20244 - 20290 4.5 + Prom 20207 - 20266 3.3 20 9 Op 1 . + CDS 20437 - 21324 410 ## BF2800 putative transmembrane protein 21 9 Op 2 . + CDS 21366 - 22394 600 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 22 9 Op 3 . + CDS 22399 - 23421 356 ## BF2802 putative acyltransferase transmembrane protein 23 9 Op 4 5/0.000 + CDS 23434 - 23997 292 ## COG1045 Serine acetyltransferase 24 9 Op 5 . + CDS 24012 - 25043 435 ## COG0438 Glycosyltransferase 25 9 Op 6 . + CDS 25030 - 26292 504 ## BF2805 putative 4Fe-4S binding protein 26 9 Op 7 . + CDS 26323 - 27486 664 ## BF2806 hypothetical protein 27 9 Op 8 3/0.000 + CDS 27480 - 28700 478 ## COG0438 Glycosyltransferase + Prom 28756 - 28815 2.2 28 9 Op 9 . + CDS 28878 - 29597 600 ## COG1922 Teichoic acid biosynthesis proteins 29 9 Op 10 . + CDS 29607 - 30311 419 ## BF2809 hypothetical protein 30 9 Op 11 . + CDS 30317 - 31558 870 ## BF2810 hypothetical protein 31 9 Op 12 . + CDS 31584 - 32870 1255 ## BF2929 hypothetical protein 32 9 Op 13 . + CDS 32910 - 33485 763 ## BF2812 hypothetical protein + Prom 33505 - 33564 3.1 33 10 Tu 1 . + CDS 33592 - 34006 179 ## BF2931 hypothetical protein Predicted protein(s) >gi|226332028|gb|ACIB01000028.1| GENE 1 87 - 1490 923 467 aa, chain + ## HITS:1 COG:wcaJ KEGG:ns NR:ns ## COG: wcaJ COG2148 # Protein_GI_number: 16129987 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Sugar transferases involved in lipopolysaccharide synthesis # Organism: Escherichia coli K12 # 81 456 72 454 464 244 37.0 2e-64 MKQVLRFNKVIKRIVFTGDLILLNGTFLSLYTLLGSKFFADPFIHSLPQVLVLLNLCYLV SNMSSGIILHRRVVRPEQIVWRALRNSAGHALFFSCALTFGNFGILSARFFLLFYIAFTL LLVCYRLLFRKILKSYRKHGGNSRSIILVGSNSNIIELYHQMTDDVTSGFRVIGYFDDQP GSRFPEKVNYLGKPGKIVDRLKQGGVEQVYCCLPSARSEEILPIIDYCENHLIRFFSVPN VRSYLKRRMYFELLGNVPVLCIRQEPLSFAENRFRKRVFDIAFSLLFLCTLFPIIYVIVG LTIKITSPGPIFFKQKRSGEDGREFWCYKFRSMKVNTQSDTLQATLHDPRKTRFGNFLRK SSIDELPQFINVLMGDMSVVGPRPHMLKHTEQYSQLINKYMVRHFVKPGVTGWAQVTGFR GETHELWQMEGRVQRDIWYIEHWTFMLDLYIIYKTVRNALEGEKEAY >gi|226332028|gb|ACIB01000028.1| GENE 2 1646 - 2434 753 262 aa, chain + ## HITS:1 COG:PM1016 KEGG:ns NR:ns ## COG: PM1016 COG1596 # Protein_GI_number: 15602881 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Periplasmic protein involved in polysaccharide export # Organism: Pasteurella multocida # 41 227 80 251 387 65 30.0 1e-10 MTKRLLFFTLTCILLASCQSYKKVPYLQDPGEAQRAVAEAKLYDARILPKDLLTIVVSCS DPELAEPFNLTVSPPVSNTQKSLTSQPALQQYLVDNRGNIDFPVLGTLHIGGLTKGEAES LIREKLKGYIKENTIVTVRMANYKISVIGEVNRPGTFTISNEKVNLFEALAMAGDMTVYG LRDNVRLIREDADGHQHIITLNMNRADIIQSPYYYLQQNDILYVTPNKTKAKTADISAST TIWFSVVSTLVSLASLIITIAK >gi|226332028|gb|ACIB01000028.1| GENE 3 2448 - 4853 2325 801 aa, chain + ## HITS:1 COG:CAC3040 KEGG:ns NR:ns ## COG: CAC3040 COG0489 # Protein_GI_number: 15896291 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: ATPases involved in chromosome partitioning # Organism: Clostridium acetobutylicum # 559 776 2 215 232 122 35.0 2e-27 MKEDLYDDLYEEKEEKIDFHALLFRYVIRWPWFVASVIICLAGAWLHLRQTTPVYNISAS VIIKDDKKGGNSGGNLAALEGLGLVNSVSNIDNEIEILRSKTLVKHVVSELNLYTTYSVK GSFNEVELYKSSPVLVGLTPQEADRLPGPAVFELTLSPGNRLDVKATVGETSYNKKFSKL PGLLVTPAGTFTFTLAGDSAGVSEPQTLTAVVSNPMQTAKRYAAALSVEPTSKTTSIVIV SLKNTNKRRGEDFINRLIEVYNRNTNNDKNEVAEKTEEFIAGRIRIINDELFSTEKELET FKRDAGLTDLASDAQLAVSENSAYEKQRVENGTQLNLVRYLAEYISAPDKINAVLPVNVG LPDQSLSSLIGQYNEMVLQRNRLLRNSSESNPVIVNLDSGIRAMRENILTTIHSVQKGLL ITKADLDRQASKFNRRISNAPAQERQFVSISRQQEIKAGLYLMLLQKREENSIALAATAN NAKIVDEAMADNGPVSPKTKTIYMIALVMGMGIPVAIIYVMGLLQFRIEGRADVEKLTSA PIIGDIPLAEEGNGKAGGIAVRENENSLMAETFRGIRTNLQFMLGEENKVILVTSTISGE GKTFVATNLAISLSLLGKRVVIVGLDIRKPGLNKVFNLSQKEKGITQFLAGPQTTDLMSM VQPSGISRTLSILPGGTVPPNPTELLARQALVEAIDILKKHFDYIVLDTAPIGMVTDTQI IARVADLSVYVCRADYTHKADYTLLEDLRLGNKLPNLCTVINGLDMKKRKYGYYYGYGKY GRYYGYGKKYGYGYGYGQKHN >gi|226332028|gb|ACIB01000028.1| GENE 4 4983 - 5432 240 149 aa, chain - ## HITS:1 COG:HI1494 KEGG:ns NR:ns ## COG: HI1494 COG3023 # Protein_GI_number: 16273395 # Func_class: V Defense mechanisms # Function: Negative regulator of beta-lactamase expression # Organism: Haemophilus influenzae # 46 142 2 98 116 109 50.0 1e-24 MRKIDLIVIHCSATREDRCFTEFDLDVCHRRRGFNGPGYHFYIRKDGRIVSTRPVEKIGA HAKGHNATSIGICYEGGLDARGRPKDTRTEWQVHSMRVLVKTLLKQYPGSRVCGHRDLSP DLNANGEIEPEEWIKQCPCFNVIEDKKLH >gi|226332028|gb|ACIB01000028.1| GENE 5 5713 - 6165 493 150 aa, chain - ## HITS:1 COG:no KEGG:BF2771 NR:ns ## KEGG: BF2771 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 150 1 150 150 238 100.0 5e-62 MSKEVTYSVVARKNMLKKDDPAKYYAQAQASGDVGLDEISTRVEKACTVHSADVVAVLKA LEDEMVDGLSRGEIVRLGNIGTFQVGLRSRGAEKAEDFKAANISKARVNFRPGPVLADAM KTLNFSKVSTRAAQKGDGGGDGDIVDDPTA >gi|226332028|gb|ACIB01000028.1| GENE 6 6357 - 6605 331 82 aa, chain + ## HITS:1 COG:no KEGG:BF2772 NR:ns ## KEGG: BF2772 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 82 1 82 82 152 98.0 3e-36 MNQRKEEDTTEADFIIRSYTKAELAQLYCPGLAPVLALQKLYRWMRKNTALTQALSDVNY NKYRHSFLKREVRLIVYYLGEP >gi|226332028|gb|ACIB01000028.1| GENE 7 6875 - 9250 1749 791 aa, chain - ## HITS:1 COG:no KEGG:BF2789 NR:ns ## KEGG: BF2789 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 786 1 786 791 1430 88.0 0 MKKREFKISFFLHVWEKKAEEISLEEFHNDLRGARWKVLAESYRRWMRTGMTEEGKRLKG ALNAVVVAGKCRGGHAANQVTELNGLALFDFDHCLEMLAGMKEKAGALPYVVGAFVSISG EGLKLIVRIDAENAGQYAVAYPVVARELERVLGHPCDMSCRDLGRACYASYDPEAYYNPG AGVFPWREQVDGLLQAEGECSEQSVGKACPAGVASEAGDGFMQVFLNDFDARNPFVAGGR HAFVLKLGRVARYKGFSPEEMRLLQKAVVEKYAQADFGSGEIEKTLSSGYQYVSARRADA VMASQGPKVQGPLYAPEEGESEEDMEDVLFGKSEEFRRVAPYFPEEVFEHLPDLLAQGVK AAGNYRERDMLLMAMITNISACLPEVRVLYDQVYYSPHLYYMVIAHAGGGKGVVSLAGLL PGEIHRYYEKQNEEMRLVYDKAFFEWELELKKAQAEKRSPDFSLRPKEPVRKLLTLSPNV SKSMLISALEESGKLGCCINATELDMVSGAIRNDYGKHDDVFRAAFQHEVVSADFKVNGR QVVAHNPHLALCLAGTPNQLVRFIPSLENGLYSRFLVYTGQSDWCWRSAAPREGGEDHRA MFARLSGRLLELHQFLLQSPTEVTFTAAQWEEHTSRFSSHLSEVVNERDDSPGAIVLRHG LMASRIAGVLTALRKGECAWAMPQYVCSDEDFHTAMLMTDVLLEHSLLLSTSVRKSESKS GPLKPYFRLRPVLQTFSGTFTYKDAMDRAVEMGIPVTTFKRLFRKTIELKIIDKEGDMYI RTRRGWRETDA >gi|226332028|gb|ACIB01000028.1| GENE 8 9247 - 9444 109 65 aa, chain - ## HITS:1 COG:no KEGG:BF2774 NR:ns ## KEGG: BF2774 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 65 1 65 65 107 98.0 1e-22 MAKVGCGWVWRLSEETFTAVLTFGSVKSLLFFCIMVHCRGCAVARVLLCALGGGSGLKTL KFNTQ >gi|226332028|gb|ACIB01000028.1| GENE 9 9475 - 9627 111 50 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MLKELSGQVFALVRELPKPLSREEMRELKRLCRFLNNTVKDQERKQEVRK >gi|226332028|gb|ACIB01000028.1| GENE 10 9910 - 10428 486 172 aa, chain + ## HITS:1 COG:no KEGG:BF2775 NR:ns ## KEGG: BF2775 # Name: not_defined # Def: putative transcriptional regulator UpxY-like protein # Organism: B.fragilis # Pathway: not_defined # 1 172 1 172 172 325 100.0 3e-88 MKSWLAAYVRLYHEKKTRDRLTAMGIESFLPVQEEIHQWSDRRKKIERVVIPMMIFVHVD PAERAEVLTLSSVSRYMVLRGQSTPAVIPDEQMERFRFMLDYSEEAIEVCSSPLAPGEQV RVIKGPLAGLEGELVTIDGKSKVAVRLDMLGCAHVDMPVGFVERVGKMEAVR >gi|226332028|gb|ACIB01000028.1| GENE 11 10601 - 12145 643 514 aa, chain + ## HITS:1 COG:no KEGG:BF2791 NR:ns ## KEGG: BF2791 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 514 1 514 514 862 99.0 0 MPSDSQNNKRIAQNTLLLYFRMLFLMLVSLYTSRVNLNALGIEDFGIYNVVGGLVAMFSI ISGSLVSSISRFITFELGTENKEKLKKVFSTAVSIQFFLVIIVVILAETIGLWFLNNKMV IPEERILAANIIYQFSIISFALSLMSIPYTGTIVAHEKMSAFAYISIFDVIGKLAVALTI SIAPIDKLIWFAGFIVFNSTIIQSIYIFYCKHHFEECTYHFIFDKSLLKNMFGFAGWNFI GSIAAILRDQGGNIVINMFCGPAVNAARGVAMQVNNAVSGFVSNFQTALNPQITKSYASG NYDYMMQLIFQGARLSYYILLILALPIISNTHFILQLWLGQVPKHTVLFVQLVLFFTMSE SLANPLINAMLATGKIKKFQIIVGGLNLVNLPLSYICLRLGCIPESVVIIAIIISMICEM ARVIMLRNMIHFPARSFLKKVYFNVIFVTITASILPLYLHFILEENIYTFTLISVVSFSC TLLSILYIGCNSEERVMVFSKVKVIVNKVSKRYK >gi|226332028|gb|ACIB01000028.1| GENE 12 12147 - 13184 145 345 aa, chain + ## HITS:1 COG:no KEGG:BF2792 NR:ns ## KEGG: BF2792 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 345 1 345 345 575 99.0 1e-163 MSLVNNFTVKKVRSSNLESLRLLSMFFVLVVHADFQALGMPDRTEMTILPLFSVFRIIIE AFAIVCVNSFVLLSGWFGINFHIKSLCNLLFQCAFFLIGIYTFTILIGIEPLSISGIKRC LMLTNNVWFVKCYLGMFIMAPILNAFVEKTDKRTFSTVLLSFFIFQTIYGWFSNGAPYFE KGYSAFSFMGLYLLARYVRIYQPSYTQWSKSKNLLMYISLSTFTALMLIITCYFDKVGYF CMFWTYTSPLVIAGALYLLLFFNRFAFQNKGINWIAASCFAVYLFHFFIWEYFMKPHIQQ LAVTYNGISCLLVITGLLLTFFTAAILIDKIRLYIWNHFVSKYIS >gi|226332028|gb|ACIB01000028.1| GENE 13 13264 - 14349 374 361 aa, chain + ## HITS:1 COG:no KEGG:BF2793 NR:ns ## KEGG: BF2793 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 30 361 1 332 332 680 99.0 0 MQEDEQGFLYPHINTTQCIDCGLCNKVCPVFRYDVISNKKNVNPHILALHHHNEKTWISS SSGGVFASLTDYALRQAGVIFGAIYDDNFVVVHRRAETQEDTLKFRGSKYVQSNLTGIYQ QVKFYLQEQRFVLFSGTPCQVEGLKGYLQRSYDNLLTVDILCHGVPSPRVFHDYLNFIRQ NSDFHFTGIFMKDKTFGWRYQNSRLFFGKNASQFNTVLSRLWNDIFYSHLTTRPSCHACR FTNYLRPGDITIGDFWGIEKHHPQFTDNRGISLIMLNNTKAEIVWNHIKDDFNYLESNIK ECIQPNLKYPVPEPVNKATFWQDYASMPFFQIMNKYYRITHQDLLKNRFYMILLTLKKRF T >gi|226332028|gb|ACIB01000028.1| GENE 14 14346 - 15440 309 364 aa, chain + ## HITS:1 COG:no KEGG:BF2794 NR:ns ## KEGG: BF2794 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 364 1 364 364 743 99.0 0 MKIGILTFHDAHNYGAMLQCYALQQFLFKKGYNVEVIDYRPAFYQKQYHRHSLCPWIGKN PVRTMKSIYYNYYLFNKRCAAFSDFHNRHLYMSIPATRTNIPQSYDAYIVGSDQIWNPQL TNGFHDVYFCDFCFPKGKSRFIAYAPSMEISRLSTQEAEYLTRVLNCFDALSVRESSLIP ILQPLVSQPIQQVLDPTLLLDATAWNPLIGKCPENRPYVVLYQVRENPAVRMKAFEIAQS IGGIVVELTARIDCHYSTKYQTASPADFVTYIRYATYVVTTSFHGTAFSLIFNRPFYTFS LGDNFDSRSASLLESVNLTERLVCPDEQFEISLIDFKQANKRLKHLRKQSCDFLRQSLHE RQEQ >gi|226332028|gb|ACIB01000028.1| GENE 15 15474 - 16556 459 360 aa, chain + ## HITS:1 COG:BS_yveT KEGG:ns NR:ns ## COG: BS_yveT COG0463 # Protein_GI_number: 16080481 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Bacillus subtilis # 3 201 5 211 344 135 36.0 1e-31 MKVSIIVPCYSVASKLPRCVHNLLAQTFTDWELILIDDGSTDNTWDICNSFTKNNAHIHA VHKENGGVSSARNTGIEIAKGEFITFIDSDDYVKPDYLQKLVEGQEADLVLCGFRSSTGI DFTPEPQYLIGDDLSKNIQAIVENDYLLYSPWCKLFRRDIIQKHQHRFDPEIRLGEDTIF CYKYLLYCSSIKVVASNSYFYDGVWGGYKKYVLTRQEVEYLDKAEITTLHNINQHFNCRI DLTYRGYHVAMLKGLYEKFRDYDTFEMYTRTHDVLPPEHFFANHKLSYIFWGIVELETLY VDKEYCAGKLFMQRLHHFFTIPTNQLLSYSYKMRLIHYLVKSKHYTMAHVLLIFLSILKH >gi|226332028|gb|ACIB01000028.1| GENE 16 16562 - 17551 310 329 aa, chain + ## HITS:1 COG:SP1365 KEGG:ns NR:ns ## COG: SP1365 COG0463 # Protein_GI_number: 15901219 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Streptococcus pneumoniae TIGR4 # 1 250 1 247 328 127 35.0 3e-29 MSSPKISIIIPVYMAESYLHRCVDSIITQTFTDWELLLVDDGSPDHSGELCEKYASAYKS RIKAFHKPNGGVASAREFGMQQARGEYSIHVDPDDWIDSNTLEELYQQAVKEQADMVICD FMMEYPNRQIHNCQKPQLLDSSSFMHQLLQQERHGSLCNKLIRTELYHKYQLHFPEKMIC WEDLYICCSILLHGCKLAYVPHALYHYDFYTNDNSMVRHTDMRGLQAQIDFCRLMQAKIS PEYLPELNELKGITLITAFRNQLLNEQAIRSLFPEINDWYVTRYGHDYEKSNYYGLTLVL RGYNFKTARRRMFVAKFLVQIKNKITRML >gi|226332028|gb|ACIB01000028.1| GENE 17 17561 - 18550 271 329 aa, chain + ## HITS:1 COG:SP1764 KEGG:ns NR:ns ## COG: SP1764 COG0463 # Protein_GI_number: 15901595 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Streptococcus pneumoniae TIGR4 # 7 217 4 221 301 135 34.0 1e-31 MNNTNPLVSIIVPVYKVERYIHRSIDSLLRQTYNNLEIILVDDGSPDNCPFICDKYAMQD NRIRVIHKSNGGLSDARNAALNVMTGQYVTFVDSDDYIADNMIEKFIETVKLYQCNMVVA GMNIIDKNNQIYDYRRTLTSTLSTGIDITRKLLKDVFPFNFVCAKIFESSLFEEIRFPVG RHYEDTATTYKLTHKCERIYCIADCLYFYELEREGNITSELNTSKAIKSYIDGCLNSREQ IFFCQKHKDFSDLLPVLTKRLQLWALLAIQSAISLGYKEYMFYYHQIRIYIRSLSYTQRM THLQIALTFPIIYYSLYPLLSRLKKMYKN >gi|226332028|gb|ACIB01000028.1| GENE 18 18564 - 19115 333 183 aa, chain + ## HITS:1 COG:AGc2882 KEGG:ns NR:ns ## COG: AGc2882 COG1045 # Protein_GI_number: 15888884 # Func_class: E Amino acid transport and metabolism # Function: Serine acetyltransferase # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 55 182 76 200 209 131 53.0 7e-31 MKKIKTAFALPLFWLLFKVSKDMHKIQVDGQAWCQWQHKNFTLWNMCSLFIGFKEFRNLF YYRIGYLHHLIEWIFPRMTNLYITTPRSDVDSGLIIQHGFATIISAKQIGKNCKIYQQVT IGYDHTLQAPIIGDNVEICCGAKVIGGVTIGNNVIIGANAVVIKDVPNNCIVAGVPAKII KKI >gi|226332028|gb|ACIB01000028.1| GENE 19 19388 - 20158 381 256 aa, chain + ## HITS:1 COG:no KEGG:BF2799 NR:ns ## KEGG: BF2799 # Name: not_defined # Def: glycosyltransferase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 256 93 348 348 508 99.0 1e-143 MEKFFPMLHFAQKVKRYKFNIPLYAMVHLVPSRLEKGFSDKKTFDEWTICIDKFLTLGHS LTQFLITKGLPEDKVVTTFHYVDEYYHNKRPVRLHKNIRVIAMGNQMRNLKLLKTIVDNN PNVNFTICQGVNDLSSYFLKNTNVELIPFVEESELRQHMANADISLNVMEDTVGSNVIVT SLAMGLAMICSNVGSIKDYCDDSNTIFCNNSNVEEFSQAITALQTDRIRLNTMQQSAANM GLQFTIEKFVRQISAL >gi|226332028|gb|ACIB01000028.1| GENE 20 20437 - 21324 410 295 aa, chain + ## HITS:1 COG:no KEGG:BF2800 NR:ns ## KEGG: BF2800 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 295 94 388 388 512 100.0 1e-144 MVYAFTFALCAMIFMKDYRTYARYMFPLFLIGFMNFEESMIRQAFSYSFFFLYLKYLFKL KFNKPKDILHNHKKLIYCIIFAILTLAIHTGNIISLFVITTLYIFWRKPFQPQFAIPIYV ACVYILPHIFNFNWLEPILSFAADTNERAAEYVKNADYWFSEKGENDQYDKNFIVEIIQV IGSSALMYFGYRLIIEKLPKHYALITMLNTFIIGLCIESIFVKLEILHRIGQTLDIVGYF ALAIVVSYKTIKLKPIQKVAYVCLLWFVYYYVKYLFFSGRTMFIWDTHYPFFKFI >gi|226332028|gb|ACIB01000028.1| GENE 21 21366 - 22394 600 342 aa, chain + ## HITS:1 COG:BS_yveT KEGG:ns NR:ns ## COG: BS_yveT COG0463 # Protein_GI_number: 16080481 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Bacillus subtilis # 1 230 1 219 344 145 39.0 1e-34 MIPKVSVIVPIYNVEKYLDQCVQALLAQTLSDIEIILIDDESPDNCPKICDDYAAQYPNI KVIHKKNAGLGMACNSGLDVATGEYVAFCDSDDYVDSDMYMTMYNVAQKYTCDAVFTGLK RITMAGIPTGTVTHQKEFKLYKSKNEIHTLLKDLIASDPYAREERAIQVSAKVVLYRRNL IEKKHLRFVSERILPSEDLIFNVDVLANSNIVCVLPQTFYNYRTNPISISHTIKKDKFSL FKQLYIEITDRCHRLGVEDNVQLRIQRMFLGYTRNYICNILNSSITNIEKKQITSSICKD GIWKPIWKTYPLSVMPLPHRIFTFAMRHNFYSLLLVLAKIKK >gi|226332028|gb|ACIB01000028.1| GENE 22 22399 - 23421 356 340 aa, chain + ## HITS:1 COG:no KEGG:BF2802 NR:ns ## KEGG: BF2802 # Name: not_defined # Def: putative acyltransferase transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 340 1 340 340 621 99.0 1e-176 MYKAPPRDISIDILKCIAAIMITNSHMDILYGDYSYLATGGAIGDALFFFCSGYTLFLGR DMDFFNWYKRRINRIYPTVFTWALIAAVFFDQRYDMPEILIKGGGWFVSCIMIYYVVLWF IRKFLKKHLKWVFFMAATITLLWYFVLGFGEKSGNNMYGWNYFKWCYYFLFMLLGSMMGL KRNLNKKTYPFFEGLYDKHIHIIPIFLKLLICIVLFFGLCWFKRQTGIGWELLQISSLIP LLGTTYYFWRLSNTKLLTQAYHHHLTGPIIRFISGLCLEIYLVQYNLFTDKLNNIFPLNL LVIFSTILVTAYLLRCLSRIWSQTFKDGDYDWRDVTKLYS >gi|226332028|gb|ACIB01000028.1| GENE 23 23434 - 23997 292 187 aa, chain + ## HITS:1 COG:MA3442 KEGG:ns NR:ns ## COG: MA3442 COG1045 # Protein_GI_number: 20092254 # Func_class: E Amino acid transport and metabolism # Function: Serine acetyltransferase # Organism: Methanosarcina acetivorans str.C2A # 2 182 24 206 232 86 35.0 3e-17 MIQSKKDLKYYLSEDLKRFNNHKPNIKDWLLHNEIWYIFHYIRHLRYVEYYKNTNKNKIL FFYHFFRYKRLGFKLKITIYPNTIGAGLRIYHVGDFIHIGAQCHIGHNCTLLPGVVFGNK YEKATDTQIIAGNNCYFGLGAKIFGSIIIGNNVTIGANAVVTKDIPDNAIVGGIPAKVLR FKEINIL >gi|226332028|gb|ACIB01000028.1| GENE 24 24012 - 25043 435 343 aa, chain + ## HITS:1 COG:VC0925 KEGG:ns NR:ns ## COG: VC0925 COG0438 # Protein_GI_number: 15640941 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Vibrio cholerae # 5 343 14 362 365 199 31.0 9e-51 MPKVLVVATSRKTKGGITSVVKAHETGEQWKKFHCKWIETHRDGNSVRKLWYLATALIEY ICLLPFYDIVHIHVGLRTSVNRKLIFARIALLFRKKIIVHFHPATEKHLFDPMFSGNIKH LFELSNKLLVLSPKWIEWINEAYRGNKYNIQVLYNPCPSVKRSIQRENYILYAGILSDRK GYNRLIEAFSKIAAKYPDWKIKFAGNGEIEKGKSLAVKFGIEQQTEFLGWIAGNTKESIF QHASIYCLPSWGEGFPMGVIDAIAYGIPVITTPVGGLEKVFHDGIDAMIYETYDLKMLAD KLEQLIKSETYRNSIVNEADKLVCNDFNIITICNKIEKIYENM >gi|226332028|gb|ACIB01000028.1| GENE 25 25030 - 26292 504 420 aa, chain + ## HITS:1 COG:no KEGG:BF2805 NR:ns ## KEGG: BF2805 # Name: not_defined # Def: putative 4Fe-4S binding protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 420 5 424 424 855 99.0 0 MRTCSPRLAKEKYCTGCMACSDACTHDAIKIVEKNCMPFVKVDTHKCMNCHLCEKACPII TPVKKNRAEEMNVYGGWANDEQTRIDAASGGGFSGLAQSFFHLHKEDKVAVIGATLANNR VYHQLIEQEKDIVLLTNSKYIQSDTQGIYKEVIERLKTGYWILFSGCPCQIAGLYGFLGK KRDSERLITIEVVCHGIASYEALDLHLKYYNSSRIYRFRDKRHGTQDWEQSQCTTIEQNG EEIKLKRKDDMFYAIYAGWMLDRKSCSNCQFAEINRVADITIADFWGLQVPDYYKQGVSL IIANNNKADTMIKAADAIYTFKESLRTAINGNPHLFTGYKLIQFHPIVMWPDFFRKILPK RIRFKILTNRMPYKLFWALYKLGTIYLVKYQKKQLISKFQKDDNLLKLLNNANRGGGKMA >gi|226332028|gb|ACIB01000028.1| GENE 26 26323 - 27486 664 387 aa, chain + ## HITS:1 COG:no KEGG:BF2806 NR:ns ## KEGG: BF2806 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 387 1 387 387 782 100.0 0 MKIGIITICKVNNYGAELQAFATQKKLEQMGHNAEIINYLYYKDWHFKDTPLSQPFVPLD MKGKLSYWIKYRLMSWVVNKVLPIFNGNMRRRLHNYQSFIDSERFSAQYKSMDELYKTYP KYDIYMVGSDQVWNPSASSSIEPYFLTFAPKNAPKVTYASSFGVASIAPNLSKRYAKLLN NLDTIAVREQSGVELVKQLTGREAKLVVDPTLLLSKADWEPYMKPLAKISTQYILIYQLF PSQTVIDVALKIGKEKNLPVYNICKRAYGMKKIVGINNILDAGPSEFLWLIANATCMVTN SFHGTAFSVNFATPFCCVLNRKRKNNGRMISFLDKVDMSNRILYEDSIAELNVMTACSEV TNNHLRLLVNNSIDYLKSIIENKEQKC >gi|226332028|gb|ACIB01000028.1| GENE 27 27480 - 28700 478 406 aa, chain + ## HITS:1 COG:PAB0827 KEGG:ns NR:ns ## COG: PAB0827 COG0438 # Protein_GI_number: 14521452 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Pyrococcus abyssi # 223 400 189 361 371 63 30.0 8e-10 MLSILINAYACSPNMGSEPGMAWNWCINLARHCELYIITEGEFRDKIEAVLPTLPQGKHM HFYYNPVSEEIRKMCWNQGDWRFYKHYKKWQWKTYEMAQEIIVKQHIDIVHQLNMIGFRE PGYLWKLDKPFVWGPVDAKEKFPTAYLRDAGIKANLFIRLKNHITGLQLRYSLRVKKAVK KASVVTSASSESQKSFKKYFHIDAPLLNETGCYPKTTIINSTKEKGDLNLLWVGKLDFRK QLPLAIKAIARLANPHIKLHIVGGNNNSYQKLAMELNISHQCIWHGVISHNEVQELMQKA DIFFFTSIAEGTPHVVLEAINNNLPVICFDICGHGDSINEQVGIKIPLSTPQQSINDFAE KITYLFNHRDVLKQMSENCRVRQEELSWDNKAKQMVSLYKKVLSQE >gi|226332028|gb|ACIB01000028.1| GENE 28 28878 - 29597 600 239 aa, chain + ## HITS:1 COG:CAC2317 KEGG:ns NR:ns ## COG: CAC2317 COG1922 # Protein_GI_number: 15895584 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Teichoic acid biosynthesis proteins # Organism: Clostridium acetobutylicum # 39 220 50 231 250 111 33.0 1e-24 MLKLKTVSLLKHRSDLSILPEGKLLINTINAHSYNTALKDAGFAEALLKGGALIPDGASM VLAFRWLRKESIERTAGWDLFEYEMERLNRKGGICYFLGSSKNTLKLIKEKAKTVYPNIR IETYSPPYKPEFTEEENQMMIDAINAVKPDLLWIGMTAPKQEKWAYTHLDALEVNGHIGT IGAVFDFFAGTVERAPVRWQEHGLEWLYRLIKEPRRMWRRYIIGNALFLWNITKEKFSI >gi|226332028|gb|ACIB01000028.1| GENE 29 29607 - 30311 419 234 aa, chain + ## HITS:1 COG:no KEGG:BF2809 NR:ns ## KEGG: BF2809 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 234 1 234 234 476 100.0 1e-133 MRLKKAVLTVCVMMITSGIKAQYTMGTTGMMNIPTAEMQQTGTFMIGGNYLPEELNPFKY NSGNYFVNITFFSFLELNYRCILLKSDYMAKKPKFNQQDRSLSVRLRPLKEGKYWPAIVI GSNDPFKDKGYNYFASVYGVATKSFMIGEHRLAATAGYYYPLSKDKYTLQDGIFGGLSYT PSFCKPLSIMAEYDSDGFNVGAAAKLWKHLSLNVFTREFKCISGGIRYECVLIH >gi|226332028|gb|ACIB01000028.1| GENE 30 30317 - 31558 870 413 aa, chain + ## HITS:1 COG:no KEGG:BF2810 NR:ns ## KEGG: BF2810 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 413 1 413 413 818 99.0 0 MKKQLTIIIGLLLSSSITVHAQVAQKLRELGMENIRTIETGGTTVAAFEDNVYRGTYRGV GKAIIAGMEGMGNGNLELVALDGNGIPQLSISLPDTLIAGYKSGGISLKEVYERMEMSYD TDRPMGLLKGSAGVINRSAWKADIVLYPEVSLENSTFDKLYSYRVNLSPAVEMDLWKGAK ATAQVVFPIATNMKGEYKKIRPGVMTISQEIRFRNNFLARIVAGNFTDHRIGAQAEVKYR TGNGRVELGAQIGTTGYSAITDDGWYIGTRQRINAAVKGSLYVPQFNTQLDLQAGRYLYG DYGLRGDCTRHFGEYAVGVYAMYVEGEVNGGFHFAIPLPGKKWNRNHAVRMKPAEFFAAE YSMVSWGEYADRKMGYTYQTRPAENQSSGFFQPEYIRHFLIKSIEKERNKKQF >gi|226332028|gb|ACIB01000028.1| GENE 31 31584 - 32870 1255 428 aa, chain + ## HITS:1 COG:no KEGG:BF2929 NR:ns ## KEGG: BF2929 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 428 1 428 428 671 100.0 0 MKKSDLFKIGVLLMATTLGTTGCSFGEDEKKPEIVVDPAEKTIEYYIAGKVTEGTTALSG VEVKAGEVTATTDAEGAYKLTVDSKKVYTVTFSKEGYMSIDNATATIADNAANRSMVSLS VKLSKKAPEKEVKADAEEEVVVTDKGDSNISQAEAAVIIPPKAIETTTTVSVTPYEEPAA VTTTVTPGNNVETPVAIANIEVETAKEVTLAKPVTLAIINKASEHTTFENVEVYNQKTTT RAGENWNKVADAIYDSETNSYKFTLPAGASLSGKYSMRVKSSKTTGKELVGETNKEEKKS NEGNMTAIPEYKINFEATAGWEYTVSPEKALMNAGVDAADAQGMATTINSAIEAQEGTTG TYKVAHELIAGISGNHILYYLNQAKYCEKTYTFKISGGRTVTITLKFYTGMQITYTNVEA SQHSGGKI >gi|226332028|gb|ACIB01000028.1| GENE 32 32910 - 33485 763 191 aa, chain + ## HITS:1 COG:no KEGG:BF2812 NR:ns ## KEGG: BF2812 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 191 1 191 191 325 99.0 7e-88 MKLNNIKTGLLLAGLLLLFGNVKAQDNPFKPEWFVSGGINGVTALNGSANNILGGKVSGG VWLNKIIGLRVDAEAGNVWLKGGYNAVTVGAGADILVNLMKNYTDENRKFRLNAIFGLGY NYYSFGDDYPRLSKTNTMSGNFSLQAAFRLNSHLSIFAEPGIKISTKFYDIENKDDVFAG GMMTVGVIYKF >gi|226332028|gb|ACIB01000028.1| GENE 33 33592 - 34006 179 138 aa, chain + ## HITS:1 COG:no KEGG:BF2931 NR:ns ## KEGG: BF2931 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 138 1 138 170 281 100.0 6e-75 MKLSKFLSSRTGKRFYNLCYCWGACLVILGAVFKIAHMPYDNLFLMIGLFTEVFIFFISG FDEPAREYKWERVFPLLNDKNANINPHTGVSDTLMTEKYIQQLKRLENNVCKLNETYEAQ IKGMTEHAKSLNEMNSEE Prediction of potential genes in microbial genomes Time: Tue May 17 22:49:34 2011 Seq name: gi|226332027|gb|ACIB01000029.1| Bacteroides sp. 3_2_5 cont1.29, whole genome shotgun sequence Length of sequence - 77155 bp Number of predicted genes - 69, with homology - 68 Number of transcription units - 31, operones - 14 average op.length - 3.7 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . + CDS 108 - 1424 923 ## BF2814 hypothetical protein 2 1 Op 2 . + CDS 1432 - 2529 968 ## BF2815 hypothetical protein 3 1 Op 3 . + CDS 2557 - 2985 507 ## BF2934 hypothetical protein 4 1 Op 4 . + CDS 3001 - 4050 765 ## COG0836 Mannose-1-phosphate guanylyltransferase + Term 4151 - 4194 12.4 - Term 4313 - 4361 13.4 5 2 Op 1 4/0.000 - CDS 4478 - 5713 1279 ## COG1883 Na+-transporting methylmalonyl-CoA/oxaloacetate decarboxylase, beta subunit 6 2 Op 2 . - CDS 5713 - 7548 1766 ## COG5016 Pyruvate/oxaloacetate carboxyltransferase 7 2 Op 3 . - CDS 7583 - 7840 321 ## BF2938 putative oxaloacetate decarboxylase gamma chain 1 - Prom 7881 - 7940 6.7 - Term 7932 - 7991 16.2 8 3 Tu 1 . - CDS 8014 - 9651 1272 ## COG0737 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases - Prom 9673 - 9732 10.6 9 4 Tu 1 . - CDS 10560 - 11210 346 ## BF2823 hypothetical protein - Prom 11237 - 11296 3.7 + Prom 11225 - 11284 6.6 10 5 Tu 1 . + CDS 11309 - 11434 60 ## + Prom 11507 - 11566 5.9 11 6 Op 1 . + CDS 11814 - 14726 898 ## COG1061 DNA or RNA helicases of superfamily II 12 6 Op 2 . + CDS 14737 - 15312 324 ## BF2944 hypothetical protein 13 6 Op 3 . + CDS 15318 - 15632 366 ## BF2945 hypothetical protein + Term 15647 - 15689 4.9 + Prom 16384 - 16443 4.0 14 7 Tu 1 . + CDS 16519 - 16983 553 ## BF2948 hypothetical protein + Prom 16995 - 17054 7.0 15 8 Op 1 . + CDS 17130 - 17798 473 ## COG0325 Predicted enzyme with a TIM-barrel fold 16 8 Op 2 . + CDS 17854 - 18828 948 ## COG0167 Dihydroorotate dehydrogenase + Term 18841 - 18888 14.3 - Term 18828 - 18876 10.7 17 9 Op 1 . - CDS 18942 - 19736 823 ## COG3409 Putative peptidoglycan-binding domain-containing protein 18 9 Op 2 . - CDS 19758 - 20096 301 ## BF2828 hypothetical protein 19 9 Op 3 . - CDS 20103 - 20609 199 ## BF2829 hypothetical protein 20 9 Op 4 . - CDS 20621 - 20857 168 ## BF2954 hypothetical protein 21 9 Op 5 . - CDS 20854 - 21468 594 ## BF2955 hypothetical protein 22 9 Op 6 . - CDS 21485 - 22297 738 ## BF2956 hypothetical protein 23 9 Op 7 . - CDS 22326 - 23198 933 ## BF2957 hypothetical protein 24 10 Op 1 . - CDS 23332 - 23733 436 ## BF2958 putative two-component system response regulator 25 10 Op 2 . - CDS 23774 - 24118 125 ## BF2959 hypothetical protein - Prom 24285 - 24344 12.3 - Term 24259 - 24324 5.8 26 11 Op 1 . - CDS 24349 - 25242 681 ## BF2836 hypothetical protein 27 11 Op 2 . - CDS 25247 - 27676 1364 ## COG1020 Non-ribosomal peptide synthetase modules and related proteins 28 11 Op 3 2/0.200 - CDS 27689 - 30055 1288 ## COG1629 Outer membrane receptor proteins, mostly Fe transport 29 11 Op 4 10/0.000 - CDS 30052 - 30888 845 ## COG0642 Signal transduction histidine kinase 30 11 Op 5 . - CDS 30911 - 33748 1723 ## COG0642 Signal transduction histidine kinase 31 11 Op 6 . - CDS 33745 - 35346 658 ## BF2965 hypothetical protein 32 11 Op 7 . - CDS 35343 - 35771 451 ## COG2172 Anti-sigma regulatory factor (Ser/Thr protein kinase) 33 11 Op 8 . - CDS 35786 - 36091 403 ## BF2843 putative anti-anti sigma factor 34 11 Op 9 . - CDS 36110 - 37279 910 ## COG2208 Serine phosphatase RsbU, regulator of sigma subunit - Prom 37360 - 37419 5.8 35 12 Op 1 . + CDS 37627 - 38250 305 ## BF2845 hypothetical protein 36 12 Op 2 . + CDS 38286 - 38621 363 ## BF2970 hypothetical protein 37 12 Op 3 . + CDS 38636 - 39136 403 ## BF2847 hypothetical protein + Term 39162 - 39206 9.2 - Term 39148 - 39193 5.2 38 13 Tu 1 . - CDS 39227 - 41131 1844 ## COG1086 Predicted nucleoside-diphosphate sugar epimerases - Prom 41206 - 41265 4.7 - Term 41236 - 41277 7.0 39 14 Tu 1 . - CDS 41293 - 42618 1512 ## COG1875 Predicted ATPase related to phosphate starvation-inducible protein PhoH - Prom 42731 - 42790 8.2 + Prom 42645 - 42704 4.0 40 15 Tu 1 . + CDS 42748 - 44124 1069 ## COG0285 Folylpolyglutamate synthase 41 16 Tu 1 . - CDS 44092 - 44466 503 ## COG0251 Putative translation initiation inhibitor, yjgF family - Prom 44562 - 44621 5.8 - Term 44698 - 44741 10.8 42 17 Op 1 . - CDS 44764 - 46470 1877 ## COG0457 FOG: TPR repeat 43 17 Op 2 . - CDS 46491 - 47231 385 ## PROTEIN SUPPORTED gi|163764761|ref|ZP_02171815.1| ribosomal protein S11 44 17 Op 3 . - CDS 47292 - 48977 1806 ## COG0497 ATPase involved in DNA repair 45 17 Op 4 . - CDS 48981 - 50174 1033 ## COG0452 Phosphopantothenoylcysteine synthetase/decarboxylase 46 17 Op 5 . - CDS 50177 - 50950 867 ## COG0847 DNA polymerase III, epsilon subunit and related 3'-5' exonucleases - Prom 51013 - 51072 5.0 47 18 Tu 1 . - CDS 51081 - 52205 1163 ## COG0592 DNA polymerase sliding clamp subunit (PCNA homolog) - Prom 52316 - 52375 6.5 + Prom 52162 - 52221 6.9 48 19 Tu 1 . + CDS 52358 - 52732 434 ## BF2982 hypothetical protein + Term 52753 - 52813 9.2 - Term 52742 - 52798 12.3 49 20 Op 1 . - CDS 52810 - 54447 1262 ## BF2859 hypothetical protein - Prom 54471 - 54530 6.2 - Term 54473 - 54519 2.1 50 20 Op 2 . - CDS 54540 - 55298 507 ## COG1235 Metal-dependent hydrolases of the beta-lactamase superfamily I 51 20 Op 3 . - CDS 55359 - 56357 936 ## COG0812 UDP-N-acetylmuramate dehydrogenase 52 20 Op 4 . - CDS 56377 - 57219 751 ## BF2986 hypothetical protein - Term 57229 - 57269 -0.9 53 21 Tu 1 . - CDS 57327 - 58280 1029 ## COG0451 Nucleoside-diphosphate-sugar epimerases - Prom 58336 - 58395 4.7 + Prom 58309 - 58368 4.7 54 22 Op 1 . + CDS 58417 - 59604 1392 ## COG0156 7-keto-8-aminopelargonate synthetase and related enzymes 55 22 Op 2 . + CDS 59654 - 60058 413 ## BF2989 hypothetical protein + Term 60098 - 60143 5.0 - Term 60086 - 60131 1.2 56 23 Tu 1 . - CDS 60198 - 61109 845 ## BF2991 hypothetical protein - Prom 61157 - 61216 2.5 - Term 61161 - 61200 3.4 57 24 Op 1 . - CDS 61227 - 62273 884 ## BF2867 hypothetical protein 58 24 Op 2 . - CDS 62287 - 64407 1284 ## BF2868 putative lipoprotein - Prom 64434 - 64493 4.5 59 25 Tu 1 . - CDS 64510 - 65532 692 ## BF2869 putative lipoprotein - Prom 65612 - 65671 3.1 60 26 Op 1 . - CDS 65678 - 67234 993 ## BF2870 hypothetical protein 61 26 Op 2 . - CDS 67218 - 68270 962 ## BF2996 hypothetical protein - Prom 68340 - 68399 2.8 62 27 Op 1 . - CDS 68450 - 70111 1384 ## BF2997 hypothetical protein 63 27 Op 2 . - CDS 70137 - 71540 853 ## BF2874 hypothetical protein 64 27 Op 3 . - CDS 71549 - 72106 459 ## BF2875 hypothetical protein 65 27 Op 4 . - CDS 72143 - 72517 330 ## BF2876 hypothetical protein - Prom 72763 - 72822 7.5 - Term 72725 - 72781 0.3 66 28 Tu 1 . - CDS 72908 - 73846 729 ## BF2877 tyrosine site-specific recombinase - Prom 73874 - 73933 6.0 + Prom 73924 - 73983 5.8 67 29 Tu 1 . + CDS 74063 - 74437 245 ## BF3039 hypothetical protein + Term 74603 - 74649 -1.0 68 30 Tu 1 . - CDS 74666 - 76243 1359 ## BF2880 hypothetical protein - Prom 76277 - 76336 6.2 - Term 76745 - 76776 2.5 69 31 Tu 1 . - CDS 76994 - 77155 93 ## BF2883 hypothetical protein Predicted protein(s) >gi|226332027|gb|ACIB01000029.1| GENE 1 108 - 1424 923 438 aa, chain + ## HITS:1 COG:no KEGG:BF2814 NR:ns ## KEGG: BF2814 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 438 1 438 438 842 99.0 0 MAKYTLPPRQKMINLLYVVLIAMLAINISSDVLEGYGRMNNDYLPQIKKLEEYNRTLLER INSRNDKAALSAQNIDAAAGKLMDTLEELKEDIARKADKEKYEAGKLKAKDDLNAVPEVF LSVTGGKGKALRLSLDTFKEDALSLIKNDAHRQLVGTYLNTESPGTGISWEKETFSYLPA IGGVTFINKMQEEVLMCVNEVYRSLLYEEAEDGKGGAFVFINEDQMIVNKDGTVDLPVVQ ITPALTSILYTDYENPLNILTAGIPFNEVTFRMTNGKILKRGNHCIAVPDEKAQTATVTA TQIKNGVARQLAEYRYTVKALPDPTPYILCTDENGRTVQYRGNVPINKRLVSNMTQLGAS ISDGPKANYEISSFEMVLIKGSSKAVTSIPNTGNKFSARQMELIRQLEKGDKFYITSIVV TGPGNKKKQIASINVVLI >gi|226332027|gb|ACIB01000029.1| GENE 2 1432 - 2529 968 365 aa, chain + ## HITS:1 COG:no KEGG:BF2815 NR:ns ## KEGG: BF2815 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 365 1 365 365 719 98.0 0 MTNILGLKQNRWIVGAVLLLTTCNLQAQEQPNAREKFERGNWFVSGALNGEWLTKTIGNV YAGGKISGGVYLTPLSGFRATAEIGKNRIGNDTEATQLSANLDYMLTLIGNNGFKRFNLA AILGAGFNYYDFGDNDPKYTRVNTISGNFSIQASYNVNRKFSIFIEPGLKVLPKYYSKEL NNKIYMQSNLTIGLAYTFRDKYRKSVDNSIHPLYLPEVDLLEIKEKIGMLCEEVMQMKQE LKERRKITDGQNLMIVPQKDALSIDIMFDEFSSFVSEEQGQKIDGIGEWMKNNNASIRII AFSDNLTDKKADQELRKRRSEAIRKILIEKYHISPERISESTPEAMGYENKTGCNAMIVY IPENK >gi|226332027|gb|ACIB01000029.1| GENE 3 2557 - 2985 507 142 aa, chain + ## HITS:1 COG:no KEGG:BF2934 NR:ns ## KEGG: BF2934 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 142 1 142 142 290 100.0 2e-77 MTIAVDFDGTIVEHRYPKIGEEIPFATETLKILAQERHKLILWTVREGELLEEAIEWCRQ RGVFFYSVNKDYPEEEKSHNGFSRKLKADLFIDDRNLGGLPDWGTIYQMIHEQKPYEPVL CDRQKPTGDLSWIEKLLGKRNK >gi|226332027|gb|ACIB01000029.1| GENE 4 3001 - 4050 765 349 aa, chain + ## HITS:1 COG:CAC3058 KEGG:ns NR:ns ## COG: CAC3058 COG0836 # Protein_GI_number: 15896309 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Mannose-1-phosphate guanylyltransferase # Organism: Clostridium acetobutylicum # 6 342 5 340 350 288 43.0 1e-77 MNNHVVIMAGGIGSRFWPMSTPECPKQFIDILGCGKTLIQLTVERFGNVCPQENMWVVTS EKYIDTIREQLPGIPESNILAEPCPRNTAPCIAYACWKIKKKYPEANIVVTPSDQVVIDT TEFRRVIEKALLFTDKSSAIITLGIKPARPETGYGYIAAGEPITRDKEIFHVEAFKEKPD KETAEKYLAAGNYFWNAGIFVWNVRTITAVMRVYAPGIAQIFDRIYPDFYTEREEESVKK LFPTAESISIDYAVMEKAEEIYVLPAQMGWSDLGTWGALHTLLPKDKEGNATVGPDIRMY ESRNCMVHASQEKRVVIQGLNDYIIAEKDNILLICQLSEEQRIKDFSKE >gi|226332027|gb|ACIB01000029.1| GENE 5 4478 - 5713 1279 411 aa, chain - ## HITS:1 COG:AF2084 KEGG:ns NR:ns ## COG: AF2084 COG1883 # Protein_GI_number: 11499666 # Func_class: C Energy production and conversion # Function: Na+-transporting methylmalonyl-CoA/oxaloacetate decarboxylase, beta subunit # Organism: Archaeoglobus fulgidus # 24 410 5 353 354 294 48.0 3e-79 MNGIFEKLYDMTAFSNIVAEPQFLVMYVIAFVLLYLGIKKQYEPLLLVPIAFGVLLANFP GGGMGVIQADENGMILVNGVMKNIWEMPLHDIAHELGLMNFVYYMLIKTGFLPPIIFMGV GALTDFGPMLRNLRLSIFGAAAQLGIFTVLLVAILMGFTPSEAASLGIIGGADGPTAIFT TIKLAPHLLGPIAIAAYSYMALVPVIIPLVVRLLCTKKELSINMKEQEKKYPSKTEIKNL RVLKIIFPIVVTTVVALFVPSAVPLIGMLMFGNLVKEIGANTFRLFDAASNSIMNAATIF LGLSVGATMTSEAFLNWTTIGIVVGGFLAFALSIAGGIFFVKLVNLFTKKKINPLIGATG LSAVPMASRVANDIALKYDPKNHVLQYCMASNISGVIGSAVAAGVLISFLS >gi|226332027|gb|ACIB01000029.1| GENE 6 5713 - 7548 1766 611 aa, chain - ## HITS:1 COG:AF1252m KEGG:ns NR:ns ## COG: AF1252m COG5016 # Protein_GI_number: 18677784 # Func_class: C Energy production and conversion # Function: Pyruvate/oxaloacetate carboxyltransferase # Organism: Archaeoglobus fulgidus # 9 515 9 477 480 184 29.0 4e-46 MKREVKFSLVFRDMWQSAGKYVPRVDQLVKVAPAIVEMGCFARVETNGGGFEQVNLLFGE NPNKSVRAWTKPFHEAGIQTHMLDRALNGLRMSPVPADVRKLFYKVKKAQGTDITRTFCG LNDVRNIAPSITYAKDAGMISQCSLCITHSPIHTVEYYTNMALELIKLGADEICIKDMAG IGRPVSLGKIVANIKAAHPDIPIQYHSHAGPGFNMASILEVCEAGCDYVDVGMEPLSWGT GHADLLSVQAMLKDAGFQVPEINMEAYMKVRALIQEFMDDFLGLYISPKNRLMNSLLIAP GLPGGMMGSLMADLETNLESINKYKAKHNLPFMTQDQLLIKLFNEVAYVWPRVGYPPLVT PFSQYVKNLSMMNVIAMEKGKERWGMIADDIWDMILGKAGRLPGKLAPEIIEKAEREGRK FFEGNPQDNYPDDLEKYRKLMKEKKWETGEDDEELFEYAMHPAQYEAYKSGKAKEDFLAD VAKRRAEKEQVPTEEAKPKTLTVQVDGQAYRVTVAYGDTELPVAPASQPSAAAGEGKEVL SPLEGKFFLVKNAQETPLKVGDAVKEGDVLCYVEAMKTYNAIRAEFGGTVTAISANPGDT VSEDDVLMKIG >gi|226332027|gb|ACIB01000029.1| GENE 7 7583 - 7840 321 85 aa, chain - ## HITS:1 COG:no KEGG:BF2938 NR:ns ## KEGG: BF2938 # Name: not_defined # Def: putative oxaloacetate decarboxylase gamma chain 1 # Organism: B.fragilis # Pathway: not_defined # 1 85 1 85 85 121 100.0 6e-27 MENLETALLLMVVGMATVFAILLIVIYLGKLLISLVNKYAPEEQLPAKQGAQSPVPIPGN IVAAITAAVNVVTQGKGKVAKIEKI >gi|226332027|gb|ACIB01000029.1| GENE 8 8014 - 9651 1272 545 aa, chain - ## HITS:1 COG:BH0026_2 KEGG:ns NR:ns ## COG: BH0026_2 COG0737 # Protein_GI_number: 15612589 # Func_class: F Nucleotide transport and metabolism # Function: 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases # Organism: Bacillus halodurans # 53 498 2 429 471 144 30.0 5e-34 MEKLHIRKIASLGLMLCFFTGVGAQTPVKVEKRKEHKSNTVIPVVKGNVTDTLSLVSFND FHGAFACDKGVPGAGQLVQTVLTQKEKNKNTIVLSVGDNFSGSYFSRITRGNPLPEMFQE MDVKMSAVGNHEFDWGLPYLTDTAKVYMNFVAANIITDRGDTLEWAKPYRIVTLNLKNGG TVRVAFVGLTTTDTAHKTSPENIKGLAFVHPVYAARVETACRLKKEGKVDMVVLLMHIGT NMKNRDIIEEENAKLLPFLKGVDAIISGHSHEVVLSKVNDVPIIQAGVNGTHIGKLDFRV VKEEGGNRISYIGGDTIRTEGPSNAHIDSLVDKVLAVYGLSEKLILAKDALIHDRTINKW EYTPVGAYVTAAYVRSFLENEKVPAELKVLPVIGVNHFGGLRASIPAGEVTRLRAGNVLP FGGNVVAYEFTGERLKQLLDDGRRNKNGFLQTSYLKLHLNSKGNVETITDCRTNKIIRDN DKCIVVLDAFITSGGDGYDSKLFSGYEIGDFNSQNLETTVAFIDYLKSLKGFVSSEAAPI PIVQK >gi|226332027|gb|ACIB01000029.1| GENE 9 10560 - 11210 346 216 aa, chain - ## HITS:1 COG:no KEGG:BF2823 NR:ns ## KEGG: BF2823 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 216 2 217 217 411 96.0 1e-113 MRFILFIIFSLCLPLVMQGQGQNVWTQNDSLKLKKILAGQAEIRINREALRELDRVFSSP RRLFKSQSHSAILLIKDFLLYRPNVFTGKYHISTFKVNELNIRQDSLFMNVHMLGNKQLF IKSQLDIGDPRVLIKRKTDIQFKFTDHLAYRVYGGYTIDKNRTVILPTTATPYYVGTGFS YSLNKKLQLKTGIEYQYNVVYKRWEWVWNSGIRFNF >gi|226332027|gb|ACIB01000029.1| GENE 10 11309 - 11434 60 41 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MKLRFVPTHPIIFNTIKYYFTSVWNTELKATIFSLNYHFHI >gi|226332027|gb|ACIB01000029.1| GENE 11 11814 - 14726 898 970 aa, chain + ## HITS:1 COG:Z1129_2 KEGG:ns NR:ns ## COG: Z1129_2 COG1061 # Protein_GI_number: 15800650 # Func_class: K Transcription; L Replication, recombination and repair # Function: DNA or RNA helicases of superfamily II # Organism: Escherichia coli O157:H7 EDL933 # 397 814 2 400 407 324 39.0 5e-88 MENLQEKYDALIKRYNTLLDENEELKSILLQHGIIYPARNITNESAFSSITFPPIKLSLD EKVALFRSFFKGREDVFARRWFSKTTEKGGYQPVCINEWYKGACDKKRNKCTECPNRNFA PLTNQNIYRHLEGKDENGCDVIGLYAITLDNKCSFLCADFDDKNCTHGYKEDVLAFIAVC RDWEIPYSIERSRSGNGAHIWTFFKEPIPSYKARKLGNTILTEAMKLNGRITFDSYDRFF PNQDRLPEGGFGNLIALPLQGRARKAGNSVFVDEEFLPFKDQWAYLYGIKKIDESVVDGL LAQYRQEDFGALATSSETKPWESPVIQNITRNDFDGRLKINKSDKIYIPLNSISDTVVNH LKRIAAFKNPEFYSKQAMRISTYNVPRIICRADFTDEYIVMPRGCEDAITAMLSSLGVTY EIVDRTNQGKHIAVAFKGKEREEQLNAINTLMAHTNGVLSATTAFGKTVTAAALIARKKI NTLILIHSKALLLQWHERLSEFLDIDFTEPDIPKKRGRKNVFSPIGCLDSTSNTLHGVID IALMQSCLENGEVKPFVQEYGMVIVDECHHVSSVTFENVLKNITSHFVYGLTATPIRKDG LQPIIFMQCGPIRYSSDAKAQIEKQSFRRYLVPRFTSYRPITDDKQSFTALSQSLSESEI RNTLIVEDVLHNVAESRTPIILTNRTSHVKLLAEMLEPHIANVIQLTGEGSTRNKREAFQ RLYDIPQDAPLVIVATGKYIGEGFDYPRLDTLFLALPISWKGLVAQYAGRLHREREGKTD VRVYDYVDIHEPVCENMYRKRLKGYAAIGYSVLSKDNLTLFDNIDSLQSSSYEGQIFNGS TFRQSFTKALKSSKQSVVVSSPKLYHTERNAFVKILKELQTNGIEVAIITSASNEQTEYL KKQGLFVRVVPNLSLCSCVIDKSIVWYGSINLLGYPTEEDSIIKCPDFKLATELLDIIFC SDKQHDITTQ >gi|226332027|gb|ACIB01000029.1| GENE 12 14737 - 15312 324 191 aa, chain + ## HITS:1 COG:no KEGG:BF2944 NR:ns ## KEGG: BF2944 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 60 191 1 132 132 231 99.0 1e-59 MVFFHLFHRYSTRSSSFEQREHFLLLNALFIIIQLRNGTQKAPYREVVFYINLFYFFGKM FLYMEPFAYICIKIKVMEVKAKFKVVYSKEADDFLNHLPSKIKDKIIYNIAKAKFVIDPE LFKKLDDTDIWEFRTFYKKVKYRLLAFWDKDNGADTLVIATHGFIKKTQKTPPKETTKAE EIRKAYFNSKK >gi|226332027|gb|ACIB01000029.1| GENE 13 15318 - 15632 366 104 aa, chain + ## HITS:1 COG:no KEGG:BF2945 NR:ns ## KEGG: BF2945 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 104 1 104 104 189 100.0 3e-47 METMKFYTEEEILDRHIGEKDTPKRDKFEADLHSFLIGEAIKQARQSKNLTQEELGNLIG VQRAQISRIENGKNLTFSTISRVFKAMGISAKLEIGNLGKVALW >gi|226332027|gb|ACIB01000029.1| GENE 14 16519 - 16983 553 154 aa, chain + ## HITS:1 COG:no KEGG:BF2948 NR:ns ## KEGG: BF2948 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 154 1 154 154 286 100.0 1e-76 MSMHTWFECKIRYEKLMENGMNKKVTEPYLVDALSFTEAEARIIEEMTPFITGEFTVSDI KRANYSELFPSEEEAADRWFKCKLIFITLDEKSGAEKKTSTQVLVQAADLRDAVKKLDEG MKGTMADYQIGSVAETAIMDVYPYSSEPNDKPEV >gi|226332027|gb|ACIB01000029.1| GENE 15 17130 - 17798 473 222 aa, chain + ## HITS:1 COG:CAC2121 KEGG:ns NR:ns ## COG: CAC2121 COG0325 # Protein_GI_number: 15895390 # Func_class: R General function prediction only # Function: Predicted enzyme with a TIM-barrel fold # Organism: Clostridium acetobutylicum # 1 222 1 218 221 156 40.0 2e-38 MNVTDNLKQVLAELPSGVRLVAVSKFHPNEAIEEAYRAGQRIFGESKVQEMTGKYESLPK DIEWHFIGHLQTNKIKYMAPYVSMIHGIDSYKLLAEVNKQAIKAERVIRCLLQIHIAQEE TKFGFSFDECKEMLNAAEWKALANIQICGLMGMATNTDSKEQIEREFRSLNCFFHEVKKQ YFANEPTFCELSMGMSHDYHLAIKEGSTLVRVGSKIFGERVY >gi|226332027|gb|ACIB01000029.1| GENE 16 17854 - 18828 948 324 aa, chain + ## HITS:1 COG:alr1912 KEGG:ns NR:ns ## COG: alr1912 COG0167 # Protein_GI_number: 17229404 # Func_class: F Nucleotide transport and metabolism # Function: Dihydroorotate dehydrogenase # Organism: Nostoc sp. PCC 7120 # 3 319 2 323 343 199 34.0 5e-51 MADLKTTFAGLTLKNPVIISSSGLTNSAAKNAKLEAAGAGAIVLKSLFEEQIMMEADRLR NPSYYPEGSDYLAEYIRNHKLAEYLELIKESKKICTIPVIASINCYTDAEWVDFAKQIEE AGADALEINILALQSDIQYKYGSFEQRHIDILSHIKKTIRIPVIMKLGSNFTNPVALIDQ LYANGAAAVVLFNRFYQPDIDVEKMEHTSGDVFSNASDLSTTLRWIGISSSLVSKIDYAA SGGIHKPDGIVKAILAGASAIEICSAIYQNTNLFVGEMNRFLSAWMERKGFKHISQFKGK LNAKDVEGINMFERTQFLKYFSEK >gi|226332027|gb|ACIB01000029.1| GENE 17 18942 - 19736 823 264 aa, chain - ## HITS:1 COG:RSc1931_1 KEGG:ns NR:ns ## COG: RSc1931_1 COG3409 # Protein_GI_number: 17546650 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Putative peptidoglycan-binding domain-containing protein # Organism: Ralstonia solanacearum # 1 163 1 159 159 62 31.0 1e-09 MKTIKLGYEGEEALLLCRELKRNGYSVKESRTFTQEMKESVVDFQQKSQLDADGIVGYRT WESLFFTGRPTTERLTEEDFILVARLLDVEVAALKAVQQVETGGRGGFFAPGKPAILFEG HIFWNQLKKRNVNPESHVKGNENILYPKWEKGHYKGGMGEYDRLEQARKINHEAADASAS WGMFQIMGFNYAACGEKSVDSFVKAMCMSECRQLVLSARFIKQSGMLSALQAKDWAEFAK RYNGPAYEQNQYDKKLAAAYQKFS >gi|226332027|gb|ACIB01000029.1| GENE 18 19758 - 20096 301 112 aa, chain - ## HITS:1 COG:no KEGG:BF2828 NR:ns ## KEGG: BF2828 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 112 1 112 112 220 100.0 1e-56 MNPILNKMGANANEQKKLLMECVSMLEKYVNRFPAEKGCASFSGEDMKLWKEVYFPKLVQ TDILLDGKFFCGTSSGNSGIGTDGYFTGYEFFQFIYRAYKALYELEKASQMR >gi|226332027|gb|ACIB01000029.1| GENE 19 20103 - 20609 199 168 aa, chain - ## HITS:1 COG:no KEGG:BF2829 NR:ns ## KEGG: BF2829 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 168 1 168 168 337 100.0 1e-91 MKLTFFKRMGEKIRHPFRKEIPKTIPVVETAPQPVADNATEATAEESSVIRSADQCGEQA RYFLLRNNKPVGKPFSYYHPEIRIVHVGSFVNAFLFFLRMCDQRLLTYRQTGEYLHCTAV FPDESGNLYFTNKVTCRNKENTVAVLKIDYVGLKPKITEIRFELNIKK >gi|226332027|gb|ACIB01000029.1| GENE 20 20621 - 20857 168 78 aa, chain - ## HITS:1 COG:no KEGG:BF2954 NR:ns ## KEGG: BF2954 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 78 1 78 78 138 100.0 6e-32 MNNVAEHAREQKAGMKCPQCGAFIETSIFELLTSNALQCPSCHLRLNIDRMKSKAAFDAL RKVQNAQENLERKSKFNG >gi|226332027|gb|ACIB01000029.1| GENE 21 20854 - 21468 594 204 aa, chain - ## HITS:1 COG:no KEGG:BF2955 NR:ns ## KEGG: BF2955 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 204 1 204 204 340 100.0 2e-92 MDNKKVRSTSSQVMELQQLIAGPLIATIEADSLSSQRYLDYLMKIAFESYDPVTGRTGKI RMLTFNYQSQDAGGGRTQSVSIPILTLVPLPLLQVQEADFDFDIKILDALSETAEEKFSL EEGKSLNEPQSGGGFKLRASLAPKQGEGSSTSNVQQSLSANMKVKVKMRQADMPAGLSNL LHLTASNMQVEETEAEEITEGGNK >gi|226332027|gb|ACIB01000029.1| GENE 22 21485 - 22297 738 270 aa, chain - ## HITS:1 COG:no KEGG:BF2956 NR:ns ## KEGG: BF2956 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 270 1 270 270 483 100.0 1e-135 MAQLSSVIGSILRDIVSAQHEANLYSLSLGDSYGKDGKAKDFQLPNVMVSDMELDLKYGV KSASESQQQFNIKYDKFRQFLKELCEQVARVAISSAVTTVMTSDIERNEGEKHFFERLKK ENKLHQEFCTFLSRNMRNSFRNNLYDAVDSSNGSVNNDVVISRLTDVVRKKFLYDTDLDD LFAGEDGEKLRDTAEKNIIKAMEAIVKKLSVDANFKSLHSFPQLDVAITADELMNMPEEA IHSFKIKFSPRNYSVSQTDDDSLLEDFVMR >gi|226332027|gb|ACIB01000029.1| GENE 23 22326 - 23198 933 290 aa, chain - ## HITS:1 COG:no KEGG:BF2957 NR:ns ## KEGG: BF2957 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 290 1 290 290 501 100.0 1e-141 MAVDKTPSQVATSALQAIPFGSIIGGPLKACVEAQALAAKTTWEFIQNVGINVNPETGEK TAVNVSFSFIQGGRLVQLNIPLLTIVPIPYIAIREVDINFKANISASSSTTAETNESVSK DAALKASASMRFGCFKMNADMNASYSSKKDSKATSDSKYSVEYTMDVAVKAGQDSMPAGL SKILELLGNSLDVSDPAGTLEVNSNLLLIENGKETVRLIATYKDGSGLLAPDKLTITGTG ANAADFKVSGDSKIIDLPEGTYTIKAEGSKKEVIVEVKKDTATPQEEVTE >gi|226332027|gb|ACIB01000029.1| GENE 24 23332 - 23733 436 133 aa, chain - ## HITS:1 COG:no KEGG:BF2958 NR:ns ## KEGG: BF2958 # Name: not_defined # Def: putative two-component system response regulator # Organism: B.fragilis # Pathway: not_defined # 1 133 1 133 133 275 100.0 4e-73 MKENSIKPYCYCGESESSLVDNAIFVYFGDEYRRVLLDEILWLEASGSYCVLCMENGAEI TVSYPLDRIFNNDLPRGKFQRIHRSYAINVFKVTGFAGNYVHIGKKMLPVSESHKKNFLA CFHKIYSKRALGK >gi|226332027|gb|ACIB01000029.1| GENE 25 23774 - 24118 125 114 aa, chain - ## HITS:1 COG:no KEGG:BF2959 NR:ns ## KEGG: BF2959 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 114 1 114 114 231 100.0 8e-60 MGKNRNHYSFFLYFVFIIFVLLGCRPVRVQMPDGMYGRWKSSAGRPDITLGTDSIGSFAI VHHRIYDGKICPVRYPLHLNSPTKGYIRAEGCILFYYDSLKCVLYFSPGGDYTQ >gi|226332027|gb|ACIB01000029.1| GENE 26 24349 - 25242 681 297 aa, chain - ## HITS:1 COG:no KEGG:BF2836 NR:ns ## KEGG: BF2836 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 297 1 297 297 488 99.0 1e-136 MQTENFTLGKKELLILLGLTLVLPFPINWALIELNPLTLMAGADDVPTWIGFGGSYIGGI IGGVISIMILNKTLQQTGVLHYDLKLLQLDTIIYTQQQDWFTGFKQELAENLKSIDLYVL NMVVSSISMKNYAYAKEVLTEINKGLEYQMVAASFSFTSTHLSSEEKKYMNLVRRVQTEY SSFIKDILYYIPLAETISSRKKLNYDQLMDYTLSQYDYLQGQEGESTASGAIIPRVMEID PSDDIRKELERIMEERLTGRTYLYHLKSDLAEATHQLIIYEENRINSILHKPEREMN >gi|226332027|gb|ACIB01000029.1| GENE 27 25247 - 27676 1364 809 aa, chain - ## HITS:1 COG:BS_srfAC_1 KEGG:ns NR:ns ## COG: BS_srfAC_1 COG1020 # Protein_GI_number: 16077420 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism # Function: Non-ribosomal peptide synthetase modules and related proteins # Organism: Bacillus subtilis # 18 562 479 1028 1050 252 32.0 2e-66 MKTNIQENLSRSLSTWGDKPAVVAKDGTVSYQMLERYSANIAQSIRQRLSVTDPSGLRNI CIGVCMERNKILVPAILAIFRLGATYLPIDPSLPDNRKRYMAENADMVLLLTDSSNEVGG IPSVRQLYLNGEQLSEPVVGDYTEVLPNDCAYIIYTSGTTGNPKGVRISYRNLDTFTRNL VDKKLYHLSDPANRYLAFASISFDASILELMMCIPVGGTLILAGEDERRDISLLDELIRR EKVNIAFFPPSLLGMFADLDFPSFKTLLFGAEAIGEKLFNRLKQQPYRLMNVYGPTENTV LSTIRIVGKDTSYDDIGYPLKGTVCYVLSENLQQITLGATGELCLGGPQVSLGYIGSVQL NEKSFISYDGERLYRTGDLVQQQPDGSIRFIGRKDTQVKIRGFRIELTEIAERLNRDPDV ERAHVVVVERNGRQLLGAYLQPSVSGNFHPEEVKERLRAELPYYMIPNLWQVVDHFQRTI NDKIDVRALPAFTSLELKYVPPVHPGEVILCGILKEMLGLPQVSVEADLIDDLGLTSLDM LRLVTEANKKGCPINVSAVYTARNLRRLLLVPRQEPIYWYRQQNLKKPVIILVCGSAYFN HLYFNLADRLYPKYDFLVVDAIYDHFNKVATDIPELLEYYVRTVQPMIRERTVYALTGFC MGAELAVGLAEMLHRSMGIMPKVFALDGQAWQNPALCQNYPLLVFPGDTDEMVRERNEII NIYFRTTPNLIYQGEVIVLLSGLFHQQAGLSPEESWTEENYRIFRTEYDNCERLWNKYYP NAPVLRLPADHWHFLEGESLEQLITVFLK >gi|226332027|gb|ACIB01000029.1| GENE 28 27689 - 30055 1288 788 aa, chain - ## HITS:1 COG:RSp0811 KEGG:ns NR:ns ## COG: RSp0811 COG1629 # Protein_GI_number: 17549032 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor proteins, mostly Fe transport # Organism: Ralstonia solanacearum # 115 593 29 468 652 76 23.0 2e-13 MKRCLLYILCCFGVVQLSAQPADNIRNSFVKAETFYESGDIDEACRILEENLSSFQGTMH TEACRLAALCCLALDRFPEAEKYVSLLLKDEPYYYISLQDPERFADMVRKHRETKVTLVT ASQQVETPEEAPVPVTLITEEMIRVIHARCLRDVLIAYVPGISGLSSNEEMNLAMRGVYS PEQENILIMQDGQRLNSYITNAVSPDYGISLAKVKQIEVLRGPASSLYGSVALTAVINIV TKDGVDVRNGSVSVSAGNRGQLAADLLLGKHDMNMDFMAWFSLYRATGESVFVPAEKQYA LYPRDGFIRLDNYSGFPAMDGGIKLQRGNLLFSFSMNYAKKRQPYSMWLFSSPYSYERFR TFDGSGPGYSRWSAREQAVYSRTWQRITFNTAFYTDWNKNVHYETSGDTLQDYPIFPNYD YQPIIYPTRGTFQYIRWMDFNVGFNGRVNYAYDWGKLGKGNMLGGVEWNRYTLYDSEYLE GMNFKEIVRTWIEKRLYTGHEMNTDAFLQIKHNLHQNWIVNAGIRYDYKRRSNKRTLQAF SPRLSLIYLRNGLNIKASYSRAFVDAPYYYRNNEMDTYSGGENLQAEYLSSYQVTCAYHH SPSHIDVECNLFYNRASHFLFTQPETRVYENAGSLNMGGVEVVARYKADRLSLDGNLCFQ KVLNYTNFFVTDGSVNNVPGFSMNLVANYFLLKKKVQSWSAHLKLNCSSHCYTQVSVLED GLNGDATNYTIRLPGYAVFSFTTRYKYKRVEGSLGIENLFNNRYECGGATVPIRQKGRWI SASILYNF >gi|226332027|gb|ACIB01000029.1| GENE 29 30052 - 30888 845 278 aa, chain - ## HITS:1 COG:all2282_4 KEGG:ns NR:ns ## COG: all2282_4 COG0642 # Protein_GI_number: 17229774 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Nostoc sp. PCC 7120 # 5 277 12 288 301 207 39.0 1e-53 MEKNISQEKRIEELEARLKEQEKLGSLGLLSAGIAHEIQNPLNFVINFSKLSGQLLADLA DRVDEAKARIPEAELEDIKDILSGLRENVDKIKEHGDRAISIIRGILLYSRGKEDEFMPT QIQKLTKEYVWLAYHAMRANYKNFNIAIREDYAKDIPSIRLIPQDFSRAVLNVMNNACYA VWKKWQNAPEGYSPEIMVTLEKLDKRVVLSIRDNGEGMSPEVKRKLFDSFFTTKPVGQGT GLGMAIVRDIIENKHHGKVLFDSVLHEYTCFRFIIPIA >gi|226332027|gb|ACIB01000029.1| GENE 30 30911 - 33748 1723 945 aa, chain - ## HITS:1 COG:all2282_4 KEGG:ns NR:ns ## COG: all2282_4 COG0642 # Protein_GI_number: 17229774 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Nostoc sp. PCC 7120 # 667 939 12 288 301 139 30.0 3e-32 MKKIVFLIMCLCSVYANSQEGIPFFVNYPASVYQAHNRNFDVVCDSCGNVYFANFEGILH YDYNRWETIYTPGFSRVTRLFRDSEGRIWVGGYNVFGRIERDGRGCITLRTLLSDLDTNS LGELEDMAEIDKRIYLKATSGRYYTVQSDSILAPVQVLPAQLQEKWRNQSSVSMNRAFSL PGGETISINSAHGLIMDDSGKKERFSVTERNGLCSNAVSGIAADGRGNLWGATDNGVFHV FIPSLFSRYTSGEGLKGEVISAVSYKGMIYTGTLQGLYVLKQNTFVPVQGISQACWQLCL SPQGELYAASGDGVYVIRDYNHSEKLTDMAAYSLTFIGHTNLLMGTMDGIYQYSADEERI KKISDVEKVVRLEVKKDRSVWAKTLYGEIYLREEGKSSFVLQDRESEEIMTEYTDNDGCH WQTNLKGKEVQVHHSQIDTEKFNQCLYAIRNYVVRVIYIEEDRAAWFGGDFGLIRMDLEK ARTFTPVVPRIYLREVCLNRDSVYWGGDLPEESGGADWQINSTVPRLGNDVRSIRFSFAT DAPCFTGSNEYRYRLVGYDPEWSSWDSGTVKEYANLSSGTYTFCVRARDIYGTESEMKQF RFSLLPPFYLQWYCLILYTVAFGMLLFVLFKWRMRSLLKEKERLEALVGQRTKQLVQQKN EIEEKSLKLEKALKELGQAQDELVRQEKMATVGKLTQGLIDRILNPLNYINNFSHLTSGL LKDLYQNLESVKELLDEDTYLDSVDVINMMRDNLEKIEEHGSNTTRVLKAMEEILRDRNR QLEKTELIGLCRKDMELLSSYYQKEITAMHIAVRTSLPDNPLFIDGNAEQLGKTIMSLLN NGMYAIAKKYGKKAYPAEIGLALESKDGQAVIRLYDNGVGIEQSILDKIFDPFFTTKTTG EAAGIGLYLSKEIILNHHGQIAVRSEKDELTEFTITLPLWEEKSV >gi|226332027|gb|ACIB01000029.1| GENE 31 33745 - 35346 658 533 aa, chain - ## HITS:1 COG:no KEGG:BF2965 NR:ns ## KEGG: BF2965 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 533 1 533 533 1094 99.0 0 MIKLNENIIRWIIAVPCILLTVCLVLYRAYRHSASDRERLALSLEEFRDKEKKSVLIQRV SQQMEEIAYQQKDISEKRRQEAVAQTQKANRMQQQAELARERALAAQLEAEKAYRMADEQ KNLAMERQLQAEQSKRVADTLAFLALGRSLGTLSVTQYRSGNMELASLLAYTAWWFTKRY DGDVYHPAVFNALSLSCRQDQLWFGHKGAVTGIKLLSGGEEKARAGRNPLPGLVTVGKYG EMIRWKSENGQTYTSELLLNDPCYDFRDVLLRSDSQMYALSYCGQLISTDSAGIVTRFLP KGEYRQLISIASNLVLAVSTTGIYSFDCINHEIQPYFQPEHSITCVGQMEGYLYVFFKNG TVALLSSDGTFLRFIDSLPENSVVTAFCPLTNERYAIGCREGVILICDKNGNIRQKLIGH QAAVTAISRKGNKLFSSSYDCTLRLWDLRKGKTESAVIVTSPGWIHTFCFHPDGTSVFIG DEQGRLYRVPVSPDHMATVVRQGLQRDFTREEWEYYIGKQIPFESYYLKTYQP >gi|226332027|gb|ACIB01000029.1| GENE 32 35343 - 35771 451 142 aa, chain - ## HITS:1 COG:slr1861 KEGG:ns NR:ns ## COG: slr1861 COG2172 # Protein_GI_number: 16330247 # Func_class: T Signal transduction mechanisms # Function: Anti-sigma regulatory factor (Ser/Thr protein kinase) # Organism: Synechocystis # 1 125 1 129 143 61 31.0 6e-10 MTLLQFRPVDGRLDEILSMLMHSREVASHPSLSYAIRLVSEEIIVNILNYAYPQQAEGYL TLCLWDEDGEITLEFIDGGIPFNPLDKADPDISLPLEQREIGGLGIFLVREMMDDVAYTY VNKENRLTIKKKYLQPTDEPVS >gi|226332027|gb|ACIB01000029.1| GENE 33 35786 - 36091 403 101 aa, chain - ## HITS:1 COG:no KEGG:BF2843 NR:ns ## KEGG: BF2843 # Name: not_defined # Def: putative anti-anti sigma factor # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 101 1 101 101 193 100.0 2e-48 MEKYEIHFVGSVLDSNTSGDEQAKIVALIEQGHSVALDLSGCSYVSSAGLRVMLYAFKLA KAKSRDVCLVGVSQEVKDVMHMTGFDKFFRFYQTLDELSQP >gi|226332027|gb|ACIB01000029.1| GENE 34 36110 - 37279 910 389 aa, chain - ## HITS:1 COG:FN1091 KEGG:ns NR:ns ## COG: FN1091 COG2208 # Protein_GI_number: 19704426 # Func_class: T Signal transduction mechanisms; K Transcription # Function: Serine phosphatase RsbU, regulator of sigma subunit # Organism: Fusobacterium nucleatum # 141 389 204 447 447 151 35.0 2e-36 MAVKILSVDDELDLEVLLTQYFRRQIRKGEYEFAFAHNGLEALQKLLETPDFDIILSDIN MPEMDGLTLLAKVNELKNPAMKCIMVSAYGDMDNIRSAMNKGAFDFATKPIDLDDLSRTI EKAIEQVRYIRESQQEHNQLESIKNDLAIAGEIQQTILPRSFPPFPELTEVVDIYASMTP AKDVGGDFYDFFQIDDERIGLVIADVSGKGVPASLFMAVSRTLLRATALRGVSSAECLTY ANKLLCKESLDSMFVTVFYGIYHYKTGMMDYTNAGHNPPYLLRGGRTVECLPVASNFVVG VFDDIEFESNTLTFGIGDTLLLYTDGVTEAFNDKREQFSESNLQDILASMHESSSAKEVV TSVLQSVKTFSGDYPQSDDITLLSLQRIK >gi|226332027|gb|ACIB01000029.1| GENE 35 37627 - 38250 305 207 aa, chain + ## HITS:1 COG:no KEGG:BF2845 NR:ns ## KEGG: BF2845 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 207 1 207 207 401 100.0 1e-111 MLIILKQDDSCQVAQTTCTTNSLEIRYKDYSVLKKTLDLNTKSINSNSCIELIIYGTHGN EKEICKLTKPDIKDLSYFFKGKVCRLVLDACHSACHIPHFKEMLTDDGIILCHVGICQVN SIETKTGKDTYERWINGNKEGLKVWSDGIVPAIYTRKNNTLTYYECSKFEVDINHPFTGT EFKTYCLNLKSKGIKMVIKNHSDFDKI >gi|226332027|gb|ACIB01000029.1| GENE 36 38286 - 38621 363 111 aa, chain + ## HITS:1 COG:no KEGG:BF2970 NR:ns ## KEGG: BF2970 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 111 1 111 111 210 99.0 1e-53 MWELEKAGGKFDNFNSKQQKAIYELIDGLTEIPTKPNKDQYRVDNQSQYTKIVGRTYYNG EYALYAIQKGKTMYASIACPAIGPDISIDNIKEGFKLSLKEGKRAVWTKVK >gi|226332027|gb|ACIB01000029.1| GENE 37 38636 - 39136 403 166 aa, chain + ## HITS:1 COG:no KEGG:BF2847 NR:ns ## KEGG: BF2847 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 166 1 166 166 324 98.0 6e-88 MLIQSVAFKDVFLRMDGKGISTACGAGAGKVNCQRSMAPTGAFKVQKQTNGTFTIESSKY PGVFLRMDGNGIHAFAGSGSGRVNCQFGASAWERFKLHEQSDGSYTIESAAFPNVFLRMD GNNPSRKEEDFGTVNCQYGAGAYEKFYLQNMPEIQNVKDMFNKITK >gi|226332027|gb|ACIB01000029.1| GENE 38 39227 - 41131 1844 634 aa, chain - ## HITS:1 COG:BH3718 KEGG:ns NR:ns ## COG: BH3718 COG1086 # Protein_GI_number: 15616280 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Predicted nucleoside-diphosphate sugar epimerases # Organism: Bacillus halodurans # 22 577 16 555 608 359 37.0 8e-99 MKQNFFHRYLSAKVLPIWTILLIDIFIIVASCLLAYSLRYDFRSIFLDSSTIDKTILWTV VANLIFFRVFRTYSNVLRFSSFVDIMRIFVSLTVSYGVLMILSLLLDAYLGIRIGAISVL FMAYVINFAMMACSRIVVKMFFEVLNFDGSHTTNVFIYGAKEAGVNIAKSLRVNLRNHYR LRGFIADEPELIGKVMMGAKVFPNDEALIENMNDRDVHTIIVSPAKMEKLKKSDMIDTLL SNNVKLLTAPPLSEWGGQALNKTQLKEIQIEDLLQREPIEVDIHKIASHLEGKRVMITGA AGSIGSEIMRQVASFNPYKLILIDQAETPLHDIRLELQDRWRDIDAETIVADISNATRME AIFREYKPQYIFHAAAYKHVPMMEDNVSESIQINVSGTRTLADLAVKFGSEKFVMISTDK AVNPTNVMGCSKRICEIYVQSLAKKLQKEGTRSVQFITTRFGNVLGSNGSVIPRFRDQIQ RGGPVTVTHPEIIRYFMTIPEACRLVLEAGSMGNGGEIYIFDMGKPVKIVDLAKRMISLS GRTDVKIEFTGLRHGEKLYEELLNVKELTKPTYHEKIMIATVREYDYDEVKERIQKLIDV SYTYDQMKIVAAMKDIVPEFVSKNSCFEALDKKD >gi|226332027|gb|ACIB01000029.1| GENE 39 41293 - 42618 1512 441 aa, chain - ## HITS:1 COG:BH2629 KEGG:ns NR:ns ## COG: BH2629 COG1875 # Protein_GI_number: 15615192 # Func_class: T Signal transduction mechanisms # Function: Predicted ATPase related to phosphate starvation-inducible protein PhoH # Organism: Bacillus halodurans # 4 441 2 442 442 290 40.0 5e-78 MGTKKNFVLDTNVILHDYNCLKNFQENDIYLPIVVLEELDKFKKGNEQINYNAREFVREL DLITDDSLFTHGAPLGEGLGKLFIVTGDSEAPKVHESFPARKPDHQILAVAEYLARKYPK MKNILVTKDVNLRMKARSIGILCEDYITDKVVNVDIFEKSNEVFENIEPDLIDRIYSSRE GLDISEFDFKDIIRPNECFILKSDRSSVLARYNPFTHTVCRVNKTKNYGIEPRNAEQSFA FEILNDPDIKLVALTGKAGTGKTLLALAAALGKLTEYKQILLARPIVALSNKDLGFLPGD ATEKVAPYMQPLFDNLNVIKRQFASNSTEVKRLEDMQKSEQLVIEALAFIRGRSLSETYC IIDEAQNLTPHEIKTIITRAGEGTKMVFTGDIQQIDQPYLDSQSNGLVYMIDRMKDQNLF AHVNLVKGERSELSELASNLM >gi|226332027|gb|ACIB01000029.1| GENE 40 42748 - 44124 1069 458 aa, chain + ## HITS:1 COG:CAC2398 KEGG:ns NR:ns ## COG: CAC2398 COG0285 # Protein_GI_number: 15895664 # Func_class: H Coenzyme transport and metabolism # Function: Folylpolyglutamate synthase # Organism: Clostridium acetobutylicum # 1 449 1 426 431 236 34.0 7e-62 MNYQETLDYLYNSVPMFQQVGSSAYKEGLENTYALDEYLGHPHTAFQSIHIAGTNGKGSC SHTLAAILQSAGYRVGLYTSPHLVDFRERIRINGEPIPQEYIIRFVEDHRLFFEPLHPSF FELTTSMAFRYFADQHIDVAVIEVGLGGRLDCTNIIRPDLSIITNISFDHMQFLGNTLAK IATEKAGIIKRGIPVVVGETTEETKPVFYRKAQEMEAPVTFAEEEQRLKGATKFKTRADR KPGQYTDHQPNTAAEKDTEYITGWIYENDAYPGLEGVLGGSYQLKNTNTLLSALPVLKSL GYKIEDHDVRNGFLQVDKMTGLQGRWQKLSDSPTVICDTGHNVAGISYIVEQLKQMKYNR LHMVIGMVNDKDVSGVLSILPENAVYYFTKASVKRALPEAQLQQIGASAGLQGKAYPDVQ SAVKAAQEKSLPEDLIFVGGSSFIVADLLSCRDALDLD >gi|226332027|gb|ACIB01000029.1| GENE 41 44092 - 44466 503 124 aa, chain - ## HITS:1 COG:PH0854 KEGG:ns NR:ns ## COG: PH0854 COG0251 # Protein_GI_number: 14590714 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Putative translation initiation inhibitor, yjgF family # Organism: Pyrococcus horikoshii # 1 124 12 136 137 132 56.0 1e-31 MKKVICSEKAPGAIGPYSQAIEANGMVFVSGQLPIDAATGVMPDGVEAQARQSLENIKHI LEAAGLTMADIVKTTVFLQDMSLFAGMNGVYATYFDGAFPARSAVAVKALPKDALVEIEC IAAR >gi|226332027|gb|ACIB01000029.1| GENE 42 44764 - 46470 1877 568 aa, chain - ## HITS:1 COG:FN1787 KEGG:ns NR:ns ## COG: FN1787 COG0457 # Protein_GI_number: 19705092 # Func_class: R General function prediction only # Function: FOG: TPR repeat # Organism: Fusobacterium nucleatum # 248 564 41 337 628 68 26.0 2e-11 MKKSILFTFVLCLLSQWSVAQAPKWVEKAKRSVFSIVTYDKDDKIQNTGNGFFVTEDGVA LSDYSLFKGAQRAVIINSEGEKMPVECILGANDMYDIIKFRVGITVKKVPALQVAALAPA VGAEVYLLPYSTQKGGNVTRGKVKKVDNIGGDKYHYYTLDMVLKDKMVSCPVTTADGKVF GVAQKSSGQDTASISYAAGAAFAMSQNISALALSDPALNAIGIKKGLPEDEDQALVYLFI ASTQSTPEAYAIALDDFIKTFPNSADGYLRRAGNYVFADKDENHMDKAAADLEHALKVAQ KKDDTYYNIAKLIYNYQLSKPETVYKDWTYDKALENVRSAIAIQSLPVYQQLEGDILFAK QDYAGAFASYDKVNQTELASPASFFSAAKAKELSKAAPEEVIALLDSCIARCQTPITSDL APYLLERAQMYMNVEKYRLALADYDAYFNAVKGSVNDLFYYYREQAAFKAKQFQRALDDI AKAIELNPEDLTYRAEQAVVNLRVGRYEEAEKVLKDALAIDPKYAEGYRLLGICQIQLKQ EKAACASFAKAKELGDPNVDELIKKHCK >gi|226332027|gb|ACIB01000029.1| GENE 43 46491 - 47231 385 246 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163764761|ref|ZP_02171815.1| ribosomal protein S11 [Bacillus selenitireducens MLS10] # 6 244 7 246 255 152 34 4e-36 MIDKSEMIFGVRAVIEAIQAGKEIDKILVKKDIQSDLSKELFTALKGTLIPVQRVPVERI NRITRKNHQGVVAFISSVTYQKTEDLVPFLFEEGKNPFFVMLDGITDVRNFGAIARTCEC AGVDAVIIPAKGSVTVNADAMKTSAGALHTLPVCREQNLKTTLQYLKDSGFRIVAATEKG DYDYTKADYTGPMCIIMGAEDTGVSYDNLALCDEWVKIPMLGSIESLNVSVAAGILIYEG VKQRTN >gi|226332027|gb|ACIB01000029.1| GENE 44 47292 - 48977 1806 561 aa, chain - ## HITS:1 COG:PA4763 KEGG:ns NR:ns ## COG: PA4763 COG0497 # Protein_GI_number: 15599957 # Func_class: L Replication, recombination and repair # Function: ATPase involved in DNA repair # Organism: Pseudomonas aeruginosa # 1 550 1 552 558 321 37.0 3e-87 MLRSLYIQNYALIEKLDIRFETGFSVITGETGAGKSIILGAIGLLLGQRADVKAIRQGAS KCVIEARFDISAYHMEAFFEENELEYEPECILRREVQSSGKSRAFINDTPASLTQMKELG EQLIDVHSQHQNLLLNKEGFQLNVLDILSHNEEALDVYHHLYQDWKKLCKELDELIVLAE QSKTDEDYIRFQLEQLEEAHLTAGEQEELEQEAETLAHAEDIKAGLYRVGQTLASDEGGL LPVLKESLGALSGLQKVYQPAGELAERMESTYIELKDIAQEISDQGEGIEFNPLRLEEVN DRLNLIYSLEQKHRVQTVEELIALSEQYATRLAAITSYDDRIVKLTERCDAQYNKVKKQA AVLTKARTEIAREVEQQMAARLIPLGIPNVRFQVEMGLKKEPGPQGADTVNFLFSANKNG TLQSVSSVASGGEIARVMLSIKAMIAGAVKLPTIVFDEIDTGVSGEIADRMADIMQEMGE QNRQVISITHLPQIAARGRAHYKVYKRDNDTETNSHIRRLTDEERVEELAHMLSGATLTE AALSNARALLEPSLKERKIIK >gi|226332027|gb|ACIB01000029.1| GENE 45 48981 - 50174 1033 397 aa, chain - ## HITS:1 COG:BH2510 KEGG:ns NR:ns ## COG: BH2510 COG0452 # Protein_GI_number: 15615073 # Func_class: H Coenzyme transport and metabolism # Function: Phosphopantothenoylcysteine synthetase/decarboxylase # Organism: Bacillus halodurans # 1 396 1 395 404 324 43.0 1e-88 MLNGKKIILGITGSIAAYKACYIIRGLIKQGAEVQVVITPAGKEFITPITLSALTGKPVI SEFFAQRDGTWNSHVDLGLWADAMLIAPATASTIGKMANGIADNMLITTYLSAKAPVFVA PAMDLDMFAHPSTQKNLDTLRSYGNHIIEPASGELASHLVGKGRMEEPENIIRVLDEFFS STGELAGKKVLITAGPTYEKIDPVRFIGNYSSGKMGFALAEECARRGADVVLIAGPVQQK TYHSHITRIDVESAQDMYEAAMAQYPLVDAGILCAAVADFTPDAVADKKIKREGDELLLH LKPTHDIAAALGKIKTPGQKLIGFALETNDEQRNAEGKLIRKNFDFIVLNSLNDAGAGFR YDTNKISILSCRGRTDYPLKSKTEVARDIIDRMIKEM >gi|226332027|gb|ACIB01000029.1| GENE 46 50177 - 50950 867 257 aa, chain - ## HITS:1 COG:CT261 KEGG:ns NR:ns ## COG: CT261 COG0847 # Protein_GI_number: 15604982 # Func_class: L Replication, recombination and repair # Function: DNA polymerase III, epsilon subunit and related 3'-5' exonucleases # Organism: Chlamydia trachomatis # 9 240 4 214 232 93 32.0 5e-19 MKLNLKNPIVFFDLETTGTNINSDRIVEICYLKVYPNGNEESKTLRINPEMPIPAESSAV HGIYDADVADCPTFKEVAKSIANDIEGCDLAGFNSNRFDIPVLAEEFLRAGVDIDMSKRK FVDVQVIFHKMEQRTLTAAYKFYCGRNLEDAHTAEADTRATYEVLMAQLDRYPEELQNDM SFLADYSSYNKNVDFAGRMVYDDNGVEVFNFGKYKGQSVSEVLKKDPGYYSWILNSDFTL NTKAMLTKIRLRELTGK >gi|226332027|gb|ACIB01000029.1| GENE 47 51081 - 52205 1163 374 aa, chain - ## HITS:1 COG:BMEI1942 KEGG:ns NR:ns ## COG: BMEI1942 COG0592 # Protein_GI_number: 17988225 # Func_class: L Replication, recombination and repair # Function: DNA polymerase sliding clamp subunit (PCNA homolog) # Organism: Brucella melitensis # 1 370 26 395 397 162 30.0 1e-39 MKFIVSSTALFSHLQAVSRVINSKNALPILDCFLFQLEDGTLSVTVSDSETTMVTSVEVN ESDSNGKFAVAAKTLLDALKEIPEQPLTFDIKPDTYEITVQYQNGKYSLMGQNADEFPQS ATLGDNAVRVEMEAQILLGGINRSVFATADDELRPVMNGIYFDITTEDITMVASDGHKLV RCKTLAARGNERAAFILPKKPATLLKNLLPKESGMVVIEFDERNAVFTLESYRMVCRLIE GRYPNYNSVIPQNNPHKVTVDRMQLMGALRRVSIFSSQASSLIKLRMQENQIVVSAQDID FSTSAEETQVCQYAGAAMSIGFKSTFLIDILNNISADEVVIELADPSRAGVIIPVEQEEN EDLLMLLMPMMLND >gi|226332027|gb|ACIB01000029.1| GENE 48 52358 - 52732 434 124 aa, chain + ## HITS:1 COG:no KEGG:BF2982 NR:ns ## KEGG: BF2982 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 124 1 124 124 207 100.0 8e-53 MEFTGKIIAILQPRGGVSKTSGNEWKAQEYVIENHDQYPKKMCFDIFGADKIEQFNIQMG EELTVSFDIDARQWQDRWFNSIRAWKVERVGAGAPMAPGAPVPPPAPSSAPEFIAGDAKD DLPF >gi|226332027|gb|ACIB01000029.1| GENE 49 52810 - 54447 1262 545 aa, chain - ## HITS:1 COG:no KEGG:BF2859 NR:ns ## KEGG: BF2859 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 545 1 545 545 1109 100.0 0 MKLRTIVKIAVISSVVLLCTGFAMFSFFRLSAAEGRKDFNLYTLVPGSATVVLETDDLAG MIQGINELSCSKDRHFLYVSKLFSYLKLHLYTLLEDTPHGLSKQMNKVLLSFHEPDNDRN QVLYCSLGNGDYELVEKFIRKYCSSSFPSKLFDYKGEEIRIYPMPDDSFLACYFTSDFLV VSYQKKLIEQVIDARLSKKSLLTDASFAKVHEDKRARVAATIYARMQPLSMGKATDGIRS CTQLGGWTEFDMKMNGDAIYFSGVSHDTDTCLTFMNVLRQQQPVEDFPGDILPASTFFFN KRSVTDMQAMLDFTARQEYTTSTYSDYIRDRDGELLAYLKENAGGEIVTCLFHSTDTLSN PCAVMSIPLRDGQQAERVLQGMLRTAPKEVDGPPKPRTTFCKTPLRAYTLYVLPRNTLFT QLTGITESALYIYACFYEGRLVLAPDVESLTAYLRHLDKKEILDDTPGYEEAVVNLSPSY NFMMVADLGETFSQPENYVRLIPAFFFRNQEFFRHFILSAQFTCTDGIVYPNVVLIYKGE SDDVS >gi|226332027|gb|ACIB01000029.1| GENE 50 54540 - 55298 507 252 aa, chain - ## HITS:1 COG:BB0533 KEGG:ns NR:ns ## COG: BB0533 COG1235 # Protein_GI_number: 15594878 # Func_class: R General function prediction only # Function: Metal-dependent hydrolases of the beta-lactamase superfamily I # Organism: Borrelia burgdorferi # 6 250 6 251 253 182 35.0 4e-46 MKIRILGSGTSTGVPEIGCTCAVCTSKDPRDCRLRTSALVYTDDATILLDCGPDFREQML RVPFGKIDAVLISHEHYDHVGGLDDLRPFCRFGEVPIYAETYTAERLRSRMPYCFVEHSY PGVPNIPLREIEPNRPFLVNHTEVLPLRVMHGKLPILGYRIGKLGYITDMLTMPDESFEQ LQGIEVLVMNALRIAPHNTHQSLSEALEAVKRIGAKETWLIHMSHHIGLQADVEKQLPPH VHFAFDGLELEC >gi|226332027|gb|ACIB01000029.1| GENE 51 55359 - 56357 936 332 aa, chain - ## HITS:1 COG:PA2977 KEGG:ns NR:ns ## COG: PA2977 COG0812 # Protein_GI_number: 15598173 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylmuramate dehydrogenase # Organism: Pseudomonas aeruginosa # 1 331 5 338 339 254 45.0 2e-67 MEQKYSLLSHNTFGIDVSAACFLEYASVDELRGLIGSGRVTSPYLHIGGGSNLLFTKDYE GTILHSRIGGVEIVAETDDDIVVRVGAGVVWDDFVDYCVQRHWYGVENLSLIPGEVGASA VQNIGAYGVEVKDLIVRVETLNIEGKEHVYDVTECGYSYRDSIFKRPENKSVFVTYVSFR LSKREHYTLDYGTIRRELEKYPGVTLDVVRRVIIAIREEKLPDPRVMGNAGSFFMNPIVG REQFEALQAEYPQMPFYEIDTDRVKIPAGWMIDQCGWKGKALGPAAVHDKQALVLVNRGG AKGADVIALSDAVRASVRAKFGIDIHPEVNFI >gi|226332027|gb|ACIB01000029.1| GENE 52 56377 - 57219 751 280 aa, chain - ## HITS:1 COG:no KEGG:BF2986 NR:ns ## KEGG: BF2986 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 280 5 284 284 542 100.0 1e-153 MAGVTVLVLFASCGNSNKTDADPFASITHLVDSAMVNKTDSIDREKTSDEPKPIEADESF DDFIYNFASDDALQRQRVVFPLPYYNGERASKIDRKYWKHDDLFAKQSYYTLLFDREEDM DLVGDTSLTSVQVEWIFVKKRMVKKYYFERIKGAWMLEAINLRPIEENENEDFVEFFGHF ATDSIFQSRRIRQPLVFVTTDPDDDFSILETTLDLNQWFAFKPALPADKLSNINYGQQND DNASHKILALKGIGNGFSNILYFQRKDSGWELYKFEDTSI >gi|226332027|gb|ACIB01000029.1| GENE 53 57327 - 58280 1029 317 aa, chain - ## HITS:1 COG:SA0511 KEGG:ns NR:ns ## COG: SA0511 COG0451 # Protein_GI_number: 15926231 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Nucleoside-diphosphate-sugar epimerases # Organism: Staphylococcus aureus N315 # 1 315 1 314 321 345 53.0 6e-95 MKNILVIGATGQIGSELTMELRKRYGNTHVVAGYIHGAEPKGELKESGPSAVVDVTDQDM IASVVREYNIDTIYNLAALLSVVAESKPKLAWKIGIDGLWNVLEVARENKCAVFTPSSIG SFGESTPHVQTPQDTIQRPRTMYGISKVTTELLSDYYFNKYGVDTRAVRFPGIISNVTPP GGGTTDYAVDIYYSAVKGEKFICPIPEGTLMDMMYMPDALNAAISLMEADPTKLVHRNAF NIASMSFAPETIYAAIKKHVPDFEMEYKVDPLKQRIANSWPDSMDDSCAREEWGWKPAYD LESMTVDMLEKLRAKLK >gi|226332027|gb|ACIB01000029.1| GENE 54 58417 - 59604 1392 395 aa, chain + ## HITS:1 COG:YPO0059 KEGG:ns NR:ns ## COG: YPO0059 COG0156 # Protein_GI_number: 16120412 # Func_class: H Coenzyme transport and metabolism # Function: 7-keto-8-aminopelargonate synthetase and related enzymes # Organism: Yersinia pestis # 2 394 11 402 403 479 59.0 1e-135 MYGKMKEHLSNTIAEIKEAGLYKEERLIESAQQAAITVKGKEVLNFCANNYLGLSNHPRL IEGAKKMMDRRGYGMSSVRFICGTQDIHKELEAAISDYFKTEDTILYAACFDANGGVFEP LFTDEDAIISDSLNHASIIDGVRLCKAKRYRYANADMADLERCLQEAQAQRFRIIVTDGV FSMDGNVAPMDKICDLAEKYDALVMVDESHSAGVVGATGHGVSEQYNTYGRVDIYTGTLG KAFGGALGGFTTGRKEIIDLLRQRSRPYLFSNSLAPGIIGASLEVFKMLKESNEIHDKLV DNVNYFRDKMTAAGFDIKPTQSAICAVMLYDAKLSQIYAARMQEEGIYVTGFYYPVVPKD QARIRVQISAGHEKEHLDKCIAAFIKVGKELGVLK >gi|226332027|gb|ACIB01000029.1| GENE 55 59654 - 60058 413 134 aa, chain + ## HITS:1 COG:no KEGG:BF2989 NR:ns ## KEGG: BF2989 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 134 1 134 134 195 100.0 4e-49 MEELTLTTPALLFSAVSLILLAYTNRFLSYAQLVRILRDRYMEDPSDINVAQIENLRKRL NLTRMMQVFGIASLFFCVVTMFLIYIGLFLLSIYIFGLALLLLIASLGVSLREIQISTRA LDIYLSTMEGKLKH >gi|226332027|gb|ACIB01000029.1| GENE 56 60198 - 61109 845 303 aa, chain - ## HITS:1 COG:no KEGG:BF2991 NR:ns ## KEGG: BF2991 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 303 1 303 303 612 98.0 1e-174 MNNNDPMKRFGYIVFSICLFALSACTPHEQMDQEEGIVKVSMGLTAASFTDDDATTRAEQ PMAPDYENLISNLWILQFDREGILTGSEHKVLPTPVLNTTLEGIALRTGRGTVCVVGNLA DGEIAAWPDNLSGFKSLVVDMGWLKERNTDRNVCLFGYYEGEIAAGTTAVNVVLGRLVCR LNIAVSAKTAGIFSNVRIQLQNAQTKGYLFPSDVYLSPEGGGNYTEEVVIGDDKVLGTAP LYRYYYMAENVTEGTDSGERTRLQIKAKKGGAEYTKAIDLGRSDIHDYSLRRNNNYTFNI VLE >gi|226332027|gb|ACIB01000029.1| GENE 57 61227 - 62273 884 348 aa, chain - ## HITS:1 COG:no KEGG:BF2867 NR:ns ## KEGG: BF2867 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 348 1 348 348 674 99.0 0 MKRMIVYKSCSYVIMALLCTACAAGSPEEDTEDRVRIDPVAGGYYPSISPSAQTRGATPD GETLKDRPIFLLEDGSTIRLVVYDDAKNLLEEYSKAYLVRNAGTSGSSLLYPCEVDDNGA VISSSSTPLYMKAGTYYFRILSPAKALNSKGFVNIGNGEYLLATDDRYTQTAMTAVTITK IDEGGTLNNVQTLYLPPIINQTARMQFTVRAGEGVHTLEMLAEGIEISGIQQPLDNTTSF DWVNGDVLPVKVGDQSASVRITHATQNADNSLVAHTGVLPTDARSHSISVLLNLKVNGNP TQYQMLLTGLYLTAGHSYNYTATVKISNGVTVLTWQNRSWTENVVMDK >gi|226332027|gb|ACIB01000029.1| GENE 58 62287 - 64407 1284 706 aa, chain - ## HITS:1 COG:no KEGG:BF2868 NR:ns ## KEGG: BF2868 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 706 1 707 707 920 66.0 0 MKTIIRFFLSTVAVLVTGCTPEEEHIPAQGGDSFTLSSSEEQVITRAGDVSVNHFTAGTK YRLYGLIGSSWSGRLSTMNGQEGTEVVGADGTHQIEYLSTGTDEDRFEGRILDFYALTYG NTTLPICGSEADGIPVCNVTSSAGGVLPDLRRAVLTGQSGINSGVITLPFKHTLSKLKFE VVKQKPENGSAGVDVLAGIELKGITVSDYSEGALLLSTGEYSYMGGKSDRAVLGSGFSQA VTESVQFVTIDGQSGSAAAECLIFPTLTTAAEGLKIKVATKGTNSGDRTDTYEIRVPVIG DDGEVRKDENGKTVTGPFRFLPNYEYTLTITITNSDVQIITIIPRRYDWIEHNDDSQYMG QPLTFNGVMWMDRNLGATSADMTTPEGWEAGRGYYYQVGRDIPYFVKRTVNFKGSDGKTY TRPYCRGYSSGAAPYPVITGKEGVSAISGPQNPTNVAVTFDDVKEGKTVAWIVSKADATD WDNTHAISINRWKTPLNDPCPKGWRLPTYIEFAGIMPLEDKSGDITFLQHSGTSWIETMS NDPESGYTSVYIGERANTGTMYGVIYALKYQGTSRAYRIKWEPKFVSQEDVSDGSGVFIR RAYMQISKYSCTSADRLTLESKSGMDWEHPSDVINIPIYGFIWARSAMLINDAFEAMYLT STTQGNMVRAAKLKFDADTSTRYLNMSNYTPADGFQIRCVRDVTSQ >gi|226332027|gb|ACIB01000029.1| GENE 59 64510 - 65532 692 340 aa, chain - ## HITS:1 COG:no KEGG:BF2869 NR:ns ## KEGG: BF2869 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 327 1 329 342 528 89.0 1e-148 MNKEEKKSRFCRACLLLCLCLLSASCVSGEGEYEQEIPMEVALTRTDGVAGSEGTPLLLF WKETDFSTGTLGSPYKTSTPAGGIGDYADAPYNTETDYPTDNSRVVVMGLCPSAMTTADS HTTYSLPSRAGLTDVLVTPNGVTGSRKQPFGRALEFAHAAVRIVLKARRTDNTVGKLYIK NLKLTVPSRLVCGGVKWDTGVKHYVAAPASADLVIDHPGQILSTEDTQIAVFYLIPCATN DLLTDISSSGTPGLTLTADVAKDVNFTTGARKASFPVLDGNLNFMDQSGVPVTQLQSGES YGTTLNFDIDSFTLEGEEKEWEDGGKMTIPVVVPEPVPSA >gi|226332027|gb|ACIB01000029.1| GENE 60 65678 - 67234 993 518 aa, chain - ## HITS:1 COG:no KEGG:BF2870 NR:ns ## KEGG: BF2870 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 2 518 1 517 517 959 99.0 0 MMHNCKGSIWIIFFRFLLCLWACGCSDDAGMPASPAGADRVTACVRLSVRVVGETTRAAV QADVTVEEKISSVAVFLVSVDGSGKEDWNDVQYEVVYGTTSSAADVYQARIPTTPGEKKV YVGANLLPGQVHAICGQAEGKGLYTAAGAGYDEVIRQFARGTEGIAMTGKAGTSVTVSAG TDVNVDLTASPITLERVVAKVLLVCDTYTDDGGAYVRMSGNAARPSADPGWIRLSDVRYA LNTVNRKVYLNAPPDGKDPNHEVDPYVVKDDGGNYVSAPDVAEQQFVYRTLGSVWAEGIA PEAYEEVKFNATKATPPESGAYTGGLYCPENTTATSTASLAGGLDLTGTLNSTQAEIPRL VTTHLLVAAKFVPKQIITGAGTTQTLTSPADAATYLPATTTPVDAEAHAAGTYFTNGSDY YSYAGMKAAIGAGTLKRTDFTAYEGGIGYYYTYIDGTSATDGTIAFSADSGILRNRYYLL RITRFSLPSAALPQPMRATMKVTDWVTSSGNQIIVRPT >gi|226332027|gb|ACIB01000029.1| GENE 61 67218 - 68270 962 350 aa, chain - ## HITS:1 COG:no KEGG:BF2996 NR:ns ## KEGG: BF2996 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 350 1 350 350 699 99.0 0 MSDKFQTFCFSHSGSWFLPFLWLSLLAGLFACSWTGDDRSDCPSGFRIRLQPALHAQIQP DSGTGVITDEIDTLSLYVFDAQGQFVCLHTENRQSLTENDYIITLPLEYKDGDVYELVFW AGGDNRHYRMPQLTPGSSTRDELTLRLERDGDGRQDDELGHLWYGHLRLSRIQPSELTSV SVPMLKDSNRFVITLHDTSGQGLDADDYDFTLLADNGRMNADNEVMTGDRVTYAAYHTES ASETEPAATRTGEVSLARARLNTLRLLADQEARLVVTDRVSGQKVVDIDLTRYLLMTRPL FEESNGVELSDQDYLDYEDRFNVIFYLTPMGKLEALNINGWIIRLNDAQL >gi|226332027|gb|ACIB01000029.1| GENE 62 68450 - 70111 1384 553 aa, chain - ## HITS:1 COG:no KEGG:BF2997 NR:ns ## KEGG: BF2997 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 553 1 553 553 879 95.0 0 MKKLKYMSMMGLAALLLTTWAACSDDTDASGGENPEEARAYTTVTIAVPNGVAETRASDP TADTDDTNMDIGLTDEYKVTKANLYLFPGGTGSSFGSAKLTEIISISQFTQTTTTTTDQK TIVWTSKKTALTPGDYRIYIVVNGTVNGVGDSDKETLTEADFLAKTTAAATSVIAAVPSD GLVMASRSPNSNNSNTLPYIAQEITKDPEQTIAATVERVMGKITVTAGGTSASSAATVNK YTSFSTTVTAIGNITDITLKNYYVVNARKEGYYFRHVDKESTVTNPLTEANYGNSSATLP YVTDPKTYNKTYTSTPALANSYGDWYLQGSSAFGLSSFGTFSGTYTDMPGYSSGAVETKV AAYCYENTMLKDKQKNGYTTGIVFKAEIAPSKMMQKKPLGDGVEETTTIGSIGEIFYHSG IFYKDIEALKEAGVLLADGTTSSSASGAPADLKKNDVQCFKKESADGKFICYYPYWIKHL PSADTAEDVMEFGIVRNNVYQVTVTGIQGVGKDGVTENIITDTETDDPTTVLLNVKLSIK PWVVRANSAVLGR >gi|226332027|gb|ACIB01000029.1| GENE 63 70137 - 71540 853 467 aa, chain - ## HITS:1 COG:no KEGG:BF2874 NR:ns ## KEGG: BF2874 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 467 1 467 467 916 99.0 0 MKEKIRSTRLRMCFRKMPPAVCFLTLFALCFGALSVRAADPSGRVALTAVRMQRAGGQVY VSFAVKIAPRAVRARHRWVITPCLGNASDSVLLGPFVVTGRIMAREENQRRLLAGLPDRD VNHRWTARNGDTFLYTDTLRYAPWMENGLNLRLDIDREGCCRVQTVGSIVSSGAFPVALP YRPSVSELTPRVSRTVAEHADDYPFLCEAGSRPLHESGIGIRFRAASAVVDTLYSANAGN LRRITEAIGLLRADSCAFLQGISISGYASPEGTTELNRKLSAKRAEALRHALSVRMNLPV SLFELNAGGVDWDRLAELVNGSDMTYKEEVLAILRSHPEEERNDRLKALAGGRPYRSVLD VLYPQLRDACYIRVQYANRPDSVADTVNRAIEAIRGRKYEEAFRLLKTVEADERSWNVRG VCHLLCGDDKEAGLWLHRAVKAGNREAEENLKKMNAERRAATIGITQ >gi|226332027|gb|ACIB01000029.1| GENE 64 71549 - 72106 459 185 aa, chain - ## HITS:1 COG:no KEGG:BF2875 NR:ns ## KEGG: BF2875 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 185 1 185 185 348 98.0 5e-95 MKKQLFILLLLVSGNVLAQRAAVKTNLLHDVATAPSLAAEWAFASRWTLDVAGSACPWNF ADNRKWKFWMTQGEARYWLCQSFYGHFLGVHAGGGEYNLSRVHLPFVSRSVSYRYEGWAL MAGFSYGYSWVLGKRWNLEATIGAGWVHAQYKRFNCPVCGEYRGANKKNFLAPTRAGISL IYMLK >gi|226332027|gb|ACIB01000029.1| GENE 65 72143 - 72517 330 124 aa, chain - ## HITS:1 COG:no KEGG:BF2876 NR:ns ## KEGG: BF2876 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 124 1 124 124 240 99.0 1e-62 MLTPLTLTAISVLRHISSHETGNLQDLECGCKPLDELLDQLESAGLIRVRTDQSGSGIPK TYELTRPLFRISLLDLMEAVDQHLNCNQPDCEKMYARYHYAAHKLGIVNQMTRAYLSEIP LTDL >gi|226332027|gb|ACIB01000029.1| GENE 66 72908 - 73846 729 312 aa, chain - ## HITS:1 COG:no KEGG:BF2877 NR:ns ## KEGG: BF2877 # Name: tsr25 # Def: tyrosine site-specific recombinase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 9 312 1 304 304 596 100.0 1e-169 MNETLTVFMISEIERLREEGREPARRIYHNMLRTLRESAGKEEIGFEEVTPVFLGKYEHW LLGKRLSWNTVSTYMRALRAGYNRGMKGRPGYVTGLFDKVYTGTRSDVKRAVDARTVGRM IRMSGCPDESASAKAVDWFVLMFMLRGIPFVDLAHLRRSNLDKGVLTYCRHKTGQEVSIS VPREAMEIINRRMVENCHPSYLLPILGQPRTRKRHTKKALTPYQEYQYALRNLNRRLERV SVDLRLGGRLSSYTARHTWATIAFHQETPVGVISRGLGHSSVKVTETYLKPFGDREVDRT NRKILNYVLNAV >gi|226332027|gb|ACIB01000029.1| GENE 67 74063 - 74437 245 124 aa, chain + ## HITS:1 COG:no KEGG:BF3039 NR:ns ## KEGG: BF3039 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 124 1 124 124 196 98.0 3e-49 MKKSIQDLKESLRKKEEKQQRLHEQEETVKKDIRVLRYLIAKEEEAASPQAPRKRKIPQK QINDFFTRIALFYEDLQRRLNLRITYRCFCRWLCTRYEFESRYHDRHKLSPCTILGYFKR ERGG >gi|226332027|gb|ACIB01000029.1| GENE 68 74666 - 76243 1359 525 aa, chain - ## HITS:1 COG:no KEGG:BF2880 NR:ns ## KEGG: BF2880 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 524 1 524 525 1048 99.0 0 MDTPINAPLRKLPIGIQSFEKLRSENFLYVDKTALIYRLAQAGNPCFLSRPRRFGKSLLL SSFEAYFQGKKELFRGLAIEKLEKEWLQYPVLHLSLNAEKYDSREGLIDILERQLRQWEE LYRTGGEGITHSGRFMTVIRRACEQTGRRVVVLIDEYDKPLLRSFDSQELQHDFRETLTA FYTVLKDADPWLQFVFITGVTKFAQMGIFSNLNQLNDISFDLDYNTLCGMTRAEIEATFS PELEALAVKSNATCRQVMDRLTRQYDGYRFTKDEEFTSMYNPFSVLSALQKRSYGNYWFA SGTPTFLVEMLQKTDFDLREMEGIEVNEASLSDDRADINNPIPMIYQSGYLTIKDYDERF RMYTLGFPNEEVKYGFLNFVSPFYTPIAQTDTSFYIGKFIRELESGDVDAFLTRLRCFFA GIPYDLNDRTERHYQTVFYLVFQLMGQFTETEVRSARGRADAVVKTPDYIYVFEFKLNDS AEAALRQIDEKGYLLPYQADGRKVVKVGVAFEKEERNIGEWVIGE >gi|226332027|gb|ACIB01000029.1| GENE 69 76994 - 77155 93 53 aa, chain - ## HITS:1 COG:no KEGG:BF2883 NR:ns ## KEGG: BF2883 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 53 247 299 299 100 100.0 2e-20 GRQEGLAEGRQEGLAEGRMEEKQANARRMKALNLPVETICQVTGLSAGEIENL Prediction of potential genes in microbial genomes Time: Tue May 17 22:52:27 2011 Seq name: gi|226332026|gb|ACIB01000030.1| Bacteroides sp. 3_2_5 cont1.30, whole genome shotgun sequence Length of sequence - 80790 bp Number of predicted genes - 72, with homology - 69 Number of transcription units - 34, operones - 18 average op.length - 3.1 N Tu/Op Conserved S Start End Score pairs(N/Pv) 2 2 Tu 1 1/0.250 - CDS 869 - 2032 1389 ## COG0019 Diaminopimelate decarboxylase - Prom 2128 - 2187 2.4 3 3 Op 1 . - CDS 2189 - 3508 1349 ## COG0527 Aspartokinases 4 3 Op 2 . - CDS 3540 - 4238 278 ## PROTEIN SUPPORTED gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 5 3 Op 3 24/0.000 - CDS 4223 - 4834 683 ## COG0139 Phosphoribosyl-AMP cyclohydrolase - Prom 4911 - 4970 2.4 6 3 Op 4 23/0.000 - CDS 5023 - 5775 688 ## COG0107 Imidazoleglycerol-phosphate synthase 7 3 Op 5 25/0.000 - CDS 5877 - 6596 727 ## COG0106 Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase 8 3 Op 6 . - CDS 6636 - 7226 607 ## COG0118 Glutamine amidotransferase - Prom 7307 - 7366 4.1 + TRNA 7577 - 7650 85.5 # Asp GTC 0 0 + TRNA 7681 - 7757 86.1 # Asp GTC 0 0 9 4 Op 1 . + CDS 7998 - 9230 986 ## BF2058 tyrosine type site-specific recombinase 10 4 Op 2 . + CDS 9268 - 9768 109 ## Rfer_3556 hypothetical protein + Term 9986 - 10029 5.3 11 5 Op 1 . - CDS 9999 - 10292 271 ## BF2114 DNA-binding protein 12 5 Op 2 . - CDS 10328 - 10618 225 ## BF2115 hypothetical protein 13 5 Op 3 . - CDS 10629 - 10922 216 ## BF2116 hypothetical protein 14 5 Op 4 . - CDS 10919 - 11221 265 ## BF2063 hypothetical protein - Prom 11363 - 11422 2.4 15 6 Tu 1 . + CDS 11697 - 12146 253 ## BF2118 hypothetical protein + Term 12169 - 12219 1.2 - Term 12152 - 12211 11.0 16 7 Op 1 . - CDS 12212 - 12538 327 ## BF2066 hypothetical protein - Prom 12561 - 12620 1.9 - Term 12609 - 12646 -0.5 17 7 Op 2 . - CDS 12669 - 13154 174 ## BF2067 hypothetical protein 18 7 Op 3 . - CDS 13042 - 13281 158 ## BF2068 hypothetical protein 19 7 Op 4 . - CDS 13299 - 14180 573 ## COG2961 Protein involved in catabolism of external DNA - Term 14223 - 14255 2.0 20 7 Op 5 . - CDS 14258 - 14827 181 ## COG0664 cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases - Prom 15070 - 15129 3.2 - Term 15473 - 15534 17.1 21 8 Tu 1 . - CDS 15564 - 17639 1508 ## COG1479 Uncharacterized conserved protein - Prom 17800 - 17859 8.0 + Prom 17984 - 18043 5.3 22 9 Tu 1 . + CDS 18095 - 18268 64 ## gi|253564838|ref|ZP_04842294.1| predicted protein + Term 18440 - 18485 3.1 - Term 18402 - 18444 -0.8 23 10 Tu 1 . - CDS 18592 - 18999 273 ## COG3871 Uncharacterized stress protein (general stress protein 26) - Prom 19075 - 19134 7.5 24 11 Op 1 . - CDS 19210 - 20031 303 ## COG2207 AraC-type DNA-binding domain-containing proteins 25 11 Op 2 . - CDS 20074 - 20439 368 ## COG3324 Predicted enzyme related to lactoylglutathione lyase - Term 20495 - 20524 0.0 26 11 Op 3 . - CDS 20545 - 21351 683 ## COG2207 AraC-type DNA-binding domain-containing proteins - Prom 21545 - 21604 10.0 - Term 21579 - 21620 7.4 27 12 Op 1 . - CDS 21736 - 25497 2805 ## COG3534 Alpha-L-arabinofuranosidase 28 12 Op 2 . - CDS 25522 - 28101 1946 ## COG1472 Beta-glucosidase-related glycosidases 29 12 Op 3 . - CDS 28120 - 29124 662 ## COG0407 Uroporphyrinogen-III decarboxylase 30 12 Op 4 . - CDS 29133 - 29816 406 ## BF3066 putative 5-methyltetrahydrofolate-homocystein methyltransferase 31 12 Op 5 . - CDS 29818 - 30453 703 ## COG5012 Predicted cobalamin binding protein 32 12 Op 6 . - CDS 30477 - 31727 985 ## BF3068 putative methyltransferase CmuC 33 12 Op 7 . - CDS 31741 - 34326 2267 ## COG1472 Beta-glucosidase-related glycosidases - Prom 34432 - 34491 1.9 - Term 34416 - 34462 1.0 34 13 Op 1 . - CDS 34498 - 36186 1358 ## BF2906 hypothetical protein 35 13 Op 2 . - CDS 36207 - 39542 2436 ## BF2907 hypothetical protein - Prom 39566 - 39625 2.0 36 14 Tu 1 . - CDS 39673 - 40704 959 ## COG3712 Fe2+-dicitrate sensor, membrane component - Prom 40726 - 40785 2.9 37 15 Tu 1 . - CDS 40914 - 41510 458 ## BF2910 putative ECF sigma factor + Prom 41532 - 41591 11.0 38 16 Tu 1 . + CDS 41813 - 41968 60 ## + Prom 42275 - 42334 3.7 39 17 Op 1 . + CDS 42361 - 42597 244 ## BF2912 hypothetical protein 40 17 Op 2 . + CDS 42599 - 43192 536 ## COG0693 Putative intracellular protease/amidase 41 17 Op 3 . + CDS 43212 - 43328 70 ## gi|255010491|ref|ZP_05282617.1| hypothetical protein Bfra3_15239 - Term 43465 - 43501 -0.1 42 18 Op 1 . - CDS 43547 - 43723 106 ## 43 18 Op 2 . - CDS 43773 - 44165 311 ## COG0494 NTP pyrophosphohydrolases including oxidative damage repair enzymes - Prom 44287 - 44346 6.9 + Prom 44200 - 44259 5.6 44 19 Op 1 3/0.250 + CDS 44331 - 46283 1320 ## COG1401 GTPase subunit of restriction endonuclease 45 19 Op 2 . + CDS 46300 - 48528 903 ## COG1700 Uncharacterized conserved protein 46 20 Tu 1 . - CDS 48515 - 48763 198 ## COG3326 Predicted membrane protein - Prom 48973 - 49032 3.5 + Prom 48735 - 48794 4.7 47 21 Tu 1 . + CDS 48884 - 49801 457 ## COG3129 Predicted SAM-dependent methyltransferase + Term 49965 - 50020 1.3 - Term 49624 - 49658 -0.8 48 22 Tu 1 . - CDS 49679 - 49951 127 ## BF3082 hypothetical protein - Prom 49988 - 50047 3.9 + Prom 49972 - 50031 5.3 49 23 Tu 1 . + CDS 50081 - 50554 271 ## BF3083 arsenate reductase + Term 50715 - 50749 -0.8 50 24 Op 1 . - CDS 50560 - 54225 3276 ## BF2922 hypothetical protein 51 24 Op 2 . - CDS 54209 - 54769 750 ## BF3085 hypothetical protein - Prom 54794 - 54853 2.2 52 25 Op 1 . - CDS 54893 - 56107 1067 ## BF3086 hypothetical protein 53 25 Op 2 . - CDS 56139 - 57071 870 ## COG0697 Permeases of the drug/metabolite transporter (DMT) superfamily + Prom 57025 - 57084 4.4 54 26 Tu 1 . + CDS 57157 - 57453 182 ## BF3088 hypothetical protein 55 27 Op 1 2/0.250 - CDS 58098 - 58367 120 ## COG0168 Trk-type K+ transport systems, membrane components 56 27 Op 2 . - CDS 58409 - 59596 1082 ## COG0168 Trk-type K+ transport systems, membrane components - Prom 59648 - 59707 2.5 57 28 Tu 1 . - CDS 59800 - 60255 125 ## COG2207 AraC-type DNA-binding domain-containing proteins - Prom 60496 - 60555 4.9 + Prom 60590 - 60649 4.0 58 29 Tu 1 . + CDS 60786 - 61958 677 ## COG0477 Permeases of the major facilitator superfamily + Prom 61978 - 62037 5.5 59 30 Op 1 . + CDS 62074 - 62838 197 ## PROTEIN SUPPORTED gi|163797523|ref|ZP_02191474.1| 50S ribosomal protein L9 60 30 Op 2 . + CDS 62895 - 63737 584 ## COG2207 AraC-type DNA-binding domain-containing proteins 61 30 Op 3 . + CDS 63759 - 64028 259 ## BF3095 hypothetical protein 62 31 Op 1 . - CDS 64317 - 64580 361 ## COG2388 Predicted acetyltransferase 63 31 Op 2 . - CDS 64615 - 65940 1145 ## COG1090 Predicted nucleoside-diphosphate sugar epimerase 64 31 Op 3 . - CDS 65989 - 66846 836 ## COG3757 Lyzozyme M1 (1,4-beta-N-acetylmuramidase) - Prom 66886 - 66945 6.0 65 32 Op 1 . - CDS 66983 - 68686 1350 ## BF3099 hypothetical protein - Term 68699 - 68760 12.1 66 32 Op 2 . - CDS 68783 - 70426 1975 ## COG0205 6-phosphofructokinase - Prom 70493 - 70552 2.6 67 33 Op 1 . - CDS 70641 - 73460 2061 ## BF3101 hypothetical protein 68 33 Op 2 . - CDS 73475 - 73609 71 ## - Prom 73755 - 73814 3.2 69 34 Op 1 . - CDS 73829 - 74725 669 ## BF2940 hypothetical protein 70 34 Op 2 . - CDS 74747 - 76657 1626 ## BF3103 hypothetical protein 71 34 Op 3 . - CDS 76667 - 80044 2442 ## BF2942 hypothetical protein 72 34 Op 4 . - CDS 80082 - 80711 379 ## BF2943 hypothetical protein Predicted protein(s) >gi|226332026|gb|ACIB01000030.1| GENE 1 252 - 731 642 159 aa, chain - ## HITS:1 COG:MTH158 KEGG:ns NR:ns ## COG: MTH158 COG1528 # Protein_GI_number: 15678186 # Func_class: P Inorganic ion transport and metabolism # Function: Ferritin-like protein # Organism: Methanothermobacter thermautotrophicus # 1 159 1 161 171 150 44.0 6e-37 MISEKLQNAINEQISAEMWSSNLYLSMSFYFEREGFSGFAHWMKKQSQEEMGHAYAMADY IIKRGGIAKVDKIDVVPTGWGTPLEVFEHVFEHERHVSKLVDALVDIAAAEKDKATQDFL WGFVREQVEEEATAQGIVDKIKRAGDAGIFFIDSQLGQR >gi|226332026|gb|ACIB01000030.1| GENE 2 869 - 2032 1389 387 aa, chain - ## HITS:1 COG:SMc00723 KEGG:ns NR:ns ## COG: SMc00723 COG0019 # Protein_GI_number: 15966402 # Func_class: E Amino acid transport and metabolism # Function: Diaminopimelate decarboxylase # Organism: Sinorhizobium meliloti # 11 377 23 388 422 280 43.0 5e-75 MKTVFPIHKFRELPTPFYYYDTKVLRDTLACVNREVARYDNFSVHYAVKANANPKVLTII RESGLGADCVSGGEIRAAIKAGFPAGKIVFAGVGKADWEIDLGLDYDIFCFNVESVPELE VINELAAAKGKVANVAFRINPNVGAHTHANITTGLAENKFGISMHDMDKVIDVALELKHV KFIGLHFHIGSQILDMGDFIALCNRVNELQDKLEARHILVEHINVGGGLGIDYDHPNRQP IPDFADYFRTYDEHLKLRPHQTLHFELGRAITGQCGSLISKVLYVKQGTNKQFAILDAGM TDLIRPALYQAHHKMENLTSEDPVVETYDVVGPICESSDVFGKAVDLNKVKRGDLIALRS AGAYGEIMASGYNCRELPKGYISEELV >gi|226332026|gb|ACIB01000030.1| GENE 3 2189 - 3508 1349 439 aa, chain - ## HITS:1 COG:VC0391 KEGG:ns NR:ns ## COG: VC0391 COG0527 # Protein_GI_number: 15640418 # Func_class: E Amino acid transport and metabolism # Function: Aspartokinases # Organism: Vibrio cholerae # 3 436 34 476 479 250 35.0 3e-66 MKVLKFGGTSVGSAQRMKEVAKLITDGERKIVVLSAMSGTTNTLVEISDYLYKKNPEGAN EIINKLEAKYKQHVDELYATEEYKQKGLEVIKSHFDYIRSYTKDLFTLFEEKVVLAQGEL ISTAMVNYYLQECGVKSVLLPALEYMRTDKNAEPDPVYIKDKLQAQLDLYPDAEIYITQG FICRNAYGEIDNLQRGGSDYTASLVGAAIHASEIQIWTDIDGMHNNDPRIVDKTAPVRQL HFEEAAELAYFGAKILHPTCIQPAKYANIPVRLLNTMDPHAPGTLISNDTEKGKIKAVAA KGNITAIKIKSSRMLLAHGFLRKVFEIFESYQTSIDMICTSEVGVSVSVDNTKHLNEILD DLKKYGTVTVDKDMCIICVVGDLEWENVGFEAKALDAMRDIPVRMISFGGSNYNISFLIR ECDKKVALQSLSDMLFNGK >gi|226332026|gb|ACIB01000030.1| GENE 4 3540 - 4238 278 232 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 [Vibrio campbellii AND4] # 20 220 16 218 245 111 30 1e-23 MDETLIHYTDVEIHQQELCVLSEVNLQLHKGEFVYLVGKVGSGKTSLLKTLYGELDVTAG EAEVLGYRMTSIKRKHIPQLRRKLGIVFQDFQLLTDRTVNDNLEFVLRATGWKNKQEIKE RIAEVLNLVGMENKGYKLPNELSGGEQQRIVIARAMLNSPEIILADEPTGNLDVETGKAI VELLHNICQAGSLVVMTTHNLQLVAEYPGQVYRCAEHRIVNVTDEFAKKENN >gi|226332026|gb|ACIB01000030.1| GENE 5 4223 - 4834 683 203 aa, chain - ## HITS:1 COG:hisI_1 KEGG:ns NR:ns ## COG: hisI_1 COG0139 # Protein_GI_number: 16129967 # Func_class: E Amino acid transport and metabolism # Function: Phosphoribosyl-AMP cyclohydrolase # Organism: Escherichia coli K12 # 2 100 9 107 112 144 64.0 1e-34 MDLDFDKMNGLVPAIIQDNDTRKVLMLGFMNKEAYEKTVETGKVTFFSRTKNRLWTKGEE SGNFLNVVSIKEDCDKDTLLIQVNPVGPVCHTGTDTCWGEKNEEPVMFLKLLQDFIDRRH EEMPEKSYTTSLFQSGINKIAQKVGEEAVETVIEATNGTDDRLIYEGSDLIYHLIVLLTS KGYRIEDLARELQIRHSDSWTKH >gi|226332026|gb|ACIB01000030.1| GENE 6 5023 - 5775 688 250 aa, chain - ## HITS:1 COG:aq_181 KEGG:ns NR:ns ## COG: aq_181 COG0107 # Protein_GI_number: 15605750 # Func_class: E Amino acid transport and metabolism # Function: Imidazoleglycerol-phosphate synthase # Organism: Aquifex aeolicus # 1 250 1 250 253 288 56.0 7e-78 MLAKRIIPCLDIKDGQTVKGTNFVNLRQAGDPVELGRAYSEQGADELVFLDITASHEGRK TFAELVRRIAANISIPFTVGGGINELSDVDRLLNAGADKISINSSAIRHPQLIDDIAKHF GSQVCVLAVDAKQTENGWKCYLNGGRIETDKELTAWTKEAQERGAGEVLFTSMNHDGVKT GYANEALAELASQLSIPVIASGGAGQMEHFRDAFTLGKADAALAASVFHFGEIKIPELKS YLCGQGITVR >gi|226332026|gb|ACIB01000030.1| GENE 7 5877 - 6596 727 239 aa, chain - ## HITS:1 COG:PM1203 KEGG:ns NR:ns ## COG: PM1203 COG0106 # Protein_GI_number: 15603068 # Func_class: E Amino acid transport and metabolism # Function: Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase # Organism: Pasteurella multocida # 3 234 5 241 249 188 39.0 6e-48 MIELIPAIDIIDGKCVRLSQGDYGSKKVYNENPVEVAKEFEANGIRRLHVVDLDGAASHH VVNYRTLDLIASRTSLIIDFGGGLKSDEDLIIAFENGAQMVTGGSIAVRNPDLFCRWIDR YGSGKIILGADVKDRRIAVNGWKDESTCELFPFLKDYTQKGIEKVICTDISCDGMLAGPS LDLYKEILAEHPTLYLIASGGVSSIADIEALHEAGVPAVIFGKALYEGRITLKELQAFL >gi|226332026|gb|ACIB01000030.1| GENE 8 6636 - 7226 607 196 aa, chain - ## HITS:1 COG:VC1136 KEGG:ns NR:ns ## COG: VC1136 COG0118 # Protein_GI_number: 15641149 # Func_class: E Amino acid transport and metabolism # Function: Glutamine amidotransferase # Organism: Vibrio cholerae # 3 196 5 203 203 182 43.0 5e-46 MKVAVIKYNAGNIRSVDYALKRLGVEAVITSDKEVLKAADKVIFPGVGEAETTMLHLKES GMDRFIKELRQPVLGICLGMQLMCRFSEEGNVDCLGIFDTDVKRFAPRKHEEKVPHMGWN TISCLKSDLFKGFTRDEFVYFVHSYYVPVSEFTAAETDYIRPFSAALHKDNFYATQFHPE KSGEAGERIIKNFLEL >gi|226332026|gb|ACIB01000030.1| GENE 9 7998 - 9230 986 410 aa, chain + ## HITS:1 COG:no KEGG:BF2058 NR:ns ## KEGG: BF2058 # Name: not_defined # Def: tyrosine type site-specific recombinase # Organism: B.fragilis # Pathway: not_defined # 1 406 1 406 409 745 94.0 0 MGAVKRNTLSVLFIIKKSKLLKNGEAPICMRITVNKRVAEVMIKRSIPVDLWNQKKECSK GKDRVANELNHYINTVRAKILQIHRELEIDNKTITADIIKDCFYGRDKVQRSLLEVYAEH NEKCRALIGKEYTESTVTKFDTSINRLKEYIRSRYHRDDMMLAELDGQFIRDFDFWLKTD KHCQNNSALKYLKNLKKVVRIALANGWIKKDPFYGIRFKQEEVNVEFLSREELDILMNKE FAIKRLEQVRDIFVFCCFTALAFVDVQQLSREHLIKDNNGALWIRKARQKTNQMCNIPVL SIPQRILGKYEDNAECIKKGVLLPVISNQRMNAYLKEIADLCGIAKRLTTHVARHTAATV VFLANDVSMENVSKILGHSNIRMTQHYAKVLDSSIMREMRNVEKNFSYGD >gi|226332026|gb|ACIB01000030.1| GENE 10 9268 - 9768 109 166 aa, chain + ## HITS:1 COG:no KEGG:Rfer_3556 NR:ns ## KEGG: Rfer_3556 # Name: not_defined # Def: hypothetical protein # Organism: R.ferrireducens # Pathway: not_defined # 6 161 1 157 162 149 44.0 3e-35 MNKLLLTFQEIKNRITGFSFPLFGVSWQPNESEIKIAQNIINQLEDRRVLYSPYELERPH YCIESILRIRECLTQEICKVSQNQNIYQDIQLLRAACRKFLDTIQPIQDEVHNCDSFTTI SGWIFLSALGELRGVFGIIISKLSVSYGIHINGELIKIIPSNDIDE >gi|226332026|gb|ACIB01000030.1| GENE 11 9999 - 10292 271 97 aa, chain - ## HITS:1 COG:no KEGG:BF2114 NR:ns ## KEGG: BF2114 # Name: not_defined # Def: DNA-binding protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 97 1 97 97 143 74.0 2e-33 MELINGNSEPVKEFFQSLECLLDGINRLAKENKPSLGGDSFLSNREVSKLLKVSIRTLQE WRDTGIIPYIQIRGKVIYRQSDIDRLLQSCYNEERQE >gi|226332026|gb|ACIB01000030.1| GENE 12 10328 - 10618 225 96 aa, chain - ## HITS:1 COG:no KEGG:BF2115 NR:ns ## KEGG: BF2115 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 2 96 6 100 100 137 73.0 8e-32 MKKETETNCSFRLSAISGTWESLNLHPAIMIYPSRRKYLLSMLRVSDNGQARPATYEIQK EKNRYFIVEGFKRLYIGYDKVKDILSISYYGSYLRD >gi|226332026|gb|ACIB01000030.1| GENE 13 10629 - 10922 216 97 aa, chain - ## HITS:1 COG:no KEGG:BF2116 NR:ns ## KEGG: BF2116 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 97 1 97 101 133 70.0 2e-30 MKVVAVEEQTFQLVCRRFSTFASQVKSICMESSRKSEEWLSSREVCALLGISLRSLQNYR DSGKLGYSQIGNKMYYKATDIERLVAVYTENKKSNHK >gi|226332026|gb|ACIB01000030.1| GENE 14 10919 - 11221 265 100 aa, chain - ## HITS:1 COG:no KEGG:BF2063 NR:ns ## KEGG: BF2063 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 96 1 91 95 113 62.0 2e-24 MEEVKQQPLQGMGSRERDGYKSLFLKKRTVCTRQSVYVSGEIHGCIARMVGVIAGKRISI GNFIDNVLEHHLNSYKEVISSLYREEADKGIINPPKGNQA >gi|226332026|gb|ACIB01000030.1| GENE 15 11697 - 12146 253 149 aa, chain + ## HITS:1 COG:no KEGG:BF2118 NR:ns ## KEGG: BF2118 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 149 1 149 149 243 87.0 2e-63 MEQKKKRFNKGGRKPLLNPKVHRYSFNLDDVENAKFLSFFDRSGYTVKAHFIKNCIFGKS FKVVVRDKSKVDYYIQLTQFYSQFRKIANNYNQAVKELHSNFSERKALALLYRLEQCTVE LIKTNEMIVLLCKRFEQSYQREGIISADE >gi|226332026|gb|ACIB01000030.1| GENE 16 12212 - 12538 327 108 aa, chain - ## HITS:1 COG:no KEGG:BF2066 NR:ns ## KEGG: BF2066 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 107 1 107 107 177 86.0 2e-43 MYIENYEFREWMQKLLDKLDEVGKGVRSLQNNPEVMPGDKLLDNQDLCLLFRVSTRTLQR LRAKKKLPFMMISGKAYYRASDVREFIRERFDVGTLRKFEKEHGDNKQ >gi|226332026|gb|ACIB01000030.1| GENE 17 12669 - 13154 174 161 aa, chain - ## HITS:1 COG:no KEGG:BF2067 NR:ns ## KEGG: BF2067 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 34 161 1 128 128 223 90.0 1e-57 MNRIIVFCFAIFLLDCTDLNHTVCGLSKKKIPYLYLIDEAIELIKTEVRIVNLRIKYPEQ FQQHANSLYLSPLHLADKTSLINIMEIVDGLFLSQRIIYQNGTSVHLTDLGKAFEWLFNI KLGDYHQKYMDVIKRKPAKLTEFLNELANLIRKEHINKGYR >gi|226332026|gb|ACIB01000030.1| GENE 18 13042 - 13281 158 79 aa, chain - ## HITS:1 COG:no KEGG:BF2068 NR:ns ## KEGG: BF2068 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 79 12 90 90 103 79.0 1e-21 MTQLSQQRFFRLLSEYSQHKVSASELVEAIEELAIHLANFSTNEQNYSVLLRYFSFGLHR LKSYRMRFEQEKNTLSVLD >gi|226332026|gb|ACIB01000030.1| GENE 19 13299 - 14180 573 293 aa, chain - ## HITS:1 COG:NMB1061 KEGG:ns NR:ns ## COG: NMB1061 COG2961 # Protein_GI_number: 15676945 # Func_class: R General function prediction only # Function: Protein involved in catabolism of external DNA # Organism: Neisseria meningitidis MC58 # 1 243 1 246 281 67 24.0 3e-11 MSTYKHFGNQPDVLKHLVLCEILQNENPSTYIETNSACAIYQMEHTPEQQYGIYHFLERA NDENGLKDSMYYKLEKSEMLKGNYLGSPGLAMNVLKGVNDFIFFDIEKSALDNVSSYAGQ IKIHSDVRLLNMDSLNGMIDLLPTLPKSTFIHIDPYEIDKKGASDLTYLDIFVKATQSGM KCLLWYGFMTNQDKGHINQYVIRSLQQENIKNYTCVELIMKSIKENTVSCNPGVLGSGIL AANLSQKSHNMIFDYSRKLVELYKNAKYNGYDGSLYKDIVKNVSRRNSFKFRL >gi|226332026|gb|ACIB01000030.1| GENE 20 14258 - 14827 181 189 aa, chain - ## HITS:1 COG:alr4010 KEGG:ns NR:ns ## COG: alr4010 COG0664 # Protein_GI_number: 17231502 # Func_class: T Signal transduction mechanisms # Function: cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases # Organism: Nostoc sp. PCC 7120 # 33 184 40 191 196 79 29.0 3e-15 MDKFNVEKEILDTSELERLFLREGIFTTIKRNEYLIRQNEMTNQIGFVVSGIFRLSRIDV NGNEWIIGYSFKNDFVCDYPSFINKMGATVNIQASTDCEVYLLSLNRLNQFWETDMNTQR LGRRIAETMFAEIYQRLLGFYCDTPEQRYQALMKRCPDLQGKLSLKEIAHFLGITPETLS RIRKKILLK >gi|226332026|gb|ACIB01000030.1| GENE 21 15564 - 17639 1508 691 aa, chain - ## HITS:1 COG:MA2417_1 KEGG:ns NR:ns ## COG: MA2417_1 COG1479 # Protein_GI_number: 20091248 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Methanosarcina acetivorans str.C2A # 1 582 1 590 597 579 51.0 1e-165 MKASSNNLLSIIKGPRQFVIPIYQRTYSWQLVQCNQLLNDILRISNDSAVQGHFIGSIVY FQESIHTVSDVPKLLVIDGQQRLTTVSLLIAAIADFIKENAVEIDTSFTKLQNYYLINPE EDNELRYKLLLTRRDKDTYINLLKGIPRSEGMSQRIIENYDFFKSKINKENVVAIYSGVQ RLFVVDVALEKEKDNPQLIFESMNSTGLDLSQADLIRNYVLMGQEVHLQTSLYESYWYPM EQGYGSEYAALFNSFMRDYLSVKTGTIPRIDMVYDAFKAYVIGGKAPDTISEVVKDIYTY SGYYVNMVLHKEPDKLLCGAFKRISQLRVDVSYPFLLPLYNDYVNEVISRDEFYEALCLV ENYVFRRAICGIPTNSLNKTFATLYKSFDKANYMDGLKAAFLLLDSYKRFPNDTEFTTFL QTKDVYNFRNRNYLLNRLENFQRKEMVNISDYTIEHVMPQNPNLSHEWQEMLGEGWVEVQ GKYLHTLGNLTLTGYNSELSDRPFQEKKSMEGGFDDSPIRLNSYLRRISSWNEEQILVRA GQLAEKAKEIWRFPLLSPETLEVYRSAERDPAEYTLEHYEYLKGDILTLFQALRRRIMNI DPSVKEELKKLYIAFKAYTNFVDVVPQKSRLRLSLNVAFADILDPKGLCKDVSNLGRWGN GDVEVGISNMNELDDIMELIQQAFDKQMEAN >gi|226332026|gb|ACIB01000030.1| GENE 22 18095 - 18268 64 57 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253564838|ref|ZP_04842294.1| ## NR: gi|253564838|ref|ZP_04842294.1| predicted protein [Bacteroides sp. 3_2_5] # 1 57 1 57 57 92 100.0 7e-18 MLVLASKTKDVSFVIRQKEETQGNTVEIYSITLLKDIHTNRTDYAILLNVPDTVIAY >gi|226332026|gb|ACIB01000030.1| GENE 23 18592 - 18999 273 135 aa, chain - ## HITS:1 COG:DR1146 KEGG:ns NR:ns ## COG: DR1146 COG3871 # Protein_GI_number: 15806166 # Func_class: R General function prediction only # Function: Uncharacterized stress protein (general stress protein 26) # Organism: Deinococcus radiodurans # 9 128 40 161 193 63 28.0 1e-10 MKSLNERAADLLQGCETVILSSVNQEGYPRPVPLSKIASEGISEIWMATGEHSVKTKDFR SNPKAGLCFYEQGNSVALTGEIEVVTDAGLKQKYWQDWFIAHFPKGPTDPEYVLLKFRSE HATFWIDGQFVHRNI >gi|226332026|gb|ACIB01000030.1| GENE 24 19210 - 20031 303 273 aa, chain - ## HITS:1 COG:mlr1196 KEGG:ns NR:ns ## COG: mlr1196 COG2207 # Protein_GI_number: 13471273 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Mesorhizobium loti # 43 253 68 269 276 75 28.0 1e-13 MYREYQPCGLLAPYVDKIWEFKGSPEYGMRINVLPDGCTDLIFALGGITQPVGNEGRIMP SCRSFFVGPMKRYSELVAYTETVHMVGIRFHPCGLFRFMDLPLQELGGQRISSADLGIKL FDDSFTERLYELPDLRSRIQCIETVLVRSMHKHDVVDKQIVFAVNHIHLYHGQREIRLLA EDTCLCQRHLERRFKLFTGFTPKEYSRIVKFRQAIDLLKNTTEANNLLSVAVNAGYYDVS HFLKEVKTLSGGTAESFLSPTLPQEGLLTYIEK >gi|226332026|gb|ACIB01000030.1| GENE 25 20074 - 20439 368 121 aa, chain - ## HITS:1 COG:RSc2671 KEGG:ns NR:ns ## COG: RSc2671 COG3324 # Protein_GI_number: 17547390 # Func_class: R General function prediction only # Function: Predicted enzyme related to lactoylglutathione lyase # Organism: Ralstonia solanacearum # 1 118 1 118 131 73 39.0 1e-13 MEKFIAFFEIPAADFHRAVGFYETVLDIKLAVSEYEEEKMACFMEQGEAVGAVSWAPDFL PSERGTLIHFYCEEIGKSLERVLQKGGRVITPETEIDAEGRGHFAVFADSEGNHIGLYSD K >gi|226332026|gb|ACIB01000030.1| GENE 26 20545 - 21351 683 268 aa, chain - ## HITS:1 COG:CC2573 KEGG:ns NR:ns ## COG: CC2573 COG2207 # Protein_GI_number: 16126811 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Caulobacter vibrioides # 8 259 8 263 270 87 25.0 2e-17 MQSFRVIKPTAALVPYVRHYWVLSDDALAPVSERTLPVGCVQMVFHRGRQLFSLTEGRLQ PSSFISGQAFGYSDVESTGMLEMIVVVFQPFAAKAFLHMPVSEFRGMNVNTEEMGDPLLV DLGRRIADMPDRVACIRLIEEFLLSRLYAFPEYNLKRVSTVLEAVNLHPHIRTAQLADVA CLSNKQFGRVFAEYVGATPKEFLRIVRIQRALYTLQCQPGISFAQLAYECGFFDQSHMIK EFKFFSGYTPAEYLAVCAPYSDYFFVEE >gi|226332026|gb|ACIB01000030.1| GENE 27 21736 - 25497 2805 1253 aa, chain - ## HITS:1 COG:TM0281 KEGG:ns NR:ns ## COG: TM0281 COG3534 # Protein_GI_number: 15643050 # Func_class: G Carbohydrate transport and metabolism # Function: Alpha-L-arabinofuranosidase # Organism: Thermotoga maritima # 214 401 25 211 484 111 35.0 7e-24 MNKIVSFLFICLLLADLHAQNTIKIIDKDKTSTPVSPYLWGSFFEMGFGRSDLLWGELLF NRSFENTKPVSESNSWYTCYRGDVKEAKWWHSGYEEPKWYLLADGRKEEKLPLIFNNYWP SAHGKYFLQIDNRKKQTPTLLVQERIYIEKGKGYTVSGLFSSGGYLSEEKYSKESVPVTI AIYKEGDFHTPLSKTELAVNTNQFILYSSSLDATEYEGWCTYTVEVPAGGCVGIDLLSLM ADDVIKGWKRESVERIKNELRPRTMRMPGGCFTSLYDWRSGIGPREERPVSYDTWWGCEL LNDVGTFELVDLCEAVGAEPFFCVPVMFNNEYSAADWVDFCNNPTNAQRIAYGRTQPLNV KYWELENEPYRRFDAVTYANRCVDFAKAMKAKDPSIKIAVGNYWLFNKKFKEMLEIVGPY VDLITNRGGTPEEMRADIVILDAYNKAHGTDIKLCHTEFRAPVTRNEGNTDGLNQKDTGG EETLFNASIRWGFAMNMVEQYIAYQNMGGSFFTANYTNLSDGWGECLINTPKEGTYLNAP GVAFALLNSLDIAYPQIIEQEKENQDIVIQAAWNKRRDKLTLVVLNFSQNTQSCKIDFSQ IKKSFRVRKGMKIAPQSDLSFNTLQHPEEVKVESFVPSTGKMMKLGLPGNSLIVVELQAE RSHGIHVNASTGNDASIGSLAYPLKTIQAAADMAEPGDTVIVHEGIYRERVSPSRGGESE EKPIVFMAAKGENVEIKGSEVMKGWKKVNDTTWEVGIPNKFFGGFNPYAETLHGDWFERG KWCHTGEIYLNDIALMENPSLSNVLQNKGDSLLWFCKVEQDTTRLYANFGDKNPNQELVE INVRQSVFYPERPYVNYIVVNGFKLSQAATPWAPPTAEQIGLLGTHWSKGWVIENNTITH SKCVGITLGKYGDEWDNKSESEEGYVNCVKRALRHNWNREHIGGHLVRNNTVAYCGQAGI AGSLGAIFSKIKNNTVHDISTQNLFWGYEMAGIKIHAAVDVEISGNHIYRVEGGIWLDWM AQGARVTRNLLHDNRVVEVSFEVNHGPILVDNNLFLSPELAQIKLSQGMAFVHNLIVWKV WKLNNVDPRKTPYLAPHGTEIMGYHDCPCGNVSYFNNIFTRAEMTEYDDCVLPVQMEKNC YWGEAVSSGLDKNATVNSGFDADIQVIEKTDGWYLQINVPENWKDEKLRDKVSTKDLGRA SIPDQSFNKENGTVIDLIEDYWGQNRKGQKKYYPGPIDFTTNGGKVMLKVYDK >gi|226332026|gb|ACIB01000030.1| GENE 28 25522 - 28101 1946 859 aa, chain - ## HITS:1 COG:TM0076 KEGG:ns NR:ns ## COG: TM0076 COG1472 # Protein_GI_number: 15642851 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-glucosidase-related glycosidases # Organism: Thermotoga maritima # 26 759 4 758 778 543 40.0 1e-154 MKKLIFLSLMAICFCVRLYAQTNFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYS IMENGKLNEEKLEKMIGGQNYGFIEGITLPGKECLTLMNEVQKYMREKPRLGIPVFTLTE SLHGSVHDGSTIFPQAIALGSTFNPILAYEMTSAIAKELSAQGITQSLTPVIDVCRDLRW GRVEECFGEDPFLVSRMGVSQVRGYLDNQVSPMIKHFGAHGTPQGGLNLASVSCGQRELL SIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRWDFQGYVYSDWGAIG MLNYFHKTAQNSAEAAIQALTAGLDAEASDNSYAELQQLVENGMLDVKYIDQAVARILTA KFNMGLFEYPLPMEKNYDKVVHAPAHVSLARKIAEESIVLLQNENNILPLQMNKLKSIAV IGPNADQVQFGDYTWSRDNKDGVTLLEALKERVSNQLTLNYAKGCDLVTDDCSGFKEAVD VAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTGVQEDLVEAIHATGKPVIVV LLSGKPFAMSWIKENIPGIVVQWYPGEQGGLALADMLLGKVNPSGKLNYSFPQSVGHLPC YYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFGHGLSYTDFEYLSATTSKEDYACE DVIEVTIAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQVIIKIP VSELALYNKEMKKVVEPGAFELQIGRASDDIRIKKVITVERASEKYIPTLRDKEKKVSST KNMTATPVVVKGTIRDVQANLLPQVTVKVGKEEVVTNSKGEYSIRAMSTDTLIVSGSKFK TEHISIEGRQVINIRMLNR >gi|226332026|gb|ACIB01000030.1| GENE 29 28120 - 29124 662 334 aa, chain - ## HITS:1 COG:MA0146 KEGG:ns NR:ns ## COG: MA0146 COG0407 # Protein_GI_number: 20089044 # Func_class: H Coenzyme transport and metabolism # Function: Uroporphyrinogen-III decarboxylase # Organism: Methanosarcina acetivorans str.C2A # 60 330 68 337 339 116 29.0 7e-26 MNVDEWIEYIIVSDKKVALPIMTHPGIELLNKRVLDAVTDGEVHFKAVEALNMNYPQSAA CTVIMDLTVEAEAFGAQIQFSENEVPNVIGRLVSNYEEVVGLKIPTLDIARIPQYLEVNR LAAKGLDKPVLGGCIGPYSLAGRLFDMTEIMMAIYTEPETALLLLDKCTQFITQYCRAIK DCGSAGVIIAEPAAGLLSNEDCMQYSSVFVKRIVEEVQDSHFAVILHNCGNMGQCTQAMV ATGAKGYHFGNKIDMLAALEECPSNALVMGNLDPVGIFKQATAEEVYRQTYTLLQKTTAY PNFVISTGCDVPPEIPLDNIRAFYEAVKDYNQKQ >gi|226332026|gb|ACIB01000030.1| GENE 30 29133 - 29816 406 227 aa, chain - ## HITS:1 COG:no KEGG:BF3066 NR:ns ## KEGG: BF3066 # Name: not_defined # Def: putative 5-methyltetrahydrofolate-homocystein methyltransferase # Organism: B.fragilis # Pathway: not_defined # 1 227 1 227 227 462 99.0 1e-129 MDFRYKELSWKDICLDWDDFVFSLGKGYKMEGEVLTMFSALEREVTQFCKPRWGYRVFSL DAFDKSRIILNGCRLQTGRIITPYLENAELCALFVATAGEEFERFQQTVKKSGQIVEEFL LDALGSAIAEATVREACKAIEKEFGAKGLGISYPYSPGYCGWKVSDQQILFSLLPNQPCG VSLTASSLMCPIKSVSGVVGIGRQMTRQKYGCELCGKKDCYKNRLNK >gi|226332026|gb|ACIB01000030.1| GENE 31 29818 - 30453 703 211 aa, chain - ## HITS:1 COG:mlr1231 KEGG:ns NR:ns ## COG: mlr1231 COG5012 # Protein_GI_number: 13471298 # Func_class: R General function prediction only # Function: Predicted cobalamin binding protein # Organism: Mesorhizobium loti # 6 208 26 228 238 171 45.0 6e-43 MNLTDLYDAILNGKLDRAVAVTNEAISEGVLPNEIITNYMIKAMEEIGNRFEAGKVFVPN LLMSARAMKGALDILKPLLQGETDAYVGKIVIGTVKGDLHDIGKNLVASMFEGCGFEVVN LGVDVSSEKFVEAARTNNADIICMSALLTTTMNYMKVVVDDLKTAGLYGKVKVMVGGAPI NEAFAHSIGADAYTSNANAAVIMAKKLIGAA >gi|226332026|gb|ACIB01000030.1| GENE 32 30477 - 31727 985 416 aa, chain - ## HITS:1 COG:no KEGG:BF3068 NR:ns ## KEGG: BF3068 # Name: not_defined # Def: putative methyltransferase CmuC # Organism: B.fragilis # Pathway: not_defined # 1 416 1 416 416 885 100.0 0 MSTSRDRVRQALSHQGSDRIPVDFGATAVTGIHCRVVEALRKHYGLSYKPVKIVDTFQML GEVDRDLADAMGVDCIGVGGTRDIFDHDTECMHEQVTPWGQKVLVPIQLDLTQDKEGDVY VYAGGDKNYPPSAVMPNGCFFINAIERQQPIDEDKLNPMDNLEEFNSITDEELDAYKKKV NEASATGRAVVASFGGTALGDVAFVPGMGLKEPKGIRSVVEWYMSTVMRQEYLHEIFRRQ TDIAIANYEKLWAVLGDKVDVVLTCGTDFGSQESQFCSVEVFNELWLPHYRRMNDWIHEH TTWKVFKHSCGAIVPILPGLIEAGFDIINPVQINAKDMDSGMLKREFGSHLTFWGGGVDT QKMLPFGTPDEIRRHVLGQCEILGKDGGFVFNSVHNIQANVPVENVIAMLDALKTV >gi|226332026|gb|ACIB01000030.1| GENE 33 31741 - 34326 2267 861 aa, chain - ## HITS:1 COG:TM0076 KEGG:ns NR:ns ## COG: TM0076 COG1472 # Protein_GI_number: 15642851 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-glucosidase-related glycosidases # Organism: Thermotoga maritima # 25 752 4 750 778 503 38.0 1e-142 MKNYIFTAVLLFLTISLSAQPIPKYKDASQPIEVRVQDLLSRMTIEEKVAQLRHIHEGSI LNDDHTLNLEKMKNIIGTLGWGAVEGLTLEGIEMARYTYQIQKHCIENTRLGIPIFTISE SLHGAVQGGATIFPQAVALGSTFNPDLAYQMTKAISGELNAMGVNHVLSPTIDVIRELRW GRVEESFGEDPFLVSQMGVHEINGYIDGGISPMLKVFGPHGVPTSGLNLASTEANERDLR EVFLKPYEVAVKQTGVNSVMTAYNSTNRIPNTASKWLLTDLLRTEWGFKGYTYSDWGAVS MLYGFHKVASNVNEAVKMALMAGTDLEASSDCYANIPAMVRLGELDVKYVDLACSRVLYA KFKAGLFENPYGLPIEEYEKKVRTKENVALSRRISEESVVMVKNEGNLLPLDMKKLKSVA VIGPNANQVQFGDYTWSRNNKDGITPLQGIQNLVGNKLAVHHVVGCDLVSDDKSGFADAV ATAKKSDVVFLFVGSASASLARDYSNCTCGEGYDLTDLNLTGVQGDLVKEIYATGKPVVL ILVTGRPFSITWEKEHIPAILFQWYGGEREGEVIADVLFGKVNPSGSLCYSIPQSVGHLP IHYNRLPSDKGIYRSPGTINKPGRDYVFSTPEPLWPFGYGLSYSDFEYSDFCFDKENYGL TDTVRIQVNVKNKSAIEGKSTVQVYVRDLAGSVVMPMKQLKGFSKVTVPAYGSTLAEISV PVSELGLYDMNMRYVVEPGDFDFMIGTSSDSICFKKTIHVGEVDAKAERVSSAASKETQT VKTGKKMIVKGVVRDVQAKTLDGVKVSVKGRKGSVITNGKGEYSIEATSASVLVFSRKGY DMQEVEVNSQKTLNITLLNKI >gi|226332026|gb|ACIB01000030.1| GENE 34 34498 - 36186 1358 562 aa, chain - ## HITS:1 COG:no KEGG:BF2906 NR:ns ## KEGG: BF2906 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 562 1 562 562 1152 99.0 0 MNKIAYMTCVLSLAFTMQACDDFLDSFPQDTVTNENFWKSKEDADKVLVDIYASLLPKDA IFFDEAMSDNAYLVWDWWGGAQQVANGSYTTSGEIPTNRWNGSYEVIRKCWFLLEGIEKI EDISEQDKNKIIGETYFMLAYNYYVLTSYFGDVPLVTETLAIPESKQLVRTPKAEVVDYA INKLKEAAVMLEGLSQEKGRVTADACRFLIARMYLYNGDYSNVLETVKLLEGKYQLYREG DTPYEDLFSGVAENSCEVILSVVCDKRVGEIYTSHGGNGIMLLKGITGEDPYRGVTPSGS LVDAYPMADGRLIHEAGSSYDPKKPYEGRDPRFYQSIVYPTGQIKYLDVETGTVKERLYD PEDPTTVPEHQYNYSQPSATGYMWNKYIDYSVYAMNSVWDCTNDIIVFRYADVLLMKAEA LLQTKGESAKEEVCNLIDQLRDRCSCGRVHRENYNSKDELMELLKNERRIELANEGLRYM DLIRWKDAEKNTIVTGVGLTGQMYGAYMRKDGVGKDDKTVDVDNTPRRYIETRYFNASKG YLFPIPQKERDLNPNLTQNPNW >gi|226332026|gb|ACIB01000030.1| GENE 35 36207 - 39542 2436 1111 aa, chain - ## HITS:1 COG:no KEGG:BF2907 NR:ns ## KEGG: BF2907 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1111 1 1111 1111 2189 99.0 0 MKKKCFYSGMTYLFCHSGANLLLVRTQILLFFFVFLLPLSGVASPLQEIRITIQQKNVPL SKVFKEIEEKTDCSFLIRNNDVNTNEKVSIDAKNKTVAEILGILFDGKGIKYEVNGKRIS VYKAVRQHTIGGKRKVTGQVTDNMEEAVIGASVFVMGASNGTITDMNGHFSLELPDDNAK LQVSYIGYKTQVINVGNKSSVNIVLVEDSKALEEVVVVGYGTQKKVNLTGAVETVKSDRL ANKPVTSIASALTGEAAGVTVTQNSGQPGPNQGTVRVRGIGTWGDASPLVLVDGVSMSLN DVIPSEVESVSVLKDAASAAIYGSRAANGVILITTKKGKEGKLTFNYSGNVGFQFATRVP ESVTSWQYAELYNQMQYNEGKSSSLFPQDRIDRMKAGGDPDKLEGNTDWYDELLRSGAPQ HNHQLTVSGGSDKITYMISAGYSDQQGIIPSTDYERYNLRVNTTSKLTSWLKLDVNMAYL NSTQEESAAGAAEAYRRTMRALPYLPVQFSDGTYSYDAAPSNPVRMVNGDYGMRRKNNDC MTLLIAPEINILDGLNIRGTFGYESNIYKEKIFNKTVTYGSFEPAGQSGLTEVSRNKQTD RWDQYRNLTANVTASYEKTIGKHDFKVMAGGSLETFKWAYTKASRMDFPNDDFGEINAGD ATTAAAEGNSTYSALASLFGRANYVYADRYLFEFTARYDGSSKFARGHRWGFFPSVSAGW RISEEAFFEPLKKHVQNLKLRASWGELGNQRINDYQFISNVGNGGSYLFGGTPIIGYKEA LMGNEIITWESSRNLDFGIDFALFDNRLQTTFDWYCRTTSDILLNLEAPGALGIKPAMEN AGKMENKGWDLTVSWRSNIGKDFKYNIGFNLSDVKNKVIDLRGYKSSTTELTAKIEGQPL NAIFGFETLGICDNQELYDKYAPMMQKYNPKWGMGDIIIKDRTGEGVINDEDRTVIGNSI PRFTFGLNLGFEYKGFDFSCFFQGVGKADGYVTMEAIQPMGINGARKEHYKESFNPQDPK PGAYFPRILSSDYNYAYMSHWVQDASYIRLKNLQIGYSFKIKGLNQLRVYASGENLFTAT KYRTWDPETPVGARGFYPNVAVYSMGVNLNF >gi|226332026|gb|ACIB01000030.1| GENE 36 39673 - 40704 959 343 aa, chain - ## HITS:1 COG:SMc04204 KEGG:ns NR:ns ## COG: SMc04204 COG3712 # Protein_GI_number: 15965785 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Sinorhizobium meliloti # 140 305 157 319 354 84 32.0 4e-16 MEMMEKKLDTFEELMLDFLAGKLSEDGERKLLHFLQSDISYQQRYKEMARTRAKSFIGKF EQEKQADYEALSVKLGIKKKSEKKRIPLWSTFSQVAAIALLILTTSIAGYYIYNDVAESN QEMALCQMEVPLGSQTKVILPDGSVVCLNSGSVLKYDPAFLRKKNREVYLIGEGYFEVQK NPEKPFIVHADDINVKVLGTVFNVRSYPEDSEIEVSLIKGKVNVFSTSETRDNVILAPDE QLTYDKRSGKMNHHHVDALQTSQWTTGRLSFVNASVPEIMKAIERKYDVRIVIHSKYLDK EVFSGSISPKLTVEEILDYMDVDNKYSWSRSGNVITITDKLIK >gi|226332026|gb|ACIB01000030.1| GENE 37 40914 - 41510 458 198 aa, chain - ## HITS:1 COG:no KEGG:BF2910 NR:ns ## KEGG: BF2910 # Name: not_defined # Def: putative ECF sigma factor # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 198 1 198 198 390 100.0 1e-107 MTTTRISEQIIDDINEGKESAFSALYDCYYSYLCAYATTYVFDPDEAKEIVNDVFMNIWS SRGQLSFPIHNYLLRSVQNRCLNYIRTLHTRERVLDEYREELIAFQEEFCKNDNNPLQLL EIEELKSQVNTVIDSLSVKCRLVFEKYLYEGMSPQEIADEQAISVNTVRVHIKNAMDHIK LQLGPTAGILLLFLYGRL >gi|226332026|gb|ACIB01000030.1| GENE 38 41813 - 41968 60 51 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MNPISFKLYKRVLFLDEESCKKHLKKQREKQGIIYLKCGRVHHYWNKNRNK >gi|226332026|gb|ACIB01000030.1| GENE 39 42361 - 42597 244 78 aa, chain + ## HITS:1 COG:no KEGG:BF2912 NR:ns ## KEGG: BF2912 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 78 1 78 78 117 100.0 9e-26 MCKEKEFPMNFVPVKSIHLSLVEEKDKFFIIDEVSSKVLITFNNEINAQNAMKQITDNLS TTINHILTISQQNIQILE >gi|226332026|gb|ACIB01000030.1| GENE 40 42599 - 43192 536 197 aa, chain + ## HITS:1 COG:BS_ydeA KEGG:ns NR:ns ## COG: BS_ydeA COG0693 # Protein_GI_number: 16077578 # Func_class: R General function prediction only # Function: Putative intracellular protease/amidase # Organism: Bacillus subtilis # 3 195 2 184 197 125 38.0 5e-29 MKQEVLFIILNEYADWESAFLAASLHSGLMPGSEIKYVVKTVAPTLEAVRSLGGFRTLPD YSFDTMPSDYAGLVLIGGMQWQSPEAERIFPIVQDAFEKGKVIGGICNAASFLCAHGFLN HVKHTGNTLAVLKQWGGERYTNEDGYLEKQAVGDKNIVTANGTGYLEFTRELLLALKADT QEKIEAFYDFSKNGLVR >gi|226332026|gb|ACIB01000030.1| GENE 41 43212 - 43328 70 38 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|255010491|ref|ZP_05282617.1| ## NR: gi|255010491|ref|ZP_05282617.1| hypothetical protein Bfra3_15239 [Bacteroides fragilis 3_1_12] # 3 37 89 123 243 66 85.0 6e-10 MGQHKLSDDSLLAIEGNFMETRMSNWKRGEENILFTIP >gi|226332026|gb|ACIB01000030.1| GENE 42 43547 - 43723 106 58 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MHVYLFFYSILTPKGYVKSQVLIGHAKDYFMLTFFTLYIRDSSVCIESEGFNQTISYY >gi|226332026|gb|ACIB01000030.1| GENE 43 43773 - 44165 311 130 aa, chain - ## HITS:1 COG:MA1602 KEGG:ns NR:ns ## COG: MA1602 COG0494 # Protein_GI_number: 20090460 # Func_class: L Replication, recombination and repair; R General function prediction only # Function: NTP pyrophosphohydrolases including oxidative damage repair enzymes # Organism: Methanosarcina acetivorans str.C2A # 1 127 1 129 132 124 46.0 4e-29 MKSIEVVAAVIRLGEKYLCVQRGQTKFSYTSFRYEFPGGKVEEGESLQEALQREIMEEMD YVIEVGEKLLTVHHTYPDFEITMHAFLCHPVGQRYVLKEHIAAQWLSTREMAILDWAEAD KPIVRKISEQ >gi|226332026|gb|ACIB01000030.1| GENE 44 44331 - 46283 1320 650 aa, chain + ## HITS:1 COG:mcrB KEGG:ns NR:ns ## COG: mcrB COG1401 # Protein_GI_number: 16132167 # Func_class: V Defense mechanisms # Function: GTPase subunit of restriction endonuclease # Organism: Escherichia coli K12 # 96 419 14 332 465 130 29.0 1e-29 MDFEKITKEAVVQALLEIKKNGIPKNAHSSTYDILYKGKRYPPKLVMEYAYQHSTGKQIT RNDFEGGEKTPCFNRLKELGFTIVHKEKNPNFYETLTKFLEQTKTKSQKYKDYPKEFLGL NISVSFGIGGPSKTPWISFLYDGQKTQHGIYPVYLYYKEYNTLFLAYGVSETTPPNIEWK FNTSPETIFDFFKRKGIKPKRYGNSYVFKDYDINNLNPSTINKDLFEIIENYKNLMASYT EISTNTNKMQSPSINIDYQSIPANLRQFVTAIKSKPFILLAGISGTGKSRIVRQLAYATG GENPEKVQKPYNYEMISVHPNWHDSTELLGYVTRVSGNPEYIVTDFLKFIAKAWFYEGIP FFLCLDEMNLAPVEQYFAEYLSVVESRKLRNNKIVTDPIVPPLTTWDKADQKTLVSDQIL KELFHEFWNNEGWNNSTIAARIEELKEQFKNTGISIPQNLIVMGTVNMDETTYSFSRKVL DRAMTIEMNHVDLNSGLSKAQDNITPITAKALLPEVVEGYDVYEQNPEICKSIIKYLQQV NEKLEDSPFKIAYRTRNEFLIYVLANLPYQGKETQEQCITRALDEITCMKILSRIEGDKN KVGTVLDDLYTIIQKRIETSGINSEKSISLDKIRRMKKKLETAYYCDFWN >gi|226332026|gb|ACIB01000030.1| GENE 45 46300 - 48528 903 742 aa, chain + ## HITS:1 COG:MTH502 KEGG:ns NR:ns ## COG: MTH502 COG1700 # Protein_GI_number: 15678530 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Methanothermobacter thermautotrophicus # 140 582 130 561 567 82 24.0 2e-15 MEALLTLNHPDYKLMVYCPSYDKIFKKAQTCMRVKQNDETLYSIYTWDKNAFTQINGEQL IEGKRHPSIFFENTDYQFWINFKRIDIEDAWINTPLRNIQDNFMFNQENQILFGHLNYGN DIGRAEFNIDYKLKNGEIRHSKLGYDVLSIKLDYHKDLKKIVDDIEKEYRMLSIDFLRKT YHTFDVDVVGETPDIIWWNLFKDIQTNFIQAVKTIVDCPRNRLVQKETYLRADKLKRLTP QLEIQLTEHRKNPAHLYRTEHPIACVDTMENRFLKYCIIFIAEKFSDLKCRIINSYKQLT EKYIINLNQQQEELQRLVHHPFFRTVGEFRGFTQESLILKQATGYSEIYHDWIILSCGYD LKEGANSLELKDIAKLYEIWCFIELKNIIKELLGNEVETNYSKRPEKKFITQLGKGKLSK VIFSKHNIELAKICYNPKSYVGEEQLSTIIPCTTSYTVSQQPDIILQLTRRDIKQGIKLT YLFDAKYRIGDTQDNVDTPPDDAINQMHRYRDAIYYIDQDTKQLKKEVIGGYILFPGNGE DSAVEEMNFYKSIGKVNIGAFPLRPQDNESKELLRGFLKRLIWECPTYQILEQTFTHKET MLTFNQPGSVLLVPIAKSRSYFKDFNERSKVIHYYFGKIKMGGVYEHLNFQKYDYFIPVI EGKIRDVYRIEYASVRHRNDIPGIEPKENVDLSDFRIYMKLTSPSYLTGAKRLIPIRHSR IDYKCFFKEYNSITEVINLITE >gi|226332026|gb|ACIB01000030.1| GENE 46 48515 - 48763 198 82 aa, chain - ## HITS:1 COG:BS_ysdA KEGG:ns NR:ns ## COG: BS_ysdA COG3326 # Protein_GI_number: 16079936 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Bacillus subtilis # 1 73 10 84 89 58 46.0 2e-09 MNIVAFVMYGIDKRHASRKRWRTPEVRLQGIAVVGSLGARVGMYIFRHKTWYFKFKYGIP VIEGLQVGIAVYYWYLTISIQL >gi|226332026|gb|ACIB01000030.1| GENE 47 48884 - 49801 457 305 aa, chain + ## HITS:1 COG:YPO2519 KEGG:ns NR:ns ## COG: YPO2519 COG3129 # Protein_GI_number: 16122739 # Func_class: R General function prediction only # Function: Predicted SAM-dependent methyltransferase # Organism: Yersinia pestis # 3 305 25 321 336 314 50.0 1e-85 MAERNELHKRNRHNGQYDFSRLTEEYPPLKKFIVLNAYGTTSIDFFNPRAVKALNKALLI SYYGIRYWDIPKNYLCPPIPGRADYIHYIADLIQPDISDESTGLKTAVPNTRQYRCLDIG VGANCIYPIIGQTEYGWTFVGSDIDPVSIDNARKIVTCNPALAHKIELRLQRDSRKIFEG IIAPNEYFDVTLCNPPFHSSKEEAEDGTLRKLSSLKGKKVTKARLNFGGNANELWCEGGE LRFLLTMIEESRNYRKNCGWFTSLVSKEKNLGKLTAKLKSTDIAEHRIIEMHQGTKTSRI LAWRF >gi|226332026|gb|ACIB01000030.1| GENE 48 49679 - 49951 127 90 aa, chain - ## HITS:1 COG:no KEGG:BF3082 NR:ns ## KEGG: BF3082 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 90 1 90 90 176 100.0 2e-43 MLKHAFRNLLVCFDKQLNVNDIFRWQSLSCEDRLPKGFESILADLITLSFSKSPCQDPAG FSTLVHLNDSMFGYIGRFQLGGQFTQVLFF >gi|226332026|gb|ACIB01000030.1| GENE 49 50081 - 50554 271 157 aa, chain + ## HITS:1 COG:no KEGG:BF3083 NR:ns ## KEGG: BF3083 # Name: not_defined # Def: arsenate reductase # Organism: B.fragilis # Pathway: not_defined # 1 157 1 157 157 326 98.0 2e-88 MNLLIISNADSCRSRIAQALLSSFGKGMKVYSAGTMPAAEIHPLVLKLIKETGIEPNTQP PHSIREYTNENWDHIIVLSGTADDIRNLFRKEVKHWYHLPFEDLFSTAAPSEAELWDRLI RLKADIQRKMYELYRDDLREQLLPRCSCGANDFCRCE >gi|226332026|gb|ACIB01000030.1| GENE 50 50560 - 54225 3276 1221 aa, chain - ## HITS:1 COG:no KEGG:BF2922 NR:ns ## KEGG: BF2922 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1221 1 1221 1221 2065 99.0 0 MRYLNKIIFINSARIQYAEIQIDGNVHFIGTQGVGKSTALRALLFFYNADKTKLGISKEK KSFDEYYFPYVNSYIIYEVVVDDASYCVLAFRSQGRVCFRFLGTGYKKEYFISPEGKAYE EWDQIRDALGSFVYKSRRIETYEEYRDIIFGNGRGLPPEFRKFAITESRQYQNIPRTIQN VFLNSKLDAEFIKQTIIMSLNEEDVRIDLGQYAHHLRRFDEEVTDISKWFRKNKNGEVTV RRQADRVIELYREMHYLEQQARTLAGELNYAFRTAREVLPSLQKQKEELLKEVAKEKRQL EELSGKFQSERDRLLGVVKVQNENLKTARERKERYESQDIHHVRERVNAESEAVLHKQML EEQLTMLTARFDDINSQYKLLKEQAASAFERFRNGKNAELNTLHLRAIERKEAIRKEFDK ILKEVREQEVGKLTLLREQTEKKKEGIYQLKMEREKCLHRTLYEEELQACRTERAALEKE NHEHRLKQKEAEQQMELVRRRWELDQTACEQQFTARQKDIEQQITLAKERVAEIDRLLDN RQGSFYEWLSQNCQGWEDTIGKVVDEKQVLFSKELKPQSVPDAGQDSFYGIKLDLSAIRK EVKSVQEYAADQEIAGKEVSEWQQQLEKLGEEKEKELALIRKRHQATLSTCKEDVAQSTY CMEQNDKRIRLLKADEIRWEQKAGEEKRVLLEQLDKQLAEATRSLQGTVAELEQFNHLLE TRVRQKEKERNQRMQEEDALIRNKQEEIHLSIASEKQKTDELLATMDKDLLHELSGKGVD TERITALRNEVASTEQELVFIDQHRRLVYDYEKDKREFFDRMDEFRNEKQLAEKELEGEK EKFRLKEEELNLKVTGLNKQLAETNTRLKHLDEDIRETENFKTLNICPPVLQTGVETENA KRCKKVIEELTQNHYRQIEQEKDFREAVGRFSGNFSEQNTFNFRTRLTKREEFMTFAADL KEFIDNDKIIDFERRSNEAYTDLIHRIGKETTDMLSKEGLIRKTISDINSDFVERNFAGV IKSIALQIVPSGNRIMQHFVTIKEFCDRSQPGGPGMFSLFGQEDFVERNRQAVSLLQSLV KELALNKEKELTLSDTFELQFRIVENDNDSGWVEKLANVGSDGTDILVKAMINIMLLNVF KEKASHQFNDFKLHCMMDEIGKLHPNNIKGILDFANNRNIILVNSSPTSYRASDYKYTYI LNKDKRNITSVTRLIKQEVRE >gi|226332026|gb|ACIB01000030.1| GENE 51 54209 - 54769 750 186 aa, chain - ## HITS:1 COG:no KEGG:BF3085 NR:ns ## KEGG: BF3085 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 186 1 186 186 343 100.0 2e-93 MIKQTAEIFEILSKGGFISSDSTNPNIRQLYTVIEDNQSELYDFFAAINFVLESGNEYYY FSRRENKVDLERKLEIAVRWIDVLDFIKTYDAAFSSGFRFQPADMVVKVGTDLELKEKLT GLKKLTGREKHEEMIDKIVNDLKRDGFIELENEITSTYKVVAAFGYLEELVACINIPEEI QNEIPE >gi|226332026|gb|ACIB01000030.1| GENE 52 54893 - 56107 1067 404 aa, chain - ## HITS:1 COG:no KEGG:BF3086 NR:ns ## KEGG: BF3086 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 404 1 404 404 692 99.0 0 MTTFHSIDELLKMMSREQLLLKQMFGKRKQQSFRREYALELTEYKLQRIQSLIDHGVLRE NGSFLEMEDIYLHFFEQVLEVNEEINTSFVNEHISYLKDTISYYQQENHEKRKTTYLRTI KRILRNIALTTLRNVIDLKRNIDSTFKNEPNYQIKKKKLVRLDEKRRDIEALIRVSEELL VTEEDRFFRRVPDDELVLVVANVRIQLNECFHNLIEIQKQIISYLNRIEYQNKIRVKIRQ LKYLKDQFELEERTDICRVLMQKDSVWFEPAPAYPLRLSVEYLRNDDEVLESIRKVIATV GKRAILARNVADSIAEESLETHISEEAYINLEEVKKRFMQADGDLFSFVTAYPFTQEVSF DERITFYCQIVSQFYEELRITDEYGRMDGVEYALIYPERTGEIV >gi|226332026|gb|ACIB01000030.1| GENE 53 56139 - 57071 870 310 aa, chain - ## HITS:1 COG:FN1744 KEGG:ns NR:ns ## COG: FN1744 COG0697 # Protein_GI_number: 19705065 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; R General function prediction only # Function: Permeases of the drug/metabolite transporter (DMT) superfamily # Organism: Fusobacterium nucleatum # 16 286 12 285 293 75 25.0 2e-13 MNINSKAKGFVCGAVAAATYGMNPLFTLPLYKEGMSVDSVLFYRYGFAVLILGILMKVQG QSFALKKNEVLPLIVGGLLFSASSLLLFLSYKHMDAGIASTILFVYPVMVALIMFLFFHE KVSLLTVFCILLALSGIGLLYKGEGGETLSLVGMLLVILSSLSYAVYIVGVNHSTLKLMS TAKLTFYALLFGLSIYIVRLNFCSDLQAVPSLPAWGNILAMAFLPTVISLVCTAVSIHTI GSTSTAILGALEPVTALFFGVMIFGERLTPRLMLGILMILVAVTFIVVGKPLMTFLKEEV AGGMKRKLTR >gi|226332026|gb|ACIB01000030.1| GENE 54 57157 - 57453 182 98 aa, chain + ## HITS:1 COG:no KEGG:BF3088 NR:ns ## KEGG: BF3088 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 98 1 98 98 174 100.0 9e-43 MSKNRIDQEKRVVELMIRLYCRKKEKNVTLCPRCEELLHYAHARLDHCPFGEKKKACKQC SIHCYKPAMREQMRRVMRFSGPRMLIYAPWEAIKHLLG >gi|226332026|gb|ACIB01000030.1| GENE 55 58098 - 58367 120 89 aa, chain - ## HITS:1 COG:VC0042 KEGG:ns NR:ns ## COG: VC0042 COG0168 # Protein_GI_number: 15640074 # Func_class: P Inorganic ion transport and metabolism # Function: Trk-type K+ transport systems, membrane components # Organism: Vibrio cholerae # 5 88 397 480 481 68 45.0 3e-12 MGMFFAFYLIIVILGWVVLLFLGVGFSESIGTVISSIGNVGPGLGSCGPAYSWNGLPDAA KWVLSFLMLIGRLELFSVLLLFYPGFWKS >gi|226332026|gb|ACIB01000030.1| GENE 56 58409 - 59596 1082 395 aa, chain - ## HITS:1 COG:MA1483 KEGG:ns NR:ns ## COG: MA1483 COG0168 # Protein_GI_number: 20090342 # Func_class: P Inorganic ion transport and metabolism # Function: Trk-type K+ transport systems, membrane components # Organism: Methanosarcina acetivorans str.C2A # 44 393 26 370 476 236 42.0 4e-62 MEEYLSTDIYVSKRNNSLINGKMIGRVLGVLLFIEAGMFVLCSGISVVYGESDYKYFLYT VGINLLSGALLMFYGRGAENRLSRRDGYCIVTLSWVFFTLFGMLPFYLSGSIDSLTNAFF ETMSGFTTTGATILDDIESLSHGMLFWRSLTQWIGGLGIVFFTIAVLPVFTSGGVQLFSA ESTGVIHDRTHPKINVMAKWLWTVYLILTLAETILLMLGGMSLFDAVCQSFATTATGGYS TKQASISYWNSPFIEYVVAIFMLLSGVNFALFLMCLRGKVSRLLRDEELRWFLGSVAILT FLITFALVFQNHYDWETAFRKSLFQVATAHTSCGFATDDYNLWPAFTWLLLLIAMLSGGC TGSTSGGIKNMRLLIIARSIRNEFKHLLHPNAVCR >gi|226332026|gb|ACIB01000030.1| GENE 57 59800 - 60255 125 151 aa, chain - ## HITS:1 COG:PA0248 KEGG:ns NR:ns ## COG: PA0248 COG2207 # Protein_GI_number: 15595445 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Pseudomonas aeruginosa # 23 148 155 285 288 60 30.0 1e-09 MYYHFIYHRYLDAECPSEVIKGMLFSLVLEVCRMYSGRNISVEMSRQDKLVDGFFSLLHK YCTQERMAAFYASRLCISDKYLMRSIKKQTGQTFHYWMADFILREAKLMLRSTDLSVTEI ADKLSFPNSSSFARFFRKYTGFSPVQFRNEA >gi|226332026|gb|ACIB01000030.1| GENE 58 60786 - 61958 677 390 aa, chain + ## HITS:1 COG:ECs1866 KEGG:ns NR:ns ## COG: ECs1866 COG0477 # Protein_GI_number: 15831120 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Escherichia coli O157:H7 # 14 374 10 371 387 223 40.0 7e-58 MTKRIGNSRLYILIFIGIVSAFGPFVTDFYLPALPVLSEYFDTTASLVQLSLTFSMVGLA VGQLIIGPLSDKYGRKLPLMVSLVIFCISTVGCLYSPEIHGFIFARLLQGLSGAGGVVIS KSIAIDLYQGKELTRFFAMLSSVQGLAPVCAPVLGGILLGAMDWKGIFWILLAIGILLIV ALSAFKESLEIKKRQKGNVFSTFKYYLPVLRNRQFMRYVLIQAFAMGVMFTYIAASPFIF QNHFGTSPFAYSLCFGVNALGIMLGSLAVSRFKDATAALRFGVAGFTTMSLPVAAALIFS PSVWIVEGTLFFLLAFLGLILPGSTTLALDMERKNSGNASALLGFLMFVFGGLLSPLTGI GNMLYSTGIIIVACCVGTWFFTYKATSSAR >gi|226332026|gb|ACIB01000030.1| GENE 59 62074 - 62838 197 254 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163797523|ref|ZP_02191474.1| 50S ribosomal protein L9 [alpha proteobacterium BAL199] # 5 244 8 257 259 80 27 3e-14 MKRAIIIGATSGIGREVAKQLLLQGWRLGIAGRRLPALEALQSSAPDLIEIAVLDVTQPD ATPKLNNLIRRVGGMDLFLLSSGIGYQNMELNPDIELDTARTNVEGFMRMADTAFHHFRE HGGGQLAVISSIAGTKGLGVAPAYSATKRFQNTYIDALEQLAGMQKLNIRFTDIRPGFVS TDLLNDGKHYPLLMRPEKVAKRIVRALNHHQRVVVIDWRYAIIVFFWRMIPRRIWKQLPI RTGQKPTEKNSKIG >gi|226332026|gb|ACIB01000030.1| GENE 60 62895 - 63737 584 280 aa, chain + ## HITS:1 COG:RSp1247_1 KEGG:ns NR:ns ## COG: RSp1247_1 COG2207 # Protein_GI_number: 17549468 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Ralstonia solanacearum # 1 113 32 144 153 87 40.0 3e-17 MKSDTEEKYLQQVNRVIDYINSHLNEPLRVETLAREVCLSEYHFHRIMRAYLHEPLATYI ARQRVERAVMYLQMKNIRLAQVAEMVGYETPQSLSKAFKQFFGISPTAYRKRRAERYEEF STLKKESLKPEILTEPELKLVYIRIIGRYGEEEPYIEAWRKLRDFLQINGLLTPSTRWIG ISFDDPTVTKTEQCRFYACATVEHDVSPQGAFGMKTIPQGRYAVYTLRGSYSGLQEMYDR IYSYPLPTAFRDATSFEEYLNCEPDTEEKDYVTRIYIPIE >gi|226332026|gb|ACIB01000030.1| GENE 61 63759 - 64028 259 89 aa, chain + ## HITS:1 COG:no KEGG:BF3095 NR:ns ## KEGG: BF3095 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 89 1 89 89 162 100.0 2e-39 MEQKFCQSCGMPLNPEVLGTEKDGSKNEEYCTYCYADGHFTVECTMDEMINQCAQFVDEF NKGSEVKMTKEEAIANMKQFFPMLKRWKQ >gi|226332026|gb|ACIB01000030.1| GENE 62 64317 - 64580 361 87 aa, chain - ## HITS:1 COG:CC0893 KEGG:ns NR:ns ## COG: CC0893 COG2388 # Protein_GI_number: 16125146 # Func_class: R General function prediction only # Function: Predicted acetyltransferase # Organism: Caulobacter vibrioides # 8 84 7 83 89 63 42.0 1e-10 MEEYEVIHRPERNRFELEKNGMTAFVEYEVEDGALDIMHTIVPPPLEGKGIAAALVEATY KYASAQGLKPKATCSYAVAWLKRHPAE >gi|226332026|gb|ACIB01000030.1| GENE 63 64615 - 65940 1145 441 aa, chain - ## HITS:1 COG:all2390 KEGG:ns NR:ns ## COG: all2390 COG1090 # Protein_GI_number: 17229882 # Func_class: R General function prediction only # Function: Predicted nucleoside-diphosphate sugar epimerase # Organism: Nostoc sp. PCC 7120 # 1 278 1 298 306 175 35.0 1e-43 MNIAISGASGFIGKHLTEYLTEAGHRVIPLGRPMFREGTSGHLIQALSHCDVIINLAGAP ISKRWTPEYKKELYDSRIKVTHCIIRAMDAVKTKPRLMISASAVGYYPEEGTFDEYTNTR GSGFLAELCHAWEKEARRCPSQTRLVITRFGIVLSPDGGAMEQMLRPLRITRVAGVIGPG TQPFPWIAIQDLCRAMEFIISHEEASGVFNLVAPQQVSQYAFTQAMAARYHAWMKVVVPR WFFRMRYGEAASFLASGQNVRPTRLLEAGFRFAKPTIEDFFESTDHAAVDRLDLHRYMGT WYEIARFDHRFERGLSEVTATYTLMPDGTVRVENRGCKHKKPYDVCKTAEGRAKIPDPSQ PGKLKVSFFLSFYSDYYVLELDQDEYNYALIGSSSDKYLWILSRAPILPEEIKKKLLDAA TRRGYDVKKLIWVEQREKKIM >gi|226332026|gb|ACIB01000030.1| GENE 64 65989 - 66846 836 285 aa, chain - ## HITS:1 COG:yegX KEGG:ns NR:ns ## COG: yegX COG3757 # Protein_GI_number: 16130040 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Lyzozyme M1 (1,4-beta-N-acetylmuramidase) # Organism: Escherichia coli K12 # 73 280 58 265 275 184 44.0 1e-46 MKSSRNTSTSGKPASGRRTPARKTKAKKKTTRTMPVWMRNTLALIVVGVFSLTFYYFVIR PYSYRWKECYGRKEYGVCIPCGYEVHGIDISHYQGNIDWKELKQNRETDFPLHFIFMKAT EGGDHGDDTFKDNFEQARRYGFIRGAYHFFTPRTDALKQADFFIRTVKLDSGDLPPVLDV ELTGKRPKKELQQNIKKWLDRVEAHYGVKPILYTSYKFKTRYLDDSLFNAYPYWIAHYYV DSVRYEGKWHFWQHTDIGSVPGIHHDVDLNVFNGSLEDLRKMTMR >gi|226332026|gb|ACIB01000030.1| GENE 65 66983 - 68686 1350 567 aa, chain - ## HITS:1 COG:no KEGG:BF3099 NR:ns ## KEGG: BF3099 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 567 1 567 567 1201 99.0 0 MKRLSTLCLLAAGFFIPGQAHQARPTVLDSVYAPLKVATPPSDAYIGLSLLDNGEIRHYN YGEQAVAGTVYLSSTDHGLTWKRVNRPKEMPFADCQSPVSKEYIRLVDMGAMGVYCIRTS GGLTGGRTLTKVADRNSIMIKPPVFIRNGKRIVVAAHGGVTPKGCYTYFSDDDGLTWKCS NTVTSPDHQGGGFHKGIRWNHGAVEPTVVELKDGTLWMLMRTSQDFHYQAFSKDGGQTWG ESEPSPFYGTITMPTLGRLANGRLLLFWCNTTPLPEKEGTDGVWDDVFTNRDVTHVAVSD DDGKTWKGFRELYMDPMRNDIDYAVHGGGIDRGVHQAQFVEVAPGKVLASIGQHPLHRAM MMFDVNWLYEKSRFNDFTDSLSQWSTFNYMNGIKGHCAYNRIQGCMLEPHPRKEGRQVLH LTYRPDASLVADTRGAVWNFPAMKKGTFTVSLRIPEGSHSVSLLLNDRWMNPSDTVARYQ SMYELPLTRKQLGVKDDRWHEVSLEWDLLQKRPQARVRVDGRLRPLRLPLKNKSQNGISY VHFIAPPAEANPGVYLEWVRAEADGAI >gi|226332026|gb|ACIB01000030.1| GENE 66 68783 - 70426 1975 547 aa, chain - ## HITS:1 COG:TP0542 KEGG:ns NR:ns ## COG: TP0542 COG0205 # Protein_GI_number: 15639531 # Func_class: G Carbohydrate transport and metabolism # Function: 6-phosphofructokinase # Organism: Treponema pallidum # 1 545 1 559 573 644 55.0 0 MTKSALQIARAAYQPKLPKALKGSVKAVEGAATQSVADQEAIQKLFPNTYGMPLIKFEEG EAIQLPAMNVGVILSGGQAPGGHNVISGLFDGIKTLNKDNKLYGFILGPGGLVDHNYMEL TADIIDEYRNTGGFDIIGSGRTKLETPEQFEKGLEIINKLGIKALVIIGGDDSNTNACVL AEYYAAKNAGVQVIGCPKTIDGDLKNEMIETSFGFDTACKVYSEVIGNIQRDCNSARKYW HFIKLMGRSASHIALECALQVQPNVCIISEEVEAKDMSLDDVVTYIAKIVADRAAQGNNF GTVLIPEGLVEFIPAMKRLIAELNDFLAANAVEFANIKRSRQREYIISKLSKENAEIYAS LPEGVARQLSLDRDPHGNVQVSLIETEKLLSEMVGTKLAQWKEEGKYVGKFAAQHHFFGY EGRCAAPSNYDADYCYSLGYTASMLIANGKTGYMSSVRNTTAPAAEWIAGGVPITMMMNM ERRHGEMKPVIQKALVKLDGKPFLTFAAKRDQWALNTDYVYPGPIQYFGPTEVCDQPTKT LQLEQAK >gi|226332026|gb|ACIB01000030.1| GENE 67 70641 - 73460 2061 939 aa, chain - ## HITS:1 COG:no KEGG:BF3101 NR:ns ## KEGG: BF3101 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 939 1 939 939 1900 99.0 0 MKKHLILMISAAIIPLLLYACSAEEERALTSTTLEVAQSAIDFKSDAGTRDIAIVTNADH WTARSDKDWCSVAVNESTLTVNVSGYDGKETREAVIKVTADGLAETVNVRQLGSEPAILI SQQIFTVEASGSDIAFDVTTNVSVTITLPEWIKEKPAGTRASEMVTTTHNYIVTANPEDS ERTGNITVKEVGGELEALVSVTQKGLGEYESGNLEGIKDDIKVPVESGEASSFQGGSNID KSFDGDMSTIYHSNWNNAGDHYFPITLTYNFAAGSDMDYLIYYPRTSGPNGNFKEVEIRV KSNANTRGTDEWNTVMTKNFGGTNAAVRVNFPKAQIGVTSVQFIVKSGSGDGQGFAACAE MEFYKKNPDAFDPLTLFTDGTCSELKPGLTDEEIENCPYSFYKNIAYYMKQGKYPAEFRI QEYKAWPHPDAQSETHKTSPYSLRDNPTGISVKDGEQLMIFVGDTHGQTVSAVIQNLDVP GGDGFGGTSYPLSEGANKITARNKGLMYILYHTPDYETAQPVKIHIASGQVNGYFDVAKH QASDWNKLLSNAVDKYFDVVGHYAHLTFPTERFRTHTTDGKALIDAYDQIVNSEMELMGL YKYNKLFKNRMYLHVMYTSYMYATSYHTAYNDGTLAELCNVDKLKTSACWGPAHEIGHCN QTRPGLKWLGTTEVTNNIMSEYIQTTIFGQPSRLQTEDMGDGSRNRYSKAWTQIIAAGAP HGNFGSDSDVFCKLVPFWQLELYFGKVLGRTPLQQSDKGGFYPDVYEYIRTHDNLRTAGE QQTEFVYICSLIAKANLLDFFTKWGFLTPVDITVDDYGTGKLTVTQARIDEIRSRVEALG YPKPDVALEYITDNSVELYKDKPGIVAGTATRSGSTFTMTNWKNVAAYEVVDETGKKVCI SDGLLAPSGTATFTMKTAWKDGFKVYAVSATGARTAVTF >gi|226332026|gb|ACIB01000030.1| GENE 68 73475 - 73609 71 44 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MNLATYGCQIDAKYENQVIIKKEIQVKSFMRTKFVFPLFYLSYV >gi|226332026|gb|ACIB01000030.1| GENE 69 73829 - 74725 669 298 aa, chain - ## HITS:1 COG:no KEGG:BF2940 NR:ns ## KEGG: BF2940 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 298 3 300 300 588 100.0 1e-167 MKKILLGLCVILGLINMIGCSSDEKYPMPTSIDQNSLSAEAKAGAIKLKWTVPADSNYYY VKVTYTLPEDGKKCMRLASVNSDTMLVDNLLHRYGDINFTLQPCNRAGEASQSCSIMAQA LPALKQIKTDRNPITLSAKQLYTDDQESSEGPIANLVDGRNDTYFHMSWSSPTPFPHYIV VDLGEENALSTFLFSYVCRDNNNKDNPKEMDILGSNTFDGKNYDESQTTLLASLSNLPNT KAASYESDIIKAGASYRYLWFKVKSSTSGSNWIALAELGLSKVIVKTYDPETGETTIE >gi|226332026|gb|ACIB01000030.1| GENE 70 74747 - 76657 1626 636 aa, chain - ## HITS:1 COG:no KEGG:BF3103 NR:ns ## KEGG: BF3103 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 636 1 636 636 1301 100.0 0 MKIIYKLLFLFLLTIGYSSCDFLDIVPDEITTEADAFKDRNAAKNFVYSCYGYLPQSNVA SGSLDLLTGDEVITAFEHETFASFPKGNYTASSPVISYWNTFFQGLRQCYIFLENVDKVP DLTESVKTDYIAQVKFLIAYYHYQLARCYGPIILIKELPDVNASQAEYLPRTSYDECVDW ICNLLDEAASSLPAIRTNKEDYGLATSVGAKAVKAKMLLYAASPLFNGNSSFYSNFTDKS GSQLMPLTYDPNKWVKAKTAIKEAIDLAEQNGHALYVKQDYKIGNEDANPYPAAGPVRCL RTSLVDWDSRNPEVLLAETRSEGSYGIQNKSLPFVTDGWAWNGVGPTWTMLNRFYTKNGL PWDEDPEYKDKDKLKIVNVDASHADEAHEGSKTLLFNLDREPRFYAWVAFQGGYYEVMNG STNPAYVMSNGKKDNSDSRLICDFVLGGNCSRGTATVQRPGNYTPSGYLNKKGVDPNTVV STNATKLNQYPWPIIRLADLYLAYAEACVETNDLETAKQYLNKVRERAGIPTVETAWSGV AALDQTKLRQIVRQERMNELYLENQNFWDMRRWLLAGQYFNVKAKGLNIAATTIEDYAIV KTIDFERKFEAPTQYLLPIPSADINRNEKLVNNPGY >gi|226332026|gb|ACIB01000030.1| GENE 71 76667 - 80044 2442 1125 aa, chain - ## HITS:1 COG:no KEGG:BF2942 NR:ns ## KEGG: BF2942 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1125 1 1125 1125 2205 99.0 0 MNKFLFSCQKRCLKYIVMALLLYPLSALAAQGQIVVKGQSLTIPQAIRLIEKSSQYTFFY NANDLKNTTLKSIDCKGSIDDVLNEVFKGSNISYVIKGNEVILKVEKTESTQQKKAKIIG IVTDSKTGEPIIGATVQLLGTTTGVITDVDGKFELAAFPKNEIQISYIGYVTKKVKVGSQ KVMSITLAEDAQQLDEVVVTAFGTGQKKETITGSIQSVRPSDLLVPSANLSSSFAGRLSG VIAYQRSGEPGQNSADFFIRGVATMNGATSPLIILDGVEVSKADLNSLDPEVIESFSVLK DATASAMYGTRGANGVLIVKTKSGSDLDRPIIGVRLEGYVNTPTKKPEIVDGPTYMRLYN EAVTNQGTGAVLYSDEKINGTIHNLNPYIYPNVDWYKEVFKDATFNQKANFNVRGGTSKI TYFMNVNMNHETGMLKDRSSDFFSYKNNIDYMKYAFQNNVDFHLSKSSTISLHLNVQLND MHGPLTTKDGNGVGDIFSAIMGTNPVDFPVMFPQGSDTWYHWGGILAGNYQPLNPVALSS VGYKDTFESTVVANVNWDQKLDFITKGLSFRALVSFKNWSYNQKFRLQGYNSYQLSDYKQ NEDGSYDFTNTPIGEPSNHTMDAFFGTNGDRRFYIQGYLNYERSFGLHNVSGMLLYNQDD YNTNVNSSLIASLPKRKMGVAARLSYDYDHRYMLEVNAGYNGSESFAKGHRWGLFPSISL GWNISEEKFWKPIKPVISNFKVRGSYGLVGNDQIGSDRFAYLAIVNLTKSPSYTTGYGGS TTSLSGPTYNRFQNNELTWEVGNKLNVGVDLQLFNSLNITVDGFREIRDNIFQQKNSIPN YLGTASTKIYGNFAKVKNTGFDLALDYGKQLNRNFSIQMKGTFTYAHNEVLKYDEAAGLR PALSQVGKSLNSIWGYVADGLYIDEADIANNPQSTIGNIAIAPGDVKYVDQPDASGNYDG KITSDDRVVLGYPTIPEIIYGFGPSITWKNWDFSFFFQGQARVSFMMSGFEPFGTQSKNN VLKWISDDHWSKDNQNPNARYPRLTQYNNNNNTASSSYWLRNASFLKLRNAEIGYRFKWA RIYVNGSNLLTFSPFKLWDPEMGGGAGMKYPTQRTYNVGIQLTFK >gi|226332026|gb|ACIB01000030.1| GENE 72 80082 - 80711 379 209 aa, chain - ## HITS:1 COG:no KEGG:BF2943 NR:ns ## KEGG: BF2943 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 209 17 225 225 414 100.0 1e-114 MDKYIEITAQKSRMYLLPDSSKVWMQPGSSIRFAEDFKKHRNVWLKGNSLFEVHKNMGRK FRVYIDKAFIEVKGTCFLIKQNNPSANEITLFNGSIEFNVESTQHKIEMKPLQELVYNPA DAGTQLRQIENIEWQNGRYNFTQFNLEHLTRIINQMYGSRIIISDKVNKNCAFTGSIRYD ESLEDVIDKICFSLNLRKKEINHEIIIYN Prediction of potential genes in microbial genomes Time: Tue May 17 22:56:03 2011 Seq name: gi|226332025|gb|ACIB01000031.1| Bacteroides sp. 3_2_5 cont1.31, whole genome shotgun sequence Length of sequence - 235115 bp Number of predicted genes - 190, with homology - 189 Number of transcription units - 96, operones - 50 average op.length - 2.9 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 41 - 100 4.2 1 1 Tu 1 . + CDS 200 - 748 474 ## BF3107 RNA polymerase ECF-type sigma factor - Term 619 - 654 -0.5 2 2 Op 1 9/0.000 - CDS 767 - 2143 464 ## PROTEIN SUPPORTED gi|157165073|ref|YP_001466086.1| 30S ribosomal protein S12 3 2 Op 2 27/0.000 - CDS 2147 - 5293 2930 ## COG0841 Cation/multidrug efflux pump 4 2 Op 3 . - CDS 5387 - 6577 1267 ## COG0845 Membrane-fusion protein + Prom 6696 - 6755 4.5 5 3 Tu 1 . + CDS 6896 - 9106 1870 ## COG1752 Predicted esterase of the alpha-beta hydrolase superfamily + Term 9152 - 9218 11.0 + Prom 9259 - 9318 5.4 6 4 Op 1 . + CDS 9355 - 9624 335 ## BF3112 hypothetical protein 7 4 Op 2 . + CDS 9627 - 10184 485 ## COG0204 1-acyl-sn-glycerol-3-phosphate acyltransferase 8 4 Op 3 . + CDS 10191 - 10961 403 ## COG0388 Predicted amidohydrolase + Prom 10965 - 11024 4.2 9 5 Op 1 12/0.000 + CDS 11099 - 12265 1140 ## COG1820 N-acetylglucosamine-6-phosphate deacetylase + Term 12290 - 12339 9.1 + Prom 12428 - 12487 6.5 10 5 Op 2 . + CDS 12545 - 14536 2003 ## COG0363 6-phosphogluconolactonase/Glucosamine-6-phosphate isomerase/deaminase + Term 14566 - 14610 12.5 + Prom 14538 - 14597 2.9 11 6 Tu 1 . + CDS 14631 - 15353 606 ## COG2365 Protein tyrosine/serine phosphatase + Prom 15372 - 15431 3.1 12 7 Tu 1 . + CDS 15455 - 18364 2140 ## BF3118 xanthan lyase + Prom 18746 - 18805 7.2 13 8 Tu 1 . + CDS 19001 - 19213 121 ## BF2957 hypothetical protein + Prom 19372 - 19431 4.7 14 9 Op 1 . + CDS 19466 - 19786 446 ## BF2958 hypothetical protein 15 9 Op 2 . + CDS 19783 - 19998 136 ## BF3123 hypothetical protein 16 9 Op 3 . + CDS 19995 - 20339 379 ## BF2960 hypothetical protein 17 9 Op 4 . + CDS 20344 - 20820 493 ## BF3125 hypothetical protein 18 9 Op 5 . + CDS 20846 - 22702 1576 ## COG1032 Fe-S oxidoreductase - Term 22672 - 22720 11.3 19 10 Tu 1 . - CDS 22727 - 26104 3059 ## COG1197 Transcription-repair coupling factor (superfamily II helicase) - Prom 26141 - 26200 2.9 20 11 Op 1 . + CDS 26297 - 27049 771 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 21 11 Op 2 . + CDS 27053 - 28432 1314 ## COG0044 Dihydroorotase and related cyclic amidohydrolases 22 11 Op 3 . + CDS 28499 - 28771 133 ## BF2966 hypothetical protein 23 11 Op 4 . + CDS 28808 - 29746 816 ## COG1410 Methionine synthase I, cobalamin-binding domain 24 11 Op 5 . + CDS 29803 - 30723 814 ## BF3132 hypothetical protein 25 12 Op 1 . - CDS 30912 - 31799 911 ## BF3134 hypothetical protein 26 12 Op 2 . - CDS 31848 - 32693 834 ## BF3135 hypothetical protein 27 12 Op 3 . - CDS 32742 - 33878 1026 ## BF3136 hypothetical protein - Prom 33931 - 33990 7.6 + Prom 33926 - 33985 5.9 28 13 Tu 1 . + CDS 34170 - 34598 151 ## BF3138 hypothetical protein 29 14 Tu 1 . - CDS 34666 - 35148 398 ## BF3139 hypothetical protein - Prom 35275 - 35334 4.4 + Prom 35123 - 35182 4.0 30 15 Op 1 . + CDS 35243 - 35440 67 ## BF3140 hypothetical protein 31 15 Op 2 . + CDS 35437 - 35949 642 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 32 15 Op 3 . + CDS 35953 - 36537 530 ## BF3142 hypothetical protein 33 16 Op 1 . + CDS 36649 - 37815 850 ## COG1408 Predicted phosphohydrolases 34 16 Op 2 35/0.000 + CDS 37812 - 39536 258 ## PROTEIN SUPPORTED gi|229849245|ref|ZP_04469311.1| LSU ribosomal protein L17P 35 16 Op 3 . + CDS 39539 - 41305 175 ## PROTEIN SUPPORTED gi|229849245|ref|ZP_04469311.1| LSU ribosomal protein L17P - Term 41460 - 41486 -1.0 36 17 Tu 1 . - CDS 41611 - 41829 76 ## BF2980 hypothetical protein 37 18 Op 1 28/0.000 - CDS 41919 - 44882 2571 ## COG0419 ATPase involved in DNA repair - Prom 44941 - 45000 3.5 38 18 Op 2 . - CDS 45030 - 46262 1223 ## COG0420 DNA repair exonuclease - Prom 46341 - 46400 4.7 + Prom 46261 - 46320 3.1 39 19 Tu 1 . + CDS 46351 - 47109 650 ## COG0204 1-acyl-sn-glycerol-3-phosphate acyltransferase + Term 47143 - 47204 14.5 - Term 47128 - 47194 17.0 40 20 Op 1 9/0.000 - CDS 47239 - 50442 2833 ## COG0841 Cation/multidrug efflux pump 41 20 Op 2 9/0.000 - CDS 50463 - 51956 1624 ## COG1538 Outer membrane protein 42 20 Op 3 27/0.000 - CDS 51959 - 55036 2680 ## COG0841 Cation/multidrug efflux pump 43 20 Op 4 . - CDS 55063 - 56133 1074 ## COG0845 Membrane-fusion protein - Prom 56153 - 56212 3.0 - Term 56274 - 56309 -0.5 44 21 Op 1 . - CDS 56534 - 58222 993 ## BF3153 hypothetical protein 45 21 Op 2 . - CDS 58222 - 59112 541 ## COG0681 Signal peptidase I 46 21 Op 3 . - CDS 59140 - 60168 431 ## BF3155 hypothetical protein - Prom 60216 - 60275 7.5 47 22 Op 1 . - CDS 60277 - 60843 279 ## BF2991 hypothetical protein 48 22 Op 2 . - CDS 60840 - 61850 530 ## BF2992 hypothetical protein - Prom 62030 - 62089 2.5 49 23 Op 1 . - CDS 62157 - 62615 195 ## Cpin_2456 hypothetical protein 50 23 Op 2 . - CDS 62612 - 63778 309 ## Cpin_2455 hypothetical protein - Prom 63803 - 63862 6.3 - Term 63808 - 63860 1.1 51 24 Op 1 . - CDS 63901 - 64146 265 ## gi|253564944|ref|ZP_04842400.1| predicted protein - Prom 64224 - 64283 2.8 52 24 Op 2 . - CDS 64393 - 66363 1283 ## BF3158 hypothetical protein - Prom 66464 - 66523 9.9 - Term 66428 - 66476 1.1 53 25 Tu 1 . - CDS 66528 - 68489 1091 ## BF3159 hypothetical protein - Prom 68529 - 68588 2.7 54 26 Op 1 . - CDS 68636 - 69502 441 ## BF2999 hypothetical protein 55 26 Op 2 . - CDS 69512 - 70399 405 ## BF3000 hypothetical protein - Prom 70419 - 70478 1.7 56 27 Op 1 . - CDS 70515 - 71507 571 ## BF3162 hypothetical protein - Term 71518 - 71560 -0.8 57 27 Op 2 . - CDS 71575 - 71856 291 ## BF3163 hypothetical protein - Prom 71876 - 71935 4.1 58 28 Tu 1 . + CDS 71893 - 72060 58 ## 59 29 Op 1 . - CDS 72020 - 73081 554 ## BF3164 hypothetical protein 60 29 Op 2 . - CDS 73088 - 73297 196 ## BF3165 hypothetical protein - Prom 73381 - 73440 8.1 + Prom 73278 - 73337 5.3 61 30 Tu 1 . + CDS 73528 - 74631 377 ## BF3166 hypothetical protein 62 31 Tu 1 . - CDS 75117 - 75503 417 ## BF3006 hypothetical protein - Prom 75595 - 75654 7.4 63 32 Op 1 . - CDS 75669 - 77429 606 ## BF3007 hypothetical protein 64 32 Op 2 . - CDS 77473 - 78201 625 ## BF3169 hypothetical protein - Prom 78411 - 78470 4.7 + Prom 78250 - 78309 5.9 65 33 Op 1 . + CDS 78440 - 79675 1088 ## COG0641 Arylsulfatase regulator (Fe-S oxidoreductase) 66 33 Op 2 . + CDS 79717 - 81882 2211 ## BF3010 hypothetical protein 67 33 Op 3 . + CDS 81885 - 84050 1809 ## BF3011 hypothetical protein - Term 84036 - 84104 11.3 68 34 Op 1 . - CDS 84157 - 84339 249 ## BF3012 hypothetical protein 69 34 Op 2 . - CDS 84336 - 85460 709 ## COG1819 Glycosyl transferases, related to UDP-glucuronosyltransferase 70 34 Op 3 . - CDS 85457 - 86263 630 ## COG2908 Uncharacterized protein conserved in bacteria - Prom 86387 - 86446 5.8 + Prom 86315 - 86374 5.1 71 35 Op 1 4/0.000 + CDS 86438 - 86734 370 ## COG0526 Thiol-disulfide isomerase and thioredoxins 72 35 Op 2 . + CDS 86744 - 87211 419 ## COG0526 Thiol-disulfide isomerase and thioredoxins + Term 87228 - 87293 6.4 73 36 Tu 1 1/0.125 - CDS 87592 - 88227 521 ## COG0778 Nitroreductase - Prom 88249 - 88308 6.7 - Term 88296 - 88343 10.1 74 37 Op 1 . - CDS 88365 - 88922 728 ## COG1592 Rubrerythrin 75 37 Op 2 . - CDS 88944 - 89381 295 ## COG0735 Fe2+/Zn2+ uptake regulation proteins - Prom 89408 - 89467 6.5 - Term 89463 - 89510 12.0 76 38 Op 1 . - CDS 89533 - 89880 389 ## COG3152 Predicted membrane protein - Prom 89963 - 90022 10.4 77 38 Op 2 . - CDS 90061 - 92172 2185 ## COG0339 Zn-dependent oligopeptidases - Prom 92228 - 92287 4.0 + Prom 92190 - 92249 5.8 78 39 Op 1 . + CDS 92283 - 94208 1549 ## COG0171 NAD synthase 79 39 Op 2 . + CDS 94278 - 94814 530 ## BF3183 hypothetical protein + Prom 94821 - 94880 2.2 80 40 Op 1 . + CDS 94920 - 98075 2908 ## BF3184 hypothetical protein 81 40 Op 2 . + CDS 98106 - 99578 1299 ## BF3185 hypothetical protein 82 40 Op 3 . + CDS 99611 - 100615 776 ## COG1409 Predicted phosphohydrolases - Term 100666 - 100729 9.3 83 41 Op 1 13/0.000 - CDS 100754 - 101878 954 ## COG0131 Imidazoleglycerol-phosphate dehydratase 84 41 Op 2 19/0.000 - CDS 101884 - 102921 897 ## COG0079 Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 85 41 Op 3 18/0.000 - CDS 102918 - 104204 1242 ## COG0141 Histidinol dehydrogenase 86 41 Op 4 . - CDS 104240 - 105091 928 ## COG0040 ATP phosphoribosyltransferase - Prom 105127 - 105186 11.7 - Term 105131 - 105167 4.0 87 42 Tu 1 . - CDS 105341 - 105832 415 ## BF3191 hypothetical protein - Prom 105914 - 105973 4.3 + Prom 105799 - 105858 7.4 88 43 Tu 1 . + CDS 105954 - 106661 380 ## COG1741 Pirin-related protein + Term 106688 - 106735 -0.7 - Term 106675 - 106723 11.1 89 44 Op 1 . - CDS 106746 - 108755 1852 ## COG4232 Thiol:disulfide interchange protein - Prom 108795 - 108854 8.3 90 44 Op 2 . - CDS 108856 - 109155 467 ## BF3194 hypothetical protein - Prom 109175 - 109234 5.8 + Prom 109136 - 109195 8.8 91 45 Op 1 . + CDS 109257 - 109871 480 ## COG0572 Uridine kinase 92 45 Op 2 . + CDS 109868 - 111259 1152 ## COG4623 Predicted soluble lytic transglycosylase fused to an ABC-type amino acid-binding protein - Term 111253 - 111301 10.0 93 46 Op 1 . - CDS 111319 - 112872 1232 ## COG0591 Na+/proline symporter 94 46 Op 2 . - CDS 112869 - 112994 81 ## gi|265766058|ref|ZP_06094099.1| conserved hypothetical protein - Prom 113061 - 113120 8.6 - Term 113084 - 113128 5.8 95 47 Op 1 . - CDS 113148 - 113753 619 ## COG0778 Nitroreductase 96 47 Op 2 . - CDS 113768 - 116518 2412 ## COG1410 Methionine synthase I, cobalamin-binding domain 97 47 Op 3 . - CDS 116538 - 116990 611 ## COG0691 tmRNA-binding protein 98 47 Op 4 . - CDS 117000 - 117542 396 ## BF3201 hypothetical protein 99 47 Op 5 . - CDS 117574 - 118374 788 ## BF3042 putative lipoprotein - Prom 118396 - 118455 3.3 - Term 118391 - 118440 2.1 100 48 Tu 1 . - CDS 118457 - 118957 528 ## BF3204 hypothetical protein - Prom 118977 - 119036 4.1 - Term 119250 - 119292 2.2 101 49 Tu 1 . - CDS 119496 - 119900 195 ## BF3205 hypothetical protein - Prom 120030 - 120089 7.1 + Prom 119963 - 120022 5.7 102 50 Op 1 . + CDS 120140 - 120841 764 ## COG0822 NifU homolog involved in Fe-S cluster formation 103 50 Op 2 . + CDS 120862 - 121872 1205 ## BF3207 hypothetical protein + Term 121901 - 121939 6.4 + Prom 122224 - 122283 6.9 104 51 Tu 1 . + CDS 122374 - 123387 744 ## COG0332 3-oxoacyl-[acyl-carrier-protein] synthase III + Prom 123400 - 123459 3.2 105 52 Op 1 . + CDS 123494 - 124939 1261 ## COG0366 Glycosidases 106 52 Op 2 . + CDS 124941 - 125876 648 ## COG1295 Predicted membrane protein + Term 125916 - 125955 1.9 107 53 Tu 1 . - CDS 125890 - 126210 239 ## BF3211 hypothetical protein - Term 126499 - 126537 1.2 108 54 Tu 1 . - CDS 126579 - 127415 735 ## BF3212 putative ferredoxin 109 55 Tu 1 . + CDS 127690 - 129756 2111 ## COG1158 Transcription termination factor + Term 129779 - 129839 16.6 + Prom 129795 - 129854 4.5 110 56 Op 1 10/0.000 + CDS 129981 - 131738 1480 ## COG0642 Signal transduction histidine kinase 111 56 Op 2 . + CDS 131735 - 133690 1283 ## COG0642 Signal transduction histidine kinase 112 56 Op 3 . + CDS 133765 - 135087 1280 ## COG0534 Na+-driven multidrug efflux pump + Term 135089 - 135120 -0.1 + Prom 135122 - 135181 3.3 113 57 Tu 1 . + CDS 135239 - 136558 1790 ## COG0541 Signal recognition particle GTPase + Term 136653 - 136692 6.8 114 58 Op 1 . + CDS 136963 - 138048 999 ## COG0526 Thiol-disulfide isomerase and thioredoxins 115 58 Op 2 . + CDS 138068 - 138949 1048 ## COG0190 5,10-methylene-tetrahydrofolate dehydrogenase/Methenyl tetrahydrofolate cyclohydrolase + Term 138957 - 139002 11.1 116 59 Op 1 . + CDS 139017 - 140648 1201 ## BF3220 thiol:disulfide interchange protein 117 59 Op 2 . + CDS 140661 - 141779 639 ## COG2843 Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) + Term 141935 - 141986 16.2 + TRNA 141859 - 141934 69.3 # His GTG 0 0 - Term 142032 - 142079 12.5 118 60 Op 1 . - CDS 142124 - 143152 895 ## BF3222 hypothetical protein 119 60 Op 2 . - CDS 143186 - 143857 598 ## BF3223 hypothetical protein 120 60 Op 3 . - CDS 143883 - 144770 912 ## COG1131 ABC-type multidrug transport system, ATPase component 121 60 Op 4 . - CDS 144798 - 145373 426 ## BF3064 hypothetical protein 122 60 Op 5 . - CDS 145357 - 145941 727 ## BF3226 hypothetical protein 123 60 Op 6 . - CDS 145963 - 147513 896 ## BF3227 hypothetical protein 124 60 Op 7 . - CDS 147552 - 148766 1099 ## BF3228 putative lipoprotein 125 60 Op 8 . - CDS 148793 - 151531 2073 ## BF3229 putative outer membrane receptor protein - Prom 151614 - 151673 8.0 - Term 151807 - 151868 14.2 126 61 Op 1 . - CDS 151900 - 152880 589 ## BF3230 hypothetical protein - Prom 152905 - 152964 3.3 127 61 Op 2 . - CDS 152982 - 154826 1476 ## COG2812 DNA polymerase III, gamma/tau subunits - Prom 154874 - 154933 3.9 + Prom 154795 - 154854 3.1 128 62 Op 1 . + CDS 154975 - 155277 455 ## BF3072 putative septum formation initiator-related protein 129 62 Op 2 . + CDS 155274 - 155618 377 ## BF3073 hypothetical protein + Term 155621 - 155657 0.0 - Term 155656 - 155681 -0.8 130 63 Op 1 . - CDS 155687 - 156094 451 ## COG4704 Uncharacterized protein conserved in bacteria 131 63 Op 2 . - CDS 156108 - 158015 1299 ## BF3075 hypothetical protein - Prom 158039 - 158098 3.4 - Term 158063 - 158100 3.0 132 64 Tu 1 . - CDS 158154 - 159053 461 ## BF3236 hypothetical protein - Prom 159118 - 159177 6.9 - Term 159126 - 159171 6.4 133 65 Op 1 . - CDS 159197 - 159802 656 ## BF3237 hypothetical protein 134 65 Op 2 . - CDS 159830 - 160063 184 ## BF3238 hypothetical protein - Prom 160115 - 160174 5.2 + Prom 160141 - 160200 7.1 135 66 Tu 1 . + CDS 160225 - 161691 1296 ## COG2195 Di- and tripeptidases + Prom 161743 - 161802 1.6 136 67 Tu 1 . + CDS 161859 - 162893 518 ## BF3080 hypothetical protein - Term 162721 - 162761 7.4 137 68 Tu 1 . - CDS 162878 - 164047 1028 ## COG0668 Small-conductance mechanosensitive channel - Prom 164068 - 164127 6.0 - Term 164115 - 164160 8.6 138 69 Op 1 . - CDS 164186 - 164842 654 ## COG0176 Transaldolase 139 69 Op 2 . - CDS 164908 - 166725 2020 ## COG3669 Alpha-L-fucosidase 140 69 Op 3 1/0.125 - CDS 166775 - 169837 2993 ## COG3250 Beta-galactosidase/beta-glucuronidase 141 69 Op 4 . - CDS 169915 - 172230 1533 ## COG3525 N-acetyl-beta-hexosaminidase 142 69 Op 5 . - CDS 172275 - 173861 1659 ## COG3119 Arylsulfatase A and related enzymes - Prom 173891 - 173950 3.7 143 70 Tu 1 . + CDS 174273 - 175739 1372 ## COG3119 Arylsulfatase A and related enzymes 144 71 Tu 1 . - CDS 175876 - 177525 1685 ## COG3525 N-acetyl-beta-hexosaminidase - Prom 177623 - 177682 3.9 + Prom 177481 - 177540 5.2 145 72 Tu 1 . + CDS 177703 - 180627 1853 ## COG2207 AraC-type DNA-binding domain-containing proteins + Term 180642 - 180686 7.4 146 73 Tu 1 . - CDS 180703 - 181755 1125 ## COG1830 DhnA-type fructose-1,6-bisphosphate aldolase and related enzymes - Prom 181814 - 181873 2.6 147 74 Tu 1 . - CDS 181914 - 182660 851 ## COG0588 Phosphoglycerate mutase 1 - Prom 182740 - 182799 5.3 + Prom 182725 - 182784 5.3 148 75 Tu 1 . + CDS 182817 - 185159 1729 ## BF3092 hypothetical protein - Term 185193 - 185257 0.0 149 76 Tu 1 . - CDS 185300 - 186754 208 ## PROTEIN SUPPORTED gi|90020817|ref|YP_526644.1| ribosomal protein S16 - Prom 186777 - 186836 6.5 150 77 Tu 1 . - CDS 186896 - 188314 1119 ## COG3119 Arylsulfatase A and related enzymes - Prom 188334 - 188393 3.8 151 78 Tu 1 . - CDS 188426 - 189982 1001 ## COG3525 N-acetyl-beta-hexosaminidase - Term 189998 - 190039 5.5 152 79 Op 1 . - CDS 190083 - 191948 1428 ## BF3257 hypothetical protein 153 79 Op 2 . - CDS 191963 - 195400 2819 ## BF3097 hypothetical protein - Prom 195420 - 195479 1.6 154 80 Op 1 6/0.000 - CDS 195538 - 196536 366 ## COG3712 Fe2+-dicitrate sensor, membrane component - Prom 196556 - 196615 4.4 155 80 Op 2 . - CDS 196627 - 197187 510 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog - Prom 197297 - 197356 5.9 - Term 197197 - 197253 3.3 156 81 Op 1 . - CDS 197428 - 199428 1703 ## COG1523 Type II secretory pathway, pullulanase PulA and related glycosidases 157 81 Op 2 . - CDS 199488 - 200054 411 ## COG0817 Holliday junction resolvasome, endonuclease subunit 158 81 Op 3 . - CDS 200051 - 200353 320 ## BF3263 hypothetical protein - Prom 200486 - 200545 77.3 + TRNA 200469 - 200542 62.5 # Ala GGC 0 0 - Term 200571 - 200608 7.1 159 82 Tu 1 . - CDS 200706 - 202046 609 ## BF3264 hypothetical protein - Prom 202169 - 202228 6.0 + Prom 202565 - 202624 3.6 160 83 Tu 1 . + CDS 202670 - 203689 1188 ## COG0016 Phenylalanyl-tRNA synthetase alpha subunit + Prom 203711 - 203770 1.9 161 84 Op 1 1/0.125 + CDS 203801 - 204997 1325 ## COG0477 Permeases of the major facilitator superfamily 162 84 Op 2 . + CDS 204994 - 205671 728 ## COG0177 Predicted EndoIII-related endonuclease + Prom 205681 - 205740 4.3 163 85 Op 1 . + CDS 205761 - 207020 1558 ## COG0126 3-phosphoglycerate kinase 164 85 Op 2 . + CDS 206941 - 208050 537 ## COG0715 ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 165 85 Op 3 . + CDS 208067 - 209104 1025 ## BF3109 hypothetical protein 166 85 Op 4 . + CDS 209128 - 210318 987 ## COG2311 Predicted membrane protein 167 85 Op 5 . + CDS 210322 - 212520 1931 ## COG0457 FOG: TPR repeat + Term 212545 - 212607 -0.8 + Prom 212659 - 212718 5.9 168 86 Tu 1 . + CDS 212766 - 213278 429 ## BF3274 hypothetical protein + Term 213487 - 213524 1.2 - Term 213515 - 213558 11.2 169 87 Tu 1 . - CDS 213603 - 214442 630 ## COG2273 Beta-glucanase/Beta-glucan synthetase - Prom 214521 - 214580 5.5 170 88 Op 1 . + CDS 214810 - 216330 999 ## COG1020 Non-ribosomal peptide synthetase modules and related proteins 171 88 Op 2 . + CDS 216345 - 216587 299 ## BF3277 acyl carrier protein 172 88 Op 3 . + CDS 216591 - 217973 1119 ## COG1696 Predicted membrane protein involved in D-alanine export 173 88 Op 4 . + CDS 217992 - 219032 876 ## BF3118 hypothetical protein + Prom 219038 - 219097 4.3 174 89 Tu 1 . + CDS 219137 - 219529 273 ## BF3280 hypothetical protein 175 90 Op 1 . - CDS 219722 - 220303 552 ## COG0424 Nucleotide-binding protein implicated in inhibition of septum formation 176 90 Op 2 . - CDS 220335 - 220856 589 ## COG1778 Low specificity phosphatase (HAD superfamily) 177 90 Op 3 . - CDS 220877 - 221668 854 ## BF3283 hypothetical protein 178 90 Op 4 . - CDS 221665 - 221997 204 ## BF3284 hypothetical protein 179 90 Op 5 . - CDS 222000 - 222536 554 ## COG0778 Nitroreductase - Prom 222556 - 222615 3.5 - Term 222572 - 222627 3.1 180 91 Tu 1 . - CDS 222703 - 223245 546 ## COG0288 Carbonic anhydrase - Prom 223337 - 223396 6.0 181 92 Tu 1 . + CDS 223363 - 224346 646 ## BF3287 hypothetical protein + Prom 224486 - 224545 4.1 182 93 Op 1 2/0.000 + CDS 224579 - 224983 498 ## COG0346 Lactoylglutathione lyase and related lyases + Prom 225030 - 225089 3.5 183 93 Op 2 . + CDS 225109 - 226662 1796 ## COG4799 Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) 184 93 Op 3 . + CDS 226683 - 227597 960 ## BF3129 hypothetical protein 185 93 Op 4 9/0.000 + CDS 227623 - 228054 535 ## COG0511 Biotin carboxyl carrier protein 186 93 Op 5 . + CDS 228057 - 229217 1370 ## COG1883 Na+-transporting methylmalonyl-CoA/oxaloacetate decarboxylase, beta subunit + Term 229243 - 229286 10.1 + Prom 229238 - 229297 5.1 187 94 Op 1 . + CDS 229365 - 230696 1097 ## BF3293 hypothetical protein 188 94 Op 2 . + CDS 230711 - 232003 1058 ## BF3133 hypothetical protein + Prom 232350 - 232409 1.5 189 95 Tu 1 . + CDS 232529 - 234379 1821 ## COG0366 Glycosidases + Term 234410 - 234457 6.5 + Prom 234441 - 234500 4.4 190 96 Tu 1 . + CDS 234537 - 234977 403 ## BF3135 MarR family transcriptional regulator + Term 235067 - 235096 -0.4 Predicted protein(s) >gi|226332025|gb|ACIB01000031.1| GENE 1 200 - 748 474 182 aa, chain + ## HITS:1 COG:no KEGG:BF3107 NR:ns ## KEGG: BF3107 # Name: not_defined # Def: RNA polymerase ECF-type sigma factor # Organism: B.fragilis # Pathway: not_defined # 1 182 1 182 182 318 100.0 7e-86 MQEKETVRKLKHGDQEAFAFLYNHYWKQVYNFTRLYFTASMDIEEIVQEVFVKVWESHHF LDENKSFEGYLFIITRNVIFNHSRRYYKETALKITAIQAVEESYDMEGELDAADLKKYID ELVMQLPPRQREVFRMSRELHMSNREIAEHFSITEKAIERHINLALKFLKKNLNLFMLFM AA >gi|226332025|gb|ACIB01000031.1| GENE 2 767 - 2143 464 458 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|157165073|ref|YP_001466086.1| 30S ribosomal protein S12 [Campylobacter concisus 13826] # 51 457 45 458 460 183 29 6e-45 MKRSYLYITLLLVVLILGSCKIGKSYVRPELNLPDSLAQHQDSLSFGDKEWWQIYTDSTL RSLIDRALEHNKDMLIAAARVKEMAAQKRISTANLLPDIRGTVTAERENENHGGDAFKRS DTFEAQLLFSWELDLWGNLRWARSASIAEYLQSVEAQRALRMTIIAEVAQAYYELVALDT ELDIVRQTLKAREEGMRLARIRFEGGLTSETSYRQSQVELARTATLVPDLERKISLKEND IAFLAGEYPNKITRSRLLQEFNFPQELPVGLPSGLLERRPDIRQAEQKLIAANARVGVAY TNMFPRISLTGRVGAESTALSELLKSPYSIMEGALLTPIFGWGKNRAALKAKKAAYEAEV HSYEKAVLTAFKETRNSIVNFNKIKEVYDLRAKLERSAKSYVDLAQLQYINGVTNYLDVL DAQRGYFDAQIGLSNAIRDELIAVVQVYKALGGGWQVQ >gi|226332025|gb|ACIB01000031.1| GENE 3 2147 - 5293 2930 1048 aa, chain - ## HITS:1 COG:SMa1662 KEGG:ns NR:ns ## COG: SMa1662 COG0841 # Protein_GI_number: 16263363 # Func_class: V Defense mechanisms # Function: Cation/multidrug efflux pump # Organism: Sinorhizobium meliloti # 6 1030 7 1032 1044 869 43.0 0 MKVSFFIDRPVFSIVISILIVIIGIIGLTMLPVDQYPQITPPVVKISASYPGASALTVSQ AVATPIEQEINGTPGMLYMESNSSNSGGFSATVTFDVSADPDLAAVEIQNRVKLAESRLP AEVIQNGISVEKQAPSQLMTLTLMSSDPKFDEIYLSNFATINVLDVIRRIPGVGRVSNIG SRYYAMQIWAEPDKLANFGLTVQDLQNALKDQNRESAAGVLGQQPVKGLDVTIPITTQGR LSTVEQFENIVIRANTNGSIIRLRDVARVSLEASSYSTESGINGKNAAVLGIYMLPGANA MEVAKSVKEAMDEISKNFPEGLSYEVPFDMTTYISESIHEVYKTLFEALILVVLVVYLSL QSWRATLIPIVAVPISLIGTFGFMLIFGFSLNILTLLGLILAIGIVVDDAIVVVENVERI MEEEKLPPYEATKKAMNGLAGALIATSLVLCAVFVPVSFLSGITGQLYRQFTITIAVSVL ISTVVALTLSPVMCSLILKPDNGKKKNIVFRKINHWLNVGNHKYVIAIRRVIGNPRRVLA GFGVVLIGILLIHRLIPTSFLPVEDQGYFKIELELPEGATLERTREVTDRAITYLEKNPY IAYVQNVTGSSPRVGSNQARSELTVILKPWEDRKDTSIDEIMSNVRHDLSEYPECKVYLS TPPVIPGLGTSGGFEMQLEARGEATFENLVQAADTLMYYASQRKELTGLSSSLQSDIPQL YFDVDRDKVKMLGVPLADVFSTMKAYTGSVYVNDFNMFNRIYKVYIQAEAPYREHKDNIN LFFVKASNGAMIPLTSLGNASYTTGPGSIKRFNMFTTAVFRGEAAQGYSSGQAMEIMEQI ARYHLPDNIGLEWSGLSYQEKKAGGQTGLVLALVFLFVFLFLAAQYESWTVPIAVLLSLP VAALGAYLGVWVCGLENDVYFQIGLVMLVGLAAKNAILIVEFAKVQVDRGGDLIQSAIHA AQLRFRPILMTSLAFVLGMLPMVLATGPGSASRAAIGTGVFFGMIFAIVVGIILVPFFFV LVYKTKAKLKNVPNVNVKLPFKSKKKTD >gi|226332025|gb|ACIB01000031.1| GENE 4 5387 - 6577 1267 396 aa, chain - ## HITS:1 COG:mll6731 KEGG:ns NR:ns ## COG: mll6731 COG0845 # Protein_GI_number: 13475614 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Mesorhizobium loti # 36 388 49 397 402 208 36.0 2e-53 MRLFFTRKELKLRRKRTIAGIVCLALVAGIYWILTRPHKVEPEVPTVIVEPAERDNVEIF GEYVGRIRAQQFVEVRARVEGYLESMLFAEGTYVNKNQVLFVINQDQYRAKADKARAQLK KDEAQALKAKRDLERIKPLYAQNAASQLDLDNAEAAYESAVATVAMSEADLAQAELELGY TLVRSPLSGHISERNVDLGTLVGPGGKSLLATVVKSDTVLVDFSMTALDYLKSKERNINI GQQDSSRSWQPNITITLADNTVYPYKGYVDFAEPQVDPQTGTFSVRAEMPNPKQVLLPGQ FTKVKLLLDVREGAIVVPHKAVTIEKGGAYIYVMRRDSTAEKRFIELGPEFGNKLVVERG LGAGEEVVVEGYHKLTPGMKVRATLPQPSAENKETE >gi|226332025|gb|ACIB01000031.1| GENE 5 6896 - 9106 1870 736 aa, chain + ## HITS:1 COG:PA3339_1 KEGG:ns NR:ns ## COG: PA3339_1 COG1752 # Protein_GI_number: 15598535 # Func_class: R General function prediction only # Function: Predicted esterase of the alpha-beta hydrolase superfamily # Organism: Pseudomonas aeruginosa # 26 293 22 296 308 209 41.0 2e-53 MKKHLSLILVLLPVLFLALPALAQERKKVGVVLSGGGAKGVAHIQALKVIEEAGIPIDYI VGTSMGSIIGGLYSIGYTPQQLDSMVRKQDWMFLLSDRVKRSAMSLNEREKSEKYVFSFP FTKSPKDAVSGGIIKGQNLANLFTELTVGYHDSVDFNKLPIPFACVSQNIVNGEQIVFHN GILATAMRASMAIPGVFTPVRKDSMILIDGGMINNYPVDVARSMGADIIIGVDVQNNLKG IDKLNSAPDILSQIIDLTTKNNHQSNVGLTDTYIKVNVEGYSSASFTPAAIDSLMHRGEV AARKQWASLLALKKKIGIADTFVPQSHGPYTMFSKDRTLHVKEITFSDVEENDKKWLMKK CKLQENSRISMRQIEQALFILRGNQSYSNASYTLTDTPEGYKLNFLLEKKYEKTINVGIR FDSEEIASLLINATAQLKTHIPSKVSVTGRLGKRYMARVDYTLEPMQQRNVNFSYMFQYN DINIYDHGDRTYNTTYKYHSGEFGFSDVWYKNFRFGFGARIEYFKYKDFLFKKPEFTMNV NSEYFISYFAQLRYNTFDKGYFPSKGSNFSGAYSLYTDNFARYNGHAPFSTLSASWESVF SISNRLTLIPALYGRVLIGQEIPYAYENALGGDVFGRYLPQQLPFAGIYNIELTHNSVAV ASLKLRQRMGSKHYITLVGNFALSDDNFFKILKGNRIYGCSIGYGLDSMFGPLEASLGYS NQSKDVGFYVNLGFSF >gi|226332025|gb|ACIB01000031.1| GENE 6 9355 - 9624 335 89 aa, chain + ## HITS:1 COG:no KEGG:BF3112 NR:ns ## KEGG: BF3112 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 89 1 89 89 157 100.0 1e-37 MAKRRELKKNVNYIAGELFSECLINSKFIPGTDKKKADELMVEIMKMQDEFISRISHTEP GNVKGFYKKFRSDFNAKVNEIIDAIAKLN >gi|226332025|gb|ACIB01000031.1| GENE 7 9627 - 10184 485 185 aa, chain + ## HITS:1 COG:CC1900 KEGG:ns NR:ns ## COG: CC1900 COG0204 # Protein_GI_number: 16126143 # Func_class: I Lipid transport and metabolism # Function: 1-acyl-sn-glycerol-3-phosphate acyltransferase # Organism: Caulobacter vibrioides # 11 177 16 182 196 129 42.0 2e-30 MKKAIYSFIYYHLLGWKTNVTVPNYDKCVICAAPHTTNMDLFIGKLFYGAIGRKTSFMMK KEWFFFPLGILFKAVGGIPVNRGRKSSLVEQMAEVFAKRPKFHLAITPEGTRKRNPNWKK GFYYIALKAQVPIVLIGIDYNTKTVTSTKAIMPSGDIEKDMREIKLYFKDFKGKHPENFS IGDVE >gi|226332025|gb|ACIB01000031.1| GENE 8 10191 - 10961 403 256 aa, chain + ## HITS:1 COG:STM0308 KEGG:ns NR:ns ## COG: STM0308 COG0388 # Protein_GI_number: 16763691 # Func_class: R General function prediction only # Function: Predicted amidohydrolase # Organism: Salmonella typhimurium LT2 # 1 255 4 255 255 233 45.0 3e-61 MKISIVQTDIIWENKQENLRLLREKLSPLRGTTEIVVLPEMFTTGFSMNSRLLAEPVSGS TLRSLKNYAIEFHLSLAGSFICEEQGSYYNRAFLITPDGQEFYYDKRHLFRMGHEAEHFS AGSRKVIIPYNGWNICLQVCYDLRFPVWSRNVNNEYDLLIYVASWPTPRIQAWNTLLCAR AIENQCYVCGVNRIGQDGNGLCYPGYSALYGPKGENLAGTPDSEEKIQTIELSLEALTTF RHKFPCWKDADPFLLY >gi|226332025|gb|ACIB01000031.1| GENE 9 11099 - 12265 1140 388 aa, chain + ## HITS:1 COG:lin2213 KEGG:ns NR:ns ## COG: lin2213 COG1820 # Protein_GI_number: 16801278 # Func_class: G Carbohydrate transport and metabolism # Function: N-acetylglucosamine-6-phosphate deacetylase # Organism: Listeria innocua # 3 379 13 378 380 201 32.0 3e-51 MLTQLINARILTPQGWMKDGSVLIRDNKILEVTNCDLAVIGAELIDVKGMYVVPGGVEIH VHGGGGRDFMECTEDAFRAAVHTHMKHGTTSIFPTLSSSTVPMIQQAAETCTKLMEEKNS PILGLHLEGHYLNMKMAGGQIPENIKNPDPNEYIPIVEQYHCIKRWDAAPELPGAMQFGK YIAAKGILPSVAHTQAEFEDIRTAYEAGYTHATHFYNAMPGFHKRREYKYEGTVESIYLL DDMTVEVVADGIHVPPTILRLVYKIKGVERTCLITDALACADSDSKEAFDPRVIIEDGVC KLADHSALAGSVATMDRLIRTVVQKAEIPLEDAVRMASETPARIMGVYDRKGSLQKGKDA DILVLDEDLNVRAVWAMGKLVPETNTLF >gi|226332025|gb|ACIB01000031.1| GENE 10 12545 - 14536 2003 663 aa, chain + ## HITS:1 COG:BS_nagB KEGG:ns NR:ns ## COG: BS_nagB COG0363 # Protein_GI_number: 16080555 # Func_class: G Carbohydrate transport and metabolism # Function: 6-phosphogluconolactonase/Glucosamine-6-phosphate isomerase/deaminase # Organism: Bacillus subtilis # 41 280 9 239 242 175 40.0 2e-43 MKTNLSSQITLNRVSPRYYRPENAFERSVLTRLEKIPTDIYESVEEGANHIACEIAQVIR DKQKAGRFCVLALPGGNSPRSVYAELIRMHKEEGLSFRNVIVFNMYEYYPLSQDAINSNF NALKEMFLDHVDIDKQNIFTPDGTIAKDTIFEYCRLYEQRIESFGGIDIALLGIGRVGNI AFNEPGSRLNSTTRLILLDSGSRNEASKIFGTIDNTPISSITMGVATILAAKKIYLLAWG EEKAHMVKECVEGNVTDTIPASYLQTHNNAHVAIDLSAASNLTRIQRPWLVTSCEWNDKL IRSAIVWLCQLTGKPILKLTNKDYNENGLSELLALFGSAYNVNIKIFNDLQHTITGWPGG KPKADDTYRPERAKPYPKRVVVFSPHPDDDVISMGGTIRRLVEQKHEVHVAYQTSGNIAV GDEEVVRFMHFINGFNQIFINSEDQVISEKYAEIRKFLKDKKDGDMDTRDILTIKGLIRR GEARTACTYNNIPLERCHFLDLPFYETGKIQKNPISEADVEIVRNLLREVKPHQIFVAGD LADPHGTHRVCTDAVFAAVDLEKEEGAEWLKDCRIWMYRGAWAEWEIENIEMAVPISPEE LRAKRNSILKHQSQMESAPFLGNDERLFWQRSEDRNRGTAALYDSLGLASYEAMEAFVEY IPL >gi|226332025|gb|ACIB01000031.1| GENE 11 14631 - 15353 606 240 aa, chain + ## HITS:1 COG:lin1914 KEGG:ns NR:ns ## COG: lin1914 COG2365 # Protein_GI_number: 16800980 # Func_class: T Signal transduction mechanisms # Function: Protein tyrosine/serine phosphatase # Organism: Listeria innocua # 10 229 46 284 298 125 34.0 8e-29 MKQEKFFRLLPIEGAYNIRDLGGYPTSDHKHVKWKTFIRSGDLDKLTESDLDYLTSLHIR TDIDFRSMQEKKAAADKIPSTVTQYIPLSIEAGDMTDMTHFNLNNIPGILEQAYVYIIQN AQDTYREFFRIVSEERNTPLLFHCSAGKDRTGIAAALLLGALGVDREVIMEDYMLSAEYI KGKYDAIVQAHPGFAPLTTVRKEYLEAAFQTIDTDYQGMDNYLKNQLGVDTHRLRMLYTE >gi|226332025|gb|ACIB01000031.1| GENE 12 15455 - 18364 2140 969 aa, chain + ## HITS:1 COG:no KEGG:BF3118 NR:ns ## KEGG: BF3118 # Name: not_defined # Def: xanthan lyase # Organism: B.fragilis # Pathway: not_defined # 1 969 1 969 969 1966 99.0 0 MKKLCIFLLLLFAATGILFAQEIEKSVKERLSNYFETYTPASANTGSCKLKSVDIDFEGR KLSIYASESFAYQPFVPETVDEIYHQIEELLPGPVRFFRTTIYANNQPIEELIPNFFRGK KKKDKSRLSNAEYKGAPWVINTSRPYEITKGLQNRHISLWQSHGKYYKNDKGEWGWQRPR LFCTTEDLFTQSFILPYVIPMLENAGANVYTPRERDTQKNEVIVDNDTRNGSIYLEMKSR KARWEKTGGYGFAQRKPVYEDGENPFLTGSARFTRTEKKKNKAFAEWIPTIPETGSYAVY VSYQTLPNSVSDAKYLVFHKGGVTEFKVNQRIGGGTWVYLGTFEFDKGSNDYGMVVLSNE SSENGVICADAVRFGGGMGNISRGTVSGLPRYLEGARYSAQWAGMPYDVYGGKQGTNDYA DDINARSNTINYLSGGSVFNPGQKGLGVPFEMNVALHSDAGYSKTNDIVGSLSIYTTDFN NGLLNSGNSRYASRDLADLLLTQIQKDIRAKFNIQWTRRSMWDRNYSETRLPATPSTIVE LLSHQNFADMKLGHDPNFKFTVGRAIYKAVLQFISSQHNKEYVVQPLPVSNFAIEFGKKR NTLELSWQGENDPLEPTARPREYMVYTRIGYGGFDNGVRVNKPSYTLKIEPGLVYSFKVT AVNHGGESFPSEILSAYKAKQEHARVLIINGFNRLSGPTVIDTPDEAGFDLEQDPGVAYQ YNISLCGAQTGFDRSQAGKEGKGSLGYSGNELEGMKIAGNTFDYPFVHGKAIQAAGNYSF VSCSDEAVENGRIQPEHYPIVDFILGLEKDDILSNPARKTYYKTFSSPMQRILTAYCQSG GNLLVSGSYIGSDMSNSQGNREFTEKILKYGFQGSLKDTRSRQITGLGRTLQIPRLPNEK AYAVTAPDCIVPVDSAFPVFVYQPGQYSAGIAYKGNYRVFAMGFPFESIESETDRAIVMA AILKFFGEK >gi|226332025|gb|ACIB01000031.1| GENE 13 19001 - 19213 121 70 aa, chain + ## HITS:1 COG:no KEGG:BF2957 NR:ns ## KEGG: BF2957 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 70 75 144 144 140 100.0 2e-32 MIGAMVWECDNPPVGIEQEESYIFGTMIILRIKGVAAGRYQVDMKWYNPLNPGQPPLGRQ TIEVIVQPWP >gi|226332025|gb|ACIB01000031.1| GENE 14 19466 - 19786 446 106 aa, chain + ## HITS:1 COG:no KEGG:BF2958 NR:ns ## KEGG: BF2958 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 98 1 98 106 171 100.0 7e-42 MYTIQANPSGTRSMEISEENLVTIEKYSLFQHLIDSNGIVDEAVLEKLKLNIRSLIASQE EDSKDLLDLCIDVIYHNNMKAFGLQQLIKLYLTWLSKQEAEEEEEA >gi|226332025|gb|ACIB01000031.1| GENE 15 19783 - 19998 136 71 aa, chain + ## HITS:1 COG:no KEGG:BF3123 NR:ns ## KEGG: BF3123 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 71 1 71 71 142 100.0 4e-33 MITVDTCGMTNYSPLIPAIKAMCNANPGDKMEIVTDQVAAFQDLKEYLSEQGIGFREIYD GERMTLQFTIL >gi|226332025|gb|ACIB01000031.1| GENE 16 19995 - 20339 379 114 aa, chain + ## HITS:1 COG:no KEGG:BF2960 NR:ns ## KEGG: BF2960 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 114 1 114 114 219 100.0 2e-56 MSEIQNQIKKWPVTAIKKIKSTFGSAEKFYATVYLIARNEHHCQMMGVAGAEQRLKTIHA YQGMIRFMLDEEGLNGKEILDTIAGEYLEDFVNYREQDFGMTNEEFIAIIKRIG >gi|226332025|gb|ACIB01000031.1| GENE 17 20344 - 20820 493 158 aa, chain + ## HITS:1 COG:no KEGG:BF3125 NR:ns ## KEGG: BF3125 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 158 1 158 158 243 100.0 2e-63 MDIFLIILGSICLLVGLAGCIVPMLPGPPVSYLALVFLHFTDKVSFTIPQLFFWLFIVVL IQILDYFIPMFGVKRLGGTPWGKWGCIIGTFAGIFLFAPWGVFIGPFVGAVVGELLGGKE TKYALKAGFGAFAGFLLGTVLKVAVCGWFIFCFIRALV >gi|226332025|gb|ACIB01000031.1| GENE 18 20846 - 22702 1576 618 aa, chain + ## HITS:1 COG:PA4928 KEGG:ns NR:ns ## COG: PA4928 COG1032 # Protein_GI_number: 15600121 # Func_class: C Energy production and conversion # Function: Fe-S oxidoreductase # Organism: Pseudomonas aeruginosa # 9 610 23 667 747 518 41.0 1e-146 MKEYRLTDWLPTTKKEVELRGWDELDVILFSGDAYVDHPSFGAAVIGRILEAEGLRVAIV PQPNWRDDLRDFRKLGRPRLFFGISAGCMDSMVNKYTANKRLRSDDAYTPDGRPDMRPEY PSIVYTQILKKLYPDVPVVLGGIEASMRRLSHYDYWQDRLKKSILCESGADMLIYGMGEK PICELVRRLTALCDNQDGVISSSDIHSPALSSIPQTAYLTRKYESDENDITLYSHEECLA DKKKQATNFRHIEEESNKYAAARIVQAVDGKTVVVNPPYPPMTEKELDRSFDLPYTRLPH PKYKGKRIPAYDMIKFSVNIHRGCFGGCAFCTISAHQGKFIVSRSKESILKEVKEVVQLP DFKGNLSDLGGPSANMYKMGGKDLSLCKRCKRPSCIHPKVCPNLNTDHRPLLDIYYAVDS LPEIKRSFIGSGVRYDLLLHQSKDATVNKITAEYTRELIARHVSGRLKVAPEHTSDRVLS IMRKPAFSQFGEFKKIFDRINRELGLRQQLIPYFISSHPGCKEEDMAELAVITKQLDFHL EQVQDFTPTPMTVATEAWYTGFHPYTLEPVFSAKTQREKLAQRQFFFWYKPEERRNIINE LRRIGRADLIDKLYGKRK >gi|226332025|gb|ACIB01000031.1| GENE 19 22727 - 26104 3059 1125 aa, chain - ## HITS:1 COG:BS_mfd KEGG:ns NR:ns ## COG: BS_mfd COG1197 # Protein_GI_number: 16077123 # Func_class: L Replication, recombination and repair; K Transcription # Function: Transcription-repair coupling factor (superfamily II helicase) # Organism: Bacillus subtilis # 34 1048 31 1097 1177 633 34.0 0 MTITELQHQYAGHPNVEALNKLLGEPAVRHIYCGGLYASAASLFASALVEKSPCPFVFIL GDLEEAGYFYHDLTQVLGTERILFFPSSFRRSVKYGQKDAANEILRTEVLSRLQKGEEGL CIVTYPDALAEKVVSRQELSENTLKLHVGERVDTGFITDVLHSYGFEYVDYVYEPGQYAV RGSIIDVFSFSSEYPYRIDFFGNDVESIRTFEVDSQLSKEKKESIVIVPDLAVTGKVTTS FLDFIPKDTTLAMRDFLWLRERIQVVHDESLTPQALASQEAEENGGITLEGKLIDGSEFT VRALDFRRMEFGNKPTGTPDATLTFHTTAQPIFHKNFDLVAESFKEYLNRGYALYICSDS TKQTDRIKAIFEDRGDRIQFTAVERTLHEGFADDTLKLCLFTDHQLFDRFHKYNLKSDKA RSGKVALSLKELNQFTPGDYVVHTDHGVGRFSGLVRIPNGDTTQEVMKLVYQNEDVVFVS IHSLHKVSKYKGKEGEAPRLNKLGTGAWEKLKERTKTKIKDIARDLIKLYSQRREEKGFA YSPDSFLQRELEASFIYEDTPDQSKATADVKQDMERDMPMDRLVCGDVGFGKTEVAIRAA FKAVADNKQVAVLVPTTVLAYQHFQTFRDRLKGLPCRVEYLSRARTAAQAKAVIKGLEAG DVNILIGTHRILGKDVKFKDLGLLIIDEEQKFGVSVKEKLRQMKVNVDTLTMTATPIPRT LQFSLMGARDLSVISTPPPNRYPIQTEVHTFSEEVIADAINFEMSRNGQVFLVNNRIANL PELKAMILRHIPDCRIAIGHGQMEPAELEQIIFGFVNYDYDVLIVTTIIESGIDIPNANT IIINQAQNFGLSDLHQMRGRVGRSNKKAFCYLLAPPLSSLTPEAKRRLQAIENFSDLGSG IHIAMQDLDIRGAGNMLGAEQSGFIADLGYETYQKILSEAVHELKTDEFAELYADELKGE GVISGEEFVEECQVESDLELLLPANYVTGSSERMLLYRELDGLTLDRDVDAFRSRLEDRF GPIPPETEELLRIVPLRRLAARLGVEKVFLKGGRMTLFFVNNAESPYYQSAAFGKMIDYM MKYTRRCDLREQNGRRSMLVKDIPNVETAVSVLLEIVALPVKEKE >gi|226332025|gb|ACIB01000031.1| GENE 20 26297 - 27049 771 250 aa, chain + ## HITS:1 COG:Rv2051c_2 KEGG:ns NR:ns ## COG: Rv2051c_2 COG0463 # Protein_GI_number: 15609188 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Mycobacterium tuberculosis H37Rv # 7 239 3 229 264 217 46.0 2e-56 MQTSDSIVIIPTYNERENIENIIRAVFGLEKTFHILIIEDGSPDGTAAIVKTLQQEFPDR LFMIERKGKLGLGTAYITGFKWALEHSYEYIFEMDADFSHNPNDLPRLYEACAVQGGDVA IGSRYVSGVNVVNWPMGRVLMSYFASKYVRIVTGLPIHDTTAGFKCYRRQVLETIDLDHI RFKGYAFQIEMKFTAYKCGFKIIEVPVIFINRELGTSKMNSSIFGEAVFGVIKLKVNSWF HTFPQKTKMN >gi|226332025|gb|ACIB01000031.1| GENE 21 27053 - 28432 1314 459 aa, chain + ## HITS:1 COG:XF0988 KEGG:ns NR:ns ## COG: XF0988 COG0044 # Protein_GI_number: 15837590 # Func_class: F Nucleotide transport and metabolism # Function: Dihydroorotase and related cyclic amidohydrolases # Organism: Xylella fastidiosa 9a5c # 1 459 1 449 449 398 46.0 1e-110 MKRTLIQNATIVNEGRSVRGSVVIEGEKIAEVLEKGQKPAIPCEETINANGCYLIPGVID DHVHFRDPGLTHKADISTESRAAAAGGVTSIMDMPNTNPQTTTLDALNAKFDLLAEKCSV NYSCYFGATNNNYTEFDKLDKNCVCGIKLFMGSSTGNMLVDKMNSLLNIFNGTDLLIAAH CENHETIKKNTEKYVKEYIEKYPHQYYHVHHETLPMGYHAKIRSIAACYESSELAVRLAR IADARLHILHISTARELSLFDNDIPLEEKRITAEACVSHLLFDSSDYPELGARIKCNPSI KTKTNRDALRQAVNSNLIDVIATDHAPHLLKEKEGGPLKAMSGMPMIQFSLVSMLELVNE GIFTIEKVVEKMCHAPAQIYNIHNRGFIRPGYQADLVLVRPDALWTVSADQILSKCGWSP LEGRTFEWKVEKTFANGHLLYTDGQVDETYRGQEIYFER >gi|226332025|gb|ACIB01000031.1| GENE 22 28499 - 28771 133 90 aa, chain + ## HITS:1 COG:no KEGG:BF2966 NR:ns ## KEGG: BF2966 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 90 10 99 99 165 100.0 5e-40 MAFAIQIVNLHKYPNKRKAYSLSDQILISGTAIGVLQKETECAESNADFIHNIASPQKNV TKPFSGLNCYLKQNIYPKQNIQACLQTQMN >gi|226332025|gb|ACIB01000031.1| GENE 23 28808 - 29746 816 312 aa, chain + ## HITS:1 COG:AGc3907_2 KEGG:ns NR:ns ## COG: AGc3907_2 COG1410 # Protein_GI_number: 15889436 # Func_class: E Amino acid transport and metabolism # Function: Methionine synthase I, cobalamin-binding domain # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 21 311 616 904 919 200 37.0 3e-51 MEKHNPSSFTVDSSSPAHRSSFAIHDLTPYINWIYFFHAWGFQPRYAAIANIHGCDSCRA IWLTTFPEEERSKASEAMQLYKEANRMLNELDRDFEVKTIFKLCPANADGDNLIINGITF PLLRQQVKKKENEPFLCLSDFVRPLSSGITDVVGAFASSIDADMEGLYEKDPYKHLLVQT LSDRLAEAATEKMHEYVRKEAWGYAKDENLSIPDLLVEKYQGIRPAVGYPSLPDQSVNFI LDEILDMKQIGIHLTENGAMYPHASVCGLMFAHPASQYFSVGKIGEDQLADYAGRRGKTV EEMRKFLAANLQ >gi|226332025|gb|ACIB01000031.1| GENE 24 29803 - 30723 814 306 aa, chain + ## HITS:1 COG:no KEGG:BF3132 NR:ns ## KEGG: BF3132 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 306 1 306 306 590 100.0 1e-167 MKKVLILLLWLLPSCSVVIFAQTQRIAGTVVVGDGQPAVGAAIVVKGTNIGAVTNVDGKF VIAEAPASAQRLVVYHVGMERKEVDVAPNVYIQLQPVEQGFSWNVKAGVSLFSYRRPDGL DERTGFSVGAGVEYNFSRHWAIQSGLMITSKGATAEYEGYSPLSSSTDPISYKAKISPIY LDIPILAAFKIDISNHVKFAINIGPYLSVGMGGKYELTNTGGHKNESHNPFKSYSSTAEM KDKDALLKRFDIGVQGGIGFEIWKHYLINASFQHGFMDPIKGGTLYAGDTEKHYPIGGIL TVGYRF >gi|226332025|gb|ACIB01000031.1| GENE 25 30912 - 31799 911 295 aa, chain - ## HITS:1 COG:no KEGG:BF3134 NR:ns ## KEGG: BF3134 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 295 1 295 295 585 100.0 1e-165 MKKYIALAWVCLALTDVMAQKQKVVEKVVDEDYYPVEGAVVTLKRTNQNATTDKDGVFIL EEVPVYFDSLQVKKGKRSGYIDLPMRIQMRTQVMQRFSWSVKAGIGAGKFMQGPESKLKD GFSIYGGVGADIRMSKHWAFQPSLLLVSRKMKGYDFYGYYAENNGDGSSSASYEQVSGTY NPFYLEIPLLFALKYRVGNDMNLVFSFGPYIDLGLSGDEKREYIKHYYDTMEGNKTESVT NSRSLFGKRFTGGFAYGIGVEYGRFLIGGTGRVGCTSWNHSEGFIDVAFEIGYRF >gi|226332025|gb|ACIB01000031.1| GENE 26 31848 - 32693 834 281 aa, chain - ## HITS:1 COG:no KEGG:BF3135 NR:ns ## KEGG: BF3135 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 281 8 288 288 583 99.0 1e-165 MIACICAITVMAQVQTHDIKGVVFDRRQQPIVGALVTAKGTNISTITDVDGKFLLQEVPL SVKKVVVTSIGMETREVDLNVPVQLTGKRKKVSFVAHAGLSMSKYTIYGSDFKVGYEFGL GIEVRMSKRWAFQPTLQICNHGAEFNAERYGVKYQETWNPVSLDLPMLFILRCPIARKMN LAFSMGPVFSYGFAGKVKASETGKPDEEYDIYSSEYEYDYSGGKHSLLHPFSFGVAYGIG VEYKKWLAGISGKSMCLGQDDEGFEAKEHNLVLTLGVTYRF >gi|226332025|gb|ACIB01000031.1| GENE 27 32742 - 33878 1026 378 aa, chain - ## HITS:1 COG:no KEGG:BF3136 NR:ns ## KEGG: BF3136 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 378 1 378 378 700 100.0 0 MNKRKLLGLLCLMTLLATSCDNKGDYWGAMESSKATLTLERICDMATLSQDSVELLSNIL GMNTEELYRTDVVMIGKVTSEETGFYQYPRFLIAKDREMKEVLTEAYVHRDTEGTFYAFL ESNMLPVGETYYCAMVDYNYGYNGRPGLLDHVLGGNTRGERYSEVKPFRLSGLPRLVVHD AHFTGYSFYLSAEVRFKSNGGIIEQGACYSSTKRIPTVDDQKTLARETRNYDYSFLEVEV TDLLPNTHYYIRPYVTTEEGTGYGPVVEFTTEPGTEPIINQFSLYSYDDTSVELYASFYT NDYQITNYGYSYGIYSQETGTVKDEQMIEVPFGDDYGQELSKIVTGLRPGTRYAFRVYAK NGVGITYSGYRTVKIPVE >gi|226332025|gb|ACIB01000031.1| GENE 28 34170 - 34598 151 142 aa, chain + ## HITS:1 COG:no KEGG:BF3138 NR:ns ## KEGG: BF3138 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 142 1 142 142 272 100.0 2e-72 MMQNPDYFERTRSRLHPKKYRFYFFDYLYYCGDRWSKRNSRVWGSGVIFNYWTFCIWGPV AFWTRLNGIHLFSESIDVTIVFAGMLLPFVCTRLRYRKDRVSAIRHHYRRSAWRSIIPPR LVVFGWFIILLLEVIGAKLCEA >gi|226332025|gb|ACIB01000031.1| GENE 29 34666 - 35148 398 160 aa, chain - ## HITS:1 COG:no KEGG:BF3139 NR:ns ## KEGG: BF3139 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 160 1 160 160 315 100.0 3e-85 MEVPLVDKDYLLERTPGNGGWTYAPIPEVPQDKKAPFGWVKVKGSIDGVEIKKHHLMPMG NGELGLSVKAEIRKKIKKQAGDYVHVVLYLDEEPSEIPEELQLCLQDEPRALEFFNSLAE NERHNYVKWIYSAKTDRAKVARMAKAIDRLASNLKYYDKG >gi|226332025|gb|ACIB01000031.1| GENE 30 35243 - 35440 67 65 aa, chain + ## HITS:1 COG:no KEGG:BF3140 NR:ns ## KEGG: BF3140 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 65 1 65 65 117 98.0 1e-25 MGMVWVDKSLLIFMLTLHCSLIHMASKFNKYPETNPNYHQFSLLFCDYFLFFATNQIKVI LKEDQ >gi|226332025|gb|ACIB01000031.1| GENE 31 35437 - 35949 642 170 aa, chain + ## HITS:1 COG:CC3310 KEGG:ns NR:ns ## COG: CC3310 COG1595 # Protein_GI_number: 16127540 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Caulobacter vibrioides # 43 166 34 158 166 77 33.0 8e-15 MNLNPAHINEPVQKEFLSVIKEYERVIYKVCYLYTTRNATLGDLYQEVILNLWKAYPKFR KECKISTWIYRIALNTCISFIRKEKNVPEIVALTREADWMTEEKDELTEMLRQLYRMINQ LGQLDKSIVLLYLEEKSYEEIAEITGLTVTNVATKLSRIKDKLKKMKKEE >gi|226332025|gb|ACIB01000031.1| GENE 32 35953 - 36537 530 194 aa, chain + ## HITS:1 COG:no KEGG:BF3142 NR:ns ## KEGG: BF3142 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 194 1 194 194 366 100.0 1e-100 MELDDLKKSWNALDEHLKNKEFIEEKEIAQLLGRARNKMNSIDRFNRKLRFASIGILTLA VLFWICADTLTDLFYWIALSLCIPALCWDLYSAHYLSRTRIDEMPLVTVISRINRYHRWM VREWIIGILYLLAMATFFFFHRQVWQYGAAGIIVSLIVWAIGLGICLWVYRRNIRHIKEI KKNLNELKELNHTA >gi|226332025|gb|ACIB01000031.1| GENE 33 36649 - 37815 850 388 aa, chain + ## HITS:1 COG:BS_ykuE KEGG:ns NR:ns ## COG: BS_ykuE COG1408 # Protein_GI_number: 16078469 # Func_class: R General function prediction only # Function: Predicted phosphohydrolases # Organism: Bacillus subtilis # 133 386 40 283 287 118 33.0 2e-26 MLQRVLGFLIVILVLPDIYIYRTFIKQLTLSLFWRILYFFPTLFLMAGVVSLAFFANYEY AEQHTLWIGRFAVVFFLFASPKLIFTICSIIGRPFNRWLHWSRKPFVATGLVLATLNAAL ILYGSMVGKDRFEVKEVTFRSPRLPEAFNGYRIVQLSDIHIGSWQGNAKSLQRMVDLVNA QKPDLIVFTGDLVNNRAAELDGFEEILSQLHATDGVYSILGNHDYGPYYRWKSKRDQVNN LNDLKKRQADMGWILLNNEHTLLHRGNDSIALIGVENEGEPPFSQHGDLTKAQAGTNGLF KLLLSHNPTHWRREVLPQSDIDLMLAGHTHAMQLAIGHHSPASWIYPEWGGMYMEDNRGL YVNVGMGFVGLPFRFGAWPEITVITLDK >gi|226332025|gb|ACIB01000031.1| GENE 34 37812 - 39536 258 574 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|229849245|ref|ZP_04469311.1| LSU ribosomal protein L17P [Thermanaerovibrio acidaminovorans DSM 6589] # 335 556 131 351 398 103 30 5e-21 MKKYWQILKKYKISLLACPLLVLVSVMCETVQPMYMADIIDNGVMQRDLSVITAVGGKMI LISIVGLIFSIANVYVSSHASIGFGTDLRTGLFGKIQQLSFFDIDRFSTASLITRLTSDI SRIQQVIMMSMRLMLRSPLMLVMAVFFVVRINLELAGVLLAAIPILGFSVFFILRKGFPF FLKVQQKVDQLNEVVRENLINIRVVKSFVREDFEAHKFKDKSESLRDTVIHASNIIVSIF PVMQLVMNLSIIAILWMGGHKVMTGELKVGELISFVNYLGQVLMSLMMLSMIIMSYARAS ASSKRILEVLDTQPSLTDTPEGMRSTREIEKGEIAFEKVSFRYGGGETDVLRNISFHIRP GETVAIAGATGSAKSSLVQLIPRLYDVSAGEIRIDGIPVQDYNLRELHARIGMVLQKNEL FTGTIAENLRWGKPDATQEELEVAARAAEAHEFICSLPAGYDTLLGRGGINLSGGQKQRI CIARALLRKPKILILDDSTSAVDSETELRIRNNLNAWLRDTTVLIITQRIYTMQSANRVI LLDDGEIESIGTPEELLERSEMYREIYYSQQIVI >gi|226332025|gb|ACIB01000031.1| GENE 35 39539 - 41305 175 588 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|229849245|ref|ZP_04469311.1| LSU ribosomal protein L17P [Thermanaerovibrio acidaminovorans DSM 6589] # 367 574 147 353 398 72 28 2e-11 MAHGDHLKYSGKPKAGKKTFLRLISYVACDRRLLIVIGVLIVISIAANLTGSYMLRPIIN EYILPGDFQGLVRILLFLAAIYLTGVAATYIEYILLNKIGQRTVTRMREELFGKMERLPV RYFDTHQHGDVMSRYTNDIDRISDALTDSLSDMLSSALTVIGIFCLMIFISPILTAVTLI TVPLMFLSAKGIVKRSRKYFKAQQEALGMMNGYAEEMISGQKVVKVFGHEQKVETDFGIL NQSLKDKSLKAQFYSGLMMPVMQNLNTLNYVIITIVGALLAIFRGFDVGGLAAFLQYSRQ FGRPINELASLYNSIQAAIAGAERIFEIIDEAPEKADVPEAVTLKNIKGDVALKNVYFGY RPEKTILKGVSLHAPAGKKIALVGATGAGKTTILNLLPRFFDIQSGEITIDNHPIDRIER NSLRRSMAIVLQDTHLFTGTVRENIRFGRLSATDDEVVAAARLTAAHSFIKRLPQGYDTL LENDGANLSQGQRQLLNIARAAVADPAILLLDEATSNIDTRSEILIQRGLDQLMQGRTSL IIAHRLSTIRNADTILVLEHGEIIEQGSHQELLALKGKYYSLNEEQFK >gi|226332025|gb|ACIB01000031.1| GENE 36 41611 - 41829 76 72 aa, chain - ## HITS:1 COG:no KEGG:BF2980 NR:ns ## KEGG: BF2980 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 72 14 85 85 134 97.0 1e-30 MDGDSARFLIECHKIGAIASLKYFLEGKPCLSVVPLSASDTSQPERKTQESGEKNRTIDL MDKNNRRYLKKK >gi|226332025|gb|ACIB01000031.1| GENE 37 41919 - 44882 2571 987 aa, chain - ## HITS:1 COG:ZsbcC KEGG:ns NR:ns ## COG: ZsbcC COG0419 # Protein_GI_number: 15800123 # Func_class: L Replication, recombination and repair # Function: ATPase involved in DNA repair # Organism: Escherichia coli O157:H7 EDL933 # 1 975 1 1031 1047 256 28.0 2e-67 MKILTIRLKNLASIEGTFEIDFQAEPLRSAGIFAISGPTGAGKSTILDALCLALYDKTPR FSASVESLYMSDIGESRVNQADVKNILRRGTGEGFAEVDFLGASGHCYRSRWSVRRTGSR ANGALRSQTIQVTDLTANQELQGTRKELLAQLVTLVGLTYEQFTRTVLLAQNDFATFLKS RESAKAELLEKLTGTEIYSRISSEIYLRSKTADAELNQLKSNATLIELLSEEEITLLRTE KESLTNLREQGSKALIDLNAQLSVLHTLKLQQEQRDKKVQDMRLDEEKSKKLREEYTRQS DSLIRFRGQCEAVQPDLSRARELDVQIQSLVSQSKQVEEILQGAEKAVNAQANKLQSVQG ALHTSCHSLKNLTGEIELPVTEETGLFLESVRNRLKEQEDQLAILQEKNEARVNRLNAFG IEAVTDEQARWMQEQTRLQNARQQMLEWRKAGTEAERLKAQQEEMGHKQEQMRKEITLLT TRLSEKEAELKVLQRLFENARIAMGKDVRTLRQNLRENEPCPVCGGTDHPYRNEEQVVHS LYQNIEQEYQTASAEYQQLNNRNIALKQDLLHLSELSGEITVQLQAFLQEAEQKRPSSEE EQNPDYFEKQLHTVQGKLNLLAEKMHQYHQLYKEWQQHEGQIRTARSACEALREGVARCH LLMQQVLAAKEQFELLKTAETTAREQFRVVSEQLITLRQERAPLLKGKSVEDAEAAIRKK EKQLNDSVEQVRKEGEEVQSRISGMQGEIRQLNSSIDELMLRKEQIADPEHLPETIARRQ ATNQETERRLSTVEARLLQQEQNRKKLKQLEQELTEKQETANRWGKLNKLIGSADGTKFK VIAQSYTLNLLLMHANKHLSYLSKRYRLQQVPGTLALQVIDCDMCDEVRTVYSLSGGESF LISLALALGLSSLSSNNLKVESLFIDEGFGSLDADSLRTVMEALEQLQMQGRKIGVISHV QEMSERIAVQVQLHRAANGKSAITLTN >gi|226332025|gb|ACIB01000031.1| GENE 38 45030 - 46262 1223 410 aa, chain - ## HITS:1 COG:PA4281 KEGG:ns NR:ns ## COG: PA4281 COG0420 # Protein_GI_number: 15599477 # Func_class: L Replication, recombination and repair # Function: DNA repair exonuclease # Organism: Pseudomonas aeruginosa # 2 409 1 401 409 292 41.0 7e-79 MIRILHTADWHLGQTFFGYDRTQEHEHFLDWLAGVLTKNKIDVLIVAGDVFDVSNPSAAS QRMFYRFIHRVTTENPRLQLVVVAGNHDSAARLESPLPLLQEMRTEIKGIVRKQNGKIDY EHLLVELKNAAGEVEALCLAVPFLRQGDYPVVETEGNPYAEGVKELYARLLKYALKKRTD GQALVAVGHLLATGSEIAEKDHSERIIIGGLESVSPESFPEQIVYTALGHIHKAQRVSGR ENIRYAGSPLPMSFAEKHYHHGVVKVTLDEGWAVEIEKLEYTPLVRLLSIPATEAAAPDE VLDELRGLELPEDEPMPYLEVKVKLSEPEPMLRQQVEEILEGKPVRLARIVSFYRQAAEG SVEEETLTAGLQEMNPLQIVKATFENSYQTEMPEELVNLFQEACRTINLE >gi|226332025|gb|ACIB01000031.1| GENE 39 46351 - 47109 650 252 aa, chain + ## HITS:1 COG:TM1693 KEGG:ns NR:ns ## COG: TM1693 COG0204 # Protein_GI_number: 15644441 # Func_class: I Lipid transport and metabolism # Function: 1-acyl-sn-glycerol-3-phosphate acyltransferase # Organism: Thermotoga maritima # 55 216 59 219 247 109 36.0 6e-24 MKILYYIYQICIALPILLVLTILTAVVTIVGSLLGGAHIWGYYPGKIWSQLICLFLLIPV KVHGREKLHERTSYIFVPNHQGSFDIFLIYGFLGRNFKWMMKKSLRKIPFVGKACESAGH IFVDRSGPKKVLETIRQAKDSLKDGVSLVVFPEGARSFTGHMGYFKKGAFQLADDLQLAV VPVTIDGSFEILPRTGKWIHRHRMILTIHDPIPPKGQGADNMKATMAEAYTAVESALPDK FKGMVKNEDQDR >gi|226332025|gb|ACIB01000031.1| GENE 40 47239 - 50442 2833 1067 aa, chain - ## HITS:1 COG:BB0140 KEGG:ns NR:ns ## COG: BB0140 COG0841 # Protein_GI_number: 15594485 # Func_class: V Defense mechanisms # Function: Cation/multidrug efflux pump # Organism: Borrelia burgdorferi # 17 1058 12 1024 1036 176 22.0 2e-43 MDNPSKIKTQTKASSFTLIVAFICVALIGLALIPLLPVKLNPSRTLPGFTVQFSMPGTSS RVVEIEATSKLEAMLARIKGIKNIYSTSDNGSGSITIELDKYADIDAVRFEASTIIRQTW PQLPDGVSYPYIRMKRPDENASRPFMSFTLNAPSTPILIQQYADEHIKTRLAQIQGIYKI DLSGATPMEWVLEYDSEQLRRLGITLSDIQQAVSRYYLKEFLGTYNVESSTGGKEWIRLA LMPETKDEGFDASRIRVKSAEGKLISLDELVTVSHMEEAPQSYYRINGLNSIYLSITAEE TANQLQLSKQVKEEMEAIQKVLPAGYEIHTSYDATEFIHEELNKIYLRTGLTVLILLFFV LIITLNPRYLFLIVVSLSINIAVAVIFYYLFGLEMQLYSLAGITVSLNLVIDNTIVMTDH ILHRRNLKAFMSILAATLTTMGALVIIFFLDEKIRLNLQDFAAVVIINLAVSLFVALFFV PALIEKIGLKKRKRRRTQSRFFLLRASLPRRITVYFTRFYGWMIRKLCRWRVAVCILLIL LFGLPVFMLPDKVEGEGRATEWYNKTLGSSTYKEKIKPIVDKALGGSLRLFIQKVYNGSY FTRNEEVVLYVYANLPNGSTLEQMNELIKKMEIYLSQFKEIKQFQTSVYNARRGNINIYF TKEHQNSGFPYTLKANIISKALQLGGGSWGVYGLQDQGFSNDVREGAGSFQVKMYGYNYD ELYEWAEKLKAKLLTHRRIKEVIINSYFSYWKDDYQEFYFNLNRERMAQENINANILFST IRPIYGKNMEIGSVVAENGSEKIKLSSKQSQEYDIWAMQYFPYGTDDKQYKLSELATMEK GQMPQQVAKENQQYRLCLQYEYIGSGEQGNKILKRDLEEFNKELPMGYTAQSERESWGWG KKDNKQYLLLLVVIAIIFFTTSILFNSLKQPLAIIFIIPVSYIGVFLTFYWFKLNFDQGG FASFVLLCGITVNASIYILNEYNAIRRRHPRMSALRAYTKAWNAKILPIFLTVVSTILGF IPFMVGTDKEAFWFPLAAGTIGGLVMSIIGIFFFLPVFVLKKRVGKR >gi|226332025|gb|ACIB01000031.1| GENE 41 50463 - 51956 1624 497 aa, chain - ## HITS:1 COG:VC1565 KEGG:ns NR:ns ## COG: VC1565 COG1538 # Protein_GI_number: 15641573 # Func_class: M Cell wall/membrane/envelope biogenesis; U Intracellular trafficking, secretion, and vesicular transport # Function: Outer membrane protein # Organism: Vibrio cholerae # 186 474 123 403 419 63 23.0 1e-09 MKKRYYIVIAALLFGASVAKAQDHIKLDLQKTIQLANDSSLEAFRTQNMYLSGYWEYRTY KANRLPSLTLNMTPAEYNRDITKRYDSEKDLDVYRSQQSFYASGNLAIQQNFDLTGGTFY LQSQLGYMRSFGGNKTTQFTSVPIRLGYSQSLVGYNSFKWERKIEPLKYEKVKKEFVYNV EAVSVQATTYFFNLAMAQAEYNLAKENMVSSDTLYSIGVQRQKIAAISKADLLTLKLDVV NARNTLQNKASALKRAMFSLVSFLNLDKNTVIDIDLPVRPQELVIPVDKALQMAHENNPQ LLGLKQNVLEAERNVDKTKKESRFNASVNASIGFNQVADNFGDVYHKPMQQDLVSVSVSI PLVDWGVRKGKYNMARNNLNVVKTSARQDEISLDEEVIMTVNDFNIQQNMITSAEEALDL SILAYNETRQRFIIGKADINSLTLSLNRQQEAQQNYISALQNYWLNYYKIRKLTLHDFAT GISLTDKFDYAGGQLVR >gi|226332025|gb|ACIB01000031.1| GENE 42 51959 - 55036 2680 1025 aa, chain - ## HITS:1 COG:aq_786 KEGG:ns NR:ns ## COG: aq_786 COG0841 # Protein_GI_number: 15606161 # Func_class: V Defense mechanisms # Function: Cation/multidrug efflux pump # Organism: Aquifex aeolicus # 1 1015 1 992 1000 308 25.0 3e-83 MIKFLIQRPIAVLMAFTACFIVGLVTYFTLPVSLLPDISIPEITVQVSAKNTSARELENT VVKPVRQQLIQVAALKDMTSETRDGAGIIRLSFDFGTNTDLAFIEVNEKIDAAMNYLPKD TDRPKVIKASATDIPVFYLNLTLKTDSAYEETDQQAFLNLCEFSESVIKRRIEQLPEVAM VDVTGLLERQLQIVPDMDKLAMLELSIEDIETALAQNNVEPGSMTVRDGYYEYNIKFSTL LRTAEDVENIYIRKGDRIIQLKEFCRIAIVPVKEKGVSVSNGKRAVTLAIIKQADENMDN MKDALSETMDYFKKIYPDIEFSVSRNQTELLDYTISNLQQNLSLGFVFICIVAVLFLGDV KSPFIIGLSMVVSIVISFLFFYLCKMSLNIISLSGLILALGMMIDSSIIVTENISQYREK GYSLRRACVAGTSEVVTPMLSSSFTTIAVFVPLVFMSGIAGAIFYDQAFAVTVGLMVSYF TGIMLLPVLYMLVYRTGIKGGPKWLRLKINNPLKEHTLDRFYDKGIDWVFSHKTLSVLFC AISFPLCIFFFYFIDKERMPDIDENELITRIEWNENIHVDENQRRVDELFRELQGASVEQ TASIGLQDYILNREQELSSSEAELYFKTETSKEIVPLQEQIYQKLKERYPLAVISFSPPE TVFEKLFVTGEADIVAELYARNKDRAPGPGTLRGLEQTFGQKTGIPPTGIAFENQLNLSI NQEKLLLYQISYNELYRVLRTAFKENSVAMLHSYQQYLPISIAGDEKTVNQVLQETLIQT QPDSKTGEVNFIPLRELIKVTPAEDLKSITAGRNGEYIPYKFYGVENAEKLMTQVKETSS ETGDWDIAFSGSFFSNQKMLDELVVILFISLLLMYFILAAQFESFMQPLLVLMEIPIDVA FALVLLWVCGHTLNLMSAIGLIVTCGIVINDSILKLDAINELRKEGVPLLEAIHEAGRRR LRPIIMTSLTTIFAMVPLLFSFDLGSELQKPLSIAMIGTMTIGTLVSLFIIPLLYWFIYR NKEKR >gi|226332025|gb|ACIB01000031.1| GENE 43 55063 - 56133 1074 356 aa, chain - ## HITS:1 COG:BMEII0380 KEGG:ns NR:ns ## COG: BMEII0380 COG0845 # Protein_GI_number: 17988725 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Brucella melitensis # 39 356 53 382 390 75 23.0 2e-13 MKNQYPFYTLCLALTMLTACSGEKKESASEKGVETVLPDTKNEVSVMTLKKQIFNHELVS NGKISARGMADLRFESGEVIAHIWVKNGDRVRKGQKLAELDKFKLDNQLSQSEDALKKSE LELRDVLISQGYPADDISQVPEETMKLAKVKSGYDQSKSQYEMSKYNAEHATLTAPFDGV VANLFSKPYNLASTSDVFCTVIDMQGMEVDFTVLESELPLIKNGDKVVIKPYSDAATVHE GSISEINPLVDDKGMVKVKARVNGAGKLFSGMNVRVSVHRSLGEQLVIPKSAVVLRSGKQ VVFTLKDGKMAQWNYIHTALENADSYSVADGLTEGDTVIVSGNINLAHEAPVTIIE >gi|226332025|gb|ACIB01000031.1| GENE 44 56534 - 58222 993 562 aa, chain - ## HITS:1 COG:no KEGG:BF3153 NR:ns ## KEGG: BF3153 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 562 2 563 563 1128 99.0 0 MREKVDYMHIILRGGLLAMVCALLSVVWVNDPMLPAGELSGQWLYLAKVAMGAAVGWVVL AFLYYRKGYDMGADFYQVVIWSFIVLAASEAIYGLRQLYGFTSSHHSLYSLTGSFFNPGP YSGYLAMIFPLCLDQWLRLRKRENKNWMEWTGYYGAVAVLFLILCVLPAGMSRSAWVAAL ISGIWVYGAHKSWPVRLKRVWMRKKTKVLAVTSVLCIVVLVGGVCLFNLKKDSASGRLFM WKISSRAIAEKPFTGYGQGNFALAYGTSQEAYFAEGSYSPQEELVAGSPEYAFNEYLQIA LEWGIPVLLCCLAFAGFCLRRGVELKRWGPCGSVISLLVFAFSSYPMQLPAFVIAFLILL MACIAGRFLAWQLAFACVLGFSGYHWWQADVHQECKDWANCRMLYQAGAYQSAKEGYERL YPKLKGRGAFLFEYGHCLHKLKEADASILLLKEASTRSCDPMILNIIGKNFQEKGEYAQA EKWLLRSTHLLPGRIYPYYLLAKLYASPGYLQPDKFHEMADLVLTKEPKVQSTAVREMRD EIRKLVIDVSNEKIVNSRNVND >gi|226332025|gb|ACIB01000031.1| GENE 45 58222 - 59112 541 296 aa, chain - ## HITS:1 COG:NMB0765 KEGG:ns NR:ns ## COG: NMB0765 COG0681 # Protein_GI_number: 15676663 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Signal peptidase I # Organism: Neisseria meningitidis MC58 # 33 275 108 324 339 82 30.0 8e-16 MKKEVWIKLVKRIGNWIVNICFYSCVAFVAWMVLQVFCLTSFKIPSNSMEPALLSGDKIL VDKWTGGARLFNIFASLRGEEVDIYRLPGFGSFQRDDVLVFNFPYQDGSDSIGFDIMKYY VKRCIALPGDTLEIRKGYYHIKGITDSVGNVQAQHRIARVRREDSHGIVMDAFPWDGRLG WTIQEFGPLPVPAKGQVVKIDTLSCLLYGRLIHWEQKKRLRQKGEAVCLGDSAITEYKFT ENYYFVSGDNMENSKDSRYWGMLPESYIVGRAFTIWRSDDPLRGKIRWNRVFKRIK >gi|226332025|gb|ACIB01000031.1| GENE 46 59140 - 60168 431 342 aa, chain - ## HITS:1 COG:no KEGG:BF3155 NR:ns ## KEGG: BF3155 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 342 1 342 342 716 100.0 0 MWKKLSLYVCLITILCSCQKQRSAYAPPTFPEVKKIHAHRLSDELLISYPLDMAVSEDYI FILALADNAWLQVYDKTTGQLLGSFVTRGQGPGEATTANMCYYNAREKKISVYDESSMKL LTYQFDKDADNWGALIEERSFYDLGGTLRRVWELRNGRFLVDGQLGTKSDQQKRFQMLAD AKVVADYNDFPIDTPKERSVWSSPAIAISPDCKKMAVGTLYGGILELFDLSQNIELRAIR KFYPPVVQYLSGTIQNTEETVWGFSALCATDERIYSVFIGDKNPNLFNNLSVFDWDGREL IKYNTDCLVLRICASTQEPNKLYGIAFSETHEFYLVSFSLDS >gi|226332025|gb|ACIB01000031.1| GENE 47 60277 - 60843 279 188 aa, chain - ## HITS:1 COG:no KEGG:BF2991 NR:ns ## KEGG: BF2991 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 188 1 188 188 375 98.0 1e-103 MKWMILIFLNFLFCAQLVGQVSRPDRNLLRGETYVIEVPKGWKRPSAVHSCNDEPLKRVN GKYETTKFMRVYSKRKDRCGAVLTIMEIQKCASFQEIFKEDSIWASTDTTQVKVIYKSVN SKNGGKKMAFTSYKAERHPETNELSALQKAEWYLQGRENVYYISFTSCSLFLELLPQIKD IVASLKEL >gi|226332025|gb|ACIB01000031.1| GENE 48 60840 - 61850 530 336 aa, chain - ## HITS:1 COG:no KEGG:BF2992 NR:ns ## KEGG: BF2992 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 336 1 336 336 686 97.0 0 MNVRCFLWGILFITVSSCIESDRIMHYAQFEHTINLKSDRIQVPSVLLYPRSLVLCDSNL IVFNEKMDTMFQCFHLPDLTFQYGFGTQGQGPNDFVLPSITPVKYQKNGFVMLDGINLKH ISVEKDKAIVQTSTLNYGFNCFNDLISISDSSYCCNGGFENEKEFRFLYPDGNHESWGEY PETEERFGSVLGRNQAYIKMTVAKPDKSCFVSFYQHIRRFRIYGKDGELKRDVILDILPG QERPDVDDYLRFIHPISIYATDSYIYTLNLDMTTEEIENRKTTPNIQVFDWEGKPLTQYK LDCFINTFVVDEVANKIYGAFVEDEDHIYVFNLPRL >gi|226332025|gb|ACIB01000031.1| GENE 49 62157 - 62615 195 152 aa, chain - ## HITS:1 COG:no KEGG:Cpin_2456 NR:ns ## KEGG: Cpin_2456 # Name: not_defined # Def: hypothetical protein # Organism: C.pinensis # Pathway: not_defined # 11 145 9 140 152 72 31.0 7e-12 MRKLNIVWKYCVLFFLLCTSCHIGDTKEKSALDRMVKQLKRISNYNKGTFTHIVVIPNVG CGGCISESEAFLKRNTNDSIFFVFTNISSLKALRLRMGDKLQQKNVYVDENNDFLFNDER IDSYPIVLQVNNPKKVEWDFLEPGNSFEQTLK >gi|226332025|gb|ACIB01000031.1| GENE 50 62612 - 63778 309 388 aa, chain - ## HITS:1 COG:no KEGG:Cpin_2455 NR:ns ## KEGG: Cpin_2455 # Name: not_defined # Def: hypothetical protein # Organism: C.pinensis # Pathway: not_defined # 40 388 50 401 404 93 26.0 2e-17 MTIIKHIILGTLLGGLVCSCSYKKNDRVHRTEDYSLIVKADKCFNLDSETVQTTEYLQLF KSGEEIIFSFVNEYDNSIVLYDYATGENMRKIKFEREGANGINSVTSYLIINEDSIYLYD RTTHLLSLANDRSIVKDKKRINIVRCLKGDSIFAPSELFPRTNSPILKIGDELLLSGTLF YEFEGENDSNRPVMAFYNLQKNTIRYSDSYPSMYHSGNWGGSFTYRFPYYTLSPNNELVI SFAADHNIRVHHVDSLQYHEFYAGTKEDIVIEPVEKSLDFEHFSPEADRDHYVHSLNYGC IHYDSYREVYYRLAGHPDSSIDPKEGVLRKPMSVTILDKNFQIVGETMLPQELYLLNQCF VGPDGFHIQVESEDDDIMRFKTFELLKL >gi|226332025|gb|ACIB01000031.1| GENE 51 63901 - 64146 265 81 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253564944|ref|ZP_04842400.1| ## NR: gi|253564944|ref|ZP_04842400.1| predicted protein [Bacteroides sp. 3_2_5] # 1 81 1 81 81 129 100.0 5e-29 MGKKILTAMIVAVVAVVAGYNIYAAQRTVGLSDIALTDTEALAACEVSVVHYNAGHCVKD VSLQNEYCAEYSGALPCYRTI >gi|226332025|gb|ACIB01000031.1| GENE 52 64393 - 66363 1283 656 aa, chain - ## HITS:1 COG:no KEGG:BF3158 NR:ns ## KEGG: BF3158 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 656 1 656 656 1386 99.0 0 MTIMKLRLGVRGMMWLTLVIMMWGIISCRTQEEKRLEEVLSLPLANKEELQKVLDHYKDD SLKYQAVCFLIRNMPFHAGYEGNALKHYYQYFDIYAQGKLGPHEVIDSLKENSFSVSQLK RIEDIANIDSSLLVQNVDWAFKVWREQPWGKNVSFDNFCEFVLPYRLGDEPLGFWREDIY KRYNPILDSIRLLPQAQDPLVAAKVLMDSLVVEQSHFTGLFPAGPHLGPSVVSWRAGSCR EFADLVVYVMRALGIPCGTDYMAMRGDNNVPHFWNFTLDKDGKTYITEFPDPNWKRAVSM YNPKAKVYRNTYGLNWKDVKRQQGKMMHPAFRKPLYQDVTAVYADSLNRDLVVSSDILCK EVHKGDIVYFCLSTRMDWVPIAWTVFEEDSLRFQDTEGSVIGCLATWNGKRLVMQSEPFT YDKMSGTIALLTPQSEKEDITLYFKFPLFCDLGILRMPGGVFEGSNDSQFRSADTLYYVK QWPFRLNNTIFPEKEKSYRYVRYKGPKGSYCNIAEMAFFEDTSDTLALKGRIIGTPGCFQ KDGSHDYYKVYDGNPYTYMDYKTPDEGWVGLDFGIPRRIKKFAYIPRNSDNFIHKGDVYE LFYWHDKKWNSLGRQVAKADSLNYVIPKGVALFLKNHTEGKDERIFKKTDGRQQFW >gi|226332025|gb|ACIB01000031.1| GENE 53 66528 - 68489 1091 653 aa, chain - ## HITS:1 COG:no KEGG:BF3159 NR:ns ## KEGG: BF3159 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 653 1 653 653 1362 99.0 0 MTGVAEKKKRMIKIYLLWVLLVGSLCCSCTGNKRLEYALEFAGENRGELEKVLEHYNDSG LKQDAARFLIENMPRYFSYEGWQLDTLKAIHAATEHTDGWVNKKDRKKWEHFSFRTLKKV YDAKVIKAEFLIHHIDQAFEVFEKRSWNKYLPFDDFCELILPYRIGDEPLEEWRGWYRER YESILDSLYQGTDVVEATDRLGAYLRQEKDFGYSVELDLPHLGAGFLLANRVGSCEASCD FTVYVLRALGIPAATDIYHYGPGKGAGHVWNVLRDTTGGYVPFWFIQTKVERGGSDKREK GKVYRRCFGAQQEKVSGIRRDRSVPFPLKDPYLKDVTSDYFPANQVTIEIDPQVDKKYIC LGVFTLEGCMPIDITVQKGNKATFMNVEPGILFQPLYDNGMKWVAAGYPFLVDEKGEVKY HKPDCAVKGSMDLNRKFLLRQYLKDYLSAVVGDKIEGANHSDFSDACLLHQIVDTPKVSY QVAYPQSRKRYRYIRYTSTPEKTLQLAELQLFRKVDDQEKITAKVIDGSNAFIADDRFDR FKVNDGDGLTFFLTKEKGAFVTLDLGKPEKIEKIVYMPRNDDNFIRLGDQYELFYQDGFR GWISLGRQVASELTLHYDNIPQNSVLWLRNLSRGREETVFRNEDGRQVFFVKW >gi|226332025|gb|ACIB01000031.1| GENE 54 68636 - 69502 441 288 aa, chain - ## HITS:1 COG:no KEGG:BF2999 NR:ns ## KEGG: BF2999 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 288 1 288 288 582 99.0 1e-165 MKYLYVLLAFSFLFSCKDENKKHAESVLREWMNKEIVFPNKMYFSIQGKENVDFRIKDTE YKIVAYVDSAGCTSCKLHLSKWKELIHYVDSTQSERVQFLFFFFPKNGRDIYHTMRMDKF TYPVCVDTLDSFNKLNHFPDDVRFQTFLLNKENKVVAVGNPIHNPNIRDLFLNIISGGTS LPDEKRPQTEVKIEALSMDLGMFDWKKEQKCIFTVENTGKELLVIDDINTSCGCTTVEYS REPVQSGKTVDITVVYKAEYPEHFNKTITVYCNSPVSPLQLKIKGDAK >gi|226332025|gb|ACIB01000031.1| GENE 55 69512 - 70399 405 295 aa, chain - ## HITS:1 COG:no KEGG:BF3000 NR:ns ## KEGG: BF3000 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 295 43 337 337 589 99.0 1e-167 MLSKVEMDPMQVNARCMVWDGDKRVLVRTSTTDSIYAVFAYPEMKFLSYTGSLSEYKQIL AKCNEGFYLVKDDSLYLYHLTDKDLLQKTTTHFLYNSNKIRLSKIKKLNDKMYTAHAYTD PSYNDIRLNEFYMLDAENNILYPKGHYPERTEVRFKTIFDFKFAYAHEVWPKPDGSRILV NYVRTRRFRIYDLSARLLHDVCLDYASNKYVVDADPKRWTTFIRDCFVTDKYIYLLCPEG EQSSLVIVDWEGRPIARYRLDEKIFFFFIDPDRNLFCGINSNNGQSFYFLDLDIN >gi|226332025|gb|ACIB01000031.1| GENE 56 70515 - 71507 571 330 aa, chain - ## HITS:1 COG:no KEGG:BF3162 NR:ns ## KEGG: BF3162 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 330 1 330 330 675 100.0 0 MHKIILYIISVLTVFTSCTTTDVPDKVSLQPQVMNDSLLTTMPGDLLLIDDYLVWSDPFS DNKFLHVHRSSDGKYIGSMGQKGEGPQEFVSPLINRFSINRCIAAHDANGKTRGYLSIDS LIVGKEPFMSLSDFDRNIRMAKLDEQLYLTETENGENDYFKVSSNGKKSTFGVYPIREVK HHMGTYKTYDKDRGLLAFGPFNFSYLALYKKEGDNFKLLWERMPEKENYSVVDGAIRFDR SVMGVRDICMTKDYIVTLERDREVDPLDERTVGRNASKCPRTVFVYDYDGKLLKIVNLGM PVMRIAADGRSNALYVIGVNPDFALAKYDL >gi|226332025|gb|ACIB01000031.1| GENE 57 71575 - 71856 291 93 aa, chain - ## HITS:1 COG:no KEGG:BF3163 NR:ns ## KEGG: BF3163 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 93 1 93 93 152 100.0 5e-36 MKKKYYAALLAVVVIAFTGYNVYQSQKADASLSDLAMANVEALANGELSNGNCEGSWSQE CCKCDYIHYTYACAIEVTGNSCYTVSGCSHYTN >gi|226332025|gb|ACIB01000031.1| GENE 58 71893 - 72060 58 55 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MSPFSEDVPVKLPFLKVVRSGKIASPSTSWLYPLLFEPVTLDFSQSGTPQWVGRY >gi|226332025|gb|ACIB01000031.1| GENE 59 72020 - 73081 554 353 aa, chain - ## HITS:1 COG:no KEGG:BF3164 NR:ns ## KEGG: BF3164 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 353 1 353 353 742 99.0 0 MKYCLTFLFLLVIFTGCTSDLPKDRMLYASFPKEETLHSKVIQLDSVYMRYPFRVHVSGD QAVVLDLHGTDVYCHLFHYPDFHYLSSFGRRGDSPEEMLSVETVKCIDGSFWTLDANKGE LTRFEFVSDRDSLLRAEAISFDKDSILRALDFVAFNDTTFLIPDYSGDSRFCWVNRQGKF LKKSGVIPSLNEEALKEARPALAQAWRSFIDYNPHNGVLVAATQLGEVLEIYNLQNDFHR VCLGPKGEPEFKLAGGYAIPDGIMGFSDVQVTDEAIYAVFHGHTFKEIMAQHQKEGRATD GGQYIYVFNLQGEPLCKYTLDRYITGFHVDERNKTITATDVNNDQPIVEFRFG >gi|226332025|gb|ACIB01000031.1| GENE 60 73088 - 73297 196 69 aa, chain - ## HITS:1 COG:no KEGG:BF3165 NR:ns ## KEGG: BF3165 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 69 1 69 69 104 100.0 8e-22 MKIKILAVLAVVTVVIVSMFMREKREDPSSLVMMNVEALATGEGTSPAMCVGYGSVVCPN DGSKVLYYY >gi|226332025|gb|ACIB01000031.1| GENE 61 73528 - 74631 377 367 aa, chain + ## HITS:1 COG:no KEGG:BF3166 NR:ns ## KEGG: BF3166 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 367 1 368 368 651 99.0 0 MNKRIFLVIGVIILFIMIAIGASTYIIHSLIQKEKEAFKPQVENILKEAVANNTIQKCKD IPLNGFNNSPNKIGTYETRTFCSRDTLFTYQHKIQDVDSEILFARQLGLLMMDSLQSSDI QALIIKDLNKNDIKGYINTGIIVSKHLQREIWSQPSNSIPRNAEMITYRLENEIVSVDYI MYIDYSFSTLWKRMPKTNIYINLVVEVILIYTITLFVLYYRKQQKNRSVSTVDITSDPNI ITDPISVDNTVETEKQTNSTIKEELSFKDQFVFEKDFVLFNDRPIKMPNQQQKILIFFLN RPNYRVNKHELKEEFWPKNSDPTNNMTSAINKLKKILEEINSKYTIITDKTNEEYYVLIR DKSAEKI >gi|226332025|gb|ACIB01000031.1| GENE 62 75117 - 75503 417 128 aa, chain - ## HITS:1 COG:no KEGG:BF3006 NR:ns ## KEGG: BF3006 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 128 1 128 128 232 100.0 4e-60 MKLRVILSLIVVLFIGQSMCAMSTQILRRPIVLDGEIIEEGNRSINPLIPISADIDGTTL FIEFTKVIGNVDITVKDDTKKEVYSSSVDVTAANQATSFSIADLAPGTYLLEFTNSNGGY VYGQFIVE >gi|226332025|gb|ACIB01000031.1| GENE 63 75669 - 77429 606 586 aa, chain - ## HITS:1 COG:no KEGG:BF3007 NR:ns ## KEGG: BF3007 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 586 3 588 588 1104 100.0 0 MNTHSLFGYLFIALFSLLVVSCYSTPDGVMSSLSQAEKIMESRPDSAMAILQHIPTPETL HGKAQADYSLLMTQAMDKNYINFTSDSLIKFAVGYYGGHTEDLVAKGKSFYYYGRVMESL DKVEDAMTFYLKAKDVLQSSDQFKLLGLVTEKIGDLNRRQKLLDAALNDYKESFDFYASI PDSLCMLYAYRNLGRGFLYKNQIDSAYYYYDKALYILNLKKYSAVGSILLELGVIHRSEK DYVGAEQYFLSFIEKEKDPEKLFSGYLALGNLYLYMDRLKDAERYLLLCLGSSNLVIKRD ACECLYDLEKELNNFKGAIGYKDIADSLRIITQDIDIQNSIATLQSRYNSEKWQRESLQS SIEKKNILLISSFVSFIAIMVIIYIYYKYRTNQKLVKDINERIRKNDADIKMYQRQILNY QDLQRETLQDYRNQIGELHGKMSVLEDQNKALSLRLTEKKHDIPESEADDLYAIYMQALH ILIMLRGKNIENTSGKKLLLDADWDKLFHLSNAIHGDFITRIKNDFPTLTKHDIEICCLL RFGIEHEVLGSIFLTETDSVTKAKRRMKKRLNLSASDDLDVFLLKY >gi|226332025|gb|ACIB01000031.1| GENE 64 77473 - 78201 625 242 aa, chain - ## HITS:1 COG:no KEGG:BF3169 NR:ns ## KEGG: BF3169 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 242 1 242 242 436 100.0 1e-121 MNKVLPFLLLLFVFTSCSRKYKIEGASSVTSLDGKMLFIKVLQNGEWLNIDSAEVVHGLF SMKGKVDSVVMATLYIGDESIMPLVIEKGNIQVSITNTELVAKGTALNNALYAFIDKKNS LDVQIEELQRKEARMVMDGADLADIHEQLTHEGDSLMQDMNGFIKKFISDNYETVLGPSV FMMLCSTLPYPVMTPQIEDIMKDAPYSFKNNKLVKDFITKAKSNMELIEEHQRMEQNATL NH >gi|226332025|gb|ACIB01000031.1| GENE 65 78440 - 79675 1088 411 aa, chain + ## HITS:1 COG:MA2647 KEGG:ns NR:ns ## COG: MA2647 COG0641 # Protein_GI_number: 20091470 # Func_class: R General function prediction only # Function: Arylsulfatase regulator (Fe-S oxidoreductase) # Organism: Methanosarcina acetivorans str.C2A # 11 400 9 393 446 407 47.0 1e-113 MSTYAPFAKPLYVMLKPVGAVCNLACDYCYYLEKSRLYQENPKHVMSDELLEKFIEQYIN SQTMPQVLFTWHGGETLMRPLSFYKKAMELQKKYARGRSIDNCIQTNGTLLTDEWCEFFR ENNWLVGVSIDGPQEFHDEYRKNKLGKPSFVKVMNGINLLKKHGVEWNAMAVVNDFNADY PLDFYHFFKELGCHYIQFAPIVERIFPHQDGRHLASLAQREGGELAEFSVTPEQWGNFLC TLFDEWVKEDVGDYYIQLFDSTLANWVGEQPGVCSMAKTCGHAGVMEFNGDVYSCDHFVF PEFKLGNIYNQTLVEMMYSERQTAFGQMKQKSLPTQCKECEFLFACNGECPKNRFCRTAN GEPGLNYLCKGYHQFFKHVAPYMDFMKNELMNQRPPANVMDAIKENKLIID >gi|226332025|gb|ACIB01000031.1| GENE 66 79717 - 81882 2211 721 aa, chain + ## HITS:1 COG:no KEGG:BF3010 NR:ns ## KEGG: BF3010 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 721 1 721 721 1442 100.0 0 MNRLKLYLLALTALAVCSAKADEGMWLLQLMQQQHSIDMMKKQGLKLEAQDLYNPNGVSL KDAVGIFGGGCTGEIISPEGLILTNHHCGYASIQQHSSVEHDYLTDGFWATSRDKELPTP GLKFTFIERIEDITDIVNLRIAAKEITESESFSSTFLNKLAKELFEKSDLKGKKGIVPQA LPFYAGNKFYMFYKKVYPDVRMVAAPPSSIGKFGGETDNWMWPRHTGDFSMFRIYADANG EPAEYSASNVPLKTKKHLNISIKGLKEGDYAMIMGFPGSTSRYLTVSEVKERMEASNAPR IRIRGTRQDVLKEAMNASDKVRIQYANKYAGSSNYWKNSIGMNKAIIDNNVLGTKAEQEA KFAKFAKEKNNTDYMNVVAKIDEAVAKTSPIKYQQTCLTETFFGGIEFGSPFMVMDKLKE ALEQKNDSSIEANIKVLKEVFNDIHNKDYDHEVDRKVAKALLPLYAEMIPAGQRPAIYDV IEKEYKGDYNAYVDAMYDTSILANQANFDKFIKKPTVKAIEKDIATQYSRAKFDKYTNLA EQMGKLPEELALLHKTYIRGLGEMKLPVPSYPDANFTIRLTYGNVKPYSPKDGVYYKYYT TTDGILEKENPEDREFVVPAKLKELIEKKDFGRYALPNGEMPVCFLSTNDITGGNSGSPV LNENGELIGCAFDGNWESLSGDINFDNNLQRCINLDIRYVLFILEKLGGCGHLINEMTIV E >gi|226332025|gb|ACIB01000031.1| GENE 67 81885 - 84050 1809 721 aa, chain + ## HITS:1 COG:no KEGG:BF3011 NR:ns ## KEGG: BF3011 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 721 2 722 722 1494 100.0 0 MKRNLLSAAFALMALAVSADEGMWMLTDLKAQNEAAMMDLGLQIPIEEVYNPDGIALKDA VVHFGGGCTGEIISAEGLVLTNHHCGYGAIQQHSSVDHDYLTNGFWAMNRNEELPCKGLT VTFIDRILDVTTYVNEQLKKDDDPNGINYLSPKYLATVADRFAKAENIQITPATRLELKP FYGGNKYYLFVKTVYNDIRMVGAPPSSIGKFGADTDNWMWPRHTGDFSLFRIYADKNGQP AEYSKDNVPLQVKKHLTISLAGVKEGDFTFVMGFPGRNWRYMISDEVKERMQTTNFMRHH VREARQAVLMDQMLKDPAVRIHYASKYASSANYWKNAIGMNEGLVRLKVLDTKEKQQEQL LAMGREKGDDSYQKAFDEIRSIVAHRHDAMYHQQAISEALVTALDFMKIPSTDGLKKALE SKNATKIKEETDKLKAEADKYFASVPFPEVERLVGKKMLETYAGYIPEDQQIGIFKVIDS RFKGNKDAFIDACFKYSIFGSKENFNKFIAHPTLNKLDKDWMILFKYSITDGLLKTALAM KDANKNYDAAHKVWVKGMMDMRQVAGTPIYPDANSTLRLTYGQVLPYEPADGTVYNYYTT LKGVMQKEDPDNWEFVVPQKLKQLYHAKDFGHYAMENGEMPVCFIVNTDNTGGNSGSPVF NGKGQLIGTGFDRNYEGLTGDIAFRPSSQRAAVVDIRYTLFIIDKYAGASHIIKELDIVE E >gi|226332025|gb|ACIB01000031.1| GENE 68 84157 - 84339 249 60 aa, chain - ## HITS:1 COG:no KEGG:BF3012 NR:ns ## KEGG: BF3012 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 60 1 60 60 105 100.0 5e-22 MKLNKTDYMLERTSDGGYYAWLTVNMQCNAYGDSPEEAVKNLEQTMEDLVEEMYLVEDFI >gi|226332025|gb|ACIB01000031.1| GENE 69 84336 - 85460 709 374 aa, chain - ## HITS:1 COG:MTH884_2 KEGG:ns NR:ns ## COG: MTH884_2 COG1819 # Protein_GI_number: 15678904 # Func_class: G Carbohydrate transport and metabolism; C Energy production and conversion # Function: Glycosyl transferases, related to UDP-glucuronosyltransferase # Organism: Methanothermobacter thermautotrophicus # 2 361 1 345 348 78 22.0 3e-14 MKFLFIVQGEGRGHFTQAITLEDMLLRNGHQVVEVLVGKSSSRTLPGFFNRSIQAPVKRF TSPNFLPTAENKRADLKKSFAYNLIHVPEYFRSMCYINQRIKETGAEVVINFYELLTGLT YALFRPSVPYVCIGHQYLFLHNHFEFPRKSVIQLSMLRFFTRMTSLRASRRLALSFRKME SDRTERISVVPPLLRREVTAMQPEQGNYIHGYMVNSGFADSVEAFHALHPEIPMHFFWDK QDADEVTKVDATLSFHQIDDVKFLNRMAGCRAYASTAGFESICEAMYLGKPVLMVPAHIE QDCNAYDARQAGAGIIGESFDLESLLRFAGTYVPNREFIRWVRSCERQIIGELERLADQH SAVTVPTLTNYFPI >gi|226332025|gb|ACIB01000031.1| GENE 70 85457 - 86263 630 268 aa, chain - ## HITS:1 COG:CC3344 KEGG:ns NR:ns ## COG: CC3344 COG2908 # Protein_GI_number: 16127574 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Caulobacter vibrioides # 6 243 9 244 281 197 43.0 2e-50 MHLRTYYPTVVLSDIHLGTQHSKTEEVTHFLKSINCDRLILNGDIIDGWHLQKSGLGKWK AKHTDFFKVIMKMMENFGTQVIYVRGNHDDFLDNLAPLNFYNIRIVKDCIYESHGRRYYV THGDIFDTVTTQMKWLAKLGDTGYTFLLWLNKVYNLRRTKQGKPYYSLSQSIKNRVKTAV SYISDFEKELVGLARAKKCDGVICGHIHHPANTFYEDIHYLNSGDWVETLSALTEDEDGN WTIRYFDSGLLKEDNHKEKQTISITIAS >gi|226332025|gb|ACIB01000031.1| GENE 71 86438 - 86734 370 98 aa, chain + ## HITS:1 COG:DRA0164 KEGG:ns NR:ns ## COG: DRA0164 COG0526 # Protein_GI_number: 15807833 # Func_class: O Posttranslational modification, protein turnover, chaperones; C Energy production and conversion # Function: Thiol-disulfide isomerase and thioredoxins # Organism: Deinococcus radiodurans # 4 96 45 137 142 94 41.0 5e-20 MEKFEDLIQSQSPVLVDFFAEWCGPCKAMKPILEDLKQQVGEKARIVKIDVDTHEELAVK YRIQAVPTFILFKKGEAVWRHSGMIQANELKGVIEQYT >gi|226332025|gb|ACIB01000031.1| GENE 72 86744 - 87211 419 155 aa, chain + ## HITS:1 COG:BB0061 KEGG:ns NR:ns ## COG: BB0061 COG0526 # Protein_GI_number: 15594407 # Func_class: O Posttranslational modification, protein turnover, chaperones; C Energy production and conversion # Function: Thiol-disulfide isomerase and thioredoxins # Organism: Borrelia burgdorferi # 39 151 3 115 117 115 42.0 4e-26 MKKVLSLVALAMISTIMFAVNDGVKADQNKKEAKSGEVIVMNKEMFINDVFDYQNSKEWK YKGDKPAIIDLYADWCGPCRMTAPIMKSLAKEYDGKIVIYKVNVDKEKELAALFNATSIP LFVFIPMEGEPQLFRGAADKATYKKAIDEFLLKQK >gi|226332025|gb|ACIB01000031.1| GENE 73 87592 - 88227 521 211 aa, chain - ## HITS:1 COG:PAB1763 KEGG:ns NR:ns ## COG: PAB1763 COG0778 # Protein_GI_number: 14521107 # Func_class: C Energy production and conversion # Function: Nitroreductase # Organism: Pyrococcus abyssi # 24 208 3 192 196 90 34.0 2e-18 MKKLQFLVCFLLLSVASYAAERTIQLPKPDMNRAGLLMKALSERHSTREYASKALSNTDL SDLLWAANGINRSSEGKRTAPSAMNRQDIDIYVVLPQGTYLYDAKGHKLNLISEGDHRSA VAGGQAFVNNAPVSLVLVSDLSKLGDAKSNHVQLMGAMDAGIVSQNISLFCSAARLATVP RASMDLARLKTALKLKDTQMPMMNHPVGYFK >gi|226332025|gb|ACIB01000031.1| GENE 74 88365 - 88922 728 185 aa, chain - ## HITS:1 COG:CAC3598 KEGG:ns NR:ns ## COG: CAC3598 COG1592 # Protein_GI_number: 15896832 # Func_class: C Energy production and conversion # Function: Rubrerythrin # Organism: Clostridium acetobutylicum # 1 182 1 180 181 229 69.0 2e-60 MKKFRCTVCGYVCEGDAAPEKCPLCKAPASKFVEVVEEEGGALTFVDEHVIGVAKGCDEE MIKDLNNHFMGECTEVGMYLAMSRQADREGYPEVAEAFKRYAWEEAEHASKFAELLGDCV WDTKTNLEKRMNAEAGACEDKKRIATRAKALNLDAIHDTVHEMCKDEARHGKGFEGLYNR YFGKK >gi|226332025|gb|ACIB01000031.1| GENE 75 88944 - 89381 295 145 aa, chain - ## HITS:1 COG:FN2045 KEGG:ns NR:ns ## COG: FN2045 COG0735 # Protein_GI_number: 19705335 # Func_class: P Inorganic ion transport and metabolism # Function: Fe2+/Zn2+ uptake regulation proteins # Organism: Fusobacterium nucleatum # 11 138 12 137 142 114 43.0 5e-26 MERRKKMESYNRLLEHNIKPSMQRIAIMNYLMEHKTHPSADEIYTELSPSMPTLSKTTVY NTLRLFSEQGAAQMLTIDERNTNFDADTSQHAHFLCKRCGRIYDLKCQVEMKQVEGLQMD GHEVSEVHYYYKGVCKKCLNDIRID >gi|226332025|gb|ACIB01000031.1| GENE 76 89533 - 89880 389 115 aa, chain - ## HITS:1 COG:PA0563 KEGG:ns NR:ns ## COG: PA0563 COG3152 # Protein_GI_number: 15595760 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Pseudomonas aeruginosa # 7 115 8 116 117 103 44.0 8e-23 MKEYIDVLKKWKDFDGRARRREYWMFVLFMAIFAIVASIIDAILGTICVFVGIYYLAMLL PMIAVSIRRMHDIGKSGWWLFITFVPVIGSLWYLFLTIQDGQPGSNQYGENPKGI >gi|226332025|gb|ACIB01000031.1| GENE 77 90061 - 92172 2185 703 aa, chain - ## HITS:1 COG:XF1944 KEGG:ns NR:ns ## COG: XF1944 COG0339 # Protein_GI_number: 15838538 # Func_class: E Amino acid transport and metabolism # Function: Zn-dependent oligopeptidases # Organism: Xylella fastidiosa 9a5c # 6 703 1 715 716 567 43.0 1e-161 MKKMLMAAGMAAVMTACGTAGQKAATDAGNPFLAEYSTPFGVPPFDLIKIEHYKEAFLKG MEEQKKEIDAIVNQRSVPDFDNTIAAFDQSGELLNKVSTVFSGLNSCNTNDEMQAFNKEI TPLLSAHRDDISLNPALFARVKEVYERREKLGLDKEQNKLLEETYKKFVRGGANLDSVDQ AKLRQLNSEISMLQLTFGQNLLKETNAFELVIDKKEDLAGLPESLVASAAEAAKGAGMEE KWLFTLHNPSVMPFLQYADNRELREKIFKGYINRGNNGNEADNNEIVKKLVALRLEKAKL MGYADYASFILEDRMAKNEENVYRLLNQIWTPAVAKAKEELFDIQAEIKKEGANFTPEGW DWRYYFEKAKKAKFSLDENEVRPYLELNNVREGAFYVANRLYGITFTEIKDIPKPHEEAQ AFECKDKDGTHLGVLYMDFFPRNSKRGGAWCGTYRSQTYRDGKRLAPVVTIVCNFTKPSS GQPALLSADEAGTLFHEFGHALHNLFKDVHFHAVSGVPRDFVELPSQVMEHWVFEPEVLK IYAKHYRTGEVIPAALIEKLDKSGKYGQGFATTEYLAASLLDMDYHVLKEIPRNMDVTEF EAAVLKERGLLSQIPPRYRTTYFNHIMNSGYTAGYYSYIWAEVLDSDAFEAYKETGDLFN QEVASRFRRYILTPGGIDDAMDMYKNFRGKEPGIEPLLRNRGL >gi|226332025|gb|ACIB01000031.1| GENE 78 92283 - 94208 1549 641 aa, chain + ## HITS:1 COG:CAC1050_2 KEGG:ns NR:ns ## COG: CAC1050_2 COG0171 # Protein_GI_number: 15894337 # Func_class: H Coenzyme transport and metabolism # Function: NAD synthase # Organism: Clostridium acetobutylicum # 327 634 2 309 310 456 67.0 1e-128 MNYGFVKVAAAVPRVKVADCKFNSERLEGLITIAEGKGVQILTFPEMCITGYTCGDLFAQ QLLLEQAEMALIQILNSTRQLDIISILGMPVVVNSTVINAAVVIQKGKILGVVPKTYLPN YKEFYEQRWFTSALQVSENSVRLCGQIVPMGNNLLFETAETTFGIEICEDLWATVPPSSS LALQGAEIIFNLSADDEGIGKHNYLCSLISQQSARCISGYVFSSSGFGESTTDVVFAGNG LIYENGYLLARSERFCLEEQLIINEIDVECIRAERRVNTTFAANKANCPGKEAIRISTEF VNSKDLNLTRTFNPHPFVPQGSELNSRCEEIFSIQIAGLAQRLLHTGAKTAVIGISGGLD STLALLVCVKTFDKLGLSRKDILGITMPGFGTTDRTYHNAIDLMNSLGVSIREISIREAC IQHFKDIGHDLNIHDVTYENSQARERTQILMDIANQTWGMVIGTGDLSELALGWATYNGD HMSMYGVNAGIPKTLVKHLVQWVAENGMDEASKATLLDIVDTPISPELIPADENGEIKQK TEDLVGPYELHDFFLYYFLRFGFRPSKIYFLAQTAFSGVYDDETIKKWLQTFFRRFFNQQ FKRSCLPDGPKVGSISISPRGDWRMPSDASSAAWLKEIAEL >gi|226332025|gb|ACIB01000031.1| GENE 79 94278 - 94814 530 178 aa, chain + ## HITS:1 COG:no KEGG:BF3183 NR:ns ## KEGG: BF3183 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 178 1 178 178 309 99.0 4e-83 MKRHLILAFALLASSVALNAVNSLPDDDKSDNKNKTELNSVVKKTWEFYSAIKQPSADAL ANAGNYKFGQEAGYLYNQFMKIYVVREEVVPGDPTRRTVIRKPTIYNAVRSIEKQLNKEL KSNKMTREQVAAEFTNVLKVAISAYDSESESFEDALQTNRKNATDLLSVFQNVKLTEI >gi|226332025|gb|ACIB01000031.1| GENE 80 94920 - 98075 2908 1051 aa, chain + ## HITS:1 COG:no KEGG:BF3184 NR:ns ## KEGG: BF3184 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1051 1 1051 1051 2025 99.0 0 MNSKFLLLLCSMLLCTSLAFAQSIKVTGTVTDKMGAVIGATIMVKNSSNGTVTDIDGRYS IEVPKNATLLFSFVGYSTVEKEVGNNTVINVELSDDIQAIDEVVVTAIGIKQQKKKIGYT TQQINSEVLNATPSLNVGSALSGQVAGLLVANPTGIFQAPSFKLRGNAPLVVLDGVPVET DFFDISSENIESVNVLKGTAASALYGSRGKNGAILITSKTAKKEGLEINFSTNNMITAGF AVLPETQHQYGSGSNGKYEFWDGADGGISDGDMTWGPKLNVGTKVAQWNSPIRDKVTGKE IPWWGDVKGTQYDDKSRYERIPIDWVSHDNLKDFLQTGLVTNNNISIAYKGEKARYFVTG QYAYQKGQVPSTEMHSGGINFNSTFDLAKNLQLDANLAYNKIVAPSYPRYGYGPKNHMYT IVVWMGDDVNGKELQKHKYVPGQEGYRQASYNYAWYNNPYFAAEELQQSESRDVVNGQLR LNYQILPNLNIQGRAALRQKTILQEMKVPKTYMNYGDSREGDYKVWNDRQTNVDADVLAT YTQDLTPDILFTLNAGTSVFYRNYRQEYQSTDGLIVPFVYSIKNTQGPSITDANRNEKSI RSIYGSINLDLYKYAYLTLTGRNDWSSTLAKGSNSYFYPSVALSTMVSEYIKLPTFMDYL KMYGSWAVVSTDLSPYQIMSTYTKDSNYGSNPSISYPSSLVNYYIKPQKTTSWEAGLSTA FFRNRLSFDLTYYHTIDENQIIDLNISNASGFTSRKVNGNQYTTNGWEIMANVQAIKNKD FQWDFSLNWSKSVKKLTEIYGGQKKFGDLKVGDRADAFYGSQWQKSADGELILDENGMPT KDAYKQYLGHLDPNFRMGMQNTFRYKDFTLSVDLDGAYKGVIYSVLSEKLWWGGKHPESV EYRDAQYAVGHPIYVPNGVVVTGGELKRDIDGNVISDTRTYKRNTTAVDWQQWCQNYPYQ AYVSSKENAKFANVFDRSYIKLRRVALTYNFTKLLSKQSPVKGLTATVFGNNLAVWKKVP FVDPDYTGDSNDGGANDPTARYIGMGVNIKF >gi|226332025|gb|ACIB01000031.1| GENE 81 98106 - 99578 1299 490 aa, chain + ## HITS:1 COG:no KEGG:BF3185 NR:ns ## KEGG: BF3185 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 490 1 490 490 983 99.0 0 MKKIILLIVSVWMCVSCGNLEEMNIDPDNATQTHPKLLLTQICMNAFKRGTDGMYATKKV IQADGESADQYYKWTRGSFGYYDNLRNVQKMGEEAERINAPVYTALTKFFRAYYFYELTL RFGDIPYSQALKGEKEEIYTPEYDAQEDVFAGILQELREADEILANDASVIDGDIIYNGN STQWRKLINSFRLKVLMTLSNHTTVGNINIASEFKNIATNSPLMNSLADNGQLVYLDQQG NRYPQFNAQWSGYYMDDTFIQRMRARRDPRLFIFSAQTNKGKTEGKPIDDFSSYEGGDPA APYSDAIIKVSEGTISPINDRFRTDPIVEPTMLMGYAELQQILAEAVVRGWISGNAQTYY EKGIRASFSFYETHAKDYAGYLNENAVAQYLKEPLVDFTQASGTEEQIERIIMQKYLVTF YQGNWDSFYEQLRTGYPDFRRPAGTEIPKRWMYPQGEYDNNGTNVETAITRQFGAGNDKI NQATWWQKKS >gi|226332025|gb|ACIB01000031.1| GENE 82 99611 - 100615 776 334 aa, chain + ## HITS:1 COG:CAC2806 KEGG:ns NR:ns ## COG: CAC2806 COG1409 # Protein_GI_number: 15896061 # Func_class: R General function prediction only # Function: Predicted phosphohydrolases # Organism: Clostridium acetobutylicum # 1 311 2 312 317 169 35.0 5e-42 MKKIILSSVLLLSGFFIQAQQAPDKISFNSNGEFKIAQFTDMHLGHDQEKDRIVGDMIKE VLDSEKPDLVIFTGDNTTMDEVRQAWEAISAELSARRIPWTAVLGNHDDEYAVKRDEIIR IIREQPYCMMKQVAEGIKGEGNHILPIYSSKDGNKTAALLYCLDTNAYSKIKTVKGYDWI GRSQIDWYSRESRKYTERNEGQPLPALTFLHIPLPEYTQAWESFETKRYGDRNEKECSPN INSGMFANMLECGDVMGVFAGHDHVNDYIATLYNIALGYGRASGGKNTYGDKTPGSRIIV LKEGKREFDTWLREKGNMAKLNVCTYPGSFVKEK >gi|226332025|gb|ACIB01000031.1| GENE 83 100754 - 101878 954 374 aa, chain - ## HITS:1 COG:XF2217_2 KEGG:ns NR:ns ## COG: XF2217_2 COG0131 # Protein_GI_number: 15838808 # Func_class: E Amino acid transport and metabolism # Function: Imidazoleglycerol-phosphate dehydratase # Organism: Xylella fastidiosa 9a5c # 184 374 5 211 211 221 52.0 1e-57 MKKKVLFIDRDGTLVIEPPVDYQLDSLEKLEFYPKVFRNLGFIRSKLDFEFVMVTNQDGL GTSSFPEETFWPAHNLMLKTLAGEGITFDDILIDRSMPEDCASTRKPRTGMLTKYISNPE YDLEGSFVIGDRPTDVELAKNIGCRAIYLQESIDLLKEKGLETYCALATTDWDRVAEFLF AGERRAEIRRTTKETDILVALNLDGKGTCDISTGLGFFDHMLEQIGKHSGMDLTIRVKGD LEVDEHHTIEDTAIALGECIYQALGSKRGIERYGYALPMDDCLCRVCLDFGGRPWLVWDA EFKREKIGEMPTEMFLHFFKSLSDAAKMNLNIKAEGQNEHHKIEGIFKALARALKMALKR DIYHFELPSSKGVL >gi|226332025|gb|ACIB01000031.1| GENE 84 101884 - 102921 897 345 aa, chain - ## HITS:1 COG:YPO1547 KEGG:ns NR:ns ## COG: YPO1547 COG0079 # Protein_GI_number: 16121820 # Func_class: E Amino acid transport and metabolism # Function: Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase # Organism: Yersinia pestis # 4 341 7 352 382 221 37.0 1e-57 MKTLQELTRPNIWRLKPYSSARDEYSGAAASVFLDANENPYNLPHNRYPDPMQRDLKLEL SKIKKVAPAHIFLGNGSDEAIDLVFRAFCEPGRDNVVAIDPTYGMYQVCADVNDVEYRKV LLHDDFQFSADELLAVADERTKMIFLCSPNNPTGNDLLRSEIIKVINDFEGLVILDEAYN DFSDEPSFLSELDKYPNLIILQTFSKAFGCAAIRLGMAFASEGIIGVLNKIKYPYNVNQL TQQQAIEMLHKYYEIERWVKTLKEERGYLEEAFVELPCVLQVFPSNANFFLARVTDAVKI YNYLVGEGIIVRNRNSISLCGNCLRVTVGTRAENAKLIGALKKYQ >gi|226332025|gb|ACIB01000031.1| GENE 85 102918 - 104204 1242 428 aa, chain - ## HITS:1 COG:ECs2821 KEGG:ns NR:ns ## COG: ECs2821 COG0141 # Protein_GI_number: 15832075 # Func_class: E Amino acid transport and metabolism # Function: Histidinol dehydrogenase # Organism: Escherichia coli O157:H7 # 11 428 16 432 434 407 52.0 1e-113 MKLIKYPDRSQWNEILKRPILETENLFDTVRNIINRVRAGGDRVVMEYEAVFDKAELTSL AVTSAEIEEAEKEVPIELKAAIYLAKRNIETFHSAQRFEGKKVDTMEGVTCWQKAVAIEK VGLYIPGGTAPLFSTVLMLAIPAKIAGCKEIVLCTPPDKNGKVHPAILFAARLAGVSKIF KVGGVQAIAAMAYGTESIPKVYKIFGPGNQYVTAAKQLVSLRDVAIDMPAGPSEVEVLAD ESANPVFVAADLLSQAEHGVDSQAMLVTTSEKLQTEVVYEVERQLGYLTRRDIAEKSLAN SKLILVKDMEEALELTNAYAPEHLIIETKDYMEVAGQIVNAGSVFLGAFSPESAGDYASG TNHTLPTNGYAKAYSGVSLDSFIRKITFQEILPSGMSAIGPAIEVMAANEHLDAHKNAVT VRLEEIRK >gi|226332025|gb|ACIB01000031.1| GENE 86 104240 - 105091 928 283 aa, chain - ## HITS:1 COG:PM1195 KEGG:ns NR:ns ## COG: PM1195 COG0040 # Protein_GI_number: 15603060 # Func_class: E Amino acid transport and metabolism # Function: ATP phosphoribosyltransferase # Organism: Pasteurella multocida # 2 282 7 298 299 251 47.0 1e-66 MLRIAVQAKGRLFEETMALLEESDIKLSTTKRTLLVQSSNFPVEVLFLRDDDIPQSVATG VADLGIVGENEFVERQEDAEIIKRLGFSKCRLSLAMPKDIEYPGLSWFNGKKIATSYPGI LDAFMKSNGVKAEVHVITGSVEVAPGIGLADAIFDIVSSGSTLVSNRLKEVEVVMRSEAL LIGNKNMSKEKKEILDELLFRMDAVKTAEDKKYVLMNAPKDKLEDIIAVLPGMKSPTVMP LAQDGWCSVHTVLDEKRFWEIIGKLKALGAEGILVLPIEKMII >gi|226332025|gb|ACIB01000031.1| GENE 87 105341 - 105832 415 163 aa, chain - ## HITS:1 COG:no KEGG:BF3191 NR:ns ## KEGG: BF3191 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 163 1 163 163 333 100.0 1e-90 MKKIINPWKGMEGYNCFGCAPNNEAGVKMEFYEDNDEVISIWRPRPEYQGWIDTLHGGIQ AVLLDEICAWVILRKLQTTGVTSKMETRYRKSISTNDSHVVLKAHIKEVKRNIVIIEARL YNKDEELCTEALCTYFTFPKEKAREEMHFLSCEVEDEEILPLI >gi|226332025|gb|ACIB01000031.1| GENE 88 105954 - 106661 380 235 aa, chain + ## HITS:1 COG:RSc2208 KEGG:ns NR:ns ## COG: RSc2208 COG1741 # Protein_GI_number: 17546927 # Func_class: R General function prediction only # Function: Pirin-related protein # Organism: Ralstonia solanacearum # 8 233 7 232 232 168 38.0 8e-42 MKTVVDKASSRGYFNHGWLKTHHTFSFANYYNPSRMHFGVLRVLNDDSVDPEMGFDTHPH QNMEVISIPLKGYLRHGDSVKNTRTITPGDIQVMSTGKGIFHSEYNGSDKEQLEFLQIWV FPRIENTEPEYNNYDIRPLLKRNELALIISPDGKVPASIKQDAWFSMGTFDAGKSFEYKL HQEGNGVYLFIIEGDVEVAGNRLSRRDGIGLWDTKSFKVEITQEATLLLMEVPMR >gi|226332025|gb|ACIB01000031.1| GENE 89 106746 - 108755 1852 669 aa, chain - ## HITS:1 COG:NMA1719 KEGG:ns NR:ns ## COG: NMA1719 COG4232 # Protein_GI_number: 15794612 # Func_class: O Posttranslational modification, protein turnover, chaperones; C Energy production and conversion # Function: Thiol:disulfide interchange protein # Organism: Neisseria meningitidis Z2491 # 73 577 68 552 613 103 23.0 2e-21 MKKLSSFLLLLLVVFTAQAQIQEPVKFKTELKTLSGAEAEIVFTGTIDAGWHVYSTDLGD GGPISATFNVEKMSGAEVVGKLTPRGKEVSDFDKLFEMKVRYFEKTAQFIQKIKFTGSDY SIEGYLEYGACNDENCLPPTQVPFKFSGKAAATAEVSAKETPATPVKEPVATVTDSIVEP TATTVTTAIGSVDLWKPVINDLKKFGEANSQEDMSWIYIFITGFLGGLLALFTPCVWPII PMTVSFFLKRSKDKKKGIRDAWTYGASIVVIYVALGLAITLIFGASALNALSTNAVFNIL FCLMLIVFAASFFGAFELTLPAKWSTAVDSKAEATSGLLSIFLMAFTLSLVSFSCTGPII GFLLVQVSTTGSVVAPAIGMLGFAIALALPFTLFALFPSWLKSMPKSGGWMNVIKVTLGF LELAFALKFLSVADLAYGWRILDRETFLALWIVIFALLGFYLLGKIKFPHDDDDTKVSVS RFFMALVSLAFAVYMVPGLWGAPLKAVSAFAPPMKTQDFNLYTNEVHAKFDDYDLGMEYA RQHNKPVMLDFTGYGCVNCRKMELAVWTDPKVSSIINNDYVLITLYVDNKTPLTEPVKIM ENGTERTLRTVGDKWSYLQRVKFGANAQPFYVLIDNEGNPLNKSYAYDEDISKYINFLQT GLENYRKEK >gi|226332025|gb|ACIB01000031.1| GENE 90 108856 - 109155 467 99 aa, chain - ## HITS:1 COG:no KEGG:BF3194 NR:ns ## KEGG: BF3194 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 99 1 99 99 189 100.0 2e-47 MVKHIVLFKLRDDVPVEEKLVVMNSFKEAIEALPAKISVIRKIEVGLNMNPGETWNIALY SEFDNLDDVKFYATHPEHVAAGKILAETKESRACVDYEF >gi|226332025|gb|ACIB01000031.1| GENE 91 109257 - 109871 480 204 aa, chain + ## HITS:1 COG:BH1275 KEGG:ns NR:ns ## COG: BH1275 COG0572 # Protein_GI_number: 15613838 # Func_class: F Nucleotide transport and metabolism # Function: Uridine kinase # Organism: Bacillus halodurans # 2 201 6 205 211 219 54.0 3e-57 MLIIGIAGGTGSGKTTVVRKIIESLPAGEVVLLPQDSYYKDSSHVPVEERQNINFDHPDA FEWSLLSKHVALLKEGKCIEQPTYSYLTCTRQPETIHIEPREVVIIEGILALCDKKLRNM MDLKIFVDADPDERLIRVIQRDVVERGRTAEAVMERYTRVLKPMHLQFIEPCKRYADLIV PEGGSNQVAIDILTMYIKKHIGRP >gi|226332025|gb|ACIB01000031.1| GENE 92 109868 - 111259 1152 463 aa, chain + ## HITS:1 COG:VC0866 KEGG:ns NR:ns ## COG: VC0866 COG4623 # Protein_GI_number: 15640882 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted soluble lytic transglycosylase fused to an ABC-type amino acid-binding protein # Organism: Vibrio cholerae # 35 456 34 460 530 182 32.0 2e-45 MKRHLIIYSLLFLLFCVLSCRNKQAVATEESSAHDLEQIKDSGELVVLTLYSSTSYFIYR GQDMGFQYELSEQFAKSLGVKLRIEVAKNVPELIRKLLNGEGDIIAYNIPITKELKDSLI YCGEEVITHQVIVQRTNGKTKPLKDVTELVGKNIYVKPGKYYERLVNLNKELGGGILIHQ VTNDSITAEDLITQVAQGKIPYTVADNDVAKLNATYYPNLNTSLSISFDQRASWAVRKDC PQLAAAADEWHKQNMTSPAYTASMKRYFEISKAMPHSPILSLKEGKISHYDNLFKKYAQE IGWDWRLLASLAYTESNFDTTAVSWAGAKGLMQLMPATARAMGVPPGKEQNPEESIKAAV KYIAATDRSLSMVPDKQERIKFILASYNAGLGHIFDAIALADKYGKNKTVWTDNVENYIL LKSNEEYFTDPVCKNGYFRGIETYNFVRDINSRYESYKKKIKS >gi|226332025|gb|ACIB01000031.1| GENE 93 111319 - 112872 1232 517 aa, chain - ## HITS:1 COG:MTH1856 KEGG:ns NR:ns ## COG: MTH1856 COG0591 # Protein_GI_number: 15679844 # Func_class: E Amino acid transport and metabolism; R General function prediction only # Function: Na+/proline symporter # Organism: Methanothermobacter thermautotrophicus # 1 513 1 512 526 488 50.0 1e-138 MNTFTLGLIVIAYLLSLAYLGFLGYKKTSSASDYLVGGRQMNPFVMALSYGATFISASAI VGFGGVAAAFGMGIQWLCFLNMFIGVVIAFIFFGLRTRRMGAKLNVSTFPQLLGRHYRSR GIQVFVAAVIFLGMPLYAAVVMKGGAVFIEQIFQIDFNISLLIFTLVIAAYVIAGGMKGV MYTDALQAVIMFGCMLFLLFSLYRVLDMGFTEANQALTDIAPLVPEKFKALGHQGWTAMP VTGSPQWYTLVTSLILGVGIGCLAQPQLVVRFMTVESSKQLNRGVFIGCFFLIITVGAIY HAGALSNLFFLKTEGVVATEAVKDMDKIIPYFINKAMPDWFAALFMLCILSASMSTLSSQ FHTMGASVGSDIYGTYKPRSRGKLTNVIRLGVLFSILVSYIICYMLPNDIIARGTSIFMG ICAAAFLPAYFCALYWRRATRQGVMASLWIGTIGSLFALAFLHQKEAAAMGVCRWLFGKD VLIEAYPFPMIDPILFALPLSVAAVIIVSLLTEKGKK >gi|226332025|gb|ACIB01000031.1| GENE 94 112869 - 112994 81 41 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|265766058|ref|ZP_06094099.1| ## NR: gi|265766058|ref|ZP_06094099.1| conserved hypothetical protein [Bacteroides sp. 2_1_16] # 1 41 1 41 41 72 95.0 6e-12 MFGISDSLIILSYLLSVVCVLFAAWFGFKYWNKDNEKDKTR >gi|226332025|gb|ACIB01000031.1| GENE 95 113148 - 113753 619 201 aa, chain - ## HITS:1 COG:MA0330 KEGG:ns NR:ns ## COG: MA0330 COG0778 # Protein_GI_number: 20089228 # Func_class: C Energy production and conversion # Function: Nitroreductase # Organism: Methanosarcina acetivorans str.C2A # 33 201 3 174 179 152 42.0 4e-37 MVIKKAIYVWVIGILGMISFAACSTASQGEAPSTSNAALDNIFARKSVRAYLDKEVEKEK IDWMLRAGMAAPSGKDIRPWEFVLVTDRVALDSMAAALPYAKMLTQARYAIVVCGDVAQS SYWYLDCSAAAQNILLAAEAQGLGAVWTAAYPYEDRIRVVRKYTELPGNIVPLCVIPFGY PATTQEPKQKFDEKKIHYDKF >gi|226332025|gb|ACIB01000031.1| GENE 96 113768 - 116518 2412 916 aa, chain - ## HITS:1 COG:VC0390_2 KEGG:ns NR:ns ## COG: VC0390_2 COG1410 # Protein_GI_number: 15640417 # Func_class: E Amino acid transport and metabolism # Function: Methionine synthase I, cobalamin-binding domain # Organism: Vibrio cholerae # 324 915 1 590 899 722 61.0 0 MKKTIQQLVLERILILDGAMGTMIQQYNLREEDFRNERFAHIPGQLKGNNDLLCLTRPDV IRDIHRKYLEAGADIIETNTFSSTTISMADYHVQEYVREMNQAAVKLAREVADEYTVLNP DKPRFVAGSVGPTNKTCSMSPDVNNPAYRAVTYDEMADAYQQQMEAMLESGVDALLIETI FDTLNAKAAILAAERAMKATGVKVPVMLSVTVSDTGGRTLSGQTLEAFLASVQHADIFSV GLNCSFGARQLKPFLEQLAARAPYYISAYPNAGLPNSLGKYDQTPADMAHEVKEYVHEGL INIIGGCCGTTDAYIAEYPALIAGAKPHIPVCKPDCMWLSGLELLEVKPEINFVNVGERC NVAGSRKFLRLINEKKYDEALSIARKQVEDGALIIDVNMDDGLLDAKEEMTTFLNLVASE PEIARVPVMIDSSKWEVIEAGLKCLQGKSIVNSISLKEGEEKFLEHARTVRQYGAAVVVM AFDEKGQADTATRKIEVCERAYHLLVDKIGFNPHDIIFDPNVLAVATGIEEHNNYAVDFI EATAWIKKNLPGAHISGGVSNLSFSFRGNNYIREAMHAVFLYHAIQKGMDMGIVNPGTSV LYTDIPADVLERIEDVVLNRRSDAAERLIELADRLKEASAGNTSAGQPVKHDAWRDGTVE ERLQYALVKGIGDFLEEDLAEALPKYDKAVDVIEGPLMNGMNHVGELFGAGKMFLPQVVK TARTMKKAVAILQPIIESEKVEGTASAGKVLLATVKGDVHDIGKNIVSVVMACNGYDIID LGVMVPAESIVQKAIEEKVDMIGLSGLITPSLEEMVHVAMELEKAGLDIPLLIGGATTSK LHTALKIAPVYHAPVVHLKDASQNAGVAARLMSPKSKEELAKELSGEYEALRDKSGMMKR ETVSLKEAQENRLKLF >gi|226332025|gb|ACIB01000031.1| GENE 97 116538 - 116990 611 150 aa, chain - ## HITS:1 COG:TM0254 KEGG:ns NR:ns ## COG: TM0254 COG0691 # Protein_GI_number: 15644629 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: tmRNA-binding protein # Organism: Thermotoga maritima # 10 149 16 155 158 138 50.0 5e-33 MKLAPINIKNKRATFDYELIDTYTAGIVLTGTEIKSIRLGKASLVDTFCYFAKGELWVKN MHIAEYFYGSYNNHAARRDRKLLLSKKELNKLERGTKDAGFTIVPVRLFINERGLAKVVV ALAKGKKQYDKREALKEKDDRRDMDRMFKR >gi|226332025|gb|ACIB01000031.1| GENE 98 117000 - 117542 396 180 aa, chain - ## HITS:1 COG:no KEGG:BF3201 NR:ns ## KEGG: BF3201 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 180 1 180 180 330 100.0 1e-89 MSLISSPAKAWEEIRLEDRRAVLTVFVYPMIGLCGLSVFIGALWTNGWGGPQSFQLAMTQ CCAVAVALFGGYFLAAYAINQMGIKMFGMTNDIPLAQQFAGYALVVTFLLHIVTGLLPDF SIIGWLLQFYIVYVVWEGARVVMLVEEKNRLRYTIFSSILLILCPAVIQVVFNKLTAILN >gi|226332025|gb|ACIB01000031.1| GENE 99 117574 - 118374 788 266 aa, chain - ## HITS:1 COG:no KEGG:BF3042 NR:ns ## KEGG: BF3042 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 266 1 267 267 444 98.0 1e-123 MKQLKLMVLTLTLLMGTMFTSCMDSGESGPQQWAGVVKVNDRMGYVTFTDAAGTELIPTN TIPVTLNARMAYIDCQVDEGQDLSTNPKSIKITLLADPTGIDATAITTPKVESSDVTTNA PVGSLSFASGYSTVAPFQFSENTIVLPVLYRVKNVTTTEDIKNELAKHTFTLVCYTDDIK SGDTILKLYLRYKVEDEPAAIAERATRTPSFKAYEISQILREYTLKSGQTKPAKITIVAQ QNEYNNKLEDTSTIEKVYEIEYKTAE >gi|226332025|gb|ACIB01000031.1| GENE 100 118457 - 118957 528 166 aa, chain - ## HITS:1 COG:no KEGG:BF3204 NR:ns ## KEGG: BF3204 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 166 1 166 166 284 100.0 7e-76 MIEVTDASLQKAAGEGMDEFIQVFTDKYKEVIGGELTAETMPLLTGEQHSLLAYQIFRDE VMFGGFCQLIQNGYGGYIFDNPFAKVMRLWGAEDFSKLVYKAKKIYDAHRHDLEKERTED EFMAMYEQYEAFDDLEEEYLDIEEEVTALVASYVDDHLELFAKIVK >gi|226332025|gb|ACIB01000031.1| GENE 101 119496 - 119900 195 134 aa, chain - ## HITS:1 COG:no KEGG:BF3205 NR:ns ## KEGG: BF3205 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 134 1 134 134 233 100.0 2e-60 MNLNEVDIHYLIAAISVITSALVFYTIGVWGERLQKRLKFWHLVFFLLGLLADSVGTALM ENIARLTHLHDEIHTVTGIIAILLMFIHAMWAIWTYVKGSERAKEHFNRFSIVVWCIWLI PYCIGVYLGMSLHH >gi|226332025|gb|ACIB01000031.1| GENE 102 120140 - 120841 764 233 aa, chain + ## HITS:1 COG:CAC2565 KEGG:ns NR:ns ## COG: CAC2565 COG0822 # Protein_GI_number: 15895825 # Func_class: C Energy production and conversion # Function: NifU homolog involved in Fe-S cluster formation # Organism: Clostridium acetobutylicum # 1 233 1 230 230 373 80.0 1e-103 MTYSHEVEHMCVVKKGPNHGPAPIPEEGKWVKSKEIVDISGLTHGVGWCAPQQGACKLTL NVKEGIIQEALVETIGCSGMTHSAAMAAEILPGKTILEALNTDLVCDAINTAMRELFLQI VYGRTQSAFSEGGLIIGAGLEDLGKGLRSQVGTLYGTLAKGPRYLEMAEGYIKTIALDKN DEICGYEFVHMGKFMDEIKKGTDANEALKKVTGTYGRFTAEQGAVKHIDPRHE >gi|226332025|gb|ACIB01000031.1| GENE 103 120862 - 121872 1205 336 aa, chain + ## HITS:1 COG:no KEGG:BF3207 NR:ns ## KEGG: BF3207 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 336 1 336 336 652 99.0 0 MIREVKFESQDRRIKGIIEALNANGIKDIEEANAICEAAGVDPYKTCEETQPICFENAKW AYVVGAAIAIKKGCKNAADAAEAIGIGLQAFCIPGSVADDRKVGIGHGNLAAMLLHEETK CFAFLAGHESFAAAEGAIKIAAKADKVRKEPLRCILNGLGKDAAQIISRINGFTYVQTQF DYFTGELKVVREIAYSDGPRAKVKCYGADDVREGVAIMWKEGVDVSITGNSTNPTRFQHP VAGTYKKERVLAGKPYFSVASGGGTGRTLHPDNMAAGPASYGMTDTMGRMHSDAQFAGSS SVPAHVEMMGFLGIGNNPMVGCTVACAVDVAQALAK >gi|226332025|gb|ACIB01000031.1| GENE 104 122374 - 123387 744 337 aa, chain + ## HITS:1 COG:aq_1099 KEGG:ns NR:ns ## COG: aq_1099 COG0332 # Protein_GI_number: 15606369 # Func_class: I Lipid transport and metabolism # Function: 3-oxoacyl-[acyl-carrier-protein] synthase III # Organism: Aquifex aeolicus # 8 327 5 309 309 277 40.0 2e-74 MKQINAVITGVGGYVPDYILTNDEISRIVDTTDEWIMGRIGIKERRILNEEGLGTSYMAR KAVKQLMQRTQSNPDDIDLVIVATTTPDYRLPSTASILCERVGLKNAFAFDMQAVCSGFL YALETGANFIRSGKYKKVIIVGADKMSSVIDYTDRATCPIFADGAAAFLLEPTTDHLGVI DSVLRTDGKGLPFLHMKAGGSVCSPSYFTVDNHMHYLHQEGRTVFKYAVANMSDACESII ERNQLTKDEIDWVVPHQANQRIISAVAQRLDVPLEKVMINIEHYGNTSAGTLPLCIWDFE NKLKKGDNLIFTAFGAGFAWGAVYVKWGYDGKTNNAC >gi|226332025|gb|ACIB01000031.1| GENE 105 123494 - 124939 1261 481 aa, chain + ## HITS:1 COG:SP1382 KEGG:ns NR:ns ## COG: SP1382 COG0366 # Protein_GI_number: 15901236 # Func_class: G Carbohydrate transport and metabolism # Function: Glycosidases # Organism: Streptococcus pneumoniae TIGR4 # 1 477 1 479 484 482 49.0 1e-136 MENGVMMQYFEWNLPNDGNLWKQLKEDASHLHEIGVTAVWIPPAYKADEQQDEGYATYDL YDLGEFDQKGTVRTKYGTKEELKEMIDELHKNHISVYLDVVLNHKAGGDFTEKFIVVEVD PNDRTQALGKPFEIQGWTGYSFHGRKDKYSDFKWHWYHFSGTGFDDAKKRSGIFQIQGEG KAWSEGVDNENGNYDFLLCNDIDLDHPEVVTELNRWGKWVSKELNLDGMRLDAIKHMKDK FIAQFLDAVRSERGDKFYAVGEYWNGDLNTLDAYIKSVGHKVNLFDVPLHYNLFQASQEG KNYDLQNILKNTLVEHHCDLAVTFVDNHDSQSGSSLESQIEDWFKPLAYGLILLMKDGYP CLFYGDYYGVKGENSPHTQIINILLDTRRKYAYGDQIEYFDHPSAIGFIRTGDEEHVGSG LVFLMSNDEAGSKKMDLGEEHKGEIWHEITGNIQQEITLDEKGSGEFSVNTRNIAVWIKK N >gi|226332025|gb|ACIB01000031.1| GENE 106 124941 - 125876 648 311 aa, chain + ## HITS:1 COG:alr3393 KEGG:ns NR:ns ## COG: alr3393 COG1295 # Protein_GI_number: 17230885 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Nostoc sp. PCC 7120 # 1 297 1 300 314 178 33.0 1e-44 MEIHSERKKRLSLSLLFKIIKDTVWGFIDDSVMRLSASLAYATLFSIIPFLSLLVTVDVF FHMDLANQLYVQLQPIVGPEVTEALRSIIENAENTDPSRSAAFVSLGISIFGATTIFAEI QSSLNSIWGIKAVPKKSWLKFIKNRILSFSIILVFAFILLITFTITNIIGELSQKFIFKY PEVADSLVKVVGIIINMSVTTIIFTLIFKILPDAKIKSKDVCIGAVVTTILLLIGQWGIS FYIGIANVGTVYGAAAFMVVFVTWIYYSSIIIYTGAEFTKAWANEMGSKIFPDEYAVATK TIEIHEDKPIE >gi|226332025|gb|ACIB01000031.1| GENE 107 125890 - 126210 239 106 aa, chain - ## HITS:1 COG:no KEGG:BF3211 NR:ns ## KEGG: BF3211 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 106 28 133 133 201 100.0 8e-51 MLTFTDNFENDKELILRDHLALERTKLANERTLFAYIRMALYLLTVGIGIFQIESISRLD GLAWGCIIAGIFLFFLGFVRFEQMRKHLKQYTKTCRDTENESSRKK >gi|226332025|gb|ACIB01000031.1| GENE 108 126579 - 127415 735 278 aa, chain - ## HITS:1 COG:no KEGG:BF3212 NR:ns ## KEGG: BF3212 # Name: not_defined # Def: putative ferredoxin # Organism: B.fragilis # Pathway: not_defined # 1 278 1 278 278 558 100.0 1e-158 MTANEVHLIYFSPTHTSKQVGEAIVRGTGITNVINTNLTQQATQDLVIAESALAIIVVPV YGGRVAPLAMDRLASVRGSNTPAVIVVVYGNRAYEKSLMELDYWAIQQGFKVIAGATFIG EHSYSTEKYPVAAGRPDERDLAVAADFGKQISDKIASATEPEKLYAVDVRKIRRPRQPFF PLFRFLRKVIALRKSGVPLPRTPWVEDESLCTHCGACAKMCPVSAIAKGDELNTDAERCI KCCACVKGCPQKARVYDTPFAVLLSQCFVKQKDPCTLV >gi|226332025|gb|ACIB01000031.1| GENE 109 127690 - 129756 2111 688 aa, chain + ## HITS:1 COG:AGc5136 KEGG:ns NR:ns ## COG: AGc5136 COG1158 # Protein_GI_number: 15890078 # Func_class: K Transcription # Function: Transcription termination factor # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 320 688 52 421 421 468 64.0 1e-131 MYNIIQLNDKNLSELQAIAQELGIKKTDSLKKEELVYKILDEQAIAGATKKVAADKLKEE RKEDKKKRSRVTVKKENADKVFSSTKNGELTKTDAKTPAAKTQPQPKTTEPTPETAKEAN AETNATPAESVKVTPYATPKKKPGRPRKNQVETEAKPAEETTEKPETVPSAQEEKPAAQP ETEKRPISKPILKPKPAVVDEESSILSDIDADDDFIPIEDLPSEKVELPTELFGKFESTK AEAATAPEPVAQPQRPRVIRPRDNNNNNNYNNNNNNQRNNNQRQPVQQRPMSQQNAAEAA PVQERRVIEREKPYEFDDILTGTGVLEIMQDGYGFLRSSDYNYLSSPDDIYVSQSQIKLF GLKTGDVVEGVIRPPKEGEKYFPLVKVSKINGRDAAFVRDRVPFDHLTPLFPDEKFKLCK GGYSDSMSARVVDLFSPIGKGQRALIVAQPKTGKTILMKEIANAIAANHPEVYMIMLLID ERPEEVTDMARSVNAEVIASTFDEPAERHVKIAGIVLEKAKRLVECGHDVVIFLDSITRL ARAYNTVSPASGKVLSGGVDANALHKPKRFFGAARNIENGGSLTIIATALIDTGSKMDEV IFEEFKGTGNMELQLDRNLSNKRIFPAVNIVASSTRRDDLLLDKQTLDRMWILRKYLSDM NPIEAMDFVKDRLEKTKDNDEFLMSMNS >gi|226332025|gb|ACIB01000031.1| GENE 110 129981 - 131738 1480 585 aa, chain + ## HITS:1 COG:MA4377_3 KEGG:ns NR:ns ## COG: MA4377_3 COG0642 # Protein_GI_number: 20093164 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Methanosarcina acetivorans str.C2A # 340 584 13 265 311 152 37.0 2e-36 MSALLDLIDNSLNYSKEAPNDSIIQWGNELAPLLKKQKEYKTLFQLKQLIVTAYASRGDM NMAIDHARRMYKEAKELNSPIGIALSSRAIGDAYLNANMQEPAIESYKEALELLDKIPGS EILEQEILPKFILTLIQTSHMDEVRIYLQKFENLYADNPNPTFHFFICACNAYYNIESGD PEKGKAELDKARKIHEQLNYLYLRSIYNYILAQYYQAVGKYELALQQYERLTKVPKAPAP NKHIGLQLECAQLLTQMGRTEEAYRIYQKANRQKDSLNALSYARQINDLRGMYQIDRMEI RNQIQRNQIILWIIIASIFILMLVLLLIVRIRQESNRLLRSKEELEIARKYAENSIRTKS LFLSNMSHEIRTPLNALSGFSSILTDESIDNDTRYQCNDIIQQNSELLLKLINDVIDLSN LDPGKLTFNFKECDAVNICRNVINTVEKVKQTQAGVSFVTSLDKLTLRTDEARLQQVLIN LLINATKFTTEGSITLTLEKESETIALFTVTDTGCGILREKQDQIFNRFEKLNEGAQGTG LGLSICQLIIEQIGGRIWIDPDYTEGARFRFTHPVRPAKEKEAER >gi|226332025|gb|ACIB01000031.1| GENE 111 131735 - 133690 1283 651 aa, chain + ## HITS:1 COG:AGc3465 KEGG:ns NR:ns ## COG: AGc3465 COG0642 # Protein_GI_number: 15889187 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 422 574 241 395 511 92 33.0 3e-18 MKRLILIIIVCCRALGWCHANTQTETDSLYRVAQSLPHDSTRLEMFKRLAQIEQLTPRCI TFSGLLREEATLQKNDRYNAIAAYLHTVYYYNQNNRDSVKKWLDTMEPYARKSQTWDLYF DALRFQIDLCTYEEQYELAINEANLMYERAPKVNCARGLIGAKQCLGNVYISTERWDEGM KALEAAYQLSLQTDNAVVRISILCQLISITKDQKNNQLLSEYLAKLKETLHHHTSTNPML KEAFYDVYLFCEVYYTYYYLYAGQPEQAHKNLVNAGKFLNGNTFFLYRVLYYDAYAAYFR ACKAYDRALAKIDSTIILLQEDFNSNYIHQKLTKADLLAEAGRSAEAIPLYIETLHLKDS IETTVLDKQMQQIKAKYNIDKVALEEERLKSYIQLGTLIVVVIILIILVAFMLRISHVRK ALERSEKETRETTRMAEEANEMKNRFLSNISYHIRIPLNGVVGFSQLIASEPNMPDELRK EYSSIIQKNSEELMRLVNDVLDLSRLEAGMMKFNIQEYGLAELCNEATYMARMHSEGCTV IRLENEIDTDLNIRVDTVRFTQALLSALTYPQKYKEKREIDFKVTLDTEKNFINFRITNS PLADERFTSQEVCIRHEINRLLFEYFGGNYKVQTNPDGKPTILFTFPSGRN >gi|226332025|gb|ACIB01000031.1| GENE 112 133765 - 135087 1280 440 aa, chain + ## HITS:1 COG:PAB0243 KEGG:ns NR:ns ## COG: PAB0243 COG0534 # Protein_GI_number: 14520582 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Pyrococcus abyssi # 5 393 6 397 463 111 24.0 3e-24 MYTNKQIWSVSYPILLSLLAQNVINVTDTAFLGRVSEIALGASAMGGLFYICIFTIAFGF STGSQIVIARRNGEARYGDVGPVMIQGVLFLLVMALLLFGFTKAFGGNIMRLLVSSESIY DATMEFLDWRIFGFFFSFVNVMFRALYIGITRTKVLTINAVVMALTNVVLDYALIFGHFG LPEMGIKGAAIASVIAEAASLLFFLIYTYITVNLKKYGLNRLRSFDPVLLMRILSISCFT MLQYFLSMATWFVFFVAVERLGQRELAIANIVRSIYIVMLIPVNALATTTNSLVSNAIGA GGINYVMPLINKIGRFSFLIMLGLVIITALFPQALLSVYTNETALINESVSSVYVICVAM LIASVANIVFNGISGTGNTQAALMLEAITIAIYGSYIIFIGMWVKAPIEWCFTIEILYYT LLLATSYIYFKKAKWQNKKI >gi|226332025|gb|ACIB01000031.1| GENE 113 135239 - 136558 1790 439 aa, chain + ## HITS:1 COG:BH2484 KEGG:ns NR:ns ## COG: BH2484 COG0541 # Protein_GI_number: 15615047 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Signal recognition particle GTPase # Organism: Bacillus halodurans # 2 428 3 432 451 436 53.0 1e-122 MFDNLSERLERSFKILKGEGKITEINVAETLKDVRKALLDADVNYKVAKGFTDTVKEKAL GQNVLTAVKPSQLMVKIVHDELTQLMGGETVEIDTKGQPAVILMSGLQGSGKTTFSGKLA RMLKTKKNKRPLLVACDVYRPAAIEQLRVLAEQIDVPMYSEIDSKDPVSIAQNAIKEARA KGYDLVIVDTAGRLAVDEQMMNEIAAIKEAIQPNEILFVVDSMTGQDAVNTAKEFNERLD FDGVVLTKLDGDTRGGAALSIRSVVNKPIKFVGTGEKLDAIDQFHPARMADRILGMGDIV SLVERAQEQYDEEEAKRLQKKIAKNQFDFNDFLSQISQIKKMGNLKELASMIPGVGKAIK DIDIDDNAFKSIEAIIYSMTPEERSNPGILNGSRRTRIAKGSGTTIQEVNRLLKQFDQTR KMMKMVTSSKMGKMMPKMK >gi|226332025|gb|ACIB01000031.1| GENE 114 136963 - 138048 999 361 aa, chain + ## HITS:1 COG:BH1577 KEGG:ns NR:ns ## COG: BH1577 COG0526 # Protein_GI_number: 15614140 # Func_class: O Posttranslational modification, protein turnover, chaperones; C Energy production and conversion # Function: Thiol-disulfide isomerase and thioredoxins # Organism: Bacillus halodurans # 225 356 39 166 176 79 28.0 9e-15 MKHILFTLALASIAGIANAQHNGHSYTVEGLINDSTLNGQTLYISRYDDGMKVDSTQVTN GQFKFTGKADIPCFCRIDAGREYANFILEGGNIQVNVLTHNDPKGTPMNEEYTHISDKTS ELVTELRKRHAAIEQQTEDKSEQLRLKKEYADTYWHPTYTRLYKDMFMKNPDNALGEFAI RELAMCALPEEMDTIFAASGPWLKSLSVYHRIEKQFQGMKATAVGQKFTDFSGKTIDGAA SSLSDFVGKGQYTLVDFWASWCGPCRSESPHIAELYNTYKDKGLTVLGVAVWDKPENTKK AIKELNIDWPQIIDTGMTPMDLYGVKGIPFILLFGPDGTIIARDLRGEGMKNKVAEVLNN K >gi|226332025|gb|ACIB01000031.1| GENE 115 138068 - 138949 1048 293 aa, chain + ## HITS:1 COG:lin1397 KEGG:ns NR:ns ## COG: lin1397 COG0190 # Protein_GI_number: 16800465 # Func_class: H Coenzyme transport and metabolism # Function: 5,10-methylene-tetrahydrofolate dehydrogenase/Methenyl tetrahydrofolate cyclohydrolase # Organism: Listeria innocua # 3 289 4 279 284 272 51.0 5e-73 MTLIDGKAISEQVKQEIAAEVAEIVAHGGKRPHLAAILVGHDGGSETYVAAKVKACEVCG FKSSLIRYESDVTEDELLAKVRELNEDDDVDGFIVQLPLPKHISEQKVIETIDYRKDVDG FHPINVGRMSIGLPCYVSATPNGILELLKRYRIETSGKKCVVLGRSNIVGKPMAALMMQK AYPGDATVTVCHSRSKDLVKECREADIIIAALGQPNFVKAEMVKEGAVVIDVGTTRVPDA SKKSGFKLTGDVKFDEVSPKCSFITPVPGGVGPMTIVSLMKNTLLAGKKAIYQ >gi|226332025|gb|ACIB01000031.1| GENE 116 139017 - 140648 1201 543 aa, chain + ## HITS:1 COG:no KEGG:BF3220 NR:ns ## KEGG: BF3220 # Name: not_defined # Def: thiol:disulfide interchange protein # Organism: B.fragilis # Pathway: not_defined # 1 543 1 543 543 1110 99.0 0 MNTIFKTTGLCVFLFACTPWAAAQNFSIDYPSYQKRNTDALEISRIVRNDTATILYMDAY SRPNYWIRLASELSLHGEQSGKNYPVIRSQGFELDKQVYMPASGNVTFTLQFAPIDPQDR TIDFVESNNEEDFRIEGICLDASVPKKKVHCHLAGKMANRPQTSRLILIESNKNTRGTPW ISIPVRNGVFEYDFYTDHEKAYELICWDDLLNGCWYPVTFFAENGTVSFTISAIDARPPY RIETLNPLTTALRLFEAEGEDKFSFDELNARRDTLEKYGRFESEAMQALWKKFDQVKDNP EERNKLFRERDRLEESGEAFTEEAKALIKEREQLFRKKVIWETEQASQHPSLVGLFILKM KVENARKDEDVIPYLEVFQKVYASRYPEHPYTREMQMFIDNNQPKTGNRFIDFSAPDLNG NMVQLSEQIRGKVALIDLWASWCGPCRTTSKQLIPIYEKYKDRGFTVIGVAREQNSDIRM REAIRKDGYPWLNLIELNDVQQIWSKYRIPNAAGGTFLVNAQGIILAVNPTAEEVERILQ KEL >gi|226332025|gb|ACIB01000031.1| GENE 117 140661 - 141779 639 372 aa, chain + ## HITS:1 COG:SPy0818 KEGG:ns NR:ns ## COG: SPy0818 COG2843 # Protein_GI_number: 15674859 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) # Organism: Streptococcus pyogenes M1 GAS # 4 330 50 379 430 135 30.0 1e-31 MKHLILLFLFFCSVSCSSNAQEQPVPTSSADTIPAQKITLLFAGDLMQHQAQIDAARTAT GYDYTDYFKLIKEEIGKADIAIGNLEVTLGGKPYRGYPAFSAPDEYLTAIKDAGFNVLIT ANNHCLDRGKKGLERTILMLDSLQIPYAGTYTDSTARASRYPLLLEQNGFRIVLLNYTYG TNGIKVSAPNIVNYIDKDIMARDIETAKALNPDALIACMHWGIEYQSLPNKEQTSLADWL LSRGVTHVIGSHPHVVQPMELRTDTLSGQQNVVLYSLGNFISNMSARKTDGGLLFKLELT KNSIGTSVSNCGYSLIWTARPTLSKKKNYVLYPASIPTDSLSAEERNHLKIFINDTRELF RKHNRGINEYIF >gi|226332025|gb|ACIB01000031.1| GENE 118 142124 - 143152 895 342 aa, chain - ## HITS:1 COG:no KEGG:BF3222 NR:ns ## KEGG: BF3222 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 342 1 342 342 675 99.0 0 MIRFSKMFFYVTVAVLLVWQLPWCYAFLTLKPAKTPFTMYSSVLGDFVITQLDENKQLHR YDTKGNTYTQQQVDSLLPSLYVRQLTADERFPDTICGRAVSPKDIQLTNFTFKSVPSAIN APQTGLYFLMESMSKRVDLKMPEDAFRFTDKGIEFIRMETNRIDEAKSELFTEMLVQKGF AFPASYASGNPTTRKDYDEGYLVLDANHKLFHLKCTKGRPYVKLIQLPEGVLPEYVFITE FRSRRTLGYMVDSQHHFYVINSDGSLVKSALPGFDPTKDELTIFGNMFDWTVKLSTDKDD YYYALDATDYSLIKEYAYKDMRRSVPGLSFTSPDDKFVMPRF >gi|226332025|gb|ACIB01000031.1| GENE 119 143186 - 143857 598 223 aa, chain - ## HITS:1 COG:no KEGG:BF3223 NR:ns ## KEGG: BF3223 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 223 1 223 223 383 100.0 1e-105 MIQAIFYKEWIKTRWYLLLATLFMIGITGYSMLRIGRTIAIQGIDHLWVVMVQKDAIFID LLQFVPLLTGILLAAVQFFPEMQRKCLKLTLHLPYSQKKMVMAMLAFGVLALFTCFATSF IIMGVYLPQHFTSELVQRILLSAAPWFLAGFAGYLLVSWICLEPTWKRRVLNLIIAALIF RVYFLAPGAEAYNSFLPCLTLYTLLIASLSWISVVRFKAGKQD >gi|226332025|gb|ACIB01000031.1| GENE 120 143883 - 144770 912 295 aa, chain - ## HITS:1 COG:CAC0866 KEGG:ns NR:ns ## COG: CAC0866 COG1131 # Protein_GI_number: 15894153 # Func_class: V Defense mechanisms # Function: ABC-type multidrug transport system, ATPase component # Organism: Clostridium acetobutylicum # 1 294 1 309 313 129 26.0 5e-30 MSAIIECKNLTHYYGQRKIYENLSFEVPQGRILGLLGKNGTGKTTTINILSGYLQPHSGE CRIFGENIQTMDPALRRNIGLLLEGHVQYQFMNITQIEKFYASFYPGQWKKEAYYDLMNK LKVAPGQRISRMSNGQRSQVALGLILAQNPELLILDDFSLGLDPGYRRLFVDYLRDYARS ENKTVFLTSHIIQDMERLIDDCIIMDYGSILIQQPIETLMKGLRKYTCTVPEGYQPQLPV TCYHPAVIRQTLETYSFLPPSDVEGLLKENQVPFTGLQHENVGLEDAFIGLTGKY >gi|226332025|gb|ACIB01000031.1| GENE 121 144798 - 145373 426 191 aa, chain - ## HITS:1 COG:no KEGG:BF3064 NR:ns ## KEGG: BF3064 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 191 1 191 191 385 100.0 1e-106 MKSRNKGTVFRKWSRIIHRDLSFFFAGMILIYAISGIVMNHRDSINPHYTVTRTEYKITE DLSDKSKVNEKVILALLEPLNEAGNFTKYYYPKPDRIKVFLKGGSSLVVNTRTKDAVYEG VKRRPLISSMVQLHFNPGKWWTWFADAFAVSLIVITVSGMVMIKGPKGLWGRGGIELVGG ILIPILFLMCF >gi|226332025|gb|ACIB01000031.1| GENE 122 145357 - 145941 727 194 aa, chain - ## HITS:1 COG:no KEGG:BF3226 NR:ns ## KEGG: BF3226 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 194 1 194 194 333 100.0 2e-90 MKIKFMVAATLIAALVTTTSCGNSNKQSQSEKTELAAPAALSVDNLLAHADSLANREVTI EGICTHTCKHGATKIFLMGSDDTKTIRVEAGPLGSFDTKCINAIVTVTGTLKEQRVDEAY LQNWESKLKAQTEKSHGETAAGCDSEKKARGETANTPEARIADFRAKIAERKAATGKDYL SFYYMEASSYEIAE >gi|226332025|gb|ACIB01000031.1| GENE 123 145963 - 147513 896 516 aa, chain - ## HITS:1 COG:no KEGG:BF3227 NR:ns ## KEGG: BF3227 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 13 516 21 524 524 915 98.0 0 MMRIRIIITLLIALFVESRQEAFGQTADTLSLSDKVIRTASFATGFRGEVWRNPALYYYY TPYTWTRLDVNGAYHDKGKASLKQEGDKDTHIGVDVHSFVILSGRDRVFGSAGYRSEKQE NVLWNENIDWRLIAPYVTGDSIGGFLKGETYYFNGGYASESGSWTWGITGGYRASHNYRD KDPRPRNTASDLSFALGAGYRLGAYRLGVSTDFRLYQQKSEISFLADKGSTSVYHMLGLG MDYVRFAGNQTGTKHQGTGWGGSIGILPVDTEKGISATVSVDRMSMDKKLSNANNLTLLN LATTDLKGNVTWMRKLQQSEHLAVKLDAGYTVRKGTENLYGEAGGSSYGALISTSPGMKV TNCQIAASGLWERLLTDKSVWGGAIIPSIIFHRSETDYSAISRFVHLSALESSLRARLQY QKRLLRLTAEANGGYYANLSAEHSLPGLNLAKSASQALLANIDYLSDSYGMVGIRLQGDY PIMKQYNLSLSVQWQAAYYKKSGTTRYVACSLGIFF >gi|226332025|gb|ACIB01000031.1| GENE 124 147552 - 148766 1099 404 aa, chain - ## HITS:1 COG:no KEGG:BF3228 NR:ns ## KEGG: BF3228 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis # Pathway: not_defined # 1 404 1 404 404 781 100.0 0 MKMKKNQLTMAILFATMLGFTACSDDDKVNISTVSITTTVDTTIEGLQLTGGTYTFENIN TSVKTDVAYPLQSIELADGLYNVTFIGKGTYSQNGTSVEVDVQGVQQNVTVSGGSCKLEL TVHVLNTGEPGFVIAEIFIPGTYNEAGKQYNGDQYVRIYNNSDKVLYADGLIFMESQFQT TQKYQSVDPDIMNEAIAVGSVVAVPGSGTDHPVQPGESFILCDNAINHKEANPNSIDLSK ANFEWYIESKQDVDNPAVPNLDIYYCYSKTIWVLNKQGNRAYAIGRLPQGMTKEKYISDY AYNYTYIMQNGTASKPQSKYKFPNEWIIDAVNVGASNEWQWNVTSTGLDMGHTYVGVNNT IAENIGKCVMRKVAYKDGEREVLQDTNNSTVDFTPAATPSLFNK >gi|226332025|gb|ACIB01000031.1| GENE 125 148793 - 151531 2073 912 aa, chain - ## HITS:1 COG:no KEGG:BF3229 NR:ns ## KEGG: BF3229 # Name: not_defined # Def: putative outer membrane receptor protein # Organism: B.fragilis # Pathway: not_defined # 1 912 1 912 912 1835 99.0 0 MIRKIAYTFFSFLICCNVSLAWGQTFAFRGTVLDEQTHKGLDYATIQLFVEKQFAYGGIT DANGHFELLHIHPGTYRIIISYLGYDSTEKEIKVVGNTSDIFYLKPSNMALNEVVVTASE SKRATSASIVDRTAMKHLQPSSFSDLMELVPGGKSADPQMGQANLIRIRETGKTEDISSL GVGFYIDGIFQNTDANLQYMPSSTSAVNATSTMSKGVDMRTIPTDNIEKVEIIRGIPSVA YGNVANGAVIIQRKTSESPLSARFKADKTSKLFSVGKGFRLDGNGRYVLNTDLSYLDSKI DPRNSVKNYTRLTASARLDGKWLWNERNIHWNLSTDYTGSFDDAKRDKDATVKEDSYKSD FSSFKMAGKWNLKFSNHSWIREIHAATSVSWQWEKMRETKSVSLNRPAAIATQTETGESD GIYLPYNYVAQMEIDGKPLYVTVSARTHLAFPLGGLQNRMNLGVEWNYQKNLGKGQVFDV TRPISEGLSTRPRRFKDIPGLQPFAFYAEEVLNLPVKRHKLAFTAGIRLQSLLGLDRKYE MQGKIYPDLRLDLQWSLPTSNGWNIAFSGGLGWISRMPTTAQLYPDFKYVDLIQLNYYHN HPDYRRINMMTYKWDNTNYQLEPARNMKWEVRADIGYKGNRLSATYFRERMNNAFDDLTY YKSLAYKLYDPASIDGSALTAPPELSQLTYANEYNLDVYSTQGNGMKVHKEGVEFQFASR RIESLKTRVTVYGAWIKTVYSSDSPKYKASSILLDNKQLKYVGLYQGENGTESQAFNTNF MFDTYIQRLGLTFSTSAQCTWYTNRRNLWNNGVPVSYIDQSGETHLFREEDKNNIQLQHL VEKYSATYFERTTVPFYMDINLKASKRIGKYLNLAFYVNRLLGIYPDYTLRGVLQRRTSE SPYFGMEMNLTF >gi|226332025|gb|ACIB01000031.1| GENE 126 151900 - 152880 589 326 aa, chain - ## HITS:1 COG:no KEGG:BF3230 NR:ns ## KEGG: BF3230 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 326 1 326 326 541 99.0 1e-152 MKAICNKGICVFLFLSLLMSATMVNAQRVITASGKYITKNIKVTRFDQIYLKGSPTIEYT QSPGASKVQIAGSDNLVDLVECRVEGSTLIVNMKSRTNISYGKEGRLKILVSSPMLKSAS LQGSGDIHLGSLKVEGLDVSLTGSGDIVAESITCNGDFSALLQGSGDIDVKGQLRAKSVN LNLQGSGDLKVAGVTGSEISAMLQGSGDLKVGSTNITSTVTAKLSGSGDMDVLDIRANSV SGQLDGSGDMTLSGSACNATLVLNRSGELSARKLDAENVTAHVNGSGEISCTATKTLETN IQGSGEISYKGNPSIRSTGKNHLNRL >gi|226332025|gb|ACIB01000031.1| GENE 127 152982 - 154826 1476 614 aa, chain - ## HITS:1 COG:BH0034 KEGG:ns NR:ns ## COG: BH0034 COG2812 # Protein_GI_number: 15612597 # Func_class: L Replication, recombination and repair # Function: DNA polymerase III, gamma/tau subunits # Organism: Bacillus halodurans # 3 407 2 391 564 290 40.0 7e-78 MENYIVSARKYRPSTFESVVGQRALTTTLKNAIATQKLAHAYLFCGPRGVGKTTCARIFA KTINCMNLTADGEACNECESCVAFNEQRSYNIHELDAASNNSVDDIRQLVEQVRIPPQIG KYKVYIIDEVHMLSASAFNAFLKTLEEPPRHAIFILATTEKHKILPTILSRCQIYDFNRI SVDDTVNHLTYVASKEGITAEPEALNVIALKADGGMRDALSIFDQVVSFTGGNITYKSVI ENLNVLDYEYYFRLTDCFLENKVSDALLLFNDVLNKGFDGSHFITGLSSHFRDLLVSKDA ATLQLLEVGAGIRQRYQEQAQKCALPFLYRAMKLCNDCDMNYRASKNKRLLVELTLIQVA QLTVEGDDGSGGRGPKQAIKPVFTQPAAAQQPQVAPIASPSQSMNAATPVAPQAVSQQAG SSPAVNVRPGGAVSPSGAMPDAVRMAQFKEEKKIPVMKKSSLGLSIKHPQKEEEQRGAGV VHTAQMSTQQIEEDFIFNERDLNYYWQEYAGRMPIEQKAIAMRMQNMRLSLLNDTTFEVV VDNEIVAKDFTALIPGIQAYLRGSLKNRKVTMTVRVSEATENVRAVSRVEKFQMMAQKNN ALLQLKEEFGLELY >gi|226332025|gb|ACIB01000031.1| GENE 128 154975 - 155277 455 100 aa, chain + ## HITS:1 COG:no KEGG:BF3072 NR:ns ## KEGG: BF3072 # Name: not_defined # Def: putative septum formation initiator-related protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 100 1 100 100 177 100.0 9e-44 MGKLITIWEFIGRHKYWITVVAFGVIIGFLDENSMIRRIGYAREISRLQGEIDKYRAEYE ENTERLNELSTNPEAIEQIAREKYLMKKPNEDIYVFDEEE >gi|226332025|gb|ACIB01000031.1| GENE 129 155274 - 155618 377 114 aa, chain + ## HITS:1 COG:no KEGG:BF3073 NR:ns ## KEGG: BF3073 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 114 1 114 114 195 100.0 4e-49 MKQLIPALFAVGAVMALIGAAVFITGWVYAPYIYTIGAGFVALAQVNTPLRAKSKTLRRL RIQQIFGALALILTGAFMFTTRGNEWIACLTIAAILELYTAFRIPQEEEKELSK >gi|226332025|gb|ACIB01000031.1| GENE 130 155687 - 156094 451 135 aa, chain - ## HITS:1 COG:RSp0211 KEGG:ns NR:ns ## COG: RSp0211 COG4704 # Protein_GI_number: 17548432 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Ralstonia solanacearum # 24 135 34 150 150 75 33.0 2e-14 MKTMIISIIVVLASVAVSGQSLTLTVKDVEHVEGTLYVAIYSSKENFMKKPLFGFRVAVK DRTMTIPCKGIPAGTYAISLFQDENGNGKLDTGSFGRPLEKFGFSNDAEGIMGAPSYEKC CFEFKRDTTVVIHLK >gi|226332025|gb|ACIB01000031.1| GENE 131 156108 - 158015 1299 635 aa, chain - ## HITS:1 COG:no KEGG:BF3075 NR:ns ## KEGG: BF3075 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 635 46 680 680 1238 99.0 0 MKGFYVPAALSLMLLSLPIAAQSVATDTILLATFNDSISLDEVVIKAHKTPRANSRWSDL QPVDLVTVGGSNGDLYRALQTLPGVQLQGESGRLLVRGGNSNETQTYIDGMHVLNPYTTT GTDTPARGRYSTFMFSGVNLASGGQSQEYGEALSAVLPLETKDYSTVNKFGMNVSTVGMG GGGTRAFNRSSLSLNLDYQNLVPYDRVYPSRIDFKRPYRMLSGATQFRYTPNEKTLFKFY VGYDRTDFSNYTDIDHHLFGLGENNIYLNTTFRKRTAFDWNWFIGTAYSFYDRKVKGAVK DRDVWNERQQEFHLKAKFFKLFTSRLRLDMGVETFVRSYRNHYQLETLRDMHQMYPTIYA GFLSSAFYLSENLKTEISLRPEYTSLNRTMNWSPRAAVSYTWNHLLVSVVAGQYTQLPEN DYLIRNISLPSNVCRQVLFSLQYEQGGRFYKAEFYYKNYKKLELSVPDGITPDGYGYSKG IDLYFCDNALWKNFEYRLSYSYNLSKRKYREYTELTVPQYATRHNASLVLKYSVPRLRTI FSVTDGVASGRPYHNPELSGLMNDEVKPYHSLDLGITVLAGKKVIVHASATNLLGRKNEY GRIDGEAVRTSSDHFFYLGVYITLGKKVAYDVSNF >gi|226332025|gb|ACIB01000031.1| GENE 132 158154 - 159053 461 299 aa, chain - ## HITS:1 COG:no KEGG:BF3236 NR:ns ## KEGG: BF3236 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 299 1 299 299 555 99.0 1e-157 MGSHWGEKIRRYLQSPYPVAYQRWKVVLISSLVVFLILLVLQPFGISGIRQHKFWILVGF MGVTAVSLSIPMYVFGKLFPKFYKEETWTVWKQIVNLLQILFFIAIGNWIYSTLVFGWGL RWDVFCAFALFTLVIGLFPTVLFILLNQNRLLAIHLKEATEMNLHLQRSVLPAESVETTQ DSPFLLFQGGIRESLELDSKDLLYVESNGNYIRVNYQKAGKNVQCLLRATMKQAEEVTAV CPLVLKCHRAFLVNVRKVVKVNGNSQGYRLLLEGCPEEIPVSRGYSKQVKELIEGISGD >gi|226332025|gb|ACIB01000031.1| GENE 133 159197 - 159802 656 201 aa, chain - ## HITS:1 COG:no KEGG:BF3237 NR:ns ## KEGG: BF3237 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 4 201 1 197 197 289 97.0 4e-77 MKRMNYLINGFAALAFLFLFSQCAGKADNAAAPAASGNANATSGLKIAYVEVDTLLSQYN FCKDLNADMISKEENSRMVLNQKANELRKSQQEFQKKYESNAFISPERAQQEYARLGKLE QDLQALQNKLATEMASENAKNSQILRDSINAFLKEYNKTKGYNLIISNTSFDNLLYADST LNITKEIVDGLNARYTPVAKK >gi|226332025|gb|ACIB01000031.1| GENE 134 159830 - 160063 184 77 aa, chain - ## HITS:1 COG:no KEGG:BF3238 NR:ns ## KEGG: BF3238 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 77 1 77 77 150 100.0 1e-35 MLYTILITLLIVAICLGLLGIKVFFTKGGKFPNGHVSGNKALRERGISCAQSQDREAQKK RRFSIDEIEKALNDSMN >gi|226332025|gb|ACIB01000031.1| GENE 135 160225 - 161691 1296 488 aa, chain + ## HITS:1 COG:VC2279 KEGG:ns NR:ns ## COG: VC2279 COG2195 # Protein_GI_number: 15642277 # Func_class: E Amino acid transport and metabolism # Function: Di- and tripeptidases # Organism: Vibrio cholerae # 2 487 51 533 534 473 47.0 1e-133 MEKSELKPAGVFHFFNEICQVPRPSKKEEKMIAYLKAFGEKHNLETKVDEAGNVLIKKPA TPGKENLKTVILQSHVDMVCEKNNDTDHDFLTDPIETEIDGEWMKAKGTTLGADNGIGVA TELAILADDSIEHGPIECLFTVDEETGLTGAFALKEGFMSGEILLNLDSEDEGELYIGCA GGIDTVAEFQYENEMTPISHLCFRITVKGLKGGHSGGDIHLGRGNANKILNRFLYQMMTT YQEDFHLYEFNGGNLRNAIPREASAVFSVPEHYKHDIRTALNVFTAEIENELHRVEPDLN ILLETEPHRDWSIDSSTSYRLITSLYGCPHGVYAMSQDIPGLVETSTNLASVKMKPENTI RIETSQRSSILSSRDDIATTVRAVFRLAGAQVNWGEGYPGWKPNPDSEILKVAEESYKRL FGVDAKVKAIHAGLECGLFLDKYPALDMISFGPTLTGVHSPDERMHIPSVDKFWKHLLDV LAHIPAKN >gi|226332025|gb|ACIB01000031.1| GENE 136 161859 - 162893 518 344 aa, chain + ## HITS:1 COG:no KEGG:BF3080 NR:ns ## KEGG: BF3080 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 19 344 19 344 344 668 99.0 0 MLRLFFILLISLSSLFLSAQNRPPFRVVFWNTENFFDTRHDSLKNDMEFLPHSMRHWNHR RYKKKLDNVARTLTAIGEWNFPALIGLCEVENDTVMRDLTLYSPLKEAGYRYVMTHCSDL RGINVALLYQRDRFKLLSYSALSVGNFKGHRPTRDILHVSGLLLTGDTLDIMVAHLPSRS GGVRQSEPYRLYAAQKLKDAADSLINVRPSAKLIIMGDFNDYPTDKSVVQVLQALSPEVS THHDRLYHLLARKAKDRNFGSYKYQGEWGLLDHLIVSGTLLDISGTLFTEEKKANVARLP FLLTKDEKYGGMQPFRTYVGMKYQEGYSDHLPVYVDFETNQSEY >gi|226332025|gb|ACIB01000031.1| GENE 137 162878 - 164047 1028 389 aa, chain - ## HITS:1 COG:BMEI0944 KEGG:ns NR:ns ## COG: BMEI0944 COG0668 # Protein_GI_number: 17987227 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Small-conductance mechanosensitive channel # Organism: Brucella melitensis # 27 386 12 391 408 244 35.0 2e-64 MIDLGSWMNKILIGWGVDPKIANTFDETIIAILMIFIAVGLDYLCQAIFVGGMKRLARKT SYKWDTLMVKHKVIHHLIHILPGILMYMLLPMAFVHGKTLLLVSQKICVIYMIFALLLAL NSSLLMLLDILSAKEKLKDRPMKGFIQVLQVLVFFIGGIVIVAIIVDKSPATLFAGLGAS AAILMLVFKDSILGFVAGIQLSANDMIRPGDWVTIPSTNANGIVEEITLNTVKIQNFDNT ISTVPPYSLVNGSFQNWRGMTESGGRRVMKSIFLDLTTLKFCTPEMLDTFRKEIPLLADY QPEEGVIPTNSQVFRVYVERYLCSLPVVNQDLDLIISQKEATEYGVPIQIYFFSRNKIWK EYERIQSDIFDHFFAMIPKFELKVYQYSD >gi|226332025|gb|ACIB01000031.1| GENE 138 164186 - 164842 654 218 aa, chain - ## HITS:1 COG:TM0295 KEGG:ns NR:ns ## COG: TM0295 COG0176 # Protein_GI_number: 15643064 # Func_class: G Carbohydrate transport and metabolism # Function: Transaldolase # Organism: Thermotoga maritima # 1 218 1 214 218 241 50.0 7e-64 MKFFIDTANLDQIREAHDLGVLDGVTTNPSLMAKEGIKGVENQRRHYVEICNIVQGDVSA EVIATDYEGMVREGKELAALNPHIVVKVPCIADGIKAIKHFSGKGIRTNCTLVFSTGQAL LAAKAGATYVSPFVGRLDDICEDGVGLVADIVRMYRFYNYPTQVLAASIRSSKHIMECVE AGADVATCPLSAIKGLMNHPLTDAGLKKFLEDYKKVNE >gi|226332025|gb|ACIB01000031.1| GENE 139 164908 - 166725 2020 605 aa, chain - ## HITS:1 COG:SP2146 KEGG:ns NR:ns ## COG: SP2146 COG3669 # Protein_GI_number: 15901959 # Func_class: G Carbohydrate transport and metabolism # Function: Alpha-L-fucosidase # Organism: Streptococcus pneumoniae TIGR4 # 50 477 10 448 559 303 38.0 5e-82 MKKLILSTALLAAICTAGQAQETNDYYVKHVEFPQGATLEQKVDMAARLVPTPQQLEWQQ MELTAFLHFGINTFTGREWGDGKENPALFNPTDFDAEQWVRSLKEAGFKMAILTAKHHDG FCLWPTKTTGHSVAASPWKDGKGDVVRELRDACDKYGIKFGVYLSPWDRNASCYGDSPKY NEFFIEQLTELLTNYGEVHEVWFDGANGEGPNGKKQEYDWTAILSTIRRLQPRAVTAIMG DDVRWVGNERGLGRETEWSATVLTPGTYARCEEQNKALGVKATSKDLGGRDMLVNAKELF WYPSEVDVSIRPGWFYHQQEDNQVKSLKHLTDIYFKSVGYNSVLLLNIPPDQRGRISDAD VNRLKEFADYRKEIFADNRVKGGLKAWTARPGDTRVYQLKPKSEINVVMLREDISKGQRM EAFTVEALTADGWKEIAKGTTVGYKRLIRIPAVEARQLRVKVDACRLAANISEVAAYYAR PLEESAAKEDWNDLPRTAWKQVTAAPLVIDLGKAVDMTGFVYAPANAEAKPTMAFRYKFY ISTNGRDWKEVPTIGEFSNIMHNPVPQTVSFGNKVSARYIKLDATTPDATPARVDLKEIG IRLQK >gi|226332025|gb|ACIB01000031.1| GENE 140 166775 - 169837 2993 1020 aa, chain - ## HITS:1 COG:TM1193 KEGG:ns NR:ns ## COG: TM1193 COG3250 # Protein_GI_number: 15643949 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-galactosidase/beta-glucuronidase # Organism: Thermotoga maritima # 24 1017 2 984 1087 666 39.0 0 MNKPIKLTFLLMLACLMLLPVNAQNTPEWQSQYAVGLNKLAPHTYVWPYASATDVKKGDY EQSPYYLNLNGKWKFHWVKNPDLRPKDFYKPSFYTGGWADINVPGNWERQGYGTAIYVNE TYEFDDKMFNFKKNPPLVPYKENEVGSYRRTFTVPAGWKGRRVVLCCEGVISFYYVWVNG HFLGYNQGSKTAAEWDITDQLEEGENTIALEVYRWSSGSYLECQDMWRLSGIERDVYLYS TPKQYIADYKVNATLEKERYKDGIFGLDVTVGGPADGVASVSYTLNDPLGRPVLSGEMPV KSRGLSNFITFGEQRLKDVKRWSAEHPNLYTLVLELKNAGGQVTEVTGCEVGFRTSEIKD GRFCINGVPVLVKGTNRHEHSQLGRTVSKELMEQDIRLMKLYNINTVRNSHYPTDPYWYR LCDRYGLYMIDEANIESHGMGYGPASLAKDSTWLTAHMDRTHRMYERSKNHPAIVIWSLG NEAGNGINFERTYDWLKSVEKSRPVQYERAEQNYNTDIYCRMYRSVDEIKAYLAQKDIYR PFILCEYVHAMGNSVGGLKEYWDVFENNPMAQGGCVWDWVDQSFREIDSNGRWYWSYGGD YGPKGIPSFGNFCCNGLVSADRVPHPHLLEVKKIYQNIKCTLINKNNLTVRVKNWFDFSN LNEYILHWQVVGDNGKLLAEGNKEVNCAPHATADVTLGKVALPANVREGYLNLSWTRKEA SPMVGTDWEVAYDQFVLPGTKGSTAYLPAKAGQTAFTVDKETGALNSLTLDGQELLATPV TLSLFRPATDNDNRDRNGAYLWRKAGLNQLTQKVVSLKDGKKAATAKVEILNAKGMKVGD ADFAYSLNSAGALKVKVTFRPDTAVVKSMARLGLTFEMNDTYGNVAYLGRGDNETYSDRM QSGKIALYQTTAERMFHYYVTPQSTGNRTDVRWMKLTDETGQGIFVDSNRPFQFSVIPFA DDVLEKARHINDLERNGHVTVHLDAEQAGVGTATCGPGVQPQYRVPVTEQSFEFTLRTVK >gi|226332025|gb|ACIB01000031.1| GENE 141 169915 - 172230 1533 771 aa, chain - ## HITS:1 COG:CC0447 KEGG:ns NR:ns ## COG: CC0447 COG3525 # Protein_GI_number: 16124702 # Func_class: G Carbohydrate transport and metabolism # Function: N-acetyl-beta-hexosaminidase # Organism: Caulobacter vibrioides # 20 624 26 612 757 372 35.0 1e-102 MKHIIYILLLAFCPLATLQADNLTPIVSVVPCPVQMVPGTGNFLFSGSTVLRVENAEQAR VARNFVNLFTRAAGFTPVLKTGKGKGDVCFVTDPLLKSEAYRLSVSPEQISIEASDAKGF FYALQTIRQLLPPDIENSRTVNAMWTVPCLTIQDEPRFGYRGLMLDVSRFFIPKESVLRI IDCMAMLKINKFHFHLVDDNGWRLEIKKYPRLTEIGAWRVDHTDVPFHSRRNPLPGEPTP VGGFYTQEDMKEMIAYAADRQVEIIPEIEMPAHTNSSLAAYPQLACPVVDQFIGVLPGLG GDHASIIYCAGNDSVFTFLQNVIDEVAALFPSRYIHLGGDEAQKTYWKKCPLCQERMKKE HLAHEEDLQGYFMKRIGEYVRSKGKEVMGWDELTNSFIPEGAVIFGWQGMGNAALKAADR GHRFVMTPARVMYLIRYQGPQWFEPLTYFGNNTLKDVYQYEPVQKNWKPEYASLLMGVQA SLWTEFCNRPEDVDYLVFPRLAALAEVGWSRPEQKNWDLFLKAMDRYNEHLDVKGIGYAR SMYNIQHTSTPVDGALQIKLECIRPDVEIRYTTDGSEPEATSTLYTRPLEFHTAQTLKCA TFAAGRQMGKTLVLPLLHNKATAKPIFTQGASGASVLTNGVRGSLKQTDFEWCSWSKSDC ISFTLDLQKEENIHTFTLGSITVYGMAVHKPESISVALSSDNVNFTEVGEKHFTPEEIFR EGTYVEDLKFDLGTAKARYVRVTARGVGKCPPDHVRPGQEARIFFDEIIVE >gi|226332025|gb|ACIB01000031.1| GENE 142 172275 - 173861 1659 528 aa, chain - ## HITS:1 COG:PM0598 KEGG:ns NR:ns ## COG: PM0598 COG3119 # Protein_GI_number: 15602463 # Func_class: P Inorganic ion transport and metabolism # Function: Arylsulfatase A and related enzymes # Organism: Pasteurella multocida # 51 519 1 456 467 137 24.0 5e-32 MMNNLPSGILYSLTGAAAVASLTSCATGKQKEEQKPLNIVYIMTDDHTAQMMSCYDTRYI ETPNLDRIARDGVRFTNSFVANSLSGPSRACMITGKHSCANKFYDNTTCVFDSAQQTFPK LLQKAGYQTALVGKWHLESLPSGFNYWEIVPGQGDYYNPDFITQDNDTVQKHGYITNLIT DDAIDWMENKRDESKPFCLLIHHKAIHRNWMADTCNLALYEDKTFPLPDNFFDDYEGRPA AAAQEMSIVKDMDMIYDLKMLRPDKDSRLKSLYQKFLGRMDEGQRAAWDKFYGPVIDDFY KQNLSGKELADWKFQRYMRDYMKTVKSLDDNVGRVLDYLEKKGLLDNTLVVYTSDQGFYM GEHGWFDKRFMYEESMRTPLIMRMPKGFDRRGDITEMVQNIDYAPTFLELAGAPVPADIQ GVSLLPLLKGEQPKDWRNALYYHFYEYPAEHMVKRHYGIRTERYKLIHFYNDINWWELYD MQADPTEMHNLYGQKEYEPVVKELKEQMLKLQEQYNDPVRFSPERDKE >gi|226332025|gb|ACIB01000031.1| GENE 143 174273 - 175739 1372 488 aa, chain + ## HITS:1 COG:STM0035 KEGG:ns NR:ns ## COG: STM0035 COG3119 # Protein_GI_number: 16763425 # Func_class: P Inorganic ion transport and metabolism # Function: Arylsulfatase A and related enzymes # Organism: Salmonella typhimurium LT2 # 10 462 7 462 497 171 31.0 4e-42 MNQKLLFSSALLVGIAGTQQALAQKKKVQDQKRPNVVFILADDLGFGDLSCYGQEKFETP NIDKLAQEGMRFTQCYSGTTVSAPSRSCLLTGTHSGHTAIRGNVELDPEGQFPLPADAQT IFHDFQNAGYKTGAFGKWGLGFIGSTGDPKKQGIDEFYGYNCQLLAHSYYPDHLWDNDKR VELKDNTLDVQYGKGTYSQDLIHSKALDFLDRMGKSGESFCMWYPTIIPHAELIVPEDSI IKKFRGKYPEKPFHGTEPGNPAFRKGGYCSQFYPHATFAAMVYRLDVYVGQIVQKLKEMG VYDNTIIIFASDNGPHMEGGADPDFFNSNGIWRGYKRDLYEGGIRVPMIISWPGRVQPST QTDFMCSFWDVMPTFREILNPKAKNQQMDGVSLLPLLENRKGQKEHEYLYFEFQEMNGRQ AVRKGPWKLVHMNVRGKNPYYELYNLNSDPSERHNVLNQYPEKVTELKAIMQSSHIPNPN FPLLPGEK >gi|226332025|gb|ACIB01000031.1| GENE 144 175876 - 177525 1685 549 aa, chain - ## HITS:1 COG:CC0447 KEGG:ns NR:ns ## COG: CC0447 COG3525 # Protein_GI_number: 16124702 # Func_class: G Carbohydrate transport and metabolism # Function: N-acetyl-beta-hexosaminidase # Organism: Caulobacter vibrioides # 31 527 28 514 757 375 40.0 1e-104 MNKKLLSRLAPGLFAVVLFTACRPAATVKGNLDVIPQPQEIVLARDTTPFIIDRSTTIVY PATNEKMHRTADFLATFIKEMTGTEVRVSDKEKSSNAIILAVDSTMGHPEGYKLQITPEK VLLTGGSEAGVFYGIQTIHKALPILKDGKVAAALPAGTVTDFPRFRYRGFMIDVGRHFFP VSYLKQMIDLMALHNINYFHWHLTEDQGWRIEIKKYPKLTEIGSKRDSTIIDWETKKFDG KPHSGFYTQDEAREIVRYAADRFITVVPEIDLPGHTTAALASYPELGCTGGPYKVLCSFG VFPDVLCAGNDQTLQFTKDVLDEIMDIFPSEYIHIGGDECPKSRWEKCPKCQAKIKELGI KALPKHSKENQLQTYFMSELEKEINAHGRRMLGWDEVLEGGLTPNSTIMSWRGIQGGIEA ARQHHDVIMTPIQRLYFSNPRINKMTGFEWMNRVYNFEPVPAELTDAEKKFVIGTQGCIW TEWTADSTKMEWQILPRMAALSEIQWTLPEHKNFERFMERLPEMLKIYSSLDYGYREDVF AADTLKTHK >gi|226332025|gb|ACIB01000031.1| GENE 145 177703 - 180627 1853 974 aa, chain + ## HITS:1 COG:BH3443 KEGG:ns NR:ns ## COG: BH3443 COG2207 # Protein_GI_number: 15616005 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Bacillus halodurans # 810 966 45 199 207 71 27.0 9e-12 MKNNPYTGFLTWLTVLFTVCCLPLKASHYYYKQISLKEGLPSTVRCVYTEPKGFVWIGTN AGLGRFDGQKLRKYVHRQEDVHSLPHNYIHQITEDIQHNIWILTDGGIAQYRRSSDDFAI PLDDRGHPILAYSACLTEQGVIFGGRNRIYRYDYDSRSIKLLLDFSSDPYFAISAISRWD EETLLCCSRWQGLRLINLRSDERRLPPFDCGKEIMALLIDSHNRIWLAPYNEGLRCFNPE GRLLASYTTDNSGLSNNVVLSMAERDSHIWVGTDGGGINIIHPDSHRITVLEHIPGDNYS LPVNSILSLYNDNYNNMWAGSIRKGLINIREVSMKTYTDVFPGSTQGLSDPTVLSLYQDE PNGRIWIGTDGGGVNSLDPVTEEFRHDRSTWGDKVVSITGFTRESILLSVFSKGLFVYNK ENGKRKPLPIDHPDLKQYIYYSGMAVNIYRDEPGSVLLLAGHTYRYDIGSQKIRVVNEEE GMEIAGSMNAIAHNERFTYLHDSRTLYELDRTGNRLKKLFSCTGDTLLYSVSMDEKGDFW IGSNTGLGQYSIRTRQYHPLITSLFGEASSVICDHRGKVWIGADHMLFAWMLQSRKFILF GESDGVIPNEYLAKPRLVSGKGEVYMGGVNGLLCIDNRFPATSSNYPEVVLTDVRVNGEP ATNRTAGNPDKLTLPQDSRAITLRVMSHEEDIFRKKRYRYRIDGLNEEPIESYDPELVIR SLPAGNYRIQAACSTQNGDWTPFHPILSLTILPPWYRSGWFIICLLLFVSGGITAIIFAI LRRRKNRLKWELKERELQEYEEKIRFLVNVSNELLPSFTEKGERELQIVELIRNRLRNGE KSKAPAEIASSPNIVKEELSQPDETFLRKLNQLITDHLDSPELDVTFLCTEMGLSRASLY NKLKAMTNMGANDYINKFRMEKAIQLISTTDLTFTEIAEKIGFTTSRYFSTSFKQYTGET PTQYKEKIRKSSKV >gi|226332025|gb|ACIB01000031.1| GENE 146 180703 - 181755 1125 350 aa, chain - ## HITS:1 COG:all3735 KEGG:ns NR:ns ## COG: all3735 COG1830 # Protein_GI_number: 17231227 # Func_class: G Carbohydrate transport and metabolism # Function: DhnA-type fructose-1,6-bisphosphate aldolase and related enzymes # Organism: Nostoc sp. PCC 7120 # 2 350 9 360 360 496 68.0 1e-140 MNKIIELLGNQAEYYLNHTCKTIDKSLIHVPSPDTIDKIWIDSDRNIQTLRSLQTLLGHG RLANTGYVSILPVDQDIEHTAGASFAPNPIYFDPENIVKLAIEGGCNAVASTFGNLGAVA RKYAHKIPFVVKLNHNELLSYPNTYDQVLFGTVKEAWEMGAVAVGATIYFGSEQSRRQLV EIAEAFDYAHELGMATILWCYLRNNEFKKDGIDYHAAADLTGQANRLGVTIKADIVKQKL PTNNGGFKAIHFGKTDERMYTELTTDHPIDLCRYQVANGYMGRVGLINSGGESHGASDLK DAVVTAVVNKRAGGMGLISGRKAFQKPMNEGVELLHAIQDVYLDASVTIA >gi|226332025|gb|ACIB01000031.1| GENE 147 181914 - 182660 851 248 aa, chain - ## HITS:1 COG:STM0772 KEGG:ns NR:ns ## COG: STM0772 COG0588 # Protein_GI_number: 16764136 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphoglycerate mutase 1 # Organism: Salmonella typhimurium LT2 # 1 248 3 250 250 290 57.0 1e-78 MKKIVLLRHGESAWNKENRFTGWTDVDLTEKGIAEACKAGELLKENGFNFDKAYTSYLKR AVKTLNCVLDRMDQDWIPVEKSWRLNEKHYGDLQGLNKSETAAKYGDEQVLIWRRSYDIA PNALSEDDPRNPRFENRYQEVPDAELPRTESLKDTIERIMPYWKCIIFPNLKTADEILVV AHGNSLRGIIKHLKHISDEEIVKLNLPTAVPYVFEFSDELNLEKDYFLGDPEEIRKLMEA VANQGKKK >gi|226332025|gb|ACIB01000031.1| GENE 148 182817 - 185159 1729 780 aa, chain + ## HITS:1 COG:no KEGG:BF3092 NR:ns ## KEGG: BF3092 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 780 1 780 780 1634 99.0 0 MRKLFFPLLLFVSGLLSAQTEITLYVSPSGSDHHPGTAEKPMATLEYAWKKASRQAGRRS ITIYCEGTNYLSAPILITNETSGTPEHPIRFSSYPGQKAVISGSRILRNLRWKEYKNGIM QAKVEEELIPDQLFVNGKKQISARYPNFDPDIRIFNGYAADACSPERVKNWSNPAGGYLH AMHSREWGGYQYSIEGKDAKGELILKGGFQNNRQMGMHHTYRMVENIFEELDAEGEWYFD KETHTLYFYPPRELDLQTALFEVPQAENLFILKGKTGSPVRHVSVDHLELTQTLRTFMKT NEPLLRSDWKIYRGGALIIENAEKCSVNGCYLHDIGGNAIFFSNYNRNHRVSQNHITRIG ASAVCFVGSPDAVRSPLFEYGKSQTWEQMDKGTGPLTPDYPSDCLVDDNLIHSIGETEKQ GAGIQLSMSARITIRNNSIYDLPRAGINVSEGTWGGHLIEGNDVFDTVLETGDHGSFNSW GRDRYWHPDRNVMDEFAKEHPQMVFRDATETTVIRNNRWRCDHGWDIDLDDGSSNYHIYN NLCLHGGLKLREGFARTVENNIMVNNTFHPHVWFANSHDIFRHNIVTTPYRPIQVNEWGK ETDTNFFVTKQGLEQAQKRGTDLHSLYGDPLFIAPEKGDYRVKENSPALKTGFRNFDMEH FGVQCPHLKALAATPKLPVFKIPEEKPETVQTYSWKGLTLKEVSTEGERSATGLDKIRGI LVVQVEKGITALQANDVILRINGKPVDNRTDMETEIRKSPEGNKFRIIFFRNQKENAVTM >gi|226332025|gb|ACIB01000031.1| GENE 149 185300 - 186754 208 484 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|90020817|ref|YP_526644.1| ribosomal protein S16 [Saccharophagus degradans 2-40] # 265 472 13 208 318 84 31 3e-15 MEQYTFNIAGGVARNPLVRLAQPVTAQIATGEHIAIVGPNGGGKSLFVDTLLGKYPLREG TLDYDFSPSSTRTVYDNVKYIAFRDTYGAADANYYYQQRWNAHDQEDAPTVREMLGEIKD ERLKEELFELFHIEPLLDKKIILLSSGELRKFQLTKTLLTAPQVLIMDNPFIGLDAPTRE LLFSLLERLTRLSSVQIILVLSMLDDIPSFITHVIPVEDLHVLPKMEREAYLASFCVTDE VEVLDALQQRIAGLPYDGANYDSGEVVKLNKVSIRYDDRTILKELDWTVRRGEKWALSGE NGAGKSTLLSLVCADNPQSYACDISLFGRKRGTGESIWEIKKHIGYVSPEMHRAYLKNLP AIEIVASGLHDSIGLYKRPQESQMAACEWWMDVFGIVALKDKPFLQLSSGEQRLALLARA FVKDPELLILDEPLHGLDTYNRRRVKKIIEAFCRRQDKTMIMVTHYESELPSTITDRLFL KRNR >gi|226332025|gb|ACIB01000031.1| GENE 150 186896 - 188314 1119 472 aa, chain - ## HITS:1 COG:SPBPB10D8.02c KEGG:ns NR:ns ## COG: SPBPB10D8.02c COG3119 # Protein_GI_number: 19111838 # Func_class: P Inorganic ion transport and metabolism # Function: Arylsulfatase A and related enzymes # Organism: Schizosaccharomyces pombe # 2 454 20 519 554 148 28.0 2e-35 MDDLGYGDIGCYGSEKIETPNIDRLYKDGISFTQHYTGSPVSAPARCVLMTGMHSGHAQI RANDEMAYRGAIMNYDSMYVHPGLEGQYPLKAHTMTLGRMMQQAGYVTGCFGKWGLGAPG TEGTPNKQGFDSFYGYNCQRQAHSYYPAFLYKNEDRVYLANKVLDPHTTKLDAGADPRDE AAYAKFSQKEYANDLIFDELISFVGQNRKKPFFLMWTTPLPHVSLQAPEKWVKYYVGKFG DEAPYIGKAGYMPCRYPHATYAAMISYFDEQIGKLIEKLKKERLYDNTVIMFTSDNGPTF NGGSDSPWFDSGGPFRSEYGWGKCFVHEGGIRIPAIVTWPGKIKPSTQSDHICGFQDVMP TLADIANIACPETDGISFLPALLGETERQKEHEYLYWEYPDPTIGLKAIRMGKWKGIVNN IRKGNSTMELYDLESDLREEHDVAAEHPDIVRKLTRLMEKSHTEPENPKFRF >gi|226332025|gb|ACIB01000031.1| GENE 151 188426 - 189982 1001 518 aa, chain - ## HITS:1 COG:XF0847 KEGG:ns NR:ns ## COG: XF0847 COG3525 # Protein_GI_number: 15837449 # Func_class: G Carbohydrate transport and metabolism # Function: N-acetyl-beta-hexosaminidase # Organism: Xylella fastidiosa 9a5c # 90 494 166 578 841 247 38.0 3e-65 MKYLILILLSLCSYSIDLSGQIIMPTPGKIERADGRLRLQGKIRMYAEESPGSFIRLFYE KLVPESAVEWCKEEVNSHISWKKDVTLPAEGYRIRVTPERIIVGAADDAGFIYAIQSLRQ WNTGEERGLIFPCVEITDFPRVKWRSFMLDSGRQYQKVSTIKKYIDMASMLKMNYFHWHL TEGLGWRIEIKRYPFLTRIGAFVGQGPEQQGFYSQEEVKEIISYAADRGITVVPEIDMPG HAEAALNAYPRLGCFNVAVKVPQSGFTQNIFCAGKDSTLIFLKNVLDEVCRMFPSAYIHL GGDEAPKGNWDKCPDCRSRIEKEKLKDSHDLQLWFSAQMADYLKQKGRKAIFWGDVIYKD GYPLPDNVVIQWWNWRGHRDLALKNAVRHNYPVICGTNYYTYLNFPLTPWKGYTQARTFD LEDVYLRNPSYRPREENPLILGMSSALWTDDGVTESMIDRRVFPRILALAEQMWHSGNPE NFDEFYGKVLSKQLWFEQQGYSFGPALKEDAGTNYKWD >gi|226332025|gb|ACIB01000031.1| GENE 152 190083 - 191948 1428 621 aa, chain - ## HITS:1 COG:no KEGG:BF3257 NR:ns ## KEGG: BF3257 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 621 1 621 621 1294 100.0 0 MKKYTFALFIFCAIAFTGCLNNDFMERYPLGNPTEETAFVTYDNFKAYAWGLYETLPKLG YGDTSTDDISYNQTRGSSESNWIRGLVTVPDKRNDTAWDYYAFIRRVNLMLGRIDSSQMT EAEKAHWKSVGYFFRSYRYFSLLSAYGGVPWIDRVLSDNDTELINGPRASRDEIAKHILE DLQYAEQHINVNGDGNNTINRAVVQAFISRFCLFEGTWRKYHGLNDAETYLKECKRVSAE VMNSFPEICSNYDDLFCSLELKDVPGVILYREFSNAENVIHATSIGGTTGASYYNPTRDL VDSYLCSDGKTRWNSPLYKGDKDMYDEFSCRDHRLWLQVTPPYRIDRSASTDAWGNKWQF TEVAKDRSFIDSLNIRLGIGYGSAKERQKTLPFRQGYDGGILGAVPHFDFYLENQPWYKS AFGYNNWKYYCTYLSMGSQRNEETDMPLFRVEEVMLNYAEAMCELGEFDQSVADRTVNKL RSRANVAPMKVAEINDSFDPKRDLGNPAYPGDYAVNPLLWEIRRERRIELFSEGFRFDDL RRWKKCHYALKKKLGMYVKASDFPAGTKVTVDGGGTEGYLEFHPAQNHTWPDYYYLNPIP RNERVLNPQLEQNPGWDDGIK >gi|226332025|gb|ACIB01000031.1| GENE 153 191963 - 195400 2819 1145 aa, chain - ## HITS:1 COG:no KEGG:BF3097 NR:ns ## KEGG: BF3097 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1145 1 1145 1145 2288 99.0 0 MKLKIKLRNILSVKIHKIVLLLLALCIYNTAYGQNQRISVYGNNQSLQKVFEQIEEQTQL SITYNQTRLDVNRKIKGNFVDKELSFVMDALLKNTGFTCRYEAEHIVITPVEQKKEKAAA GQATQMKKITGKVTGVTGEPLIGANVLAVGGKQATITNLEGEFSLEVSDNSKLQVTYIGY LPQEVSTNGKTHFVIQLKEDSQLMDEVVVVGFGTQKKINLTGAVTAVNIQETLGDRPITN VTAALQGVVPGLKIEGTTGTPGDNLSYNIRGTTSINGGGPLVLVNNVPMDINMIDPQDIE SVSVLKDAASAAIYGARAAFGVILITTKQGKKDMAPKFNYNNNFSFSKALELPQKAGPLE SILAYKEMGWPNDTYVDGKNIPQWETYIRDYMANPSQYPNGYVFDEDGNLFLMRENDMFA DMMDNYGFMQNHSFSVSGGSSRTNYRIGLGYTNEDGILITDKDRYERANLSSFLSVEVNK WLTTQLDIRYANSTQNKVEQGGRNGIWGSAMQLPSYQNISPYEEDGIIYPAETSATYIRY GEPRIVKKTDLRALGRIIISPLKNLKITGEYTYNRVTNYNRMYVNQYKYIGMNFTGVLNN TENTRYALTQGFTNYNAINIFANYDFSIGNHHISVMGGFNQEENHAESQWTERKDVLLSN LPSISGATGTTTATDTFNEYALRGLFYRVNYSYKDRYMLEANGRYDGTSRFPKNNRFGFF PSFSAGWRISEEPFMTATRDVLSNFKFRASWGSIGNQIILLADGSPDNYPYIPEMAPGLT NWLVDGQRPTTLSTPPMVSSAFTWEKVYTLDFGVDFGFFDNRLNGTFDWYRRDTKGMLAP GMDLPWVVGVAAAKQNAANLKTYGWELELNWRDRIKDFNYRIGFNLYDSQSEITKFNNET NLLGTYRKGQKIGEIWGYVTDRFYREDDFNADGTLKEGIPIPKGVGKVYPGDILYKNFDD DASTIWSGKGTADDPGDQRIIGNSTPRFHYGINAGLSWKGFDLSVFLRGVGKRDFWRTDQ IAWPTGTWGSLFKETLDFWTPEHTDAYFPRVYANNGVNTASNRWKQTKYLANAAYLKLQN ITLSYTLPKTWAKQICFDEVKVFFSGENLHTWDHLPEGLESDMLSKGAWEYPFMKKFSLG FNVTF >gi|226332025|gb|ACIB01000031.1| GENE 154 195538 - 196536 366 332 aa, chain - ## HITS:1 COG:SMc04204 KEGG:ns NR:ns ## COG: SMc04204 COG3712 # Protein_GI_number: 15965785 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Sinorhizobium meliloti # 18 287 62 317 354 70 22.0 5e-12 MSDAYKIIHNFMAFIASPDLKNKVWTWLLDPSGKQRKEEALLQIWDEYRPEADAGTRHSL RKFQKRAGLSSARPAYLHRWAHIAAILLIPLISIITAYIYVEHHTQEGRFVECIVPKGEQ KQITLPDGSIITLNSGSIFLYPTQFTGDTRSVYLSGEGHFAVAPNKKLPFVVATNHLDIC VLGTQFNLQAYPFDRRTITTLESGSVAVRKKNQPNNFITLEPNQQLDYENRSGRFNKTDI DASVYSGWTKGEMNFISQSLREILRTLERSYAVSIQLSSDLMESNRINSDLYTIKFKRRD DIFHVLDIVTKTVGGITYKVEKDESISICPLK >gi|226332025|gb|ACIB01000031.1| GENE 155 196627 - 197187 510 186 aa, chain - ## HITS:1 COG:PA3410 KEGG:ns NR:ns ## COG: PA3410 COG1595 # Protein_GI_number: 15598606 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Pseudomonas aeruginosa # 13 177 2 167 171 59 27.0 4e-09 MKFIRKFPVTDADSLESPQKFEGFFLDYYPRVKGFINGLLQDAEEAEDLSQDIFMSLWQN RGNLKQIDNLDAYLFRIARNAVFRYIERSLLFKNYQFRQLSDDNSDLYEIESELNAKELE LIIAIAVERMPSQRRKIYQMSREQGLSNENIARELNISKRTVENHLTQALADIRKILFWV IMATFV >gi|226332025|gb|ACIB01000031.1| GENE 156 197428 - 199428 1703 666 aa, chain - ## HITS:1 COG:TM1845 KEGG:ns NR:ns ## COG: TM1845 COG1523 # Protein_GI_number: 15644588 # Func_class: G Carbohydrate transport and metabolism # Function: Type II secretory pathway, pullulanase PulA and related glycosidases # Organism: Thermotoga maritima # 45 635 229 812 843 495 44.0 1e-139 MKMNYLAFIGVTAATAVVSCTPAKKEYASYELYPVRSGSLTEMEYSPESTKFTLWAPTAD EVRLMLFDSGEGGHAYETIPMEPGEEGTWMATVSKDLMGKFYTFNVKVNDKWMGDTPGIN AKAVGVNGKRAAIINLKSTDPQGWEADQRPPLKSPADAIIYEMHHRDFSVDSTSGIQHKG KFLALTEHGTMNSAKLLTGIDHLIELGVTHVHLLPSYDYASVDETKLDENKYNWGYDPQN YNVPDGSYATDPYDPAVRIREFKQMVQALHKAGIRVVLDVVYNHTYNTDGSNFERTVPGY FYRQKPDGSLANGSACGNETASNRPMMRKYMIESVLHWIKEYHIDGFRFDLMGIHDIETM NEIRKAVSAVDPSIIIYGEGWAAEAPQYPGDSLAMKVNTCKMPGIAAFSDEIRDALRGPF NDNHKGAFLAGIPGEEESIKFGIVGAVSHPQVNNDSVNYSKAPWASQPTQMISYVSCHDD MCLVDRLKSSIPGITPEQLVRLDKLAQTAVLTSQGIPFIYAGEEVMRDKKGVHNSFESPD SINAIDWNRKTTHEDVFAYYKRLISIRKAHPAFRMGDAEMVRKHLEFLPVDGGNLVAFRL KDRANGDTWDNIIVALNARSVPAKLAIPEGKYTVVCRDGVIDERGLGSLYGPEVTVPAQS ALIIHQ >gi|226332025|gb|ACIB01000031.1| GENE 157 199488 - 200054 411 188 aa, chain - ## HITS:1 COG:VC1847 KEGG:ns NR:ns ## COG: VC1847 COG0817 # Protein_GI_number: 15641849 # Func_class: L Replication, recombination and repair # Function: Holliday junction resolvasome, endonuclease subunit # Organism: Vibrio cholerae # 9 160 3 150 173 107 41.0 1e-23 MIQPVKEKIILGIDPGTTIMGYGVLRVCGTRPEMIAMGIIDLRKFGNHYLKLRHIHERVL SIIESYLPDELAIEAPFFGKNVQSMLKLGRAQGVAMAAALSRDIPITEYAPLKIKMAITG NGQASKEQVADMLQRMLHFAKEDMPVFMDATDGLAAAYCHFLQMGRPVMEKGYSGWKDFI AKNPERVK >gi|226332025|gb|ACIB01000031.1| GENE 158 200051 - 200353 320 100 aa, chain - ## HITS:1 COG:no KEGG:BF3263 NR:ns ## KEGG: BF3263 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 100 1 100 100 202 100.0 4e-51 MLIYNTTFQVDDEVHDNFLIWIKESYIPEVEKHGALRAPRICRVLSHRDEGTSYSLQWEV DDSGVLHRWHQDQGARLNQELVKIFKDKVVGFPTLMEVLE >gi|226332025|gb|ACIB01000031.1| GENE 159 200706 - 202046 609 446 aa, chain - ## HITS:1 COG:no KEGG:BF3264 NR:ns ## KEGG: BF3264 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 446 1 446 446 760 100.0 0 MADWKSVKSISDKYGISVERVIAWVRESQITTSVVGSVVLIDDGSVCELVEKEKRLAHLK TNYEQLCAKFEQRIEAELRADEDAGLKMRLLDDFLPVLHRLLCVMIDKLNTEDRKLFNSA FTAAPLYRIAREMGFESVKDYLIAYRKVTRRLPESSDELIKRLQTDLERSRDSFKEMEHT LLRARQNHEDANRMCSDLRDKEIYYIERCERLKEKNRENEKRVERLKKRLLKWEKCPNVG EAGNTGEDESLQELSDEVGNLHIREMELRAQRRELITRVARLEREMYKYNSTENSFKSRL GEKGAWRKIESEFERCARTYFGRDVEVVSRKEAQEIRQLQIKALVDENYAQSGAWKEMMA ATEAKFGMPGETMPDAGKLENEEVETGTEPAAKDEVEMIIEGIASLDEMGASLDRYKKET SEMIEKYKANEEKTEGFWNRLRSMFR >gi|226332025|gb|ACIB01000031.1| GENE 160 202670 - 203689 1188 339 aa, chain + ## HITS:1 COG:lin1184 KEGG:ns NR:ns ## COG: lin1184 COG0016 # Protein_GI_number: 16800253 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Phenylalanyl-tRNA synthetase alpha subunit # Organism: Listeria innocua # 8 337 1 344 350 315 46.0 8e-86 MIAKINQLLEEVGALKAANAEELEVLRIKYLSKKGAINDLMADFRNVAAEQKKEVGMKLN ELKTKAQEKINALKEQFDNQDNGQDDLDLTRSAYPVELGTRHPLSIVRNEIIDIFARLGF NIAEGPEIEDDWHVFSALNFAEDHPARDMQDTFFIESHPDVLLRTHTSSVQSRVMEVSQP PIRIICPGRVYRNEAISYRAHCFFHQVEALYVDRDVSFTDLKQVLLLFAKEMFGADTKIR LRPSYFPFTEPSAEMDISCNICGGKGCPFCKHTGWVEILGCGMVDPNVLDANGIDSKVYS GYALGMGIERITNLKYQVKDLRMFSENDTRFLKEFEAAY >gi|226332025|gb|ACIB01000031.1| GENE 161 203801 - 204997 1325 398 aa, chain + ## HITS:1 COG:ECs3121 KEGG:ns NR:ns ## COG: ECs3121 COG0477 # Protein_GI_number: 15832375 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Escherichia coli O157:H7 # 3 388 2 381 396 133 25.0 6e-31 MSKDRLVTPSYVLILAANFLLYFAFYLILPVLPFYLTEIFQTGNAAVGVILSCYTVASLC IRPFSGYLLDTLARKPLYLIAYFLFITVFAGYILAGVLSLFIMLRVVHGLAFGMVTVAGN TIVIDITPSSRRGEAVGYYGLMNNTAMSFGPMIGLFMHDTCSFETIFLCSLLSGTVGFAA ACLVKTPVKEPVKREPISLDRFVLIKGIPAGVALLLLSIPYGMTSTYIAMYAKEIGITLS SGLFFTFMAVGMAVSRMFSGRQVDKGRITQVITLGLYLVCACFFTLAACGKLMSLSPELT DVLFFMVALLLGVGFGTMFPAFNTLFVNLAPNSQRGTATSTYLTSWDVGIGIGLILGGYI AQLTSFDMAYFFGACLTVVSTFYFNLKVAPHYHKNKLR >gi|226332025|gb|ACIB01000031.1| GENE 162 204994 - 205671 728 225 aa, chain + ## HITS:1 COG:CAC0689 KEGG:ns NR:ns ## COG: CAC0689 COG0177 # Protein_GI_number: 15893977 # Func_class: L Replication, recombination and repair # Function: Predicted EndoIII-related endonuclease # Organism: Clostridium acetobutylicum # 3 214 2 211 211 196 47.0 2e-50 MTKKERYEKVIAWFQENVPVAETELHYNNPYELLIAVILSAQCTDKRVNMITPRIYQDFP TPEALAATTPEVIFEYIRSVSYPNNKSKHLVGMARMLVNDFNSEVPDTLEELIKLPGVGR KTANVIQSVVFNKAAMAVDTHVFRVSHRIGLVGNSCTTPFSVEKELMKNIPDELIPIAHH WLILHGRYVCQARTPKCETCGLQLMCKYYCEKYKVSKDTPKRKNK >gi|226332025|gb|ACIB01000031.1| GENE 163 205761 - 207020 1558 419 aa, chain + ## HITS:1 COG:all4131 KEGG:ns NR:ns ## COG: all4131 COG0126 # Protein_GI_number: 17231623 # Func_class: G Carbohydrate transport and metabolism # Function: 3-phosphoglycerate kinase # Organism: Nostoc sp. PCC 7120 # 8 419 13 399 400 387 51.0 1e-107 MQTIDNFNFAGKKAFVRVDFNVPLDENFNITDDTRIRAALPTLKKILADGGSIIIGSHLG RPKGVADKFSLKHIVKHVSELLGVEVQFANDCMGEEAAVKAAALQPGEVLLLENLRFYAE EEGKPRGLAEDATDEEKAAAKKAVKESQKEFTKKLASYADCYVNDAFGTAHRAHASTALI AKYFDTNNKMFGYLMEKEVKAVDKILNDIKRPFTAIMGGSKVSSKIEIIENLLNKVDNLI ITGGMTYTFTKAMGGQIGLSICEDDKLDLARELMQKAKDKGVNLVLAVDAKIADAFSNDA NTKFCPVNEIPDGWEGLDIGPKSEEIFAEVIKNSKTILWNGPTGVFEFENFTHGSRSVGE AIVEATKNGAFSLVGGGDSVACVNKFGLANGVSYVSTGGGALLEAIEGKVLPGIAAIQE >gi|226332025|gb|ACIB01000031.1| GENE 164 206941 - 208050 537 369 aa, chain + ## HITS:1 COG:AF0088 KEGG:ns NR:ns ## COG: AF0088 COG0715 # Protein_GI_number: 11497708 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components # Organism: Archaeoglobus fulgidus # 82 357 23 299 300 116 27.0 8e-26 MFLQAVEHCSKQSKEKFFRVLPLSRNKTEVNHYKTEKATPLGVAFFKSRKWGESLLFSIL FLFFLSACTGKKEKHSALPELQTLTFGLMPSFDGLPSLVAVRQGIYDSLDIKIDFITYAS ATDRDAAFLSGKLDGMLTDYPGATLLQAKGSRLRLVMETDGSLSLIASKKSNVRNPDDLK GKNISVSGNTFVEYATDEVIKRACLHPGEVNKPEINNIPLRLMMLEDGQISASFLPGPAT AIALNDGHTALLNTRQMGLRCTGIVFSEKAITEKDEEIRRFITGYNLGVKYLQTRPQNEW TEILVKEFGLQEKAAMQIDLPDYRPATRPSYHDIEKIIAWLKSKGAIPDYYPGENLVDTT FIPGTLKPQ >gi|226332025|gb|ACIB01000031.1| GENE 165 208067 - 209104 1025 345 aa, chain + ## HITS:1 COG:no KEGG:BF3109 NR:ns ## KEGG: BF3109 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 345 1 345 345 705 99.0 0 MKNTFLLTFALILLAGSPLKAQEKEEAATLNKVVNTLKERITLAGYAQLGYTYDDAAKKM NTFDIKRIIFMAHGKITDRWTCDFMYDFYNGGMLLEVYTDYRILPGLKVRIGEFKVPYTI ENELSPTTVELINCYSQSVCYLAGVSGSDVACGMTSGRDIGAMVHGGLLNDLLCYKLAIM NGQGLNIKDKNNQKDIIGNLMVNPLKWLSVGGSFIKGTGHAIADSEITGIRAGENYTKNR WSIGGVITTTPFSLRSEYLAGKDGGVKSDGFYATGCYRMLRNFDLVASYDYFNANKAVSR KQTNYIAGLQYWFYPKCRLQAQYTFCDRNKGKDSNLFQAQVQVRF >gi|226332025|gb|ACIB01000031.1| GENE 166 209128 - 210318 987 396 aa, chain + ## HITS:1 COG:BS_yxaH KEGG:ns NR:ns ## COG: BS_yxaH COG2311 # Protein_GI_number: 16081049 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Bacillus subtilis # 9 390 10 391 402 92 25.0 1e-18 MEHKISEPNARVDVADVLRGLAVMGIIILHSIEHFNFYSFPDTVPCEWMKFTDKAIWNGL FFVFSNKAYAIFALLFGFSFYIQDNNQQRRGKDFRLRFIWRMALLFIIGQFNAAFFTGEI LTMYAMLGLILPLSCRLSDRSIAIFATLLIIQPIDWCKVIYALCNPDYVAGPSLASHYFG IAFDVQKNGTFLETIRMNLWEGQLANLTWALEHGRILQTPALFLFGMLVGRKGLFLYSEQ NEQHWLKALGISLICFFPLYGLNNMLPEFITRNAIRVPLQLIISSLSNLSFMVLLVSGLL ITFYRIKDRRFLMRFTSYGKMSLTNYIGQSVIGSLLFYHWGFELGRFLGITYSFFFGILF VLLQMAFCSWWLSHFKHGPFEGLWKRLTWIGNNKRK >gi|226332025|gb|ACIB01000031.1| GENE 167 210322 - 212520 1931 732 aa, chain + ## HITS:1 COG:MJ0798 KEGG:ns NR:ns ## COG: MJ0798 COG0457 # Protein_GI_number: 15668984 # Func_class: R General function prediction only # Function: FOG: TPR repeat # Organism: Methanococcus jannaschii # 513 697 115 302 334 71 29.0 8e-12 MNEQLINEQYQYILRLIGQKRLKEALTQLESFLWKCPEWSLRTRLERIQTSYSYMLQYMR QGVEDPERRKLYQKLLTDTLEITDQARITLLDSVSNHYYHQYRTRLSEEVSPLTLEMLMH TLESFNDDLAVSGFVSDQNMEEVLKRHEDSLKTLFLQTWTHTNWTAEEVAAAQAMLQSEL LPVNDLCLFTSAVTLSVMECFDLKKLLWLIDAYRHPNVQVSQRALVGITFILHAYSPRIS FYPEINLRITALMEETAFERDLLRIHIQILLSQETEKIDKKMREEIIPEMLKSMSPMRNM KFGFEESDEEKDDTNPDWADAIEKSGLGDKLREMNELQLEGADVYMSTFSQLKSYPFFRD ISNWFYPFDKQQSDVIKEFRHRGKEGGSLLEIILQSGFFCNSDKYSLFFTMQQLPQSQRD MMLNQLTDQQIEELADQSKAETLKKFSERPDTVSNQYLHDLYRFFKLYARRLEFRDLFKE SICLYNEPDLIDILFNPEAMEAIANFHFKKKHWEEAAEAYDTIVDMLMDEEEAEICQKQG YSLQKLKRYDEAIKAYRKADILKSNNVWTNHHLGTCYRLNRQFKEALKYYRKVEEVLPED TKVIFYIGSCLAEMWNFEEALNYFFKLDFLESNCVKAWRAIGWCSFMIAKREQAMKYYDK VIDQSPMPVDYLNAGHVAWSLGDVEKTLSLYTKAAELYGSKELFLEVFRKDEEIITSKGV TPDDIPLVLDLI >gi|226332025|gb|ACIB01000031.1| GENE 168 212766 - 213278 429 170 aa, chain + ## HITS:1 COG:no KEGG:BF3274 NR:ns ## KEGG: BF3274 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 170 1 169 169 302 99.0 4e-81 MKTITFFKRVLASAALVSVCGFACATNNGKQFIHNDTMEGGKLVCREIYAMNDAASGILN PVKMYKYSYDTDQQKTVKSTYAWNIFKNTWETESRTVISRYETETSVEYSVWNKEKGSFD LSKKYIYIITDNNNQLIAQYAYKMNSRTNQWILEKDALTPIYENIYATTR >gi|226332025|gb|ACIB01000031.1| GENE 169 213603 - 214442 630 279 aa, chain - ## HITS:1 COG:TM0024 KEGG:ns NR:ns ## COG: TM0024 COG2273 # Protein_GI_number: 15642799 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-glucanase/Beta-glucan synthetase # Organism: Thermotoga maritima # 42 273 213 455 642 121 36.0 2e-27 MKRFLVLSCSVAISALFFHACSSSDDSMIDAPLTRADAVSEKIVFQDDFNQADSIPDRNK WSLCKKGSPAWSKYLSESYDQAYVHDGKLVLVAEKVNGVYKTGGVQSLGKAEFQYGKIEI CARFTKTAKGGWPAIWMMPAKPVYSGWPACGEIDIMEQLNHDGIVYQTIHSHYKNDLGFT KPVPTKTVPYNKGQFNIFGIEWTPEALTFKVNGATTLVYPNLHLADESVKKQWPFDTSFY LILNYALGGPGTWPGTITDSELPAKMEIDWVKVSQPTGR >gi|226332025|gb|ACIB01000031.1| GENE 170 214810 - 216330 999 506 aa, chain + ## HITS:1 COG:RSp0641_7 KEGG:ns NR:ns ## COG: RSp0641_7 COG1020 # Protein_GI_number: 17548862 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism # Function: Non-ribosomal peptide synthetase modules and related proteins # Organism: Ralstonia solanacearum # 23 495 477 951 1054 211 32.0 2e-54 MDMTTSSQKDIINDILSAMQCYPENEAFIIDDKHYTYAQLGEITASITHSLSEIKDEKIG IVAENRIETYAAILAVLAGGKTYVILHPAYPEERNLKIAALAGLRTLLCTSDTDRSAFGT GHFRIIDTDRLPGKTLSEQQSHSSDEERNAYIIFTSGSTGEPKGVPITRANLNAFYRAYS SLDWNLDEHDRMLQMFELTFDVSVVSLLYPLTLGAAVYTVGHQDVKHFKVFELLEKYQLT FATVTPSLLQLLSPYFDEINLPSLKYLGVSAEASQTELLERFRKSAPNATFINLYGPTEA TIYCTCYRIPASDKCKHYNGMVAIGKPFPGIRAIIADEEGNELPQGETGELWVSGRQVMK GYLDDPEKSALVLIHRPDGQIYYRTGDLCILDGDGDIIYCGRKDYQVKIQGFRIELSEIE YTAQSFFKTPCSVAAVPLLCDGICNELHLAVETTECTQSALIEYLKEKLPKYMLPKQIHC ISQFPVTNSNKTDRKKIAELIKEKKL >gi|226332025|gb|ACIB01000031.1| GENE 171 216345 - 216587 299 80 aa, chain + ## HITS:1 COG:no KEGG:BF3277 NR:ns ## KEGG: BF3277 # Name: not_defined # Def: acyl carrier protein # Organism: B.fragilis # Pathway: not_defined # 1 80 1 80 80 147 100.0 1e-34 MNREEVLKQLQSIFRDILKKDNVCIDESSTSKDVDGWDSLTHMQIIAQIEKHFGVRFNFR EVIKFKNVGDLCSALLTKME >gi|226332025|gb|ACIB01000031.1| GENE 172 216591 - 217973 1119 460 aa, chain + ## HITS:1 COG:FN1672 KEGG:ns NR:ns ## COG: FN1672 COG1696 # Protein_GI_number: 19704993 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted membrane protein involved in D-alanine export # Organism: Fusobacterium nucleatum # 1 460 1 486 486 181 29.0 2e-45 MQFLSFSFLALFTLCFFLYYAVKGRARNLILLVTSCIFIGWYYLPFLLTAVVVALFTFFW AQWMESRAKAGKKTKPVYIAGIIALIGGWLLLHGTVIDDIIFPLGMSFYTFQAISYLTDV YWQEQRSERNWVDFLIYMLFFMKFLSGPIERGGDLLPQLKDPRPFIYSNAVTGLKYILLG LIKKLLIANQISPQTDVMFHSIHDLSGVQLLMTCLLYPIELYADFSGYTDIAIGGAYMFG IKLSPNFNRPFAARSTADFWRRWHMSLSFWVRDYLYVPLTAGTRNWGQWGIYFSLLITFL ALGLWHGAGLTFAIYGLIQGVLICWEMKTAAFRNNLPQYIGKYAADSLLIVRTYLLFALS LIFFRVQSLSDAWYFLRNISFEVHSSWKEMNIGIRDHNCIVAGSALLLVLIYEYYASKYN LMETLERQPAWLRWSVYYLLVFALLMLGKFDTETFIYLQF >gi|226332025|gb|ACIB01000031.1| GENE 173 217992 - 219032 876 346 aa, chain + ## HITS:1 COG:no KEGG:BF3118 NR:ns ## KEGG: BF3118 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 346 1 346 346 685 99.0 0 MLHHSSRISPKTRFCLKLLLLILPLIPIVVVYFMFDPYRVLHPYKRFDDSPMLLNEAHVG WQNYLQNRDSIAYNSFILGNSCTMAFLTGEWEKYLDKNDHAVRFYDNGESLGGVRQKLQL LDSVGAPLKNILIVLDKKSLDKNAPLSGNNHLFSAEAAGISQLGFQLRFLQEFLYPDRMI PYIDYLIRHKYAPYMKGVINPGDPVREPYTNNFINPREKEIAQDGEIYWSRHEKEFKKRT NAGMEELPVIFASQIQVLRSIKKICDKHHTNLKFIIGPDYYQKKTSREDIKILKAILGDS AVWDFTGINEYTADIHHYYEPGHYRPLLGARLLKAIYQDQDTCHRQ >gi|226332025|gb|ACIB01000031.1| GENE 174 219137 - 219529 273 130 aa, chain + ## HITS:1 COG:no KEGG:BF3280 NR:ns ## KEGG: BF3280 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 130 1 130 130 231 100.0 7e-60 MKKILLLLSVCISFSSCCTLFSSSKQDITFTGMNGTKIYEASTKQKIAEIKEDNSVTVQI KKKREDKQLVAKKEGYQTTPFVLESTFNNACLWNILFWPGFLVDLGTQKMNKWDNTIINI EMEKENTENK >gi|226332025|gb|ACIB01000031.1| GENE 175 219722 - 220303 552 193 aa, chain - ## HITS:1 COG:BS_maf KEGG:ns NR:ns ## COG: BS_maf COG0424 # Protein_GI_number: 16079857 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Nucleotide-binding protein implicated in inhibition of septum formation # Organism: Bacillus subtilis # 10 193 5 185 189 130 41.0 1e-30 MLANLDRYKIVLASNSPRRKELMTGLGVDYVVKTLPDVDESYPDTLQGEEIPLFIAREKA AAYQSMIGPEELLITADTIVWHEGKALGKPVGRQDAIEMLRSLSGKSHQVITGVCLTTRE WQKCFAAVTDVRFAILDEDEIAYYVDHYQPMDKAGSYGVQEWIGFVGVESISGSYFNVMG LPIQKLYRELKQL >gi|226332025|gb|ACIB01000031.1| GENE 176 220335 - 220856 589 173 aa, chain - ## HITS:1 COG:FN0213 KEGG:ns NR:ns ## COG: FN0213 COG1778 # Protein_GI_number: 19703558 # Func_class: R General function prediction only # Function: Low specificity phosphatase (HAD superfamily) # Organism: Fusobacterium nucleatum # 8 165 1 158 168 116 37.0 2e-26 MSTINYDLTRIKALAFDVDGVLSANVIPLHPSGEPMRTVNVKDGYAIQLAVKKGLRIAII TGGRSDVVRKRFIGLGVSDLYFGSAVKIHDYRGFRDKHGLTDEEILYMGDDVPDMEVMRE CGLPCCPKDAVPEVKAIARYISYADGGYGCGRDVVEQVLKAQGQWLSDDAFGW >gi|226332025|gb|ACIB01000031.1| GENE 177 220877 - 221668 854 263 aa, chain - ## HITS:1 COG:no KEGG:BF3283 NR:ns ## KEGG: BF3283 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 263 1 263 263 507 99.0 1e-142 MKRSIEDTPIVFIGAGNLATNLAKALYRKGFRIVQVYSRTEESARELAQKVEAEYTTDLA EVNPYAKLYIVSLKDSAFAELLQGIVEGKREEALMVHTAGSIPMNVWEGHVSHYGVFYPM QTFSKQREVDFKEIPFFIEASSAEDAAFLKAIASTLSNRVYDADSEQRKSLHLAAVFTCN FTNHMYALAAELLKKYNLPFDVMLPLIDETARKVHELEPKTAQTGPAIRYDENVIGNHLR MLADDPAMQRLYELLSRSIHERQ >gi|226332025|gb|ACIB01000031.1| GENE 178 221665 - 221997 204 110 aa, chain - ## HITS:1 COG:no KEGG:BF3284 NR:ns ## KEGG: BF3284 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 110 1 110 110 140 100.0 2e-32 MSAVKANNKDTYKATAVTLIVMGVLYLIDKLIHFSTLGIPWVMNKDNLLLYTAVIFLFIK RDKSVGLVLLGLWLVLNIGLIVSLLGSMSGYLLPLALLIIGGILYLISTR >gi|226332025|gb|ACIB01000031.1| GENE 179 222000 - 222536 554 178 aa, chain - ## HITS:1 COG:CAC3555 KEGG:ns NR:ns ## COG: CAC3555 COG0778 # Protein_GI_number: 15896791 # Func_class: C Energy production and conversion # Function: Nitroreductase # Organism: Clostridium acetobutylicum # 6 175 3 172 174 152 44.0 3e-37 MENFSELIKNRRSMRKFTGEELSQEDVVALLKAALMAPTSKRSNSWQFIAVDDKKLLSEL SHCKEQASAFIADAALAIVVTADPLASDVWIEDASIASIMIQLQAEDLGLGSCWVQVRER YTATGMPSDEFVRGVLDIPLQLQVLSVIAIGHKGMERKPFNEDHLQWEKIHINKFGGK >gi|226332025|gb|ACIB01000031.1| GENE 180 222703 - 223245 546 180 aa, chain - ## HITS:1 COG:BS_ytiB KEGG:ns NR:ns ## COG: BS_ytiB COG0288 # Protein_GI_number: 16080121 # Func_class: P Inorganic ion transport and metabolism # Function: Carbonic anhydrase # Organism: Bacillus subtilis # 1 176 3 178 187 213 56.0 2e-55 MLEEILEFNKKFVENRGYEKYITNKYPDKKIAILSCMDTRLTELLPAALGIHNGDVKIIK NAGAVISHPFGSVIRSLLVAIIELGVEEVMVIAHSDCGACHMNSDEMIAHMKKRGIKSET IDMIRYCGVDFNSWLGGFDDPVKSVRGTVRSIENHPLIPKDVRVHGFIIDSLTGELTRVE >gi|226332025|gb|ACIB01000031.1| GENE 181 223363 - 224346 646 327 aa, chain + ## HITS:1 COG:no KEGG:BF3287 NR:ns ## KEGG: BF3287 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 327 1 327 327 662 100.0 0 MPLKLTTYYQGSEIPDFPGTNTFHSKELFQIYEETPGYTPLLIMASEDGKPVARLLAAIR KSIRIFPPAFIKRCEIYGTGEYLIAEADKERIFGEMLEHLTTEALRNSFLIEFRNLENAM FGYKYFRSNRYFPVNWLRVRNSLHGIENAEQRFSPSRIRQIKKGLKNGAKVDEARTVEEI REFSNMLRHLYSSRIRKHFPNIIFFQHMDTRLIHSRQAKIFIVRYKDKIIGGSACIYSGN DAYLWFSGGMRKTYALQYPGILAVWKALQDAHQNGFRHMEFMDAGLPFKKHGYRDFVLRF GGKQSSTRRWFRFRWQWLNDLLIKIYV >gi|226332025|gb|ACIB01000031.1| GENE 182 224579 - 224983 498 134 aa, chain + ## HITS:1 COG:PH0272 KEGG:ns NR:ns ## COG: PH0272 COG0346 # Protein_GI_number: 14590197 # Func_class: E Amino acid transport and metabolism # Function: Lactoylglutathione lyase and related lyases # Organism: Pyrococcus horikoshii # 6 133 8 133 136 121 53.0 4e-28 MKISHIEHLGIAVKSIEEALPYYENVLGLKCYNIETVEDQKVRTAFLKVGDTKIELLEPT CPESTIAKFIENKGAGVHHVAFAIEDGVANALAEAESKEIRLIDKAPRKGAEGLNIAFLH PKSTLGVLTELCEH >gi|226332025|gb|ACIB01000031.1| GENE 183 225109 - 226662 1796 517 aa, chain + ## HITS:1 COG:RC0960 KEGG:ns NR:ns ## COG: RC0960 COG4799 # Protein_GI_number: 15892883 # Func_class: I Lipid transport and metabolism # Function: Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) # Organism: Rickettsia conorii # 11 517 12 514 514 649 61.0 0 MSNQLEKVKELIELRAQARLGGGEKAIEKQHAKGKYTARERIAQLLDEGSFEELDMFVQH RCTNFGQEKKHFLGDGVVTGYGTIEGRLVYVFAQDFTVFGGSLSETMAQKICKVMDMAMK MGAPVIGINDSGGARIQEGINALSGYAEIFQRNIMASGVIPQISGIFGPCAGGAVYSPAL TDFTLMTEGTSYMFLTGPAVVKTVTGEDVSQEDLGGASVHASKSGVTHFTAETGEEGLAI IRKLLSFIPQNNLEEAPLVNCTDPIDRMDDLLNEIIPDSPNKPYDMYEVIGAIIDNGEFL EVQKDYAKNLIIGFARMNGQSVGVVANQPKYLAGVLDSNASRKGARFVRFCDAFNIPLVT LVDVPGFLPGTGQEYNGVILHGAKLLYAYGEATVPKVTVTLRKSYGGSHIVMSCKQLRGD MNYAWPTAEIAVMGGAGAVAVLYAKEAKDQENPAQFLADKEAEYTKLFANPYNAAKYGYI DDVIEPRNTRFRVIRALQQLQTKKLTNPAKKHGNIPL >gi|226332025|gb|ACIB01000031.1| GENE 184 226683 - 227597 960 304 aa, chain + ## HITS:1 COG:no KEGG:BF3129 NR:ns ## KEGG: BF3129 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 304 1 304 304 614 100.0 1e-174 MNKKRIGIFIALLAMVCMGLRAQSATSLRINEVLVVNDQNYQDDYGLHNAWIEIFNTSFA SVNLEGCFLTNDKNNPTKYPIPKGDVLTLIKPRQHALFWADGMPNRGTFHVNFTLDPNKE NYIALYDSNGKTLIDEVTIPAGQLADRSYAREKDGSANWVVKGEGEHSYVTPSTNNMTID KNPKIENFKKHDSIGIGMAIIAMSVVFIGLVLLYLSFKAVGNVAVRLGKKNAMKATGITD KTEAKEKNLGSHTGEETAAIAMALHEYLNDAHDVEDMILTINKVKRTYSPWSSKIYTLRQ TPKR >gi|226332025|gb|ACIB01000031.1| GENE 185 227623 - 228054 535 143 aa, chain + ## HITS:1 COG:FN0200 KEGG:ns NR:ns ## COG: FN0200 COG0511 # Protein_GI_number: 19703545 # Func_class: I Lipid transport and metabolism # Function: Biotin carboxyl carrier protein # Organism: Fusobacterium nucleatum # 27 143 5 134 134 67 42.0 6e-12 MKQYKYKINGNLYNVTVNDVEDNIANVEVNGTSYKVELDKPVKAAPKPVTRPAAAPKTET GAPVVTKQPTASKKDGVKSPLPGVILDIKVKEGDTVKRGQTIIILEAMKMENNINANKDG KVAEIKVNKGDSVLEGTDLVIIE >gi|226332025|gb|ACIB01000031.1| GENE 186 228057 - 229217 1370 386 aa, chain + ## HITS:1 COG:TM0880 KEGG:ns NR:ns ## COG: TM0880 COG1883 # Protein_GI_number: 15643642 # Func_class: C Energy production and conversion # Function: Na+-transporting methylmalonyl-CoA/oxaloacetate decarboxylase, beta subunit # Organism: Thermotoga maritima # 12 385 17 383 384 347 53.0 2e-95 MGDFVSFLGNNLVDFWGYTGFANATPGHLFMLLIGLFFIYLAVAKEFEPMLLIPIGFGIL IGNIPFNMEAGLKVGIYEEGSVLNILYQGVTSGWYPPLIFLGIGAMTDFSALISNPKLML IGAAAQFGIFGAYIIALLIGFEPNQAGAIGIIGGADGPTAIFLSSKLAPNLMGAIAVSAY SYMALVPVIQPPIMRALTTKHERLIRMKPPRIVSHTEKVIFPIVGLLLTCFLVPSGLPLL GMLFFGNLLKESGVTRRLANTASGPLIDVITILLGLTVGASTQASQFLTWDSILIFILGA FSFIIATASGVMFVKFFNLFLKKGNKINPLIGNAGVSAVPDSARISQVIGLEYDPTNYLL MHAMGPNVAGVIGSAVAAGILLGFLM >gi|226332025|gb|ACIB01000031.1| GENE 187 229365 - 230696 1097 443 aa, chain + ## HITS:1 COG:no KEGG:BF3293 NR:ns ## KEGG: BF3293 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 443 1 443 443 810 100.0 0 MRTRTMKPLFLIFCLCLSPLLMGQNQNKIQEAMANYDYETAVELINKEPPTIPLLMQKAK ALKGIGSNAEALRSIRQVIAEDSTNQQAHIEAAECCKAMAKYNDALDYYRKAISLNPKNK YARLQFINLLCNTKNYAEAFGESSMMSETDSSAVVLHLQAQSLEGMQQILPAIGCYEVIQ DKYPDDYLAAAKLGNLNIVAGYPEYAIQATERYRERDSTNLIVNQQNALAYCHAEQYPTA IKRYEKLCQQGDSSTQTLYYLGVSYYADEYYYEAHACFSKLQKEMEENPNLLYYLGRCCA KTSWKKEGIEHLEKAIELTIPKDSTMIRLYKGLVDCCKLAQDTPKQIQALRELYKYDKTN HKLLYDIAWNYSYQLKDNKSAERYLQAFLKTRKANARKEEPVSEKGELVLGLENYYNAAE NWLKDLQKEKFFKEGIPLESQKQ >gi|226332025|gb|ACIB01000031.1| GENE 188 230711 - 232003 1058 430 aa, chain + ## HITS:1 COG:no KEGG:BF3133 NR:ns ## KEGG: BF3133 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 430 1 430 430 796 99.0 0 MKHWILLLYLGFSVVLQAQNTASVQEAMANYDYKTVIRLIDEESASPQLLIQKAKALKGL GRTAEALSTLQHIIIELPENQQALVEAAECCRQLSKFNEALGYYRKVMELNPEHIYAHLQ YTRLLYNYQRYGDALRESIALARKDSSATVLRLMAESMEGAGMPVESMFCYLSIIRKYPS DYLSVAKLGSIFNTMKDYEGAIALTEAYRRTDSTNVEVNRQNALAYCLRKEYPTAIKRYQ DLTARGDSTLLTCYYLGVSYYATKEYYKARDWLLKAKSCQSPGANLLYYLGRSCSKTLWK QEGIGYLNQAIALTIPADSIMERLYSGLADCYRQTKQTRDQIKATKEQYKYAPDKHILLY NLAFLSDKIEDTKATEHYLQAFLRTKSKQNAVLQQPDDDEEEPAPYIDYYKSATNKLESI RKEKFLKGEK >gi|226332025|gb|ACIB01000031.1| GENE 189 232529 - 234379 1821 616 aa, chain + ## HITS:1 COG:BH2927 KEGG:ns NR:ns ## COG: BH2927 COG0366 # Protein_GI_number: 15615490 # Func_class: G Carbohydrate transport and metabolism # Function: Glycosidases # Organism: Bacillus halodurans # 127 615 136 578 578 167 26.0 4e-41 MKKLLLLLGSFLLSLTTYAAMNISKIDPPCWFTGMNNPELQLMVYGEGIGQASVSVNYPG VSLSSIVKLESNNYLLVYLHLDKEVKPGKMPITFTVGKKKLVKEYELKARSKAGVDHKGF DASDALYLLMPDRFANGNPDNDRIEGMAEYKVDRNDPNARHGGDLAGIEQNLDYFTDLGV TALWFTPVLENNMKGGSYHGYATTDYYKVDPRFGTNEEYRSLIAKAHNRGIKVVMDMIFN HCGVEHPWIKDMPSKDWFNHADFKNNFVQTSYKLTPHVDPYASEYDFDQMNNGWFVEAMP DLNQKNPHVYKYLLQNSLWWIEYADIDGIRMDTYPYADYDAMSNWMKELNEEYPNYNTVG ETWVTEPAYTAWWQKDSKLSAPRNSHLKTVMDFSFFDKVNTAKNEQTDTWFKGLDRVYNN FVYDYLYPNPASVLAFIENHDTDRFLGEGENLDMLKQASTLLLTTRRIPQLYYGTEVMMN GVKSKSDGYVRKDFPGGWADDKENALTPEGRTRLQNESYNFYRNLLNWRKGNDVIAKGSM KQFMVQHGVYAYARQYKGKTVFVLLNGTDKEVKLPLKYYAEVLKDKTQGKDVISGKVTAL NEELTMAPRQSMVIEL >gi|226332025|gb|ACIB01000031.1| GENE 190 234537 - 234977 403 146 aa, chain + ## HITS:1 COG:no KEGG:BF3135 NR:ns ## KEGG: BF3135 # Name: not_defined # Def: MarR family transcriptional regulator # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 146 1 146 146 253 98.0 1e-66 MTNNESARELVLQTLRTRMAFRQAVQRVLKRHNVDMTFEMLQVMNCLWNKQGISQQSLAE KTAKDKACLTNLINNLEKKNWVIRKEGPSDRRNRLIFLTPQGEELALTVKPLINDIYAQT GAEMEASRITECIEDLKRLHEVLNEI Prediction of potential genes in microbial genomes Time: Tue May 17 23:02:33 2011 Seq name: gi|226332024|gb|ACIB01000032.1| Bacteroides sp. 3_2_5 cont1.32, whole genome shotgun sequence Length of sequence - 70763 bp Number of predicted genes - 47, with homology - 47 Number of transcription units - 25, operones - 12 average op.length - 2.8 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . + CDS 1 - 942 655 ## COG1566 Multidrug resistance efflux pump 2 1 Op 2 . + CDS 947 - 2542 1338 ## BF3137 putative transport-related membrane protein - Term 2547 - 2586 6.8 3 2 Op 1 . - CDS 2637 - 3641 1157 ## COG0191 Fructose/tagatose bisphosphate aldolase - Prom 3668 - 3727 2.7 - Term 3666 - 3709 -0.9 4 2 Op 2 . - CDS 3822 - 4841 932 ## BF3300 hypothetical protein - Prom 4861 - 4920 6.6 + Prom 5117 - 5176 4.5 5 3 Tu 1 . + CDS 5221 - 5472 432 ## PROTEIN SUPPORTED gi|53714587|ref|YP_100579.1| 50S ribosomal protein L31 type B + Term 5501 - 5542 5.0 - Term 5195 - 5255 4.2 6 4 Tu 1 . - CDS 5441 - 5686 84 ## BF3302 hypothetical protein - Prom 5719 - 5778 3.9 + Prom 5708 - 5767 7.3 7 5 Tu 1 . + CDS 5869 - 7395 1776 ## BF3303 hypothetical protein + Term 7419 - 7463 7.7 - Term 7451 - 7494 6.5 8 6 Tu 1 . - CDS 7506 - 10367 2329 ## COG0296 1,4-alpha-glucan branching enzyme - Prom 10390 - 10449 4.1 - Term 10386 - 10426 8.2 9 7 Op 1 . - CDS 10472 - 12070 1754 ## BF3144 putative lipoprotein 10 7 Op 2 . - CDS 12106 - 13725 1517 ## BF3306 hypothetical protein 11 7 Op 3 . - CDS 13736 - 16744 3039 ## BF3307 hypothetical protein - Prom 16920 - 16979 7.8 + Prom 16975 - 17034 5.4 12 8 Op 1 4/0.000 + CDS 17060 - 18070 675 ## COG1609 Transcriptional regulators 13 8 Op 2 . + CDS 18091 - 19407 1168 ## COG0477 Permeases of the major facilitator superfamily + Term 19479 - 19525 6.4 14 9 Op 1 . + CDS 19758 - 20672 602 ## COG1554 Trehalose and maltose hydrolases (possible phosphorylases) 15 9 Op 2 . + CDS 20509 - 22065 1312 ## COG1554 Trehalose and maltose hydrolases (possible phosphorylases) + Term 22197 - 22252 -0.8 + Prom 22318 - 22377 2.5 16 10 Op 1 27/0.000 + CDS 22408 - 23469 991 ## COG0845 Membrane-fusion protein + Prom 23491 - 23550 6.8 17 10 Op 2 9/0.000 + CDS 23612 - 26647 2933 ## COG0841 Cation/multidrug efflux pump 18 10 Op 3 . + CDS 26675 - 27949 1440 ## COG1538 Outer membrane protein + Term 27969 - 28021 18.3 + Prom 28073 - 28132 7.5 19 11 Tu 1 . + CDS 28262 - 28513 157 ## BF3314 hypothetical protein + Term 28567 - 28624 12.2 + Prom 28572 - 28631 7.9 20 12 Op 1 3/0.000 + CDS 28694 - 32398 4235 ## COG0046 Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain + Term 32421 - 32460 8.2 + Prom 32473 - 32532 8.6 21 12 Op 2 . + CDS 32570 - 36562 2699 ## COG0642 Signal transduction histidine kinase 22 12 Op 3 7/0.000 + CDS 36590 - 37132 509 ## COG2059 Chromate transport protein ChrA 23 12 Op 4 . + CDS 37158 - 37706 610 ## COG2059 Chromate transport protein ChrA + Term 37926 - 37966 -0.0 24 13 Tu 1 . - CDS 37876 - 39384 1402 ## COG1649 Uncharacterized protein conserved in bacteria - Prom 39405 - 39464 4.4 + Prom 39369 - 39428 7.2 25 14 Tu 1 . + CDS 39554 - 42331 2809 ## COG0178 Excinuclease ATPase subunit - Term 42399 - 42438 -0.1 26 15 Op 1 . - CDS 42500 - 42982 645 ## BF3321 hypothetical protein 27 15 Op 2 . - CDS 42990 - 43220 316 ## BF3322 hypothetical protein 28 15 Op 3 . - CDS 43240 - 44667 1474 ## COG1966 Carbon starvation protein, predicted membrane protein - Prom 44687 - 44746 4.4 + Prom 44651 - 44710 6.9 29 16 Op 1 . + CDS 44945 - 46465 1212 ## COG1649 Uncharacterized protein conserved in bacteria 30 16 Op 2 . + CDS 46558 - 48819 1748 ## COG0642 Signal transduction histidine kinase + Term 49045 - 49071 -1.0 31 17 Tu 1 . - CDS 48824 - 50011 805 ## BF3165 hypothetical protein + Prom 50520 - 50579 4.8 32 18 Op 1 . + CDS 50781 - 51959 1232 ## COG1373 Predicted ATPase (AAA+ superfamily) 33 18 Op 2 . + CDS 51978 - 52538 515 ## BF3167 hypothetical protein 34 18 Op 3 . + CDS 52575 - 56129 3729 ## COG0674 Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, alpha subunit + Term 56157 - 56200 8.2 + Prom 56240 - 56299 4.9 35 19 Tu 1 . + CDS 56332 - 56835 352 ## COG0454 Histone acetyltransferase HPA2 and related acetyltransferases - Term 56779 - 56829 -0.9 36 20 Op 1 14/0.000 - CDS 56877 - 57731 816 ## COG2113 ABC-type proline/glycine betaine transport systems, periplasmic components 37 20 Op 2 16/0.000 - CDS 57751 - 58575 916 ## COG4176 ABC-type proline/glycine betaine transport system, permease component 38 20 Op 3 . - CDS 58572 - 59798 1257 ## COG4175 ABC-type proline/glycine betaine transport system, ATPase component - Prom 59933 - 59992 5.2 + Prom 59753 - 59812 4.6 39 21 Tu 1 . + CDS 59836 - 60045 75 ## BF3337 hypothetical protein - Term 59868 - 59921 12.4 40 22 Op 1 . - CDS 60133 - 60477 232 ## COG3695 Predicted methylated DNA-protein cysteine methyltransferase 41 22 Op 2 8/0.000 - CDS 60508 - 63231 2387 ## COG1879 ABC-type sugar transport system, periplasmic component - Prom 63352 - 63411 4.9 - Term 63305 - 63373 3.8 42 22 Op 3 2/0.000 - CDS 63418 - 64308 907 ## COG0524 Sugar kinases, ribokinase family 43 22 Op 4 . - CDS 64340 - 65506 1130 ## COG0738 Fucose permease 44 22 Op 5 . - CDS 65552 - 67420 1745 ## COG1621 Beta-fructosidases (levanase/invertase) - Prom 67465 - 67524 4.9 - Term 67442 - 67479 -0.6 45 23 Tu 1 . - CDS 67545 - 69833 2188 ## BF3178 hypothetical protein - Prom 69953 - 70012 9.3 46 24 Tu 1 . - CDS 70084 - 70440 315 ## BF3344 hypothetical protein - Prom 70534 - 70593 3.1 - Term 70541 - 70586 13.1 47 25 Tu 1 . - CDS 70595 - 70762 123 ## BF1194 hypothetical protein Predicted protein(s) >gi|226332024|gb|ACIB01000032.1| GENE 1 1 - 942 655 313 aa, chain + ## HITS:1 COG:mll0995 KEGG:ns NR:ns ## COG: mll0995 COG1566 # Protein_GI_number: 13471111 # Func_class: V Defense mechanisms # Function: Multidrug resistance efflux pump # Organism: Mesorhizobium loti # 2 307 77 381 417 158 36.0 1e-38 YARYEITNDAVVDQYIAPLNIRIPGYIKEVRFTEHQYVHAGDTLLILDDREYRIRLKDAE AALMDALGSKEVLNSGIRTSGVNVAVQEANIAETLAHLSQQEAELNRYTRLLKKQAVSQQ DYELAKANFEATRARYNALLRQKEAAQSQYSETSKRSTGAEANILRKEADLEMARLNLSY TVVTAPYDGYTGRRTLEPGQFVQGGQTLSYLVRGNDKWVTANYKETQIAHIYIGQKVRIK VDAFSGRTFHGTVTAISEATGSKYSLVPTDNSAGNFVKVQQRIPVRIDLNDASPEEMECL RAGMMVETEAVRP >gi|226332024|gb|ACIB01000032.1| GENE 2 947 - 2542 1338 531 aa, chain + ## HITS:1 COG:no KEGG:BF3137 NR:ns ## KEGG: BF3137 # Name: not_defined # Def: putative transport-related membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 531 1 531 531 906 99.0 0 MVKLNLPTRPWVPQWLGIVTMFIVIFPITLLNGAYTGSMVEVSNTLGVLSEDITMAYYSA SVGMAVAYPIVPKIRTIATPKTLLLTDLLLQIFFSLVCAKTGSMDVIMVCSFCMGFLKAF VMLEFIILIRPLFSPKNVRSEFYAYFYPIVFSGGQLSMAITAQLAYHYQWQHMYYFVTIL LLIALLFVICFFRYARRPMHIPFKEMDGRSMFIIATAFLLTLYTFTYGKTLDWFASPKIR VYVFVIPLLIVLFIHRQRTQGKPFVSLKPLFLHKSIIGYGFMVLAMFLTATSSLVTNYMN SIIRVDSIHANSLSLWLLPGYVVGAVICFWWFRWQRWRFRFLISGGMFCYVIYLAILYFG ITPYGTYEMLYLPILFRGVGMMVIFIAFGVFVVEDLDPRLTLSNAFFLISFRSVLAPVLS ASFFNNMLYYLQAKGMNILSENMTLTNPLAEQKYNQALSNALAQGHEFGEAGQLAANSLY STLQQQSLLLALKTLIGYVLILALVIAVIAAFIPFHKTLKVAVVKTGDDMV >gi|226332024|gb|ACIB01000032.1| GENE 3 2637 - 3641 1157 334 aa, chain - ## HITS:1 COG:TP0662 KEGG:ns NR:ns ## COG: TP0662 COG0191 # Protein_GI_number: 15639649 # Func_class: G Carbohydrate transport and metabolism # Function: Fructose/tagatose bisphosphate aldolase # Organism: Treponema pallidum # 1 328 1 328 332 458 70.0 1e-129 MVNYKDLGLVNTRDMFAKAIKGGYAIPAFNFNNMEQMQAIIKAAVETKSPVILQVSKGAR QYANATLLRYMAQGAVEYAKELGCKNPEIVLHLDHGDTFETCKSCIDSGFSSVMIDGSHL PYDENVALTKKVVEYAHQFDVTVEGELGVLAGVEDEVSSDHHTYTEPDEVVDFVTKTGCD SLAISIGTSHGAYKFTPEQCHIDPKTGRMVPPPLAFDVLDGVMKELPGFPIVLHGSSSVP EEEVATINQFGGALKAAIGIPEEELRKAAKSAVCKINIDSDSRLAMTAAIRKVFAEKPAE FDPRKYLGPARDNMEKLYKHKIINVLGSDNKLAE >gi|226332024|gb|ACIB01000032.1| GENE 4 3822 - 4841 932 339 aa, chain - ## HITS:1 COG:no KEGG:BF3300 NR:ns ## KEGG: BF3300 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 339 40 378 378 695 99.0 0 MKKLIIAFNFLLLLAVGGRAQDHADRLYSVAFYNLENLFDTIHDTGKNDYEYLPDAAKGW NSEKYRSKLKNLSKVLGELSRDKVPAGPAAIGVAEVENRRVLDDLIRQPELAAGGYRYIH YEGPDKRGIDCALLYDPKQFTPHATALVLSTPFEGDTIHKTRGFLIVGGELAGDKVCLIV NHWPSRGAAEPVRVHAAMQVKALKDSLMRTDPDLKLIIMGDLNDDPMDLSLAVLGAKKHV EDMTEDDLYNPWWVTLEDKGVGTLLYRGKWNLFDQIILSPTLLQATKGLKYDHNEVFMRD YLFQQDGKYKGAPLRTYGGRVWLDGYSDHLPTIIYMRKQ >gi|226332024|gb|ACIB01000032.1| GENE 5 5221 - 5472 432 83 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53714587|ref|YP_100579.1| 50S ribosomal protein L31 type B [Bacteroides fragilis YCH46] # 1 83 1 83 83 171 100 1e-41 MKKGLHPESYRPVVFKDMSNGDMFLSKSTVATKETIEFEGETYPLLKIEISNTSHPFYTG KSTLVDTAGRVDKFMSRYGNRKK >gi|226332024|gb|ACIB01000032.1| GENE 6 5441 - 5686 84 81 aa, chain - ## HITS:1 COG:no KEGG:BF3302 NR:ns ## KEGG: BF3302 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 81 23 103 103 154 100.0 7e-37 MISFSMPEWCMAYPFQYFSIRPLGAINIKQGMDFIPYALGSSFLCGVALGQKNKLPQYWG SLFFINRMIRPIISYGYRSGS >gi|226332024|gb|ACIB01000032.1| GENE 7 5869 - 7395 1776 508 aa, chain + ## HITS:1 COG:no KEGG:BF3303 NR:ns ## KEGG: BF3303 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 508 1 508 508 982 99.0 0 MKRLTILFMLFLTSCMLYAKDRKGNDLSGPIALKGELTEKYQNYTKGTPVVIRGVRKMRI SPDETAGITYAVEIDGLQYPVGAEEAGKIIRLKPATVLEYWQGVYLSQGMYDYYNRKGYR YKMRQELDEECLDYLDKLNEIAYKDAYIQDYVQAIFAKVNPGEIDTNRPERLNVRIIQSP EPDAYMLPNGAMLVSTGLLCTIDSEEELEAIIANEMVHFILDHQVDNVSRAETRAKRAAF WADVLGTVAMAADDTNWMYGYDERVGAIELAASIGTIAALINVRTVNRLGMDYNSKQELQ ADRIARDYLAFKGMNPNALSSAINKIKEFYGSVHRYDNLTRYGSYGLLKERLAKLGETES IHSHMFEKMTSDIVTFNAAMYQGDKRYKMAEQLAQKNIDNRVASDHDYVILVKARMAQEN TPESNEACMKLLEKAREIATARNLDINKQEILLLMRMNKQAKAADKLREYLDLLAEYKQQ NDMNTQESEWIGEELDWASKMLSKISLL >gi|226332024|gb|ACIB01000032.1| GENE 8 7506 - 10367 2329 953 aa, chain - ## HITS:1 COG:MA3032 KEGG:ns NR:ns ## COG: MA3032 COG0296 # Protein_GI_number: 20091850 # Func_class: G Carbohydrate transport and metabolism # Function: 1,4-alpha-glucan branching enzyme # Organism: Methanosarcina acetivorans str.C2A # 466 950 124 623 627 132 24.0 2e-30 MKYLTFPFLLLLLPLIGFGCSSEEKETDSLILSSDSEIFFEQGIDFAATSGTRNLSFSSG RPWRISLTTDTDTRRAADWCTVSPSSGTAGDASVTISIQENADYDSRSVKLTLVAGGIEK SFTVSQKQKDALTLTASRFEVGKEGGTVQVEVKANITFEVEIPEVDRSWISQANTRGLVV TNLAFTVAPNEGVAGREGEIVIRSGSLSEKIRITQEGRCDDGLSFRPETPDADRQLTLYF KATKTSPLYGYAGDVYVHTGVVSEGTWMYVPAEWNTNVDKCKMVRVADNIWSITLAPSIR QWFGSNETPVRQLGVVIRSADGSKKGTDGDSFVSVTDHLYKPFEPAAVRYASMPGGLQEG INLIDASTVTLVLYDKDKKGGHKDFAHVVGDFNDWKLSNESNSQMNRDDAAGCWWITLTG LQPTREYAFQYYVGTRAGEILRLADAYSRKILDPDNDKYIPSSTYPDAKEYPKGAVGIAS VFKIQGDSYEWKVKNFRIPDKNNLMIYELLLRDFTATGDLNGAMEKIGYLKSLGFNAVEL MPVQEFDGNDSWGYNPCFYFALDKAYGTDHMYKAFIDKCHEAGMAVLFDVVYNHASGSHP FARLYWDTKNNRTAADNPWFNVKEPHPYGVFHDFNHDSPLVRAFVKRNLKFLLEEYRIDG FRFDMTKGFTQNSSTEATAGNYDASRIAILKDYNETVREVNPEAVVILEHFCDEKEESKL AEEGMQLWRNLNNAYCQSAMGYPSNSDFTPLVTFGTTMPYGGWVGFMESHDEERTAFKQI AYGEGPLKSDINVRMKQLAANASFFFTAPGPKMVWQFGEMGYDVSIEEGGRTGRKPLHWE YLDNEARKGLCNTYAKLLKLRREHSELFNPGSTFSWLVKTANWTGGRFLTLAATNGKRLV VVGNFTAKPIEAITSFPVTGVWTNYLDGTKLHVTSIPTGLTIPAHECRVYINF >gi|226332024|gb|ACIB01000032.1| GENE 9 10472 - 12070 1754 532 aa, chain - ## HITS:1 COG:no KEGG:BF3144 NR:ns ## KEGG: BF3144 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 532 1 532 532 982 99.0 0 MKKIYFYTLLLGLLAFTACEDEKSPVMELQKASAFEPFSQSDFTFNDENAAAEFPEIKWT AADYGVKAVVNYDVTLTNDANAKTVLLGETGTTSLKFTNGQMNTMMAKVGAYPGQTYNFT ITLTSKAYDMTADPASNSITFKATLFDPNAVDWKFAYVAVGYPDWDYTNAYLLGDPDGDG VYQGYANFDADGVSYAIIDGSDLTKVLAKDQTAAKKGFYGIKVDAEGKVEQTEPLVWGVV GDATSGGWDKDTQMDYDATTRLWTVTTSLLDKEFKFRSNNNWDSDNYGAVSGKESELEGE LVAGPNNFKVLKASPYVITMNLTNAGKYSYSMVETTIELSSAEMALPGSYQGWDATKDDC YKVQSAARDFIYTGTFYFDADTKFKFWDAGVWIGMMGPISWDEAKNIGTFVLMPSDGDNI KIETAGYYRVGADMKKLTASLTKTGWEIIGDATPGGWDKGTVMNYDPATKLWSVDVTLVA GEMKFRWDGAWTVNLGGSLGALTQDGANMKVTAGAYTIVLNPDAKTATMTKK >gi|226332024|gb|ACIB01000032.1| GENE 10 12106 - 13725 1517 539 aa, chain - ## HITS:1 COG:no KEGG:BF3306 NR:ns ## KEGG: BF3306 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 539 1 539 539 1117 99.0 0 MKTIKFKYLLPCALLGMMLGVASCVNDLDTVPLDKDELVSDVVFGNEPLAYEQSLAKIYA GMAIGGNSGGDSDQDVVGIDGGSQASFLRVLWNMQDLPSDIAHCAWNDPGIPEFNHISWG ASSPWIKGSYYRLFYQINVANAYLRETTEDKLDARGCDASLKASIKTWRAEARFLRALSY EYALDLYRNVPFVDENSPIGSIPPKQIMAADLFNWIEKELTECVEDMLEPTVGYSQDYGH ANKAAAWALLSRLYLNAETYVGQNKYTECITYAKKVIETGYQLEPVYVDMFKADNHLSDE MIFPVRYEGDQTMTWGGMTAFLCWGATATQEEVNAKGAWQGVRAKSSLYNLFLKESGSDA DTRKAMLRTDLTTSLEIIDENTFQNNGIPVTKYFNVNKDGTLPPSKEAYVDFPLFRLGEI YLTYAEAVLRGGQGGDRATALRYVNDLRKRAYSDKTLAPISDSDLTLNFIIDERGREFFF EGQRRTDLVRFGLFTTAGYVWPWKGGTAKGKAVENFYNVFPIPSDDIGSNTNLKQNEGY >gi|226332024|gb|ACIB01000032.1| GENE 11 13736 - 16744 3039 1002 aa, chain - ## HITS:1 COG:no KEGG:BF3307 NR:ns ## KEGG: BF3307 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1002 1 1002 1002 1908 100.0 0 MKQVNLRVCRMILPLLFGLFLSLSAYAQQVSVKGHVKDSTGEPVIGANVVVKDNSSIGTI TDLNGNFVLSVPQNSTLVISFIGYKPVEMKAAPSVTVTLHEDAVMLQEAVVIGYGTVKKN DVTGSVMAIDADKMVKGMATSASDLLVGKAAGVSVITDGGAPGAGATIRVRGGSSMSASN DPLIVIDGVPVDNTEIKGMGNPLSTVHPNDIETFTILKDASATAIYGSRASNGVIIITTK KGQSGRVKVDYSGTFSISTKSNTVDVMKAEDFRNFVIEKFGENSLQANALGKTSTDWQDE IFRTAFSTDHNVSVSGAVPHMPYRVSVAYTNENGILKTSNMQRLTGAINLNPNFFDKKLN IQLNVKGVYNKNRFADRAAIGLATQYDPTQPVYMEGNPYGNGYFMYMKQEGDKASPIDIG LANPVAMLEEKDDKSTVYRSIGNAQIDYKFHFLPELRANLNLGYDVSKSKGDVIIADNSP LTYCTGNFKNGFGENSHYTQLKRNTLLDFYLNYANTFGVNYIDVMAGYSWQHFYNSTTNS YPYSAAYAEKTGEEFYKKGDDYASESYLVSFFGRLNYTLLNRYLVTFTLRNDGSSRFSPD NRWGLFPSVALAWKLNEESFLKNVNAISDLKLRLGYGVTGQQNLGNGDYPYMARYMYSKA GANYYFGDTEYSLIAPQPYDQNLKWEETTTWNVGIDYGFLNGRITGTIDYYFRKTKDLLN TVTAPAGTNFSNQLLTNVGTLENKGFEFSINAHAVSTQDWNWNIGYNISYNKNKITKMTF NDDPNYAGVIHGGIDGGTGYNALIHRVGEAFNSFYVFEQIYGPDGKPIEGAYVDQNGDNQ INDADLICFKKAAPDVFMGLTSQLSYKNWDFSFALRGSFGNYVYNNVQSNREAYEGANMY DQTGFLKNRLTSARSTDFKNAQYRSSYYVQNASFVRMDNISLGYTFNKLFNDKQSARVYA TVQNPFVITKYKGLDPEISGEGIDNNIYPRPRVFMIGLNLNF >gi|226332024|gb|ACIB01000032.1| GENE 12 17060 - 18070 675 336 aa, chain + ## HITS:1 COG:YPO0108 KEGG:ns NR:ns ## COG: YPO0108 COG1609 # Protein_GI_number: 16120455 # Func_class: K Transcription # Function: Transcriptional regulators # Organism: Yersinia pestis # 7 335 11 337 342 173 32.0 4e-43 MSKPQITIKDIARELGVSPSTVSRALKDNPDISQETRDAIHKYAREHNYKPNVLALNLRT SRSNTIGVIIPQLVHHFFSCVLSGIERTAAEAGYNILVAQSNEEYEREVKIVHSFLAARV CGVITSLAKDTSRYDHYQELLDNNIPIVFYDRICTGINTERVVVDDYAGSFAAVEYMIQT GCKRIFFYSAAPHLEISKNRRNGYMDAMKKYRIPVDQSMIKLCDSREKAIAITPALLERP DHPDGFFAINDETASGILYSCKLTGRKVPDEVSICGFTDGAIAQSTDPKLTTVEQHGEEV GKSTIRLLIDKLEGNDEGKSGNKIVRTNLVVRGTTK >gi|226332024|gb|ACIB01000032.1| GENE 13 18091 - 19407 1168 438 aa, chain + ## HITS:1 COG:NMA2100 KEGG:ns NR:ns ## COG: NMA2100 COG0477 # Protein_GI_number: 15794975 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Neisseria meningitidis Z2491 # 5 432 14 447 451 439 55.0 1e-123 MKVKPDLSFWKLWNISFGFFGVQIAYALQSANISRIFATLGADPHSLSYFWILPPLAGII VQPIIGAASDKTWTRFGRRIPYLFAGSLLAVWVMCLLPNAGSFGMAVGTAMIFGLVALMF LDTSINMAMQPFKMLVGDMVNEKQKGLAYSIQSFLCNAGSLMGYLFPFIFTFIGISNTAD KGTVPDSVIYSFYIGAAILILCVIYTTVKVKEMPPKEFEEFHGITADEKKEKADFISLLK HAPKVFWTVGLVQFFCWAAFMYMWTYTNGAIAATVWGTTDVQSAGYQEAGNWVGVLFAVQ AIGSVIWAVILPLFKARKQAYALSLVLGGIGFISTLFFHNEYLLFISYLLIGCAWAAMLA MPFTILTNSVSGKNMGAYLGLFNGTICVPQIAAALVGGGLLHLVGGHQVNMLVLAGVLLI AGAVCVYFIKETMAHHEA >gi|226332024|gb|ACIB01000032.1| GENE 14 19758 - 20672 602 304 aa, chain + ## HITS:1 COG:CAC2685 KEGG:ns NR:ns ## COG: CAC2685 COG1554 # Protein_GI_number: 15895943 # Func_class: G Carbohydrate transport and metabolism # Function: Trehalose and maltose hydrolases (possible phosphorylases) # Organism: Clostridium acetobutylicum # 1 241 1 241 757 212 41.0 6e-55 MKQFLKVDEWNIIEEGFHPDNMRASESIFSLGNGRFGQRGNFEETYSSDSLQGSYVAGIT FLDRTRVGWWKNGYPRFFSRVPNAPDWSGIYLRLIDEELDLAHWDVEAYRRRLDMREGIS YRDFRVTSPKGHTLEVHVEHINSLANQNLCLIKYSVTSVNYEGKISLVPFLNGDVKHENS NFDEKMWNILRAEATNEYAYLWVQTKHEDSQICLGMTYQFYKNSKPTHISPIKIEKEKLT GFSAGADVKPGDSGKICCHPLLSAMRPSGVGGVCGGRSASRQRAGLGSTGRRAQTAMGRN LGGK >gi|226332024|gb|ACIB01000032.1| GENE 15 20509 - 22065 1312 518 aa, chain + ## HITS:1 COG:CAC2685 KEGG:ns NR:ns ## COG: CAC2685 COG1554 # Protein_GI_number: 15895943 # Func_class: G Carbohydrate transport and metabolism # Function: Trehalose and maltose hydrolases (possible phosphorylases) # Organism: Clostridium acetobutylicum # 1 513 254 755 757 511 52.0 1e-144 MTLEKYVAILSSLQCDRQELVEYAVDEAQAAKEQGWEALVGAHKQQWEEIWEESDVMIEG DPAAQQGIRFNIFQLNQSYRGDDARLNISPKGFTGEKYGGNTQWNTELCCVPYFLLSTPR EISRKLLLYRYNQLPKAIENARKLGFGGGAALYPMVTIHGEECHNEWEITFEEIHRNNII VYAIMQFSRVTGNKEYIAYYGLEVMIAISRFWSQRVSFSEARQKYVLLGVTGPNEYENNV NNNWYTNYSCVQCLQSTIECLEMVAHEYPEEYNRIRRSTEFRHAEETARWKEIIEKMYLP EDKERGIFVQDDGYPDKVLGTVNDIPVNERPINQHWSWDRILRSCYIKQSDVLLGLFLYY EHFDRETIRRNFRFYEPRTVHESSLSPFVHAILAAWIGDTEEAYRLFLHSTRLDLDDYNN EVHQGLHVTSMAGSWQIIVRGFAGMKILDGQLDFTPIIPEAWDSYTFKVNFRNCTLQMKV GKQEIKISLLEGYELNIRISEVVYNLKKGKDLIVPKQN >gi|226332024|gb|ACIB01000032.1| GENE 16 22408 - 23469 991 353 aa, chain + ## HITS:1 COG:VC1756 KEGG:ns NR:ns ## COG: VC1756 COG0845 # Protein_GI_number: 15641760 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Vibrio cholerae # 11 350 19 355 364 139 30.0 7e-33 MGNMKKNYGCILAALILLTACGQKKEDSVTTVRPVKTARVESRSEIRKDFSGIVEAVDYV KLAFRVSGQIINLPVVEGQRVKKGQLIAAIDPRDLALQYAADKSAYETAAAQVERNKRLL ARQAISVQEYEISVSTFQQKKSAYELSSNNMRDTRLTAPFDGSIEKRLVENYQRVNSGEG IVQLVNTKKLRIKFTIPDAYLYLLRSKDQRFRVEFDTYRGHIFNAKLEEYLDISTDGTGI PVTITIDDPAFDHALYEVKPGFTCSIRFSADVGPFLEESMTVVPLSAIFGESEGNKTYVW VLNGNQVNRREVTVYSPMGDAQAFISKGLKAGETVVTAGVTQLVEGETVKELK >gi|226332024|gb|ACIB01000032.1| GENE 17 23612 - 26647 2933 1011 aa, chain + ## HITS:1 COG:VC1757 KEGG:ns NR:ns ## COG: VC1757 COG0841 # Protein_GI_number: 15641761 # Func_class: V Defense mechanisms # Function: Cation/multidrug efflux pump # Organism: Vibrio cholerae # 1 1010 1 1012 1016 584 33.0 1e-166 MNLAKYSLDNTKVIYFFLAVLLIGGVFSFGKLGKKEDAPFVIKSAVIMTRYPGAEPAEVE RLITEPISREIQSMSGVYKIKSESMYGISKITFELLPSLPASSIPQKWDELRRKVLNIQP QLPSGSSVPTVSDDFGDVFGIYYGLTADDGFSYEEMRNWAERIKTQVVTADGVMKVALFG TQTEVVNISISVNKLAGMGIDPKQLAGLLQSQNQIINTGEITAGEQQLRVVANGMYTTVD DIRNQVITTRAGQVKLGDIAVIEKGYMDPPSTIMRVNGKRAIGIGVSTDPQRDVVLTGEM VDKKLAELLPLMPVGLNLESLYLENVIAKEANNGFIINLIESILIVIVIIMLVMGMRAGV LIGTSLVFSIGGTLLIMSFMGVGLNRTSLAGFIIAMGMLVDNAIVVTDNAQIAIARGVDR RKALIDGATGPQWGLLGATFIAICSFLPLYLAPSSVAEIVKPLFVVLAISLGLSWVLALT QTTVFGNFILKSKAKNAGKDPYDKPFYHKFEKILSVLIRRKIVTLGSMIVLFVVSLVVMG MMPQNFFPSLDKPYFRADVFYPDGYGVNDVAREMKKVEAHLLKLPEVKKVSITFGSTPLR YYLASTSVGPKPNFANVLVELNDSKYTKEYEEKFDVYMKANFPNAITRTSLFKLSPAVDA AIEIGFIGPNVDTLVALTNQALEIMHRNPDLINIRNSWGNKIPIWKPIYSPERAQPLGVS RQGMAQSIQIGTNGMTLGEFRQGDQVLPILLKGNSVADSFRINDLRTLPVFGNGPETTSL EQVVSEFDFRYRFSNVKDYNRQLVMMAQCDPRRGVNAIAAFNQIWSQVQKEIKIPEGYTL KYFGEQESQVESNEALAKNLPLTFFLMFTTLLLLFKTYRKPTVILLMLPLIFIGIVLGLL LLGKSFDFFAILGLLGLIGMNIKNAIVLVDQIDIENQSGLDPRKAVIKATISRIVPVAMA SGTTILGMLPLLFDAMFGGMAATIMGGLLVASALTLFVLPVAYCAIHRIKG >gi|226332024|gb|ACIB01000032.1| GENE 18 26675 - 27949 1440 424 aa, chain + ## HITS:1 COG:FN1273 KEGG:ns NR:ns ## COG: FN1273 COG1538 # Protein_GI_number: 19704608 # Func_class: M Cell wall/membrane/envelope biogenesis; U Intracellular trafficking, secretion, and vesicular transport # Function: Outer membrane protein # Organism: Fusobacterium nucleatum # 58 422 43 412 413 62 22.0 2e-09 MMKNKLILLFALGLCAQVQAQVPHLSRETYRERVEAYSQVLKQQHLKSMASTDARKIAFT GFLPKVDISAEGTLNLKEMDSWDGPAGQYRNHTYQGIFVVSQPLYTGGALQAQNRIAKAD EKLDQLSEELTRDQIHYQSDAFYWNASSARAMLNASAQYQEIVEKQYEIIQDRFKDGAIS RTDLLMISTRRKEAELQYINARQNYTLALQKLNILMGEEPNAPVDSLCAIGVVCPPVTLL PLDDVLQRRADFASTEVNIQKSEAQRKAALSQYNPQVSMYVTGGWATASPNMGYDVKFTP IVGMSVNIPVLRWGARFKTNRQQKAYTGIQKLQQSYVVDQINQELAAALTKLKETEEQVK TAEENKELAEENLDLITFSYNEGKASMVDVLSAQLSWTQAHTSLINAYLAEKMAVAEYRK VISE >gi|226332024|gb|ACIB01000032.1| GENE 19 28262 - 28513 157 83 aa, chain + ## HITS:1 COG:no KEGG:BF3314 NR:ns ## KEGG: BF3314 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 2 83 349 430 430 174 98.0 1e-42 MTNSYAGEAGGNMRTDIKYCSTDNFVWGIKIPVAIPHPIEKIDIMQVYSKFRSWITEPNH SDPSSPDFNENWFKYYDTSKVIG >gi|226332024|gb|ACIB01000032.1| GENE 20 28694 - 32398 4235 1234 aa, chain + ## HITS:1 COG:HI0752_1 KEGG:ns NR:ns ## COG: HI0752_1 COG0046 # Protein_GI_number: 16272693 # Func_class: F Nucleotide transport and metabolism # Function: Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain # Organism: Haemophilus influenzae # 49 928 101 1011 1011 507 37.0 1e-143 MILFFRTPSKSVIAVESNHELNANDNNKLCWLFGEAMPESEENLQGCFVGPRREMITPWS TNAVEITQNMGLDGISRIEEYFPVKDENADHDPMLQRMYKGLNQNVFTTNRQPEAIIYID DLEAYNEKEGLALSKEEMDYLKKVESDLGRRLTDSEVFGFAQINSEHCRHKIFGGTFIID GVEQESSLFQMIKKTTQENPNKIISAYKDNVAFAEGPVVEQFAPADHSKPDFFRIKDIKS VISLKAETHNFPTTVEPFNGASTGTGGEIRDRMGGGKGSWPIAGTAVYMTSYPRTEEGRE WEDILPVRKWLYQTPEQILIKASNGASDFGNKFGQPLICGSVLTFEHTENNETYAYDKVI MLAGGVGYGTQRDCLKGQPEAGNKVVVIGGDNYRIGLGGGSVSSVDTGRYSSGIELNAVQ RANAEMQKRANNVVRALCEEDENPVVSIHDHGSAGHVNCLSELVEECGGLIDMSKLPIGD KTLSAKEIIANESQERMGLLIKEEAIEHVRQIAERERAPMYVVGETTGDHRFAFQQADGV RPFDLAVDQMFGSSPKTYMIDKTVERHYTMPEYETSKLHEYLTQVLQLEAVACKDWLTNK VDRSVTGKVARQQCQGEIQLPLSDCGVVALDYRGEKGIATSIGHAPQAALADPAAGSILS VSEALTNLVWAPLAEGLDSVSLSANWMWPCRSQEGEDARLYTAVKALSDFCCALQINVPT GKDSLSMTQKYPNGEKVVSPGTVIVSAGGEVSDIKKVVSPVLVNDAKTTLYHIDFSFDNL KLGGSAFAQSLGKVGSEVPCVQDAEYFRDAFLAVQELVNKGLILAGHDISAGGLITTLLE MCFANVEGGLEISLDKMKETDIVKILFAENPGIVIQISDKHKDEVKKILEDAGVGFIKLG KPTDERHILVNKEGATYQFGIDYMRDVWYSSSYLLDRKQSMNGCAKKRFENYKMQPLEFA FMPGFKGKLSQYGITPERRTPSGIRAAIIREKGTNGEREMAYSLYLAGFDVKDVTMTDLI SGRETLEDVNMIVYCGGFSNSDVLGSAKGWAGGFLFNPKAKEALDKFYAREDTLSLGICN GCQLMMELGLVNPEHEKKGKMLHNESHKFESTFVGVTIPTNRSVMFGSLSGSKLGIWVAH GEGKFSLPYDEDQYNVVAKYSYDEYPGNPNGSDYSIAALASADGRHLAMMPHLERAIFPW QNACYPADRVHNDQVTPWIEAFVNARKWVEEKKK >gi|226332024|gb|ACIB01000032.1| GENE 21 32570 - 36562 2699 1330 aa, chain + ## HITS:1 COG:all4963_3 KEGG:ns NR:ns ## COG: all4963_3 COG0642 # Protein_GI_number: 17232455 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Nostoc sp. PCC 7120 # 800 1047 8 252 294 121 31.0 1e-26 MKKILFLLLLLPLMVSAQTYKYIGVEDGLSNRRVYYIQKDKKGYMWFLTHEGVDRYDGKE FKRYKLMDDGEELNSLLNLNWLYLDHESTLWEIGKKGKVFRYDPLRDQFTLVYKLPEEKI KDRPAPISYSFIDRNNNIWLCNEETLYLYNTHTLQTLQIKNEIGEDITDIEQIDDTHFFI GTEQGIHHAELKNQGLHLLPCDKLDNLPIQINELYFHRPSRKLFIGTFERGIYVYDMNTK QSTQPHMSLTDVSITRIKPLNNKELLVATDGAGVYKININSYLTTPYIVADYNQYNEMNG NNISDFYVDDEQRIWLANYPIGITVRNNRHSSYNWIKHSIGNKQSLINDQVNSIIEDSER DLWYATNNGISYYNSETGVWHSFMSSFEKNGGNKNHIFVTLCEVEPGIIWAAGYSSGIYQ INKRTLSVEYITPSSLYGVNIRPDKYIRSIIKTADGDIWSGGYYNLKRIDFHKKTLRLYP KLNSITSILEKDSKQMWIGTATGLYLLEKESGKYQRIELPVESMYIYSLYQARNGLLYIG TSGSGLLIYDPEKRTFTHYHRDNCALISNNIYTILSDTDDDIIMSTENGLSSYYPAEKLF HNWTKDQGLMASHFNAGSGTLRKNGNFIFGSSDGAIEFNKEMKIPRKYSSKMVLSDLTIF YQTVYPGDENSPLSTDIDDTKELELSYSQNIFSLKVSSINYDYPSNILYSWKLEGFYDQW SRPGNENIIRFTNLSPGEYTLHIRAVSNEDKRIVLEERTMKISIAQPIWLSFWAMLVYAI VLAVIAIITLRIIILRKQRKVSDEKIHFFINTAHDIRTPLTLIKAPLEEIREREALTKDG ISNMNTALRNVNALLRLTTNLINFERADVYSSELYISEYELNTYLTETFNAFRPYASVKH INFTYESNFRYLNVWIDKEKMDSILKNIISNALKYTPENGSVHIYASETNDSWNVEVNDT GIGIPANEQKKLFKIHFRGSNAINSKVTGSGIGLMLVWKLVHLHKGKINLSSVEHQGSSI KVSFPKDSKHFHKAHLATRTRELSTEQVPHVSPAEIYEKAKKQHDQNLQRLLIVEDNDEL RNYLTHTLSDNYTIQTCSNGKEALTIVKEYMPELIISDIMMPEMRGDELCAAIKNDIETS HIPIILLTALNDEKNILEGLKIGADEYIVKPFNIGILKATIANLLTNRALLKSKYANLEV SEEEEVSPNCATDLDWKFIATVKKSVEENIDNPAFNVDVLCNLLNMSRTSFYNKLKALTD QAPADYIRLIRLKRAAALLKTGQHSVTEISELTGFNDVKYFREVFKKHFKVSPSKYCKEG GKEDVEEQQE >gi|226332024|gb|ACIB01000032.1| GENE 22 36590 - 37132 509 180 aa, chain + ## HITS:1 COG:FN0712 KEGG:ns NR:ns ## COG: FN0712 COG2059 # Protein_GI_number: 19704047 # Func_class: P Inorganic ion transport and metabolism # Function: Chromate transport protein ChrA # Organism: Fusobacterium nucleatum # 2 169 4 171 186 155 50.0 3e-38 MNIYLEAFGIFFKIGAFTIGGGYAMVPLIENEIVTKRNWISKDDFIDLLAIAQSAPGILA VNISIFIGYKLRGIRGSLVTALGTVLPSFVIILAIAMFFHNFKDNPIVERIFKGIRPAVV ALIAAPTFSMAKSAKVNRYTLWIPVVSALLIWLLGFSPIWIIIAAGVGGFCWGKWKQSHP >gi|226332024|gb|ACIB01000032.1| GENE 23 37158 - 37706 610 182 aa, chain + ## HITS:1 COG:FN0713 KEGG:ns NR:ns ## COG: FN0713 COG2059 # Protein_GI_number: 19704048 # Func_class: P Inorganic ion transport and metabolism # Function: Chromate transport protein ChrA # Organism: Fusobacterium nucleatum # 1 182 1 173 176 118 46.0 7e-27 MIYLQLFYTFFKIGLFGFGGGYAMLSMIQGEVVTRYDWVSTQEFTDIVAISQSTPGPIGI NAATYVGFTATGSIWGSVIATFAVVLPSFILMLTISKFFLKYQKHPAVEAVFSGLRPAVV GLLASAALVLMNVENFGSPTDDTYTFVISIIIFLVAFIGTKKYHANPILMIIACGIAGLI LY >gi|226332024|gb|ACIB01000032.1| GENE 24 37876 - 39384 1402 502 aa, chain - ## HITS:1 COG:BS_yngK KEGG:ns NR:ns ## COG: BS_yngK COG1649 # Protein_GI_number: 16078889 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Bacillus subtilis # 8 501 11 510 510 363 39.0 1e-100 MNLRKLILLLALFLATGVGAQIQQQSPYPKREFRGAWIQAVNGQFRGIPTEKLKQTLIDQ LNSLQGAGINAIIFQVRPEADALYASQLEPWSRFLTGVQGQAPSPYWDPMQFMIDECHKR GMEFHAWINPYRVKTSLKSELSPNHLYNIHPEWFVTYNNQLFFDPALPESRRHICMVVAD IVSRYDVDAIHMDDYFYPYPAKGMDFPDDASFARYGGGFTNRADWRRSNVNILIQKIHET IRGLKPWVKFGISPFGIYRNEKNDPLGSKTNGLQNYDDLYADVLLWARNGWVDYNIPQIY WQIGHPAADYETLVKWWAKNTENRPLFIGQSVMNTIQNADPKNPSMNQLPRKMALERAYQ TIGGSCQWPASAVVENAGKYRDALVQEYHKYPALVPVFDFMDDKAPGKVRKVKKVWTEDG YMLFWTAPKAKDEMDRAVQYVVYRFDDKEKVNIDDASYIVAVTRNNFYKLPYKDGKNKYR YVVTALDRLHNESKSVSKKVKL >gi|226332024|gb|ACIB01000032.1| GENE 25 39554 - 42331 2809 925 aa, chain + ## HITS:1 COG:CAC0503 KEGG:ns NR:ns ## COG: CAC0503 COG0178 # Protein_GI_number: 15893794 # Func_class: L Replication, recombination and repair # Function: Excinuclease ATPase subunit # Organism: Clostridium acetobutylicum # 7 922 5 939 939 856 47.0 0 MTDSKYISIKGARVNNLKNIDVNIPRNKLVVITGLSGSGKSSLAFDTLYAEGQRRYVESL SSYARQFLGRMSKPECDFIKGIPPAIAIEQKVNSRNPRSTVGTSTEIYEYLRLLYARVGK TYSPVSGEEVKKHSIEDIVNCMLSYPEGTRYTVLTQIYLHDGRTLEQQLEIDRKQGFNRL EVNGEMVRIDEYAAGKEDVVYLLVDRMTAAKSKDAISRLTDSAETAMYEGDGMCMLRFYQ PDGTTRLHTFSTKFEADGMTFEEPNDQMFSFNSPIGACPVCEGFGKVIGIDEHLVVPNRS LSVYDGAIVCWRGEKMGEWREELIHNAEKFNFPIFTPYYELTDEQRRVLWEGNQYFHGIN DFFKMLEENQYKIQYRVMLARYRGKTLCPKCHGTRLKPEAGYVRVGGKNISELVDLPITE LQKFFDSLTLNEHDQAVARRILTEINSRIRFLIDVGLGYLTLNRLSNSLSGGESQRINLA TSLGSSLVGSLYILDEPSIGLHSRDTDRLVHVLRQLQQLGNTVVVVEHDEEIIRAADYII DIGPNAGRLGGQVVYEGDMKDLKKGSNSYTVKYLLGEETIPVPEHRRPWNNYIEIKGARE NNLKGVDARFPLNVMTVVTGVSGSGKSTLVRDIFYRALKRELDECSERPGEFVSIEGDLR NLRNVEFVDQNPIGKSSRSNPVTYIKAYDEIRKLWADQPLAKQMGYTAGHFSFNNEGGRC EECKGDGTITVEMQFMADLVLECESCHGKRFKADTLEVKFQDKNIYDVLEMTVNQAIEFF TKHGQKKVVKKLIPLQDVGLGYIKLGQSSSTLSGGENQRVKLAYYLSQEKADPTLFIFDE PTTGLHFHDIRKLLDAFDALIRRGHSIVIIEHNMDVIKCADYVIDLGPEGGDKGGNIVAT GTPEEVAACAASYTGQFLQEKLNHL >gi|226332024|gb|ACIB01000032.1| GENE 26 42500 - 42982 645 160 aa, chain - ## HITS:1 COG:no KEGG:BF3321 NR:ns ## KEGG: BF3321 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 160 1 160 160 302 100.0 2e-81 MKKLMWIVCLLVVSVTAQAQFEKGKWIVNPSVTGLGLSYSKSEKTQFGLQAQGGAFLVDN VALMLTAGANWSKPEDKYTLGVGGRYYFDKCGIYLGAGLKMNRYNWKVGDTTDFAFGAEA GYAFFLTRTVTIEPAVYYDLSFKDSDLSKFGLKVGFGFYF >gi|226332024|gb|ACIB01000032.1| GENE 27 42990 - 43220 316 76 aa, chain - ## HITS:1 COG:no KEGG:BF3322 NR:ns ## KEGG: BF3322 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 76 1 76 76 97 100.0 1e-19 MKKVKKSTWVTVALLIYVSATAAYLLPRNHEVSDTEKYLTLAASYVIVLVLWLVLRKKEQ MQQRRREEEHMNNLKK >gi|226332024|gb|ACIB01000032.1| GENE 28 43240 - 44667 1474 475 aa, chain - ## HITS:1 COG:VC0687 KEGG:ns NR:ns ## COG: VC0687 COG1966 # Protein_GI_number: 15640706 # Func_class: T Signal transduction mechanisms # Function: Carbon starvation protein, predicted membrane protein # Organism: Vibrio cholerae # 1 470 1 477 494 410 49.0 1e-114 MITFTICLLALIAGYFIYGRLVERVFGPDDRKTPALTHADGVDYIPLPTWKIFMIQFLNI AGLGPIFGAIMGAKFGTASYLWIVLGSIFAGATHDYFAGMLSLRNGGESLPEIVGRFLGM TTKQVMRGFTVVLMILVGAVFVAGPAGLLAKLTPDSLDTTFWIIVVFIYYILATLLPVDK IIGKIYPLFAAALLFMAVGILVMLYVHHPVIPELWDGLQNTNPEAAVLPIFPIMFVSIAC GAISGFHATQSPLMARCMTSERHGRPVFYGAMITEGIVALIWAAAATYFYREHGMEENNA SVIVDAITKDWLGAVGGVLAILGVIAAPITSGDTAFRSARLIVADFLKLEQKSIRRRLYI CVPMFIVAIGLLLYSLRDKDGFDMIWRYFAWANQTLSVFTLWAITVYLARARKWYGLTLV PALFMTDVCSTYICIAPEGLGLSHWVSYVTGGLCTLIGAVWFIVWKKGIGYSYKK >gi|226332024|gb|ACIB01000032.1| GENE 29 44945 - 46465 1212 506 aa, chain + ## HITS:1 COG:BS_yngK KEGG:ns NR:ns ## COG: BS_yngK COG1649 # Protein_GI_number: 16078889 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Bacillus subtilis # 28 506 20 510 510 231 32.0 3e-60 MNTIYKSHNDSFSVKKRAGRGTSLLFLFLLSLLPLHAQTSPKHEVRAAWITAVYGLDWPR TKATTPAGIRRQKEELIDILDRLKEANFNTVLFQTRTRGDVLYRSDIEPFNSILTGKTGG DPGYDPLAFAVEECHKRGMECHAWMVTIPLGGKKHVASLGKQSVTRRERDICVPYKNEYF LNPGHPATKEYLMKLVREVVSRYDVDGVHFDYLRYPENAPRFPDSYDFRRYGKGRTLAQW RRDNLTDIVRYIYNGVKAMKPWVKVSTCPVGKYRDTSRYSSKGWNAFFTVYQDPQGWLGE GIQDQIYPMMYFRGNGFYPFALDWQEQSNGRHIVPGLGIYFLHPDEGNWTTEEVERQMHF IRSNRLAGEAHYRVKYLMDNTQGVYDLLLDHFYQYPALQPPMPWIDNVSPTAPSALKAVS AGDGYTRLNWKAATDNDKQNAPMYVVYASDTYPVDITNPENILVQNLRDTHYIYAPILPW TARKYFAVTAVDRCGNESKAAQQEEK >gi|226332024|gb|ACIB01000032.1| GENE 30 46558 - 48819 1748 753 aa, chain + ## HITS:1 COG:mll3725_2 KEGG:ns NR:ns ## COG: mll3725_2 COG0642 # Protein_GI_number: 13473203 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Mesorhizobium loti # 485 742 42 304 328 142 36.0 2e-33 MRSIHSLIRCCMLLLLLTSCYSKEERRVLVIHSYEKDYQGYAEFNKLIKKEFAKAHIPVE LTFFYLNCEINNEQQEIDKINNFLDSISKWKPEVLLVNDDQATYSLLETHHPLLKGIPIV FSGVNYPNWELIGQYNNVTGFHDKIDFRKNLEMVHKLTGKNHIYTILDFTFLDRKVRNDI DNQLKSTDIISNLDWHLDKNDTRKEAEKGHIIINALSARNLSKNQNKDQTKGGDFIWSIS KYSTLPYLQTKFDYTTVTMASLSTRQRFTTINELFDCGHDFLGGYITPMHIQVEESVHAA ARILNGENVADIPIQESAKGYFIDWNAMQKEHLTIADIPHEYTIINIPFKTRHPIVWWFA LLGSITAIVSLLSGITYLYWRETKRKRSILYELEDEKESLALAVEGSDTYAWRLKDDTMV FEYAFWKNLGMAPHPLTIDGFLSFVDADYLDTTQALLTKNATNGKHFIKLKCDFNGTGYQ WWELRCSTMKSALGGQKTTGLLLNIEDYKKREQELIEARKMAEKAELKESFLANISHEIR TPLNAIVGFSTLLASPDDTDITPEEKEQYIDTINRNSELLLKLINDILELSRIESGYMSF DCDDYPLDALIRDTYQTHKILIPGQLHFLLEEGEKGLIVHVDKNRLVQVITNFLNNASKF TREGSIKLGWSYMPQTEEVEIYVEDTGIGIPQSEQKMIFGRFYKQNEFAQGTGLGLSICK LIIEKLQGRLSLRSEAGTGSRFSIFFSCKKQSI >gi|226332024|gb|ACIB01000032.1| GENE 31 48824 - 50011 805 395 aa, chain - ## HITS:1 COG:no KEGG:BF3165 NR:ns ## KEGG: BF3165 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 395 1 395 395 798 98.0 0 MTNLLYLFNPDQDLALASGEVNYMPPASARRMAEELALLPVWFADGPCSVLAPSAYNQSF LEEMLDLFPLPASLRTQAEDFSEVRSVVPWGWNPALRKRLLSLGVPDAALPSMEDIGRLR DLSHRLQAVRLLPGLQVDEVFCGESCYLTALSDCRAFVESLERCLLKAPLSGSGKGLNWC KGAFTPLIERWCARVIEQQGGVVGEPIYNKVEDFAMEFYSDGKGRIIFAGYSLFRTNAGG AYEGNRLLPDAEIERRLSAYVPVAALHRLREELQRRLSVSLGTEYAGYLGVDMMICRFAL LPEFRIHPCVEINLRMNMGLVSRMLYDRYVRPGAGGTFRISYHPSDGQALQEHDAMTVAH PLHTRNGRVVKGYLPLVPVRKSSRYRAYILVETGG >gi|226332024|gb|ACIB01000032.1| GENE 32 50781 - 51959 1232 392 aa, chain + ## HITS:1 COG:MJ1637 KEGG:ns NR:ns ## COG: MJ1637 COG1373 # Protein_GI_number: 15669833 # Func_class: R General function prediction only # Function: Predicted ATPase (AAA+ superfamily) # Organism: Methanococcus jannaschii # 28 290 68 342 473 81 28.0 3e-15 METFYRTHAYLVEHTNAPVRRDLMDEINWSDRLIGIKGTRGVGKTTFLLQYAKEKFGTDR SCLFINMNNFYFSGHSIVDFANEFQKRGGKVLLIDQVFKHPDWSRELRMCYDRFPNLKIV FTGSSVMRLKEENLELRDIVKSYNLRGFSFREFLNLQTGMKFRHYTLEEILSSHEQIAKG VLSKVRPLDYFQDYLHHGFYPFFLEKRNFSENLLKTMNMMVEVDILLIKQIELKYLSKIK KLLYLLAVDGPKAPNVSQLASDIQTSRATVMNYIKYLADARLINLVYPKGEEFPKKPSKI MLHNPNLMYSIYPVKVEEQDVLDTFFVNTMWKDHKVHKGDKNTSFMVDEVMPFKICCEGA KIKNNPGVTYALHKAEIGRGNQIPLWMFGFLY >gi|226332024|gb|ACIB01000032.1| GENE 33 51978 - 52538 515 186 aa, chain + ## HITS:1 COG:no KEGG:BF3167 NR:ns ## KEGG: BF3167 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 186 1 186 186 303 100.0 2e-81 MDYEKLEQLKRLHDSGALNDEEFEKEKKKILDEEATAKPTAVLPMGLTENAYLALMNFAM FIPYVGWIAPIVFWIMGKENSILVNRQGKYILNWYISWFLYGIALTVLFVIFMFSGILSI NDMDYANDQTSPLAVLLGIFGGGAGILLLLILGIGCFLCLLFPIIGGIKGLNGKTWKYPL SIPFLK >gi|226332024|gb|ACIB01000032.1| GENE 34 52575 - 56129 3729 1184 aa, chain + ## HITS:1 COG:CAC2499_1 KEGG:ns NR:ns ## COG: CAC2499_1 COG0674 # Protein_GI_number: 15895764 # Func_class: C Energy production and conversion # Function: Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, alpha subunit # Organism: Clostridium acetobutylicum # 5 411 3 408 413 592 70.0 1e-168 MTKQKKFITCDGNQAAAHISYMFSEVAAIYPITPSSTMAEYVDEWAAAGRKNIFGETVLV QEMQSEGGAAGAVHGSLQAGALTTTYTASQGLLLMIPNMYKIAGEFLPCVFHVSARTLAS HALCIFGDHQDVMSARQTGFAMLAEGSVQEVMDLAGVAHLATIKARVPFMNFFDGFRTSH EIQKIEMLENEDLAPLVDQEALAEFRARALNPMNPVARGMAENPDHFFQHRESCNNYYEA VPAIVEEYMNEISKITGRKYGLFDYYGAEDAERVIIAMGSVTEAAREAIDYLTSQGEKVG LVAVHLYRPFSAKHFLAAVPKTAKTIAVLDRTKEPGANGEPLYLDVKDCFYGAENAPVIV GGRYGLGSKDTTPAQIIAVFKNLAMPMPKNHFTIGIVDDVTFTSLPQEAEIALGGEGMFE AKFYGLGADGTVGANKNSVKIIGDNTDKHCQAYFSYDSKKSGGFTCSHLRFGDDPIRSTY LVNTPNFVACHVQAYLHMYDVTRGLRKNGSFLLNTIWESEELAKNLPNKVKKYFAQNNIS VYYINATQIAQEIGLGNRTNTILQSAFFRITGVIPVDQAVEQMKKFIVKSYGKKGEDVVN KNYAAVDRGGEYKTLTVDPAWANLPDDAKVENNDPAFINEVVRPINAQDGDLLPVSAFKG IEDGTWYQGTSKYEKRGVAAFVPEWNAENCIQCNKCAYVCPHASIRPFVLDAEEQKGANF EMLKAVGKQFDGMTFRIQVDVLDCLGCGNCADICPGNPKKGGKALTMKHLESQLAQADNW TYCADNVKSKQHLVDIKANVKNSQFATPLFEFSGACSGCGETPYVKLISQLYGDREMVAN ATGCSSIYSGSVPSTPYTTNAKGHGPAWANSLFEDFCEFGLGMELANEKMRARIVKLFNE ILAADNAPAEAKEVLKAWIENMYDADKTKELAPQIEAIIEQGIAAGCPISKELKGLTQYL VKRSQWIIGGDGASYDIGYGGLDHVIASGKDVNILVLDTEVYSNTGGQSSKATPVGAIAK FAASGKRVRKKDLGLMATTYGYVYVAQIAMGADQAQTLKAIREAEAYPGPSLIIAYAPCI NHGLKAGMGKSQEEEEKAVKCGYWHLWRYNPALEAEGKNPFTLDSKEPNWDDFKGFLKGE VRYASVMKQYPAEAEELFQAAEDNAKWRYNNYKRLANQAWGAAE >gi|226332024|gb|ACIB01000032.1| GENE 35 56332 - 56835 352 167 aa, chain + ## HITS:1 COG:CAC2751 KEGG:ns NR:ns ## COG: CAC2751 COG0454 # Protein_GI_number: 15896008 # Func_class: K Transcription; R General function prediction only # Function: Histone acetyltransferase HPA2 and related acetyltransferases # Organism: Clostridium acetobutylicum # 12 162 10 165 167 100 32.0 1e-21 MNITIRPTRLEELGQVMPFYERARRFMATNGNANQWINGYPSPDDIRQDIENGSSYVFVN ENQELEGCFAFIRGEDPNYKVIKDGAWLNDAPYGVIHRIASGGRVKGLMDLCLEWCSSHC PNLRVDTHRDNKVLQNILLKNGFTYCGIIYVKNGTERLAYQRTAVTR >gi|226332024|gb|ACIB01000032.1| GENE 36 56877 - 57731 816 284 aa, chain - ## HITS:1 COG:MA2147 KEGG:ns NR:ns ## COG: MA2147 COG2113 # Protein_GI_number: 20090990 # Func_class: E Amino acid transport and metabolism # Function: ABC-type proline/glycine betaine transport systems, periplasmic components # Organism: Methanosarcina acetivorans str.C2A # 29 283 58 311 315 187 37.0 3e-47 MRKYRIIGSLLLLAVMCVACNPDSGKKKISIAYANWLEGIAMSHLAKVVLEEKGYEVELL NADVAPVFASVSREKADVFMDAWLPVTMKDYIDQYGDQIEFIGEVYDSARVGLVVPQYVT IDSIGELAAYKGQFSSEIVGIDAGAGIMKTTDKAIGDYGLDGYKLLTSSSSTMLASLQKA MEKEAWIVITGWTPHWMFDRYPLKFLHDPKGTYGNVESIYVIGWKGFTEKDPFAAKFFSN IKFTTEEISSLMKALKDARMDEEDLVRKWRDEHRELVESWIPES >gi|226332024|gb|ACIB01000032.1| GENE 37 57751 - 58575 916 274 aa, chain - ## HITS:1 COG:YPO2646 KEGG:ns NR:ns ## COG: YPO2646 COG4176 # Protein_GI_number: 16122855 # Func_class: E Amino acid transport and metabolism # Function: ABC-type proline/glycine betaine transport system, permease component # Organism: Yersinia pestis # 2 274 94 366 388 260 49.0 2e-69 MINIGQYIETLINWMMIHFSTFFDALNLGIGSFINGFQHILFGIPFYLTIAAMVLLAWAK AGRGVAVFTLLGLLLIYGMGFWEATMQTLALVLSSTCLALIVGVPIGVWTANSNRAEKVI HPILDLMQTMPAFVYLIPAVLFFGLGVVPGVFATIIFAMPPVIRLTGLGIRQVPKNVVEA SRSFGATRWQLLYKVQLPLALPTILTGVNQTIMMSLSMVVIAAMIAAGGLGEIVLKGITQ MKIGLGFEGGIAVVILAIILDRITQGMVRRKDKK >gi|226332024|gb|ACIB01000032.1| GENE 38 58572 - 59798 1257 408 aa, chain - ## HITS:1 COG:MA2145 KEGG:ns NR:ns ## COG: MA2145 COG4175 # Protein_GI_number: 20090988 # Func_class: E Amino acid transport and metabolism # Function: ABC-type proline/glycine betaine transport system, ATPase component # Organism: Methanosarcina acetivorans str.C2A # 3 397 6 401 491 381 52.0 1e-105 MSKITIKDLYLIFGNDKQHALRMLKEEKSKSEILKATGCTVAVKDANLSIREGEIFVIMG LSGSGKSTLLRCINRLIKPTSGEVIINGRDISNVTDKELLRIRRKELAMVFQHFGLLPHR SVLHNIAFGLELQGVKKQEREQRAMESMQLVGLKGYENQMVGELSGGMQQRVGLARALAN DPEVLLMDEAFSALDPLIRIQMQDELLTLQSKMKKTIVFITHDLNEAIKLGDRIAIMKDG EVVQVGTSEEILTEPANAYVERFVQSVDRSKIITAASVMVDKPLVARLKKEGPEVLIRKM LERNLTVLPVVDANNVLVGEVRLKDLLRLRQDQVKSIESVVREEVHSVLGDTILEDILPL MTKTNSPIWVVNETREFQGVVPLSSLIYEVTGKNKQEINEIIQNAIEL >gi|226332024|gb|ACIB01000032.1| GENE 39 59836 - 60045 75 69 aa, chain + ## HITS:1 COG:no KEGG:BF3337 NR:ns ## KEGG: BF3337 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 17 69 1 53 53 98 98.0 7e-20 MTYTHENKHFFIARKLMPGKRRFRRSPVKYDFFTRLLQTGFLSTDSLLYPESQFHICTQL SSKDAYADN >gi|226332024|gb|ACIB01000032.1| GENE 40 60133 - 60477 232 114 aa, chain - ## HITS:1 COG:lin0580 KEGG:ns NR:ns ## COG: lin0580 COG3695 # Protein_GI_number: 16799655 # Func_class: L Replication, recombination and repair # Function: Predicted methylated DNA-protein cysteine methyltransferase # Organism: Listeria innocua # 15 107 6 98 98 110 53.0 7e-25 MKDYKENKSNLSETFCNEVYSVVREIPAGSVTTYGDIAALLGMPQYSRMVGRALRQLPAG LNLPCHRVVNAIGRLVPGWDEQRRLLDAEGVSFRKNGCVDLRLHRWRDDSVLTD >gi|226332024|gb|ACIB01000032.1| GENE 41 60508 - 63231 2387 907 aa, chain - ## HITS:1 COG:SMb20671 KEGG:ns NR:ns ## COG: SMb20671 COG1879 # Protein_GI_number: 16265126 # Func_class: G Carbohydrate transport and metabolism # Function: ABC-type sugar transport system, periplasmic component # Organism: Sinorhizobium meliloti # 25 300 34 313 322 186 39.0 3e-46 MNRILFLLGLMLSGLLVSCSPDAPRYTIGVSQCSDDTWRHKMNDEIQREALFYGGVKVET RTAHDDSRRQIGDIRYFIRQKVDLLIVAANEGMALTPVVEEAFDKGIPVIMVDRRILSDK YTAYIGADNYELGKAVGNYIVHRLKGRGKVVELSGLVGSTPAIERHQGFMSAISQYPDMT LLASEDAGWLQQPAEAKMDSLLQRFPEIDAVYGMNDRMAAGAFRAAGRRGREKEMIFVGI DALPGKGNGVELVLDSVLDATFIYPTEGDKVVQLAMDILEKKPFERETKLKTAVVDAVNA HVMELQTNHIGELDGKIETLNDRVGIYLSRVATQRIVLYGGLIILLLIVGLLIVVYKSLR SKNRLNRELSRQKEQLEEQRDQLIELSHQLEKATHAKLVFFTNISHDFRTPLTLVADPVE QLLADPSLEGDRRRMLLLVQRNVQILLRLVNQILDFRRYETGKMEFTPVPLDLLQCFVEW NDSFQAAARRKHIHFSFDSMPDMDYHTQADAEKLERIYFNLLANAFKFTPENGKVTVRLA ALQKDGAPFFRFTVANTGSLISAEHIRSIFDRFYKIDRHHTGSGIGLALVKAFVEIHGGS ISVESDERLGTVFTVDLPVRTCEEGTYVVTAPMEESAPDRVDSLLREDEAENLNDPSKPS VLVIDDNADIRAYVHTLLNSEYSIIEAADGTEGIRKAMKYVPDVIISDVMMPGIDGIECC RRLKGELQTCHIPVILLTACSLDEQRIQGYAGGADSYISKPFSSQLLLTRIRNLIESRQR MKQFFGDRQTLAKEDICDMDKDFVERFKSLIEVKMGDSELNVEDLGKEMGLSRVQLYRKI KSLTNYAPNELLRMARLKKAASLLASSDMTIAEVGYEVGFTSPSYFAKCYKEQFGESPTE FLKRSGL >gi|226332024|gb|ACIB01000032.1| GENE 42 63418 - 64308 907 296 aa, chain - ## HITS:1 COG:MA1840 KEGG:ns NR:ns ## COG: MA1840 COG0524 # Protein_GI_number: 20090690 # Func_class: G Carbohydrate transport and metabolism # Function: Sugar kinases, ribokinase family # Organism: Methanosarcina acetivorans str.C2A # 9 294 35 322 326 181 37.0 2e-45 MNSIIVGMGEALWDVLPEGKKIGGAPANFAYHVSQFGFDSRVVSAVGNDELGDEILEIFR EKQLKHQLERVNYPTGTVQVTLDNGGVPCYDIKEGVAWDNIPFTDDLKRLALSTRAVCFG SLAQRDEVSRATINRFLDTMPDMEGQLKIFDINLRQNFYTKEVLRESFRKCNVLKINDEE LIIISRMFGYPGIDLQDKCWILLAKYNLKMLILTCGTNGSYVFTPGTVSFQETPKVPVAD TVGAGDSFTAAFCSSVLKGKSIPEAHRLAVEVSAYVCTQSGAMPVLPEALRNRLND >gi|226332024|gb|ACIB01000032.1| GENE 43 64340 - 65506 1130 388 aa, chain - ## HITS:1 COG:NMB0535 KEGG:ns NR:ns ## COG: NMB0535 COG0738 # Protein_GI_number: 15676441 # Func_class: G Carbohydrate transport and metabolism # Function: Fucose permease # Organism: Neisseria meningitidis MC58 # 4 382 24 417 426 89 26.0 1e-17 MENTKNNSYMKLIPVMLCFFAMGFVDLVGIASNYVKADLGLTDSQANIFPSLVFFWFLIF SVPTGMLMNRIGRKKTVLLSLVITFASLLLPVFGDSYLLMLVSFSLLGIGNALMQTSLNP LLSNIVSGERLASTLTFGQFVKAIASFLAPYIAMWGAIEAIPTFDLGWRVLFPIYMVVAV VAILLLNATQITEEPEEGKPSTFGQCLALLGKPFILLSFLGIMCHVGIDVGTNTTAPKIL MERLGMTLADAGFATSLYFIFRTAGCFLGAFILQKMAAKTFFAISVLCMLAAMFGLFVFQ DQAMIYVCIALIGFGNSNVFPIIFSQAMLYMPDKKNEVSGLMIMGLFGGTIFPLAMGVAS DAVGQSGAVAVMLVGVLYLMFYTWRIKK >gi|226332024|gb|ACIB01000032.1| GENE 44 65552 - 67420 1745 622 aa, chain - ## HITS:1 COG:BS_sacC KEGG:ns NR:ns ## COG: BS_sacC COG1621 # Protein_GI_number: 16079757 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-fructosidases (levanase/invertase) # Organism: Bacillus subtilis # 121 622 22 509 677 407 44.0 1e-113 MKTTFMNVSRGVIGALAFSLGISSCQSSQSKMTFEQEGDSLTVIHITNPTQYLLLPVEEK TPEAQVCIASDSVPVDMDVRLSREKVDYFVPFALPKGEKEVAVRIRHLPKEALCWKELKL SDTFDTTNTDQYRPLYHHTPLYGWMNDANGLVYKDGEYHLFYQYNPYGSMWGNMHWGHSV SKDLVHWEHLEPALARDTLGHIFSGSSVVDDANTAGYGAGAIVAFYTSASDKNGQIQCMA YSTDNGRTFTKYEKNPVLTPFDGLKDFRDPKVFWYAPDQKWVMVVSADKEMRFYSSENLK EWTYMSGWGEGYGVQPSQFECPDMVELPVDGNPDHKKWALIVNVNPGCYFGGSATQYFIG DFDGEKFVCDNKPETVKWLDWGKDHYATVCFSNTGDRTIAVPWMSNWQYANIVPTRQFRS ANALPRELSLYTQDGDIYMAAAPVEETKSLRKESREIPAFEVGDAYHVDSLLSDNKGAYE IELELATGSAEIMGLKLFNEKGENVDIYISLPEKKLVMDRTKSGIVDFGKDSAPHAIEAH DRRKQNSINYVDDFALGTWAPVQKAGNYKLDIFVDKCSVEIFLNGGKIAMTNLIFPTTPY NQMSFYSRGGAFKVDRCKIYRL >gi|226332024|gb|ACIB01000032.1| GENE 45 67545 - 69833 2188 762 aa, chain - ## HITS:1 COG:no KEGG:BF3178 NR:ns ## KEGG: BF3178 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 762 22 783 783 1489 99.0 0 MMKNQKRAVCLLACLLIAGFTAAQEKKESAQTASGSKEEGNRNVMLNASSANGPREISIG LPGGDVNVLENGLPVVYTSNPHNVNTHWRGDSSLGHVGLLKISETAITTGNIGYAVNSFT QLGTEKFRGIVNYSTNHFGKQQFDANISGGMGKGWFYSGSVYQNFDPGSFKLRFTSNQDR TQIYKAALTKNYNEGRGQLSAIYHYSNSRWPSNEVTSAPFIYVGDGSVKEIPGFSLGTSS YLPTTGEMAYRDMRTGELKETNLYDATLNKGNQLTLLNTYTWDNGLNWKINLKYDHALGS YVYQTPMAMEQKDASAGYYLKAVDGTLKPYEGYVQSRMSCLNRGKIDEFFATSELSRSYR NTTWRIGVNEWHYKVDYASNTTMYDHTVGEYPERLVREGNTDGVYYDFNKNASEYYKGHE NKLAVYATHDWDISPKWNVYYGARLEWQHLEGENAAVYDAEGNAVGRFPDYYLGAVSDKG VRITPRLFDYDWMNMAFTAAATYKLTREFGFTADFTYNTQRPGLSNFAPATMPNTDKISV PLGRAGVYYNTDWISLTSLFSYISKTNNNSTLNLINPNDQTEILAAPLSYDIQTIGWTTD AVIKPFKGFNFHFLFTYQSPTYKKYETSVTFKDGTVGEIDATGNIVTEIPKVLVELDPSY NITNDLRVWASFRYFSKTYANIKNAYYFNGRWETFGGINWTVNKHLALSATVINFLNQTG AKGSIAGAELVDKKDAASYNGHWMAGSYIRPFTVELAASIRF >gi|226332024|gb|ACIB01000032.1| GENE 46 70084 - 70440 315 118 aa, chain - ## HITS:1 COG:no KEGG:BF3344 NR:ns ## KEGG: BF3344 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 118 1 118 118 197 100.0 1e-49 MKKSNIYIGEVIKQVIAEKQVTKAELARRLGVKPQSVDYLLTRKSIDTDTLYSLSLALDY DFAVLYSIKKEHALATDEESPFKVGNAKISLEIELRPDEMLKLNLKQKIADLLEGKGK >gi|226332024|gb|ACIB01000032.1| GENE 47 70595 - 70762 123 55 aa, chain - ## HITS:1 COG:no KEGG:BF1194 NR:ns ## KEGG: BF1194 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 55 203 257 257 110 100.0 1e-23 YFVLPALYIGAQIDPFAYTYNKTTYNPQAGLGDLSADSHNYSVLAAPTFKIGFKF Prediction of potential genes in microbial genomes Time: Tue May 17 23:04:02 2011 Seq name: gi|226332023|gb|ACIB01000033.1| Bacteroides sp. 3_2_5 cont1.33, whole genome shotgun sequence Length of sequence - 62994 bp Number of predicted genes - 48, with homology - 45 Number of transcription units - 29, operones - 14 average op.length - 2.4 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 15 - 1703 1547 ## BF3181 putative lipoprotein - Prom 1875 - 1934 9.2 2 2 Tu 1 . - CDS 2214 - 2477 67 ## - Term 2770 - 2812 2.1 3 3 Op 1 . - CDS 2931 - 3059 57 ## 4 3 Op 2 . - CDS 3064 - 4119 743 ## BF3348 hypothetical protein 5 3 Op 3 . - CDS 4116 - 4715 454 ## BF3349 hypothetical protein - Prom 4753 - 4812 5.7 + Prom 4659 - 4718 4.5 6 4 Tu 1 . + CDS 4743 - 4892 83 ## + Prom 4896 - 4955 3.8 7 5 Tu 1 . + CDS 5011 - 6276 1044 ## COG0738 Fucose permease + Term 6382 - 6419 -1.0 8 6 Op 1 1/0.333 - CDS 6273 - 7001 405 ## COG0778 Nitroreductase 9 6 Op 2 . - CDS 7020 - 7718 715 ## COG0693 Putative intracellular protease/amidase - Prom 7742 - 7801 1.7 10 7 Op 1 . - CDS 7846 - 8685 758 ## COG4667 Predicted esterase of the alpha-beta hydrolase superfamily 11 7 Op 2 . - CDS 8753 - 10942 1630 ## COG0475 Kef-type K+ transport systems, membrane components - Prom 10970 - 11029 7.9 12 8 Tu 1 . - CDS 11647 - 12693 1212 ## BF3187 hypothetical protein - Prom 12843 - 12902 4.3 + Prom 12652 - 12711 5.7 13 9 Op 1 . + CDS 12891 - 14216 995 ## BF3357 hypothetical protein 14 9 Op 2 . + CDS 14213 - 15418 954 ## BF3189 hypothetical protein + Term 15592 - 15623 2.2 - Term 15150 - 15181 0.1 15 10 Op 1 . - CDS 15420 - 15878 623 ## COG3015 Uncharacterized lipoprotein NlpE involved in copper resistance 16 10 Op 2 9/0.000 - CDS 15891 - 17270 1399 ## COG1538 Outer membrane protein 17 10 Op 3 27/0.000 - CDS 17288 - 20413 3072 ## COG0841 Cation/multidrug efflux pump 18 10 Op 4 . - CDS 20413 - 21480 633 ## COG0845 Membrane-fusion protein - Prom 21542 - 21601 3.7 + Prom 21431 - 21490 6.3 19 11 Op 1 . + CDS 21651 - 22529 491 ## COG2207 AraC-type DNA-binding domain-containing proteins + Prom 22533 - 22592 3.5 20 11 Op 2 . + CDS 22619 - 23803 1387 ## BF3364 aminopeptidase C - Term 23868 - 23914 -0.8 21 12 Op 1 9/0.000 - CDS 23985 - 24722 716 ## COG3279 Response regulator of the LytR/AlgR family 22 12 Op 2 . - CDS 24719 - 25888 360 ## COG2972 Predicted signal transduction protein with a C-terminal ATPase domain - Prom 25916 - 25975 4.5 + Prom 25957 - 26016 3.0 23 13 Tu 1 . + CDS 26078 - 26758 504 ## BF3367 hypothetical protein 24 14 Tu 1 . - CDS 26923 - 27063 151 ## BF3368 hypothetical protein - Prom 27134 - 27193 7.0 + Prom 27516 - 27575 1.7 25 15 Op 1 . + CDS 27652 - 30807 3432 ## BF3199 hypothetical protein 26 15 Op 2 . + CDS 30822 - 32513 1757 ## BF3370 hypothetical protein + Term 32529 - 32563 5.2 27 16 Tu 1 . + CDS 32590 - 34470 1739 ## COG3669 Alpha-L-fucosidase + Term 34483 - 34537 4.3 28 17 Op 1 29/0.000 + CDS 34870 - 35742 1072 ## COG2086 Electron transfer flavoprotein, beta subunit 29 17 Op 2 3/0.000 + CDS 35758 - 36777 1103 ## COG2025 Electron transfer flavoprotein, alpha subunit 30 17 Op 3 . + CDS 36781 - 38484 2118 ## COG1960 Acyl-CoA dehydrogenases + Term 38509 - 38542 4.5 + Prom 38520 - 38579 3.7 31 18 Tu 1 . + CDS 38717 - 42742 4220 ## COG3250 Beta-galactosidase/beta-glucuronidase + Term 42763 - 42829 18.3 - Term 42819 - 42851 0.3 32 19 Op 1 . - CDS 42887 - 43198 404 ## BF3206 hypothetical protein 33 19 Op 2 . - CDS 43214 - 44155 1123 ## COG2214 DnaJ-class molecular chaperone - Prom 44218 - 44277 4.4 - Term 44392 - 44423 -0.7 34 20 Tu 1 . - CDS 44457 - 46088 1291 ## COG0642 Signal transduction histidine kinase - Prom 46110 - 46169 4.2 35 21 Op 1 . - CDS 46387 - 48087 1528 ## COG2194 Predicted membrane-associated, metal-dependent hydrolase 36 21 Op 2 . - CDS 48074 - 49294 975 ## BF3380 hypothetical protein - Prom 49351 - 49410 2.2 + Prom 49957 - 50016 2.3 37 22 Tu 1 . + CDS 50037 - 52250 1698 ## BF3211 hypothetical protein - Term 52265 - 52328 1.5 38 23 Op 1 . - CDS 52370 - 53383 1056 ## COG2008 Threonine aldolase - Prom 53410 - 53469 2.7 39 23 Op 2 . - CDS 53473 - 53748 272 ## BF3385 hypothetical protein - Term 53750 - 53789 -0.9 40 23 Op 3 . - CDS 53844 - 54395 483 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog - Prom 54526 - 54585 6.3 + Prom 54410 - 54469 2.8 41 24 Tu 1 . + CDS 54526 - 54954 528 ## BF3215 hypothetical protein 42 25 Tu 1 . - CDS 55077 - 55295 354 ## BF3388 hypothetical protein - Prom 55477 - 55536 5.1 + Prom 55285 - 55344 4.0 43 26 Tu 1 . + CDS 55536 - 57275 1630 ## COG0028 Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] - Term 57271 - 57313 1.6 44 27 Tu 1 . - CDS 57496 - 58071 638 ## COG0655 Multimeric flavodoxin WrbA - Prom 58178 - 58237 6.6 45 28 Op 1 . - CDS 58269 - 59552 466 ## BF3220 hypothetical protein 46 28 Op 2 . - CDS 59578 - 60735 442 ## BF3221 hypothetical protein - Prom 60771 - 60830 4.5 - Term 60817 - 60858 8.8 47 29 Op 1 41/0.000 - CDS 60881 - 62518 1691 ## PROTEIN SUPPORTED gi|167855908|ref|ZP_02478658.1| 50S ribosomal protein L28 48 29 Op 2 . - CDS 62561 - 62833 309 ## COG0234 Co-chaperonin GroES (HSP10) - Prom 62857 - 62916 9.1 Predicted protein(s) >gi|226332023|gb|ACIB01000033.1| GENE 1 15 - 1703 1547 562 aa, chain - ## HITS:1 COG:no KEGG:BF3181 NR:ns ## KEGG: BF3181 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 561 1 510 512 135 33.0 5e-30 MKKKLMMVAVLLGALSLGACVDNDESASVEAVRNAKAEQLKGAAALANAQAEAETIRANA EAKLKEAEAAYQDAKTEADKAKWAAKLTVIQAEAARDIAQAQRDQKDAEMDIITNQDQWI NNTLWNNYSNASSLLINLNSELINATADEIALKAGVVSAQKAAEKIAVVKNQIIAQQTER KEQLSKLSSIASDRAALEKKFETLRVEQTELTAAKGKTADAAKVAQAKFDEAQAKICYDD YGNTDKLSSDLAKVGAELTYIEVEYNNQTYYFSVTTSNFVEVGENSNNAQSVRFYELGLK SETTSATLAFQNKIKYIKENIVGVPSDTDKGIQASGAYTNIEYWAAQKKQKEAEKAAATT DAEKAAIQAEIEALQPDIERADEALEAAKKELAEAEAEFKAFQDAVALFANADAKAAYDK DIADAKTLAEALVAASDADDAAQLPLSANQTALQDVQTLLNSTQNIDQMIADCDVQIAKA KEDIAKANSTNTIYVWSNQAYLLNGRWYPGYVVDQNNIGEDTAEALHAYMVDRIEALNAQ IESQGQIVEKYKKQLDDAIAAL >gi|226332023|gb|ACIB01000033.1| GENE 2 2214 - 2477 67 87 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MKKKGISFWNPITNGQHANSILKSFRYFRLRFCYDIRQQSFIVIRISSKYKMRELCYRKI HTFLSYPFPLGYKKALLRILKIQNDRL >gi|226332023|gb|ACIB01000033.1| GENE 3 2931 - 3059 57 42 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MFADTLYGAQVSKDIWLEGNAGGSLQANVMQGCHGEGVLSNT >gi|226332023|gb|ACIB01000033.1| GENE 4 3064 - 4119 743 351 aa, chain - ## HITS:1 COG:no KEGG:BF3348 NR:ns ## KEGG: BF3348 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 351 1 351 351 681 99.0 0 MSKWIDFSLKERKAMIQGVVEARQIDEVAAEKDWWVTTVLYALFHTSVSEYLLFKGGTSL SKGWDIINRFSEDIDLVLSRDYFLNVKKLSCANCTSNTQIHNLREKGQDFLFGEFKDELE AKLAELRLNVTVLSDNDILDENGEPRKVPHDKDPSVLYVQYSSIYNSQAAYAIPTVKIEI SILSMSEPYEMRRISSLIEQTYVGEDVDSDLVQTIRTVSPTRTFLEKAFLLNEEFQKEKP RTRRMTRHFYDLEKLMNTPYADLAINDAALYHEIVEHRRKFYHVGYVDYDKDLPDSITIL PKEVLLPQYETDYKEMQNSFIYGASLEFGELMERLKVLQARFRSIEFGDKD >gi|226332023|gb|ACIB01000033.1| GENE 5 4116 - 4715 454 199 aa, chain - ## HITS:1 COG:no KEGG:BF3349 NR:ns ## KEGG: BF3349 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 199 1 199 199 367 99.0 1e-100 MENTVIQQIRKRITRSKFGEIFFVSSFSQYDVEYVTKLLAQFEKEGLITRIAKGVYVKAR KTRFGTLYPSAYELVTEIAKRDKAKVIPTGATAANRLGFSTQVPMNTIFLTTGSGRKLKL GNRTVTLKHGAPKNFAFRGRLMSELVQALRSIGENNITPKDETRIGQLFTETPEADTIEY DLLLAPVWMRQVIKKGLKR >gi|226332023|gb|ACIB01000033.1| GENE 6 4743 - 4892 83 49 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MIANISDYKATAFISIDNGLDHKILFFDRNREVLLSFKSYFHEIVYVLY >gi|226332023|gb|ACIB01000033.1| GENE 7 5011 - 6276 1044 421 aa, chain + ## HITS:1 COG:BMEII1053 KEGG:ns NR:ns ## COG: BMEII1053 COG0738 # Protein_GI_number: 17989398 # Func_class: G Carbohydrate transport and metabolism # Function: Fucose permease # Organism: Brucella melitensis # 20 419 28 411 412 150 29.0 6e-36 MNQKRNQTKVVNVFTVFMVMLILYFIVGLFTVINQQFQIPLQTAMLPHDGNITNALVTML NFSWFLAYPLSEGFGTRWLEKYGYRKTSYLALLILIAGLAIYEAAVLFHIYTPMQVSIIG NHISVGFFIFLIGSFVIGVAATILQVVLNLYLTVCRIGKTTALQRQMIGGTSNSIGMAIA PLVISYLIFHGTPLHDIVTKQFIIPLIILILIMLIITLLVGKTQMPSIDNVRQAPGENLD KSVWSFRNLKLGVWGIFFYVGIEVAVGANVNMYASELGGSFASNATHMAALYWGLLLLGR FLGSFIKQVPSEKQLVIASIGAIVLLVLAMLTANPWILTLIGFFHSIMWPAIFTLATDQL GKYTTKASGVLTMGVIGGGIIPLLQGIFADVMGGNWLWTWLLVIAGEAYILYYGLNGYKQ H >gi|226332023|gb|ACIB01000033.1| GENE 8 6273 - 7001 405 242 aa, chain - ## HITS:1 COG:PM0161 KEGG:ns NR:ns ## COG: PM0161 COG0778 # Protein_GI_number: 15602026 # Func_class: C Energy production and conversion # Function: Nitroreductase # Organism: Pasteurella multocida # 1 241 1 241 242 328 62.0 4e-90 MNLEEVLNYRRSVRVFDKTKPLDPEKVKHCLELATLAPNSSNMQLWEFYQVIQPELLAKI SKACLDQTATSTASEVVVFVTRQDLYRSRAKFVLDFERGNVRRNSPKERQEKRIKDRELY YGKLMPFLYARFFRILGLLRSVLAKAIGLFRPIVREVSESDMRVVVHKSCALAAQTFMIA MANEGYDTCPLEGFDSKQMKKLLKLPHGAEVNMVIACGIRDGNKGIWGERGRVPFDEVYH RV >gi|226332023|gb|ACIB01000033.1| GENE 9 7020 - 7718 715 232 aa, chain - ## HITS:1 COG:PA2719 KEGG:ns NR:ns ## COG: PA2719 COG0693 # Protein_GI_number: 15597915 # Func_class: R General function prediction only # Function: Putative intracellular protease/amidase # Organism: Pseudomonas aeruginosa # 5 224 4 223 228 195 43.0 7e-50 MDALKMLIVVTGTDMYADGNLQTGLWLSELTHIYHCAEEAGYEITVASPKGGNVPVDPES LKPMMLDKLSKDYWDDLEFRRELQHAKSLAEVSGQLFDCVYLAGGHGAMYDFPDDTVLQA IIEKHYESDKAVAAICHGVSGLLNVKLSGGEYLIKDKKITGFSWFEESLAGRKKEVPFDL EAALEKKGADYEKALIPMTSKVVVDCNLITGQNPFSSKEMAEVVMRQLSREK >gi|226332023|gb|ACIB01000033.1| GENE 10 7846 - 8685 758 279 aa, chain - ## HITS:1 COG:CAC2424 KEGG:ns NR:ns ## COG: CAC2424 COG4667 # Protein_GI_number: 15895690 # Func_class: R General function prediction only # Function: Predicted esterase of the alpha-beta hydrolase superfamily # Organism: Clostridium acetobutylicum # 5 279 2 276 283 231 42.0 1e-60 MTIDNQTGLVLEGGGMRGVFTCGVLDYLMDHDIRFPYTIGVSAGACNGLSYMSRQRGRAK YSNIDLLEKYHYIGLKYLLKKRNILDFDLLFTEFPEHILPYDYQAYFDSPERYVMVTTNC LTGEADYFEEKKDKNRVIDIVRASSSLPFVCPIAYVDGIPMLDGGIVDSIPLQRAIHDGY RNNVVVLTRNRGYRKENKDIRIPPFVYRKYPKMREALSRRCAAYNEQLEMVERMEEEGDI LVIRPQKPVVVDRIERDIQKLTDLYEEGYECAKRQLEVL >gi|226332023|gb|ACIB01000033.1| GENE 11 8753 - 10942 1630 729 aa, chain - ## HITS:1 COG:SPAC105.01c KEGG:ns NR:ns ## COG: SPAC105.01c COG0475 # Protein_GI_number: 19114377 # Func_class: P Inorganic ion transport and metabolism # Function: Kef-type K+ transport systems, membrane components # Organism: Schizosaccharomyces pombe # 62 468 29 433 898 279 36.0 2e-74 MQRAKKNYLIYAVMLLLFGALIYMAIEEGDRFSHHAVASSTVAEDTPFTMFCQFVTDNLH HPLSILLIQIIAVLLMVRLFGFLFKHIGQPGVIGEIVAGIVLGPSVLGYFFPDVFQALFP PESLTNLELLSQVGLVLFMFVIGMELDFSVLKNKINETLVISHAGILVPFFLGIVASYWI YEEYAAAQTAFLPFALFIGISMSITAFPVLARIIQERNMTKTSLGTLAIASAANDDVTAW CLLAVVIAIAKAGTFASALYAIGLTALYIIIMFMVVRPFLKKVGEVYANQEVINKTFVAL ILLILIISSTLTEIIGIHALFGAFMAGVVMPPSIGFRKVMMEKVEDIALVFFLPLFFAFT GLRTEIGLINSPALWGVCLLLITVAVAGKLGGCAVAARLVGESWKDSFTIGTLMNTRGLM ELVALNIGYEMGVLPPSIFVILVIMALVTTFMTTPLLHLVERVFARREERLSAKLKLVFC FGRPESGRSLLSIFFLLFGKKMKAAQVVAAHFTVGTDLNPLNAEQYARDSFSLVDEKASE LGLSVENRYRVTDKLVQDMIRLARKERPDMFLLGAGSKYRPDTAGSNGVLWLSLFRDKID DVMEQVKCPVAVFVNRGYSGSSPVSFVLGGVIDAFLLTYLESMLEGGAQVHLFLFDTDDE AFRQSTDPILAKYSSQIRTQPFSGAANLTSAAKDGLLVMSHLSYTKLSEEEEVFRDLPSL LVIRRPKKG >gi|226332023|gb|ACIB01000033.1| GENE 12 11647 - 12693 1212 348 aa, chain - ## HITS:1 COG:no KEGG:BF3187 NR:ns ## KEGG: BF3187 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 348 1 350 350 613 97.0 1e-174 MTGIEKTVTNRSGAGVLVLCTILFCLCACTEDASYTAGVWYRRSDFDGVARTDAAGFTIG NKGYICGGYNGKTTRLADTWEYDIDNDWWTQRADMPGTVRNAATGFPVGNKGYITTGYNP DQKYLADTWEYDPETNTWRQMDDFKGGARYYALGFGIDNYGYVGTGYNDNYLKDFYRFDP TAAAGSQWTIVNGFGGQKRQGATAFVINGKAYVCGGQNNNSDVSDFWRFDPSAATPWTQL RDIANTSDDDYDYTSIVRSYGVSFVIDGKAYLTLGSTAGGSYYSNYWIYDPETDLWEGDD LTAFEGSTRIHAVCFSTGTRGIIATGGSGSSSYFDDTWELKPYEYEEE >gi|226332023|gb|ACIB01000033.1| GENE 13 12891 - 14216 995 441 aa, chain + ## HITS:1 COG:no KEGG:BF3357 NR:ns ## KEGG: BF3357 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 441 5 445 445 862 99.0 0 MKRKNLSLFFITLLISLLASCRDELSTAGGKWVESSLRTIQTDTCTVRLSTILSDSLATS GDTVCQIGTIDDPVWGKIKAAFYAEYDVPTVSFSENADYRFDSITIRFYSSGNYLGDTLS PQRISLHSLSENLSLDEGYLYTTSKVSYHSTPLASFTFTPTPGETIREHEIRLPDEWGVE WFEHFQAGSREMESQEYFRDYFKGIAFIPEEGGNCVNGFMVNDSSLCITLYYHQTETDAT ELSADFLPNSDLRFNQVSCDRSRTALSSLQSGLNNGLPSEKSEHQSYLQGLTGMYINIDF PFLNDLRAEGRLVTIESALLRLYPVKGTYGEQYPLPESLTLYTADENNVTEDVVTDISGS SVQTGSLVTDEMMGEDTYYSFDITSFLQSNLGTVGYNRKILQLMLPDNLFFTTLNGVVFG DTGHPDSNPVKLTLLYKTYNP >gi|226332023|gb|ACIB01000033.1| GENE 14 14213 - 15418 954 401 aa, chain + ## HITS:1 COG:no KEGG:BF3189 NR:ns ## KEGG: BF3189 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 401 1 401 401 724 99.0 0 MKRNLILLLTLCHSTLLIYSQNTTNSPTSMFGLGELSTGEGGQYSGLGGAGIALQSYNFL NTANPASLTAIEGQRFLIDAGVMGAYKVYTQTGTSNHSLVGNLNNLSIGCRITPRWYGAV FMAPVSSVGYAITLDQDITGTGSSTVSSLFEGEGGLSKMGISTAYRLFKGFSVGANLSYV TGTIKQTETQGSISVEESSYKHAFYADFGLQYKFSLSRNKYLVAGAVYGYSQDLAQDNTL SVSSTSGNESIDESQRHVRQCLPQFVGAGLAYNSPRWTLTAEYKYTDWSRMKSSQSNVRF ENQHRLSAGTAYTAGNIYRNPVKLLLGAGVSNSYIVIQKKKATNYYVSAGSNFTLYNGNV LSLGVKYSDQLHLPNGMQRERGVTLFFNFTFSERTYRAKIQ >gi|226332023|gb|ACIB01000033.1| GENE 15 15420 - 15878 623 152 aa, chain - ## HITS:1 COG:VC1962 KEGG:ns NR:ns ## COG: VC1962 COG3015 # Protein_GI_number: 15641964 # Func_class: M Cell wall/membrane/envelope biogenesis; P Inorganic ion transport and metabolism # Function: Uncharacterized lipoprotein NlpE involved in copper resistance # Organism: Vibrio cholerae # 1 147 2 163 163 80 34.0 9e-16 MKKNLYWMAAAFITLTAVGCTNAKKANVSAAGSDTTQVVDMHTAETSLDYYGVYKGTVPA ADCPGIELTLTLKKDRTYTYHWAYIDRKDADFDETGTFTVKDNLLTLTEKGGEVSYFKVQ EGSLVMLNNEKQPATGALADAYVLKQEEVFLD >gi|226332023|gb|ACIB01000033.1| GENE 16 15891 - 17270 1399 459 aa, chain - ## HITS:1 COG:VC1606 KEGG:ns NR:ns ## COG: VC1606 COG1538 # Protein_GI_number: 15641614 # Func_class: M Cell wall/membrane/envelope biogenesis; U Intracellular trafficking, secretion, and vesicular transport # Function: Outer membrane protein # Organism: Vibrio cholerae # 4 422 8 422 476 68 22.0 3e-11 MKKIIILSLCFMLSDFCAPAFAQETLSLQECREMALKYNKEMAASIKQTESTHYTAKSYK GNFFPNFTASGTGLYSNADGSYGIAGGNLPTFLPDATGQLIPNGGFAYFPGINLDYKIGW VYMGGIQLEQPLYMGGKITAAYKMSVLGKQMAQMNETLTATEVILKTDQAYALVVKAKEM KVVADAYHTVLTELKKNVESAYKHGLKPQNDVLKVQVRLNESELAVRKAENAFRLATMNL CHLIGKPLTADIHVSGNFPEIEQGLEIQVLDITARPEYTILDKQVAIAKQQVKLNRSELL PKIGIKGSYDYVHGLELNDKNFLDNASFSVLLNVSIPLFHFGERSNKVRAAKAKLEQTRL QQQSLNELMLLELTRAANNLDEAKLESELADRSLQQAEENRRVSKSQYEVGLETLSDHLE GQALWQQAYETKVNAHFQLYLNYVAYLKAAGILYNKINL >gi|226332023|gb|ACIB01000033.1| GENE 17 17288 - 20413 3072 1041 aa, chain - ## HITS:1 COG:VC1757 KEGG:ns NR:ns ## COG: VC1757 COG0841 # Protein_GI_number: 15641761 # Func_class: V Defense mechanisms # Function: Cation/multidrug efflux pump # Organism: Vibrio cholerae # 1 1039 1 1012 1016 632 35.0 1e-180 MDISKWAFHNRNLIYFLIAVLMFGGAYSCYQMSKLEDPEIKVKLAMVVTTYPGASAHQVE LEVTDVLEKNIRTMGNIDNIESYSYNDLSLIQIELLSTVPDDDVEQCWDMLRRKVNDARA SLPEGVSAPIVKDDFGNVYGMFYALTGDGLSDRELSDYAELIKREVGELEGVDRIDLYGK RPECINISLLQDRMANLGVKPAEVLATLNGQNKTTYTGYYDNGDNRIRVTVNDKFKTVED IGKMLIQGHDDDQLRLSDIAQIEKGYEEPVRNEMYYDGERALGILIAATSGSDIVKVGHA VEARLAELKAERLPAGVEYQKVFYQPERVGESLGTFVINLIESVIIVVLILMIAMGFKSG VIIGISLVVTVFGSFLFLYSAGGTMQRVSLAAFVLAMGMLVDNAIVIIDGILVDLKAGKD RMEAMTAIGRQTAMPLLGATLIAIIAFLPIYMSPDTAGVYTRDLFIVLAVSLLLSWVLAL VHVPLMANRRLHFAVEADSGGKRVYKGKIYAALRTALRFGLAHRWSFVFTMVGLLALSVF GYQYMRQGFFPDMVYDQLYMEYKLPEGNNYTRVEQDLKEIEAYLKGRKEITHVTASIGGT PGRYNLVRSVANPSLSYGELIIDFTSPEELVDNMDEIQRYLSATYPDAYIKLKRYNLMFK KYPIEAQFLGPDPAVLHQLADSARNIMKNTPEVCLITTDWEPQIPVLTIEYDQPAARALG LSRSDVSMSLLTAAGGIPIGSFYEGIHKNNIYLKCLDKEGQPIEDLGNTQVFSALPSLSG LLNEETMVKLKTGTLSKEDLVESMMGSTPLQQISKGIDIRWEDPVVPRYNGQRSQRVQCS PAPGIETEKARLTIADKIEQIRLPDGYSLVWQGEKIASDQSMKYLFKNFPLAIILMIAIL IMLFKDYRKPIIIFCCLPMIFVGVVGVMLLTGKVFNFVAIVGTLGLIGMLIKNGIVLMDE ITLQINAGIEPVTALIDSSQSRLRPVMMASLTTILGMIPLLSDAMFGSLAAAIMGGLLCS TLITLLFIPILYALFFKIRND >gi|226332023|gb|ACIB01000033.1| GENE 18 20413 - 21480 633 355 aa, chain - ## HITS:1 COG:VC1674 KEGG:ns NR:ns ## COG: VC1674 COG0845 # Protein_GI_number: 15641678 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Vibrio cholerae # 13 348 23 368 369 122 27.0 1e-27 MKQIYLILTGIFLMLFISCARHTKENKDCQTVKIDTVIAADKQKFLQFPGRVKAAQDISL SFRVSGTINKIYVKDGAQVRAGQLLAELDPTDYQVQLDATEAEYRQVKAEAERVMALYKE NGTTPNANDKAVYGLKQITAKYKHHQDQLAYTRLYAPFSGYVQKRLFEAHETIGAGMPVI SMVSAGAPEVEINLPAAEYIRRNRFNRYHCTFDIYPGETYPLQLISVTPKANANQLYTMR LQLIPGKQAVPSPGMNAMVTIFCDTDRSGTLSVPTSAILQKDGKSYVFIYNASDHTVHNC EVSVLRLTNDGYSLISSDGLQPGDKIVSSGVHHIENGETVKTLPEITHTNIGGLL >gi|226332023|gb|ACIB01000033.1| GENE 19 21651 - 22529 491 292 aa, chain + ## HITS:1 COG:lin0157 KEGG:ns NR:ns ## COG: lin0157 COG2207 # Protein_GI_number: 16799234 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Listeria innocua # 178 280 169 271 277 62 25.0 7e-10 MIEQQMPFRRILLTSDTFQILKEGQIISTFNKCGIFYCQRGSVEVSLEGCHYHIKPGDVY IYMASTLVHLLHKSEDAEGIMVEVDFYYILPIVNKVINVESQLFMRKNPCVSLSGEQCAH FEYLLNNLWDRINAEDCQKENVQYQHLKLELIKSMGQTICYEILNMYFTNQPLQPLQQGK KDVVFQNFMLSLFRFYRKERDVSFYARMQHITPRYFSAIIKEKTGDSALQWIVRMVITEA KQLLEESDLSIKEIADQLNFPTQSFFGKYFKQYVGVSPKEYRNNAATTRIKR >gi|226332023|gb|ACIB01000033.1| GENE 20 22619 - 23803 1387 394 aa, chain + ## HITS:1 COG:no KEGG:BF3364 NR:ns ## KEGG: BF3364 # Name: not_defined # Def: aminopeptidase C # Organism: B.fragilis # Pathway: not_defined # 1 394 1 394 394 810 100.0 0 MKKTILLAALGLISLSALAQDKPQEEGFVFTTVKENPITSIKNQNRSSTCWSFSSLGFLE SELLRTGKGEYDLSEMFVVHHTMVDRAVNYVRYHGDSSFSPGGSFYDIMFCMKNYGLVPQ DAMPGIMYGDSLPVHNELDATAGAYVNAIAKGNLKKLTPVWKKGLCAIYDTYLGQCPEKF TYKGKEYTPMTFAQSLGLNPDDYVSLTSYTHHPFYSQFAIEIQDNWRNGLSYNLPLDEFM AVMDNAVKNGYTFAWGSDVSEEGFTRDGIAVVPDAAKGAELTGSDMARWTGMTAADKRKE LTSKPLPEMKITQEMRQTAFDNWETTDDHGMIIYGIAKDQNGKEYFMVKNSWGTNNKYKG TWYASKAFVAYKTMNILVHKDALPKDIAKKLGIK >gi|226332023|gb|ACIB01000033.1| GENE 21 23985 - 24722 716 245 aa, chain - ## HITS:1 COG:VCA0850 KEGG:ns NR:ns ## COG: VCA0850 COG3279 # Protein_GI_number: 15601605 # Func_class: K Transcription; T Signal transduction mechanisms # Function: Response regulator of the LytR/AlgR family # Organism: Vibrio cholerae # 5 213 9 233 261 107 29.0 3e-23 MIKCIAVDDEPLALEQLTGYIARVPFLQLIASCQDAFSAMQVLSEEEVDLMFVDIHMPDL NGLDLVRSLVVKPLIVFTTAYPEYAVEGFKVDAVDYLLKPFEFQDLLKAADKARRQFEYH LQDNGGGTETDLLEKDGSLFVKSEYKIIRINVADICYIEGMSEYVRIYTDAADKPVVTLL SMRKLEERLPQEMFMRVHRSYIVNLRKITEVSRLRIIFNKNIYIPVGDNYKERFTEYINK ICVSS >gi|226332023|gb|ACIB01000033.1| GENE 22 24719 - 25888 360 389 aa, chain - ## HITS:1 COG:BH2727 KEGG:ns NR:ns ## COG: BH2727 COG2972 # Protein_GI_number: 15615290 # Func_class: T Signal transduction mechanisms # Function: Predicted signal transduction protein with a C-terminal ATPase domain # Organism: Bacillus halodurans # 195 389 378 586 597 112 31.0 1e-24 MKQINEQKLLEQMVYLVIWLTVISVPLVGDYLFASISPVHTFSWQTIRMAWLLTLPFILL FVVNNYFLAPRLLLRKRYWAYALSLAGVVTLLFILYPSINPPQHKQFQNLMPMQPRRYPE GKVLPDRDREFPNTLPEHSSPLLLPKQELDMRWRGPHPLPRYFLGRLSLALLVVGINVAI KLLFKSMRDEEALKELEHQHLQSELQYLKYQINPHFFMNTLNNIHALVDMDAGKAKRTIV ELSKLMRYVLYEASNRTILLSREIQFLDNYIALMKLRYTGRVRIECCMPDEVPEVQIPPL LFISFVENAFKHGVSYQEESFIRVFMSVEDGRLAFRCSNSNHGRSVEQHHGIGLENIRKR LRLLFGQDYTLSINERDDSFNVLLIIPLL >gi|226332023|gb|ACIB01000033.1| GENE 23 26078 - 26758 504 226 aa, chain + ## HITS:1 COG:no KEGG:BF3367 NR:ns ## KEGG: BF3367 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 226 1 226 226 439 99.0 1e-122 MKKIWILAVLTICSVATQAQEVFINADLVSSYIWRGMKNGNASVQPTLGVEWKGWTLSAW GSTEFRNENNEIDLTLEYEYKNLQLCLNNYFYQSEDAPFKYFHYTPRTTGHTFEAGAVYT VSERFPLSIGWYTTFAGNDYRENEERAWSSYCEFSYPFTVKGVDLAVEAGFTPWEGEYAD KLNVVNVGLSATKTLNISSGFTPAIFGKLIANPYENRFYFVFGISL >gi|226332023|gb|ACIB01000033.1| GENE 24 26923 - 27063 151 46 aa, chain - ## HITS:1 COG:no KEGG:BF3368 NR:ns ## KEGG: BF3368 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 46 13 58 58 70 97.0 2e-11 MIYGFKSNVNQGLLRKRDVIGLLKNEEIISNKIFNIFINEELDLLY >gi|226332023|gb|ACIB01000033.1| GENE 25 27652 - 30807 3432 1051 aa, chain + ## HITS:1 COG:no KEGG:BF3199 NR:ns ## KEGG: BF3199 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1051 1 1051 1051 2059 99.0 0 MLKITRQVTLLLLAGALSFPAYSYATQATEVLVPEVTQEKVTGTVEDALGPVIGASVMVK GTTNGVITDLEGKFSLNDVKKGDIIVISYIGYVTQEIPYTGKPIQVKLAEDSKALEEVVV VGYATVKKANLTGAVSAVDGKVLEDRPIVNLGQGLQGAIPNLNVTTSGRPGQGSSFNIRG TTAMSGSSPLVLVDGVEMDPNLINPQDVKSVSVLKDAASASIYGARAAYGVVLITTKGGR KDQPTQVSFDASVSFNGPTTRPTYMNSMQYATWMNTAQQNTVGRDYFDAEWMQHIEAYYK DPVNNSPVFIHSDPSISKNGTKYTYAGNTNWMKELYKKNYPVQKYNVNISGGGKKATYYT SLGYTDQGSLIRFGNEQFKKFNVMNNINYDVNDWLHLSMKTSFNRTKLRGLNQDNVHGDN FMGGDTRPIMPVKHPDGNWAGQGDFTNFPAILEDGGSRLTNKNDLWNTITMKLTPIKGMS INMDYTFNYYSENNKVHMKSFDEYGANGQFLQTFAWTNPNSVSQSQANDTYNAFNFFGDY EKTLGKHYLKGMIGYNQESKHTTGFNAGREQLISNDLGSLSYATGDRWVGSSDNSWATRS GFFRINYGYDERYLLEVNGRYDLSSKFPKHDRAVFNPSFSAAWRLSNESWFKSWTNSFFD ELKIRGSYGSLGNQALNNGWYAYLSNYSTGQISWIMGSNQPQYVVPGGLVSSSITWETVT QWDLGLDFNFLNSRLKGAFDYYQRRTSDILAAGKILPGVLGANEPQENAAESLTKGWEFE ISWNDQLANGFHYTVGFNLSDYQSEVTKFDNESKELGNWYVGQKQGEIWGYETYGLFQSE QEIAGAANQDKVSGGIKLMPGDIRFVDRNNDGVIDWGDNTVDNPGDKKIIGNSTPRYHYG INLGADWKGFDLGIFFQGVGKRDLYLPGTSFRSHYGSEWQVPSAYNNDYWTEENTGAYFP RARFNGGSAINQAQTRYMVDASYCRLKSVSIGYTLPKVLTQKASIEKIRIYFTGENLFTI SDTPDGLDPELDNPYTYPMQRSLSVGLSLTF >gi|226332023|gb|ACIB01000033.1| GENE 26 30822 - 32513 1757 563 aa, chain + ## HITS:1 COG:no KEGG:BF3370 NR:ns ## KEGG: BF3370 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 563 1 563 563 1148 100.0 0 MKIKYLFYIGVATLALSGCNDGFLERAPEAINDKTFWNTTGDLETYANQFYSYLPGGVTS IADGESDNQVPNSIPQFFWNQLSTPAEAASWCNWSKGGWQPIRLVNYFLTHYQTVSGKES EINQYVAEVRFFKAMQYAGLMRTFGDIPWLDKDLGTGDTDILYGPKLKRYEVMDKIIEEF DFAIQWLPEKPATGRIGKDVARQLKARTCLHEGTYYKYHTELGWADKADRLLKMAADETD AIMATGKYEIYNTGHPEKDYYDVFVMEDKTNLKEAILPVTYLDGKRKHGMSRTLAEANTG FSKDFVESYLCLNGKPITGNDQYKGDTNMKDETTDRDPRLKQTILTWDFPTRVTVATNDS TYIEKEEDFISQYCLTGYKSIKYFIPTDKAFEANNNTYDGIAYRYAETLLINAEAKAELG TITQADLNKTINVLRDRAGMPHLTLEVGFTDPNWPAWGYSLTPLLQEIRRERRIELAGEG FRWDDLARWKAGAICNNVKTYIGKREPYKEGQYAIVYPAYTNDNYSYEAGKSRTWNDRLY LRPIPTGELQRNDNLLPQNPGWE >gi|226332023|gb|ACIB01000033.1| GENE 27 32590 - 34470 1739 626 aa, chain + ## HITS:1 COG:SP2146 KEGG:ns NR:ns ## COG: SP2146 COG3669 # Protein_GI_number: 15901959 # Func_class: G Carbohydrate transport and metabolism # Function: Alpha-L-fucosidase # Organism: Streptococcus pneumoniae TIGR4 # 31 482 9 448 559 275 34.0 3e-73 MKINKLLYAGAALTLLGACAPAVKAPEAILPVPEEKQVDWQKMETYAFVHFGLNTFNDRE WGYGDSEPKTFNPTKLDCEQWVKTFVESGMKGVILTAKHHDGFCLWPTQLTEYCIRNTPY KDGKGDIVGELAAACKKYGIKFAVYLSPWDRHQANYGTPEYVDYFHKQLTELMTNYGEVF EVWFDGANGGDGWYGGAKDSRTIDRKTYYNYPRIYEILDKLQPQAIVFSDGGPGCRWVGN ENGFAGATNWSFLRAGEVYPGYPKYRELQYGHADGNQWVPAECDVSIRPGWFYHPEEDDR VKTVEQLTDLYYRSVGHNATLLLNFPVDRDGLIHPIDSANAVNFHKNVQKQLAHNLLAGI RPKASDERGGQFSAKAATDESWDTYWATNDGVTAADIEFDFPKTEKVNRMMIQEYIPLGQ RVKSFIVEYDKDGKWLPVKLNEETTTVGYKRLLRFETVSTDKLRIRFTDARACLCINNIE AYYAGETADTFTVEAKELKSYPFTLVGVPEEETKKCMDKDKNTTAFVEGDALVIDLGEER TITSFHYLPDQSEYNKGLISSYELSVGTEANAVNRIVAQGEFSNIKNNPILQSVYFTPVK ARYLSLKPTKMVTEGETMGFAEIGIQ >gi|226332023|gb|ACIB01000033.1| GENE 28 34870 - 35742 1072 290 aa, chain + ## HITS:1 COG:mll5862 KEGG:ns NR:ns ## COG: mll5862 COG2086 # Protein_GI_number: 13474882 # Func_class: C Energy production and conversion # Function: Electron transfer flavoprotein, beta subunit # Organism: Mesorhizobium loti # 3 284 1 263 283 181 41.0 1e-45 MSLKIVVLAKQVPDTRNVGKDAMKADGTINRAALPAIFNPEDLNALEQALRLKDAHPGST VTILTMGPGRAADIIREGLFRGADNGYLLTDRAFAGADTLATSYALATAIKKIGEYDIII GGRQAIDGDTAQVGPQVAEKLGLTQITYAEEILKVGDGSITVKRHIDGGVETVEGPLPIV ITVNGSAAPCRPRNAKLVQKYKHAKTITEKQQGNLDYTDLYDTRDYLNLVEWSVADVNGD LKQCGLSGSPTKVKAIQNIVFQAKESKTISGSDREVEELIVELLENHTIG >gi|226332023|gb|ACIB01000033.1| GENE 29 35758 - 36777 1103 339 aa, chain + ## HITS:1 COG:CAC2709 KEGG:ns NR:ns ## COG: CAC2709 COG2025 # Protein_GI_number: 15895966 # Func_class: C Energy production and conversion # Function: Electron transfer flavoprotein, alpha subunit # Organism: Clostridium acetobutylicum # 4 335 9 332 336 266 45.0 6e-71 MNNLFVYCEIEDGIVADVSLELLTKGRSLANQLGCQLEAVVAGTGLKDIEKQILPYGVDK LHVFDGEGLYPYTSLPHTSILVNLFKEEQPQICLMGATVIGRDLGPRVSSALTSGLTADC TSLEIGDHEDKKEGKVYKNLLYQIRPAFGGNIVATIVNPEHRPQMATVREGVMKKAILAA DYKGEVIHHDVKKYVADTDYVVKVIERHVEKAKNNLKGSPIIIAGGYGVGSKENFNLLFD LAKVLNAEVGASRAAVDAGFVEHDRQIGQTGVTVRPKLYIACGISGQIQHIAGMQESGII ISINNDPSAPINTIADYVINGTIEEVVPKMIKYYKQNSK >gi|226332023|gb|ACIB01000033.1| GENE 30 36781 - 38484 2118 567 aa, chain + ## HITS:1 COG:CC3393 KEGG:ns NR:ns ## COG: CC3393 COG1960 # Protein_GI_number: 16127623 # Func_class: I Lipid transport and metabolism # Function: Acyl-CoA dehydrogenases # Organism: Caulobacter vibrioides # 45 445 47 459 603 187 33.0 5e-47 MANFYLDTPELKHHLNHPLMKRIVELKERNYADKDKFDYAPVDFEDAMDSYDKVLEIVGE ICGDIIAPNAEGVDHEGPVCADNRVTYASGTTRNLDACRKAGLMGMAMPRRFGGLNFPIT PYIMAADIVSRSDAGFENLWGLQDCAETIYEFANEEQKQRYITRVCQGETMSMDLTEPDA GSDLQSVMLKATYSEKDQCWYLNGVKRFITNGDADIHLVLARSEEGTHDGRGLSMFIYDK RNGGVNVRRIENKMGIKGSPTCELVYKNAKAELCGDRKLGLIKYVMALMNGARLGIAAQS VGLSQAAYNEALAYAKDRKQFGKAIIEFPAVAEILSLMKAKLDASRSLLYETARFVDVYK ALDDIAKERKLTPEERAEQKTFAKLADAFTPLGKGMGSEFANQNAYDCIQIHGGSGFMKD YACERIYRDSRITSIYEGTTQLQVVAAIRYVTTGAYLARIQEYENMPVAPELEGLQNRLK SMASKYAACVTQITEAKDQELLDFCARRLVEMAAHIIMGHLMVQDASKSDLFSESAQVYV RYAEAEVEKHINFIRKFDKDDLAYYRK >gi|226332023|gb|ACIB01000033.1| GENE 31 38717 - 42742 4220 1341 aa, chain + ## HITS:1 COG:TM1193 KEGG:ns NR:ns ## COG: TM1193 COG3250 # Protein_GI_number: 15643949 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-galactosidase/beta-glucuronidase # Organism: Thermotoga maritima # 39 1117 7 984 1087 584 34.0 1e-166 MRKITLGLILCSMVTLCFAGQRPLEGFKYASEKAPVGNEWESPENIALNKEQPRAWFFSF QDVESARKVLPENSKYWLSLNGDWKFNWAPDPDSRPKDFYQTTFDVSGWDNIPVPSSWNI YGIQKDGSLKYGVPIYVNQAVIFMHKVKVDDWRGGVMRTPPTNWTTYKYRNEVGSYRRDF DIPQDWDGREVFINFDGVDSFFYLWINGQYVGFSKNSRNTASFNITPYLQKGKNTVAAEV YRSSDGSFLEAQDMFRLPGIFRTVALYSTPKVQVRDLVVIPDLDETYTNGSLAISADIRN FGKKAAKGYQMAYTLYANKLYSDENTPVANAVASATVNLVNPNETVEAEKAIMNVQSPNK WSAEFPHLYTLVAELKDKKGKTIETVSTTVGFRKVEIKDTPASADEFGLAGRYYYVNGKT VKLKGVNRHESNPAVGHAITREMMEKEVMLMKRANINHVRNSHYPDDPYWYYLCNKYGLY LEDEANIESHEYYYGAASLSHPVEWKNAHVARVMEMVHANVNNPSIVIWSLGNEAGPGKN FVAAYDALKAFDLSRPVQYERNNDIVDMGSNQYPSIGWMRGAVKGNYDIKYPFHVSEYAH SMGNACGNLVDYWEAIESTNFFCGGAIWDWVDQSMYNYDKKTGERYLAYGGDFGDTPNDG QFVMNGIVFGDLEPKPQYYEVKKVYQHIDVKAIDVEKGRFEVFNKYYFKNLSDYDVKWSL YENGKEAQSGLLSIGEVAPRTRTQITVPYQFSKLKADSEYFVKIQFLLKDNMPWADKNFA QAEEQILVKEATARPSIATVAAEGDKPEVMMTKASDIITIKGNGYTAQFDIKTGTIYSLT YGNEKVITDGNGPKLDALRAFTNNDNWFYSQWFDNGLHNLKHSATGFNMTTKEDGTVVLS FTVQSQAPNAAKILGGTSSGKNKIEELTDKKFGSSDFKFTTNQVWTVYKDGSIELEASIT SNQPSLVLPRLGYMVRVPQQYANFTYYGRGPIDNYADRKVGQFIEQHKNTVAGEFVNFPK PQDMGNHEDVRWCALTNNAGNGAVFIATDRLSASALPYSALDLILASHPYQLPKAGDTYL HLDAAVTGLGGNSCGQGGPLEQDRVFASHHNTGFIIRPAGKDLTVTANVAPAGEMPLSIT RNRAGVVSVSSQKKDAVILYTVDKSKSKTYTEPIALRNGGTVTAWFKDAPFIKASMTFDK IESIQTEVIYASSEESDGGEAKNLTDGDPNTIWHTMFSVTVAKHPHWVDLDAGEVKTIKG FTYLPRQDSSNGNVKDYTIHVSMDGKEWGEPILKGTFARDLKEKKVMFDKPVKARYIRFT ALSEQRGQDYASGAELTILAE >gi|226332023|gb|ACIB01000033.1| GENE 32 42887 - 43198 404 103 aa, chain - ## HITS:1 COG:no KEGG:BF3206 NR:ns ## KEGG: BF3206 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 103 1 103 103 185 100.0 4e-46 MQTELIIVSEYCHKCHIEPSFIDLLEEGGLIEVRTEGGEHYLLASQLPDVERYSRMYYDL SINMEGIDAIHHLLERMEIMRREISSLRNQLIVFKREGIMEDW >gi|226332023|gb|ACIB01000033.1| GENE 33 43214 - 44155 1123 313 aa, chain - ## HITS:1 COG:all1488 KEGG:ns NR:ns ## COG: all1488 COG2214 # Protein_GI_number: 17228981 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: DnaJ-class molecular chaperone # Organism: Nostoc sp. PCC 7120 # 3 309 8 299 315 173 35.0 4e-43 MAYIDYYKILGVDKNASQDDIKKAFRKLARKYHPDLNPNDPSAKDKFQEINEANEVLSDP EKRKKYDEYGEHWKHADEFEAQKKARQHTGGGGGGFSGFGGDGGSYWYSSDGEGFSGGDA GGFSDFFESMFGHRGGGGRGNAGFRGQDFNAELHLSLRDAARTHKQVLNVNGKQVRITIP AGVADGQVIKLKGYGGEGINGGPAGDLYITFKIAEDSVFKRLGDDLYVDVEMDLYTAVLG GEKVIDTLEGKVKLKIKPETQNGTKVRLKGKGFPVYKKEGQFGDLIITYSVKIPTNLTDR QKELFRELQQSMN >gi|226332023|gb|ACIB01000033.1| GENE 34 44457 - 46088 1291 543 aa, chain - ## HITS:1 COG:slr2098_3 KEGG:ns NR:ns ## COG: slr2098_3 COG0642 # Protein_GI_number: 16330584 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Synechocystis # 290 530 22 270 280 185 40.0 2e-46 MTTQLLLVTGVTDCLWLYVLSLLLFLTVCFLLYQNFLLRKSASADDDYRRFLFDILDNLP FPIMVKDIQEQFKYTYWNKESELQSGISRKDAVGFSDVDIFGEERGKQYRKIDEDLVRAG IPYKAEERYVTADGKVHDTLVMKSIISSGKLGKWLLVARWEITQLKMYERELLAAKEQLE GAVRKQKLALRSIDFGLIYIDRNYLVQWEETGNIKNLVSGRHYTPGTVCYRTTGQGTQPC GKCAFREAIGTGKIVRHITHVDHVDFEITATPVYDDTGNEIIGGLLRFEDITEKLKVERM LQEAKEKAEESNRLKSAFLANMSHEIRTPLNAIIGFSDLICQTDDAEEKEEYIRIVTSNN ELLLQLIDDILDLSKIEAGTMDFSYAPTDINELMEDICLQMQQKNQRPEVQIMFTEKEPG CVINTDRLRLSQVIMNLMNNAMKFTSEGSITLGYRLTRQKDELYFFVKDTGIGIPADQAG KVFERFVKLNTFVKGTGLGLAICRVIIERLGGTIGVETREGKGSCFWFRLPVREDMLLES PVR >gi|226332023|gb|ACIB01000033.1| GENE 35 46387 - 48087 1528 566 aa, chain - ## HITS:1 COG:RC0454 KEGG:ns NR:ns ## COG: RC0454 COG2194 # Protein_GI_number: 15892377 # Func_class: R General function prediction only # Function: Predicted membrane-associated, metal-dependent hydrolase # Organism: Rickettsia conorii # 172 529 168 522 522 158 30.0 3e-38 MKLFNSIKKWFGNQENLFYLFLFVLIVPNVVLCFTEPLPLVAKIANVLLPLGCYYLIMTL SRNCGKMLWILFLFVFFGAFQIVLLYLFGQSIIAVDMFLNLATTNSSEAMELLDNLLPAL ITIVILYIPALILGMISIVRKRMLSVRFIHRERRRAWVVLGAGLVSLGAAFLLDKKYEMT SDLYPVNVCYNVVLAVERNARTLDYEETSKDFTFNAAATHPAEDREIYVLVVGETSRALN WSLYGYDRETNPKLSEVSGLTAFTNVLTQSNTTHKSVPMLMAAVSAENFDSIYHQKGIIT AFKEAGFKTAFFSNQRYNNSFIDFFGKEADHCDFIKEDSLTAGQNLSDDYLLALVQEELA KGNRKQFIVLHTYGSHFNYRERYPAEAAFFQPDSPADAEFKYRDNLINAYDNSIRYTDDF LSRLIGLLQQQDAGSAMLYTSDHGEDIFDDHRHLFLHASPVPSYYQLHVPFIVWTSDTYR EKYPEHMDALQKNRHKSVASNRVVFHSVLDLAGVTTTYVNDSLSVASPSYTEFPRFYLND HNEPRSYDDIGLRKEDFEMFGKMGIR >gi|226332023|gb|ACIB01000033.1| GENE 36 48074 - 49294 975 406 aa, chain - ## HITS:1 COG:no KEGG:BF3380 NR:ns ## KEGG: BF3380 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 406 1 406 406 804 100.0 0 MRPKHKNWLLLGCFSIVWTASLAQGENHNDSLSPITHIPVIEDSASVSVTADSVAVKRSF FKKFLDYFNDANKEKKNKKFDFSVIGGPHYSSDTKLGLGLVAAGLYRTDRADTLLPLSTV SLYGDVSTVGFYLLGVRGSHIFPKDKYRLNYNLYFYSFPSLYWGVGYRNAVNDENESSYK RFQAQVKVDFMFRMAKNFYLGPMASFDYIDGRNFEKPELWQGMDARTSNVSAGLSLVYDS RDFLTNAYKGYYLRIDQRFSPAFLGNDYAFSSTELTTSYYRRVWKGGILAGQFHTLLTYG NPPWGLMATLGSSYSMRGYYDGRYRDKNVVDMQVELRQHVWKRNGVAVWVGAGNVFPDFS SFKVKHILPNYGFGYRWEFKKRVNVRLDLGFGKGQTGFIFNINEAF >gi|226332023|gb|ACIB01000033.1| GENE 37 50037 - 52250 1698 737 aa, chain + ## HITS:1 COG:no KEGG:BF3211 NR:ns ## KEGG: BF3211 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 737 1 737 737 1480 99.0 0 MKSKLLFLLLLLGSSYITAQTVIENPPFETRQGSIHTISKIELSPTETRLTIRTVFRPQW WTSLDSLTYIYAPESKKQLYPLRIEGRKFGEQVTTPASGIIEDDCVITYPPLPEGTTRID WMDDNLNSETNTYGILLVKPQETKEQASLRKIHGNWQQADAQNGWDIGIYDSLAIMDNRF WKYDLCRKQGKKFKLNLKDETGEQCLLEITQEKDGNLCIGKNGGKANKYIRAGRPQRNYA ATTAATAPQTAFFRKDTVHIRGFLDGYSPKLGFTTVLIYTDNHITNEGTPAVVTIHPDGR FESDFVVNYPGVHHLSMGNNWMTFYIRPGETLMLYINWEDHLDYLRQRRLKPMLTETLYM GPSSSINQELMPCEPLFSKDYHIIQNACKTLTPSEFKTQQEPMYKLWMHRVDSLEQSKTL QPEAMQMLKNDVMINYGAWLLEFILMRDMDARKDTTNTILKIKETPDYYDFLKAMPLNDV RSIGCNNFSTFINRIEFMNPFLPASWQIKNGTDDRMEKYAESWRKKKEILQDTTGMPFPV VGELILTRSYPFLAKTLENEKKAFALLDTLKGYLHDPFLVAEAERMYRQVYPVQGNKPQE LPAGKGTDIIRKLTAPYLGKFVIIDFWATSCGPCRASIEQHADLRKDYRNSPDIKFIFVT SNQDSPEKAYENYVEKHLKEETIFRLPQSDYNYLRELFHFNGIPRYVLLNRDGKLLDENF PMYNIELFLKESKIRKE >gi|226332023|gb|ACIB01000033.1| GENE 38 52370 - 53383 1056 337 aa, chain - ## HITS:1 COG:PA5413 KEGG:ns NR:ns ## COG: PA5413 COG2008 # Protein_GI_number: 15600606 # Func_class: E Amino acid transport and metabolism # Function: Threonine aldolase # Organism: Pseudomonas aeruginosa # 2 336 6 341 346 253 41.0 3e-67 MRSFASDNNSGVHPAIMEALTRANRDHALGYGDDLWTEEAVRKIKETFVADCEPLFVFNG TGSNVIALQLMTRPYNSILCAETAHIYVDECGSPVKMTGCQIRPIATPDGKLTPQLITPY LHGFADQHHSQPGAIYLSECTELGTIYTPDELKAITSLAHQYGMRVHMDGARIANACASL GLSLRALTVDCGIDVLSFGGTKNGLMMGECVIVFDDSLKSEARFIRKQSAQLASKMRYLS CQFTAYLTDELWLKNATHANAMAKRLADALEQVPGVRFTQRVESNQLFLTMPRAETDRML QTYFFYFWNEEADEIRLVTSFDTTEEDIDTFIRILKK >gi|226332023|gb|ACIB01000033.1| GENE 39 53473 - 53748 272 91 aa, chain - ## HITS:1 COG:no KEGG:BF3385 NR:ns ## KEGG: BF3385 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 91 32 122 122 154 100.0 9e-37 MMEQIRLEAEKQQKRKERVMLCGMIAGILLLLGVGVYTLVFKLEFNFKEYLSGMDFSHAD SSLLAFYSYIATLVLLLLGLDYWLRKKKFHS >gi|226332023|gb|ACIB01000033.1| GENE 40 53844 - 54395 483 183 aa, chain - ## HITS:1 COG:mll5118 KEGG:ns NR:ns ## COG: mll5118 COG1595 # Protein_GI_number: 13474268 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Mesorhizobium loti # 10 175 12 174 188 82 32.0 3e-16 MVQNDETQYIARILDGDTECFSAFLDRYSRPLYVLIVQIVGCSEDAEELVQDVFLKAFRC LGSYRGECRFSTWLYRIAYTTAVSATRKKKQEFLYIEENTINNVPDEKADDILYPTDDEE RTARLIQAIDLLNVEEKALITLFYYEEKSIEEIGEVLKLSPGNVKVKLHRTRKKIYVLMN GKE >gi|226332023|gb|ACIB01000033.1| GENE 41 54526 - 54954 528 142 aa, chain + ## HITS:1 COG:no KEGG:BF3215 NR:ns ## KEGG: BF3215 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 142 1 142 142 231 100.0 5e-60 MMDFITAPLIVGIITLGIYKLFELFACRRERITLIEKLGEKMSQTDLELNGKICLPDFNR PQLSFGALKGGCLLLGVGLGLLVGFILSYVSFSPYDLDRLDRGYTREMVGVIYGSCTLLF GGAGLVASFLIEQNFAAKKKEK >gi|226332023|gb|ACIB01000033.1| GENE 42 55077 - 55295 354 72 aa, chain - ## HITS:1 COG:no KEGG:BF3388 NR:ns ## KEGG: BF3388 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 72 1 72 72 111 100.0 1e-23 MDKNEIGLNAGKVWQLLSNNDKWSYGNLKKKSGLKDKDLGAALGWLAREDKIEFEQEEEE LYVYLCVNVYIG >gi|226332023|gb|ACIB01000033.1| GENE 43 55536 - 57275 1630 579 aa, chain + ## HITS:1 COG:STM0935 KEGG:ns NR:ns ## COG: STM0935 COG0028 # Protein_GI_number: 16764297 # Func_class: E Amino acid transport and metabolism; H Coenzyme transport and metabolism # Function: Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] # Organism: Salmonella typhimurium LT2 # 1 572 1 570 572 565 48.0 1e-161 MAKKVAEQLVDTLIEAGVRRIYAVTGDSLNEVNEAVRQNEAMKWIHVRHEETGAYAAGAE AQLTGLPGCCAGSSGPGHVHLINGLYDAHRSGAPVIAIASTIPTGEFGTEYFQETNTVKL FNDCSFYNEVATTPEQFPRMLQSALQTATTRKGVAVVGLPGDLAKKPAVKVESSEQIYPL ASSVCPAEEDLIRLAGMLNHYERITLFCGIGCKGAHEEIIRMSETLNAPVAYTFKGKMEV QYDNPYEVGMTGLLGMPSGYYSMHEAEVLVMLGTDFPYSAFLPDDIKIVQVDIKPERLGR RAKVDLGLCGDVRSTLRALLPMLQQKKNDSFLRKQLKRYEGVKKDLAAYTEDKGKMDQIH PEYVMSEINNISSDDAIYTVDTGMTCVWGARYLQATGKRHMLGSFNHGSMANALPQAIGA ALACPDRQVIALCGDGGLSMTLGDLETVVQYKLPIKIIVFNNRSLGMVKLEMEVDGLPDW QTDMLNPNFAQVAEAMGMTGFNVSDPEEVLNTLCNAFELEGPVLINVMTDPNALAMPPKI ELGQMVGFAQSMYKLLINGRSQEVIDTINSNFKHIREVF >gi|226332023|gb|ACIB01000033.1| GENE 44 57496 - 58071 638 191 aa, chain - ## HITS:1 COG:MA0327 KEGG:ns NR:ns ## COG: MA0327 COG0655 # Protein_GI_number: 20089225 # Func_class: R General function prediction only # Function: Multimeric flavodoxin WrbA # Organism: Methanosarcina acetivorans str.C2A # 1 190 1 190 191 213 52.0 2e-55 MKIVAFNGSPRKGGNTELLIKEVFKPIQEAGIETELVQLGGKLLRGCASCYTCFKTKDGK CAIKTDPMNEFIQKAQEADGIILASPTYYGSVSAEMKAFMDRLGLTTIGQGRTLTRKVGA AVISVRRGGAVTVYDELNRFMLGSGMIVPGSTYWNFGIGEMPGEVFDDAEGLRNMKDLGV QLAWLLKAIHN >gi|226332023|gb|ACIB01000033.1| GENE 45 58269 - 59552 466 427 aa, chain - ## HITS:1 COG:no KEGG:BF3220 NR:ns ## KEGG: BF3220 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 167 1 167 183 330 95.0 5e-89 MLTKYLRVILFLCVLFWGRSVVDAQTKGIVVDGVKGRSLSDVNIYLQKDSVGIGSTDRNG EFMFSRDQITISDTIVFSHVGYFLLKCTLSELQHLGYKVVLHEHPQLLHEVVVSRERLPF FLEWTSLSPLPKPLYSFGGFLHAGKIYVVAGDETLVRMVTDKHRQGTEAWEYRSSNMYVY DIATDVWRKGAKGFVPRAGHAAHFYNGKIFVLGGKRFSTNRQLEYTDATMEIYDPDKDTL YVDPVNPHQAVDFTSFIYDDCLYAMGGAIREKAYSNKIHTLDLKRGVWYELEGTLPAGRY GRMNGILVGDKVYFWGGYHTAPMWTAASYDLRTGEWRRLCDLKDGVSYPGLASDGSYIYI FENRNLQVYHIETDTMRIYELAALDVENAGLFYWRNTLYIVGGCNRQGIYVVPHRDVIAI DVSQINP >gi|226332023|gb|ACIB01000033.1| GENE 46 59578 - 60735 442 385 aa, chain - ## HITS:1 COG:no KEGG:BF3221 NR:ns ## KEGG: BF3221 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 385 49 433 433 805 99.0 0 MKTHLIFLWFITFGVMACTSQPGINTGENKVEGDTIPVLDFASAIHKQVPDTFMWNSVAR KITYIPLASSHLMDGHPVIEYLDDDMCIIMEGKSQWINCVDYKGNFLSTFRHVGNGPGEY VNLSSVVYHSKDSTIRIFDNGSYKHIIYNKQGKFLREISLADSEFNYLLHMQSDTYFFRG SFSKGKSEIIVTDTTLRVKFPILPFDSTADYITRGAIMLNTTGEECAPDICLFNHIYSDS VFLLTPDGLKLDFILRKGKYAPSLEDVKQFMKWNQYDPFIKGLFIKTFPGYYYLQYTYKE QVLGEIWSRKTNQIVSRSILTRPNQFTTLRGIRFRFPSGTVIRLLPDYISGNKIAFFIPA DEAMGEIPGVKIREDDNPILMVMEL >gi|226332023|gb|ACIB01000033.1| GENE 47 60881 - 62518 1691 545 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|167855908|ref|ZP_02478658.1| 50S ribosomal protein L28 [Haemophilus parasuis 29755] # 2 545 3 547 547 655 61 0.0 MAKEILFNIEARDQLKKGVDALANAVKVTLGPKGRNVIIEKKFGAPHITKDGVTVAKEIE LTDAYQNTGAQLVKEVASKTGDDAGDGTTTATVLAQAIIAEGLKNVTAGASPMDIKRGID KAVAKVVDSIKHQAEKVGDNYDKIEQVATVSANNDPVIGKLIADAMRKVSKDGVITIEEA KGTDTTIGVVEGMQFDRGYLSAYFVTNTEKMECEMEKPYILIYDKKISNLKDFLPILEPA VQSGRPLLVIAEDVDSEALTTLVVNRLRSQLKICAVKAPGFGDRRKEMLEDIAVLTGGVV ISEEKGLKLEQATIEMLGTADKVTVSKDNTTIVNGAGAKENIKERCDQIKAQIAVTKSDY DREKLQERLAKLSGGVAVLYVGAASEVEMKEKKDRVDDALRATRAAIEEGIVAGGGVAYI RAIESLDGLKGENDDETTGIAIIKRAIEEPLRQIVANAGKEGAVVVQKVSEGKGDFGYNA RTDVYENMHAAGVVDPAKVTRVALENAASIAGMFLTTECVIVEKKEDKPEMPMGAPGMGG MGGMM >gi|226332023|gb|ACIB01000033.1| GENE 48 62561 - 62833 309 90 aa, chain - ## HITS:1 COG:RC0969 KEGG:ns NR:ns ## COG: RC0969 COG0234 # Protein_GI_number: 15892892 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Co-chaperonin GroES (HSP10) # Organism: Rickettsia conorii # 1 89 5 98 99 98 56.0 2e-21 MNIKPLADRVLILPAPAEEKTIGGIIIPDTAKEKPLKGEVVAVGHGTKDEEMVLKAGDTV LYGKYAGTELEVEGTKYLIMRQSDVLAVLG Prediction of potential genes in microbial genomes Time: Tue May 17 23:06:00 2011 Seq name: gi|226332022|gb|ACIB01000034.1| Bacteroides sp. 3_2_5 cont1.34, whole genome shotgun sequence Length of sequence - 48867 bp Number of predicted genes - 37, with homology - 36 Number of transcription units - 20, operones - 11 average op.length - 2.5 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 22 - 81 5.1 1 1 Op 1 . + CDS 221 - 1420 706 ## BF3224 putative lipoprotein 2 1 Op 2 . + CDS 1451 - 2590 529 ## BF3398 hypothetical protein 3 1 Op 3 . + CDS 2604 - 3737 586 ## BF3399 hypothetical protein 4 1 Op 4 . + CDS 3755 - 4837 413 ## BF3227 hypothetical protein 5 1 Op 5 . + CDS 4837 - 5739 568 ## BF3228 putative lipoprotein 6 1 Op 6 . + CDS 5765 - 7801 1327 ## BF3402 hypothetical protein 7 1 Op 7 . + CDS 7805 - 8347 463 ## BF3403 hypothetical protein - Term 8393 - 8433 -0.2 8 2 Op 1 . - CDS 8534 - 9415 781 ## COG2326 Uncharacterized conserved protein 9 2 Op 2 . - CDS 9453 - 9614 74 ## BF3405 hypothetical protein - Prom 9670 - 9729 4.5 - Term 9643 - 9682 1.3 10 3 Op 1 . - CDS 9744 - 10322 370 ## BF3406 hypothetical protein 11 3 Op 2 . - CDS 10334 - 10807 343 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog - Prom 10877 - 10936 4.9 12 4 Op 1 . - CDS 11624 - 11785 238 ## gi|255010895|ref|ZP_05283021.1| hypothetical protein Bfra3_17282 13 4 Op 2 4/0.000 - CDS 11868 - 12887 842 ## COG0392 Predicted integral membrane protein 14 4 Op 3 . - CDS 12917 - 14068 686 ## COG0438 Glycosyltransferase - Prom 14302 - 14361 4.5 + Prom 14286 - 14345 5.6 15 5 Op 1 . + CDS 14371 - 16554 2128 ## COG3537 Putative alpha-1,2-mannosidase 16 5 Op 2 . + CDS 16578 - 19115 2361 ## COG1506 Dipeptidyl aminopeptidases/acylaminoacyl-peptidases + Term 19271 - 19318 14.4 - Term 19259 - 19306 3.1 17 6 Tu 1 . - CDS 19371 - 20735 1570 ## COG0124 Histidyl-tRNA synthetase - Prom 20966 - 21025 5.7 + Prom 20877 - 20936 5.8 18 7 Tu 1 . + CDS 20978 - 22219 1012 ## BF3239 hypothetical protein + Term 22298 - 22333 -0.6 - Term 22257 - 22290 -0.5 19 8 Tu 1 . - CDS 22322 - 23296 437 ## BF3240 putative lipoprotein - Prom 23400 - 23459 3.5 - Term 23705 - 23746 11.1 20 9 Tu 1 . - CDS 23770 - 24456 740 ## COG2738 Predicted Zn-dependent protease - Prom 24482 - 24541 7.0 - Term 24504 - 24556 11.2 21 10 Tu 1 . - CDS 24575 - 25879 1332 ## COG3669 Alpha-L-fucosidase - Prom 25956 - 26015 4.7 - Term 25886 - 25943 3.2 22 11 Op 1 . - CDS 26017 - 27288 1366 ## COG0104 Adenylosuccinate synthase 23 11 Op 2 . - CDS 27285 - 27773 214 ## COG0735 Fe2+/Zn2+ uptake regulation proteins - Prom 27819 - 27878 6.2 + Prom 27789 - 27848 4.7 24 12 Tu 1 . + CDS 27869 - 29536 1080 ## COG1807 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family - Term 29553 - 29617 14.3 25 13 Op 1 . - CDS 29634 - 30296 701 ## BF3424 hypothetical protein 26 13 Op 2 . - CDS 30300 - 32330 2185 ## BF3425 putative dipeptidyl-peptidase III - Prom 32360 - 32419 5.6 - Term 32371 - 32417 3.4 27 14 Tu 1 . - CDS 32444 - 32908 494 ## COG2207 AraC-type DNA-binding domain-containing proteins - Prom 32978 - 33037 6.2 28 15 Tu 1 . + CDS 33142 - 34947 1354 ## COG0514 Superfamily II DNA helicase + Term 35078 - 35128 -0.8 - Term 35257 - 35306 -0.9 29 16 Op 1 . - CDS 35342 - 36211 951 ## COG0457 FOG: TPR repeat 30 16 Op 2 . - CDS 36268 - 37218 740 ## PROTEIN SUPPORTED gi|148988856|ref|ZP_01820271.1| 50S ribosomal protein L9 - Prom 37259 - 37318 6.8 + Prom 37073 - 37132 2.0 31 17 Op 1 . + CDS 37309 - 39147 1487 ## COG1368 Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 32 17 Op 2 . + CDS 39172 - 39837 478 ## BF3432 putative membrane-associated phospholipid phosphatase + Term 39858 - 39906 8.2 33 18 Tu 1 . - CDS 39876 - 40343 -8 ## - Prom 40411 - 40470 4.5 + Prom 40291 - 40350 4.6 34 19 Op 1 . + CDS 40434 - 42572 1052 ## COG1629 Outer membrane receptor proteins, mostly Fe transport 35 19 Op 2 . + CDS 42622 - 43638 1107 ## BF3435 hypothetical protein + Term 43754 - 43809 3.5 36 20 Op 1 . - CDS 43860 - 45479 1327 ## BF3257 hypothetical protein 37 20 Op 2 . - CDS 45492 - 48671 2693 ## BF3437 hypothetical protein - Prom 48773 - 48832 2.4 Predicted protein(s) >gi|226332022|gb|ACIB01000034.1| GENE 1 221 - 1420 706 399 aa, chain + ## HITS:1 COG:no KEGG:BF3224 NR:ns ## KEGG: BF3224 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 399 1 399 399 730 97.0 0 MKSFKIMLITVLSVMTLSCSEKTTTEIPETELKNISCSTQLVTHSSTQTRGAAPTTLNEN DFVLYAFKKNGEGNFSYEEAKHETSVIDGGVWKYNAAFPVGTYKFIAFYNLDEKNQAALN TTITGISNQTWENILKSIVITHFPSATDQCHEDMNEIFCGKTKDAIDISSGIGGDDNEIK IKLTLERIVSRIDIKFIKVASDDDHIEVPYATGNIFGGTGTTSLTSLIFTSSNVPLKYNL NGENPDYVTGVQCQVTYAAPSFQYGEANADAVKALDYQPFPKNADAINNNMETNIKKGIA KGGAYFMGSYLLPFPSSSQTLNANIQLNKANQERTIKVPGFTVTSNYVSIITVRLKSSTD AGDNNNGNDDEHLFNPKSTFIVEIEKTYDGIHNTNVDVE >gi|226332022|gb|ACIB01000034.1| GENE 2 1451 - 2590 529 379 aa, chain + ## HITS:1 COG:no KEGG:BF3398 NR:ns ## KEGG: BF3398 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 379 1 378 378 736 99.0 0 MTMKRYIRQILLGTLLLGSACTNHQEVIEEDTSVSVVFKTRQLPSTKRENDDEISVLIFR KEKDQFTKVESIQESPWLFEQEQLQKNILLPTGIYRFLLAKGFSNSPGNSGKISFVTDNN PGNLVSTSYDRNYYFQYPTRTLNGQTELNHCTTALFTDKNEESAQNSQTEYDLSQTNRTL IARKISHLQGQLCFIIQRAKKENGQYTLIDSPENSFDNALKKISAINLTIEGSSQCCYLS ETGLIFKSPMSYHCLVRLQNGDYSFKRFDADQFVSLFSPIIEKTDYQRYEGAAYCEGPLL FPAPLDQTIRVTIDIDYTGTLKNKTFTLENIPLERNKMYLFTLWLLNEDVDISISPDADL DLEELKFNNQITGNDGFWN >gi|226332022|gb|ACIB01000034.1| GENE 3 2604 - 3737 586 377 aa, chain + ## HITS:1 COG:no KEGG:BF3399 NR:ns ## KEGG: BF3399 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 377 1 377 377 683 98.0 0 MKPHTLFLTLGILLLGFSACEPLNETPETPAFSLNLTSPDGEPSTDNDNAVLQKLVYHLF IFRSNTANPSPDNDGSNYTFWRHTGDLTLKQIREYTLSIPSESTDRSYLLLVHATPKEKP ESEIISKEGMTFSESEISMIKENDNNYVPLSKDNYYAIQQLTPEDIAQGKTSIEFKLKRA VGELVFDVMKCDEKSHNPIDIDTECSSTLDRVFRIDIAVNGVIPKVSLTNETRNPERINI CFSKEIVLKSDYTPDFANNTGVIEPLTNAPLDTNEKAVKGATRICGPYLFSKMTLDYPDE GATDPEEGIKTILNFSYYDTTPLPNGSYSTKKLILSLTDKPLTIVKDHYTVTNIRLRNNR IIDLSVSGDFGIDWKWD >gi|226332022|gb|ACIB01000034.1| GENE 4 3755 - 4837 413 360 aa, chain + ## HITS:1 COG:no KEGG:BF3227 NR:ns ## KEGG: BF3227 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 2 360 1 359 359 699 98.0 0 MMKNKLLSIWIGCGSVLCSYTSCSKEMEFIDSPNEQVTIQLTTTIDGQGFSTDNEIIPMK TRSTRSTDYYVTAIYDSRLVIIKETGQSAWSIEKIEDNVIPQGYIKSSDACKYTMQTALR PGNYKFMLLVNGPKNEALHLYQSFSEEDVPWLTDTEKDYSGYEIFFATSGIVTIKKTDRL EECGQYHPLPTPLVLKRYSSLIRLCATGDDFWGSLERATDISYSIKEQSINGINLLGQIV SKDTELSFGTLTASIRKNFECQLGDKKLFFSYFGDEDEAKQLQNILTPSDGKDITVTIFQ KEKQTSPIETTTQAKPNQVTTLILDKTESVLNWNQSPIIGTPIDWDPDNIPWGHLELNFL >gi|226332022|gb|ACIB01000034.1| GENE 5 4837 - 5739 568 300 aa, chain + ## HITS:1 COG:no KEGG:BF3228 NR:ns ## KEGG: BF3228 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 300 1 300 300 568 99.0 1e-160 MKTKYILYLLSLVFLLSSCWKEDLKNCWMGDVTLTIVAEKFQQPATDEGKLEENLSTRIN SIRYYLYKDNVLIHSGIIDNVTDLNTDAYKLTFPKLAFGDYCLALATNVSEEELPDTTTP EALNLNYPGVEQTKDYFTSCYDFTVDCECGYQDFVILRRTQGVTQFQLKQLPENITGIGI EIHKVAATCQIDTLYKGETIAEYHTTVAEVTNAEEVADFSIGTFPTSGPENASITLKLYA DDHSGSPIYQKTLNNISILRNQLTRISTDFNHSILGNSGFSIAINPDWDGIHDDDETIIP >gi|226332022|gb|ACIB01000034.1| GENE 6 5765 - 7801 1327 678 aa, chain + ## HITS:1 COG:no KEGG:BF3402 NR:ns ## KEGG: BF3402 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 662 19 680 696 1268 99.0 0 MKKRIQILIIVILYALMFVPLSAQTPHTISGIVKDRASVPISGVNIRVEGQKWKASTGKE GKFTLEIPQNSTITFSSVGYEPVTIHSGEKKWIEITLKESQSLLPEITVTSTLKNSMKFV FAPSDLELIKDMLYLKTRYKIPSKRFQSDSRVIIQPILSNNSRGTQKNFSPIVYDGKNYD ILLRRGNVCGDRAEKEYYSRFAQVIDPDSICNQTLTYADSCTVDDINDLYTTEVRIKIST FCQDEYRDTIRITNGIIYPMRFFNYNLSAMDLDNSYIPKQTPLNFNEKGEMHLRFRPEDA NIYENEGKNAEELRKMKKALDDIDKDRTKTLTTFQIIGYTSPEGTYEYNLKLAKKRMKNA EGKVFENISEETIRKAKVDNDAVVESWTTVCELMEKDSIQEVSQLKELIKRARGNHNEIS WGARRMKIYPLIRDRYLPRLRRVEYFYEYSELRTLNKDEIDALYKKDPKKLTASEFWSYI MSQKDATDEKREALYREALSIHPDLMIAANNLASLLIKQNRADTTLLKPFITQDAPSAIL VNQTVAYLQKRDFKRANHFAELLPDNKDTEIVKALAAAMDGKYQEAYPIFEKQGGINQAI LLLSMKQNSKAWEVLKKIEDTSPDTEYVKAIAANRLNNVNEAVIHLRNAITQKPSLKEIA QKDGDVLDLLDLLDLDKK >gi|226332022|gb|ACIB01000034.1| GENE 7 7805 - 8347 463 180 aa, chain + ## HITS:1 COG:no KEGG:BF3403 NR:ns ## KEGG: BF3403 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 180 1 180 180 318 99.0 7e-86 MKKRAFFLLLLLLIPLTSILSQEIALKTNVLSWATTTINLGAEFKISPRLTAGADIMYKG WSFLSDNRKMGGFLVQPEAKYWFCIPFYKHFMGLHAHYGQYNGGFSKYRYQGDLYGIGLS YGYQWIWKRRWNIEVSAGIGYASMNYDKYERPKCGLFLGKDHSNYFGLTKLGVSLIYILK >gi|226332022|gb|ACIB01000034.1| GENE 8 8534 - 9415 781 293 aa, chain - ## HITS:1 COG:all2088 KEGG:ns NR:ns ## COG: all2088 COG2326 # Protein_GI_number: 17229580 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Nostoc sp. PCC 7120 # 9 287 6 280 289 314 56.0 1e-85 MKKEILKRLLAKPGEEHSVSEFDARYTGDLTKSEAEALLAENIGKLSSLQDKLYAQDRYA VLVIFQAMDAAGKDGTIKHVMSGINPQGCQVYSFKQPSAEELDHDYLWHINRCLPERGRI GIFNRSHYEDVLVAKVHPEIVLSAKLPGIMGPGDITPKFWKKRYRQINDYERYLTENGTV IIKFFLNVSKEEQKRRFLSRLEDEAKNWKFSVSDLKERSYWDDYMKAYSDMLTHTSTEEA PWYVIPADNKWFMRYAVGQILCDRMNELDLHYPEMPVEARHQIEDFKRALLNE >gi|226332022|gb|ACIB01000034.1| GENE 9 9453 - 9614 74 53 aa, chain - ## HITS:1 COG:no KEGG:BF3405 NR:ns ## KEGG: BF3405 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 53 1 53 53 70 100.0 1e-11 MDTNELKQRQHFILIRTLLVVTALALLVYIAFSMAYSSRRFQESTSIIEWVND >gi|226332022|gb|ACIB01000034.1| GENE 10 9744 - 10322 370 192 aa, chain - ## HITS:1 COG:no KEGG:BF3406 NR:ns ## KEGG: BF3406 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 192 1 192 192 355 99.0 7e-97 MELDELKQKWTELSEQVEKNELLNRQIIIDMIQSKKETHLQQQLRVEKMAFGVLGLFLGI VCYTFWRNVAPGWISWYLLGMVIWLLLMQTLMFRIIYTLKTVTEHVEQQYKRLQSYKVLM NLTYIFSYVIITPVIIAFFYIWHNPLFRTVLCVMILAGFLGDYFIYHKTGDRLKGFRDAV RALQDLKSGKQE >gi|226332022|gb|ACIB01000034.1| GENE 11 10334 - 10807 343 157 aa, chain - ## HITS:1 COG:CC3310 KEGG:ns NR:ns ## COG: CC3310 COG1595 # Protein_GI_number: 16127540 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Caulobacter vibrioides # 6 154 7 158 166 67 30.0 1e-11 MEIEKDFIDLLTEHKALIYKVCFMYASNQEDLNDLYQEVVVNLWCSYPKFRYESKLSTWI YRVALNTCISDLRKKKVLDYVPLSVDIGVYDDCLRNDSLKEMYQLICQLDRYERMLVLLW LDENSYDEIASITGSNRNTVAVKLHRIKDKLKKMSNQ >gi|226332022|gb|ACIB01000034.1| GENE 12 11624 - 11785 238 53 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|255010895|ref|ZP_05283021.1| ## NR: gi|255010895|ref|ZP_05283021.1| hypothetical protein Bfra3_17282 [Bacteroides fragilis 3_1_12] # 1 53 1 53 53 81 90.0 2e-14 MSREELQQKSRFAMIGTLMTIVSLVFLFYVGSSLVNTTKKYKELAIEIACINK >gi|226332022|gb|ACIB01000034.1| GENE 13 11868 - 12887 842 339 aa, chain - ## HITS:1 COG:AF2231 KEGG:ns NR:ns ## COG: AF2231 COG0392 # Protein_GI_number: 11499813 # Func_class: S Function unknown # Function: Predicted integral membrane protein # Organism: Archaeoglobus fulgidus # 50 330 33 313 328 75 25.0 1e-13 MSDSIKTFKTGYILLPVLIGLSVVGWLFYREFNPELFSGIRFSWHLVGGLLLAVLFMFGR DGGLMWRFRFITDRELTWRQAFRVNMLCEFTSAVTPSAVGGSSLIVLFLNKEGINAGRST ALTISCLFLDELFLVLACPFALLLFSFDDLFGNVAILSSGIEVLFFLVYGVVTIWTFLLY LALFRRPEWVKRLLLTIFRLPLLRRWHKAIETLTDNLVLSSREMSQKSFTFWLKAFGITC LAWTSRYLVVNALLIAFTTSGSQLLAFVRQLILWVVMTISPTPGGSGVSEYMFREYYADF FDVAGMALVVAFVWRIITYYMYLLIGAIIIPGWVKKLRD >gi|226332022|gb|ACIB01000034.1| GENE 14 12917 - 14068 686 383 aa, chain - ## HITS:1 COG:lin2700 KEGG:ns NR:ns ## COG: lin2700 COG0438 # Protein_GI_number: 16801761 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Listeria innocua # 2 338 3 342 427 118 26.0 2e-26 MIGLFNDCFPPIMDGVSLTTQNYAYWLHRKAGNVCVVTPKSPDARDAEEYPVYRYSSVPI PMRKPYRLGFPRIDWPFHERISRLSFELVHAHCPFSSGALAMQIAKEQHVPIVATFHSKY RADFERAIPSRLLVNYLIKKVIRFYEAADEVWIPQAAVEETLREYGYKGRIEVVDNGNDF AGTPFLQSVRQEARRTLGIRSGEFMFLFVGQHIWEKNLGFLLDSLVRLSDVPFRMYFVGS GYASHELKQKVVELGLASKVSFVGSIVEREILKRYYVAADLFLFPSLYDNAPLVIREAAA LGTPSVLIRDSTASEIISDSVNGFLSPNSTEAYSNRIREILCSTAIIKQVGEEASRTIAR SWEDVAGEVYDRYNRLIKRNGNK >gi|226332022|gb|ACIB01000034.1| GENE 15 14371 - 16554 2128 727 aa, chain + ## HITS:1 COG:XF0842 KEGG:ns NR:ns ## COG: XF0842 COG3537 # Protein_GI_number: 15837444 # Func_class: G Carbohydrate transport and metabolism # Function: Putative alpha-1,2-mannosidase # Organism: Xylella fastidiosa 9a5c # 18 726 33 761 790 518 39.0 1e-146 MKFKSTFMTACLGIGLCTSCTPETPTAPQDYTQYVNTFIGAADNGHTFPGACLPFGLIQA SPETNAIGWQYCSGYNYQDSLIWGFSQTHLNGTGCMDLGDLLVMPVTGQRVRDDYKSGFS KKTESATPGYYTVELDKYKVKAELTATDHVALHRYTYQNADSASLLLDLQHGLVWNPQQY KSHVKACEINWEDAQTLTGHVRSSVWVNQDLYFVMKFNKPVTDSIYLPMEETEKGKRLIM SFDMKPDEQLLMKVAISTVGVDGAHKNMEKELADWDFDGTRQKAKDSWNSYLSRIEVTGT PDELENFYTSFYHALIQPNNIADVDGRYRNAKDSIVKSSSGVYYSTFSIWDTYRAAHPFY TLAVPERVDGFINSMIEQNQAQGYLPVWTLWGKETNTMIGNHSVSVIAEAYKKGFRGFDA EKAFDAIKQTLTVSHPKSDWETYMKYGYYPTDKVDAESVSRTLESVYDDYAAATMAGLMG KKEDAEYFGKRSEFYKNLFDKETQFMRPRYADGRWKTPFNPSDLAHAESRGGDYTEGNAW QYTWHVQHDVPGLIELFGGKEVFLNKLDSLFTIELKGSGLADVTGLIGQYAHGNEPSHHV TFLYALAGKPERTQELIREIFDTQYKNKPDGLCGNDDCGQMSAWYMFNAMGFYPVDPVSG HYVFGAPQMPKIVLHLPDGKTFTVIAENLSKEHKYIDSITLNGEPYTKNYISHEDIVKGG TLVYKMK >gi|226332022|gb|ACIB01000034.1| GENE 16 16578 - 19115 2361 845 aa, chain + ## HITS:1 COG:PAB1300 KEGG:ns NR:ns ## COG: PAB1300 COG1506 # Protein_GI_number: 14521796 # Func_class: E Amino acid transport and metabolism # Function: Dipeptidyl aminopeptidases/acylaminoacyl-peptidases # Organism: Pyrococcus abyssi # 484 830 291 629 631 161 32.0 5e-39 MDKKLLMMALLITTGLATHAQEKLTRYQVRNAITVRTPIMNDSINPKGEKHTAKALLQTP VVLDLANAPTQMTAADTAGLVTFAKADKDNLLYLIKTQLRAERFMKGKLKVTSPVRWELF INGESKMVKDASEDSISKAATKEVALRLEPEMDYEIAIKLLSTPDDKTVPSLKCELVKDD KFKEVACSTDPEQKHRFSLDNTVYGNRAIAVSVSPDGKYLLTRYWDNHSLKRSRTYCELT ELKTGKVLLTNLRDGMRWMPKSNKLYYTVVAPEGNDVITLDPVTLKEEVLLRGIPEQGFS WSPNEDFLIYYPREEGVKDEGPLKRIVSPADRIPNTRGRSFLARYDIASGTSERLTYGNH STYMQDISPDGKYLLYSSSKENITQRPFSLSSLFQVNLETLAVDTLFFEDRFLGGASYSP DGKQLLLTASPEAFDGIGKNCGNHPIANDFDSQAFIMDLATRKIDPITKEFNPSVNFLQW NKGDGCIYFSTNDEDCRNIYRYSPKDRKFEKLNLETDVTSAFAMSENNPSLAAYIGQGCY NAGVAYVYDLKKKTSRLIADPMKPTLERIELGEMKPWNFTASDGTEIKGMMCLPPSFDPN KKYPLIVYYYGGTTPTERGISNPYCAQLFASRDYVVYVIQPSGTIGFGQEFSARHVNAWG KRTADDIIEGTKQFCKEHPFVNDKKIGCLGASYGGFMTQYLQTQTDIFAAAVSHAGISNV TSYWGEGYWGYGYNAIAAADSYPWNNPELFTKQGSLFNADKINTPLLLLHGTVDTNVPIG ESIQLFNALKILGKTVEFITVDGENHFIADYPKRVQWHNSIMAWFARWLQDSPQWWEDMY PERHL >gi|226332022|gb|ACIB01000034.1| GENE 17 19371 - 20735 1570 454 aa, chain - ## HITS:1 COG:HP1190 KEGG:ns NR:ns ## COG: HP1190 COG0124 # Protein_GI_number: 15645804 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Histidyl-tRNA synthetase # Organism: Helicobacter pylori 26695 # 5 450 4 436 442 246 33.0 5e-65 MAAKPGIPKGTRDFSPVEMAKRNYIFNTIRDVYHLYGFQQIETPSMEMLSTLMGKYGEEG DKLLFKIQNSGDYFSGITDEELLSRNAAKLASKFCEKGLRYDLTVPFARYVVMHRDEITF PFKRYQIQPVWRADRPQKGRYREFYQCDADVVGSDSLLNEVELMQIVDTVFTRFGIRVCI KINNRKILTGIAEIIGEADKIVDITVAIDKLDKIGLDNVNKELAEKGISEEAIAKLQPII LLSGTNAEKLATLKTVLSDSETGLKGVEESEFILNTLQTMGLKNEIELDLTLARGLNYYT GAIFEVKALDVQIGSITGGGRYDNLTGVFGMAGVSGVGISFGADRIFDVLNQLELYPKEA VNGTQLLFINFGEKEAAFSMGILSKARAAGIRAEIFPDAAKMKKQMSYANVKNIPFVAIV GENEMNEGKAMLKNMESGEQQLVTAEELIGALTK >gi|226332022|gb|ACIB01000034.1| GENE 18 20978 - 22219 1012 413 aa, chain + ## HITS:1 COG:no KEGG:BF3239 NR:ns ## KEGG: BF3239 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 413 50 462 462 813 100.0 0 MKTVFLIITMLCLGLSGMAQNNEKKEFERKYSALTQEVANARKQKDYKKAEKLNWEKLKL YQGLPQQLQENAKNITGLVYYDLACYQALQGKKKAALKNFEKAFNSGWNDYNHAKGDSDI DNLRKEKKYLEIMAKMRLDSDYLYILQQAEGYNRTETKPICYNESVTDTLPRFTYMNPND SNLVQLRKHFNLDSIAGSGDEISKIKNLLYWVHNIVPHDGNSRNPEERNTIAMVELCKKE NRGVNCRMMAQMLNECYLAMGFKSRFITCMPKVMINDCHVINAVYSNTLDKWLWMDPTFN AYVTDEKGNLLGIGEVRERLRNNQPVVLNEDANWNNKNKQTKEYYLDYYMAKNLYYVTCP LQSEYNAETNYPGKKWPMYISLVPEGYSSNGKPGATAYDSHNDSYFWQSPYQE >gi|226332022|gb|ACIB01000034.1| GENE 19 22322 - 23296 437 324 aa, chain - ## HITS:1 COG:no KEGG:BF3240 NR:ns ## KEGG: BF3240 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 324 61 384 384 640 98.0 0 MPETNEEALINNIDRVYESNGRYFILDKRMKQVLCYTLSGKHLFTIHAVGNGKGEYVDLF DIAINEAENKLLLLTYPSQILYYDLEGTYLSSYPLDDMYQSFAVDKGFIYLRNDTYANGV VSDYSLIVINKETGKQTSLLEPLYETAPFCSSGNYQITNTASVLFTRKFDNHIYRITGES IEPLYMVDWKDKAFPESDKQRQFQCNDLNQFCYQGKYVYTMTDLCDTPSYLLFRTNQPGM CLLSKATSTVNNYQVIINTDYQLPLPNYMSVEGKQSWIFFIYSSEVLCEQKKLSAEEDIN EKMSSLLGQIKEGDNPVIFTYHVK >gi|226332022|gb|ACIB01000034.1| GENE 20 23770 - 24456 740 228 aa, chain - ## HITS:1 COG:BH1677 KEGG:ns NR:ns ## COG: BH1677 COG2738 # Protein_GI_number: 15614240 # Func_class: R General function prediction only # Function: Predicted Zn-dependent protease # Organism: Bacillus halodurans # 8 225 5 220 224 175 43.0 5e-44 MVIGFQWIIFIGIALVSWLVQMNLQNKFKKYSKIPTGNGMTGRDVAIKMLQDNGIYDVQV THTPGQLTDHYNPANKTVNLSEGVYDSNSIMAAAVAAHECGHAVQHARAYAPLTLRSKLV PVVSFASQWMTWLLLAGILLLEPFPQLLFAGIILFAMTTLFSFITLPVEIDASKRALVWL SASGITNSYNHRQAEDALRSAAYTYVVAALGSLATLIYYIMIFMGRRE >gi|226332022|gb|ACIB01000034.1| GENE 21 24575 - 25879 1332 434 aa, chain - ## HITS:1 COG:TM0306 KEGG:ns NR:ns ## COG: TM0306 COG3669 # Protein_GI_number: 15643075 # Func_class: G Carbohydrate transport and metabolism # Function: Alpha-L-fucosidase # Organism: Thermotoga maritima # 37 352 24 358 449 133 31.0 7e-31 MKKHFITFLLLVGMTASLTAQQKYQSTEANLKARSEFQDNKFGIFLHWGLYAMLATGEWT MTNNNLNYKEYAKLAGGFYPSKFDADKWVAAIKASGAKYICFTTRHHEGFSMFDTKYSDY NIVKATPFKRDVVKELADACAKHGIKLHFYYSHIDWYREDAPQGRTGRRTGRPNPKGDWK SYYQFMNNQLTELLTNYGPIGAIWFDGWWDQDINPDFDWELPEQYALIHRLQPACLVGNN HHQTPFAGEDIQIFERDLPGENTAGLSGQSVSHLPLETCETMNGMWGYKITDQNYKSTKT LIHYLVKAAGKDANLLMNIGPQPDGELPEVAVQRLKEVGEWMSKYGETIYGTRGGLVAPH DWGVTTQKGNKLYVHILNLQDKALFLPIVDKKVKKAVVFADKTPVRFTKNKEGIVLELAK VPTDVDYVVELTID >gi|226332022|gb|ACIB01000034.1| GENE 22 26017 - 27288 1366 423 aa, chain - ## HITS:1 COG:PM0938 KEGG:ns NR:ns ## COG: PM0938 COG0104 # Protein_GI_number: 15602803 # Func_class: F Nucleotide transport and metabolism # Function: Adenylosuccinate synthase # Organism: Pasteurella multocida # 5 418 6 425 432 384 49.0 1e-106 MKVDVLLGLQWGDEGKGKVVDVLTPKYDVVARFQGGPNAGHTLEFEGQKYVLRSIPSGIF QGDKVNIIGNGVVLDPALFKAEAEALEASGHNLKERLHISKKAHLILPTHRILDAAYEAA KGDAKVGTTGKGIGPTYTDKVSRNGVRVGDILHNFEQKYAAAKARHEQILKGLNYEDDLT ELEKAWFEGIEYLKQFQLVDSEHEINGLLDNGKSILCEGAQGTMLDIDFGSYPFVTSSNT VCAGACTGLGVAPNKIGDVYGIFKAYCTRVGSGPFPTELFDKTGDQICTLGHEFGSVTGR KRRCGWVDLVALKYSIMVNGVTKLIMMKSDVLDTFETIKACVAYKMNGEEIDYFPYDITD EVEPIYVELPGWQTDMTKMQSEDEFPEEFNAYLSFLEEQLGVQIKIVSVGPDREQTIIRY TEE >gi|226332022|gb|ACIB01000034.1| GENE 23 27285 - 27773 214 162 aa, chain - ## HITS:1 COG:Cj0400 KEGG:ns NR:ns ## COG: Cj0400 COG0735 # Protein_GI_number: 15791767 # Func_class: P Inorganic ion transport and metabolism # Function: Fe2+/Zn2+ uptake regulation proteins # Organism: Campylobacter jejuni # 1 154 1 157 157 75 30.0 4e-14 METQNVKDTVRQIFTEYLNANGHRKTPERYAILDTIYSIDGHFDIDMLYSQMMNQENFRV SRATLYNTIILLINARLVIKHQFGTSSQYEKSYNRETHHHQICTQCGKVTEFQNEALQNA IENTKLSKFQLSHYSLYIYGICSKCDRANKRKRVNNNNKKEK >gi|226332022|gb|ACIB01000034.1| GENE 24 27869 - 29536 1080 555 aa, chain + ## HITS:1 COG:FN1262 KEGG:ns NR:ns ## COG: FN1262 COG1807 # Protein_GI_number: 19704597 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family # Organism: Fusobacterium nucleatum # 35 390 30 376 519 159 29.0 1e-38 MTRVPFTQNYRWHLPLLLLVFCCSFFINNGAIFADIMESRNIITAREMVYDHNWLVPTMN GELRLEKPPLPTWIAAITEMISPDNLPLQRAMAGFAAVMLVLFFYKFATKLTDNRTYALV SSLVLCTSYNIILMGRTATWDIYCHAFMMGAIYYLYLALRQNPCKWTYFIGAGIFMGLSF LGKGPVSFYALLLPFVCAYILYYRKETQMKGKWIALAVMILIGIVLSTWWYAYIYIYHQE MASYVFHKESSSWSNHNVRSWYYYWQFFLETGVWSLLTLTTLLVPFWKKRVESSKEYLFC LSWMLLILFFLSLLPEKKTRYLLPILLPAALTMGHLFVYWIRQAKQKMPQLKDRVLYRIN AYLIVVAALALPIALYLFMYREGRMGTGMFVWLVVLFLTVAVWLFRSAFKLQPFSFLMGI VALFAVAELFVMPYIGSFVSNSDPKSISATRENPELQPLPFYHSKDEVLRIELVYEAHKK IGDMDLTNKEEIIKALPFVLISQKPAELLIPDSIRKDLNLRFIDCYDNNRWAKGHKRYDS VFISNVTIVEPIKEQ >gi|226332022|gb|ACIB01000034.1| GENE 25 29634 - 30296 701 220 aa, chain - ## HITS:1 COG:no KEGG:BF3424 NR:ns ## KEGG: BF3424 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 220 1 220 220 428 99.0 1e-119 MDVKEQLKDIKTQLRLSMNGVVSQSMREKGLDYKLNFGVELPRIKSIAAAYEKSHDLAQA LWKENIRECKILAGLLQPIDTFFPEIADIWVEDIRNIEIAELTCMNLFQNLPYAPAKTFQ WIADEAEYTQVCGYLTIARLLMKKGDMAQRPAGELLDQAICAVQSGSYHVRNAAMLAIRK YMQHSEEHAFQVCRLVEGMENSEKEAEQMLYAMVKAEIND >gi|226332022|gb|ACIB01000034.1| GENE 26 30300 - 32330 2185 676 aa, chain - ## HITS:1 COG:no KEGG:BF3425 NR:ns ## KEGG: BF3425 # Name: not_defined # Def: putative dipeptidyl-peptidase III # Organism: B.fragilis # Pathway: not_defined # 1 676 1 676 676 1351 99.0 0 MAATVILLTACGGAKKTNTAEADNFKYTVEQFADLQILRYRVPGFENLTLKQKELVYYLT QAALEGRDILFDQNGKYNLTIRRMLETIYTDYAGDRNSPDFVNLTTYLKRVWFSNGIHHH YGSEKFVPGFTPEFLKQALLSVDASKLPLAQGQTVEQLFEELSPVIFDPKVMPKRVNQAD GEDLVLTSASNYYDGVTQQEAEAFYNALKNPKDETPVSYGLNSRLVKEDGKIIEKVWKVG GLYTQAIEKIVYWLKKAEGVAEDDAQKAAIGKLIEYYETGDLKTFDEYAILWVKDLNSRV DFTNGFTESYGDPLGMKASWESIVNFKDLEATRRTELISSNAQWFEDHSPVDKQFKKEKV KGVTAKVITAAILGGDLYPSTAIGINLPNSNWIRSHHGSKSVTIGNITDAYNKAAHGNGF NEEFVYSDTEKQLIDKYADLTGELHTDLHECLGHGSGKLLPGVDPDALKAYGSTIEEARA DLFGLYYVADPKLLELGLVPSEDAYKAEYYTYLMNGLMTQLVRIEPGNSVEEAHMRNRQL IARWVFEKGKADKVVEMVQKDGKTYVVVNDYQKLRHLFGELLAEIQRIKSTGDFDAARSL VETYAVKVDPELHSEVLTRYKKLNLAPYKGFVNPRYDAVIDEQGNIVDVQVTYDEGYAEQ MLRYSRDYSPLPSIND >gi|226332022|gb|ACIB01000034.1| GENE 27 32444 - 32908 494 154 aa, chain - ## HITS:1 COG:VCA0926 KEGG:ns NR:ns ## COG: VCA0926 COG2207 # Protein_GI_number: 15601680 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Vibrio cholerae # 42 151 261 364 365 62 32.0 4e-10 MSDLENKTQEETPKKRPYNLREKKEKKAAYRSLIRPELADELYDRILNIIVVQKKYRDPD YSAKDLAKELKTNTRYLSAVVNSRFGMNYSCLLNEYRVKDALHLLTDKRYADKNVEEISA MVGFANRQSFYAAFYKNVGETPNGYRKKHAEKKK >gi|226332022|gb|ACIB01000034.1| GENE 28 33142 - 34947 1354 601 aa, chain + ## HITS:1 COG:PM1427 KEGG:ns NR:ns ## COG: PM1427 COG0514 # Protein_GI_number: 15603292 # Func_class: L Replication, recombination and repair # Function: Superfamily II DNA helicase # Organism: Pasteurella multocida # 2 599 27 616 632 525 44.0 1e-149 MIQTLKTYFGYDSFRPLQEEIIHNLISKKDSLVLMPTGGGKSICYQLPALLMEGTAIVIS PLISLMKDQVETLRANGIPAGALNSSNDETENANLRRACISGQLKLLYISPEKLLSEADY LLRDMTLSLFAVDEAHCISQWGHDFRPEYARMGFLRNQFPNVPMIALTATADKITREDIV RQLQLRQPQIFISSFDRPNLSLSVKRGYQPKEKSKAIVDFITRHRGESGIVYCMSRSKTE TVAQMLQKHGIRCGVYHAGLSARQRDETQDDFINDRIEVVCATIAFGMGIDKSNVRWVIH YNLPKSIESFYQEIGRAGRDGMASDTILFYSLGDLILLTKFATESNQQNINLEKLNRMQQ YAESDICRRRILLSYFGETTTEDCGNCDVCRNPPERFDGTIIVQKALSAIARTNQQVSIT LLIDILRGNPTTEITEKAYTELKTFGAGRDIPARDWQDYLLQMLQMGYFEIAYNENNHLK ITGSGSDVLFGRKKAMLVVIRREEPATAKRRKKSAATPTQAWAEESRSKEEDLFEALRAL RKQLADQEALPAYIVLSDKVLHLLCLSRPTTVEAFGNINGIGEYKKKKYGKDFVALIRQF V >gi|226332022|gb|ACIB01000034.1| GENE 29 35342 - 36211 951 289 aa, chain - ## HITS:1 COG:alr0622 KEGG:ns NR:ns ## COG: alr0622 COG0457 # Protein_GI_number: 17228118 # Func_class: R General function prediction only # Function: FOG: TPR repeat # Organism: Nostoc sp. PCC 7120 # 71 271 317 513 547 74 26.0 2e-13 MNRKTIMIPFREREGVRRKFLFLLFSCLLSVSLSAQTYQELSEKAVECVGKDSLVQAEDL LKQALKLEPKNAHNALLFSNLGLVQRKLGRYNDAVESYTYALNIAPLAVPILLNRAAIYL EQGMQDKAYVDYCQVMDVDKKNTEALLMRAYIYMLRRDYKGARLDYQRLLEIDPKNYNGR LGLGTLEQKENKFREALDILNQLLVEFPEDAVLYVARADVERDMKHDDLALVDLDEAIRL APDSIDAYLLRGDIYLDQKKKSLAKADFEKAISLGVPPADVHEQMQQCK >gi|226332022|gb|ACIB01000034.1| GENE 30 36268 - 37218 740 316 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|148988856|ref|ZP_01820271.1| 50S ribosomal protein L9 [Streptococcus pneumoniae SP6-BS73] # 4 311 3 307 308 289 49 2e-77 MAKIAKKLTELIGHTPLMELSGYSRKYGLQENIVAKLESFNPAGSVKDRVALSMIEDAEE RGVLQPGATIIEPTSGNTGVGLAMVATIKGYRLILTMPETMSLERRNLLKALGAQIVLTN GQKGMAGSIAKAEELKKSIPGSVILQQFENPANTEVHARSTGEEIWQDTDGEVAVFVAGV GTGGTVCGVARALKKHNPNVYIVAVEPASSPVLEGGKAASHRIQGIGANFVPGIYDASVV DEVMPVPDDEAIRGGRELASTEGLLVGISSGAAVYAARQLAGRPEFKGKMIVTLLPDTGE RYLTTELFAFDAYPLD >gi|226332022|gb|ACIB01000034.1| GENE 31 37309 - 39147 1487 612 aa, chain + ## HITS:1 COG:VCA0802 KEGG:ns NR:ns ## COG: VCA0802 COG1368 # Protein_GI_number: 15601557 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily # Organism: Vibrio cholerae # 198 591 204 617 657 179 29.0 2e-44 MKRRLIQFVTTYFLFVLIFVLQKPIFMGYYHTLYNKVSWTDYFSVMGHGLPLDFSLAGYL TAIPGLLLIASVWIQPAVIRQIRRGYFMIIAILLSCIFIGDLGLYEYWGFRLDATPLFYL FSSPKDALASVSIWVVMAGLAAMAVYAALLYLIFYRVLIYQKQPVKIPFHRLSVSGVLLL ATALLFIPIRGGFTVSTMNLSKAYFSSNQRLNHAAINPCFSLMESLSRQDNFDKQYRFMP AEEADKLFAELKDQPVAPTDSIPQLFTTERPNVILIILESFSSKLMETLGGESNVAINMD QFGREGVLFTHFFANSFRTDRGLAAIISGYPAQPTTSIMKYPKKTQHLPSIPSSLKKAGY DLQYYYGGDADFTNMRSYLIQAGIDNIVSDKDFPLSERLSKWGAHDHVVFNRLLDDLKQH TPQKPFMKILQTSSSHEPFEVPFRRLENPRLNAFAYADSCAGDFVRQFKETPLWKNTVIV LVPDHLGAYPQDIDNLTVDRYRIPLIFIGGAVKEPRQIGTYGSQIDIAATLLGQLGLPHE EFIFSKNMLNPNSPHFGFFTFPNAFGMMTPENEVVFNCESNSIVSDEGTHKGENLPKAKA YLQKLYDDLAKR >gi|226332022|gb|ACIB01000034.1| GENE 32 39172 - 39837 478 221 aa, chain + ## HITS:1 COG:no KEGG:BF3432 NR:ns ## KEGG: BF3432 # Name: not_defined # Def: putative membrane-associated phospholipid phosphatase # Organism: B.fragilis # Pathway: not_defined # 1 221 1 221 221 430 100.0 1e-119 MTNDMIQLLVNTDQNLLLYLNGFHNAFGDYFMSTFTGKWIWVPMYASILYVLLKNFNWKI TLCCLTAIALTILFADQVCASLIRPAVERLRPSNPANPISDLVHIVNNYRGGRYGFPSCH ASNSFGLAFFLVFLFRKRWLSLFILLWATLNCYTRIYLGVHYPGDLIVGAIIGCCGAAFM CYLLKKTARGASFGKVKHTEITIYVGLLTTIGILVYASIMA >gi|226332022|gb|ACIB01000034.1| GENE 33 39876 - 40343 -8 155 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MLYCHKGRGTEGMKDKGSRETSFFLLKIPHRNAYRQANIFKVFGFSEGGESKKANCTTFL RIASVLYTGRHALRPVMCMERFYPKEWHESQQQQICCQNSVLLYMFYPFHLSGKYRFLCG SVQYLYVGISLLDIVRRIRTWSVTLCRIYKLPEIG >gi|226332022|gb|ACIB01000034.1| GENE 34 40434 - 42572 1052 712 aa, chain + ## HITS:1 COG:XF0384 KEGG:ns NR:ns ## COG: XF0384 COG1629 # Protein_GI_number: 15836986 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor proteins, mostly Fe transport # Organism: Xylella fastidiosa 9a5c # 40 638 44 612 681 87 22.0 1e-16 MIFETYFKHVYLYVACLYATSGTGLYAQTLHQHAGTQRDTVPVTFPVQHLEEVVITAGRP GITAGAVSNRISSSEIHRAAGSSLATLLERISGVSSLSTGTTVSKPVIHGMHGNRILIIN NGARQTGQQWATDHAPEVDINESGNILVIKGADGVRYGSDALGGIIVMEQPPFAFGQEHP KGRIATFYGSNGHRYAATGSLEGTLPFLRNIAWRMQGTYSNSGDRSTAHYLLNNTGTRGL HFSASAGYDSGRLRIEGIYSHFGEQTGVMFGAQMGSEDLLAERIRLGRPVYTDPFTRHIS YPYQKVVHRTAIGKVRYNAGAAGVFYWQTSWQKDDRRENRIRRMNHSDIPAVALLLSSIQ NTFRWKLDYGPWQTEIGGQMIFTDNHSKAGTGIVPVIPNYTEMQAGAYGIQKYRYERTAV EAGIRLDRQETRAGGYDWTGNYYGGNRKFCNFTYGLGGHYRLSKYWELTSNFALTWRAPH VHELYSNGNELGSGMFVRGDASMNAERSHKWITSVSYRDKVFHIRLDSYLQWIKGYIYDE PLKENITVISGTYPVFRYRQTPAFFRGADFDFRFMPAASWEYHLIASFIRANEQGTGNYL PYIPSFHLSHELAWTHQTKSHILFRLTARHKFVAKQNRFNPATDLIPYTPPAYHLFGAEA SMECPVKYGNKLTLTVTADNLLNREYKEYTNRSRYYAHDMGRDIRCTLNWNF >gi|226332022|gb|ACIB01000034.1| GENE 35 42622 - 43638 1107 338 aa, chain + ## HITS:1 COG:no KEGG:BF3435 NR:ns ## KEGG: BF3435 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 338 1 338 338 692 100.0 0 MKTITSKTILMLLTGAFLVLLFNSCTKDPVIPEDETKNKLHEDPAKVTVRLVECHLHADW NEIQTNGGPHQNPESPARHIKRIQDITYELKAGQGWTLAEGSQKKFYVQKNGEYKNQGRF TPAPVYLMFIYYYNAKGELMNNQFVENGQENIHQHFFTPENIKPTFDGQIEADDNDPQKL IDYLYVDTTPWDKTYHSGEAEITGRDNPVGLKGIIRFLKDRKEFDLKVRLYHGYESKKNP QTGTFDPFYKPSGVLIQRGTWDINLNLPVVVFWSREEFIDIDEEANLEKVGEDSLDEGSN RTVHSIMETFNLTWKEALEEFITYTYKAGDAEGGAIWL >gi|226332022|gb|ACIB01000034.1| GENE 36 43860 - 45479 1327 539 aa, chain - ## HITS:1 COG:no KEGG:BF3257 NR:ns ## KEGG: BF3257 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 539 1 539 539 1110 99.0 0 MKKNIHILLAATLLFGATSCEDFLQKDPPSSPSQSVFWQKKSDFESALAGTYSVMYSGVF SQIMPCLDGLTDNAIVQHSEGTYGWAKTIAQGDLTPNQGGFVTDIYGNCYKGIARVHILM EQLDQYTGSDISADEKKFMLAQCKALRGYFYSWLYQCYKEVPVVTQSLDLNTMYQPKATR AEVLNRVMTDYDEAISELPDKLYSDSQTSGRFTVSAVKALKARILLFDAYDSNGKAISSK MEEVLTLLQSIKGYSLAARVRDNFISEKQLASPEIMFSVRFLRPNLTHSMDLYYGAWAVL DPTRNMVDAFECTDGKPWGESLLTVRPDESILYGNDDALKKAERAKMFMNRDRRLYESVN HSMMANFVDDGFKDEEVQVNESNNKGPTGFSALKYVQPTDVTPGYSTVSDADIVVLRYAH VLLMIAEAENEAHGATTTALNAINEVRTRSGQPAIEAGISQDDLRERIRNEWRIETCFEG LRYFQLKRWKLMDKRVNGVEDPAYPGYIKVYKPAFEFFPIPQSEIDKAGGVLKQDPAYE >gi|226332022|gb|ACIB01000034.1| GENE 37 45492 - 48671 2693 1059 aa, chain - ## HITS:1 COG:no KEGG:BF3437 NR:ns ## KEGG: BF3437 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1059 1 1059 1059 2098 100.0 0 MTKRTNLFPSLIKTREMNCLKIAGASLLLLCISPQFAVADGLKQDAVTIMQQQNLKVSGV VTDEAGEPLIGVSVLVKGTTLGNITDLNGRFSLDVPEGSILEISYIGYKTQSIKAQREPM NIVLKEDAQKLDEVVVVGFGTQKKVNLTGSVSAVTGDDISKRPVANAAILLQGQIPGLRV NQGLGQPGGEGTSFRIRGQGTFSSAGSDPLILINGVPGSMTNLDPSVIESVSVLKDAASA AIYGARAANGVILVTTKQGAVGDKVHISYHGNVGLHTPTKLYDRVTNSVEYMELANLAWK NSGTGKQYTQDQINLYRNNVGDPQYPNFDWQDYMFRTAVVQTHNLSMAGSTEKTTYNVAL NFVDQPGTMRGFKYRKYNATIDLTARITNFIKVGTYANLMYGETEQPRQGQNDAFLSTLS QAPTYMPWLPDDGTGIRRWTSSAYSFESHNKNMPAIIGDNAMKRDNNFDINAQLWLEINL AKGLTWYTKGAARLQSNKSKDWRGSTTYTYDYHTGERSSELDKGGLGLSVGDGRRFYTNL YSYLKYDLSLVDNAHNFSLMVGYNQESEKYETLNAYRKDFAFDLPVLNAGGTADWSNSGG EEEWAIQSLFGRFNYDFKERYLFEANMRYDGTSRISDENRWGVFPSFSVAWRATEEEFIK NLNLNWLNNFKLRGSWGQLGNQNIGLYPYQAMISGVDDYPFTKTSDGVIIGYQQTAYANR NIKWETTTITDIGFDLQVFDGLSVTFDWYKKTTDDILRSSQVSSLLGLSAPTVNNGSVEN KGIEVALNYANMVKGGTFRGFRYNAGVYFDRSRNKLTEFGAEEIGSYSIKREGLPYDEYY MLECIGVFADQAEINASPKQFNDNTQPGDLKYKDISGPDGKPDGVIDNYDRRTFSGRFPG FEYGINASATWKGFDLSLIGQGVADKKYYTTDWGVQPFMQGSSPNKDYIKHMWTEENPYG AKHPKLYWQDMGGGKNTRPNSYYLKDASFFRLKNLTLGYTLPRVWTEKANISKVRIYFSG DNLLTLTPYKGLDPERNGDGRDAIYPQNRIYSFGLNVEF Prediction of potential genes in microbial genomes Time: Tue May 17 23:07:39 2011 Seq name: gi|226332021|gb|ACIB01000035.1| Bacteroides sp. 3_2_5 cont1.35, whole genome shotgun sequence Length of sequence - 11432 bp Number of predicted genes - 10, with homology - 10 Number of transcription units - 3, operones - 2 average op.length - 4.5 N Tu/Op Conserved S Start End Score pairs(N/Pv) - Term 209 - 270 10.7 1 1 Op 1 . - CDS 294 - 1304 1085 ## COG0451 Nucleoside-diphosphate-sugar epimerases 2 1 Op 2 1/0.000 - CDS 1318 - 2871 972 ## COG1807 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 3 1 Op 3 2/0.000 - CDS 2861 - 3508 375 ## COG3952 Predicted membrane protein 4 1 Op 4 . - CDS 3513 - 4271 659 ## COG0463 Glycosyltransferases involved in cell wall biogenesis - Prom 4293 - 4352 7.1 - Term 4390 - 4432 8.2 5 2 Op 1 11/0.000 - CDS 4465 - 5526 1093 ## COG0473 Isocitrate/isopropylmalate dehydrogenase 6 2 Op 2 . - CDS 5539 - 7068 1613 ## COG0119 Isopropylmalate/homocitrate/citramalate synthases 7 2 Op 3 30/0.000 - CDS 7047 - 7637 419 ## COG0066 3-isopropylmalate dehydratase small subunit - Prom 7657 - 7716 6.5 8 2 Op 4 6/0.000 - CDS 7750 - 9144 1390 ## COG0065 3-isopropylmalate dehydratase large subunit 9 2 Op 5 . - CDS 9192 - 10688 1760 ## COG0119 Isopropylmalate/homocitrate/citramalate synthases - Prom 10816 - 10875 6.8 - Term 11014 - 11065 12.4 10 3 Tu 1 . - CDS 11090 - 11362 222 ## BF3449 hypothetical protein Predicted protein(s) >gi|226332021|gb|ACIB01000035.1| GENE 1 294 - 1304 1085 336 aa, chain - ## HITS:1 COG:BH3709 KEGG:ns NR:ns ## COG: BH3709 COG0451 # Protein_GI_number: 15616271 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Nucleoside-diphosphate-sugar epimerases # Organism: Bacillus halodurans # 1 335 1 334 343 383 55.0 1e-106 MKALVTGAAGFIGSYTVKALVAQGCEVVGLDNINSYYDVQLKYDRLADTGITKESIEKDI LLPSAKYPSYRFIKMDLTDREGLTNLFKDEHFDIVVNLAAQAGVRYSIENPYAYIESNIV GFLNLLECCRHYPVNHLVYASSSSIYGLNDKVPYAETDKADSPVSLYAATKKSNELMAHA YSKLYSIPTTGVRFFTVYGPWGRPDMAPCLFMKAILNGDPIKVFNNGQMRRDFTYIDDII AGLMKIIAHPSADPIPFYIYNIGNSAPVELMDFISVIEKTAGKTAIKQMMGMQPGDVVCT YADTGRLEKDFGYKPSTSIEEGIQKFYDWYVGYFNK >gi|226332021|gb|ACIB01000035.1| GENE 2 1318 - 2871 972 517 aa, chain - ## HITS:1 COG:YPO2418 KEGG:ns NR:ns ## COG: YPO2418 COG1807 # Protein_GI_number: 16122638 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family # Organism: Yersinia pestis # 8 316 15 321 554 77 25.0 4e-14 MKSKFLPFLLLLAVLPVLIFRDYTPSNELRYLSIVDEALRNGDIFTFTNHGIQYADKPPL YFWILMLGKWLLGNHAMWFASLFSFIPALVIMLVMDRWVEREVSVANRLSAQLMLMSCGL FLGLAVVLRMDMLMCMFIVLALRTFYQMLKGQGSKNWNQFLFPFYIFMAVFSKGPVGILV PLVSTFIFLLITGRVKTFGRYWGWKTFAVLLLGCFIWFGGVCWEEGGLTYLHDLLFRQTV GRAVNAFDHSAPFYYYFISVWYSLAPWALFLVGIIIAGACRRLIRSDMERFFMVIILTTL LMLSCFSGKLAVYLAPTFPFFVYLAVLLLSHFRWNQWLALTLLLPAVVFVAGLPALIVLG RMPGTEFLGQKLFYVAGGILTVSGGTALYFLYRKKSLNKTINVLALGLFCAVFVGGWDVP AINGELGYSELCRKAVELSKEKNVSGYCVLNVRRSENMDVYLHERVKEVTEEEVLDNKYQ NTILMISNKKIRSNKKLEEFVNGKEHYVIGRFSVMVL >gi|226332021|gb|ACIB01000035.1| GENE 3 2861 - 3508 375 215 aa, chain - ## HITS:1 COG:CT411_1 KEGG:ns NR:ns ## COG: CT411_1 COG3952 # Protein_GI_number: 15605136 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Chlamydia trachomatis # 6 207 9 219 227 73 29.0 2e-13 MKGSGFIYVIGFLAQAFFSARIIFQWILSERAKKVVSPSIFWILSIAGSYLLCIYGWLRD DFSIIFGQFISYYIYLWNLNEKGIWNKLHGALKTLLVITPVIAAAFMLHDAQHFIDSFFR NEEVPLWLLIFGSMGQIIFTLRFVYQWAYSFHHKESLLPAGFWIISLVGSSVIVAYGVFR LDPVLILGQSVGFVAYFRNLMIGRKSSKQSVAYEK >gi|226332021|gb|ACIB01000035.1| GENE 4 3513 - 4271 659 252 aa, chain - ## HITS:1 COG:aq_1899 KEGG:ns NR:ns ## COG: aq_1899 COG0463 # Protein_GI_number: 15606924 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Aquifex aeolicus # 17 238 2 224 322 142 37.0 4e-34 MRDIYNEAMNKTINYQLTIVVPVYNEEDNIYSLEQKLGEFLPKSICTACVLFVNDGSRDN SKQRIMEVCARNKDFFYMDLAKNSGLSAAMKAGIDYTESVYVGYMDADLQTTPEDFNLLL KDIADYQLVMGIRANRKDSFFKNLQSKIANGFRRMMTNDGVQDTGCPLKVLHTAYAKRIP FFTGMHRFLPALILLQDAKIKQIPVRHFPRVAGTSKYHLWNRLVSPFLDCFAYRWMKKRY INYRIGDNNLIA >gi|226332021|gb|ACIB01000035.1| GENE 5 4465 - 5526 1093 353 aa, chain - ## HITS:1 COG:aq_244 KEGG:ns NR:ns ## COG: aq_244 COG0473 # Protein_GI_number: 15605790 # Func_class: C Energy production and conversion; E Amino acid transport and metabolism # Function: Isocitrate/isopropylmalate dehydrogenase # Organism: Aquifex aeolicus # 3 352 4 358 364 352 49.0 5e-97 MDFKIAVLAGDGIGPEISVQGVEVMSAVCEKFGHKVNYEYAICGADAIDKVGDPFPEETY RVCKNADAVLFSAVGDPKFDNDPTAKVRPEQGLLAMRKKLGLFANIRPVQTFKCLVHKSP LRAELVEGADFLCIRELTGGMYFGEKYQDNDKAYDTNMYTRPEIERILKVGFEYAMKRRK HLTVVDKANVLASSRLWRQIAQEMAPQYPEVTTDYMFVDNAAMKMIQEPKFFDVMVTENT FGDILTDEGSVISGSMGLLPSASTGESTPVFEPIHGSWPQAKGLNIANPLAQILSVAMLF EYFDCKAEGALIRKAVDASLDANVRTPEIQVEGGEKFGTKEVGAWIVDYIRKA >gi|226332021|gb|ACIB01000035.1| GENE 6 5539 - 7068 1613 509 aa, chain - ## HITS:1 COG:MK0391 KEGG:ns NR:ns ## COG: MK0391 COG0119 # Protein_GI_number: 20093829 # Func_class: E Amino acid transport and metabolism # Function: Isopropylmalate/homocitrate/citramalate synthases # Organism: Methanopyrus kandleri AV19 # 6 504 4 491 499 239 33.0 1e-62 MGKGVKIEIMDTTLRDGEQTSGVSFVPHEKLMIARLLLEELKVDRVEVASARVSDGEFDA VKMICDWAARRNLLQKVEVLGFVDGHTSLDWIHATGCRVINLLCKGSLKHCTCQLKKTPE KHIEDILAVVDYANELDIEVNVYLEDWSNGMKDSPEYVFRIVDALKETTIKRFMLPDTLG ILNPLQVIEFMRKMKKRYPDTHFDFHAHNDYDLAVSNVLAAVLSGVKGLHTTINGLGERA GNAPLASVQAILKDHFNAITNIDESRLNDVSRVVESYSGIVIPANKPIVGENVFTQVAGV HADGDNKSNLYCNDLLPERFGRVREYALGKTSGKANIRKNLESLGLELDEESMKKVTERI IELGDKKELVTQEDLPYIISDVLKHDGMSNKVKLKSYFVTLAHGLKPMATLSIEIDGQVY EESSSGDGQYDAFVRALRKIYKVTLGRKFPMLINYAVSIPPGGRTDAFVQTVITWSFGEK VFRTRGLDADQTEAAIKATIKMLNIIEEY >gi|226332021|gb|ACIB01000035.1| GENE 7 7047 - 7637 419 196 aa, chain - ## HITS:1 COG:HI0989 KEGG:ns NR:ns ## COG: HI0989 COG0066 # Protein_GI_number: 16272927 # Func_class: E Amino acid transport and metabolism # Function: 3-isopropylmalate dehydratase small subunit # Organism: Haemophilus influenzae # 3 193 2 193 200 178 47.0 6e-45 MKAKFNILTSTCVPLPLENVDTDQIIPARFLKATTKEGFGENLFRDWCYDKQGNKIDSFV LNDPTYGGQILVAGKNFGSGSSREHAAWAIADYGFRVVVSSFFADIHKNNELNNFVLPVV VTEEFLAELFDSIFKNPKMEVEVNLPEQTITNKATGKSEHFEINAYKKHCLMNGLDDIDF LLTNKDKIEQWEKASK >gi|226332021|gb|ACIB01000035.1| GENE 8 7750 - 9144 1390 464 aa, chain - ## HITS:1 COG:NMA1450 KEGG:ns NR:ns ## COG: NMA1450 COG0065 # Protein_GI_number: 15794355 # Func_class: E Amino acid transport and metabolism # Function: 3-isopropylmalate dehydratase large subunit # Organism: Neisseria meningitidis Z2491 # 3 461 5 466 469 513 56.0 1e-145 MNTLFDKIWDAHVVTTVEDGPTQLYIDRLYCHEVTSPQAFAGLRARGIKVFRPEKVYCMP DHNTPTHDQDKPIEDPVSKTQVDTLTKNAADFGLTHYGMMDKRNGIIHVVGPERGLTLPG MTIVCGDSHTSTHGAMGAVAFGIGTSEVEMVLASQCILQSRPKTMRITVDGKLGKGVTAK DIALYMMSKMTTSGATGYFVEYAGEAIRSLTMEGRLTLCNLSIEMGARGGMIAPDEVTFE YIKGRENAPQGEEWDQAVQYWKTLKSEDDAVFDKEVHFDAADIEPMITYGTNPGMGMGIT QHIPTTDGMNETTKASFLKSLDYMGFQPGEALLGKKIDYVFLGACTNGRIEDFRAFASIV KGHQKAEHVIAWLVPGSWMVDAQIREEGLDKILEDAGFAIRQPGCSACLAMNDDKIPAGK YSVSTSNRNFEGRQGPGARTLLASPLVAAAAAITGVIADPRELM >gi|226332021|gb|ACIB01000035.1| GENE 9 9192 - 10688 1760 498 aa, chain - ## HITS:1 COG:VC2490 KEGG:ns NR:ns ## COG: VC2490 COG0119 # Protein_GI_number: 15642486 # Func_class: E Amino acid transport and metabolism # Function: Isopropylmalate/homocitrate/citramalate synthases # Organism: Vibrio cholerae # 1 498 1 500 516 461 50.0 1e-129 MSDRLFIFDTTLRDGEQVPGCQLNTVEKIQVAKALEVLGVDVIEAGFPISSPGDFNSVIE ISKAVTWPTICALTRAVQKDIDVAVDALKFAKHKRIHTGIGTSDSHIKYKFNSTREEIIE RAVAAVKYARRFVDDVEFYAEDAGRTDNEYLARVIEAVIKAGATVVNIPDTTGYCLPSEY GAKIKYLVDHVAGIENAIISTHCHNDLGMATANTMAGILNGARQVEVTINGIGERAGNTA LEEIAMIIKSHHEIDIETNINTQKIYPTSRMVSSLMNMPVQANKAIVGRNAFAHSSGIHQ DGVLKNVQTYEIIDPHDVGIDDNSIVLTARSGRAALKNRLSILGVTLDQEKLDKVYEEFL KLADRKKDIHDDDILVLAGADRSGNHRIKLEYLQVTSGVGVRSVASLGLNIAGEKFEAAA SGNGPVDAAIKALKRIIDRHMTLKEFTIQAISKGSDDVGKVHMQVEYDNQMYYGFGANTD IIAASVEAYIDCINKFTK >gi|226332021|gb|ACIB01000035.1| GENE 10 11090 - 11362 222 90 aa, chain - ## HITS:1 COG:no KEGG:BF3449 NR:ns ## KEGG: BF3449 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 90 1 90 90 90 100.0 2e-17 MKTKFFLAAAMVAFSFAMVSCGGNKTANSDAAADSTSVAVEAAESAATCCKSDSVACDST KACCKDKAACDQKTECPQKEDCAKKACDKK Prediction of potential genes in microbial genomes Time: Tue May 17 23:07:44 2011 Seq name: gi|226332020|gb|ACIB01000036.1| Bacteroides sp. 3_2_5 cont1.36, whole genome shotgun sequence Length of sequence - 10331 bp Number of predicted genes - 16, with homology - 15 Number of transcription units - 8, operones - 5 average op.length - 2.6 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 15 - 74 4.4 1 1 Op 1 . + CDS 107 - 1264 614 ## COG0582 Integrase 2 1 Op 2 . + CDS 1338 - 1550 179 ## gi|253565242|ref|ZP_04842697.1| predicted protein 3 2 Op 1 . - CDS 2698 - 2832 87 ## 4 2 Op 2 . - CDS 2867 - 3787 606 ## BDI_1256 clindamycin resistance transfer factor BtgB 5 2 Op 3 . - CDS 3792 - 4376 396 ## BF0644 clindamycin resistance transfer factor BtgA - Prom 4412 - 4471 3.0 - Term 4503 - 4541 -0.7 6 3 Tu 1 . - CDS 4631 - 4981 141 ## BF0646 hypothetical protein - Prom 5058 - 5117 1.9 - Term 5048 - 5087 9.5 7 4 Op 1 . - CDS 5138 - 5779 386 ## BF0647 hypothetical protein 8 4 Op 2 . - CDS 5793 - 6023 88 ## gi|253565247|ref|ZP_04842702.1| transcription regulator - Prom 6251 - 6310 5.1 9 5 Op 1 . - CDS 6479 - 7009 205 ## PROTEIN SUPPORTED gi|124009622|ref|ZP_01694295.1| nucleotidyltransferase plus glutamate rich protein grpb plus ribosomal protein alanine acetyltransferase 10 5 Op 2 . - CDS 7084 - 7314 140 ## BT_1587 hypothetical protein 11 5 Op 3 . - CDS 7298 - 7654 73 ## BF3291 hypothetical protein - Prom 7691 - 7750 3.8 12 6 Tu 1 . - CDS 7755 - 8324 317 ## COG0664 cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases - Prom 8354 - 8413 7.3 + Prom 8337 - 8396 7.9 13 7 Op 1 . + CDS 8545 - 8859 240 ## BF0653 hypothetical protein 14 7 Op 2 . + CDS 8846 - 9139 234 ## BT_2337 hypothetical protein 15 7 Op 3 . + CDS 9193 - 9486 239 ## BT_2336 hypothetical protein + Term 9711 - 9756 2.0 - Term 9694 - 9750 4.0 16 8 Tu 1 . - CDS 9771 - 10331 359 ## BF0656 tyrosine type site-specific recombinase Predicted protein(s) >gi|226332020|gb|ACIB01000036.1| GENE 1 107 - 1264 614 385 aa, chain + ## HITS:1 COG:Ta1314 KEGG:ns NR:ns ## COG: Ta1314 COG0582 # Protein_GI_number: 16082303 # Func_class: L Replication, recombination and repair # Function: Integrase # Organism: Thermoplasma acidophilum # 228 379 110 275 283 67 31.0 4e-11 MNTRISYKFVLKAEANSRGLYPVYMRAFLHGKKIEIATSITIVEGDWSETKQRVKRRNKL NEKHNMILEAFEKKALKCILDNFVYEETPLTLRQFKDYMLSIGQTENSFTDYILNYLNEN KSRLRSESWWSYKSQITKLLKFRKHISFADLTEKFINEYQHYMLYTLHNNENTVSKSLRS LRTFINIAMRYGLIKTNPFKYITIKKVDGKRDFLSAEELSKLSDAYISNKIIDEQEKEVL QYFLFSCYTGLRYSDLKMLKTSSIKNNILHINMHKTNCLVSIPLSQKALQLLPNKINSES DYVFRVYCNKVTNRVLKQVGKRYGIPKKLTCHVARHTFATVSIILGIPIEVVSKLLGHSS LKTTQIYAKIVDSVKEREMEKWDKL >gi|226332020|gb|ACIB01000036.1| GENE 2 1338 - 1550 179 70 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253565242|ref|ZP_04842697.1| ## NR: gi|253565242|ref|ZP_04842697.1| predicted protein [Bacteroides sp. 3_2_5] # 1 70 1 70 70 104 100.0 2e-21 MAGHTLKLMIYTIVLRDGEKKDSNFKKLHQAIYESTEKEKAKLFDIYKDRFLKSFKEKFV LSITTQKALQ >gi|226332020|gb|ACIB01000036.1| GENE 3 2698 - 2832 87 44 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MECFKYCKSLVYLTVNNIIDTQRNSRVDLHLFIRKLKVNFHLIA >gi|226332020|gb|ACIB01000036.1| GENE 4 2867 - 3787 606 306 aa, chain - ## HITS:1 COG:no KEGG:BDI_1256 NR:ns ## KEGG: BDI_1256 # Name: not_defined # Def: clindamycin resistance transfer factor BtgB # Organism: P.distasonis # Pathway: not_defined # 1 294 1 294 306 411 81.0 1e-113 MHIDFAPPSNGAYNNAESSRQLANYMEHEDLERMEKGIYTEGFFNLTDDNIYKSKVIKDI DTNIGQLLKTDAKFFAIHVSPSESELRVMGNTEQEQAEAMKRYIREVFISEYANNFNKGL SEADIKFYGKIHFNRSRSDNKLNMHCHLIVSRKDQVNKKKLSPLTNHKNTKKGTVTGGFD RVNLFKQAEQGFDKLFNYNRQQTESFDYYNTMKNSSISEQLEIQNQNFTGEKKKERFQSC EKENNISCNLDSKQDNKYSNNQQNNSGGDSLLSIFSLDDNNNYDVTLAEELQVQKRKKKK QRGIRR >gi|226332020|gb|ACIB01000036.1| GENE 5 3792 - 4376 396 194 aa, chain - ## HITS:1 COG:no KEGG:BF0644 NR:ns ## KEGG: BF0644 # Name: not_defined # Def: clindamycin resistance transfer factor BtgA # Organism: B.fragilis # Pathway: not_defined # 1 194 1 194 194 326 97.0 2e-88 MPNSSRKTIFTTISIDKETAALVEKICKRYSLKKSEVVKLAFGYIDKAHINPAEAPESVK SELAKINKRQDDIIRFIRHYEEEQLNPMIRVTNSIALRFDAIGKTLETLILSQLEASQEK HTAVLKKLSEQFCNHADVINNQSKQINALYQIHQRDNKKLLHLIQLYSELSACGVMDSKR KENLKTEITNLINT >gi|226332020|gb|ACIB01000036.1| GENE 6 4631 - 4981 141 116 aa, chain - ## HITS:1 COG:no KEGG:BF0646 NR:ns ## KEGG: BF0646 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 109 1 109 110 146 74.0 2e-34 MYIDNENFDKWMEKLSKKLNEIGQNLQSLINTDTVLDPNDKLLDNQDLAFLLKVSYRTLQ RYRASGKLPYFMISHKTYYRASDIRIFVQENADCKSYERFKKENQLDKQTDAKQGG >gi|226332020|gb|ACIB01000036.1| GENE 7 5138 - 5779 386 213 aa, chain - ## HITS:1 COG:no KEGG:BF0647 NR:ns ## KEGG: BF0647 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 81 210 14 142 143 96 44.0 6e-19 MMEWDLTEKRIYQLLKHPYQFDKSFVEDMSSAYLEFVEELFDYLNRVNDNKLRIRQLNMN FVDFGTLKALEESCPTENSKLKLVFIDKLLSLITMEQELIYRQMEYPKFFINIESEWKSP FYLNNEVIKLVDMMELVCGIFYISGGIVRIDHKEIFLSDVARIFEKMFNVNFGDIYKKEI AVIKRKPIKITEFLDRLKAAIIQKSKDEGYYQP >gi|226332020|gb|ACIB01000036.1| GENE 8 5793 - 6023 88 76 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253565247|ref|ZP_04842702.1| ## NR: gi|253565247|ref|ZP_04842702.1| transcription regulator [Bacteroides sp. 3_2_5] # 1 76 76 151 151 127 100.0 2e-28 MVSKVLRTLEKKQYISRTGSIHDTRIRIISLTENGTEISQKSINIVEAVDTKFFSILNND LQIFLKCMNTLSNQDD >gi|226332020|gb|ACIB01000036.1| GENE 9 6479 - 7009 205 176 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|124009622|ref|ZP_01694295.1| nucleotidyltransferase plus glutamate rich protein grpb plus ribosomal protein alanine acetyltransferase [Microscilla marina ATCC 23134] # 1 154 1 154 174 83 30 6e-16 MKQYFETQRLIFRSWQEEDISYLARLNSDDKVMEYFLKKLSYQQTIALYNQIQEEFTIYG FGAYSVEEKETGVFIGFVGLHNVTFEVDFAPAVEILWRLLPEFWGKGYATEAAIACLNYA KEELKLKEIVSFTSLLNKRSEHVMQKIGMTRIKEFNHPLVEPEHPLYRHILYKIVL >gi|226332020|gb|ACIB01000036.1| GENE 10 7084 - 7314 140 76 aa, chain - ## HITS:1 COG:no KEGG:BT_1587 NR:ns ## KEGG: BT_1587 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 76 174 250 250 112 68.0 4e-24 MTIYYDMQYPYVYQYIETITQYCEMNDFSVSLIQVDALQKAKELPCVFNNFAMFYKGVFE TVNLLNIDYLKRILRK >gi|226332020|gb|ACIB01000036.1| GENE 11 7298 - 7654 73 118 aa, chain - ## HITS:1 COG:no KEGG:BF3291 NR:ns ## KEGG: BF3291 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 113 1 113 115 165 70.0 5e-40 MKKSTVTRTDLLTVHLHDKQSLSKVEIKKIVIPQKGKADYHLHSCPVVGYVVSGTLLFQI EGQSSYLIKAGEAFYEPKNQPILHFDNASDSTPLIFVAYYLLEGDEDLITLLPHDDLL >gi|226332020|gb|ACIB01000036.1| GENE 12 7755 - 8324 317 189 aa, chain - ## HITS:1 COG:alr4010 KEGG:ns NR:ns ## COG: alr4010 COG0664 # Protein_GI_number: 17231502 # Func_class: T Signal transduction mechanisms # Function: cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases # Organism: Nostoc sp. PCC 7120 # 30 183 36 190 196 109 37.0 2e-24 MDIKKLIIEEIGLSNSSYERLIEVTERFSLKKKDFLLQQGKVCTFIGFVEKGTLRSYIEK DGEEYTSDFYTDGSFTTSYRSFLTKEPSIGSIQALENSLILSLSKSNYELLLQESNEWYK LGKYIADTLFIRKCGKESSLLMDSALDRYKLLLRTFPHIEQHVSQYHIASYLGIKPESLS RLKSLNIGQ >gi|226332020|gb|ACIB01000036.1| GENE 13 8545 - 8859 240 104 aa, chain + ## HITS:1 COG:no KEGG:BF0653 NR:ns ## KEGG: BF0653 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 104 1 104 104 173 95.0 2e-42 MEVVTIEKRTFSYVCERFTEFAKRIESLCSTHTQKVENWLDSQEVCLLLGFSKRTLQYYR SSGRLAYSQIGNKIYYKSSDIERIIADSETQNQSPKQTTPYEKN >gi|226332020|gb|ACIB01000036.1| GENE 14 8846 - 9139 234 97 aa, chain + ## HITS:1 COG:no KEGG:BT_2337 NR:ns ## KEGG: BT_2337 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 97 1 97 97 162 90.0 4e-39 MKRTKEDYPSFNLFSIVGTWESINLNPTIIIYRSDKEYLLSIIYVSETTKQASPATYEIQ QDGSQYFITSASKRLYVDYDPAKDVLSISSLGDYLRN >gi|226332020|gb|ACIB01000036.1| GENE 15 9193 - 9486 239 97 aa, chain + ## HITS:1 COG:no KEGG:BT_2336 NR:ns ## KEGG: BT_2336 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 97 1 97 97 172 96.0 5e-42 MELINKDTLQIKEFISSLDSMLNGIESIVKYYKPHLNGERFLSNNEVSKKLNVSLRTLQE WRDTGLIPFIQIKGKIIYRQSDIDKLLQKHYFESWKE >gi|226332020|gb|ACIB01000036.1| GENE 16 9771 - 10331 359 186 aa, chain - ## HITS:1 COG:no KEGG:BF0656 NR:ns ## KEGG: BF0656 # Name: not_defined # Def: tyrosine type site-specific recombinase # Organism: B.fragilis # Pathway: not_defined # 1 186 221 406 406 360 95.0 1e-98 DPEFLTMDEIKIILAKEFTIKRVEQVRDVFVFCIFTGLAFSDVKDLSPEHLVKDNKGELW IRKNRQKTKIMCNIPVLPVAASILDKYKDVAECTGKLLPVLCNQRMNSYLKEIADVCGIH KNLSTHTARHSYATSICLANGVSMENVAKMLGHADTNVTKHYARVLDQNIFKDMQKVNSC LSELAI Prediction of potential genes in microbial genomes Time: Tue May 17 23:09:39 2011 Seq name: gi|226332019|gb|ACIB01000037.1| Bacteroides sp. 3_2_5 cont1.37, whole genome shotgun sequence Length of sequence - 281435 bp Number of predicted genes - 233, with homology - 230 Number of transcription units - 114, operones - 55 average op.length - 3.2 N Tu/Op Conserved S Start End Score pairs(N/Pv) - TRNA 905 - 977 70.0 # Lys TTT 0 0 + Prom 1008 - 1067 6.1 2 2 Tu 1 . + CDS 1124 - 2782 1771 ## COG0488 ATPase components of ABC transporters with duplicated ATPase domains + Term 2800 - 2835 6.1 + Prom 2886 - 2945 9.6 3 3 Op 1 . + CDS 2978 - 5767 2988 ## BF1991 putative TonB-dependent outer membrane protein 4 3 Op 2 . + CDS 5791 - 7668 1689 ## BF1992 hypothetical protein + Term 7693 - 7739 3.1 - Term 7685 - 7722 1.4 5 4 Tu 1 . - CDS 7783 - 8868 725 ## COG3021 Uncharacterized protein conserved in bacteria - Prom 8892 - 8951 2.5 6 5 Op 1 . - CDS 9009 - 11708 2546 ## COG0642 Signal transduction histidine kinase 7 5 Op 2 1/0.071 - CDS 11721 - 12941 804 ## COG1215 Glycosyltransferases, probably involved in cell wall biogenesis 8 5 Op 3 2/0.000 - CDS 12938 - 13783 505 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 9 5 Op 4 4/0.000 - CDS 13806 - 14717 714 ## COG1442 Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases 10 5 Op 5 11/0.000 - CDS 14714 - 16981 1332 ## COG0438 Glycosyltransferase 11 5 Op 6 8/0.000 - CDS 16978 - 18432 1114 ## COG2244 Membrane protein involved in the export of O-antigen and teichoic acid - Prom 18541 - 18600 6.4 12 5 Op 7 . - CDS 18644 - 19594 692 ## COG0463 Glycosyltransferases involved in cell wall biogenesis - Prom 19748 - 19807 1.9 + Prom 20134 - 20193 12.0 13 6 Tu 1 . + CDS 20274 - 20567 118 ## + Term 20583 - 20612 -0.5 14 7 Tu 1 . - CDS 21086 - 22306 1077 ## COG1215 Glycosyltransferases, probably involved in cell wall biogenesis - Prom 22363 - 22422 2.6 - Term 22407 - 22471 1.1 15 8 Op 1 . - CDS 22521 - 23411 526 ## COG1216 Predicted glycosyltransferases 16 8 Op 2 . - CDS 23457 - 24794 881 ## BF2060 putative transmembrane surface-related protein 17 8 Op 3 . - CDS 24787 - 27003 1876 ## BF2061 putative transmembrane protein 18 8 Op 4 . - CDS 27016 - 27765 489 ## BF2062 putative outer membrane protein - Prom 27841 - 27900 7.2 19 9 Tu 1 . - CDS 28077 - 29303 1146 ## BF2063 hypothetical protein - Prom 29386 - 29445 5.0 + Prom 29388 - 29447 4.9 20 10 Tu 1 . + CDS 29494 - 30891 1333 ## COG3579 Aminopeptidase C + Term 31008 - 31052 4.3 + Prom 31298 - 31357 6.3 21 11 Op 1 7/0.000 + CDS 31487 - 32839 1673 ## COG1726 Na+-transporting NADH:ubiquinone oxidoreductase, subunit NqrA 22 11 Op 2 9/0.000 + CDS 32871 - 34049 1449 ## COG1805 Na+-transporting NADH:ubiquinone oxidoreductase, subunit NqrB 23 11 Op 3 9/0.000 + CDS 34064 - 34741 825 ## COG2869 Na+-transporting NADH:ubiquinone oxidoreductase, subunit NqrC 24 11 Op 4 9/0.000 + CDS 34758 - 35390 763 ## COG1347 Na+-transporting NADH:ubiquinone oxidoreductase, subunit NqrD 25 11 Op 5 7/0.000 + CDS 35419 - 36045 757 ## COG2209 Na+-transporting NADH:ubiquinone oxidoreductase, subunit NqrE 26 11 Op 6 . + CDS 36068 - 37339 1488 ## COG2871 Na+-transporting NADH:ubiquinone oxidoreductase, subunit NqrF + Term 37354 - 37413 15.1 + Prom 37352 - 37411 4.2 27 12 Tu 1 . + CDS 37436 - 38755 1133 ## COG0513 Superfamily II DNA and RNA helicases + Prom 39096 - 39155 5.1 28 13 Op 1 6/0.000 + CDS 39269 - 40336 1083 ## COG1932 Phosphoserine aminotransferase + Prom 40363 - 40422 4.8 29 13 Op 2 2/0.000 + CDS 40450 - 41370 1242 ## COG0111 Phosphoglycerate dehydrogenase and related dehydrogenases 30 13 Op 3 . + CDS 41383 - 42630 1468 ## COG4198 Uncharacterized conserved protein + Prom 42734 - 42793 3.2 31 14 Op 1 . + CDS 42878 - 43279 315 ## COG0545 FKBP-type peptidyl-prolyl cis-trans isomerases 1 32 14 Op 2 . + CDS 43307 - 43924 626 ## COG1739 Uncharacterized conserved protein 33 14 Op 3 . + CDS 44002 - 45090 1120 ## BF2023 type II restriction enzyme HpaII 34 14 Op 4 . + CDS 45115 - 45684 495 ## COG0778 Nitroreductase 35 15 Op 1 . - CDS 45827 - 48676 2888 ## COG1003 Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 36 15 Op 2 . - CDS 48719 - 49357 472 ## COG0491 Zn-dependent hydrolases, including glyoxylases 37 15 Op 3 . - CDS 49377 - 50051 681 ## COG0357 Predicted S-adenosylmethionine-dependent methyltransferase involved in bacterial cell division 38 15 Op 4 . - CDS 50091 - 50945 912 ## BF2028 hypothetical protein - Prom 51015 - 51074 6.9 + Prom 50883 - 50942 6.1 39 16 Tu 1 . + CDS 51045 - 51479 244 ## BF2029 hypothetical protein + Prom 51587 - 51646 1.8 40 17 Op 1 . + CDS 51668 - 53875 2232 ## BF2030 putative TonB-dependent outer membrane receptor protein 41 17 Op 2 . + CDS 53926 - 54237 356 ## BF2031 putative heavy-metal binding protein 42 17 Op 3 . + CDS 54289 - 56499 2376 ## COG2217 Cation transport ATPase 43 18 Tu 1 . - CDS 56675 - 57514 586 ## COG2207 AraC-type DNA-binding domain-containing proteins - Prom 57534 - 57593 6.8 + Prom 57492 - 57551 9.0 44 19 Tu 1 . + CDS 57580 - 58245 572 ## COG0321 Lipoate-protein ligase B 45 20 Tu 1 . - CDS 58249 - 58527 58 ## BF2089 hypothetical protein - Prom 58750 - 58809 4.3 - Term 58534 - 58576 7.1 46 21 Op 1 . - CDS 58818 - 59594 921 ## BF2036 putative xylanase 47 21 Op 2 . - CDS 59632 - 60462 526 ## COG0657 Esterase/lipase 48 21 Op 3 . - CDS 60492 - 62480 2265 ## COG1297 Predicted membrane protein - Prom 62506 - 62565 4.3 - Term 62552 - 62589 4.8 49 22 Tu 1 . - CDS 62611 - 63006 385 ## BF2093 hypothetical protein - Prom 63033 - 63092 4.7 50 23 Op 1 13/0.000 - CDS 63190 - 64509 939 ## COG0642 Signal transduction histidine kinase 51 23 Op 2 . - CDS 64529 - 65899 1097 ## COG2204 Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains - Prom 66071 - 66130 6.1 + Prom 66002 - 66061 8.9 52 24 Tu 1 . + CDS 66089 - 67447 767 ## BF2042 hypothetical protein + Term 67507 - 67554 5.3 + Prom 67526 - 67585 3.9 53 25 Op 1 . + CDS 67629 - 68996 802 ## BF2097 hypothetical protein 54 25 Op 2 . + CDS 69007 - 71421 1491 ## BF2098 putative transporter permease protein 55 25 Op 3 . + CDS 71438 - 72112 305 ## PROTEIN SUPPORTED gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 + Term 72118 - 72173 16.2 - Term 72104 - 72161 16.1 56 26 Op 1 . - CDS 72165 - 72764 566 ## BF2047 hypothetical protein 57 26 Op 2 . - CDS 72780 - 73385 591 ## COG0218 Predicted GTPase 58 26 Op 3 . - CDS 73400 - 74851 928 ## COG0591 Na+/proline symporter - Prom 74900 - 74959 2.9 59 27 Tu 1 . - CDS 75348 - 75863 389 ## COG1755 Uncharacterized protein conserved in bacteria - Prom 75884 - 75943 6.9 60 28 Tu 1 . - CDS 76147 - 76557 366 ## BF2052 hypothetical protein - Prom 76664 - 76723 5.0 + Prom 76746 - 76805 9.4 61 29 Op 1 . + CDS 76827 - 77450 531 ## COG0353 Recombinational DNA repair protein (RecF pathway) 62 29 Op 2 . + CDS 77489 - 77938 410 ## BF2107 putative transmembrane protein 63 29 Op 3 . + CDS 77942 - 78478 351 ## PROTEIN SUPPORTED gi|229254479|ref|ZP_04378409.1| acetyltransferase, ribosomal protein N-acetylase + Term 78691 - 78725 2.3 64 30 Op 1 . - CDS 78481 - 79071 476 ## COG1678 Putative transcriptional regulator 65 30 Op 2 . - CDS 79143 - 80459 988 ## COG0436 Aspartate/tyrosine/aromatic aminotransferase - Prom 80652 - 80711 80.3 + TRNA 80632 - 80708 86.1 # Asp GTC 0 0 - Term 80863 - 80902 8.2 66 31 Op 1 . - CDS 80926 - 81453 276 ## gi|253565325|ref|ZP_04842780.1| predicted protein 67 31 Op 2 . - CDS 81464 - 81907 368 ## gi|253565326|ref|ZP_04842781.1| predicted protein 68 31 Op 3 . - CDS 81919 - 82368 299 ## gi|253565327|ref|ZP_04842782.1| predicted protein - Prom 82400 - 82459 3.6 69 32 Tu 1 . + CDS 82367 - 82516 96 ## + Term 82741 - 82771 2.0 - Term 82630 - 82686 0.1 70 33 Tu 1 . - CDS 82701 - 84047 792 ## BVU_2464 mobilization protein - Prom 84143 - 84202 4.2 71 34 Tu 1 . - CDS 84230 - 85198 316 ## BF1279 DNA primase - Prom 85271 - 85330 3.4 72 35 Op 1 . - CDS 85406 - 86476 765 ## BVU_2466 hypothetical protein 73 35 Op 2 . - CDS 86488 - 86796 124 ## BVU_2467 hypothetical protein - Prom 86917 - 86976 4.3 74 36 Op 1 . - CDS 87018 - 88052 561 ## BVU_2468 hypothetical protein 75 36 Op 2 . - CDS 88066 - 89352 903 ## BVU_2469 tyrosine type site-specific recombinase - Prom 89373 - 89432 6.0 + Prom 89923 - 89982 5.8 76 37 Op 1 . + CDS 90171 - 91304 895 ## COG0667 Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 77 37 Op 2 . + CDS 91312 - 92304 817 ## COG1853 Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family + Prom 92307 - 92366 5.7 78 37 Op 3 . + CDS 92400 - 93251 717 ## COG0656 Aldo/keto reductases, related to diketogulonate reductase + Term 93272 - 93322 8.2 - Term 93357 - 93404 0.0 79 38 Tu 1 . - CDS 93413 - 93598 182 ## BF2076 hypothetical protein - Prom 93706 - 93765 1.8 + Prom 93599 - 93658 3.0 80 39 Tu 1 . + CDS 93817 - 94326 485 ## COG0656 Aldo/keto reductases, related to diketogulonate reductase + Prom 94659 - 94718 6.3 81 40 Op 1 . + CDS 94943 - 95809 653 ## BF2131 hypothetical protein 82 40 Op 2 . + CDS 95829 - 97262 1092 ## COG0534 Na+-driven multidrug efflux pump 83 40 Op 3 . + CDS 97266 - 98429 1169 ## COG1168 Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities + Prom 98452 - 98511 3.4 84 41 Op 1 . + CDS 98549 - 99493 895 ## COG0667 Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 85 41 Op 2 2/0.000 + CDS 99510 - 100517 728 ## COG2159 Predicted metal-dependent hydrolase of the TIM-barrel fold 86 41 Op 3 4/0.000 + CDS 100547 - 101374 699 ## COG1028 Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) + Prom 101376 - 101435 4.9 87 41 Op 4 1/0.071 + CDS 101516 - 102328 642 ## COG2207 AraC-type DNA-binding domain-containing proteins + Prom 102332 - 102391 3.6 88 42 Op 1 1/0.071 + CDS 102459 - 103292 661 ## COG0599 Uncharacterized homolog of gamma-carboxymuconolactone decarboxylase subunit 89 42 Op 2 . + CDS 103294 - 103866 465 ## COG0716 Flavodoxins 90 42 Op 3 . + CDS 103878 - 104144 235 ## BF2140 hypothetical protein 91 42 Op 4 . + CDS 104146 - 104523 315 ## COG0110 Acetyltransferase (isoleucine patch superfamily) 92 42 Op 5 . + CDS 104528 - 105592 516 ## COG2220 Predicted Zn-dependent hydrolases of the beta-lactamase fold + Term 105649 - 105704 13.7 + Prom 106133 - 106192 4.2 93 43 Tu 1 . + CDS 106258 - 106629 150 ## BF2143 hypothetical protein + Term 106835 - 106876 1.6 + Prom 106749 - 106808 7.2 94 44 Op 1 2/0.000 + CDS 106905 - 107657 231 ## PROTEIN SUPPORTED gi|163739489|ref|ZP_02146899.1| 50S ribosomal protein L17 95 44 Op 2 . + CDS 107683 - 108846 1155 ## COG0667 Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) + Prom 108860 - 108919 2.9 96 45 Op 1 . + CDS 108941 - 109618 609 ## COG1985 Pyrimidine reductase, riboflavin biosynthesis 97 45 Op 2 1/0.071 + CDS 109626 - 110267 643 ## COG1073 Hydrolases of the alpha/beta superfamily + Prom 110353 - 110412 1.6 98 46 Op 1 . + CDS 110472 - 110696 207 ## COG1073 Hydrolases of the alpha/beta superfamily 99 46 Op 2 . + CDS 110773 - 112008 704 ## COG2311 Predicted membrane protein 100 46 Op 3 . + CDS 112048 - 113064 912 ## COG0667 Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) + Term 113137 - 113174 5.2 + Prom 113867 - 113926 6.1 101 47 Tu 1 . + CDS 113979 - 114464 155 ## BF2088 hypothetical protein + Prom 114870 - 114929 4.6 102 48 Op 1 11/0.000 + CDS 114972 - 116180 904 ## COG0845 Membrane-fusion protein 103 48 Op 2 . + CDS 116199 - 119312 2674 ## COG3696 Putative silver efflux pump 104 48 Op 3 . + CDS 119335 - 120489 953 ## BF2093 outer membrane efflux protein 105 48 Op 4 . + CDS 120501 - 120737 124 ## BF2094 hypothetical protein - Term 120780 - 120837 -0.4 106 49 Op 1 . - CDS 120933 - 122609 719 ## BF2157 putative lipoprotein 107 49 Op 2 . - CDS 122621 - 123331 464 ## BF2096 hypothetical protein - Prom 123462 - 123521 5.0 108 50 Tu 1 . + CDS 124102 - 126858 1219 ## BF2098 hypothetical protein + Term 126902 - 126942 1.7 + Prom 126880 - 126939 5.7 109 51 Op 1 . + CDS 126972 - 128390 999 ## BF2099 hypothetical protein 110 51 Op 2 . + CDS 128432 - 128935 440 ## BF2100 hypothetical protein + Term 128996 - 129047 0.4 + Prom 128997 - 129056 4.8 111 52 Op 1 . + CDS 129231 - 129878 596 ## BF2101 hypothetical protein 112 52 Op 2 . + CDS 129915 - 132272 1771 ## COG4771 Outer membrane receptor for ferrienterochelin and colicins 113 52 Op 3 . + CDS 132269 - 132985 661 ## BF2165 lipoprotein + Prom 132995 - 133054 5.4 114 53 Op 1 . + CDS 133076 - 133996 926 ## BF2104 hypothetical protein 115 53 Op 2 . + CDS 133998 - 134897 634 ## BF2105 hypothetical protein + Term 134938 - 134992 16.5 - Term 134925 - 134980 18.2 116 54 Tu 1 . - CDS 135004 - 135324 459 ## BF2168 hypothetical protein - Prom 135346 - 135405 4.3 117 55 Op 1 3/0.000 + CDS 135651 - 136262 460 ## COG1309 Transcriptional regulator + Term 136309 - 136352 8.7 + Prom 136333 - 136392 4.1 118 55 Op 2 . + CDS 136412 - 137230 551 ## COG3315 O-Methyltransferase involved in polyketide biosynthesis 119 55 Op 3 35/0.000 + CDS 137246 - 139057 1214 ## COG1132 ABC-type multidrug transport system, ATPase and permease components 120 55 Op 4 . + CDS 139061 - 140791 248 ## PROTEIN SUPPORTED gi|229849245|ref|ZP_04469311.1| LSU ribosomal protein L17P + Term 140887 - 140919 1.4 121 56 Tu 1 . - CDS 141092 - 142627 1001 ## COG0642 Signal transduction histidine kinase 122 57 Tu 1 . + CDS 142902 - 143774 384 ## COG2207 AraC-type DNA-binding domain-containing proteins 123 58 Tu 1 . - CDS 143818 - 144282 282 ## BF2116 hypothetical protein - Prom 144500 - 144559 4.5 + Prom 144210 - 144269 7.7 124 59 Tu 1 . + CDS 144412 - 144879 408 ## COG1522 Transcriptional regulators 125 60 Op 1 . - CDS 144921 - 145244 134 ## BF2118 hypothetical protein 126 60 Op 2 . - CDS 145264 - 146112 298 ## BF2119 transcription regulator - Prom 146279 - 146338 6.2 + Prom 146212 - 146271 11.6 127 61 Op 1 . + CDS 146313 - 146708 348 ## BF2120 hypothetical protein 128 61 Op 2 8/0.000 + CDS 146705 - 148009 1008 ## COG3969 Predicted phosphoadenosine phosphosulfate sulfotransferase 129 61 Op 3 . + CDS 148006 - 148551 500 ## COG1475 Predicted transcriptional regulators + Prom 148673 - 148732 6.5 130 62 Op 1 . + CDS 148888 - 149454 528 ## BF2182 hypothetical protein 131 62 Op 2 . + CDS 149481 - 150932 1304 ## BF2183 hypothetical protein 132 62 Op 3 . + CDS 150974 - 152533 1467 ## BF2126 hypothetical protein 133 62 Op 4 . + CDS 152565 - 153545 816 ## BF2185 hypothetical protein + Term 153592 - 153644 13.3 134 63 Tu 1 . - CDS 153660 - 154619 447 ## BF2186 hypothetical protein - Prom 154769 - 154828 3.2 - Term 154777 - 154817 8.3 135 64 Tu 1 . - CDS 154869 - 155828 724 ## BF2187 hypothetical protein 136 65 Tu 1 . + CDS 156080 - 156397 176 ## BF2131 hypothetical protein + Prom 156405 - 156464 5.9 137 66 Op 1 . + CDS 156538 - 157476 525 ## COG0451 Nucleoside-diphosphate-sugar epimerases 138 66 Op 2 . + CDS 157511 - 158638 922 ## COG0642 Signal transduction histidine kinase + Term 158711 - 158759 1.5 - Term 158750 - 158801 14.4 139 67 Op 1 . - CDS 158883 - 161756 1890 ## BF2191 hypothetical protein 140 67 Op 2 . - CDS 161753 - 164926 1998 ## COG1074 ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) 141 67 Op 3 . - CDS 164950 - 166527 1308 ## COG3525 N-acetyl-beta-hexosaminidase - Prom 166599 - 166658 4.6 + Prom 166617 - 166676 6.7 142 68 Tu 1 . + CDS 166784 - 169423 2030 ## BF2137 hypothetical protein + Prom 169473 - 169532 7.4 143 69 Op 1 . + CDS 169552 - 172857 2536 ## BF2195 hypothetical protein 144 69 Op 2 . + CDS 172880 - 174211 1115 ## BF2139 hypothetical protein + Term 174245 - 174286 2.4 - Term 174574 - 174603 1.4 145 70 Op 1 . - CDS 174714 - 176084 583 ## COG2425 Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 146 70 Op 2 . - CDS 176059 - 176226 186 ## BF2198 hypothetical protein - Prom 176447 - 176506 4.9 147 71 Tu 1 . - CDS 176521 - 177633 702 ## COG0714 MoxR-like ATPases - Prom 177805 - 177864 3.7 + Prom 177677 - 177736 3.0 148 72 Op 1 . + CDS 177821 - 178603 722 ## BF2199 hypothetical protein 149 72 Op 2 . + CDS 178581 - 178964 216 ## BF2200 hypothetical protein 150 73 Op 1 . - CDS 178970 - 179689 419 ## COG1022 Long-chain acyl-CoA synthetases (AMP-forming) 151 73 Op 2 . - CDS 179664 - 180653 517 ## COG1022 Long-chain acyl-CoA synthetases (AMP-forming) - Prom 180673 - 180732 4.9 152 74 Tu 1 . - CDS 180849 - 182315 932 ## COG3263 NhaP-type Na+/H+ and K+/H+ antiporters with a unique C-terminal domain - Prom 182341 - 182400 8.1 + Prom 182329 - 182388 6.1 153 75 Tu 1 . + CDS 182484 - 183674 859 ## PROTEIN SUPPORTED gi|168182407|ref|ZP_02617071.1| 50S ribosomal protein L18 154 76 Tu 1 . - CDS 183801 - 185432 1625 ## COG1151 6Fe-6S prismane cluster-containing protein - Prom 185499 - 185558 3.1 155 77 Tu 1 . - CDS 185948 - 186613 485 ## COG0664 cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases - Prom 186667 - 186726 2.5 156 78 Op 1 8/0.000 - CDS 186729 - 188042 927 ## COG5000 Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation 157 78 Op 2 . - CDS 188039 - 189370 967 ## COG2204 Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains - Prom 189390 - 189449 4.8 158 79 Op 1 . - CDS 189509 - 190219 376 ## BF2151 hypothetical protein 159 79 Op 2 . - CDS 190194 - 191003 602 ## BF2152 calcineurin superfamily phosphohydrolase - Prom 191024 - 191083 7.6 + Prom 191179 - 191238 6.1 160 80 Op 1 . + CDS 191258 - 193546 1099 ## BF2210 putative ABC transport system, membrane protein 161 80 Op 2 . + CDS 193566 - 194246 333 ## PROTEIN SUPPORTED gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) 162 80 Op 3 . + CDS 194260 - 196569 1363 ## BF2212 putative ABC transport system, membrane protein 163 80 Op 4 . + CDS 196585 - 197259 338 ## PROTEIN SUPPORTED gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) 164 80 Op 5 . + CDS 197276 - 199582 1416 ## BF2214 putative ABC transport system, membrane protein 165 80 Op 6 . + CDS 199613 - 201907 1105 ## BF2158 ABC transporter permease protein + Term 201921 - 201975 19.4 - Term 201914 - 201958 15.5 166 81 Tu 1 . - CDS 201991 - 203355 1208 ## COG0534 Na+-driven multidrug efflux pump + Prom 203346 - 203405 4.3 167 82 Op 1 . + CDS 203463 - 204125 714 ## COG0637 Predicted phosphatase/phosphohexomutase 168 82 Op 2 . + CDS 204197 - 205192 816 ## COG0167 Dihydroorotate dehydrogenase 169 82 Op 3 . + CDS 205207 - 205935 742 ## COG0657 Esterase/lipase + Prom 205944 - 206003 3.8 170 83 Op 1 . + CDS 206027 - 206848 904 ## COG0413 Ketopantoate hydroxymethyltransferase 171 83 Op 2 . + CDS 206856 - 208022 680 ## COG0477 Permeases of the major facilitator superfamily + Term 208044 - 208105 16.1 - Term 208032 - 208093 16.1 172 84 Op 1 2/0.000 - CDS 208111 - 208497 303 ## COG0818 Diacylglycerol kinase 173 84 Op 2 . - CDS 208500 - 210713 2004 ## COG0317 Guanosine polyphosphate pyrophosphohydrolases/synthetases - Prom 210814 - 210873 4.7 + Prom 210740 - 210799 5.1 174 85 Tu 1 . + CDS 210821 - 212338 1653 ## BF2167 hypothetical protein + Term 212350 - 212401 11.2 - Term 212338 - 212390 12.1 175 86 Tu 1 . - CDS 212408 - 212653 294 ## COG0724 RNA-binding proteins (RRM domain) - Prom 212673 - 212732 10.0 + Prom 212650 - 212709 8.3 176 87 Tu 1 . + CDS 212788 - 212922 61 ## BF2169 hypothetical protein + Term 212947 - 212999 2.0 177 88 Tu 1 . + CDS 213301 - 214467 959 ## COG0027 Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) + Prom 215027 - 215086 7.4 178 89 Op 1 42/0.000 + CDS 215110 - 216627 1608 ## COG0055 F0F1-type ATP synthase, beta subunit 179 89 Op 2 . + CDS 216639 - 216884 296 ## COG0355 F0F1-type ATP synthase, epsilon subunit (mitochondrial delta subunit) 180 89 Op 3 . + CDS 216918 - 217346 351 ## BF2173 hypothetical protein 181 89 Op 4 . + CDS 217330 - 218481 995 ## COG0356 F0F1-type ATP synthase, subunit a 182 89 Op 5 . + CDS 218508 - 218765 445 ## CFPG_368 F-type ATP synthase C subunit 183 89 Op 6 38/0.000 + CDS 218781 - 219278 672 ## COG0711 F0F1-type ATP synthase, subunit b 184 89 Op 7 41/0.000 + CDS 219288 - 219848 520 ## COG0712 F0F1-type ATP synthase, delta subunit (mitochondrial oligomycin sensitivity protein) 185 89 Op 8 42/0.000 + CDS 219862 - 221445 1773 ## COG0056 F0F1-type ATP synthase, alpha subunit 186 89 Op 9 . + CDS 221467 - 222339 916 ## COG0224 F0F1-type ATP synthase, gamma subunit + Term 222359 - 222430 16.3 + Prom 222395 - 222454 3.7 187 90 Tu 1 . + CDS 222497 - 225031 1596 ## COG0507 ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member 188 91 Tu 1 . - CDS 225189 - 225545 486 ## COG0526 Thiol-disulfide isomerase and thioredoxins - Prom 225687 - 225746 4.6 + Prom 225480 - 225539 6.3 189 92 Op 1 13/0.000 + CDS 225778 - 227040 1153 ## COG1538 Outer membrane protein 190 92 Op 2 11/0.000 + CDS 227054 - 228145 746 ## COG0845 Membrane-fusion protein 191 92 Op 3 . + CDS 228161 - 231286 2682 ## COG3696 Putative silver efflux pump 192 92 Op 4 40/0.000 + CDS 231321 - 231995 813 ## COG0745 Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 193 92 Op 5 1/0.071 + CDS 232013 - 233350 962 ## COG0642 Signal transduction histidine kinase + Prom 233373 - 233432 5.7 194 93 Tu 1 . + CDS 233534 - 234208 516 ## COG2095 Multiple antibiotic transporter 195 94 Tu 1 . - CDS 234398 - 236149 1403 ## BF2190 putative acetylhydrolase - Prom 236170 - 236229 3.2 196 95 Tu 1 . - CDS 236338 - 237045 438 ## COG2243 Precorrin-2 methylase - Prom 237080 - 237139 3.2 + Prom 236757 - 236816 3.4 197 96 Op 1 . + CDS 236927 - 237145 69 ## gi|253565469|ref|ZP_04842924.1| predicted protein 198 96 Op 2 33/0.000 + CDS 237159 - 238307 1118 ## COG0614 ABC-type Fe3+-hydroxamate transport system, periplasmic component 199 96 Op 3 35/0.000 + CDS 238316 - 239353 1091 ## COG0609 ABC-type Fe3+-siderophore transport system, permease component 200 96 Op 4 . + CDS 239360 - 240388 1178 ## COG1120 ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components 201 97 Op 1 . - CDS 240522 - 241412 713 ## COG1091 dTDP-4-dehydrorhamnose reductase 202 97 Op 2 . - CDS 241425 - 241958 464 ## BF2196 hypothetical protein 203 97 Op 3 . - CDS 242022 - 242810 569 ## BF2197 hypothetical protein 204 97 Op 4 40/0.000 - CDS 242823 - 243518 697 ## COG0745 Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 205 97 Op 5 . - CDS 243515 - 245092 1045 ## COG0642 Signal transduction histidine kinase - Prom 245218 - 245277 4.1 + Prom 245064 - 245123 4.0 206 98 Tu 1 . + CDS 245250 - 247367 1292 ## BF2200 hypothetical protein + Term 247568 - 247603 -0.8 - Term 247319 - 247361 2.3 207 99 Tu 1 . - CDS 247394 - 249403 836 ## COG0642 Signal transduction histidine kinase - Prom 249539 - 249598 7.5 208 100 Tu 1 . + CDS 249572 - 251239 1357 ## COG2759 Formyltetrahydrofolate synthetase + Term 251321 - 251388 8.1 - Term 251463 - 251509 5.1 209 101 Tu 1 . - CDS 251535 - 252227 498 ## BF2203 hypothetical protein - Prom 252322 - 252381 5.0 - Term 252338 - 252381 5.4 210 102 Tu 1 . - CDS 252418 - 253698 1334 ## COG0112 Glycine/serine hydroxymethyltransferase - Prom 253735 - 253794 4.6 211 103 Op 1 . - CDS 253806 - 254558 366 ## BF2205 hypothetical protein 212 103 Op 2 . - CDS 254531 - 255100 413 ## COG1853 Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family 213 103 Op 3 19/0.000 - CDS 255186 - 255647 472 ## COG1781 Aspartate carbamoyltransferase, regulatory subunit 214 103 Op 4 . - CDS 255652 - 256578 864 ## COG0540 Aspartate carbamoyltransferase, catalytic chain - Prom 256669 - 256728 9.0 215 104 Op 1 . + CDS 257010 - 257288 339 ## BF2209 hypothetical protein 216 104 Op 2 . + CDS 257285 - 257905 388 ## COG2431 Predicted membrane protein - Term 257766 - 257804 0.5 217 105 Op 1 . - CDS 257906 - 258823 595 ## BF2211 hypothetical protein 218 105 Op 2 . - CDS 258847 - 260226 849 ## BF2265 putative lipoprotein - Prom 260442 - 260501 9.6 + Prom 260200 - 260259 7.3 219 106 Tu 1 . + CDS 260461 - 261423 578 ## BF2214 hypothetical protein + Prom 261426 - 261485 5.7 220 107 Tu 1 . + CDS 261615 - 263774 1068 ## BF2267 hypothetical protein 221 108 Tu 1 . - CDS 263778 - 264728 878 ## COG2837 Predicted iron-dependent peroxidase - Prom 264805 - 264864 5.1 222 109 Op 1 . - CDS 264949 - 267033 1583 ## BF2218 putative outer membrane protein involved in nutrient binding 223 109 Op 2 . - CDS 267050 - 270319 2496 ## BF2270 hypothetical protein + Prom 270403 - 270462 6.3 224 110 Tu 1 . + CDS 270656 - 271144 430 ## BF2271 hypothetical protein + Term 271155 - 271197 -1.0 225 111 Op 1 . - CDS 271232 - 272548 350 ## BF2221 hypothetical protein 226 111 Op 2 . - CDS 272475 - 273776 672 ## BF2273 hypothetical protein 227 111 Op 3 . - CDS 273773 - 274603 465 ## COG1212 CMP-2-keto-3-deoxyoctulosonic acid synthetase - Prom 274631 - 274690 4.8 + Prom 274559 - 274618 9.8 228 112 Tu 1 . + CDS 274768 - 274899 56 ## 229 113 Tu 1 . + CDS 275460 - 275927 646 ## BF2275 hypothetical protein + Term 275989 - 276043 13.2 + Prom 276057 - 276116 1.7 230 114 Op 1 . + CDS 276138 - 278471 2153 ## COG5009 Membrane carboxypeptidase/penicillin-binding protein 231 114 Op 2 . + CDS 278485 - 278853 302 ## BF2228 putative 2-amino-4-hydroxy-6-hydroxymethyldihydropteridine pyrophosphokinase 232 114 Op 3 . + CDS 278916 - 280202 1124 ## COG0612 Predicted Zn-dependent peptidases 233 114 Op 4 . + CDS 280253 - 281413 1134 ## COG4642 Uncharacterized protein conserved in bacteria Predicted protein(s) >gi|226332019|gb|ACIB01000037.1| GENE 1 3 - 660 480 219 aa, chain - ## HITS:1 COG:no KEGG:BF0656 NR:ns ## KEGG: BF0656 # Name: not_defined # Def: tyrosine type site-specific recombinase # Organism: B.fragilis # Pathway: not_defined # 1 219 1 219 406 414 97.0 1e-114 MARKSFSVLFFIKKGKLLKNGEAPVCMRITVNGCMVDISIKRSCPVNLWNQAKENSKGKD RMSVELNHYLEITRSHVHQIYRELETSGKVITVDLVRKLFYGVDEDSKTLLQVFREHNEQ SRKLIGKDFVSKTVQRYETTTRYLEEFIKKEYQLSDIALNNLEANFISKFDVFLKIEKGC AQNSAITRLKNLKKIIRIALENDWIKKDPFAYYRFKPEK >gi|226332019|gb|ACIB01000037.1| GENE 2 1124 - 2782 1771 552 aa, chain + ## HITS:1 COG:RSc2913 KEGG:ns NR:ns ## COG: RSc2913 COG0488 # Protein_GI_number: 17547632 # Func_class: R General function prediction only # Function: ATPase components of ABC transporters with duplicated ATPase domains # Organism: Ralstonia solanacearum # 1 550 8 554 555 637 57.0 0 MVGVSKAFQPNKNVLKDIYLSFFYGAKIGIIGLNGSGKSTLLKIIAGLEKSYQGEVVFSP GYSVGYLAQEPYLDNTKTVKEIVMEGVQPIVDALNEYEEINQKFGLPEYYEDQDKMDQLF ARQGELQDIIDATDAWNLDSKLERAMDALRCPPEDQSVANLSGGERRRVALCRLLLQKPD ILLLDEPTNHLDAESIDWLEQHLQQYEGTVIAVTHDRYFLDHVAGWILELDRGEGIPWKG NYSSWLEQKTKRMEMEEKTASKRRKTLERELEWVRMAPKARQAKGKARLNSYDKLLNEDV KEKEEKLEIFIPNGPRLGNKVIEAKHVAKAYGDKLLFDDLNFMLPPNGIVGVIGPNGAGK TTLFRLIMGLETVDKGEFEVGETVKVAYVDQQHKDIDPNKSVYQVISGGNDLIRMGGRDI NARAYLSRFNFSGADQEKLCGVLSGGERNRLHLAMALKEEGNVLLLDEPTNDIDVNTLRA LEEGLEDFAGCAVVISHDRWFLDRICTHILAFEGDSNVFYFEGSYSEYEENKLKRLGNEE PKRVRYRKLIAD >gi|226332019|gb|ACIB01000037.1| GENE 3 2978 - 5767 2988 929 aa, chain + ## HITS:1 COG:no KEGG:BF1991 NR:ns ## KEGG: BF1991 # Name: not_defined # Def: putative TonB-dependent outer membrane protein # Organism: B.fragilis # Pathway: not_defined # 1 929 1 929 929 1726 99.0 0 MRRHLIHFLLVAVLTVCSAATAIAQITVKGQVVDAETGEPLIGAAVTIVGTTQGSVTNLD GMFTQKAASGSTLLIKYLGYKEFSKKITQKGGTEDLGVIKMAADAMVLNDVIITSSVAVS RKTPVAVSTVDPVFIEEKLGTQEFPEVLKSTPGIYATKQGGGFGDSKVNIRGFKTENSAM MINGVPMNDMEWGGIYWSNWAGLSDVTRSMQVQRGLGASKVAAPSVGGSINIVTNTIDAN KGGFVSYGMGNDGLNKILFKVSTGLTKSGWAMTLLGGKTWADGYIQGTNYEAYNWFVNIT KRFNDNHQLSFTAFAAPQWHNQRGNKDGLTIEGWQEVAKNYMNGEKPYRYNPTYGFGLNG QRKSSAYNVYNKPQLSLNHLWQINEKSSLSTALYASIGRGYGYSGQGLTSADRSNWYGSN NGNLNMTFRKADGTFAYDEIYALNEASENGSVMAMSKSKNFHNWYGLLSTYTTKFGDYFD FYGGIDYRYYKGTHTNELVDLYGGDFYVDSSSRKSVLASNNAAAAAGSSFVNQKLKVGDI VYRDFDGYVMSEGVFAQGEYNRDKLSAFISGSVSNTGYWRYDRFYYDKAHAKSKTVNFIG WNAKGGLNYNLTENHNVFANIGYISRAPFFSGGAFLNSTVSNATNPDAVNEKVFSFEIGY GYRSSFLTVNINAYHTRWMDKTTTRSQDITNYYEGSLSEPYDASKLVSTKSVINMQGVNA LHQGVELDFVAKPFQWLDLSGMFSIGNWRWDSNASGSFTVEGQFVNSASIKGSDGKDVTV LVNAAANGLEPGTMKLNLKDVKVGGSAQTTAALGATFKIDKALRLGIDWNLYARNYADWS LNSNDLVMNSEKDFSTPWRIPTASTFDLNASYKFNFGKLNAVLSGNVNNLFDQTYISDAT DGSNHDWKTAYNVFYGFGRTYSLRLKVNF >gi|226332019|gb|ACIB01000037.1| GENE 4 5791 - 7668 1689 625 aa, chain + ## HITS:1 COG:no KEGG:BF1992 NR:ns ## KEGG: BF1992 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 625 1 631 631 1082 94.0 0 MNRNLLMPVAVSLILLSGCKYNDDNFEGLDDMTQPTNLMKIEYTLTDADYATISSNSTNK KIATDAGVSKDLENVKTNMYLTEKITGADYIPAFLLDKYYTADKGSSAKITYKYKEAMSS LLSEYASVKYLKPTDAEYKLVYGENAFAPYLNEKTEGQMSKILNEKFKDAEKGTAVFVDY KLGEGQLENPLMWQNFEALPTGDLKELKGWFISSTGDTQWKVTSYDDNQYVQYSANGTKG ACVGWMVTPAISVTAGDYLAFDVTVGYYNASCLSVLISENFDGENVGTANWVDVTSDFSI PTKPTSGYGTFASAGKVPLSAYAGKKVYVAFKYEGDGANKKTTTYQIDNIMVGTSIPANS LSTPTYAVKVYDGKNWKNKSNSVYVLTYADYGDMGQSKRYFTSDVPAVNYLPAYLSKMVA YPVDGDARVVVYRYYNGTDLKIYSDEYTYSAEKARWELNTRIVDKTEQFVLSDGKWNFDP STVITLKAKGDAETSTFYQTIVDWVKEHYSEYVTSYGNNEYYYGSSAYQNNFDFRPDKWK VQNPAAYGTMSDDDLKKLMFERLPEAFLPALQSLYGDADVVEGVDVIYTINFGIYDGSDA QYTIKYKVTGKGQFEYVADSLKKVE >gi|226332019|gb|ACIB01000037.1| GENE 5 7783 - 8868 725 361 aa, chain - ## HITS:1 COG:DR0632 KEGG:ns NR:ns ## COG: DR0632 COG3021 # Protein_GI_number: 15805659 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Deinococcus radiodurans # 152 359 144 325 329 63 25.0 4e-10 MGKKAVSKLFYCTSIALTFVLAGITIAGAFAGHIPPEHSTLMPFIGLALSGLLLINLAAA IYWGIRRRFWIIIPLIAIAANWQYLSRIFQPPFTAGEKEANTLKIATYNVDSFGNEQSGY SCKELAAYMKEHRVDIICFQEFAGNRYFTPDSIRNAFADWQYAVIPQAPDSTPILQVALF SKYPVKDSRLITYPDSRNCSMWCDLNVDGQTIRVFNNHLQTTEVSQNKRRLERELAKNEL TGREEAVAKQLLEGLNENFRKRAAQAKTLEQLIRTTPYPVLVCGDFNSLPSSYTYSTVKG DNLQDGFQTCGHGYMYTFRYFKRLLRIDYIFHSKEFKGVDYYSPDLDLCSDHNPVVMEVK M >gi|226332019|gb|ACIB01000037.1| GENE 6 9009 - 11708 2546 899 aa, chain - ## HITS:1 COG:MA2348_2 KEGG:ns NR:ns ## COG: MA2348_2 COG0642 # Protein_GI_number: 20091183 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Methanosarcina acetivorans str.C2A # 487 728 173 423 427 164 38.0 5e-40 MDRILHDVDNAHKILQLTSDTLILVDKNGTCLDIDPHSDLWFLQEDRLLGKNLFNLLPDH TFQKLLPDFRRVTQQGITVNRNYRLPLEGGETYYFKCIMQPYDGDKVLCQYRDITARSNV KLQLERTNYELKEIQKAAQIGQWKYSSRERTFYYRGYNGIVCTEEERSINFQDYYETILS EDLPAVNTWMEANRRELLKEYIEYRILLEGQVYYMRQQCYLRNEEEDGNIVLEGYIQNIT DIQRKRNDINTLTHAINNAKESVYAARRDGTLIFANRQFRLNHRIAEQADLSLIRVFDVV GDMTCIEDWEERYRSIREGQTLNFLAYQPLKHDKNTLAFEGTMYSVTTDDGEETFWSFTH DISERIRYESQIKRFNRIMDTTMENIPAGIVVKDIENDFRYIYRNRESYNRDISSENAIG MNDFDYYPPEMAQQKRKEDMEIAATGKGMHWIMEGKDKNGNLLILDKQKIMVESEDFSPI IVSIEWDITQLELMRRELIESKEKAETSDKLKSAFLANMSHEIRTPLNAIVGFSRIISES DNAEERREYYEIVDANNERLLQLINEILDLSKIESGIVEFTYGPVRLHTLCKEIHDAHVF RCPQGVELRFDSPDEALSIHSDKNRIFQVFSNLIGNAFKFTTEGSVSYGYKQEGERVVFY VKDTGLGIEPEKLGRVFQRFAKLNNFAQGTGLGLSICKTIIERLGGEIAVSSEVGTGTTF TFWLPLENVIQDTETGTNSHLPGEAVGTQPSEVLPAKEDTPRPKEETTEKEEDLRTTAVE TEKATILIAEDTDSNFDLLNAILGRKYRLVRAKDGMEAVTMYDEVNPDLILMDIKMPNLD GLEATRIIRQLSAEVPIIAQSAYAYEHDRNAAEEAGCNDFISKPIAQEKLKEKIKKWLK >gi|226332019|gb|ACIB01000037.1| GENE 7 11721 - 12941 804 406 aa, chain - ## HITS:1 COG:PAE0419 KEGG:ns NR:ns ## COG: PAE0419 COG1215 # Protein_GI_number: 18311929 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases, probably involved in cell wall biogenesis # Organism: Pyrobaculum aerophilum # 56 310 42 298 365 77 26.0 4e-14 MNNTITTWIEILFWLSLFLVFYTHLGYGILLYMLVKLKELFVKPVRRQLPPDKRLPDVTL FITAFNEEEVVDEKMRNCLGLDYPADKLHIVWTTDGSNDSTNQRLENWPQATVHFQPLRQ GKTAAMTRGMMLVDTPLVVFTDANTMLNREAIREIVRAFEDPKVGCVAGEKRIAVQEKDG AAAGGEGIYWKYESTLKALDARLYSAVGAAGELFAVRRELFTVMEPDTLLDDFILSLRIA MQGYKIAYCTQAYAIESGSADMREEQKRKVRIAAGGLQSIWRLRELLNPFRYGVLTFQYV SHRVLRWSLAPVLLFALLPLNIAILLVGGSPVCYGTILALQLLFYIMGGWGYYLSTRQVK NKLLFIPYYFLFMNINVMKGVNYLRKKKGTGAWEKAKRTKTESLNQ >gi|226332019|gb|ACIB01000037.1| GENE 8 12938 - 13783 505 281 aa, chain - ## HITS:1 COG:PM0777 KEGG:ns NR:ns ## COG: PM0777 COG0463 # Protein_GI_number: 15602642 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Pasteurella multocida # 36 204 212 393 477 61 30.0 2e-09 MEWYTKYLSIFGLTLSEIPGDTLSEIGTLLHEKQSDTPLVSVVVIAHNEEPHILSCLWSL GNNEYSYPIEILVVNNHSTDRTEQALQAVGATYFNELKKGPGYARQCGLDHAKGRYHICI DADTMYPPHYIDTHVRNLMKPGVACTFSLWSFIPDKRHSRLGLWLYECLRDLHLSIQAIK RPELCVRGMTFGFNTELGRLFGFRTDIRRGEDGSLALAMKPYGRLIFITSRKARALTSNS TIDADGSLARSFGVRLLKALKGATGLLYKKTTYKDKDSNLL >gi|226332019|gb|ACIB01000037.1| GENE 9 13806 - 14717 714 303 aa, chain - ## HITS:1 COG:SP1767 KEGG:ns NR:ns ## COG: SP1767 COG1442 # Protein_GI_number: 15901598 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases # Organism: Streptococcus pneumoniae TIGR4 # 40 302 550 812 814 187 40.0 2e-47 MNGLQIIKTLALKIRYASIVEWYNFWSIVLQHHQNIVPTVASIDETIRHITEGNRSISRF GDGEMLLTSPSKSIGFQEGSPLLAKRLREVLVSHEEGHLVAIPDVFSGLNRYRRKCRRFQ RTHFFIYGKWWDQLLIPGRKYENAFLSRPYMDYTSKEHCARWFRELKTIWEGRDIVFIEG AMSRLGVGNDLFDNAGSIRRILCPPRNAFERYDRILNEALKVEKEVLFLIALGPTATVLA YDLHKAGYQAVDIGHIDVEYEWWRMKARRKVKLEKKYVNEAFGNKRVTDAGEGYRKEIIA QIS >gi|226332019|gb|ACIB01000037.1| GENE 10 14714 - 16981 1332 755 aa, chain - ## HITS:1 COG:SMb21250 KEGG:ns NR:ns ## COG: SMb21250 COG0438 # Protein_GI_number: 16264502 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Sinorhizobium meliloti # 369 754 41 408 427 152 28.0 2e-36 MKALFLIFHGFEEANGISKKIRYQVKALKECGMDVHTCYLNEENGHKCRMIDNHTLRDYG SGIKGKLRKRFELQSIVKYILQENIRLVYMRSYHNANPFTISMVKQLKRQGVKVVMEIPT YPYDQEYITRRMKLDLLVDRCFRRKLAAQLDGIVTFSDAETIFGGHTIRISNGIDFDAIP QKITRNDTSRELHLIGVAEVHYWHGFDRIIKGMADYYATHPSYKVYFHIVGALTGERERR EILPVIAQYKLEPFVILHGQQHGEQLDKIFEQSDFGIGSLARHRSGITTIKTLKNREYAA RGLPFIYSETDTDFDDKPYVLKAPANETPVRIQAIVDFYQTQTWDPAGIRQSISHLSWHA QMQKVIGKYQVASEKKLKIAYCIPSIHCPGGMERVISLKVNYFTKKFGYDIHLILTDGKD KAPYYPLHPSITLHQLDINYDEMDGMPVLRHITGYVKKQKLFKKRLDACLNELKPDITIS TLRRDINIINNMTDGSIKLGEIHFNKSNYRELSDKPLPAFLKKWIRSYWMKQLIRKLRKL KRFIVLSYEDAAEWTELNNVTVIHNPLPFFPDRQSDGSRKQVIAAGRYVPQKGFDMLIKA WKIVSEQHPDWTLRIYGDGFREELQKLIDGLGISRSCILEHTVSNIVDKYCESSIFALSS RYEGFGMVLVEAMVCGVPPVSFACPCGPRDIIDDGNDGLLVPKENINKLAEKIGYLISHE NIRKEMGQRARIHVERFKIDHIASQWKELFNSLIS >gi|226332019|gb|ACIB01000037.1| GENE 11 16978 - 18432 1114 484 aa, chain - ## HITS:1 COG:L13324 KEGG:ns NR:ns ## COG: L13324 COG2244 # Protein_GI_number: 15672194 # Func_class: R General function prediction only # Function: Membrane protein involved in the export of O-antigen and teichoic acid # Organism: Lactococcus lactis # 4 469 1 466 475 216 29.0 7e-56 MTSLKHQLFSGVFYTALAKYSGIGVSLVVAGVLARLISPDDFGVMAVATVIIAFFNLFTD VGLSPAIIQHKTLTGENLSGLFSFTVWTGIGLALLFAAASWPIAAYYDREILRPLCQLLA VNLFFASATIVPNALFYRNKEFKFIALRSFVIQIATGTAAVVAALCGAGLYALIIGPILS GILIFAVSIRRYPQRLKFTLGLDVLRRIFSYSAYQFLFNIINYFSRNLDKLLIGKYMGMS PLGYYEKSYRLMMLPLQNITQVITPVMHPIFSDYQDDLERLASGYERIVRFLAFIGLPLS VLLYFTAGEVTLIIFGDQWTPSIPVFRILTLSVGVQIILSSSGSIFQAAGDTKNLFMCGL FSSILNVTGILLGIFWFGTLEAVATCITLTFTVNFAQCYWMMYRMTLHRSLRHFAIQLIS PLMVSILLIAVLYPLSLMTEGGNIFLTLIVKSIIFFCIFGCYIQLTGEYDITGKAKSIIN KNKR >gi|226332019|gb|ACIB01000037.1| GENE 12 18644 - 19594 692 316 aa, chain - ## HITS:1 COG:YPO0187 KEGG:ns NR:ns ## COG: YPO0187 COG0463 # Protein_GI_number: 16120528 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Yersinia pestis # 10 258 6 255 329 115 30.0 1e-25 MEQPNKHPKVSVIIPIYNTCHYVQEALESICRQTLTELEIIAIDDGSTDSSRKVVEQVAA TDMRIRVYSQANQGQSITRNNGMQYVQGKYVYFMDSDDRLEADTLETCYSACEAGQLDFV CFDADILNKDHPCARHFNYDRSACARPKQVYKGVELLQRQLSARVFSPSPCLNLISTSYL KTSGLTFYPRIIHEDQLFTSQLYLKAEKVGYIPQKFFLRRFRTDSTMTRQFTWRNMEGYL TVTDELLKAAPSYPENVRRLIRQFLCQMLDAAVWQAHTLPLRQRIGLFLLCLRHYRPYVR MRTLMVLMLKKKEEGR >gi|226332019|gb|ACIB01000037.1| GENE 13 20274 - 20567 118 97 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MYLRHQPGYVACYTRSSLEHLAQCSRGTVELFSNVLLNFGGSQGHASGVDSGPSLVRNRG RNSLCGSACNSLWDRKKNSEKPIIQYKTRKDKIKQKQ >gi|226332019|gb|ACIB01000037.1| GENE 14 21086 - 22306 1077 406 aa, chain - ## HITS:1 COG:CAC1691 KEGG:ns NR:ns ## COG: CAC1691 COG1215 # Protein_GI_number: 15894968 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases, probably involved in cell wall biogenesis # Organism: Clostridium acetobutylicum # 48 278 50 271 425 75 28.0 2e-13 MILILTLIDWILFVPLALCVAYLLVYAIASKFYRAPHYPEADKLHRIAVFFPAYREDKVI IDSVRSFLEQDYPEEMYDVYVISDHMQDSTNKALSRLPIRLLVATYEDSSKAKALLLAIS TIEEDGKAKGLGNRQLAFMYDIAVVMDADNVTVPGFLSEVNRAYSAGVQAMQAHRTGKNL NTDIALLDGVSEEINNGFFRSGHNALGLSAGLAGSGMAFDYFWYYDAVQSLETAGEDKEL ELTLLECGMHTVYLEHLPVYDEKTQKKENIKNQRRRWMAAQFGILCEGLSFIKSVKQMEG WWRWWPSLDLVDKIIQWMLPPRLVQLVAVFGFTLLATLVYRPAASKWWILSAAQVAAMFI PVPARLLNGRLLKALTQVPSLALGTIASLFHLKGANKKFIHTEHGE >gi|226332019|gb|ACIB01000037.1| GENE 15 22521 - 23411 526 296 aa, chain - ## HITS:1 COG:CAC3069 KEGG:ns NR:ns ## COG: CAC3069 COG1216 # Protein_GI_number: 15896320 # Func_class: R General function prediction only # Function: Predicted glycosyltransferases # Organism: Clostridium acetobutylicum # 8 249 4 247 299 167 35.0 2e-41 MKQIPPLISFITICYNGLDDTCVLIESLRDTISSVSYEIIVVDNASRQDEASLIQERYPF VRTLRSEKNLGFSGGNNLGIQIAQGKYLFLINNDTYLTEDGLPALIERLESDPRIGAVSP KIRFAFPPQNIQFAGYTKLSRYTMRNKALGMGCPDDGTFDTPRPSAYLHGAALMLKREVI WKAGLMPEIYFLYYEELDWCTSMTRAGYQLWYEPRCTIFHKESQSTGRQSPLRTFYMMRN RMLYAWRNLPGMERCLAIAYQMLIVAPKDSLCFVLKGEPKLAKAVWHGVKGFWKLC >gi|226332019|gb|ACIB01000037.1| GENE 16 23457 - 24794 881 445 aa, chain - ## HITS:1 COG:no KEGG:BF2060 NR:ns ## KEGG: BF2060 # Name: not_defined # Def: putative transmembrane surface-related protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 2 440 3 441 444 632 66.0 1e-180 MNDELQKKRDPDQSEIAFYLLFSVQLVLIAIGIFVPIRMGVTTLPLVIALSMITLMRCTG QETDWKRGQNVMFGLFMVWGAYCLFEIGNPNHIQAAWNIAITHYWVYPLVFAFVVPLAIR NYKGIEWLLLIWSVFILIAAFKGYWQKSHGFNAKELYFLYTLGGYRTHLIWSGIRYFSCF PDAANFGVHSAMAVTTFGIATFYVRKRWMKVYFAIIAICAIYGMGISGTRAAMAVPFGGI ALFILVSKSWKSFALSTLAFIAVFSFFNFTTIGDGNQYIRKMRSAFRPSEDASYQVRVEN RKKMKGLMDEKPIGYGIGLSKGAQYGPKEVMPYPPDSWLVAVWVETGIVGLMLYLLIHGV LFAWCSWLLMFRIMNQRLRGLLAAWLCMNAGFFVAAYANDVMQYPNSIIVYTGFALCIAG PYIDPVMQKEDEELERERKEKKKRD >gi|226332019|gb|ACIB01000037.1| GENE 17 24787 - 27003 1876 738 aa, chain - ## HITS:1 COG:no KEGG:BF2061 NR:ns ## KEGG: BF2061 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 730 1 730 738 1159 80.0 0 MDYVLYLLRALYRVKWWILLGTALITAIVYFRTGNLRGGYNVEATLYTGVVSGYGVEEST KVDWALAQNSMDNLINIIQAESTLKRVSLRLFSRILVKGSPDKDNEGITAASYNYTYNHM KNSPDGKALIALIDRTSEDKTMENFQKYERQDKNNYIYGLFYFQHVYYSYQALKNIKVER KGSSDLLKVSYQSGDPGIAYNTIEILMKEFVNEYQALRYEGTDKVIEYFKAELKRIGGDL TKYEDDLTQYNVENRIINYYDETKEIAAINKEFELREQNVLFEYNSSRAMLEELERQMDA NSKRIIHNVELVDKLKQATNLTGKIIEMETISTAGDSTGMKLTEYKNRLIQSRRDLSTIA NQYVAGQQTKEGVAKATIVEQWLDQLLLFEKAKAELKIVQRSRSDLNAKYTHFAPVGTTI KRKERTISFTEQNYLTNLKSYNDALLRKKNLEMTAAILKVLNPPAYPINEEVASRKKIVM MAAAGSFIFLVALFLLIEAIDRTLRDSTRTRKQTGSIVLGAYPAPLKLSPISKQCEEIAT RYMSSAILRFFTERKEGMPFILNLLSTEQGSGKTYLAEQLQGYWESIGLKVRRLTDGTDF NSNSSAFTLAKNLTDLYTPGTEDILIVEYPSLEKANIPTPLLQDAQLNLLVASAVHGWKA TDKVLLQKLKSQLGTSPYLYLNRAPKYEVETYTGMLPPYTFVHKQMYRLSQLALTESLFN WKKNSRKKPQDIDDDDDE >gi|226332019|gb|ACIB01000037.1| GENE 18 27016 - 27765 489 249 aa, chain - ## HITS:1 COG:no KEGG:BF2062 NR:ns ## KEGG: BF2062 # Name: not_defined # Def: putative outer membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 249 1 249 249 397 85.0 1e-109 MKRVLFILCSIPLLLFSPVSFANAQETSDFSKLNPEDYTNISLPPLDLLFENAKGGPIYE LASVKEQIERKLLAKERRSFLQFFSVRGSYQWGRFGVDNTFTDVATPIMYNYSTSKQKMY TVGGAINIPFNELFDLVPRVRRQKLTVKTAVLEREVKFEEMKREIIELYATATSQLNVLK LRAEALELANMQYDIAEKNFVNNTINTGDLSVEKERQSTALEAFEKSRFEVTKSLMILEV VTRTPILKK >gi|226332019|gb|ACIB01000037.1| GENE 19 28077 - 29303 1146 408 aa, chain - ## HITS:1 COG:no KEGG:BF2063 NR:ns ## KEGG: BF2063 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 408 10 417 417 807 99.0 0 MAIMPAAFVTAQDKGISTVKDGQTAVGNTFNTDTAKQDSTPALSKRELRRQRVARRNLHY NILGGPSYTPDFGLLIGGSALMTFRMNPSDTTQQRSVVPVAIALMFNGGLNLFSKPQLFF KGDRFRIFGQFSYKNTQENFYGIGYSTNKDYVRSDTTSQYRYSGLQINPWFLFRLGESNF FAGPQVDLNYDHMYDPAKYLVDQPSYKAAGGTDKGYKNFSSGVGFLLTYDTRDVPANAYR GMYLDFRGMMYQKFLGSDNNFYRLEIDYRQYKTLRKRGVLAWTAQTKNVFGDVPLNKYAL SGTPFDLRGYYMGQYRDKSSHVVMAEYRQMINTDKGNWVKRMLNHVGYVAWAGCGFMGPN PGKIEGVLPNMGVGLRIEVQPRMNVRLDLGRNMVNKQNLFYFNMTEAF >gi|226332019|gb|ACIB01000037.1| GENE 20 29494 - 30891 1333 465 aa, chain + ## HITS:1 COG:SPy1651 KEGG:ns NR:ns ## COG: SPy1651 COG3579 # Protein_GI_number: 15675522 # Func_class: E Amino acid transport and metabolism # Function: Aminopeptidase C # Organism: Streptococcus pyogenes M1 GAS # 34 465 11 444 445 275 35.0 2e-73 MNKRILSVFALSAAFLTVCAQQDKGGISPEMLNQIKQSYQGTTTDKAIRNAIGNNDIRKL ALNQDNLKGMDTHFSIKVDSKGITDQKSSGRCWLFTGLNVMRAKAIARYGLGSFEFSQNY NFFWDQLEKANLFLQGVIDTREKPMDDKMVEWLFRNPLSDGGTFTGVADIVSKYGLVPKE VMPETNSSENTSRMAGLIAEKLREYGLQLRDRAAKGAKQPELEKDKTEMLGTVYRMLVLN LGVPPTEFTWTRKDEKGNPVETAQYTPMSFLEKYGDKNLLDNYVMLMNDPSREYYKCYEI DFDRHRYDGRNWTYVNLPADEIKEMAIASLKDSTMMYFSCDVGKFLNSERGLLDVNNYDY ASLMGTTFGMDKKQRVQTFASGSSHAMTLMAVDVKDNKPVKWMVENSWGATNGYQGHLIM TDEWFDEYMFRLVVEKKFATPKVLEILKQKPIRLPAWDPMFAPEQ >gi|226332019|gb|ACIB01000037.1| GENE 21 31487 - 32839 1673 450 aa, chain + ## HITS:1 COG:YPO3240 KEGG:ns NR:ns ## COG: YPO3240 COG1726 # Protein_GI_number: 16123399 # Func_class: C Energy production and conversion # Function: Na+-transporting NADH:ubiquinone oxidoreductase, subunit NqrA # Organism: Yersinia pestis # 4 448 1 446 447 275 34.0 1e-73 MANVIKLRKGLDINLKGKAAEELSTVKEPGFYALVPDDFPGVTPKVVVKEQEYVMAGGPL FIDKNHPEVKFVSPVSGVVTSVERGARRKVLNIVVEAAAEQDYEEFGKKDVSKLDGEAVK AALLEAGMFAFMKQRPYDVIADPTVAPRAIFISAFDSNPLAPDFEYVLKGEEANFQTGLD ALAKIAKTYLGISIKQKSTALTQAKNVTVTVFDGPNPAGNVGVQINHVAPVVKGETVWTI GAEAVIFIGRLFNTGRVDLTRTVAVTGSEVVKPAYCKLKVGALLTHVFAGNVTKDKELRY ISGNVLTGKQVKPNGFLGAFDSQLTVIPEGDDIHEMLGWIMPRFNQFSVNRSYFSWLMGN KKEYVLDARIKGGERHMIMSGEYDKVFPMDILPEFLIKAIIAGDIDRMEALGIYEVAPED FALCEFVDSSKLELQRIVRAGLDMLRAEMM >gi|226332019|gb|ACIB01000037.1| GENE 22 32871 - 34049 1449 392 aa, chain + ## HITS:1 COG:PA2998 KEGG:ns NR:ns ## COG: PA2998 COG1805 # Protein_GI_number: 15598194 # Func_class: C Energy production and conversion # Function: Na+-transporting NADH:ubiquinone oxidoreductase, subunit NqrB # Organism: Pseudomonas aeruginosa # 3 386 2 401 403 319 44.0 6e-87 MKALRNYLDKIKPNFEEGGKLHAFRSVFDGFETFLFVPNTTSKSGAHIHDSIDSKRIMSI VVISLIPALLFGMYNVGYQHFTHTGAQGGFIEMFIYGFLAILPKIIVSYVVGLGIEFVVA QWKKEEIQEGFLVSGILIPMIVPVDCPLWILAIATAFAVIFAKEVFGGTGMNVFNVALVT RAFLFFAYPTKMSGDAVWVAQDSIFGLGNTVDGLTAATSLGVASTATDPNGFPAFSWDMV TGLIPGSIGETSVIAILIGAVILLWTGIASWRTMLSVFVGGAFMGWIFNTVGPDTAMAHM PWYEHLVLGGFCFGAVFMATDPVTSARTETGKYIFGFLIGAMAIIIRVLNPGYPEGMMLA ILLMNIFAPLIDYCVVQSNIKLREKRAIKSNN >gi|226332019|gb|ACIB01000037.1| GENE 23 34064 - 34741 825 225 aa, chain + ## HITS:1 COG:PA2997 KEGG:ns NR:ns ## COG: PA2997 COG2869 # Protein_GI_number: 15598193 # Func_class: C Energy production and conversion # Function: Na+-transporting NADH:ubiquinone oxidoreductase, subunit NqrC # Organism: Pseudomonas aeruginosa # 2 224 3 255 261 89 29.0 7e-18 MNTNSNSYTIIYASVMVVIVAFLLAFVSSSLKDIQNKNQELDTKKQILSALNIRDVKDAD AEYNKYVKGDMLMNVDGTLTENTDGFSISYEKEAKENNRLHVFVCEVDGETKYVVPVYGA GLWGAIWGYVALNADKDTVYGVYFSHASETPGLGAEIATTAFQNEFSGKKVLKDGQVALA VEKNGKVTDPAYQVDGISGGTITSKGVDAMIKACLSQYDKFLTNN >gi|226332019|gb|ACIB01000037.1| GENE 24 34758 - 35390 763 210 aa, chain + ## HITS:1 COG:HI0168 KEGG:ns NR:ns ## COG: HI0168 COG1347 # Protein_GI_number: 16272134 # Func_class: C Energy production and conversion # Function: Na+-transporting NADH:ubiquinone oxidoreductase, subunit NqrD # Organism: Haemophilus influenzae # 10 210 8 208 208 206 56.0 3e-53 MSQLFSKKNKEVFATPLGLNNPVTVQVLGICSALAVTAKLEPAIVMGLSVTVITAFSNVV ISLLRKTIPNRIRIIVQLVVVAALVTIVSEVLKAFAYDVSVQLSVYVGLIITNCILMGRL EAFAMANGPWESFLDGVGNGLGYAKILIIVAFFRELLGSGTLLNFRIIPESFYKMGYINN GLMLMPPMALIICACIIWYQRSRCKELQEK >gi|226332019|gb|ACIB01000037.1| GENE 25 35419 - 36045 757 208 aa, chain + ## HITS:1 COG:HI0170 KEGG:ns NR:ns ## COG: HI0170 COG2209 # Protein_GI_number: 16272135 # Func_class: C Energy production and conversion # Function: Na+-transporting NADH:ubiquinone oxidoreductase, subunit NqrE # Organism: Haemophilus influenzae # 1 208 1 198 198 207 62.0 8e-54 MEQLLSLFVRSIFVDNMIFAFFLGMCSYLAVSKNVKTAVGLGIAVTFVLVVTLPVNYLLQ TKVLAANAIIEGVDLSFLSFILFIAVIAGIVQLVEMVVERFSPSLYASLGIFLPLIAVNC AIMGASLFMQQRITMDPSNPQAITGVGSAVVYALGSGIGWLLAIVGLAAIREKMAYSDVP APLKGLGITFITVGLMAMAFMCFSGLKL >gi|226332019|gb|ACIB01000037.1| GENE 26 36068 - 37339 1488 423 aa, chain + ## HITS:1 COG:PA2994 KEGG:ns NR:ns ## COG: PA2994 COG2871 # Protein_GI_number: 15598190 # Func_class: C Energy production and conversion # Function: Na+-transporting NADH:ubiquinone oxidoreductase, subunit NqrF # Organism: Pseudomonas aeruginosa # 5 423 6 407 407 431 50.0 1e-120 MTSLILASIGVFLVVIILLVIILLVAKSYLSPSGEVTITMNGEQQLKTSQGGTLLGTLSA NNVFLSSACGGKGSCGQCRCQVLEGGGEILPTETGFFSRKEQADHWRLGCQVKVKQDMSI KIDESILGVKEWECEVISNKNVATFIKEFIVALPPGEHMDFVPGSYAQIKIPTFSMDYDK DIDKSLIGDEYLPAWEKFGLLGLKCRNDEPTIRAYSMANYPAEGDRIMLTVRIATPPFKP KDQGPGFMDVMPGIASSYIFTLKPGDKVTMSGPYGDFHPILDSKNEMMWIGGGAGMAPLR AQIMHLTKTLHITDRTMNYFYGARALNEVFYLEDFLQIEKDFPNFKFHLALDRPDPAADA AGVKYTAGFVHNVIYETYLKNHEAPEDIEYYMCGPGPMSKAVEKMLDDLGVPSKNLMFDN FGG >gi|226332019|gb|ACIB01000037.1| GENE 27 37436 - 38755 1133 439 aa, chain + ## HITS:1 COG:VC2564 KEGG:ns NR:ns ## COG: VC2564 COG0513 # Protein_GI_number: 15642559 # Func_class: L Replication, recombination and repair; K Transcription; J Translation, ribosomal structure and biogenesis # Function: Superfamily II DNA and RNA helicases # Organism: Vibrio cholerae # 8 430 19 451 460 215 36.0 2e-55 MNGLKDILERLKIEQLNPMQEASVEAFNKGGEDLILLSPTGSGKTLAFLLPLVGSLKADV KGVQAVVLVPSRELALQIEQVFKAMGTEFKAMSCYGGRPAMEEHRTMKGMQPAVIIGTPG RMNDHLSKQNFDASTVSLLVIDEFDKCLEFGFQEEMATVIGQLPDLKRRFLTSATDAEEI PQFTGLNRTIKLDFLTNDVEESRLRLMKVVSPAKDKIETLYKLLCTLGSSSSIVFCNHRD AVDRVSALLTEKGVSNERFHGGMEQPDRERALYKFRNGSCPVLVSTDLAARGLDIPEVEH IIHYHLPVNEEAFTHRNGRTARWDATGTSYLILNPEEHVPDYIPSELEIFDLPENAPRPA KPQWVTIYIGKGKKDKLSKIDIAGFLYKKGNLAREDVGAIDVKDHYAFVAVRRPKMKQLL TLIRGEKIKGMKTVIEEAD >gi|226332019|gb|ACIB01000037.1| GENE 28 39269 - 40336 1083 355 aa, chain + ## HITS:1 COG:BS_serC KEGG:ns NR:ns ## COG: BS_serC COG1932 # Protein_GI_number: 16078066 # Func_class: H Coenzyme transport and metabolism; E Amino acid transport and metabolism # Function: Phosphoserine aminotransferase # Organism: Bacillus subtilis # 5 353 6 356 359 330 47.0 2e-90 MKKHNFSAGPSILPREVIEETAKAILDFNGSGLSVLEVSHRGKDFQAVMDEAVALFKEIL NIPEGYSVLFLGGGASMQFCMVPYNFLEKKAAYLNTGVWAKKAMKEAKGFGEVVEVASSA DANYTFIPKDFTIPADADYFHVTTNNTIYGTELKGDLDSPVPMVADMSSDIFSRPVDVSK YICIYGGAQKNLAPSGVTFVIVKDDAVGKVSRYIPSMLNYKTHIDGGSMFNTPPVLPIYS AMQTLRWIKAQGGVKEMDRRATEKADMLYAEIDRNKMFVGTAAKEDRSRMNICFVMAPEY KDLEADFLKFATDKGMSGIKGHRSVGGFRASCYNAMPKESVQALIDCMQEFEKLH >gi|226332019|gb|ACIB01000037.1| GENE 29 40450 - 41370 1242 306 aa, chain + ## HITS:1 COG:MJ1018 KEGG:ns NR:ns ## COG: MJ1018 COG0111 # Protein_GI_number: 15669207 # Func_class: H Coenzyme transport and metabolism; E Amino acid transport and metabolism # Function: Phosphoglycerate dehydrogenase and related dehydrogenases # Organism: Methanococcus jannaschii # 39 296 32 303 524 156 36.0 4e-38 MKVLVATEKPFAKIAVDGIKKEIEGAGFELVLLEKYTDKAQLLDAVKDANAIIIRSDIID AEVLDAAKELKIVVRAGAGYDNVDLNAATAHGVCVMNTPGQNSNAVAELVFGLLVYAVRN FYNGTSGTELMGKKLGIHAYGNVGRNVARIAKGFGMELYAYDAFCPKDVIEKDGVKAVDS AEELYKTCNIVSLHIPATAETKNSINHDLLANMPKGAILVNTARKEVINEDELIQLMEER PDFKYITDIMPAANTKFAELFAGRYFSTPKKMGAQTAEANINAGIAAARQIVGFLKEGCE KFRVNK >gi|226332019|gb|ACIB01000037.1| GENE 30 41383 - 42630 1468 415 aa, chain + ## HITS:1 COG:CAC0016 KEGG:ns NR:ns ## COG: CAC0016 COG4198 # Protein_GI_number: 15893314 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Clostridium acetobutylicum # 1 415 1 414 414 414 49.0 1e-115 MATIKPFKGIRPPQDLVEQVASRPYDVLNSEEARAEAAGNDKSLYHIIKPEIDFPVGTDE HDEKVYAKAAENFRLFRDKGWLVQDDKENYYIYAQTMNGKTQYGLVVGAYVPDYMNGVIK KHELTRRDKEEDRMKHVRVNNANIEPVFFAYPDNAVLDAIIRKYTAQKPVYDFIAPGDGF GHTFWVIDNSEDIAVITKEFAAMPALYIADGHHRSAAAALVGAEKAKQNPNHRGDEEYNY FMAVCFPANQLTIIDYNRVVKDLNGLTPAEFLTALEKNFEIEEKGKEIYKPNALHNFALY LDGKWYSLTAKPGTYDDNDPIGVLDVTISSNLILDEILGIKDLRSDRRIDFVGGIRGLGE LSRRVDSGEMKVALALYPVSMKQLMDIADTGNIMPPKTTWFEPKLRSGLVIHELE >gi|226332019|gb|ACIB01000037.1| GENE 31 42878 - 43279 315 133 aa, chain + ## HITS:1 COG:CC3636 KEGG:ns NR:ns ## COG: CC3636 COG0545 # Protein_GI_number: 16127866 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: FKBP-type peptidyl-prolyl cis-trans isomerases 1 # Organism: Caulobacter vibrioides # 12 132 46 167 177 112 43.0 1e-25 MGRKEEYKLQNEQFMQTLRTEADVHELPCGILYKVLEEGTGAATPRSNSVVSVHYKGTLI NGREFDNSWKRNCPEAFRLNEVIEGWQIALQKMRVGDHWIVYIPYNMGYGTRTSGPIPAF STLIFEVQLLGIA >gi|226332019|gb|ACIB01000037.1| GENE 32 43307 - 43924 626 205 aa, chain + ## HITS:1 COG:NMB2153 KEGG:ns NR:ns ## COG: NMB2153 COG1739 # Protein_GI_number: 15677966 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Neisseria meningitidis MC58 # 5 177 4 176 203 185 49.0 5e-47 MTEDTYKTITEVSEGTYTEKRSKFIAIALPVRTLEEIKVHLEAYQKKYYDARHVCYAYML GHERKNFRANDNGEPSGTAGKPILGQINSNELTDILIIVVRYFGGIKLGTSGLIVAYKAA AAEAIAACTIVEKTVDEEVTVLFEYPFMNDVMRIVKEEEPEILSQSYDMDCSMTLRIRQS AMPRLRSRLEKVETARIADEQEGNG >gi|226332019|gb|ACIB01000037.1| GENE 33 44002 - 45090 1120 362 aa, chain + ## HITS:1 COG:no KEGG:BF2023 NR:ns ## KEGG: BF2023 # Name: not_defined # Def: type II restriction enzyme HpaII # Organism: B.fragilis # Pathway: not_defined # 1 362 1 362 362 702 99.0 0 MAFEATKREWSELYAFFRLLADGKVSLGTPQAKKEDEKYRPIAMIQREEHDGTRRYYIEE EVIRMEGEKVEKSIPREDFATVADLILDALKNSSADEVTSPDGVEEFLDEAGIFDLEART EDRTDFSIAFWHPEAPLAGFNVRSRLSAMNPLLDGGRAANLKLEQSGIKFATPTVNKINA LPESPTEVAERMMMIERLGGVLKYSDVADRVFRCNLLMIDLHFPRVLAEMVRMMHLDGIT RVSELTEQMKIINPLKIKEELISKHGFYEFKMKQFLLVLALGMRPAKIYNGTDSAVEGIL LVDGKGEVLCYHKSEKKTFEDFLYLNSRLEKGSVDKDKYGFLERENGVYYFKLNVKIGLI KR >gi|226332019|gb|ACIB01000037.1| GENE 34 45115 - 45684 495 189 aa, chain + ## HITS:1 COG:MA1774 KEGG:ns NR:ns ## COG: MA1774 COG0778 # Protein_GI_number: 20090624 # Func_class: C Energy production and conversion # Function: Nitroreductase # Organism: Methanosarcina acetivorans str.C2A # 3 171 31 202 220 97 34.0 2e-20 MKTNEVLETIKARRSVHAYDRKQIPADDLNAILEAGAYAPSGMHYETWHFTAVRNTVKLE ELNERIKGAFAKSDDKHLRERGHSETYCCYYHAPTLVIVSNEPKQWWAGMDCACAIENMF LAATSLGIASCWINQLGTTCDDPEVRAYLTSLGVPENHKVYGCVALGYKAEGALLKEKTV KAGTITIVE >gi|226332019|gb|ACIB01000037.1| GENE 35 45827 - 48676 2888 949 aa, chain - ## HITS:1 COG:YPO0905_2 KEGG:ns NR:ns ## COG: YPO0905_2 COG1003 # Protein_GI_number: 16121210 # Func_class: E Amino acid transport and metabolism # Function: Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain # Organism: Yersinia pestis # 465 944 4 487 494 578 59.0 1e-164 MKTDLLACRHIGVNKADAEVMLRKIGVASLDELIDKTIPANIRLKAPLALPAPMTEYEFA RHIAELAGKNKLFTTYIGMGWYNTITPAVIQRNVFENPVWYTSYTPYQTEVSQGRLEALM NFQTAVCDLTAMPLANCSLLDEATAAAEAVTMMYGLRSRNQQKAGANVVFIDENIFPQTL AVITTRAIPQGIEIRTGKFRDLEFTDDLFACVLQYPNANGNAEDYREFTEKAHTANCKVA VAADILSLALLTPPGEWGADIVFGTTQRLGTPMFYGGPSAGYFATRDEYKRNMPGRIIGW SKDKYGKLCYRMALQTREQHIKREKATSNICTAQALLATMAGFYTVYHGQEGIRNIASRI HSITVFLEKSIGKLGFKQVNRQYFDTLRFILPDSVSAQQIRTIALSKEVNLRYFDNGDVG LSIDETTDVAAANILLSIFAIAAGKDFQKVDDIPEATIISEELKRQTPYLTHEVFSKYHT ETEMMRYIKRLDRKDISLAQSMISLGSCTMKLNAAAEMLPLSCAEFMCMHPLVPEDQAAG YRELIHNLSEELKVITGFAGVSLQPNSGAAGEYAGLRTIRAYLESIGQGHRNKVLIPASA HGTNPASAIQAGFTTVTCACDEHGNVDMDDLRAKAEENKDDLAALMITYPSTHGIFETEI VEICQIIHACGAQVYMDGANMNAQVGLTNPGFIGADVCHLNLHKTFASPHGGGGPGVGPI CVAEHLVPFLPGHGLFGNSQNEVSAAPFGSAGILPITYGYIRMMGAEGLTMATKTAILNA NYLAACLKDTYGIVYRGANGFVGHEMILECRKVYEETGISENDIAKRLMDYGYHAPTLSF PVHGTLMIEPTESESLSELDNFVLTMLTIWNEIQEVKNGEADKEDNVLINAPHPEYEVVS DQWEHCYTREKAAYPIESVRENKFWVNVARVDNTLGDRKLLPTCYGCFD >gi|226332019|gb|ACIB01000037.1| GENE 36 48719 - 49357 472 212 aa, chain - ## HITS:1 COG:VC1270 KEGG:ns NR:ns ## COG: VC1270 COG0491 # Protein_GI_number: 15641283 # Func_class: R General function prediction only # Function: Zn-dependent hydrolases, including glyoxylases # Organism: Vibrio cholerae # 11 210 13 210 218 155 40.0 5e-38 MKIKRFEFNMFPVNCYVLWDETNEAVVIDPGCFYDEEKQALKNFIVTNNLNIKHLLNTHL HLDHIFGNPFMLREFGLSAEANQADEYWIDEAPKQSRMFGFQLNEAPVPLGKYLHDGDII TFGNTTLEAIHVPGHSPGSLVYYCRADNCMFSGDVLFQGSIGRADLAGGNFDELKEHICS RLFVLPNETIVYPGHGAPTTIGIEKAENPFFR >gi|226332019|gb|ACIB01000037.1| GENE 37 49377 - 50051 681 224 aa, chain - ## HITS:1 COG:SA2499 KEGG:ns NR:ns ## COG: SA2499 COG0357 # Protein_GI_number: 15928295 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted S-adenosylmethionine-dependent methyltransferase involved in bacterial cell division # Organism: Staphylococcus aureus N315 # 28 161 16 154 239 100 36.0 2e-21 MNTTEQTSSLPSAGGENGVKLLLKYFPDLTEEQRKQFAALYELYIDWNSKINVISRKDIE NLYEHHVLHSLGIARIIRFRAGSSVMDLGTGGGFPGIPLAILFPDTKFHLVDSIGKKVRV ATEVANAIGLKNVTFRHARAEEEKQTFDFVVSRAVMPLADLIKIIRKNISPKQQNALPNG LICLKGGELEHEAMPFKHKTSMHNLNEDFDEEFFQTKKVVYVTI >gi|226332019|gb|ACIB01000037.1| GENE 38 50091 - 50945 912 284 aa, chain - ## HITS:1 COG:no KEGG:BF2028 NR:ns ## KEGG: BF2028 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 284 1 284 284 554 100.0 1e-156 MKKQYVSLLAIILATSGFLFSCGDKMNKNTGALEFDSIQVNETAHLFGDTAKPACNLTIN FAYPVKSTDNKLKDSLNSYFIAACFGEGYIGEKPAQVVKEYTEHYVKEYRTDLEPMYAED EKNKESEGSIGAWYSYYKGIESHVQLYYKNLLVYRINYNEYTGGAHGIYMTTFLNMDLIN LRPLKLDDIFTGDYKEALTDLLWNQLMADKKVTTHEALEDMGYGSTGDIAPTENFYLDKD GITFYYNVYDITPYAMGPVEIKIPYEMMEHMLGSNPIIGEMKSK >gi|226332019|gb|ACIB01000037.1| GENE 39 51045 - 51479 244 144 aa, chain + ## HITS:1 COG:no KEGG:BF2029 NR:ns ## KEGG: BF2029 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 14 144 1 131 131 243 100.0 1e-63 MAGWLNLLIFATVMKRFRYILAVILSLLIVYVGAGVSVAQYCCSGCETANCCCADKCGFC GKFDFEFHKSCRGEGCTATIYKLDLVKQAFESSVPAPVSLLLCDQVSDLLCALFRDEVLD PPYVIPPPKTSSRHYLALYSTLLI >gi|226332019|gb|ACIB01000037.1| GENE 40 51668 - 53875 2232 735 aa, chain + ## HITS:1 COG:no KEGG:BF2030 NR:ns ## KEGG: BF2030 # Name: not_defined # Def: putative TonB-dependent outer membrane receptor protein # Organism: B.fragilis # Pathway: not_defined # 13 735 16 738 738 1464 100.0 0 MLLTGLLLGSLTVQAQVSGTVKDQAGEPIIGANVFWKNIPGGVATREDGTFSISKPDKSN HLIVSFIGYENDTIQVNDKKAVLDVVLREGMELSEVQIVSRKLSTLKLRSSVMNEEIITS DELCRAACCNLGESFVTNPSVDVSYSDAATGAKQIKLLGLSGTYVQMLTENIPNYRGAAS PYGLGYVPGPWMHSIQVSKGISSVKNGYEALTGQINVEFKKPQLPEADWVSANLFASTTN RYEANADATVKLSKRWSTSLLAHYENETKAHDGNDDGFADIPRIEQYNFWNRWAYMGDHY VFQAGIKALDESRKGGQVSHSGVPAADRYEIDIDTRRYEAFTKNAYIFNKEKNTNLALIL SGTLHNQDALYGRKIYNVDQSNAYASLMFETEFTKEHNLSAGFSYNYDGYDQHYRLTNNA ETPLTKAFARESVGGAYAQYTFNLDNKFVLMAGLRGDHSSEYGFFVTPRAHIKYNPNDFV HFRLSAGKGYRTNHVLAENNYLMASSRKVSIADHLDQEEAWNYGASISGYIPLFGKTLNL NLEYYYTDFLKQVVVDMDTNPHEVAFYNLDGRSYSQVFQVEATYPFFQGFSLTAAYRWTD AKTTYNHQLMEKPLTGKYKGLVTASYQTPLGLWQFDATWQMNGGGRMPNPYTLADGTSSW DARYKGFSQLSAQVTRYFRRWSIYIGGENLTNFKQKNPIIDAADPWGDRFDSTMIWGPVH GAKGYIGVRFNLARD >gi|226332019|gb|ACIB01000037.1| GENE 41 53926 - 54237 356 103 aa, chain + ## HITS:1 COG:no KEGG:BF2031 NR:ns ## KEGG: BF2031 # Name: not_defined # Def: putative heavy-metal binding protein # Organism: B.fragilis # Pathway: not_defined # 1 103 1 103 103 135 99.0 6e-31 MKTKKMIATLVVALLSVTAVMAKDFRIVVFKVAQMECANCERKVKNNIKFEKGLKNFTTD LKERTVTITYDAEKTNVEKLKEGFRKFKYEAVVIKEAKETDKK >gi|226332019|gb|ACIB01000037.1| GENE 42 54289 - 56499 2376 736 aa, chain + ## HITS:1 COG:alr1627 KEGG:ns NR:ns ## COG: alr1627 COG2217 # Protein_GI_number: 17229119 # Func_class: P Inorganic ion transport and metabolism # Function: Cation transport ATPase # Organism: Nostoc sp. PCC 7120 # 1 735 1 746 753 606 45.0 1e-173 MSNITKKAFPVLNMHCAGCANNVEKTVKKLAGVVDASVNFATNTLSVSYEADKLTPGEIR AAVLAAGYDLIVEEALKEERQEEAQEKHYRLLKRQVIGAWIFVVPMLLFSMVLMHVPFSD EIQLILALPVMIFFGGSFYVNAWRQARLGRSNMDTLVALSTSIAFLFSVFNTFFPEFWYS RGLEPHVYYEAAVVIIAFVLTGKLMEERAKGNTSTAIRKLMGLQPRVARVLREGIEEDIL IDQLQTGDLVVVRPGEQIPVDGRLSEGESYVDESMISGEPIPVEKKVGDRVLAGTINQKG AFVIKASGVGSETVLARIIRMVQEAQGSKAPVQRIVDRVTGIFVPVVLCIAVLTFVIWLL VGGTDYFSHALLSAVSVLVIACPCALGLATPTALMVGIGKAASNHILIKDAVALEQMRKV DVVVLDKTGTLTEGHPTVSGWLWAQVQEEHFKNVLLAAELKSEHPLAGAIVSSLQEVEKI VPAQLESFESITGKGIKVVYQGDTYWVGSHKLLKDFSASLSDVLAEMMVQYESDGNSIVY FGRGTEVLAVVAIADQIKPTSAEAVKELKRQGIDICMLTGDGQRTALAVSGKLGIDRFVA DALPDDKEEFVRELQMQGKTVAMVGDGINDSQALALADVSIAMGKGTDIAMDVAMVTLMT SDLLLLPRAFELSKQTVKLIHQNLFWAFIYNLIGIPIAAGILFPVNGLLLNPMLASAAMA FSSVSVVLNSLSLARK >gi|226332019|gb|ACIB01000037.1| GENE 43 56675 - 57514 586 279 aa, chain - ## HITS:1 COG:BMEII0641 KEGG:ns NR:ns ## COG: BMEII0641 COG2207 # Protein_GI_number: 17988986 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Brucella melitensis # 171 277 183 291 307 70 33.0 4e-12 MKFDFPQVDLPCEILAWNDVTEDILNIYKQSCRLKAGIFAICTEGKMTATINLIDYEIKP NDLITLLPGTIIQFRERTEKVRLCFAGFSSECVERINLIKSMVSSFSKITECPIVELQED IASYLIDYFSLLARVTCDEKLSLPSEMTEVSLRSILTAVGLIYQRYSSKNHNTNRKEEIC RELVGLVTEHYTEERRAQFYADKLGISLQHLSTTVKQVTGRNVLDVIAYVVIIDAKAKLK SSNMTIQEIAYSLNFPSASFFGKYFRRYVGMSPLEFRNS >gi|226332019|gb|ACIB01000037.1| GENE 44 57580 - 58245 572 221 aa, chain + ## HITS:1 COG:MT2274 KEGG:ns NR:ns ## COG: MT2274 COG0321 # Protein_GI_number: 15841708 # Func_class: H Coenzyme transport and metabolism # Function: Lipoate-protein ligase B # Organism: Mycobacterium tuberculosis CDC1551 # 11 205 30 208 240 150 44.0 1e-36 MKTITTDWELIPYSEAWSRQTEWFDALVHAKQNGENYENRIIFCEHPHVYTLGRSGKENN MLLGEEQLKTIGATLYHIDRGGDITYHGPGQLVCYPILNLEEFGLGLKEYVHLLEEAVIR VCASYGVVAGRLEKATGVWLEGDTSRARKICAIGVRSSHYVTMHGLALNVNTDLRYFSYI HPCGFIDKGVTSLQQELGRSIDMAEVKEQLGRELLAALLSK >gi|226332019|gb|ACIB01000037.1| GENE 45 58249 - 58527 58 92 aa, chain - ## HITS:1 COG:no KEGG:BF2089 NR:ns ## KEGG: BF2089 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 92 1 92 92 150 95.0 1e-35 MFSYFTSIVRVSEVKRERLFSGCTPEADKSELTEREPHALITHLFIPIASRHRHYDSLQR HLISKTKLLTRKEGDFIRSVHHWTEIEQSREV >gi|226332019|gb|ACIB01000037.1| GENE 46 58818 - 59594 921 258 aa, chain - ## HITS:1 COG:no KEGG:BF2036 NR:ns ## KEGG: BF2036 # Name: not_defined # Def: putative xylanase # Organism: B.fragilis # Pathway: not_defined # 1 258 1 258 258 521 100.0 1e-146 MKTIRTLLLCTSLLAGTVAAQESSLVLSNGLGFVDTPYKAGTLEVDDTEDLIINCDEVDC TTFVEYALAMALCPQQGDEMQEGDFARNLQRIRYRDGKIDGYTSRLHYISDWINNAVRQG LLEDVTAAYSPFKQKLSLSYMSTHPELYKSLKNSPENVAQMAKYEKALSGKEVHYLPKDK LEPDGLPWIKNGDIIALTTNTPGLDVSHMGIAIYIKGQLHLLHASSKEGKVVVGKTALSQ MLKDRKSLTGIRVLRMKK >gi|226332019|gb|ACIB01000037.1| GENE 47 59632 - 60462 526 276 aa, chain - ## HITS:1 COG:CC2313 KEGG:ns NR:ns ## COG: CC2313 COG0657 # Protein_GI_number: 16126552 # Func_class: I Lipid transport and metabolism # Function: Esterase/lipase # Organism: Caulobacter vibrioides # 5 251 34 304 328 176 37.0 4e-44 MATGLSAQKPVELPLWPNGAPNDNGLKGEEVMSAPYRLTNVTQPTITVYRPATPNGMTII MCPGGAYALLAMDHEGHDMAPWFNSLGITYVVLKYRMPNGHCEVPLSDAEQAIRIVRKHA KDWNIRTDRVGIMGASAGGHLASTLATHYSSEDTRPDFQILLYPVITMESGDTHGGSRHN LLGPDATPELTRKFSNEQQVTDHTPQAFITLSSDDEGVPPANGVNYYLALQKHKVPATLH IYPTGGHGWGFYDSFTYKRQWTEELEKWLRDGVRFQ >gi|226332019|gb|ACIB01000037.1| GENE 48 60492 - 62480 2265 662 aa, chain - ## HITS:1 COG:PH0361 KEGG:ns NR:ns ## COG: PH0361 COG1297 # Protein_GI_number: 14590271 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Pyrococcus horikoshii # 25 626 3 595 626 241 30.0 2e-63 MKQEEDKFTGLPENAFRELKPGEVYNPLMSPKKTYPEVNLWSVAWGIAMAILFSAAAAYL GLKVGQVFEAAIPIAIIAVGVSGAAKRKNALGENVIIQSIGACSGVIVAGAIFTLPALYI LQAKYPDMSVTFMQVFISSLLGGVLGILFLIPFRKYFVSDMHGKYPFPEATATTQVLVSG EKGGSQAKPLLMAGLIGGLYDFIVATFGWWNENFTTRVCGVGEMLADKAKLVFKVNTGAA VLGLGYIVGLKYASIICFGSLAVWWIIVPGMSLFFGDSVLNQWNPDITATVGSMSPEQIF SHYAKSIGIGGIAMAGVIGIIKSWSIIRSAVGLAAKEMGGKSDAEKNIIRTQRDLSMKII AIGSIITLILVVLFFYFDVMQGNLVHTLVAILLVAGISFLFTTVAANAIAIVGTNPVSGM TLMTLILASVVMVAVGLKGPSGMVASLVMGGVVCTALSMAGGFITDLKIGYWLGSTPAKQ EAWKFLGTIVSAATVGGVMIILNKTYGFTSGQLAAPQANAMAAVIEPLMSGVGAPWMLYG IGAVLAIVLTLLKVPALAFALGMFIPLELNIPLVVGGAINWYVTTRSKDASLNTERGEKG TLLASGFIAGGALMGVVSAAMRFGGINLVNDAWLNNTLSQLAALIAYALLILYFIKASMK VK >gi|226332019|gb|ACIB01000037.1| GENE 49 62611 - 63006 385 131 aa, chain - ## HITS:1 COG:no KEGG:BF2093 NR:ns ## KEGG: BF2093 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 131 1 131 131 253 100.0 2e-66 MKKILFFITMMVLSIGLAQAQGKAEIKFDKTTHDFGTFSENNPVVSCTFKFTNIGDAPLV IHQAVASCGCTVPEYTQEPIMPGKTGTIKVTYNGTDKYPGHFKKSITLRTNAKTEMIRLF VEGDMTAKDAK >gi|226332019|gb|ACIB01000037.1| GENE 50 63190 - 64509 939 439 aa, chain - ## HITS:1 COG:BH1920 KEGG:ns NR:ns ## COG: BH1920 COG0642 # Protein_GI_number: 15614483 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Bacillus halodurans # 215 436 315 537 548 100 31.0 6e-21 MKQKWARKPLILTAALLICCYTTVWLGMHGFYISLPVSVCLLLYTAYRIYRYILRSTRAM AQFIWSVRYSEFLSSPVQSEESLRSLPAELLNEMNQALDFYKQNLQKKESKLQYFQALAN HIDMSVLVYTPSGRIEWMNEAAKRLLDNHNLKSIDELKYFHSELPARLYSLKAGDIAVLQ AKKEEETIQLALSGMEFVIQGRPLTVASMKNIDSVLDSQETEAWQKLIRVLTHEIMNSIT PVTSLSELLEHQIEDFDGNEEERAEMLRMLQTIRRRGDGLIRFVNSYREVSHLPQPLLKI YTSQELLTGVVRLMYREPNDLHLILPPKAQRLMADKDLIEQVLINLIKNARENDATDIRI SAGLSSGERPYIRIEDNGTGIEQEVLDRIFIPFFTTKPTGSGIGLTISRQIMHLHRGTIT VSSEPGKGSIFTLLFPGVF >gi|226332019|gb|ACIB01000037.1| GENE 51 64529 - 65899 1097 456 aa, chain - ## HITS:1 COG:atoC KEGG:ns NR:ns ## COG: atoC COG2204 # Protein_GI_number: 16130157 # Func_class: T Signal transduction mechanisms # Function: Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains # Organism: Escherichia coli K12 # 7 456 6 456 461 283 36.0 5e-76 MTLKRGKILIADDNEDILFTLKMLLRPITESISITTDPRELLPILSRTHYDVILLDMNFR NDAVSGREGFHWLEEILKLDSGAVVIFITAYADTEKAVRAIKLGATDFIAKPWQNDKMIA TVSAALQLSFSRTEVESLKEQKEALTAPSPEPARIIGESAAMQAIFQTIRKFADTDANLL LLGENGTGKDLIARYVYEQSPRKGEIYVPIDLGSIPETLFESELFGFEKGAFTDARKKKP GRLEVASGGTLFLNEIGNLSLPLQAKLLSVIEQRKSSRLGSTTSYPVDVRLICATNTDLY TAIDNGLFRQDLLYRINTIEIRIPPLHERGNDLFLLADHFLQRYRKKYKKEVRGISKEAR RLMQLYRWPGNVRELEHTIERAVILSGNPMLMPNDFMLRTSPQGQVSEKEKYNLERQERE TISEVLRLCAGNITLASEMLGITRTSLYRRIEKHGL >gi|226332019|gb|ACIB01000037.1| GENE 52 66089 - 67447 767 452 aa, chain + ## HITS:1 COG:no KEGG:BF2042 NR:ns ## KEGG: BF2042 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 452 1 452 452 919 99.0 0 MIGSKMKSTWMLGLLLLLGSCQQKGPASGRVEMKGESDSLSVRHGQENALSVSDEYPEGY VPVPGIRYQANYLTGNPERLDVKAALRNVRPLKLSQLGSQLDFMVFKDVKGGIFRLIPIE EGCLGVGIGGIWLLDGNLKMIRMLFRNDVDITEENGFTNFQTKRYINPDYYEKTSRTLIG VLYDQRDKKNPKSFVRLSLKDLLSASRPLTPDDIHKRAGISGNYLLGMADGYATWTRFTN DVYTFNLRGDTLCHFTIGEKVNYPPRGNGSYRSGEQATVYHYKGCPMIIMPYGNTVYRMK DAFTLEPVYELDFGTLHRATGEEVVGGADVDNAYFLSDWVETDSYVFMHVEKGYDCPYAR DKKQVTLYSLIYDKRSKAFFSLPQAEETVHPCPEPDLPDGIPFWPRKGFSEKQLATFTTG WKLSKSSPEVFKRIPALQGTEKETTSLIITLK >gi|226332019|gb|ACIB01000037.1| GENE 53 67629 - 68996 802 455 aa, chain + ## HITS:1 COG:no KEGG:BF2097 NR:ns ## KEGG: BF2097 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 455 1 455 455 922 99.0 0 MIKRYYYWGNLLLMGGLCLSACGNKNPGVSKEELRREFAKSATPAADVEDSLAGWGDNDS LQVFPEDYQAPSGIKYDSHITDHGMIVLNVESALRNVRPMKLTDLGQQVQYRIMEANDDW TSLIAAGDDFLIPDEEGVWLLDKDLRKKRLLIRSDLQVIHTPGGGVGIGGGHLMQNFYYD SERKQLRFTCYNTDKRRSYLVVASLTGLMASSQPAELKTFRNRLPIGNSYLFSMPGGYGM AVRHSDELYTFGMKGDTLCHFVLGNSERYKPRLFEAERDIVYHVGGRTLFYHAYGNKIYR VKDASTLEVVYKLDFGSLSRITGADVARGGNVRSSYFVTSCMETDHFLFLNVDKGYDSEN ARRKGEVQLYSLVYDKRNGEFFSLPEVTGGGHPESPLIAAGLQEDLPFWPNLNLNGDPAF VVGKAVLEKYYPIQLRGNDKLGEFNETDLILMTVK >gi|226332019|gb|ACIB01000037.1| GENE 54 69007 - 71421 1491 804 aa, chain + ## HITS:1 COG:no KEGG:BF2098 NR:ns ## KEGG: BF2098 # Name: not_defined # Def: putative transporter permease protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 2 804 1 803 803 1587 99.0 0 MMINHYFKVALRLIKRSFLFSSIHILGFVWGMAAAFLIYLWVIDEFTFEDHNPDAGRIFR VIEANCSESGEVTENPYTSKLLADAFRKEFPQVEEATYLGNVDMTSLRSGDKFLSLNWMR VDTAFFDVFHFPVVEGDPGRLKSGFNHIVLSETLAKKFFGNEPAVGKEVMYNRGMDGESV LRIVGVVKVPRKSHIQFDAIVGQSFFDKIDVNIVRMSSPWDVRDAMVYVKMRPGTSVSDS DRVRMSRILSKHTHTERLLRFQPLRDIHLKTDFADISVKNHGSMASIYLFIILAVLIIFM GAFNFTTLSTARAALRYKEIGVRKVTGAKRKTLIVQFLSESLVQAFISLILALALTELLL PVFNRIMDKDITLQASWSVLVYVVLGIIGVGCLSGSYPAFYLSAVNPLIAFKGGQKNGKK GGLIRGLLCVQFVIAITLLLCTGIVFKQLNYLQNKDLGLEKENVVSIYTGLWYNVDGFKQ EILKNPNVRSVSMGAEITDYLEGDKSQGDVLRWTDERGETDSLRMMCIWADGDFVNTFGL KLLKGEGLKADGGAYFSGTYDFPVIINEAARKAMKVADPIGMEISGGFGVGTNKKRIVGV VQDFNFQSLRQKIKPAYLMYSPECLGNIHIKIAPEHKQETLNFIQKKFEEMAPFFIKEFK YKFFSDALNRNYEQERQQSRMLLAFTILAVVIAMMGVFGLVTLSTRQRTKEIGIRKVNGA HSGGIVKMFCLEYLKWVGIAFMPACPLGYLFMYHWLGEFAYRTTMSWWLFLGGGLIIAGI TLLTVIGQTWRTASQNPVRSLRYE >gi|226332019|gb|ACIB01000037.1| GENE 55 71438 - 72112 305 224 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 [Vibrio campbellii AND4] # 1 221 1 220 245 122 34 2e-26 MIKAIGLTKIFRTESVQTIALNEISIDISEGEFVAIMGPSGCGKSTLLNILGLLDNPTSG ELWFIGKEVSRYSENDRTDMRNGNIGFVFQSFNLIDELTVFENVELPLLYAGVSVHERVD RVNKALERMQIGHRTEHYPQQLSGGQQQRVAIARAIVTNPKIILADEPTGNLDSTNGNEV MLLLKELNQDGATVVMVTHSEENAREAGRIVRMMDGCILTENRR >gi|226332019|gb|ACIB01000037.1| GENE 56 72165 - 72764 566 199 aa, chain - ## HITS:1 COG:no KEGG:BF2047 NR:ns ## KEGG: BF2047 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 199 1 199 199 348 100.0 6e-95 MKRKLLSFAVLITLLLVPTVNRAQSIKDLFNKDNISKVVNAVTGHTETVDMTGTWRYTGS AIEFESENLLKKAGGTVAASAAEQKLDEQLAKVGIKEGQLSFTFNADSTFVSTLGKRKLN GTYSYDAGTQMLHLRYMKLIPMNAKVNYTTQQMDLLFEADKLLKLITFLSSKSSSATLKA ISSLADSYDGMMLGYELKR >gi|226332019|gb|ACIB01000037.1| GENE 57 72780 - 73385 591 201 aa, chain - ## HITS:1 COG:CAC2636 KEGG:ns NR:ns ## COG: CAC2636 COG0218 # Protein_GI_number: 15895894 # Func_class: R General function prediction only # Function: Predicted GTPase # Organism: Clostridium acetobutylicum # 1 200 1 199 200 142 42.0 3e-34 MEITNAEFVISNTDVKKCPAGTFPEYAFIGRSNVGKSSLINMLTGRKGLAMTSATPGKTM LINHFLINNSWYLVDLPGYGYARRGQKGQEQIRTIIEDYILEREQMTNLFVLIDSRLEPQ KIDLEFMEWLGENGIPFAIIFTKADKLKGGRLKINISAYLRELRKQWEELPPYFITSSEE RLGRTEVLNYIESINKELNSK >gi|226332019|gb|ACIB01000037.1| GENE 58 73400 - 74851 928 483 aa, chain - ## HITS:1 COG:sll1087 KEGG:ns NR:ns ## COG: sll1087 COG0591 # Protein_GI_number: 16330938 # Func_class: E Amino acid transport and metabolism; R General function prediction only # Function: Na+/proline symporter # Organism: Synechocystis # 1 422 3 423 512 109 27.0 1e-23 MMILATIVCYFAILLLIARITGRKGGSNAAFFKGENQSPWYVVAFGMIGASISGVTFVSV PGMVKAMDMTYMQTVFGFFFGYLAVAHILLPLYYKLNLTSIYTYLDTRIGKRAYRTGASF FLLSRMLGTAAKLYLVCLILYTYVFRDMGIPFWSIAAGSVALVWIYTHKSGIKTIVWTDT LQTFCLIAALISILVFVTAKLNLDFSGVIQTISSNEHSRIFVFDDWMSRQNFFKQFLSGI FIVIVMTGLDQDMMQKNLSCRSLRDAQKNMYCYGFAFAPLNLLFLGLGILLLVLAQEMQL ELPAAGDDILPLFATQGYLGEGVLILFTIGIIAAAFSNSDSALTAMTTSFCIDLLDTGKD TEEEARRKRNRVHIGLSVLLIFFICLVDALNNQSVIDAIYIIASYTYGPLLGMFAFGLFT QRKTNDRWVPFIAIASPLICYAADRFARQETGYQFGYELLMLNGILTFAGMWIVSKKQLK NEF >gi|226332019|gb|ACIB01000037.1| GENE 59 75348 - 75863 389 171 aa, chain - ## HITS:1 COG:PM0984 KEGG:ns NR:ns ## COG: PM0984 COG1755 # Protein_GI_number: 15602849 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Pasteurella multocida # 1 169 1 169 172 177 58.0 9e-45 MQNIIITFIAFFVLRLLSLSYSIRNEKRLLKSGAVQYGKVNSLLLTLAHIVYYFSALYEA YTSGTTFNYFSVCGVFIMGFAYAMLFYVIYKLHDVWTVKLYIIPDHRIEKSFLFRTVRHP NYYLNIIPELIGVALLCNAWYTLLIGLPIYACLLAIRIRQEERAMKELLEN >gi|226332019|gb|ACIB01000037.1| GENE 60 76147 - 76557 366 136 aa, chain - ## HITS:1 COG:no KEGG:BF2052 NR:ns ## KEGG: BF2052 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 136 1 136 136 248 99.0 4e-65 MDLKKTTFYLFTLFSLMLISCSNDDENKNDAQVTVTVVSADGKPLPNEIVQMFDEKTYEE FKKDNRTTPTAYALTNSTGVATFIFTYDKWFESNKDRFFTFAVQYGSGTENYEIWSAGRT VRLGSVTQIELKLKPL >gi|226332019|gb|ACIB01000037.1| GENE 61 76827 - 77450 531 207 aa, chain + ## HITS:1 COG:DR0198 KEGG:ns NR:ns ## COG: DR0198 COG0353 # Protein_GI_number: 15805234 # Func_class: L Replication, recombination and repair # Function: Recombinational DNA repair protein (RecF pathway) # Organism: Deinococcus radiodurans # 6 204 2 197 220 198 48.0 5e-51 MYMNQQYPSTLLEKAVGEFSKLPGIGRKTAMRLVLHLLRQDTSVVEAFGSSIITLKHEVK YCKVCHNISDTETCQICANPQRDASMVCVVENIRDVMAVEATQQYRGLYHVLGGVISPMD GVGPGDLQIESLVRRVAEGGINEVILALSTTMEGDTTNFYIYRKLEKMGVKLSVLARGVS IGDELEYTDEITLGRSIVNRTTFTGTV >gi|226332019|gb|ACIB01000037.1| GENE 62 77489 - 77938 410 149 aa, chain + ## HITS:1 COG:no KEGG:BF2107 NR:ns ## KEGG: BF2107 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 149 1 149 149 267 100.0 8e-71 MEEQIKRIVKSQKVQYISFWIIPLLLVLLGEAGVLPVGIKADNVRAVYVFETVGILMTAV CIPLSLKLFSFVLTKKIDQLTFPVALSRYMLWGAVRLALLEFVVVFNLAGYYFTLSSTGA LCALIGLTASFFCLPGEKRLRAELHIDKE >gi|226332019|gb|ACIB01000037.1| GENE 63 77942 - 78478 351 178 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|229254479|ref|ZP_04378409.1| acetyltransferase, ribosomal protein N-acetylase [Capnocytophaga ochracea DSM 7271] # 11 172 2 162 166 139 43 9e-32 MKQSFLANERIYLRAVEPEDLDLMYEMENDPSMWDISSFTVPYSRFVLKQYIEGSQSDMF ADKQLRLMIMRRKDNCTLGTVDITDFVPLHSRGAVGIAVHSNYRQEGYASDALKLLCEYA FNFLFIKQLYAHIAVDNEPSLRLFNSCGFTQCGVLKEWLLTHEGYKDAVLVQCMNPKR >gi|226332019|gb|ACIB01000037.1| GENE 64 78481 - 79071 476 196 aa, chain - ## HITS:1 COG:CPn0139 KEGG:ns NR:ns ## COG: CPn0139 COG1678 # Protein_GI_number: 15618063 # Func_class: K Transcription # Function: Putative transcriptional regulator # Organism: Chlamydophila pneumoniae CWL029 # 19 196 10 188 188 97 32.0 1e-20 MNINTDIFKIQSNNVMPSRGKILISEPFLHDVTFGRSVVLLVDHTEEGSMGLIINKPLPL MLNDIIKEFKYIEDIPLHKGGPIGTDTLFYLHTLHEIPGTLPINNGLYLNGDFDAIKKYI LQGNPIKGKIRFFLGYSGWECEQLIQEIKENTWIISKEENTYLMNEDIKGMWKEALGKLG SKYETWSRFPQVPSLN >gi|226332019|gb|ACIB01000037.1| GENE 65 79143 - 80459 988 438 aa, chain - ## HITS:1 COG:L0098 KEGG:ns NR:ns ## COG: L0098 COG0436 # Protein_GI_number: 15673812 # Func_class: E Amino acid transport and metabolism # Function: Aspartate/tyrosine/aromatic aminotransferase # Organism: Lactococcus lactis # 36 271 23 251 393 76 25.0 8e-14 MKDTPIKRHLIDETIEEFQITDFSKATIREVKAIAAKAETASGVEFIKMEMGVPGLPPST VGVKAEIEALQNGIASLYPDINGLPELKSEASKFIKAFIDIDLKPEGCVPVTGSMQGTFA SFLTCSQCDEKKDTILFIDPGFPVQKQQLVVMGQKYETFDVYDYRGDKLKEKLESHLKKG NISAVIYSNPNNPSWICLKDEELKIIGELATQYDVIVLEDLAYFAMDFRQDLSTPYHAPY QPSVAHYTDNYILLISGSKAFSYAGQRIGVSCISDKLYHRHYPGFDKRYGGGTFGTVFIH RVLYALSSGTSHSAQFAMAAMLKAANEGKYNFLNEVRIYGERARKLKEIFLRYGFHLVYD KDLEDPVADGFYFTIGYPGMTSGELAKELMYYGVSAISLVTTGSQQQGLRACTSFIKEHQ YAQLDERMKLFAENHPIS >gi|226332019|gb|ACIB01000037.1| GENE 66 80926 - 81453 276 175 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253565325|ref|ZP_04842780.1| ## NR: gi|253565325|ref|ZP_04842780.1| predicted protein [Bacteroides sp. 3_2_5] # 17 175 1 159 159 310 100.0 3e-83 MKNILLLLSFLMSIFTMHSQNIQQLEAKPSFKGITIGMPISEISNKLSFEKSSNGYSIYK VTDAYYYSIFNVTMNYVRVIGLNGKVHAIEVIKMVKATNEHATVFDASELDVIQAGLTRL YGDPQYKLTENHSQYNRIGVQWISNSKEANCFIDFYGTFVGYKLQFSLCEHSEDF >gi|226332019|gb|ACIB01000037.1| GENE 67 81464 - 81907 368 147 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253565326|ref|ZP_04842781.1| ## NR: gi|253565326|ref|ZP_04842781.1| predicted protein [Bacteroides sp. 3_2_5] # 1 147 10 156 156 207 100.0 2e-52 MKTLFTIVLLCISTLLWGQNYTQKYNELYDRTEFYNSYGNLIGYAKYNSLYDRLEYYNAN GDLLKTEQYNSLYNRKDIKDQYGNQQGYEKRNNLYNQNEEYDSYGNVKYKKKWNDLYQRY EIYDTYGNMVGYYKWNELYRRWEFISK >gi|226332019|gb|ACIB01000037.1| GENE 68 81919 - 82368 299 149 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253565327|ref|ZP_04842782.1| ## NR: gi|253565327|ref|ZP_04842782.1| predicted protein [Bacteroides sp. 3_2_5] # 1 149 1 149 149 218 100.0 1e-55 MTKKLLCFVFLTVSIFANAQNRYDTPANATFTNTYVPMTHEEMMLRAAAEVYREKRARED FDKYSRTAYEYLQKKQIGYFTSYANAALSTGYYNSQLYYNLGISYYLSGQKRKGKKFLKK ALKKGFLEANRALFAIKKKEILSYSWFIY >gi|226332019|gb|ACIB01000037.1| GENE 69 82367 - 82516 96 49 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MYYNIFILPSSMKKFIKTRSVELQCHVLDEGRRKNTLSTDMDIAAHAIA >gi|226332019|gb|ACIB01000037.1| GENE 70 82701 - 84047 792 448 aa, chain - ## HITS:1 COG:no KEGG:BVU_2464 NR:ns ## KEGG: BVU_2464 # Name: not_defined # Def: mobilization protein # Organism: B.vulgatus # Pathway: not_defined # 1 448 1 446 446 697 85.0 0 MGYVVLHLEKAKGTDSRMSAHIERTVHPKNADRTRTHLNRELVQFPEGVRNRTQAIAHRI ETAGIRRKVGTNQVKAIRVLLTGSNKDMKQMEADGQLDEWCNDSLQWLRETYGERNLVSA VLHMDEKTPHIHATIVPIVTGERRKAKKEEQNGKKKYRKKSTQDVRLCADDVMARHKLKH YQDTYAQAMGKYGLQRGIDGSLAKHISTMQYYKELVEQQDSLQENIETLLGLEEEAQKRL KQVKGEINVQKMKGAAVNATTAIADGVSSLLGGSKVRRLEAENEGLKQDVVNLQKQVQAE QREQKNMENRHNSEIIRIDRSYQQKIAGYDNRLKQIDTYFPIVTELLPIAEQCREVGFTE ELTRRIVSLQPVGFKGRLYSKEHKEKFRTEHSTATVERNPQEKGKFQLCIDGMPILEWFR MKFKELKEKLGVSHTQKEENRPKRGLRL >gi|226332019|gb|ACIB01000037.1| GENE 71 84230 - 85198 316 322 aa, chain - ## HITS:1 COG:no KEGG:BF1279 NR:ns ## KEGG: BF1279 # Name: not_defined # Def: DNA primase # Organism: B.fragilis # Pathway: not_defined # 1 322 1 322 322 533 82.0 1e-150 MNTNEAKQIRIEEYLHSLGYDPVRKQGDSLWYKSPFRNEREPSFKVNTERNLWYDFSAGR GGNIIALAQELYASDSLPYLLERIAEQAPGVHPVSFYFGKQALSKPSFQQLEVVPLSSPA LYSYLRQRGINTELAKRECREVRYLTDGKPYFAVGFPNRSGGYEIRNKFFKGCIAPKDIT HIRQEQPRETCCLFEGFMDYLSFLTLRLERCPNCPDLDGQDYIVLNSTSNLSKAIRPLGG YGHIHCFLDNDKAGMEAVQKLREEYGLRVRDASHIYGDYNDLNDFLCGKRSGQAERRQEK PEPEREQRQARQPERKGRGFRM >gi|226332019|gb|ACIB01000037.1| GENE 72 85406 - 86476 765 356 aa, chain - ## HITS:1 COG:no KEGG:BVU_2466 NR:ns ## KEGG: BVU_2466 # Name: not_defined # Def: hypothetical protein # Organism: B.vulgatus # Pathway: not_defined # 1 356 1 357 357 598 85.0 1e-169 MENKERNITPEEAIILWHASRLDLSEDYEQAPEILKVQGSVIGTLGNFSASIGKAKSKKT FNVSAIVAAALKNGTVLNYTAELPENKRKILYVDTEQSSYHCAKVARRSLRMAGLPTGSN HENLEFLVLRKYTPEERIAIMREAIYRTENIGLVVIDGIRDMVYDINSPGESTKVISLLM TWTGERNIHIHTILHQNKGDENARGHIGTELSNKAETVLQVEKDSKNPDISTVKTAHIRA VDFEPFAFRINEEALPELLEDYQFKDKDETKGNREKFDPYKDITERQHRIALEAAFTLKA EYGYQELAEALQEAYASVGVTLGDNKVVKLITVLKSKRMIVQENGRKYTFNPDFHY >gi|226332019|gb|ACIB01000037.1| GENE 73 86488 - 86796 124 102 aa, chain - ## HITS:1 COG:no KEGG:BVU_2467 NR:ns ## KEGG: BVU_2467 # Name: not_defined # Def: hypothetical protein # Organism: B.vulgatus # Pathway: not_defined # 1 102 1 102 102 176 89.0 2e-43 MEIRDLLSKPVWQMTGEEFIFLNRHALQESETKPAQPAADKDKKYVYGIGGIARLFGCSI PTANRIKKSGRIDRAITQIGRKIIVDADMALELAGRKSGGRR >gi|226332019|gb|ACIB01000037.1| GENE 74 87018 - 88052 561 344 aa, chain - ## HITS:1 COG:no KEGG:BVU_2468 NR:ns ## KEGG: BVU_2468 # Name: not_defined # Def: hypothetical protein # Organism: B.vulgatus # Pathway: not_defined # 1 344 1 338 338 450 73.0 1e-125 MEERITSMIPRYGKLNKIYAEIISGGSFSFEKQQFISDFYREYGDTQTFETALISLMLEA DAAHFSILLNSLKREIEGNISTYSACKEFFDCLDTGYVCRQHEERFDWSIDRQMKVTNGY YRELMAANGSLEAVGFREHDRQEEELLERRYDRCKREYDKEKAKLDELYRQKEQARREAL QCLQNRCGDICRLGGSLLAILDKYLTDQKKKEGEEKEIPSSGTTPASPPAYFPMRLLSAI YEKCNDKQFEAVPEADFYAGMNLQPCRNRLKIRPGEKARVCYLIFLMGETLSNQDREKWK DEIMRLLDIDIKYYKSKYKAPVPRTDLASDSNQEFAKEMRLIFR >gi|226332019|gb|ACIB01000037.1| GENE 75 88066 - 89352 903 428 aa, chain - ## HITS:1 COG:no KEGG:BVU_2469 NR:ns ## KEGG: BVU_2469 # Name: not_defined # Def: tyrosine type site-specific recombinase # Organism: B.vulgatus # Pathway: not_defined # 1 428 1 430 430 685 80.0 0 MNIKRNIIFSLESRKKNGVPIVENVPIRMRVIFASQRIEFTTGYRIDAAKWDTDKQRVKP GCTNKLKQSASEINTDLLRYYTEIQNIFKEFEVQGTMPTTAQVKEAFNSLHSEKREEEQQ KPLTFAPMEVFGEFIKECGTQNGWSDATYEKFAAVRKHLEKFDKELTFETLDEPKLTSYV NFLKDVEGLRNTSTMKQIAYLKWFLRWCTKKGYCMNNAYESFNPKLKSVQKKVIFLTWEE LNKLKDYKIPPTKQYLERVRDVFLFCCFSGLRYSDVYNLKRSDIKPDHIEVTTVKTADSL IIELNNHSKAILEKYKDVYFEDHKALPVISNQKMNDYLKELGELAEINDPVRETYYKGNE RIDTVTPKYALLGTHAGRRTFICNALALGIPAQVVMKWTGHSDYKAMKPYIDIADNIKAN AMNKFNQL >gi|226332019|gb|ACIB01000037.1| GENE 76 90171 - 91304 895 377 aa, chain + ## HITS:1 COG:YPO2806 KEGG:ns NR:ns ## COG: YPO2806 COG0667 # Protein_GI_number: 16123004 # Func_class: C Energy production and conversion # Function: Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) # Organism: Yersinia pestis # 52 377 1 329 329 358 55.0 1e-98 MDRRNFLRTASSFALLAAGATTGVSRVFTESPISSLSGNLSDKNTPNADDTMEYRKLGEL DVSAIGLGCLPMVGYYGGKYDKKDMIALIRRAYDKGVTFFDTAEVYGSYTSEEWVGEALA PFRDKVKIGTKFGFGVEEKQPTAINSRPDHIRQAVEGSLKRLRTDHIDLLYQHRVDPAVP MEDVAGTVKDLMQEGKVLHWGLSEASARSIRRAHAVCPLSAVQSEYAIWWREPETKIFPT LEKLGIGFVPYCPLGRAFLTGIINENSRFYEGDRRWNLPQFTPEALKHNMPLIALVRKWA ERKGVTLAQFALLWMLSRKSWIAPIPGTTNPAHLDDLLGAGTVRLSAWEMEEFDKEYAKI DLMGHRADPFTESQIDK >gi|226332019|gb|ACIB01000037.1| GENE 77 91312 - 92304 817 330 aa, chain + ## HITS:1 COG:MA3965 KEGG:ns NR:ns ## COG: MA3965 COG1853 # Protein_GI_number: 20092760 # Func_class: R General function prediction only # Function: Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family # Organism: Methanosarcina acetivorans str.C2A # 31 207 1 181 202 112 38.0 1e-24 MVKIILGVLSLLVMLSCSTAVKENTTQPDIMETNKKNLGNLLALYPKPMTVVGAEVEGKV NWLVVGHTGVIGHDRILVSMSKSHYTNQGIKKSKRLSVNLVSREMLPKADYVGSVSGATV DKSEVFAYHIGENDTPVIDASPLTMECEVVDIYETDGFDNFICAIVNTYAASDVLDSDGK LDYTKLKPVLFEFPTYSYLATGEIIGKCLNPDKPGMCVKEPMTTDGIVRLSKIEVYPQYL DEYMNYATEVGEISLRTEPGVLTMYAVGEKENPCKVTILETYASREAYEQHIASEHFQKY KQGTLHMVKSLVLSDQTPLNPANKLNNFMQ >gi|226332019|gb|ACIB01000037.1| GENE 78 92400 - 93251 717 283 aa, chain + ## HITS:1 COG:YPO2805 KEGG:ns NR:ns ## COG: YPO2805 COG0656 # Protein_GI_number: 16123003 # Func_class: R General function prediction only # Function: Aldo/keto reductases, related to diketogulonate reductase # Organism: Yersinia pestis # 1 278 15 292 297 310 54.0 2e-84 MDFKELNNGVKMQIQGFGVFQIPDATECERVVTDALAVGYRLIDTASVYGNERAVGMAIR KSGIPREELFITTKAWISEMGYERTLRALDTSLARLGLDYLDLYLIHMPFGDYYGAWRAM EKLYAKGRVRAIGVCNFEPDRLLDLCHNANVIPAVNQIEVHPYTPQTDATRTMQELGIQA EAWGPLAEGRNGLFTDDILTGIARKYDKSAAQVVLRWHLQRGVVAIPKSVHRQRMQENFN IGDFMLTPEDMAAIASMNMGYNMILDLHAPEEVQRLYGIECPA >gi|226332019|gb|ACIB01000037.1| GENE 79 93413 - 93598 182 61 aa, chain - ## HITS:1 COG:no KEGG:BF2076 NR:ns ## KEGG: BF2076 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 60 1 60 61 120 100.0 2e-26 MHFIKGWALLFSPDLLHGTPLGNHIKDFSFFSYQSNEAGEHIRCHIIDMAKSKLINGEQI M >gi|226332019|gb|ACIB01000037.1| GENE 80 93817 - 94326 485 169 aa, chain + ## HITS:1 COG:SA0658 KEGG:ns NR:ns ## COG: SA0658 COG0656 # Protein_GI_number: 15926380 # Func_class: R General function prediction only # Function: Aldo/keto reductases, related to diketogulonate reductase # Organism: Staphylococcus aureus N315 # 1 157 14 173 279 123 39.0 1e-28 MPQLGVGTSTLKETAAECVKHAIGLGYRLVDAAQGYDNEAEVWYGIKESGIGRSEVFIIS KVSPDAVRSGKVRESLDRTIEAFGGTYVDLMLIHWSVARKVKERWRIMEKYVDVGKIRAI GVSNFNPHHVDELLAYARIKPVVNPIKIHPYMKHQEVVGNTFAKGIQVQ >gi|226332019|gb|ACIB01000037.1| GENE 81 94943 - 95809 653 288 aa, chain + ## HITS:1 COG:no KEGG:BF2131 NR:ns ## KEGG: BF2131 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 288 1 288 288 607 99.0 1e-172 MKVISNAEFGGERPLFESHDLRLENVIIRAGESAIKECSNIEAVDCRFEGNYPFWHVHGF VIDRCFFDVGGRSALWYSDNLKMTNTRIDAPKMFREMHDIEIENVEIKDADEVFWRCKNL DIKNLKLHGGTYPFMFSSNIRIDGLEGDSKYVFQYVKNVELRNAKITTKDAFWEVENVTI YDSELNGEYLGWHSHNLRLVNCHITGEQPLCYAHDLVLENCTFGPDCDRAFEYSSVQATI KGAIGGVKNPRTGCITAESYGEIILDENIKAPADCKLKLWDEKTCFTD >gi|226332019|gb|ACIB01000037.1| GENE 82 95829 - 97262 1092 477 aa, chain + ## HITS:1 COG:CAC3444 KEGG:ns NR:ns ## COG: CAC3444 COG0534 # Protein_GI_number: 15896685 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Clostridium acetobutylicum # 33 475 21 462 462 206 31.0 1e-52 MGTGKGKTDYLLSLIREGKQMTLGQQLRLTAYLSVPAIMAQISSIAMQYIDASMVGSLGA NAAASIGLVSTTTWLFWELCAAAATGFSVQVAHKIGAGDFVGARKILRQSIAATLVFSSL LAAVGISISGMLPGWLGGDEAIRSDSSLYFWIFALFLPALQLNFLAGGMLRCSGNMHVPS MLNVLMCLLDIVFNFFLIFPSRQVEWLGVEFTAPGAGLGVEGAILGTVLAELITAGGMMW YLCHRSPMLRLSGEWGSFLPQKETLRKAFRISLPMGFEHMAICGAQIATTVIVAPLGIIA IAANSFAITAESLCYMPGYGISEAATTLVGQSLGANRIRLLRRFANITVWSGMLIMGVMG ALMYMAAPQIIGVMTPVEEIRTLGIEILRIEAFAEPMFAASIVAYGIFVGMGNTFVPSLM NFGCIWGVRLTLAAWLAPTMGLRGVWFAMCIELCFRGVIFLARLWGSNWIYKLRINR >gi|226332019|gb|ACIB01000037.1| GENE 83 97266 - 98429 1169 387 aa, chain + ## HITS:1 COG:YPO3006 KEGG:ns NR:ns ## COG: YPO3006 COG1168 # Protein_GI_number: 16123185 # Func_class: E Amino acid transport and metabolism # Function: Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities # Organism: Yersinia pestis # 1 377 1 382 393 372 44.0 1e-103 MRYDFDTIVPRRGTNSYKWDTPEEENVLPMWVADMDFRTAPAIIDALQKRVEHGIFGYTK VPETYYDAVVWWFEDRHRWRIDPRWIIYTSGVVPALSAIIKALTVPGDKVIVQTPAYNCF YSSIRNDGCELSANNLVYRNGRYSIDFDDFEAKAADPKAKLLLLCNPHNPVGRVWTPEEL RHIGDICLRNGVFVVADEIHCELTYEGHDYTPFASLSERFQQNSITCVSPSKAFNLAGLQ IANIIAADDDVRRRIDRAININEVCDVNPFGVIATIAAYNEGGEWLDALRKYLRGNYEYL CHFFAERLLQYPVLPLEGTYLVWIDCRALGIGSDATTLRLQEQQKLMVNSGTMYGPGGEG FIRLNIACPRALLADGLERMARVLEYS >gi|226332019|gb|ACIB01000037.1| GENE 84 98549 - 99493 895 314 aa, chain + ## HITS:1 COG:all1225 KEGG:ns NR:ns ## COG: all1225 COG0667 # Protein_GI_number: 17228720 # Func_class: C Energy production and conversion # Function: Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) # Organism: Nostoc sp. PCC 7120 # 7 298 16 311 315 160 35.0 4e-39 MEKLPSIALGTWSWGTGFAGGDRVFGNNLGVEELKPVFDEAMANGLNLWDSAVVYGMGVS ETVLSTFTKNCKREDVFISTKFTPQIAGDSENPVADMLAGSLDRFATDYIDIYWIHNPAD VEKWTPYLIPLVKSGKVKRIGVSNHNLAQIKRAEEILSKEGVHIFAVQNHYSLLYRSSEK AGILDYCKENGIDFWAYMVLEQGALSGKYDTAQPLPAGSQRGETYNPLLPQIEKLVAVMR TVGNKYGITPAQVALAWAIAKGTTPIIGVTKPSQVQDALQATKVFLTADEMKALEEAAES TGVDTRGSWEMPMV >gi|226332019|gb|ACIB01000037.1| GENE 85 99510 - 100517 728 335 aa, chain + ## HITS:1 COG:SA2366 KEGG:ns NR:ns ## COG: SA2366 COG2159 # Protein_GI_number: 15928159 # Func_class: R General function prediction only # Function: Predicted metal-dependent hydrolase of the TIM-barrel fold # Organism: Staphylococcus aureus N315 # 45 334 39 331 336 76 24.0 7e-14 MKCYLIGLLSMLSVHAAMGQQAIDVHCHNILPTFKELLDRHGATLEETFPLPDWDVASHL KFMKEAGIETSVLSMPAPQPWFGDVEESRRAVRQYNETCSRLKADYPGKFLFCASLPLPD VDAAIKEAVYVLDTLGADGIKLATNSRGQYVGDAALDTLMQVLNEHHAVVMLHPHKPSPV NDGIIATAPLAVYEYPAETTRTVVNLIARNVPARYPNLKFVVPHCGSFLPLALPRMKVVY LVMAAKGLMEPIDWNANLKAFYYDLAGGATPEVVKVLLTITTPDRLLYGSDYPYQPATVL TGNLKQLRTWITDDAELTPFAEKILHDNALKLFGK >gi|226332019|gb|ACIB01000037.1| GENE 86 100547 - 101374 699 275 aa, chain + ## HITS:1 COG:RSc0215 KEGG:ns NR:ns ## COG: RSc0215 COG1028 # Protein_GI_number: 17544934 # Func_class: I Lipid transport and metabolism; Q Secondary metabolites biosynthesis, transport and catabolism; R General function prediction only # Function: Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) # Organism: Ralstonia solanacearum # 3 273 2 272 275 298 56.0 9e-81 MKKEVMILTGAGQIGMAIARRMGYGKKIIVGDLNPESAEAVCRILNEAGFDAVPVEMNLS SRESILNLIAVAREHGEISMLVNAAGVSPSQASIETILKVDLYGTAVLLEEVGKVIKEDG VGVTISSQSGHRMPALTIEEDMLLATTPTEELLSLDMLQPDSIKDTLHAYQMAKRCNVKR VMAEAVKWGERGARINSISPGIIVTPLAIDEFNGPRGDFYKNMFAKCPAGRPGTADEVAN VAELLMSDRGAFITGADFLIDGGATASYFYGPLKP >gi|226332019|gb|ACIB01000037.1| GENE 87 101516 - 102328 642 270 aa, chain + ## HITS:1 COG:all3171 KEGG:ns NR:ns ## COG: all3171 COG2207 # Protein_GI_number: 17230663 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Nostoc sp. PCC 7120 # 174 268 202 297 306 69 31.0 7e-12 MERLDVFDCSNVLIASYFTDDRGCAHENREHTLIYLCSGELEIEERGKKTVLHPGDCAFM RRDNRMWLQKKVEDGKPYRSVVLKFSRPFLREFYQTLNRQQIPTDSEREKVSLRVLPSNR LDIRSLFESVIPYFEAGEKPSEDVLKLKMVEGIYVLLNTDRNLYASLFDFVEPWKIDILD YLNENYMCDLSLEEIASYTGRSLATFKRDFAKVSNLTPQKWIIKRRLEVAHGLIKSGKKK VTEACFDVGFKNLSHFSKIYKEAYGVAPSW >gi|226332019|gb|ACIB01000037.1| GENE 88 102459 - 103292 661 277 aa, chain + ## HITS:1 COG:Cgl1022 KEGG:ns NR:ns ## COG: Cgl1022 COG0599 # Protein_GI_number: 19552272 # Func_class: S Function unknown # Function: Uncharacterized homolog of gamma-carboxymuconolactone decarboxylase subunit # Organism: Corynebacterium glutamicum # 29 129 6 106 107 142 67.0 6e-34 MKTKVASLALLLTLIFPIMAKSQVKIQQTAGRDALGEFAPEFARLNDDILFGEVWSRNDL LSLRDRSIVTVVALMSQGLTDSSFKYHLESAKKNGVTRTEIAEILTHAAFYAGWSKAWAA FRMAKEVWTDGNADSVAAGSLEAYAQTIIFPVGKPNDAYAKYFIGQSYTAPVVTDGVPVV NVTFEPGCRNNWHVHKATKGGGQTFVCVGGRGYYQEWGKEPVELRPGDAINIPAGVKHWH GAAPDSWFSHLAIEVPGENNSTEWLEPVGDEEYSKLK >gi|226332019|gb|ACIB01000037.1| GENE 89 103294 - 103866 465 190 aa, chain + ## HITS:1 COG:YPO2003 KEGG:ns NR:ns ## COG: YPO2003 COG0716 # Protein_GI_number: 16122245 # Func_class: C Energy production and conversion # Function: Flavodoxins # Organism: Yersinia pestis # 11 165 54 202 235 98 34.0 5e-21 MKQLVLIFMSLLTFSCPSKAQKQTADTDRPSDKKILVAYFSCTGTTEKVATAIAKETGGK LYRITPATAYTSSDLDWNDKASRSSVEMTDEKSRPALGGETIDLKDYDVVFLGYPLWWDL CPRPVNTFLEKYDFAGKTVIPFATSGGSSITGSVKQLKKLYPKIEWEEGRLFNSGTVNVA GWSKQIIEKL >gi|226332019|gb|ACIB01000037.1| GENE 90 103878 - 104144 235 88 aa, chain + ## HITS:1 COG:no KEGG:BF2140 NR:ns ## KEGG: BF2140 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 88 1 88 88 159 98.0 3e-38 MENFAYYTPTRLIFGKGLAIITPRWMRHILNDHTLERFVKFSVDIYGLLRLRKRTFTKYL KHHCRIYEEYNRGRIFGQGVCLFIIYYK >gi|226332019|gb|ACIB01000037.1| GENE 91 104146 - 104523 315 125 aa, chain + ## HITS:1 COG:SA2342 KEGG:ns NR:ns ## COG: SA2342 COG0110 # Protein_GI_number: 15928134 # Func_class: R General function prediction only # Function: Acetyltransferase (isoleucine patch superfamily) # Organism: Staphylococcus aureus N315 # 13 123 18 184 199 63 28.0 7e-11 MEEFRIDCLYQTEEELVVTKELRQNIFRLNHTMPDTEEYRELLHKVFPHLGENCRIETPF SGVRTANVKFGRNVIVMPGCLMMSAGGITIGENAVVGAGSVVTHDVEPDTLVAGNPAKFI RKIKS >gi|226332019|gb|ACIB01000037.1| GENE 92 104528 - 105592 516 354 aa, chain + ## HITS:1 COG:XF1739 KEGG:ns NR:ns ## COG: XF1739 COG2220 # Protein_GI_number: 15838340 # Func_class: R General function prediction only # Function: Predicted Zn-dependent hydrolases of the beta-lactamase fold # Organism: Xylella fastidiosa 9a5c # 23 352 27 355 385 300 45.0 3e-81 MKQKMIICVSIFIVLVLTVIIVLNHPAFGRTPRGERLVRIERSPNYKEGQFVNQEPTPFM TTDKSRWRIMWDNLTEKKPDNLVPSESIRAVKTDLKQLDLSKDAVVWFGHSSYLLINGGK KILVDPVLTTGFPASLMMKPFKGTDIYSPEDIPEVDYLIITHDHYDHLDYGTVKAIRDKV NKVICPLGVGEHLEYWGYSAEKIVEMDWNEVYTSEPGFRITCLPARHFSGRFLRQNPSLW ASFMLEGHSTVYIGGDSGYGAHFSEIGKRFPHIDLAILENGQYNEDWRYIHTMPEQLPVE VHELDAAKVLPVHNSKFSLSRHAWDEPIHRIEQAATKDSSLHVIHGIIGSPITY >gi|226332019|gb|ACIB01000037.1| GENE 93 106258 - 106629 150 123 aa, chain + ## HITS:1 COG:no KEGG:BF2143 NR:ns ## KEGG: BF2143 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 123 14 136 136 211 100.0 8e-54 MINEKYQMTLDDTLVLRSISILIIILHNYIHRFSNVVLENQHVYYPERNKELIDSFLEFD SGLFLDLISHYGHYGVPVFVFQSGYGLVMKYEKKEVSLKFRKFMKRHADKLWLLLLPDHA CSE >gi|226332019|gb|ACIB01000037.1| GENE 94 106905 - 107657 231 250 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163739489|ref|ZP_02146899.1| 50S ribosomal protein L17 [Phaeobacter gallaeciensis BS107] # 4 246 1 239 242 93 31 8e-18 METMNEKVAMVTGAAAGIGLASAEAFAKAGATVVLVDINEPKEQVEKLVSEGYKAVAYRC DVSDTRAVKEMIDWIVATYGRLDAALNNAGIQTPQRPMAEITDEEFDRTMAVDLKGVWNC MRYEIIQMLKQGGGAIVNTSSHGGVTGFPGQAAYIACKHAVIGLTRTAAIDYSAKGIRIN AICPGVIRTPMAEELIRRNPDLEKELVRDIPAGRLGKPEEIANAVLWLCSPQASFVDGHA LLVDGAFSIH >gi|226332019|gb|ACIB01000037.1| GENE 95 107683 - 108846 1155 387 aa, chain + ## HITS:1 COG:TM1006 KEGG:ns NR:ns ## COG: TM1006 COG0667 # Protein_GI_number: 15643766 # Func_class: C Energy production and conversion # Function: Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) # Organism: Thermotoga maritima # 63 383 9 329 333 304 47.0 2e-82 MEENNKMDISRRGFLKTAALAGAAMAMPSGLGKVFASEAKQAETSDVDTDAARIKGHRVL GTGKAAFEVSALGFGVMGMTYNRSQHPDKKECIRLLHEAVDRGVTLFDTAIIYGPLTNEN LAEEALSEFKGRINVTTKFGHEVIDGKGTGRQDSRPATIRRYCEESLRRLRLDSLPMFYQ HRADPNTPAEEVAATIADLIKEGKVQRWGMCEVSAETIRKAHAICPLTAIQSEYHLMHRL VEENGVLDVCRELGIGFVPYSPINRGFLGGCINEYTVFDVNNDNRQTLPRFQPEAMRANT RIVNALQAFGRTRGMTSAQVALGWLLQKAPWIVPIPGTTKLSHLEENLRTLDFNISSGEW KELEDAVAAIPVVGDRYNAEQQRQVGR >gi|226332019|gb|ACIB01000037.1| GENE 96 108941 - 109618 609 225 aa, chain + ## HITS:1 COG:XF1748 KEGG:ns NR:ns ## COG: XF1748 COG1985 # Protein_GI_number: 15838349 # Func_class: H Coenzyme transport and metabolism # Function: Pyrimidine reductase, riboflavin biosynthesis # Organism: Xylella fastidiosa 9a5c # 2 224 3 231 237 100 28.0 2e-21 MRPYIISHMMTSVDGRIDCPMVGQLSTDEYYIALEKLGPCSKLSGRITTALECSAVKEES TPMEGTPIGHKSVYVASKSDEYTIIVDTYGKLRWQEGEADGHPLLCIVSEQVSEEYLETL RTLGISWIAAGTERIDLPEAMELLHEHFGVERLAIVGGGHICGGFLEAGLIDEVSIMVAP GIDGRKRQTVVFDGISRMECNPYKLKLESVEQWEADIVWLRYKIK >gi|226332019|gb|ACIB01000037.1| GENE 97 109626 - 110267 643 213 aa, chain + ## HITS:1 COG:PA2218 KEGG:ns NR:ns ## COG: PA2218 COG1073 # Protein_GI_number: 15597414 # Func_class: R General function prediction only # Function: Hydrolases of the alpha/beta superfamily # Organism: Pseudomonas aeruginosa # 23 213 38 228 367 247 64.0 9e-66 MKLQAIAILTFLIFENVMAQETTTAKYINSTDMEALKLTQEWDKTFPQSDKVEHTKITFH NRYGITLAADLYKPKNTQGRLAAIAVSGPYGAVKEQVSGRYAQTLAERGFLTIAFDPSYY GESGGTPRYLTSPEISTEDFSAAVDYLTSRADVDPERIGILGICGWGGFALNAAANDPRI KATVTSTMYDMSRVNANGYFDAMSSDDRYKLRE >gi|226332019|gb|ACIB01000037.1| GENE 98 110472 - 110696 207 74 aa, chain + ## HITS:1 COG:ECs0310 KEGG:ns NR:ns ## COG: ECs0310 COG1073 # Protein_GI_number: 15829564 # Func_class: R General function prediction only # Function: Hydrolases of the alpha/beta superfamily # Organism: Escherichia coli O157:H7 # 1 72 306 378 378 91 57.0 3e-19 MLTYISEIRSAVLMIHGEKAHSRYFSEDAYKRLTGSNKELLIIPGANHVDLYDNLNVIPF DKIDAFFKNALKEK >gi|226332019|gb|ACIB01000037.1| GENE 99 110773 - 112008 704 411 aa, chain + ## HITS:1 COG:BH3693 KEGG:ns NR:ns ## COG: BH3693 COG2311 # Protein_GI_number: 15616255 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Bacillus halodurans # 17 405 12 398 405 154 30.0 3e-37 MNKEIDIKDMAPVKASERHVILDALRGFALLGICFANFPEFSLYTFQKPEITEAMPTAEI DKVIRFLQYLFVDGKFYTIFSLLFGIGFSIIISNAVKKGTDGFRIFYRRMIVLAAIGFLH LMFIWSGDILLLYALLGMLLPLFRHVSDRVLLGTSAVLLLLPILIDWLAGTFGVSRSAPA VRMQQHYCNLYGITEYNFGIWLRDAENYGGVFQFLVQGAWVRLQEFIDGNRYFKVLGLFL LGFYIGRKQIYADLEANRVLLKKTVTYGFLLGLPLSVLYAWSAVNGHPFGTTAHTAIYTA SVYPLGFAYVSAICLLYLHGREWRLWRCLAAPGRMALTNYVGQSVWGMVLFYGIGFGLGA GIGLTGTESIAFYVFLVQMAFSALWLSYFRFGPLEWGWRMLTYGKWLKIRK >gi|226332019|gb|ACIB01000037.1| GENE 100 112048 - 113064 912 338 aa, chain + ## HITS:1 COG:YPO2806 KEGG:ns NR:ns ## COG: YPO2806 COG0667 # Protein_GI_number: 16123004 # Func_class: C Energy production and conversion # Function: Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) # Organism: Yersinia pestis # 1 318 1 318 329 274 47.0 2e-73 MDKRKLGQLEVSPIGMGCMGFSHGYGQVPPEAYAIEAIRGAYDYGCTHFDTAEAYGKEQF YAGHNEELVGKAIEPFRKKVVLATKFHIGELSKPDETNLYQEVRRHLEDSMSRLRTDYID LYYLHRISEAVRLEDVATVMGRLIQEGLIRGWGLSQVSADQIRAAHKITPLSAVQNIYSM VERDCETEIFPVCLEKGIGVVPFSPIASGFLSGKVTAQEQFGFDDVRKFVPQLSKENIEA NQPILDLLYRFAVEKNATNAQISLAWMLHKYPNVVPIPGSKNQERILENLGAWNVTLSDD EFRQLQSALDECKVHGHRGCVETEQTSFGKQWSEETDK >gi|226332019|gb|ACIB01000037.1| GENE 101 113979 - 114464 155 161 aa, chain + ## HITS:1 COG:no KEGG:BF2088 NR:ns ## KEGG: BF2088 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 161 1 161 161 317 99.0 8e-86 MFATWLQEIYSIFVPKVKTKQMKRIFFVYPLAIATLFLIVLSAIPHHHHKEMMCTVMELC EQDDIYNDGHTDHEAGQDAHNENTCVSQAGYIFPSSVDKSNLHDGSLMNIHLPVLYLFAD ILTIHFDIPIPENTYDRYVVSYTSVVLGESSGLRAPPYFFS >gi|226332019|gb|ACIB01000037.1| GENE 102 114972 - 116180 904 402 aa, chain + ## HITS:1 COG:AGl3090 KEGG:ns NR:ns ## COG: AGl3090 COG0845 # Protein_GI_number: 15891660 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 151 398 173 429 452 73 26.0 6e-13 MKIYIFIILAAATSISLISCDSKQSDTRSASSSEVHRNDDGHDHRESYGDNHSEIENSGK GHEDEIIFTRQQAEAIGLEIYNVVPGSFAQVIRTSGQIQAAQGDEETIVATTNGVVSFPG QNIIEGATVGVGSTIVTISAKNLYEGDPVAKAKIAYETALKEYQRAEGLVKDKIISAKEF EQTRMKYENARTAYEAQAANVTVSGVKVTSPISGYVKNRLVSQGEYVTVGQPVATISKNR RLQLRADVSENYFNELKKIRGANFMVSYNNKVYRLEDLHGRLLSFGKAAAESSFYIPITF EFDNIGDFIPGSYVEVYLLTTPQNNVFSIPVTALTEEQGIYFVYLQIAEEEFVKREVGIG ESDGKNVRILSGLKGGERVVVKGAYQVKLASSSSVLPEGHSH >gi|226332019|gb|ACIB01000037.1| GENE 103 116199 - 119312 2674 1037 aa, chain + ## HITS:1 COG:all7631 KEGG:ns NR:ns ## COG: all7631 COG3696 # Protein_GI_number: 17158767 # Func_class: P Inorganic ion transport and metabolism # Function: Putative silver efflux pump # Organism: Nostoc sp. PCC 7120 # 1 1037 1 1031 1041 744 42.0 0 MLNKIIKFSLNNRIVVLIGALLLIIAGTYTAVNMEVDVFPDLNAPTVVVMTEAKGMAPEE VERLVTFPVETAVNGATDVRRVRSSSTTGFSIVWVEFNWGTDIYRARQIVSEKLAVLGDA LPSNVGKPTLGPQSSILGELLIIGLTSDSVSLQDLRTMADWTIRPRLLSTGGVAQVTVLG GDIKEYQILLDPGKMKHYGIGLNEVIDVVKDMNQNAAGGVLYEFGNEFIVRGVLSTNKVE ALRKAVIKNVDEVPITLENIAEVKIGAKAPKMGLASERGKAGILLTVTKQPNTSTLDLTG KLDKSLEDLQKVLPKDVKVTTDIFRQSRFINNSIDNVQKSLYEGGIFVIIVLLLFLMNIR TTVISLITIPLSVIVAILTLKTLGLTINTMSLGGIAIAIGSLVDDAIVDVENVFKRLREN RQKPKEEQKNVLTVVFEASKEVRMPILNSTLIIVASFVPLFFLSGMEGRMLAPLGITFVI SLFASTVVALTLTPVLCSYLLNKPKNNGEEHDPYLVRKLKSGYGMALRWTLCHKKVVLGA IGAILIVSLVMMASFGRSFLPPFNEGSFTVSISTLPGISLEESDKIGRMAEDILLSVPEV QTVGRKTGRAELDEHALGVNTSEIEAPFILDKRSKDEVLTEIREKIKVIPGVNIEIGAPI THRINAMLSGSRANIAIKLFGTDLNRMYEIGNQIKNSIQDIEGVADLNVEQQVERPQLKI EPKREMLAKYGVTLPQFGDIVNVMLGGEAVSQVYEENRSFDLTLKVNDASRASAERIRKL IVDANGRKVPLENIANVTSSMGPNTISRENVARKIVISANVAGRDLRGVVNDIQSKVDSE IQLPEGYHIEYGGQFESEQAASRIITITSIFSILVIFLLLFKEFKSATQSLVILLNLPLA LIGGVFSIYFTSGILSIPAIIGFISLFGIATRNGMLLIDRYNSLRASGMSVNDSILHGSL DRLNPILMTALSSGLALIPLALGGELPGNEIQSPMAKVILGGLLSSTILNGFIIPIMYLY ISRKQEKKIETIELTED >gi|226332019|gb|ACIB01000037.1| GENE 104 119335 - 120489 953 384 aa, chain + ## HITS:1 COG:no KEGG:BF2093 NR:ns ## KEGG: BF2093 # Name: not_defined # Def: outer membrane efflux protein # Organism: B.fragilis # Pathway: not_defined # 1 384 8 391 391 695 99.0 0 MALLASVSLVAQENIGSILFSIEENNSTLKALREETNAQKLGNKTGIYLSDPDVEFGYLL GNPGKIGNRQDFSIKQTFDIPTLTGMRSRLAGNQNKLVELQYASERINLLLEAKQYCIDL VYYNGLKKELKVRLRHAQAIADAYRQRLDRGDASILEYNKVQLNLSTVQGEMSRIEVERN ALLSELKRLNGGMDVIFEASNYSPASLPVNFEDWYLSAQQKNPLLQYVKQQIEVSKEQVK LGKAMTLPKFSAGYSLERTLGQKYQGISVGISIPLWENKNRVKQAKAGVVAAQAREQDSK QQFYDRLRNLYMRASGLQQTAIAYRESLKALNNTALLMKALDVGEISLLNYIVEIGLYYD TVNQTLAAERDFEKALADLSAVEL >gi|226332019|gb|ACIB01000037.1| GENE 105 120501 - 120737 124 78 aa, chain + ## HITS:1 COG:no KEGG:BF2094 NR:ns ## KEGG: BF2094 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 78 1 78 78 151 100.0 9e-36 MRKLKKPYSQNWRGLSAKWLFILSTVLGIIAGTCFWIFISRCGNEECFLNYDPLPEMFIG GVIGAFLYVIIWGMTSDM >gi|226332019|gb|ACIB01000037.1| GENE 106 120933 - 122609 719 558 aa, chain - ## HITS:1 COG:no KEGG:BF2157 NR:ns ## KEGG: BF2157 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 558 1 558 558 1093 99.0 0 MKAKKYNWTLRHSFLLFLVIFISSSCVEDVTESTKEPTRYTANDIKSYSDLFEVFWNTMN QRYNYFYEQSSFNWETVYNEYAPKFKKLKTFNRDKQYSKAEISEDCNKAIEYFTEIIDPI IDRHFYVKISLPVSHSFIRNVYFHGGMKSKEKIYTYPFELKYEYMRSKIQSETGVFGQAN DMLGGFLSDNPDIYYFSFKSFTISNHYILSFGSEYLVIDDKSPYYLTEKEIRDTVEANKI KDPAVKSALIEKSIEYMNKFNSFMRSEIAQDAIKKIADFNQSENPDNSFIEALSKAKENA PDINIELSQLSGLKEFRLNPNYTTWFKQRSTEHLQLACEYTVFLSNIDNVINNQYKIDFY RNFLVPLKVGKIKKIILDLRGNGGGMVLDARTFTDRFITKDAIFGYQRFKEDNNPFSYTP WIPCMTKTTGIGIKKEIPIVILLDNNSASMSEISTLMLKSQGKHVTVVGGYSAGATAGLG DSDQFNGGIRGKVSDYLEFYMPLLAMQDATHTVIEGIGIKPDLLVDPLTEDEVREMALSP FTHIDRTLKQAIEVLSNN >gi|226332019|gb|ACIB01000037.1| GENE 107 122621 - 123331 464 236 aa, chain - ## HITS:1 COG:no KEGG:BF2096 NR:ns ## KEGG: BF2096 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 236 9 244 244 427 99.0 1e-118 MKKQLVIFILLGIIAIPSSVKAQKLKVSLSAGLSKSILNSDVSNLVNTRYNTKTGVATRV NLEYNFFKDFIVGTGLGFIQKNYEYKKTDNITGTHTLYKNNFMDIPLNVGLYLFNNPHKE NGIWLKVQGGVFYEYFTRMHRKGEYPIFAQLQEDGSYIKAQVNETYDFKRNENNLKRNLF GIEGTGEVGYSFNRIDVFASYTYQYGLTDIYKAKTSSNRKSKRISNIISLGVAYKF >gi|226332019|gb|ACIB01000037.1| GENE 108 124102 - 126858 1219 918 aa, chain + ## HITS:1 COG:no KEGG:BF2098 NR:ns ## KEGG: BF2098 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 918 5 922 922 1765 99.0 0 MNILLCFLWLFSATLSAQQVKIVKGHVCVLSEDSIERSLPYASVVVLEGKDSAFVKGMAS DANGSFKLEFIPQKRMEYLLRISYIGMSTIFRKLDPHMMDTDCGHILMKSGIKLAEVLVT APVKEIDMVGDTTVINADAYRTPEGSNLEELVRKIPGLEYDDRSKSLSYNGLVISEISVN GEAFFSGNHVLALENLPADLISRIKVYDKRSEMEKFTGVRTVDENYVLDLQTKKELNGTL ITSVAVGKGNKKKKEAELISNFFKADGENLSVIAKSGNRDITTENKKNRQDNIAVNFLKK FGETVHINGNIMYNNAINGINGTSYYEQYLMTGNRYRYATDNGYHTNRMISAMLSVRWNI DKKTLLNISGSFNALKGINDDSNRQATYNADPKLDVTSPFNRDASEQIEDSIRVNDICMT SRSSSINRLYSIGADITRRLNAKGTSLGLTVQYSEERGNNKAFSLSSTTYYQLQNVKGND SILYRNQYQESPNRNRTIKLGLILTQILRKNLRAQLSYIFKLDNQSRDRNTYVLSPVIDG EEHTPTGNFPKGYEVEYTDSLSNKSRSHTMAHGVSLNLNYTDKTWEITTGLSVTPKRQTL DQKTGWTQADTLRYSVNYHPTLTILWRKRKTWVQLSYEGNTQQPGLAELLTLTDNSDPLN IIRGNPDLKSSYTQKVRFEVRDTKTGLSGDVNWTSMLNNVTRAVIYDSQTGGIESRPVNV NGNWNIKAAMRYQKRINHYFNLSARTGTSFIQNVSLVNDGQREQPERSVTHNRLYNAGLR VGYQPKWGGFDLSGDWRFRHSTNLLRETNNYIRDYSFGLTAYAVFPGNIRLKSETTHTFR NGTNINRSEDNEVVWNLHLSWSFLKYKKAEFSVYWADILSQKKSYSRNVTSHGLSERYTQ QIGSYFIVSFKYRFNRQL >gi|226332019|gb|ACIB01000037.1| GENE 109 126972 - 128390 999 472 aa, chain + ## HITS:1 COG:no KEGG:BF2099 NR:ns ## KEGG: BF2099 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 472 1 472 472 892 100.0 0 MNNLRNYLGLSALTMGLCLMSCNDDNTPSYSQTTMKNSELKTILQQKGYQFNEQGNLLLD DLANNTTTLDLSGTKLSNLSELDILPNLTEVKLSDNDYGPVFDFSKLPKQITGIDLTGND IYDYDNLVNVVVEENGNETVTNLHDITKLYLPRTAKDNIKDLVRFYIKNKDAITNGKIDM KIKDESGTLQTYTTLREVPDENLRTYLQANFSDLFNGDQIDLSKHLGYAQKTTILLIQAN AGVTNFEGIQYIIQNPYWEGAAVALYSAAQSGANMPSVKLGKYVTNLVLNNLNVRSLDLS NAGSLFVLNIGTVAGLSTLDLTHTIWGQREKEIEAEESKGSYLIVYDCPSLKEIKLPKKD ELKTCFLDLECLDALETFDISNLKMVKNLIFGNLPENFNLVYPELTVFYSPEGRSATSFC CSESTFNRESTKTFLDRYYTKGTGVEKLGFSISMSCNKNDGYNWRKALKKKS >gi|226332019|gb|ACIB01000037.1| GENE 110 128432 - 128935 440 167 aa, chain + ## HITS:1 COG:no KEGG:BF2100 NR:ns ## KEGG: BF2100 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 167 1 167 167 312 99.0 3e-84 MKKGILFIALFAASFGFMQSSFAAPSMSANSITNVLAISDYEGTYSGTMDNIIMRGKPYE SRAATYKIEGGRLKCDFPQIGSMPGTITISLAVEVDEETGEITAYNGDEAGTLSLPLGIK VKLYLDDLRDAKITDNGSSKQIEFTLDVSGTFLGANFPASVHFVGTK >gi|226332019|gb|ACIB01000037.1| GENE 111 129231 - 129878 596 215 aa, chain + ## HITS:1 COG:no KEGG:BF2101 NR:ns ## KEGG: BF2101 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 215 1 215 215 424 100.0 1e-117 MYNKIRFDTFMLMCFFMITILGLSACDSDEKITQEPPSQTYVKKAKEILAGDIVLSTRAT MNGVDKTLLKSGCPTKFNFSWREDGMMILNLSDFSVGAMPFAISFKCATKIMQLNSWEQD EYPGDGWIKFVGTDGNVTTSGDDAEDNQEGSGARVDGYLNVNTNQIEFIVDYNMMNVRTE TFLQTIDKTRIDRFKEEFAQYEKDLEEAKKDQGKA >gi|226332019|gb|ACIB01000037.1| GENE 112 129915 - 132272 1771 785 aa, chain + ## HITS:1 COG:AGl1858 KEGG:ns NR:ns ## COG: AGl1858 COG4771 # Protein_GI_number: 15891046 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor for ferrienterochelin and colicins # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 103 745 46 693 707 121 21.0 4e-27 MKRVTTILFLMLLICVHTAAQQKVKLEVLEKGTEQPIIAANVIYADNEALRNPQYAITNT SGQAELKLPSKGICYYKVTYIGYVPVTGKIGGTQDEKVIYMKEDDLGINEVVVTGSRTAR PIKMSPVTTQVLGGKALVDAGYSNLQQALQQETPGLNIQKVGFGNEISMQGLDARHVLFL MDGERMTGDMAGNLDYERFNLHAIDRVEIVKGASSTLYGSRAAGAVINLITKKTDKPLSI DAGIRYGQMNERNYKHPQPKDFLYMFEQNADRPNLQSWVSAGFKAGKFTSQTDVWYSESD AFYMYQAENDKKVYTKEANPFLPHDIIVVSNAVRPPMGIEGKEHITVSQKLYYNPNPNLS VLVYGSSFFMNTYDLIQDMTFSQARDWTAGTKVTYHVKDWFSVTGSLHADFYDRFKRHER IDKRQKDYESSIYQPRLTVTSNYFNGHSLILGMEHTSDELTSDRFSGNANHDLKTRALKE TEYFLQDEWTINPRWMISAGIRTNFSKAFGFMGMPKVAAKYSPDKHWSLRANYSMGYRSP SIKELFFNWDHLGMFMIRGNENMRPEKNNYFSLGAEYSNDRLFVSGTAYGNYFRDKIEGV WRIYDMQYNFEYTNLSQQRLLGLEVLARWSVLDCLTLNGTYSFVDVSKNKGIQVNTTSPH AATASMDYKYMKKNYRLNAVFSASYMGGKKFDVQDRVFVKEENKSYDAYFRCDLPQYVLC NLSVSQTFWNKVKLTLGMDNLFNYVPKTLGSGITMFNVPATAGARGWVQVEFMLDDVINS LKKKK >gi|226332019|gb|ACIB01000037.1| GENE 113 132269 - 132985 661 238 aa, chain + ## HITS:1 COG:no KEGG:BF2165 NR:ns ## KEGG: BF2165 # Name: not_defined # Def: lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 238 1 238 238 493 100.0 1e-138 MKHTGLFKTLCFCAGCLLLSACVDYSDIQPFDGKTLPRKSGYTTGVTNDWIYFNLRTGEI FNALGVNRDIKEGGQMNRTDWDLAFCGYVMRTNSGTSGIGRGGAADLGYGNYENWTSVAQ LPSDLKWVEDNQEVYVTMSQNDWNHYLIENGLDFNSNPWFDPNNGPQKTTTNANPVLAQA MSFAGPPPVYTPSYHTYVVRTADGKHYFKIQIISWYDANVEIGDEGGRLSYYCDELQP >gi|226332019|gb|ACIB01000037.1| GENE 114 133076 - 133996 926 306 aa, chain + ## HITS:1 COG:no KEGG:BF2104 NR:ns ## KEGG: BF2104 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 306 1 306 306 552 99.0 1e-156 MEQITDVKTTEMNLMPSAGSKRYWWKAIASFLMVLFTMPLGHALMIIMEHVMNETVLHYS AFAMGAVGMVMVIIGVFAQGDTRQTLWGFFGGLLFWTGWVEFLFMYFANRFGTQPELDPV TGEIVTRPEYLILPASFGFWMMVMVMYLFSTKNGCNFINWWQRLLLRGRKADIAARPMTR HVSIITFMELMMILWTSYLVLMFCYDDVFLGEHHPVTLLVGLGCLVGAFFIFVKQLRIAS WGANIRMAIATVVVFWTPVEILGRMNLFSEIWIDPMNHVMEMGIILAVFIILTVYLWYMS AKKKKR >gi|226332019|gb|ACIB01000037.1| GENE 115 133998 - 134897 634 299 aa, chain + ## HITS:1 COG:no KEGG:BF2105 NR:ns ## KEGG: BF2105 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 299 1 299 299 560 98.0 1e-158 MKDSKLKTAGMRKLHWGKAVMSIVVTLAAMPLTHSLARVLKEGTTGVEQFYAGMGMGAFG LFMVIAGVFVKGHIRQTLLGLFGGMFYWMGAVDFLFMYFANRFGTQAQLDPVTGEVVSRP EYLLLPATFGFWVMVMILYLFCTRNGCNFLNWWQKLFFGKHKKEIVVRAMTRHTSIVAFM EVITMLWTCYLVLMFCYDERFFGDHHPVTLLVGMLGLIGSIFMFAKLLRHASWDMSLRFG FATVIIFWIAVEVFDRIHLFPGLWENPGGHKQELLLIAASIIFTGCCLVYNNLLVLKNK >gi|226332019|gb|ACIB01000037.1| GENE 116 135004 - 135324 459 106 aa, chain - ## HITS:1 COG:no KEGG:BF2168 NR:ns ## KEGG: BF2168 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 106 1 106 106 149 100.0 4e-35 MKKVLVAVALVMGLGSSVAFAQEVENSTAVETQAQAPQDEFTKIDAKKLPDAVMNALAKS YEGASIKEVYSADKETGKIYKVILTTKDSQEVTVLLDEKGEEIKEA >gi|226332019|gb|ACIB01000037.1| GENE 117 135651 - 136262 460 203 aa, chain + ## HITS:1 COG:FN0473 KEGG:ns NR:ns ## COG: FN0473 COG1309 # Protein_GI_number: 19703808 # Func_class: K Transcription # Function: Transcriptional regulator # Organism: Fusobacterium nucleatum # 2 125 3 120 189 69 33.0 5e-12 MQTLKSDIRNRILSAAKEQFVQRGYLKTSMREIADAVDVGVGNLYNYFENKDELFCVILR PVSDALERMLQEHHGAKGADIMLICSEEYLKSAVDEYISLINKHGELMKILLFHSQGSSL ETFREDYTNRSTEMVKTWFAEMKEKHPEINVVVSDFMIHLQAVWMFTLFEEMLKHAIDSK EMEYIVHEYILFEIQGWRALLRV >gi|226332019|gb|ACIB01000037.1| GENE 118 136412 - 137230 551 272 aa, chain + ## HITS:1 COG:MA3472 KEGG:ns NR:ns ## COG: MA3472 COG3315 # Protein_GI_number: 20092284 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism # Function: O-Methyltransferase involved in polyketide biosynthesis # Organism: Methanosarcina acetivorans str.C2A # 1 248 1 251 274 165 33.0 1e-40 MKQKHEFENIVAETLLIPLYMRAKENRRKNPILCDKLAEQLVENIEYDYSRFDGAKLSEV GCVIRGWYFDHAIRRFIDTHTRPVVVNVGCGLDTRYQRVGNDGKAVFYELDLPEVIAIRR RLIPEPENDCYLSASLLETDWMDRIRLLHPNGDFIFVVEGVLMYFREEQVRTFLHNITMR FEGGELWFDVCGTMMSRCGVKPDSLREHKAQIRSGIDDGHMVELWEPGLHLLEQANYMKF FRSRWGFFFGQILGRMTKLCYKFSSMLGYKIG >gi|226332019|gb|ACIB01000037.1| GENE 119 137246 - 139057 1214 603 aa, chain + ## HITS:1 COG:YPO0771 KEGG:ns NR:ns ## COG: YPO0771 COG1132 # Protein_GI_number: 16121084 # Func_class: V Defense mechanisms # Function: ABC-type multidrug transport system, ATPase and permease components # Organism: Yersinia pestis # 12 597 11 581 590 381 37.0 1e-105 MVNKKKEGLSRLFEIAGQKKSLLLLAGLLSAGSAVCMLIPYWAIYRILYELLNHSRELSS IDETNMIRWGWIAFGGLIGGLLLLYASLMSSHVAAYRILYGLRVRLTEHIGRLPLGYLNG TSTGAIKKTMEQNVEKIENFIAHTIPDLVNVMATVVVMFLIFFSLDGWLAGVCLAVIVLS IFLQFSNFMGKKAREFTRIYYNAQEQMSASAVQYVRGMPVVKIFGQSVRSFRQFNAEIEA YKTYALKVCDTYESGMTYFTVLLNSIVTFILPVGILLMQNDSRSLTLAAVWLFFIILGPG VASPVYKLMYLGSSTREINEGVSRIDRILENQPVSEPACPKIPATYDIEFRHVSFSYENK EQATRTEALHDLCFTAPQGKITAFVGPSGSGKSTVANLIPRFWDVEQGEILIGNVNVKDI ATEQLMDLVSFVFQDTFLFYDTLYENIAVGSSKATRDTVIAAARAAQCHEFIEKLPNGYE TRIGDKGVFLSGGEAQRVCVARAILKNAPILVLDEATAFADPENEYKMQQALKSLIKDKT VIIIAHRLSSIVSSDRIIVLKDGRAVQCGRHEELSSQEGVYKKMWNAYTSAFRWQLNVKQ EKE >gi|226332019|gb|ACIB01000037.1| GENE 120 139061 - 140791 248 576 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|229849245|ref|ZP_04469311.1| LSU ribosomal protein L17P [Thermanaerovibrio acidaminovorans DSM 6589] # 336 554 132 351 398 100 32 8e-20 MSAIRNITIGRTERLYKPVGYTMLANLVNIVPFCLSIEAIRIIFRAFNGGGQSLDTTRLW CIFGCMTGYIAVMVLAERAAYRANFRGAYEMSASGRISLAEHLRKLSLGFLGKRDPGDLS SMLITDFTMAETGISHYLPQLMGALVMPVLAFVSLLWIDWRMAVAMFVALPFAMGILWLS TSVQERLSGRQIKAKVNAGNRLEEYLQGIRVMKAYNLLGDRFVRLRDAFAELRRACIRLE ALLGPFVLLAITLVRAGLTLMVLCGTYLLLGGQLSILTFVMFLVVGSRVFDPLTSALTNF TEFRHFSISGGRILSLMNEPEMKGTKEAPEDGNIIFENVSFGYQEKEVLHGISVILSRNS LTALVGPSGSGKSTVMKLCARFYDPTKGRILFGGVPVREIEPEKLMSRISMVFQDVYLFQ DSIRNNIRFGKSDATDEEIVAAAKKACCHDFIMHLPHGYDTMVGEGGCTLSGGEKQRLSI ARAMLKDAQIVLLDEATASLDPENEVEIQKAIDTLIKGRTVIVIAHRLKTIMGADHIVVL SDGKVEEQGTHSELMCRDGLYRKLWNIQESTLGWTL >gi|226332019|gb|ACIB01000037.1| GENE 121 141092 - 142627 1001 511 aa, chain - ## HITS:1 COG:mll3725_2 KEGG:ns NR:ns ## COG: mll3725_2 COG0642 # Protein_GI_number: 13473203 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Mesorhizobium loti # 249 505 49 311 328 150 36.0 6e-36 MSNSSIKSKTALLRNGETQKGNGYPEANDYSARILETMQTGVIFFNTEQIISGINNLACE DLQIPRDPSGHKITDIISIIHQEKDIFPELIARLKSSETDMEKLPIDTLIRSLETKVQFF ASGCIMQLETGRYLLAFRNTMDEVTHEHLLSMILARTKIFPWFFDLKRNKMLIDAHWFSY LGIPAGDCEITIEKFFSRVHPNERDMLADALQKQLSEKEIPDSFSYRLQRGDGSWEWFSE QSMYLSKTNDGSPYRIVGVCHSIQEHKNTEDKLRAARNKAQESDRLKSAFLANMSHEIRT PLNAIVGFSNLIAGGIVDLDTEEARDYSALISKNCNYLLTLVSDVLDLSCIESDTMTFKF TIYPLTRLLTEIYQKYENRIPQEVQFNLLLPTDNVEIETDAVRLRQVIEHLLDNAAKFTV KGHIDIGYALSDHGEKIYVFVADTGCGIPSDQYKKVFERFYKINSFVQGAGLGLSVCKTI VEGLGGTINVYSQLKEGSRFSVILPLNRLHK >gi|226332019|gb|ACIB01000037.1| GENE 122 142902 - 143774 384 290 aa, chain + ## HITS:1 COG:PA0248 KEGG:ns NR:ns ## COG: PA0248 COG2207 # Protein_GI_number: 15595445 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Pseudomonas aeruginosa # 183 286 183 286 288 64 31.0 2e-10 MADNVKANKMNSQADDDIGFGIYTDIADLPMTGCPSYIEEGIGGVCESGTATIVVFDVPF QIVPNVVITLMPWQLVFIKEISEDFRITFFKISKDMFSETLSTLWRPASGFLLYMRKHIV SIPDGELIGRFLAYCNLLVYRMKHTPQNCRQESIMQLLRVYFWDVYTVYINDPQAEKSLK FTRKDEYVYQFVRLIIEDHSPDKDVAYFAQKLGISPKRLTNLIRSISGQSAREWIVYYTI LEIKSLLRESSLDIKSIAARVNFPDQTTLSRYFRHYTGVTPSQYRKNIYF >gi|226332019|gb|ACIB01000037.1| GENE 123 143818 - 144282 282 154 aa, chain - ## HITS:1 COG:no KEGG:BF2116 NR:ns ## KEGG: BF2116 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 10 154 1 145 145 281 100.0 6e-75 MQVQKYIEYMETKILSNATHKCVLVIDNAQPTGIVANIASVLSMTLGCRVSNIVSHDVYD KQGERHLGITQLPIPILGASQEKIKELRNYFHSLEIEDLVLVDFSTIAQQSRTYDEYERE MYSANEDDLHYVGIGICAEKKAINKATGSLSLIR >gi|226332019|gb|ACIB01000037.1| GENE 124 144412 - 144879 408 155 aa, chain + ## HITS:1 COG:mll9538 KEGG:ns NR:ns ## COG: mll9538 COG1522 # Protein_GI_number: 13488398 # Func_class: K Transcription # Function: Transcriptional regulators # Organism: Mesorhizobium loti # 4 151 6 153 155 122 36.0 4e-28 MGQLDKTDVEILQVLQKDAKVNTKELSEKLHISKTPIYERIKRLENDGYIKGYVALVDNK KVGLPLIVFCNVSLAVHDDEHIKRFQEEIKEIDEIMECYSTGGIYDFFIKVVLKDLDAYN RFVFEKLTKVHGIVKMQSSFVLSEIKHTTVLNIDR >gi|226332019|gb|ACIB01000037.1| GENE 125 144921 - 145244 134 107 aa, chain - ## HITS:1 COG:no KEGG:BF2118 NR:ns ## KEGG: BF2118 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 107 1 107 107 207 100.0 1e-52 MEKIEIVLRREQNNRNGIYLNYINGYWYTYEWSAFLLCMLHPEVEVRKCIGVQPDENYAI ARVNKKIIKKLERKYQTSMMDDSIKILLPPFNEDENIFLNWKALLPP >gi|226332019|gb|ACIB01000037.1| GENE 126 145264 - 146112 298 282 aa, chain - ## HITS:1 COG:no KEGG:BF2119 NR:ns ## KEGG: BF2119 # Name: not_defined # Def: transcription regulator # Organism: B.fragilis # Pathway: not_defined # 1 282 1 282 282 541 100.0 1e-153 MKLLHIERHTTCLNYVSDYNICFIHQRLFSGGDFKIDNHHHSCILFLLKGEILTSCSEFH DQHIVEGHMVLFPQNDPNQSKSMTETEFILLFFDNQVNLHSKMSIELSAIHLESEKSCFY SLSICPPLRHVLDSICFYLKQKVQCSHMHELKQKEIFMVFGTFYNRTDMAHFLMPITGRD PNFKSFVLENYLQIRNIKQFAQLYHCSERSFNRKFKSCFHDTPYNWILNQKTRHIKGQLA NRNIPISEIARTFHFASPSHFTTYCKKRLGITPSEFREKIAK >gi|226332019|gb|ACIB01000037.1| GENE 127 146313 - 146708 348 131 aa, chain + ## HITS:1 COG:no KEGG:BF2120 NR:ns ## KEGG: BF2120 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 131 1 131 131 259 100.0 2e-68 MIQTIQVQGTEKRLYQLIAPLVMNPDVLSANNNYPFKTTEQYVWFIAIDKKSVVGFMPVE HRRSGCVINNYYVSGDNRETLSLLISSVLEAIGKEVRLFAVVMVNHQAVFEEHGFIMEKA WKRYVKMQKDE >gi|226332019|gb|ACIB01000037.1| GENE 128 146705 - 148009 1008 434 aa, chain + ## HITS:1 COG:lin1347 KEGG:ns NR:ns ## COG: lin1347 COG3969 # Protein_GI_number: 16800415 # Func_class: R General function prediction only # Function: Predicted phosphoadenosine phosphosulfate sulfotransferase # Organism: Listeria innocua # 10 433 4 434 434 435 49.0 1e-121 MMTKTTTVPKNVYELAQERLRIVFNEFDNVYLSFSGGKDSGVLLSLCIDYIRRNNLKIKL GVFHMDYEIQYKMTIDYIARMLEDNKDILEVYRVCVPFRVATCTSMYQSFWRPWEDSKKD LWVRPLPENAMTKEDFPFYNTQMWDYEFQMRFASWLHEKKDAVRTCCLIGIRTQESFNRW RCIYLNRKYQMYHRYRWTSKVGNDIFNAYPIYDWKTTDVWTANGKFKWDYNKLYDYYYWA GVNLERQRVASPFIGEAQESLALYRAIDPNTWGKMIGRVNGVNFTSMYGGTHAMGWQSIK LPEGYTWREFMYFLLSTLPDRARNGYLRKLQVSVQFWRNKGGCLSDETIRRLNEAKVPII VMDNSNYKTTKKPVRMEYQDDIDIPEFREIPTYKRMCICILKNDHACKYMGFSPTKEEMS KRNQVIEQYKNILQ >gi|226332019|gb|ACIB01000037.1| GENE 129 148006 - 148551 500 181 aa, chain + ## HITS:1 COG:L69383 KEGG:ns NR:ns ## COG: L69383 COG1475 # Protein_GI_number: 15673430 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Lactococcus lactis # 6 165 7 166 180 221 63.0 4e-58 MSIKKSPVYNVIAVPVEKVQANDYNPNVVAPPEMRLLELSIWEDGFTMPCVCYYDKEKDV YILVDGFHRYSVLKTSKRIFQRENGMLPIVVIEKDLSNRMSSTIRHNRARGTHNIELMCH IVAELDKAGMSDQWIMKNIGMDRDELLRLKQISGLADLFANRDFSVPEDDQPGNVDKKPT R >gi|226332019|gb|ACIB01000037.1| GENE 130 148888 - 149454 528 188 aa, chain + ## HITS:1 COG:no KEGG:BF2182 NR:ns ## KEGG: BF2182 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 188 1 188 188 370 98.0 1e-101 MKNLILFGAAISISVAVSAQHVALKNNLLYDATTTPNLALEVGLGKKTTLDLYGGYNPFT FGNHKRFKHWLAQPEFRYWTCERFNGTFWGVHLHGGEFSVAGISLPFKIFPSLKDHRYEG YFYGGGVSVGHQWLLSKHWSLEASVGVGYAYWVYDKYRCVNCSPKIKSGHKNYVGPTKAA VSLVYFIR >gi|226332019|gb|ACIB01000037.1| GENE 131 149481 - 150932 1304 483 aa, chain + ## HITS:1 COG:no KEGG:BF2183 NR:ns ## KEGG: BF2183 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 483 1 483 483 978 99.0 0 MKKFYFLILGIALCDMVAQAQNSYEGHIGFGQHHVVRKGGELNVEVSLDLGAVKLAAQQM IVLTPVLRSTEGEEQQLLAPVVIAGPRRYRVLKRSLAFGTDNFEMSPMLVEKRKSGTPQT VNLHFGLPYHEWMRRAELILREEVTGCADCPVSQGDHTVITSVFDEQFTPRYELSYVTPP VEPLKQRSETHTAYLNFEVDKYVLLRNYKNNANVLADVDRIVNEIQNDSNLTVTEFRVTG YASPEGNYSRNMKLSENRALAFVGYLQNHGGVDESLLTVDWKGEDWSGLRREVAASSLID KGAILAVIDGYTDFATRKNRLQALNGGTTYRMLLRDYYPPLRRNEYTISYVARAFNVDEA KQLIKTKPQYLSLNEMFLVANTYPKDSGEFKEVFDIAARIYPDDPVARLNTAALELENGA IDAAIVRLQKSDMPEAWNNLGIAYILKQDYKKGGEYLEKAIDAGIQAAAYNMGQLAAWLK TQE >gi|226332019|gb|ACIB01000037.1| GENE 132 150974 - 152533 1467 519 aa, chain + ## HITS:1 COG:no KEGG:BF2126 NR:ns ## KEGG: BF2126 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 519 1 519 519 1015 99.0 0 MKVEWKKLMFGLAIGAGMMMMASCSDEENFDPGGSTGKETPEPTEKAFLSMRLSTGSLAG TRVAVDPTTSGRTEEQVVNHVRMVLYETKNNTVRYSWDLNVSTDGMNEFTGGDVVRGEDV PSATPTVSRFVTVGREVVKQDYELLILINPPGELLEITGQGNPRSYLSRAANMTKESLIQ PYGIAADNNFYMTNHQDLIFVPEVELRDNQRMAEENPVRVEVERAVAKVVVSGVPEVVPH GDRIDNLKWGLDVTNMYTYWMRKMTFIANSGGVPNEMEQLNAGYREERYAEDPNFTRFSS WNGGNPVGQFEYLSGTPELSKNFDDYDYTLENTMDAADQRHDVTTRVVISGTYTPNGFGS VATRNGGGISFYYFKGNAIRVEAMRDMVNDRGQIPQELRDAGLEQAIENVLAWNPNAFNS PTASFSEGGINFYYQGVCYYTVLIRHFSNNMVPVLMGYGRYGVVRNNVYQLSINKIIGPG QPVINPPGTDPDDEDTSWISADVNIMRWYIRNQNVEELL >gi|226332019|gb|ACIB01000037.1| GENE 133 152565 - 153545 816 326 aa, chain + ## HITS:1 COG:no KEGG:BF2185 NR:ns ## KEGG: BF2185 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 326 1 326 326 642 99.0 0 MNRFIGYIQVACCCLLLCASCVRDGMDEDCNSYVRFVYDYNLEYIDLFHKQATKMNLYVF DEKGVFVTELKEESGAFAPDYLMTLPGAMAGRRYIFVAWSGLYGESYDKVTLTPGVSTLE DLEVSVNNLKTWIGGGVVDRELHLLWHGKQTEVSPQYNNDITTVSLLKNTKKFRIIMQML DDSSIHVDDYDFRIISPNGRYNHENGLLGDETDEKVEYTAYHTEDDPETGAIAELNTLRL MTDTENRLVITHKSSGNVILDIPLNKYLNALRLQQYADIPLQEYLDRADKHGIILFFKGM DGNGNYISVDVQINGWLIRKQEVDGV >gi|226332019|gb|ACIB01000037.1| GENE 134 153660 - 154619 447 319 aa, chain - ## HITS:1 COG:no KEGG:BF2186 NR:ns ## KEGG: BF2186 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 319 1 319 319 610 100.0 1e-173 MKNYIVNELIAAMKERIPRGINLANYLTDALCMGKEAVYRRLRGEVAFTFDEIAMISCKL GISIDQIIGNHQSNRVTFDLNLLHSPDPLESYYEIIERYLRIFNYVKDDISTKIYTASNV IPFTLYSSYEYLSKFRLCRWIYQNGKIRTPNSLSGMHIPDKAVHAHKLLSEAVKACRKTC FIWDSNVFYSFVKEMKYFAGLNLISETDLIHLKNELELLLHELEQISAKGEFSNGNKVAI YLSNIDFEATYSYIEKKDFQISLLRVYSINSMDSQSPRICGIQKDWIQSLKRHSTLISES GESQRITFLEQQKSFIDTL >gi|226332019|gb|ACIB01000037.1| GENE 135 154869 - 155828 724 319 aa, chain - ## HITS:1 COG:no KEGG:BF2187 NR:ns ## KEGG: BF2187 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 319 1 319 319 577 99.0 1e-163 MITNELNIGLIEAAKEKMPTGTNLANTLMDILYIGKEAIYRRLRGEVPFTLAEAAVISRK MGISLDKMIGVSFSNNAVFDLNVVHHTNTFETYHDILTKYVDAFDNIREDPTTEMATSSN ILPQALYLKHDVLSKFRLFKWMYQNENIKCKHFDELEIPHKIYNIQKDFVNMTQQMKTTD YIWDNTVFEHVVRDIQFFSEIHLVSEEDKELIKDDLLLLTDELEELAGKGKYETGNDVRI YISNIKFDATYSYVATSNSHISMIRIYSINAITTQDDGMFRSLKEWVQSLKKFSTQISES GEMQRIRFFNEQREIINTL >gi|226332019|gb|ACIB01000037.1| GENE 136 156080 - 156397 176 105 aa, chain + ## HITS:1 COG:no KEGG:BF2131 NR:ns ## KEGG: BF2131 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 105 1 105 105 204 100.0 1e-51 MDLEKVLIREINNDSRIFLYKEGDCWSAHDNSARHLCFLYSQFNAYDRIYQAYEIVLKCV MLSNAMIEKFIEHTLVSTVHEDEIEICIPKEKRAEFESWRSTSGV >gi|226332019|gb|ACIB01000037.1| GENE 137 156538 - 157476 525 312 aa, chain + ## HITS:1 COG:XF0611 KEGG:ns NR:ns ## COG: XF0611 COG0451 # Protein_GI_number: 15837213 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Nucleoside-diphosphate-sugar epimerases # Organism: Xylella fastidiosa 9a5c # 3 309 22 328 329 473 66.0 1e-133 MKRILVSGGAGFIGSHLCTRLINEGHDVICLDNFFTGSKENIIHLMDNHHFEVVRHDITF PYSAEVDEIYNLACPASPIHYQYDAIQTIKTSVMGAINMLGLARRLNAKILQASTSEVYG DPEVHPQPESYWGNVNPIGIRSCYDEGKRCSETLFMDYHRQNNVRIKIVRIFNTYGPRML PNDGRVVSNFLIQALKNDDITIYGTGEQTRSFQYIDDLVEGMIRMMNTGDDFIGPINLGN PNEFSMLQLAEKIIQKTGSKSKITFKPLPHDDPQQRKPDIRLAQEKLGWQPTILLDEGLD RMIDYFKMKYKL >gi|226332019|gb|ACIB01000037.1| GENE 138 157511 - 158638 922 375 aa, chain + ## HITS:1 COG:CAC3391 KEGG:ns NR:ns ## COG: CAC3391 COG0642 # Protein_GI_number: 15896632 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Clostridium acetobutylicum # 130 372 336 576 579 125 36.0 9e-29 MNMEINPSEYKILIVDDVMSNVLLLKVLLTNEKFNIVTASNGNQALDQVKKENPDLILLD VMMPDMSGFEVSQKLKADPEAAHIPIIFLTALNSTADIVKGFQVGGNDFISKPFNKEELI IRVSHQISLVAAKRIIEAKTEELKKTIIGRDKLYSVIAHDLRSPMGSIKMVLNMLILSLP KEKIGEDMYELLTMANQTTEDVFSLLDNLLKWTKSQIGKLKVVYQDIDMVEVVEGVGEIF AMVAGLKNIRLRIESPECQAVHADIDMIKTVIRNLISNAIKFSNEGSEVLIKVEESDGMS VVSVKDSGCGIDEESQKKLLHTDTHFSTFGTNNEEGSGLGLLLCQDFVVKNGGKLWFTSV KDEGSTFYFSIPLKK >gi|226332019|gb|ACIB01000037.1| GENE 139 158883 - 161756 1890 957 aa, chain - ## HITS:1 COG:no KEGG:BF2191 NR:ns ## KEGG: BF2191 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 957 1 957 957 1883 99.0 0 MKTFLQLVAQDLYCKIGNDLSRTAIIFPNKRASLFFNEHLANQSDQPLWSPAYLSISELF QHLSVLKLGDPIRLVCELYKIFREETNSDESLDDFYFWGELLISDFDDVDKNLVHADKLF TNLQDLKNVMDDYEFLDQEQEQAIQQFFQNFSIEKRTLLKEKFISLWDKLGDIYHRYHKK LEELGFAYEGMLYRNVIEQLEPDSLKYDCYVFVGFNVLNKVETHFFQQLQNAGKALFYWD YDVFYTQLPSRQKQRHEAGEFINRNLKLFPNELPAELFNELIKPKKVRFISSPTENAQAR YLPQWVHENLSNEEKENAVVLCNEALLLPVLHSIPEVVRNVNITMGFPLAQTPVYSFINA ILELQTSGYRTDSGRYIYDAVQTVLKHPYTRRLSDKAEPLQRELTKTNRFYPFPSELKKD KFLDILFTPRNGIRELCVYITELLKEVSVLYRQEQESDDIFNQLYRESLFKSFTLVNRLL NLIDNNELQVRIETLKRLLNKILNAANIPFHGEPAIGMQIMGVLETRNLDFRNLLLLSLN EGQLPKSGGESSFIPYNLRKAFGMTTIEHKNAVYAYYFYRLIQRAENITLMYNTSSDGLN RGEWSRFMLQFLIEWPHEISREYLEAGQSPQNSKEIRITKTPEIIDRLYRTYDFSRNPDA LILSPSALNTYLDCRLKFYFRYVARLKAPDEVSAEIDSALFGTIFHRSAQLVYLDLTANK RDVHKEDLERLLRDNIRLQNYVDIAFKEIFFHVPIDEKPEYNGIQLINSKVITSYLRQLL RNDLQYAPFRMMGMEQEVVEDIRIEGPVGKLSLRIGGTIDRMDSKEGTLRIVDYKTGGSP KVPANIEQLFTPAEGRPNYIFQTFLYAAIMARQQALKVAPSLLYIHRAASESYSPVIEIG EARKPKLPVDDFSVYEDEFRERLLKLLEEIYDDKEEFTQTEDTKKCEYCDFKAMCKR >gi|226332019|gb|ACIB01000037.1| GENE 140 161753 - 164926 1998 1057 aa, chain - ## HITS:1 COG:Cj1481c KEGG:ns NR:ns ## COG: Cj1481c COG1074 # Protein_GI_number: 15792796 # Func_class: L Replication, recombination and repair # Function: ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) # Organism: Campylobacter jejuni # 4 821 7 711 921 125 22.0 3e-28 MSELIVYKASAGSGKTFTLAVEYIKLLIRNPRAYRQILAVTFTNKATAEMKERILSQLYG IQIGDPDSDAYLKRIIAETGHSEDEIRTTAGIALGYMLHDYSRFRVETIDSFFQSVMRNL ARELELSPNLNIELNNVEVLSDAVDSMIEKLGPNSPVLVWLLDYIDERIADDKRWNVSDE IKSFGRNIFDEGYIEKGDGLRRRLRDPNVIHNYRKTLKEMETAALEQMKEFAQQFENVLS SQSLKPTDLKNGAKGIGSYFNKLKNGILGDEIVNATVIKCLDDETNWAAKTSKQYTDIIL LASSILMPLLQNAEQYRSRNNRIVNSCRLSTQHLNKVRLLTNIDEEVRQLNRENNRFLLS DTNALLHQLVKDGDSSFVFEKIGTNIRNVMIDEFQDTSRMQWDNFKLLLLEGLSQGADSL IVGDVKQSIYRWRNGDWGILNGLNKQLGYFSIRTETLKTNRRSETNIIRFNNSIFSAAVD YLNEMYNKQLGSICEPLINAYADVEQESLRNKQQGYVKVEFLEPDEEHDYTEQTLISLGM EVEHLLQSGVKLNDIAILVRKNKSIPRIADYFDKQLNYKIVSDEAFRLDASLAICMMLDA LRYLSDPENRIVKAQLATNYQLQILHSEYDLNSLLLHKAEELFPPAFLERMAELRLMPLY ELLEELFSLFELHRIEQQDAYLFAFFDAVTDYLQSHSSDPDSFIRYWNETLSGKTIPSGE VEGIRIFSIHKSKGLEFHTVLLPFCDWKLENETNNQLVWCVPQEAPFNELDIVPVNYSSA MAESVYRTDYLHERLQLWVDNLNLLYVAFTRAGKNLIIWSRKGQRNTMAELLTGALPQAA NKLDQEWDEEQVYELGDLCPSENEKKIDSGNKLTRKPEKLPINMESMHPDIEFRQSNRSA DFIKGLSEEESDDRFINHGQLLHTLFSAIETKDDIEPAIQRLIFEGIIGSKEAEEQIRSL TVKAFSLPEVQEWYSGEWRLFNECAIIYKDKGVLQTRRPDRVMMKNEQVVVVDFKFGKAN KKYNKQVKGYMQLLSRMGYKNITGYLWYVEEEIIEKV >gi|226332019|gb|ACIB01000037.1| GENE 141 164950 - 166527 1308 525 aa, chain - ## HITS:1 COG:XF0847 KEGG:ns NR:ns ## COG: XF0847 COG3525 # Protein_GI_number: 15837449 # Func_class: G Carbohydrate transport and metabolism # Function: N-acetyl-beta-hexosaminidase # Organism: Xylella fastidiosa 9a5c # 25 506 89 579 841 248 34.0 2e-65 MKNYLGLIFLLFAFTATAQNNRSALLPMPNHIEQVQGKPFSLTGKNITIHPGQPELKFAA TTLQSILKDRMQVDIPLSGSRQSPIRLIIDPQLEGKEHYQLKVDQKGMTISGASAAAVFY GVMTVDQVLLGDVCSSNRKEMTPISIDDAPRFGYRALMLDPARHFLPIEDVKFYIDQMVR YKYNVLQLHLTDDQGWRIEIRKHPKLTAGQSFYTQEELADLIRYAAERHVEIVPELDIPG HTVAVLAAYPELGCTHTDTIAKNVGETVNLMLCANNEKVYEVYNDIIDEVSALFPSRYIH LGGDEAVIEKNWTKCERCQKMMKELKYEKASQLMIPFFSRMLSFVEADGKYPILWCELDN IRMPANDYLFPYPKNVTLVSWRYGLTPTCQKLTQQHGNPLIMAPGEFAYLDYPQFKGDLP EFNNWGMPVTTLETCYQFDPGYGKPAAEQAHILGVMGTLWGEAIKDINRVTYMTYPRGLA LAEAGWTQMEHRNWDSFKERLYPNLNNLMKKGVSIRVPFEIVKRK >gi|226332019|gb|ACIB01000037.1| GENE 142 166784 - 169423 2030 879 aa, chain + ## HITS:1 COG:no KEGG:BF2137 NR:ns ## KEGG: BF2137 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 879 1 879 879 1782 99.0 0 MRLKTILLTTMATGSFLCEPVAAMCIEPPATPDMGWFLKKKKKSNPQDSIKVKNEYEKLT GSDSVVRRGMFNVYQKKNDYYFEIPSTLLGRDMLVVNKLQRVPAELNEAGVNRGTNYENQ MIRFELDKSANKLLIRQSRPLPISPSEDAISQSVKDNYISPLIAGFKVEAYNNDSTSMLI KVNDIYDGTETSINNVFTNINLGTSAIKNLSRILSIKSFDNNVVATSELTTRVTEGTTTI YVTVEVSSSILLLPEVPMTGRLDNPRVGYFTNPLTNFSDGQQRVNKKQFITRWRLEPRPE DRAAYLRGELVEPRKPIVFYIENSTPYRWRKYIRQGIEDWQVAFERAGFKNAIIAKDITE DMEVDMDDVNYSVLTYAASTKANAMGPSILDPRSGEILEADIMWWHNVLSMLQEWITVQT GVVRPEARGVALPDSLMGDAMRFVACHEVGHSLGLRHNMMGSWAFPTDSLRSKTFTDRMN STSSSIMDYARFNYVAQPGDGIKALSPHIGPYDMFAIEYGYRWYGKQTPEEEKELLQDFL AKHTDRLYKYSEAQDPRDAVDPRAQNEDLGDDPIRSSQYGIANLKCIVPQIIQWTTTGEK GQTYEEASRLYYAVINQWNNYLYHVMANIGGIYIENTTVGDGEKTYTFVEKEKQQAALRF LLDEVLCYPKWLFDPEIAQYTYLLKNTPLGVVENAPTQVLKNAQAYVFWDLLSNNRLMRM LENESVNGKKAFTAVELMDGLHKSIFAVTERGGLPDVMTRNLQKGFVDALITAAAESEGV KVNKKLIDNHFLFDLQTPICSCDDHAHRSAHTDRMGARRELNFYGSQINRISDAISVKRG ELLRIKDLLQSRLGTSDVATKYHYKDLILRINTALGISK >gi|226332019|gb|ACIB01000037.1| GENE 143 169552 - 172857 2536 1101 aa, chain + ## HITS:1 COG:no KEGG:BF2195 NR:ns ## KEGG: BF2195 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1101 1 1101 1101 2162 99.0 0 MKRHVFILLLSFAGVLTSAFAASRQVQGVVISSEDNMPLIGASVYIKAEDLSKDGNSPTI TGVITDIDGKFNISVPEGVTRLFCSYVGHEVQELKLVPGKNQYEITLFPSAQMLDAVVVT GYQTVERRKLTAAVGKLNISDETIGAVKSIDQALAGQIAGLSVTSTSGAPGAPAKIRIRG TSSLNGTQDPLWVLDGIPLEGTDVPQSNVLNDVSNIQQSSIAGLNPADIENITVLKDAAA TAIYGARAANGVIVITTKKGKVGKPVINFSSKFTYIPTLSTNRLNMLNSQEKVDLELELL RSNFAYGDNKGGVSKIISGYGLTDAYKKGGWGALTPEAQTDISRLRNTETDWGDILFRDA FNQEYSLSLSGGNERVTYYTSIGYYQENGNVKGVGLDRLNVVAKTSYKVNRMLKFGVSLF VNRRNNKTYLTDTYGLVNPVYYSRKANPYYQPFDANGNYVYDFDVQNNSDTDLGFNIFEE RKNTSNEETINALSSIFDAELRFNDKLKFTTQLGLQLDKASKEQIADKESFSMRIIRKNS KYWDSASQSNKYFIPDGGVHKAYENTNSQITWKAMGEYRDSFNDIHELEVMVGTELRKTW YETLFSAGYGFDRQTLTTKPVVFPDEDRARQFPLHQKTYKENAYVSFFSTASYSLMNRYT FGGSIRFDGSDLFGVDKKYRYLPLYSVSGLWRLSNEPFMQGTRKWMDNLAFRVSYGIQGN IDKNTSPFLLGKYIVDNILPGGSEHMIDINSAPNKKLRWEKTQSVNVGLDFSVLNQALNL SVDYYYRKGTDLIGKQMLPLETGFVSTNINWASMVNKGVEVSLSTRNVATKNFSWYTNLN FAYNNNKVLREAIPEAQTIPGREGYPVDAIFAIKTAGLDEEGYPLFYDKEGKKVTLKELY RLQDPFGLGFTVNSDVTPAEERSFYSYIGSQDTPYTGGLINTFSYKNWELTANLSFNLGG YVRTTPSYNFINFDRGQNVNSDILDRWTPENTDGRLPALITSEKRADEYYWYDQKSEIYK NLDIWVKKLNYFRLQNLRLGYRLPEKMTKSLGMGSASVAIEGRNLLVFGSSYKNFLDPES MYNPYAPPIPKSITFSLNLNF >gi|226332019|gb|ACIB01000037.1| GENE 144 172880 - 174211 1115 443 aa, chain + ## HITS:1 COG:no KEGG:BF2139 NR:ns ## KEGG: BF2139 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 443 1 443 443 863 100.0 0 MKKIYVLALLSCLLMLSACDSYLDIRPVGSVIPQTAEEYRALLARAYLNVPNDRGLACLR SDEMLVNDNEYDRNSYGDIERWNDVSPFPGTSQFTWSNFYNVLFIANQVIESQKEITEGT PEVVNQLVGEAHLLRAYLHFVLVNLHGQPYTKSGALNSKSIPLKLDTDLEKTLGRNTVEE VYTSILSDIEHARELINKEKWETVFSYRFNVLSVDALQSRVSLYMGAWPKCLESAEAVLA KKSVLVDMNETPLALPNHFESVESITALEQVMGSSVNNAVWVPATFLALYQEGDKRLAAY FAAPDENGNRKSSKGGKREFSCTFRVGELYLNAAEAAANMDKLPHARMRLLELMRKRYTP EAYAKKENAVNVMDKNALISEILNERARELAFEGHRWFDLRRTTRPRMVKVLQGKTYILE QDDPRYTIPIPRDAIAANPGLAN >gi|226332019|gb|ACIB01000037.1| GENE 145 174714 - 176084 583 456 aa, chain - ## HITS:1 COG:VCA0762 KEGG:ns NR:ns ## COG: VCA0762 COG2425 # Protein_GI_number: 15601517 # Func_class: R General function prediction only # Function: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain # Organism: Vibrio cholerae # 127 454 125 467 481 148 29.0 2e-35 MDWKIRNIRLQELREIYQEKLKNIAYRVYESHFQNGIVKQEELEGEIMSYYQHTQPSLQE FYSHYATQWEHFYEGHELTDSAFLRFLENSAYPLQMKYNRGDLNLQYYIDRFHTLKKRSK EWKHLRNLFFDKWYHLLANNEYNYQIERINNLCERFYRLQKNIADQLPQRGNARLMWLLR THQELAKQLFHYDEIAKNHPAIRELTKILGKQHYGKEKKFRMVAGIHREQIITHATKSDI TGVCEGNDLNSLLPIEYCYLSDPALQPLFFERFNKKKLQMMDYESKDQHRIKDIKIQGNE IVEEQSGPFIICVDTSGSMSGEREEFVKSAILAIAELTEQQDRKCYLINFSNDIACIEIE RLGQNIQELANFLCQSFHGGTDLTPALLHAIHILKTKSYRNADLVMMSDFEMPPLNEELS EEIKKIKQNKTHLYALSVHKQSENTYLNVCNKFWFV >gi|226332019|gb|ACIB01000037.1| GENE 146 176059 - 176226 186 55 aa, chain - ## HITS:1 COG:no KEGG:BF2198 NR:ns ## KEGG: BF2198 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 55 470 524 524 107 100.0 1e-22 MTDIELTYCKEHLFMDDKQRNMVKQILNRQKEMLEIYQNEIREIAYTHGLENKEY >gi|226332019|gb|ACIB01000037.1| GENE 147 176521 - 177633 702 370 aa, chain - ## HITS:1 COG:VCA0763 KEGG:ns NR:ns ## COG: VCA0763 COG0714 # Protein_GI_number: 15601518 # Func_class: R General function prediction only # Function: MoxR-like ATPases # Organism: Vibrio cholerae # 8 354 19 367 552 296 43.0 4e-80 MKTVKTHITQLLHAMNKGIFEKEHPIALSLLSAISGESIFFLGPPGVAKSLIARRLKLAF DQSTAFEYLMSRFSTPDEIFGPVSISKLKDEDKYERIIEGYLPSATIVFLDEIWKAGPSI QNSLLTVINEKVYRNGQYTIQLPLKGLIAASNELPAQGEGLEALWDRFLIRYFIGNIEQE FAFDQMIASVNDMEAEIPTGLSITEEQYTDWRTQISQIKIHYTVFELIHSIKRQIEKYNI QKEEVPHSTLYISDRRWKKIVSLLRTSAFLNETDTIRFSDCTLLLHCLWNEIEQIPIIEQ MVSSALDECISHYLCGERTLEQKLSSIREDMKSEHSLRETKDTALQIVDTFYHQIERYPV AGNLLIFASD >gi|226332019|gb|ACIB01000037.1| GENE 148 177821 - 178603 722 260 aa, chain + ## HITS:1 COG:no KEGG:BF2199 NR:ns ## KEGG: BF2199 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 260 1 260 260 517 100.0 1e-145 MELRTVNVTRYIMPLREGGSLPALAEADDSFKYVVKFRGAGHGTKALIAELIGGEVARVL GFRVPELVFLNLDEAFGRSEGDEEIQDLLQGSRGLNMGLHFLSGALPFDPVVTEVDEKLA SQVVWLDALLTNVDRTVKNTNMLMWHKELWLIDHGASLFFHHSWVNWHKHALSSFTQVKD HALLPLAGKLDEVDAEFRKLLTSEKIREIVDLIPDSWIEWRDKDETPQDIRDIYYRFLKE RIEHSEIFVKEAQHARKAYL >gi|226332019|gb|ACIB01000037.1| GENE 149 178581 - 178964 216 127 aa, chain + ## HITS:1 COG:no KEGG:BF2200 NR:ns ## KEGG: BF2200 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 127 1 127 127 246 100.0 2e-64 MPEKHIYEYAVVRIVPKVEREEFINVGVILFSKQAAFIRMRYEINKKRLEALSPEPDIDS FRKYLEAFSKVCAGCPTGGVIAKLEVPERFRWLTAHRSSCIQTSRPHVGYSDNLEETLER LFEELVL >gi|226332019|gb|ACIB01000037.1| GENE 150 178970 - 179689 419 239 aa, chain - ## HITS:1 COG:aq_999_1 KEGG:ns NR:ns ## COG: aq_999_1 COG1022 # Protein_GI_number: 15606303 # Func_class: I Lipid transport and metabolism # Function: Long-chain acyl-CoA synthetases (AMP-forming) # Organism: Aquifex aeolicus # 2 233 275 499 600 145 35.0 6e-35 MEIFGGNFDEIIIGGAPFNAEVEAFLKQIGFPYTIAYGMTECGPIICSSRWETLKQASCG KATSRMEVKIDSPDPENIAGEIICKGTNLMLGYYKNTEATSQIIDVNGWLHTGDLATMDS EGYVTVRGRSKNMLLTSSGQNIYPEEIESKFNNMPYVSESLVLLQKDKLVALIYPDFDDA FAHGLLQSDIEKIMETNRIELNQQLPAYCQITKIKIHFEEFEKTAKKSIKRFMYQEAKG >gi|226332019|gb|ACIB01000037.1| GENE 151 179664 - 180653 517 329 aa, chain - ## HITS:1 COG:VC2341 KEGG:ns NR:ns ## COG: VC2341 COG1022 # Protein_GI_number: 15642338 # Func_class: I Lipid transport and metabolism # Function: Long-chain acyl-CoA synthetases (AMP-forming) # Organism: Vibrio cholerae # 36 320 45 313 563 89 27.0 1e-17 MCRNYKMEQEQRFIGYIEQSIINNWDANALTDYKGITLQYKDVARKIAKFHIILEMAGIQ PGDKIAVCGRNSAHWAVTFLATVTYGAVIVPILHEFKADNIHNIVNHSEAKLLFVGDQVW ENLNEDRMPLLEGISSLTDFTPLVSRNDKLTYAHEHRNEIYGQRYPKNFRPEHISYRKDM PEELAVINYTSGTTGYSKGVMLPYRSLWSNIAYCHEMLPVKPGDHIVSMLPMGHVFGMVY DFLYGFSAGAHLYFLTRMPSPKIIAQSFAEIKPRVIACVPLIVEKIIKKDILPKLDNKIG KLLLRVPIVNDKIKAAARQGSNGNFWWKF >gi|226332019|gb|ACIB01000037.1| GENE 152 180849 - 182315 932 488 aa, chain - ## HITS:1 COG:BH4038 KEGG:ns NR:ns ## COG: BH4038 COG3263 # Protein_GI_number: 15616600 # Func_class: P Inorganic ion transport and metabolism # Function: NhaP-type Na+/H+ and K+/H+ antiporters with a unique C-terminal domain # Organism: Bacillus halodurans # 1 486 1 482 490 300 37.0 3e-81 MIFTAENILLIGSILLFVSIVVGKTGYRFGVPALLLFLLVGMLFGSDGLGLQFHNAKIAQ FIGMVALSVILFSGGMDTKFKEIRPILSPGIVLSTVGVFLTALFTGLFIWYLSGMSWTNI HFPLITSLLLASTMSSTDSASVFAILRSQKMNLKHNLRPMLELESGSNDPMAYMLTIVLI QFIQSDGMGTGNIIGSFIIQFLVGAAAGYILGKLAILILNKINIDNQSLYPILLLSFVFF TFAITDLLRGNGYLAVYIAGMMVGNHKITFRKEIATFMDGLTWLFQIIMFLMLGLLVNPH EMIEVAVVALLIGVFMIVIGRPLSVFLCLLPFRKITLKSRLFVSWVGLRGAVPIIFATYP VVANVEGSNMIFNIVFFITIVSLIVQGTSVSFVARLLHLSTPLEKTGNDFGVELPEEIDT DLSDMTITMEMLNEADTLKDMNLPKGTLVMIVKRGDEFLIPNGTLKLHVGDKLLLISEKN KQETVKNE >gi|226332019|gb|ACIB01000037.1| GENE 153 182484 - 183674 859 396 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|168182407|ref|ZP_02617071.1| 50S ribosomal protein L18 [Clostridium botulinum Bf] # 1 396 4 423 447 335 42 1e-90 YMDSNHLSPLRKGVVGVQFLFVAFGATVLVPLLVGLDPSTALFTAGIGTLLFHLVTKGKV PIFLGSSFAFIAPIIKATELYGLAGTLSGMVGVAMVYFVMSALVKWQGIRLIERLFPPVV IGPVIILIGLSLAGTGVNMAKENWTLALLSLFTAVIVSIRAKGLLKLIPIFCGIIVGYIA ALIFYDVDMSGVRNAAWLGFPQFVFPQFSWEPILFMMPVAIAPVIEHIGDVYVVNTVTGK DYVKDPGLHRTLLGDGLACLCAGLLGGPPVTTYSEVTGAMSLTKVTNPQVIRIAAITAIL FSVIGKVSALLKSIPSAVLGGIMLLLFGTIACAGIANLVNNCIDLSRTRNIIIVSLTLTI GIGGAVLAWGEFSLSGIGLAALVGVGLNLVLPREER >gi|226332019|gb|ACIB01000037.1| GENE 154 183801 - 185432 1625 543 aa, chain - ## HITS:1 COG:CAC2750 KEGG:ns NR:ns ## COG: CAC2750 COG1151 # Protein_GI_number: 15896007 # Func_class: C Energy production and conversion # Function: 6Fe-6S prismane cluster-containing protein # Organism: Clostridium acetobutylicum # 1 541 1 527 530 737 64.0 0 MSMFCFQCQETAKGTGCILSGVCGKTPEVANMQDLLLFVVRGIAVYNQALRKDGRSSARA DKFIFDALFTTITNANFDKHAIIEKIKKGLELKKDLSNQVTIEHAPDECTWYGDETEFEE KAQTVGVLRTSDEDIRSLKELVHYGIKGMAAYVEHAYNLGYENPEIFAFMQYALAELTRE DITVDELITLTLATGNHGVQAMTQLDTANTSHYGNPEISEVNIGVRNNPGILVSGHDLKD IEELLQQTEGTGIDIYTHSEMLPAHYYPQLKKYKHLAGNYGNAWWKQKEEFESFNGPILF TTNCIVPPRPNATYKDRIYTTGATGLEGATYIPERKDGKQKDFSVIIEHARRCQPPVAIE SGKIVGGFAHAQVIALADKVVEAVKSGAIRKFFVMAGCDGRMKSRSYYTEFAEKLPADTV ILTAGCAKYRYNKLPLGDINGIPRVLDAGQCNDSYSLAIIAMKLQEVFGLKDINDLPIVY NIAWYEQKAVIVLLALLALGVKKIHLGPTLPAFLSPNVKQVLIDNFGIGGISTADEDIAK FLA >gi|226332019|gb|ACIB01000037.1| GENE 155 185948 - 186613 485 221 aa, chain - ## HITS:1 COG:CAC0884 KEGG:ns NR:ns ## COG: CAC0884 COG0664 # Protein_GI_number: 15894171 # Func_class: T Signal transduction mechanisms # Function: cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases # Organism: Clostridium acetobutylicum # 1 221 6 226 229 117 28.0 2e-26 MTYLATNPLFHGISPETLSRDFDGIVSHLRMFRKGDILARQGDVCNRLMILLKGSVRGEM IDYSGRLIKVEDIIAPRAIAPLFLFGADNRYPVEVTANEATEVFEIPKESVLKLFRRNEK FLENYMNLSANYARTLADKLFFMSFKTIRQKLASYLLRMLKQQGDSPIQLDRSQQELADY FGVSRPSLARELAHMQDDGLIKTDRKLVHILRKEDMMQLIQ >gi|226332019|gb|ACIB01000037.1| GENE 156 186729 - 188042 927 437 aa, chain - ## HITS:1 COG:CC1742 KEGG:ns NR:ns ## COG: CC1742 COG5000 # Protein_GI_number: 16125986 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation # Organism: Caulobacter vibrioides # 112 436 363 698 716 118 26.0 2e-26 MKIHLKLLTERYWFRLGLSLCFAITAALSYADRDFIWMGLSLCLLLFSIWWQLSLYRIHT KRVLFMIDALENNDSAIHFPEEQIMPETREVNRALNRVGRILYNVKSETVQQEKYYELIM DCVNTGVLVLNENGAVYQKNNEALRLLGLNVFTHIRQLNKVDIQLMKKIEFCRPGDKIQT IFNNERGTINLSIRVSGITVREEQLRILAFNDINSELDEKEIDSWIRLTRVLTHEIMNSV TPITSLSETLLSLADTRDEEIRRGLQTISTTGKGLLSFVESYRRFTRIPTPEPSLFYVKA FIDRMVELARHQNKCDNITFHIDIAPADLIVYADENLISQVVINLLKNAIQAIDAQADGK IEIQGRCNAAEEVLIEIKNNGPAIPSDIADHIFIPFFTTKEGGSGIGLSISRQIMRLSGG SITLLQGKETKFILKFK >gi|226332019|gb|ACIB01000037.1| GENE 157 188039 - 189370 967 443 aa, chain - ## HITS:1 COG:STM4174 KEGG:ns NR:ns ## COG: STM4174 COG2204 # Protein_GI_number: 16767428 # Func_class: T Signal transduction mechanisms # Function: Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains # Organism: Salmonella typhimurium LT2 # 4 440 8 441 441 306 41.0 4e-83 MGTIIIVDDNKGVLTAVQLLLKNHFSKVITLSSPVSLSTVLREENPEVVLLDMNFTSGIN NGNEGLFWLHEIKRQYRDLPVVLFTAYADIDLAVRGIKEGASDFVVKPWDNQKLLETLLN AASQAKDGKKKNRKKESSPVSAMYWGESSAMQQLRTLIEKVATTNANILITGENGTGKEM LAREIHALSPRSAESMISVDMGAITESLFESELFGHVKGSFTDAHADRTGKFEAADRSSL FLDEIGNLPFHLQAKLLTAIQQRSIVRVGSNQSIPVDIRLICATNRNLQEMVDKGLFRED LLYRINTIHVEIPPLRKRKEDIVPLAERFIARFCKQYDKASISLSPAACEKLTAHAWYGN IRELEHAIEKAVIISDGETIPAEMFQLVQKTENPETETSTLEDMEKAMIRKALDKCGGNL SAVAAQLGITRQTLYNKMKKFGL >gi|226332019|gb|ACIB01000037.1| GENE 158 189509 - 190219 376 236 aa, chain - ## HITS:1 COG:no KEGG:BF2151 NR:ns ## KEGG: BF2151 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 236 3 238 238 469 99.0 1e-131 MIMKWLNFNSIIGMAVLSLLFYTENVAAQTDKNDTKQKIDTIQTTQSKYSKYDKRIHRFR KGWNSLIPTHNKIQYAGNMGMFSFGTGWDYGKRDQWETDLFFGFIPKHDSHRAKMTMTLK QNYMPWSLELGKGFSTEPLACGIYFNTVFGHEFWVHEPSRYPEGYYGFSSKIRTHIFLGQ RLTYDIDRERRFFAKSVTLFYELSTCDLLLISRVTNSYLRARDYLSLSFGLKFQWL >gi|226332019|gb|ACIB01000037.1| GENE 159 190194 - 191003 602 269 aa, chain - ## HITS:1 COG:no KEGG:BF2152 NR:ns ## KEGG: BF2152 # Name: not_defined # Def: calcineurin superfamily phosphohydrolase # Organism: B.fragilis # Pathway: not_defined # 1 269 1 269 269 552 99.0 1e-156 MRQIKGITAIFLCCLLVAGCDLIDYHPYDVDIKGERDINAKNIQKIEAKCLGKSTIRFIA MGDSQRWYDETVDFVNAVNKRDDIDFVVHGGDFSDFGLTDEFLWQRDIMNKLKVPYVGLI GNHDCLGTGEDAFRQIFGDTNFSFIAGGVKFVCLNTNAMEYDYSEPIPDFDYIERQLTER ADEFNKTVFCMHARPLCDQFNNNVAKVFQMYVRQFPGLQFCTVAHEHRISASDVFDDGVM YYGSNCMKNRSYLVFTIKPDGYDYEVVEF >gi|226332019|gb|ACIB01000037.1| GENE 160 191258 - 193546 1099 762 aa, chain + ## HITS:1 COG:no KEGG:BF2210 NR:ns ## KEGG: BF2210 # Name: not_defined # Def: putative ABC transport system, membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 762 1 762 762 1523 99.0 0 MRQLYYTFRTLLRGRGVNLTKIVSLTLGLLVGILLFARVAFELSFDGHYKEVDRLCVINA VYYVGGEKGEAHQVVLAPVPGAIAEGFPDEIESSTVVRIRYGDNTLFYNNVRQCPKMILS DSLFFKTMGIDIISGDPRELVNPETMFVSRSFARRIFGDESPIGKTLLYNRTLPMTIKGV YEDIPENSSLYHEVVISFSTIEKHHWECMGWECGDSFQGYIRLKNASDLDKVNSRIDPII EKHLPFRPEEGFAIRYSLQPIRGVHASAPVIQKMVMIMSLLGLVILFIAAMNYVLISISS LARRAKAIGVHKCNGASERNIFSMFLWETGIIIMISLILVSVLVLNFREDIEYLASASIG ALFTWETLWVPICVIVILFIVAGIIPGHLFSSIPVTHVFRNYTERKGGWKRTLLFLQFTG VTFVLGILCVVLLQYNRITTKPIGYNPEGVAFTYHNFADSESALDNLRRLPMVSDVSDSE SSIISGYGGLAVTDENGIILFTTRLNACLYNYVPFMGIRIKEGRNLNGPDQALVNEEFVR QMRWTDGAVGKKYENFTIVGVMENFPVNSYYEEQDPVAFIGQQQINNCYHVRLKQPYEEN LRSLNKSMEEMYPTEDIVFKSLSYSIEDQYQNVRRFRDAVMLAFVVILLISLMGLIGYIS DEIQRRSKEIAIRKVNGAEVSHILNLLSKDIIWTASPAVLFGTAGAYFVGIQWLGQFAER ISLQVYWFVLIAIVVLAMIFLSAVIKVWHIANENPVKSIKSE >gi|226332019|gb|ACIB01000037.1| GENE 161 193566 - 194246 333 226 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) [Campylobacter concisus 13826] # 3 201 4 199 223 132 37 1e-29 MMMIQISDISKVFRTSEVETVALNHVSLNVEEGEFVAIMGPSGCGKSTLLNILGLLDNPT EGSYKLMGQEVAILHEKERTRVRKGKLGFVFQSFNLIDELNVYENVELPLTYLGIKASER RQMVNNILHRMNISHRAKHFPQQLSGGQQQRVAIARAVVTNPKLILADEPTGNLDSKNGA EVMNLLTELNHEGTTIIMVTHSQHDASFAHRTVHLFDGSIVASVKA >gi|226332019|gb|ACIB01000037.1| GENE 162 194260 - 196569 1363 769 aa, chain + ## HITS:1 COG:no KEGG:BF2212 NR:ns ## KEGG: BF2212 # Name: not_defined # Def: putative ABC transport system, membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 769 1 769 769 1464 99.0 0 MMKQFYYTIQTLLRGQGNNMIKIVSLTLGLFVGILLFSRVAFELNYDSYYQEPENLFLTL RTVVSQGEKKEPVCSNYGKLPAAIRENFPDEVEDATLIDLFSRSSLYHEGQEKKDAILAT SRSHIFSTLGVKVLSGNVSELDNMDALFISRSLAQSLFADADPIGKTVMINIDYPLTVRG VFEDIPENAEFRFDGVYSFVTRANRFRDERGGWRGDISYTCMVRFRHPEDVEKVAARMPD MMKKYIQYNKDWFEEFSFITPSQFHLQKKESRKIISILSILGFAILLIAGMNNVLISISS LPQRAKSVGVHKCSGASTGHIYRMFLWESALLILVSLLGVIVLLLYFKPEIEDLSGALLA TLFTWRTLWVPALVTIFLFLLIGLLPGKLFSSIPVTQVFHRFTGYRASWKYPLLFVQFTG VAFILGLLMIILLQYNQVMNRSMGYKIDNLVIGWGPFDSMDKIDGILRGLPFIEASCNSA SFIYNGHTRQSFTDINGKRFMGCIDFIDEHYVPILGLQILQGRNVHQDGEVLVNEELLRQ VGWTDSPIGRKLMEDNYEWGTVVGVVKDYVAQSAYQPQASVALVNSLERRWEANKRNLIL KEPFKENLSRIRALMKETFPTEDIQFRSARQEVDNLYQVVRRFRNIVIVASVSIVVIVLM GLFGFVNDEVQRRSKEIAIRKVNGAESGNIINLLNHNIFWIALPAIFVGIALAYVVGHKW VEQFTDQINLNAGHFLLLLLLILLLILGSVTGESWRIANENPVNSIKNE >gi|226332019|gb|ACIB01000037.1| GENE 163 196585 - 197259 338 224 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) [Campylobacter concisus 13826] # 1 199 4 199 223 134 37 3e-30 MIEINDISKVFRTSEVETVALNHVSLNVEEGEFVAIMGPSGCGKSTLLNILGLLDNPTEG SYKLIGQEVADLHEKERTRVRKGKLGFVFQSFNLIDELNVYENVELPLTYLNIKASERRR MVDNILHRMNISHRAKHFPQQLSGGQQQRVAIARAVVTNPKLILADEPTGNLDSKNGAEV MNLLTELNREGTTIIMVTHSQHDASFAHRTIHLFDGSIVASVKA >gi|226332019|gb|ACIB01000037.1| GENE 164 197276 - 199582 1416 768 aa, chain + ## HITS:1 COG:no KEGG:BF2214 NR:ns ## KEGG: BF2214 # Name: not_defined # Def: putative ABC transport system, membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 768 1 768 768 1490 99.0 0 MKQLYYTIQTLLRGRGSNIIKVISLTLGLLVGILLFARVAFELNYDSYYSEPENLCITLR TGISDGIKKEPNIDNYGKLAEAIRENFPDEVEDATVLDLFSQSPLYHEGKKMEHVILATS KSHVFSTLGIKVLLGDVSQLDRVDVIFISCSLARRLFADANPIGKTINIEIDYPLTVQGV FEDIPENTEFHFDGVYSFDTRSKQYGSERGGWGYDSSYHCMVRFRHPDEREKVEVRMPEM LKKYIHNYKGQSEEYSFMTPSQYHLQKPESRKIVMILSILGFAILLVAGMNNVLISVSSL AQRAKSVGIHKCNGASDGHIYRMFLYESALLILLSLLFVTVLLFTFKLEIEDLSGASLKA LFTWQTLWVPILVSLVLFLVIGLFPGKLFAAIPVTQVFHRFTAHRFVWKRSLLFIQFAGI AFILGLLMVILLQYHQVMTRDMGYKVDNLAVGWSPYREIDKMDGILRGLPIVEEFCNAST IIYGGYMGQPYTDAHGKEFMGRIEFVDEHYVPVMGLQIIKGRNIQQDKEILINEEMVRQI GWTDSPIGKNLEDGKNNFGTIVGVVKDYVVQSAYMPQAPVALMSNLEWMNVLNKRNIILK EPFGENLAKINTLMKEAFPTVDIVFRSARQEIDKQYQEVRRFRNVVIIASIAILLIALMG LFGFVNDEIQRRSKEIAIRKVNGAEVPDILRLVSGNIFWTALSAVLVGIVFAYIVSNKWL EQFSDRISVNGGHFLVVIIIILLLIMGSVIGRSWNVANENPVNSIKNE >gi|226332019|gb|ACIB01000037.1| GENE 165 199613 - 201907 1105 764 aa, chain + ## HITS:1 COG:no KEGG:BF2158 NR:ns ## KEGG: BF2158 # Name: not_defined # Def: ABC transporter permease protein # Organism: B.fragilis # Pathway: not_defined # 1 764 1 764 764 1571 99.0 0 MKQFHYTIQTLIRDRRSCVIKVISLSLGLLVSIILFSRVAFELSYDNCFQDVDNLYIVKT EWIKDGVIKGNAGSYTLIPIASTVAEEFPKEVESAVCSSISFEAIFKIGNRKMNKSFILS DSLYFRTMGIEVISGNPNDLTNPDVLFLSQSVAREAFGEENPIGKTLHMMVWGTPVEALV KGVFADLPYNVSLERHEAVLSFASHSKYGWGRPGWTSGGNYNAFIRLKDGERSADVINTD IDKVIAKHIPSDMNMHLHMFVVPLRTIHLEHSDVKRTILILSLLGFAILFAATMNYVLIF VSSLSQRAKGIGIHKCNGASDKAIFSMFIYETALIIGVSLVLMIIFLFQFQEKIEELAEV PLSSLFTWHNLWAPLSVVTFLFVIGGILPGKVFSLIPVTQVFHPYIKKNRGWKRILLFIE FAGVAFIFGLMCVAYLQCHYIINRDMGYQPKGVASCKHDFAEPDNARNNLKSLPYVEGVA SIRGSMTWFGNREVTDEGGKLLFTPRCAAFDKDFVPLLGLHIKAGRNFTGERQFLVNQPY VEKMGWKGSGVGEIVPNRGTVVGVLAPFCCGVLPADDEPLEIEYGTNLRNVHVRLKEPFT ENLHRLNNEMKKIYPQEDIEFRSLEQDLERYYRPTIIFRDATFLAFITILFITLMGLIGY INDEVRRRSKEIAIRKINGAEARSILFLLSKDIFWVAILSVAIGTYGAYYMSQLWISQFE DTICVYAGWYVVTAICLLAFIFVFIIGRSWHIANENPVNSIKSE >gi|226332019|gb|ACIB01000037.1| GENE 166 201991 - 203355 1208 454 aa, chain - ## HITS:1 COG:CAC0883 KEGG:ns NR:ns ## COG: CAC0883 COG0534 # Protein_GI_number: 15894170 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Clostridium acetobutylicum # 6 431 3 429 448 335 42.0 1e-91 MAQQTDPRILGTEPIGKLLLQYSIPAIIGMTITSLYNIIDSIFIGHGVGPMAISGLAITF PLMNLVVAFCVLISAGGATISSIRLGQKDIKGATDVLGNTLMLCLTNAVLFGGLAYLFLD PILFFFGASTGTLPYARDFMQVILLGTPITYTMIGLNNVMRATGYPKKAMLTSLVTVIAN VIIAPIFIFHFGWGIRGAAMATVLSQFIGMIWVVNHFRNKESFVHFMPGFWKMKKRIIGS IFSIGMSPFAMNVTACIIVILINNSLQKYGGDMAIGAYGIINRLLMLYVMVVMGLTMGMQ PIVGYNYGAQKIDRVKHTLRLGIIVGVLITSSGFIICELFPHTVSAIFTDSDELIDMASS GLRICTLMFPFVGAQIVISNFFQSIGMAKISIFLSLSRQLVYLLPGLLLLPPLYGVKGVW ISMPVSDGLAFVTAVVILMVYIKKVKEKTSGQKL >gi|226332019|gb|ACIB01000037.1| GENE 167 203463 - 204125 714 220 aa, chain + ## HITS:1 COG:L150333 KEGG:ns NR:ns ## COG: L150333 COG0637 # Protein_GI_number: 15672725 # Func_class: R General function prediction only # Function: Predicted phosphatase/phosphohexomutase # Organism: Lactococcus lactis # 12 217 7 216 222 75 30.0 8e-14 MFMDATKKITALFDCDGVIVDTEGQYTVFWNEMGQKYVNDENFGSKVKGQTLVQIYDKYF AGEPEKQRDITEALNRFEIKMNYDYVPGIVEFIADLRRHGVKIALVTSSNTAKMENVYHA HPEFKSLFDEILTAERFKRSKPDPECFLLGMTIFGSDSKDSYVFEDSFHGLQAGRSSGAI VVGLATTNSREAIADKADYVIDDFRGMTYEKLLTITSRYI >gi|226332019|gb|ACIB01000037.1| GENE 168 204197 - 205192 816 331 aa, chain + ## HITS:1 COG:YPO1415 KEGG:ns NR:ns ## COG: YPO1415 COG0167 # Protein_GI_number: 16121695 # Func_class: F Nucleotide transport and metabolism # Function: Dihydroorotate dehydrogenase # Organism: Yersinia pestis # 1 330 1 336 336 213 36.0 6e-55 MYKQTIRPVLFLMEPEKVHALLVSCLKCYRHLPWCRCWIRHLYTCSDKQLIWNHLTFRNR IGLSAGFDKGAEIFDELADYGFGFIEVGTVTPDSQDGNPRPRIFRLPQCESLISRTGFNN PGLDVIKRRLEQKSGSYVLGVNINKNPSSEGEQAVADFLRLYKELHPHVGYFTLNWGSVD VALMKQVLQGLAAFRVEQNIHVPLLLKLPADITEEGMDDVIDCTRLYRVDGVIATGPTME RSCLKGYSPAQLQKIGSGGISGRGIGERSLKAVSYLRAHAGKSLLIVGAGGIITPADARR MLDAGANLIQIYSSFIYEGPGIVKKMIQEIK >gi|226332019|gb|ACIB01000037.1| GENE 169 205207 - 205935 742 242 aa, chain + ## HITS:1 COG:AGc2981 KEGG:ns NR:ns ## COG: AGc2981 COG0657 # Protein_GI_number: 15888930 # Func_class: I Lipid transport and metabolism # Function: Esterase/lipase # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 23 239 70 303 310 128 33.0 1e-29 MEQSFIEYSLGKDASSAVLWVYPVRKPRGKAIIMCPGGGFNQIASDHEGHDFAAWFNNQG ITYAVLNYRMPHGDVEVIREDIREAIRLIRRQSAEWGIHQLGVMGASIGGYIAATAATLY TGTDRPDFQVLLYPVISMTDRLTHWPSRERMLGETISEGLKETLSLELHVTADTPPTFIV LAEDDQAVSPLNSIVYYTALLKHGVSAGLHIYPEGGHSFGFRDSFIYKKLWTDELQKWLL TF >gi|226332019|gb|ACIB01000037.1| GENE 170 206027 - 206848 904 273 aa, chain + ## HITS:1 COG:Cgl0115 KEGG:ns NR:ns ## COG: Cgl0115 COG0413 # Protein_GI_number: 19551365 # Func_class: H Coenzyme transport and metabolism # Function: Ketopantoate hydroxymethyltransferase # Organism: Corynebacterium glutamicum # 8 273 5 269 269 245 49.0 5e-65 MAGYISDDTRKVTTHRLIEMKQRGEKISMLTSYDYTMAQIVDGAGIDVILVGDSASNVMA GNVTTLPITLDQMIYHGKSVVRGVKRAMVVVDMPFGSYQGNEMEGLASAIRIMKESHADA LKLEGGEEVIDTVKRILSAGIPVMGHLGLMPQSINKYGTYTVRAKDDAEAEKLIRDAHLL EEAGCFGLVLEKIPAALASRVASELTIPVIGIGAGGDVDGQVLVIQDMLGMNNGFRPRFL RRYADLYTVMTDAISHYVSDVKNCDFPNEKEQY >gi|226332019|gb|ACIB01000037.1| GENE 171 206856 - 208022 680 388 aa, chain + ## HITS:1 COG:STM2280 KEGG:ns NR:ns ## COG: STM2280 COG0477 # Protein_GI_number: 16765607 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Salmonella typhimurium LT2 # 1 186 1 186 396 81 30.0 3e-15 MATKLWTLHFMRICLANLLLFISLYLLYPVLPVMMASRLGVPVSQTGVIFIFFTLAMFFI GPFHAYLVDVYKRKYICMLSFGVMVAATAGYTLVQNATHLLMLCIVQGVSFGMAATAGIT LAIDITNSTFRSAGNVVFSWAARLGMIIGAALGVYLFRTHGFETLLYVAVALGALGILSV SRVYVPFRAPIGMKVCSMDRFLLPRGLIPAFNLILIAFIPGLMLPVLAGAPSDVPVGGET VPFFALVGCGFLLSVLIVKLFFRYDNKMWLQIVVGLVTVIGSMAMLFSPETSWNAPAAVL MGLGLGLVTPEFLMMFVKLSQHCQRGTANTTHLLAWELGVGLGIASACHLHLTANEQAVY RVGLLSAIVSLAFFVLLTYPYFKRKKVR >gi|226332019|gb|ACIB01000037.1| GENE 172 208111 - 208497 303 128 aa, chain - ## HITS:1 COG:SA1398 KEGG:ns NR:ns ## COG: SA1398 COG0818 # Protein_GI_number: 15927149 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Diacylglycerol kinase # Organism: Staphylococcus aureus N315 # 10 123 1 114 114 82 40.0 2e-16 MEKFSTRKRIRSFGYAWKGIRSFVSKEHNAWIHCTAIIIVTVAGFCFGITRNEWIAIILC FGVVLAAEGFNTAIERLVNLVSPERNPIAGDVKDIAAGSVLICAIVAAIVGIIIFMPYVL AVLLCNMG >gi|226332019|gb|ACIB01000037.1| GENE 173 208500 - 210713 2004 737 aa, chain - ## HITS:1 COG:lin1558 KEGG:ns NR:ns ## COG: lin1558 COG0317 # Protein_GI_number: 16800626 # Func_class: T Signal transduction mechanisms; K Transcription # Function: Guanosine polyphosphate pyrophosphohydrolases/synthetases # Organism: Listeria innocua # 61 736 51 735 738 391 33.0 1e-108 MERDEFFTKEERELLFSLYKKLLRLTGETLQKGDCRKLKKHLIDSTQNNTMQRDSFGLNP VIKDMQTAVIVAEEIGMKRASILGIMLHTPVRCHSYTIEYIQQEYGEDVAGIIRGLIKIN DLYDKSPTIESENFRNLLLSFAEDMRVILIMIADRVNVMRQMKDAENDEARRRVANEAAY LYAPLAHKLGLYKLKSELEDLSLKYTEHDIYYHIKEKLNETKKSRDRYIANFIAPIQQKL EEAGLHFHMKGRTKSIHSIYQKMKKQKCQFENVYDLFAIRIILESQFEKEKQECWQAYSI VTDMYQPNPKRLRDWLSVPKSNGYESLHITVMGPEGKWVEVQIRTERMDDIAERGLAAHW RYKGVKGESGLDEWLTSIREALENTENDLEMMDQFKLDLYEDEVFVFTPKGDLFKLGKGA TVLDFAFHIHSKLGCKCIGAKVNGKNVQLRQKLNSGDQVEIMTSNTQTPKQDWLNIVTTS KARTKIRQALKEMVARQHDFAKETLERKFKNRKMEYDEAVMMRLIKRLGFKNVTEFYQKI ADEVLDINDILDKYIEQQKRDSERDEVTYRSAEEYNLQNQIDETTVTKEDVLVIDQNLKG LDFKLAKCCNPIYGDDVFGFVTVSGGIKIHRNDCPNAGQMRERFGYRIVKARWAGKSEGT QYPITLRVVGHDDIGIVTNITSIISKENGISLRSIGIDSNDGLFSGTLTIMVSDTGRLEA LIKKLRTVKGVKQVSRN >gi|226332019|gb|ACIB01000037.1| GENE 174 210821 - 212338 1653 505 aa, chain + ## HITS:1 COG:no KEGG:BF2167 NR:ns ## KEGG: BF2167 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 505 1 505 505 1010 99.0 0 MITPEDKELLAKKGISEVQIAEQLACFQKGSPYLKLDAAASVEKGILAPDAEEQKAYLAA WDAYTNSDKTIVKFVPASGAASRMFKNLFEFLDADYTEPTTKFEQTFFESIERFAFYDDL NTACVRTEGKGIPTLIAEGNYKAVVSGLLNVAGLNYGALPKGLLKFHKYEEGSRTPLEEH LAEGAMYAAGKSGKVNVHFTVSTEHRELFKSLVTEKVDAFAKRYGVDYNITFSEQKPSTD TIAADMENQPFRDNGKLLFRPGGHGALIENLNDLDADVIFIKNIDNVVPDKLKGDTVLYK KLIAGVLVSLQKQAFQYLELLDSGRYTHEQVMDILQFVQKKLFCKNPETKDLEDAELVIY LKNKLNRPMRVCGMVKNVGEPGGGPFLAYNSDGTISLQILESSQIDMNNPEAKEMFEKGT HFNPVDLVCAVRDYKGHKFDLAKYVDKATGFISYKSKSGKDLKALELPGLWNGAMSDWST VFVEVPLSTFNPVKTVNDLLREQHQ >gi|226332019|gb|ACIB01000037.1| GENE 175 212408 - 212653 294 81 aa, chain - ## HITS:1 COG:asl4022 KEGG:ns NR:ns ## COG: asl4022 COG0724 # Protein_GI_number: 17231514 # Func_class: R General function prediction only # Function: RNA-binding proteins (RRM domain) # Organism: Nostoc sp. PCC 7120 # 1 80 1 80 94 82 55.0 2e-16 MNMYVGNLSYNVKESDLRQVMEEYGVVESVKLITDRETRRSKGFAFVEMPESSEASNAIK ELNGAEYAGRPMVVKEALPRN >gi|226332019|gb|ACIB01000037.1| GENE 176 212788 - 212922 61 44 aa, chain + ## HITS:1 COG:no KEGG:BF2169 NR:ns ## KEGG: BF2169 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 44 25 68 68 64 97.0 1e-09 MLQSKALIPDNTPYPLLFILLNILYCVIRAKINISLWLYVSYLL >gi|226332019|gb|ACIB01000037.1| GENE 177 213301 - 214467 959 388 aa, chain + ## HITS:1 COG:alr1299 KEGG:ns NR:ns ## COG: alr1299 COG0027 # Protein_GI_number: 17228794 # Func_class: F Nucleotide transport and metabolism # Function: Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) # Organism: Nostoc sp. PCC 7120 # 2 387 9 388 391 427 57.0 1e-119 MKKILLLGSGELGKEFVISAQRKGQHIITCDSYAGAPAMQVADECEVFDMLNGEELERIV KKHRPDIIVPEIEAIRTERLYDFEKEGIQVVPSARAVNYTMNRKAIRDLAAKELGLKTAK YYYAKSLEELKEAAEKIGFPCVVKPLMSSSGKGQSLVKSAAELEHAWEYGCNGSRGDIRE LIIEEFIKFDSEITLLTVTQKNGPTLFCPPIGHVQKGGDYRESFQPAHIDPAHLKEAEDM AEKVTRALTGAGLWGVEFFLSHENGVYFSELSPRPHDTGMVTLAGTQNLNEFELHLRAVL GLPIPGIKQERIGASAVILSPIASQERPQYRGMEEVTGEEDTYLRIFGKPYTRVNRRMGV VLCYAPNGSDLDALRDKAKRIADKVEVY >gi|226332019|gb|ACIB01000037.1| GENE 178 215110 - 216627 1608 505 aa, chain + ## HITS:1 COG:CAC2865 KEGG:ns NR:ns ## COG: CAC2865 COG0055 # Protein_GI_number: 15896119 # Func_class: C Energy production and conversion # Function: F0F1-type ATP synthase, beta subunit # Organism: Clostridium acetobutylicum # 1 501 1 465 466 614 65.0 1e-175 MSQIIGHISQVIGPVVDVYFEGTESDLILPSIHDALEIKRHNGKKLIVEVQQHIGENTVR TVAMDSTDGLQRGMKVFPTGGPITMPVGEQIKGRLMNVVGDSIDGMKELNRDGAYSIHRD PPKFEDLTTVQEVLFTGIKVIDLLEPYSKGGKIGLFGGAGVGKTVLIMELINNIAKKHNG FSVFAGVGERTREGNDLLREMIESGVIRYGEAFKESMEKGHWDLSKVDYNEVEKSQATLV FGQMNEPPGARASVALSGLTVAESFRDMGAKSGARDILFFIDNIFRFTQAGSEVSALLGR MPSAVGYQPTLATEMGAMQERITSTKTGSITSVQAVYVPADDLTDPAPATTFTHLDATTV LSRKITELGIYPAVDPLESTSRILDPHIVGQEHYDVAQRVKQILQRNKELQDIISILGME ELSDADRLVVNRARRVQRFLSQPFTVAEQFTGVPGAMVAIEDTIKGFKMILDGEVDYLPE PAFLNVGTIEEAIEKGKKLLEQANK >gi|226332019|gb|ACIB01000037.1| GENE 179 216639 - 216884 296 81 aa, chain + ## HITS:1 COG:HI0478 KEGG:ns NR:ns ## COG: HI0478 COG0355 # Protein_GI_number: 16272425 # Func_class: C Energy production and conversion # Function: F0F1-type ATP synthase, epsilon subunit (mitochondrial delta subunit) # Organism: Haemophilus influenzae # 1 71 1 71 142 62 38.0 2e-10 MKELHLNIVSPEKEVFNGEVKSVTLPGTSGVFSILPQHAPIVSSLQEGTVSYTTTDGEEH TLDIHSGFVELSNGEASVCVS >gi|226332019|gb|ACIB01000037.1| GENE 180 216918 - 217346 351 142 aa, chain + ## HITS:1 COG:no KEGG:BF2173 NR:ns ## KEGG: BF2173 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 142 1 142 142 213 100.0 2e-54 MNMSITKRNFLGYLSILTLVGGGLGALVLHYLEPGHYFGGYPLIPVYFYIFGVFYIYMFD ACRRHAPEKMVMLFLVAKVLKMIVSVFLLIIYCVAVPDSAIEFLLTFLAFYLGYLIYESW FFFVFEWNQKLKKKSKKYETVA >gi|226332019|gb|ACIB01000037.1| GENE 181 217330 - 218481 995 383 aa, chain + ## HITS:1 COG:BMEI1546 KEGG:ns NR:ns ## COG: BMEI1546 COG0356 # Protein_GI_number: 17987829 # Func_class: C Energy production and conversion # Function: F0F1-type ATP synthase, subunit a # Organism: Brucella melitensis # 119 376 21 276 277 87 30.0 3e-17 MKQLRNIVAGMLVLIGGMLPATTFAQEPVPGDTTATLQQEIIVGKDTINQEANQVDVKGI VFGHIGDSYEWHITNIGKTSICIPLPIIVYSETSGWHAFLSSRLEENGGKYEGFYIAPAG SKYEGKVVERNATGEEVRPWDISITKVTLSLFINSAILLAIILSVAHWYRKREQGAYAPG GFIGFMEMFIMMVHDDVIKSCVGPNYKKFAPYLLTAFFFIFINNIMGLIPIFPGGANVTG NIAITLVLALFTFVIVNIFGTKHYWKDIFWPDVPWWLKVPIPMMPFIEFFGVFTKPFALM IRLFANMLSGHMAMLVLTCLIFISASMGPAINGSLTVASVLFNIFMNLLEVLVAFIQAYV FTMLSAVFIGLAQEGGKKEEVKE >gi|226332019|gb|ACIB01000037.1| GENE 182 218508 - 218765 445 85 aa, chain + ## HITS:1 COG:no KEGG:CFPG_368 NR:ns ## KEGG: CFPG_368 # Name: not_defined # Def: F-type ATP synthase C subunit # Organism: A.pseudotrichonymphae # Pathway: Oxidative phosphorylation [PATH:aps00190]; Metabolic pathways [PATH:aps01100] # 1 83 1 82 82 69 80.0 4e-11 MLLSVLLQAAAAGVGLSKLGAALGAGLAVIGAGIGIGKIGGSAMEGIARQPEASGDIRMN MIIAAALVEGVALLALVVCLLVLFL >gi|226332019|gb|ACIB01000037.1| GENE 183 218781 - 219278 672 165 aa, chain + ## HITS:1 COG:lin2677 KEGG:ns NR:ns ## COG: lin2677 COG0711 # Protein_GI_number: 16801738 # Func_class: C Energy production and conversion # Function: F0F1-type ATP synthase, subunit b # Organism: Listeria innocua # 9 149 15 155 170 61 32.0 7e-10 MSLLLPDSGLIFWMLLSFGIVFAVLAKYGFPVIIKMVEGRKTYIDESLEVAREANAQLSR LKEEGEAIVAAANKEQGRIMKEAMQEREKIIYEARKQAEIAAQKELDEVKRQIQIEKDEA IRDIRRQVALLSVDIAEKVIRKNLDDKQEQMGMIDRMLDEVLTKN >gi|226332019|gb|ACIB01000037.1| GENE 184 219288 - 219848 520 186 aa, chain + ## HITS:1 COG:sll1325 KEGG:ns NR:ns ## COG: sll1325 COG0712 # Protein_GI_number: 16329328 # Func_class: C Energy production and conversion # Function: F0F1-type ATP synthase, delta subunit (mitochondrial oligomycin sensitivity protein) # Organism: Synechocystis # 6 174 10 177 185 60 25.0 2e-09 MEVGIISMRYAKALMAYAEERGAEERLYHELVTLAHSFRTVKGFCAVLDNPIVSVNEKFN LICTAADGDHKPSEEFIRFIRLVLKERRETYLQFMSLMYLDLYRKKKHIGVGKLITAVPV DKATEERIRQTAAHILHAYMELETVVDPSIEGGFVFDINDYRLDASIATQLKKVKQQFID KNRRIV >gi|226332019|gb|ACIB01000037.1| GENE 185 219862 - 221445 1773 527 aa, chain + ## HITS:1 COG:TM1612 KEGG:ns NR:ns ## COG: TM1612 COG0056 # Protein_GI_number: 15644360 # Func_class: C Energy production and conversion # Function: F0F1-type ATP synthase, alpha subunit # Organism: Thermotoga maritima # 5 517 3 496 503 580 58.0 1e-165 MSENIRVSEVSDILRQQLEGIETKVQLDEIGTVLQVSDGVVRIYGLRNAEANELLEFDNG IKAIVMNLEEDNVGAVLLGPTDKIKEGFTVKRTKRIASIRVGESMLGRVIDPLGEPLDGK GLIGGELYEMPLERKAPGVIYRQPVNQPLQTGLKAVDAMIPIGRGQRELIIGDRQTGKTS IAIDTIINQRSNYEAGDPVYCIYVAIGQKGSTVASIVNTLRQYGAMDYTIVVAATAGDPA ALQYFAPFAGAAIGEYFRDTGRHALVVYDDLSKQAVSYREVSLILRRPSGREAYPGDIFY LHSRLLERAAKIINQEEVAREMNDLPESLKGKVKGGGSLTALPIIETQAGDVSAYIPTNV ISITDGQIFLDTDLFNQGNRPAINVGISVSRVGGNAQIKAMKKVAGTLKIDQAQYRELEA FSKFSGDMDPVTALTIDKGQKNARLLVQPQYSPMPVEKQIAILYCGIHGLLRNVPLDKVE DFEAAFLNTLALDHQADVLDVLKTGVINDEVTKAIEETAAMVAKQYS >gi|226332019|gb|ACIB01000037.1| GENE 186 221467 - 222339 916 290 aa, chain + ## HITS:1 COG:BH3755 KEGG:ns NR:ns ## COG: BH3755 COG0224 # Protein_GI_number: 15616317 # Func_class: C Energy production and conversion # Function: F0F1-type ATP synthase, gamma subunit # Organism: Bacillus halodurans # 1 290 1 283 285 176 35.0 6e-44 MASLKEVKTRINSVQSTRKITSAMKMVASAKLHKAQGAIENMLPYQRKLNKILTNFLSAD LPVESLFCVERPVKRVAIVAFSSNSSLCGAFNANVLKMFLQTVGEYRELGQDNILIYPVG KKIEEAVKKLGFFPQGSYQKLADKPSYDEAAALAKLLMELFLEKNIDRVELIYHHFKSMG VQELLRERYLPIDLSAVQNDEERGGVVNDYIIEPSAAQLIADLIPQVLSQKIFTAALDSN ASEHAARTLAMQIATDNANELIQELTKQYNKTRQQAITNELLDIVGGSMA >gi|226332019|gb|ACIB01000037.1| GENE 187 222497 - 225031 1596 844 aa, chain + ## HITS:1 COG:SPBC887.14c KEGG:ns NR:ns ## COG: SPBC887.14c COG0507 # Protein_GI_number: 19113280 # Func_class: L Replication, recombination and repair # Function: ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member # Organism: Schizosaccharomyces pombe # 21 407 328 758 805 156 30.0 1e-37 MEENHEIELAWQVIENTGTHLFLTGKAGTGKTTFLRRLKELTPKRMVVVAPTGIAAINAG GVTIHSFFQLNFAPYIPESTFNSAQQGFHKFGKEKINIIRSIDLLVIDEISMVRADQLDA IDAVLRRYRDRSKPFGGVQLLMIGDLQQLAPVVKEEDWSLLSSYYDTAFFFGSHSLKETE YITIELKKVYRQSDTEFVGLLNKIREKEADDAVLEELNKRYLPGFRPREEEGYIRLTTHN YQAQQYNDRQLLSLSGRAFSFQAKVEGTFPESAYPADEMLTVKEGAQIMFIKNDSSGEHR YYNGMIGLVTAVSKDGIRVKGNGESQDFLLETEEWTNSKYSLNPQTKEITEEVEGTFRQY PIRLAWAITIHKSQGLTFERAIIDANASFAHGQVYVALSRCKSLQGLVLSSPLRRESIIS DDTIDEFTRNAGEMTPDKHKLALLRQHYFYELLCEQFDFHPIEQHFLRLLRLLDEHLYRL YPKLLERYKTTADLYKTQIMKVADTFKLQYSALLMGAEDYTANPKLNERVMAGAHYFRQH LEDLLTPLITSTKVETDNKELKKKFSEAADAMKTALHVKLGTLCYTEKEGFSVSAFLKQK AVLTLSVSGGEAASSSGRSERKSRTAEKIEVPTDILHPELYKQLIAWRNSEAAKAGLPVY TIIQQKAILGIVNLLPNDAASLIRIPYFGKRGAEKYGDALLEMVNRYVEEHGIERPQMPT ATLTVNNGIKTSKEPKPLKEAKSVKEPKPDTKEVTYRLFRQGKSIEEIAKERELVSGTIA GHLEHYVRSGEVKIEQLVAREKITKIIRYVQAHGSDKGLTVIKAALGDDVSYADIRLVLA AGIK >gi|226332019|gb|ACIB01000037.1| GENE 188 225189 - 225545 486 118 aa, chain - ## HITS:1 COG:BB0061 KEGG:ns NR:ns ## COG: BB0061 COG0526 # Protein_GI_number: 15594407 # Func_class: O Posttranslational modification, protein turnover, chaperones; C Energy production and conversion # Function: Thiol-disulfide isomerase and thioredoxins # Organism: Borrelia burgdorferi # 4 115 3 113 117 100 38.0 6e-22 MKVIDLTKESFVEKVAEFQEYPNKWDFKGDKPCLVDFHAPWCVYCKALSPILDQLAVEYD GKIDIYKVDVDQEPELEAAFAIRTIPNLLLCPMGGKPSMKLGTMNKTQLKALIEEVLL >gi|226332019|gb|ACIB01000037.1| GENE 189 225778 - 227040 1153 420 aa, chain + ## HITS:1 COG:PA2522 KEGG:ns NR:ns ## COG: PA2522 COG1538 # Protein_GI_number: 15597718 # Func_class: M Cell wall/membrane/envelope biogenesis; U Intracellular trafficking, secretion, and vesicular transport # Function: Outer membrane protein # Organism: Pseudomonas aeruginosa # 26 347 41 353 428 66 23.0 1e-10 MIRKFFILFFLGFFGFAEAQQPSVGLTLKEAEQRFLKCNLSLLAERYNVDIAQARLLQAG LFDNPVISFEQNVYNRLNGKYFDFGKKGESVVEIEQVIRLAGQRNKQIRLEKINKEIAGY QFEEVMRTLRQELGEAFTEVFYLSKSLSVYDKEINSLEHLLTGIKEQHAKGNISLMEMAR LESMLLSLKKDKNECESNYLSRRGELNLLLNLPADFRTEPVIDEGDLRQLNMDRLSYADL QERVHGRPDQKLARSCVTASQADLKLQKALAFPEFAVKGSYDRQGNFINNYFAIGFSMSV PIFNRNQGNIKMARFNLLKADREQEYSRNKAEAELYAAYTALEKACQLYQSTDMGLEQNF EKLIAGANENFIKRNISLLEFIDFYDSYKETCIRLYEIKKNVLLGIENLNAVAGQPIFNY >gi|226332019|gb|ACIB01000037.1| GENE 190 227054 - 228145 746 363 aa, chain + ## HITS:1 COG:PA2521 KEGG:ns NR:ns ## COG: PA2521 COG0845 # Protein_GI_number: 15597717 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Pseudomonas aeruginosa # 42 360 158 475 484 114 24.0 4e-25 MNWTKYLPCLLILGMGSGCSSEVKHPGENQDLCLTDSLLKIVSVDTVHLHDVADELTLNG RVTFNQEQVAHVYPMFGGTVTELRAEVGDYVRKGDILAILRSGEVADYERQMKEAEQQVI IARRNVNATRDMFDSGLASDKDVLQARQELINAEAEEDRIKEIFSINNFSGRSFYEVKSP VSGFIVEKSVSRNMQLRPDQGEEIFTVSGLEHVWVMADVYESDISKVAEGASVHITTLAY PGKVFSGNIDKIYHMLNTESKTMNVRVKLCNEDYLLKPGMFTTVNVECKSSGKQMPRINA HALIFEGGKNYVVTVTPDNRLKVKEVDVYKRQNQECYVRSGLSEGDRVLNQNVLLVYNSL NAD >gi|226332019|gb|ACIB01000037.1| GENE 191 228161 - 231286 2682 1041 aa, chain + ## HITS:1 COG:RSp1040 KEGG:ns NR:ns ## COG: RSp1040 COG3696 # Protein_GI_number: 17549261 # Func_class: P Inorganic ion transport and metabolism # Function: Putative silver efflux pump # Organism: Ralstonia solanacearum # 5 1027 2 1018 1038 722 37.0 0 MHKFIDNIVAFSLKNKFFIFFCTAIAVIAGVVSFKHTPIDAFPDVTNTKVTIITQWAGRS AEEVEKFITIPVEIAMNSVQKKTDIRSTTLFGLSVINVLFEDHVDDFVARQQVYNLLNDA DLPDGVTPEVQPLYGPTGEIYRYTLRSDKRSVRELKTIQDWVIDRNLRAVSGVADIVSFG GEVKTFEVSVNPNQLINYGITSLELYDAIAKSNINVGGDVITKSSQAYVVRGIGLINDLE ELRNIVVKNINGTPILVKNLADVRESCLPRLGQVGRMDEDDVVEGIVVMRKGENPGEVIS GLKAKIDELNENILPSDVEIVPFYDREDLVDLAVHTVTHNLVEGILLVTFIVFIFMADWR TTAVVAVIIPLALLFAFICLHIMGMSANLLSMGAIDFGIIIDGAVVMVEGIFVALDRKAK EVGMPVFNRMSKMGLIRSTAKEKAKAVFFSKLIIITALIPIFSFQKVEGKMFSPLAYTLG FALLGALIFTLTLVPVMSSILLKKNVRERSNFLVHFIREKSAALFAIFHAHRKLSIGLAS LAGGVGLFLYSFLGMEFLPQLNEGAIYIRATLPQSISLDESVSLANRMRRELLAFPEVRQ VLSQTGRPNDGTDATGFYNVEFHVDIFPEKEWESGLSKGQLIEKMQSALALYPGTDFNFS QPITDNVEEAASGVKGSIAVKVFGKDLYEAEKLAVQIDKVLGTVQGIEDLGVIRNIGQPE LRIELNEKQLARYGVSKENVQSIIEMAIGGKSASLLYEDERKFNIMVRYNEEFRRNEEQI GKILVPAMDGTMIPIKELADIRTITGPLLIFRDSHARFCAVKFSVRGRAMGSAVAEAQKK VAASVHLPDGYTLKWTGDFENQQRASKRLAQVVPVSIAIIFVILFILFSNARDAGLVLLN VPFAAAGGIIALLITHFNFSISAGIGFIALFGICIQNGVIMISGIKSNVRSHIPLSEAVK NTVRSRVRPVVMTAAMAAIGLMPAALSHGIGSESQRPLAIVIIGGVLCDTFFTLFIFPLI VEVIYGKTLYDKEGKLRQRRV >gi|226332019|gb|ACIB01000037.1| GENE 192 231321 - 231995 813 224 aa, chain + ## HITS:1 COG:ECs0609 KEGG:ns NR:ns ## COG: ECs0609 COG0745 # Protein_GI_number: 15829863 # Func_class: T Signal transduction mechanisms; K Transcription # Function: Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain # Organism: Escherichia coli O157:H7 # 3 222 2 221 227 191 46.0 1e-48 MAKILLVEDEVNIASFIERGLKEFGHSVSVANDGDAGWELIRQEAFDLLILDIIMPKMNG LELCRLYRQQYGYLTPVIMLTALGTTEDIVKGLDSGADDYLVKPFSFQELEARIKAILRR GREDSVQQLVCDDLVLNCNTRRARRKEVEIELTVKEYRLLEYFMTHQGMVLSRLTLLKDV WDKNFDTNTNVVDVYVNYLRGKIDKEHDKKLIHTVVGSGYIMYA >gi|226332019|gb|ACIB01000037.1| GENE 193 232013 - 233350 962 445 aa, chain + ## HITS:1 COG:RSp1043 KEGG:ns NR:ns ## COG: RSp1043 COG0642 # Protein_GI_number: 17549264 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Ralstonia solanacearum # 157 437 167 455 466 150 34.0 4e-36 MKIGRKIALFYTLVTVLTTMAVIGVFYLFSSRYIDGLYRSYLREKAFLTAQKHWERDEVD EQSYQTIQRKYDELLPEAKEILLDMDSDAVVRDTLNKYLSASQQKQLLESKEMSSVSFVY QDMLGAALYYPDNEGNFIVIIMSHNSYGVKIQEHLLLLSAFLVLCSSVLIFFIGQLYSTR ILIPLQHVLLQLKQIRGNSLNRRLKTTGNKDELDHLIETLNSMLDRIDTAFRAEKSFVSH ASHELNNPITAIQGECEISLLKERSTDEYIEALRRIAGESKRLSNLIRHLLFLSRQEEDI RKNNVEEIRLADLLEEAGAANPRIRLQYPEDAARYAVVSASPYLFKIALQNVIDNACKYS QGEVLIRLYKEDERWGIAVKDSGIGIPADEMELIFQSFYRGSNTREYAGQGIGLSLSMKI FSVYRGKVSIRSEEGKGTEVRVVFA >gi|226332019|gb|ACIB01000037.1| GENE 194 233534 - 234208 516 224 aa, chain + ## HITS:1 COG:CC1662 KEGG:ns NR:ns ## COG: CC1662 COG2095 # Protein_GI_number: 16125908 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Multiple antibiotic transporter # Organism: Caulobacter vibrioides # 4 218 9 218 228 76 28.0 3e-14 MSLSNLLIIFSSSFMALFPVVNPLGNGFVVNGFFTDLDPKQRKTAIRKLILNFIIIGVGT LVIGHLFLLMFGLAIPVIQLGGGILICKTAMELLGDSNSPDQEESSRNMDSLKWKNIEQK IFYPITFPISIGPGSISVIFTLMASASVKGKLLQTGINYFVIALVIVCMAAILYIFLSQG QRIIQKLGPVGNQIINKLVAFFTFCIGIQISVTGISQIFHLSIL >gi|226332019|gb|ACIB01000037.1| GENE 195 234398 - 236149 1403 583 aa, chain - ## HITS:1 COG:no KEGG:BF2190 NR:ns ## KEGG: BF2190 # Name: not_defined # Def: putative acetylhydrolase # Organism: B.fragilis # Pathway: not_defined # 1 583 1 583 583 1179 99.0 0 MIMKTTRLQLSLLALFLGCASLQAQYKWADPLKQDFHTVRGQAWQDELKDSYARLPQRAE DKVRKPLWDLSRQSAGLSVAFRSNASEIKVRYVVKGGLSMPHMPATGVSGIDLYATDNNG QERWCAGNYSMGDTIVYNFRGLSYAAKSGNGFEYQLFLPLYNSVSWMEIGVPADASFRFL PVSQEKPLVIYGTSIAQGACASRPGMAWGNILNRKLGHPVINLGFSGNGKLEEALFDLLS EIDARLYIIDCMPNLAGKEASAVVYQRTLEGVKKLREKSRAPILLVEHDGYSNEFSSESA EESYRVANAELRKAYETLQKEQVPAVYYLTKEEIGMPMDAMVDGVHSTDLGMQQYADSYR KKIGEILHEESEGPTSCIPCKQQRDPYDWYGRHEEILKLNKQSAPEVVMIGNSITHFWGG EPIAHNQFGTESWDKLFKGKRVRNLGFGWDKTENVLWRIYHGELDGFQAQNIFLLIGTNN LLFNTDDEVIEGICRVVKAIRERQPRTKLCVMGILPRKEMETRIAQIDAALQERLNGKDC TFINLAPQLTHKDGTIDHSLFRDGLHPNAEGYKRIAKVLKGYL >gi|226332019|gb|ACIB01000037.1| GENE 196 236338 - 237045 438 235 aa, chain - ## HITS:1 COG:slr1879 KEGG:ns NR:ns ## COG: slr1879 COG2243 # Protein_GI_number: 16330281 # Func_class: H Coenzyme transport and metabolism # Function: Precorrin-2 methylase # Organism: Synechocystis # 10 233 9 233 242 85 30.0 8e-17 MITTHPIRFVSLGPGEPDLITLKGLKALQGADCIFCPATMTQDGKSSSRALSILNTLGFS DTVQCFRLPMDKDRTLALRSYEAVYESSKILRAEGQNVVIVAEGDAGLYSSIHYIYDELQ QDDIPVEQIAGIPAFIASGAMAGLHIVSQEERLIVIPGHVTAKELDDYLKHQTVVVIMKL SQCIDEVHQCIINHLEYQYHYFENVGTEKEYYSCSTEELREKRYPYFSVMIIRFG >gi|226332019|gb|ACIB01000037.1| GENE 197 236927 - 237145 69 72 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253565469|ref|ZP_04842924.1| ## NR: gi|253565469|ref|ZP_04842924.1| predicted protein [Bacteroides sp. 3_2_5] # 34 72 1 39 39 75 100.0 9e-13 MVAGQKMQSAPCSALSPFNVIRSGSPGPKDTKRMGCVVIICYRFIILMFCKSEENIPMKQ ELLSIFDRCNLN >gi|226332019|gb|ACIB01000037.1| GENE 198 237159 - 238307 1118 382 aa, chain + ## HITS:1 COG:alr4031 KEGG:ns NR:ns ## COG: alr4031 COG0614 # Protein_GI_number: 17231523 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-hydroxamate transport system, periplasmic component # Organism: Nostoc sp. PCC 7120 # 4 381 43 424 426 215 31.0 1e-55 MKQFLIGCALLACISLAGCRPKNNQNVTIVTGEVSDTSSVRITPVYAKGFKVTYTPTCRL VDISDPQKEGGESFHYALVPGGTKPENIPAGYTVIETPVKSVICMTSLQLSNFIKLDALN HVVGITSTRHLFNKEMNERLKQGRTAKIGIEGNFDNEVIMSVNPDVIFISPFKRGGYDAM REVGIPLVPHLGYKEMTPLGQAEWIKFIGLFIGEEETANRKFAAIEKHYNELKERVAHVK KRPVVFSGEIRGGNWYAVGGKSFLAQLFRDAGADYFLKDDPRSGGVTLDFETVYSQAESA DYWRIVNSFDGTFSYDVLKSEDPRYADFRAFREKGVIYCNMREKPFYESMPTQPEVVLED LIKAFHPDLLPDYTPVYYERLN >gi|226332019|gb|ACIB01000037.1| GENE 199 238316 - 239353 1091 345 aa, chain + ## HITS:1 COG:alr4032 KEGG:ns NR:ns ## COG: alr4032 COG0609 # Protein_GI_number: 17231524 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-siderophore transport system, permease component # Organism: Nostoc sp. PCC 7120 # 20 338 36 354 362 250 47.0 3e-66 MKRPTIPLMLLIAASIFVFFLLNLLLGSVSIPIGSVWNILWGGTDESVIWQNIIWKSRVP QALTALVAGAGLSISGLQMQTVFRNPLAGPSVLGISSGASLGVAFVVLLSGAFGGVALSK LGYMGEVALSLAAVIGALSVMALIVYVSQKVKGNVTLLIIGVMIGYVASAVIGVLKYFSV EEDIRAYVIWGLGSFARVSGDQMTLFVCIMVVLIPLSFLLIKTMNLMLLGDGYARNLGLN IKRARLLVISCSGVLVAIVTAYCGPIMFIGLAVPHLCRAIFHTSDHRILMPATLLAGASL ALVCNLVARMPGFEGALPVNSVTALVGAPVVASVLFRKRKSELSE >gi|226332019|gb|ACIB01000037.1| GENE 200 239360 - 240388 1178 342 aa, chain + ## HITS:1 COG:alr4033 KEGG:ns NR:ns ## COG: alr4033 COG1120 # Protein_GI_number: 17231525 # Func_class: P Inorganic ion transport and metabolism; H Coenzyme transport and metabolism # Function: ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components # Organism: Nostoc sp. PCC 7120 # 10 293 6 290 333 261 46.0 1e-69 MKQKTIHIENLSIGYLGKTDVKVVADRINAGINCGELTCLLGANGVGKSTLLRTLSAFQP KLGGKIEIVGKEIDAYTDKELSTVISVVLTEKCDIRNMTVHELVGLGRSPYTGFWGTLRG EDKEVVERSIALVKIQNLAHRMVHTLSDGERQKVMIAKALAQETPVIFLDEPTAFLDFPS KVEMMQLLHRLSRQTNKTIFLSTHDLELALQIADKIWLMDKMNGVTIGTPEDLSLSGKLS SFFARKGIVFDLETGLFRVDNEYTSQIRLVGHGQKYAMVRKALQRNGILANRTVESDTYI ETGDLKDGNGFILHPQEGEAVTLNSIEELLERLQAGSAERAV >gi|226332019|gb|ACIB01000037.1| GENE 201 240522 - 241412 713 296 aa, chain - ## HITS:1 COG:alr4490 KEGG:ns NR:ns ## COG: alr4490 COG1091 # Protein_GI_number: 17231982 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: dTDP-4-dehydrorhamnose reductase # Organism: Nostoc sp. PCC 7120 # 4 284 5 275 294 133 32.0 3e-31 MKNILIIGANGFTGRRILNDLSVNPIYHVTGCSLRDDICPGKDYRFVRTDIRDENEVRKL FKECRPDIVINTSALSVPDYCETHHAEAEATNVTAVETIAHVCEQYGSRFIHLSTDFVFD GKSTRLYKEEDEAIPVNYYGVTKLKAEKIIASICSNYAIVRVVVVYGKALPGQHGNILQL VANRLRNGETIRVVSDQWRTPTFVGDISVGVEKLMFHTANGIYHICGSECLTIAEIAYRV ADFLKLDRSLIEPVTTEEMKEVTPRPRFSGLSIEKAKAEIGYTPRTLEEGMEASLF >gi|226332019|gb|ACIB01000037.1| GENE 202 241425 - 241958 464 177 aa, chain - ## HITS:1 COG:no KEGG:BF2196 NR:ns ## KEGG: BF2196 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 177 1 177 177 318 100.0 5e-86 MKKLIFFLLLLLSTASLSAQSEQPSDSIRHLPSTVKQYGDFLIDMGLFIAAPPKLPKYKF ELPDASKDYNRIFSLNPDVIMTQGLSNVFTPSLSYGFGWGGHDFFSSPQQLQMGSFKLKN GMRLNTYGEYNADGKKVPNPAAMPWEKNNFKGAFEMKSSDGNFGIRIEVQQGRNYPY >gi|226332019|gb|ACIB01000037.1| GENE 203 242022 - 242810 569 262 aa, chain - ## HITS:1 COG:no KEGG:BF2197 NR:ns ## KEGG: BF2197 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 262 2 263 263 535 99.0 1e-151 MKNRIYCTFILAIITVMVCAQTEQPKDSIKGHRETVPEIRQGLHINDRNIQTDGTLPDNT RQAAAGDSMTFRIPYPESIPSGWIVPQLGSYSNPFIWDYDRYYNFNLSPNSNLSTFSNYN TYLSIGTIVKAGAAYSFSPNDRWIFSGGVFVAKYTLPSLRIPTAPMAGSRFDAGIHGKIT YRLTDHLYLNVFGQYSLNGQRNSKKGYMVPDLYMQNHFGGTVDYMFNGKFGITGGAIHEF NPVKGRWETNPVFGPVIHLKKK >gi|226332019|gb|ACIB01000037.1| GENE 204 242823 - 243518 697 231 aa, chain - ## HITS:1 COG:MT1062 KEGG:ns NR:ns ## COG: MT1062 COG0745 # Protein_GI_number: 15840463 # Func_class: T Signal transduction mechanisms; K Transcription # Function: Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain # Organism: Mycobacterium tuberculosis CDC1551 # 5 228 51 272 276 147 35.0 2e-35 MKEKIKVLLAEDELTLAMIIKDTLEEDDFILTIAANGEEGLRMFFDIRPDVLVADVMMPR MDGFEMVRRIRQTDKQTPVLFLTARSAINDVVEGFELGANDYLKKPFGMQELIIRIKALV GKAFSFNEEKKTTTRFEIGNYLFDSLAQTLTHAGLKQELSHRESEILKRLCENQNQVVNT QNVLLDLWGDDNFFNSRSLHVFITKLRHKLSADERIRIVNVRGIGYKLILN >gi|226332019|gb|ACIB01000037.1| GENE 205 243515 - 245092 1045 525 aa, chain - ## HITS:1 COG:BS_phoR_3 KEGG:ns NR:ns ## COG: BS_phoR_3 COG0642 # Protein_GI_number: 16079962 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Bacillus subtilis # 293 525 41 271 279 129 35.0 2e-29 MKLPIKHIAILVIASLMAIFAYQAYWLVSMYHTMKSDMEHSIIEAMRTSDYNEMMLRIER MRRDDQDHGEISVSAGYNDEGGTLVRSSTMVHKEKIPGSTGIQQDSILKTERKLDTLIVV RNDQKEIKIVPQDSTSQEVKAAMLQTKGGLDILLKDQNTMLELATYFQRGLHSGLDVIMD PDFQSYDSLLTLSLQERGISLPYRIEYLHFGNTPDSSLLFTDTLGMSGTVNYIPGPDAHT YDYTFDIHSHSLYRLRMDSVAGVIVHQMAGILITSFIILLILGFSFWFLIRTLLKQKTLE EMKSDFTNNITHELKTPIAVAYAANDALLNFGQADEKAKRDKYLSISQEQLQRLSGLVEQ ILTMSMEQRKTFRLRPEEITLATLLESLTEQHKLKAQNPIDIDYRVEPANLTVFADRTHF SNILSNLIDNAVKYSPGKAVIRIHCRETADNGKVEISVSDEGTGIAQEKQKHIFDKFYRV PTGNLHNVKGYGLGLYYVKTMIEKHGGTVCVKSEPGHGSTFTITL >gi|226332019|gb|ACIB01000037.1| GENE 206 245250 - 247367 1292 705 aa, chain + ## HITS:1 COG:no KEGG:BF2200 NR:ns ## KEGG: BF2200 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 705 1 705 705 1415 99.0 0 MKRIYLTFVFAGAVCFQVAAQNVDEDRILSADSLITMDGRLDSLYRSLPEVMVVGERPVV KAMQGKLVYDLPRLIQNFAVDNAYDAIKELPGVSEMNDNLTLGGKSVTVVLDGKVTTMTQ EQLNTLLKSIPAGRIEKAEVMYSAPARYQVRGAMINICLKQGDSGKSTLQGEFFSDYRQR HYEYLTERASLLYSGHKFSADFLYSYSHGRNYFLTDKEALHALSDGTIHPISTNESQRSR SNRHNLRLAGDYVFAANHQLSVTYNAQFVNGFNLSTVDGTQISNARTRMTDQLHNARLDY HMPFGWKAGVEYTYYHSPSSQLLHSRMGSDELDFRVKDSQRINRWKLFLSGEHDLGNGWG MNYGAVYTTSLDNSHQYYHDPETDEIISGNSNMKSRRREQTLNFYAGFNKAFGDKLSLDA SLAAEQFHTTVWNEWSLYPTFNINYAPAPGNVWQLSLSSDKSYPDYWATQDAVSYMGGGY SEIHGNPYLKPEINYQLQLTYVLNSKYMFNAWYSHTKDNATQTLYQSPERLVEIYKYFNF DFEQQAGVQAVVPFAIGRWLKSRFTLTGVYDRQKDSDFWDIPFDRKAYYAMLNMNHTVTL FSHPDIKLIVSGMIRSKAIQGIYDLPASGNLDIALRYGFANGKALLTLRCNDLLETGQIS PRIYYAMQNVTNHYSAFREFGVSLTYKFGGYKEKKREGVDTSRFK >gi|226332019|gb|ACIB01000037.1| GENE 207 247394 - 249403 836 669 aa, chain - ## HITS:1 COG:rcsC_1 KEGG:ns NR:ns ## COG: rcsC_1 COG0642 # Protein_GI_number: 16130155 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Escherichia coli K12 # 361 668 370 674 700 114 29.0 8e-25 MMRYIILLVLCAWTGIALAIDNHAKDSLLLQLKQTTIAVERINIYRNLADICFETPDEKT YLLNMYREAQKAGDTSGMLNALNDLVCGETKEYRMDSAYHYMELIKAIREPQETAPVLAY LQMRFFDTLCSHNETEEAITKELQFIEEKKSDEASLYNKIVQAYITGGSLHYNDMIKEAL PYLETAMNLSKQLPPEKQVLFTSTIVWKLSNTYGILNRCDSAIRILDENLQTQQQYYKDH YQAQRPFYNMAIHELHFYTSLLANAILEAPEKMDYYWQQIVKLNKKLTNPYDRYNYFLSM NNYYLNQKPQPHYEKALMANDSLIQIAQEIIPNNLPGLYDIQSQTYEAMGNFKEALAYLR IATQYKDSLTTENMQKQLGELQIKYEVNKLNNEKSQLEIKNKRILVICLSIILIIVIFVC LYLYHDLKKEKKMKFHLRNLKQKAEESEKMKTAFINSICHEIRTPLNSIVGFSDLIFNDE IDKETREGFSQEIQKSTILLTSLIDNMLEISSLDVSQEKLPCKEANLNSICIQEMTLLNR SHKPDIDYRIDLPEHPIILTTHEKYLSLVIEHLLNNANKFTEKGIITLHCHIDEARQQVH ISVTDTGCGIPADKHKEVFERFSKLNAFTPGNGLGLYLCQLIIRRLSGKISIDPTYTGGT RITVILPVQ >gi|226332019|gb|ACIB01000037.1| GENE 208 249572 - 251239 1357 555 aa, chain + ## HITS:1 COG:SP1229 KEGG:ns NR:ns ## COG: SP1229 COG2759 # Protein_GI_number: 15901091 # Func_class: F Nucleotide transport and metabolism # Function: Formyltetrahydrofolate synthetase # Organism: Streptococcus pneumoniae TIGR4 # 1 554 1 555 556 571 54.0 1e-162 MKSDIEIARSVELKKIKQVAESIGIPRDEVENYGRYIAKIPEYLIDEEKVKKSNLILVTA ITATKAGIGKTTVSIGLALGLNKIGKKAIVALREPSLGPCFGMKGGAAGGGYAQVLPMEK INLHFTGDFHAITSAHNMISALLDNYLYQNQSKGFGLKEILWRRVLDVNDRSLRNIVVGL GPKTNGITQESGFDITPASEIMAILCLSKDVDDLRRRIENILLGYTYDNKPFTVKDLGVA GAITVLLKDAIHPNLVQTTEGTAAFVHGGPFANIAHGCNSILATKMAMTFGDYVITEAGF GADLGAEKFYNIKCRKSGLQPRLTVIVATAQGLKMHGGVSLDRIKEPNLEGLREGLRNLD KHVRNLRSFGQTVIVAFNKFASDTDEEMELLREHCEQLGVGYAINNAFSEGGEGAVDLAN LVVETIENKPSEPLQFTYNDEDSVQQKIEKVATNLYGASVVTYSTLTRNKIKLIEEMGIG HYPVCIAKTQYSFSADPKVYGAVDNFELHIKDIVINNGAEMIVAIAGEIMRMPGLPKEPQ ALHIDIVDGNIEGLS >gi|226332019|gb|ACIB01000037.1| GENE 209 251535 - 252227 498 230 aa, chain - ## HITS:1 COG:no KEGG:BF2203 NR:ns ## KEGG: BF2203 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 230 7 236 236 453 99.0 1e-126 MVAATLLVTAGAMAQNQDCAFFFPNQEGEQITRNCYTADGKLTNILVYRVDQAYEYPSGM EVVANYTFADAAGKTLNSGQMVARCSDGNFSMSMGDVATFPTALNMMNADVYMMGDLMNY PDAFSNPMNPGDDDEFDDGTLRLYQKGNKNNRAEISVFDREFVTTETINTPAGAFYCTKV KYEMNIWTPKETIKGYGYEWYAPNIGIVRSEQYNNKKELQSYSVLERIKK >gi|226332019|gb|ACIB01000037.1| GENE 210 252418 - 253698 1334 426 aa, chain - ## HITS:1 COG:aq_479 KEGG:ns NR:ns ## COG: aq_479 COG0112 # Protein_GI_number: 15605959 # Func_class: E Amino acid transport and metabolism # Function: Glycine/serine hydroxymethyltransferase # Organism: Aquifex aeolicus # 1 424 5 410 428 476 57.0 1e-134 MKRDDLIFDIIEKEHQRQLKGIELIASENFVSDQVMEAMGSCLTNKYAEGYPGKRYYGGC EVVDQSEQIAIDRLKEIFGAEWANVQPHSGAQANAAVFLAVLNPGDKFMGLNLAHGGHLS HGSLVNTSGIIYTPCEYNLKQETGRVDYDQMEEVALREKPKMIIGGGSAYSREWDYKRMR EIADKVGAILMIDMAHPAGLIAAGLLDNPVKYAHIVTSTTHKTLRGPRGGVIMMGKDFPN PWGKKTPKGEIKMMSQLLDSAVFPGIQGGPLEHVIAAKAVAFGECLQPEYKEYQKQVQKN AAVLAQALIDRGFTIVSGGTDNHSMLVDLRSKYPTLTGKVAEKALVSADITVNKNMVPFD SRSAFQTSGIRLGTPAITTRGAKEDLMLEIAEMIETVLSNVENEEVIAQVRARVNKTMEK YPIFAY >gi|226332019|gb|ACIB01000037.1| GENE 211 253806 - 254558 366 250 aa, chain - ## HITS:1 COG:no KEGG:BF2205 NR:ns ## KEGG: BF2205 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 9 250 1 242 242 481 99.0 1e-134 MVSRKENKVKRIVLLFISIWIAFAAYGEDNHKKERHPDDRKINVGIRGGFNSSMFLVSDL KIKDVTIDEIQNNYKIGYFGALFMRLNMKRHFIQPEISYNISRCEISFDKLGSQHPDIEP DYASVSSTIHSIDFPLLYGYHVVKQGPYTMSLFAGPKLRYLWNKKNKITFENFDQQGIHE KLYPFNVSAVIGVSVNISRIFFDFRYEQGLHNLSKSVTYDNIDMEGRPEVSNITFRRRDN VLSFSLGVIF >gi|226332019|gb|ACIB01000037.1| GENE 212 254531 - 255100 413 189 aa, chain - ## HITS:1 COG:FN1468 KEGG:ns NR:ns ## COG: FN1468 COG1853 # Protein_GI_number: 19704800 # Func_class: R General function prediction only # Function: Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family # Organism: Fusobacterium nucleatum # 19 184 20 185 197 153 46.0 2e-37 MKQDWKPGTLIYPLPAVLVSCGSDESEYNILTVAWTGTICTNPPMCYISVRPERHSYPII KKNMEFVINLTTRDMAFATDWCGVRSGKDYHKFEEMKLTPGKCSVVNAPLIEESPLCIEC RVKEIVSLGSHDMFIADVVNIRADDRHLNRETGKLELAEANPLVYVHGGYYNLGEKIGKF GWSVEKKTK >gi|226332019|gb|ACIB01000037.1| GENE 213 255186 - 255647 472 153 aa, chain - ## HITS:1 COG:PAB1499 KEGG:ns NR:ns ## COG: PAB1499 COG1781 # Protein_GI_number: 14521525 # Func_class: F Nucleotide transport and metabolism # Function: Aspartate carbamoyltransferase, regulatory subunit # Organism: Pyrococcus abyssi # 8 150 4 148 152 136 46.0 1e-32 MSENKQALQVAALKNGTVIDHIPSEKLFTVVSLLGLEHMTTNITIGFNLDSKKLGKKGII KIADKFFCDEEINRISVVAPHVKLNIIRDYEVVEKKEVRMPDELKAIVKCANPKCITNNE PMATLFHVIDKDNCVIKCHYCEKEQKREDITII >gi|226332019|gb|ACIB01000037.1| GENE 214 255652 - 256578 864 308 aa, chain - ## HITS:1 COG:VC2510 KEGG:ns NR:ns ## COG: VC2510 COG0540 # Protein_GI_number: 15642506 # Func_class: F Nucleotide transport and metabolism # Function: Aspartate carbamoyltransferase, catalytic chain # Organism: Vibrio cholerae # 4 304 29 330 330 320 55.0 2e-87 MENRSLVTIAEHSREKILYMLEMAKQFEKNPNRRLLEGKVVATLFFEPSTRTRLSFETAA NRLGARVIGFSDPKATSSSKGETLKDTIMMVSNYADVIVMRHYLEGAARYASEVAPVPIV NAGDGANQHPSQTMLDLYSIYKTQGTLENLNIYLVGDLKYGRTVHSLLMAMRHFNPTFHF IAPEELKMPEEYKIYCKEHNIKYVEHTDFNEEVIKDADILYMTRVQRERFTDLMEYERVK NVYILKAKMLENTRSNLRILHPLPRVNEIAYDVDDSPKAYYFQQAQNGLYARQAILCDVL GITLQDIL >gi|226332019|gb|ACIB01000037.1| GENE 215 257010 - 257288 339 92 aa, chain + ## HITS:1 COG:no KEGG:BF2209 NR:ns ## KEGG: BF2209 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 92 1 92 92 116 100.0 3e-25 MFSIISIMFLGIGIGYLLRNLKFLEKVEKSTSLTIFLLLFVLGLSIGSNSLIVNNLGKFG WQAIVLATSSILGSMLASFLVLRLFFKKGGKL >gi|226332019|gb|ACIB01000037.1| GENE 216 257285 - 257905 388 206 aa, chain + ## HITS:1 COG:FN1083 KEGG:ns NR:ns ## COG: FN1083 COG2431 # Protein_GI_number: 19704418 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 5 206 2 195 198 86 32.0 3e-17 MKSSLIVVAFFCIGCLVGIFNDFQFDMHNLSMYILYALMLQVGISIGSNKNLKFLIKSLR PNMLLVPIATIVGTLLFSAFASLLLSQWSVFDCMAVGSGFAYYSLSSILITQFKEASVGL QLATELGTIALLANIFREMMALLGAPLIRKYFGKLAPISAAGVNSMDVLLPSITLYSGKD MIPVAIFHGILIDMSVPFFVSLFCSL >gi|226332019|gb|ACIB01000037.1| GENE 217 257906 - 258823 595 305 aa, chain - ## HITS:1 COG:no KEGG:BF2211 NR:ns ## KEGG: BF2211 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 305 5 309 309 587 96.0 1e-166 MFTCKIEAGFRVFIFIILSFVLTNCIREDMSGCPRDDGGLALKFSYDTNADSRQSTAQGI DRLAVFIFDEKGLFISQVNDSMTSINDDYVMELPYKQGSYQFVAWGGYDKSTYQTSECVP GKTYIDDFFLSVKRQEDNRVINQPKLLYHGMHDIVGLNSKEKTIVLINLKQMTNHIRVIA HNLNQDRSDNIYIEDNNGKYGYDSQFSDDDQISYIPIYEASSEQSNPLIADFNVMRLEKD REPRLRIADKTGTIRYDENLIGKLIGGNPNIDFEHNHDFTIEISFDNYIPVIIKINGWEI VNEEI >gi|226332019|gb|ACIB01000037.1| GENE 218 258847 - 260226 849 459 aa, chain - ## HITS:1 COG:no KEGG:BF2265 NR:ns ## KEGG: BF2265 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 459 1 459 459 825 97.0 0 MKTSKIFMTTLVALTMAACSNDNDWVDQPSNPDVIASDAYASFSISIPHASKTRAVSTDP GIAAENTVKSLHVFIYDAESPNTPTVAEFTVAGGTLTQKPAGSSTWVTSQPVSTKKTDKY IFAGVNLNAEIVNYITSNGLGAFSYKDFTQEITKLADQTNGFVMFNSAYPTLTPATDLYE KKSEAENNHITISVNRVTAKAAVFQSQSFVVNGGGTMTDLKFGWRNLNKKFYFIQDNRDA LIKDYNWANYTAADFTRGTDAINVYASADVPTSFSYATENAFQYISGTSNVDAATFISVS GVFTPTNIISAKINPPTVAADFEIIVNPKPSDKTFFVVRTADGVANYFIDGPTAEKFAEL CAANTPQMPSINGTYLLSENTYSNGLCYYHIFVNGDAVTPQAPYNIYRNQYFKININSIQ APGNPSDNFDSGEPIKPNSWIGVDIQIIPWEVIEEDHDL >gi|226332019|gb|ACIB01000037.1| GENE 219 260461 - 261423 578 320 aa, chain + ## HITS:1 COG:no KEGG:BF2214 NR:ns ## KEGG: BF2214 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 320 1 320 320 546 98.0 1e-154 MIMRNINTDLIDAMKIYLPKGNNLANTLMDILYLGKEATYRRLRGEVPFTFAEVATISQH MGISLDKIVGADLNDNAIVNLNMLQCQRPAETYYSIIDSYIKLFGQLIERESSERSTSSN TVPQTLYLKYEALSKFQLFKWIYQHESTYAGRHYEDLEIPEKLIDKQKEFVNLSQLFQTT NYIWDKEIFIRLVNEIKFFLNINLISEDSVKRIKKELLILLNELEKISAQGKYSSGKDVK IYISDINFESTYSYVETDIYHQCLIGVFSINSITSKDDLLFQHLKVWIQSLKKYSTLISQ SGEVQRIHFFNRQQELVKSL >gi|226332019|gb|ACIB01000037.1| GENE 220 261615 - 263774 1068 719 aa, chain + ## HITS:1 COG:no KEGG:BF2267 NR:ns ## KEGG: BF2267 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 719 5 723 723 1332 98.0 0 MVILITGLLYMSCTDSMEVPGDADRNNTEDVSVRLVITIPASTTYARTRGTFATTDHESK ISEIQVLVFEEGKYKYRVPGISINNTSSAASFKALLKSSSSPLKLLILANATDAVIANEP SVDDSEDLVKKNINLRFNNITSDFPMYGEYELPGGLEATVINNITGIKMLRSIARVDVKA TEVANFKLSGVKAYRANDHLQIIPDETGVVRVTLPSVPAGSTGNVNSVLYPVSAENLNEF SAQLYLPEADSPAPDNRVSQATCIVVEGYYEGSDQPGYYRMDFDPDNVENAFGQVLRNHK YIFNIKKVSGPGWGSPDEAANNRSAHIVAEVQAWDDYTIDMSFDGEHHFGVSTREIVLKN KAGSAGIINVSTDLPDYTLQWADAAGTPTGTGSQSLANEYFTVTKAQNGSQLVITALQSN STNDTSRIQNFVITAHRWRILVNIQQKYDVAAYQTIHLLTFNAGLGYLGTNIIGSGSAEA RATGLRGILNNQNNFGPTGTVECGGYNLIGVNANYNNLTDALFASFDVVYVHYMGNLLFG TSDAQKAHNWVKSKKNRVLIVSYDALDVSQNLLKEILGGNNGISFLTSNTGPYPLATPEN GNSYFSTTGPFTSAPYTPINTDFSLRNYDAYHGEIQLNTEASKGITPILMGPAGGIVLGI DYSRRIVYWGDTDISSSASSSTSTTDNRINNNSGTINNNASKLIANVFAWIVETVLYGE >gi|226332019|gb|ACIB01000037.1| GENE 221 263778 - 264728 878 316 aa, chain - ## HITS:1 COG:MT0820 KEGG:ns NR:ns ## COG: MT0820 COG2837 # Protein_GI_number: 15840211 # Func_class: P Inorganic ion transport and metabolism # Function: Predicted iron-dependent peroxidase # Organism: Mycobacterium tuberculosis CDC1551 # 13 311 8 306 335 280 47.0 2e-75 MNSHQELFGGNIPQDVTGKQGENAIFIVYGLKQSDNTIKQVKDLCANFSALIRSMRNRFP EMQFSCTIGFGADAWKQLFPEQGNPKELKTFETIKGAKYIAVSTPGDILLHIRAKQMGLC FEFASIIDEKLKGVVDSIDETHGFRYMDGKAIIGFVDGTENPAVDENPYHFAVIGDEDPD FIGGSYVFVQKYIHDMTTWNSLPVEAQEKVIGRHKYNDVELSDEEKPQNAHNAVTNIGDD LKIVRANMPFANTSKGEYGTYFIGYASTFSTTHKMLENMFIGDPVGNTDRLLDFSTPITG TLFFAPSYDLLAKLGE >gi|226332019|gb|ACIB01000037.1| GENE 222 264949 - 267033 1583 694 aa, chain - ## HITS:1 COG:no KEGG:BF2218 NR:ns ## KEGG: BF2218 # Name: not_defined # Def: putative outer membrane protein involved in nutrient binding # Organism: B.fragilis # Pathway: not_defined # 1 694 1 694 694 1395 99.0 0 MKIKLKHIYFCSLIGMGGLALTSCNDFLDRSPISDITPEDYFNTVDQVGSYVINYYDDYL ENSNTTKMYHQRAWNSGVVRNDANTDNLLSDDGNLDYFAGNKQVPEGKNIQEPLNRIRVW NYLFEKVLPKEKEGTIPGDAELLKQYIGEAYFFRALAYYNALVRFGDYPIITEVLPDDSE TLIKKSQRAPRNEVARFILKDLDEAVSRLKERGFQNNQRINKQAALVLKSRVALFEATFE KYHQGTGRIPGDPTWPGAAMSYNSGKTFDIAGEINFFLTEAMQAAVAVADHVQLAENSHV MNPPYNTLYGWNPYFEMFSQPDLSNVEEVLLWKQYNLSLTVSHCVGARLKNGDRTGLTRS LIKTFLMKDGLPIYASNSTIDDRTVSDEKKDRDERLQLFVWGEKDAWMTDERADTVKNYN KDQAGNSVTNPVPVPWVKSTVISDQEQTRDITGYRSRKFYPYDDEQSKSDELLGTNACPI FRASEAYLNYIEACYEKNGTLDSKAQEYWKAIRRRAGVDEDYQKTIARTDLGREDDLGVY SGDRMVDATLYNIRRERRCEFIAEGMRWDDLKRWRSWDRLFTEPYIVEGINFWDEAYKLY TTVDKDGNEKSAVVADGSTSANMSPKSDGKYVRPLRRTQTNNQLYDGYTWKKAYYLEPLG LQDLQLSATNPEDINTSMMYQNPYWPAGTGKALE >gi|226332019|gb|ACIB01000037.1| GENE 223 267050 - 270319 2496 1089 aa, chain - ## HITS:1 COG:no KEGG:BF2270 NR:ns ## KEGG: BF2270 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1089 1 1089 1089 2179 99.0 0 MTKKSNLSLRFTRVWEKHCLKTAAATLLLCCVLPVQAKADTYKTQETASIQQQTVKTTGV IIDNTGEPLIGVSVKVQGTNTGTITDLDGKFSIDTRPKALLEFSFIGYKTITMEVTGKEL HITMQEDSKQLDEVVVVGYGSQKKVNVTGSVSMVNADVLESRPVQNVSQALQGVIPGLNM SVGSSGGTLDGKLNVNIRGAGTISDGSSSSPLVLIDGIEGDMNTVNPNDIESVSVLKDAA SSSIYGARAAFGVILITTKSGKSGKTRVNYSGNVRFSDAIQLPDMVDSYTFAQYFNRAST NGGESPTFDEKALQNILDFQNGKFTDPSTPEYYGVEAGPDGKWKSYAGSFANTDWFKEFY KSWTPSTEHNLSISGGTEKLTYMISGSFLNQNGLIRHGEDNFNRYTMNAKISAKPAEWVT LNYTSKWTREDYDRPTYMTGAFFHNIARRWPTCAPMDPNGHYMPNMEIIQLEEGGVQTSQ RNWYTNQLQAIFEPVKDWRIVVEGSMRTYTRKQHWAVLPIYGYDVNNKPYLLSWNGGAAG YSEVQDEREDEDYFSGNIYSDYAKTIGNHYFKVMGGFNAELFRPSGMTGFGTDLISSNVP SLGLTQDNQKANAWARERAIAGFFGRVNYNYKERYMLEANLRYDGSSRFVGDKRWGLFPS FSAGWNIAREDFFRPLAGVIGTLKLRGSWGQLGNNNTDKANAWYPFYQNMITGSANSGWL IDSKKQNTAQLPGIVNSLMTWETIESWDIGLDFGLLDNRLTGSVGYYNRYTYDMIGPAPI LPPVLGALPPQVNNCDMKSYGWELELSWRDRISEFDYSARFVLSDGKRKILRYPNPTNSL SSDVYYNGQILGDIWGYKTIGIAQTQEEMNAHLANGGTPNWGTNWGAGDVMYANLDGKEG VNNGSNTLEDHGDLTIIGNNTPRYNFGLTLTGAWKGFDFSVFLQGVMKRDYWLDGPYFWG ANGGLWQSTAFKEHMDYWRPEGDPLGANTNAYYPKPYFNTDKNQKVQSGYLQNAAYCRLK NAQIGYTLPKTWTRKAAMESVRVYVSGDNLVTFSGISGVFDPELLGSDWGDGKLYPLQRT ISIGLNVNF >gi|226332019|gb|ACIB01000037.1| GENE 224 270656 - 271144 430 162 aa, chain + ## HITS:1 COG:no KEGG:BF2271 NR:ns ## KEGG: BF2271 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 162 32 193 193 304 99.0 7e-82 MKKLKFNLLTGLLLLISVSVFAAGFPPPDKVQETFQKMYPKVTTVDWQRKGDYHIADIRV DGRELNVWFSDKGKWLMTEVDVETLEAVPAAVAKAFMQSTMASMQLEDVRIITFPKQPAV IVIEVEEYNTDSEFQLFYAPDGKLLQTLNVSDTGGEIYPGLF >gi|226332019|gb|ACIB01000037.1| GENE 225 271232 - 272548 350 438 aa, chain - ## HITS:1 COG:no KEGG:BF2221 NR:ns ## KEGG: BF2221 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 438 5 442 442 933 99.0 0 MVIHSGLALPGMSSLMNYMQIVATENTIYTSPDASKIFCYTPSILVTPTGRLIVSFDLGG EGVKSIEGHKSSRAGGSRFGQGKIFISDDNGQKWTFVQNFPFWHARLFTIGNSIYLIGHA GDICIMKSEDNGESWSDTYFLTHGEKWHSSACNVLFSNGNVYLAMEQRCRLNEVTGWDVA GLSPTLFRACVEDNLCLASSWSRSEKFIYKEVFDGAKLDFFGIPFYDCETNKPKEIATGI NNAPLGWLEANVVKFVDKDHIWHTDLKEVFHLFLRAHTGGVNYAHLFKIEIQDDQSMIPS LEHTPSGQKISYIPFPGGHLKFFIIYDELTRFYWLVSNQATDSMRRVSSLSNIKRYGLPN NERHRLQLHFSRNCVDWCFAGMVACSTNELYSRNYPSAVIKGDDLHIVCRSADEHALNPQ YNNMITHHIVSNFRQLIY >gi|226332019|gb|ACIB01000037.1| GENE 226 272475 - 273776 672 433 aa, chain - ## HITS:1 COG:no KEGG:BF2273 NR:ns ## KEGG: BF2273 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 433 1 433 433 852 99.0 0 MKMEDIAISITGYSYSNIKETIPDGVDKEEIAAVYEEIIDEYLQKGIPREIPALINVSGV PGAGKSTFCKKLLAMPENSSAIYIGFDAIMENERLPYIREEVNHAEEAFKRWELSARIAG YELLKRAIENKYLIIFDHSSALSQHIDLFNLLLSEGYEVHFNFIFIPEEEARRRAKNRKR YIPPYYIEERSKTLQYLLPEYKRICTTFKQIEPMRTRLIIARHGNTFRPEETPTRVGAKT DLPLVEEFKGRSIGRYLKEHDIIPDVIYAAPLLRTMQTARLAVQTIGLDSDISSLNAFVE IDYGVDENKTEEEVRLRLGNGNIEKGKKIIEDWDKNAVVPDGWKVDPDQIIHTWLDFAEK TVIPHQTTLLVTSNGIIRFAPYLTGDFEKFAQEHKIKVAPGGLCIFDKNDGDSFWTCSAW NVKPYELYADSRY >gi|226332019|gb|ACIB01000037.1| GENE 227 273773 - 274603 465 276 aa, chain - ## HITS:1 COG:HI0058 KEGG:ns NR:ns ## COG: HI0058 COG1212 # Protein_GI_number: 16272032 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: CMP-2-keto-3-deoxyoctulosonic acid synthetase # Organism: Haemophilus influenzae # 8 266 5 252 254 133 35.0 4e-31 MKKNKILIIIPSRFASTRLPEKPLVKIAGKEMVLRVAEIANYVCNKVEGCNYIVATDHEK IVNFCKENNIAVMMTSENCKSGTERCWDVTTKIAEKPDFIVNLQGDNPLCPPWFIEQLIE AWKNDKEGQVFTPSLHLSWEEYDRMKESKKITPYSGTTVEVDKFGYALAFSKAMIPVIRN EEKVRKILDKSPVRRHIGLYSYTYDALKKYFEVEASPYELPEGLEQMRFLHNRIPVKMID VDYRNRKSMSGVDSPEDIERAEKIIAEFGEFNLSPE >gi|226332019|gb|ACIB01000037.1| GENE 228 274768 - 274899 56 43 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MENLSDDKEREQKHSFRPIFKHICWLEAKFPAWRIDRLNCGFF >gi|226332019|gb|ACIB01000037.1| GENE 229 275460 - 275927 646 155 aa, chain + ## HITS:1 COG:no KEGG:BF2275 NR:ns ## KEGG: BF2275 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 155 1 155 155 274 100.0 7e-73 MSVIFKTVKRPSDPRVENSPKRYYPQLITLGRSVDLKFIAQKIQDRSSLSVGDIKSTIQN FVEKLKEQLLEGKAVNIEGLGVFMLAAKSKGSEKQEDITAKSVDSVRIYFQANKELKITK TATRAGEKLDLISLDDYLKGAADGGNGDIVDDPTA >gi|226332019|gb|ACIB01000037.1| GENE 230 276138 - 278471 2153 777 aa, chain + ## HITS:1 COG:aq_624 KEGG:ns NR:ns ## COG: aq_624 COG5009 # Protein_GI_number: 15606057 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane carboxypeptidase/penicillin-binding protein # Organism: Aquifex aeolicus # 2 748 1 679 726 280 30.0 6e-75 MIKKIVKGLWVFFALMVLAGIAVFASIAYGWIGYMPPVEELENPNYKFATEILSEDGKVL GTWSLSKENRVYTSYNELSPNIVNALIATEDVRFTEHSGIDAKALIRAVVKRGLLMQKNA GGGSTLSQQLAKQLFTDEVARNTLQRLFQKPIEWVIAVKLERYYTKEEILSMYLNKFDFL NNAVGIKTASYTYFGCEPKDLKIEQAATLIGMCKNPSLYNPVRFNERSRGRRNTVLDQMR KAGYITAEECDSLQNLPLELVYHRVDHKEGLATYFREYLRGVMTAPKPVRSNYRGWQMQK FYEDSIDWEMNPLYGWCEKNKKKDGSNYNIYTDGLKIYTTINSHMQRYAEEAVEEHVGEY LQPLFFKEKKGRKKAPYSNQLTQEEIDRILDRAVKQTSRYQTMKEAGISEAEIKKAFNKP ESMSVFTWHGVKDTIMSPMDSIRYYKHFLRAGFMSMDPINGQVKAYVGGPNYTYFQYDMA MVGRRQVGSTIKPYLYALAMENGFSPCDETRNVEITLIDENGKPWSPKNTSKGHYGEMVT LKWGLANSNNWISAYLMSKLNPYALARLIHSFGVRNKEIQPTVSLCLGPCEISVGEMVSA YTAFANKGIRVAPLFVTKIEDSEGNVLATFSPQMEEVISASSAYKMLVMLRAVINEGTGG RVRRYGITADMGGKTGTTNRNSDGWFMGFTPSLVSGCWVGGEERDIHFDTMTYGQGASLA LPIWTKYMHKVYADQTLGYDPKETFKLPDGFDPCKDFSISGDSIIDEPESGLDDLFN >gi|226332019|gb|ACIB01000037.1| GENE 231 278485 - 278853 302 122 aa, chain + ## HITS:1 COG:no KEGG:BF2228 NR:ns ## KEGG: BF2228 # Name: not_defined # Def: putative 2-amino-4-hydroxy-6-hydroxymethyldihydropteridine pyrophosphokinase # Organism: B.fragilis # Pathway: not_defined # 1 122 1 122 122 215 99.0 4e-55 MHSCLICIGSNYNRKENLLLARRRLTALFPSIRFTGEQETRPLFFRNPALFSNQMARFYT DADAERVVKELKTIEREAGREQEDKKKEKVCLDIDLLVFDDRILRPEDLQREYVRKGLEE LK >gi|226332019|gb|ACIB01000037.1| GENE 232 278916 - 280202 1124 428 aa, chain + ## HITS:1 COG:alr2744 KEGG:ns NR:ns ## COG: alr2744 COG0612 # Protein_GI_number: 17230236 # Func_class: R General function prediction only # Function: Predicted Zn-dependent peptidases # Organism: Nostoc sp. PCC 7120 # 1 413 1 408 427 112 24.0 2e-24 MNRTLQPEIQELVQFNILPPVRTVMPNGVPLTIINAGEQDVVRVDILFGGGRWQQSQKLQ ALFANRMLREGSRKYTAAEIAEKLDYYGAWLELSSSAEYAYITLYSLNKYFAETLDVLES IIKEPLFPEKELGTVIDANIQQYQVNASKVDFLAHRSLLRALYGEEHPCGRYVEEMDYHH ITPALLREFYDAYYHSGNCYVYLSGKVTDEITHRIEAAFGTTHFGNHQQVAVKKDFPFVS IPEKRLFIEREDAMQSAVKLGTTTIMRTHPDYLKLRVLITLFGGYFGSRLMSNIREEKGY TYGISAGIMFYPGSGLLGISTETANEYVEPLIQEVYKEIDKLQNDKVTPEELAMVRNYML GEMCRNYESPFSLADAWMFILTSGLDDDYFARSLQAVKEVTPEEIRELAGRYLCKESLKE VIAGKKLT >gi|226332019|gb|ACIB01000037.1| GENE 233 280253 - 281413 1134 386 aa, chain + ## HITS:1 COG:slr1485 KEGG:ns NR:ns ## COG: slr1485 COG4642 # Protein_GI_number: 16329198 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Synechocystis # 44 357 27 341 349 163 35.0 5e-40 MKTTRYIYTALILAALSQGNAVAQNKGGFFGKVRDTFSTEIKIGNYTFKDGSVYTGEMKG RKPNGKGKTVFKNGDVYEGEYVKGKREGFGVYTFPDGERYEGQWYQDQQHGNGIYYFMNN NRYDGMWFQDYQHGPGTMYYHNGDIYVGDWVNDKREGKGTYTWRDGSKYVGDWKNDKKDG KGVLVWNDGCKYDGDWKNDVREGKGTFEYTNGEKYVGDWKDDLQHGKGIFFLGGDRYEGS YLQGERTGPGIYYHANGDKYVGNFKDGMQDGEGTFTWANGAVYEGEWKDNKRNGHGIYKW SNGDVYEGEWKNNQPNGKGTLTLTNGTKYKGGFVNGMQEGNGVEEDKNGNRYEGFFKQGK KNGPFVETDKNGKVIRKGTYKFGRLE Prediction of potential genes in microbial genomes Time: Tue May 17 23:18:19 2011 Seq name: gi|226332018|gb|ACIB01000038.1| Bacteroides sp. 3_2_5 cont1.38, whole genome shotgun sequence Length of sequence - 197430 bp Number of predicted genes - 169, with homology - 165 Number of transcription units - 82, operones - 44 average op.length - 3.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 29 - 967 1004 ## COG0462 Phosphoribosylpyrophosphate synthetase - Prom 1023 - 1082 6.2 + Prom 948 - 1007 5.3 2 2 Op 1 . + CDS 1098 - 5573 2760 ## COG0642 Signal transduction histidine kinase 3 2 Op 2 . + CDS 5602 - 6330 863 ## BF2233 two-component system response regulator 4 2 Op 3 . + CDS 6338 - 7774 1049 ## COG2978 Putative p-aminobenzoyl-glutamate transporter + Term 7939 - 7982 3.1 + TRNA 7865 - 7941 76.3 # Arg TCT 0 0 + Prom 8256 - 8315 6.2 5 3 Op 1 . + CDS 8388 - 8717 140 ## BF2236 hypothetical protein + Prom 8880 - 8939 5.4 6 3 Op 2 . + CDS 9040 - 9777 733 ## BF2238 hypothetical protein + Term 9788 - 9859 20.5 + Prom 9812 - 9871 6.0 7 4 Op 1 9/0.000 + CDS 9895 - 10878 476 ## COG0147 Anthranilate/para-aminobenzoate synthases component I + Prom 11072 - 11131 3.6 8 4 Op 2 . + CDS 11174 - 11458 134 ## COG0115 Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase + Term 11633 - 11668 4.1 + Prom 11648 - 11707 6.2 9 5 Op 1 . + CDS 11746 - 12414 529 ## COG5587 Uncharacterized conserved protein 10 5 Op 2 . + CDS 12483 - 14495 1341 ## COG0296 1,4-alpha-glucan branching enzyme + Term 14519 - 14566 8.3 - Term 14505 - 14554 9.4 11 6 Op 1 . - CDS 14571 - 16268 1345 ## COG0366 Glycosidases 12 6 Op 2 . - CDS 16279 - 17337 783 ## COG0673 Predicted dehydrogenases and related proteins 13 6 Op 3 . - CDS 17328 - 18134 763 ## COG1752 Predicted esterase of the alpha-beta hydrolase superfamily - Prom 18226 - 18285 6.7 - Term 18258 - 18302 3.5 14 7 Tu 1 . - CDS 18304 - 19518 1022 ## COG0477 Permeases of the major facilitator superfamily - Prom 19570 - 19629 6.4 + Prom 19634 - 19693 3.6 15 8 Tu 1 . + CDS 19737 - 21242 1200 ## COG0174 Glutamine synthetase + Term 21260 - 21307 11.6 + Prom 21244 - 21303 4.0 16 9 Op 1 . + CDS 21328 - 21909 345 ## COG0664 cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases 17 9 Op 2 . + CDS 21982 - 22695 487 ## COG0300 Short-chain dehydrogenases of various substrate specificities + Prom 22705 - 22764 5.5 18 9 Op 3 . + CDS 22840 - 23430 334 ## BF2252 hypothetical protein + Term 23434 - 23484 17.0 - Term 23421 - 23472 18.0 19 10 Tu 1 . - CDS 23511 - 23720 192 ## BF2253 hypothetical protein - Prom 23753 - 23812 4.4 - Term 24556 - 24592 4.8 20 11 Op 1 . - CDS 24618 - 24788 251 ## BF2349 hypothetical protein 21 11 Op 2 39/0.000 - CDS 24830 - 25699 802 ## COG0074 Succinyl-CoA synthetase, alpha subunit 22 11 Op 3 . - CDS 25696 - 26844 1037 ## COG0045 Succinyl-CoA synthetase, beta subunit - Prom 26864 - 26923 7.3 - Term 26971 - 27026 18.0 23 12 Op 1 . - CDS 27050 - 27937 1032 ## COG0331 (acyl-carrier-protein) S-malonyltransferase - Prom 27980 - 28039 3.9 24 12 Op 2 . - CDS 28041 - 28874 639 ## COG0351 Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase - Prom 29102 - 29161 11.2 + Prom 29059 - 29118 9.1 25 13 Op 1 . + CDS 29216 - 29947 666 ## COG1051 ADP-ribose pyrophosphatase 26 13 Op 2 1/0.056 + CDS 29999 - 30430 229 ## COG1070 Sugar (pentulose and hexulose) kinases 27 13 Op 3 11/0.000 + CDS 30262 - 31506 927 ## COG1070 Sugar (pentulose and hexulose) kinases 28 13 Op 4 . + CDS 31596 - 32915 1594 ## COG2115 Xylose isomerase 29 13 Op 5 . + CDS 32981 - 34429 1319 ## COG0477 Permeases of the major facilitator superfamily + Term 34562 - 34627 21.0 - Term 34833 - 34892 15.2 30 14 Tu 1 . - CDS 34984 - 36192 1151 ## COG1524 Uncharacterized proteins of the AP superfamily - Prom 36222 - 36281 6.3 + Prom 36178 - 36237 5.2 31 15 Op 1 . + CDS 36298 - 37836 1288 ## COG0388 Predicted amidohydrolase + Prom 37857 - 37916 2.1 32 15 Op 2 . + CDS 37955 - 38587 661 ## COG0450 Peroxiredoxin + Term 38612 - 38651 6.1 + Prom 38719 - 38778 10.1 33 16 Tu 1 . + CDS 38798 - 39043 311 ## COG0724 RNA-binding proteins (RRM domain) + Term 39090 - 39127 -0.9 - Term 39052 - 39095 8.7 34 17 Op 1 . - CDS 39116 - 40507 1361 ## COG1858 Cytochrome c peroxidase 35 17 Op 2 . - CDS 40523 - 41308 353 ## COG0755 ABC-type transport system involved in cytochrome c biogenesis, permease component 36 17 Op 3 . - CDS 41292 - 42179 534 ## BF2364 hypothetical protein - Prom 42415 - 42474 6.6 + Prom 42302 - 42361 3.8 37 18 Tu 1 . + CDS 42439 - 43704 559 ## BF2365 hypothetical protein 38 19 Tu 1 . - CDS 43839 - 44888 674 ## BF2274 hypothetical protein - Prom 45068 - 45127 5.4 + Prom 45032 - 45091 8.8 39 20 Op 1 . + CDS 45113 - 48538 3489 ## COG0060 Isoleucyl-tRNA synthetase 40 20 Op 2 . + CDS 48573 - 48953 578 ## BF2276 putative DnaK suppressor protein + Prom 48956 - 49015 1.8 41 21 Op 1 . + CDS 49071 - 49703 414 ## BF2277 lipoprotein signal peptidase 42 21 Op 2 . + CDS 49714 - 50760 477 ## BF2278 hypothetical protein 43 21 Op 3 . + CDS 50769 - 51098 201 ## BF2279 hypothetical protein - Term 50932 - 50989 1.6 44 22 Tu 1 . - CDS 51052 - 51813 553 ## COG0566 rRNA methylases - Prom 52024 - 52083 9.0 + Prom 51881 - 51940 8.3 45 23 Tu 1 . + CDS 52184 - 52696 369 ## BF2281 TonB + Term 52715 - 52757 9.2 + Prom 52729 - 52788 7.9 46 24 Tu 1 . + CDS 52818 - 55130 1821 ## BF2374 putative surface membrane protein + Term 55162 - 55206 7.1 - Term 55150 - 55194 7.1 47 25 Tu 1 . - CDS 55219 - 55773 616 ## BF2283 hypothetical protein - Prom 55832 - 55891 7.4 48 26 Tu 1 . - CDS 55895 - 56332 334 ## BF2284 hypothetical protein - Prom 56366 - 56425 3.9 + Prom 56616 - 56675 5.8 49 27 Tu 1 . + CDS 56725 - 56823 118 ## + Term 56842 - 56882 1.0 - Term 56825 - 56873 3.9 50 28 Tu 1 . - CDS 56881 - 57336 394 ## BF2286 hypothetical protein - Prom 57441 - 57500 7.3 - TRNA 57994 - 58069 81.9 # Lys CTT 0 0 - Term 58129 - 58185 1.1 51 29 Tu 1 . - CDS 58337 - 58492 78 ## - Prom 58655 - 58714 5.9 + Prom 58358 - 58417 6.4 52 30 Tu 1 . + CDS 58592 - 59734 751 ## COG3746 Phosphate-selective porin 53 31 Tu 1 . - CDS 59879 - 61090 715 ## COG0477 Permeases of the major facilitator superfamily - Prom 61274 - 61333 5.3 - TRNA 61343 - 61418 81.9 # Lys CTT 0 0 - Term 61303 - 61347 7.6 54 32 Op 1 . - CDS 61507 - 62310 778 ## COG1235 Metal-dependent hydrolases of the beta-lactamase superfamily I - Term 62322 - 62366 3.7 55 32 Op 2 . - CDS 62389 - 63777 810 ## BF2381 putative transport-related membrane protein - Prom 63846 - 63905 5.1 - Term 63848 - 63896 -0.8 56 33 Tu 1 . - CDS 63927 - 65333 956 ## COG1904 Glucuronate isomerase - Prom 65447 - 65506 6.8 - Term 65621 - 65666 2.9 57 34 Op 1 . - CDS 65725 - 66258 493 ## BF2383 hypothetical protein 58 34 Op 2 . - CDS 66269 - 67330 850 ## BF2295 hypothetical protein - Prom 67486 - 67545 5.5 + Prom 67335 - 67394 4.1 59 35 Op 1 . + CDS 67489 - 68037 609 ## COG1898 dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes 60 35 Op 2 . + CDS 68061 - 69374 1278 ## COG1004 Predicted UDP-glucose 6-dehydrogenase + Prom 69424 - 69483 7.3 61 36 Tu 1 . + CDS 69516 - 70319 519 ## BF2298 hypothetical protein + Term 70445 - 70473 -0.9 - Term 70172 - 70205 -0.6 62 37 Op 1 . - CDS 70390 - 71646 927 ## COG0513 Superfamily II DNA and RNA helicases 63 37 Op 2 . - CDS 71726 - 72949 1349 ## COG0560 Phosphoserine phosphatase - Prom 73005 - 73064 3.9 + Prom 73005 - 73064 5.3 64 38 Tu 1 . + CDS 73221 - 73730 393 ## BF2390 hypothetical protein + Term 73818 - 73869 9.1 - Term 73806 - 73856 8.1 65 39 Op 1 1/0.056 - CDS 73896 - 74993 1002 ## COG0795 Predicted permeases 66 39 Op 2 . - CDS 74998 - 76035 1064 ## COG0343 Queuine/archaeosine tRNA-ribosyltransferase 67 39 Op 3 . - CDS 76125 - 78593 2450 ## COG0466 ATP-dependent Lon protease, bacterial type - Prom 78720 - 78779 4.1 + Prom 78698 - 78757 3.1 68 40 Tu 1 . + CDS 78778 - 79491 417 ## COG4123 Predicted O-methyltransferase + Term 79559 - 79607 7.1 - TRNA 79602 - 79676 81.8 # Pro TGG 0 0 - TRNA 79704 - 79781 82.4 # Pro TGG 0 0 - Term 79924 - 79964 5.1 69 41 Tu 1 . - CDS 79992 - 80210 382 ## BF2360 hypothetical protein - Prom 80235 - 80294 7.2 + Prom 80312 - 80371 6.1 70 42 Op 1 . + CDS 80396 - 80548 181 ## gi|253565585|ref|ZP_04843040.1| predicted protein 71 42 Op 2 . + CDS 80557 - 80898 352 ## COG4828 Predicted membrane protein 72 42 Op 3 . + CDS 80967 - 81140 241 ## BF2363 hypothetical protein + Term 81203 - 81237 1.8 - Term 81093 - 81128 1.0 73 43 Op 1 8/0.000 - CDS 81147 - 82424 1006 ## COG5000 Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation 74 43 Op 2 . - CDS 82421 - 83785 1453 ## COG2204 Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 75 43 Op 3 . - CDS 83761 - 84006 69 ## BF2366 hypothetical protein - Prom 84088 - 84147 6.2 76 44 Op 1 13/0.000 + CDS 84148 - 85620 1505 ## COG1538 Outer membrane protein 77 44 Op 2 . + CDS 85659 - 86909 1523 ## COG0845 Membrane-fusion protein + Prom 86947 - 87006 2.4 78 45 Tu 1 . + CDS 87053 - 89476 1582 ## BF2369 ABC transporter permease + Term 89561 - 89614 3.4 + Prom 89539 - 89598 5.0 79 46 Op 1 . + CDS 89626 - 91971 1363 ## BF2454 ABC transporter 80 46 Op 2 10/0.000 + CDS 91988 - 94315 1686 ## COG0577 ABC-type antimicrobial peptide transport system, permease component 81 46 Op 3 . + CDS 94330 - 96732 1256 ## COG0577 ABC-type antimicrobial peptide transport system, permease component + Prom 96825 - 96884 3.1 82 47 Op 1 . + CDS 96926 - 99154 1082 ## BF2373 putative ABC-transporter permease protein 83 47 Op 2 . + CDS 99165 - 101501 990 ## BF2458 hypothetical protein 84 47 Op 3 3/0.000 + CDS 101530 - 102204 336 ## PROTEIN SUPPORTED gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) + Prom 102277 - 102336 6.6 85 47 Op 4 . + CDS 102375 - 103541 417 ## COG0526 Thiol-disulfide isomerase and thioredoxins + Term 103580 - 103635 9.3 + Prom 103596 - 103655 5.9 86 48 Op 1 . + CDS 103675 - 104343 117 ## Cthe_3205 hypothetical protein 87 48 Op 2 . + CDS 104361 - 104987 462 ## Slin_3445 protein of unknown function DUF88 88 48 Op 3 . + CDS 104963 - 105742 467 ## Mevan_0112 hypothetical protein 89 48 Op 4 . + CDS 105739 - 107214 734 ## Mevan_0111 hypothetical protein 90 48 Op 5 . + CDS 107211 - 107690 413 ## gi|253565605|ref|ZP_04843060.1| predicted protein 91 48 Op 6 . + CDS 107704 - 107955 180 ## Mevan_0105 hypothetical protein + Prom 108436 - 108495 8.5 92 49 Op 1 . + CDS 108570 - 109112 278 ## gi|253565606|ref|ZP_04843061.1| conserved hypothetical protein 93 49 Op 2 . + CDS 109117 - 110919 785 ## Mevan_0106 CRISPR-associated RAMP Crm2 family protein 94 49 Op 3 . + CDS 110912 - 112051 485 ## Mevan_0107 CRISPR-associated Cmr3 family protein 95 49 Op 4 . + CDS 112071 - 112910 454 ## COG1336 Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) 96 49 Op 5 . + CDS 112923 - 113333 368 ## gi|253565610|ref|ZP_04843065.1| predicted protein 97 49 Op 6 . + CDS 113330 - 114280 640 ## Mevan_0110 CRISPR-associated RAMP Cmr6 family protein 98 49 Op 7 . + CDS 114300 - 116570 972 ## COG3344 Retron-type reverse transcriptase 99 49 Op 8 . + CDS 116564 - 116854 143 ## Ppha_2458 CRISPR-associated protein Cas2 + Term 116864 - 116910 1.5 + Prom 117976 - 118035 6.1 100 50 Tu 1 . + CDS 118164 - 118406 60 ## + Prom 118432 - 118491 1.9 101 51 Op 1 . + CDS 118531 - 118749 153 ## COG3666 Transposase and inactivated derivatives 102 51 Op 2 . + CDS 118807 - 118962 75 ## BF4243 ISNCY family transposase - Term 118952 - 118989 5.5 103 52 Tu 1 . - CDS 119018 - 120202 1136 ## COG0156 7-keto-8-aminopelargonate synthetase and related enzymes - Prom 120237 - 120296 6.4 + Prom 120253 - 120312 5.4 104 53 Op 1 . + CDS 120437 - 121480 772 ## COG1597 Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase + Prom 121483 - 121542 5.1 105 53 Op 2 . + CDS 121562 - 123319 1822 ## COG0173 Aspartyl-tRNA synthetase 106 53 Op 3 . + CDS 123320 - 123703 245 ## BF2380 hypothetical protein + Term 123728 - 123770 3.1 - Term 123697 - 123768 8.3 107 54 Op 1 5/0.000 - CDS 123783 - 124667 962 ## COG0388 Predicted amidohydrolase 108 54 Op 2 . - CDS 124679 - 125794 803 ## COG2957 Peptidylarginine deiminase and related enzymes - Term 125808 - 125875 12.1 109 55 Tu 1 . - CDS 125876 - 126403 549 ## COG4739 Uncharacterized protein containing a ferredoxin domain - Prom 126445 - 126504 4.0 + Prom 126249 - 126308 3.5 110 56 Op 1 . + CDS 126498 - 127118 186 ## PROTEIN SUPPORTED gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 111 56 Op 2 . + CDS 127122 - 127916 586 ## COG0390 ABC-type uncharacterized transport system, permease component 112 56 Op 3 . + CDS 127924 - 128196 201 ## BF2386 hypothetical protein + Prom 128198 - 128257 5.0 113 56 Op 4 . + CDS 128288 - 128671 340 ## COG0346 Lactoylglutathione lyase and related lyases + Term 128698 - 128736 -0.1 114 57 Tu 1 . - CDS 128664 - 129077 306 ## BF2472 hypothetical protein - Prom 129103 - 129162 6.1 + Prom 128698 - 128757 2.5 115 58 Op 1 3/0.000 + CDS 128938 - 130278 641 ## COG3323 Uncharacterized protein conserved in bacteria 116 58 Op 2 . + CDS 130284 - 131108 934 ## COG1579 Zn-ribbon protein, possibly nucleic acid-binding + Term 131111 - 131155 1.1 + Prom 131153 - 131212 3.3 117 59 Op 1 13/0.000 + CDS 131346 - 132752 501 ## PROTEIN SUPPORTED gi|157165073|ref|YP_001466086.1| 30S ribosomal protein S12 118 59 Op 2 27/0.000 + CDS 132771 - 133910 986 ## COG0845 Membrane-fusion protein 119 59 Op 3 . + CDS 133915 - 137037 2671 ## COG0841 Cation/multidrug efflux pump + Term 137087 - 137138 6.1 - Term 137231 - 137298 19.7 120 60 Tu 1 . - CDS 137382 - 137831 354 ## BF2394 hypothetical protein - Prom 137873 - 137932 4.0 + Prom 137797 - 137856 4.0 121 61 Op 1 . + CDS 137896 - 138672 629 ## COG0775 Nucleoside phosphorylase 122 61 Op 2 . + CDS 138691 - 139710 887 ## COG1466 DNA polymerase III, delta subunit + Term 139732 - 139795 12.6 123 62 Tu 1 . - CDS 140302 - 140454 107 ## BF2397 hypothetical protein - Prom 140474 - 140533 4.3 + Prom 140781 - 140840 11.0 124 63 Op 1 . + CDS 140915 - 141370 238 ## BF2398 hypothetical protein 125 63 Op 2 13/0.000 + CDS 141443 - 142219 466 ## COG0543 2-polyprenylphenol hydroxylase and related flavodoxin oxidoreductases 126 63 Op 3 . + CDS 142207 - 143118 903 ## COG0167 Dihydroorotate dehydrogenase + Term 143287 - 143327 8.4 - Term 143262 - 143329 17.5 127 64 Tu 1 . - CDS 143336 - 144013 549 ## COG0336 tRNA-(guanine-N1)-methyltransferase - Prom 144098 - 144157 5.9 + Prom 143949 - 144008 5.7 128 65 Tu 1 . + CDS 144090 - 146087 1613 ## COG0272 NAD-dependent DNA ligase (contains BRCT domain type II) + Prom 146102 - 146161 1.9 129 66 Tu 1 . + CDS 146191 - 147084 832 ## COG0329 Dihydrodipicolinate synthase/N-acetylneuraminate lyase + Term 147139 - 147194 2.1 - Term 147127 - 147182 5.1 130 67 Op 1 . - CDS 147236 - 147862 333 ## COG4845 Chloramphenicol O-acetyltransferase - Prom 147885 - 147944 3.3 131 67 Op 2 . - CDS 147982 - 148245 235 ## BF2406 hypothetical protein - Prom 148459 - 148518 8.9 + Prom 148459 - 148518 8.6 132 68 Tu 1 . + CDS 148545 - 149438 458 ## COG1262 Uncharacterized conserved protein + Term 149503 - 149537 5.5 - TRNA 149529 - 149605 54.7 # Arg ACG 0 0 - TRNA 149628 - 149701 55.8 # Arg ACG 0 0 - TRNA 149720 - 149793 54.1 # Arg ACG 0 0 133 69 Tu 1 . - CDS 149874 - 152126 1598 ## COG1752 Predicted esterase of the alpha-beta hydrolase superfamily - Prom 152146 - 152205 5.4 - Term 152263 - 152309 9.5 134 70 Op 1 1/0.056 - CDS 152344 - 154389 2172 ## COG0326 Molecular chaperone, HSP90 family - Prom 154455 - 154514 2.8 - Term 154454 - 154488 5.3 135 70 Op 2 . - CDS 154520 - 157054 1959 ## PROTEIN SUPPORTED gi|163764771|ref|ZP_02171825.1| ribosomal protein S8 - Prom 157176 - 157235 9.2 136 71 Op 1 . + CDS 157446 - 159983 2436 ## COG0188 Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit 137 71 Op 2 . + CDS 160016 - 161227 1272 ## BF2412 TPR repeat-containing protein + Term 161263 - 161315 12.4 138 72 Op 1 . - CDS 161361 - 161528 89 ## 139 72 Op 2 . - CDS 161577 - 162698 773 ## COG0589 Universal stress protein UspA and related nucleotide-binding proteins - Term 162719 - 162760 6.6 140 72 Op 3 . - CDS 162777 - 163061 259 ## BF2414 hypothetical protein - Prom 163091 - 163150 7.7 - Term 163194 - 163243 7.5 141 73 Op 1 . - CDS 163272 - 164108 682 ## BF2415 hypothetical protein 142 73 Op 2 . - CDS 164127 - 165971 1410 ## BF2416 hypothetical protein 143 73 Op 3 3/0.000 - CDS 165986 - 166696 709 ## COG0457 FOG: TPR repeat 144 73 Op 4 5/0.000 - CDS 166705 - 167730 1044 ## COG2304 Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 145 73 Op 5 . - CDS 167744 - 168727 773 ## COG2304 Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 146 73 Op 6 . - CDS 168779 - 169852 822 ## BF2502 putative membrane exported protein 147 73 Op 7 23/0.000 - CDS 169861 - 170730 616 ## COG1721 Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 148 73 Op 8 . - CDS 170787 - 171782 1064 ## COG0714 MoxR-like ATPases - Prom 171958 - 172017 4.9 - Term 171959 - 172017 9.0 149 74 Op 1 . - CDS 172026 - 173477 1013 ## BF2423 putative integration host factor IHF alpha subunit 150 74 Op 2 . - CDS 173470 - 173760 281 ## BF2424 DNA-binding protein HU 151 74 Op 3 3/0.000 - CDS 173835 - 175133 1226 ## PROTEIN SUPPORTED gi|229254937|ref|ZP_04378866.1| SSU ribosomal protein S12P methylthiotransferase 152 74 Op 4 . - CDS 175130 - 176089 739 ## PROTEIN SUPPORTED gi|163762490|ref|ZP_02169555.1| ribosomal protein L28 - Prom 176110 - 176169 8.4 - Term 176160 - 176218 4.1 153 75 Op 1 . - CDS 176239 - 176397 263 ## PRU_0750 hypothetical protein 154 75 Op 2 . - CDS 176417 - 176605 320 ## PROTEIN SUPPORTED gi|53713719|ref|YP_099711.1| 50S ribosomal protein L33 155 75 Op 3 . - CDS 176632 - 176892 456 ## PROTEIN SUPPORTED gi|53713720|ref|YP_099712.1| 50S ribosomal protein L28 - Prom 176912 - 176971 5.7 156 76 Op 1 . - CDS 177014 - 178246 1032 ## COG1058 Predicted nucleotide-utilizing enzyme related to molybdopterin-biosynthesis enzyme MoeA 157 76 Op 2 . - CDS 178254 - 179273 648 ## PROTEIN SUPPORTED gi|227425790|ref|ZP_03908856.1| SSU ribosomal protein S18P alanine acetyltransferase + Prom 179212 - 179271 3.8 158 77 Tu 1 . + CDS 179306 - 183910 3160 ## BF2513 hypothetical protein - Term 183881 - 183917 4.0 159 78 Op 1 . - CDS 184006 - 184386 284 ## COG3304 Predicted membrane protein - Prom 184415 - 184474 3.6 - Term 184397 - 184460 8.4 160 78 Op 2 . - CDS 184479 - 185972 1732 ## COG0442 Prolyl-tRNA synthetase - Prom 186055 - 186114 5.6 - Term 186196 - 186232 -0.6 161 79 Tu 1 . - CDS 186267 - 186974 314 ## BF2436 hypothetical protein - Prom 187033 - 187092 5.1 162 80 Tu 1 . - CDS 187133 - 190366 2868 ## COG0793 Periplasmic protease - Prom 190386 - 190445 4.7 - TRNA 190517 - 190592 72.5 # Gly CCC 0 0 + Prom 190585 - 190644 5.2 163 81 Op 1 2/0.000 + CDS 190738 - 191256 499 ## COG2087 Adenosyl cobinamide kinase/adenosyl cobinamide phosphate guanylyltransferase 164 81 Op 2 11/0.000 + CDS 191279 - 192316 1062 ## COG2038 NaMN:DMB phosphoribosyltransferase 165 81 Op 3 6/0.000 + CDS 192329 - 193072 460 ## COG0368 Cobalamin-5-phosphate synthase 166 81 Op 4 . + CDS 193075 - 193620 558 ## COG0406 Fructose-2,6-bisphosphatase 167 82 Op 1 9/0.000 - CDS 193598 - 194563 518 ## COG1270 Cobalamin biosynthesis protein CobD/CbiB 168 82 Op 2 2/0.000 - CDS 194590 - 195603 1009 ## COG0079 Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 169 82 Op 3 . - CDS 195596 - 197059 1442 ## COG1492 Cobyric acid synthase - Prom 197214 - 197273 5.9 Predicted protein(s) >gi|226332018|gb|ACIB01000038.1| GENE 1 29 - 967 1004 312 aa, chain - ## HITS:1 COG:jhp0679 KEGG:ns NR:ns ## COG: jhp0679 COG0462 # Protein_GI_number: 15611746 # Func_class: F Nucleotide transport and metabolism; E Amino acid transport and metabolism # Function: Phosphoribosylpyrophosphate synthetase # Organism: Helicobacter pylori J99 # 7 310 13 317 318 306 50.0 4e-83 MSEKAPFMVFSGTNSRYLAEKICASLNCPLGNMNITHFADGEFAVSYEESIRGAHVFLVQ STFPNSDNLMELLLMIDAAKRASAKSVVAVIPYFGWARQDRKDKPRVSIGAKLVADLLSV AGIDRLITMDLHADQIQGFFNIPVDHLYASAVFLPYIQSLKLDELVIATPDVGGSKRAST FSKYLGVPLVLCNKSREKANEVASMQIIGDVKGKNVVLIDDIVDTAGTITKAANIMLEAG ANSVRAIASHCVMSDPASFRVQESGLTEMVFTDSIPYSKKCVKVKQLSIADMFAETIKRV MNNESISSQYII >gi|226332018|gb|ACIB01000038.1| GENE 2 1098 - 5573 2760 1491 aa, chain + ## HITS:1 COG:MA1957_2 KEGG:ns NR:ns ## COG: MA1957_2 COG0642 # Protein_GI_number: 20090805 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Methanosarcina acetivorans str.C2A # 657 879 52 253 270 70 32.0 3e-11 MNPRLLIPLFCMLLLTTFFSCVDTAPTKEVQHIDSLNSRAYMYRYRDLDSSCKAASQAYK EVSMYSQGKAEASNNLAFCAFMRMDFDTAERLHKGVYDLTKNELELLIADIGLMKIYQRT AMNKEFYDYRNSAIRRMKRIDEENNLFVDRHESARLDYAFTEFFIVSAVYYYYLQQRTEA MASLNEIQLNEDLATDTNQILYFHYIKGSAGLCEGKTPDERKLEEFDELYTTWKMASEQG YLYFEGNGLQGLTNLMASQESYELFLNRRSHALTGFGIPVDSLLPLRLGQMALERFRKYN DLYQIAGAYVSIGKYLNAHGRYTEALDTLAKALDCVNQHHMSYYHSVADTLDKLQVFVPN KDVAYTEVTWLGKEKVKTVPEWISRIREQLSVSYACLGMKPASDYNRNVYLDILRYTRQD KELESRYLSLEQESRQLNVVLFFVIIGLILVTAMFWLFNKRSKVRNRLHIARLRQTLDVC QKITASIPVDVTDESEIVYAISESIQPDMEQLFGATEIRIGIRNEETGDIEFNEECESEQ TDEPKSIGEVGIGSVFNLYVPDKTEPIGILKLFTRHRLNKDEQALVKVISPYIAWALDNG MTFISLGDERNKLEKQRYVYEQHIAGNKRQNLIKKSCLAIVDGINPYIDRIINEVHKLTE KGFINNDRIKKEKYQYIDELVTTINEYNDILALWIKMKQGSLSLNIENFELNELFELISK GRKAFEMKKQRLEIEPTKALVKADKALTLFMINTLAENARKYTPEGGIVKIYAKRTDEYV EISVEDNGRGLSTEDITKIIGEKVYDSQSIGMKDNPDKEELKRSKGSGFGLMNCKGIIEK YKKTNDLFRICTFNIESTPGKGSRFYFRLPLGVRKIIGICLCLVGLFGFFSCQGEPMPEK LKDIPVDSVTLAAEAEYERLLDEASRFADTVYYCNVIENFELALQYADSALNRLNAHYKK YARHPQRFMKLVGSGIPAELEWWVEPYNTDFHVILDVRNEASVAFLGLKKLDAYTYNNVA YTALYKLTGEDQSLEGYCRQLERSTTNKTVGIILCILLLVASLCGYYLLYVRKRLLNRLN LEQVLEINKKVFASSLVRTQESAEALQREEDTLKEIPQRIVNESFDSMNELLTIECLGIA VYNEMGHRLEFASTPRMDTPPEIIQQCFDNQTYLSDGDMQALPLLVDAGGKHQCVGVLYL EKQEDSMQEADRLLFQLISRYVGIVVFNAVVRLATKYRDIEAAHEETRRASWEDSMLHVQ NMVLDNCLSTIKHETIYYPNKIKQIIGRLNTHSLSGEEEKECVETISELIEYYKGIFRIL SSCASRQLEEVTFRRATIPVTDIMAYAGKYFKRISRGVDYKITLTIEPLEAKVIGDINQL RFLIENLIDEALSFHQDGELVLKAIMDGEYVRFLFTDRRREKQVEELNQLFYPNLARMTS GEKGELRGTEYLICKQIIRDHDEFAGRRGCRINAEPAQGGGFTVYFTVPKR >gi|226332018|gb|ACIB01000038.1| GENE 3 5602 - 6330 863 242 aa, chain + ## HITS:1 COG:no KEGG:BF2233 NR:ns ## KEGG: BF2233 # Name: not_defined # Def: two-component system response regulator # Organism: B.fragilis # Pathway: not_defined # 1 242 1 242 242 457 100.0 1e-127 MEDKKFKVIIVEDVKLELKGTEEIFRHEIPNAEVIGTAMTENEFWPLMETQLPDMVLLDL GLGGSTTIGVDICRNIFKRYPGVRVLIFTGEILNEKLWVDVLNAGADGIILKTGELLTKT DVQAVMDGKKLVFNYPILEKIVERFKKSVANDAKRQEAVISYDIDEYDERFLRHLALGYT KEMIANLKGMPFGVKSLEKRQNDLIGRLFPQGERVGVNATRLVVRALELRIIDLDNLEAD EE >gi|226332018|gb|ACIB01000038.1| GENE 4 6338 - 7774 1049 478 aa, chain + ## HITS:1 COG:FN0470 KEGG:ns NR:ns ## COG: FN0470 COG2978 # Protein_GI_number: 19703805 # Func_class: H Coenzyme transport and metabolism # Function: Putative p-aminobenzoyl-glutamate transporter # Organism: Fusobacterium nucleatum # 1 478 23 503 512 314 39.0 2e-85 MPHPATMFFLFTLAVIFLSWIFDIYGLRVQLPQTGAEIRVQSLLSPEGIRWMLRNAITNF TGFAPLGMVLIAMFGIGVAQHSGFIDACVRQGVKNRKNTRRIILWVIILGLLSNIVGDAG YIILLPIAATLFYSVGLNPVAGIITAYVSVSCGYSANVVLSTMDPLIARTTQEAAIDSGV YQGNTGPLCNYYFMSVSTFVIGAIIYRITCKRLIPSLGQYEGKQIFEGYKQLSRKERRAM TMAIVVGMLYAAIILWATFSSWGILRGVNGGLIRSPFIMGILFLLSLGAAIMGMVYGFSS GRYRSDNDVIEGLAQPMKLLGGYLVIAFFAAQMFACLEYSHLDKCVAIIGANLLSSVQAG PLWTLILFILFTATINLIMVSATAKWAFMAFIFVPVFARMGIEPDMTQCAFRIGDSATNA ITPFMFYMPLVLTYMQQYDKQATYGSLLKYTWRYSVYILIGWTMLLFIWYLTGLPLGL >gi|226332018|gb|ACIB01000038.1| GENE 5 8388 - 8717 140 109 aa, chain + ## HITS:1 COG:no KEGG:BF2236 NR:ns ## KEGG: BF2236 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 53 109 1 57 57 104 94.0 1e-21 MVKEFTPDMTNSMQKIVRKCFPRTLQIVNKIHVFTLAYEAIRDLRMAYRWQIMKNETAFS TAGIYDIWVDKDSGKQHATFSIITIVTDPLTDYIHNTKYRMPVIFVIQR >gi|226332018|gb|ACIB01000038.1| GENE 6 9040 - 9777 733 245 aa, chain + ## HITS:1 COG:no KEGG:BF2238 NR:ns ## KEGG: BF2238 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 245 1 245 245 391 99.0 1e-108 MKKIILLLALCFTANNFFAQTTDPNQLKNEGNDALNAKNYAVAFEKYSEYLKLTNNQDSV TAYNCGVCADNIKKYKEAADYFDIAIKKNYNLANAYIGKSAAYRDMKNNQEYIATLTEGI KAVPGNATLEKLYAIYYLKEGQKFQQAGNIEKAEENYKHATDVTSKKWKTDALYSLGVLF YNNGADVLRKATPLASSNKEKYASEKAKADAAFKKAVDYLGEAVTLSPNRTEIKQMQDQV KAMIK >gi|226332018|gb|ACIB01000038.1| GENE 7 9895 - 10878 476 327 aa, chain + ## HITS:1 COG:HI1170 KEGG:ns NR:ns ## COG: HI1170 COG0147 # Protein_GI_number: 16273094 # Func_class: E Amino acid transport and metabolism; H Coenzyme transport and metabolism # Function: Anthranilate/para-aminobenzoate synthases component I # Organism: Haemophilus influenzae # 10 326 10 324 328 297 46.0 2e-80 MQFYSRNEAINRINKLAGAGKAFLFIIDYKQECSFIEKVDDIDSSELLYNLNGFTNCTSV VTPFRYPIIWQPQPISLSQYKKSFDIIRKNILSGNSFLTNLTCMTLVNTNLGLKDIFYRS RALYKLWLKETFVVFSPEIFIRIENGRISSYPMKGTIDATLPSATRLLMEDEKEAAEHAT IVDLIRNDLSIVADNVSVTRYRYVDTLYTNHGPILQTSSEISGVLPKSYVDHLGEILFRL LPAGSITGAPKHKTMEIIEQAEEYERGFYTGITGYFDGRKLDSAVMIRFIEEQNGQIFFK SGGGITCKSDLENEYNEMKQKVYVPIY >gi|226332018|gb|ACIB01000038.1| GENE 8 11174 - 11458 134 94 aa, chain + ## HITS:1 COG:HI1169 KEGG:ns NR:ns ## COG: HI1169 COG0115 # Protein_GI_number: 16273093 # Func_class: E Amino acid transport and metabolism; H Coenzyme transport and metabolism # Function: Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase # Organism: Haemophilus influenzae # 7 77 115 185 188 65 43.0 2e-11 MHKREQDEILITRNGLLTDTSIANIALFNGKEWHTPKHPLLKGVQRAALIDKHLIREKEI TVDQLFNYSQICLFNAMIDFGKIKIDVNRELIRI >gi|226332018|gb|ACIB01000038.1| GENE 9 11746 - 12414 529 222 aa, chain + ## HITS:1 COG:XF2023 KEGG:ns NR:ns ## COG: XF2023 COG5587 # Protein_GI_number: 15838617 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Xylella fastidiosa 9a5c # 8 219 19 234 237 95 26.0 7e-20 MSNIPVIFRFLKDLTANNNREWFNEHREEYEIARLEFENFLSTVIARISLFDESIRGIQP KECTYRIYRDTRFSSDKTPYKNHFGGYINAKGKKSYHSGYYIHIQPEGCMLAGGSLCLPS NILKALRQSIYDNIDEYRSIVEDPEFQQFFPIVGGDFLKTAPKGFPKDFKYIDYLKPKEF TCAYSVPDSFFLTPDILDKIEEVFRQFKRFADFTNFTIDDFE >gi|226332018|gb|ACIB01000038.1| GENE 10 12483 - 14495 1341 670 aa, chain + ## HITS:1 COG:YEL011w KEGG:ns NR:ns ## COG: YEL011w COG0296 # Protein_GI_number: 6320826 # Func_class: G Carbohydrate transport and metabolism # Function: 1,4-alpha-glucan branching enzyme # Organism: Saccharomyces cerevisiae # 8 666 12 700 704 561 45.0 1e-159 MEKTLNLIKNDPWLEPYKDAIVGRFEHAMDKKAELTNGGKSTLSDFASGYLYFGLHRTDK GWIFREWAPNASHIYMVGTFSNWEEKPAYKLKRLKNGSWEIKLPIDTIQHGDLYKLHVYW EGGQGERIPAWANRVVQDDNTKIFSAQVWAPEKPFKFKKKTFKPSTDPLLIYECHIGMAQ QEEKVGTYNEFREKILPRIAKEGYNCIQIMAIQEHPYYGSFGYHVSSFFAASSRFGTPEE LKQLIDTAHGLGIAVIMDIVHSHAVKNEVEGLGNFAGDPNQYFYPGGRREHPAWDSLCFD YGKNEVMHFLLSNCKYWLEEYHFDGFRFDGVTSMLYYSHGLGEAFCNYGDYFNGHQDDNA ICYLTLANELIHEVNPKAITIAEEVSGMPGLAAKVEDGGYGFDYRMAMNIPDYWIKTIKE KIDEDWKPSSMFWEVTNRRQDEKTISYAESHDQALVGDKTIIFRLIDADMYWHMQKGDEN YIVHRGVALHKMIRLLTASTINGGYLNFMGNEFGHPEWIDFPREGNGWSCKYARRQWDLV DNKNLTYHYLGDFDADMLKVIKSVKNIQQTPVQEIWHNDGDQVLAYQRKDLVFVFNFNPS QSFTDYGFLVTPGTYEVVLNTDNIIYGGNGLSDDSVKHFTLPDPLYKKEKKEWLKLYIPA RTAMVLRRTK >gi|226332018|gb|ACIB01000038.1| GENE 11 14571 - 16268 1345 565 aa, chain - ## HITS:1 COG:TM1650 KEGG:ns NR:ns ## COG: TM1650 COG0366 # Protein_GI_number: 15644398 # Func_class: G Carbohydrate transport and metabolism # Function: Glycosidases # Organism: Thermotoga maritima # 3 360 2 262 422 89 24.0 2e-17 MKNENKMIIYQVFTRLFGNNNNHCIYNGDISQNGCGKMADFTAKALGEIKKLGATHIWYT GIIEHASQTDYRRYNIRPDHPAIVKGKAGSPYAIKDYYDVDPDLATDVPGRMKEFENLVS RTHRAGLKVIIDFVPNHVARQYHSDAQPDGTTQLGANDDPNYSFSPYNNFYYIPQSELHG QFDMTGNALEPYHEFPAKATGNNRFDAYPNINDWYETVKLNYGVDYQNGGTCHFSPTPDT WTKMLDILLFWSSKNIDGFRCDMAEMVPVEFWEWAIPQVKQEYPNIIFIAEVYNPHEYKN YLFRGKFDFLYDKVGLYDTLRNVACGYDSATAITRSWQSLGGIEKRMLNFLENHDEQRIA SDFFAGDPRKGVPALIVSACMNTNPMMIYFGQEFGEMGMDSEGFSGRDGRTTIFDYWSVD TIRRWRNEGKFDGKMLTEEQKHLYAIYQRVLTLCNEEQAISNGVFFDLMYANENGWRFNE HKQYTFMRKYKNELLFIVVNFDNQPVNVAINVPSHAFDFLQIPQFDSYKAVDLLTDKVEE ISLLPYKATEIALGAYTGKILKIKF >gi|226332018|gb|ACIB01000038.1| GENE 12 16279 - 17337 783 352 aa, chain - ## HITS:1 COG:PM0652 KEGG:ns NR:ns ## COG: PM0652 COG0673 # Protein_GI_number: 15602517 # Func_class: R General function prediction only # Function: Predicted dehydrogenases and related proteins # Organism: Pasteurella multocida # 2 342 3 338 350 304 45.0 1e-82 MEIIKTGLAAFGMSGQVFHAPFISTNPHFELYKIVERSKELSKERYPQASIVRSFKELTE DPEIDLIVVNTPDNTHYEYAGMALEAGKNVVVEKPFTSTTKQGEELIALAKKKGLMLSVY QNRRWDADFLTVRDILAKSLLGRLVEYESTFARYRNFIKPNTWKETGESGGGLTYNLGSH LIDQAIQLFGMPEAVFADLGILREGGKVDDYFIIHLLHPSLAPNVKITLKASYLMREAEP RFALHGTLGSYVKYGVDKQEAALLAGEIPERPNWGEESEQEWGLLHTEINGKEICRKYPG IAGNYGGFYQNIYEHLCLGQPLETHAQDILNVIRIIEAAYQSHRDNKIVNLK >gi|226332018|gb|ACIB01000038.1| GENE 13 17328 - 18134 763 268 aa, chain - ## HITS:1 COG:aq_1386 KEGG:ns NR:ns ## COG: aq_1386 COG1752 # Protein_GI_number: 15606577 # Func_class: R General function prediction only # Function: Predicted esterase of the alpha-beta hydrolase superfamily # Organism: Aquifex aeolicus # 15 261 13 258 259 165 36.0 9e-41 MKMENSVLTGKPYNIGYALSGGFIKGFAHLGVIQALLEHDIKPDIISGVSAGALAGVFYA DGNEPYRVLDYFSGHKFQDLTKLVIPKVGLFALGEFIDFLKSNLKAQKLEDLKLPLIITA TDLDHGRSMHFHKGNIAERVAASCCMPVLFTPVKIGNTHYVDGGLLMNLPVSTIRNECEK VVAVNVSPLMAEKYKMNIVSIAMRSYHFMFRANTFPERDNCDLLIEPYNLEGYSNTELEK AEEIFEQGYNTASEVLDQLIEEKGKIWK >gi|226332018|gb|ACIB01000038.1| GENE 14 18304 - 19518 1022 404 aa, chain - ## HITS:1 COG:ECs0532 KEGG:ns NR:ns ## COG: ECs0532 COG0477 # Protein_GI_number: 15829786 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Escherichia coli O157:H7 # 2 394 3 401 406 388 55.0 1e-108 MVQSQTQPIRRIAFPILIALSVSHCLNDLLQSVISAVYPLFKEDLSLSFAQIGLITLVYQ MSASVFQPLTGLIFDKRPIAWSLPIGMSFTLIGMLNLAFASNLNWLLASVFIIGIGSSVL HPEASRITFLASGGKRGLAQSLFQVGGNLGGSLGPLLVALLVAPYGRHHIALFAILALAA ICVMFPICRWYRSYLNHLKKRPIHAKAYIERPLPPQKTVFAITILMILIFSKYIYMASLN SYYTFYLIHKFNVSIQQSQLFLFVFLVATAIGTLMGGPIGDKVGRKYVIWGSILGTAPFS LLMPHAGLVWTIILSFCVGLMLSSAFPAILLYAQELLPNKLGLISGLFFGFAFGVAGIAS AVLGNMADKFGIDAVYNVCAFMPLLGLVTWFLPDLKKVRSEKQE >gi|226332018|gb|ACIB01000038.1| GENE 15 19737 - 21242 1200 501 aa, chain + ## HITS:1 COG:MA3382 KEGG:ns NR:ns ## COG: MA3382 COG0174 # Protein_GI_number: 20092196 # Func_class: E Amino acid transport and metabolism # Function: Glutamine synthetase # Organism: Methanosarcina acetivorans str.C2A # 1 499 1 504 506 570 54.0 1e-162 MMNQELLMSPNRLVTFLQKPAAEFTKADIINYIQQNEIRMVNFMYPAADGRLKTLNFVIN NASYLDAILTCGERVDGSSLFPFIEAGSSDLYVIPRFRTAFVDPFAEIPTLVMLCSFFNK DGEPLESSPEYTLHKACKAFTDVTGMEFQAMGELEYYVISEDDGLFPATDQRGYHESGPY AKFNDFRTQCMSYIAQTGGQIKYGHSEVGNFMLDGKVYEQNEIEFLPVNAENAADQLMIA KWVIRNLAYQYGYDITFAPKITVGKAGSGLHIHMRMMKDGQNQMLKDGVLSDTARKAIAG MMQLAPSITAFGNTNPTSYFRLVPHQEAPTNVCWGDRNRSVLVRVPLGWSAQTDMCALAN PLESDSNYDTTQKQTVEMRSPDGSADLYQLLAGLAVACRHGFEIENALAIAEQTYVNVNI HQKENADKLKALAQLPDSCAASADCLQKQRTVFEQYNVFSPAMINGIISRLRSYNDATLR KDIQDKPEEMLALVSKFFHCG >gi|226332018|gb|ACIB01000038.1| GENE 16 21328 - 21909 345 193 aa, chain + ## HITS:1 COG:all4541 KEGG:ns NR:ns ## COG: all4541 COG0664 # Protein_GI_number: 17232033 # Func_class: T Signal transduction mechanisms # Function: cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases # Organism: Nostoc sp. PCC 7120 # 35 193 34 190 193 87 36.0 1e-17 MDEEVKGFNRYMSKVDFQPVTEFIFQNGQLTDYKKGEFFSRQNESCKMVGYVTEGSFRYC CTDSRGGSKIVGYTFDHSFVGNYPAFRLGDNSNVDIQAICNCSVYVINNRQLEEFYSRNE ANQKLGRQIAEILLWEVYERMISLYSMTPEERYTEILKRCPELLNLISLKELASYLMICP ETLSRLRRKLVQK >gi|226332018|gb|ACIB01000038.1| GENE 17 21982 - 22695 487 237 aa, chain + ## HITS:1 COG:CAP0051 KEGG:ns NR:ns ## COG: CAP0051 COG0300 # Protein_GI_number: 15004755 # Func_class: R General function prediction only # Function: Short-chain dehydrogenases of various substrate specificities # Organism: Clostridium acetobutylicum # 1 236 1 240 240 169 39.0 5e-42 MKKVIIIGATSGIGKGLAERFLREGNTVGITGRREDKLQEICSQNKNCFYSVSDVTKDTD TVRQLSNLVNRVGGMDILIFCSGIGELNPELDYLLEKPTLLTNVIGFTNVVDWAFHFFQK QEWGHLIVISSVGGMRGEGIAPAYNASKAYQINYTEGLRKKTAKLPYPIYITDVRPGFVD TAMAKGEGLFWITPLDKAVQQIYRAILRRRKVAYVSKRWKYVALLLRMIPASIYCKM >gi|226332018|gb|ACIB01000038.1| GENE 18 22840 - 23430 334 196 aa, chain + ## HITS:1 COG:no KEGG:BF2252 NR:ns ## KEGG: BF2252 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 196 1 196 196 352 100.0 4e-96 MSPRKLMFWLFACIFLVCALRAGLLTSADQYIYHLRNMHASTFAYRYDDFLPYLPIVAMF VLKLTGVKSRSNWKRMLVSTAFSYILMGTIVLTMKSLAGVLRPDGSDFLSFPSGHTATAF TAATLLYKEYGFKTPLAGIATFLPAVVTGFTRQLNNRHWLSDVLAGAIIGIMMVELAYFL TDRLLMKTGAQTCSKS >gi|226332018|gb|ACIB01000038.1| GENE 19 23511 - 23720 192 69 aa, chain - ## HITS:1 COG:no KEGG:BF2253 NR:ns ## KEGG: BF2253 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 69 1 69 69 127 100.0 2e-28 MELETIGENAGKVWRTLNEMRGEISIQELSRKINLSAEDVALAVGWLARENNIFIQRHNY LLYVSHDAF >gi|226332018|gb|ACIB01000038.1| GENE 20 24618 - 24788 251 56 aa, chain - ## HITS:1 COG:no KEGG:BF2349 NR:ns ## KEGG: BF2349 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 56 1 56 56 63 100.0 3e-09 MSRRRQLEHEVSLAQERIKKAPKDTPKEILKTWEQELVDLELELNNLVDDEEDNNE >gi|226332018|gb|ACIB01000038.1| GENE 21 24830 - 25699 802 289 aa, chain - ## HITS:1 COG:BS_sucD KEGG:ns NR:ns ## COG: BS_sucD COG0074 # Protein_GI_number: 16078673 # Func_class: C Energy production and conversion # Function: Succinyl-CoA synthetase, alpha subunit # Organism: Bacillus subtilis # 1 285 1 284 300 323 60.0 3e-88 MSILIDKSTRLLVQGITGRDGLFHAKKMAEYGTNVVGGTSPGKGGTMIDDTFPVFNTMHE AVRRTQANTSVIFVPARFAADAIMEAADAGIRLIICITEGIPTLDVIKAYRFVELKGAKL IGPNCPGLISPGESLVGILPGQVFTPGNIGVISRSGTLTYEIVSHLTAKGMGQSTAIGMG GDPVVGLYFRDLLGMLQNDPQTDAIVMIGEIGGNAEELAATYIREHVTKPVVAFIAGRSA PPGKQMGHAGAIISGSSGSATEKISALEAAGIRVAGEPSEIPDLLKGSF >gi|226332018|gb|ACIB01000038.1| GENE 22 25696 - 26844 1037 382 aa, chain - ## HITS:1 COG:SA1088 KEGG:ns NR:ns ## COG: SA1088 COG0045 # Protein_GI_number: 15926828 # Func_class: C Energy production and conversion # Function: Succinyl-CoA synthetase, beta subunit # Organism: Staphylococcus aureus N315 # 1 379 1 387 388 325 44.0 9e-89 MKVHEYQAKEIFSTYGIPVERHALCHTADGAVAAYHRMGVNRVAIKAQVLTGGRGKAGGV KLANNDRDVYQYAQTILEMTIKGYPVTKILLSEAVNIAAEYYISFTIDRNTRSVTLIMSA AGGMDIEEVARQSPEKIIRCSIDPLIGVPDYLAHKFAFSLFEQAEQANQMATIIQDLYKA FIEKDASLAEINPLVLTPVGTLLAIDAKMVFDDNALYRHPDLQKLSEPTEDEKLEAIAKE RGFSYVRMDGEIGCMVNGAGLAMTTMDMIKLYGGNPANFLDIGGSSNPVKVIEAMRLLLD DKKVKVVFINIFGGITRCDDVAIGLLQAFEQIQTDIPIIVRLTGTNGNMGRELLRKNNRF QVAQTMEEATKMAIESLKKESI >gi|226332018|gb|ACIB01000038.1| GENE 23 27050 - 27937 1032 295 aa, chain - ## HITS:1 COG:CAC3575 KEGG:ns NR:ns ## COG: CAC3575 COG0331 # Protein_GI_number: 15896809 # Func_class: I Lipid transport and metabolism # Function: (acyl-carrier-protein) S-malonyltransferase # Organism: Clostridium acetobutylicum # 3 294 5 297 308 242 45.0 5e-64 MKAFVFPGQGAQFVGMGKDLYETSALAKELFEKANDILGYRITDIMFNGTDEDLRQTKVT QPAVFLHSVISALCMGDDFKPEMTAGHSLGEFSALVAAGALSFEDGLKLVYARAMAMQKA CEATPSTMAAIIALPDEKVEEICASVTTEGEVCVPANYNCPGQIVISGSVPGIEKACELM KAAGAKRALPLKVGGAFHSPLMDPAKVELEAAINATEFHTPKCPVYQNVDALPHTDPQEI KKNLVAQLTASVRWTQTVKNMVADGATDFTECGPGAVLQGLIKKIDSTVSAHGIA >gi|226332018|gb|ACIB01000038.1| GENE 24 28041 - 28874 639 277 aa, chain - ## HITS:1 COG:CAC3095 KEGG:ns NR:ns ## COG: CAC3095 COG0351 # Protein_GI_number: 15896346 # Func_class: H Coenzyme transport and metabolism # Function: Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase # Organism: Clostridium acetobutylicum # 1 263 1 260 265 223 47.0 2e-58 MKTYPVVLSIAGSDCSGGAGIQADIKTISALGAYAASVITAVTVQNTRGVKAVHTVPAEI VQGQIEAVMEDLRPDALKIGMVSEPALVKIIAGCLLKYPHCPIVYDPVMVSTSGRKLMAK DAIQLIKEELFPLTSLITPNLDETEVLTGKKITTAEEMKEAARQLSEEYHTAVLVKGGHL EGNEMQDVLFTDGNAYIYKEKKIESRNLHGTGCTLSSSIATYLALGLPMDQAVGKAKSYV SKAIDAGKEIIIGHGNGPLCHFWGPEKARIWDDNKVE >gi|226332018|gb|ACIB01000038.1| GENE 25 29216 - 29947 666 243 aa, chain + ## HITS:1 COG:alr2484 KEGG:ns NR:ns ## COG: alr2484 COG1051 # Protein_GI_number: 17229976 # Func_class: F Nucleotide transport and metabolism # Function: ADP-ribose pyrophosphatase # Organism: Nostoc sp. PCC 7120 # 7 239 15 238 248 87 29.0 2e-17 MHDIPKQIPLANNHISVDCVVIGFDGEQLKVLLINRIGEENGKVYRDMKLPGSLIYMDED LDEAAQRVLFELTGIRNVNLMQFKAFGSKNRTSNPKDVHWLERAMQSKVERIVTIAYLSM VKIDRALDKNLDEFQACWVALKDIKTLAFDHNLIIKEALTYIRQFVEFNPSMLFDLLPRK FTASQLRILFELVYDKAVDVRNFHKKIALMDYVVPLEEKQTGVAHRAARYYKFDRKIYNK TRR >gi|226332018|gb|ACIB01000038.1| GENE 26 29999 - 30430 229 143 aa, chain + ## HITS:1 COG:CAC2612 KEGG:ns NR:ns ## COG: CAC2612 COG1070 # Protein_GI_number: 15895870 # Func_class: G Carbohydrate transport and metabolism # Function: Sugar (pentulose and hexulose) kinases # Organism: Clostridium acetobutylicum # 2 90 3 87 500 72 40.0 3e-13 MFLLGYDISSSSVKASLVDAETGKCVASAFFPKTEAGIIAVRPGWAEQEPESWWENLKLS TRSILSESRVDAKDIKAIGISYQMHGLVCVWTNGNAHCVRPLSGAIHVRSPTGRGHSRQS VRSSVWHICLILPETLLLQNWHG >gi|226332018|gb|ACIB01000038.1| GENE 27 30262 - 31506 927 414 aa, chain + ## HITS:1 COG:CAC2612 KEGG:ns NR:ns ## COG: CAC2612 COG1070 # Protein_GI_number: 15895870 # Func_class: G Carbohydrate transport and metabolism # Function: Sugar (pentulose and hexulose) kinases # Organism: Clostridium acetobutylicum # 3 396 87 487 500 143 30.0 6e-34 MCVDKRQRTLRPAIIWCDSRAVSYGQRAFEAIGEKFCLAHLLNSPGNFTASKLAWVKENE PDIYEQIDKIMLPGDYIAMKLSGEVCTTIEGLSEGMFWDFRNNRPADFLMQYYGIDPSLI ADIRPTFAEQGRLTGTAARELGLQEGTPITYRAGDQPNNALSLNVFNPGEIASTAGTSGV VYGVNGEINYDPQSRVNTFAHVNHTATDPRLGVLLCINGTGILNSWIRRNVAPEGISYAE MNRFASSVPIGSAGISILPFGNGAERMLNNRATGCGIHGVDFNRHDKSHLIRAAQEGIVF SFKYGIDIMEEMGIPVKKIHAGHANMFLSSVFRETLAGTTGATIELYDTDGSVGAAKGAG MGAGIYKDHEEAFATLDKLTVVEPDAGKQQEYTDAYARWKQCLTQSMQTETENK >gi|226332018|gb|ACIB01000038.1| GENE 28 31596 - 32915 1594 439 aa, chain + ## HITS:1 COG:HI1112 KEGG:ns NR:ns ## COG: HI1112 COG2115 # Protein_GI_number: 16273037 # Func_class: G Carbohydrate transport and metabolism # Function: Xylose isomerase # Organism: Haemophilus influenzae # 6 438 4 439 439 456 51.0 1e-128 MATKEYFPGIGKIKFEGKDSKNPMAFRYYDAEKMINGRSMKDWLKFAMAWWHTLCAEGGD QFGGGTKQFPWNGDPDPVQAAKNKMDAGFEFMQKMGIGYYCFHDVDLVTEADSIEAYEAN LKELVAYAKQKQAETGIKLLWGTANVFSHARYMNGAATNPDFDVVARAAVQIKNAIDATI ELGGTNYVFWGGREGYMSLLNTDQKREKEHLAQMLTIARDYGRARGFKGTFLIEPKPMEP TKHQYDVDTETVIGFLKAHGLDQDFKVNIEVNHATLAGHTFEHELAVAVDNGMLGSIDAN RGDYQNGWDTDQFPIDNFELTQAMMQIIRNDGLGNGGTNFDAKTRRNSTDPEDIFIAHIA GMDAMARALESAANLLNESPYQKMLSDRYASFDAGKGKEFEEGKLSLEELVAYAKANGEP KQTSGQQELYEALVNIYSL >gi|226332018|gb|ACIB01000038.1| GENE 29 32981 - 34429 1319 482 aa, chain + ## HITS:1 COG:ECs5014 KEGG:ns NR:ns ## COG: ECs5014 COG0477 # Protein_GI_number: 15834268 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Escherichia coli O157:H7 # 11 480 9 479 491 430 49.0 1e-120 MNHTNEGSKLYLYSITSVAILGGLLFGYDTAVISGAEKGLEAFFLTATDFQYDKVMHGIT SSSALIGCVLGGALSGIFASRLGRRNSLRLAAVLFFLSALGSYYPEFLFFEYGKANMNLL ITFNLYRILGGIGVGLASAVCPMYIAEIAPSNIRGTLVSCNQFAIIFGMLVVYFVNYLIL GDHQNPVILKDAAGTLSVSSESDMWTVTEGWRYMFGSEAFPAAFFGMLLFFVPKTPRYLV MIDQDQKAYSILKKVNGATKAQEILAEIKATSQEKTEKLFTYGAAVIVIGILLSVFQQAI GINAVLYYAPRIFENAGAEGGGMMQTVIMGIVNIVFTLIAIFTVDRFGRKPLLIIGSIGM AVGAFAVALCDSMGIKGILPVLSVIVYAAFFMMSWGPICWVLISEIFPNTIRGKAVAIAV AFQWIFNYIVSSTFPALYDFSPMFAYSLYGIICVIAALFVWRWVPETKGKTLEDMSKLWK KR >gi|226332018|gb|ACIB01000038.1| GENE 30 34984 - 36192 1151 402 aa, chain - ## HITS:1 COG:CC1277 KEGG:ns NR:ns ## COG: CC1277 COG1524 # Protein_GI_number: 16125526 # Func_class: R General function prediction only # Function: Uncharacterized proteins of the AP superfamily # Organism: Caulobacter vibrioides # 19 398 65 451 451 223 34.0 6e-58 MRKFIISFCCYVFFIFTLAAQDKAPHYTVIVSLDAFRWDYPAMYDTPNLNQMAREGVKAT MLPSYPASTFPNHYTLATGLVPDHNGIINNTFWDVKRRRQYSMGDPATRNNPDYYLGEPI WITAQKQGVKTGNVYWVGSDIAIKGGYPTYYREYAEKPRLTFEQRVDSTIALLEKPEAER PRLVMLYFEEPDGVTHHHGPRSVEAAAIIHRMDSLVGMLRQGIASLPFGKDVNLIVTADH GMTEISDDRVVDMNKYLRPEWCEAVDGRTPTSIFTKPEYRDSVYNALKDVPHIHVWKKEE IPAELNYGSSDRIGDIVVAPELGWQFTDVPRALKGAHGYFPQSPDMQVMFRACGPDFKAG YESKGFVNVDIYPLLAHLLKITPEKTDGQFERIKDILKDVSF >gi|226332018|gb|ACIB01000038.1| GENE 31 36298 - 37836 1288 512 aa, chain + ## HITS:1 COG:BH1089_2 KEGG:ns NR:ns ## COG: BH1089_2 COG0388 # Protein_GI_number: 15613652 # Func_class: R General function prediction only # Function: Predicted amidohydrolase # Organism: Bacillus halodurans # 201 511 3 313 313 327 49.0 4e-89 MDYPHKINKVQIRNLQIEDYAQLSQSFTRVYSDGSDVFWTHEQIEKLIKIFPEGQIVTVV DEKIVGCALSIIVEYDKVKNDHTYAQVTGKETFNTHSPQGNILYGIEVFIHPEYRGLRLA RRMYEYRKELCETLNLKAIMFGGRIPNYHKYADKMRPKEYIDRVRQREIYDPVLTFQLSN DFHVRKVMTNYLPNDEESKHYACLLQWDNIYYQPPTQEYLAPKTTVRVGLVQWQMRSYKT LDDLFEQVEFFVDAVSDYKSDFVLFPEYFNAPLMSKYNDKGESQAIRGLAQYTEEIRDRF INLAISYNINIITGSMPLIKEDGLLYNAGFLCRRDGTYEMYEKLHVTPDEIKSWGLSGGK QLKTFDTDCAKIGILICYDVEFPELSRLMADQGMQILFVPFLTDTQNAYSRVRVCAQARA IENECFVVIAGSVGNLPRVHNMDIQYAQSGVFTPCDFAFPTDGKRAEATPNTEMILVSDV DLDLLNELHTYGSVRNLKDRRNDVYEVRFKKP >gi|226332018|gb|ACIB01000038.1| GENE 32 37955 - 38587 661 210 aa, chain + ## HITS:1 COG:STM0402 KEGG:ns NR:ns ## COG: STM0402 COG0450 # Protein_GI_number: 16763782 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Peroxiredoxin # Organism: Salmonella typhimurium LT2 # 4 210 3 196 200 224 53.0 6e-59 MRSLIGKQAPKFDATAVINGHEIVQNFSLDQYKGKKYVVFFFYPMDFTFVCPTELHAFQE KLEEFEKRDVAVVGCSVDSEYSHFSWLQMPKNEGGIQGVKYPIVSDFSKSISESYGVLAG SYAPDENGNWVCEGTPVAFRGLFLIDKEGVVRHCVINDLPLGRNVDEVLRMVDALQHFEE YGEVCPANWSKGKDAMKATEDGVANYLSKH >gi|226332018|gb|ACIB01000038.1| GENE 33 38798 - 39043 311 81 aa, chain + ## HITS:1 COG:asl4022 KEGG:ns NR:ns ## COG: asl4022 COG0724 # Protein_GI_number: 17231514 # Func_class: R General function prediction only # Function: RNA-binding proteins (RRM domain) # Organism: Nostoc sp. PCC 7120 # 1 80 1 80 94 80 53.0 7e-16 MNMYIGNLSYRVKEADLRQVMEEYGTVDSVKLIIDRETRKSKGFAFVEMPNDDEAKNVIS ELNGAEYEGRQMVVKEALPRN >gi|226332018|gb|ACIB01000038.1| GENE 34 39116 - 40507 1361 463 aa, chain - ## HITS:1 COG:PM0939 KEGG:ns NR:ns ## COG: PM0939 COG1858 # Protein_GI_number: 15602804 # Func_class: P Inorganic ion transport and metabolism # Function: Cytochrome c peroxidase # Organism: Pasteurella multocida # 8 459 15 468 468 444 48.0 1e-124 MKKSTKFIIALLVTVGALAITYRVVNQAPSKDLAADAQMQEIITSGGCLQCHSGSPDLPF YANWPVASGMVQKDVTQGYRAFDMTEMAEALKAGKPVGKVALAKVEKVIMDGTMPKHAYY MVHWGSSVTDAKKEMAMAWVKQHRLAHYANGLAAAEFANEPIRPIADSIPVDMRKVILGD MLYHDTRLSADNTVSCASCHGLNTGGVDNKQYSEGVGGQFGGVNAPTVYNAAYNFVQFWD GRAGTLAEQAAGPPLNPVEMACQSFDEIIAKLEQDANFTKAFLAVYPDGYSEQNITNAIE EFEKTLLTPNSRFDLYLKGEKTAINDIELAGYELFKKYDCATCHVGETLGGQSYELMGVK RDYFADRGIELTEEDNGRFKQTRNERDKHRFKVPGLRNIALTAPYFHDGSMKTMKEAVDY MAKYQMDLNLSEDELNKIVAFLETLTGEYKGKPLTNDNQTKAL >gi|226332018|gb|ACIB01000038.1| GENE 35 40523 - 41308 353 261 aa, chain - ## HITS:1 COG:RSc2985 KEGG:ns NR:ns ## COG: RSc2985 COG0755 # Protein_GI_number: 17547704 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: ABC-type transport system involved in cytochrome c biogenesis, permease component # Organism: Ralstonia solanacearum # 116 253 244 391 395 90 35.0 3e-18 MITWDNFYLFAVASICLWLTGAIFALRSSVRSRMAVVLTIAGITCLGIFIAGLWISLQRP PLRTMGETRLWYSFFMGIAGLLTYIRWKYRWILSFSTLLSTVFVIINLLKPEIHDQSLMP ALQSIWFIPHVTVYMFSYSVLGCAFIIALCGLVHHKEEYLVTADNLVYSGVAFLSIGMLL GSLWAKEAWGNYWSWDPKETWAVVTWMGYLLYIHLRLRRKFRKKMLYVILIFSFLALQMC WYGVNYLPSAQQSVHLYNRNN >gi|226332018|gb|ACIB01000038.1| GENE 36 41292 - 42179 534 295 aa, chain - ## HITS:1 COG:no KEGG:BF2364 NR:ns ## KEGG: BF2364 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 295 1 295 295 570 98.0 1e-161 MHKKLLVTAYFVIAALLQTLAGNFPLSFFAFPLNVIVAVIWIYSLWRLYKEGNKLPLTRF LLSSRTSVLSILLLIGGSLVIGLFPQLSEAEADSTPGVLASLGCYNFMISWIFIAILFLL LSNLAMVIIHAFYHCVPAKKRFILNHLGLWLALFAGFFGSSDVQTLRIPLYAGQPGREAY SMDGKAYYLDYELELYSFNTEYYPNGMPSRFAADVRIGNRRTTLEVNHPHSYRLGEDIYL TGYDTHNMGNTQYCILQIVRQPWKYVMVVGILMILTGAVLLFINGPKKLKHDNLG >gi|226332018|gb|ACIB01000038.1| GENE 37 42439 - 43704 559 421 aa, chain + ## HITS:1 COG:no KEGG:BF2365 NR:ns ## KEGG: BF2365 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 421 1 421 421 819 98.0 0 MKTPSLILMTIILCNLSIPINAQILTSRQQKEDFDTLYSLLHQVHPDLFLYQTQKEFEKK HDSIYSSLNKERNLSDFYFIVSPFVASVKDGHTNFTIPATQDRIDYLNNGGLTLPLRLKI VENKILVDFPLISCSIQENDEIICMNNINSQTILSQLYLLLGAEKGNAIKENQLTSYLST LLWYKYNWGEKYDFTIKRGKKIWKESLNGISQADAFPVLKARLGKSLPQFVYTLSSDKQT ATLQIMNLYQLPQLKQFCDSVFSVINREHVPNLVIDVRNNKGGSSAGVDMLLSYLSHDAY TLYIKTDLKISSYSKRYNEQKHPETYEEIKNLPDGSLFAIRDSFVEGNRDKADIYKGSVT VLVNESTYSGASTFASAIKKSHAGKVLGETGCPTVYFGNYMSFTLPNSRLEYYISLNKFY E >gi|226332018|gb|ACIB01000038.1| GENE 38 43839 - 44888 674 349 aa, chain - ## HITS:1 COG:no KEGG:BF2274 NR:ns ## KEGG: BF2274 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 349 1 349 349 602 99.0 1e-171 MNYGISVLFRAIPLAMALFCFGYGAFISAYGDDSNRLVAGPVVFSLGMICIALFATAATI IRQIIHTYGRGSLYALPIIGYLAAVVTIIGGICMFTRSTSTSSFVAGHVVAGVGLITTCV ATAATSSTRFSLIPANSKMIGNGIPEGAFTKGQERILKTIAITISLIAWIWAFVLLAKSD VHPAYFVAGHVMVGLACICTSLIALVATIARQIRNVYTDRERKRWPKLVLLMGTVSLLWG LFVIFSDSSTTNGVIGYIMIGLGLVCYSISSKVILLAKIWGREFALANRIPLIPVLTALA CLFLASFVFELGTTHDDYFIPARVLAGLGAICFTLFSIVSILESGTSSK >gi|226332018|gb|ACIB01000038.1| GENE 39 45113 - 48538 3489 1141 aa, chain + ## HITS:1 COG:CAC3038 KEGG:ns NR:ns ## COG: CAC3038 COG0060 # Protein_GI_number: 15896289 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Isoleucyl-tRNA synthetase # Organism: Clostridium acetobutylicum # 8 1136 2 1031 1035 852 40.0 0 MSKKFAEYSQFDLSKVNKDVLKKWDENQVFAKSMTEREGCPSFVFFEGPPSANGMPGIHH VMARSIKDIFCRYKTMKGYQVKRKAGWDTHGLPVELGVEKSLGITKEDIGKTISVAEYNA HCRQDVMKFTKEWEDLTHKMGYWVDMKHPYITYDNRYIETLWWLLKQLYKKGLLYKGYTI QPYSPAAGTGLSSHELNQPGCYRDVKDTTVVAQFKMKNPKPEMAQWGTPYFLAWTTTPWT LPSNTALCVGPKIDYVAVQSYNAYTGQPITVVLAKALLNAHFNPKAAELKLEDYKAGDKL VPFKVIAEYKGPDLVGMEYEQLIPWVNPGEGAFRVILGDYVTTEDGTGIVHIAPTFGADD AQVAKAAGIPPLQLVNKKGELRPMVDLTGKFYTLDELDEDFIKQRVNVDLYKEYAGRFVK NAYDPNLSDQDESLDVSICMMMKVNNQAFKIEKHVHNYPHCWRTDKPVLYYPLDSWFIRS TACKERMIELNKTINWKPESTGTGRFGKWLENLNDWNLSRSRYWGTPLPIWRTEDNSDEK CIESVEELYNEIEKSVAAGYMQSNPYKDKGFVPGEYNEENYNKIDLHRPYVDDIILVSKD GKPMKREADLIDVWFDSGAMPYAQIHYPFENKELLDSHQVYPADFIAEGVDQTRGWFFTL HAIATMVFDSVSYKAVISNGLVLDKNGNKMSKRLGNAVDPFSTIEQYGSDPLRWYMITNS SPWDNLKFDVDGIEEVRRKFFGTLYNTYSFFALYANVDGFEYKEADLPMNERPEIDRWIL SVLNTLVKEVDTCYNEYEPTKAGRLISDFVNDNLSNWYVRLNRKRFWGGGFTQDKLSAYQ TLYTCLETVAKLMAPIAPFYADRLYSDLIGVTGRDNVVSVHLAKFPEYNEKMVDKELEAQ MQMAQDVTSMVLALRRKVNIKVRQPLQCIMIPVVDEVQKAHIEAVKALIMSEVNVKEIKF VDGAAGVLVKKVKCDFKKLGPKFGKQMKAVAAAVAEMSQEAIAELEKNGKYTFDLGGAEA VIESADVEIFSEDIPGWLVANEGKLTVALEVTVTDELRREGIARELVNRIQNIRKSSGFE ITDKIKLTLSKNPQTDDAVNEYNSYICNQVLGTSLTLADEVKDGTELNFDDFSLFVNVVK E >gi|226332018|gb|ACIB01000038.1| GENE 40 48573 - 48953 578 126 aa, chain + ## HITS:1 COG:no KEGG:BF2276 NR:ns ## KEGG: BF2276 # Name: not_defined # Def: putative DnaK suppressor protein # Organism: B.fragilis # Pathway: not_defined # 1 126 1 126 126 212 100.0 3e-54 MAEKTRYSDAELEEFRAIINEKLELAQRDYEQLKLSLMGLDGNDTDDTSPTYKVLEEGAN TLSKEETTRLAQRQLKFIQGLQAALVRIENKTYGICRETGKLIPAERLRAVPHATLSIEA KNSGKK >gi|226332018|gb|ACIB01000038.1| GENE 41 49071 - 49703 414 210 aa, chain + ## HITS:1 COG:no KEGG:BF2277 NR:ns ## KEGG: BF2277 # Name: not_defined # Def: lipoprotein signal peptidase # Organism: B.fragilis # Pathway: Protein export [PATH:bfr03060] # 1 210 1 210 210 382 100.0 1e-105 MKKLLTKGQIAILVIFSVLIIDQVIKIWIKTHMYWHESIRITDWFYIYFTENNGMAFGME LFGKLFLTTFRIVAVGLIGWYLYKIVKRGLKTGYIICVSLILTGALGNIIDSVFYGVIFN ESTHSQIASFMPDGGGYSTWFYGKVVDMFYFPIIDTNWPTWMPFVGGEHFIFFSPIFNFA DAAISCGIIALLLFYSKYLNDSYHHSVTKK >gi|226332018|gb|ACIB01000038.1| GENE 42 49714 - 50760 477 348 aa, chain + ## HITS:1 COG:no KEGG:BF2278 NR:ns ## KEGG: BF2278 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 348 1 348 348 665 100.0 0 MKKFFRFQLCCICLLVLIVSACKVKRPDSVISESEMENLLYDYHIAKAMGENMPGGENYK KALYVEAVFKKYGTTEEVFDSSMVWYTRNTKILSEIYEKVNKRLKAQQNAINHLIALRDN KPKMSAPGDSIDVWAWQRIAQLTEAPLNNKFTFTLPSDTNFKKRDVLLWKMQYNFLSEIP DSTMAPIMAMQIVYENDTVTHSCVKHIFKSGIQNIRLQSDTMNIKEIKGFIFCPLSEESI TLLVSDISLTRYHANDSITQIGRDSLKTDSIKEKSKDDSIQKKTPKDTIQASSPHQRTNP NDLNRPNNDVRPIKPEQREKEMQIEKEKQQLERQQRTNPRRPLRRQNN >gi|226332018|gb|ACIB01000038.1| GENE 43 50769 - 51098 201 109 aa, chain + ## HITS:1 COG:no KEGG:BF2279 NR:ns ## KEGG: BF2279 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 109 1 109 109 212 100.0 3e-54 MKRFAAHYLFVPGSGFLKQYAIEIEGGYICHIFPFSEEIESVEWFPGVILLTPQEESDIN TLFNFTNIEKQSIYIPKVTIDMKWRAYLLYPFNFVTMQPVAETLHRQLQ >gi|226332018|gb|ACIB01000038.1| GENE 44 51052 - 51813 553 253 aa, chain - ## HITS:1 COG:VC0803 KEGG:ns NR:ns ## COG: VC0803 COG0566 # Protein_GI_number: 15640821 # Func_class: J Translation, ribosomal structure and biogenesis # Function: rRNA methylases # Organism: Vibrio cholerae # 2 250 13 255 257 175 39.0 7e-44 MSILSKNRIKYIRSLELKKIRKEEKVFLAEGPKLVGDVLGYFPCKLLIATSDWLEEHPAV QAAEVIEVTSEELSRTSLLKTPQQVLALFEQPEYEIDMEAIRNSLCLALDDIQDPGNLGT IIRLADWFGIEHIFCSPNTVDVFNPKTIQATMGGIARVKVYYTALPDLMHSLGNVPVYGT LLDGENMYEQPLSKNGIIIMGNEGNGISPEIEKLVNRKLYIPNYPAERETSESLNVAIAT AIVCAEFRRQAAL >gi|226332018|gb|ACIB01000038.1| GENE 45 52184 - 52696 369 170 aa, chain + ## HITS:1 COG:no KEGG:BF2281 NR:ns ## KEGG: BF2281 # Name: not_defined # Def: TonB # Organism: B.fragilis # Pathway: not_defined # 1 170 1 170 170 320 100.0 8e-87 MLNEKRTQRIMKSKFLIFLSAVAMLLLFSNCGSKTTSNDQATTEVKDTVTSKEEAVPDSV SILGDQVYDIVNTAPEFPGGMKACLEFLYKNITYPAQAIESKQEGQVVIQFVVTKNGKII DPKVVKSVSPSLDAEAIRIINLMPDWTPGKQKNGQEVNSRFTLPVRFTLK >gi|226332018|gb|ACIB01000038.1| GENE 46 52818 - 55130 1821 770 aa, chain + ## HITS:1 COG:no KEGG:BF2374 NR:ns ## KEGG: BF2374 # Name: not_defined # Def: putative surface membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 770 1 770 770 1464 99.0 0 MKKGILYTILLYLALSLASCSATKFVPDGSYLLDEVKIHTDNKEIKPSDMRLYVRQNPNS KWFSTIKTQLYVYNWSGRDSTKWFNRFLRKIGDAPVIYNESDAIRSQEEIAKAVQNLGYM GASVKRTTKTKKKKLKLFYEITSGKPYIVRTLKYDISDKKIAEYLRNDSTQSMLREGMLF DVNVLDAERQRITDYLLCNGYYKFNKDYITYTADTARNTHQVDLTLHLLPYKTYVGDTPK EHFQYKINKINFITDYDVLQSSALSSIEINDSLHYNGFPIYYKDKLYLRPKVLVDNLRFA SGDLYDERNVQKTYTYFGRLSALKYTNIRFFETQNGDSTQLNCYVMLTKSKHKSISFELE GTNSAGDLGAAASVSFQHRNLFRGSETFMVKFRGAYEAISGLQPGYKNHNYTEYGVETSI NFPNFLFPFLTSDFKRRIKATTEFGLQYNYQLRPEFSRTIASASWSYKWIQKQKIQHRID LLDISYLYLPWISSQFQEDYINKDKDNYILKYNYENRLIVRMGYNYSYNSAGGALVNNTI TTNSYSIRAGFESAGNILYGISKMINMRKNKDGEYAILGIPYAQYLKGDFDFAKNIIIDH RNSLAFHAGIGIAVPYGNAKVVPFEKRYFSGGANSVRGWSVRNLGPGSFAGDGNFMNQSG DIKLDASIEYRTRLFWKFRGAAFIDAGNIWTIREYENQPGGVFEFDKFYKQIAVAYGLGL RLDLDFFVLRFDGGMKAINPKYKKAKERYPIIHPRFSRDFAFHFAVGYPF >gi|226332018|gb|ACIB01000038.1| GENE 47 55219 - 55773 616 184 aa, chain - ## HITS:1 COG:no KEGG:BF2283 NR:ns ## KEGG: BF2283 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 184 1 184 184 339 100.0 3e-92 MKKNRLTLVAAIFLSGTILFSSCVGSFGLFNRISSWNQSIGTKFVNELVFLALNIVPVYG VAYLADALVINSIEFWSGTNPMANVGDVKKVKGENGDYLVKTLENGYSITKEGEDSAMEL IYNKEANTWNVVADGVSTELLKMNNDGTAEMNLPNGDKMNVTLDAQGMMAARQATMGGLL FAAR >gi|226332018|gb|ACIB01000038.1| GENE 48 55895 - 56332 334 145 aa, chain - ## HITS:1 COG:no KEGG:BF2284 NR:ns ## KEGG: BF2284 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 145 1 145 145 243 99.0 2e-63 MIYKFLFPLKPDSAGASLFLLILRISFGLLLMNHGIQKWSNFQELSTSFPDPLGLGSPLS LGLAVFAELACSMAFIIGFLYRLAMIPMIFTMVIAFFVIHANDIFAMKELALVYLIIFVL MYISGPGKYSVDYVIGRQLKNKRKL >gi|226332018|gb|ACIB01000038.1| GENE 49 56725 - 56823 118 32 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MDYLFFYLFFLFIIWTFIVKSIHDAPEMEDIE >gi|226332018|gb|ACIB01000038.1| GENE 50 56881 - 57336 394 151 aa, chain - ## HITS:1 COG:no KEGG:BF2286 NR:ns ## KEGG: BF2286 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 151 25 175 175 287 100.0 6e-77 MTLIKEDISGRKLVVIRDQAFDSEATVEIYSREVTIKTAWSRYTYRLFVLGDCVWCEYNG AYRGLLEQKLLPSITPKESLLDSEVLDSSLYGHEKKKLREYAEDNLKLKKFRRENFNENR TGVAPFDHPKKVYDEFIKEDYIAPSSKENNK >gi|226332018|gb|ACIB01000038.1| GENE 51 58337 - 58492 78 51 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MPNRLKYYNINVIKLMWRIISIVDLIKYYIHICWIIIVYVNYSIESIIKPI >gi|226332018|gb|ACIB01000038.1| GENE 52 58592 - 59734 751 380 aa, chain + ## HITS:1 COG:XF0975 KEGG:ns NR:ns ## COG: XF0975 COG3746 # Protein_GI_number: 15837577 # Func_class: P Inorganic ion transport and metabolism # Function: Phosphate-selective porin # Organism: Xylella fastidiosa 9a5c # 101 352 112 364 389 78 23.0 2e-14 MNLYLKHTLFYLLGISYALISSAQSNPDKLQCKVTGRMLLDGGVYLKNDNNFGNGVEFSD LRIGAKVAYQNWDMKLEIGYTGNKATIKDAFATYTYKNHSIQVGQFYEPFSLEMMCSTFD IRFNQSPGAVLALTNGRRMGITYSYRNKRHYMSGGAFMDNEVNNLKKASHGYALDGRVVY RPVLDSKKLIHIGFAANYRTPNESLNEEDKNIFIYKSPGVSTIDNRNIAMATIDHAAYQI KFGTELLVYYHRFCLQSEYIRTHVERDNAFKNYVAQGAYLQCSWLLSGETYLYDESIACA GRPEGKSLEVCSRFNYLTLNDEDAAIWGGEQKDISIGLNYYINKYIGIKLNYSYLMPGAN IKEISRKNFSVFQGRFQFIF >gi|226332018|gb|ACIB01000038.1| GENE 53 59879 - 61090 715 403 aa, chain - ## HITS:1 COG:RSp0310 KEGG:ns NR:ns ## COG: RSp0310 COG0477 # Protein_GI_number: 17548531 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Ralstonia solanacearum # 4 401 45 441 450 179 32.0 8e-45 MNNWKKKFIIIWTGQLFSILSSSIAQFSIVLWISLKTGSAEVLSFATIAALLPQALLGPF AGVFVDRWNRKWTMIGADSFVALCSGVIALLFYLDIIELWHIYLLLMLRSVGGAFHTPAM KSSVPLLAPEKELMRIAGINQAIQSICNIGGPALGAILLLAFDMSLVMLLDVLGAIIACT ALLFVYIPNPKQENTSAKNVLYDMRDGFNVIMRNKGVSWVMVTEVLVTFFVMPMVALMPL MTLKNFSGTAYQVSLIETLFGAGMLAGGALLGVWNPKIRKTLLIAISYFLLGAALAFCGI LPADGFVLFAALTVAQGIVVPFFSGPFTSLLQTQFKPAYLGRVFSLFDSVSLLPSIIGLF ITGFIADSLGIANIFICCGIAIVFTSILMMCIPAVRDLEKQSK >gi|226332018|gb|ACIB01000038.1| GENE 54 61507 - 62310 778 267 aa, chain - ## HITS:1 COG:CAC3538 KEGG:ns NR:ns ## COG: CAC3538 COG1235 # Protein_GI_number: 15896774 # Func_class: R General function prediction only # Function: Metal-dependent hydrolases of the beta-lactamase superfamily I # Organism: Clostridium acetobutylicum # 3 266 1 261 261 175 37.0 9e-44 MKIRFISLASGSSGNCYYLGTEKYGILIDAGIGIRTIKKSLKDINVTMDSIRAVFITHDH ADHIKAVGHLGEKLNIPVYTTARVHAGINKSYCMTEKLHGSVRYLEKEEPMQLEDFRIES FEVPHDGTDNVGYCIEIDGKVFSFLTDLGEITPTAARYICKAHYLIIEANYDEEMLRMGP YPTYLKERISSKTGHMSNIDTANFLAENIMEHLRYIWLCHLSKDNNHPELAYKTVEWKLK SKGIIVGKDVQLLALKRNTPSELYEFE >gi|226332018|gb|ACIB01000038.1| GENE 55 62389 - 63777 810 462 aa, chain - ## HITS:1 COG:no KEGG:BF2381 NR:ns ## KEGG: BF2381 # Name: not_defined # Def: putative transport-related membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 462 1 462 462 825 100.0 0 MNNSPQPAAKGFTRAFYVSNTVELFERMAYYAVFIVLTIYLSTILGFNDFEASMISGLFS GGLYLLPIFTGAYADKIGFRKSMLVAFSLLTAGYFGLGVLPTLLESTGLVSYGASTHFSG LTDSVFRWLIVPVLFIIMIGGSFIKSVISASVAKETTEATRARGYSIFYMMVNIGAFTGK TVIDPLRNMIGDQAYIYINYFSGFMTLIALLAVFFLYKSTHTVGEGKSMREIGQGFLRIV TNWRLLILILIITGFWMVQHQLYATMPKYVIRMAGETAKPGWIANVNPFVVVCCVSFVTR WMAKRSAITSMNIGMFLIPVSALLMACGNLLDNEVVSGMSNITLMMIAGIVVQGLAECFI SPRYLEYFSLQAPKGEEGMYLGFSHLHSFLSSIFGFGLAGVLLTKYCPDPTLFESREAWE VASVNAHYIWYYFAAIGLVAAIALLIFAKITDFIDKKKKTNV >gi|226332018|gb|ACIB01000038.1| GENE 56 63927 - 65333 956 468 aa, chain - ## HITS:1 COG:uxaC KEGG:ns NR:ns ## COG: uxaC COG1904 # Protein_GI_number: 16130987 # Func_class: G Carbohydrate transport and metabolism # Function: Glucuronate isomerase # Organism: Escherichia coli K12 # 1 466 1 465 470 537 55.0 1e-152 MKNFMDKNFLLQTETAQELYHNHAAKMPIIDYHCHLNPQMVADDYRFKSLTEIWLGGDHY KWRAMRSNGVDECFCTGKETSDWEKFEKWAETVPYTFRNPLYHWTHLELKTAFGIDKVLN PKTAREIYDECNEKLSSQEYSARGMMRRYHVETVCTTDDPIDSLEYHIRTRESGFEIKML PTWRPDKVMAVEVPSDFRTYIEKLSEISEITISDYNDMILALRKRHDYFAEQGCKLSDHG IEEFYAEDYTEGEIKTIFNKIYGGSELTKEEVLKFKSAMLIVLGEMDWEKGWTQQFHYGA IRNNNSRMFKLLGPDTGFDSIGEFATAKAMSKFLDRLNSKGKLTKTILYNLNPCANEVIA TMIGNFQDGSIPGKIQFGSGWWFLDQKDGMERQLNALSLLGLLSRFVGMLTDSRSFLSYP RHEYFRRTLCNLLGCDVENGEIPLSEMERVCQMVEDISYFNAKNFFHF >gi|226332018|gb|ACIB01000038.1| GENE 57 65725 - 66258 493 177 aa, chain - ## HITS:1 COG:no KEGG:BF2383 NR:ns ## KEGG: BF2383 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 177 1 177 177 350 100.0 1e-95 MKRILCPKCENYLSFDETKYSEGQSLVFVCEHCGKQFSIRLGKSKMKAPRKEEKLDEDVY KEEFGCIVVIENVFGFKQVLPLQEGDNIIGRRCVGTDINTPIETGDMSMDRRHCIINVKR NRQGELVYTLRDAPSLTGTFLMNEILGDKDRIRIDDGAIITIGATTLILRAAKKEEI >gi|226332018|gb|ACIB01000038.1| GENE 58 66269 - 67330 850 353 aa, chain - ## HITS:1 COG:no KEGG:BF2295 NR:ns ## KEGG: BF2295 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 353 1 353 353 672 99.0 0 MIELAQHIEVLLLENDCVIVPGFGGFIAHYAPAMRVAEENLFLPPTRTIGFNPQLTLNDG VLVQSYMAVYDTNFSDATKMVEKEVAELISALHEDGKTDLPNIGEIRYTIHNTYEFVPYD NKITTPYLYGLDSFEMKELSALRRPEKEQILPTVLKKKTSYEFRANWAFLRNAVAMIAAV ALFFFMSTPVENTYIEKGNYARLLPTDLFEKIEKQSVAMTPVMLKSVDAIPQTKPATAKK KSSTVRKVSVVKPVAVKEVKVNQPEKTMKATETKVVEKTFPYHIIIASVANTKDAEAMAG ELKAKGYTGAKVLTGDGKIRVSIMSCADREDANRQLLKLRENEAYKNAWMLAK >gi|226332018|gb|ACIB01000038.1| GENE 59 67489 - 68037 609 182 aa, chain + ## HITS:1 COG:MA3780 KEGG:ns NR:ns ## COG: MA3780 COG1898 # Protein_GI_number: 20092576 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes # Organism: Methanosarcina acetivorans str.C2A # 1 179 1 180 183 203 56.0 2e-52 MNYIQTEIDGVWIIEPKIFFDPRGYFMEAFKQQEFDATIGQINFIQDNESQSSFGTLRGL HYQKGTYSQAKLVRVIKGEVLDVAVDLRKSSPTFGKHISVLLSDENKRQLFIPRGFAHGF LVKSEIAIFTYKVDNIYAPQSEASILYNDPALAIDWPIADSQLVMSEKDKQAGAFREAEY FE >gi|226332018|gb|ACIB01000038.1| GENE 60 68061 - 69374 1278 437 aa, chain + ## HITS:1 COG:XF1606 KEGG:ns NR:ns ## COG: XF1606 COG1004 # Protein_GI_number: 15838207 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted UDP-glucose 6-dehydrogenase # Organism: Xylella fastidiosa 9a5c # 1 437 1 444 450 514 57.0 1e-145 MKIAIVGTGYVGLVTGTCFAEIGVDVTCVDTNSEKIEALKKGIIPIYENGLEEMVIRNTK AGRLKFTTSLESCLDDVEVVFSAVGTPPDEDGSADLSYVLAVARTIGQNMKKYKLVVTKS TVPVGTACKVRNAIQEELDKRGAKIEFDVASNPEFLKEGNAVNDFMSPDRVVIGVESERA EKLMTKLYKPFMLNNFRVIFMDIPSAEMTKYAANSMLATRISFMNDIANLCELVGADVNM VRSGIGSDTRIGRKFLYPGIGYGGSCFPKDVKALIKTAEQNGYQMRVLQAVEEVNENQKS LLFDKLVKQYNGNLEGKTVALWGLAFKPETDDMREAPALVLIDKLLKAGCKVRAYDPAAA NECKRRIGETIYYARDMYDAVLDADALMLVTEWKEFRLPSWAVVKKTMSQQVVMDGRNIY DKKEMEEQGFIYHCIGK >gi|226332018|gb|ACIB01000038.1| GENE 61 69516 - 70319 519 267 aa, chain + ## HITS:1 COG:no KEGG:BF2298 NR:ns ## KEGG: BF2298 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 267 1 267 267 520 100.0 1e-146 MIKYIATLLLTVLFVACNNGKGQQPSEENEDPKAKEILQGIWLDDETETPLMRIIGDTIY YSDAQSAPVYFKILKDTLYTYGKDVTHYQIDKQSEYSFWFHSLADNIIKLHKSEDPNDTL AFSFKSVEIIPTYTEVTKKDSVVMFDGVRYRAYVYINPSQMKVVKTTYSEDGISMDNIYY DNVMHICVYEGKKSLYAKDITKQMFVDVIPTDFLQQAILSDMNFTGIDRKGYHYQALVCI PESPVCNLVNLTISFDGKLNITAAKYK >gi|226332018|gb|ACIB01000038.1| GENE 62 70390 - 71646 927 418 aa, chain - ## HITS:1 COG:CAC3010 KEGG:ns NR:ns ## COG: CAC3010 COG0513 # Protein_GI_number: 15896262 # Func_class: L Replication, recombination and repair; K Transcription; J Translation, ribosomal structure and biogenesis # Function: Superfamily II DNA and RNA helicases # Organism: Clostridium acetobutylicum # 2 373 5 374 528 287 42.0 2e-77 MKFSELQLNDNVLEALDAMRFEECTPIQEQAIPVILEGRDLIAVAQTGTGKTAAFLLPIL NKLSEGGHPEDAINCVIMSPTRELAQQIDQQMEGFSYFMPVSSVAVYGGNDGILFEQQKK GLMLGADVVIATPGRLIAHLSLGYVDLSRVSYFILDEADRMLDMGFYEDIMQIVKYLPKE RQTIMFSATMPAKIQQLANTILNNPAEVKLAVSKPAEKIVQAAYVCYENQKLGIVRSLFA EEVPERVIIFASSKIKVKEVAKALKMMKLNVGEMHSDLEQVQREFIMHEFKSGRINILVA TDIVSRGIDIDDIRLVINFDVPHDSEDYVHRIGRTARANNDGVALTFVNEKEQTNFKNIE NFLEKEIYKIPVPAELGEAPQYNPRSYTNAGRGGRNFRNGNRKNNNGGRSTAPRSGRR >gi|226332018|gb|ACIB01000038.1| GENE 63 71726 - 72949 1349 407 aa, chain - ## HITS:1 COG:PA4960_2 KEGG:ns NR:ns ## COG: PA4960_2 COG0560 # Protein_GI_number: 15600153 # Func_class: E Amino acid transport and metabolism # Function: Phosphoserine phosphatase # Organism: Pseudomonas aeruginosa # 191 402 1 212 217 261 65.0 2e-69 MQPSKTELILIRITGEDRPGLTASVTEILAKYDATILDIGQADIHNTLSLGILCMTEEQL SGFMMKELLFKASSLGVTIRFYPITEEEYESWVNMQGKNRYILTLLGRKLTARQIAAVTR ILAEQDMNIDAIKRLTGRIPLDERKMHTRACIEFSVRGTPRDKEAMQGQLMKLASELEMD FSFQLDNMYRRMRRLICFDMDSTLIETEVIDELAIRAGVGAEVKAITERAMRGEIDFTES FRERVALLKGLDESVMQEIAESLPITEGVDRLMYVLKKYGYKIAILSGGFTYFGQYLQKK YGVDYVYANELEIVDGKLTGRYLGDVVDGKRKAELLRLIAQVEKVDIAQTIAVGDGANDL PMLGVAGLGIAFHAKPKVVANAKQSINTIGLDGVLYFLGFKDSYLNM >gi|226332018|gb|ACIB01000038.1| GENE 64 73221 - 73730 393 169 aa, chain + ## HITS:1 COG:no KEGG:BF2390 NR:ns ## KEGG: BF2390 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 169 1 169 169 323 100.0 1e-87 MRHVKWIFVVLLISSLTSFVEKDKPTGGLNVGDVAPDFTIESTSDAQYNFDLTDLKGKYV LLSFWASYDAQSRMQNASLSNALRSTSQDVEMVSVSFDEYQSVFQETIRKDQIVTPTCFA ETKGESSGLFKKYRLNRGFTNYLLDGNGVIIAKNISAAELSAYANKIKG >gi|226332018|gb|ACIB01000038.1| GENE 65 73896 - 74993 1002 365 aa, chain - ## HITS:1 COG:FN1030 KEGG:ns NR:ns ## COG: FN1030 COG0795 # Protein_GI_number: 19704365 # Func_class: R General function prediction only # Function: Predicted permeases # Organism: Fusobacterium nucleatum # 7 363 2 361 363 110 27.0 3e-24 MRSNRFIKRLDLYIIKKFLGTYVFAIALIISIAVVFDFNEKMDKFMERSAPWSAIIFDYY MNFIPYFANLFSPLFVFIAVIFFTSKLAENSEIIAMFSTGMSFKRMLRPYMISAGIIAIS TFILGSYVIPRGSVTRLDFEDKYVKKKKTTYVHNIQLEIDTGVIAYIDNYQDYNKTGNRF SLDKFVDKKLVSHLTARSITYDTTAVNKWTIKDYMIRNLDGLKETIVRGDKMDSIIPMEP ADFMIMRNQQEMLTSPQLSAYIDKQKQRGIANIKEFEIEYHKRIAMSFASFILTVIGVSL SSRKTKGGMGLHLGIGLGLSFSYILFQTVASTFAVNGNMPPMIAMWIPNLLYALIAFYLY RKAPK >gi|226332018|gb|ACIB01000038.1| GENE 66 74998 - 76035 1064 345 aa, chain - ## HITS:1 COG:BS_tgt KEGG:ns NR:ns ## COG: BS_tgt COG0343 # Protein_GI_number: 16079824 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Queuine/archaeosine tRNA-ribosyltransferase # Organism: Bacillus subtilis # 1 330 37 363 381 350 51.0 2e-96 MPVGTIGSVKGVHQTELKEDIQAQIILGNTYHLYLRPGLDVLEKAGGLHKFNGFDRPMLT DSGGFQVFSLSGIRKLREEGAEFRSHIDGSKHIFTPEKVMDIERIIGADIMMAFDECPPG DSDYAYAKKSLGLTHRWLDRCIQRFNETEPKYGYSQSLFPIVQGCVYPDLRKQSAEYIAS KDADGNAIGGLAVGEPVDKMYEMIELVNEILPKDKPRYLMGVGTPVNILEGIERGVDMFD CVMPTRNGRNGMLFTKDGIMNMRNKKWEADFSPIEADGASYVDTLYSKAYLRHLFHAQEL LAMQIASIHNLAFYLWLVGEARKHIIAGDFSTWKPMMVKRVSTRL >gi|226332018|gb|ACIB01000038.1| GENE 67 76125 - 78593 2450 822 aa, chain - ## HITS:1 COG:PM1978 KEGG:ns NR:ns ## COG: PM1978 COG0466 # Protein_GI_number: 15603843 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: ATP-dependent Lon protease, bacterial type # Organism: Pasteurella multocida # 38 803 9 770 804 658 45.0 0 MKKERYLREMDDQNDNAFSLIADFDGNEDQVFDIKVGETLPVLPLRNMVLFPGVFMPVSV GRKSSLRLVREADKKKSYIAVVCQKMAETDEPAFEDLHPIGTIGKIVRVLEMPDQTTTVI IQGMKRLELKNITETHPYLKGEVNIIEEEIPSKDDKEFQALVETCKDLTIRYIKSSDTLH QESAFAIKNLTNHMFLVDFICTNLPLKKDEKIELLRIDSLRERTYRLLEILNREVQLAEI KASIQMRAREDIDQQQREYFLQQQIKTIQDELGGGGQEQEIEEMRQKAEHMKWSTEVRET FLKELAKLERTHPQSPDYSVQLNYLQTMLNLPWGVYTTDNLNLKNAEKTLNKDHYGLEKV KERILEHLAVLKLKGDMKSPIICLYGPPGVGKTSLGKSIAAALKRKYIRMSLGGVHDEAE IRGHRKTYIGAMPGRIIKNLIKAGSSNPVFILDEIDKVSADRQGDPSSALLEVLDPEQNT AFHDNFLDVDYDLSKVMFIATANNLNTIPGPLLDRMELIEVSGYITEEKVEIARKHLVPK ELEANGMKKTDIKIPKDTLEAIIESYTRESGVRELEKKIGKILRKSARQYATDGFFLKTE IKPTDLYDFLGAPEYTRDKYQGNDYAGVVTGLAWTAVGGEILFVETSLSRGKGGRLTLTG NLGEVMKESAMLALEYIKAHASLLNLDEEIFDNWNIHVHVPEGAIPKDGPSAGITMATSL ASALTQRKVKANLAMTGEITLRGKVLPVGGIKEKILAAKRAGIKEIIMSAENKKNIDEIQ DIYLKGLTFHYVNDVKEVFAIALTQEKVADAIDLSVKKANQE >gi|226332018|gb|ACIB01000038.1| GENE 68 78778 - 79491 417 237 aa, chain + ## HITS:1 COG:YPO2709 KEGG:ns NR:ns ## COG: YPO2709 COG4123 # Protein_GI_number: 16122913 # Func_class: R General function prediction only # Function: Predicted O-methyltransferase # Organism: Yersinia pestis # 6 235 20 250 252 196 41.0 3e-50 MSQPFFQFKQFTVWHDKCAMKVGTDGVLLGAWTPVESSARILDIGTGTGLVALMLAQRCS ASVIALEIDGTAAQQAAENITRSPWGSRIEVVCQDFRLYSNKNNSLKYDTIVSNPPYFTD SLKCPDSQRNTARHNDNLSYEELLKGVSNLLSPNGTFTVVIPMDASDSFKDIASSQGLYP SRQLLVITKPGAPPKRTLISFTFIKQDCKEEKLLTEVARHRYSDEYIKLTREFYLKM >gi|226332018|gb|ACIB01000038.1| GENE 69 79992 - 80210 382 72 aa, chain - ## HITS:1 COG:no KEGG:BF2360 NR:ns ## KEGG: BF2360 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 72 1 72 72 83 100.0 2e-15 MAKVINRDVPIAEENTTLTGQPATNMYDDWSEEMEDRADNVYDDTKKKSAGNKKSKEKKL KEIDEVVKEDLE >gi|226332018|gb|ACIB01000038.1| GENE 70 80396 - 80548 181 50 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253565585|ref|ZP_04843040.1| ## NR: gi|253565585|ref|ZP_04843040.1| predicted protein [Bacteroides sp. 3_2_5] # 1 50 18 67 67 70 100.0 2e-11 MRLKTPKPIRLIALAGIALFIDSSVSIPSVSVKDWTEIDETGNPLYPLPK >gi|226332018|gb|ACIB01000038.1| GENE 71 80557 - 80898 352 113 aa, chain + ## HITS:1 COG:MA1608 KEGG:ns NR:ns ## COG: MA1608 COG4828 # Protein_GI_number: 20090466 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Methanosarcina acetivorans str.C2A # 6 111 11 111 111 57 34.0 5e-09 MLTQFLNLLATIISVISLLIVTYGALIAIISFIINELKRVTGAYTPTNIRKLRAVFGTYL LLGLEFLIASDILKTVLEPTMNELIILGGIVVIRTILSVFLNKEIKELETENN >gi|226332018|gb|ACIB01000038.1| GENE 72 80967 - 81140 241 57 aa, chain + ## HITS:1 COG:no KEGG:BF2363 NR:ns ## KEGG: BF2363 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 57 1 57 57 62 100.0 5e-09 MNTEKEKTSSEEQKKAEKVLKDKVPVQQTGTYSEATKKEVRDAVKELNPDMSGLDRG >gi|226332018|gb|ACIB01000038.1| GENE 73 81147 - 82424 1006 425 aa, chain - ## HITS:1 COG:NMB0114 KEGG:ns NR:ns ## COG: NMB0114 COG5000 # Protein_GI_number: 15676042 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation # Organism: Neisseria meningitidis MC58 # 110 423 362 699 706 122 26.0 1e-27 MKRYSISVVLHILLLVVLSIGGYLLFCYELWFSTLIVGILLIATGVHLYSIQMKLAGMMR RLTDCIRFNDMTQNFQPPFKSKMMVELADELSQTLRLFRGRLLEEEIKHQYYENLLNKVD TAVVVTDRSGRVEWMNRAAVALVGQESRLPQEWLTTSWNETQVVRIRQQGASVEMAVSCT LFAAQNKERLLVSLKNIHSVLERNEMEAWQKLIRVLTHEIMNSITPIISLSETLSERGIP ERLSEKEYGVMLQAMQTIHRRSKGLLGFVENYRRLTRIPTPACTSVAVDELFSDLRKLFP DSFIHFAATHRGATLYIDRAQIEQVLINLIKNAKESCGQNTAPQIEVELEQVPGKVCSLT VRDNGEGILPEVIDKVFVPFFTTKPSGSGIGLSLCKQIMNLHGGTITVSSEIGKGSCFTL MFPGR >gi|226332018|gb|ACIB01000038.1| GENE 74 82421 - 83785 1453 454 aa, chain - ## HITS:1 COG:hydG KEGG:ns NR:ns ## COG: hydG COG2204 # Protein_GI_number: 16131834 # Func_class: T Signal transduction mechanisms # Function: Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains # Organism: Escherichia coli K12 # 7 451 8 441 441 292 37.0 1e-78 MEQLGKILIVDDNEDVLFALNLLLEPYTEKIKVATTPDRIEYFMTTFGPDIILLDMNFSR DAISGQEGFESLEQILKIDPQAIVIFMTAYADTDKAVRAIKAGATDFIPKPWEKEKLLAT LSSGMKLRQSRHEVNMLKEQVEVLSGQGGPENEIIGESEAMQEVFSTINKLSETDANILI LGENGTGKDVIARLLYRCSPRYGKPFVTIDLGSIPEQLFESELFGYEKGAFTDARKAKAG RMEVATGGTLFLDEIGNLSLPMQSKLLTAIEKRQISRLGSTQSVPIDVRLICATNADIRA MVDEGNFRQDLLYRINTIEIHIPPLRERGNDVILLAEFFLERYARKYKKEMHGLTREAKN KLLKYNWPGNVRELQHTIERAVILGDGSLLKPENFLFHSSVRQKKEEEVLNLELLERQAV EKAMRLSEGNITRAAEYLGITRFALYRKLEKLGL >gi|226332018|gb|ACIB01000038.1| GENE 75 83761 - 84006 69 81 aa, chain - ## HITS:1 COG:no KEGG:BF2366 NR:ns ## KEGG: BF2366 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 81 1 81 81 146 98.0 3e-34 MDIICTFALRLKVSEYIRLTLASLHGKIRCTRCITGTRRVVQLVHDEMYNWYTTSCTTGI SYPLKESKLARNITWNSLEKY >gi|226332018|gb|ACIB01000038.1| GENE 76 84148 - 85620 1505 490 aa, chain + ## HITS:1 COG:VC1565 KEGG:ns NR:ns ## COG: VC1565 COG1538 # Protein_GI_number: 15641573 # Func_class: M Cell wall/membrane/envelope biogenesis; U Intracellular trafficking, secretion, and vesicular transport # Function: Outer membrane protein # Organism: Vibrio cholerae # 180 471 120 403 419 65 23.0 2e-10 MNLKTILIIAGCSYIFPATAQERTMELSLDETVKLAKLQSPDAQTARHSFRSAYWNYKYY RANYLPALSLTSDPNLNRAINKVTLGDGTVKFVEQNMLSTDLTLNLTQNIPWTGGSLFVE TAAQRMDIFSDHTTAWQTSPINIGYRQSLFGYNSLKWDRRIEPVRYREAKKSYVETLELV ATRATQKFFNLATAQSNYETATTNYANADTLYQYAQGRYNIGTITENEMLQLELNKLTEE TNRMNARIEMDNCMQELRSYLGIQSDEELKVKINDHVPDFSVELHEALLLANENSPEIQN MIRRKLESESNVSYARANAGLKADIYLRFGLTQTADKLGSAYKRPLDQQYVSLSVALPIL DWGRGKGKVRVARSNRDLVYTQVEQDKTDFELNIRKLVKQFNLQAQRVRIAARTDETAQR RSDVARKLYLLGKSTILDLNASITEKDQARRNYITALYNYWSLYYTLRSLTLFDFEGKTP LTENYDLLID >gi|226332018|gb|ACIB01000038.1| GENE 77 85659 - 86909 1523 416 aa, chain + ## HITS:1 COG:YPO1498 KEGG:ns NR:ns ## COG: YPO1498 COG0845 # Protein_GI_number: 16121771 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Yersinia pestis # 57 406 56 410 420 96 25.0 9e-20 MDIKIEKKPWYIRYKFYIAGGIAFVAFLVYVIILSAGPRKLRIESENIQIAEVKDDKFME YVDVEGLIQPILTIKVNTREAGSVERIIAEEGSLLQKGDTILTLSNPDLLRSIEDQRDDW EKQRITYQEKEIEMEQKSLSLKQQTLETNYELARLKKSFTLDKEEFRMGIKSKAQLEVSE DEYNYKVKNAELQREGLRHDSAVTIIRKDLIRTDMERERKKYERATERLGNLVVKAPISG QLSFVKVTPGQQVGSSESIAEIKVLDQYKIHTSLSEYYIDRITTGLPATVNYQGKKYPLR ITKVVPEVKDRMFDVDLVFTGEMPDNVRVGKSFRVQIELGQPEQAIVIPRGNFYQATGGQ WIYKANASKTKAVRTPITIGRQNPQQYEITGGLEPGDYVVTTGYDTFGEAEELILK >gi|226332018|gb|ACIB01000038.1| GENE 78 87053 - 89476 1582 807 aa, chain + ## HITS:1 COG:no KEGG:BF2369 NR:ns ## KEGG: BF2369 # Name: not_defined # Def: ABC transporter permease # Organism: B.fragilis # Pathway: not_defined # 1 807 1 807 807 1608 99.0 0 MKTIRLAWKALARFRTYTFINILGLALSLACVLIILRYIHQEVTVNHFCKDLENTYLLYI EYEDGRRTISSNEDRNNDPNFIDPLNDPSVLKSTRWINFPEDRITVGKQIYNVKTVVTDS VFLQILPYPSVSGISSLKSPNDAIITRRLAERLFGKENPIGKTMTYSTGDIVTVTGVIGE PTTKSFLDFDLIISERLQHSWSRLSNSLVQLIPGTDFKKLNVKNEKFMKLRCHMDAPTRL QFFPLKDFYFDKTVRVYNNNIRKGNYNNILVLAVVTIALLIIGLFNFINIYTVMMLKRAR EFGVKKVYGAGAKDVFAQIFTENFILTGMALCISWCIIEITGGMMEHVLRIPQTSNTEFS ATLSVGILILLPLLTSIYPFIRYNYVSPSVSIRSVNAGGHSIVSRVLFLFVQYIITFVLI IVSLFFTKQVRFMLSADLNYTTKDIIQCQLYAERSSYDINISDEEWERRKQREKSNLAYI KEEMDHSPLFIRWEYGENPNQLDDNYINVRNAQRDEFKQVIYSSLSNKYIELFGFQLKEG RLWNDSVDQWTDYKMIINESAKSLLEIDNIETALIQPERRLWWSLSKSEEMKKNPPYQVI GVIKDFKIGHLSKATPPLFIVYEDPRGSYRDRLMAQIVPGKKQEAIAFLKKLRDEILGEG EFEYSFLEDEIATMYQEDKRTAEIYSLFSIIAILISCLGLFGLSMFDIRQRYREIALRKV NGATLKEIYPLLLKKYSIILGMAFIISAPLSWYIISKYLEGFANKAPISWWLFAIAAIVT AFISLATLIWQIRKAANINPAKVLKGE >gi|226332018|gb|ACIB01000038.1| GENE 79 89626 - 91971 1363 781 aa, chain + ## HITS:1 COG:no KEGG:BF2454 NR:ns ## KEGG: BF2454 # Name: not_defined # Def: ABC transporter # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 781 1 781 781 1563 99.0 0 MIRHYFKIAFRNLLKYKTQSIISIIGLAVGITCFALATLWIRYEMTYDTFHRRADDIYLV RAQLTITDGTLSNSMPYPAIEYLRKNISEIEDICGISPFKTNLRFKDKGGDLLAIEVDSA FIRMFDVRILQGNVNFLKKKSNEIAITEAVAKEWFGNESPLGKEIELGSRPCKVCAVVSG WSQHSNLSYGALLPARHHPSWQSNSAQIFVRILPDTDKSALQKRISSLDASSQEKESTLG KLNFTPITSLRYSDYLQKDEIVISFNYIRYFAMAGVLVIVCSLFNYLTLFVSRLRMRGRE LGLRKVCGSTNRSLFALLSVEYLIVLLAGSLLGMAFIEACLPHFIELAQISEATPLYTEV IIYILAVIVLSFGISQIPLYYFRSRTLQSSIRNKKGSPQRGIFRRLGLIAQLIISLGFIF CTTIMMKQLYYLKNTDLGIERHNIGNVAVWMKGDINEWSSKIANLPMVTEALPPHYFPIV PTGPMMYTDINGWDGLNETTDETYSVGLIPSGKEFFDFYGLQLTEGEWLSEKNSPGDVII NETAALTFGWRNPVGKQFYSEYEHNRTYYTVVGVVKDFSYLPPTIAPRPLAFVRTEEQKY LWSRASILFKFTEGSWEACKDTIRKMKEEDFPSSFLRLYNEEEEYNKYLRSESALMTLLG IVSIVCVIISIFGIFSQVTLSCEQRRKEIAIRKVNGATIGSILQMFIKEYFVLLLVAALI AFPASYGMMRVWIESYVRQTSTPFWIYIVLFAGIGIIIVISIFWRVWNAAKQNPAEVVKT E >gi|226332018|gb|ACIB01000038.1| GENE 80 91988 - 94315 1686 775 aa, chain + ## HITS:1 COG:YPO1365_2 KEGG:ns NR:ns ## COG: YPO1365_2 COG0577 # Protein_GI_number: 16121645 # Func_class: V Defense mechanisms # Function: ABC-type antimicrobial peptide transport system, permease component # Organism: Yersinia pestis # 496 730 108 352 395 61 23.0 7e-09 MIRHYLKIACRNLLKYKTQSIISILGLAIGFTCFALAVLWIHYEMTYDTFHEGFDRIHLV YQKSALSDTGVTTTIPYPVSTSLEKQFPEVEDACGFLFYEQEVTVDDGAIRQLYEINADS CFMHMFGIQVLSGSLDFLESEERIALTEHAAKELFGTENPIGKEIKLYGAPKTVCAIVNG WNRHTNLPFSILTGGIRQWHNAWYHGGFHVFIKLHKEVNAETFQKKLEQTKLEADGKGGI QNLMVMPISKCHYTVLADQNAIQFSYILFFSIVGGLVILCSLINYLSLFVSRLRMRSREL ALRKVCGSSDLHLFILLVTEYLLILLAAGLMGMALIELVLSPFKELSGVKEGDVYWESFL YFALVIGCSLATFLPVTFYFNKQTLQSNIQQKTVNRYGYLGRKISIVFQLSISICFIFCI SVIMKQLYYLSTTDIGIERKNIATLSMYPQNNLLPAADKIEQFPYVTQVLKGHFSLLPKT ASMAMHFKDWDGKQPGDAEIDMEVLMESEELAQFYGIRLLKGKMLKEGERDAGTIVINEA AAKALGWNDPIGKKLIRPNGTGTTVIGLVKDFHTTSPTTPIKPIAFIAKGFSGFDLGKGD VLIKYREGEWPKLKKDIEQLCQKEYPENKIRLSNMEETYDNYLKSEQTLLKLLSRVAIVC ILIAVFGVFSLVTLACEQRRKEIAIRKVNGATLGNILSIFIKEYLILLLCASFLAFPVSY MIMKAWLENYVEQISIGVSMYVTIFTGIGIIITACIGWRVWKAARENPAEVVKTE >gi|226332018|gb|ACIB01000038.1| GENE 81 94330 - 96732 1256 800 aa, chain + ## HITS:1 COG:YPO1365_2 KEGG:ns NR:ns ## COG: YPO1365_2 COG0577 # Protein_GI_number: 16121645 # Func_class: V Defense mechanisms # Function: ABC-type antimicrobial peptide transport system, permease component # Organism: Yersinia pestis # 510 800 109 395 395 65 22.0 4e-10 MIQHYFKIACRNLLKHKVQNILSIVGLSIGFTAFLLGGYWHHWEYHFDSFHPQSSRTYAL TTTGIFKTADGSVGELNQIHQMVEKDLVTFPEIAKVCHVSEVKYEFEKDTKSWIGMKIDS TFFDIFQCKLIEGSYYKVPFNVNHVILTQKMANFYFGDSSCVGKELKINDKLSYTIAGVM ENYPQNSDFKFEYLILATPSPNQVKRNTTYVWLHPSADAAHLSKKIAAYRVKEPDTKWSK YSEWRFHLRPLPEIHTRCSPELKGRLQHIRILATAGILAFASALMNLLVLFIGQQQRKAR YNATFSTLGASIYSLIGKNLLELTLPLFIAFLLSMAFIEFLFPFYKDYTSLVAESSSYYN GVIQSITRQEVLKASYWIYPLCCLIFLVLSTVPIVGLLKRNSRGTSLALRNGLIIGQIFI GSLFLLTSCMFYSQYRFMSRTDKGLVTDHIWQIDLGFDATYNTDCTPFIEALKQNSAIDD VTALTQPLLVLRGEWYCSFITQFPIEGRNNVDEATEDNCIVVQKNFLSFFGMKMKEGEWI QDQGTRDIVINETGARELNIPSLTGRLILSDDEDSENHAVPTRISGILRDFYYCPMQYPL SKVFFMYQNNADAARGYNGFRYFYIKVHPDNEKQALQYARRIYSQYSKKEISEDMQIIQL STLMELFNRPEKTMFRIFLLLAVLCILISSFGVFFLVSLSTEQRKKEIAIRKVNGAQFSD ILYLFLKEYLWLTLVSNAIALPLGYLFIKRWLETYAYHTDIHGWLFVCVFLFTCIIVILS VMRQVVVAAKINPAESVKSE >gi|226332018|gb|ACIB01000038.1| GENE 82 96926 - 99154 1082 742 aa, chain + ## HITS:1 COG:no KEGG:BF2373 NR:ns ## KEGG: BF2373 # Name: not_defined # Def: putative ABC-transporter permease protein # Organism: B.fragilis # Pathway: not_defined # 1 742 1 742 742 1472 99.0 0 MILHYLKIVFRQMAKRKAQTAISILGITAGLLCFSVCNYYNRIFSTGNKDLATYENQAEI CIKERSYQVNIPIEDFEKKIGKDKFEAVAFYVNSSSTITLDETVYCKVDKTECNADYFKV FPTECIDGSLKQFGISGNEAVVTTEFVKQFCGGVPPLGKTILDQRGKIHTIIAVIKPYPA GMNNYHSSYDVFLPLPENASFGIHKLLLKRPEDAEHISQLLPKLGLFPNHPEWIPQIVLD SQTEHKAGAELWVAILGLLVLLVGMINYFSFSIGAFANRYKEISLRNTLGSTYWGLFILL FLEQAVIILICGIITLAITESLLPWFISTFSNEIQRNLYIDTHRLWVYECQYIGGLLLIS LLISFISSWHIAHKTIAQGLRGGTTTGQRHIIRNTLLSVQLLFSFLFIVGTVGIRMQMKE YDLSANPNLSTEVKKEIMVVNIGRYDRIREHQPELINFLRSRRWNAETAYTNRDYSQEYG FTELCFVSDDFFNLMNIKCHHKPGEPFCYVNEQLYQTLQADSTSESFRFQNQVYPVKGLV HIGPDSPSAKQLALLPLSAMNDEIGKIYIRLVLDAPRKEVKAEMSKEMNQYLPQNEPFEF ISLYEEQTGLGTISVMWLFVVCSSICLVITVLGVYGAISIDTIRKQKEVAIRKINGARLP DIYWLFAKNYLILFLIASVIGGLISLFVMVIGSQHRVILFDYADPWLWMGPLMLLIGIIT ATISWQIYYIARTNPAEVIKNE >gi|226332018|gb|ACIB01000038.1| GENE 83 99165 - 101501 990 778 aa, chain + ## HITS:1 COG:no KEGG:BF2458 NR:ns ## KEGG: BF2458 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 778 1 778 778 1531 98.0 0 MNYMIQHYLKTAIRNLLKYKTHSIISAICLSVGMTCFSIIHFFINEIDGASRNMPNFEQR ISIRMINSNHEVGGWGWSLNSSEIQTLTEHPIPGIKQICFHSFQREAEVVFINREQEEKP YIISYMDTDPNFFSHYNASFLYGNVLPTTPEEVVLSESCARKVYGKSNPVGTLLKIVKLK ESEKDKSTYYKVVNVIRNLPKTLNVETDIYFSHLREENRQQGYITEGTLETADGLNKANE SLKGITTLHNNEMAYFIANKEADSYHDPQRMIGIAFITFLSSLILLSGMINFLKFIIQSF YNRNRELALRKSLGASPKSLFALLFTEAFWMLTFSLLLSLVLSECTCLLLTTYIPPKEMI PIDIQTLYGIQVKLYIGLLLICTLVMLYPIQRLQRSGLARHMKTNSHRHLFRNIMMCVQL CVCIFFLGMSIAIHLFNSVGSVLYLPLSDKETNSTLCLEMNSVTLGKNKDAILSQIKMLP GVENISSALMSGNYNSFLTSDYESADHRTLTIRVRQGDPSYFQFFRIPFRGEIVEPHTSN VVYISEAFQKQLENDSVSGNVKLGKENYRIAGTYKACYGENISEHNQYNISVFFPTEEAS VIYIRFRDDISFGKAKSEIERVCRNYVPESLPLDIQRLDIRRSTTQGIRDLMGDASLLLG IISALLVILSIYSAISMDTVSRQKEVAIRKINGATPKVIALMFGKAYIIQFILAYTITYP LLRLLVIDITKDSPISSITGFTWGIYLFILIGLLIFVTTAYKIYRIMHLNPAEIIKNE >gi|226332018|gb|ACIB01000038.1| GENE 84 101530 - 102204 336 224 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) [Campylobacter concisus 13826] # 1 199 4 199 223 134 37 4e-30 MIKTINLQKIFKTEEVETWALNNVSVEVKEGEFVAIMGPSGCGKSTLLNILGLLDNPTGG EYYLNGKEVSKYTESQRTNLRKGVIGFVFQSFNLIDELNVYENIELPLLYMGIPASERKQ RVEKAMERMAITHRSKHFPQQLSGGQQQRVAIARAVVANPKLILADEPTGNLDSKNGKEV MGLLSELNKEGTTIVMVTHSQHDAGFADRVINLFDGQVVTEVTI >gi|226332018|gb|ACIB01000038.1| GENE 85 102375 - 103541 417 388 aa, chain + ## HITS:1 COG:SP1000 KEGG:ns NR:ns ## COG: SP1000 COG0526 # Protein_GI_number: 15900873 # Func_class: O Posttranslational modification, protein turnover, chaperones; C Energy production and conversion # Function: Thiol-disulfide isomerase and thioredoxins # Organism: Streptococcus pneumoniae TIGR4 # 234 373 26 170 185 63 31.0 5e-10 MKTQFFTLFFTIICLSLQAQQPCIIEGNINGIPDGTVISLMRQQGTGMKRIANDTIDNGK FKFIIHTLNNQTEALRIVSKGEGFPNTWLDVYASPGETVSIIGSDKLLRTWNIVSNIKEQ QEENQYTNEGFRNLTDQRQRLQALSSDMWKKIAISDSPKEKIQMTDSIQNILYPQLDSLE LLLSKEEINLMKNLPVTSIWLDHLEALSRQSVYLKGFPISEAQVLYQQLTSTQRNSQIGK KIEACLTPTKAKIGDDMPDTELSNIDGNHHRLSDYKGKYLLLDFWSRSCGHCIESLPEME ILSDMWKEKVTFIGINIDDEKSWKEFSQRKNIKWIDLNDPKGAFGLYIRYKANGTPFYVL VTPDGKITDIWYGYNKDSLSERLKQGIK >gi|226332018|gb|ACIB01000038.1| GENE 86 103675 - 104343 117 222 aa, chain + ## HITS:1 COG:no KEGG:Cthe_3205 NR:ns ## KEGG: Cthe_3205 # Name: not_defined # Def: hypothetical protein # Organism: C.thermocellum # Pathway: not_defined # 40 221 38 216 222 104 33.0 3e-21 MKNTHVLLIKFKNKISDDEVQFFRSSIIQKLGDQPDILYHNHVEKNKYRYSYPLIQYKNI EQQATIVCIDQGTKAIEKFFSQCDFNFQLGNRKVNMKFASVTPYKLLIERQSKMINYHIH NWLPLNSDNYKKYQNISILSERINFLEKILIGNILSFTKGVNYFIDFPLQCKLLQLSFAK LISNKNIKLMSFDADFQCNLNLPDYIGIGKHTSIGYGTITRN >gi|226332018|gb|ACIB01000038.1| GENE 87 104361 - 104987 462 208 aa, chain + ## HITS:1 COG:no KEGG:Slin_3445 NR:ns ## KEGG: Slin_3445 # Name: not_defined # Def: protein of unknown function DUF88 # Organism: S.linguale # Pathway: not_defined # 5 167 7 168 338 126 40.0 5e-28 MIESITSIGIFIDGGYFTKINQALEEKLSLNIDITFFFKFIKEKIAYEYNLNTEFCQITE SHYFRGRYRVNDANNKHLLFSERKFEDSLIENDVIFHYKHLREIQKEGEINVIEKGIDVW FALEAYELSLFRKFDFVILITGDADHEMLIKKLKALKIHTILLTWDLSPESATARLLREE ACKHIELSEIAIEDKDLIKKICRSKQKR >gi|226332018|gb|ACIB01000038.1| GENE 88 104963 - 105742 467 259 aa, chain + ## HITS:1 COG:no KEGG:Mevan_0112 NR:ns ## KEGG: Mevan_0112 # Name: not_defined # Def: hypothetical protein # Organism: M.vannielii # Pathway: not_defined # 10 254 2 246 250 175 44.0 1e-42 MQKQAKEIKKHLFLLGGHDLEMQTIVQILTDRNVIFKDRYLQWDNALLSQYEEEIQQYGN KEPFIIYGVELKEDITPPTNYIRIDHHNEYATYPSALEQVASILDHPLNRYQTLVAANDK AYIPGMLEIGASHEEINLIRQEDRKAQGVIEDDEKLAQEAITNGTEKIGSLYVVFTTANK FSPICDRLYPYEKLLIYTPNELIYYGKGINSIQKILKRYTPISNIFWGGGINGFIGTVRN RLTTNEILNIVEQIKLLEL >gi|226332018|gb|ACIB01000038.1| GENE 89 105739 - 107214 734 491 aa, chain + ## HITS:1 COG:no KEGG:Mevan_0111 NR:ns ## KEGG: Mevan_0111 # Name: not_defined # Def: hypothetical protein # Organism: M.vannielii # Pathway: not_defined # 2 450 5 478 635 248 34.0 4e-64 MIYSYHIFYFPFKWEIMGLENQAFSDQVNLDNIQYNRNSHWERSQKPDPGEEESLYNEKN YYYTFVHNILYDEEHSPLNLIHHFERKEPKLSNHIYYYIKKKGRNNPYKLIVDAMNINLY ATGVGFLSFYLKNEDCTQNSPEDILAINQYGRRIMPPFFNDTRLRNEISEYIRIEGLNQT VYFEDFKSYTPYDSWQPSSSIKKLICELVTNLSIDPIIDDRMFVATWYKNNQLSQQFTNN AKAYFDSQDPFSDYWYRFLFIDGSNATCQNEKMKKELLEEHTYYRWQQWSSLYGISKYSL VYLTNNEVPDYLIEYFQTIYARMAELVLVQRASMLRFSGEITKVSQLSNQDVEAVSKRVS SLYKEYIRFVNQIYFREITAQDQGIEMYNKLHSCLQMESYIKDLDGEIEELHQYISLMED RERNKKASLLNDIATLFLPITVITGFWGMNQISEVMEENGELSTGFIIQSLLLIIGTLCA ICIIYKRKRKL >gi|226332018|gb|ACIB01000038.1| GENE 90 107211 - 107690 413 159 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253565605|ref|ZP_04843060.1| ## NR: gi|253565605|ref|ZP_04843060.1| predicted protein [Bacteroides sp. 3_2_5] # 1 159 1 159 159 270 100.0 2e-71 MNWKLVECEIALIVSLTVIECVNMGQNSPKDITCLTVFFCIMIVLLPLIGVLQQWHLSCF QNRQKEKEYQAKQETDEKMKTWLLAREAIIKDKEKEELTNKVNGLQQKCDSLIENQENEL KKFYLSILSIIGTKDDLKSIEENFKKMKDFFEEYKKITK >gi|226332018|gb|ACIB01000038.1| GENE 91 107704 - 107955 180 83 aa, chain + ## HITS:1 COG:no KEGG:Mevan_0105 NR:ns ## KEGG: Mevan_0105 # Name: not_defined # Def: hypothetical protein # Organism: M.vannielii # Pathway: not_defined # 1 77 1 70 337 67 53.0 2e-10 MNQLTAILKQHTPMIHFQHNESGATLRASEVKPLLDKFILTKLGNGDIREGRLYAKKNNW LIDNEKNYALNYKLSISLQKKVD >gi|226332018|gb|ACIB01000038.1| GENE 92 108570 - 109112 278 180 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253565606|ref|ZP_04843061.1| ## NR: gi|253565606|ref|ZP_04843061.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] # 1 180 136 315 315 321 100.0 1e-86 MKQLIKARGYELKGDHSPISGIRENDNSWNDPNPNGYNYAYIRAILGLAEQYEFQLETPY QKAIVKIKSANNCISRYKSPLLFKIINNSIYLVGNEINTEILNKPFQYSYIEQTKNKNMR TGKSEITERTMHINEIEMNYNNRINYHYTPTSFSLIDFMQYAMSYKKNGKNILNYIPLKQ >gi|226332018|gb|ACIB01000038.1| GENE 93 109117 - 110919 785 600 aa, chain + ## HITS:1 COG:no KEGG:Mevan_0106 NR:ns ## KEGG: Mevan_0106 # Name: not_defined # Def: CRISPR-associated RAMP Crm2 family protein # Organism: M.vannielii # Pathway: not_defined # 2 526 3 484 528 194 32.0 9e-48 MKYIAITLGPITRTIEMAESTKELWAASYFFSYLAKKIVEPFVKKNRTFQLPLINEEMQK PHCGAGLFPDRYIFKSEPGDLELLKQHSDQVLIEIAGHIASPSLPGTAKDVSQIYHYLKS YIKIYFIERTLESDDPYVVIPACEKYLNIIENQETFPEQEETMISHQKSDFLKFLITNVN GKIYRKDKNSIPRFTGSFLTRDAFGDMNGERLFESILEISASELNINIQQKALEVITANE KNKGEKYSDQIWDAEEIILNDNKAQLRPYHKYIAIIKSDGDSMGETIKSMGAYNIPITQL SKALLSFNIESINEIVAYGGKPIFIGGDDLLCFAPVCCNGNNVFNLVEKLSTCFDQCINQ HLQQYINACSEAQRPLPSLSFGISITYHKYPMFEALHTTDYLLEMVAKDNLFKYTLSNKN ILNENMKRFILKNKLAFSLQKHSGQIYHTAMSKKGKSYVKFNMLLQKYILKNKDMSKTQE SEKFLSSVIQMIRAHAEILQIILQNEDKRTEMLKNYFDNNFNESCHLGYTGLFEDIQTLL CLRYQENIQDYQNRNEIIQQNTILTSDEKEILIVSPAMDAIHTIFTALQFIHFINYNKDE >gi|226332018|gb|ACIB01000038.1| GENE 94 110912 - 112051 485 379 aa, chain + ## HITS:1 COG:no KEGG:Mevan_0107 NR:ns ## KEGG: Mevan_0107 # Name: not_defined # Def: CRISPR-associated Cmr3 family protein # Organism: M.vannielii # Pathway: not_defined # 4 377 3 356 362 101 28.0 5e-20 MNRHYLITLTPMDWFFFGGERTLDDGKSADYISHSNKFPQQSALLGMIRYQLLKQHNLLS QFPYTENKPTEKEIMKTLIGEQSFRMTERKAKSLGLGVIKQISPLMLIECKDDTSSRSIY FPLPLDDGYKVSFNETNNEDKVFYNGIECPIPNVYPASEEQDSGNQKRKFFDHKTYNNYL FWCTQGNNQIKKLLSDEIWISKMQIGITKHVEEGEDNNKSFYKQEFLQLKKSFIYAFYIT LSGESELSSDIIQLGGQRSVFRMEVESIEENSDIQEKYQTAAQFLTQSDRLLILSPTYVD NLKELSALCNFMWSDSIVFRNIQTTNASNFYGKPIKSSSKYHFLKPGSVLYFKQGKRKEV EKLLMDYTYLRLSGYNIYI >gi|226332018|gb|ACIB01000038.1| GENE 95 112071 - 112910 454 279 aa, chain + ## HITS:1 COG:TM1792 KEGG:ns NR:ns ## COG: TM1792 COG1336 # Protein_GI_number: 15644536 # Func_class: L Replication, recombination and repair # Function: Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) # Organism: Thermotoga maritima # 1 269 1 280 288 78 24.0 1e-14 MTTRMYVINTLSNMHVGSGEVNYGVIDNLIQRDSVTNLPNINSSGLKGAIREYFKENENL VRELFGSAPKDEKTLPGKVRFFEANLLSMPVRSDKVPFLMATSDEVLQELITKMKFFNCE EATQYISHLSTLLDNIKTQAQGTDFAYVFDPLLQGAIIEEVSIRATCPSHIPLQPSLKKL LGDRLVILSHKYFSILSDDNHLPVLSRNNLENGQSANLWYEQVLPRYSRLYFMLMDGNAQ SEYLKKFRDTLCTPSTIIQIGANASIGYGYCQISELSPF >gi|226332018|gb|ACIB01000038.1| GENE 96 112923 - 113333 368 136 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253565610|ref|ZP_04843065.1| ## NR: gi|253565610|ref|ZP_04843065.1| predicted protein [Bacteroides sp. 3_2_5] # 1 136 1 136 136 239 100.0 5e-62 MKISKKQIEYAIEALRANNIITNDNQYPKVFKGYISSFGAAVIQSGLIPAIIFFENEDND ANADRHKIIGVLKDIINAMRQQYTVTDATILVSSQIPANYSMAQYIIEHGNTDQLLKEIT EAAVAMKLALRMYKSE >gi|226332018|gb|ACIB01000038.1| GENE 97 113330 - 114280 640 316 aa, chain + ## HITS:1 COG:no KEGG:Mevan_0110 NR:ns ## KEGG: Mevan_0110 # Name: not_defined # Def: CRISPR-associated RAMP Cmr6 family protein # Organism: M.vannielii # Pathway: not_defined # 29 313 39 285 334 141 36.0 4e-32 MIMPKNYTLQNASNLGWLFYKDYYRQEPNVDFISTQGKESDTTADFFRKTNQRITAYQLN SESPLVAAFNNHFGTPLQLKTIYPGLITGSGLPHQTGSKGEFKLGFQFDYTTGLPYIPGS SIKGTLRSMFPFSLKDKGSTKRILPEYRKERMEYIRDLIIEVTNINEISDTEIQALEYAI FTNSTPSGKTIEFSLEEKDVFYDAFVADSKDGVMLSDDYITPHGENPLKDPKPILFLKIR PDVTINFYFKLCTTHLYKEKVCSSKQIEEIKKQNDFSSSDYKMITAHQKRNLFEKILLCI GIGAKTNIGYGQLKKL >gi|226332018|gb|ACIB01000038.1| GENE 98 114300 - 116570 972 756 aa, chain + ## HITS:1 COG:SMb21167 KEGG:ns NR:ns ## COG: SMb21167 COG3344 # Protein_GI_number: 16264581 # Func_class: L Replication, recombination and repair # Function: Retron-type reverse transcriptase # Organism: Sinorhizobium meliloti # 14 347 55 396 453 137 26.0 5e-32 MPDYYHSITTLHALQNAWRAVRAKNAAGGIDGFTLSHFEKRLNDNLIELQHELISQTWNP EPYLRIEITKNETEKRKLGLLCIKDKIVQQAIKTAIEPQLEKTFLNLSYGYRPNKGPERA IKRVVHDLKKLKSGYVAKLDIDNYFDTINHERLFTRLANWLKDDETLRLIRLCIQTGIVT PQLQWQEINKGVPQGAILSPLLANFYLHPFDQFAANKVPMYIRYADDFLIATSTEKQIKE AVELVKEELESQFYLQLNTPIIHNFHDGIEFLGITISDTGLSITEKKKKTLQERINSIKF IKSSLSSQSKETLQGIKNYYAKLLPESTLKELDCFLMNRLNALIIRNQNSINNKKELVSN LQKIEFYSENSNKNKSQLIQQLCSTYIVHSTKSKTRLTSTHIDNTKLITQKKKEYQKREN EGAELVISIPGSYIGATYKGITVKLQGKIINKPSPALKHITVVGKGISLSSNAITYCMNH KIPIDFFDGRGKQYSTVLNPVFLDGTLWNKQVELPLEQKIKLATQIIIGKLKNQLNLIKY YHKYHKDILGGKLSEKYVEVVLKIDKLIEKAKNYSQRNEKYTAELMAIESQAAIAYWSYI RVLTADDGIDFIRREHQGATDLLNSLLNYGYAILYARVWKNILAAKLNPSIGVLHAKQDG KPTLVFDVVELFRAQMVDRVVISLIQKKVSLKMHDGLLNESSKRVLIRYILERLNRYEKY RGEEITFSQIILRQAQEIALFISGDNLIFKPYVAKW >gi|226332018|gb|ACIB01000038.1| GENE 99 116564 - 116854 143 96 aa, chain + ## HITS:1 COG:no KEGG:Ppha_2458 NR:ns ## KEGG: Ppha_2458 # Name: not_defined # Def: CRISPR-associated protein Cas2 # Organism: P.phaeoclathratiforme # Pathway: not_defined # 8 80 3 77 94 66 41.0 3e-10 MVKAKKIFCVVAYDIQDDRSRIQISKILEKYGTRINYSVFECMFTDRQFQKIQINLERWI NRRYDTVVYYPMCINCYTRIIYQPIRKKIIKTVEIV >gi|226332018|gb|ACIB01000038.1| GENE 100 118164 - 118406 60 80 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MDFGIAFNIGKMINKQEKKKRGRTNLLVTILISCGIAYQKYTKAIILRGCPKSKVPPKSR IAPFTIVYFGEKPHITVVKN >gi|226332018|gb|ACIB01000038.1| GENE 101 118531 - 118749 153 72 aa, chain + ## HITS:1 COG:BH1901 KEGG:ns NR:ns ## COG: BH1901 COG3666 # Protein_GI_number: 15614464 # Func_class: L Replication, recombination and repair # Function: Transposase and inactivated derivatives # Organism: Bacillus halodurans # 10 71 51 112 237 77 48.0 7e-15 MDISYLLSAYNGGGTNSYHPRMILKVLFYAYLNNIYSCRKTQKALQKNIHIMWLSGNSTP NFRTINDFRGKV >gi|226332018|gb|ACIB01000038.1| GENE 102 118807 - 118962 75 51 aa, chain + ## HITS:1 COG:no KEGG:BF4243 NR:ns ## KEGG: BF4243 # Name: not_defined # Def: ISNCY family transposase # Organism: B.fragilis # Pathway: not_defined # 7 49 321 363 543 70 74.0 1e-11 MSVWMYIAVTDPGYGNEQNDEFMKNMGIEAFIKYNYFHKEQKRTWNKDAKT >gi|226332018|gb|ACIB01000038.1| GENE 103 119018 - 120202 1136 394 aa, chain - ## HITS:1 COG:BS_kbl KEGG:ns NR:ns ## COG: BS_kbl COG0156 # Protein_GI_number: 16078763 # Func_class: H Coenzyme transport and metabolism # Function: 7-keto-8-aminopelargonate synthetase and related enzymes # Organism: Bacillus subtilis # 29 394 27 392 392 306 41.0 4e-83 MGLLQEKLAKYDLPQQIKAKGVYPYFRCIESEQNTEVIMSGRKVLMFGSNSYLGLTNHPK VIEAAVEATRKYGTGCAGSRFLNGTLDLHLQLEKELAEFVGKEDAIIYSTGFQVNLGVVS CVTGREDYVICDELDHASIVEGRRLSFSTILKFKHNDMESLEKELQKCRPDAVKLIVVDG VFSMEGDIANLPEIVRLSKKYDANIMVDEAHGLGVLGNHGRGTCDHFGLTKEVDLIMGTF SKSLAAIGGFIAADESIINYLRHNSRSYIFSASNTPAATAAARAALQIMKNEPERIEHLW DITNYSLKCFRELGFEIGHTSTPIIPLYVRDMEKTFMVTKMLFDEGVFVNPVVPPACSPN DTLIRFSLMATHSKEQIDFAIGKLVKCFKALDLL >gi|226332018|gb|ACIB01000038.1| GENE 104 120437 - 121480 772 347 aa, chain + ## HITS:1 COG:lin0768 KEGG:ns NR:ns ## COG: lin0768 COG1597 # Protein_GI_number: 16799842 # Func_class: I Lipid transport and metabolism; R General function prediction only # Function: Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase # Organism: Listeria innocua # 7 307 3 309 309 110 28.0 4e-24 MSENKKKIIFIVNPISGTQSKELVLSLLDEKIDKEMYTWEVVYTERAGHAIEIAADAADK NTDIVVAVGGDGTINEIARSLVHTNTALGIIPCGSGNGLARHLQISMDPRKALEILNDGI IDIIDYGKINGTDFFCTCGVGFDAFVSLKFANAGKRGLLTYLEKTLQESLKYQPETYELE TEDGTSKYKAFLIACGNASQYGNNAYIAPQATLTDGLLDVTILEPFTVLDVPALAFQLFN KTIDQNSRIKTFRCKKLCIHRSSPGVVHFDGDPMQADEDIKIELIQKGLRVVVPGDKKKD NPNVLQKAQEYVNGIKLINEAIVEDIAHKNKVILKKNKQLIQKLTKK >gi|226332018|gb|ACIB01000038.1| GENE 105 121562 - 123319 1822 585 aa, chain + ## HITS:1 COG:BH1252 KEGG:ns NR:ns ## COG: BH1252 COG0173 # Protein_GI_number: 15613815 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Aspartyl-tRNA synthetase # Organism: Bacillus halodurans # 3 581 4 585 595 599 51.0 1e-171 MFRTHTCGELRISDVNKQVKLSGWVQRSRKMGGMTFVDLRDRYGITQLVFNEEIDAELCE RANKLGREFVIQIVGTVNERFSKNSHIPTGDIEIIVSELNILNSAITPPFTIEDNTDGGD DIRMKYRYLDLRRSAVRSNLELRHKMTIEVRSYLDKLGFLEVETPVLIGSTPEGARDFVV PSRMNPGQFYALPQSPQTLKQLLMVSGFDRYFQIAKCFRDEDLRADRQPEFTQIDCEMSF VEQEDVITTFEGMAKHLFKVIRNIELAEPFPRMPWSEAMRLYGSDKPDIRFGMQFVELMD ILKGHGFSVFDNATYIGGICAEGAAGYTRKQLDALTEFVKKPQIGAKGMVYARIEADGTV KSSVDKFYTQEVLQQLKEAFGAKPGDLILILSGDDAMKTRKQLCELRLEMGNQLGLRDKN TFACLWVVDFPLFEWSEEEGRLMAMHHPFTSPKPEDIHLLDTNPAAVRANAYDMVINGVE VGGGSIRIHDSQLQNKMFELLGFTPERAQEQFGFLMNAFKFGAPPHGGLAYGLDRWVSLF AGLDSIRDCIAFPKNNSGRDVMLDAPAALDPSQLEELNLIVDIKE >gi|226332018|gb|ACIB01000038.1| GENE 106 123320 - 123703 245 127 aa, chain + ## HITS:1 COG:no KEGG:BF2380 NR:ns ## KEGG: BF2380 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 127 1 127 127 223 100.0 2e-57 MKESVRIFRFAVIGTLNALITAFVIWLMMDELSYDYIPANITAYIVAQIHNFIWSKYWIF PIENKKNNIWKQMLFFCSAFGLAYSAQFLFLVTLVECGDVNEYLAQFLGLFIYGTVNFIV NKKLTFR >gi|226332018|gb|ACIB01000038.1| GENE 107 123783 - 124667 962 294 aa, chain - ## HITS:1 COG:XF2443 KEGG:ns NR:ns ## COG: XF2443 COG0388 # Protein_GI_number: 15839034 # Func_class: R General function prediction only # Function: Predicted amidohydrolase # Organism: Xylella fastidiosa 9a5c # 4 294 6 295 295 409 64.0 1e-114 MRKIKVGIIQQANTSDIRINLMNLAKSIEACAANGAQLVVLQELHNSLYFCQTENTDLFE LAEPIPGPSTGFYSELAAANRIVLVTSLFEKRAPGLYHNTAVVFDRDGSIAGKYRKMHIP DDPAYYEKFYFTPGDIGFEPIQTSLGKLGVLVCWDQWYPEAARLMALKGAEILIYPTAIG WESTDTDDEKKRQLNAWIISQRAHAVANGLPVISVNRVGHEPDPSGQTNGILFWGNSFVA GPQGEYLAQAGNDRSENMIVEVDLERSENVRRWWPFLRDRRIDEYGNLTKRFID >gi|226332018|gb|ACIB01000038.1| GENE 108 124679 - 125794 803 371 aa, chain - ## HITS:1 COG:XF2442 KEGG:ns NR:ns ## COG: XF2442 COG2957 # Protein_GI_number: 15839033 # Func_class: E Amino acid transport and metabolism # Function: Peptidylarginine deiminase and related enzymes # Organism: Xylella fastidiosa 9a5c # 34 370 22 361 363 307 47.0 2e-83 MGIMVGLPTSGGTEKDLQLNFGLTVNDQVEMLAPFLPAEWFLQSGIQLTWPHAGTDWAYM LAEVQECFINIAREIAKRELLLIVTPYPEEVRKQIIGTVNMDNVRFLKCDTNDTWARDHG AITLMDTGGASLLDFTFNGWGEKFEARLDNQITRRAVEAGALKGQYKDCLNFVLEGGSIE SDGAGTLLTTSECLLSPHRNSPMNRVDIEEYLCRVFHLQRVLWLDHGYLSGDDTDSHIDT LARFCSPDTIAYVKCTDSEDEHYEALCKMEEQLKTFRTTSGAPYRLLALPMADKIEVEGE RLPATYANFLIMNDVVLYPTYNQPENDKLAKEVLCEAFPTYEVVGIDCRALIKQHGSLHC VTMQYPTGVIK >gi|226332018|gb|ACIB01000038.1| GENE 109 125876 - 126403 549 175 aa, chain - ## HITS:1 COG:AF2201 KEGG:ns NR:ns ## COG: AF2201 COG4739 # Protein_GI_number: 11499783 # Func_class: S Function unknown # Function: Uncharacterized protein containing a ferredoxin domain # Organism: Archaeoglobus fulgidus # 1 175 1 183 184 133 41.0 2e-31 MILNERDSRHEHVLNVARQMMTAARTAPKGKGIDIIETAIVTGEEIQQLSDTLKAMFEEF GMKFFLRDADNILQAECILLIGTREQAQGLNCGHCGYATCSGRSEGVPCALNSIDVGIAI GSACATAADLRVDTRVMFSAGLAAQRLEWLKGCHQVMAIPVSASSKNPFFDRKPK >gi|226332018|gb|ACIB01000038.1| GENE 110 126498 - 127118 186 206 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 [Vibrio campbellii AND4] # 1 197 1 202 245 76 27 1e-12 MLQIDNACIAFGEDILFSEFCMRLNKGETACIAGQSGRGKTSLLNAIMGFVPLRKGKIKV GGILLEPTTIDAIRRHIAWIPQELALPSEWVKEMISLPFALKANRHISFSEEKLFTCFDE LGLDKELYQKRVGEISGGQRQRIMIAVAAMLEKPLIIVDEPTSALDAGSTDKVLAFFRNQ AEKGTAILAVSHDRTFAYGCNQLITL >gi|226332018|gb|ACIB01000038.1| GENE 111 127122 - 127916 586 264 aa, chain + ## HITS:1 COG:STM0503 KEGG:ns NR:ns ## COG: STM0503 COG0390 # Protein_GI_number: 16763883 # Func_class: R General function prediction only # Function: ABC-type uncharacterized transport system, permease component # Organism: Salmonella typhimurium LT2 # 1 250 1 246 259 94 30.0 2e-19 MGTIDISYFNLLIGLLLLVIPLFYLWKFKTGLLKATLIGTARMIVQLFLIGMYLKYLFLW NNPWINFLWVIIMIFVAGQTALVRTGLKREILLIPISVGFLCSVVLVGMYFIGIVLQLDN VFSAQYFIPIFGILMGNMLSSNVIALNTYYSGLKREQQLYCYLLGNGATRQEAQTPFIRE AIIKSFSPLIANIAVMGLVALPGTMIGQILGGSSPNVAIKYQMMIMVITFTASMLSLMIT ISLASRKSFDEYGRILQVTKESQK >gi|226332018|gb|ACIB01000038.1| GENE 112 127924 - 128196 201 90 aa, chain + ## HITS:1 COG:no KEGG:BF2386 NR:ns ## KEGG: BF2386 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 90 1 90 90 172 98.0 3e-42 MTSTDSILQLISEIHIPGFFITVDFLQIGEAIPQGISGFLKEKYDKISHGASGRKFIYQE SGWRMAFTFYPTDRVVDEKYAMKNKMIKKR >gi|226332018|gb|ACIB01000038.1| GENE 113 128288 - 128671 340 127 aa, chain + ## HITS:1 COG:BS_yyaH KEGG:ns NR:ns ## COG: BS_yyaH COG0346 # Protein_GI_number: 16081138 # Func_class: E Amino acid transport and metabolism # Function: Lactoylglutathione lyase and related lyases # Organism: Bacillus subtilis # 1 127 1 126 126 141 54.0 2e-34 MHISHIAIWTTRLEELRNFYITYFNGTSNEKYINPKKGFESYFISFDQGFASLEIMQRED ITTPALKDCLGLAHFSFSVGSKEAVLELTEQLRKDGFVIESEPRTTGDGYFESAILDPEG NIVEITI >gi|226332018|gb|ACIB01000038.1| GENE 114 128664 - 129077 306 137 aa, chain - ## HITS:1 COG:no KEGG:BF2472 NR:ns ## KEGG: BF2472 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 137 1 137 137 240 100.0 1e-62 MKAFLPLLFSFFFIISCQQHKEATISPIDEEDELQEEADSLPRATAIFWLDKYHMKELKK DDVLTFRTAKAKVVIRNDGTIDLLSFVEQQPGNAQRYIRYRLKDFKVKKILMDNGYINPG EQYVQLRYIPALARRVK >gi|226332018|gb|ACIB01000038.1| GENE 115 128938 - 130278 641 446 aa, chain + ## HITS:1 COG:BH1380_2 KEGG:ns NR:ns ## COG: BH1380_2 COG3323 # Protein_GI_number: 15613943 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Bacillus halodurans # 213 315 6 108 113 119 53.0 1e-26 MAVARGRLSASSCNSSSSSIGDIVASLCCWHEIIKKKEKSNGKNAFIGLSLYKLGGKYIK KSIKARQDAIFFLYLQANFLLKMKIKEIVSALERFAPLPLQDGFDNAGLQIGLTDAETTG ALLCLDVTEAVLDEAIASGCNLIISHHPLIFKGYKSITGKDYVERCILKAIKNDIVIYSA HTNLDNAPGGVNFKIAEKIGLKNVRILDPKESSLIKLVTFVPSAQAEEVRNALFTAGCGC IGNYDSCSYNTEGEGTFRAQEGSHPFCGTVGELHRETEVRIETILPEYKKGEVIRALLSK HPYEEPAYDLYPLHNSWAQVGSGIVGELEEPESELEFLKRIKKRFEVGCLKHNKLTGRLI QKVSLCGGAGAFLIPQAVRSGADVFITGEIKYHDYFGRETDILLAEIGHYESEQYTKEIF YSIIRDLFPNFALQFSKVNTNPIKYL >gi|226332018|gb|ACIB01000038.1| GENE 116 130284 - 131108 934 274 aa, chain + ## HITS:1 COG:TP0494 KEGG:ns NR:ns ## COG: TP0494 COG1579 # Protein_GI_number: 15639485 # Func_class: R General function prediction only # Function: Zn-ribbon protein, possibly nucleic acid-binding # Organism: Treponema pallidum # 21 244 10 232 273 65 23.0 1e-10 MAREAKNEPKELTVEQKLKALYQLQTTLSKIDEIKTLRGELPLEVQDLEDEIAGLSTRID KIKSEVDELKSAIAGKRVEIEAAKASVEKYKSQQDNVRNNREYDFLTKEIEFQSLEMELC EKRIKEFTAEEQEKSEEIEKNTKALEERQKDLDQKKNELDEIIEETKQEEEKLRDKAKDL ETKIEPRLLQSFKRIRKNSRNGLGIVYVQRDACGGCFNKIPPQRQLDIRSRKKIIVCEYC GRIMIDPELAGVEIEHKVEEAPVTTKRAIRRKAE >gi|226332018|gb|ACIB01000038.1| GENE 117 131346 - 132752 501 468 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|157165073|ref|YP_001466086.1| 30S ribosomal protein S12 [Campylobacter concisus 13826] # 48 462 41 456 460 197 29 3e-49 MKLRIGSITFLLFLSSVAFPQATSRYLDKPLPQGWEEDTQIFQQVLPVDDQWWKAFQDPV LDSLISVAVKQNYSVLTAIDRINMAKANLRMERGNFFPTIGLNAGWTRQQSSGNTSDLPQ STQHYYDASLNMNWELDLFGSIRNRVKAQKENFAASKEEYTGTMISLCAQVASAYINLRE LQQELAVVQKNCASQEAVLKITEVRYNTGLVSKLDVAQAKSVFFSTKASIPQIESGINQY ITTLAILLGTYPQEVRPALTAPGTLPDYMEPIGVGLPADLLLRRPDIRSAERSVNAQAAL VGASKSDWLPQVFLKGSVGYAAKDLKDLTHHKSMTYEIAPALSWTLFKGTQLVNATKLAK AQLDEAINQFNQTVLTAVQETDNAMNAYRNSIKQIVALREVRNQGQETLTLSLELYKQGL TPFQNVLDAQRSLLSYENQLVQARGYSLLQLIAMYQALGGGWSGNLNN >gi|226332018|gb|ACIB01000038.1| GENE 118 132771 - 133910 986 379 aa, chain + ## HITS:1 COG:BMEII0914 KEGG:ns NR:ns ## COG: BMEII0914 COG0845 # Protein_GI_number: 17989259 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Brucella melitensis # 2 371 74 438 451 167 33.0 3e-41 MKKLMYIFLILPLIMSGCKGKKETERGGMPTPEISVAYPLVQNITLTKDYPGYLTTEQTV NLVARVNGALQSASFTPGTRVKQGQLLFVIEPTIYKDNVTQAEAQLKTALAQLEYARNNY SRMKEALKSDAVSRIQVLQAESNVAEATAAVSNAEATLNTAHTNLGYCYIRAPFNGTVSR SLYDVGSYISGAAQPVTLATIYKDDRMYTYFNVADNQWLSMLLSQNGKEKELPKNVIVRL GENGTQNYPATLDYLSPNVDLNTGTLNVRANLDNPKGILKSGLYVSITLPYAEAKQAVLV PEASIGTDQLGKYLYIVNDSNIVRYRHIEPGQLVNDTLRQIKSGLSPKEQYVTTALMKVR DGMKVKPVSVNHESSTSNR >gi|226332018|gb|ACIB01000038.1| GENE 119 133915 - 137037 2671 1040 aa, chain + ## HITS:1 COG:BMEI1629 KEGG:ns NR:ns ## COG: BMEI1629 COG0841 # Protein_GI_number: 17987912 # Func_class: V Defense mechanisms # Function: Cation/multidrug efflux pump # Organism: Brucella melitensis # 3 1028 2 1027 1051 860 44.0 0 MFSKFFINRPIFATVLALIIVVAGLVTLNILPVAQFPEITPPTVQVSAFYPGANAETVAQ TVGIPIEQQVNGVDGMLYMSSTASSSGAYSLTITFAVGTDIDMATVQVQNRVSVAQSSLP EPVIVQGVTVQKQSSNIVMFLTMQAQDSVYDGLYLTNYAQLNLVDQLTRVPGVGAVNVMG AGNYSMRIWLDPEAMRIRNLSPAQIYQAIQSQNIEVSAGYIGQPIGKNNNNAYQYTLNVQ GRLTSPEEFGNIIIRTEEGGKMLRLKDVARIDLGSSSYNVVSKLKGHPTAAIAIYQQPGS NSLDVSKGVKAKMQELAQNFPAGVSYNVTLDTTDVINASIDEVLVTFLETTLLVVLVIFL FLQNWRAVIIPCITIPVSLIGTLAVMAALGFSINTLTLFGLILAVAIVVDDAIVVVENAS RLLETGQYSPKEAVTKAMGEITGPIVGVVLVLLAVFIPTTLISGISGQLYKQFALTIAAS TVLSGINSLTLTPALCALFLEHNKPSNFFIYKGFNKVYDKTQNLYDRIVKGLLVRPGLAL ISYGIITAVAVILFMKWPSTFVPDEDDGYFIAVIQLPPASSLERTQAVGRKVNQILDSYP EVKDYIGISGFSIMGGGEQSNTGTYFVVLKNWDQRKGKEHTAAAVVERFNEMAYGIQEAQ IFAMVPPAIPGLGASGGLQLQLEDRNNLGPTEMQRAVETLMATYHTQPALASISSMYQAN VPQYFLNIDRDKVQFMGIQLDNVFSTLSYYMGAAYVNDFVQFGRIYQVKIEAGEQAQKVI DDVLKLSVPNAKGDMVPFSSFTKVEERLGMDQISRYNMYSTASITCNVASGSSSGEGIQQ MEDLIKDQLGNEFGYEWTSVAYQETQAGNTTTIVFIMALLVAFLVLAAQYESWTSPLSAI MGLPMALLGAMIGCSVMGTPVSIYTQIGIILLIALSAKNGILIVEFARDFRAEGNSIRDA AYEAGHVRLRPILMTSFAFVLGVMPLLFATGAGAQSRIALGAAVVFGMALNTLLATIYIP NFYELMQKFQENVLDRKKKK >gi|226332018|gb|ACIB01000038.1| GENE 120 137382 - 137831 354 149 aa, chain - ## HITS:1 COG:no KEGG:BF2394 NR:ns ## KEGG: BF2394 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 149 1 149 149 298 100.0 5e-80 MLSLNLPVFDTKIATRNGKNVIFDVIRRRYVALTPEEWVRQHFVHFLIVHKGYPSSLMAN EVLLNLNGTKKRCDTVLYKRDLSARMIVEYKAPHIEITQAVFDQITRYNMVLKVDYLVVS NGMQHYCCRMDYDTQSYSFLSDIPDYDAL >gi|226332018|gb|ACIB01000038.1| GENE 121 137896 - 138672 629 258 aa, chain + ## HITS:1 COG:CPn0894 KEGG:ns NR:ns ## COG: CPn0894 COG0775 # Protein_GI_number: 15618803 # Func_class: F Nucleotide transport and metabolism # Function: Nucleoside phosphorylase # Organism: Chlamydophila pneumoniae CWL029 # 3 255 6 263 293 205 39.0 8e-53 MKTKQEIVANWLPRYTKRNLEDFGEYILLTNFNKYVEIFAEKFNVPILGKDANMISASAE GITIINFGMGSPNAAIIMDLLSAISPKACLFLGKCGGIDKKNKIGDLILPIAAIRGEGTS NDYFPPEVPSLPAFMLQRAVSSAIRDYARDYWTGTVYTTNRRIWEHDDTFKEYLKRTRAM AVDMETATLFSCGFANHIPTGALLLVSDQPMIPEGVKTDKSDNIVTKNYVEEHVEIGIAS LRMIIDEKKTVKHLKFDW >gi|226332018|gb|ACIB01000038.1| GENE 122 138691 - 139710 887 339 aa, chain + ## HITS:1 COG:BS_yqeN KEGG:ns NR:ns ## COG: BS_yqeN COG1466 # Protein_GI_number: 16079610 # Func_class: L Replication, recombination and repair # Function: DNA polymerase III, delta subunit # Organism: Bacillus subtilis # 10 274 4 272 347 83 26.0 7e-16 MAKQELTCDDILKELRAKQYRPIYYLMGEESYYIDLIADYITDNVLTDTEKEFNLTVVYG ADVDVATVINAAKRYPMMSEHQVVIVKEAQAIRNIEELSYYLQKPLNSTILVVCHKHGAL DRRKKLAAEIEKTGILFESKKIKEAQLPAFISSYMKRKGIDMEPKATAMLADFVGTDLSR LTGELEKLIITLPGGQKRVTPEQIEKNIGISKDYNNFELRSALVEKDVLKANKIIKYFEE NPKTNPIQMTLSLLFNFYSNLMLAYYAPDKSEQGVATMLGLKTPWQARDYLTAMRKYTGV KTMQIVGEIRYADAKSKGVGNTSISDGDILRELVFKILH >gi|226332018|gb|ACIB01000038.1| GENE 123 140302 - 140454 107 50 aa, chain - ## HITS:1 COG:no KEGG:BF2397 NR:ns ## KEGG: BF2397 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 50 1 50 50 76 96.0 3e-13 MILLYKNKDITFVNIYCVTYTLHNCHILSLNMQQIVDIYLFFLVTGEPFK >gi|226332018|gb|ACIB01000038.1| GENE 124 140915 - 141370 238 151 aa, chain + ## HITS:1 COG:no KEGG:BF2398 NR:ns ## KEGG: BF2398 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 151 1 151 151 273 100.0 1e-72 MEIKDRIKIIMEKENMASGAFAESIGIQQSTLSHILNGRNNPSLDVIMKVHQKYNYVKLE WLLYGQGNISEESIQSASDFQPSLFAENAIIPPNGTVTPENRREMPLESSQNTPKEIVKQ EIRYIEKPSRKITEIRIFFDDNTYETFRGEK >gi|226332018|gb|ACIB01000038.1| GENE 125 141443 - 142219 466 258 aa, chain + ## HITS:1 COG:BH2535 KEGG:ns NR:ns ## COG: BH2535 COG0543 # Protein_GI_number: 15615098 # Func_class: H Coenzyme transport and metabolism; C Energy production and conversion # Function: 2-polyprenylphenol hydroxylase and related flavodoxin oxidoreductases # Organism: Bacillus halodurans # 8 255 7 258 259 171 36.0 2e-42 MKKFILDLTVTENLRLHTNYVLLKLTSQTVLPDMLPGQFAEIRIDGSPTTFLRRPISINY VDRQRNEVWFLIQLVGDGTKRLAQVNRGEIINVVLPLGNSFTIPEKPSDKLLLVGGGVGT APMLYLGEQLAKNGSKPTFLLGARSNKDLLQLEDFAAYGEVYTTTEDGSHGEKGYVTQHS ILNKIKFEQIYTCGPKPMMMAVAKYAKDNDINCEVSLENTMACGIGACLCCVENTTEGHL CVCKEGPVFNINKLLWQI >gi|226332018|gb|ACIB01000038.1| GENE 126 142207 - 143118 903 303 aa, chain + ## HITS:1 COG:aq_046 KEGG:ns NR:ns ## COG: aq_046 COG0167 # Protein_GI_number: 15605646 # Func_class: F Nucleotide transport and metabolism # Function: Dihydroorotate dehydrogenase # Organism: Aquifex aeolicus # 3 299 2 300 306 304 50.0 2e-82 MADLSVNIGKLQMKNPVMTASGTFGYGEEFADFIDITRIGGIIVKGTTLHKREGNPYPRM AETPSGMLNAVGLQNKGVEYFSNHIYPRIKDIQTHMIVNVSGSAIEDYVKTAEIINELDK IPAIELNISCPNVKQGGMAFGVTTKGVSEVVQAVRSAYKKTLIVKLSPNVTDIAEMARAA EANGADSVSLINTLLGMAIDAERKRPILSTVTGGMSGAAVKPIALRMVWQVAKAVNIPVI GLGGIMNWKDAVEFMLAGASAIQIGTANFIDPAITIKVIDGINDYLERHGCKSVSEIIGA LEV >gi|226332018|gb|ACIB01000038.1| GENE 127 143336 - 144013 549 225 aa, chain - ## HITS:1 COG:BH2479 KEGG:ns NR:ns ## COG: BH2479 COG0336 # Protein_GI_number: 15615042 # Func_class: J Translation, ribosomal structure and biogenesis # Function: tRNA-(guanine-N1)-methyltransferase # Organism: Bacillus halodurans # 1 224 1 225 246 243 49.0 2e-64 MRIDIITVLPEMIEGFFNCSIMKRAQDKGLAEIHIHNLRDYTEDKYRRVDDYPFGGFAGM VMKIEPIERCINALKAERDYDEVIFTTPDGEQFDQKMANSLSLSGNLIILCGHFKGIDYR IREHLITKEISIGDYVLTGGELAAAVMADAIVRIIPGVISDEQSALSDSFQDNLLAAPVY TRPAEYKGWKVPEILLSGHEAKIKEWELQQSLERTRRLRPDLLED >gi|226332018|gb|ACIB01000038.1| GENE 128 144090 - 146087 1613 665 aa, chain + ## HITS:1 COG:BH0649 KEGG:ns NR:ns ## COG: BH0649 COG0272 # Protein_GI_number: 15613212 # Func_class: L Replication, recombination and repair # Function: NAD-dependent DNA ligase (contains BRCT domain type II) # Organism: Bacillus halodurans # 4 665 7 668 669 563 45.0 1e-160 MTVKEKIEQLRLQLHQHNYNYYVLNAPEISDKEFDDLMRELQDLEQEHPEYKDENSPTMR VGSDINKNFTQVAHKYPMLSLSNTYSENEVTDFYDRVRKALNEDFEICCEMKYDGTSISL TYENGKLIRAVTRGDGEKGDDVTDNVKTIRSIPLVLHGDNYPEVFEIRGEILMPWEVFEA LNREKEAREEPLFANPRNAASGTLKLQNSAIVASRKLDAYLYYLLGDNLPTDGHYENLQE AAKWGFKISPLMRKCQTLQEVFDFINYWDVERKNLNVATDGIVLKVNSLKQQRNLGFTAK SPRWAIAYKFQAERALTRLNMVTYQVGRTGAVTPVANLDPVQLSGTVVKRASLHNADIIE GLDLHIGDMVYVEKGGEIIPKITGVDTSVRFMIGEKVKFITHCPECGSKLIRYEGEAAHY CPNETACPPQIKGKIEHFISRKAMNIDGLGPETVDMFYRLGLIHDTADLYRLTTDDIRGL DRMGDKSAENIIKGIMQSKEVPFERVIFALGIRFVGETVAKKIAKSFKDIEELENADLET LINIDEIGEKIARSILNYFANESNRKLVGRLKTAGLQLYRPEEDLSGHTDKLAGQSIVIS GVFTHHSRDEYKDLIEKHGGKNVGSISSKTSFILAGDNMGPAKLEKASKLGIKIMNEEEF LKLIS >gi|226332018|gb|ACIB01000038.1| GENE 129 146191 - 147084 832 297 aa, chain + ## HITS:1 COG:BH1742 KEGG:ns NR:ns ## COG: BH1742 COG0329 # Protein_GI_number: 15614305 # Func_class: E Amino acid transport and metabolism; M Cell wall/membrane/envelope biogenesis # Function: Dihydrodipicolinate synthase/N-acetylneuraminate lyase # Organism: Bacillus halodurans # 12 277 9 273 295 257 48.0 2e-68 MIQTRLKGMGVALITPFKEDESVDYDALMRLVDYLLQNNADFLCVLGTTAETPTLSEEEK KKIKKMVIDRVNGRIPILLGVGSNNTRAVVETLKNDDFTGVDAILSVVPYYNKPSQEGIY QHYKAIASATELPIVLYNVPGRTGVNMTAETTLRIAKDFQNVIAIKEASGNITQMDDIIK NKPANFDVISGDDGITFPLITLGAVGVISVIGNAFPREFSRMTRLALQGDFANALTIHHK FTELFNLLFVDGNPAGVKSMLNAMGMIENKLRLPLVPTRITTFEAIRKVLNELNIKC >gi|226332018|gb|ACIB01000038.1| GENE 130 147236 - 147862 333 208 aa, chain - ## HITS:1 COG:CAC0235 KEGG:ns NR:ns ## COG: CAC0235 COG4845 # Protein_GI_number: 15893527 # Func_class: V Defense mechanisms # Function: Chloramphenicol O-acetyltransferase # Organism: Clostridium acetobutylicum # 3 193 2 190 212 126 34.0 4e-29 MKQLIDLENWNRKEHFKFFSAFDDPFFGITTLVDFTNTYHQSKDEKKSFFLYSVHFLLQC VNEVEAFKLRIEGEQVVKYDFIHLSPTIGREDGTFGFGFFEYDADLEVFIQNAEKEIERV KNSTGLSFSENIGRLDLIRYSALPWFAFSEMKHAVSFGRGDSVPRISTGKLIKENGVYLL PISISGHHALMDGRNVAELIEKLETTKK >gi|226332018|gb|ACIB01000038.1| GENE 131 147982 - 148245 235 87 aa, chain - ## HITS:1 COG:no KEGG:BF2406 NR:ns ## KEGG: BF2406 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 87 1 87 124 179 98.0 2e-44 MNVEEFREYCLSFKGVHDRMPFKKATSEYDRDLLVFYVMDKWFCFVNIDAFDFCNIKCNA GQIEDLLDKYEGVQPGYHMNKKHWISV >gi|226332018|gb|ACIB01000038.1| GENE 132 148545 - 149438 458 297 aa, chain + ## HITS:1 COG:MA4278 KEGG:ns NR:ns ## COG: MA4278 COG1262 # Protein_GI_number: 20093067 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Methanosarcina acetivorans str.C2A # 11 287 9 267 270 160 37.0 2e-39 MKTLLNIKLHLSKKNIFTILVFILVLGGTTGCIQHKSDQKRLPALSFTVNGESFEMIPVE GGTFIMGGTSEQGNDCENNEKPTHEETLPFFYIGKYEVTQKLWKAVMGTDFDQSYNSGCE DCPAEYISWNDTQKFISKLNTLTNKTFRLPTDIEWEYAARGGKYSEKYKYSGSNDIDEVA WYIENYQKSKYGDKGTTHPVGMKKPNELGLYDMSGNVWEWCDNWYTQEYSQNGKSVHPGW PFNGTSAFFRRVLRGGSWGGTAKGCRVSYIDYDVPNYRDEYGGFRLVLVPDSVQTAN >gi|226332018|gb|ACIB01000038.1| GENE 133 149874 - 152126 1598 750 aa, chain - ## HITS:1 COG:PA3339_1 KEGG:ns NR:ns ## COG: PA3339_1 COG1752 # Protein_GI_number: 15598535 # Func_class: R General function prediction only # Function: Predicted esterase of the alpha-beta hydrolase superfamily # Organism: Pseudomonas aeruginosa # 7 286 24 304 308 166 35.0 2e-40 MLVQAQKVGLVLSGGGAKGLTHIGIIRALEENNIPIDYITGTSMGAIVGSLYAMGYSPDD METLLKSEDFKRWYSGEVEEKYMYYFKKNLPTPEFFNIRFSFKDSLSLKPQFLPTSVVNP IQMNLVFIDLYARATAACDGDFDKLFVPFRCIASDVYNKKQLILKRGDLGDAVRASMSFP FMFKPIEIDSMLAYDGGIYNNFPTDVMREDFHPDIIIGSVVSTNPGKPKENDLMSQIENM VMQKTDYSLPDSAGILMTFKYNDVSLMDFQRIDELEKIGYDRTMSLMDSIKSRIHRRVNV DNIRLRRLVYKSNYPELRFKNIYIDGANTHQQVYIKKEFHTSDDKEFTYEDLKRGYFRLL SDNMISEIIPHAVFNPEDDTYDLHLKIKMENEFSVRVGGNVSTTSSNQIYLGLAYQNLNY YSKEFTLDGQLGKIYNNAQFMAKVDFATTIPTSYRFIASISTFDYFKKDKLFSKNDKPAF NQKDERFLKLKVALPFLSSKRLELGFGIAQIEDRYFQNNVIDFDKDKYDKSGYRLFGGSV SFNGSTLNSRQFPIQGAREALVAQIFTGNESFRPGVNSENKKPVKEKHSWLQLSYMKEKY HKMGANWILGWYLDAVYASKNFSENYTATMMQASEFAPTAHSKLTYNEAFRANQYVAAGI RPIYRLNQMFHVRGEFYGFLPIFPIERNSINKAYYGKAFSRFEYLGEISVVCQLPFGAIS AYVNHYSSPRREWNVGLTLGWQLFNYRFIE >gi|226332018|gb|ACIB01000038.1| GENE 134 152344 - 154389 2172 681 aa, chain - ## HITS:1 COG:alr2323 KEGG:ns NR:ns ## COG: alr2323 COG0326 # Protein_GI_number: 17229815 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Molecular chaperone, HSP90 family # Organism: Nostoc sp. PCC 7120 # 1 575 2 595 658 458 41.0 1e-128 MQKGNIGVTTENIFPIIKKFLYSDHEIFLRELVSNAVDATQKLNTLASISEFKGELGDLT VHVSLGKDTITISDRGIGLTAEEIDKYINQIAFSGANDFLEKYKNDANAIIGHFGLGFYS AFMVSKKVEIITKSYKEGAQAVKWTCDGSPEFTLEEVEKADRGTDIVLYIDDDCKEFLEE SRISALLKKYCSFLPVPIAFGKKKEWKDGKQVETAEDNVINDTIPLWTKKPSELSDEDYK KFYRELYPMSDEPLFWIHLNVDYPFHLTGILYFPKVKSNIDLNKNKIQLYCNQVYVTDSV EGIVPDFLTLLHGVLDSPDIPLNVSRSYLQSDSNVKKISTYISKKVSDRLQSIFKNDRAQ FEEKWNDLKIFINYGMLTQEDFYDKAQKFALFTDTDGKHYTFEEYQTLIKDNQTDKDKNL IYLYANNKDEQFAYIEAAKNKGYNVLLMDGQLDVAMVSMLEQKLEKSRFTRVDSDVVDNL IVKEDKKSDVLEASKQEALSAAFKSQLPKMEKVEFNVMTQALGENGSPVMITQSEYMRRM KEMANIQAGMSFYGEMPDMFNLVLNSDHKLVKEVLADEEKECSAAIAPIQTELEDVTKRR DALKKKQEGKKDEDIPTAEKDELNDLDKKWDELKQQKDSIFAGYAGKNKVVRQLIDLALL QNNMLKGEALNNFVKRSIELI >gi|226332018|gb|ACIB01000038.1| GENE 135 154520 - 157054 1959 844 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163764771|ref|ZP_02171825.1| ribosomal protein S8 [Bacillus selenitireducens MLS10] # 1 840 2 815 815 759 46 0.0 MNNQFSQKVSDIIVYSKEEANRLKSSYIGPEHLLLGMLRDGEGKAIEILSKLKTNLTDIK KQIEAILKEHADDMLLPDADVPLSNGAAKILKLCILEARVMKSQVADTEHVLLAILKDKD NLAATVLEANHVNYQQVFEQLSLQPDISAGMGFTEDDDDEEEMNQSRSSHGSGERQQQAQ TASRKPTNDTPVLDNFGTDMTKAAEEGRLDPVVGREREIERLAQILSRRKKNNPILIGEP GVGKSAIVEGLALRIIQKKVSRILFDKRVVALDMTAVVAGTKYRGQFEERIRSILNELQK NPNVILFIDEIHTIVGAGSAAGSMDAANMLKPALARGEIQCIGATTLDEYRKNIEKDGAL ERRFQKVMVEPTTADETLQILRNIKDKYEDHHNVNYTDAALEACVKLTDRYITDRNFPDK AIDALDEAGSRVHLTNVSVPKEIEDQEKLIEEAKNNKNEAVKSQNFELAASFRDKEKELA VQLDVMKKDWEERLKDNRETVDEEEIANVVSMMSGIPVQRMAQAEGIKLAGMKEDLQSKV IAQDDAIKKLVKAILRSRVGLKDPNKPIGTFMFLGPTGVGKTHLAKELAKYMFGSSDALI RIDMSEFMEKFTVSRLVGAPPGYVGYEEGGQLTEKVRRKPYSIVLLDEIEKAHPDVFNLL LQVMDEGRLTDSYGRMVDFKNTVIIMTSNIGTRQLKEFGRGVGFATQSRLDDKEFSRSVI QKALNKSFAPEFINRVDEIITFDQLSLEAITKIIDIELKGLYNRIESIGYKLVIEDKAKQ FVASKGYDVQYGARPLKRAIQTYLEDGLSELIISADLNEGDTITVSLNEEKGELEMKNEA KTAE >gi|226332018|gb|ACIB01000038.1| GENE 136 157446 - 159983 2436 845 aa, chain + ## HITS:1 COG:BH0007 KEGG:ns NR:ns ## COG: BH0007 COG0188 # Protein_GI_number: 15612570 # Func_class: L Replication, recombination and repair # Function: Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit # Organism: Bacillus halodurans # 1 812 1 804 833 871 55.0 0 MLEQDRIIKINIEEEMKSSYIDYSMSVIVSRALPDVRDGFKPVHRRILYGMMELGNTSDK PYKKSARIVGEVLGKYHPHGDSSVYFAMVRMAQEWAMRYPLVDGQGNFGSVDGDSPAAMR YTEARLNKLGEEMMQDLYKETVDFEPNFDNTLMEPKVMPTRIPNLLVNGASGIAVGMATN MPPHNLSEVIDACEAYLDNKDVTVEELMEYVKAPDFPTGGYIYGISGVREAYLTGRGRVV MRAKAEIESGQTHDKIVVTEIPYNVNKAELIKAIADLVNEKRIEGISNANDESDREGMRI VIDIKRDANASVVLNKLYKMTALQTSFGVNNVALVNGRPKMLNLRDLIVYFVEHRHDVVI RRTQFDLRKAKERAHILEGLIIASDNIDEVIRIIRAAKTPNDAISGLMERFNLSEIQARA IVEMRLRQLTGLMQDQLHAEYEEVMKQIAYLESILADDEVCRKVIKDELLEVRAKYGDER RSEIVYSSEEFNPEDFYADDQMIITISHMGYIKRTPLTEFRAQNRGGVGSKGTETRDEDF VEHIYPATMHNTMMFFTQKGKCYWLKVYEIPEGTKNSKGRAIQNLLNIDSDDAVNAYLRV KSLNDQEYINSHYVLFCTKNGVIKKTSLEQYSRPRQNGVNAITIREDDRVIEVRMTNGNN EIIIANRNGRAIRFHEAAVRVMGRTATGVRGITLDDDGQDEVIGMICIKDLETESVMVVS EQGYGKRSDIEDYRKTNRGGKGVKTMNITEKTGKLVTIKSVTDENDLMIINKSGITIRLK VADVRIMGRATQGVRLINLEKRNDQIGSVCKVTSESLEDEVPEEEREGNIPSDPETNTPV NETEE >gi|226332018|gb|ACIB01000038.1| GENE 137 160016 - 161227 1272 403 aa, chain + ## HITS:1 COG:no KEGG:BF2412 NR:ns ## KEGG: BF2412 # Name: not_defined # Def: TPR repeat-containing protein # Organism: B.fragilis # Pathway: not_defined # 1 403 1 403 403 696 100.0 0 MKRVLFSMVLLMAVSFAFAQEKNVKEAKSIAGEVKPDFAKAEQLINEALTNPETKDNAAT WDVAGYIQKRINEKEMENAYLRKPYDTLKVYNSVLNMYNYYVKCDELAQIPNEKGKIKNK YRSANSKTILAERPNLINGGIQYFNLNKNEDALKYFAAYVDAATLPMMEKENLLEKDTIL PQVAYYATLAADRVGDKDAVMKYAQYALKDKENGQFAMQLLTDAYKAKGDTAKWVEKLQE GIVKFPENQYFFANLVDYYSSSNQNDKAMQFADDMLAKDPNNKLYLYVKAYLYHNMKDYE KAIEFYKKTLDIDPAYAEACSNLGLVYLLQAQEYADKAPADINDPNYATAQAEIKKFYEA AKPYYEKARELKPDQKDLWLQGLYRVYYNLNMGPEFEEIEKMM >gi|226332018|gb|ACIB01000038.1| GENE 138 161361 - 161528 89 55 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MFNKKEAIYLYIVNCLFYLIPISILLILPSLALKKIRTCILTGFSHFIHHETNKK >gi|226332018|gb|ACIB01000038.1| GENE 139 161577 - 162698 773 373 aa, chain - ## HITS:1 COG:MJ0531 KEGG:ns NR:ns ## COG: MJ0531 COG0589 # Protein_GI_number: 15668711 # Func_class: T Signal transduction mechanisms # Function: Universal stress protein UspA and related nucleotide-binding proteins # Organism: Methanococcus jannaschii # 81 236 23 164 170 62 29.0 1e-09 MEEKLVTLAILTYTKAQILKNVLENEGIETYIHNVNQIQPVVSSGVRLRIKESDLPRALK ITESSAWLAESIVGEKTPKVEHRTKKVLIPVDFSNYSMKACEFGFNFAKSFDAEVILLHV YFTPIYASSLPYGDVFNYQISDEETVKNVLHKVHDDLNTLSEKIKQKVASGEFPDVKYTC VLREGIPEEEILRYNKEHRPRIIIMGTRGKNQKDIDLIGSVTAEIIERSHTTVLAIPENT PFNRFNEVKRIAFMTNFDQRDLIAFDSFINGLSPFHFSVSLIHLSDVKDTWNEIKLAGIK DYFQKQYPDLEIHYDVVMSNDFLNSLDNYIKTNQIDIITLTSYKRNIFSRLFNPGIARKM IFHSDTPLLVING >gi|226332018|gb|ACIB01000038.1| GENE 140 162777 - 163061 259 94 aa, chain - ## HITS:1 COG:no KEGG:BF2414 NR:ns ## KEGG: BF2414 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 94 1 94 94 176 100.0 2e-43 MRTITFNELRKIKDSLPSGSMHRIADELNLNVDTVRNFFGGHNFKEGKSVGIHLEPGPDG GLVMIDDTTVLDRALRILDELNMSKEEATESVQV >gi|226332018|gb|ACIB01000038.1| GENE 141 163272 - 164108 682 278 aa, chain - ## HITS:1 COG:no KEGG:BF2415 NR:ns ## KEGG: BF2415 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 278 1 278 278 521 99.0 1e-147 MKKILFIALGLLMAVTSFGQDSLTTDSTQMIQGDTVSIHNAEFSGSKLEDATKAEGDSAY IRNDFASAIQIYESLLRKGESADVYYNLGNSYYKINEIAKAILNYEKALLLQPGNGDIRA NLEIARGKTVDKVEVVPEIFFVTWTKALINSMSVDSWAIWGIVSFLLLIVSLYFFIFSKQ VVLKKVGFITGIIFLIVVVMANVFASKQKEELLNRDTAIIMSPSVTVRSTPSENGTSLFI LHEGHKVNIKDDSMKDWKEIRLEDGKVGWVPVGSIEII >gi|226332018|gb|ACIB01000038.1| GENE 142 164127 - 165971 1410 614 aa, chain - ## HITS:1 COG:no KEGG:BF2416 NR:ns ## KEGG: BF2416 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 614 1 614 614 1167 99.0 0 MIKTSDEMRKLIFLLIALVAMTTQAQADGKVVFTASAPDAVVVGDQFRLSYTVNTIKVRD FRVPSIKGFEVLMGPNRSQRMQSINGVTNNSITFTYILMATAEGEYSIPGATITADGNQM VSNSVKIKVLPPDKTGNTADGKGTASSGNQSGMSSSVSNQDLLITATANKTNVYEQEAFL LTFKIYTRESQLRFENVKLPDFKGFHSQEIEMPANAKWSQEHYKGKNYFTTVYRQFVLFP QQSGKLTIEPARFDATIAKAVQSDDPFDAFFNGGSNYVNVSKVIVTPKITVNVNPLPAGK PANFSGGVGEFSITSSINSKEVKTNDAITIKLVISGTGNLKLIANPEIKFPEDFDVYDPK VDSKVRLTQEGLSGNKVIEYLAIPRHAGVYKIPGVSFSYFDIKSKSYKTLNTEDYEVKVE KGAGNADQVIANFTNKEDLKVLGEDIRYIKLNDVKLQPKDNLLFGSLLYWLFYIVPAVVF IVFFIVYRKQAAENANVAKMRTKKANKVATKRMKLAGKLLAENSKEAFYDEVLKALWGYI SDKLNIPVSRLSKDNVEEKLRNYGVSDELIKDFLNALNECEFARFAPGDESQAMDKVYSS SLEVMSKMENSIKR >gi|226332018|gb|ACIB01000038.1| GENE 143 165986 - 166696 709 236 aa, chain - ## HITS:1 COG:MA3260 KEGG:ns NR:ns ## COG: MA3260 COG0457 # Protein_GI_number: 20092076 # Func_class: R General function prediction only # Function: FOG: TPR repeat # Organism: Methanosarcina acetivorans str.C2A # 43 135 232 321 395 58 35.0 8e-09 MSRNKYILFALLLSLSAGAFAQNAERDYIRKGNRLFKDSVFVDAEVNYRKALEANPKSTI SMYNLGNTLSQQQKFKDAMEQYVAATSIEKDKAKLGQIYHNMGVLFQSGKDYQKAVEAYK MSLRNNPKDDETRYNLALAQKLLKDQQQNQQNQDQNQDQNKDDQQKQQDKKDQNKQNDQN KDQQQQQPPKSEKNDNEMSKENAEQLLNSVMQDEKGVQDKVKKQQTLQGRRLEKDW >gi|226332018|gb|ACIB01000038.1| GENE 144 166705 - 167730 1044 341 aa, chain - ## HITS:1 COG:VCA0172 KEGG:ns NR:ns ## COG: VCA0172 COG2304 # Protein_GI_number: 15600942 # Func_class: R General function prediction only # Function: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain # Organism: Vibrio cholerae # 9 324 8 316 318 97 27.0 3e-20 MFRFEEPAYLYLLLLLPLLAAFYLYSNYRKRKAIRKFGDPVLMAQLMPDVSKYRPDVKFW LLFTAIGLFAVLLARPQFGSKLETVKRKGVEVMIALDISNSMLAQDVQPSRLEKAKRLIS KLVDGMENDKVGMIVFAGDAFTQLPITSDYISAKMFLESISPSLISKQGTAIGAAINLAA RSFTPQEGVGRAIVVITDGENHEGGAVEAAKEAAKKGIQVNVLGVGLPDGAPIPIEGSND FRRDREGNVIVTRLNEAMCQEIAKEGNGIYVRVDNSNSAQKAINQEINKMAKSDVESKVY TDYNEQFQVIAWMILLLLLVEMLILDRKNPLFKNIRLFSNK >gi|226332018|gb|ACIB01000038.1| GENE 145 167744 - 168727 773 327 aa, chain - ## HITS:1 COG:VCA0172 KEGG:ns NR:ns ## COG: VCA0172 COG2304 # Protein_GI_number: 15600942 # Func_class: R General function prediction only # Function: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain # Organism: Vibrio cholerae # 3 318 4 313 318 157 33.0 3e-38 MVFANIEYLFLLLLLVPYIVWYIMKRKKTEPTLQISDARVYAHAPKSYKNYLLHVPFGLR IITLILIILVLARPQTTNSWQNSEIEGIDIMLAIDVSTSMLAEDLKPNRLEAAKDVAAEF INGRPNDNIGITLFAGESFTQCPLTVDHAVLLNLFQGIQCDIIEDGTAVGMGIANAVTRL KDSKAKSKVIILLTDGTNNKGDISPLTAAEIAKSFGIRVYTIGVGTNGMAPYPVRVGGTT QYINTPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDKLEKTKLNVKEYSKRQEE YRWFALAAFLCILLEVLLRNSILKKIP >gi|226332018|gb|ACIB01000038.1| GENE 146 168779 - 169852 822 357 aa, chain - ## HITS:1 COG:no KEGG:BF2502 NR:ns ## KEGG: BF2502 # Name: not_defined # Def: putative membrane exported protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 357 1 357 357 632 100.0 1e-180 MNKYILLIVLLFLVSGRIAAQSVTVDAKIDSLQILIGEQAKVQLQVAMDAKQRAVFPSFT DTLVRGVEIVDIAKPDTQYLNDRQRMLITQEYTVTSFDSALYYIPPMGVKIDNKEYKSKA LALKVYSMPVDTLHPDQFFGQKTVMKAPFAWEDWYGLIACSFLALPLLGLLIYLIIRIRD NKPIIRKVKVEPKLPPHQLAMKEIERIKTEKIWQKGQSKEYYTELTDALRTYIKNRFGFN ALEMTSSEIIDKLLEFNDKEAISDLKYLFQTADLVKFAKHDPQMNENDANLINAIDFINE TKQLEEENQKPQPTEITIIEKRSLRTKILLICGIVFLSAALIATFVYIGLQLYNLFG >gi|226332018|gb|ACIB01000038.1| GENE 147 169861 - 170730 616 289 aa, chain - ## HITS:1 COG:BB0175 KEGG:ns NR:ns ## COG: BB0175 COG1721 # Protein_GI_number: 15594520 # Func_class: R General function prediction only # Function: Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) # Organism: Borrelia burgdorferi # 3 276 8 278 291 139 31.0 5e-33 METSEILKKVRRIEIKTRGLSNNIFAGQYHSAFKGRGMAFSEVREYQFGDDIRDIDWNVT ARFNKPFVKVFEEERELTVMLMVDVSGSLEFGTVKQLKKDMVTEIAATLAFSAIQNNDKI GVIFFSDRIEKFIPPKKGRKHILYIIRELIDFKPDSRRTNIRLALEYLTNVMKRRCTAFI LSDFIDQESFKNAMTIANRKHDVVAIQVYDRRVAELPAVGLMRIKDAETGHEQWIDTSSA GVRRAHHEWWVNKQTELDETFTKSNVDSVSVRTDQDYVKALLNLFAKRN >gi|226332018|gb|ACIB01000038.1| GENE 148 170787 - 171782 1064 331 aa, chain - ## HITS:1 COG:Rv1479 KEGG:ns NR:ns ## COG: Rv1479 COG0714 # Protein_GI_number: 15608617 # Func_class: R General function prediction only # Function: MoxR-like ATPases # Organism: Mycobacterium tuberculosis H37Rv # 30 331 52 352 377 312 52.0 5e-85 MAESIDIRELNERIERQSAFVTNLTTGMDQIIVGQKHLVESLLIGLLSDGHVLLEGVPGL AKTLAIKTLASLIDAKYSRIQFTPDLLPADVVGTMVYSQKDESFQVKKGPIFANFVLADE INRAPAKVQSALLEAMQERQVTIGKETFLLPEPFLVLATQNPIEQEGTYPLPEAQVDRFM LKVIIDYPKQEEEKLIIRQNINGEKFNVKPILKAEEIIEARKVVRQVYLDEKIERYIVDI VFATRYPEKYDLKELKDMIGFGGSPRASINLALAARTYAFIKRRGYVIPEDVRAVAHDVL RHRIGLTYEAEASNVTSDEIVSKILNKVEVP >gi|226332018|gb|ACIB01000038.1| GENE 149 172026 - 173477 1013 483 aa, chain - ## HITS:1 COG:no KEGG:BF2423 NR:ns ## KEGG: BF2423 # Name: not_defined # Def: putative integration host factor IHF alpha subunit # Organism: B.fragilis # Pathway: not_defined # 1 483 1 483 483 733 99.0 0 MNEKLTIQDLVELLVNRHEVSQEDADVFVREFFLLIEQALDADQYVKIKGLGTFKLIGVN SRESVNVNTGERIKIEEHTKISFTPDPSLRDIINRPFSHFETVVLNENTVLEDTPIEELE EESGNISETTELPLITETVEREEAKAEEKVVETEANGKIEPETSKGQDVVSSDVEVAEDV SEVMKESERTEVVDDIDILETVEDVSIHKGSEAVVEGSSIAEVREEGGLDKVVENSEEPI QFTGDTGQETTDNLKKVIEDEGSPKLTAEEIIAREIQKAEVSTIPVKKEKRPKKEVKPEN QKSPVPYLIVIIVFVMSLCGAALVFIYYPDLFSKKESEQSITTETVEKKEPIREIPLDTV AKADTIVKVVAKTPNQQEIKQMSERVNVSEKVDKTSESESVSREKSTKTVAIPVKPDSVN YTITGTKATYTIKEGETLTRVSLRFYGTKDLWPYIVKHNRGVIKNPNNVPYGTVLKIPEL VKK >gi|226332018|gb|ACIB01000038.1| GENE 150 173470 - 173760 281 96 aa, chain - ## HITS:1 COG:no KEGG:BF2424 NR:ns ## KEGG: BF2424 # Name: not_defined # Def: DNA-binding protein HU # Organism: B.fragilis # Pathway: not_defined # 1 96 1 96 96 167 98.0 1e-40 MNNKEFTSELSRRLGYNTKYTSELITSLLSDITQELQEGNAIGIQGFGTFEVKKKAERIV INPVTKLRLLVPPKLVLAFKPSPILKDKFKETFPYE >gi|226332018|gb|ACIB01000038.1| GENE 151 173835 - 175133 1226 432 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|229254937|ref|ZP_04378866.1| SSU ribosomal protein S12P methylthiotransferase [Capnocytophaga ochracea DSM 7271] # 1 430 6 431 433 476 52 1e-133 MKRKTIDIITLGCSKNLVDSEQLMRQLEEAGYDVTHDSEKPTGEIAVINTCGFIGDAKEE SINMILEFAQEKEEGNLEKLFVMGCLSERYLKELAIEIPQVDKFYGKFNWKGLLQDLGKA YHEELHIERTLTTPKHYAYLKISEGCDRKCSYCAIPIITGRHVSRPIEEILDEVRYLVSN GVKEFQVIAQELTYYGVDLYKKQMLPELIERISEIPGVEWIRLHYAYPAHFPEELFRVMR ERDNVCKYMDIALQHISDNMLQRMRRHVTKKETYRLIEQFRKEVPGIHLRTTLMVGHPGE TEEDFEELKEFVRKVRFDRMGAFTYSEEEGTYAAANYEDSIPQELKQARLDELMAIQQGI STELSASKVGQKMKVIIDRIEGEYYIGRTEFDSPEVDPEVLIRCEGDNLMIGNFYQVQVI DSDEFDLFGEII >gi|226332018|gb|ACIB01000038.1| GENE 152 175130 - 176089 739 319 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163762490|ref|ZP_02169555.1| ribosomal protein L28 [Bacillus selenitireducens MLS10] # 5 317 2 322 336 289 46 7e-77 MGFFSFFSKEKKETLDKGLSKTKESVFSKIARAVAGKSKVDDEVLDNLEEVLITSDVGVE TTLNIIKRIEKRAAEDKYVNTQELNSILREEIAALLTENNSDDVADFDVPVEKKPYVIMV VGVNGVGKTTTIGKLAYQFKKAGKSVYLGAADTFRAAAVEQLVIWGERVDVPVIKQKMGA DPASVAFDTLSSAVANNADVVIIDTAGRLHNKVGLMNELTKIKNVMKKVVPDAPNEVLLV LDGSTGQNAFEQAKQFTLATEVTAMAITKLDGTAKGGVVIGISDQFKIPVKYIGLGEGME DLQVFRKKEFVDSLFGENA >gi|226332018|gb|ACIB01000038.1| GENE 153 176239 - 176397 263 52 aa, chain - ## HITS:1 COG:no KEGG:PRU_0750 NR:ns ## KEGG: PRU_0750 # Name: not_defined # Def: hypothetical protein # Organism: P.ruminicola # Pathway: not_defined # 1 51 1 51 52 80 92.0 2e-14 MAKKTVASLHEGSKEGRAYTKVIKMVKSPKTGAYIFDEQMVPNEKVQDFFKK >gi|226332018|gb|ACIB01000038.1| GENE 154 176417 - 176605 320 62 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|53713719|ref|YP_099711.1| 50S ribosomal protein L33 [Bacteroides fragilis YCH46] # 1 62 1 62 62 127 100 3e-28 MAKKAKGNRVQVILECTEHKDSGMPGTSRYITTKNRKNTTERLELKKYNPILKRVTVHKE IK >gi|226332018|gb|ACIB01000038.1| GENE 155 176632 - 176892 456 86 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|53713720|ref|YP_099712.1| 50S ribosomal protein L28 [Bacteroides fragilis YCH46] # 1 86 1 86 86 180 100 5e-44 MSKICQITGKKAMIGNNVSHSKRRTKRTFDLNLFNKKFYYVEQDCWISLSLCAAGLRIIN KKGLDAALNDAVAKGYCDWKTIKVVG >gi|226332018|gb|ACIB01000038.1| GENE 156 177014 - 178246 1032 410 aa, chain - ## HITS:1 COG:alr4808_1 KEGG:ns NR:ns ## COG: alr4808_1 COG1058 # Protein_GI_number: 17232300 # Func_class: R General function prediction only # Function: Predicted nucleotide-utilizing enzyme related to molybdopterin-biosynthesis enzyme MoeA # Organism: Nostoc sp. PCC 7120 # 1 249 1 250 252 130 32.0 4e-30 MFAEIITIGDELLIGQVTDTNSAWMGRELNKVGIEVIRVVSVRDRADEIIEAVDASMKRA NIVLVTGGLGPTKDDITKQTLCKYFGTRLIFSEAAFENVKRVLAGKIPMNALNKSQAMVP EDCIVINNRVGSASVSWFEKDGKVLVSMPGVPQEMTTVMSEEVIPRLCAKFRTDAIIHRT FTVQNYPESVLAEKLESWEMALPVCLKLAYLPKPGLIRLRLTGRGQNRSEVEACVDTESA KLEAILGEDILDEEDTPIEILIGELLKKKNLTLSTAESCTGGSIAARITSVAGSSEYFKG SIVAYANEVKTELLGVSMETLEKRGAVSEETVIEMVKGAMKALKTDCAVATSGIAGPSGG TEEKPVGTVWIAAAYKSEICTMKQETNRGREMNVERASNNALLLLRKLVK >gi|226332018|gb|ACIB01000038.1| GENE 157 178254 - 179273 648 339 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|227425790|ref|ZP_03908856.1| SSU ribosomal protein S18P alanine acetyltransferase [Atopobium parvulum DSM 20469] # 4 326 479 804 832 254 43 3e-66 MSTIILGIESSCDDTSAAVIKDGYLLSNVVSSQAVHEAYGGVVPELASRAHQQNIVPVVH EALKRAGVTKEELSAVAFTRGPGLMGSLLVGVSFAKGFARSLNIPMIDVNHLTGHVLAHF IKEEGEANEQPDFPFLCLLVSGGNSQIILVKAYNDMEILGQTIDDAAGEAIDKCSKVMGL GYPGGPIIDRLARQGNPKAYTFSKPHISGLDYSFSGLKTSFLYSLRDWMKEDPDFIEHHK NDLAASLEATVVDILMDKLRKAAKQYKINEVAVAGGVSANNGLRNAFREHAEKYGWKIFI PKFSYTTDNAAMIAITGYFKYQDKDFCSIEQPAYSRVTL >gi|226332018|gb|ACIB01000038.1| GENE 158 179306 - 183910 3160 1534 aa, chain + ## HITS:1 COG:no KEGG:BF2513 NR:ns ## KEGG: BF2513 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1534 1 1534 1534 2932 99.0 0 MIKVGFITNYFIFLFSKSKQPPIRTLKKTVRWVIGIILGIYIGTIILLNIPYIQRNMTTF VTKELSRTLGTELTIGKIDIGLLNRIIIDDVLLDDQSGKEMLKITRLSAKFDIIPLFNGK ITISSVQLFGFNINLNKPAPHMEPNFKFVLDAFASKDTVKTKKDIDLRINSILIRRGKLS YDVLSEEETPGKFNPQHIKLHNIIANISLKALQNDSINAAIKRLSVDEQSGFELRKLSLK VIANNKGMKIENFAIEMPGTEMKMDTIRMEYDSLKALNHFADNVRFSFRTLPSHVTLNDI SAFVPALSNFKEKLDLNIDVEGTLNQLNCRTLEINAGDKFRLKGDVSLQDLSRPQDAYVY GHLANLSANKEGIGFLVRNLSPHYNGVPPVLQHLGNTSFHGEISGYFTDLVMYGLFRTDI GSVQTDLKLSSDKAKALFSYSGGVKTTDFELGQLLGNKQLGKITFNLDVRGNHYKSQYPS ITLKGLIASLEYSNYKYENITLDGEFKRGGFDGKVALNDENGSVHLNGNINVVEKVPTFN FNAVIDKIRPHDLNLTKEYPDAEFSLKLKANFRGGSIDEMMGEINIDSLQFTAPEKSYFL DNINITATRQDKENQLKLTSSFLKASIEGNYLYHTLPASVMNIMRRYIPSLIQPDKKPIK TNNNFSFDIHIFNTELLSTVFDIPLKIYSHSTVKGYFNDQAQRLRVEGYFPRLQYQNTFI ESGLVLCENPTDQFKAKVRFNNLKKESAVSISLDAQAKNDTINANINWGNNAISTYSGRL SAAASFFRAAEEKSPLKTVVDIKQTDIILNDTLWQVHPSQVVVDSGKIDVNDFYFSHQDR HIRINGRISEQAKDTLKVELKDINVGYVFDVVNFDDVDFKGDATGTAYASGILKEPVMNT RLHFKNFTFNDASLGAMDIYGAWKNDMRAIFLDAHMEEEGVSKTHVIGHVYPLKPESKLD LNIETEHTNIQFLQYFMRSIVEDLHGRTSGKAHFYGKFKALNIEGNLMTDASLKIGILNT SFTVTDTIRLSTSGISFDNIRIADMEGHQGTMNGKLNFRHFRDLSYHFEFNVNNMLLMNT KENPDINFYGKVYGTGNAMLIGNPQELQVNAAVTTNRNTNFVYITNATASAASNQFIKFV DKTPRRFVQDSINVMSEYDRLQQEMEEEESKTDIRLNLLIDATPDATMKIIMDPIAGDYI SGKGSGNIRTEFFNKGDVKMFGNYRINQGIYKFSLQEVIRKDFIIKDGSSITFNGPPLDA TLDIQASYTVNSASLNDLIPDASETITQQPNVKVNCIMNLTGILWRPNIKLGIELPNERD EIQTLVRNYISTDEQMNMQILYLLGIGKFYPQESTGGTQNSNMTSSALFSTLSGQLNNLL SQVFDNNNWNIGTNLSTGDKGWTDMEIEGILSGQLLNNRLLINGNFGYRDNPLANTNFIG DFEAEWLLNRSGDIRLKAYNETNDRYYTRTNLTTQGIGIMYKKDFNKWSELLFWNKWKLR NLRRKQAAAKDSIPNDSVKETQKAKSEMKREHPM >gi|226332018|gb|ACIB01000038.1| GENE 159 184006 - 184386 284 126 aa, chain - ## HITS:1 COG:MT0892.1 KEGG:ns NR:ns ## COG: MT0892.1 COG3304 # Protein_GI_number: 15840283 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Mycobacterium tuberculosis CDC1551 # 1 122 1 121 129 89 45.0 2e-18 MNPLLNLIWLLCGGIFTAIEYLISSLLMMITIIGIPFGFQTLKLAGLALWPFGKEVRSTP DSGGCLSIIMNIIWLFLGGIWISLSHLGWGILLCITIIGIPFGKQHFKLAGLALTPFGKV IVDKSF >gi|226332018|gb|ACIB01000038.1| GENE 160 184479 - 185972 1732 497 aa, chain - ## HITS:1 COG:BB0402 KEGG:ns NR:ns ## COG: BB0402 COG0442 # Protein_GI_number: 15594747 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Prolyl-tRNA synthetase # Organism: Borrelia burgdorferi # 8 497 5 488 488 471 49.0 1e-132 MAKELKDLTKRSENYSQWYNDLVVKADLAEQSAVRGCMVIKPYGYAIWEKMQRQLDDMFK ETGHVNAYFPLLIPKSFLSREAEHVEGFAKECAVVTHYRLKNAEDGSGVVVDPAAKLEEE LIIRPTSETIIWNTYKNWIQSYRDLPILCNQWANVFRWEMRTRLFLRTAEFLWQEGHTAH ATREEAEEEAIRMLNVYAEFAEKYMAVPVVKGVKSANERFAGALDTYTIEAMMQDGKALQ SGTSHFLGQNFAKAFDVQFVNKENKLEYVWATSWGVSTRLMGALIMTHSDDNGLVLPPHL APIQVVIVPIYKNDEQLKLIDAKVEGIVAKLKQLGISVKYDNADNKRPGFKFADYELKGV PVRLVMGGRDLENNTMEVMRRDTLEKETVTCDGIETYVQNLLEEIQANIYKKARTYRDSR ITTVDSYDEFKEKIEEGGFILAHWDGTVETEEKIKEETKATIRCIPFESFVEGDKEPGKC MVTGKPSACRVIFARSY >gi|226332018|gb|ACIB01000038.1| GENE 161 186267 - 186974 314 235 aa, chain - ## HITS:1 COG:no KEGG:BF2436 NR:ns ## KEGG: BF2436 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 4 235 13 244 244 487 99.0 1e-136 MLLADKYEVEEFIMDDPIQFPHRYTDKADIEISGLIASWIATGNRKAIIKSGDRIDHELF LNAPYRYILSEEWRKYRGVTSSFYRYYSWNDFYILCQTLYAAYREHGDLESYLCHSLSSG TPLERLQSVFGHINGMPALSSASEAKKMCMFLRWMIRRDSPVDLGIWRSFSPSDLIIPLD THVHRISTDLGLTNARKCLKTARCITDALREIWPDDPVKGDFALFGFGINEPVKS >gi|226332018|gb|ACIB01000038.1| GENE 162 187133 - 190366 2868 1077 aa, chain - ## HITS:1 COG:TVN0895_2 KEGG:ns NR:ns ## COG: TVN0895_2 COG0793 # Protein_GI_number: 13541726 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Periplasmic protease # Organism: Thermoplasma volcanium # 673 1064 1 387 393 225 35.0 4e-58 MNKKLILSIFVLAGAPMLLSAAGEARLLRFPATNGNEIVFSYAGDLYKVPASGGEAQRLT SHVGYEMFPRFSPDGKTIAFTGQYDGNTEVYTMPATGGEPLRITYTATNSRDDLGDRMGP NNIVMTWTPDGQRIVYRNRISDGFSGKLFTVDKEGGLSEVIPLPEGGFCSYSPDGKQLAY NRVMREFRTWKYYKGGMADDIWVYNPGNKTVENVTNNVAQDIFPMWIGDEIFFLSDRDRI MNIFAYNTKTKQTVKVTNFTEYDVKFPSVHGNTIVFENGGYIYKMDAAARKAEKVNITLA SDNIYARTDLKEGANYVTAASLSPDGARMVVTSRGEVFNLPVEKGVTKNITRSPGAHDRD AQWSPDGTQIAYISDATGETELYLQNAAGGEPMQLTHKNDTYIRDFKWSPDSKKIVYMDR KNRVNLLDVASGKVSLLLQDPVGVPGGVTFSPDSEWLTYTRMGKNEINVVYVYNIAEKKE YPVTDKWYNSSSPVFSADGKYLIFSSARDFNPTYGSLEWNHVYNNMYGVYIALLSKDTSS PFMQKDAEVAVSNATPKSGDKKPADKKEVADASLVKFDPDGITDRIVRLPLSPSYYGNFY SDGNKVYYWGRGGTKMYDLASQKEESIADGASMDVTYDGKKALFFKGRQIYVTNLPSGKT ELTAPVDLSNMKITVDYPKEWAQIFDEAWRAYRDGFYQESMHGVDWKAIKEKYAVLLPYV KTRLDLNYIIGEMIGELNCGHAYVNPGETEQPKRINTGLLGAEITRDKSGFFRLEKIFPG ASWSKELRSPLTEPGVDVKVGEYIVAIDGVPTNTVKDMYSLLVGKAEIPTEISLNAKPQL SGARKVVISPLANEYPLKHYNWVQDNIKKVDQASNGRIGYIYIPDMGPEGLNEFARYFYP QLDKEGLIIDDRANGGGNVSPMILERLSREPYRLTMGRGTSHVGTVPDAVQVGPKVCLIN KYSASDGDLFPWGFRALGLGKLIGTRTWGGIVGISGSLPYMDGTDIRVPFFTSYDPKTGK WIIENHGVDPDILIDNDPVKEWNGEDQQLNRAIEEVMKQLKDRKPLPPVPAPRDFSK >gi|226332018|gb|ACIB01000038.1| GENE 163 190738 - 191256 499 172 aa, chain + ## HITS:1 COG:BMEI0693 KEGG:ns NR:ns ## COG: BMEI0693 COG2087 # Protein_GI_number: 17986976 # Func_class: H Coenzyme transport and metabolism # Function: Adenosyl cobinamide kinase/adenosyl cobinamide phosphate guanylyltransferase # Organism: Brucella melitensis # 5 169 8 172 173 132 39.0 3e-31 MKQIILITGGARSGKSSHAERLALSLSPNPVYLATSRIWDEEFRQRVLRHQANRGPEWTN IEEEKELSRHTLEGRVVLIDCVTLWCTNYFFDLEADTDKALTAVKAEFDRLTQQDATFIF VTNEIGMGGTSENLIQRKFTDMQGWMNQYIASRANRVILMVSGIPVKVKDEK >gi|226332018|gb|ACIB01000038.1| GENE 164 191279 - 192316 1062 345 aa, chain + ## HITS:1 COG:RSc2397 KEGG:ns NR:ns ## COG: RSc2397 COG2038 # Protein_GI_number: 17547116 # Func_class: H Coenzyme transport and metabolism # Function: NaMN:DMB phosphoribosyltransferase # Organism: Ralstonia solanacearum # 21 343 20 344 354 275 46.0 6e-74 MKTFQITRPDETIREALTDKINNLTKPKGSLGTLEELALQIGLIQQTLTPELRHPQNIIF AADHGIVDEGVSLSPKEITWQQISNFLHGGAGVNFLCRQHGFELKIVDAGVDYDLPYEKG IINMKVRKSSRNYLYEAAMTEEEMNLCIERGAEVVRQCHAEGCNVLSLGEMGIGNTSSSS MWMTCFTHIPLELCVGAGSGLDNAGVRHKYNVLQQALDHYQGDGSAHDLIRYFGGLEMVM AIGAMLQAAELKMIILVDGFIMTNCILAASQLYPEVLHYAIFGHQGDEAGHKLVLDAMGA KPLLNLGLRLGEGTGAICSYPIIDSAIRMINEMDNFAHAAITKYF >gi|226332018|gb|ACIB01000038.1| GENE 165 192329 - 193072 460 247 aa, chain + ## HITS:1 COG:VC1238 KEGG:ns NR:ns ## COG: VC1238 COG0368 # Protein_GI_number: 15641251 # Func_class: H Coenzyme transport and metabolism # Function: Cobalamin-5-phosphate synthase # Organism: Vibrio cholerae # 4 247 13 255 261 107 34.0 1e-23 MNILAAFIFFTRLPFWRIREVPAECFKHVVPYWPLSGWLTGGIMAGVLWLSAQILPFSVA VLLALAARLLITGALHEDGLADFFDGFGGGTNRERILSIMKDSHIGSYGVIGLIFYFLLL WSLLMSLPLSFACITLIAGDTISKLTSSQIINFLPYARKEEESKAKVVYNRMSGGECAFG LLCGILPSALLLPYRYWMAIVFPLVMLYLLCTLMKRKLQGYTGDCCGALFLLSELSFYLG IVILMFI >gi|226332018|gb|ACIB01000038.1| GENE 166 193075 - 193620 558 181 aa, chain + ## HITS:1 COG:RSc2395 KEGG:ns NR:ns ## COG: RSc2395 COG0406 # Protein_GI_number: 17547114 # Func_class: G Carbohydrate transport and metabolism # Function: Fructose-2,6-bisphosphatase # Organism: Ralstonia solanacearum # 1 145 1 149 192 80 32.0 2e-15 MEVILIRHTSVDVPKGVCYGQTDVPLRDSFEEEASITAQQLQNDVFDAVFTSPLSRCTRL ADHCGYPDAIRDARLKELNFGEWEMQEFDKICDPRLEEWYNDYFHVAATGGESFMMQLQR VSEFLNEVSGKEYKRIAVFAHGGVLICAQIYAGILRMEDAFNALTPYGGVVRLQLNSKTE E >gi|226332018|gb|ACIB01000038.1| GENE 167 193598 - 194563 518 321 aa, chain - ## HITS:1 COG:BH1588 KEGG:ns NR:ns ## COG: BH1588 COG1270 # Protein_GI_number: 15614151 # Func_class: H Coenzyme transport and metabolism # Function: Cobalamin biosynthesis protein CobD/CbiB # Organism: Bacillus halodurans # 15 300 4 296 319 203 36.0 5e-52 MDGMFFWYISLVYFCRLFPLPLAWLFDRWQGDPSWLPHPVVGFGKLIAWGEKCLNAGRAR VWKGGMMSVALIAGVYFFTFLFFKVIGEYSIILTALIQTLLIFCCLAGTTLIREVRMVFE AVDRSLDEGRKQVARIVGRDTSALSAQEVRTAALETLAENLSDGVIAPLFWYAVLGVPGM MAYKMVNTLDSMIGYRNERYRQFGCIAARIDDVANYIPARLTALLMILVSGRFSLLRFVG KYGSRHASPNSGYPEAALAGILNCRFGGPHYYFGEEVWKPFIGNNERALTTEDMKKAVCV NRQAEVLMVVLVWLTILLSLS >gi|226332018|gb|ACIB01000038.1| GENE 168 194590 - 195603 1009 337 aa, chain - ## HITS:1 COG:BH1589 KEGG:ns NR:ns ## COG: BH1589 COG0079 # Protein_GI_number: 15614152 # Func_class: E Amino acid transport and metabolism # Function: Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase # Organism: Bacillus halodurans # 45 335 55 357 370 168 35.0 1e-41 MIEGHGDDSYKYRYPIRSNFSSNVYNKVNLDGLRAHLCGRISAISAYPEPEPYTLEARLA DRHALPAASVCVTNGATEAIYLIAQTFRGTNTAILMPTFSEYADACRMHGHKVTSLYTLD AVPEDVRMVWLCNPNNPTGEVRDKKYLTELIAKHPRVCFVIDQSYEYFTLKELFTAQEAA GFPNVILLHSMTKRYAIPGLRLGYVTAHPGLIGRLRTNRMPWSVNQLAIEAGLYLLSEGI PAGLSMKDYLAECARLKSSLEAIGGLEVWPTDTHFMLVCLRFGKAAALKEYLAREEGILI RDASNFEGLDERFFRIATQTPEENDELVGAIAKWMAE >gi|226332018|gb|ACIB01000038.1| GENE 169 195596 - 197059 1442 487 aa, chain - ## HITS:1 COG:STM2019 KEGG:ns NR:ns ## COG: STM2019 COG1492 # Protein_GI_number: 16765349 # Func_class: H Coenzyme transport and metabolism # Function: Cobyric acid synthase # Organism: Salmonella typhimurium LT2 # 1 483 6 499 506 410 46.0 1e-114 MLAGTGSDVGKSIIAAAFCRIFLQDGYHPAPFKAQNMALNSYATPEGLEIGRAQAVQAEA AGVPCHTDMNPLLLKPSSDHTSQVVLNGRPIGNRNAYEYFRREGREELRKEVHAAFDRLA VRYNPVVMEGAGSISEINLRDSDLVNLPMAMHAGADVILVADIDRGGVFASVYGSVMLLR PEERKHIKGILINKFRGDIRLFESGVKMLEDLCGVPVVGVVPYYKDIYIEEEDSVMLQTK NIRAGQGKVNVAVVLLRHLSNFTDFNVLERDPRVHLFYTNNTDELMKADIILLPGSKSTL SDLYELRRNGVAQAIVRAHREGATVMGICGGYQLMGREVCDPDHVEGEIERLPGLGLLPV STRMQGEKVTRQVRFRFLEDSAVCEGYEIHMGTTTPLADVPVSPLNHLADGREDGYFVDR TCMGTYIHGILDNPSVIDYLLEPFADKLKETAFDYKAFKEEQYDKLAAHVRKHVDLPLIY QILTDND Prediction of potential genes in microbial genomes Time: Tue May 17 23:23:07 2011 Seq name: gi|226332017|gb|ACIB01000039.1| Bacteroides sp. 3_2_5 cont1.39, whole genome shotgun sequence Length of sequence - 57353 bp Number of predicted genes - 45, with homology - 45 Number of transcription units - 13, operones - 7 average op.length - 5.6 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 170 - 229 6.0 1 1 Tu 1 . + CDS 272 - 916 328 ## BF2525 DNA-binding protein + Prom 1511 - 1570 4.6 2 2 Op 1 . + CDS 1658 - 2005 381 ## BF2526 hypothetical protein 3 2 Op 2 . + CDS 2002 - 2325 214 ## BF2496 hypothetical protein 4 2 Op 3 13/0.000 + CDS 2408 - 3901 1466 ## COG1538 Outer membrane protein 5 2 Op 4 9/0.000 + CDS 3932 - 4930 1063 ## COG0845 Membrane-fusion protein + Prom 4967 - 5026 2.4 6 2 Op 5 22/0.000 + CDS 5074 - 6246 862 ## COG0842 ABC-type multidrug transport system, permease component 7 2 Op 6 . + CDS 6230 - 7420 1007 ## COG0842 ABC-type multidrug transport system, permease component 8 3 Tu 1 1/0.500 - CDS 7383 - 9821 1781 ## COG0642 Signal transduction histidine kinase - Prom 9869 - 9928 3.4 9 4 Op 1 . - CDS 9932 - 10504 535 ## COG2096 Uncharacterized conserved protein 10 4 Op 2 . - CDS 10504 - 11826 889 ## COG1797 Cobyrinic acid a,c-diamide synthase - Prom 11846 - 11905 3.3 11 5 Op 1 . - CDS 11910 - 12644 583 ## BF2504 hypothetical protein 12 5 Op 2 12/0.000 - CDS 12679 - 13164 336 ## COG3610 Uncharacterized conserved protein 13 5 Op 3 . - CDS 13161 - 13928 752 ## COG2966 Uncharacterized conserved protein - Prom 13954 - 14013 2.1 14 6 Tu 1 . - CDS 14055 - 15743 1774 ## COG2985 Predicted permease 15 7 Op 1 . + CDS 16890 - 18962 2293 ## COG4771 Outer membrane receptor for ferrienterochelin and colicins 16 7 Op 2 . + CDS 18966 - 19883 929 ## COG4822 Cobalamin biosynthesis protein CbiK, Co2+ chelatase 17 7 Op 3 . + CDS 19880 - 22234 2363 ## COG1629 Outer membrane receptor proteins, mostly Fe transport 18 7 Op 4 . + CDS 22252 - 23706 744 ## BF2542 hypothetical protein 19 7 Op 5 . + CDS 23708 - 27646 2810 ## COG1429 Cobalamin biosynthesis protein CobN and related Mg-chelatases 20 7 Op 6 . + CDS 27651 - 28211 270 ## BF2515 hypothetical protein 21 7 Op 7 4/0.000 + CDS 28231 - 28848 584 ## COG0811 Biopolymer transport proteins 22 7 Op 8 . + CDS 28814 - 29140 374 ## COG4744 Uncharacterized conserved protein + Prom 29144 - 29203 4.5 23 8 Op 1 . + CDS 29241 - 30650 1443 ## COG1010 Precorrin-3B methylase 24 8 Op 2 5/0.000 + CDS 30707 - 32011 912 ## COG2242 Precorrin-6B methylase 2 25 8 Op 3 . + CDS 32018 - 33817 1524 ## COG2875 Precorrin-4 methylase 26 8 Op 4 . + CDS 33820 - 35748 1347 ## COG1903 Cobalamin biosynthesis protein CbiD + Term 35935 - 35968 -0.2 + Prom 35846 - 35905 4.1 27 9 Tu 1 . + CDS 35982 - 36431 411 ## BF2522 hypothetical protein + Term 36576 - 36621 1.1 - Term 36517 - 36541 -0.3 28 10 Tu 1 . - CDS 36659 - 37867 1135 ## COG5026 Hexokinase - Prom 37899 - 37958 5.0 29 11 Op 1 . - CDS 38349 - 38537 133 ## gi|253565715|ref|ZP_04843170.1| predicted protein - Prom 38557 - 38616 1.9 30 11 Op 2 . - CDS 38632 - 40842 1263 ## FIC_00184 hypothetical protein 31 11 Op 3 . - CDS 40878 - 41159 175 ## gi|265764091|ref|ZP_06092659.1| predicted protein 32 11 Op 4 . - CDS 41167 - 43629 1496 ## BT_2473 hypothetical protein - Prom 43652 - 43711 10.6 33 12 Op 1 . - CDS 43806 - 44951 688 ## gi|301163476|emb|CBW23027.1| putative exported transmembrane protein 34 12 Op 2 . - CDS 45001 - 46008 847 ## gi|301163477|emb|CBW23028.1| putative exported lipoprotein 35 12 Op 3 . - CDS 46069 - 47058 730 ## gi|253565721|ref|ZP_04843176.1| conserved hypothetical protein 36 12 Op 4 . - CDS 47107 - 47958 525 ## gi|253565722|ref|ZP_04843177.1| conserved hypothetical protein 37 12 Op 5 . - CDS 47955 - 48893 581 ## gi|301163480|emb|CBW23031.1| putative exported lipoprotein 38 12 Op 6 . - CDS 48957 - 50099 589 ## gi|301163481|emb|CBW23032.1| hypothetical protein 39 12 Op 7 . - CDS 50127 - 51299 827 ## BDI_2990 hypothetical protein 40 12 Op 8 . - CDS 51331 - 52317 682 ## gi|253565726|ref|ZP_04843181.1| predicted protein 41 12 Op 9 . - CDS 52333 - 53352 794 ## BF3558 hypothetical protein 42 12 Op 10 . - CDS 53387 - 54334 707 ## BF3558 hypothetical protein 43 12 Op 11 . - CDS 54350 - 55507 625 ## BF3849 hypothetical protein 44 12 Op 12 . - CDS 55531 - 55923 329 ## BF2876 hypothetical protein - Prom 55992 - 56051 8.0 + Prom 56503 - 56562 7.7 45 13 Tu 1 . + CDS 56780 - 57353 493 ## BT_1928 transposase Predicted protein(s) >gi|226332017|gb|ACIB01000039.1| GENE 1 272 - 916 328 214 aa, chain + ## HITS:1 COG:no KEGG:BF2525 NR:ns ## KEGG: BF2525 # Name: not_defined # Def: DNA-binding protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 196 1 196 263 386 98.0 1e-106 MANWITLKQLSEKRGIAESDLRTWANLGYIASSRIENVLMIDDESLTQYLDVHQTKDLGE NYLEKIIKEKELEREVLLSQCDDELFLLKTQKLHQPLFHILIQELGQLITDDHEREIFLS VSGGEPIARVAKRNKMTYARVATCYSSILRTLGEHKGRIATFRSRTMELMFDKCNTVTPV NTPLSNLVGAHAYNVLSLSAKTETSSCHQRSLHW >gi|226332017|gb|ACIB01000039.1| GENE 2 1658 - 2005 381 115 aa, chain + ## HITS:1 COG:no KEGG:BF2526 NR:ns ## KEGG: BF2526 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 115 1 115 115 154 100.0 1e-36 MKEPEKYKQPEEETTRLSEPTVAYNSMAYLELEAEKAELIRTIANIDSKEIIDKVKQKLH DVLGLKEKTVVKKTVPCQLTEDEIKEEIEQAIDEIQQGQTISSQEMHTTFKHYLL >gi|226332017|gb|ACIB01000039.1| GENE 3 2002 - 2325 214 107 aa, chain + ## HITS:1 COG:no KEGG:BF2496 NR:ns ## KEGG: BF2496 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 107 1 107 107 199 100.0 2e-50 MMKIAWTIRAQKRVIEILEYSQQEFGNKAADRLARTIEQKTLQLINNPFRGPVELILAKE KQISCRYLLVNPYKIIYYVDEKEETIFIVTLFHTHQHPSKLNREVGL >gi|226332017|gb|ACIB01000039.1| GENE 4 2408 - 3901 1466 497 aa, chain + ## HITS:1 COG:VC1606 KEGG:ns NR:ns ## COG: VC1606 COG1538 # Protein_GI_number: 15641614 # Func_class: M Cell wall/membrane/envelope biogenesis; U Intracellular trafficking, secretion, and vesicular transport # Function: Outer membrane protein # Organism: Vibrio cholerae # 39 477 24 458 476 159 27.0 1e-38 MEKKKIPVALMIAAGMLLYNNTVAAQSLPPTQETSQHQLSFNEALQLLHKGNQSLKIADK GIDIARAERGKLNAFWMPSLQSTGAFVHLSEKIEVKQPLSQFTDPAKDFVHSILPDDKII SSILDQIGTNTLIFPLAPRNLTTVDLTAEWVLFAGGKRIHATKIGNTMIDLARENRAQTD ATQRTLLAESYYGLRLAQEIVGVRLESYKALKLHYENALKLESTGMIDKAARLFAQVNMD EALRELEAARKEEAVVQRTLKTLLNLETSGDISPSSPLFINDTLPPKMEFMQVVGISNYL LNQLSLQEHMAKQQVRIDQSGYLPNIALFGKQTLYSHGIQSNLLPRTMIGVGFTWNLFDG LEREKRIRQSRLTQQTLALGQEKARDDLSVGVDKLYTGLQKALDNVRALNTTIELSEELV RMRKKAFAEGMATSTEVVDAETLLSKTKVARLAAYYEYDVTLMNLLALCGIPEQFGSMKD VTSLPITENRRNEIEIE >gi|226332017|gb|ACIB01000039.1| GENE 5 3932 - 4930 1063 332 aa, chain + ## HITS:1 COG:HP1488 KEGG:ns NR:ns ## COG: HP1488 COG0845 # Protein_GI_number: 15646097 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Helicobacter pylori 26695 # 32 332 29 328 329 217 38.0 3e-56 MENSESKKGRTLSIAFIVVLVAVALFTVIGMIAMRHQPLVLQGQAEATEIRISGKLPGRI DTFLVEEGQWVKQGDTLVVINSPTVEAKYRQVDALKQVAVEQNKKIDAGTRKQIIATAQQ LWNKTQSDLTLARTTYNRILTLYKDSVVTSQRKDEVEAMYKAAQAAERAAYEQYQMAVDG AQSEDKASARSMVNAANSTVDEVSSLLVDARLIAPEDGQIATIFPKRGELVAPGTPIMNL VVMDDIHVVLNVREDLMPDFRMGGTFIGDVPALAQKGIGFKIYYISPLGSFATWKSTKQT GSYDLQTFEIHARPTKKVEGLRPGMSVLVEIK >gi|226332017|gb|ACIB01000039.1| GENE 6 5074 - 6246 862 390 aa, chain + ## HITS:1 COG:VC1608 KEGG:ns NR:ns ## COG: VC1608 COG0842 # Protein_GI_number: 15641616 # Func_class: V Defense mechanisms # Function: ABC-type multidrug transport system, permease component # Organism: Vibrio cholerae # 6 350 6 348 387 107 22.0 3e-23 MQHSPITRVIQREWQRMTSRRLYFGVCLVLPLFTLFFMATIFGNGQMENIPIGIVDRDNT ATSRDITRRMSAVPTFRVTRHFVDEAEARKAVQQKEIYGYLSIPPRFEQDMISGQDATLN YYYHYALLSVGGELMAAFESSLAPVALSPIVMKAVALGVNEQQIETFLLPVQANNHPIYN PSLDYSVYLSQPFFFVLFQVLVLLITVYAVGSEIKFGTAGQWLQAAGGDITVAVTGKLLP YTLIFSLIGILGNFVMFGILHIPFQGSWLLLNVMTVLFIIATQALALFIFSLFPAVAIII SIVSMVGSLGATLSGVTFPVLNMYPLVRDASYLFPVRHYTEITQTMLYYGGGFIHLWPSA VILCIFPLLALAMLPHLRRAIISRKYENIR >gi|226332017|gb|ACIB01000039.1| GENE 7 6230 - 7420 1007 396 aa, chain + ## HITS:1 COG:jhp1379 KEGG:ns NR:ns ## COG: jhp1379 COG0842 # Protein_GI_number: 15612444 # Func_class: V Defense mechanisms # Function: ABC-type multidrug transport system, permease component # Organism: Helicobacter pylori J99 # 13 383 6 359 376 134 28.0 2e-31 MKTSGKLSQISFIIAREFRAISTSYAVLLVLMGGIFVYGLLYNYMYAPNIVTDAPVAVVD NSHSSLSRQYIRWLDATPQVAVYAQAVDYREAREWMKEGKVQGILYIPHDFETRVFQGRE AVFSLYATTDAFLYFEALQEATSRVYLAINDAHRMDGAVFLPPQGLLAVAMAKPVNVAGT ALYNHTEGYGSYLIPAVMMVIIFQTLLMVIGMLTGDEYQHRATEPLLPGGRTADKSGLWG GAMRLVAGKTFVYCGLYTVFSMFLLGLLPHFFSIPNIGNGLYITAMMVPYLMATSFFGLA ASRYFTDSEAPLLMIAFFSVGLIFLSGVSYPLELMPWYWRMAHYILPAAPATLAFIKLNS MGADMADIQPEYITLWIQVIVYFGLSVWVYKKKLEA >gi|226332017|gb|ACIB01000039.1| GENE 8 7383 - 9821 1781 812 aa, chain - ## HITS:1 COG:mll3725_2 KEGG:ns NR:ns ## COG: mll3725_2 COG0642 # Protein_GI_number: 13473203 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Mesorhizobium loti # 528 803 29 308 328 189 40.0 3e-47 MIISKNPLGDIAKLNRICASAQIGWWEVNFTTGKCFISETLLKSLEVSSEWLDIDELMST VRQDYRKRITDEFTSIPRKGVFEQTFPVTSGRGNVFWIHCALSMEEENEEGQLIATGYGQ RIESPETQGYQCAWNQRINNLLYCQNSIANSLLKLLSNDTGDELFEEMLADILYFFKGAR VYIVRYNWKNGNQSCLYEVAACNVITLKEKLQNICSEDAPWFYQQIHANRPVILNSPDEL PPLAVRDREVLAENGTNSMMLAPLMREEGVWGYMGIDIVDGYRKWNSEDYQWFSSLANII SICMELRMIKERVMHSEKLFHDIFTNIPVGLELYNKEGMLLDCNNRNLEIFGVGDKSRII GLNLFESPNMTRDIHESLRAGRPGTFHLKYDFDEERRLFQSERRGVMDLDIRSLMLYDAE DNLSNYLLVNIDNTERNNALSKVHDFENFFSIISDYSKVGYAKINLLDHTGFAVRQWYRN LGESHDTPLADIIGIFSHMHPDDRKSVLDFYEKAKAGTERFFDGDLRIRPADGSDRWNWI HKSSMVTAYQSPNPRLELVEVNYDITVQKETEAELRAARDKAEESNRLKSAFLANISHEI RTPLNAIVGFSDLLMTVDDPAEQEEFRRTIQKNNTLLLQLFSDIIDLSKIDAGSFEYMPK PVCLYQFCAMMVQKMRNKVPEGVELQIDEDSPLDAWFSADSGYLNQVVTNFMSNAIKFTH RGTITVGYRIDARQQLEMFVEDTGIGISIENQEAVFDRFMKVDSFVQGTGLGLPLCKSII EKMGGHIGVISELGKGSRFWFTLPAFSCIPTR >gi|226332017|gb|ACIB01000039.1| GENE 9 9932 - 10504 535 190 aa, chain - ## HITS:1 COG:lin1172 KEGG:ns NR:ns ## COG: lin1172 COG2096 # Protein_GI_number: 16800241 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Listeria innocua # 3 167 2 163 188 121 42.0 8e-28 MKRIYTRTGDRGTTGIHGGERVEKDDIRIEANGTIDELNAVIGIIRSLLPQEHDWQKLLH HLQRELMVVMSHVATPSAIRDKNPNVLSPGLAAFCEQEMDTMTAGLKENGYFLLPGGTPV SAQLQFARTVARRAERRLWTLNRQDAVPEEILSFINRLSDLFFVMARFDMQQQDWPEERW QAFAYKTKKK >gi|226332017|gb|ACIB01000039.1| GENE 10 10504 - 11826 889 440 aa, chain - ## HITS:1 COG:MA0106 KEGG:ns NR:ns ## COG: MA0106 COG1797 # Protein_GI_number: 20089005 # Func_class: H Coenzyme transport and metabolism # Function: Cobyrinic acid a,c-diamide synthase # Organism: Methanosarcina acetivorans str.C2A # 1 390 20 402 458 241 39.0 2e-63 MISQFLIAAPSSGSGKTTVSRGLMALLIKKGLKVQPFKCGPDYIDTKYHTAVCRRPSINL DTFMASAGHVKELYARYATGADACITEGMMGMYDGYDRDRGSSAEVAGLLNLPVILVVDA KSAAYSVAPLLSGFIHFRPEIRIAGVIFNRVGSPRHYEMLQEVCTELGIACLGYLPKQES LVQESRYLGLDFSHSKGTDALEELTGLMEKYIDYNRLLEETKLPAPIPPVSNISLQEDLK ISVACNSESFSFIYQEHLDVLCRLGTVILFNPEDNRPLPEGTDLLYLPGGYPEKHYEKLR QAWQRMQSIRNYAESGGRVLAECGGMIYLSKGILLDRSEHSDSEVGLQAGVLPFFISNRK ADRRLTLGYRQFDYNGQHLRGHEFHYTQFEPKPEESLESVTQVYNAKRMPVSTPVFRYKN VIASYTHLYWGEIDLLKLFE >gi|226332017|gb|ACIB01000039.1| GENE 11 11910 - 12644 583 244 aa, chain - ## HITS:1 COG:no KEGG:BF2504 NR:ns ## KEGG: BF2504 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 244 1 244 244 395 99.0 1e-109 MEIFWKTIAYYNSATWIYQLLIIVAGLLLTVMLIKNPRPWVKMGMKLYMIFLYLWIAIAY YAICCDERSYNGALAMFWVVMATIWVWDAITGYTTFERTYKYDILSYVLLILPFVYPLVS IARGLIFPGITSPVMPCSVTVFTIGLLLLFSRKVNMFLVLFLCHWSLIGLSKTYFFNIPE DFLLASATIPALYLFFREYFLNNLHADTKPKAKYINWLLVFVCVSIGILLTTTLFLELMP GKQP >gi|226332017|gb|ACIB01000039.1| GENE 12 12679 - 13164 336 161 aa, chain - ## HITS:1 COG:Cj1165c KEGG:ns NR:ns ## COG: Cj1165c COG3610 # Protein_GI_number: 15792489 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Campylobacter jejuni # 6 161 5 160 164 80 31.0 1e-15 MIALDILSDGFFAAIAGIGFGAISDPPLRAFKMIAILAAAGHACRYCLMTFLGVDIATAS LFGALVIGFGSLWLGRKVYCPMTVLYIPALLPMIPGKFAYNMVFSLIMSLQTMNEPERLG KYMETFFSNGLVTCTVIFMLAVGATFPMFLLPHKAFSLTRH >gi|226332017|gb|ACIB01000039.1| GENE 13 13161 - 13928 752 255 aa, chain - ## HITS:1 COG:Cj1166c KEGG:ns NR:ns ## COG: Cj1166c COG2966 # Protein_GI_number: 15792490 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Campylobacter jejuni # 10 247 9 249 258 148 37.0 8e-36 MTTNESLISISKFIAGYSAHLMGAGVHTSRVIRNSKRIGEAYGVDVKLSVFHKNIILTII DNETREACNEVIDIPPHPISFEHNSELSALSWEVYDKHLSLHELSDKFNKIISAPKIDPL FVLLLVGFANASFCKLFGGDIISMGIVFSATITGLFLKQQMQKKKINHYIIFIVSAFVAS LCASTALIFDTTSEIALATSVLYLVPGVPLINGVIDIVEGYILTGFARLTEAALLIVSIA IGLSFTLLMVKNSLI >gi|226332017|gb|ACIB01000039.1| GENE 14 14055 - 15743 1774 562 aa, chain - ## HITS:1 COG:ECs4625 KEGG:ns NR:ns ## COG: ECs4625 COG2985 # Protein_GI_number: 15833879 # Func_class: R General function prediction only # Function: Predicted permease # Organism: Escherichia coli O157:H7 # 19 557 12 553 561 379 40.0 1e-105 MELLRNLFEGYPNLWGGGVAHSVLILSLVIAFGIMLGKIKVAGISLGVTWILFVGIVFGH FNLNLNEHLLHFLKEFGLILFVYSIGLQVGPGFFSAFKKGGFTLNMLAMIVVFAGVIITL ALHFITGIPITTMVGILSGAVTNTPGLGAAQQANSDLTGIDAPEIALGYAVAYPLGVVGC IMSLLGLKYLFRINIKQEEAEAEQGLGHLQELTVRPVSLEVRNEALHGKRIKDIRPLVNR NFVVSRIRHLNGKKESELVNSDTELHLGDEILVIATPIDIEAITAFFGKPIEVEWEQLNK ELISRRILITKPELNGKTLAQLKIRNNFGASVTRVNRSGVDLVASPQLQLQMGDRVTIVG SELAVSHAEKVLGNSMKRLNHPNLIPIFLGIALGCILGSIPFMFPGIPQPVKLGLAGGPL IVSILISRFGPQYKLITYTTMSANLMIREIGISLFLACVGLGAGDGFVETIIHEGGYVWI AYGMIITIVPLLLAGFIGRYAFKLNYYTLIGVLAGSTTNPPALAYSNDLTSCDAPAVGYA TVYPLTMFLRVLTAQLLILSLG >gi|226332017|gb|ACIB01000039.1| GENE 15 16890 - 18962 2293 690 aa, chain + ## HITS:1 COG:cirA KEGG:ns NR:ns ## COG: cirA COG4771 # Protein_GI_number: 16130093 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor for ferrienterochelin and colicins # Organism: Escherichia coli K12 # 35 688 29 663 663 120 23.0 1e-26 MNKRCLLTILLGGCAAMGAFAQQHATGKKTEMSVDLNPVVVTGTGTHQRLKNTPAPVEVV TANEIKKAGITDFQQAMTMLVPSLSFSTNSMGSYLMMNGLSNKYVLILINGRKLTGDTSN NIDLSRIDMSRVKRIEVLDGAGSSLYGSDAIAGVINIITNEPKELMQVTSTTRYEDHRQL TQMVNADIATEKFGSYTSYKHEQSAGWQNSDLAYVTDKKGNVSTVPTVSATSNGFHSNLV NQKFTFEPTEKLSFYANGGYYWKLTDRPVYTADIDGGSKYDLHYESYNFGLGGRYKINKR SSIQLDLMNDNYMQNYKYTVADDKNKIAIGDYAKTKEQHFYDAELKGIFNFAKNNMTVLG VDYRSESLDRPSARVDKSVYTASAYAQHEIKLWNCLTGIAGVRYDYHELAGGRFTPKVAV MYSVGAFNIRGTYSTGYRAPGLDELYYYMNKGTTITQGNKDLKAEHSNYYSVNLEYNTNR LNVSVTGYLNYIGDMINSTTYKLADLPNGEELRAAAQKEFNLTDAEAKKLANYKLYGNLD KGVVRGFEVNAATNLGAGFSLNGNYAYAYARGKSVDGVWGNIERSVRHTGTVAGNYTHAW NDYMLNVNINGRFQSKRFHPGHSYGDAPGYGVWNLNVKHSLTGFQHLGLDFGMGLDNIFN KRDTRPNGVNYALLSPGRMAYVSLTLRFKK >gi|226332017|gb|ACIB01000039.1| GENE 16 18966 - 19883 929 305 aa, chain + ## HITS:1 COG:FN1263 KEGG:ns NR:ns ## COG: FN1263 COG4822 # Protein_GI_number: 19704598 # Func_class: H Coenzyme transport and metabolism # Function: Cobalamin biosynthesis protein CbiK, Co2+ chelatase # Organism: Fusobacterium nucleatum # 38 300 13 279 283 218 44.0 1e-56 MKTYILSLFLLVSLFCSAHEGGNFVASDMLAGMQPGDKAALLMVHFGTTYDDTRTKTIDA INAKAKEAFPQMEMREAYTSRIVMRRLKARGIEKPNPLEALLKLLGDGYTHVIVQSTNII EGVEMESLRRDVASVARFFKEIRVGNPLLYSVEDAEAVVDILGAGKPEKGSVVLVGHGTY TPSTATYAMIDYMLKAKGLKNFHVGTIEGYPTFDTMLQQLKDNKTKQVTLVPFMFVAGDH ANNDIAVDWKEALEKEGLKVDVRMQGLGEIPAIQQLFIDHAQFMLKHEMVDIMKKKDKYS KDKDE >gi|226332017|gb|ACIB01000039.1| GENE 17 19880 - 22234 2363 784 aa, chain + ## HITS:1 COG:alr2185 KEGG:ns NR:ns ## COG: alr2185 COG1629 # Protein_GI_number: 17229677 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor proteins, mostly Fe transport # Organism: Nostoc sp. PCC 7120 # 53 763 214 828 853 176 26.0 2e-43 MKRIILAALGSALLLPSQAQQKNKEYTNFNDSVFSINEVVVATNYRRKTDALKLDVPAKF IPISTNSITSGMLEKRNIRDIQEASRFLPGVRFRTSYGAFTQFSIRGFDNSVIMVDGVRD ERSSIDNSYPFMDLSAVESIELLKGPASVLYGQSAVGGVLNIVRKAPVSKQSVYARLAYG SYYNKQATMALGGKLIGPLNYRASVNWQDQEGWRSNATKRLSGYLALGGHLTENDELDIR IGANRDFYPTEIGLPPTMSYDILSATDGSKYLSKGDALPGLNKKARYNSESDFMYNRGFN VSAMYKHTFSEAFKLMEKLSYTYDDIDYFGTESLDYLTSDRPIYDHYYMTKDKQGNDTKK YICLDSIYYSYPLRFSHIAKTVNNQLEASGKFYTGDVAHNYLGGYSFVSLMRDSYMAYGN GSTGATGPGTTGHGSVYNPHSIGWMEAPFRFVTAQKTFTHGFYLQDLVEFSDKLKMMLAG RYDLFMYKTANLNTSDGGRHYDKPDDDAYNKITNGAFTFRAGLVYLPIEKLSVYGSYGTY FKPIRAFYDANTIYIDKDGKEFTPVNGKEVFKPEKGFQVEVGARYEITRTLQTNVSLFYI NKDNIRQTLANKGDISNGVELDKKVVGQVGRMDSKGFDIDITWSPIYNLSMSAGYGYTDA KVRDLADNPYMPTTSSKGKQYAYIPKNTFYAFGAYTVSKGVLKGLGVNFSTSFQDKVYRN SDNTSSFDAYWLTDLGFSYTLKSNVRLGVNINNLFNKEYCNQALGNQLIPSMPRNFMLSA SYTL >gi|226332017|gb|ACIB01000039.1| GENE 18 22252 - 23706 744 484 aa, chain + ## HITS:1 COG:no KEGG:BF2542 NR:ns ## KEGG: BF2542 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 484 7 490 490 981 99.0 0 MYTIHRVLGTLLSILFLVWFLSAFVMMYHGFPRASQAEKLEKLEPLSPSLPSVSEITSRL PEGEKVKGIRLDRYLGQTIFHIRTDKGEHNLPADSVQALPVIDGSRIHRVASLWCNAPID RIDTLNRLDQWIPFGSLKREFPIYKFHFADTEKHQLYIGSQSGEVLQFTTRNERFWAWLG AIPHWVYFTWLRQDAALWSITVIWLSGIGCLMTIAGLWVGIDVWRRSRKQKGKFSPYRKK WYHWHYVTGIVFGLFVLTFCFSGMMSLAEVPAWISKPVLDRNPTREIKKGAPKPDQYLLD YRQILTEYPDVRQVEWSNFRSKPYYIVKRSEGDLYIDASDSLPHPLKLDKKQVTDAVRTI HGDSIHLKVELIDKFETYYRDMSRMYRDRSLLPVWKITVDDPDHSCYYIHPETATVRYVN STARWKYWMYTALHRLRIQGLNSSPTLRKSVLWVLLLGGTVCSLSGVVLGVRYIERKCRK KTRR >gi|226332017|gb|ACIB01000039.1| GENE 19 23708 - 27646 2810 1312 aa, chain + ## HITS:1 COG:MA4424 KEGG:ns NR:ns ## COG: MA4424 COG1429 # Protein_GI_number: 20093210 # Func_class: H Coenzyme transport and metabolism # Function: Cobalamin biosynthesis protein CobN and related Mg-chelatases # Organism: Methanosarcina acetivorans str.C2A # 133 1293 258 1453 1518 476 27.0 1e-133 MKVLTLFRHKRTLYIAGSVLLLAIAFTIGYRYWMAPTRILIVNPLPAQAADIVLNNDSRN IEVTCIQTEKLESFKGYDAVVLYGRSLNLNDRQMKEAERAASAGIPLFTISLRNFNTIIN RNITPEQEAMLMQYFGDACRQNYRNGLRYLRHIATPTRWNIETFDAPLRLPNNLFYHQEY GKYFETQKALEQYLRQKGIFHENGPKIAFISGVSFPMEGNRAHVDTLISKMTQAGFNVYP IAGKEKREKMLRSLHPDALVYLPMGRLGDDSLINWLHTENIPIFNPFPLIQSREEWLDPM KPVSGGTLTARVLVPEIDGGMTPLLIATQNLHKSGYYLHEPEMERVDNFISHVHKYLDLR TKPNSDKRIAICYFKTPGKDALLASGMEVIPSLYNFLKRLRTEGYDVSGLPATVEEFGKQ IYRDGAVMGSYAAGAQEKFLQTAHPVWLTKTQYEKWVHEAIEPDKYKEVTERYGDAPGHL LTGTNTQGEAQLAIACLRFGNILLFPQPRPALGDDDFKLVHGMPVAPPHSYLAPYLYVQK GFQADALIHFGTHGNLEYTPGKNVALSHNDWADALVGDLPHFYYYTTGNVGEGIIAKRRT HAVLVTHLTPPYVESGMRQRYTSLLEDIHKILSEDIEKNRTLGIRIKKEVIKLGLHRDLK LDSVSSRPYTAEELERIDLFAEEIANEKTIGAYYTLGETYSARDLLTTTLAVSADPLAYQ MAKRDRDKGKITTEQLQDFGYITHHYLPIAKQRLIPLLQNPPKDTTGIAPELQEALRYHA LLVSSTGNELNAMLRGLKGGTVFPAPGGDPVLNPNVLPTGRNMYSINVETTPGILSWEEG KRLAEATLKAYRENHNGEYPRKVSYSFWAGEFITTEGATLAQVFWMLGVEPIRDKMGRVV DLRLVPSSELGRPRVNVVVQVSGQLRDIAGSRLTMLTDAVRLASAADDKAYPNYVSSGTR LQEKLLVEKGVSPKRAREMSVMRVFGPVNSGYSTGMMAYTEKSDRWDHESELVDGYLNNM GAAYGDEESWGGMQKDLFASALSETDVVIQPRQSNTWGPLSLDHVYEFMGGLSLTVKTLT GKEPDALMADYRNRNNKRMQNINEAIAVEARATVLNPTFVKERMKGGATTAQMFGEIFRN IFGWHATRPSAMDKEIFNDLYKMYIVDENHLGIRDYFQRINPASYQAMTSVMLESARKGY WKASDEQLKVTARLHAQITREAGAACTEFVCDNRKLQQFVEGHLDNNDSESYRLVMQEVH QAGNEKGKDIVLKEEKLTKTENRKKNVVNGILTGVIVLLVFGGVIYLLKRKK >gi|226332017|gb|ACIB01000039.1| GENE 20 27651 - 28211 270 186 aa, chain + ## HITS:1 COG:no KEGG:BF2515 NR:ns ## KEGG: BF2515 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 186 1 186 186 289 100.0 3e-77 MEHIIHLLIGFIVLSFLLKTGFYPRWGIWLSALVYTVFLICIGPWATEQSPTEINSLLAS APHILTLSVYVTLEASIMIAFCFNCFADTSKQRTLFQRTVTYILNFYPGLLMAGILTYLL IQLFFAFPGVSFGLITGISSVAVFILISGLSLLLKNIVGERKLRLEILFITNLFIVLLSV VSTGNN >gi|226332017|gb|ACIB01000039.1| GENE 21 28231 - 28848 584 205 aa, chain + ## HITS:1 COG:MA4426 KEGG:ns NR:ns ## COG: MA4426 COG0811 # Protein_GI_number: 20093212 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Biopolymer transport proteins # Organism: Methanosarcina acetivorans str.C2A # 8 201 11 211 273 129 37.0 4e-30 METISNALFWISNGLLVPVVVLLLLFFARAVLLAGGFFGEFYRRVHTQKSLAEQLEELTP DNIEEKADSLTGDRSTPVQRCVYKLYTHRDNAAYCERLLANFEVDAEQELGRSRTFVKLG PMLGLMGTLIPMGPALVGLATGDIASMAYNMQVAFATTVVGMVIAAIGVVTLQIRQRWYA REINDLEFISKTLIHGTKQTSTQPE >gi|226332017|gb|ACIB01000039.1| GENE 22 28814 - 29140 374 108 aa, chain + ## HITS:1 COG:MA0345 KEGG:ns NR:ns ## COG: MA0345 COG4744 # Protein_GI_number: 20089243 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Methanosarcina acetivorans str.C2A # 3 108 9 114 131 90 50.0 6e-19 MARNKLLHNQNDTDPMGTVANLFDVAMVFAVALMVALVSRFNMTEIFSKEDYTMVKNPGQ ENMEIITKEGKEIKRYTPSEQKESSGKRGKKVGVAYELENGEIIYVPE >gi|226332017|gb|ACIB01000039.1| GENE 23 29241 - 30650 1443 469 aa, chain + ## HITS:1 COG:lin1162 KEGG:ns NR:ns ## COG: lin1162 COG1010 # Protein_GI_number: 16800231 # Func_class: H Coenzyme transport and metabolism # Function: Precorrin-3B methylase # Organism: Listeria innocua # 6 245 2 239 241 226 48.0 1e-58 MKQSKIIVAGIGPGSEQDITPAVLAAVREADVVVGYKYYFRFIRDFVRPDAECIDTGMKR ERARAEQAFEYAEQGKTVCVISSGDAGIYGMTPLIYEMKRERQSNVEIIALPGISAFQKA ASLLGAPIGHDFCVISLSDLMTPWERIERRILAAAQADFVTAVYNPKSDGRYWQMYRLRE IFLREGRSPETPVGYVRQAGREEQEIHITTLAAFDPETVDMFTVVLIGNSQTYTFNQNII TPRGYYRETRSEATGIGQDIMIRSFRTIETELKNRDIPLDRKWALLHAIHTTADFEMERL LYTDPNAVASLYDAIRTGYLRTIVTDVTMAASGIRKGALQRLGVEVKCYLNDERVAEMAT SKGITRTQAGIRLAVEEHPDALFVFGNAPTALMELCDLIRKEKAQPAGIVAAPVGFVHVE ESKHMTKPFTRIPKLIVEGRKGGSNLAATLVNAILCYPDAEQLRPGRDV >gi|226332017|gb|ACIB01000039.1| GENE 24 30707 - 32011 912 434 aa, chain + ## HITS:1 COG:STM2030 KEGG:ns NR:ns ## COG: STM2030 COG2242 # Protein_GI_number: 16765360 # Func_class: H Coenzyme transport and metabolism # Function: Precorrin-6B methylase 2 # Organism: Salmonella typhimurium LT2 # 252 432 1 176 192 97 35.0 3e-20 MKTSETTRPTLSSLPVGKRAEEGTPHNLFTVIGLDDSPSPYLSPSVKALIDQGCVFSGGA RHHDIVTPLLPAGAKWIDITVPLDQVFARYAGHPHIIVFASGDPIFFGFANTIRRRLPDA EIRLYPSFNSLQTLAHRLVMPYDDMRTISLTGRPWHGFDRALIERTPKMGILTDREHTPA TIASRMLDYGYNDYTMYIGEHLGHPAKELIRGMTLEEAAAETFEYPNCLILTTGDGLQSV NGGILSPRFFGIPDEAFELLDGRARMITKAPIRLLTLSALELNRRTSFWDIGFCTGSVSI EARLQFPHLHVTSFEIRPEGKRLMEINSQRFGTPGITTVIGDFLETDTAIYPCPDAVFIG GHGGRLKEIISRVRHKLLPGARIVFNSVSEESKTHFIEAANESGLCFLGGTRVAINEYNP IEILVASAPDSPYL >gi|226332017|gb|ACIB01000039.1| GENE 25 32018 - 33817 1524 599 aa, chain + ## HITS:1 COG:MJ1578 KEGG:ns NR:ns ## COG: MJ1578 COG2875 # Protein_GI_number: 15669774 # Func_class: H Coenzyme transport and metabolism # Function: Precorrin-4 methylase # Organism: Methanococcus jannaschii # 356 599 9 254 259 253 51.0 7e-67 MKTAIIVISEAGIALAKTLEQELPESEIFSTGTDTDCHSISNLQEAVPEIFHKFDAIIFI GAMGICIRAIAPHIEDKHKDPAVVCVDSTGRYAVSVLSGHIGGANGLTRYVASILGAEPV ITTRSDRTGLWALDTLGKKYGWQTVPAESSDMNHLITLFVDCKPTALLLDIRDEGTTQLE HTLPPHVDVFYKFEDMDLRKYDLLLLVTPFIYNTSDTPALYYVPPVLHMGIGLARDAHPV DTVITHLMDVVVQANMIPLAIRTVSSIEEKKDEPVLKLLAEAYQTRLYTASQLSKIEVPT PSEVVNKHMGTPSVSEASALLSSGGGPLLLPKQKGTNFTVAIAMDAASVRQGHIEIVGAG PGDPELISVRGRRFLEEADLILYAGSLVPRKLTECAKAGATIRSSASMTLEEQFALMKEF YDRGQLVVRLHTGDPCIYGAIQEQMNFFDQYGMHYHITPGISSFQAAAAALQSQFTIPER VQTIILTRGEGRTPMPEKEKLSLLARSQSTMCIFLSAGVVDQVQRELLEHYPPTTPVAAC YHLTWKDERIFRGQLQDLAKIVNENHLTLTTMIVVGDAIDNREGLSRLYSHQFKHLFRK >gi|226332017|gb|ACIB01000039.1| GENE 26 33820 - 35748 1347 642 aa, chain + ## HITS:1 COG:PA2908 KEGG:ns NR:ns ## COG: PA2908 COG1903 # Protein_GI_number: 15598104 # Func_class: H Coenzyme transport and metabolism # Function: Cobalamin biosynthesis protein CbiD # Organism: Pseudomonas aeruginosa # 268 566 11 283 366 187 40.0 4e-47 MILIFGGTTEGRAAVNVIEEAGKPYYYSTKGDEQDIYLHHGIRLSGAMTRRTLKAFCRQN DIRLLIDAAHPFAEKLHDTVTDVAHDLGIPCIRYERIYDRSYLNPIFEDNCDPDDLPFKF EYDNRDLLRELKKEKEGHRFLFLTGVQSIARFKSLWTKKKYECYFRILDRDSSREIARQA GFPEDHLVYYHPETENLSQLLQELSPQAVVLKESGKSGGFTEKKDMILEYGATPYILLHP ELEYYDITVDGVNSLRRTLEKMLPDYFPLRSGLTTGSCAAAAAIAAFRKLKNPILEDFNR NIHTVLPSGETIEIPCQSVSGTFSDEKIEVSATVIKDGGDDPDVTSGLPIVTTLTLNLAE AKQANNAPVQTPETWEFVFHGGPGVGTVTLPGLGLEVGGPAINATPRQMIIDNLRNCIRY YYRYLPNVPIHVTISVPGGEEVAARTFNPRLGVVGGISIIGTSGIVKPFSSEAFVRSIRK EMEVARATGACRIVINSGAKSEKYIRNLYPELPPQAFVHYGNFIGETIGIAAELGISRLT LGVMMGKAVKLAEGHLDTHSKKVTMNKEFLKEIARRCGCTPSSIEAIDHIILARELWNIL PETELQAFCSLLIEQCHRHCDVLLPNGELTILLITEEGKIIQ >gi|226332017|gb|ACIB01000039.1| GENE 27 35982 - 36431 411 149 aa, chain + ## HITS:1 COG:no KEGG:BF2522 NR:ns ## KEGG: BF2522 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 149 1 149 149 266 98.0 2e-70 MKFNRKEKTFIMKKTYLWTAMLCTAIAFSACKSNKAGQDTASEAKTEEAVIPGSDKDEHG CVGSAGYVWSEVKKDCIRPFEAGLKISETQKDNATYATYIVFAADSVQAELYTPESEGSI LLERADNQWKNDTISVSCKNGQWSISKQK >gi|226332017|gb|ACIB01000039.1| GENE 28 36659 - 37867 1135 402 aa, chain - ## HITS:1 COG:SPAC4F8.07c KEGG:ns NR:ns ## COG: SPAC4F8.07c COG5026 # Protein_GI_number: 19114777 # Func_class: G Carbohydrate transport and metabolism # Function: Hexokinase # Organism: Schizosaccharomyces pombe # 4 278 14 299 455 116 31.0 8e-26 MEKNIFKLDNEQLKGIAHTFREKVEEGLNKNNAEIQCIPTFILPKATDVKGKALVLDLGG TNYRVAIVDFSTEKPIIYPNNGWKKDMSIMKSPGYTREELFKELADLIVEIKREEEMPIG YCFSYPTESIPGGDARLLRWTKGVDIREMVGQFVGKPLLDYLNEKNKIRFTGVKVLNDTI ASLFAGLTDKSYDAYIGLIVGTGTNMATFIPSDKITKLDPECHVQGLIPVNLESGNFYPP FLTAVDDTVDATSDSLGKQRFEKAVSGMYLGDILKAAFPLEEFEEKFDARKLTAIMNYPD IHKDIYVQVAHWIYNRSAQLVAASLAGLIALLKSYNRDIHRVCLIAEGSLFWSESRKDKN YNILVMEKLQELLRELELEDVEVHINSMDNANLIGTGIAALS >gi|226332017|gb|ACIB01000039.1| GENE 29 38349 - 38537 133 62 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253565715|ref|ZP_04843170.1| ## NR: gi|253565715|ref|ZP_04843170.1| predicted protein [Bacteroides sp. 3_2_5] # 1 62 3 64 64 127 100.0 2e-28 MNVDSMISRKLPIGIQNFEKLHTEGYFSNLNQLQDISLHPAYTTLCERTDVRGNDGKVYP PI >gi|226332017|gb|ACIB01000039.1| GENE 30 38632 - 40842 1263 736 aa, chain - ## HITS:1 COG:no KEGG:FIC_00184 NR:ns ## KEGG: FIC_00184 # Name: not_defined # Def: hypothetical protein # Organism: F.bacterium # Pathway: not_defined # 418 644 578 834 1036 70 28.0 3e-10 MDAVTRGVPAVPTKLYNLEILQYNRNGTYQNGNSYGTVELGTHLDVTLNVMNDCQLLVVA RGNKDAVKTLVGKNLEDTESTKGVKSMDIDASIINQIDPSTADAIDAMPYVLHLEHVNVV TGTDGKAVIQSPEGSYDTRLLLKRLAARLTVSWNYTVSGYELKQVLLQSVPLNYTLVPTA DSNGTYPSILDQFHTVEIDMSKGNSYSCWIPANVRGESPAANSDLQRTKANAPKGSSFLN FVAVNTTDPKKKLDYRVYIGGKTSSDFSLNNNTEYSYAVSFSHTGIPTNDKRVTYIDPVP ASENNDNPVPTANCFMVAPGGGFCFDPLVYQSDGTEKTNETLKGWCQGGGIVKVKLLWQT KEDGDIGEPVMGIVNSAEDHTNIVDIKRTDGTAVGQNPVTDKGQCRIYCRVAPGTTGGSG VIAAYDSSDNILWSWHVWVTDYHPDATGNVDVQEPLTKRKLKFTYGNHSDQRPMMDRDLG AMAGYTGVPPSDVEKFKTHGFQYQWGRKDPYPSSYSNKPIKTVNLPAKITEPIVGIMSLY GSDGVKFLPFDPSYNGRAGYQMAYRNPLTAYKPSGSQYWFTDDVTSSISGAWATVKTVHD PCPAGWRVAKAEEYYSLFSDKGYNGTLPSYSTNNMNMNNYNTQGADKGFVLRYDETDQSK TTYFRLCGYYADRVFVQIGYFDFIWCCNCAKNGNTYQARHLQLVSTASDQRRGINGINNE GTLSAMLPLRCIQEKD >gi|226332017|gb|ACIB01000039.1| GENE 31 40878 - 41159 175 93 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|265764091|ref|ZP_06092659.1| ## NR: gi|265764091|ref|ZP_06092659.1| predicted protein [Bacteroides sp. 2_1_16] # 1 92 1 92 96 157 89.0 2e-37 MKQIWKNKWLMLLGGMTLLGGLSGCEDRPEGTVPFPEGEEVTVSLVFGFAEDSNNEGGQR LASTARLNGSPQRLASTARLNGSPQRLASTARR >gi|226332017|gb|ACIB01000039.1| GENE 32 41167 - 43629 1496 820 aa, chain - ## HITS:1 COG:no KEGG:BT_2473 NR:ns ## KEGG: BT_2473 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 495 794 405 685 706 70 26.0 4e-10 MKHTSKNLWKLPVLVCLLGIAACEERADVVPVDADTKPVEVSLCFGFADEEDGYNLSASA DTRGSDSGQDGAFTARPVPAVRTRATEPSHPDALYQFYLMQYKADGTLMGSVQSKEQITA GTDFSTLVTLTPATNCQLVVIVCGKGNTTPSIGGSLANVQQQVMDADLFKKTIPAEGFTQ DDINKMPYMLHLPCVNVTSDGKLQSPDGSYDARLLLRRLATRLTVNWEIDAALKNAGYAL KEVKLCQIPAAFRLLASPVETQWGMTYPSEVVEFIDYYRLTNASELAAGKKTVWIPANVR GTSAKATSPYYRTKENAPTAASYVELVVDNAVKKERLYYRAYLGGQESTDFNLYENKDYT WKLSVKSTAYQTDKRIQLLDQSPVQSTNLVETSNCFMMKPGTNICFNPYKHEAKMNGYNV PLSGWNTHLTDGSTLADNKKITDVKLVWQTKDDATSGDLVMGYAISGDDHSNLARITDGG DLQKARIHVKVPVSKGGNALIAAYSGSKIVWSWHLWITDYVPQGITSSVTYAQAQQLTQN GSVHQYATAAFKSSGMHVGKVIMDRNLCATAGGFPGENASLLEFARRIGYLYYWGRKDPF LGSTDGTANELNVIYDGEGRGVQLEKVAYSDITLVNGNTLQYVIEHPDHIITGSSSDQNQ SKCSWYSLNETTADYQYLYNNSKTLYDPCPAGWKIPHQTVYNGWGKSQAYWFNVNGTFVE NGSTHDRGGRLYNVSGGNGVPSPRTEDNTAWFPVTAYRSFSDGKLIFNGSAAGYEGTNTI AKNGNNYRIYYTKIAAGELSTPNNAWGMIGEPYPFRCVQE >gi|226332017|gb|ACIB01000039.1| GENE 33 43806 - 44951 688 381 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|301163476|emb|CBW23027.1| ## NR: gi|301163476|emb|CBW23027.1| putative exported transmembrane protein [Bacteroides fragilis 638R] # 1 381 1 382 382 581 84.0 1e-164 MAKRQYIRRWTGKWALGCLLAVSGLWTVSGLLAGCSEESVETSEELRLVLKLPPELTVDT RGEINNIPVNTIWVLQYATESGVSKLKQKAFFGTIGNPGSNQTIEVNTKNDDVAFVQAES RFYVIANVDQSFLSGFTGTEAELKAKTVPFSGYRNKPSLLTSGPLEYKPDPNKPGVIPLV VPLRRAYATISLSWKKMGDLADPKNPSNLVVKSVSLYNVPTHMALYTRGGGSIKDKYPAD NDGSTAITSTNGTEITTNWASSSSFEFYMPENLRGMGTAASFMEKSIPAKGPDGTLDYCT YIKLSGKYVYAGAKDSIGVNYLLHLGGNLMTDYNIRRDYLYNLTVNISGANSADVRVTIT DGNVVMFDAVEVLPVNNVVFK >gi|226332017|gb|ACIB01000039.1| GENE 34 45001 - 46008 847 335 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|301163477|emb|CBW23028.1| ## NR: gi|301163477|emb|CBW23028.1| putative exported lipoprotein [Bacteroides fragilis 638R] # 1 335 1 335 335 560 89.0 1e-158 MNKRLKLWIGCCLCLLGAPGLTGCSEQAPGEPGTEEGDPVSLRFSLYRAEADEASTRADA ATDMADGKTFCIYAFPAGASTTTTEPLDHKVYTVKGGVATGELYLYRGTYDLYLVSYNSS TEVPELKTDGTIQVSNGKDFMYTSLKGIVVQPNQTGENMMDVVLPAPFKRLGAQIKVSVA AKSGTHPVTPTSLVVNSFKMGGLRASLPYTLGSTAWGTVANETFATTQTFTGFTYNTTGQ TVTIPRVSTPVVVLPVDGSAMISFDVNLTVGYKDNGDKTLTETYPAEIQKVLLPGMTYTF DFTLTFYGILDPADLTLAIGEYESTVTLDSDEMGK >gi|226332017|gb|ACIB01000039.1| GENE 35 46069 - 47058 730 329 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253565721|ref|ZP_04843176.1| ## NR: gi|253565721|ref|ZP_04843176.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] # 1 329 9 337 337 605 100.0 1e-171 MKKPANRLYMALALGAGICLTASCASDDGFAADEGALELQRIEVEGEESITRAEATAITE VDVYATNTAHKAYGDNPLMTFSLKEGVWTPDKQTIMNSTSGDALLYAYYPPAAIQASGDG EHTTAVNLPSSLTDFLATGQADYLYGVGTDTGTAPVTATLSSRTVSFKMKHALAKVSFHI VKSASATEALKLIQVDVLSGTNRLRTGIGTMNLKSGLLNSLSDISTLTLAQASGVELKLK ENQKGPNVTCLVAPMAKEETVLSFRLKVCVDGEAAAKAHTFETQSLTAQWEAGKHYVYLI TVDKMGGTLSSVQIEDWKNDANQNTSIGI >gi|226332017|gb|ACIB01000039.1| GENE 36 47107 - 47958 525 283 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253565722|ref|ZP_04843177.1| ## NR: gi|253565722|ref|ZP_04843177.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] # 1 283 1 283 283 490 100.0 1e-137 MKKKKNRILSVSGAALCLFFPVLLAACSQEDALPRPLEVSAEVGNPATRATDAHADDYDK REFVAGDVIRITDGTKSANYQRVVTGTTGTWQPASGQTALTTTGSETFTASYPTAFTRIL ADQHTAINFWQSNRLTAKGVLDGNKATFSFAPEAAKVTLVVKYGNNDSDKKLPGTASLEG TGIATDVSTSETINAYGASAAGTSALQHTYVAIVKPGSRTFVIKVKAGDSGETEKKYTDK AAHTLKAGYNYQYNFTSTNELILNGITVERFVETTETDVGNAT >gi|226332017|gb|ACIB01000039.1| GENE 37 47955 - 48893 581 312 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|301163480|emb|CBW23031.1| ## NR: gi|301163480|emb|CBW23031.1| putative exported lipoprotein [Bacteroides fragilis 638R] # 1 312 1 310 310 532 95.0 1e-150 MKTDRLYQTVLGCLLLTAGCSDNALTPSLPEGTPLTVSAAICASGEGRESETRAAVAAGT RAVAADNGYDRSTFAAGDKIRIIRSRNGSSSTPVDYTLNSASSGNSTGEWKPSVTGTGTE LLVESGATYQASYPIEYSGIRADQRKAGGEDYRLSNLLETPEKVAIGRDGTLSFTGESAF VHKGVKLTLKFSGKHTLSKDFTSMTVTGKGLYSGGASKDETVYLYHPGGTGDAKYTWHGI IAPLTSQQSIQVSVTDANGVVYDITLTCARAANSHYTYTLTLKNNVLVPTGQEIKEWQSG ESQTGTLVDVTP >gi|226332017|gb|ACIB01000039.1| GENE 38 48957 - 50099 589 380 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|301163481|emb|CBW23032.1| ## NR: gi|301163481|emb|CBW23032.1| hypothetical protein [Bacteroides fragilis 638R] # 1 380 1 380 380 556 97.0 1e-157 MKNVFILLLLGTASLFCIACSGENSVLTDGDAVLLQPGVHAGATLASTRSMINDVGTGAD RISTIGIYLAAADGSLYPGSNAGGATFTKAASGTNWTSTPSTYVSLAKATVYAWAPAATA LKSDGGSGSGGNSGSGSGSGSGGSSSTVLPALPVSVPAGQTFDGGNTYDCSTTDYLYGSG SATVGSATAVTANSLSASPTLYLQHALSQVVFRIQNANDRTPDPTYDYVKKIKLTAAGGA TPFYCTTASGSGSAGTMSLTDGALSGLDAVGEISFTPSARPQQVGTSGPVTVAYGLVAPK AEAASGTQVTLTLTLGEQSSDVTERDLTLSTNAFNPAWQKGYRYVYTLTLGERGISLQPV DIKGWTEVSGGSSDVDPGWQ >gi|226332017|gb|ACIB01000039.1| GENE 39 50127 - 51299 827 390 aa, chain - ## HITS:1 COG:no KEGG:BDI_2990 NR:ns ## KEGG: BDI_2990 # Name: not_defined # Def: hypothetical protein # Organism: P.distasonis # Pathway: not_defined # 191 378 140 309 322 69 30.0 2e-10 MMINQTKYHWLMALPALLLAVGCSESILPEGQDTGVAELQVSPQVVLTRGAIDAGSDASA VGNALGSIAVYANSTTTNKTTNNYGLYTYSSSAWGNSSATDKIYLSAEEATIYAHYPAYQ LDSNGEFKSSGTALKATDSSGEYTDNSTINISVFPGKGGEANATIDFQSTDNSESNSSGS AKILAASGEVDYMYADQSTAVKASYKNGDTNKTGKVTLAMKHALAMVSFRVYADHTYKNT GAFTRLVLANKSGGTTILNNGGSPTMKIKDGTITVGQSPAAVTYTRDIANYTLPKAESSD ASAATTAKNNAKKASILVLPESTVDKSTVEATLTIDNQDYKVALPSSNSAWEAGKNYLYT VKLSGSELVISSVTVTEWTTVASSTDLDIK >gi|226332017|gb|ACIB01000039.1| GENE 40 51331 - 52317 682 328 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253565726|ref|ZP_04843181.1| ## NR: gi|253565726|ref|ZP_04843181.1| predicted protein [Bacteroides sp. 3_2_5] # 1 328 1 328 328 622 100.0 1e-177 MSTKHSSIVPSLFVLLSVWVAGCADAGYSSLSEERVQVQLCARDGVLSRASDAGNLPEQE FEATVALSTAKGDYASLSEACEGTWNAAVKTDGKMEWKTAGGSDVPVYPSSGDWLYLTAF SPTAVPSGGVASFVLTGQTDLLYAPELRGNKWEGERFAGNRWKPDRPLQFSHLLTQLSFK ACKAIADGVEVQITKIKVNEAKPCVSLPLSTGIPAFTATTEHPVGLTLDVSSDGGKKVIG TTPVEVGKIQVPPLEGGSTYTLKVETSIGTFDNVSLTFGDSNASGKNPLQAGMSHVVTLN IGDHELGITSVTVQAWAPVTVDGTLDAD >gi|226332017|gb|ACIB01000039.1| GENE 41 52333 - 53352 794 339 aa, chain - ## HITS:1 COG:no KEGG:BF3558 NR:ns ## KEGG: BF3558 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 2 339 3 290 290 64 28.0 7e-09 MKTKLSVAAMALLLASCSSNDSDLLPENPVNATPIQVSQSVQGMVKTRAAVAEGSSVTAT VLMHDGTASDWTGFTAVAKNVLDASDNTLTTRATVSNATFTVGSSAVKLGLNPTLYYDVA DKTKNSFLVAVSPAGEVNGTVVTLLQTDGQQDVMYAPEAEAGNSNSVQTAALTFGHLTTQ LNFAMKLQPADSEGEWTDKTVSLKSIEVQEAYRPVSVDAKSGDVVWDNVNSGTLPVPGIS NNTLGKESVKVGVPVMIRPASKLTVCAIITVGGTDIPFHNVPVKDANDSGKDLTTETGKA HLITLTVKEPKKASGATVVSATATVTPWKSGAAGSADLE >gi|226332017|gb|ACIB01000039.1| GENE 42 53387 - 54334 707 315 aa, chain - ## HITS:1 COG:no KEGG:BF3558 NR:ns ## KEGG: BF3558 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 28 305 28 289 290 67 25.0 6e-10 MILKLKYILKAAFLSQLLFSCTERSLEPRPEPEPVPIRLFTGIRTRATVDVFGDTPVCVA YGVRTGQYDGSWDGIATDNEIRLMPERYYPSDGSALYLRGYYPPAPLTADGRLTYRLTGE EDLMLTGEQNGSLSAPFGESDSQSLLYRHLLTQLIFTLTVEAEDTGALRVRSLHLNGLTD EVTLSLAEESLLPGERTVSVPVYEAGEEAEGLPFDGNKLTLPGYVLVQPSATLTIDLVLS VDDDRTHDREYRALPVRFEGGEGEGGTAYTVRVELPDPVIPDPVEIVAVATVGRWQEGDS GSGELVDNRSAETEQ >gi|226332017|gb|ACIB01000039.1| GENE 43 54350 - 55507 625 385 aa, chain - ## HITS:1 COG:no KEGG:BF3849 NR:ns ## KEGG: BF3849 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 29 121 123 215 493 67 36.0 9e-10 MNRILLLLGYILLLSAPVCGRGVGTRAVSDCDSLHTFRFVQGRDVLYADYQGNNHELKRL FAAVSSFRGPIADGKMRIYVDGHCTSGGSVHDNLWLAYVRSSRVKSELITRKGMKEEYFV THNHAVCYPGEERDVVIVCLREPAGADILSAARRNYGMYASRYPLLPPAVPSAPDNRPCP PPILFVPVGPAGHPGTVAGSGAENPRRMLFSLKTNLADWSGLTAEGRLGAFRPNLALELH FARRWSVMASAAYSDWKGGRDHRFWGVSGYSLEPRLWLKGDGRYRWFYLGVYGQSGDFDY RPSPAGDTDATGRSRTGRYRHAGLSAGVYVPLSRHWGIEAGVRGGYLHSSAKAYDNEPPH AYYHHPASFVRWGLTGINLGLGYRF >gi|226332017|gb|ACIB01000039.1| GENE 44 55531 - 55923 329 130 aa, chain - ## HITS:1 COG:no KEGG:BF2876 NR:ns ## KEGG: BF2876 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 128 1 124 124 69 38.0 4e-11 MLTILTQEAIRVLRYIYYRDAGISCPPDSSDCAFRNVSVLLPLLERGGLIRCICPESPDS PVSYELCKPLGSIDLLSLLLILHEGVCPVSPDVDEQRVYGRYGSVASRMGVVNQMMRSIF SEIHLTELCL >gi|226332017|gb|ACIB01000039.1| GENE 45 56780 - 57353 493 191 aa, chain + ## HITS:1 COG:no KEGG:BT_1928 NR:ns ## KEGG: BT_1928 # Name: not_defined # Def: transposase # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 191 1 191 409 376 100.0 1e-103 MKVEKFKVLLYLKKSEPDKTGKAPIMGRITLNRTMAQFSCKLSCTPGLWNARESRLNGKS REAVETNEKIERLLLAVHSAFNSLMERKRDFDAAAVRDMFQGNAGMQMTLLKLLDRHNGE MKARVGVDRAPTTLSTYLFTYRTLSEFIKAKFKVPDLVFGQLNEQFIRDYQDFILLEKGY AVDTLRGYLAI Prediction of potential genes in microbial genomes Time: Tue May 17 23:26:01 2011 Seq name: gi|226332016|gb|ACIB01000040.1| Bacteroides sp. 3_2_5 cont1.40, whole genome shotgun sequence Length of sequence - 24935 bp Number of predicted genes - 16, with homology - 15 Number of transcription units - 9, operones - 3 average op.length - 3.3 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . + CDS 3 - 656 426 ## BT_1928 transposase 2 1 Op 2 . + CDS 676 - 1887 829 ## BF3004 tyrosine type site-specific recombinase - Term 1877 - 1937 14.7 3 2 Op 1 . - CDS 1952 - 2245 262 ## BT_1930 hypothetical protein 4 2 Op 2 . - CDS 2462 - 2581 69 ## gi|253570444|ref|ZP_04847852.1| conserved hypothetical protein - Prom 2603 - 2662 4.1 + Prom 2918 - 2977 4.3 5 3 Tu 1 . + CDS 3123 - 6671 2651 ## COG1002 Type II restriction enzyme, methylase subunits + Term 6692 - 6718 -1.0 6 4 Tu 1 . - CDS 6598 - 7329 384 ## BT_1932 hypothetical protein - Prom 7385 - 7444 6.8 + Prom 7290 - 7349 7.6 7 5 Tu 1 . + CDS 7520 - 7792 231 ## COG3328 Transposase and inactivated derivatives - Term 7534 - 7570 -0.1 8 6 Op 1 . - CDS 7739 - 9931 1460 ## BT_1934 hypothetical protein 9 6 Op 2 . - CDS 9934 - 12135 1546 ## BT_1935 hypothetical protein 10 6 Op 3 . - CDS 12142 - 14289 1619 ## BT_1936 hypothetical protein 11 6 Op 4 . - CDS 14347 - 15909 1336 ## BT_1937 hypothetical protein 12 6 Op 5 . - CDS 15933 - 17204 1029 ## BT_1938 hypothetical protein 13 6 Op 6 . - CDS 17218 - 20079 1626 ## BT_1939 putative outer membrane receptor - Prom 20103 - 20162 9.3 + Prom 20071 - 20130 9.1 14 7 Tu 1 . + CDS 20176 - 20280 109 ## 15 8 Tu 1 . - CDS 20999 - 22879 793 ## BT_1940 hypothetical protein - Term 23952 - 23983 0.1 16 9 Tu 1 . - CDS 24137 - 24925 520 ## BT_1945 conjugate transposon protein Predicted protein(s) >gi|226332016|gb|ACIB01000040.1| GENE 1 3 - 656 426 217 aa, chain + ## HITS:1 COG:no KEGG:BT_1928 NR:ns ## KEGG: BT_1928 # Name: not_defined # Def: transposase # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 217 193 409 409 429 100.0 1e-119 KKICRIAYKEGHSEKYHFCHFKLPKQKETTPKALSRENFEKLRDLEIPEKRRSHVITRDL FLFACYTGTAYADAVSITRKNLFRDDEGSLWLKYQRKKTDYLGRVKLLPEAVALIEKYRD DTRETLFPPQDYHTLRANMKSLRLMAGLSQDLVYHMGRHSFASLVTLEEGVPIETICKML GHSNIKTTQIYARVTPKKLFEDMDRFVEATRDLKLIL >gi|226332016|gb|ACIB01000040.1| GENE 2 676 - 1887 829 403 aa, chain + ## HITS:1 COG:no KEGG:BF3004 NR:ns ## KEGG: BF3004 # Name: not_defined # Def: tyrosine type site-specific recombinase # Organism: B.fragilis # Pathway: not_defined # 1 403 1 403 403 800 100.0 0 MRSTFKLLFYINRNKVKSDGTTAVLCRISIDGKKSAVTTGVYCKPGDWDSKKCEIKTARE NNRLAAFRSRLEEAYGNLLRNQGVVTAELLKTTVSGANSVPEYLLQAGEVERERLRVRSK EINSTSTYRQSKTTQLNLRQFIESRGMKDIAFSDITEEFAESFKVFLKKELGHRNGHVNH CLCWLNRLIYIAVDREILRANPIEDVAYERKETPKLRHISRSELKRMMETPLPDPMMELA RRTFIFSSLTGLAYADTRALHPRHIGTTSEGRRYIRIRRAKTDVEAFIPLHPIAGQILEL YNTTDDDRPVFPLPVRDVLWYEVHGMGVALGMKENLSYHMARHSFGTLTLTAGIPIESIA RMMGHTNIDSTQVYAQVTDRKISSDMNRLMERRKPAAGKEAAG >gi|226332016|gb|ACIB01000040.1| GENE 3 1952 - 2245 262 97 aa, chain - ## HITS:1 COG:no KEGG:BT_1930 NR:ns ## KEGG: BT_1930 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 97 1 97 97 169 100.0 2e-41 MEGIIDKENERVRRFFALLDDMEKKVERLARDNRPPFNGERFLTDRELSGMLKISRRCLQ DYRDQGRIPYIQLGGKILYRQSDIERLLEENYHPALV >gi|226332016|gb|ACIB01000040.1| GENE 4 2462 - 2581 69 39 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253570444|ref|ZP_04847852.1| ## NR: gi|253570444|ref|ZP_04847852.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] # 1 39 17 55 55 67 100.0 3e-10 MEICYIEAGVLERMLARAENLSARVDRLYERNRCKEPGE >gi|226332016|gb|ACIB01000040.1| GENE 5 3123 - 6671 2651 1182 aa, chain + ## HITS:1 COG:jhp1409 KEGG:ns NR:ns ## COG: jhp1409 COG1002 # Protein_GI_number: 15612474 # Func_class: V Defense mechanisms # Function: Type II restriction enzyme, methylase subunits # Organism: Helicobacter pylori J99 # 6 1181 7 1164 1252 617 36.0 1e-176 MGLLKPNQVLNKAYRQVAIETTDFDLFKNALRTLRDNIVDGQREHTQKEHLRNFLSETFY KPYYMAPEEDIDLAIRLDKTIKSNIGLLIEVKSTTNKGEMISNDNLNRKALQELLLYYLK ERVNKKNNDIKYLIATNIHEFFIFDAHEFERKFYQNKQLRREFQDFVDGRKTSNKTDFFY TEIATTYIEEVKDSLEYTYFNLQDYQHLLDRTDSSASRKLIELYKIFSDTHLLKLSFQND SNSLNRGFYTELLHIIGIEERKENNKTVIVRKAVERRDEASLLENTINQLDAEDCLRHIN GRLYGNDYEERLFNVAMELCITWMNRILFLKLLEAQMLKYHNGDAIYKFLSITKIHDYDD LNTLFFQVLARDMGSRTHSIMRDFAYVPYLNSSLFEVTDLESKTIKINSLSQRTVLPVLA SSVLRNKKRNLQVNALPTLQYLFAFLDAYNFASEGSEEVQEEAKTLINASVLGLIFEKIN GHKDGSVFTPGFITMFMCREAITKTVLQKFNGYYGWNCTTRIELYNHIDNIVEANELINS LRLCDPAVGSGHFLVSALNELILLKYELGILVDATGKRIRKADYQLAIENDELIVTDTEG NLFAYNPLNAESRRMQETLFKEKRQIIENCLFGVDINPNSVKICRLRLWIELLKNAYYTA ESNYTYLETLPNIDINIKCGNSLLHRFALTDSIQTVLRESSISISQYKEAVAKYKNAQSK SEKQDLETFITEIKSKLKTEINRRDARLVRLNKRRSELANLQAPQLFEPTKKEKKASDKR IADLKKEIATLENIFEEIRSNKIYLGAFEWRIEFPEVLDAEGNFLGFDCIIGNPPYIQLQ SMGKSADVLECMGYITYARTGDIYCLFYELGMNLLTPNGFLCYITSNKWMRAGYGEALRG YFASKTNPIMLVDFAGIKIFDAITVEANILLSQKAANIFNTQACLVQDSNGLNNLSDFVQ QQGVKCNFADSIPWMILSPIEQSIKQKIESVGIPLKDWNIQINYGIKTGFNDAFIISTEK RDEILANCQTEDERVRTAELIRPILRGRDIKRYEYEWADLWIIATFPSRHYDIESYPAVK NYLLSIGIERLEQTGETHIVNGKKIKARKKTSNEWFETQDSISYWEDFSKPKIVWKIIGN QMAFAYDANNYVMNNACYIMTGDHLDYLLAVLNFPITEVTFV >gi|226332016|gb|ACIB01000040.1| GENE 6 6598 - 7329 384 243 aa, chain - ## HITS:1 COG:no KEGG:BT_1932 NR:ns ## KEGG: BT_1932 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 229 1 229 235 447 99.0 1e-124 MVKIQKISEIEPRLGFTEFDMLKKYRQSFATSELGRLHALFPFSELTRQMHLKSSALGRK SYFSPEGKIALMVLKSYTNFSDAQLIEHLNGNIHYQLFCGVQIDPLHPLTNPKIVSAIRQ ELAHRLDVEPLQLILAEHWKPYLENLHVCMTDATCYESHLRFPTDTKLLWEGIVWLHRHL CKHCQTLHIQRPRNKYLDVRRAYLAYSKLRKRRKSQTRMITRRLLQLLENSILPTDNPND RLS >gi|226332016|gb|ACIB01000040.1| GENE 7 7520 - 7792 231 90 aa, chain + ## HITS:1 COG:SMa0384 KEGG:ns NR:ns ## COG: SMa0384 COG3328 # Protein_GI_number: 16262658 # Func_class: L Replication, recombination and repair # Function: Transposase and inactivated derivatives # Organism: Sinorhizobium meliloti # 14 71 106 163 400 61 44.0 4e-10 MSLYSVDNIKEKLVISLYAKGMSVSDIEEEMREIYEIELSTSAISIITNKVNQAAQEWQN RPLDPVYLIVWMILPILTFQPINCFGNFVH >gi|226332016|gb|ACIB01000040.1| GENE 8 7739 - 9931 1460 730 aa, chain - ## HITS:1 COG:no KEGG:BT_1934 NR:ns ## KEGG: BT_1934 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 17 730 1 714 714 1436 100.0 0 MNKYILLAITALCLQDMQAQTVVHPSIKTKTTFAIVVDQKSYDEAKSEIDAYRTSIEKEG LGTYLLIDDWKRPEPIREQLVKLHENEKTPLEGCVFIGDIPIPMIRDAHHLSSAFKRSPK ANWQKSSVPSDRYYDDFGLKFDYIKQDSLIPDYHYMTLRADSKQYISPDIYSARIRPLHL EGENRYQMLRDYLKKAVAEKAKQNAFDQLTMARGHGYNSEDPLAWSGEQIALREQLPQIF KSGNTVKFYDFNMRYPMKPLYLNEIQREGLDVMLFHHHGGPTMQYINGYENGSGINLSIE NAKIFLRSKVPSYAKKHGREAAIKEYAKQYGVPESWCAEAFDEEKIKSDSIVNRNMDIYT EDIRLLTPNARFILFDACFNGSFHLDDNIVGSYIFNKGKTIATMGCTVNTIQDKWPDEFL GLLAAGMRIGQFTRFTCFLENHLIGDPTFHFTNNAGLDMDINQALVAQEGNVTFWKKQLN SPMADMQAMALRQLSMANYSGLVELLKKSYHESNYFVVRLEALRLLALNYPTEVADVLQT AMNDSYELIRRYAVEYVEKNCNPELLPAWIESYLLRGHENRHRFRIFSAINTFDHDMALN ELKKQAADWSFYDSSYVNELLEYLPRQKKGLERDFALIDSPESTTKQIQSEISRFRNKPI AKAIEPLLNIIKNESQEEELRILAAETLGWYNLYYNKADIIKELNTFRTSNQKLMNEVTK TINRLKSQNR >gi|226332016|gb|ACIB01000040.1| GENE 9 9934 - 12135 1546 733 aa, chain - ## HITS:1 COG:no KEGG:BT_1935 NR:ns ## KEGG: BT_1935 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 733 1 733 733 1449 99.0 0 MIKKIIYLAFLLPLAGNAQTTVIKPLVKQPTAFAIITDNQTYANTKDAMHQYKTAVEDDG LATYLISGDWQNPDQVKQIIIKTYQECPSLEGLVLIGDVPVALVRNAQHMTTAFKMNEKA FPWDQSSVPTDRFYDDLNLKFEFIRQDSVNHQHFYYKLTEDSPQRLNPTFYSARIKYPEK KEGDKYAAIASYLKKAAAAKADKHNQLDRVFSFNGASYNSDCLIVWMDDEKAYMENFPLA FGRQMGFKHWNFRMKHPMKYKLFSELQRKDLDLFMFHEHGMPTGQLINDELACTDFNNRY KMLKSTLYNAVMSHVGKRDKDTLRIQMQEKRQVNEVFFKDLDNPKFWEADSLHYADERIV TEDLMKRNLSTNPKMIMFDACYNGSFHENDYIAGQYIFNDGQTLVAQGNTRNVLQDRWTI EMIGLLSHGVRAGQYNKLIVSLEGHLFGDPTFRFAPIEANTLSTDITIHKDDKAYWKNLL NSPYADVQSLAMRMLADADTQKELSPLLLKKYRESGFNTVRMEAIKLLSRYQDDNFIEAL REGLNDTYEMVARQSAIYAGFVGDDSLLPAIVEALVEHNERLRVQMSANKALSLYPKEKV EKTIEDFYAKVDRLNENEEKKRLLRSLERMFVQEAKVHQTLMDVAAPEAKRISAIRNVRN YTFHFHVDDYLNVIRDAGNPQEVRVVMAEALGWFTNSVQRPHILEEIKKMQQTANLPEDL KAELEQTIKRLSL >gi|226332016|gb|ACIB01000040.1| GENE 10 12142 - 14289 1619 715 aa, chain - ## HITS:1 COG:no KEGG:BT_1936 NR:ns ## KEGG: BT_1936 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 715 1 715 715 1444 100.0 0 MLCSILSLRAQTFVKPAVKVKDTSFAVITDKGTFQACEAELKAYQEILGMEGLPTFIVYN EWNKPEDVKKVIVKLYKKDKLEGVVFVGDIPIPMLRKAQHMTSAFKMDEKNNDWRDSSVP SDRFYDDFDLQFDFLKQDSVENNFFYYNLAIKSPQQIRCDIYSARVKAVDNGEEPHAQIS RYFKKVVAEHQINNKLDQFFSYTGDGSYSNSLTAWTPETFTIREQMPGVFDKEGRARFIR YNFSDYPKDDVINMLKRTDLDLSIFHEHGMPERQYLSGSPATNRWNAHVDAMKYYYRGLA RRKQNNKKSFDEMLDMMKNTYGLDTTWIAGYDDPKVIAEDSLLDLRTGIILSEVTEFKPN SRMVIFDACYNGDFREKDYIAGRYIMSEGKCVTTFANSVNVLQDKMANEMLGLLGMGARV GQWAKLTNILESHITGDPTLRFQSINEVDANALFKEPYSESRMLELLQSPYADIQNFALH NLYRNDYPGISDLLRKTFETSSFMMVRFTCLALLEKISDKNFREVLHLAITDSYEFIRRT SVRMMQHVGLNEYVYPQIKAYVEDNLSERVAFNVSLGLQVFDQAAVQAAIDKVMAETYVL QDKEEMRKVLENANNSRSMQKELLSKETSERWRILYCNSLKNHMAHACVDGLLALLTDSS ESEKLKTCLLEAFAWFTHSYRKPDILRVCDQLRKDKSLSENLREEADRTYYRLKN >gi|226332016|gb|ACIB01000040.1| GENE 11 14347 - 15909 1336 520 aa, chain - ## HITS:1 COG:no KEGG:BT_1937 NR:ns ## KEGG: BT_1937 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 520 1 520 520 1001 100.0 0 MKKQYMNYMKSCLLVMIACLVSMVGAAQGFSPAAMEQLKTRRLWSHSQNAAGMPFDDIQN YSNVILGYDLQDGNYCRPQEGQKEAIVGVSSEGFINLKNAYVWGAFNFAQKNLTDAGYNA SIADPFRGMPYYVADQHLSKWRNQYYDLKFRAATPLLGNHWALGLEGNYVATLAAKQRDP RVDTRFYTLGLTPGITYKLNNSHKFGASFKYSSIKEDSRMSNVNSYVDQDYYILYGLGTA IKGIGSGVTSNYIGDRFGGALQYNFSMPSFNLLLEGSYDVKAETVQQSYTTPKKIAGVKD KTAHVSLTMIQEGKDYTNYMRTTYTNRNIDGIQYISQRDNSESQSGWVELYNNIRSTYKA QTASLNYALSRNRGNEYSWKAELNVNYTKQDDEYLMPNSVQNAENLSLGLGGKKNFVLGN SLNRRLLIDVHVAYNNNLGGEYVYGGSHADYPTVTELQQGLTNYYTCDYYRIGGSITYSQ QVRENRRMNLFAKVVFDRVNTSDYDYDGRTHLSISLGCNF >gi|226332016|gb|ACIB01000040.1| GENE 12 15933 - 17204 1029 423 aa, chain - ## HITS:1 COG:no KEGG:BT_1938 NR:ns ## KEGG: BT_1938 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 25 423 1 399 399 795 100.0 0 MKKYLIYLFTLASTLLIGCDSFRDMSGTAEVNPITVDVYLDITVENISTLKDLTVKFDNY DEDLHYVKEVTDNSVKVDGIIPGIYSVTVSGTAIDTENNEYYINGNSVNAALFKHGSALN IEVQGLKVSPLIFKEIYYCGSRPEKGGVYFRDQFYEIYNNSADILYLDGIYFANLTPGTA TTKLPIWPEADGNNYAYGERVWKFPGNGTEYPLAPGESCIISQFAANHQLDIYNPQSPID GSSSEFEFNMNNPNFPDQAAYDMQHVFYQGKAEMGSIPQYLTSVFGGAYVIFRVPEGEAW DPVNDENMKTTDLSKPNSNVYYAKIPIKYVLDAVEAVNNESKMNAKRVPGVLDAGITWVG ATYCGLGIARKLSTDEEGNPIIREETGTYIYQDTNNSTDDFERGVVPVMRRNGAKMPSWN HTL >gi|226332016|gb|ACIB01000040.1| GENE 13 17218 - 20079 1626 953 aa, chain - ## HITS:1 COG:no KEGG:BT_1939 NR:ns ## KEGG: BT_1939 # Name: not_defined # Def: putative outer membrane receptor # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 953 1 953 953 1825 100.0 0 MKKGILGCNIIYLFILICIGIALPIKSQNNYKVKLTGTVYEYDHNNKRLPLEFAAVSIPE IALGTTSDENGRYILENVPTGKIRMQIQYLGKVSIDTLINVNKDLVLNFTMRNEDFKLKE VTVTATNSRSGKSTSSHISRSAMDHMQATSLYDVMSLMPGGISQNQDMSSAQQINIRQVS SSSGPEAPMNAMGTAIIRDGAPISNNANLSAMSPTVLSGTETPASLAGGASPAGGTDVRS ISTENIESIQIVRGIPSVEYGDLTSGAVIINTKAGREPLRVKAKANPNIYQVSMGTGFEL GKKKGALNVSADYAYNTNNPISSYQHYQRATTKLLYSNTFFNNKLRTNSSFDFIYGKDQR ERNPDDEQTKTASEGRDIGFTLNTNGTWNINKGWLKTLRYVLSGTYMDKDSYYETVYSSA TSPYSMTTTNGAVLSNFAGQHIYDANGNQITNFGPEDINHYAVYLPSSYLGHYEIDSREV NLFAKVTSSLFKASGHVNNRILIGADFRSDGNVGKGKTYDPSTPPYRSQYGHNSSFRPRN YKDIPFINQFGAYVEDNFKWSISGTHDLNIQAGVRYDHTSVVGGIFSPRVNASIDLIPNL LSLQGGYGIAAKMPSLLYLYPENAYFEYININELTNENIPESQRLFMTTTEVRQVDNSDL KIAQNHKAEVGFNLRVGKTNLNVIAYKERLKDGYVMSQTFNTFNTFIYNEYQRTENGIEL SSSLPVLSTYAKPTNNLNIETKGLEFDLNIGRIDAIRTAFQINGSWMRTKSWRQGYSFYD NSEDAASARKPVAIYSQEGNASYKQQFVTTLRATHNIPRIGFVVTMTAQAIWQQSNWNTF GNDSIPVGYLALEDASVNMFPKGKYTTTQQVKDAGYGYMLNNVSHNNAIKESYSPYFCFN LNVTKEISNMLRVSFFANNMFRSYPRRESKRNPGSYIQLNNRFFFGLELSLTL >gi|226332016|gb|ACIB01000040.1| GENE 14 20176 - 20280 109 34 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MLFLLVPLHNLTDLNRLLSNNIATALYLGKSNIY >gi|226332016|gb|ACIB01000040.1| GENE 15 20999 - 22879 793 626 aa, chain - ## HITS:1 COG:no KEGG:BT_1940 NR:ns ## KEGG: BT_1940 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 212 626 122 536 536 848 99.0 0 MKRKVLLSTLFLILHFTVALAQQVTISGQVLDEKSEPLIGATINIEGTTNAVITDLEGKF TIKVLPSEKLVISYLGYKPKTITIGKNRRFDIILDPSVTEMDEVVVVGYGSQRKSDIATA VASVNIKDIVNSSSTQTLQALQGKISGVQIIPTDGSLSSGMTFRIRGVNSVTGGTQPLFV IDGVPMPTQQITNEDTETVNNPLLGLNPNDIESNLNYWNPRPIKKDNPYLTVQRKLSPQT VEDFANRLFIYQVGKNNHIGFPFRKPSQMEILNFEMRNYFAETNTNYKAFATGGDKAQSC WMANFVPFDKVTDIYLFESAIDAMSFYEINHYTKETTCAFISTGGYVTKSQIENISRIFP SDKVKWNCCYDNDASGNGFDITTAYYLKGEECKAFARTNTGDTYKTIYLSFPDGNTQTFK EDAFSSGEYLKQHGIDNVNIIKPSRYKDWNELLVYYKRFDLNLGPGMKFIPAIEKTISQL NLRGYEQLANSISSSTKELVDSLLEQANYCISAPLAESGAYTLMVDCNIFMGLDTMVPVP SNLYVIEKCTQKKISAHAINEFLKKEYINIFRDMSSSDFKNFLEKDILTYTKGAVEKNFE KVILTFGWSLKPSILKKKSFDLEHGI >gi|226332016|gb|ACIB01000040.1| GENE 16 24137 - 24925 520 262 aa, chain - ## HITS:1 COG:no KEGG:BT_1945 NR:ns ## KEGG: BT_1945 # Name: not_defined # Def: conjugate transposon protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 247 1 247 260 499 100.0 1e-140 MMKKEKELLVAIASQKGGVGKSVFTVLLASVLHYRKDVRVAVVDCDSPQHSIALMRERDM ENVMKNDDLKVNLYRQYERIRKPAYPVIKSDPEKGVEDLRRYMDEKGETFDIVLFDLPGT LRSEGVVHTVAAMDYIFVPLKADNIVMQSSLQFTKVLEEELIAKGNCNLKGIRLFWNMVD RRGRKNLYDAWNRVIHRMGLRLLSSHIPNTLRYNKEADPVCKGVFRSTLFPPDPRQEKDS GLPELVEEICHAIGLEESDTER Prediction of potential genes in microbial genomes Time: Tue May 17 23:27:23 2011 Seq name: gi|226332015|gb|ACIB01000041.1| Bacteroides sp. 3_2_5 cont1.41, whole genome shotgun sequence Length of sequence - 22578 bp Number of predicted genes - 22, with homology - 21 Number of transcription units - 12, operones - 3 average op.length - 4.3 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . + CDS 3 - 551 102 ## gi|160890441|ref|ZP_02071444.1| hypothetical protein BACUNI_02883 + Term 563 - 598 4.1 2 2 Tu 1 . - CDS 500 - 916 180 ## BT_1947 hypothetical protein - Prom 936 - 995 5.5 + Prom 874 - 933 8.2 3 3 Tu 1 . + CDS 993 - 1247 122 ## BT_1948 hypothetical protein + Prom 1549 - 1608 3.9 4 4 Tu 1 . + CDS 1630 - 1902 112 ## + Term 2016 - 2052 -0.5 - Term 1670 - 1713 2.3 5 5 Op 1 . - CDS 1831 - 2397 310 ## BF3025 hypothetical protein 6 5 Op 2 35/0.000 - CDS 2394 - 3185 195 ## PROTEIN SUPPORTED gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 7 5 Op 3 33/0.000 - CDS 3149 - 4129 677 ## COG0609 ABC-type Fe3+-siderophore transport system, permease component 8 5 Op 4 . - CDS 4130 - 5269 1000 ## COG0614 ABC-type Fe3+-hydroxamate transport system, periplasmic component 9 5 Op 5 . - CDS 5281 - 7359 1203 ## BT_1953 putative TonB-linked outer membrane receptor 10 5 Op 6 . - CDS 7378 - 8478 788 ## BT_1954 putative surface layer protein 11 5 Op 7 . - CDS 8507 - 10597 1166 ## BT_1955 putative cell wall biogenesis protein 12 5 Op 8 . - CDS 10612 - 12393 1108 ## BT_1956 putative cell surface protein - Term 12408 - 12446 4.0 13 5 Op 9 . - CDS 12458 - 13375 812 ## BF3033 hypothetical protein 14 6 Op 1 . - CDS 14109 - 14438 67 ## BF3035 hypothetical protein 15 6 Op 2 . - CDS 14485 - 15378 653 ## BF3036 tyrosine type site-specific recombinase 16 7 Tu 1 . - CDS 15808 - 16596 743 ## BF3038 tyrosine type site-specific recombinase - Prom 16624 - 16683 5.9 + Prom 16570 - 16629 9.2 17 8 Tu 1 . + CDS 16813 - 17187 283 ## BF3039 hypothetical protein 18 9 Tu 1 . - CDS 17947 - 19548 1299 ## BT_0374 hypothetical protein - Prom 19635 - 19694 4.3 + Prom 19592 - 19651 4.3 19 10 Tu 1 . + CDS 19671 - 19982 332 ## gi|253565777|ref|ZP_04843232.1| predicted protein + Prom 20262 - 20321 3.7 20 11 Op 1 . + CDS 20342 - 20962 595 ## BT_1515 hypothetical protein 21 11 Op 2 . + CDS 20986 - 22350 1142 ## COG0305 Replicative DNA helicase - Term 22289 - 22321 3.0 22 12 Tu 1 . - CDS 22368 - 22577 239 ## BF3047 hypothetical protein Predicted protein(s) >gi|226332015|gb|ACIB01000041.1| GENE 1 3 - 551 102 182 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|160890441|ref|ZP_02071444.1| ## NR: gi|160890441|ref|ZP_02071444.1| hypothetical protein BACUNI_02883 [Bacteroides uniformis ATCC 8492] # 104 182 1 79 79 158 98.0 1e-37 LKVKHYCFRNKNNAPIGGDPLSWGTSLYDRAGQGLLQNHVRRFPVFFQMPRDKVVDEPLQ SDLSVSRGSRQDGQFPVQGGGNMQRPTLHARNAMQVFLPVDGRLHGAIFPCGTYNVVTGR STSLPGTGRRFPHDLTDAVRVHPAKTNPLFYVLSCHDMPSWTMTTFYRRGVPPGCPSSSQ GR >gi|226332015|gb|ACIB01000041.1| GENE 2 500 - 916 180 138 aa, chain - ## HITS:1 COG:no KEGG:BT_1947 NR:ns ## KEGG: BT_1947 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 138 1 138 138 262 100.0 3e-69 MDSEKEKNRLSDIVLERVGLTGNLLSAPVSPSLEPVVEIPSHGSQVRAGKVTGPEEYKRR FLVPAPRAAEWKTAYIDGRLHRRIAMLVRAAGCGSISGFIIRLLELHMEEHREDIASLLG EVYRPWDEDGQPGGTPRR >gi|226332015|gb|ACIB01000041.1| GENE 3 993 - 1247 122 84 aa, chain + ## HITS:1 COG:no KEGG:BT_1948 NR:ns ## KEGG: BT_1948 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 84 1 84 84 149 100.0 2e-35 MKPPVTYSHPKSVIVMIHMTGGTEMPFRSDGSSYAEATSEARSGVEKRAGFRFLVHRSFL ILMGRRESAHPSGYGFLNYRSMGV >gi|226332015|gb|ACIB01000041.1| GENE 4 1630 - 1902 112 90 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MWSAPADSKLCFRLTKTGRFNGEYTDKPLLISSSSVAFYISNYNSFEITKQIEPNVVSLS VYFMSESLVCYVAKFVFIMKKSAYENKIFR >gi|226332015|gb|ACIB01000041.1| GENE 5 1831 - 2397 310 188 aa, chain - ## HITS:1 COG:no KEGG:BF3025 NR:ns ## KEGG: BF3025 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 188 1 188 188 391 100.0 1e-108 MSIYTVENFTSDITVEGYIAEFRDEPHFLELCKQCTNYGKSWGCPPFDFDTESFLRQYKY AHLMATKIIPEDKDIPIEYTQKLILPERIRIESELLDMERKYGGRSFAYIGKCLHCSDNE CTRNCGTPCRHPEKVRPSLEAFGFDIAKTLSELFNIELLWGKDGKLPEYLVLVSGFFHNE YELCNIAY >gi|226332015|gb|ACIB01000041.1| GENE 6 2394 - 3185 195 263 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 [Vibrio campbellii AND4] # 12 227 1 217 245 79 25 2e-14 MGGFTQQIHHRMIELQHFSIGYKENSLLHEVNATIKKGQLTALIGRNGTGKSTLLRAIAG LNRCYSGKIILDGHDIACMKTEDMAKTLAVVTTERTRIANLRCKDVVAIGRAPYTNWIGR MQETDKEIVMQSLISVGMEAYANRTMDKMSDGECQRVMIARALAQDTPIILLDEPTSFLD MPNRYELVALLRRLVHDEKKCIMFSTHELDIALSMCDSIALLDTPNLSCLTASEMQKSGY IDRLFQNENIRFDSLCGTMILKQ >gi|226332015|gb|ACIB01000041.1| GENE 7 3149 - 4129 677 326 aa, chain - ## HITS:1 COG:alr4032 KEGG:ns NR:ns ## COG: alr4032 COG0609 # Protein_GI_number: 17231524 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-siderophore transport system, permease component # Organism: Nostoc sp. PCC 7120 # 6 323 22 356 362 237 48.0 2e-62 MRSRSTILFSILITLTVGLFLLDLAVGAVNIPIRDVWAALTGGNCSRATEKIVLNIRLIK AIVALLAGAALSVSGLQMQTLFRNPLAGPYVLGISSGASLGVALVVLAGIGSSIGIAGAA WVGAAVVLLVITAVGQRIKDIMVILILGMMFSSGVGAVVQILQYLSKEESLKAFVIWTMG ALGDVTSGQLLILVPSVFAGLLLAVLTIKPLNLLLFGEEYAVTMGLNIRRSRSLLFLSTT LLAGTITAFCGPIGFIGLAMPHVTRMLFQNSDHHVLLPGTILSGASILLLCDIISKIFTL PINAITALLGIPIVVWVVLRNKSITA >gi|226332015|gb|ACIB01000041.1| GENE 8 4130 - 5269 1000 379 aa, chain - ## HITS:1 COG:alr4031 KEGG:ns NR:ns ## COG: alr4031 COG0614 # Protein_GI_number: 17231523 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-hydroxamate transport system, periplasmic component # Organism: Nostoc sp. PCC 7120 # 51 375 93 420 426 226 36.0 4e-59 MNALKNLSLILLLSLAFTGCHNKSSKINDFNLLLYAPEYASGFDIKGAGGKESVLITVRN PWQGADSVTTWLFIVRNGEEVPEGFAGQVLKGDAKRIVAMSSTHIAMLDAIGEVRCITGV SGIDYISNPDIQARRDSIGDVGYEGNINYELLLSLDPDLVLLYGVNGASAMESKLEELDI PFMYVGDYLEESPLGKAEWMVVLSEVTGKREKGEKAFAAIPVRYNALKKKVADSTLGTPS VMLNVPYGDSWFMPSTQSYVARLITDAGGRYIYQKNTGNASIPIDLEEAYLLASDADMWL NVGMANSLDDLKASCPKFTDTRCFKNGEVYNNNARTNTAGGNDYYESAVVNPDIVLRDLV KIFHPELVQEECVYYKQLK >gi|226332015|gb|ACIB01000041.1| GENE 9 5281 - 7359 1203 692 aa, chain - ## HITS:1 COG:no KEGG:BT_1953 NR:ns ## KEGG: BT_1953 # Name: not_defined # Def: putative TonB-linked outer membrane receptor # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 692 1 692 692 1381 100.0 0 MKRHLILLFVGVSLPFLLAAQQKNSVSITKRVLRIPEVTVVGKRPMKDIGVQRTRFDSIA MKENIALSMADVLTFNSSVFVKNYGRATLSTVAFRGTSPSHTQVTWNGMRINNPMLGMTD FSTIPSYFIDDASLLHGTSSVNETGGGLGGLVRLSTSPANHEGFGLQYVQGVGSFSTFDE FLRLTYGDKHWQSSTRVVYSSSPNDYKYRNRDKKENIYDEDKNIIGSYYPTERNRSGAYK DLHVLQEIYYNTGEGDKFGLNAWYINSNRELAMLSTDYGNDMDFENRQREQTFRGVLSWD RVREKWKVGVKGGYIHTWMAYDYKRDKGNGEMASMTRSRSKINTFYGSADGDYAPSEKWL FTAGVSVHQHLVESADKNIISQEGNKAVVGYDKGRVEFSGSVSAKWRPVDRFAASLVLRE DMFGTEWAPVIPAFFIDGVLSKKGNIVAKASISRNYRFPTLNDLYFLPGGNPDLKSEHGF TYDVGLSFSVGKENVYALSGGINWFDSHIDDWIIWLPTTKGFFSPRNLKKVHAYGAETNA HLDIMLGKDWKLDMNGTFSWTPSINESEPMSPADQSVGKQLPYVPEFSATVTGRLSWRTW SLLYKWCYYSQRYTMSSNDYTLTGYLPPYFMNNVTLEKQLSFRWADLSLKGSINNLFDEE YLSVLSRPMPGINFEIFIGITPKFGKNKNSKR >gi|226332015|gb|ACIB01000041.1| GENE 10 7378 - 8478 788 366 aa, chain - ## HITS:1 COG:no KEGG:BT_1954 NR:ns ## KEGG: BT_1954 # Name: not_defined # Def: putative surface layer protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 366 1 366 366 751 100.0 0 MIRVLFFIRMTMSRTIQRICLFLFCLPVFGSCMKWDYGEMEDFSVSASGLFITNEGNFQY SNATLSYYDPATCEVENEVFYRANGFKLGDVAQSMVIRDGIGWIVVNNSHVIFAIDINTF KEVGRITGFTSPRYIHFLSDEKAYVTQIWDYRIFIINPKTYEITGYIECPDMDMESGSTE QMVQYGKYVYVNCWSYQNRILKIDTETDKVVDELTIGIQPTSLVMDKYNKMWTITDGGYE GSPYGYEAPSLYRIDAETFTVEKQFKFKLGDWPSEVQLNGTRDTLYWINNDIWRMPVEAD RVPVRPFLEFRDTKYYGLTVNPNNGEVYVADAIDYQQQGIVYRYSPQGKLIDEFYVGIIP GAFCWK >gi|226332015|gb|ACIB01000041.1| GENE 11 8507 - 10597 1166 696 aa, chain - ## HITS:1 COG:no KEGG:BT_1955 NR:ns ## KEGG: BT_1955 # Name: not_defined # Def: putative cell wall biogenesis protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 696 1 696 696 1297 100.0 0 MKIYFLPMLLSLFFLGACDKNDEIIPEDADENFITSVVMTVDGKSYTADITDNTVTITVP YTVSLNNAEVEFKYTTSATIIPDPETVTDWDNERTFRVTSYNGDAREYTYKVVKSEIESD GDVELKTTEEVASFAATKTTVVKGNLIIGSDAEEAEKITDISALASLKEVTGNIVIRNSY NGADLTGLDNIVSAGGLQVGSTDVASKATELHMISMKALETLSGDISVYNDQVTYVLFEK LATIEGSVMFNASSLQSFEFPVLTTVGQDLNLQGLNEENTAAGSIASLEIPELTSVGGVL SVNNLAKLTSMSFLKLKETGGLDFHTVPVMLETINLPEIETVNGSIIMEANMEAPPTGSF VPQRNDVLQAFGGMDKLTTIKGQIKIKNFTALKQLPDWSKITTLGSITLDYLEDVSGTLL LPNARFETFGETAPQIEIINKVQLSKIETAEDLSNVNFVITSLTNNKFPEITFKNIKDFT CKPTTNNTDYTISTIQHVYGNLNVTGQMRSNAKFPDLEIIDGYGYIQIPMFASITMPVLK EVGGQFYLSGNFTSCNLPLLSKVCCSASPVYYKEGEGSLAISLQSKSLDIPELLHVGGEG LFVNKATGITCDKLQTIDGTLQIKSATSLSQETLSMEKLETLHGVVFDGLTKFTDYTFFG KFIENGMITGESWSVTKCGYNPTFQNMKDKQYTQQD >gi|226332015|gb|ACIB01000041.1| GENE 12 10612 - 12393 1108 593 aa, chain - ## HITS:1 COG:no KEGG:BT_1956 NR:ns ## KEGG: BT_1956 # Name: not_defined # Def: putative cell surface protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 593 1 593 593 1164 100.0 0 MHRFHYFIISACMLFTSCNKDEVITEEVGGQPIIELDSETGIYTVKVDHELTIAPTYQNV EDALFAWTIDGTLVSSGPSLQRTWNECGDFYVKLRVDNAEGYAEEELKVEVKELTPPVIS LALPSQGLKVVRNTDYTFTPDIQHSDVEGFKIEWVREGKIVSTENTYTFNEKELGVYTVT INASNIDGTTTKDVSVEVVETMPYVVKFPTPSYLQTSTDRYTFADRPVFLRPLLEYFDNP RFEWSVDGQVMEGEVERMFKFTPSAPGEYTVSCTVSEDTPTEKISRNIDKGKTAVTATVK VVCVDKKEQDGFRASGSSKLWNKVYEYTPAPGQFINETSTIGGMTGNETSPEAAVAWATQ RLKDKLHVSLGSFGGYIIVGFDHSIPNSGNQYDFCVQGNAFDGSSEPGIVWVMQDINGNG LPDDEWYELKGSEAGKEETIQNFEVTYYRPEGKKMDVQWISSDGRNGWVDYLSAYHTQDY YYPAWISENSYTLTGTCLAARNTQDSQTGYWDNQSYDWGYVDNFGNDQIEGGSTVDGSGQ RNGFKISNAIHADGTEANLQYIDFIKIQCGVLAKSGWLGEVSTEVFSFEDLTK >gi|226332015|gb|ACIB01000041.1| GENE 13 12458 - 13375 812 305 aa, chain - ## HITS:1 COG:no KEGG:BF3033 NR:ns ## KEGG: BF3033 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 305 1 305 305 592 99.0 1e-168 MKRKLRFLAGACLFTATALFSGCSSDDDFLMDPVDSGTSQTRAVTNSDGTLTITFDDFDP GMLAGPTSAGENLYSYQGYPQVTTIYDNTPEEYLFLSMFNTVGGSTEYSSGGIALSNWNI RSNQSGNTGDWWYSYLNQCSVYNTAVEAEGQNKEAGHSGSNFGVVYGYVDAYNQAWMAKP EFYFNVPRKLVGLWICNTSYTYGVITYGNQFGSTGVATPLKEMKGYFQVNLECYDANGGL IRTYKRLLADYRNGQQQVDPITTWDYWEINAEGVQSVKFNFEGSDSGAYGLNTPAYICID DITIQ >gi|226332015|gb|ACIB01000041.1| GENE 14 14109 - 14438 67 109 aa, chain - ## HITS:1 COG:no KEGG:BF3035 NR:ns ## KEGG: BF3035 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 109 1 109 109 200 100.0 1e-50 MQLRGEVTPHIGRHAFAVLAILKGMLLETLQKVFGHKSITYVLRIHSLVRRRRGNLFLEV AHLYDQRADQLSQLLIRACFNNCQFHTKILFKILNMRQSNAHPISFPNR >gi|226332015|gb|ACIB01000041.1| GENE 15 14485 - 15378 653 297 aa, chain - ## HITS:1 COG:no KEGG:BF3036 NR:ns ## KEGG: BF3036 # Name: not_defined # Def: tyrosine type site-specific recombinase # Organism: B.fragilis # Pathway: not_defined # 1 297 1 297 297 606 99.0 1e-172 MGRITINGTQAGFSCKKEVSLALWDVKTNRAKGKSEEARTLNQELDNIKAQITRHYQYIC DHDSFVTAKKVYNRYVGFSEECHTLMNLFREQLEPYKKKIGIEKAESTYCGLVADYKSLL LFMKSKKNAEDIVIEELEKSFIEDYYNWMLGTCALANSTVFGRVNTLKWLMYIAQEKGWI RVHPFASFECMPEYKRRSFLSEEELQRIIHIEPRYKRQRAMRDMFLFMCFTGLSYVDLKA ITYDNIHTDSDGGTWLMGNRIKTGVAYVVKLLPIAIELIEKYRGTDEKKDSPNVSFR >gi|226332015|gb|ACIB01000041.1| GENE 16 15808 - 16596 743 262 aa, chain - ## HITS:1 COG:no KEGG:BF3038 NR:ns ## KEGG: BF3038 # Name: not_defined # Def: tyrosine type site-specific recombinase # Organism: B.fragilis # Pathway: not_defined # 1 262 4 265 265 467 92.0 1e-130 MNETLTVFMISEIERFREEGRESARRIYHNMLRLLRESAGKEEIGFEEVTPAFLGNYEFL LLGRGLSWNTVSTYMRALRAGYNRGMKGRPGYVTGLFDKVYTGTRSDVKRAVDARTVGRM IRMSGCPDESASAKAVDWFVLMFMLRGIPFVDLAHLRRSNLDKGVLTYCRHKTGQEVSIT VPREAMDIINRRMAENDHPSYLLPILGQPRTGKRRQEKVLTPYQEYQCELRNLNRRLERV SVDLRLGGRLSSYTAKHHTISI >gi|226332015|gb|ACIB01000041.1| GENE 17 16813 - 17187 283 124 aa, chain + ## HITS:1 COG:no KEGG:BF3039 NR:ns ## KEGG: BF3039 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 124 1 124 124 175 87.0 4e-43 MKKSIQDLKESLRKKEQKQQRLHEQEDKIEKDIRVLRYLIEREEQAASAQAPRKRKIARK QVSDFFTRIALFYEDLQRRLNLRITYRCFCRWLCTRYEFESRYHDRHKLSPCTVLGYFKR ERGG >gi|226332015|gb|ACIB01000041.1| GENE 18 17947 - 19548 1299 533 aa, chain - ## HITS:1 COG:no KEGG:BT_0374 NR:ns ## KEGG: BT_0374 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 17 532 1 516 516 831 76.0 0 MYPQMGLLLISGTIKNMSDKIYPIGIQNFEKIRKEGFFYVDKTALVYQMVKTGSYYFLSR PRRFGKSLLVSTLEAYFRGKKELFEGLAMEKLEKEWIEHPILHLDLNIEKYDSPQSLEDI LEKAIVSWEKLYGAEPSERSLSLRFAGVIERACKLTGHRVVILVDEYDKPMLQSIGDEEL QKEFRKTLQAFYGAIKTMDGYIRFAFLTGVTKFGKVSVFSALNNLIDLSMDERYVALCGI TEEEIRTNLDQELYELADRQRMGYEEVCRELKACYDGYHFVEDSIGIYNPFSLLNTFYKM KFGNYWFETGTPTYLVELLQIHHYDLHKMAHVETDADVLNSIDSSSTDPIPVIYQSGYLT IKGYDREFGIYRLGFPNREVEEGFMKFLLPYYADTNKVEAPFEIQKFVQEVRAGDYDSFF RRLQSFFADTPYEMIRDRELHYQNVLFIVFKLMGFYTQVEYHTAEGRVDLVLKTDKFIYV MEFKLDGTAEEALRQINGKHYTQPFATDGRKLFKIGVNFSAQTRNIEKWIVES >gi|226332015|gb|ACIB01000041.1| GENE 19 19671 - 19982 332 103 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253565777|ref|ZP_04843232.1| ## NR: gi|253565777|ref|ZP_04843232.1| predicted protein [Bacteroides sp. 3_2_5] # 1 103 1 103 103 215 100.0 7e-55 MKELNFDWLLNGSCLLSDVAMAYFTTCVYPRSAGKRMRDEIERYPNLYAELLEAGYKRPN TLLTPRQICIVIRHWGMPDTVYKWLREHPADRVQKLFADRKFD >gi|226332015|gb|ACIB01000041.1| GENE 20 20342 - 20962 595 206 aa, chain + ## HITS:1 COG:no KEGG:BT_1515 NR:ns ## KEGG: BT_1515 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 12 206 12 212 212 147 39.0 2e-34 MNASTETILKKGYILIPKSLIEDFLKAGLGTEGYLEAWIRVLALVNYSDTEICVQGNRMI CRRGETVYTYSHWEKVLGWSRYRTRHFFETLFKSGIMEAVENSAGITLLRVIDYDLWTGQ KKAAATRSNHATEGFDDFWDLYHRITQKDKINIARARKEWNKLTATEKKLALENIEEYYA HQKDIRFCKQAATYLADKAFLNEYEF >gi|226332015|gb|ACIB01000041.1| GENE 21 20986 - 22350 1142 454 aa, chain + ## HITS:1 COG:lin0047 KEGG:ns NR:ns ## COG: lin0047 COG0305 # Protein_GI_number: 16799126 # Func_class: L Replication, recombination and repair # Function: Replicative DNA helicase # Organism: Listeria innocua # 4 438 11 437 450 264 37.0 2e-70 MLQPQAPELEETILGACLIEKEGMALVDELLKPEMFYVTRHQLIYAALQAMFHAGTNIDI LTATEELRKRGKLDDAGGPFYITQLSSRVASSAHLEYHARIVHQKYIRREMIVGFSKLLT LSGDETIDLTDTLVDAHNLLDRLEGACGHNRQLRDMDSLMQATLVEAEGRMRKNKDGVTG IPTGLTELDKMTGGWQNNDLITIAARPAVGKTAFALHLAKVAAAAGHHTVVYSLEMQGER LGDRWLLSAGEVDPHLWRSGKVSQETWLQARQTAGELARLPICVDDNPDMSMDRVRSSAR LLQSKGKCDFIIIDYLQLCDMRTEQNNRNREQEVAQASRKAKLLAKELHIPVILLSQLNR GSENRAYGRLELADLRESGAIEQDSDLVVLLYRPALAHLATDKQSGYPTEGLGIAIIAKH RNGETGNVYFGHNRSMTKIGDYVPPTEWIRRNAK >gi|226332015|gb|ACIB01000041.1| GENE 22 22368 - 22577 239 69 aa, chain - ## HITS:1 COG:no KEGG:BF3047 NR:ns ## KEGG: BF3047 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 6 69 240 303 303 96 84.0 2e-19 GRQEGLAEGRQEGLAEGRQEGRQEGRQEGLAEGRMEEKQANARRMKALNLPVETICQVTG LSAGEIESL Prediction of potential genes in microbial genomes Time: Tue May 17 23:28:55 2011 Seq name: gi|226332014|gb|ACIB01000042.1| Bacteroides sp. 3_2_5 cont1.42, whole genome shotgun sequence Length of sequence - 45005 bp Number of predicted genes - 45, with homology - 43 Number of transcription units - 22, operones - 11 average op.length - 3.1 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 215 - 274 5.9 1 1 Tu 1 . + CDS 322 - 801 572 ## BT_1517 hypothetical protein + Term 964 - 997 0.5 + Prom 835 - 894 3.8 2 2 Tu 1 . + CDS 1029 - 1469 439 ## COG3023 Negative regulator of beta-lactamase expression - Term 1486 - 1546 10.1 3 3 Op 1 . - CDS 1557 - 1859 416 ## BT_4733 hypothetical protein 4 3 Op 2 . - CDS 1825 - 2082 104 ## BT_4732 hypothetical protein - Prom 2192 - 2251 2.0 - TRNA 2310 - 2382 84.5 # Gly GCC 0 0 + Prom 2206 - 2265 3.7 5 4 Tu 1 . + CDS 2356 - 2598 142 ## - TRNA 2387 - 2471 51.5 # Leu CAG 0 0 - TRNA 2534 - 2609 85.2 # Gly GCC 0 0 - TRNA 2632 - 2715 47.8 # Leu GAG 0 0 - TRNA 2750 - 2834 51.5 # Leu CAG 0 0 - TRNA 2860 - 2932 84.5 # Gly GCC 0 0 6 5 Tu 1 . - CDS 3063 - 4163 815 ## COG0019 Diaminopimelate decarboxylase - Prom 4218 - 4277 1.8 7 6 Tu 1 . - CDS 4329 - 6689 2344 ## COG0210 Superfamily I DNA and RNA helicases - Prom 6765 - 6824 4.5 + Prom 6520 - 6579 3.4 8 7 Tu 1 . + CDS 6811 - 7428 435 ## PROTEIN SUPPORTED gi|15900660|ref|NP_345264.1| superoxide dismutase, manganese-dependent 9 8 Op 1 . + CDS 7797 - 7994 302 ## BF2528 ThiS protein involved in thiamine biosynthesis 10 8 Op 2 3/0.000 + CDS 7998 - 8612 624 ## COG0352 Thiamine monophosphate synthase + Prom 8670 - 8729 1.6 11 8 Op 3 . + CDS 8754 - 9530 862 ## COG2022 Uncharacterized enzyme of thiazole biosynthesis 12 8 Op 4 . + CDS 9616 - 11310 1657 ## COG0422 Thiamine biosynthesis protein ThiC 13 8 Op 5 . + CDS 11312 - 11866 369 ## BF2532 hypothetical protein 14 8 Op 6 . + CDS 11868 - 12992 971 ## COG1060 Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 15 8 Op 7 . + CDS 13015 - 13716 537 ## COG0476 Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 16 8 Op 8 . + CDS 13778 - 14386 654 ## BF2535 thiamine phosphate pyrophosphorylase + Term 14413 - 14456 7.1 - Term 14397 - 14447 12.8 17 9 Op 1 . - CDS 14464 - 14976 694 ## BF2536 hypothetical protein - Prom 15002 - 15061 6.6 - Term 15103 - 15155 9.1 18 9 Op 2 . - CDS 15178 - 15747 752 ## BF2537 hypothetical protein - Prom 15769 - 15828 9.6 - Term 15899 - 15939 9.6 19 10 Tu 1 . - CDS 15960 - 18680 3010 ## COG0574 Phosphoenolpyruvate synthase/pyruvate phosphate dikinase - Prom 18852 - 18911 6.2 + Prom 18860 - 18919 4.5 20 11 Op 1 . + CDS 18947 - 20365 1341 ## COG2265 SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase 21 11 Op 2 . + CDS 20376 - 21293 217 ## PROTEIN SUPPORTED gi|161507907|ref|YP_001577871.1| ribosomal protein large subunit 22 12 Op 1 . - CDS 21456 - 21860 364 ## BF2541 hypothetical protein 23 12 Op 2 . - CDS 21883 - 22326 471 ## BF2571 putative lipoprotein - Prom 22348 - 22407 9.1 + Prom 22865 - 22924 5.4 24 13 Op 1 . + CDS 23153 - 24007 887 ## COG0024 Methionine aminopeptidase 25 13 Op 2 . + CDS 24008 - 25234 1173 ## COG1322 Uncharacterized protein conserved in bacteria 26 13 Op 3 . + CDS 25262 - 26008 512 ## BF2553 hypothetical protein - Term 26100 - 26159 7.7 27 14 Op 1 . - CDS 26208 - 27521 1188 ## COG3004 Na+/H+ antiporter 28 14 Op 2 . - CDS 27566 - 28744 962 ## BF2555 putative Na+/H+ exchange protein - Prom 28810 - 28869 2.7 - Term 28816 - 28849 -0.2 29 15 Tu 1 . - CDS 28890 - 30671 2070 ## COG0481 Membrane GTPase LepA - Prom 30699 - 30758 9.4 - Term 30716 - 30776 16.2 30 16 Op 1 . - CDS 30797 - 30997 345 ## BF2557 hypothetical protein - Prom 31057 - 31116 4.4 - Term 31011 - 31056 1.9 31 16 Op 2 . - CDS 31144 - 31608 479 ## BF2558 hypothetical protein - Prom 31628 - 31687 8.3 + Prom 31564 - 31623 8.9 32 17 Tu 1 . + CDS 31678 - 32088 384 ## COG0432 Uncharacterized conserved protein + Term 32132 - 32165 0.9 33 18 Op 1 . - CDS 32090 - 32851 749 ## COG0708 Exonuclease III 34 18 Op 2 . - CDS 32862 - 34115 918 ## COG1914 Mn2+ and Fe2+ transporters of the NRAMP family - Prom 34223 - 34282 4.2 + Prom 34108 - 34167 4.1 35 19 Tu 1 . + CDS 34258 - 34650 460 ## BF2587 putative lipoprotein + Term 34671 - 34720 14.1 - Term 34645 - 34718 18.6 36 20 Op 1 . - CDS 34800 - 35045 166 ## BF2563 hypothetical protein 37 20 Op 2 . - CDS 35045 - 35782 977 ## COG0217 Uncharacterized conserved protein - Prom 35802 - 35861 3.2 - Term 35784 - 35833 -0.9 38 20 Op 3 . - CDS 35878 - 38340 2478 ## COG0072 Phenylalanyl-tRNA synthetase beta subunit - Prom 38408 - 38467 4.9 - Term 38415 - 38459 5.4 39 21 Op 1 1/0.000 - CDS 38490 - 39443 619 ## COG0472 UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase - Term 39469 - 39508 -0.7 40 21 Op 2 12/0.000 - CDS 39561 - 40457 665 ## COG0451 Nucleoside-diphosphate-sugar epimerases 41 21 Op 3 26/0.000 - CDS 40466 - 41239 342 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 42 21 Op 4 1/0.000 - CDS 41215 - 42297 676 ## COG0438 Glycosyltransferase - Prom 42343 - 42402 9.6 43 21 Op 5 . - CDS 42688 - 43776 411 ## COG3754 Lipopolysaccharide biosynthesis protein 44 21 Op 6 . - CDS 43773 - 44327 88 ## gi|253565824|ref|ZP_04843279.1| predicted protein - Prom 44444 - 44503 6.8 + Prom 43903 - 43962 11.8 45 22 Tu 1 . + CDS 44197 - 44439 108 ## Predicted protein(s) >gi|226332014|gb|ACIB01000042.1| GENE 1 322 - 801 572 159 aa, chain + ## HITS:1 COG:no KEGG:BT_1517 NR:ns ## KEGG: BT_1517 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 144 1 144 176 185 64.0 5e-46 MEVSVERYQRKKYVNQENSPVLYYIRQKAGTAKVVDIDALAEDIQTNCALTTGDVKHTIE ALVEQLRKVLTQGNKVKIEGLGTFHMTLTCLPGETEKECTVKNIKRVNVRFIPDKAMKLV NASRSMTRSPNNVNFALVSAGAPEGGGSGGGDIVDDPTA >gi|226332014|gb|ACIB01000042.1| GENE 2 1029 - 1469 439 146 aa, chain + ## HITS:1 COG:HI1494 KEGG:ns NR:ns ## COG: HI1494 COG3023 # Protein_GI_number: 16273395 # Func_class: V Defense mechanisms # Function: Negative regulator of beta-lactamase expression # Organism: Haemophilus influenzae # 46 142 2 98 116 105 49.0 3e-23 MREINLIVIHCSATREDRCFTEYDLEECHRRRGFDGAGYHFYIRKNGKIVTTRPVERIGA HAKGFNAHSIGICYEGGLDCNGRPKDTRTEWQKHSMRVLVKVLLKDYPGSKVCGHRDLSP DLNGNGEIEPEEWIKACPCFEVKTFF >gi|226332014|gb|ACIB01000042.1| GENE 3 1557 - 1859 416 100 aa, chain - ## HITS:1 COG:no KEGG:BT_4733 NR:ns ## KEGG: BT_4733 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 3 100 4 101 101 132 72.0 4e-30 MHENNNQLVDYDAVLDAKFGKIGTPSRIEAEEKAYAFYTGKIIEDARKKAKVTQAELARR IGSNRSYISRVESGQTDLRTSTLYRIMNALGCQIEFNMSL >gi|226332014|gb|ACIB01000042.1| GENE 4 1825 - 2082 104 85 aa, chain - ## HITS:1 COG:no KEGG:BT_4732 NR:ns ## KEGG: BT_4732 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 80 31 110 110 74 56.0 9e-13 MLLLKTQEKVTKKFVKSIKDGIFELRTEYGGNIYRVFFIFDEGHIVVLFNGFQKKTQKTP TVEIEKAIKIKEEYYARKQQSISGL >gi|226332014|gb|ACIB01000042.1| GENE 5 2356 - 2598 142 80 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MLYQLSYFRNVVPRTGLEPACLSTHAPETCASTNSATWALTNQKPAVKKNGEEQITDVLV ERKTRLELATLTLARLCSTN >gi|226332014|gb|ACIB01000042.1| GENE 6 3063 - 4163 815 366 aa, chain - ## HITS:1 COG:sll0873 KEGG:ns NR:ns ## COG: sll0873 COG0019 # Protein_GI_number: 16330194 # Func_class: E Amino acid transport and metabolism # Function: Diaminopimelate decarboxylase # Organism: Synechocystis # 1 365 24 386 387 404 50.0 1e-112 MEEELLRKNLSLIKSVADDAGVEIILAFKSFAMWRSFPIFREYIGHSTASSAYEARLALE EFGSKAHTYSPAYTEADFPEIMRCSSHITFNSLSQFSRFYPLTVAEGSGISCGIRVNPEY SEVETELYNPCAPGTRFGITADLLPARLPQGIEGFHCHCHCESSSFELERTLQHLEEKFS PWFSQIKWLNLGGGHLMTRKDYDTRHLTGLLQGLKKRYPHLRIILEPGSAFTWQTGVLTS EVVDIVESRGIRTAILNVSFTCHMPDCLEMPYQPAVRGAVMGEEGPFVYRLGGNSCLSGD YMGAWSFDHELQAGERIVFEDMIHYTMVKTNMFNGIHHPAIALWTADGKAEIFRQFSYED YRDRMS >gi|226332014|gb|ACIB01000042.1| GENE 7 4329 - 6689 2344 786 aa, chain - ## HITS:1 COG:SPy1267 KEGG:ns NR:ns ## COG: SPy1267 COG0210 # Protein_GI_number: 15675225 # Func_class: L Replication, recombination and repair # Function: Superfamily I DNA and RNA helicases # Organism: Streptococcus pyogenes M1 GAS # 8 780 8 767 772 520 40.0 1e-147 MPDYIEELNESQRAAVLYGDGPSLVIAGAGSGKTRVLTYKIAYLLENGYNPWNILALTFT NKAAREMKERIARQVGEQRARFLWMGTFHSVFSRILRAEASHIGFTSQFTIYDSADSKSL IRSIIKEMGLDEKTYKPGSVQARISNAKNHLVSPSGYAANKEAYEGDLAAKMPAIRDIYS RYWERCRQAGAMDFDDLLVYTYILFRDFPDVLARYREQFRYVLVDEYQDTNYAQHSIVLQ LTKENQRVCVVGDDAQSIYSFRGADIDNILYFTKIYPDTKVFKLEQNYRSTQTIVRAANS LIEKNERQIPKEVFSEKERGEAIGVFQAYSDVEEGDIVTNKIAQLRREHDYEYSDFAILY RTNAQSRVFEEALRKRGMPYKIYGGLSFYQRKEIKDIIAYFRLVVNPNDEEAFKRIINYP ARGIGDTTVGKIITAATDNNVSLWTALCEPITYGLSINKGTHTKLQDFRALIEQFMADVT VKNAYEIGTEIIRQSGIINEVCQDNSPENLSRKENIEELVNGMNDFCAMRQEEGNTNVSL IDFLSEVSLLTDQDSDKEGDGEKVTLMTVHSAKGLEFRNVFVVGMEENLFPSGMAGDSPR AMEEERRLFYVAITRAEEHCFLSFAKTRFRYGKMEFGSPSRFLRDIDTRFLQLPQEAALG RSIDEGAGRFRREMEEGYSRRSSSERFSARPSADRPERERPKVQIIAPTVPRNLKKVSGT TLSPSSASGAGVAGVQPGQTIEHERFGLGEVIRVEGTGDNAKATIHFRNAGDKQLLLRFA RFKVIE >gi|226332014|gb|ACIB01000042.1| GENE 8 6811 - 7428 435 205 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|15900660|ref|NP_345264.1| superoxide dismutase, manganese-dependent [Streptococcus pneumoniae TIGR4] # 13 203 1 195 201 172 43 4e-42 MNTLLMSLIFTTMTYEMPKLPYANNALEPVISQQTIDYHYGKHLQTYVNNLNSLVPGTEY EGKTVEAIVASAPDGAIFNNAGQVLNHTLYFLQFAPKPAKNEPAGKLGEAIKRDFGSFEN FKKEFNAASVGLFGSGWAWLSVDKDGKLHITKEPNGSNPVRAGLKPLLGFDVWEHAYYLD YQNRRADHVNKLWEIIDWDVVEKRL >gi|226332014|gb|ACIB01000042.1| GENE 9 7797 - 7994 302 65 aa, chain + ## HITS:1 COG:no KEGG:BF2528 NR:ns ## KEGG: BF2528 # Name: not_defined # Def: ThiS protein involved in thiamine biosynthesis # Organism: B.fragilis # Pathway: not_defined # 1 65 38 102 102 127 98.0 1e-28 MKVQVNNKEVETTASTLAQLATQLQLPENGVAIAVNNRMIPRPQWDGFGLQENDNLIVIK AACGG >gi|226332014|gb|ACIB01000042.1| GENE 10 7998 - 8612 624 204 aa, chain + ## HITS:1 COG:sll0635 KEGG:ns NR:ns ## COG: sll0635 COG0352 # Protein_GI_number: 16329575 # Func_class: H Coenzyme transport and metabolism # Function: Thiamine monophosphate synthase # Organism: Synechocystis # 3 202 144 335 343 123 37.0 3e-28 MLSLQFITHQTENYSYLESARMALEGGCKWIQLRMKEASPEEVEAVALQLKPLCKAKEAI LILDDHVELAKKLEVDGVHLGKKDMPIGEARQMLGEAFIIGGTANTFEDVKLHHAAGADY LGIGPFRFTTTKKNLSPVLGLEGYTSILAQMNEADIRIPVVAIGGIVAEDIPPIMETGVN GIALSGAILQAPDPIEETKRILNI >gi|226332014|gb|ACIB01000042.1| GENE 11 8754 - 9530 862 258 aa, chain + ## HITS:1 COG:YPO3742 KEGG:ns NR:ns ## COG: YPO3742 COG2022 # Protein_GI_number: 16123879 # Func_class: H Coenzyme transport and metabolism # Function: Uncharacterized enzyme of thiazole biosynthesis # Organism: Yersinia pestis # 2 258 62 326 333 313 61.0 2e-85 MEKLIIAGREFNSRLFLGTGKFSSNEWMEQSILASGTEMVTVAMKRVDMESTEDDMLKHI VHPHIQLLPNTSGVRNAEEAVFAAQMAREAFGTNWLKLEIHPDPRYLLPDSVETLKATEE LVKLGFVVLPYCQADPVLCKQLEEAGAATVMPLGAPIGTNKGLQTKEFLQIIIEQAGIPV VVDAGIGAPSHAAEAMEMGASACLVNTAIAVAGNPIEMAKAFKQAVEAGRTAYEAGLGMQ AIGFVAEASSPLTAFLNE >gi|226332014|gb|ACIB01000042.1| GENE 12 9616 - 11310 1657 564 aa, chain + ## HITS:1 COG:PA4973 KEGG:ns NR:ns ## COG: PA4973 COG0422 # Protein_GI_number: 15600166 # Func_class: H Coenzyme transport and metabolism # Function: Thiamine biosynthesis protein ThiC # Organism: Pseudomonas aeruginosa # 7 563 23 595 627 774 63.0 0 MEQRIKFPRSEKVYLSGKLFPEIRVGMRKVEQVPSTTFEGEKKVITPNPHVYIYDTSGPF SDPDIEIDLKKGLPRLREEWILNRGDVEQLPEISSEYGRMRRDDGSLDHLRFEHIALPYR AKAGRHITQMAYAKQGIVTPEMEYVAIRENMNCEELGIETHITPEFVRQEIAEGRAVLPA NINHPEAEPMIIGRNFLVKINTNIGNSATTSSIDEEVEKAMWSCKWGGDTLMDLSTGENI HETREWIIRNCPVPVGTVPIYQALEKVNGKVEDLTWELYRDTLIEQCEQGVDYFTIHAGI RRHNVHLAEKRLCGIVSRGGSIMSKWCLVHDRESFLYEHFDDICDILAQYDVAVSLGDGL RPGSTHDANDEAQFAELDTMGELVVRAWEKNVQAFIEGPGHVPMHKIRENMERQIEKCHN APFYTLGPLVTDIAPGYDHITSAIGAAQIGWLGTAMLCYVTPKEHLALPDKEDVRVGVIT YKIAAHAADLAKGHPGAQVRDNALSKARYEFRWKDQFDLSLDPERAFSYFHAGRHTDGEY CTMCGPNFCAMRLSRDLKKTQKQK >gi|226332014|gb|ACIB01000042.1| GENE 13 11312 - 11866 369 184 aa, chain + ## HITS:1 COG:no KEGG:BF2532 NR:ns ## KEGG: BF2532 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 173 1 173 184 332 100.0 3e-90 MTVTERTAEYRKALDVPISQLETDRIVKEILDRPENFDNIYRLTSDDKLLVSWRALWICD KLCRQKPEWLIPFREELTGRLMSCGHDGSKRLLLSILYHAPATKVPSVALLNFCLDAMLS PQESIGVQSLAIRMAYRLCEPEPELLYELRTILESTETEMYSTAVKSAVRNTLKKINQKN KKKK >gi|226332014|gb|ACIB01000042.1| GENE 14 11868 - 12992 971 374 aa, chain + ## HITS:1 COG:VC0066 KEGG:ns NR:ns ## COG: VC0066 COG1060 # Protein_GI_number: 15640098 # Func_class: H Coenzyme transport and metabolism; R General function prediction only # Function: Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes # Organism: Vibrio cholerae # 2 370 3 369 370 372 49.0 1e-103 MFSDELEKISWEETTKAIYSKTDADVRRALSKEHCDVNDFMALISPAAAPYLETMARLSR KYTMERFGKTISMFVPLYITNSCTNSCVYCGFNHNNPMKRTILTEEEMVNEYKAIKKLAP FENLLLVTGENPAKAGVDYIERALLLAKPYFANLQIEVMPLKAEEYERLTHAGLNGVICF QETYNKANYNIYHPRGMKSKFEWRVNGFDRMGQAGVHKIGMGVLIGLEEWRTDITMMAYH LRYLQKHYWKTKYSVNFPRMRPSENGGFQPNVVMNDRELAQVTFAMRIFDHDVDISYSTR ESAAFRNHMATLGVTTMSAESKTEPGGYFTYPQALEQFHVSDERKAVEVDAALRSLGRIP VYKDWDTALTLPQC >gi|226332014|gb|ACIB01000042.1| GENE 15 13015 - 13716 537 233 aa, chain + ## HITS:1 COG:BMEI1940 KEGG:ns NR:ns ## COG: BMEI1940 COG0476 # Protein_GI_number: 17988223 # Func_class: H Coenzyme transport and metabolism # Function: Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 # Organism: Brucella melitensis # 1 218 26 248 269 205 48.0 8e-53 MERYSRQTMLPEIGEVGQLKLKAAKVLIVGVGGLGSPIALYLAGAGVGTIGLADDDEVSL SNLQRQILYTEEEVGDLKAICASMRISALNREIKVNACPGRLSKENARDLIGQYDIIVDG CDNFATRYLLSDVCSELGKPYVYGAICGFEGQVSVFNYGEGTQRKTYRDLYPDEEGMLHM PPPPKGVVGVTPAVTGSVEACEVLKIICGFGEVLAGKLWTIDLRTLQSNIFSL >gi|226332014|gb|ACIB01000042.1| GENE 16 13778 - 14386 654 202 aa, chain + ## HITS:1 COG:no KEGG:BF2535 NR:ns ## KEGG: BF2535 # Name: not_defined # Def: thiamine phosphate pyrophosphorylase # Organism: B.fragilis # Pathway: not_defined # 1 202 1 202 202 408 100.0 1e-113 MKLIVVTTPTFFVEEDKIITALFEEGLDILHLRKPETPAMYSERLLTLIPEKYHKRIVTH EHFYLKEEFNLMGIHLNARNPKEPHDYSGHISCSCHSVEEVKNKKHFYDYVFMSPVYDSI SKEGYNSPYTAEELRLAAKDKIIDNKVMALGGITPDNILEVKDFGFGGAVVLGDLWGKFD ACSDQDYLAVIEHFKKLKRMAD >gi|226332014|gb|ACIB01000042.1| GENE 17 14464 - 14976 694 170 aa, chain - ## HITS:1 COG:no KEGG:BF2536 NR:ns ## KEGG: BF2536 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 170 1 170 170 327 100.0 1e-88 MKKIVLFLFVAIATLSVKAQDLYMGGTVGLWRNDDANTTSFKLAPEIGYNLSEQWALGVE LQFNHEYKEHISTNTFAIAPYARFSYYENKIVRLFVDGGFGFATTKVKDGGDAVNGFEIG LKPGIAIKLNQHFSLVAKCGFLGYKDDYMGNGFGFSASSEDLTFGFHYEF >gi|226332014|gb|ACIB01000042.1| GENE 18 15178 - 15747 752 189 aa, chain - ## HITS:1 COG:no KEGG:BF2537 NR:ns ## KEGG: BF2537 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 189 1 189 189 335 100.0 6e-91 MKRNLVFVLFALVSVVGFSQVSWNAKVGMNISNFTGDFDMNAKVGFKIGGGMEYGFNEIW SLQPSLFVSSKGAKKDELSVNAVYLELPVMAAARFKVADNTNIVLSAGPYFACGIAGNSK VDLGKGRLEVDTFGDDGLLKRGDVGLGIGVAAEFGKIIAGLDGQFGFVDVMDNVNGKNLN LSISVGYKF >gi|226332014|gb|ACIB01000042.1| GENE 19 15960 - 18680 3010 906 aa, chain - ## HITS:1 COG:mlr7532 KEGG:ns NR:ns ## COG: mlr7532 COG0574 # Protein_GI_number: 13476256 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphoenolpyruvate synthase/pyruvate phosphate dikinase # Organism: Mesorhizobium loti # 4 897 3 877 892 1014 57.0 0 MDKKRVYTFGNGLAEGKAGMRNLLGGKGANLAEMNLIGVPVPPGFTITTDVCTEYYEMGQ EKVVSLLKEEVEKAIANIENLMRSKFGDVENPLLVSVRSGARASMPGMMDTILNLGLNDE VVEGLTRKTGNARFAWDSYRRFVQMYGDVVLGMKPVNKEDQDPFEAIIEEVKHAKGVKLD NELEVEDLKELVKKFKAAVKAQTGKDFPTCAYEQLWGAICAVFNSWMNERAILYRKMEGI PDEWGTAVSVQAMVFGNMGDTSATGVCFSRDAATGEDLFNGEYLINAQGEDVVAGIRTPQ QITKIGSQRWAQLAGVSEEERASKYPSMEEAMPEIYKQLDELQTKLENHYKDMQDMEFTV QEGKLWFLQTRNGKRTGAAMVKIAMDLFRQGMIDEKTALMRVEPNKLDELLHPVFDKSAL KQAKVLTRGLPASPGAATGQIVFFADDAAEWHAAGKRVVMVRIETSPEDLAGMAVAEGIL TARGGMTSHAAVVARGMGKCCVSGAGALNIDYKARTVEVDGVLLKEGDFISLNGSTGEVY QGKVETKAAELSGDFADLMKLADKYTRLQVRTNADTPHDAEVARNFGAVGIGLCRTEHMF FEGEKIKAMREMILAENAEGRRKALAKILPYQQADFKGIFKAMAGCPVTVRLLDPPLHEF VPHDLKGQQEMADTMGVSLQYIQQRVESLCEHNPMLGHRGCRLGNTYPEITQMQTRAILG AALELKKEGIETHPEIMVPLTGILYEFQQQESVIRAEADKLFEEVGDRIDFKVGTMIEIP RAALTADRIASSAEFFSFGTNDLTQMTFGYSRDDIASFLPVYLEKKILKVDPFQVLDQNG VGQLVRMATEKGRAIRPDLKCGICGEHGGEPSSVKFCHKVGLNYVSCSPFRVPIARLAAA QAAIEE >gi|226332014|gb|ACIB01000042.1| GENE 20 18947 - 20365 1341 472 aa, chain + ## HITS:1 COG:BH0687 KEGG:ns NR:ns ## COG: BH0687 COG2265 # Protein_GI_number: 15613250 # Func_class: J Translation, ribosomal structure and biogenesis # Function: SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase # Organism: Bacillus halodurans # 13 472 15 457 458 287 36.0 4e-77 MARKKKELPLLEKVTITDVAAEGKAIAKVDDLVVFVPYVVPGDVVDLQVKRKKNKYAEAE AVKFHELSPVRAVPFCQHYGVCGGCKWQVLPYAEQIKYKQKQVEDNLRRIGKIELPEISP ILGSAKTEFYRNKLEFTFSNKRWLTAEEVKQDVKYDQMNAVGFHIPGAFDKVLAIEKCWL QDDISNRIRNTIRDYAYEHNYSFINLRSQEGMLRNMIVRTSSTGELMVILICKITEEHEM DLFKQLLQYVADQFPEITSLLYIINNKCNDTINDLDVHVFRGNDHIFEEMEGLRFKVGPK SFYQTNSEQAYNLYKVARDFAGLTGDELVYDLYTGTGTIANFVSRQARKVIGIEYVPEAI EDAKVNAEINGIENTLFFAGDMKDILTQDFINQYGRPDVIITDPPRAGMHQDVVDVILFA EPKRIVYVSCNPATQARDLQLLDVKYRVKAVQPVDMFPHTHHVENVVLLELK >gi|226332014|gb|ACIB01000042.1| GENE 21 20376 - 21293 217 305 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|161507907|ref|YP_001577871.1| ribosomal protein large subunit [Lactobacillus helveticus DPC 4571] # 62 293 50 279 285 88 29 8e-17 MKRIKRTPAEKARAQYTGYLVKEPMELMDFLAAKMPDASRTKLKSLLSKRIVLVDNVITT QFNFPLQPGMKVLISKDKNKKEFRHPLLKIVYEDAYIIVVEKKEGLLSVGTERQKERTAQ HILSEYVGRSGRGNRIYVVHRLDRDTSGLMMFAKDEKTQYTLRDHWHDIVTDRRYVAVVT GEMEKDSDTVVSWLTDRTLYVSSSSYDDGGSKSITHYRTIKRANGYSLVELRLETGRKNQ IRVHMQDLGHPLIGDGRYGIDGGPNPLGRLALHAFKLCFYHPVTDQLMEFETPYPPTFKK LFLKK >gi|226332014|gb|ACIB01000042.1| GENE 22 21456 - 21860 364 134 aa, chain - ## HITS:1 COG:no KEGG:BF2541 NR:ns ## KEGG: BF2541 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 134 1 134 134 253 100.0 1e-66 MKGYWKILLILMLAVGFASCEDDQGEIEYVITGRAWTGDVGMNAHNGEPLFSTFEFGNDG FGVETQFYASDGLLYDQFRFQWYWEDSYNRNLVLNYGKNGISYMDDVRIYGDRITGAFYL SDDARGFNFELRME >gi|226332014|gb|ACIB01000042.1| GENE 23 21883 - 22326 471 147 aa, chain - ## HITS:1 COG:no KEGG:BF2571 NR:ns ## KEGG: BF2571 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 4 147 1 144 144 268 100.0 4e-71 MTLMKTLNFMKTLFLLVAIVGLSSCGDKYYSDDYLRNSNAKLCGKTWVNDSEKNDVDEWV RHTLKFDDNGRLAETYAYYHVNESQPYRTETNNLTWSWIDDTMEGIVFDYGVNGVTYFDN VWVREHNMSGKLNGKVVVFVDSKYNRN >gi|226332014|gb|ACIB01000042.1| GENE 24 23153 - 24007 887 284 aa, chain + ## HITS:1 COG:PA3657 KEGG:ns NR:ns ## COG: PA3657 COG0024 # Protein_GI_number: 15598853 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Methionine aminopeptidase # Organism: Pseudomonas aeruginosa # 39 283 5 248 261 266 52.0 5e-71 MKKFIKGVRFTPSNYPDEIEDKIQKYRKQGYKLPPRKVLRTPEQIEGIRESAKINTALLN HIAENIREGMSTEEIDRLVYDFTTSHGAIPAPLNYEGFPKSVCTSINDVVCHGIPSSTEI LKSGDIINVDVSTIYNGYFSDASRMFMIGEVSPEKQRLVQVTKECMEIGIAAAQPWARLG DVGAAIQEHAEKNGYSVVRDLCGHGVGIKFHEEPDVEHFGRRGTGMLILPGMTFTIEPMI NMGTYEVFVDSADDWTVCTDDGLPSAQWENMILINETGNEILTY >gi|226332014|gb|ACIB01000042.1| GENE 25 24008 - 25234 1173 408 aa, chain + ## HITS:1 COG:RSc1035 KEGG:ns NR:ns ## COG: RSc1035 COG1322 # Protein_GI_number: 17545754 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Ralstonia solanacearum # 21 408 74 457 457 302 48.0 9e-82 MELTLLLIIAALLVALLALTLTRNNRAQSEEMQRALRQQMQENREELNRSIRELRMEMTQ TLNQGLQQLQDAMHKNMMTTGELQRQKFDAMARQQETLIQSTEKRLDDMRVMVEEKLQKT LNERIGQSFEIVRSQLENVQKGLGEMKSLAQDVGGLKKVLSNVKMRGTFGEVQLGALLEQ MMSPEQYEANVKTKKSGTEFVEFAIKLPGKDDANSTVYLPIDAKFPKDVYEQYYDAFEAG DAALMESCGRQLETTIKKMAKDIHDKYVDPPFTTDFAILFLPFESIYAEVIRRTSLVETL QKDYKIVVTGPTTLGAILNSLQMGFRTLAIQKRTGEVWTVLGAVKTEFGKFGGLLEKVQK NLQSAGDQLEEVMGKRTRAIERKLRQVEELPHEESRRILPIDDGGEDD >gi|226332014|gb|ACIB01000042.1| GENE 26 25262 - 26008 512 248 aa, chain + ## HITS:1 COG:no KEGG:BF2553 NR:ns ## KEGG: BF2553 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 248 1 248 248 493 99.0 1e-138 MEITLKNQFITLWNTYFPQAGLPITFQYSADTQNLPIVEAPKGHRCIIAQLTQVQRGKTL CMQADSVGCRGGKRYTNFTDKMFPGFECFLSHNEQGEGERYKQTPELAAAALAQLPVLPV KGENLIFKRWDKLEAEDMPEVVIFFVSADILSGLFTLACFDNVAPDAVIAPFGAGCASII YHPYREQLDGTNRAVLGSFDPSARKCMKPDLLSFAIPFNKFKSMVSQMEESFLKTATWDV IKKRMGSS >gi|226332014|gb|ACIB01000042.1| GENE 27 26208 - 27521 1188 437 aa, chain - ## HITS:1 COG:jhp1447 KEGG:ns NR:ns ## COG: jhp1447 COG3004 # Protein_GI_number: 15612512 # Func_class: P Inorganic ion transport and metabolism # Function: Na+/H+ antiporter # Organism: Helicobacter pylori J99 # 6 436 13 433 438 274 41.0 2e-73 MTVLRSMKDFSSMNITASILLFVTAIAAAVIANSPAASVYQEFLSHELHFRIGGFNLLSH AGHNLTMIEFINDGLMTIFFLMVGLEIKRELLVGELSSFRKAALPFIAACGGMVVPVVIY SMVCAPGTEGEQGLAIPMATDIAFSLGVLSLLGKRVPLSLKIFLTAFAVVDDIGGILVIA IFYSSHVAYEYLLWAALLYVLLYFIGKKGATNKIFFLVVGVVIWYLFLQSGIHSTISGVI LAFVIPAKPQLNVGTYIERIRRIISTFPEMGANNIVLTNQQIAKLKEVESASDRVISPLQ SLEDNLHGAVNYLVLPLFAFVNAGVMFSGEGEVIGGVTLAVALGLLAGKFLGIYSFTWLA VKSGLTPMPLGMNWKNISGVALLGGIGFTVSLFIANLSFGSAHPVLLNQAKLGVLSGTVM AGILGYLVLHWVLPKRR >gi|226332014|gb|ACIB01000042.1| GENE 28 27566 - 28744 962 392 aa, chain - ## HITS:1 COG:no KEGG:BF2555 NR:ns ## KEGG: BF2555 # Name: not_defined # Def: putative Na+/H+ exchange protein # Organism: B.fragilis # Pathway: not_defined # 1 392 1 392 392 677 100.0 0 MRKVLSFSAFLIIGLLLSQYLPLLAGEGYATVKIVSNILLYICLSFIMINVGREFEVDKT RWRSYAGDYFIAMATAAMPWFLIAIYYVFVLLPPEFWNSWEAWKENLLLSRFAAPTSAGI LFTMLAAIGLKSSWIYKKIQVLAIFDDLDTILLMIPLQIMMIGLRWQLIVVVFIVFLLLS LGWKQLGRYNWRQDWKAIMGYSVLVFVATQAVYYFSKQLYGEEGSIHIEVLLPAFVLGMI MKHKEIDTPVEHKVSTGVSFLFMFLVGMSMPHFIGVNFAETHAGTHSVTGSQEMMSWGMI ALHVLIVSLLSNIGKLFPVFFYRDRKFSERLALSIGMFTRGEVGAGVIFIALGYNLGGPA LVISVLTIVLNLILTGIFVLWVKKLALRSYTT >gi|226332014|gb|ACIB01000042.1| GENE 29 28890 - 30671 2070 593 aa, chain - ## HITS:1 COG:BS_lepA KEGG:ns NR:ns ## COG: BS_lepA COG0481 # Protein_GI_number: 16079605 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane GTPase LepA # Organism: Bacillus subtilis # 3 593 13 606 612 728 57.0 0 MDKIRNFCIIAHIDHGKSTLADRLLEFTNTIQVTEGQMLDDMDLEKERGITIKSHAIQME YTYKGEKYILNLIDTPGHVDFSYEVSRSIAACEGALLIVDASQGVQAQTISNLYMAIEHD LEIIPIINKCDMASAMPEEVEDEIVELLGCKRDEIIRASGKTGMGVEEILAAVIERIPHP QGDESAPLQALIFDSVFNSFRGIIAYFKITNGVIRAGDKVKFFNTGKEYVADEIGVLKME MVPRKELRTGDVGYIISGIKTSKEVKVGDTITHVARPCDKAIAGFEEVKPMVFAGVYPIE AEEFEDLRASLEKLQLNDASLTFQPESSLALGFGFRCGFLGLLHMEIVQERLDREFDMNV ITTVPNVSYHIYDKQGNMTEVHNPGGMPDPTMIDHIEEPYIKASIITTTDYIGPIMTLCL GKRGELLKQEYISGNRVELFYNMPLGEIVIDFYDRLKSISKGYASFDYHPDGFRPSKLVK LDILLNGESVDALSTLTHFDNAYDMGRRMCEKLKELIPRQQFEIAIQAAIGAKIIARETI KAVRKDVTAKCYGGDISRKRKLLEKQKKGKKRMKQIGNVEVPQKAFLAVLKLD >gi|226332014|gb|ACIB01000042.1| GENE 30 30797 - 30997 345 66 aa, chain - ## HITS:1 COG:no KEGG:BF2557 NR:ns ## KEGG: BF2557 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 66 1 66 66 107 100.0 2e-22 MLKEKAGEIAGKIWNALNGTEGLTAKQIKKATKLVDKDLFLGLGWLLREDKISTQEIEGE LFVTLN >gi|226332014|gb|ACIB01000042.1| GENE 31 31144 - 31608 479 154 aa, chain - ## HITS:1 COG:no KEGG:BF2558 NR:ns ## KEGG: BF2558 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 154 1 154 154 260 100.0 9e-69 MTKEERISRATELFKSGYNCSQSVVAAFADMYGFTEEQALRMAASFGGGIGRMRETCGAA CGMFLLAGLEKGAIDGADREGKAANYALVQELAAEFKKRNGSLNCGELLGLKKKAPVSSE PEARTEQYYAKRPCSKMVEEAARIWAEYLEKEKK >gi|226332014|gb|ACIB01000042.1| GENE 32 31678 - 32088 384 136 aa, chain + ## HITS:1 COG:DR2598 KEGG:ns NR:ns ## COG: DR2598 COG0432 # Protein_GI_number: 15807580 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Deinococcus radiodurans # 1 134 7 142 151 154 54.0 5e-38 MATTFDIQLPHYPRGFHLITCDILSLLPDLPENGLLVVFIKHTSAGITINENADPDVRHD FNTFFNKLVPDGAPYFVHTLEGPDDMSAHIKASLIGTSVSIPIRNHRLNLGTWQGIYLCE FRDGGDKRKLSITILE >gi|226332014|gb|ACIB01000042.1| GENE 33 32090 - 32851 749 253 aa, chain - ## HITS:1 COG:MTH212 KEGG:ns NR:ns ## COG: MTH212 COG0708 # Protein_GI_number: 15678240 # Func_class: L Replication, recombination and repair # Function: Exonuclease III # Organism: Methanothermobacter thermautotrophicus # 1 252 4 255 257 249 45.0 4e-66 MKIITYNVNGLRAAVNKGLPEWLAEENPDVLCLQETKLQPEQYPAEAFEALGYKAYLYSA QKKGYSGVAILTKVEPDHIEYGMGIEEYDNEGRFIRADFGDLSVVSVYHPSGTSGDERQA FKMVWLEAFQKYVTELRKSRPNLILCGDYNICHEPIDIHDPVRNATNSGFLPEEREWMTR FLSAGFIDSFRTLYPQKQEYTWWSYRFNSRAKNKGWRIDYCMVSEPVRSLLKEAVILNNA VHSDHCPMALEIG >gi|226332014|gb|ACIB01000042.1| GENE 34 32862 - 34115 918 417 aa, chain - ## HITS:1 COG:CAC0628 KEGG:ns NR:ns ## COG: CAC0628 COG1914 # Protein_GI_number: 15893916 # Func_class: P Inorganic ion transport and metabolism # Function: Mn2+ and Fe2+ transporters of the NRAMP family # Organism: Clostridium acetobutylicum # 3 417 5 415 417 451 62.0 1e-127 MKNIFKDLKSKDHKRYLGGLDVFRYIGPGLLVTVGFIDPGNWASNFAAGSEFGYSLLWVV TLSTIMLIILQHNVAHLGIVTGLCLSEAATQYTPKWVSRPILGTAVLASISTSLAEILGG AIALEMLLDIPIVWGAVLTTVFVSIMLFTNSYKKIERSIIAFVSVIGLSFIYELFLVDID WPMAVEGWVTPAIPKGSMLIIMSVLGAVVMPHNLFLHSEVIQSHEYNKQDTASIKKVLKY ELFDTLFSMIIGWAINSAMILLAAATFFKSGIQVEELQQAKSLLEPLLGSNAAIVFALAL LMAGISSTITSGMAAGSIFAGIFGESYHIKDSHSQVGVILSLGIALLLIFFIGDPFKGLI ISQMVLSIQLPFTVFLQVGLTSSRKVMGDYVNSKWSTFVLYTIAVIVTVLNIMLLFS >gi|226332014|gb|ACIB01000042.1| GENE 35 34258 - 34650 460 130 aa, chain + ## HITS:1 COG:no KEGG:BF2587 NR:ns ## KEGG: BF2587 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 130 1 130 130 217 100.0 1e-55 MKKIILGACAVLFTLASCQQAKQKVFELAAEQVNKQCPITVDEMTRMDSTTYSGKDNTFT YFYTLSGQADDPTMSEQLKKSLEETLPETIKNTEEMKVYRESDVTIKYIYLSGKTKEELI QVTVTPDMYK >gi|226332014|gb|ACIB01000042.1| GENE 36 34800 - 35045 166 81 aa, chain - ## HITS:1 COG:no KEGG:BF2563 NR:ns ## KEGG: BF2563 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 81 1 81 81 162 100.0 3e-39 MKYVYKTQGTCSTNIELEVENNIVKEVAFWGGCNGNLQGISRLVTGMPVSDVITKLEGIR CGARSTSCPDQLCRALHEMGF >gi|226332014|gb|ACIB01000042.1| GENE 37 35045 - 35782 977 245 aa, chain - ## HITS:1 COG:Cj1172c KEGG:ns NR:ns ## COG: Cj1172c COG0217 # Protein_GI_number: 15792496 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Campylobacter jejuni # 1 236 1 234 235 181 46.0 1e-45 MGRAFEYRKATKLKRWGNMARTFTRIGKQIAIAVKAGGPDPENNPHLRAVVATAKRENMP KDNVERAIKNAMGKDQKDYKEMNYEGYGPFGIAVFVETATDNTTRTVANVRSVFNKFGGT LGTSGSLDFMFSWKSMFTITKKEGVDMDDLILELIDYGVEEEYDEDEDEITLYGDPKSFA QIQKYLEENGFEVKGAEFTRIPNDEKDLTPEQRATIDKMVERLEEDEDVQNVYTNMKPAD NEGEE >gi|226332014|gb|ACIB01000042.1| GENE 38 35878 - 38340 2478 820 aa, chain - ## HITS:1 COG:FN2122_2 KEGG:ns NR:ns ## COG: FN2122_2 COG0072 # Protein_GI_number: 19705412 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Phenylalanyl-tRNA synthetase beta subunit # Organism: Fusobacterium nucleatum # 154 820 3 652 653 370 33.0 1e-102 MNISYNWLKEYVNFDLTPDEVAAALTSIGLETGGVEEVQTIKGGLEGLVIGEVLTCVEHP NSDHLHITTVNLGNGEPTQIVCGAPNVAAGQKVVVATLGTKLYDGDECFTIKKSKIRGVE SIGMICAEDEIGIGTSHDGIIVLPEDAVPGTLAKDYYNVKSDYVLEVDITPNRADACSHY GVARDLYAYLVQNGKQAALTRPSVDAFAVENHDLDIKVTVENSEACPRYAGVTVKGVTVK ESPEWLQNKLRIIGLRPINNVVDITNYIVHAFGQPLHCFDANKIKGGEVIVKTMPEGTTF VTLDGVERKLNERDLMICNKEDAMCIAGVFGGLDSGSTEATTDVFLESAYFHPTWVRKTA RRHGLNTDASFRFERGIDPNITIYCLKLAAMMVKELAGGTISSEIKDVCATPAQDFIVEL TYEKVHSLIGKVIPVETIKSIVTSLEMKIMDETAEGLTLAVPPYRVDVQRDCDVIEDILR IYGYNNVEIPSTLKSSLTTKGDCDKSNKLQNLVAEQLVGCGFNEILNNSLTRAAYYDGLE SYPSKNLVMLLNPLSADLNCMRQTLLFGGLESIAHNANRKNADLKFFEFGNCYHFDAEKK NPEKVLAPYSEDYHLGLWVTGKMVSNSWAHADENTSVYELKAYVENIFKRLGLDLHSLVV GNLSDDIYSTALTVNTKGGKRLATFGVVTKKMLKAFDVDNEVYYADLNWKELMKAIRSVK VSYKEISKFPAVKRDLALLLDKKVQFAEIEKIAYETEKKLLKEVSLFDVYEGKNLEAGKK SYAVSFLLQDESQTLNDKMIDKIMSKLVKNLEDKLGAKLR >gi|226332014|gb|ACIB01000042.1| GENE 39 38490 - 39443 619 317 aa, chain - ## HITS:1 COG:PA3145 KEGG:ns NR:ns ## COG: PA3145 COG0472 # Protein_GI_number: 15598341 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase # Organism: Pseudomonas aeruginosa # 4 267 9 289 339 83 29.0 5e-16 MYYLIILVLLFLAELFYFRIADKCNIIDKPNERSSHTRITLRGGGIIFYLGALAYFLTNQ FEYPWFMLALTLITFISFVDDIRSTSQGLRLVFHFTAMALMFYQWGLFSLPWWTILVALI VCTGIINAYNFMDGINGITGGYSWVVLLALAFINVQIVRFVEEDLIYTMLCAVLVFNFFN FRKKAKCFAGDVGSVSIAFVILFLIGKLIIRTENFSWIVLLVVYGVDSVLTIIHRLMLHE NIGLPHRKHLYQLMANELEIPHVVVSLIYMTSQAIIIVGYLLTPGWGYCYLLGTIVILSM VYILFMKKYFHLHPAMK >gi|226332014|gb|ACIB01000042.1| GENE 40 39561 - 40457 665 298 aa, chain - ## HITS:1 COG:ECs2847 KEGG:ns NR:ns ## COG: ECs2847 COG0451 # Protein_GI_number: 15832101 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Nucleoside-diphosphate-sugar epimerases # Organism: Escherichia coli O157:H7 # 2 279 4 294 331 107 29.0 2e-23 MNLLFTGASGFLGSNLYSLLKDKYQIRTVGLTSRDNYTINLVSDVPKLNIKYDVVLHAAG KAHSIPKTGEEKQLFFDVNLQGTKNLCTALENSGIPKAFIFISTVAVYGCDSGENITEEH PLNGTTPYALSKIKAEKYLEGWCAMHNVKLSILRPSLIAGPNPPGNLGAMIRGIRNGKYL SIAGGKARKSVLMVQDIANLLPMLIEKGGIYNVCDSYQPSFRELEMVICKQLNKKRPISI PYWLAKSMAVIGDCLGEKAPINSLKLRKITSSLTFSNEKAVRELKWKPMNVLETFLIE >gi|226332014|gb|ACIB01000042.1| GENE 41 40466 - 41239 342 257 aa, chain - ## HITS:1 COG:YPO3098 KEGG:ns NR:ns ## COG: YPO3098 COG0463 # Protein_GI_number: 16123272 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Yersinia pestis # 8 257 1 247 247 206 42.0 4e-53 MGENKKKMSFSLVTVTYNSAQTLRDTITSVLSQTHQAIEYIIIDGFSKDNTVAIIKEYEP LFNGRLKWISEKDNGLYDAMNKGFQMATGDVIGIINSDDLISDPNAIEKVIKCFESDTSI DAVYADLYYVAQNDISKIVRYWKSGGQRPFCKGWHPAHPTFYVKKEVYQRYGLFDLDFKF AADFELMLRLIDKEHIKLYYLPEPLVRMRLGGTTSKNLSNIRKGNLECINAFKKNGIKVS MLYPLYRLLPKIRQYFQ >gi|226332014|gb|ACIB01000042.1| GENE 42 41215 - 42297 676 360 aa, chain - ## HITS:1 COG:YPO3104 KEGG:ns NR:ns ## COG: YPO3104 COG0438 # Protein_GI_number: 16123274 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Yersinia pestis # 2 353 1 335 337 242 41.0 1e-63 MIKVVLDNIIFSLQRSGGISVVWSELLKRLQLGNLNFECLEYDVMSNINRRQLNLNSKSV QVRKKRFLSITRYFSPRVVKNEKFIFHSSYYRTCSNPNAINITTVHDFTYEYYYKGLKKR IHLWQKHRAISKSNFIICISENTKRDLLKFLPDINETKIRVIHNGVSDDYFPIKEREELE LPFELETYMLFVGSREKYKNFELAVKAVACNRLKLVIVGAPLLAEELDFLQTELGQSNFM EMGRVSNEELNCLYNGAMALLYPSEYEGFGIPVLEAQRAGCPVIAYNASSIPEIIGDTPL LLDVLSIESITKCFNVLKIKKEREDIIYKGIENAKRFTWDRMYEQVIALYKEVWEKIKKK >gi|226332014|gb|ACIB01000042.1| GENE 43 42688 - 43776 411 362 aa, chain - ## HITS:1 COG:CC0633 KEGG:ns NR:ns ## COG: CC0633 COG3754 # Protein_GI_number: 16124886 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Lipopolysaccharide biosynthesis protein # Organism: Caulobacter vibrioides # 3 353 222 561 818 246 37.0 4e-65 MNIIAYYLPQYHPTKDNDIWWGKGFTEWTNVAKAKPLFPGHYQPRIPADLGFYDLRMSDV RQKQVDLAIEAGISGFCYYHYWFGNGKQELELPFNEVVASGEPNFPFCLCWANETWSHKF WNEDGSIIGSETLIEQKYLGEDDDILHFNTLLPAFKDKRYMQIDGKLVFVIYKPLAFQDV SKFIELWNDLARKNGLNGFYFIGFTFNADKEGEQILKLGFDAIDSCRLNRNRIRGIGWFF RKIISIIFHTPRRVAYKKVIPTLIGELERNCDNYFPTIIPNWDHTPRSGVNGDLFTKSTP DLFEIHCMDVLSSVTKKNTNRQVCFLKSWNEWGEGNYMEPDLKYGKGYIYALRKVVDTLE SL >gi|226332014|gb|ACIB01000042.1| GENE 44 43773 - 44327 88 184 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253565824|ref|ZP_04843279.1| ## NR: gi|253565824|ref|ZP_04843279.1| predicted protein [Bacteroides sp. 3_2_5] # 1 184 225 408 408 302 100.0 8e-81 MALAFFCVVFMYICSILCRKKILKSFTFFNVLLYVVVVTVILTMFQEYFDFFEERIDSID GAFDERNGQWIDTFKHSENIIFGTGLGSAGHKVLAFTKYHITDGALFKITAECGIVGLIF FLYIIIKCLVLKIKYFPCLFREYLIIVVCLAQSTGSNTLVFQQILPIFWFSLGSIANYYK TKDL >gi|226332014|gb|ACIB01000042.1| GENE 45 44197 - 44439 108 80 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MVKMTVTTTTYNKTLKKVNDFKIFFRHRIEHIYINTTQKKARAIDDLCAVKIALTNAIIY NNKYNNKLKNEYSFLTSQNL Prediction of potential genes in microbial genomes Time: Tue May 17 23:30:59 2011 Seq name: gi|226332013|gb|ACIB01000043.1| Bacteroides sp. 3_2_5 cont1.43, whole genome shotgun sequence Length of sequence - 251158 bp Number of predicted genes - 220, with homology - 212 Number of transcription units - 119, operones - 48 average op.length - 3.1 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 928 - 987 5.1 1 1 Tu 1 . + CDS 1058 - 1570 577 ## COG2193 Bacterioferritin (cytochrome b1) + Term 1586 - 1649 11.0 + Prom 1627 - 1686 7.1 2 2 Op 1 . + CDS 1808 - 3586 1702 ## COG0616 Periplasmic serine proteases (ClpP class) 3 2 Op 2 . + CDS 3595 - 4725 436 ## COG1663 Tetraacyldisaccharide-1-P 4'-kinase 4 2 Op 3 . + CDS 4664 - 5473 794 ## COG0005 Purine nucleoside phosphorylase 5 2 Op 4 . + CDS 5501 - 6535 962 ## COG0611 Thiamine monophosphate kinase + Term 6554 - 6599 6.1 + TRNA 6654 - 6726 82.1 # Phe GAA 0 0 6 3 Tu 1 . + CDS 6978 - 8195 684 ## BF2111 putative bacteriophage integrase - Term 8194 - 8243 1.5 7 4 Op 1 . - CDS 8338 - 8619 208 ## BF3280 hypothetical protein 8 4 Op 2 . - CDS 8624 - 8911 197 ## BF3281 hypothetical protein + Prom 9062 - 9121 4.0 9 5 Op 1 . + CDS 9249 - 9557 310 ## gi|253565837|ref|ZP_04843291.1| conserved hypothetical protein 10 5 Op 2 . + CDS 9538 - 10770 598 ## COG5545 Predicted P-loop ATPase and inactivated derivatives + Prom 10834 - 10893 5.3 11 6 Tu 1 . + CDS 11104 - 12084 424 ## BF3283 putative transposon-related/mobilisation protein + Term 12135 - 12181 6.4 12 7 Op 1 . + CDS 12241 - 12624 288 ## BF3284 mobilization protein 13 7 Op 2 . + CDS 12590 - 13513 472 ## BF3285 mobilization protein 14 7 Op 3 . + CDS 13517 - 14245 487 ## BF3286 hypothetical protein - Term 14241 - 14284 5.2 15 8 Tu 1 . - CDS 14287 - 14619 307 ## BF3287 hypothetical protein - Prom 14639 - 14698 1.5 16 9 Tu 1 . - CDS 14750 - 14983 237 ## BF3477 hypothetical protein 17 10 Op 1 . - CDS 15123 - 15362 185 ## BF2068 hypothetical protein 18 10 Op 2 . - CDS 15434 - 15751 84 ## 19 10 Op 3 . - CDS 15748 - 16866 445 ## COG1902 NADH:flavin oxidoreductases, Old Yellow Enzyme family - Prom 16893 - 16952 4.3 + Prom 16867 - 16926 2.4 20 11 Tu 1 . + CDS 16993 - 17337 291 ## COG1733 Predicted transcriptional regulators - Term 17749 - 17797 5.5 21 12 Op 1 . - CDS 17948 - 18241 225 ## BT_2337 hypothetical protein 22 12 Op 2 . - CDS 18228 - 18542 252 ## BF0653 hypothetical protein - Prom 18740 - 18799 5.0 + Prom 18493 - 18552 5.3 23 13 Op 1 . + CDS 18765 - 19379 442 ## COG1309 Transcriptional regulator 24 13 Op 2 . + CDS 19369 - 20325 400 ## Coch_0727 hypothetical protein 25 13 Op 3 . + CDS 20391 - 21113 415 ## Sde_1498 hypothetical protein 26 13 Op 4 . + CDS 21110 - 21634 262 ## gi|253565854|ref|ZP_04843308.1| conserved hypothetical protein 27 13 Op 5 . + CDS 21712 - 21945 203 ## BF0648 hypothetical protein 28 13 Op 6 . + CDS 21965 - 22345 294 ## BF3477 hypothetical protein + Term 22398 - 22432 4.0 + Prom 22349 - 22408 3.1 29 14 Tu 1 . + CDS 22501 - 22833 209 ## BF0646 hypothetical protein + Prom 23023 - 23082 3.0 30 15 Op 1 . + CDS 23118 - 23702 333 ## BF0644 clindamycin resistance transfer factor BtgA 31 15 Op 2 . + CDS 23707 - 24657 498 ## BDI_1256 clindamycin resistance transfer factor BtgB + Prom 24698 - 24757 7.3 32 16 Op 1 . + CDS 24880 - 25083 170 ## BDI_2133 hypothetical protein + Prom 25101 - 25160 1.9 33 16 Op 2 . + CDS 25182 - 28313 731 ## Bxe_A2569 hypothetical protein + Term 28344 - 28386 1.7 - Term 28681 - 28731 8.2 34 17 Op 1 . - CDS 28783 - 29421 472 ## BF3297 hypothetical protein 35 17 Op 2 . - CDS 29399 - 29926 262 ## COG0110 Acetyltransferase (isoleucine patch superfamily) - Prom 29971 - 30030 2.5 - Term 29953 - 29994 -0.6 36 18 Tu 1 . - CDS 30043 - 30552 385 ## COG0778 Nitroreductase - Prom 30576 - 30635 3.5 + Prom 30521 - 30580 8.6 37 19 Tu 1 . + CDS 30665 - 31504 292 ## COG2207 AraC-type DNA-binding domain-containing proteins 38 20 Tu 1 . - CDS 31538 - 35218 2069 ## COG0642 Signal transduction histidine kinase - Prom 35300 - 35359 2.2 39 21 Tu 1 . - CDS 35821 - 37770 1484 ## BF3492 putative alpha-glucosidase - Prom 37964 - 38023 6.8 + Prom 37835 - 37894 6.6 40 22 Tu 1 . + CDS 37990 - 39954 1133 ## BF3493 sialic acid-specific 9-O-acetylesterase + Term 39969 - 40023 8.6 - Term 39957 - 40011 9.4 41 23 Op 1 . - CDS 40068 - 41660 786 ## BF3304 hypothetical protein 42 23 Op 2 . - CDS 41666 - 42646 674 ## COG2152 Predicted glycosylase 43 23 Op 3 . - CDS 42745 - 44610 1578 ## BF3496 hypothetical protein 44 23 Op 4 . - CDS 44631 - 47786 2835 ## BF3307 hypothetical protein - Prom 47808 - 47867 2.4 45 24 Tu 1 . - CDS 47917 - 49116 712 ## COG4833 Predicted glycosyl hydrolase - Prom 49249 - 49308 6.5 + Prom 49610 - 49669 8.5 46 25 Tu 1 . + CDS 49701 - 50243 267 ## PROTEIN SUPPORTED gi|229255399|ref|ZP_04379326.1| acetyltransferase, ribosomal protein N-acetylase - Term 50095 - 50137 -0.4 47 26 Tu 1 . - CDS 50251 - 51537 599 ## COG0534 Na+-driven multidrug efflux pump - Prom 51764 - 51823 9.1 48 27 Tu 1 . + CDS 51850 - 52176 376 ## COG1917 Uncharacterized conserved protein, contains double-stranded beta-helix domain + Prom 52180 - 52239 4.2 49 28 Tu 1 . + CDS 52285 - 52947 387 ## COG0692 Uracil DNA glycosylase + Prom 53295 - 53354 2.6 50 29 Tu 1 . + CDS 53390 - 53890 322 ## COG3449 DNA gyrase inhibitor + Term 53908 - 53951 1.9 + Prom 54047 - 54106 11.2 51 30 Op 1 . + CDS 54171 - 56060 898 ## BF3314 putative lipoprotein + Term 56086 - 56146 -0.0 + Prom 56110 - 56169 6.1 52 30 Op 2 . + CDS 56250 - 56849 401 ## BF3506 hypothetical protein - Term 56888 - 56923 3.0 53 31 Tu 1 . - CDS 57086 - 57814 277 ## COG2846 Regulator of cell morphogenesis and NO signaling - Prom 57872 - 57931 4.0 54 32 Op 1 . - CDS 58058 - 58522 493 ## COG2030 Acyl dehydratase 55 32 Op 2 . - CDS 58586 - 58981 63 ## COG3011 Uncharacterized protein conserved in bacteria 56 32 Op 3 . - CDS 58991 - 59851 849 ## BF3510 hypothetical protein + Prom 60593 - 60652 4.8 57 33 Tu 1 . + CDS 60788 - 61567 240 ## PROTEIN SUPPORTED gi|163739489|ref|ZP_02146899.1| 50S ribosomal protein L17 - Term 61687 - 61727 6.5 58 34 Op 1 . - CDS 61971 - 64937 2308 ## BF3321 putative lipoprotein 59 34 Op 2 . - CDS 65006 - 65569 455 ## BF3513 hypothetical protein 60 34 Op 3 . - CDS 65566 - 65928 228 ## BF3323 hypothetical protein 61 34 Op 4 . - CDS 65947 - 66582 496 ## BF3324 hypothetical protein 62 34 Op 5 . - CDS 66595 - 68334 790 ## BF3325 putative lipoprotein 63 34 Op 6 . - CDS 68401 - 70095 990 ## BF3326 putative lipoprotein 64 34 Op 7 . - CDS 70122 - 71882 1053 ## BF3518 hypothetical protein 65 34 Op 8 . - CDS 71929 - 72828 564 ## BF3519 putative lipoprotein - Prom 72853 - 72912 4.6 - Term 72912 - 72948 0.9 66 35 Tu 1 . - CDS 72967 - 74142 1170 ## BF3520 hypothetical protein - Prom 74319 - 74378 7.1 67 36 Tu 1 . + CDS 74467 - 75345 551 ## BF3521 AraC family transcription regulator 68 37 Tu 1 . - CDS 75508 - 76719 843 ## BF3522 tyrosine type site-specific recombinase - Prom 76868 - 76927 6.6 69 38 Tu 1 . - CDS 77152 - 78018 478 ## BF3523 AraC family transcription regulator - Prom 78154 - 78213 5.7 + Prom 78347 - 78406 3.6 70 39 Op 1 . + CDS 78448 - 79383 506 ## BT_0234 putative transposase + Term 79429 - 79467 2.6 + Prom 79390 - 79449 3.6 71 39 Op 2 . + CDS 79525 - 80643 688 ## COG4974 Site-specific recombinase XerD + Term 80668 - 80717 15.2 + Prom 80645 - 80704 6.1 72 40 Tu 1 . + CDS 80891 - 81748 424 ## BF4230 putative protein involved in transposition + Term 81809 - 81869 1.2 + Prom 81851 - 81910 4.7 73 41 Op 1 . + CDS 81948 - 82322 339 ## BF4232 excisionase 74 41 Op 2 . + CDS 82328 - 83425 562 ## BF4233 hypothetical protein 75 41 Op 3 . + CDS 83429 - 84544 668 ## BF4270 hypothetical protein + Term 84646 - 84698 8.3 + Prom 84562 - 84621 1.7 76 42 Tu 1 . + CDS 84827 - 86230 789 ## BVU_1439 mobilization protein + Term 86249 - 86304 12.5 - Term 86240 - 86289 9.1 77 43 Tu 1 . - CDS 86323 - 87288 537 ## BT_4507 beta-lactamase precursor - Prom 87366 - 87425 7.0 + Prom 87323 - 87382 6.8 78 44 Tu 1 . + CDS 87552 - 87764 92 ## BVU_2907 hypothetical protein + Prom 88023 - 88082 5.6 79 45 Tu 1 . + CDS 88313 - 89479 877 ## COG1488 Nicotinic acid phosphoribosyltransferase 80 46 Tu 1 . - CDS 89503 - 90714 803 ## BF3525 thiol:disulfide interchange protein - Prom 90871 - 90930 4.9 + Prom 90758 - 90817 7.7 81 47 Tu 1 . + CDS 90845 - 91006 60 ## BF3526 hypothetical protein - Term 91359 - 91390 -0.1 82 48 Tu 1 . - CDS 91463 - 93727 1618 ## COG0475 Kef-type K+ transport systems, membrane components - Prom 93825 - 93884 4.0 + Prom 93446 - 93505 2.3 83 49 Tu 1 . + CDS 93666 - 93980 112 ## - Term 93857 - 93891 1.1 84 50 Tu 1 . - CDS 94100 - 94354 95 ## - Prom 94445 - 94504 4.2 85 51 Op 1 . + CDS 94269 - 95783 1621 ## COG0439 Biotin carboxylase 86 51 Op 2 . + CDS 95788 - 96288 470 ## COG1038 Pyruvate carboxylase 87 51 Op 3 . + CDS 96308 - 97843 1593 ## COG4799 Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) + Term 97870 - 97922 10.4 - Term 97858 - 97910 10.4 88 52 Tu 1 . - CDS 97948 - 98619 785 ## BF3532 putative isochorismatase - Prom 98644 - 98703 1.5 + Prom 98818 - 98877 6.4 89 53 Tu 1 . + CDS 99017 - 100630 1360 ## COG1680 Beta-lactamase class C and other penicillin binding proteins + Term 100636 - 100681 1.4 + Prom 101100 - 101159 5.7 90 54 Op 1 . + CDS 101224 - 102774 967 ## COG2865 Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 91 54 Op 2 . + CDS 102793 - 104274 543 ## BAA_A0205 pXO1-133 + Prom 104355 - 104414 3.2 92 55 Tu 1 . + CDS 104443 - 104622 140 ## + Term 104808 - 104851 -0.6 93 56 Tu 1 . - CDS 104828 - 106321 1955 ## COG0516 IMP dehydrogenase/GMP reductase - Prom 106360 - 106419 5.1 94 57 Tu 1 . - CDS 106552 - 106839 368 ## COG2350 Uncharacterized protein conserved in bacteria - Prom 106929 - 106988 5.2 + Prom 106802 - 106861 4.1 95 58 Tu 1 . + CDS 107017 - 107250 190 ## BF3346 hypothetical protein - Term 107055 - 107089 -0.3 96 59 Tu 1 . - CDS 107275 - 108120 528 ## COG2207 AraC-type DNA-binding domain-containing proteins - Prom 108261 - 108320 4.1 + Prom 108085 - 108144 4.1 97 60 Op 1 27/0.000 + CDS 108271 - 109386 909 ## COG0845 Membrane-fusion protein 98 60 Op 2 9/0.000 + CDS 109469 - 112675 2670 ## COG0841 Cation/multidrug efflux pump 99 60 Op 3 . + CDS 112686 - 114113 398 ## PROTEIN SUPPORTED gi|157165073|ref|YP_001466086.1| 30S ribosomal protein S12 + Term 114177 - 114223 0.5 - Term 114188 - 114240 9.1 100 61 Tu 1 . - CDS 114262 - 115542 1232 ## COG2873 O-acetylhomoserine sulfhydrylase - Prom 115660 - 115719 3.6 + Prom 115516 - 115575 4.6 101 62 Tu 1 . + CDS 115712 - 116752 714 ## BF3546 putative N-acetylmuramoyl-L-alanine amidase 102 63 Tu 1 . + CDS 116880 - 117836 833 ## COG2040 Homocysteine/selenocysteine methylase (S-methylmethionine-dependent) + Term 117927 - 117974 3.3 103 64 Tu 1 . - CDS 117884 - 118837 548 ## BF3354 hypothetical protein - Prom 118978 - 119037 5.5 + Prom 118830 - 118889 2.2 104 65 Tu 1 . + CDS 119036 - 119299 258 ## BF3549 hypothetical protein 105 66 Tu 1 . + CDS 119401 - 119676 354 ## BF3550 hypothetical protein + Term 119866 - 119914 -0.3 + Prom 119852 - 119911 2.4 106 67 Tu 1 . + CDS 119971 - 120405 457 ## BF3551 hypothetical protein 107 68 Tu 1 . + CDS 120793 - 120966 68 ## BF3553 hypothetical protein + Prom 120997 - 121056 3.1 108 69 Tu 1 . + CDS 121207 - 123519 1173 ## BF3555 putative ABC-transporter permease protein 109 70 Op 1 . - CDS 123594 - 123800 68 ## BF3556 hypothetical protein 110 70 Op 2 . - CDS 123809 - 124495 639 ## COG1011 Predicted hydrolase (HAD superfamily) - Prom 124653 - 124712 6.3 + Prom 124513 - 124572 5.7 111 71 Op 1 . + CDS 124684 - 125556 589 ## BF3558 hypothetical protein 112 71 Op 2 . + CDS 125639 - 126463 694 ## BF3363 hypothetical protein 113 72 Tu 1 . - CDS 126464 - 127252 639 ## BF3560 hypothetical protein - Prom 127300 - 127359 6.5 - Term 127503 - 127537 -0.9 114 73 Op 1 . - CDS 127599 - 129245 1372 ## COG0845 Membrane-fusion protein 115 73 Op 2 . - CDS 129277 - 130782 1322 ## BF3562 hypothetical protein - Term 130895 - 130924 2.5 116 74 Op 1 . - CDS 130962 - 134720 3263 ## COG3696 Putative silver efflux pump - Prom 134740 - 134799 3.7 - Term 134801 - 134837 -0.9 117 74 Op 2 . - CDS 134912 - 135217 185 ## BF3369 hypothetical protein - Prom 135452 - 135511 8.0 + Prom 135310 - 135369 5.3 118 75 Op 1 . + CDS 135435 - 136412 1065 ## COG0741 Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) 119 75 Op 2 . + CDS 136457 - 137356 519 ## COG1226 Kef-type K+ transport systems, predicted NAD-binding component 120 76 Tu 1 . - CDS 137346 - 143219 3733 ## COG1112 Superfamily I DNA and RNA helicases and helicase subunits - Prom 143273 - 143332 7.9 121 77 Tu 1 . - CDS 143338 - 143709 358 ## BF3568 hypothetical protein - Prom 143740 - 143799 5.0 + Prom 143749 - 143808 3.9 122 78 Tu 1 . + CDS 143934 - 144602 512 ## COG1285 Uncharacterized membrane protein 123 79 Op 1 1/0.214 - CDS 144605 - 145420 600 ## COG1573 Uracil-DNA glycosylase 124 79 Op 2 . - CDS 145386 - 146651 752 ## COG4277 Predicted DNA-binding protein with the Helix-hairpin-helix motif 125 79 Op 3 . - CDS 146648 - 147547 656 ## BF3377 hypothetical protein - Prom 147644 - 147703 5.0 + Prom 147503 - 147562 4.0 126 80 Tu 1 . + CDS 147719 - 148804 727 ## COG0229 Conserved domain frequently associated with peptide methionine sulfoxide reductase + Term 148921 - 148953 2.0 - Term 148901 - 148948 6.4 127 81 Tu 1 . - CDS 148968 - 149144 313 ## gi|167754220|ref|ZP_02426347.1| hypothetical protein ALIPUT_02513 - Prom 149223 - 149282 6.0 + Prom 149204 - 149263 8.1 128 82 Op 1 . + CDS 149293 - 149934 516 ## BF3380 hypothetical protein 129 82 Op 2 . + CDS 149970 - 151349 826 ## PROTEIN SUPPORTED gi|145629959|ref|ZP_01785741.1| 50S ribosomal protein L21 + Prom 151364 - 151423 4.6 130 83 Tu 1 . + CDS 151450 - 152088 638 ## BF3577 hypothetical protein + Term 152139 - 152180 5.4 + Prom 152127 - 152186 6.4 131 84 Tu 1 . + CDS 152222 - 154321 1319 ## COG1509 Lysine 2,3-aminomutase 132 85 Op 1 . - CDS 154412 - 155737 1101 ## COG2233 Xanthine/uracil permeases 133 85 Op 2 . - CDS 155744 - 156451 792 ## BF3580 hypothetical protein - Prom 156512 - 156571 4.3 + Prom 156440 - 156499 5.8 134 86 Op 1 . + CDS 156558 - 157148 391 ## COG3663 G:T/U mismatch-specific DNA glycosylase 135 86 Op 2 . + CDS 157151 - 157504 461 ## COG1393 Arsenate reductase and related proteins, glutaredoxin family + Term 157652 - 157693 9.5 + TRNA 157570 - 157642 80.5 # Trp CCA 0 0 - Term 157639 - 157681 2.1 136 87 Op 1 . - CDS 157693 - 158862 1161 ## BF3584 hypothetical protein 137 87 Op 2 . - CDS 158859 - 159434 451 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 138 87 Op 3 5/0.000 - CDS 159435 - 160106 372 ## COG0204 1-acyl-sn-glycerol-3-phosphate acyltransferase 139 87 Op 4 4/0.000 - CDS 160116 - 161072 872 ## COG4589 Predicted CDP-diglyceride synthetase/phosphatidate cytidylyltransferase 140 87 Op 5 . - CDS 161069 - 161725 473 ## COG0558 Phosphatidylglycerophosphate synthase - Prom 161785 - 161844 8.1 + Prom 161693 - 161752 6.7 141 88 Op 1 . + CDS 161916 - 163748 1984 ## COG2304 Uncharacterized protein containing a von Willebrand factor type A (vWA) domain + Prom 163787 - 163846 3.5 142 88 Op 2 . + CDS 163951 - 164265 500 ## BF3394 hypothetical protein + Term 164282 - 164336 14.2 - Term 164418 - 164465 -0.2 143 89 Op 1 . - CDS 164466 - 165338 754 ## COG2240 Pyridoxal/pyridoxine/pyridoxamine kinase 144 89 Op 2 . - CDS 165409 - 165987 661 ## BF3592 hypothetical protein 145 89 Op 3 17/0.000 - CDS 165984 - 167369 1275 ## COG1139 Uncharacterized conserved protein containing a ferredoxin-like domain 146 89 Op 4 . - CDS 167366 - 168106 553 ## COG0247 Fe-S oxidoreductase - Prom 168131 - 168190 2.0 - Term 168157 - 168203 2.0 147 90 Op 1 22/0.000 - CDS 168239 - 168784 205 ## PROTEIN SUPPORTED gi|157803532|ref|YP_001492081.1| 50S ribosomal protein L35 148 90 Op 2 . - CDS 168771 - 169103 254 ## COG0720 6-pyruvoyl-tetrahydropterin synthase - Prom 169155 - 169214 5.3 149 91 Tu 1 . + CDS 169220 - 169582 216 ## COG2832 Uncharacterized protein conserved in bacteria 150 92 Tu 1 . - CDS 169549 - 170115 531 ## COG0775 Nucleoside phosphorylase - Prom 170135 - 170194 2.8 + Prom 170063 - 170122 6.4 151 93 Op 1 . + CDS 170268 - 170513 112 ## BF3402 hypothetical protein 152 93 Op 2 . + CDS 170386 - 171438 636 ## COG3274 Uncharacterized protein conserved in bacteria - Term 171433 - 171499 19.0 153 94 Op 1 . - CDS 171516 - 171998 433 ## BF3600 hypothetical protein - Prom 172035 - 172094 1.7 154 94 Op 2 . - CDS 172108 - 172863 643 ## COG3142 Uncharacterized protein involved in copper resistance - Prom 172893 - 172952 1.9 - Term 172889 - 172935 7.9 155 95 Tu 1 . - CDS 172955 - 174490 1991 ## COG1418 Predicted HD superfamily hydrolase - Term 174507 - 174570 13.5 156 96 Op 1 . - CDS 174599 - 174892 197 ## BF3603 hypothetical protein 157 96 Op 2 . - CDS 174908 - 175201 244 ## BF3604 hypothetical protein - Prom 175441 - 175500 7.6 + Prom 175285 - 175344 7.1 158 97 Op 1 . + CDS 175541 - 177829 2686 ## COG0281 Malic enzyme 159 97 Op 2 . + CDS 177826 - 178002 186 ## BF3606 hypothetical protein 160 97 Op 3 . + CDS 178045 - 178170 57 ## 161 97 Op 4 . + CDS 178167 - 179501 1589 ## COG0334 Glutamate dehydrogenase/leucine dehydrogenase + Term 179534 - 179581 11.5 + Prom 179977 - 180036 9.7 162 98 Op 1 6/0.000 + CDS 180150 - 180689 309 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 163 98 Op 2 . + CDS 180736 - 181725 634 ## COG3712 Fe2+-dicitrate sensor, membrane component + Prom 181827 - 181886 4.7 164 99 Op 1 . + CDS 181906 - 185175 3183 ## BF3412 putative outer membrane protein Omp117 165 99 Op 2 . + CDS 185196 - 186824 1504 ## BF3612 hypothetical protein 166 99 Op 3 . + CDS 186850 - 187896 860 ## BF3613 putative endo-beta-N-acetylglucosaminidase 167 99 Op 4 . + CDS 187920 - 189101 1037 ## BF3415 putative lipoprotein 168 99 Op 5 . + CDS 189115 - 190146 691 ## BF3615 hypothetical protein + Term 190182 - 190226 8.6 169 100 Tu 1 . - CDS 190315 - 190635 316 ## BF3421 hypothetical protein - Prom 190790 - 190849 4.9 + Prom 190624 - 190683 7.5 170 101 Tu 1 . + CDS 190835 - 191290 486 ## BF3422 putative DNA-binding protein + Term 191339 - 191409 19.4 + Prom 191765 - 191824 7.9 171 102 Tu 1 . + CDS 192053 - 192409 286 ## BF3424 putative lipoprotein + Term 192431 - 192479 11.3 172 103 Tu 1 . - CDS 192419 - 192613 63 ## BF3620 hypothetical protein - Prom 192782 - 192841 3.2 - Term 193104 - 193137 -0.7 173 104 Op 1 . - CDS 193309 - 193815 356 ## BF3623 hypothetical protein 174 104 Op 2 . - CDS 193822 - 195042 920 ## BF3624 hypothetical protein - Prom 195220 - 195279 5.4 - Term 195226 - 195289 10.1 175 105 Op 1 . - CDS 195296 - 195448 174 ## 176 105 Op 2 . - CDS 195553 - 196230 545 ## BF3626 hypothetical protein 177 105 Op 3 . - CDS 196233 - 198365 1477 ## BF3627 hypothetical protein 178 105 Op 4 . - CDS 198377 - 199987 1332 ## BF3628 hypothetical protein 179 105 Op 5 . - CDS 200008 - 200733 560 ## BF3629 hypothetical protein 180 105 Op 6 . - CDS 200744 - 202534 1329 ## BF3432 hypothetical protein 181 105 Op 7 . - CDS 202547 - 203875 1137 ## BF3631 hypothetical protein 182 105 Op 8 . - CDS 203896 - 204705 664 ## BF3632 hypothetical protein - Prom 204765 - 204824 5.6 - Term 204866 - 204905 3.0 183 106 Tu 1 . - CDS 204929 - 207898 2995 ## BF3633 phosphoenolpyruvate synthase - Prom 207934 - 207993 3.3 + Prom 208227 - 208286 4.4 184 107 Op 1 . + CDS 208313 - 209650 1530 ## COG0334 Glutamate dehydrogenase/leucine dehydrogenase + Term 209708 - 209758 12.2 + Prom 209729 - 209788 6.4 185 107 Op 2 . + CDS 209821 - 210984 1245 ## COG0006 Xaa-Pro aminopeptidase + Term 211076 - 211127 19.2 - Term 211058 - 211122 24.0 186 108 Op 1 . - CDS 211146 - 213908 2632 ## BF3637 hypothetical protein 187 108 Op 2 . - CDS 213912 - 216014 2268 ## COG2319 FOG: WD40 repeat - Prom 216154 - 216213 6.4 - Term 216163 - 216212 15.6 188 109 Tu 1 . - CDS 216279 - 217724 1356 ## COG0617 tRNA nucleotidyltransferase/poly(A) polymerase - Prom 217859 - 217918 2.5 + Prom 217672 - 217731 5.6 189 110 Tu 1 . + CDS 217897 - 218745 917 ## BF3443 putative lipoprotein + Prom 218946 - 219005 3.0 190 111 Op 1 . + CDS 219121 - 222171 2302 ## BF3444 hypothetical protein 191 111 Op 2 . + CDS 222185 - 223633 884 ## BF3445 hypothetical protein 192 111 Op 3 . + CDS 223651 - 224853 904 ## BF3446 putative lipoprotein 193 111 Op 4 . + CDS 224840 - 227704 1591 ## COG0612 Predicted Zn-dependent peptidases 194 111 Op 5 . + CDS 227685 - 228584 770 ## COG3016 Uncharacterized iron-regulated protein 195 112 Tu 1 . - CDS 228812 - 229252 345 ## BF3643 putative DNA-binding protein - Prom 229348 - 229407 3.3 + Prom 229573 - 229632 2.0 196 113 Tu 1 . + CDS 229681 - 229920 67 ## 197 114 Op 1 . + CDS 231016 - 231225 197 ## gi|294808077|ref|ZP_06766850.1| ISSpo3, transposase family protein 198 114 Op 2 . + CDS 231274 - 231597 143 ## Fjoh_0919 hypothetical protein + Term 231837 - 231881 7.1 199 115 Tu 1 . + CDS 232051 - 232272 186 ## + Prom 232293 - 232352 4.2 200 116 Op 1 . + CDS 232380 - 232667 182 ## gi|294808074|ref|ZP_06766847.1| conserved domain protein + Prom 232694 - 232753 1.5 201 116 Op 2 . + CDS 232773 - 233222 376 ## Pjdr2_0818 S-layer domain protein + Term 233327 - 233364 5.3 + Prom 233226 - 233285 5.7 202 117 Op 1 . + CDS 233483 - 234046 227 ## gi|253566040|ref|ZP_04843494.1| predicted protein 203 117 Op 2 . + CDS 234043 - 234378 201 ## gi|253566041|ref|ZP_04843495.1| predicted protein - Term 234591 - 234636 12.2 204 118 Op 1 . - CDS 234662 - 235378 505 ## BF3451 hypothetical protein 205 118 Op 2 13/0.000 - CDS 235375 - 235917 410 ## COG1898 dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes 206 118 Op 3 1/0.214 - CDS 235914 - 236801 902 ## COG1209 dTDP-glucose pyrophosphorylase 207 118 Op 4 11/0.000 - CDS 236815 - 237594 319 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 208 118 Op 5 . - CDS 237555 - 238283 157 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 209 118 Op 6 . - CDS 238280 - 238690 178 ## gi|253566047|ref|ZP_04843501.1| O-antigen polymerase - Prom 238719 - 238778 2.6 210 119 Op 1 . - CDS 239334 - 240437 502 ## COG0438 Glycosyltransferase 211 119 Op 2 . - CDS 240419 - 241396 264 ## PFL_5100 O antigen biosynthesis abequosyltransferase RfbV, putative 212 119 Op 3 . - CDS 241374 - 242744 368 ## Coch_0703 polysaccharide biosynthesis protein 213 119 Op 4 1/0.214 - CDS 242795 - 244531 771 ## COG0028 Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 214 119 Op 5 5/0.000 - CDS 244518 - 245156 374 ## COG0451 Nucleoside-diphosphate-sugar epimerases - Prom 245352 - 245411 4.0 215 119 Op 6 5/0.000 - CDS 245435 - 246514 838 ## COG0451 Nucleoside-diphosphate-sugar epimerases 216 119 Op 7 2/0.000 - CDS 246520 - 247296 630 ## COG1208 Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) 217 119 Op 8 . - CDS 247334 - 248677 1063 ## COG0399 Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis 218 119 Op 9 . - CDS 248696 - 249793 839 ## COG0472 UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase 219 119 Op 10 . - CDS 249845 - 250330 431 ## BF3465 putative LPS biosynthesis related transcriptional regulatory protein 220 119 Op 11 . - CDS 250375 - 251001 580 ## BF3667 putative transcriptional regulator UpxY-like protein - Prom 251085 - 251144 4.8 Predicted protein(s) >gi|226332013|gb|ACIB01000043.1| GENE 1 1058 - 1570 577 170 aa, chain + ## HITS:1 COG:PA4880 KEGG:ns NR:ns ## COG: PA4880 COG2193 # Protein_GI_number: 15600073 # Func_class: P Inorganic ion transport and metabolism # Function: Bacterioferritin (cytochrome b1) # Organism: Pseudomonas aeruginosa # 14 161 32 177 177 77 34.0 1e-14 MAKESVKILQGKLDVKSLIDQLNAALSEEWLAYYQYWVGALVVEGAMRADVQGEFEEHAE EERHHAQLIADRIIELEGVPVLDPKKWFELARCKYDSPTAFDSVSLLNQNVSSERCAILR YQEIANFTNGKDYTTCDIAKHILAEEEEHEQDLQDYLTDIARMKESFLKK >gi|226332013|gb|ACIB01000043.1| GENE 2 1808 - 3586 1702 592 aa, chain + ## HITS:1 COG:all4590 KEGG:ns NR:ns ## COG: all4590 COG0616 # Protein_GI_number: 17232082 # Func_class: O Posttranslational modification, protein turnover, chaperones; U Intracellular trafficking, secretion, and vesicular transport # Function: Periplasmic serine proteases (ClpP class) # Organism: Nostoc sp. PCC 7120 # 38 592 42 609 609 343 37.0 6e-94 MKDFFKFTLATVTGIVLSGIVLFIIGVVTLVGIISSSDTETVVKKNSVMMLDLKGTLVER TQESLEGLLGKFTGETADTYGLDDILASIKKAKENDNIKGIYIQASWLNASYASLQAIRK ALDDFKESGKFIVAYSDNYTQGLYYLSSVADKVMLNPKGMIEWRGLASAPIFYKDLLQKL GIEMQVFKVGTYKSAVEPFTATEMSPANREQVTAFIGSIWNQILDGVSASRKIGKDSLNM YADRMLMFYPSDESVKCRLADTLIYQNDVRDYLKTLVKIDEDDRLPILGLEEMVNIKKNV PKDKSGNILAVYYASGEITDYAGSAASDEGIIGSKMIRDLRKLKEDDDVKAVVLRVNSPG GSAFASEQIWHAVKELKAKKPVIVSMGDYAASGGYYISCAADSIIAEPTTLTGSIGIFGM IPNVKGLTEKIGLTYDVVKTNQFSDFGNLMRPVNSDERALLQMMIGQGYDLFVSRCAEGR HMSKDKIEKIAEGRVWTGEMAKKIGLVDELGGIGKALEIAAQKADLKGYTIISYPAKKDI LSTLFDVQPGNYVESQVLKSQLGDYYKDFSLLKNIKERAMIQARVPFELNVK >gi|226332013|gb|ACIB01000043.1| GENE 3 3595 - 4725 436 376 aa, chain + ## HITS:1 COG:aq_1656 KEGG:ns NR:ns ## COG: aq_1656 COG1663 # Protein_GI_number: 15606758 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Tetraacyldisaccharide-1-P 4'-kinase # Organism: Aquifex aeolicus # 12 214 6 205 315 110 36.0 4e-24 MEENSIKIHKWLYPASWLYGAGVALRNKLFDWGKLQSKSFNVPIICIGNIAVGGTGKTPH TEYLIKLLHDEFQVAVLSRGYKRHTKGFVLSTAESDARSIGDEPYQIQSKFSDIQVAVDE DRCHGIERLLTLKEPPVEVILLDDAFQHRYVKAGLNILLTDYHRLFCDDTLMPAGRLRES AQGKNRAQIVIVTKCPPDIKPIDYNIIKKRLNLFPYQQLYFSSFRYGNLRAVFPDCATVQ ERKLSSLQTEEQILLITGIASPDTIIRELEIHTRNIDLLAFSDHHNFSQRDLAQIKERFG KLRKGQRLIVTTEKDATRLICHQELDEGLKPFIYALPIEVEILQNQQDNFNQHIIGYVRE NTRNGSLPERKDAHKS >gi|226332013|gb|ACIB01000043.1| GENE 4 4664 - 5473 794 269 aa, chain + ## HITS:1 COG:BH1532 KEGG:ns NR:ns ## COG: BH1532 COG0005 # Protein_GI_number: 15614095 # Func_class: F Nucleotide transport and metabolism # Function: Purine nucleoside phosphorylase # Organism: Bacillus halodurans # 3 266 6 270 275 285 53.0 8e-77 MLEKIQETAAFLKGKMHTSPETAIILGTGLGSLANEITEKYEIKYEDIPNFPVSTVEGHS GKLIFGKLGNKEIMAMQGRFHYYEGYSMKEVTFPVRVMRELGIKTLFVSNASGGTNPEFE IGDLMIITDHINYFPEHPLRGKNIPYGPRFPDMSEAYDKELIRKADAIAAEKGIKVQHGI YIGTQGPTFETPAEYKLFHILGADAVGMSTVPEVIVANHCGIKVFGISVVTDLGVEGKIV EVSHEEVQKAADAAQPKMTTIMRELINRA >gi|226332013|gb|ACIB01000043.1| GENE 5 5501 - 6535 962 344 aa, chain + ## HITS:1 COG:MTH1396 KEGG:ns NR:ns ## COG: MTH1396 COG0611 # Protein_GI_number: 15679395 # Func_class: H Coenzyme transport and metabolism # Function: Thiamine monophosphate kinase # Organism: Methanothermobacter thermautotrophicus # 1 340 1 324 327 155 32.0 1e-37 MRTEIATLGEFGLIKHLTEGIKLENESSKYGVGDDAAVLSYPADKQVLVTTDLLMEGVHF DLTYTPLKHLGYKSAVVNFSDIYAMNGTPKQITVSLALSKRFSVEDMDEFYSGLRLACQQ YKVDIVGGDTTSSLTGFAISITCIGEADKDKVVYRNGAKDTDLICVSGDLGAAYMGLQLL EREKAVFKGEQDAQPDFSGKEYLLERQLKPEARKDIIEKLSAANIVPTSMMDISDGLSSE LLHICTQSKAGCRVYEEHIPIDYQTAVMAEEFNMNLTTCAMNGGEDYELLFTVPIADHEK VSEMEGVRLIGHITKPELGCALITRDGQEFELKAQGWNPLQENK >gi|226332013|gb|ACIB01000043.1| GENE 6 6978 - 8195 684 405 aa, chain + ## HITS:1 COG:no KEGG:BF2111 NR:ns ## KEGG: BF2111 # Name: not_defined # Def: putative bacteriophage integrase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 403 13 415 416 436 52.0 1e-120 MERKRFSVLFFIKRSKLLKNGEAPVRVRVTYDRLYVELQLKRSVKVPLWSQEKEKSTGKD RNSVELNHYIDALRVKFYQIYQDLELEGKIISARAIVNRYQGKDETFKTLYNVFKEHNDN CRKLIGTDYADITVRRYDNCLKYLMELVKRDYKVDDMLLREVNGELVRKFDLYLKTEKHC AQNTVIRYMKCFKKVINLAIANEWLTKNPFAGIKFHEVEVNKQFLSQAEINRIWQKEFRI ERLELVRDVFIFCVYTGLAFIDVYNLRPEHISEDSNGNLWIVKPREKTNNLCNIPLLSIS KQILEKYKDNPYCMDKGTLLPVPCNQKMNSYLKEIADLCGIKKNLTTHTARHSFASVIAL ANNVSLPNVAKMLGHSSTRMTQHYAKVLDQTILRDMQAVEKQLSV >gi|226332013|gb|ACIB01000043.1| GENE 7 8338 - 8619 208 93 aa, chain - ## HITS:1 COG:no KEGG:BF3280 NR:ns ## KEGG: BF3280 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 93 1 93 95 145 80.0 3e-34 MDLITKDSDTTLVLFSSLDRVLENVEYVVTNYRPVLNGEHYLTGDEVCRRLCISKRTLQD YRDTGLLGYVQLPGKIIYRESDIMDLLERFYQK >gi|226332013|gb|ACIB01000043.1| GENE 8 8624 - 8911 197 95 aa, chain - ## HITS:1 COG:no KEGG:BF3281 NR:ns ## KEGG: BF3281 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 95 1 95 95 156 82.0 3e-37 MEIVTIEKRTFELWKQRFENFVGRVDALCVPLRRKRDKWLDNCETCRLLNVSARTMQTYR DTGKLPYSQINNKIYYKASDVETFLLNQVRDNSKK >gi|226332013|gb|ACIB01000043.1| GENE 9 9249 - 9557 310 102 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253565837|ref|ZP_04843291.1| ## NR: gi|253565837|ref|ZP_04843291.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] # 1 102 1 102 102 192 100.0 4e-48 MNNPFEEIFKRLENIEKMIAPVMGAQPEERQDGKEPVLVKISVASGITGYSVNYLYHLAS KGLIPCVKRGRTLRFDMEELKKWMQQQYVPASNRLPDEKEKK >gi|226332013|gb|ACIB01000043.1| GENE 10 9538 - 10770 598 410 aa, chain + ## HITS:1 COG:all8519 KEGG:ns NR:ns ## COG: all8519 COG5545 # Protein_GI_number: 17232892 # Func_class: R General function prediction only # Function: Predicted P-loop ATPase and inactivated derivatives # Organism: Nostoc sp. PCC 7120 # 83 399 348 650 836 70 23.0 7e-12 MKKKRSDDTRHIEGRQSKNERIESLLNVLYDFRFNTVKSRTEYRATSSSGLYQPVTKFVL NSFRRRLDATAGIVTSAENIRTILESDFARKVHPIREYFNALPLLNPTEHGHIGKLLNTV QVANPGKWEEYFTKWLIGVVANAMNDTGCQNHTCLVLTGDRQGQFKSWWLDNLCPTPLKN YLFTGKIDPQGKDILTLIAEYLFINIDDQLKELNKQNENALKNLITTPAVKYRRPYDVYI EEYPHLASFMASVNGNEFLTDPTGSRRFLPFEVLHIDKPTAESIHMDNVYSEIMYLYRQG VRYWFNDAEIGELHLNNAEFEVQTIEFEMLTQYFEKPTEEEEPHFFMTTAQILARLRDIC AMQLSEKRLGEALRKAGFKRVQKRINKQNYSVYGYRIKPVPASSTNDDYG >gi|226332013|gb|ACIB01000043.1| GENE 11 11104 - 12084 424 326 aa, chain + ## HITS:1 COG:no KEGG:BF3283 NR:ns ## KEGG: BF3283 # Name: not_defined # Def: putative transposon-related/mobilisation protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 326 1 344 344 431 64.0 1e-119 MTYKEANNISIKDYLNFLGIQPVTEKGSYGMYRSPLREDNTPSFKVDYNANLWCDYGTGE GGTLIDLVMKQHQCNAYGAICRLEQGNTASFSFHGKDLPERDTKRQATSPIEIRRIQPLQ NPALMRYLQERGISPGTAAPYVQEMYYRIGGKPYFALAFKNDSGGYELRNPRFKGSTSKD ITHIRQQGEPRDTCFVFEGFLDFLSFLTIRQQKSPNMSCTDWQDYVILNSTANTDKALYP LAGYGHIHCMLDNDEAGRKAVEAIRQEYKWRVRDASYLYSGHNDLNDYLRSLKVKQSQDL TVADKPQPEQDNRQNPGEKRKRGLRM >gi|226332013|gb|ACIB01000043.1| GENE 12 12241 - 12624 288 127 aa, chain + ## HITS:1 COG:no KEGG:BF3284 NR:ns ## KEGG: BF3284 # Name: bmgB # Def: mobilization protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 127 1 127 127 191 85.0 6e-48 MIEIRNKPGGRPAKSRIDKQNRVVSTKLTELQFYAIRKRATEAGLRVSEYVRQAVVSAEI TPRLNRQDADTIRKLAGEANNINQLAHRANAGGFALVAVELVKLKNRIVEIINQLSDDWK NKKGKRI >gi|226332013|gb|ACIB01000043.1| GENE 13 12590 - 13513 472 307 aa, chain + ## HITS:1 COG:no KEGG:BF3285 NR:ns ## KEGG: BF3285 # Name: bmgA # Def: mobilization protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 307 1 304 304 476 83.0 1e-133 MIGKIKKGSGFKGCVNYVLGKEQAALLHAEGVLAESNRDIIRSFILQAGMNPDLKKPVGH IALSYSPVDAPKLTDGKMIQLAQEYMREMKITDTQYIIVRHQDREHPHVHIVFNRIDNNG KTISDRNDMYRNEQVCKKLKAKHGLYFAKGKEHVKQHRLREPDKSKYEIYNAVKNEIGKS RNWQQLQQRLAEKGITIRFKYKGQTSEIQGISFSKGEYTFKGSEIDRSFSFSKLDKCFGY AGLNTAGNNRQTVFAPVQEPARTPGKADSPLLAGLGGLFSASSSPADETPDNPNERKKRK KKRHLKL >gi|226332013|gb|ACIB01000043.1| GENE 14 13517 - 14245 487 242 aa, chain + ## HITS:1 COG:no KEGG:BF3286 NR:ns ## KEGG: BF3286 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 242 1 241 241 315 72.0 1e-84 MKDNLILEGLLSMVTELKERQEKQVTPASREETINRLDVIEQRISDMQSKSGIPENTVQE ILNQIGSIRKGQSENQKQDLEDIKGLIVTSHRYFKERLKVLFPVDDTALTGEVTPVSWYG KLTYRVTPYLKPKFFLLSTGFIICITSLILNVRFTERMQRLQDNDIKYRYILMKGKADGS SLDLLETKFSRERDNAFVRSLTDSVKGFEYRSRKQAETLERARLLNEQAEQLKEEADKLG KP >gi|226332013|gb|ACIB01000043.1| GENE 15 14287 - 14619 307 110 aa, chain - ## HITS:1 COG:no KEGG:BF3287 NR:ns ## KEGG: BF3287 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 108 10 117 117 145 74.0 5e-34 MYIDNDDFSVWMQKLYAKLEELCKDVRVLRNADRVLPEDDNLLDNQDLCLLFKVSIKTLQ RYRAIGALPYFTISGKVYYKASDVREFIKERFSVTTLRQFEKEHCTKKKK >gi|226332013|gb|ACIB01000043.1| GENE 16 14750 - 14983 237 77 aa, chain - ## HITS:1 COG:no KEGG:BF3477 NR:ns ## KEGG: BF3477 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 74 55 128 139 139 90.0 2e-32 MEMVSGLFLSKDIVYQNGKPAYLVDLSKAFEWLFNIKIGDCYQKHEDVIKRKPGKLTEFL NGLAELIKKEHDKKGYR >gi|226332013|gb|ACIB01000043.1| GENE 17 15123 - 15362 185 79 aa, chain - ## HITS:1 COG:no KEGG:BF2068 NR:ns ## KEGG: BF2068 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 79 12 90 90 114 86.0 1e-24 MKQLLQQRFFRLLSEYSQRKVSASEFVEAIEELATHVANFSINEQDYSVLLRYFSFGLHR LKSYRVRFEQEKNALFTFN >gi|226332013|gb|ACIB01000043.1| GENE 18 15434 - 15751 84 105 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MKKMLLLALVVILGTTLVAVGQNAENKKEEKTECCCEKCYCKECLCGINCTECKTCLKHK ECADCMNENKCNDSIVYKYRRYCEHDNHCNYGHSYKVRNNRRCCH >gi|226332013|gb|ACIB01000043.1| GENE 19 15748 - 16866 445 372 aa, chain - ## HITS:1 COG:MA1426 KEGG:ns NR:ns ## COG: MA1426 COG1902 # Protein_GI_number: 20090286 # Func_class: C Energy production and conversion # Function: NADH:flavin oxidoreductases, Old Yellow Enzyme family # Organism: Methanosarcina acetivorans str.C2A # 8 360 2 355 365 249 37.0 4e-66 MRMKRKALFEQVSFPKLTLKNRFVRSGVWMEMTDEQGHLTPDLINVYKALVDGGVGFIIT EYAYIDINDQPNPRMIGMYDDSFISEWKEVIDYAHAKGVKIACQIASGGSQSGLVASKYR RMIGPSAVLNRVTGITPEEMTKDDISHVIECHKRAALRVKQAGFDAVQIHAAHGYLLSQF LTPYYNRRKDEYGGSIHNRARLIYEVVSEVRSAVGEDFPVMIKVNFDDYMSAGEGLSFPE SLEIFKHLDTLGLDFIEPSGTNLSSGNGITQSFPHIARSIEKQSYFKKQVSEIAQNIQTP LILVGGNRNIAVMDDILNNDNIPLFSLARTLFSEPDLINKWEDNPNYTPKCISCNKCWET IPNSCILNRKRK >gi|226332013|gb|ACIB01000043.1| GENE 20 16993 - 17337 291 114 aa, chain + ## HITS:1 COG:MTH1285 KEGG:ns NR:ns ## COG: MTH1285 COG1733 # Protein_GI_number: 15679289 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Methanothermobacter thermautotrophicus # 12 103 26 117 131 94 46.0 6e-20 MYERKIPINLGCGIEVTMNIIGGKWKPWLINRIREGQHRPIEIQRAIPIADKRVLTQQLN ELENIGIVRKVVYPVIPTKVEYFLTDLGKSLLPIIDLMEEWGRNHRNLLKNEDL >gi|226332013|gb|ACIB01000043.1| GENE 21 17948 - 18241 225 97 aa, chain - ## HITS:1 COG:no KEGG:BT_2337 NR:ns ## KEGG: BT_2337 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 97 1 97 97 151 81.0 6e-36 MKRTKENYPSFNLFSIVGTWESINLNPTVIIYRNDNDYLLSIIYVSETTKQASPATYEIQ KEGSLYFIAPAPKRFYIDYDPVKDVLNLSSLGDYLRN >gi|226332013|gb|ACIB01000043.1| GENE 22 18228 - 18542 252 104 aa, chain - ## HITS:1 COG:no KEGG:BF0653 NR:ns ## KEGG: BF0653 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 104 1 104 104 167 92.0 8e-41 MEVVTIEKRTFLYICERFTEFAKRIESLCNTHTQEVENWLDSQEMCLLLGFSKRTLQYYR SSGRLAYSQIGSKIYYKSSDVERIIADSETQNQSLKQATPYEKN >gi|226332013|gb|ACIB01000043.1| GENE 23 18765 - 19379 442 204 aa, chain + ## HITS:1 COG:CC2662 KEGG:ns NR:ns ## COG: CC2662 COG1309 # Protein_GI_number: 16126897 # Func_class: K Transcription # Function: Transcriptional regulator # Organism: Caulobacter vibrioides # 4 195 10 207 213 85 29.0 5e-17 MKGKSITGERDREATEKRLLDTIGKMIAEDGFEKIGINAIATQSGVSKILIYRYFGSVEG LMAAYIRQHDFWINFPLEYPSREKLPAFVKSMFQGQIEQLRNNPTLKRLYRWELSCNNDM IVKLREQREKVGIDLVKKVSELTGHPQKEIAAIASMLTASITYLVMLEDFCPVYNGIPLN ENSGWEQINEGIEVLINKVFQDEN >gi|226332013|gb|ACIB01000043.1| GENE 24 19369 - 20325 400 318 aa, chain + ## HITS:1 COG:no KEGG:Coch_0727 NR:ns ## KEGG: Coch_0727 # Name: not_defined # Def: hypothetical protein # Organism: C.ochracea # Pathway: not_defined # 4 314 25 336 344 298 48.0 2e-79 MKTDFIQIASYASKAPSGHNTQPWKFHITDSTITVLPNLDVALPVVDRNNRELFISLGCA VENLCIAASYFGYTTHIIECSIEAIILELTKNALTIEDSLFHQIEKRQTNRNIYNGNKIS DGILQQLQSIPKENGIQFYFTEINTPFANTITQYIMKGNEIQMADIAFKNELLSWMRFNK KQVEATHNGLSYLVFGNPPLPRILARPIVSLFLKPNAQNKSDRKKIDSSSHFVVCATQRD TIEEWINLGRTLQRFLLKVTEIGISYAFLNQPCEVAALAFDLREKLPVNKEHPTLIMRIG YAKQIPYSPRKKIETLLV >gi|226332013|gb|ACIB01000043.1| GENE 25 20391 - 21113 415 240 aa, chain + ## HITS:1 COG:no KEGG:Sde_1498 NR:ns ## KEGG: Sde_1498 # Name: not_defined # Def: hypothetical protein # Organism: S.degradans # Pathway: not_defined # 46 239 145 339 340 128 36.0 2e-28 MNKMGLTLIYLWLVSLCSCQQELIEYEKGDVKVCIEQGEQWLHDFPLFLGINKKNPPQIA VWLEDTQGNYLSTVYVTHKIATQSWQASGGNRRKEALPHWCYSRGIKYDDGLYLPTKKEP LTDGISGATPHGSFDIKLSPTTALKKFVVKIEINHSTDFNEAFPKLAKEGESNYSGGKEG SGQPAIVYTANVDLLSGEKSFEANLIGHSSPDGSSGEINEDTSGLTTALHIVKRITVTIQ >gi|226332013|gb|ACIB01000043.1| GENE 26 21110 - 21634 262 174 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253565854|ref|ZP_04843308.1| ## NR: gi|253565854|ref|ZP_04843308.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] # 1 174 1 174 174 324 100.0 1e-87 MKKIILLIIASCTVAMATAQKKEHFTFATSVGTGIDMSEPAATPFSLQVLGYYAINKRFS VGVGTGLSIYEKVLIPLFADTKFLIIKPRKFTPYIECGVGYSFAPNKNANGGFYLNPSAG VEYSICKSKKLFLALGYESQKFERLKTQKQSLFTAEFAEKLSHNAISIKIGFMF >gi|226332013|gb|ACIB01000043.1| GENE 27 21712 - 21945 203 77 aa, chain + ## HITS:1 COG:no KEGG:BF0648 NR:ns ## KEGG: BF0648 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 77 17 93 93 67 49.0 2e-10 MLMNDLSKTRIIILLTDSSQKVTDTEMQDAYDEFIRCIATIGNSKDNSNIFRMLNLTRIE IAPLKELYQCEQGEKCA >gi|226332013|gb|ACIB01000043.1| GENE 28 21965 - 22345 294 126 aa, chain + ## HITS:1 COG:no KEGG:BF3477 NR:ns ## KEGG: BF3477 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 2 122 11 128 139 76 34.0 3e-13 MLAIINVELELLSLKIKHPESFVSPISPTFESDLYVIPKSKDLGIIGIAEIVIGLSFLGE VVGKDGKPVPLVRLAHGFEVLFNLRFGSIYDKLDAIFMRKPFNLTKTLDALKNAINKEAR KRSNKH >gi|226332013|gb|ACIB01000043.1| GENE 29 22501 - 22833 209 110 aa, chain + ## HITS:1 COG:no KEGG:BF0646 NR:ns ## KEGG: BF0646 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 110 1 110 110 166 86.0 2e-40 MYIDNENFEKWMEKLSKKLTEIGKDLKSLINTDKVLDDNEKILDNQDLAFLLKVSFRTLQ RYRVSGLLPFFTIGKKTYYRAGDIRSFVRERADFQSYKQFEKANQLENQP >gi|226332013|gb|ACIB01000043.1| GENE 30 23118 - 23702 333 194 aa, chain + ## HITS:1 COG:no KEGG:BF0644 NR:ns ## KEGG: BF0644 # Name: not_defined # Def: clindamycin resistance transfer factor BtgA # Organism: B.fragilis # Pathway: not_defined # 1 193 1 193 194 318 94.0 6e-86 MPNSSRKTIFTTISIDKETASLVEKICKRYSLKKSEVVKLAFGYIDKAHINPSEAPESVK SELAKINKRQDDIIRFIRHYEEEQLNPMIRTTNSIALRFDAIGKTLETLILSQLEANQER QTAVLKKLSEQFCNHADVINSQSKQINALYQIHQRDYKKLFHLIQLYSELSACGVMDSKR KENMKAEISNLINI >gi|226332013|gb|ACIB01000043.1| GENE 31 23707 - 24657 498 316 aa, chain + ## HITS:1 COG:no KEGG:BDI_1256 NR:ns ## KEGG: BDI_1256 # Name: not_defined # Def: clindamycin resistance transfer factor BtgB # Organism: P.distasonis # Pathway: not_defined # 1 301 1 287 306 340 69.0 3e-92 MHIDFAPPSKGTYNNAGSSRQLANYLEHEDLERMEKDIYTEGFFNLTDDNIYKSMVIKDI DSNIGQLLKTDAKFYATHVSPSEKELRAMGSTEKEQAEAMKRYIREVFIPEYAKNFNKEL SASDIKFYGKIHFNRSRSDNELNMHCHLIVSRKDQSNKKKLSPLTNHKNTKNGVIKGGFD RVNLFQQAEQGFDKLFNYNRQLSEAFEYYNIMKNSTITNKLRLQKREQNTPKQNFTSEKK DSMQICKHTDKQADMIENIFTNQQENNLENKYDSNSTNFGLSSLFSTLLSTTDINSTDKQ QELITKKKKKQRPRLI >gi|226332013|gb|ACIB01000043.1| GENE 32 24880 - 25083 170 67 aa, chain + ## HITS:1 COG:no KEGG:BDI_2133 NR:ns ## KEGG: BDI_2133 # Name: not_defined # Def: hypothetical protein # Organism: P.distasonis # Pathway: not_defined # 2 65 6 69 80 78 67.0 7e-14 MELNRIKVVLVEKKKTGKWLAETLGKNEATVSRWCANVSQPSIETLFAIAKVLNVDMKDL LVSAKKT >gi|226332013|gb|ACIB01000043.1| GENE 33 25182 - 28313 731 1043 aa, chain + ## HITS:1 COG:no KEGG:Bxe_A2569 NR:ns ## KEGG: Bxe_A2569 # Name: not_defined # Def: hypothetical protein # Organism: B.xenovorans # Pathway: not_defined # 2 1043 13 1063 1063 619 32.0 1e-175 MINKLQFDEFQRAIGISKNDTFSLLLGAGCSINSDIPSAEDCIWEWKRDIYKTNNSSSFG WIDNYKNPKTQEIIQNWLNNQGIYPERGCKEEYSFYAYKCYPIDEHRRQYFQKICSGKKP SIGYKLIPLLARKGMLDSVWTTNLDDLVVTACIGNGIQAIEITLDSVQRLNNRPQNRHEL PVIKLHGDFKYGDLKNTEEELLNQDKTFRERLIEYVQDKHLIVLGYSGRDTSLMDTLKEA YSKQGGGILYWCGYGDNINSDIAELIQIATKNGRRAFYIPTDGFDSTLRKITQIVVEDDN NLKKELLELHQTSNINDTITPFDLKCERVNKLLKSNIFRISFPDEVFVFDVSISDKPWKF VDERTLERNDISAVPYNKQIWAFGRLDIIKDIFKDVMNSDIQRKPLANIKIYNTAVSRLL LTTICKILALQSNLKTDYKGKIWTENNSKSISGHIVYNAVLLSFDRISGEYYLSLNPDFV LANPNIEKSSIQTIGLFFFQKLWNQQFNEYINYWREILLKKNNEYEFPINSGTGFKFKIK NIPVFTNICDLNNPRINNHNVSSHHLLLQGVQFKEIPLLFSTNNGNRTATDTHPMRGLLI NKPYETGVNDFLEKSITLGIISPSQDALRFYQFLENQNSKIKKHNDKDNYIIDYEGFFAI YGVSLSFPTPNDNEWERINEPLIMGIKETAQQIKQLICDSIVKISSTTRRKIIVIYIPQR WEPYTSYQLDGESFDLHDYVKAFCAEKGIMSQLIREKTINDTIQKCQIHWWLSLSFFVKS FRTPWILANTNNTTAFAGLGYSVENKKDINGHIVLGCSHIYSSNGEGLKYKLAKISNDKI QWRHKKPHLCYDDAYEFGKSIVNLFYESMNELPKRVVIHKRTFYTDEEKQGIIDSISDNK KIESIDLIEINFENNIKYASSKIHDGKVDIDGFSVSRGTCIQLSSKEALLWAHGVIPSVI NPNWNFYPGGRYIPKPLRIIKHYGTGSLEQIANEILGLTKMNWNSLNMYSQLPATISSSN DIARIGKLIGANSMHEYDYRYFI >gi|226332013|gb|ACIB01000043.1| GENE 34 28783 - 29421 472 212 aa, chain - ## HITS:1 COG:no KEGG:BF3297 NR:ns ## KEGG: BF3297 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 212 1 212 212 423 97.0 1e-117 MIRLVAFDLDGTIGDTIPLCLAAFREATTPYVDHELSDDEIARTFGLNEEGMINQVVSGN SREKALNDFYVIYEEMHAILCPKPFEGMKTLIGQLHERNVIVVLITGKGIRSCDITLKQF GMDGCFDRIETGSSEKNRKSEAMKAISLYYGFKSNEMIYVGDAVSDIEACYSVKIQCLSA AWAASTDCEQLEKYNSGYVFSSVRLLRDFLLC >gi|226332013|gb|ACIB01000043.1| GENE 35 29399 - 29926 262 175 aa, chain - ## HITS:1 COG:all1011 KEGG:ns NR:ns ## COG: all1011 COG0110 # Protein_GI_number: 17228506 # Func_class: R General function prediction only # Function: Acetyltransferase (isoleucine patch superfamily) # Organism: Nostoc sp. PCC 7120 # 3 163 42 191 192 162 47.0 4e-40 MRYNSLSYEQKEEKYAILKEMFGSIGTEVSVGHSFLCDYGCNIHIGDNVTVNIGCVFVDC NKITVGNNVLIAPNVQIYTATHPIDLNERLTPVEAPEGVRYVRHTFALPVTVEDGCWIGG GVIILPGVTIGKGSVIGAGSVVTKNVPANSLAVGNPCRVIRQINKSEKYDPAGSF >gi|226332013|gb|ACIB01000043.1| GENE 36 30043 - 30552 385 169 aa, chain - ## HITS:1 COG:CAC1484 KEGG:ns NR:ns ## COG: CAC1484 COG0778 # Protein_GI_number: 15894763 # Func_class: C Energy production and conversion # Function: Nitroreductase # Organism: Clostridium acetobutylicum # 1 168 1 169 172 135 40.0 3e-32 MNFLELTKKRFSVRNYKSDRVEQDKIDYIIECARLAPSAVNYQPWHFMVVASEEQKQNLR QCYNREWFARAPVYIVVCADKSIAWVRKSDNKNHADIDAAIATEHICLAAAEIGLGSCWV CNFDPELFKANFGLSSERYPVAIVSLGYIQEQSDHFTTRKDKDEIVTFL >gi|226332013|gb|ACIB01000043.1| GENE 37 30665 - 31504 292 279 aa, chain + ## HITS:1 COG:CAC1451 KEGG:ns NR:ns ## COG: CAC1451 COG2207 # Protein_GI_number: 15894730 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Clostridium acetobutylicum # 22 275 33 285 295 68 24.0 2e-11 MKRENLHQPFEINFSEFDESMLKEHDHTFFELVYILSGTGIQWINNNKFSYHDGHLFMIT PGDSHSFEIHSTTKFINIKFNDIYIHSAVFGTENIKRLEFILQHANHQPGCILRNRTDKL LVKPMIEAIIREYVNRNLYSKEIITQLINTIIIVVARNIAMFLPEQVNECSEEKSLGILQ YIQTYIYQSEKIKTKAISQHFGISENYLGRYFKKHTNETMQQYILNYKLKLVENRLLHSE MRISEIVAELGFTDESHLNKLFKKYRGCSPTNFRKNNAV >gi|226332013|gb|ACIB01000043.1| GENE 38 31538 - 35218 2069 1226 aa, chain - ## HITS:1 COG:all4963_3 KEGG:ns NR:ns ## COG: all4963_3 COG0642 # Protein_GI_number: 17232455 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Nostoc sp. PCC 7120 # 691 936 4 250 294 122 32.0 4e-27 MANNKDDMLFILKDDGYLYYYQREAQSFNRLEIPNLEFSHVLSMTVDSNDILWIFTSNEE TQSYRIENTKQGLTLTRKNLFTHSCKLYHAFAEKDMAYFIDETYALYEYDFNNRQAYYIA DLRGQIEVHGEVSSIIKQKNDYYIGFKNSGLIVLKYMSSQKMKYQIQKTEIHSGIFCLMK DKFQDIVWIGTDGQGVYMYYNDTFSITNTLLDTPVYQISNPVRALYYDQEQTLWIGTKGS GILRIKKYTPDNSMPMSSDRITPYNSALADNSVYCFAPGCKDKLWIGTENGINYYSYLSR QLKELPIIANGEKVKYVHSIKQPNDTTLWVSTVGEGIVKVIFDASTATPTVKSASRTVID NGRMASNYFFTSYQENDSILWFGNRGCGAYRMNVETGEMIPHRFDSIVSSQTANDIFAIY KNAKGYWLGTSSGLLHLEQDDSLYRKADLFLNNTVHGVLEDHQGDLWLSTNQGLIRFNPE TRTGQTYNSGNGLEITEFSDGAFYKDVVSETLFFGGTNGFISIQTNDCITEEYMPQITLK GLSIFGKEHNIHKFLYEKEGKTILQLDYSQNFFQLNFMAIDYINGNNYSFYYKLEEMSNQ WIENGTSTSAIFSNLAPGEYSLLVKYKNNINGQESQTQSVIIRITPPWYLSPLAYMVYFI LFALLCTAGIYRLIYIYRRKQHRMLDKMNREKKEEIYESKLRFFTNITHEFCTPLTLIYG PCEKILANPEIDAYTQKYAKMIQQNTEKLNNLILELLEFRRLETGNKVLSIQQLSVSEKV RSIAESFEELAENKEMNYQLHISPDIEWNTDISCFNKIVSNLISNAFKYTPDKGIITIEL KVENQSLIFCISNSGKGIAKENLSKIFDRYKILDSVEMNGKNSRTGLGLAICKSMITQLN GEISVNSVLYEMTTFTVTLPNLPITERETTQTTYETGLLHTATEEPIVLEKTAIEFDTGK RTIMVIDDDNSMLWFVSELFADKYNVCSFDNAKDALSSLEQKQPDLIISDVMMPGIDGLS FAQKIKQHKLWSHIPLILLSALHHEDDQVKGIESGAEAYVTKPFNVKYLEKIVYRLIKRE ADLKEYYSSIFSSFTMEHGNCIHKEDQEFLDKMLEIIEKNIANPDLQVELLSSEMGYSTR QFYRKLKPITEQSPADIIKEYRLTMAERLLIAKNLTIEEIMDQTGFNNRGTFYKLFSKRY GMPPRQYREQQKENVKKEKISGISDE >gi|226332013|gb|ACIB01000043.1| GENE 39 35821 - 37770 1484 649 aa, chain - ## HITS:1 COG:no KEGG:BF3492 NR:ns ## KEGG: BF3492 # Name: not_defined # Def: putative alpha-glucosidase # Organism: B.fragilis # Pathway: not_defined # 1 649 1 649 649 1325 100.0 0 MKKQLLLLLIFSISLYIQGQNIHKLASPDGNIQISVNLSDKIYYDVICRNETLLKQCHLA MEIGDQELGTNPKMTKVSHKNIDESLKPVIPLKFSSVSNRYNQLLLDFKGGYSVEFRAFN DGIAYRFITNKKGMINVKNETLQVNFPDNYLLHMQQSGSFKTAYEEEYTHLYSKEWKSSA SMALLPILIDTQKGSKILISETSLTDYPAVFLKSNGSNGMVSVFPRVPLEFGEDGDRSVK ILKEADYIAQTSGKRNFPWRYFVISTEDSQLIENTMSYRLAEKNILEDTSWIKPGLASWE WWNGATPYGPDVNFVAGCNLDTYKYFIDFAANYGIPYIIMDEGWAMSTRDPYTPNPAVDV HELIRYGKEKNVGIVLWLTWLTVENNFGLFETFEKWGVKGVKIDFMDRSDQWMVNYYERV AREAAKHHLFVDFHGAFKPAGLEYKYPNVLSYEGVRGMEQMGGCRPDNSIYLPFMRNAVG AMDYTPGAMLSMQPEIYCSERPNSASIGTRAYQMALFVIFESGLQMMADNPTLYYRNDEC TRYITQVPQTWDETIALKAKAGEYVIVAKRKGDKWYIGGMTNNRQQERTFELDFDFLKEG QSYRMTSFEDGVNANRQAMDYRKKEYTLKKGDKIIVRLARNGGFASVIE >gi|226332013|gb|ACIB01000043.1| GENE 40 37990 - 39954 1133 654 aa, chain + ## HITS:1 COG:no KEGG:BF3493 NR:ns ## KEGG: BF3493 # Name: not_defined # Def: sialic acid-specific 9-O-acetylesterase # Organism: B.fragilis # Pathway: not_defined # 1 654 1 654 654 1334 99.0 0 MKNKCMLVVILLILSGTAFAKIVLPPIFSDNMVLQQQTNAPIWGEAQPMKTVKVTTSWDG KTYAVQADKAGKWKVTVHTPVAGGPYEIALTDGKKVSLKNVMIGEVWICSGQSNMEMPLG GWGKITNYQKEITEAGHSNIRLLQIEQINSTQPETNIKVRNDSWQVCSPITIPEFSATAY FFGREISEKQNVPVGLIHTSWGGTNVESWISGEVLKEMPEFVKTVESIQKMPGDKKILKA EYLKELTAWNNRVDEGFAEGKPVRAAASLDDKDWESMNFPGEVGPQLAGFDGVMWVRKEI EIPASWAGKDVQLSLGAIDDNDITYWNGIEIGRTDGPTLQRKYIIPEKMVKAGKAILAIR VLDTGGNCGIWGDLYLRSTNDEQISLSGDWKYQVAADTHKVGALPIDRSVDPNLPTSLYN AMIHPLISYGIRGAIWYQGENNSSRAYQYRELFPLVIENWRRDWKQDFPFYFVQLANFMH EVSQPAESEWAELREAQMRALAVGNTGMAVIIDRGDANDIHPKDKQTVGHRLALIARAKT YGEKLPYSGPIYRSHQIVGNKIILSFDHTDGGLKSSDGKELKGFAIAGRNHEFHWAKAEI DGDKIIVSAPEAVPYPVAVRYAWANNPVCNLYNGAGLPASPFRTDDWRGITQKD >gi|226332013|gb|ACIB01000043.1| GENE 41 40068 - 41660 786 530 aa, chain - ## HITS:1 COG:no KEGG:BF3304 NR:ns ## KEGG: BF3304 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 530 1 530 530 1081 99.0 0 MQIKKLFIALGIVLPLHMQGQNFLIKDAPEVIESYVNQFNREDNELYKQDIPNCGASDFL RKNIPFFECPDKELEKTYYFRWWTYRKHIKKTPDGFVITEFLPDVPWAGKYNTISCAANH HFYEGRWLRNAEILSDYASFWFSGSGNPRLYSFGAADAIYNYYLIHNDKMLLADLYPKLK DNFAKWEEEKRDSTGMFWQVDDRDGMEMSVSGHLSEGGRGYRPTINSYMYGEAVALAKIA SIVDRDMEARTYQKKADKLKGIINRRLWDKQADFYKVIPLNGKMEFSYARELLGYIPWFY NIPPDNYSIAWKQLFDSKGFEAAYGPTTVEQRCPDFKISYEGHECQWNGPSWPYLTSMTL AAMANYFNSYDSPIITKKDYLSLLNIYSNSHRILSVNNDTICWIDENINPYTGDWISRTR LKSWKNGTWDDSKGGVERGKDYNHSSFCNLIISGLMGVRPQEDGSIIINPLVPDGCWDYF CLDNVYCQGKTITIIFDKKGKKYGRGKGFIVYVDDKCLSHTTRVQKVVIR >gi|226332013|gb|ACIB01000043.1| GENE 42 41666 - 42646 674 326 aa, chain - ## HITS:1 COG:PH1107 KEGG:ns NR:ns ## COG: PH1107 COG2152 # Protein_GI_number: 14590938 # Func_class: G Carbohydrate transport and metabolism # Function: Predicted glycosylase # Organism: Pyrococcus horikoshii # 93 320 6 226 299 70 30.0 4e-12 MKKNVIQTLISLTVIIGICSCTTSPKKTIQKEEATGKWVKYENNPVLGGGDLGTVFDICV LKDSDSYKMYSSWHPQKSIALSTSKDGKNWSAPQIVLPPVEGSSWEADMNRPVVVYKDGL YHMWYTGQNDGKSWIGYAISKDGYNFERQSKEPVLSAEQPWEKVAVMCPHVIWDKHENIF KMWYSGGEQYEPDAIGYATSKDGLHWTKWDKNPIFKADPAQSWEQHKVTACQVIERENDY LMFYIGFYDINFAQIGMARSKDGINDWERYSENPIISPTEGGWDASATYKPFAIQEKDCW MLWYNGRNEHLEQIGLAIYDNHDLNF >gi|226332013|gb|ACIB01000043.1| GENE 43 42745 - 44610 1578 621 aa, chain - ## HITS:1 COG:no KEGG:BF3496 NR:ns ## KEGG: BF3496 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 621 1 621 621 1302 100.0 0 MKKLIYTLCLFTGMAISSCEDWLARDPLDQIADVNLTYTADECKLYVNQFYTSFWGSPNN YIYHIDRGSDNLLSDNYQDNKDLIEGLHQVPSSGEGWGTTEWGKIRSVNFLLDNCDKSPE PEKARKYIGEAYFFRAMLYYEQFLRKFGGAPWIDKTLDLESGELYGPRLKRHELADKILN DMDDAIDRLPTYSQQETGRVSKEAAMLYKARIALFEATWEKYHAGTPFAGEGNVQAYMEE AARMAKLVIDSQLFDLDNMGVEDGYHTLFNRWDYSSSKEIMLWKKFDRSLGFWHNDNRNP GRNGAGVGLTRALVDSYLCISEDGTQALPISLAENYAGDTNLLNVVANRDPRLAQTMFTP GRPRTINGKDTTVVFIKPNITLSGVEKCSTGYELAKGADPDANEQETISGSIKGSIIFRY AEALLIYAEARAELGNITQNDLDITINKLRDRVGMPHLTLSVGYTDPKGDFTAARGYEGV PVSNLLQEIRRERRIELACEGYRHDDLKRWRAHHLWNHDRIQGANAAQFENLDWLVKYFQ NDFHIPAAINKADFMEKVGHWSPERNQDNYWVDSEGYFEPYQRHIPDGHFHFDPTKAYLQ PIPTEQLVLNPDLKQNPGWEK >gi|226332013|gb|ACIB01000043.1| GENE 44 44631 - 47786 2835 1051 aa, chain - ## HITS:1 COG:no KEGG:BF3307 NR:ns ## KEGG: BF3307 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1051 1 1051 1051 2064 100.0 0 MKIKKHILIPLMVLLLFGSMDAMAAIEQAIQVKGKVVDNIGEPVIGANVVVKGTTTGVIT SIDGSFAIDAVKGSTIVVSFIGYLSQEVKVTGNFLNITLENNTELLDEVVVVGYGVQKKA NLTGAIATVGAKELEDRPVTNAAAALQGKVANLNISNGDGGPGKKASFNIRGYAGLDATY SPLVIIDGVTGYFDDLNPNDIETLTVLKDAAASAIYGAQAAYGVILVTTKSGKKNEKTVI NYNNNFSFNSPTVLPKTAGSLEFSRLFREADINGGGSGIIDLETMERIEKYYYDPTSIPN NVPQRDNPDRWADWGDGRSNANEDWQKAMFKNNQLNQQHNISIRGGSDKTTYVMSLGYLK DEGKLRYYDDHYQRYNALAKITTDVTKWLTVGLNVRYSHEKSVSPAYGMNPGGSVNDMIG WTQVVWPTIPVRDPNGHFSPAGRMVFIADANPQTSYTDNFWGTANVVIKPLKGWTINADF TYNKWMNKRSYSKGLIYSYSVSNEPYLEGGFGTEDTRVWQESNNDDFTSMNAYTTYEKEF KGHNFKIMAGMQSEYKKNFGLYANKMGMVLPGQPSISTSTGKIEAWDSLDHYATLGFFGR FNYDYKSRYLFEFDLRRDGSSRYAKGHQWGTFPAFSAGWNVAKEAFFEPYTSILSELRLR GSWGELGNMRGKNYQYISTVPYNATTDYIMGDKRISAFGAPNMIAYNTWEKNRTLDFGVD IAALNNRLTMSFDWYRRDIIGLITKGVTLPAVLGVNSPDTNNADIRNEGWELTLGWRDQF SLASKSFNYSVSFNLSDYQGTVLKYSNPKGLIYDYYVGKKMGTIWGYTTDHIMTDPAEAK AINDSGVQNKFGGDWVVGDIKYVNLDDDPNINDGNQTLEDHGDLSIIGNDTPRYNFGFGF NADWNGIDFSMFLQGTMKRDLWLEGPVAFGLGGGQWGSNVWKNTLDCWREDGSNPDPWLP RLYLWSTSKNLQKQTRYMDTGAYCRLKNIQLGYSLPGAIINKVGLEKVRIYFSGDNLLTF SGINENFDPEAPWGGAYPISKSISFGVNVTF >gi|226332013|gb|ACIB01000043.1| GENE 45 47917 - 49116 712 399 aa, chain - ## HITS:1 COG:lin0763 KEGG:ns NR:ns ## COG: lin0763 COG4833 # Protein_GI_number: 16799837 # Func_class: G Carbohydrate transport and metabolism # Function: Predicted glycosyl hydrolase # Organism: Listeria innocua # 147 345 90 287 341 76 30.0 1e-13 MNSKKVLFIIFICALTVTGCDTQEQTTIDSGLTLIRATQTLDSLYANYSVSGTCLLRENY PSNVGNYTATYLASEEQKAIPNQYSYLWPYSGTFSAVNALCAATGDESYKTLLDNKVLRG LEKYLDISRSPVGYASYINTASQSDRFYDDNIWLGIDFTDAYLNTKEEKYLKKAQLIWKF IESGADDKLGGGIYWCEQKKVSKNTCSNAPASVFALKMFKAIRDSSYLIKGQELYEWTKK RLQDSTDYLYFDNISLDGKIDKSKYAYNSGQMMQAATLLYQLTGRSHFLKDAQNIAKACY NYFFINFIPEEGEPFKLLKKGDVWFTAVMLRGFIELYQTDHNKTYINSFNQNMDYAWEHV RDEKGLFDIDFSGRTHDDRKWLLTQAAMVEMYARLAVTK >gi|226332013|gb|ACIB01000043.1| GENE 46 49701 - 50243 267 180 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|229255399|ref|ZP_04379326.1| acetyltransferase, ribosomal protein N-acetylase [Capnocytophaga ochracea DSM 7271] # 6 170 4 169 175 107 36 5e-22 MHTTILETERLLLRPFRETDLQELFECCQNPNLGNNAGWEPHKSIEDSKEVLHTVFMGNE GVFAIILKEDNSLVGSIGIITDPKRENTRTRMLGYWLKECHWGKGMASEATRTILDYGFN VLGLHLISANCYPHNTRSRLLLERNGFVYEGILHEAEMTYDGHVYDHLCFYQKKGRVPMD >gi|226332013|gb|ACIB01000043.1| GENE 47 50251 - 51537 599 428 aa, chain - ## HITS:1 COG:HI1612 KEGG:ns NR:ns ## COG: HI1612 COG0534 # Protein_GI_number: 16273502 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Haemophilus influenzae # 11 341 19 356 464 75 22.0 2e-13 MNYTYRQIWLINYPVMLSILMEQLINITDAVFLGHVGEVELGASALGSIYYLAIYMLGFG FSLGVQVMVGQYNGKGHYTGTGGTFFQGLVFLCVLAILLSLSSYFFSPFLLRYFVHSSEI YCAVIDYLDWRCFGLLFSFPFLAFRSFYIGIVRTNALSWAALAAVLINIPCNYWFVFVLD RGIAGAAIASSLSEAASLLIIVVYTWLKIDRTDFGLKFIFDKCLLVELWHLSVWSILHAF ISVAPWFLFFVVIERLGEAQLAVSNIIRSVSTIFFVIVNSFAATTGALVSNLIGAGERQY LFPLCRKILKLGYAIGLPLVVLAMVFRRQITGFYTDSSQLINISSIPFAVMLLNYVFALP GYVYINAVTGTGKTRIAFWFQLITIVIYLFYLYLLSHYTASLAVYLTAEYLFVILLAVQS VLYLRISS >gi|226332013|gb|ACIB01000043.1| GENE 48 51850 - 52176 376 108 aa, chain + ## HITS:1 COG:MTH1452 KEGG:ns NR:ns ## COG: MTH1452 COG1917 # Protein_GI_number: 15679449 # Func_class: S Function unknown # Function: Uncharacterized conserved protein, contains double-stranded beta-helix domain # Organism: Methanothermobacter thermautotrophicus # 10 107 1 98 99 108 45.0 2e-24 MEQSFKKGIVLHLASLVEYSEGGIISKQLIKSPAGNITLFSFDKGEGLSEHSAPFDALVQ VLEGSANIVVNGQVFTVNAGESIVFPANAPHALTAIERFKMLLTMIKE >gi|226332013|gb|ACIB01000043.1| GENE 49 52285 - 52947 387 220 aa, chain + ## HITS:1 COG:PA0750 KEGG:ns NR:ns ## COG: PA0750 COG0692 # Protein_GI_number: 15595947 # Func_class: L Replication, recombination and repair # Function: Uracil DNA glycosylase # Organism: Pseudomonas aeruginosa # 3 220 8 226 231 242 52.0 4e-64 MNVKIESSWQQRLQEEFDKPYFEKLVNFVKNEYGKAHILPPGHQIFHVFNSCPFQNVKVV ILGQDPYPNPGQYYGICFSVPDGVAIPGSLSNIFKEIHQDLGKPLPNSGNLDRWVKQGVF PMNSVLTVRAHETGSHRNIGWETFTDAVIKKLSEERENLVFMLWGSYAKEKASLIDTDKH LILTAVHPSPRSADYGFFGCKHFSKANTFLRSRGIEEIDW >gi|226332013|gb|ACIB01000043.1| GENE 50 53390 - 53890 322 166 aa, chain + ## HITS:1 COG:BH0401_2 KEGG:ns NR:ns ## COG: BH0401_2 COG3449 # Protein_GI_number: 15612964 # Func_class: L Replication, recombination and repair # Function: DNA gyrase inhibitor # Organism: Bacillus halodurans # 16 166 6 156 158 77 33.0 1e-14 MKPAIIKPDLELKKEIRELPQRNVIYIRLFGDYKLNDYAGTWMHLIQFVKEQNLPMGDPS PLCIYHDDPKVTPTDKLRTDVCMLLPSNARPKGNVGFKQLPAGRYATFLYKGSYDQLQAV YDTIYGKYLPEMECTLSDEPSAERYLNDPSCTPPEELLTEIYIPIR >gi|226332013|gb|ACIB01000043.1| GENE 51 54171 - 56060 898 629 aa, chain + ## HITS:1 COG:no KEGG:BF3314 NR:ns ## KEGG: BF3314 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 629 1 629 629 1248 100.0 0 MKNKSIASFMIKIVMMPLLFTISCVNEISEDPVDIPGEIPIRLSTQILCNHTRAINNEFQ EKDAIGLYVLTQLSTINQKRYIDNMRFTCSQATGFEPEETIYYPKGDGKCDFISYYPFQE TGINQDQSIMQVQIHTDQSSVSKHSLSDFMIATNSDITPSQNMVSMEYKHKLCKLKITIK PAPGEDIDELLNDNPSLSLNGFHSDASYDFLTDRFEPSGQTISVTPHGEWKIENNALTGK EVILIPEKIESDNHYINIDINGKSYSCPFPDNFQLASEKNCSIAIIYKSSEGIQINNFDH SITDWTEGDSGETTAQETSGVIHLSALKFSKSNVYKAINEGTQVAEICKEYLLADNIDAQ AIVVYPVLNGATDLNNGTVICLLDEPASIHGGKVSWNKVNNALDYTPGNQSIISDFYITQ DNSISISKPENPLPVRLQEEVLTDIRNTETQTYPIVKIGIQYWMGRSLEATHYTDGKAIT LKKDFTTTAGYYTGKFKDAPQDFYFYNSEAVISGKLSPQGWSIPTETEWELLKQYINNDA SKLKFGPWSSDENGDKLPILNITGFNGVPEGYILKSKNGYTNGLYTVVYWSTNSSGNQTN RAIYLLHTTNEIKDGNITDRALSVRCIRK >gi|226332013|gb|ACIB01000043.1| GENE 52 56250 - 56849 401 199 aa, chain + ## HITS:1 COG:no KEGG:BF3506 NR:ns ## KEGG: BF3506 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 199 1 199 199 375 98.0 1e-103 MQKVLFITTILLTLFALSIRAEHNPRKRPDVSAQFQTVSGHSISLNGIRLAYSYEYAFTP KGTIIFSAGSSYAFGRILQLKMESLREFHIDTKDYHLVTGDLAIEPRFYYNLKKRHRNGK RTWGNSGGYLSVNFGYSFPITITSGVKAAHIYVITPYWGFRRVWKHFLFDLSGGVGYIGS SNRTSAVYPGLRIGLGYRF >gi|226332013|gb|ACIB01000043.1| GENE 53 57086 - 57814 277 242 aa, chain - ## HITS:1 COG:RSp0958 KEGG:ns NR:ns ## COG: RSp0958 COG2846 # Protein_GI_number: 17549179 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Regulator of cell morphogenesis and NO signaling # Organism: Ralstonia solanacearum # 22 232 21 220 223 124 33.0 2e-28 MKDYKLMAVGQIVADCFDYAKVFNKYGIDFCCNGDVSLADACGKMGIDADCLLEELKQIK SERSLTLDFKSWPIDLLVDYILKFHHRNIRYQGPQILQLLDRVCEAHARKHPELYEVREL FQESWIDLNNHLTKEEMVLFPYIYDLFDAVAQHRPIPAFHCGSVSSPISVMMSEHDAEGE RFRRISGLTHGYLVPGDACSSYRLLLEMLRTFEDNLHHHIHLENNIVFPKAIELQENCER MC >gi|226332013|gb|ACIB01000043.1| GENE 54 58058 - 58522 493 154 aa, chain - ## HITS:1 COG:CC0942 KEGG:ns NR:ns ## COG: CC0942 COG2030 # Protein_GI_number: 16125194 # Func_class: I Lipid transport and metabolism # Function: Acyl dehydratase # Organism: Caulobacter vibrioides # 8 148 5 145 148 122 46.0 2e-28 MDKVIINSYEDFEKLIGQQIGVSDYLEVSQERINLFADATLDHQWIHVDTERAKVESPYH STIVHGYLTLSLLPHLWNQIIEVNNLKMMINYGMDKMKFGQAVLSGQSVRLKASLHSLTN LRGVAKAEIKFAIEIKDQPKKALEGIAIFLYYFN >gi|226332013|gb|ACIB01000043.1| GENE 55 58586 - 58981 63 131 aa, chain - ## HITS:1 COG:BS_yuxK KEGG:ns NR:ns ## COG: BS_yuxK COG3011 # Protein_GI_number: 16080202 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Bacillus subtilis # 3 119 10 127 137 111 43.0 4e-25 MNVILFDGICNLCNGAVTFVVKRDRKGLFRFVSLQSETGKSLLKRYAVESTNKTLYYFRN NRCYSKSTAILYILKDLGGFWQCLYPLILIPAKLRDAIYLLVSKYRYRIFGKADSCIKPF GFSSEESKRSD >gi|226332013|gb|ACIB01000043.1| GENE 56 58991 - 59851 849 286 aa, chain - ## HITS:1 COG:no KEGG:BF3510 NR:ns ## KEGG: BF3510 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 286 27 312 312 528 100.0 1e-148 MKKNLLLVGALVSAFLLASCSGGDKSKAPVVSTADIENAAEVIKYYNTSLGVLKDMVKEK DVNAVLDYMEQKGKAPALSAIVPPAVVSKDSAIVLNPGNCFNEETRQNLKQNYTGLFQAR TEFYANFDTYLSYLKKKDVTNAKKLLDVNYQLSTQMSEYKQNIFDILSPFTEQAELVLLV DNPLKAQIMSVRKMSSTMQSILNLYARKHRMDGPRIDLKVAELTQQLDAAKKLPVVNGHE GEMKSYQAFLSQVETFIKQVKKVREKGEYSDADYDMLTSAFETSII >gi|226332013|gb|ACIB01000043.1| GENE 57 60788 - 61567 240 259 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163739489|ref|ZP_02146899.1| 50S ribosomal protein L17 [Phaeobacter gallaeciensis BS107] # 7 247 4 238 242 97 31 7e-19 MNRFENKIIIITGAAGGIGASTTRRIVSEGGKVVIADYSREKADQFAAELSNSGADVRPV YFSATELKSCKELITFTMKEYGQIDVLVNNVGGTNPRRDTNIETLDMDYFDEAFHLNLSC TMYLSQLVIPIMSTQGGGNIVNVASISGITADSNGTLYGASKAGVINLTKYIATQTGKKN IRCNAVAPGLILTPAALNNLNEEVRKIFLGQCATPYLGEPQDVAATIAFLASEDARYITG QTIVVDGGLTIHNPTINLV >gi|226332013|gb|ACIB01000043.1| GENE 58 61971 - 64937 2308 988 aa, chain - ## HITS:1 COG:no KEGG:BF3321 NR:ns ## KEGG: BF3321 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 988 14 1001 1001 1885 99.0 0 MVITACTPFGLLHKEQPRVTLALPARQETGPMNVVKRDSVSLLPVSNFTFVNSKGDSIPV GMSVEWDSIHKENLTTLALDEVVVSARSTRNTAERNGMVNIEFVVTVPQALQKDNWMLNI RPVLMRGDTPDSLKELRFTGQRFREAQERDYRHYDRFVEKIIPDSVNFYRTYVNYHSFER YLERLKWYKRGLEKRWAIQDARKRRPDPLLLRFDMFNRQVGRRDSLMKSRMLDNSQRMIT RQWWRYGRAWERMNDTLQFQSRHLLERFRFFNNKWADNAAFQSDGLIARKNYFRDKALST PMWQAKRALYKADPDAAIRIYASRFGYFNDKIERLDATLYRYYRTKGARAESREGVRFLR AFMVGRDTTSSYLNRNQLTEKYIRRYEKVKNFFPMFHFRRPDPDTLSPLWETRTRIDTMQ TRHTLLSKFSKEDIYEYYVRQQQGVSDRGMIGPFRGLLPLYTYHRDLPDSIVSRVPGRKT RRDFELSRFDSATTVNRYIGRYEFLRSTYPQYRLIRKLYNIHPPALRHAARQASYEERLA RINSLDSTSLIKMFYNTQKIARNEARKAMKDTKYRDIVRFPFNPEAQLDTVIYATDQVHF LYSQKVPADENSARMKVYVVGDVLNSNGSRFSLPYSDTLTYLVSSMTKFVDRTPRFVRKI VTRDAEANASVNFYFPKNSFRMDETIDVNRQGVKQVHNLTLALMTDPVYIIDSLTLLATS SPEGNWHVNGEIARKRAESIRNILVEDFKQLYDSLAIGAAIEMDEAGNIIRQEMKDGIPN LPELIKIRTVPEGWEKLRRLIVNDKNFQGNKGAILRIIDREQEPDRREWLIKSQYKTEYA YMFDKLYPAVRRVDFLFSLSRRGMRQDTLYTNEPDTMYARAVDYLEKRKYGQALEILRPY EDVNTAIAYMSLGYDKAALRILEQSSQTAETQYMQAILNARLGNEQRAVSLLLSAAEVDD RIRFRANLDPELSLLVKKYGLFKEDDLW >gi|226332013|gb|ACIB01000043.1| GENE 59 65006 - 65569 455 187 aa, chain - ## HITS:1 COG:no KEGG:BF3513 NR:ns ## KEGG: BF3513 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 187 1 187 187 337 100.0 2e-91 MRRIIRYCLLFFFLLFAGSLTSNLQAQFYSVQTNTLKLATTTFNAEGSMLLSTHWTLNLG FSYNPWNFSDTRKIKHFLIEPGARYWFWQTYAGSFISMYAMGARYNVAWDGLLGGDYRYQ GWGYGAGMTYGRSWLLSKRWNMEVEAGLGLLVAPYTKYRCEHCGDKVKSGTYFLPTPKLA FNLVYLF >gi|226332013|gb|ACIB01000043.1| GENE 60 65566 - 65928 228 120 aa, chain - ## HITS:1 COG:no KEGG:BF3323 NR:ns ## KEGG: BF3323 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 120 1 120 120 233 98.0 2e-60 MEIPVLAKTMELDNLYHIYLFYVDDRWCAFGCSAYYLSIMYPELDDFGEAFFTSDGDCLP FLPVTEPCLLNLSDYYNTLVSDTHIQVSVPPTVYSYRNGYDKWCAKLFVDKNKLHILKHQ >gi|226332013|gb|ACIB01000043.1| GENE 61 65947 - 66582 496 211 aa, chain - ## HITS:1 COG:no KEGG:BF3324 NR:ns ## KEGG: BF3324 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 211 4 214 214 432 99.0 1e-120 MSFKYYILLPIVWVCLCFPSDAFGQYIKKETVNSQVYAVIYAEGLAGSAQWVSGFKMMKN DATIRHQVSRGNGGNPIVNDRIPMRFIIAPTDVANVTWMQAEGAGDGNGNLNADFRSTAA TGCRSYKIAGDPNRKWRVPTQRELQLMWLFREPVGIIYPAAQMENVSSKIYWAATEEDAA NAWYFDFKQGVPQCSWQLKTTSSNVRCVSDY >gi|226332013|gb|ACIB01000043.1| GENE 62 66595 - 68334 790 579 aa, chain - ## HITS:1 COG:no KEGG:BF3325 NR:ns ## KEGG: BF3325 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 579 11 589 589 1098 99.0 0 MICPLLAGCTDDDMMPSSQMPLSSETVPVRLNFSTEAFNTPFQGDTRSGGESTVLSVSNQ DMDIELVKTPVTRDAVAAIDKENAVYNYTVLQFAGITETATLLGKATYPCKDGVITTADV ELQATTTGPGGTVVKHRFVVIANVDGTDFNTLQENTSTYSDLQNMHISQAGNQDFPLHKV TVNGVKKDAIIMSGLVDATINTGEAKQLSIALKRTVAKVTFNVKTDNPAFSNLKNWDLIL MSIPNKSYFNTLGRFAVFPTVDPLNQFSAYWFKPLTSTVGEALPLNEKSSYLPVNLQQSV AISTSGTRRDNAPIGGTYLQIMGREMSPDGVGSFPVVKDFVLYQIYLGKNLTTDFSVYSN NNLTYNITLKGRSDDDTNVIRFIPGYFSGELKAYDANDNALASKTDPSAVKWEYSKRLEA FFQDSKYTGQQVGGEENGRQDVRWQVIGSYNNRGATSLTDGYGNTRQLEANDIFYLHYPA AQACYGGLNGLVNGGDTSFSWYLPSVSELIGTWISSASTASQLSASYWSSTALGSPNAFI ITNKGEVKTAPVNSNDDRHYVRGFRDPDAVNTIHYNITH >gi|226332013|gb|ACIB01000043.1| GENE 63 68401 - 70095 990 564 aa, chain - ## HITS:1 COG:no KEGG:BF3326 NR:ns ## KEGG: BF3326 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 564 1 564 564 1137 99.0 0 MDTQKKHNLSGTIILVSCFLLLAGACDNSDRIETETQANGVLLNFNASTIDATTTETRSF VPIEGFAKNEYIFGMSVTKDNASRGEIFEGSRNLKATMSRPSGDAPWNWEFSNNASGTVV TPRGPEGKPLRVIAYYPAIAGTENFTDGIPFDFTQTNNPQQKEILYNTNTSYTIPSSGGS DKVTIPLKFQHAYSWISIKVTKSVDKGSHKLSGVSIDNLSGGWIKNKGKINPATGMAMQG AIQGPIGEVRTAEVLDPSGDTATPIIYDFLVPAFMDRNVKDDDLVFTLIIDGKKEIFSLG REHLNNDGDTYGFKQGYANTYNVEFNNSSLNMRLLNWTSTRIDGDFGQNVTNPTNYQEMS FYYAEDNGGWTTANGKVFPPKYKDLPAGDRRYYNYLTTVKYGGNGEYVPAKPVTDPPPPK GIIIEDDANVATQEPACRLFQMTTKDVSIEPVPWEDEMGQLVAKELCRKYNGGGFKDWRL PRASELRALLVYAIYGAGTKQFINLKLTNDVNREKLYWTGTEESEDKAWAMLYYDDNTLV GRGPNISAQDKSTRLSVRCIRQLQ >gi|226332013|gb|ACIB01000043.1| GENE 64 70122 - 71882 1053 586 aa, chain - ## HITS:1 COG:no KEGG:BF3518 NR:ns ## KEGG: BF3518 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 586 1 588 588 888 76.0 0 MTNNPISKLLRLVMLLAVVLLLSSCRQDDDSYSSHPGPTTLVSLSVSASNSGAVAASDDP ASSIRDLCILQFNVNGTGFGNLRHVAKGSPASAGTFNATLLQSVNPDDKYKLVLLANLPD YGFLNSLSGKSYDQVQKTLLSEELSGTNNIPSFDGSRPFPMFGVANSGNSIEITENMSLS DVSLIRGVARVDIGIGIKNADDTWNKNGVKFNMTQIQIWKAGKQYAYMPSENNFSSTGRV DGVTITGPSPVGTTETKVYDITHIINNTYCSGKIYLPEADLNWGDVYDANHTDRLAVIVG GKYNGSQTETFYRVDFKNDVSGEKMDILRNHVYRFTVTKVTDDGYDTAELAYKSIPKDIS FTAELTPWTFPPAVSVPSIIGYRMVYQNTNGGMLLWNTATGLTIPKKRDTWKGTKMNFNY NGFYDETNNAYAITYPIEPRNGSLYHTIEVAFDYEGVYPSLMVSADDVTDVTGGDANPWK TGKTLTAFDICRNYEGDGFGDWRLPRLSELALLYLNRGSLEAMRGFAPLSGTYWSGSEYL VSDSKVDKRHSEQAWGINFDATNPGNAAPYDKTTKKFKIRCVRQTQ >gi|226332013|gb|ACIB01000043.1| GENE 65 71929 - 72828 564 299 aa, chain - ## HITS:1 COG:no KEGG:BF3519 NR:ns ## KEGG: BF3519 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis # Pathway: not_defined # 1 299 2 300 300 453 72.0 1e-126 MTGDYKFRFMPGILSLIFTAVLLSGCIAENTDDCFKGVSLQVKLPADVSGETMKDINLYV FDDKDLLLDILPISSSEFVVLDYPGIPVLHCITWCNTGDGAVSAGSLKKGDPLSAGFISL KPSAATRAQMSLFNPPADLFYGELILENTSTSNHMEEQELSVSRMVASMNITIRGLELLG ESNRGVYSLVVHETASRLDFEGRYGGDPASYAFTPSFEVGKDCTMPTFNLFPVMGSGGII IDIYHDGKLLRSVSTDSGNRPLVPVVGKTLNVLLNFKLDVDVQVEITGWGEKYIWKEYN >gi|226332013|gb|ACIB01000043.1| GENE 66 72967 - 74142 1170 391 aa, chain - ## HITS:1 COG:no KEGG:BF3520 NR:ns ## KEGG: BF3520 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 391 10 422 422 249 44.0 1e-64 MCAIAALASCSQNDDVAVPTDSAPPAKVTIRFAGSEGMTRAIGSDDAVVNNLTAFFFNTA GALIKTPMPVAGNDLKPKITLATTTDASQVVLVANVPAGTKFDAVTSLSKLKEFVVSSLG TEASGNFPINQTKTNLTMSGWGAINMNADDNTGTANVKLHFMAAKIETLKVTIGGKNVGH YADTEDGVTDDKWFTIKQAYLMMAQTNSVLLPATDLGAWTGAFTPATFAYAGGLAWGTRP WENPPVNPDPVKATDYLQTTIPAGATSNVIDNILVSAPWYVFENASPNATGVVLEVVCNV RKDNNTLEKESRYFTMYFGEKKTGDSGNQPILNAGQRYSINIALNGSFDPGDGTGGGGTT DPTKPSVDANVEITVAPAEWTAVAVINKEFN >gi|226332013|gb|ACIB01000043.1| GENE 67 74467 - 75345 551 292 aa, chain + ## HITS:1 COG:no KEGG:BF3521 NR:ns ## KEGG: BF3521 # Name: not_defined # Def: AraC family transcription regulator # Organism: B.fragilis # Pathway: not_defined # 1 292 1 292 292 537 98.0 1e-151 MNKKSSLNNLLYIQEHRSCRNYLEKVENGFKYIEFSHEEVILEKEISWNYLLFVLEGECV INCNQFRERLFQANCMVLLPKTAMVEIKVIAGTRLLSLSFDVPLNVCDKFILQSLTGLCR KLDYNFQSLAIRYPLPPYLEIVTYCLTNKMDCGHFHTLLQQELFFLLRVFYLKEELALLF HPIISAELSFKDFVIGNYFKVSNVNDLISLSNMCKSSFYCKFKEVFGMTAKQWLLKQRNT HILNKVMTSETTVGELMEEFRFESQAHFTHYCKQHFNCTPRELIMKYQVVNQ >gi|226332013|gb|ACIB01000043.1| GENE 68 75508 - 76719 843 403 aa, chain - ## HITS:1 COG:no KEGG:BF3522 NR:ns ## KEGG: BF3522 # Name: not_defined # Def: tyrosine type site-specific recombinase # Organism: B.fragilis # Pathway: not_defined # 1 403 1 403 403 793 99.0 0 MTSVKLKLNKTRALKDGNYPVVFQLIHQKRKKIIYTKYRMKEEDFIIIAGNVVSGCHTGC KISRELLRIYKQLTARVRRLESRGEEYTINDITTVMFSKVTGKFLLLPYIDTQIEWKKSI MKNGTAAAYQSTYASLAKYIGKKEVKISQVNHRFVTCYRDFLSKNGATENTIGYYLRNFR ALYNLAVKDGLVSPCDYPFKEICTKPCKTVKRALDREQMVKLACLSLHSDAELKRSLDLF LFGFYAQGMAFVDIAYLKWKNISGNRIIYRRHKSKQLIQIVITPQIKSIIDEHGNNTNNA EEYVFSVIKNNANEYTQYRTALGRTNRHLKIISAKLKIDPPLTTYTARHTWATLAREYGA PVSAISAGLGHTKEEMTLVYLKELDLAPLHRINKMVNNLLERK >gi|226332013|gb|ACIB01000043.1| GENE 69 77152 - 78018 478 288 aa, chain - ## HITS:1 COG:no KEGG:BF3523 NR:ns ## KEGG: BF3523 # Name: not_defined # Def: AraC family transcription regulator # Organism: B.fragilis # Pathway: not_defined # 1 288 1 288 288 567 100.0 1e-160 MQKLNQQPDICPYFSPEFKVIPQYIIMKEGECMELNNRTTSFFIFILSGEITISFEQYTN RSVLENEMFFLPKNNCFKWKAVTQTVLILTGYNATIFPCTSVRARILYKIKAGVKFDCRG VVMKDEVKVVVNQMKHYLESGINCHHMYILKHKELYLMFKHFYTYEEIIQIFYLILGSNP LFNERVLDNYLKVKTVKELAGLLGYGIKTFEKLFRENFDESPYKWMQKRKALQIQQRLMN PAISLKQIMYEFKFATSSHFNFYCKQHLGAAPMQIRNSNKDDNMSTLP >gi|226332013|gb|ACIB01000043.1| GENE 70 78448 - 79383 506 311 aa, chain + ## HITS:1 COG:no KEGG:BT_0234 NR:ns ## KEGG: BT_0234 # Name: not_defined # Def: putative transposase # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 309 1 311 316 428 69.0 1e-118 MKATRKCSFCGKSFVTRSGMQRYCSEACQAEAKRARVMQKNNLFKVAQPLMEIQHQEYLT FSKAAILMGCSRQYIYKLVAIGKLKASRISNRMAFIRRADIEQMLEGNPYHRILPGNTST PRKSSSSSLPAKREKREKESEEVLDFYSGEEVMSLFKVKQSWLYTSAKRNHIPICRIAGK NYYSKKHIDEFFGVAVDISEITDWLLTEEVEELFGMKPTALRAYTYRHKIPTKREYGRTY YSKSHLNELRRTDLVNDERYYTVEQVQQIYGLSSANICHIVKVKHIEKIKVGVKNLLLRS DVERVMAERNK >gi|226332013|gb|ACIB01000043.1| GENE 71 79525 - 80643 688 372 aa, chain + ## HITS:1 COG:PA3738 KEGG:ns NR:ns ## COG: PA3738 COG4974 # Protein_GI_number: 15598933 # Func_class: L Replication, recombination and repair # Function: Site-specific recombinase XerD # Organism: Pseudomonas aeruginosa # 95 358 14 287 298 69 25.0 9e-12 MSKCKTVTLRKRKIKNGTQYSLCLDYYPGYRDNVTMRVITREALGIYIFAKPANQQERDF NARMMKKAVILRNQRYEAIFNENNGFFDKTKMKGDFLAYFKGLADRKNIKWQHVYKHFQR FVNGKCTFEEVDVDLCRKFMEYLLDAPQSIHTNQKLHINSAAGYWSTFRAVLHTAYRDRK IKENPNGFLDRIECIPTIREHLSQEELIRLAETPCEEEVLKKAFLFACLTGLRKSDIRQL TWQQIQPYTNGRMFVTTRMQKTKEIVHNPISDEAYGLLGERGEGLIFEDFKDKMLQGPLQ RWLTAAGITKKITFHCTRHSFGSLHVEMGTDMAVIQAYLGHKNITTTQIYSKIAAQQMCQ VVDKITLKRKEA >gi|226332013|gb|ACIB01000043.1| GENE 72 80891 - 81748 424 285 aa, chain + ## HITS:1 COG:no KEGG:BF4230 NR:ns ## KEGG: BF4230 # Name: not_defined # Def: putative protein involved in transposition # Organism: B.fragilis # Pathway: not_defined # 116 230 2 116 142 151 61.0 2e-35 MESSIKDKYIILGFVGFAIVLISSIATLVIADSFNQDNFVRWIVFVCCNLLGWLLYLSFQ TLIFDTYEIYKIKFGKKETIAEAIEVQEELSQNTLEEATSVPGPTSVPEPVPESSPTKEE TLIQTQPIELTIAPDLHEKNRANYASREQREKEERIRMVMEYCHYYLPRIADQETVNHIC TEVDKWMNLNTYTPKPIQRPFTKDINNIPLRHFVWNISERFLYKRYYNGDNRAKFIKALF PKSFADTDLSTIKNFKVEPLKTEIPIDEPENGKLDFHYPEDYVRN >gi|226332013|gb|ACIB01000043.1| GENE 73 81948 - 82322 339 124 aa, chain + ## HITS:1 COG:no KEGG:BF4232 NR:ns ## KEGG: BF4232 # Name: not_defined # Def: excisionase # Organism: B.fragilis # Pathway: not_defined # 1 101 1 101 136 157 81.0 1e-37 MEKSILTFNDLPEVVAQLRDEVMSLKSLLAEQRSVNNAKTVDTHVPMSVDEAAEYLGIPK GTLYMKLSEGTIPATKPGKRYCLYRDELDKWLETARKNPIPLSDEELNKSLSSSHRRKPN PRNW >gi|226332013|gb|ACIB01000043.1| GENE 74 82328 - 83425 562 365 aa, chain + ## HITS:1 COG:no KEGG:BF4233 NR:ns ## KEGG: BF4233 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 198 1 198 240 363 85.0 6e-99 MEEDKNYINLIRGDLTKASQAHNGMPDSVGMMNIKTANQTILEASLLPTPRALWDSFWYE GELSCLFADSNVGKSILAVQIADRIARTDNVLYLDFELSEKQFQLRYTNEHGELYTFPDK LYRVSIDCNQLLDANFEEAIIGGIEQMAVQTDCKIFIIDNLTYLCCAMEKGDAAGRLMIQ LNNLKKRYALSILVLAHTPKRSLDCPITSNDLAGSKRLYNFFDSVFTIGKSAQDGGLRYV KQLKVRYGTFSHDADNVIVYEIDKVDAFLQFVFRGYSTEKEHLKKLGDNESSQRDCQILQ LSQSGKSVREIASQVNCGKSTVNRIIQRSKESKNAGVPSVPLSQPLECGTMGQDGTADNQ PSKTD >gi|226332013|gb|ACIB01000043.1| GENE 75 83429 - 84544 668 371 aa, chain + ## HITS:1 COG:no KEGG:BF4270 NR:ns ## KEGG: BF4270 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 4 358 6 341 342 259 43.0 9e-68 MGNYSLQKYKGTATRHTCPKCGDRHSFVYYVDENNVPLHPSVGRCNHESGCGYHYTPKEY FQEHPEHRTTNDFSFDRQRAEQKKVKQQSKPTAIGYIPPHYVEKSQSERSNFFRFLFTLL TSYYGDKAKEVLKRLLEEYRLGATRDGSVIFWQIDRTGKVRTGKVMQYNPEDGHRIKGGQ TSAVNWIHSILKKQRVLAEDWQLSQCLFGEHLLKTHPDKVVVLVESEKSAVIGSAIFPDY VWLATGGKSQMREEKLRVLSGRTVLLFPDADAYAEWKQRAESMYFCKVVVSDIIERNATP KQKEAHIDIADWIIFQIREGKVMSTANHLVEAERILQRMIEKNPVLQKLIDDLDLVLVGA SPIGNDDEKPP >gi|226332013|gb|ACIB01000043.1| GENE 76 84827 - 86230 789 467 aa, chain + ## HITS:1 COG:no KEGG:BVU_1439 NR:ns ## KEGG: BVU_1439 # Name: not_defined # Def: mobilization protein # Organism: B.vulgatus # Pathway: not_defined # 1 467 1 467 467 675 75.0 0 MATKSSIHIKPCNIASSEAHNRRTAEYMRHIGESRIYVVPELSTDNEQWINPDFGSPDLR MHYDNIRQMVKEKTGRAMQEKERERKGKNGKIVKIAGCSPIREGVLLVRSDTTLADVRKF GEECQRRWGITPLQIFLHKDEGHWLNGQPEAEDRESFKVGDRWFKPNYHAHIVFDWMNHE TGKSRKLNDDDMMQMQTLASDILLMERGQSKAVTGKEHLERNDFIIEKQKAELQRMDAAK RHKEEQINLAEQELKQVKSEIRTDKLKKTATTAATAITSGVASLFGSGKLKELERANEKL QDEVSKRNTNIEKLQSQVQQMQKQHDTQIHNLREMHRQELDMKEKELSRLARIIDKAFRW FPMFREMLRMEKFCAMLGFSKEMTESLIVKKEALKCSGKIYSEQHRRNFDIKDDILRVEN DPDDESRLNLTINRKPIADWFREQWHRLRYGARVPQQEERKSRGFKL >gi|226332013|gb|ACIB01000043.1| GENE 77 86323 - 87288 537 321 aa, chain - ## HITS:1 COG:no KEGG:BT_4507 NR:ns ## KEGG: BT_4507 # Name: not_defined # Def: beta-lactamase precursor # Organism: B.thetaiotaomicron # Pathway: Biosynthesis of secondary metabolites [PATH:bth01110]; Two-component system [PATH:bth02020] # 44 311 22 287 293 204 41.0 5e-51 MEKNRKKQIVVLSIALVCIFILVFSLFHKSATKDSANPPLTNVLTDSISQIVSACPGEIG VAVIVNNRDTVKVNNKSVYPMMSVFKVHQALALCNDFDNKGISLDTLVNINRDKLDPKTW SPMLKDYSGPVISLTVRDLLRYTLTQSDNNASNLMFKDMVNVAQTDSFIATLIPRSSFQI AYTEEEMSADHNKAYSNYTSPLGAAMLMNRLFTEGLIDDEKQSFIKNTLKECKTGVDRIA APLLDKEGVVIAHKTGSGYVNENGVLAAHNDVAYICLPNNISYTLAVFVKDFKGNESQAS QYVAHISAVVYSLLMQTSVKS >gi|226332013|gb|ACIB01000043.1| GENE 78 87552 - 87764 92 70 aa, chain + ## HITS:1 COG:no KEGG:BVU_2907 NR:ns ## KEGG: BVU_2907 # Name: not_defined # Def: hypothetical protein # Organism: B.vulgatus # Pathway: not_defined # 3 70 1 68 68 107 80.0 1e-22 MRMPNTWITDFSFREQTLYPQLCYVVYWLNSISMGNTFVADFKQLLSKYPSVRTRLLGFP HNWEQEPLWR >gi|226332013|gb|ACIB01000043.1| GENE 79 88313 - 89479 877 388 aa, chain + ## HITS:1 COG:MA2533 KEGG:ns NR:ns ## COG: MA2533 COG1488 # Protein_GI_number: 20091361 # Func_class: H Coenzyme transport and metabolism # Function: Nicotinic acid phosphoribosyltransferase # Organism: Methanosarcina acetivorans str.C2A # 2 383 1 395 404 301 42.0 1e-81 MIIRTILDTDLYKFTTSYAYIKLFPYAIGTFSFKDRDGTEYSDEFVERLRTEISQLSHVA LTEKELEYMIKNCRFLPRVYWEWLSSFRFQPEKIEIRLDENRQLHIEVNDYLYKATLYEV PLLAIVSEIKNQSSGNVANLEDILYKLSEKTELSNKHQLLFSEFGTRRRFSFDVQNQVIG HLKQTAHYCIGTSNCHFAMKYGMKPMGTHPHEWFMFHGAQFGYKHANYIALENWVNVYDG DLGIALSDTYTSAIFLSNLSRKQAKLFDGVRCDSGDEFRFIDQLTARYKELGIDPTTKTI VFSNALDFDKALDIQEYCQGKIRCSFGIGTNLTNDTGFKPSNIVMKLSQCKMNMNQEWRE CVKLSDDIGKHIGSPEEVRACLYDLRLE >gi|226332013|gb|ACIB01000043.1| GENE 80 89503 - 90714 803 403 aa, chain - ## HITS:1 COG:no KEGG:BF3525 NR:ns ## KEGG: BF3525 # Name: not_defined # Def: thiol:disulfide interchange protein # Organism: B.fragilis # Pathway: not_defined # 1 403 1 403 403 852 100.0 0 MKSILLSVISFLLVMNVCGKNDVDMRTYLKKVLGNLGKIESASYHEQSQSWQPGDTVAIT NFHRFIKEYTNPSDSTIGASYVCLDAKDTTRFEFGYDGNVRMISYHEHKGIMIDDFTTRS LPFRLVAPPFFCYTRNIIGYALSTGDSITTEWKDLGEAYYFKLVIHEDRQVEFFGKAYYI PKPPFDLGDPTSIYELWISKADGLPYKMRREMSHQISATTCSDVVLNQLSIAGLNLYDYV PQDYEIRKYGEQKKVTQPAATDLIGQKAPDWMLKDMYEKPVALSEFKSKVLLVNLTGIGC GACHASIPFLNGLKGKFNAGEFEVVSIETWGREPHSLRTYADKNQINYSFLCGDEGIVKS YRTYGAAPLFFILDQDRVIRKVIRGYRMGRTDEEITDAIKALL >gi|226332013|gb|ACIB01000043.1| GENE 81 90845 - 91006 60 53 aa, chain + ## HITS:1 COG:no KEGG:BF3526 NR:ns ## KEGG: BF3526 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 53 1 53 53 90 100.0 2e-17 MARTQIDEDIPILRSERNDFSLFNNTPSIQNLCNRKIQRIDMTFTYRNLHYIE >gi|226332013|gb|ACIB01000043.1| GENE 82 91463 - 93727 1618 754 aa, chain - ## HITS:1 COG:PA5529 KEGG:ns NR:ns ## COG: PA5529 COG0475 # Protein_GI_number: 15600722 # Func_class: P Inorganic ion transport and metabolism # Function: Kef-type K+ transport systems, membrane components # Organism: Pseudomonas aeruginosa # 3 429 2 427 585 347 42.0 7e-95 MSHLPTLIADLALILMSASIITLLFKWLKQPLVLGYIVAGLLAGPYVRIFPTVGDMENIN TWAEIGVVFLLFALGLEFSFKKLMNVGSAAFITATTEVISMLLIGFMVGHLLGWSTMNSV FLGGMLSMSSTTIIIKAFDDLGLRSQRFTGIVFGTLVVEDLIAILMMVILSTMAVSKEFV GEELLMSVLKVAFFLILWFLVGIFILPFFLKKAKRLMNNETLLIVSLGLCLAMVVLATRT GFSAALGAFIMGSILAETVEAEHIEHIIQPVKELFGAIFFVSVGMLVNPSVLLQYAWPVV IITLVTLVGKSIFSSLGVLLSGEPLKVSVKSGFSLAQIGEFAFIIAGLGASLKVLDPFVP PIIVAVSVITTFTTPYFIRLANPFSEWLYKVLSPRTREFLDRYASGKKTVNHDSDWKRLL KTIVGRVIIYSVLLTAIWLLSVQTVYPTINGMFDSPGRWLSVVMCLLTLLLMTPFLWALV SDKYNSPDVFLKLWNDDNYNHGRLVALILFRVSVAVFFIAGVVISYFSLNYGIGIVIAVA VLGLILLLRENLTQYSHLENHFLTNLNGREEAARNRYPLKSRFNSEFSDKDIELTSVWVS PYSSYIGKSLEELPFRREFGVNVVGIVRGERRIYIPQSDECIYPQDKLIVVGTDGQLQKF RSELDIRQDFPDEKNVRQEVTLHSFTVDEESPLLNKSIAQSRLGKQYDSLIVAIEREDEL ISMNQSTVFRLGDLVWIVGDREKIRQLLMRKKHV >gi|226332013|gb|ACIB01000043.1| GENE 83 93666 - 93980 112 104 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MILADISIKARSAIKVGKCDIVWYVYFVSTSFRFMLVYCTQRTIGSPFVFRIYDIYFEKK LRGHCTKLNFSCKKYPSYSIFLYICRRIGSESVVNGRRLKGNRV >gi|226332013|gb|ACIB01000043.1| GENE 84 94100 - 94354 95 84 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MLVISISRQERITLTAISPRLATKILLIIFISQLNAYTYSRFSTLRTYVHPMKTQYFYDL GLNNDAFFPNPRGENALIHEWQVF >gi|226332013|gb|ACIB01000043.1| GENE 85 94269 - 95783 1621 504 aa, chain + ## HITS:1 COG:MA0675 KEGG:ns NR:ns ## COG: MA0675 COG0439 # Protein_GI_number: 20089560 # Func_class: I Lipid transport and metabolism # Function: Biotin carboxylase # Organism: Methanosarcina acetivorans str.C2A # 1 463 1 464 493 515 55.0 1e-146 MIKRILVANRGEIAVRVMRSCREMEITSIAVFSEADRTAKHVLYADEAYCIGPAASKESY LNIEKIIEVAKSCHADAIHPGYGFLSENATFARRCREEGITFIGPDPETMEAMGDKISAR IKMIEAGVPVVPGTQDNLKSVEEAVELCNQIGYPVMLKASMGGGGKGMRLIHHADEVAEA YTTAKSESLSSFGDDTVYLEKFVEEPHHIEFQILGDKHGNIIHLCERECSVQRRNQKIVE ESPSVFITPELRRDMGEKAVAAAKAVNYIGAGTIEFLVDKHRNYYFLEMNTRLQVEHPIT EEVVGVDLVKEQIKVADGQVLQLRQEDIQQRGHAIECRICAEDTEMNFMPSPGVIKQITE PNGIGVRIDSYVYEGYEIPIYYDPMIGKLIVWATTREYAIERMRRVLHEYKLTGVKNNIS YLRAIMDTPDFVEGHYDTGFIAKNGEVLQQCITRTSERAENIALIAAYMDYLMNLEENNS GLAADNRPISKWKEFGLHKGVLRI >gi|226332013|gb|ACIB01000043.1| GENE 86 95788 - 96288 470 166 aa, chain + ## HITS:1 COG:AGc4940 KEGG:ns NR:ns ## COG: AGc4940 COG1038 # Protein_GI_number: 15889978 # Func_class: C Energy production and conversion # Function: Pyruvate carboxylase # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 88 164 1095 1171 1174 64 38.0 7e-11 MEIHIGNRVAEIELVSKEDNKVVLTIDGKTFEADVVMAENGNCNILMDGRSSNAQLIRKE NGKSYKVNTHYSSFNVEIIDSQAKYLRMRKKGEEEQNDRITSPMPGKVVKIPVSVGQEMK AGETVIVIEAMKMQSNYKVTSDCRIKEILVQEGDNIAGEQTLITLE >gi|226332013|gb|ACIB01000043.1| GENE 87 96308 - 97843 1593 511 aa, chain + ## HITS:1 COG:VNG1529G KEGG:ns NR:ns ## COG: VNG1529G COG4799 # Protein_GI_number: 15790513 # Func_class: I Lipid transport and metabolism # Function: Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) # Organism: Halobacterium sp. NRC-1 # 13 511 12 516 516 619 59.0 1e-177 MEDINKAYATFAERDRIASLGGGAAKIDIQHESGKMTARERIDMLLDKGTFVELDKLMVH RCTNYGMDRNKIPGDGVVSGYGKIDGRQVFVYAYDFTVYGGSLSASNAKKIVKVQQLALK NGAPIIALNDSGGARIQEGIESLSGYADIFYQNTMASGVIPQISAILGPCAGGACYSPAL TDFIFMVKEKSHMFVTGPDVVKTVIHEEVSKEELGGAMTHSSKSGVTHFMANTEEELLMS IRELLSFLPQNNMDEARKQNCTDDVNREDASLNSIVPADPNVPYDMKEIIERVVDGGYFF EVMQNFAKNIIIGFARMGGRSVGIVANQPAYLAGVLDIDASDKASRFIRFCDCFNIPLIT FEDVPGFLPGYTQENNGIIRHGAKIVYAFAEATVPKLTVITRKAYGGAYIVMNSKQTGAD VNFAYPSAEIAVMGAEGAVNILFRKADAETKAQELNAYKEKFATPYQAAELGFIDEIILP KQTRKRLIQALEMTENKMQTNPPKKHGNMPL >gi|226332013|gb|ACIB01000043.1| GENE 88 97948 - 98619 785 223 aa, chain - ## HITS:1 COG:no KEGG:BF3532 NR:ns ## KEGG: BF3532 # Name: not_defined # Def: putative isochorismatase # Organism: B.fragilis # Pathway: not_defined # 1 223 1 223 223 468 100.0 1e-130 MEGINTPFVIDEHTAIVMTDPQNDFLSENGLGWGAFGENIQKNGTVENLRRIFEVAAAKG MLVFISPHYYYKHDHQWLFEGPIEKLMHDTGMFERRGQLTGEGFEGSGADWLDLYKPYIN EGTNIIVTAPHKLYGPENNDLILQLRKRGVNKVVVCGMSGNLCAESHLRELQERGFEAAV VFDATASAKLPGMDADTAAFINFTLLAEKVYTTDEFVNEMRQR >gi|226332013|gb|ACIB01000043.1| GENE 89 99017 - 100630 1360 537 aa, chain + ## HITS:1 COG:PH0142 KEGG:ns NR:ns ## COG: PH0142 COG1680 # Protein_GI_number: 14590084 # Func_class: V Defense mechanisms # Function: Beta-lactamase class C and other penicillin binding proteins # Organism: Pyrococcus horikoshii # 50 278 9 226 289 92 27.0 2e-18 MKKNLLILVALLTSATISAQNGGTIMKKYENQLPQAGRGNVHVAYQGKAIDQMIYDFMEE QGIPGMTLAIVQAPYIPRVAGYGVTDLEKGNLAAAKTLWPIGPISQGYAAVAVMQLYEKG KLDLNDPIGKYLKDIPENWKPISILQLMQHSSGIADYRNEKGFDVSADYRPEQLIETVAA IPLAFEPGTDVKQSATNFLLLTSIIEKTGKMPYHDFVKKYQIDYLGLKQTFFGKDLAKVK QEDVTLTGNVHQTFKKDKDYINPSETTTGYVEKEGRLVAAPAVSPTALKGFSDIWASAEN VSHWDIGLAGSALIEKPENRDMVYKPTRLANGKVVPAMAGWQFYNHNGLMDIKGNVSGHS AFLSRFTDASELVCVTLLANKEGVDLTNLGRRIAAAFDSDKMGTGANDNLLYTYESQFSV PETMTRIEQTLHTMGVPVFAKFDHGKNAEEVGLQLRPNQVIVFGSPKVGTKLMQDNPSIS IELPLKISVWEDKNGSVWATFPQMRTMAAEYGLEAEPVIGKMQELLEKIVIKGASVY >gi|226332013|gb|ACIB01000043.1| GENE 90 101224 - 102774 967 516 aa, chain + ## HITS:1 COG:MA2121 KEGG:ns NR:ns ## COG: MA2121 COG2865 # Protein_GI_number: 20090964 # Func_class: K Transcription # Function: Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen # Organism: Methanosarcina acetivorans str.C2A # 59 511 7 455 458 270 35.0 6e-72 MEEWKNLTESKCSELRGRTSSGYSTAIDKIYCKTNNLKSQQITIFVPLLQKFMLNTENIQ SLIDSGEGYNVEFKVRVPSKVRELTEEICAFANADGGYLLIGVDDNGQVVGTNLENDKRS AIQGSISEISPALHCELCPVSIEDKTVWVIDVPSGKDKPYIFSGSIYVREGANSQKLRTV EEMRSFFQECNKIFFDHIPCYWFNIYTDADEQMIKDFRTEAKLSPSTPGKQIFENLELFT ENGTAKNGAAMFFGKQPERKFPHAVTRCVLFKGTNKVYIIDDKTFGGSLYQQYLYAMAWL ESKLQVAYKIEGAGPREEIWEIPLTVFKEAIINALSHRDYYEQGASIMIEMFDDRVEISN PGGLLPVVAKDFGHKSMTRNPLIFGLFTRMHLVERVASGIPRMREAMRKANLPEPEFHTE GIFTAVFKRGISINHDTVNDTVNDTVNSKEQEVLNIIKQYPGLNSSKIAELIGKSVPTAK RYLNSLVRLELIEFKGAQRNGGYYWNNKNNETSGKN >gi|226332013|gb|ACIB01000043.1| GENE 91 102793 - 104274 543 493 aa, chain + ## HITS:1 COG:no KEGG:BAA_A0205 NR:ns ## KEGG: BAA_A0205 # Name: not_defined # Def: pXO1-133 # Organism: B.anthracis_A0248 # Pathway: not_defined # 2 492 3 483 485 250 35.0 8e-65 MRNINILSIIEAYRKLSNTLFQKLMNSYGITSGIKNYELNGIESFVDELLKANNNISIVN RYYLGYSIPQIGKEFDLLRFGHNYIINIEIKTESSIEKILKQQQKNKYYLSFLDKPLHIY TFISNENKLYKLVIRNNGDEIEEITFNELCNILMSQEVVTFNNIDDLFNPSDYLVSPFNS PEKFMSEGYFLTVQQEQIYKEIQTKLSDTATNFIALTGGAGTGKTLLTYHIAKETIQRGK KVLILHCAPLNSGHQILMDEYNWSIYMPKYAPNTIDFDLIIIDEAQRMYPYQFDKYIKEV RTLNKKCIFSYDEKQYLRDNEKQYHTKERIEKELLCTPYKLTDKIRTNKEIAYFIRQLFN IKKNIPNIDYTNIELTYCKDCYSAKLLLQELLERGWKTPNYTPGTRSFFHYEAYLSNDTE SAHSVVGQEFNNVVVVIDESFKYNSQGDLIADNTYYSQRQMLYQIITRTRKKLHIVIINN EVMLNRCIDILSK >gi|226332013|gb|ACIB01000043.1| GENE 92 104443 - 104622 140 59 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MQTNFRLPKIIYKIFGGNTREKSQAAIEQRSMTSPVFFYYTQACKTGFILLSRFVYPKR >gi|226332013|gb|ACIB01000043.1| GENE 93 104828 - 106321 1955 497 aa, chain - ## HITS:1 COG:lin0179_3 KEGG:ns NR:ns ## COG: lin0179_3 COG0516 # Protein_GI_number: 16799256 # Func_class: F Nucleotide transport and metabolism # Function: IMP dehydrogenase/GMP reductase # Organism: Listeria innocua # 230 495 4 269 276 441 75.0 1e-123 MAVYVEEVSRTFGEYLLIPGLTTKQCVPSNVSLRTPLVKHAAGTQAAIELNIPFVSAIMQ SVSGPELAIELARNGGLSFIFGSQPIASQAEMVRKVKKFKAGFVTSDSNLTPEHTLEDVL RLLRQTGHSTIGITDDGSPNGHLLGLVTSRDYRISRDPLDKKIKDFMTPFEKLIVGEVGL TLSEANQIIWDHKLNTLPIIDKEGRLAYFVFRKDYDSHKENPNEVSSPDKKLLVGAGINT RDYQERVPALVEAGVDVLCIDSSDGYSEWQYETLQWIKQQYGDKVLVGAGNVVDKEGFLY LAEAGADFVKVGIGGGSICITREQKGIGRGQATALQDVARARDEYQARTGIYVPICSDGG LVHDYHMVLALAMGADFLMMGRYFARFDESPTKKLCIKNNYVKEYWGEGSNRAQNWQRYD MGGSESLKFEEGVDSYVPYAGKMKDNLAATLSKIKATMCSCGAVTIPDLQQNAKITLVSS TSIVEGGAHDVILKEKG >gi|226332013|gb|ACIB01000043.1| GENE 94 106552 - 106839 368 95 aa, chain - ## HITS:1 COG:YPO1733 KEGG:ns NR:ns ## COG: YPO1733 COG2350 # Protein_GI_number: 16121990 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Yersinia pestis # 1 91 1 91 94 101 52.0 4e-22 MFVLILTYKAPIEKVIELLEAHCCYLDKYYAAGIFLASGPQVPRTGGVILCRAQSRAEVE KIIGEDPFNAVADYRVIEFEPNKSVEGFKELLKIG >gi|226332013|gb|ACIB01000043.1| GENE 95 107017 - 107250 190 77 aa, chain + ## HITS:1 COG:no KEGG:BF3346 NR:ns ## KEGG: BF3346 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 77 1 77 77 138 97.0 6e-32 MSQEMKAFPGMKASQKLNFNHSEVIINSAPDRKVILESTLLEQSFVEKGCGETTPHAWKT TSKESLKRPVFNEIWYT >gi|226332013|gb|ACIB01000043.1| GENE 96 107275 - 108120 528 281 aa, chain - ## HITS:1 COG:PA0248 KEGG:ns NR:ns ## COG: PA0248 COG2207 # Protein_GI_number: 15595445 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Pseudomonas aeruginosa # 174 276 183 285 288 65 35.0 9e-11 MEKRNPLTLTLDQPFVAGTDDFSPFYNRLHKLNCAIILYCRAGRGTMAIDLKKYEITVNT QVVLLPGAVISLDEKSDDFRVSFFASHIEMFREACIRFEPSFFHFIKEKPCYTLPSEFTA PINGLLHATSAIYADTDHRFRNQIARNHLQSFLLDVYDKVHRLFTHKEIEGGSRPNELFH KFVALVHEYCCSQRDVVFYAGKLCISTKYLTSICRSLTGHSAKKVIDDFTALEIKVLLQS TDLSIQEIADRLNFPDQSYLGRYFKRHEGVSPMEYRAELAG >gi|226332013|gb|ACIB01000043.1| GENE 97 108271 - 109386 909 371 aa, chain + ## HITS:1 COG:Cj0367c KEGG:ns NR:ns ## COG: Cj0367c COG0845 # Protein_GI_number: 15791734 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Campylobacter jejuni # 44 371 45 367 367 145 31.0 1e-34 MIEKVSKVKQAILFACCLAATGCKQAPQATVESGYKVITLAPTDRTLSSTYSATIRGRQD IEIYPQVSGTLTQVCVSEGERVKRGQSLFIIDQVPYEAALQTALANVEAAKASLATAQLT YDSKQELYKQNVVSAFDLSTAKNSLLAAQAQLAQMKAQEVNARNNLSYTLVKSPADGVVG TLPYRVGTLVSASLPEPLTTVSDNSDMYVYFSMTENQLLGLIRRYGSKEEALKQMPEIGL QLNDRSDYPQQGRIETISGVIDRNTGTVSLRAVFPNREGLLHSGGAGNVIVPTEKAGALV IPQAATFEVQDKVFAYKVVDGKAQSAPVQVTRVNGGQEYIVESGLQPGDVIVAEGVGLLR EGTEIKTINGE >gi|226332013|gb|ACIB01000043.1| GENE 98 109469 - 112675 2670 1068 aa, chain + ## HITS:1 COG:BMEI1629 KEGG:ns NR:ns ## COG: BMEI1629 COG0841 # Protein_GI_number: 17987912 # Func_class: V Defense mechanisms # Function: Cation/multidrug efflux pump # Organism: Brucella melitensis # 4 1025 3 1022 1051 760 41.0 0 MNLRTFIERPVLSAVISITIVVVGIIGLFTLPVEQYPDIAPPTIMVSTSYFGASAETLQK SVIAPLEEAINGVEDMTYMTSSATNAGTVSITVYFKQGTDPDMAAVNVQNRVSKATGQLP SEVNQVGVTTSKRQTSILQMFSLHSPDDSYDEAFLANYISINLKPEILRISGVGDMMIMG GDYSLRIWMKPDVMAQYRLIPSDVSTVLAEQNIESATGSFGENSDETYQYTMKYKGRRIT PEEFGEIVIRSTDDGQVLKLKDIATIELGQESYAYSGTTDGHNGVSCMLFQTAGSNATEV NNQINRFLEEARKELPQGVELTQLMSSNDFLYASIHEVVKTLLEAILLVILVVYVFLQDI RSTLIPLVGIIVSLIGTFAFMAIAGFSINLITLFALVLVIGTVVDDAIVVVEAVQARFDV GYKSSYMASIDAMKGISNAVITSSLVFMAVFIPVSFMSGTSGTFYTQFGLTMAVAVGISA VNALTLSPALCALLLKPYINEDGTQKQNFAARFRKAFNAAFDVVVEKYKGIVLFFIKRRW LTGSLLIASIALLVVLMNTTKTSLVPDEDQGVVFVNVSTAAGSSLRTTDDVMKRIEERME QIPQVKHVQKVAGYGLLAGQGSSFGMLILKLKPWDERPGKEDDVQAVIGQVYGRTADIKD ASVFAISPGMIPGYGMGNALELHMQDKTGGDVNEFFQTTQQYLGALNQRPEIAMAYSTFD VRYPQWLVEVDPSKCKRAGITPDQVLSTLSGYYGGQYVSNFNRFSKVYKVMIQSDPQYRL DEASLNNTFVRMSNGEMAPLSQFLTLTRTYGAESLSRFNMYNSIAVNAMPADGYSTGDAI RAVQETASTSLPKGYGYDYGGITREENQQSGTTAIIFGICFLMIYLILSALYESFLIPFA VLLSVPCGLMGSFLFARMFGLENNIYLQTGLIMLIGLLAKTAILLTEYAAERRKAGMGLI ASALSAAKARLRPILMTALTMIFGLLPLMVASGVGANGNRSLGTGAVGGMVIGTLALLFI VPSLFIAFQWLQERIRPVQIEPSHDWQIQTEQEVSEHEKEEAKNRPLK >gi|226332013|gb|ACIB01000043.1| GENE 99 112686 - 114113 398 475 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|157165073|ref|YP_001466086.1| 30S ribosomal protein S12 [Campylobacter concisus 13826] # 42 465 21 455 460 157 27 3e-37 MKKLIISLAAILALSSCGIYSKYKPVTEIPDGLYGHGGTDTPGLTSADPEIQRSDTAANF GNLSWREVFTDPYLRVLIDSALVRNTDLRTAHLRVKEAEATLLSARLSYLPAFSLSPQGT ASSFDGGKATQTYSLPVSASWEIDIFGRLTNAKRRAQAVVAQSRDYEQAVQTQLIAAVAN NYFTLLMLDAQIEISTATEAAWKESVATTRAMKAAGMVTEAALSQTEATYYNICTTLLDL QEQLNQAENALSLLLADVPHRIPRGRLADQQLPENLSVGVPLQILSNRPDVRSAEQSLAQ AFYTTNAARSAFYPSITLSGSAGWTNSAGAMIVNPGKFLATAVASLTQPLFNRGANIAQL RIAKAQQEEARLSFEQTLLNAGSEVNNALVQYQTARDKSAYFNRQVASLENAARSTQLLM KHGNTTYLEVLTAQQTLLNAQLSQVANRFTEIQGVITLYQALGGGRETASGERNS >gi|226332013|gb|ACIB01000043.1| GENE 100 114262 - 115542 1232 426 aa, chain - ## HITS:1 COG:PM0738 KEGG:ns NR:ns ## COG: PM0738 COG2873 # Protein_GI_number: 15602603 # Func_class: E Amino acid transport and metabolism # Function: O-acetylhomoserine sulfhydrylase # Organism: Pasteurella multocida # 9 426 5 420 422 517 58.0 1e-146 MAKQFKPETLCVQAGWTPKKGEPRVLPIYQSTTFKYETSEQMARLFDLEDSGYFYTRLQN PTNDAVAAKIAALEGGVGAMLTSSGQAANFYAIFNICQAGDHFVCSSAIYGGTFNLFGVT MKKLGIDVTFVSPDAGEEKISAAFRPNTKALFGETISNPSLEVLDIEKFARIAHSHGVPL IVDNTFPTPINCRPFEWGADIVVHSTTKYMDGHATSVGGCIVDSGNFDWEAHADKFPGLC TPDESYHGLTYTKAFGKGAYMTKATAQLMRDLGSIQSPQNAFLLNLGLETLHLRMPQHCG NAQKVAEYLAQNDKVAWVNYCGLPGNKYYELAQKYMPNGSCGVISFGLKGGRELSIKFMD SLKLAAIVTHVADARTCVLHPASHTHRQLTDEQLIEAGVRPDLIRLSVGIENADDIIADI EQALNA >gi|226332013|gb|ACIB01000043.1| GENE 101 115712 - 116752 714 346 aa, chain + ## HITS:1 COG:no KEGG:BF3546 NR:ns ## KEGG: BF3546 # Name: not_defined # Def: putative N-acetylmuramoyl-L-alanine amidase # Organism: B.fragilis # Pathway: not_defined # 1 346 1 346 346 657 99.0 0 MKRKLCLLLFFCLLLIAPALRAQQKATPRSGEGISTFLLRHNRAPKKYYNDFIELNKAKL GKSRTLKMGVTYLIPPVKKASAATSGKTTEAATEKTSAHHPRRTEVNEPLFGKWLSNVKV TSNRLAGTCFYVVSGHGGPDPGAIGRIGKHELHEDEYAYDIALRLARNLMQEGAEVRIII QDAKDGIRDEAYLSNSKRETCMGSPIPLNQVQRLQQRCDKINALYRKDRKKYKYCRAIFI HVDSRSKGTQTDVFFYHSNRKAESKRLAKNMKETFESKYDKHQPNRGFSGTVSGRNLYVL AHTTPASVFVELGNIQNTFDQRRLVIPSNRQALAKWLMEGFIKDYK >gi|226332013|gb|ACIB01000043.1| GENE 102 116880 - 117836 833 318 aa, chain + ## HITS:1 COG:slr1189 KEGG:ns NR:ns ## COG: slr1189 COG2040 # Protein_GI_number: 16332297 # Func_class: E Amino acid transport and metabolism # Function: Homocysteine/selenocysteine methylase (S-methylmethionine-dependent) # Organism: Synechocystis # 39 309 72 344 351 116 29.0 6e-26 MEQLSFIESFRTSPFILTEGAIVERLRHEFHISPDKHIAHAALIYDDSHREILASIYRQY LQIATEFRLPLMLMTPTRRANIEQIAASDYRHKNVLADTMAFLSRFRDEASTPVYIGGLA GCRGNAYDGRYYLSVEEAMEFHFPTVRTLAQSGADYLFAGIMPQLTEAIGMANAMAATGL PYIISFMVCRDGRLIDGTFIHDAIDAIEKETSTRPLCYMANCVHPDVLHQALLHPRNDTP LVRQRFQGIQANAANLSPEELDGCDHLISSSPEELADRLMTLLWDFPLKICGGCCGTNQQ HMHRFAEMLAYRRDNKAW >gi|226332013|gb|ACIB01000043.1| GENE 103 117884 - 118837 548 317 aa, chain - ## HITS:1 COG:no KEGG:BF3354 NR:ns ## KEGG: BF3354 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 305 19 323 335 542 99.0 1e-153 MVMLMVGIAEWTGEKEIIFPEMAALAVGLWVIDKRVWKVGCWQLIGLMTAGAVAGVCIVR YSTLPLLCNLCLAFAFAACCLLFSRATLIPLISACMLPVLLHTETWIYPSAVFLLSAVLV AGQRLMEKGSLRRETDYVLPGREWKKEIFRWAALLFWVSLVAALSISCGCSYFIIPPLIV TFTEIVNSKAGFRNRPMQVFLFLVTGAALGTAFQIIGHTFLHLPETVVALLIICCLFAVF EWTGKYFAPAGALALIPLIVPQEGVHWLPLQAAAGAALFITIGMLVFQQCYKWSKAQLIF CFTPTLLRRYLNRRRKE >gi|226332013|gb|ACIB01000043.1| GENE 104 119036 - 119299 258 87 aa, chain + ## HITS:1 COG:no KEGG:BF3549 NR:ns ## KEGG: BF3549 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 87 1 87 87 158 98.0 5e-38 MTMSDFNPRPSDKLTASERKELDTSEFGIPQLREFPIHDAAHVRAAEAYFRYAPEEYKAQ LARNILAKAYLLGVNVKSPTILEWAEK >gi|226332013|gb|ACIB01000043.1| GENE 105 119401 - 119676 354 91 aa, chain + ## HITS:1 COG:no KEGG:BF3550 NR:ns ## KEGG: BF3550 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 91 1 91 91 151 100.0 5e-36 MSFSGFVFDMIRRNKENRDLLTLRRERMKDLQGKMYRKGTLQNPNVTLEELEKIEKATRE KEKAETQYYLRATLIFLTVTAILALILWWIL >gi|226332013|gb|ACIB01000043.1| GENE 106 119971 - 120405 457 144 aa, chain + ## HITS:1 COG:no KEGG:BF3551 NR:ns ## KEGG: BF3551 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 144 1 144 144 283 99.0 1e-75 MTPDFDYSEVPFGFNYCLNKSCKQASTCLRYRLYTHIPAACQTIRIINPVYAATIDDVCP FFMPDKKVRYALGITHLLDHVPYSDAVSIKRQMLAHFKQATYYRCRRKERMLDPSEQEYI RKLFVSKGIKELPVYDEYIEKYDW >gi|226332013|gb|ACIB01000043.1| GENE 107 120793 - 120966 68 57 aa, chain + ## HITS:1 COG:no KEGG:BF3553 NR:ns ## KEGG: BF3553 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 57 1 57 57 92 100.0 4e-18 MANHHLNEEKPSTKVSQSLLLGDNFLSPERKKFFTYTTQSETIEKGINTSPSTGFFH >gi|226332013|gb|ACIB01000043.1| GENE 108 121207 - 123519 1173 770 aa, chain + ## HITS:1 COG:no KEGG:BF3555 NR:ns ## KEGG: BF3555 # Name: not_defined # Def: putative ABC-transporter permease protein # Organism: B.fragilis # Pathway: not_defined # 1 770 1 770 770 1441 99.0 0 MIKHYLKVALRNLMNFKVHSLISAICLAIGITCFSMMNYFIDAITGKVELSDNNKYSIRL SGASSQTAADIYLFKEDFDYLKELPIAGIDTLVASSSYSNGKEITAIDKKQRELPFLVSF QNVSSNYFTYNSLQLKYGNQEITAPDEVIVSRSFARKAFGEENPIGQVIRQETEAANPSD LMVYKIVNVALTEEKDFHGKTIDCYFPLSAKSRTPLCIRSRLTGQTTTESLNKQLKGLTW KHGDQDIYLYASLESEQNSSVQRTISILLARFIASLILLSGLINFLKFIIQMFYNRQREL VLRKCIGSDIKGLFALLFAEIFWMLSVAFLLSLAVTEITLSLVYTYIRPEDMISFSLVDL YGSQLGLYLALLLICMLAILYPIYRLRRLSVLHSVVQRQKRHVFRNFMIALQLAISILFT GGVFGITLLFNEMFEGMYRPLSTEEENRVISISVNTICMQKNMDAILSDIQSLSEITDRT SAFNTFDADVYTYMTYMKDGKPRGNVMMIQAEPHYFEFFKIPFSGKLVDKDAQGFVYISE QFKEQLQRDSIAGSVTLDGKEYRIAGTYRALNREDTQSSSVGSVFLVNPQAYTYYFKTLH SDITPVALEKITEICRRYVPETLPLNIRNTGDSKQSVMGTVALLQTASLLLAIVSILLLI LSIYSGISMDVINRQKEVAIRKINGATPRVIALLFGKIYLIIYLSVFVIIYPLVRLVLIS ITQKSNLQSIYSWSWGVLLFFTMALLIFLVTAYKIYKVMHLNPASVLKKE >gi|226332013|gb|ACIB01000043.1| GENE 109 123594 - 123800 68 68 aa, chain - ## HITS:1 COG:no KEGG:BF3556 NR:ns ## KEGG: BF3556 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 7 68 1 62 62 105 98.0 3e-22 MRLHKRMALKSRRCTSTALGRINTLTWLLCIAQERRGFHNSPHRIAQIYTKRFFYLLITN TAYLYTSV >gi|226332013|gb|ACIB01000043.1| GENE 110 123809 - 124495 639 228 aa, chain - ## HITS:1 COG:mlr6523 KEGG:ns NR:ns ## COG: mlr6523 COG1011 # Protein_GI_number: 13475450 # Func_class: R General function prediction only # Function: Predicted hydrolase (HAD superfamily) # Organism: Mesorhizobium loti # 3 227 6 232 238 195 44.0 5e-50 MQQIKVIAFDADDTLWDNQVFYDKVESEFCHLLAGYGTAEEISSRLFAIEMENMDIYKYG AKPFTLSMVEAAVKISRNRVPAEVIGRIVEMGKELLEMPIRLLGGVTEALETLKDDYKLV VATKGDLLDQERKLQRSGISHYFDHTEIMTDKAPQDYQRLISSLDVAPQSFLMVGNSLKS DVLPVLSLGGHAIHVPAEAMWKHEVISGTEREYLMIHSLAELPAVLRG >gi|226332013|gb|ACIB01000043.1| GENE 111 124684 - 125556 589 290 aa, chain + ## HITS:1 COG:no KEGG:BF3558 NR:ns ## KEGG: BF3558 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 290 1 290 290 433 99.0 1e-120 MKKLMFAVIACAMGLSGCSSEDDCATNPEPAGDVTEIKLSAGIQGVATRSPVSTNDNITA PFVASATTGDYTTNAWSSTVTFVASPTPTAALSFSPARYYPVDNSPIYIRGYYPAGTLSG KTVTLAGDGTEDVMLTAQASGTKATAGALSFVFNHLLTQLQFKLVAGAGYPASGVNVTSL VIKQQKTPATLDLNTSSLTYTTKDLTLSGTFPILTAGSTINNYPMVKSGEALTVAVTTSD GVTYPESTISVTTEVGKSHLITLTFTPKEITSSISVTAWQTGGTGSSTLQ >gi|226332013|gb|ACIB01000043.1| GENE 112 125639 - 126463 694 274 aa, chain + ## HITS:1 COG:no KEGG:BF3363 NR:ns ## KEGG: BF3363 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 274 21 294 294 459 99.0 1e-128 MKRTILILLGWMLLSQGCSYGVGDDCLPPEEPVPIRMAIAEGKLSTRAPVTTIDASNLSN VGIYAVSEGSIAGQYPWTSTPFALNLVPSGISGSQLSFNPKLYYPLGGKRVIFYSYYPRT TATSGSNYITAPGNGVAPAYHFTLTGAEDIMYAAGTPTGSTSTMPVSLTFNHVLTQLQLN TSLLGALSSIKLLGVYNKGTLDIGNGNVTYDSSTTDITLTVPLLGSVTNTVMVPAGVASY KVEVVLLLSLLKRTYLVKPTSGNFQPGVIYTISL >gi|226332013|gb|ACIB01000043.1| GENE 113 126464 - 127252 639 262 aa, chain - ## HITS:1 COG:no KEGG:BF3560 NR:ns ## KEGG: BF3560 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 262 1 262 262 521 100.0 1e-147 MAKKHRKESLDALFREKVLLEMLKIAGVGMWTYSTETGRLRYIGFADTIHRHDELQYVDP ADYLKLMNGEDLQTVTQFMADIHAGRPPQEGIRYKIHGNGEDFYLENHVVTCQPLGNGFY YLKGYVKNITAQVMEANRLCIEKERTEQADRLRSAFFAHVSRELRGPLNRIEELASQMVH AESVEEREECFHRIQENRRMAMKIADALHAVSQLPVSTYEPYHSPMQLCLIYRDLLMENG YSHNPEDGEQEENANFLSLLFS >gi|226332013|gb|ACIB01000043.1| GENE 114 127599 - 129245 1372 548 aa, chain - ## HITS:1 COG:RSp0927 KEGG:ns NR:ns ## COG: RSp0927 COG0845 # Protein_GI_number: 17549148 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Ralstonia solanacearum # 66 430 65 420 513 154 31.0 4e-37 MKKISDILRKKQVLYPLIALAGFVLGWLLFSPSSSPESAGGTHAEAHNHDMHGTSHDLVQ DESGVWTCSMHPQIRQDKPGKCPICGMDLIPLKKNVISGGDAVSDPDAIRLSDEAMALAD VQTTRVSRSNPVKQVRLYGKIIPDERSLQSQTAYVGGRIERLDIEFTGETVRAGQTLATL YSPELFTAQQELLEAVRMQQPALVQAAREKLRLWNLTDAQIDAIQYSGQASPMVEIKSNT NGIVIAKRVNRGDYVSQGSILFDIANLSRVWAMFDAFEVDLPFLAKGDRVEFTLSAFPGK TYSGRISFIDPILNATTRTARVRVDVANPTLEMKPEMYATAQVAAPLKGYKDRIVVPQTA VLWTGKRAVVYVRLPDTDTPTFRMREVTLGPALGGAYVVLDGLSDGEEIVTNGVFSIDAS AQLEGKRSMMNEDTPGTAPMTGHQGHSMSGMSGSHAVSQESEHVLFAVRGSCDMCKERIE TAAKGVSGVRSALWDREKQMIHLQLDPSETSADAVAKAIAAAGHDTDKYKAVKAVYDALP GCCKYRDE >gi|226332013|gb|ACIB01000043.1| GENE 115 129277 - 130782 1322 501 aa, chain - ## HITS:1 COG:no KEGG:BF3562 NR:ns ## KEGG: BF3562 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 501 1 501 501 805 100.0 0 MINKKHISLLTLAIFLSLSHFAVSASAQPSVVEVPSVSVQPSVDSLSHYLEQAARSNPQV NADFMLYKASLEKIPQAGAFADPELEIGFFVKPMETLMGKQIADFTLMQMFPWFGTRKAA RSEASEMARMAYEQFRDTRNNLWYEVKAQWYQLSSLNEQYHITEANIRLLYQLEQLALNR FSASSAQAAPSSTSVTSVASMPSALSSGSGMSGMGGMGSPAPAGTPGVSAGSSSGRGMSG MGGGSMGNVAVGGMSDVLRIQMERAGLEDNLASLLSARLTVQARFNALLNRPSDASVCVP DSLTQRFYRIDGQLLLDSIFTRNPMLAMLEAEGEAYRAKAKMDRRMSYPMIGIGLQYSVV NKVADPMGMPDMNGKDMVMPMVKVSLPLFRRKYNARQRESHNYRMASELKRDNVQNQLQA EYINVRQQLDDAARKVSLYERQYALSLSTWQLMVREFTAGRQSLTDVIQVERQMLDYKLK KSEAVAAYNTTVAAIEKLIAD >gi|226332013|gb|ACIB01000043.1| GENE 116 130962 - 134720 3263 1252 aa, chain - ## HITS:1 COG:aq_1122 KEGG:ns NR:ns ## COG: aq_1122 COG3696 # Protein_GI_number: 15606386 # Func_class: P Inorganic ion transport and metabolism # Function: Putative silver efflux pump # Organism: Aquifex aeolicus # 46 532 34 494 1050 363 39.0 1e-99 MINKIIRYFLENRVITILLLTLVVVWGISTAPFNWHGGIVPRNPIPVDAIPDIGDNQQIV ATEWMGRSPKDIQDQITYPLTTSLLGIPGVKTIRSSSMFGMSFIYIIFEDDIEFYWSRSR ILEKLNSLPPGTLPENVQPTLGPDATALGQIYWYTLEGRDPKTGKPAGGWNAEELRTIQD FYVKYSLSAAEGVSEVASAGGFIKEYQIELNPDAMYSFNVSVMDVMNAVKKSNLDIGAET MEVNKVEYLIRGLGYVKNVADIENTVVTVREGVPVRISDIAFVNIGPGTRRGGLDKEGVE AVGGVVIARYGSNPLEVINNVKEKIHEMDAGMPQKTLADGTVSKVTVVPFYDRTGLIKET IGTLETSLSHEILICIIVIIVLVLNLRASVVIASMLPIAVLATFLLMRYTGIEANIVALS GIAIAIGVMVDVGVVFVESIIRYMEMPENKGITRGKPFVGLIYKAVSEVSGAIATAMITT IVSFLPVFAMQAQEGKMFSPLAYTKTYALASAFVLGLILLPTLAYILFSVRIDSRLIRRV MNCVLIAAGIVLFAVYDSVPALGLTAVGINNLLAYRWKNPKTGNYVNIGIALLVAVFYLS EEWLPMGPQRGLSVNVLFVAGCVAIILALLWILVIFYERILRWCLANRWKFMMIPAATVV CGFLIWRGIGQEFMPSLNEGSFLLMPTSMPHTGIEQNLDYVEKLDKRLAAIPEVETAIGK WGRVNSALDPAPVQMFENTINYRPEYIIGEDGKRARFRVNYDGAFLLKGGGTYNPANGFR LIPADSLVPDSRGDYFRQWRPEIKNANDIWQQIVNVTHLPGLTSAPKLQPIEARLVMLST GMRAPMGVKVYGPTLEDIEQGGKAIEQALKSVPSVIPSSVFYDRAVGAPYLEIKLNRESM ARYGVAVGDLQEVLSAAVGGMALTRTVEGRERFPIRLRYARELRDSPEALSMLLVPTATG IQVPLKELADIEYSRGAQMIQSENTFLVGYVIFDKLSGRAEVDVVKEASNLLEAKVKSGE LVLPKGVSYKFAGNYEQQQRATDRLMIVVPLALLIVLLVLYFQFRTVTASLIHFSGVFVA FAGGFILLWLYGQPWFMNFSIAGENMRDLFQMHPINLSVAVWVGFIALFGVATDDGVLMG TYIHHVFLERDPRTKYDIREAVVEAGLKRVRPAAMTTATTLIALLPVLTSTGKGADIMVP MAIPTFGGMLIQSMTMFVVPVLQCWWRETVERRREKKNGSEVSEPVTGSPGI >gi|226332013|gb|ACIB01000043.1| GENE 117 134912 - 135217 185 101 aa, chain - ## HITS:1 COG:no KEGG:BF3369 NR:ns ## KEGG: BF3369 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 101 30 130 130 207 100.0 1e-52 MGRLADVEWGSASVCASCGEKKMTSHCCKDEAHYVKLAVDQDVNHVPVTNLLPAVTELLP VMYSAFIPLEAESLRRSVASFNFPPWQTDIPLFVHHCTYLI >gi|226332013|gb|ACIB01000043.1| GENE 118 135435 - 136412 1065 325 aa, chain + ## HITS:1 COG:aq_1420 KEGG:ns NR:ns ## COG: aq_1420 COG0741 # Protein_GI_number: 15606599 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) # Organism: Aquifex aeolicus # 69 299 58 280 299 114 32.0 2e-25 MKHISINYILTTALALGIGISIPVLIGSTCLSEQHSVQSEVPYCVTPPTVPEQAVFDGDT IDLRRYDRRERMDRELMSFTYMHSSTMQMIKRANRYFPVIEPLLKANGIPDDFKYLMVIE SNLNPIARSPAGAAGLWQFMPVTAREFGLEVNDNVDERYHIEKATAAACRYFKQAYAKYG DWMAVSASYNAGQGRISSQLDKQLADHAMDLWLTEETSRYMFRLLAVKEVFGNPQRFGFL LKREHLYPAIPYKEVAVDTEISDLSNFAQKQGITYAQLRDANPWLRGSSLKNKTGKKYVL HIPTQEGMNYDPRKTVPHEHKWVID >gi|226332013|gb|ACIB01000043.1| GENE 119 136457 - 137356 519 299 aa, chain + ## HITS:1 COG:MA2034 KEGG:ns NR:ns ## COG: MA2034 COG1226 # Protein_GI_number: 20090882 # Func_class: P Inorganic ion transport and metabolism # Function: Kef-type K+ transport systems, predicted NAD-binding component # Organism: Methanosarcina acetivorans str.C2A # 17 279 19 279 279 265 49.0 1e-70 MKSDSWLHRFLHNQPLKHKLYVIIFESDTPAGKAFDVTLIICILLSILLAIIESLQGLPS WLSTPFIVLEYLFTVFFTFEYVTRIYCSPNPRKYIFSFFGIVDLLATLPLYLAFFLPGAR YLLIIRAFRIIRVFRIFKLFNFWLEGERLLTSLRESSKKIAVFFLFVVILVVAIGTLMYM IEGTQPNTQFNNIPNSIYWAIVTMTTVGYGDITPATALGKFLSACVMLIGYTIIAVPTGI VSASMMKEYKKLKDLQCPNCHKTGHEENATYCKYCGHKLKNDEIYRQENTATDPDRTTS >gi|226332013|gb|ACIB01000043.1| GENE 120 137346 - 143219 3733 1957 aa, chain - ## HITS:1 COG:MA3490 KEGG:ns NR:ns ## COG: MA3490 COG1112 # Protein_GI_number: 20092301 # Func_class: L Replication, recombination and repair # Function: Superfamily I DNA and RNA helicases and helicase subunits # Organism: Methanosarcina acetivorans str.C2A # 381 1824 9 1465 1939 538 29.0 1e-152 MSQENPSSQLDVEFIYLPVINYSMQQNRIPVVRLLSIKNNTEHPLADLKVFLTLEPEFAS VSPVMVEKLASGEIITITGLNLMLDPSFFIQQTERLSGTIVLVVSDEENVFFQEKYPVDI LAFDQWGGIQVLPELLSAFVVPNHPVLTGVLSRASSILKEWSGNSSLDAYQSCNPNRVKL QLAALYEAIKEQHIAYCTPPSSFGDAGQRVRLSDNVLSGKLGTCLDLSLLYASCAEAMGL HPLLVIIQGHAFVGCWLIDGTFPDAVNDDPSLLTKRTADGINEVILLEATCMTDGNNVTF DTAVGLANDKMLAVNDFTCFIDVARSRFAHILPLPQRVMHGKAWTVGPEVAQIPKSGLYI SPVSAPEEIKQYDLDNQDSYVEFTKQLLWERKLLDLSLRNNFLNLRITRNALQVISADID KMEDAFSDGTEFQILGKPSDWDNPLYDFGLYGTLTESDPMIDLIKQELTQKRLRTYLTEQ DLKKSLTYLYRSSRIALEENGANTLYLALGLLRWYETEHSERPRYAPILLLPVEMIRKSV SKGYIIRAREEESMLNITLLEMLRQNFGITISGLDSLPKDENGTDVKRIFSIFRKAVMNE KRWDVEEQAILGTFSFSKFIMWNDIHSNAEELSKNKIVGSLMSGKMEWEVAEVDANAVEL DHALTPADIALPVSADSSQLEAVYEAVNEKSFILHGPPGTGKSQTITNIIANALYQGKRV LFVAEKMAALSVVQKRLMNIGLAPFCLELHSNKARKTDVLSQLKESTEIFRYKEPEEFKE ESERLFKMRQQINGYVEALHRIYPCGISVYEAITRYSSIDETEEIMIPASLLASLTKEQF NEWNHAVEELIGVGKVSGHSHQHPLTGINIMEYSSQLKEEADKLLKDYMILLQKMEEKMN RCFSNYGIGNKCTEKLLDNFVRFIRILMQLPGMTGNLMLLTDLDENVDKIGRIIEHGRKR DEFCNLLKQSFEDTFLTLPVQQKISEWKDITQSWFLPRLLKQRKFCKELSLFSLQGRVNK EQVLPALQQLLFYQQQKQEVDSSSRWFEDLFGNKSHPGEEQWDDIEVMSKAILQLNRLLV EVVDDPMSIRRVKEKLAEQLSGGYSLFRQMNRELLEGLVYDWECIKQLEKSMLQLLGVSK EVLHACEDDWLAGASRQCEIWARNTDKLKDWYRWLNVSRRFEACGLTSVFAAYTERNLPA ERMMNIYLKGFYHSFIEYAIGKEQVLQFFNGELFNDSIRRFRELNTEYQELIKKELYTKL ASNIPSFVLAAAQSSEVGILQKCIRSNGRGMSIRKLFDSISNLLSRMCPCMLMSPMSVAQ YINVNHDKFDLVVFDEASQMPTCEAVGAIARGNHVVVVGDPKQMPPTNFFTSNSVDEEHI EIEDLESILDDCLALSMPSKYLLWHYRSKHESLIAFSNSQYYDNKLLTFPSPDDLAAKVT LVPIEGYYDKGKSRQNQAEAQAVVDEIVRRLSDPELRNQSIGVVTFSSVQQTLIEDMLSD VVLQNAELENLAFNREEPVFIKNLENVQGDERDVILFSVGYGPDVNGRVSLNFGPLNRDG GERRLNVAVSRARYEMKVFSTLRADQINLKKTSSIGVAGLKYFLEYAGKGTGVLNHSYAA VSEEVEMGELIAVALREKGHQVKTKIGCSGFKVDVGIIDPNDTSRYMLGILCDGENYRAA KTVRDREIVQDSVLKMLGWNICKVWTLDWWEDSQKVIDHIEAELKRAECGKSQRSVPVIS LASGSGEVQKEGLLCHAQHCLVKLPEQVIREEVKTASPEIYEKTLLNSVNVSAWELMMPR REPRIRKQLNEIMRTEAPISRSLLSSRIFNAYGILRKTARLIEWMDGILDKTPYYKQEID GLVFYWNTKEEADSYTGFRIDSKREAVDLPPREVANAARNILEQQVALPLVDLMRVTAQL LGYARFGLNVETAMRRGVQILLDGEEVKIEGDKILMK >gi|226332013|gb|ACIB01000043.1| GENE 121 143338 - 143709 358 123 aa, chain - ## HITS:1 COG:no KEGG:BF3568 NR:ns ## KEGG: BF3568 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 123 1 123 123 181 100.0 7e-45 MNKSFFYAICFVVILVVEIIIGIYVRDNFVRPYMGDALVVVLIYCFIRIFIPNGLSQLPL YVLAFACFIEILQYFQLVDVLGISNRILRIALGSTFDLKDMVSYAGGYVFILLAEYFLDK KRK >gi|226332013|gb|ACIB01000043.1| GENE 122 143934 - 144602 512 222 aa, chain + ## HITS:1 COG:lin2751 KEGG:ns NR:ns ## COG: lin2751 COG1285 # Protein_GI_number: 16801812 # Func_class: S Function unknown # Function: Uncharacterized membrane protein # Organism: Listeria innocua # 6 222 4 220 220 246 61.0 2e-65 MMLNFDFVLRLLVAGILGAIIGLDREYRAKEAGYRTHFLVSLGSALIMIVSQYGFQEIIK ENSVTLDPSRVAAQVVSGIGFIGAGTIIFQKQIVRGLTTAAGIWATAGIGLAVGAGMYVI GIAAMVLTLIGLEVLSYLFKSIGMKSSMITFSTDNKQVLKGVADRFNSKDYLIVSYQMDT QKHGSIETYQVTMIIKSKRNNDEGHLLSLIQEFPEVTVERIE >gi|226332013|gb|ACIB01000043.1| GENE 123 144605 - 145420 600 271 aa, chain - ## HITS:1 COG:CC2333 KEGG:ns NR:ns ## COG: CC2333 COG1573 # Protein_GI_number: 16126572 # Func_class: L Replication, recombination and repair # Function: Uracil-DNA glycosylase # Organism: Caulobacter vibrioides # 101 268 78 231 479 70 29.0 3e-12 MKISYHLFLTIEIVMILFIYDKTLDGLLTALFDAYNRKTFPDVLLSKGDTLPLFYDDIFT VITDEEKAGRVWRGLQKKISASALSAITWCWLSELPEVGMLLFRYIRKAIDSPVSIETNF GDPDVLTLSKIWKRVDWERLRMLQFVRFQKAVDGTFFAAFEPQHNALPLTVGHFKDRFAD QRWLIYDMKRRYGFYYDLHTVEEVTFDDDGQAAHLITGMLDESLMDKDEKLFQQLWKTYF KSITIKERLNPRKHKQDMPVRYWKYITEKQK >gi|226332013|gb|ACIB01000043.1| GENE 124 145386 - 146651 752 421 aa, chain - ## HITS:1 COG:CAC3343 KEGG:ns NR:ns ## COG: CAC3343 COG4277 # Protein_GI_number: 15896586 # Func_class: R General function prediction only # Function: Predicted DNA-binding protein with the Helix-hairpin-helix motif # Organism: Clostridium acetobutylicum # 4 420 2 424 440 445 51.0 1e-125 MNANVLEKLKILAESAKYDVSCASSGTVRANKPGTLGNTVGGWGICHSFAEDGRCISLLK VMLTNYCIYDCAYCINRRSNDLPRATLSVSELVDLTIEFYRRNYIEGLFLSSGVVRNPDY TMERLVRVAKDLREVHRFNGYIHLKSIPGASRELVNEAGRYADRLSVNVEIPKEENLKLL APEKDHKSVFAPMLYIQQGVLESSEERKKFRYAPRFAPAGQSTQMIVGATAESDKDILFL SSALYQRPTMKRVYYSGYVSVNTYDTRLPALKQPPLVRENRLYQADWLMRFYQFKVNEIV DDTYPDLDLEIDPKLSWALRHPEQFPVDINKADYEMILRIPGIGVKSAKLIIASRRYSKL GYYQLKKIGVVMKKAQYFITCSELPMRTVNEMTPQTVRSLLLPKSAKKKTDENQLSFVFN D >gi|226332013|gb|ACIB01000043.1| GENE 125 146648 - 147547 656 299 aa, chain - ## HITS:1 COG:no KEGG:BF3377 NR:ns ## KEGG: BF3377 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 299 1 299 299 588 99.0 1e-167 MAKQQFNKLIWLVDTIFRNAPVTFAEINRRWRAGDDNRSDIPLRTFHNHRQAIEEIFDIN IECDKRNNTYYIADAEGMLGDKLKMWLLNSFSLNNVLQENVDMKNRIVFEEVPSGVQFMD LVIDAMRSGRVLAMEYQAYNWEHSKDVLLEPYFVKLFRHRWYLIGINREYKAFRSYSLDR IKRIALSEETFNFPRKFTPQDHFRDSFGIIRDENLLPQHTVLRTTISQAPYLRNLPLHSS QKEIAVTESYVDFELYISHTYDFIQELLSKGAAVEVLKPQSLRDKIKAEIQNMQTLYRL >gi|226332013|gb|ACIB01000043.1| GENE 126 147719 - 148804 727 361 aa, chain + ## HITS:1 COG:CAC1550 KEGG:ns NR:ns ## COG: CAC1550 COG0229 # Protein_GI_number: 15894828 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Conserved domain frequently associated with peptide methionine sulfoxide reductase # Organism: Clostridium acetobutylicum # 217 361 2 144 146 208 71.0 1e-53 MKKKFTIFVVLLAGIIYTGHAQGVQWIYQQKQEKKNMKNLSEIYFAGGCFWGTDHFLKQI RGVKSTQVGYANGNIANPTYQQVCTGKTNFAETVKVEYNPQEVPLKLLIDLFFKTIDPTS LNRQGNDKGSQYRTGIYYTDEVDLPTIRTAIDELAKEYSKPIVIEVKPLSNFYKAEEYHQ DYLDKNPGGYCHINPALFELAKKANAQAEQPQTNYKKPDDATLRSKLTPEQYAVTQKNAT EPAFHNEYWDEKRDGIYVDITTGEPLFISTDKFDSGCGWPSFTKPIEKEVIKEKMDTSHG MIRTEVRSKTGDAHLGHVFTDGPKEKGGLRYCINSASLRFIPKEKMKEEGYGEYLKLLPQ K >gi|226332013|gb|ACIB01000043.1| GENE 127 148968 - 149144 313 58 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|167754220|ref|ZP_02426347.1| ## NR: gi|167754220|ref|ZP_02426347.1| hypothetical protein ALIPUT_02513 [Alistipes putredinis DSM 17216] # 1 58 33 90 90 67 75.0 2e-10 MKELVEKIATLVAEFNKDANAQIENGNKAAGTRARKASLEIEKAMKEFRKVSLEESKK >gi|226332013|gb|ACIB01000043.1| GENE 128 149293 - 149934 516 213 aa, chain + ## HITS:1 COG:no KEGG:BF3380 NR:ns ## KEGG: BF3380 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 213 1 213 213 399 99.0 1e-110 MKPLTDTDYIHALIAEGEHQQQDFKFEISDARKIAKTLSAFANTDGGRLLIGVKDNGKIA GVRSDEEKYMIEAAAQLYCRPEVDYSMQTFHVEGRSVLVVQIDESEHKPVFAKDENGKSL AYIRIKDENILATPVHLRVWQQSGSPAGELIRYTEREQLLLDLLAENTSLSLNRYCRQAG ISRRAAEHLLAKFIRFDIVEPIFENHKFYFRLK >gi|226332013|gb|ACIB01000043.1| GENE 129 149970 - 151349 826 459 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|145629959|ref|ZP_01785741.1| 50S ribosomal protein L21 [Haemophilus influenzae 22.4-21] # 1 436 7 445 456 322 42 7e-87 MNLINDILWTYILIAMLLGCAIWFTIKTRFVQFRMIGKMLRLLKDSAGKGEHGEKHISSF QAFAISLASRVGTGNLAGVATAIAIGGPGAIFWMWIIALLGASSAFVESTLAQLYKEKGK ESFIGGPAYYMKKGLKKTWMGILFAILITVTFGFAFNSVQSNTICAAVEHAFGISHVPMG IILTGLTLLIIFGGIQRIAKVSSIIVPVMALGYVGLALVIVLINITHLPDVIGTIVSHAF GWQQALGGGVAAALMQGIKRGLFSNEAGMGSAPNVAATAFVSHPVKQGLIQTLGVFTDTL IICSCTAFIILFSGAPLDGSTNGVQLTQHALNNEIGSIGGIFVAVALFFFAFSSIIGNYY YGEANIRYLTHKRWIVYLYRLLVGGMVWVGAMSTLDFVWGLADITMGLMAICNLIAIAFL GKYAFRLLQDYREQKKSGIKSPVFTKNKMKDIEKDIECW >gi|226332013|gb|ACIB01000043.1| GENE 130 151450 - 152088 638 212 aa, chain + ## HITS:1 COG:no KEGG:BF3577 NR:ns ## KEGG: BF3577 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 212 1 212 212 402 100.0 1e-111 MSIFANLFKKQADADAKVVGNVEDFVSLTRVYFQSVIAVNLGITNIRFLPDVANFKRLFK VATQGGKLGLAEKSASRKMLMQDYGISESFFKEIDTSIKKNCRTQNDVQAYLFMYQGFSS DLMMLMGNLMQWKFRMPAIFKKALRSMTEKTVHDVCTKTVWKADDVHKTAAAVRQYKERL GFSEQWMSEYVYNIVLLAKKEPKHKDEEAKAK >gi|226332013|gb|ACIB01000043.1| GENE 131 152222 - 154321 1319 699 aa, chain + ## HITS:1 COG:MJ0634 KEGG:ns NR:ns ## COG: MJ0634 COG1509 # Protein_GI_number: 15668815 # Func_class: E Amino acid transport and metabolism # Function: Lysine 2,3-aminomutase # Organism: Methanococcus jannaschii # 129 668 175 619 620 144 25.0 8e-34 MKQKKMLSLTLSQIRQLYCQELPELTAMAQQSLDDTDFKNHLQEFLTPYISGGNKAGEQI RLLISYDGKTVHELSNEQDMQIQTLSLLRRFLTGNLENAEIPTDLFLDLYYLFKRLEEPE SPLPSPQRIKNRTERWATGLDEDVIELRSENQERMLHLLIQKIENRKSKPSSRFHFEEGM SYEEKYQQVCQWWNDFRFHLSMAIKTPTELNRFLGNSLSSETMYLLSRARKKGMPFFATP YYLSLLNTSGEGYNDEAIRSYILYSPRLVETYGNIRAWEREDIVEAGKPNAAGWLLPDGH NIHRRYPEVAILIPDTMGRACGGLCASCQRMYDFQSERLNFEFDALRPKESWDKKLRRLM SYFEEDTQLRDILITGGDALMSQNKTLKNILEAVYRMAARKRKANQERPEGEKYAELQRV RLGSRLLAYLPMRINDELIEILREFKEKASVIGVKQFIIQTHFQSPLEVTPYTREAIRKI LSAGWLITNQLVYTVAASRRGHTTRLRQVLNSLGVVCYYTFSVKGFNENYAVFTPNSRSM QEQHEEKAFGKLTQEQADELYSLLETGEDIATRIRHFMKKHHLPFMATDRSVLNLPAIGK SMTFNLVGITEEGRRVLRFDHDSTRRHSSIIDKLGKIYIVENKSLAAYLRQLSKMGEDPE DYASIWSYTEGETEPRFKLYEYPEFEFHTTERMSNLEVL >gi|226332013|gb|ACIB01000043.1| GENE 132 154412 - 155737 1101 441 aa, chain - ## HITS:1 COG:VC2712 KEGG:ns NR:ns ## COG: VC2712 COG2233 # Protein_GI_number: 15642706 # Func_class: F Nucleotide transport and metabolism # Function: Xanthine/uracil permeases # Organism: Vibrio cholerae # 2 441 20 466 480 419 51.0 1e-117 MKTDLIYGIEDRPPFKDALFAALQHLLAIFVAIITPPLIIASALKLDVEKTGFLVSMSLF ASGVSTFIQCRRFGPIGAKLLCIQGTSFSFIGPIIATGLVGGLPLIFGVCMAAAPIEMII SRTFKYMRNIITPLVSGIVVLLIGLSLIKVGIISCGGGYTAMDNGTFASWENLSIAGAVL LSVLFFNRCKNKYLRMSSIVLGLCLGYGLAFVLGKVDMSALNVEMLMSFNIPQPFKYGLD FNVSSFIAIGLVYMITAIEATGDVTANSMISGLKIEGDDYLKRVSGGVMADGFNSFLAGI FNSFPNSIFAQNNGIIQLTGVASRYVGYYIAAMLILLGLFPIVGAVFSLMPDPVLGGATL LMFGTVAAAGIRIVASQNIGRKETLVLAVSLSLGLGVELMPDVLSQAPEAIRSIFSSGIT TGGLTAIIANIVIRVKEENEE >gi|226332013|gb|ACIB01000043.1| GENE 133 155744 - 156451 792 235 aa, chain - ## HITS:1 COG:no KEGG:BF3580 NR:ns ## KEGG: BF3580 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 235 1 235 235 460 100.0 1e-128 MRRTAIALFCLFLLSVGIGLRAQNIQLHYDFGRSLYDKDLKDRPVLTSTVEKFHPDKWGS TYFFVDMDYTSDGVAAAYWEIARELKFWKNPFSVHVEYNGGLAKGFSYQNAYLGGVTYTY NNTAFSRGFSLSAMYKYIQKHHSPNNFQLTGTWYMNFSNNLLTFSGFADWWREETAYGKT IFLTEPQFWVNLNRIKGISDKFKLSVGSEVELSNNFGGRDGFYVIPTLALKWTIN >gi|226332013|gb|ACIB01000043.1| GENE 134 156558 - 157148 391 196 aa, chain + ## HITS:1 COG:NMB0698 KEGG:ns NR:ns ## COG: NMB0698 COG3663 # Protein_GI_number: 15676596 # Func_class: L Replication, recombination and repair # Function: G:T/U mismatch-specific DNA glycosylase # Organism: Neisseria meningitidis MC58 # 3 188 33 220 229 186 47.0 3e-47 MDIEIENHPLEPFLPANARLLMLGSFPPQKKRWSMDFYYPNLNNDMWRIVGLLFFNNKDY FLNETRKAFCRERIISFLNDKGIALFDTASAIRRLQDNASDKFLEVVQPTDISRLLGHLP ECKAIVTTGQKATDTLRAQFEVEEPKVGDFSEFVFDGRPMRLYRMPSSSRAYPLALDKKA AAYRTMYQDLQMLNIE >gi|226332013|gb|ACIB01000043.1| GENE 135 157151 - 157504 461 117 aa, chain + ## HITS:1 COG:FN0052 KEGG:ns NR:ns ## COG: FN0052 COG1393 # Protein_GI_number: 19703404 # Func_class: P Inorganic ion transport and metabolism # Function: Arsenate reductase and related proteins, glutaredoxin family # Organism: Fusobacterium nucleatum # 4 116 5 117 120 126 62.0 1e-29 MKTLFLQYPACSTCQKAKKWLTENNIEYTNRLIVDDNPTVEELKAWIPLSGLPVKKFFNT SGVVYKELKLSSKLPTMTEEEQIALLATNGKLVKRPLVVTERFVLVGFKPEEWEKLK >gi|226332013|gb|ACIB01000043.1| GENE 136 157693 - 158862 1161 389 aa, chain - ## HITS:1 COG:no KEGG:BF3584 NR:ns ## KEGG: BF3584 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 389 1 389 389 727 99.0 0 MKLSDYIRGFRKGKEAHRLEKEAMRDPFLADALEGYSRVESGADEQIELLRRRVNMRVTQ KRRHTIAWSVAASLLIGVCIGSYFLLQENTLPDEARIAMEEVSHLEPLSVQKEEKKEDLV ATVRKDSATIQKGLITGNREKKVVVSPHTEVPQAMTQEWIDEALETTIAEEPLAATTSSA MKSIPANDSSLTAQVAVGGKVHGRVTDSSGYPIVGATVKLKGTNQGTISDVNGNFVLKTG GNRELAVDYIGYESVTLPADTTKSLLIAMNEDQATLDEVVVVGYGSQSKGSITGAVASLK MSGTPQPTIGKKAFRKYLKESLVHPSDKDCARAKGKVILTFRVDKEGRPQDITVKKGLCA SADEEAIRLIEEGPGWTMGEEPVEISIRF >gi|226332013|gb|ACIB01000043.1| GENE 137 158859 - 159434 451 191 aa, chain - ## HITS:1 COG:mll4824 KEGG:ns NR:ns ## COG: mll4824 COG1595 # Protein_GI_number: 13474039 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Mesorhizobium loti # 17 186 9 178 179 75 25.0 4e-14 MLFFKRNISRLSDEELLIHYTKSGDTEYFGELYNRYIPLLYGLCLKYLHDEDRAQEAVMQ LFEDLLPKLGNYEIKVFKPWLYRVAKNHCLQLLRKENKEIPLDYTVNIMESDEFLHLLSE EESSEEQLKALHHCLEKLPEEQRTSITRFFLEEMSYADIVEQTGFTLNNVKSYIQNGKRN LKICIKNQLKG >gi|226332013|gb|ACIB01000043.1| GENE 138 159435 - 160106 372 223 aa, chain - ## HITS:1 COG:VC1937 KEGG:ns NR:ns ## COG: VC1937 COG0204 # Protein_GI_number: 15641939 # Func_class: I Lipid transport and metabolism # Function: 1-acyl-sn-glycerol-3-phosphate acyltransferase # Organism: Vibrio cholerae # 27 215 22 202 223 124 37.0 1e-28 MEKRFMQSVAMRIIYKGVFHWFLKLIVGVQFTDCQFLKKEKQFIILANHNSHLDTLSLLS SLPGNLLWKVKPVAAEDYFGKTRFQASISNFFINTLLIRRKGEKDSEHDPIRKMLEAIDA GYSLILFPEGTRGKSEQMGKIKSGIARILSLRPEVKYIPVFMTGMGRSLPKGKMILLPYK ASVYYGIPTLVKSTDTHEILDQITGDFEVMKEKYQVVIDEEEE >gi|226332013|gb|ACIB01000043.1| GENE 139 160116 - 161072 872 318 aa, chain - ## HITS:1 COG:VC1936 KEGG:ns NR:ns ## COG: VC1936 COG4589 # Protein_GI_number: 15641938 # Func_class: R General function prediction only # Function: Predicted CDP-diglyceride synthetase/phosphatidate cytidylyltransferase # Organism: Vibrio cholerae # 22 311 14 303 310 261 47.0 1e-69 MKDLLDKIFPTLSDELIIVISLIIGLLVTASLILFLVKKIFPKTNISELAARTRSWWIMA GMFIGAVFISYNISYFFLAFLSFIAFRELYSVLGFREADRGALFWGILAIPIQYYLAYLA WYGAFIIFIPVVMFLVLPLRLVLKGDTHGITKSMALLQWILMLSVFGISHLAYLLSLPEL PGFSSGGRGLLLFLVFLTEINDVMQFIWGKLLGRHKILPKISPNKTWEGFLGGVISTTVI GYFLGFLTPLSAPNVILVSALIAIAGFSGDVVISAIKRDKGIKDMGNSIPGHGGVFDRID SLAYTAPVFFHLVYYIAY >gi|226332013|gb|ACIB01000043.1| GENE 140 161069 - 161725 473 218 aa, chain - ## HITS:1 COG:VC1935 KEGG:ns NR:ns ## COG: VC1935 COG0558 # Protein_GI_number: 15641937 # Func_class: I Lipid transport and metabolism # Function: Phosphatidylglycerophosphate synthase # Organism: Vibrio cholerae # 2 206 27 233 252 164 45.0 9e-41 MKNEVDGRREIASRNTAWANIIARKLTHWGVTPNQISMMSVFFAMVGCLLLIGTVIYPGF NKYVAYILFIVCMQSRLLCNLFDGMVAIEGGKKSANGDLYNDMPDRFADALFIIPVGYVA GGFGIELGWLAALLAVMTAYFRWIGAYKTHQHFFNGPMAKQHRMALLTLTFVVATCTIYS GYDRMVCFIALIIINIGLIATLIHRLYLISHTTNTEIK >gi|226332013|gb|ACIB01000043.1| GENE 141 161916 - 163748 1984 610 aa, chain + ## HITS:1 COG:STM2315 KEGG:ns NR:ns ## COG: STM2315 COG2304 # Protein_GI_number: 16765642 # Func_class: R General function prediction only # Function: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain # Organism: Salmonella typhimurium LT2 # 139 610 111 591 593 415 46.0 1e-115 MKTNQFRAMVFALLMAVISLATVSAQAITVSGTVTDAKDGTPLVGCSVQIKGTTKGTVTN MNGQYTIQSKKGETLLFQYIGYKQEKRVVKSSTLDVKMKADELVLEECVVVGYGHELRAT KSMSTAYMAVCPASGIMYNAVNAEEYGEIQENGFKNVSDAPLSTFSIDVDAASYSNMRRF INKGKLPPVDAIRTEELVNYFSYDYPKPTGSDPVKITMEAGTCPWNADHRLVRIGLKAKE IPTDNLPASNLVFLIDVSGSMWGANRLDLVKSSLKLLVNNLRDKDKVAIVTYAGNAGVKL EATPGSDKQKIREAIDELEASGSTAGGEGIMLAYKIAQKNFISGGNNRIILCTDGDFNVG VSSDKELEKLIEQKRKSGIFLTVLGYGMGNYKDSKMQTLAEKGNGNHAYIDNLQEANRVL VNEFGATLHTVAKDVKLQVEFNPAQVQAYRLIGYESRLLADEDFNNDTKDAGEMGAGHTV TAFYEVVPTGVKSDFAGKIDDLKYQKKQKPSTPLNESDELLTIKLRYKTPDSNTSKKIEL PLIDHKSNRVSADFRFAAAVAMFGQLLRDSEFKGNATYDKVISLAKTGLENDEKGYKREF IRLAETAKSL >gi|226332013|gb|ACIB01000043.1| GENE 142 163951 - 164265 500 104 aa, chain + ## HITS:1 COG:no KEGG:BF3394 NR:ns ## KEGG: BF3394 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 104 1 104 104 140 100.0 1e-32 MKKVLVAVALVMGLGSSVAFAQNAENANAVVATAQAPQDEYTKIEVKDLPAAVTETLGKA YPESTIKEASVTTKEEGKLYKVVVTQKDGTDVTVVLNEKGEEVK >gi|226332013|gb|ACIB01000043.1| GENE 143 164466 - 165338 754 290 aa, chain - ## HITS:1 COG:CAC1622 KEGG:ns NR:ns ## COG: CAC1622 COG2240 # Protein_GI_number: 15894900 # Func_class: H Coenzyme transport and metabolism # Function: Pyridoxal/pyridoxine/pyridoxamine kinase # Organism: Clostridium acetobutylicum # 6 290 5 290 290 311 50.0 7e-85 MFANKVKKVAAIHDLSGMGRVSLTVVIPILSSMGFQVCPLPTAILSNHTQYPDFTFLDLT DEMPRIIAEWKRLEVEFDAIYTGYLGSPRQIQIVSDFIRDFRRKDSLTVIDPVLGDNGKL YSNFIESMVVEMQHLVTHADVITPNLTELFYLLDRPYKESNTDQELKEYLRCLSDKGPEV VIITSVPVLDEPHKTSVYAYNRTGNRYWKITCPYLPAHYPGTGDTFTSVITGALLQGDSL PIALDRATQFILQGIRATFGYEYDNREGILLEKVLHNLDMPIQSSSYELI >gi|226332013|gb|ACIB01000043.1| GENE 144 165409 - 165987 661 192 aa, chain - ## HITS:1 COG:no KEGG:BF3592 NR:ns ## KEGG: BF3592 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 192 1 192 192 380 100.0 1e-104 MSSKEDILASIRKNTKNRYEYPDWQINATVYPDVVAKFCEMSFAVGGEAIVLGEGEDINA VIRRTYPDAGSIASDLEEITCATFNPDLLGRAQDLNGTDIAVVKGEIGVAENAAVWIPQT VKYKALYFIAEKLVIVLDKNRIVSNMHQAYERIKDEKYKFGTFISGPSKTADIEQALVMG AHGARDVLVILK >gi|226332013|gb|ACIB01000043.1| GENE 145 165984 - 167369 1275 461 aa, chain - ## HITS:1 COG:ykgF KEGG:ns NR:ns ## COG: ykgF COG1139 # Protein_GI_number: 16128292 # Func_class: C Energy production and conversion # Function: Uncharacterized conserved protein containing a ferredoxin-like domain # Organism: Escherichia coli K12 # 13 458 18 473 475 307 37.0 2e-83 MSTKHAKAAEKFLENPVMAAWHNETLWMVRAKRDKMSKEVPEWEELRDKACALKLYSNSH LEELLLEFEKNATANGAIVHWAKDAEEYRAIVYEILSSHGVKHFVKSKSMLAEECELNPF LIEKGIDVVESDLGERILQLMNLAPSHIVLPAIHIKREQVGELFEKEMGTEKGNFDPTYL THAARKNLRNIFLNAEAAMTGANFAVASTGDIVVCTNEGNADMGTSCPTLNIAAFGMEKI VPDMEALGVFTRLLARSATGQPITTYTSQYRNPREGGEYHIIIVDNGRSALLAHPDHIKT LNCIRCGACMNTCPVYRRSGGYSYTYFIPGPIGINLGMAHDPEKYYDNLSACSLCMSCSD VCPAKVDLAEQIYKWRQDLDKIGKANTGKKIMSGGMKVLMDHPTLFNAALWAAPVVNHLP RFMKYNDLDAWGKGRELPKFAGESFNEMWKKNKVQGKEENK >gi|226332013|gb|ACIB01000043.1| GENE 146 167366 - 168106 553 246 aa, chain - ## HITS:1 COG:BH1832 KEGG:ns NR:ns ## COG: BH1832 COG0247 # Protein_GI_number: 15614395 # Func_class: C Energy production and conversion # Function: Fe-S oxidoreductase # Organism: Bacillus halodurans # 1 245 1 238 244 171 38.0 1e-42 MKVGLFIPCYINAIYPQVGVASYKLLKNLGVDVDYPLDQTCCGQPMANAGFQDESLKMAI RFDDLFRKYDYIVGPSASCVAFVKENHPGILAKEGHQCQSAGKIYDLCAFIHDVIKPTKI PARFPHKVSIHNSCHGVRELFLSAPSELNIPYFNKLRDLLQMVEGIEVFEPSHVDECCGF GGMFAVEEQAVSVCMGRDKIRHHMETGAEYITGADSSCLMHMQGIIEREHLPIRMIHVVE ILASQL >gi|226332013|gb|ACIB01000043.1| GENE 147 168239 - 168784 205 181 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|157803532|ref|YP_001492081.1| 50S ribosomal protein L35 [Rickettsia canadensis str. McKiel] # 3 180 19 224 225 83 31 7e-15 MRKINEIFYSLQGEGYHTGTPAVFIRFSGCNLKCDFCDTRHEEGEMMTDEDIVNEIGKYP AVMVILTGGEPSLWIDDAFIDLLHRAGKYVCIETNGTKPLPAAIDWVTCSPKQGVNLALN RMDEVKVVYEGQNIDVYEQLPAEHFFLQPCSCNNTAETVDCVMRHPKWRLSLQTHKLINI L >gi|226332013|gb|ACIB01000043.1| GENE 148 168771 - 169103 254 110 aa, chain - ## HITS:1 COG:aq_853 KEGG:ns NR:ns ## COG: aq_853 COG0720 # Protein_GI_number: 15606204 # Func_class: H Coenzyme transport and metabolism # Function: 6-pyruvoyl-tetrahydropterin synthase # Organism: Aquifex aeolicus # 5 101 7 106 114 68 39.0 2e-12 MFTVIKRMEISAAHKLILPYRSKCASLHGHNWIITVYCRSERLNADGMVVDFTQIKQAVK EKLDHRNLNEVLPFNPTAENIARWVCKQIPQCYKVEVQESEANTVIYEKD >gi|226332013|gb|ACIB01000043.1| GENE 149 169220 - 169582 216 120 aa, chain + ## HITS:1 COG:PA1439 KEGG:ns NR:ns ## COG: PA1439 COG2832 # Protein_GI_number: 15596636 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Pseudomonas aeruginosa # 8 117 21 129 135 89 42.0 2e-18 MKTICIILGTISLGLGILGIFLPLLPTTPFLLLTAALYFKGSPRLYQWLLNHKYFGTYIR NFRENKAIPLRAKIISLLLMWGTMLYCIFFLIPLVWVKILLFLIAAGVTYHILSFKTLKK >gi|226332013|gb|ACIB01000043.1| GENE 150 169549 - 170115 531 188 aa, chain - ## HITS:1 COG:BS_yrrU KEGG:ns NR:ns ## COG: BS_yrrU COG0775 # Protein_GI_number: 16079781 # Func_class: F Nucleotide transport and metabolism # Function: Nucleoside phosphorylase # Organism: Bacillus subtilis # 13 173 32 215 231 58 28.0 5e-09 MLKILVTYAVQGEFTEIKWPDVEVYYVRTGIGKVKSAFHLSEAIQQVKPDIVINQGTAGT INHQVGDVFVCRHFVDRDMHKMTGLGMEYRIDSSELLAARGFCQHWTESATCNTGDSFLT ELTDIEGDVVDMEAYAQAFVCRAKEIPFISVKYVSDVIGQNSVKHWEDRLEDARAGLSHF FNVLKESI >gi|226332013|gb|ACIB01000043.1| GENE 151 170268 - 170513 112 81 aa, chain + ## HITS:1 COG:no KEGG:BF3402 NR:ns ## KEGG: BF3402 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 34 1 34 389 77 100.0 1e-13 MIQNSISDNRIVWLDVIRCVAMIMVIGVHCIDPFLHFSHNAGYSGIHALGSDLWFTAPSV GTSVCYDDGTASAPRKKTTTG >gi|226332013|gb|ACIB01000043.1| GENE 152 170386 - 171438 636 350 aa, chain + ## HITS:1 COG:RSc3292 KEGG:ns NR:ns ## COG: RSc3292 COG3274 # Protein_GI_number: 17548009 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Ralstonia solanacearum # 13 342 40 330 336 69 23.0 9e-12 MRAIPEYTHWAAIYGSLLRPSVPLFVMMTGLLLLPVKKQPLGKFYKKRIYRVLFPFLIWS VLYSMFPWVTGVLGLPKEIIGDFFCYTQGQESQSLIDSLKDVAMIPFNFSHKENHMWYIY LLIGLYLYMPFFSAWIENADRKTKRAFLLIWIISLFIPYLKEYVANCLFERSGYVFGTDT WNEFGLFYYFAGFNGYLLLGHYVKKGNDWSLMKTFILCILMFAVGYYITYTGFSTTASNP NATETEMELFFTFCSPNVLLMTLATFLLLQKVVITNSTVIKVLANMTQCGFGIYMVHYFV VGPFFLLIGPSSLPIPLQVPLMAICIFLCSWAFTALIYKLMPRKAVWFMG >gi|226332013|gb|ACIB01000043.1| GENE 153 171516 - 171998 433 160 aa, chain - ## HITS:1 COG:no KEGG:BF3600 NR:ns ## KEGG: BF3600 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 160 1 160 160 313 100.0 1e-84 MIKLRFSLLMALLVVAVSVFADNAPAKVQTALKKMYPKADGIAWSQDGGYYCADFMMNGY EKNVWFNAQGQWQMTQTEWGDTDELSATVYNAYASGPYSGWQVEDVTYVEFPKWQPIIVI KVGQQNVDIQYQLFYSPNGTLLRTRNVSYMDDILGPGTFL >gi|226332013|gb|ACIB01000043.1| GENE 154 172108 - 172863 643 251 aa, chain - ## HITS:1 COG:PM0526 KEGG:ns NR:ns ## COG: PM0526 COG3142 # Protein_GI_number: 15602391 # Func_class: P Inorganic ion transport and metabolism # Function: Uncharacterized protein involved in copper resistance # Organism: Pasteurella multocida # 7 245 4 242 244 209 46.0 4e-54 MKNYLFEVCTNSVESCIAAQEGGANRVELCAGIPEGGTTPSYGEIAMAREVLTTTRLHVI IRPRGGDFLYSPVEVKTMLKDIEMARQLGADGVVFGCLTTNGGIDVPVMKQLMEASKGLS VTFHRAFDVCRDASEALEQIIDLGCDRILTSGQQATAELGIPLLKELRERANGRITLLAG CGVNEKNICRIAKETGIQEFHFSARESIKSGMEYKNEAVSMGGTVHISEYERNVTTVKRV KDTIESITSSL >gi|226332013|gb|ACIB01000043.1| GENE 155 172955 - 174490 1991 511 aa, chain - ## HITS:1 COG:CAC1816 KEGG:ns NR:ns ## COG: CAC1816 COG1418 # Protein_GI_number: 15895092 # Func_class: R General function prediction only # Function: Predicted HD superfamily hydrolase # Organism: Clostridium acetobutylicum # 34 511 44 514 514 400 50.0 1e-111 MLVTIVASIACFIVGGILSYVLFKYGLKAKYDNVLKEAETEAEVIKKNKLLEVKEKFLNK KADLEKEVALRNQKIQQAENKLKQREMVLSQRQEEIQRKRAEADAVRENLEAQLGIVDKK KEELDKLQHQEIEKLEALSGLSADEAKERLVESLKEEAKTQAQSYINDIMDDAKLTASKE AKRIVIQSIQRVATETAIENSVTVFHIESDEIKGRIIGREGRNIRALEAATGVEIVVDDT PEAIVLSAFDPVRREIARLALHQLVTDGRIHPARIEEVVAKVRKQVEEEIIETGKRTTID LGIHGLHPELIRIIGKMKYRSSYGQNLLQHARETANLCAVMASELGLNPKKAKRAGLLHD IGKVPDEEPELPHALLGMKLAEKFKEKPDICNAIGAHHDEIEMTSLLAPIVQVCDAISGA RPGARREIVEAYIKRLNDLEQLAMSYPGVTKTYAIQAGRELRVIVGADKIDDKQTENLSG EIAKKIQDEMTYPGQVKITVIRETRAVSFAK >gi|226332013|gb|ACIB01000043.1| GENE 156 174599 - 174892 197 97 aa, chain - ## HITS:1 COG:no KEGG:BF3603 NR:ns ## KEGG: BF3603 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 97 1 97 97 147 100.0 1e-34 MNDKIKINLQIADSYYPLTINRDEEETVREAAKQVNIRLNAYREHYRNVAPEKIIAMVAY QFSLEKLQLLQRNDTQPYTAKIEELTEMLEEYFRNEE >gi|226332013|gb|ACIB01000043.1| GENE 157 174908 - 175201 244 97 aa, chain - ## HITS:1 COG:no KEGG:BF3604 NR:ns ## KEGG: BF3604 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 97 1 97 97 119 100.0 5e-26 MTEEEKKLLSTFEARLRHLIYLHDELKRENAELKQLLEDEKSELEKARAEYKALESSYTD LKTATTISLNGSDVKETKLRLSKLVREVDKCIALLNE >gi|226332013|gb|ACIB01000043.1| GENE 158 175541 - 177829 2686 762 aa, chain + ## HITS:1 COG:STM2472_1 KEGG:ns NR:ns ## COG: STM2472_1 COG0281 # Protein_GI_number: 16765792 # Func_class: C Energy production and conversion # Function: Malic enzyme # Organism: Salmonella typhimurium LT2 # 1 423 1 424 434 509 61.0 1e-144 MAKITKEAALLYHSQGKPGKIEVVPTKPYSTQTDLSLAYSPGVAEPCLEIEKNPQDAYKY TAKGNLVAVISNGTAVLGLGDIGALSGKPVMEGKGLLFKIYAGIDVFDIEVNEKDPDKFI EAVKAIAPTFGGINLEDIKAPECFEIERRLKEELDIPVMHDDQHGTAIISSAGLVNALQV AGKKIEDVKIVVNGAGASAVSCTKLYVSLGARLENIVMLDSKGVISKARTDLNEQKRYFA TDRTDIHTLAEAIKDADVFLGLSKGNTLSQDMVRSMAPMPIVFALANPTPEISYEDAMAA RPDVLMATGRSDYPNQINNVIGFPYIFRGALDTQAKAINEEMKIAAVHAIANLAKQPVPD VVNEAYHVNNFTFGPEYFIPKPVDPRLITEVSIAVARAAMESGVARKNIENWDDYKTHLR ELMGQESQLTRQLYDTARRNPQRVVFAEGGHPNMLKAAVEAKSEGICHPIILGNEERIEK LAKELDLSLDGIEIINLRHDREAERRERYAHILSQKRAREGATYEEANDKMFERNYFGMM MVETGDADAFITGLYTKYSNTIKVAKEVIGIRPEYKHFGTMHILNSKKGTYFLADTLINR HPDTSTLIDIAKLADQTVRFFNHTPVISMLSYSNFGSDQAGSPLKVHEAVAYMQQEYPEL AIDGEMQVNFAMNRELRDSKYPFTRLNGKDVNTLVFPNLSSANAGYQLLQAMDPDTEFIG PIQMGLNKPIHFTDIESSVRDIVNITAVAVIDAIVEKKKANK >gi|226332013|gb|ACIB01000043.1| GENE 159 177826 - 178002 186 58 aa, chain + ## HITS:1 COG:no KEGG:BF3606 NR:ns ## KEGG: BF3606 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 58 1 58 58 84 100.0 2e-15 MKRPLSKSQIVCIILLWVALCYIVLVYAERIDGPTILMLIISAALVFIPVYKSLKKNK >gi|226332013|gb|ACIB01000043.1| GENE 160 178045 - 178170 57 41 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MFYAVFLYKMLASSIFMFTFATSSTETIEFKNNHIKKKKEL >gi|226332013|gb|ACIB01000043.1| GENE 161 178167 - 179501 1589 444 aa, chain + ## HITS:1 COG:PA4588 KEGG:ns NR:ns ## COG: PA4588 COG0334 # Protein_GI_number: 15599784 # Func_class: E Amino acid transport and metabolism # Function: Glutamate dehydrogenase/leucine dehydrogenase # Organism: Pseudomonas aeruginosa # 7 444 9 445 445 585 63.0 1e-167 MNAAKVLEDLKRRFPNEPEYHQAVEEVLSTIEEEYNKHPEFDKANLIERLCIPDRVYQFR VTWVDDKGNVQTNMGYRVQHNNAIGPYKGGIRFHASVNLGILKFLAFEQTFKNSLTTLPM GGGKGGSDFSPRGKSNAEVMRFTQAFMLELWRHIGPETDVPAGDIGVGGREVGFMFGMYK KLSHEFTGTFTGKGREFGGSLIRPEATGYGNIYFLMEMLKTKGTDLKGKVCLISGSGNVA QYTAEKVLELGGKVVTMSDSDGYIYDPDGIDRAKLDYIMELKNLYRGRIREYAETYGCKY VEGARPWGEKGDIALPSATQNELNGDDARKLVANGVIAVSEGANMPSTPEAIKVFQDAKI LYAPGKAANAGGVSVSGLEMTQNSIKLSWSSEEVDEKLKSIMKNIHEACVQYGTEPDGYV NYVKGANVAGFMKVAKAMMAQGIV >gi|226332013|gb|ACIB01000043.1| GENE 162 180150 - 180689 309 179 aa, chain + ## HITS:1 COG:VC2302 KEGG:ns NR:ns ## COG: VC2302 COG1595 # Protein_GI_number: 15642300 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Vibrio cholerae # 6 167 22 186 194 59 26.0 2e-09 MILNNESNKKKKFEQFFIMTYPKVKAFAWKLLKSEEDAEDIAQDIFAKLWTNPEIWENQE TWNSYIYTMVRNHIYNFLKHKSIRQTYQEQCTKEEPAISETDIHDQLYAKESELLIKLTI ANMPEQRRKIFRMSRTQEKSNQEIADELDISIRTVERHIYLALIDLKKVLLTLFFFYLG >gi|226332013|gb|ACIB01000043.1| GENE 163 180736 - 181725 634 329 aa, chain + ## HITS:1 COG:PA2388 KEGG:ns NR:ns ## COG: PA2388 COG3712 # Protein_GI_number: 15597584 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Pseudomonas aeruginosa # 119 270 124 272 331 73 29.0 4e-13 MKNYIQRIIRLFATSDPDPKLTGEIHRWLLDQEHAGEKETALHDLWNETEGKVDRTTWDS LASVYTKVGANSGDRHQPRIRFAHYAAAIALLIVSVSVTFQMTKQHFAEAPLIENITPDG RLSSLRLPDGSIVQTNSGSILLYPEKFKGETRTVYLIGEANFKVKKNSGQPFIVRSGTMS VTALGTEFNVAAYPEENEMIATLIHGKIKVECDNGKESYIVTPGQQVTYRKSTGESRLAE ANIEDVTAWQKGMYVFRGVTMSEILNELERRYAVTFQYNANLFNDDKFNFRFREKSTLED ILNIMQEVVGGFSHELKGNICYIKPETKK >gi|226332013|gb|ACIB01000043.1| GENE 164 181906 - 185175 3183 1089 aa, chain + ## HITS:1 COG:no KEGG:BF3412 NR:ns ## KEGG: BF3412 # Name: omp117 # Def: putative outer membrane protein Omp117 # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1089 1 1089 1089 2075 99.0 0 MNFYRLKSIFFIIFSCFILQTAAFAQNNVKITIKKKNITLQEALREVEKQSDYLIAFNES KLEKTKRVNLNINAESLDKTLASILSGTGLSYKIKDKYIMIIPQSKVEVESKKLSGIVKD DKGDPLIGVNVSFKGSPTGTVTGLDGRFSILAAKGNIIEFSYVGYTTQYIIVGDASSLTV VLEEDAKALDEVVVTALGIKRAEKALSYSVQQVKSDAINDVKDANFVNGLTGKVAGVSIN RSSSGIGGATRVVMRGAKSIVGNNNVLYVVDGMPIGNPSKGEINNDYSTPGGGEGISDFN PEDIESLSILTGPAAAALYGSSAANGVILINTKKGQEGKLKISISNNTEFMTPYVMPEFQ NRYGNAKGSYKSWGEMLQQPSTFRPKDFFKTGANIMNAANFSVGNKNNQTFVSVATTNST GIIPNNEYYRYNFTLRNTASMLNDKLHLDLGASYVLQGDQNMLSAGRYFNPLVPLYLFPR GEDFEAVKVYERYDTNRKFPIQEWSYGDQGLNLENPYWIVNREMFVSKKKRYMFYANVKY DILSWLNIAGRIRVDNTNTTSERKLHASTIKLHAQSDKGAYNRSMEEYQQTYADIMLNVN KNFGNFNLTANAGFSYEDHLTTGMGIGGKLFTVPNLFSAYNFDPASGPGSQSHTHTRNNS VFVSTELGYKSMLYLTLTGRQEWASQLVNSDQPTYFYPSVGVSGVISEMVSLPKFISFWK MRASFAEVGGPINYTGLTPGTVTDPMKGGVINPISVYPFPNFKAEQTKSYELGTNLRLFN NKINIDATVYLTDTYNQTFLSSMSPASGYSGFYVQAGKVRNKGIELSLGYNDQFGKVGYA TNLTYTANRNKIMKMVHDYKNPSDGSLFSITELTLQDKGGVYLREKDAIGDVYVKGILAR GKDGKLIEEEGGYKVDRSQRIKIGSVNPDFSIGWRHNVTWNNITLDLLFNGRFGGIVTSS TQAFLDDYGVSKDTYKARQNGGVWVNGTQYDAEKYYTTIGGEQLMAYYAYKATNIRLQEA SLSYTLPGKWFGNVINRLTVSAIGRNLWMIYNKAPFDPEMTSSTGTYNRGDVFMPPSLRS VGFSVKIEL >gi|226332013|gb|ACIB01000043.1| GENE 165 185196 - 186824 1504 542 aa, chain + ## HITS:1 COG:no KEGG:BF3612 NR:ns ## KEGG: BF3612 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 542 1 542 542 1086 100.0 0 MKLNKVKNMWRTLSVIALLGSTLCTSCISEDINRNPLLPTKEDEKMDGVIYGAYLPNLEK SVIPIGTASESTEPVNRYQIGVNLAGDAWAGYMSPRDNKFNGSKNFTNYFMYENWVNYVY SFMVTDVYSPWMQIKRISQDEGTRNDEIYALAQIIKIAALHRTTDMFGPIPYSQVGKGSF KVAYDSQESVYRSFLKELEEAVQTLDDYSNKSKEVLPAFDIVYNGDVNKWMRFANSLMLR LAIRVRFADAGLAKEYAEKAVKHPAGLIDSKELAAQMGKGAGLQMKNPLKVINEEYNDTR MGATIYSYLAGYNDARAAVYFVKNNGFKAVRCGIAKSGDAYNGFTRPNVHEDDPLYWMKA SEVCFLKAEGALAGFDMGGSAGDFYNAGIRMSFSENGLDNSSAETYLKDSTRKPANYTDT SNGELSANAPSSITIRWENGATEEEKLERIITQKYLAIFPNGQEAWTEWRRTGYPRQIVV AENKTNSAVLIGNGYDLGGVRRLPYPRTEYEQNGENLHNAISQYLGGVDNAATKVWWDKK SK >gi|226332013|gb|ACIB01000043.1| GENE 166 186850 - 187896 860 348 aa, chain + ## HITS:1 COG:no KEGG:BF3613 NR:ns ## KEGG: BF3613 # Name: not_defined # Def: putative endo-beta-N-acetylglucosaminidase # Organism: B.fragilis # Pathway: not_defined # 1 348 1 348 348 716 100.0 0 MNMRNTIFTFIGVALLVGFVACDDWTSPEKLDVENEAVGDLYSKRDSIKWAEEEKRHKEN EAAYEKYLENLRAYKSTKHPIMFGWFNAWQPDGAGKYPRLSLLPDSMDVVSIWGNWHSLS EEKIKELRSVQAKGTKVIIGWIIEDIGDQIKWGRDQWPADDTQAIKEYAQAIVDTINKYG YDGFDYDYEPSYASPFKPGNHCGNLTSCSRDYNKEKEILFMKTMRELLGPDKLFHLNGSI HWLDPRAAQYFDRFVVQSYNGSASSFERWTNDIQNRLNIKPEQLVFTESFQNKPGARSRF PGTYAGYVASKQGNVGGIGVFHINEDAFEDEAYVNIRKAISIMNPPVK >gi|226332013|gb|ACIB01000043.1| GENE 167 187920 - 189101 1037 393 aa, chain + ## HITS:1 COG:no KEGG:BF3415 NR:ns ## KEGG: BF3415 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 393 1 393 393 778 99.0 0 MKCNVSYKYLFIAIGILCPLFSACDYADGEGSGSENAVYMETPDNKGIVNFTLEPDGGIT YLTPRLANISQSPVTIQVGYDKEALDKYNKDNGASYEPLPPSAFKLADAEGNELSASEGI RVPAGDFSAKIMVKVGQLNSKDFPDSKKYAIPLSITGASNYSLIPSQRSAILLLNRSILS SVAKVSGGEGIRIKPVGMHTKAEWTIQMSAIYSSLTRSNLTTAYLSNGTGGAFYTRISST AGIQVKNGRDGDDTWTQIPLQAGKWLHITYVHKDKKTTVYVNGKVQKVFENSAITFGENS MIVVGNSGYRNDYLREIRLWDKALTESEINDYLYLPMDPATPHLISYLPLSKEMETKDLK APAGTENVTTKARIEYVENVKFPADELVIVNQE >gi|226332013|gb|ACIB01000043.1| GENE 168 189115 - 190146 691 343 aa, chain + ## HITS:1 COG:no KEGG:BF3615 NR:ns ## KEGG: BF3615 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 343 1 343 343 679 99.0 0 MKRRIKSAQSILWVLLLILSASCSKDEIQIVDQDTSWQMDNKYIEEDIREQLGIDPFTDL VYLGYYGNPYTQLEAINDLVNTTLVGKNELSFKVKVTKPYKEDIKVNLMKEDKLVTDFPE MAEGIPLFPSENCTFEGGVLKAGELETTVKLTIKDVEKLNNLSGYVMAIKLTMEGSHEHL AIARTRSAYFVKLNLSIRLDNIDSSNKKIEGKGFNKEISFKSDIRPDKLGSLNDGNFTAN NWYTSNANNYLTIILPEKQSLKGFRLDTNTSPSGSYMLKSCRVMVETPDGNWVNHGVFDR KSMDGIAYISFKKPVECTKVRFENMMAFNGRFSVDVNEVTAFR >gi|226332013|gb|ACIB01000043.1| GENE 169 190315 - 190635 316 106 aa, chain - ## HITS:1 COG:no KEGG:BF3421 NR:ns ## KEGG: BF3421 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 106 23 128 128 196 100.0 2e-49 MIGLLTTKELDFLTKLAELLKEYSAIISYGHCSELRILVCAGDSEDVEKYPIIFEDSFDE NEIYDLLRKNRKRIEEIIEREVAEAVPEGELSQPDDQAGSMADHFH >gi|226332013|gb|ACIB01000043.1| GENE 170 190835 - 191290 486 151 aa, chain + ## HITS:1 COG:no KEGG:BF3422 NR:ns ## KEGG: BF3422 # Name: not_defined # Def: putative DNA-binding protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 151 1 151 151 239 100.0 2e-62 METEEEIQNVNVHHGHNIRRTRIEKNIKQDALAALVNMTQPNVSKYEKMRVIEDEMLNRF ARALNVPVEYLKTLEEDAPSVVFENITNNVHDNKDSSMGSTGYNNDSITNTFNPIDKITE LYERLLKEKDEKYAALEKRIQGLEQQNNNGK >gi|226332013|gb|ACIB01000043.1| GENE 171 192053 - 192409 286 118 aa, chain + ## HITS:1 COG:no KEGG:BF3424 NR:ns ## KEGG: BF3424 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 118 1 118 118 206 100.0 2e-52 MKKLRITWLCLILAGILLGVSSCSNDAEELQGKLVITFAKPTKGLKVAICSMENTKYPIL VESPNVNGVLKVSLNIGNYYIKPSDDSSDMYSDIGIQVRPDKTTTVTYGESRQVIGRE >gi|226332013|gb|ACIB01000043.1| GENE 172 192419 - 192613 63 64 aa, chain - ## HITS:1 COG:no KEGG:BF3620 NR:ns ## KEGG: BF3620 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 64 1 64 64 88 98.0 7e-17 MFRYCSRPCRLEEVILLLLVRNFMFLIKRRNILLLHIKGLGYLLSLLKDGFDRFKLNLSK PFWV >gi|226332013|gb|ACIB01000043.1| GENE 173 193309 - 193815 356 168 aa, chain - ## HITS:1 COG:no KEGG:BF3623 NR:ns ## KEGG: BF3623 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 168 1 168 168 315 99.0 5e-85 MSIDFFIAKCQTENIVDKEFGICDDEDEEKKTPAYVDRNQPDKWVAVVKNQTNQSINFTA VDNCVEMNRSDGTMDFRCDAMLTNGDNIVFVELKVQAADWIFHAVDEQLQTTIDHFKANH DLSRYKYKRAFVCNKRHPNFRVSYKDKMTSFYQKNGIRLNLVREIIFK >gi|226332013|gb|ACIB01000043.1| GENE 174 193822 - 195042 920 406 aa, chain - ## HITS:1 COG:no KEGG:BF3624 NR:ns ## KEGG: BF3624 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 406 1 406 406 764 99.0 0 MNETVNLTVYQFGPISEADVVFDKYTVLIGKQGSGKSTIAKLYSMFTWLEKGLARRITSE KYITQYSRFQKIYCAYHRLESYFKRETVIHFYGLHYNFFYENEKFHVEAKGLPESYKVAK VMYVPAERNFLSTADDTDGLKSLPESLETLLEEFDKAKEAFKTGYRLPFNDTDFEYDALN KISWIKGSDYKIRLSAASSGYQSVLPLSLITRFLSDLVLDNANKEDLSIKEKKQIEKEVN KVMNDKSLTDGVKFAMLRNISSRFKYSCFVNIVEEMELNLYPESQRSVLFDLLSYANKIE LNRLVLTTHSPYVINYLTLAAKAFLLTQKISANETLQERIKEVVPADSAIDPARLRIYEL KDGGVFRLSTYEGLPSDENFLNIQLGVTNELFDQLLEIEQEFDYKN >gi|226332013|gb|ACIB01000043.1| GENE 175 195296 - 195448 174 50 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MNRDRQLKKDMESREVQIHEEYEKNRHLMSEDKVKEHQKKNRGIKTTTQK >gi|226332013|gb|ACIB01000043.1| GENE 176 195553 - 196230 545 225 aa, chain - ## HITS:1 COG:no KEGG:BF3626 NR:ns ## KEGG: BF3626 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 225 1 225 225 424 99.0 1e-118 MKTYIYLFLLLLATASCSEQTAPDHSVGYLRVENIILSCDTETLPITRAVDAGLKLEIWQ GSECVRSYDPGAAELSKRIVLPVGEYTLKAFTPDQTEAPDNESGTPIYSVDYPFAIVSED VTLISVKAPQINIGVGVEYSDEFMANFTDFSITVSSPTGRQASLAGNVTDLLYFNVPTGG THLSYTLTATNADGETMTSEARPILQESGAELTSGNYKVRIGLVQ >gi|226332013|gb|ACIB01000043.1| GENE 177 196233 - 198365 1477 710 aa, chain - ## HITS:1 COG:no KEGG:BF3627 NR:ns ## KEGG: BF3627 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 710 1 710 710 1432 99.0 0 MHKSVLSLVCCLFFFLSCQEEIETMPNGSLNIVLTDEAAVTRTLPEALSDELRQQFTIEL LRDREGTIVPEYKGALRDFGDQRVFKVGSYQLKAYLGENPSLALDAPYYYGEVQDIAIEK GKATTVTVGCKVANALATFEIVNQEVFDKRLKDYYVEVSAGGESVTWKPGDATHPYFKAG GRVTMALIGTSVETGQEGSYALNPIETVKAGVKYNYKLSMKASNVSLEVTTETQQEPITI NETVPDSWLPKAKVFSEDFDENHVLTYTETADALSRAGIAYTALRPVQDVEFAFNFADKH LEHLNKTYLLSELSEEDRRALAAVNIVLPDLTAGSTEGVIDFAGVTSGMLTHDGGQDTDN IIAVRVKANDRWSDAGMYTIRTVKPVFKVGYYPGNVWTKEFTLNTLTADSVKTGNLDKFT DIAYEFSADGNSWEAMPGDLRKAGLSPGTSYYVRAKYRGEVPGEKVEVKTYEALSIPNSD FNAGYDVTYPKSENPLYTFKGDWIGTRNPLTCHTDGANAFYVSKSSTLPVVDGERNVAHM MTLGWGAGNTCSFGNKDYWLGNSVINHISAGIVCVGDYEAAGDVVNGKAAYIRPTSMSFV YKAAPYKDDEYLIEAYLENITGEVETIIGKAYLKSGTAYSSYQTQTLNFEYNNEHRNLPI SHVKIIFKAGTKEDRDHLEDKFRDAKVPYGDAYIIGSQFWLDSFTLHYDK >gi|226332013|gb|ACIB01000043.1| GENE 178 198377 - 199987 1332 536 aa, chain - ## HITS:1 COG:no KEGG:BF3628 NR:ns ## KEGG: BF3628 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 536 1 536 536 1023 99.0 0 MKSLQIYLFLFLSVFALGACIQNDIPYPYIKGEITAFEVEGQTGDAEINKNSRTIVVEVG DEVDIEELRITRFVVNEEATYSVDEQYCVSPNKFPSAGFSALADLPAGADTRVDFSKTVP VLLRTYQDYQWMITVKQTIERIVEVENQALPAIIDDKNHTVLVYVSQKQDLSAVKITKMI LGGSKATITPDPSTVTNFRRPQEFVVNRFDKEELWTVDVVRTTSTGTTGSADVWATRATL NGGMKQGTTPRVEYRKKSEDTWTVVPETDVKLESGTTFSTTLTGLQDGTDYVWRVVVEEV PSTESGFTTEKIQEIPNLNFDTWSQNPTGTFKKSWYPNADGSNSFWATGNDGVTSSLAGS RDSSTRPEEKSVVNGKAAYMVTLGSVPLVGVAAGNLFIGDYKTNAQSPKDSPKFGRSFTG ARPTGLKGWYKYTSKPVDYVGNPDNLKNDECHIYLRLWDDKDNEIGYGEFIGKETVTQYT QFRFDVTYTNKTAKPAKITIVATSSHYGGDFTGMKVTGSVGVGSELWVDEFELLYE >gi|226332013|gb|ACIB01000043.1| GENE 179 200008 - 200733 560 241 aa, chain - ## HITS:1 COG:no KEGG:BF3629 NR:ns ## KEGG: BF3629 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 241 1 241 241 454 100.0 1e-126 MKKIIIFLLLLCPVLVQAKKKFDRGIVKSVFVPKGQWFMGSTVSYSEQSADQYQFLVLAN IDAKGYTFRLSPFGGYFFADNMAAGGRFTYSRTYFNIGNVDINLGDDLSFHIKDDMYLEH NYSASGFLRTYMGLGSSKVFGFFNEVRLTYAYGQGKHSNGTGNDLTGYYQRSHTFEIGAA PGLAAFVSDFASVEVSVGVMGFSYKWVDQIHNQVNKASRHSASGNFKIDLFSINLGMTMY F >gi|226332013|gb|ACIB01000043.1| GENE 180 200744 - 202534 1329 596 aa, chain - ## HITS:1 COG:no KEGG:BF3432 NR:ns ## KEGG: BF3432 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 596 1 596 596 1157 100.0 0 MKRYIYLFVSVFTLVMASCQQEETSNGKTGFLQFSVEKNTSTILVPATRAEELPIALQVV DRAGTVVKETDDWHNWTSTPLELPLGSYTINAFSKGVDVGTAGFDEPYYWGQSEVTVVPK VNQSVNIECRLANVKVTVNYSADVKKYFSKLNCSVGNSSGKLVFGKDETRSGYFAVDKLN ISLALTNTDGRSFVFESEPITDVKERQHYRINYTMKANGAIGGVSVTLDPSTKEYNVNIA IPKESNPSVNVWSDFADVTLPLPDGVVTKECKYRISGTEDWNTVSNVEQADGKLVATITG LIPGTEYDFCFAINGVNGKITTATTELQKQLENGSFDEWNQNGKTWFPGTAAEASAKNSY WDTGNVGAATMSKNPSIGESNDVHTVGGKSAKLSSQFVGMFGIGKFAAGNIYIGRYMETY TSPMGARIRFGREFTSRPTQLKGWYKYTRGTSIDRGDHNVEELKNSGGDKCAIYIALTDN EGLVDDNGVKTAYEVNNNKEGSPTRYTVDLSEANKDVIAYGSITDEESKGSFDESGNVVW KQFTIDLKYRDLTRKPKYIIVVASASKYGDFFTGSESSVMLIDDFELVYGTPVTAN >gi|226332013|gb|ACIB01000043.1| GENE 181 202547 - 203875 1137 442 aa, chain - ## HITS:1 COG:no KEGG:BF3631 NR:ns ## KEGG: BF3631 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 442 1 442 442 793 100.0 0 MNAIKNIYAVLALIALCLSSCEMKNELEGKYNLKNDEGLLTLDLVNKTQATPTTKAESTS LTDELDVNTYKVEIVDNATEAVVRSFASYAKLKEALPLVVPVGNYKIVAKSGVLQDASRT PYFEGSSSIEVKQGMESKAEVLCKSATVKVSLNISEEFLNMFADDYVFTVSNGIGGVIYV KKEDLSSIYLSIPDGSTSIKIVAKVTEKDSGRDIETVYTVTKPDAEGLQGGDSFNVTVKP VEEGEDPNNPDVTPSDPKLGIQLDIDLTMDETGITVKVPTELIEESKPDEPDQPTDPDQP TVDGPEIIGADEVVEVDTNNPPTVQVTMKAPAGIQNLLVTITSDSNEFIGLISQMGLGET FDLANPGDLEDKLGGSLEDGSGIGLIDPNDPIKDKKEFIFDVSGFMPMLSPFGLQQHYFT IKLIDNNGKELAKKLTVKVVKN >gi|226332013|gb|ACIB01000043.1| GENE 182 203896 - 204705 664 269 aa, chain - ## HITS:1 COG:no KEGG:BF3632 NR:ns ## KEGG: BF3632 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 269 1 269 269 490 99.0 1e-137 MKRLSFMVSMAMLISCVASCDSGSDAVSCGVGTLSLGLSANPSFSTKTRSVNEAEYKKAD NYQVTVKDADGASLYNGLYKDMPLSIDLTAGKGYTVKAFYGENVNAGFDKLYVEGSQEFT VSEGEQKNVILYCKPANVKVSVVYTEDFLKYYSDCTVSLSTSHLTAPFEVNMKKDSGKDA YLKANADGEKVSITVGGFRDKEGNPVVMEALVAEKKVAPKTHLTITVDPEVITISTGTAS LDVTVDTGTEDKDVNIEIPEEYWPGNASK >gi|226332013|gb|ACIB01000043.1| GENE 183 204929 - 207898 2995 989 aa, chain - ## HITS:1 COG:no KEGG:BF3633 NR:ns ## KEGG: BF3633 # Name: not_defined # Def: phosphoenolpyruvate synthase # Organism: B.fragilis # Pathway: not_defined # 1 989 1 989 989 2012 99.0 0 MLSKFKLNQLYFKDTQFANLMTRRIFNVLLIANPYDAFMLEDDGRIDEKIFNEYTSLSLR YPPRFSQVSTEEEALRQLESVSFDLVICMPGTGDNDSFDIGRHIKEKYEQIPIVILTPFS HGITKRIANEDLSAFDYVFCWLGNTDLLVSIIKLIEDKMNLEHDVKEVGVQLILLVEDSI RFYSSVLPNLYKFVLKQSQEFSTEALNAHQRTLRMRGRPKIVLARTYEEAMDIYNKYTNN ILGVITDVRFPRVDKGEKDGMAGIKLCAEVRKKDPFVPLIIQSSETENAAYAAKYGATFI DKNSKKMDVDLRRIVSDNFGFGDFVFRNPETGVEIARVRNLKELQNILFAVPAESFLYHI SRNHVSRWLYSRAMFPVAEFLRPITWHSLQDVDAHRKIIFEAIVKYRKMKNQGVVAVFKR DRFDRYSNFARIGDGSLGGKGRGLAFIDNMVKRHPEFEEFENARVAIPKTVVLCTDVFDE FMDMNNLYQVALSDADDDTILRYFLKAKLPDRLVEDFFTFFDVVKSPIAIRSSSLLEDSH YQPFAGIYNTYMIPYLDDKYEMLRMLSDAIKGVYASVYFRDSKAYMQATSNVIDQEKMAV ILQQVVGNQYGDRYYPSMSGVARSLNYYPIGDEKAEEGTVNLALGLGKYIVDGGMTLRFS PAHPSKVLQTSELDIALKETQTRFYALDLKNAGDNFSIDDGFNLLKLHVKEAEKDGSLRY IASTYDPYDQVIRDGLYPGGRKVITFANILQHDVFPLARILRWVLRYGQQEMRRPVEIEF AVTLNHDRDKTGTFYLLQVRPIVDSKDMLDEDLTTIPDEDVLLRSNNSLGHGIMNEIHDI VYVKTDHYSASNNQNIAWEIEKINQQFLNEGKNYVLVGPGRWGSSDTWLGIPVKWPHISA ARVIVEAGLTNYRVDPSQGTHFFQNLTSFGVGYFTINAFMNDGVYNQEFLNAQPAVFETE YLRHVRFERPIVVKMDGKKKLGVVLMPDK >gi|226332013|gb|ACIB01000043.1| GENE 184 208313 - 209650 1530 445 aa, chain + ## HITS:1 COG:PA4588 KEGG:ns NR:ns ## COG: PA4588 COG0334 # Protein_GI_number: 15599784 # Func_class: E Amino acid transport and metabolism # Function: Glutamate dehydrogenase/leucine dehydrogenase # Organism: Pseudomonas aeruginosa # 2 445 4 445 445 539 58.0 1e-153 MNIEKIMSSLEAKHPGESEYLQAVKEVLLSIEDIYNQHPEFEKAKIIERLVEPDRIFTFR VTWVDDKGEVQTNLGYRVQFNNAIGPYKGGIRFHASVNLSILKFLGFEQTFKNALTTLPM GGGKGGSDFSPRGKSDAEIMRFCQAFMLELWRHLGPDMDVPAGDIGVGGREVGYMFGMYK KLTREFTGTFTGKGLEFGGSLIRPEATGFGGLYFVNQMLQTKGIDIKGKTVAISGFGNVA WGAATKATELGAKVVTISGPDGYIYDPNGISGEKIDYMLELRASGNDIVAPYADEFPGST FVAGKRPWEVKADIALPCATQNELNGEDAKNLIDNNVLCVGEISNMGCTPEAIDLFIEHK TMYAPGKAVNAGGVATSGLEMSQNAMHLSWSAAEVDEKLHSIMHGIHAQCVKYGTEPDGY INYVKGANIAGFMKVAHAMMGQGII >gi|226332013|gb|ACIB01000043.1| GENE 185 209821 - 210984 1245 387 aa, chain + ## HITS:1 COG:MA4232 KEGG:ns NR:ns ## COG: MA4232 COG0006 # Protein_GI_number: 20093022 # Func_class: E Amino acid transport and metabolism # Function: Xaa-Pro aminopeptidase # Organism: Methanosarcina acetivorans str.C2A # 23 384 20 385 388 174 31.0 3e-43 MLQPELKMRRDKIRVLMAQQNIEAALITCNVNLLYTYGRIVSGYLYLPLHSPALLFIKRP NNITGEHVFPIRKPEQIPDLIKENNLPMPQKLMLEGDELSFTEYNRLAAIFPESEIVNGT PLIREARSVKTPVEIELFRRSGVAHAKAYDQIPSVYRPGMTDLEFSIEIERLMRLQGNLG IFRVFGQSMEIFMGSVLTGDNAAYPSPYDFALGGEGLDPALPGGLNKTPLKEGQSVMVDL GGNFNGYMGDMSRVFSVGKLSDEAYTAHQVCLDIQEAVSSMAQPGVVCEDLYNAAINIVT KAGFADKFMGISQQAKFIGHGIGLEINEAPVLAPRMKQELEPGMVFALEPKIVIPGVGPV GIENSWAVTPEGVEKLTICNEEIIELQ >gi|226332013|gb|ACIB01000043.1| GENE 186 211146 - 213908 2632 920 aa, chain - ## HITS:1 COG:no KEGG:BF3637 NR:ns ## KEGG: BF3637 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 920 1 920 920 1775 99.0 0 MKRFFLLTIILTVVVTGKVGAQEVASASKRANQQYVLFESERDKGTNVTAMYDYLMDSYE NFMKVVEAPDNSQYIGGAKNRLRALYPYLLNGAVYYSEQKQPSKALDFAAAYIDMPQLKL FRSELLPKDNRYASVVYYAAVAAFNLEKNEKALRYFQEYLNTGTEAQQKDCYVYMNMIYQ KQKKYADQERVLEQAIAKYPVSLDFLYNLVNVHIATNNMEKLIGAIDRILAVDPNNDKVL PIKARILERQGKNVEALDIYKRLYALHPESFELMTGVARANFNCATEIVNNGATIANDTE YALVRQRASGYLMDAKDLFLKILQKDPSSKMYMQGLAGVYQYMDMKPEYEVLNKIVQDGA SYTAFPSRLAAYKEALKKTENVAQEQQAVPVPIEPAMLVIKVDQFTDANNNKVIDAGESF AIRFTIENQGKGDAYNVRLRLAEQQGYDQYFDGPRELDGGNIAAGKSKEYTFRYIAKKEL PTSLAKINIYAFEANGFDADPSELIVNTQEYAMPRLRVADHQFFASEGSSITLGKNGKLT VALQNFGTQTARKVKVNFTLPKNVYTTDTPEIIVDSIAPGDVAILDYGFLVNKRFDGDSI AVMLAVTESTRSASLNEAYKVKVGEYLSASASMNLSGNVAARKVNVKDFSLGFKSELMED VPVGAVNRHRYALIIGNEDYSMTGANAEINVPYAVNDAVLFREYCVRTFGVPDGQIKVVP NATAGMMHEQLDWLVNMASTDPQAELIFYYSGHGNNDEATKEPYLLPVDITGKNIRLGIS LSDLYKKLATYPVKGAYVFLDACFSGGYKSAAPLLAQKGVRVVPKVGLPQGNTLSFSSSS GDQTSSVYHEKKQGYYTYFLIKTIRDAKGNISMKELFDRTSAAVKRATALIDKIQEPQCM ASPTWIGWEDIKLETPVVTP >gi|226332013|gb|ACIB01000043.1| GENE 187 213912 - 216014 2268 700 aa, chain - ## HITS:1 COG:slr1409 KEGG:ns NR:ns ## COG: slr1409 COG2319 # Protein_GI_number: 16330230 # Func_class: R General function prediction only # Function: FOG: WD40 repeat # Organism: Synechocystis # 129 320 57 243 326 65 25.0 3e-10 MNRMKCGFYTYRGIAFCFVAAMFCGQTMAQVVEKRGFDSQKKINAFDNTTFCTAYLSDGA LYTMRDIAINDVRKIERIVFNPTGSSIALLRAKNPISIYSFRDRNKKLFELKEKRKKLKA KPMPVSMCYSADARSFIVGNSLGEIVIYDTKEYMPLAYIQGEAPATALAMSSNNYFIAAA AGQNINIWNFQTKELRKAIPMPAVVKEVTFSPDAALLAVTTDDNHLTIIDTKNWDKVDIF DKLGGTLSSPSFHPEGKYISVVKDGKNIEIINLKNGVVEQDIVDPTGGVTGGRFFKNNQN SEVFLLTNRTKQMVFWDANGLNPFYGKIMGREVDAKMNEWVKMMQGESMEDYAIRVNDET RIKQQQLFAQEVATALAGDRISMDNPFIDGYDASNNMLNIGFKGLPSIGLEVPSNEAGDF KDGKMKFSNAVYVLNDKDEFELAYVEVTNETTNKVYIYDNIGRTKLTALEADENFVPLEI MQQATREEAQLAEIKEQVIEEKKQDKLITDNTQINVKTEVIPGVDANGKKILNYKVGYQY EVINKEFSAKEDFPSGGYNIERSNAAMSLMKIIKNAFEGDFAKYLSEGKQVKVIITGSAD AAPIRGRLAYDGRYGEFVDEPYYKDGNLDNITVTKAGGITQNEQLALMRAAGVKTYIEKN VTTLGNTKNEYEYHVEVAKERGGEFRKINVEFVIMDAFQQ >gi|226332013|gb|ACIB01000043.1| GENE 188 216279 - 217724 1356 481 aa, chain - ## HITS:1 COG:MT4026 KEGG:ns NR:ns ## COG: MT4026 COG0617 # Protein_GI_number: 15843539 # Func_class: J Translation, ribosomal structure and biogenesis # Function: tRNA nucleotidyltransferase/poly(A) polymerase # Organism: Mycobacterium tuberculosis CDC1551 # 27 450 34 457 480 229 34.0 7e-60 MIELTQEELKQHFSEPIFGQISETADALGLECYVVGGYVRDIFLQRPSKDIDVVVVGSGI AMAEALGKRLGRGAHVSVFKNFGTAQVKCHGTEVEFVGARKESYQRDSRKPIVEDGTLED DQNRRDFTINALAVCLNKGRFGELVDPFGGMNDLKEKIIRTPLDPDITFSDDPLRMMRCI RFATQLNFYIDDDTFESLCRNRERIEIISRERIADELNKIMLSPIPSKGFIDLDRSGLLE LIFPELVALQGVETRNGRAHKDNFYHTLEVLDNISRVTDNLWLRWSALLHDIAKPVTKRW EPKAGWTFHNHNFIGEKMIPNIFRKMKLPMNEKMKYVQKMVSLHMRPIVIADDVVTDSAV RRLLFEAGDDIDDLMTLCEADITSKNMERKQRFLNNFQLVRQKLKDLEEKDRVRNFQPPV SGEEIMEVFNLGPCRQVGSLKSAIKDAILDGVIPNEYEAAYAFMLQKAEKMGLKPVQNKE V >gi|226332013|gb|ACIB01000043.1| GENE 189 217897 - 218745 917 282 aa, chain + ## HITS:1 COG:no KEGG:BF3443 NR:ns ## KEGG: BF3443 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 282 1 282 282 496 100.0 1e-139 MKKLIVLTCTLSLLTACGDSIEKKAGEKLAAARAAFEHNDYNEAKLQIDSIKILYPKAFD TRKEGIKLMQQVELKEQQESLVYLDSMLQVKQEEFEAIKNKYTFEKNEEYQKIGNYFWPT QTVEKNLHRSFLRFQVNEQGVMTLTSIYCGPSNIHHVAVKVIAPDGSFAETPASNDSYET TDLGEKIEKADYKMGEDGNVLSFLYMNRDKKNIRVEYLGERKFSTTMTPSDREALVGTYE LAKLLSSIRQIQQEKEEANLKIEFVKRKMEQKAQEEAAEKQR >gi|226332013|gb|ACIB01000043.1| GENE 190 219121 - 222171 2302 1016 aa, chain + ## HITS:1 COG:no KEGG:BF3444 NR:ns ## KEGG: BF3444 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1016 1 1016 1016 1993 99.0 0 MKAVIISFFITLSTLTSHAQQRDIVLTGTVTDHQNGPLPGATIRIKGTQFGTVTDTDGHY LLRGKWKENDLILFSFIGMKEIRVKYTGQKVQDAAMQEDPKALDEVVIVARQNINELDIR AKSGVVQRVDVERLNSKPMIDMSLALQGAVPGLIITNTGDLGSKPEIRIRGNSSFRKGDM ANEPLYVMDGKVISSDAFMTLNPADIQEIKVLKDAVACALYGIKAANGVIEITSQRGNPD GRLTTSYSFNIGITTRGRRGVKMMDSEEKLELERRLQNISTPGYRYSEDYYRKYYATAPN LDELIAEGQQVLDSLKNIHTDWFDELIHRSIYQRHNLSIKGGTDKTSYYISTNYAKQGGR VPGNDTQRFTARMSLDQKLGNWGYFSLSTDAGYSATDTPNGSTHSPTDLIYQLNPYETKT GKLISYSEKSSEYTLNDLMSQYHSKSTDKRGGVSGSFNLRPFKGLEIDAVTGIDFLLNEA LTLVPSTSIAEREMGIAIAERGKLTKEKNTTTNISSNIRITYNKTFAGRHDLTIGGNMDY YLTQTDNISATGYGVGTQMSLNAINHSITGARKPTASSLLDKTAQLGFGVVMGYSFDSTY DLFATYKADASSVLPPDKRWNAAWAVGLGWTLSRYPFLKNNKVITLLNLKGSHGRMANLS GVSASATIGTFSYSTNYYGNARLLQLLGFYNTDLKPEQTSTTDFSLSIEFFKRLTLGLNL YRRETSDALLDVPIPLSNGFNTMKRNIGVLRNEGYELNAAIKVLDTPDWRVSLRGSLAYN RNKVISLYYTDRLYTSETALTPDYEVGKAYNMLYGLKSLGINPITGLPVFQGADGSEIPP TQNPARENFIVLGHSTPPYSGTFNLNFSYRNFDLDMDFYYVFGGIKPYNYSYVRSADSAN KNAIQKQLENMWFHRGDEGKIYHSPFYISPANASLQQPNTETVGKSDYLKLAMLSLRYRV PHTFLEKNCHFIKYANIAFQASNLFMITPYKESDPETGSLAGAMQPVLTINLSLTF >gi|226332013|gb|ACIB01000043.1| GENE 191 222185 - 223633 884 482 aa, chain + ## HITS:1 COG:no KEGG:BF3445 NR:ns ## KEGG: BF3445 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 482 1 482 482 949 99.0 0 MNLNPIINNFQRPIRKVQICCILIFSCAACSLNIPYENQYSDPDAITTVTAARELLASAY DAIPKPEFSLSVLSDDFQPTFLINLNADLNNLYKWHPKPMEDLSNSLWEKYYSAIAIANT VLERIHYVSPISDSDKQELQCIVSEAKTLKAYCYFNLLRLFAPTYPEGPEKDGIILKERF ELAFLKRSSISECVNAIRQLLTEAVTVDNRPSQVYWFSRQSAYYLLAALELYAENYDKAE EYALKILSSTNAYDVLAPVHYKKLWEQEPCEECIFSLHTTNSYYTDMNYERKKGDYFTIN STLTQLYTANDIRYEATVYPQEMPGESIGETLPVLYLGKYNKQNWNYKPIQYINKLRVSG ICFILAEAYCQDQKGHDALALEVINNYLAQREAPLLDTSLSGAPLLKAILQEKWKEFAGE GERYFDLKRYRKSLLSDWNLSATMKNKPIKPDDYRWLFPIPKGEYLYNENISQNAGWTKI EK >gi|226332013|gb|ACIB01000043.1| GENE 192 223651 - 224853 904 400 aa, chain + ## HITS:1 COG:no KEGG:BF3446 NR:ns ## KEGG: BF3446 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 400 1 400 400 769 98.0 0 MKTRMKKLFIYTVGICLLPAFTSCKDDREENKYTGINKIYLSADPPVITESENIPLTVNV DLTLTCEQDLVLNFELPDDKNGVLKLENNPVTIKAGKRSATFEVVSNQKNLLTEDTYFQV GISQFPTDNLMLNETLQVRVKPNPKIPELSPQQKALIKGYKTKYGIDLNKWLGVLSCHTT VQSPADGYLQPFAAAFTKEYDGKTVITLSEQATEDMPVLKMTDNPFGLTEYLYFVLRSET VADPEFWTQQPEPQKLMKLLNWNKESKETFAVSLDAIRLKDITSTSTRLEYLGPGKNYYG APITIVPFTYQFSALDRQNKLLESGNAEIKEINDQGASADPAYYLNCYDLVNSDYYDTPE NFIKSTGNIDFATDKMTYQFLLSHANAGGYTRITVVYEKK >gi|226332013|gb|ACIB01000043.1| GENE 193 224840 - 227704 1591 954 aa, chain + ## HITS:1 COG:pqqL KEGG:ns NR:ns ## COG: pqqL COG0612 # Protein_GI_number: 16129453 # Func_class: R General function prediction only # Function: Predicted Zn-dependent peptidases # Organism: Escherichia coli K12 # 39 740 36 727 931 181 22.0 6e-45 MKRNNKNRYLYIAAIFIAAMLPYTACPQSTPILLPPGTVEGRLPNGLHYLILHNASPASR VEFRLIMRVGSVQETEQEKGCAHFLEHITFGGTRHFPKRSLVEYLESLGMKYGQDINAFT GFDRTIYMFAVPTDFAKDEALDRSLLILHDWLDGVTIDPEKVENEKGIILEELRGFDPED DFYPLKIGQGIFSHRMPLGTTDDIRKVTPQVLKNYYHKWYVPSLATLVIVGDISPLEIES KIKERFKSLPGRPVNDFRNYPLEYTRGIHLASIRDSLQPRTKVELMIPHPCTVERTMEDA IAKEKGRLLVSAISSRFRARKLKTDVTDQWYLSDKNHFVLTVEGENRKEILTSISTTVSL LNDLIRNGWQEDELQDIKNNFCRRMKLSTDAPSRPSSMWCDDFADYVISGDRYLTDPSQQ QQLKEAMSRVSGQSLQTLLKEWMSYREETLLVACSTHPGLGAPLSETEIASAWAQGEQVE CTPFLYFRPEKQEEIDIETPPCLAARFPFDPASVLRQTEYPQNRIREVELKNGIRLVLKP TLEADSTLLITSFAPFGTSSLSDEEYPLLEGFAGYIDMGDIAKVDGQVLSDYLFRKEISL SMAVENHWHGFIGMSPTANAPELFNLIYEKIFDPELKYDEFEEIRQDLLENQDKETILEK MLQRSPDRLLSARINELTGTGFARSSQKLSSEQIKNLNLDSIAAFYKKLYTNPQGTTYVI CGNFNADTLMQQFVSVFGRIPVSSHLSRFSYPHFNFPVRKHIEGFPNDNDTQTLFDYLLP GHYQPGLKNTLTLKLMRDLIRNRLISVLREQKSLVYSPYISLMYEGIPQGIFYFDINASA DNDNMPQIEQLLKEILHQLKQQEVDNEELNTLKRSFLIAKREALNEESPSAWRTALVGLL KNGETISDFDHYEQCLDSITPAMLREAFRRYLDTENYILLYLSKNKLKNDTSNH >gi|226332013|gb|ACIB01000043.1| GENE 194 227685 - 228584 770 299 aa, chain + ## HITS:1 COG:VC2004 KEGG:ns NR:ns ## COG: VC2004 COG3016 # Protein_GI_number: 15642006 # Func_class: S Function unknown # Function: Uncharacterized iron-regulated protein # Organism: Vibrio cholerae # 34 282 48 292 325 111 30.0 1e-24 MILQTIKQISCLLCLCCFLQQSAFSQDDKDKPAYTLFDNTGKQISYGELIKRLSGYDVIF LGELHNCPITHWLEFEITRSIYNIHKDQLMLGAEMFESDNQLIFDEYMQQKISYDRFEAE ARLWDNYRTDYYPVVFFAKEHHIPFIATNIPRRYANIVKNKGFEALDSLSEEAKRYIAPL PIDFEYDEAQSAAAFSMMNMMGGRRAGDNRKLAQAQAIKDATMGWFIARNIKNKFLHING SYHSNRQGGIIPYLLRYRPNTSIVTVTSVRQESIRKLDDDHKGLADFYICVPEDMVNSY >gi|226332013|gb|ACIB01000043.1| GENE 195 228812 - 229252 345 146 aa, chain - ## HITS:1 COG:no KEGG:BF3643 NR:ns ## KEGG: BF3643 # Name: not_defined # Def: putative DNA-binding protein # Organism: B.fragilis # Pathway: not_defined # 1 146 1 146 146 246 98.0 2e-64 MVKKITTSKRDREKLKEQKRKEKQQRKEERQSNGPSSFEDMIAYVDQFGVLHSVPQERLE EEVDASHIEVSVPKQEDVEVAPLMGRIEYFNAAKGYGFVKDADNGEKYFFHISSAPTTIA EGDRVTFEIERGMRGMNAVRISIVTE >gi|226332013|gb|ACIB01000043.1| GENE 196 229681 - 229920 67 79 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MLQPSYYYGRVSTELLSKVTDISHDNYLSQLYTLLYQRLCIMISCTNNKEFIQRFTFTRR TYSLDLAEETQNTPQYLPN >gi|226332013|gb|ACIB01000043.1| GENE 197 231016 - 231225 197 69 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|294808077|ref|ZP_06766850.1| ## NR: gi|294808077|ref|ZP_06766850.1| ISSpo3, transposase family protein [Bacteroides xylanisolvens SD CC 1b] # 1 69 118 177 311 62 49.0 9e-09 MLHRILHNIKDDDATFEEHTQVGENYAGGRTSEKDQRRKNHKSGKKSMGHSLAQKTIVMG LLSKGKVYA >gi|226332013|gb|ACIB01000043.1| GENE 198 231274 - 231597 143 107 aa, chain + ## HITS:1 COG:no KEGG:Fjoh_0919 NR:ns ## KEGG: Fjoh_0919 # Name: not_defined # Def: hypothetical protein # Organism: F.johnsoniae # Pathway: not_defined # 2 100 203 300 311 74 41.0 1e-12 MVKKGSTVISDSWSGYHSVKDSYTHEVVQHSLGIYVNKKGFHTNSIGGFRGHLKRMITGV YYVMSPKHLPKYCKESAFRYSTYKISDGERLSYSYSNQRNACTMESC >gi|226332013|gb|ACIB01000043.1| GENE 199 232051 - 232272 186 73 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MGKISTRAFCNTLKAGAFRSELTRCPTRSEIENAGLKVVSTYATRQLVMEESFISPNYVI IARNAGYIEKVEI >gi|226332013|gb|ACIB01000043.1| GENE 200 232380 - 232667 182 95 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|294808074|ref|ZP_06766847.1| ## NR: gi|294808074|ref|ZP_06766847.1| conserved domain protein [Bacteroides xylanisolvens SD CC 1b] # 1 89 1 95 231 64 41.0 2e-09 MVICTNLSLIGCDKEEIDEPVTPPIEKPDETIASPSANDIIKIKTGDINIIIGTINWQNI TYGNGRYVAVGRSGYIAYFIDEMNWTSKQVSSNCA >gi|226332013|gb|ACIB01000043.1| GENE 201 232773 - 233222 376 149 aa, chain + ## HITS:1 COG:no KEGG:Pjdr2_0818 NR:ns ## KEGG: Pjdr2_0818 # Name: not_defined # Def: S-layer domain protein # Organism: Paenibacillus # Pathway: not_defined # 3 149 176 327 1500 85 37.0 7e-16 MYLYGMTYGNEKYIAVGGSSSISYICYSTDGVNWTTKQVSCRYLYGATYGNGKYIVVGDG RYIAYSTDGINRTSKTVGSSSWQSTVYGNDKYVMVGNNGYIVYSTDGINWATKIFSSNSE LWESVAYSNGKYIAVGYNGYIASSANAVN >gi|226332013|gb|ACIB01000043.1| GENE 202 233483 - 234046 227 187 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253566040|ref|ZP_04843494.1| ## NR: gi|253566040|ref|ZP_04843494.1| predicted protein [Bacteroides sp. 3_2_5] # 1 187 3 189 189 295 100.0 7e-79 MNNKIKIAIGIIMVFTLCMITYCYRDKLNIITYISIVGSYASLFGIWISYLQIRSVKEIA EITQQSIEQNISEVNQYLSYSDISKTIKIICEIENYILISKLEPALIRMRDLKLALIQIS QNGRLIKEIEKQKILKNHITNLGIDIANINEYIMNQHQLDFKVIHQNLENATTYLAELES KLKYQKL >gi|226332013|gb|ACIB01000043.1| GENE 203 234043 - 234378 201 111 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253566041|ref|ZP_04843495.1| ## NR: gi|253566041|ref|ZP_04843495.1| predicted protein [Bacteroides sp. 3_2_5] # 1 111 1 111 111 199 100.0 4e-50 MIPQNYQALIDKLILGTNMKKISWKKTSRQTEFQTSVGSGTITTDNWKDDTTDMTYVDFV IWNDEGEAIDSISAFKGDEDYDSIMQLHECARRAYLKIDETINDIMNHLDF >gi|226332013|gb|ACIB01000043.1| GENE 204 234662 - 235378 505 238 aa, chain - ## HITS:1 COG:no KEGG:BF3451 NR:ns ## KEGG: BF3451 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 238 1 238 238 468 97.0 1e-131 MILSSLVHTDFQPYLLPNTDAWALPGDKALELLVYLEQQGVRQIYCVPPVKVENEGNVFS FLKDAFQYLQQQYSGNISLRLSARYRLDEGFPALLEKGDLLTIGGWKELLVDVSPLQQPE GLSEMIHAICQSGYIPVLMQPERSLYWGTEDYLHLRESGCRLMLNLYSLFGYNGDGALNY SRMLLRKEWYTYLCSGREDTKVMRYGESFSIEDDDDLAMKLQEIERNSRLLWSATENG >gi|226332013|gb|ACIB01000043.1| GENE 205 235375 - 235917 410 180 aa, chain - ## HITS:1 COG:MA3780 KEGG:ns NR:ns ## COG: MA3780 COG1898 # Protein_GI_number: 20092576 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes # Organism: Methanosarcina acetivorans str.C2A # 1 176 1 177 183 211 57.0 7e-55 MNVIKTAIDGVVILEPRIFKDDRGYFFESFNQREFEEKVCKTTFVQDNESKSSYGVLRGL HFQQAPFAQSKLVRVVKGAVLDVAVDIRKGSPTFGQHVAVELTEDNHRQFFIPRGLAHGF SVLSEEVVFQYKCDNFYAPHSEGAITWDDPDLGIDWRIPADRVILSEKDSRHPRLKDLNL >gi|226332013|gb|ACIB01000043.1| GENE 206 235914 - 236801 902 295 aa, chain - ## HITS:1 COG:NMB0062 KEGG:ns NR:ns ## COG: NMB0062 COG1209 # Protein_GI_number: 15675999 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: dTDP-glucose pyrophosphorylase # Organism: Neisseria meningitidis MC58 # 1 291 1 288 288 416 66.0 1e-116 MKGIVLAGGSGTRLYPITKGVSKQLLPIFDKPMIYYPISVLMLAGIREILIISTPDDLPA FRRLLGDGSDYGIRLEYAEQPSPDGLAQAFIIGEEFIGSDSVCLVLGDNIFYGQSFTRML NEAVRMAEVEQKATVFGYWVSDPERYGVAEFDKEGNVLSLEEKPEEPKSNYAVVGLYFYP NKVVEIAKKIEPSARGELEITTVNQEFLKDQELKVQLLGRGFAWLDTGTHDSLSEASTFI EVIEKRQGLKVACLEGIALRQGWISADKMRELAKPMLKNQYGQYLLKVIQELGLK >gi|226332013|gb|ACIB01000043.1| GENE 207 236815 - 237594 319 259 aa, chain - ## HITS:1 COG:jhp0094 KEGG:ns NR:ns ## COG: jhp0094 COG0463 # Protein_GI_number: 15611164 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Helicobacter pylori J99 # 8 257 2 248 260 244 50.0 9e-65 MALTQEKIRVSIITATYNSGNTLTDTVLSVLSQSYSNVEYIIVDGGSQDNTIEIIKMFEC RFNGNLKWISETDRGLYDAMNKGIQLATGDIIGVLNSDDFYTSQTVLSRVVAEFENKSLD AVYGDVHFVKADNLNKFVRYYSSRIFKPVLMKYGFMPAHPSFYCRKYCFFKYGLYKIDYK ICADFDLLLRYIYVDKISIKYIPLDMVTMRLGGVSTNGIGSHIRIMKEHLRSFRENGVKS NVFMLSIRYLYKITEFFIK >gi|226332013|gb|ACIB01000043.1| GENE 208 237555 - 238283 157 242 aa, chain - ## HITS:1 COG:DRA0037 KEGG:ns NR:ns ## COG: DRA0037 COG0463 # Protein_GI_number: 15807707 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Deinococcus radiodurans # 3 207 5 221 328 87 26.0 2e-17 MIRPLISVCIATYNGEKFIVEQLKTILSQLTDFDEIIISDDHSSDCTLSLIRSFRDPRIK IYLNENDKGYTSNFENALKKATGEIIFIADQDDIWKKDKVDISLNYLKKYDFIVSDACII DNSGNFMFESYIKQRNSYVSFWANVYKFSFLGCCYAFKRKILDIAIPFPPNHKLCTHDNW IFLIAASSFSYKIIPEKLICYRRHLGNTSTGGLKNNTSLYFKMKYRIYLIWHLLKRKLEF RL >gi|226332013|gb|ACIB01000043.1| GENE 209 238280 - 238690 178 136 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566047|ref|ZP_04843501.1| ## NR: gi|253566047|ref|ZP_04843501.1| O-antigen polymerase [Bacteroides sp. 3_2_5] # 1 136 212 347 347 208 100.0 1e-52 MIPIDVIREKMQMYKALEEMGEDGFSQVNLFNPYYLFKLGIYYFFLTKYTFFERKDKYFT IYLKIFGLSLFLFPALSSITPLLGYRISELFGIIEIFLFPIACFLFRTHKQSLLVIFLYY SLLMSVNVYHKELIFL >gi|226332013|gb|ACIB01000043.1| GENE 210 239334 - 240437 502 367 aa, chain - ## HITS:1 COG:APE1191 KEGG:ns NR:ns ## COG: APE1191 COG0438 # Protein_GI_number: 14601239 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Aeropyrum pernix # 218 360 204 354 363 66 29.0 8e-11 MVYPYMKKVTFLLPRTGEVPIGGFKIVYEYGNRLAQRNVQVTFVYGIVSRKDLSFLLSFF YRVFRFFRFIKYKYFVDYKPTVWFKTNPEISHKLVYSLSERNIPKSDIIIATSWATAFWL NDYCCIEPSGKYYFIQHFEDWHAGKDKVITTWKMNLNKIVIAPWLQDIACNMGEKSVLVE NAFNQDEFYLSNPIADRNKYTAIMLWHDNPFKGCSIGLKALEIVLKRFPDFSVKLFGVPE KPVHLPPWVTYYRMPQKDLHRKLYNQSAIFIGTSYSEGWGLTLGEGMLCGCAIACTNNPG YTILAENNVTALLSEVGDAEGLANNIIKLVEDDLLRLKIAEAGWNRARSYTWERSFEKFV AAIKVFV >gi|226332013|gb|ACIB01000043.1| GENE 211 240419 - 241396 264 325 aa, chain - ## HITS:1 COG:no KEGG:PFL_5100 NR:ns ## KEGG: PFL_5100 # Name: not_defined # Def: O antigen biosynthesis abequosyltransferase RfbV, putative # Organism: P.fluorescens # Pathway: not_defined # 12 313 9 315 318 127 29.0 7e-28 MNGIKKDKYTLLSICIPTYNRYEILREGLNILLPQIKGLDIKVYVIDNNSTDDTVLIANE YSDIIYIRNEKNIGGDMNILKAYQIASQTSEYICVLGDSYRFKNRLDSIMDLLLPCDLNL LVLNRECEFSGIHSRYYYSADEILSDLGGGMDLIGSIVVNKKAVLEENYVPYLWSNFIHV GMVFNYLSSLDSLKCYFLREQVLYHTNLDKTKVSWYKDMFEIFAKTWMLTILSLPATLSI DSKLQCTKKHDIYTGVFRLKRLLYLRGFGYVKYKDIKKYSIYIPFVTDVPILYMYLISMI PQFIVQSLIFLNKMKSVMFLWYILI >gi|226332013|gb|ACIB01000043.1| GENE 212 241374 - 242744 368 456 aa, chain - ## HITS:1 COG:no KEGG:Coch_0703 NR:ns ## KEGG: Coch_0703 # Name: not_defined # Def: polysaccharide biosynthesis protein # Organism: C.ochracea # Pathway: not_defined # 13 454 14 451 453 333 44.0 6e-90 MDSSFIRTIQRYLGIDRAIFYTSVARILQAFGGVISVFFVAKYLTGIEQGFYYTFGSIVA IQVFFELGLNSIITQYVAHEVSYLSWKTPVELSGEEKYKSRLASLLHFCVKWYLGFAGIL LITLIIVGYSFFNRYGNHNDIDWHLPWLLLAFGTALNLLLAPVSAFLEGLGKVQEVAKMC LWQQMIGLLVVWGGLIIGAKLYVLGVNWLVGITLIVIFIVKTDFGNIIQNIWQITIKEKV NYRKEIFPYQWKIALSWISGYFIFQLFNPVLFATEGAVVAGQMGMTLAALNGIQSLSLSW MTTKIPLYSGLIAQKEYQRLDIVFNRTLKQSVFINGSALIIMFIFIYFVEHYHIVVGDIN LGDRFLKCWPMTLMMISLFANQFVNSWAIYLRCHKREPFLINSIVGGILCCLSTIFMGIY YGILGITGGYCCITLILTFWGYWIFKCKKNEWHKKR >gi|226332013|gb|ACIB01000043.1| GENE 213 242795 - 244531 771 578 aa, chain - ## HITS:1 COG:TM0548 KEGG:ns NR:ns ## COG: TM0548 COG0028 # Protein_GI_number: 15643314 # Func_class: E Amino acid transport and metabolism; H Coenzyme transport and metabolism # Function: Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] # Organism: Thermotoga maritima # 1 569 6 565 584 324 35.0 2e-88 MKVSDYIISYIESRGVHVIFGYIGGMITHLVDSVSQNPNMQFIQTYHEQTAAIAAEGFAK ESGLFGVAISTSGPGATNMMTGIADAYFDSIPVLYITGQVNTYEYKYDKPVRQQGFQETD IVSMVKSVTKYAKLIDKAEDIKYELDKALYIALSGRKGPVLLDLPMDIQREEINPETLIG YSGESILNNPLIAWEEIRLLMESSHRPMLLLGAGCCNSDMVLLNDFIRRHHFPVITSLMG RGAIDETYDNYIGMIGSYGNRCANMGVANADLLIALGTRLDTRQTGARLDQFLSNGHIIH VDIDDNELEYHRLLNRKKVNCTIDCFLQKEKEMPISLGDISEWNFFLHGLKQRYGQDAEI ERFVENKSPYRFMQYFDSLTQTDDVICADIGQNQMWAAQTLRLKSGQKFVTSGGLAPMGF SLPVAIGCSFANPNKKVFSINGDGGFHMAIQSLMLISQYNLPIKVIILNNASLGMITQFQ HLYFDDRMCGTTLNGGYRVPDIKSLSTAYGLPYFRLTVDRLDDPDLREEMQAAHNCIIEC VVEGLTSVSPKLEYDKPISKPLPLLPEEEYKENMLSEV >gi|226332013|gb|ACIB01000043.1| GENE 214 244518 - 245156 374 212 aa, chain - ## HITS:1 COG:BH2304 KEGG:ns NR:ns ## COG: BH2304 COG0451 # Protein_GI_number: 15614867 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Nucleoside-diphosphate-sugar epimerases # Organism: Bacillus halodurans # 4 199 109 302 315 58 25.0 9e-09 MNSLLYIAEKSNVSKFIALGSQAEYGDFDGIVSENAGLFPVNSYGYVKSMVSRMVGSFCD LRGIDWYWLRVFSVYGERESNQWLIPGLLTNMLDNMAGMDLTLGLQRYAYLYVKDFANAV MKVCSGKAPCGVYNLSSSTAIELRVLLEHLRDRLNPAFELRFGALPYRAGQPMLVQGDVS KFVKSFGHFENTPLNAGLEYTITYYKKQHESI >gi|226332013|gb|ACIB01000043.1| GENE 215 245435 - 246514 838 359 aa, chain - ## HITS:1 COG:STM2091 KEGG:ns NR:ns ## COG: STM2091 COG0451 # Protein_GI_number: 16765421 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Nucleoside-diphosphate-sugar epimerases # Organism: Salmonella typhimurium LT2 # 8 357 5 354 359 384 52.0 1e-106 MGIDIFDNFYRGKRVLVTGHTGFKGSWLSIWLHELGAEVIGVAQDPFTARDNFVLSGIGE KIKADLRADIRDGERIKAIFQEYQPEIVFHLAAQPLVRLSYDIPVETYETNVMGTIHVLE AVRSTDSVKVGVMITTDKCYENKEQIWGYRENEPMGGYDPYSSSKGAAEIAIASWRRSFF NPEQYDKHGKSIASVRAGNVIGGGDWALDRIIPDCIKALESGRTIDIRSPKAVRPWQHVL EPLSGYMLLAQKMWDAPTDYCEGWNFGPRSESISTVWDVATRVVSEYGRGELRDLSTPDA LHEARLLMLDISKARFCLGWEPRMNIGQTVGLTVDWYKRYREEEVYDVCVDQIKDYLLK >gi|226332013|gb|ACIB01000043.1| GENE 216 246520 - 247296 630 258 aa, chain - ## HITS:1 COG:alr2825 KEGG:ns NR:ns ## COG: alr2825 COG1208 # Protein_GI_number: 17230317 # Func_class: M Cell wall/membrane/envelope biogenesis; J Translation, ribosomal structure and biogenesis # Function: Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) # Organism: Nostoc sp. PCC 7120 # 1 258 1 257 257 339 59.0 3e-93 MKAVILAGGFGTRLSEATNLIPKPMVEIGGKPILWHIMKTYSHYGINDFVICCGYKQYII KEYFANYFRHNSDMTVDLSNNTTTILDNHSENWKVTMVDTGLNTQTGGRIRRVQKYLGNE RFLLTYGDGVTDLNIGDTLKAHESSGCLLSLTAYKPGGKFGALQLDLDTDKVLSFQEKPD GDRNWINAGYFVCEPEVFDYIPEGDSTIFERQPLESIAKAGRMHAFRHTGFWKPMDTLRD NTELNEMWDQGVAPWKVW >gi|226332013|gb|ACIB01000043.1| GENE 217 247334 - 248677 1063 447 aa, chain - ## HITS:1 COG:YPO3113 KEGG:ns NR:ns ## COG: YPO3113 COG0399 # Protein_GI_number: 16123279 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis # Organism: Yersinia pestis # 1 442 1 432 437 511 56.0 1e-144 MSQKDLLKQQILDLTREYYKEVHGSSRSFEPGKSFVNYGGRYFDDRELVNLVDSSLDFWL TAGPWARKFEIRFSEWLGVKYCSLTNSGSSANLLAFMALTSPQLGERRIRRGDEVITVAC GFPTTVTPCIQYGAVPVFVDVTIPEYNIDVTQLEAALSPKTKAVMIAHSLGNPFDLQAVK DFCDKHNLWLVEDNCDALGSTYTIDGVEKKTGTIGHIGTSSFYPPHHMTMGEGGAVYTDD PLLHKLVNSFRDWGRDCWCIGGVDNTCKYRFSKQFGDLPVGYDHKYVYSHFGYNLKVTDM QAAIGCAQLEKLDSIVEARRSNFAYLKEGLAGTSGLILPEAQKNSDPSWFGFLISVKEDA GFTRNDLSQHLESRKIQTRNLFAGNLLKHPAFDEMRSTGEGYRVIGNLEGTDYVMNHTLW IGVYPGMTRAMLDHMIGTIRDFVSSHK >gi|226332013|gb|ACIB01000043.1| GENE 218 248696 - 249793 839 365 aa, chain - ## HITS:1 COG:BS_tagO KEGG:ns NR:ns ## COG: BS_tagO COG0472 # Protein_GI_number: 16080606 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase # Organism: Bacillus subtilis # 6 332 9 311 358 131 32.0 2e-30 MNILFIILAFVISASVARLIIPRILLISLRKKLFDMPSERKVHKRAIPRLGGVSFFPTIL LSSCGVFALRILMGYDVPALRAVYLLPECLFLVCGMTLLYLTGIADDLVGVRYRQKFVIQ IICASFFPLAGLWINNFYGLFGLYALPAWIGMPFTVLLVVFVTNAINLIDGIDGLASGLS SVALLVFGFLFMQKELWTYSMLAFATFGVLVPFFYYNVFGSAERARKIFMGDTGSLTLGY VLSFLAIKYSQYNPDVTPYTEGAFVIAFSTLIVPAFDVIRVVMVRVRSGKSPFEPDKNHI HHKFLAMGFTPRKAMITILLISCAFSAVNILLVPVIDNTVMLLADIAVWVGLNLWFDRVR DKKQK >gi|226332013|gb|ACIB01000043.1| GENE 219 249845 - 250330 431 161 aa, chain - ## HITS:1 COG:no KEGG:BF3465 NR:ns ## KEGG: BF3465 # Name: uphZ # Def: putative LPS biosynthesis related transcriptional regulatory protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 161 1 161 161 316 100.0 2e-85 MNSLDSQITALYTLAHELLYLGSDGSPIYSDHFSRLNGDVLSRANTLYPHHGSTDEEEAR LCLSLLMGYNATIYNNGDKEVRIQQILNRCWEVLDRLPASLLKVRLLTYCYGEVFDDDLS REAHSIIDSWGERALSGDECEIAEQLRSLEENPYPNWEVEE >gi|226332013|gb|ACIB01000043.1| GENE 220 250375 - 251001 580 208 aa, chain - ## HITS:1 COG:no KEGG:BF3667 NR:ns ## KEGG: BF3667 # Name: not_defined # Def: putative transcriptional regulator UpxY-like protein # Organism: B.fragilis # Pathway: not_defined # 1 208 1 208 208 395 99.0 1e-109 MNASKTDIRITSPDREVLSYSGAPKEHPFVKESPELFWYAVRVTYSRELALKEYLDGECI ENFIPMHYEYIVKNERRVRKLVPAVHNLVFIRSSRERIDRIKDEMGMTLPIRYIMDRESR QPIVVPTSQMRSFMAVAGSYDQQLVYLEPSAVAFRKGQRVRVTGGIFAGVEGEFIRVKND RRVMVSIQGVMAVATTYIHPSLIEPLDL Prediction of potential genes in microbial genomes Time: Tue May 17 23:41:24 2011 Seq name: gi|226332012|gb|ACIB01000044.1| Bacteroides sp. 3_2_5 cont1.44, whole genome shotgun sequence Length of sequence - 242915 bp Number of predicted genes - 218, with homology - 213 Number of transcription units - 101, operones - 53 average op.length - 3.2 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 138 - 197 6.2 1 1 Op 1 . + CDS 310 - 690 266 ## BF3468 hypothetical protein 2 1 Op 2 . + CDS 756 - 2915 1737 ## COG5545 Predicted P-loop ATPase and inactivated derivatives 3 1 Op 3 . + CDS 2899 - 3111 60 ## + Term 3250 - 3322 29.0 + Prom 3171 - 3230 6.9 4 2 Op 1 . + CDS 3421 - 4947 474 ## BF3470 hypothetical protein 5 2 Op 2 . + CDS 4952 - 5395 289 ## BF3471 hypothetical protein + Term 5452 - 5509 2.2 6 3 Tu 1 . - CDS 5596 - 5841 379 ## BF3675 hypothetical protein - Prom 5867 - 5926 7.5 + Prom 5831 - 5890 8.4 7 4 Tu 1 . + CDS 6109 - 6579 452 ## BF3676 putative non-specific DNA binding protein + Term 6719 - 6778 22.2 - Term 6707 - 6765 13.4 8 5 Op 1 . - CDS 6793 - 7494 602 ## COG0120 Ribose 5-phosphate isomerase 9 5 Op 2 . - CDS 7524 - 7664 109 ## 10 5 Op 3 . - CDS 7645 - 8649 754 ## BF3679 hypothetical protein - Prom 8670 - 8729 2.1 + Prom 8658 - 8717 3.9 11 6 Op 1 . + CDS 8737 - 9255 416 ## COG1247 Sortase and related acyltransferases 12 6 Op 2 . + CDS 9344 - 9766 330 ## BF3477 putative DNA-binding protein + Prom 9816 - 9875 2.9 13 7 Op 1 . + CDS 9939 - 10490 235 ## BF3478 hypothetical protein 14 7 Op 2 . + CDS 10453 - 10977 334 ## BF3688 hypothetical protein 15 8 Tu 1 . - CDS 11115 - 11720 783 ## COG0632 Holliday junction resolvasome, DNA-binding subunit - Prom 11824 - 11883 7.2 + Prom 11681 - 11740 7.3 16 9 Tu 1 . + CDS 11879 - 12778 1089 ## BF3690 meso-diaminopimelate D-dehydrogenase + Prom 12785 - 12844 3.2 17 10 Tu 1 . + CDS 12920 - 13570 493 ## COG1272 Predicted membrane protein, hemolysin III homolog + Term 13593 - 13628 -0.6 + Prom 13657 - 13716 3.2 18 11 Op 1 12/0.000 + CDS 13911 - 16304 2494 ## COG1328 Oxygen-sensitive ribonucleoside-triphosphate reductase 19 11 Op 2 . + CDS 16311 - 16769 450 ## COG0602 Organic radical activating enzymes + Prom 16924 - 16983 3.7 20 12 Tu 1 . + CDS 17004 - 18410 903 ## COG0477 Permeases of the major facilitator superfamily 21 13 Op 1 2/0.000 - CDS 18575 - 19663 1274 ## COG0075 Serine-pyruvate aminotransferase/archaeal aspartate aminotransferase 22 13 Op 2 . - CDS 19669 - 20460 837 ## COG0637 Predicted phosphatase/phosphohexomutase - Prom 20595 - 20654 7.6 + Prom 20422 - 20481 6.8 23 14 Tu 1 . + CDS 20657 - 21535 913 ## BF3490 hypothetical protein 24 15 Op 1 17/0.000 - CDS 21663 - 23018 1301 ## COG0750 Predicted membrane-associated Zn-dependent proteases 1 25 15 Op 2 . - CDS 23038 - 24204 1183 ## COG0743 1-deoxy-D-xylulose 5-phosphate reductoisomerase 26 15 Op 3 . - CDS 24207 - 25067 800 ## COG0739 Membrane proteins related to metalloendopeptidases 27 15 Op 4 . - CDS 25133 - 25675 500 ## BF3701 16S rRNA-processing protein RimM 28 15 Op 5 . - CDS 25672 - 26976 1476 ## COG0766 UDP-N-acetylglucosamine enolpyruvyl transferase 29 15 Op 6 . - CDS 27024 - 27641 763 ## BF3703 hypothetical protein 30 15 Op 7 . - CDS 27693 - 28382 708 ## COG1214 Inactive homolog of metal-dependent proteases, putative molecular chaperone - Prom 28493 - 28552 7.5 + Prom 29005 - 29064 3.9 31 16 Op 1 . + CDS 29194 - 29406 131 ## BF3705 hypothetical protein 32 16 Op 2 8/0.000 + CDS 29415 - 30281 1118 ## COG1561 Uncharacterized stress-induced protein 33 16 Op 3 . + CDS 30294 - 30908 584 ## COG0194 Guanylate kinase 34 16 Op 4 . + CDS 30940 - 31545 362 ## COG1057 Nicotinic acid mononucleotide adenylyltransferase 35 17 Tu 1 . - CDS 31549 - 32601 698 ## COG1408 Predicted phosphohydrolases - Prom 32678 - 32737 5.3 + Prom 32561 - 32620 1.6 36 18 Tu 1 . + CDS 32701 - 33597 634 ## COG1575 1,4-dihydroxy-2-naphthoate octaprenyltransferase + Term 33833 - 33868 -1.0 - Term 33503 - 33556 8.1 37 19 Op 1 16/0.000 - CDS 33602 - 34741 1407 ## COG1088 dTDP-D-glucose 4,6-dehydratase 38 19 Op 2 . - CDS 34761 - 35624 938 ## COG1209 dTDP-glucose pyrophosphorylase - Prom 35646 - 35705 4.7 39 20 Tu 1 . - CDS 35801 - 36292 512 ## COG0622 Predicted phosphoesterase - Prom 36314 - 36373 3.6 + Prom 36243 - 36302 2.4 40 21 Op 1 . + CDS 36349 - 38424 1428 ## COG0855 Polyphosphate kinase 41 21 Op 2 . + CDS 38490 - 40739 2238 ## BF3715 putative phosphate/sulphate permeases 42 22 Tu 1 . - CDS 40904 - 41302 353 ## COG0784 FOG: CheY-like receiver - Prom 41324 - 41383 10.6 + Prom 41260 - 41319 8.6 43 23 Tu 1 . + CDS 41411 - 43045 2001 ## COG0488 ATPase components of ABC transporters with duplicated ATPase domains 44 24 Tu 1 . + CDS 43164 - 43439 307 ## BF3718 hypothetical protein + Term 43564 - 43605 -0.9 45 25 Tu 1 . - CDS 43457 - 43684 72 ## BF3719 hypothetical protein - Prom 43803 - 43862 3.6 + Prom 43582 - 43641 4.3 46 26 Tu 1 . + CDS 43665 - 44069 87 ## BF3720 hypothetical protein + Term 44153 - 44213 3.7 + Prom 44071 - 44130 2.9 47 27 Op 1 11/0.000 + CDS 44241 - 45458 1303 ## COG0845 Membrane-fusion protein 48 27 Op 2 . + CDS 45517 - 48603 3252 ## COG3696 Putative silver efflux pump 49 27 Op 3 . + CDS 48623 - 49801 1231 ## BF3723 hypothetical protein + Term 49846 - 49902 2.6 50 28 Op 1 . - CDS 49965 - 50993 980 ## BF3724 hypothetical protein 51 28 Op 2 . - CDS 51071 - 51820 739 ## BF3725 hypothetical protein 52 28 Op 3 . - CDS 51856 - 53274 1121 ## COG0144 tRNA and rRNA cytosine-C5-methylases 53 28 Op 4 . - CDS 53271 - 53528 296 ## BF3727 hypothetical protein 54 28 Op 5 . - CDS 53476 - 53841 258 ## BF3518 hypothetical protein - Prom 53934 - 53993 4.3 + Prom 53777 - 53836 3.6 55 29 Op 1 . + CDS 53960 - 54586 666 ## BF3730 hypothetical protein 56 29 Op 2 . + CDS 54586 - 55134 439 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 57 29 Op 3 . + CDS 55118 - 55447 365 ## BF3732 hypothetical protein + Term 55584 - 55628 8.1 + Prom 55878 - 55937 3.5 58 30 Op 1 . + CDS 56002 - 57198 841 ## COG1502 Phosphatidylserine/phosphatidylglycerophosphate/cardioli pin synthases and related enzymes 59 30 Op 2 16/0.000 + CDS 57264 - 58058 747 ## COG0207 Thymidylate synthase 60 30 Op 3 . + CDS 58073 - 58573 331 ## COG0262 Dihydrofolate reductase - Term 58483 - 58536 3.8 61 31 Tu 1 . - CDS 58570 - 59046 431 ## COG1522 Transcriptional regulators - Prom 59137 - 59196 5.7 + Prom 59096 - 59155 6.2 62 32 Tu 1 . + CDS 59197 - 60477 1110 ## BF3737 hypothetical protein - Term 60914 - 60972 13.8 63 33 Op 1 . - CDS 60985 - 61449 505 ## BF3739 hypothetical protein 64 33 Op 2 . - CDS 61449 - 62039 615 ## BF3740 hypothetical protein 65 33 Op 3 . - CDS 62076 - 62546 427 ## BF3741 hypothetical protein 66 33 Op 4 . - CDS 62553 - 63374 841 ## COG0811 Biopolymer transport proteins - Prom 63409 - 63468 6.6 - TRNA 63453 - 63540 62.1 # Ser GGA 0 0 67 34 Op 1 . - CDS 63702 - 64478 756 ## COG0084 Mg-dependent DNase 68 34 Op 2 . - CDS 64482 - 65192 531 ## BF3744 hypothetical protein 69 34 Op 3 . - CDS 65200 - 66174 857 ## COG0142 Geranylgeranyl pyrophosphate synthase - Term 66188 - 66241 8.3 70 34 Op 4 . - CDS 66255 - 66938 590 ## BF3746 TonB - Prom 67009 - 67068 7.1 + Prom 66905 - 66964 8.1 71 35 Op 1 2/0.000 + CDS 67094 - 67783 220 ## PROTEIN SUPPORTED gi|15639271|ref|NP_218720.1| bifunctional cytidylate kinase/ribosomal protein S1 + Term 67832 - 67884 13.3 + Prom 67815 - 67874 3.0 72 35 Op 2 . + CDS 67896 - 68768 364 ## PROTEIN SUPPORTED gi|15895122|ref|NP_348471.1| 4-hydroxy-3-methylbut-2-enyl diphosphate reductase 73 35 Op 3 . + CDS 68842 - 69822 1227 ## COG0205 6-phosphofructokinase + Term 69843 - 69902 14.8 74 36 Op 1 3/0.000 - CDS 69879 - 70748 618 ## COG1028 Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 75 36 Op 2 . - CDS 70748 - 71980 1002 ## COG1902 NADH:flavin oxidoreductases, Old Yellow Enzyme family 76 36 Op 3 . - CDS 71995 - 72720 377 ## BF3752 3-oxo-5-alpha-steroid 4-dehydrogenase - Prom 72787 - 72846 4.7 - Term 72834 - 72869 -0.8 77 37 Op 1 4/0.000 - CDS 72962 - 74305 1470 ## COG0372 Citrate synthase 78 37 Op 2 1/0.050 - CDS 74318 - 75508 1205 ## COG0538 Isocitrate dehydrogenases - Prom 75600 - 75659 2.7 - Term 75544 - 75576 1.7 79 38 Tu 1 . - CDS 75671 - 77914 2175 ## COG1048 Aconitase A - Prom 78042 - 78101 8.1 + Prom 77898 - 77957 5.5 80 39 Tu 1 . + CDS 78173 - 80077 1604 ## COG1112 Superfamily I DNA and RNA helicases and helicase subunits + Term 80131 - 80175 11.8 - Term 80110 - 80171 19.0 81 40 Tu 1 . - CDS 80190 - 82109 1609 ## BF3757 hypothetical protein - Prom 82208 - 82267 8.2 - Term 82241 - 82288 12.9 82 41 Op 1 . - CDS 82312 - 83355 1132 ## COG0059 Ketol-acid reductoisomerase - Term 83363 - 83403 1.7 83 41 Op 2 . - CDS 83437 - 84021 660 ## BF3759 hypothetical protein 84 41 Op 3 . - CDS 84039 - 84782 801 ## COG3884 Acyl-ACP thioesterase 85 41 Op 4 32/0.000 - CDS 84783 - 85343 588 ## COG0440 Acetolactate synthase, small (regulatory) subunit 86 41 Op 5 6/0.000 - CDS 85357 - 87054 1537 ## COG0028 Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 87 41 Op 6 . - CDS 87110 - 88912 1804 ## COG0129 Dihydroxyacid dehydratase/phosphogluconate dehydratase 88 42 Tu 1 . - CDS 89236 - 90414 628 ## BF3764 hypothetical protein - Prom 90466 - 90525 9.3 - Term 90705 - 90762 14.2 89 43 Op 1 . - CDS 90801 - 92876 1637 ## BF3553 hypothetical protein 90 43 Op 2 . - CDS 92893 - 95043 1599 ## COG2931 RTX toxins and related Ca2+-binding proteins 91 43 Op 3 . - CDS 95062 - 97218 1426 ## BF3767 hypothetical protein 92 43 Op 4 . - CDS 97224 - 98531 862 ## COG2885 Outer membrane protein and related peptidoglycan-associated (lipo)proteins 93 43 Op 5 . - CDS 98563 - 99411 467 ## BF3769 transcriptional regulator - Prom 99519 - 99578 8.0 + Prom 99478 - 99537 9.3 94 44 Tu 1 . + CDS 99739 - 100329 619 ## COG1047 FKBP-type peptidyl-prolyl cis-trans isomerases 2 + Term 100359 - 100406 10.5 - Term 100346 - 100394 11.1 95 45 Op 1 . - CDS 100421 - 100939 389 ## BF3771 hypothetical protein - Prom 100971 - 101030 2.2 96 45 Op 2 . - CDS 101048 - 102334 1167 ## COG3681 Uncharacterized conserved protein - Prom 102444 - 102503 6.6 + Prom 102324 - 102383 3.5 97 46 Tu 1 . + CDS 102510 - 103637 912 ## BF3561 hypothetical protein + Prom 103704 - 103763 4.5 98 47 Op 1 . + CDS 103787 - 104881 752 ## BF3562 hypothetical protein 99 47 Op 2 . + CDS 104892 - 105674 706 ## BF3563 hypothetical protein + Term 105706 - 105752 8.6 - Term 105879 - 105930 12.4 100 48 Op 1 . - CDS 105946 - 106341 197 ## gi|253566158|ref|ZP_04843612.1| predicted protein 101 48 Op 2 . - CDS 106377 - 107948 1036 ## BT_1798 hypothetical protein - Prom 108085 - 108144 10.0 102 49 Tu 1 . - CDS 108560 - 109498 683 ## BF3777 hypothetical protein - Term 109513 - 109564 14.2 103 50 Op 1 . - CDS 109577 - 110500 930 ## BF3569 hypothetical protein 104 50 Op 2 . - CDS 110514 - 112970 1527 ## COG1520 FOG: WD40-like repeat - Term 113008 - 113049 1.1 105 51 Op 1 . - CDS 113075 - 114868 1663 ## BF3780 hypothetical protein 106 51 Op 2 . - CDS 114881 - 118255 2962 ## BF3781 hypothetical protein - Prom 118310 - 118369 5.0 - Term 118385 - 118422 0.3 107 52 Op 1 . - CDS 118491 - 119513 621 ## COG3712 Fe2+-dicitrate sensor, membrane component 108 52 Op 2 . - CDS 119587 - 120144 556 ## BF3574 putative RNA polymerase sigma factor - Prom 120164 - 120223 6.5 109 53 Tu 1 . - CDS 120225 - 120362 59 ## gi|265766917|ref|ZP_06094746.1| predicted protein - Prom 120437 - 120496 5.8 - Term 120825 - 120867 1.1 110 54 Op 1 . - CDS 120995 - 122836 1600 ## BF3784 hypothetical protein 111 54 Op 2 . - CDS 122859 - 126074 2668 ## BF3785 hypothetical protein - Prom 126192 - 126251 2.8 - Term 126814 - 126872 14.0 112 55 Op 1 . - CDS 126887 - 127231 308 ## BF3579 hypothetical protein 113 55 Op 2 . - CDS 127191 - 128207 827 ## BF3787 putative chitobiase 114 55 Op 3 . - CDS 128236 - 130122 1534 ## BF3788 hypothetical protein 115 55 Op 4 . - CDS 130152 - 133604 2595 ## BF3581 hypothetical protein - Prom 133636 - 133695 5.9 116 56 Op 1 . - CDS 133733 - 134929 645 ## COG3712 Fe2+-dicitrate sensor, membrane component - Prom 134968 - 135027 3.1 - Term 134940 - 134974 -0.1 117 56 Op 2 . - CDS 135029 - 135607 305 ## BF3791 RNA polymerase ECF-type sigma factor - Prom 135653 - 135712 8.5 + Prom 135636 - 135695 7.7 118 57 Op 1 . + CDS 135868 - 136944 1031 ## COG0082 Chorismate synthase 119 57 Op 2 . + CDS 136957 - 138321 1166 ## COG0624 Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases + Term 138342 - 138396 14.2 - Term 138330 - 138384 13.4 120 58 Tu 1 . - CDS 138406 - 139461 1025 ## COG3049 Penicillin V acylase and related amidases - Prom 139593 - 139652 5.2 - Term 139466 - 139511 10.1 121 59 Tu 1 . - CDS 139760 - 140077 202 ## COG1012 NAD-dependent aldehyde dehydrogenases - Prom 140190 - 140249 5.3 122 60 Op 1 . - CDS 140291 - 141013 586 ## BF3797 putative integral membrane protein - Prom 141036 - 141095 4.7 123 60 Op 2 . - CDS 141103 - 143253 2163 ## COG0550 Topoisomerase IA - Prom 143387 - 143446 3.8 + Prom 143220 - 143279 6.3 124 61 Tu 1 . + CDS 143380 - 144780 1108 ## COG3669 Alpha-L-fucosidase + Term 144830 - 144870 9.1 - Term 144811 - 144865 11.1 125 62 Op 1 7/0.000 - CDS 144886 - 147033 2201 ## COG1884 Methylmalonyl-CoA mutase, N-terminal domain/subunit 126 62 Op 2 . - CDS 147035 - 148933 2170 ## COG1884 Methylmalonyl-CoA mutase, N-terminal domain/subunit - Prom 149157 - 149216 7.9 + Prom 148983 - 149042 5.1 127 63 Tu 1 . + CDS 149182 - 150846 1825 ## COG2985 Predicted permease + Term 150867 - 150926 9.3 - Term 150859 - 150910 8.1 128 64 Tu 1 . - CDS 150934 - 151563 533 ## BF3595 hypothetical protein - Prom 151673 - 151732 5.8 + Prom 151621 - 151680 5.9 129 65 Op 1 1/0.050 + CDS 151729 - 153081 1242 ## COG0534 Na+-driven multidrug efflux pump 130 65 Op 2 . + CDS 153122 - 153691 487 ## COG0664 cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases + Prom 153786 - 153845 1.5 131 66 Tu 1 . + CDS 153976 - 154467 344 ## BF3806 hypothetical protein 132 67 Tu 1 . - CDS 154685 - 156895 2085 ## COG1752 Predicted esterase of the alpha-beta hydrolase superfamily - Prom 156998 - 157057 4.2 133 68 Tu 1 . - CDS 157084 - 158445 1247 ## COG0534 Na+-driven multidrug efflux pump - Prom 158498 - 158557 3.4 + Prom 158436 - 158495 4.5 134 69 Op 1 . + CDS 158583 - 160310 1970 ## COG1190 Lysyl-tRNA synthetase (class II) + Term 160339 - 160370 0.0 135 69 Op 2 . + CDS 160410 - 161405 981 ## COG0240 Glycerol-3-phosphate dehydrogenase 136 69 Op 3 . + CDS 161453 - 162790 1527 ## COG0166 Glucose-6-phosphate isomerase + Term 162815 - 162866 15.6 - Term 162796 - 162859 15.9 137 70 Tu 1 . - CDS 162875 - 163381 511 ## BF3605 putative lipoprotein - Prom 163403 - 163462 3.3 + Prom 163371 - 163430 4.6 138 71 Tu 1 . + CDS 163479 - 164207 816 ## COG0637 Predicted phosphatase/phosphohexomutase + Term 164227 - 164267 8.1 139 72 Tu 1 . + CDS 164779 - 165771 497 ## PG0838 integrase + Term 165817 - 165858 4.7 140 73 Tu 1 . + CDS 165915 - 166106 175 ## gi|253566200|ref|ZP_04843654.1| predicted protein - Term 166166 - 166219 1.1 141 74 Op 1 . - CDS 166464 - 168341 528 ## CHU_3478 hypothetical protein 142 74 Op 2 . - CDS 168353 - 168877 455 ## CHU_3477 hypothetical protein - Prom 168897 - 168956 5.6 - Term 169409 - 169445 5.1 143 75 Tu 1 . - CDS 169679 - 170398 255 ## gi|253566203|ref|ZP_04843657.1| predicted protein - Prom 170433 - 170492 7.8 144 76 Op 1 . - CDS 170533 - 170997 186 ## BF2444 hypothetical protein 145 76 Op 2 . - CDS 170994 - 171488 431 ## COG3023 Negative regulator of beta-lactamase expression - Prom 171718 - 171777 5.8 146 77 Tu 1 . - CDS 172374 - 172808 455 ## gi|253566206|ref|ZP_04843660.1| predicted protein - Prom 172952 - 173011 7.2 147 78 Op 1 . - CDS 173097 - 173429 255 ## gi|253566207|ref|ZP_04843661.1| predicted protein 148 78 Op 2 . - CDS 173433 - 173717 357 ## gi|253566208|ref|ZP_04843662.1| predicted protein 149 78 Op 3 . - CDS 173768 - 178231 1618 ## BF2447 hypothetical protein 150 78 Op 4 . - CDS 178228 - 180276 987 ## BF2448 hypothetical protein 151 78 Op 5 . - CDS 180273 - 183365 2480 ## COG5281 Phage-related minor tail protein 152 78 Op 6 . - CDS 183367 - 183984 516 ## gi|253566212|ref|ZP_04843666.1| predicted protein - Term 183989 - 184027 4.3 153 79 Op 1 . - CDS 184051 - 184530 432 ## BF2452 hypothetical protein 154 79 Op 2 . - CDS 184562 - 184930 252 ## gi|253566214|ref|ZP_04843668.1| predicted protein 155 79 Op 3 . - CDS 184930 - 185385 174 ## gi|253566215|ref|ZP_04843669.1| predicted protein 156 79 Op 4 . - CDS 185382 - 185705 253 ## gi|253566216|ref|ZP_04843670.1| predicted protein 157 79 Op 5 . - CDS 185702 - 186010 249 ## gi|253566217|ref|ZP_04843671.1| predicted protein 158 79 Op 6 . - CDS 186014 - 187258 1108 ## NMC0858 putative phage-related protein 159 79 Op 7 3/0.000 - CDS 187269 - 187865 252 ## COG3740 Phage head maturation protease 160 79 Op 8 2/0.000 - CDS 187868 - 189118 810 ## COG4695 Phage-related protein 161 79 Op 9 . - CDS 189159 - 190781 836 ## COG4626 Phage terminase-like protein, large subunit 162 79 Op 10 . - CDS 190778 - 191161 393 ## gi|253566222|ref|ZP_04843676.1| predicted protein - Prom 191191 - 191250 5.7 - Term 191381 - 191433 5.6 163 80 Op 1 . - CDS 191463 - 191855 299 ## gi|253566223|ref|ZP_04843677.1| conserved hypothetical protein 164 80 Op 2 . - CDS 191904 - 192122 225 ## gi|253566224|ref|ZP_04843678.1| predicted protein 165 80 Op 3 . - CDS 192137 - 192439 229 ## gi|301164546|emb|CBW24105.1| putative endonuclease 166 80 Op 4 . - CDS 192393 - 192722 178 ## gi|253566226|ref|ZP_04843680.1| predicted protein 167 80 Op 5 . - CDS 192697 - 192882 57 ## gi|301164548|emb|CBW24107.1| hypothetical protein 168 80 Op 6 . - CDS 192915 - 193193 185 ## BF2464 hypothetical protein 169 80 Op 7 . - CDS 193208 - 193759 478 ## gi|253566229|ref|ZP_04843683.1| conserved hypothetical protein 170 80 Op 8 . - CDS 193779 - 194060 253 ## BF2465 hypothetical protein - Prom 194155 - 194214 5.1 171 81 Tu 1 . - CDS 194256 - 195197 443 ## BF2466 hypothetical protein - Prom 195218 - 195277 2.9 - Term 195724 - 195755 3.2 172 82 Op 1 . - CDS 195866 - 197077 264 ## BF2468 hypothetical protein 173 82 Op 2 . - CDS 197153 - 197572 277 ## BF2469 hypothetical protein 174 82 Op 3 . - CDS 197641 - 198396 508 ## COG0863 DNA modification methylase 175 82 Op 4 . - CDS 198396 - 198770 381 ## BF2471 hypothetical protein 176 82 Op 5 . - CDS 198743 - 198955 135 ## BF2407 hypothetical protein 177 82 Op 6 . - CDS 198978 - 199418 248 ## BF2472 putative recombination protein 178 82 Op 7 . - CDS 199393 - 199743 230 ## BF2473 hypothetical protein 179 82 Op 8 . - CDS 199798 - 200226 266 ## gi|253566240|ref|ZP_04843694.1| predicted protein 180 82 Op 9 . - CDS 200245 - 200478 234 ## BF2474 hypothetical protein 181 82 Op 10 . - CDS 200462 - 200776 225 ## gi|253566242|ref|ZP_04843696.1| conserved hypothetical protein 182 82 Op 11 . - CDS 200793 - 201350 341 ## BF2475 hypothetical protein 183 82 Op 12 . - CDS 201383 - 201622 63 ## gi|301164567|emb|CBW24126.1| hypothetical protein 184 82 Op 13 . - CDS 201629 - 201835 270 ## gi|301164568|emb|CBW24127.1| hypothetical protein - Prom 201913 - 201972 9.0 + Prom 201799 - 201858 9.3 185 83 Tu 1 . + CDS 201989 - 202378 259 ## gi|253566245|ref|ZP_04843699.1| predicted protein + Prom 202402 - 202461 2.3 186 84 Tu 1 . + CDS 202482 - 202997 277 ## gi|253566246|ref|ZP_04843700.1| predicted protein 187 85 Tu 1 . - CDS 203088 - 203267 202 ## - Prom 203496 - 203555 5.5 + Prom 203288 - 203347 6.4 188 86 Op 1 . + CDS 203450 - 204703 381 ## BT_2451 putative pyrogenic exotoxin B 189 86 Op 2 . + CDS 204700 - 205356 284 ## gi|253566248|ref|ZP_04843702.1| predicted protein + Term 205394 - 205430 5.0 - TRNA 205564 - 205635 35.7 # Arg CCG 0 0 + Prom 205732 - 205791 4.2 190 87 Op 1 . + CDS 205855 - 206892 1045 ## COG2502 Asparagine synthetase A 191 87 Op 2 . + CDS 206899 - 207561 471 ## COG0692 Uracil DNA glycosylase + Term 207595 - 207667 27.1 - Term 207583 - 207654 26.3 192 88 Op 1 . - CDS 207672 - 210410 2302 ## BF3817 hypothetical protein - Prom 210436 - 210495 1.7 193 88 Op 2 . - CDS 210502 - 211038 506 ## COG1418 Predicted HD superfamily hydrolase + Prom 211394 - 211453 4.8 194 89 Tu 1 . + CDS 211474 - 213780 1936 ## COG3525 N-acetyl-beta-hexosaminidase + Prom 213913 - 213972 5.0 195 90 Op 1 . + CDS 213999 - 216092 1243 ## BF3612 hypothetical protein 196 90 Op 2 . + CDS 216130 - 218364 1859 ## COG1472 Beta-glucosidase-related glycosidases 197 91 Op 1 . - CDS 218711 - 219166 245 ## COG0454 Histone acetyltransferase HPA2 and related acetyltransferases 198 91 Op 2 . - CDS 219184 - 220101 717 ## BF3822 putative sodium-dependent transporter - Prom 220225 - 220284 5.0 199 92 Op 1 . + CDS 220209 - 221453 983 ## COG0860 N-acetylmuramoyl-L-alanine amidase 200 92 Op 2 . + CDS 221473 - 222366 759 ## BF3824 hypothetical protein + Prom 222486 - 222545 3.5 201 93 Tu 1 . + CDS 222715 - 224145 995 ## COG0593 ATPase involved in DNA replication initiation - Term 224005 - 224051 8.9 202 94 Tu 1 . - CDS 224233 - 225012 624 ## COG0778 Nitroreductase - Prom 225132 - 225191 5.5 + Prom 224979 - 225038 7.1 203 95 Op 1 . + CDS 225183 - 227804 2085 ## COG0209 Ribonucleotide reductase, alpha subunit + Term 227842 - 227885 10.1 + Prom 227921 - 227980 5.1 204 95 Op 2 . + CDS 228002 - 230704 2391 ## COG1640 4-alpha-glucanotransferase + Term 230736 - 230794 16.3 + Prom 230743 - 230802 2.4 205 96 Op 1 . + CDS 230834 - 231865 716 ## COG3594 Fucose 4-O-acetylase and related acetyltransferases 206 96 Op 2 . + CDS 231916 - 232284 345 ## COG1539 Dihydroneopterin aldolase 207 96 Op 3 . + CDS 232308 - 233729 1102 ## BF3831 hypothetical protein + Term 233851 - 233888 8.0 - Term 234182 - 234252 16.1 208 97 Op 1 . - CDS 234294 - 234812 533 ## COG1803 Methylglyoxal synthase 209 97 Op 2 . - CDS 234817 - 235848 818 ## COG1216 Predicted glycosyltransferases 210 97 Op 3 . - CDS 235841 - 236737 905 ## COG1560 Lauroyl/myristoyl acyltransferase 211 97 Op 4 . - CDS 236740 - 238059 764 ## PROTEIN SUPPORTED gi|16079597|ref|NP_390421.1| hypothetical protein BSU25430 - Prom 238080 - 238139 3.4 + Prom 238283 - 238342 5.6 212 98 Tu 1 . + CDS 238488 - 240161 1589 ## COG1022 Long-chain acyl-CoA synthetases (AMP-forming) + Prom 240165 - 240224 2.8 213 99 Tu 1 . + CDS 240320 - 240538 285 ## - Term 240784 - 240844 14.3 214 100 Op 1 27/0.000 - CDS 240862 - 241305 711 ## PROTEIN SUPPORTED gi|60683080|ref|YP_213224.1| 50S ribosomal protein L9 215 100 Op 2 11/0.000 - CDS 241317 - 241589 460 ## PROTEIN SUPPORTED gi|53715145|ref|YP_101137.1| 30S ribosomal protein S18 216 100 Op 3 . - CDS 241592 - 241936 577 ## PROTEIN SUPPORTED gi|53715146|ref|YP_101138.1| 30S ribosomal protein S6 - Prom 241961 - 242020 9.9 + Prom 241896 - 241955 9.8 217 101 Op 1 . + CDS 242098 - 242544 525 ## COG1846 Transcriptional regulators + Prom 242596 - 242655 2.8 218 101 Op 2 . + CDS 242686 - 242862 274 ## Predicted protein(s) >gi|226332012|gb|ACIB01000044.1| GENE 1 310 - 690 266 126 aa, chain + ## HITS:1 COG:no KEGG:BF3468 NR:ns ## KEGG: BF3468 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 126 5 130 130 214 100.0 1e-54 MGNTPYVVIVNEKSDEGQKVLEALEENIEKMDIGSHRELVIFFFVWLNHQQKDPKKRKNI RELAKIMHRSLFFGQKHNSNEEMKPDSIETEIFKILRILKSMKKAEDKDLIINLLDDISL FLDENV >gi|226332012|gb|ACIB01000044.1| GENE 2 756 - 2915 1737 719 aa, chain + ## HITS:1 COG:XF2121 KEGG:ns NR:ns ## COG: XF2121 COG5545 # Protein_GI_number: 15838712 # Func_class: R General function prediction only # Function: Predicted P-loop ATPase and inactivated derivatives # Organism: Xylella fastidiosa 9a5c # 382 602 116 341 501 76 28.0 2e-13 MKSISITTYRGFSKVKGKCSLQELIGWVRSRQYANLIEKIGRLVSEGKTKEAENVKRQLD YFTVTANYHECRLAHSIAAYNDTSTIDIDKLREEELERIRALIEADEATLACFLTAKQHG FKILAYLTDLEAEAWRNSFFKTATITYDRLEQYHAGIYELTRKHYEKLLQTEVDTSGKDL SRGVFASYDPKAFYSAERVARIPERTLTIEAPEPAQRGRKKKKEPETGQTGDISAYTCME FNKCLCSTQRLMKYTEGSRNSFLFTLGNKCFRKGLEESEVKRLAAERLGDGGGMDTDTPI GNAYTYTDRTERAEEKKKIPLVEQVIDYLNKNYAFRRNTVLDRLEMCDLSQTEEKSFYAM RNKDFNSIFLNISRQGIAYPLNSLKSVIDSDYSPEFNPFTHYFEGNARWDRKTDHIRKLA DTIQAEDQEFWREGFRRWIVAMVASALRPGKANQEALVLHGAQGKGKSTWIRHLLPPELA EYYRNGMIDPANKDDLLLLSTRLLINMEEFEGVKTGDIAELKRIIGQENVTIRKVYDTQA QLYPRRASFIGSTNNMQFLKDYGGNRRFLVIPVKTIDYRTPVDHKGVYAQAVQLIEDGFR YWFEGNEIDDINTRNERHRMKDPLEENLYVYFRPAGEKDFEVKWKPAAAILATLSVYGRT QANAQTQQVLVQILERDAFGKRVNIHGITEYAVVELTQQEVSENFRKREKGKEMDELPF >gi|226332012|gb|ACIB01000044.1| GENE 3 2899 - 3111 60 70 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MNSRFKSTGGKEIEEKCPVRIYVIEITYILTGQDIYWRFIDKKILAILIRLHTYYKRHTY QQFYPYRLDI >gi|226332012|gb|ACIB01000044.1| GENE 4 3421 - 4947 474 508 aa, chain + ## HITS:1 COG:no KEGG:BF3470 NR:ns ## KEGG: BF3470 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 508 1 507 507 1008 99.0 0 MKYYLLFYSLIFLFIFCSCYDESLNFDHDIIQKDLDSNYTFIMNEADALDLVSYFINSME NSPATRVASLDTREVEGIVTFPKTRANLLPEWLQEFNKRFYIVNMKNDKGYAIISKDERA FPFYAILDHGRFNFNSIDSVSTSIYQGFIRRNKIDIERSDSLYLNKSLFPLTRSSLDDKT FFQIKELKKGLFRGFGRSSNYLLKTRWTQNVTRPNIYLQKSGATYYDVYGNTTRSATINT GDGSIQSTRLFGCTPVAFGQVLYGLRSHKGVKDLCYSNGQPVLWDRMEKGMNPEVERFLG WITMNCSPKVVDITFLGTSLTKPGVMVHNVDAKNFIKKILGDYLDIQYDNCVIWAGDLNG FTGNNEGKKIAEKFYTTNKECFAIFTGSQKTMPSDYHSYVVDRMCEILMKHRGVIVTPKE IVESFTPTQTNIMKEQGLLVIETNPNRIIYHEPYSVTICHVNYGWGNKHNGYYYYFNERT DGGLKYEGNEGFNNNFNHNLAYTIITPK >gi|226332012|gb|ACIB01000044.1| GENE 5 4952 - 5395 289 147 aa, chain + ## HITS:1 COG:no KEGG:BF3471 NR:ns ## KEGG: BF3471 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 147 1 147 147 304 100.0 6e-82 MKELKRQNQMEILKKGYKIFVLLFSIIIIAYSCRDGGVAAVVNFKPNPIRFSKEGGEYTI KVKGSRGPQWDYTNCTTHSGKEETMKMDTIDNIVHISNDWIEFMFNSGANCKYIDVKVKP NNSDMRRSLIFNVWSVALPTQLAVYQD >gi|226332012|gb|ACIB01000044.1| GENE 6 5596 - 5841 379 81 aa, chain - ## HITS:1 COG:no KEGG:BF3675 NR:ns ## KEGG: BF3675 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 81 1 81 81 151 100.0 7e-36 MKDLEDDEREFPIRVYTKVELALLYAPHLSENAALNNLSRWMRHNKLLMAALEEVGYYKY RHSFTPKEVRLIFRYMGEPGA >gi|226332012|gb|ACIB01000044.1| GENE 7 6109 - 6579 452 156 aa, chain + ## HITS:1 COG:no KEGG:BF3676 NR:ns ## KEGG: BF3676 # Name: not_defined # Def: putative non-specific DNA binding protein # Organism: B.fragilis # Pathway: not_defined # 1 156 1 156 156 277 100.0 8e-74 MAVPYKKIARKDPRKTDAIEKFYPQLVTLGQSASLESIAYEMKEKSSLSSGDIKSVLTNF VEAMRTSLYNGQSVNIRDFGVFSLSARTKGVDTEKECTAKNIMAVKINFRPSSSVRPNLT STRAGDKIEFIDIKAALEGKESEKGGDGDIVDDPTA >gi|226332012|gb|ACIB01000044.1| GENE 8 6793 - 7494 602 233 aa, chain - ## HITS:1 COG:AF0943 KEGG:ns NR:ns ## COG: AF0943 COG0120 # Protein_GI_number: 11498548 # Func_class: G Carbohydrate transport and metabolism # Function: Ribose 5-phosphate isomerase # Organism: Archaeoglobus fulgidus # 19 232 2 216 225 151 41.0 1e-36 MKWESSLVEHLQWSDTITNRESKEQLARMIAMRVQEGEIIGAGSGSTVYLALLAIAERIR TENLHVTVIPASMEISMECVRLGIPQTTLWMCRPDWTFDGADEVDPDHNLIKGRGGALFK EKLLICSSNRTFILVDESKQVSFLGSRFPVPIEVFPMALPYVEREVLTMGALRTELRLAK GKDGPVITENGNLILDAWFGNIHSSLEKEIKSITGVVENGLFMGYDVEVMVAK >gi|226332012|gb|ACIB01000044.1| GENE 9 7524 - 7664 109 46 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MKKSGNSSARPGSGRPKASGGKKPSGKRIGKSKPSKANNQLNRRVK >gi|226332012|gb|ACIB01000044.1| GENE 10 7645 - 8649 754 334 aa, chain - ## HITS:1 COG:no KEGG:BF3679 NR:ns ## KEGG: BF3679 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 334 1 334 334 675 99.0 0 MKLQKIYRQLWMEVHPRPQSADTDQWYVDFANRLLPLFEKSELTGQMIHKNRAVLYFTWY LEDCVNNSGGWNKFIRLHKRLYGRFLPFYTLTGAYADDEINFEDVSFLLWSLLSPVTDDS PVPWNPTDKSLLRLATDIYALLEAHFEQAPLTDDESMDWLPEIRALLPPPGPVLDIFPEM ELPHDVTKFLNATQGKQLVYFEDYAGLRRFCVDALEWADEDDSLMPELADEENFVFFANP KGILLAPNIGACFRDERNSTYNPGIAEQEGAELLYVPGLCPIDLLHYAMQHDLLSEVTFP FEDGRRVLHENWDFIARRYLGKYYNEDFYEEERK >gi|226332012|gb|ACIB01000044.1| GENE 11 8737 - 9255 416 172 aa, chain + ## HITS:1 COG:BMEI0379 KEGG:ns NR:ns ## COG: BMEI0379 COG1247 # Protein_GI_number: 17986662 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Sortase and related acyltransferases # Organism: Brucella melitensis # 1 164 1 164 164 84 30.0 1e-16 MDITLRTLTENDLPFVKDIYDYYTLHTTVVYFVHCASIDELKNYIPVGDPVYRSFIIETP EGAPCGFCYFARFKPREAFRISVELTLYLKPEFTGRGYGKQAILRLEEIIRQEGFSNIMA LISGENEASIRLFEKCGFECCANIRQVAEKFGKKLDLRMYQKIISDNSHLTP >gi|226332012|gb|ACIB01000044.1| GENE 12 9344 - 9766 330 140 aa, chain + ## HITS:1 COG:no KEGG:BF3477 NR:ns ## KEGG: BF3477 # Name: not_defined # Def: putative DNA-binding protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 140 1 140 140 249 99.0 3e-65 MWQYLYIQEIIGTFATEIRRRAVMKTVNQIIGENLKKIRELSGFTQEQVAQSIKIERSTY SNYEGGTREIPYTILEDISNLFGCEPFILFEDNIQTNNEIMATAFRISNLGENDLKEIAA FKDIVKSYLKMERIAQNEAE >gi|226332012|gb|ACIB01000044.1| GENE 13 9939 - 10490 235 183 aa, chain + ## HITS:1 COG:no KEGG:BF3478 NR:ns ## KEGG: BF3478 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 183 63 245 245 371 100.0 1e-102 MLINSNQPRGRQHFTIAHELYHLYIEKKPTPHKCNPGCASKDPIEQCADMFASSLLMPEG GICQLIPEMELKTKNISMATVLKLEHYFSVSRSALLYRLQNIGLITESTRSQLAEIKVKY SAKCFGYDTALYEPANEGLVIGDFGEKARKLFEQEKISEGHYIELLHKININGTQENEDS TRC >gi|226332012|gb|ACIB01000044.1| GENE 14 10453 - 10977 334 174 aa, chain + ## HITS:1 COG:no KEGG:BF3688 NR:ns ## KEGG: BF3688 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 174 1 174 174 333 100.0 2e-90 MEHKKTKIVLDADVIIHFMEANYFSILPDIFPEYEYLILDVVYNEISQNSGTKDFIDKYL HFFPKLKKEVFSPKRESMKEFFLLQRTLGKGESACMIYCRDNRDVLGSSNLKDIKEYCSK NNITYLTTLDFLYYAYCRKKMTEQECKEFMQEVNNAGSKLPIIDITQYTCTVQI >gi|226332012|gb|ACIB01000044.1| GENE 15 11115 - 11720 783 201 aa, chain - ## HITS:1 COG:PA0966 KEGG:ns NR:ns ## COG: PA0966 COG0632 # Protein_GI_number: 15596163 # Func_class: L Replication, recombination and repair # Function: Holliday junction resolvasome, DNA-binding subunit # Organism: Pseudomonas aeruginosa # 1 199 1 198 201 111 35.0 7e-25 MIEYIKGEIAELSPATAVIDCNGLGYAVNISLNTYSAIQGKSSCKLYIYEAIREDAYVLY GFADKQERELFLLLISVSGIGGNTARMILSALSPAELVNVISTENANMLKTVKGIGLKTA QRVIVDLKDKIKTGAMAATAVGGAAGALLPAMNAEVQEEAIAALTMLGFAATPSQKAVLA ILKEEPDAPVEKVIKLALKRL >gi|226332012|gb|ACIB01000044.1| GENE 16 11879 - 12778 1089 299 aa, chain + ## HITS:1 COG:no KEGG:BF3690 NR:ns ## KEGG: BF3690 # Name: not_defined # Def: meso-diaminopimelate D-dehydrogenase # Organism: B.fragilis # Pathway: Lysine biosynthesis [PATH:bfr00300] # 1 299 1 299 299 598 99.0 1e-169 MKKVRAAIVGYGNIGRYVLEALQAAPDFEIAGVVRRAGAENKPAELNDYAVVKDIKELQG VDVAILCTPTRSVEKYAKEILAMGINTVDSFDIHTGIVDLRRELGACAKEHGAVSIISAG WDPGSDSIVRTMLEAIAPKGITYTNFGPGMSMGHTVAVKAIDGVKAALSMTIPTGTGIHR RMVYIELKDGYKFEEVAAAIKSDAYFVNDETHVKQVPSVDALLDMGHGVNLTRKGVSGKT QNQLFEFNMRINNPALTAQVLVCVARASMKQQPGCYTMVEVPVIDLLPGDREEWIGHLV >gi|226332012|gb|ACIB01000044.1| GENE 17 12920 - 13570 493 216 aa, chain + ## HITS:1 COG:lin1978 KEGG:ns NR:ns ## COG: lin1978 COG1272 # Protein_GI_number: 16801044 # Func_class: R General function prediction only # Function: Predicted membrane protein, hemolysin III homolog # Organism: Listeria innocua # 1 205 1 199 210 112 36.0 4e-25 MKNKRYSRGEELSNTLSHGAGTLLGITTGYFLLEKALANPHPYWATGCVLAYLVGMLASY ISSTWYHGSRPGKRKELLRKFDHGAIYLHIAGTYTPFTLLVLRHAGGWGWGIFTFVWLSA IVGFILAFKKLKEHSNLETICFVGMGSAILVALKPLMDCLSAIGASPAFWWLLGGGASYI MGAVFYSLRKPYMHAVFHLFCLGGSIGHIIAIWLIL >gi|226332012|gb|ACIB01000044.1| GENE 18 13911 - 16304 2494 797 aa, chain + ## HITS:1 COG:CAC1209 KEGG:ns NR:ns ## COG: CAC1209 COG1328 # Protein_GI_number: 15894492 # Func_class: F Nucleotide transport and metabolism # Function: Oxygen-sensitive ribonucleoside-triphosphate reductase # Organism: Clostridium acetobutylicum # 8 796 5 689 699 419 35.0 1e-116 MNYAEICIIKRDGKREDFSISKIKNAVSKAFSATGINDEQQLIADITMNVIGQFASPTIT VEEIQDLVEKELMKVRPEVAKKYIIYREWRNTERDKKTQMKHVMDGIVAIDKNDVNLSNA NMSSHTPAGQMMTFASEVTKDYTYKYLLPKRFAEAHQLGDIHIHDLDYYPTKTTTCIQYD MDDLFERGFRTKNGSIRTPQSIQSYATLATIIFQTNQNEQHGGQAIPAFDFFMAKGVSKS FRKHLASFISFYVQMNKGEEIEEKAIRTVIAEHLSSIKASELERETLRMALTALQINIDK EHLNQIIEKAFVQTQKDTHQAMEGFIHNLNTMHSRGGNQVVFSSINYGTDTSAEGRMVIE ELLKATVEGLGTRGEVPVFPIQIFKIKNGVSYTEADYEKAMANFDAAMEGKLKFEAPNFD LFLKACHTTAKALFPNFMFLDAPFNQNEKWDANDPQRYRYELATMGCRTRVFENVAGEKS SLGRGNLSFTTLNMPRLAIEARIKAENMTEDGHNKDAVERKAKEIFIESIHETATLIAEQ LYERYQYQRTALARQFPFMMGNDVWKGGAALNPNEQVGDVLRSGTLGIGFLGGHNAMVAL YGEGHGHSQKAWDTLYEAVLEINKVADEYKAKYNLNYSVLATPAEGLSGRFTRMDRRKYG KIKGVTDNDYYVNSFHVDVKEPISIVEKIKREAPFHAITRGGHITYVELDGEAQKNVRAI AKIVKVMFDEGIGYGSINHPVDTCHNCGYKGVIYDKCPVCQSENILRMRRITGYLTGDLS SWNSAKRAEEKDRVKHG >gi|226332012|gb|ACIB01000044.1| GENE 19 16311 - 16769 450 152 aa, chain + ## HITS:1 COG:CAC0481 KEGG:ns NR:ns ## COG: CAC0481 COG0602 # Protein_GI_number: 15893772 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Organic radical activating enzymes # Organism: Clostridium acetobutylicum # 10 152 12 153 153 137 49.0 5e-33 MNILYTYPETIVDGEGIRYSIYLAGCRHGCPGCHNPESWNPQAGEELSEGRLASIIREIN SNPLLDGVTFSGGDPFYDPEAFLPVIRRVKTETGQNIWCYTGYTYEEIESDPKLAAILPY IDVLVDGRFKQELYSPHLEFRGSSNQRIIKLK >gi|226332012|gb|ACIB01000044.1| GENE 20 17004 - 18410 903 468 aa, chain + ## HITS:1 COG:YPO1712 KEGG:ns NR:ns ## COG: YPO1712 COG0477 # Protein_GI_number: 16121972 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Yersinia pestis # 13 461 6 454 455 469 58.0 1e-132 MILHTQTPVVDETDGLPLPHRIWAVVGISFALCMSVLDINIINVVLPTLSHDFGTSPAVT TWIINGYQLAIVISLLSFSSLGEIYGYRKIFLSGIAMFIVTSLICALSHSFWTLTIARIF QGFSASAITSVNTAQLRTIYPRKQIGRGMGINAMVVAISAAAGPSVASGILSVASWHWLF AINVPLGLVALTLGLKYLPRKEERSNRKFDKLSAIANAITFGLLIYTLDGFAHHENNDYI VIQLAVLAVVGTYYVRRQLNQPSPLLPLDLLGIPIFRLSILTSICSFTAQMLAMVSLPFF LQNFLGYSEVMTGLLLTPWPIATLVTAPAAGYLVERIHPGILGSIGMALFCIGLYSLSTL TADSSVTGIILRLMLCGAGFGLFQTPNNSTIISSAPTRRSGGASGMLGMARLLGQTFGTT LVALLFSFVVHEKSTAVCLIAGSGFAFVAAVVSSMRLSQPSTLKTKPR >gi|226332012|gb|ACIB01000044.1| GENE 21 18575 - 19663 1274 362 aa, chain - ## HITS:1 COG:VCA0604 KEGG:ns NR:ns ## COG: VCA0604 COG0075 # Protein_GI_number: 15601362 # Func_class: E Amino acid transport and metabolism # Function: Serine-pyruvate aminotransferase/archaeal aspartate aminotransferase # Organism: Vibrio cholerae # 4 362 5 363 367 476 60.0 1e-134 MKPYLLLTPGPLTTSETVKETMMTDWCTWDEDYNLHIVEALRKELVGIATRNTEEYTSVL LQGSGTYCVEAVIGAAIGKNDKLLICSNGAYGDRMGNIAEYYHIDYELLAFDETEQVSVD YVDDYLSNNSDVTHVAFVHCETTTGILNPLKELAHVVKMHGKKLIVDAMSSFGGIPMDVS ELGIDFLISSANKCIQGVPGFGFIIARRSELVRCKGVARSLSLDIYDQWETMEKGHGKWR FTSPTHVVRAFKQALTELIEEGGVEARHRRYCENHRVLVEGMRSLGFVTLLDDAIQSPII TSFLYPKTGFDFKAFYTALKSKGFVIYPGKISKADTFRIGNIGDVHPEDFARLVEVVRET EY >gi|226332012|gb|ACIB01000044.1| GENE 22 19669 - 20460 837 263 aa, chain - ## HITS:1 COG:STM0432 KEGG:ns NR:ns ## COG: STM0432 COG0637 # Protein_GI_number: 16763812 # Func_class: R General function prediction only # Function: Predicted phosphatase/phosphohexomutase # Organism: Salmonella typhimurium LT2 # 1 261 2 264 270 182 36.0 7e-46 MKKIECIIMDWAGTAVDYGCFAPVAAFIKAFAGKGLTIDVEQTRKPMGLPKIQHIRELLT MPEVNEQFINRYRRAWTEEDVVELNHLFEKYLFASLKEYTDPIPGVIPTLEKLRAEGLKI GSTTGYTREMMDVVLPEAQAKGYRVDYCATPNLLPAGRPAPYMIFENLTKLAVPDPDTVV KVGDTIADIQEGVHAKVWSVGVVLGSNEMALTEEETHALPAAELENRIAEVKQRMLAAGA SYVIRSIEELPALIQLINSKLNH >gi|226332012|gb|ACIB01000044.1| GENE 23 20657 - 21535 913 292 aa, chain + ## HITS:1 COG:no KEGG:BF3490 NR:ns ## KEGG: BF3490 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 292 11 302 302 534 100.0 1e-150 MNELEDIYKRIEYLRNNGVKMKEIADWIHMAPSVLSALYSSVLPTYLNLQKTKPREEALD EALALVNNVSKKRLLGNLGEMKERLFDLEPNQEANITGNAFLKLLEKEMQESVGEVYNYS GTYLSYSLSSSTDSLKAEPYLICASENNDYVKVGMINAYKSVHWGSGIISNHQNSYLMFN ERELPQFALVTIYLQLPHYEFPHMLKGLYLCLDYNHNPIARRIVLVKQSDSTDIDEFLKM EGKLISRSELTPEEEIYYNYTCQEGDYIKTCTVPSPKLDESDLEREKKMLKI >gi|226332012|gb|ACIB01000044.1| GENE 24 21663 - 23018 1301 451 aa, chain - ## HITS:1 COG:aq_1964 KEGG:ns NR:ns ## COG: aq_1964 COG0750 # Protein_GI_number: 15606963 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted membrane-associated Zn-dependent proteases 1 # Organism: Aquifex aeolicus # 9 450 4 428 429 144 29.0 3e-34 METFLIRALQLIMSLSLLVIIHEGGHFLFARLFKVRVEKFCLFFDPWFTLFKFKPKKSET EYAVGWLPLGGYVKIAGMIDESMDTEQMKQPEQPWEFRSKPAWQRLLIMVGGVLFNFLLA LFIYSMILFKWGDQYIPVQKAPLGMDFNETAKAVGFQDGDILLSADGVDFVRYDPDMLSQ IADAREVTVLREGKKASVYIPEDMMQRLLGDSVRFAEFRFPYVVDSVMVNSPAAMAGIQP GDSIIALDGKPVSYTDFLAAMAERRQNAKALQNDSINPHQISLTYVRDGKTDVLTLTTDS AFKIGVAVNPYTDQLLPVIRKEYGFFESFPAGVALGVKTLKGYVGNMKYLFSKEGAKQLG GFGTIGSIFPATWNWHQFWYMTAFLSIILAFMNILPIPALDGGHVLFLFYEIIARRKPSD KFMEYAQMAGMILLFGLLIWANFNDILRFFF >gi|226332012|gb|ACIB01000044.1| GENE 25 23038 - 24204 1183 388 aa, chain - ## HITS:1 COG:alr4351 KEGG:ns NR:ns ## COG: alr4351 COG0743 # Protein_GI_number: 17231843 # Func_class: I Lipid transport and metabolism # Function: 1-deoxy-D-xylulose 5-phosphate reductoisomerase # Organism: Nostoc sp. PCC 7120 # 7 380 3 380 399 345 46.0 1e-94 MNEIKKKQIAILGSTGSIGTQALQVIEEHPELYEVYALTANNKVDLLIAQARKFMPEAVV IANEEKYAQLKEALSDLPVKVYAGAAALCQIVESGPIDVVLTAMVGYAGLKPTMNAIRAG KAIALANKETLVVAGELINQLARQYRTPILPVDSEHSAVFQCLAGEVGNPIEKVILTASG GPFRTCTMEQLKTVTKVQALKHPNWEMGAKITIDSASMMNKGFEVIEAKWLFGVQPGQIE VVVHPQSVIHSMVQFEDGAIKAQLGMPDMRLPIQYAFSYPDRINSSFDRLDFSKCTNLTF EQPDTKRFRNLALAYESMYRGGNMPCIVNAANEVVVAAFLRDEISFLGMSDVIEHTMGQV SFVQTPTYDDYVATDAEARRIARELICK >gi|226332012|gb|ACIB01000044.1| GENE 26 24207 - 25067 800 286 aa, chain - ## HITS:1 COG:NMB1483 KEGG:ns NR:ns ## COG: NMB1483 COG0739 # Protein_GI_number: 15677336 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane proteins related to metalloendopeptidases # Organism: Neisseria meningitidis MC58 # 163 286 295 415 415 93 41.0 4e-19 MPKKKRSKAFWNNIKFKYKLTIINENTLEEVVGLHVSKLNGLSVLLSVLTVLFLFAAAII TFTPLRNYLPGYMNSDIRAQVVENALRVDSLQQLVDRQNMYIMNIQDIFSGTVRVDTVQS MDSLTTMREDSLIARSEREEAFRRQYEETEKYNLTSITAQPDVNGLIFYRPTRGMISDHF DAEKKHFGTDIAANPNESVLATLDGTVILSTYTAETGYLIEVQHNQDFVSVYKHCGSLLK REGDTVKGGEAIALVGNSGTLTTGPHLHFELWHRGRPVNPEKYIVF >gi|226332012|gb|ACIB01000044.1| GENE 27 25133 - 25675 500 180 aa, chain - ## HITS:1 COG:no KEGG:BF3701 NR:ns ## KEGG: BF3701 # Name: rimM # Def: 16S rRNA-processing protein RimM # Organism: B.fragilis # Pathway: not_defined # 1 180 1 180 180 344 100.0 1e-93 MIKREDVYKIGLFNKPHGIHGELSFTFTDDIFDRADCDYLICRLDDIFVPFFIEEYRFRS DSTALVKLEGVDTAERARMFTNVEVYFPVKHAEEAGPGELSWDFFVGFRVEDVRHGALGK VTDVDTSTVNTLFVVDRDGDELLIPAQEELIAGIDQKHKIITVDLPEGLLSLDECDDEES >gi|226332012|gb|ACIB01000044.1| GENE 28 25672 - 26976 1476 434 aa, chain - ## HITS:1 COG:BB0472 KEGG:ns NR:ns ## COG: BB0472 COG0766 # Protein_GI_number: 15594817 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylglucosamine enolpyruvyl transferase # Organism: Borrelia burgdorferi # 1 434 16 439 442 383 45.0 1e-106 MASFVIEGGHRLSGEIHPQGAKNEVLQIICATLLTAEEVTVNNIPDILDVNNLIQLMRDM GVTVAKTGVDSYSFKAANVDLAYLESDNFLKKCSSLRGSVMLIGPMVARFGKAMISKPGG DKIGRRRLDTHFIGIQNLGADFTYNEEREIYEISAEELKGTSMLLDEASVTGTANIVMAA VLAKGKTTIYNAACEPYLQQLCKMLNRMGAKISGIASNLLTIEGVEELHGTDHTVLPDMI EVGSFIGMAAMTRSEITIKNVSYENLGIIPESFRRLGIKLEQRGDDIFVPAQDCYQIESF IDGSIMTIADAPWPGLTPDLLSVMLVVATQAKGSVLIHQKMFESRLFFVDKLIDMGAQII LCDPHRAVVIGHNHGFTLRGGNMTSPDIRAGIALLIAAMSAEGISRIHNIEQIDRGYQNI EGRLNAIGARITRI >gi|226332012|gb|ACIB01000044.1| GENE 29 27024 - 27641 763 205 aa, chain - ## HITS:1 COG:no KEGG:BF3703 NR:ns ## KEGG: BF3703 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 205 1 205 205 371 100.0 1e-102 MQYNTQQKRMPLPEYGRSIQNMVDFALTIQDRSERQRCANTIINIMGNMFPHLRDVPDFK HKLWDHLAIMADFKLDIDYPYEIIRKDNLVTKPDPIPYPSTKIRYRHYGRTLEILIKKAC EFQEGDEKKNLVALICNHMKKDYMAWNKDTVDDRKIAEDLAEFSGGKLQMDDEILRLMSE RIAQNYRPRTNNNNNQRNNNQRRKF >gi|226332012|gb|ACIB01000044.1| GENE 30 27693 - 28382 708 229 aa, chain - ## HITS:1 COG:XF1533 KEGG:ns NR:ns ## COG: XF1533 COG1214 # Protein_GI_number: 15838134 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Inactive homolog of metal-dependent proteases, putative molecular chaperone # Organism: Xylella fastidiosa 9a5c # 4 134 3 127 229 81 39.0 1e-15 MSCILHIETSTAVCSVAVSEDGQNIFVKEDLKGPSHAVSLGVFVDEALSFIDSHAIPLDA VAVSCGPGSYTGLRIGVSMAKGICYGRNVPLIGIPTLEVLSVPVLLYHELPEDALLCPMI DARRMEVYAAIYDRALNVKREISADIVDENSYLEYLEQHPVYFFGNGAAKCREKITHPNA HFIDDLHPLAKMMFPLAEKAVAINDYKDVAYFEPFYLKEFVASQPKKLL >gi|226332012|gb|ACIB01000044.1| GENE 31 29194 - 29406 131 70 aa, chain + ## HITS:1 COG:no KEGG:BF3705 NR:ns ## KEGG: BF3705 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 70 30 99 99 125 100.0 5e-28 MSKRKLATQFEEEPFSYVFGCRTFVIEIPDSKIIMAENSKQFLLTKMSFLSIYSDYLKII PYFCTKLLVI >gi|226332012|gb|ACIB01000044.1| GENE 32 29415 - 30281 1118 288 aa, chain + ## HITS:1 COG:CAC1716 KEGG:ns NR:ns ## COG: CAC1716 COG1561 # Protein_GI_number: 15894993 # Func_class: S Function unknown # Function: Uncharacterized stress-induced protein # Organism: Clostridium acetobutylicum # 1 287 5 291 292 132 32.0 6e-31 MTGYGKATAELPDKKINVEIKSLNSKAMDLSARIAPAYREKEMEIRNEIARVLERGKVDF SLWVEKKECADAATPINQVLVEGYYNQIKAISENLHIAVPTDWFQTLLRMPDVMTRTETQ ELSEEEWGIVYAAVKEAVSHLVDFRKQEGAALEKKFREKIANIHRLLESVTPYEKERVDK VKERITDALEKTLNVDYDKNRLEQELIYYIEKLDINEEKQRLGNHLKYFISTLESGSGQG KKLGFIAQEMGREINTLGSKSNHAEMQKIVVQMKDELEQIKEQVLNVM >gi|226332012|gb|ACIB01000044.1| GENE 33 30294 - 30908 584 204 aa, chain + ## HITS:1 COG:FN2033 KEGG:ns NR:ns ## COG: FN2033 COG0194 # Protein_GI_number: 19705324 # Func_class: F Nucleotide transport and metabolism # Function: Guanylate kinase # Organism: Fusobacterium nucleatum # 20 196 8 181 185 143 42.0 2e-34 MNPTERITTPHQTGEAKVIIFSAPSGSGKSTIINYLLAQKLNLAFSISATSRPPRGNEKH GVEYFFLSPDEFRQRIANNEFLEYEEVYTDRFYGTLKAQVEKQLAAGQNVVFDVDVVGGC NIKKYYGERALSLFIQPPCIDELRRRLIGRGTDTPEVIESRIAKAEYELSFAPKFDKVII NDDLETAKAHALKVIKEFLGIDTE >gi|226332012|gb|ACIB01000044.1| GENE 34 30940 - 31545 362 201 aa, chain + ## HITS:1 COG:BS_yqeJ KEGG:ns NR:ns ## COG: BS_yqeJ COG1057 # Protein_GI_number: 16079618 # Func_class: H Coenzyme transport and metabolism # Function: Nicotinic acid mononucleotide adenylyltransferase # Organism: Bacillus subtilis # 5 195 3 189 189 117 35.0 2e-26 MAKTKTGIFSGSFNPIHIGHLALANYLCEFEGLDEVWFMVTPHNPFKNQADLWPDELRLQ LVQLAIEGYPRFRVSDFEFHLPRPSYTIHTLNRLKQEYPEREFQLIIGSDNWMVFDRWFE SERIVSENKILVYPRPGFSVDKSQLPPNVHVADSPIFEISSTFIREALATGKDIRYFLHP AVYKRIIQQTGSIDSSHSCHT >gi|226332012|gb|ACIB01000044.1| GENE 35 31549 - 32601 698 350 aa, chain - ## HITS:1 COG:mll3894 KEGG:ns NR:ns ## COG: mll3894 COG1408 # Protein_GI_number: 13473337 # Func_class: R General function prediction only # Function: Predicted phosphohydrolases # Organism: Mesorhizobium loti # 87 344 27 311 312 107 30.0 4e-23 MKKIIYLLAALLLLLSSCKSKKNLVSPIPRPVLNVDSVRPDSSDVVARLFSPDTSELKKI SVKRKREKTHVASPVITRSAPSIVGRGTRITSSAVSVSSVYPGIDRVKKYEFTHRDVPDA FDGFRIAFISDLHYKSLFKEKGLESLVRLLNAQHADVLLMGGDYQEGCQYVPELFAALAK VKTPMGTYGVMGNNDYERCHDEIIREMQRYGMRPLEHQIDTLRRDGAQIILAGVRNPFDL ANNGVSPTLSLSPSDFVILLVHTPDYAEDVSVANSDLVLAGHTHGGQVRILGYAPIIPSH YGSRFLTGLKYNSAKIPMIVTNGIGTSNKNIRIGAPAEIVIITLHRLRNE >gi|226332012|gb|ACIB01000044.1| GENE 36 32701 - 33597 634 298 aa, chain + ## HITS:1 COG:VNG1075G KEGG:ns NR:ns ## COG: VNG1075G COG1575 # Protein_GI_number: 15790173 # Func_class: H Coenzyme transport and metabolism # Function: 1,4-dihydroxy-2-naphthoate octaprenyltransferase # Organism: Halobacterium sp. NRC-1 # 10 296 11 311 311 180 37.0 3e-45 MEEVKRNSLQAWILAARPKTLAGAITPVMIGCALAFADGKFNWIPALICCLFAGLMQVAA NFINDLFDFLKGTDREDRLGPERACAQGWISAAAMKQGIFITVGLACLIGCTLLFYAGWE LILIGALCVLFAFLYTTGPYPLSYKGWGDVLVIVFFGFVPVGGTYYVQALNWTPNVTVAS LVCGLIVDTLLVVNNYRDRDADRKSGKKTVVVRFGESFGRYFYLLLGITAAWLCFWFLFN GHLYATLLPQLYLFFHIRTWKKMVQIHSGKKLNSILGETSRNMLLMGILLSIGLVING >gi|226332012|gb|ACIB01000044.1| GENE 37 33602 - 34741 1407 379 aa, chain - ## HITS:1 COG:FN1667 KEGG:ns NR:ns ## COG: FN1667 COG1088 # Protein_GI_number: 19704988 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: dTDP-D-glucose 4,6-dehydratase # Organism: Fusobacterium nucleatum # 1 379 1 399 399 499 61.0 1e-141 MKTYLVTGAAGFIGANYLKYILAKHSDIKVVVLDALTYAGNLGTIANDIDNERCFFVKGD ICDRELADRLFGEYKFDYVVNFAAESHVDRSIENPQLFLMTNILGTQNLLDAARRAWVTG KDEYGYPTWRKGVRYHQVSTDEVYGSLGAEGYFHETTPLCPHSPYSASKTSADMVVMAYH DTYKMPVTITRCSNNYGPYHFPEKLIPLIIKNILEGKKLPVYGDGSNVRDWLYVEDHCKA IDLVVREGVEGEVYNVGGHNEKTNLEIVKLTIATIHRLMAEHPEYREVLKKKEKNADGEI SIDWINEDLITFVKDRLGHDQRYAIDPTKITNALGWYPETKFEVGIVKTIEWYLNNQEWV EEVTSGDYQKYYERMYSKR >gi|226332012|gb|ACIB01000044.1| GENE 38 34761 - 35624 938 287 aa, chain - ## HITS:1 COG:MTH1791 KEGG:ns NR:ns ## COG: MTH1791 COG1209 # Protein_GI_number: 15679779 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: dTDP-glucose pyrophosphorylase # Organism: Methanothermobacter thermautotrophicus # 1 286 1 286 292 406 65.0 1e-113 MKGIILAGGSATRLYPLSKAISKQMMPVYDKPMIYYPLSTLMLAGIREVLVISTPRDLPL FRELLGSGEEFGMSFSYLVQEQPNGLAQAFVLGADFLNGEPGCLILGDNLFYGQGFSAML RRAATIEKGACIFGYYVKDPRAYGVVEFDGSGKVVSLEEKPAVPKSNYAVPGLYFYDATV TEKALALEPSARGEYEITDLNKLYLDEGTLKVELFGRGFAWLDTGNCNSLLEASNFVATI QNRQGFYVSCIEEIAWRNGWISSAQLKALGEKLEKTEYGKYLIDLAN >gi|226332012|gb|ACIB01000044.1| GENE 39 35801 - 36292 512 163 aa, chain - ## HITS:1 COG:PA0351 KEGG:ns NR:ns ## COG: PA0351 COG0622 # Protein_GI_number: 15595548 # Func_class: R General function prediction only # Function: Predicted phosphoesterase # Organism: Pseudomonas aeruginosa # 3 137 9 134 157 61 31.0 6e-10 MTKVGLLSDTHSWWDEKYLQYFETCDEIWHAGDIGSVEVAQKLAAFRPFRAVYGNIDGQE IRRMFPQVNRFTVDGAEVLMKHIGGYPGNYDPSIKGSLLVHPPKLFISGHSHILKVKYDK TLDMLHINPGAAGMSGFHKVRTMVRFAIDNGVFKDLEVIELAG >gi|226332012|gb|ACIB01000044.1| GENE 40 36349 - 38424 1428 691 aa, chain + ## HITS:1 COG:ECs3363 KEGG:ns NR:ns ## COG: ECs3363 COG0855 # Protein_GI_number: 15832617 # Func_class: P Inorganic ion transport and metabolism # Function: Polyphosphate kinase # Organism: Escherichia coli O157:H7 # 1 684 1 682 688 422 36.0 1e-118 MENKYQYFKRDISWLSFNYRVLLEADDDRLPLYERINFISIYSSNLEEFYKIRVADHKAV ASGVTDGTEESLQSAKDLLEEINREVNRQLEDRIHIYEKKIIPALRKNHVIFYQSRNVEP FHQQFVKDFFREEIFPYLQPVPVSKDKVISFLRDNRLYLAVRLFLKGTNKEDADHLQYFV MKLPYSKVPRFIELPKHGRDYYLMFIEDIIKANIDVIFPGYEVDCSYCIKISRDADIMID DTINSVDLVEQVKKKIKKRKIGAVCRFVYDRAMPDDFLNFLVDAFRIRHEELVPGDKHLN LEDLRHLPNPNHSIPRIERPIPMKLNRLNDKESIFSYVEKKDLLLYYPYHSFDHFIHFLY EAVHNPETREIMVTQYRVAENSAVINTLIAAAQNGKKVTVFVELKARFDEENNLATAEMM KAAGINIIYSIPGLKVHAKVALIRRRSFTGEKIHSYAYISTGNFNEKTATLYADCGLFTS NPVIVHDLTNLFRTLRGKENPRFTRLLVARFNLIPELNRLIDKEIELAEKGRGGRIILKM NALQDPIMIDRLYEASQKGVKIDLIVRGICCLIPGQEYSCNIRVTRIVDSFLEHARIWYF GNAGHPKVYMGSPDWMRRNLYRRIEAVVPILDNELREEIVDMLHIQLSDNQKACFVDDKL NNIFKFKTNAAPVRAQYTFYNYLKEKNETFL >gi|226332012|gb|ACIB01000044.1| GENE 41 38490 - 40739 2238 749 aa, chain + ## HITS:1 COG:no KEGG:BF3715 NR:ns ## KEGG: BF3715 # Name: not_defined # Def: putative phosphate/sulphate permeases # Organism: B.fragilis # Pathway: not_defined # 1 749 1 749 749 1428 99.0 0 METIYLCIIIFLFVLAVFDLMVGVSNDAVNFLNSAVGAKAASFKTILFIAGAGIFIGASL SNGMMDIARHGIYQPEHFYFAEIMCILLAVMLTDVVLLDVFNSMGMPTSTTVSMVFELLG GTFALALIKVHNSDTLGLGDLINTDKALSVIMAIFVSVAIAFFFGMLVQWLARMVFTFNY KSNIKYSIALFGGIASTAIVYFMVIKGLKDSSFMTPENKQWVQENTMMLVSCFFVISTIL MQILHWLKVNVFKVVVLLGTFALALAFAGNDLVNFIGVPLAGYSSFIDYTANGTSVGPDG FLMTSLMGSAKTPWYFLIGAGAVMVYALCTSKKAHAVIKTSVDLSRQDEGEENFGSTPIA RTLVRFSLTLANGISRITPPSAKRWIDTRFRKDEAIIADGAAFDLVRASVNLVLAGLLIA VGTSLKLPLSTTYVTFMVAMGTSLADRAWGRDSAVYRITGVLSVIGGWFITAGAAFTICF FVAMVIHFGGSIAIIALIGLAAFTLIRSQLMYKKKKEKEKGNETLKQLMQATSSHKALEL MRKHTREELSKVLEYAEQNFELTVTSFLHENLRGLRRAMGSTKFEKQLIKQMKRTGTVAM CKLDNHTVLEKGLYYYQGNDFASELVYSISRLCEPCLEHIDNNFNPLDAIQKGEFGDVAE DITYLIQQCRQKLEGNNYSNLEEDLHRANDLNSQLSHLKRQELQRIQSQTGSIKVSMVYL TMIQEAQNVVTYTINLMKVSRKFQIETDI >gi|226332012|gb|ACIB01000044.1| GENE 42 40904 - 41302 353 132 aa, chain - ## HITS:1 COG:all4097_4 KEGG:ns NR:ns ## COG: all4097_4 COG0784 # Protein_GI_number: 17231589 # Func_class: T Signal transduction mechanisms # Function: FOG: CheY-like receiver # Organism: Nostoc sp. PCC 7120 # 11 119 1 110 133 80 43.0 6e-16 MDIEEIKDFRPLILVAEDDDSNFKLIKAIIGKKCDILWAKNGEEMLNLYREHTQDAHAIL MDIKMPIMNGLEATRIIREEGASLPIIMQTAYAFSSDRENAMQAGASEVLVKPITVSALR GCLSSYFPEIKW >gi|226332012|gb|ACIB01000044.1| GENE 43 41411 - 43045 2001 544 aa, chain + ## HITS:1 COG:all4183 KEGG:ns NR:ns ## COG: all4183 COG0488 # Protein_GI_number: 17231675 # Func_class: R General function prediction only # Function: ATPase components of ABC transporters with duplicated ATPase domains # Organism: Nostoc sp. PCC 7120 # 1 533 1 531 564 405 41.0 1e-112 MISVDGLAVEFGGTTLFSDISFVINEKDRIALMGKNGAGKSTLLKILAGVRQPTRGRISA PKECVIAYLPQHLMTEDGRTVFEETAQAFAHLHEMEAEIERMNKELETRTDYESDSYMEL IENVSTLSEKFYAIDATNYEEDVEKALLGLGFMREDFHRQTSDFSGGWRMRIELAKLLLQ KPDVLLLDEPTNHLDIESIQWLEDFLINNGKAVIVISHDRKFVDNITTRTIEVTMGRIYD YKVNYSKYLELRKERREQQQKAYEEQQKFIAETKDFIERFKGTYSKTLQVQSRVKMLEKL EILEVDEEDTSALRLKFPPSPRSGNYPVIMDGVGKSYGDKIVFRNATLTVERGDKVAFVG KNGEGKSTLVKCIMNEIEHDGTLTLGHNVQIGYFAQNQASLLDENLTVFQTIDDVAKGEI RNKIRDLLGAFMFGGPEESMKKVKVLSGGERTRLAMIKLLLEPVNLLILDEPTNHLDLKT KDILKQALKDFDGTLIVVSHDRDFLDGLVTKVYEFGNQKVTEHLCGIYEFLDKKKMDSLR ELEK >gi|226332012|gb|ACIB01000044.1| GENE 44 43164 - 43439 307 91 aa, chain + ## HITS:1 COG:no KEGG:BF3718 NR:ns ## KEGG: BF3718 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 91 1 91 91 75 97.0 4e-13 MKKLVFCAAFIAAMCMAGTTTAQAQDVKKKEVKKEQCDKKDSKACCKKEEKACCKKEADK KTTDGCKHKADCKAKAGCKDSKCTKDKGGKK >gi|226332012|gb|ACIB01000044.1| GENE 45 43457 - 43684 72 75 aa, chain - ## HITS:1 COG:no KEGG:BF3719 NR:ns ## KEGG: BF3719 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 46 1 46 53 82 93.0 5e-15 MYRLFLILNLDYCEAKIVYHWIKNKQCYLKNKKTDASYFLPRSITEPFFVNDQRPKIDSM LLCVLCGEFRHTCLK >gi|226332012|gb|ACIB01000044.1| GENE 46 43665 - 44069 87 134 aa, chain + ## HITS:1 COG:no KEGG:BF3720 NR:ns ## KEGG: BF3720 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 134 1 134 134 263 100.0 2e-69 MRNKRYIVSFLLFISMIMLVVPVIPHHHHADGVICMKNDLTPEPQCPKHYHHPGNDSCCN DGCMTRLNSPTPSVQADNNPHYLFTAILFTDFIIENLFRPQERRIKNYYAYRESLHGTAV NRAFGLRAPPYPVV >gi|226332012|gb|ACIB01000044.1| GENE 47 44241 - 45458 1303 405 aa, chain + ## HITS:1 COG:RSp1041 KEGG:ns NR:ns ## COG: RSp1041 COG0845 # Protein_GI_number: 17549262 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Ralstonia solanacearum # 128 384 110 375 382 77 26.0 6e-14 MKKLIFMGILGLFILGSCNSKSGGNHEGHDHGTEAHDHEHEGEDHEGHDHEGDEHSRSSE PATGHSDEIILPKAKAEAAGVKTSIIEPEVFEQVIKTSGQVLAAQGDESVAVATVAGVVS FRGKVTEGMSVGKGTALVTISSSNIADGDPVQRARIAYDISRKEYERMQALVKNKIVSDK EFAQAEQNYENARISYEALAKNHSAGGQAVTSPISGFVKNILVKEGDYVTIGQPLVSITQ NRRLFLRAEVSEKYYPSLRTIGSANFKTPYDNKVYELKELNGRLLSFGKSAGENSFYVPV TFEFDNKGDIIPGSFVEVYLLSSPMENVLSLPRTALTEEQGLFFAYLQLDEEGYKKQEVT LGADNGKSVQVLSGIKAGDRVVTQGAYQVKLASASNAIPAHSHEH >gi|226332012|gb|ACIB01000044.1| GENE 48 45517 - 48603 3252 1028 aa, chain + ## HITS:1 COG:all7618 KEGG:ns NR:ns ## COG: all7618 COG3696 # Protein_GI_number: 17158754 # Func_class: P Inorganic ion transport and metabolism # Function: Putative silver efflux pump # Organism: Nostoc sp. PCC 7120 # 1 1020 1 1019 1058 765 42.0 0 MLNKIIHFSLQNRILVLVASVLLLIGGTYTAMHTEVDVFPDLNAPTVVIMTEANGMAAEE VEQLVTFPVETAVNGATGVRRVRSSSTNGFSVVWVEFDWGTDIYLARQIVSEKLAIVGEE LPSNVGKPTLGPQSSILGEVLIIGLTADSTSMLDLRTIADWTIRPRLLSTGGVAQVAVLG GELKEYQIQLDPERMRHYGVSMNEVMTVTRGMNLNANGGVLYEYGNEYIVRGVLSTANIE QLGKAVVKSIDSVPVLLEDIADVRIGPKAPKLGTASERGKPAVLMTVTKQPATSTLELTD KLEASLQDLRKNLPPDVKVSTDIFRQSRFIDSSISNVKKSLFEGGIFVVIVLFLFLANVR TTIISLVTLPLSLLVSILTLHFMGLTINTMSLGGMAIAIGSLVDDAIVDVENVYKRLREN RLLPENERLSVIQVVFNASKEVRMPILNSTLIIVVSFVPLFFLSGMEGRMLVPLGIAFIV ALFASTIVALTLTPVLCSYLLGKEKGDKLPKEAFVARWMKGVYEKALTWVLIHKRLTLGS TIGLFIITLGFFFTLGRSFLPPFNEGSFTINISSLPGISLEESDKMGHRAEELLLSIPEI QTVARKTGRAELDEHALGVNVSEIEAPFELKDRSRNELMADVREKLGTITGANIEIGQPI SHRIDAMLSGTKANIAIKLFGDDLNKMFSLGNQIKEAIGNIPGIADLNVEQQIERPQLKI TPKREMLAKYGITLPEFSEYINVALAGEVISQVYEQGKSFDLIVKVKNNFRDEAEKIRNL MVDTQDGKKVPLSYIADVASSMGPNTINRENVKRKIVISANVADRDLRSVVNDIQKQVDE QIKLPEGYHIEYGGQFESEQAASRTLALTSFMSIVVIFLLLYHEFRSVKESAVILINLPL ALIGGVFALLITTGEISIPAIIGFISLFGIATRNGMLLISHYNHLQQVEGLGVYESVIRG SLDRLNPIVMTALSSALALIPLALSGSLPGNEIQSPMAKVILGGLLTSTFLNGFIIPIVY LMMNGKRK >gi|226332012|gb|ACIB01000044.1| GENE 49 48623 - 49801 1231 392 aa, chain + ## HITS:1 COG:no KEGG:BF3723 NR:ns ## KEGG: BF3723 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 392 1 392 392 683 100.0 0 MKRITILAATLFALSGLQAQTGIDGVLRNIETNNKELQANAQLIASQKLETRTDNNLPDP TLSYAHLWNNKDKNNTIGELVVSQSFDFPSLYATRNQLNRLKAGAFDGQKSVFRQGILLQ AKDVCLDIIMLRKQQQILTERLRNAEELSAMYAKRLQTGDANVIETNKINLELLNVKTEA SLNETALRNKIQELTALNGNIPVVFEDADYPAVIFPSNYEELKTEVLASDYTLQALNSES AAARKQIAVNKSQWLPKLELGYRRNTESGEPFNGVVVGFSFPLFENRNKVKIAKAQSLNV DLQRANTSVQVESELTQLYREAHTLRTSMEEYEKTFQAQQDLSLLKQALTGGQISMIEYF VEVSVVYQSKQNYLQLENQYQKAMAKIYKNKL >gi|226332012|gb|ACIB01000044.1| GENE 50 49965 - 50993 980 342 aa, chain - ## HITS:1 COG:no KEGG:BF3724 NR:ns ## KEGG: BF3724 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 342 5 346 346 701 100.0 0 MIASCLLACSGFVSAQMTGGNPEEVKQTAPAPLYRDPVYDGVADPVVVWNKEDRSWWMLY TQRRANVNAGNVAYCYGNDIGIASSRDHGRTWVYRGVLDLNMERGKNTFWAPEVVNFNGV YHLFVSYIEGVRTDWGGHARMAHYTSKNMWDWKFEGFVKLSSDKTIDATFFRMPDGKWRA WYKDETRNAAIMTAESDDLFHWTLNDTPVIDQSRQEGPKVFRFGGYYWMLTDEWHGMRVY RSKDATTWEKQGVILDKPGTRPEDTPSGAHGDVVVVGDKAYVIYFTHPGRKAHSEETKDE DGNIPYHLRRSSAQVAELLIKDGQLVADRSPEFNFYLPDMEE >gi|226332012|gb|ACIB01000044.1| GENE 51 51071 - 51820 739 249 aa, chain - ## HITS:1 COG:no KEGG:BF3725 NR:ns ## KEGG: BF3725 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 249 1 249 249 495 99.0 1e-139 MKTNFCRFLIGLVAFVVGIPVMAQTGPDKVKPFSHLSVSLNAGTLGGGLQVAAPLNDYLG LRAGFSLLKFKCNYDYDGIRDDQLIQDAGTRTGYNPDKYYTVPLKAKANMTNGMLLLDYF PFKRSVFHVTAGLLFGTSSILKVSGQTDERIEVGDIIIEPGADGRVEAALKTNAVKPYVG IGFGRSVAHSRVGFKFELGAMFHGNPKIEATTGKIVEEAIDQDLSRFNKFLKNFKAYPVL NFQLSYRIF >gi|226332012|gb|ACIB01000044.1| GENE 52 51856 - 53274 1121 472 aa, chain - ## HITS:1 COG:SP1402_1 KEGG:ns NR:ns ## COG: SP1402_1 COG0144 # Protein_GI_number: 15901256 # Func_class: J Translation, ribosomal structure and biogenesis # Function: tRNA and rRNA cytosine-C5-methylases # Organism: Streptococcus pneumoniae TIGR4 # 1 293 1 280 280 173 35.0 8e-43 MKLPISFIESTRALMGDEEYQELSVALEQEPPASVRLNSKFPGLAACSSISGRIPWAAEG YYLNQRLTFTFDPLFHAGCYYVQEASSMFVEQVLRRYVTTPVKMLDLCAAPGGKSTHARS VLPEGSLLVANEVIRNRSQILAENLTKWGYADVVVTNNDPSDFSRIGSFFDVILTDVPCS GEGMFRKDPGAIEEWSPENVEICWQRQRRIITDIWPCLKPGGILIYSTCTYNTREDEENI TWIRQEFGAEPLPLAVPAEWNITGSLLVGVDAPVYRFLPHKTQGEGFFLAALRKPEEDRE ADTDLFSPKKKKSKGDTAASPVSKENLAIAKSWLASSDEYNLLVNGTAITAFPVYYQNDL ALLRQSLRIVQAGTEVAEVKGKDLIPAHGLAMSRLLKTSVFSTEALTYEQAIAYLRKEAI VLPPSVPKGYVLVTYKEIPLGFVKNIGNRANNLYPQEWRIRSGYLPDDIRTL >gi|226332012|gb|ACIB01000044.1| GENE 53 53271 - 53528 296 85 aa, chain - ## HITS:1 COG:no KEGG:BF3727 NR:ns ## KEGG: BF3727 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 85 1 85 85 145 100.0 3e-34 MRAFVAVRQLVSALPVNDITRLQDEIKELKEYVEAAFADYNDINEDTRMQLELINQAIAE LQAKDKQAGGKTRNPIGFISYNKEK >gi|226332012|gb|ACIB01000044.1| GENE 54 53476 - 53841 258 121 aa, chain - ## HITS:1 COG:no KEGG:BF3518 NR:ns ## KEGG: BF3518 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 121 2 122 122 243 96.0 1e-63 MELQIIQSKIYGIRGQKVMLDFDLAGLYQVETHVLNQAVKRNSKRFPTDFMFRLNTEEWE ILKSQIVISSWGGTRKLPFAFTEQGLAMLSGVLNSDRHRSEYFHYASFCSCPPIGLGSSG E >gi|226332012|gb|ACIB01000044.1| GENE 55 53960 - 54586 666 208 aa, chain + ## HITS:1 COG:no KEGG:BF3730 NR:ns ## KEGG: BF3730 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 208 1 208 208 420 100.0 1e-116 MKRIIIALMVAVTFCSLAMAQNPTITQDNKNSSDSTIVTEYTDIVDSSTVDTDYQSSSFD FGNEFPFNIDKGAINGGILTGLVVIILIFGFPFFIVFIAFYFRYKNRKAKYRLMEQALAT GQPLPEGIFKDTLPQDYRTKGIKNICTGIGLFIFLWAITDEFSIGCIGLLVMFTGIGQWI ISRNQQHERPEDPFTRPTHKDETLNEQK >gi|226332012|gb|ACIB01000044.1| GENE 56 54586 - 55134 439 182 aa, chain + ## HITS:1 COG:RSc1055 KEGG:ns NR:ns ## COG: RSc1055 COG1595 # Protein_GI_number: 17545774 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Ralstonia solanacearum # 1 174 1 188 199 95 34.0 6e-20 MSQLNDISLVAQVVVFRNTRAFDQLVQKYQSPVRRFFLNLTCGDSELSDDLAQDTFIKAY TNIANFKNLSSFSTWLYRIAYNVFYDYIRSRKETTDLDAREVDAANSTEQVNVGEQMDVY QSLRTLKEIERTCITLFYMEDVSIDKIAGITGCPVGTVKSHLSRAKDKLAIYLKQNGYDG NR >gi|226332012|gb|ACIB01000044.1| GENE 57 55118 - 55447 365 109 aa, chain + ## HITS:1 COG:no KEGG:BF3732 NR:ns ## KEGG: BF3732 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 109 1 109 109 214 100.0 9e-55 MTETDDKLLKQFFGEQKQEIEDNGFSRRVMRNLPGRNHRLVQAWGAACAVVCVILFFTLG GLQATISTLREVFVSMVQQSATTGFDPKSLYIAALVLAFFGARKAWSMA >gi|226332012|gb|ACIB01000044.1| GENE 58 56002 - 57198 841 398 aa, chain + ## HITS:1 COG:BS_ywnE KEGG:ns NR:ns ## COG: BS_ywnE COG1502 # Protein_GI_number: 16080712 # Func_class: I Lipid transport and metabolism # Function: Phosphatidylserine/phosphatidylglycerophosphate/cardioli pin synthases and related enzymes # Organism: Bacillus subtilis # 15 398 103 482 482 261 37.0 1e-69 MQREDTTQFIRSDSLVLQFLEYSNIPITDNNKVKLIKSGREKFEDLFEAIRGAKHHIHLE YFNFRNDSIANALFDLLGEKVKEGVKVRAMFDAFGNWSNNKPLKKRHLKAIREKGIEIVK FDPFKFPYINHAAHRDHRKIAVIDGKIGYTGGMNIADYYINGLPKIGTWRDMHIRIEGDA VNILQEIFLDIWNKTTKQNIVGEEYFPNHPERADSCNTVISIVDRTPKRNSRMLSHTYAM SIYAAQHDVRIVNPYFVPTSSIRKALKRALNRGTKVEIMISSKSDIPFTPDASLYAVQKL MKKGAMIYLYNGGFHHSKIMMVDDLFCTVGTANLNSRSLRYDYETNAFIFDKDITQQLND VFEADMLHCTRLTPEMWKQKSAWKKFKCWFANLFTPFL >gi|226332012|gb|ACIB01000044.1| GENE 59 57264 - 58058 747 264 aa, chain + ## HITS:1 COG:BH3451 KEGG:ns NR:ns ## COG: BH3451 COG0207 # Protein_GI_number: 15616013 # Func_class: F Nucleotide transport and metabolism # Function: Thymidylate synthase # Organism: Bacillus halodurans # 1 264 1 264 264 422 70.0 1e-118 MKQYLDLLDRVLTEGIEKGDRTGTGTISVFGHQMRFNLDEGFPCLTTKKLHLKSIIYELL WFLKGDTNVKYLQDHGVRIWNEWADETGDLGHIYGYQWRSWPTYNGGFIDQISEAVDAIK HNPDSRRIIVSAWNVGDLDHMNLPPCHAFFQFYVANGRLSLQLYQRSADIFLGVPFNIAS YALLLQMMAQVTGLQAGDFIHTLGDAHIYLNHLEQVKLQLSREPRPLPQMKINPDVKNIF DFQFEDFELVNYDPHPHIAGKVAV >gi|226332012|gb|ACIB01000044.1| GENE 60 58073 - 58573 331 166 aa, chain + ## HITS:1 COG:RSc0946 KEGG:ns NR:ns ## COG: RSc0946 COG0262 # Protein_GI_number: 17545665 # Func_class: H Coenzyme transport and metabolism # Function: Dihydrofolate reductase # Organism: Ralstonia solanacearum # 1 163 1 161 167 126 42.0 1e-29 MSRISIIAAVDSRMAIGFQNKLLFRLPNDLKRFKALTTGNTIIMGRKTFESLPKGALPNR RNVVLSSNPAAECPGAEVFTSLEAALESCQAEEKVYIIGGASVYRQTISLADELCLTEVN DTAPEADAFFPAVDTTIWHEKSREVHPADEKHLCSYAFVDYVRGID >gi|226332012|gb|ACIB01000044.1| GENE 61 58570 - 59046 431 158 aa, chain - ## HITS:1 COG:VC0071 KEGG:ns NR:ns ## COG: VC0071 COG1522 # Protein_GI_number: 15640103 # Func_class: K Transcription # Function: Transcriptional regulators # Organism: Vibrio cholerae # 5 149 6 151 153 118 42.0 3e-27 MGHHQLDTLDEQILKLIADNARIPFLEVARACNVSGAAIHQRIQKLTNLGILKGSEYVID PEKIGYETCAYIGIYLKDPESFDSVTKALEAIPEVVECHFTTGKYDMFIKIYARNNHHLL SVIHDKLQPLGLARTETLISFHEAIKRQMPIMVDTDED >gi|226332012|gb|ACIB01000044.1| GENE 62 59197 - 60477 1110 426 aa, chain + ## HITS:1 COG:no KEGG:BF3737 NR:ns ## KEGG: BF3737 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 426 1 426 426 895 99.0 0 MKLILTLFTCLFVTGCAYAQNFSDYFTNKTLRIDYLFTGNADKQSICLDELSELPVWAGR RHHLSELPLEGNGQIVMRDVASGKVIYTTSFSSLFQEWLETDEAKEVTKGFENTYLLPYP IKPAEVEITLRNNKREVSANLKHVVKPDDILIHKKGLTHITPHKYLLKSGNEEQCIDVAI LAEGYTTSEMETFYKDAAIACEALFSHEPFQSMKNRFNIVAVASPSADSGVSAPKQGAWK HTAFGSHFDTFYSDRYLTTSRVKAINDALAGIPYEHIIILANTEQYGGGGIYNAFTLTTA HHPNFRPVVVHEFGHSFGGLADEYFYDENVMNGLYPLNIEPWEQNITTRINFASKWEDML TKATPVPTPVADKDKYPIGVYEGGGYSAKGIYRPAFDCRMRTNEYPTFCPVCQRAIQRII EFYTGK >gi|226332012|gb|ACIB01000044.1| GENE 63 60985 - 61449 505 154 aa, chain - ## HITS:1 COG:no KEGG:BF3739 NR:ns ## KEGG: BF3739 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 154 1 154 154 274 100.0 8e-73 MGKFNKTGKRGMPTLNTSSLPDLIFTLLFFFMIVTTMREVSLKVEFKIPQGTELEKLEKK SLVTFIYVGKPTAEFRKKLGSESRIQLNDAYAEVDEIQAYVTNERSSMKEEDQPFMTVSL KIDQDTKMGIVTDIKQALRQAYALKINYSARARE >gi|226332012|gb|ACIB01000044.1| GENE 64 61449 - 62039 615 196 aa, chain - ## HITS:1 COG:no KEGG:BF3740 NR:ns ## KEGG: BF3740 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 196 1 196 196 362 100.0 5e-99 MAKGNRKVPEINSSSTADIAFLLLIFFLITTSMDTDRGLARQLPPPPEKDQVDNDVVVKD RNVLQIFLNFQNQLMCGGEIVDVKQLREKAKEFIANPYNDEKLPEKHAKDVPFFGNVMVT ENHVISLQSDRGSSYQAYFDVQNELVAAYNELRDELAQEKWQKNYADLNEDQQKAIRDIF PQKISEAEPKGVKKNN >gi|226332012|gb|ACIB01000044.1| GENE 65 62076 - 62546 427 156 aa, chain - ## HITS:1 COG:no KEGG:BF3741 NR:ns ## KEGG: BF3741 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 156 1 156 156 224 100.0 5e-58 MSKLTYKVSYYVLYAMFALIVIVLGLFYFGGQMETPIVYDMDNPANTDALLYLMYGLFGI AVVATVVAAIFQFGSALKDNPKGAIRSLLGLILLVLVLVVAWSMGSGETLTIQGYEGTDN VPFWLKLTDMFLYSIYFLMLVTVLAIIGSSIKKKLS >gi|226332012|gb|ACIB01000044.1| GENE 66 62553 - 63374 841 273 aa, chain - ## HITS:1 COG:MTH1022 KEGG:ns NR:ns ## COG: MTH1022 COG0811 # Protein_GI_number: 15679040 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Biopolymer transport proteins # Organism: Methanothermobacter thermautotrophicus # 81 254 26 198 279 72 30.0 1e-12 MKKLFAIVAVMGVLTFGSTQLVQAQDAAATEQAAPAADKPVADAVAEAAEASAPLAGAEE GGIHKEIKVKFIEGTASFMSLVAIALVIGLAFCIERIIYLSLAEINTKKFMSSIEAALEK GDVEAAKDIARNTRGPVASIYYQGLMRIDQGIDVVEKSVVSYGGVQAGYLEKGCSWITLF IAMAPSLGFLGTVVGMVMAFDKIQQQGDISPTVVAGGMKVALITTIFGLVVALILQVFYN YILAKIEALTSEMEDSSISLLDMVIKYNLKYKK >gi|226332012|gb|ACIB01000044.1| GENE 67 63702 - 64478 756 258 aa, chain - ## HITS:1 COG:VC0103 KEGG:ns NR:ns ## COG: VC0103 COG0084 # Protein_GI_number: 15640135 # Func_class: L Replication, recombination and repair # Function: Mg-dependent DNase # Organism: Vibrio cholerae # 2 258 1 255 255 201 40.0 2e-51 MLVDSHSHLFLEEFAEDLPFVMERARAAGVTHIFMPNIDSTTIEPMLSVCDTYRDFCFPM IGLHPTSVNESYEKELEIVAANLETSGRFVAVGEIGIDLYWDKTWLKEQLIAFEKQVQWA LHYQLPIVIHCREAFDYIYKVLQPYKNSGLTGIFHSFTGTPGEAARMMEFPGFKIGINGV VTFKKSTLPETLKSVPLERIVLETDSPYLTPVPNRGKRNESANVKDTLIKVAEIYNEDPE KVAELTAVSALKVFGMLK >gi|226332012|gb|ACIB01000044.1| GENE 68 64482 - 65192 531 236 aa, chain - ## HITS:1 COG:no KEGG:BF3744 NR:ns ## KEGG: BF3744 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 236 1 236 236 468 100.0 1e-130 MPYRRLPNTDQARIRALKTAVVKGDMCDVYDLPVSLKTLGEARIFLSKFETAHSYYVHCY GEQSRNSKKHQANVKTARLYISHFIQVLNLAVIRMEIKESHKALYGLPVDNFSVPDLSSE ASLAEWGQKIIEGERKRTSQGGIPIYNPTIAKVKVHYDIFMEGYEKQKSLQSLTNRSLEQ LASMRVQADRLILDIWNQVEAKFQDVSPNEKRLEKCRDYGLIYYYRTGEKQNKEIL >gi|226332012|gb|ACIB01000044.1| GENE 69 65200 - 66174 857 324 aa, chain - ## HITS:1 COG:MK0774 KEGG:ns NR:ns ## COG: MK0774 COG0142 # Protein_GI_number: 20094211 # Func_class: H Coenzyme transport and metabolism # Function: Geranylgeranyl pyrophosphate synthase # Organism: Methanopyrus kandleri AV19 # 11 323 16 323 324 192 36.0 8e-49 MYTAFQLLDKINSHISDIQFTRTPAGLYDPIKYVLSMGGKRIRPVLMLMAYNLYKEDVSS IYDPATAIEVYHNYTLLHDDLMDRADMRRGKTTVHKVWNDNTAILSGDAMLVLAYQYMAA SSSEHLKEVMDLFSLTALEICEGQQMDMNFESREDVKEEEYLEMIRLKTAVLLAASLKIG AILGGASPEDAENLYDFGMQIGVAFQLQDDLLDVYGDPAVFGKNIGGDILCNKKTYMLIK ALERADRDQLEGLNHWLSATSFCPEEKISAVTELYTQIGIKAVCENKMREYYTRAMTSLG AVSVIEDKKSELKKLMKHLMYREM >gi|226332012|gb|ACIB01000044.1| GENE 70 66255 - 66938 590 227 aa, chain - ## HITS:1 COG:no KEGG:BF3746 NR:ns ## KEGG: BF3746 # Name: not_defined # Def: TonB # Organism: B.fragilis # Pathway: not_defined # 1 227 1 227 227 387 100.0 1e-106 MEVKKSPKADLEGKKTQWLLIGYVVVLAFIFVAFEWTERDIKIDTSQAVAQIEFEEEMIP ITQQEEKPAPPPVEVPKQAEILKIVDDEADVQETAIASTEDTGQKVEVKYVPVEVKEEEP SEQEIFEVVENAPEFPGGMPACLQFLYKNIKYPPIAQENGTQGQVVLQFVVERDGSIGDI KVVKSVDPYLDKEALRVVKTMPKWKPGMQRGKPVRCRFTLPVRFRLQ >gi|226332012|gb|ACIB01000044.1| GENE 71 67094 - 67783 220 229 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|15639271|ref|NP_218720.1| bifunctional cytidylate kinase/ribosomal protein S1 [Treponema pallidum subsp. pallidum str. Nichols] # 1 222 32 282 863 89 27 1e-16 MKKITIAIDGFSSCGKSTMAKDLAKEIGYIYIDSGAMYRAVTLYSIENGIFHGDTIDTDE LKRRIGDIHISFRIDPETGRPNTYLNGVNVENKIRTMEVSSKVSPISALGFVREAMVAQQ QEMGKAKGIVMDGRDIGTTVFPDAELKIFVTASAEIRAQRRYDELKAKGQETGFEEILEN VKQRDHIDQTREVSPLKKADDALLLDNSHLTIAEQKEWLMAEYQKAIKA >gi|226332012|gb|ACIB01000044.1| GENE 72 67896 - 68768 364 290 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|15895122|ref|NP_348471.1| 4-hydroxy-3-methylbut-2-enyl diphosphate reductase [Clostridium acetobutylicum ATCC 824] # 1 290 1 287 642 144 31 3e-33 MVKVEIDEGSGFCFGVVTAIHKAEEELAKGVTLYCLGDIVHNSREVERLKEMGLITINHE EFKQLHNAKVLLRAHGEPPETYIIAKENNIEIIDATCPVVLRLQKRIKQEYMQEDLDEKQ IVIYGKNGHAEVLGLVGQTTGKAIVIEKLDEARRLDFSKSIRLYSQTTKSLDEFWEIVEY IKEHISPDVTFEYYDTICRQVANRMPNLRKFAASHDLIFFVSGKKSSNGKMLFEECKKVN PNSHLIDSADEIDDSLLPGVNSIGVCGATSTPKWLMEEISEAIKAQIKRQ >gi|226332012|gb|ACIB01000044.1| GENE 73 68842 - 69822 1227 326 aa, chain + ## HITS:1 COG:BH3164 KEGG:ns NR:ns ## COG: BH3164 COG0205 # Protein_GI_number: 15615726 # Func_class: G Carbohydrate transport and metabolism # Function: 6-phosphofructokinase # Organism: Bacillus halodurans # 4 326 1 319 319 311 51.0 8e-85 MGTVKCVGILTSGGDAPGMNAAIRAVTRAAIYNGLQVKGIYRGYRGLVTGEIKEFKSQNV SNIIQLGGTILKTARCKEFTTPEGRKMAYDNMVKEGIDALVVIGGDGSLTGARIFAQEYD IPCIGLPGTIDNDLYGTDTTIGYDTALNTILDAVDKIRDTATSHERLFFVEVMGRDAGFL ALNGAIASGAEAAIIPEFSTEVDQLEEFIKSGFRKSKNSSIVLVAESELTGGAMHYAERV KNEYPQYDVRVTILGHLQRGGSPTAHDRILASRLGAAAIDAIMEDQRNVMIGIEHDEIVY VPFSKAIKNDKPVKRDLVNVLRELSI >gi|226332012|gb|ACIB01000044.1| GENE 74 69879 - 70748 618 289 aa, chain - ## HITS:1 COG:MT2323 KEGG:ns NR:ns ## COG: MT2323 COG1028 # Protein_GI_number: 15841754 # Func_class: I Lipid transport and metabolism; Q Secondary metabolites biosynthesis, transport and catabolism; R General function prediction only # Function: Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) # Organism: Mycobacterium tuberculosis CDC1551 # 4 202 15 215 317 121 37.0 2e-27 MEHKLAVITGADGGMGTEITRAVACAGYDVIMACYSSSKAETKCRELVKETGNEKIEVWQ IDLASLASVRAFADRMLRQKTPVALLMNNAGTMETGLHITEDGLERTVSVNYVGPYLLTR LLLPLMGEGTRIVNMVSCTYAIGKLDFPDFFLWGRKGSFWRIPIYSNTKLALLLFTIELA ERLRARGITVNAADPGIVSTNIIRMDQWFDPLTDIFFRPFIRTPLQGAATAIGLLLDAEV EGRTATFNLNNHCRLLPEKYTRPDRRAQLWEETERILSEKGFLPINDKR >gi|226332012|gb|ACIB01000044.1| GENE 75 70748 - 71980 1002 410 aa, chain - ## HITS:1 COG:MT3467 KEGG:ns NR:ns ## COG: MT3467 COG1902 # Protein_GI_number: 15842955 # Func_class: C Energy production and conversion # Function: NADH:flavin oxidoreductases, Old Yellow Enzyme family # Organism: Mycobacterium tuberculosis CDC1551 # 6 382 8 385 396 296 42.0 6e-80 MNTHSASKLFTPVTIGPLTLRNRTIRSAAFEGMCPGNAPSPMLLDYHRSVAAGGVGMTTI AYASVTRSGLSFPRQLWLRPEIIPGLREVTAAIHAEGAAASIQIGHCGNMSHKSICGVTP ISASSGFNIYSPTWVRGMKREELPEMAKAYGRAVCLAREAGFDAVEVHAGHGYLISQFLS PYTNHRKDEYGGSLENRMRFMTMVMNEVMKAAGKDLAVFVKMNMRDGFRGGMEIEETLEV ARRLESLGAHALVLSGGFVSKAPMYVMRGAMPIKTLTHYMDCWWLKFGVKLAGRMMIPTV PFKEAYFLEDALKFRSEIKIPLIYVGGLVSREKIDEVLNEGFEAVQMARALLNEPGFVNR MRREEKARCNCGHSNYCIGRMYTIEMACHQHLTEELPLCLKKEIEQLENK >gi|226332012|gb|ACIB01000044.1| GENE 76 71995 - 72720 377 241 aa, chain - ## HITS:1 COG:no KEGG:BF3752 NR:ns ## KEGG: BF3752 # Name: not_defined # Def: 3-oxo-5-alpha-steroid 4-dehydrogenase # Organism: B.fragilis # Pathway: not_defined # 1 241 13 253 253 436 100.0 1e-121 MGLVALVVFIALYFVRAGYGMFRSRSWGISVNNKIAWILMEAPVFLVMGWLWWHSDRRFL PVELTFFLFFQLHYFQRAFVFPFLMKGKSKMPVLILLMGVIFNILNGLMQGEWIFYLAPS DYYTPGWLLTPCFWIGTLLFFAGMGINWHSDSVIRHLRAPGDTRHYLPQRGMYRYVTSAN YLGEIVEWVGWAILTWSLSGCIFAWWTVANLVPRADAIWHRYREEFGTQVGLKKRIFPFL Y >gi|226332012|gb|ACIB01000044.1| GENE 77 72962 - 74305 1470 447 aa, chain - ## HITS:1 COG:L67186 KEGG:ns NR:ns ## COG: L67186 COG0372 # Protein_GI_number: 15672652 # Func_class: C Energy production and conversion # Function: Citrate synthase # Organism: Lactococcus lactis # 12 447 8 441 441 372 44.0 1e-103 MKKEYLIYKLSEAMKDCTRIDNELFPKFDVKRGLRNEDGTGVLVGLTKIGNVVGYERIPG GGLKPIPGKLFYRGYDLEDLAHAILKEKRFGFEEVAYLLLSGSLPDKEELASFRELINDN MPLEQKTKMNIIELEGNNIMNILARSVLEMYRFDPNPDDTSRDNLMRQSIDLISKFPTII AYAFNMLRHATFGRSLHIRHPQENLSIAENFLYMLKRDYTELDARTLDLLLVLQAEHGGG NNSTFTVRVTSSTGTDTYSAIAAGIGSLKGPLHGGANIQVADMFHHLQENIQDWTNVDEI DTYFTRMLNKEVYNKTGLIYGIGHAVYTISDPRAVLLKELARDLAHEKGREREFAFLELL EERAIATFGKIKNNGKTVSSNVDFYSGFVYEMIGLPQEIYTPLFAMARIVGWCAHRNEEL NFEGKRIIRPAYKNVLEEEQYVPLKKR >gi|226332012|gb|ACIB01000044.1| GENE 78 74318 - 75508 1205 396 aa, chain - ## HITS:1 COG:SA1517 KEGG:ns NR:ns ## COG: SA1517 COG0538 # Protein_GI_number: 15927272 # Func_class: C Energy production and conversion # Function: Isocitrate dehydrogenases # Organism: Staphylococcus aureus N315 # 3 395 5 422 422 506 60.0 1e-143 MSKITKQQNGVLSVPDVPTIPFITGDGVGAEITPAMQAIVDTAVDMSYGGVRRIEWKEVL AGERAFRATGSWLPDETMEVFKEYLIGIKGPLTTPVGGGIRSLNVALRQTLDLYVCLRPV RWFRGVVSPVKEPEKVNMYIFRENTEDIYAGIEWEAGTPEAKKFYRFLHEEMGVSKVRFP ETSSFGVKPVSREGTERLVRAACRYALQHELPSVTLVHKGNIMKFTEGGFKKWGYELAER EFADALASGKLVIKDCIADAFLQNTLLIPEEYSVIATLNLNGDYISDQLAAMVGGIGIAP GANINYDSGHAIFEATHGTAPNIAGKNVVNPCSLILSAVMMLEYIGWQEPADLIVEALEA SFAEGKATNDLARFMHKGQPLSTSEFTREIINRIKN >gi|226332012|gb|ACIB01000044.1| GENE 79 75671 - 77914 2175 747 aa, chain - ## HITS:1 COG:SPAC24C9.06c KEGG:ns NR:ns ## COG: SPAC24C9.06c COG1048 # Protein_GI_number: 19114943 # Func_class: C Energy production and conversion # Function: Aconitase A # Organism: Schizosaccharomyces pombe # 12 745 41 769 778 897 58.0 0 MVYDLNMLKSFYASYKGKMEHVRAALKRPLTLAEKILYTHLYNVADLKNYERGEDYVNFR PDRVAMQDATAQMALLQFMNAGKEAVAVPSTVHCDHLIQAYRGAERDIETATQTNREVYD FLRDVSSRYGIGFWKPGAGIIHQVVLENYAFPGGMMVGTDSHTPNAGGLGMVAIGVGGAD AVDVMTGMEWELKMPKLIGVRLTGELNGWTAPKDVILKLAGILTVKGGTNAIIEYFGPGT ASLSATGKATICNMGAEVGATTSLFPYDERMAVYLKATGREEVAAMADSVAADLRADDEV MARPDHFYDRVIEINLSELEPYINGPFTPDAATPISEFAEKVVTNGYPRKMEVGLIGSCT NSSYQDISRAASVARQVNEKNLGVAAPLIVNPGSEQIRATAERDGMMDVFEKMGATIMAN ACGPCIGQWKRHTDDPTRKNSIVTSFNRNFAKRADGNPNTFAFVASPEIVLALTIAGDLC FNPLKDRLVNHDGEKVKLSEPQGDELPSAGFVAGNQGYQAPGGEKNEIRVAPDSQRLQLL TPFPAWDGNDFLNMPLLIKAQGKCTTDHISMAGPWLRFRGHLENISDNMLMGAVNAFNGE TNKVWNRLTNTYETVSGTAKQYKADGISSIVVAEENYGEGSSREHAAMEPRFLHVKVILA KSFARIHETNLKKQGMLAVTFADKADYDRIREHDLISVVGLKEFSPGRNLEVILHHEDGT EERFAVQHTYNEQQIGWFRAGSALNAK >gi|226332012|gb|ACIB01000044.1| GENE 80 78173 - 80077 1604 634 aa, chain + ## HITS:1 COG:aq_2057 KEGG:ns NR:ns ## COG: aq_2057 COG1112 # Protein_GI_number: 15607027 # Func_class: L Replication, recombination and repair # Function: Superfamily I DNA and RNA helicases and helicase subunits # Organism: Aquifex aeolicus # 192 630 49 524 530 246 35.0 1e-64 MNDNKIKSSSPLSDLQHQQLLLRMEYEYEKEEFRRQTETMGIARKVKRGLCWYPVGTGRS FYNSLNQLVIEIERKEDKDIEHVFEFGRPVCFFEQGYDGKIHYFNSICTVSYADEERMVV AVPGAGSLLEIQGAERLGVQLYFDETSYRTMFEALEDVIRAKGNRLAELRDILLSKQPSC WRATYPVRFPWLNSTQEAAVNKVLCAKDVAIVHGPPGTGKTTTLVEAIYETLHRENQVLV CAQSNTAVDWIAEKLVDRGVPVLRIGNPSRVNDKMLSFTYERRFEGHPAYTELWGIRKSI REMGNRMRKSSYSEREAARSRINHLRERATELEIQINEDLFSGARVIASTLVSSNHRILT GRRFTTLFIDEAAQALEAACWIAIRKADRVIFAGDHCQLPPTIKCIEAARNGLEQTLMEK VAANKQETVSLLKVQYRMHQSIMQFSSEWFYQGELQAAPEVTNRGILDLDLPMSWIDTSE MEFHEEFIGESFGRINKPEANLLLQELEAYIRKIGEKRVLEERIDFGLISPYKAQVQYLR GKLKGCSFFRPFRSQITIHTVDGFQGQERDVIFISLVRANEDGQIGFLNDLRRMNVAITR ARMKLVILGDAVTMSKHAFYKKLIGYIRHISQGL >gi|226332012|gb|ACIB01000044.1| GENE 81 80190 - 82109 1609 639 aa, chain - ## HITS:1 COG:no KEGG:BF3757 NR:ns ## KEGG: BF3757 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 639 1 639 639 1252 98.0 0 MKNRIYTMKGMLATLLSAFLLGYAVTGCIDEKDHYKPDDKTSGVPNSFDFATTQDVQLDL KYDVPVKDYQVLFELYFENPLTTDAEGQVVKRTDITPKVTRMTDGTGKYRAKETVPAYGE EVYIYTSYIGVPMLYKTKIVGNTITADINWDTAAEESVQTRAEGEYQTVPQGFYTLGSWN VKGRPNYLDSEGVIELTSSFYQTINQTIPEGGNCPRKYRQSADIVINDELGAEVKVRFVG GTSAAYSAFGYYCYPEGAAKKQIENARKYVVFPNTKTGVGIKGGECVKLHYIDENGEDQG TTFPKGTKIGWFISNDAFTKKGEKTGSVGKGLGMFYSTTALNSDGRTHTAAFKINDFIVL SFEDWTDQDYNDVMFNIWSNPIEAIAPDVPSVDPIDPDDASVAYRMTYKGILAFEDNWPS KGDYDLNDVIVKYSSILEFNTKNQVLSAEDTFTAMWSGALFKNGFAYQLNTDRSNVECSI LEGKSGWDKQGLDKDLEQATISVFANAIEETGENTKTSTFKVQNKFKQPVDHETFGVAPY NPFIFLHQNTDKNRTEVHLVNHGPTSKENMDLFNTHQDLSDKDKGIYYVSDQNYPFAIHL SDVESFSTTEKEAIDKSYPRFASWAQSGGTTDKDWYLKK >gi|226332012|gb|ACIB01000044.1| GENE 82 82312 - 83355 1132 347 aa, chain - ## HITS:1 COG:YLR355c KEGG:ns NR:ns ## COG: YLR355c COG0059 # Protein_GI_number: 6323387 # Func_class: E Amino acid transport and metabolism; H Coenzyme transport and metabolism # Function: Ketol-acid reductoisomerase # Organism: Saccharomyces cerevisiae # 1 346 48 394 395 417 59.0 1e-116 MAQLNFGGVTENVVTREEFPLEKAREVLKNETIAVIGYGVQGPGQSLNLRDNGFNVIVGQ RPGKTYEKAVADGWVPGETLFGIEEACEKGTIIMCLLSDAAVMSVWPTIKPYLTAGKALY FSHGFAITWSDRTGVVPQKDIDVIMVAPKGSGTSLRTMFLEGRGLNSSYAIYQDATGRAM ERTIALGIGVGSGYLFETTFVREATSDLTGERGSLMGAIQGLLLAQYEVLRENGHTPSEA FNETVEELTQSLMPLFAKNGMDWMYANCSTTAQRGALDWMGPFHDAIKPVVQKLYNSVKT GNEAQISIDSNSKPDYREKLEAELKALRESEMWQTAVTVRKLRPENN >gi|226332012|gb|ACIB01000044.1| GENE 83 83437 - 84021 660 194 aa, chain - ## HITS:1 COG:no KEGG:BF3759 NR:ns ## KEGG: BF3759 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 194 1 194 194 371 98.0 1e-102 MKKIVFLLLLCVACCMNAQEFHLIPKVGLNLANTTGEVDSKVRSGLNIGLGGDVMLTERF GIETGVYYSMQGSKYKGGGVSYTDKLDYINVPVYAKEFIYKGLYVFGGPQFSFNVNSENK SSSDYGSTVIGMNVIRKFDCGVGLGAGYQFERGLLISLNYNIGLINVNKSWASDSNPKAN NSVVQLNVGWRFAL >gi|226332012|gb|ACIB01000044.1| GENE 84 84039 - 84782 801 247 aa, chain - ## HITS:1 COG:CAC3591 KEGG:ns NR:ns ## COG: CAC3591 COG3884 # Protein_GI_number: 15896825 # Func_class: I Lipid transport and metabolism # Function: Acyl-ACP thioesterase # Organism: Clostridium acetobutylicum # 17 220 15 216 248 88 25.0 1e-17 MSDDKKIGSYKFIAEPFHVDFNGRLTMGVLGNHLLNCAGFHASERGFGIATLNEDNYTWV LSRLAIDLEEMPYQYEEFTVQTWVENVYRLFTDRNFAIIDKDGKKIGYARSVWAMINLNT RKPADLLTLHGGSIVDYVCDEPCPIEKPSRIKVATDQPCAKLTAKYSDIDINGHVNSIRY IEHILDLFPIDLYKSKRNQRFEMAYVAESYYGDELSFFEEEVSENEYHVEIKKNGSEVVC RAKVKFV >gi|226332012|gb|ACIB01000044.1| GENE 85 84783 - 85343 588 186 aa, chain - ## HITS:1 COG:MA3791 KEGG:ns NR:ns ## COG: MA3791 COG0440 # Protein_GI_number: 20092587 # Func_class: E Amino acid transport and metabolism # Function: Acetolactate synthase, small (regulatory) subunit # Organism: Methanosarcina acetivorans str.C2A # 6 159 3 154 161 73 32.0 1e-13 MDKTLYTLIVHSENFAGLLNQVTAVFTRRQINIESLNVSASSIKGVHKYTITAWTDKDTI EKVVKQIEKKIDVLQAHYFTEDEIYFHEIALYKVSMPEFQSQPEASKVIRRYNARIVEVN PVFAIVEKNGISEEITSLYEELSALNCVLQFVRSGRVAITTSCFERVNEFLADRETKYNL SKKEEK >gi|226332012|gb|ACIB01000044.1| GENE 86 85357 - 87054 1537 565 aa, chain - ## HITS:1 COG:ECs4702 KEGG:ns NR:ns ## COG: ECs4702 COG0028 # Protein_GI_number: 15833956 # Func_class: E Amino acid transport and metabolism; H Coenzyme transport and metabolism # Function: Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] # Organism: Escherichia coli O157:H7 # 6 562 1 544 548 496 45.0 1e-140 MSKDIISGGEALMRSLEYHGVKTIFGYPGGSIMPVFDALYDHRETLNHILVRHEQGAAHA AQGFARASGEVGVCLVTSGPGATNTITGIADAMIDSTPIVVIAGQVGTAFLGTDAFQEVD LVGITQPITKWSYQIRRAEDVPWAVARAFYIAKSGRPGPVVLDFAKNAQVEKAEYMPAKL DFIRSYVPVPETDPEAVKAAAELINGAERPLVLVGQGVELGNAQQELRAFIEKADMPAGC TLLGLSALPTEHPLNKGMLGMHGNLGPNMNTNKCDVLIAVGMRFDDRVTGNLATYAKQAK VIHFDIDPAEINKNVHADVAVLGNCKETLSAVTALLQPNEHKEWLDSFLPYEQVEEEKVI RPELHPMGDTLSMGEVVRAVSDATNHEAILVTDVGQNQMMSARYFKYSKDRSMITSGGLG TMGFGLPAAIGATFGRPDRTVCVFMGDGGLQMNIQELGTIMEQKAPVKIIVLNNNFLGNV RQWQAMFFNRKYSFTPMLNPDYIKIASAYDIPAKRVFTREELADAIDEMIATDGPFLLEA CVIEEGNVLPMTPPGGSVNQMLLEC >gi|226332012|gb|ACIB01000044.1| GENE 87 87110 - 88912 1804 600 aa, chain - ## HITS:1 COG:NMB1150 KEGG:ns NR:ns ## COG: NMB1150 COG0129 # Protein_GI_number: 15677026 # Func_class: E Amino acid transport and metabolism; G Carbohydrate transport and metabolism # Function: Dihydroxyacid dehydratase/phosphogluconate dehydratase # Organism: Neisseria meningitidis MC58 # 4 597 3 612 619 798 65.0 0 MKKQLRSSFSTQGRRMAGARALWAANGMKKEQLGKPIIAIVNSFTQFVPGHVHLHEIGQL VKKEIEKLGCFAAEFNTIAIDDGIAMGHDGMLYSLPSRDIIADSVEYMVNAHKADAMVCI SNCDKITPGMLMAAMRLNIPTVFVSGGPMEAGELDGQHLDLIDAMIKSADESVSDEEVSK IENRACPTCGCCSGMFTANSMNCLNEAIGLALPGNGTIVATHANRTQLFKDAAKLIVENT YKYYRDGDESVLPRSIATREAFLNAMTLDIAMGGSTNTVLHLLAIAHEAEADFKMDDIDM LSRKSPCLCKVAPNTQKYHIQDVNRAGGIMGIMGQLAKAGLIDTSVVRIDGMTLGEAIDK YDITSPNVCEEAIKKYKSAAAGKFNLVLGSQDVYYKELDTDRAEGCIRDIEHAYSKDGGL AVLKGNIAQDGCVVKTAGVDESIWKFTGPAKVFDSQEAACNGILGGKVVSGDVVVITHEG PKGGPGMQEMLYPTSYIKSRHLGKECALITDGRFSGGTSGLSIGHISPEAAAGGNIGKIK DGDIIEINIPERTINVRLTDEELAARPMTPVTRERYVPKSLKAYASMVSSADKGAVRLID >gi|226332012|gb|ACIB01000044.1| GENE 88 89236 - 90414 628 392 aa, chain - ## HITS:1 COG:no KEGG:BF3764 NR:ns ## KEGG: BF3764 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 392 1 392 392 773 99.0 0 MKNIIGILGVFILAISCSGGKKESADAGLELTEDSVVYLLADNVTLGIKALFPFIDKDGH EYLTFQNQLEPEICVYDLQSGEFVKSIFFDREGANGVGMFGGYHIIDFDEIYLPSLQQSK VFVMEESGKKKREIITEKTDDGIPLLPFGAITFAYRPIYFNNGKMYIPQTINMRLGNKVM EKSPVYVVVDTVKNVLSPFPIKFPPIMSSDDVTKPSLGNELSYSCCLNDKDQFVFSFFFD EDIYVVSLQDGEMKKIKVKSRYIDKPAIKENPPQDFDGAMKASSEIPCYGNLIYDKYRKV YYRFVYLKADLDGEKNYLNIWQYGRKSFSIMILNEDFDVIGETRFSDFTYISTLHYIGKD GLYLSDSHYKNPSFDENKLRFRRFKLVHYNKK >gi|226332012|gb|ACIB01000044.1| GENE 89 90801 - 92876 1637 691 aa, chain - ## HITS:1 COG:no KEGG:BF3553 NR:ns ## KEGG: BF3553 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 691 14 704 704 1308 99.0 0 MNHIIKLLFLCLFFATACSQEEMNEVLKNHAEMEGQANSIEELCKNMNRDIVALKLIINS SESGDYITGFKELADGSGYTISFFKSGTIVIKNGEKGDDGKKGEDGQDGQNGQNGTDGKD GQDGTDGKDGQDGIDGQDGQDGTDGKDGQDGTDGKDGQDGTDGKDGQDGTDGTDGPEGDK GQNGTAPVVTMKMDTDGHLYWAIKNADGTSSFLLDNNGQKVRASGTDGIVPVIGVNAAGY WTLDYGSGPVELKDAAGNPMKAKGVSGDPMFRKVVSEDGYIVFYLTDGTIFKVPEWGGAQ VELAASGVTYFHRSESLTFTFTCSGVNKSIAPVCTAPVGWTVTAQYTSDTTGEMTVIAPD KSSADGALSGTAILELTAKDGKKLSATLGVTLLLEFKVPDSSRSFVYELYVEGVKVGELC HEFIPDYSFGSGKSASVFYPYNVYNKTFGEGMVLDNGGKVDYLTTVYTPGTNTQPFTSLF TEDGVSFITGDYIGYSDSQANGLRPYLTTDNEGNSYRVMKIGTQYWMADNLRTITNSAGV ISSEWRKDANPRYAVYGFPAGITEDSRTIRNQYGLLYNAGVFSGSTSLVTEGWKIPNHLG DWDKLRNFLGGDSKVAEALSIAGFVGKPGGRRDADEPFAFQEKDEVGYWWFSSVDGYNCW ALSINPSNVSVPQSTNTYSRGYGFSIRFIRQ >gi|226332012|gb|ACIB01000044.1| GENE 90 92893 - 95043 1599 716 aa, chain - ## HITS:1 COG:NMB1415 KEGG:ns NR:ns ## COG: NMB1415 COG2931 # Protein_GI_number: 15677274 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism # Function: RTX toxins and related Ca2+-binding proteins # Organism: Neisseria meningitidis MC58 # 70 276 1393 1607 1829 68 29.0 4e-11 MKRYKKYMFIILSVLLAGGCVQEDINEALRREAELNVRMGKLREWISDTNNEVRLLATIV NTIPTHDYIESVTELPDGSGTLIVFHKGGDILIKNGLTGNKGENGNDGANGENGSDGANG ADGTDGKDGVDGQNGADGQNGSNGTDGTDGHNGSDGQDGANGEDGKDGVDGNDGNDGVKG PAGDTPLLGAKRDTDGNYYWTVQIGDGAVEWLTDGEGNKVIMTPVNGVTPSVGIAKDTDG EYYWQVTVGGVTKWITGGSGEKIVATVTDGSPSIFASVNTGDPEYVVFTMTDGSIYKVAK TGDSKLIFNETGPLFFRRGVTQDVTFDCSSFRQIVKVAGTWDATINYQPKALQGRLTVKA PAAGSALPASERITIEGTDALGNTVRADILVTLYYRIELPKFNSSQVYDVLMEGVKVGEI CREYVPAYSTTDRATVVYSYNAASRTFGSGLILENGASIQHDGTGYMPVAGNPSSTAILT EDGSRYVMAADGADGLVSGTVCGLKASLAVDVQGYAYRVVKIGSLYWMGENLKTTQFNDG TPIPTGFATSLDWEIQCGADLPACQVDRAVNANAPGAMQYRNLYGVLYNQFAMKGNIAPQ GWHVPSKVELQALVDFVGADARKLKSTAVGKGIGNWLEDTGIEANNLTGFTALPANTVNG GNGSTGAQETTGQWWSTETSYRIRMSYKSNSVELLTSGNALAGEGNSIRCVRDANY >gi|226332012|gb|ACIB01000044.1| GENE 91 95062 - 97218 1426 718 aa, chain - ## HITS:1 COG:no KEGG:BF3767 NR:ns ## KEGG: BF3767 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 718 4 721 721 1281 99.0 0 MIRKKIYLLLLLIPLLHGCKESDFNDLLDRQGDQRKELQELTDLCKKLNEDIYNLQVIVN TDRIGDNITHIEELADGAGYTISFSKSAPITIRNGKKGDTGPDGDAGKDGIDGTDGKDGV DGTDGKDGVDGTDGTDGKDGENGTDGKDGADGTNGTDGKDGVDGTDGKDGVDGTNGNKGE QGDPGQNPVVGIMQDPADSEYYWTIKIGSGEPYYLADNDGNRIKATSTVHDGQTPQLGVK QWETSDGGDDNYYWTQKIGTGPETWIEADGKKIVANAKNAVSVFEKVDLKEPDYVEFTLS GGATRFRLPIGRPVIEVPEGRKLFFFNRGGSQEIAFSCKGISKEQLSVDVPKGWKATVDF EAGALTLEAPSAGTADVALSGDVVLRSTNTAEETARTSFTVSMLYKIMLPDFRGSYVYNI RLWGKKVGELCREYQRELGLSATVVYPYLTGGVYGSGLITNTGASVKHDGTGYHASVSDR PAVMYIYTENGSEFYTDSSLKGREPDFGDGLQAERLTDADGNAYRIVKIGKQYWTAENLR TLTYPDGTPVATGLSGRAWRESGFDDSGSGACAVYDYENAAAAGAFANKVSYGVLYNRVA ARKVVPAGWKLPTESDIVSTLRAFLGSNAGTLLKESGTEHWLTGGGTDLTGFSGVGGGYR GADGLSFNDFQQVGIWWSSASLESVGAVFRLSANSSVLEFNSGDNNSTGYSVRLLKED >gi|226332012|gb|ACIB01000044.1| GENE 92 97224 - 98531 862 435 aa, chain - ## HITS:1 COG:VC2213 KEGG:ns NR:ns ## COG: VC2213 COG2885 # Protein_GI_number: 15642211 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Outer membrane protein and related peptidoglycan-associated (lipo)proteins # Organism: Vibrio cholerae # 326 409 204 286 321 63 44.0 7e-10 METGCRRAILFLLFSPLFGTGFAQTETPVKVDKPLKSYSHWYFGAEYGVPFLFGDFTSFS ADKTYVGSQFGGFAGYQVNSWIGIEASARTGYTRMGAKSYAGDYLMNADGMTYYTNQDFN TWKYKDVFSKVHFTNIGLQMNLNVNNFFGPNRGNRRWTVLLGPAVYAQHFSTELINKADK SPLSGKKTDKWNIGIGGDVSLRYKISRAFDVQLRTGIIWVNNNKMDGISTLIKSKDHFMT SAGLSLIWKVGKKKEDNVLYASRRAADVEIRYIEERAVSLPTPACCVEDSIEKERMKREI ASLNMQLQQAHTVVKEKTGSDPILGFNELPPVYFKRGSAYLNVALYKNELCRIVQTLKKY PELKVILSGHADHTGNPDINQKISLQRAEALAAYLEKKGIDGKRIAVKGECIDMLTSDPN NYSVLARRVIVEIQK >gi|226332012|gb|ACIB01000044.1| GENE 93 98563 - 99411 467 282 aa, chain - ## HITS:1 COG:no KEGG:BF3769 NR:ns ## KEGG: BF3769 # Name: not_defined # Def: transcriptional regulator # Organism: B.fragilis # Pathway: not_defined # 1 282 1 282 282 530 100.0 1e-149 MKTLIENLTSDLLPNQIFQTGFSFLIILKGNSLLKLDSNVLIFIMSGTMKVSSAQQELAT VRERHIFFWDKEDDYTCEMLSDSQVILFAFGDLIVHDLLTFRPFGAISDSSVSKDVGLKF AEPLNSFLQLLAQYMEMNLYDLSLYIAKQRELFYILNSVYNEQELAILFSSLTEQSSRFK EQILENYLSAKNVGELASLLGYGVTNFRAKFKEQFGVSVYRWLLNRKSQHIIYRITVYGD EFSQIIDDFGFSSPSHFNKFCRSQYGLTPCELRKKLKTNNNS >gi|226332012|gb|ACIB01000044.1| GENE 94 99739 - 100329 619 196 aa, chain + ## HITS:1 COG:FN1875 KEGG:ns NR:ns ## COG: FN1875 COG1047 # Protein_GI_number: 19705180 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: FKBP-type peptidyl-prolyl cis-trans isomerases 2 # Organism: Fusobacterium nucleatum # 1 156 1 149 164 86 35.0 4e-17 METAENKYITVAYKLYTTEDGKRDLVEETAAEHPFQFISGLGTTLEAFESQIVNLHKGDK FEFTIPFAEAYGEYDEEHVIDLPKNIFEIDGKFDNEHIYPGNIIPLMNSEGQRLNGSVVE VKADTVVMDMNHPLAGEDLTFVGEVTESRPATNEEIQEMIKMMTGEGGCSCGSCGDGCGD DCGDSCGDSCGCGHCH >gi|226332012|gb|ACIB01000044.1| GENE 95 100421 - 100939 389 172 aa, chain - ## HITS:1 COG:no KEGG:BF3771 NR:ns ## KEGG: BF3771 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 172 1 172 172 325 100.0 5e-88 MKKHFVWLLLLFPLLLTGCYGEEDKEKDQGDYIWDFINYNIYFSVKDAAGNNLLDPQVAS NILGNEITVEYGDKSFPLENSVDTRFNMPRPLGLRKEVLGEAKERVLSFGEFSPEHQYKG ETFTIHWGDGTKDVVKFDLYITWKKQNPTIHKRLYLNDKEYSKDSFLIKIVK >gi|226332012|gb|ACIB01000044.1| GENE 96 101048 - 102334 1167 428 aa, chain - ## HITS:1 COG:ECs3990 KEGG:ns NR:ns ## COG: ECs3990 COG3681 # Protein_GI_number: 15833244 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Escherichia coli O157:H7 # 7 427 11 435 436 323 43.0 3e-88 MTESERKQIIALIQREVIPAIGCTEPIAVALCVAKATETLGAKPEKIKVLLSANILKNAM GVGIPGTGMIGLPIAVALGALIGKSDYQLEVLKDSTPEAVEEGKKLIDEKRICISLKEDI TEKLYIEVTCEAGGEQATAIISGGHTTFVYVAKGDEVLLNKQQTSGEEEEEETLELTLRK VYDFALTAPLDEIRFILETARLNKKAAEQSFQGDYGHALGKMLRGTYEHKIMGDSVFSHI LSYTSAACDARMAGAMIPVMSNSGSGNQGISATLPVVVYAEENGKSEEELIRALMMSHLT VIYIKQSLGRLSALCGCVVAATGSSCGITWLMGGSYKQVAFAVQNMIANLTGMICDGAKP SCALKVTTGVSTAVLSAVMAMENRCVTSVEGIIDEDVDQSIRNLTRIGSQGMNETDRVVL DIMTHKGC >gi|226332012|gb|ACIB01000044.1| GENE 97 102510 - 103637 912 375 aa, chain + ## HITS:1 COG:no KEGG:BF3561 NR:ns ## KEGG: BF3561 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 375 1 375 375 791 99.0 0 MNKGFILKAQFLLCFLFSQTAQGQSFTPGEIWPDNHQVHINAHGGGILYENGTYYWFGEH KTEGEAGNLANVGVHCYSSDDLYHWKDCGIALSVIENDPGHPISKGCILERPKVIYNPLT KKYVMWFHLEPKGAGYSGALSGIALSDRVTGPYTFLKAVRPNAGSWPINVLPIHKTTRRP SAEEERQCTGGSLPAHPDSLNILGRDMEQGQMARDMNLFVDDDGKAYHIYSSEENSTLHI AELDPTYTGYTGKYIRVFINRFMEAPAMFKKDGNYYLIMSGCSGWNPNAARSAIASSIWG EWKELGNPCIGQDADLTFHSQSTYILPVQGKKNQFIYMGDRWTPQNAIDGRYIWLPIHFE GPKPIIEWKDSWTLD >gi|226332012|gb|ACIB01000044.1| GENE 98 103787 - 104881 752 364 aa, chain + ## HITS:1 COG:no KEGG:BF3562 NR:ns ## KEGG: BF3562 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 363 1 363 365 449 63.0 1e-125 MKLKNLIACFFLGFIMVSCIQDEAPNAEADIIACSVPGDVLNRDPIIENNKVTLIVKAGT DITALAPKFTLTPGASIIPNSGTTLDFTTPQYYEVTSEDKKWKKKYEVGVAFSGITNTTY HFENIKFDSEGKYHIFYETDAQGKETMTWASGNAGFAFTGVKATAEEYPTSQSTNGVEGK CLKLTTCETGYWGSLLGMPIAAGNLFLGSFEVRPTDILRSTKFGTPFSNIPTSITGYYKY KAGKSFQIDGKPVDNKKDICDIYAVFYETDEKLSTLDGTNILAEDNTHIVSVARIKDAKE TDEWIKFDLPFEYRTGKSVDMEKLKAGKYNLAIVFSSSIRGDHFEGAPGSTLYIDEVLLN FTNK >gi|226332012|gb|ACIB01000044.1| GENE 99 104892 - 105674 706 260 aa, chain + ## HITS:1 COG:no KEGG:BF3563 NR:ns ## KEGG: BF3563 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 260 1 261 261 421 77.0 1e-116 MKKYILIGLLAIISTITSVQAQEERNHGIIWSALHGLEYEIKAGFNVGGASPLPLPAEIR ALTGYSPTICFAIEGNTTKWFGKDSKWGMTLGLRLETKGMEARARVKNYSMEIIGDGGER LAGYWTGKVRTKYRGSYFSVPITAAYKISQRVKINAGPYVSFMTSGDFNGHVNDGYLRKD TPTGEKAEFEGDKIAPYDFSKDLNNFQWGVQAGAEWKAFKHLNVYADLVWGLNDIFKKEF KTITFNMYPIYLNVGFGYAF >gi|226332012|gb|ACIB01000044.1| GENE 100 105946 - 106341 197 131 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566158|ref|ZP_04843612.1| ## NR: gi|253566158|ref|ZP_04843612.1| predicted protein [Bacteroides sp. 3_2_5] # 1 131 1 131 131 224 100.0 1e-57 MKKIFTLLMLLCTISVFAANAAGTYTGRLTISVDGNAPTVKDGQNVIVTESDIVTLTIPD FSYSGYPKADVVITASKDAAGNLTLKTIKYGFLRLSAKFNDGSKVSDNNCNISLSISAVL QKVEVTFVGTK >gi|226332012|gb|ACIB01000044.1| GENE 101 106377 - 107948 1036 523 aa, chain - ## HITS:1 COG:no KEGG:BT_1798 NR:ns ## KEGG: BT_1798 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 145 459 39 399 448 152 32.0 2e-35 MKKNLLYLFALICSVSLFTACSDDDDNSWQELPKGEIKAENVDFQLNGTNTTGTVNFEAT SLQSATVGFKNVVDGYSDITVDVTMEKQADGSFKFNGTKDIMTKPVTRETSQPAPLLKVT VDGMITPEGKVTLNVSATGAGLYIGTYKGETLVLTYGETTLTGKEVVFDATDGANVSILL KDIIPGETETTLTGVQVANEGFSGSTKTNTSTIEYTGSRKDKVLTLNLKVTMNDPKGWAK TYTLGEYTIGTLDVDGTPMPNSVLTSSLYSNWEVKDAYYSTFFPAVLRTIGGLILPQVLQ SVTLEADGNISAKYSSGSVTFDPNWAMGLIFGGGAPGVDVLNKLIPTDGWQQSPKNLAYW FPKDDKLYLKLNVSTIISQAMGSNAESLAPIISEILNGDAATVKKLIGTMLKVDMSSISD ETFEMLLGWVNNGIPLNVKTTDKGHTYIYLDKTAFDPIMVDKEMSADSSDFGTGSDLFKL WKIMMDAKIIPEDAAAAIILLIGLPQNWPSTTGFDLGLDLLAK >gi|226332012|gb|ACIB01000044.1| GENE 102 108560 - 109498 683 312 aa, chain - ## HITS:1 COG:no KEGG:BF3777 NR:ns ## KEGG: BF3777 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 312 1 312 312 652 99.0 0 MNKFIYIFVAIVAISFASCTQERAKEKELKVLSWNVWHAGHAKNYPEKGCEGTIGILRKS QADVILMIETYGAAPMVADSLGYDYVLLSDNLCIYSRYPIKKTYLFPDSISTFNFGGVEI DMDGTPVRLFDTWLHYLPDMRLVPTEQSETDILAWDDAGTRDNEIRRILSVLQPMIRQTD SIPMIMGGDFNVHSHLDWTDATKDMYHHGGAVVEWTVSKEMQNAGFKDSFREIHPEPEKN IGTTWIYDNEDKPLRSDRIDFIYYQGKTIRAITSESYNQELIKPLKFMGEEFFYPSDHGF VMTTFKISPLEK >gi|226332012|gb|ACIB01000044.1| GENE 103 109577 - 110500 930 307 aa, chain - ## HITS:1 COG:no KEGG:BF3569 NR:ns ## KEGG: BF3569 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 307 1 307 307 625 100.0 1e-178 MKKISILFIFSLILGLFVSEVSAAGPRLKKRPKHVVLVAFDGLSAVAIRNHPMPNFNRLM KEGASTLNNRSILPSSSAPNWASMFTGVGPELHGYTTWGSKTPEIPPFITNQYGRFPGLY GLLRDTHPKAELGYIYEWDGMKYLVDSLAINHFVHAPQTKDHPKGATQFAVNYLKEKKPM YCAVIFEYPDHTGHTYKWESKEYYEKLDELDGYLGEIVAAIEEAGMMDETVIILTADHGG IGTNHGGKTLNEMETPLVFYGKGVKKNYKITESTMVIDVPATEAWLLGVEPHEAWLGNPV TTAFFTK >gi|226332012|gb|ACIB01000044.1| GENE 104 110514 - 112970 1527 818 aa, chain - ## HITS:1 COG:MTH1485 KEGG:ns NR:ns ## COG: MTH1485 COG1520 # Protein_GI_number: 15679482 # Func_class: S Function unknown # Function: FOG: WD40-like repeat # Organism: Methanothermobacter thermautotrophicus # 485 775 103 371 407 63 23.0 2e-09 MLKLFISMAALFLGVGSSVFAQIEGKVYIDANGNGICDAGERGLKGVCVQDGLNVVKTTD DGHFILPGHKDTRFVTLTVPDGYQASTSHYLSFDGTGKKYELGICKTSVDTGNGYSFVQI TDTETSLYGDWIDNLKEYVKTNPTAFIIHTGDICYEAHQDFHGRYLRSVDLGIPTYYCVG NHDLRAGKYGEELWQSHFGPSWYSFDVGNVHYVVTPMLGGDHAPSYRRSDIIRWLKNDLA QTDKGKRIVLFNHDLWFWGDDLLFKDKNGEQIDFADYNLDAMIYGHWHNHYYKQLKSGLH TYCSSTPDKGGIDHGTSCFRIYNADTKGKLSSATRYTYIDGILTSAYPAEGETVSVPDGK MTVRINAYRTISDAKKVTASVERNGRLVSTVTLMPETDWGWSGAVRVSGGKQRLLVTAEF EDGTRLTKRVDYTVTKQPLSVIATSDVWAGLRGNAAHNQLVNDSVSLPLQTNWIQNVGSN IYMCSPIVAQNKVFIGTIDDDKAKKCYVKAYDATTGHLCWTFVTSNSIKNTIAYEDGRIF ASDASGMLYAIDAEKGTACWQTQLPVSLLPLLDEGLAVADGVVYAGHAKGTCAVQAVDGK ILWQNKAWDGGEGTTSTFTVGAGVLVASAHWNGLFGHDISNGALLWKKRDSKIRFRDGSA TFYDGNFYLASCENLYVINPRSGDILKMAETSYEFNSACAPLVTDKYLIVSTSNKGVVAF DRLTFKEVWNYRTGTSLFYTVPYSHNQECTVEVSPVLVGSAVLFGASDGYLHAVDLNTGA YRGKRALGAPVFSSVAVSGNCLFVADFGGNIYNFKLHN >gi|226332012|gb|ACIB01000044.1| GENE 105 113075 - 114868 1663 597 aa, chain - ## HITS:1 COG:no KEGG:BF3780 NR:ns ## KEGG: BF3780 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 597 1 597 597 1231 99.0 0 MCSKIKHILLTACCFTGAGLMTSCNDGFMDRFPETSITEKVFFSSPADLETYTNGMYGYI GASYSDTPSDNMLYPEDTDIYKMMRGEYRADNIGKWSWSNIRTVNFMLARTGRVEGDRGE IDHYIGLARMFRALVYYSKVKDYSDVPWYSHDLQTTDIDLLYKPQDPRALVVDSIMADLD FAVTHMKTTKSTTRIYRDAALAVQARIALHEGTFRKYHPELKLNDGDRFLKIAVEACQKI MDTKSYSLSTTKESGLPAYQSLFCSTDLTQNPEMILVADYDKALGRLHNAQAQFDYNTGL SRSLMEDYLVVKDGHTEYFHQVEGYKTKTVLEVFENRDPRLEQTFMKPGVLNVGTTEPHR TKLNLGGYPQIKFRPLTFDQIDWGKSYTDLPIIRYAEVLLMYAEAKAELGILTQDDVNQT INLIRQRAGMPDASLDDWLANIDPVQDERYSNVQSAQKGAVLEVRRERRIELACEGFRYG DLMRWGCGKLFEAAPEGAYIPGMGYYDVTGDGQPDVAIVEKKADIDKIPEEDKQKYKLTV YALEGNTIGLTEGTKGYIYLVAQHNKYTFVSPKYYYYPVATKDITVNENLYQNPFWE >gi|226332012|gb|ACIB01000044.1| GENE 106 114881 - 118255 2962 1124 aa, chain - ## HITS:1 COG:no KEGG:BF3781 NR:ns ## KEGG: BF3781 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1124 25 1148 1148 2226 99.0 0 MSFILFLFVFQGVYAQQTRINLHVKQVPLKQVLKSIESKSEYTFFYNDAEIDMNRKVTVQ ANNERIDVILSKILPDCKCVVENRKIILVPGAEKQNTPNDNTAKTKEITGTVTDTRGEML IGVNVTVLGTTTGVITNIDGKYSLKVPAGKSLKFSYVGYIAQTVKVGDKSVIDIVLEENS KALDEVVVVGYAVQKKVNLSGSVATVSTKAIEDRPVLNMGQALQGAVANLNVSVGDGEAD DSPSYNIRGTTSLNGGSPLVVIDGVVSTSDQLNRMNPVDIANISVLKDAASSAIYGSRAA FGVILVTTKDGSNEKLTVNYNNNFVLRTNTRMPEIITDPYLVATTRNTMAYPWYNLYNEE QLAYAKKCSEDPSTSPYFLNPDGSYTYFGRTNWVDEAYNDVGFSTIHNIDISGKTDRISY YFSGGYNRQNGMFKYGNDIYNRYNLRTKLQFKLTDWWSLNSNVSLTTSDYDYANAMTNTY KQMYRKNPMDMVKNPDGTWTDASVGTLGALAEGGRATDWKTNTNINLSTKIDVIKDVFFV QGTFAFSNTKTRSNWYNLPVTYRNGPELPVLTFNPISTVSDASSSNSDTKHILFDVYGTF QKTFAKKHAVTAVVGFNQEEYKYDYVKANRKELISSSLPTINLATGDMNMSQSITTWALR GAFARLGYIYNDKYIFEFNGRYDGTSRFPKNDRFVFNPSVSLGWVISREKFFEPLTGVVS FLKLRGSYGSLGNQDVDAYAYLATMGSGKISQILDKQQPVYVGAPGLVAGNLTWEKVTTT NLALDANFFDNRLSITGEVYVRRTKDMLTPGVTLPSVLGTDVPKQNAADLKTEGWELTVG WKDQFKLAGKPFYYDVNFNLADSRAYITKYENPKGLLGDYYVGKEIGEIWGVETLGFFTS EEDIKNHADQSWCTSYPGTRPLAPGDLKFKDENKDGKITDGAWTLEDHGDYKIIGNSRAR YTFGLSANAQWNGFDLSLFAQGVGKKDYYPGTGDLYFWGIYAQPWTNITKGNMYDHWTEE NPDAYFPRMKAYVAENTDRECGVVQTRYLQNAAYMRLKNLTVGYTLPKVLLNKIGIERLR IFFSGDNLCEFSGLYKHYKVDPESLGDIVYPLQRSYSFGLNVTF >gi|226332012|gb|ACIB01000044.1| GENE 107 118491 - 119513 621 340 aa, chain - ## HITS:1 COG:PA2388 KEGG:ns NR:ns ## COG: PA2388 COG3712 # Protein_GI_number: 15597584 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Pseudomonas aeruginosa # 132 339 123 327 331 70 28.0 4e-12 MKNEIKDINEVIIRFLDGTATGEEKVFLFNWLKQSEKNRNEFSEVRDLWLLGNTIATDDL ETEIALERFKNRIQSTESGLRKNRFVFRKHFVPFLRVAAVFLMLFTVGSVFYYWGSSSVP KQPDVMNRLLTANGSKGRFVLPDSTVVWLNSNSLLEYPETFSSSAREVSLSGEAYFEVRR NEKLPFRVQAGEMKVEVLGTRFIVDNYRRKSGVEAVLVEGSVKIAGCKMNHSVVLTPGQL INYDKKSERTKVQMVNTDDYISWIQNELTFDNDKLADIIINLNKWYGVDIECPSEFAEKV FMSFSVRNGENLDEILKAMTLVAPIRYYWENGILHILPRK >gi|226332012|gb|ACIB01000044.1| GENE 108 119587 - 120144 556 185 aa, chain - ## HITS:1 COG:no KEGG:BF3574 NR:ns ## KEGG: BF3574 # Name: not_defined # Def: putative RNA polymerase sigma factor # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 185 6 190 190 297 100.0 1e-79 MYSNLESVEQLFRQYYKVLRVYAFRFVNDWDIAEDVVQDVFVALWNKRTDIEFDGAVKAY LFKAVYNKSLNILSSKKYTEEESVEQFSDQIEALQILENNQENSLFMKELQGEIETFIET LPTQVKKVFILSRSYGLKIKEISVQLDLSPKTVEKYLTRALLELRTHLKNKDLMSLLFLL YLCSK >gi|226332012|gb|ACIB01000044.1| GENE 109 120225 - 120362 59 45 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|265766917|ref|ZP_06094746.1| ## NR: gi|265766917|ref|ZP_06094746.1| predicted protein [Bacteroides sp. 2_1_16] # 1 45 1 45 45 83 100.0 4e-15 MSTDSAFNGYGYLSPYADTCFYRDEKNGLFENCHNLFIAITKTVS >gi|226332012|gb|ACIB01000044.1| GENE 110 120995 - 122836 1600 613 aa, chain - ## HITS:1 COG:no KEGG:BF3784 NR:ns ## KEGG: BF3784 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 613 1 613 613 1255 100.0 0 MKVFKNLSTYILALGFSATLFSGCEDYLNVSDDLAAEMTMEEVFNNTSYARRFHRYIYTG IPDVSNIIITSAYADLTGLDNPWPAVSDELKSAQNNVKTIPTIGYHAGSATLSRWSLYKQ IRQANEFIAYAHVIPQNGDVADFIDEKELALLKNEARFLRAYYHYLLFELYGPIPIMTEI ADPSAADLDYYRNSVDEVVAFIDKELNECYDLLPEKELNPDGTINNERAAAPTKGAALAI LAKLHVYAASPLFNGGYPEAIALKDNQGKQLFPAKDDTKWKTALDALQRFIDYSKGRYSL YQVMKNGEIDPAESLYQLFQVSVNNSEAVWQSSKNSWGGVNGEGRERRCTPRAIFSGFSC VGVLQEAIDDFLMSDGKSIEESGLYKEEGIGEDGIPNMYKNREPRFYQDITYSGKVWQKT DKKIYFYKGMPDDNSKADMSYSGYLLYKGMNRDLLNQGNNPKSKYRAGMLFRLADFYLLY AEALNHVNPGDARIIQYVDSVRYRAGIPLLKDIKPEIIGNRELQEKAIRHERRIELFAEG QRYFDVRRWMCAEEEGYKQGGPVHGMDMNATDLEGFMKRTAFETRIFEKRMYLYPIPLAE IQKSKKLVQNPGW >gi|226332012|gb|ACIB01000044.1| GENE 111 122859 - 126074 2668 1071 aa, chain - ## HITS:1 COG:no KEGG:BF3785 NR:ns ## KEGG: BF3785 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1071 1 1071 1071 2120 99.0 0 MRKPFMLISVLPITVHISHDTDTGSNRIDGIPEKYVFKNKVLRSICFVLFLLSIGLNSAF AQVRNAAGLVVDENGQPLIGVQIKLEGTTTGVITDVDGNFSINAKKGDILLFSYVGYEPQ RITYKGEKILAIKMLPNTELLDEVVVIGYGKQKKNSVVSSINAIGPKELAVSSNRNLTNS LAGQVPGLIAVQRSGEPGYDNSEFWIRGVSSFKGGTNPLVLVDGVPRNMQDIEPDEIESF TLLKDAAATAVYGAEGANGVILITSKRGNSQKPKISLRAESTLLTPTRLPKFMNSVETLD LYNEALNNEGTASIRTAEEIAMYGPGADRDLYPDTDWLKEMLREHTYNMRYTLNVRGGSE RARYFVSGAFYQENGIFKEGKKNEYDNNIGLKRYNLRSNIDFDATKTTLVKVDISGQYLQ TNYPGTSTNTIFNSMCRTPSYLMPAEYSDGTIAGHPRPSGNRVNPYNLLMNSGYAKEWRT SIQSKLEVDQKLDFITKGLNWKGLISFDADMKYIAKRTKTPTQYLATGRDENGGLQFKKV VEGSDVLTEKLENSSNKKIYFETSFNYNRTFAQKHDVTAMVLYMQKETQYHNNALPYRKQ GLVGRATYGYDGRYFIEGNFGYTGSETFAKGYRFGFFPAMGLAWYVSNEPFYPEVLKKVV NKLKFRFSIGRTGNDDTGGDRFLYRGTMKQDNGGYDLGFSDTGGMGGIGNGITEARFEAP YLSWEIEEKKNYGIDLGLFDNRIDLQVDYFNNKRKSILLQRNTVSNVTGFQQMPWQNFGI VKNHGVDASLTLNQKIGQVNLSARGNFTFARNEIIEYDQVPQVYPWLEKRGTRLNSNKLY IAEGLYTYDDFIINGEGLNRTYELKPGVVSGLSSGVRPGDIKYKDLNGDGKIDSNDQKED VGNPTVPEVVYGFGFNAEWKGFYAGIFFQGAGNTSTVLGTGADATFFPFQWGVEESAVRS VVADRWTEQNPSQNVMFPRMHSNNFQNNTVASTWWLRDASFLRLKNIELGYNFDKKLLKK LNIEALRFYVQGNNLCVWDHIKMWDPEQGNSNGGFPYPLNRTFTFGLDFTF >gi|226332012|gb|ACIB01000044.1| GENE 112 126887 - 127231 308 114 aa, chain - ## HITS:1 COG:no KEGG:BF3579 NR:ns ## KEGG: BF3579 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 8 114 334 440 440 219 99.0 3e-56 MSRLSGIRGQGGSDPLPYEIIIDMNHRVKIAQIELLPRGRGSNNPIKVVRFEASEDGTNW ESIGQFGFTNQDAALKYYVKSSTARYIKLVIPDGVGNGTVAAIRELDVRGTVVN >gi|226332012|gb|ACIB01000044.1| GENE 113 127191 - 128207 827 338 aa, chain - ## HITS:1 COG:no KEGG:BF3787 NR:ns ## KEGG: BF3787 # Name: not_defined # Def: putative chitobiase # Organism: B.fragilis # Pathway: not_defined # 1 334 1 334 440 661 99.0 0 MNRFNYIGMLAYVALSMSSCDDSFDVSSKADGVLAVSQEGFNTLQSYNVGEKYTADLWIQ QGGLKSTASVVSFSVDKALLDSMNIADGTSYELLPADCYQLTKSSVDIPVNEWLLKGELT YDPAKIQELSGYDHLKYVLPLRATSSGMPFVSGRSVVLLGFKVSEPIVTIMNAGVEEINL AEVKELPVQIGVPFTNKWEISCRLESRQSVIDAYNTAHGTYFSMLPSDAYVAPETPILHS GVNQVTATYKLKDDVLPGNYMLPVQIAEVTSDATIRADKDVYAAYSIIKEGDKLSKTDWK IVSFTTEEASGEGSNNGHAKHLIDGNVETFWHSRAGRI >gi|226332012|gb|ACIB01000044.1| GENE 114 128236 - 130122 1534 628 aa, chain - ## HITS:1 COG:no KEGG:BF3788 NR:ns ## KEGG: BF3788 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 628 1 628 628 1280 100.0 0 MKKQNSIYTLLIALLCFVSSCDYLGVSDQLAGGLQNTEQVFDNVSYTKRWYANVFAGIPD YSGINSVNVGAFKNPWTGMCDELVVGYGNSSKYNNSDRNAANMGFHRYGDCYKYIRQANI FLQKAHPIMTTGTQGDQLLEDELTQMKANVRFMRAFYHYLLFEQYGPIILVKDKIYNATE DQDVPRNTVDEVIEYIDSELTAVASELTQEPIFEDKDYRAWPTKGVALAVRAKLWLYAAS PLLNGGYREALSVTNPDGTRLFPDYDAGKWEKALAACKDFIDYAEAGRYELYKEYKDDNG AVIDPDKSVYNLFQKYTHEIIWATANNDWGGMNGDAFDRRIAPRCEKNGLGSTGVTQELV DAFYMKDGFPVSATAYLPQSTLYQEEGYGTYKDQNDNFSKKYTNVTVSNRYLNREPRFYN TVFFNGRQWPVSCNQVLFYNGGNSGVQEGQATLTGYMLFKRFNRSVSLTNPGVASQFRPS IIFRLADFYLMYAEAANEVNPNDARVLKYLNLVRERAGLPDIETLNPAIRGNQELQRAAI QRERQIELATEGQRYFDVRRWMIADKNGEGRQNGYVHGMNVRGESNDKEDFNRIVEASQI VFNRKMYLYPMPDSEMRKTKNLVQNPGW >gi|226332012|gb|ACIB01000044.1| GENE 115 130152 - 133604 2595 1150 aa, chain - ## HITS:1 COG:no KEGG:BF3581 NR:ns ## KEGG: BF3581 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1150 9 1158 1158 2199 100.0 0 MNNHGIFSPDRTKVRTLLSIKSIFLFLLFTFAFEMAYASSVYSQTKVFTMQSAEKTVLQV FKEIEKNSEFIIFYRDGVIDLNRKVSVNVVNQSVDKILEQLLAHTDNGFTIKDRQIIIYK KETSATPSVSQQKNKIKVTGVVTDAKGESIIGANVVVKGNPTIGAITNMEGRYEVMLPSD DVILLVSYLGYNTEEIKVKGRRNINVVLHEDSKALDEVVIVGYGKQKKESVVVSMSSIKP KDIVVPSRSLNNSLAGQVAGLIAVQRSGEPGYDNAEFWIRGVSTFAGGTSPLVLVDGVPR NMSDIEPDEIETFSVLKDAAATAIYGAEGANGVVLVTTKRGRVEKAKISFKTEHTISSPT RLPEFVGSADYLSLYNEALRNDGEGPQFSDELIAHYRNNDDPDLYPNTNWIDELLRKNTF SHRYTLNVRGGTEKAKYFVSGAYYNESGLFKNRPNGIYDTNIGIDRFNLRSNIDMAVSST TTVGVDLAMQYLINNYPGTGTSTIFRSMLITPPYAFPAVYSDGTVATYAQERDANMRNPY NLLMNSGYAKEYRTGIQSKVNVNQKLDFITKGLSANLNVSYDYDSEMIIRREYNPTRYHA TGRDELGQLIFSTVVSGNPDIQDPKNSATSATKKIYIDASINYKRTFGKHDVGAMLLYMQ KETQQHNVPLPFRKQGFVGRATYGYDGRYFIEGNFGYTGSEAFAEGNRFGFFPAIGAAYY LSNESFYPEAIKKVVNKLKLRASVGRTGNDKTGQERFLYRPTFTTNAGGFTQGIGDTGGT NGIGNGIVEGRFAAPYLAWEIEDKQNYGFDLGLFDNRIDIIFDYFRSERRDILLQRRTVP QLGGLRQDPWQNFGKVRNQGIDMSMNLNQQIGKLKLSARGTFTFTRNKILEYDELPQKYG YQAVTGTRVSENTLYIADRLYTEDDFIVSTNANGLKTYKLRSELPRPTLGGLIGPGDIKY VDVNGDGVIDSYDQVRGVGNPSTPEIIYGFGLNAEYKGFYASIFFQGAGNTSVLLGGATS EGWYPFSWGVDQSNYRTFALDRWTENNPSQDVIIPRLHKNNANNANNRVASTWWLRNGSF LRLKNIEFGYQLPKKFMDKIGFEAARIYIMGYNLAVWDDIKYFDPEAGNANAGLNYPLPR TFTLGLDFTF >gi|226332012|gb|ACIB01000044.1| GENE 116 133733 - 134929 645 398 aa, chain - ## HITS:1 COG:AGl2871 KEGG:ns NR:ns ## COG: AGl2871 COG3712 # Protein_GI_number: 15891547 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 201 379 136 310 331 67 31.0 7e-11 MEKNHYIHYTAADLLNDGTFLDSMQHPTEQSEKFWSQLEKENETFAGELRMARSFLMAVA ESPQKRMTDDEVGTLWERISRQVAMEKRVKRKKQLLFLRVAGIAACISVLALSSYFVFSL ISFYEESFCYSLAGISEPDSSNDIQLILSEGRKLVMDGKESRLHYKEGGKIAISSGKTQL DEENETGYNQLIVPSGKRSFITFSDGTRVAVNANTRIVYPSEFSGHKREIYVNGEVYLQV SPDKKHPFVVKTNRMEVEVLGTEFNVSAYDFTKNQSVVLVSGKVEVDTYKYPKKVLKPND MLTYDGQDDGLRVNTVDVSEYISWVDGYYCFNHEKIEIITEKLSRYYGKRVIPDSGLIGL TCSGKLDLRDDLRDVLEVLSKTIPAQIETRDNYFLLTK >gi|226332012|gb|ACIB01000044.1| GENE 117 135029 - 135607 305 192 aa, chain - ## HITS:1 COG:no KEGG:BF3791 NR:ns ## KEGG: BF3791 # Name: not_defined # Def: RNA polymerase ECF-type sigma factor # Organism: B.fragilis # Pathway: not_defined # 1 192 1 192 192 358 99.0 4e-98 MYNDELIESSKIKWQSFLKGDDDAYAWLYSRYVQQLYQYGCRFTTDTEMVKDCVQDVFVN VYRQKDRYSSPPDNVKIYLMSSLRNSIFNVFNKGNLHDTYISNINYEFDLSVEEKLIETE DETSQKHTVAHLLNTLSPRQREIIYYRFFEGLDYSAICELMGLNYQSAYNLLQRSLSRLR EMYGILPFFFLF >gi|226332012|gb|ACIB01000044.1| GENE 118 135868 - 136944 1031 358 aa, chain + ## HITS:1 COG:sll1747 KEGG:ns NR:ns ## COG: sll1747 COG0082 # Protein_GI_number: 16330007 # Func_class: E Amino acid transport and metabolism # Function: Chorismate synthase # Organism: Synechocystis # 1 351 1 353 362 364 54.0 1e-100 MFNSFGNIFRLTSFGESHGKGIGGVIDGFPAGIVIDEEFVQQELNRRRPGQSVITTSRKE ADKVEFLSGIFEGKSTGCPIGFIVWNENQHSNDYNNLEKVYRPSHADYTYTVKYGIRDHR GGGRSSARETISRVVGGALAKLALRQLGIHITAYTSQVGPIKLEGNYTDYDLDLIETNPV RCPDPEKAKEMQDLIYKIKGEGDTIGGVLTCVIKGCPIGLGQPVYGKLHAALGNAMLSIN AAKAFEYGDGFKGLKQKGSEQNDVFYNNNGRIETRTNHSGGIQGGISNGQDIFFRVAFKP VATVLMEQETVNIDGIDTTLKARGRHDPCVLPRAVPIVEAMAAMTILDYYLLDRMTQL >gi|226332012|gb|ACIB01000044.1| GENE 119 136957 - 138321 1166 454 aa, chain + ## HITS:1 COG:DR2025 KEGG:ns NR:ns ## COG: DR2025 COG0624 # Protein_GI_number: 15807020 # Func_class: E Amino acid transport and metabolism # Function: Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases # Organism: Deinococcus radiodurans # 18 441 18 440 459 403 47.0 1e-112 MNEIQKYIAENESTMMEDLFSLIRIPSISALPEHHDDMLACAQRWTQLLLKAGADEAIVM PSKGNPIVFGQKIVDPNAKTVLIYAHYDVMPAEPLDLWKSQPFEPEIRDGHIWARGADDD KGQAFIQVKAFEYLVKYNLLENNVKFIFEGEEEIGSPSLEAFCEEHKELLKADVILVSDT SMLGADLPSLTTGLRGLAYWEIEITGPNRDLHSGHFGGAVANPINVLCGMLSKVIDTDGR ITIPGFYDAVEEVPQAEREMIAHIPFNEEKYKEAIGVKKLFGEKGYSTLERNSCRPSFDI CGIWGGYTGEGSKTVLPSKAYAKVSCRLVPHQDHHVISKLFADYIRQIAPATVEVKVTAM HGGQGYVCPISLPAYQAAEKGFEIAFGKKPLAVRRGGSIPIISTFEQVLGIKTVLMGFGL ESDAIHSPNENFSLDIFRKGIEAVVEFHLIYGKK >gi|226332012|gb|ACIB01000044.1| GENE 120 138406 - 139461 1025 351 aa, chain - ## HITS:1 COG:AGl573 KEGG:ns NR:ns ## COG: AGl573 COG3049 # Protein_GI_number: 15890402 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Penicillin V acylase and related amidases # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 25 346 28 353 355 369 54.0 1e-102 MRRKIMMATLIVIAVNFIWSGQPVKACTRAVYIGPDNMVITGRTMDWKEDIQSNLYLFPR GIKRAGYNKGNTVEWISKYGSIVATGYDIGTCDGMNEKGLVASLLFLPESIYVRPNDTRP VMGISIWTQYVLDNFATVSEAVEELKKQTFRIDAPDMPNGSASTLHMAITDETGNSAVLE YIDGNLIIHEGKEYQVMTNSPRYDLQLAVNDYWKEVGGLNMLPGTNRSSDRFVRASFYIH AIPQTSDAKIAVPSVLSVMRNVSVPFGITTPDKPYISSTRWRSVSDQKNRVYYFESTLTP NLFWIDLHKVDFSPNASIKKLSLTHGEVYAGDVVKDFKDSRSFTFMFELPQ >gi|226332012|gb|ACIB01000044.1| GENE 121 139760 - 140077 202 105 aa, chain - ## HITS:1 COG:HP0056_2 KEGG:ns NR:ns ## COG: HP0056_2 COG1012 # Protein_GI_number: 15644687 # Func_class: C Energy production and conversion # Function: NAD-dependent aldehyde dehydrogenases # Organism: Helicobacter pylori 26695 # 1 100 335 435 802 104 51.0 4e-23 MTGGTDTARSIAKAIPATPLSAETGGKNVIILTASGDRDHTIMNIVISVFGNAGQKCSAC SLLVERSVYEDKNFQKKLIDVASSMKAGSVWNPGNVVGPMITNKK >gi|226332012|gb|ACIB01000044.1| GENE 122 140291 - 141013 586 240 aa, chain - ## HITS:1 COG:no KEGG:BF3797 NR:ns ## KEGG: BF3797 # Name: not_defined # Def: putative integral membrane protein # Organism: B.fragilis # Pathway: not_defined # 1 240 1 240 240 419 99.0 1e-116 MNTMFFNAVSCFFVCAGLIIALILAINVIFHRQSMKIMNIVWVLTGLWGHYFALFAYYTF GVRKDNMVATVPMENMKMDMKISMEMDMSEVRPQWQSITLSALHCGAGCTLADIIGEWFT YWVPLQIGGSLIAGSWALDFVLALILGVFFQFIAIREMEAISFREAVSRAFKADFFSLLA WQVGMYSWMAVATFILFKDESLEKTSWTFWFMMQIAMLLGFMVSYPVNVWLIKSGIKKGM >gi|226332012|gb|ACIB01000044.1| GENE 123 141103 - 143253 2163 716 aa, chain - ## HITS:1 COG:CAC3567 KEGG:ns NR:ns ## COG: CAC3567 COG0550 # Protein_GI_number: 15896801 # Func_class: L Replication, recombination and repair # Function: Topoisomerase IA # Organism: Clostridium acetobutylicum # 2 711 4 650 709 440 38.0 1e-123 MIVCIAEKPSVARDIADILGARERKEGYIEGNGYQVTWTFGHLCTLKEPHEYTPGWKSWS LGSLPMIPPRFGIKLIENPTYEKQFHIIEGLMQKADEIINCGDAGQEGELIQRWVMQKAG ARCPVKRLWISSLTEEAIKEGFAKLKDQKDFQPLYEAGMSRAIGDWLLGMNATRLYTVKY GQNRQVLSIGRVQTPTLALIVNRQLDIENFQPKQYWELKTIYRDTTFSALIRKSDEELAA EEEKSGGKAKKTENRGIDPIANREEGMALVERIKDLPFVVTSVAKKDGKEYAPRLFDLTS LQVECNKKFAYSADETLKLIQSLYEKKVTTYPRVDTTFLSDDIYPKCPNILKGLKDYEVL TTPLAGASLPKSKKVFDSSKVTDHHAIIPTGVHPQNLTDMERRVFDLIARRFIAVFYPDC KVSTTTVMGEVDSIEFKVTGKQILEPGWRVVFAKDVKDPNEEKEGEDENVLPAFVKGESG PHIPDLNEKWTQPPKPYTEATLLRAMETAGKLVDNDELRDALKENGIGRPSTRAAIIETL FKRNYIRKEKKNLIATPTGVELIQLIHEELLKSAELTGIWEKKLREIEKKTYDARQFLEE LKQMVSEIVNSVLSDNTNRRITIQEATAVEEEKKKKEPKRRERKSATPKEKKGKSEPATV DVSSTGQSVRADALKETDSLVGKLCPVCGKGMIIKGKTAYGCSEWKNGCTYRKAFE >gi|226332012|gb|ACIB01000044.1| GENE 124 143380 - 144780 1108 466 aa, chain + ## HITS:1 COG:TM0306 KEGG:ns NR:ns ## COG: TM0306 COG3669 # Protein_GI_number: 15643075 # Func_class: G Carbohydrate transport and metabolism # Function: Alpha-L-fucosidase # Organism: Thermotoga maritima # 31 372 7 358 449 136 30.0 1e-31 MKSYKLLLAFTLFLAFAFNMKAQYVHERSDQYTPPEDSLVIQKLHHWQDQKFGMLIHWGL YSVAGIVESWSICSEEADWIPRDSTMAYEDYKKWYWGLKDSFNPTRFDPEQWAQAAKSAG MRYAIFTTKHHDGFNMFNTAFSDFSIAKGSFQTDPRADVAKYVFEAFRNNDLMVGAYFSK PDWHSEYYWWPRYATPRRTQNYNIDKNPWRWNQFKEFTYNQIGELMHNYGPIDILWLDGG WVNNPGTKSVLDMDRISQMARQAQPGILFVDRTIHGKYENYQTPEQQIPDKQLPYPWETC MTLGVDWGYTPHAVFKSPVTVIAKLMEIVAKGGSLLLGVGPTPEGILQDEIVSRLQTIGG WMKKNGTAIYNTVTTPQYYSEGIWFTMNKDGKTMYALYSNPETEKFPDYIEWENNIPAKR SAIYCLQNGKKVKWEIKNNRVRVYLPQNLKTEVNALAFSFARQAIQ >gi|226332012|gb|ACIB01000044.1| GENE 125 144886 - 147033 2201 715 aa, chain - ## HITS:1 COG:BH2955_1 KEGG:ns NR:ns ## COG: BH2955_1 COG1884 # Protein_GI_number: 15615517 # Func_class: I Lipid transport and metabolism # Function: Methylmalonyl-CoA mutase, N-terminal domain/subunit # Organism: Bacillus halodurans # 29 587 22 582 582 850 73.0 0 MRKDFKNLDIYAAFQPANGAEWQKANGIEANWKTPEHICVKPVYTKEDLEGMEHLDYAAG LPPYLRGPYSVMYTLRPWTIRQYAGFSTAEESNAFYRRNLASGQKGLSVAFDLATHRGYD PDHERVVGDVGKAGVSICSLENMKVLFDGIPLNKMSVSMTMNGAVLPIMAFYINAGLEQG AKLEEMAGTIQNDILKEFMVRNTYIYPPAFSMKIISDIFEYTSQKMPKFNSISISGYHMQ EAGATADIELAYTLADGLEYLRAGVAAGIDIDAFAPRLSFFWAIGTNHFMEIAKMRAARM LWAKIVKQFNPKNPKSLALRTHSQTSGWSLTEQDPFNNVGRTCIEAMAAALGHTQSLHTN ALDEAIALPTDFSARIARNTQIYIQEETYICKNVDPWGGSYYVESLTNELAHSAWEHIQE IEKLGGMAKAIETGIPKMRIEEAAARAQARIDSGSQTIVGVNKYRLEKEDPIDILEVDNT AVRKEQIENLKRLKEGRNQAEVDKALAAITECVKTGKGNLLELAVEAARVRATLGEISYA CEQIVGRYKAIIRTISGVYSSESKGDADFKRACELTEKFAKKEGRQPRIMVAKMGQDGHD RGAKVVATGYADCGFDVDMGPLFQTPAEAAREAVENDVHVVGVSSLAAGHKTLVPQIIEE LKKLGREDIVVIAGGVIPAQDYDFLYKAGVAAIFGPGTPVAKAACQILEILMDED >gi|226332012|gb|ACIB01000044.1| GENE 126 147035 - 148933 2170 632 aa, chain - ## HITS:1 COG:BH2956_1 KEGG:ns NR:ns ## COG: BH2956_1 COG1884 # Protein_GI_number: 15615518 # Func_class: I Lipid transport and metabolism # Function: Methylmalonyl-CoA mutase, N-terminal domain/subunit # Organism: Bacillus halodurans # 8 470 9 468 525 225 31.0 2e-58 MADSKEKLFSDFSPVSTEKWMEKVTADLKGADFEKKLVWKTNEGFKVKPFYRMEDLEGLK TTDALPGEFPYLRGTKKNSNEWLVRQEIKVESPKEANAKALDILNKGIDSLSFHVKAKEL NAEYIETLLNDICAECVELNFSTCQGHVVELADLLVAYFQKKDYDLTKLQGSINYDFFNK MLAKGKEKGNMVQTAKALIEATAQLPKYRVLNVNALTLNNAGAYISQELGYALAWGNEYM NQLTEAEIPAAIVAKKIKFNFGISSNYFLEIAKFRAARMLWANIVASYNPECLRDCENKG PNGECRCAAKMKVHAETSTFNLTLFDAHVNLLRTQTEAMSAALGGVDSMTVVPFDKTYET PDEFSERLARNQQLLLKEESHFDKVIDPAAGSYYIENLTVSIAKQAWEIFLAVEEEGGFY AALKAGTVQAAVNESNKARHKAVAQRREVLLGTNQFPNFNEKAGDKKPVEATCCCGGHDT CEKDVATLNFDRAASQFEALRLETEASGKRPKAFMLTIGNLAMRQARAQYSCNFLACAGY EVIDNLGFETVEAGVEAAMAAKADIVVLCSSDDEYAEYAIPAFRALNGRAMFIIAGNPEC AEELKAAGIENFIHVRVNVLETLKEFNAKLLK >gi|226332012|gb|ACIB01000044.1| GENE 127 149182 - 150846 1825 554 aa, chain + ## HITS:1 COG:STM3807 KEGG:ns NR:ns ## COG: STM3807 COG2985 # Protein_GI_number: 16767092 # Func_class: R General function prediction only # Function: Predicted permease # Organism: Salmonella typhimurium LT2 # 28 552 18 548 553 385 41.0 1e-106 MDWLQDLLWNPNSVAHIVFLYAFVVAAGVYLGKIKIFGVSLGVTFVLFAGILMGHFGFTG DTHILHFIREFGLILFVFCIGLQVGPSFFSSFKKGGMTLNMLAVGIVVLNIAVAMALYFI LGGRIELPMMVGILYGAVTNTPGLGAAQEALNQLSYSGPQIALGYACAYPLGVVGIIGSI IAVRYIFRINFAKEEENWNQETDGTHHKPHLMSLEVHNEAIYGKTLGTISSFLGRPFVCS RIRKNGHVSIPNHGTILEQNDQLFVVCSEEDSDAIVAFIGREVQVDWEKQDMPMVSRRIL VTKPEINGKKLGMLNFRSMYNVNITRVNRSGVDLFANPNLILQVGDRVMVVGSEDAVERV ASVLGNSLKRLNEPNIITLFVGIFLGILCGSLPIAFPGMPTPVKLGLAGGPLVVAILIGR FGHKLHLVTYTTQSANLMIREIGIVLFLASVGIEAGANFVDTVIHGDGLLYVGCGFLITI IPLLIIGVIARSYYKINYFMLMGLIAGSNTDPPALAYSNQATGSDAPAVGYSTVYPLSMF LRILAGQMILLLMM >gi|226332012|gb|ACIB01000044.1| GENE 128 150934 - 151563 533 209 aa, chain - ## HITS:1 COG:no KEGG:BF3595 NR:ns ## KEGG: BF3595 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 209 31 239 239 425 100.0 1e-118 MNGNYVINIGRQLGSGGKEIGEKLAARLGIDFYDKELINLASEESGLCREFFEKADEKAS QGIIGGLFGMRFPFISDGAMPCTNCLSNDALFKIQSDVIRHLAANKSCVFVGRCADYILR EHPRCANIFISASQEDRIARLCRIHGISEEAAAEKMNKADKKRSEYYNYYSYKTWGAAAT YHLCIDSSVLGIDETVRFIEEFVVKKLAL >gi|226332012|gb|ACIB01000044.1| GENE 129 151729 - 153081 1242 450 aa, chain + ## HITS:1 COG:lin2192 KEGG:ns NR:ns ## COG: lin2192 COG0534 # Protein_GI_number: 16801257 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Listeria innocua # 12 438 13 432 443 201 30.0 2e-51 MKDSIDFGNMEIPRLFRKLLIPTVLGMVFSAVFVITDGIFVGKGIGSDALAAVNITAPLF MITTGIGLMFGVGASVVASIHLSQGKRKVASINITQALAFSALLILVLSALCCYFAEPIG RLLGSSERLLPLVVEYINWYVPFLVFYLLLSAGMFYIRLDGSPNYAMMCNAVSAIINIIL DYVFIFQLGWGMMGAAFATSLGTMVGGLMTLIYLLRFSRNVGIYRIKLSWKSMRLTCRNI GYMIRLGSSAFISEASIASMMFLGNYVFISHLGESGVAAFSIVCYFFPIIFMVYNAIAQS AQPIISYNFGQQNPVRVARTIRLALKTALGCGIFFFAATLVFNHQIVGLFIDKSYQAYDI AVNGIPYFAVGYLFFALNIVGIGYYQSIERARRATVITLFRGTLFMLAGFLLLPPVLGVR GIWLAVPLAELLTLLLIIGIYLKDSFTVRR >gi|226332012|gb|ACIB01000044.1| GENE 130 153122 - 153691 487 189 aa, chain + ## HITS:1 COG:all4541 KEGG:ns NR:ns ## COG: all4541 COG0664 # Protein_GI_number: 17232033 # Func_class: T Signal transduction mechanisms # Function: cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases # Organism: Nostoc sp. PCC 7120 # 2 187 3 187 193 68 29.0 9e-12 METFIEKFRNSYHLSENDTQTLLSYMEEIRFKKKEVIVHEGSKNGNLYLIKQGIWRAHYL KDGVDTTIWFAGAGEAAFSVWGYVENTASHITIEVMCDSIAYCIPGSTLNNLYASSLGLA NLGRQLMERQLLSLENWLISAGSPKAKERYLTLIKEHPELLQNVPLKHIASYLWITPQSL SRIRREMTM >gi|226332012|gb|ACIB01000044.1| GENE 131 153976 - 154467 344 163 aa, chain + ## HITS:1 COG:no KEGG:BF3806 NR:ns ## KEGG: BF3806 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 163 1 163 163 275 98.0 3e-73 MGLIYLVRKKKFRTAEGIRELYFAIQRKLQKRGGKNEEDLAEILSANSSRSKGEVLSILT DLPDVIEEILKNGESVSIRGLGSFHASITSNGFEHPEDVLPHEVRVSKVYFIADRKFTQR VSRMKFFRYPLSKYFPKDLLRPETIREEETREEEPEFIPDDDE >gi|226332012|gb|ACIB01000044.1| GENE 132 154685 - 156895 2085 736 aa, chain - ## HITS:1 COG:PA3339_1 KEGG:ns NR:ns ## COG: PA3339_1 COG1752 # Protein_GI_number: 15598535 # Func_class: R General function prediction only # Function: Predicted esterase of the alpha-beta hydrolase superfamily # Organism: Pseudomonas aeruginosa # 26 291 22 293 308 193 41.0 1e-48 MKKRICFVLILCISLFVFSPVHAQQRKSVAVVLSGGGAKGVAHIGALKVIEEAGIPIDYI VGTSMGSIIGGLYSIGYTPHQLDSMVNHQNWPLLLSDRISWEDQTMTERQNSETYVLSVP LKKNLKANVFGGVIKGQNLANLFSELTVGYHDSINFNKLPIPFACVSENIVNGDEVVFHN GVLATAMRASMAIPGVFTPVRLGDKILVDGGMKNNFPTNIAHAMGADVIIGVDVQNDLRT ADELNNLSEIFNQIINLTGQTRYEENIKLATVYIKVDVKGYSAASFNIPALDTLVNRGEE AAREQWTALQKLKKEIGLPENYVAPRHGPFSSLWSSKDIFVKEITFDGIEDSDKKWIMHR CHLKENSKMRMEQLYEALTTLRGSQAYSNVSYKLTDTPQGYQLHFILEEKYERNLNLGIR FDSEEIASLLLNVRSRLDTRVPSWVSVTGRLGKRYLARVEYTLAPMQMRNFNFAYQFEYN DINIYDHGRRSYNTTYKYHSGEFGFSDVWFRNLRFGAGLNFEFFKYKDFLYNTGGQRLEV KPQHFFSYFAQLHYNTYNKGYFPSKGTDVQGRYSLYTDNLTHYKGHAPFSALAASWAGVF SLTDRFALIPSLYGRVLIGKNIPYPYLNAMGGENFGHYLPQQLPFAGITNLEIVDNSVLV TSLKLRQRIGSKNYVTFTGNVAFRNDNFFDIWGAKPVWGGSVGYGYDSLFGPLEASLGYS SRSHKVGFYVNLGYVF >gi|226332012|gb|ACIB01000044.1| GENE 133 157084 - 158445 1247 453 aa, chain - ## HITS:1 COG:CAC0883 KEGG:ns NR:ns ## COG: CAC0883 COG0534 # Protein_GI_number: 15894170 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Clostridium acetobutylicum # 7 434 4 432 448 325 39.0 9e-89 MTGQKTPTALGTEKIGKLLMQYAIPAIIAMTASSLYNMVDSIFIGHGVGPMAISGLALTF PLMNLAAAFGSLVGVGAATLVSVKLGQKDYDTAQRVLGNVLVLNIIIGLAFTILTLIFLD PILYFFGGSDETVGYARDYMKVILYGNVITHLYLGLNAVLRSAGHPQKAMYATIATVVIN TILDPLFIYGFGWGIQGAAIATILAQVISLMWQFRLFSNKEELLHFHRGIFRLKRKIVFD SLAIGMAPFLMNLASCFIVILINQGLKKYGGDLAIGAFGIVNRLVFLFVMIVMGLNQGMQ PIAGYNFGARLYPRVTRVLKLTIYGATIVTTTGFLMGMLIPGLAVSIFTSHEELIRQSAE GLRIVVLFFPIVGFQMVTSNFFQSIGMASKAIFLSITRQVLILIPCLLILPRYFGQTGVW VSMPVSDLIASLISAGMLWWQFRLFRIHDRQAA >gi|226332012|gb|ACIB01000044.1| GENE 134 158583 - 160310 1970 575 aa, chain + ## HITS:1 COG:BS_lysS KEGG:ns NR:ns ## COG: BS_lysS COG1190 # Protein_GI_number: 16077150 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Lysyl-tRNA synthetase (class II) # Organism: Bacillus subtilis # 5 499 10 497 499 476 50.0 1e-134 MNILELSEQEIIRRNSLNELRAMGIEPYPAAEYVTNAFSTDIKAEFKDDETPRQVSVAGR MMSRRIMGKASFIELQDSKGRIQVYITRDDICPGEDKEMYNTVFKRLLDLGDFIGIEGFV FRTQMGEISIHAQKLTVLAKSIKPLPIVKYKDGVTYDSFEDPELRYRQRYVDLAVNEGVK DIFIKRSKVYSSMREYFNSKGYMEVETPILQAIAGGAAARPFMTHHNALDIPLYMRIASE LYLKRLIVGGFEGVYEIGKNFRNEGMDRTHNPEFTCMEIYVAYKDYNWMMEFTEKMIEKI CLDVNGTTEVKVGDNIINFKAPYKRVTMLGAIKEHTGYDLTGMNEEQIREVCKKLNMEID DTMGKGKLIDEIFGEFCEGTYIQPTFITDYPIEMSPLTKKHRDNPELTERFELMVNGKEL CNAYSELNDPIDQLERFEDQMKLSEKGDDEAMIIDKDFVRALEYGMPPTSGMGIGMDRLT MLMTGQSTIQEVLFFPQMRPEKVVPKDSASKFMELGIAEEWVPVIQKAGYNQVADMKEVN PQKFHQDICGINKKYKLELTNPSVNDVAEWIQKIK >gi|226332012|gb|ACIB01000044.1| GENE 135 160410 - 161405 981 331 aa, chain + ## HITS:1 COG:BS_gpsA KEGG:ns NR:ns ## COG: BS_gpsA COG0240 # Protein_GI_number: 16079340 # Func_class: C Energy production and conversion # Function: Glycerol-3-phosphate dehydrogenase # Organism: Bacillus subtilis # 6 314 3 312 345 154 31.0 3e-37 MKLPGKIAIMGGGSWATAIAKMCLAQEESINWYMRRDDRIADFKRLGHNPAYLTGVKFDM KRINFSSNINDVVKESDTLIFVTPSPYLKAHLKKLKTRIRDKFIITAIKGIVPDDNLIVS EYFNKEYGVPPENIAVLAGPCHAEEVALERLSYLTIACPDKDKARVFARRLGSSFIKTSV SDDVIGIEYSSVLKNVYAIAAGICSGLKYGDNFQAVLISNAIQEMNRFLNTVHPINRNVD ESVYLGDLLVTGYSNFSRNRTFGTMIGKGYSVKSAQIEMEMIAEGYYGTKCIKEINKHHH VNMPILDAVYNILYERISPMIEIKLLTDSFR >gi|226332012|gb|ACIB01000044.1| GENE 136 161453 - 162790 1527 445 aa, chain + ## HITS:1 COG:BH3343 KEGG:ns NR:ns ## COG: BH3343 COG0166 # Protein_GI_number: 15615905 # Func_class: G Carbohydrate transport and metabolism # Function: Glucose-6-phosphate isomerase # Organism: Bacillus halodurans # 2 445 5 449 450 494 54.0 1e-139 MISLNIEKTFGFISKESVSAYEAQVKAAQEALENGTGKGNDFLGWLHLPSSISKEHLADL KATAQVLRDNCEVVIVAGIGGSYLGARAVIEALSNSFTWLQEKKTAPVMIYAGHNIGEDY LYELTEFLKDKKFGVINISKSGTTTETALAFRLLKKQCEDQRGKEMAKKVIVAVTDAKKG AARVTADKEGYKSFIIPDNVGGRFSVLTPVGLLPIAVAGFDIEQLVNGAADMEKACGADV PFAENPAAIYAATRNELYKNGKKIEILVNFCPKLHYVSEWWKQLYGESEGKDNKGIFPAA VDFSTDLHSMGQWIQEGERTIFETVISVDKVNHKLEVPSDEANLDGLNFLAGKRVDEVNK MAELGTQLAHVDGGVPNMRIVIPELSEFSIGQLLYFFEKACGISGYLLGVNPFNQPGVEA YKKNMFALLNKPGYEEESKAIQARL >gi|226332012|gb|ACIB01000044.1| GENE 137 162875 - 163381 511 168 aa, chain - ## HITS:1 COG:no KEGG:BF3605 NR:ns ## KEGG: BF3605 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 168 1 168 168 297 100.0 8e-80 MKKLFFPLLSLCLLTTACKDSVPSPTIEMQGLYVNDGEMDYTRQMEALPPLKVDDELEIS LKLDGNGEELNTFLVKEETPGSKETAVEIDFNDLPEETLSDDKEFTDKDNGKLGFKDGVS QTQIGIRAKVQQVTEDGVKLKFYLFSKPVNCEGAKYELEIKTSDKQDD >gi|226332012|gb|ACIB01000044.1| GENE 138 163479 - 164207 816 242 aa, chain + ## HITS:1 COG:MA0451 KEGG:ns NR:ns ## COG: MA0451 COG0637 # Protein_GI_number: 20089342 # Func_class: R General function prediction only # Function: Predicted phosphatase/phosphohexomutase # Organism: Methanosarcina acetivorans str.C2A # 20 238 2 212 218 122 36.0 6e-28 MFQEAIAQYLKQNHYESIQLKSVLFDMDGVLFDSMPYHAEAWHKTMKAHGLNLSQEEAYM HEGRTGAGTINIVCQRQLGRDATQEEIESIYLEKSIEFNKHPQAERMPGAWELLQKIKAE GIIPTVVTGSGQASLLDRLEHNFPGMFRQELMVTAFDVKYGKPNPEPYLMALEKGGLKPN EAIVIENAPLGVEAGHKAGIFTIAVNTGPLNGEILLNAGADLLFPSMQALCESWEKLVRL LH >gi|226332012|gb|ACIB01000044.1| GENE 139 164779 - 165771 497 330 aa, chain + ## HITS:1 COG:no KEGG:PG0838 NR:ns ## KEGG: PG0838 # Name: not_defined # Def: integrase # Organism: P.gingivalis # Pathway: not_defined # 94 322 195 422 432 95 28.0 3e-18 MESYLQKRKLSEVRERNFQVLLRSLKRYELFISACYKKDFKLDINKIDTDTIEDIESFLR NEHTLYNEYPEIYEKIPAVIGTIRKAPKPQPRGNNTICALFSKFRAFYNWCNKQGITNNR PFEKYSGNTTEKYGTPFYLTLDERNIIADFDLSARPQLAIQRDIFIFQCLIGCRVSDLLK MTQGNIINEAVEYIPHKTKDERPAVVRVPLNGRAKELIEKYKGIDTKKRLFPFISAQRYN DDIKDILRLCGIDRYVTILNPTTGKEEKRPIYEVASSHMARRTFIGNLYKKVKDPNLVGS LSGHAEGSRAFTRYREIDDDIKKELVDMLE >gi|226332012|gb|ACIB01000044.1| GENE 140 165915 - 166106 175 63 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253566200|ref|ZP_04843654.1| ## NR: gi|253566200|ref|ZP_04843654.1| predicted protein [Bacteroides sp. 3_2_5] # 1 63 1 63 63 104 100.0 2e-21 MLLRDKDGISFRYTSNQEGEVYKKIANSKKLRQLILNLTFYFKSMIELFPKRKDQGLELS SQI >gi|226332012|gb|ACIB01000044.1| GENE 141 166464 - 168341 528 625 aa, chain - ## HITS:1 COG:no KEGG:CHU_3478 NR:ns ## KEGG: CHU_3478 # Name: not_defined # Def: hypothetical protein # Organism: C.hutchinsonii # Pathway: not_defined # 1 520 1 546 676 98 23.0 1e-18 MSSILEQCYDQFIEEFPESWLPNGNEEEVVFFSKNIQIESFFETCFILLSKSIISGEYIN VPNFLDVLNNFLEKTTVEYAPPSLSGAVNEKVDKLLSRYRDLNYSIYNALKHYNYFVTIS KNKFDTEEDRYEYGFYKLKNIKSTDIILKLFSEITMPLCLFDYRFPTSEDEFQKLLFSRD RLMEYISEGSSERRAILSVLLHKCHFIIRKIKETPLYINSESNIVCLNPKELDIGYYDEF VIEECSSESKVSELLDNINSANPKLKSFVLLMKYYKQNFIDKSDIKKMDLVLKQFSNIYQ IRRNTQAFISPSNSIEKYDRFSLNSIFNFLYNCRFSFYVQKCEPNLKQIKEELRHIENIQ ARTGVKNFHPYEKAIEAIIRCIELHIGKEDFDERLIEDKLEELSRVIDLYKEAYEWSRAH QFFPFQLPFEESMYSAGNESINLFVPSAYARYIDYNTLKERLEQFNRTKEYLRFRCDLSM ERKEITQIKNDIKTSDKKAYDLIAIFTAAITFLFGIVNIFINNTTLDLYQLTANTIGFGI LLLLFASLYLFISPLLIQRINWRQYLKTGRFIAGIVFIAIYVILVFTLAETSRSVIDKIG QTESIVKDSLHNKQKLEMQQIKVAQ >gi|226332012|gb|ACIB01000044.1| GENE 142 168353 - 168877 455 174 aa, chain - ## HITS:1 COG:no KEGG:CHU_3477 NR:ns ## KEGG: CHU_3477 # Name: not_defined # Def: hypothetical protein # Organism: C.hutchinsonii # Pathway: not_defined # 3 173 6 174 176 77 35.0 2e-13 MNKIYAFDYMLFLFEEWYNEENKEQNRGFENCSKLSMLKLLFLAAAPKGEGTGDLLNIFN KFFALPYGPVESDIYNAIQGNNLPSYTITERSITKKVIDTLPYRIEDYVKVEEAVNALKV KNKHLILLNAFDLVDITHKWESWKQSIDFAKLMEMSSYKMTVESIRNDRNKYFE >gi|226332012|gb|ACIB01000044.1| GENE 143 169679 - 170398 255 239 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566203|ref|ZP_04843657.1| ## NR: gi|253566203|ref|ZP_04843657.1| predicted protein [Bacteroides sp. 3_2_5] # 1 239 1 239 239 478 100.0 1e-133 MRKLFYATVAFVAFLIGLFLFMFASFTSCTSNNELTPPEMYVDVNEITLSTLWVDVDWPP SVHKINVYGTRAVSFKSENEKIARVSPDGKVVGMRAGSTNIVVQGDLKSIKVKVNVIPRP SNFLEPLYNFTLTKKELIQQKGDGYDMQVDPDIFYYRWGGVSPVGEYYFFDKQTGSLFTS YLIVNKSKVTEQDLYIFFEERYLALGKGGWKSLDGRLIVQISEYDDQHYKIKYSAKKEE >gi|226332012|gb|ACIB01000044.1| GENE 144 170533 - 170997 186 154 aa, chain - ## HITS:1 COG:no KEGG:BF2444 NR:ns ## KEGG: BF2444 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 154 1 149 149 109 45.0 4e-23 MKSLPYIIIIILVLFIVFRPTRVERVPGEVVRDTIITNRIDTVWGTVPIPVYESVVDSFP FVVPVPVPGDTVRDTVYLPITQKIYKDSLYTAYVSGYRAKLDSIEVYSKTRTMFIRERAK RKRFGLGVQAGYGFSGNKATPYVGIGVSYNLWEW >gi|226332012|gb|ACIB01000044.1| GENE 145 170994 - 171488 431 164 aa, chain - ## HITS:1 COG:HI1494 KEGG:ns NR:ns ## COG: HI1494 COG3023 # Protein_GI_number: 16273395 # Func_class: V Defense mechanisms # Function: Negative regulator of beta-lactamase expression # Organism: Haemophilus influenzae # 45 149 1 98 116 91 41.0 5e-19 MKTIDSIIIHCSATRFGQDLRAKDIDRMHKQRGFNQIGYNYVIDIDGTVENGRPLSVDGA HCNTKGDSGRSYNKHSIGICYIGGLDVNGKAADTRTEAQRIALRDLVEKLCRDYPIIEVL GHRDTSPDLNDNGEVEPFEYIKACPCFDVRKEFSNFMKPVIIRP >gi|226332012|gb|ACIB01000044.1| GENE 146 172374 - 172808 455 144 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566206|ref|ZP_04843660.1| ## NR: gi|253566206|ref|ZP_04843660.1| predicted protein [Bacteroides sp. 3_2_5] # 1 144 1 144 144 276 100.0 4e-73 MKNLKMIALIALPLSPLLELFERYVFGDWEFVKWLIVLVCVDTVLGFVKHWLSKDISSKA YGMIGRKLIIYSCVLILSHVMGNFSIAGQVVDSFVWFRYFACTALMIREALSIIENVEEI CPGFFPKAIINKLKGFDNVSGKKE >gi|226332012|gb|ACIB01000044.1| GENE 147 173097 - 173429 255 110 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566207|ref|ZP_04843661.1| ## NR: gi|253566207|ref|ZP_04843661.1| predicted protein [Bacteroides sp. 3_2_5] # 1 110 1 110 110 199 100.0 5e-50 MFQEESRTVQVNGKAVSGDYQYNVNYSVNNDNLSRLHCEIIKTVTEEIDTPTGKQPVTSG RYIGYLLLESGSKQMSLPESENVAAHFEVFDQITKEVKATLEPKPASKSK >gi|226332012|gb|ACIB01000044.1| GENE 148 173433 - 173717 357 94 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566208|ref|ZP_04843662.1| ## NR: gi|253566208|ref|ZP_04843662.1| predicted protein [Bacteroides sp. 3_2_5] # 1 94 1 94 94 170 100.0 3e-41 MKINFRRIKVKTAIDGEIKEFDVAKTVGNAIYCNTPDLGELEFAQRIYKEGEVEVDEQGA NIIRNYVDPAPILAVVKTAIYNELDKVIMNSQNQ >gi|226332012|gb|ACIB01000044.1| GENE 149 173768 - 178231 1618 1487 aa, chain - ## HITS:1 COG:no KEGG:BF2447 NR:ns ## KEGG: BF2447 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 4 157 2 172 1324 181 70.0 3e-43 MKDVKIKTTSIPAKPRSKNYPAGAVITRTAGGITVNGGGGGGASVDIVKATDTKSFTDSN VLSSLRTLLEIRSRIIAESDTTTELTDDNTLSSKRTLKEIDAAIKEALKKIDDLYLSKVK ADIAKEPITFLKGLFVGDGLTFINESGDTELQSLVARMKVKAATLEVTGSANVGTLHSEG NISTGADIWAKGDTHTLNLLVQALAKTYDLNVEHVATLFQTIVKDYISSERFIPGLMGEG MKLYKAINGDWNLEIDNAVVRKAMTIFELIISKVRAVNGGLVISSANGRVKSVSETSGDP AYYVLGIEGDMMFVADDLVRCQVYTSGHVKYYWVPVASVNDDSILILKSVFPNGTTPAVG DDLVQMGNLTNPNRQGILYLTASEDGKPRISVLDGVNSTSLAGKNKVILGCLDGMTDTDF PADFQPSGYGLYAMNCFLKGIFILRNGKSIEQEFSNIATELAAIPGKIELAIRSMKVADV NLLYDSNHKLNANPYQMGSYKYDVHLEAGKTYTLTVCYKCADSDVIRAYNNPSYGWIGTL PKSAEETVLSQPITPINPDGAYFYFYKFPQQESTETYIKWAVITEGSVGVANWIPSATER KLNIGGENLMLQSQQALDGSGAQYAFQLSKAWTDLKGKTLTISFDYAYSNLKMGSSQRFG LEKAIYKSGTSQYYYIGAFKYVDSTSPTADKGRYVHTIKVPEDIEDSLDTDIIAYIQLGA GSVCRINNFQIEIGDTATGWKPAPKDSFTESKKYTDTQILAVDGKIELSVKTKVENLGIG ANNLYSYTSSLLNTLYPSPTIERQMSLHGFYLVGSQGNGGAMRIPNIIPPIPGKYTVSGW IKGSQNTPVGFTIDVCDSENVIVKSTADNQWSYFKHTFNVTKNTEEQKDVYNFVDIERID WAYIWVKDFKVEAGEIATAWSPNFQDAVYKGAEYTNSQISVVEGKITSTVEKINTVDGRV TGLASRVEQTEKSITSVVGDIGVINSTTNRHISKRIDLRGWDNNKFFPLVISIPVYHKTR VEISRPLDAGYGKPSYGTHDGGFSMNLTFEMSGSGWGSLPAVTNIFDYTKAWTSAGAKIV VDLGQITETSTCRMGIRGGSMYDVTVDDTIDPNVINVYQTDYHGSYNTSFPVRTDGTEPV RTYGYYTEIKQTQESIALTANKVDDQGRRLSAAELTLSSDHAKLSVVEQAANSANSLAGT ANNKAEAADGRVTATQNGLVETGINITSRKIILKADNLLFQNNTGQQTAAINANGKLSAN VIEAAEVVAQAFSAQRITTGNLTVTDGAKIGAWNISGGSLVSASNSQAKILLNMSGNKFL RINEEGDSPTTSRTALMSIRNDNYSGLSIESYGSSGFALRCLANAGTANSIESYGSHIFA QRGGEKWNAPGMLCTGYVYQAGTVTNEWGNGCTLTSAQKIATGKYRIYHSLKHLQYAVLV QGLGGYGWVFGQVETQNNSYFEVLMLDANKGPRDCPFRVFVVGRNVW >gi|226332012|gb|ACIB01000044.1| GENE 150 178228 - 180276 987 682 aa, chain - ## HITS:1 COG:no KEGG:BF2448 NR:ns ## KEGG: BF2448 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 681 1 689 693 690 52.0 0 MSYGLIYTIPFASLRNKSCIIEIEKEGYVGAPTELVGAGNPFTVDIDDDDFLYVPSRFST ANIRIVGSDYLQSLFSTAYQQYRVTFKRDGVVTWCGFIKPELYTQDYSSTIFELELECVS AMSALEYIDYKPKNGTERGFVTLWELLTRCVSESRGCYSNVYIPHVYAKDKSNYTAWTNV LKDMMISEQNFFDEDDKPMKLKEVLEEICKFLNWTCVDWRGDLYFVDVDHAGDYYKYALD FSAYATVRGFTINVQKVGFSGDNHTLDILGGYNKVTVKDSNYPVGNLLPEESYEDAKVLS SRLNTNKDRKCYRQFLYPKNWNMYLYDGDTVITNDDLELRAYDAHKLIGGIQERYCNYKI VDGKPDISDYSFTNVIQARCLGAVGDLSMIGGLELLTKIMDFKGASSVYESGAFAVSGSY KTIADMDLIPWDNSRGTYMPLAACQLRIGNKYYGSTNGLAPFAWSANPNYFFRLPASEEN NKARLDYVSIENQKTIYMPYKGVSGVIIPIDTLLYGELEFTLYASKIHNAIFINGFLLKD FSFKYGKSTEAEKTTDNTDRYYENVVNEDYINELDEIEFKISSYNNDGACYSKVMIGEDY LRDNLYSVLVDRAIRPEEHLIQRIINRYSTTRIKLTQEIEETIGLTPISRLSDKSLVNKI FINAGGSIDYKMEQFRCIMIET >gi|226332012|gb|ACIB01000044.1| GENE 151 180273 - 183365 2480 1030 aa, chain - ## HITS:1 COG:ECs2240 KEGG:ns NR:ns ## COG: ECs2240 COG5281 # Protein_GI_number: 15831494 # Func_class: S Function unknown # Function: Phage-related minor tail protein # Organism: Escherichia coli O157:H7 # 40 276 66 304 1080 71 26.0 8e-12 MAGRLSFSIAINLLTENFKRGTNSVKNGLRVMQMQVLTFAAALGAGGLGLSNFVSRLIDV ARETSRVTTALKNVSGSMVRFADNQRFLLDMAKKYGIEINALTGNYAKFTAAASISGMSM MDQRKIFESVSRAVTAFGMSAEDSNGVFLALSQMMSKGKVSSEELRLQMGERLPIALQAM AKAAGVSVGGLDKLLKQGKLMSKDVLPKFAEALDKMIPNVDTDNLETSVNRLKNAFTEFV NGTEVQSKYKALIDWLTNAVKVAADNIRSVITYTVAAIMVMVTSRLVNKILLSISRAELA AKSAARRAAKDAGQKFNEIAWKAQRTSASIKMAFSKAAMSIRATLISMAPTAILTVIGAV VAKLYNAYRESKRIKGLFDEYQKRMNDVPSKTPEIIKIRALQEEYNKTNVTLSDKKRILA QINGILGTELIVNQDVNKVIEKRISLLESAARAELAAKEVADSENELGKIGGKSYNGKTI RSMAPDWAMARGDLVKEERFKKKYDVHTQDALGWENGLKDDLNAFIEHAKILKDAKGRLG KEIANSVATADSTPPEPDSKKTELQKAEEKYAKSLRELDARREVEKMSESEYYKAVDELG RKMLIEAKASGDKEILNSKYLKMLQDVIDHPLYDEAAAEMEKVQKEYNDKVKENKTLLSK GLISQKAFNEHLAGLSVEAAKSAASIKGIGERADAFIKDMLDQAISHIPSVKMKSRDTTF DYKKSKVDIASENLDKAKEYAKELQEQAKKVGKELSDELSNAIANVPTLEEALKLAKVKE DVKKFTKELDESLYSGIKDIATSSDRVVSAFTSLRDVMNDVDATGWEKIMAIWNAMINTI DSFTSIVRTIENISVLAKKLAGAKEAQQGLEKSTAGTVAGTVVKIAADEVATKMELENSQ KKSAAAVTEMASKSTAAYAGIPFVGAALAAGQIATMMAMIEAARISAPGFNSGGIYLGGT SFGDKGLARLNKGEMILNMTQQSNLFDAINSGNLGSSNRVQIEFGKAKVLGPDILLSINN TLKKQGKKPL >gi|226332012|gb|ACIB01000044.1| GENE 152 183367 - 183984 516 205 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566212|ref|ZP_04843666.1| ## NR: gi|253566212|ref|ZP_04843666.1| predicted protein [Bacteroides sp. 3_2_5] # 1 205 1 205 205 357 100.0 2e-97 MEARLTIKAVIRWEQLRGKSFSLMDYSDKEDVNALLYTSTIVAKGEVYTFDVFKKTLSNR KLVREMVLSLENRMSVLAQFQNKRAGTDKINSDTTPGMIGNIVSTLIMSGLDATYALEEM ELCDLPMYIEAYERKRKEEMEASRLWTFFTMLPHIDSKKMKNGAMDLITFPWEEVEAARE AERAINEDIDRFEQFMKEGKKLINK >gi|226332012|gb|ACIB01000044.1| GENE 153 184051 - 184530 432 159 aa, chain - ## HITS:1 COG:no KEGG:BF2452 NR:ns ## KEGG: BF2452 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 5 151 1 146 160 72 31.0 5e-12 MATKLDSSKDIYRGELMLFIGDEPIAFASSCGLDVSTEEIDISNKMMGDWAGSLPGKKSF TLSSESLLTRKEGAMSFDTLLSKQIAGEVLDFFLGSPASADKDNFGGTFTKDTKQKNYTG KVIITSLSIKSDNGQIVSCSASFKGIGALAPVEPVGVGG >gi|226332012|gb|ACIB01000044.1| GENE 154 184562 - 184930 252 122 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566214|ref|ZP_04843668.1| ## NR: gi|253566214|ref|ZP_04843668.1| predicted protein [Bacteroides sp. 3_2_5] # 1 122 1 122 122 225 100.0 6e-58 MNKFKVTTEVRTILQDSLGIKTMVGDKIFPLVAPNGTEGDFIIYQRDGFKQEYTKMAVAR QVPTIFVTAVSDNYTRSQELASLIYDALEGDFVDPVMKIRMEDSTEDYESGKYFQVLQFS ID >gi|226332012|gb|ACIB01000044.1| GENE 155 184930 - 185385 174 151 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566215|ref|ZP_04843669.1| ## NR: gi|253566215|ref|ZP_04843669.1| predicted protein [Bacteroides sp. 3_2_5] # 1 151 1 151 151 266 100.0 2e-70 MIQIRTIDRENIIYLVDQLETFEKDKAIKSGLRAAVNVFRVRGRSNLRSRLLHHGKQTGH LMNSFTTRVKRNKLGALAGFDRPGGNHSHLVDAGTRARTTTGKKSVRAGASRGLMPANRF WEDAKVSEEKKAMDALYAGIERAVQRINDRG >gi|226332012|gb|ACIB01000044.1| GENE 156 185382 - 185705 253 107 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566216|ref|ZP_04843670.1| ## NR: gi|253566216|ref|ZP_04843670.1| predicted protein [Bacteroides sp. 3_2_5] # 1 107 1 107 107 194 100.0 2e-48 MKAGLLREILEFREEVKSQDLNGFVSNRYETVLTCKASRRKMSAVADKSGVNAMEQFIGS IIVFQVRNYPAIKENQRVVYRGVEYAIKMIDPQRDNTLVITLEKLNI >gi|226332012|gb|ACIB01000044.1| GENE 157 185702 - 186010 249 102 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566217|ref|ZP_04843671.1| ## NR: gi|253566217|ref|ZP_04843671.1| predicted protein [Bacteroides sp. 3_2_5] # 1 102 1 102 102 183 100.0 3e-45 MKYVSLDLAKKHLYIETEYTDDDSIIGVYVAAAEGAVANHIRRDLDTLEDSEGKLPDPIL SAILLVAGGLFRDREVNFVAERARDKVGLLDYLLQPYIDYSK >gi|226332012|gb|ACIB01000044.1| GENE 158 186014 - 187258 1108 414 aa, chain - ## HITS:1 COG:no KEGG:NMC0858 NR:ns ## KEGG: NMC0858 # Name: not_defined # Def: putative phage-related protein # Organism: N.meningitidis_FAM18 # Pathway: not_defined # 10 409 218 619 627 138 29.0 4e-31 MPREKSITDLKDERTQLSIRAKAITDGARAEKRMLNEGENTELGEIQCRMTDINMEIATK EAENRGKGTPHVEPGQERFSLRRSLANYISGQGQHDADASVIEAATRLHNSAGVTRSSQN SLVIPMSLEKRAMFTAATESATGVVIDQEQQELLLPLQSSLVLAQAGARFMTGLQGDIYW PKYSGSNVFWEGENAKAKDGAGQFSKGDAYKPKRLTAYVDISEQLLVQENTSVEAIIRQT LAAAIAQKVEQTAFGTHAHNDNTPDGLFQTVPAINGVMDWAKIVELETNADINNALFGNL AYIMHPSLVGKAKTKVKDASGAGGFIFGDKGEGTLNGYKALRTNNLPKGLQTAKDEFGIV FGNWNDYFIGQWGALEIKVDPYSRMLEGVVRLVINSYWNMGMIRPESFSIASMK >gi|226332012|gb|ACIB01000044.1| GENE 159 187269 - 187865 252 198 aa, chain - ## HITS:1 COG:STM2236 KEGG:ns NR:ns ## COG: STM2236 COG3740 # Protein_GI_number: 16765564 # Func_class: R General function prediction only # Function: Phage head maturation protease # Organism: Salmonella typhimurium LT2 # 4 155 3 158 172 77 38.0 2e-14 MDEKREIRNTAYQVVSDEEKRTVEGYALLFGVSSDGLSFEEVIEHGALDGVIEKSDVFAL LNHDQSRGILARCNRGTGSLTLSIDSKGLRYRFEAPKTGLGDELMENIRRGEIAESSFCF DVEEETWEKKSDGTWKRTILKIDHLYDVAPVYNAAYSKTSVYMRGKEQAEEDFRKQEEQR KSGELDEYYENIEKLFNN >gi|226332012|gb|ACIB01000044.1| GENE 160 187868 - 189118 810 416 aa, chain - ## HITS:1 COG:RSc1682 KEGG:ns NR:ns ## COG: RSc1682 COG4695 # Protein_GI_number: 17546401 # Func_class: S Function unknown # Function: Phage-related protein # Organism: Ralstonia solanacearum # 20 397 24 401 407 160 28.0 6e-39 MKIPILNIEIRKASKQEVSNIAAWSSGGRSLLLSRDKPMLLSTVYRCVDLISDSVAVLPL KTYQLDEEGFKKECKWHPAYHVLNTEPNEDMTRYVFFKTLMASVLLTGNGYAYIERDGTD LQLIYVPSFQVGIEWIVDAKGIRRKRYRITGFKDLVQPKDMIHVLNFSYDGIIGVSTLTH ARQTLGIASDSEAHAAGFFKGGGNVAGILAFEGRLDKKQKDQIYETWENRTSSVGGKPNG IAVLEGNMKYQPITISPKDSQLLESREFNVVDLCRFFSVSPVKAFDLSKSSYSTVEATQL QYLTDTVLAVITKIEQEINRKVFLKSERGRILAEFDTSAILRTDKKAQAAYAKDMFYVAG MTPNEIRRENNLPRLENGDKAFVQVNTQTLDRAVADPVIDKNSKLSDSSVVNEEKD >gi|226332012|gb|ACIB01000044.1| GENE 161 189159 - 190781 836 540 aa, chain - ## HITS:1 COG:ECs1598 KEGG:ns NR:ns ## COG: ECs1598 COG4626 # Protein_GI_number: 15830852 # Func_class: R General function prediction only # Function: Phage terminase-like protein, large subunit # Organism: Escherichia coli O157:H7 # 1 530 1 528 553 286 33.0 7e-77 MKGYYQYAADVRDGKIVVGEFIKQAVERFYVLFERDDIDFRENRADYAIEFISLLRHYTG RHAGKSFTLLPWQEFAVASIYGFYKKDEDGSWCRLVSSVYIEMARKNGKSAFAAALCLYH LIADGESAAEVYLAANSKDQAKVSFTMCRNFVSGLDPKHRYLVSFRDQINFDKTLSFLKV LAADSSKLDGPNPSMFLLDEYHAAKNSGLKDVLQSGQGMRDDPMSIIITTAGFDKLGPCY QFREMCTEVLKGLKEDDTLFALIYALDEGDDWKNEKVWGKSNPNLGVTVKPKYLREQVQK AINSPSEEVGIKTKNINMWCDAETVWIPDHYILNASANLDFEQFRDMDCYAGIDLSSTSD LTCMSFMFPTQDKYYFKTLYYLPEAALQEKRFKDLYGDWRRQGLITITPGNVTDYDYILN DLMRIREIVFIQKVAYDAWNATQFVINATDQGLPMEEFSQALGNFNRPTKEMERLLLSGR AVIDNNVINRHCFRNVIMARDRNGNTKPSKQFEEKKIDGVIAKLEALGIYLMSPRYGEFY >gi|226332012|gb|ACIB01000044.1| GENE 162 190778 - 191161 393 127 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566222|ref|ZP_04843676.1| ## NR: gi|253566222|ref|ZP_04843676.1| predicted protein [Bacteroides sp. 3_2_5] # 1 127 1 127 127 217 100.0 2e-55 MVKFVMPDNLSDETQKFIKDVVKELNARKAIQNIDLGAIRMLATSYEMYMQATDILLKEG PVIEIKYEKAANPAQNIATKNYAQVMKIMTEYGLTIKSRGNIKAMKSEDKNDSPLDQFLK KGARERR >gi|226332012|gb|ACIB01000044.1| GENE 163 191463 - 191855 299 130 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566223|ref|ZP_04843677.1| ## NR: gi|253566223|ref|ZP_04843677.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] # 1 130 1 130 130 233 100.0 3e-60 MKTDIERLKERFANANTEAEIEAVDKEMKALADQDMDQFAEGLIECIKDTNKEADEILLR EKLESVLPFISVSALAKTYFKRSPQWFYQRLNGSIVNGKPIRFNDAELKTLAGALTDIGK KISQAAAFVF >gi|226332012|gb|ACIB01000044.1| GENE 164 191904 - 192122 225 72 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566224|ref|ZP_04843678.1| ## NR: gi|253566224|ref|ZP_04843678.1| predicted protein [Bacteroides sp. 3_2_5] # 1 72 1 72 72 103 100.0 3e-21 MEYLLNNNNVIIFVVSKNKSNMGEKPVSKERIKLEKDLLFYLRYYKELQDRGHYKQELDY QIELLTKKLKEM >gi|226332012|gb|ACIB01000044.1| GENE 165 192137 - 192439 229 100 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|301164546|emb|CBW24105.1| ## NR: gi|301164546|emb|CBW24105.1| putative endonuclease [Bacteroides fragilis 638R] # 17 100 1 84 84 162 98.0 5e-39 MPTIYKPKKREQKSNNMYDDARRKIYNSERWRRLRAWKMVNNPLCEVCWQKGLATPAEDV HHIVSFMTTNDPLQRKSLAYDYDNLMSLCKQCHQNIHNSK >gi|226332012|gb|ACIB01000044.1| GENE 166 192393 - 192722 178 109 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566226|ref|ZP_04843680.1| ## NR: gi|253566226|ref|ZP_04843680.1| predicted protein [Bacteroides sp. 3_2_5] # 1 109 1 109 109 197 100.0 2e-49 MRNRYKRNSYYPKVAEAIGKNYLKLRSLCCVEFDTFHGSLSREDIFQDTVLYVIQDVEAS LLESEEDIIKHFCYRYKMIAFQIIQDSKQLREIPYADYLQTQKEGTEEQ >gi|226332012|gb|ACIB01000044.1| GENE 167 192697 - 192882 57 61 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|301164548|emb|CBW24107.1| ## NR: gi|301164548|emb|CBW24107.1| hypothetical protein [Bacteroides fragilis 638R] # 1 61 13 73 73 102 100.0 8e-21 MGNKRRSVRFDEHTWMLLKEVSEKMGVNMSVVIRSMVARSLREITDDSGNLILNEKQVQA K >gi|226332012|gb|ACIB01000044.1| GENE 168 192915 - 193193 185 92 aa, chain - ## HITS:1 COG:no KEGG:BF2464 NR:ns ## KEGG: BF2464 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 92 46 136 136 154 94.0 1e-36 METTIDSNGLGGFQTRQDRILCIRSQINRSSEELARINEKLGAKDTPSLEEWLRLSDIRN NLMVSIHRKEEELSRLTDSRRLDQPRRANYNY >gi|226332012|gb|ACIB01000044.1| GENE 169 193208 - 193759 478 183 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566229|ref|ZP_04843683.1| ## NR: gi|253566229|ref|ZP_04843683.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] # 1 183 1 183 183 351 100.0 1e-95 MLDLNTLRNRAYQNACDHGFHDRELSDNHCFMLVITELSEAVEADRKGRRADKAEFESVV SSNSDHMSEAFVDAFERLVKDTVEAELADTVIRMLDMAGLRGINLNGIFIVAYIVSRKKS FTENCYAIIKDIVNYKYTTEECLNYAIRQVFELAEFYDMDLEWHIEQKMKYNEHCGKMHG KKY >gi|226332012|gb|ACIB01000044.1| GENE 170 193779 - 194060 253 93 aa, chain - ## HITS:1 COG:no KEGG:BF2465 NR:ns ## KEGG: BF2465 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 91 1 91 93 171 96.0 7e-42 MKCEAEGKILVELPSTGGVTKDGKDWEKREYIMETSERYHSKMRFSVCSFDGPVENPPKV GNKIRVNFTVEAREYKGNWYNEVRAHRTENIEC >gi|226332012|gb|ACIB01000044.1| GENE 171 194256 - 195197 443 313 aa, chain - ## HITS:1 COG:no KEGG:BF2466 NR:ns ## KEGG: BF2466 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 313 1 313 313 592 99.0 1e-168 MFDKITIKATIDTADIETIVLRNYLEECTEGDEVYYKSTAYANFDGCFIEVRGNRLRCTC SICKLYSKGKTGKLDNSRPITFAIAVRTIKELLLRLCVRIENAVVTYYEIGITMKMSLPA DSYIKQMYEVSGKLLWNDANYSAFKQQTTEKSKYFRKILKVYDKSFEAGEKGRNVGANIL RIETIYKHQSVSLMELTDNLFLSRIGRIFYKDWSEICFTRELSAAKGVKVSQLERAREIY RIGVTRYKERYKKLYLSGKLTKKQWETIRNFARSWPEEREKYVEEIGDMEREFKDKLLSG YQTGIFTPICRKI >gi|226332012|gb|ACIB01000044.1| GENE 172 195866 - 197077 264 403 aa, chain - ## HITS:1 COG:no KEGG:BF2468 NR:ns ## KEGG: BF2468 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 403 25 427 427 815 99.0 0 MLAWAKTDCLEHKGFATKSRVVCMDCGQRFSPDIVRRKLAVCPHCGAKLKVEQSRCTTDK QSRYVAIAEIHGEFQVIRNFEIRAYYKAGAVPKYFINEVLQHWIRQDGKNTVVALNHTVN WYCDSWGGDMEIRVEHRRGYYSSGVRYDIYPSRLHPDSEFRPDIGRYGIDHRLQGLTPLE AINMIPDNPKMETLLKARRYELLGYASNEKYKIERYWPSIKICLRNKYRIKDVKIWFDYL DLLRYFHKDLHNAHYVCPDNLKKEHDKLVIKKRQLQEKEEAERKRKRAIEDEAKFRALKA KFFGLRFTDGFIEVRVLESVREVMEEGDALHHCVFTNNYYLKPESLILSARIGDKRIETI EVDLKTLNVVQSRGACNQNTEYHDRIIGLVKKNTRLIKQKLAS >gi|226332012|gb|ACIB01000044.1| GENE 173 197153 - 197572 277 139 aa, chain - ## HITS:1 COG:no KEGG:BF2469 NR:ns ## KEGG: BF2469 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 139 1 139 139 219 99.0 2e-56 MKDNSFQAAIKSYLDERAKADELFAKAYNKENKSIDECCSYILGEAKKRGNAVAISDAEV FGMAVHYYDEDNIKVEKIPANTGSSVSGLSASTVLTEEDKEKAREAALRRLEEEQYALLK KKPTRAKKEIIEVQQMSLF >gi|226332012|gb|ACIB01000044.1| GENE 174 197641 - 198396 508 251 aa, chain - ## HITS:1 COG:PSLT059 KEGG:ns NR:ns ## COG: PSLT059 COG0863 # Protein_GI_number: 17233429 # Func_class: L Replication, recombination and repair # Function: DNA modification methylase # Organism: Salmonella typhimurium LT2 # 40 233 26 197 226 73 30.0 5e-13 MKDIELFNNHFQNYKVYGIPKAQLIIADVPYNLGNNAYASNPSWYVDGDNKNGESDLAGK EFFDTDKDFRPAEFMHFCSQMLMKEPKEKGKAPCMIIFCEFEDQFRYIELGKRYGLNNYI NLVFRKDFSAQVLKANMKIVGNCEYGLLLYRDKLPKFNNDGRMIFNCFDWVRDGETPKVH PTQKPVPLLRRLIEIFTDKGDVVIDPCAGSGSTLLAAAQLGRKAYGFEIKKQFFADANKL ILSRIQQSLFV >gi|226332012|gb|ACIB01000044.1| GENE 175 198396 - 198770 381 124 aa, chain - ## HITS:1 COG:no KEGG:BF2471 NR:ns ## KEGG: BF2471 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 9 124 1 116 116 191 100.0 5e-48 MQEINKENMIMKPKKQLIETAVKDGSIDRMNMLLSAAHLLNCEANSLIEEASDVMLAKGL LLGNLKKLHNDFVKCADRYFREFATLVTTDKSKMDMFGDLDGFDKSFREWAKVSADWEPK KEVE >gi|226332012|gb|ACIB01000044.1| GENE 176 198743 - 198955 135 70 aa, chain - ## HITS:1 COG:no KEGG:BF2407 NR:ns ## KEGG: BF2407 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 69 1 69 69 98 82.0 8e-20 MKVIHVYLIFKKKNYYFGSLSAIFEHLDENDIGIKKRTLLHRSDESTILTDRAIIIKSTL LRCRKSTKKI >gi|226332012|gb|ACIB01000044.1| GENE 177 198978 - 199418 248 146 aa, chain - ## HITS:1 COG:no KEGG:BF2472 NR:ns ## KEGG: BF2472 # Name: not_defined # Def: putative recombination protein # Organism: B.fragilis # Pathway: not_defined # 2 146 1 145 145 261 100.0 4e-69 MMWKKKTTDLKKKSPNLKNKLDTVFSRFIRLRDARKDGTFQCISCGRILPLDQADCGHYI NRQHMSTRFSEKNCNAQCRSCNRFDEGNMQGYRRGLILKYGEPAVLLLESMKTQTNKISD FEYSAMIKYYQGEVKRLKEEKQIRQI >gi|226332012|gb|ACIB01000044.1| GENE 178 199393 - 199743 230 116 aa, chain - ## HITS:1 COG:no KEGG:BF2473 NR:ns ## KEGG: BF2473 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 116 1 116 116 224 100.0 8e-58 MKIQNFSIPPECRHASVEAVDNRLIITFEPENLSDFFCQETDHIEQTPRIGDLALFWDTA YRGSAIIARLIDEDRINGVQAYQAANDVWYENAIRFRSDEQYRLITQRHDVEKEND >gi|226332012|gb|ACIB01000044.1| GENE 179 199798 - 200226 266 142 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566240|ref|ZP_04843694.1| ## NR: gi|253566240|ref|ZP_04843694.1| predicted protein [Bacteroides sp. 3_2_5] # 1 142 1 142 142 264 100.0 1e-69 MKQKIEEVKGKINRYYSDFIEKCLEVHGIDLTTIISDCVTAGYESRSDEIIELRRELDSI EEMNSDGNKILDAIKRMAADDNKGLRMTTTIVDVKDDPRGSIVGFGTEKVCGDDALTQTM GLPGKYMACAFFIDREELKKYL >gi|226332012|gb|ACIB01000044.1| GENE 180 200245 - 200478 234 77 aa, chain - ## HITS:1 COG:no KEGG:BF2474 NR:ns ## KEGG: BF2474 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 77 2 78 78 136 97.0 3e-31 MKNMNSLSKHLLMVIISIVTVAGCIYAGNVEMNDDILSGMSFEKYQYIHDRIGDRATSSD VVKEYLRNRQFYDSIAY >gi|226332012|gb|ACIB01000044.1| GENE 181 200462 - 200776 225 104 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566242|ref|ZP_04843696.1| ## NR: gi|253566242|ref|ZP_04843696.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] # 1 104 1 104 104 176 100.0 4e-43 MSIKEILSSDSNLSVTIKSTDLKEFADHIIKQTIKEVLASNMKSDEEYLTVNETAKMLCV NRSTLWSWNKKGYLCPVEIGGKRRYKISDIDSILKNKRTDEEYE >gi|226332012|gb|ACIB01000044.1| GENE 182 200793 - 201350 341 185 aa, chain - ## HITS:1 COG:no KEGG:BF2475 NR:ns ## KEGG: BF2475 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 185 1 153 163 156 51.0 4e-37 MTTGTIIFLTLIAALVLVLGVAVIWQCCNIRNLIDESAKESRNDIYGTYSSLKHLLVKYI GESKKDLSPDSENASKSSLCPLDLDSTRSLRDCLEDICKFYGIPVHLLARGMQEAGKELN LKSESVSKSGLSPESATPNGTITPPSSLSDIELRKYCIEQTNKDQVYLRIEDAQRLYDYI LNGNQ >gi|226332012|gb|ACIB01000044.1| GENE 183 201383 - 201622 63 79 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|301164567|emb|CBW24126.1| ## NR: gi|301164567|emb|CBW24126.1| hypothetical protein [Bacteroides fragilis 638R] # 3 79 1 77 77 130 98.0 2e-29 MTVEEKPNCIGNCRLCPDLCKCPPDHLHCEDCGIEIEPGEGISIEVEAIISGHPGTKMIT VCPVCFADHYQGDETIEFE >gi|226332012|gb|ACIB01000044.1| GENE 184 201629 - 201835 270 68 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|301164568|emb|CBW24127.1| ## NR: gi|301164568|emb|CBW24127.1| hypothetical protein [Bacteroides fragilis 638R] # 1 68 1 68 68 100 100.0 2e-20 METTNFVTKKSLTGTLANMSVKEVIEINIKDFKEYSIRNAAIKLKKKGYLFSVSSAGRID TTAVMRLK >gi|226332012|gb|ACIB01000044.1| GENE 185 201989 - 202378 259 129 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253566245|ref|ZP_04843699.1| ## NR: gi|253566245|ref|ZP_04843699.1| predicted protein [Bacteroides sp. 3_2_5] # 1 129 1 129 129 213 100.0 2e-54 MTVKEKIKEFLDYKGISPTSAERELNWGVGAFTKPKSITVDRAKEFLLLYTDLSSEWLFR EIGEMIRPTINETKISPINVDGELTNTEMEKEIKRLRASIDALIEKNERLEAELAKYKEK ESLNKGLAS >gi|226332012|gb|ACIB01000044.1| GENE 186 202482 - 202997 277 171 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253566246|ref|ZP_04843700.1| ## NR: gi|253566246|ref|ZP_04843700.1| predicted protein [Bacteroides sp. 3_2_5] # 1 171 7 177 177 346 100.0 3e-94 MEWIIMGLLVFIVAVILILKSDWQNEKDKILKSQEHISRRNELTESQQYYIGDKCIGLSA RKGYGKFPIAGAYYRDLPITMVGKFNGYAIAQTDNEYDQYAIAVYNDAGIHIGFLPRGNK KQHSYIIDEGEDKRVHAYGYLAWHGSGMYGEVCVETDKNAVTKRNKPYITD >gi|226332012|gb|ACIB01000044.1| GENE 187 203088 - 203267 202 59 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MSDEELRIRCIELSVECFSWFKGNEHGIKGTPIALADLMYQFVKTGKSPDAKPYYPQPL >gi|226332012|gb|ACIB01000044.1| GENE 188 203450 - 204703 381 417 aa, chain + ## HITS:1 COG:no KEGG:BT_2451 NR:ns ## KEGG: BT_2451 # Name: not_defined # Def: putative pyrogenic exotoxin B # Organism: B.thetaiotaomicron # Pathway: not_defined # 102 405 91 401 426 137 31.0 1e-30 MNKNLKNKLYVIIFTSLLFYSCGENNVIASFDHSTKAESFKDISYKIPVSDAVFTVLDVI NSTKGSTRNAYSAEIENIEVVKTTPAITRSLKANSLSSDTLLYIINFKDGGFGVAAADSR TAPIYAYSDKGHFNLKDTCQVLALKMFIKSAIHTILYDINNQGNNPRFAETPETRNELLE QVGPFMDIEWSQNNPYNKECVIGTEYAKAGCVAIATAQICAYNKYPNTFEGYNYDWNTIY KIKSSSDQYKYPDATSQLAHFIRRVGLNVGMKYGVKESGAKSEKIPGLLRKMGYTCSDLI SYSDKGLVESLKAGHPVYQCGFDKESDYFIFQTHSDGHAWVVDGYRYEMLNIRICRPRRG EMDCDSEKRRFLFVRNNYGWGGLYNGWYQPFVTMPDTNGKRIPTFAFKPRMITNIHR >gi|226332012|gb|ACIB01000044.1| GENE 189 204700 - 205356 284 218 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253566248|ref|ZP_04843702.1| ## NR: gi|253566248|ref|ZP_04843702.1| predicted protein [Bacteroides sp. 3_2_5] # 1 218 1 218 218 395 100.0 1e-108 MKKIYILLLSLLTLCSCEEQKITGALFSKSFLQGFIDIKEIVSGNISNNDELYFMFQGEN IARGNPQYDELCTKYGDVSYNRYMVPFSNPCLVDTITSLDLICNTDFDNNHKKGSSLNDI AILYYSSPLEFIKSGYKEYPKTEDTTPAPYRPSKEYYPYSKSFSDLKGEELILLYQMGYI KFLVHPLQSRQSITFRVQTKSGKTISTDFELNFAPTRK >gi|226332012|gb|ACIB01000044.1| GENE 190 205855 - 206892 1045 345 aa, chain + ## HITS:1 COG:FN0776 KEGG:ns NR:ns ## COG: FN0776 COG2502 # Protein_GI_number: 19704111 # Func_class: E Amino acid transport and metabolism # Function: Asparagine synthetase A # Organism: Fusobacterium nucleatum # 10 345 3 327 327 355 52.0 8e-98 MSYLIKPQNYKPLLDLKQTELGIKQIKEFFQLNLSSELRLRRVTAPLFVLKGMGINDDLN GIERPVSFPIKDLGDAQAEVVHSLAKWKRLTLADYHIEPGYGIYTDMNAIRSDEELGNLH SLYVDQWDWERVITAEDRNADFLKEIVNRIYAAMIRTEYMVYEMYPQIKPCLPQKLHFIH SEELRQLYPDMEPKCREHAICKKYGAVFIIGIGCKLSDGKKHDGRAPDYDDYTSKGLNDL PGLNGDLLLWDDVLQRSIELSSMGIRVDKEALLRQVKQENQEQRLELYFHKRLLNDTLPL SIGGGIGQSRLCMFYLRKAHIGEIQASIWPEEMRRECTALNIHLI >gi|226332012|gb|ACIB01000044.1| GENE 191 206899 - 207561 471 220 aa, chain + ## HITS:1 COG:PA0750 KEGG:ns NR:ns ## COG: PA0750 COG0692 # Protein_GI_number: 15595947 # Func_class: L Replication, recombination and repair # Function: Uracil DNA glycosylase # Organism: Pseudomonas aeruginosa # 3 220 8 226 231 277 60.0 8e-75 MNVQIEESWKTHLEPEFEKDYFRTLTEFVRSEYSQYQIFPPGKLIFNAFNLCPFDKVKVV IIGQDPYHGPGQAHGLCFSVNDGVAFPPSLVNIFKEIKEDIGTPAPSTGNLTRWAEQGVL LLNATLTVRAHQAGSHQRRGWEEFTDAAIRVLAEERENLVFILWGSYAQKKGAFIDRNKH LVLSSAHPSPLSAYNGFFGNKHFSKTNEYLKAHGKTEINW >gi|226332012|gb|ACIB01000044.1| GENE 192 207672 - 210410 2302 912 aa, chain - ## HITS:1 COG:no KEGG:BF3817 NR:ns ## KEGG: BF3817 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 912 1 912 912 1692 99.0 0 MTPLKANTIISSILLLILLMLPYEAMAQRRRVRELKGPVVYSPPKNDSLRVDTLKNDTLK ANVLKADTLSAESVKKKKQPLDAPVVYSANDSIVFTQGGFAHLYGDGKVNYENIELGAEI ITMNMDSSTVYARGVIDSVGVEKGKPVFKDGETPYETKAIRYNFKSKKGFINNVVTQQGE GYVVGNNAKKGANDELFMENGRYTTCDHHDHPHFYMQLTKAKVRPKKNVVTGPAYLVVED VPLPLAVPFFFFPFSSSYSSGFVMPSYMDDSSRGFGLTGGGYYFAVSDLMDLKLTADIFT KGSWAANMETNYNKRYKYSGSLQAGYQVTKLGDKGMPDYSVAKDFKIVWNHRQDQKASPN STFSASVNFATSSYERTNINNLYNSQLMTQNTKTSSVSYSRSFPDQKLTLAGTFNIAQTM RDSSIAVTLPDLNITLSTIFPFKRKRAVGEERWYEKISLRYSGRLTNSLKTKDDRLFKAG FREWENAMQHNIPVQATFTLFKYLQVVPSFNYTERWYTRKVMKSYDETTRKWDTHPGDTI HGFYRVFNYSASLALSTKMYGMYQPLFMKKKEIQIRHVFTPQISLNGSPSFGQFWEYYRD ADGNDQYYSPYSSQQFGTAPREKSGTVSFDVSNNLEMKYRNKKDSLVKVSLIDEIGFNLS YNMAAKKQPWSNLNMRLRMKFKKFNNYTLNMNAVFATYAYTFDKSGNVIVGDRTEWSYGR FGRFQGWGSSINYTFNNDTWKKLFGKDKDNDQKKKKDGADEEKGNSTGDEAVTEKRVEKA QADRDGYQVFKMPWSLNVNYSFRISEDRSKPINRNTMRYPFKYTQNINMSGNLKISNNWS FTFNSGYDFEAKEITQTSCTITRDLHCFNMSASISPFGRYRYYNFTIRATASILRDLKWD KRSQTQSNIQWY >gi|226332012|gb|ACIB01000044.1| GENE 193 210502 - 211038 506 178 aa, chain - ## HITS:1 COG:MJ0778 KEGG:ns NR:ns ## COG: MJ0778 COG1418 # Protein_GI_number: 15668959 # Func_class: R General function prediction only # Function: Predicted HD superfamily hydrolase # Organism: Methanococcus jannaschii # 14 161 16 149 169 61 31.0 8e-10 MKVLDLIDKYYPQDNELKHILNVHSRSVADKALWIAGKHPELNLDTVFLEEAAMLHDIGI FLTHAPGIQCFGTEPYICHGYLGAGLVRKEGFPRHALVCERHTGAGLSLKDIMDQKLPVP HREMLPVSMEEQVICFADKFFSKTHLDREKTVEGARKSIAKYGDEGLQRFNNWCKLFL >gi|226332012|gb|ACIB01000044.1| GENE 194 211474 - 213780 1936 768 aa, chain + ## HITS:1 COG:CC0447 KEGG:ns NR:ns ## COG: CC0447 COG3525 # Protein_GI_number: 16124702 # Func_class: G Carbohydrate transport and metabolism # Function: N-acetyl-beta-hexosaminidase # Organism: Caulobacter vibrioides # 34 618 30 608 757 352 34.0 1e-96 MTKPSYKQWIAACTLTSTLLLGACGPAPVTRDASIVPLPNQIQQSGNAFVLTPNTTIGTT DPELQPAAQYLKEILSAATGYDLQVKEGKGTITLAKANIEGKEGAYTLSAKSDRIDITGN SYGGVIAGIESLRQLFPPQIESKQIVDSVAWAIPTAEIQDAPRFEWRGIMLDVSRHFYTK EEVKELLDLMALYKMNKFHWHLTDDQGWRIEIKKYPLLTEKGAWRTFNSHDRSCMKSAKS EDNPDFLIPENKLRIVEGDTLYGGYYTQEDIKEVIEYAKVRGIDIVPEIDMPGHMLAAVS NYSGVACTDKVGWGNTFSSPVCPGKESAMEFCKNVYSELIDLFPYKYVHIGGDEVEKANW KKCPDCQKRMRDNHLKTEEELQSWFIHDMEKFFNAKGKEMIGWDEIIEGGLSPTATVMWW RSWAKDAPAKTTQQGNSIIFTPNGQFYLDYQEDKNSVRNIYNFNPAIEGLTSEQQALVKG VQGNIWCEWIPSRERMQYMAVPRLLAIAELGWSQPSQKNWNDFAQRMANQFERLNIMGIN YRIPDLEGFHRNNAFIGEGTVKVTCLDPNAEIHYTTDGSTPTLQSPKYEGPIQVKETTDF TFRTFRPNGKAGDISRTRFIKSEYAPATTVTPSAKGLQAEWYEFKGNKCADITKAPLNGT YPIEKVMIPEKAKGNIGLVIKGFINVPKDDIYTFALLSDDGSTLVIDGEQVIDNDGPHGP REVIGQKALSKGYHPIEVRYFDQNGGQLKMTVTGTEGNEIPTSDLYAN >gi|226332012|gb|ACIB01000044.1| GENE 195 213999 - 216092 1243 697 aa, chain + ## HITS:1 COG:no KEGG:BF3612 NR:ns ## KEGG: BF3612 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 56 41 96 99 107 96.0 2e-21 MIKLKLSILVWAIGLSMTAFSQTTSSLRAKVLTLNDYPDALRLWELYNDSASVMDKATQL HAKVSLYYYFNRPDEMLQCVDSLLTLYPKECTTEQKLAYCYVKAEKLLEKGHYKKLNTWW KSLRKDKKLYREIEKQENFPCSEKAIQGLSDKDDFRMDFPESSSTVPTSYTYPLVLSVTI NGTTLPATIFDTGAPYTFLTKETATKCNVQCMGDTIPVKSMFGTSQATTGFVKTLQLGSI TFHNVTVHVSLLEKDPIFSGHDALLGLKELRGISALEFEFGKLTLKQKSLRSPLDPNMCF AETGCAFLFANGQNYLLDTGGEGSFSNTPDSVSTKVIDVNGYPVQFFNTYTTIPAAQKSG LLGFPFFSGFKICTLDFDRMNFSGEGYRLRKSYSELMNSGDMIGLDIEYERISKTTDEMG KWLTNASLEMMKNKPESCIQYTDSLLGKYQQELGGSIIYVLNLRAASLAYLGLYKEAGDL MKMCAQAVPDMINGYNKCMALTPFGAQQLSWEQPEVTLNTTFSEKGFLASAEINGNKNKL YFAPDQINSSISEADAGKLNMKIIEFEDHTTATGKKRMAIANELKLGNLLIKNVQFNLTE GNDIILGNSLLRLIPQFSIESQKLVLMQQVQSFTNAKQYPLLLINYTFCFRDPDDDTQKY SIGNPTPYTRKITLQDLCKSSGKIVFDMKDMKLLKIN >gi|226332012|gb|ACIB01000044.1| GENE 196 216130 - 218364 1859 744 aa, chain + ## HITS:1 COG:YPO2803 KEGG:ns NR:ns ## COG: YPO2803 COG1472 # Protein_GI_number: 16123001 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-glucosidase-related glycosidases # Organism: Yersinia pestis # 24 738 16 708 793 405 33.0 1e-112 MIMKKIITASMLSLLLGGTVQAQSLPVYLDDSKPIEDRIEDALSRITVEEKVALIHAQSK FSSPGVARLGIPEFWMTDGPHGIRPEVLWDEWNQAGWTNDSCVAFPALTCLAATWNPKAA LLYGQSIGEEARYRNKTVLLGPGVNIYRTPLNGRNFEYMGEDPYLASQMVVPYVKGVQQN GVAACVKHYALNNQEINRHTTNVIVDDRALYEIYLPAFKAAVQEGKAWAIMGSYNLYKNQ HNCHNRYLLNDILKGEWGFDGVVVSDWGGVHNTEEAIYNGMDMEFGSWTNGLSKGMGNAY DNYYLAHPYLKQIKEGKIGTKELDDKVRRILRLAFRTTMNRNRPYGAMLSEEHIAAARKI GEEGIVLLQNKKNILPIDLNRTKKIAVIGENALKMMTVGGGSSSLKVQYECSPLDGIKRR IGDGIEISYARGYVGDTGGQFDGVSSGQNLKDDRSARQLIEEAVRIAQSADYVIFIGGLN KSGHQDCEDTDRKGLELPYKQDKVIGALAKVNKNLIVVNISGNAVAMPWISEVPAVIQAW YLGTEAGNAIASILVGDVNPSGKLPFTFPEKLEDVGAHQLGDYPGRQREDGIFDEKYNES IFVGYRWTDKQKIRPLFPFGHGLSYTTFAYGKATVNKKVMKIDEQIAITVPITNTGKRIG SEIVQLYISDLKSSLPRPVKELKGFSKIQLAPGETQEVTFLIDKQALSFFNDSRHEWVAE PGKFEAQIAASATDIKSKVTFELE >gi|226332012|gb|ACIB01000044.1| GENE 197 218711 - 219166 245 151 aa, chain - ## HITS:1 COG:L69304 KEGG:ns NR:ns ## COG: L69304 COG0454 # Protein_GI_number: 15673990 # Func_class: K Transcription; R General function prediction only # Function: Histone acetyltransferase HPA2 and related acetyltransferases # Organism: Lactococcus lactis # 2 151 3 151 152 167 51.0 4e-42 MDIISLRKSPQYLEEAIAYFQRKWADENSRMVYDNCFRTCLESESPLPQWYLLMNGEGEI IGCAGLATNDFNSRMDLYPWLVALFIEEQYRGHNYGNLLIKAVEEDTRRLGFGNLYLCTT HTGYYEHFNFVYIGDCYHPWGEHTRVYQKEI >gi|226332012|gb|ACIB01000044.1| GENE 198 219184 - 220101 717 305 aa, chain - ## HITS:1 COG:no KEGG:BF3822 NR:ns ## KEGG: BF3822 # Name: not_defined # Def: putative sodium-dependent transporter # Organism: B.fragilis # Pathway: not_defined # 1 305 1 305 305 558 100.0 1e-157 MLKFLKNWTLPIAMLVGAVGYPVFISFSFLTPILIFTMLLLTFCKVSPRDLKPKPLHLWL LLIQIFGSLIVYLLLYRFNKIVAEGAMVCVICPTATAAAVITSKLGGSAASLTTYTLIAN IGAAIAVPILFPLIEANPGISFIDAFLVILSKVFPLLICPFLAAWFLQRFIPKVHKVLLG YHELAFYLWGISLAIVTAQTLYSLINDPADGLTEIMIAVVALIACCLQFFLGKTLGSIYN DRISGGQALGQKNTILAIWMAHTYLNPLSAVGPGSYVLWQNIINSWQLWKKRKKESEKCS NNVSG >gi|226332012|gb|ACIB01000044.1| GENE 199 220209 - 221453 983 414 aa, chain + ## HITS:1 COG:aq_1681 KEGG:ns NR:ns ## COG: aq_1681 COG0860 # Protein_GI_number: 15606778 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: N-acetylmuramoyl-L-alanine amidase # Organism: Aquifex aeolicus # 30 255 130 353 359 124 36.0 2e-28 MELRRPHILYILICLWLLVSPSNVSSVWAKDFVVVIDAGHGGHDPGAIGKISKEKNINLK VALKLGNLIKQNCNDVKVVYTRSKDVFIPLDRRAEIANNAKADLFISIHTNALANNRTAK GASTWTLGLAKSDANLEVAKRENSVILYEDDYKTRYAGFNPNSAESYIIFEFMQDKYMEQ SVHLASLVQKQFRHHCKRVDRGVHQAGFLVLKASAMPSILVELGFISTPEEERYLNTEEG SSTLAKGIYRAFLSYKREHEIRLTGSSRTALPNDDEVTDTEVAQIDSTESENKKPQNTPR TDKLVTEAKTQRPIVVESTTNDSEITFKIQILTSSRPLSKNDKRLKGLKDVDYYKENGLY KYTYGASSDYNKVLRTRRNTVTPLFKDAFIIAFRNGEKMNINEAIANFKKRRNK >gi|226332012|gb|ACIB01000044.1| GENE 200 221473 - 222366 759 297 aa, chain + ## HITS:1 COG:no KEGG:BF3824 NR:ns ## KEGG: BF3824 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 297 1 297 297 557 100.0 1e-157 MKLITKEVRIGIAGVAALCLLVFGINYLKGINMFKPASYFYVKFHNVNGLAQSSPVFADG VRVGIVRDIAYDYNQPENVIVEVEVDTDLRIPKGSSAELVPELMGGVRMNILLANNPRER YTVGDTIPGTLNNGMMEKVAAMMPAVEKMLPKLDSILTSLNTIMADQSIPATLHSIEKTT ANLEVTSRQLKVLMNNDIPQLTGKLNTIGDNFVVISGNLKEIDYAATFKKIDATLSNVKM LTEKLNSKDNTVGLLLNDPQLYNNLNQTTINAANLLEDLKEHPKRYVHFSLFGKKDK >gi|226332012|gb|ACIB01000044.1| GENE 201 222715 - 224145 995 476 aa, chain + ## HITS:1 COG:BS_dnaA KEGG:ns NR:ns ## COG: BS_dnaA COG0593 # Protein_GI_number: 16077069 # Func_class: L Replication, recombination and repair # Function: ATPase involved in DNA replication initiation # Organism: Bacillus subtilis # 9 467 7 446 446 293 34.0 7e-79 MSESSHVGLWNRCLEIIRDNVPESTYKTWFVPIVPLKYEDKTLIVQVPSQFFYEFLEDKF VDLLRKTLYKVIGDGTKLMYNVLVDKSSGATVNQESTTRSTAIPQSGLPRVDERKAPGLL RAPAVQDLDPHLNPNYNFETFIEGYSNKLSRSVAEAVAENPAKTVFNPLFLHGASGVGKT HLANAIGTRIKELYPDKRVLYVSAHLFQVQYTDSVRNNTTNDFINFYQTIDVLIIDDIQE FAGVTKTQNTFFHIFNHLHQNGKQLILTSDRAPVLLQGMEERLLTRFKWGMVAELEKPTV ELRKNILRNKIHRDGLQFPSEVIDYIAENVNESVRDLEGIVISIMAHSTIYNKEIDLDLA QRIVRKVVHCETKAVTIDDIINVVCKHFDLESSAIHTKSRKREVVQARQVAMYLAKTHTD FSTSKIGKFIGNKDHATVLHACKTVKGQCEVDKGFRSDLENIETLLKKRNVSNGER >gi|226332012|gb|ACIB01000044.1| GENE 202 224233 - 225012 624 259 aa, chain - ## HITS:1 COG:BH1048 KEGG:ns NR:ns ## COG: BH1048 COG0778 # Protein_GI_number: 15613611 # Func_class: C Energy production and conversion # Function: Nitroreductase # Organism: Bacillus halodurans # 12 204 4 195 244 111 33.0 1e-24 MIILFISLQKKDMMDTVKNRRTIRKYQQKDITPDLLNDLLETSFRASTMGGMQLYSVVVT RDAEKKEMLSPAHFNQPMVKEAPVVLTFCADFRRFCKYCQERNAVPGYGNLMSFLNAAMD TLLVAQTFCTLAEEAGLGICYLGTTTYNPQMIIDALHLPELVFPITTVTVGYPAESPKQV DRLPIEGIIHEESYHDYTAEDINRLYAYKESLPENKLFIEENQKETLAQVFTDVRYTKKD NEFMSENLLKVLRRQGFMD >gi|226332012|gb|ACIB01000044.1| GENE 203 225183 - 227804 2085 873 aa, chain + ## HITS:1 COG:AF1664 KEGG:ns NR:ns ## COG: AF1664 COG0209 # Protein_GI_number: 11499254 # Func_class: F Nucleotide transport and metabolism # Function: Ribonucleotide reductase, alpha subunit # Organism: Archaeoglobus fulgidus # 51 873 7 752 752 271 29.0 5e-72 MQARLLMISKLLHIYLLIKKSKTHCIVEKQIYSYEEAFEESLRYFQGDELAARVWVNKYA VKDSFGNIYEKSPKDMHWRLANEVARIEAKYPNALSSEQLFELFDHFKYIVPQGSPMTGI GNDYQVASLSNCFVIGIDGSADSYGAIIKIDEEQVQLMKRRGGVGHDLSHIRPKGSPVKN SALTSTGLVPFMERYSNSTREVAQDGRRGALMLSVSIKHPDSEAFIDAKMTEGKVTGANV SVKLDDAFMSAAVEGRKYTQQYPIDSDHPTTVKEIEASNLWKKIVHNAWKSAEPGVLFWD TIIRESVPDCYADLGYKTVSTNPCGEIPLCPYDSCRLLAINLYSYVVNPFTKDAYFDFDL FHKHVALAQRIMDDIIDLELEKIERIIEKIDQDPENEEVKHTERGLWEKIYKKSGQGRRT GVGITAEGDMLAALGMRYGTEEATEFSEKVHKAVALGAYRSSVDMAKERGAFDVYDSERE KNNPFINRLREADPALYEDMKKYGRRNIACLTIAPTGTTSLMTQTTSGIEPVFLPVYKRR RKVNPNDTNVRVDFVDETGDAFEEYIVFHHKFVTWMEANGYDPAKRYTQEEIDELVAKSP YYKATSNDVDWLMKVRMQGKIQKWVDHSISVTINLPNDVDEELVNRLYVEAWKSGCKGCT VYRDGSRSGVLISAKSDKDKKEELPPCKPPTVVEVRPTVLEADVVRFQNNKEKWVALVGL LDGRPYEIFTGLQDDDEGIIIPKSVNTGRIIKNVDENGNKRYDFQFENKRGYKMTIEGLS EKFNKEYWNYAKLISGVLRWRMPIEQVIKLVGSLQLDSENINTWKNGVERALKKYVQDGT EAKGKKCPNCGNETLVYQEGCLICTTCGASRCG >gi|226332012|gb|ACIB01000044.1| GENE 204 228002 - 230704 2391 900 aa, chain + ## HITS:1 COG:L94405 KEGG:ns NR:ns ## COG: L94405 COG1640 # Protein_GI_number: 15672678 # Func_class: G Carbohydrate transport and metabolism # Function: 4-alpha-glucanotransferase # Organism: Lactococcus lactis # 406 896 3 487 489 448 46.0 1e-125 MIVTFHIEYRTSWGEEVRILGSVPELGKNNPEQAVALTTVDGIHWSNEISIQLPAEGVVE YSYHIYRDGKAIRTEWNSFPRRIYLPADVKKSLRINDCWKNLPEQQYFYSSAFTEALLAH RERSAIPKSYKKGLMIKAYAPRINSKYCLAICGNQKTLGNWDPDKAWPMSDANFPEWQAE LDASKLEFPLEYKFVLYNKEEKRAEAWENNPNRYLAAPEIKANETLVIADRYAYFNIPSW KGAGVAVPVFSLKSEKSFGVGDFGDLKRMVDWAVSTSQKVVQILPINDTTMTHTWTDSYP YNSISIYAFHPMYADLKQMGNLKDKETAAAFNRKQKELNALSAIDYEAVNRVKWEYFHQI FKQEGEKVLDSKAFRSFFEANKDWLQPYAVFSYLRDLYHTPNFREWPQYSEYNAQEIEEL CRPDTADYAHIAIYFYIQFNLHLQLLEATTYAREHGVVLKGDIPIGISRNSVEAWTEPYY FNLNGQAGAPPDDFSINGQNWGFPTYNWDVMENDGYKWWMKRFQKMAEYFDAYRIDHILG FFRIWEIPMNAVHGLLGQFVPALPMSREEIESYGLSFREEFLRPYIHEYFLGQVFGPHTD YVKQTFIEPTETYEVYRMRPEFDTQRKVEAFFAGKNDEDSIWVRDGLYALISDVLFVPDR KDPNLYHPRIGVQHDFIYRALNDWEKTAFNRLYDQYYYHRHNDFWQQQAMKKLPQLTQST RMLVCGEDLGMIPDCVAWVMNDLRILSLEIQRMPKNPAEEFGRLNEYPYRSVCTFSTHDM STLRGWWEEDYQQTQRYYNQMLGHYGTAPAIATPELCEEVIRNHLYSNSILCILSLQDWL SMDGKWRNPNVQEERINIPANPRHYWRYRMHLTLEQLMKAESLNEKIRELVKQTGRNPEK >gi|226332012|gb|ACIB01000044.1| GENE 205 230834 - 231865 716 343 aa, chain + ## HITS:1 COG:CAC3042 KEGG:ns NR:ns ## COG: CAC3042 COG3594 # Protein_GI_number: 15896293 # Func_class: G Carbohydrate transport and metabolism # Function: Fucose 4-O-acetylase and related acetyltransferases # Organism: Clostridium acetobutylicum # 7 279 2 268 337 68 25.0 2e-11 MGDDKNKRIDFVDLTKGVCIILVVMAHIGGAFEKLDYHSMIASFRMPLYFFISGIFFKSY EGLFGFFIRKINKLIIPFLFFYLSAFFLKYIVWKIAPGVFQLPVSWTELLVVFHDHALIK FNPPIWFLLALFNCNILFYLVHSLRNRRLGLMFALTLLIGTAGFYMGKHQIELPLYMDVA MSALPFYVAGFWIRRYNFFLFPHRFDKLIPLCILAALAVMYFTATFVGMRTNNYAGNIFQ FWPSAFAGIFMIMLFCKKFKKLPVISYMGRYSVITLGIHAPLLHFEYPVVSRFIHNEWGQ AIALLLLTLTVCIIATPIFLKLIPQAVAQKDFIKTKQSTQQGS >gi|226332012|gb|ACIB01000044.1| GENE 206 231916 - 232284 345 122 aa, chain + ## HITS:1 COG:lin0257 KEGG:ns NR:ns ## COG: lin0257 COG1539 # Protein_GI_number: 16799334 # Func_class: H Coenzyme transport and metabolism # Function: Dihydroneopterin aldolase # Organism: Listeria innocua # 7 121 4 119 124 88 41.0 3e-18 MTTQHYIFLENVRFYSYHGVAPQETAIGNEFIINLRLKTDFGKATETDEVEDTVSYADIY ATLKEEMELPSKLLEHVCGRIVKRLFRDFRKIREIEIKLAKRNPPMGADIDSAGVEMHCT RD >gi|226332012|gb|ACIB01000044.1| GENE 207 232308 - 233729 1102 473 aa, chain + ## HITS:1 COG:no KEGG:BF3831 NR:ns ## KEGG: BF3831 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 473 1 473 473 955 99.0 0 MKKLLLILLLVLAGEISAQQAATIVVKTTPDNEIAYWPTGKEHLFCIALGTKAQAGPLGE FVHQFRTDRPGMVQVWTQGSDSFTLYLTPGSKDTITVTKDTLIISGTNSAYNRCLKTVND YQKYSDKLVYMQPHELRGITSLEQYHRLADARMRQALDAVNASGLNEEFLAEQRAHIDYI RRSIFIHIARQLSRKEKLPEDWQRELTEVINSSVNGDYLRSYRGIGFFVNDLVMMQFTNL ENGDLKEIKDYASFLFDRYRKFFTGDNLQYMQAQLIYEDEFQGSKTPSIPQLYETYRAEF PNSPFLNVLEPGVKENLRFQNSRITDKDYHILTCDSTITSLEDAVKPFKGKVVYIDVWAT WCGPCLKEFQYLPALKEKAHNMDVVYLYISIDRPEERKKWEKTIAYHQLKGYHLLVNEKL GKSLYTELGNERQILSIPCFVIIDKTGKIVIRHAAAPSEPEKVIEQLSTYYNK >gi|226332012|gb|ACIB01000044.1| GENE 208 234294 - 234812 533 172 aa, chain - ## HITS:1 COG:TM1185 KEGG:ns NR:ns ## COG: TM1185 COG1803 # Protein_GI_number: 15643941 # Func_class: G Carbohydrate transport and metabolism # Function: Methylglyoxal synthase # Organism: Thermotoga maritima # 6 164 16 166 166 138 50.0 4e-33 MEKLIRRIGLVAHDAMKKDLIEWVLWNSELLMGHKFYCTGTTGTLIQEALKEKHPDVEWD FTILKSGPLGGDQQMGSRIVDGEIDYLFFFTDPMTLQPHDTDVKALTRLASVENIVFCCN RSTADHIISSPLFLDPDYERTHPDYSGYTKRFENKPVVTEAVESVKKRKRKK >gi|226332012|gb|ACIB01000044.1| GENE 209 234817 - 235848 818 343 aa, chain - ## HITS:1 COG:CAC2327_1 KEGG:ns NR:ns ## COG: CAC2327_1 COG1216 # Protein_GI_number: 15895594 # Func_class: R General function prediction only # Function: Predicted glycosyltransferases # Organism: Clostridium acetobutylicum # 3 249 6 251 378 134 34.0 2e-31 MNKISVVILNWNGCEMLRSFLPSVLRYSEAEGVEVCVADNGSTDQSVEMLRREFPSVRRI LLDGNHGFADGYNLALRQVEAEYVVLLNSDVEVTGQWLQPMAAYLDAHPEVAACQPKIRS WRQKEWFEYAGAAGGFIDRYGYPFCRGRVMGVVEADRGQYDTVLPIFWATGAAMFIRLAD YREAGGLDGRFFAHMEEIDLCWRLRARGRGIVCIPQSVVYHVGGATLKKENPHKTFLNFR NNLVMLYKNLPDRELAGVMRVRCWLDYIAAATFALKGQLPNAKAVVCARREFKLLRDSFR DARAENLRKTSSHFIPERIKSSILVQFYVKGRKFFSQLSDLKG >gi|226332012|gb|ACIB01000044.1| GENE 210 235841 - 236737 905 298 aa, chain - ## HITS:1 COG:XF1348 KEGG:ns NR:ns ## COG: XF1348 COG1560 # Protein_GI_number: 15837949 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Lauroyl/myristoyl acyltransferase # Organism: Xylella fastidiosa 9a5c # 16 294 16 291 316 73 24.0 7e-13 MKSKLLYLLTYVGMWLLAVLPFPVLYALSDFIYFWLYHVIGYRRKVVRTNLKNSFPEKSA AELKAIERKFFHYLCDYMLEDVKMLRMSEKELCKRMTYENKETYLRMIDERGGIVLLIPH YANFEWITGMGMIMRPGDVPVQVYKPLKDVYLDGLFKYIRARFGGYNVPKHSTAREVIKL KRAGKKMAIGLITDQSPNMHEAHYWTTFLNQDTVFMDGAERIAKMMDFPVFYCELRKERR GYCRVDFDLVTDRPKETADGEITEIFARRLEQTIRKEPAYWLWSHKRWKLKRPEKKNE >gi|226332012|gb|ACIB01000044.1| GENE 211 236740 - 238059 764 439 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|16079597|ref|NP_390421.1| hypothetical protein BSU25430 [Bacillus subtilis subsp. subtilis str. 168] # 11 421 3 421 451 298 38 1e-79 MIDTTVFQDKTAVYYTLGCKLNFSETSTIGKILREAGVRTARKGEKADLCIVNTCSVTEM ADKKCRQAIHRLVKQHPGAFVVVTGCYAQLKPGDVAKIEGVDVVLGAEQKKDLLQYLGDL HKHEGGEAYTTATKDIRSFAPSCSRGDRTRFFLKVQDGCDYFCSYCTIPFARGRSRNGTI ASLVEQARQAAAEGGKEIVLTGVNIGDFGKSTGETFFDLVKALDRVEGIERYRISSIEPN LLTDEIIEYVSRSRSFMPHFHIPLQSGSDEVLQLMRRRYGTELFASKIAKIKEVMPDAFI GVDVIVGTRGETAGYFEKAYEFIHGLDVTQLHVFSYSERPGTQALKIDHVVTPEEKHQRS QRLLALSDEKTKAFYARHIGQTMPVLMEKPKAGAPMHGFTANYIRVEVESDAALDNKVVN VLLGDFNEEGTALKGTITQ >gi|226332012|gb|ACIB01000044.1| GENE 212 238488 - 240161 1589 557 aa, chain + ## HITS:1 COG:aq_999_1 KEGG:ns NR:ns ## COG: aq_999_1 COG1022 # Protein_GI_number: 15606303 # Func_class: I Lipid transport and metabolism # Function: Long-chain acyl-CoA synthetases (AMP-forming) # Organism: Aquifex aeolicus # 33 554 20 503 600 234 30.0 4e-61 MIKENFIKLYENSFRENWDLPCYTNYGEPESYTYGEVAEEIAKLHLLFKHCSLRRGDKIA VIGKNNARWCIAYMATITYGAIIVPILQDFNPNDVHHIVNHSESVFLFTSDTIWENLEEE RLTGIRAVFSLTDFRCLHQRDGETVQKFLKHIDQYMTDTYPKGFRKEDVLYTTLSNDKVM LLNYTSGTTGFSKGVMLTGNNLAGNVTFGIRTELLKKGDKVLSFLPLAHAYGCAFDFLTA TAVGTHVTLLGKVPSPKIIMKAFEEVKPNLIITVPLVIEKIYKNVIQPIISKKGMKWALS IPLLDNQIYGQIRKKLIDALGGRFKEIIIGGAAMNPEVEEFFHKIKFPFTIGYGMTECGP LISYAPWDKFVPSSSGKILDIMEARIYKENPEAETGEIQVRGENVMTGYYKNPEATQEVF TKDGWLRTGDLGTMDDEGNIFIRGRLKTMILSSSGQNIFPEEIEAKLNNLPFILESLVIE RNKKLVALVYADYEALDSLGLNHEDNLKTIMDENLKNLNNNVAAYEKVSQIQLYPTEFEK TPKRSIKRYLYNSIAED >gi|226332012|gb|ACIB01000044.1| GENE 213 240320 - 240538 285 72 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MKKLVLMFVAIAAVSFASCGNKAADAEKATADSIRIADSIAAVEAAAAEAAAQAADTIAA DTTVVTETVVAE >gi|226332012|gb|ACIB01000044.1| GENE 214 240862 - 241305 711 147 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|60683080|ref|YP_213224.1| 50S ribosomal protein L9 [Bacteroides fragilis NCTC 9343] # 1 147 1 147 147 278 100 2e-73 MEIILKEDVVNLGYKNDIVTVKSGYGRNYLIPTGKAVIASPSAKKMLAEELKQRAHKLEK IKKDAEALAAKLEGVSLTIATKVSSTGTIFGSVGNIQIAEELAKLGHEIDRKIIVVKDAV KEVGAYKAIVKLHKEVSVEIPFEVVAE >gi|226332012|gb|ACIB01000044.1| GENE 215 241317 - 241589 460 90 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|53715145|ref|YP_101137.1| 30S ribosomal protein S18 [Bacteroides fragilis YCH46] # 1 90 1 90 90 181 100 2e-44 MAQQVQSEIRYLTPPSVDVKKKKYCRFKKSGIKYIDYKDPEFLKKFLNEQGKILPRRITG TSLKFQRRIAQAVKRARHLALLPFVTDMMK >gi|226332012|gb|ACIB01000044.1| GENE 216 241592 - 241936 577 114 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|53715146|ref|YP_101138.1| 30S ribosomal protein S6 [Bacteroides fragilis YCH46] # 1 114 1 114 114 226 100 5e-58 MNQYETVFILTPVLSDVQMKEAVEKFKGILQAEGAEIINEENWGLKKLAYPIQKKSTGFY QLIEFNAEPTVIDKLELNFRRDERVIRFLTFRMDKYAAEYAAKRRSVKSNKKED >gi|226332012|gb|ACIB01000044.1| GENE 217 242098 - 242544 525 148 aa, chain + ## HITS:1 COG:FN2010 KEGG:ns NR:ns ## COG: FN2010 COG1846 # Protein_GI_number: 19705306 # Func_class: K Transcription # Function: Transcriptional regulators # Organism: Fusobacterium nucleatum # 25 148 17 141 160 65 32.0 3e-11 MIEQFNFDIRLIFAILNGKVSAAINRKLYRNFRQNGLEISPEQWTVLIFLWEKDGVTQQE LCNATFKDKPSMTRLIDNMERQHLVVRISDKKDRRTNLIHLTRTGKELEEKARIIANRTL KEALHGITVEELSVSQEVLRKIFFNTKD >gi|226332012|gb|ACIB01000044.1| GENE 218 242686 - 242862 274 58 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MKGLLKNLGLLLILVGVIILIACSLTGEVNNNAVLGGSIVLVVLGLITYIAINKRIAD Prediction of potential genes in microbial genomes Time: Tue May 17 23:50:53 2011 Seq name: gi|226332011|gb|ACIB01000045.1| Bacteroides sp. 3_2_5 cont1.45, whole genome shotgun sequence Length of sequence - 57276 bp Number of predicted genes - 46, with homology - 46 Number of transcription units - 25, operones - 13 average op.length - 2.6 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 40/0.000 - CDS 39 - 740 876 ## COG0745 Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 2 1 Op 2 . - CDS 742 - 2301 1203 ## COG0642 Signal transduction histidine kinase - Prom 2403 - 2462 9.4 - Term 2513 - 2549 0.3 3 2 Tu 1 . - CDS 2691 - 4847 2228 ## COG0480 Translation elongation factors (GTPases) - Prom 4960 - 5019 2.6 + Prom 5604 - 5663 3.9 4 3 Op 1 . + CDS 5683 - 6816 949 ## COG0635 Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 5 3 Op 2 . + CDS 6829 - 7386 626 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 6 4 Op 1 . + CDS 7525 - 7962 279 ## BF3871 hypothetical protein 7 4 Op 2 . + CDS 7974 - 8852 754 ## COG3712 Fe2+-dicitrate sensor, membrane component 8 4 Op 3 . + CDS 8855 - 11533 2079 ## BF3642 hypothetical protein 9 4 Op 4 . + CDS 11540 - 12574 778 ## BF3874 hypothetical protein 10 4 Op 5 . + CDS 12612 - 12899 368 ## BF3875 hypothetical protein 11 4 Op 6 . + CDS 12896 - 13498 347 ## COG2431 Predicted membrane protein + Prom 13534 - 13593 5.2 12 5 Tu 1 . + CDS 13763 - 14116 437 ## BF3877 hypothetical protein + Term 14139 - 14196 8.5 13 6 Op 1 . - CDS 14241 - 15371 651 ## COG1672 Predicted ATPase (AAA+ superfamily) 14 6 Op 2 . - CDS 15412 - 16368 377 ## COG0472 UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase - Prom 16389 - 16448 4.3 15 7 Tu 1 . - CDS 16597 - 16803 88 ## BF2592 putative LPS biosynthesis related DNTP-hexose dehydratase-epimerase + Prom 17429 - 17488 7.8 16 8 Tu 1 . + CDS 17646 - 18209 381 ## BF3651 hypothetical protein + Term 18262 - 18309 0.2 + Prom 18249 - 18308 11.2 17 9 Op 1 . + CDS 18495 - 18992 345 ## BF1545 hypothetical protein 18 9 Op 2 . + CDS 19111 - 19317 151 ## BF1525 hypothetical protein + Term 19492 - 19531 1.6 - Term 19480 - 19518 1.4 19 10 Tu 1 . - CDS 19659 - 19862 240 ## BF3882 hypothetical protein - Prom 20000 - 20059 6.6 - Term 19990 - 20045 13.6 20 11 Tu 1 . - CDS 20080 - 21129 1175 ## BF3653 hypothetical protein - Prom 21149 - 21208 5.3 + Prom 21106 - 21165 5.5 21 12 Op 1 . + CDS 21210 - 22430 963 ## COG0809 S-adenosylmethionine:tRNA-ribosyltransferase-isomerase (queuine synthetase) 22 12 Op 2 . + CDS 22434 - 22964 539 ## COG1443 Isopentenyldiphosphate isomerase - Term 22905 - 22947 1.3 23 13 Op 1 . - CDS 23083 - 23625 487 ## COG0386 Glutathione peroxidase 24 13 Op 2 . - CDS 23688 - 26033 1739 ## COG3537 Putative alpha-1,2-mannosidase - Prom 26217 - 26276 8.2 + Prom 26109 - 26168 3.8 25 14 Op 1 11/0.000 + CDS 26212 - 30531 3390 ## COG3696 Putative silver efflux pump 26 14 Op 2 . + CDS 30553 - 31440 719 ## COG0845 Membrane-fusion protein 27 14 Op 3 9/0.000 + CDS 31469 - 32530 677 ## COG3275 Putative regulator of cell autolysis 28 14 Op 4 . + CDS 32527 - 33306 594 ## COG3279 Response regulator of the LytR/AlgR family + Prom 33314 - 33373 3.7 29 15 Op 1 . + CDS 33395 - 34864 1366 ## COG4624 Iron only hydrogenase large subunit, C-terminal domain 30 15 Op 2 . + CDS 34845 - 35882 684 ## COG0502 Biotin synthase and related enzymes 31 15 Op 3 . + CDS 35895 - 37313 1365 ## COG1060 Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 32 15 Op 4 . + CDS 37310 - 38485 595 ## COG1160 Predicted GTPases + Term 38493 - 38538 7.6 - Term 38477 - 38530 11.9 33 16 Tu 1 . - CDS 38606 - 39802 798 ## BF3896 hypothetical protein - Prom 39938 - 39997 9.0 - Term 40154 - 40192 2.3 34 17 Tu 1 . - CDS 40222 - 42531 1970 ## COG1874 Beta-galactosidase - Prom 42711 - 42770 10.9 - Term 42714 - 42757 8.5 35 18 Op 1 . - CDS 42776 - 44164 1731 ## COG1109 Phosphomannomutase 36 18 Op 2 . - CDS 44201 - 44845 545 ## BF3899 hypothetical protein - Prom 44871 - 44930 6.8 37 19 Op 1 . - CDS 44988 - 46019 751 ## COG0618 Exopolyphosphatase-related proteins 38 19 Op 2 . - CDS 46071 - 47978 774 ## COG0658 Predicted membrane metal-binding protein - Term 48094 - 48133 5.1 39 20 Op 1 1/0.000 - CDS 48180 - 48830 648 ## COG0036 Pentose-5-phosphate-3-epimerase - Prom 48851 - 48910 5.7 40 20 Op 2 . - CDS 48994 - 49968 926 ## COG0223 Methionyl-tRNA formyltransferase - Prom 49990 - 50049 4.1 - Term 49996 - 50042 3.5 41 21 Op 1 . - CDS 50063 - 51856 1497 ## COG0038 Chloride channel protein EriC 42 21 Op 2 . - CDS 51853 - 52416 575 ## COG0009 Putative translation factor (SUA5) - Prom 52462 - 52521 6.2 + Prom 52364 - 52423 4.9 43 22 Tu 1 . + CDS 52496 - 52930 330 ## COG0824 Predicted thioesterase + Term 52935 - 52977 7.0 - Term 52923 - 52965 7.0 44 23 Tu 1 . - CDS 52980 - 55052 1308 ## BF3907 hypothetical protein - Prom 55150 - 55209 8.7 - Term 55570 - 55624 10.4 45 24 Tu 1 . - CDS 55771 - 56250 501 ## BF3909 putative non-specific DNA binding protein - Prom 56372 - 56431 5.5 - Term 56481 - 56532 16.1 46 25 Tu 1 . - CDS 56571 - 57155 504 ## BF3910 putative phage-related protein - Prom 57216 - 57275 1.7 Predicted protein(s) >gi|226332011|gb|ACIB01000045.1| GENE 1 39 - 740 876 233 aa, chain - ## HITS:1 COG:lin2728 KEGG:ns NR:ns ## COG: lin2728 COG0745 # Protein_GI_number: 16801789 # Func_class: T Signal transduction mechanisms; K Transcription # Function: Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain # Organism: Listeria innocua # 6 226 3 221 225 145 38.0 5e-35 MDEKLRILLCEDDENLGMLLREYLQAKGYSAELYPDGEAGFKAFLKNKYDLCVFDVMMPK KDGFTLAQEVRAANAEIPIIFLTAKTLKEDILEGFKIGADDYITKPFSMEELTFRIEAIL RRVRGKKNKESNVYKIGKFTFDTQKQILAIGDKQTKLTTKESELLGLLCAHANEILQRDF ALKTIWIDDNYFNARSMDVYITKLRKHLKDDDSIEIINIHGKGYKLITPEPES >gi|226332011|gb|ACIB01000045.1| GENE 2 742 - 2301 1203 519 aa, chain - ## HITS:1 COG:TM1654_2 KEGG:ns NR:ns ## COG: TM1654_2 COG0642 # Protein_GI_number: 15644402 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Thermotoga maritima # 273 514 29 269 272 119 29.0 1e-26 MKKSTIWILGIIMGLSFLSLLYLQVSYIEEMVKMRKEQFNTSVRNALFQVSKDVEYDETQ RWLLEDITEAERRALAQSSSTTEQKNGLIQQSERYRFKSPDGTLYSEFELKMITTEPSKV PKAMISERHGRNTIPQTSRSLTDAIKNRYMYQRFLLDDVALRMIYKASDKSIGERVNFKK LDNYLKSNFINNGVELLYHFSVIDKDGREVYRCSDYEDGGSEDSYTQPLFQNDPPAKMSI VKVHFPGKKDYIFDSVSFMIPSMIFTIVLLITFIFTIYIVFRQKKLTEMKNDFINNMTHE FKTPISTISLAAQMLKDPAVGKSPQMFQHISGVINDETKRLRFQVEKVLQMSMFDRQKAT LKMKELDANELITGVINTFALKVERYNGKITSNLEATNPVIFADEMHITNVIFNLMDNAV KYKKPEEDLVLNVRTWNEPGKLMISIQDNGIGIKKENLKKVFDKFYRVHTGNLHDVKGFG LGLAYVKKIIQDHKGTIRAESELNVGTKFIIALPLLKND >gi|226332011|gb|ACIB01000045.1| GENE 3 2691 - 4847 2228 718 aa, chain - ## HITS:1 COG:FN1546 KEGG:ns NR:ns ## COG: FN1546 COG0480 # Protein_GI_number: 19704878 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Translation elongation factors (GTPases) # Organism: Fusobacterium nucleatum # 1 710 3 685 690 426 35.0 1e-119 MKVYQTNEIKNIALLGNDGSGKTTLTEALLYESGIIKRRGRITAKNTVSDYFPVEQEYGY SVFSTIFHVEWNGKKLNIIDCPGSDDFVGAAMTALNVTDTAILLLNGQYGAEVGTQNHFR YTEKLGKPVIFLVNQLDNEKCDYDMVLEQLKSIYGPKVVPVQYPLATGPNFNSLIDVLLM KKYSWAPEGGAPIIEEIPAEEKDKAMELHKALVEAAAENDEGLMEKFFEQDSLTEDEMRE GIRKGLAARGMFPVFCVCAGKDMGVRRLMEFLGNVVPGVSEMPKVHNTRGEVVEPDSNGP TSLYFFKTGVEPHIGDVQYFKVMSGKVHEGDDFTNADRGSKERVAQIYACAGANRIKVEE MVAGDIGCTVKLKDVHTGNTLNGKGAENRFNFIKYPNSKYSRAIKPVNEADTEKMMAILN RMREEDPTWVIEQSKELKQTIVHGQGEFHLRTLKWRLENNEKLQVKFEEPKIPYRETITK AARADYRHKKQSGGAGQFGEVHLIVEPYYEGMPVPDMYKFGGQEFKINVKGTEEVPLEWG GKLVFVNSIVGGSIDARFLPAILKGIMSRMEQGPLTGSYARDVRVIVYDGKMHPVDSNEI SFMLAGRNAFSEAFKNAGPKILEPIYDVEVFVPSDKMGDVMGDLQGRRAMIMGMSSENGY EKLVAKVPLKEMSSYSTALSSLTGGRASFIMKFASYELVPSDVQDKLIKDFESKQTEE >gi|226332011|gb|ACIB01000045.1| GENE 4 5683 - 6816 949 377 aa, chain + ## HITS:1 COG:SPy1040 KEGG:ns NR:ns ## COG: SPy1040 COG0635 # Protein_GI_number: 15675037 # Func_class: H Coenzyme transport and metabolism # Function: Coproporphyrinogen III oxidase and related Fe-S oxidoreductases # Organism: Streptococcus pyogenes M1 GAS # 5 368 9 369 376 245 34.0 1e-64 MAGIYIHVPFCKTRCIYCDFYSTTRSEWKGRYIEALCKELEMRYTYLKGKPIETLYFGGG TPSQLDEKDFRKVFDTVRRVYGMENCHEITLEANPDDLCPEFLQMLSELPFNRISMGIQT FDDTTLKLLKRRHNAAQAIRAVELCRAHGFRNISIDLIYGLPGETTERWEKDLQQAIALD VEHISAYHLIYEEGTPIYKMLQKHQVEEVDEDSSVRFFTLLIDRLHEAGYEHYEISNFCK PGMYSRHNTSYWQGVSYLGCGPSAHSFDGQTREWNCSSIEKYMSGIESGQRDFEREERDL ATRYNEFIITSVRTQWGISLERLSNDYGTQLEQYCLKMARPSLENGKLEIYEGALRLTRE GIFISDSIMSDLLWVEN >gi|226332011|gb|ACIB01000045.1| GENE 5 6829 - 7386 626 185 aa, chain + ## HITS:1 COG:mll8140 KEGG:ns NR:ns ## COG: mll8140 COG1595 # Protein_GI_number: 13476734 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Mesorhizobium loti # 8 178 2 175 208 63 28.0 2e-10 MTESEVRKLLRRMKELDSQTAFRAFYDMTYDRLYRIAYYYVKREEWSQEIVLDVFLKLWE QRSSLPEVKSIEDYCFILVKNASLNYLEKENRRTTVSTETLPEPEAQSDSPEESMISEEL FAIYVKALDRLPERCREVFIRIREEKQSYAQVAEELGISTKTVDAQLQKATIRLKEAISM VNNDR >gi|226332011|gb|ACIB01000045.1| GENE 6 7525 - 7962 279 145 aa, chain + ## HITS:1 COG:no KEGG:BF3871 NR:ns ## KEGG: BF3871 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 145 1 145 145 259 100.0 1e-68 MKKLLRPTTFALLTMALCFTACENGNNSYPKEYLGFEKKAQDFSYQKNAEETELQVKIVA ADKTSEDRIVFIESPARPTKPGSSSAPFYKIKDNKITIKGGSKSAKATILVYPRKVGTNE YIQLICRPQNGKSETTKMSIRLVKK >gi|226332011|gb|ACIB01000045.1| GENE 7 7974 - 8852 754 292 aa, chain + ## HITS:1 COG:PA2388 KEGG:ns NR:ns ## COG: PA2388 COG3712 # Protein_GI_number: 15597584 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Pseudomonas aeruginosa # 74 261 115 298 331 82 30.0 6e-16 MKYTDRELDRLLEKLIASTRSPRGKFSAAASYQILEKRLPRPALRLFSLRTLSAVAAIAL LCFIGWNTYCYLKPAALQTISTLADTRTIKLPDGTEVTLNHFSSLTYPEKFKGEHREVNL KGEAYFEVTKNRKHSFIVQTESVNVEVLGTHFNVESYPDDPEVKTTLLEGSVAVSNKSNS VRIVLKPNESAIYNKEKKSMTLEVSDRVAEEIAWRNGELIFTNLPLQEIARQLSNTFGVD ISITDTALQNYRITARFSSEEGLDQILDLLHTVGNFNYSYNNKEITITTKLN >gi|226332011|gb|ACIB01000045.1| GENE 8 8855 - 11533 2079 892 aa, chain + ## HITS:1 COG:no KEGG:BF3642 NR:ns ## KEGG: BF3642 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 892 1 892 892 1782 100.0 0 MRKSNPDFHHIRSYILMCVVSLTFSTIYAQTPEPKLTLKLQNTSLSEVIRQIEQDTGFSF IYGEEVKLDHKVSLNVQKKPLREVLNLLFANKPISYKITGKHILLQKTPPKPVSRKFTVS GYVTDGASAETLIGANILESRHHQGTTTNPYGFYSITLPEGEARLSFSYLGYTGQQHVLN LTADTLLNIRMKDNNMLQEVVIVSDKAESGVMATQMGASEIPMTQIKNTPSILGEADVMK AIQLMPGVQAGMEGSAGLYVRGGGPDQNLILLDGVPVYNVDHLLGFFSVFTPEAVKKVTL FKSSFPARFGGRLSSVIDVRTNDGDMKNYHGVVSVGLLTSKINFEGPIIKNRTSFNISAR RSYIDLLAKPFMPKDEKYSYYFFDVNAKINHKFSDRSRLFLSAYNGKDHFMTKYDDTYYG DEDKYRDGGKMNWGNTIVSGRWNYIFNNKLFSNTTVAFNNYKFDVSTFTKNEIHTNNQIT WNRYKADYKSGIRDWSAQIDFDYNPIPTHHVKFGVQYLHHSFRPEVSTSKIFDKTGETIE RDTTYYTTSNSEILAHEASAYLEDNFNLSSRLRMNLGLHFSTFQVQKKNYFSVQPRISAR YQLSKDVVLKASYTKMSQYVHLISSMPFAMPTDLWVPVTSKIKPMQAHQVSLGGYYTGID GWEFSIEGYYKGMKNVLEYKDGVSFLGSSTGWEDKVEMGHGRSMGVEFMAQKTIGKTTGW LAYTLAKSDRKFATGGINNGERFPYKYDRRHNIDLTLNHRFNERIDVSASWIFTTGGTTT IPTELTGIIRPGDNDSSIEEGDYVKHRNNYRLPATHRLNVGVNFSKKTKHGMRIWNVSIY NVYNAMNPTLIYRTYKKTEYTQGEQAQGNRIPVIRKFTLLPCIPSFSYTYKF >gi|226332011|gb|ACIB01000045.1| GENE 9 11540 - 12574 778 344 aa, chain + ## HITS:1 COG:no KEGG:BF3874 NR:ns ## KEGG: BF3874 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 344 1 344 344 701 99.0 0 MRRSILDIFLFLLVIISTAACNNDLPFDLKENPPKLVMNAIINADSTYNTLFLNLTGRNQ IGQIKGATVEVRINGSLSETLRPDPHSSDKGRFYINSAFHPGDVVRIDAMTDDGEHHAWA EVTVPQPIGKIEKVDTASIMRKPSNYGYGTPPRRHLRYQIKIKDRPGEKNFYRIIVEQRK YWKYYWEQNDQTCWDSAMQKSFKLQTNEDVVLTDGKPSTEEDDENGLFGTVNNKYAIFDD SRFTDGSYTMNVYNDIYGWGFWGQEYIWIKTDVYIRILSITEKEYYYLRALNLLDSDAYD NTLSEPIAFPSNVNGGTGMVGFSTETNYMLTVKNNAVPPMVPDL >gi|226332011|gb|ACIB01000045.1| GENE 10 12612 - 12899 368 95 aa, chain + ## HITS:1 COG:no KEGG:BF3875 NR:ns ## KEGG: BF3875 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 95 1 95 95 132 98.0 3e-30 MFTIIGIMLTGMLLGYLLRSKRLTWIHKVITLLIWLLLFLLGIDVGGNEAIVKGLHAIGL EALIITAAAVTGSTLAAWGLWYLLHTRYQKKEAKP >gi|226332011|gb|ACIB01000045.1| GENE 11 12896 - 13498 347 200 aa, chain + ## HITS:1 COG:FN1083 KEGG:ns NR:ns ## COG: FN1083 COG2431 # Protein_GI_number: 19704418 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 5 200 2 195 198 114 38.0 1e-25 MKGSLIIVSFFIVGTLCGLFQLIPYDFSQSKLSFYALCGLMFCVGISIGNDPQTLKSFRS LNPRLLFLPVMTILGTLAGCAVVSLFLSHRSPSDCMAVGSGFGYYSLSSIFITEYKGAEL GTIALLSNIMREIIALLCAPLLVKFFGKLAPISVGGATTMDTTLPIITRCSGQEFVIVSI FHGFIVDFSVPFLVTLFCSI >gi|226332011|gb|ACIB01000045.1| GENE 12 13763 - 14116 437 117 aa, chain + ## HITS:1 COG:no KEGG:BF3877 NR:ns ## KEGG: BF3877 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 117 1 117 117 195 100.0 5e-49 MKRLGLTLVAALCLVATTFAAGNQPTVAKWEGNINVNKLGKYLNLSSVQAEEVANICNYF DEQMGRATTAKKNKDTMVRNAVYGNLKLMKKTLTDAQYTKYTTILNMTLKNKGIEVK >gi|226332011|gb|ACIB01000045.1| GENE 13 14241 - 15371 651 376 aa, chain - ## HITS:1 COG:MA1854 KEGG:ns NR:ns ## COG: MA1854 COG1672 # Protein_GI_number: 20090704 # Func_class: R General function prediction only # Function: Predicted ATPase (AAA+ superfamily) # Organism: Methanosarcina acetivorans str.C2A # 7 375 7 386 390 94 24.0 3e-19 MAVINPFVVGGYVSPRYFCDRVAETENLIRNLINGRNVALVSTRRMGKTGLIRHCFYQPL IKEGYYTFFIDIYATSSLKEFVFALGKGIFEKLKPQGNKFIDRFFSIITSLRIGFKLDSI TGEPILELGLGDIHAPETTLEEIFIYLEQADKPCIVAIDEFQQISSYPEKNLEAILRTKV QHCSNSNFVFAGSQRHIMMNIFNSPSRPFYQSVSMMHLGAIPLEVYKPFVKGLFEENGKK VTDQLVENVYTLFDGHTWYVQLMLNELFILTDKKGICDVPMINLALNNIIATQDFTFQEI FSRLPEKQKEIMIAIAKEQKAKGVTSAAFIKKYRLTSASSVQSGLKGLLEKDMLTQESGG YQVYDRLFNIWLRRNY >gi|226332011|gb|ACIB01000045.1| GENE 14 15412 - 16368 377 318 aa, chain - ## HITS:1 COG:PA3145 KEGG:ns NR:ns ## COG: PA3145 COG0472 # Protein_GI_number: 15598341 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase # Organism: Pseudomonas aeruginosa # 4 252 9 275 339 80 29.0 3e-15 MCYLLILVLLFLAELFYFRIADKCNIIDKPNERSSHTRITLRGRGIIFYLSALTFFLTNQ FEYPWFILALTLITFISFIDDIRSTSQGLRLVFHFTAMALMFYQWGLFSLPWWTLLVALI VCTGIINAYNFMDGITGITGGYSLVILVSLAYVNAEVISFTEQNFIYTMICSVLVFDFFN FRKRAKCFAGDVGSVSIAFVVLFLIGSLILQTKDFSWLVMLTVYGVDSVLTIIHRLLLHE NIGLPHRKHLYQIMVNELRIPHVVVSLVYMIVQIVIIIGYLYCRGYGDWYLLGCILLLSG IYIVLMRKYFHLHLLPKR >gi|226332011|gb|ACIB01000045.1| GENE 15 16597 - 16803 88 68 aa, chain - ## HITS:1 COG:no KEGG:BF2592 NR:ns ## KEGG: BF2592 # Name: not_defined # Def: putative LPS biosynthesis related DNTP-hexose dehydratase-epimerase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 54 193 246 298 104 90.0 1e-21 MVQDIANLVPLLIEKGGIYNVCDSYQLSFRELEMVICKQLNKKRPISIPYCLLKVWLFLV IVLGKIPQ >gi|226332011|gb|ACIB01000045.1| GENE 16 17646 - 18209 381 187 aa, chain + ## HITS:1 COG:no KEGG:BF3651 NR:ns ## KEGG: BF3651 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 187 1 187 187 352 100.0 4e-96 MAQLESKQLNKIGLHENATNRNMDEKLLLKCENSIRAYKIVNLLNKHNIALRQHDESQDP RVGAYGAVTGIAIYVFAKDYEKALSIVSPILKDFNTISTFCPKCGSENVKPITGNHKYIT YLIFLCLFLILTPGIYIALPEDFGLRSSLINKIALMMVALGFILMPIINHYNVNYKCKKC GNRFRHY >gi|226332011|gb|ACIB01000045.1| GENE 17 18495 - 18992 345 165 aa, chain + ## HITS:1 COG:no KEGG:BF1545 NR:ns ## KEGG: BF1545 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 159 35 200 316 245 80.0 5e-64 MEVIEAKYGIKGPAIVLKLLCKIYKERCYILWDEEQCLIFAYKVEREVQTGEVQGIIVIL FIKGILDRSSCQENGILTSENIQKVWMEATKRRKKELSKLPYLMVKTEKENGKPDNTSAQ QEIEQPKPLKEGKVAVGTGNVAVSLGNVVRHVAVNSMHAIPDKVK >gi|226332011|gb|ACIB01000045.1| GENE 18 19111 - 19317 151 68 aa, chain + ## HITS:1 COG:no KEGG:BF1525 NR:ns ## KEGG: BF1525 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 68 237 304 304 132 91.0 4e-30 MTHNYPGLKDTLQRLGINEVSEVNAILRLSDYGGKGTTVWRLITNTCWSDIVSKGRYLIA ALNKAKRK >gi|226332011|gb|ACIB01000045.1| GENE 19 19659 - 19862 240 67 aa, chain - ## HITS:1 COG:no KEGG:BF3882 NR:ns ## KEGG: BF3882 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 61 13 73 92 112 95.0 4e-24 MCMNDTENLTAVQTAMLRVVANGEYRFNSIPVVRKYELGSAQTITRNKRMLTERDFIEKE GGTVCVF >gi|226332011|gb|ACIB01000045.1| GENE 20 20080 - 21129 1175 349 aa, chain - ## HITS:1 COG:no KEGG:BF3653 NR:ns ## KEGG: BF3653 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 349 1 349 349 680 99.0 0 MKIGDKVRFLSEVGGGIVKGFRGKDIVLVEDADGFDIPMQIRECVVIDTDDYNMTRKVAP APKKPEEPVKPVKPEIPVVRSAEVRGGDVLNVFLAYVPEDVKAISSTSFEAYLVNDSNYY LYYTYQSAEGKAWKTRSHGLVEPNTKLLLEEFTKDMLNEMEHVAVQFIAFKDGRTAPLKP AVCVELRIDTVKFYKLHTFRESDFFEQPALVYDIVKDDVPTRQVFVSAEELQSVLIQKKE VDKPKSQPIVKRGGKNEILEIDLHINELLDDTRGMGNAEILNYQLDKFREVMEKYKAKRE QKIVFIHGKGDGVLRKALIDELKRKYSNCRYQDASFQEYGFGATMVTIK >gi|226332011|gb|ACIB01000045.1| GENE 21 21210 - 22430 963 406 aa, chain + ## HITS:1 COG:HI0245 KEGG:ns NR:ns ## COG: HI0245 COG0809 # Protein_GI_number: 16272205 # Func_class: J Translation, ribosomal structure and biogenesis # Function: S-adenosylmethionine:tRNA-ribosyltransferase-isomerase (queuine synthetase) # Organism: Haemophilus influenzae # 8 404 1 353 363 230 34.0 4e-60 MKENPKHIRISEYNYPLPDERIAKFPLPVRDQSKLLIYRHGEVSEDVFTSLPEYLPAGSL MIFNNTKVIQARLHFRKETGALIEVFCLEPIQPNDYVLNFQQTEHAAWLCMVGNLKKWKD GPLCREMTVKGFPITLTATRGECRGTSHWIDFRWDNPEVTFADILEVFGELPIPPYLNRD TEESDKETYQTVYSKIKGSVAAPTAGLHFTPRVLDALTEKGIDLEELTLHVGAGTFKPVK SEEIEGHEMHTEYISVSRSIIKKLIDHDACAIAVGTTSVRTLESLYHIGVTLANNPEATE EQLHVKQWQPYETECDVRPVVALQKILGYLDRHGMEALHTSTQIIIAPGYDYKIVKAMVT NFHQPQSTLLLLVSAFVKGNWHTIYDYALGHDFRFLSYGDSSLLIP >gi|226332011|gb|ACIB01000045.1| GENE 22 22434 - 22964 539 176 aa, chain + ## HITS:1 COG:MT1787 KEGG:ns NR:ns ## COG: MT1787 COG1443 # Protein_GI_number: 15841209 # Func_class: I Lipid transport and metabolism # Function: Isopentenyldiphosphate isomerase # Organism: Mycobacterium tuberculosis CDC1551 # 8 155 12 166 203 73 30.0 2e-13 MQSDNNQEMFPIVDEQGTITGAATRGECHSGSKLLHPVVHLHVFNSKGELYLQKRPEWKD IQPGKWDTSVGGHIDLGESVEIALKREVAEELGITDFTPELLTSYVFESARERELVFVHK TVYDGEIHPSDELDGGRFWSYEEIKTNLGKGVFTPNFESEIDKVEIFSSTSQKEAV >gi|226332011|gb|ACIB01000045.1| GENE 23 23083 - 23625 487 180 aa, chain - ## HITS:1 COG:BH2830 KEGG:ns NR:ns ## COG: BH2830 COG0386 # Protein_GI_number: 15615393 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Glutathione peroxidase # Organism: Bacillus halodurans # 23 168 2 146 157 157 51.0 1e-38 MKAFFSFVAALVLSLSMAAQNKSFYDFTVKTIDGKEYPLSGLKGKKVLVVNVASKCGLTP QYAELQELYDQYKDQNFVIIGFPANDFMGQEPGTNEEIAKFCSVNYDVTFPIMAKISVKG KDMAPLYHWLTEKKLNGKQDAPVQWNFQKFMIDENGNWVGFVAPKESPFSETIISWIEEK >gi|226332011|gb|ACIB01000045.1| GENE 24 23688 - 26033 1739 781 aa, chain - ## HITS:1 COG:XF0842 KEGG:ns NR:ns ## COG: XF0842 COG3537 # Protein_GI_number: 15837444 # Func_class: G Carbohydrate transport and metabolism # Function: Putative alpha-1,2-mannosidase # Organism: Xylella fastidiosa 9a5c # 6 779 23 781 790 524 39.0 1e-148 MKKKVLSLLAFLPAFTTVMAQQTAEVPDVCAFVNPIIGTNGMGHTFPGACAPFGLVQLSP DTDTIPHNIDGTYQKNAYEYCAGYQYHDPTIVGFSHTHLSGTGHSDLGDILIMPATGQLK LNPGRAGTPDEGYRSRFSHDTEVARPGYYEVELADYGIKAQLTATQRVGIHKYIFPDNAD GHIILDLIHGIYNYDGKTLWANLRVENDTLLTGYRITNGWARTNYTYFAISLSQPIKDYG YTDKGKALYKGFWRRFNTDRNFPEMTGRKLVAYFNFDTRQNPELVIKVALSAVSTEGAVK NLQAEAAGKTFNQLVAEANSAWNRELDVLEAKGTPDQLAMFYTSLYHTMINPSVYMDVDG RYRGLDHNIHTSEGFTNYTIFSLWDTYRAEHPFLNLLKPRQNTDMVQSMIRHQQQSVHGM LPVWSLMGNEGWCMSGYHAVSALADAVAKGADISVGEALMAMDHTANVPYYEGVEAYKRL GYVPFDQSGTAASTTLEYAYDDWTIYRTALLAGDDQLADLYKKRANNYRNVFDTSVGFAR PRYSNGEFRKEFDAMQTYGEGFIEGNSWNFSFHVPHDVAGLIRLMGGEKKFVSRLDTLFS MALPRKYYEKNEDIAEVSLVGGYVHGNEPSHHIPYLYAWTSQPWKTQYWLRTVMNRMYKN DIDGLGGNDDCGQMSAWYLFTAMGFYPVCPGTDQYVLGAPYLPYIRMNLPNGHTFEIKAP KVSDRNCYVRQVKLNGKVYDKMYITHADLLAGGTLEFDMAASPNKKRGLAKEAKPYSMSE E >gi|226332011|gb|ACIB01000045.1| GENE 25 26212 - 30531 3390 1439 aa, chain + ## HITS:1 COG:PA2520 KEGG:ns NR:ns ## COG: PA2520 COG3696 # Protein_GI_number: 15597716 # Func_class: P Inorganic ion transport and metabolism # Function: Putative silver efflux pump # Organism: Pseudomonas aeruginosa # 1 1042 1 1045 1051 808 43.0 0 MFKAIVRFSIKKKLFVGLTTLFLLIGGIYAMLTLPIDAVPDITNNQVQIVTVSPTLAPQE VEQLITMPIEIAMSNIMNVEDIRSVSRFGLSVVTVVFKESVPTLDARQLINEQIQSVAGE IPPELGMPEMMPITTGLGEIYQYILKVEPGYEDKYDAMELRTIQDWMVKRQLSGIPGIVE INSFGGYLKQYEVAVDPDALFSLNITIGEVFEALSKNNQNTGGSYIEKAKNAYYIRSEGM ISRTKDIEQIVVANRNGIPVHISDVGIVRFGAPKRFGAMTKDGKGECVGGIAMMLKGANA NVVTQELEKRVEKIQKLLPEGVSIEPYLNRSELVNRNISTVVHNLIEGAIIVFVVLIIFL GNIRAGLIVASVIPLAMLFAFILMRIFGVTANLMSLGAIDFGIVVDGSIVIVEGILAHLY SNKLKGRTLSGTEMDEEVEKGASGVVRSATFAVFIILIVFFPILTLSGIEGKYFTPMAKT LVFCIIGALLLSLTYVPMMASLFLKHTIMVKPTFADRFFEKLNVIYQRCLHFCLRFKWQT VTVAFATLIGSFLLFGRLGAEFIPTLDEGDFAMQMTLPAGSSLSESIEVSNQAEKLLMDR FPEIKHVVAKIGTAEVPTDPMAVEDADVMIVMKPFKEWTSASSRAEMVEKMKEALQPLEN RAEFNFSQPIQLRFNELMTGAKADIAVKLYGEDTHELYAKAKEAARFVEQVAGASDVIVE QTMGLPQLVVKYNRGKIARYGINIEELNTMIRTAYAGEVSGVVFENERRFDLVVRLNQEK VADLNLDKLFIRTSEGIQIPVSEVASIDLVNGPLQINRDATKRRIVIGVNVRDADIQQVV SEIQQILDKNIKLQPGYYFEYGGQFENLQNAIRTLTIVIPVALMLILLILFFAFKNVTYT LMVFSTVPLSLIGGILALWLRGLPFSISAGVGFIALFGVAVLNGILMINHFNDLRKQNKY AMTTNQIIKRGTPHLLRPVFLTGLVASLGFVPMAIATSAGAEVQRPLATVVIGGLILSTI LTLIILPVFYKIVNAASAGWKQPKRRLHLFILLPALLLSATATAQAPQTVSLEQAIEIAK QNHPRLKIATTAIRQAQAGRGEIVEATPTTFNYSWGQLNGENKKDKEMAFEQSLGSLLTP FYKNVLINRQVQTSTYYRQMVEKEITAEVKRAWAYYQYASGMSSLYHDQDRMAAELRRIG ELRYEQGEITLLEKNMMTTLAADLHNRLFQALEEKKVALARFQWSCYADDPITPKDTIMT LFPTDCEQRSTSEAHLGFFNSQASEARAILNVERSRFFPELSIGYSRQDILPLKNLNAWM VGVSFPIYFLPQKSRIKQAKLAVSAAQIQAEANIRELNNKITELSAALRRYEESLRFYTS SALKEADELVKTANLQLQHSETGIAEFIQSVSAAREIRKGYIETIYQYNIASLEYELYQ >gi|226332011|gb|ACIB01000045.1| GENE 26 30553 - 31440 719 295 aa, chain + ## HITS:1 COG:RSp0529 KEGG:ns NR:ns ## COG: RSp0529 COG0845 # Protein_GI_number: 17548750 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Ralstonia solanacearum # 61 294 30 265 344 73 24.0 4e-13 MKKIFYPVLLLVLAACKNPEQTSETIVPAPAIADIPTDTVTTQVDGITSATSKPNQVSFN GTIVLPPQRQATVALTMGGVVKHTSLLPGQQVRQGALLATLENPDFIALQQTYLDSHAQA EYLQAEYERQKTLSTEQAASQKKFQQSKADYLSMKSKLEATAAQLTLLGIVPEELLKSGI QPLLQVKAPISGYISDVAMNIGKYIQPGEALCEVIDKSAPLLCLTTYEKDLADMKVGSPV QFRVNGMGKTVFKATLVSIGQKVDEVSRSLEVYARIDDVNQQFRPGMYVTARIQK >gi|226332011|gb|ACIB01000045.1| GENE 27 31469 - 32530 677 353 aa, chain + ## HITS:1 COG:SA0250 KEGG:ns NR:ns ## COG: SA0250 COG3275 # Protein_GI_number: 15925963 # Func_class: T Signal transduction mechanisms # Function: Putative regulator of cell autolysis # Organism: Staphylococcus aureus N315 # 170 330 379 555 584 94 31.0 4e-19 MKRIRAIYNDKYLLSTLIISLAVAVLIHFPESVSLFDGFESHTLFPGMKFADVANEILFT FLSLLILFAVNTRLFHFNQTSMKITWQKIILSFVLTWILSNFLGQCFVYLHKTFDIPAID AMVHHYLHPLRDFIMSSLVTSSCYIIYLIRRQQQVIVENEQLKAENIRNQFEVLKNQLNP HMLFNSLNTLRSLVRENQDRAQDYIQELSRVLRYTLQGNESQCVTLREEMDFVSAYIFLL KMRFEDNLRFEINITHNSEEYLLPPMSIQMLIENAVKHNEISNRRPLTITIATDSEEGLL VSNPIQSKLTATTGTGIGLVNLDKRYRLLFRQEIQITEDRNFTVRIPLIRKDL >gi|226332011|gb|ACIB01000045.1| GENE 28 32527 - 33306 594 259 aa, chain + ## HITS:1 COG:SA0251 KEGG:ns NR:ns ## COG: SA0251 COG3279 # Protein_GI_number: 15925964 # Func_class: K Transcription; T Signal transduction mechanisms # Function: Response regulator of the LytR/AlgR family # Organism: Staphylococcus aureus N315 # 1 253 1 240 246 103 28.0 3e-22 MKATIIEDEKAAVRNLTSLLGEVCPQIEIVTELDSIADTIEWFTDHSMPDLVFMDIHLAD GSAFEIFEHVHITCPIIFTTAYDEYALRAFKVNSIDYLLKPIGRQDIEQALEKLTLFSSG NSKPTGPETNELVNLMRMLKKQESYKTHFLIPIKGDKLLPVSVDMILLFYIRDCQVKAVL TDGTEYSFPQTLDELTECLDPTLFFRVNRQYLLSREAIKDIDLWFNSRLSINLRHSVSGE KILVSKARVAEFKEWFSVK >gi|226332011|gb|ACIB01000045.1| GENE 29 33395 - 34864 1366 489 aa, chain + ## HITS:1 COG:CAC3230 KEGG:ns NR:ns ## COG: CAC3230 COG4624 # Protein_GI_number: 15896476 # Func_class: R General function prediction only # Function: Iron only hydrogenase large subunit, C-terminal domain # Organism: Clostridium acetobutylicum # 173 406 95 335 450 148 36.0 2e-35 MAFTNNVMIVRHKLLAKLVNLWKENKLTNEIDRLPIELSPRRSRPLGRCCIHKERAVYKY KLFPLLGFDMTDETDELTPLSEYARQALERKNKQKENILCVIDEACSSCVQVNYEVTNLC RGCVARSCYMNCPKDAIRFRKNGQAKIDHDACISCGKCHQSCPYHAIVFIPVPCEEACPV KAISKDENGIEHIDESKCIYCGKCLNACPFGAIFEISQAFDVLEGIRSGEKMIAIPAPSI LGQFNTSIEAVYGALRQMGFADVVEVAQGAMDTVSHEAAELKEKLEEGQPFMTTSCCPSY IELVNKHIPGMKPYVSSTGSPMYYAARIAKERHPDAKIVFIGPCVAKRKEARRDECVDYI LTFEEMASIFEGLDIQLEQTQPFSVLYTSVREAHGFAQAGGVMGAIKAYLGEEAKKFSAI QVSDLNKKNIGLLRAAAKTGKAQGQFIEVMACEGGCISGPSAHNDAIGGRRQLNQELIKR RESYEETHR >gi|226332011|gb|ACIB01000045.1| GENE 30 34845 - 35882 684 345 aa, chain + ## HITS:1 COG:CAC1631 KEGG:ns NR:ns ## COG: CAC1631 COG0502 # Protein_GI_number: 15894909 # Func_class: H Coenzyme transport and metabolism # Function: Biotin synthase and related enzymes # Organism: Clostridium acetobutylicum # 3 312 6 310 350 240 42.0 3e-63 MKKLIDKLREERTLTSEEFAHLLSHYDDEALAYINQQAREVATAHFGQGVYIRGLIEISN YCRNNCNYCGIRKDNWRADRYRLSKEMIWDCCEHGYKLGFRSFVLQGGEDPKRSDNDMEE IIAEIRRRYPECAITLSIGEKPAKAYERYFMKGADRYLLRHETFNRGHYYCLHPYEMSNE RRIKCLQELKRIGFQTGTGIMVGPPHQRVEFLIEDIRFIENFQPEMIGIGPFIPHQRTPF CDEKAGSVELTLLLLSIFRLMHPKALIPSTTALASLAPDGRIRGILAGANVVMPNLSPII VRNKYNLYDQKVAFGAEAAEGLALLEKQLTAVGYHIDYSRGDYNN >gi|226332011|gb|ACIB01000045.1| GENE 31 35895 - 37313 1365 472 aa, chain + ## HITS:1 COG:CAC1356 KEGG:ns NR:ns ## COG: CAC1356 COG1060 # Protein_GI_number: 15894635 # Func_class: H Coenzyme transport and metabolism; R General function prediction only # Function: Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes # Organism: Clostridium acetobutylicum # 1 472 1 472 472 716 74.0 0 MYKVDSPDAVDFINHQEIIETLEYARAHRNDRELIRNLIEKARLCKGLTHREAAILLECA EEDLTKEIFHLAKEIKQKFYGNRIVMFAPLYLSNYCVNGCVYCPYHLKNKTIIRKKLTQE EICREVIALQDMGHKRLALEAGEDPVRNSIDYILESIRTIYSIHHKNGAIRRVNVNIAAT TVENYRKLKDAGIGTYILFQETYHKENYEQLHPTGPKSNYAYHTEAMDRAMQGGIDDVGM GVLFGLNTYRYDFVGLLMHAEHLEAVYGVGPHTISVPRICSADDINAEDFENAISDEIFQ KIVAVIRISVPYTGMIISTRESQKTREKVLDLGISQISGGSRTSVGGYAEAETPEENSAQ FDVSDTRTLDEVVNWLLKLGYIPSFCTACYRAGRTGDRFMSLVKSGQIANCCSPNALITL QEYLEDYASEETKARGVAMIKQEMQHIPNPKIRERALENLKQIAAGERDFRF >gi|226332011|gb|ACIB01000045.1| GENE 32 37310 - 38485 595 391 aa, chain + ## HITS:1 COG:CAC1651 KEGG:ns NR:ns ## COG: CAC1651 COG1160 # Protein_GI_number: 15894928 # Func_class: R General function prediction only # Function: Predicted GTPases # Organism: Clostridium acetobutylicum # 1 388 4 390 411 327 45.0 2e-89 MKTSPRSNRLHIALFGKRNSGKSSLINALTNQNAALVSDIAGTTTDPVYQPMEIHGIGPC VFIDTAGFDDEGELGSLRIERTLQAADKADIALMVCCDTELSEEQRWIELLKERNIPYLL VLNKADLLEKPDEVADKLEQQTGQHPLIVSAKEKTGIDSIRQSILHRLPELNEQPDIVGD LANEGDVVLLVMPQDIQAPKGRLILPQVQTLRELLDKKCITLSCTTDQLDNALKVLSAPP SLIITDSQVFRTVYEKKPPQSRLTSFSVLFARYKGDIDYYTEGAYIIDQLTENSRVLIAE ACTHAPLSEDIGRVKLPRMLRKRIGEKLHIDIVSGNDFPKDLNCYDLIIHCGACMFNRKH VLNRIVKAKAQGVPMTNYGIVIAYINGIPFH >gi|226332011|gb|ACIB01000045.1| GENE 33 38606 - 39802 798 398 aa, chain - ## HITS:1 COG:no KEGG:BF3896 NR:ns ## KEGG: BF3896 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 398 2 399 399 801 100.0 0 MNTKSFFMVAIGVFVLASCNLNQVDQTGLLTASVRDALNSPATLSIADEIESVEYIPLEM TNDDASLIDGVVDFAITSKYIYVLVGKEARIVLFDRQGHFLRTFLRQGQGPDDFNGMIGF IQADEADNRFYVIGNKVGIYTLEGTFVEDLPVNSPIIYAHHLGNKRIGAIAMPLMPFQNG SFGIGVFQEGGEAIITKNDFYSPLVPREDSGFTFGVMGSPSDGKQASVLFKTASNDTIYR LSADTIQPVLVAGLSNSDEEVIRGLNIRDIKRFPANGDIFVSDIFETPRRFYLRMMLNEK YYVASVDKHSGETVVEQCDIPETSAYNLADINMQLGMVGSKGHNRFPVWGRVLGNNLVQV VTPYEIETFKEQTQITVPQELQKRNANENPIFIIYKIK >gi|226332011|gb|ACIB01000045.1| GENE 34 40222 - 42531 1970 769 aa, chain - ## HITS:1 COG:XF0840 KEGG:ns NR:ns ## COG: XF0840 COG1874 # Protein_GI_number: 15837442 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-galactosidase # Organism: Xylella fastidiosa 9a5c # 4 599 6 602 612 432 38.0 1e-120 MKKLILLLILIFSLPVAAQNFTIGKSTFLLNGKPFTVKAAELHYTRIPAPYWEHRIEMCK ALGMNTICLYVFWNIHEQTEGQFDFTGQNDIAAFCRLAQKHGMYVIVRPGPYVCAEWEMG GLPWWLLKKKDIVLRTLDPYFMERTAIFMKEVGKQLAPLQITRGGNIIMVQVENEYGAYA VDKPYISAIRDIVKSAGFTEVPLFQCDWSSTFDRNGLDDLLWTINFGTGANIEQQFKRLK EARPETPLMCSEFWSGWFDHWGRKHETRPAKSMVQGIKDMLDRNISFSLYMAHGGTTFGH WGGANNPSYSAMCSSYDYDAPISEPGWTTDKYFQLRDLLKNYLPAGEQLPEIPEAFPVIE IPEVEFTQIAPLFSNLPEAKESMDIQPMEAFDQGWGTILYRTTLQEPVENGTTMKITEVH DWAQVFADGKLLARLDRRRGEFVLQLPALKKGTRIDILVEAMGRVNFDESIHDRKGITEK VELVRGKQSAELKNWTVYSFPVDYSFVQDKRYKNGTAQTMPAYYRTTFRLDKVGDTFLDM STWGKGMVWVNGLAIGRFWEIGPQQTLFMPGCWLKEGENEIIVLDLKGPEKASIRGLKKP ILDWLRNEGASTHRKEGEQLDLSRETPVAEGTFVPGNGWQEVCFDRQSIGRYFCLEALSA QKGKKIAAIAELDVLGADGKPISREKWRIRYADSEETRSGNCTGDKVFDLQESTYWMTVA KDAYPHQLVIDLGGDYTVTGFRYLPRAEKGYPGMIKDYRVYVKGEDFQY >gi|226332011|gb|ACIB01000045.1| GENE 35 42776 - 44164 1731 462 aa, chain - ## HITS:1 COG:PH0923 KEGG:ns NR:ns ## COG: PH0923 COG1109 # Protein_GI_number: 14590777 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphomannomutase # Organism: Pyrococcus horikoshii # 12 448 6 441 455 253 38.0 7e-67 MTLIKSISGIRGTIGGGAGEGLNPLDIVKFTSAYATLIRKTCKSKSNKIVVGRDARISGE MVKNVVVGTLMGMGWDVVDIDLASTPTTELAVTMEGACGGIILTASHNPKQWNALKLLNE HGEFLNAAEGQEVLRIAAAEEFDYADVDHLGSYRKDLTYNKKHIDSVLALDLVDVEAIKK ADFTVAIDCVNSVGGIILPELLERLGVKHVEKLYCEPTGNFAHNPEPLEKNLGDIMNLMK GGKADVAFVVDPDVDRLAMICENGVMYGEEYTLVTVADYVLKHTPGNTVSNLSSTRALRD VTRKYGMEYNASAVGEVNVVTKMKATNAVIGGEGNGGVIYPASHYGRDALVGIALFLSHL AHEGKKVSELRATYPPYFIAKNRVDLTPEIDVDAILAKVKDIYKNEEINDIDGVKIDFAD KWVHLRKSNTEPIIRIYSEASTMEAAEEIGQKIMNVINELAK >gi|226332011|gb|ACIB01000045.1| GENE 36 44201 - 44845 545 214 aa, chain - ## HITS:1 COG:no KEGG:BF3899 NR:ns ## KEGG: BF3899 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 214 1 214 214 383 100.0 1e-105 MKKLLLLFFSLLAFGYMFQACDNSKTYAEMLDEEKDAVNAFIKKHNIQTISESDFEANGY KTDTTKNEYVAFSNGVYMQIVDKGIVTDKPENDSIKNNNIVAVRFVEHDIKANDTTCFNV VLPGFENYPNYYTYPDVFRYVDNGTSVAGVFTEGSMYAKYGTTDVPPGWLLALKYVTNYA HVRMIVPSKMGHQSANQYVNPYFYDIRKFQKALN >gi|226332011|gb|ACIB01000045.1| GENE 37 44988 - 46019 751 343 aa, chain - ## HITS:1 COG:aq_1630 KEGG:ns NR:ns ## COG: aq_1630 COG0618 # Protein_GI_number: 15606737 # Func_class: R General function prediction only # Function: Exopolyphosphatase-related proteins # Organism: Aquifex aeolicus # 24 342 19 322 325 140 31.0 2e-33 MLTKVIAQAHIDHFTKWFERADKIVIVSHVSPDGDAIGSSLGLYHFLDSQDKIVNVIVPN AFPDFLKWMPGSKDILLYDRYQEFADKLIMEADVICCLDFNALKRIDEMSDIVAASPGRK IMIDHHLYPEDFCRITISHPEISSTSELVFRLICRMGYFSDISKEGAECIYTGMMTDTGG FTYNSNNREIYFIISELLSKGIDKDDIYRKVYNTYSESRLRLMGYVLSNMKVYKDYNSAL ISLTKEEQGKFDYIKGDSEGFVNIPLSIKNVCFSCFLREDTEKKMIKISLRSVGKFPCNR LAAEFFNGGGHLNASGGEFYGTMEEAVKVFEQALEKYKPLLKE >gi|226332011|gb|ACIB01000045.1| GENE 38 46071 - 47978 774 635 aa, chain - ## HITS:1 COG:SMc02086 KEGG:ns NR:ns ## COG: SMc02086 COG0658 # Protein_GI_number: 15965263 # Func_class: R General function prediction only # Function: Predicted membrane metal-binding protein # Organism: Sinorhizobium meliloti # 96 460 219 608 801 104 28.0 5e-22 MFGCILYLFVFFGGAGGINQALQQTLYSFSEQKCVYRAVVLEQPEPKEHSFLCRAFLEER QDSVCTMPVNRKVLLYISKDSLSEGLRSGDELIFFAHVSPPSNNGNPDEFDYARYLRYKG ISGIAFVASGNWKITGYRFSRSCRQIALEYRDRILDQYRALKFNPDEFAVLAALTVGYKE ELSEDIRETYSVSGASHVLALSGLHIGFLYMMLLFFLKWLPGNAFGVRLFRAVVIITALW GFAFFTGLSPSVVRSVIMFSLLALSILSRRTGISLNTLALTACIMLVVHPFWLFDVGFQL SFSAVAAILLLYPWLFRQLPVGNSLLKKVWALMSVSLAAQIGTAPLVLLYFSRFPTHFLL TNLLVIPLVSGIMYATVALLVLTPFPMLYTGCSVVVRSLVDWLNTMVRWVEHLPLASIDR VWIYPTEAFAFYLVLLIGIRYKVVRSLKCLYVFGICILAMGSFHWVSRMMDRPVQSIVFY NVRGCPVVHCIEACGKSWLAYADSIPDERRLSRAVAGYWNRLHLDVPVAITDNFHSSGFW MQDHLLMFGNKRICMVSDNRWRNKTVAESLNIDYLYVCKGYTGKLESLVGLFHCREVILD SSLSAYYKEAYSEECRRLGLHFISLSDEGSVRFLL >gi|226332011|gb|ACIB01000045.1| GENE 39 48180 - 48830 648 216 aa, chain - ## HITS:1 COG:BH2502 KEGG:ns NR:ns ## COG: BH2502 COG0036 # Protein_GI_number: 15615065 # Func_class: G Carbohydrate transport and metabolism # Function: Pentose-5-phosphate-3-epimerase # Organism: Bacillus halodurans # 5 215 4 214 216 208 49.0 6e-54 MKPIISPSILSADFAYLAKDIEMINRSEADWVHIDIMDGVFVPNISFGFPVLKYVAKLTS KPLDVHLMIVNPEKFIPEVKALGAHIMNVHYEACPHLHRVVQQIREAGMQPAVTINPATP ITLLQDIIRDVYMVLVMSVNPGFGGQKFIEHSVEKVKELRELIERTGSKALIEVDGGVNL ETGARLIAAGADALVAGNAIFAAENPEGMIHAMKGL >gi|226332011|gb|ACIB01000045.1| GENE 40 48994 - 49968 926 324 aa, chain - ## HITS:1 COG:BH2508 KEGG:ns NR:ns ## COG: BH2508 COG0223 # Protein_GI_number: 15615071 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Methionyl-tRNA formyltransferase # Organism: Bacillus halodurans # 6 312 1 300 317 228 40.0 2e-59 MKKEDLRIVYMGTPDFAVEALQCLVEGGYNVVGVITMPDKPAGRGHKIQYSPVKQYALDH QLPLLQPEKLKDEEFIQALREWKADLQIVVAFRMLPEVVWNMPRLGTFNLHASLLPQYRG AAPINWAVINGDTETGITTFFLKHEIDTGEVIQQVRIPIADTDNVEIVHDKLMHLGGRLV IETVDAILEGKVKSIPQEEMAVAGELRPAPKIFKETCRIDWNQPVKRVYDFIRGLSPYPA AWSELVNPEGEAVVVKIFESEKLPKVHTLAPGSIVTDGKNFLRVAVPDGFVNVLSLQLPG KKRLKTDELLRGFYLTEAFKMKAV >gi|226332011|gb|ACIB01000045.1| GENE 41 50063 - 51856 1497 597 aa, chain - ## HITS:1 COG:RSp0020 KEGG:ns NR:ns ## COG: RSp0020 COG0038 # Protein_GI_number: 17548241 # Func_class: P Inorganic ion transport and metabolism # Function: Chloride channel protein EriC # Organism: Ralstonia solanacearum # 22 449 28 447 461 145 27.0 3e-34 MMEKEKISLLQRFIIWRENKIKEKQFILILSFLVGIFTAIAALLLKFFIHTIQNFLTDNF NTTEANYLYLVYPVVGIFLAGWFVRNIVKDDISHGVTKILYAISRRQGRIKRHNIWSSTI ASAITIGFGGSVGAEAPIVLTGSAIGSNLGSMFKMEHRTLMLLVGCGAAGAIGGIFKAPI AGLVFTLEVLMIDLTMSSLLPLLISAVTAATVSYITTGQEAMFKFHLDQPFELERIPYVI LLGIFCGLVSLYFTRAMNSVEGVFGKLSNPYKKLALGGVMLSVLIFLFPPLYGEGYDTIE LLLNGVSNADWDTVLNNSLFYGYGNLLLVYLVLIILLKVFASSATNGGGGCGGIFAPSLY LGCIAGFVFSHFSNDFDFTSTLPEKNFALMGMAGVMSGVMHAPLTGVFLIAELTGGYDLF LPLMIVSVSSYLTIIVFEPHSIYSMRLAKKGQLLTHHKDKAVLTLMKVENVVETDFVSVR PEMDLGELVKAISTSHRNMFPVTDKDGVLLGVVLLDDIRNIMFRQELYHRFTVSKLMTSV PARLYDTDSMEQVMQTFDDTKAWNLPVVNEEGKYLGFVSKSKIFNSYRQVLVHFSED >gi|226332011|gb|ACIB01000045.1| GENE 42 51853 - 52416 575 187 aa, chain - ## HITS:1 COG:MK0635 KEGG:ns NR:ns ## COG: MK0635 COG0009 # Protein_GI_number: 20094073 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Putative translation factor (SUA5) # Organism: Methanopyrus kandleri AV19 # 5 187 13 193 212 98 32.0 8e-21 MIEDIKKACQVMSEGGVILYPTDTVWGIGCDATNEDAVRRVYEIKRRADSKAMLVLVDSP VKVEFYVQDVPSVAWDLIEVADKSLTIIYSGARNLASNLLAEDGSVGIRVTNEAFSRRLC QQFRKAIVSTSANVSGQPGAANFNEISEEIKSSVDYIVNFRQDDMSRPKPSSIIKLDKGG VIKIIRE >gi|226332011|gb|ACIB01000045.1| GENE 43 52496 - 52930 330 144 aa, chain + ## HITS:1 COG:TVN0706 KEGG:ns NR:ns ## COG: TVN0706 COG0824 # Protein_GI_number: 13541537 # Func_class: R General function prediction only # Function: Predicted thioesterase # Organism: Thermoplasma volcanium # 12 102 11 101 133 64 31.0 5e-11 MEEIEFHHSLPIQLRFNDVDKFGHVNNTVYFSFYDLGKTEYFASVCPGVDWEKDGIVVVH IEADFLAQIFSSDHIAVQTAVCEIGTKSFHLLQRVIDTETMEVKCICRSVMVTFDLERHE SKPLTEEWIKAICRFEGRDLRKKK >gi|226332011|gb|ACIB01000045.1| GENE 44 52980 - 55052 1308 690 aa, chain - ## HITS:1 COG:no KEGG:BF3907 NR:ns ## KEGG: BF3907 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 690 1 690 690 1388 99.0 0 MKRNVSLLKYALLIALCCVACVNEKDLYEPSGEDPGETEELDLSFKFALRSDKQIHISVT RADGKAAEGIGVGVYLQQPYEEDGIISGKPLYMGYTDGNGQIDATISVPANSDKLYVASL TAGYPGVQEMDVQPSMTCNLTATVFQIKTATTRMVATRSETGLDVPVDQKLSNLYELYSP YTDSEIGKDGIPLLNASPLVTKEELSAKFLNLMNSWYPEQKNVQDVDLKKSSDLVVTDEL GAEVWATYVGDGGFYVNNATVYNVLAYYSYQEGELGRREDIQGHRMTLLLPNTHQQKCPS GLKVQLLYWDGKQYSKVFPKGARIGFAVARDGLNIANVNAANGGVNSKSSYKFKNQTFPN GDVNGFYYSTPSLNATKRTNAVIRNVPDYNCCIMGFDIRPYDDPKTDYDFNDVMIKLTAS PVSAIKPEEDIPVIDEFTPSEAVYGTLAFEDQWPKMGDYDFNDFVMNYSYELEKGDNNMI TALKLTFTPIAKGAASWTHIGVGIELPLSADNIDKAKSEGATLEEGNDRATFIVWNDVNT AFGTTEGYVNTEGAVVGVSAIPVEVTVRLKTPVSSLLTQKFNPFIFVNSRQREIHLVDYK PTKHADTSLFGTENDRSDPGAEVYYRMDNRYPWALDFPRKEDSSPAWNYPKERVIITKAY PNYEKWVLDQSNLSWFDASVSGNVNKEFLY >gi|226332011|gb|ACIB01000045.1| GENE 45 55771 - 56250 501 159 aa, chain - ## HITS:1 COG:no KEGG:BF3909 NR:ns ## KEGG: BF3909 # Name: not_defined # Def: putative non-specific DNA binding protein # Organism: B.fragilis # Pathway: not_defined # 1 159 1 159 159 274 99.0 8e-73 MALFYKAVKSTMATKSGDKKWHLNLVKVGKVVSTQQLAEMIAEKSSLTPGDVHNVVRNLM TAMRSALLDSKTVRLDGLGTFTMKARTRGRGVDKEEEVNPNQVTALLCHFTPEYTRPAAI GTTRALFQGVEFQKASGIGASGNNGSGGGDGDIVDDPTA >gi|226332011|gb|ACIB01000045.1| GENE 46 56571 - 57155 504 194 aa, chain - ## HITS:1 COG:no KEGG:BF3910 NR:ns ## KEGG: BF3910 # Name: not_defined # Def: putative phage-related protein # Organism: B.fragilis # Pathway: not_defined # 1 194 1 194 194 384 99.0 1e-105 MENLTENDFQRVADWLGVEVAVVKAVQTVETGGRGGFVAPGRPIILFEGHIFWRELKKRG LDPERYVVGNENILYPSWRREHYYGGIREYERLEKAREIHKEAADASTSWGMFQVMGFNY VMCGYGSVDEMVKDMCSGEDKQLEAFARFIKLAELRPNLERKDWTGFAKRYNGPGYAQNQ YDKKLEEAYRRFTK Prediction of potential genes in microbial genomes Time: Tue May 17 23:51:55 2011 Seq name: gi|226332010|gb|ACIB01000046.1| Bacteroides sp. 3_2_5 cont1.46, whole genome shotgun sequence Length of sequence - 13488 bp Number of predicted genes - 13, with homology - 13 Number of transcription units - 3, operones - 3 average op.length - 4.3 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 4/0.000 - CDS 1 - 235 194 ## COG0399 Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis - Prom 271 - 330 5.0 - Term 273 - 307 -0.8 2 1 Op 2 3/0.000 - CDS 347 - 1498 418 ## COG0438 Glycosyltransferase 3 1 Op 3 10/0.000 - CDS 1534 - 2679 693 ## COG0381 UDP-N-acetylglucosamine 2-epimerase 4 1 Op 4 . - CDS 2717 - 3922 767 ## COG0677 UDP-N-acetyl-D-mannosaminuronate dehydrogenase 5 1 Op 5 . - CDS 3960 - 4748 191 ## MmarC7_0334 hypothetical protein 6 1 Op 6 . - CDS 4767 - 5429 308 ## gi|253566335|ref|ZP_04843789.1| conserved hypothetical protein - Prom 5464 - 5523 8.3 7 2 Op 1 . - CDS 5835 - 7085 535 ## Hhal_0781 hypothetical protein 8 2 Op 2 . - CDS 7057 - 8322 247 ## COG2244 Membrane protein involved in the export of O-antigen and teichoic acid 9 2 Op 3 . - CDS 8328 - 9422 370 ## M301_0757 glycosyl transferase group 1 10 2 Op 4 1/0.000 - CDS 9419 - 10684 870 ## COG0677 UDP-N-acetyl-D-mannosaminuronate dehydrogenase 11 2 Op 5 . - CDS 10714 - 11682 788 ## COG0451 Nucleoside-diphosphate-sugar epimerases 12 3 Op 1 . - CDS 12181 - 12666 500 ## BF3924 hypothetical protein 13 3 Op 2 . - CDS 12725 - 13156 420 ## BF3699 putative transcriptional regulator - Prom 13389 - 13448 3.5 Predicted protein(s) >gi|226332010|gb|ACIB01000046.1| GENE 1 1 - 235 194 78 aa, chain - ## HITS:1 COG:SP1837 KEGG:ns NR:ns ## COG: SP1837 COG0399 # Protein_GI_number: 15901666 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis # Organism: Streptococcus pneumoniae TIGR4 # 3 78 6 81 408 88 56.0 2e-18 MKIPFSPPYIDEAVINEVVDSLRSGWITSGPKVKALEEEIKSFSGAKEVLCVNSWTSGAI MMLRWLGVKEGDEVIVPA >gi|226332010|gb|ACIB01000046.1| GENE 2 347 - 1498 418 383 aa, chain - ## HITS:1 COG:SP0351 KEGG:ns NR:ns ## COG: SP0351 COG0438 # Protein_GI_number: 15900280 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Streptococcus pneumoniae TIGR4 # 7 369 16 393 409 127 29.0 3e-29 MGAPQNRLYEMAVGLKKFGADITVITGMPNYPTGKIFKEYSGKWFCKENLDGIDIFRFWL FASNVKRVLPRVLSMLSFSFSVLFSLKYVRKKRFDFIIVESPPLTLGLSGYFLSKVCKSK MIMNISDLWPLSARELGVLTDGVIYCMLEKLEYFLYKKSVACMGQSQEIVSYISQHGASR TYLFRNGVTPERFQNIPNKKRTNGNLIIVYAGLLGVAQGILEICQKIDFKSLGTEFHIYG AGGEQHLIEEFLLTNSERGISFHGRVSRDEIPSLLKQADVTLIPLVKNIFGAVPSKIYES MAAGVPILFAGEGEGQRIIEENCLGWVSRSRDYEKLIENIKLIRSNDIDMLQKRENCKNA AENLFNRPKQIKALFQYLSQLNT >gi|226332010|gb|ACIB01000046.1| GENE 3 1534 - 2679 693 381 aa, chain - ## HITS:1 COG:RSp1017 KEGG:ns NR:ns ## COG: RSp1017 COG0381 # Protein_GI_number: 17549238 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylglucosamine 2-epimerase # Organism: Ralstonia solanacearum # 1 381 5 375 379 449 58.0 1e-126 MKKVMLVFGTRPEAIKMAPLVKEFQKDSIAFQTIVCVTGQHREMLDQVLHLFDITPDYNL NIMRSGQDLYDVTSRVLIGMRKVLKETLPDIVLVHGDTTTSTAAALAAFYQQIPVGHVEA GLRTFNIYSPWPEEMNRQITGRIATFHFSPTQLSRKNLLREGIADEKIIVTGNTVIDTLH VVVDRIKKDKLLNEELSEVLLLAGYDVNRLHKGKRLVLITGHRRENFGNGFVSICKAIKT LTEKYPDVDFIYPMHLNPNVRKPIYEVFGENQLPNIFFIEPLEYLSFVYLMEKSTIVLTD SGGIQEEAPGLGKPVLVMRDTTERPEALEAGTVKLVGKDYNKIVSEVSVLLDNQIYYDKM SKAVNPYGDGQASGRIVDVLR >gi|226332010|gb|ACIB01000046.1| GENE 4 2717 - 3922 767 401 aa, chain - ## HITS:1 COG:ECs4720 KEGG:ns NR:ns ## COG: ECs4720 COG0677 # Protein_GI_number: 15833974 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetyl-D-mannosaminuronate dehydrogenase # Organism: Escherichia coli O157:H7 # 6 400 9 420 420 404 50.0 1e-112 MKACFMGLGYIGLPTAIIAARHGIQVIGVDINPVVVDMTNQGQLHIIEPGMQEILQEVVG NGLLKASTAPEVSDAYFIVVPTPFKGNHEPDVSFVEAATRAVIPLLKEGDLYVIESTSPV GTTNMMAHLIFALRPELKDKIYIAYCPERVLPGNVIYELVHNDRVVGGIDEVSTDKAIGF YSKFVQGTLHRTHCKTAEMCKLTENSSRDVQIAFANELSLICDKAGINVWELISLANKHP RVNILQPGCGVGGHCIAVDPYFITADFPIESQLIAKAREINNYKAFWCAEKVENAMLRFE LEHHRRPTIAMMGLAFKPDIDDLREAPAKYITTKVMQSCNNADLLIVEPNVAEHKVFKLT DYKKAYEKADIVVFLVAHSAFKSLPYDNKKVILDFCGIYKK >gi|226332010|gb|ACIB01000046.1| GENE 5 3960 - 4748 191 262 aa, chain - ## HITS:1 COG:no KEGG:MmarC7_0334 NR:ns ## KEGG: MmarC7_0334 # Name: not_defined # Def: hypothetical protein # Organism: M.maripaludis_C7 # Pathway: not_defined # 5 259 8 259 265 120 35.0 4e-26 MNIDILICTFNEGINRVKEVLLDYREDIHYVVSHQITNSDYNYIPLELKREDVTVFHIES KGLSLNRNACFTRAKGDICFIADDDVKYTYQYIDTVKDIFLKDSSLDVCIGKIKTDTNVD YKSYGSYSRAIKKWNVTKISSIEIVVKRSSVLRFNIIFDIRFGLGSSLFAYGGEEAVFIM DCLNVGMNVRYYPVYISQHPFESSGKLVRHDNEFIQYHAALMKRLYGRFSLLLMGIVYLK NYWKYISPFLFAKNFLIGYNKI >gi|226332010|gb|ACIB01000046.1| GENE 6 4767 - 5429 308 220 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566335|ref|ZP_04843789.1| ## NR: gi|253566335|ref|ZP_04843789.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] # 1 220 136 355 355 289 100.0 1e-76 MNGIRQYFAMYFFTLSVLKMLNRNYIVMSLLMLASVLIHKSALLLSLFLLLFFFLKNIRT PKILYLIIIVGVIISAAPLYNSFIGFIMEYSFYDGYIENGYIQEVGINDKLIKYAFVPIF MYSIGKVKFLEIPCFHRYLFVIGIISFSVRLMALNLSLTNRFGMYFEFLMCIPLVYLIIT LKRQSTFKFLSLLLFIFMVYALKVTLFAKGEYLYNSYFFN >gi|226332010|gb|ACIB01000046.1| GENE 7 5835 - 7085 535 416 aa, chain - ## HITS:1 COG:no KEGG:Hhal_0781 NR:ns ## KEGG: Hhal_0781 # Name: not_defined # Def: hypothetical protein # Organism: H.halophila # Pathway: not_defined # 55 411 59 411 428 228 33.0 5e-58 MVIVKWLKSEIKKYPCLMHFINLILSFIKYLPHISFKYVDSKYELVRYEDGKNETFFGYY DKSPKNETGEYILYYSTSHPTKKKPDPNRPINLIVFDCINNVIVKRFLIYAYNWQQGSKV QWIEKYRFIFNNYDFVKKRYYSELYDVQTNISTLVDYPVYDVCNNKALSLNFNRLAILRP DYGYFNVAMSKEELEDISDDGVYIIDLERNSCTLLLSIERLLQTFPIRYDIEEVTSHKVN HIMFSPDGERFIFLHRFFINGKKIDRFFLYEFESQKITLLSDNDMVSHYSWIDNKRIIVY MRRFGIGDLYYVIDLYPVKITPVQNTELQSFGDGHPTVKNGLLVTDSYPDSSCTQYLLLY NFLTNKNLCLGTFHHSLLYSREMRCDLHPKWTFNGEEIFFDSIYNRGRHLYSLKLS >gi|226332010|gb|ACIB01000046.1| GENE 8 7057 - 8322 247 421 aa, chain - ## HITS:1 COG:PH0421 KEGG:ns NR:ns ## COG: PH0421 COG2244 # Protein_GI_number: 14590338 # Func_class: R General function prediction only # Function: Membrane protein involved in the export of O-antigen and teichoic acid # Organism: Pyrococcus horikoshii # 5 418 6 423 432 94 23.0 5e-19 MIKNKVFSNIVYLVSSSLITKIVLFLFYIYVARILGPNGYGYISSSTEYLGLFLIFATFG LQMAIVREASRNSDNQIGLFNKILPARFVFSIISFVFCITICYFLYWDTVNFHLIMILAI MVLLQPLEDHCYSFFWVKQELKFVALGELVKIVVYIGSFILLNVLFGLQLTNLVVSTLLG FLCSILFKMKWLKRRYGYRYQFIIDIPYIRKIFYISFYFGIVSIIYIYSLKIDIQMLNVI CGSTEVGYYSVAWQMVQIGIVFIQSLSTSLFPNSVQKIHLRSFRMKLLKYITYLTIFVIL CALLVTFLSDSIIELLYGRAYSNSVLLLNLLVWYLPMRLFAVWGSQILESGAWYKKRVLI YLTPLVINIVLNYIYLPLYGAQAAAVIALISNFILVSLITIFAFVYSKKYYSNGDCKMVE V >gi|226332010|gb|ACIB01000046.1| GENE 9 8328 - 9422 370 364 aa, chain - ## HITS:1 COG:no KEGG:M301_0757 NR:ns ## KEGG: M301_0757 # Name: not_defined # Def: glycosyl transferase group 1 # Organism: Methylotenera_301 # Pathway: not_defined # 186 363 200 376 386 68 28.0 4e-10 MNRGVRPKVCFVIPDQFGYAAGYYQYAKYLSEYGFGVTVLSVDKGLPRIEEIPDVTIKYL AINEKSVFSNTISYIRKSNRYLNTLDINTIVILKYIPFISVLLLSLRRKKRVFLDIRTGA VNKSRVKSILLNALLTFESYFFSRIYILSLELARILHLNMKKVVYLPLGADVYSVQSKSY ENDFNLLYVGTFNFRRIYDTIYGLKMFCDKYASRLRITYTIIGYGDEDEIRKIKNAINTC HLNDYVDYVGRKTYTEFSPYFDKANIGIAYVPMVKYYNNQPPTKVFEYALSGLICLATNT DANKQLICSKNGVLCNDSPDAFFLALEKLYKERSRFVFTDITQSMEQYSWKNIVREVIIP SLKG >gi|226332010|gb|ACIB01000046.1| GENE 10 9419 - 10684 870 421 aa, chain - ## HITS:1 COG:PM1003 KEGG:ns NR:ns ## COG: PM1003 COG0677 # Protein_GI_number: 15602868 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetyl-D-mannosaminuronate dehydrogenase # Organism: Pasteurella multocida # 1 421 3 424 424 506 58.0 1e-143 MDTIKIAVIGLGYVGLPLARLFSTKYKTIGFDMNQARVTALMDGHDTTLEVTDELLQSAI KNGFVCTFNIEDIRDCNFYVVAVPTPVDENNNPDLIPLYKASESVGKVITEGDIVVYEST VYPGVTEDECIPVVEKVSGLKYNVDFFAGYSPERINPGDKLHTVEKIKKVTSGSTPEIAR IVDDVYASVITAGTHSAPTIRVAEAAKVIENSQRDINIAFVNELSKIFTRMGIDTQDVLE AASTKWNFLPFKPGLVGGHCIGVDPYYLAQCAQRYGYNPEIILAGRRVNDGMGEYVANQV VKLMLKKGIQVLNSNILILGFTFKENCPDVRNTKVIDIYRTLKEYGVNIFVYDPWANPTI VEKEYGIKITNELLSSKFDAVILAVAHEDFKDLDINLFLNSSCVIYDVKGVLDPKIVASR L >gi|226332010|gb|ACIB01000046.1| GENE 11 10714 - 11682 788 322 aa, chain - ## HITS:1 COG:VNG0065G KEGG:ns NR:ns ## COG: VNG0065G COG0451 # Protein_GI_number: 15789399 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Nucleoside-diphosphate-sugar epimerases # Organism: Halobacterium sp. NRC-1 # 4 318 5 308 309 244 42.0 2e-64 MKVLVTGGAGFIGSNLCEYLLKEGHQVRCLDNFITGKIENLLSLIKKYPDSFQLIIGDIR KLEDCQKAVEGMEYVLHEAALGSVPRSIKDPITTNEINIGGFLNMIIAARDAKVKRFVFA ASSSTYGDSQSLPKVESIIGNPLSPYAITKYVDELYADIFARTYNFEYIGLRYFNVFGRR QDPFGAYAAVIPLFVKKFMKYESPVVNGDGEYSRDFTYIDNVLQMNMLALTTTNPEAVNQ IYNTAFGERTTLNQLVNYLKIYLSEFDPEISHVKIVHGPNRQGDIPHSLASIDKAKTLLN YDPQYCMKDGLKEAVKWYWENI >gi|226332010|gb|ACIB01000046.1| GENE 12 12181 - 12666 500 161 aa, chain - ## HITS:1 COG:no KEGG:BF3924 NR:ns ## KEGG: BF3924 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 160 1 160 161 296 97.0 1e-79 MLSLQSEIDSLCAVSHELLHLGLDGEPIYSDRFRQLNTDVYHRCEHLFGSHGRTLEEEAS LCIALLTGYNATIYNHGDKEDKIQSVLNRSWDLLDTLPVSLLKCRLLVACYAEVFDEELA AEAHAIIDGWKDRELTREEFEIVEHLKSLEENPYPNSEIED >gi|226332010|gb|ACIB01000046.1| GENE 13 12725 - 13156 420 143 aa, chain - ## HITS:1 COG:no KEGG:BF3699 NR:ns ## KEGG: BF3699 # Name: updY # Def: putative transcriptional regulator # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 143 37 179 179 259 99.0 1e-68 MRYEYVFRGERKIRKLVPVVHNLVFVYATRSEVDEMKSTVGASLPIRYIMDRETRQPITI PEVQMRSFIAVAGNYDEQVVYLDPSVVSMKRGDRVRVTGGIFEGVEGEFVRIKGDRRVVV SIQGGMAVATAFIHPSLIELIKN Prediction of potential genes in microbial genomes Time: Tue May 17 23:52:28 2011 Seq name: gi|226332009|gb|ACIB01000047.1| Bacteroides sp. 3_2_5 cont1.47, whole genome shotgun sequence Length of sequence - 18674 bp Number of predicted genes - 15, with homology - 15 Number of transcription units - 7, operones - 4 average op.length - 3.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . + CDS 653 - 1000 351 ## BF3701 hypothetical protein + Term 1041 - 1080 -0.8 + Prom 1037 - 1096 4.6 2 2 Op 1 . + CDS 1139 - 1480 387 ## BF3702 hypothetical protein 3 2 Op 2 . + CDS 1502 - 1972 338 ## BF3702 hypothetical protein 4 3 Tu 1 . + CDS 2079 - 2237 100 ## BF3929 hypothetical protein 5 4 Tu 1 . - CDS 2299 - 2871 411 ## PROTEIN SUPPORTED gi|157164512|ref|YP_001467500.1| 50S ribosomal protein L24 (BL23; 12 kDa DNA-binding protein; HPB12) - Prom 3053 - 3112 4.5 + Prom 2843 - 2902 5.0 6 5 Op 1 . + CDS 3070 - 4788 1570 ## COG0608 Single-stranded DNA-specific exonuclease 7 5 Op 2 1/0.500 + CDS 4785 - 6689 1621 ## COG0514 Superfamily II DNA helicase 8 5 Op 3 . + CDS 6752 - 7714 1151 ## COG0457 FOG: TPR repeat + Term 7733 - 7788 11.1 9 6 Op 1 . - CDS 7782 - 9017 1007 ## COG0477 Permeases of the major facilitator superfamily 10 6 Op 2 . - CDS 9095 - 10681 890 ## COG4409 Neuraminidase (sialidase) 11 6 Op 3 . - CDS 10705 - 11805 801 ## BF3710 hypothetical protein - Prom 11830 - 11889 3.9 - Term 11832 - 11883 5.3 12 7 Op 1 . - CDS 11898 - 13358 1113 ## BF3938 hypothetical protein 13 7 Op 2 . - CDS 13372 - 16668 2497 ## BF3712 hypothetical protein 14 7 Op 3 3/0.000 - CDS 16759 - 17667 758 ## COG0329 Dihydrodipicolinate synthase/N-acetylneuraminate lyase - Prom 17736 - 17795 2.0 15 7 Op 4 . - CDS 17873 - 18595 766 ## COG2186 Transcriptional regulators Predicted protein(s) >gi|226332009|gb|ACIB01000047.1| GENE 1 653 - 1000 351 115 aa, chain + ## HITS:1 COG:no KEGG:BF3701 NR:ns ## KEGG: BF3701 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 115 1 115 115 202 98.0 3e-51 MKQKKRPASQTEAMKLRWKKRIVFEKGYTEQCAEWMAERLEALTDHLQYGHAAIAYQKQN GDFRLVKATLIYYETEFHKKYDPTQIEGAVVYWNVDEQRWTTFQMENFMEWRPIV >gi|226332009|gb|ACIB01000047.1| GENE 2 1139 - 1480 387 113 aa, chain + ## HITS:1 COG:no KEGG:BF3702 NR:ns ## KEGG: BF3702 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 113 1 113 277 229 100.0 3e-59 MAIAYDGINYFPVGVNFMEENAMEVIEAKYGIKGSAIVLKLLCKIYKEGYFIRWDEEQCL IFANKAGREVQAAEVQGIIEILFIKGILDRNSYLANGILTSANIQKIWMEATK >gi|226332009|gb|ACIB01000047.1| GENE 3 1502 - 1972 338 156 aa, chain + ## HITS:1 COG:no KEGG:BF3702 NR:ns ## KEGG: BF3702 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 156 122 277 277 286 97.0 2e-76 MPYLLVNDLTQQETEAPEGENVTISPGNVVHDVTVNAKNACNSGQSKVKEKKAEENKELP PSAPPKGKEKEWEEVSAPLPIPGYAFNTMTHNYPGLTDTLKRLGITEVGEVNAILRLSDY GRKGTRVWQLIANTCWSDIGVKGRYLIAALNKAKRK >gi|226332009|gb|ACIB01000047.1| GENE 4 2079 - 2237 100 52 aa, chain + ## HITS:1 COG:no KEGG:BF3929 NR:ns ## KEGG: BF3929 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 52 1 52 52 82 98.0 3e-15 MIDSHFRAQYFNLQKLYFNVTLKFVFRLQIANSLNAKDLLFADKNKAIGKWH >gi|226332009|gb|ACIB01000047.1| GENE 5 2299 - 2871 411 190 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|157164512|ref|YP_001467500.1| 50S ribosomal protein L24 (BL23; 12 kDa DNA-binding protein; HPB12) [Campylobacter concisus 13826] # 8 188 3 182 185 162 44 1e-39 MQDIINGRCGWCGSDELYVKYHDQEWGKLVTDDKTLFEFLVLESAQAGLSWITILKKREG YRKAFCNFDAESVAQMTDEDVERLMHFDGIVKNRLKIKSTITNARSFLAVQKEFGSFYDY TLSFFPDRKPIVNTFQSLSEIPVSSPESDAMSKDMKKRGFKFFGTTICYAHLQASGFMND HLVDCICRKR >gi|226332009|gb|ACIB01000047.1| GENE 6 3070 - 4788 1570 572 aa, chain + ## HITS:1 COG:BH1240_1 KEGG:ns NR:ns ## COG: BH1240_1 COG0608 # Protein_GI_number: 15613803 # Func_class: L Replication, recombination and repair # Function: Single-stranded DNA-specific exonuclease # Organism: Bacillus halodurans # 4 530 7 523 562 356 37.0 6e-98 MNHKWNYRPITQEQAEISRALAQELGISPVLGRLLVQRGITKAQDAKKFFRPQLPDLHDP FLMKDMDIAVERLNMAMGKKERILIYGDYDVDGTTAVALVYKFIQQFYSNLDYYIPDRYN EGYGISKKGVDYAAETGVGLIIVLDCGIKAVEEIAYAKEKGIDFIICDHHVPDDVLPPAV AILNAKRLDNTYPYTHLSGCGVGFKFMQAFAISNGIEFHHLIPLLDLTAVSIASDIVPIM GENRILAYHGLKQLNGNPSVGLKAIIDVCGLSEKEITVSDIVFKIGPRINASGRIQNGKE AVDLLIEKDFSAALEKAGQINQYNETRKDLDKSMTEEANKIVAELEGLADRRSIVLYNED WHKGVIGIVASRLTEIYYRPAVVLTRTDDMATGSARSVSGFDVYKAIEHCRDLLENFGGH TYAAGLSMKVENVQAFTERFESFVSEHILPEQTSAVIDIDAEIDFKDITPKFFNELKRFN PFGPDNQKPVFCTHHVYDYGTSKVVGRDQEHIKLELVDNKSNNVMNGIAFGQSSHVRYIK TKRSFDICYTIEENTHKRGEVQLQIEDIKPIE >gi|226332009|gb|ACIB01000047.1| GENE 7 4785 - 6689 1621 634 aa, chain + ## HITS:1 COG:PA3344 KEGG:ns NR:ns ## COG: PA3344 COG0514 # Protein_GI_number: 15598540 # Func_class: L Replication, recombination and repair # Function: Superfamily II DNA helicase # Organism: Pseudomonas aeruginosa # 6 357 8 359 712 298 45.0 3e-80 MTYQEILKQYWGYDSFRDLQEDIITSIGNGKDTLGLMPTGGGKSITFQVPALAKEGLCIV ITPLIALMKDQVQNLKKRGIKAIAIYSGMTRQEIVVALENCIFGDYKFLYISPERLDTEI FRAKLRSMKISMITVDESHCISQWGYDFRPAYLKIADIRDLVPDAPVLALTATATPEVVK DIQERLRFREENVFRMSFERKNLAYIVRPTDNKNGELLHILNRIQGSAIVYVRSRRKTKE TTELLVNEGITADFYHAGLDNATKDLRQKRWQNGESRVMVATNAFGMGIDKPDVRIVIHL DLPDSPEAYFQEAGRAGRDGQKAYAVILYAKSDKTTLSKRIADTFPDKDYIKDVYEHLQY HYQMAMGDGLGCMYDFSLEEFCRKFKYFPVPADSALKILTQAGYLEYTDEQDNASRIIFT IRRDELYKLREMGEAAEKLIQMILRSYTGVFTDYAYISEQTLAVRTGLTRQQIYDLLVML SKRRIVDYIPHKKTPYIIYTRERIDLHYLQIPRAVYEERKERYETRIHAMVEYVTSENVC RSRMLLRYFGEKNEHNCGQCDVCLSHRAEPDISQSTFDGLREQICALLKEHPMTPAEIAS HINTDKEQLSEVIRFMLDEGLLSSENGLLTEKTS >gi|226332009|gb|ACIB01000047.1| GENE 8 6752 - 7714 1151 320 aa, chain + ## HITS:1 COG:alr1677 KEGG:ns NR:ns ## COG: alr1677 COG0457 # Protein_GI_number: 17229169 # Func_class: R General function prediction only # Function: FOG: TPR repeat # Organism: Nostoc sp. PCC 7120 # 133 312 63 242 280 81 29.0 2e-15 MPNFFKSFFAGKTENPEEEKQKNAKKNFEIFKYDGLRAQRMGRPDYAIKCFNEALAIEED FETLNYLSQLYIQTGEFGKAHELLERMIALEPELTSTYLTLANLCFMQEDYQEMADAAQK AIALEEGNAMAHYLLGKANHGLDNGIMTIAHLTKAIVLKDDFTEARLLRAEALYKMQQFA EAMEDIEAILAQNPDEEAALLLRGKIKEATGKEEEAETDYLHVTEINPFNEQAYLYLGQL FITQKKLTAAIELFDEAIELNPNFGAAYHERGRAKLLNGDKDGSIEDMKKSLELNPKEGE NLNGQFNNQQAETPPNVLGL >gi|226332009|gb|ACIB01000047.1| GENE 9 7782 - 9017 1007 411 aa, chain - ## HITS:1 COG:CC2486 KEGG:ns NR:ns ## COG: CC2486 COG0477 # Protein_GI_number: 16126725 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Caulobacter vibrioides # 7 369 40 421 519 100 25.0 4e-21 MKNRKIYPWIVVGLLWFVALLNYMDRQMLSTMKDAMQIDIIELQSATNFGRLMAVFLWIY GLMSPMAGIIADRVSRKWLIVGSLFVWSAVTYGMGYADTFNQIYWLRALMGVSEALYIPA GLSLIADWHQEKSRSLAVGIHMTGLYAGQAIGGFGATVAAAYSWHTTFHWFGIVGIVYAL VLIIFLRENEEHARVIRAMHTDKSKKIPLFKGVTLLFGNIAFWIILFYFAAPSLPGWATK NWLPTLYAENLDIPMAEAGPISTITIAVSSFIGVILGGLLSDRWVCKDIRGRIYTGAIGL GLTIPALLLLGLGNGFISIVGAGFLFGVGFGMFDANNMPILCQFVSAKYRATAYGIMNMT GVFAGAVVTSLFGKWTDGGNLGLGFAILGGIVLLALGMQLCFLRPHTDNME >gi|226332009|gb|ACIB01000047.1| GENE 10 9095 - 10681 890 528 aa, chain - ## HITS:1 COG:Cgl1519 KEGG:ns NR:ns ## COG: Cgl1519 COG4409 # Protein_GI_number: 19552769 # Func_class: G Carbohydrate transport and metabolism # Function: Neuraminidase (sialidase) # Organism: Corynebacterium glutamicum # 171 524 73 399 399 89 27.0 2e-17 MRYISILFLLSCFLLSTPLRAERVKVIVRQPIVPVLTKKEINPVLQLKLIKSCPGPCFVK EIGLSLKGTTLLTDLTHLSLYRVAGKRGLSDWEKCVDSVAPALKTVLDTPLELKSDTTIL WVTVKLKDKVDLTHRVTVSCDHVTTTCGKASVTSVRPIVALRTGVAVRQRGEDGVHTSRI PGITTSLKGTLMAIFDARYDSSRDLQGDIDIAMMRSMDGGMSWQPMQIVLDRKKWGGLPE KYNGISDACILTDEKNGTIYVAGLWMYGVLDPRSGKWVEGMTQDSTRWIHQWHAKGSQPG LGVKETCQFLITKSVDDGVTWSDPVNITAQTKKPEWWLYAPAPGHGITLKDGTLIFPTQG RDKDGIPFSNITYSKDGGKTWIASKPAYHNTTECMAVELQDGSVMLNMRDNRNHGNKKVN GRRICVTSDLGSTWTEHSTSRKALIEPTCMASIHRHTYQENGRQKTLLLFCNPESYDSRD HMTLKCSLDDGNTWDSGRKIMLDELGSFGYSCITSVNDSTIGVFYESS >gi|226332009|gb|ACIB01000047.1| GENE 11 10705 - 11805 801 366 aa, chain - ## HITS:1 COG:no KEGG:BF3710 NR:ns ## KEGG: BF3710 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 366 13 378 378 770 99.0 0 MKNTILKLFLLFCAVPSIGQNVHPNLPWIDISGQRERQVTIAPGRPDLYNGHPTTVMMDD HKTILCTWSYGHGGKASFIAESKDAGLTWKNGKTPADWQTMSNCPSIYKLTDKQGKERVF VFSAWPDMPMTYSEDGGKSWSPVRSLNKPCVMAFSSIVKLKNGDYLGLYHRGLNDRDRPP LTLWQSVSHDGGLTWSESVKVGEMEGRSPCEPCVFRAPDGKRLVCVARENNRVGNSLMMF SDDEGTTWSPLQETPWGLTGDRHVIKFTPDGRMIAVFRDMAPNSPTKGHFVAWVGNYKDL LEGTSGQYKIKLLHSYAGSDCGYPGLEILPDGSIVAITYVKMRPGPEQHSIVGVRFKLEE TDKMLY >gi|226332009|gb|ACIB01000047.1| GENE 12 11898 - 13358 1113 486 aa, chain - ## HITS:1 COG:no KEGG:BF3938 NR:ns ## KEGG: BF3938 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 486 1 486 486 988 100.0 0 MKATKILICILCCTWICSCSLLEVDPVSTITSQSFWKTPGDAKAYLTGIYNKVRDLNNTS YYGEDRGDAFKAGEIGPTSVAWAHTLLESNAPSYRSAYNIIHHANLLFNKIESLKFTNET EKNRIKAECHFLRAYTYFLIVRIWGDAPIITDPVLSDNVELKPRSPKEDVMKLILEDIEQ SVLLFPEDGYINKNLASKPAAYALKADVLMWKAKVLNGGNADLEEAIKAIDQVGGSGVSL LPDYAKVFANDNKKNNEIIFSFYFERYETGNLSIATNTTSRTDNLSMAVNLADAATSPNQ SRHVYAPSDKARELYLKYPGDRRYKVAMIDLVDKDGNLILTQTNKFRGKAYSDDRYFDDD LIAYRWGDLLLLRAEANAALNKISESLVDLNEVRDRAGLEPYDGPKDKIAVEKEICDERL RELFIEQKRWFDLVRFHCGGTIDIYKEVPNLNDKPGYPLYFPINYNDMVLNDKLVQTDGY ESNVER >gi|226332009|gb|ACIB01000047.1| GENE 13 13372 - 16668 2497 1098 aa, chain - ## HITS:1 COG:no KEGG:BF3712 NR:ns ## KEGG: BF3712 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1098 1 1098 1098 2141 99.0 0 MRKITRIKCLIVVLLTFIGTLAPVHAQQSGNFEISGVVKDTSGEPVIGATVIVKNTQIGT TTDVDGKFKLKVPHKSLLQISFIGMKTKDLKVTNNTFYEITLEDESVLLDEVVAIGYGTQ SKATVTSGVVSVKKAELMSSVSASPLNNLQGKVAGLDIRQTTGQPGAQPVVLIRGGSTDP ANDSPLFVIDGVVRSNMNGLNQEDIESMEVLKDAASAAIYGAKAANGIILITTKQGSSKD GKATISASYRLGVEQIRQHYPFSGARDYLYASRLAGSRGINDANVSGRLEGGAYPYSTGN INYKNGALEGYGYSRFTTEYLDDLISNMGQSYVDDLLNRQGYETMTDPYSGKQLIFKDNH YQKDVLFQTALTHNYDLSASGGNDKGNYYVSLGYINSEGIVLGTGYDRFSLTANGNYNLR SNLKLTVGLKHSTISNKATDPENSGTSTMDRSSRYPTTFRLYYDDGTPGIGEAGGSPRNR LHELYYQDISDKAYRNTIQLGLDWEILKGLHFKPSASYYMQENIYRFFEKYNEFNKGRKT LEKHDQYKQIMADAVFTYDKVFSDKHTLNAMIGMNYTQDDTYKLKGTGSEAPTDYVPTLA PTKPDLQRTTSSLDKEVLVGFFARVNYDYKRRYLLTVSARYDGASQFAEDHKFALFPAVS GGWNMHYEDWFPKTVVSRMKLRASWGQTGNNKLSYSNTQGEYASYIYAGNPGVLNSVLAN NSLVWETTSNVDYGFDAGFFDNRLELSVTGYNKLTSDRLYDKALPAQTGFSSIKANLGTV QNKGFELSLTAHPLSTTSPVNWDVTGTFSMNRTYMKKLPYNGRDKNRVQGGLVWDAKSGT YVETGGLAEGERIGGRWGYKYLGVYDKDEEAAKAPVDTKVSGSKMNKKKVAGDAIWADLD NNGVIDDKDIAFIGWANPDKKGALINNLSYKNFSLRLVVDFALGHSIANIWRCRANANAR NAIITTTDVTNGNIWWQTGDAATAKYPRYDVASDWDNGYRNHMRTIAYAGMNSNGNADNT AYYSKGDYLCFREVSLGYELPRSICSKMKVKGVSLNAGVTNIGYITAFDGLNPEQYDGQE TGEYFLPIQFNFGVRLTF >gi|226332009|gb|ACIB01000047.1| GENE 14 16759 - 17667 758 302 aa, chain - ## HITS:1 COG:VC1776 KEGG:ns NR:ns ## COG: VC1776 COG0329 # Protein_GI_number: 15641779 # Func_class: E Amino acid transport and metabolism; M Cell wall/membrane/envelope biogenesis # Function: Dihydrodipicolinate synthase/N-acetylneuraminate lyase # Organism: Vibrio cholerae # 5 302 2 297 298 236 39.0 5e-62 MKHYQRLEGMVAATFTPMDARGDINLSVIDKYADLMAESGMAGVFVCGTTGESHSLTTGE RKAILAQWIKSARKRFKVIAHVGSNCQLEAMELARHAQEVEADAFAAMAPCFFKPSSVKD LVDFFTPIAQSAPDLPFYYYNMPSMTGVSLSVPSFLIEGKKTMPNLVGTKFTHNNLMEMG ECLELNNGEFEVLHGYDEILIAGLALGAVAGVGSTYNYLPAVYQNLFDAFKKGDICTARR MQQKSIEIVKIIIKYGGGVRGGKAIMNLIGVDCGRCRLPVTPFGDDEYSSLKRDLEKIGF LN >gi|226332009|gb|ACIB01000047.1| GENE 15 17873 - 18595 766 240 aa, chain - ## HITS:1 COG:mll6865 KEGG:ns NR:ns ## COG: mll6865 COG2186 # Protein_GI_number: 13475721 # Func_class: K Transcription # Function: Transcriptional regulators # Organism: Mesorhizobium loti # 8 226 6 218 250 74 22.0 2e-13 MDTLKLHGQAITLVDQVEDNLLTYFRTKDLRAGDAIPNEMELAAALGVARSVLREALSRL KMVGMIETRTRRGMILTEPSILGGMKRVVDPRILSEHSLFDLLGFRIALELGICSDLFQN ITPEDIVELKEIVRLGIAFENNEYAPISEFTFHAKLYEITGNATIREFQEIIHPVMVFVK DKFKELLEPINIEIKERGELVTHADLLGFLEKHDEAGYRKALEKHFAVYKIFMKRRIVNE Prediction of potential genes in microbial genomes Time: Tue May 17 23:53:07 2011 Seq name: gi|226332008|gb|ACIB01000048.1| Bacteroides sp. 3_2_5 cont1.48, whole genome shotgun sequence Length of sequence - 29851 bp Number of predicted genes - 26, with homology - 26 Number of transcription units - 11, operones - 9 average op.length - 2.7 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 109 - 168 10.5 1 1 Op 1 . + CDS 389 - 1234 604 ## COG0077 Prephenate dehydratase 2 1 Op 2 . + CDS 1209 - 2393 1258 ## COG0436 Aspartate/tyrosine/aromatic aminotransferase 3 1 Op 3 5/0.000 + CDS 2416 - 3477 1306 ## COG2876 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 4 1 Op 4 . + CDS 3493 - 4266 817 ## COG0287 Prephenate dehydrogenase + Term 4286 - 4333 6.2 + Prom 4916 - 4975 3.4 5 2 Tu 1 . + CDS 5013 - 7016 2122 ## COG0358 DNA primase (bacterial type) 6 3 Op 1 . - CDS 7051 - 8643 1234 ## COG3119 Arylsulfatase A and related enzymes 7 3 Op 2 . - CDS 8670 - 10151 867 ## COG3119 Arylsulfatase A and related enzymes - Term 10161 - 10214 7.1 8 4 Op 1 . - CDS 10235 - 11842 1464 ## BF3950 hypothetical protein 9 4 Op 2 . - CDS 11856 - 15164 2797 ## BF3951 hypothetical protein - Prom 15332 - 15391 4.3 10 5 Op 1 . - CDS 15399 - 16331 693 ## COG3712 Fe2+-dicitrate sensor, membrane component 11 5 Op 2 . - CDS 16407 - 16985 468 ## BF3953 putative RNA polymerase ECF-type sigma factor - Prom 17023 - 17082 6.4 12 6 Op 1 . - CDS 17113 - 17697 692 ## COG0302 GTP cyclohydrolase I 13 6 Op 2 . - CDS 17700 - 18149 434 ## BF3955 hypothetical protein - Term 18157 - 18219 13.5 14 7 Op 1 . - CDS 18226 - 18981 974 ## COG0149 Triosephosphate isomerase 15 7 Op 2 . - CDS 19030 - 20361 881 ## BF3957 hypothetical protein 16 7 Op 3 . - CDS 20351 - 20914 591 ## BF3731 hypothetical protein - Prom 20934 - 20993 1.8 - Term 20928 - 20979 13.1 17 8 Op 1 . - CDS 20998 - 21879 665 ## COG0739 Membrane proteins related to metalloendopeptidases 18 8 Op 2 . - CDS 21902 - 22366 409 ## COG0105 Nucleoside diphosphate kinase - Prom 22411 - 22470 4.8 19 9 Op 1 . - CDS 22585 - 24681 1408 ## COG1200 RecG-like helicase 20 9 Op 2 . - CDS 24681 - 25340 287 ## PROTEIN SUPPORTED gi|163764767|ref|ZP_02171821.1| ribosomal protein L15 21 9 Op 3 . - CDS 25345 - 25896 624 ## COG0693 Putative intracellular protease/amidase 22 9 Op 4 . - CDS 25968 - 26810 966 ## BF3737 putative TonB exported protein 23 9 Op 5 . - CDS 26807 - 27187 338 ## BF3738 putative tansport related protein - Prom 27223 - 27282 3.4 24 10 Op 1 . - CDS 27300 - 28022 775 ## COG0811 Biopolymer transport proteins 25 10 Op 2 . - CDS 28022 - 28735 793 ## COG0854 Pyridoxal phosphate biosynthesis protein - Prom 28840 - 28899 5.2 + Prom 28704 - 28763 4.8 26 11 Tu 1 . + CDS 28868 - 29740 647 ## COG0061 Predicted sugar kinase Predicted protein(s) >gi|226332008|gb|ACIB01000048.1| GENE 1 389 - 1234 604 281 aa, chain + ## HITS:1 COG:ECs3462_2 KEGG:ns NR:ns ## COG: ECs3462_2 COG0077 # Protein_GI_number: 15832716 # Func_class: E Amino acid transport and metabolism # Function: Prephenate dehydratase # Organism: Escherichia coli O157:H7 # 3 274 1 271 282 137 31.0 2e-32 MKRIAIQGTLGSYHDIAAHKYFEGEEIELICCANFEDVFAAIRKDSQTIGMLAIENTIAG SLLHNNELLRQSGTQIIGEYKLRISHSFVCLPEEDWNDITEVNSHPIALMQCREFLNQHP RIKVVEAEDTALSAEIIKRENLKGHAAICSRAAAERYGMKVLQEGIETNKHNFTRFLVVA DPWQVDEIRKQNTVLNKANIVFTLPHSEGSLSQVLSILSFYNINLTKIQSLPIIGREWEY QFYVDVAFNDYLRYKQSITAITPLTKELKILGEYAEGKSNV >gi|226332008|gb|ACIB01000048.1| GENE 2 1209 - 2393 1258 394 aa, chain + ## HITS:1 COG:aq_273 KEGG:ns NR:ns ## COG: aq_273 COG0436 # Protein_GI_number: 15605813 # Func_class: E Amino acid transport and metabolism # Function: Aspartate/tyrosine/aromatic aminotransferase # Organism: Aquifex aeolicus # 13 392 5 385 387 295 42.0 8e-80 MQKENQTYKVAPADRLADVSEYYFSKKLKEVARMNAEGKDVISLGIGSPDMPPSEQTIET LCNNAHDPNGHGYQPYVGIPELRKGFADWYKRWYGVELNPATEIQPLIGSKEGILHVTLA FVNPGEQVLVPNPGYPTYTSLSKILGAEVVNYNLKEEDGWMPDFDELEKMDLSRVKLMWT NYPNMPTGANATPELYKRLVEFARRKNIVIVNDNPYSFILNDKPISILSVPGAKECCIEF NSMSKSHNMPGWRIGMLASNAEFVQWILKVKSNIDSGMFRAMQLAAAKALEADSTWYEGN NVNYRNRRHLAGEIMKTLGCTYDEKQVGMFLWGKIPASCADVEELTEKVLQEARVFITPG FIFGSNGARYIRISLCCKDAKLAEALERIKSIMK >gi|226332008|gb|ACIB01000048.1| GENE 3 2416 - 3477 1306 353 aa, chain + ## HITS:1 COG:DR1001_2 KEGG:ns NR:ns ## COG: DR1001_2 COG2876 # Protein_GI_number: 15806024 # Func_class: E Amino acid transport and metabolism # Function: 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase # Organism: Deinococcus radiodurans # 16 242 26 245 270 148 39.0 2e-35 MELESILLPGVEDKRPMVIAGPCSAETEEQVMSTATQLAAKGVKIFRAGIWKPRTKPGGF EGIGVDGLAWLKEVKKETGMYVSTEVATAKHVYECLKAGIDILWVGARTTANPFAVQEIA DALKGVDIPVLVKNPVNPDLELWIGALERINNAGLKRLGAIHRGFSSYDKKLYRNLPQWH IPIELRRRIPELPIFCDPSHIGGKRELVAPLCQQAMDLNFDGLIVESHCNPDCAWSDASQ QVTPDVLDYILNLLVIRKETQTTENLNVLRKQIDECDDNIIQELAKRMRVAREIGTYKKE HDITVLQTGRYNEILEKRGAQGEQCGMSAEFVKVIFEAIHEESVRQQMEIINK >gi|226332008|gb|ACIB01000048.1| GENE 4 3493 - 4266 817 257 aa, chain + ## HITS:1 COG:MJ0612_1 KEGG:ns NR:ns ## COG: MJ0612_1 COG0287 # Protein_GI_number: 15668793 # Func_class: E Amino acid transport and metabolism # Function: Prephenate dehydrogenase # Organism: Methanococcus jannaschii # 56 251 65 272 285 65 24.0 9e-11 MRILILGAGKMGSFFTDILSFQHETAVFDVNPHQLRFVYNTYRFTTLEEIKEFEPELVIN AATVKYTLDAFRKILPVLPKDCILSDIASVKTGLKKFYEESGFRYVSTHPMFGPTFASLS NLSSESAIIISESDHLGKVFFKDLYNSLNLNIFEYTFDEHDETVAYSLSIPFVSTFVFAA VMKHQEAPGTTFKKHMAIAKGLLSEDDYLLQEILFNPRTPSQVENIRTELKQLLEIITNK DAEGMKKYLTKIREKIK >gi|226332008|gb|ACIB01000048.1| GENE 5 5013 - 7016 2122 667 aa, chain + ## HITS:1 COG:BH1375 KEGG:ns NR:ns ## COG: BH1375 COG0358 # Protein_GI_number: 15613938 # Func_class: L Replication, recombination and repair # Function: DNA primase (bacterial type) # Organism: Bacillus halodurans # 2 441 5 447 599 281 37.0 2e-75 MIDQATIDRILDVAQIVEVVSDFVTLRKRGVNYVGLCPFHNEKTPSFSVSPSKGLCKCFS CGKGGNAVHFIMEHEQMSYPEALRYLAKKYNIEIKERELTNEEKEVQSNRESMFIVNNFA RDYFQNILKNHVDGRSIGLAYFRQRGFRDDIIDKFQLGFSTEGRDALAQEALRKGFKQEF LVKTGLCYETDDHKLRDRFWGRVMFPVHTLSGKVVAFGGRVLSTENKKLAKYVNSPESEI YHKSNELYGIYFAKQAIVKQDRCFLVEGYTDVISMHQSGVENVVASSGTSLTPGQIRLIH RFTNNITVLYDGDMAGIKASIRGIDMLLEEGMNIKVCLLPDGDDPDSFARKHNATEFQNF IQEHETDFIRFKAQLLMEDAGKDPMKRAELINDIVRSIAVIPEAIVRDVYIKECGQLLRI EDKLLVSEVAKRRELQAEKGNKPIASNNAPTPQPGEMPPPFPPEEMEADTYQSFIPQEGK EGQEFYKYERLIIQMIVRYGEKVMCNLTDEEGNEVPVTVVEYVINDLKEDELAFHNPLHR RILSEASEHIHDQEFASERFFVAHPDPKISTIATELASDRYQLSKYHSKTQKLVTDEERL YEMVPMLMINFKNAIVAEELKHIMYALQDPSIANDNAQCDAVMQRYKEMKEIQNLMAKRL GDRVVLR >gi|226332008|gb|ACIB01000048.1| GENE 6 7051 - 8643 1234 530 aa, chain - ## HITS:1 COG:CC1172 KEGG:ns NR:ns ## COG: CC1172 COG3119 # Protein_GI_number: 16125424 # Func_class: P Inorganic ion transport and metabolism # Function: Arylsulfatase A and related enzymes # Organism: Caulobacter vibrioides # 17 460 21 473 521 193 31.0 9e-49 MKFVTLSLLATALTVGGNAAEQARSSKQPNILFILADDFGWRDLACTGSRYYESPNIDGI ARNGVRFTQGYAACQVSSPSRASIMTGKFTARHGITNWIGEGSGEEWRKMGRHSKLLPAQ YVWQLPKEDITLPEALKAHGYKTFMAGKWHLGGEGSYPEDHGFDINIGGHEAGGPYPGGY FAPYGNPKMKEGPDGENLSMRLAHETASFIETHTRRNKKQPFFAFLSFYAVHAPIETTEA KWRHFRNKADSMGIAPVGFEVDRTLPVRLQQDNPIYAGLIQQMDDAVGVVLAKLHELGLD ENTIIVFTSDNGGVSSGDAYATSNYPLRGGKGRQWEGGIRVPLFIDFPGNTLKGDSCVVP VTGADLYPTFLDMAGIPLMPGQHQDGVSLLPLLQGKSIPERALYWHYPHYGNQGGEPSSI IRQGDWKLIHYYEDGRDELYNLRIDETESEPLNVQYPEKVEFLSKKLSVWLTEVGARYPE PDPQYNPAAEALYKKKTRERMMKTLEATRKKQLGKDFKPNADWWGSETKD >gi|226332008|gb|ACIB01000048.1| GENE 7 8670 - 10151 867 493 aa, chain - ## HITS:1 COG:STM0035 KEGG:ns NR:ns ## COG: STM0035 COG3119 # Protein_GI_number: 16763425 # Func_class: P Inorganic ion transport and metabolism # Function: Arylsulfatase A and related enzymes # Organism: Salmonella typhimurium LT2 # 1 469 1 478 497 212 32.0 2e-54 MKRLILPIACGICTVTSDAQTDKQPHPNVIFIYADDLGYTDLSCTGSRFYETPHIDKLAR EGVCFTQSYAACPVSSPSRAALLTGKYPARINLTDYIPGDRAYGPHKNQRLASLPFNLHL SKDEITMAEAFRQNGYSTFMAGKWHLAESAEYYPEQNGFDINIGGNNTGHPSKGYFSPYG NPQLKDGPEGEYLTDRLTDEVIRYISEPKEKPFFVYLSYYTVHLPLQAKAEKIAKYRRKL SRAVPADSSFVKKGETYHKLVQDIPAYAAMVESLDENIGRLLDTLHRSGLDERTIVVFTS DNGGMATSNTTRNIPTSNLPLRAGKGYLYEGGIKVPAIIRWSRHLKGRQVSDTPIIGTDY YPTLLDLCGLPLLPGQHVDGVSMKPVLQGGRLSRPSLFWHYPHYSGGLGGRPSAAIREGD YKLIEFFEDHHVELYNVIQDESEEKDLSQIYPEIADGLRKKLYLWYKEVGARMPVDNPHY VSPVKDSDSFENK >gi|226332008|gb|ACIB01000048.1| GENE 8 10235 - 11842 1464 535 aa, chain - ## HITS:1 COG:no KEGG:BF3950 NR:ns ## KEGG: BF3950 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 535 1 535 535 1069 99.0 0 MKNIYLFLLSVVILCSCNDFLDREPKTNLSPGSFWKSEEDLRLAANVFYQNMNRSYTLDN QSADGFANVGNLVSSGTHAPGNTDGIWTTAYKQIRHANNFLENYEKAEVSEATKNRYAGE VRFFRAYFYFNLVKRFGDVPYVLRTLDMDSEELMGPRVPKEQVIAGIIEDLEFAETHIPL KSKLPTDVGRLTKGAAQAMLARVALYIGTWNKFHGSGEYKSYLQIAKEASKRLIDSKEYS LYADYRNLFLLPGEDSNEHILSFRYSAEADTYNPRIRATIADLSHSPTKVLADAFLCKDG LPLEKSAYRVEYLPAGKEFENRDPRMALTIWKPGDPFLGKPFVPNLTSQTRTGYMFKKYG DEESYSNMKSRIDEILIRYAEVLLTFAEASFELDDRISDEDLNLSVNALRNRFEGDPNRL PDLTNAFVAEHGLSMRDEIRRERRVELAAESFRYDDIIRWKTAETELPAAILGAKFDPEL YPSTVPGKDVILDKNGFILVQNAESRTFDSSKDYLFPLPLREVSLNPNLKQNPNW >gi|226332008|gb|ACIB01000048.1| GENE 9 11856 - 15164 2797 1102 aa, chain - ## HITS:1 COG:no KEGG:BF3951 NR:ns ## KEGG: BF3951 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1102 24 1125 1125 2207 99.0 0 MRVTIFLLFFCVFSTIAGTVNSQNARVSIHKTNVPLEEILNEIEHQTDYLFMYSNNIDVK ERTSIRVSGKPVSEVLKKLLGNKIAYEMEGMHIILSDKRENQTAIAQQANKIQVKGNVTD MAGEPVIGASILEKGTTNGIVTDFNGDFSLNVNPGAVLVVSYIGYSAQEVKVIPGKVLKI KIKEDAELLDEVVVVGYGTQKKVNLTGAVETVDADVIENRPIRSATDALQGTVSGLTVTS GTGKPGEFASFKIRGNTSVNSAGALVIIDGMPGDINTVNPQDIETISVLKDAASAAIYGA RAAEGVVLVTTKKGTSEKVKVEYTGNFSFNTPTRLPESNTGLDHALLSNVAFTNAGLAVP FSQKAIDAIKDPNTIAIPNGKEWTYTSDMDWIDLMMDHSFQQTHNLTISKASDRLKYLFS IGWLDQNGMFSEYGPDNYDRINLRSNISVELIKDKLSLDSRISYSRGVNLYHAAEGSWSI PYITFIQAGPNMPVYDPNGNYSRHRMQLNPIQALKEGGEGRTRNQRIEGVFTLEYKPVKG LSLRAVGGANILDGQKKEWRRAYGKYGVDGLISTAFGQKSPNSVTQNNSHRQYLTGQLIA EYKGVFGKHDINVLGGWSAEENLYEDLQGKRTNIVGNELPALNLGDTDGWSNAADENEWA LLSGFMRASYAFAAKYLVEVNFRADASSRFSKKNRWGVFPSASVGWRITEEKFMQNQHIF DNLKLRASWGQLGNQNGLGLYDHIASYNINGYYPFKSELGQWAVISKLPSESRTWETVEV KNIAVDMAFLRNRLTVTGEYFIKKNKDMLVSIEIPSIIGIDVPTGNYGELKVKGWEVTVG WQDKIKDFSYGARFNLSDQKDKLVDYGVEYNGFVAGVNQKVQGYSLGSIFGYRTDGYFTS EEEVKNSAAFNKAITGVGDIKYIDKDGDGKISAPNDLEYLGTTTPRYTFGLNLTAAWKGF DLGVLLQGVGKRNFYLSSEVMNPYYATWNNFSYKMHNDYWTPENPNAAFPRYYAGANHNY QISDHWLQNAAYVRLKNLQLGYTISPKLTKSWGIQRLRVYFSGDNLCEYSKLNDNFDPEL SNINGYVYPIMRNFSFGINVTL >gi|226332008|gb|ACIB01000048.1| GENE 10 15399 - 16331 693 310 aa, chain - ## HITS:1 COG:PA2388 KEGG:ns NR:ns ## COG: PA2388 COG3712 # Protein_GI_number: 15597584 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Pseudomonas aeruginosa # 108 256 124 273 331 69 32.0 7e-12 MKIEKEILYRYFNGDATPEEEHKIRQYLEASDENWKEYLRERKFFDTIILKEQVVSEEKQ MKRRQLIRKISLECLKVAAVLLIAFGTAFFWNNQSEKPATKSVVNTLKGQMANITLPDGS RVWLNSNTRIEYSQHFDDKREVQIDGEAYFEVVRNTGRPFIVYTPDDEQVEVLGTKFYVE AYSGTKKFETALIEGSVRVRAANSQFILQPSYKAVLKGGKMSVEKITDFDIYRWREGLIC FKNRHFSEILEELKKYYGVHIRFDAAKINNPVLTVKFRLSDGIEYALRVLQKDVKFKYVR NDEENTFVIK >gi|226332008|gb|ACIB01000048.1| GENE 11 16407 - 16985 468 192 aa, chain - ## HITS:1 COG:no KEGG:BF3953 NR:ns ## KEGG: BF3953 # Name: not_defined # Def: putative RNA polymerase ECF-type sigma factor # Organism: B.fragilis # Pathway: not_defined # 1 192 1 192 192 373 100.0 1e-102 MSATINDLASFNKFFTENQHRFIRFAWTYTRDEVVAEDIVMESLMAYWENRDHMTPEINP AAYVLTVVKNKCLNYLRHLQLVNDVSDRVASHSEWELSNRIATLDACEPNALFASEVQDI IDRVISRLSATTARIFLLSRYDNKSHKEIAEIMGMTVKGVEFHISKATKELRVALKDYLV LFPFLCDFLSRN >gi|226332008|gb|ACIB01000048.1| GENE 12 17113 - 17697 692 194 aa, chain - ## HITS:1 COG:slr0426 KEGG:ns NR:ns ## COG: slr0426 COG0302 # Protein_GI_number: 16331608 # Func_class: H Coenzyme transport and metabolism # Function: GTP cyclohydrolase I # Organism: Synechocystis # 13 194 48 230 234 213 59.0 2e-55 MLEKEEIISPALEDLKNHYRSIITLLGEDAEREGLLKTPERVAKAMLTLTKGYHMDPHEV LRSAKFQEEYSQMVIVKDIDFFSLCEHHMLPFYGKAHVAYIPNGYITGLSKIARVVDIFS HRLQVQERMTLQIKECIQETLNPLGVMVVVEAKHMCMQMRGVEKQNSVTTTSDFTGAFNQ AKTREEFMNLIRQR >gi|226332008|gb|ACIB01000048.1| GENE 13 17700 - 18149 434 149 aa, chain - ## HITS:1 COG:no KEGG:BF3955 NR:ns ## KEGG: BF3955 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 12 149 12 149 149 273 100.0 1e-72 MKKLGLLLFLFVCAAVVRAQSNIVKSLERNVPGQGKVTIHQDSRIEALLGTARTGTGEQT VIKSSGYRVQAYAGNNTRQAKNEAHQVGTRVKEYFPELSVYTSFNPPRWLCRVGDFRSIE EADAMMRKLKATGVFKEVSIVKDQINIPL >gi|226332008|gb|ACIB01000048.1| GENE 14 18226 - 18981 974 251 aa, chain - ## HITS:1 COG:FN1366 KEGG:ns NR:ns ## COG: FN1366 COG0149 # Protein_GI_number: 19704701 # Func_class: G Carbohydrate transport and metabolism # Function: Triosephosphate isomerase # Organism: Fusobacterium nucleatum # 1 251 1 251 251 235 49.0 7e-62 MRKNIVAGNWKMNKTLQEGIALAKELNEALANEKPNCDVIICTPFIHLASVTPLVDAAKI GVGAENCADKESGAYTGEVSAAMVASTGAKYVILGHSERRAYYGETVEILKDKVKLALAN GLTPIFCIGEVLEEREANKQNEVVAAQLASVFDLSAEDFSKIVLAYEPVWAIGTGKTASP AQAQEIHAFIRSAVAEKYGKEIADNTSILYGGSCKPSNAKELFANPDVDGGLIGGAALKV ADFKGIIDAFN >gi|226332008|gb|ACIB01000048.1| GENE 15 19030 - 20361 881 443 aa, chain - ## HITS:1 COG:no KEGG:BF3957 NR:ns ## KEGG: BF3957 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 443 1 443 443 897 99.0 0 MKDRELHIIQKVWTNLCRFALAGVFIFSGFAKAVDPLGSEYKIQDYLDAFGMGTWLPAFF PLLAGIVLSAIEFSVGIFLFFGIRKTTATWLALLLMIFMTPLTLYLALANPVSDCGCFGD AWVLTNWQTFWKNIILLIAAISVFRWKHQMIRFISAKMEWLVSLYTFLYVFALSFYCLGN LPILDFRPYKIGKNIPEGMDVPEGAKPTVYESVFVLEKNGEKKEFSLENYPDSTWKFIEA RTIVKEKGYEPSIHDFSMTNLETGEDITEEVLSDKNYTFLLVAHRIEEADDSNIDLINEI YDYAVEHGYRFYCLTSSLDDQIEQWKDKTGAEYPFCLMDDITLKTMIRSNPGLMLIKEGT ILNKWSDSELPDEYALTDKLENLELGKQKEESDVHTIGYVLLWFAIPLLMVLGVDILVVK RLEKRRKRAAEKAVENSKSSSEA >gi|226332008|gb|ACIB01000048.1| GENE 16 20351 - 20914 591 187 aa, chain - ## HITS:1 COG:no KEGG:BF3731 NR:ns ## KEGG: BF3731 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 8 187 1 180 180 348 100.0 7e-95 MHTICKEMKDTKQQFEHVIALCRDLFSKKLHDYGPAWRILRPASVTDQIFIKANRIRSIE VKGVTLVDEGIRSEFIAIVNYGIVGLIQLELGYAESADITVEEALALYDKHAKEALELML AKNHDYDEAWRSMRISSYTDLILMKIYRTKQIESLAGQTLVSEGIDANYMDMINYSVFGL IKIEFEG >gi|226332008|gb|ACIB01000048.1| GENE 17 20998 - 21879 665 293 aa, chain - ## HITS:1 COG:HI0409 KEGG:ns NR:ns ## COG: HI0409 COG0739 # Protein_GI_number: 16272358 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane proteins related to metalloendopeptidases # Organism: Haemophilus influenzae # 98 205 328 442 475 105 48.0 7e-23 MNFNCIKTVMIAAAAMVSLNSFSQDLIARQAPIDRKLKSVDSLALQKQIRAEQSEYPALS LYPNWNNQYVHAYGKDAIIPDSYTIDLTGFHMPTPSTRITSPFGPRWRRMHNGLDIKVNI GDTIVAAFDGKVRIVKYERRGYGKYVVIRHDNGLETVYGHLSKQLVEENQLVKAGEPIAL GGNTGRSTGSHLHFETRFLGIAINPALMFDFPKQDIVADTYTFRKTRGYERNRAGSHDTN IASDGEIRYYKVKKGDSLSRIAKLRGVSVSTLCKLNRITTKTTLRPGQVLRCS >gi|226332008|gb|ACIB01000048.1| GENE 18 21902 - 22366 409 154 aa, chain - ## HITS:1 COG:AF0767 KEGG:ns NR:ns ## COG: AF0767 COG0105 # Protein_GI_number: 11498373 # Func_class: F Nucleotide transport and metabolism # Function: Nucleoside diphosphate kinase # Organism: Archaeoglobus fulgidus # 2 149 1 148 151 146 45.0 2e-35 MMEKTLVILKPCTLQRGLVGEITRRFERKGLRLAGMKMVQLTDEVLSEHYSHLSSKPFFQ RVKDSMMTAPVIVCCFEGVDAIQAVRALAGPTNGRLAAPGTIRGDYSMSFQENIVHTSDS PETAAVELNRFFKPEEIFDYKQATFDYLYANDEY >gi|226332008|gb|ACIB01000048.1| GENE 19 22585 - 24681 1408 698 aa, chain - ## HITS:1 COG:slr0020 KEGG:ns NR:ns ## COG: slr0020 COG1200 # Protein_GI_number: 16331409 # Func_class: L Replication, recombination and repair; K Transcription # Function: RecG-like helicase # Organism: Synechocystis # 18 670 146 805 831 496 43.0 1e-140 MFDLTTRDIKYLSGVGPQKAAVLNKELEIYSLHDLLYYFPYKYVDRSRIYYIHEIDGNMP YIQLKGKILGFETFGEGRQRRLLAHFSDGTGVVDLVWFQGIKYVTNKYKLHEEYIVFGKP TVFNGRINVAHPDIDSPADLKLSSMGLQPYYNTTEKMKRSFLNSHAIEKMMATVIGQIQE PLSETLSPKLIADHHLMSLTDALRNIHFPSNPELLRKAQYRLKFEELFYVQLNILRYAKD RQRKYRGYVFETVGETFNTFYSKNLPFELTGAQKRVLREIRQDVGCGKQMNRLLQGDVGS GKTLVALMSMLMALDNGFQACMMAPTEILANQHYDTIRELLFGMDVRVELLTGSVKGKKR EAILAGLLTGDVHILIGTHAVIEDTVNFASLGLAVIDEQHRFGVAQRARLWSKSVQPPHV LVMTATPIPRTLAMTLYGDLDVSVIDELPPGRKPIATIHQFDNRRESLYRSVRKQIEEGR QVYIVYPLIKESEKIDLKNLEEGYLHICEEFPDCKVCKVHGKMKPAEKDAQMQLFISGDA QIMVATTVIEVGVNVPNASVMIIENAERFGLSQLHQLRGRVGRGADQSYCILVTTYKLTE ETRKRLEIMVRTNDGFEIAEADLKLRGPGDLEGTQQSGIAFDLKIADIARDGQLLQYVRT IAEEITDADPGGVLPENAILWQQLRALRKTNVNWAAIS >gi|226332008|gb|ACIB01000048.1| GENE 20 24681 - 25340 287 219 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163764767|ref|ZP_02171821.1| ribosomal protein L15 [Bacillus selenitireducens MLS10] # 5 219 6 223 234 115 38 4e-25 MCRTALIVAGGKGLRMGSELPKQFLPIGGKPVLMRTLEAFHRFDEKMQIILVLPREQQDF WRELCEEHGFDIKHQIADGGETRFHSVKNGLALVNGIGVVGIHDGVRPFVSQEVIARCFR EAVVRKAVIPVIDVVETVRHLTESGSETVSRNDYKLVQTPQVFDADLLKRAYEQEFTPFF TDDASVVEAMGVPVYLVEGNRENIKITTPFDLKVASALL >gi|226332008|gb|ACIB01000048.1| GENE 21 25345 - 25896 624 183 aa, chain - ## HITS:1 COG:CAC1629 KEGG:ns NR:ns ## COG: CAC1629 COG0693 # Protein_GI_number: 15894907 # Func_class: R General function prediction only # Function: Putative intracellular protease/amidase # Organism: Clostridium acetobutylicum # 7 182 6 180 188 139 43.0 4e-33 MGTVYAFFADGFEEIEALTTIDTLRRAGLDVEIVSVTPDEIVVGAHDVSVLCDKNFENCD FFDAELLFLPGGMPGAATLDKHEGLRKLILSFAEKNKPIAAICAAPMVLGKLGLLKGRRV TCYPSFEQYLDGADCTNEPVVRDGNIITGMGPGAAMEFALTIVDTLLGKEKVNELVEAMC VRR >gi|226332008|gb|ACIB01000048.1| GENE 22 25968 - 26810 966 280 aa, chain - ## HITS:1 COG:no KEGG:BF3737 NR:ns ## KEGG: BF3737 # Name: not_defined # Def: putative TonB exported protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 280 1 280 280 403 99.0 1e-111 MNIKRKSEYIGALGALLVHVAIIALLILVSFAIPHPDEEAGGVPVMMGDVDAAYGNYDPS TMVDVEVLPEEVPAPQPEPEVETEQEMITQTEEETVVVKPKAEPKKEKPKVAKKLEKTPE EKAAEAKKLAEEKAERERKAAAEAASKRVAGAFGKGSQMGGSKGTATSGEGVEGSKDGNS STGAKSGVGGYGTFNLGGRSIGEGGLPRPVYNVQEEGRVVVSITVNPAGHVIATSINRLT NTVNSTLRKAAEDAAKKARFNAVDGVNNQTGTITYYFNLK >gi|226332008|gb|ACIB01000048.1| GENE 23 26807 - 27187 338 126 aa, chain - ## HITS:1 COG:no KEGG:BF3738 NR:ns ## KEGG: BF3738 # Name: not_defined # Def: putative tansport related protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 126 20 145 145 236 100.0 2e-61 MASMTDVIFLLLIFFMITSTVVSPNAIKVLLPQGKQQTSAKPLTRVIIDKDLNYYAAFGN EKEHALGLEELTPFLQSCADKEPEMYVALYADETVPYREIVKVLNIANENHFKMVLATRP PETKKK >gi|226332008|gb|ACIB01000048.1| GENE 24 27300 - 28022 775 240 aa, chain - ## HITS:1 COG:FN1312 KEGG:ns NR:ns ## COG: FN1312 COG0811 # Protein_GI_number: 19704647 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Biopolymer transport proteins # Organism: Fusobacterium nucleatum # 36 224 1 190 202 92 31.0 5e-19 MNAMLLLAQVATNLADSVASANPVLTPVSAPAEMNMLDMAIKGGWIMIVLAVLSVVCFYI LFERNYMIRKAGKEDPMFMEKIKDYIHSGEIKSAINYCRTINTPSARMIEKGISRLGRPV NDVQVAIENVGNIEVAKLEKGLTVMATISGGAPMLGFLGTVTGMVRAFYEMANAGSGNID ITLLSGGIYEAMITTVGGLIVGIIAMFAYNYLVMLVDRVVNKMESRTMEFMDLLNEPAQK >gi|226332008|gb|ACIB01000048.1| GENE 25 28022 - 28735 793 237 aa, chain - ## HITS:1 COG:BMEI0621 KEGG:ns NR:ns ## COG: BMEI0621 COG0854 # Protein_GI_number: 17986904 # Func_class: H Coenzyme transport and metabolism # Function: Pyridoxal phosphate biosynthesis protein # Organism: Brucella melitensis # 3 236 4 244 246 203 50.0 2e-52 MTKLSVNINKVATLRNARGGNVPNVVKVALDCESFGADGITVHPRPDERHIRRSDVYDLR PLLRTEFNIEGYPSPEFIDLVLKVKPHQVTLVPDDPSQITSNSGWDTKVNFDFLTEVLDE FNGAGIRTSVFVAPDAEMIEYAAKAGADRVELYTEPYATAYPKDPAAAVAPFVEAAKAAR RLGIGLNAGHDLSLLNLNYFYKNIPWLDEVSIGHALISDALYLGLERTIQEYKNCLR >gi|226332008|gb|ACIB01000048.1| GENE 26 28868 - 29740 647 290 aa, chain + ## HITS:1 COG:PA3088 KEGG:ns NR:ns ## COG: PA3088 COG0061 # Protein_GI_number: 15598284 # Func_class: G Carbohydrate transport and metabolism # Function: Predicted sugar kinase # Organism: Pseudomonas aeruginosa # 64 286 64 289 295 162 38.0 9e-40 MKFAIFGNTYQAKKSSHAATLFKLLEKHGAEICVCREFHRFLKSDLKLNVKADDLFDENN FDADMVISIGGDGTFLKAARRVGNKGIPILGINTGRLGFLADVSPEEMEETIEEVYQNHY TVEERSVLQLLCDDKHLQNSPYALNEIAILKRDSSSMISIRTAINGAHLTTYQADGLIIA TPTGSTAYSLSVGGPIIVPHSKTIAITPVAPHSLNVRPIVICDDWEITLDVESRSHNFLV AIDGSSETCKETTRLTIRRADYSIKVVKRFNHIFFDTLRTKMMWGADSRV Prediction of potential genes in microbial genomes Time: Tue May 17 23:54:08 2011 Seq name: gi|226332007|gb|ACIB01000049.1| Bacteroides sp. 3_2_5 cont1.49, whole genome shotgun sequence Length of sequence - 78291 bp Number of predicted genes - 57, with homology - 56 Number of transcription units - 30, operones - 14 average op.length - 2.9 N Tu/Op Conserved S Start End Score pairs(N/Pv) + 5S_RRNA 3 - 96 98.0 # CR626927 [R:3201914..3202064] # # Bacteroides fragilis NCTC 9343 # Bacteria; Bacteroidetes; Bacteroidia; Bacteroidales; Bacteroidaceae; Bacteroides. + Prom 112 - 171 1.8 1 1 Op 1 27/0.000 + CDS 260 - 1303 725 ## COG0845 Membrane-fusion protein 2 1 Op 2 9/0.000 + CDS 1300 - 4332 2399 ## COG0841 Cation/multidrug efflux pump 3 1 Op 3 . + CDS 4329 - 5624 958 ## COG1538 Outer membrane protein 4 1 Op 4 9/0.000 + CDS 5667 - 6692 802 ## COG3275 Putative regulator of cell autolysis 5 1 Op 5 . + CDS 6685 - 7458 590 ## COG3279 Response regulator of the LytR/AlgR family + Term 7490 - 7531 9.9 - TRNA 7526 - 7599 73.7 # Met CAT 0 0 - Term 7473 - 7524 16.2 6 2 Op 1 . - CDS 7696 - 8007 218 ## BF4311 putative transmembrane protein 7 2 Op 2 . - CDS 8011 - 8475 495 ## COG1522 Transcriptional regulators - Prom 8563 - 8622 5.2 - Term 8593 - 8634 11.1 8 3 Op 1 . - CDS 8668 - 9537 1058 ## COG0545 FKBP-type peptidyl-prolyl cis-trans isomerases 1 9 3 Op 2 . - CDS 9558 - 10142 639 ## COG0545 FKBP-type peptidyl-prolyl cis-trans isomerases 1 - Prom 10225 - 10284 4.4 + Prom 10484 - 10543 3.2 10 4 Tu 1 . + CDS 10609 - 11319 787 ## COG0846 NAD-dependent protein deacetylases, SIR2 family 11 5 Op 1 26/0.000 - CDS 11253 - 12026 684 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 12 5 Op 2 . - CDS 12019 - 13284 977 ## COG0438 Glycosyltransferase + Prom 13097 - 13156 1.9 13 6 Op 1 10/0.000 + CDS 13389 - 14591 1304 ## COG0677 UDP-N-acetyl-D-mannosaminuronate dehydrogenase 14 6 Op 2 . + CDS 14646 - 15776 1011 ## COG0381 UDP-N-acetylglucosamine 2-epimerase + Term 15820 - 15848 -0.1 15 7 Op 1 . - CDS 15828 - 16475 593 ## BF4508 hypothetical protein 16 7 Op 2 . - CDS 16544 - 17743 1001 ## COG0438 Glycosyltransferase 17 7 Op 3 . - CDS 17743 - 18753 663 ## BF4300 putative exopolysaccharide biosynthesis protein 18 7 Op 4 11/0.000 - CDS 18763 - 19950 787 ## COG0438 Glycosyltransferase 19 7 Op 5 . - CDS 19954 - 21474 1223 ## COG2244 Membrane protein involved in the export of O-antigen and teichoic acid 20 7 Op 6 . - CDS 21490 - 23529 2008 ## COG0143 Methionyl-tRNA synthetase - Prom 23617 - 23676 3.5 - Term 23618 - 23669 8.2 21 8 Op 1 . - CDS 23693 - 24781 481 ## BF4501 hypothetical protein 22 8 Op 2 . - CDS 24798 - 25121 149 ## BF4500 hypothetical protein - Prom 25141 - 25200 6.2 + Prom 25033 - 25092 8.1 23 9 Tu 1 . + CDS 25298 - 26161 234 ## BF4294 hypothetical protein + Term 26182 - 26238 7.1 24 10 Tu 1 . - CDS 26466 - 26618 65 ## BF3408 hypothetical protein - Prom 26702 - 26761 4.8 + Prom 26828 - 26887 4.0 25 11 Tu 1 . + CDS 27091 - 28485 938 ## BF4495 hypothetical protein + Prom 28591 - 28650 6.3 26 12 Op 1 . + CDS 28675 - 30735 1423 ## COG1042 Acyl-CoA synthetase (NDP forming) + Prom 30753 - 30812 5.1 27 12 Op 2 . + CDS 30835 - 31323 488 ## BF4493 hypothetical protein 28 12 Op 3 . + CDS 31334 - 32398 756 ## COG0535 Predicted Fe-S oxidoreductases + Prom 32400 - 32459 4.2 29 13 Tu 1 . + CDS 32500 - 36489 2351 ## COG3292 Predicted periplasmic ligand-binding sensor domain + Prom 36541 - 36600 4.8 30 14 Tu 1 . + CDS 36642 - 38675 1369 ## BF4490 hypothetical protein + Term 38700 - 38745 5.8 31 15 Tu 1 . - CDS 38791 - 39990 1022 ## BF4489 hypothetical protein - Prom 40093 - 40152 5.0 32 16 Op 1 . - CDS 40164 - 41846 851 ## BF4488 hypothetical protein 33 16 Op 2 . - CDS 41866 - 43404 831 ## COG0606 Predicted ATPase with chaperone activity 34 16 Op 3 . - CDS 43438 - 44508 784 ## BF4486 hypothetical protein - Prom 44551 - 44610 6.4 - Term 44542 - 44589 11.0 35 17 Tu 1 . - CDS 44620 - 46311 1963 ## BF4280 hypothetical protein - Prom 46388 - 46447 6.1 - Term 46349 - 46385 0.3 36 18 Tu 1 . - CDS 46522 - 47475 545 ## COG4974 Site-specific recombinase XerD - Prom 47545 - 47604 3.8 + Prom 47418 - 47477 4.3 37 19 Op 1 . + CDS 47549 - 47968 451 ## COG0757 3-dehydroquinate dehydratase II 38 19 Op 2 . + CDS 47996 - 49453 1448 ## COG0469 Pyruvate kinase 39 19 Op 3 . + CDS 49462 - 50100 501 ## COG4122 Predicted O-methyltransferase + Term 50199 - 50233 2.0 + Prom 50266 - 50325 7.0 40 20 Op 1 . + CDS 50421 - 50753 417 ## COG0858 Ribosome-binding factor A 41 20 Op 2 . + CDS 50750 - 51988 767 ## COG4591 ABC-type transport system, involved in lipoprotein release, permease component + Term 52001 - 52037 -0.8 - Term 52132 - 52180 2.5 42 21 Tu 1 . - CDS 52232 - 52783 362 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog - Prom 52915 - 52974 5.4 + Prom 52713 - 52772 3.8 43 22 Op 1 . + CDS 52934 - 53893 600 ## BF4475 putative anti-sigma factor 44 22 Op 2 . + CDS 53933 - 57454 3153 ## BF4474 hypothetical protein 45 22 Op 3 . + CDS 57476 - 57883 348 ## BF4473 hypothetical protein 46 22 Op 4 . + CDS 57920 - 59425 1280 ## BF4473 hypothetical protein + Prom 59449 - 59508 4.9 47 23 Tu 1 . + CDS 59531 - 61123 1115 ## COG3119 Arylsulfatase A and related enzymes + Term 61147 - 61197 4.1 48 24 Tu 1 . - CDS 62053 - 64551 1997 ## BF4469 hypothetical protein - Prom 64577 - 64636 8.1 49 25 Tu 1 . - CDS 64673 - 66091 1468 ## COG0499 S-adenosylhomocysteine hydrolase - Prom 66209 - 66268 5.5 + Prom 66168 - 66227 6.8 50 26 Tu 1 . + CDS 66278 - 67846 1133 ## COG0642 Signal transduction histidine kinase + Term 67847 - 67905 19.0 - Term 67835 - 67893 19.0 51 27 Op 1 . - CDS 67902 - 69578 1334 ## COG0318 Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 52 27 Op 2 . - CDS 69588 - 70163 567 ## COG1396 Predicted transcriptional regulators - Prom 70222 - 70281 7.8 - Term 70251 - 70300 15.6 53 28 Tu 1 . - CDS 70322 - 70591 446 ## PROTEIN SUPPORTED gi|53715743|ref|YP_101735.1| 30S ribosomal protein S15 - Prom 70735 - 70794 4.1 + Prom 70542 - 70601 6.2 54 29 Tu 1 . + CDS 70747 - 72546 2144 ## COG1217 Predicted membrane GTPase involved in stress response + Term 72571 - 72612 8.1 - Term 72627 - 72668 8.1 55 30 Op 1 . - CDS 72780 - 74807 1898 ## BF4257 hypothetical protein 56 30 Op 2 . - CDS 74822 - 78082 3011 ## BF4460 hypothetical protein 57 30 Op 3 . - CDS 78099 - 78290 68 ## Predicted protein(s) >gi|226332007|gb|ACIB01000049.1| GENE 1 260 - 1303 725 347 aa, chain + ## HITS:1 COG:VC1674 KEGG:ns NR:ns ## COG: VC1674 COG0845 # Protein_GI_number: 15641678 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Vibrio cholerae # 1 340 14 362 369 125 28.0 9e-29 MKISVLLAALLLLFSCTDKDSKQSQDLIVKTAQAVSASGIKTTEFPFIAQPFRTSELSFR VGGPIDRLDVYAGNHYKQGSIIAEIDPRDFHIRKERAEAIYHQAKAEFERIEKLYEKNNV SASTYEKTKADYTTAKTAFDTASNELGDTRLTAPFDGYVGEVYIEKYQDVKPAQPVISFI DINRLKIEIYVTQNIAFASHPTDSVRIYFDAQPDKYYKAQIVEVSKGTTRNNLSYLLTAV LPNKEGKLLAGMSGKAIFDAPGTTDLTGVSIPQTALCYRPSEGEYVWVIDTNTRQVNRRT VKKGNLLPGGYVTITEGLRASETVATSGLRFLSDGMKVEISTKTNSL >gi|226332007|gb|ACIB01000049.1| GENE 2 1300 - 4332 2399 1010 aa, chain + ## HITS:1 COG:VC1757 KEGG:ns NR:ns ## COG: VC1757 COG0841 # Protein_GI_number: 15641761 # Func_class: V Defense mechanisms # Function: Cation/multidrug efflux pump # Organism: Vibrio cholerae # 1 1008 1 1012 1016 658 37.0 0 MKLVKYFLQKRAVTILLLVLVLAGGLFSYFKMGKLEDAPFTIKQALVLTSYPGASPAEVQ SQVTDILEEAIQSLGELYYLKTENRAGLSKITVYVKKEIRAEEMQQLWDKLRRKVNDVQD KLPAGAGTSIVNDDFGDVLGVFYGLTGDGHTYRELEDQAKFIKNELLKVKDVAKIEIYGV QTPTIDVLISPSVMAQSGVTTADIMRAFEAQNKMVDAGGIDAGTNRIRIESTGNFYSLDD IRDLTIVSRTGEHFRLADIAQIEEGYQTPPANQMRINGSPAVGIAISTVPTGNVVDMAEN VKMRIGELSQSMPDGYELISIYDQGYESAVANQGFILNLIISVITVVAILLFFIGFKNGL LIGSGLVFSIFATLIVMMACDIALQRMSLAAIIIAMGMLVDNAIVVSDSALINMERGMRK RVAIMRACSSTALPLLAATVIAILTFLPIYYSPHITGELLSSLVVVIGVSLMFSWVFALT QTPFFIQEFVRRPRPEELKASLFDGKYYHLFRKSLRWVLRHRTMTIASLVVLLLLSAWSF KFIPKVFVPALDKQYFTVDMWLPEGTTIGETDRIAGEISDYIRTHEETEMVSSYIGRTPP RYYLSNISFGPQSNYAQLLVKCKSSKESRALNALLQDSIRLKYPEPLIKVNKFELSPLTE AMIEARFLGPDPAVLDSLAGEAIEIMRRNPKVADARNEWGNMSMVMRPVYDPVKAGALGI TKSQMMESVKSISDGTRVGVYRDDEKKVPVLLKSEGADITDARSLGNFSVWNGEHSAPLS QVTERIETTWEWPQMRTYNRQLSMAAMCGVKSGYTMAEVHGEIRKEIEAMKLPEGYTFFW DSQYKDQREAMQAIGKFFPLAFLVLVVILVALFGNFRDPVIILCVLPLSIIGVAVGMLLT GFDFGFFPIAGWLGLLGMVIKNVIVLLDEIDVQRREGVVPYTAVIESTVSRTRPVLMAAT TTILGMVPLLFDIAFGGMAATIIFGLTFATLLTLFVTPALYAIFYKIKEK >gi|226332007|gb|ACIB01000049.1| GENE 3 4329 - 5624 958 431 aa, chain + ## HITS:1 COG:aq_1059 KEGG:ns NR:ns ## COG: aq_1059 COG1538 # Protein_GI_number: 15606342 # Func_class: M Cell wall/membrane/envelope biogenesis; U Intracellular trafficking, secretion, and vesicular transport # Function: Outer membrane protein # Organism: Aquifex aeolicus # 9 427 2 409 417 68 22.0 2e-11 MMMKKINRWLIFLLCVPTVAFAQQNSLLQKYRSMALDYNHDLKAADKNIAASIELEKAAQ KDLRPKLSGEANFQYTGNPLQLNIDLPSMQTPLAFEGRNMKYGASLSLLQPVYTGGRLLE SIRMAKHQQSLAIHQADYFRSAVSYQTDMQYWNTVARAEIVRVTTEYRNSVATFFQTIRE RVEAGLVDPQDLLMAEVKLNEAEYQLLQAKRNLETGRMALNSLIGVELHAPTEIEDTISV VRADKDLWGEGEIDRPELKMAYARIKIAESSKKLTYAKYKPQFYIGVDGSYSAPGYDFRS DLDPNYAIYAKVSVPLFEWGKRRDEKRASSFKVGMATDYLNQVTDQVKLEVETARVSLSQ TMEQVRLTEGSLSKAFENERMALERYTEGKASVIEVIEAQTYRQASQLNHVQAKVSAQGA YSELIRALNKY >gi|226332007|gb|ACIB01000049.1| GENE 4 5667 - 6692 802 341 aa, chain + ## HITS:1 COG:YPO3943 KEGG:ns NR:ns ## COG: YPO3943 COG3275 # Protein_GI_number: 16124071 # Func_class: T Signal transduction mechanisms # Function: Putative regulator of cell autolysis # Organism: Yersinia pestis # 165 339 377 560 565 83 31.0 5e-16 MKWNVLLYGLLFSGLGIFSYLLLVNYTELTPKVADVLYSKGAFVFFITAFNVLGYSTLRI SSWINTQYALNIRHRWKIIVIYVAVILLFLLLNYSLLIAAKLLAGIDNLFTFSNGGWRIL IVVWLVELVIVGLLLANRSIQNNLKLQQEAAKLQTENDTARYAALQSQLNPHFLFNSLNT LIAEIEYNPGNAVHFTKHLSSVYRYVLQCQDKTLVTLTEELEFLQSYLFLHKVRLGDCIS CNCCIASGYTSCMLPPLTLQLLAENVINHNSITLSKPMKIDLRLEEGYLVVSNPIQPKKS HESPGVGLKNLSNRCQLMLGKEIIVHNDEKVFIVKVPLLYE >gi|226332007|gb|ACIB01000049.1| GENE 5 6685 - 7458 590 257 aa, chain + ## HITS:1 COG:VCA0850 KEGG:ns NR:ns ## COG: VCA0850 COG3279 # Protein_GI_number: 15601605 # Func_class: K Transcription; T Signal transduction mechanisms # Function: Response regulator of the LytR/AlgR family # Organism: Vibrio cholerae # 1 233 2 243 261 90 27.0 2e-18 MNKINVAIIEDEIPAARLLHSMVSGLRPQWELTLIPGSVDEAVAWFDEHPHPDLIFLDIH LADGNAFDFLSAAHPSSVIIFTTAYDQYAIRAFTVNSIDYILKPIDEKRLSDAITKYESL LTNAVPRPEDYLGTLLEALQYKEKRYRTRFLISGVDRFWSLQVADIAYFYSENKVTFAVT RKGQEHILDLSLNKLMEQLDPERFFRANRQVLVCIDAIDHAEPFFNGKIVVTVRPPYKQK ITISEEKLSAFKLWLNH >gi|226332007|gb|ACIB01000049.1| GENE 6 7696 - 8007 218 103 aa, chain - ## HITS:1 COG:no KEGG:BF4311 NR:ns ## KEGG: BF4311 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 103 1 103 103 187 99.0 9e-47 MEWLNEYHLSGLLIGICTFLIIGLFHPVVVKAEYYWGTKCWWIFLILGIAGVTASLSTDN ILVSSLLGVFAFSSFWTIKEVFEQEERVKKGWFPKNPKRTYKF >gi|226332007|gb|ACIB01000049.1| GENE 7 8011 - 8475 495 154 aa, chain - ## HITS:1 COG:YPO0002 KEGG:ns NR:ns ## COG: YPO0002 COG1522 # Protein_GI_number: 16120355 # Func_class: K Transcription # Function: Transcriptional regulators # Organism: Yersinia pestis # 3 148 6 151 153 118 40.0 5e-27 MEKIDNLDRQILEIISQNARIPFKDVAAECGVSRAAIHQRVQRLIDLGVIVGSGYHVNPK SLGYRTCTYVGIKLEKGSMYKAVVAELQKIPEIVECHFTTGPYTMLTKVYARDNEHLMDL LNNKMQEIPGVTATETLISLEQSIKKEIPIHADK >gi|226332007|gb|ACIB01000049.1| GENE 8 8668 - 9537 1058 289 aa, chain - ## HITS:1 COG:ECs5185 KEGG:ns NR:ns ## COG: ECs5185 COG0545 # Protein_GI_number: 15834439 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: FKBP-type peptidyl-prolyl cis-trans isomerases 1 # Organism: Escherichia coli O157:H7 # 76 287 61 258 259 157 41.0 3e-38 MKKVSILAAVAMATGLASCTAQAPKATLKTDVDSLSYAIGISQTQGLKDYLSQRMEMDTT YMADFLKGVNDAANKTSKKDQAYLLGLQIGSQMAGPQAIKGMNHQLFADDSTMTVNKGDI LAGVFAGVLNKDMKMRPEEAQVLIQKMMESIKGKAAEKKYADNKAAGEKFLAENKTKEGV KTTASGLQYKVITEGKGEIPNDTCKVKVNYRGKLIDGTEFESTYERKEPFVTNVGGVIKG WTEALKMMPVGSKWELYIPQELAYGSRDMGQIKPFSTLIFEIELLDIEK >gi|226332007|gb|ACIB01000049.1| GENE 9 9558 - 10142 639 194 aa, chain - ## HITS:1 COG:YPO3532 KEGG:ns NR:ns ## COG: YPO3532 COG0545 # Protein_GI_number: 16123678 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: FKBP-type peptidyl-prolyl cis-trans isomerases 1 # Organism: Yersinia pestis # 5 194 14 206 206 182 48.0 4e-46 MDKFSYAIGLGIGQNLLGMGAKGIAVDDFAQAIKDVLEGNQTAISHQEAREIVNKYFEEL ETKMNAANIEQGKAFLEENKKRPNVVTLPSGLQYEVITEGTGKKAQATDQVKCHYEGTLI DGTLFDSSIKRGQPAVFGVNQVIPGWVEALQLMPEGSKWKLFIPSELAYGAQGAGEMIPP HSTLVFEVELIEVL >gi|226332007|gb|ACIB01000049.1| GENE 10 10609 - 11319 787 236 aa, chain + ## HITS:1 COG:jhp1180 KEGG:ns NR:ns ## COG: jhp1180 COG0846 # Protein_GI_number: 15612245 # Func_class: K Transcription # Function: NAD-dependent protein deacetylases, SIR2 family # Organism: Helicobacter pylori J99 # 1 231 1 225 234 240 51.0 2e-63 MKNLVVLSGAGMSAESGISTFRDAGGLWDRYPVEQVATPEGYARDPELVTHFYNERRKQL LEVEPNRGHELLAELEKDFQVTIVTQNIDNLHERAGSSHIIHLHGELTKVCSSRDPNNPH YIKELKPEEFEVKIGDLAGDGSQLRPFIVWFGESVPEIETAIDWVEKADVFVIIGTSMNV YPAAGLLNYVPRNAEIYLIDPKPVDVHSSRPIHVIQKGASEGVAELREKLLTTNHA >gi|226332007|gb|ACIB01000049.1| GENE 11 11253 - 12026 684 257 aa, chain - ## HITS:1 COG:jhp0094 KEGG:ns NR:ns ## COG: jhp0094 COG0463 # Protein_GI_number: 15611164 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Helicobacter pylori J99 # 8 218 3 215 260 142 37.0 5e-34 MHSHPSPKFSVITVTYNAEKVLEDTVQSVISQTYHHVEYIIIDGASKDGTLEIVNRYRDR IHQLVSEPDKGLYDAMNKGIALATGDYLCFLNAGDSFHEDDTLQKMVHSINGNELPDILY GETALVDAERHFLRMRRLSAPETLNWKSFKQGMLVCHQAFFPRHTLIEPYDLQYRFSADF DWCIRIMKKARTFHNTHLILIDYLAEGMTTQNHKASLLERFRIMTRHYGLLSTLAHHAWF VVRSFSRNSATPSDAPF >gi|226332007|gb|ACIB01000049.1| GENE 12 12019 - 13284 977 421 aa, chain - ## HITS:1 COG:all4426 KEGG:ns NR:ns ## COG: all4426 COG0438 # Protein_GI_number: 17231918 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Nostoc sp. PCC 7120 # 1 416 1 412 417 227 34.0 3e-59 MRVLIINTSERLGGAAVAASRLMESLKNNGIKAKMLVRDKQTDQISVVGLQRNWWQVWRF VWERIVIWKANRFKKNNLFAVDIANTGTDITSLPEFQQADVIHLHWVNQGMLSLNDIRKI LKSGKPVVWTMHDMWPCTGICHHARECTNYHQECNHCPYLYGGGSKKDLSNRIFRKKQQL YKEAPITFVTCSQWLKGQAEKSALLTGETVISIPNPINTNLFKPRNKKEARSKCHLPQNG KLILFGSAKITDKRKGIDYLIESCKLLAEKHPELKDSLSVVVLGKQSEQLKPLLPFKVYP LNYVSNEHELVDVYNAVDLFVTPSLEENLPNTIMEAMACGVPCIGFNVGGIPEMIDHLHN GYVAQYKSSEDFANGIYWALTDPDYPSLSEQANRKVIANYSEGIIAKRYIDVYNKITGRY A >gi|226332007|gb|ACIB01000049.1| GENE 13 13389 - 14591 1304 400 aa, chain + ## HITS:1 COG:ECs4720 KEGG:ns NR:ns ## COG: ECs4720 COG0677 # Protein_GI_number: 15833974 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetyl-D-mannosaminuronate dehydrogenase # Organism: Escherichia coli O157:H7 # 4 381 6 399 420 423 54.0 1e-118 MKKVVFLGLGYIGLPTAAVVAGHGYEVVGVDVNPSVVETINQGKIHIVEPELDQIVKEVV RTGNLRAVSKPEQADAFFVVVPTPFKQNHRADITYVESATRSVIPYLREGNLFVIESTSP VFTTERMAEVIYKERPELKDKIYIAYCPERVLPGNTLYELVHNDRVIGGVNPESTAKAIE FYSAFVQGKLHPTNARTAEMCKLTENSSRDSQIAFANELSMICDKAGINVWELIELANKH PRVNILQPGCGVGGHCIAVDPWFIVSDYPEQAQIIKRARETNDYKADWCANKVMEACQQF VEKNDREPVVACMGLAFKPNIDDLRESPAKYIASRIVSESRAEVLIVEPNVASHASFHLT DYREAYQKADIVVWLVRHTPFVELPREESKLELDFCGVRK >gi|226332007|gb|ACIB01000049.1| GENE 14 14646 - 15776 1011 376 aa, chain + ## HITS:1 COG:STM3920 KEGG:ns NR:ns ## COG: STM3920 COG0381 # Protein_GI_number: 16767195 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylglucosamine 2-epimerase # Organism: Salmonella typhimurium LT2 # 3 374 2 369 376 469 61.0 1e-132 MKKILLVFGTRPEAIKMAPLVKALQRDTEHFETKVCVTAQHRQMLDQVLEVFDIIPDYDL NIMAPNQDLYDITTKVLLGLRDVLKDFCPDTVLVHGDTTTSMAASLAAFYRQVAVGHVEA GLRTYDMLSPWPEEMNRQVTDRICTYYFAPTGKSKQNLLQENIDAKKIFVTGNTVIDALL MAVDIISKKPGIKEKLHQELRDKGYEVGQREYILVTGHRRENFGEGFLHICKAIRELAAL HPEMDIVYPVHLNPNVQKPVYELLSGVDNVYLISPLDYLPFIYAMQHSTLLLTDSGGVQE EAPSLGKPVLVMRNTTERPEAVEAGTVKLVGTDAEAIVSNVTELLRNKELYRRMSETHNP YGDGHACERILSALTR >gi|226332007|gb|ACIB01000049.1| GENE 15 15828 - 16475 593 215 aa, chain - ## HITS:1 COG:no KEGG:BF4508 NR:ns ## KEGG: BF4508 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 215 1 215 215 406 100.0 1e-112 MTDLTILIAVIALALWPIVFLISRILHERNKRAKPSGDTASAETEEVTEEMTTSALIMSI LQQLGCQPEVNEENHISFKYQGDDFLVAAEDGLRLIIVWNPWWASISIDNQALPYLKEII NAVNMNSLVTTVYALDEDEKTFGIHSKCHMLFAPEEEEPEKSFTDLLDSFFTTHNTIKEN LKQLGNGMPDMEKKERVRIKGFAAYKDNSTELKGE >gi|226332007|gb|ACIB01000049.1| GENE 16 16544 - 17743 1001 399 aa, chain - ## HITS:1 COG:DR1225 KEGG:ns NR:ns ## COG: DR1225 COG0438 # Protein_GI_number: 15806244 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Deinococcus radiodurans # 198 373 175 345 402 84 32.0 5e-16 MKVLYTFGGMPHYLNAMLNKLHNKGVEITVITPQKGNATIGKGVKMVEGGTYRHLTAIEK KAFYGKSSYPSLPEIVREEKPDILIMGWPYFLQVFFQPRLRKAMKECRTRLVIREIPFQT PPYGKIKEYFHENPMYDENMRLVSTGTGFYLKQWLTAKIRKYCYARITGTLNYSTAAYDI LPSYGVKQEQIHVTYNSTDTDALLKEKEAVLTSPPLLPPSSKRALHIGRLVKWKRVDLLI DAFTKVIASHPDAELVVVGDGPELDNLKKQAADLNLTEQVRFIGAVYSPKELGAYMNEST VYVLAGMGGLSINDAMTYGLPVVCSVCDSTERDLVTDGVNGLFFKEGNADSLSDKLNKLF ASPERCASMGRESERIIREKINIETVSERYLQAFRTFMQ >gi|226332007|gb|ACIB01000049.1| GENE 17 17743 - 18753 663 336 aa, chain - ## HITS:1 COG:no KEGG:BF4300 NR:ns ## KEGG: BF4300 # Name: not_defined # Def: putative exopolysaccharide biosynthesis protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 336 1 336 336 673 99.0 0 MKVTIYPYSINKGDIRVNPYISDFIHALQQNGITIANPPHKNPLFSLIPRKADSDAYIFH WPENVPDYKYGMLQTLAAIWLLFKIKCHHKKIIWFLHNKQPHVMKHRWSKKLLIHLLLRK ADLIVTHATEGIKVVQDQYPKAVAKTVFLHHPTKNRIEEYIPQPPETDLLIWGNISRYKG VPEFVRFATQHSLKLKTKIIGKCSSQELLEELHKESNEMISIEDRSIPFEELKQEIRRSR FVLVPYAAESILSSGILMDSLSFGAKVIGPAVGSFRDYATEPLLNVYTFHTFDDIQELVD KANEATDIAGYNRFLNAHSWNEFGKKFQNLLQKIIQ >gi|226332007|gb|ACIB01000049.1| GENE 18 18763 - 19950 787 395 aa, chain - ## HITS:1 COG:aq_1641 KEGG:ns NR:ns ## COG: aq_1641 COG0438 # Protein_GI_number: 15606745 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Aquifex aeolicus # 170 388 91 313 316 84 27.0 5e-16 MNILIQQTKAFPHRANSFYWFYAQLTEWLTAHGVDSKLYFSYLELADSEFENGLLLPDHY SQFYTPRNIEAICRFIIDKQIDVILDYSHVITGDTRKYYLEIKKRNPGIKICTMIHNCPS HTTQLKQYELSTLRFKDVHGPKKLFQWMLPQLYISLLKKVVSHQNRSAYDTLDEVVLLSP AYIPEFKKLIGKKDAWKLSAIPNAIKPVHSNIPIEEKDKEIIFVGRMATEKALPKLLKIW GMVQDKLPDWKLTLVGDGPQFGTCRQIIAEKKLKRVCLTGHQMSIPYIDRARILCLTSVI EGLPTVFTEAMSLGVIPIGFDSFNAIYDMIDDGIDGFIIPDNNYEQYAETILRLAQNDTL RCRIAYKAQKRKNRYDIEQVGPLWMETFRKHGLIK >gi|226332007|gb|ACIB01000049.1| GENE 19 19954 - 21474 1223 506 aa, chain - ## HITS:1 COG:BS_tuaB KEGG:ns NR:ns ## COG: BS_tuaB COG2244 # Protein_GI_number: 16080613 # Func_class: R General function prediction only # Function: Membrane protein involved in the export of O-antigen and teichoic acid # Organism: Bacillus subtilis # 30 490 3 468 483 144 23.0 4e-34 MYSGLYERRPDTPFIFTNGRERKYNMAEQSLKEKTAKGLFWGGFSNGIQQLLNLLFGIII TRMLDSTDYGMIGMLAIFTAVANSIQESGFTAALANKQTFRHEDYNAVFWFSFLMGASLY LLLFFCAPFIAAFYKTPQLIPLSRFLFLGFLISSCGTAHNAVLFKKLMVKEKAKATITAL LCSGTIGIVMAYNGMAFWGLAVQQITYIFIANALLWYFSPWRPTFSFNFKPIREMLPFSS KLLITNVFHYINDNIFSVLLGRFYTSQDVGYYTQANKWTNMGFSLISNMINGVSQPVLVE TSSDAMRQKNIFRKMLRFTAFISFPAMFGLALIANEFIVIAVTAKWQACVPIMQILCIWG AFVPITYMYSNLLISKGKSNLFMWNTIAQSLVQLTMLLCTISQGILVMAVIYTVINIGWL LIWHYFVNKQIHITLWEVMKDITPYLLISGGVIGTSYFLTIGFHNLYVLLILKIGIAVSL YVFIMWIGKSVIFFESVQFIRKKLLK >gi|226332007|gb|ACIB01000049.1| GENE 20 21490 - 23529 2008 679 aa, chain - ## HITS:1 COG:PAB2364_1 KEGG:ns NR:ns ## COG: PAB2364_1 COG0143 # Protein_GI_number: 14521189 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Methionyl-tRNA synthetase # Organism: Pyrococcus abyssi # 7 546 3 553 562 558 49.0 1e-159 MEKNFKRTTVTSALPYANGPVHIGHLAGVYVPADIYVRYLRLKKEDVLFIGGSDEHGVPI TIRAKKEGITPQDVVDRYHFLIKKSFEEFGISFDVYSRTSSKTHHELASDFFKKLYEKGE FIEKTSEQYYDEEAHQFLADRYITGECPHCHSEGAYGDQCEKCGTSLSPTDLINPKSAIS GSKPVMKETKHWYLPLDKHETWLRQWILEEHKEWRPNVYGQCKSWLDMGLQPRAVSRDLD WGIPVPVEGAEGKVLYVWFDAPIGYISNTKELLPDSWETWWKDPETRLVHFIGKDNIVFH CIVFPAMLKAEGSYILPDNVPSNEFLNLEGDKISTSRNWAVWLHEYLEDFPGKQDVLRYV LTANAPETKDNDFTWKDFQARNNNELVAVYGNFVNRAMVLTQKYFEGKVPAAGELTDYDK ETLKEFSDVKAEVEKLLNVFKFRDAQKEAMNLARIGNKYLADTEPWKLAKTDMERVGTIL NISLQLVANLAIAFEPFLPFSSERLRQMLNMDSFDWAELGRNDLLPAGHQLNKPELLFEK IEDATIEAQVQKLLDTKKANEEANYKAKPIRANIEFDDFMKLDIRVGTVLECQKVPKADK LLQFKIDDGLETRTIVSGIAQHYKPEELVGKQVCFIANLAPRKLKGIVSEGMILSAENND GSLAVVMPGREVKPGSEVK >gi|226332007|gb|ACIB01000049.1| GENE 21 23693 - 24781 481 362 aa, chain - ## HITS:1 COG:no KEGG:BF4501 NR:ns ## KEGG: BF4501 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 362 1 362 362 702 100.0 0 MRKALLTVISFTLCLYITSCSQSSKPEKARTIETIATDAQQTLSFNHEPLSIDPIGIGDI IVTDTFLILALNKEENMLHVYNLPHLQFLGSFQKIGNGPDEVILPSAFTQWFNKDGQIQL VMRSYQKFTGLLNISKSLIENKAIYDNKYTYNAPKGKNSFQQSSVSYLLGDSIFLINRSI IMRPQDNQNDFFEVYDYKNDSILRSFYASNFPKELLEHHGRDQAFQKDIAISNDCKKMVI AYRFLNMISIVNIEKEEINNLFTDGNKLNWEQVIEGTPKPYYTKVHCNNAYIWAMAIEGE DPSTFRSRLDIFDWKGNYLCKAHLDKWVSSFSIDERNQTMYAVTADDMLVRYNIKELLDQ LP >gi|226332007|gb|ACIB01000049.1| GENE 22 24798 - 25121 149 107 aa, chain - ## HITS:1 COG:no KEGG:BF4500 NR:ns ## KEGG: BF4500 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 107 11 117 117 209 100.0 2e-53 MKQTLSKAVVTIIIACTALYAWNHKQPVLTNVQLQNLEAIAAGEEGACIRWIEQTCYYSF SEEHDNEPHYECNGSSGQAGMTSCGVINNKKPTFGYVKGTCLICIEH >gi|226332007|gb|ACIB01000049.1| GENE 23 25298 - 26161 234 287 aa, chain + ## HITS:1 COG:no KEGG:BF4294 NR:ns ## KEGG: BF4294 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 287 25 311 311 571 99.0 1e-161 MRAIFIFVCILLFVSCQNSDKSRIAHMIDEWDGKEIIYPDDLVFTTMGEDTVKWFLKDSR YTIVTYADSIGCMGCKLKLPVWKDFISYLDSVSDHTVKVIFILHPRDKKEMVHLLVYNDF SYPVCLDIKGSFDKINKIPSNLAFQTFLLDNRNKVIAIGNPIHNPNVRTLYLNLILSDSI CESRDLIQTKISLPETIDMGVFDWSEEKEAMLIIKNLGVVSLTVENIVTSCGCTSVEYSR RPVSLKDSLVIKIKYKAESPGYFNKDIAIYCNVPSSPVYVKLSGKAQ >gi|226332007|gb|ACIB01000049.1| GENE 24 26466 - 26618 65 50 aa, chain - ## HITS:1 COG:no KEGG:BF3408 NR:ns ## KEGG: BF3408 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 50 1 50 50 82 92.0 6e-15 MLIEERYKDEDTGSGGVNSLPKLELSYSAGVCLFLLKQAKRTIINLKKKK >gi|226332007|gb|ACIB01000049.1| GENE 25 27091 - 28485 938 464 aa, chain + ## HITS:1 COG:no KEGG:BF4495 NR:ns ## KEGG: BF4495 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 464 1 464 464 967 99.0 0 MKHFTGILFLLLLCFSCTPVHDAPLEQALTLAGDNRKELQQVLGHYEGDSLKHKAACFLI ENMIGKGTIRYLLRESDSCYIQQEPEPDLTCITADYLIENIDLAFEVWQKYPWCKQLSFR EFCRNILPYRLKQEPLDRWRSYYYTRYKMTVDSLARAGATMREIVFFFNSQHGKKYLHDA AKIPGDFSIELIEKLGGGTCDHLALNAVQLMRAVGIPLNLDILPYHGKVNGGHAYNSFTD ENGKFFYFSPYEREPERNQWIAPLVQRVCYERQPEPKIGRNRWNAQLANRLLKEVTAEYY LSDSVRLPVHTSDTVAYIATFNRGAFKVVSQGRVESNSVLHRVLPYGLLYFQMADKKGKL VPTGNPFVMTPDSIHFITPIRQTTVLNGILTYDVKRVLELGDEAYTLYYWKDGWQPVKEV TSKDSRTLDFGEVPVRSLFLVCGNTYMGRMQRPFLLEDGKPVYY >gi|226332007|gb|ACIB01000049.1| GENE 26 28675 - 30735 1423 686 aa, chain + ## HITS:1 COG:AF1211 KEGG:ns NR:ns ## COG: AF1211 COG1042 # Protein_GI_number: 11498810 # Func_class: C Energy production and conversion # Function: Acyl-CoA synthetase (NDP forming) # Organism: Archaeoglobus fulgidus # 5 682 3 679 685 330 33.0 4e-90 MITTQLLRPQSIVVVGASNNTHKPGGAILKNLINGGYQGELRAVNPKETEVQGVPSFADA KELPDTDLAILAIPAVLCPEVVETLAAEKQTRAFIILSAGFGEETQEGALLEERILETVN KYGASLIGPNCIGLMNTWHHSVFTQPIPNLNPHGVDLISSSGATAVFILESAVTKGLQFN SVWSVGNAKQIGVEDVLQYMDENFDVRKDSKIKLIYIESIKNPDRLLFHASSLIRKGCKI AAIKAGSSESGSRAASSHTGAIASSDSAVEALFRKAGIVRCYSRGELTTVGCIFTLPELK GKNFAIVTHAGGPGVMLTDALSKGGLNVPKLEGPVAEELKSKLFPGASVGNPIDILATGT PEHLSIAIDYCEEKFENIDAILAIFGTPGLVTMFETYEVLHQKMLTCKKPLFPVLPSVRT AGEEVAFFLEKGHVNFADEVMLGTALSRIINAPKPAVPEIELFGVDVPRIRRIIDSIPQN GYIEPHYVQALLHSAGIPVVEEFVSGNKDEVLAFARRCGFPVVAKVVGPVHKSDVGGVVL NIKGEQHLAFEFDRMMQIPEARAIMVQPMLKGTELFIGAKYEEKFGHVVLCGLGGIFVEV LKDVSSGLAPLSYEEAYSMIHSLRAYKIIQGTRGQKGVNEDKFAEIIVRLSTLLRFATEI KEMDINPLLATEKEVVAVDARIRIEK >gi|226332007|gb|ACIB01000049.1| GENE 27 30835 - 31323 488 162 aa, chain + ## HITS:1 COG:no KEGG:BF4493 NR:ns ## KEGG: BF4493 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 162 1 162 162 306 99.0 1e-82 MKKKNEQWLRYSNRALSGLLMLFGFVSCDNGGGDIPVEYGMPSAKYRVKGKVIDADTQEP VPGIEVVTGAVHTGDGKEWLSYPDTLITDKDGAFATERTEFPSKKYRFIVRDVDGDANGT YSKDSVEVEAGGFTGGSYWYRGETSIDKTIEIKKETAKDNEK >gi|226332007|gb|ACIB01000049.1| GENE 28 31334 - 32398 756 354 aa, chain + ## HITS:1 COG:all2029 KEGG:ns NR:ns ## COG: all2029 COG0535 # Protein_GI_number: 17229521 # Func_class: R General function prediction only # Function: Predicted Fe-S oxidoreductases # Organism: Nostoc sp. PCC 7120 # 30 354 13 353 420 124 27.0 3e-28 MSLRKRLGLEIALKIRNNLRELHPLRQLFWECTLQCNLACKHCGSDCRKMSEQKDMPAAD FLQVVDSITPHVNPNEVNIIITGGEPLMRDDLEEVGMALYRKGYPWGIVSNGLYLTRERL DSLMAAGLHAVTISLDGFAEEHNWLRGNPDSYEKALEAIKMLVHEPELTWDVVTCVNRKN YSYLEELKAYLYTIGVRNWRIFTIFPVGRAANHLEFQLTDEEFTGVLEFIKKVRKEGRVH LSYGCEGFLGKYESEVRDHFYSCNAGISVASVLADGSISSCPSIRSNFHQGNIYEDDFME VWEKRFQVFRNREWARKGECADCSFFRYCEGNGMHLHNDNGDLLFCHQKRIVEL >gi|226332007|gb|ACIB01000049.1| GENE 29 32500 - 36489 2351 1329 aa, chain + ## HITS:1 COG:XF1330_1 KEGG:ns NR:ns ## COG: XF1330_1 COG3292 # Protein_GI_number: 15837931 # Func_class: T Signal transduction mechanisms # Function: Predicted periplasmic ligand-binding sensor domain # Organism: Xylella fastidiosa 9a5c # 21 762 21 739 740 135 22.0 5e-31 MKNTFCVLACFFITIFCQAQSVEEHYYFKNLSIRNGLSQNTVNAILQDRKGFMWLGTKDG LNRYDGLSFRKFKHDAANPRSIGNSFITSLYEDFNGNIWVGTDAGVYIYYPEKEAFEEFD CQSLEKTRIERSVSMIAGDKQGRVWIAVEAQGMFCYDTRQKLLRNYPLSEISSNIKCFTF DSGGTLWLGFYGDGLYYSKDNLATVHPYGSPEDGKREFEGGVITKIVQGNYNCLYIGSVK EGVSELNLTSGQVRNLLAIDESGESIFCRDLLPYSDNELWIGTESGIYIYNLRTAQFIHL RASLYDSYSLSDNAIYALYKDREEGLWIGSYFGGVDYYPRQYTYFAKYYPKNIANSLHGK RVREFCRADDGTLWIGTEDGGLNHFNPKTKEFHFFEPSAGFTNIHGLCMDGSHLWVGTFS KGLRVIDTRTGVVLRTYTEGHTPHSLNDNSIFSICRTSAGEIYLGTLFGLLRYNRTQDNF DRIPELNGKFVYDIKEDSYGNLWLATYANGAYCYDVSVRRWKNYVFDAEDEKSLPYDKVL SVFEDSYRQIWLTTQGGGFCLFHPDTETFTRYGLKDGLPNDVVYQIVEDDDRFLWLTTNN GLVRFDPKTMEMKVFSTANGLPTNQFNYRSGFKDEAGNIYLGSINGFVAFDPRTFAENRQ VPAVAITDFLLFNKEVPVGETDSPLKSSITFSDKVVLTADQNSFSFRIAALSYQAPRMNK LMYKLEGFDEGWLTIGESPLVTYSNLGYGDYVFKVKASNSDGVWNEQETSLHLSILPPFY LSGWAYCFYVLFFMGCLVCVIFYFKRRNYRKQHRQMEMLEQEKEREVYHAKIDFFTNVAH EIRTPLTLIKGPLENIILKKEVDSETKEDLYIMKQNTERLLNLTNQLLDFRKTETRGFRL NFTECDVVAVLRETYLRFTSLAKQKGLDFILELPQECFMADVNQEALTKIISNLLNNGVK YASTYLRISLETDEKVFHIRTFNDGEMIPDTMKEEIFKPFVRLDKEDEVTTGTGIGLALS RSLAELHQGSLMMEKGEEVNCFCLTLPVNQDSTITLSAENVSQVEENSCGWEQEETDTKE KKPMILVVEDNPDMLAFIRKQLTTEYSVLTAMNGIEALAVLDNHYVNLVVSDVMMPQMDG FELCKTIKSDLSYSHIPVVLLTAKTNIQSKIEGLELGADAYIEKPFSVEYLLANISSLIH NREKLRQTFAKSPFVAANTMALTKADEEFIWKLNDIIQANLHNPEFSMEDMADALKMSRS SFYRKIKGVLDLSPNEYLRLERLKQAAQLLKEGKSRVNEICYTVGFNSPSYFSKCFLKQF GVLPKDFIG >gi|226332007|gb|ACIB01000049.1| GENE 30 36642 - 38675 1369 677 aa, chain + ## HITS:1 COG:no KEGG:BF4490 NR:ns ## KEGG: BF4490 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 677 1 677 677 1373 100.0 0 MKMILKTMVCLAVAFSGTAGAANYSPEKSQASLALKVPGNPAVEYPLTLSKLSDSYFDYE WKAKEKIPVTIFQQISTVDDKQQVTVVLTAMEDVYFNFEERIRTDFRHDDCQFYLPGFWY RRNLRSPKEAPSFHTSDSWVVREDRLSTPLTGIFDEKQKKYMTVVRRAEYIQDALSTHKE GEVILSGTTSLGFTGFENLDGTAALAFGFPYKEAPKTYIRKLTLAPSVTAFQLLKKGESI SLTWEIHEGKGEDFAEFVSHTWEYCYDTFQPKPVETDYTPDYTKEILSHFFIESFVGDRP LNYNSGVHMRTDDCQNTGSAEVGFVGRVLLNAFNAWEYGWKNNRADLKENAAKVFDTYLV NGFSPAGFFKEFVDYRTGYEETVFSIRRQSEGIYAIFHYLDFEKRNGRKHPEWEAKVRKM LDVFLQLQNPEGSFPRKFKDDLSIVDKSGGSTPSATLPLVMGYKYFKDKRYLESAKRTAE YLEKELISKSDYFSSTLDANCEDKEASLYAATATYYLALVSQGKEREHYTGLTKKAAYFA LSWYYLWDVPFAPGQMLGDIGLKTRGWGNVSVENNHIDVFIFEFASILNWLSKEYSEPRF SQFAEVISTSMRQLLPYEGHLCGVAKCGYYPEVVQHTNWDYGKNGKGYYNDIFAPGWTVA SLWELFSPGRAEQFFRK >gi|226332007|gb|ACIB01000049.1| GENE 31 38791 - 39990 1022 399 aa, chain - ## HITS:1 COG:no KEGG:BF4489 NR:ns ## KEGG: BF4489 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 399 1 399 399 790 99.0 0 MKTRMIYHSLAICCLLPAFAFTTGENDPFIKSPTVAKLTNTPNGPLISCDLKALKDTVNF PLSQLTEELQIVKLDNRDEALIGGWIRTTVGEKYILVSNNKQTPYKLFDRTGKFITNIGS YGQGPNEYLNTYAEQLDEANNRIYILPWQSSKILVFDLKGNALDPIPLCLRVPKGKFRVN TAKSEVTVTVLPFPKWPAVVWTQDLKGKRKNFVAPGSLAMPQDFSNEVSMGNNTAAYDVM LMKIMPQPSVDTLYHYNTASNKLEGRFTVKYPSNDKIPWHAYYEIPKYFIGDVSFPIQID ESTFSGSKPAYYMVDKKTLHGNYVRLYNDFISTPSQTIYPSFNNGYYVTNMEPMALKEIL EKEVNKKGLTADKKKKVQNLIKTLNDNDNNIVMFAKLKQ >gi|226332007|gb|ACIB01000049.1| GENE 32 40164 - 41846 851 560 aa, chain - ## HITS:1 COG:no KEGG:BF4488 NR:ns ## KEGG: BF4488 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 560 1 560 560 1001 99.0 0 MKKSFLIILCLALLSCVTGCKDSTQTLLKKSVEMEGISTDSMLFYLQQIQSPNHLSDKQR AEYCFQLYKATLWKTQKPKDSLLKVCIPLFLHVGDTAQWLQAQLEQANSFFYKDQPDSIL HSARELRDKTEYMTPTQQRYYYNIQKFTYFNQKKYPEALKLANKVLALNNPSNDTLSLFY DHRTQLEILRKMGKTDEVIEGYYKMLEWFAPSKEYHYLTYTIAEDIVNYYLGQQDFDKAL ESVQNLRLYRRNRYDIPYYQLIRGQIFQSLHQLDSAGYYYKQAATSTSPYIAIEATSRLY QLTNATQQPEQAYYLAKTEDILYKDLTSNLKAKETTRKYNEVKLQNELYQLRLTQQEKEL WMRGIAVILLLIALLILFFYHQEKKKRLVSERRLQAEQAGEEARRLQHENELLHKEAELS ALREKEIIMRNKESEMREALFRHISFLQKLPSLHIENSTDDNPNRKKITVSDAEWCEVKQ AVNDAFNNFVDRLQEDYPQLNEKDICFCCLVKINVNIQDLSDIYCVSKAAITKRKYRIKT EKMHISDETLSLDAILQNFG >gi|226332007|gb|ACIB01000049.1| GENE 33 41866 - 43404 831 512 aa, chain - ## HITS:1 COG:slr0904 KEGG:ns NR:ns ## COG: slr0904 COG0606 # Protein_GI_number: 16331658 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Predicted ATPase with chaperone activity # Organism: Synechocystis # 1 507 1 507 509 538 54.0 1e-153 MLIKVFGAAVQGIEATLITIEVNSSRGCMFYMVGLPDSAVKESHQRILSALQVTGYKMPT SNIVINMAPADIRKEGSSYDLPLAIGMLAAGETISCQKLSRYMMMGELSLDGTIQPIKGA LPIAIKAREEGFDGLIVPSQNAREAAVVNNLSVYGVNNIQEVIEFINGKRELTPTIVNTR EEFYACQSDFEYDFADVKGQENVKRALEVAAAGGHNLIMVGAPGSGKSMMAKRLPSILPP LSLGESLETTKIHSVAGKLGRNSSLISQRPFRDPHHTISQVAMVGGGSFPQPGEISLAHN GVLFLDELPEFNRSVLEVLRQPLEDRRITISRVKSTIDYPASFMLVASMNPCPCGYYNHP TKPCVCNPGQVQKYLNKISGPLLDRIDIQIEIVPVPFEKISDRQQGESSAAIRQRVIKAR QKQEERFSGYPGTYCNAQMTSKQLSSFAQPDTKGLLLLKNAMERLNLSARAYDRILKVSR TIADLEESEQIQPSHLAEAISYRNLDRENWAG >gi|226332007|gb|ACIB01000049.1| GENE 34 43438 - 44508 784 356 aa, chain - ## HITS:1 COG:no KEGG:BF4486 NR:ns ## KEGG: BF4486 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 356 1 356 356 696 100.0 0 MRKIGLLFITIFCLLACKNNSSRISLEGEIKGLTNDTLYLYGTDALYDRIDTIYAEKGKF SYNLNIDSTVIDTLTTAVLLINGHVEYPVFLDKGNQITIKGNIDDLSYLDINGNEPNTSL SLFMKEQRGLGSASDKMMQEKAETFIRQHNSSLASVYLLDKYFVQTPQPDYIKIKEITEA MTGALLDRPYIENISDYIDQLEKVTVGKSAPYFSLPNEKGEKLSRSAERFRNRYLLLNFW ASWCDPQPEANAELKRLNKEYKKNKNFAMLGISLDIDREAWETAIKKDTLSWDQVCDFTG LSSETAKQYAILTLPTNILLSPTGKILARDIQGEALTDKLKELLKTEEKKPGKSIR >gi|226332007|gb|ACIB01000049.1| GENE 35 44620 - 46311 1963 563 aa, chain - ## HITS:1 COG:no KEGG:BF4280 NR:ns ## KEGG: BF4280 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 5 563 1 559 559 987 99.0 0 MIKKLYLPLLMALVVALSSCSNKMGALSSDYFTTTPQVLEAVGGKVPVTINGKFPEKYFK KNAVVEVTPVLKWEGGQVKGQPAVFQGEKVEGNDQTISYKMGGNYTMKTTFDYVPEMAKS ELYLEFNAKVGKKTIAIPAVKIADGVISTSELINNTLSSANPALGDDAFQRIIKEAHNAN IMFLIQQANIRSSELKTAKEFNKEVANVNDAENKKISNIEISAYASPDGGVKLNTGLAEN REGNTTKLINKDLKKAKIEVPVDAKYTAQDWEGFQELVSKSNIQDKELILRVLSMYQDPE QRETEIKNISSVYKTLADEILPQLRRSRLTLNYEIIGKSDEEIANLAATDPKQLNIEEIL YAATLTNDPAKKADIYTKASQQFPNDYRAFNNLGKLAYQAGDLDKAQSYVKKAESIKSAP EVNMNLGLIALAKGDKAAAESYLGKAAGAKELNETLGNLYVAQGQYERAVNAFGDTKTNS AALAQILAKDYNKAKNTLAGIATPDAYTDYLMAVLGARTNNTSMLTSSLKSAVAKNPALA KKAATDLEFAKYYTNADFMSIVK >gi|226332007|gb|ACIB01000049.1| GENE 36 46522 - 47475 545 317 aa, chain - ## HITS:1 COG:SA1328 KEGG:ns NR:ns ## COG: SA1328 COG4974 # Protein_GI_number: 15927078 # Func_class: L Replication, recombination and repair # Function: Site-specific recombinase XerD # Organism: Staphylococcus aureus N315 # 16 305 4 294 295 212 41.0 7e-55 MKLEEKTKKKDEQGLIIKKYQQYLKLEKSLSKNTLEAYLTDLEKLLSFLSAEGVEILEVS LTDLQRFAAGLHDIGIHARSQARIISGIKSFFHFLIIADYIEADPSELLEGPKIGFKLPE VLTVEEIDRIISTIDLSKNEGQRNRAILETLYSCGLRVSELTGLKLSDLYFDEGFIKVEG KGSKQRLVPISPKAIQEIKLYFLDRNRINIKKDHEDYLFLSRRGTHLSRIMIFHLIKELA DMAGITQNISPHTFRHSFATHLLEGGANLRAIQCMLGHESISTTEIYTHIDRNMLRSEII EHHPRNIKYRQEKKPFR >gi|226332007|gb|ACIB01000049.1| GENE 37 47549 - 47968 451 139 aa, chain + ## HITS:1 COG:sll1112 KEGG:ns NR:ns ## COG: sll1112 COG0757 # Protein_GI_number: 16329990 # Func_class: E Amino acid transport and metabolism # Function: 3-dehydroquinate dehydratase II # Organism: Synechocystis # 2 138 6 144 152 151 53.0 4e-37 MRIQIINGPNINLLGKREPSIYGSVTFEEYLAELRKKYPDVELGYFQSNVEGEIIDIIQQ TGFDVDGIILNAGAYTHTSIALQDAIRSVTSPVIEVHISNVHAREQFRHVSMIACACKGV ICGFGLNSYRLALEALLDK >gi|226332007|gb|ACIB01000049.1| GENE 38 47996 - 49453 1448 485 aa, chain + ## HITS:1 COG:BB0348 KEGG:ns NR:ns ## COG: BB0348 COG0469 # Protein_GI_number: 15594693 # Func_class: G Carbohydrate transport and metabolism # Function: Pyruvate kinase # Organism: Borrelia burgdorferi # 1 471 1 473 477 393 46.0 1e-109 MLLKQTKIVASISDRRCDVDFIKELFDAGMNVVRMNTAHASREGFEALIANVRAVSNRIA ILMDTKGPEVRTTANADPILYQIGEKVKIVGDPDRETTRECIAVSYPNFVHDLNVGGTIL IDDGDLELRVIDKTTEYLLCEVQNEATLGSRKSVNVPGVRINLPSLTEKDRNNILYAIEK DIDFIAHSFVRNRQDVLDIRGILDAHNSDIRIIAKIENQEGVDNIDEILEVADGVMVARG DLGIEVPQERIPGIQRMLIRKCILAKKPVIVATQMLHTMINNPRPTRAEVTDIANAIYYR TDALMLSGETAYGKYPVEAVKTMTKIAAQAEKDKLGENDIRIPLDENSNDVTAFLAKQAV KATTKLKIRAIITDSYQGRTARNLAAFRGKYPVLAICYKEKTMRHLALSYGVEAIYMPEL ANGQEYYFAALRRLLKEGRLHPTDMVGYLSSGKAGTQTSFLEINVVEDALKHAGDSVLPN SNRYL >gi|226332007|gb|ACIB01000049.1| GENE 39 49462 - 50100 501 212 aa, chain + ## HITS:1 COG:aq_1507 KEGG:ns NR:ns ## COG: aq_1507 COG4122 # Protein_GI_number: 15606661 # Func_class: R General function prediction only # Function: Predicted O-methyltransferase # Organism: Aquifex aeolicus # 7 211 9 211 212 137 36.0 1e-32 MKETDLLDEYILQHIDEEGEYLKSLYRDTHVKLLRPRMASGHLQGRMLKMFVRMIRPRQI LEIGTYSGYSALCLAEGLEEGGMLHTFEINDEQEDFTRPWLENSAYADKIKFYIGDALRL IPALGITFDLAFVDGDKRKYIEYYEMTLAHLSVGGYIIADNTLWDGHVLEEPHSNDHQTI GIKAFNELVAHDERVEKVILPLRDGLTIIRKK >gi|226332007|gb|ACIB01000049.1| GENE 40 50421 - 50753 417 110 aa, chain + ## HITS:1 COG:BS_rbfA KEGG:ns NR:ns ## COG: BS_rbfA COG0858 # Protein_GI_number: 16078728 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Ribosome-binding factor A # Organism: Bacillus subtilis # 3 109 2 107 117 62 36.0 3e-10 METTRQNKISRLLQKELSEIFLLQTKAMPGVLVSVSAVRISPDMSIARVYLSIFPSEKSE EMVKNINNNMKSIRFELGTRVRHQLRIIPELKFFVDDSLDYIEKIDALLK >gi|226332007|gb|ACIB01000049.1| GENE 41 50750 - 51988 767 412 aa, chain + ## HITS:1 COG:CC1930 KEGG:ns NR:ns ## COG: CC1930 COG4591 # Protein_GI_number: 16126173 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: ABC-type transport system, involved in lipoprotein release, permease component # Organism: Caulobacter vibrioides # 7 407 17 416 426 129 26.0 1e-29 MNLPFYIARRYLFSKKKHNAINIISAISVCGVALATMAMVCTLSVFNGFQDTVADLFTVF DPELKVTIAEGKVFDAQDPRIQAIRQMPEVDVLTETLEENAMVRYKDRQTMAVIKGVQNN FEQLTAIDSILYGAGEFLLNDSIVDYGVMGIELVSTLGTGIRFVDPLQIYIPKRSGKVNM ANPAASFNMDYLYSPGVVFVVNQQKYDGQYILTSLDFARRLLDYTTEVSAIELKLKPGSD LSFVKAKIKKELGDDFVIRNRYEQQEDVFRIMEIEKLVSYLFLTFILVIACFNVIGSLSM LILDKKEDVDTLRKLGANDRLVSRIFLFEGCMISFYGAIIGIVLGLLLCWVQMTYGIISL GGGSAAGNFVVDAYPVSVHLWDVIVIFITVFAVGFLSVWYPVRYLSKRLLKM >gi|226332007|gb|ACIB01000049.1| GENE 42 52232 - 52783 362 183 aa, chain - ## HITS:1 COG:PA1300 KEGG:ns NR:ns ## COG: PA1300 COG1595 # Protein_GI_number: 15596497 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Pseudomonas aeruginosa # 16 174 6 169 175 65 25.0 4e-11 MNEKEAIIKLKTCNDKAAFAFLYRSYWAKVYNFTRLYITSAVDVEEIVQEVFIKIWENRE ALDEEQNFAGYLFITMRNLVFNRSRKNLNEPFYQLSVIEAVEESYDIEEELDAANLRTHI SALISMLPPRQQEVFRLSRDEELSYREIAERLQISERTVEHHISDALKFLRKNIKLYLLF LSL >gi|226332007|gb|ACIB01000049.1| GENE 43 52934 - 53893 600 319 aa, chain + ## HITS:1 COG:no KEGG:BF4475 NR:ns ## KEGG: BF4475 # Name: not_defined # Def: putative anti-sigma factor # Organism: B.fragilis # Pathway: not_defined # 1 319 1 319 319 605 98.0 1e-172 MKISNKYDRLVRKLIAGESSSEEMEELAHWNVVETKMKKQFDAAKNIVENGAIERRIWDK IDSRCQAPVERSQKLQLRYWRVALAACITALLIIGGGIFFFDKGHTASQRIIEYTEVVSS NSRLYVLPDSSKVWMQAGSRLRFSQDFMSNREVWLEGVSTFEVTKRKGHNFKVYIDQAFV EVKGTVFRVQSTCQDGTEVTLFSGKVDFNVKASQRKVEMKPLQQIVFHPEKDEVILKNIG NISWDEGRYKFVDMRMDDLIEAIHDIYHIPVELDRKVARNDLFTGYMRYDDPASKVIEKI CINMNLKFKKETQKIIIYK >gi|226332007|gb|ACIB01000049.1| GENE 44 53933 - 57454 3153 1173 aa, chain + ## HITS:1 COG:no KEGG:BF4474 NR:ns ## KEGG: BF4474 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1173 1 1173 1173 2296 99.0 0 MKDMRLCFSVCNRRFLTGVIMFMLLYPMSVFAVQGQITVQGKAMSIKQAIQVIEKNSKYT FFYKAADLSNAKIRDIHCEGSIEEVLNVLFKDSGISYVIKDNEVILKSTPVVVATPQQSN KIVVKGNIRDTLGESVIGATIMEKNNAQNGSISDINGDFSLSVSPGAVIVISYIGYVTQE LKAIAGAPLKVVLKDDSRTLDEVVVVGFGSQKKANLTGAVSSIKMDEIIGDRPIMTASDA LQGTVPGLLVSNSGNAPGSGKSFQLRGAYSVGIKNSDGSYGANVAPLILIDNVEGSLDML NPEDIETVTVLKDAASAAIYGARAAGGVVLVTTKRPKEATAFRLNYNNNFGFATATNLPK QASLMDYLQAYQDGGYSDAYWSYGSPSVSKWKEYLTQYRQDPSSIKTVGDGIFADTDGAL YYLNEHDPYKNFMETSFQMNHNLSVSGGTDKLRYRMSAGYVSTDGVLITDKDTYERLNIN SYISADITKWFTQELTMSYARTNQSQPNSGLGSMFGSNQVSYQPEGNMPSDVCSTISQDL PFNTPRNQVLLANKWKKSYDNPRVFVKSILKPFKGFEAVFEYTFDKNMYDYNFYTGKTQY TDIQGGNNIWNAAKDYLQKEKQFTDYNAFNIYGTYKFDLNKDHHFSVMAGFNQESKYTEG VNVLSYNQAVVEVPALGSGTGDLKATDSYNEYSVRGGFFRVNYNYMDKYLLEVNGRYDGS SKFPKDSRFGFFPSVSLGWNVAQEKFMEVTRNYIDGLKIRASYGVIGNQNVVNYAYFPTM SVSNKYNGWLSGGDYVTAINSLPNLVSTSFTWEKVATTDIGLDLNMFGNRMNVVFDWYQR DTKGMLAPGMQLPAVVGASSPFQNTADMRTRGWELAVNWRDRIGKFNYRVGFNLSDSYSE ITKYDDNAASKLLSNFYPGQRLGEIWGYEVDGFYTVDDFVDTNSWKLKDGVASIKGVSPR PGDLKFKNLRDDDKSTNQIDSGDGTLDNPGDQKVIGNSLPKYLYGITLGANYKGFDLNIM MQGTGQRDAWIANNLVFPMYIYSVNDIKYQPLFDGLTDYWRPVDAANGDYTAVNPNAKYP RMYGQNPTVGSNYGRKTDKYLSNAAYFRIKNVTLSYTVPKTWISRIGLNQLKGFVSVENL ATFSSLPSGIDPETLSWNYPAFRTVSFGINFTL >gi|226332007|gb|ACIB01000049.1| GENE 45 57476 - 57883 348 135 aa, chain + ## HITS:1 COG:no KEGG:BF4473 NR:ns ## KEGG: BF4473 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 135 1 135 649 279 100.0 2e-74 MKKIYLSLAILAGIGLAGCNDSFLEHAPVTSLTENNAFRSYDNFKSFAWPCYEIFKDNNI ANTINGTGQGSCYAGDMNAGYLESRANESGNDYAFGRVQSVASGNGWGFSGTFRRANILL ANIDKSEMTDAEKDH >gi|226332007|gb|ACIB01000049.1| GENE 46 57920 - 59425 1280 501 aa, chain + ## HITS:1 COG:no KEGG:BF4473 NR:ns ## KEGG: BF4473 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 501 149 649 649 1048 100.0 0 MELINRFGAVPWVNTALNENSPEAYGPRVDREIVADSVLNRLKWAEANIGDFEKQDGANT INRDCIRAAISRFALREATWRKYHGIDGAQKFFDECIRVSRLLMNDYPTLYYGTDGQPAA GYGEMWTTEDLGKVPGVILYMEFVQDIKMANFSALEHMDSHNVEMNQHTVDLYLCKDGKP IATSANYHGDKTPYATFRDRDPRLYHVVMPPYKVKAKVKTKEDPRTWDYTDDPADREYID IMGPNESCDNPGIGMKRLPGQNWSASLVPSSPNFMGGIGATGFVRSRSGYYFWKNWSNWE TNRNGGVTLNTSDKPIFKIEEVLLNYAEAMCETGQFTQAVADESINKLRRRAGVADMKVA DIDDSFDPNRGRYYPKGNEQGVLVDPVLWEVRRERIVELMGEGFGFYDIRRWRMAPWFLN RQFKGMWMTKDKFRHGAQFLLNETTGGPDPADGAMTEGYIYLQPDPIKAGEGWQERYYLY EVPTQEIILNPALAPNNPGWE >gi|226332007|gb|ACIB01000049.1| GENE 47 59531 - 61123 1115 530 aa, chain + ## HITS:1 COG:STM0035 KEGG:ns NR:ns ## COG: STM0035 COG3119 # Protein_GI_number: 16763425 # Func_class: P Inorganic ion transport and metabolism # Function: Arylsulfatase A and related enzymes # Organism: Salmonella typhimurium LT2 # 43 510 24 469 497 184 28.0 4e-46 MLSVVKLKPNGLETNPRMNQKIIYSSALLVGLGSLQAFAHKEKAHTPQKPNIIFIMCDDM GYGDLGCYGQPYISTPNIDNMAREGMRFTQAYAGSPVSAPSRASLMTGQHTGHCEVRGNK EYWTQASTVMYGDNKEFSVVGQHPYDPEHVILPEIMKDNGYTTGMFGKWAGGYEGSASTP DKRGIDEYYGYICQFQAHLYYPNFLNRYSPSLGDTGVVRVVMEENIKYPMYGPDYHKRTQ YSADLIHQKAMEWIEKQDGEQPFFGIFTYTLPHAELVQPEDSILNHYKTQFADDKAFGGQ KGSRYNAITHVHAQFAGMITRLDYYVGEVLKKLEEKGFDENTIVIFTSDNGPHEEGGADP TFFGRDGKLRGLKRQCYEGGIRVPFIVRWPGQVAAGTVNDHQCAFYDVMPTLCDLTGVKN FTKKYVNKKKEADYFDGISFAPTILGKKNQKKHDFLYWEFNETNQIGVRMGDWKMVVKKG TPFLYNLATDIHEDHNVAAENPDIVKKMVGIIHQQHTENPNFKVTLPPTM >gi|226332007|gb|ACIB01000049.1| GENE 48 62053 - 64551 1997 832 aa, chain - ## HITS:1 COG:no KEGG:BF4469 NR:ns ## KEGG: BF4469 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 832 1 832 832 1647 100.0 0 MKKFLPDLIAILAFIVISFIYFFPAITEDRILFQHDTVAGAGAGQEAKEYYERTGERTRW TNALFGGMPTYQMSPSYDSTEPLTFVQKVYHLFLPNYVWLTFIMMLGFYILLRAFGIPAW LAGLGGIIWGFSSYFFILIAAGHIWKFITLAYIPPTIAGIVLAYRKKYLLGGIITALFMA MQILSNHVQMTYYFLFVILFMVGAFFEDAWRKKELPQFFKATGVLIVAGLIGVSINLSNL YHTYEYSKETMRGKSELKYEGAAAKQTSSGLNRDYITQWSYGIGETFSLLVPNVKGGASV PLSRSEKAMEKANPMYSSLYSQLTQYFGDQPMTSGPVYVGAFVLMLFILGCFIVKGPMKW ALLGATIFSILLSWGKNFMGLTDFFIDYIPMYNKFRAVSSILVIAEFTIPLLAILTLKEI LTKPELLKEKLKYIYISFGLTGGLALLFAIAPRLFFPTYIPGNEMAALQNALPADQLSPI IANLEEMRVHLFTSDAWRSFFIVTIGTLLLLAYNAKKLKATWTVAAIALLCLGDMWSVNK RYLYDEQFIPKSEQTATFRKTQTDELILQDPSLDYRVLNFAGNTFEENNTSYWHKSVGGY HAAKLRRYQEMIDHHIAKEMQAAYQEVATAGGQMDSVNAAKFPVLNMLNTKYFIFPAGQQ GQTVPIENPYTFGNAWFIDKIQYVNNANEEIDAIGQVDLQQTAIVDSKFKEALKGVNEGY KDSLSTIRLTSYEPNQLVYETSSPQDGIVVFSEIYYPGWTATIDGKPADIARADYILRAM NVPAGKHTIEMRFDPQSLHITEGIAYGAMALLLVGVIILIWIYRKKYSENSK >gi|226332007|gb|ACIB01000049.1| GENE 49 64673 - 66091 1468 472 aa, chain - ## HITS:1 COG:XF1037 KEGG:ns NR:ns ## COG: XF1037 COG0499 # Protein_GI_number: 15837639 # Func_class: H Coenzyme transport and metabolism # Function: S-adenosylhomocysteine hydrolase # Organism: Xylella fastidiosa 9a5c # 33 472 1 446 446 649 69.0 0 MSTELFSTLPYKVADITLADFGRKEIDLAEKEMPGLMALREKYGESKPLKGARIMGSLHM TIQTAVLIETLVALGAEVRWCSCNIYSTQDHAAAAIAASGVAVFAWKGETLADYWWCTLQ ALNFEGGKGPTVIVDDGGDATMMIHVGYEAENNAAVLDKEVHAEDEIELNAILKKVLAED KERWHRVAAEVRGVSEETTTGVHRLYQMQEEGKLLFPAFNVNDSVTKSKFDNLYGCRESL ADGIKRATDVMIAGKVVVVCGYGDVGKGCSHSMRSYGARVLVTEVDPICALQAAMEGFEV VTMEDACKEGNIFVTTTGNIDIIRIDHMEQMKDQAIVCNIGHFDNEIQVDALKHYPGIKR VNIKPQVDRYYFPDGHSIILLADGRLVNLGCATGHPSFVMSNSFTNQTLAQIELFNKKYD INVYRLPKHLDEEVARLHLEKIGVKLTKLTPEQAAYIGVSVDGPYKADHYRY >gi|226332007|gb|ACIB01000049.1| GENE 50 66278 - 67846 1133 522 aa, chain + ## HITS:1 COG:mll3725_2 KEGG:ns NR:ns ## COG: mll3725_2 COG0642 # Protein_GI_number: 13473203 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Mesorhizobium loti # 225 522 4 312 328 179 34.0 2e-44 MGHYEHKTKEELLEIVCKLEKKVEGLSSELDSLKKGRFRERYSTRILDALPDMLTVFDHN ANIVELASSPTTNHVEGTTSDSIINSNVKDIVPEEAYESVRHNMDKVIHTGKSSTAEHSL MLDGVLHHYENRIFPLDDQYLLCMCRDVSQETEMAKINETQRSEIVRLNSLMNDILNNIP VYLFVKDTGNDFRYLYWNKAFAEHSGISVQRAVGSTDADIFPDKKDAEHFREDDLKVMKL GRIEYLENYTTMSGEVRTVTTMKTVVPSGGQHPYIIGVSWDVTEIKKTEKELIAARIKAE EADRMKSSFLANMSHEIRTPLNAIVGFSKLIIEADNECEKRQYADIVEHNSTVLLNLFND ILDLSALEAGSLELSYHPVRLKDICLQLYELHVKSVKPEVRLVLDEVDSQLCIQGDWERI SQVLVNLITNAIKFTSVGEIHFGFQKKADMVQFYMSDTGKGIPAERVATIFQRFGKIDDF VQGTGLGLTICRMLVEKMGGRIWVRSKQGEGTVFYFTLPMAK >gi|226332007|gb|ACIB01000049.1| GENE 51 67902 - 69578 1334 558 aa, chain - ## HITS:1 COG:MTH657 KEGG:ns NR:ns ## COG: MTH657 COG0318 # Protein_GI_number: 15678684 # Func_class: I Lipid transport and metabolism; Q Secondary metabolites biosynthesis, transport and catabolism # Function: Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II # Organism: Methanothermobacter thermautotrophicus # 10 553 1 545 548 729 62.0 0 MIALHITKNMQLYERTLGQWLEHWAETTPDKEYIVYSDRNLRFTWKQFNKRVDDMAKGLI AIGVERGTHVGIWAANVPDWLTLLYACAKIGAVYVTVNTNYKQAELEYLCENSDMHTLCI VNGEKDSDFVQMTYTMLPELKTCERGHLKSERFPYMKNVIYVGQEKHRGMYNTQEILLLG DNIEDTELNELKSQVDCHDVVNMQYTSGTTGFPKGVMLTHYNISNNGFLTGEHMKFTGND KLCCCVPLFHCFGVVLATMNCLTHGCTQVMVERFDPLIVLASIHKEKCTALYGVPTMFIA ELNHPMFDMFDMSSLRTGIMAGSLCPVELMKQVEEKMYMKVTSVYGLTEAAPGMTATRID DPFDVRCNTVGRDFEFTEVKVLDPETGEECPVGVQGEMCNRGYNTMKGYYKNPQATAEVI DKNNFLHSGDLGIKDEDGNYRITGRIKDMIIRGGENIYPREIEEFLYKLDGVKDVQVSGI PSKKYGEAVGAFIILHEGVTMQASDVQDFCRNKISRYKIPKYIFFIDEFPMTGSGKIQKF KLKDLGLKLCEEQGIQII >gi|226332007|gb|ACIB01000049.1| GENE 52 69588 - 70163 567 191 aa, chain - ## HITS:1 COG:MTH659 KEGG:ns NR:ns ## COG: MTH659 COG1396 # Protein_GI_number: 15678686 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Methanothermobacter thermautotrophicus # 7 190 6 189 190 194 57.0 7e-50 MDTSKIVGEKIKSLRENKGISIEELAERSGLAIEQIERIENNIDLPSLAPLIKIARVLGV RLGTFLDDQDETGPVVSRKMEATDTISFSNNAIHSRKHMQYHSLSKSKADRHMEPFIIDV APTQDSDFVLSSHEGEEFIMVMEGVMEISYGKSTYLLEEGDSIYYDSIVPHHVHAYEGQA AKILAVIYTPI >gi|226332007|gb|ACIB01000049.1| GENE 53 70322 - 70591 446 89 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|53715743|ref|YP_101735.1| 30S ribosomal protein S15 [Bacteroides fragilis YCH46] # 1 89 1 89 89 176 100 3e-43 MYLDAAKKQEIFSKYGKSNTDTGSAEAQIALFSYRITHLTEHMKLNRKDYSTERALTMLV GKRRALLDYLKAKDITRYRDIIKELGLRK >gi|226332007|gb|ACIB01000049.1| GENE 54 70747 - 72546 2144 599 aa, chain + ## HITS:1 COG:DR1198 KEGG:ns NR:ns ## COG: DR1198 COG1217 # Protein_GI_number: 15806217 # Func_class: T Signal transduction mechanisms # Function: Predicted membrane GTPase involved in stress response # Organism: Deinococcus radiodurans # 5 594 4 593 593 654 55.0 0 MQNIRNIAIIAHVDHGKTTLVDKMLLAGNLFRGNQTSGELILDNNDLERERGITILSKNV SINYNGTKINIIDTPGHSDFGGEVERVLNMADGCILLVDAFEGPMPQTRFVLQKALEIGL KPIVVINKVDKPNCRPDEVHEMVFDLMFSLDATEEQLDFPTIYGSAKNNWMSTDWKEQTD SIVPLLDCIVENIPAPEQLEGTPQMLITSLDYSSYTGRIAVGRVHRGTLKEGMNVSLAKR DGSIVKSKIKEVHVFEGLGRVKTTEVSSGDICALVGIDGFEIGDTICDYENPEALPPIAI DEPTMSMLFAINDSPFYGKDGKFVTSRHIHDRLTKELDKNLALRVRKSEEDGKWVVSGRG VLHLSVLIETMRREGYELQVGQPQVIYKEIDGVKCEPIEELTINVPEEYSSKIIDMVTRR KGEMTMMENTGERINLEFDMPSRGIIGLRTNVLTASAGEAIMAHRFKEYQPFKGDIERRT NGSIIAMESGTAFAYAIDKLQDRGKFFIFPQEEVYAGQVVGEHAHEKDLVVNVTKSKKLT NMRASGSDEKARLIPPVQFSLEEALEYIKEDEYVEVTPKAMRMRKVILDETERKRANKS >gi|226332007|gb|ACIB01000049.1| GENE 55 72780 - 74807 1898 675 aa, chain - ## HITS:1 COG:no KEGG:BF4257 NR:ns ## KEGG: BF4257 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 675 1 675 675 1378 99.0 0 MKIKMKYIVAQACLVAGASIALTSCNDFLDREPLDQVTPQEYFQTADHLAAYTISKYNSM FSTHSGWNAGTVNNDGGTDNMVVGGYNSTYYEKGIWKVPSNNGNWSNGLSLARYCNYFFE QVLPKYEAGKISGAEDDLKHYIGEMYFMRALIYYNQLRKYGDYPIITEVLPDDEAILIEK SVRQPRNKVARFILEDLDKAIGLMKNQGFMNNNRLNKQCALLIKSRVALYEATFEKYHQG TGRVPGDETWPGKSVHPNFSLDVTAEINFFLDQCMDAASQVADAITLTENTGVFNPLSDS EYSGWNPYFEMFSAEDMSGYSEVLFWKDYLRSGSINISHGAPNYIYSGGNNGMLRSYVQT FLMENGLPWYASNSGFKGDSRIQEEKEGRDQRLQLFLFGEKDKQAPRADGDGTMPEFGEP ALFELQEVRDLTGYRIRKCLTYKKDQIISGSDQSTTGCIIYRGVEAYLNYLEAYYLRNGK VDGKAAQYWTAIRKRGGVDTDYEKTIRNTDMSKEVDWAKYSGSTLVDATLFNIRRERRCE FIGEGMRWDDLMRWRAMDQLLTTKYIPEGFNFWDEAYDDYVAMKNDKGEPKYKIVSDGSS TANMSAKELGKYVRPYSIVKANNAVFDGYTFAKAYYLYPVPIRQMELLSPDRSVNNSVLY QNPYWPSKTSEPAIE >gi|226332007|gb|ACIB01000049.1| GENE 56 74822 - 78082 3011 1086 aa, chain - ## HITS:1 COG:no KEGG:BF4460 NR:ns ## KEGG: BF4460 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1086 1 1086 1086 2159 100.0 0 MTEKTNLFPNLIRFRETNRLKMAIAASIMLWCATPQQAAADTYEKHEIASVQQQKVKTTG TVLDQNGEAMIGVSVKVKDNATMGTITDLEGKFSIDAPKGATLEISYIGYKTVTVKAEGT ALHITMKEDAEVLDEVVIVGYGSQKKVNVTGAVGMVNSEVLEARPVQNVSQALQGVVPGL NLSVGNSGGALDSSMSINIRGAGTIGDGSGSSPLILIDGIEGDLNSVNPNDIENVSVLKD AASASIYGARAAFGVILVTTKSGKSGKAKVSYNGNVRFSDALCVPEMMDSYQFALYFNRA AENAGDSGPFSQEALDRILAYQAGTLKETMTMNEQTRKWQAYGGANANTDWFKEFYNDWV PSQEHSLSISGGSEKTQYTISGSFLDQNGLLRHGSDNFQRYTMNGKITSQIADWFTVTYS TKWTREDFDRPSYLTGLFFHNIARRWPTNPAYDPNGHPVDGMEIEQLENGGKQINQKDLN TQQLQFIFEPIKNWRINVEGSLRTTNTNEHWDVLPVYAYNADNEPYLISWNGGALGLSQV NEYSYKENYYTTNIYSDYFKQFDSGHYFKVMAGFNSELYKTRYVQAQKSTLISSSVPTIN TATEDPKAWGGYAHNAVAGFFGRINYNYKDRYMVEANGRYDGSSRFIGDKRWGFFPSFSA GWNVAQEPFFERIAEKCSIGTLKLRASWGQLGNTDTKDAWYPFYQTMPTGSNYGWLLNGA LPNYANNPGIVSMKKTWETIETWDVGLDWGLFNNRLTGSFDYFVRYTYDMIGPAPELAAS LGTGVPKINNADMKSYGFELELGWRDRIRDFSYGVKFVLSDSQQKILKYPNEDYNIGTYY KGQKLNNIWGYKTIGIAQSQEEMDAHLAKVDQSALGSKWGAGDIMYADLDGDGKISTGSN KLGDTGDRVILGNSTPRFNYGLTIDASWKGIDFRAFFQGIGKRDYWLQGPYFWGSTGLGQ WQAAGFKEHWDFWRPEGDPLGANTNAYYPRVARNGGKNTNVQSRYLQNAAYCRLKNIQIG YTLPKTWTEKAGMSSVRVYVSGDNLLTFSDITGIFDPEAIGSTYDANNGKLYPLQRVISV GLNVNF >gi|226332007|gb|ACIB01000049.1| GENE 57 78099 - 78290 68 63 aa, chain - ## HITS:0 COG:no KEGG:no NR:no ITFGQTNLINKSAWFRKEQEKNDIHTVTSNNYAYEVEKTNPLQTLFRKQNKIRNPKTNYQ INV Prediction of potential genes in microbial genomes Time: Tue May 17 23:56:12 2011 Seq name: gi|226332006|gb|ACIB01000050.1| Bacteroides sp. 3_2_5 cont1.50, whole genome shotgun sequence Length of sequence - 16806 bp Number of predicted genes - 9, with homology - 9 Number of transcription units - 5, operones - 3 average op.length - 2.3 N Tu/Op Conserved S Start End Score pairs(N/Pv) - Term 331 - 389 14.0 1 1 Op 1 . - CDS 413 - 2902 2174 ## BF4459 hypothetical protein 2 1 Op 2 . - CDS 2992 - 6039 2088 ## BF4458 hypothetical protein - Prom 6061 - 6120 5.9 + Prom 5990 - 6049 2.2 3 2 Tu 1 . + CDS 6077 - 6382 129 ## BF4253 hypothetical protein + Term 6462 - 6511 1.4 - Term 6480 - 6517 2.0 4 3 Op 1 . - CDS 6685 - 7812 1070 ## COG4299 Uncharacterized conserved protein 5 3 Op 2 . - CDS 7825 - 9642 1141 ## BF4251 hypothetical protein - Prom 9662 - 9721 4.9 - Term 9732 - 9795 2.2 6 4 Op 1 . - CDS 9870 - 10349 590 ## BF4453 hypothetical protein 7 4 Op 2 . - CDS 10392 - 12245 1753 ## BF4452 hypothetical protein 8 4 Op 3 . - CDS 12275 - 15430 3183 ## BF4451 hypothetical protein - Prom 15462 - 15521 3.4 - Term 16557 - 16598 10.4 9 5 Tu 1 . - CDS 16628 - 16804 75 ## COG1866 Phosphoenolpyruvate carboxykinase (ATP) Predicted protein(s) >gi|226332006|gb|ACIB01000050.1| GENE 1 413 - 2902 2174 829 aa, chain - ## HITS:1 COG:no KEGG:BF4459 NR:ns ## KEGG: BF4459 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 829 1 829 829 1710 100.0 0 MKKLYSLLLTALFALPAAHVHATDYTQGLSIWFDTPNTLQGRAIWYGSRPDLWKGESKPE SAGDTARNPDANWESQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSRGADAYW NVNKQSAHVLKEIRQAFTDGDQKKAEMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYI ETGLSTVNMSDYKRILSLDSALAVVQFKKDDVAYERDYFISYPANVMAIRFKADRPGKQN LTFSYSPNPVSTGSMSADGANGLAYTAHLDNNGMQYVVRIHAIAKGGTLSNANGKITVKD ADEVVFLVTADTDYKINFDPDFKDPKAYVGVNPAETTRQWMDNAVAMGYDVLFKQHYDDY AALFNRVKLQLNPDAQSANLPTGKRLQNYRKGQPDFYLEELYYQFGRYLLIASSRPGNMP ANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLYECTLPLIDFIRTLVKPGQKTAQ AYFGTRGWTASISANIFGFTTPLESEEMAWNFNPMAGPWLATHVWEYYDYTRDKKFLKET GYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEHGPIDEGTTFVHAVIREILQDAIEASK VLGVDSKERKQWQEVLTHLAPYKVGRYGQLMEWSKDIDDPKDEHRHVNHLFGLHPGHTLS PITTPDLAKAARVVLEHRGDGATGWSMGWKLNQWARLQDGNHAYKLFGNLLKNGTLDNLW DTHPPFQIDGNFGGTAGITEMLLQSHMGFIQLLPALPDAWKDGSISGICAKGNFEVDLSW KNGQLAEATIFSKAGEPCTVRYGDKTLSFKTSKGKVYKLALDADRLVIK >gi|226332006|gb|ACIB01000050.1| GENE 2 2992 - 6039 2088 1015 aa, chain - ## HITS:1 COG:no KEGG:BF4458 NR:ns ## KEGG: BF4458 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1015 1 1015 1015 2118 99.0 0 MRLHVLSLTCLLLTVGGTGPVAAQKPQNYPYGVPSEPWEESFGNHRAVLQIEKPAQIANL DFQWRRPDKDAGHRRFLIIHAATGDTIRNIRRIEVNDEHCRLQFGPVEQKGTYYFYYLPY QVQKGYGFYSGGYLPKENEPDAAWQAQGGSTLKSTRAKVVRVESRQAFDSFYPMEVAATA REKENYINRHKASLYLFAEDRRFPIRMRSNLPTKWLADKQGKLFRGEAAPNEYYTFQIGL WAAVNQADKIAYRASSLKCGREIIPATAITCFNVEGTDPYGKAFKKEVNVPKGEVQALWF GIDIPDGQKEGIYTGTITLSDASGAQSSIPLSIRIAGKALPDRGDSELWRCSRLRWLNST LGIADTPTAPYTAMTVNENRIGCLGRSITIDEGTGLPAQIRSWNNDVLSSPIEFVIQTAG GVKSLKAVPELTERTEGHVAGNWKAEDEDMTVSCKAIMEFDGWINYIYTITPKKQIQVKD VRLVLPVRNEIGTYFLGMGLPGQPTPQQYDGKWDAPEKTVNNFGVSIPTSKEQQWLWPFD SFWIGNEHAGIHCEFRGSTYSGPLLNLYRPAYPESWFNGGKGGFAIRKEADGVKAVAYSG ERMLEAGSSITFDFAMIITPVKMLDMKSQFTDRYYHNGPKPTPSQADIEAGVRIINVHQG NDYNPFINYPFLTVDKMKEFTKEWHARGCKIKIYYTLRELSNATAEIWAIRSLGNEILRG GNGGGFPWCREHFVTNYTPQWYEHFDNADKQGTTADASILTAEGDSRWYNYYIEGLRWMV QNMDIDGIYLDDVSFDRRILKRMRRAMESVKPGCLIDLHSNTGFSRGPANQYTEFFPYVD KLWFGESFLYDKMTPANWLVESSGIPFGLSGDMLYRGGNAWLGMQYGMTVRYPWYTEGVN CDPRQVWKIWDSFGIAEATMLGFWEEHPAVSTSDEAVKVTVYRKPEKVLLSLGNYSDEVK TVKLNIDWKQIGLNPQKAKLTTPEIPDMQQAHEWNVDTPIVTAPRKGWIIYLTSE >gi|226332006|gb|ACIB01000050.1| GENE 3 6077 - 6382 129 101 aa, chain + ## HITS:1 COG:no KEGG:BF4253 NR:ns ## KEGG: BF4253 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 101 1 101 101 192 99.0 3e-48 MRVPLPGISREQSVTLIIGPPQNRFFRTDWQSGRKKRALRVKSDITFPIFELVCFYFEGG MTVLPVNGSFSGLLPAFLSVYNALSAMVNGRKNSSAKTRFG >gi|226332006|gb|ACIB01000050.1| GENE 4 6685 - 7812 1070 375 aa, chain - ## HITS:1 COG:all1887 KEGG:ns NR:ns ## COG: all1887 COG4299 # Protein_GI_number: 17229379 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Nostoc sp. PCC 7120 # 11 375 2 375 375 105 27.0 1e-22 MKKPSSTPSPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGF RFWDLVMPLFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNLLGLD PKHLYLYSNTLQAIATGYLIAAIIQLHCNFRWQLIVTALLLLIYWIPMTFLGDFTPEGNF AEKVDRLVLGHFRDGVFWNEDGSWSFSAHYNYTWIWSSLTFGATVMLGAFAGKIMKAGKD NRRKVVQTLLIIGISLIAFSLIWSLQMPIIKRLWTSSMTLFSGGLCFLLMGAFYYRIDYK GHSRGLNWLKIYGMNSITAYILGEVINFRCIAASVSYGLEQYLGGYYQVWLSFANYLIVF LILRIMYRQKIFLKI >gi|226332006|gb|ACIB01000050.1| GENE 5 7825 - 9642 1141 605 aa, chain - ## HITS:1 COG:no KEGG:BF4251 NR:ns ## KEGG: BF4251 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 605 1 605 605 1260 99.0 0 MKKYLHILPACFLFYAAAHAQQKDTVYVTDFGAVPYSYENCVTQIQAAIDECKRTGAKVL SLPEGRYDIWPEGATRKEYYISNTSTEQECPSKVKTVGLMLHEIDDLTIEGNGATLMYHG KMTTIALEHCNGVRINNLHIDFERPAGSEIQYRKVTGGETEVTLHRDTRYEIVNGKIRLY GEGWRSNRNHCIEYDPDTESFTYSQGWNTLSASDAREIAPGIVRFNTPAEFMPKAGNTLT VRDIIRDQVGLFILESKNITLSRLQMHYMHGLGIVSQYTENITMDRVKCAPRPDSGRLLA ASADMMHFSGCKGKVIIDSCYFAGAQDDPVNVHGTNLRALEKIDAQTLKLRFMHGQSYGF NAYFKGDTVAFVRAATMERFASATVRDVRRISDRIVEVRFDRDIPTSLELNHDCVENMTC TPEVEIRNCYFTRTSTRGTLVTTPRKVVIENNTYYKTGMSAILIEADAEGWYESGPVKDV LIKGNTFIDCAYNGGPGHAVIAIHPSNKIIDAERPVHQNIRIEDNTFRTFDYPVLYAKST AGLLFRNNTIGRTETFPAASGNPYVFYLNGCKKAVIEGTVFEGETPRQSIKTENMKRKDL KTTIK >gi|226332006|gb|ACIB01000050.1| GENE 6 9870 - 10349 590 159 aa, chain - ## HITS:1 COG:no KEGG:BF4453 NR:ns ## KEGG: BF4453 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 159 1 159 159 312 100.0 2e-84 MKRIKLSDTKWFMLLLLMCVPIISSCSKEDLPAYEEAEITKVGAYHRFYSGDKDAITGEN IVAEKELDRTNNIDSEHGVATAVFTIPAAGGKFTEAERAKVSLSNLVVYVNVSTAARVTP LDGSPKFGVPADWTREHKYSVMAADGTKKIWTVKVTLNK >gi|226332006|gb|ACIB01000050.1| GENE 7 10392 - 12245 1753 617 aa, chain - ## HITS:1 COG:no KEGG:BF4452 NR:ns ## KEGG: BF4452 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 617 1 617 617 1262 99.0 0 MKFIKYITPALFCAALTLQSCEHFLDTKPMESYSEDLVWSLKSTADAFVLQTYNNVLPFY HDIRTEEQWTLNSVMRQNCPNEARDLMNRDWSWGFNQFGTIRRCNLIIEKSQASTGLSDA DKKALVAEGKMLRAMTYYYIAKHCGRVIWVDHVLAETDEFNLPLTESIDKTYDLILKDID DAIAGLPETSLQGRINANAARALKSEVCLTAAAYSTDDTRKKSLFEQAVAAVDGIQGYTL DSNYGSMFNQDGARTSPEIILAQYYSKDNTNGAGTLMQEMLPNQSNKRLEDYGGRPFFNQ DLVFECWLEHSPSQNLVDDYLVIDQITNQGVKWNESTQFVNNTTPLSNADVADMAVDAKE LDDNSLAYQTNSNAPDVQINDFMYKNRDQRFYHSIQYDSCMFYNELITLHKGGNLHRTAV DGKTPGNGYIPITNYIWRKYIYVNAERVFWNNPTDYHYVIFRYGRALLNKAEALLGLAQY DASKVSNAVATFNRTRTTHGNLPPSIATSLADAWHDYKIERHVDLALEGDYYWSLLRWGM YGGEANHGKPAGDIIPELSSPATFVEISQDRHRMFVGTVGYTNADRTFSKKRYLFPITQS IINANSAINDSDQNPGW >gi|226332006|gb|ACIB01000050.1| GENE 8 12275 - 15430 3183 1051 aa, chain - ## HITS:1 COG:no KEGG:BF4451 NR:ns ## KEGG: BF4451 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1051 1 1051 1051 2076 100.0 0 MTKRTCQFPKHTCLREINFLKIAGISLLLFCTTSQIAVADSYEKNAVTATQQSKTEKITG KIVDESGEAIIGASVKVQGSTIGTITNMEGEFMIPNVPNKAVLEISYIGYKPLEVAVGKS KDLRITMEEDTKTLDEVIVVGYGTTSKRKTTAAIASVNTEDIIKAPTANITQSLAGRAPG LLVTTSGGGLNNFSSVSIRGGGTPLYVIDDVISEERDFRNLNAEDIDQITILKDAASTAV YGARAANGIVMIVTKQGKAGKMSVNYNFNYNWSQPANMPNKLDAHDAAFYKNMSMTNDGL APAYTDDELELFRNGSDPRRYPNTDWQKLCLKNFAPEMQHTLTVTGGSEKIKAYTSLGFY DQKSLYKFDVNSFKRYNFRTNIVADFKEIGLKVTSSIEAYKTDLRSPNAKSGDSYYHTWG HIQNKAPWEIAYNPNGQIFNTPDNPLMEISPDAGYTKNENLSAIANLALEWSVPYVPGLR LKALGNYRINNDKSKSWKKSPLAYDWDGNPNDPGKPSLSKSYSNWSSYTVQGFANYDRTF NQVHTISATAGIEAYKLFKDDASLSREEYLLDVDQIGAGPVSTAKNSSSEGEEARAGVVA RLKYDYASKYVAEASLRYDGSDNFPRGKRWGTFYAGSLAWVISEESFWQTLKDRHIFDQF KVRASYGEIGSDAIGRYAYLQSYGLNDRGYLLNGSWYPGFSEGALVSKDITWYTTRDFNI GFDFGSLNNRLSGSVDYFRKSTKGYLTSPSAVAYTDPLGIALPQVKSNGEFIRQGAEFIL QWKEKRGDFEYTLSGNFTYFDQYWNINPNEAETDTKNPYKRTTQAKGYWGIGYDCLGYYQ NQEDIMNSPKRQSSVNLGAGDLKYNDFNGDGIIDGSDQHRIGKNSMPRGQYGFSADLNYK GWFMNMLWQGATPADLYMGGMIQGSQSGSGYPPVIYDFQTDVWTPNNTGARYPRLRSTAS YNGSNNYGSSDFWLINTGYLRLKTLSIGYDFKHKLLKRVAWMNKCNVSLNGYNLLTFSKA NKFDIDPEIGDGNLYTYPVSRVYSISVNVGF >gi|226332006|gb|ACIB01000050.1| GENE 9 16628 - 16804 75 58 aa, chain - ## HITS:1 COG:PM1542 KEGG:ns NR:ns ## COG: PM1542 COG1866 # Protein_GI_number: 15603407 # Func_class: C Energy production and conversion # Function: Phosphoenolpyruvate carboxykinase (ATP) # Organism: Pasteurella multocida # 1 57 481 537 538 93 77.0 8e-20 PTELPGVDPKILDPRDTYADPAQWNEKAKDLAGRFIKNFAKFTGNEAGKKLVAAGPKL Prediction of potential genes in microbial genomes Time: Tue May 17 23:57:39 2011 Seq name: gi|226332005|gb|ACIB01000051.1| Bacteroides sp. 3_2_5 cont1.51, whole genome shotgun sequence Length of sequence - 153270 bp Number of predicted genes - 106, with homology - 105 Number of transcription units - 55, operones - 30 average op.length - 2.7 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 1 - 1429 1172 ## COG1866 Phosphoenolpyruvate carboxykinase (ATP) - Prom 1665 - 1724 4.2 + Prom 1399 - 1458 7.5 2 2 Tu 1 . + CDS 1694 - 2347 512 ## COG0035 Uracil phosphoribosyltransferase + Term 2376 - 2434 9.3 + Prom 2427 - 2486 5.7 3 3 Tu 1 . + CDS 2581 - 3342 642 ## COG0584 Glycerophosphoryl diester phosphodiesterase + Term 3389 - 3450 -0.4 + Prom 3606 - 3665 7.8 4 4 Tu 1 . + CDS 3808 - 5388 780 ## BF4442 hypothetical protein + Term 5567 - 5607 2.0 - Term 5552 - 5598 11.4 5 5 Op 1 23/0.000 - CDS 5627 - 6637 908 ## COG1013 Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, beta subunit 6 5 Op 2 . - CDS 6641 - 8491 1747 ## COG0674 Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, alpha subunit - Prom 8657 - 8716 2.9 + TRNA 8998 - 9072 51.3 # Arg CCT 0 0 + Prom 9305 - 9364 10.2 7 6 Tu 1 . + CDS 9457 - 10698 700 ## BF4235 putative bacteriophage integrase 8 7 Tu 1 . - CDS 10945 - 11832 666 ## BF4437 transcription regulator - Prom 11880 - 11939 6.6 9 8 Op 1 . + CDS 12215 - 12775 496 ## BF4435 hypothetical protein 10 8 Op 2 . + CDS 12809 - 14524 1433 ## BF4231 hypothetical protein 11 8 Op 3 . + CDS 14604 - 15668 944 ## BF4433 hypothetical protein + Term 15684 - 15727 8.4 + Prom 15670 - 15729 2.0 12 9 Tu 1 . + CDS 15751 - 16734 780 ## BF4229 hypothetical protein + Term 16787 - 16829 0.4 + Prom 16896 - 16955 3.3 13 10 Op 1 . + CDS 16982 - 20167 1232 ## BF4431 hypothetical protein 14 10 Op 2 . + CDS 20204 - 24727 2069 ## BF4430 hypothetical protein 15 10 Op 3 . + CDS 24768 - 25058 333 ## COG0776 Bacterial nucleoid DNA-binding protein + Term 25134 - 25176 8.5 - Term 25288 - 25340 15.5 16 11 Op 1 . - CDS 25359 - 28382 3319 ## COG0342 Preprotein translocase subunit SecD - Prom 28402 - 28461 4.1 17 11 Op 2 . - CDS 28562 - 29029 110 ## BF4426 hypothetical protein - Term 29091 - 29124 -0.5 18 12 Op 1 . - CDS 29194 - 30291 882 ## BF4425 hypothetical protein 19 12 Op 2 . - CDS 30295 - 31194 661 ## COG0705 Uncharacterized membrane protein (homolog of Drosophila rhomboid) 20 12 Op 3 . - CDS 31175 - 31849 619 ## COG0705 Uncharacterized membrane protein (homolog of Drosophila rhomboid) - Prom 32039 - 32098 9.7 + Prom 31986 - 32045 10.4 21 13 Op 1 . + CDS 32097 - 32369 290 ## COG0776 Bacterial nucleoid DNA-binding protein + Term 32392 - 32440 7.1 22 13 Op 2 . + CDS 32457 - 34250 1918 ## COG0018 Arginyl-tRNA synthetase + Term 34277 - 34336 14.1 - Term 34356 - 34413 17.5 23 14 Op 1 . - CDS 34436 - 35419 754 ## BF4420 hypothetical protein - Prom 35454 - 35513 6.0 - Term 35496 - 35557 5.1 24 14 Op 2 . - CDS 35591 - 37936 2339 ## COG0550 Topoisomerase IA - Prom 37994 - 38053 7.7 25 15 Op 1 . - CDS 38215 - 39918 1539 ## BF4418 putative TonB-dependent receptor 26 15 Op 2 . - CDS 39925 - 42933 3116 ## COG0457 FOG: TPR repeat 27 15 Op 3 . - CDS 42973 - 43494 395 ## COG1051 ADP-ribose pyrophosphatase 28 15 Op 4 . - CDS 43543 - 43668 109 ## - Prom 43709 - 43768 3.6 + Prom 43610 - 43669 4.5 29 16 Tu 1 . + CDS 43694 - 45352 917 ## BF4213 hypothetical protein + Term 45434 - 45466 3.9 + Prom 46032 - 46091 6.5 30 17 Tu 1 . + CDS 46246 - 47388 749 ## BF4212 hypothetical protein + Prom 47424 - 47483 5.4 31 18 Op 1 . + CDS 47556 - 48749 549 ## BF4412 hypothetical protein 32 18 Op 2 . + CDS 48814 - 49044 57 ## BF4411 hypothetical protein 33 18 Op 3 . + CDS 49106 - 49465 228 ## BF4410 hypothetical protein + Prom 49488 - 49547 4.2 34 19 Op 1 . + CDS 49636 - 50946 783 ## BF4209 hypothetical protein 35 19 Op 2 . + CDS 51000 - 52301 637 ## BF4208 hypothetical protein + Term 52332 - 52375 2.8 + Prom 52317 - 52376 3.8 36 20 Op 1 . + CDS 52486 - 53748 846 ## BF4206 hypothetical protein 37 20 Op 2 . + CDS 53806 - 55023 506 ## BF4204 hypothetical protein 38 20 Op 3 . + CDS 55045 - 56160 638 ## BF4203 hypothetical protein + Term 56221 - 56267 5.3 39 21 Tu 1 . - CDS 56221 - 57570 512 ## COG1442 Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases - Prom 57638 - 57697 6.9 + Prom 57786 - 57845 2.9 40 22 Op 1 . + CDS 57926 - 58633 362 ## BDI_3162 hypothetical protein 41 22 Op 2 . + CDS 58640 - 59857 470 ## COG0438 Glycosyltransferase + Prom 59962 - 60021 4.2 42 23 Tu 1 . + CDS 60060 - 60296 260 ## gi|255011492|ref|ZP_05283618.1| hypothetical protein Bfra3_20275 + Term 60366 - 60422 14.4 + Prom 60359 - 60418 3.9 43 24 Op 1 . + CDS 60441 - 61694 920 ## BF4206 hypothetical protein 44 24 Op 2 . + CDS 61691 - 63160 534 ## COG0641 Arylsulfatase regulator (Fe-S oxidoreductase) 45 24 Op 3 . + CDS 63157 - 64380 361 ## BF4196 hypothetical protein + Term 64431 - 64468 2.2 + Prom 64415 - 64474 3.1 46 25 Op 1 . + CDS 64517 - 66322 1397 ## COG0079 Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 47 25 Op 2 . + CDS 66319 - 66615 237 ## COG3475 LPS biosynthesis protein 48 25 Op 3 . + CDS 66579 - 66944 211 ## BF4396 putative lipooligosaccharide cholinephosphotransferase 49 25 Op 4 . + CDS 66976 - 67578 572 ## BF4395 hypothetical protein 50 25 Op 5 . + CDS 67569 - 68330 326 ## BF4394 hypothetical protein + Prom 68381 - 68440 5.9 51 26 Op 1 . + CDS 68469 - 69659 721 ## BF4191 hypothetical protein 52 26 Op 2 . + CDS 69668 - 70801 801 ## BF4392 hypothetical protein 53 26 Op 3 . + CDS 70828 - 72318 822 ## PROTEIN SUPPORTED gi|90021240|ref|YP_527067.1| ribosomal protein S32 + Term 72335 - 72383 11.0 - Term 72322 - 72370 11.0 54 27 Tu 1 . - CDS 72402 - 73127 513 ## BF4390 hypothetical protein - Prom 73255 - 73314 2.5 55 28 Op 1 . + CDS 73230 - 74036 588 ## COG0483 Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family 56 28 Op 2 . + CDS 74071 - 74763 478 ## COG1040 Predicted amidophosphoribosyltransferases + Term 74777 - 74816 5.1 - Term 74768 - 74799 0.8 57 29 Tu 1 . - CDS 74810 - 75091 233 ## COG1359 Uncharacterized conserved protein - Prom 75128 - 75187 1.7 58 30 Op 1 . - CDS 75199 - 77274 1405 ## COG3533 Uncharacterized protein conserved in bacteria 59 30 Op 2 2/0.000 - CDS 77271 - 78401 399 ## PROTEIN SUPPORTED gi|15900011|ref|NP_344615.1| aldose 1-epimerase 60 30 Op 3 . - CDS 78437 - 79756 808 ## COG0477 Permeases of the major facilitator superfamily 61 30 Op 4 11/0.000 - CDS 79769 - 81838 1387 ## COG1882 Pyruvate-formate lyase 62 30 Op 5 . - CDS 81835 - 82587 308 ## COG1180 Pyruvate-formate lyase-activating enzyme - Prom 82617 - 82676 5.7 63 31 Tu 1 . - CDS 82819 - 86847 2630 ## COG3292 Predicted periplasmic ligand-binding sensor domain - Prom 86927 - 86986 7.4 + Prom 87085 - 87144 2.7 64 32 Op 1 . + CDS 87167 - 90337 2677 ## BF4380 hypothetical protein 65 32 Op 2 . + CDS 90350 - 92329 1695 ## BF4379 hypothetical protein 66 32 Op 3 . + CDS 92342 - 93655 1195 ## BF4378 hypothetical protein + Prom 93659 - 93718 1.7 67 33 Op 1 . + CDS 93807 - 96554 2175 ## BF4175 hypothetical protein 68 33 Op 2 . + CDS 96556 - 99018 1991 ## COG3533 Uncharacterized protein conserved in bacteria + Term 99036 - 99088 10.1 - TRNA 99132 - 99208 79.9 # Asn GTT 0 0 - TRNA 99238 - 99311 79.3 # Asn GTT 0 0 + Prom 99305 - 99364 3.4 69 34 Op 1 . + CDS 99394 - 100383 908 ## COG0524 Sugar kinases, ribokinase family 70 34 Op 2 . + CDS 100428 - 102113 1850 ## COG0793 Periplasmic protease 71 35 Tu 1 . + CDS 102288 - 105131 256 ## PROTEIN SUPPORTED gi|163788005|ref|ZP_02182451.1| 50S ribosomal protein L33 + Term 105155 - 105195 4.4 + Prom 105158 - 105217 4.2 72 36 Tu 1 . + CDS 105309 - 107603 2224 ## COG1472 Beta-glucosidase-related glycosidases + Term 107747 - 107784 7.3 + Prom 108165 - 108224 7.5 73 37 Tu 1 . + CDS 108350 - 111346 3017 ## BF4370 hypothetical protein 74 38 Tu 1 . + CDS 111453 - 112973 1621 ## BF4168 putative outer membrane protein + Prom 113094 - 113153 1.6 75 39 Op 1 . + CDS 113208 - 114500 1138 ## COG5368 Uncharacterized protein conserved in bacteria 76 39 Op 2 . + CDS 114522 - 116243 1452 ## BF4166 xylosidase/arabinosidase 77 39 Op 3 . + CDS 116256 - 117029 773 ## COG4099 Predicted peptidase + Term 117229 - 117273 5.4 78 40 Op 1 . - CDS 117222 - 119102 1669 ## COG0821 Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 79 40 Op 2 . - CDS 119107 - 119613 731 ## COG0041 Phosphoribosylcarboxyaminoimidazole (NCAIR) mutase - Prom 119705 - 119764 4.2 80 41 Op 1 . - CDS 119805 - 120185 569 ## COG0509 Glycine cleavage system H protein (lipoate-binding) 81 41 Op 2 . - CDS 120216 - 120896 752 ## BF4362 hypothetical protein 82 41 Op 3 . - CDS 120924 - 122402 1837 ## COG1508 DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog - Prom 122582 - 122641 3.2 + Prom 122585 - 122644 5.6 83 42 Tu 1 . + CDS 122666 - 124039 1575 ## COG0006 Xaa-Pro aminopeptidase + Term 124111 - 124140 0.5 + Prom 124500 - 124559 5.1 84 43 Tu 1 . + CDS 124622 - 125083 373 ## BF4157 hypothetical protein + Term 125113 - 125151 4.3 + Prom 125240 - 125299 9.4 85 44 Op 1 . + CDS 125484 - 127421 1591 ## BF4156 hypothetical protein 86 44 Op 2 . + CDS 127439 - 129025 1250 ## BF4155 exported protein, ATP/GTP-binding 87 45 Tu 1 . + CDS 129416 - 131242 1457 ## COG3568 Metal-dependent hydrolase + Term 131250 - 131292 0.9 88 46 Tu 1 . - CDS 131469 - 133022 1302 ## COG0642 Signal transduction histidine kinase - Prom 133176 - 133235 6.2 89 47 Tu 1 . - CDS 133260 - 134933 691 ## PROTEIN SUPPORTED gi|39938628|ref|NP_950394.1| ribosomal protein L13 - Prom 135003 - 135062 7.1 + Prom 134787 - 134846 5.1 90 48 Tu 1 . + CDS 135017 - 136714 1730 ## COG1283 Na+/phosphate symporter + Term 136784 - 136824 0.4 - Term 136716 - 136779 -0.8 91 49 Tu 1 . - CDS 136792 - 136953 171 ## COG1592 Rubrerythrin - Prom 137025 - 137084 6.0 + Prom 136955 - 137014 6.8 92 50 Op 1 1/0.000 + CDS 137088 - 138761 1529 ## COG0642 Signal transduction histidine kinase 93 50 Op 2 . + CDS 138825 - 141509 2480 ## COG0474 Cation transport ATPase 94 51 Op 1 . + CDS 141627 - 142250 538 ## COG1011 Predicted hydrolase (HAD superfamily) 95 51 Op 2 . + CDS 142270 - 143211 404 ## PROTEIN SUPPORTED gi|163762565|ref|ZP_02169630.1| ribosomal protein S2 + Term 143359 - 143400 -0.9 + Prom 143831 - 143890 7.3 96 52 Op 1 . + CDS 144125 - 144913 591 ## COG1266 Predicted metal-dependent membrane protease 97 52 Op 2 . + CDS 144949 - 145587 406 ## BF4343 hypothetical protein 98 52 Op 3 . + CDS 145616 - 145843 256 ## PROTEIN SUPPORTED gi|163756262|ref|ZP_02163377.1| 50S ribosomal protein L20 + Term 145994 - 146033 0.5 99 53 Op 1 . - CDS 145971 - 146399 465 ## COG2166 SufE protein probably involved in Fe-S center assembly 100 53 Op 2 . - CDS 146409 - 147398 910 ## BF4340 leucine aminopeptidase precursor 101 53 Op 3 . - CDS 147470 - 148384 756 ## COG1619 Uncharacterized proteins, homologs of microcin C7 resistance protein MccF 102 53 Op 4 . - CDS 148449 - 149243 817 ## COG2273 Beta-glucanase/Beta-glucan synthetase - Prom 149290 - 149349 6.1 + Prom 149328 - 149387 4.6 103 54 Op 1 3/0.000 + CDS 149426 - 150367 743 ## COG0280 Phosphotransacetylase 104 54 Op 2 . + CDS 150390 - 151505 1052 ## COG3426 Butyrate kinase + Term 151710 - 151754 3.5 105 55 Op 1 . - CDS 151616 - 152374 759 ## BF4335 hypothetical protein 106 55 Op 2 . - CDS 152400 - 152630 153 ## gi|253566574|ref|ZP_04844027.1| conserved hypothetical protein - Prom 152650 - 152709 5.5 Predicted protein(s) >gi|226332005|gb|ACIB01000051.1| GENE 1 1 - 1429 1172 476 aa, chain - ## HITS:1 COG:VC2738 KEGG:ns NR:ns ## COG: VC2738 COG1866 # Protein_GI_number: 15642731 # Func_class: C Energy production and conversion # Function: Phosphoenolpyruvate carboxykinase (ATP) # Organism: Vibrio cholerae # 2 475 10 481 542 716 72.0 0 MANLDLSKYGITGVTEILHNPSYDVLFAEETKPGLEGFEKGQVTELGAVNVMTGVYTGRS PKDKFFVKNEASENSVWWTSEEYKNDNKPCSEEAWADLKAKAVKELSNKRLFVVDTFCGA NEGTRMKVRFIMEVAWQAHFVTNMFIRPTAEELANYGEPDFVCFNASKAKVDNYKELGLN SETATVFNLKTKEQVILNTWYGGEMKKGMFSIMNYMNPLRGIASMHCSANTDMEGTSSAI FFGLSGTGKTTLSTDPKRKLIGDDEHGWDNEGVFNYEGGCYAKVINLDKESEPDIFNAIK RDALLENVTVAADGKINFADKSVTENTRVSYPIYHIENIVKPVSKGPHAKQVIFLSADAF GVLPPVSILNPEQAQYYFLSGFTAKLAGTERGITEPTPTFSACFGAAFLSLHPTKYAEEL VKKMEMTGAKAYLVNTGWNGSGKRISIKDTRGIIDAILDGSIDKAPTKVIPFFDFV >gi|226332005|gb|ACIB01000051.1| GENE 2 1694 - 2347 512 217 aa, chain + ## HITS:1 COG:MTH1114 KEGG:ns NR:ns ## COG: MTH1114 COG0035 # Protein_GI_number: 15679125 # Func_class: F Nucleotide transport and metabolism # Function: Uracil phosphoribosyltransferase # Organism: Methanothermobacter thermautotrophicus # 28 214 29 211 215 132 40.0 4e-31 MKIINLSETDSILNQYVSEIRNVEVQNDRLRFRRNIERIGEVMAYEMSKTFAYSVKEIQT PLGIAPVRTPDNPLVISTILRAGLPFHQGFLSYFDYAENAFVSAYRKYKDTLKFDIHIEY IASPRIDGKTLIITDPMLATGGSMELSYQAMLTKGHPSEIHVASIIASQRAVDHIASILP EDKTTIWCAAIDPEINEHSYIVPGLGDAGDLAFGEKE >gi|226332005|gb|ACIB01000051.1| GENE 3 2581 - 3342 642 253 aa, chain + ## HITS:1 COG:AGl1291 KEGG:ns NR:ns ## COG: AGl1291 COG0584 # Protein_GI_number: 15890770 # Func_class: C Energy production and conversion # Function: Glycerophosphoryl diester phosphodiesterase # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 11 241 1 228 248 136 36.0 5e-32 MGACKSEPVRLPELSGHRGADCIAPENTLASADSCIKYKIDFMECDICISKDSVFYLLHD STLDRTTNGTGLIREWLSADIDTLDAGSWFGEKFSGQCVPRLDVLLRKAKQNGLKLTLDY RTGDFGQLLDLVRREGMLENCTFTFWSDKEAKAFRQVAPEIRTLQAYVGGGAELDKVIAE INPNIAVIRIDSLDKLLVERCHKKGLKVLALALGTDDVEESDRKAIELGVDVLATDRPEL FVKKYRPEHTWTK >gi|226332005|gb|ACIB01000051.1| GENE 4 3808 - 5388 780 526 aa, chain + ## HITS:1 COG:no KEGG:BF4442 NR:ns ## KEGG: BF4442 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 526 8 533 533 1035 99.0 0 MSIYITVLLWVLIPLLIINGLIFLTSDMLSVFVGGMLSLKLLCYSLFFSYSFLLGYKAQT GGGIIILVLLLILGFFLWQSFISWVEPGTLWREAVGAIIFLGVILSVGILCRYGIKAGGL LRLSSILAFAIVAFVYITVCLSTVCGAYSPIPEELFWKDAYPEKGKNGTVFFHHSKLHIN PFDSRVIGVFRLNDKKEYVGYSASMYESHEIVNSGFKVDDTIKASTCDFIHSSYEPLLSN FMSDCQDALYMKEGDYCVFLYTEKGALYRKEFDYHRNGIVKQARTYTYDRETGFHIEREK TIEFDELGFCDKSGRVRWNIKDFHQTDTVIDMESRYGQAYDPAARNRKYALPQQLIIDSF VTVESKTASPDSLRKYSSLDFPLDEYYKLGGVNGHAISVYVNTDEYKWRKIYLSDFTQLG NVEQMYSFIDALELSCLDTGMWVRIGEQSMVVGEDDECCSFVLKEVVKGGKALALYFFDD YAVTTDSRFCFYFNPSIHKDMEIQKRAGAAWNEMKKIFMEYKEKEN >gi|226332005|gb|ACIB01000051.1| GENE 5 5627 - 6637 908 336 aa, chain - ## HITS:1 COG:Rv2454c KEGG:ns NR:ns ## COG: Rv2454c COG1013 # Protein_GI_number: 15609591 # Func_class: C Energy production and conversion # Function: Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, beta subunit # Organism: Mycobacterium tuberculosis H37Rv # 8 335 44 372 373 323 46.0 3e-88 MSETVYTAKDYKSGQPRWCPGCGDHAFLNSLHKAMAELGVAPHNIAVISGIGCSSRLPYY VNTYGFHTIHGRAAAVATGAKVANPDLTIWQISGDGDGLAIGGNHFIHAVRRNIDLNMIL LNNRIYGLTKGQYSPTSDRGFVSKSSPYGTVEDPFHPAELCFGARGRFFARCVAVDGPAS VEVLKAAANHKGASVVEVLQNCVIFNDGTHESVATKEGRSKNAIYLEHGKPMLFGENKEF GLMQEGFGLKVVKLGENGVTEKDILIHNAHSMDNTLQLKLALMEGPDFPIALGVIRDVEA PTYNDAVAEQIDEVKAKKKYHNFQELLMTNETWEVK >gi|226332005|gb|ACIB01000051.1| GENE 6 6641 - 8491 1747 616 aa, chain - ## HITS:1 COG:MT2530_2 KEGG:ns NR:ns ## COG: MT2530_2 COG0674 # Protein_GI_number: 15841979 # Func_class: C Energy production and conversion # Function: Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, alpha subunit # Organism: Mycobacterium tuberculosis CDC1551 # 213 606 1 389 425 371 50.0 1e-102 MADEMMVKELEEVVVRFSGDSGDGMQLAGNIFSNVSATVGNDICTFPDYPADIRAPQGSL TGVSGFQVHVGASKIFTPGDHCHVLVAMNPAALKTQIKFCKPQGLVITDSDSFGEKDLEK AQFKTGNPFEEMGITQQVLEVPISSMCKESLKDSGLDNKAMLRCKNMFALGLVCWLFNRN LSAAEKMLNEKFAKKPEIAAANIKVLNDGYNYGANTHASTSTYKIESKTPKAAGLYTDIN GNKATSYGLIAAAEKAGLELYLGSYPITPATDILHELAKHKSLGVKTVQCEDEIAGCASA VGAAFAGDLAVTTTSGPGVCLKSEAMNLAVIAELPLVVVNVQRGGPSTGMPTKSEQTDLL QALYGRNGESPMPVIAATSPTNCFDAAYMAAKIALEHMTPVVLLTDAFIANGSAAWKLPN MDEYPAINPPYVTPDMIGTWTPFQRNEKTGVRYWAVPGTEGFMHRIGGLEKSSETGVIST EPENHQKMTLLRQAKVDKIADSIPEQEVQGDADADLLVVGWGGTYGHLYSAVEHMRKNGK KVALAHFQYINPLPKNTAEILKKYKKIVVAEQNLGQFAGYLRMKVPGLNISQFNQVKGQP FVTRELVEAFTKLLEE >gi|226332005|gb|ACIB01000051.1| GENE 7 9457 - 10698 700 413 aa, chain + ## HITS:1 COG:no KEGG:BF4235 NR:ns ## KEGG: BF4235 # Name: not_defined # Def: putative bacteriophage integrase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 413 1 413 413 770 99.0 0 MEKKIGKLSIRVLLNTHRRNDCEIYPLIIRVVYHRRKSEYSLGWKIHTSNFSADRERVVY SSTGNLKRKDLGLINDAISQERERLLKIFAFLQQNMPGFSLSQLMDKYRMERNLRYVDAF IVREIERLRQEGRSGTAGLYLSGLYSLRRFLRGQKITFRELTYCFLTDYIHFLRMRGISE NTVNMYIRNLRAVYNKAQKQGINMGCESPFRELKLQTQETAKRALCKHDIARIVSVDLSS EPLLDRARDLFMFSFYARGMPFVDIVFLKHDSIINGIIYYERNKTGQRMQVRVIPPLVAL IEKYRSSYPYVLPYITSFSDRTSYMQYRYALGNVNRLLKRLGRRLHLPLVLTTYVARHSW ATIAKEEGFSIASISEGLGHTSEATTQIYLQSFNSEVIDKINEQVVASIGRHI >gi|226332005|gb|ACIB01000051.1| GENE 8 10945 - 11832 666 295 aa, chain - ## HITS:1 COG:no KEGG:BF4437 NR:ns ## KEGG: BF4437 # Name: not_defined # Def: transcription regulator # Organism: B.fragilis # Pathway: not_defined # 1 295 1 295 295 557 100.0 1e-157 MKLHYKKEHISCTNYKSESYEGFGIGTLTSGSNFNSQTLSVKTNFLIFILEGEVEIIPKE GKIKRVIAQEFFFISALSTYEIQVRVPGRYIYMSFLYNDIKLCEKHMLESYLKEVREASE EVGILSVRHPLNLFLELMDAYLRAGVNCKHLHSIKEKELFIILRTSYSKQEIVNLFHEII GTNMSFKAAVLLHVDRVNNREELAQAMGMSITDLARKFKVEFGESVYSWLLKQKNKKIIY RLAQPGASVKEIVYEFGFSSAASFNKYCKKNFGNSPRELVRQLKDKHIDNQNLKI >gi|226332005|gb|ACIB01000051.1| GENE 9 12215 - 12775 496 186 aa, chain + ## HITS:1 COG:no KEGG:BF4435 NR:ns ## KEGG: BF4435 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 186 1 186 186 356 99.0 3e-97 MAFKRSVFLFLLVVSSTFLGVETARGQVYSLQTNVLGWGTTNMNLEFGLKFSHRWTFHFP LQYNPFSFGDARLRNLSASPGVRYWIRESYGRSYFLGIHGVSTMYNVGGVFGDKYRYEGY GFGGGLSLGYNRPLSPHWNLEFEAGLGVLWTHYDKYVCKACGQKVATFKGARLIPTKLAV NIVYLF >gi|226332005|gb|ACIB01000051.1| GENE 10 12809 - 14524 1433 571 aa, chain + ## HITS:1 COG:no KEGG:BF4231 NR:ns ## KEGG: BF4231 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 571 1 571 571 1113 99.0 0 MIHQKKFVTISIGYRIYKMLGVLCLCVLVYSCGSKSPLTMRLFMQAGGATVELPNTSSAD SVFAVSETVGFRDSVVVDNPVSSVDSLEEDIWKSIDMDRIDIVAQRKSIERILERNGEVT LMFYVKAPKVLLDSCWKLTMYPELIEKDSSVLLSPVILRGKEFIRTQESQYKAYDDFLKG IIPESKYDSAFVDREGIRKDIFARQRLFWKVYEAERRRRLAYLKWKTLMDKRHGWISTKA EGNRNTLKQRMQRNVLERSVEKFIAGYDTVGIRASYQKKYDRRTDFWPAYRFPRAMTVKD VPSRFQELYLSGGRLEDIRNYSFTRRDSIEIASHRYFRKAIAENEYNRDHQDLIRDRIIR FPYADSAMVNQTADPSEDFSYLYMYRIPVREGMKKLHITLRGNVLATDHSVWSLPPADTL TFVIASVADLADATLARRFDIAGDSVNHLTPEREEYAQGLEALSNREYQRALGILEKYPD YNTAVALTCLGYHAKSEDLLKQLPQTAAVEYLRAIVNVRLEDYQAAAELLLEACRKDTKY VYRTEMDSDITALLPRFMGLKEELERIASEE >gi|226332005|gb|ACIB01000051.1| GENE 11 14604 - 15668 944 354 aa, chain + ## HITS:1 COG:no KEGG:BF4433 NR:ns ## KEGG: BF4433 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 354 1 356 356 566 95.0 1e-160 MRITKSILALMAATMTFAACSNDEKITESTPKTVALTVKLPEFGGASSRAIDPETTTGNK VTINGPVKIIARVSQGGAITNTISENITAGSMSTITVSGAAQWIEVEANGDNSTETDNVN TRQGSATSSKVRLFGGATINPGSGGNATCTPTINPDMARVEVKGSLAGPWTHLNDLKIKG IYINNVKLTRGASSLTRIVSAAWGTDYAPSGQFEKMFNTDLGAGVGTGVAQIAGGQADGY NFFPQQDLSSPTTKEDVMKKSIHVIMEVEFDKKVGGSGPETGWLNVVALKDNTATNYITD FEAGKVYFINLADIKDIMDVPVPPVTPDPDPETVSVDLTVSIGQWTVVQVKPEV >gi|226332005|gb|ACIB01000051.1| GENE 12 15751 - 16734 780 327 aa, chain + ## HITS:1 COG:no KEGG:BF4229 NR:ns ## KEGG: BF4229 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 327 1 327 327 666 99.0 0 MIFVGKKNSTSFFLLISLILGLALYTALFSSCVYEDLSKCSRMFMLQPRYLLHTGEGDRF GEAVHHIDVYAFDSLGIYRKMFRDKGAHLKNGYRMPIDLPEGKWTLLCLGGKVGDYEIGI FKTGIPQQFSPAVSPGITTLSDFRVKAYFEQKENLEYSLGELFFGRLDSVEITADNGGTG VVDLMKNTNKIEVRVKGIADGSSARITSDNGRFNSENVTPADAGTIIYVPYYSASQADDT RVFQFDVLRLYTDGHLFLKLLNPDGTDVIPGFTKDLINAIMSSPAYHTQEDLDREDTYLI ELVLSKDGVIISLRVNGWETVSTTPEV >gi|226332005|gb|ACIB01000051.1| GENE 13 16982 - 20167 1232 1061 aa, chain + ## HITS:1 COG:no KEGG:BF4431 NR:ns ## KEGG: BF4431 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1061 1 1061 1061 2115 99.0 0 MSLLKKKYMRCVKGIYIFFLLLSFISCELEDKIGRYTSGDNPVEVELMTRNVTLGGSESG VTLKRLRVIAVAKNSGKVEMNRRADLSDPSSSGLEHTGGVFKLYLRPGDYDLFVIGNETT GMNSALTGEPSLSEIKSVPVTLPIDTAEFVVCKHLDIRIRNAASSTPVTGEVSVDEGLTW NSAFSVELERIASKASLEIRKNTDEDIKIRQVTVKQLPSFSFLSPSAYPLSGGLTVTDTR SFSTPVSIAGEKDVPSGYPYTSVVQGKNSFILPEYIWDNTAGKGRASFMEIEAERNGASE TWQILLGKSTDIPADYSLGRNTYYRYMLTVNPLNVDVNVSVEPWQNTAAYDTIPGAKIVF SRVNVGYSYASESVVTFTTKNLPPVSVSLSPVLYTYNSTGDPVSSKFDMIRTKINYSYDQ DTGTGMGTLTIKRNKVSLADTLLLMAGGFAHTIAVEGVEFAGSNIYFDRVLEKFTFDDTP APGCRAEHEKFQGASFYWGSLQGYNTGYNFLATRKSDRKWGYISTPVWISAFSVVGGDDE NLILQKGLHDPKNLKGDICRFITQREWGPGNKKWRMPTLDELKLLGDGVRETTASTPGWG YISGSSSEGSTVIGNGIRTNNRYYLPANDNGFRGRYWSASAAISASARGLYFEYLSTSVQ DWNTSSTVGVIRCVVDSENNYSKVYRVVYSNTDPLNQESHLYTVSPPALLNEGETIVLPA LSTKFNDYEVREGNKVVTKYFHSGWLVNGRHFDFGDSYVAGADGVGSTEVEIIPEWTKFC KIEYVCTPPYPGAAFYSAFPSFPNNIHFVKPGERYSLPNVQVAVMSSPVKIAVGLLVNGI RYNWGDEMPVFSDIKVEPCWANVPESSFASTNLKLSNSGAIDATLIFASEGETGTLFLWD ILRCNSVTLPGFTVNGVSIASPTVADYDRGRVIHSVSGLKRGVGDICRLVGIPLNEINAK LAAGILPDNGTWRLPTANELAALLHPFQDSSQGRVYSILNNTFFPYNAKYGTKDITYPTA EISIPHLPSIYAWGGFIRLGSRIAIRPYASTAETAIRCIRQ >gi|226332005|gb|ACIB01000051.1| GENE 14 20204 - 24727 2069 1507 aa, chain + ## HITS:1 COG:no KEGG:BF4430 NR:ns ## KEGG: BF4430 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1507 1 1507 1507 2863 99.0 0 MCLNKNIKSFLGILMCGILAFLPVSCSDDSPVTGSPLLPEAPEGKVNVFLSLHKGSDYES PATRSGTAADETIMGPPWVLVFLGNDKDATFLEAVQANMTSAGELYAQLSACNSSVVLLL ISNADELIQVKLGSLTTTTTLSDAVTNLLLYGDPSDSPVAGNSALAIPQATVPFTGKKIP MSAYCPLPQIITGTTVGTVGTPLRLKRIVSKLYVDASGAHASDGFILTGVSVINVPVQGA LAFEYGNVNETLPITAQFTDYGHKSGSASRLDNPIIASSTGFAGHITAGMGSEDYPVYVY ETAGGPADRSDVILAGKFDNGPVRYYRASLKNSKGEKLAFKRNYLYTLNLVRVEGGGYST MDEAIAAPSGSSGILCNVTVVDDSHEITGNGVYYLGLTNSSYVLYTDEEQKDVTVCVIGT NAYSRPGSTVTPGVVSMSSGIAGVTLKTTSISADSTAIKLDFAKGAQGETTLDVQVGGLR REIKLKAAGMGVSGNYASGSQGLLLGDFNQIRILESTSKSGLAISPASPDRDSEVISSVT SPSVPVYFFVEEAAGPQSAKLSGLAGESVVVIENRVYNDLPAASNIYWDAAQGRLTFDDV PSYVPAARNQKQGVYFLYGSLIALQGGSSATDVRQLDAAEVNPVWKTNQSPNIPKFNPAS LAPGADLENVLTHIHNPGNRIGDICRYLTQRGWAPPGKKWKMPVKVRLEAVLNSIYRTEG SWLPIGGLATDGTGEVVHGRLIKDRFYFPASGSRKEDGTFIEQPGVSGHYWSSSVISGSI GWQPASLWFQGGDAAIQATALDVALPVRCVVDNTSTDWPELATVAYYASPPGGATLTSEL PANHFVEQRKPFALPSTPLTFSDPSVTHTGWGNNLGLGENTTINTSVGSFHATFSQGASL EYDSNLPAGTALVSGTVPGVYVAAGGTSIKLSTNQLVCSNPGYVHTGWSIDGVYHGLGAN YVMHASGSRKAFAVWSRDCWVEYLAKAPAGAPAGVTVTGTLPSPVKVAQNSSVTLATEAQ GLQVCSDPRWKHTAWTAGAFGGNTIVTNDLKIYPEWTRYYQVTYSPQPPSGTVPGYPRSE IVAPGSSVMLPVQLHSSDPLQVHQAWLVGGQEKAPGTSVIVNGDMTITAKWLLYEVSYLA NAPVGTTSVTPLPALQKIAPGKMVTLSATRLVCSDPAWIHTGWSVGGVHYILGANIPVLS NMQVSAEWTKRFKIKYHLNLPSGTGHAGDMLPEDEYVMPGQNVTLAVPRLKCSNEDFFFT GWMINGQFYGLGSSYTPLPGQDTDVYAVWKYAQAMEFNVTVSNSGRGTDNATLVFTDSKE ETGAFFQSRGVVAWSNTGSPVTWFDPVSTGRTWSSNWMAFASEELHTYANLRNGGGDPCR LIGYSQKYVSTQISKGELPDNRTWRLPKGREYMQYGFTPENKVGSWANGWNFNNGKFLPA SGMRSVADGQLRESGVGYYWLSDYFTIIDDFGKQYMGRGSKVSSEKVIEEGVVYHSAAGF QIRCVRQ >gi|226332005|gb|ACIB01000051.1| GENE 15 24768 - 25058 333 96 aa, chain + ## HITS:1 COG:BH1309 KEGG:ns NR:ns ## COG: BH1309 COG0776 # Protein_GI_number: 15613872 # Func_class: L Replication, recombination and repair # Function: Bacterial nucleoid DNA-binding protein # Organism: Bacillus halodurans # 1 85 1 85 90 71 48.0 3e-13 MKKIDLIRSIAVKSNLKKEQIAIVVEGVMEAIAEALHQGESVTLVGFGTFEVKERKARKG YNLSTGEIMTIPGKKTVRFKPGAKMNLEKKHQDTSR >gi|226332005|gb|ACIB01000051.1| GENE 16 25359 - 28382 3319 1007 aa, chain - ## HITS:1 COG:AGc2877_1 KEGG:ns NR:ns ## COG: AGc2877_1 COG0342 # Protein_GI_number: 15888881 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Preprotein translocase subunit SecD # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 391 664 275 548 562 241 44.0 4e-63 MQNKGFVKVFAVLLTLVCVFYLSFSFVTRHYTNKAKEFAKGDVKVEQDYLDSLSNEKVWL GNYTLKQCREMEISLGLDLKGGMNVILEVSVPDVIKALADNKPDEAFNKALAEAAKQATT SQDDVITLFIKEYHKVAPGAKLSELFATQQLKDKVNQKSSDAEVEKVLREEVKAAVTNSY NVLRTRIDRFGVVQPNIQSLEDKMGRIMVELPGIKEPERVRKLLQGSANLEFWETYNAKE VAPYLQAADSKLRAVLAHEATVNDTVAAVDSTALAAAEATPDKAVSAADSLAAALKGGEK KQQASSADLEQLKKEHPLLAILSVNPNGGPVVGYANYKDTATVNSYLAMKEVAAELPKDL RLKWGVSPFEYDPKGQTFELYAIKSTERNGKAPLEGDVVTDAKDDYDQYGKPSVSMSMNS DGARRWALLTKQNINKSIAIVLDNYVYSAPNVSNEITGGNSQITGHFTPEQAKDLANVLK SGKMPAPAHIVQEDIVGPSLGQESINAGIFSFVVALILLMIYMCSMYGFIPGMIANGALV LNFFFTLGILSSFQAALTMSGIAGMVLSLGMAVDANVLIYERTKEELRSGKGVKKALADG YSNAFSAIFDSNLTSIITGIILFYFGTGPIRGFATTLIIGILCSFFTAVFMTRLVYEHFM SKDKLLNLTFTSPISKKMLVNTHFDFMGGNKKWLTITGVILLICIGSLVTRGLSQSIDFT GGRNFKVQFENPIEPEQVRELISNKFGDANVSVISIGTDKKTVRISTNYRIQDEGNNVDS EIESYLYEALKPMLTQNITLATFIDRDNHTGGSIISSQKVGPSIADDIKTSAIWSVVFAL VAIGLYILIRFRNIAYSAGSVAALTSDTLMILGAYSLCWGWMPFSLEIDQTFIGAILTAI GYSINDKVVIFDRVREFFGLYPKRNVKQLFDDSLNTTLARTINTSLSTLIVLLCIFILGG DSIRSFAFAMILGVVIGTLSSLFVASPIAYMMLKNKKGSAPATTTEE >gi|226332005|gb|ACIB01000051.1| GENE 17 28562 - 29029 110 155 aa, chain - ## HITS:1 COG:no KEGG:BF4426 NR:ns ## KEGG: BF4426 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 155 1 155 155 233 96.0 2e-60 MPNRRPNRKKTGSLKGQSRGLSKNKSGFDTKSLFVFRKRSKHFLQKTVTFSKNAYFFALT SPYVLKTTFTAPENTTYSGSSCSNRLLSESPTGVSKRLKFPSLKSTGNIYFQIAFFLTEN KSRTKQNDKPGNEIETFFRPRLSRESKKRKLGGFI >gi|226332005|gb|ACIB01000051.1| GENE 18 29194 - 30291 882 365 aa, chain - ## HITS:1 COG:no KEGG:BF4425 NR:ns ## KEGG: BF4425 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 365 1 365 365 756 100.0 0 MKHIGRLALFLCLAVNAFFIGMLLLTAYSPYINPEVHPVQSCLGLTFPIFLVINFCFLIF WLIVRYRFALVPLLGFLLCYPQLRTYMPVNPGTAGQPENSIKLLSYNIMSFGNMKKENGQ NPILNYIKNSNADIVCMQEYAGSETAKIHLSNKEIRQALKDYPYHNIKQVGKTGAGSQLA CYSKFPILSARMLDYRSNYNGSMVYEIKIGKDTVLLINNHLESNKLTREDKVVYEDMLKD PKAGKVKSGVRQLVNKLAEASAIRSAQARTIAQEIAHSPYPSVIVCGDFNDSPISYAHRV ISQDMDDAFTESGCGLGISYNQNKFYFRIDNILVSKNLKASGCTVDNSIKDSDHYPIWCY ITLPD >gi|226332005|gb|ACIB01000051.1| GENE 19 30295 - 31194 661 299 aa, chain - ## HITS:1 COG:MA3859 KEGG:ns NR:ns ## COG: MA3859 COG0705 # Protein_GI_number: 20092655 # Func_class: R General function prediction only # Function: Uncharacterized membrane protein (homolog of Drosophila rhomboid) # Organism: Methanosarcina acetivorans str.C2A # 65 214 68 212 226 86 32.0 5e-17 MGHIITDLKEAFRRGNVYIQLIFINVGVFVITTLIGILLQLFNRSAAGIFELLALPASFT RFAWQPWSIFTYMFMHAGFLHILFNMLWLYWFGALFLYFFSGKHLRGLYIVGGICGGLLY MISYNVFPYFRPMTAYSTMVGASASVLAIVVATAYREPNYPVRLLFFGNVRLKYLALIVV LTDLLFITSSNAGGHIAHLGGALAGLWFAASLNKGKDITSWVNKALDAIAALFSAKTWKR KPKMKVHYGNNTRQNDYDYNARKKAQSDEIDRILDKLKKSGYESLTTEEKKSLFDASKR >gi|226332005|gb|ACIB01000051.1| GENE 20 31175 - 31849 619 224 aa, chain - ## HITS:1 COG:XF0649 KEGG:ns NR:ns ## COG: XF0649 COG0705 # Protein_GI_number: 15837251 # Func_class: R General function prediction only # Function: Uncharacterized membrane protein (homolog of Drosophila rhomboid) # Organism: Xylella fastidiosa 9a5c # 1 213 9 203 224 143 43.0 3e-34 MPTVTKNLIIINVLLFLAQFVAQSYGINLSDYLGLHFFLADNFNPAQLFTYMFMHGGFTH IFFNMFAVWMFGRILEQVWGPKRFLFYYILCGVGAGLLQEGVQYIQYVTELSQYTSVNIG TGIIPMSEYLNMMTTVGASGAVYAILLAFGMLFPNQQLFIFPLPFPIKAKFFVIGYALIE LYAGFANNPGDNVAHFAHLGGMIFGFILIMYWRKKNRNNGTYYN >gi|226332005|gb|ACIB01000051.1| GENE 21 32097 - 32369 290 90 aa, chain + ## HITS:1 COG:BS_hbs KEGG:ns NR:ns ## COG: BS_hbs COG0776 # Protein_GI_number: 16079336 # Func_class: L Replication, recombination and repair # Function: Bacterial nucleoid DNA-binding protein # Organism: Bacillus subtilis # 1 89 1 89 92 85 58.0 2e-17 MNKAELINAMAAESGLSKVDSKKALEAFFSSVTKALSAGDKISLVGFGTFSVAERSARMG INPSTKKAIEIPAKKVAKFKPGAELTDAIK >gi|226332005|gb|ACIB01000051.1| GENE 22 32457 - 34250 1918 597 aa, chain + ## HITS:1 COG:TP0831 KEGG:ns NR:ns ## COG: TP0831 COG0018 # Protein_GI_number: 15639817 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Arginyl-tRNA synthetase # Organism: Treponema pallidum # 8 597 12 589 589 469 41.0 1e-132 MKIEDKLVTSVISGLKALYGQDVPAAQVQLQKTKKEFEGHLTLVVFPFLKMSKKGPEQTA QEIGEYLKANEPAVAAFNVIKGFLNLTVASATWIELLNEIHADAQYGIVSADENAPLVMI EYSSPNTNKPLHLGHVRNNLLGNALANIVMANGNKVVKTNIVNDRGIHICKSMLAWQKYG KGETPESSGKKGDHLVGDYYVAFDKHYKAEVAELMEKGMSKEEAEAASPLMNEAREMLVK WEAGDPEVRALWQMMNNWVYAGFDETYRKMGVGFDKIYYESNTYLEGKEKVMEGLEKGFF FKKEDGSVWADLTAEGLDHKLLLRGDGTSVYMTQDIGTAKLRFADYPIDKMIYVVGNEQN YHFQVLSILLDKLGFEWGKSLVHFSYGMVELPEGKMKSREGTVVDADDLMAEMIATAKET SQELGKLDGLTQEEADDIARIVGLGALKYFILKVDARKNMTFNPKESIDFNGNTGPFIQY TYARIRSVLRKAAEAGIVIPEVLPANIELSEKEEGLIQMVADFAAVVRQAGEDYSPSGIA NYVYDLVKEYNQFYHDFSILREENEDVKLFRIALSANIAKVVRLGMGLLGIEVPDRM >gi|226332005|gb|ACIB01000051.1| GENE 23 34436 - 35419 754 327 aa, chain - ## HITS:1 COG:no KEGG:BF4420 NR:ns ## KEGG: BF4420 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 327 1 327 327 642 99.0 0 MKKVILAVASIALWASCIEDEKDYSQIIETRVANCETSKDFSVPVKEGYTTFVTSGEDTL AMANEPITIRIPKNATISTRAEGDGINISYTILDEGSETTYAKVWQAIMFEDTQNGDYDY NDLIIHVKNTASNHAYQHPTETWQTIEIQPIALGSTKTIKLGCILSDGSTHMISDDVRTD LFGGRQGFINTVNDNDPIRYKLASTNIKNYAMPKKEKTSAAWVAWFIEVDGKRMYAASSD IDYKSYDMVNKENMPYGLAVSNGNGTFSYPQEKNSLFETYPGFSDWINGKISSIGSFQKE LVYKYCSGGIIGEDGKSHKIWDYLDLN >gi|226332005|gb|ACIB01000051.1| GENE 24 35591 - 37936 2339 781 aa, chain - ## HITS:1 COG:SMc01364_1 KEGG:ns NR:ns ## COG: SMc01364_1 COG0550 # Protein_GI_number: 15965053 # Func_class: L Replication, recombination and repair # Function: Topoisomerase IA # Organism: Sinorhizobium meliloti # 1 581 5 581 585 446 42.0 1e-125 MQKNLVIVESPAKAKTIEKFLGKDFKVLSSYGHIRDLKKKEFSIDIEKNFKPKYEIPAEK QELVDKLKEEASKAETVWLASDEDREGEAISWHLYEVLKLKPENTKRIVFHEITKTAILK AIEQPRDIDINLVNAQQARRILDRIVGFELSPVLWKKVKPALSAGRVQSVAVRLIVERER EIQAFKSEASYRVTAVFLVPDADGKLVEMKAELARRLKTKKEAQKFLESCKAATFSIEDI VTRPVKKSPAAPFTTSTLQQEAARKLGFTVAQTMMVAQKLYESGLITYMRTDSVNLSEYA TEGSKVAIAQMMGDQYVHPRHFATKTKGAQEAHEAIRPTYMENQTIEGSAQERKLYDLVW KRTIASQMADAELEKTTATIGISNGNDKFTATGEVIKFDGFLRVYKESYDDDNEQEDESH LLPPLKKGQMLEHQGIVATERFTQRPSRYTEASLVRKLEELGIGRPSTYAPTISTIQQRE YVEKGDKPGEERSFDILTLKDNQITDIKHTEIVGAEKSKLLPTDIGTVVNDFLTEYFPNI LDYNFTANVEKQFDEIAEGDKKWTSIMKDFYKDFHPSVETTLATKTEHKVGERILGEEPK TGKPVSVKIGRFGPVVQIGTADDTDKPRFAQMKKGQSMETITLEEALDLFKLPRKVGEFE DKTVTIGTGRFGPYVYHNSKYVSLPKTYDPLEVTLDEAIELILAKREAEAKKHIKKFDED AEMEILNGRYGPYIAYKGSNYKIPKDVVPVDLNYQTCLEIIKLQSEKAENAPKRGRYAKK K >gi|226332005|gb|ACIB01000051.1| GENE 25 38215 - 39918 1539 567 aa, chain - ## HITS:1 COG:no KEGG:BF4418 NR:ns ## KEGG: BF4418 # Name: not_defined # Def: putative TonB-dependent receptor # Organism: B.fragilis # Pathway: not_defined # 1 567 1 567 567 1113 99.0 0 MKYNNYILLGIAFTALPVSIQAQTQPKDTTVNRTVIVEQQYNPDIMDAAKVNVLPKVEEP SVSKKEVEYATFTTPATSIPAGTIGAYTGKEIQPGFIPGYVRLGYGNYGNLDVLANYLFR LSDRDKLNVNFKMDGMDGTLDMPFGDTRKWNAFYYRTRANVDYVHQFAKLDLNVAGNFGL SNFNYEPYGFKKQKFTSGDVHFGVKSTDETLPLQYRAETNLMLYGRQQCQLFGGVNETMV RTLATVSGSVSDEQTVAIGFAMNNLIYGNELKENKDRIKDIFKNRTTLDLNPYYELNNDS WRVHVGANVDLSFGNGKAVRVSPDVKAQYVFSDSYVLYAKATGGRQLNDFRRLETYNPYL DPGQEVKDTYEQLNAGLGFKASPTPGLWFDIFGGYQNLKDDLYQSADAWDGGDGANYIGL GQTHTDNFYAGIKASYEYKDLFAISAGGTYYHWNADAQTTGSKSSDYNEALLMKPEFDLG IHTEIHPIAALWLNAGYQYTRRAERYTGLYAKSIPAVSNLSLGATYRIFKGISAYVKADN LLNKKYQYYLYYPVEGINFVGGLSFRF >gi|226332005|gb|ACIB01000051.1| GENE 26 39925 - 42933 3116 1002 aa, chain - ## HITS:1 COG:MA1613 KEGG:ns NR:ns ## COG: MA1613 COG0457 # Protein_GI_number: 20090471 # Func_class: R General function prediction only # Function: FOG: TPR repeat # Organism: Methanosarcina acetivorans str.C2A # 82 762 394 1092 1885 65 22.0 7e-10 MKKKISRLICAVACCVPVALQAQTSEKITSPVNLYKEGKELFLQKNYAAAMPPLRTFVRQ KADVNLKEEAEYMLVCSAYELKDRNAIAQLRNYLDTYPDTPHANRIYALIASAYFYQGNY DEALALFNSSRLDLLGNEERDDMTYQLATCYLKVGNVKEAAIWFETLKASSPKYANDCSY YISYIRYTQKRYDEALKGFLPLQDDAKYKALVPYYIAEIYAVKKNYDKAQIVAQNYLSAY PQNEHAAEMYRILGDAYYHFGDYHKAVASFRNYLEKENTPRRDALYMLGLSYFQTGVFSK AAETLGEVTTESDALTQNAYLHMGLAYLHLAEKNKARMAFEQAAASNANLKIKEQAAYNY ALCIHETSYSAFGESVTVFEKFLNEFPNSEYAEMVSSYLVEVYMNTRSYEAALKSIDRIA HPGKRILEAKQRILFQLGTQAFANTQFEQAIGYFDRSLGLGQYNRQTKADALYWRGEAYY RLNRMEEAKRNFTDYLQLTQQTHNEMYALAHYNLGYIAFHQKDYTQAQNWFRKYISLEKG ENKTALADAYNRIGDCYLDVRNFDEAKHYYSQAEAMNTPSGDYSFYQLALVSGLQKDYSG KITLLNRLAGKYPASPYAISALYEKGRSYVLMDNNQQAIASFKELLAKYPESPVSRKAAA EIGLLYYQNEDYDQAINAYKQVVQKYPGSDEARLAMRDLKSIYVDMNRIDEFAALASAMP GNIRFDASEQDSLTYMAAEKIYIRGRVEQAKESFGKYLQTFPDGAFGLNAHYYLCLIGKE QKNYDMILEHSGKLLEYPDNPFSEEALIMRAEVQFNKVQFADALASYKMLKEKATTAERR LLAETGMLRAAYLLKDDTETIHAATALLSEAKLSPELKNEALYYRAKAYLNQKADKAAMG DLKELAKDTRNLYGAEAKFLVAQELYNSQNYAAAEKELLNFIDQSTPHAYWLARGFILLS DVYVAMDKKLDARQYLLSLQQNYHADDDIESMIESRLNNLNK >gi|226332005|gb|ACIB01000051.1| GENE 27 42973 - 43494 395 173 aa, chain - ## HITS:1 COG:RSc1618 KEGG:ns NR:ns ## COG: RSc1618 COG1051 # Protein_GI_number: 17546337 # Func_class: F Nucleotide transport and metabolism # Function: ADP-ribose pyrophosphatase # Organism: Ralstonia solanacearum # 9 131 2 121 195 67 32.0 1e-11 MEHPLNQFKYCPKCGSAAFEIHNEKSKQCTDCGFVYYFNPSSATVALILNEKDELLVCRR AKEPAKGTLDLPGGFIDMNETGEEGVSREVEEETGLKVKKATYLFSLPNIYIYSGFPVHT LDMFFLCQVEDTSHFEAMDDVADSFFVPLCQINPEEFGLGSIKKGLKRFLKER >gi|226332005|gb|ACIB01000051.1| GENE 28 43543 - 43668 109 41 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MAKVEKEKGKEEKGSGHYSSHFPFFINTLYYQTYNTGGHIT >gi|226332005|gb|ACIB01000051.1| GENE 29 43694 - 45352 917 552 aa, chain + ## HITS:1 COG:no KEGG:BF4213 NR:ns ## KEGG: BF4213 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 552 1 552 552 1137 100.0 0 MKSLHSLLFCGVLLLHASCSGIKTSDEKSLDDCPLVATWKQAGTDSIVVLDVGLIKDTMQ IRLSQLVDDLEIIKLETRDTALVKSGYMAVSDRYMLLGSYLMPCKLFDKNGTFLRQIGGL GQGPGEYTNIYDAQIDEVNNRIYMLPWTSNQLLVFDLDGNILPPIPLPARVPKGVFRVDT KKNLLTMGILPFQDLENKFVLWQQDLKGNVLQSISSTPYYTYDDYSNEVSSNRNAGSFDF FIFNWSAVQDSLYHYDAKENRLVPVFTANFGTQDIPKHTYTEFPGHYWVNIITEVVNGQG MPPMNVLIDKHSLKGTYCTLVIDELGGIPVEYPYDCFQDGRFLMNLDPGDLIDELEKVLA RPERFSKEESDRLTKLKNSISVDDNNYILVGKFKSKGESLILSANPVQKITEQEQQPAKE EPVKTAVQSEVASEADTVWSVSPYSAILPDAIDYFRTHNKYKDWDPKKGKRVLVRGIAEK DGTITGVGISYGWDLDPQSGAMKNKSEGTCGLKELDEEALRLIRQAKLLPGMTDKKIPVR SKFVIVVDFPPK >gi|226332005|gb|ACIB01000051.1| GENE 30 46246 - 47388 749 380 aa, chain + ## HITS:1 COG:no KEGG:BF4212 NR:ns ## KEGG: BF4212 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 380 14 393 393 788 99.0 0 MKQNTFFVNVWTLMLILGLIACGEKKTQVKTELQRIEAFAFDVNDDYLQSYAGTFCYSTS ARIDGKECLIVYNGKLHSIDILNLADRRPLKQIALAKDGPDQILAPKGIGYYKDSFIILN TGGLYRVGQDGKVVSKKLLNDFPQIKEEGYGIAVPDLTVYFSVYSFFGFDAANGRVALPL YFYEKDTTGEYPKKVLIVSCDDWNIRDEVEIHCPDVIRKEGDMQLLGCVNVLPYGDRLIY NFPASSKVYVYDLSVKKSKEYDFPSTFTDPFFHLPDINGSEPGFGCLKTGYYFPLCYDAY HNVFWRIQQGPLDGHGVGGKPFSVMCISPDFLNSAEYVIPAGASIYPDLAFTDSLILLPY TGGDKIGENNMCFYGLQYRE >gi|226332005|gb|ACIB01000051.1| GENE 31 47556 - 48749 549 397 aa, chain + ## HITS:1 COG:no KEGG:BF4412 NR:ns ## KEGG: BF4412 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 397 1 397 397 823 99.0 0 MKVLYVIFICVLYSCMQNVEPDNKISVGAAMNALGKITAQTISSNVRYVALETNDSSLVG EMPDMRVLEHAILVSSVNQCLKLFDRSTGKFIRDIGHVGTDPQGYAKDAWGKVNYWVDYD QNIIYVLGWGNDLILYDLAGNYKDRICIDDNVRYNLQQSYLYVTYGKLWGHNKLYISNRT SPLFCIDENSNHITDIVGLPVDLLPMDDLQSVSELLGDYASYGGDLTMASFANGGKFYTA INSPSLWRFENDIRLKQTFNDTIYTLSDSKIKPYLIFELGDWAWQYQDRLEEGGCEKKIM IDYALENERCIYFHFHTGFYTKNRQAFCGLYYKADHRVVLMCGDRLLDTVNRQSLRVRGV SSDGCFIALLQPDELCDEVKKKTGSKEEDNPIVVILE >gi|226332005|gb|ACIB01000051.1| GENE 32 48814 - 49044 57 76 aa, chain + ## HITS:1 COG:no KEGG:BF4411 NR:ns ## KEGG: BF4411 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 76 6 81 81 113 94.0 2e-24 MDHKKVFGRTVGIILVDIEGKSRYNGKIDLYYKLVTVRVFRVKIVRFMSIVLIYSGLYRI IFHEFVRVDNLRIGST >gi|226332005|gb|ACIB01000051.1| GENE 33 49106 - 49465 228 119 aa, chain + ## HITS:1 COG:no KEGG:BF4410 NR:ns ## KEGG: BF4410 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 119 1 119 119 221 100.0 6e-57 MDKIEFREDKITFSYKKSYGECLYQDLIAVEYSKPYCILQIGGRNSILFLISLRVILEKL PADFLLINRGIIVNKERIVDCVLQDGAYHIKMDNNKIYRISTRKLATVKKYFMVQNDTI >gi|226332005|gb|ACIB01000051.1| GENE 34 49636 - 50946 783 436 aa, chain + ## HITS:1 COG:no KEGG:BF4209 NR:ns ## KEGG: BF4209 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 436 3 438 438 863 99.0 0 MKHMSLLLIGVFVLLGCSSNKKQEPISKSGVPVINLSEDVSTVPSLLLSEAAEKLEIVSL EMTDESVLSDITEMQVTDHNIWIDHGREFYIYRFSRTGKFLNKIGSIGQGPGEYTTYSTF LVDEDKKEVYIIANTNGVLAYDFEGNFKRKIIDIQMILQLFSSPYDQYILNNQKFFATQN FGLYRPIDKDSLWSFVSLGDDFQKKKYFKNPAHVGREEQIIANRANMDRMVNYWREYLTS MDTYNAQLTLKYPDTDTIYCYDDATNQLSPQYAIFTDEEKGDYEATHLWFKDRKAFDYFS IFSYYPTKDFIYLVGSKGEEVYTYCYNKKDGSVRLQKRQSTITERDVPWFSFPLRQMKRD FVLDNDLGGGDFTVDSRSSGKYWIDILEPGGDENWIDIDQIKSSTVIDESKKKELIRVLE SATEDSNPILMIATLK >gi|226332005|gb|ACIB01000051.1| GENE 35 51000 - 52301 637 433 aa, chain + ## HITS:1 COG:no KEGG:BF4208 NR:ns ## KEGG: BF4208 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 433 1 433 433 840 99.0 0 MKSILLLIITLLGCSSNMKQEPISKSGIPVINLSEDVSTVPSLLLSESAEKLEIVPLEMT DQSMLGEIRRIQVTEHDIWIHDFNKFYIYRFSRTGKFLNRIGSIGQAPGEYVNFSTFLVD EYKKEVYIISNNNGILVYNFKGEFKKKIVDQQTINNLFSSVYSQYILYNGNFFAAQNIAL YKLIDKDSLWSFAFLDTAFQEKKLFKNPAHMGREEQIIANCVDKGRMINLWMEYQTSIDT YNNQLTLKYPDTDTIYCYDDATNDLLSQYVICTREEKGDYEVTHLWFKDRKAFDYFSIKS YYPTKAFIYLVGSKGEEVYTYCYNKKDGSVRLQKRQSAITERDVPWFSFPLRQMKRDFVL DNDLGGGDFTVDSRSSGKYWIDILEPGGDENWIDIDQIKSSTVIDESKKKELIRVLESAT EDSNPILMIATLK >gi|226332005|gb|ACIB01000051.1| GENE 36 52486 - 53748 846 420 aa, chain + ## HITS:1 COG:no KEGG:BF4206 NR:ns ## KEGG: BF4206 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 420 2 421 421 874 99.0 0 MEFFMRMILILGVMVLGFTACSNKKTVDPNPVVRDNIIHLSTAIKNVREEMMLSELVDSV SYIPLETNPNCMLGNYQRLTFSPQYIFYSNYCFDWNGQFLFRIGSQGQGACEDIYVHVAD IVYLNNHFYGNASKIIEYDDRGKCTGKELSWFTQKTMDTAPVGHLVNQVCFAPAGENLMF YNFPDTVYFINTDYEFVAKRSMMPWGRKGGAPSMSGGDPRFKYTSYYKDTTLFYNFYTDT VFTVTPTSLMPRWVVELDEELRFPTRYLYEDGLLSEAFKCWESGNLENAKMIKLLDHKYM VSGVFETERFVFLSVYECMPFRELRKLPETPPLTAIYNKRTGETFAVKQVVDDLGGMKAF FPSWGAYNEKLLATIWPYKLKEFIEEEQSAGRAVAPQILNLMQRVREDDNPVLIIAHLKK >gi|226332005|gb|ACIB01000051.1| GENE 37 53806 - 55023 506 405 aa, chain + ## HITS:1 COG:no KEGG:BF4204 NR:ns ## KEGG: BF4204 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 131 405 1 275 275 562 98.0 1e-159 MNKVRLALYLGILVHLIGCSSFHSEKIDLLAQVPSDCPIVEMTSVLDREDPISVKELVDS ISYVQLDNFVKLPVRASILSMAVTKDYIFVLAGSDAGVFKYDRKGKLIKKIISNDYSSLK LWIGADEFKELLYLNQEGGSEGTTEVYDYDGNYQGNIAELYNMPYHSQYVHRLNEKFLAA FSPWCMPLTDKNYFGGAVFSEKGEMVHQLNYFVPSDTLALCKVSYYSFTYQSDGSMLVWN NGGSGSPIVQPYTTLYKVTVDSIFPVYRLFSGNTRDKIDHVQGIGVSGNSLAIETNSSLY LCCYHPKAPHPCPTLCFMRYDIKNKVLQGVNYPPNGISGGYINHLNGDIPIRFQYSFPLQ KIYVSSINSGEIEQLRKSGYINANSDETLRSNKPGDNPILIYYHY >gi|226332005|gb|ACIB01000051.1| GENE 38 55045 - 56160 638 371 aa, chain + ## HITS:1 COG:no KEGG:BF4203 NR:ns ## KEGG: BF4203 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 371 1 371 371 726 99.0 0 MKSLLFFCLSTFLFYSCSKKENSLYPVIDLADAIENPVEKSVYDIAESVEVVQLETNDSL LIPYVSQLIMTDQYFIIGYGKKCSLFSHSGKFVCDIAQKGSGPEEYTMLMNLLYINNRVL ITDLNNKVNVYSLNGKFIESYQALPDMFSAIYPMSDKNFIGFKAQSGGDEKERLVFYRRD EKRGAIPYQQNYQAKRVCFFPREAQFSSLHDKLYFKQLLNDTIFSVDTVKHSLSPEYIID WGKLRSNDRLRYSLENPDEELFLHMPYVPLFSMTANGLVLGAITVDIDQKKQYYLTAFYD KANHQADLFELKLSKKEMDYFGEPTGKLPPYLEGDTFFPEYISQDGNYLISVRTQPLKEE LNPALIVVKLK >gi|226332005|gb|ACIB01000051.1| GENE 39 56221 - 57570 512 449 aa, chain - ## HITS:1 COG:HI0258 KEGG:ns NR:ns ## COG: HI0258 COG1442 # Protein_GI_number: 16272216 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases # Organism: Haemophilus influenzae # 1 243 35 281 330 127 33.0 3e-29 MKNTIPIFFAINEEYIDHCCTSIVSILENNRYLNISIYILTDYISLESKEFLQEIKNVFT CVTIQWEIIDSESFKQLKKKGGYITEHTLYRYAIADLFPNLDKALYLDADLVINGSIEPL WELDLEGYYCAGVDDIFIRRINYRKILELAEKDVYINAGVLLLNLKDLRKDKIQEKLLQH TSIYINRDRYQDQDAINCICKGKIKLIPNIYNFTTSETLHTPEMLSDIIIIHYTGSIKPW HQEYTWQVLKELYCKYNSSMNKIKNRLLSRWMERTIELFQLSQKTNDTELEEEADKLLNK IIDHCSLAVPITYENGLCGIGTGIEYLLQKKLVEGNSDEILHQIDSAVYSVIEQKSLTDL GLGKGVSGLAYYFYSRLCTRENFNTPTALKIKEYLFHLINWIAELLPDTNNRPVLCEVYL VLSLLHELNIPQAPIETLMRNSLSQITGY >gi|226332005|gb|ACIB01000051.1| GENE 40 57926 - 58633 362 235 aa, chain + ## HITS:1 COG:no KEGG:BDI_3162 NR:ns ## KEGG: BDI_3162 # Name: not_defined # Def: hypothetical protein # Organism: P.distasonis # Pathway: not_defined # 4 232 1 207 214 145 35.0 1e-33 MDRLEINKSAILSKIVNHLVLVSNSVHDIGLYYGKMGIVLFLYNYSRYVHNDLYEKIAGE LLDNVLEEVHNYLPYDLANGYCGIGWAIEYLSEQKFIEGNINEILRNLDEKIMERDVRRI SDETLSSGLEGVFLYVLTRSMGNLLGTPFDKIYLEDLYEVAKKQKIEKGDSHISIFRCKY MDWFEGKVVYRAPLLLSDVIDFPNIPQDEDLWKWGIGISNGCAGVGLRMMVELVN >gi|226332005|gb|ACIB01000051.1| GENE 41 58640 - 59857 470 405 aa, chain + ## HITS:1 COG:MJ1607 KEGG:ns NR:ns ## COG: MJ1607 COG0438 # Protein_GI_number: 15669803 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Methanococcus jannaschii # 169 342 151 329 390 82 32.0 2e-15 MKHFYLVVDRVDTENCGIGTYVEQMLQYLKGEIELSVTIVELDSSESELKEIEVDGLKYL KIPLRYKGLYRDSGVYYRTVAYLLCTMMDDDKINIFHFNYLHQYDIAYKMKELCPKAIIW VTVHFVNWVLALKGNVDKLHSLLREHILKESLILKDYIDSGNMLRLADKVIFLSVQTQKT LCDEYKLDMCKSVVIANGIADYDIKISKSVRNKIRNSFGFLQNDRIILYSGRLEEGKGVD DLIGAFLILLKKMPACRLVIAGSGDFNRYFHLVRFCSRITFVGKLNQKELYKIYQIADVG VLLSFTEQCSYTIIEMLMFGLPIIGTTAPGLSEMFEDGIHGIKIKMRKKRDGSFFYKKSE IVQAFLDFFSNDQVGIISRNCRAHYERMYSLKTMSDRMALLLKAL >gi|226332005|gb|ACIB01000051.1| GENE 42 60060 - 60296 260 78 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|255011492|ref|ZP_05283618.1| ## NR: gi|255011492|ref|ZP_05283618.1| hypothetical protein Bfra3_20275 [Bacteroides fragilis 3_1_12] # 1 78 1 78 78 118 100.0 9e-26 MKKLNSIKLNGLNKSNLESRDMANLYGGNYCYFGKENLQANTDTGKCSCACSNQYNEHNY YSDLGLKHEAEFFKSSNF >gi|226332005|gb|ACIB01000051.1| GENE 43 60441 - 61694 920 417 aa, chain + ## HITS:1 COG:no KEGG:BF4206 NR:ns ## KEGG: BF4206 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 415 6 420 421 834 93.0 0 MRMILILGAMILGFTACSNKKTVNPNPVVRDDIIHLSTAIKNVREEMMLSELVDSVSYIP LETNPNCMLGNYQRLTFSPQYIFYSNYCFDWNGQFLFRIGSQGQGACEDIYAHVADIVYL NNHFYGNASKIIEYDDRGKCTGKELSWYAQKTMDTAPVGRMVNKVCFAPAGENLMFYNFP DTVYFINTDYEFVAKRSMMPWGRKGGAPSMSGGDPRFKYTSYYKDTTLFYNFYTDTVFTV TPTSLMPRWVVELDEELRFPIQYLYEDGLFSDAFKCWESGNLENAKMIKMLDHKYIVSGV FETECFVFLSVYECMPFRELRKLPETSPLTAIYNKRTGETFAVKQIIDDLGGMKTFSPSW GAYNEKLLATIWPYKLKEFIEEEQSAGRTVAPQILNLMKRVREDDNPILIIANLKTK >gi|226332005|gb|ACIB01000051.1| GENE 44 61691 - 63160 534 489 aa, chain + ## HITS:1 COG:CAC0658 KEGG:ns NR:ns ## COG: CAC0658 COG0641 # Protein_GI_number: 15893946 # Func_class: R General function prediction only # Function: Arylsulfatase regulator (Fe-S oxidoreductase) # Organism: Clostridium acetobutylicum # 39 441 74 466 518 93 24.0 9e-19 MKSFTFFRTEEGNYYLYNSRKSSLLNVHPVIKVIEALDNNDGEETLFAKIMNQNLEIGDD EKRLLLDKYLFLRDNGFFEEINEEDFIDGKITPEIVEKQIAFIDNVLFQVTGNCNLRCQY CCYGDMYSDKVEKKNLSFETVETVLNYLMSLWTSNKNLSYEHPIRIGFYGGEPLVNFSLI EKIVSLCESITERTGLVFIYSMTTNALLLDRYKDFIVRHNFSLLISMDGNEAHNVLRIKA DGEQSFQTVYENVKKLQQQYPDYFQNKVEFNSVLNSHSSVDDIHDFIFNEFGKIPLIETI SHTALSDEQKYQEIAKDYKESPEMMIKRKDRSPVYKELGFFFYYQLDNAYKHYCEVLYGT IKQKKRIPTGTCLPFFKKMFVTADNKLLTCERISLHHVLGTVDNEVHLNFEEIASIYNSY YEKMRSQCRGCYLIENCGECFLQFPLKNGVPVCHVKMNEIQYQHYLSEIFGVLEKNPSLF EEVNKMVFA >gi|226332005|gb|ACIB01000051.1| GENE 45 63157 - 64380 361 407 aa, chain + ## HITS:1 COG:no KEGG:BF4196 NR:ns ## KEGG: BF4196 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 406 2 412 416 308 39.0 2e-82 MKKYWFIVETYVFLWKKEDHILVYNSISGKGYIYNCSPDLLSFIDQLMLPENLYCSQVDE ELLRRKSIGEFVISMQSNFCGSLFDVDEFPQKPIVAVPDVNIGEDIEGVDNSVFGGNVLR NLTDVFICLTGKCNKNCPDCNLIYKQMCWCHKSNESLAFEKLEAILKKLEYLNVFELHFT GGNMFLYPYWRELLIKLNEISYKKSFYVHYSQLRGYDEDIDHILNLENSVIRILVDFADF DVENLLLISKSRGPFEYLFKITSWNEYNEACDIIERLCLNAKIIPFYNVGNYSFFEDNVF LNMQDILSTKWTKNEIFANQKLNTNDFGKIRMLENGFVYANLNFSPIGKWHDDYREIVYN ELKRGTSWRRTRDSLPVCKECLCKYLCPSPSNYELAIGKSNLCHVKS >gi|226332005|gb|ACIB01000051.1| GENE 46 64517 - 66322 1397 601 aa, chain + ## HITS:1 COG:mlr5890 KEGG:ns NR:ns ## COG: mlr5890 COG0079 # Protein_GI_number: 13474906 # Func_class: E Amino acid transport and metabolism # Function: Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase # Organism: Mesorhizobium loti # 202 601 39 437 449 210 33.0 6e-54 MQAIILAAGMGRRLGELTRYDTKCMIEVNGIRIIDRLLANLAVARLSRIVLVIGFQGDKL RAYLGNEYCGIPIYYLENPYYAYTNNIYSLFLARHHLASDDTLLLESDIVFEKRILERVL EEPYPDVAVVDRYKSWMDGTMVTVDEKQFIVDFVSKHTFSYEKTPTYFKTVNIYRFSKEF SVGKYVPFLEAYCKCFDNSAYYEQILAVLSLLDKAGLKALPLEGEKWYEIDDMQDLDIAE TLFGKKEGLLPGYQKRYGGYWRFPFLFDFAYLVNPHFPTERMLEELKANFDKLLRQYPSG SYVNRRLVAKHWSIPAEAVAVGNGAAELIRKLMELLSGRMGVILPTFEEYLVGDDRIETF LPLGPGYRYTVSDLKTFFEDKGVTSLLLLNPDNPSGNSIPYEDLISLAGWTQERSIRLIV DESFIDFSEGGEETSLIDDKILKEYNHLVVIKSLSKSHGIPGLRLGIAVSGDHKLMDELQ QKLPVWNINSLAEYYLQILDKYTIDYRQACACFREERALFFEELQEVGYLAVFPSQANFF LCEVTHKYNARELAEVLLREHDILVKDCSLKRGMYGKQYIRIAIRSKEENRYLAGILKYK L >gi|226332005|gb|ACIB01000051.1| GENE 47 66319 - 66615 237 98 aa, chain + ## HITS:1 COG:SP1273 KEGG:ns NR:ns ## COG: SP1273 COG3475 # Protein_GI_number: 15901133 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: LPS biosynthesis protein # Organism: Streptococcus pneumoniae TIGR4 # 3 66 4 67 267 78 50.0 4e-15 MNIDIKEAHRRMLYLLQSFDTVCKKHDIDYWLDYGTLLGAIRHQGFIPWDTDTDVGMLRS DYALFLEKGVPELPQDIFFRLRKQNRPWLHGAGWLRRG >gi|226332005|gb|ACIB01000051.1| GENE 48 66579 - 66944 211 121 aa, chain + ## HITS:1 COG:no KEGG:BF4396 NR:ns ## KEGG: BF4396 # Name: not_defined # Def: putative lipooligosaccharide cholinephosphotransferase # Organism: B.fragilis # Pathway: not_defined # 1 121 88 208 208 242 99.0 2e-63 MAPWSWLVEARLRDRHSRYVPDKKTPAEPMQFGGLQLDLFIYDWDGKYENALSNSFERNL SESRIHLRLDEVEYLDTARFEGVEFPVPSGYDTYLTRCYGDYRTFPPEEERQIPEVVAWS V >gi|226332005|gb|ACIB01000051.1| GENE 49 66976 - 67578 572 200 aa, chain + ## HITS:1 COG:no KEGG:BF4395 NR:ns ## KEGG: BF4395 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 200 10 209 209 368 100.0 1e-101 MESVRGIENPGMMGEMGKIIGFYRLYRQTAEEEWEEKAEVLLDEVMENCSLELPVTYGDG LCGVGVGIEYLLQEGFVEGDADEILWQIDCRVFNTINSRAIGTLGIGKGICGLAYYLYYR LSRRKGEEDIKVLRMKEHLIYLIDWIADSLPGVRESSLFEEVFFILCLLHRLNVFNAKVE KLMEYCEKGMIISGKEAVWI >gi|226332005|gb|ACIB01000051.1| GENE 50 67569 - 68330 326 253 aa, chain + ## HITS:1 COG:no KEGG:BF4394 NR:ns ## KEGG: BF4394 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 253 1 253 253 513 99.0 1e-144 MDLTRGVSIVIPLRVDNPERAENLRFILSLLLQQTEVSVDILEADTEQRFYLSETCERLR YRFVKDDDPVFYRTRYLNILLRSAKFPIAGIWDTDVIIPPVQLREAVGRICSGCVMCFPY DGRFIFLNEAMSDRIRKDVSVLEKVDATSIGMRPSVGGAFLVNRVAYLRAGGENEAFYGW GPEDAERVKRLEILELPIARVKGPLYHLHHPRGINSGFDMGERDKRNLEALLATCRRSKA EMLRWLEQPTFPD >gi|226332005|gb|ACIB01000051.1| GENE 51 68469 - 69659 721 396 aa, chain + ## HITS:1 COG:no KEGG:BF4191 NR:ns ## KEGG: BF4191 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 7 396 1 390 390 799 98.0 0 MKNTILLLNGLWLLGLVACQPKSSSLSECPIVNVRSALQEETPISMKEDVVSIQYIPLET TDSCLISNLLNLQVTTDYMFMYNGKTEEVLQFDRKGKFIRRVGRQGNGPGEYSMISELAV DDSNKELSIFQYGGDALVYSYDGTFLRNDTTAKQAGGMYVFADGKRALKGLVMKPFEQAP WAGALQQADGLLLKSKSLFPQGLSQDVCFMKEICFSPSTEGVLLFTACNDTVAGIYANGI EPAYVLKRENPTEYYMDIANINKFRDNTVETDQIIGVYDLFESPHYLYLRLYKGDAIFIQ RFDKKTGELKSQRIPDDYLECSAAIPGGNVIGMDNDIDGGIPFWPEYVMADGGRAQVVNA DILLALREKGYLKQAPDVLNIGDEANPVVILYTFKR >gi|226332005|gb|ACIB01000051.1| GENE 52 69668 - 70801 801 377 aa, chain + ## HITS:1 COG:no KEGG:BF4392 NR:ns ## KEGG: BF4392 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 377 1 377 377 764 99.0 0 MKSFTFCILLAHVLAFPLFAQKNAAAVTLNLAKAVTQSPKTVLMSELASDVRYFPLETTD NCLLGNECSIIYAGNSIIAGDAQTRSFYRFDKNGKFMNKIGRQGQGPEEYAVGLLFFTDP DNQKLYVQDFQDIICYGFNGKFLRRIPAPHLNMGTGAVDGQGSILYCDNNYFMRKDNPQQ LFLIDENGKKLKIWKGYMEPGKKYGVNLSTRDVMYRYGGDIYFKPALENLIYKIDANRKK TLAWKFDCSGKDVDVSANEIDPGKRFQSIAVQQVFESDRYFFVLYVLKNDSFVGLYDKQK KSFSNVIIKDDLAAGFDFTPPGTGLGNQLANARMVGYLSKGKRYSKALLPERKKELDELI NRLDEEDNPVMVVVTLK >gi|226332005|gb|ACIB01000051.1| GENE 53 70828 - 72318 822 496 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|90021240|ref|YP_527067.1| ribosomal protein S32 [Saccharophagus degradans 2-40] # 104 451 18 347 408 321 47 1e-86 MKITSYVSFLYCLCLMLASPTVQASEVRTAIFEGKPCINPPHVVGNYPATPFLFYIPTSG ERPIKWHAENLPKGLKLDKETGIIKGKVVEKGTYKVMLKAENALGTDTQELLINIGDELL LTPPMGWNSWNTFGRHLTEELLLQTADAMVENGMRDLGYAYINIDDFWQLPERGADGHIQ IDKTKFPRGIKYVADYLHERGFKLGIYSDAADKTCGGVCGSYGYEEIDARDFASWGVDLL KYDYCNAPAGRVEAMERYEKMGRALRATDRSIVFSICEWGQREPWKWAKKVGGHLWRVSG DIGDLWNRSTDEKGGLRGILNILEINAPLSEYARPGGWNDPDMLVVGIGGKSKSIGYESE GCTNEQYQSHFALWCMMASPLLCGNDVRQMNDSTLQILLNKDLIAIDQDPLGIQAERAIR ADHYDVWVKPLSDGSKAIACLNRISGPVDVELNVKTVEGLSLDRVYDVIEGSLVAEASTG WVVKLAPGECKVFICK >gi|226332005|gb|ACIB01000051.1| GENE 54 72402 - 73127 513 241 aa, chain - ## HITS:1 COG:no KEGG:BF4390 NR:ns ## KEGG: BF4390 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 241 1 241 241 456 99.0 1e-127 MQKFRLTMLFIICGNGFAYAQTFNETPIPAFILHKEMKTPQIFKLPEIKNTLSETNPAFN NSMPLVKQYELRKKFSYLDPVFTGYFNQQQYRLFNSRYFGYELYGSSYSLRGVGTQNMAG GRLVYRLNRQLAIRIGGNAYQYRSNGRMFNDFTLNADLTYRLNNWLTAYIYGQYRLDCNP NSGVQGFPLSPQSHYGASFRINLLERKEYGLDLNLGTDRSYNAATRQWENTYKIGPTIRL K >gi|226332005|gb|ACIB01000051.1| GENE 55 73230 - 74036 588 268 aa, chain + ## HITS:1 COG:PA3818 KEGG:ns NR:ns ## COG: PA3818 COG0483 # Protein_GI_number: 15599013 # Func_class: G Carbohydrate transport and metabolism # Function: Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family # Organism: Pseudomonas aeruginosa # 5 265 1 263 271 164 38.0 2e-40 MGLDLQQLTTEVCRIATEAGNFLRKERRSFSRERVVEKHAHDYVSYVDKESERLLVAQLS ALLPEAGFIAEEGSAVYKNEPYCWVIDPLDGTTNYIHDNAPYCVSIALRSCTELLLGVVY EVCRDECFYAWKGGKAWMNGDELHVSKIENIEEAFVITELPYNHRQYKRTAEYLLKQLYG VVGGIRMNGSAASALCYVAAGRFDAWAEAFIGKWDYSAAALIVLEAGGKVTDFFGSEYFI EGHHIIATNGPLHPVFQRLLKEMPPLEM >gi|226332005|gb|ACIB01000051.1| GENE 56 74071 - 74763 478 230 aa, chain + ## HITS:1 COG:DR1389 KEGG:ns NR:ns ## COG: DR1389 COG1040 # Protein_GI_number: 15806406 # Func_class: R General function prediction only # Function: Predicted amidophosphoribosyltransferases # Organism: Deinococcus radiodurans # 11 213 20 204 219 97 33.0 2e-20 MNTWFDSFWSLLFPRCCVVCGAPLSKEEECLCIRCNMNLPRTGFHLRKDNPVECLFWGRI PVLERASSFLFYRKGSDFRRILHLLKYSGYKELGEVMGRYMAAELISCGFFDHVDVIVPV PLHKKKQKLRGYNQSEWIARGISSVTGIPLNAKSVIREKNTETQTRKSTFERSENVEGIF KLCDVACFQGKHVLIIDDVLTTGSTTVACASTLFEVEGVRISVLTLAVAE >gi|226332005|gb|ACIB01000051.1| GENE 57 74810 - 75091 233 93 aa, chain - ## HITS:1 COG:NMB1575 KEGG:ns NR:ns ## COG: NMB1575 COG1359 # Protein_GI_number: 15677425 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Neisseria meningitidis MC58 # 1 80 1 80 97 65 45.0 3e-11 MEKKTIVARVEVLPGKEQAFLQAADALIKGTRAEEGNISYNLYQNPSQPVAFIFYEEYKD QRAMDIHAASPHFQAFGKAIKEMLASDLIIETF >gi|226332005|gb|ACIB01000051.1| GENE 58 75199 - 77274 1405 691 aa, chain - ## HITS:1 COG:mlr2247 KEGG:ns NR:ns ## COG: mlr2247 COG3533 # Protein_GI_number: 13472070 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Mesorhizobium loti # 104 688 99 659 662 319 35.0 1e-86 MKTTKHLSVAAVLTVLMQMGCQSHTDNTRQTLHLPELNEVRIEDAFWSPKLDIWRKITTN DVLNKFEGKYTPFPGSTDTRNAFRNFDRVAEGQRDIKQHDGPEWYDGLVYESIRGIADFL ASHPNKELEKRIDGYVDRIYAAQQTEPTGYINTHTQLMENNHRWGDNGGLLRGQHDVYNA GMLIEAGVHYYQATGKTRLLEIATRFANYMADYMGPEPRMNIVPAHSGPEEAVMALYWLY KNEPELKDKLSIPVRESDYYNLATFWIENRGHHCGFPLWGTWGYRKSEKWIKDACYHQAE FGTHSRPSWGEYSQDSIPVLEQKTIEGHAVRATLMATGLTAAALENQSPQYIETAKRLWE NMAGKRMFITGGVGAIHEDEKFGPDYFLPTDAYLETCAAVGAGFFSQRMNQLTCNARYMD EVERVLYNNVLTGVSLSGDKYTYQNPLNTDKPDRWEWHVCPCCPPMFLKIMAAMPSYIYA YQGDNVYVNLFIGSEVRVPVGKSNSVRLKQLTSYPWHGALSIQVNPDKASTFSMKVRIPG WAQGTENPYDLYQSNLKAPVKLKVNQEDVLLRIVDGYAEINREWKKGDHIELELPMQPRL ITANKAVENLRGQVALASGPIIYCFEDADNPELQTFKLQAQTPLELSHDSNLLNGVNIIK CQGDIPAKAIPYYAVANREESHSYKVWIPQK >gi|226332005|gb|ACIB01000051.1| GENE 59 77271 - 78401 399 376 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|15900011|ref|NP_344615.1| aldose 1-epimerase [Streptococcus pneumoniae TIGR4] # 36 376 12 345 345 158 28 2e-37 MKTKLIILLILTTMIHTNTVNAQHSQLKRADFQQTIDGKQTDLYFLRNKNGIEIAITNFG GRVVEFWTPDKKGHFEDIVLGHDHVDKYLHYKGERFLGATIGRYGNRINKGKFTLNGQTY QLPINDTPNSLHGGFKGFDMVVWDVEQPDSQTLQLTYLSKDGEEGYPGNLQVSMSYKLTD KNEFIITHQAQTDKETVINLTHHSFFNLHGAGNKDINDHILMINADKFTPVDQTLIPTGI LQDVEGTPMDFRRPTPIGKRVNDSFEQLEFGHGYDHNWVLNRKTSNTPELAATVYEPASG RYLEVWTTEPGLQFYGGNFFDGTMTGKHEKKYNYRASLALETQHYPDSPNQPAFPSTTLL PGDTYKHICIYKINVQ >gi|226332005|gb|ACIB01000051.1| GENE 60 78437 - 79756 808 439 aa, chain - ## HITS:1 COG:ECs5014 KEGG:ns NR:ns ## COG: ECs5014 COG0477 # Protein_GI_number: 15834268 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Escherichia coli O157:H7 # 7 434 8 475 491 276 36.0 4e-74 MKNTAKNFMFYVAFVASLGGLLFGFDTAVISGAEKSIQVVYDLSDFSHGFTIAIALIGTI IGAFVCSKPVEKHGRLKALKIIAFLYFVSAVGSAAIIDWHSFLFFRFAGGLAVGASSVVG PMYIAEISPSRWRGRFVAFFQFNIVLGIVLAYFSNYWIHGIAHDWQWMLGVEAIPAIAFA LLLYTVPESPRWLVKQDREAEARHVIKKVSNADIEQEIHEIKESLVTIGASGEKLFQHKY RKPILYAFLIATFNQLSGINAILYYAPRIFEMSGVFTDSAMMQSIVIGLTNLTFTMIGMI LIDQVGRRKLLYIGSIGMTLSLALVAKGFYQDAFSGYYMLICLMGFIAFFAISLGAVIWV LISEVFPNNVRSKGQVLGSMTHWVWSALLSWMFPVFIRTGGTFIFSFFAIMMFLSFFFAL RLPETKNKSLEQIQKELTN >gi|226332005|gb|ACIB01000051.1| GENE 61 79769 - 81838 1387 689 aa, chain - ## HITS:1 COG:SPy2049 KEGG:ns NR:ns ## COG: SPy2049 COG1882 # Protein_GI_number: 15675819 # Func_class: C Energy production and conversion # Function: Pyruvate-formate lyase # Organism: Streptococcus pyogenes M1 GAS # 103 689 167 805 805 249 30.0 2e-65 MNERINYLKTYILDKRHHSQRRTPSSIGLDKLNTIYAQQGLSPVERATACFAALMNAELP VILPGEKIVFTRTLTQVPDIYTPEEWNEIKNKYYIHEKGTVCNISPNYAYTIQHGLEARK QEIRKRQENPSLNEKERVFLNSMYQCIISIQKLIEKYEQYAFLNNETEIAHTLHTIKTEG AQNFRQALQLLRILHFSIWEAGNYHNTLGRFDQYMYPFYQRDLENGTLTKEEAFDLLEEF FLVCNKDSDLYPGMQQGDNGQSLVLGGRDPEGKYLFNDLSRMCLQASYELKLIDPKINIR VDPKTPDEIFTLGSRLTKIGLGFPQYSNDDIIIPGLIRKGYSKEDAYNYVVAACWEFIIP NRAMDIPNIDAVSLIGCVDRCLEKLNTCSDYSSFYTLMEQEIQKEVNAIYEKHRNLYIIP SPMMSLLMDGTIERAKDISEGSYYNNYGIHGTGIATATDTLAALKKYYFEEQSLDYTTLL TAIRSNFKGYEELQKKLREEAPKMGQDDDYADLIAKDLLDSFDRSLADKRNERGGVYRAG TGTAMYYIFHSNQLRATPDGRNDGEMIPANYSPSLFLKQKGPISVIKSFTKQHLDRVVNG GPLTLEFDQSVFSNDETIEKLGMLVKTYIVLGGHQLQLNTVSRETLLHARKHPEQHKNLI VRVWGWSGYFVELDECYQNHVINRIEFGL >gi|226332005|gb|ACIB01000051.1| GENE 62 81835 - 82587 308 250 aa, chain - ## HITS:1 COG:AF1450 KEGG:ns NR:ns ## COG: AF1450 COG1180 # Protein_GI_number: 11499045 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Pyruvate-formate lyase-activating enzyme # Organism: Archaeoglobus fulgidus # 4 242 6 287 302 175 34.0 7e-44 MKTGTIFSVEEFAIHDGPGIRTTIFLKGCPLRCAWCHNPEGISPQPQYMIKKGVKSICGY QITVEELVTMIEKNRSIYTLNRGGVTLTGGEPLFQPDFVIELLRQLPDIHTAIETSGYAN THIFNEVTSLADLILFDIKHTDPEMHRKYTGVDNAIILENLALLCNSGRDFIIRIPLIPG VNDTRENMSAILEKIKDARNLIRVEILRYHRTAGAKYAMIGETYHPPFDTGKAPQIYNVF EENNIKNLIV >gi|226332005|gb|ACIB01000051.1| GENE 63 82819 - 86847 2630 1342 aa, chain - ## HITS:1 COG:XF1330_1 KEGG:ns NR:ns ## COG: XF1330_1 COG3292 # Protein_GI_number: 15837931 # Func_class: T Signal transduction mechanisms # Function: Predicted periplasmic ligand-binding sensor domain # Organism: Xylella fastidiosa 9a5c # 26 636 28 631 740 110 23.0 2e-23 MKKSTFTLILFFSSVILYAQQNELMFHSLGSQHGLTYSAVRDILQDSKGYIWIATLKGLN RYDGYNIKQYYKSDDGLSSNCIEKLLLLGQDTLLMGTNEGLCLYDMMREKFTTIVPQTKA PLYVLDMAYDGRSVFIASDSGLYAYSKTEQSMPLLHKGLIVKVTLDMNGNVWAVSPNTIY CFRPNGQMTRKITATEVSPDYPVEFTSIYKDSQGTLWLGTTENGLYRYNKNYNQFVSVEF ASQDRKDMRYIRCIQEDMRGNLWIGTENGLFIYDYTDNSYIQYRQHAKDVQSGLTDNAIY TIYKSRGDIMWIGTFFGGVSYTSLTENNFHYLIADNGKQYLKGKAISNIIKDKNGALWFA SEDHGISILYPDGHIRYLNKSTHPSLNGDNVHALAEDHSGNIWIGNFIDGLQKVDLAKGY IRSYKNIAGGHAGLSNNSIYKLYVHNPDTMFIGTSQGVNIYHFRTDSFTPFLPDVFRLIR IDDITRDLKGNIWFSTHFNGIFRYHIPTHSIHRYQKGVTGCKTMTSDNIYCSFVDSKGEV WFGTSNGGLMKYNARADSIQAFGKENELRQRDIYSIQEDSFGYLWMSTDNGIFSFNPESR SFAHYKVSDNLVSNQFNACPGYKDPDGTLFFGSINGVCFFRPEGLNHNSPTNDIHLTFSD FRIFNKHVQPSPDGILQNNIDSTSAIRLPHGMNTLTFDFLVINYNENCQSQLSCEYYLEG METEWNATQQIPQSVTYTNLDPGTYQFHVRVIGKNGVVFDRRKITINIRPHFLLSGFMIT IYSLIGLLISFIIVRFYQVRMRDKMDIRIERMEKNNLRELNKHKLNFFTYITHEFKTPLS ILMAVFEDISIGRNNTITGEEMKIINRNIQRLQFLINQLLEFRSVETDHARIEYVKGDIM TYGRSIFELFIPVFRQKQIVFQYATSADSYYTVFDRDKIEKIISNLLSNAFKHSDPQSEI NFRIDVDKASGQLILSCHNSSSYIHPEQREAVMQPFHKTDSSDQKYSNTGIGLALVNGLV QLLSGTVEIESHQNSGTTFKVKLPLVEDSKDMIAPDETLDIVNSPDVVADTVYLLNNSGL KEDMNAANAEKKITVLLVEDNPDINNILKSKLLRLYKVKTAYNGQEAVELLKTHIIDIII SDIMMPYMDGYELSKYIKTSREYSHIPVILITSQPSKENELQGLSAGADAYIEKPFTFDE LNLRITNLLKAKNNIREHYHDMKIFQLNEELNNKDEEFIKSLTQFVIEHIENPELSVDQL TTHMNISRTQLYNKLKKLLNLSATEFINKIKIDVAKVKIIKTNLTIAEISWQLGFNNPSY FSKTFKRFCGVTPNEFKNGKSQ >gi|226332005|gb|ACIB01000051.1| GENE 64 87167 - 90337 2677 1056 aa, chain + ## HITS:1 COG:no KEGG:BF4380 NR:ns ## KEGG: BF4380 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1056 1 1056 1056 2070 99.0 0 MNLKDLNNLRADTEGRIKAVFLICMFVLVSAGGFAQNTKSISGTVREKGSNETVIGATVQ VKGTHNGVITNENGEYTIKNVSPGQVLVFSMIGMNTVEKTVGSQNRIDVLMDAGVLIDEV VVTGYQTQRKVDLTGSVSSLSSDQFMQTNPLSLEQALKGKISGVQVMNNDGAPGGGITIK IRGASSITAGSSPLYVIDGFPLPISDDPLESPLATISPDAIESISILKDVSSTAIYGAQG ANGVVLITTKKGSAGMSEISVKATYGISKLANSIPMLGAEDYMRAYMRDMIMSGRWQNAD FYQEYKDQIWNTNPSRFQFYPDLCLQNGTKQNYEVSYRGGTDRIQNSTIFSLMNEDGIAI NTGFKRFYFQTNNGIKLLPQLTLNTNLSYEHNIRSGAFWTEGNIFNEIQTFSPLVPKEWT FQEIDDNLYYTGKMDNPYRKLKDIDYSNKNNTFFGQAELVYNINDNWFVKGGIGVRIPKG EVKEFIPKTIQRGYDNNGLATYATQSGLNMRGVVQAGFNKVFNKVHSLSVNAVYEANTNK YETFNQEYSQFNTDLGWEGIYDAKSGNHVKSPGVSYEKIAMLSGVLMANYSYKGRYLLKA SMRADGSSKFSPDNRWGFFPSGALGWRVSEEEFFKNVSWLEKNVNNLKLRFSYGQVGNDQ IAPYAYAQTLSSSQRQAIFGDGAIPALYTSRMANPEISWEVTEEFNGGLDLDMFNNRLNI SLDLYTKTTRDMLLEQNLPRTSGFGKVTRNIGSVRNRGFEISVGGVLIDKKDFTWNATVN FSSNQSKVLSLGAETQMLEGRPVGSASGSENVLIKKGYPLGLFYGLQMEGIRSNWHSDYN GIGSADSPWWYATEREMPYGFPSFADTNGDGKVDMSDRVVIGDVNPVFIGGLNTRLRWKF IELAMDFSWSYGNDIINGNVYNLMNNGDIRNKSAVYYKDAWFANNPTGTFTGPGAIDWSG YMWAASNSEMVEDGSFLKMNNLAVTFRMPKNILKAWKIKDLALTYTINNVFCLTNYSGYD PEVRSGSSVNNRILPGVDISAYPYARSHIFSLNFKY >gi|226332005|gb|ACIB01000051.1| GENE 65 90350 - 92329 1695 659 aa, chain + ## HITS:1 COG:no KEGG:BF4379 NR:ns ## KEGG: BF4379 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 659 1 659 659 1389 100.0 0 MKKILLALLTSCALVSCEGYFDQLPKTELPSETFYTSYDAALRNVAILYANAGHVNDGIM TSDRFMMPSLMNEGPFDLTSTSGSVLNLWSKHYAYIAQANLILERLETNKEVIDENAGHS ALDKATITGSATEMLMGEVRFLRAYAYFTLYRYYGGVPLIIEPTGPKPDYVPRATRQEMF KFLYDEMAYALDKCLDNRSGIAYGRVTKGAVAGMLAKMKIFHASYIRRAEMYGNKINETT ADDVDKTTLYADAVKLCDDIIAGVYGTYKLEEYFPAVFTKRNNEIMFSVLAEEGVGTGNK IPMGFAGEGKYGATGGRNLTSWLTLLYDIPMWEHNYSFKDVCMDYGQVDRFNAESPKPGT TPYDLQNLYDRTGKYTITGDITRRMWSSVKGWVTGPNSGGAPIGLWVFEPAGRELGPEFY IEPGKVNDYTEEQLKVMDVALESHERAWWKNETNGQNKPNLWNCNWWQLGKFRNLNPSEL SSTFDINYGGVDYPVLRLAEIYLLKAEAQIMQGHVSEGVKTMNIIRDRACHQGSIKDMFV SQGDAPYTYQPGSVMMIPTNISPANALKELMYERLRELAAEDDCGWLDVARFPDVSMVDL ADICRYRDPLQFFDPFGDPARGEYLWHLFNEEKVYRVLMPIPFTELSFFPEMKQNPGYF >gi|226332005|gb|ACIB01000051.1| GENE 66 92342 - 93655 1195 437 aa, chain + ## HITS:1 COG:no KEGG:BF4378 NR:ns ## KEGG: BF4378 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 3 437 1 435 435 855 100.0 0 MDMKTNKLIIGLLAVAATSLVSCEKFGADVETPSVSISVSPENPKVGEEVTVTINTDAQY LTLFTGDEGKNFERSRIKAIMEHDWDSFYEECYRVSYAKNGEKTLFYKYFKDYQSIEDVK KDFEFFGAIDNIQLVPYKKGDFPEALMEMSYIGTNQLKFTVTDRRIPSGIRMKPNIHIFG GLANQPGNTIIESRFVACDADRAIRKYSNDAWVAAYFGLHTEQLEDFQGYSAGYKYYTFQ DERSYGAFQTNDPLSRRPTEGFYKLGDMYNRDTYLQPFLDHGEKIILRQVDMYVNGRSTF VAADDSPYKYDLDGDGVLESYECELDPATGLPVHEADYTKYKGFQGDVYLSFIEMGTDEY EPWNTGVSLGSVYTTGGIQKTYKYIYTGAGDFTITAVATNVGDKDYKGIDYSEERSNSLD DYSHKRALSSVKVSVKP >gi|226332005|gb|ACIB01000051.1| GENE 67 93807 - 96554 2175 915 aa, chain + ## HITS:1 COG:no KEGG:BF4175 NR:ns ## KEGG: BF4175 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 915 3 917 917 1868 99.0 0 MKLKYCILSLLFFYLNISSIQAVIPQMEVSPDERGVSSLVFQGAGNVRNYVDHGKYLGDL SLTYEVRGKSYAVSLADITPLVLSNTPDKIQIFWQLPSDVRLYQTFTIKGEEVDWEIDFF NRSHHPVKVTDMWFALPVGALDESIQAHQNLNRHFSLNGNASFFYWTPLTGQGDILLMTM HKGTAIEYATQDGKYYLHSMNAVDRTNDSWRLPSTSKNVQPYEHYMTGFNFTLTGNHEEV KTKIYDKHGVVVKVAPGMVVTPEFEVYCALQSKLPVVELVAEYPEEIQITSLGQKEGDKY IYKFRFSHLGENLITVHYGDDLICFLDFFVTEPLETLIKKRARFIVDKQQHRDSSKWYNG LYSLWDMEKSELLSPDHLGDLREEFMVGGSDDPSNSKPVYVSEKNVIYPNKEEIASLEYY EENFVWGKLQRTDEEYPYPYGIYGSENWYQNRSGKYGGYEDGGSGKGRMWRTFDYTTHFA IYYNLYRIAEDNPEMVSYLDADGYLERAYRTAMAYFEVPYNILMGKQWAFHGWTDWAYKQ GNFHERYLLDIINALQQKGRLKDAAKLRREWEKKVTYMVYEDPWPFGSEMFVDRTAFESS YYVAEYAKLNPIKPEEQFWYDKNRKKWYSYTSFDISMIDRFMQSQLDGNLALRGLFEPGY ANLGTAWSGQYVNLDYMTQMGGVALLDYAYRFSDRPDRYINYGYNSLLASWALMNTGTKK TDFGYWYRGEQNDGAVGWAFSPYQNSRTYMNYIKVGRAPWRFDGEIDHGLTGGIHGSGVY LLDDPDFGLIGYGGNVRMDKDGTVSIIPFDGVRRQVRIMTPVRFSVELMQDGFRKDYPIT LRGTEELSFCIENRSDKPHNTTIRAEGMPEGKYTVMTDHKMITTFNIEAGNAHHPYYIEV PVTDKHTQVKLLKTN >gi|226332005|gb|ACIB01000051.1| GENE 68 96556 - 99018 1991 820 aa, chain + ## HITS:1 COG:TM0280 KEGG:ns NR:ns ## COG: TM0280 COG3533 # Protein_GI_number: 15643049 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Thermotoga maritima # 28 662 18 618 620 388 36.0 1e-107 MKNVLTGLFLLILATACSEEEPQTITPVPFNQVTLTDGFWKNRMQTEINVTVPFSVEQSA PAVERFRRCAAFLAGDSTALPETHRFISSDLYKVMEGVSYSLMIQPNKELEEFMDRVADL IAASQKDDGYLYISHICGNPDPREMGEKPYSWVVHSHELYNVGHLYEAAVAYYQATGKDK LLNVAIKSAKHVNKVFFEGGDPNYNGGKPINQAPGHEEIELALCKLYRVTNDPLYLDMAK KFLEIRGVTYRPEGEGVMAPTYAQQHAPVKEQTEAVGHAVRAAYLYTAMAQVDALTGLND YTKALNSIWTNLVTTRMHITGGLGAVEGMEGFGAPYELPNLTAYNETCAAVANVFFNYGM YLDSGDAKFLDIAELSLFNNSLAGINLHGDRFFYVNPLEADGVRRFNHGNGGRAKWFGCA CCPPNISRLILQVPGYMYAYSKDRVYLTLYGGSQTTIPLEGTRVKLEQTSAYPFDGKVRL TVQPEKGSKFSVCMRIPTWARSDEFVPGGLYPYKQPKQAEVELSVNGQKTDFKMEKGFAV IKRDWKPGDVVELNIPMPVRFVDCIPEVSENIGKTAVTRGPLVYCAEEVDNAGPVQRLYL GDTNEAQAKVANISSGVLQGLERITLPGMEKKVGGITSREITMIPYYAWCNRGDNRTMLV WLNEEASTASIGQQTMAYLRNVKGVNASSVAGGKNISVRALCDGKVSESSADKLTEQWLS EGSGKQWVQVDFREPFLLNSLSVYWLDDQDKITVPSGWSVEYKSDAGWTPLELYVTDSYQ MGTDRFNVVHPSGSLEVESVRILIEPQKGKSIGVSEIRFE >gi|226332005|gb|ACIB01000051.1| GENE 69 99394 - 100383 908 329 aa, chain + ## HITS:1 COG:SMc02846 KEGG:ns NR:ns ## COG: SMc02846 COG0524 # Protein_GI_number: 15963924 # Func_class: G Carbohydrate transport and metabolism # Function: Sugar kinases, ribokinase family # Organism: Sinorhizobium meliloti # 4 308 6 313 330 189 36.0 8e-48 MDKIIGLGNALVDVLATLKDDTLLDEMGLPKGSMQLIDDAKLQQINERFSRMKTHLATGG AAANTILGLACLGAGTGFIGKIGNDAYGNFFRANLQRNGIEDKLLVSDLPSGVASTFISP DGERTFGTYLGAASTLKAEDLTLDMFKGYAYLLIEGYLVQDHDMILHAIELAKEAGLQVC LDMASYNIVAGDLEFFTLLINKYVDIVFANEEEAKAFTGKEDPKEALELISKKCSIAIVK VGGNGSYIRKGTEEIKVEAIPVKKVIDTTGAGDYFASGFLYGLTCGYSLEKCAKIGSILS GNVIQIVGTTIPGERWDEIKLNINEVLSE >gi|226332005|gb|ACIB01000051.1| GENE 70 100428 - 102113 1850 561 aa, chain + ## HITS:1 COG:aq_797 KEGG:ns NR:ns ## COG: aq_797 COG0793 # Protein_GI_number: 15606169 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Periplasmic protease # Organism: Aquifex aeolicus # 39 409 36 398 408 228 37.0 2e-59 MKKFLNRRNGVLLAAVLVAVAFFSFKSGDDRNFQIAKNLDTFNSIVKELDMFYVDTLDPN KTVREGIDYMLSSLDPYTEYYPEDDQAELQQMLNASFGGIGSLITYNQKLKRSMIAEPFE GTPAAKVGLKAGDILMEIDGKDLAGKNNQEVSQMLRGAVGTSFKLKVERPDEKGGTRPLE FDIVRQTIQTPMIPYDTIFNKNVGYINLSTFSGTPSKDFKKTFLKLKKEGITSLVIDLRG NGGGRLEEAVEIANFFLPRGKVIVTTKGKTKQASNTYKTLREPLDLDIPITVLVNGATAS ASEILSGAFQDFDRAVIVGSRTFGKGLVQTTRPLPYGGVMKLTTSKYYIPSGRCVQAIDY KHRNEDGSVGTIPDSLTTVFHTAAGREVRDGGGVMPDIEVKQEKLPNILFYLVRDNLIFD YATQYCLKHPSIPSPQEFKVTDADYNDFKAMVKKADFKYDQQSEKIMKTLKEAAKFEGYL DEASEEIKALEKKLTHNLDRDLDYFSKDIRSMIADEIIKRYYYTRGGIIQQLKDDDGLQA ALKILADPVKYKETLSAPVKK >gi|226332005|gb|ACIB01000051.1| GENE 71 102288 - 105131 256 947 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163788005|ref|ZP_02182451.1| 50S ribosomal protein L33 [Flavobacteriales bacterium ALC-1] # 787 944 466 620 622 103 36 6e-21 MKNLLLFLVFACTFSFSCLGSPVPFSPIVRNYSVLDYNAGNENWAVAQDECGVMYFGNNS GLLRYDGSRWKLFPLPTSGIVRAVYVASDRRIYVGSFEEFGYFEQNDLNLLEYHSLKEQV KGFDFHNDEIWTIVEQGGNIIFQSFGSYFIYDGKGTKGVRCPELPLNLFRIGDTLYSQLI NGGVCTFAGDKFIPLISRQELGDSDVLAGLPYPGGMLLLTRNSGGYIHTSSGIRPWHTDS DEELKRHTVNRAVMTKDSCYVIGTISNGLYAFSKEGHLLWKENADNQLENNTVLGLYCDM DNNIWTALDNGIAYVRNNSLIYHFEPVRRKVGMVYDVLVRDKDAYIASNQGLYRLEDTRL ELVPGLEEQAWTIGEWGGQVLCGHNKGTFQIKGMQARLLSDVRGAMCMRQAQINGEQLLI QGTYTFLNIYKKSAAGEWYFANSVGNFSHMAKNIEVDAHGNIWVQHMRKGLYRLRLDEEL KQVTDLKQYDSLSGNQGGNCCLFKVNGRVAFSDGRNFYTYDDMADSIIPYKAMNEQLATL RGIHTVDVMKGDLYWFLSDREAYLVRCTVSDFKVERRIPFSMFGNLPIEGLARMVYDRRN DCSYLCLNNSFARIAADSTGLYKSRQKPSLWVSGFSASDEQTGERIQLPVSGTDEIASAF NNISISLAYPVYNDFALNVRYRLEGLSASGKWTEGLPDLQKEFTRLPFGSYCFRAEVYDE NGVISSVDLPFRILRPWYLSYPAVAVYALSGIALLLGLLYGVYVYTKKKKDAVIERQRAR HEAEIEQQEKKIMELEKEQLEADLRFKSKELSGVVMTNIAHQEFLNSLKEELQQQKLSGQ YTRKNLDKLLSMINQNMVSDEENWNMFQSNFDRIHENFFRNLKEKFPDLTSGDLRLCALL RLNLPTKEIAKLMNISVRGVDAARYRLRKKLGLPPESSLTDFMIAFK >gi|226332005|gb|ACIB01000051.1| GENE 72 105309 - 107603 2224 764 aa, chain + ## HITS:1 COG:STM2166 KEGG:ns NR:ns ## COG: STM2166 COG1472 # Protein_GI_number: 16765496 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-glucosidase-related glycosidases # Organism: Salmonella typhimurium LT2 # 28 761 31 764 765 697 47.0 0 MKHFVRRMQALAASLVVVAAGLQAQKAPRDMDRFIDQLMKKMTLEEKIGQLNLPVTGEIT TGQAKSSDVAKRIRNGEVGGLFNLKGVERIREVQRQAVEESRLGIPLLFGMDVIHGYETI FPIPLGLSCTWDMKAIEESARIAAVEASADGISWTFSPMVDVSRDPRWGRVSEGNGEDPF LGAAIARAMIRGYQGKDMSRNDEIMACVKHFALYGASEAGRDYNTVDMSRQRMFNEYMLP YQAAVEAGVGSVMASFNEVDGVPATGSKWLMTDVLRKQWGFDGFVVTDYTGINEMIDHGM GDQQTVAALALNAGVDMDMVSDAFSGTLKKSVEEGKVSAAAIDAACRRILEAKYKLGLFD DPYKYCDVNRPKKQIFTKEHRAIARKTASESFVLLKNEGVLPLSKKGTIAVVGPLANTRS NMPGTWSVAAVLDNAPSLVEGLREVVGDRAKVVTAKGSNLIGDADYEKRATMFGRELHRD NRTDRELLDEALKVAAGADVIVAALGESSEMSGESSSRTNLEMPDVQRALLQELLKTGKP VVLVLFTGRPLVLTWEEEHVPAILNVWFGGSEAAYAISDVLFGDVNPSGKLTATFPQNVG QIPLFYNHKNTGRPLQEGRWFEKFRSNYLDVSNEPLYPFGYGLSYTTFAYSDIHLSSTEM SADGELTATVTVTNTGSRDGAEVVQLYIRDLVGSITRPVKELKGFEKIFLKAGESRKVSF SITPELLKFYNYDLQFVCEPGDFDVMIGGNSRDVKKARFLLKGE >gi|226332005|gb|ACIB01000051.1| GENE 73 108350 - 111346 3017 998 aa, chain + ## HITS:1 COG:no KEGG:BF4370 NR:ns ## KEGG: BF4370 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 998 1 998 998 1904 99.0 0 MKKNLFTLCMMLLLSMLSFAQGSLTISGVVTEKKTGEPIIGASILLKGQSSGTITDFNGN YFISNVPSGATLVFSYIGMKTQEVKVTASSKLDISLEEDNQLIDEVVVIGYGIQRKSDLT GSVGAVKSKDLTKVATPNVANALQGRVSGVYISANGAPGSSPEVRIRGIGTTNNSNPLYV VDGMFMDDISFLSTHDIESMEVLKDASATAMYGSRGANGVIIVTTKQGTEGKAVVNFTAS EGFQFNNSSFEMANATEYATLLNEALVNTGGKPKFDDPASLGKGTNWFDEIFRVASVRDY QLSVSGGSEKVRYNLSAGYYQQDGIITGNTYNRFTLRANNSYKISKRLTLGHNLSASFSH KKNENSAVVKAAYTISPVKRPYNEDGSFMDSESASSANPVATLHYTNNDDWKERIVGSAF LNWNILNGLDFKTSLGIDYINGRRRNFVPKYYVSETQKNETNSLSKTWDRDFTWLWENLL TYDWKINDKNRLNLLGGITAQKRVYELLEGTGRDFFSDNENYWYLDQASAGSKSVANNGY HETMMSYLFRANYALMDRYLLTVSVRADGSSKFGPDNRWGYFPSVAAGWRVSEEAFLKDR VQWLSNLKLRGSWGQIGNDKIGNDKYRALANISPSYDAVFGGVFYPGGTITSLSNRSVHW ERSEQMDLGFDLGLLNNRFSLELDYYRRDTKDMLVTVDVPASVGLTPVETNVGAVRNSGV DFTVKWEDSLKDFRYGIRLTGTTIKNEVISLGGKRIASGDIGAGKSVQMTEEGKPIKYFY GYNVIGIFQNEAQIKEYNERAAAATGNAGQQYQNNVGPGDLIYEDVDGDGYITANDRKDL GSPTPKFIGGLGISASWKGFDLSIDFQGNFGNKIFNAKQVERFSGSDNWDRSFLDRWTPE NPNTMTPRMTLEGNNYQVSSRYVESGSYVKLQTVELGYTFPKSWMQKVSVQNLRVYFSGN NLAYFTGYNGFTPEVLGGIDRQIYPVTATCRFGLNVTF >gi|226332005|gb|ACIB01000051.1| GENE 74 111453 - 112973 1621 506 aa, chain + ## HITS:1 COG:no KEGG:BF4168 NR:ns ## KEGG: BF4168 # Name: not_defined # Def: putative outer membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 506 1 506 506 1040 99.0 0 MLLCGEPKTITMKKIYSILLASALTLSSCADFLDVDSQGKLTEDVFFGEEEGALMSINAI YTQLRAWDIIGFSWFAITELPGDNSDTGSELADGSVARLNQFNDFTYDASTSEINGWWEG NYKAIASCNVALDNLGAVKNEELRVKCVAQARFFRGFFYFNLVRAFGGVPLVTKVLQPGE YNQPRATEEAVYQQIIEDLTYAAEHLPTRQEWGAKESGRATKGTAEGLLAKVYLFRQDYA NVKKYTGQVIGRGEYSLHRDYRDLFNPNSYYSDEVMLADQYLWGESTERNLESEYVKWQG IRGEMGWGMFSPSEALDQAYEAGDPRRTATIFYDGETLEGKGEIHFKKEVPPRANKKTIW PTGYWNENSFAKQNCHLIFLRYADVLLMYAEACNELGESREALDKLEMVRARARRTVHPA DMTVGLPEITETGKEKLREIIWNERRIELALEGHRFFDLIRADKMVPGYAEKMMKAHGKT NFSIAKHATFFIPQKQVDISQGVLKN >gi|226332005|gb|ACIB01000051.1| GENE 75 113208 - 114500 1138 430 aa, chain + ## HITS:1 COG:AGl3503 KEGG:ns NR:ns ## COG: AGl3503 COG5368 # Protein_GI_number: 15891871 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 36 426 14 404 425 358 45.0 1e-98 MKHTLLFLLMICACTLQATATGKQPRLTDDELMTLVQKQTFRYFWDFAHPESGLARERSN DRLEIATIGGSGFGVMAIIVGVERGFITREQGAERLLKIVEFLNKADSYHGIWAHWMDGA TGKTIPFSRKDDGADLVESAFMFEGLLAAHQYFTHDNPTENRIRGLINTLWHQAEWDWFT RGGEDVLYWHWSPNNGWAMNHQLKGQNECHITYILAASSPTYPIRESVYHKGWANSITFK NGKEYYGIRLPLGTDFGGPLFFTHYSYLGLDPRGLKDSYADYGEQMKAHTLINRAYCIDN PKKYKGYGRKCWGLTASDNHQGYSAHCPQNDLGVITPTAAISSIPYTPEHSLEAMRYFYE ELGDRLWGEYGFKDAFNLTENWFASSYLAIDQGPIIVMIENYRSGLIWKLFMSHPDVQRG LKRLGFGSEE >gi|226332005|gb|ACIB01000051.1| GENE 76 114522 - 116243 1452 573 aa, chain + ## HITS:1 COG:no KEGG:BF4166 NR:ns ## KEGG: BF4166 # Name: not_defined # Def: xylosidase/arabinosidase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 573 1 573 573 1134 98.0 0 MKQLFLLLLALGSPALALAQDMRQDTYCNPLNIDYTYMIYNSDKDISYRSGADPAVVRFR GEYYMFVTRSNGYWHSKDLLDWDFIAPKDWYPQGSNAPAAHNYKDSVLYVTGDPSGSMSI LYSDNPKSGEWKATPAILHNLQDPDLFIDDDGKAYMFWGSSNVYPIRGMELDKNHRFLPK GEVKELFNLDMPRHGWERFGENHSDTVLGGYIEGPWLTKHNGKYYMQYGAPGTEFNVYAD GVYVADHPMGPYTYQKHNPVSYKPGGYMNGAGHGSTVQGPGGEYWHFASMALSINVNWER RLCMFPAGFDKDGIMYVDTRFGDYPRYAPAVPGKKGQFRGWMLLSYRKPVTASTAKGEFG PDALTDERTKSFWLAEANDERQWVLIDLEKPARVCAVQVNYHDYRSNLYGRIPGLRHRYV IEGSSDGETWNILVDRRSSYKDTPNDYVELEVPTTARYIRYKNIDVPTPNLAISELRVFG LGFGKAPRPPQKLALDRHTDRRDVTVRWESVKGAQGYNVLWGVAPDKLYSSWMVYGGNEL EMKSLTIDQDYYFAVEAFNENGVSLPSETKYVE >gi|226332005|gb|ACIB01000051.1| GENE 77 116256 - 117029 773 257 aa, chain + ## HITS:1 COG:TM0033 KEGG:ns NR:ns ## COG: TM0033 COG4099 # Protein_GI_number: 15642808 # Func_class: R General function prediction only # Function: Predicted peptidase # Organism: Thermotoga maritima # 34 256 170 395 395 159 38.0 5e-39 MKQFVLFFISLFLLGVGTRSQEMYQKKVFISSRGDSLNYRLLRPEVEKTGLQYPLVLFLH GAGERGSDNEKQLTHGGQMFLNPVNREKYPAFVLVPQCPEKDYWAYTSRPSSFIPSEMPA GQEITPVLRAVKELLDTYLDLPQVDRNRIYVVGLSMGGMATYDLAVRFPDTFAAAVPICG TVNPVRLADAANVRFRIFHGDADNVVPVEGSREAYKALKNLGVEVEYIEFPGCNHGSWNP AFNYPGFMEWVFAQRKR >gi|226332005|gb|ACIB01000051.1| GENE 78 117222 - 119102 1669 626 aa, chain - ## HITS:1 COG:CPn0373 KEGG:ns NR:ns ## COG: CPn0373 COG0821 # Protein_GI_number: 15618288 # Func_class: I Lipid transport and metabolism # Function: Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis # Organism: Chlamydophila pneumoniae CWL029 # 5 612 9 605 613 383 37.0 1e-106 MDLFNYSRRETSEVNIGAVPLGGPNPIRIQSMTNTPTQDTEACVAQAKRIVDAGGEYVRL TTQGVKEAENLMNINIGLRSTGYMVPLVADVHFSPKVADVAAQYAEKVRINPGNYVDPGR TFKQLEYTDEEYAAELQKIRDRFVPFLNICKENHTAVRIGVNHGSLSDRIMSRYGDTPEG MVESCMEFLRICVDEKFTDVVISIKASNTVVMVKTVRLLVSVMEQEGMSFPLHLGVTEAG DGEDGRIKSALGIGALLADGLGDTIRVSLSEEPEAEIPVARKLVDYITSRRNHPYIPGME APDFNYLSPVRRKTRPVRNIGGNHLPVVLADRMDGRMETHPQFTPDYIYAGRALPEQTEP GVQYILDADVWKGEPDTWPAFNYAQLELMETCAAELKFLFTPYMALTREVVACLKQHPEA VVVSQSNHPNRVGEHRALAHQLAVEGLQNPVIFFQHYAEDTAEDLQVKAGADMGALIFDG LCDGIYLFNQGKLSHAVIDATAFGILQAGRIRTSKTEYISCPGCGRTLFNLQSTIARVKE ATSHLKGLKIGIMGCIVNGPGEMADADYGYVGAGRGKISLYKQKECIEKNIPEEEAVEKL IELIKANGDYEEKTSSLSSPKEKEDK >gi|226332005|gb|ACIB01000051.1| GENE 79 119107 - 119613 731 168 aa, chain - ## HITS:1 COG:PAB1077 KEGG:ns NR:ns ## COG: PAB1077 COG0041 # Protein_GI_number: 14521838 # Func_class: F Nucleotide transport and metabolism # Function: Phosphoribosylcarboxyaminoimidazole (NCAIR) mutase # Organism: Pyrococcus abyssi # 3 163 7 168 174 173 58.0 2e-43 MTPIVSIIMGSTSDLPVMEKAAQLLNDLHVPFEMNALSAHRTPEAVEEFAKNARSRGIKV IIAAAGMAAHLPGVIAASTPLPVIGVPIKSSLDGMDALLAIVQMPPGIPVATVGINGALN AAILAVQMLSLEDKELETKFIAYKEGLKKKIVKANEELKEIKYEFKTN >gi|226332005|gb|ACIB01000051.1| GENE 80 119805 - 120185 569 126 aa, chain - ## HITS:1 COG:PA5214 KEGG:ns NR:ns ## COG: PA5214 COG0509 # Protein_GI_number: 15600407 # Func_class: E Amino acid transport and metabolism # Function: Glycine cleavage system H protein (lipoate-binding) # Organism: Pseudomonas aeruginosa # 2 126 3 127 129 123 50.0 9e-29 MNFPQNVKYTNEHEWIRLEGNVAYVGITDYAQEQLGDIVFVDIPTEGETLEAGEVFGTIE VVKTISDLFLPVAGEVVEQNPALEENPELVNKDPYGEGWLIKMKPANAADLDNLLDAEGY KAVVNA >gi|226332005|gb|ACIB01000051.1| GENE 81 120216 - 120896 752 226 aa, chain - ## HITS:1 COG:no KEGG:BF4362 NR:ns ## KEGG: BF4362 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 226 1 226 226 366 100.0 1e-100 MAEEEKIEQHIIADRTLIRTARIISGILTPFSIPFLAFLVLFLFSYLRIMPLQYKLIVLG IVYCFTILMPTMTIFLFRKVNGFARQDLSDRKKRYVPILLTIISYVFCLMMMHKLNIPWY MTGIILASLVVLVICIIVNLKWKLSEHMAGMGGIVGGLVSFSALFSYNPVWWLCLFILVA GILGSARIILQHHTLGEVLAGFTVGFVCSLLVLHPLSNVLFRIFLF >gi|226332005|gb|ACIB01000051.1| GENE 82 120924 - 122402 1837 492 aa, chain - ## HITS:1 COG:RSc0408 KEGG:ns NR:ns ## COG: RSc0408 COG1508 # Protein_GI_number: 17545127 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog # Organism: Ralstonia solanacearum # 19 492 15 497 499 221 33.0 4e-57 MAQGSRQIQSQAQQQIQTLSPQQILVVKLLELPAVELEDRVHAELLENPALEEGKEESTS DETTPAETSEGEMEGDTDYDSLSDYLTEDDIPDYKLQENNRSKGEQAEEIPFSDATSFYE ILREQLGERNLTEHQRELAEYLIGSLDDDGLLRKSLESIGDELAIYAGINATEEELEEAL RIVQDFDPPGLGARSLQECLLIQIRRKKQSHGTDPLLETEEEIISECYEEFTRKHWEKII KKLGLDEEHFYKALEEITKLNPRPGASLGEAIGRNLQQIVPDFIVDTYDDGTINISLNNR NLPELRMSRDFTEMVEEHTKNKANQSKESKEAMMFLKQKMDAAQGFIDAVKQRQNTLMTT MQAIIDLQRPFFLEGDESLLRPMILKDVAERTGLDISTISRVSNSKYVQTNYGIYPLKFF FSDGYTTEDGEEMSVREIRKILKECIDSEDKKKPLTDDELAEILKEKGYPIARRTVAKYR QQLNIPVARLRK >gi|226332005|gb|ACIB01000051.1| GENE 83 122666 - 124039 1575 457 aa, chain + ## HITS:1 COG:FN1949 KEGG:ns NR:ns ## COG: FN1949 COG0006 # Protein_GI_number: 19705251 # Func_class: E Amino acid transport and metabolism # Function: Xaa-Pro aminopeptidase # Organism: Fusobacterium nucleatum # 1 454 1 454 462 405 46.0 1e-113 MFSKETYTQRRALLKKTLGSGVLLFLGNDECGLNYEDNTFRYRQDSTFLYYFGLSFAGLS AIIDIDEDKEIIFGDELTIDHIVWMGTQPTLKEKSERVGVDITMPSADIVSYLHKAVQKG QAIHYLPPYRAEHKLKLMDWLGVPPARQEGSVPFIRAVIAQRNYKSSEEIVEIEKACDIT ADMHITAMKILRPGMREWEVSAAMEAVAHAAGGDLSFATIATVNGQTLHNHYHGNIVKPG DLFLIDAGAETEMGYAGDMSSTVPADKKFTRRQREVYEIQNAMHLESVKALRPGIPYMDV YDLSARVMVEGLKGLGLMKGNAEDAVREGAHALFYPHGLGHMMGLDVHDMENLGELWVGY NGQPKSTQFGRKSQRLAIPLEPGFVHTVEPGIYFIPELIDLWKGQKKFTDFINYDKVETY KDFGGIRNEEDYLITETGARRLGKKIPLTPDEVEALR >gi|226332005|gb|ACIB01000051.1| GENE 84 124622 - 125083 373 153 aa, chain + ## HITS:1 COG:no KEGG:BF4157 NR:ns ## KEGG: BF4157 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 153 1 153 153 261 88.0 7e-69 MSGLYYNLALRNSDVGNKESEKLYYCVLKNRGMVDQETFVNYLSQISGVQEALCLAMISK LGAAVINYLQDGRSVDIPHLGTFNLTASSTGVKTLDEAKASQIKAINLRFSTKEETDLVL SRCALTRSVSLTNLNDVIKEKDPGGDIVDDPTA >gi|226332005|gb|ACIB01000051.1| GENE 85 125484 - 127421 1591 645 aa, chain + ## HITS:1 COG:no KEGG:BF4156 NR:ns ## KEGG: BF4156 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 645 1 645 645 1350 99.0 0 MRKLKNVLRASSCCAFSILMCLPAAGQNAWSEADCVTLLDSTSVTVQPNGSGSFAVYKSF KVQTPKGAVNNHVIKYDYDPLTAFARFKQVTVQRANGETIQVDVTKTCDYAAPARAIYWG ARQIMLELGRLEPGDVVSYEISKKGFTYALLAGAEDDDARFIPPMHGQFYDIVPFWVNEP TRRKVYVVSMPMEKELQFQFYNGACTSSMRYEDGRKVYTFAVDEVLPFRKEPNMVDLFDE APKLMMSSTPKWEDKSLWFHDLNEAYGSFAALPEAQKKVDELIKGKKTEMEKIAVLTHWV ADNIRYAGITMGEGEGFTLHNTKMNYTDRCGVCKDIAGTLISFLRMAGFEAYPAMTMAGS RIESIPADHFNHCVAVVKLANGTYMPLDPTWVPFCRELWSSAEQQQNYLPGIPGGSDLCL TPVSAPENHYVRITADNKIDAKGTLKGSFTITAEGQSDSSIRRIFTQGWQTEWQSTMESQ LLNVSPKARMLGVDYGKAPKDYQTGPIRITFRYEIPDYALVGDRELLLKPMVMNNLYASV LSFLRIDTGLETRQYGFRDACSRLVELDETIALPRGYRLAGEPRSEQKAAPAADFEGSLQ QVGNKLVLKQKLALKKRIYRAADWEGFRAAVNAYKSFADYLIVKL >gi|226332005|gb|ACIB01000051.1| GENE 86 127439 - 129025 1250 528 aa, chain + ## HITS:1 COG:no KEGG:BF4155 NR:ns ## KEGG: BF4155 # Name: not_defined # Def: exported protein, ATP/GTP-binding # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 528 1 528 528 955 99.0 0 MTKIYTLLLLCLFALTLPVTAREAEFKKIKESWTLQADGTQVYRQSKVLTLYTHTAMNRT YGESFITYDPRYQTLQIHESYTRQKDGNIVKTPANALVEVLPSAAANAPAFNALREMVVV HTGLELGATIYLDYSITTKPGYLPGIQICRLLEHSSPVKEYEISITVPRGQHLEYALNNH KAKASVTESEGMKQVSWTLKNLPALSREPQVSVIGGDLPLLTATTEEKPLSFLTSQWQPE GDAGIRALAGKLTAGAKDDKEKVTAIRDYVIDNLYYSSLPLAESGYRIRPAAEVIRSAYG TEAEKANLMAALLKAAGLKAGIGAVCAPSENTASLGVGSIRELFVWTDAGGSQQALSLKG KTPSAVSWQKDYAYVASLTQPFELNLPPVTLRKDYALQPDAANAKDGYLVFTLPVERGSL ADSQYVRFNSKRTSNLLLPGLADESFTYTVDTPAGMESVTRPMEKKMDNAVGTLTISIRP EGAKTRVTRSLKLKKQMIRPADYAAFRQLMTEYGAVDGLTLVYRKPLP >gi|226332005|gb|ACIB01000051.1| GENE 87 129416 - 131242 1457 608 aa, chain + ## HITS:1 COG:CC0523 KEGG:ns NR:ns ## COG: CC0523 COG3568 # Protein_GI_number: 16124778 # Func_class: R General function prediction only # Function: Metal-dependent hydrolase # Organism: Caulobacter vibrioides # 23 246 4 246 259 88 30.0 3e-17 MRKLLLLLFSALFVLSAQAEDVLRLMTYNVRNANGMDGICNYQRVANVINNARPDIVAIQ ELDSMTARSNRTDVLKELAERTQLHPCFAPAIDYDGGKYGIGILSKETPLRVQTFALPGR EEPRTLLVAEFPEYVFACTHLSLTEEDRMKSLEILKSVTANTRKPFFLAGDFNSDADSGF IKDLKSTFQILSNPKQPTYPASEPKETLDYLIALKQETPTFVVNSARVIDEPLASDHRPL LVEVRMAVPENRICRTKPYLQNPVGGGITVMWQTNVPAYCWVEYGTDPSNLKRARTLLDG QVVCGNTLHKIRLDSLQPGQKYYYRTCSQEILLYQAYKKVFGNTARTELREFTLPAAGDD SFTAVVFNDLHKQSKTFQALCEQIKGIDYDFVVFNGDCVDDPANHDQATAFISELTEGVE GDRVPVFFMRGNHEIRNAYSVGLHSLFDYVGDKTYGSFNWGDTRIVMLDCGEDKPDDHWV YYGLNDFTKLRNDQVGFLKRELASKEFKKAAKRILIHHIPLYGNDYENLCSGLWGKLLEK APFNISLNAHTHEYAYHPKGSLGNNFPVIIGGGYSMKSATVMVIEKKKDILKVKVLNTKG EVLLELTV >gi|226332005|gb|ACIB01000051.1| GENE 88 131469 - 133022 1302 517 aa, chain - ## HITS:1 COG:ZrcsC_1 KEGG:ns NR:ns ## COG: ZrcsC_1 COG0642 # Protein_GI_number: 15802771 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Escherichia coli O157:H7 EDL933 # 280 515 422 672 700 82 26.0 2e-15 MKISTLILTILVCFSTIVDAQGTQNRANELMKQAQSSLEQKEYTKARYLFIQAYGAFATQ ENYPKAVECGIQGAALYHRENYYKEAFDLCRGMDQLVWAGEQKQQKQFYPLRFQITKERL QMYIGLKNAAQAKIQLDKLAETVGLAKNDSLSENLLYTQASYYYTFGQNTQGDACFQKLI NQYKAQKNYDKVNECYKKLIAIGRKSNNASLVARTYENFIVWTDSVKAMTAQDELNVLKR KYDESQQIIQEKEDTLSGKQYMIVGLCTLVVILIAGLIFVAIVLLRFIAGNRKLKKSVQI ANEHNELKTQFIQNISAQMEPTLDTLASSAAELSDKAPQQAQQMQGQVAALKKFSNDIQE LSTLENSLTEPYEMKEINVNTFCEATMDKVKEFVQPEVSTVVNAAKLQIKTNPEQLERIL IHLLKNAAEYTESGKIFLDFKKRGAHTHQFIISDTGTGIPVEKQENLFKPFTEIKDLTEG DGLGLPICALIATKMNGSLTLDTSYTKGSRFILELHT >gi|226332005|gb|ACIB01000051.1| GENE 89 133260 - 134933 691 557 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|39938628|ref|NP_950394.1| ribosomal protein L13 [Onion yellows phytoplasma OY-M] # 4 557 1 546 546 270 29 2e-71 MKQMLQIYCKNNNISKEFPIGSSLLDIYYGFNLNFPYQVVSAKVNNRSEGLNFRVYNNKD VEFLDVRDSSGMRTYVRSLCFVLYKAVSELFPNGKLFVEHPVSKGYFCNLRIGRPVTLED VSQIKKRMQEIIAEDITYHRIECHTTEAVRVFSERGMNDKVKLLESSGSIYTYYYTLGGT ADYYYGNLLPSTGFIHLFDLVKYYDGLLLRIPNKENPSVLEDVVKQEKMLDVFKEHLRWN YIMGLNNVGDFNIACEEGHATDLINVAEALQEKKIAQIADSIFHRGENGTRVKLVLISGP SSSGKTTFSKRLSIQLMTNGLKPYPISLDNYFVDREDTPLDENGNYDYESLYALDLELFN TQLQALLRGEEVELPRYNFMLGKKEYKGDKLRIDEHTVLILEGIHALNPELTPQIPAANK FKIYVSALTTISLDDHNWIPTTDNRLLRRIIRDFNYRGYSAQETISRWPSVRAGEDKWIF PYQENADVMFNSALLFEFAVLRCHAEPILTSVPRNCPEYAEAYRLLKFIKYFTPVQDKEI PPTSLLREFLGGSSFKY >gi|226332005|gb|ACIB01000051.1| GENE 90 135017 - 136714 1730 565 aa, chain + ## HITS:1 COG:TP0771 KEGG:ns NR:ns ## COG: TP0771 COG1283 # Protein_GI_number: 15639758 # Func_class: P Inorganic ion transport and metabolism # Function: Na+/phosphate symporter # Organism: Treponema pallidum # 8 552 47 585 593 262 33.0 2e-69 MEYSFYDFLKLIGSLGLFLYGMKIMSEGLQKVAGDRLRGILTAMTTNRVTGVLTGVLITA LIQSSSATTVMVVSFVNAGLLTLAESISVIMGANIGTTVTAWIISIFGFKVDMAAFALPL LAIALPLIFSGKSNRKSIGEFIFGFSFLFMGLSYLKANAPDLNANPEMLAFVQNYTDMGF FSVLLFLFIGTILTMIVQASAATMAITLIMCANGWISLELGAALVLGENIGTTITANLAA LTANSQAKRAALAHFVFNVFGVIWVLIIFHPFMVFVNWVVDTFFHPGSAEVAISYKLSAF HSIFNICNVCLLIWGVKLIERTVCAIIRPKEEDEEPRLRFITGGMLSTAELSILQARKEI HLFSERIHRMFGMVQDLLHTEKDDDFNKLFSRIEKYENISDNMELEIANYLNQVSEGRLS SESKLQIRAMLREVTEIESIGDSCYNLARTISRKRQTNQDFTEKQYEHIHFMMKLTNDAL SQMIVVVEKPEHQSIDVNKSFNIENEINNYRNQLKNQNILDVNNKEYDYQMGVYYMDIIA ECEKLGDYVVNVVEASSDVKEKKAS >gi|226332005|gb|ACIB01000051.1| GENE 91 136792 - 136953 171 53 aa, chain - ## HITS:1 COG:alr1174 KEGG:ns NR:ns ## COG: alr1174 COG1592 # Protein_GI_number: 17228669 # Func_class: C Energy production and conversion # Function: Rubrerythrin # Organism: Nostoc sp. PCC 7120 # 2 49 182 229 237 75 60.0 2e-14 MKYICTVCDFIYDPEIGDPEGGIEPGTQFEDIPDDWVCPLCGVGKEDFEPYNG >gi|226332005|gb|ACIB01000051.1| GENE 92 137088 - 138761 1529 557 aa, chain + ## HITS:1 COG:mll3725_2 KEGG:ns NR:ns ## COG: mll3725_2 COG0642 # Protein_GI_number: 13473203 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Mesorhizobium loti # 138 405 38 308 328 178 38.0 2e-44 MMWQIVLSLVCVAFLGAILWEKRQGKLMQSASDRKLENVNCLFDAILTHIHAYVLVIDSD FKVLKTNYYDLTHLSPDGKEKRVGDLLQCRNALSAKGGCGTHAFCQECPVRGAIGNAFRQ KKEFTDLQSSLNIAIDEHEFVKCDVMISGVFMTLDSEERMVLTVHDITSIKQTEEALAEA KEKAENADRSKSAFLANMSHEIRTPLNAIVGFSELLAAANTEEEKQKYLEILHTNSELLL QLVNDILDLSKIEAGTLEFVYSDVDINLLLNDLEQLFRMKIGSNSPVQIITEPGLPSCMV HTDRNRIAQVVSNFVSNAIKFTTEGSIRIGYQSSENGLRFYVSDTGSGISADKLEGVFDR FVRLQSDKNGNGLGLSICKTIVNKLGGEIGAESEVGKGSTFWFTLPEHSDIKPKVIIEKE QEELPSAVRVPVIDAGSDKKLSILVAEDMEDNYRLCEAILASRYELHWAHNGEEAISLFL KFQPDIILMDIRMPEVNGYEATEAIRQMSATVPIIALTAFAYEEDRQKIMHSGFTDFLTK PISSKVLLGKLESLKKI >gi|226332005|gb|ACIB01000051.1| GENE 93 138825 - 141509 2480 894 aa, chain + ## HITS:1 COG:SPAPB2B4.04c KEGG:ns NR:ns ## COG: SPAPB2B4.04c COG0474 # Protein_GI_number: 19114802 # Func_class: P Inorganic ion transport and metabolism # Function: Cation transport ATPase # Organism: Schizosaccharomyces pombe # 25 888 212 1139 1292 457 34.0 1e-128 MTAKNDDYFHLGLTDQEVLQSREKYGANLLTPPKRPSLLKLYLEKFEDPVVRVLLIAAVF SLIISVIENEYAETIGIIAAILLATGIGFYFEYDANKKFDLLNAVNEETLVKVIRNGRIQ EIPRKDVVVGDIVVLETGEEIPADGELIEAISLQVNESNLTGEPVINKTIIEADFDEEAT YASNLVMRGTTVVDGHGSMKVLRVGDATEIGKVARQSTEQTTEPTPLNIQLTKLANLIGK IGFTVAGLAFLIFFIKDVVLYFDFGALNGWHDWLPVLERTLKYFMMAVTLIVVAVPEGLP MSVTLSLALNMRRMLATNNLVRKMHACETMGAITVICTDKTGTLTQNLMQVHEPNFYGLK DGGKLADDDISRLISEGISANSTAFLEETGKGEKPKGVGNPTEVALLLWLNSQKRNYLEL REGARVLDQLTFSTERKFMATLVKSPLIGKKVLYIKGAPEIVLGKCKEVILDGRRVDSVE YRSTVEAQLLGYQNMAMRTLGFAFRLVEDNEPDDCVALVSENNLNFLGVVAISDPIRPDV PAAVAKCQSAGIGIKIVTGDTPGTATEIARQIGLWKPEDTEHNRITGVAFAELSDEEALD RVMDLKIMSRARPTDKQRLVQLLQQKGAVVAVTGDGTNDAPALNHAQVGLSMGTGTSVAK EASDITLLDDSFNSIGTAVMWGRSLYKNIQRFIVFQLTINFVALLIVLLGSIVGTELPLT VTQMLWVNLIMDTFAALALASIPPSESVMNDKPRRSTDFIISKAMQHNIFGVGTLFLVVL MAMIYYFTNADGGMTVQRLTIFFTFFVMLQFWNLFNARVFGTTDSAFKGLTKSYGMELIV LAILGGQFLIVQFGGAVFRTEPLDWQTWLIIIGSSSLVLWIGELIRLVKRLTQK >gi|226332005|gb|ACIB01000051.1| GENE 94 141627 - 142250 538 207 aa, chain + ## HITS:1 COG:L111950 KEGG:ns NR:ns ## COG: L111950 COG1011 # Protein_GI_number: 15672092 # Func_class: R General function prediction only # Function: Predicted hydrolase (HAD superfamily) # Organism: Lactococcus lactis # 6 195 3 192 207 130 37.0 2e-30 MKRKGIKNLIFDFGGVLINLDRQRCIENFRKLGLEKVDELLGMYSQQGIFMQHEKGLISS AQFRDSIRGQITQEVTDEQIDAAWNSFLVDIPRFKLDMLLKLREKYVVYLLSNTNEIHWK WACEHAFPYRGFRVEDYFEKMYLSYEMKMVKPAEEIFRGVLDDANLDPRETFFIDDSEAN CQGAQALGISTYTVKPGEDWRPLFAEN >gi|226332005|gb|ACIB01000051.1| GENE 95 142270 - 143211 404 313 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163762565|ref|ZP_02169630.1| ribosomal protein S2 [Bacillus selenitireducens MLS10] # 11 303 15 309 317 160 34 4e-38 MQIIRNIPGVLPEPCVATIGFFDGVHMGHRFLIEQVRELAAARGLRSALITFPIHPRKVM NADYRPELLTTSEEKLALLEETGVDYCFMLDFTREVSHLTAHEFMSDILRDRYRVQTLVI GYDHRFGHNRSEGFEDYCRYGSEMGIEVIRARACVCDDLHISSSVIRKMLHQGEVDRAAR CLGYDYFLDGTVVSGYQVGRKIGFPTANLSVDDPDKLVPADGVYAVHVTFDGHTYNGMLN IGTRPTIGNGPERSIEVNILHFHSDIYDKFIRLSFVKYLRPELKFDGIEGLIAQLHQDAA DVEALLGKIQNVN >gi|226332005|gb|ACIB01000051.1| GENE 96 144125 - 144913 591 262 aa, chain + ## HITS:1 COG:RSc3402 KEGG:ns NR:ns ## COG: RSc3402 COG1266 # Protein_GI_number: 17548119 # Func_class: R General function prediction only # Function: Predicted metal-dependent membrane protease # Organism: Ralstonia solanacearum # 120 213 142 236 285 63 44.0 3e-10 MKTAIKLILIYLGIQLICGGLIGIPFTIIARMNGGSVDATRVSELTLAPSMLLSMAVMFF YLWKAHYIPKDKTSWSFISFPFLIITFLIGLSMTVLMDLLTAVLSWVPDILEQQFDALQS GWLGIVAITLLGPILEELLFRGGATKALLERYSPRKAIFLSALLFGVFHLNPAQIVAAFF GGLLLAWVYYRTRSLIPCILIHIVNNSISVMLSLTYPDADTIRDVTGTTPYYLLIAVAAM VFVGCFLRIKQVTVPGTWRKDE >gi|226332005|gb|ACIB01000051.1| GENE 97 144949 - 145587 406 212 aa, chain + ## HITS:1 COG:no KEGG:BF4343 NR:ns ## KEGG: BF4343 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 212 1 212 212 386 100.0 1e-106 MKRRLNILCLLVFLVFCYSVFNMGRDFGHGFSEGMKISHSGNQSVEQAINFRIATLMPKD FSSFKDSVYNEKTGSYVPVAYTQMVTNVKVDRSTWMMVAQVISGLLAVFAIIASTVLFIQ LIVAINKSDIFNWKNVRRLRWLGVALLLNFISEAVPALMNDYELSSVFSLSGYSMETSID SVMLVILGLVSLIVGEVFAIGLKMKEEQDLTI >gi|226332005|gb|ACIB01000051.1| GENE 98 145616 - 145843 256 75 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163756262|ref|ZP_02163377.1| 50S ribosomal protein L20 [Kordia algicida OT-1] # 1 65 1 65 67 103 73 6e-21 MAIIVNLDIMMARRKISLGELAEKIDITPANLSILKTGKAKAIRFSTLEAICRVLDCQPG DILEYQVDEEDGGTS >gi|226332005|gb|ACIB01000051.1| GENE 99 145971 - 146399 465 142 aa, chain - ## HITS:1 COG:XF0994 KEGG:ns NR:ns ## COG: XF0994 COG2166 # Protein_GI_number: 15837596 # Func_class: R General function prediction only # Function: SufE protein probably involved in Fe-S center assembly # Organism: Xylella fastidiosa 9a5c # 14 137 23 146 146 105 42.0 2e-23 MSINELQDEVVAEFSDFDDWMDRYQLLIDLGNEQEPLDEKYKTEQNLIEGCQSRVWLQAD EVDGKIIFKAESDALIVKGIIALLIKVVSGHTPDEILNSELYFIDKIGLKDHLSPTRSNG LLSMVKQMRMYALAFKAKEAKG >gi|226332005|gb|ACIB01000051.1| GENE 100 146409 - 147398 910 329 aa, chain - ## HITS:1 COG:no KEGG:BF4340 NR:ns ## KEGG: BF4340 # Name: not_defined # Def: leucine aminopeptidase precursor # Organism: B.fragilis # Pathway: not_defined # 1 329 6 334 334 688 100.0 0 MFILSAFILFSAFSCGTNGSKTTSTDITEKKVIVQVPQFDADSAYKYIQAQVDFGPRTPN SKGHVACGDYLAAKLAEHGAKVISQNAELPAYDGTLLKARNIIGSFKPESKKRIALFAHW DTRPWADNDPNEKNHHTPILGANDGASGVGVLLEIARQINQQQPELGIDIIFLDAEDYGA PQFYTGEHREDQWCLGAQYWARTPHVDGYNARFGILLDMVGGKDATFFKEVYSEKYAKGI NKKVWKKANDAGYGRYFINEVGGQITDDHLFINRLAGIPTIDIIPNDENCELSSFGPTWH TVNDNMDAIDRSTLKAVGQTVLEVIYNEK >gi|226332005|gb|ACIB01000051.1| GENE 101 147470 - 148384 756 304 aa, chain - ## HITS:1 COG:alr3273 KEGG:ns NR:ns ## COG: alr3273 COG1619 # Protein_GI_number: 17230765 # Func_class: V Defense mechanisms # Function: Uncharacterized proteins, homologs of microcin C7 resistance protein MccF # Organism: Nostoc sp. PCC 7120 # 6 299 67 363 368 145 30.0 9e-35 MNLQFPPFLHEGDKVAIVSPSSKIDSIFLKGAKTRLSSWGLTPVMGDHVRSSWGSYAGAT HQRLKDFQAAMDDEEIKAILCSRGGYGAVHLLDKLDFTRFRNHPKWLIGFSDITALHNLF QYNGFASLHAPMARHLAVTGEEDPCSLYLRDILFGKLPGYKCHRHKLNHLGQAKGILRGG NMAVFHGLRGTPYDIPPEGTILFIEDVGERPYAIERMMYNLKLGGVLEKLSGLIIGQFTE YKEDYSLKKDLYSTLDALVKEYDFPICYDFPVGHVTENLPLINGAEVEFVSGKKGVELLI NPPI >gi|226332005|gb|ACIB01000051.1| GENE 102 148449 - 149243 817 264 aa, chain - ## HITS:1 COG:CC0380 KEGG:ns NR:ns ## COG: CC0380 COG2273 # Protein_GI_number: 16124635 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-glucanase/Beta-glucan synthetase # Organism: Caulobacter vibrioides # 25 261 21 296 301 121 31.0 1e-27 MKTKTFLALFCLLCISSGFSSCQRHSSDQWKLVWEDNFDQKTGFDPQVWSKIPRGKSDWN NYMTDFDSCFDMRDGNMVLRGIINHSQPNDTAPYLTGGIYTKGKKAFSNGRLEIRAKLNG ARGEWPAIWMLPIDAPWPMGGEIDIMERLNHDTIAYQTIHTNYTHNLGIKDNPLSHSVGA INPDDYNVYSVEMYPDSIAFYINDTHTFTYPRIETDKEGQFPFDQPFYLLIDMQLGGSWV GAVDPKELPVEMYVDWVRFYQKEK >gi|226332005|gb|ACIB01000051.1| GENE 103 149426 - 150367 743 313 aa, chain + ## HITS:1 COG:CAC3076 KEGG:ns NR:ns ## COG: CAC3076 COG0280 # Protein_GI_number: 15896327 # Func_class: C Energy production and conversion # Function: Phosphotransacetylase # Organism: Clostridium acetobutylicum # 4 304 2 301 301 169 31.0 8e-42 MEPIRNFDQLTAHLKTLNRRKRIAVVCANDPNTEYAIARALDEEIAEFLMIGDSAILQKY PSLQKYPEYVKTLHIEDPDEAAREAVRIVREGGADILMKGIINTDNLLHAILDKEKGLLP KGKILTHLAVMQIPTYDKLLFFSDAAVIPRPTLQQRIEMIWYAICTCRRFGIEQPRISLI HCTEKVSAKFPHSLDYVNIVELAEAGEFGNVIIDGPLDVRTSCEQASGDIKGIVSPINGQ ADVLIFPNIESGNAFYKSVSLFAKADMAGLLQGPICPVVLPSRSDSGLSKYYSIAMACLT ASTRSAERGRCSE >gi|226332005|gb|ACIB01000051.1| GENE 104 150390 - 151505 1052 371 aa, chain + ## HITS:1 COG:CAC1660 KEGG:ns NR:ns ## COG: CAC1660 COG3426 # Protein_GI_number: 15894937 # Func_class: C Energy production and conversion # Function: Butyrate kinase # Organism: Clostridium acetobutylicum # 20 371 4 356 356 369 51.0 1e-102 MDKLNINSTPSSVKNGRGERLLVINPGSTSTKIAVYENETPLLVRNIRHTVEELSAFPRV IDQFEFRKSLVLRELEVNDIPFRFDAVIGRGGLVKPIPGGVYEVNEAMKRDTLHAMRTHA CNLGGLIADELAAALPGCRAFIADPGVVDELEEVARITGSPLMPRITIWHALNQKAIARR YAAEHGTRYEDLDLIVCHLGGGISVAVHRHGRAVDANNALDGEGPFSPERAGTLPAGQLI DLCFSGKFTKDELKKRISGRAGLTAHLGTTDIPAIIQSIEAGDDHARLVLDAMIYNVAKS IGAASTVLCGKVDAILLTGGIAYSDYVISRLRERISFLAPVFVYPGEDEMEALALNALGA LRGELPVQVYQ >gi|226332005|gb|ACIB01000051.1| GENE 105 151616 - 152374 759 252 aa, chain - ## HITS:1 COG:no KEGG:BF4335 NR:ns ## KEGG: BF4335 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 252 1 252 252 468 100.0 1e-130 MKTELKGWLADNTVTTDNKEDKILVLESAGNLTLSDVLDEMKKEDTGLRAETLKHAVDLF QRTVSELVLNGYSVNTGLFRAVPQFRGVIDGGVWNPERNSIYVSFNQDKDLREAIARTGV KILGAKGDSAYFIGGEDAATRATDGSATAGRNYRLQGKNIKVAGTDPAVGIVLIDEKGTE TKLPMDMIAVNNPSEVLVLLPADLTDGIYKLRLTTQYTSGNRQLKTPHVISQTIVIGNTT EGDGDIVDDPTA >gi|226332005|gb|ACIB01000051.1| GENE 106 152400 - 152630 153 76 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566574|ref|ZP_04844027.1| ## NR: gi|253566574|ref|ZP_04844027.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] # 1 76 214 289 289 138 100.0 1e-31 MNATKGLFLPAAGGRDNGTSGTSATYNPGKHGNYWVTESASSSKGYYLYFDGSLVVCGDN PKTKALTVRCVKGTKQ Prediction of potential genes in microbial genomes Time: Wed May 18 00:02:23 2011 Seq name: gi|226332004|gb|ACIB01000052.1| Bacteroides sp. 3_2_5 cont1.52, whole genome shotgun sequence Length of sequence - 1242 bp Number of predicted genes - 2, with homology - 2 Number of transcription units - 1, operones - 1 average op.length - 2.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . - CDS 84 - 311 221 ## BF0078 hypothetical protein 2 1 Op 2 . - CDS 337 - 588 230 ## BF4334 hypothetical protein - Prom 611 - 670 4.0 Predicted protein(s) >gi|226332004|gb|ACIB01000052.1| GENE 1 84 - 311 221 75 aa, chain - ## HITS:1 COG:no KEGG:BF0078 NR:ns ## KEGG: BF0078 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 50 1 50 252 87 92.0 1e-16 MKTELKGWLADNTVTTDNKEDKILVLESAGNLTLSDVLDEMKEEDTIVEARDRSLSVYSK ECDIDTRSVLVCFLL >gi|226332004|gb|ACIB01000052.1| GENE 2 337 - 588 230 83 aa, chain - ## HITS:1 COG:no KEGG:BF4334 NR:ns ## KEGG: BF4334 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 83 455 536 536 154 97.0 7e-37 MWFMNNSIGLFLPAAGCRQDGKGSNTSPTTDSGTDGYYWSSDIGNGNNTGKRLYFGKRYN TADVADHAKNAGLTVRCVKGTKQ Prediction of potential genes in microbial genomes Time: Wed May 18 00:06:04 2011 Seq name: gi|226332003|gb|ACIB01000053.1| Bacteroides sp. 3_2_5 cont1.53, whole genome shotgun sequence Length of sequence - 138961 bp Number of predicted genes - 142, with homology - 137 Number of transcription units - 61, operones - 26 average op.length - 4.1 N Tu/Op Conserved S Start End Score pairs(N/Pv) - Term 402 - 448 8.5 1 1 Op 1 . - CDS 512 - 1042 559 ## BF4330 hypothetical protein 2 1 Op 2 . - CDS 1042 - 2127 963 ## BF4330 hypothetical protein 3 1 Op 3 . - CDS 2140 - 5460 2968 ## BF4329 hypothetical protein - Prom 5512 - 5571 4.8 4 2 Tu 1 . - CDS 5721 - 6653 975 ## COG3712 Fe2+-dicitrate sensor, membrane component - Prom 6690 - 6749 4.4 - Term 6700 - 6734 -0.9 5 3 Tu 1 . - CDS 6754 - 7314 508 ## BF4327 putative RNA polymerase ECF-type sigma factor - Prom 7410 - 7469 6.2 - Term 7407 - 7451 10.2 6 4 Tu 1 . - CDS 7478 - 9604 1542 ## PROTEIN SUPPORTED gi|62291006|ref|YP_222799.1| polynucleotide phosphorylase/polyadenylase + Prom 9587 - 9646 8.6 7 5 Tu 1 . + CDS 9801 - 10952 1099 ## BF4325 hypothetical protein + Term 11052 - 11107 13.2 + Prom 10984 - 11043 6.4 8 6 Op 1 . + CDS 11133 - 11597 641 ## COG0782 Transcription elongation factor + Term 11603 - 11640 5.2 9 6 Op 2 . + CDS 11648 - 12052 416 ## COG0537 Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases + Term 12076 - 12122 10.9 - Term 12170 - 12209 6.3 10 7 Tu 1 . - CDS 12221 - 13303 844 ## COG0836 Mannose-1-phosphate guanylyltransferase - Prom 13487 - 13546 80.3 + TRNA 13460 - 13546 50.8 # Leu CAA 0 0 + Prom 13462 - 13521 80.3 11 8 Op 1 . + CDS 13720 - 13899 150 ## BF4315 hypothetical protein 12 8 Op 2 . + CDS 13932 - 14204 211 ## BF4314 hypothetical protein 13 8 Op 3 . + CDS 14211 - 14528 151 ## BF4314 hypothetical protein + Term 14587 - 14635 4.1 - Term 14573 - 14622 2.1 14 9 Op 1 . - CDS 14631 - 15185 407 ## PROTEIN SUPPORTED gi|229873878|ref|ZP_04493445.1| acetyltransferase, ribosomal protein N-acetylase 15 9 Op 2 . - CDS 15210 - 15665 235 ## BF4312 hypothetical protein 16 9 Op 3 . - CDS 15670 - 16884 867 ## BF4311 hypothetical protein 17 9 Op 4 . - CDS 16963 - 17160 81 ## BF4117 hypothetical protein 18 9 Op 5 . - CDS 17163 - 17345 167 ## - Prom 17413 - 17472 5.3 + Prom 17275 - 17334 4.5 19 10 Tu 1 . + CDS 17554 - 17868 130 ## BF4309 hypothetical protein + Prom 18039 - 18098 4.3 20 11 Op 1 . + CDS 18195 - 18686 406 ## COG0716 Flavodoxins 21 11 Op 2 . + CDS 18707 - 20467 1063 ## COG1154 Deoxyxylulose-5-phosphate synthase + Term 20593 - 20633 7.4 - Term 20578 - 20624 2.0 22 12 Tu 1 . - CDS 20641 - 21537 417 ## COG2207 AraC-type DNA-binding domain-containing proteins 23 13 Tu 1 . + CDS 21909 - 22262 364 ## BF4112 hypothetical protein + Term 22293 - 22341 7.0 + Prom 22397 - 22456 5.2 24 14 Tu 1 . + CDS 22604 - 23122 349 ## COG0350 Methylated DNA-protein cysteine methyltransferase 25 15 Tu 1 . - CDS 23179 - 24015 400 ## BF4110 transcriptional regulator GerE - Prom 24069 - 24128 1.8 + Prom 23800 - 23859 4.1 26 16 Op 1 . + CDS 24024 - 24611 536 ## COG0110 Acetyltransferase (isoleucine patch superfamily) 27 16 Op 2 . + CDS 24636 - 25286 627 ## COG3506 Uncharacterized conserved protein + Term 25294 - 25346 7.6 - Term 25285 - 25331 4.6 28 17 Op 1 . - CDS 25378 - 25890 429 ## BF4107 putative transmembrane protein 29 17 Op 2 . - CDS 25908 - 28208 1613 ## COG4771 Outer membrane receptor for ferrienterochelin and colicins 30 17 Op 3 . - CDS 28205 - 29587 1025 ## COG1453 Predicted oxidoreductases of the aldo/keto reductase family 31 17 Op 4 . - CDS 29615 - 31120 816 ## COG1145 Ferredoxin 32 17 Op 5 9/0.000 - CDS 31144 - 31899 509 ## COG3279 Response regulator of the LytR/AlgR family 33 17 Op 6 . - CDS 31908 - 32930 630 ## COG3275 Putative regulator of cell autolysis - Prom 32961 - 33020 4.5 34 18 Op 1 . - CDS 33030 - 33122 76 ## 35 18 Op 2 . - CDS 33202 - 33759 531 ## COG0494 NTP pyrophosphohydrolases including oxidative damage repair enzymes + Prom 33778 - 33837 4.4 36 19 Tu 1 . + CDS 33951 - 36482 2022 ## BF4091 putative O-antigen related protein + Term 36489 - 36527 2.1 + Prom 36698 - 36757 4.7 37 20 Tu 1 . + CDS 36804 - 37289 212 ## BF4290 hypothetical protein - Term 37222 - 37259 -0.9 38 21 Tu 1 . - CDS 37381 - 38001 528 ## BF4289 hypothetical protein - Prom 38051 - 38110 2.7 + Prom 38210 - 38269 3.7 39 22 Tu 1 . + CDS 38332 - 40413 1474 ## COG5545 Predicted P-loop ATPase and inactivated derivatives + Prom 40566 - 40625 7.1 40 23 Op 1 . + CDS 40669 - 41682 591 ## BF4287 hypothetical protein 41 23 Op 2 . + CDS 41715 - 43457 1050 ## BF4286 hypothetical protein + Term 43493 - 43551 2.9 - Term 43483 - 43537 5.0 42 24 Tu 1 . - CDS 43562 - 43870 230 ## BF4089 hypothetical protein 43 25 Tu 1 . + CDS 43901 - 44065 84 ## 44 26 Tu 1 . - CDS 43984 - 45699 848 ## BF4089 hypothetical protein - Prom 45860 - 45919 4.2 + Prom 45655 - 45714 5.6 45 27 Tu 1 . + CDS 45875 - 46807 804 ## BF4283 tyrosine type site-specific recombinase + Term 46822 - 46858 2.7 46 28 Tu 1 . - CDS 47034 - 47204 75 ## BF4282 hypothetical protein - Prom 47431 - 47490 1.7 + Prom 47171 - 47230 7.8 47 29 Op 1 . + CDS 47253 - 49694 2542 ## BF4087 hypothetical protein 48 29 Op 2 . + CDS 49719 - 50693 938 ## BF4086 hypothetical protein + Prom 50705 - 50764 1.9 49 29 Op 3 . + CDS 50784 - 51359 647 ## BF4085 hypothetical protein + Term 51378 - 51431 1.2 - Term 51315 - 51350 -1.0 50 30 Tu 1 . - CDS 51559 - 52518 808 ## BF4278 hypothetical protein - Prom 52551 - 52610 5.1 + Prom 52425 - 52484 7.5 51 31 Op 1 . + CDS 52621 - 53250 473 ## BF4083 hypothetical protein 52 31 Op 2 . + CDS 53299 - 53862 449 ## COG2096 Uncharacterized conserved protein 53 31 Op 3 . + CDS 53933 - 54154 363 ## PGN_1678 hypothetical protein + Term 54179 - 54234 16.3 - Term 54164 - 54227 18.4 54 32 Op 1 . - CDS 54230 - 55417 1288 ## BF4216 hypothetical protein 55 32 Op 2 2/0.000 - CDS 55450 - 56508 1121 ## COG0252 L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D 56 32 Op 3 . - CDS 56554 - 57864 1425 ## COG2704 Anaerobic C4-dicarboxylate transporter 57 33 Tu 1 . + CDS 58183 - 59616 1611 ## COG1027 Aspartate ammonia-lyase + Term 59652 - 59699 7.1 + Prom 59654 - 59713 6.2 58 34 Tu 1 . + CDS 59763 - 60923 899 ## BF4036 hypothetical protein + Term 61017 - 61065 7.2 - TRNA 61105 - 61195 42.4 # Pseudo CGA 0 0 + Prom 61497 - 61556 8.1 59 35 Op 1 . + CDS 61604 - 62182 732 ## BF4035 putative transmembrane protein 60 35 Op 2 . + CDS 62200 - 64716 2170 ## COG1198 Primosomal protein N' (replication factor Y) - superfamily II helicase 61 35 Op 3 . + CDS 64758 - 66557 1667 ## BF4209 hypothetical protein + Term 66607 - 66653 3.5 + Prom 66751 - 66810 8.1 62 36 Tu 1 . + CDS 66842 - 68416 680 ## Aasi_1729 hypothetical protein + Prom 68483 - 68542 7.0 63 37 Op 1 . + CDS 68734 - 68946 180 ## 64 37 Op 2 . + CDS 68915 - 69826 574 ## gi|253566637|ref|ZP_04844090.1| predicted protein + Term 69844 - 69896 0.0 + Prom 69943 - 70002 9.8 65 38 Tu 1 . + CDS 70095 - 70724 399 ## COG1961 Site-specific recombinases, DNA invertase Pin homologs + Term 70725 - 70785 12.9 - Term 70724 - 70759 3.0 66 39 Op 1 . - CDS 70776 - 71267 248 ## gi|253566639|ref|ZP_04844092.1| predicted protein 67 39 Op 2 . - CDS 71301 - 71699 424 ## gi|253566640|ref|ZP_04844093.1| predicted protein + Prom 72030 - 72089 4.6 68 40 Tu 1 . + CDS 72111 - 72578 457 ## COG0394 Protein-tyrosine-phosphatase + Term 72743 - 72798 2.5 - Term 72415 - 72484 6.2 69 41 Tu 1 . - CDS 72553 - 74610 1853 ## COG1480 Predicted membrane-associated HD superfamily hydrolase - Prom 74644 - 74703 4.6 + Prom 74447 - 74506 3.6 70 42 Op 1 . + CDS 74705 - 76222 1596 ## COG0008 Glutamyl- and glutaminyl-tRNA synthetases 71 42 Op 2 . + CDS 76235 - 77455 905 ## COG1519 3-deoxy-D-manno-octulosonic-acid transferase + Term 77615 - 77654 -0.9 - Term 77366 - 77394 1.0 72 43 Op 1 . - CDS 77644 - 78156 602 ## COG0663 Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily - Prom 78187 - 78246 2.6 73 43 Op 2 . - CDS 78251 - 80029 1245 ## COG0006 Xaa-Pro aminopeptidase 74 43 Op 3 . - CDS 80010 - 80186 154 ## gi|265767526|ref|ZP_06095192.1| predicted protein - Prom 80248 - 80307 5.1 + Prom 80004 - 80063 6.1 75 44 Op 1 . + CDS 80203 - 80394 310 ## PROTEIN SUPPORTED gi|53715487|ref|YP_101479.1| 30S ribosomal protein S21 76 44 Op 2 . + CDS 80468 - 81349 682 ## COG4974 Site-specific recombinase XerD 77 44 Op 3 . + CDS 81364 - 81663 200 ## PROTEIN SUPPORTED gi|163755828|ref|ZP_02162946.1| 30S ribosomal protein S21 + Term 81708 - 81757 3.4 + TRNA 81762 - 81838 73.6 # Thr TGT 0 0 + TRNA 81910 - 81995 65.6 # Tyr GTA 0 0 + TRNA 82024 - 82096 68.9 # Gly TCC 0 0 + TRNA 82107 - 82178 81.6 # Thr GGT 0 0 78 45 Tu 1 . + CDS 82229 - 83413 1396 ## PROTEIN SUPPORTED gi|119502908|ref|ZP_01624993.1| Ribosomal protein S19 + Term 83427 - 83468 9.7 + TRNA 83467 - 83542 81.1 # Trp CCA 0 0 + Prom 83469 - 83528 80.2 79 46 Op 1 . + CDS 83554 - 83745 157 ## BF4198 preprotein translocase SecE subunit 80 46 Op 2 45/0.000 + CDS 83762 - 84304 615 ## COG0250 Transcription antiterminator 81 46 Op 3 55/0.000 + CDS 84365 - 84808 738 ## PROTEIN SUPPORTED gi|53715481|ref|YP_101473.1| 50S ribosomal protein L11 82 46 Op 4 43/0.000 + CDS 84824 - 85522 1157 ## PROTEIN SUPPORTED gi|53715480|ref|YP_101472.1| 50S ribosomal protein L1 83 46 Op 5 47/0.000 + CDS 85538 - 86050 844 ## PROTEIN SUPPORTED gi|53715479|ref|YP_101471.1| 50S ribosomal protein L10 84 46 Op 6 28/0.000 + CDS 86099 - 86473 588 ## PROTEIN SUPPORTED gi|53715478|ref|YP_101470.1| 50S ribosomal protein L7/L12 + Term 86499 - 86543 10.1 + Prom 86499 - 86558 5.3 85 47 Op 1 58/0.000 + CDS 86578 - 90390 2924 ## PROTEIN SUPPORTED gi|163796927|ref|ZP_02190884.1| 30S ribosomal protein S12 + Term 90400 - 90457 11.1 + Prom 90401 - 90460 3.1 86 47 Op 2 . + CDS 90537 - 94820 4286 ## COG0086 DNA-directed RNA polymerase, beta' subunit/160 kD subunit + Term 94840 - 94895 11.1 87 48 Tu 1 . - CDS 94895 - 96031 646 ## COG3274 Uncharacterized protein conserved in bacteria - Prom 96057 - 96116 4.5 88 49 Tu 1 . - CDS 96161 - 96670 165 ## BF4189 hypothetical protein - Prom 96743 - 96802 4.2 + Prom 96611 - 96670 8.1 89 50 Op 1 . + CDS 96862 - 97107 287 ## BF4188 hypothetical protein + Term 97121 - 97160 -0.9 90 50 Op 2 . + CDS 97170 - 97430 479 ## BF4187 hypothetical protein + Term 97452 - 97518 5.3 + Prom 97501 - 97560 4.8 91 51 Tu 1 . + CDS 97744 - 98055 493 ## BF4186 hypothetical protein + Term 98086 - 98157 6.9 + Prom 98218 - 98277 4.8 92 52 Op 1 56/0.000 + CDS 98303 - 98704 686 ## PROTEIN SUPPORTED gi|29348140|ref|NP_811643.1| 30S ribosomal protein S12 + Prom 98712 - 98771 6.7 93 52 Op 2 51/0.000 + CDS 98864 - 99340 809 ## PROTEIN SUPPORTED gi|53715469|ref|YP_101461.1| 30S ribosomal protein S7 94 52 Op 3 4/0.000 + CDS 99394 - 101511 1970 ## COG0480 Translation elongation factors (GTPases) + Term 101525 - 101565 4.3 + Prom 101513 - 101572 3.4 95 52 Op 4 40/0.000 + CDS 101593 - 101898 494 ## PROTEIN SUPPORTED gi|53715467|ref|YP_101459.1| 30S ribosomal protein S10 96 52 Op 5 58/0.000 + CDS 101917 - 102534 1073 ## PROTEIN SUPPORTED gi|53715466|ref|YP_101458.1| 50S ribosomal protein L3 97 52 Op 6 61/0.000 + CDS 102534 - 103160 1047 ## PROTEIN SUPPORTED gi|53715465|ref|YP_101457.1| 50S ribosomal protein L4 98 52 Op 7 61/0.000 + CDS 103177 - 103467 479 ## PROTEIN SUPPORTED gi|53715464|ref|YP_101456.1| 50S ribosomal protein L23 99 52 Op 8 60/0.000 + CDS 103473 - 104297 1419 ## PROTEIN SUPPORTED gi|53715463|ref|YP_101455.1| 50S ribosomal protein L2 100 52 Op 9 59/0.000 + CDS 104318 - 104587 465 ## PROTEIN SUPPORTED gi|53715462|ref|YP_101454.1| 30S ribosomal protein S19 101 52 Op 10 61/0.000 + CDS 104623 - 105033 679 ## PROTEIN SUPPORTED gi|167764367|ref|ZP_02436492.1| hypothetical protein BACSTE_02751 102 52 Op 11 50/0.000 + CDS 105039 - 105773 1256 ## PROTEIN SUPPORTED gi|53715460|ref|YP_101452.1| 30S ribosomal protein S3 103 52 Op 12 . + CDS 105797 - 106231 736 ## PROTEIN SUPPORTED gi|53715459|ref|YP_101451.1| 50S ribosomal protein L16 104 52 Op 13 . + CDS 106237 - 106434 319 ## PROTEIN SUPPORTED gi|53715458|ref|YP_101450.1| 50S ribosomal protein L29 105 52 Op 14 50/0.000 + CDS 106431 - 106700 456 ## PROTEIN SUPPORTED gi|53715457|ref|YP_101449.1| 30S ribosomal protein S17 106 52 Op 15 57/0.000 + CDS 106703 - 107068 596 ## PROTEIN SUPPORTED gi|29348126|ref|NP_811629.1| 50S ribosomal protein L14 107 52 Op 16 48/0.000 + CDS 107089 - 107409 534 ## PROTEIN SUPPORTED gi|53715455|ref|YP_101447.1| 50S ribosomal protein L24 108 52 Op 17 50/0.000 + CDS 107409 - 107966 910 ## PROTEIN SUPPORTED gi|53715454|ref|YP_101446.1| 50S ribosomal protein L5 109 52 Op 18 50/0.000 + CDS 107973 - 108272 495 ## PROTEIN SUPPORTED gi|53715453|ref|YP_101445.1| 30S ribosomal protein S14 + Term 108292 - 108320 -0.9 110 52 Op 19 55/0.000 + CDS 108327 - 108722 664 ## PROTEIN SUPPORTED gi|53715452|ref|YP_101444.1| 30S ribosomal protein S8 111 52 Op 20 46/0.000 + CDS 108738 - 109307 969 ## PROTEIN SUPPORTED gi|53715451|ref|YP_101443.1| 50S ribosomal protein L6 112 52 Op 21 56/0.000 + CDS 109329 - 109673 561 ## PROTEIN SUPPORTED gi|53715450|ref|YP_101442.1| 50S ribosomal protein L18 113 52 Op 22 . + CDS 109679 - 110197 851 ## PROTEIN SUPPORTED gi|53715449|ref|YP_101441.1| 30S ribosomal protein S5 114 52 Op 23 . + CDS 110208 - 110384 281 ## PROTEIN SUPPORTED gi|53715448|ref|YP_101440.1| 50S ribosomal protein L30 115 52 Op 24 53/0.000 + CDS 110415 - 110861 736 ## PROTEIN SUPPORTED gi|53715447|ref|YP_101439.1| 50S ribosomal protein L15 116 52 Op 25 2/0.000 + CDS 110866 - 112212 891 ## PROTEIN SUPPORTED gi|163796899|ref|ZP_02190856.1| 30S ribosomal protein S11 117 52 Op 26 9/0.000 + CDS 112227 - 113024 680 ## COG0024 Methionine aminopeptidase 118 52 Op 27 . + CDS 113028 - 113246 239 ## PROTEIN SUPPORTED gi|15900168|ref|NP_344772.1| translation initiation factor IF-1 119 52 Op 28 . + CDS 113255 - 113371 198 ## PROTEIN SUPPORTED gi|53715443|ref|YP_101435.1| 50S ribosomal protein L36 120 52 Op 29 48/0.000 + CDS 113405 - 113785 637 ## PROTEIN SUPPORTED gi|53715442|ref|YP_101434.1| 30S ribosomal protein S13 121 52 Op 30 . + CDS 113797 - 114186 665 ## PROTEIN SUPPORTED gi|29348112|ref|NP_811615.1| 30S ribosomal protein S11 122 53 Tu 1 . - CDS 114125 - 114277 124 ## - Prom 114471 - 114530 4.5 + Prom 114188 - 114247 5.7 123 54 Op 1 26/0.000 + CDS 114306 - 114911 1027 ## PROTEIN SUPPORTED gi|53715440|ref|YP_101432.1| 30S ribosomal protein S4 124 54 Op 2 50/0.000 + CDS 114923 - 115915 1033 ## COG0202 DNA-directed RNA polymerase, alpha subunit/40 kD subunit 125 54 Op 3 . + CDS 115919 - 116404 807 ## PROTEIN SUPPORTED gi|53715438|ref|YP_101430.1| 50S ribosomal protein L17 + Term 116428 - 116475 11.1 + Prom 116469 - 116528 4.8 126 55 Tu 1 . + CDS 116573 - 116950 331 ## BF3974 hypothetical protein + Term 116983 - 117029 5.3 + Prom 116966 - 117025 3.9 127 56 Tu 1 . + CDS 117059 - 117598 585 ## BF4152 hypothetical protein + Term 117620 - 117671 12.5 128 57 Tu 1 . - CDS 117630 - 119444 1263 ## COG0249 Mismatch repair ATPase (MutS family) - Prom 119493 - 119552 2.1 + Prom 119793 - 119852 4.6 129 58 Tu 1 . + CDS 120005 - 120637 618 ## BF3971 hypothetical protein + Term 120784 - 120831 0.0 + Prom 120726 - 120785 4.2 130 59 Op 1 1/0.000 + CDS 120835 - 122820 1828 ## COG2987 Urocanate hydratase 131 59 Op 2 1/0.000 + CDS 122889 - 123791 937 ## COG3643 Glutamate formiminotransferase 132 59 Op 3 1/0.000 + CDS 123798 - 125051 1259 ## COG1228 Imidazolonepropionase and related amidohydrolases 133 59 Op 4 1/0.000 + CDS 125086 - 125715 739 ## COG3404 Methenyl tetrahydrofolate cyclohydrolase 134 59 Op 5 . + CDS 125712 - 127208 1097 ## COG2986 Histidine ammonia-lyase + Term 127287 - 127335 10.2 + Prom 127331 - 127390 5.9 135 60 Op 1 . + CDS 127519 - 128163 619 ## BF3965 putative TetR transcriptional regulator 136 60 Op 2 13/0.000 + CDS 128214 - 129566 1524 ## COG1538 Outer membrane protein 137 60 Op 3 27/0.000 + CDS 129595 - 130611 1140 ## COG0845 Membrane-fusion protein + Prom 130614 - 130673 2.6 138 60 Op 4 . + CDS 130712 - 133870 3412 ## COG0841 Cation/multidrug efflux pump 139 60 Op 5 . + CDS 133889 - 134182 380 ## BF4139 hypothetical protein + Term 134208 - 134256 16.5 + Prom 134201 - 134260 1.6 140 61 Op 1 . + CDS 134299 - 135672 1020 ## BF3960 putative transmembrane protein 141 61 Op 2 . + CDS 135641 - 136555 596 ## BF4137 putative periplasmic protein 142 61 Op 3 . + CDS 136542 - 138029 834 ## COG1696 Predicted membrane protein involved in D-alanine export + Term 138080 - 138122 1.1 Predicted protein(s) >gi|226332003|gb|ACIB01000053.1| GENE 1 512 - 1042 559 176 aa, chain - ## HITS:1 COG:no KEGG:BF4330 NR:ns ## KEGG: BF4330 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 176 363 538 538 360 100.0 8e-99 MTPAEVWFLRAEAALRGWSSESVKDCYEKGVKASFAQWGAAGAEAYLESDRKPSDYVDAF KAANNVKAVNTLTPKWDDAAGNEDKLGRIITQKWLAMFPEGGEAWAEQRRTGYPRLFPVL VNQSEGTVDTNLGPRRLNFFVGIKTTNPEQYTQLVNALGGIDNCGTRLWWDTGRNF >gi|226332003|gb|ACIB01000053.1| GENE 2 1042 - 2127 963 361 aa, chain - ## HITS:1 COG:no KEGG:BF4330 NR:ns ## KEGG: BF4330 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 325 1 325 538 654 100.0 0 MKNNIKYIAGILLGGLIGFSACTDSFESFNTNEAGFDNDSKKQDFNYYGIPLGIIQQGIY FNYDWGSGKNWPFQTMQNLGADLFSGYVHDFNPFNEGKNNSTYYMMDGWNGSTWDNTYGY IMPEVQKSETINEKDNIGFYGITKILKVELMHRLSDLYGPIVYTQFGSKTGSTPDTQQEA YKAFFNDLDTGIAKIREYQKANPDIESFAKFDILMPQGKRTFSEWIRFANSLRLRLAVRI AMADSKLAVAEAQKALTDEEGLLEGNDEVVAVSTSSGYTNPFGEINKAWGEVFMNANMES LLVGYEDPRMEKYFDKATGSDATSLSTIKVLTKVSAKEQALVIKTTTDIQKVPLPNRQTP Y >gi|226332003|gb|ACIB01000053.1| GENE 3 2140 - 5460 2968 1106 aa, chain - ## HITS:1 COG:no KEGG:BF4329 NR:ns ## KEGG: BF4329 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1106 1 1106 1106 2119 99.0 0 MKITLFLLFFVAFQAYSENGYSQSTKISIPRSSLKVSELLSKIESQTEYLFVYNKKNVDT RRWVNVQADNKAVSEVLDQAFKGTNIKYVMEGNNIVLTRNNDNAITAQQDRVTVKGVVTD QNGDPIIGANVLEKGTTNGCITDMEGNFTLNVPSNATLAITYIGYQPQNIQVNGRQSFNV KLQEEAMALEQVVVTAMGIKKKEASLTYSTQQVGGDELTRAKDPNMINALAGKTAGVQIT KSSSGLGGSAKVSIRGSRSISGNNQPLYVIDGVPMLNSSNEQASTAIGGTADAGNRDGGD GISNLNPDDIESMSILKGASAAALYGSQAANGVILITTKKGKAGIQKITFSSNLTVDHAI CLPEFQNNYAMDPEAKNSWGKNQTLKDYNNADNFFQNGVTAINSVSFMNGSEKMQTYFSY ANTTAKGIVEKNKMQKHNLNFRETANFFNDYLKLDANVNLMTQTIKNRPTSGGYYMNPLV GLYGFPRGEDLSEYKNNFEVFDPARNMNVQNWYTSYQDMEQNPYWITNRINSNDKRTRAI ASLTANVKITDWLNIQARGTADYTHDQYQQRMYASTAPALAGQNGRYIDLNHTETLYYGD VMAMVNKKWNDFSLNGAIGGSINSTNVNSLRLDSKTASLYYPNVFTVANIKMTQAAYIDE QMNQKRVLQSLFGTAQLGWKESLYLDVTARNDWSSTLAYTKHSSFFYPSVGLSWVLNNTV KLPEWISFGKIRGSWSKVGNDLPLFISNTIPGSNDIIGAGGAIVTYSKAPFNDLKPEMST SYEIGTEWRFFNYRLDFDITYYRTNTKNQLFTLPSSAGADYKYYMVNAGNIQNEGVEITL GATPVMNDAFRWKTQFNFSTNKNKIIKLHPDLKTFVYGDESFSSSYSMRLVEGGSFGDIY GKAFERDENGRVAFTTDKDGDKIPNVIGGGNTEKVGNCNPDFMLGWSNTFTYKGFSLYFL IDGRFGGDVLSQTQAELDQRGVSLNSGKARDAGYINIDGTQVKPRKFYTAVSGRDGCTEY YMYDATNIRLRELSLGYSLPQSWLAKAGGAFKDVQLSFVARNLFFISKKAPFDPDAVLST GNDNQGIDVFGMPTTRSLGFNIKFTF >gi|226332003|gb|ACIB01000053.1| GENE 4 5721 - 6653 975 310 aa, chain - ## HITS:1 COG:RSc2919 KEGG:ns NR:ns ## COG: RSc2919 COG3712 # Protein_GI_number: 17547638 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Ralstonia solanacearum # 99 277 64 237 274 68 28.0 2e-11 MEQELLYKYFKGTTSEEEERLILDWVDASPENRKAFQKERMLYDIALFTDEKQMNRKDRK ARIIPMLRWSARIAAVVIVAISFGFLFKNYQYEKSACQQTITVPAGQRAQITLADGTKVW LNSKSTLTYASNFGRKERNVELDGEAYFEVAKNKKIPFFVNTEINRVKVVGTHFNVCAYK GSNEFETTLIEGIVDIYPIGSDQVITRLTKDEFFGSYNGKYKKTTLPSYEYLRWKEGLYC FDDAPFNSLLNKLEKYYNVNISVRNLNILNYRCTGKFKEQDGIEHILKVIQKDHKFTYSI NEEKDSIIIE >gi|226332003|gb|ACIB01000053.1| GENE 5 6754 - 7314 508 186 aa, chain - ## HITS:1 COG:no KEGG:BF4327 NR:ns ## KEGG: BF4327 # Name: not_defined # Def: putative RNA polymerase ECF-type sigma factor # Organism: B.fragilis # Pathway: not_defined # 1 186 1 186 186 317 97.0 1e-85 MPSNNSLHTMDLFSQFFQENQKKFLSFAYSYTRNKAAAEDILMEAMVSLWENREKWEKDS NLHALLLTIIKNKSLNYLEHEQVRMKAEEAINTHKQRELDLRISTLEACEPATIFDTEIQ RIVYKTLEQLPEQSRHIFILSRYHNTPNKKIAEQLGISIKSVEFHITKTLKLLRLELKDY LISLLF >gi|226332003|gb|ACIB01000053.1| GENE 6 7478 - 9604 1542 708 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|62291006|ref|YP_222799.1| polynucleotide phosphorylase/polyadenylase [Brucella abortus bv. 1 str. 9-941] # 1 701 1 694 714 598 48 1e-170 MINPIVKTIELGDGRTITLETGKLAKQADGSVMLRMGNTMLLATVCAAKDAVPGTDFMPL QVEYKEKFAAFGRFPGGFTKREGRASDYEILTCRLVDRALRPLFPDNYHAEVYVNIILFS ADGVDMPDALAGLAASAALAVSDIPFNGPISEVRVARIDGKFVINPTFDQLEQADMDIMV AATYENIMMVEGEMSEVSEAELLEAMKVAHEAIKVHCKAQMELTEMVGKTVKREYCHEEN DEELRKAVHDACYDKSYAIAASGNRNKHERQDAFDAIRDEFKAQFSEEELEEKGALIDRY YHDVEKEAMRRCILDEGKRLDGRKTTEIRPIWCEVGYLPGPHGSAIFTRGETQSLTSVTL GTKLDEKIIDDVLAHGKERFLLHYNFPPFSTGEAKAQRGVGRREIGHGNLAHRALKRMIP EDYPYVVRVVSDILESNGSSSMATVCAGTLALMDAGVKIKKPVSGIAMGLIKNAGEEKYA VLSDILGDEDHLGDMDFKVTGTKDGITATQMDIKVDGLSYEILERALNQAKEGRMHILGK IEETISEPRTELKDHAPRIETMTIPKEFIGAVIGPGGKIIQGMQEETGATITIEEIDNVG RIEISGTNKKSIDDAIRLIKGIVAVPEVGEVYKGKVRSIMPYGAFVEFLPGKDGLLHISE IDWKRLETVEEAGIKEGDEIEVKLIDIDPKTGKFKLSRKVLLPRPEKK >gi|226332003|gb|ACIB01000053.1| GENE 7 9801 - 10952 1099 383 aa, chain + ## HITS:1 COG:no KEGG:BF4325 NR:ns ## KEGG: BF4325 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 383 1 383 383 748 100.0 0 MKKVTLVALVALALSSCNSDPKFNVKGDVSGADGKMLYLEASGLEGIVPLDSIKLKGDGS FSFKQLRPESPEFYRLRVEDKVINFSVDSTETVSIQAPYTDFSTAYTVEGSENSAKIKEL TLKQVRLQKDVDALVKAAQAHQLGNDVFEDSLAVLLKNYKDDVKINYIFAAPNTASAYFA LFQKLNNYMIFDPLNNKDDIKCFGAVATSLNNTYPHAVRSKNLYNIVIKGMKNTRTPQQK TIEIPEEKIAETGVIDIALRDMKGNIRKLTDLKGKVVLLDFTVYQSAVAAPHNLMLRDLY NKYASQGLEIYQVSLDADEHFWKTSADNLPWVCVRDENGIYSTNAALYGVQNLPAFFLIN RNNELRARGETVKDLEGTIKSML >gi|226332003|gb|ACIB01000053.1| GENE 8 11133 - 11597 641 154 aa, chain + ## HITS:1 COG:mll2568 KEGG:ns NR:ns ## COG: mll2568 COG0782 # Protein_GI_number: 13472314 # Func_class: K Transcription # Function: Transcription elongation factor # Organism: Mesorhizobium loti # 4 154 6 156 157 117 47.0 6e-27 MAYMSEEGYKKLMAELKELETVERPKISAAIAEARDKGDLSENAEYDAAKEAQGMLEMKI NKLKAVIADAKIIDESKLKTDSVQILNKVELKNTKNGMKMTYTIVSESEANLKEGKISVS TPIAQGLLGKKVGDVAEIQVPQGKISLEVVNISF >gi|226332003|gb|ACIB01000053.1| GENE 9 11648 - 12052 416 134 aa, chain + ## HITS:1 COG:Cgl2549 KEGG:ns NR:ns ## COG: Cgl2549 COG0537 # Protein_GI_number: 19553799 # Func_class: F Nucleotide transport and metabolism; G Carbohydrate transport and metabolism; R General function prediction only # Function: Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases # Organism: Corynebacterium glutamicum # 1 117 1 117 136 88 41.0 3e-18 MATIFSRIIAGEIPCYKVAENEKFFAFLDINPLVKGHTLVVPKQEVDYIFDLSDEDLAAM HVFAKKIARAIEKAFPCKKVGEAVIGLEVPHAHIHLIPIQKESDMLFSNPKLKLADDEFV SIAKAISSAYETAK >gi|226332003|gb|ACIB01000053.1| GENE 10 12221 - 13303 844 360 aa, chain - ## HITS:1 COG:CAC3072 KEGG:ns NR:ns ## COG: CAC3072 COG0836 # Protein_GI_number: 15896323 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Mannose-1-phosphate guanylyltransferase # Organism: Clostridium acetobutylicum # 9 340 5 337 350 250 41.0 2e-66 MTSKDNYCVIMGGGIGSRFWPFSRKTMPKQFLDFFGTGRSLLQQTFDRFNKIIPTENILI VTNAIYADLVKEQLPELDPKQILLEPARRNTAPCIAWASYHIRALNPNANIVVAPSDHLI LKEGEFLAAIEKGLDFVSKSDKLLTLGIKPNRPETGYGYIQIAEQEGDNFYKVKTFTEKP ELELAKVFVESGEFYWNSGLFMWNVNTIIKAGETLLPELASKLAPGREIYGTPEEKDFIE ENFPACPNVSIDFGIMEKADNVYVSLGDFGWSDLGTWGSLYDLSPKDEQRNVTLKCDSLI YNSNDNIVVLPKGKLAVIEGLEGFLVAESDNVLLICKKDEEHAIRKYVNDAQMKLGEDYI >gi|226332003|gb|ACIB01000053.1| GENE 11 13720 - 13899 150 59 aa, chain + ## HITS:1 COG:no KEGG:BF4315 NR:ns ## KEGG: BF4315 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 59 1 59 59 97 98.0 1e-19 MEQGVWQEIEQLYQKFQKLGINEAVDYDKYYLFLLLPIPQPLKAQRLRNLIRSFFLTRE >gi|226332003|gb|ACIB01000053.1| GENE 12 13932 - 14204 211 90 aa, chain + ## HITS:1 COG:no KEGG:BF4314 NR:ns ## KEGG: BF4314 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 90 5 94 202 179 96.0 2e-44 MNEDLKQAYELAKTESSSLVQITPALLQRLNATLMRTTGSVRSVMGGSFDSSNGDFPLCG VTAGVGGHAYMNYLKVPAKVDELCAILQAK >gi|226332003|gb|ACIB01000053.1| GENE 13 14211 - 14528 151 105 aa, chain + ## HITS:1 COG:no KEGG:BF4314 NR:ns ## KEGG: BF4314 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 105 98 202 202 207 95.0 8e-53 MGTFREQYELSFNAYLNLVTIHPWVDGNGRMARLLMNYIQFCYHLFPTKIFKEDREEYIL SLRQCQDEETNQVFLDFMARQLKKSLSLEIERFNASQKRGFSFMF >gi|226332003|gb|ACIB01000053.1| GENE 14 14631 - 15185 407 184 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|229873878|ref|ZP_04493445.1| acetyltransferase, ribosomal protein N-acetylase [Spirosoma linguale DSM 74] # 15 184 16 185 185 161 44 2e-38 MVNELNKEMDVDKYKIRSWSKDDFSTLAKYLNNKKIWDNCRDSLPYPYSENDAQQFILSV SSQNEQNNYCIEVNQEAAGNISFARGIDVERYNAELGYWLAEPYWGKGIMTQMLALAISS YFHHTDVMRICANVYAGNIASMRVLEKIGFRKCGIHRNACFKNGVFTDCHYFELLKEEFR NLVK >gi|226332003|gb|ACIB01000053.1| GENE 15 15210 - 15665 235 151 aa, chain - ## HITS:1 COG:no KEGG:BF4312 NR:ns ## KEGG: BF4312 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 151 8 158 158 283 100.0 2e-75 MNKVIVTFLLLIGIVASSCTTSKSVISQKADLSRYEYASIINNDTYHIPAQLMEYEIQLF DAIESSRLKLVNDIRIGELSPNQQSKLLMVKYGVDILEEESVVTVNFIDYLTGRPIASCR GAYTTLGFSVSADIRGAIKRVAKQIAETFPK >gi|226332003|gb|ACIB01000053.1| GENE 16 15670 - 16884 867 404 aa, chain - ## HITS:1 COG:no KEGG:BF4311 NR:ns ## KEGG: BF4311 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 404 6 409 409 809 100.0 0 MPFMTLVCLLFTACNKDDILPGGPMLWTYEILTPESVEYEGGTVGWIPKECFKANGNEGY IVMTCKNFDMLNPISGGSYTYDCGWATLKVEANQLKIHFPRQVSEAPDAYEEITISTNDG KRTASTIICLSRTFKDEGQPDPEPEPLPEEAKFKMKKAYFTPFMHLDTQFPAPLDLVTFR ITDINDNYTPLGFPEFTQYYDSIVWSAEGFPHTFRVYESNTTEGGMETHLATEWSSHFFK SGTIKNYLKGYRKGKVEYETSLAVRLYERDFLGIEWGTIVLQSPQNLTTYCLLDTDYEYQ VYDIVAKDYNPFSKIIPVNHKQLSDSDFPAAAQKAIKTLMENNIGEGQNAGGKENLFKCL PEEGVKAELYWENKTTRILMLHQLSTDPDDLTQEKYYLHVEPKQ >gi|226332003|gb|ACIB01000053.1| GENE 17 16963 - 17160 81 65 aa, chain - ## HITS:1 COG:no KEGG:BF4117 NR:ns ## KEGG: BF4117 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 65 1 65 65 117 100.0 1e-25 MYSDKGNSFQDIREKKKRYVYLVGENVGNFQTDVRDSIVVLTHIAWTVLVTSASKVSYEL HLDVA >gi|226332003|gb|ACIB01000053.1| GENE 18 17163 - 17345 167 60 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MTDLLAIIQNGNGNIKLEINDEDMAAMQTLAYDSLRKNPEKVEDSSYNSVLYNTKDQLNS >gi|226332003|gb|ACIB01000053.1| GENE 19 17554 - 17868 130 104 aa, chain + ## HITS:1 COG:no KEGG:BF4309 NR:ns ## KEGG: BF4309 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 104 1 104 104 164 97.0 7e-40 MVFANVFYQLFAVSKWRNEIFVCFLNTSFMNDFLFIKLVVQAFLKVLMGHFRMKQYSTNA INNRAISVSVAWVFNRVPIFSIRMSCLSIRSSGSVRGSALRHAV >gi|226332003|gb|ACIB01000053.1| GENE 20 18195 - 18686 406 163 aa, chain + ## HITS:1 COG:MA0407 KEGG:ns NR:ns ## COG: MA0407 COG0716 # Protein_GI_number: 20089301 # Func_class: C Energy production and conversion # Function: Flavodoxins # Organism: Methanosarcina acetivorans str.C2A # 3 161 4 175 179 97 34.0 8e-21 MIMNDRKILVAYFSCSGVTKAVAEKLAAITGADLYEIKPEVPYTEADLDWNDKKSRSSVE MRDALSRPAISGTLFHPEKYEVLFVGFPVWWYIAPTIINTFLESYDFAGKIVVPFATSGG SGIGNCEKNLHKAYPDIVWKDGKLLNGRITRDLVTEWFEKIRL >gi|226332003|gb|ACIB01000053.1| GENE 21 18707 - 20467 1063 586 aa, chain + ## HITS:1 COG:CAP0106 KEGG:ns NR:ns ## COG: CAP0106 COG1154 # Protein_GI_number: 15004809 # Func_class: H Coenzyme transport and metabolism; I Lipid transport and metabolism # Function: Deoxyxylulose-5-phosphate synthase # Organism: Clostridium acetobutylicum # 1 582 1 586 586 773 64.0 0 MYIENINLPKDIKKLSVEQLSVLAEEVRTALIRKLSEHGGHIGPNLGMVETTIALHYVFN SPIDKIVFDVSHQSYVHKMLTGRMAAFLDPAKYDDVTGYTNPDESEHDFFTIGHTSTSVS LAMGLAKGRDLTGGKENIIAVIGDGSLSGGEAFEGLNNAAMLGSNMIIIVNDNDQSIAEN HGGLYKGLKELRDTNGESPDNIFKAMGLEYYYLGDGHDVSALIKLFTSVKDIDRAVVLHI HTIKGKGLKYAEENKEYWHAGGPFHIEDGSPKGPGWPVNETVRESVLDLIEKRSDIVAIT AGTPSVIGFTEDYRKRAGKQFVDVGIAEEHAVAMASGIARNGGTPIFGVFSPFLQRTYDQ LSSDLCLNNNPAVIMVFMASVYGMNSNTHLGIYDIPMISHIPNLVYLAPTSKEEYLAMFK YATTQKAHPIAIRIPMMMPETGIEDTTDYSLLNKYQVVRKGSGVAIIALGDFFELGVQIA DKYKILTGNDVTLINPKFITGIDEELLECLKTDHKLVLTLEDGIVEGGFGQTIASFYGLS DMKVKNYGIKKSFPTDFRPEELMRENGLSVEQIVEDIKSVCREHVM >gi|226332003|gb|ACIB01000053.1| GENE 22 20641 - 21537 417 298 aa, chain - ## HITS:1 COG:BH3443 KEGG:ns NR:ns ## COG: BH3443 COG2207 # Protein_GI_number: 15616005 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Bacillus halodurans # 208 294 112 198 207 60 37.0 4e-09 MKERILNIETVHQCNCCLGCKTLHPLVSVIDLSKSNLEQQIIKFDFYTILMMEGEIDGVL YGRKYYDYSNASLVFLTPGESIKINKSKALPSKGWLLAFHPDLISQTSLGEHIKDYSFFF YNPEEALHLSQREKAKAVECICNIEEELRHAIDCHSQILISRYIELLLDHCNRFYERQFI TRCEANKKIMKKTDVLLKDYILSGKLKYNTSPSLGYCAKILQLSSHYFNDLLKFESGKNI DEYFESKRLEMAKSMLLDSNNTVSVVTEKLGYPNIRYFSRLFKRITGVAPNNYRLSQN >gi|226332003|gb|ACIB01000053.1| GENE 23 21909 - 22262 364 117 aa, chain + ## HITS:1 COG:no KEGG:BF4112 NR:ns ## KEGG: BF4112 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 117 1 117 117 229 99.0 3e-59 MKNYQKMSVAQDARVELHDSLALTGAEVSINHLPAGAGVPFVHSHKQNEEIYGILSGKGF ITIDGEKIELQAGDWLRIAPDGKRQISAASDSPIGFICIQVKAGSLEGYTMTDGVVQ >gi|226332003|gb|ACIB01000053.1| GENE 24 22604 - 23122 349 172 aa, chain + ## HITS:1 COG:VCA1017 KEGG:ns NR:ns ## COG: VCA1017 COG0350 # Protein_GI_number: 15601770 # Func_class: L Replication, recombination and repair # Function: Methylated DNA-protein cysteine methyltransferase # Organism: Vibrio cholerae # 10 166 7 154 157 140 47.0 8e-34 MENVIQIQYYQSPCGELILGAYREKLCLCDWKIEERRIIIDRRIQKELQASYKEGISEVI TRTIGQLDEYFAGRRTTFDIPLLLVGTDFQKTVWNELLNIPYGKTISYAGLSQKLGNPKA IRAIASANGANPISILVPCHRVIGSDRKLVGYGGGLPAKKILLDLESSDRLF >gi|226332003|gb|ACIB01000053.1| GENE 25 23179 - 24015 400 278 aa, chain - ## HITS:1 COG:no KEGG:BF4110 NR:ns ## KEGG: BF4110 # Name: not_defined # Def: transcriptional regulator GerE # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 278 1 278 278 551 99.0 1e-155 MFYDYLCKGKSLYLSTQCLLITISFPDMDIVDKLNKEFLTQPFCKNEQLPEELNEYKRIA YNYARIENSIAVLSDMHTNISYIYYGGTAETLGIARKGDNQNLESIWEKEVFKYIHPDDL AEKYVQELRFYHFLKQIPHKKRADYFLMSKLRMRDPSGKYIPILHRMFYVATHSNDSMWL ALCLYNLSVDPTMSCRVINSTNGQVIELEKQDCSRLLSDREKTILQLIDMGKTSHEIARE LFISKNTVSRHRQNILEKLQVKNSIEACRIAKELKLLF >gi|226332003|gb|ACIB01000053.1| GENE 26 24024 - 24611 536 195 aa, chain + ## HITS:1 COG:MA0513 KEGG:ns NR:ns ## COG: MA0513 COG0110 # Protein_GI_number: 20089402 # Func_class: R General function prediction only # Function: Acetyltransferase (isoleucine patch superfamily) # Organism: Methanosarcina acetivorans str.C2A # 3 190 10 197 199 288 71.0 5e-78 MESEKEKMGTGRLYDANYDTELIAERRACKELCYTLNHLPPSQIAEREAIIRRLFCKTKE RFLLEQPFYCDYGYNIEIGENFYANMNCVILDEAKVTFGDNVFIAPSCGFYTAGHPLDVE QRNRGLEYARPIRVGNNVWIGAQVCVLPGVTIGDNTVIGAGSVVNRDIPANVIAAGNPCR VIREITEEDKTKYLL >gi|226332003|gb|ACIB01000053.1| GENE 27 24636 - 25286 627 216 aa, chain + ## HITS:1 COG:all7165 KEGG:ns NR:ns ## COG: all7165 COG3506 # Protein_GI_number: 17233181 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Nostoc sp. PCC 7120 # 25 199 1 175 183 142 38.0 4e-34 MKKVCLSLLMGLMVQMTFGQTLEKMQWFNEPEQWEIKNNVLSMSVTPQSDYWRISHYGFT VDDAPFYYATYGGEFEAKVKVVGEYKERFDQAGLMLRIDHENYIKAGIEFVDGKFNLSTV VTHKTSDWSVITLDKTVPYIWIKAVRRLDAVEIFYSFDDKTYTLMRNAWLQDHIPVKVGL MAACPDGSGFNAKFEYFQVKHLPDQRRVEWLKKNAE >gi|226332003|gb|ACIB01000053.1| GENE 28 25378 - 25890 429 170 aa, chain - ## HITS:1 COG:no KEGG:BF4107 NR:ns ## KEGG: BF4107 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 170 1 170 170 286 100.0 2e-76 METTSIKLYSLNYNDTKTYLATLLFVVGNMALPQLFHLIPQGGITWLPIYFFTLIGAYKY GWKVGLLTALLSPVLNSLLFGMPQPVILPAILLKSTLLAIAAGYAAHRYKRISIPILLLV VLSYQVVGTLGEWILVNDFFSAVQDFRIGLPGMALQIFGGYLFISRLIYK >gi|226332003|gb|ACIB01000053.1| GENE 29 25908 - 28208 1613 766 aa, chain - ## HITS:1 COG:VC0475 KEGG:ns NR:ns ## COG: VC0475 COG4771 # Protein_GI_number: 15640502 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor for ferrienterochelin and colicins # Organism: Vibrio cholerae # 174 678 33 558 652 105 26.0 5e-22 MMEKIIDLLHSGGYSCVIGNGTEIRTFTQRGVADLYDLFRQDPSFMKGAGIADKVIGKAA AGLMVLGGIRQVYADVISQPALALLHNANIKVSYVRLVPFIENRDKSGWCPLETACYGIE SIQEIFRIIENFLSKIRMKKNLLGILLVCAFLSSSLQAQVRKDTTQAGHNYEIDDVVVTG TRNETDIRHLPMTISVVNRETIGQRSEPSLLPTLTEQVPGLFTTSRGVMGYGVSNGAAGG ISLRGIGGSPTTGLLVLIDGHPQYMGLMGHPIADAYQSMLAEKVEVVRGPASVLYGSNAM GGVINIVTRRQREDGVRTGVRLGYGSYDTWMTEATNQVKKGRFNSIITGSYNRTNGHRPN MEFEQYGGYTRLGYELDHSWNVSADLNLTHFNASNPGTVSTPVFDNDSRITRGMTSFALE NRYERTSGALKFFYNWDKHHINDGYGTGEEPLDYRFKSKDRMMGISWYQSASLFSGNRLT AGIDYMQFGGEAWNRFIADKHKEGISDKSENEIAGYLDFRQAIGSYLTMDAGLRIDHHTV TGTEWIPQVGLSVQLPQNASLKAMASKGFRNPTIRELYMFRPANPDLLPERLWNYELSYS QRLLKGTFYYGVNLFYINGDNMIQTIRTDGRPLNVNTGKVENWGAEADIAYHIHPMWRLT ANYSWLHMEHPLIAAPEHKLYTGIDFTQKKWSFSTGIQYVTGLYTTVDPQEKKENFLLWN LRGSYRICSIADLFVKGENLLAQRYEINAGYPMPKATCMGGININF >gi|226332003|gb|ACIB01000053.1| GENE 30 28205 - 29587 1025 460 aa, chain - ## HITS:1 COG:TM1183 KEGG:ns NR:ns ## COG: TM1183 COG1453 # Protein_GI_number: 15643939 # Func_class: R General function prediction only # Function: Predicted oxidoreductases of the aldo/keto reductase family # Organism: Thermotoga maritima # 52 456 1 379 379 234 35.0 2e-61 MEKQNNHIDRRGFLKIVGISAATTTAALYGCGSGTKSSQGRNASSPVPTDQMTYRSVGGI KDKVSLLGYGCMRWPTVPSPEGKGDLINQEAVNELVDYAIAHGVNYFDTSPVYVQGWSEK ATGIALKRHPREKLYIATKLSNFSNFSRENSLAMYHQSFKDMQVEYFDYYLLHAIGGGGM KVFNERYIDNGMLDFLLKEREAGRIRHLGWSFHGDVEVFDQVLAMHDTVKWDFVQIQLNY VDWRHATGNNVNAEYLYGELAKRNIAAVIMEPLLGGRLSNVPEHIVGRLKQRRPEDSVAS WAFRFAGSPELVLTVLSGMTYMEHLQDNIRTYSPLVPLTDDDKEYLEETAQLMMQYPTIP CNDCKYCMPCPYGIDIPAILVHYNKCVNEGNIPQSQSSENYKEARRAFLVGYDRSVPKLR QASHCIGCNQCTPHCPQSIHIPEELHRIDRFVEQLKQGTL >gi|226332003|gb|ACIB01000053.1| GENE 31 29615 - 31120 816 501 aa, chain - ## HITS:1 COG:napG KEGG:ns NR:ns ## COG: napG COG1145 # Protein_GI_number: 16130142 # Func_class: C Energy production and conversion # Function: Ferredoxin # Organism: Escherichia coli K12 # 318 482 20 196 231 70 30.0 1e-11 MLRKIRLTCGIICLTLITLLFLDFTGTLHGWFGWLAKIQFLPAVLALNVGVVVLLIILTG VFGRIYCSVICPLGVFQDVAAWIGKKRKKLPYSYSPALSLLRYGALAIFIITLVAGVSFI ATLFAPYSAYGRIANNLFQPIWLWGNNLFAHLAERAGSYAFYEVDIWIKSLPTFIVAAAT FVILILLAWRNGRTYCNTICPVGTVLGFLSRYSLFRITIDTEKCNKCGLCARHCKAACIN AKEHTIDYSRCVVCMDCLGKCKQKALSYQLRTTKARPAKAEENAHAASSKEVNEARRNFL TVTAMAATASALKAQEKKVDGGLAAIEDKKIPNRQTPITPPGSLSARNMAAHCTACQLCV SACSNQVLRPSTNLMNLMQPEISYERGYCRPECNDCSQVCPTGAIHPITAADKSSTQIGH AVWIKANCVSLTDGVKCDNCARHCPTGAIQMIVAEPEKETSPQIPAINTERCIGCGACEN LCPARPFSAIYVEGHERHRII >gi|226332003|gb|ACIB01000053.1| GENE 32 31144 - 31899 509 251 aa, chain - ## HITS:1 COG:BS_lytT KEGG:ns NR:ns ## COG: BS_lytT COG3279 # Protein_GI_number: 16079944 # Func_class: K Transcription; T Signal transduction mechanisms # Function: Response regulator of the LytR/AlgR family # Organism: Bacillus subtilis # 1 247 2 239 241 93 30.0 4e-19 MKVLIVEDETAAYENLTDILTEITPDIRIMANTESVTQTVGWLQSNPAPDLIFMDIHLSD GSAFAIFDRIELETPIIFTTAYDRYAIEAFKVNSIDYLLKPVKVEDVEHALEKYSKLTRQ DLLQYLSQLTLLKPAPRYKDKLLIAHKDKLLPVNIKNISYFYATGKNTYVCLKDGNRYPY SKTLEQIVSSLNPEDFIRANKQFIVARNSVTDITIWFDNRLLITLDTEVPERIYVSKNKA SEFKTWLVNDK >gi|226332003|gb|ACIB01000053.1| GENE 33 31908 - 32930 630 340 aa, chain - ## HITS:1 COG:SMb21546 KEGG:ns NR:ns ## COG: SMb21546 COG3275 # Protein_GI_number: 16264735 # Func_class: T Signal transduction mechanisms # Function: Putative regulator of cell autolysis # Organism: Sinorhizobium meliloti # 113 318 146 353 383 80 26.0 5e-15 MDSFDKYISRNLPLVSLISAVFIIYPNIACTPWELNSLEPSEYLGFFSYFIYRFLFFWGM IGFLINYNLRQIPTALFRKRLTHNFLFALTGYLFFASVSYTISSHGFHTDFLGSTLISQF FTLCFLCTLVGYISMLYTRQREKEQEIERLRFENLQSRCNALANQVNPHFFFNSLNGISS LIRKKNDENTLTYVNQLSDIFRYILQSDRKGVVTLREELEFIQSFRYVMEVRFANKLSFT IDVDEAQKDVLTLPVLSLLPLVDNVTVHNRIDSEHKMDISIRLNEQYELVVSNPIYPKLS PPDTNGTGLSNLENRFNLLMNKQIRIETDEKVFRVYLPLI >gi|226332003|gb|ACIB01000053.1| GENE 34 33030 - 33122 76 30 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MKEFMLIASLVLSFCILILCRDYIVFMLKK >gi|226332003|gb|ACIB01000053.1| GENE 35 33202 - 33759 531 185 aa, chain - ## HITS:1 COG:CC3650 KEGG:ns NR:ns ## COG: CC3650 COG0494 # Protein_GI_number: 16127880 # Func_class: L Replication, recombination and repair; R General function prediction only # Function: NTP pyrophosphohydrolases including oxidative damage repair enzymes # Organism: Caulobacter vibrioides # 5 180 3 177 187 107 33.0 9e-24 MEKKDEVWQVVSSKYLFRRPWLTVRCDDMLLPNGNHIPEYYILEYPDWVNTIAITKEGKF VFVRQFRPGIGKQLYELCAGVCEKEDASPLVSAQRELLEETGYGKGNWKEYMVISANPST HTNLTHCFLATDVEQIDTQHLEDTEALTVHLLSLEEVKELLENGQIMQSLHAAPLWKYMA EHKQI >gi|226332003|gb|ACIB01000053.1| GENE 36 33951 - 36482 2022 843 aa, chain + ## HITS:1 COG:no KEGG:BF4091 NR:ns ## KEGG: BF4091 # Name: not_defined # Def: putative O-antigen related protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 842 1 842 844 1771 98.0 0 MKNAIVSLLLLLMVTQYVTAQKKVIKIACIGNSITYGVGTRNPAKDSYPAVLGQMLGDGY EVRNFGVSARTMLMKGDHPYMKEERYRQALAYNPDIVTIKLGTNDTKPQNWRYKSDFKKD METMIRTIRALPSKPEIYLCYPIPAYAVQWGINDSTIVHGVMPVIDQLAAKYQLKVIDLH TPLTGMKECFADHVHPNEKAAACIARVIYRQLTGKEAPEHVSQPFPGHKSKWQGFDQYTF TYQDRQAIVVCPERAAAGNPWIWRPAFFGAFASVDEALLKRGFHVVYYDLTHLYGSPRAR KSGTDFYWNMVQMYGLSPRVTLEGFSRGGLFAYNWAADHPDKVACIYVDAPVCDVFSWPG RSSGNAGLWKGMLDEWGLTEARMNTFPGNPIDRLKPLVDARIPVICVCGDSDRVVPFSEN SAVVRQRYTAMGAPFELILKPGGDHHPHSLENPTPVVDFIVRHQAGYEAGQCYTLRGNYQ NSYRKFEKERVGTVAFLGGSITEMKGWRDMICEDLKQRFPYTKFTFVAAGIPSTGSTPGA FRLTDDVLSKGKVDLLFVEAAVNDDTNGFSAIEQVRGMEGIVRHALVSNPSMDIMMLHFI YDPFIPKLDKGQMPDVILNHERVANHYLLPSVNLASEIAARMRSGEFTWEQFGGTHPNLL GHAYYAATINKVLDEMYAPCATAKDAAKPHALPAVPLDAYSYTNGRLVDIRQAHIGKGWQ LVAPWTPRLAAETRPGFVDVPMLETNRPGAKLTLDFEGTAVGIFCVSGPAAGILEYSVDG GPFKKLDTFTAWSGGLYIPWVYMFDTELPMGKHRLTLRMSKDHHPQSKGTSCQIRQFVVN ESF >gi|226332003|gb|ACIB01000053.1| GENE 37 36804 - 37289 212 161 aa, chain + ## HITS:1 COG:no KEGG:BF4290 NR:ns ## KEGG: BF4290 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 161 1 161 163 289 100.0 2e-77 MANWITLRQAAEKYETTIKDILSWSELPEITFTYINHTLLLNDDSVCYFIESHSLPAEPG KEYGCAVTFENIEIIAKQYVNALEKISLLEKQIELSNECISEQSKLFESIEESLTGMLSS MEDTDVECTELIKEEPKFGSFMKRILLSYEKFISIFHLPPF >gi|226332003|gb|ACIB01000053.1| GENE 38 37381 - 38001 528 206 aa, chain - ## HITS:1 COG:no KEGG:BF4289 NR:ns ## KEGG: BF4289 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 206 1 206 206 388 100.0 1e-107 MAARYDFREVPSKKGNGEEPKLYPHIVSGGTIRTRTILEEISEASTFTVGDLEGVLCALT EKISKYLVDGYHVELGKIGYFSASLKATHPVKDKKEIRAQSIYFDNVNFRASMWFRRHTS GYVERAGKYGSRSSSQLSEEEKLFRLERYLNENAFITRSDYTRLTGRLKNKALEDLNHFV SQGVIERRGRGSLLVFIKKTPSKEEI >gi|226332003|gb|ACIB01000053.1| GENE 39 38332 - 40413 1474 693 aa, chain + ## HITS:1 COG:L109011 KEGG:ns NR:ns ## COG: L109011 COG5545 # Protein_GI_number: 15672499 # Func_class: R General function prediction only # Function: Predicted P-loop ATPase and inactivated derivatives # Organism: Lactococcus lactis # 394 631 142 382 480 67 24.0 7e-11 MKITLIRKDETLIVQRTVKLETVLENMKTETKTSPVTALRRALKFAGGSGHIMAVDKLPR IYFSVEMGRKEEVSVMKAYHGLILLNVRGLAGFDEAVRLRDKLAGLPQTMITFVGSSGRS VKILVPFLRPDGSLPATVEEARLFHAHAYQWAVNFYRGQLLPEHRDITLENPVPEQSCRY SFDPGLFFNPDVYPIRLEQPLSMPEEVTYRETVEAETDPLQRMMPGYERSEIISTLFETS LYEALNAVDCDREEGSKLLIVELTRNCFRSGIPEEEVVGWTLIHFRGKVPEILVRETVHN VYTVEKRFGSNPCISPKQTMAVRTDEFMGRRYEFRYNTLAGVVEYRERKTFCFNFRPVTD RVLNTIAVNAISEGLELWDRDVKRWINSERVPIYSPITDFLYSLPRWDGKDRIRALASYV PCDNSHWPDFFHRWFLSMVAHWKGNDKQYANSVSPLLIGAQGCGKSTYCRNLLPPDLRAY YTDSIDFSRKRDAELYLNRFMLINMDEFDQVSANHQGFLKHILQKPVLNVRKPNESAVSE LKRYASFIATSNHSDLLSDPSGSRRFICINITGKIRNDAVINYPQLYAQAVQELRDGERS HFSSEEEAILVENNQEFELQTPVEQLFQQYFRSAREGEKCETLLAVDILGRIQKKSGFKL SATKIVHFGRILRKLGVPCKKMKNGNFYCVVEL >gi|226332003|gb|ACIB01000053.1| GENE 40 40669 - 41682 591 337 aa, chain + ## HITS:1 COG:no KEGG:BF4287 NR:ns ## KEGG: BF4287 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 337 1 337 337 710 99.0 0 MRFINVHTHVFTFNHVPLKFIPFMNVILWIHKYAPELLNKVLPEKIAAFLERGTRCDQLE ILRELTDYYPSDACFAIHTIDFEYMEAGKCRKGYGFMEQLDEVARIVASPEWKERIFPFI CVDPRRPDIAEIVKDYIENKGFCGVKLYPALGYYPQDERLDELWDWIEQKKIPVMVHCSK DGAVYNKKMGTQYCNRFSDPANFLSILEKHPDVKVCFAHFGGDKECIRFYKDGDNQKENW FACITQLIRKYKGVYADMSYSGGNSDLMALFHVYAQDVYDVNKKSSYQIGDKMLFGSDYY MAHLSRNERWFSINIRSCMGETTFWKLMKNAEEYLWG >gi|226332003|gb|ACIB01000053.1| GENE 41 41715 - 43457 1050 580 aa, chain + ## HITS:1 COG:no KEGG:BF4286 NR:ns ## KEGG: BF4286 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 580 1 580 580 1168 100.0 0 MEKNMLPKDEEKVLEINDVASCFRKFKEVQKGMEKLEADPKPYVPGKFIFRGVSDEKYHV LSSAGRRLKELNKQNDFIRYHVNLVSNARKMGYGKLDLQTELSDLEILAEIQHLGGATCL TDFTTNFLIALWFATSPKEDADGKLVWLDLGQPTNFRLINYCNGEDEKSSIQKLLKGLDF SLQTRNQNKAEPCFWLWEPPCLNSRIMKQNSVFLFGLRAFPTVSDPKDDNLKFGTILIPN EKKKEIREELEHFFGISAETVFHDFSGYSLNANDVKVPVSENILSTRNCVAAAKENIKKR EYSLAVNYLDEAIECLRCRNKDNYECKRKINGKKFCCYTLGEIAFWRGRANADRGYIDEA ILNYYEVVKHLLVNNENHGLIYDAYRELIYLYYDKDDFKSAMKMAECLWKLYLEHRDWEN NGHDVIFYLLELSILTCQEKLFDHYVKEGEHLKDKFVATNGSFLWVYFESIGKAIFSKDI SSLKGSCEEIESIIVAILNDDDHGKNNLIGHYYWDFEDMINWIKSDKNNIDKHCEELINS NIDKLSLFTEKANEAQSRLVNHVFANSVAFSEEIITSVTE >gi|226332003|gb|ACIB01000053.1| GENE 42 43562 - 43870 230 102 aa, chain - ## HITS:1 COG:no KEGG:BF4089 NR:ns ## KEGG: BF4089 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 102 611 712 712 216 97.0 3e-55 MCHGGKKGDYIEFEFSGFEDHEYSLNLFCTKAADYGNIKFYVNRQENGKQLDCYSQEVEA TGAINLGMHKPIDGKFILRIELTGQNALSTSTLFGLDCIRIE >gi|226332003|gb|ACIB01000053.1| GENE 43 43901 - 44065 84 54 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MSAEPLSVITQPFRFGGWLDLAEVDSIGYATDGGLGGVGSKPASISSIPEEPAA >gi|226332003|gb|ACIB01000053.1| GENE 44 43984 - 45699 848 571 aa, chain - ## HITS:1 COG:no KEGG:BF4089 NR:ns ## KEGG: BF4089 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 568 1 568 712 1170 98.0 0 MNTKSKQPLLSFPKPKFTTGMIRLSITLLFSTLCIGSFGQKKSSITVETLLEEMTSYDEM TRYPALPYRSMQQSSYDRRSVSPDRPGWFANDDGEGFIRLEEHNGRKEKVLFEDEGPGAI TRIWLTTFGSIHTILRFYFDGKDEPGWVVPSYDLQKFGVRGLKKGLIEPDDKWIRGSLIY LPITYADGCKVTMEELTPERTNRHFLFNYRKYPTGTPVETFSRKVAERIPQSVEKTSATL YRNIDKGFDPQARYGKGALIHRQNLSLNKGEKQQLNLHKGKRAISLLQFNVKTDPDLKPG TDDFARLMRSLIISISFDGQQTVWAPLSDFAGSGMGAFASRSFFFYSDGKGIVCSKWLMP YRQNCEISVLNLSPYKTDVHIDIVSQPYKWDNRSLYFHTVWKQERGLPVVTWMEHEKCMD WNFTTISGRGVYRGDLLSLFNHTTEWYGEGDEKITVDHEAFPSHFGTGTEDYYSFDGYFK SQTPFAGQPRQDMKDFYGYNSFFRVRCLDGIPFNQQLKFDFELLGWGNGTVDYSSTVFWY GDLNSQAAGSSGIEEIEAGLLPTPPSPPSVA >gi|226332003|gb|ACIB01000053.1| GENE 45 45875 - 46807 804 310 aa, chain + ## HITS:1 COG:no KEGG:BF4283 NR:ns ## KEGG: BF4283 # Name: not_defined # Def: tyrosine type site-specific recombinase # Organism: B.fragilis # Pathway: not_defined # 1 310 1 310 310 594 100.0 1e-168 MKEKDLIRFMDRMIEERKAEYALGTAHIYQASRNALSAFLKAHDIPFKRVRPELLKQFER FLRRRGNSWNTVSTYMRVLRAVYNRAVDRRLAPHVPHLFKAVYTGTQADIKRALKAEEMG QLLDTKCTRKQSELLQKTHHLFVLMFLLRGLPFVDLAYIQKKDLNGNILTYHRRKTGRQI TITVTKDAMNIIREYMDTTTESPYLFPILSAEGGEDTIYREYQQALRIFNYQLTKLGELL GLTTELTSYTARHTWATLAYYLEVHPGIIREAMGHSSIKVTETYLKPFNIKKLDETNLSI IGYAKRSFEG >gi|226332003|gb|ACIB01000053.1| GENE 46 47034 - 47204 75 56 aa, chain - ## HITS:1 COG:no KEGG:BF4282 NR:ns ## KEGG: BF4282 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 56 1 56 56 106 98.0 2e-22 MFSLADDGLASILCNTSIYSYFVGTPKQTIEGCKWRFAKRNSSKRESLFIIRYKKE >gi|226332003|gb|ACIB01000053.1| GENE 47 47253 - 49694 2542 813 aa, chain + ## HITS:1 COG:no KEGG:BF4087 NR:ns ## KEGG: BF4087 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 813 1 807 807 1186 86.0 0 MRKWTYLVAALLVGGATTTFTGCIDNDEPAGIEQLRGAKAEFIKAKAAFETVLTEIQRVK IEREQVSLESDKVNLELKKVALEKEQASAAWVKDSLQARQDTLAASLKEQLLAIQKKEAD TNADLQESLAALEVAMVTAKDEAFGEAIKDVKEALAGITEGELHTYGALDYLKDSNARLL KAKSDLLDFLSDNKYLEDKLNAGIDEAKAALATQEKVLEDMKTFAATPTSEWNTKLAEIS KQIAAVNADVVAKSEAIAKQTAEIQPVLADIEREKAKLNTKDKSFTIPVVDAALQNDLAG FVQESSVLTSDEFNKVFKQDNVTGEYTMIADLNLSGLSLSNYDEASQMLECIKIANIWRG SQLFSYGYINLFNVAYERVFSSNRNNSNIQPTDAEIAKAKGELARMAIDKADKYAIFKKD STEWMDAYKAYMTALTNYKNYQQTTTYDAIVAKVDAYRVLASAEQTKDKANALLADLKAY GQLRDAVDGATGKIYNVDNKEIRLYNVTIVDDSETPTGNQVTLSSFNNSMSNSNSNSWIL GSQQLVTSFSNNALSDFDGATQKLILASKTLFGTDDALSSIVEPKKVGDKYYLPEDVEAG NNTCSYYLYTTAMENVAIFTNIEKWIALDNSLTADLKKFDDAKKPIADNIATLQAGIADK QDAIWKAELEVKLLDYTQNMSNGNPYSVSNSSACQIQALNSLMTTIKNAINNGGQVTYVT YDPVNHSFETVKGTIEKLISDQESKIAIAKDAVATAEGKLEAYQTLGKDDKNRFESDLQA AITNAEQEVAFIQAEVDRLNATLKKLLDAYAAE >gi|226332003|gb|ACIB01000053.1| GENE 48 49719 - 50693 938 324 aa, chain + ## HITS:1 COG:no KEGG:BF4086 NR:ns ## KEGG: BF4086 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 324 1 324 324 654 100.0 0 MIKKLCIILLSVCTVAPLSAQQYSSSDDASFAPKKGQWQVSMVFGSSQMFNNSTEDYLLP RYWDGKSALAPNIGIGNGSGHQSADPAYYLTLGDLNNNSLVNIIGVQGKYFLTDRWDINL MFSMNIGVTPKKDYIEGDRTVQDMQIPAKQYLEGKIKNNWSINIGSNYYFNTKNERINLY VGGLLGWQMGRIQTTTPYTGIMVEDPDMSTDDADAPGGTDLNPSENPNSQDNAAVVDGSD VNGTPLEVYIPNTRTGQIFGLRAASVAGIEYSLGKGLILGFEVQPVAYRYDMIQICPKGM SAYKVGHHNINLFALPNLKLGFRF >gi|226332003|gb|ACIB01000053.1| GENE 49 50784 - 51359 647 191 aa, chain + ## HITS:1 COG:no KEGG:BF4085 NR:ns ## KEGG: BF4085 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 191 1 191 191 363 98.0 1e-99 MKRNLTGVLLMGLIMGACNHTPKEGAYHLRGVVTNEKLEGRTIYLRDAMEGVVRYDSTIV SEGRFVFNGKVTAPQVRELFIQENDSDRFPVTLPVVLEPGEINAKIGDIVLVEGTGLNEE MMQTLMALDEFRGRDFTGKEINEIKEAFGGFVLEQIVKHAGSPVGNYLYEAYQNKLSENQ QTEARKTLGIG >gi|226332003|gb|ACIB01000053.1| GENE 50 51559 - 52518 808 319 aa, chain - ## HITS:1 COG:no KEGG:BF4278 NR:ns ## KEGG: BF4278 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 13 319 13 319 319 602 99.0 1e-171 MKKLMLLSLLSTFIFYSCSDDDSCTTCKEDNGNLVTPDLSVTLSDTQSPMTGVLEAYPCQ AGGAIYYGNYIEGKLTPFPGMYYLQNGEIYGDKNREISLPVGTYNMIYWGTPKYEEPIYS TPVVVDPQITIGGDLSQQYFGLRKVSADTTYYPVFDLVYTVKPAHIGTEELSAAMQRVVA GLKVIVKNKNNGILSSSIAGMEVHVGGIAEKLNMYTAAPVNQTKTVSFPLVLSADGTQMS NATVMLFPSSAKPMFKLIIKLKNGNTKVYQQPLNAPLKANNKLTLTLTLGDIFSEETSGG FTIDNWQEENETIDIPTLE >gi|226332003|gb|ACIB01000053.1| GENE 51 52621 - 53250 473 209 aa, chain + ## HITS:1 COG:no KEGG:BF4083 NR:ns ## KEGG: BF4083 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 209 12 220 220 424 100.0 1e-117 MNGFFLLKRPFIWLARFRHRCGYGVHSPFAFDLITNVIYERTPYYAYSSLEAEQKKMSAN SGRKWKHESKKVNRLLFRLVNYIQPDTIVDAGTLSASSLYLQAGHAKADYVGASDLSELF LEKDTPVDFLYLHHYRNEEFVEQVFDLCASRTTGRGLFVIEGIRYTKKMKALWKKIQQDD RTGITFDLYDLGIVFFDRTKIKQHYLVNF >gi|226332003|gb|ACIB01000053.1| GENE 52 53299 - 53862 449 187 aa, chain + ## HITS:1 COG:SMc00731 KEGG:ns NR:ns ## COG: SMc00731 COG2096 # Protein_GI_number: 15966394 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Sinorhizobium meliloti # 1 182 2 186 192 148 44.0 6e-36 MKKSLVYTKTGDKGTTGLIGGTRVPKTHIRLEAYGTVDELNSNLGLLATYLMDEHDLNFV QSVQDKLFAIGSHLATDQEKVQLNDVSIITPAEVEAIEREIDAADEILPPLHSFIIPGGS RGSAVCHVCRTVCRRAERRILALSESCTISADLLAYINRLSDYLFVLSRKMNFNEGKDEI FWNNSCK >gi|226332003|gb|ACIB01000053.1| GENE 53 53933 - 54154 363 73 aa, chain + ## HITS:1 COG:no KEGG:PGN_1678 NR:ns ## KEGG: PGN_1678 # Name: not_defined # Def: hypothetical protein # Organism: P.gingivalis_ATCC33277 # Pathway: not_defined # 1 73 13 85 85 117 98.0 2e-25 MYWTLELASKLEDAPWPATKDELIDYAMRSGAPLEVIENLQEMEDEGEIYESIEDIWPDY PSKEDFFFNEEEY >gi|226332003|gb|ACIB01000053.1| GENE 54 54230 - 55417 1288 395 aa, chain - ## HITS:1 COG:no KEGG:BF4216 NR:ns ## KEGG: BF4216 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 395 1 395 395 752 100.0 0 MKKLMAMLLLAGSIQGVYAQKTEKKEMFLENKSLYEELTNVQKKTDKFNLYLNMQGSFDA NFRDGFDEGVFKMRQLRIEAKGNLNSWLSYRYRQRLNRSNEGGGMIDNIPTSIDYAGIGV KLNDQFSFFAGKQCTAYGGFEFDLNPIDIYQYSDMIENMSNFMTGLNIGYNITPTQQLNL QILNSRNSSFDKTYGITEDSEGKLPDLKSGKMPLVYTLNWNGNFNEVFKTRWSASVMSEA KGKNLYYYAVGNELNLDKFNMFVDFMYSQEGIDRNGTITGIVGNAGGHNAFNAGYLSVVT KLNYRFLPKWNAFVKGMYETASVTKAADGIEKGNYRTSWGYLAGVEFYPMKTNLHFFLTY VGRSYDFTHRAKVLGQENYSTNRLSLGFIYQLPMF >gi|226332003|gb|ACIB01000053.1| GENE 55 55450 - 56508 1121 352 aa, chain - ## HITS:1 COG:STM3106 KEGG:ns NR:ns ## COG: STM3106 COG0252 # Protein_GI_number: 16766407 # Func_class: E Amino acid transport and metabolism; J Translation, ribosomal structure and biogenesis # Function: L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D # Organism: Salmonella typhimurium LT2 # 7 352 1 348 348 369 60.0 1e-102 MKELKRLSFVVVTLLLSTMMAFAQKPNIHILATGGTIAGTGGSATSTNYTAGQVAISTLL DAVPELKDIANVTGEQIVRIASQDMSDEVWLILAKKINQLLKRPDIDGIVITHGTDTMEE TAYFLNLTVKSNKPVVLVGAMRPSTALSADGPLNLYNAVVTAGAKESIGKGVLIAMNGLI LGAESAIKMNTIDVQTFQAPNSGALGYIFNGKVFYNQAPLKKHTTQSVFDVTNLNSLPKV GIVYSYSNIDPDMVTPLLHHDYKGIIHAGVGNGNFHKNILPVLLEARKKGILVVRSSRVP TGPTTMDAEVDDTQYQFIASQELNPQKSRVLLILGLTKTNDWKQIQQYFNEY >gi|226332003|gb|ACIB01000053.1| GENE 56 56554 - 57864 1425 436 aa, chain - ## HITS:1 COG:HI0746 KEGG:ns NR:ns ## COG: HI0746 COG2704 # Protein_GI_number: 16272687 # Func_class: R General function prediction only # Function: Anaerobic C4-dicarboxylate transporter # Organism: Haemophilus influenzae # 2 430 6 433 440 374 56.0 1e-103 MILQLAFVLTAIIIGARLGGIGLGVMGGVGLGILTFAFGLQPTAPPIDVMLMIAAVISAA SCMQAAGGLDYMVKLAEKLLRKNPSHVTILSPIVTYLFTFVAGTGHVAYSVLPVIAEVAT ETKIRPERPLGIAVIASQQAITASPISAATVALLGLLAGFDITLFDILKITIPATIIGVL VGALFSMKVGKELVDDPEYQKRLAEGYFNSKKIEIKDVHNRRNAMISVLIFILATAFIVF FGSFDGMRPTFLIDGETVTLGMSAIIEIVMLSAAALILLITKTDGIKATQGSVFPAGMQA VIAIFGIAWMGDTFLQGNMGQLTESIEGLVRQMPWLFGIALFIMSILLYSQAATVRALMP LGIALGISPYMLIAMFPAVNGYFFIPNYPTVVAAINFDRTGTTKIGKYVLNHSFMMPGLV STVVAIALGLLFIQIF >gi|226332003|gb|ACIB01000053.1| GENE 57 58183 - 59616 1611 477 aa, chain + ## HITS:1 COG:Cj0087 KEGG:ns NR:ns ## COG: Cj0087 COG1027 # Protein_GI_number: 15791475 # Func_class: E Amino acid transport and metabolism # Function: Aspartate ammonia-lyase # Organism: Campylobacter jejuni # 8 468 2 462 468 523 56.0 1e-148 MKEELSKATRTESDLIGEREVPETALYGVQTLRGIENFRISKYHLCEYPLFINALAITKM GAAMANFELGLLTEEQANAILRACKEILEGKHHDQFPVDMIQGGAGTTTNMNANEVIANR ALELMGHKRGEYQYCSPNDHVNRSQSTNDAYPTAIHIGMYYTHLKLVKHFKEVIDAFRRK GEEFAHVIKMGRTQLEDAVPMTLGQTFNGFASILQDEVKNLDFAAQDFLTVNMGATAIGT GITAEPEYASKCIAALRKITGLDIRLADDLIGATSDTSCLVGYSSAMRRIAVKMNKICND LRLLASGPRCGLGEINLPAMQPGSSIMPGKVNPVIPEVMNQIDYKVIGNDLCVAMSGEAA QMELNAMEPVMAQCCFESADLLMNGFDTLRTLCIDGITANEEKCRKDVHNSIGVVTALNP VIGYKNSTKIAKEAQETGKGVYELVLEHDILSKEDLDTILKPENMIHPVKLDIKPNH >gi|226332003|gb|ACIB01000053.1| GENE 58 59763 - 60923 899 386 aa, chain + ## HITS:1 COG:no KEGG:BF4036 NR:ns ## KEGG: BF4036 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 386 12 397 397 803 100.0 0 MKRLPFYFWVSLALASCQGGKEAVNQALPVIDMNEDYPEKEIVLQDIADISYIPLETNDE FLFDGSVEVVTDQYVITKGHRGNDVCFFSRQGKVLNRIHRVGNGPGEYKDIGSIDVNPAN GELYLKEMNRQQIHVYSLDGKFKHSFTFPEGKRMSRMCLFSPDYLIAEQESKVPDDQDAN FYPYLLVSTRDGHLDSLDYVQKRNILVKLIVNAENHSYAYLLEPSLIRNGSRFYIGNPDS DTLFAMNPDRTLEPLLVRTPSHSEEGNKYGLFLRGAAGAYFFLTKQPMEVPMNSIESLDL KSEEWLYDCRTQEVCRYLLKNKDDASKRVEGIMFFCYPEDCGLAVLKSEDLMDAYEAGQL SGELKEIAAGLKADDNPVLMLIHFKK >gi|226332003|gb|ACIB01000053.1| GENE 59 61604 - 62182 732 192 aa, chain + ## HITS:1 COG:no KEGG:BF4035 NR:ns ## KEGG: BF4035 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 192 1 192 192 367 100.0 1e-100 MKKIISALMIAVCIGMAMPAQAQLIKFGVKGGVNLAKADFNKSDLKTDNFTGFFIGPMAE VTIPLVGLGVDGALLFAQRGVKVGDESIRQNGLDIPINLKYTIGLGSALGIYVAAGPDFY FNFSGDKNYGELGRLNKKNAQVGINVGAGVKLLRHLQVGANYNIPLNKTAEWKEADFSYK TKMWQISAAYIF >gi|226332003|gb|ACIB01000053.1| GENE 60 62200 - 64716 2170 838 aa, chain + ## HITS:1 COG:BS_priA KEGG:ns NR:ns ## COG: BS_priA COG1198 # Protein_GI_number: 16078634 # Func_class: L Replication, recombination and repair # Function: Primosomal protein N' (replication factor Y) - superfamily II helicase # Organism: Bacillus subtilis # 32 837 15 801 805 465 34.0 1e-130 MINESPALASGIFCIFVLTMKKYVDVILPLPLPRCFTYSLPDEGAEEVQIGCRVVVPFGR KKYYTAIVRNVHHYAPTEYEVKEISTVLDTSPILLPGQFRFWEWLADYYLCTQGDVYKAA LPSGLKLESETIVEYNPDFEADAPLSEREQLVLDLLAKEPEQCVTKLEKESGLKNILTVI KSLLDKEALFVKEELRRTYKPKTEARVRLAADASGEENLRRIFDELERAPKQLALLMKYV ELSGVLGDGASKEVSKKELLQRASASPAIFNGLVEKQIFEVYYQEIGRLNRLVGKTVELN VLNEHQQRAYHEIMQSFQEKNVCLLHGVTSSGKTEVYIHLIEETLRQGRQVLYLLPEIAL TTQITERLKRVFGSRLGIYHSKFPDAERVEIWQKQLTEEGYDIILGVRSSVFLPFRNLGL VIVDEEHENTYKQQDPAPRYHARNAAIVLASMYGAKTLLGTATPSVETWQNATTGKFGWV ELKERYKEIQLPEIIPVDIKELHRKKRMTGQFSPLLLQYVREALDNKQQVILFQNRRGFA PMIECRTCGWVPKCKNCDVSLTYHKGINQLTCHYCGYTYQLPRSCPACEGVELMHRGFGT EKIEDDVKLIFPEASVARMDLDTTRTRSAYEKIIADFEQGKTDILIGTQMVSKGLDFDHV SVVGILNADTMLNYPDFRSYERAFQLMAQVAGRAGRKNKRGRVVLQTKSIDHPIIRQVMT NDYEDMVAGQLAERQMFHYPPYYRMVYVYLKNRNETLLDVMAHTMAEKLRALFGNRILGP DKPPVARIQTLFIRKIVVKIEQNAPMSRARELLLRVQREMIEDERFKSLIVYYDVDPM >gi|226332003|gb|ACIB01000053.1| GENE 61 64758 - 66557 1667 599 aa, chain + ## HITS:1 COG:no KEGG:BF4209 NR:ns ## KEGG: BF4209 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 599 15 613 613 1233 99.0 0 MKRITTLLLSCLVSTGPLLAQQGVTQCGTPTGQAPFPIQSYKELPDPVAPSEKEWAAVKA PQVQWGNTDTRYAKHAVPVIQPQKSITLEGWRGEKLHAQAVVWTGTDLKGLNYSLSEFKN SKGDVLPADAFSGGFVRYVMTDELNKDGRGGCGYRPDHSIYDSLLVADPIDHLLTSMPME AKSTQAIWINCQVPQTVSPGVYRGTVEVKDGDNRLSTLKMDIKVSSRVLPAPSQWAFHLD LWQSPFAVARYYQVPLWSQAHIDAMRPVMKMLADAGQKIITASIMHKPWNGQTYDYFESM VTWTKKVDGTWAFDYDVFDKWVEMMMSVGIDKQINCYSMVPWKLSFQYFDQATNSMQYVK TAPGEKAYEEMWVAMLKSFSKHLREKGWFDICTIAMDERPMEVMQKTLQVIRKADPEFKV SLAGNFHKELEADIYDYCIPIGASYPAEVLARRAQNNLPTTYYTCCTEAFPNTFTFSDPA EAAWMSYYSAKDHLDGYLRWAYNSWPKEPLLDSRFEAWAGGDTYLVYPGARSSIRFEKLI EGVQAHEKITILRKEFTDKKNKTGFKKLEKMLSTFNLRDFPEVPAAETVNKANKILNSL >gi|226332003|gb|ACIB01000053.1| GENE 62 66842 - 68416 680 524 aa, chain + ## HITS:1 COG:no KEGG:Aasi_1729 NR:ns ## KEGG: Aasi_1729 # Name: not_defined # Def: hypothetical protein # Organism: A.asiaticus # Pathway: not_defined # 8 473 3 401 404 102 24.0 3e-20 MKRTLPTITPILYTSKTLANGEHPIMLRVCYNGKRSYKSLGIYCKSTEWNKDKKRVKGSR ASKYNMVITRELTKASDYVLSLEGKDDYTAATIVKHLSKSFPTQVTLFMLFEERIAFFKE EKQSHNNAVGYRTLLNRIKRYTNNIDLELFEITSNWLSEFEEHLHCHYCDNSIRKFFDVF KAVFNYAKRQEYIKETPFVNFIFSKKLDTHTRKRALSLDEITKLMRYYYQRYGMLGIEDN NVFGEHDLKQYWVNQKFKLKGQNKLTPINAEQFSLALYLTSYLFQGLALVDIANLKLKDL HLLEVVDDEKYQRDAALRGVDYAEAHKRTVLHYDISTCRAKTHKNTHIVAECQNLMPYLN PFGSYFDDYDQLDEEDMERYLFPIFDHNNDDAETKFCRMTYMNYLVNVNLKRIAKRLGMP PMTFYSARHSYASQLYHANVPIGLIAQNMGRNTSEIETYLKEFNVNSIVSANNKSLIVGQ DLFKEMAKKQEKERQDNIREVLMANGNVEELERYEAYLKWREEH >gi|226332003|gb|ACIB01000053.1| GENE 63 68734 - 68946 180 70 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MKEETKRKISMRLRGRKKSATHCKHISQSLQALKKTKEHKEHLSASLKEYHKNNRIKSER HEKEKEYNKH >gi|226332003|gb|ACIB01000053.1| GENE 64 68915 - 69826 574 303 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253566637|ref|ZP_04844090.1| ## NR: gi|253566637|ref|ZP_04844090.1| predicted protein [Bacteroides sp. 3_2_5] # 1 303 1 303 303 566 100.0 1e-160 MRKKKSIISIDKLTLCYRATDEVIKKLNEYNELIDSDDSFRLVRVSSDEALFANSFHVMI KFPFGEAGHGFVERKFATLKTKLRSMGDDKVNYVWLYIENWAFYEVFSIYDSNKCNWLSS VDYIADELGLSLNNITDLHIALDTNIDFAKKISKAQFDDDYIVVLNGTIRSNKDEILDDI LHVKTGNQRRLKTLSVYVSPKKKDGLSLKIYDKKRELEKSNKNYIPQWNGLKNKNYRVEL TIKNEHLKEFYQWKGETFPDELLMATLASQSQSQDLLFDMLYYFTNRLLRFSYKGKSISI FQL >gi|226332003|gb|ACIB01000053.1| GENE 65 70095 - 70724 399 209 aa, chain + ## HITS:1 COG:VCA0795 KEGG:ns NR:ns ## COG: VCA0795 COG1961 # Protein_GI_number: 15601550 # Func_class: L Replication, recombination and repair # Function: Site-specific recombinases, DNA invertase Pin homologs # Organism: Vibrio cholerae # 2 202 119 301 318 63 31.0 3e-10 MRAIIYARVSSTTDRQTTDRQVADLKSYAEYAKMEIVKVFEEKVSGAKRNTERPVLVEAL EFCRTERIDMLLVSELSRLGRNAFEVLETVKGLVDDKINLYMQKEKFTLLDDEKQPSMFA AIMLATLSTCAQLERENIKFRLNSGRQQYIAKGGRLGRKEGSTKSVDKKREEYKDVLKAL RQGLSVRQVAKLTDTSASTVQRLKKEFKL >gi|226332003|gb|ACIB01000053.1| GENE 66 70776 - 71267 248 163 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566639|ref|ZP_04844092.1| ## NR: gi|253566639|ref|ZP_04844092.1| predicted protein [Bacteroides sp. 3_2_5] # 1 163 1 163 163 294 100.0 1e-78 MKKYIYFFSITLLATLSFTLMSCGDDDENNETNGNTIEINGVMRTVSTIAGLEGSWSNGS GEFTLTVDNVKNGTNDLEYYMFSFQNAADLKKGDDVSKMQLTLSPPEASYWESYSYQSGK ATVMATNKEKREITIQFEQLEMVYKGDIYKFNGTATLMYDFGN >gi|226332003|gb|ACIB01000053.1| GENE 67 71301 - 71699 424 132 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566640|ref|ZP_04844093.1| ## NR: gi|253566640|ref|ZP_04844093.1| predicted protein [Bacteroides sp. 3_2_5] # 1 132 1 132 132 238 100.0 1e-61 MKKHVIGFALIGCIALSSCSTVNYMSTDKLKANYETKINHTTFLGKTLFKTDKKKKDTYE NVKVYMNENEVGKDFEVVALGSYTPWILPLVRPERPRLEKYLLWKAARKARKLGANGAII DNKNNFRVIKTK >gi|226332003|gb|ACIB01000053.1| GENE 68 72111 - 72578 457 155 aa, chain + ## HITS:1 COG:alr5068 KEGG:ns NR:ns ## COG: alr5068 COG0394 # Protein_GI_number: 17232560 # Func_class: T Signal transduction mechanisms # Function: Protein-tyrosine-phosphatase # Organism: Nostoc sp. PCC 7120 # 2 152 4 155 161 149 48.0 2e-36 MKILFVCLGNICRSSTAEGVMLHLIKEAGLEKEFVIDSAGILAYHQGELPDSRMRAHAAR RGYELVHRSRPVCTEDFYNFDLIIGMDDRNMDDLKEKAPSPAEWKKIHRMTEYCTRIPAD HVPDPYYGGAEGFEYVLDILEDACAGLLTSLTQDS >gi|226332003|gb|ACIB01000053.1| GENE 69 72553 - 74610 1853 685 aa, chain - ## HITS:1 COG:CAC1292 KEGG:ns NR:ns ## COG: CAC1292 COG1480 # Protein_GI_number: 15894574 # Func_class: R General function prediction only # Function: Predicted membrane-associated HD superfamily hydrolase # Organism: Clostridium acetobutylicum # 193 684 195 679 695 275 33.0 3e-73 MEHWKKKNSFSYKDLLYKALIFVGTVAFIVYFLPRDGKFNYQFDINKPWKYGQLMATFDF PIYKDEAVVKREQDSLLASFQPYFELDKEVEKSALAKLKENYHAHLKGILPSTDYIRYIE RGLKAIYQSGVVSTEEMRTLLHDSISSVMVIEDKLANQRTTDGIYTVKRAYENLISGDTA HYNRDILRQCALNEYITPNLIYDSVRTETARKELLDNYSWANGVVQSGQKIIDRGEIVNK QTYNILESLRKESIKRSESIGQKRLILGGQILFVGILILCFMLYLELFRKDYYERKGSLS LLFALIVFYCVITALMVTNNIFNVYILPYAMLPIIIRVFLDSRTAFLTHVITILICSITL RYPHEFILTQIAAGLVAIFSLRELSQRSQLFRTALLVILTYAAIYFAFELISENDLSKLN VSMYIYFIINGVLLLFAYPLLFLLEKTFGFTSNVTLVELSNINNDLLRRMSETVPGTFQH SMQVANLAAEAAIRIGAKSQLVRTGALYHDIGKMENPAFFTENQSGVNPHKNLSYEQSAQ VVISHVTDGLKLADKHNLPKVIKDFISTHHGRGKTKFFYISWKNEHPGEEPNEEVFTYPG PNPFSKETAILMMADSVEAASRSLPEYTEESISNLVDKIIDSQVQEGYFKECPITFKDIA TIKAVFKEKLKTIYHTRISYPELKK >gi|226332003|gb|ACIB01000053.1| GENE 70 74705 - 76222 1596 505 aa, chain + ## HITS:1 COG:BB0372 KEGG:ns NR:ns ## COG: BB0372 COG0008 # Protein_GI_number: 15594717 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Glutamyl- and glutaminyl-tRNA synthetases # Organism: Borrelia burgdorferi # 7 505 4 487 490 337 37.0 5e-92 MAERKVRVRFAPSPTGALHIGGVRTALYNYLFARQHGGDLIFRIEDTDSNRFVPGAEEYI LESFKWLGIQFDEGVSFGGEYGPYRQSERREIYKKYVQVLLDNGKAYIAFDTPEELDAKR AEIANFQYDASTRVGMRNSLTLPKEEVEALIADGKQYVVRFKIEPNEDIHVNDLIRGEVV INSSILDDKVLYKSADELPTYHLANIVDDHLMEVSHVIRGEEWLPSAPLHVLLYRAFGWE DTMPAFAHLPLLLKPEGNGKLSKRDGDRLGFPVFPLEWHDPKSGEISSGYRESGYLPEAV INFLALLGWNPGNDQEVMSMDELIRLFDLHRCSKSGAKFDYKKGIWFNHTYIQQKSDKEI AELFVPVLKEHGVEAPFEKVVTVVGMMKDRVSFVKELWEVCSFFFVAPTEYDEKTVKKRW KEDSAKCMTELAEVLAGIEDFSIEGQEKIVMDWIAEKGYHTGNIMNAFRLTLVGEGKGPH MFDISWVLGKEETLARMKRAVEVLK >gi|226332003|gb|ACIB01000053.1| GENE 71 76235 - 77455 905 406 aa, chain + ## HITS:1 COG:YPO0055 KEGG:ns NR:ns ## COG: YPO0055 COG1519 # Protein_GI_number: 16120408 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: 3-deoxy-D-manno-octulosonic-acid transferase # Organism: Yersinia pestis # 49 404 51 416 425 147 28.0 4e-35 MFYDLAIGIYDLLVHLAAPFSRKPRKMMKGHWVVYDLLRQQVEKDERYIWFHAASLGEFE QGRPLIESIRERYPDYKILQTFFSPSGYEVRKNYRGADIVCYLPFDKPRNVKKFLDIVNP CMAFFIKYEFWKNYLDELHKRRIPVYSVSSIFRKDQIFFKWYGGTYRNVLKDFDHLFVQN EASKRFLAKIGITRVTVVGDTRFDRVLQIREQAKELPLVEQFKNGAFTFVAGSSWGPDED LFIEYFNSHPEMKLIIAPHVIDENHLVEIIGKLKRPSVRYTRADEKNVRKADCLIIDCFG LLSSIYRYGEIAYIGGGFGVGIHNTLEAAVYGIPVIFGPKYQKFMEAMQLIEAGGAYSIK DYNELKILLDRLLTDEAFLKKTGTNAGNYVIGNSGATEKVLHMINF >gi|226332003|gb|ACIB01000053.1| GENE 72 77644 - 78156 602 170 aa, chain - ## HITS:1 COG:RP516 KEGG:ns NR:ns ## COG: RP516 COG0663 # Protein_GI_number: 15604376 # Func_class: R General function prediction only # Function: Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily # Organism: Rickettsia prowazekii # 3 170 2 168 185 145 42.0 4e-35 MALIKSVRGFTPEFGENCFLADNATIIGDVKMGQNCSIWFSTVLRGDVNSIRMGDGVNIQ DGSVLHTLYEKSTIEIGNYVSVGHNVTIHGATVKDYALIGMGSTLLDHAVIGEGAIVAAG SLVLSNTIIEPGSIWGGVPAKFIKKVDPEQAKELNQKIAHNYLMYSDWYK >gi|226332003|gb|ACIB01000053.1| GENE 73 78251 - 80029 1245 592 aa, chain - ## HITS:1 COG:Cj0653c KEGG:ns NR:ns ## COG: Cj0653c COG0006 # Protein_GI_number: 15792013 # Func_class: E Amino acid transport and metabolism # Function: Xaa-Pro aminopeptidase # Organism: Campylobacter jejuni # 7 592 6 595 596 459 43.0 1e-129 MKQSISERIHALRMWFKPNIQAFIIPSTDPHLSEYVAPHWKSREWISGFTGSAGTVVITE KKAGLWTDSRYFLQAAEQLQGSGIDLYKEMLPETPSITKFLSDELQPGESVGIDGKMFSV EQVESMQAELSAKNIQIVFCPDPMDELWENRPPMLESPAFVYDIKYAGKSCSEKIAAIRT ELKKKSAESVMLSALDEIAWTLNLRGNDVHCNPVVVSYLLITEKKAVLFIAPEKVTEEVR NYLEEQQIEIQNYSDTEIYLSDLNSSSILMNPAKTNYSVFSSVNPQCRIIRGEAPVALLK AIRNEQEIKGIHAAMQRDGVALVKFLRWLESAVPSGTETELSIDRKLHAFRATQDLYAGE SFDTIAGYKEHGAIVHYSATEESNATLHPKGFLLLDSGAQYLDGTTDITRTIALGELTTE EKTDYTLVLKGHIALAMAVFPSGTRGAQLDVLARMPLWSHKMNFLHGTGHGVGHFLSVHE GPQSIRMNENPIVLQPGMVTSNEPGVYKGGSHGIRTENLTLVCSAGEGLFGEYLKFETIT LCPICKKGIIKELLTADEVDWLNNYHQQVYEKLSPKLNEEEKAWLKEATAAI >gi|226332003|gb|ACIB01000053.1| GENE 74 80010 - 80186 154 58 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|265767526|ref|ZP_06095192.1| ## NR: gi|265767526|ref|ZP_06095192.1| predicted protein [Bacteroides sp. 2_1_16] # 1 58 1 58 58 89 98.0 5e-17 MLWFKISAVKLRRKITHTFFIHKTFGNYFSKSMQKVQKVVIFGEYNDIIYIMYETVNQ >gi|226332003|gb|ACIB01000053.1| GENE 75 80203 - 80394 310 63 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715487|ref|YP_101479.1| 30S ribosomal protein S21 [Bacteroides fragilis YCH46] # 1 63 1 63 63 124 100 3e-27 MIVVPVKEGENIEKALKKFKRKFEKTGIVKELRSRQQFDKPSVTKRLKKERAVYVQKLQQ VED >gi|226332003|gb|ACIB01000053.1| GENE 76 80468 - 81349 682 293 aa, chain + ## HITS:1 COG:lin1316 KEGG:ns NR:ns ## COG: lin1316 COG4974 # Protein_GI_number: 16800384 # Func_class: L Replication, recombination and repair # Function: Site-specific recombinase XerD # Organism: Listeria innocua # 2 293 7 300 300 194 38.0 2e-49 MLTDSFLDYLRYERNYSEKTVLAYGEDISQLREFAQERMEKFDPAEVKPELVREWIVSLM DQGYTSTSVNRKLSSLRSFYKYLLRQGEVSVDPLRKITGPKNKKPLPSFLKESEMNKLLD DTDFGEGLKGCRDRLIIEMFYATGMRLSELIGLDDKDVDFSASLLKVTGKRNKQRLIPFG DELKETMLEYVDIRNEMISGRSDAFFVRENGERLYKNLVYNLVKRNLSKVVTLKKRSPHV LRHTFATTMLNNDAELGAVKELLGHSSLATTEIYTHTTFEELKKVYKQAHPRA >gi|226332003|gb|ACIB01000053.1| GENE 77 81364 - 81663 200 99 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163755828|ref|ZP_02162946.1| 30S ribosomal protein S21 [Kordia algicida OT-1] # 1 97 4 100 102 81 43 2e-14 MEIRIQSIHFDATEQLQAFIQKKVSKLEKYYEDIKKVEVSLKVVKPETAENKEAGITVLV PNNDFHASKICDTFEEAVDLCVEALEKQLVKYKEKQRNK >gi|226332003|gb|ACIB01000053.1| GENE 78 82229 - 83413 1396 394 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|119502908|ref|ZP_01624993.1| Ribosomal protein S19 [marine gamma proteobacterium HTCC2080] # 1 394 1 407 407 542 66 1e-153 MAKEKFERTKPHVNIGTIGHVDHGKTTLTAAITTVLAKKGLSELRSFDSIDNAPEEKERG ITINTSHVEYETANRHYAHVDCPGHADYVKNMVTGAAQMDGAIIVVAATDGPMPQTREHI LLARQVNVPKLVVFMNKCDMVEDAEMLELVEMEMRELLSFYDFDGDNTPIIQGSALGALN GVEKWEDKVMELMEAVDTWIPLPPRDVDKPFLMPVEDVFSITGRGTVATGRIETGVIHVG DEIEILGLGEDKKSVVTGVEMFRKLLDQGEAGDNVGLLLRGVDKNEIKRGMVLCKPGQIK PHSKFKAEVYILKKEEGGRHTPFHNKYRPQFYLRTMDCTGEITLPEGTEMVMPGDNVTIT VELIYPVALNIGLRFAIREGGRTVGAGQITEIID >gi|226332003|gb|ACIB01000053.1| GENE 79 83554 - 83745 157 63 aa, chain + ## HITS:1 COG:no KEGG:BF4198 NR:ns ## KEGG: BF4198 # Name: not_defined # Def: preprotein translocase SecE subunit # Organism: B.fragilis # Pathway: Protein export [PATH:bfr03060]; Bacterial secretion system [PATH:bfr03070] # 1 63 1 63 63 104 100.0 8e-22 MKKVVAYIKESYDELVHKVSWPTYSELTNSAVVVLYASLLIALVVFAMDFCFQNFMEKII YPH >gi|226332003|gb|ACIB01000053.1| GENE 80 83762 - 84304 615 180 aa, chain + ## HITS:1 COG:CC3205 KEGG:ns NR:ns ## COG: CC3205 COG0250 # Protein_GI_number: 16127435 # Func_class: K Transcription # Function: Transcription antiterminator # Organism: Caulobacter vibrioides # 7 179 12 183 185 152 47.0 4e-37 MSEIEKKWYVLRAISGKEAKVKEYLEADIKNSDLGEYVSQVLIPTEKVYQVRNGKKIVKE RSYLPGYVLVEAALVGEVSHHLRNTPNVIGFLGGSDKPVPLRQSEVNRILGTVDELQETG EDLNVPYIVGETVKVTFGPFSGFSGIIEEVNSEKKKLKVMVKIFGRKTPLELGFMQVEKE >gi|226332003|gb|ACIB01000053.1| GENE 81 84365 - 84808 738 147 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715481|ref|YP_101473.1| 50S ribosomal protein L11 [Bacteroides fragilis YCH46] # 1 147 1 147 147 288 99 7e-77 MAKEVAGLIKLQIKGGAANPSPPVGPALGSKGINIMEFCKQFNARTQDKAGKILPVIITY YADKSFDFVIKTPPVAIQLLEVAKVKSGSAEPNRKKVAEITWEQVRAIAQDKMVDLNCFT VEAAMRMVAGTARSMGIAVKGEFPVNN >gi|226332003|gb|ACIB01000053.1| GENE 82 84824 - 85522 1157 232 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715480|ref|YP_101472.1| 50S ribosomal protein L1 [Bacteroides fragilis YCH46] # 1 232 1 232 232 450 100 1e-125 MGKLTKNQKLAAGKIEAGKAYSLKEAASLVKEITFTKFDASLDIDVRLGVDPRKANQMVR GVVSLPHGTGKQVRVLVLCTPDAEAAAKEAGADYVGLDEYIEKIKGGWTDIDVIITMPSI MGKIGALGRVLGPRGLMPNPKSGTVTMDVAKAVREVKQGKIDFKVDKSGIVHTSIGKVSF TAEQIRDNAKEFISTLNKLKPTAAKGTYIKSIYLSSTMSAGIKIDPKSVEEI >gi|226332003|gb|ACIB01000053.1| GENE 83 85538 - 86050 844 170 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715479|ref|YP_101471.1| 50S ribosomal protein L10 [Bacteroides fragilis YCH46] # 1 170 1 170 170 329 100 4e-89 MRKEDKNTIIEQIAATVQEYGHFYLVDTTAMNAAATSELRRACFKADIKLMVVKNTLLHK ALESIEGDFSPLYDSLKGTTAVMFCNVANAPAKLIKDKSKDGIPGLKAAYAEESFYVGAD QLDALVAIKSKNEVIADIVALLQSPAKNVISALQSGGNTLHGVLKTLGER >gi|226332003|gb|ACIB01000053.1| GENE 84 86099 - 86473 588 124 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715478|ref|YP_101470.1| 50S ribosomal protein L7/L12 [Bacteroides fragilis YCH46] # 1 124 1 124 124 231 100 2e-59 MADLKAFAEQLVNLTVKEVNELATILKEEYGIEPAAAAVAVAAGPAAGAAAAEEKSSFDV VLKSAGAAKLQVVKAVKEACGLGLKEAKDMVDGAPSVVKEGLAKDEAESLKKTLEEAGAE VELK >gi|226332003|gb|ACIB01000053.1| GENE 85 86578 - 90390 2924 1270 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163796927|ref|ZP_02190884.1| 30S ribosomal protein S12 [alpha proteobacterium BAL199] # 9 1269 16 1387 1392 1130 45 0.0 MSSNTVNQRVNFASTKNPLEYPDFLEVQLKSFQDFLQLDTPPEKRKKEGLYKVFAENFPI ADTRNNFVLEFLDYYIDPPRYTIDDCIERGLTYSVPLKAKLKLYCTDPDHEDFDTVIQDV FLGPIPYMTDKATFVINGAERVVVSQLHRSPGVFFGQSVHANGTKLYSARIIPFKGSWIE FATDINNVMYAYIDRKKKLPVTTLLRAIGFENDKDILEIFNLAEDVKVNKTNLKKVVGRK LAARVLKTWIEDFVDEDTGEVVSIERNEVIIDRETVIEPEHIDEIIDSGVQNILIHKEEP NQSDYSIIYNTLQKDPSNSEKEAVLYIYRQLRNADPADDASAREVINNLFFSEKRYDLGD VGRYRINKKLNLTTDMDVRVLTKEDIIEIIKYLIELINSKADVDDIDHLSNRRVRTVGEQ LSNQFAVGLARMSRTIRERMNVRDNEVFTPIDLINAKTISSVINSFFGTNALSQFMDQTN PLAEITHKRRMSALGPGGLSRERAGFEVRDVHYTHYGRLCPIETPEGPNIGLISSLCVFA KINDLGFIETPYRKVDNGKVDLSENGLVYLTAEEEEAKIIAQGNAPLNDDGTFIRNKVKS RQDADYPVVEPSEVELMDVAPQQIASIAASLIPFLEHDDANRALMGSNMMRQAVPLLRSE APIVGTGIERQLVRDSRTQIAAEGDGVIDFVDATTIRILYDRTEDEEFVSFEPALKEYRI PKFRKTNQNMTIDLRPTCNKGDRVTKGQILTEGYSTENGELALGKNLLVAYMPWKGYNYE DAIVLNERVVREDLLTSVHVEEYSLEVRETKRGMEELTSDIPNVSEEATKDLDENGIVRV GARIQPGDILIGKITPKGESDPSPEEKLLRAIFGDKAGDVKDASLKASPSLKGVIIDKKL FSRVIKNRSSKLADKALLPKIDDEFESKVADLKRILVKKLMVLTEGKVSQGVKDYLGAEV IAKGSKFSASDFDSLDFTAIQLSDWTNDDHANGMIRDLILNFIKKYKELDAELKRKKFAI TIGDELPAGIIQMAKVYIAKKRKIGVGDKMAGRHGNKGIVSRVVRQEDMPFLEDGTPVDI VLNPLGVPSRMNIGQIFEAVLGRAGKNLGVKFATPIFDGATLDDLNEWTDKAGLPRYCKT YLCDGGTGERFDQPATVGVTYMLKLGHMVEDKMHARSIGPYSLITQQPLGGKAQFGGQRF GEMEVWALEGFGASHILQEILTIKSDDVVGRSKAYEAIVKGEPMPQPGIPESLNVLLHEL RGLGLSINLE >gi|226332003|gb|ACIB01000053.1| GENE 86 90537 - 94820 4286 1427 aa, chain + ## HITS:1 COG:mlr0277 KEGG:ns NR:ns ## COG: mlr0277 COG0086 # Protein_GI_number: 13470543 # Func_class: K Transcription # Function: DNA-directed RNA polymerase, beta' subunit/160 kD subunit # Organism: Mesorhizobium loti # 13 1395 18 1356 1398 1337 50.0 0 MAFRKENKIKSNFSKISIGLASPEEILENSSGEVLKPETINYRTYKPERDGLFCERIFGP IKDYECHCGKYKRIRYKGIVCDRCGVEVTEKKVRRERMGHIQLVVPVAHIWYFRSLPNKI GYLLGLPTKKLDSIIYYERYVVIQPGVKAEDGIAEFDLLSEEEYLDILDTLPKDNQYLED TDPNKFIAKMGAEAIYDLLARLDLDALSYELRHRAGNDASQQRKNEALKRLQVVESFRAS RGRNKPEWMIVRIVPVIPPELRPLVPLDGGRFATSDLNDLYRRVIIRNNRLKRLIEIKAP EVILRNEKRMLQESVDSLFDNSRKSSAVKTDANRPLKSLSDSLKGKQGRFRQNLLGKRVD YSARSVIVVGPELRMHECGIPKLMAAELYKPFIIRKLIERGIVKTVKSAKKIVDRKEPVI WDILEHVMKGHPVLLNRAPTLHRLGIQAFQPKMIEGKAIQLHPLACTAFNADFDGDQMAV HLPLSNEAVLEAQMLMLASHNILNPANGAPITVPSQDMVLGLYYITKLRKGAKGEGLTFY GPEEALIAYNEGKVDIHAPVKVIVKDLDENGNIVDVMRETSVGRVIVNEIVPPEVGYINT IISKKSLRDIISAVIKACGVARTADFLDGIKNLGYKMAFQGGLSFNLGDIIIPKEKETLV QRGYEEVEQVISNYNMGFITNNERYNQVIDIWTHVNSELSNILMKTISSDDQGFNSVYMM LDSGARGSKEQIRQLSGMRGLMAKPQKAGAEGGQIIENPILSNFKEGLSVLEYFISTHGA RKGLADTALKTADAGYLTRRLVDVSHDVIINEEDCGTLRGLVCTDLKNNDEVIATLYERI LGRVSVHDIIHPQTGELLVAGGEEITEDIAKKIQESPIESVEIRSVLTCESKKGVCAKCY GRNLATNHMVQKGEAVGVIAAQSIGEPGTQLTLRTFHAGGTAANIAANASIVAKNNARLE FEELRTVDIVDETGESAKVVVGRLAEVRFIDVNTGIVLSTHNVPYGSTLYVADGEVVEKG KLIAKWDPFNAVIITEATGKIEFEGVIENVTYKIESDEATGLREIIIIESKDKTKVPSAH ILTEDGDLIRTYNLPVGGHVVIENGQKVKAGEVIVKIPRAVGKAGDITGGLPRVTELFEA RNPSNPAVVSEIDGEVTMGKVKRGNREIIVTSKTGEVKKYLVPLSKQILVQENDYVRAGT PLSDGATTPADILAIKGPTAVQEYIVNEVQDVYRLQGVKINDKHFEIIVRQMMRKVTIDE PGDTRFLEQQVVDKLEFMEENDRIWGKKVVVDAGDSENLKAGQIVTARKLRDENSMLKRR DLKPVEVRDAVAATSTQILQGITRAALQTSSFMSAASFQETTKVLNEAAINGKIDKLEGM KENVICGHLIPAGTGLREFDKIIVGSKEEYDRILANKKTVLDYNEVE >gi|226332003|gb|ACIB01000053.1| GENE 87 94895 - 96031 646 378 aa, chain - ## HITS:1 COG:RSc3292 KEGG:ns NR:ns ## COG: RSc3292 COG3274 # Protein_GI_number: 17548009 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Ralstonia solanacearum # 9 331 1 289 336 79 25.0 8e-15 MELKKKENIGWIDLLRVLACFFVVFSHSCDAFIGQFDANRESFLTGVFLGSLMRPCVPIF VMMTGVLLLPVQTDMAAFYKKRIGRLIPPMIFWSLVLPVLYFIYLNYINPDTQNPLISMP DHSLEALWFKLYTFIFNFNFDTVPLWYLYMLIGLYLIMPIISGWLEKVSKKELKLFLGIW GISLIAPYVKMFAPALGYQGNYGNMGLWGVCDWNDYGTFYYFSGFIGYLLLAYYLTKYPL KWSWKKLLSITIPMFLTGYLITSYGYVITQNYFPGNYAYLEIVWYFAGINVFMMTFPVFV IVQKIKVPSNHRLSHMASLAFGIYLSHYVFVFIAYDLLDTELLPYTVRIICMACIVFLTC YAIVWLMSKSKLTNRFIR >gi|226332003|gb|ACIB01000053.1| GENE 88 96161 - 96670 165 169 aa, chain - ## HITS:1 COG:no KEGG:BF4189 NR:ns ## KEGG: BF4189 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 169 1 169 169 315 100.0 3e-85 MKNINYSIKKQFKISDEGICLLPMLLRALCRLTKKQLLLLEEIYSNYSSSTEGEIITQSH RTVNAKRIPTSFSYEGGITLDTETLHYSSPSEKFATTWKNILFHTISIRIIIGLHKSNME PSEWREIIHQNTYQKFHRKHRRRLPLSFRERYTQYKNKASDAFNQDAPK >gi|226332003|gb|ACIB01000053.1| GENE 89 96862 - 97107 287 81 aa, chain + ## HITS:1 COG:no KEGG:BF4188 NR:ns ## KEGG: BF4188 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 81 1 81 81 81 100.0 8e-15 MNFIWYILIGILAGYFAGKIMRGGGFGLLVNLLLGIIGGVLGGWVFALLGLAATGIIGSL ITSVVGAILFLWIASFFSRSR >gi|226332003|gb|ACIB01000053.1| GENE 90 97170 - 97430 479 86 aa, chain + ## HITS:1 COG:no KEGG:BF4187 NR:ns ## KEGG: BF4187 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 86 1 86 86 123 100.0 2e-27 MGSGNAKFLVGLGIGSAIGALVYHFSRTAKAKKLKNDVFNALHEIEADAELAVVEAKDKA VKAGAKVAGKVADKATEVKEKLTPNS >gi|226332003|gb|ACIB01000053.1| GENE 91 97744 - 98055 493 103 aa, chain + ## HITS:1 COG:no KEGG:BF4186 NR:ns ## KEGG: BF4186 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 103 1 103 103 189 100.0 3e-47 MENQNPDNQLQIELKEEVAQGTYANLAIITHSSSEFVLDFVRVLPGLPKAGVQSRVILAP EHAKRLQRALEENIAKYERAFGPIRLQEDGVDTPPILDIKGEA >gi|226332003|gb|ACIB01000053.1| GENE 92 98303 - 98704 686 133 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|29348140|ref|NP_811643.1| 30S ribosomal protein S12 [Bacteroides thetaiotaomicron VPI-5482] # 1 133 1 133 133 268 100 8e-71 MPTIQQLVRKGREVLVEKSKSPALDSCPQRRGVCVRVYTTTPKKPNSAMRKVARVRLTNQ KEVNSYIPGEGHNLQEHSIVLVRGGRVKDLPGVRYHIVRGTLDTAGVAGRTQRRSKYGAK RPKPGQAAPAKKK >gi|226332003|gb|ACIB01000053.1| GENE 93 98864 - 99340 809 158 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715469|ref|YP_101461.1| 30S ribosomal protein S7 [Bacteroides fragilis YCH46] # 1 158 1 158 158 316 100 4e-85 MRKAKPKKRVILPDPVFNDQKVSKFVNHLMYDGKKNTSYEIFYAALETVKAKLPNEEKTA LEIWKKALDNVTPQVEVKSRRVGGATFQVPTEIRPDRKESISMKNLILFARKRGGKSMAD KLAAEIMDAFNEQGGAFKRKEDMHRMAEANRAFAHFRF >gi|226332003|gb|ACIB01000053.1| GENE 94 99394 - 101511 1970 705 aa, chain + ## HITS:1 COG:HP1195 KEGG:ns NR:ns ## COG: HP1195 COG0480 # Protein_GI_number: 15645809 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Translation elongation factors (GTPases) # Organism: Helicobacter pylori 26695 # 3 700 4 692 692 841 60.0 0 MAKNDLHLTRNIGIMAHIDAGKTTTSERILFYTGLTHKIGEVHDGAATMDWMEQEQERGI TITSAATTTRWKYAGDTYKINLIDTPGHVDFTAEVERSLRILDGAVAAYCAVGGVEPQSE TVWRQADKYNVPRIAYVNKMDRSGADFFEVVRQMKAVLGANPCPVVIPIGAEENFKGLVD LIKMKAIYWHDETMGADYTIEEIPANLVDEANEWRDKMLEKVAEFDDALMEKYFDDPSTI TEEEVLRALRNATVQMAVVPMLCGSSFKNKGVQTLLDYVCAFLPSPLDAENVVGTNPDTG AEEDRKPSEDDKTSALAFKIATDPYVGRLTFFRVYSGKIEAGSYIYNSRSGKKERVSRLF QMHSNKQNPVEVIGAGDIGAGVGFKDIHTGDTLCDETAPIVLESMDFPEPVIGIAVEPKT QKDMDKLSNGLAKLAEEDPTFTVKTDEQTGQTVISGMGELHLDIIIDRLKREFKVECNQG KPQVNYKEAITKTVNLREVYKKQSGGRGKFADIIVNIGPVDEDFTQGGLQFVDEVKGGNI PKEFIPSVQKGFQTAMKNGVLAGYPLDSLKVTLVDGSFHPVDSDQLSFEICAIQAYKNAC AKAGPVLMEPIMKLEVVTPEENMGDVIGDLNKRRGQVEGMESSRSGARIVKAMVPLAEMF GYVTALRTITSGRATSSMVYSHHAQVSSSIAKAVLEEVKGRADLL >gi|226332003|gb|ACIB01000053.1| GENE 95 101593 - 101898 494 101 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715467|ref|YP_101459.1| 30S ribosomal protein S10 [Bacteroides fragilis YCH46] # 1 101 1 101 101 194 100 1e-48 MSQKIRIKLKSYDHNLVDKSAEKIVRTVKATGAIVSGPIPLPTHKRIFTVNRSTFVNKKS REQFELSSFKRLIDIYSSTAKTVDALMKLELPSGVEVEIKV >gi|226332003|gb|ACIB01000053.1| GENE 96 101917 - 102534 1073 205 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715466|ref|YP_101458.1| 50S ribosomal protein L3 [Bacteroides fragilis YCH46] # 1 205 1 205 205 417 100 1e-115 MPGLLGKKIGMTSVFSADGKNVPCTVIEAGPCVVTQVKTVEKDGYAAVQLGFQDKKEKHT TKPLMGHFKKAGVTPKRHLAEFKEFENELNLGDTVTVELFDGADYVDVVGTSKGKGFQGV VKRHGFGGVGQSTHGQHNRARKPGSIGACSYPAKVFKGMRMGGQMGGDRVTVQNLQVLKV IAEHNLLLIKGSVPGCKGSIVLIEK >gi|226332003|gb|ACIB01000053.1| GENE 97 102534 - 103160 1047 208 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715465|ref|YP_101457.1| 50S ribosomal protein L4 [Bacteroides fragilis YCH46] # 1 208 1 208 208 407 100 1e-112 MEVNVYNIKGEDTGRKVTLNESIFGIEPNDHAIYLDVKQFMANQRQGTHKSKERSEISGS TRKIGRQKGGGGARRGDMNSPVLVGGGRVFGPKPRDYYFKLNKKVKTLARKSALSYKAQD NAIVVVEDFNFEAPKTKVFVEMTKNLKVSDKKLLVVLPEANKNVYLSARNIEGANVQTIS GLNTYRVLNAGVVVLTESSLKAIDNILI >gi|226332003|gb|ACIB01000053.1| GENE 98 103177 - 103467 479 96 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715464|ref|YP_101456.1| 50S ribosomal protein L23 [Bacteroides fragilis YCH46] # 1 96 1 96 96 189 100 8e-47 MGIIIKPLVTEKMTAITDKLNRFGFIVRPEANKLEIKSEVEALYNVTVVDVNTVKYAGKN KSRYTKAGIINGRTNAFKKAIVTLKEGDTIDFYSNI >gi|226332003|gb|ACIB01000053.1| GENE 99 103473 - 104297 1419 274 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715463|ref|YP_101455.1| 50S ribosomal protein L2 [Bacteroides fragilis YCH46] # 1 274 1 274 274 551 100 1e-156 MAVRKFKPTTPGQRHKIIGTFEEITASVPEKSLVYGKKSSGGRNNEGKMTMRYLGGGHRK VIRIVDFKRNKDGVPAVVKTIEYDPNRSARIALLFYADGEKRYIIAPNGLQVGATLMSGE NAAPEIGNALPLQNIPVGTVIHNIELRPGQGAALVRSAGNFAQLTSREGKYCVIKLPSGE VRQILSTCKATIGSVGNSDHGLESSGKAGRSRWQGRRPRNRGVVMNPVDHPMGGGEGRAS GGHPRSRKGLYAKGLKTRAPKKQSSKYIIERRKK >gi|226332003|gb|ACIB01000053.1| GENE 100 104318 - 104587 465 89 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715462|ref|YP_101454.1| 30S ribosomal protein S19 [Bacteroides fragilis YCH46] # 1 89 1 89 89 183 100 3e-45 MSRSLKKGPYINVKLEKKVLAMNESGKKVVVKTWSRASMISPDFVGHTVAVHNGNKFIPV YVTENMVGHKLGEFAPTRTFRGHAGNKKK >gi|226332003|gb|ACIB01000053.1| GENE 101 104623 - 105033 679 136 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|167764367|ref|ZP_02436492.1| hypothetical protein BACSTE_02751 [Bacteroides stercoris ATCC 43183] # 1 136 1 136 136 266 100 5e-70 MGARKKISAEKRKEALKTMYFAKLQNVPTSPRKMRLVADMIRGMEVNRALGVLKFSSKEA AARVEKLLRSAIANWEQKNERKAESGELFVTKIFVDGGATLKRMRPAPQGRGYRIRKRSN HVTLFVGSKSNNEDQN >gi|226332003|gb|ACIB01000053.1| GENE 102 105039 - 105773 1256 244 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715460|ref|YP_101452.1| 30S ribosomal protein S3 [Bacteroides fragilis YCH46] # 1 244 1 244 244 488 100 1e-137 MGQKVNPISNRLGIIRGWDSNWYGGNDYGDSLLEDSKIRKYLNARLAKASVSRIVIERTL KLVTITVCTARPGIIIGKGGQEVDKLKEELKKVTDKDIQINIFEVKRPELDAVIVANNIA RQVEGKIAYRRAIKMAIANTMRMGAEGIKIQISGRLNGAEMARSEMYKEGRTPLHTFRAD IDYCHAEALTKVGLLGIKVWICRGEVFGKRELAPNFTQSKESGRGNNGGNNGGGKNFKRK KNNR >gi|226332003|gb|ACIB01000053.1| GENE 103 105797 - 106231 736 144 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715459|ref|YP_101451.1| 50S ribosomal protein L16 [Bacteroides fragilis YCH46] # 1 144 1 144 144 288 100 1e-76 MLQPKKTKFRRQQKGRAKGNAQRGNQLAFGSFGIKALETKWITGRQIEAARIAVTRYMQR QGQIWIRIFPDKPITRKPADVRMGKGKGSPEGFVAPVTPGRIIIEAEGVSYEIAKEALRL AAQKLPITTKFVVRRDYDIQNQNA >gi|226332003|gb|ACIB01000053.1| GENE 104 106237 - 106434 319 65 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715458|ref|YP_101450.1| 50S ribosomal protein L29 [Bacteroides fragilis YCH46] # 1 65 1 65 65 127 100 3e-28 MKIAEIKEMSTNDLVERVEAEVVNYNQMVINHSISPLENPAQIKQLRRTIARMRTELRQR ELNNK >gi|226332003|gb|ACIB01000053.1| GENE 105 106431 - 106700 456 89 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715457|ref|YP_101449.1| 30S ribosomal protein S17 [Bacteroides fragilis YCH46] # 1 89 1 89 89 180 100 4e-44 MISLMEARNLRKERTGVVLSNKMEKTITVAAKFKEKHPIYGKFVSKTKKYHAHDEKNECN VGDTVRIMETRPLSKTKRWRLVEIIERAK >gi|226332003|gb|ACIB01000053.1| GENE 106 106703 - 107068 596 121 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|29348126|ref|NP_811629.1| 50S ribosomal protein L14 [Bacteroides thetaiotaomicron VPI-5482] # 1 121 1 121 121 234 100 2e-60 MIQVESRLTVCDNSGAKEALCIRVLGGTGRRYASVGDVIVVSVKSVIPSSDVKKGAVSKA LIVRTKKEIRRPDGSYIRFDDNACVLLNNAGEIRGSRIFGPVARELRATNMKVVSLAPEV L >gi|226332003|gb|ACIB01000053.1| GENE 107 107089 - 107409 534 106 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715455|ref|YP_101447.1| 50S ribosomal protein L24 [Bacteroides fragilis YCH46] # 1 106 1 106 106 210 100 3e-53 MSKLHIKKGDTVYVNAGEDKGKTGRVLKVLVKEGRAIVEGINMVSKSTKPNAKNPQGGIV KQEAPIHISNLNPVDPKTGKATRVGRKVSSEGTLVRYSKKSGEEIK >gi|226332003|gb|ACIB01000053.1| GENE 108 107409 - 107966 910 185 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715454|ref|YP_101446.1| 50S ribosomal protein L5 [Bacteroides fragilis YCH46] # 1 185 1 185 185 355 99 8e-97 MSNTASLKKEYAERIVPALKSQFQYSSTMQIPVLKKIVINQGLGMAVADKKIIEVAINEM TAITGQKAVATISRKDIANFKLRKKMPIGVMVTLRRERMYEFLEKLVRVALPRIRDFKGI ETKFDGKGNYTLGIQEQIIFPEINIDSITRILGMNITFVTSAQTDEEGYALLKEFGLPFK NAKKD >gi|226332003|gb|ACIB01000053.1| GENE 109 107973 - 108272 495 99 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715453|ref|YP_101445.1| 30S ribosomal protein S14 [Bacteroides fragilis YCH46] # 1 99 1 99 99 195 98 1e-48 MAKESMKAREIKRAKLVAKYAEKRAALKQIVRTGGPAEAFEAAQKLQELPKNSNPIRMHN RCKLTGRPKGYIRQFGVSRIQFREMASNGLIPGVKKASW >gi|226332003|gb|ACIB01000053.1| GENE 110 108327 - 108722 664 131 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715452|ref|YP_101444.1| 30S ribosomal protein S8 [Bacteroides fragilis YCH46] # 1 131 1 131 131 260 100 3e-68 MTDPIADYLTRLRNAINAKHRVVEVPASNLKKEITKILFEKGYILNYKFVEDGPQGTIKV ALKYDSVNKVNAIKKLERISSPGMRQYTGYKDMPRVINGLGIAIISTSKGVMTNKEAAEL KIGGEVLCYVY >gi|226332003|gb|ACIB01000053.1| GENE 111 108738 - 109307 969 189 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715451|ref|YP_101443.1| 50S ribosomal protein L6 [Bacteroides fragilis YCH46] # 1 189 1 189 189 377 100 1e-103 MSRIGKLPISIPAGVTVTLKDDVVTVKGPKGELSQYVNPAINVAIEDGHITLTENENAML DNPKQKHAFHGLYRSLVHNMVVGVSEGYKKELELVGVGYRASNQGNIIELALGYTHNIFI QLPPEVKVETKSERNKNPLILLESCDKQLLGQVCSKIRSFRKPEPYKGKGIKFVGEEIRR KSGKSAGAK >gi|226332003|gb|ACIB01000053.1| GENE 112 109329 - 109673 561 114 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715450|ref|YP_101442.1| 50S ribosomal protein L18 [Bacteroides fragilis YCH46] # 1 114 1 114 114 220 100 2e-56 MTTKIERRIKIKYRVRNKVSGTAARPRMSVFRSNKQIYVQIIDDLSGKTLAAASSLGMTE KLPKKEVAAKVGEIIAKKAQEAGITTVVFDRNGYLYHGRVKEVADAARNGGLKF >gi|226332003|gb|ACIB01000053.1| GENE 113 109679 - 110197 851 172 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715449|ref|YP_101441.1| 30S ribosomal protein S5 [Bacteroides fragilis YCH46] # 1 172 1 172 172 332 100 6e-90 MAGVNNRVKITNDIELKDRLVAINRVTKVTKGGRTFSFSAIVVVGNEEGIIGWGLGKAGE VTAAIAKGVESAKKNLTRVPVLKGTVPHEQSAKFGGAEVFIKPASHGTGVVAGGAMRAVL ESVGVTDVLAKSKGSSNPHNLVKATIMALGEMRDARMIAQNRGISVEKVFRG >gi|226332003|gb|ACIB01000053.1| GENE 114 110208 - 110384 281 58 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715448|ref|YP_101440.1| 50S ribosomal protein L30 [Bacteroides fragilis YCH46] # 1 58 1 58 58 112 100 7e-24 MSTIKIKQVKSRIGAPADQKRTLDALGLRKLNRVVEHESTPSILGMVDKVKHLVAIVK >gi|226332003|gb|ACIB01000053.1| GENE 115 110415 - 110861 736 148 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715447|ref|YP_101439.1| 50S ribosomal protein L15 [Bacteroides fragilis YCH46] # 1 148 1 148 148 288 100 1e-76 MNLSNLKPAEGSTKTRKRIGRGPGSGLGGTSTRGHKGAKSRSGYSKKIGFEGGQMPLQRR VPKFGFKNINRIEYKAINLETIQKLAEAKKLEKVGVNDFIEAGFISSSQLVKVLGNGTLT AKLSVEAHAFSKSAVAAIEAAGGNVVKL >gi|226332003|gb|ACIB01000053.1| GENE 116 110866 - 112212 891 448 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163796899|ref|ZP_02190856.1| 30S ribosomal protein S11 [alpha proteobacterium BAL199] # 13 442 19 447 447 347 42 1e-94 MRKAIETLKNIWKIEDLRQRILITILFVAIYRFGSYVVLPGINPGMLTQLHQQTSEGLLA LLNMFSGGAFSNASIFALGIMPYISASIVIQLLGIAVPYFQKLQREGESGRRKMNQYTRY LTIAILLVQAPSYLLNLKMQAGPSLNASLDWTLFMVTSTIILAAGSMFILWLGERITDKG IGNGISFIILIGIIARLPQSLFQELISRMTDKTGGLIMFLFEIVFLLIVIAGAILLVQGT RKIPVQYAKRIVGNKQYGGARQYIPLKVNAAGVMPIIFAQAIMFIPITFIGFSNTNNVSG FVHAFTDHTSFWYNFVFAVMIILFTYFYTAITINPTQMAEDMKRNNGFIPGIKPGKKTAE YIDDIMSRITLPGSFFLAFIAIMPAFAGIFGVKAEFAQFFGGTSLLILVGVVLDTLQQIE SHLLMRHYDGLLKSGRIKGRTGSSVAAY >gi|226332003|gb|ACIB01000053.1| GENE 117 112227 - 113024 680 265 aa, chain + ## HITS:1 COG:BH0156 KEGG:ns NR:ns ## COG: BH0156 COG0024 # Protein_GI_number: 15612719 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Methionine aminopeptidase # Organism: Bacillus halodurans # 1 251 1 246 248 264 50.0 1e-70 MIFLKTEDEIELLRQSNLLVGRTLAEVAKLVKPGVTTKELDKVAEEFIRDHGAVPTFKGF PNQYGDPFPASLCTSVNEQVVHGIPGDIVLKDGDIVSVDCGTYMNGFCGDSAYTFCVGEV DEEVRQLLKVTKEALYIGIQNAVQGKRIGDIGYAIQQYCESHSYGVVREFVGHGIGKDMH EDPQVPNYGKRGYGTLLKKGLCIAIEPMITQGDRQVIMERDGWTVRTRDRKCAAHFEHTI AVGAGEADILSSFKFIEEVLGDKAI >gi|226332003|gb|ACIB01000053.1| GENE 118 113028 - 113246 239 72 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|15900168|ref|NP_344772.1| translation initiation factor IF-1 [Streptococcus pneumoniae TIGR4] # 1 72 1 72 72 96 61 5e-19 MAKQSAIEQDGVIVEALSNAMFRVELENGHEITAHISGKMRMHYIKILPGDKVRVEMSPY DLSKGRIVFRYK >gi|226332003|gb|ACIB01000053.1| GENE 119 113255 - 113371 198 38 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715443|ref|YP_101435.1| 50S ribosomal protein L36 [Bacteroides fragilis YCH46] # 1 38 1 38 38 80 100 3e-14 MKVRASLKKRTPECKIVRRNGRLYVINKKNPKYKQRQG >gi|226332003|gb|ACIB01000053.1| GENE 120 113405 - 113785 637 126 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715442|ref|YP_101434.1| 30S ribosomal protein S13 [Bacteroides fragilis YCH46] # 1 126 1 126 126 249 100 4e-65 MAIRIVGVDLPQNKRGEIALTYVYGIGRSSSAKILDKAGVDKDLKVKDWTDDQAAKIREI IGAEYKVEGDLRSEVQLNIKRLMDIGCYRGVRHRIGLPVRGQSTKNNARTRKGRKKTVAN KKKATK >gi|226332003|gb|ACIB01000053.1| GENE 121 113797 - 114186 665 129 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|29348112|ref|NP_811615.1| 30S ribosomal protein S11 [Bacteroides thetaiotaomicron VPI-5482] # 1 129 1 129 129 260 100 2e-68 MAKKTVAAKKRNVKVDANGQLHVHSSFNNIIVSLANSEGQIISWSSAGKMGFRGSKKNTP YAAQMAAQDCAKIAFDLGLRKVKAYVKGPGNGRESAIRTIHGAGIEVTEIIDVTPLPHNG CRPPKRRRV >gi|226332003|gb|ACIB01000053.1| GENE 122 114125 - 114277 124 50 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MNSICCSRDYREEINAIQKQNQVSFLLKVNLKLYVFSEDDNRYAEAELRQ >gi|226332003|gb|ACIB01000053.1| GENE 123 114306 - 114911 1027 201 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715440|ref|YP_101432.1| 30S ribosomal protein S4 [Bacteroides fragilis YCH46] # 1 201 1 201 201 400 99 1e-110 MARYTGPKSRIARKFGEGIFGADKVLSKKNYPPGQHGNSRKRKTSEYGIQLREKQKAKYT YGVLEKQFRNLFEKAATAKGITGEVLLQMLEGRLDNIVFRLGIAPTRAAARQLVGHKHIT VDGQVVNIPSYVVKPGQLIGVRERSKSLEVIANSLAGFNHSKYAWLEWDEASKVGKLLHI PERADIPENIKEHLIVELYSK >gi|226332003|gb|ACIB01000053.1| GENE 124 114923 - 115915 1033 330 aa, chain + ## HITS:1 COG:AGc3518 KEGG:ns NR:ns ## COG: AGc3518 COG0202 # Protein_GI_number: 15889218 # Func_class: K Transcription # Function: DNA-directed RNA polymerase, alpha subunit/40 kD subunit # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 8 325 11 320 336 234 40.0 3e-61 MAILAFQKPDKVLMLEADAKFGKFEFRPLEPGFGITVGNALRRILLSSLEGFAITTIRIE GVEHEFSSVPGVKEDVTNIILNLKQVRFKQVVEEFESEKVSITIENSSEFKAGDIGKYLT GFEVLNPELVICHLDSKATMQIDITINKGRGYVPADENREYCTDVNVIPIDSIYTPIRNV KYAVENFRVEQKTDYEKLVLEITTDGSIHPKEALKEAAKILIYHFMLFSDEKITLESNDV DGNEEFDEEVLHMRQLLKTKLVDMDLSVRALNCLKAADVETLGDLVQFNKTDLLKFRNFG KKSLTELDDLLESLNLSFGTDISKYKLDKE >gi|226332003|gb|ACIB01000053.1| GENE 125 115919 - 116404 807 161 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|53715438|ref|YP_101430.1| 50S ribosomal protein L17 [Bacteroides fragilis YCH46] # 1 161 1 161 161 315 100 7e-85 MRHNKKFNHLGRTASHRSAMLSNMACSLIKHKRITTTVAKAKALKKFVEPLITKSKEDTT NSRRVVFSNLQDKLAVTELFKEISVKIADRPGGYTRIIKTGNRLGDNAEMCFIELVDYNE NMAKEKVAKKATRTRRSKKTTEAAPAAEVPATEEPKAESAE >gi|226332003|gb|ACIB01000053.1| GENE 126 116573 - 116950 331 125 aa, chain + ## HITS:1 COG:no KEGG:BF3974 NR:ns ## KEGG: BF3974 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 125 11 135 135 236 99.0 2e-61 MLFAVALHSVANDYFAEKQAEQDIYMAMSTMKGDTHETVSSPQTPYFPDAELAGTGIQTH QIAMSRIQRIQAAESIFSLKALAQRLADRDAVLSQHWGKLYETTTSYCCHPVSEYYVFAL RRIIV >gi|226332003|gb|ACIB01000053.1| GENE 127 117059 - 117598 585 179 aa, chain + ## HITS:1 COG:no KEGG:BF4152 NR:ns ## KEGG: BF4152 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 179 1 179 179 327 100.0 1e-88 MTTQAIDATIFASSHPDIAKRTSVSGLIFSCIMLLAGVIAFVSTFEMEDRSSTISMGLMV LGTALFLIGVFRLFWKSKEIVYLPTGSVAKEQSIFFDLKHLDELTDMVKSGDFSMQSTAK GGTSGNLRLDVMLSEDRKFAAVQLFQFVPYTYNPVTSVRYFTNGEAASIAAFLTKTKGH >gi|226332003|gb|ACIB01000053.1| GENE 128 117630 - 119444 1263 604 aa, chain - ## HITS:1 COG:CAC3034 KEGG:ns NR:ns ## COG: CAC3034 COG0249 # Protein_GI_number: 15896285 # Func_class: L Replication, recombination and repair # Function: Mismatch repair ATPase (MutS family) # Organism: Clostridium acetobutylicum # 9 601 5 597 598 266 30.0 9e-71 MEQLNLIIDTYQRIILESESKLAKVKQHIYRIGTLRLILFAAGIAGIIYFWDEGRLVIGG IAAITFIPFILLVKLHNRLFHQKDYLEKKIEINRQELQALTYDTSAFDNGEEFINPSHLY TYDLDVFGEHSLFQYINRTATQPGKKRLAEWMNMHLKSKAEIEKRQEAVRELAPELEMRQ HFRVLGLLHKGKTADEEEIRNWASSPEYYRKKWYFRTLAILIPTANAVCIGLAIAGIISF TTWGIVFASIGLFSSSFSKGISRMQSVYGKKMLILTTYARLIHIIEEKKMRCSALKEIKE LVGGEKQTASQAVKRLTELMNALDQRNNMLMQFVLNGLFFWELRQVMKIEAWKENYAAHL PDWLEAIGEMDAYCSLACFAYNHPGYVFPEIASKPFCVEAEALGHPLMNRNKCVRNDIRI AKRPFFIIITGANMAGKSTYLRTIGVSYLLACIGAPVWAEKMKLYPAQLVTSLRTTDSLA DNESYFFAELKRLKLIIDKLNSGEELFIILDEILKGTNSMDKQKGSFALIKQFMALQANG IIATHDLLLGSLINLFPKDIHNYCFEADITNNELTFSYKLRDGIAQNMNACFLMKKMGIA VIDD >gi|226332003|gb|ACIB01000053.1| GENE 129 120005 - 120637 618 210 aa, chain + ## HITS:1 COG:no KEGG:BF3971 NR:ns ## KEGG: BF3971 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 210 1 210 210 420 99.0 1e-116 MKKLCFLLLMSWVVMDVTAQNVVYSNLKGLLACEGDTVASLKVEKRTKNHILMTGGADYK ISASPDDSMCKYLKSRCYAVQADTSLYVNCKRLRYKKFRFGGWYAPALRIGDHIYFSAIP LGSVAAGSDATMDVMLGGQFGDAIAASALISKRVYYEIDPETNKVGFVGKERMGELLGGH PDWKAAYLNENSESAKVTDKYLRLLKAEEK >gi|226332003|gb|ACIB01000053.1| GENE 130 120835 - 122820 1828 661 aa, chain + ## HITS:1 COG:SPy2082 KEGG:ns NR:ns ## COG: SPy2082 COG2987 # Protein_GI_number: 15675840 # Func_class: E Amino acid transport and metabolism # Function: Urocanate hydratase # Organism: Streptococcus pyogenes M1 GAS # 1 661 13 673 676 1011 69.0 0 MKMTLSNQLPEYPVFAEGIRRAPDRGYTLSPAQTVTALKNALRYIPVELHRKLAPEFLEE LRTRGRIYGYRFRPAGDLKAKPVDEYQGNCIEGKAFQVMIDNNLCFDIALYPYELVTYGE TGQVCQNWMQYRLIKQYLELLTREQTLVIESGHPLGLFHSRPDAPRVIITNSMMVGMFDN QHDWHEAAQMGVANYGQMTAGGWMYIGPQGIVHGTFNTLLNAGRLKLGIPQDKNLSGHLF VSSGLGGMSGAQPKAAEIAGAASIIAEVDRSRIETRYKQGWVEHVTTDLHTAFRMALSAA ERHESCSVAYHGNVVDLLEYAVQEDIPIELLSDQTSCHAVYEGGYCPAGVTFEERTRLLH ESPEAFRRLVDESLHRHFAVIKKLVSRGTYFFDYGNSFMKAVYDAGVKEISRNGTDEKDG FIWPSYVEDIMGPELFDYGYGPFRWVCLSGNPEDLARTDRAAMECIDVKRRGQDLDNYNW IRDAGKNRLVVGTQARILYQDAVGRLKIALRFNQMVRDGEVGPIMLGRDHHDVSGTDSPF RETSNIKDGSNVMADMAVQCFAGNCARGMSLVALHNGGGVGVGKAINGGFGMVCDGSLRV DEILRSSMLWDVMGGVARRSWARNEHAMETSEAFNLSHGDAYHITLPYLADEELIKRIVA E >gi|226332003|gb|ACIB01000053.1| GENE 131 122889 - 123791 937 300 aa, chain + ## HITS:1 COG:SPy2083 KEGG:ns NR:ns ## COG: SPy2083 COG3643 # Protein_GI_number: 15675841 # Func_class: E Amino acid transport and metabolism # Function: Glutamate formiminotransferase # Organism: Streptococcus pyogenes M1 GAS # 5 299 3 298 299 341 53.0 1e-93 MNWNKIVECVPNFSEGRDLEKIDRIVAPFRARSGVKLLDYSNDEDHNRLVVTLIGEPEAL RDAVIEAIGVAVELIDLNHHRGQHPRMGAVDVVPFIPIKNVTMDEAVSLSREIGEKVAGL YHLPVFLYEKSATAPHRENLAAVRKGEFEGMAEKMKLPEWHPDYGPAGCHPTAGVVAIGA RMPLVAYNINLSTDNLEIATKIAKNIRHINGGLRYVKAMGVELKERNITQVSINMTDYTR TALYRAFELVRIEARRYGVIILGSEIVGLVPMEALIDTASYYLGLENFSMRQVLESRIME >gi|226332003|gb|ACIB01000053.1| GENE 132 123798 - 125051 1259 417 aa, chain + ## HITS:1 COG:BH1984 KEGG:ns NR:ns ## COG: BH1984 COG1228 # Protein_GI_number: 15614547 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism # Function: Imidazolonepropionase and related amidohydrolases # Organism: Bacillus halodurans # 21 411 24 409 426 343 46.0 3e-94 MSENLIIFNAKVVTPLGFSARKGKEMAELRILEKATVEVVDGIITYVGPNRGEVRDGYYH HFWHYNARGKCLLPGFVDSHTHFVFGGERAEEFSWRLKGESYMSIMQRGGGIASTVQATR ELSFIHLRSKAEGLLKKMTAMGITTVEGKSGYGLNRETELLQLKVMRSLNKDEGVRVDIV PTFLGAHALPDEYKERPDDYIDFLIRELLPVIQRDSLAEFCDVFCEEGVFSIEQSRRLLT AAGDYGFLPKLHADEIVPLGGAELAAELGAVSADHLLHASDAGIEAMARKGVVATLLPLT AFALKEPYARGRDMIDAGCAVALATDLNPGSCFSGSIPLTFALACIHMQLTVEEAITALT LNGAAALNRADSIGSIEVGKKGDFVVLDSDNYHILPYYVGMNCVNTTIKGGMLYPSV >gi|226332003|gb|ACIB01000053.1| GENE 133 125086 - 125715 739 209 aa, chain + ## HITS:1 COG:FN0739 KEGG:ns NR:ns ## COG: FN0739 COG3404 # Protein_GI_number: 19704074 # Func_class: E Amino acid transport and metabolism # Function: Methenyl tetrahydrofolate cyclohydrolase # Organism: Fusobacterium nucleatum # 2 207 3 212 212 133 38.0 2e-31 MLAELTVKEFLDKVAGSDPVPGGGSVAALNGAVASALTAMVAGLTIGKKGYEEHEELMKH ISRLSIRQQELFVEYIDRDSEAYDHVFGCFKLPKSTDEEKAARSAAIQEATRFAALVPMQ VARNACELMEIIADVARLGNQNAITDACVAMMAARSAVLGALLNVRINLGSLKDKTFVDE LKREADHLEQLACMREKELLEVVNQELNQ >gi|226332003|gb|ACIB01000053.1| GENE 134 125712 - 127208 1097 498 aa, chain + ## HITS:1 COG:FN1406 KEGG:ns NR:ns ## COG: FN1406 COG2986 # Protein_GI_number: 19704738 # Func_class: E Amino acid transport and metabolism # Function: Histidine ammonia-lyase # Organism: Fusobacterium nucleatum # 1 487 1 490 511 404 44.0 1e-112 MNNVYYVGSGELTFSIIERIINENLKLELAPEAEQRIRKCRDYLDRKIAASTEPLYGITT GFGSLCSKNISSDELNTLQENLVKSHACSVGEELRPVIIKLMLLLKAHALSLGHSGVQVI TVQRILDFFNNDVMPIVYDRGSLGASGDLAPLANLFLPLIGVGDVYYKGKKREAISVLDE FGWEPVRLMSKEGLALLNGTQFMSANGVFALLKARRLSKKADMIAALSLEAFDGRIDPFM ECIQQIRPHPGQIETGEIFRRLLHGSELIARTKEHVQDPYSFRCIPQVHGATKDAIRYVS SVLLTEINSVTDNPTIFPDEDQIISGGNFHGQPLALSFDFLAIAMAELGNISERRVAQLI MGLRGLPEFLVANPGLNSGFMIPQYAAASMVSQNKMYCYAASSDSIVSSNGQEDHVSMGA NAATKLFRIMDNLEHILAIELMNAAQGIEFRRPAKTSPILERYLAAYRKEVPFVKDDIVM YKEIHKTVAFLNRTRSEY >gi|226332003|gb|ACIB01000053.1| GENE 135 127519 - 128163 619 214 aa, chain + ## HITS:1 COG:no KEGG:BF3965 NR:ns ## KEGG: BF3965 # Name: not_defined # Def: putative TetR transcriptional regulator # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 214 1 214 214 386 100.0 1e-106 MIENAKENSQRVELKDRIIETASEAFTTHGIKSITMDDIAASLGISKRTLYEVFQDKESL LTQCILKRQEEMNAFLAETLANSKNVLEVILVCYQRSIETFHRTNKRFFEDIKKYPKVHS LLKNYRERDSDSTIEFFKMGIKQGIFRDDVNFAIVNLLVHEQLDLLMNTDICNKYSFLEV YEAIMFTYIRGISTEKGAKVLEDFITEYRKQRIH >gi|226332003|gb|ACIB01000053.1| GENE 136 128214 - 129566 1524 450 aa, chain + ## HITS:1 COG:aq_1332 KEGG:ns NR:ns ## COG: aq_1332 COG1538 # Protein_GI_number: 15606535 # Func_class: M Cell wall/membrane/envelope biogenesis; U Intracellular trafficking, secretion, and vesicular transport # Function: Outer membrane protein # Organism: Aquifex aeolicus # 36 442 21 413 415 77 22.0 6e-14 MNKMKCFLSKKMLLAVVVLLSVGYVQAQEAKDIVLLTLDKALEIALSENPTMKVAGQEIQ LKKEAKKEAYGGLFPEVSLTGSYSRTLKKQTMVMDFGGESQTIQVGSDNSYNGGLNVSLP VFAPTLYKSINLTKTDVELAVEKARSSKLDLVNQVTKAYYQLLLAQDSYKVLLQSYAQAE ANYEVVKAKYEQGTVSEYDKIRADVQVRSLKPSVVSAGNGVNLARLQLQVLMGMDTEVEI AADGNLKDYEMVMFRRQMESNQLNLNNNSDLKQLDLNADLLKKTLAVQRTNFMPTLAASF NYSYTSLNNDFKMSHYKWFPYSTVGLSLSIPLFKASNFTKVKQTKIQMQQLAENRTNVYR QLTMQATSYLDNMAASTEQVVSNKEGIVQAEKGRLIAQKRYEVGKGTILELNDSEVALTQ AQLTYNQSIYDYLVAKADLDQVLGREEVTE >gi|226332003|gb|ACIB01000053.1| GENE 137 129595 - 130611 1140 338 aa, chain + ## HITS:1 COG:VC0165 KEGG:ns NR:ns ## COG: VC0165 COG0845 # Protein_GI_number: 15640195 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Vibrio cholerae # 1 336 1 353 368 119 27.0 8e-27 MKRCFQLVALLAVVLLGSCTGGKDKSAAEKTEEKPKVKLADVTARPVEQIQEYTATVEAE VKNNIAPSSPVRIDKIFVEVGDHVSKGQKLVQMDAANLKQTKLQLDNQEVEFNRIDELYK VGGASKSEWDAAKMAYDVKKTAYQNLLENTSLLSPISGVVTARNYDSGDMYSGGNPVLTV EKITPVKLLINVSEVYFTKVKKGAPVNVKLDVYGDEAFEGKISLIYPTIDPSTRTFQVEI QLPNQNQKVRPGMFARASLNFGTEENVVVPDLAIVKQAGAGDRYVYVYKDGKVTYNKVEL GRRMGTEYELKSGVPNNSQVVIAGQTRLINGTEVEVEK >gi|226332003|gb|ACIB01000053.1| GENE 138 130712 - 133870 3412 1052 aa, chain + ## HITS:1 COG:BH3816 KEGG:ns NR:ns ## COG: BH3816 COG0841 # Protein_GI_number: 15616378 # Func_class: V Defense mechanisms # Function: Cation/multidrug efflux pump # Organism: Bacillus halodurans # 7 1020 5 1005 1093 493 32.0 1e-138 MSLYEGAVKKPIMTSLCFLAVVIFGLFSLSKLPIDLYPDIDTNTIMVMTAYPGASASDIE NNVTRPLENTLNAVSNLKHITSRSSENMSLITLEFEYGNDIDVLTNDVRDKLDMVSSQLP DDVENPIIFKFSTDMIPIVLLSVQANESQSALYKILDDRVVNPLARIPGVGTVSISGAPQ REIQVYCDPGKLEAYNLTIETISSIIGAENKNIPGGNFDIGNETYSLRVEGEFDDSRQLE DVVVGSYNGANVYLRDVARVVDTVEERAQETYNNGVKGAMIVVQKQSGANSVDISKKVAE ALPKLQKNLPSDVKLGVIVDTSDNILNTIDSLAETVLYALLFVVIVVFLFLGRWRATLII CITIPLSLIASFIYLAVTGNTINIISLSSLSIAIGMVVDDAIVVLENVTTHIERGSDPKQ AAVHGTNEVAISVIASTLTMIAVFFPLTMVSGMSGVMFKQLGWMMCAIMFISTVAALSLT PMLCSQLLRLQKKQSRTFKLLFGPIEKGLDALDTGYARMLNWAVRHRPIVIFGCIVFFVV SLFCAKSIGTEFFPAQDNARIAVQLELPIGTRKELAQEVSEKLTNQWLNKYKGVMTVCNY TVGQADSDNTWASMQDNGSHIISFNISLVDPGDRDISLEQVCDEMREDLKKYPEFSKAQV ILGGSNTGMSAQASADFEVYGYSMEETDSVAARLKRELLNVKGVSEVNISRSDYQPEYQV DFDREKLALHGLNLATAGNYLRNRINGAIASKYREDGDEYDIKVRYAPEYRTSIESIENI LIYNSRGEAVRVKELGKVVERFAPPTIERKDRERIVTVSAVISGEALGNVVNAGNAIIDK MDIPSDVTIQVAGSYEDQQDSFRDLGTLAILIVVLVFIVMAAQFESLTYPFIIMFSLPFA FSGVLMALFFTKSTLSVMSLLGAIMLIGIVVKNGIVLIDYITLCRERGLAVLNSVVTAGK SRLRPVLMTTATTVLGMIPMAIGGGQGSEMWSPMAIAVIGGLTVSTILTLVLIPTLYCVF AGTGIKHQRKVLRRKRELEAYFEEHKDEMIKK >gi|226332003|gb|ACIB01000053.1| GENE 139 133889 - 134182 380 97 aa, chain + ## HITS:1 COG:no KEGG:BF4139 NR:ns ## KEGG: BF4139 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 97 1 97 97 203 100.0 2e-51 MKSVLITFDQAYYERIIALLDRLGCRGFTYLERVQGRGSKTGDPHFGSHAWPSMCSAIIT VVDDKKVDPLLDALHRMDMQTEQLGLRAFVTNVERSI >gi|226332003|gb|ACIB01000053.1| GENE 140 134299 - 135672 1020 457 aa, chain + ## HITS:1 COG:no KEGG:BF3960 NR:ns ## KEGG: BF3960 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 457 24 480 480 961 100.0 0 MKNYLKYSLWLTLIVLFALIGMHWLPAITIDGHTMRRVDLLSDVRMPEPDKDEVVADSLP PVPVVKPAFVDTCRSGMTCIEDYSDSTMRGMTPFYRALDEIQSKGRLVRIAVFGDSFIEA DIFTADLREMLQKRFGGCGVGFVTITSMTSGYRPTVRHSFGGWSSHAVTDSVYFDKKKQG ISGHYFVPRERAYVELRGQNKYASLLDTCQIASIFFYNKGEVNLSVCVNRGEAEARDFST TGRLQQMKVNGRIGSVRWDINRADSTLFYGVAMDGTQGVVVDNFSLRGSSGLSLRSIPSK MLQEFNAQRPYDLIILEYGLNVATERGRNYDPYKKGLLTAINHLKECFPQAGFLLLSVGD RDYKTDTGELRTMPGVKNLIRYQQNIAAESGIAFWNMFEAMGGEGSMAELVHAKPSLANY DYTHINFRGGRHLAGLLYETLIYGKEQYERRKAYEAE >gi|226332003|gb|ACIB01000053.1| GENE 141 135641 - 136555 596 304 aa, chain + ## HITS:1 COG:no KEGG:BF4137 NR:ns ## KEGG: BF4137 # Name: not_defined # Def: putative periplasmic protein # Organism: B.fragilis # Pathway: not_defined # 1 304 6 309 309 613 99.0 1e-174 MKGERPMRLNRWLGVLILLLSGVWSVRAQDLLPACPQVDKGTKACKPMREPGSLGDTVSV KIVFPVAFKGVGRNEVVDSLGILAPVLEHLRLVQNGSSEDTVRIVHIGDSHIRGHIFPRT TGARLTETFGAISYTDMGVNGATCLTFTHPDRIAAIAALKPELLILSFGTNESHNRKYNS NVHYRQMEELLELLHDSLPDVPILMTTPPGSYESFRQRRRRRTYAINPRTVTAVNTIHDF ARRHKLVVWDMYNVVGGSLRACKNWSDAQLMRPDHVHYLPEGYTLQGELLYEAIIKAYNE YVSH >gi|226332003|gb|ACIB01000053.1| GENE 142 136542 - 138029 834 495 aa, chain + ## HITS:1 COG:PA3548 KEGG:ns NR:ns ## COG: PA3548 COG1696 # Protein_GI_number: 15598744 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted membrane protein involved in D-alanine export # Organism: Pseudomonas aeruginosa # 51 418 28 390 520 251 41.0 2e-66 MFPIDIDFSRLLEAFKYNPDAPMIFSSGIFLWLFAAFMVIYTLLQHCNTARILFVALFSY YFYYKSSGTYFFLLAIVTVTDFVIAWLMDRTDVRWKRKFCVVLSVSVNLGLLCYFKYTNF LGGVIASLMGGEFTALDIFLPVGISFFTFQSLSYTIDVYRKEIKPLTSLLDYAFYVSFFP QLVAGPIVRARDFIPQIRKPLYVSQEMFGRGIFLIVAGLFKKAVISDYISINFVERIFDN PTLYSGVENLMGLYGYALQIYCDFSGYSDMAIGIALLLGFHFNLNFNSPYKSASITEFWR RWHISLSSWLRDYLYISLGGNRHGKFRQYLNLIITMFLGGLWHGASWNFVLWGTFHGVAL ALHKAWMSIIGRKKGETSHGIRRVLGVIITFHFVCFCWIFFRNADFHNSMDMLNQIFTAF RPQLFPQLIEGYWRVFALMAVGFLLHFAPDSWENAVCRGVIKLPFLGKAIVMVAMIYLVI QMKSSEIQPFIYFQF Prediction of potential genes in microbial genomes Time: Wed May 18 00:10:06 2011 Seq name: gi|226332002|gb|ACIB01000054.1| Bacteroides sp. 3_2_5 cont1.54, whole genome shotgun sequence Length of sequence - 11424 bp Number of predicted genes - 9, with homology - 9 Number of transcription units - 3, operones - 2 average op.length - 4.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . + CDS 63 - 2417 1789 ## COG3537 Putative alpha-1,2-mannosidase 2 1 Op 2 . + CDS 2434 - 3297 827 ## COG3568 Metal-dependent hydrolase 3 1 Op 3 . + CDS 3344 - 4486 1026 ## COG1785 Alkaline phosphatase 4 1 Op 4 . + CDS 4505 - 6109 1605 ## COG1626 Neutral trehalase 5 1 Op 5 . + CDS 6122 - 7573 821 ## COG3538 Uncharacterized conserved protein 6 1 Op 6 . + CDS 7657 - 9108 1028 ## COG1409 Predicted phosphohydrolases + Term 9153 - 9190 1.7 7 2 Tu 1 . - CDS 9122 - 10297 639 ## COG1609 Transcriptional regulators - Prom 10384 - 10443 4.2 - Term 10543 - 10578 1.0 8 3 Op 1 . - CDS 10611 - 11279 504 ## PROTEIN SUPPORTED gi|149175515|ref|ZP_01854136.1| ribosomal protein S1-like RNA-binding domain 9 3 Op 2 . - CDS 11248 - 11409 99 ## gi|265763136|ref|ZP_06091704.1| predicted protein Predicted protein(s) >gi|226332002|gb|ACIB01000054.1| GENE 1 63 - 2417 1789 784 aa, chain + ## HITS:1 COG:XF0842 KEGG:ns NR:ns ## COG: XF0842 COG3537 # Protein_GI_number: 15837444 # Func_class: G Carbohydrate transport and metabolism # Function: Putative alpha-1,2-mannosidase # Organism: Xylella fastidiosa 9a5c # 2 549 224 781 790 412 39.0 1e-114 MRRTRGWTDNQYVYFAAQFSEPFQTVEFVQDKKIVSAETKQVGTDLQAILTFADKDGEPI IAKVGLSLVSVDNARKNLAEEVKDFNFDAVCAAARNDWEQALSSITVEGGGTDDLKNFYT AIYHAMVVPNVVSDVNGEYRRHNMQIGQLPKGKMQYSTFSLWDTFRAWNPLMTLIDTALV NNMVNSYLDIYDASGELPIWPLSAAETGTMIGYHSVSVIADAYLKGIRGFDAEKALDAMK VSSEKNKKGADYYIKYGFIPSNIKKESISCLLEFAYDDWCIARMAQEMGKEDVYRKYIER SQNYINVFDGSTKFFRGKRMDGNWETSFNPFEVGRSYTEATAWQYRFSVPYDVNGMVQLF GGKEKFITALDSIFIADPNVHGDLADITGLIGQYAHGNEPSHHIAYLYDYVGQPWKTQEM TRHLLDEMYQPTPGGISGNEDCGQMSAWYILSGLGIYSVCPGSNEFALTTPLFEKAVLKL ANGKRLTLLANDPKKNIYIHKVELNGKQIDTNFITYAQLMEGGELRFTLSDKPDKSRGIS EEASPYSYTKEKVVSIPYVDRDLNLFMDKVTVALATTTEGAELRYTLDGTEPTEKSLLYD KPFKVDMTTQIKAKGFKEGFRPSRTLSITATKAELKASLPVHPSRNGTSYKYFEGTYQKV ADIEKTPLLEVGVLPEPSIKEAKQKDHFGYIFSGLINVPEDGVYIFQTRSDDGSVLYIGN ELVVNNDGSHAAIPATGYIALEKGFHPYILYYFEDYEGEHLSWFWKLPSAKELAPIPTSA LFVK >gi|226332002|gb|ACIB01000054.1| GENE 2 2434 - 3297 827 287 aa, chain + ## HITS:1 COG:lin0348 KEGG:ns NR:ns ## COG: lin0348 COG3568 # Protein_GI_number: 16799425 # Func_class: R General function prediction only # Function: Metal-dependent hydrolase # Organism: Listeria innocua # 30 287 3 255 257 173 37.0 4e-43 MEKIILLVLPFFAASCGLVKQQASAPEPVNVMSFNIRYDNPEDSLDNWRYRKDRVANAIH FYDVDILGTQEVLHNQLEDLKLRLPEYGVVGVGREDGKEKGEYSALWYKKDRFNVLDSGY FWLSETPEVAGSKGWDGACERIASWVKLQDKVSDKEYFALNTHLDHVGGMARREGISLML DRVNELSDGLPVIVTGDFNSEPESDVIKHVTDSANPEHLTDARQASSIVYGPSWSFHDFG KIPYNKRPLIDYVFVRNGLKVLRYGILAETENNGFLSDHTPVLVTVE >gi|226332002|gb|ACIB01000054.1| GENE 3 3344 - 4486 1026 380 aa, chain + ## HITS:1 COG:TM0156 KEGG:ns NR:ns ## COG: TM0156 COG1785 # Protein_GI_number: 15642930 # Func_class: P Inorganic ion transport and metabolism # Function: Alkaline phosphatase # Organism: Thermotoga maritima # 52 331 20 310 434 176 38.0 5e-44 MKNILRNFVFIVWAVALLPVNVSAQNRRDKEQTYVLEQPYEVTKITPSQGKKIKNVILMI GDGMSLMHVYSAWTANRGKLFLDNCQAVGLSKTYCADKLITDSGAGGTAIASGQKTNYHY VGVDTLGHPLKSLVDFAAAKGKSTGIAVTCRLWDATPADFCCHNKDRDAESEIVTDYVNC NADYVFGGGAKLFENREDGRDLFKELREKGFRTPRSWDELAGIKSGKVFAVPYPVDTPLP AERGDLLARASLKGIDLLNQNKNGFFMMIEGSQLDDYGHFNDLDLLMQETHDFDRTIGAI YEWAAKDGETLVVVTADHETGGLTLVDGDLKEGKIVCKFSTGGHSGVMVPVYAFGPGAQE FTGIYENTAIFDKIKKLLDL >gi|226332002|gb|ACIB01000054.1| GENE 4 4505 - 6109 1605 534 aa, chain + ## HITS:1 COG:TP0931 KEGG:ns NR:ns ## COG: TP0931 COG1626 # Protein_GI_number: 15639916 # Func_class: G Carbohydrate transport and metabolism # Function: Neutral trehalase # Organism: Treponema pallidum # 124 531 62 468 476 139 26.0 2e-32 MLKHILFTCFFFFTAIPLLKAQGCGNDEKYHLPYKNTYVKEPLVAENEYRIAKPETVEPK SFEEARQILPNPIWAGHEKELEMYWRAWEIAVGNIRAPQQGSGFVSSYLDTAYNGNIFMW DSSFILMFARYGTRFFPFQRTLDNFYAKQHPDGFICREIKADGADCFERYDPVSTGPNLM PWCEMVYYYQFGDTERLHKIFPVLCAYYKWLKLNRTWRNGTYWSSGWGTGMDNMPRVPEG YSPIYSHGHMIWLDTNLQQLFTANLLLEMGFYLERWQEIEEFEDEAKMLGKYIHDNLWDE KTGFLYDQYADGTLCKTKGIGAYWTLLTDVLDDKQLDRMVKELDNPATFNRKFRIPSLSA DHPKYKENGRYWQGGIWPGTNYMVMQGLVKKGYHKLAREIALNHYAEVLEVYKNTGTFWE YYSPEKAEPGFMARKEFVGWTGLPPIAELIEFIIGIRGDYVNQQIIWDMNLTETNGIERY PFGSEGIINLKAEARRSANDEPRIAVDTNIGFELLVLYGGKEKKVNVTPGKHTY >gi|226332002|gb|ACIB01000054.1| GENE 5 6122 - 7573 821 483 aa, chain + ## HITS:1 COG:XF0843 KEGG:ns NR:ns ## COG: XF0843 COG3538 # Protein_GI_number: 15837445 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Xylella fastidiosa 9a5c # 1 479 19 505 516 475 48.0 1e-133 MKRNMSRRHFLKTGGLALAAMAMCPPLSFASSEMPVQKYISLRPPVGKRHFVSKAVEATI EQTRPKIKDEKLRWMFENCFPNTLDTTVRYKMKAGRPDTFVITGDIDAMWLRDSSAQVWP YLPLMKDDKELQLLIAGLINRQAECIRIDPYANAFNDGPLGSYWETDHTQHMVKELHERK WEIDSLCYPIRLAYHYWLLTKDISAFDADWHETMKLVVQTFKEQQRKQGLGPYSFTRDCD RPTDSQINNGWGAPVKPVGLIVSSFRPSDDATQYGFLIPSNMFAVVSLRQLAEIEREVYD NLPFAEECTALADEVDAAIRRYGTFNHPVCGRIYAFEVDGFGNALCMDDANVPCLLAAPY LGYCSFKDAVYRNTRKMIWSENNPYFFKGKAGEGVGGPHVGLNYIWPMSIIMKAFTTDAP EEIRSCLKQLRDTDGGTGFMHESFNSENAADFTRSWFAWTNTLFGELILKIIREYPGLLS QAL >gi|226332002|gb|ACIB01000054.1| GENE 6 7657 - 9108 1028 483 aa, chain + ## HITS:1 COG:CAC1961 KEGG:ns NR:ns ## COG: CAC1961 COG1409 # Protein_GI_number: 15895233 # Func_class: R General function prediction only # Function: Predicted phosphohydrolases # Organism: Clostridium acetobutylicum # 7 306 18 316 324 149 32.0 9e-36 MKKLIIFLFVLSGCVPAVVFAQQQFSFKDGKFKIAQFTDLHWTPRSLACTETEATICAVL KAEHPDIAILSGDVVTEDPAIDGWKSVIRIFDEAKVPFVVTMGNHDAEHMAKDDIYDLLL ESPYYAGAKGPEGIMGCGNCVIPVYGSRNREKVEALLYCMDSNDYQPDKLYGPYDWIHFD QIAWYRKQSARFTKENNGNPVPALAFFHIPLLEYNEIAGDGKTFGNNREGEVASANINSG MFASFIDMKDVMGVFAGHDHDNDYLGINKGIVLGYGRVTGADAYGELTRGARIIELYEGK FRFDTWITTPSGREATYYYPSGLNSEEERTADYLPAVKNVSSPKQGVAYTYYEGKCKRVA GIASCLKVKEGVMKNISIKEAAVADHFAYDFHTLIQIPEKGIYRFYTFSDDGSMLYIDGK LVVDNDGGHSARRAEGKIALEKGFHELHLLYFEDYMGQELEVGFSGLDFPEVPLQDEMLF LPN >gi|226332002|gb|ACIB01000054.1| GENE 7 9122 - 10297 639 391 aa, chain - ## HITS:1 COG:HI1106_1 KEGG:ns NR:ns ## COG: HI1106_1 COG1609 # Protein_GI_number: 16273032 # Func_class: K Transcription # Function: Transcriptional regulators # Organism: Haemophilus influenzae # 60 261 58 265 265 108 32.0 2e-23 MIRLILLTDFTESFSYNLLKGVLAYSKKHEPWVVCRMPPSYKLTYGIEGVLKWAKAWQAD AIIGRFDNDDNVELFRKNGIIAIAQDYESRFSNIPNITGDYHKTGRMAAEFFLSKGFRNF AFYGYRDTVWSQERCEGFYECIAEHGFGNNFYSYQEQSLDDSWFYEAPPLLTWLKSLPQP TALMACDDNQGNRITEICKVNNIRVPDKIAILGVDNDEIICNLSDPPLSSISQNIVRGGF EAAELIEHLLNDEECSYQDVVLQPVNIVNRLSTDFYSTTNTHIHTALKYIHRNLANDITV SDIVKQVPLSRRLLEIRFKEVTKQSIHKYILNLRIERFAQLLLASDAPIADVAEQVGINN LKNLSRQFKTLKNVSPYEYRKEHRMMSNDNY >gi|226332002|gb|ACIB01000054.1| GENE 8 10611 - 11279 504 222 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|149175515|ref|ZP_01854136.1| ribosomal protein S1-like RNA-binding domain [Planctomyces maris DSM 8797] # 7 195 306 496 828 198 51 1e-50 MPPSVKEQADDEAIRVFAENLRQLLLAPPLGQKRVMGIDPGFRTGCKVVCLDAQGNLVHN ENIYPHPPVDKKTEAASKLRKMIEAYKIEAIAIGNGTASRETENFVTHQQFDRPVQVFVV SEQGASIYSASKTARDEFPDYDVTVRGAVSIARRLMDPLAELVKIDPKPIGVGQYQHDVD QTKLKKSLDQTVENCGMSEITKGSVIKKRKLAIFLRHYSANG >gi|226332002|gb|ACIB01000054.1| GENE 9 11248 - 11409 99 53 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|265763136|ref|ZP_06091704.1| ## NR: gi|265763136|ref|ZP_06091704.1| predicted protein [Bacteroides sp. 2_1_16] # 1 53 1 53 53 70 98.0 3e-11 MQMDTEVSEDERARNAVRNLFARQAVISADAYRRLLKSSIETEFASLSKRTGR Prediction of potential genes in microbial genomes Time: Wed May 18 00:10:27 2011 Seq name: gi|226332001|gb|ACIB01000055.1| Bacteroides sp. 3_2_5 cont1.55, whole genome shotgun sequence Length of sequence - 66972 bp Number of predicted genes - 57, with homology - 54 Number of transcription units - 25, operones - 9 average op.length - 4.6 N Tu/Op Conserved S Start End Score pairs(N/Pv) - Term 249 - 283 -0.8 1 1 Op 1 . - CDS 361 - 1209 779 ## BF1826 hypothetical protein 2 1 Op 2 . - CDS 1155 - 1544 266 ## BF1763 putative outer membrane protein 3 1 Op 3 . - CDS 1541 - 3670 1664 ## BF1827 hypothetical protein 4 1 Op 4 . - CDS 3681 - 4286 430 ## BF1765 hypothetical protein 5 1 Op 5 . - CDS 4352 - 5221 344 ## BF1829 putative transmembrane protein - Prom 5307 - 5366 5.1 + Prom 5161 - 5220 6.4 6 2 Tu 1 . + CDS 5333 - 5473 89 ## + Term 5541 - 5587 1.9 7 3 Tu 1 . - CDS 5431 - 5664 134 ## BF1768 hypothetical protein - Prom 5716 - 5775 7.5 8 4 Tu 1 . + CDS 6035 - 6214 126 ## + Prom 6229 - 6288 2.6 9 5 Tu 1 . + CDS 6319 - 7794 1142 ## COG2865 Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen + Term 7828 - 7880 2.8 + Prom 8572 - 8631 4.8 10 6 Tu 1 . + CDS 8817 - 8999 65 ## BF1844 hypothetical protein + Prom 9220 - 9279 3.4 11 7 Tu 1 . + CDS 9303 - 10517 1119 ## BF1845 putative integrase/transposase + Prom 10935 - 10994 5.6 12 8 Op 1 . + CDS 11140 - 11706 537 ## BF1847 hypothetical protein 13 8 Op 2 . + CDS 11720 - 12832 712 ## BF1848 hypothetical protein 14 8 Op 3 . + CDS 12846 - 13973 885 ## BF1849 hypothetical protein + Prom 14122 - 14181 3.8 15 9 Tu 1 . + CDS 14212 - 15861 1784 ## BF1850 hypothetical protein + Term 15885 - 15923 7.4 16 10 Tu 1 . - CDS 15862 - 16050 80 ## 17 11 Op 1 . + CDS 15956 - 16879 927 ## BF1851 hypothetical protein 18 11 Op 2 . + CDS 16906 - 18471 1304 ## BF1852 ATP/GTP-binding chaperonin 19 11 Op 3 . + CDS 18496 - 19614 648 ## BF1853 hypothetical protein 20 11 Op 4 . + CDS 19642 - 20622 611 ## BF1854 hypothetical protein 21 11 Op 5 . + CDS 20667 - 21620 718 ## BF1855 hypothetical protein 22 11 Op 6 . + CDS 21647 - 22582 856 ## BF1856 hypothetical protein 23 11 Op 7 . + CDS 22595 - 23596 871 ## BF1857 hypothetical protein 24 11 Op 8 . + CDS 23609 - 24583 745 ## BF1858 hypothetical protein 25 11 Op 9 . + CDS 24608 - 25609 590 ## BF1859 hypothetical protein 26 11 Op 10 . + CDS 25623 - 26684 653 ## BF1860 hypothetical protein 27 11 Op 11 . + CDS 26700 - 27626 766 ## BF1861 hypothetical protein 28 11 Op 12 . + CDS 27645 - 29216 1070 ## BF1797 hypothetical protein 29 11 Op 13 . + CDS 29260 - 30330 642 ## BF1798 hypothetical protein 30 11 Op 14 . + CDS 30327 - 31283 686 ## BF1799 hypothetical protein 31 11 Op 15 . + CDS 31267 - 32856 997 ## BF1800 hypothetical protein 32 11 Op 16 . + CDS 32869 - 33816 418 ## BF1866 hypothetical protein + Prom 33825 - 33884 1.7 33 11 Op 17 . + CDS 33911 - 35095 1138 ## BF1867 hypothetical protein + Prom 35138 - 35197 6.3 34 12 Op 1 . + CDS 35232 - 37232 808 ## BF1803 hypothetical protein 35 12 Op 2 . + CDS 37238 - 37606 273 ## BF1804 hypothetical protein 36 12 Op 3 . + CDS 37648 - 38631 691 ## BF1805 hypothetical protein 37 13 Tu 1 . - CDS 38765 - 40888 183 ## PROTEIN SUPPORTED gi|84496588|ref|ZP_00995442.1| 30S ribosomal protein S1 - Prom 40981 - 41040 6.0 + Prom 40905 - 40964 2.8 38 14 Tu 1 . + CDS 41006 - 42919 1503 ## COG0642 Signal transduction histidine kinase 39 15 Op 1 . + CDS 43036 - 44775 1760 ## COG0737 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 40 15 Op 2 . + CDS 44782 - 45255 326 ## PROTEIN SUPPORTED gi|15902812|ref|NP_358362.1| hypothetical protein spr0768 41 15 Op 3 . + CDS 45293 - 45631 223 ## BF1810 hypothetical protein + Term 45776 - 45810 0.6 + Prom 45634 - 45693 3.4 42 16 Op 1 . + CDS 45854 - 48127 2288 ## COG4771 Outer membrane receptor for ferrienterochelin and colicins 43 16 Op 2 . + CDS 48187 - 48948 698 ## COG1402 Uncharacterized protein, putative amidase + Term 48963 - 49019 18.3 - Term 49049 - 49079 0.0 44 17 Tu 1 . - CDS 49121 - 49702 568 ## COG0664 cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases + TRNA 49980 - 50066 59.3 # Ser GCT 0 0 + TRNA 50145 - 50219 57.4 # Glu CTC 0 0 + Prom 49991 - 50050 80.4 45 18 Tu 1 . + CDS 50275 - 51636 894 ## COG0534 Na+-driven multidrug efflux pump + Term 51758 - 51801 2.3 + Prom 52071 - 52130 6.3 46 19 Tu 1 . + CDS 52282 - 52728 452 ## BF1816 hypothetical protein + Term 52755 - 52803 11.4 + Prom 52730 - 52789 2.4 47 20 Op 1 . + CDS 52840 - 54192 1204 ## COG0534 Na+-driven multidrug efflux pump + Term 54214 - 54262 -0.1 + Prom 54195 - 54254 2.5 48 20 Op 2 . + CDS 54278 - 55939 1470 ## COG2985 Predicted permease 49 20 Op 3 . + CDS 55988 - 57982 2020 ## COG3855 Uncharacterized protein conserved in bacteria + Term 58003 - 58067 8.1 - Term 57989 - 58056 9.6 50 21 Tu 1 . - CDS 58074 - 59231 1144 ## BF1820 hypothetical protein - Prom 59274 - 59333 2.7 - Term 59243 - 59306 14.9 51 22 Op 1 . - CDS 59340 - 60998 1642 ## COG1022 Long-chain acyl-CoA synthetases (AMP-forming) - Prom 61024 - 61083 4.8 52 22 Op 2 14/0.000 - CDS 61192 - 62262 993 ## COG0451 Nucleoside-diphosphate-sugar epimerases 53 22 Op 3 . - CDS 62267 - 63340 1212 ## COG1089 GDP-D-mannose dehydratase - Prom 63509 - 63568 10.2 + Prom 63475 - 63534 8.4 54 23 Tu 1 . + CDS 63554 - 64825 801 ## COG1373 Predicted ATPase (AAA+ superfamily) + Term 64924 - 64983 11.8 - Term 64916 - 64964 5.1 55 24 Op 1 . - CDS 65119 - 65922 566 ## BF1890 hypothetical protein 56 24 Op 2 . - CDS 65972 - 66319 437 ## BF1826 hypothetical protein - Prom 66357 - 66416 4.6 - Term 66398 - 66446 1.2 57 25 Tu 1 . - CDS 66460 - 66798 232 ## BF1827 hypothetical protein - Prom 66881 - 66940 12.7 Predicted protein(s) >gi|226332001|gb|ACIB01000055.1| GENE 1 361 - 1209 779 282 aa, chain - ## HITS:1 COG:no KEGG:BF1826 NR:ns ## KEGG: BF1826 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 5 282 111 388 388 543 100.0 1e-153 METGKQGNTNMHLAGANVDYLINISAFAARYNPKRVFEVIGALGLSYQATIVKDQKTIHS YGLRAGLQGKLNVSSAFNLFIEPQLALYPDRVDNQSSWKRYNLAGSLMAGITYKPAGYAT LTFLKGGFASLAAGTGNTGNVLFDTEFALGKWFDKFNGMRISAGSSTAFLDNEDSGSNRD FNISLNIDYLCSLTRLFSDRDSHVFNLIVAGGIGSYFPGAESSSSIILNGRIGLQGEIRL SAHSGLWLEPRINIFKDRSYRADLQEPIRGTVGLMVGTTYKF >gi|226332001|gb|ACIB01000055.1| GENE 2 1155 - 1544 266 129 aa, chain - ## HITS:1 COG:no KEGG:BF1763 NR:ns ## KEGG: BF1763 # Name: not_defined # Def: putative outer membrane protein # Organism: B.fragilis # Pathway: not_defined # 1 122 1 122 391 237 94.0 1e-61 MKNMGKSYKRILLLGIGFILCGTLAAQEQKQDSTEGKNNIKASEYLMPKRKGAEKFHSKR GSEHLFFSVGSGIGYLFNVGGGQTAHGPRASFMAGNWLTPVIGLRAGGEYTQWKQGNREI RICIWQEPM >gi|226332001|gb|ACIB01000055.1| GENE 3 1541 - 3670 1664 709 aa, chain - ## HITS:1 COG:no KEGG:BF1827 NR:ns ## KEGG: BF1827 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 709 1 709 709 1305 99.0 0 MNIKRFLLLGIMALYAIIPAWGQAQKVEIRGSVIDDEGEPAISIVIRDQNEKGDVYGITD LDGKFKIMADPNTTLHFSGFAYASKTVKLKGKTTINVVISYEASMIDEVVITAKKVVDKL LPEPTDIEIVGNQYIIHPKVKIPKEMYKPNTRIVVQPMLVNITRKTQSLFRPAVVTGKEY AITLERMMEFDLSRDPLAAFQEKTQKIDKNEVIAYVDSLYMDNPDDECRCDIYMYLVEYK KLAYKDTVVIAKGTVNPMRFFTYQADGMKIRDEKYIPKPQKQQRGDRGEVKLNFLINSAT IDEKDPNNQRELEKMRLRLQEIETDPNSEFLSFSVKGVSSPEGPYQSNLKLAQKRTDSTL KRIFGFLNGGTINAIKDSTYTEGVVASWEEVAELMERDSLPTDKLREIINCYPDNMASQY SRILRLPEYRNVILTTYLPRLRRVEYSFNYSVMRLLNDEEIRIMYKQDYKKLVPYEFWRI YLDADNDSTREVICRQALEQYPKFMIMANELAALLIEQKKADSKLLEPFVSRSAPTELLC NQVIALMDERAYNRADSIIDFLPDNDMTQDVRAIVGAYNGHFEDAYERFGTQGGINEVVL LMAMKQNEEAWEKAQELPDEPLSYYLRAACANRLDKVSEAYAFIKRALNEDPSLKEIAQI DGDVTDLLQQLEDEKKELKEKAEKTKEKNETEDTETEESGLNEEKTIKQ >gi|226332001|gb|ACIB01000055.1| GENE 4 3681 - 4286 430 201 aa, chain - ## HITS:1 COG:no KEGG:BF1765 NR:ns ## KEGG: BF1765 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 201 1 201 201 350 99.0 2e-95 MNTFNLKLDFPNLLWEIAGYNFPSLFGKRAILFIFIAISFQVSAQRMAIKTNTLEWLAAS PNLGVEFPLNDWMTAEISASANPWKITDKLFYRHGRVQAEAKYWLRNLLARHYIGITGFY SMFDVGINRRAYYGDAAAAGVTYGYNWILSRRWNLEVSGGVGVARYRLVRYQPGSTHGEP NESGWAPIPVKLSVSFIYIAK >gi|226332001|gb|ACIB01000055.1| GENE 5 4352 - 5221 344 289 aa, chain - ## HITS:1 COG:no KEGG:BF1829 NR:ns ## KEGG: BF1829 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 289 1 289 289 514 100.0 1e-144 MNSKYLSLPKTKKYIQKKYLQFKEIRNPKRQTIYFVYYGTVGCFSIGIIADLCVYLIRKD LLLALCNILSLGLFLLFTYLLIRKKKQITFLLKCTFYTIQSNILISMYCRIYLPPEETGF FLSQDLMIGMVTCGLASISVSRHTVMILSFAPILLYMFIGVYTSSELYLMSLPSLAVAYI FPPIMLARLQEILRTMQRQKARMTSELKLWAAFNALHLQPSSKEIQLCCLILENKTTEEI AALQYIATSTVRSNRSRLRQKLQLNQETDLQTFLSELIKKDFDYLESPD >gi|226332001|gb|ACIB01000055.1| GENE 6 5333 - 5473 89 46 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MKIEGGRVKILKKCTEFVIRVIKAHHIPALSDDQSLNLCVKSNIKT >gi|226332001|gb|ACIB01000055.1| GENE 7 5431 - 5664 134 77 aa, chain - ## HITS:1 COG:no KEGG:BF1768 NR:ns ## KEGG: BF1768 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 77 205 281 281 142 94.0 5e-33 MFADAFRIFAELSWADIYNFEGLDYKEYTNKQKDSFAIFRSKKIFKFRITQKYRCSGKVV NGVFHVLMFDLTHKLSD >gi|226332001|gb|ACIB01000055.1| GENE 8 6035 - 6214 126 59 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MRSTFKVLFYLKRNAPKKNGLVPVMYHFSIYDNNTKLYSREQSRKEACAIDIRFLLNKK >gi|226332001|gb|ACIB01000055.1| GENE 9 6319 - 7794 1142 491 aa, chain + ## HITS:1 COG:MA2369 KEGG:ns NR:ns ## COG: MA2369 COG2865 # Protein_GI_number: 20091201 # Func_class: K Transcription # Function: Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen # Organism: Methanosarcina acetivorans str.C2A # 2 486 4 501 510 266 35.0 1e-70 MNIKEILNQSEGRRLEFKAELPEHSDLAKTVVAFANDAGGDLYIGVADDPREVVGLDEDK LVTIEEKISNIIFDRCYPAILPEIKFISEENKHLIQVTVFRGSTPPYYLKEKGKLQGTFI RVGSANRLADEAIISELERRRRNISFDSEVIPDKPVNDLNIDGFKAIFKEKTGEELSDQA LRKLDLVKDMQGAEYPTNALILFSDDPLRNSLFHYAKVECARFKGVSIDDFIDQKSITTN IATQAEEAYNFVLRHINKGASVEGVYTVSRWEYPVKAIREAIRYAVVHRDYSLTGKDVKI AIYDDMVEITSPGLLPPSIDYAAMESRQSDARNKVIAPVFKRLGIIDQWGNGLKLIADEM KEYPNIELRWREVGLSFQVQFVRLDYVLNAERIKDIQQELQQELQQELQQELRKATLYSE VLRCIVSNALSRQDISLALGQKKVSGQLNKVIQKLIANNLIERTIPEKPNHPAQKFRLTE RGQLFLGLLAK >gi|226332001|gb|ACIB01000055.1| GENE 10 8817 - 8999 65 60 aa, chain + ## HITS:1 COG:no KEGG:BF1844 NR:ns ## KEGG: BF1844 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 35 27 61 134 74 100.0 9e-13 MYSYIPKQAQVRIRFMSFYNVYTFLICHMGREMSFSPCPGCSETRLCDIPWFPCPTGEAG >gi|226332001|gb|ACIB01000055.1| GENE 11 9303 - 10517 1119 404 aa, chain + ## HITS:1 COG:no KEGG:BF1845 NR:ns ## KEGG: BF1845 # Name: not_defined # Def: putative integrase/transposase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 404 1 404 404 752 99.0 0 MTSITPRLNRSREGRDGSYPLVIQIIRHRKKREIYTPYRFWEAEFNTRLEMVENVGGNRR RLLIVREANEYLIYIKKELEAICGSLEADKGSAYTVDDIVNVYNYHNDLSQVLVYADSVI AGLENKGRQGTAANYRSARRAFEMFLDGSPFSFEELTPEVLDRFVTFLRERGNRPNTVSF YLRQWRAIYNRACADHVVFSDQKPFRRLNLKEEVTSKRAISREKIAQIECVDLTACHADM QLARDLFLFSFYTRGMSFVDMCYLNKENLQGNYLRYKRQKTGQELQIRIEKDLRVLIDRY ASSLSDYLLPMLRNGDRYQDYRRRQRRLNKLIRELGDRLQLDMPLTFYVARHSWATLAHE NDVPVSVISDCMGHTSEKTTRIYLDRIDTKRLDRANRLVINSLR >gi|226332001|gb|ACIB01000055.1| GENE 12 11140 - 11706 537 188 aa, chain + ## HITS:1 COG:no KEGG:BF1847 NR:ns ## KEGG: BF1847 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 188 1 188 188 367 100.0 1e-100 MKKILCIWVLTVTFLGAFPALADAQQWGLTANGLYWATATPNIGVEYAFHSKMSIAGLVQ YNPFTYAKNRKMKHLAGQLEYRYWLSDVFKGHYLGVHATGGIFNFGNLPLGILKDYRLEG QLYGGGLTYGYQWIISNRVNIGVDIGLGYLYVDYDKFYCPTCGERVDHYRTNYLGPTKVG VSIIYLLK >gi|226332001|gb|ACIB01000055.1| GENE 13 11720 - 12832 712 370 aa, chain + ## HITS:1 COG:no KEGG:BF1848 NR:ns ## KEGG: BF1848 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 370 1 370 370 741 99.0 0 MNIKQVIFINISFFISLTVLAADRPTQPVRTEVYRLERLDSVLLVDLAVDLTGVHLAPDC TVYLFPLLASENTGDSLSLPPIVLNGPQSDLMYRRRRALGTTSGLEKITPYTVLREGDHA LPRIHYRTEVPYAAWMDDVKVWMRDTNCNCDARLVPFAMHTEHIPPLVVERVDTIVIHDT IRLASVASGQSTVASDIPLRKKVTRIQAGYEADIYFPTNEMRILPDHELNRASWMHFVNQ VDSIEQDNRNSISGVTVTGYSSPEGYTSNNERLAEKRAKALQAFLENKYGERMEVAVEWV GEDWKQFEKDIEVSDLPERNEILSILRTVSDSNQRKSRLKALNKGKTFEILLREYFPKLR RVSCRIRYVK >gi|226332001|gb|ACIB01000055.1| GENE 14 12846 - 13973 885 375 aa, chain + ## HITS:1 COG:no KEGG:BF1849 NR:ns ## KEGG: BF1849 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 375 4 378 378 734 99.0 0 MYYIGYMLCLLLAGCVVGEKADGLLEQRLSDRTLLVYMGGDNDLADETDEKLSALTEAWD RFPGHLLIYQDKKGADSTRLLEVCLDEQGEKVTKILAKYKQENSAGASVFARVVNEAMAR YPSVDPGLIVFSHTSGWLPSGTAVVPAGITRSVIKDNHYEMSLQDFASAIPDGQFNFILF EGCFMAGLEVAYELKDKTQYIVGSSAEMLSPGFTPVYQQMFPLLYKKEADLPAVAAAYYD YYNSMEGDNRSATISVIQTSGLEMLKVQLRAAESRVERWEWIDCSGLQAFDRLSDGRHLF YDASAYIKRIGSVEESAAFDEALEQVIIYKAATENFMPESVGGFTIDGHCGMTLYIPDAT VPTLLTERKKLKLMQ >gi|226332001|gb|ACIB01000055.1| GENE 15 14212 - 15861 1784 549 aa, chain + ## HITS:1 COG:no KEGG:BF1850 NR:ns ## KEGG: BF1850 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 549 1 549 549 892 87.0 0 MKLNKLFTFTLAALAMAACSNDDEPGIDKGGQKGELIDAISIAFTSSSAPATRADKGEIE GTGSENDVYVAYLFAKENDPLHEGAKVGDWTVKRVAGDANAEDKDVTTAITGGDVATPGT KKNMCTFNGVRQGDSVYVVVNDPQMTLATAQTLAHQGDKSEAAIRAYISNLSKSYLNDLT VAIDGKQKGKYIMAGVSAIPTNPNIPNGSTVKVSIPLNRELAKVFFNASVTTNPVYEAYG KMAIEDTEWKPDGTTEDPDGIVVVRIPRRVSPFKAQARDWYFPQSADATAKDWNVENWLK AFAGEKESAPTVAEVAGTTPALNKGECNADAKEYRLTWVVGEKALADGGTPKETSIVYVK SDKLYSPYFYVTPNYADNAGCATVVVTQATYIGANTLLEPTITEEMLDKALQNDAFKTAT STDGTTKYDKLAADFWNTEKNVDALVAFLNTDEAYKLALRGETEIAKQRAAITIQKNDKR YYRADVANYSDDETTSMKITERNTFYHITGTITTLGAKSIEDAINSDNIDMLVQVVVKPW KYVVNNINM >gi|226332001|gb|ACIB01000055.1| GENE 16 15862 - 16050 80 62 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MCIYPAQASISSLIHEFSSRVTPKSIGSNFFILVSYSSLSDKKEISTGADFPSAPDYIKK RF >gi|226332001|gb|ACIB01000055.1| GENE 17 15956 - 16879 927 307 aa, chain + ## HITS:1 COG:no KEGG:BF1851 NR:ns ## KEGG: BF1851 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 307 11 317 317 641 100.0 0 MKKLLPILLGVTLLLNSCIKDDMDACAGYMHIYFSYIYGGANRFFETVSTPTQLHFYKQK HKYRELEIAVDEIGLTEPYRFLKNFDDTDSLELIAWTQDEAIDYVDTPDTPIGEGYVKLK EITDGSGICRPVDDLLYGRIAIDAGLRENRNIVTIPFVRAVCRTRITMIPQTVEDQVVEE GGTTRAASSIIPSPEDFKFHLYGTRSGVDYNNKANADEVVLEPQCYYEESTGNVLTPWFG SFSSEGKYLKVNVFIRDEQVASFDCAPIELTSVPGNFIDLVIDGHYVKPVMQVRVNGWKV ATIESNM >gi|226332001|gb|ACIB01000055.1| GENE 18 16906 - 18471 1304 521 aa, chain + ## HITS:1 COG:no KEGG:BF1852 NR:ns ## KEGG: BF1852 # Name: not_defined # Def: ATP/GTP-binding chaperonin # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 521 8 528 528 1018 99.0 0 MKRILICRLMVFLTVALVSCSDDTSESIPQNPDKDMYLRVNVPRTYASGADETDSKETVI NGIDVLVFAPGTASQSGLFLKSVSEGTPVTGKNTFQVTMPVGEGLEVHVFTNCHEELVRS GAYKGLGMKMDALLEKMISKVENNSTGTDCLPMHGFLSGVTVAKESVGKTLSVPVLRSVA AVQAKVDEAHTGNPGELLDSEGKVIFKLREFYAYFPADSGRLAPLKEVYETAVAGSEEEK KTRNVVKTTLPDKLGVRPIGDKLFLKSETPVALAGPLYLYENTYYSDNGFDQPGSVAGNT QVATTRLVVGGVYGEDTEVSYYRIDLTDPDDPKNLVEILRNHKYTFQIMNVSGSGYDNPD DAAVGVPMNIYVKVIDWIDVNTEIDFDRENWFSSATKKIVLSGYADSQKSVSIDSDVTFG PFWQLSFNTSSTVNGNASVVPVTVAEGAASAMISNDRYEITVTGNTLTVKAKKSYGDLPA GQAYDDDFYIKVKNLTVHFKLTQVDRSPDDWGNGGTQEDEL >gi|226332001|gb|ACIB01000055.1| GENE 19 18496 - 19614 648 372 aa, chain + ## HITS:1 COG:no KEGG:BF1853 NR:ns ## KEGG: BF1853 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 372 1 373 373 734 97.0 0 MKRLKKNMRQGCMLLSVFCLIGIIACSGEEGHDIQIPDPAICTLNYTIPDTEIPDTRASD TGGKISVARPEESAIKDVKLLFFIYDEHGNGLYAGSLDGTVDGNHLAKTGKISVTIPGSG SIDNHTDYDVLVLANASVYFSGMDWDAYCAGQTENAVRVRLRGEMPLHSPTDRTYKVTDD CLPMSGIAFKEAGKDMSVQLLRAAVRIDVRVVEDKKGTVLLKSAVLRNVSPDIPVFNDPQ DAAFTPLTYANTKSAQEQFSIIGGLYATEVLRTLHTPYLKQTQAVCLLLECEKSPSFSGW YRVNINIDKDDVQYLRRNNVYTVIIKDILSQGADTPDDAYGAGSFGGIQTVTVPTDWKVP DGIVTPPDVEVN >gi|226332001|gb|ACIB01000055.1| GENE 20 19642 - 20622 611 326 aa, chain + ## HITS:1 COG:no KEGG:BF1854 NR:ns ## KEGG: BF1854 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 326 1 326 326 635 98.0 0 MKIKKCKIDCFQKSCILLCMALFSCSQYDCVVPDNGKTGEGERASLGIGGVSTGRLEVLT RSVTAELTDATDAIGIFRKEDTGKGYEPVHNRKYTYGTPLWETDGEEVVLIDEPAELTAY YPYREDGASSVGLSSELYVASKEIYYCPFKASNSTGPITLNLRRAYALLRFNFIRGIADG TPTSAKGEYTGDGKISSFTFKATLRIAGILDLFTGTVEGEAREVTFNYNVPISIGTTAAP AVLDYMVVPSDFTGELSFTLMVDGKEMKGKISASDLCGTSGKLAEGTKYEINAIIRPTEV EIGTVEVEEWIGEVVKDPSGDPFVPQ >gi|226332001|gb|ACIB01000055.1| GENE 21 20667 - 21620 718 317 aa, chain + ## HITS:1 COG:no KEGG:BF1855 NR:ns ## KEGG: BF1855 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 4 317 1 314 314 590 99.0 1e-167 MKQMLTNIGMGILLTACGNGGMDPVDTGGEEEYVPVRVETVTDNGIGTRAVTLSNAAIGI FRTAPYSATFPQQYDVECIHDGTSWSLASKIYVGSTDASLHAYYPYGKVTFGANSTVTAL TVKDHTADNDFCYADAPLTAVNNRTPVASFLLKHAYAGLKFSIACHSTYPLKKCKLTRIV MQPVTSGKTFYVERSMDISKSAADAGQLGGSTASGWSLDTSALAVGTTGIAQGSTDESIS KLFPPQDFEDDTRLILTIDGSEYSVTIPYKVLKALKAGYLSTISLEIKGTGVDVSGVKVY PWDTSVVPGGDNDATLD >gi|226332001|gb|ACIB01000055.1| GENE 22 21647 - 22582 856 311 aa, chain + ## HITS:1 COG:no KEGG:BF1856 NR:ns ## KEGG: BF1856 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 311 1 311 311 589 99.0 1e-167 MKTKSDMKPFIIAALSLTLLVGCGNDDFPAEEPKVPLEITSAFVTGDVQTRAVTPELLTS GSIGVFLEGRSGTGYDKKDNIQYDYNVGWKPKTETIYLGGEDADVCAYKKTTKLDLSAKE AVPLVSQIYDAEIDLVYATNQTVNGTSARKSVEFTMGHAYSQIEFVFSREDYPNTCKVTE ITVKNANIIKTATLNLATGGYTPVTVEKVSYWTNVAKHEDGIAVPESGTVKSNVLMIPCT LANAGGTGVTLELTVDGKLMTVPIAHDKLGALTAGVIHQISLKLKGTALEISVKDTPWDN QPVDGEYNPEP >gi|226332001|gb|ACIB01000055.1| GENE 23 22595 - 23596 871 333 aa, chain + ## HITS:1 COG:no KEGG:BF1857 NR:ns ## KEGG: BF1857 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 333 1 333 333 644 99.0 0 MKRIYTHNPGFRLAGCLWLMCLLMVAWSSCSQAEVADSIEGGNGEAPLEVRAVTGAEVDT RAPAAVEMTEGTLGIFRLAEADYTALDNIRYDRVPGGNWHPVSTVIYVGGTYARLCAYAP YNSVEFKNSSLKTEAGLTMQTYAVEKDMRFAVSGGDEVWKKTPTANFELKRAYARLVLSV VRDATYPNTCKITQAKIKASTGNIITANTVDISTGTEGSGTQTPQYTHTVTTGLKDGFEI GVTDDTSFDWLLPQQTFSGGVILTLTVDGMDYSVTIPVDKLSTFVRGTKYTVSLVVKGGK LTLMSDKILIDRDWTEVQTGTGGSGSDYDTSFN >gi|226332001|gb|ACIB01000055.1| GENE 24 23609 - 24583 745 324 aa, chain + ## HITS:1 COG:no KEGG:BF1858 NR:ns ## KEGG: BF1858 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 324 4 327 327 609 97.0 1e-173 MKHLLLILVGGILLNACSDDREMDDGRWKASALRVPLEVSSAVIKQEVVTRLAPDPVPLT EGAIGIFLSGTEPESGQEDSGYKVIDNRKYVYSEGHWGPPTANDTIYLVGNDADVCAYYP YKDSYTDKTAIPLQSQDYVETEDIYYALNTMINGFTPAITFEMVHAYSLVELKISRDHYF MPCEISKITLKNSNLIKKGTINIAVDGSIHSSETGNYDLTTVTDASPHTLSVGESYVCCV LMIPVPLKIERTDAEGGEFGLSVSLVIDGQQMLVEIPYSELGEFGQGEKYVIGLKIKGTE IVPTVKALEWEDEEVNGGNKYPVE >gi|226332001|gb|ACIB01000055.1| GENE 25 24608 - 25609 590 333 aa, chain + ## HITS:1 COG:no KEGG:BF1859 NR:ns ## KEGG: BF1859 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 333 1 333 333 648 99.0 0 MKGVLYISFLVCILLGCSDSDGLPDGEKDKTARTELEIVSASIVEGNTQTRADVPFTPGS SIALFCHGQAAETSYTPVNNRKYTLNSGDSKWTAESEATTIYLGSEKADVCGYYPYVEGT VGTADTLYNVRTTIPMDTQDYDPSKELYYVANQEVDAKRYFVEMHFKHAYSRLVLNIGVE DAYPATCSINSVALANDSLFRKAVIDITSGTVGIHPDDTEKQFRGGYNITKDLPYLLDSP DKKYQADLLMIPTKLLPDPFEPTKGLSVIVNVSGFPATVIIPIDDLSDFESGKKYVVSLK LRGIAIKSVTVTTTDWDEKILNEGVDYEPLPQS >gi|226332001|gb|ACIB01000055.1| GENE 26 25623 - 26684 653 353 aa, chain + ## HITS:1 COG:no KEGG:BF1860 NR:ns ## KEGG: BF1860 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 4 353 1 352 352 674 98.0 0 MNSMIMKLNMNINTSVAAIYAMMLLLVACSQERTLFPTEKTDRVPLSVFTSFKADVTTRA TAQPLPEGSSIGVFLIADANYTPQSNVKYTCQPDDEWTSETSVWVGGARAGLCAYYPFGK VTFSADTRTTLKNQAYTEDADWWYAKDNAAVTVTNLSPEIEFLLEKPYSQLSLEMIRNGR YPLQCKITNVRIELENGESMVDQGTFDIKDGSLQNTTDVTSYSYPTSGPMHDTGLGVGEA ARDTTCNYLFVPQTLASGLKLTLTIDEREFSVIVPQTDLGELIAGKRHVVRLDIRGGIPI NIVEVTTKEWQSTSLDGGDATTIKDTTEIIIDGGKVGTEDWASDPTENGSGAL >gi|226332001|gb|ACIB01000055.1| GENE 27 26700 - 27626 766 308 aa, chain + ## HITS:1 COG:no KEGG:BF1861 NR:ns ## KEGG: BF1861 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 308 1 308 308 577 98.0 1e-163 MKTNLLFFICILFALVSCEQEDKVSGEKTLAVVSASSDDRPSTRGIINDNTYALGVFRTT ANTYAPLYNVKHIYSGGEWGADDVIKVDYRNASFFAYYPYHTATGNYAGLAGGTTLTLQA QLFNAGEDICYGAGEASGGGPVSVYNPFVEFLNMKHAYARLRLTLTRGEKFDKTKKCNIQ NITFKSNNANFYLTRSLDIASTAGATGGSAVAAGYVHNPNVNIATGKSVTYEYMFPPQPL AGSKLTILVTVDGVTRSCDISTLGSSLDSGKYYGVSLTFTDVGIILSSAAVTVNDFDGQG NTQVDTEL >gi|226332001|gb|ACIB01000055.1| GENE 28 27645 - 29216 1070 523 aa, chain + ## HITS:1 COG:no KEGG:BF1797 NR:ns ## KEGG: BF1797 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 523 1 523 523 962 98.0 0 MIQLKDKLKVNGGTRRYMEMIGLSLAFSLFAVSCTQDSDTFYPSSPEEEHPSVDESRMIP IQLSVDGVDQFYGGAVTRSGETVARLVQPLDSTYDTGYDVETTVESIPLENIVSTRANLG NVQFRVVAYRTDASSADHYAGTAVYKTNGSGVAAIVAKTATPVNSAGQWILGPGTYTFVC YSYGLNSAPTMLSGNWSATVSHNQDFMLCRKEGVVVAPDDRGEFTLGGISFTRQCALLQL AVTANDFTNNTVQQCAATVSNLNSNSITWNAGQTALSTGGTGGAVNFSWSSLNAATVNSN VYKVLPQNSRTLTIKLTTLRIGNIQYNNRVTVNVSGRQFAAGGNYKITVKITGNGITVGG ATWAKGNVYKSGSNFYFESSQSGYHSGTQGGSFFGWNTLSSTNNTYGGSSFSSDNDPCDK VAPQHTWCTPTANQLQNLGNSGYRSGYLDGKRGGYFGGNKVFLPAMGNRGKNNVNYWTET GYYRSSTGASGKRCYYLEFNQSYAVKNNYYWYWDGFPIRCVKR >gi|226332001|gb|ACIB01000055.1| GENE 29 29260 - 30330 642 356 aa, chain + ## HITS:1 COG:no KEGG:BF1798 NR:ns ## KEGG: BF1798 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 356 1 356 356 699 98.0 0 MNNLNDLIRIWCLLLILVACNQDDNMDGIPDYATVPCVINSVSVIEPHKVNTRATIEDGD SIGVFLTDDVAANGAGSTRKYSDRNNCKYTKVTGKWTPIDDENTVSLAEGVFANLWAIYP YHADKENGLDYTDRTAIPLKSRLYSAAKDLCYGKGTGATDETSFAQPAGSKVPVDISFLN MDHAYSWVVFRINRHLYTKEAKLSKVKVSNVLTYTTLDITNGTYAGNASTPEPGIAEKVD ETIPELETDYVEVGFLIVPCTLESIGTILGVADCGMKVDLTVDGDVYTLGIPRSKLPQLE AGKKYLIEITVQGAHGIVIAPDGVSTIDWPAPTTASGDLIGEATIHTNSVLNCFWS >gi|226332001|gb|ACIB01000055.1| GENE 30 30327 - 31283 686 318 aa, chain + ## HITS:1 COG:no KEGG:BF1799 NR:ns ## KEGG: BF1799 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 318 1 318 318 588 97.0 1e-167 MKINSMFFIAMCGLCISCGAEITTDTSSEVLSDRQIPIGISTVAVPEYGAETTRGLIANA TYQLGLFRTGDKGYPPQYDAPYTYTATGWTASTEVKVDHREVSLYAYYPYQSVSFVDNTT TAVLNAQLYSADKDMCYGKGLPVGSASMINNDAPEVEFTAMKHAYARLKLTLLRGANYDK SKACNIKNITLKSNNTDFYPQRHLDITTAAGATEGTAVTTGYVYNPNVNIAIGEQSSYDF LLPPQPLAGNTLTVLVTVDDEVRFLNIAGLGSSLDAGAYYNVSLTITDIQMVLTNALTVN DYGTQGNTNFDYKYEMHN >gi|226332001|gb|ACIB01000055.1| GENE 31 31267 - 32856 997 529 aa, chain + ## HITS:1 COG:no KEGG:BF1800 NR:ns ## KEGG: BF1800 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 529 15 543 543 1000 99.0 0 MRCTIDRTQTGILKSMIRLMACSCSLFLILLTVTGCDREEDERFPSGHHDPIEGDGEMVP INVKVDGVDEFLGGAVTRSGEILTKIVQPLDSTYDSGYDVETTIESLLPVNPVQTRGNMA NMQFRVVAYKNNSITAANYAGTAVYSTNASGIASIVANTATPAAVSGQWVLRPGTYAFVF YSYGTNSAPAALSGNWSTTVTHNQDFMLCQKTGVDVKADASGQCLLSGISFSRQCAQLQL CVVAKEFNNNTVQQCAATISGLSNSPVTWNASQTTLPVTGTSGTLNVAWTNPNTTTVNSN VYKVLPQTSRTLTIKFTTLKIGNGQMNNAITVSATSRIFSAAGNYKITVSIVPNYISVRS YKWARGNLYKSGSNFYFEAAQSNYHGGATEGGFIGWNTLDIGVGKYNSGNYSTANDPCYQ VVPPGTWETPSDGQLQDLINAGVTYSGSPKGFWFGGNNGVFLLALGSRDQSSTTINNTIS KNGAWAFYWSRTPSGKTAKGLNAMQGNNSAGIDNSWARPDGRTIRCVKR >gi|226332001|gb|ACIB01000055.1| GENE 32 32869 - 33816 418 315 aa, chain + ## HITS:1 COG:no KEGG:BF1866 NR:ns ## KEGG: BF1866 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 21 315 21 315 315 545 98.0 1e-154 MKQVLECCLLSLSLFLFFSCGKEEEGPIPTDRQVPIGVMEASLSTRVSTRGVINTDIYKL GIFRTDAQGYPPQYDAPYSYDGVAGWTATTEILVDHRSASLYAYYPYQSVSFADNTTKAI LVARIYDAAKDMCYGSATSANGSGMINNDHKGVRFIDMKHAYARLKLTFVRGTNVMSGRK CKIENIVLKNNSTNFYLQRQVDITTGTITEGTPAAEGYVHNPNVEIASGNQVYEYLLPPQ SLTDNKLTISVTVDGEVRTVTVTAFGGTLNAGGYYHVTLTINDVQIVPSAGVTDNGYTSG DDPKNPIQNNTPSVV >gi|226332001|gb|ACIB01000055.1| GENE 33 33911 - 35095 1138 394 aa, chain + ## HITS:1 COG:no KEGG:BF1867 NR:ns ## KEGG: BF1867 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 394 1 394 394 749 99.0 0 MKRELSMVLCLLLLPGMAGAGTGADSPEKKAASAVNNKRNVRQKITPVGTRKSTSVRPLS VCEVKVDLPTDSMLLSWVGFLSPSATEAPKVEEVRQTVSGSIVLDYSRGQKTKYDERNFV EAVYKLLAVTRPLRETPGVSLEGIRLTGYTAPDGDYRANERLGLQRALALKDYIRREKSF GTVPFEVNWIAEDWKGLTDLIVGSEMAFKESVLDIIRTVDVVDGRERMLMDLASGTAYHY LSSAFFGRLRRIEYEVTYVDGSFATQPSIGMTEGVSVVTSGKQEAFSVADFCRLAEAHAV GSAEFNDLMDLAGRLYPDSPVACINAAGVALLRRDTERARKYLQRFATLPEAGCNMGILC LLEGDRGKAEVYLSLAQAGGSVPAGRALRFLQSK >gi|226332001|gb|ACIB01000055.1| GENE 34 35232 - 37232 808 666 aa, chain + ## HITS:1 COG:no KEGG:BF1803 NR:ns ## KEGG: BF1803 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 666 1 666 666 1386 99.0 0 MDIAEIATMAFIASGMTYNAVKYWRVNRKVHLAREWPVHLILKRTDDLSEECRRAVLLVF DRRGYLVRKYPEADVLKRRKRWRLRCTVQEGMLNFCVLFTPHALCLSGIHHRDDVSRLFA GHAGICIAETGLQEIIHGTSLELIVVGTETDEENEKVDSATLNALLPTEVITDAMAKELP VELKGCSEQWHTGSVHYAEGEPDNWLAVRREGDRLLMQFRTNFSAVEREATIEIGNGEGV HLLKVRQQVMGIHPTLTVSRRLYVSTGRKNEAVTLTVIPDNEQACWCVRSANANDGGCWY SVYPPVGLQQKGSQNLKVHLEAKPASVRSRSLVLTLETGTYPFSQTTDLLLMQGVCFDYY IEYPPEDPCARHSRVIETPPDYREEEGVRTYIVCVDSNQSWRIVSDKAADWVEVSEPELL QGHYDGRFTVKVHSNAGYRVRGGFPAARHTVLSLVNDTGVVRDILIYQGGYVRIRGKYWL DRNLAAGGKLAQVAIPLGLEVDTTLNQGTYFQFGCPTDRWEENFTPCRGSWYDGTAESPA RINELDPSPEGWRLPSRIEMEALMNSPAAPMELQREEDRTNICLLSDDGVPVYLPLCGHR SHINGCRIVIPHGHRYWTGSSQSPVYGYSLCVEPSRQMYLMHDMKKYGFPVRSIFNDERQ MVNDKL >gi|226332001|gb|ACIB01000055.1| GENE 35 37238 - 37606 273 122 aa, chain + ## HITS:1 COG:no KEGG:BF1804 NR:ns ## KEGG: BF1804 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 116 1 116 122 207 99.0 6e-53 MVEIRKIEEVWGGVDIPEITGVYDPLSGLRDGTITSQAPIVVSGCNLNRYALENIRLCLV THAKPEQVIDIRLVYTYSEGKVVVALPELKPGEYRPAVILKGDEKKVYVLPMRWVVRGRW RR >gi|226332001|gb|ACIB01000055.1| GENE 36 37648 - 38631 691 327 aa, chain + ## HITS:1 COG:no KEGG:BF1805 NR:ns ## KEGG: BF1805 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 327 1 327 327 671 98.0 0 MNAKKNLMAFILTVSSIALMVICLGLGMVKACAGGDGSEWKEKVAADTLHVVHYTRPDLP QIMTDPAERAVYYVKHYWDGYLTGDTAWVNSGDTEQLYVDFIDALKYVEPETGRKALHTM MVRMEADSTAYRRFGLLGEKYLNEPNSPMRNEDFYIAVLEQMLQSDRLQEWEKIRPADRL KQAHKNRPGMKAADFTYVTVHGDNSRMSRLKAKYTMLFFYDPDCSNCRKFEKLFAEIPAF VEMMENGTLRVLAIYPDENREEWAAKAVYMPQGWIVGWNKAGDIRTRQLYDIRATPTIYL LDGRKRVILKDTSMEQLIDYLATQAGE >gi|226332001|gb|ACIB01000055.1| GENE 37 38765 - 40888 183 707 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|84496588|ref|ZP_00995442.1| 30S ribosomal protein S1 [Janibacter sp. HTCC2649] # 627 705 200 279 502 75 46 9e-13 MEIFHKMISAALNLPEKQISNTLGLLAEGATIPFISRYRKEITGGLDEVQIESIKTQYDK LSELAKRKETILGTIGEQGKLTPELRQRIDATWDATALEDIYLPYKPKRKTRAEAARQKG LEPLALLLMMQRENNLGSRIPAFVKGDVKDAEDALKGARDIIAEQVSEDERARNAVRNLF ARQAVISAKVVKGKDEEAAKYRDYFDFSSPLKRCTSHRLLAIRRAEAEGLLKVSITPDDD ECLERLDRQFVRSNNECGRQVAEAVQDAYRRLLKPSIETEFASLSKEQADDEAIRVFAEN LRQLLLAPPLGQKRVMGIDPGFRTGCKVVCLDAQGNLVHNENIYPHPPVDKKTEAASKLR KMIEAYKIEAIAIGNGTASRETENFVTHQQFDRPVQVFVVSEQGASIYSASKTARDEFPD YDVTVRGAVSIARRLMDPLAELVKIDPKSIGVGQYQHDVDQTKLKKSLDQTVENCVNQVG VNLNTASSHLLTYISGLGPQLAQNIVAYRAANGAFASRKELMKVPRMGAKAFEQCAGFLR IAGGENPLDNTAVHPESYGIVQQMAKDLSCTVPQLIADKSLRTRIEMEKYITPTVGLPTL KDILQELDKPGRDPRDTIQVFEFDRNVRTINDLREGMTLPGIVSNITNFGAFVDIGIKEN GLVHLSQLANRFITDPTEVVSIHQHVTVKVLSIDLERKRIQLTMKEE >gi|226332001|gb|ACIB01000055.1| GENE 38 41006 - 42919 1503 637 aa, chain + ## HITS:1 COG:MA4377_3 KEGG:ns NR:ns ## COG: MA4377_3 COG0642 # Protein_GI_number: 20093164 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Methanosarcina acetivorans str.C2A # 400 637 13 255 311 158 40.0 3e-38 MPVSFKYILYLLLLVIGCCPPMAGHAATGEKPILMICSYNPGAYPTSANVSDFMDEYQRL GGKRGVVIENMNCKSFSDFPRWKGVMENILDKYRGSQEPALIILFGQEAWASYLSLNDSV TGEVPVMCALTSRNVVLLPDDGKDLAHWMPESSDFYEDSLKHQVCGGFLYEYDIASNIRM IRAIYPDTKNIAFISDNTYGGVTLQAHVRKEMKQFPDMNLILLDGREHTIYTIVDELRKL PKHTAVLLGTWRVDKNEGYFMRNATYSMMEAIPDVPTFTATSIGLGYWAVGGVVPVFRTF GKELAEEAVKLLDNPEDPNMRVEVVGTEALLDSKKVKEQKIDVAALPMKVKLVNESPSFY KQYRYQIWVGVGVLCILVIGLLVSIYFYLRTKRLKDDLERSQVALYEAKDRAEESNRLKS AFLANMSHEIRTPLNAIVGFSDVLASGGSSEEDQRNYFRIIQSNSDLLLRLINDILDLSR LEANKVILTPEDCDVVQLCRQALSSVEMSRRESGNRFVFETKTDSFVLQTDIQRLQQVLI NLLTNAAKFTKNGTITLQFEVEKEKNRVLFAVADTGCGIPKEKQKQVFERFEKLNEYAQG TGLGLSICKLTVDKWGGDIWIDPDYEGGARFVVSHPL >gi|226332001|gb|ACIB01000055.1| GENE 39 43036 - 44775 1760 579 aa, chain + ## HITS:1 COG:CAC0353 KEGG:ns NR:ns ## COG: CAC0353 COG0737 # Protein_GI_number: 15893644 # Func_class: F Nucleotide transport and metabolism # Function: 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases # Organism: Clostridium acetobutylicum # 27 558 542 1079 1193 224 29.0 5e-58 MKRLICMYAFLLCLVCVLPAQEREVKLKIVQTSDVHGNYFPYNFITQKEWGGSLARVYAL VQKNREVYKGNLILLDNGDILQGQPSAYYYNYIDTVAPHVCAEMMNFMGYDAGNMGNHDV ETGRAVFDRWIGECNFPVLGANIVETATGETHLPPYRVLERDGVKIVVLGMITPAIPAWL SENLWQGLRFDDMEETARKWMKVIREKENPDLVIGLFHAGQDAFVMSGKYNENASLNVAK NVPGFDMVLMGHDHARECKKVVNVAGDSVLVIDPASNGIVVSDIDVTLKLKDGKVVSKQI DGVLTDTKEYGVSESFMRHFALQYGAVEKFVSKKIGVFTEDMSTRPAYFGSSAFIDFIHS LQLDISGADISFAAPLSFDAEIKKGDIRVSDMFNLYKYENMLYVMKLSGKEIKDFLEESY YMWTNRMKSPEDHLLWLKEKRRAGAEDRASFQNFSFNFDSAAGIIYTVDVTRPKGEKVTI VSMADGSPFQMDHIYKVALNSYRGNGGGELLTKGSGIPQEKLKERIIFSTDKDLRFYLMQ YIEKKGTLDPRALNQWKFVPEEWVKPAAERDYEYLFGGK >gi|226332001|gb|ACIB01000055.1| GENE 40 44782 - 45255 326 157 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|15902812|ref|NP_358362.1| hypothetical protein spr0768 [Streptococcus pneumoniae R6] # 8 155 2 150 165 130 44 2e-29 MAEELTFISGSKEEQYLSLLPQVRSLIEGEVDLVANLANVAAALKEAFDFFWVGFYLVKQ DQLVLGPFQGPVACTRIRKGKGVCGTAWQEGATLLVPDVEVFPGHIACSSLSRSEIVVPL IKDGKVWGVLDIDSDLLNFFDETDRKYLEEMCGYLSK >gi|226332001|gb|ACIB01000055.1| GENE 41 45293 - 45631 223 112 aa, chain + ## HITS:1 COG:no KEGG:BF1810 NR:ns ## KEGG: BF1810 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 112 1 112 112 159 99.0 3e-38 MGHIKKILLSFLLLTLFVTYQVSITMFTHVHYVNGVMIAHSHLYKGTHSHTASNIIVIAH FAAFHSLEVDVHYDFTPERPILFTVEIPESIPVTAGTHLQVISLRAPPAELA >gi|226332001|gb|ACIB01000055.1| GENE 42 45854 - 48127 2288 757 aa, chain + ## HITS:1 COG:PA1922 KEGG:ns NR:ns ## COG: PA1922 COG4771 # Protein_GI_number: 15597118 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor for ferrienterochelin and colicins # Organism: Pseudomonas aeruginosa # 106 625 37 550 653 95 24.0 3e-19 MLVLACVSMSSYAVNPIKEGNMIAGHVIVKGTEESIPFATVMILGTNRGAVSNEEGQFEF RKLAAGKYTLRVQVMGYKTQEKTITVSAEATSVVHFQMEEVSFTTDEVVVSANRNEVSRK AAPVVVNVMSAKLFETVNSTDLAKSLNFQSGLRVENNCQNCGFPQVRINGLEGPYSQILI NSRPIISALSGVYGLEQIPVNMIERVEVVRGGGSALFGANAVGGTINIITKDPINNSFQV ASTMSNMNGKSWEQYMGGNVSLVAKDNSYGIALYETYRNRNPYDADGDGFSELGKLNMNT FGMRAYYRPNYFSRINVEYHTTNEFRRGGNKFNLQPHEADITEQTKHIINSGGVSYDRYW GEKHKMSVYGSIQHTDRNSYYGAQKDMNAYGKTNDLTWVVGGMYVGNMDRCLFAPATFTG GVEYQSNSLHDVMTGYHRDMQQDVRIAGGFVQNEWRLNRWTMLVGARLDKHNLIDHPIFS PRVNFLYKPSDNLQARLTYSTGFRAPQAYDEDLHVTAVGGEGVQIRLADGLREERSNSFS GSVDWSFPMGHWQSNILLEGFYTDLHHVFVLEDIGEDQNGDKIKERRNGSGAKVYGVNLD AKVAHGREAQLQLGFTVQRSRYNRAEVWTSEGEEEQTTKRMPRTPDYYGYFTFTSAPLKN FDFSLSGTYTGKMIVPHMAGYIEKSRMEHTPQFMDLNLKLNYTFVLKDHIKMQVNGGVQN IFNSFQKDLDKGEFRDAGYFYGPTQPRTYFVGIKIMN >gi|226332001|gb|ACIB01000055.1| GENE 43 48187 - 48948 698 253 aa, chain + ## HITS:1 COG:MK0183 KEGG:ns NR:ns ## COG: MK0183 COG1402 # Protein_GI_number: 20093623 # Func_class: R General function prediction only # Function: Uncharacterized protein, putative amidase # Organism: Methanopyrus kandleri AV19 # 17 248 12 221 224 90 31.0 2e-18 MNKEVDLSVSCLGKVKELKYDVIILPWGATEPHNLHLPYLTDCILPHDIAVEAAELALSR SGVRCMVMPPVPFGAHNPGQRELPFCIHTRYATQQAILEDIVSSLHVQGFRKLLILSGHG GNNFKGMIRDLAFEYPDFLIAAANWFEVVSPKGYFEAEIDDHAGESETSVMMHYHPELVN LAEAGDGESKPFAIASLNEKVAWAPRHWDKATVDSGVGNPKKATAEKGERYVKPIVEKLA GLFEEMAQHDLYE >gi|226332001|gb|ACIB01000055.1| GENE 44 49121 - 49702 568 193 aa, chain - ## HITS:1 COG:CAC3336 KEGG:ns NR:ns ## COG: CAC3336 COG0664 # Protein_GI_number: 15896579 # Func_class: T Signal transduction mechanisms # Function: cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases # Organism: Clostridium acetobutylicum # 11 189 13 192 199 75 26.0 5e-14 MENLKETVDIVVNSRYPEMNREGRELLAQVLIRKELEKGEMLLNEGQISRHMVFVGKGML RQFYYKNGKDVTEHFSYEGCILMCIESLLKQEPTHLMAEALEPAVVYMLPYDVLQKLLEQ SKEINAFYRKVLEYSLIVSQIKADSWRFETARERYNLLLHHHPEIIKRAPLSHIASYLLM TPETLSRVRSGVL >gi|226332001|gb|ACIB01000055.1| GENE 45 50275 - 51636 894 453 aa, chain + ## HITS:1 COG:CAC0883 KEGG:ns NR:ns ## COG: CAC0883 COG0534 # Protein_GI_number: 15894170 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Clostridium acetobutylicum # 1 430 1 428 448 253 36.0 5e-67 MNDKYEERLGTDRMLPLVFRMALPAVIAQIVNLLYNIVDRIYIGHIPGIGTQALAGIGVA GSLIILISAFSAIVAGGGAPLAAIALGQGNRTHAGKILGNGFVLLLFFTLLTSGLSYLFM EPILLFTGASEQTLGYATAYLSIYLIGTLFVEVSVGLNTFINTQGRPGIAMLSIVIGALL NILLDPLFIFVFDWGVKGAALATIISQACSAGWVLFFLTSRRASLRLEPRYMRLDRKVVG AILALGASPFIMASTESLVGFVLNGSLKTFGDIYVSALTIMQSAMLFVSVPLAGFALGFV PIVSYNYGHGNRERVKECFKIVMTFMFLFNLVLILLMILFPSVIASAFTSDEKLIETVVQ VMPVFLAGMTIFGLQRACQNMFVALGQAKVSIFIALLRKVILLIPLALMLPYLMGVMGVY AAEAISDAAAAICCTVIFAVQFPRIMNKLTVRS >gi|226332001|gb|ACIB01000055.1| GENE 46 52282 - 52728 452 148 aa, chain + ## HITS:1 COG:no KEGG:BF1816 NR:ns ## KEGG: BF1816 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 148 1 148 148 273 100.0 2e-72 MKTLTFKYLKLFLLAVAMINLTSCEIEIDDFYDDDNIGGSYYNKSLDLCSRPWADTFYDA DGNYCYQELNFYLDRHGEDYIRVEYPNGRYSESVYSFTWNWEDRSQYSLRMVYGPGDVSY LDDVWIRGNVLSGYLDGHDNYVDFTGVR >gi|226332001|gb|ACIB01000055.1| GENE 47 52840 - 54192 1204 450 aa, chain + ## HITS:1 COG:BH2936 KEGG:ns NR:ns ## COG: BH2936 COG0534 # Protein_GI_number: 15615498 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Bacillus halodurans # 14 448 11 446 452 303 42.0 6e-82 MTHRIQQESIKRRLAKLAAPIFIETLLIMMLGAVDTIMLSRHSDNSVAAVGVVNQIIMLT FLVFEVINLGTSVLCSQYLGARLEKKVVQVVGVSLLVNLLVGGIISLILFTMARTILQWM GLGPELMADGMDYMRIVGAFAFFQAISLTLSASLRSANKAIYPMLVTVVVNILNIIGNYS LIFGKFGCPELGVEGAAISTAFSRGVSMVILFVILFRKHIHRFPLAYFRPFPFIELKNLM RVGLPSAGEQLSYSSSQVVITFFINMLGVEALATRTYCVNIIMFVYLFSISMAQGGAICI GHLIGEKKPHAAFLLGKYVMKKSVMITLILSGIIAASGHAILSWLTSNPEIIRMGVIVLL IDVVLEIGRPINIFATNALRAAGDVNYPFYVGLVVMWSVAVGIGYLFGIYWAWGICGMWV AFALDENIRGIIFVRRWYGMKWVNKSFVRS >gi|226332001|gb|ACIB01000055.1| GENE 48 54278 - 55939 1470 553 aa, chain + ## HITS:1 COG:STM3807 KEGG:ns NR:ns ## COG: STM3807 COG2985 # Protein_GI_number: 16767092 # Func_class: R General function prediction only # Function: Predicted permease # Organism: Salmonella typhimurium LT2 # 27 548 18 545 553 341 37.0 3e-93 MEWLYSLFIEHSALQAVVVLSLISAIGLGLGKIHVCGISLGVTFVFFAGILAGHFGLSID PQMLNYAESFGLIIFVYALGLQVGPGFFSSFRKGGVTLNMLAIAVVILGTFLAVVCSYTT GVSLPNMVGILCGATTNTPALGAAQQTLKQMGLESSTPALGCAVAYPLGVIGVILAVLLI RKLLVRREDLEVQEKDDANKTYIAAFQVHNPAIFNKSIKDIAHMSYPKFVISRLWRDGNV SIPTSEKIIKEGDRLLVVTSEKDALALTVLFGEQENTDWNKEDIDWNAIDSQLISQRIVV TRPELNGKKLGALRLRNHYGINISRVYRSGVQLLATPELTLQLGDRLTVVGEAAAIQNVE KVLGNAIKSLKEPNLVAVFVGIILGLALGAVPFSIPGISTPVRLGLAGGPIIVGILIGTF GPRLHMITYTTRSANLMLRALGLSLYLACLGLDAGAHFFDTVFRPEGLLWIGLGFGLTLV PTVLVGFFAFKIMKIDFGSVSGMLCGSMANPMALNYANDTIPGDNPSVAYATVYPLSMFL RVIIAQVLLMFLL >gi|226332001|gb|ACIB01000055.1| GENE 49 55988 - 57982 2020 664 aa, chain + ## HITS:1 COG:CAC1572 KEGG:ns NR:ns ## COG: CAC1572 COG3855 # Protein_GI_number: 15894850 # Func_class: G Carbohydrate transport and metabolism # Function: Uncharacterized protein conserved in bacteria # Organism: Clostridium acetobutylicum # 1 664 1 663 665 795 60.0 0 MTAQSNITPESIVADLRYLQLLSRSFPTIAAASTEIINLEAILNLPKGTEHFLTDIHGEY EAFQHVLKNASGAVKRKVNEIFGHTLREIEKKELCTLIYYPEEKLQLIKATETDIDDWYL ITLNQLVKVCQNVSSKYTRSKVRKSLPAEFSYIIQELLHESTIEPNKHAYINVIISTIIS TRRADDFIIAMCNLIQRLTIDSLHIVGDIYDRGPGAHIIMDTLCDYHNFDIQWGNHDILW MGAASGNEACMANVIRLSMRYGNLGTLEDGYGINLLPLATFAMDTYADDPCTIFAPKTNF ADSTYNEKTLRLITQMHKAITIIQFKLEANIINRRPEFGMGGRKLLEKIDFERGVFVYEG KEYPLRDTNFPTVDPADPYRLTDEEQELIEKIHYSFMNSEKLKKHMRCLFTYGGMYLVCN SNLLYHASVPLNEDGSFKHVNICGKEYWGKNLLDKIDQLIRTAYFDEDDEEEKRFAMDYI WYLWCGPDAPSFDKDKMATFERYFIADKSLHKETKGYYYALRNKEEICDRILEEFGVTGQ HTHIINGHVPVKTIKGEQPMKAGGKLLVIDGGFSKAYQPETGIAGYTLVYHSHGLQLVQH DPFQSTQKAIEEGQDIKSTTFVIEFNSQRMMVKDTDKGKELVTQILDLKKLLVAYRIGLI KEKV >gi|226332001|gb|ACIB01000055.1| GENE 50 58074 - 59231 1144 385 aa, chain - ## HITS:1 COG:no KEGG:BF1820 NR:ns ## KEGG: BF1820 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 385 1 385 385 664 99.0 0 MKKIILVAALLSAAVCLPAQNKGGNKSGGINLSLWKKACTQPLDSTQTTYVNLGLFSAMH KLHGVGFNAFGSMVQNNMNGVQISGLANLAGGSMHGVQIGGISNVNGNNLAGLSVSGLVN ITGNKAKGVLITGLSNIAGDNMRGLMMSGIMNITGDKAAGVQLAGLANVTGEEYDGPMMS GLLNVVGEEMNGLQLSGLANVTGGQMNGVQLGLFNFASKAKGLQIGLFNYHKEDMKGLQL GLVNANPQTKVQLMVFGGNSTKINVGARFKNKLFYTILGGGTHYLDFDDKFSAALFYRAG LELPLYRNLFVSGDLGYQHIETFRNKKVEGIPARLYALQARLNLEYRFTNKFGLFVTGGY GGSRYYNKARTYDKGVIAEAGVVLF >gi|226332001|gb|ACIB01000055.1| GENE 51 59340 - 60998 1642 552 aa, chain - ## HITS:1 COG:aq_999_1 KEGG:ns NR:ns ## COG: aq_999_1 COG1022 # Protein_GI_number: 15606303 # Func_class: I Lipid transport and metabolism # Function: Long-chain acyl-CoA synthetases (AMP-forming) # Organism: Aquifex aeolicus # 25 546 14 499 600 212 29.0 1e-54 MEQSFIAYIENSIKNNWDLDALTDYKGATLQYKDVARKIEKLHIIFEESGIRKGDKIAVC GRNSSHWGVTFLATLTYGAVIVPILHEFKADNVHNIVNHSEAKLLFVGDVVWENLNESAM PLLEGILMMNDFTLLVSRSERLTHAREHLNEMFGKKFPKNFRKEHIEYHKDQPEELAVIN YTSGTTSYSKGVMLPYRSLWSNTAFAFEVLPLKAGDKIVSMLPMAHMYGLAFEFLYEFAV GCQIYFLTRMPSPKIIFQAFADVKPNLIVAVPLIIEKIIKKSVLPKLETPTMKLLLKVPI INDKIKATVREEMIKGFGGNFEAVIVGGAAFNQEVEQFLRMIDFPYTVGYGMTECGPIIC YEDWKRFKPGSCGKAALNMEVKVLSPDPENVVGEIVCKGPNVMLGYYKNEEATAQVIDKD GWLHTGDLALEDAEGNITIKGRSKNMLLSASGQNIYPEEIEDKLNNLPYVAESIIVQQND KLVGLVYPDFDEAFAHGLKNEDMERVMEENRITLNEMLPAYSQISKMKIYPEEFEKTPKK SIKRFLYQEAKG >gi|226332001|gb|ACIB01000055.1| GENE 52 61192 - 62262 993 356 aa, chain - ## HITS:1 COG:Cj1428c KEGG:ns NR:ns ## COG: Cj1428c COG0451 # Protein_GI_number: 15792746 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Nucleoside-diphosphate-sugar epimerases # Organism: Campylobacter jejuni # 1 354 1 342 346 348 47.0 1e-95 MDKNAKIYVAGHHGLVGSAIWKNLQEKGYTNLVGRTHKELDLLDGAAVKQFFDEEMPEYV FLAAAFVGGIMANSIYRADFIYKNLQIQQNVIGESFRHQVKKLLFLGSTCIYPRDAEQPM KEDVLLTSPLEYTNEPYAIAKIAGLKMCESFNLQYGTNYIAVMPTNLYGPNDNFDLERSH VLPAMIRKIHLAHCLKKGDWEAVRKDMNLRPVEGISGANSNEEILRILRKYGITETEVTL WGTGMPLREFLWSEEMADASVFVMEHVDFKDTYKAGAKDIRNCHINIGTGKEITIRELAG LIVNTVGYQGKLTFDSSKPDGTMRKLTDPSKLHNLGWHHKIDIEEGVQRMYEWYLG >gi|226332001|gb|ACIB01000055.1| GENE 53 62267 - 63340 1212 357 aa, chain - ## HITS:1 COG:BMEI1413 KEGG:ns NR:ns ## COG: BMEI1413 COG1089 # Protein_GI_number: 17987696 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: GDP-D-mannose dehydratase # Organism: Brucella melitensis # 1 350 1 346 362 480 67.0 1e-135 MKKALISGITGQDGSFLAEFLLQKGYEVHGILRRSSSFNTGRIEHLYFDEWVRDMKQKRT VNLHYGDMTDSSSLIRIIQQVQPDEIYNLAAQSHVKVSFDVPEYTAEADALGTLRMLEAV RILGLEKQTRIYQASTSELYGKVQEVPQSETTPFYPRSPYGVAKQYGFWITKNYRESYGM FAVNGILFNHESERRGETFVTRKITLAAARIAQGEQDKLYLGNLDAKRDWGYAKDYVECM WLILQHDVPEDFVIATGEMHTVREFCTLAFAEIGINLRWEGEGVNEKGIDTATGKVLVEV DPKYFRPAEVEQLLGNPTKARTVLGWNPCKTPFPELVKIMVRHDMAKVKRMIATKHD >gi|226332001|gb|ACIB01000055.1| GENE 54 63554 - 64825 801 423 aa, chain + ## HITS:1 COG:FN1382 KEGG:ns NR:ns ## COG: FN1382 COG1373 # Protein_GI_number: 19704717 # Func_class: R General function prediction only # Function: Predicted ATPase (AAA+ superfamily) # Organism: Fusobacterium nucleatum # 25 416 4 396 402 201 33.0 2e-51 MNDIFDELVSSNYWNGKLPKVEFARKSYTNRIFGYVGNRLVKVLTGQRGVGKGCLLRQII YQLIADGVSSRNILYISKKFTDSGVLSGYRDLDDLLFIYREKIHPVGKVYIFIEEIQKVE GWEHFVHSHSQDFVDTCELFISGSNQDMLSGEAEKLLARHYVSFEIFPFSFSEHRILERV EATGKNYAEYMDKGALPILFNLPDAEAGWKAISALRDTILFRDIIRRYRIKDARLLEDVF VYLCTHLAELITIGQLVSHFAAQNRKTSYDTVANYICYLEDTSLIHRVERYQIRSKEILL GSCKYYINDWAFMRHLYPFFAGSRKSCFENQVYLELRRAGYTVFVGVLRGGKLVDFVGRK KDRIVYLQCAPLLNDDFQVEQMYNTLEMIQDNYEKWVVSMDDTTLPSKEGIRHIQVWQLP EIL >gi|226332001|gb|ACIB01000055.1| GENE 55 65119 - 65922 566 267 aa, chain - ## HITS:1 COG:no KEGG:BF1890 NR:ns ## KEGG: BF1890 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 267 1 267 267 528 100.0 1e-149 MATTYDGINYFPIAVNFMEENAMEVIEAKYGMKGLAIVLKLLCKIYKEGYYIPWGEEQCL IFANKTGKEADAEEVQGIIEIMIEKGMIDRGSYAEHKVLTSEAIQKVWIEATKRRKRNWT AMPYLLIKPKETCNEEKTVCTQNVEQDAGSDAKNACNTEQSKVKQSREFPPSAPPRGKEE EVNATPVSMPGYAFNTMTHNYVGLMGNLERFGITDEKEIEAILRLSDYGRKGTPVWKLIC STNWSNIGAKGKYMIAALNRAKKRSGT >gi|226332001|gb|ACIB01000055.1| GENE 56 65972 - 66319 437 115 aa, chain - ## HITS:1 COG:no KEGG:BF1826 NR:ns ## KEGG: BF1826 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 115 1 115 115 195 100.0 5e-49 MKKNRKAVSKNEMMKGKWKRQIMLEKDYTEQCSEWMAERLEALIEYMQYGHAAVAYMKQD GTFKLVKGTLVGYEKDFGKQYDPMEIKNTVVYRDVEQQRWMTFKIENFMEWRAIV >gi|226332001|gb|ACIB01000055.1| GENE 57 66460 - 66798 232 112 aa, chain - ## HITS:1 COG:no KEGG:BF1827 NR:ns ## KEGG: BF1827 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 112 1 112 112 186 100.0 3e-46 MNDNLVTIEVNGKVIFAGKADLQFSLNQKVREPLRITKGKGKLMEALTESFIKGGINSME NRPIEELQETIKEYLTFEYQRKGIATEPNKRSFLSELKKYARAFRKKRDESE Prediction of potential genes in microbial genomes Time: Wed May 18 00:13:39 2011 Seq name: gi|226332000|gb|ACIB01000056.1| Bacteroides sp. 3_2_5 cont1.56, whole genome shotgun sequence Length of sequence - 67811 bp Number of predicted genes - 65, with homology - 63 Number of transcription units - 29, operones - 15 average op.length - 3.4 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . + CDS 128 - 652 433 ## BF1828 putative transcriptional regulatory protein UpxY-like protein 2 1 Op 2 . + CDS 656 - 1135 458 ## BF1829 hypothetical protein + Prom 1167 - 1226 6.8 3 2 Op 1 . + CDS 1274 - 2707 401 ## BF1835 putative flippase 4 2 Op 2 . + CDS 2704 - 3543 314 ## BF3688 putative alpha-1,2-fucosyltransferase 5 2 Op 3 5/0.000 + CDS 3550 - 4485 616 ## COG0451 Nucleoside-diphosphate-sugar epimerases 6 2 Op 4 1/0.000 + CDS 4482 - 5735 717 ## COG1208 Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) 7 2 Op 5 3/0.000 + CDS 5739 - 6719 636 ## COG2605 Predicted kinase related to galactokinase and mevalonate kinase 8 2 Op 6 . + CDS 6721 - 7287 308 ## COG0279 Phosphoheptose isomerase 9 2 Op 7 1/0.000 + CDS 7290 - 8189 404 ## COG0463 Glycosyltransferases involved in cell wall biogenesis + Prom 8404 - 8463 7.8 10 2 Op 8 . + CDS 8486 - 9232 186 ## COG3774 Mannosyltransferase OCH1 and related enzymes + Prom 10067 - 10126 3.7 11 3 Tu 1 . + CDS 10184 - 10843 195 ## gi|253566798|ref|ZP_04844250.1| predicted protein + Term 10903 - 10948 -0.8 12 4 Tu 1 . - CDS 11002 - 11136 58 ## - Prom 11220 - 11279 5.8 + Prom 11142 - 11201 9.0 13 5 Op 1 4/0.000 + CDS 11278 - 11853 329 ## COG0438 Glycosyltransferase 14 5 Op 2 3/0.000 + CDS 11857 - 12879 873 ## COG1086 Predicted nucleoside-diphosphate sugar epimerases 15 5 Op 3 . + CDS 12867 - 13997 919 ## COG0381 UDP-N-acetylglucosamine 2-epimerase 16 5 Op 4 3/0.000 + CDS 14017 - 14880 519 ## COG1091 dTDP-4-dehydrorhamnose reductase 17 5 Op 5 8/0.000 + CDS 14877 - 16088 502 ## COG0438 Glycosyltransferase 18 5 Op 6 1/0.000 + CDS 16111 - 17118 713 ## COG0451 Nucleoside-diphosphate-sugar epimerases 19 5 Op 7 . + CDS 17122 - 18072 605 ## COG0472 UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase 20 6 Op 1 . - CDS 18232 - 18423 211 ## BF1849 hypothetical protein 21 6 Op 2 . - CDS 18407 - 18658 98 ## BF1850 hypothetical protein - Prom 18735 - 18794 6.9 + Prom 18875 - 18934 4.9 22 7 Tu 1 . + CDS 19060 - 19533 577 ## BF1851 putative non-specific DNA binding protein + Term 19708 - 19755 8.6 23 8 Tu 1 . - CDS 19724 - 20902 999 ## COG1301 Na+/H+-dicarboxylate symporters - Prom 20977 - 21036 7.3 + Prom 20805 - 20864 3.7 24 9 Op 1 . + CDS 21035 - 22510 1606 ## COG0362 6-phosphogluconate dehydrogenase 25 9 Op 2 15/0.000 + CDS 22525 - 24021 1259 ## COG0364 Glucose-6-phosphate 1-dehydrogenase 26 9 Op 3 . + CDS 24018 - 24734 168 ## PROTEIN SUPPORTED gi|163781723|ref|ZP_02176723.1| 50S ribosomal protein L13 + Term 24748 - 24785 3.1 27 10 Tu 1 . + CDS 25091 - 25315 309 ## BF1856 hypothetical protein + Term 25361 - 25407 2.1 + Prom 25386 - 25445 5.2 28 11 Tu 1 . + CDS 25561 - 25875 304 ## BF1857 hypothetical protein + Term 25919 - 25954 3.4 29 12 Tu 1 . - CDS 25915 - 26046 60 ## - Prom 26073 - 26132 6.0 + Prom 26035 - 26094 8.4 30 13 Tu 1 . + CDS 26158 - 28224 1706 ## COG1649 Uncharacterized protein conserved in bacteria + Term 28246 - 28302 10.2 + Prom 28303 - 28362 5.2 31 14 Op 1 . + CDS 28385 - 28933 253 ## COG4413 Urea transporter 32 14 Op 2 . + CDS 28941 - 29237 129 ## BF1925 putative UreA transport protein + Term 29284 - 29340 4.5 + Prom 29400 - 29459 12.2 33 15 Op 1 . + CDS 29609 - 30856 846 ## BF1926 hypothetical protein + Term 30867 - 30916 -0.7 + Prom 30882 - 30941 9.9 34 15 Op 2 . + CDS 31053 - 31757 369 ## BF1861 hypothetical protein + Term 31936 - 31972 2.4 35 16 Op 1 4/0.000 - CDS 31871 - 33154 546 ## COG0389 Nucleotidyltransferase/DNA polymerase involved in DNA repair 36 16 Op 2 . - CDS 33154 - 33594 339 ## COG1974 SOS-response transcriptional repressors (RecA-mediated autopeptidases) - Prom 33735 - 33794 5.0 + Prom 33678 - 33737 6.7 37 17 Op 1 . + CDS 33902 - 34345 266 ## gi|253566821|ref|ZP_04844273.1| predicted protein 38 17 Op 2 . + CDS 34363 - 35049 505 ## gi|253566822|ref|ZP_04844274.1| predicted protein 39 17 Op 3 . + CDS 35118 - 35660 320 ## gi|253566823|ref|ZP_04844275.1| predicted protein + Term 35696 - 35752 4.0 + Prom 36026 - 36085 7.6 40 18 Tu 1 . + CDS 36158 - 36736 325 ## gi|253566824|ref|ZP_04844276.1| predicted protein + Term 36830 - 36861 1.1 - Term 36887 - 36923 -1.0 41 19 Op 1 9/0.000 - CDS 36944 - 37642 595 ## COG3279 Response regulator of the LytR/AlgR family 42 19 Op 2 . - CDS 37645 - 38697 802 ## COG3275 Putative regulator of cell autolysis 43 20 Op 1 36/0.000 - CDS 38830 - 40050 427 ## PROTEIN SUPPORTED gi|163788031|ref|ZP_02182477.1| 50S ribosomal protein L9 44 20 Op 2 24/0.000 - CDS 40064 - 40807 326 ## PROTEIN SUPPORTED gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 45 20 Op 3 13/0.000 - CDS 40810 - 42033 1365 ## COG0845 Membrane-fusion protein 46 20 Op 4 . - CDS 42070 - 43473 1360 ## COG1538 Outer membrane protein + Prom 43711 - 43770 9.1 47 21 Op 1 . + CDS 43926 - 44141 168 ## BF1874 hypothetical protein 48 21 Op 2 31/0.000 + CDS 44174 - 45727 1586 ## COG1271 Cytochrome bd-type quinol oxidase, subunit 1 49 21 Op 3 . + CDS 45753 - 46901 1161 ## COG1294 Cytochrome bd-type quinol oxidase, subunit 2 + Term 46956 - 47007 12.0 + Prom 46945 - 47004 4.6 50 22 Op 1 . + CDS 47025 - 48089 1181 ## COG3831 Uncharacterized conserved protein 51 22 Op 2 . + CDS 48094 - 49104 677 ## BF1940 hypothetical protein 52 22 Op 3 . + CDS 49091 - 49942 476 ## BF1941 hypothetical protein 53 22 Op 4 . + CDS 49939 - 51219 947 ## COG0635 Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 54 22 Op 5 . + CDS 51233 - 52744 651 ## BF1943 hypothetical protein - Term 52679 - 52719 4.1 55 23 Op 1 . - CDS 52747 - 53703 1094 ## COG1052 Lactate dehydrogenase and related dehydrogenases - Prom 53723 - 53782 2.5 56 23 Op 2 . - CDS 53787 - 55058 1219 ## COG2256 ATPase related to the helicase subunit of the Holliday junction resolvase 57 23 Op 3 . - CDS 55082 - 57676 1843 ## COG3250 Beta-galactosidase/beta-glucuronidase 58 23 Op 4 . - CDS 57712 - 60273 2094 ## BF1947 hypothetical protein - Prom 60324 - 60383 4.5 59 24 Tu 1 . - CDS 60387 - 61472 1068 ## COG0381 UDP-N-acetylglucosamine 2-epimerase - Prom 61495 - 61554 6.0 + Prom 61459 - 61518 5.7 60 25 Tu 1 . + CDS 61557 - 62183 611 ## COG2860 Predicted membrane protein 61 26 Tu 1 . - CDS 62197 - 62691 365 ## BF1888 hypothetical protein - Prom 62833 - 62892 3.1 - Term 62798 - 62841 3.2 62 27 Op 1 3/0.000 - CDS 62975 - 63943 939 ## COG0501 Zn-dependent protease with chaperone function 63 27 Op 2 . - CDS 63965 - 64525 709 ## COG1704 Uncharacterized conserved protein - Prom 64592 - 64651 8.5 64 28 Tu 1 . + CDS 64816 - 65319 474 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog + Term 65391 - 65438 11.5 + Prom 65393 - 65452 8.8 65 29 Tu 1 . + CDS 65568 - 67343 1690 ## COG5016 Pyruvate/oxaloacetate carboxyltransferase + Term 67373 - 67425 10.5 Predicted protein(s) >gi|226332000|gb|ACIB01000056.1| GENE 1 128 - 652 433 174 aa, chain + ## HITS:1 COG:no KEGG:BF1828 NR:ns ## KEGG: BF1828 # Name: not_defined # Def: putative transcriptional regulatory protein UpxY-like protein # Organism: B.fragilis # Pathway: not_defined # 1 174 1 174 174 333 100.0 1e-90 MSEQQKYWFAARTRDKQEFAIRDSLEKLKTELDLNYYLPTQFVIRQLKYRRKRVEVPVIK NLIFIQATKQDACDISNKYNIQLFYMKDLLTRAMLIVPDKQMQDFIFVMDLDPNGVSFDN DHLSVGSRVQVVKGDFCGVEGELASEANKTYVVIRIAGVLSASVKVPKSYLRVI >gi|226332000|gb|ACIB01000056.1| GENE 2 656 - 1135 458 159 aa, chain + ## HITS:1 COG:no KEGG:BF1829 NR:ns ## KEGG: BF1829 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 159 1 159 161 265 89.0 3e-70 MNTLTSQIEQLQSLAHELLYLGVDGAPIYTDHFRQLNKEVLEQSDALYPQRGATPEEEAN ICLALLMGYNATIYNQGDKEEKKQVVLNRCWDVLDQLPATLLKCQLLTYCYGEVFEEELA KEAHLIISGWDHSRLSNDEKEVFESLKILEENPYPYFEL >gi|226332000|gb|ACIB01000056.1| GENE 3 1274 - 2707 401 477 aa, chain + ## HITS:1 COG:no KEGG:BF1835 NR:ns ## KEGG: BF1835 # Name: not_defined # Def: putative flippase # Organism: B.fragilis # Pathway: not_defined # 1 466 23 490 497 258 39.0 4e-67 MMVGIILVPIYLVYIPKDLYGYWLATGNILTIISLLDPGIGGVVTQKVSYYYGKMDCRSV GCYSFNGVLLSVIIALLVFIVGILLSEKISLFLNVPYEYQEELINAFNYTLTGTSLMIIY YSIGAIAYGMLSSKCIGFINLFANISGLILTVVFFRLEYGLLSIGYASLIRSIIYILGSI LYITYRFVNEKIGFSFDLLLMKDFFKLSFYNFFGSLGQNLLSNMNSFICTKYISPIASAN LRFSQTVPDMGKTVALRIVSSFAPSISNLYGANQKKQLKTMIFLLTQILVWLLGLVFVGF LFLNESFILIWIGANNYCGGISNFLIIVLLMLSTVNKTISLIIFNLGDIKINNLVLFFQA VLYFALIIPIALYFKINGILLLSIGIEFLAFYFYYGRKIINIFDCKDDVKKIYQNIFQTI LVCAFVYIVLLVIDYFPTNWISFLMIVLFISLFYTLCLCLISVAFRQACFKFILRFR >gi|226332000|gb|ACIB01000056.1| GENE 4 2704 - 3543 314 279 aa, chain + ## HITS:1 COG:no KEGG:BF3688 NR:ns ## KEGG: BF3688 # Name: not_defined # Def: putative alpha-1,2-fucosyltransferase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 274 7 287 289 176 38.0 7e-43 MKKVIFSGGLGNQMFQYAFYLFLKKKGIKAVIDNSLYSEFKMHNGFELIKVFDIKESIYR TYFLKVHLIFIKLLMKIPPVRKLSCKDDVIPIGDHEFDPPYARFYLGYWQSKKIVNYVIE ELRAQFIFRNIPQMTIEKGDFLSSINSVSIHIRRGDYMGIPAYQGICNEIYYERAISFMK EHFLNPRFYVFSNDSIWAKLFLEKFDIDMEIIVTPPIYSYWDMYLMSRCRNHIIANSTFS WWAAVLNINKDKIVISPTIFKKDECIDIIFDDWVKISNI >gi|226332000|gb|ACIB01000056.1| GENE 5 3550 - 4485 616 311 aa, chain + ## HITS:1 COG:MA4460 KEGG:ns NR:ns ## COG: MA4460 COG0451 # Protein_GI_number: 20093246 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Nucleoside-diphosphate-sugar epimerases # Organism: Methanosarcina acetivorans str.C2A # 2 302 10 310 320 286 47.0 3e-77 MKILVTGGAGFIGSHLCDLLVCNDNQVVALDNLSRGRKENIMHLVDDGHFSFIQEDLLNR SSLRQIFIQEDFDMVYHLAANSDIQKGSQDPTVDYDLTFNTTFNVLQCMKEFKVKKFFFA STSAIYGETSDWLKENYGPLLPVSNYGAAKLASEAFISAFSSMYNIQTWIARFPNVVGER FTHGVIYDFIHKLQKNPNVLTVLGDGEQIKPYLYVKDLIGGILFICKNSHEEINIFNLGS TTRTKVKEIAQMVIDEMGLSASIEYTGGTRGWIGDVPEFRYDLTKINTLGWSATYNSNDS VRIAIQKALGK >gi|226332000|gb|ACIB01000056.1| GENE 6 4482 - 5735 717 417 aa, chain + ## HITS:1 COG:alr2361_1 KEGG:ns NR:ns ## COG: alr2361_1 COG1208 # Protein_GI_number: 17229853 # Func_class: M Cell wall/membrane/envelope biogenesis; J Translation, ribosomal structure and biogenesis # Function: Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) # Organism: Nostoc sp. PCC 7120 # 1 237 1 237 375 168 37.0 2e-41 MKVVIIAGGKGTRIASLNSEIPKAMIPINGKPVLEYQIELAKRYGHTDIFLVIGYLGDKI RSYFGNGESFGVQIKYFEEHSPLGTAGALAELRNVLTEDFFLFYGDTVMDVALDKMYAFH TIQQADATLFLHPNDHPYDSDLVEIDKKGVIIGFHSKPHPKNFLYRNLVNAALYILSPRL LLHIPQGIKSDFGKDIFPKVLTENLRLIGYVSSEYIKDMGTPERYKKVCNDVLTGKVARL NKKYAQAAVFLDRDGVLVKEVDLLCKPEQLELLEGAADAIRYINESGYLAVVVTNQPVIA RNLCSIEELEFIHKKMETLLGFEHSYLDAIYYCPHHPDKGFPEERKEYKIKCTCRKPNPG MLLQAAQDLNINLKKSYMIGDRDSDIIAGQNAGVSASILIERNKPFALLNALRNFIK >gi|226332000|gb|ACIB01000056.1| GENE 7 5739 - 6719 636 326 aa, chain + ## HITS:1 COG:TVN0888 KEGG:ns NR:ns ## COG: TVN0888 COG2605 # Protein_GI_number: 13541719 # Func_class: R General function prediction only # Function: Predicted kinase related to galactokinase and mevalonate kinase # Organism: Thermoplasma volcanium # 1 323 1 321 324 239 44.0 8e-63 MIISRTPFRISFAGGGSDLSSFYSQQMGAVLSTSINKYVYIAIHPFFDSRKIQLKYSKTE LVSSFDEIQHPIFKEVLKMSDLTGIDLNSIADIPAGTGLGSSSAFTVGLLNAIYAYKYKA VGNEMLAKLACEVEIERLKSPIGKQDQYAAACGGLNLISFYPDETVNVEKIIMDPHKKQE LEDNLIMIYTGGTRSANSILKEQNREILEKDKFNNQKAMVKLAFDLKRSLEDNNIDDFGQ YLHEGWLLKKTLTGSISNSFVDDIYDLGLKSGALGGKLLGAGGGGFILFYCPKGIQENFR KKMSQFTEIDFRFDNYGSKIIYVGDR >gi|226332000|gb|ACIB01000056.1| GENE 8 6721 - 7287 308 188 aa, chain + ## HITS:1 COG:HP0857 KEGG:ns NR:ns ## COG: HP0857 COG0279 # Protein_GI_number: 15645476 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphoheptose isomerase # Organism: Helicobacter pylori 26695 # 4 179 7 180 192 101 32.0 9e-22 MDYKNDIIVYFNKLKKTIDNISLEELNALMNILVEAKDAGRTVFIMGNGGSSATASHYVC DFNKGVSFEHNKKFKFVCLNDNIPSLMAYANDLSFEDIFIEQLKNFFQPNDVVIGISGSG NSKNVLRAIEYANINRGITVGLTGYDGGILKKIVQYSVHVPVNDMQITEDIHMVLDHCMM KILSNNKC >gi|226332000|gb|ACIB01000056.1| GENE 9 7290 - 8189 404 299 aa, chain + ## HITS:1 COG:CAC2174 KEGG:ns NR:ns ## COG: CAC2174 COG0463 # Protein_GI_number: 15895443 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Clostridium acetobutylicum # 1 219 1 227 336 139 40.0 9e-33 MNIAPLVSVIMPSYNSKKYIKKAIDSVLEQTYSNFELIIVDGNSTDGTLDILDEYKKQDR RIKVIQDEGRGIGAALQLGCQIASGKFIARMDSDDIAINTRFEKQLKIFHSIPNLILVAS PVIYINEDDSIVGYSFPYTNKRIIQEKVYLIAHPTVMMKKDAYVKAGGYQPLLRAEDYFL WNRMRLMGEFYIFKEPLIKYRLLQDSLSHTLDDNFNKKLGRKLESYFIKPIISEIDIIEI NDFISTNLPKNRIISHMSSKRKIRLFSFLKIIFKFLPLNIFVVKFVISIKNVFGFLYVK >gi|226332000|gb|ACIB01000056.1| GENE 10 8486 - 9232 186 248 aa, chain + ## HITS:1 COG:FN1241 KEGG:ns NR:ns ## COG: FN1241 COG3774 # Protein_GI_number: 19704576 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Mannosyltransferase OCH1 and related enzymes # Organism: Fusobacterium nucleatum # 1 220 1 211 243 86 31.0 5e-17 MIPKIIHYCWLSKEKQSPFIQGCIRSWRKIMPDYQIICWDVNRFDLNLSPFAKEAYEKRK WAFAADYIRLFALYNYGGFYLDSDVRVFKRFDAFLNHGFVSSIDIQQGLEGHLDFGIQAA IMGAEVHNPFVKDCLSYYENRHFIKEDGSLSIKPIAPDIIAMYADKYGLKRINQFQLLEP NIAIYPASVFAGHPIYWTKQSVAMHFCNNSWIPKSRIHKLTSYLYNNYRLPMLLKKIYFR LRRFYINV >gi|226332000|gb|ACIB01000056.1| GENE 11 10184 - 10843 195 219 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253566798|ref|ZP_04844250.1| ## NR: gi|253566798|ref|ZP_04844250.1| predicted protein [Bacteroides sp. 3_2_5] # 1 219 227 445 445 427 100.0 1e-118 MLLISVILLGALGLFQGGMTFIRSNKSLDIIEIIQSINNSKNDGERSLLDEIDFRFGALT HYSTGFYRMVDRGYMAGWNPILNSLYSPIPRSLMEDKPVPCSVDGDLYSMGMYKTQAEIT HIDTNMVEFSTAAHAYWELHIFGLILFSIIPAIYVFLSIKLFRQFALLAPCFLMVVFKPW GYNDPKIWVSEIFLQLSQVIIPTLFILLFYSSIRKRTCK >gi|226332000|gb|ACIB01000056.1| GENE 12 11002 - 11136 58 44 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MKISINYCIYFYWWWNISKTQYLTKGVSMMKIKSKPLYHRWIRK >gi|226332000|gb|ACIB01000056.1| GENE 13 11278 - 11853 329 191 aa, chain + ## HITS:1 COG:RP340 KEGG:ns NR:ns ## COG: RP340 COG0438 # Protein_GI_number: 15604208 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Rickettsia prowazekii # 31 164 182 313 338 62 32.0 6e-10 MQCMFDKPLTKIFLYPFWQSYGTSINEAFCRRDYVYVANYTPTKQHFLLIKVWEKLHELG FNLTLHLTLSNYPIDLEQELEKALKNGVKIINHGSLEKSNIQKLYNCSKATIYPSLNESF GLGIIEALTAGCDVIGPDLPYIHSVCIPSVVFSSFEVDNVVNAIIDYERGCGQKSQLTIT NNLSGLVDLLI >gi|226332000|gb|ACIB01000056.1| GENE 14 11857 - 12879 873 340 aa, chain + ## HITS:1 COG:PM1007 KEGG:ns NR:ns ## COG: PM1007 COG1086 # Protein_GI_number: 15602872 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Predicted nucleoside-diphosphate sugar epimerases # Organism: Pasteurella multocida # 3 335 1 333 344 473 68.0 1e-133 MSLFKDKTLLITGGTGSFGNAVLKRFLDSDIKEIRIFSRDEKKQDDMRHHLQNKKVKFYI GDVRDKRSVDGVMNGVDYIFHAAALKQVPSCEFFPVQAVRTNVLGTENVLDSAIEHGVKN VVVLSTDKACYPINAMGISKAMMEKVAIAKGRQMGEGGQTTICCTRYGNVMASRGSVIPL WVEQMKAGKDITITDPNMTRFMMTLDDAVDLVIYAFQHGHNGDLFVQKAPAATLDTLARS LKELYKIDTPVRVIGTRHGEKLYESLVTREEMAKAEDMGNYYRIPCDARDLNYDKYFVEG QEKVSKFEDYHSHNTHRLDVDGMKQLLLKLDMIKEDVCLK >gi|226332000|gb|ACIB01000056.1| GENE 15 12867 - 13997 919 376 aa, chain + ## HITS:1 COG:RP334 KEGG:ns NR:ns ## COG: RP334 COG0381 # Protein_GI_number: 15604202 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylglucosamine 2-epimerase # Organism: Rickettsia prowazekii # 1 374 1 376 376 462 59.0 1e-130 MLKVMTIVGTRPEIIKLSRVMAELDKYTEHIMVHTGQNFDYELNEIFFQELRIRKPDYFL DAAGKNAAETIANVIRKSDELMDQVKPDALLLYGDTNSCISVISAKRRKIPIFHMEAGNR CFDQRVPEEINRKIVDHLSDINMPLSEHARKYLLAEGLRPETVIKTGSPMTEVLIYHKAE IEENDVLEKEGLKKGDYFIVSTHREENVDSEKNFSDLLSSLNAIVDKYHKKVIVSTHPRT RKKLESIGFINSNPMIEFMKPFGFMEYIKLQQNAFCVISDSGTITEESSILHFPAITIRQ AHERPEGMDEGTLIMTGLNSDRILESIEIVTSQYAEGADVIHSIPDYASDNVSKKVVRII LSYTDYINRTVWHKEI >gi|226332000|gb|ACIB01000056.1| GENE 16 14017 - 14880 519 287 aa, chain + ## HITS:1 COG:RP332 KEGG:ns NR:ns ## COG: RP332 COG1091 # Protein_GI_number: 15604200 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: dTDP-4-dehydrorhamnose reductase # Organism: Rickettsia prowazekii # 5 274 2 283 284 181 37.0 2e-45 MVKKKILLFGATGMAGHMAYYYLQCTGKYDLVNVVYRTQLTDDSIVVDVTDGDAVSQLVR EVRPDFILNCIGVLIRGSREHPDNAILINAWFPHLLKKLSDEVGAKLIHISTDCVFSGKK GNYSETDIRDADDVYGRSKALGEIINDKDLTIRTSIIGPELKENGEGLFHWFMHQHGCVN GFQTAIWGGVTTLELAKAIDVAIDQGVTGLIQLSNGLGISKYDLLHLFSRIWHKRDVEIL PFDGNGIDKSIAKSARFSYVVPGYEEMLREQYDWMQNKQELYRTLYL >gi|226332000|gb|ACIB01000056.1| GENE 17 14877 - 16088 502 403 aa, chain + ## HITS:1 COG:SP0351 KEGG:ns NR:ns ## COG: SP0351 COG0438 # Protein_GI_number: 15900280 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Streptococcus pneumoniae TIGR4 # 1 403 1 405 409 258 36.0 2e-68 MKILLVSQYFWPETFRVNDLAQELVLRGNEVTVLTGKPNYPQGAIYKGYSFWGYKKEYYK GIELIRVPLIPRGKGSGLRLALNYLSFVLFSCLYILFHKRKFDVSLTFAISPITQVYAAL LHKKLFGSRAYLWVQDLWPESVSAAGKMNSGLVYRMLTKMVQSIYQKVDGICIQSEAFSQ SILQKGDYKHKISYIPNWAEDLFTDDSLINKEHFKSLIPDGFVVMFAGNIGEAQDFDSIL KAAIRTKEYKDIKWVIVGDGRKKEFVEQQVKELNLCDTVFLLGRYPLEDMPDLFIHADVM LVSLKDQNIFSLTIPSKIQSYMAFGKPIISMINGIGNEIIKEANCGFTANAGDFESLANN VKRLSRMDKNMLYEKGRAGKEYYQLFFAKKKIIDNLIEVFQSE >gi|226332000|gb|ACIB01000056.1| GENE 18 16111 - 17118 713 335 aa, chain + ## HITS:1 COG:VC0262 KEGG:ns NR:ns ## COG: VC0262 COG0451 # Protein_GI_number: 15640291 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Nucleoside-diphosphate-sugar epimerases # Organism: Vibrio cholerae # 1 331 9 317 323 106 27.0 7e-23 MNILITGAYGFVGTNLINNLKIKHNLYGLDIVCSTREGVLKTFCWKDIDPESFPLQILSQ FDAIIHLAGKAHDTKNQSAAQSYFDINTGLTQKIFDFFLESSAKKFIFFSSVKAAADSVV GDMLTEDVIPAPVGPYGESKIRAEEYIKEHFAFPTVSSCECSPFREMTSVTEKQVYILRP CMIHGPGNKGNLNLLYNVVKKGIPWPLGDFENRRSFTSIDNLCYVIEGLLTKEVPTGIYH MGDDEALSTNELIAIMCEAMGKQPHIWKMNKGFMEGCAGLGTLLHLPLNTERLRKLTENY VVSNAKIKAALGIDKMPVTAKEGLIKTIRSFEETK >gi|226332000|gb|ACIB01000056.1| GENE 19 17122 - 18072 605 316 aa, chain + ## HITS:1 COG:PA3145 KEGG:ns NR:ns ## COG: PA3145 COG0472 # Protein_GI_number: 15598341 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase # Organism: Pseudomonas aeruginosa # 4 295 9 315 339 88 28.0 2e-17 MYYLIILVLLFLAELFYFRIADKCNIIDKPNERSSHTRITLRGGGIIFFFGALAYFLTNQ FEYPWFMLALTLITFISFVDDIRSTSQGLRLVFHFTAMALMFYQWGLFSLPWWTIVVALI VCTGIINAYNFMDGINGITGGYSLVVLTALAFINGVYVPFVEPTLIYTMLCAVLVFNFFN FRKQAKCFAGDVGSVSIAFVILFLIGMLIIRTENFSWIVLLAVYGVDSVLTIIHRLMLHE NIGLPHRKHLYQIMANELKIPHMVVSLVYMLVQAVVIVGYFLFPGNEYGYLSGTIIALSL VYILFMKRFFCLHQAK >gi|226332000|gb|ACIB01000056.1| GENE 20 18232 - 18423 211 63 aa, chain - ## HITS:1 COG:no KEGG:BF1849 NR:ns ## KEGG: BF1849 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 63 1 63 63 115 100.0 4e-25 MIQNLDVFLWNRKVGTLVAYKEKYVDKVISVVSNYQRYAHIAGVDAYWGHQIKEEINDRI GML >gi|226332000|gb|ACIB01000056.1| GENE 21 18407 - 18658 98 83 aa, chain - ## HITS:1 COG:no KEGG:BF1850 NR:ns ## KEGG: BF1850 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 4 83 1 80 80 130 100.0 2e-29 MARMSQKEMAEKSGVSLATISHFEQGVNQNMTLNNFISLLRIIGMEQRINDLLPELPMPL MALKQLNKFIPKRVRRNNNDTKS >gi|226332000|gb|ACIB01000056.1| GENE 22 19060 - 19533 577 157 aa, chain + ## HITS:1 COG:no KEGG:BF1851 NR:ns ## KEGG: BF1851 # Name: not_defined # Def: putative non-specific DNA binding protein # Organism: B.fragilis # Pathway: not_defined # 1 157 1 157 157 301 100.0 4e-81 MPLFYRARQSQLKTKEGKKQWHLTLVKVGKMVTSQQLAEVIAEKSSLTPGDVHNVIRNLM TAMRKELLNSRSVRLEGLGTFTMKACTQGHGVDQEEEVSPNQVAALRCLFTPEYTRPAAI GTTRALLQGVEFQKVSAIGGAINGGSGSGDIVDDPTA >gi|226332000|gb|ACIB01000056.1| GENE 23 19724 - 20902 999 392 aa, chain - ## HITS:1 COG:Cgl2969 KEGG:ns NR:ns ## COG: Cgl2969 COG1301 # Protein_GI_number: 19554219 # Func_class: C Energy production and conversion # Function: Na+/H+-dicarboxylate symporters # Organism: Corynebacterium glutamicum # 8 381 5 379 412 325 51.0 1e-88 MKKIHIGLLPRIIIAIILGIAIGNFLPTPLVRLFVTFNSIFGEFLNFSIPLIILGLVTIA IADIGKGAGRMLLVTALIAYGATLFSGFLSYFTGAAIFPSLITPGAPLDEVSEAQGILPY FSVAIPPLMNVMTALVLAFTLGLGLASLHSDALKNVARDFQEIIVRMISAVILPLLPIYI FGIFLNMTHSGQVFSILMVFIKIIGVIFILHIFLLVFQYCIAALFVRKNPFRLLGRMLPA YFTALGTQSSAATIPVTLEQTKKNGVSADIAGFVIPLCATIHLSGSTLKIVACALALMMM QGMPFDFSLFAGFIFMLGITMIAAPGVPGGAIMASLGILQSMLGFDESAQALMIALYIAM DSFGTACNVTGDGAIALIIDKIMGKRKTPESL >gi|226332000|gb|ACIB01000056.1| GENE 24 21035 - 22510 1606 491 aa, chain + ## HITS:1 COG:TP0331 KEGG:ns NR:ns ## COG: TP0331 COG0362 # Protein_GI_number: 15639322 # Func_class: G Carbohydrate transport and metabolism # Function: 6-phosphogluconate dehydrogenase # Organism: Treponema pallidum # 8 491 4 488 488 556 55.0 1e-158 MATQNKTDIGLIGLAVMGENLALNMESRGWNVSVYNRTVPGVEEGVVERFINGRAKGKHI EGFTDIEAFVESIALPRKIMMMVRAGSPVDELMEQLFPYLSPGDILIDGGNSNYEDTNRR VKLAESKGFLFVGAGVSGGEEGALNGASIMPGGSEKAWEEVKPILQSIAAQAPDGTPCCQ WVGPAGSGHFVKMIHNGIEYGDMQLIAEAYWVMKELLDMTNEEMASVFTRWNEGKLRSYL IEITGNILRHKDKTGAYLIDKILDAAGQKGTGKWSVINAMELGMPLGLIATAVFERSLSA RKELREAAARQYQCRHSMAVYNKQDTEKEIFSALYASKLVSYAQGFAVLQRASDTFGWNL DLASIARMWRGGCIIRSVFLNDIAAAFEAKEKPKHLLLAPYFEEEIKGLLSGWKNLVAQA MREELPVPAFSSALNYFYSLVSADLPANLVQAQRDYFGAHTFERKDELRGVFFHENWTGH GGDTKSGTYNV >gi|226332000|gb|ACIB01000056.1| GENE 25 22525 - 24021 1259 498 aa, chain + ## HITS:1 COG:VCA0896 KEGG:ns NR:ns ## COG: VCA0896 COG0364 # Protein_GI_number: 15601650 # Func_class: G Carbohydrate transport and metabolism # Function: Glucose-6-phosphate 1-dehydrogenase # Organism: Vibrio cholerae # 6 498 10 501 501 555 54.0 1e-157 MSKFVMTIFGASGDLTKRKLMPALYSLYVAKRLPEEFEILGVGRTVYEDADYRTYIYNEM EKFVKSEEQNKEKMDAFVGHLHYLAIDPALESGYGQLRLRIEELSGDSRPDDLLFYLATP PSLYGVIPLHLKSVHLNKGRARIIVEKPFGYDLESAEKLNKIYASVFDEHQIYRIDHFLG KETAQNLLAFRFANGIFEPLWNRNYIDYVEVTAVENLGIEQRGGFYDTTGALRDMVQNHL IQLVALTAMEPPAVFNADNFRNEVVKVYESLTPLTETDLSEHIVRGQYTAGGNKRGYREE KNISPDSRTETYIAMKLGISNWRWSGVPFYIRTGKQMPTKVTEIVVHFRETPHQMFHCAG GNCPRANKLILRLQPNEGIVLKFGMKVPGPGFEVKQVTMDFSYDQLGGVPGGDAYARLIE DCILGDQTLFTRSDAVEASWHFFDPILRYWNEHPEAPLYGYPAGTWGPLESEAMMHEHGA EWTNPCKNLTNTDQYCEL >gi|226332000|gb|ACIB01000056.1| GENE 26 24018 - 24734 168 238 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163781723|ref|ZP_02176723.1| 50S ribosomal protein L13 [Hydrogenivirga sp. 128-5-R1-1] # 34 234 37 218 228 69 27 5e-11 MKPYIFPSSIETARALILHLVKLMLDEPDRTFCIAFSGGSTPALMFDLWANEYTDITPWE RLKVFWVDERCVPPENSDSNYGMMRSLLLSIVPIPYENVFRIQGEKNPKKEAARYSKLVM KEVPVENEFPVFDVVLLGAGNDGHTSSIFPGQEELLSTDHIYEANFNPNNGQKRIALTGL PILNARRIIFLITGRVKSPVVEDIFYSGDTGPAAYIAHHADNVELFMDNAAAEKVIRG >gi|226332000|gb|ACIB01000056.1| GENE 27 25091 - 25315 309 74 aa, chain + ## HITS:1 COG:no KEGG:BF1856 NR:ns ## KEGG: BF1856 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 74 1 74 74 126 100.0 2e-28 MEYKFDEQSVKELMEWAQTAQLPQELELSKAERIFDVKLCIESDLSCIRAHYPDAFYNPA ITRLYRIREKLEEK >gi|226332000|gb|ACIB01000056.1| GENE 28 25561 - 25875 304 104 aa, chain + ## HITS:1 COG:no KEGG:BF1857 NR:ns ## KEGG: BF1857 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 104 1 104 104 191 100.0 5e-48 MKKVLVALVIVMGLGFSVAKADEPLKKKSPKVEQRDSREDFTPIEVNNLPEAVIDELSCE GALIKEAFIAYSRSEGKLYKVIILSSDFHEQAVFLNERGNILNR >gi|226332000|gb|ACIB01000056.1| GENE 29 25915 - 26046 60 43 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MLLFHSQKISQKPENYKEMSLEHISLAFCTDETKTPRLFRVGA >gi|226332000|gb|ACIB01000056.1| GENE 30 26158 - 28224 1706 688 aa, chain + ## HITS:1 COG:all1210_2 KEGG:ns NR:ns ## COG: all1210_2 COG1649 # Protein_GI_number: 17228705 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Nostoc sp. PCC 7120 # 360 606 1 279 489 75 24.0 3e-13 MKQIQPVFLAMKFSFLIFFFFSLPVGAQSIFQKYGRFLTEPRSYVCYRTDGKLKIDGKLD EASWQKAKPTAPFVDISGEGFPTPKYETTAKMLWDDEYLYIGAMLQEDDIKARLTQRDTI IYYDNDFEVFIDPDWDGHNYFEIETNARGVIFDLMLDRPYRSGGNFMVQWDCPGLKLAIH REGTLNKSKDKDKYWSVEMAIPHKALTMNFNNPLKAGNCWRINFSRVQWLKAGGPEENWV WTPTGKVDMHMPDRWGYLFFADEKVGTPEHTFALPYNASVYKLLWAMFYVQQERYAKEKN YLRTEQDFFLTDAELKGLPQGAQISVEATRNTYQIAITVPGEGRRYIINNEGRFWTEKVA PRQVKNWVWTRINKSKSEADYRQWFALLKECGISGVMFEGYDENLYRMCKEAGLEAHFWK WTMNRAELLNVHPDWFAVNRKGESTHDKPAYVDYYRFLCPNHEGVAQYLADDYVKIAHLP YVDGVHLDYVRFPDVVLPVSLWKNYGIEQTSEHPEYDYCYCDVCRTKFKEQTGRDPLELK YPMEDQSWINFRLDAISRVVDQITKAVKADGKAISAAVFPGPSMAKKMVRQDWGNWSLDA YFPMIYNGFYYEGPEWIGRSVQESVKTVDGRAKVYAGLMFPDIKNDFEKALDEAFDNGAS GVSFFDGPSDEYLHQFKAYLDKKGLKTE >gi|226332000|gb|ACIB01000056.1| GENE 31 28385 - 28933 253 182 aa, chain + ## HITS:1 COG:YPO2672 KEGG:ns NR:ns ## COG: YPO2672 COG4413 # Protein_GI_number: 16122877 # Func_class: E Amino acid transport and metabolism # Function: Urea transporter # Organism: Yersinia pestis # 10 177 29 214 330 75 32.0 6e-14 MYKNILILGRGIGQVMFQNNALSGGLMLLGIAFNSWQLAVLSVLGTVVSTLTASLSGYDK EDIRNGLYGFNGTLVGIAIGVFMETNVTSILLLISGSAFSTWVARCFRYQNRVSGLTAPF IFVVWLLLVGCHYLYPSLLLSSSLEKPELTMDIFRSFCLNIGQVMFQGNILSGLFFLFRD PD >gi|226332000|gb|ACIB01000056.1| GENE 32 28941 - 29237 129 98 aa, chain + ## HITS:1 COG:no KEGG:BF1925 NR:ns ## KEGG: BF1925 # Name: not_defined # Def: putative UreA transport protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 98 186 283 283 166 100.0 3e-40 MNALYTLTGAILPLFMILYPHTDLAAWNLGLLGYNGVLCAIALGDKTGIGVVKAIFSIIL SIVLQLTGMHMGIVTLTAPFVFSVWITGGLFSVFRSKS >gi|226332000|gb|ACIB01000056.1| GENE 33 29609 - 30856 846 415 aa, chain + ## HITS:1 COG:no KEGG:BF1926 NR:ns ## KEGG: BF1926 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 415 1 415 415 823 100.0 0 MKMILNFSECIISFFSLCLLCVVFSCDEMDIDQGPPTSVTRNLMTSDGPGFKIDTVTYDK IPAEYARKILSLEEPTSVVTDKSKRSFRVNELFQIRKANDQLLSITSYSAKTVYDLTLEV YVEGGSQYVPIAYLDSIPGFSQFEFKPSLINGNFIYKKDNGVDTLSLSSLNEKRMKFRLL SDDKHFEMLSKIDAEWNISFSNYDWKPGYESGSWRELSAIYAREWVVIITNYAYMMTTPE YAFIMRNFSKIFGGELYDNNRVKFTPEKYLSEEKRFKQPHNFVCGRSKPSVGGLGGGNVW GVTHWNYYGHYASFSGWESITHEFMHCMGYGHSSNMTYASGGVGWTEFMWQLHTYLRGND WLPYTDRNLLGFHKPENAKYRDGGIDPDKLNDNKILQFYNKSKVTQYFLANPLSK >gi|226332000|gb|ACIB01000056.1| GENE 34 31053 - 31757 369 234 aa, chain + ## HITS:1 COG:no KEGG:BF1861 NR:ns ## KEGG: BF1861 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 234 31 264 264 478 100.0 1e-134 MLVPTMTTEEVCKEIKNDYPAFYEKMLDNKASNYRKFIKAVLFPVIHQFSWKSSSGNMWN VIMLARYRNERKCPGIVPYLKYENWGMGIIYPKNIYSNLSIIDFKPHFWKRYRERQLIPN GLEGISFDEQIKYFFLNSGLFTFDFREGSNKGHEGFVGYTKTGIFFGVVIKELDYLCVKT YVSANMLFDNQIESLDSADELREKILSHPDYFQKRGKLFHIMNDSSFWMDETIR >gi|226332000|gb|ACIB01000056.1| GENE 35 31871 - 33154 546 427 aa, chain - ## HITS:1 COG:ECs1679 KEGG:ns NR:ns ## COG: ECs1679 COG0389 # Protein_GI_number: 15830933 # Func_class: L Replication, recombination and repair # Function: Nucleotidyltransferase/DNA polymerase involved in DNA repair # Organism: Escherichia coli O157:H7 # 1 413 1 421 422 327 42.0 4e-89 MFGLMDCNNFYASCERVFNPALNGKPIVVLSNNDGCVIARSNEAKALGIKMGVPAYQIKD DIQKYGISVFSSNYTLYGDMSGRVMSILAEQVPEMEVYSIDEAFLNLEGIRDIQSLGTDI INKVIRGTGIPVSLGIAPTKTLAKVANKFAKKYPAYNRLCIIDTEEKRIKALQLTEIGDI WGIGHRQVAKLEKQGVKTAYDFTELPESWVRKNMTVVGERTWKELQGISCIDMETTPPAK KQICTSRSFGKMVEDIDTMSEAIATHASTCAKKLRQQKSYAMSLMAFIHTNNFRKDSPQY WRNTVIHLPIPTNDTLEIVHYALAGLKTIFMQGYQYKKTGVIITEITDSTQLGLFDSVDR EKRERLQQTIDKINGKHSRLVKLAIQGTGRNWKLKQKQLSGHYTTDINQIISINCTYPTA CQRKQYS >gi|226332000|gb|ACIB01000056.1| GENE 36 33154 - 33594 339 146 aa, chain - ## HITS:1 COG:ECs1678 KEGG:ns NR:ns ## COG: ECs1678 COG1974 # Protein_GI_number: 15830932 # Func_class: K Transcription; T Signal transduction mechanisms # Function: SOS-response transcriptional repressors (RecA-mediated autopeptidases) # Organism: Escherichia coli O157:H7 # 5 143 2 139 139 118 41.0 3e-27 MKRKLEIHKIDVSSSLPIPYADEGIRAGFPSPAQDYMEQAIDLNKELIKHPASTFFGRVV GDSMRDEGIEEGDILVIDKSLELQDDDLAVCFIDGDFTVKRVRIEPNAVWLIPANPKYSL IKVTKENEFIVWGIVTYTIKKNRRKR >gi|226332000|gb|ACIB01000056.1| GENE 37 33902 - 34345 266 147 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253566821|ref|ZP_04844273.1| ## NR: gi|253566821|ref|ZP_04844273.1| predicted protein [Bacteroides sp. 3_2_5] # 1 147 1 147 147 285 100.0 4e-76 MSSYSQSRPDHLFNDIPSALTKGGIPDCFIIAFRGEQYVRELSGLRDKFVTAPMGNRKKL DVNSNNYATSLTAFRDYLNNLGNKTLVYNREEALAIVIDLFKNSIDTDVPAKLENENENQ IRRAIEKWQKYIIQVVDTIAQVATKFD >gi|226332000|gb|ACIB01000056.1| GENE 38 34363 - 35049 505 228 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253566822|ref|ZP_04844274.1| ## NR: gi|253566822|ref|ZP_04844274.1| predicted protein [Bacteroides sp. 3_2_5] # 1 228 1 228 228 464 100.0 1e-129 MSVRVINKDFKYQERQSDHDNYCAGFALAAILSDLNGDGTICKGEGIYRSLQKYPVTGEW SSILHRLTGCIEGVDGVMTLPGAVVVGTVELTKRNVCVFIDDEKLIEAIDQLIHFLLDIG LGRDLDQQLESVKLHIPELIAEQKSIVGAQCIKPWKSIEEVLQESRYAIMLVNGVHWIAV KKTENGYNVYDPACGVRSAQLNTQYITLISDRYGDRQYTISGIIIAVS >gi|226332000|gb|ACIB01000056.1| GENE 39 35118 - 35660 320 180 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253566823|ref|ZP_04844275.1| ## NR: gi|253566823|ref|ZP_04844275.1| predicted protein [Bacteroides sp. 3_2_5] # 1 180 1 180 180 334 100.0 2e-90 MEIYFLDSKSVYNKKEISDKEIPNNVIKVGGLGDYFKYHTILEVPNKRFMLKDLVGLTKL YNTEFAIYTCEDKLYVKKGNKPDLQHGKAGSVTIKKSEQMLIHVHPTYHSVRDHLNVDLQ VNSGQVEAIIDYDYNIIVYQNNEVFNKKDAEGIYQTLDIHDVHWPSFLEKNAGYNIEVYI >gi|226332000|gb|ACIB01000056.1| GENE 40 36158 - 36736 325 192 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253566824|ref|ZP_04844276.1| ## NR: gi|253566824|ref|ZP_04844276.1| predicted protein [Bacteroides sp. 3_2_5] # 1 192 1 192 192 389 100.0 1e-107 MSMNELDFNLRFLDLLNRRDDIPTWGIPSVPQSIIVDENKYKLINCGVEYAPQRLYGVYW RQSNISYYRIAKGADAISFQFTGCYLAKVMYNSDYFVFHIHSSDSDSTADYWNQFIDDNR DRIAEITIFKPAVANEKDDVMFKYLNESDKGIFTIAGMITVDNECYEILLNNKTCKAEAV LRKTKLVDPHLR >gi|226332000|gb|ACIB01000056.1| GENE 41 36944 - 37642 595 232 aa, chain - ## HITS:1 COG:FN0219 KEGG:ns NR:ns ## COG: FN0219 COG3279 # Protein_GI_number: 19703564 # Func_class: K Transcription; T Signal transduction mechanisms # Function: Response regulator of the LytR/AlgR family # Organism: Fusobacterium nucleatum # 3 204 2 207 240 99 32.0 7e-21 MKLRCAIVDDEPLALSLLESYVNKTPFLELAGKYSSAVQAMKELPGNQIDLLFLDIQMPE LNGLEFSKMVSPRTRIVFTTAFGQYAIDGYRVNALDYLLKPISYVDFLQATNKALQWFEL VQKPEEVDSIFVKSDYKLVQVELKKILYIEGLKDYIKIYTEDAPKPILSLMSMKSMEELL PPARFMRVHRSFIVQKDKIRIIDRGRIVFDKTYIPVSDSYKQTFQTFLDERS >gi|226332000|gb|ACIB01000056.1| GENE 42 37645 - 38697 802 350 aa, chain - ## HITS:1 COG:SMb21546 KEGG:ns NR:ns ## COG: SMb21546 COG3275 # Protein_GI_number: 16264735 # Func_class: T Signal transduction mechanisms # Function: Putative regulator of cell autolysis # Organism: Sinorhizobium meliloti # 124 324 155 356 383 95 32.0 1e-19 MKQSLTSARRPLEILIHIISWGIVFGFPFFFIDRTGDSINWHAYLRHSAVPLSFVTVFYL NYFLLVPHLLFREQKNKYIIYNILLVCLIGLLLHIWQSLNAPAPTLKKPHMPPGWIFFVR DILSLIFTIGLSAAIRMSARWGQAEAARREAEKSRTEAELKNLRNQLNPHFLLNTLNNIY ALIAFDSDKAQQAVQELSKLLRYVLYDNQQNYVPLCKEVDFIRNYIELMRIRLSGNVEVI TQFDIQPDSRTEIAPLIFISLIENAFKHGISPTELSFIHILISENKEEIRCEIRNSYHPK TNTDKSGSGIGLEQVRKRLELSYPGRYQWDKAISPDGKEYISKLLIFNHP >gi|226332000|gb|ACIB01000056.1| GENE 43 38830 - 40050 427 406 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163788031|ref|ZP_02182477.1| 50S ribosomal protein L9 [Flavobacteriales bacterium ALC-1] # 8 406 10 413 413 169 30 5e-41 MNGTNLFKIALRALNNNKLRAFLTMLGIIIGVASVITMLAIGQGSKKSIQAQISEMGSNM IMIHPGADMRGGVRQDPSAMQTLKLTDYETLRDETSFLAAVSPNVSSSGQLIAGNNNYPS SVNGVGTEYLEIRQLSIDNGEMFSEADIQSSAKVCVIGKTIVDNLFPDGEDPVGRIVRFS KIPFRVVGVLKSKGYNSMGMDQDDIVLAPYTTVMKRLLAQTYLQGIYASALSEDMTDNAT EEITELLRRNHKLKEADDDDFTIRSQQELSSMLNSTTDLMTTLLACIAGISLVVGGIGIM NIMYVSVTERTREIGLRMSVGARGVDILSQFLIEAILISITGGLIGVIIGCGASWVVKSV AHWPIFIQPWSVFLSFAVCTVTGVFFGWYPAKKAADLDPIEAIRYE >gi|226332000|gb|ACIB01000056.1| GENE 44 40064 - 40807 326 247 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 [Vibrio campbellii AND4] # 5 232 1 226 245 130 34 3e-29 MNKTVIELQNIKRNFQVGDETVHALRGVSFTITEGEFVTIMGTSGSGKSTLLNTLGCLDT PTSGEYLLDGISVRTMSKPQRAILRNRKIGFVFQSYNLLPKTTAVENVELPLMYNSGVSA SERRRRAIEALQAVGLGERLEHKSNQMSGGQMQRVAIARALVNNPAVILADEATGNLDTR TSFEILVLFQKLHAEGRTIIFVTHNPEIAQYSSRNIVLRDGQVKEDSTNPDILSAAEALA ALPVQEE >gi|226332000|gb|ACIB01000056.1| GENE 45 40810 - 42033 1365 407 aa, chain - ## HITS:1 COG:AGc3332 KEGG:ns NR:ns ## COG: AGc3332 COG0845 # Protein_GI_number: 15889118 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 2 374 37 435 437 196 32.0 6e-50 MKKKKIILIAVSLAILAGGGVWLFGGSTAKHKVTYATATVSKGEISESVTATGTIEPVTE VEVGTQVSGIIDKIYVDYNAAVTKGQLIAEMDRVTLQSELASQRATYSGAKAEYEYQKKN YERNKGLHEKGLISDTDYEQSLYNYEKAKSSFESSQASLAKAERNLSYATITSPIDGVVI SRDVEEGQTVASGFETPTLFTIAADLTQMQVVADVDEADIGGVEEGQRATFTVDAYPNDV FEGIVTQIRLGDASSTSTSSSSTTVVTYEVVISAHNPDLKLKPRLTANVTIYTLDRKDVL SVPARALRFTPEKPLIGDNDIVKDCEGEHKIWTREGNTFTAHPVQIGITNGINTEITQGA SEGMVVVTEATIGNMPGGNVSPEGGQEGGGEQSPFMPSHPGSKKKGK >gi|226332000|gb|ACIB01000056.1| GENE 46 42070 - 43473 1360 467 aa, chain - ## HITS:1 COG:RSp0817 KEGG:ns NR:ns ## COG: RSp0817 COG1538 # Protein_GI_number: 17549038 # Func_class: M Cell wall/membrane/envelope biogenesis; U Intracellular trafficking, secretion, and vesicular transport # Function: Outer membrane protein # Organism: Ralstonia solanacearum # 48 461 68 468 491 81 22.0 3e-15 MTNVKRITLITVCLAAILPGNGLWAQQTEATGTTQTADSVSMPAQWDLQSCIDYALQQNI SIRRNRINAQSTQVDVKTAKAALFPSLSFSSSQNLVNRPYQESSSIISGSEVLKSSNKTT YNGNYGLNAQWTVYNGSKRLKTIEQEKLNNRVADLDVATSENDIEQSIAQVYIQILYAAE SVKVNENTLQVSEAQRDRGKQLLDAGSIARSDYAQLEAQVSTDRYQLVTAQATLQDYKLQ LKQLLELDGEQEMQVYLPALGDENVLSPLPTKTDVFRSAVALRPEIEASKLSVEASELGI GIAKSGYLPSVSLTAGIGTNHTSGSDFTFGEQVKNGWNNSIGLSISVPIFNNRQTKSAVE KAKLQYQTSQLTLLDEQKTLYKTIEGLWLDANSAQQRYAAAIEKLHSTQTSYELVSEQFN AGMKNTVELLTEKNNLLQAQQELLQSKYMAILNTQLLKFYQGDKITL >gi|226332000|gb|ACIB01000056.1| GENE 47 43926 - 44141 168 71 aa, chain + ## HITS:1 COG:no KEGG:BF1874 NR:ns ## KEGG: BF1874 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 71 1 71 71 126 100.0 3e-28 MKNTLLHIWNFYVEGFRSMTLGRTLWLIILVKLFIMFFILRLFFFPNFLSSQPTDKDKGE YVAGELIERGQ >gi|226332000|gb|ACIB01000056.1| GENE 48 44174 - 45727 1586 517 aa, chain + ## HITS:1 COG:Cj0081 KEGG:ns NR:ns ## COG: Cj0081 COG1271 # Protein_GI_number: 15791471 # Func_class: C Energy production and conversion # Function: Cytochrome bd-type quinol oxidase, subunit 1 # Organism: Campylobacter jejuni # 6 512 3 504 520 560 55.0 1e-159 MIESIDTSLIDWSRAQFALTAMYHWIFVPLTLGLAVVMAIMETLYYKTGNEFWKRTAKFW MKLFGINFAVGVATGLILEFEFGTNWSNYSWFVGDIFGAPLAIEGILAFFMEATFIAVMF FGWDKVSKRFHLISTWLTGLGATISAWWILVANAWMQHPVGMQFNPDTVRNEMVDFMAVA FSPVAVNKFFHTVLSSWVLGAVFVIGISCWFLLKKRDKEFAVASIKIGAVFGLVASLLTV WTGDGSGYAIAQTQPMKLAAVEGYYEGQNGAGLVAVGLLNPEKKTYDDGQDPFLFRIEIP KMLSLLAERKVDAFVPGIKNIIEGGYELKDGTKALSAAEKIEKGKKAIAALATYRTAKKE GDEAAAKEAYTTLQENVPYFGYGYIKDVNQLVPNVPLNFYAFRVMVILGGYFILFFILVL FFAYKKDLSKIRWMQYVALWTIPLAYIAGQAGWVVAECGRQPWAIRDMLPTSVSISKLDV GSVQTTFFIFLVLFTVMLIAEIGIMVREIKKGPTVNH >gi|226332000|gb|ACIB01000056.1| GENE 49 45753 - 46901 1161 382 aa, chain + ## HITS:1 COG:Cj0082 KEGG:ns NR:ns ## COG: Cj0082 COG1294 # Protein_GI_number: 15791472 # Func_class: C Energy production and conversion # Function: Cytochrome bd-type quinol oxidase, subunit 2 # Organism: Campylobacter jejuni # 7 382 10 374 374 306 50.0 4e-83 MGTYIFLQQYWWLVVSLLGAILVFLLFVQGGNSLLFCLGKTEEHRKMMVNSTGRKWEFTF TTLVTFGGAFFASFPLFYSTSFGGAYWLWMIILFSFVLQAVSYEFQSKAGNLLGKKTYQT FLVINGVVGPLLLGGAVATFFTGSDFYINKGNMVNEVMPVISHWGNGWHGLDALTNIWNV ILGLAVFFLARVLGALYFINNIADKELVAKCRRSLIANTVLFLVFFLAFVVRTLLADGYA VNPETKEIYMEPYKYFNNFIEMPVVLIVFLVGVVLVLFGIGKTLLKKTFDKGIWFVGIGT VLTVLALLLTAGYNNTAYYPSNTDIQSSLTLANSCSSQFTLKTMAYVSILVPFVIAYIFY AWRSIDNRKIDAKEMDEGGHAY >gi|226332000|gb|ACIB01000056.1| GENE 50 47025 - 48089 1181 354 aa, chain + ## HITS:1 COG:ZmolR.A_1 KEGG:ns NR:ns ## COG: ZmolR.A_1 COG3831 # Protein_GI_number: 15802594 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Escherichia coli O157:H7 EDL933 # 3 67 2 66 94 83 56.0 8e-16 MKRVFVFQDFKSQKFWSIDVRGTDVIVNYGKLGTDGQTQVKNFSSAGEAEKAAGKLIAEK TKKGYVETLEEVAKEMKVEAKKYALSYDEAEEGVNLMDKILKDKKLPSLKQITIGCWGYE GEDCSDIADGIVENKEKFAHFEGLFWGDIDFEEQEISWIEQVDLSPVLDAMPLLNNLKIK GTNNLSIGKKPRPNLKSLEIISGGLPDSVVEDILGSDLPNLEKLVLYVGVEDYGFDGDMN VFRPLFSKDRFPNLKWLGIVDAEEQNAVVEMFLESDILPQLETMDISAGVLTDEGARLLL DHVDKIKHLKFINMKYNYLSDEMKKELQKSLPMKIDVSDSQEYDDDYSYPMITE >gi|226332000|gb|ACIB01000056.1| GENE 51 48094 - 49104 677 336 aa, chain + ## HITS:1 COG:no KEGG:BF1940 NR:ns ## KEGG: BF1940 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 336 1 336 336 680 99.0 0 MRILVASNSTSKRTDYFIKAGRSLGADTCFVTYDELSAVLPDCRDTVVKLEPPVFREADF RKYNLLCEEYRSLLSRLADTDKSESVHFLNEPAAILCALDKVYTQRKLTGAGLKTTPLLS DALSTFDDLAAILCRQKRGGFLKPRYGSGAGGIMAVRYNHRRDEWVAYTTMSWEGGRVCN AKRICRLTNRKEIATLAEEVIRCGAVLEEWMAKEKLEGENYDLRVVCRGGEVDYVVVRCS DGAITNLHLNNKARLFEELSLAPSVREELFCRSITAMKALGLRYAGIDVLIARNTDTPYI IEVNGQGDHIYQDMYTENKIYANQIKTIESLFNGNR >gi|226332000|gb|ACIB01000056.1| GENE 52 49091 - 49942 476 283 aa, chain + ## HITS:1 COG:no KEGG:BF1941 NR:ns ## KEGG: BF1941 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 283 1 283 283 597 99.0 1e-169 MEIDELPSGTQDQKPDIDMNEIVGTHDILMLCFDTLRYDVSVAEEASGGTPVLNSCGNGW EKRHAPGNFTYPSHFAIFAGFLPSPAEPHMLRNRKWLFFPFQAGTGRIPPKGSYAFKEAT FVQSLAQVGYETICIGGVNFFSKRNDIGRVFPGYFNKSYWLPTFGCTDKNSAANQVDFAV DKLEKYPADRKVFMYINFSAIHYPNCHYVEGKKKDDKESHAAALRYVDSQLPRLFEAFRR RSDTLVIALSDHGTCYGEDGYEYHCISHEKVYTVPYKHFILRK >gi|226332000|gb|ACIB01000056.1| GENE 53 49939 - 51219 947 426 aa, chain + ## HITS:1 COG:STM4012 KEGG:ns NR:ns ## COG: STM4012 COG0635 # Protein_GI_number: 16767277 # Func_class: H Coenzyme transport and metabolism # Function: Coproporphyrinogen III oxidase and related Fe-S oxidoreductases # Organism: Salmonella typhimurium LT2 # 10 416 7 405 413 293 41.0 3e-79 MNEQQQISRYVSYMYSYPHKTAYRTLTPPVSLSPYLERLEGREASLYFHIPFCAHKCGYC NLFSQQCCDAERISLYLHTMRRQAEQLSVAAQGLKFTSFAVGGGTPLILDEGQLEELFCL AELFGVHPSRVFTSVETSPEYTQKSVLRQLRARGVERLSMGVQSFNETELKKLKRRPGLG TVVGALENIVEAGFPQFNLDLIYGIEGQTVESFMRSLNTALTYRPNELFIYPLYVRPGTR INVRSTDDIGYAIYKSARELLVGQGFVQTSMRRFVRRETTETEFSCGDEVMLSCGAGGRS YLGNLHYATPYAVRQQAIADEIDHYIRTTDFMTAANGFLLSTEEMQIRFIIKNLMYHRGV DLAEYEKRFGEKPDRNLFREFTDRGWIEETGRIVRLTEEGMAYSDYIGQAFISPVVRKLM SEYVYP >gi|226332000|gb|ACIB01000056.1| GENE 54 51233 - 52744 651 503 aa, chain + ## HITS:1 COG:no KEGG:BF1943 NR:ns ## KEGG: BF1943 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 4 503 1 500 500 1043 99.0 0 MSSLSTVRSIYYRGSLEHCNYTCSYCPFGRKSVSADTREDQEALDRFISRIGGWKYGSLR ILIIPYGEAMIHRYYREGIMRLAAMPHVIGVSCQTNLSFSVSRFLDEAEAEQADVSKFRF WASYHPEMVGVGEFASKVEMLRAAGIGVCAGAVGDPSAKEQIRKLRQLLDPSVYLFVNAM QGLRKPLSEEDIRFFGEIDNLFDYDRRNAKACLDGCVGGRETLFIDRKGDMYACPRSGIR MANFYDDSTSDFQPFCLRKVCDCYIAFSNLCDTPLRRMMGDGALWRIPERKKVEAVFFDV DGTLTDAQGRIPDRTVSVLEYMAKRLPLYLSTALPVSHAKKRLGNVFGLFSGGVFADGGL LCYGETIECVPIANPVTAGFPGCRVTRYTREGKVFKYAVLAPNTREAVRWLTELDEEAYQ LYQEGRLLTVVDSKAGKKNGLITLCARLGISLREVLVVGNTMHDWPMMSVAGYSCAVMDA EEKLRKLSGYVLNPDSIPVFFDI >gi|226332000|gb|ACIB01000056.1| GENE 55 52747 - 53703 1094 318 aa, chain - ## HITS:1 COG:CAC2945 KEGG:ns NR:ns ## COG: CAC2945 COG1052 # Protein_GI_number: 15896198 # Func_class: C Energy production and conversion; H Coenzyme transport and metabolism; R General function prediction only # Function: Lactate dehydrogenase and related dehydrogenases # Organism: Clostridium acetobutylicum # 1 318 1 324 324 333 50.0 3e-91 MKIVVLDGYAANPGDLNWDELRTLGECEIYDRTAPDEVLERSKDAEAILTNKVVITAEHM ASLPNLKYIGVIATGYNIIDVAAAKERGITVTNIPAYSTPSVGQMVFAHILNITQRVQHY ADEVREGRWTQSQDFCYWDTPLIELLGKKIGLIGLGQTGYNTARIAIGFGMKVWAYTSKS RLQLPPEIRKAELDQIFRECDIVSLHCPLTESTRDLVNTRRLELMKPNAILINTSRGPLV NEHDLAEALNNYKIYAAGLDVLSTEPPRADNPLLTARNCFITPHIAWATSAARERLMAIL VDNLKAYIGGKPVNNVAK >gi|226332000|gb|ACIB01000056.1| GENE 56 53787 - 55058 1219 423 aa, chain - ## HITS:1 COG:CAC0326 KEGG:ns NR:ns ## COG: CAC0326 COG2256 # Protein_GI_number: 15893618 # Func_class: L Replication, recombination and repair # Function: ATPase related to the helicase subunit of the Holliday junction resolvase # Organism: Clostridium acetobutylicum # 3 413 16 429 443 389 48.0 1e-108 MQPLAERLRPKTLDDYIGQKHLVGPGAILRKMIDAGRISSFILWGPPGVGKTTLAQIIAN KLETPFYTLSAVTSGVKDVREVIDRAKSNKFFTQSSPILFIDEIHRFSKSQQDSLLGAVE HGTVTLIGATTENPSFEVIRPLLSRCQLYTLKSLEKEDLLELLQRAITTDVVLKERKIEL KETGAMLRFSGGDARKLLNILELVVESETEETVIITDDLVTERLQQNPLAYDKDGEMHYD IISAFIKSIRGSDPDGAIYWLARMVEGGEDPAFIARRLVISAAEDIGLANPNALLLANAC FDTLMKIGWPEGRIPLAETTIYLATSPKSNSAYNAINDALALVRETGNLPVPLHLRNAPT KLMKQLGYGQEYKYAHNYEGNFVKQQFLPDEIKAKQLWQPQHNPAEQKHAERMKQLWGNE KNY >gi|226332000|gb|ACIB01000056.1| GENE 57 55082 - 57676 1843 864 aa, chain - ## HITS:1 COG:SP0648_2 KEGG:ns NR:ns ## COG: SP0648_2 COG3250 # Protein_GI_number: 15900551 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-galactosidase/beta-glucuronidase # Organism: Streptococcus pneumoniae TIGR4 # 34 588 55 658 871 188 27.0 4e-47 MNFTLHTLNGIKLWLVLLLLSGAINVCRADSPRQTINFNRGWKYCQGDFDHAARPGFDDS EWEKIGLPHSFSTPYFMSKDFYVGYGWYRKAFPVKKEILGKKSFLEFDGVFQEAEIFVNG HLAGTHKGGYTGFSIDISAYLKEGKNLVAVRVNNCWRPDLAPRAGEHVFSGGIYRNVRLV IKPPTYIDWYGTWITTPDLAENKGKSGSVHIRTDVCNASGKTDTYRLLTTVVDAQGKEVS SVSTSQVLPDNATYTFEQQTKEIQAPQLWHPNHPALYKVISSLYHGQELIDRYETAFGFR WFEWTADRGFFLNGEHLYFKGANVHQDHAGWGDAVTETGMRRDIRLVKEAGFDLIRGSHY PHSPAFSQACDEIGMLFWAENAFWGIGGHKGDGYWNASAYPVNESDRAEFENSVKAQLKE LIRIHCNHPSIIVWSMSNEPFFTAPETINPMRKLLEETVKLSKQLDPTRPAAVGGAQRPL GEKRIDKLGDIAGYNGDGSYIPEFQQPGMPTVVSEYGSTTADRPGEYDPGWGDLAKNNAQ NGFPWRSGQAIWCAFDHGSIAGSALGKMGIIDYFRIPKRAWYWYRNAYKGITPPEWPQEG TPARISLVADRTDNIKADGTDDVMLSITILDASGKPVSNSPAVKLDILSGPGEFPTGTSI LFEKESDIRILDGKAAIEFRSYYAGETVIRATSPGLEPAEVKIRFTGSTPYTEAFKVKER PYTRFETPTKTDNLQTFGPNNPTFCSSSANGHSSAFAADGDESTYWQASENDLERSWTLD TEKGLSIRHIRIAFPDLAPYQYKVEVSMDREHWSLIPDQTNNKQNENIRMIQVVPGIQGR FVRISFTGEKAAITDVQVIGTVID >gi|226332000|gb|ACIB01000056.1| GENE 58 57712 - 60273 2094 853 aa, chain - ## HITS:1 COG:no KEGG:BF1947 NR:ns ## KEGG: BF1947 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 853 3 855 855 1701 100.0 0 MKQQYSKLLLVIFLLLSAPQAFAQLKGVITDSISHEPLMYISVFYEGKGVGGISNANGEY KVETRKGWNELTFSAVGYITKKVKIPAGAKELNVVLSPDDVMLEEVVVKPKKEKYSRKNN PAVEFMRKVIDHKKAQKLETNDYYQYSKYQKMKMSLNDVTPESLEKGIYKKFSFLKDQVE VSPETNKLILPISVQETSSKTIYRKDPESKKTIIEGMNSNGVEEFFSTGDMLGTILNDVF ADINIYDDDIRLLQRRFVSPIGSGAISFYKFYLMDTVMVDKNKCVHLTFVPQNPQDFGFT GHLYVLDDSTYAVKKCTMNLPKKTGVNFVENLDVVQQYEQLPNGNWVLTDDDMTVDLLVM KAIQGIQVKRTTKYSNYVFEPIEPRLFRLKGNVIKEADMLTKSDEYWAGVRQVPLTKTES SMDLFMNRLEQIPGFKYVIFGAKALIENYVETGTKKHPSKFDFGPINTMISSNYVDGTRF RLSGMTTAKLNPHWFFNGYGAYGLKDKKWKYEGNVTYSFRKCEFFPWEFPKHYISASYRY DVMSPMDKFLDTDKDNVFVAWKTTTVDQMSYVRDATLRYEMETLSGFSVAAMARHRKDTP AGKLQYIRNDAAKTIVPDITTAELGLTLRYAPGETFVNTKQRRRPVSLDAPIFTLSHTTG FKGVLGGEYNFNLTEASIWKRFWLSSWGKVDVTLKGGAQWNTVPFPLLILPAANLSYITQ KETFNLINNMEFLNDRYASLALTYDMNGKLFNRIPLIKKLKWREVFRFRALYGNLTEKNN PFKSNNPELFDFPQRDGSYTSYVMDPKVPYLEASVGIYNIFKLLHIEYVRRLTYLNHPGI NKQGIRFMIQVVF >gi|226332000|gb|ACIB01000056.1| GENE 59 60387 - 61472 1068 361 aa, chain - ## HITS:1 COG:MJ1504 KEGG:ns NR:ns ## COG: MJ1504 COG0381 # Protein_GI_number: 15669698 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylglucosamine 2-epimerase # Organism: Methanococcus jannaschii # 1 358 1 361 366 188 32.0 2e-47 MKITIVAGARPNFMKIAPITRAIDAAKAQGKRISYRLVYTGVENDNSLDASLFADLHMKA PDAYLGVNGNNPTELTAGIMIAFERELTENPTHVVLVVDDLTATMSCAIVAKKQNIKVAH LVAGTRSFDMSMPKEINRMITDGLSDYLFTAGMVANRNLNQTGTENETVYYVGNILIDTI RYNRNRLIKPVWFSVLGLKEHEYILLTLNRHVLLNNKENLQELMETLLKKANGMPIVAPL HTYVRDAIKALGITAPNLHIMPTQSYLSFGYLMNQAKAIVTDSGNVAEEATFLGIPCITL NTFAEHPETWRTGTNELVGEDPAALGACMDKLMNGEWKQGTLPERWDGRTAERIVQILLG E >gi|226332000|gb|ACIB01000056.1| GENE 60 61557 - 62183 611 208 aa, chain + ## HITS:1 COG:PM0935 KEGG:ns NR:ns ## COG: PM0935 COG2860 # Protein_GI_number: 15602800 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Pasteurella multocida # 10 205 2 202 226 128 37.0 6e-30 MPTIFQIPTFIQILEFIGTFAFAISGIRLASAKQFDWFGAYVAGVAVAIGGGTIRDVLLD VTPFWMTNPIYLICSALALLWVIFFRKHLIHMHNTFFIFDSIGLALFTVVGISKTIDLGY AFWVAIIMGTMTGAAGGVIRDVFINEIPLIFRKEIYAMACVIGGVIYWGLDRLGVDAALT QVISGCCIFVVRALAVKYQICLPILKGE >gi|226332000|gb|ACIB01000056.1| GENE 61 62197 - 62691 365 164 aa, chain - ## HITS:1 COG:no KEGG:BF1888 NR:ns ## KEGG: BF1888 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 164 1 164 164 311 99.0 8e-84 MKTNRRIMLLILFIAGTGIFTLQAQNREERKELKEQTVKEKIESENYRIDINTAYPRRGR MIPLTSIYSVTIRNDSVFSQLPYFGRAYSIPYGGGQGLMFNAPIDQYTMAMGKRGAAKIN FTAKSPEDQFRFRITIYSNGSSSIDVDMQNRESISFSGDLILPE >gi|226332000|gb|ACIB01000056.1| GENE 62 62975 - 63943 939 322 aa, chain - ## HITS:1 COG:lin0962 KEGG:ns NR:ns ## COG: lin0962 COG0501 # Protein_GI_number: 16800031 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Zn-dependent protease with chaperone function # Organism: Listeria innocua # 74 320 52 302 304 169 38.0 1e-41 MQYVGIQTQQSRNNLRSGILLILFPCLVAVLTYLFCYLLITFTVEDDYGQYNTLAMTNQM FINLIPYIIGGVLVWFIIAYFTNSSIIKAATGARPLERKENKRIYNLVENLCMSQGMKMP KINIIDDDSLNAYASGINEQTYTITLSKGIIEKLNDEELEGVIAHELTHIRNHDVRLLII SIVFVGIFSMLAQIALRSVYYSSWTRSRNDKNNGAILILVLAMIVAAIGYFFATLMRFAI SRKREYMADAGAAEMTKNPLALASALRKISADPDIEAVEREDVAQLFIQHPGKQAKSALS GLSGLFATHPPIEKRIAILEQF >gi|226332000|gb|ACIB01000056.1| GENE 63 63965 - 64525 709 186 aa, chain - ## HITS:1 COG:lin0961 KEGG:ns NR:ns ## COG: lin0961 COG1704 # Protein_GI_number: 16800030 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Listeria innocua # 4 186 5 185 185 150 45.0 1e-36 MNLLIILGIIIILVIIIASMYNSLVKLRNNRENAFADIDVQLKQRHDLIPQLVDTVKGYA AHEKETLERVIQARNGAVSARTIDEKITAENQLSSALAGLKITLEAYPDLKANQNFLQLQ EEISDVENKLAAVRRYFNSATKELNNAVQTFPSNLIANMFGFHKEMMFDLGTEQRANLEE APKIKF >gi|226332000|gb|ACIB01000056.1| GENE 64 64816 - 65319 474 167 aa, chain + ## HITS:1 COG:mll3697 KEGG:ns NR:ns ## COG: mll3697 COG1595 # Protein_GI_number: 13473184 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Mesorhizobium loti # 3 164 5 161 183 101 37.0 6e-22 MKSLSFRKDLIGVQEELLRFAYKLTANREEANDLLQETSLKALDNEDKFMPDTNFKGWMY TIMRNIFINNYRKIVRDQTYVDQTDNLFHLNLPQDSGFESTEGAYDLKEMHRVVNALPKE YKVPFSMHVSGFKYREIAEKLELPLGTVKSRIFFTRQRLQQELKDFV >gi|226332000|gb|ACIB01000056.1| GENE 65 65568 - 67343 1690 591 aa, chain + ## HITS:1 COG:AF1252m KEGG:ns NR:ns ## COG: AF1252m COG5016 # Protein_GI_number: 18677784 # Func_class: C Energy production and conversion # Function: Pyruvate/oxaloacetate carboxyltransferase # Organism: Archaeoglobus fulgidus # 1 460 1 434 480 186 31.0 1e-46 MKKEIKFSLVYRDMWQSSGKYQPRVDQLVRIAPLIIEMGCFARVETNGGAFEQVNLLYGE NPNKAVRAFTKPFNDAGIQTHMLDRGLNGLRMYPVPADVRRLMYKVKHAQGVDITRIFCG LNEVRNIIPSIHYALEGGMIPQATLCITFSPVHTVEYYTAIADKLIEAGAPEICLKDMAG VGRPAMLGQLTKVIKERHPEVLIQYHGHSGPGLSMASILEVCENGADIIDVAMEPMSWGK VHPDVISVQAMLKDAGFRVPDINMKAYMKARAMTQEFIDDFLGYFMDPTNKHMSSLLLKC GLPGGMMGSMMADLKGVHAGINMILKSNNQPELSIDDLLVMLFDEVEYVWPKLGYPPLVT PFSQYVKNVALMNVMARVKGEERWSMIDNNTWGMILGKSGRLPGPLDPEIVALAKEKGYE FTDEDPQKNYPDQLDEYRKEMQENGWESGPDDEELFELAMHDRQYRDYKSGVAKKRFEED LQRAKDAALAKQGFSEEDVKRMKRAKAEPITAMEKGRIIWEIDVESPSMPPEVGHKYEPD DVFCYIATPWNTYDRVLANFSGRIIEVCAKQGALVNKGDALAYVERCEEPA Prediction of potential genes in microbial genomes Time: Wed May 18 00:15:56 2011 Seq name: gi|226331999|gb|ACIB01000057.1| Bacteroides sp. 3_2_5 cont1.57, whole genome shotgun sequence Length of sequence - 88018 bp Number of predicted genes - 78, with homology - 77 Number of transcription units - 27, operones - 13 average op.length - 4.9 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . + CDS 58 - 3435 3083 ## BF1894 outer membrane protein Omp121 2 1 Op 2 . + CDS 3457 - 5346 1722 ## BF1957 hypothetical protein + Term 5370 - 5414 10.4 3 2 Tu 1 . - CDS 5450 - 6196 646 ## COG0744 Membrane carboxypeptidase (penicillin-binding protein) - Prom 6223 - 6282 2.7 - Term 6212 - 6283 16.0 4 3 Op 1 . - CDS 6320 - 6994 854 ## COG2885 Outer membrane protein and related peptidoglycan-associated (lipo)proteins - Prom 7051 - 7110 7.4 5 3 Op 2 . - CDS 7149 - 8492 1264 ## COG0534 Na+-driven multidrug efflux pump - Prom 8702 - 8761 5.4 + Prom 8443 - 8502 3.3 6 4 Tu 1 . + CDS 8581 - 9480 710 ## COG0697 Permeases of the drug/metabolite transporter (DMT) superfamily + Term 9658 - 9693 0.2 7 5 Op 1 . - CDS 9499 - 11058 1107 ## BF1900 hypothetical protein 8 5 Op 2 . - CDS 11083 - 11931 514 ## BF1901 putative anti-sigma factor 9 5 Op 3 . - CDS 11928 - 12479 335 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 10 5 Op 4 . - CDS 12532 - 13794 1094 ## BF1903 hypothetical protein - Prom 13850 - 13909 5.4 - Term 13874 - 13915 -0.9 11 6 Tu 1 . - CDS 13933 - 14475 573 ## BF1904 hypothetical protein - Prom 14506 - 14565 6.5 12 7 Op 1 1/0.000 - CDS 14831 - 15400 504 ## COG0716 Flavodoxins 13 7 Op 2 1/0.000 - CDS 15414 - 16181 503 ## COG0599 Uncharacterized homolog of gamma-carboxymuconolactone decarboxylase subunit - Prom 16201 - 16260 9.2 14 7 Op 3 . - CDS 16263 - 16826 307 ## COG0110 Acetyltransferase (isoleucine patch superfamily) - Prom 16920 - 16979 3.5 + Prom 16808 - 16867 6.4 15 8 Tu 1 . + CDS 17025 - 18302 963 ## BF1979 hypothetical protein + Term 18379 - 18422 4.5 16 9 Tu 1 . + CDS 18798 - 19094 120 ## BF1980 hypothetical protein 17 10 Tu 1 . - CDS 19023 - 19226 78 ## - Prom 19306 - 19365 4.7 + Prom 19248 - 19307 6.0 18 11 Op 1 . + CDS 19331 - 19663 144 ## BF1981 hypothetical protein + Term 19670 - 19712 9.8 19 11 Op 2 . + CDS 19714 - 20652 409 ## COG0454 Histone acetyltransferase HPA2 and related acetyltransferases + Term 20795 - 20819 -1.0 - Term 20611 - 20647 1.3 20 12 Op 1 2/0.000 - CDS 20755 - 22143 744 ## COG1232 Protoporphyrinogen oxidase 21 12 Op 2 . - CDS 22146 - 23510 642 ## COG0635 Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 22 12 Op 3 4/0.000 - CDS 23516 - 24745 491 ## COG0477 Permeases of the major facilitator superfamily 23 12 Op 4 . - CDS 24764 - 26833 945 ## COG1629 Outer membrane receptor proteins, mostly Fe transport - Prom 26890 - 26949 1.6 + Prom 27195 - 27254 10.2 24 13 Op 1 . + CDS 27277 - 29595 1654 ## BF1918 hypothetical protein 25 13 Op 2 . + CDS 29608 - 30807 798 ## COG2885 Outer membrane protein and related peptidoglycan-associated (lipo)proteins + Term 30839 - 30880 7.2 + Prom 31007 - 31066 7.8 26 14 Tu 1 . + CDS 31107 - 31973 449 ## BF1989 AraC family transcriptional regulator + Term 32006 - 32046 4.2 - Term 31994 - 32034 3.4 27 15 Tu 1 . - CDS 32115 - 32285 71 ## BF1922 hypothetical protein - Prom 32432 - 32491 5.2 + Prom 32486 - 32545 6.3 28 16 Op 1 . + CDS 32786 - 35017 1476 ## COG4772 Outer membrane receptor for Fe3+-dicitrate 29 16 Op 2 . + CDS 35073 - 38450 2323 ## BF1924 hypothetical protein 30 16 Op 3 . + CDS 38437 - 39966 1182 ## BF1993 hypothetical protein 31 16 Op 4 . + CDS 40005 - 41009 624 ## BF1926 hypothetical protein 32 16 Op 5 . + CDS 41016 - 42047 737 ## BF1927 hypothetical protein 33 16 Op 6 . + CDS 42070 - 42708 400 ## BF1996 hypothetical protein 34 16 Op 7 . + CDS 42726 - 43913 789 ## BF1929 hypothetical protein 35 16 Op 8 . + CDS 43917 - 45020 534 ## BF1930 chitinase 36 16 Op 9 . + CDS 45048 - 45968 597 ## BF1999 hypothetical protein 37 16 Op 10 . + CDS 45958 - 48207 1397 ## COG1596 Periplasmic protein involved in polysaccharide export + Term 48340 - 48388 3.3 38 17 Tu 1 . - CDS 48411 - 48647 140 ## BF1933 hypothetical protein - Prom 48758 - 48817 2.8 39 18 Op 1 . - CDS 48820 - 51192 1370 ## BF1934 hypothetical protein 40 18 Op 2 . - CDS 51194 - 52087 805 ## COG3291 FOG: PKD repeat 41 18 Op 3 . - CDS 52084 - 52611 442 ## BF2004 hypothetical protein 42 18 Op 4 . - CDS 52624 - 53127 620 ## BF2005 hypothetical protein 43 18 Op 5 . - CDS 53133 - 54038 556 ## BF2006 hypothetical protein 44 18 Op 6 . - CDS 54091 - 55233 863 ## BF2007 hypothetical protein 45 18 Op 7 . - CDS 55261 - 56109 811 ## BF1940 hypothetical protein 46 18 Op 8 . - CDS 56158 - 56610 171 ## gi|253566901|ref|ZP_04844353.1| conserved hypothetical protein 47 18 Op 9 . - CDS 56607 - 58460 706 ## MXAN_0563 LysM domain-containing protein 48 18 Op 10 . - CDS 58489 - 59082 400 ## Sputw3181_0259 CHAP domain-containing protein 49 18 Op 11 . - CDS 59076 - 59315 147 ## gi|253566904|ref|ZP_04844356.1| predicted protein 50 18 Op 12 . - CDS 59312 - 59887 287 ## gi|253566905|ref|ZP_04844357.1| predicted protein 51 18 Op 13 . - CDS 59884 - 61710 1073 ## Fjoh_3259 hypothetical protein 52 18 Op 14 . - CDS 61724 - 62488 444 ## gi|253566907|ref|ZP_04844359.1| predicted protein 53 18 Op 15 . - CDS 62500 - 64356 1590 ## COG3501 Uncharacterized protein conserved in bacteria 54 18 Op 16 . - CDS 64382 - 66199 1076 ## BF1945 hypothetical protein 55 18 Op 17 . - CDS 66230 - 66670 318 ## BF2015 putative bacteriophage GP25 protein 56 18 Op 18 . - CDS 66667 - 67125 282 ## BF1947 hypothetical protein 57 18 Op 19 . - CDS 67130 - 67549 413 ## BF1948 hypothetical protein 58 18 Op 20 . - CDS 67552 - 67959 461 ## BF1949 hypothetical protein - Prom 67979 - 68038 5.0 59 19 Op 1 . - CDS 68059 - 68421 252 ## BF1950 hypothetical protein 60 19 Op 2 . - CDS 68457 - 69452 520 ## BF1951 hypothetical protein 61 19 Op 3 . - CDS 69465 - 69869 242 ## BF1952 hypothetical protein 62 19 Op 4 . - CDS 69875 - 70303 363 ## BF1953 hypothetical protein 63 19 Op 5 . - CDS 70319 - 72802 1311 ## PROTEIN SUPPORTED gi|163764771|ref|ZP_02171825.1| ribosomal protein S8 64 19 Op 6 . - CDS 72815 - 74194 1321 ## BF1955 hypothetical protein 65 19 Op 7 . - CDS 74210 - 74659 456 ## BF1956 hypothetical protein - Prom 74686 - 74745 5.1 - Term 74676 - 74720 5.7 66 20 Tu 1 . - CDS 74749 - 75138 334 ## BF1957 hypothetical protein - Prom 75303 - 75362 8.7 - Term 75451 - 75504 2.3 67 21 Tu 1 . - CDS 75579 - 76322 563 ## BF1958 putative transcriptional regulator - Prom 76508 - 76567 6.3 + Prom 76579 - 76638 5.1 68 22 Tu 1 . + CDS 76877 - 77938 779 ## COG0156 7-keto-8-aminopelargonate synthetase and related enzymes + Term 77966 - 78014 9.6 - Term 78054 - 78101 14.1 69 23 Op 1 . - CDS 78160 - 81234 1566 ## COG1112 Superfamily I DNA and RNA helicases and helicase subunits 70 23 Op 2 . - CDS 81263 - 81448 223 ## BF1962 probabale oxalocrotonate tautomerase - Prom 81517 - 81576 3.1 71 24 Tu 1 . + CDS 81840 - 82079 150 ## BF2031 hypothetical protein + Term 82141 - 82181 7.2 72 25 Tu 1 . - CDS 82446 - 82574 71 ## gi|253566929|ref|ZP_04844381.1| predicted protein - Prom 82640 - 82699 4.5 73 26 Op 1 . + CDS 82655 - 82999 442 ## BT_2334 hypothetical protein 74 26 Op 2 . + CDS 82983 - 85028 1314 ## BT_2333 hypothetical protein 75 26 Op 3 . + CDS 85041 - 85394 375 ## BT_2332 hypothetical protein + Term 85480 - 85521 0.5 + Prom 85556 - 85615 6.3 76 27 Op 1 . + CDS 85652 - 86545 293 ## gi|253566933|ref|ZP_04844385.1| predicted protein 77 27 Op 2 . + CDS 86573 - 87304 326 ## BT_2328 hypothetical protein 78 27 Op 3 . + CDS 87216 - 87827 232 ## BT_2327 hypothetical protein + Term 87907 - 87964 6.1 Predicted protein(s) >gi|226331999|gb|ACIB01000057.1| GENE 1 58 - 3435 3083 1125 aa, chain + ## HITS:1 COG:no KEGG:BF1894 NR:ns ## KEGG: BF1894 # Name: not_defined # Def: outer membrane protein Omp121 # Organism: B.fragilis # Pathway: not_defined # 1 1125 1 1125 1125 2185 100.0 0 MNNSKIINVRLMKKVLVLVLSFLSVTAFAQNITVKGIVKDGTGEPIIGGSVLVKGSSIGT VTDVDGNYTLSNVPADGVLEFSYIGMKKQDVKVSGKTVINVVLQEDTQILDEVVVTALGL KREQKALGYAVTEVKGDDLKAANTISPVAALQGKVAGVEIRQSDGGMFGATKIQIRGAST LKGNNQPIYVIDGVILDNSTSGNTTMDWDAGNNNANDYGNELKNLNPDDFETVSVLKGAA ATALYGSRGLNGAVVITTKSGKGFKGFGVSVSQTFGIDHAYRTPDIQTEYGVGLMPGWKD TDNNGSVWDPFQFKLDDKGDRTLIGAGSYGWGPKYDGQPIRNYDGTWTNYSPHKNNMLDL YQLGLNSNTNVAIRGGNDKTSYYTSLSYKKARSTSEKNTFERYSFLLKGSHKISDRVEVS AAMSFTNSNPKNSPRTVGERFVNPNGTIMTPMLDVNYFRDKYLGEHGGLASTSYGDKYGS VPGRDLFFMIDKYDYSQKETVVRPQMEVNVQILDWLRFKADANMNYYYTKFEEKQLGSGY ANEGGKYTMGQTTKEQATFGGTFTVNKQIQDFSVGGFARYEYYTSRSEAYKVYTDGGMVV PGQWFVDNSKNPKKSEASISNTKRMMSAVFALNLGWKNQVYLDVTGRNDWSSSLVYQNGM GTYSYFYPSVSGSWLLNETFDLPHWITFAKVRGSWAQVGNDTDPYYVNSVYGFETKEMYD GNIYVNTLDKTMKSLKLKPERKNAWEVGLDLRLFDSRLNFDFTYYKENTRDQIMSIEVPA ISGVNTQLINAGNIQNKGIEIAVNATPYKNKDWQWDVAMTYTKNKNTIISLHENVADYIA LSGYANDYDYHIGSVAKVGGDYGLLMSDILPAKNEKGETLLEWDDSWRGAYEARSGKVQE VGKMTPDFLGSLSTTLSWKNLSLHIATDMRFGGLVASYSNLYGTQAGWIKSSLKWRDPEH GGLSWTSQYGDSKGISYGDGVIPDGVFKNGTFATLVDGTKMDVSGMSYKQLVAEGKLEPT HAGTYHVNRAAWGQNTIFDTWVHELNYIALREITLSYRFPKSVASKFGAQGLGLSFSARN LGYLYNSLPNHLNPESVRGNTASEFRIRGYEPYTANYMMTINVDF >gi|226331999|gb|ACIB01000057.1| GENE 2 3457 - 5346 1722 629 aa, chain + ## HITS:1 COG:no KEGG:BF1957 NR:ns ## KEGG: BF1957 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 629 1 629 629 1271 100.0 0 MKQMMKKYLYMAAVAVVGTGFLMSSCKDEFAGQNTNPSTVSKPNVRYLFTQCAMSFQPAD YLQWFAGFDAMSTWVQATASGGGNSSKLNMVTQTGCGYQVNEVLRYTNEIKHQISLMSDD EKAKYEYIAYLCNPMLVYLGLEDSDMYGSRQYSEAEMARYGGTLTPKYDTQEELFELWLK QLDETINYLRENNPQDVLGAQDFIYRGKLDKWAKLANSLKLRIAARLINKDKARAIAIVN EAAQNPAGLILTLDDDFVFNKGKRDNNWNNDISVGAGTKQLIDFMVSNRDPRLFYFFQKN DYNSNVVQGFFDQKRALPSYVEANVNYTVDADGKKHFESWKAPGEPWVRYYGVPCQVDIN KKEEYKDYFDPNNELFYLLSKDGAKKTYTPIAYRNTENIKGLLIYTFPDVPDVAPVQDKE EYGWYGLYFSAGETNLLLAEFKLLGANLPMTAQQYLSAGVEMSVRGYDFVSAKNHIPYYD KTYTGDVHDKTISLKEGMIDEMLSHDAYHLTGDLSKDLEKVYIQQYIHYLMLPMDMFVTA RRSGVPMKNSTLLPYQDFDPLLGDQYVIPRRFPVSKPLDSDLLRDITIAAYQAQGYTYEG EMSNSPVTLSKERVWYDKEAPAFGTGPQQ >gi|226331999|gb|ACIB01000057.1| GENE 3 5450 - 6196 646 248 aa, chain - ## HITS:1 COG:CC0325 KEGG:ns NR:ns ## COG: CC0325 COG0744 # Protein_GI_number: 16124580 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane carboxypeptidase (penicillin-binding protein) # Organism: Caulobacter vibrioides # 24 215 27 212 229 210 54.0 2e-54 MKLRKPFRILRNLILFFFISSIGAVIFYRFVPVYVTPLMIIRSVQQLVSGEKVVCKHTWV PFDKISPSLPMAVIASEDNRFASHNGFDMIEIKKAMKENETRKKKRGASTISQQTAKNVF LWPQSSWIRKGFEVYFTFLIETCWSKERIMEVYLNSIEMGKGIYGAQATAKYKFKTTAAK LTRGQCALIAATLPNPIRFDSAHPSPYIKRRQGQILRLMNLVPKFPPVDKEKAKGQDTKK QKNKKKKK >gi|226331999|gb|ACIB01000057.1| GENE 4 6320 - 6994 854 224 aa, chain - ## HITS:1 COG:FN1265 KEGG:ns NR:ns ## COG: FN1265 COG2885 # Protein_GI_number: 19704600 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Outer membrane protein and related peptidoglycan-associated (lipo)proteins # Organism: Fusobacterium nucleatum # 47 208 38 196 202 64 32.0 2e-10 MKKIKFMALFLSMALVFGSCGSMNNTAKGGVIGGGSGAALGAIIGGIAGKGKGAAIGAAV GTAVGAGAGVLIGRKMDKKAAEAAKIKDAQVEQVTDNNGLAAVKVTFPSGILFAFNSSAL SAASKQSLAEFANILKEDPTVDVAIIGHTDKVGSYEANQKVSANRAYAVENYLQACGVKP YQFKKVEGVGYSQYNESETPEQNRRVEIFMYASEQMIKNAEAGK >gi|226331999|gb|ACIB01000057.1| GENE 5 7149 - 8492 1264 447 aa, chain - ## HITS:1 COG:TM0815 KEGG:ns NR:ns ## COG: TM0815 COG0534 # Protein_GI_number: 15643578 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Thermotoga maritima # 12 444 18 457 464 135 26.0 1e-31 MQGIKNLTQGPINKQLFNLAMPIMATSFIQMAYSLTDMAWVGRLGSEAVAAVGSVGILTW MSGSISLLNKVGSEVSVGQSIGAQNHEDARNFASHNITIALIISLCWGGLLFLLARPIIG IYELEAHITENAIAYLRIISTGLPFIFLSAAFTGIYNAAGRSKIPFYISGTGLVLNILLD PLFIFGFGLGTNGAAYATWISQAAVFGIFIYQLRCRDALLGRFSFFTRLKKKYTHRILKL GLPVATLNTLFAFVNMFLCRTASEQGGHIGLMTFTTGGQIEAITWNTSQGFSTALSAFIA QNYAAGRTDRVIKSWHMTLLMTSIFGTLCTLLFVFFGNEIFALFVPEQAAYEAGGVFLRI DGYSQLFMMLEITTQGVFYGIGRTIPPAIISITCNYMRIPLAILFVRMGMGVEGIWWAVC VTTVAKGLILAGWFALIKRKVLSRPIL >gi|226331999|gb|ACIB01000057.1| GENE 6 8581 - 9480 710 299 aa, chain + ## HITS:1 COG:BS_ybfH KEGG:ns NR:ns ## COG: BS_ybfH COG0697 # Protein_GI_number: 16077290 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; R General function prediction only # Function: Permeases of the drug/metabolite transporter (DMT) superfamily # Organism: Bacillus subtilis # 1 292 1 290 306 194 41.0 2e-49 MKTTQRTAGWYHVMAAVTVMIWGTTFVATKVLIKYGLSPVDILFYRFLLAYICIWFFSPR VLLAKSWQDELRFVGLGLCGGSLYFVAENTALGMTLASNVSLIICTTPILTALLAPFFYK GDKLKARLIGGSLMALIGVGLVVFNGSFILQLSPAGDILTLIAALMWAFYCLLLRRMNTH YPTLFITRKVFFYGLVTLLPLFLVYPLQTDIHILFRPVVALNLLFLGVIASMLCYIMWNT AVKQLGVVCATSYIYVVPLITLLTSAIVIDETITIVALLGSALILSGVYIAERGVNLKK >gi|226331999|gb|ACIB01000057.1| GENE 7 9499 - 11058 1107 519 aa, chain - ## HITS:1 COG:no KEGG:BF1900 NR:ns ## KEGG: BF1900 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 519 1 519 519 1023 99.0 0 MKASKSLCLQCLFTCLLLFIAARVKADDSGILDRIIRLPKSEMTVYKLLSKITEETGYLF IYDSKLVDNERTVKLKGGKQTVRQAIYSIIGNDNLKLRSVDKHIIIYRPETALSISKEEG LCRDSTLSFTLEGTLIDQLSREPIPYATVGAEGSSIGSVTNQNGSFRLHLPDSLRNGRIR FSHLGYVPQTTDASLLAGRNGTFALEPKVIPLQEVIVRIVNPVRLLREMLQFRKKNYSKV PVYLTSFYREGIEQKNRFVSLTEGIFKIYKASSSTPEKTDQVKLLKMRRITNQAVKDTLI AKMKSGIHASIELDLIKSLPDFLLPDSKECVYVYTSSDLAVIDNRLAHVVSFEQRPSIKY PYYCGELYIDSENSALLRARFELTPRYIHKAANMLVEKRSRNIRIIPQKVVYTVSYKPWK QTYYIHHVRGDLHFKIKQKNKWLNNTSLHTWFEMVTCKTETDNVNRFDHNERLSVHTIFA ETPFVYDKSFWEDFNVIPPEKELSEAIEKISSKIEETEN >gi|226331999|gb|ACIB01000057.1| GENE 8 11083 - 11931 514 282 aa, chain - ## HITS:1 COG:no KEGG:BF1901 NR:ns ## KEGG: BF1901 # Name: not_defined # Def: putative anti-sigma factor # Organism: B.fragilis # Pathway: not_defined # 1 282 1 282 282 520 98.0 1e-146 MNRIEKNKTQTDQAWNKLHNRLETDGLLPTVTEHRFATRPTAWIGIAAIAAIISLCVYLP TVLRTDRHLSGGELLVKANKEESILVTTLEDGSVVYLSEQTSLEYPKHFSKKRREVSLKG NALFDIAGNRARPFFIETGKVQIEVIGTAFHVRNSGNSPFELAVQRGEVKVTQKQNGQEI HVKAGETATLLGDEWQLTVTENSEQFTRYMQNMRFKDEQLDHILHAINLRQTEIHLQSSP ELGKHVLTVSFSEDSPEKMAELIGLALNLKCTRNQNIITLSE >gi|226331999|gb|ACIB01000057.1| GENE 9 11928 - 12479 335 183 aa, chain - ## HITS:1 COG:PA0149 KEGG:ns NR:ns ## COG: PA0149 COG1595 # Protein_GI_number: 15595347 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Pseudomonas aeruginosa # 44 169 38 161 181 65 33.0 4e-11 MLNDVFILTQIKEGNIKAFETLFRQYYTPLRLYAASITGEPDVAEEIVEELFYVFWKDRE KLEIFHSVKNYLYRSVRNRSIQYCEHQDVKRRYQDAILSVPVNIASPDPQEQIEYKELQQ IINRTLEKLPERRLHIFRLHHTEGKKYSEIASLLSLSVKTVEKEMTRALRTLRKEIENYI QIS >gi|226331999|gb|ACIB01000057.1| GENE 10 12532 - 13794 1094 420 aa, chain - ## HITS:1 COG:no KEGG:BF1903 NR:ns ## KEGG: BF1903 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 420 1 420 420 836 100.0 0 MKTQWIRSIGCILAVLILSGIMPLAAQDNAERYTTISGVVKDKLNKKKLEYVNVSIPGSS VGTVTNADGEFTLKIPESVQAKDIEASHVGYLNSRIPLKEENPTERIVWLTPYANLLSEI LVRARDPRSIVEEALRKIPANYSPQSNMLTGFYRELAQKGRRYINISEAVIDIYKTPYNE TAEHDRVQIYRGRRLLSQKQSDTLAVKLLGGPNMAIYMDIVKNPDCLLAQEDLLFYEFRM EDPTSIDDRSQYVISFRPRVKLSYPLCYGTLYIDKERLSFTRAEFNLSMDDKNKATQAIL RKKPFGLRFKPVEVSYLISYKNLGGITYLSYIRNNIRFKCDWKRKLFSTNYTILSEMVVT DRKENNITAIPYKAAFKQNHVFSDKVDNFTSDNFWGGYNIIEPTESLEHAVNKLKKQQKQ >gi|226331999|gb|ACIB01000057.1| GENE 11 13933 - 14475 573 180 aa, chain - ## HITS:1 COG:no KEGG:BF1904 NR:ns ## KEGG: BF1904 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 180 1 180 180 370 100.0 1e-101 MIVQDLNFDQTVAMAKAKAESMIQNGLYSRFGLSEKERLDKALLGCIGELAFQKHLKNLG IPFELDQTDFQSHHSDEFDVKVNGAKIDIKVAKKTTANPPTDNWTYGYPQEQHPETKDYV VVGWVDFNRKEVGFYGWIRGKQIVEFKVVTQNSYAKYPYLTPNHEFKWGCLTKDLNEILK >gi|226331999|gb|ACIB01000057.1| GENE 12 14831 - 15400 504 189 aa, chain - ## HITS:1 COG:YPO2003 KEGG:ns NR:ns ## COG: YPO2003 COG0716 # Protein_GI_number: 16122245 # Func_class: C Energy production and conversion # Function: Flavodoxins # Organism: Yersinia pestis # 25 163 73 200 235 81 31.0 1e-15 MYKIIFVFLAIMGIATASCAQQKQGANRKQPNNKVLIAYFSATGTTAGAAEKLSKVTGGE LYEITPAQPYTNADLNWNNKQSRSSLEMNDPKSRPAIRKSSIDIADYDVIFVGYPIWWNL APRIINTFIESYHLKNKTIILFATSGSSSITNSMATLKKSYPDLIWKEGKLLNGMNENDI REWISKLDY >gi|226331999|gb|ACIB01000057.1| GENE 13 15414 - 16181 503 255 aa, chain - ## HITS:1 COG:Cgl1022 KEGG:ns NR:ns ## COG: Cgl1022 COG0599 # Protein_GI_number: 19552272 # Func_class: S Function unknown # Function: Uncharacterized homolog of gamma-carboxymuconolactone decarboxylase subunit # Organism: Corynebacterium glutamicum # 7 110 4 107 107 145 62.0 1e-34 MAQEKIKQTAGRDQLGDFAPKFAELNDDVLFGEIWSRTDKLSLRDRSLVTITSLISQGIT DNSLTFHLQSAKNNGISRTEISEIITHIGFYAGWPKAWAAFRLAKEVWAKDTTEVDAKAA FQREMIFPIGEPNTAYAQYFTGNSYLAPISHEQVNISNVTFEPGCRNNWHVHHAKKGGGQ MLIGIAGRGWYQEEGKPAVEILPGTVIHIPANVKHWHGATAESWFAHLAFEIPGEDSSNE WLEPVTNKEYNRLPQ >gi|226331999|gb|ACIB01000057.1| GENE 14 16263 - 16826 307 187 aa, chain - ## HITS:1 COG:MA0410 KEGG:ns NR:ns ## COG: MA0410 COG0110 # Protein_GI_number: 20089303 # Func_class: R General function prediction only # Function: Acetyltransferase (isoleucine patch superfamily) # Organism: Methanosarcina acetivorans str.C2A # 16 186 22 191 191 168 49.0 5e-42 MTIREFKEHVKTRKLLDTEEIHQFMDIMSNEARRITFQLNTTYHTPNEVRELLSELFGYR VPSSFRVFPPFYTDFGKNITIGEDVFINACCHFQDHGGITIGDGCQIGHNVVFATLNHGL LPEERKSTQPAPIVLGKNVWVGSNATILQGVSIGNNAIVAAGAVVTKDVPSDAVVGGVPA KFIKTIR >gi|226331999|gb|ACIB01000057.1| GENE 15 17025 - 18302 963 425 aa, chain + ## HITS:1 COG:no KEGG:BF1979 NR:ns ## KEGG: BF1979 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 425 1 425 425 872 99.0 0 MKRGKRLILPLLGTLILTSLFSCGVDRWPEYYPETGRDIWIDSVMRQEYLWYRDMPSPAA PDYFQKPEAFLKKAVASMDNGFSKIDSLLDEPIPSYGFDYTLYKVLDNDTAYNALISYVV PGSPAEEAGLQRGHWIMMMNGDYITKKVESELLQGSTRQLQIGVYKEVVGEDGEVTGGVV PIGETTMPASRSLADKPVHRFEIIPWNGKKVGYLMYNEFKAGPTTDSQAYNDDLRRAFRD FQTGGVNEFVLDLRYNTGGSLDCAQLLCTMLAPADKMNQLLALLRYSDKRVEANQDLTFN PELIQSGANLNLSTVYVLTTNATRGAAEMVINCLNPYMKVVLIGTKTAGEYVATKPFVHP TDRFILNLVVCNVYNAEEKSDYATGFKPTYEYNEDSYLSTYLPFGNTNETLLNAALKIMS GITDK >gi|226331999|gb|ACIB01000057.1| GENE 16 18798 - 19094 120 98 aa, chain + ## HITS:1 COG:no KEGG:BF1980 NR:ns ## KEGG: BF1980 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 20 98 1 79 79 151 98.0 6e-36 MWSQQERTFENFFYRDIYLLLTSCQLINQNPFTYELLVPILHQDTFVQMVLRGITYLPLL ILVMAVMDGLLCIIGLLPFADRNKGIGHEYDCNELFGR >gi|226331999|gb|ACIB01000057.1| GENE 17 19023 - 19226 78 67 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MITATMEGNAAAQDFLSHLPLKATLPFIPRGVALQFSAFLAPTHLSSEKFVAVVLMTNSF VTVCEWQ >gi|226331999|gb|ACIB01000057.1| GENE 18 19331 - 19663 144 110 aa, chain + ## HITS:1 COG:no KEGG:BF1981 NR:ns ## KEGG: BF1981 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 110 6 115 115 193 99.0 2e-48 MKQLIACCGLDCENCDARIATVRDDNELREKTAQKWSIMNNAPEITPATINCMGCRTDGA KFAYCNDYCPIRKCVNEKGYNTCGDCKELDDCQIVGAIFQHAPDAKENLL >gi|226331999|gb|ACIB01000057.1| GENE 19 19714 - 20652 409 312 aa, chain + ## HITS:1 COG:YPO3444 KEGG:ns NR:ns ## COG: YPO3444 COG0454 # Protein_GI_number: 16123592 # Func_class: K Transcription; R General function prediction only # Function: Histone acetyltransferase HPA2 and related acetyltransferases # Organism: Yersinia pestis # 154 312 9 167 167 97 34.0 2e-20 MKQEYISRIRSFNRYYTKILGILNKYYLGSELGLPEVRIIQDVYLHPDRSSKDISNELNM DKGLLSRLLKQLEQKEYIFRKSTEKDNRMGLVNLTEKGCEVYYRLNTAANQSIERIFSHL EDRQLQRLVHCMDSIYKIINSVETGLTVDNNEPIVIRPIEESDNASIASVLRASVEEHGA PKVGTFYDDPHTDRMFQTFNIKGAEYWIVESNGVILGGGGFYPTKGLPHGYAELSKFHFR PELRGRGIGKRLLQFIEQRAVSAGYVYMYIVSYHQFGNAVSMYEKYGYEHIDNALDQSGL YQDAPFHMVKAL >gi|226331999|gb|ACIB01000057.1| GENE 20 20755 - 22143 744 462 aa, chain - ## HITS:1 COG:aq_2015 KEGG:ns NR:ns ## COG: aq_2015 COG1232 # Protein_GI_number: 15607001 # Func_class: H Coenzyme transport and metabolism # Function: Protoporphyrinogen oxidase # Organism: Aquifex aeolicus # 11 461 3 427 436 211 33.0 3e-54 MNQTVTENHRLRDTIIIGAGLTGLTTAYCLTRKGCDIEVIEQSPCVGGQIRTYHENGFTF ESGPNTGVISHPEVAELLAELSPTCRLETAREASRQRLIWKGDRFHPLPSGLFSAITTPL FSTKDKFNILGEPFRSKGNNPDETIGELVQRRLGISYLHYAVDPFISGVYAGDPMRLVTR HALPKLYQLEQTYGSFILGGIAKSFSHRSERDRLATRKIFSTYGGLSNLTKALEQAIGIK RFSLGATSTSLMPCEQGWVVSFTDSCGIVNRIHCRKVITTTPAFVLPSLLPFVPDKQMNR ISNLTYAPVMQVSVGLRNTYGKEFHAFGGLVPSCEQKPVLGILFPSACFDNRSPEGGALY SYFLGGTRHPELLEKSDDEIIRLITTGLNEMLDYPAGIVPDLIRIFRHKKAIPQYESSST DRFAAINELQKQYPGLVVAGNLKGGIGMADRIKQAFEIARER >gi|226331999|gb|ACIB01000057.1| GENE 21 22146 - 23510 642 454 aa, chain - ## HITS:1 COG:aq_2124 KEGG:ns NR:ns ## COG: aq_2124 COG0635 # Protein_GI_number: 15607073 # Func_class: H Coenzyme transport and metabolism # Function: Coproporphyrinogen III oxidase and related Fe-S oxidoreductases # Organism: Aquifex aeolicus # 4 440 8 441 456 312 38.0 8e-85 MYTEIINKYNVPVPRYTSYPPANYFEPFTNARYLEAVQQSNQASERALSFYLHIPFCRHL CHYCGCNSYPMARPEIIESYVEALHQEIDLILPLLDKDRPIAQIHYGGGSPTAIPVALIK ELNAHLLSSFPVIDRPEIAIECHPGYLSEKDWLQLTECGFNRLSIGVQDFNIEVLKTVNR RPSLLPMEDIFILLREKGISINLDFLYGLPKQTVENFTRNIEQAILLSPDRLVMFSYAHV PWINKRQLLLEKSGLPDNHEKRTMFDTAAGLLHKSGYQSIGMDHFVRPNDELSIAMQTKK LHRNFQGYCTRRTTAQVYGLGVTAISQLESAYAQNTKDIPHYIKTISKGELSITKGYALS PTEQLTREVIETLMCNGCIDWRDLSKRLHVSVSTLKAATAYDEKKLSGFADDGLIYYTDD YLEMTTAGSAFVRNVAASLDKLMLHSPHSYSKPL >gi|226331999|gb|ACIB01000057.1| GENE 22 23516 - 24745 491 409 aa, chain - ## HITS:1 COG:all4025 KEGG:ns NR:ns ## COG: all4025 COG0477 # Protein_GI_number: 17231517 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Nostoc sp. PCC 7120 # 22 397 11 386 396 176 34.0 6e-44 MRINKLSPLSRKTLNLSTFFCLYIAQAIPMSFFSTAIQVLMRQADYSLSSIALLQLIKLP WILKFLWAPLVDRHCITLKDYKRCIITSEIVYALLILMVGLLDIQTDLYLIIGLVFLSLI ASATQDIATDTLAVLSFGKSDKSLVNSMQSMGSFGGTLIGTGILLLVLQHYGWHVVIPCL CIFVLLAIIPLLKNKHMRIIPKEPSKRAQFTDFIWFFARRNIWKQIGFLLLYYASIIGIL SVLRSYLVDLGYSMKEIGIMIGIGGTGAAFASSFLAGLLVRKIGRYHSRILFAIFILLTT LYFMCISCTVPSFSMLCLGIVLLWSAYGMATIVVYTTSMDCVRKGCEGTDFTIQTVLTHL SGLLIAFLSGVVADRTGYHGLFIFEVILASISLIYIFYFFRKESQPIHS >gi|226331999|gb|ACIB01000057.1| GENE 23 24764 - 26833 945 689 aa, chain - ## HITS:1 COG:all4026 KEGG:ns NR:ns ## COG: all4026 COG1629 # Protein_GI_number: 17231518 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor proteins, mostly Fe transport # Organism: Nostoc sp. PCC 7120 # 9 689 164 854 854 235 27.0 2e-61 MIKRLFFLLPFSTIVSANEPDTIQVKRIDLDEVTIVAFKQNTPNREPLSISTLDNRFLKE NEISGAKDLSSLLPNFYMPDYGSKQNSPVYIRGIGAKKDAPSVGFYVDGIPYFETSAFDI DLSDISSIEVLRGPQGTLYGRNSIGGTINVYTHSPLDYQGTYFRLGYGSYNDMRLIASNY TKVNEQLGLSFSGNYHHNDGFFTNLHTHKKADKLDNGAGRIGLTWKPAAHWTTRFITSYE YSNQGGYPYGLYNADKGTTEAVNYNNEGLYRRNLLTSGINIRYNGPHISFNSQTSYQYIQ DKMGIDQDFSPRNIFYGQNKIRQHMYCQEFTVKSVNKSRYHWITGAFVFRQTINRKVDLS RFTDTTRHLTNSGIPTQGIAFYHQSTLDLLQGLSCSVGLRYDYEHARCDFSKVQQPLNGN GETKSLEQFNRSLHFGQFTPRFSMQYLSSHNQLFYASVSKGYKAGGFNVSFLNNDDYLYS PEYNWNYEIGTKLSFLNNRLSADLSLFYIDWRNQQITNTIPTVGNVIRNAGRSRNKGIEA SFQARPTKSWMMYMNYGYTDARFVHYQKEERGILKDYGGNYLPMVPRHTFSLTTGYSFYD ICSWIDRLTLNAGVSVTGPIYWYEDNQAKQSPYALVNLRISINKGCFTWEAWSKNLTNTD YLSYYFVTSKAYAQKGKPITLGTSVSISL >gi|226331999|gb|ACIB01000057.1| GENE 24 27277 - 29595 1654 772 aa, chain + ## HITS:1 COG:no KEGG:BF1918 NR:ns ## KEGG: BF1918 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 772 1 784 784 1456 98.0 0 MNNAKLLHLMVYILTIILGQSCTEVDITMPKGPKGDRGMSAYEFWKENVENGVISWPKKE TEITDFFKYLKGKDGLDGKSAFELWKEEVATGALDNPHRPGSMWPVSQNNLRDFWYYLTG ASGENGQTPHIGNNMNWWIGNKDTGIRAQGRDGQNGEDAVPPVVTIGDNGNWLIDGVDTG KPSRGEEGVAGTTPTVTIGENGNWVINGKDTGKAAIGKDGRSPEVIIGTNGNWYINGKDT GIRAYGKDGINGKDGANGKDGINGKDGANGKDGINGKDGAAGKDGANGKDGANGKSAYEL WVESVEAGCNNTGPKVKNPHNPSLDWDCGKTTLSDFWEFLRGADGKDGADGKDGKPGVPG KPGAEVTIIKGVPNVIALYSQQEFGEYVRTTDGGVAYRVYDESGNKAPKAVVKGIPGLDP AKTYTANEEGEFIIPKEDLPQIDDIDARWGKVKEVTINNVTKESAENTYVPNRMQIRMIY IGTSPYLDYEHNLQFRVERKTDPGAEWKTLPSYLPNVNAGFTAYQVTNPEDPTSLDKTKK IESTTPNMSSTSMSINPNRYVKENPAGIKNGITDFWDGKDNYFSIVKDTPYYGETIYWNG VCKMAPYQIPPTLKTLALTKASAESGDDVFLNKAQGEFDFSTIDFNIISKRELVKTVKQN GIDYIEPEYYTLEEAKGLLLCYVKFTYTSPLGVQTATSEHNKSSYNKPEYAALSPYLGAT IYSVGANSAFIYSNSGSGVSLGVLKKKVDDGTYYVENTYKNMPAISVTYKDK >gi|226331999|gb|ACIB01000057.1| GENE 25 29608 - 30807 798 399 aa, chain + ## HITS:1 COG:PA1777 KEGG:ns NR:ns ## COG: PA1777 COG2885 # Protein_GI_number: 15596974 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Outer membrane protein and related peptidoglycan-associated (lipo)proteins # Organism: Pseudomonas aeruginosa # 94 345 74 305 350 71 27.0 2e-12 MNFSVIINRLLIGALAFTVFPFYTYAQNDKVERQTAHRTKAWEIGVGGALINWDRVTFSD FRQVDGNYLYRMNIDHLFGGIQLYAARELNPWFYLDLQGTLGLARKQVETGGRKFDFMYM AGPGLQFRLTPLFKSKYVEPYLRVGVNYLHHDFYAINAGKFENDPIGEAEWTSSNPWNKE KIGSKQSYFPLSFGAGVQAWLNDHWGVGLQGEYIMPVDKKQTRFVQASMRIMFRLGGSTK RPMPVVQYIDRPVDRIVERIVEKRIEVPAVVESHVCDLFDNIHFAFDKDVITSESEITLD KIADLLKSYPDNNFLITGYTDARGSDNYNIDLSKRRAKAVYSALLKRQVPQHMLKWRGVG YHASSVPASGPDKVRMGDRKVSIERVTNSDYWGWLTNEE >gi|226331999|gb|ACIB01000057.1| GENE 26 31107 - 31973 449 288 aa, chain + ## HITS:1 COG:no KEGG:BF1989 NR:ns ## KEGG: BF1989 # Name: not_defined # Def: AraC family transcriptional regulator # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 288 1 288 288 563 100.0 1e-159 MKLLYFKEHLSCINYQINVNTGFVYYNLEKDSVSKIDNSASPCILFLLDGEVSIDSGEYQ NVHIEKDKMVLIPQHVDNKIEVIYDAKCLLLFWNKDIRVCDKVYMNSLSSYKERKKEMCV LPIRDPLQAVLNSVVAYLYAKMQCKHMHLIKQQEVLLVLRGYYTKKELFTFFSSILGNTG HFEDFVMNNYRKVKSVKEFAGLYCTSERSFNRKFQNCFKESPYQWMQKKKAELIREKISE SDTPFQEIAMDFDFNSQAHFTSYCKRLFGMTPSKLRTESKKVAPDLEY >gi|226331999|gb|ACIB01000057.1| GENE 27 32115 - 32285 71 56 aa, chain - ## HITS:1 COG:no KEGG:BF1922 NR:ns ## KEGG: BF1922 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 56 1 56 56 94 100.0 8e-19 MAHYRDKSCTISLSKVLNCTLVYTETFLLSGINIVEWMQSTKNSKQLRASSDGTKK >gi|226331999|gb|ACIB01000057.1| GENE 28 32786 - 35017 1476 743 aa, chain + ## HITS:1 COG:SMc02721 KEGG:ns NR:ns ## COG: SMc02721 COG4772 # Protein_GI_number: 15966136 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor for Fe3+-dicitrate # Organism: Sinorhizobium meliloti # 19 625 24 633 932 358 38.0 3e-98 MTNRKPLFIILFSLGYSGIYSQEQQVKKDSVYQLQEIVVSSQQILGSKFKARNRTGSAYY ISPEEIRRLGYTDINRMLKAVPGVNMYEEDGFGLRPNISLRGTKAERSERISIMEDGVLA APAPYSAPAAYYFPNVARMEAIEVLKGSSQVQYGPFTTGGAINLVSTPIPNSFSGKANIS YGSKNTFKSHTSVGSSWKHFGYMVEYLRYQSDGFKKYEDHAAKGFKRNDIIAKIRVKTDH VKGVNHALELKFGYADENSDETYVGLSADDFKKTPFLRYAGSQMDKLKTDHRQWVATYLL TFSNKLKITTNAYYNYFHRNWYKLNDVRAGITSKEKRSIADVLVDPETNIRYFDILTGKT DREEEALLVRANNRTYRSRGIQTRAEYRFNLNEFFFDLEFGLRYHADEEDRFQWDDSYSM KNKKMVLFMEGIHGTNANRVTSANALAGYLLAKLRYDAWTVTAGLRYEDVDLLKKDYTKE DLARSGKVRIETPNHARVLIPGVGLHYQLMPAASVFFGIHKGFAPPSAELYQKPESSVNM ELGTRVAIGNFRAELIGFYNNYSNMLGSDLAASGGAGTLEQFNVGEARVKGAEFLVQYQP LPKNCNVRLPLQVSYTYTDTEIRNSFESHSWGNVVRGDEIPYIFKHALNMQLGIECKWFY ANIGTRYNSDMRTSPGQGTIAEREKVPANLIFDASLNVFVNKYLTVRLNAINLTNRVYLT SRHPAGLRAGHPFGIYAGANVQF >gi|226331999|gb|ACIB01000057.1| GENE 29 35073 - 38450 2323 1125 aa, chain + ## HITS:1 COG:no KEGG:BF1924 NR:ns ## KEGG: BF1924 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1125 1 1125 1125 2230 99.0 0 MNTSSMIRIVISSLLMAIFSQTCPAQVAVEIKGTVRSVTGESLPGATVIVEGTARGVITD VEGRFTLKARKGQMLKVSFVGMKPRLIKASSGIMNIILQDNVQEIEGVVVTGYQQIKNRV FTGAAASVKLDEIKLNGVSDVSRMLEGRIAGLSIQNVTGSFGTAPRINIRGGASIMGNVQ PLWVIDGAVYEDLVSLTLDQLASGDAVTLISSAVAGLNASDIEDIQVLKDASATSIYGAR ALNGVIVITTRSGKRNAPNRFSYSYELSMRSIPSYTNFDLLNSQESMSVYQEMGRKGYFS LQNTLYGRRSGVYYQMYKALNTIDPATGQYYLENTDDVKRAFMREREYANTNWFKELFTH RPIHTHTVTFSGGGENSAMYASIGFYDDRGWTLADHVKRITANIKNSFYWNEDKIKATIS AQGNLRNQNSPGTIPQRRNTVIGTFERDFDINPFSYALSTSRTLRPHNADGEREFYRNNW APFNILNEYENNYLKTEVLDLKLQGELSYRLNDHIEVKGLAMVRHAVTKSSHFIAEASNV VQAFRANETPYVARENIYLLKNKDDPMQLPGVALTHGGIFNKTETSLRSYLGRLALDYNR QLSEHNIRAFGFTEIRYADRSMNPFQGYGIQYDRGNQVFSNPLVFEKLANEGDTYFSLTE RYERGVTLSGSATYGYAGKYIFNGVFNYEGANTAGKYSRSRWLPTWNIGAKWNLDQEKFM RKHTTISRLALRTSYGLTAKMNEQAVNSTTVFKHVMVNRTLLKDRENALRILHLENRDLT WEKMYELNFGLELGLFGNRISATFDVYQRNSFDLIDLIRTSGVGGQYYKYANFGDMRTRG VELAIQTQNILTDKFSWSTTVTISGMKQKITRLLNTPNTFDMVAGTGRGNIVGFPRGSLF SFNFQGLKSNGLPTFDFGLYPSNKGANSEISGADFLDAQYSKSYLIYHGPIEPQYIGGIS NTFKYKNWEFSCFVTMQAGNKIRMNPSFDPAFADLNVFSKEYYNRWLNPGDERKTNIPVI PSQDLIRNIGKENIEKAYNTYNYSQNRVADGSFVRMKNISLGYRLPQRFLSHLKIKQMNV KVNVTNPFLIYSDRKLNGQDPEFYRSGGVSLPTPKQYTMTLNVEF >gi|226331999|gb|ACIB01000057.1| GENE 30 38437 - 39966 1182 509 aa, chain + ## HITS:1 COG:no KEGG:BF1993 NR:ns ## KEGG: BF1993 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 509 1 509 509 1010 100.0 0 MLNFNWVKMKKTIFLSTIILLLTACQDYLETNPDSTFDVQIDSEDKIAELLAGAYPEASY FAFLEARTDNVGERTNGIHSRLNEAMYYWEDYDQEDLDTPLNYWNACYAGIAQANQALEL LSKYPKSDRVKALYGEAFLLRAYLHFMLVNIWAEPYGTTKSATAPGIPYLTRPEKNALVD YERGTVKEVYEKIEKDLKLGLSLVNDDYYVKPKFHFNKKAAYAFASRFYLIKGEWDLVVS YSDYVLGVDPKPVLRNWQKYKKEFNSNHKYLYIRYASVDEPANLLLTTTESRVARNIPSE KYGVTIQSAEKVYNEHGIDGCFNFRKMKMQSFFLFNYNDGRIDDGQYIAKFDELSLSGYT GIRPRGLYVTNVLFSTDEVMLNRMEAYTMLGEYDKAIDNLLVYLSVKYGVYPSCGRSTYT QTSSENYQIYTPFYGMSINQLALVKILLGFRRQEFLHEGLRWFDIRRFYISVKRTSKYKF YRPLEKEDSRKLLQIPAEAINRGLVPNPR >gi|226331999|gb|ACIB01000057.1| GENE 31 40005 - 41009 624 334 aa, chain + ## HITS:1 COG:no KEGG:BF1926 NR:ns ## KEGG: BF1926 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 334 1 334 334 654 99.0 0 MKTIIRKVNFRYFFRRSLYLLVISFLLAGCEEEASLSARSVVEAGETEREKTELDKWILD TFTRPYGIEVEYRWDKNAVQNGSYIYPPEVANVKSVLNTIKTLWIDLYTAPELGGDKFLL GKNPLKIYMYGGRNVDGNGMELLDNLEATTNEMFLYNVNEFNPQDEDKVFILMRSVHHQF ARHLMELFPYDRSKFLSISRNKYIESTKSIAWIFKGETQGRRGFILAGYPNKKGFFTFHS LLSPEKDFAEIISLKLTYGPKDLLQALDRAKTPYNAGSDKDLQKEYDEQALQAYKELVEK QAFVEDYFSKEIKISLNYLQLISMKQVKEFINKK >gi|226331999|gb|ACIB01000057.1| GENE 32 41016 - 42047 737 343 aa, chain + ## HITS:1 COG:no KEGG:BF1927 NR:ns ## KEGG: BF1927 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 343 1 343 343 683 98.0 0 MKLGFQILILMVFCFSCKDEKVFDSSPSERNANHIGELRKELVNAPYGWAATYFPRTDSL LFTNVNELITLPKGIFEDKNKFGYGGHYFLMKFSENGIVETVADYNEESLTKKLQSEFEV SQNTFTQLSFTTYTYLHSLVNDRFTGSSDFLYTGKDVDGNLIFKTSSYIEPAREYIIFTK LKNDESWQEDIQKSYDNKLFFEKMKNPQLIIRRAGRVYFHSDVQMNVTYGGDGSVNGKQP PEVYQRYRLFLARDYFASQGWLGNKVKGLGSGYVGTADGLTFKPGIRYSETYIFYDFRRE GERFVCELVKVVDPYSKKIHWVSKHLAPYGEESGVIAEIRDEI >gi|226331999|gb|ACIB01000057.1| GENE 33 42070 - 42708 400 212 aa, chain + ## HITS:1 COG:no KEGG:BF1996 NR:ns ## KEGG: BF1996 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 212 1 212 212 432 99.0 1e-120 MKHVWIISCLLFVCSCNTVREDFVVNEDYEKLFPSKEIEKPENKRGELLVQLCDPDQALE NYKYPGTETPNGADQYKITLMCSFQEKRWDGNLTKDVSAQYKVKYINEKKELVTISCGKR NIGNAGEDADGLRPNVMFNGEKLEISFNVHSGFPLYLSVSGEGPRSSNIRASITAVSTDG LVEIPSLQTEQYQNEEGINPLRYPYCEYLILP >gi|226331999|gb|ACIB01000057.1| GENE 34 42726 - 43913 789 395 aa, chain + ## HITS:1 COG:no KEGG:BF1929 NR:ns ## KEGG: BF1929 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 395 1 395 395 814 100.0 0 MNYLLKCLFLSCLLISVVACTDGINETVISAYQLPVLPEPSYKFSRNGESSVNVLECGFL KSPIDRIFSEYMNEARMSTKRDYDEALRIYHEGNFGLKPQKEVSASSKHLKDRDKILKDI DAWFETSARIAGLGANIPSYEHRNREAVKGLTGYVGNGIGDKDICYVDERGIAVAEVYKY AIMGAIYLDKILNIHLSEQILENNEVLVRNDLTQLLPGHNYTELEHHWDLAYGYYDFWKT LAQSDGLPALKDCHLRISRSFVKGRALMTTSQYDEMRLQADTIRQELSRVVAVRAMHLLV GPNTLANLKENPRRAFRLLSQAYGLIYAAQFARNMQGKSFLTNEETGILLHELEKGDGLW DKERLLGREQTEGALYNLAVRIGEKFDVSPEDIKK >gi|226331999|gb|ACIB01000057.1| GENE 35 43917 - 45020 534 367 aa, chain + ## HITS:1 COG:no KEGG:BF1930 NR:ns ## KEGG: BF1930 # Name: not_defined # Def: chitinase # Organism: B.fragilis # Pathway: not_defined # 1 367 1 367 367 739 100.0 0 MRKIHKLLLPAIIFMMAIIQVKAQGTAGIDVIPNGGFEKWQDTGQPTGWRIVSSLNPERV QERRPESTGVYALKIWLNGGSVFLAQPVSVKAGKQYTLSFWNKGSVGNREIVVTLFWYDN GSIKSREKILSIRTVKDEWRRVESTVTIPENIHSMGMGIRTQSYQGYMLFDDMSMVLKES GPDISPVPEAPDNLRMKAYQNEMEISWNKVADETIKWEVVFDDQVETITSGNSYVKTKLK PGSTHHIKVRAVKGKEFSPYAERRGATERMREAENSEDRIPYLRTILPDGSCEGRFLKLY YNELANPNAKVSYKLDGVTIEPKDNTLEFPEFEGFYKRFRLEVYIDEGEGREWEILYPHL GVKRNEK >gi|226331999|gb|ACIB01000057.1| GENE 36 45048 - 45968 597 306 aa, chain + ## HITS:1 COG:no KEGG:BF1999 NR:ns ## KEGG: BF1999 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 306 1 306 306 618 100.0 1e-176 MIKTVITQLYIAFCLICFFACEKQKEEFPDIRIGKEGVVDELSLNKQTEKRLLLSGGNGK YIVNVENAQIATADISMDTLKVKGWLEGETFATIISHDKRIRLKINVVFPELGISHSVVQ LLPRFRSKFISISGGGELTKLEEDDPADIMDMKWDGSTGMLEIYPKYEGEARVIAISEDA KEKKVIHVKVRPEGKLEIPGWYSTNSSSYYLIQNNNMVVKRKGVGTWIVNSARPYGGGVM YNSSYIKIAPIMNPVQGDSIDLNILRHGSLKPQITEGIHRLYVEEVRESEVMLRGRGFKF LLPYEK >gi|226331999|gb|ACIB01000057.1| GENE 37 45958 - 48207 1397 749 aa, chain + ## HITS:1 COG:aq_505 KEGG:ns NR:ns ## COG: aq_505 COG1596 # Protein_GI_number: 15605977 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Periplasmic protein involved in polysaccharide export # Organism: Aquifex aeolicus # 82 356 83 356 725 141 33.0 6e-33 MKSEIFVRFVSCVAKKVSCLKDMILKKINDKGALLFALFILLAGGSRARQHRKDDHVIRG KQVAVFGRNIFTARNLSFEPSLDIPTPENYVLGSGDELIIDVWGASENTVREIISPEGTI HVAGIGPIFLSGMNIQDAERSLRREFSKIYAAIPQKTVHIELSLGRIRTIMINVMGEVKV PGIYRLSAFASVFHALYRAGGISDIGSLRDIRVVRDGKEIARVDVYDYIMKGKLTDNIRL SEGDVILVPPYQNLVSISGKVKRPMKYEMKSGETVATLLSYAGGFTGDAYRSAIRLFRMG GKAKQVYNVAQDDYQSYLLADGDKLSVEAVLERFSNKVEIRGAVYRAGIYQLDDSVTGTV RQLISKAEGLRGDAFLNRALLRRQQEDLTHEMIPVDLKKMMDDTSADLCLQKNDVLYIPS VKDIEKEGTLSIYGDVRVPGEFPYVKNTTIQDLIVKAGGLPESASMVRIDVSRRIKDPGS ILSSNVIGKSFTVELANGLLIGEDKGFELEPYDIVFVRRSPGYRKQANVTVEGEVAFTGN YALTKSNERLSSLIARAGGLSKEAYVRGARLIRRMTADEIRRKQDVVRLLVKGSEENSIS PVALETGSTYPVGIELEKALINPGSDEDMVLREGDVLFIPKYVSTVTISGAVMYPNTILY QKGSNLDYYIEQAGGFGNRALKRHIYVVYMNGMVSRLRKSAVCAIEPGCEIIVPSKENRK KTVPRDVAGMNTSIASIAAMVAAMVGMIK >gi|226331999|gb|ACIB01000057.1| GENE 38 48411 - 48647 140 78 aa, chain - ## HITS:1 COG:no KEGG:BF1933 NR:ns ## KEGG: BF1933 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 78 45 122 122 149 96.0 4e-35 MKGHARITSTFFHDEILCAGEVMFVPRGSEYSGVALSDVTLLVHKFNNTVCQTENCILSY LYSHKNIDSKIYCCQRRT >gi|226331999|gb|ACIB01000057.1| GENE 39 48820 - 51192 1370 790 aa, chain - ## HITS:1 COG:no KEGG:BF1934 NR:ns ## KEGG: BF1934 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 790 1 790 790 1587 98.0 0 MRKLFVLLISIVVLSACAPVHRFTQLRKLPREYSENYSIEGIKAPRSAAHRKPWIVYSDR VENAAYVNPGGRVKSAEVGLLDTFLVIKTKGDYLRLIKYNPTNIKHNRMTQRKKAEYVGW MHRSRLILSPSSITDVQSGLHDKLLTAITDTTAIMQSATYFTTADSLKVFGDPELQNQTG VVGIHAIIYALKYSVDRRSVLVSKTPSLSADKIGEQVIGWIPAVMLQEIGHQVFTGTAFS GVPALQKTLKYAPMIYPYHTDSTCSFVSGTLEPVIDKKDNHVFNINGKRISYTRGNQIKQ ELKRINILFAMEQSSRLPEQYPMLLNAIQNLGPFFAGSGESFSYQFGAAVATHRGMETIP LTADYEILIDRLVKIASHVADTENTSLPAWKAMRSALELIGNTPEAVNLIISVGETGEQQ ENAPSSIVKTLNEKNCRLLGWQLYASNDDKYNNYVLQLSNMIEHYAEYRTKNKRNMILYA DQLCRSNLLREAGPNFLMLDYPYASMTQGGFLFPEKGETLPMELFAGAVDSIVTQIKADH QLLSESIDRAFATVGNGKDRLDSLLIATYHLPQGVKPDKEFKKIFGDVAPVWYRKTGRIT VPDSLMRYYLLLSDPELKQTIERLETLCAIEVDVKDMNKPKKGKVKQLCRYLRERVRPDK VETLGASPANPESKVDMVYVSTGKIRRHLYHFYMSELRNCRICKNKRKEIRRYSLSYAHS QIFGVPANSPVLDDITVKDLKKKKQLTDKELDGLIQYFKERKENMAKKYGEEKITMEGQS YYYIASELLP >gi|226331999|gb|ACIB01000057.1| GENE 40 51194 - 52087 805 297 aa, chain - ## HITS:1 COG:MA1904_4 KEGG:ns NR:ns ## COG: MA1904_4 COG3291 # Protein_GI_number: 20090753 # Func_class: R General function prediction only # Function: FOG: PKD repeat # Organism: Methanosarcina acetivorans str.C2A # 39 179 338 480 930 60 29.0 3e-09 MMNKFLNTKVLYIFISVIILIMFFLLVMRACSQKQPIQATVTPSDPVVGEEIFFSDSTSG ARIWYWEFGNNETSTQRSGYHRFKQKGVYKIRLTVNGNLERYFDVRVKEKTNAEDLHLVH IIAPKEAIQGENIIFRGEGHDEQWRWEFGETGMIDSREKTALYAYTEPGEYEVLLNTENT RYPIRHRINILPYYSENDSTDVMVLIGLDIKEKLQNIADGKPFNVNYNYVVDKYFNNNPN TLVVINNNKYNDFYSYCQGLHHIGRKETIIQNVIVETEDEESGYITQITVMQIEKKK >gi|226331999|gb|ACIB01000057.1| GENE 41 52084 - 52611 442 175 aa, chain - ## HITS:1 COG:no KEGG:BF2004 NR:ns ## KEGG: BF2004 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 175 1 175 175 324 100.0 9e-88 MSITPKNKSSKTVSERERFIGFVYVLTLFIVITGACGFILFKYAGTRHIFSNKIMVIKKM ERQKEFQNIQSVQIVSADTLFSRIEQFEPGVNASYEENDIKFLINDLAKQWERNSFDKRN KMFWHLASVYEMWFADKKELWSKQDNIIKFKKNLEECEVGLQKKEGELKNKGGKP >gi|226331999|gb|ACIB01000057.1| GENE 42 52624 - 53127 620 167 aa, chain - ## HITS:1 COG:no KEGG:BF2005 NR:ns ## KEGG: BF2005 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 167 1 167 167 276 100.0 3e-73 MKAKNSEKIIRGYLEFAGGLLISTALSMALLTGFIHTNGSEYKLMESKTQEYDKIYARQI ALVDKVDSLYNYLVLMGSNDRLNQVVLQKVISTRKMELIEELQIMDSKDVLLYKKLASQI NVFLDTKEAIRKAVIEESLVRKDLMRCIQDNKQATRKLTLGNISVEK >gi|226331999|gb|ACIB01000057.1| GENE 43 53133 - 54038 556 301 aa, chain - ## HITS:1 COG:no KEGG:BF2006 NR:ns ## KEGG: BF2006 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 301 18 318 318 608 99.0 1e-173 MDTDFKAELYGALLSEAGFDTAQIMMVRDGNNLSNVSKDIRSVKHRNHVGIAEGAYIELK TNRRGIYDSLPEGLFHEALFPGKVKDLGLILEEMQQHRNEEFFIRRFFSLLESEVDREGI QAQLLELRYDKKNKYSDYAKLFAACWPVIHILSGQGALLFIKFMPHIHSIRGRLEEVSDA LSQILEAPVKVRPKMVQRTIRAQKPNRLGNMRLGANSVNVGVLNSAEADLHIHIGDLPTR EVERFLPGNRSRKALEMLADIFLGAWQEFDVTVSVSPDERKTYLKPTGDASPCYLGINTY L >gi|226331999|gb|ACIB01000057.1| GENE 44 54091 - 55233 863 380 aa, chain - ## HITS:1 COG:no KEGG:BF2007 NR:ns ## KEGG: BF2007 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 380 1 380 380 778 98.0 0 MAKKNLVNWSDCMPLSAAIFNQHDDYFLNSIRDSIEVRTNSYNYGLLPARQNRDGENGIR ISQHVTGHIEVRLKYCDAVTASGIRIQFDATETGSELVKNYSVESDTRKNITQWDIILSV DPFHRVGSGDPNPEEVPPRHPNALPSYRLFVMPKGEINVSELGAHYLTIGRIRKDAERFM VDADFIPPCTTMKSHPELQEYHAKFGNMFRSLENYSKIIIAKIHNRDNRGELGAHISLIC REMLRYLATLQFTYTNKGLYNAPIDVLDSVSSLAHIMYVSFSYLSGTQKEETQKYFYEWS DVTPGSFDEQLADTLEMLYEHTDIRASMVRAYSFMYTLTELWQRLSTLEYIGQHKENIVV SERTTGNNTTGQNKTWSIMD >gi|226331999|gb|ACIB01000057.1| GENE 45 55261 - 56109 811 282 aa, chain - ## HITS:1 COG:no KEGG:BF1940 NR:ns ## KEGG: BF1940 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 282 1 282 282 512 97.0 1e-144 MNSFFQTLAFTYLLYPILGLLLVGLGIFIAKKNALLGNKRLVGYTIGAIVLLTLPALLGF LDYGFMPYGYIFLAMLYLLLGSYNIRMIAWVFKDDYKYRHEIILTGFILVVSMLFFTLVF NLCNELKYGLWASTCLLPFVIVSVFIRTYRIFIAIPIPVYEVWRYTDDTEQDGYFDPGSL QVLQIELYKQESDREPVKLSVKAPDEMQFGTWFHRMIDDYNLKSPQAPIDDYAAKEGGWI FYRKPTLLSPRHYIDFNLTVKDNRIRSRDVIVAKRVMEQSEN >gi|226331999|gb|ACIB01000057.1| GENE 46 56158 - 56610 171 150 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566901|ref|ZP_04844353.1| ## NR: gi|253566901|ref|ZP_04844353.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] # 1 150 1 150 150 274 100.0 1e-72 MKITILLFALFLFTSCTETESRPYEAIWLAITDTDSGYVVYNYPNFQDEKMIAPEYVAIR NDSLIYATWHDYPEKFSLKYAKIENGNNEQYSFVTEYHFSFMWVDKEKHIARWTISDIQK KKTISNYLYIDSRYNTFPLIDYQWDENEPQ >gi|226331999|gb|ACIB01000057.1| GENE 47 56607 - 58460 706 617 aa, chain - ## HITS:1 COG:no KEGG:MXAN_0563 NR:ns ## KEGG: MXAN_0563 # Name: not_defined # Def: LysM domain-containing protein # Organism: M.xanthus # Pathway: not_defined # 500 591 419 511 539 71 37.0 1e-10 MSDDKSQLTISIDIKAEDRKYQIVENSVTSILFKVSDTTTRYVYNIRMGEMRNESETTPE GLKNLNLKKECDLSVKIRTEADQAFVLSEKHCGNIMLVEFFKYEGIPVLCASCVLIVKPS DLKITQVIFKDSTGKEVGCGSVQYEANLKFALRVKYNRPLLRGEKAPKLKCKGYCTNAVT GEYEEISKFRVDENGAYTDTFYCDDGLQESHAGADYVFSFGINNPYGPPFMIEDTNVPVP SIQKIHLIGRNLKKPQITSVIWSSKEMIKFGEDSPRRKSINYNEDGFLHIHARGMYGQKV RVELFEKDSTGIKKLLLGLKDDVTILDNVVCVPVEMSGVYAKAAKGRHALAEGLSFEILA KVTPLDTSIAAFEQDDKSLIELQIYGKADEDKAAKSTVNGTMKFMIADVEEDEKGEEEKA IEEGVCPLCGKKHIDLRSKIDYQTQFDSRFGTKKEQNVACYKACKVILTNAGLSPNSAPN DNTVIQIGVEKSSTDNSSHSSSLTIDFVKASEGLNYINQQLETGYPILVGVDYKAGSPNS DKTTDHFIVIVGRGCKNNEVYYLFYEVGTGQQENGQYKGAHENNKLYLKKDNTLQGTPYH NSNKKYIVVQIRKNILS >gi|226331999|gb|ACIB01000057.1| GENE 48 58489 - 59082 400 197 aa, chain - ## HITS:1 COG:no KEGG:Sputw3181_0259 NR:ns ## KEGG: Sputw3181_0259 # Name: not_defined # Def: CHAP domain-containing protein # Organism: Shewanella_W3-18-1 # Pathway: not_defined # 21 195 23 198 215 199 52.0 6e-50 MVILKIIIIVSILFFVVVVIYIFSTYCNLNVNTQRGTVVDSLNSVYVYYNGGVNQTSGRN VVDGYNIGMKYQCVEFVKRYYYEYYHHKMPDSYGHAKSFFDKKLSNGEMNVSRGLIQYKN GEGILPQIGDIVVFDGYLFNPYGHVAIISAVGTNEVELIQQNSGCMNVSRKCLGLTKNNS GWEIQNKRILGWLHIAE >gi|226331999|gb|ACIB01000057.1| GENE 49 59076 - 59315 147 79 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566904|ref|ZP_04844356.1| ## NR: gi|253566904|ref|ZP_04844356.1| predicted protein [Bacteroides sp. 3_2_5] # 1 79 1 79 79 154 100.0 2e-36 MIYSVIIGETRRGYQSDMVCKKSIVVLDNKEIVNIYITKALCNDENYDDTEEGVVYLFYT ERWHKTIMNPNPYKAGELW >gi|226331999|gb|ACIB01000057.1| GENE 50 59312 - 59887 287 191 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566905|ref|ZP_04844357.1| ## NR: gi|253566905|ref|ZP_04844357.1| predicted protein [Bacteroides sp. 3_2_5] # 1 191 1 191 191 375 100.0 1e-102 MKSWFIIILITGVFFLFSCRDSKVSFIMCSNPDNEVSIDWRGGGSYTNRILVKSWPKDSI GLYKMMINYLYDNKTVLDTLTRNENIETIFIDFFKYNQTTKNAISTKEGDYKVLKNYLGG VTYSQTAECDKWTINADRNLGYSLQLENPTDYPACANIKLDNQCDTDFYEKHKDDEIVRY YHDLFGKNQEK >gi|226331999|gb|ACIB01000057.1| GENE 51 59884 - 61710 1073 608 aa, chain - ## HITS:1 COG:no KEGG:Fjoh_3259 NR:ns ## KEGG: Fjoh_3259 # Name: not_defined # Def: hypothetical protein # Organism: F.johnsoniae # Pathway: not_defined # 257 608 196 513 548 233 39.0 2e-59 MAGKEFVVDKAMCMCKYGAAPGKLMVTDNQFFRLNGTKLCASTMTLGNVIYPPGFGICKV NPMFPKPCVPAITQWNGQFSKITMMGGNPLTDKSKGTCSCGGPDCIEFMQTGQIPVPGSK QMQQATGEHQGELDAMGDPSALTKHPVDTPTSLLLKEGNILVKAVKGEAESFSGQTLIYE VEHYNTPIVSDEIRSHVKWKVTIGEKEETVDQPGTDVLELSVKEEWQGKKLCVQAYINQP SDNVKVNTQIKKWEFPIIVDRYKMPGLNDTGTDIADDMAYGYGVNTKKCVYSTLLIKQLI ESYEQKHENKKIDNILSNSIDYDPEPPMFSTSSLDTKQKMERYIRVKNAKAIYSKEDFPK ISSRIQEGVRFLRKGGDDFTDEELFADFEAMAKLAFSSLNSEMRGNIVRMIAKFRQNNGG VYEDSVLTDHIKKHPSTIRYCNQLETYIKKELQDSKGDVSTLEDIKIFFKGERDLLEDIL KKRVNDKYHKKDFSLTPVYDAGFISFKNIGRKTEQIKNATQGYTIALNDIWSTEVIIKKY VLNGNSYTVDYRVTLWDHFGLDAPDLEANKVAAYGAGFRAWFILQHFRGYKPFITKITFD KTFKGKIQ >gi|226331999|gb|ACIB01000057.1| GENE 52 61724 - 62488 444 254 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566907|ref|ZP_04844359.1| ## NR: gi|253566907|ref|ZP_04844359.1| predicted protein [Bacteroides sp. 3_2_5] # 1 254 3 256 256 474 100.0 1e-132 MYKFRNLKGQYSINVSTTFPGSQASLYSAIIEVIASYQNVDDYFIFELRSRQEVKLNGNP PSQMVDLFMLRLSNTYYPMKLKVSTSGKIMEVINFTDIKERWEAECAKIVEEIPCIAYEQ YIELSKSNMDTENVFLQALRKDSFIQFYFKEYLDDIDIVCYNFPRRGESTFYDLAIDSDG SFHKDIKTFHVKEYNSKRYSGKLICEYSEENDIFSLIAEFYYNTLDGQCKKKVSISVENR SVQKANKLKSFLFD >gi|226331999|gb|ACIB01000057.1| GENE 53 62500 - 64356 1590 618 aa, chain - ## HITS:1 COG:mlr6559 KEGG:ns NR:ns ## COG: mlr6559 COG3501 # Protein_GI_number: 13475477 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Mesorhizobium loti # 368 528 25 196 213 89 35.0 2e-17 MASTNLDAVSVEIKVAGKVCDYVTMELFQSVSTHHRFKIKVNYRPDKPSVWAIGPDVIFK QLGEKVSIIMTHHESGEKTEFHGLISDIHVEGFDGNQGFVILEGGSPTILLDRDPAMDCY VEQNLNTIVSDILDKSGVKMNVTNNPKHTDIIPYVARYKETSYGFLSRLLRSYGEWFYYN GETLQIGNPEIETESRAGYDVDLTGVSINATIRSLNHSTYEFDPVNDKFYYDYSGTPKGA TLGSRSAEKCSEPIFPTEAKLPSMRPAYSAMDLEHYGDAGFHRNYSQLSQIKASSRYCGI RLGELVVTRVPESFPGVKITDLGRYRITEITHTVDGQGRYSNTFCGVPGGTPVMPWGDAV MPVAYPEMARVVSNEDPKNQGRVKVQFMWQEVDGGESYWMRVQSPDAGKSDQVAKNRGFV FIPEPGDLVMVGFEQGNPDRPYVTGSLFYKANSQGAATDNTVKSIRTRSGHTLEFNDDEG GDWGITIKDRNGCMFHFDTKGKNIEITAPETMTLNAQNININAGEQLNTSSGKETVMQIG TDFQQDVGGNAEIAIGESLTESIAKDSTNSIAGNLSVTVDENLMYDAQDMTLTAQGGMKL LANAKIGLKSSEGVDIAQ >gi|226331999|gb|ACIB01000057.1| GENE 54 64382 - 66199 1076 605 aa, chain - ## HITS:1 COG:no KEGG:BF1945 NR:ns ## KEGG: BF1945 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 605 1 605 605 1176 99.0 0 MANLFTKESIKARMFKQAATLYDIRNIDGIDPLIRLLIEALSGEIFKLSGDMHAIESRLL EKVASALTPHTALVAKPAHAIAAARPYTPQATVSPTDLFSYKSTEIVKKYKMKNLFFTPL HETRIINAELKYLVTADEFCTITPEGERDATARFRSDVPVMGRKISIGMKIGNNVTTLND LPLYIDIPLVADKSSYLKLLPYCHCTIAGIPVEIKGGIEYDPTRSVSEKYDLGRLITEEI TSKYASHYLTLKAHGLKVKDLSRSRVPEEISFLLPSDFIAECDADTVWIDIEFPTAFSKE ILEQIKVQMNTFIVVNKYPAKITRKVDSVSAILPLEKTEFEYFLFVDSITDNHGDRLREI SSTQDEGRAGCYSVRRGGCERFNAMDAKDFLNRLTDLLYDESMAFSSTDKDGMKEVIEQI EERVSQLGDKNKDGIGGQEMLSYVVIDQRYDKDTRLTVDYTLTNGEFANDIYAGEPLNDC ANPDIDKDTLRFVTSAHGGEPSPSVKRRMDMYRFMLLSHGSIYTKEDIRNFCMARYGDSI RSVEVKLGYAAGKKESEGFIRTLDVYLRLSEGMQGLDREEFVVDLDSELRRLSPETYNYR VFINS >gi|226331999|gb|ACIB01000057.1| GENE 55 66230 - 66670 318 146 aa, chain - ## HITS:1 COG:no KEGG:BF2015 NR:ns ## KEGG: BF2015 # Name: not_defined # Def: putative bacteriophage GP25 protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 146 1 146 146 293 100.0 1e-78 MKQKYYKLPVRLEKLFEGDIRQLERCSELESIDQHIELLLTTCPGEHRFDPKYGCRIWEL DFERIVSTEQWKELFTGYVMQAISTYEKRISDITLSVNLQEVVREEVLDNRMIRKRVDIT VLATLNSTGAPCGFGYKLYMGPLSNE >gi|226331999|gb|ACIB01000057.1| GENE 56 66667 - 67125 282 152 aa, chain - ## HITS:1 COG:no KEGG:BF1947 NR:ns ## KEGG: BF1947 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 152 1 152 152 308 99.0 5e-83 MAIYSLHEMAGLLKTGLWQGGEVEKLGYIGTAVPHASPAPACELTGVYISYDTMLERCEV SMQGGGCELFISQYHPYTGTYMTYVLESNLSDRRLLLAKLEEILADRRNPYLWSHNLYKR NSFGIERRYDTGHSSMWGGRIAVPRQSKFFGR >gi|226331999|gb|ACIB01000057.1| GENE 57 67130 - 67549 413 139 aa, chain - ## HITS:1 COG:no KEGG:BF1948 NR:ns ## KEGG: BF1948 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 139 1 139 139 260 92.0 1e-68 MGFFDRLFKGKEVFADRRNPDMVCEITLCGQSHILSEFDIAYEPDNSSKEYMEAYAVFLE PVNAEAENWIMQSNRKENGVVRFYRNSDAMDEGALFEIKFSDASCVHYRNVSQGDTSVKT IVMTFPSVRMGGDEFELKR >gi|226331999|gb|ACIB01000057.1| GENE 58 67552 - 67959 461 135 aa, chain - ## HITS:1 COG:no KEGG:BF1949 NR:ns ## KEGG: BF1949 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 135 1 135 135 271 100.0 7e-72 MQTHLYLHKENTDLPHDEYRGKYPLKHFDYGLSRNVGRKGEITSGVCGGEIRIVIDGFAD ATLLGWLFDTFRKEDGAIVTLDEHETTFAKLQFSGASVRSYRMNYDSRVKKGVVTIIVIE AKEIVTDNDLFFKNK >gi|226331999|gb|ACIB01000057.1| GENE 59 68059 - 68421 252 120 aa, chain - ## HITS:1 COG:no KEGG:BF1950 NR:ns ## KEGG: BF1950 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 120 23 142 142 206 97.0 2e-52 MGMRSHNFDGLPVLAGMMMVTPNMMLHLAILQVVLQTLEVHWFEELLALGWWGRIIYLGF FVGVYCYYWYNGRYKRIIEKYNLEKNTYWKRHPFVTILLYVITNFVAFFIVVCIKKGYIF >gi|226331999|gb|ACIB01000057.1| GENE 60 68457 - 69452 520 331 aa, chain - ## HITS:1 COG:no KEGG:BF1951 NR:ns ## KEGG: BF1951 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 331 1 331 331 636 97.0 0 MAYRNEYKTQSEFNVILERGDDYEGFVVGLGYTWMSSRVILPVNQNGWSPISRNVSVDES FHTIVSERKYDTSQYAYEKSLMQDPTKVSEKVRDLIVKNKGNNITEINLGQGKQYLPTDN SEISISINDTGSRYEIVISATDNSNGKTYEAKYESLTDLVSAVRDSGSPPAVNKEGPNLE GLAGLGFGIAETAGNWAEKIMDNRGAYLPKQMRFSPKTLPPIIRSPLGNYQVPAKAMSRV RGVGKALGWAGMVLTGYQVVRDVQNGRFAVAGTRIAVAGLAYGVTFIPYVGWVLAIGIGV ADYTWGDEFYDWIDNRASELEMWWDGVRLAL >gi|226331999|gb|ACIB01000057.1| GENE 61 69465 - 69869 242 134 aa, chain - ## HITS:1 COG:no KEGG:BF1952 NR:ns ## KEGG: BF1952 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 134 1 134 134 264 97.0 7e-70 MELFSLPKIDSDLTAWFILDGKEYEMSQFSISFGQSVDHKGQPQDEVRGGRMLVVLTQAL PDSMYRWAMTSSPKNGEIVFRSKTANSPLRIMFNNAYCVSFQRQVGNASGMISKLLISPD EILMNGISFDNHWG >gi|226331999|gb|ACIB01000057.1| GENE 62 69875 - 70303 363 142 aa, chain - ## HITS:1 COG:no KEGG:BF1953 NR:ns ## KEGG: BF1953 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 142 1 142 142 268 91.0 4e-71 MFGHKSFLRIGALSDSSISGLYKDSYELESCSFNFSQGVDTNGSPQTEVRGGTLYLTYGG LPQEDMLHWMLGSTKYEDGAIVVCDDNNEPLEKIFFEQAACVGLEIDYVQQGKGYIQTKI TLQARKIKVGETTLENRWTINK >gi|226331999|gb|ACIB01000057.1| GENE 63 70319 - 72802 1311 827 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163764771|ref|ZP_02171825.1| ribosomal protein S8 [Bacillus selenitireducens MLS10] # 1 818 1 804 815 509 35 1e-143 MDFLNINETVQSVIRISKGVAREYGNASYAPAHLLFALMHKEVGLRSFVESLGKDADYLR EWAEVRIEEYPKAAGAGEIMPDPKVDELFEQADNVRIKWGLLEINPLCLLAAIATPEAGF STDELRSFPIREREIHDLFTSGMGNMKKSVSTQTDLPGSEAPLWTAGAGNLDKYCTDKTA LAADKKVYPIVCRDRETRMMMEILGRMGKPNVLIIGDAGVGKTALVDGLACSIIEGTVPQ YLKDMTIYELDTGTLIAGATYKGEIEERLKGIIKELSAHGNAILFIDEIHTLIDPKSGNS GAASILKPELAKGNITVIGVTTVDEYRKLIEPDHALNRRFEVLQVSEPDLASTVNMIQAA LERYESYHGVGVDVDSLPECVALAKRYVKERRLPDAAIDLIDRTLSAVKMINQSGEKDIK GLAEQLAAIEAAEDQPEAERTEKLRLVNFTMKNRLSPILLGMLSDGQNEPESSDYGEWMD YILGVLDRLRELTKEKIGKITSHEVAAVVSSSTGIPIGKIESGEKEKLLNMEDILRRRVV GQDNALKVLTDAIVESRSGMNKPGQPIGSFFLLGPTGTGKTELAKALAEALFNDEKAMIR FDMSEFKEEHSAALLLGAPPGYVGYEEGGVLVNRIRRQPYAVVLFDEIEKAHSSVYDIFL QIMDEGKLHDRLGKEGDFSNSIVLFTSNVGSEWLSKQINEGKNPSTTDLMEVMGSYFRPE FLARLSEIVPFSPINETMLVRIFEIQLKGVIALLEKQHIDIEITEKAKNLLATRGFTPKY GARQVAGTIRNYIRRPISKMIVAGTLAAGNTVVVDVADNGDIDWNVK >gi|226331999|gb|ACIB01000057.1| GENE 64 72815 - 74194 1321 459 aa, chain - ## HITS:1 COG:no KEGG:BF1955 NR:ns ## KEGG: BF1955 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 459 1 459 459 874 100.0 0 MEDNKNKSEVQAQETVQVSYKEKNGESSLASALDVLSRFGGFNFLESTVDGVQNLNPERK ARKKIFLTDEQKQEEREVLKNKIDMWIDLLNSSSAVTEMIATSKAKSEAAASHLAKSQLV AVQSVRDMEQAYRGVMLFYKNTEADKVNNVTIVNASKEQLTDLDNPRFIDFVARELKQNY DKLDLRQNYSLLAIPGYLGSNKVLEKWAKIAYENKVMLYTDFADLDKPEDVVELFSSSNM ASGEAFKSNTCMTCNWLVGRAGYREIGEEEDLHVSPSIALMGKVYTTLMSQVTAGKKYGG INEIDAVVFPLKKSEISQLEKMGLIPMVNEYGKVMAFSAKTLFNGDNLGLQTYSVVRVFD YITKVLFDFLNRRSFENWSSKTERDLRGQIVQYLDSVQGADRLIEKFKIIRFEQDQKQKD RIYLDIHMTPFFPAKSFVIKLDGTKGDDGTNWNTNYEQA >gi|226331999|gb|ACIB01000057.1| GENE 65 74210 - 74659 456 149 aa, chain - ## HITS:1 COG:no KEGG:BF1956 NR:ns ## KEGG: BF1956 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 149 1 149 149 254 99.0 5e-67 MAILEYGIGGNEVKVDTSEAIANIPENRSLIVEQLTADEPVTPEAVKGLSTIEEVFAHFL PNIDVEFENEEGQPVKENFRFSSVADFSVKNMTQNSPFLHKLDTEKTFYEGMVTQLRSNK VLQRVLENPESKKAFISALEALSNELKTE >gi|226331999|gb|ACIB01000057.1| GENE 66 74749 - 75138 334 129 aa, chain - ## HITS:1 COG:no KEGG:BF1957 NR:ns ## KEGG: BF1957 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 129 1 129 129 257 100.0 7e-68 MAFRATLSFAGKEFDVLDCTYSLKRDVDSKGRPSSNIYGGQIRLHVESTDDTSILENMTN QFKPHSGSIVFKKGDEEAKMKELTWENGYITEFTENIDIVGSQPMTITFVVSAQVIKIGG AQFEQNWPK >gi|226331999|gb|ACIB01000057.1| GENE 67 75579 - 76322 563 247 aa, chain - ## HITS:1 COG:no KEGG:BF1958 NR:ns ## KEGG: BF1958 # Name: not_defined # Def: putative transcriptional regulator # Organism: B.fragilis # Pathway: not_defined # 1 247 1 247 247 460 100.0 1e-128 MMGQGDKKIRIRRTNEQLDKEVISEFEKLVGEFGFGNVNLSALMKAADLEANVFYRRYGS MDNLYDRLAKQYDFWINNTIDISTLNTLGPKKFFAETFKTLFRNLSENSVMQKLLLYEMT TINSTTKRSAETRDVMNLNLITFYENLFAPAKINIKSIASILIGGIYYLILHKECAKICT IDYKTKEGENAFSEGIDFLTDIIFDRLEMYDRDKKAIRQMISDGISESKICKYMGINKND LKTLLSE >gi|226331999|gb|ACIB01000057.1| GENE 68 76877 - 77938 779 353 aa, chain + ## HITS:1 COG:aq_626 KEGG:ns NR:ns ## COG: aq_626 COG0156 # Protein_GI_number: 15606058 # Func_class: H Coenzyme transport and metabolism # Function: 7-keto-8-aminopelargonate synthetase and related enzymes # Organism: Aquifex aeolicus # 4 348 29 370 373 256 40.0 6e-68 MLMFGSNSYLDATGIPSVVEKAVRVITDYGVGSGGVPLLSGTTIFQNELEKEIAKLTGFD DTILFSSGFTANIGVIVGLIRPNNLLVYDRLNHASLIDGALMSGAKMVRYKHNDPKALEK ILKENAGQYKDGMMVVTDGVFSMDGDIADIPAILEITKKYNALLLIDDAHATGVIGEDGA GTLSYYDIKERENIIVTGTLSKAIGSIGGFITAKQNIIDYLRVYARSNMYSTSLPQSICA ASLEVIKEMRNTDIQNALKRNAEYVRNGLKALGFNTLNSMTPIIPVIVGDEYILTQITKE LYDRDIFTNAIFPPVVPPNMCRIRIGVMSSHTFEDCDRLINAFHEIGKKYGLI >gi|226331999|gb|ACIB01000057.1| GENE 69 78160 - 81234 1566 1024 aa, chain - ## HITS:1 COG:SA0089 KEGG:ns NR:ns ## COG: SA0089 COG1112 # Protein_GI_number: 15925797 # Func_class: L Replication, recombination and repair # Function: Superfamily I DNA and RNA helicases and helicase subunits # Organism: Staphylococcus aureus N315 # 40 1019 64 1043 1050 392 31.0 1e-108 MNYMQILSCWHKLEHFSPAILPKDKSLKPLKELPWMRPLEAKDPKKTIQYTIYLGVFSQI SVSDFVKDFFKDERNNPNVTDAKVCYASLKLDNLGVYIQNTFGFSTMPWALRQLEAGKVN TNSWSEDFDKLRKNLLERLGENRKELAEDYSSYLSETQTLENLQQIQALIIQDLKWSTSP ETEIYVRIEEVYKKNNTSDKEEANADLLNSFYIDDLERIITSSVKGSYNTAFRNYLSACL NKDFVHFDLSLQPEILKECLVPENYPDGCWPSPHTASLMQQFAVNTVSKELSGEKQEGIF SVNGPPGTGKTTLLRDIIAAILVKRAKKMVNFTEPAKAFRKIGEVQVSEKYTPSIYEPDS SICDGGIVVASSNNGAVENISKELPLKKEARGYSDQVGYFRQVSEECVGEESWGLIAAVM GNKENQRKLIYSIWDGDSEEESYTLKQQLKDYKPTEEEWLNIVVSFKNKLEEVEIEKSRL TGFMKDAESIEKLRIQLEDAESHLSHVDKELEGLLEEKNLLSTEIKRGKQQKEDAMTELK LLQSTRPGFFIYWFNKTVRTQYKKALTATLTKYNQLSEEITKQKTSLQALDLRVEKQRKI QEQSQKDYDRINSDYARLSELTEAARQELKGAYADASFWKQIESKEVQEISPWYSKRLKQ LQSELFIEAMKVNELFILRANATSSRIKTTLDVFFNFLKTGGNLTEREIQAIWNTFWLIV PVVSSTFASIQRMFSQMKTGTIPWLFVDEAGQAVPQAAAGAIWRSKRAVIVGDPFQIEPV VTIPEQLVNNISHHFGLDKTQIQTSLSVQSMADRANPYGWITNDTWTGSPLRVHRRCVDP MFSIANEIAYNGMMYNSTLAESSQLFMRNGFLQVEGKVSGRHYVPEQGVLIRQMIIDEIH HLQDLPDLFVISPFSEIPSILKKELRQPIKQALATYKSIEDNELKKWLDAHIGTVHTFQG KQAAGVILCLGLDEKSKGAASWASSKPNLLNVALTRAKQRFVAVGDGDIWLRQPYFSKLK ALNR >gi|226331999|gb|ACIB01000057.1| GENE 70 81263 - 81448 223 61 aa, chain - ## HITS:1 COG:no KEGG:BF1962 NR:ns ## KEGG: BF1962 # Name: not_defined # Def: probabale oxalocrotonate tautomerase # Organism: B.fragilis # Pathway: not_defined # 1 61 1 61 61 107 98.0 9e-23 MPYITIEGGSLTREQKSELIRKVTEVASEVMQIPMEFFLCTVKELPDENIGIGGRTIDLI K >gi|226331999|gb|ACIB01000057.1| GENE 71 81840 - 82079 150 79 aa, chain + ## HITS:1 COG:no KEGG:BF2031 NR:ns ## KEGG: BF2031 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 79 21 99 99 138 97.0 8e-32 MGNYSTEDGTRTFYTELLHWIKEVLSENLDLMKSLLDNDLAEEKAKNLVGLDVTNDWSQE SIEILFKLVLMYANREPYY >gi|226331999|gb|ACIB01000057.1| GENE 72 82446 - 82574 71 42 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253566929|ref|ZP_04844381.1| ## NR: gi|253566929|ref|ZP_04844381.1| predicted protein [Bacteroides sp. 3_2_5] # 1 42 5 46 46 82 100.0 1e-14 MFFRFSEKEGELPHINGSIEVDGNAVGLMQHPPGARGKPIYI >gi|226331999|gb|ACIB01000057.1| GENE 73 82655 - 82999 442 114 aa, chain + ## HITS:1 COG:no KEGG:BT_2334 NR:ns ## KEGG: BT_2334 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 114 1 114 114 192 96.0 3e-48 MEVITFESEAYKALVGKIEKIAEYVAAAQLPSEEKKEAWLDSNQLAEALGISTRTLQRLR DENLISYSMLRGRCMYKLSEVERCLEERTIRCKPQTLEDFRKNYLMRTGNDKKG >gi|226331999|gb|ACIB01000057.1| GENE 74 82983 - 85028 1314 681 aa, chain + ## HITS:1 COG:no KEGG:BT_2333 NR:ns ## KEGG: BT_2333 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 681 1 681 681 1361 97.0 0 MIRKDDILKMTEKGISVFRYYLPVDFKVGKNFLNPFYKDTKASCNIYYERKAGVFKMKDF GNEDYSGDCFEMVGRLNGLNSKEPKEFVEIMEIINRDLHLGLSAHEGYHVSHSKVSQKNE VVSEEPKAKSVRPYTVVQKPFTAAELAFWGKSGIGENILKAYRTVSLKKFSSENQERKPF SCMTSVDEPMFGYMGKQHIKVYRPCSQMRFLYAGDFGDNYCFGLEQLPAKGDLLFITGGE KDVMSLAAHGFHAICFNSETAFIPAAVIHRLSFRFKHIILLYDVDSTGLKSSAKREEELK EYGVKRLLLPLAGTKTEKDVSDYFILGNSREDLIKLFLDYLETLYSETMSALKSCEVDFN NPPPIAQMIVSVNDVPLGTQGNLLCVTGGEGTGKSNYVAALIAGAIRPSGTDVDALGVTL HENSKNKAVLFYDTEQSEVQLYKNISNLLRRCGREAMPEWFKAYRLTGMSRKERLLSIIQ SLDKYHYQYGGVHLVVIDGIADLIKCANDEAESIAVVEELYRLAGIYKTCIVTVLHFIPN GLKLRGHLGSELQRKAAAILSIEKDSDPAVSVVKALKVRDGSPLDVPIMQFSWDKKKAMH VYLGEKPKEEKDKRKEDELVAVAKEVFSRRRFVTYVELAEEIQSILEVKERTAKSYIKFM REKEIILKSSDNQSYYVIGNF >gi|226331999|gb|ACIB01000057.1| GENE 75 85041 - 85394 375 117 aa, chain + ## HITS:1 COG:no KEGG:BT_2332 NR:ns ## KEGG: BT_2332 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 117 1 117 117 189 99.0 4e-47 MLVDKTEFEAWMERIMGELYRISRKLDKAETREKYLNYLNGERLYDNQEVCLLLRISKRT LQRYRNNGVLKFYSIYHKTYYKESDLHEFIRNNFDENEIKRQAREDKLSESYPLPKE >gi|226331999|gb|ACIB01000057.1| GENE 76 85652 - 86545 293 297 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253566933|ref|ZP_04844385.1| ## NR: gi|253566933|ref|ZP_04844385.1| predicted protein [Bacteroides sp. 3_2_5] # 1 297 3 299 299 543 100.0 1e-153 MRYLNNKSPLFWGILYFLIIPVFATVFLFLPTETWSANSDLENWWDCLYFSMVTITTLGF GDIAPITTLGRLLVAIETVSGVLFIGLFLNALSLQQSKIISEEEKKKHEENIREKERAKL LRHYKLIEPIINEFIIAIYEITTPLSNRNFAKIIINENFHFNDMSDLYYQSLKLASNPTK TVIHNYYEVQNKLCHNIERLLLEIDLSYWKNIEIACLNFLQKCNELDYSDSILGNFKIKL GNQKAIDYYSAVIKKYTGKLQYRQSNTMNQFIALYELIKYNIRFIEEIRQDFDKIKA >gi|226331999|gb|ACIB01000057.1| GENE 77 86573 - 87304 326 243 aa, chain + ## HITS:1 COG:no KEGG:BT_2328 NR:ns ## KEGG: BT_2328 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 243 1 243 243 451 95.0 1e-126 MNKEQFTFYIDESCHLEHDHFPVMCIGYIKVPKEQTEEMKQCIKTIKREYNILHEIKWNT ISNTHIDMYKELIDYFFDSNMEFRCILVKYKDRLDNLSFNNGEHDNFYYKMIYYLLVNPY TNPPTMNNYRVFLDIKDTKGCAKLNKIQEIFFNKFHGKSPFLSFQHIRSHESQFIQLADF FIGAVAYKARGLHLKKDGSLAKKELINYIEMKSGYVLDEGTEPGEIKFNIFDHQPRKRND SGR >gi|226331999|gb|ACIB01000057.1| GENE 78 87216 - 87827 232 203 aa, chain + ## HITS:1 COG:no KEGG:BT_2327 NR:ns ## KEGG: BT_2327 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 50 203 1 154 154 281 93.0 1e-74 MFWMKVQNQEKLNLIFLITNLVSAMTVADELGALKPYFADGVAETEQEKMDELYGIFHRD FFENTVIIDGIPLKVKPYLYKNSEKDNLPVDFERYCEKFVHVVTRTIKGGKYKTSGKIRE FREERANRVHWIRPILENKEDKRITYFQYIEDDGTLRDYYWYRGKQYVVIVEYIQPDYAL ITGFCVDCDNQPYYQNKYINREK Prediction of potential genes in microbial genomes Time: Wed May 18 00:20:34 2011 Seq name: gi|226331998|gb|ACIB01000058.1| Bacteroides sp. 3_2_5 cont1.58, whole genome shotgun sequence Length of sequence - 52193 bp Number of predicted genes - 49, with homology - 47 Number of transcription units - 25, operones - 12 average op.length - 3.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 113 - 253 113 ## BT_2326 putative DNA methylase 2 2 Tu 1 . + CDS 332 - 691 279 ## BT_2333 hypothetical protein - Term 1079 - 1132 1.5 3 3 Tu 1 . - CDS 1356 - 1586 94 ## - Prom 1636 - 1695 7.0 + Prom 1455 - 1514 5.7 4 4 Op 1 . + CDS 1551 - 2603 576 ## BF2019 hypothetical protein + Term 2748 - 2802 -0.1 + Prom 2610 - 2669 4.6 5 4 Op 2 . + CDS 2811 - 3089 277 ## gi|301163016|emb|CBW22564.1| putative transmembrane protein + Term 3091 - 3140 8.4 + Prom 3613 - 3672 4.5 6 5 Tu 1 . + CDS 3784 - 4461 346 ## BF2027 hypothetical protein + Prom 4655 - 4714 10.7 7 6 Tu 1 . + CDS 4753 - 4992 237 ## BT_1755 hypothetical protein + Prom 5004 - 5063 1.6 8 7 Op 1 . + CDS 5119 - 5817 532 ## COG2849 Uncharacterized protein conserved in bacteria 9 7 Op 2 . + CDS 5839 - 6165 251 ## gi|301163019|emb|CBW22567.1| hypothetical protein 10 7 Op 3 . + CDS 6244 - 6477 57 ## gi|301163019|emb|CBW22567.1| hypothetical protein 11 7 Op 4 . + CDS 6496 - 7149 320 ## gi|253566945|ref|ZP_04844397.1| predicted protein + Prom 7212 - 7271 9.4 12 8 Op 1 . + CDS 7299 - 8090 645 ## gi|253566946|ref|ZP_04844398.1| conserved hypothetical protein + Term 8145 - 8188 -0.7 13 8 Op 2 . + CDS 8202 - 8882 128 ## gi|253566947|ref|ZP_04844399.1| predicted protein 14 8 Op 3 . + CDS 8965 - 9156 144 ## gi|301163023|emb|CBW22571.1| hypothetical protein 15 8 Op 4 . + CDS 9169 - 10119 627 ## BF1273 hypothetical protein + Prom 10156 - 10215 4.1 16 9 Op 1 . + CDS 10239 - 10496 160 ## gi|301163025|emb|CBW22573.1| hypothetical protein 17 9 Op 2 . + CDS 10531 - 11148 337 ## BF2010 putative transmembrane protein 18 9 Op 3 . + CDS 11145 - 11591 270 ## BF2010 putative transmembrane protein 19 9 Op 4 . + CDS 11596 - 12498 225 ## BF2009 putative transmembrane protein + Prom 12651 - 12710 2.3 20 10 Tu 1 . + CDS 12770 - 13423 43 ## BF1941 hypothetical protein + Term 13610 - 13653 -0.8 21 11 Tu 1 . + CDS 13747 - 14349 179 ## gi|253566953|ref|ZP_04844405.1| predicted protein + Prom 14650 - 14709 3.6 22 12 Op 1 . + CDS 14739 - 15686 593 ## BT_0234 putative transposase 23 12 Op 2 . + CDS 15827 - 16948 767 ## BT_0233 integrase + Term 16968 - 17021 14.3 + Prom 17023 - 17082 3.6 24 13 Tu 1 . + CDS 17121 - 17915 285 ## BT_0232 putative protein involved in transposition + Term 17944 - 17989 9.4 + Prom 17947 - 18006 3.6 25 14 Op 1 . + CDS 18153 - 18530 353 ## BT_0231 excisionase 26 14 Op 2 . + CDS 18533 - 19555 557 ## BF4233 hypothetical protein + Prom 19672 - 19731 2.5 27 15 Op 1 . + CDS 19841 - 20368 398 ## BT_0229 hypothetical protein 28 15 Op 2 . + CDS 20373 - 20900 266 ## BVU_3673 hypothetical protein + Prom 20978 - 21037 2.4 29 16 Tu 1 . + CDS 21155 - 22576 798 ## BVU_1439 mobilization protein + Term 22665 - 22714 5.1 + Prom 22613 - 22672 3.6 30 17 Op 1 . + CDS 22763 - 26986 2319 ## CHU_2948 hypothetical protein 31 17 Op 2 . + CDS 26994 - 28787 287 ## COG1112 Superfamily I DNA and RNA helicases and helicase subunits 32 17 Op 3 . + CDS 28819 - 28941 75 ## PRU_0534 divergent AAA domain-containing protein 33 18 Op 1 . + CDS 29099 - 30397 934 ## COG2865 Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 34 18 Op 2 . + CDS 30411 - 30692 388 ## gi|253566965|ref|ZP_04844417.1| conserved hypothetical protein + Term 30702 - 30747 5.0 - Term 30948 - 30989 1.1 35 19 Tu 1 . - CDS 31010 - 32185 647 ## PRU_0911 prophage PRU01 site-specific recombinase + Prom 32695 - 32754 6.4 36 20 Tu 1 . + CDS 32788 - 33408 562 ## BF2032 TetR family transcriptional regulator + Prom 33483 - 33542 5.0 37 21 Tu 1 . + CDS 33567 - 34481 553 ## COG0582 Integrase + Prom 34588 - 34647 4.5 38 22 Op 1 . + CDS 34775 - 35362 221 ## BF1967 hypothetical protein 39 22 Op 2 . + CDS 35451 - 36311 435 ## BF2036 hypothetical protein 40 22 Op 3 . + CDS 36336 - 36809 118 ## BF2037 putative transmembrane protein 41 22 Op 4 . + CDS 36852 - 37319 243 ## BF2038 putative transmembrane protein 42 22 Op 5 . + CDS 37363 - 37794 522 ## BF2039 putative transmembrane protein + Prom 37864 - 37923 4.8 43 23 Op 1 . + CDS 38012 - 38470 451 ## BF1973 hypothetical protein + Prom 38481 - 38540 3.8 44 23 Op 2 . + CDS 38561 - 39349 669 ## BF1974 hypothetical protein 45 24 Tu 1 . - CDS 40674 - 40814 76 ## - Prom 40865 - 40924 6.9 + Prom 41157 - 41216 3.9 46 25 Op 1 . + CDS 41236 - 42810 702 ## COG1205 Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster 47 25 Op 2 3/0.000 + CDS 42856 - 46494 1378 ## COG1205 Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster 48 25 Op 3 6/0.000 + CDS 46496 - 49420 1895 ## COG0553 Superfamily II DNA/RNA helicases, SNF2 family 49 25 Op 4 . + CDS 49433 - 52193 1155 ## COG1002 Type II restriction enzyme, methylase subunits Predicted protein(s) >gi|226331998|gb|ACIB01000058.1| GENE 1 113 - 253 113 46 aa, chain - ## HITS:1 COG:no KEGG:BT_2326 NR:ns ## KEGG: BT_2326 # Name: not_defined # Def: putative DNA methylase # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 46 1863 1908 1908 73 80.0 3e-12 MPQSISIKEAMDTLDGKLVIGGVPKPGNKDDPDSDQPEKSKPKLKL >gi|226331998|gb|ACIB01000058.1| GENE 2 332 - 691 279 119 aa, chain + ## HITS:1 COG:no KEGG:BT_2333 NR:ns ## KEGG: BT_2333 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 3 118 377 492 681 213 87.0 1e-54 MSLGTQGNLLCVTGGEGTGKSNYVAALIVGAIRSSGTDVDALSVTLHENSKNKAVLFYDM EQSEVQLYKNIINLLRRCRRESILEWFKAYYLTGMSRKECLLSIIQSLDKYHYQYGGIW >gi|226331998|gb|ACIB01000058.1| GENE 3 1356 - 1586 94 76 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MIVYIFFIMIIHYLEMFIPFFILYNYKYIKSAVHHERHTALFSVNRTGFNIYSPPCLSFV AAWGGVCSALACRIAP >gi|226331998|gb|ACIB01000058.1| GENE 4 1551 - 2603 576 350 aa, chain + ## HITS:1 COG:no KEGG:BF2019 NR:ns ## KEGG: BF2019 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 27 345 6 324 325 428 88.0 1e-118 MDYHYKEDVNNHTNWELFFEVMDRIILFFLGSVLLLSGCKNSKQPEGYNSVNIIEYVNVY DENNRILTAQGSEYDYLYFGDNKDKGILAATNNFTKTYSYDNDSYCYTVEEPLSGSLLKT MRYTENSIEELVLENNKDTFSYSFSLYYDKNKPKYKKSIIILGDDPFSDSRYEEYYYYDN NGNKTKKIRHDLNTGEREETYKFNDTDYKEAVNLVPSSDYKQNIECSLKQTVNDTLISRI TLNGVLDRVMKEYTIGKKKIKEEFDNGMTLVNKETEYEENGLRVNVNHTLRNTGYSTDSI YYEGNKKVKHIYNSDYNGTITLEISEYDEQGNIIKKTKKLRWESDNEITR >gi|226331998|gb|ACIB01000058.1| GENE 5 2811 - 3089 277 92 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|301163016|emb|CBW22564.1| ## NR: gi|301163016|emb|CBW22564.1| putative transmembrane protein [Bacteroides fragilis 638R] # 1 92 57 148 148 181 100.0 2e-44 MIDDLVNNRLKVGMTYREILNLLGESYFVNEEDSMPGIRYEIDVEYQFLDIDPYKGKDLF IKFGKDSLVVSYKFIDWKSGSEDITKETECKQ >gi|226331998|gb|ACIB01000058.1| GENE 6 3784 - 4461 346 225 aa, chain + ## HITS:1 COG:no KEGG:BF2027 NR:ns ## KEGG: BF2027 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 225 22 246 246 362 81.0 6e-99 MSELEKLVAEQGFGNVNLSTLTKAAGMEANVFYRRYGSMDNLYDRLAKQYDFWMNNAIDI SNLNVLGPKKFFAETFKTLFRNLSDNPVMQKLLLYEMSVVNDTTKRTAETRDIMNLNLIT YYETLFKPAKVNIKSIAAILIGGIYYLILHKECAKICTIDFNTPEGEKAFSEGIDFLTDT IFNRLEAYERDRNAVRQMLADGISELKICKYMGISKNDLKVLLSK >gi|226331998|gb|ACIB01000058.1| GENE 7 4753 - 4992 237 79 aa, chain + ## HITS:1 COG:no KEGG:BT_1755 NR:ns ## KEGG: BT_1755 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 21 79 64 121 408 72 57.0 4e-12 MKKIMLLLLSGVIVLTSTAQEKVYQIEEITVINYGDGRMLFRLNNEEKTPLNGSHRLIDG YHSEYILADFKDGMYHGNY >gi|226331998|gb|ACIB01000058.1| GENE 8 5119 - 5817 532 232 aa, chain + ## HITS:1 COG:FN2119 KEGG:ns NR:ns ## COG: FN2119 COG2849 # Protein_GI_number: 19705409 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 224 70 292 338 79 30.0 5e-15 MDGVSRTYHSNGKVETEKVYKLGIEDGYDRRYGSDGTLTLDECYKDGKRDGKWTEHLSGN VGDVIRISFYKNGLPDGQWIETWKDGKPRSKSSYKDGKKEGVWIRYGKGGKPEKSTTYKN DEKNGEEITYFTDGTPEKSSNYLNGKLNGVTKEFYFGSGKCKSEYTFKNGQREGAYKRYF DTGELREEGCCEADSEVYRKEYYANGKLKSVAERKGGGWNTIERYDSEGNKE >gi|226331998|gb|ACIB01000058.1| GENE 9 5839 - 6165 251 108 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|301163019|emb|CBW22567.1| ## NR: gi|301163019|emb|CBW22567.1| hypothetical protein [Bacteroides fragilis 638R] # 1 108 8 115 219 215 99.0 8e-55 MRYKKTVRLILISVIIVGSIGLFYSNVLQPPFIHINDGKRLVNPRGTDSIYIYTEDILVA HPPKDTLERMKMMINYYDTAGVSLVDLKKQGDITFYYMGFSKNTCATR >gi|226331998|gb|ACIB01000058.1| GENE 10 6244 - 6477 57 77 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|301163019|emb|CBW22567.1| ## NR: gi|301163019|emb|CBW22567.1| hypothetical protein [Bacteroides fragilis 638R] # 1 77 143 219 219 151 100.0 1e-35 MEGSPDKWKISISYNLGTEPDADYIGCKTKDYILYDERDSNFYERHKDDEIVRYYHELQE RKRHTKSEPKSEQEIFI >gi|226331998|gb|ACIB01000058.1| GENE 11 6496 - 7149 320 217 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253566945|ref|ZP_04844397.1| ## NR: gi|253566945|ref|ZP_04844397.1| predicted protein [Bacteroides sp. 3_2_5] # 1 217 1 217 217 439 100.0 1e-122 MNKIIMIAALSAACAFTGCNNASKQRTGNTSSETNLPSCQKTSQDKLNVIEIDGTPYLAL NKLLVLPDDAKNVAIPLPVSFLGYYNYGQENPGYQPADSLISFLKTLGFEGMEFHMCHTL PFRNKNILPVLLCLSLGDNDYYLLVTVDAERGEIIDHLEVGESNDNGRVSFRIDTAFNIE RFSAETVYNETDNTKEEVYKEKLGSYQIGSDGKICDK >gi|226331998|gb|ACIB01000058.1| GENE 12 7299 - 8090 645 263 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253566946|ref|ZP_04844398.1| ## NR: gi|253566946|ref|ZP_04844398.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] # 1 263 29 291 291 554 100.0 1e-156 MKSIKILFCIGSLFTLSSCNSCASTNDQLLNGIDNLIGKVDRFLEEHKAVAKPQPPIDYE KEGVNTKHLFIFTDTTMTYNGKSFMPGMTIGKLCEIFGQYERLAEPGIYIWDSIGLTMTS LDESGKDSSPVNGILIDWNIELRGAISEDNIKWLRNRCPRHYFTGKIIVDGAVLGRGMHI DQFLKKTNLKFSNDPFPLLYFCDLYDWDYTKAPIHRKEKYYTYMIRKSNDGTDIESFNAA INSRGIGDPPYDGPEYEKYVNGE >gi|226331998|gb|ACIB01000058.1| GENE 13 8202 - 8882 128 226 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253566947|ref|ZP_04844399.1| ## NR: gi|253566947|ref|ZP_04844399.1| predicted protein [Bacteroides sp. 3_2_5] # 1 226 1 226 226 406 100.0 1e-112 MKISIILKILGLSIALAIISFFILNYLARSRLNRIVNPTFDITKVVAIERASISEHSEVY NDGRFIEYECVYKDSCLDLVNLGRVDSLFHHVEDIVEVTNTPIPFPNSFFPYLIAAEPYS VSIQITSFYNVYSMPYSSLNKVKLYIKGQIKNKISESTTTVKYEVSANRIDIALNNSPEV GLSYNIGIDNNRTDAELVFNLHNGKLYVVVLTKALDVNNLLKKYLP >gi|226331998|gb|ACIB01000058.1| GENE 14 8965 - 9156 144 63 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|301163023|emb|CBW22571.1| ## NR: gi|301163023|emb|CBW22571.1| hypothetical protein [Bacteroides fragilis 638R] # 1 63 1 63 63 130 98.0 2e-29 MKLKTFIVMGVFVCVQMGCTHRRAVPFNRKGWNEWDGHYAKYGNRGCQFELRINDNVTAE VKL >gi|226331998|gb|ACIB01000058.1| GENE 15 9169 - 10119 627 316 aa, chain + ## HITS:1 COG:no KEGG:BF1273 NR:ns ## KEGG: BF1273 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 7 311 5 333 352 69 26.0 1e-10 MKTIYSLLYGMLVLFSTCGQSKVQEKFDTITTYSYTNSYGKDGRLAEVLIKQQSQFLENG AYISGRAVEITQRYNYPDNDTYMITETDDLSPNEISTVIKGKTFEKYYTLKNNDTIKFRM RTYTDATQSKPLVERENYTLSGFPITDEEENINIESNYYYDDKGDLIKIIKRDFNSGEVV ETYGLKEVIADTTIIYYRINTDTSYAVKRYIDNGKTIEITFDENQKPQNKITNYTANGFD IKVRESVKNGIITVDSIYQIKGKKVRSVSVSPENKSVTTLEYDGKGNITKEICKSKSDIT ISNEELNEMIQRYKMI >gi|226331998|gb|ACIB01000058.1| GENE 16 10239 - 10496 160 85 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|301163025|emb|CBW22573.1| ## NR: gi|301163025|emb|CBW22573.1| hypothetical protein [Bacteroides fragilis 638R] # 1 85 42 126 126 153 98.0 3e-36 MPLLGIGTIVMYETQANDYTALKENLTEKKKKEMPYIRWDEVIKDARKDYLWGVIGSIIL NIIYSGFIIFILWMIIYEGVSTIFS >gi|226331998|gb|ACIB01000058.1| GENE 17 10531 - 11148 337 205 aa, chain + ## HITS:1 COG:no KEGG:BF2010 NR:ns ## KEGG: BF2010 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 6 199 1 194 348 392 98.0 1e-108 MEQKTMKGHKHSLNKYHQWADEVQVGKEAEQREERKWKKIFFIMALCVSLSFCAVFFVTC SLFETATITNLALSVVMMLVITSVWLIMILYLIIPKIYPKIPLINNVQLKYYGCFDFGNK WVSENIEVFKQFDDKKIIIDATSPTRNGAAILYIAILITLALFDFYPKVYDNLLDGDIWG TTQAYTVYHRILDERGVLLQIMITS >gi|226331998|gb|ACIB01000058.1| GENE 18 11145 - 11591 270 148 aa, chain + ## HITS:1 COG:no KEGG:BF2010 NR:ns ## KEGG: BF2010 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 148 201 348 348 296 99.0 2e-79 MINKLSFASNMALASVYAIFILLALLSVLSPRRTVIFDREKKTITIPPRWKIHKAETIPF YDAVITFGAKVDKKSFKGDEEIVIANYKHLVKGVSLQFAGRTNGYCFARLIQAYMTEDDL SNHPEFEAFKQIQEEMRINKLRKEAEKQ >gi|226331998|gb|ACIB01000058.1| GENE 19 11596 - 12498 225 300 aa, chain + ## HITS:1 COG:no KEGG:BF2009 NR:ns ## KEGG: BF2009 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 300 1 300 306 468 78.0 1e-131 MKSHKEYKNEYQHWLDEIQALNKAVQHDAKKWKIVGCFFALCASLSFCGSFFAACFICEI ATITNLTLSVGVTILGFCFWLVLIFYKVAPKVFPKIPLINNDEFGHYGCFDFRDEWVYKY IDVFKQFDEKRIVIDASNHDKVGVFLTCLLAFVLLALLEMFPFRDGNNNFVDNFLIVYAY TIVLILGIPMLLSPRRTIVFDREKKTITIPRRWLIHKEETIPFQKAVISFGQDALRGIDG VVIANYQHLVGEVSLQFAGRTNGYCFARLIQAYMTEEDLSNYPEFETFRQIQEEININRL >gi|226331998|gb|ACIB01000058.1| GENE 20 12770 - 13423 43 217 aa, chain + ## HITS:1 COG:no KEGG:BF1941 NR:ns ## KEGG: BF1941 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 15 214 102 304 342 153 44.0 3e-36 MYLLFKEARNLTVWIPLKNGESTGNAQWLPDSSGIPVEPEYLSRCDDKQIVIDNDGISMR MVIFLIWLFVFLVSVCAGLIPSNLTTEDYGSILLLLELYLIILTAGISAILQPWRRIVFD RVSKTVTIPGRLLLHKKETIPYSQTELTIRYYRHSSRLAADIIISNANSSSLLSGVSLMP GDLDKARRFARFIQLYMEEEELPDIPEFEKYRSKVNV >gi|226331998|gb|ACIB01000058.1| GENE 21 13747 - 14349 179 200 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253566953|ref|ZP_04844405.1| ## NR: gi|253566953|ref|ZP_04844405.1| predicted protein [Bacteroides sp. 3_2_5] # 1 200 27 226 226 365 100.0 1e-100 MAQGRLNRVVNPTFDITKVVAIERASISEHSEVYNDGSFIEYECVYKDSCLDLVNLGRVD PLFHHVEDIVEVTNTPISFPNSFFPNLIAPEPYSVSVQITSFYNVYSMPYSSLNKVKLYI KGQIKNKISESTTTVKYEVSANRIDIALNNSSEVGLSYNLGIDNNRVDAELVFNLHNGNL YVVVLTKALDVNNLLKKYLP >gi|226331998|gb|ACIB01000058.1| GENE 22 14739 - 15686 593 315 aa, chain + ## HITS:1 COG:no KEGG:BT_0234 NR:ns ## KEGG: BT_0234 # Name: not_defined # Def: putative transposase # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 315 1 316 316 372 59.0 1e-102 MKTTKKCVFCGRAFTTNSGMRKYCSVRCADEAKKAKKKRQQDLLNAVEPVLEIQYQEYLT FSKAAILMGCSRQYIYKLVAQGRLRASRISNRMAFIRRADIEKMLEDNPYNRVMPGTRPK LDTATGQKKSGKERQAAVPASEEILSYISGEEVLSIYKVKKSWLYTTAKRNQIPICKIAG KNYYSKSHVEECLGLTTDIAAITDWLTTEQVFEVFGMKSKAIHAYAYRHGIPTKKEYGIL YYSKTHLDELRRIDLTEDENYCTVEDVSEKYGLSKANIHRIVKVHGIGKRKVGIRNFLLR CDVERAMAERAAKRL >gi|226331998|gb|ACIB01000058.1| GENE 23 15827 - 16948 767 373 aa, chain + ## HITS:1 COG:no KEGG:BT_0233 NR:ns ## KEGG: BT_0233 # Name: not_defined # Def: integrase # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 370 1 373 373 475 60.0 1e-132 MANICKTVNLYTRKIKGGKMLSYFLDYYPGYRDEATNKVMRHESLGIYIYAHPKTQREKD YNARLTEKAEALRCRRYEEIVNERYEFFDRNKLNGSFIDYFKTYASKKNSRYEQAFLHFD RFVGGKCTFGEVTVELCNKFLEYLRSTPQAIHQKRKLHTNTIASYWSAFLGTLHTAHRDR KIKENPCPYLERVQTIPSDKVGLSAEELARVAETPCEIPVLKTAFLFSCLTGLRKSDVKT FSWEMIQPEADGTLYITTRMQKTKQIIYNPIGEEALQLIDGSREGLVFPGFKDSMTQAPF KKWIKAAGITKKLTFHGARHTFCSLQLDAGTDSRTVQELVGHKNLATTQRYLDSVNSRKK EAANRITLKRAAE >gi|226331998|gb|ACIB01000058.1| GENE 24 17121 - 17915 285 264 aa, chain + ## HITS:1 COG:no KEGG:BT_0232 NR:ns ## KEGG: BT_0232 # Name: not_defined # Def: putative protein involved in transposition # Organism: B.thetaiotaomicron # Pathway: not_defined # 10 263 7 247 255 124 31.0 2e-27 MKNHNTIVNTENCLFYGLPGIVILAVSLMASTRIGADFGGDTFVKTVSFLGCNGLLWLLY LVLCQYLLADLLAICLPKKKTATVGVLEAKPEAGMTLSETVAEQPTTPTPVLSQEQYQSY CADFERQKQEAREELATSVLGYVNRKMAPFTTEENLAQLCNEVRTWCDNPLHSPKAIRLK PVSNPKDRLRTVDFKHFVWNIGARLGFENGYSVQVQAGFIKRLFPNELADIELTSLARSL TSEPDKGHIKLDKPLHTDNYIFHS >gi|226331998|gb|ACIB01000058.1| GENE 25 18153 - 18530 353 125 aa, chain + ## HITS:1 COG:no KEGG:BT_0231 NR:ns ## KEGG: BT_0231 # Name: not_defined # Def: excisionase # Organism: B.thetaiotaomicron # Pathway: not_defined # 3 125 1 123 123 173 73.0 2e-42 MNMENGLTFNDLPQVVAGLRDEVIGMRQMLTSLQKENGQRKENTHRPMSVEDAAEYLKIP LRTLYMKLGNDTIPATKPGKRYVLYQDELDKWLECNRKNPVPLTVEEENAAVLASNRRKP YNRNW >gi|226331998|gb|ACIB01000058.1| GENE 26 18533 - 19555 557 340 aa, chain + ## HITS:1 COG:no KEGG:BF4233 NR:ns ## KEGG: BF4233 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 12 210 2 198 240 199 51.0 1e-49 METGEKNDNIRERPFNVIAAIGAELDSQRQQAENIPAQSGMFSVKTANLTIQEAANRPNP VPLWLTLWYQGEVCCLFADSNLGKSIYAVQIAAAIAEERKVLYFDFELSDKQFQLRYTNE ATGQCHRFPDDLYRVEIARDCLCPQDDFENALIMEIEQLSVKMGCKTLIIDNLSYLCMTS EKGEDAGRLMSRLMELKRKYGLSILILAHTPKRQLNMPITQNDLAGSKKLYNFFDNVFAI GKSAKDENLRYIKQLKVRYGNFEYGSNNVIVCAIEKADDFLRFVTLGHAAESAHLKEITE EERGRMRNEARTLHSQGMSYRAIGEKLGISKTTVERYCKQ >gi|226331998|gb|ACIB01000058.1| GENE 27 19841 - 20368 398 175 aa, chain + ## HITS:1 COG:no KEGG:BT_0229 NR:ns ## KEGG: BT_0229 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 3 168 7 174 341 134 39.0 1e-30 MMYQLEKYKNRSSRHTCPKCGRPRCFTYYVDENGNPLDKSVGRCDHESGCGYHYPPKDYF KDHPDKDMPETRPFPSKAIRKGNHSNTPIDTIPMEYVTRSRSDNSHLAQFLFSLQKDNEA VLKRVLDDYRMGATRNGATIFWQIDRDNRVRGGKIIPYNKEGGHRIKDKGVNWGA >gi|226331998|gb|ACIB01000058.1| GENE 28 20373 - 20900 266 175 aa, chain + ## HITS:1 COG:no KEGG:BVU_3673 NR:ns ## KEGG: BVU_3673 # Name: not_defined # Def: hypothetical protein # Organism: B.vulgatus # Pathway: not_defined # 4 168 54 206 230 108 38.0 6e-23 MKKQGVFNQDWTLAQCLFGEHLLSLPANQDKVVAVVESEKTALVCSVQYPDCVWLATGGK SQLSVDKMKVLANRTVVFFPDMDGYHEWTECVKAFTFCRNAKVSDVLEQNATEADRLAKI DIAGLVLREWQSLRRYHEDTPLARVRRMIQEMTERNPVLQTLIDTFGLVPVVDDG >gi|226331998|gb|ACIB01000058.1| GENE 29 21155 - 22576 798 473 aa, chain + ## HITS:1 COG:no KEGG:BVU_1439 NR:ns ## KEGG: BVU_1439 # Name: not_defined # Def: mobilization protein # Organism: B.vulgatus # Pathway: not_defined # 3 351 4 353 467 235 39.0 3e-60 MAQTSDHIKPCNIGSSEAHNRRTREYLKHIGKEKFYIRTDLTPQNSSWVSPLMEGKDLTA YYNEIARMVKEKTGRAMQTRERTVTNKKTGKTKVVSGSSPLRESVVVCKADTTIDQLRKY CDRCHERWGITALQIFIHLDEGHYGIPGNSSTWKPNCHAHIVWDWMNHETGKSCKLGKAD MSLMQDMVAECLEMERGTRKEETGKAHLERTDFIIAKQKREVEEADNRKKSLDHENQVRE KIGAELDNEIARKQNKANRENGNTILSGLARLTGKGQFAETEKENAALKKRLADMVRRIK AMEQEYRSQLGQASERQASLEATIKRLQSEMAHLRKEAEEKDRQIACLDRLAYPQRYRLS SGAELTHIHVPNYLHPSLHIWTKVGNELFDDVKYGISYETAQRHLRGELTDEEFVNAVFE PQEQVSVAQAQLIAWGCFYVGKRWTGTGSCGYWRRRLAIGFAVERKEKKWLWC >gi|226331998|gb|ACIB01000058.1| GENE 30 22763 - 26986 2319 1407 aa, chain + ## HITS:1 COG:no KEGG:CHU_2948 NR:ns ## KEGG: CHU_2948 # Name: not_defined # Def: hypothetical protein # Organism: C.hutchinsonii # Pathway: not_defined # 17 610 20 597 861 203 30.0 4e-50 MSDKISAREAFDNATYKQAADKIRQILSAIRNNPASSAKRWVWELMQNAKDIPNKYGKVS VEIELISENELQFRHNGNPFGIKNITGLIRQVSSKDSLNSDEETTGKFGTGFICTHLLSD VIDVDGVLNYDTYRKFTLSLDRSGRSSEELMPRIKEVESVFYEPERHFDEITDYEANREE GDYDTVFTYHLTSVEKLESAQAGLKDLVNTLPITLVTQSKKIKQVHVIDRISKSDVIYKC NSVELDDNVTFSKISINDTVKMYLSYITEEVALTIEIKQTEDGYELIKRDSKQPVLYRDF PLIGSEKFYFPFTLNGFRLCPTEKRDNIPLNGEDNEEAKDNRTIIEHAVDTAIKFNEWLI AHNATNRYLLAFSRRPEPEVAYDERVALPWIKDLQSEWRRQLLSQPLTETKNGVYELSEI SVPSFAGYGESNAKSTNEKFFDLVDGFYLGRGQLPKKEHLQGWLDVLRPEYSTWNADLKY EKDDFLTDLESAKSVSQLCSQLNKSEAEIYNWLNDVYSFLIEQNCLNDFDKYSIIPNMEG TFMKLEELSSDNSEPIPSKLMDLYNKVMPSTINSKILNYSINSAVFGNSLKVFALKDIIE WFNKKITSKDTYISDGKSYYANFYLAYNIIELYPKGIDDSEYVSYRKQLYNFSNADGRNN PFSPIAVDNRDLWREADIYWFNDNYRYIADKKTVDNLSKTYFKEPKSIEDTLNWLNEYIQ FYRDNSKGDLIKDKCIFPNQQLDLKSLNYLRYDDDIEEVFKDLADYALNVENTNDKYRHV LLHRSISGYEKHNPLTLNEVYKYIKEDVFDKSSGNIRDVISKHAISIIKRSEDSESQETK LYGFVKTVFGDSIPKIAYVDQSTGFNWGFAQEYYLKKLCKTIAESVNLAGLKELSDGFVD YTDKDLIEWVDSLIEFLHSYKSKKYWTIITDSDNGYGIWLNQNSDFCRFQDIRKDENIPN ELIELVANNKHVAFDYKEQLYNLDAAYTSYLETNPVTIEEVGEYLDEKIEKYEGDKQDKD FAALIFAVGKLCSSHKELGKIMKYYSEKKNALIVGSLGEGKTLDLVGSIIQHGDEKLQAI NDILNNNTVEDLCEVSKILRGCPDGKIEKLKDFISKISGEEPTVIDEDTPIGGEDSMDLV LVPKTYEIENVENFEGQILNVKADQAQYAGLSLEEIEKYVSEAKDRVVKYFRELNDKHDS GYQFDSEKICKHSYSQLYGISDKDGNEIPIVVHSYKGPQYRYFDLNWYDWQLLSCKNSLL FVLTTTGLQCIPLYALPVRNVNIEIDNEMADMNRAALLTLAAVSKEYSSISFDFGNNMPH GFDKLLPFNFVPKEIKECVQSIKEVCDQNIPQISNMYNSARNIPLIRSTEGYSVALKEYE ETGKMRDEFEAPINDLKVPEVGTTYID >gi|226331998|gb|ACIB01000058.1| GENE 31 26994 - 28787 287 597 aa, chain + ## HITS:1 COG:VNG1501G KEGG:ns NR:ns ## COG: VNG1501G COG1112 # Protein_GI_number: 15790495 # Func_class: L Replication, recombination and repair # Function: Superfamily I DNA and RNA helicases and helicase subunits # Organism: Halobacterium sp. NRC-1 # 185 544 447 802 821 85 26.0 2e-16 MNIDRKKHWQFLEDELKAETEEFKKIYLTTAISLLKNSGGMHVAQFLSFKDGEMIMKFSV SRPLPRKGDFLVCMVLPPQLQDYRTWGDKTYRDLYKARYNSTDCVCIWHSPANDPRYSLV GFSKVSVDFANYIKESPNIVLTFAPQRPPIDYAINLQKVVEDKWSEGVASVLDSSYHHKD WEPTLIKQNNVSEFVHSQLSLTDTMLLEGPPGTGKTYMIAELCAKLCAEGKSVLVTALTN RALMEIAEKQAVETLLNEHKIFKTNISIDEIREIRNLENIKCVAPMPGCLVLSTYFIISG FAAELSIEQPFDYVIMDEASQAILTMFAASRKIGKKNLWVGDINQLSPIVLLNKELVKLC NYKPMIEGLKLLADNSSSPIYQLTTAYRFGQRAADYTGIFYNNSLVAKESKRYNDLPSLS NILCKDGGPTLILTDMPIGDCTPTFAMYIVSFIVTNIFKDAKDKEVAVLTCMKKTVSTLQ MTIAQKVSSRKNLVVDTVARVQGLTTDVTIFVVPDYSYIRTLEPRLFNVATSRAKEHTII IADKYIFDCSTLDIRVRKYLEKLKANKCIYVPDPEHGLGKVTNILDYQDSIGRLLFS >gi|226331998|gb|ACIB01000058.1| GENE 32 28819 - 28941 75 40 aa, chain + ## HITS:1 COG:no KEGG:PRU_0534 NR:ns ## KEGG: PRU_0534 # Name: not_defined # Def: divergent AAA domain-containing protein # Organism: P.ruminicola # Pathway: not_defined # 1 37 1 37 530 77 94.0 1e-13 MALAININDLLNKQKIESNRIEFKKGWNPTSIYHSVCFRQ >gi|226331998|gb|ACIB01000058.1| GENE 33 29099 - 30397 934 432 aa, chain + ## HITS:1 COG:FN0830 KEGG:ns NR:ns ## COG: FN0830 COG2865 # Protein_GI_number: 19704165 # Func_class: K Transcription # Function: Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen # Organism: Fusobacterium nucleatum # 7 427 110 512 522 93 23.0 7e-19 MPRTSVEEVDGKYVLVIWCPAGINRPYSVPENVTAKNGSKEYFYIRSGTSSIIAKGEVLD ELRELASRVPFDERGNPDIRLEDISTLLLREYLVKVESKLASDINIKPLHDILEQMDLFV GPKENRMLRNVVAMMFCENPSKFFKRTQVEIVYFPEGRLNNPSNLYEGAVITGSVPQIID RTLEYLKRMLVMQTIIKLKDDYRSKKFYTYPYQALEESVTNSLYHRDYREWEPVVITVEP DGITIQNVGGPDRSISAADISRCEILVSKRYRNRRLGEYLKELDMTEGRSTGIPTIQNVL ENNGSPRATVVTDEDRTFFRITIPCHEAVGNIIADIAYKDDSLKASRRGVLKSGLESTLE STLQSVLENAPKSTLEIIKQIQGNPKATYADIANATGYSRSWVAKTVKRLQEQGIIGRIG SDKTGYWEIINK >gi|226331998|gb|ACIB01000058.1| GENE 34 30411 - 30692 388 93 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|253566965|ref|ZP_04844417.1| ## NR: gi|253566965|ref|ZP_04844417.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] # 1 93 1 93 93 149 100.0 5e-35 MLLGKRIKELRDENGVLQRQLAALLEIDTPMFSKIERGDRRAKRTQVVQLANYFNVDEKE LLTLWVADKVLDAVEDEDEFKHDAIKVAQDVIE >gi|226331998|gb|ACIB01000058.1| GENE 35 31010 - 32185 647 391 aa, chain - ## HITS:1 COG:no KEGG:PRU_0911 NR:ns ## KEGG: PRU_0911 # Name: not_defined # Def: prophage PRU01 site-specific recombinase # Organism: P.ruminicola # Pathway: not_defined # 1 384 1 375 381 181 32.0 3e-44 MSKYVRLFFNRKKKATETVAGAIEIVVCLQRERCYFHSGHSVTLNQWEEGTVVNHPRAII INEEIQKEVTKFEHILIAMQVNKDDMTIAQFKTYLGTSSGNRRNFMAWLRERIENRPLRE GTRKGHFTTYKALERFGKFKTFDDVSLQHIHEFDLFIKEEQTCTTKGKPIVRSQAAIHNY HKRFKSYVSEAFRIGLIKENPYERFQDKRGEKSDRPHLTEAQLRKLIRMRDESTDNVMNR YLDFFLFQTFTGMAYSDAKSFDYARHVVTIDGKDYIDGHRIKTGSEFVTPILPITRKILE RNNYKMEITSNQKYNQFLKGIGLALNSSFPLTTHIARHTFACTVALGQGISKEVLQIMMG HTSIKTTEIYAKLPIQYVSQNIGDKMLKNWK >gi|226331998|gb|ACIB01000058.1| GENE 36 32788 - 33408 562 206 aa, chain + ## HITS:1 COG:no KEGG:BF2032 NR:ns ## KEGG: BF2032 # Name: not_defined # Def: TetR family transcriptional regulator # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 206 20 225 225 393 100.0 1e-108 MKITRDELLIAAFKLFMSVNYEKASFAELGKMLGMSKAGIFKYYKNKQELFIAVVDKFWF STQNPRNKFTETNGTFAEFIDEYVRGVQRTMDMLGDLIGAEREKVAQGKFTYHAQYFHFL FQLLQYDPDAKEKLRNLVEVDYAYWRAAIQRAIATGELREDVDVEDAVVMFRQVYMGLSF EMAFMGGLNTQRLAKHLHAVYSLLKR >gi|226331998|gb|ACIB01000058.1| GENE 37 33567 - 34481 553 304 aa, chain + ## HITS:1 COG:mll9328 KEGG:ns NR:ns ## COG: mll9328 COG0582 # Protein_GI_number: 13488149 # Func_class: L Replication, recombination and repair # Function: Integrase # Organism: Mesorhizobium loti # 99 300 18 219 299 114 32.0 2e-25 MDASKQRGSIVLTKQNIDGQDYIRIEYTDNQTIALLLSQDKGIKTIGNGSAYIQAFTFPF PEFYVRYSPHAYIDYSRVYVRHPNLQREYVLPKGYLELLEQKCYSPSTIKTYRIYFSDFM EYHKGRNIDRLKVADINHYILYLVNEKKISVSQQNMRINAIKFYYEKVKGGKWQYYGGIT GAKEYKTLPEVFSRNEISRILSCLPNLKHHCMISLIYSVGLRRSELLNLIPKDIISERML VRIMGKRKKCRYSLLSEKVLNELRTYYKEYRPKKWLFEGDSPGEQYSASALVKVLKRSCR TCRY >gi|226331998|gb|ACIB01000058.1| GENE 38 34775 - 35362 221 195 aa, chain + ## HITS:1 COG:no KEGG:BF1967 NR:ns ## KEGG: BF1967 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 195 1 195 195 338 85.0 8e-92 MLIEQFSYFTFSCFQCSLENIVKTLSADFLENGKRKLSFRPFVFDLYDNSPLKGGAHFEK AYFFAPATNRNICVMYSNYSDGWNTLARCLSSKLQCDCYNFQITNIDSPDSMNSFQLIQN GIDVRTVYVMKDPKWIFYENGIVQWFEDKSYYKRRLVKNRVNKDILLSYCVKLGFAITEA KFWESKDAILFERIQ >gi|226331998|gb|ACIB01000058.1| GENE 39 35451 - 36311 435 286 aa, chain + ## HITS:1 COG:no KEGG:BF2036 NR:ns ## KEGG: BF2036 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 4 286 1 283 283 540 98.0 1e-152 MKIMEIKYIFCIVFCFWAMKTTMAQSDNNLFALEKQLCFIQDTLSILRQNYPYTDDNYCD SLQHRFSILLEELCAVDKEMKYDFIELRKKERQFTMAVSVDEDMRVFSRNTYFGGSMPLF ASYIQYKDKEHLYFFDINEDNDMGICYDTIYSIQALNKRYYLLSGTSQIVAAYPLAVMKA VSCANGELKKEIIFVSGNQQTDYLSISYRYVKDNIDTRLFISGNLTFPRIVYVETQEEIL KPVTVRDKDDIKYIGGEIDVYKLERNKNEIKFINNNESYHLNDDPF >gi|226331998|gb|ACIB01000058.1| GENE 40 36336 - 36809 118 157 aa, chain + ## HITS:1 COG:no KEGG:BF2037 NR:ns ## KEGG: BF2037 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 157 1 157 157 254 99.0 7e-67 MDIAKIIYVSLTMLTAWVFMSIAFPIDTYKHLLGVVGSYICVSCSALFASWFAWHWVHPF SYVKYLLCGLLTAFLFHAGLTIGCSIPIIFLSCCGVHVISTEPITLWEVLSYLFSIFIMS FYFVGIFTFGQCLLTASITKIIIWKLNIGNNITNNAR >gi|226331998|gb|ACIB01000058.1| GENE 41 36852 - 37319 243 155 aa, chain + ## HITS:1 COG:no KEGG:BF2038 NR:ns ## KEGG: BF2038 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 155 1 155 155 218 98.0 4e-56 MNKTVQSILSLTAVSLLLIVITFFLGGLGFGRLFGIYGIMEDSFLGMLMVIDMLVIYKVY KLYFRQPKAIMLLFGLECCSILLWLVFICLDQHLLDWQITNLLLQAVSLDGFAGDERGLN MLAVLCGMIYPLIGIGCLFTGLRFINKLVTTKDSI >gi|226331998|gb|ACIB01000058.1| GENE 42 37363 - 37794 522 143 aa, chain + ## HITS:1 COG:no KEGG:BF2039 NR:ns ## KEGG: BF2039 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 143 1 143 143 258 98.0 4e-68 MGDIDILNKFDNDKLIDVVKNYKRYGYDDELRDYAINLLGERGWSREDLQQFGYLTNYDY DEAEKQYKAYNRNSLIGICTLVFSGGILAVVYLIFLIQAYRNVAKFYNALGHNQDETALF NALGVLAYFHLKGRMKKELKGIR >gi|226331998|gb|ACIB01000058.1| GENE 43 38012 - 38470 451 152 aa, chain + ## HITS:1 COG:no KEGG:BF1973 NR:ns ## KEGG: BF1973 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 3 152 1 150 150 293 100.0 2e-78 MIMKKFLFILTVITCFFASSVEARRVRFTIPVDNSIPKVVTLPDSSYYKTDEGSHLDLGY IEEDGKRTLVLFSESKPNTYYDISDEYAEVICRDLNVEELDTLIPKPTFWDQWGGSILFY GIAALVVIGVVSYLKDFIFNLLGLSKKKEEEE >gi|226331998|gb|ACIB01000058.1| GENE 44 38561 - 39349 669 262 aa, chain + ## HITS:1 COG:no KEGG:BF1974 NR:ns ## KEGG: BF1974 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 262 1 262 262 527 100.0 1e-148 MTDFNWMLWIVTPIIIVYYLYRHVWPAVRKFIRLFQGIRINPRSHLTEAEYKKLSVGSLY ALQQGAYLNSLTLDIKDKLPTILADWWGICNAQDAKQTLEYLGKKGFAYYFPHVYQAFLL DDEEAKDRIFQQHMDSQEDYDKAVEQLHNLEDCYDELLECGTITCREDLLRYGVTGWDAG RLNFMARACYDMKYISEDEAWHYINHAYEMVHSRFSSWHDFAMSYVIGRALWGGKSASNS GMMYMAEDLLKSEKSPWTKIEW >gi|226331998|gb|ACIB01000058.1| GENE 45 40674 - 40814 76 46 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MRISSKIMLLISAHCRVYAGYYTCVLQKTSLDLRIYEMISIELHDI >gi|226331998|gb|ACIB01000058.1| GENE 46 41236 - 42810 702 524 aa, chain + ## HITS:1 COG:DRB0135_1 KEGG:ns NR:ns ## COG: DRB0135_1 COG1205 # Protein_GI_number: 10957431 # Func_class: R General function prediction only # Function: Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster # Organism: Deinococcus radiodurans # 1 500 1 472 1102 245 33.0 2e-64 MKAFKNHQSIIEDYKKFLKSFFTISDRRIKEEVEKKFEENSSCDSYIPEPLIQFNPSYKT GLSLDDIPGAHSELKNILGKFSLYHHQVEAIKKGIENKGFIVTSGTGSGKSLTFLATIFN DVLKSPGKGIRAIIVYPMNALINSQEEEIKKYEINYLKSLLRPNIPLQQQNEGLDEQITE LKSLTKGTFPVTYAKYTGQENATTKEIIREEEPHIILTNYMMLELIMTRYEERVFRESMS KYLKFLAFDELHTYRGRQGADVAMLIRRIKNLCNNKLVCIGTSATMSSEGNADERKKAVG AVADLIFDESYSLNQIIGEQLETSTDYAEQIPSSSQLSRSVHESIDSNDGADKFKSHPLA IWLERKIALDDTNGYFERGKPQPLSAIIQQLSKDSGENSDDCKNAMDGLLGWIEQLNIKA ADENPRKSYLPFKIHQFISQTGTVQVTLDSRDARYITLNDELYTKINGAETLLYPVLFSR YSGVDFLCVKLINDKIMPRDPDNMDDLPPKITQEDIKADKVTGR >gi|226331998|gb|ACIB01000058.1| GENE 47 42856 - 46494 1378 1212 aa, chain + ## HITS:1 COG:DRB0135_1 KEGG:ns NR:ns ## COG: DRB0135_1 COG1205 # Protein_GI_number: 10957431 # Func_class: R General function prediction only # Function: Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster # Organism: Deinococcus radiodurans # 7 611 496 1099 1102 265 33.0 5e-70 MQDDDEDEVWNEDDKDYLPDSWFTETLKYGRRIKNFYEYRLPQLLHFDKNGNYSWKNDNK LPLKAWYIPAYLLFDPTSGIIFDLKTNENTKLSRLGNEGRSSATTILGANILKSLSENNI PLRNQKFLSFTDNRQDASLQAGHFNDFFTIARLRSAIYKSVRNNLDGLDSNRIGIEVFKE LNLREDEYAKYLSKNPNWPDEGNINAIQKYIIIRILYDLKLGWRYTTPNLEQTALVQIGY KKLNDFCKIDSFFSHLSWLKNMSSEEREYTFTQILNYFRTSFAFDHIYLVQKREEIETEL KNLLDEKKPWSLDFEEKIDVPYVLLPYSKGKAKTREYTASIGPASYLGKYIKRLVKDRTG QILKQDDLSKEIDTICQILVEGHYLNQIPVKTSEGTMHGYRIRVDHILWKPGDEVHTLRD EVRFMKYKNIESTPNLYFQDFYKQDFKTYERQGKEHTGQLGNAKRIEREDDFREGKLSAL FCSPTMELGIDISQLNIVHMRNVPPRPDNYSQRSGRAGRSGQSALVFTFCSQKSPHDRNY FKDPTKMVHGSVIPPRMDITNMELIRSHLDAFIMMELGIDIKVSISEIIDINKPELPVYE HIRLKIEEIQKHQFLQWAKQFDNILRNINKIDDTSWYSDKNWLIDQVQFFYKRFDNAFKR WRILYTNTKKTLEEAHSVISTPNSPKIGDAKRMQAIALRQRDLLLNKSNSSSGNESEFYV FRYLASEGFLPGYNFTRLPVRTFVGNKSAEQGEYVSRPRFIALKEFGPNNLIYHDGSKYK VVKMQLNQNGEQMHTIKISKETGYVFVGKEAENASNDPITRKELKDFDSFEKFSNVVELN ESEARIEERISCNEEYRTSQGYDIDHYFSFSKGIDNTQKAIIKAAEHDLLQVIYDQSATL LQINNKWKITKDDSFPIGTISGRWKSRIEAEQSNPDEPIAGVRIYTTTTSDILYIQPVKE LHLKEEGVCTLAYALKRAIEKVCQVEESEIGVWIMGKNESKNILLYESSEGSLGILKDLL NNAAQLQAIFKEAYNLLGFDPETLIDTRPDTPKASYDDLLSYYNQQYHEKLDRFSVKEAL EKLMRCNIDNQQGGRTLEEQYNYLLRTYDLNSSTEKPLIEYLYRNGLKLPDKAQMNIQNL YVNADFVYKISDNQFALIFCDGSVHDQYEIKKEDQMKRQNCRDAGYEVVEWHYTEPIEEF VKRNKHIFKVVR >gi|226331998|gb|ACIB01000058.1| GENE 48 46496 - 49420 1895 974 aa, chain + ## HITS:1 COG:DRB0136 KEGG:ns NR:ns ## COG: DRB0136 COG0553 # Protein_GI_number: 10957432 # Func_class: K Transcription; L Replication, recombination and repair # Function: Superfamily II DNA/RNA helicases, SNF2 family # Organism: Deinococcus radiodurans # 13 969 6 934 941 504 35.0 1e-142 MTESITNQDIKPGMLVYFRHRQWVVLPSNDKDIVQLKPIGGSDAEATAVYRPLQLPSDSM YKAEFQYPEKKDLADFQSAKILYNAAKLSFRNACGPFRCMGKLSFRPRSYQVVPLVMALK QKVVRLLIADDVGVGKTIEALMILKELIERGEIERFTIICLPHLCEQWQKELKDKLDIDA EIIRSSTIASLERKLQGDISVFKHYPYQVISIDYIKLKDKLGRREMFMNDCPELVIVDEV HTCARPAGKGDQLRYDLLKEVSSRPDRHIVLLTATPHSGKDEEFQSLIGLLNPEFAEYNI SGMDENKRKKLARHFVQRKRENLKRWRKHSEEQNPFPERDSKEIRYSLSEEYLSLYNGIL DFARGLSTQTGKKIKNTSPIKYWAALALLRGAMSSPAAGLSMLKNRKNKLSEEIQEDEED FYFRSTLFDKELNRDDSLTTNAINEYEVQEKNDDFLEILTQRAEELCNYPSDNKLSKAIS VLSDWTKGVQEIVDGVKQKVVYHPIVFCKYIQTAKYIGEKLKDYFGNKVQVVVATSELAD EQRRDLIDSIHPDKPRILVATDCLSEGINLQELFNAVLHYDLPWNPNRIEQRDGRVDRFG QESPRIKTFILLGENNDVDKIVWDVLIKKIYEIRNSIGVNISIGDDDSSVMEGLIKKLIT GEEEASNKQLSLFADDHITNVIEQMRRKAENIRSIFAHEAVSAEEIENELKEVDDAIGDI SSVESFVTSSIIALRGTCTPIYARKTNREDGIQSYKQGYRIDLTNLPTHLQSMLPKNGKN ITICFESPTPAGLVYVGRNHKFVEQLCQFMLSLAIDMNKNSAYPRVARSAVVLTQKVKTR TTLIQFRVRNVIREIGSKREVIAEEVFLWGYEGSHADSRILSEDESKQLLFEVESEENMS QESQENEFDYEQQRFSSKAEAFKEVAQQRADHLVEAHGRFRSLVGGKSYETVIPVLPPDI LGVYIFKPVATPLF >gi|226331998|gb|ACIB01000058.1| GENE 49 49433 - 52193 1155 920 aa, chain + ## HITS:1 COG:DRB0137 KEGG:ns NR:ns ## COG: DRB0137 COG1002 # Protein_GI_number: 10957433 # Func_class: V Defense mechanisms # Function: Type II restriction enzyme, methylase subunits # Organism: Deinococcus radiodurans # 32 638 37 587 609 270 32.0 1e-71 MNNTSVNIQGNIISSEIIEKIRSEEQGYYQRPSYFDTDHSLREEIGNAWINAKALWNIFK SKKERVKDEDSGTTETRKSWMEPFLAELNFTATKALVYQHAESGKRFDISHRDDELDGFP IHLVSFKQSLDKATVNGKSSPHALVQEYLNKVEHTYALVSNGIFLRLLRDSSRIVRISYY EFNLEKMMEEDLFSDFAILYRTLHSSRFKQKNDFPDGCIFETYHLQSLESGSRLRSSLSS AVINALVGYPNGIVSAKNQNFKKQTGLANGFIQNPHNLELREQIASGRIAADTFYAELLR LIYRFLFLIVTEERDLVYPETRDEEVLRKKQIYYRYYSIERLRKLAGKLLYADNSKDDLW EGMKSTFMLFENKYYGEKLGIQALSSGIFATDALRTLMTLRLDNCTLIDTIAGLCFFVNN QTGQTVRVNYSDLDVEEFGSVYEGLLEYSPAFNLKNETPHFQLKEGKFRSDSGSHYTPEE LCKPLIQYSLEYILEERLLQSKIDKTIPLHRIIGQERETAIRALLGIKVCDVACGSGHLL LSAARRIALEVARLYTGEEQPNPVAMRQAMRIVIKNCIYGVDKNPLAVELCKVALWLEAH NPGEPLNFLDHHIKCGDSIVGLGHKDELMNGIPDEAFKVLPEDDKEIAKLYRTENKKGAG LIQQIGLFENKVGDSLEELIKQFDSLNQMQENTPEQVAEKERQYRMLMNKSAMQRLRTLA DMQVAQFFVPKTDENKNKLVTNKDYFEYLRGGQIPMQLASGIIEVFTKYQFFHWFLEFPE VFTQGGFDCIIGNPPFLGGQKLSGAFGNSFLKFVKYAYKPAGAVDLVTYFFRRIFNILRP NGFQSLVSTNTISQGSAREGGLDVICSKEGVINHAVRSMRWPGDAAVEISLVTIHKGEWS KDIVLDKKKVERITPYLDDS Prediction of potential genes in microbial genomes Time: Wed May 18 00:24:24 2011 Seq name: gi|226331997|gb|ACIB01000059.1| Bacteroides sp. 3_2_5 cont1.59, whole genome shotgun sequence Length of sequence - 169207 bp Number of predicted genes - 143, with homology - 138 Number of transcription units - 74, operones - 35 average op.length - 3.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 198 - 257 4.5 1 1 Op 1 . + CDS 334 - 924 709 ## BF2734 ATP synthase subunit E 2 1 Op 2 . + CDS 936 - 1775 780 ## BF2733 hypothetical protein 3 1 Op 3 16/0.000 + CDS 1812 - 3569 1813 ## COG1155 Archaeal/vacuolar-type H+-ATPase subunit A 4 1 Op 4 16/0.000 + CDS 3599 - 4918 1639 ## COG1156 Archaeal/vacuolar-type H+-ATPase subunit B 5 1 Op 5 4/0.000 + CDS 4931 - 5536 694 ## COG1394 Archaeal/vacuolar-type H+-ATPase subunit D 6 1 Op 6 16/0.000 + CDS 5533 - 7350 1760 ## COG1269 Archaeal/vacuolar-type H+-ATPase subunit I 7 1 Op 7 . + CDS 7395 - 7856 632 ## COG0636 F0F1-type ATP synthase, subunit c/Archaeal/vacuolar-type H+-ATPase, subunit K + Term 7876 - 7919 8.7 8 2 Tu 1 . - CDS 7879 - 8022 64 ## - Prom 8104 - 8163 5.1 + Prom 7946 - 8005 7.2 9 3 Op 1 . + CDS 8086 - 9747 1625 ## COG0438 Glycosyltransferase 10 3 Op 2 . + CDS 9781 - 12345 2605 ## COG0058 Glucan phosphorylase + Term 12383 - 12437 16.4 + Prom 12356 - 12415 6.3 11 4 Tu 1 . + CDS 12446 - 13096 581 ## BF2725 hypothetical protein + Term 13201 - 13253 0.2 12 5 Tu 1 . - CDS 13238 - 14341 1413 ## COG0526 Thiol-disulfide isomerase and thioredoxins - Prom 14362 - 14421 2.9 + Prom 14370 - 14429 9.3 13 6 Op 1 30/0.000 + CDS 14479 - 15864 1564 ## COG3842 ABC-type spermidine/putrescine transport systems, ATPase components + Term 15945 - 15971 -0.6 14 6 Op 2 36/0.000 + CDS 15980 - 16678 552 ## COG1176 ABC-type spermidine/putrescine transport system, permease component I 15 6 Op 3 25/0.000 + CDS 16672 - 17460 756 ## COG1177 ABC-type spermidine/putrescine transport system, permease component II + Prom 17462 - 17521 2.0 16 6 Op 4 . + CDS 17629 - 18963 1366 ## COG0687 Spermidine/putrescine-binding periplasmic protein + Prom 19000 - 19059 5.4 17 7 Tu 1 . + CDS 19179 - 19547 506 ## COG0724 RNA-binding proteins (RRM domain) + Term 19581 - 19636 13.5 + Prom 19613 - 19672 5.4 18 8 Op 1 . + CDS 19752 - 20987 845 ## BF2733 putative lipoprotein 19 8 Op 2 . + CDS 20987 - 21382 255 ## BF2732 hypothetical protein + Term 21423 - 21475 4.1 - Term 21418 - 21453 6.5 20 9 Op 1 . - CDS 21497 - 22042 901 ## PROTEIN SUPPORTED gi|60682204|ref|YP_212348.1| 30S ribosomal protein S16 - Prom 22101 - 22160 4.2 21 9 Op 2 . - CDS 22195 - 22557 312 ## BF2730 hypothetical protein - Prom 22793 - 22852 5.1 + Prom 22483 - 22542 2.9 22 10 Tu 1 . + CDS 22562 - 22732 57 ## + Prom 22752 - 22811 4.7 23 11 Tu 1 . + CDS 22872 - 24728 1728 ## BF2729 hypothetical protein 24 12 Tu 1 . - CDS 24741 - 26042 1235 ## COG1757 Na+/H+ antiporter - Prom 26108 - 26167 4.4 + Prom 26457 - 26516 5.8 25 13 Op 1 5/0.000 + CDS 26642 - 28039 1040 ## COG1690 Uncharacterized conserved protein 26 13 Op 2 . + CDS 28044 - 28664 512 ## COG1186 Protein chain release factor B 27 13 Op 3 . + CDS 28690 - 29403 548 ## BF2709 hypothetical protein 28 13 Op 4 . + CDS 29400 - 30512 743 ## COG0617 tRNA nucleotidyltransferase/poly(A) polymerase + Term 30679 - 30712 -0.9 29 14 Tu 1 . - CDS 30948 - 34109 2282 ## COG0642 Signal transduction histidine kinase - Prom 34258 - 34317 6.3 + Prom 34294 - 34353 4.4 30 15 Op 1 . + CDS 34458 - 34814 269 ## COG1733 Predicted transcriptional regulators 31 15 Op 2 . + CDS 34811 - 35257 339 ## COG0454 Histone acetyltransferase HPA2 and related acetyltransferases - Term 35247 - 35305 3.3 32 16 Tu 1 . - CDS 35321 - 36385 839 ## COG1879 ABC-type sugar transport system, periplasmic component - Prom 36584 - 36643 8.9 + Prom 36422 - 36481 8.2 33 17 Op 1 8/0.000 + CDS 36639 - 37664 934 ## COG0524 Sugar kinases, ribokinase family 34 17 Op 2 . + CDS 37711 - 38379 691 ## COG0800 2-keto-3-deoxy-6-phosphogluconate aldolase + Term 38380 - 38444 19.0 - Term 38368 - 38431 17.2 35 18 Op 1 12/0.000 - CDS 38458 - 38982 449 ## COG3610 Uncharacterized conserved protein 36 18 Op 2 . - CDS 38979 - 39752 576 ## COG2966 Uncharacterized conserved protein 37 18 Op 3 . - CDS 39762 - 40010 234 ## BF2715 hypothetical protein - Prom 40192 - 40251 4.9 38 19 Op 1 24/0.000 - CDS 40439 - 41488 928 ## COG0208 Ribonucleotide reductase, beta subunit 39 19 Op 2 . - CDS 41514 - 44033 2271 ## COG0209 Ribonucleotide reductase, alpha subunit - Prom 44164 - 44223 3.0 40 20 Op 1 4/0.000 - CDS 44493 - 44819 394 ## COG4744 Uncharacterized conserved protein 41 20 Op 2 . - CDS 44816 - 45418 635 ## COG0811 Biopolymer transport proteins 42 20 Op 3 . - CDS 45463 - 46146 736 ## BF2710 hypothetical protein 43 20 Op 4 1/0.125 - CDS 46155 - 50474 4067 ## COG1429 Cobalamin biosynthesis protein CobN and related Mg-chelatases 44 20 Op 5 . - CDS 50559 - 52703 2242 ## COG4771 Outer membrane receptor for ferrienterochelin and colicins 45 20 Op 6 . - CDS 52733 - 53401 546 ## BF2687 hypothetical protein 46 20 Op 7 . - CDS 53444 - 54289 865 ## BT_0498 hypothetical protein - Prom 54364 - 54423 5.0 + Prom 54652 - 54711 5.1 47 21 Op 1 . + CDS 54731 - 55075 266 ## BF2704 hypothetical protein + Term 55084 - 55121 2.2 48 21 Op 2 . + CDS 55123 - 56031 838 ## COG1230 Co/Zn/Cd efflux system component 49 22 Tu 1 . + CDS 56708 - 58306 1724 ## COG2985 Predicted permease + Term 58332 - 58394 8.5 + Prom 58309 - 58368 3.7 50 23 Op 1 . + CDS 58476 - 58574 77 ## 51 23 Op 2 19/0.000 + CDS 58650 - 59156 381 ## COG1566 Multidrug resistance efflux pump 52 23 Op 3 . + CDS 59153 - 60682 1249 ## COG0477 Permeases of the major facilitator superfamily + Term 60690 - 60722 -0.2 + Prom 60755 - 60814 4.8 53 24 Tu 1 . + CDS 60834 - 62021 1206 ## COG2311 Predicted membrane protein + Term 62022 - 62061 2.3 + Prom 62053 - 62112 5.3 54 25 Op 1 . + CDS 62251 - 62595 206 ## BF2676 hypothetical protein 55 25 Op 2 . + CDS 62660 - 64969 1977 ## COG4771 Outer membrane receptor for ferrienterochelin and colicins + Term 65014 - 65058 9.9 56 26 Tu 1 . + CDS 65387 - 66364 1037 ## COG0620 Methionine synthase II (cobalamin-independent) + Term 66389 - 66425 -0.8 + Prom 66433 - 66492 2.4 57 27 Tu 1 . + CDS 66523 - 67632 830 ## COG1858 Cytochrome c peroxidase - Term 67630 - 67676 3.0 58 28 Op 1 . - CDS 67685 - 68170 483 ## COG0526 Thiol-disulfide isomerase and thioredoxins 59 28 Op 2 . - CDS 68193 - 69227 875 ## BF2693 hypothetical protein 60 28 Op 3 . - CDS 69239 - 69454 264 ## BF2670 hypothetical protein - Prom 69666 - 69725 4.0 61 29 Op 1 . - CDS 69964 - 70326 414 ## BF2668 hypothetical protein 62 29 Op 2 . - CDS 70349 - 70858 543 ## COG0716 Flavodoxins - Prom 70881 - 70940 2.3 - Term 70903 - 70965 11.1 63 30 Tu 1 . - CDS 70986 - 72161 1427 ## COG0138 AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) - Prom 72345 - 72404 4.9 + Prom 72112 - 72171 3.0 64 31 Op 1 . + CDS 72371 - 73075 633 ## BF2687 hypothetical protein 65 31 Op 2 . + CDS 73065 - 73673 582 ## COG2197 Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain - Term 74092 - 74130 3.0 66 32 Op 1 . - CDS 74273 - 74902 595 ## COG0110 Acetyltransferase (isoleucine patch superfamily) 67 32 Op 2 . - CDS 74918 - 75811 521 ## BF2684 hypothetical protein 68 32 Op 3 . - CDS 75829 - 76989 925 ## COG2148 Sugar transferases involved in lipopolysaccharide synthesis 69 32 Op 4 . - CDS 76990 - 77355 265 ## COG3947 Response regulator containing CheY-like receiver and SARP domains - Prom 77437 - 77496 4.8 - Term 77503 - 77542 4.4 70 33 Tu 1 . - CDS 77558 - 79984 1465 ## BF2681 hypothetical protein - Prom 80026 - 80085 5.6 71 34 Tu 1 . - CDS 80200 - 81246 929 ## COG0252 L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D - Prom 81279 - 81338 4.7 72 35 Op 1 . - CDS 81494 - 82285 350 ## PROTEIN SUPPORTED gi|148984260|ref|ZP_01817555.1| ribosomal protein L11 methyltransferase 73 35 Op 2 9/0.000 - CDS 82345 - 82932 537 ## COG0135 Phosphoribosylanthranilate isomerase - Prom 82985 - 83044 3.0 74 35 Op 3 21/0.000 - CDS 83056 - 83868 744 ## COG0134 Indole-3-glycerol phosphate synthase 75 35 Op 4 13/0.000 - CDS 83894 - 84889 974 ## COG0547 Anthranilate phosphoribosyltransferase 76 35 Op 5 35/0.000 - CDS 84936 - 85541 657 ## COG0512 Anthranilate/para-aminobenzoate synthases component II - Prom 85622 - 85681 2.4 - Term 85573 - 85613 -0.7 77 35 Op 6 . - CDS 85685 - 87091 1338 ## COG0147 Anthranilate/para-aminobenzoate synthases component I - Prom 87160 - 87219 2.7 78 36 Tu 1 . - CDS 87226 - 88398 1260 ## COG0133 Tryptophan synthase beta chain - Prom 88431 - 88490 2.2 - Term 88426 - 88474 16.3 79 37 Op 1 . - CDS 88601 - 88789 78 ## BF2649 hypothetical protein - Prom 88940 - 88999 5.4 80 37 Op 2 . - CDS 89007 - 89246 168 ## BF2648 hypothetical protein - Prom 89267 - 89326 4.7 81 38 Tu 1 . + CDS 89303 - 89413 82 ## + Term 89461 - 89500 5.1 + Prom 89519 - 89578 4.1 82 39 Tu 1 . + CDS 89619 - 90764 1265 ## COG1979 Uncharacterized oxidoreductases, Fe-dependent alcohol dehydrogenase family + Term 90852 - 90903 0.3 - Term 90894 - 90930 -0.4 83 40 Op 1 11/0.000 - CDS 90957 - 91871 768 ## COG0248 Exopolyphosphatase 84 40 Op 2 . - CDS 91868 - 93934 1869 ## COG0855 Polyphosphate kinase - Prom 93983 - 94042 7.6 + Prom 93946 - 94005 6.9 85 41 Tu 1 . + CDS 94105 - 94671 697 ## BF2644 hypothetical protein + Term 94774 - 94824 -0.8 86 42 Op 1 . - CDS 94804 - 96036 1212 ## COG0436 Aspartate/tyrosine/aromatic aminotransferase 87 42 Op 2 . - CDS 96089 - 96898 595 ## COG0253 Diaminopimelate epimerase - Prom 96996 - 97055 6.0 88 43 Tu 1 . - CDS 97341 - 98216 639 ## BF2664 hypothetical protein - Prom 98246 - 98305 6.3 89 44 Tu 1 . - CDS 98425 - 99738 795 ## BF2663 hypothetical protein - Prom 99842 - 99901 4.5 - Term 99884 - 99928 7.2 90 45 Tu 1 . - CDS 99958 - 100719 741 ## COG0584 Glycerophosphoryl diester phosphodiesterase - Prom 100758 - 100817 4.9 91 46 Tu 1 . - CDS 100970 - 102646 1505 ## COG0367 Asparagine synthase (glutamine-hydrolyzing) - Prom 102675 - 102734 5.2 + Prom 102997 - 103056 4.3 92 47 Tu 1 . + CDS 103201 - 103650 259 ## COG0449 Glucosamine 6-phosphate synthetase, contains amidotransferase and phosphosugar isomerase domains + Prom 103668 - 103727 3.3 93 48 Op 1 . + CDS 103771 - 105654 1863 ## COG0034 Glutamine phosphoribosylpyrophosphate amidotransferase 94 48 Op 2 24/0.000 + CDS 105689 - 106765 1074 ## COG0505 Carbamoylphosphate synthase small subunit + Prom 106788 - 106847 4.0 95 48 Op 3 . + CDS 106928 - 110158 3279 ## COG0458 Carbamoylphosphate synthase large subunit (split gene in MJ) + Prom 110199 - 110258 3.9 96 49 Tu 1 . + CDS 110358 - 110654 409 ## COG0526 Thiol-disulfide isomerase and thioredoxins + Term 110685 - 110730 12.1 97 50 Tu 1 . - CDS 111215 - 112102 579 ## COG2207 AraC-type DNA-binding domain-containing proteins - Prom 112171 - 112230 5.0 98 51 Tu 1 . + CDS 112434 - 113594 735 ## BF2632 hypothetical protein + Term 113653 - 113703 6.4 + Prom 113650 - 113709 4.4 99 52 Op 1 . + CDS 113806 - 115062 1113 ## BF2631 outer membrane efflux protein 100 52 Op 2 10/0.000 + CDS 115101 - 115997 715 ## COG0845 Membrane-fusion protein 101 52 Op 3 45/0.000 + CDS 116011 - 117471 344 ## PROTEIN SUPPORTED gi|225088774|ref|YP_002660041.1| ribosomal protein S16 102 52 Op 4 22/0.000 + CDS 117475 - 118575 799 ## COG0842 ABC-type multidrug transport system, permease component 103 52 Op 5 . + CDS 118572 - 119690 783 ## COG0842 ABC-type multidrug transport system, permease component + Prom 119741 - 119800 2.4 104 52 Op 6 . + CDS 119824 - 122892 2129 ## COG0642 Signal transduction histidine kinase + Term 122898 - 122954 14.1 - Term 122888 - 122939 0.1 105 53 Tu 1 . - CDS 122988 - 123416 360 ## COG0071 Molecular chaperone (small heat shock protein) - Prom 123594 - 123653 5.5 106 54 Op 1 . - CDS 123660 - 124124 361 ## COG0013 Alanyl-tRNA synthetase 107 54 Op 2 . - CDS 124169 - 124945 298 ## COG1305 Transglutaminase-like enzymes, putative cysteine proteases 108 54 Op 3 . - CDS 124989 - 125609 387 ## COG2949 Uncharacterized membrane protein - Prom 125679 - 125738 9.0 - Term 125676 - 125726 2.0 109 55 Tu 1 . - CDS 125912 - 127945 1712 ## COG0556 Helicase subunit of the DNA excision repair complex - Prom 128149 - 128208 6.7 + Prom 127962 - 128021 6.2 110 56 Op 1 3/0.000 + CDS 128062 - 129360 1073 ## COG1541 Coenzyme F390 synthetase 111 56 Op 2 . + CDS 129388 - 129813 463 ## COG4747 ACT domain-containing protein + Term 129828 - 129896 2.3 + Prom 129908 - 129967 4.4 112 57 Op 1 . + CDS 130001 - 130804 735 ## BF2617 hypothetical protein 113 57 Op 2 . + CDS 130820 - 131155 454 ## BF2616 hypothetical protein + Term 131181 - 131243 8.6 + Prom 131185 - 131244 3.9 114 58 Tu 1 . + CDS 131265 - 131714 386 ## BF2615 hypothetical protein + Term 131727 - 131787 11.1 - Term 131710 - 131777 19.0 115 59 Op 1 . - CDS 131797 - 132504 733 ## BF2614 hypothetical protein 116 59 Op 2 . - CDS 132501 - 134270 1313 ## BF2635 hypothetical protein - Prom 134381 - 134440 3.5 + Prom 134232 - 134291 5.7 117 60 Tu 1 . + CDS 134447 - 137275 2577 ## COG0178 Excinuclease ATPase subunit + Term 137383 - 137423 -0.9 + Prom 137330 - 137389 3.5 118 61 Op 1 . + CDS 137531 - 138010 380 ## COG2606 Uncharacterized conserved protein 119 61 Op 2 . + CDS 138072 - 139730 1643 ## COG3104 Dipeptide/tripeptide permease + Term 139761 - 139821 8.1 - Term 139750 - 139807 7.6 120 62 Tu 1 . - CDS 139842 - 140339 197 ## PROTEIN SUPPORTED gi|229884790|ref|ZP_04504247.1| acetyltransferase, ribosomal protein N-acetylase - Prom 140417 - 140476 5.3 + Prom 140376 - 140435 5.3 121 63 Op 1 . + CDS 140501 - 140836 370 ## COG1695 Predicted transcriptional regulators 122 63 Op 2 . + CDS 140849 - 141928 855 ## BF2629 hypothetical protein + Term 141958 - 142004 7.8 + Prom 141994 - 142053 4.9 123 64 Tu 1 . + CDS 142080 - 143075 729 ## COG2220 Predicted Zn-dependent hydrolases of the beta-lactamase fold + Term 143112 - 143184 25.0 - Term 143112 - 143156 12.4 124 65 Op 1 . - CDS 143176 - 145284 2020 ## COG1506 Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 125 65 Op 2 . - CDS 145358 - 146689 1102 ## COG0534 Na+-driven multidrug efflux pump - Term 146704 - 146753 11.1 126 66 Op 1 . - CDS 146778 - 148634 1635 ## COG0706 Preprotein translocase subunit YidC 127 66 Op 2 . - CDS 148683 - 150284 1863 ## COG0504 CTP synthase (UTP-ammonia lyase) - Prom 150323 - 150382 6.2 - Term 150393 - 150424 -0.8 128 67 Tu 1 . - CDS 150475 - 150582 68 ## - Prom 150608 - 150667 6.6 + Prom 150554 - 150613 7.2 129 68 Tu 1 . + CDS 150668 - 152101 1261 ## BF2601 hypothetical protein + Term 152139 - 152177 8.4 130 69 Tu 1 . + CDS 152230 - 152634 408 ## BF2600 hypothetical protein + Term 152726 - 152754 -1.0 + Prom 153230 - 153289 4.0 131 70 Op 1 10/0.000 + CDS 153310 - 154182 857 ## COG2878 Predicted NADH:ubiquinone oxidoreductase, subunit RnfB 132 70 Op 2 12/0.000 + CDS 154219 - 155556 1147 ## COG4656 Predicted NADH:ubiquinone oxidoreductase, subunit RnfC 133 70 Op 3 12/0.000 + CDS 155562 - 156554 973 ## COG4658 Predicted NADH:ubiquinone oxidoreductase, subunit RnfD 134 70 Op 4 13/0.000 + CDS 156581 - 157219 881 ## COG4659 Predicted NADH:ubiquinone oxidoreductase, subunit RnfG 135 70 Op 5 3/0.000 + CDS 157237 - 157824 697 ## COG4660 Predicted NADH:ubiquinone oxidoreductase, subunit RnfE 136 70 Op 6 . + CDS 157838 - 158410 675 ## COG4657 Predicted NADH:ubiquinone oxidoreductase, subunit RnfA + Term 158437 - 158493 9.3 + Prom 158472 - 158531 4.9 137 71 Tu 1 . + CDS 158615 - 159649 1072 ## COG1087 UDP-glucose 4-epimerase + Term 159786 - 159831 3.1 - Term 159923 - 159970 13.6 138 72 Tu 1 . - CDS 159975 - 161234 862 ## BF2612 putative lipoprotein - Prom 161292 - 161351 4.5 139 73 Op 1 . - CDS 161368 - 162612 675 ## BF2590 hypothetical protein 140 73 Op 2 . - CDS 162635 - 163459 515 ## COG1947 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase - Prom 163507 - 163566 3.6 + Prom 163492 - 163551 6.7 141 74 Op 1 . + CDS 163637 - 165184 1309 ## COG0305 Replicative DNA helicase 142 74 Op 2 . + CDS 165214 - 165543 404 ## BF2587 hypothetical protein 143 74 Op 3 . + CDS 165548 - 168397 2113 ## COG2605 Predicted kinase related to galactokinase and mevalonate kinase + Term 168409 - 168456 12.2 Predicted protein(s) >gi|226331997|gb|ACIB01000059.1| GENE 1 334 - 924 709 196 aa, chain + ## HITS:1 COG:no KEGG:BF2734 NR:ns ## KEGG: BF2734 # Name: not_defined # Def: ATP synthase subunit E # Organism: B.fragilis # Pathway: Oxidative phosphorylation [PATH:bfr00190]; Metabolic pathways [PATH:bfr01100] # 1 196 1 196 196 322 100.0 4e-87 MENKIQELTDKIYREGVEKGNEEARRLIANAQEEAKKIVEDAHKEAESIIASSRKSADEL TENTKSELKLFAGQAVNALKSEIATMVTDKIVTAPVKEFAQNKDFLNAFIVALASKWSVD EPIIISTSDAESLKKYFAANAKALLDKGVTIEQVNGIKALFSVSPADGSYKVNFGEEEFM NYFKAFLRPQLVEMLF >gi|226331997|gb|ACIB01000059.1| GENE 2 936 - 1775 780 279 aa, chain + ## HITS:1 COG:no KEGG:BF2733 NR:ns ## KEGG: BF2733 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 279 1 279 279 525 99.0 1e-147 MSKYYYLVAGLPELTLEDSKQSYTVADFKTEIYTGLSVSDQRLIDLFYLKFDNANVLKLL KDKEAAIDTRGNFSAEELAEYISTLKEGGEVSAKEFPAYLATFISAYFNTPAENAVLLED HLAALYYDYAMKCGNKFVASWFEFNLTINNILVALTARKYKWDVACNIVGDTEICEALRT SGARDFGLSGEVDFLDQLVKISEITELVEREKKLDALRWNWMEDAIFFDYFTIERIFAFL LKLEMIERWISLDKERGNQLFRSIIESLKNEVQIPAEFR >gi|226331997|gb|ACIB01000059.1| GENE 3 1812 - 3569 1813 585 aa, chain + ## HITS:1 COG:TP0426 KEGG:ns NR:ns ## COG: TP0426 COG1155 # Protein_GI_number: 15639417 # Func_class: C Energy production and conversion # Function: Archaeal/vacuolar-type H+-ATPase subunit A # Organism: Treponema pallidum # 3 585 4 576 589 526 46.0 1e-149 MATKGTVSGVIANMVTLVVDGPVAQNEICYISTGGDKLMAEVIKVVGTHVYVQVFESTRG LKVGAEAEFTGHMLEVTLGPGMLSKNYDGLQNDLDKMDGVFLKRGQYTYPLDKGSVWHFV PLVSVGDKVEASAWLGQVDENFQPLKIMVPFTQKGVCTVKSIVPEGDYKIEDVVAVLVDE EGNTVEVNMIQKWPVKRAMTNYKEKPRPFKLLETGVRVIDTVNPIVEGGTGFIPGPFGTG KTVLQHAISKQAEADIVIIAACGERANEVVEIFTEFPELVDPHTGRKLMERTIIIANTSN MPVAAREASVYTAMTIAEYYRAMGLKVLLMADSTSRWAQALREMSNRMEELPGPDAFPMD LSSIISNFYGRAGYVKLNNGESGSITFIGTVSPAGGNLKEPVTENTKKVARCFYALEQDR ADKKRYPAVNPIDSYSKYIEYPEFEAYIAEHINDEWIGKVNEIKTRLQRGKEIAEQINIL GDDGVPVEYHVIFWKSELIDFVILQQDAFDEIDAVTPMERQEAILNMVIDICHTEFEFDN FNEVMDYFKKMINICKQMNYSKFKSEQYEGFQKQLEELVAERSVK >gi|226331997|gb|ACIB01000059.1| GENE 4 3599 - 4918 1639 439 aa, chain + ## HITS:1 COG:TP0427 KEGG:ns NR:ns ## COG: TP0427 COG1156 # Protein_GI_number: 15639418 # Func_class: C Energy production and conversion # Function: Archaeal/vacuolar-type H+-ATPase subunit B # Organism: Treponema pallidum # 8 437 3 428 430 399 47.0 1e-111 MATKAFQKIYTKITQITKATCSLKATGVGYDELATVNGKLAQVVKIAGDEVTLQVFEGTE GIPTNAEVVFLGKSPTLKVSEQLAGRFFNAFGDPIDGGPEIEGQEVPIGGPSVNPVRRKQ PSELIATGIAGIDLNNTLVSGQKIPFFADPDQPFNQVMANVALRAETDKIILGGMGMTND DYLYFKNVFSNAGALDRIVSFVNTTENPPVERLLVPDMALTAAEYFAVQHNQKVLVLLTD MTSYADALAIVSNRMDQIPSKDSMPGSLYSDLAKIYEKAVQFPDGGSITIIAVTTLSGGD ITHAVPDNTGYITEGQLFLRRDSDIGKVIVDPFRSLSRLKQLVTGKKTRKDHPQVMNAAV RLYADAANAKTKLENGFDLTNYDERTLAFAKDYSNQLLAIDVNLDTTEMLDVAWGLFGKY FRPEEVNIKKELVDQYWKK >gi|226331997|gb|ACIB01000059.1| GENE 5 4931 - 5536 694 201 aa, chain + ## HITS:1 COG:TP0428 KEGG:ns NR:ns ## COG: TP0428 COG1394 # Protein_GI_number: 15639419 # Func_class: C Energy production and conversion # Function: Archaeal/vacuolar-type H+-ATPase subunit D # Organism: Treponema pallidum # 1 179 1 180 206 84 30.0 1e-16 MAIKFQYNKTSLQQLEKQLKVRVRTLPIIKNKESALRMEVKRCKSEAADLEERLEKQIQA YEAMFALWNEFDTSLIKVNDVHLGVKKIAGVRVPLLENVDFEIRPYSLFNAPKWYADGIH LLKELAHTAIEREFTLAKLGLLEHARKKTTQKVNLFEKVQIPGYQDALRKIKRFMEDEEN LSKSSQKILKSQQEKRKEAEA >gi|226331997|gb|ACIB01000059.1| GENE 6 5533 - 7350 1760 605 aa, chain + ## HITS:1 COG:BB0091 KEGG:ns NR:ns ## COG: BB0091 COG1269 # Protein_GI_number: 15594437 # Func_class: C Energy production and conversion # Function: Archaeal/vacuolar-type H+-ATPase subunit I # Organism: Borrelia burgdorferi # 1 605 1 605 608 180 25.0 8e-45 MITKMKKLTFLVYHKEYEDFLNSLRELGVVHIVEKQQGAAENAELQDNIRLSARLAAALK LLQNQKHEKDAVIAANGGSAERGLQVLDEIDGLQAEHSKLLQQQQTCGKEKDALEAWGNF EPEGIQRLKDAGYVVGFYTCSEGNYKEEWEAEYNAMIICRISSKVFFITVTKDGEEVDLD VEQAKLPSQSLAQLEAQYANTETALEENEKKLVALSETDIPSLKEALKQVQTEIEFSKVV LSTEQTAGDKLMLLEGWAPATSKVEIEAYLNDAHIYYEITDPTPEDNVPIELNNKGFFAW FEPICKLYMLPKYNELDLTPFFAPFFMIFFGLCLGDSGYGLFLFVGATAYRLLAKKLSQS AKSIISLIQILATSTFFCGLLTGTFFGANIYDLPWPFIQRLKSAVFMDNNDMFQLSLILG AVQILFGMVLKAVNQTIQFGFKYAIATIGWIILLLSLAFSALLPKVMPMGSTVHLIIMGI AGVMIFLFNSPGKNIFLNIGLGLWDSYNMATGLLGDVLSYVRLFALGLSGGILAGVFNSL AVGMSPDNVIAGPIVMVLIFVIGHAINIFMNVLGAMVHPMRLTFVEFFKNSGYEGGGKEY KPFRK >gi|226331997|gb|ACIB01000059.1| GENE 7 7395 - 7856 632 153 aa, chain + ## HITS:1 COG:SPy0149 KEGG:ns NR:ns ## COG: SPy0149 COG0636 # Protein_GI_number: 15674359 # Func_class: C Energy production and conversion # Function: F0F1-type ATP synthase, subunit c/Archaeal/vacuolar-type H+-ATPase, subunit K # Organism: Streptococcus pyogenes M1 GAS # 5 148 14 154 159 74 38.0 7e-14 MEMNLFIAYIGIAVMVGLSGIGSAYGVTIAGNAAIGALKKNDSAFGNFLVLTALPGTQGL YGFAGYFMFQTIFGILTPQITAIQASAVLGAGIALGLVALFSAIRQGQVCANGIAAIGQG HNVFSNTLILAVFPELYAIVALAATFLIGSALA >gi|226331997|gb|ACIB01000059.1| GENE 8 7879 - 8022 64 47 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MLYAKNISKIAQTSVYNEISEHKKRLKCFRKTLKSFQKTLKRSVENA >gi|226331997|gb|ACIB01000059.1| GENE 9 8086 - 9747 1625 553 aa, chain + ## HITS:1 COG:YLR258w KEGG:ns NR:ns ## COG: YLR258w COG0438 # Protein_GI_number: 6323287 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Saccharomyces cerevisiae # 11 552 10 622 705 216 28.0 1e-55 MVKDLLTPDYIFEASWEVCNKVGGIYTVLSTRANTLQTNFRDKLFFIGPDVWQGKDNPLF IESDNLCAAWKEHALKKDDLSVRVGRWNIPGEPIVILVDFQPFFEKKNEIYTEMWNHFQV DSLHAYGDYDEASMFSYAAGKVVESFYRYNLTENDKVIYQAHEWMTGLGALYLQLAVPEI GTIFTTHATSIGRSIAGNDKPLYDYLFAYNGDQMAQELNMQSKHSIEKQTAHHVDCFTTV SEITNNECKELLDKAADVVLMNGFEDDFVPQGSAFTGKRKRARALMLNVANKLLGTHMDD DTLIIGTSGRYEFKNKGIDVFLESLNRLNRDKDLQKNVLAFVNVPGWVGEPREDLQTRLK SKEKFDTPLEVPFITHWLHNMTHDQVLDMLKYLGMGNRPEDKVKVIFVPCYLDGKDGILN KEYYDLILGEDLSVYPSYYEPWGYTPLESVAFRVPTITTDLAGFGLWVNSLKNQHGIDNG VEVLHRSDYNYSEVADGIKDTIALFSGKTDNEVKEIRKQAAAVAEQALWKHFIKYYYEAY DIALRNAMKRQLN >gi|226331997|gb|ACIB01000059.1| GENE 10 9781 - 12345 2605 854 aa, chain + ## HITS:1 COG:PH1512 KEGG:ns NR:ns ## COG: PH1512 COG0058 # Protein_GI_number: 14591294 # Func_class: G Carbohydrate transport and metabolism # Function: Glucan phosphorylase # Organism: Pyrococcus horikoshii # 21 746 17 734 837 618 44.0 1e-176 MKIKVSNVNTPNWKEVTVKSRVPAELEKLSELARNIWWAWNYEATELFRDLDPTLWKEAG QNPVLLLERMSYEKLEALSKDKVILKRMNDVYAKFRDYMDVTPDNKRPSVAYFSMEYGLN HVLKIYSGGLGVLAGDYLKEASDSNVDLCAVGFLYRYGYFTQTLSMDGQQIANYEAQNFG QLPIDRVTDADGKPLVVDVPYMDYYVHANVWRVNVGRVSLYLLDTDNEMNSEFDRPITHQ LYGGDWENRLKQEILLGIGGMLTLKALGIKKDIYHCNEGHAALINVQRICDYVATGLTFN QAIELVRASSLYTVHTPVPAGHDYFDEGLFGKYMGGYPVKMGISWDDLMDLGRNNPGDKG ERFCMSVFACNTSQEVNGVSWLHGKVSQEMFSSIWKGYFPEESHVGYVTNGVHFPTWSAT EWKQLYAKYFNENFLYDQSNPKIWEAIYNVPDEEIWKTRVTMKNKLVDYIRKQFRETWLK NQGDPSRIVSLLDKINPNALLIGFGRRFATYKRAHLLFTDLERLSKIVNNPDYPVQFLFT GKAHPHDGAGQGLIKRIVEISQRPEFLGKIIFLENYDMQLARRLVTGVDIWLNTPTRPLE ASGTSGEKALMNGVLNFSVLDGWWLEGYREGAGWALTEKRTYQNQEHQDQLDAATIYSIL ETEILPLYYARNKKGYSEGWIKSVKNSIAQIAPHYTMKRQLDDYYSKFYCKEAKRFKELS ANDNAKAKEIAAWKEEVVAKWDSIEIVSKEREADIAQGDIESGKEYTITVVVDEKGLNDA VGLELVTTYTTPEGKQHVYSVEPFSVIKKEGDLYTFQAKHSLSNAGSFKVAYRMFPKNQD LPHRQDFCYVRWFV >gi|226331997|gb|ACIB01000059.1| GENE 11 12446 - 13096 581 216 aa, chain + ## HITS:1 COG:no KEGG:BF2725 NR:ns ## KEGG: BF2725 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 216 1 216 216 424 99.0 1e-118 MRRIFIFIAFALCSLWQLRAQADTASFLFDKYEDAQVLLRTGGELKSKMNYSIVVNKFYF IDPQDKQVKELANPGDILLIKIAGRTFYPESNGAGIEMLPTKPVVYVQYKATARKEAPMG AYGTRSETTAVQSYGTITSNGQSYKLEGEKIIVSNRHHVYWVEKDDKMKQFRNFKQLAKI YSKHRAEVEKYIEDNKVNFEDVDQIVKLCAYADSLK >gi|226331997|gb|ACIB01000059.1| GENE 12 13238 - 14341 1413 367 aa, chain - ## HITS:1 COG:aq_152 KEGG:ns NR:ns ## COG: aq_152 COG0526 # Protein_GI_number: 15605725 # Func_class: O Posttranslational modification, protein turnover, chaperones; C Energy production and conversion # Function: Thiol-disulfide isomerase and thioredoxins # Organism: Aquifex aeolicus # 226 298 34 104 146 80 43.0 5e-15 MKKLTYLAVAAVALGVASCNTNKPGYVITGTVEGAADGDTVFIQERVNRQFNKLDTAIIA NGTFTFEGAQDSVVNRYITYSKDGDGVYVDFFLENGKIKVNLSKDDKSATGTPNNDAYQE IRNKINAIDQKQAAIYQAMGDSTLTDDQKMAKQKEFSELEEAYSQAIKEGVQKNITNPVG IMLFKQSFYENSTEDNDALLKQIPANYQNDETIVKIKDITEKQKATAAGQKFIDFEMLTP DGKPVKLSDYVGKGKVVLVDFWASWCGPCRREMPNIVEAYAKYKGKNFEIVGVSLDQDAD KWKDAIKKLNITWPQMSDLKGWQNEGAQLYAVNSIPHTMLVDADGTILARGLHGEKLQTK LEEVLNK >gi|226331997|gb|ACIB01000059.1| GENE 13 14479 - 15864 1564 461 aa, chain + ## HITS:1 COG:CAC0840 KEGG:ns NR:ns ## COG: CAC0840 COG3842 # Protein_GI_number: 15894127 # Func_class: E Amino acid transport and metabolism # Function: ABC-type spermidine/putrescine transport systems, ATPase components # Organism: Clostridium acetobutylicum # 5 352 2 348 352 402 57.0 1e-112 MNMQESKSIIEVNGVSKFFGEKTALDHVTLNVKKGEFVTILGPSGCGKTTLLRLIAGFQT ASEGEIKISGKEITQTPPHKRPVNTVFQKYALFPHLNVYDNIAFGLKLKKMPKQTIEKKV KAALKMVGMTDYEYRDVDSLSGGQQQRVAIARAIVNEPEVLLLDEPLAALDLKMRKDMQM ELKEMHKSLGITFVYVTHDQEEALTLSDTIVVMSEGRIQQIGTPIDIYNEPINSFVADFI GESNILNGVMIHDKLVRFCNTEFECVDEGFGENMPVDVVIRPEDLYIFPVSEAAQLTGVV QSSVFKGVHYEMTVLCNGYEFLVQDYHHFEVGALVGLLVKPFDIHIMKKERVCNTFEGKL IDETHVEFLGCNFECAPVTGIEAGSEVKVEVGFDNVILQDNEEDGALTGEVKFILYKGDH YHLTVLSDWDENVFVDTNDVWDDGDRVGITIPPDGIRVIKN >gi|226331997|gb|ACIB01000059.1| GENE 14 15980 - 16678 552 232 aa, chain + ## HITS:1 COG:CAC0839 KEGG:ns NR:ns ## COG: CAC0839 COG1176 # Protein_GI_number: 15894126 # Func_class: E Amino acid transport and metabolism # Function: ABC-type spermidine/putrescine transport system, permease component I # Organism: Clostridium acetobutylicum # 3 232 31 277 277 152 38.0 6e-37 MVYAFTDDSGHLTLENFAKFFQHHEAINTFVYSIGIAIITTIVCILLGYPAAWILSNAKL NRSKTMVVLFILPMWVNILVRTLATVALFDFFSVPLGEGALIFGMVYNFIPFMIYPIYNT LQKMDHSYIEAAQDLGANPVQVFFKAVLPLSMPGVMSGIMMVFMPTISTFAIAELLTMNN IKLFGTTIQENINNSMWNYGAALSLIMLLLIAATSLFSTDDKDNTNEGGGLW >gi|226331997|gb|ACIB01000059.1| GENE 15 16672 - 17460 756 262 aa, chain + ## HITS:1 COG:CAC0838 KEGG:ns NR:ns ## COG: CAC0838 COG1177 # Protein_GI_number: 15894125 # Func_class: E Amino acid transport and metabolism # Function: ABC-type spermidine/putrescine transport system, permease component II # Organism: Clostridium acetobutylicum # 1 257 1 252 260 191 44.0 2e-48 MVKKIFAQTYLWILLLLLYSPIVIIMIYSFTEAKVLGNWTGFSTKLYSSLFTTGTHHSLM NALINTVTIALIAATASTLLGSITAIGIFNLKARSRKAISFVNSIPILNGDIITGISLFL LFVSLGISQGYTTVVLAHITFCTPYVVLSVLPRLKQMNPNIYEAALDLGATPMQALRKVI VPEIRPGMISGFMLALTLSIDDFAVTVFTIGNEGLETLSTYIYADARKGGLTPELRPLST IIFVVVLALLIVINRRAGKEKK >gi|226331997|gb|ACIB01000059.1| GENE 16 17629 - 18963 1366 444 aa, chain + ## HITS:1 COG:lin0800 KEGG:ns NR:ns ## COG: lin0800 COG0687 # Protein_GI_number: 16799874 # Func_class: E Amino acid transport and metabolism # Function: Spermidine/putrescine-binding periplasmic protein # Organism: Listeria innocua # 21 326 28 302 357 181 34.0 3e-45 MNKLLITLCLAASFMLSGCYNSGEPREKVLKIYNWADYIGDGVLEDFQAYYKEQTGEDIR IVYQTFDINEIMLTKIEKGHEDFDVVCPSEYIIERMLKKDLLLPIDTVFPHSPDYMNDVS PYIREQIDKLSQPGRVASHYAVCYMWGTAGILYNKAFVPDADAESWSCLWDRKYAGKILM KDSYRDAYGTAIIYAHARDLAGGKVTVEDLMNDYSPRAMEIAEKYLKAMKPNIAGWEADF GKEMMTKNKAWLNMTWSGDAKWAIEEADAVGVDLDYVVPREGSNIWYDGWVIPKYAGNPE AASYFINFMCRPDIALRNMEASGYVSAVASPEIMEAKTDTTLTYYSDLSYFFGPGADSLQ IDKIQYPDRKVVERCAMIRDFGDKTKDVLEIWSRIKGDNLGVGITILIFVVVGLMSGWMI YKRIQRYKHKKMQRRRSRRRKKQR >gi|226331997|gb|ACIB01000059.1| GENE 17 19179 - 19547 506 122 aa, chain + ## HITS:1 COG:sll0517 KEGG:ns NR:ns ## COG: sll0517 COG0724 # Protein_GI_number: 16332012 # Func_class: R General function prediction only # Function: RNA-binding proteins (RRM domain) # Organism: Synechocystis # 1 102 1 100 101 71 45.0 4e-13 MNIFIAGISYNLSNADLGELFEEFGEVISAKIVMDRETGRSKGFGFVEMPNDEEGNAAIA ALHEKEIDGKTLAVSVARPREEGPRRNSNYGGGNRGGYGNNRGGGYGGGNRGGGYGGGRD RY >gi|226331997|gb|ACIB01000059.1| GENE 18 19752 - 20987 845 411 aa, chain + ## HITS:1 COG:no KEGG:BF2733 NR:ns ## KEGG: BF2733 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 411 1 411 411 858 99.0 0 MKNIRQFHLLSSILGGVFLFSSCSVVPKAEEKNLSEQWPVVASYQWAGQDSVVVCDLSLL KDTVDLPFSFFLKDFQIIKLDNRDEAMVGENNLCVSENYILVYGSVYELHPCRLFTKKGE FVTNIGAIGQGPGEYRAVYKAEIDEKHNCIYLMPFDNSNAIYVYDLAGKPLRSIPLHQSV SKAVFKVDADKRELTVGALPFTGYPFVAWVQDFEGHLLDSVPAARHLSVLPDYSNEVMYG ANTEVFDLYISTFFELRPDTLYHYIRSESRLKPRFTLNIGDRKRSITTFYELPQAYVGRL MVEEQVGDGMWETKSPSNFIVDKASLRGTFFRVINDFAGGMPDRLWTPWSLRNKQYIRLV EPGVLKAEIESYLSSTDGRKGKNRKKLQELCESIGEEDNSYVIYAKQKGVQ >gi|226331997|gb|ACIB01000059.1| GENE 19 20987 - 21382 255 131 aa, chain + ## HITS:1 COG:no KEGG:BF2732 NR:ns ## KEGG: BF2732 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 131 1 131 131 213 100.0 2e-54 MKKVFGILLLVLFVCTSCGDKSEPSFKVYDEYSKEKKVEKKNNIGKATYIFWINKTEAKK LRGLTFNIKCKVTINEDGSIKILGYEKEQPYTVTKSLKKYLKTFRVDKEALADGRVKVGD QVVFLRYVVLK >gi|226331997|gb|ACIB01000059.1| GENE 20 21497 - 22042 901 181 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|60682204|ref|YP_212348.1| 30S ribosomal protein S16 [Bacteroides fragilis NCTC 9343] # 1 181 1 181 181 351 100 1e-95 MATRIRLQRHGRKSYAFYSIVIADSRAPRDGKFTEKIGTYNPNTNPATVDLNFERALHWV LVGAQPSDTVRNILSREGVYMKKHLLGGVAKGAFGEAEAEAKFEAWKNNKQSGLSALKAK EEEAKKAEAKARLEAEKKVNEVKAKALAEKKAAEEAAKAAAEAPAEEAAPAEEAATEAAA E >gi|226331997|gb|ACIB01000059.1| GENE 21 22195 - 22557 312 120 aa, chain - ## HITS:1 COG:no KEGG:BF2730 NR:ns ## KEGG: BF2730 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 120 1 120 120 230 100.0 1e-59 MIKEKATPHIGLVTDLTTGQIDGKITPGGMVLVTGCNIKIENGNKPVCEAIQLSHQNGEV ICIDPPFEMNEPHILKFKIPDSLPTGEYTLTIKTRFAGKDKRLLTQEQTLVYMLKLIREE >gi|226331997|gb|ACIB01000059.1| GENE 22 22562 - 22732 57 56 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MLVVFKGLDYGGKSTHKESCPVEKTIDGLIDYWFKTRYLRLYTDFPALRLVLFSHK >gi|226331997|gb|ACIB01000059.1| GENE 23 22872 - 24728 1728 618 aa, chain + ## HITS:1 COG:no KEGG:BF2729 NR:ns ## KEGG: BF2729 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 618 45 662 662 1224 99.0 0 MTHHPGECIEHYMGLHDSLFEAGRYDMLTDIYSGMFLYMPSDPDGNPDTLRRQLLRIMPL YNQVLSKTGAYEAAVQLLDSIRLSGHPFLTGYCAYPLWAFEAQNSLMTDDNRRTEALADS FAVLLPPDDQSVVMLCCHMVSWAYHFSSARPNVACRMQERAVEAYRRGGETQDVGAILAR LGYYYRREGRYVKAVDLSLAAVEWYDKHPGIATDGMIRAYADLAALYSTLALTEKALEIN ARVIRMAAREDSMALCGAYRVRSSFFMDLEQVDSAAFYLGKEREVAQRMGERSLKTWRRD RAKYWLQMCPDSNAAALRDMEAIFADSAGVRPATHSGTRYWLGLALVRDGQEERGLAMME QAHREFAYMDWDEMEAFAAKGLLGIYASRHLGSRMLEFYPRYAALQDSLNEKDKLRYTAA ANVRYDTGRKEQEYRALMAEVELKERTLTYIGIVVVLLFMLLGLVVVYMLQRRRHYRREA HLHRERLSRLISIHQELNGRYESLNNELEKVAHADVIDNVRQKLNPMLLSGDDEIRFRQS FAALYPHYLPVLRCQCPELTRNDELLCMLIRLNQSTDEIALALGISRASVNSGRSRIRKK LGLGKDESLEAYLQNIKK >gi|226331997|gb|ACIB01000059.1| GENE 24 24741 - 26042 1235 433 aa, chain - ## HITS:1 COG:SA2117 KEGG:ns NR:ns ## COG: SA2117 COG1757 # Protein_GI_number: 15927906 # Func_class: C Energy production and conversion # Function: Na+/H+ antiporter # Organism: Staphylococcus aureus N315 # 2 430 20 448 459 327 44.0 3e-89 MINEEKIHKSTSGWWALSPLFVFLCLYLVTSILVNDFYKVPITVAFLISSCYAIAITRGL NLEQRVYQFSVGASNKNIMLMVWIFILAGAFAQSAKQMGAIDATVNLTLHILPDNLLLAG IFLASCFISLSIGTSVGTIVALTPVAIGLAEKTGIDLPFMVAVVVGGSFFGDNLSFISDT TIASTKTQGCVMRDKFRVNSMIVVPAALVVLGIYIFQGLSVSAPPLVQEIEWVKVIPYLI VLGTAIAGMNVMLVLIIGIFTSGVIGIATGSLGFFDWFGAMGTGITGMGELIIITLLAGG MLETIRYNGGIDFIIRKLTRHVNGKRGAELSIAALVSIANLCTANNTIAIITTGPIAKDI AKKFQLDRRKTASILDTFSCLIQGIIPYGAQMLIAAGLAQISPLSIIGNLYYPFTMGICA LLAILFRYPRKYS >gi|226331997|gb|ACIB01000059.1| GENE 25 26642 - 28039 1040 465 aa, chain + ## HITS:1 COG:DR0430 KEGG:ns NR:ns ## COG: DR0430 COG1690 # Protein_GI_number: 15805457 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Deinococcus radiodurans # 6 463 4 464 470 380 47.0 1e-105 MGIRLKDLSKLGYRDNVARSLAVALVGKYCKHETKEQILAALGDILKNPEVYKNSEIWSK LAEHLSPVFIEKRLTPYDLLDEPLGYKTYGSKYIETLAKQQMNLAMRLPITLAGALMPDA HAGYGLPIGGVLATDHAVIPYAVGVDIGCRMNLTLFDAGEDFLKRYSHHIKEALKEFTHF GMDGGLSFAQEHEVLDREEFRMTELLRGLHGKAVRQLGSSGGGNHFVEFGKMSLQAGNVL GVPEGNYVALLSHSGSRGLGAAIAKHYSLLARDLCRLPREAQHFAWLGLDTEEGQEYWLS MNLAGDYARACHERIHLNLSKALGLKPLANVNNHHNFAWKEEIAPGQTAIVHRKGATPAQ KGQPGLIPGSMATPGYLVCGKGISEALCSASHGAGRAMSRQKAKDNFTRSALRKYLSQAG VTLIGGSVEEMPLAYKDIDRVMKTQEELVEVQGTFMPCIVRMNKE >gi|226331997|gb|ACIB01000059.1| GENE 26 28044 - 28664 512 206 aa, chain + ## HITS:1 COG:STM0315 KEGG:ns NR:ns ## COG: STM0315 COG1186 # Protein_GI_number: 16763697 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Protein chain release factor B # Organism: Salmonella typhimurium LT2 # 6 201 4 197 204 165 46.0 7e-41 MEKTYLQITSGRGPVECCRVVALVLEKILREAQKRKLRVEILEKETGPVNRTLLSVVVAL EGAGCDVLADEWEGTVLWIARSPYRIHHRRKNWFVGVQTFLLSESREATEDDIRYETLRA SGPGGQHVNKTESAVRAVHIPSGISVVASDQRSQWQNKKLATERLLVKLTAWNIEQAMIQ AQTNWSNHNSLQRGNPVKIIQEELRF >gi|226331997|gb|ACIB01000059.1| GENE 27 28690 - 29403 548 237 aa, chain + ## HITS:1 COG:no KEGG:BF2709 NR:ns ## KEGG: BF2709 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 237 1 237 237 492 100.0 1e-138 MLSEKYGRTYHYPFSPGTTSDDRINHTYWEDIRQISTLVHTEKLDGENNCLSRYGVFARS HVAPTTSPWTSQLRQRWELLKNDLGDIELFGENLYAVHSIEYKRLETHFYVFAVRCLDKW LSWDEVKFYAALFDLPTVPELCTECVDGLTVASLEQHVVCLAQEPSVFGSCDAQTGLDCT REGVVTRNIGEYATADFAHNVFKYVRKGHVQTGEHWTRHWKRARLVWELKQEKGGNR >gi|226331997|gb|ACIB01000059.1| GENE 28 29400 - 30512 743 370 aa, chain + ## HITS:1 COG:CAC0753_1 KEGG:ns NR:ns ## COG: CAC0753_1 COG0617 # Protein_GI_number: 15894040 # Func_class: J Translation, ribosomal structure and biogenesis # Function: tRNA nucleotidyltransferase/poly(A) polymerase # Organism: Clostridium acetobutylicum # 7 208 8 210 228 121 32.0 3e-27 MIWKLSERKDWNSLEQQFGWVRDMNQVPQHTVHHAEGSVAVHTRMVLEALLRQPAYLMLP EQEREILWAAALLHDVEKRSTSVDEGNGQVTSKNHAKRGETTVRTLLYRDIPAPFNIREH IASLVRHHGLPIWLMEREDPLKRACEASLRLDTSLLKQLTVADICGRISTDKEVLLEATE FFEMFCREQQCWGKAREFANGTARFHYFHTPRSYIDYVPHDDFKCEVTLLVGLPGMGKDY YIESRCADMPVVSLDAIRCKHKFSPTDKAANGWVAQTAKEQARIYLRKGQDFIWNATNVS RQRRTQLIDLFITYGARVKIVYIEKPYSVWRRQNSTREYEVPETVLDKMLGRLEVPQLTE AHEVVYVVEE >gi|226331997|gb|ACIB01000059.1| GENE 29 30948 - 34109 2282 1053 aa, chain - ## HITS:1 COG:mll3725_2 KEGG:ns NR:ns ## COG: mll3725_2 COG0642 # Protein_GI_number: 13473203 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Mesorhizobium loti # 661 927 55 326 328 177 37.0 1e-43 MTLFTNIKSFDKSFLLKLWLSLILYQLSVCPVSAQKDTMDIKDYILIINTYTESFPWSNR LISTATNFVKDDPKLAVYTEHMNMIMIDNDSILDQFKDSLFDRYGSHRPRMLLLLGNSSL ILKDDLRKMWGDIPMVLCAGKDYTGPEHYYLTKQPIPLSERVPLAELSQSCNLTYLYANL YIHENVEMMFRTLPRMKRFIYVGDERFVNQVNSQEIQEILRTKHPDVHYTFLSSRDIKKT NQLIDSLNFVDPRTTGILFSSWFHKRQFAGNMMLTMILPEIVSTVSPPIFALNMIELNDK ESGMVGGYTYDQNHFNEKLSNMFSEILSGKSPRDLPHYLPTDGTPLINYQVLVRKGLSPD EWPAHTRFLNKPITFWDKYKYFLPGTTVCIALLVWFFLYRIRTLTHLRQIQLKEIEAMAN YKNLIDNMPLLYMQEKLIVNEQGVADDLIYLNVNPHFEKHFFRREDVVGKRASELFPESL PEFLHFIQISLKENRAITFPYYFKKIDRFYDIVLKGAHQENVIDIFCVDSTELHRAQQKL NATNHKLSMALEVADIIPWKWDLLSKTILCDINKPIELSAQGNNVSEEQLAVPDSQYFSK IYKEDRIRVEQAYKDLIEGRLEKVKEEYRVINIRNHTHKIEWVEAQAAVETRDENGIPVT LVGSSQVITGRKKMEMELTSAKDRAEESNRLKSAFLANMSHEIRTPLNAIVGFSGILAST EEEEEKQEYVSIIENNNALLLQLISDILDLSKIEAGTLEFQYSDIELNTELKKLESTLKL KLKSDDVQLEFVPGLPVCPVCTEKNRLSQLIINLVTNAIKFTSRGSIRFGYEHRGKELYF YVTDTGCGIPKDKQESIFGRFVKLNSFAQGTGLGLSICRTLVEHMGGHIGVDSEQGKGST FWFSLPYKAASTSAGTMQKTEIQPISVEKDKLTILIAEDNESNYRLFESILGHDYHLIHA WDGREAVERFKRENPQIILMDINMPVMDGYEATQEIRKYSAKVPIIAVTAFAYTSDEQRV MENGFDGYMPKPINARQLKAQITEIMQKRIILL >gi|226331997|gb|ACIB01000059.1| GENE 30 34458 - 34814 269 118 aa, chain + ## HITS:1 COG:AGc3635 KEGG:ns NR:ns ## COG: AGc3635 COG1733 # Protein_GI_number: 15889290 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 10 112 36 137 147 108 51.0 2e-24 MKNFHPTGTCPIRDVLCRLGDKWSMLVLVTLNANGTMRFGDIHKTIDDISQRMLTVTLRT LEADGLVERKAYAEVPPRVEYCLTEMGHSLIPHVEALVGWALDHMTMIFEHREQQKGL >gi|226331997|gb|ACIB01000059.1| GENE 31 34811 - 35257 339 148 aa, chain + ## HITS:1 COG:CAC3445 KEGG:ns NR:ns ## COG: CAC3445 COG0454 # Protein_GI_number: 15896686 # Func_class: K Transcription; R General function prediction only # Function: Histone acetyltransferase HPA2 and related acetyltransferases # Organism: Clostridium acetobutylicum # 1 145 1 144 147 180 61.0 9e-46 MKVEQVFSDRKRFLDLLLLADEQEDMIDRYLERGDMFALYDEDKLRAVCVVTNEGKGIYE LKNIATCPDSQRKGYGKSLIEYLFHHYSDRCSVMFVGTGDTPHTLLFYQSCGFVPSHRIK NFFTDHYDHPIYENGIRLRDMVYLKKEK >gi|226331997|gb|ACIB01000059.1| GENE 32 35321 - 36385 839 354 aa, chain - ## HITS:1 COG:mll7623 KEGG:ns NR:ns ## COG: mll7623 COG1879 # Protein_GI_number: 13476333 # Func_class: G Carbohydrate transport and metabolism # Function: ABC-type sugar transport system, periplasmic component # Organism: Mesorhizobium loti # 6 348 5 345 345 90 25.0 5e-18 MNKLPERIRIKDIARLANVSVGTVDRVLHGRSGVSEASRKRVEEILKQLDYQPNMYASAL ASNKKYTFACLLPKHLEGEYWTDVQKGIREAVTTYSDFNISANITHYDPYDYNSFVTTSQ AVIEEQPDGVMFAPTVPQYTKGFTDALNELGIPYIYIDSQIKDAPPLAFFGQNSHQSGYF AARMLMLLAVNDREIVIFRKIHEGVIGSNQQESREIGFRQYMQEHHPTCNILELNLHADL NIEDSRMLDDFFCEHPDVKHGITFNSKVYIIGEYLQQRRKSDFSLIGYDLLERNVTCLKE GTVSFLIAQQPELQGFNSIKTLCDHLIFRKEVTCTNYMPIDLLTKENIDYYHSK >gi|226331997|gb|ACIB01000059.1| GENE 33 36639 - 37664 934 341 aa, chain + ## HITS:1 COG:TM0067 KEGG:ns NR:ns ## COG: TM0067 COG0524 # Protein_GI_number: 15642842 # Func_class: G Carbohydrate transport and metabolism # Function: Sugar kinases, ribokinase family # Organism: Thermotoga maritima # 4 341 2 339 339 358 53.0 8e-99 MGKKVVTLGEIMLRLSPPGNTRFVQSDSFDVVYGGGEANVAVSCANYGHDAYFITKLPEH EIGQSAVNALRKYGVKTDYIARGGERVGIYYLETGAAMRPSKVIYDRAHSAIAEAVAADF DFDKIMEGADWFHWSGITPAISDKAAELTRLACEAAKRHGVTVSVDLNFRKKLWTKEKAQ SIMKPLMKYVDVCIGNEEDAELCLGFKPDADVEGGHTDAEGYKGIFRQMMDEFGFSYVIS TLRESFSASHNGWKAMIYNGEEFYVSRHYDIDPIIDRVGGGDSFSGGVIHGLLTKRTQGE ALEFAVAASALKHTINGDFNLVSVAEVEALVGGDASGRVQR >gi|226331997|gb|ACIB01000059.1| GENE 34 37711 - 38379 691 222 aa, chain + ## HITS:1 COG:CC1495 KEGG:ns NR:ns ## COG: CC1495 COG0800 # Protein_GI_number: 16125742 # Func_class: G Carbohydrate transport and metabolism # Function: 2-keto-3-deoxy-6-phosphogluconate aldolase # Organism: Caulobacter vibrioides # 5 221 4 221 224 231 49.0 8e-61 MAKFDKIAVLNKIGSTGMVPVFYHEDVEIAKKVVKACYEGGVRAFEFTNRGDFAQEVFAG LVKFAVRECPEMAMGVGSVVDPATAALYIQSGADFVVGPLFNPEIAKICNRRLIAYTPGC GSVSEVGFAQEAGCDLCKIFPGDVYGPNFVKGLMAPMPWSKLMVTGGVEPTRENLTGWFG AGAFCVGMGSKLFPKDKVAAEDWGYVTKKCTEALEYIAEARK >gi|226331997|gb|ACIB01000059.1| GENE 35 38458 - 38982 449 174 aa, chain - ## HITS:1 COG:Cj1165c KEGG:ns NR:ns ## COG: Cj1165c COG3610 # Protein_GI_number: 15792489 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Campylobacter jejuni # 13 169 8 163 164 110 46.0 1e-24 MMNIDFITATVLDGAFAAVAAIGFAIISNPPRKAILISAFLAAVGHGLRYFLMHAHLFTM DIATASFFAAVSIGLLAIPFAKAIHCPAEVFSFPSLLPMIPGMFAYKSILALTKFMQTKD ETDSLRYLVDFCHNGSTTIFVLFALVVGAAVPVFIFHRQSFTATRLLKKLVKKG >gi|226331997|gb|ACIB01000059.1| GENE 36 38979 - 39752 576 257 aa, chain - ## HITS:1 COG:Cj1166c KEGG:ns NR:ns ## COG: Cj1166c COG2966 # Protein_GI_number: 15792490 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Campylobacter jejuni # 6 257 5 257 258 153 36.0 3e-37 MDIHQELKELSKFLSEYSTSLMAVGVQTSRIVRNTSRIAESFGFFCDMTIFQKTIIMTLR DADNSHSYSTVNKIKPMGLNFAINSALSTLSWEAYDEHLSLSELQRRYHEIVSKPRESKW LVLILVAFANASFCRLFQGDFISMGIVFVATLAGFFVRTELMGRHWNHLAIFIISSFIAS MIGSTGYLMHWGDTPDMALGTSVLYLIPGVPLINAIMDIIDGHVLAGTSRFINACLLIIC IAIGLSMTLLITGISTL >gi|226331997|gb|ACIB01000059.1| GENE 37 39762 - 40010 234 82 aa, chain - ## HITS:1 COG:no KEGG:BF2715 NR:ns ## KEGG: BF2715 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 82 1 82 82 113 98.0 2e-24 MLVTNTMEATASTTDSCQVSEPVTATQETPRTIATYTATTPAMHRKSFKEQEPAIFSIIG FAVAYIAALIINRIIKTRKTHH >gi|226331997|gb|ACIB01000059.1| GENE 38 40439 - 41488 928 349 aa, chain - ## HITS:1 COG:TP0053 KEGG:ns NR:ns ## COG: TP0053 COG0208 # Protein_GI_number: 15639047 # Func_class: F Nucleotide transport and metabolism # Function: Ribonucleotide reductase, beta subunit # Organism: Treponema pallidum # 6 349 8 351 351 523 72.0 1e-148 METKKLKKNALFNPEGDTETRLRKMIGGNTTNLNDFNNMRYKWVSDWYRQAMNNFWIPEE INLTQDTKDYPHLTPAERTAYDKILSFLVFLDSLQSNNLPTLSEYITANEVNLCLHIQAF QECVHSQSYSYMLDSICSPEERNDILYQWKTDEHLLRRNTFIGNCYNEFQENRDGFALMK TLIANYILEGIYFYSGFMFFYNLSRNGKMSGSAQEIRYINRDENTHLWLFRNIILELKKE EPELFTPDKVKVYEEMMREGVKQEIAWGQYVIGDQIQGLNRQMISDYIHFLGNLRWSSLG YTPLYEDNRKEPESMHWVSQYSNANMVKTDFFEAKSTAYAKSTALEDDL >gi|226331997|gb|ACIB01000059.1| GENE 39 41514 - 44033 2271 839 aa, chain - ## HITS:1 COG:TP1008 KEGG:ns NR:ns ## COG: TP1008 COG0209 # Protein_GI_number: 15639992 # Func_class: F Nucleotide transport and metabolism # Function: Ribonucleotide reductase, alpha subunit # Organism: Treponema pallidum # 1 839 1 845 845 1119 63.0 0 MEITKRNGTTEPYDREKIAVAIRKSFASTGQEVSNEIIYSVVEEVENFVGKDTANRSVEQ IQDKVEQSLMEHGFYAEAKNYILYRWQRTERRKALNSIINELGDGTVMDVLKEIQKDFTT HEYSLVVLYEKFSGFCKQEMPHSERLAALIKAAVELTTQEAPDWEFIAARLLNFQLSKKL EKQAASAGIRSFYDKLRYLTEEGLYGSYILAAYSQQEIEEAAGFICPDRDKLFNYSGLDL LVKRYLIRTRLHEPAESVQEMYLGIALHLAMPEQKERMTWVKKFYDLLSRLEVTMATPTL SNARKPYHQLSSCFIDTVPDSLEGIYRSIDNFAMVSKFGGGMGMYFGKVRAAGGNIRGFK GVAGGVIRWMKLVNDTAVAVDQLGMRQGAVAVYLDVWHKDLPEFLQLRTNNGDDRMKAHD IFPSVCYPDLFWKMVKEDLNQPWYLFCPNEIMTIKGYCLEDYYGEEWEKRYHDCVNDTRL SKRSISIKDIVRLVLRSAVETGTPFTFNRDTVNRANPNAHRGIIYCSNLCTEIAQNMAAI ETVSTEIRTEEGDTVVVKTVRPGDFVVCNLASLSLGHLPLEDEEQIKEKVSTVVRALDNV IDLNFYPLPFAQITNQRYRSIGLGVSGYHHALAIRNIRWESEEHLRFVDRVFEQINYAAI EASADLAKEKGRYEYFEGSDWQTGAYFGKRGYTSPEWNRLAARVAENGMRNAYLLAIAPT SSTSIIAGTTAGTDPVMKRFFLEEKKGAMLPRVAPALSDKTFWLYKDAYTLDQKWSIRAA GTRQLHIDQSQSLNLYITNEFTMRQVLDLYLLAWECGVKTVYYVRSKSLEVEECESCAS >gi|226331997|gb|ACIB01000059.1| GENE 40 44493 - 44819 394 108 aa, chain - ## HITS:1 COG:MA4642 KEGG:ns NR:ns ## COG: MA4642 COG4744 # Protein_GI_number: 20093421 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Methanosarcina acetivorans str.C2A # 5 108 12 115 127 90 50.0 6e-19 MKRNLLRKEEDADPISVVSNLFDVAMVFAVALMVALVSRYNMTEVFSQEDYTMVKNPGKE NMEIITKEGQKINRYTPSEDQQKSGKKGKKVGIAYELDNGEIIYVPEE >gi|226331997|gb|ACIB01000059.1| GENE 41 44816 - 45418 635 200 aa, chain - ## HITS:1 COG:MA4426 KEGG:ns NR:ns ## COG: MA4426 COG0811 # Protein_GI_number: 20093212 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Biopolymer transport proteins # Organism: Methanosarcina acetivorans str.C2A # 7 195 10 205 273 130 39.0 1e-30 MNIISDILYWISTGLLVPDIVLLIVLFGRALLLVGSFYGQYLSIRKTEVLLRNELNALTP ATVMELADKLPEKSSSLVISYIRQVLQAHESPAQIQRLLANFEIAADKDLAISKTLTKLG PILGLMGTLIPMGPALAGLASGDIASMAYNMQIAFATTVVGLVAGAVGFLTQQVKQRWYL QDMTNLEFLSELLNEKRAAR >gi|226331997|gb|ACIB01000059.1| GENE 42 45463 - 46146 736 227 aa, chain - ## HITS:1 COG:no KEGG:BF2710 NR:ns ## KEGG: BF2710 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 227 1 227 227 370 100.0 1e-101 MDTVVLVLMLLIAFNFLLKQTFWKTVAVGIIATVAALFAGLMWPYAIEQSKTQIADWLGN TALMLDTSVLLTIEVSLQMAYAMLAVHVASAYPVKPRTLLTYRFLRWFPGLLIFPVLFSG LVYLIFAFPGTPFTTVAWMYAACVLIAIPVGRWLLLYLLPEKELRLELFFLTNALVAILG IVATVNGRTSVAGVSEVNWGALAGVGGITIIGSAIGLVWRRVKKRIN >gi|226331997|gb|ACIB01000059.1| GENE 43 46155 - 50474 4067 1439 aa, chain - ## HITS:1 COG:MA0348 KEGG:ns NR:ns ## COG: MA0348 COG1429 # Protein_GI_number: 20089246 # Func_class: H Coenzyme transport and metabolism # Function: Cobalamin biosynthesis protein CobN and related Mg-chelatases # Organism: Methanosarcina acetivorans str.C2A # 902 1384 1248 1733 1845 312 35.0 4e-84 MKKKQIITTCCVAAAILVGVFVWQAYFSATKIAFVNFQTINLGNISKANDNSFVKLREVS TDHLDELTGYDMVFVNGMGLRIVEEQRQQIQRAADKGIPVYTSMATNPANNICNLDSVQM SQIRQYLTNAGKVNYRNLLSYVRKEIDGKLISAPVPEAPVEKPTDILYHAGVKNPDDEME FLNVTDYEKFLRENGLYHEGARKVVITGQMADATGLILALEKAGHNVYPISSFTRFMEFV REIRPDAVINMAHGRMGDDMVEYLKERNIPLFAPLTVNSLVEEWENDPMGMSGGFLSQSV VTPEIDGAIRPFALFAQYKDDEGLQHSFAVPERLETFVNTVNNYLTLKTKPNSEKHIAIV YYKGPGQNALTASGMEVGPSLYNLLLRMKKEGYRVENLPESAKELEKMIQAQGAVFGMYA EGAFDEFMKTGNPELVTKEQYESWVKASLRPGKYAEVVAANGEFPGQYMTTPDGRLGIAR LQFGNVVLMPQMAAGSGDNAFQVVHGTNAAPPHTYIASYLWLQHGFKADAMIHFGTHGSL EFTPRKQVALCSDDWPDRLVGALPHLYIYSIGNVGEGMIAKRRSYATLQSYLTPPFLESS VRGIYRDLMEKIKIYNNTTGAKEKQSLAVKALTVKLGIHRELGLDSLPPRPYSEDEVARV ENFAEELATEKITGQLYTMGVPYEPERITSSVLAMTTEPIAYSLLSLDKQRGKATADVEK HRSLFTQRYLNPARALVEKLIANPALATDELICRTAGVSPEELAKAREIETSRNAPKGMM AMMMAAAAKNKTGDKTGKTADKMPEAMKKKMKEMGAHMDSSKAMEMAKKMGADPEALKKM EAKMNASKGDKPEADKAKGMSDMMAAMGKKTAQKEYSKEEINFALALTEVERTIRNVGNY QTELTASPEKELASLVNALNGGYTAPSPGGDPIANPNTLPTGRNMYAINAEATPSESAWE KGVALAKQTIETYQRRHNDSIPRKVSYTLWSGEFIETGGATIAQVLYMLGVEPVRDAFGR VSDLKLIPSAELGRPRIDVVVQTSGQLRDIAASRLFLINRAVEMAAAAKDDKYENLVAAS VVEAEKTLTEKGVSPKDAREMAAFRVFGGANGMYGTGIQEMVESGDRWEDESEIAATYLN NMGAYYGSEKNWEAFRKYAFEAALTRTDVVVQPRQSNTWGALSLDHVYEFMGGMNLAVRN VTGKDPDAYLSDYRNRNHMRMQEVKEAVGVESRTTILNPAYIKEKMKGGSSSAAEFAETV TNTYGWNVMKPAAIDKELWDNIYNVYVKDEYNLNVKEFFETQSPAALEEMTAIMLESARK GLWKASAEQVAELAKLHTETVNKYRPSCSGFVCDNAKLREYIASKSDAPAAAQYKENISK IREAKVSGDAKGMVMKKEEINRTPEEQKTTLSNIAVGAAVVVVILALVLFVRKRRKTSE >gi|226331997|gb|ACIB01000059.1| GENE 44 50559 - 52703 2242 714 aa, chain - ## HITS:1 COG:ECs3047 KEGG:ns NR:ns ## COG: ECs3047 COG4771 # Protein_GI_number: 15832301 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor for ferrienterochelin and colicins # Organism: Escherichia coli O157:H7 # 109 689 32 644 659 155 25.0 4e-37 MQRLLKSAYCFFAALLLTTAAAAQHKISGRVIDTTQEPLVGATITLKEKPSVGTTTDTEG RYTLTLPDQKEYTVQVSFVGYVTATKKTTAARQSRLDFILKEDQMNLSTVVITGTRTPKL LKDAPIITRVITAGDLKKVDATHIGQLLQVELPGIEFSYSMDQQVKLNMQGFGGNAVLFL VDGERLAGETLDNIDYNRLNLDNVERVEIVKGAASTLYGSSAIGGVINIITKASDDPWNL NLNTRFGVHNDQRHGGTVGFNAGKFYSQTNVQYTNIDSIHVKQGDYTTINGNKTWNVKER LMFTPNEQLRLTARAGYYFRERDASSETKNRYRGFSGGLKGNYDFNTKSNLELAYTFDQY DKSDYLVSYKNDIRDYSNVQHSVRALYNYTFNDKNTLTVGGDYLRDYLMSYQFKENADYT MHSADAFGQFDWNPTEHFNVIAGLRFDYFSESNVRHFSPHLGLMYKIGNCSLRGSYAQGF RSPTLKEMHMNFYMANTMMIYGNPDLEPETSHNFSLSGEYTKNRYNFTLTGYYNLVHDRI EYTSFRDTDGMIAQKYINTPRVDIAGIDANASAKYPCGIGARISYTYIHEFMRDGQTKLS STRPHSATVRLEYGKTWDHYDFNLSLDGRALSQVKTNQYTSNDPNAGTEKVTYPGYTMWN LTLTQRVWKGINVNMAVNNLFNYRPDYYYANSPYTTGTNFSVGLSLDIDQMFRK >gi|226331997|gb|ACIB01000059.1| GENE 45 52733 - 53401 546 222 aa, chain - ## HITS:1 COG:no KEGG:BF2687 NR:ns ## KEGG: BF2687 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 8 222 9 223 223 424 99.0 1e-117 MKINSLLIAITLLGTMLISFSACNGILSSLYDEPETAKDFGFITIDHANHSGTVRVDATQ YTKWNYINLHTLQIDSAKVTAEGADDPDTWDLAIHRYDVKTNGGEVLETDYQSLSALKNA GSMPQGTFVADEWTTNKIAVDVSHMMEDNGYLIYAPSDFNPELSKWLNVDTSEMPPIYTP SNKVYLLRMKDGTMAAIRLVSYMNAAGIKGYMTFDYIYPYEP >gi|226331997|gb|ACIB01000059.1| GENE 46 53444 - 54289 865 281 aa, chain - ## HITS:1 COG:no KEGG:BT_0498 NR:ns ## KEGG: BT_0498 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 7 223 16 233 294 85 30.0 2e-15 MKIKSLLTMMFVLCACAACNDDKNEETPLNQVVAGKYDGYTKAVAQYFPAGQYADKQSIT LTPNDNGTVNIAYTSESFGEFSISNATVELKNDAYLVKGDGKTTMGMNGGTPKEYDCTFE GTISKDKKTSALEFNVPAVMNGLTVTFTQGEAPAAVMAAGSYKGYTKAVAQYFPNGQYAA DQTIKVTANEDGTVNVTYTSTSFGEFTINNATVKSENNAYTINGEGKSVMGMEGKEPKEY ACTLKGTIDAAKATPTFEISVPAVMNGLTITFATGDVPENK >gi|226331997|gb|ACIB01000059.1| GENE 47 54731 - 55075 266 114 aa, chain + ## HITS:1 COG:no KEGG:BF2704 NR:ns ## KEGG: BF2704 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 114 16 129 129 225 100.0 4e-58 MMELISRTRNGMITYCAHCKVYHLEFGNLFFRLSEAGFECLRNYIVSINGPYCERSNRKM NANRKIFLKLPAQNVYFCVCTAELEELKSLVLLQSPAVEETDGLIPFMHDISLN >gi|226331997|gb|ACIB01000059.1| GENE 48 55123 - 56031 838 302 aa, chain + ## HITS:1 COG:CC0303 KEGG:ns NR:ns ## COG: CC0303 COG1230 # Protein_GI_number: 16124558 # Func_class: P Inorganic ion transport and metabolism # Function: Co/Zn/Cd efflux system component # Organism: Caulobacter vibrioides # 20 258 75 313 361 241 52.0 1e-63 MAHSHEHHHEHVHELTSLNKSFIIGITLNILFVLVEFGIGFYYDSLGLLSDAGHNLGDVA SLVLAMLAFRLAKVHPNSRYTYGYKKSTVLVSLLNAVILLVAVGIIITESIEKLFHPVPV EGAAIAWTAGVGVVINAVTAWLFMKDKEKDLNVKGAYLHMAADTLVSVGVVISGIIIMYT GWTLVDPIIGLVIAVIIVISTWSLLHDSLRLSLDGVPVGIDSEKIGRVILEQPGVESYHH LHIWALSTTETALTAHIVIGNLRRMEEVKREVKHELEHAGITHATLEFEYKGACCEGEDC IS >gi|226331997|gb|ACIB01000059.1| GENE 49 56708 - 58306 1724 532 aa, chain + ## HITS:1 COG:STM3807 KEGG:ns NR:ns ## COG: STM3807 COG2985 # Protein_GI_number: 16767092 # Func_class: R General function prediction only # Function: Predicted permease # Organism: Salmonella typhimurium LT2 # 20 532 18 550 553 269 32.0 1e-71 MFTDLLHSSYFSLFLIVALGFMLGRIKIKGLSLDVSAVIFIALLFGHFGVIIPKELGNFG LVLFIFTIGIQAGPGFFDSFRSKGKTLIIITMLIISSACLTAVGLKYAFGIDTPSVVGLV AGALTSTPGLAVAIDSTNSPLASIAYGIAYPFGVIGVILFVKLLPKIMRIDLDKEARRLE IERRGQFPELGTCIYRITNPSVFGRSLMQINARAMTGAVISRLKHQEEISIPTAHTVLHE GDYIQAVGSEEALTQLAVLVGEREEGELPLENTQEIESLLLTKKDMINKQLGDLNLMKNF GCTVTRVRRSGIDLSPSPDLALKFGDKLMVVGEKEGIKGVARLLGNNAKKLSDTDFFPIA MGIVLGVLFGKLNISFPGGLSFSPGLTGGVLMVALLLSAIGKTGPILWSMSGPANQLLRQ LGLLLFLAEVGTSAGKNLVATFQESGLLLFGVGAAITLVPMLIAAFVGRLVFKISLLDLL GTITGGMTSTPGLAAADSMVDSNIPSVAYATVYPIAMVFLILFIQVIATVVY >gi|226331997|gb|ACIB01000059.1| GENE 50 58476 - 58574 77 32 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MDAANTSNPSDKKKFIWLVILLIIAIVVAVVW >gi|226331997|gb|ACIB01000059.1| GENE 51 58650 - 59156 381 168 aa, chain + ## HITS:1 COG:PA3136 KEGG:ns NR:ns ## COG: PA3136 COG1566 # Protein_GI_number: 15598332 # Func_class: V Defense mechanisms # Function: Multidrug resistance efflux pump # Organism: Pseudomonas aeruginosa # 34 165 224 350 355 92 37.0 4e-19 MAPLSAVFKQEGDSVKRGELLAVLDSTMTEDYRIVSPVDGLIAKQWVVPGDLLQPGENIF TLNEGKKLWVTVYLQETKFDEVRMGQQALFTLDAYPGLTFYGKIFYIGANTASEFSLIPP DNASGNYTKVAQRIPLKISIDRVEGKEKLKANLRLLSGMSANVKIVKE >gi|226331997|gb|ACIB01000059.1| GENE 52 59153 - 60682 1249 509 aa, chain + ## HITS:1 COG:SA2142 KEGG:ns NR:ns ## COG: SA2142 COG0477 # Protein_GI_number: 15927932 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Staphylococcus aureus N315 # 11 508 141 625 643 243 30.0 1e-63 MKESSSYKWIVLGNIMIGTFMAVLDSTVVNTGLPVIMGTLGADINVAEWVLTGYMLSMAS ILPAAGWLSERFGYKKIYFLSLLVFTAGSFMCGSSSTIEELIFWRVIEGFGCGMLLPVGM AIVSDVFPPEQRGTALGFWSIASAASVSFGPAIGGYLVDYMDWNYIFYVNIPIGILALIV TAIVQKEHVKGTGVPFDIPGFITSAIFLPVFMYGLSEVNSSTNSMGWSSPVVLGAMWIAV VTFILFLYFEFTVKTPLINLRLFVDRDFALSNLILFVFGIGMFGSTFLIPLYLQDNLGYS ALQAGMFFMPVGIIQGVASPLSGKLMQRINPKVFIVSGILLMALSFYMNYYLSFLTEKWY IMLSLYLRGLSMGLLFTPLLTLSLVNIRNSDMAQASSITNIVRQMGGSFGVAIFSHLLTQ RTAFHTQRYGEALNYTGDVYRQTLHSLSRFIEHTGGRTPDTAQEVAEMLILKRIDLEGYI SAINDDFFIAFIVTLLCVVPVLFLKTKKG >gi|226331997|gb|ACIB01000059.1| GENE 53 60834 - 62021 1206 395 aa, chain + ## HITS:1 COG:BS_yrkO KEGG:ns NR:ns ## COG: BS_yrkO COG2311 # Protein_GI_number: 16079697 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Bacillus subtilis # 11 391 17 385 405 87 24.0 5e-17 MRQTIASKTPRIEVVDALRGFAVMAILLVHNLEHFIFPVYPDAASQPGWLNILDEGVFSV IFSLFAGKAYAIFALLFGLTFYIQYTNQQKKGKDFGYRFLWRLLLLGGFATLNAAFFPAG DVLLLFCVVGIFLFIVRKWSDRTVFILAIFLLLQPVEWYHYVMNLFNPAHSLPDLGVGQM YGEVAEYTKEGDFWKFIWGNVTLGQKASLFWAIGAGRFLQTAGLFLLGMLIGRKQLFVAS EATIRFWVKALIWSAVLFGPLYQLKVQLMDAGQPDMIRQTVGVVMDMWQKFAFTIVLVAS FVLLYQTEKFRKLTADLRFYGKMSLTNYISQSIAGAIIYFPFALYLAPYCGYTVSLLIGF AFFLLQVRFCKWWLKAHKQGPLESIWHKLTWIRSE >gi|226331997|gb|ACIB01000059.1| GENE 54 62251 - 62595 206 114 aa, chain + ## HITS:1 COG:no KEGG:BF2676 NR:ns ## KEGG: BF2676 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 114 1 114 114 220 100.0 1e-56 MKQGCLKIMRWFLPVLFISYMAGITLFTHSHVVNGVTIVHSHPFKKDSPHSHTTVEFQLI HLLNHVVTTGAGIFFLSLQFIACLLYTLSWRPDRAGYCSSVVGVVSLRAPPAGR >gi|226331997|gb|ACIB01000059.1| GENE 55 62660 - 64969 1977 769 aa, chain + ## HITS:1 COG:STM2199 KEGG:ns NR:ns ## COG: STM2199 COG4771 # Protein_GI_number: 16765529 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor for ferrienterochelin and colicins # Organism: Salmonella typhimurium LT2 # 122 564 33 462 663 88 24.0 4e-17 MKMKKYIFFLVSLCCALLPAMADQPEHPELKASDANIIGHVLDKKTGEHLSYITIALKGT TIGTVTDATGHYFLKNLPEGNFVLEASSVGYKTISRNVSLRKGKTLEENFELEEDAVALD GVVVSANRSVTKRRLAPTLVNVVDMKMFENTNSPTLSQGLNFQPGVRVETNCQNCGFQQV RINGLDGPYTQILIDSRPIFSALSGVYGLEQIPANMIERVEVMRGGGSALFGSSAIAGTI NIITKEPLRNSGQLAHTLTSIGGSSSFDNNTSLNASLVTDDHRAGLYVFGQNRHRDAYDH DGDGYSEMPKLKNQTVGFRSFLKTSTYSKLTFEYHHLQEFRRGGNLLNRPPHEADIAEQI QHSINGGGLKFDYFAPNEKHRLTVYTSAQHTDRDSYYGSKKDQNAYGKTTDLTFIGGSQY VYSFGKCLFMPADLTAGLEYNRDNLKDDMWGYNRYTKQTVNIGSAFLQNEWKNEKWSILL GGRLDKHNLINHVIFSPRANLRFNPSEDINLRLSYSSGFRAPQAFDEDLHIENVGGTVSM IERAKNLKEEKSQSFSASADMYHRFGAFQVNFLVEGFYTRLSDVFVLENIGERDGILIKE RRNGSGAKVLGLSMEGKMAYLSLFQLQAGVTLQQSRYDEPEKWSETAPAEKKIFRTPNTY GYFTATYMPIKPLSLSLSGTYTGSMLVQHMAGYIDKDVAVNTRDFFDMGVKVAYDFKLYK SVDLQLSAGVQNVFNAYQNDFDQGVERDSGYIYGPAAPRSYFAGIKISY >gi|226331997|gb|ACIB01000059.1| GENE 56 65387 - 66364 1037 325 aa, chain + ## HITS:1 COG:CPn0143 KEGG:ns NR:ns ## COG: CPn0143 COG0620 # Protein_GI_number: 15618067 # Func_class: E Amino acid transport and metabolism # Function: Methionine synthase II (cobalamin-independent) # Organism: Chlamydophila pneumoniae CWL029 # 11 321 14 373 374 171 31.0 2e-42 MENVVPPFKVDVAGTFLLPAALREAREQYRNEQISLLTLRAVEDAEIRNLVDRLKAEGLK VVTDGRFRSDAWPLDFMCGLDGIRFRDDRKTSVELTGRIDVHHHPVLDDFVFLTGVTGGD VIAKQVLPAPSRLLAELMKDANRTELDSVYPDREILLVDLAQTYQKLIMELYRSGCRYLQ LDDATRTVTDNAIRVNNMALENLPADLFIAFHSPTEMLFSLQGIHAFFLDYDSECCGKNR LLWFIREKQSVFGFVLSHYPVEEELEELRAKIDQIIRYIPSHRFSLCIPNAEVLPSESYE AAEEKQWHTLKMAEMVAGELWPEEG >gi|226331997|gb|ACIB01000059.1| GENE 57 66523 - 67632 830 369 aa, chain + ## HITS:1 COG:MA2908 KEGG:ns NR:ns ## COG: MA2908 COG1858 # Protein_GI_number: 20091729 # Func_class: P Inorganic ion transport and metabolism # Function: Cytochrome c peroxidase # Organism: Methanosarcina acetivorans str.C2A # 28 358 29 367 368 308 48.0 1e-83 MKTHKLLFVILLGTVCISCGHPSHKTGSEKEALGKLLFHDTSLSEPPGQSCATCHASSKG FADEQARAISEGAVQGLFSQRNSMSVCYAAFVPELHYDDDDENYVGGLFWDGRSPSLQDQ AGIPLLNPVEMGNRDKQMVAEKVKRTPYYNRIVQIYGETEHADSLFAHVTDALAAYQASR EINPFTSKYDAYKKGNYQLTEQEARGKELFKDKGQCAECHILDRDKRAHRTLFTDHTYDN LGIPKLPDHPHYKVAAEYFLLAADSVDLGLGAIVNAESENGKFRVPTLRNVELTAPYGHN GYFKTLEEIVHFYNVRDVSDEFPPAEYPATVNKEELGNLGLTQEEEADIVAFMKTLTDGY MKVHKSEKR >gi|226331997|gb|ACIB01000059.1| GENE 58 67685 - 68170 483 161 aa, chain - ## HITS:1 COG:BB0061 KEGG:ns NR:ns ## COG: BB0061 COG0526 # Protein_GI_number: 15594407 # Func_class: O Posttranslational modification, protein turnover, chaperones; C Energy production and conversion # Function: Thiol-disulfide isomerase and thioredoxins # Organism: Borrelia burgdorferi # 45 154 3 111 117 125 50.0 3e-29 MKKSILIPFLFMGLLAFASCKGTSENKSTTATDTVQAEKPQSGTIHLTRAEFLKKIADYE NHSKEWKYLGDKPAIVDFYADWCGPCKMVAPILEELSKEYAGKIYIYKVNVDKEPELARD FGIQSIPTIWFVPMKGEPQVNMGALSKEQLKGYIDKVLLKQ >gi|226331997|gb|ACIB01000059.1| GENE 59 68193 - 69227 875 344 aa, chain - ## HITS:1 COG:no KEGG:BF2693 NR:ns ## KEGG: BF2693 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 344 1 344 344 668 99.0 0 MTTTNEQEQSRAYILAGLFTLSLRLVVGWTYFSAFWRRLVLENKLIPDSAGYIGEKFNHF LPNSIGIRPVIEYLVSTPDMLWWAMVIFTLVEGIVGLLYMLGFFTRLMSIGVFSLATGIL LGSGWLGTTCLDEWQIGILGVAAGFTIFLSGGGKYSVDHLIERKFSLKKKAAWLSWLTSG ELPVSAKRFANVSVAGAIVIFTLSLYTNQEFHNGVWGPLHNKSVKPKIEISDAQIENNSL SFSVYRVEGVDVYGSFLIGISLKNADGDIVLEKKGEELADFPIGNIDNKCIARVAPGKHS LVIPLGSKATLTIDDTAIGSLPKGKYELVLTDISGITWKKEIIH >gi|226331997|gb|ACIB01000059.1| GENE 60 69239 - 69454 264 71 aa, chain - ## HITS:1 COG:no KEGG:BF2670 NR:ns ## KEGG: BF2670 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 71 1 71 71 127 100.0 1e-28 MIKAFLKKNRLTFIGLVIGAVGGFLYWKYVGCTSGTCPITSSPVNSTLWGAVMGGLLLNL FKTDSTPKKTN >gi|226331997|gb|ACIB01000059.1| GENE 61 69964 - 70326 414 120 aa, chain - ## HITS:1 COG:no KEGG:BF2668 NR:ns ## KEGG: BF2668 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 120 1 120 120 234 100.0 8e-61 MAIERKVPGEIRIFLNHVYEFKKGVRNMVLYTMNKEHEAFAIRRLERQNISYLIQEVNAN KINLFFGKAECMDAIRHIIIRPLNHLTPEEDFILGAMLGYDICQQCKRYCNKKGNIKIAG >gi|226331997|gb|ACIB01000059.1| GENE 62 70349 - 70858 543 169 aa, chain - ## HITS:1 COG:Cj1382c KEGG:ns NR:ns ## COG: Cj1382c COG0716 # Protein_GI_number: 15792705 # Func_class: C Energy production and conversion # Function: Flavodoxins # Organism: Campylobacter jejuni # 4 164 3 159 163 131 47.0 7e-31 MNKIGVFYGSTTGTTEDVAHRIAEKLNVPNGDIHDASKLNDELVKEYDVLVLGTSTWGAG ELQDDWYDGIKVLKKADLSHKFVALFGCGDSDSYSDTFCDGIGILYEELKDTHCTFCGAT DPSGYTFDSSVAVINGKFVGLPLDEVNEDGKTDERIAQWTEALKKECIN >gi|226331997|gb|ACIB01000059.1| GENE 63 70986 - 72161 1427 391 aa, chain - ## HITS:1 COG:CAC2445 KEGG:ns NR:ns ## COG: CAC2445 COG0138 # Protein_GI_number: 15895710 # Func_class: F Nucleotide transport and metabolism # Function: AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) # Organism: Clostridium acetobutylicum # 4 391 5 391 391 534 64.0 1e-152 MTNELELKYGCNPNQKPARIFIKEGELPIEVLNGRPGYINLLDALNSWQLVKELKEATGL PAAASFKHVSPAGAAVAVEMNDTLKKIYFVDDMELSPMATAYARARGADRMSSYGDFIAL SDICDEPTARIINREVSDGVIAPGYTPEALEILRNKRKGTYNVIKIDPAYRPAPIEHKDV FGITFEQGRNELKIDESLLKEMPTRNRIIPADAQRDLIIALITLKYTQSNSVCYAKDGQA IGIGAGQQSRIHCTRLAGNKADIWYLRQHPKVMNLPWKEKIRRADRDNTIDVYISDDYMD VLADGVWEQFFTRKPEVLTREEKRAWLDTQSGVALGSDAFFPFGDNIERAHKSGVSYIAQ PGGSVRDDHVIETCDKYDIAMAFTGIRLFHH >gi|226331997|gb|ACIB01000059.1| GENE 64 72371 - 73075 633 234 aa, chain + ## HITS:1 COG:no KEGG:BF2687 NR:ns ## KEGG: BF2687 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 234 2 235 235 471 100.0 1e-132 MDKLQKYKPTDKMIDLIGDNYSLLQVMSRFGLSLGFGDKTVKEVCEMNNVDCQTFLVVVN FMAEGFSRMDGSADDISIPALVDYLRQAHIYFLDYCLPAIRRKLIEAIDCSQDDVSFLIL KFFDGYMREVRKHMEYEEKTVFKYVDTLIHGNAPKNYQISTFSKHHDQVGERLTELKNII IKYCPAKANTNVLNAALFDIYACEEGLESHCKVEDYLFVPAILKLERRMRENEK >gi|226331997|gb|ACIB01000059.1| GENE 65 73065 - 73673 582 202 aa, chain + ## HITS:1 COG:BMEI1582 KEGG:ns NR:ns ## COG: BMEI1582 COG2197 # Protein_GI_number: 17987865 # Func_class: T Signal transduction mechanisms; K Transcription # Function: Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain # Organism: Brucella melitensis # 125 193 140 208 213 65 52.0 9e-11 MKNNESIRIAVAETSVIIRSGLTLALKRLPNLKIQPVELLSVEALNDCLRTQFPDILVVN PTFGDFFDVARFREETAGKGIRVVALVSSFIDASLLSKYDASFSIFDDLEALANKINLLQ NIEPEEEEDSQENLSQREKEIVICVVKGMTNKEIAEKLFLSIHTVITHRRNISKKLQIHS AAGLTIYAIVNKLVELSDVKDL >gi|226331997|gb|ACIB01000059.1| GENE 66 74273 - 74902 595 209 aa, chain - ## HITS:1 COG:alr1276 KEGG:ns NR:ns ## COG: alr1276 COG0110 # Protein_GI_number: 17228771 # Func_class: R General function prediction only # Function: Acetyltransferase (isoleucine patch superfamily) # Organism: Nostoc sp. PCC 7120 # 44 191 111 263 275 89 37.0 6e-18 MALKEKIKQNPALKQAVHRFIMHPVKTRPNWWIRIFSFLYLKRGKGSVIYRSVRQDLPPF NLFSLGKYSVVEDFSCLNNAVGDLIIGEYTRIGLGNTIIGPATIGNHVNLAQNVTVTGLN HNYQDTGKRIDEQGVSTQPITIEDDVWVGANSVILPGVTLGKHCVVAAGSVVSRSIPPYS VCAGSPAKVVKQFNPESRTWEKTVSKNGK >gi|226331997|gb|ACIB01000059.1| GENE 67 74918 - 75811 521 297 aa, chain - ## HITS:1 COG:no KEGG:BF2684 NR:ns ## KEGG: BF2684 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 297 1 297 297 595 100.0 1e-169 MDTFIQTTDFILFLCFSLMTVYLGVLAIAASLRNDTPYPQAGKRHRFAILVPPGSTSLPL PHYPEELYQVFTYEDLTEAIAALNENDFDGVVVLGETTRIEPAFLEEINSVFDAGIQAIQ LRHITEKRSTRKQYFQALNEEITQALFGTGATRLGVSSALYGADMVLDLKWLKKNQKSRK SNLERRLVRQGIFVEYLEKVKVYSSDIRAPRYKVRFSKALRALPEAIFTAHWDYCNKILR WILPSRKTNLISIALIATALLCYDWSLSLKWWSLLYILMFIICLAIPDYLVRQKTKK >gi|226331997|gb|ACIB01000059.1| GENE 68 75829 - 76989 925 386 aa, chain - ## HITS:1 COG:all4420 KEGG:ns NR:ns ## COG: all4420 COG2148 # Protein_GI_number: 17231912 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Sugar transferases involved in lipopolysaccharide synthesis # Organism: Nostoc sp. PCC 7120 # 142 383 250 441 445 139 34.0 8e-33 MLHLIYIGRHADTIEQFSKIAEGAFYAIQNSRKASEFIDKIREKYDIVILFEQRIISKDI PEIQYLRKKFPGVYIALVTEGINKEDRPAYLKAGINNSIPFNSTPETFKDITEFMMRRKQ QKINDIHKKGANLLFFKLPLWKRLFDIVFSSIAILCLSPILIITAVAIRLESKGAVVYKS KRVGSNYKIFDFLKFRSMYTDADKHLRDFNQLNQYQTEEEPELSGEEFLIGDDIELNEEE TMLISDDFIISEQNYTSKKSIEKKNAFVKLENDPRITKVGRIIRKYSIDELPQLVNILKG DMSIVGNRPLPLYEAELLTSDEYIDRFMGPAGLTGLWQVEKRGSSGKLSADERKQLDIKY AQTFSFLLDMEIILRTFTAFIQKENV >gi|226331997|gb|ACIB01000059.1| GENE 69 76990 - 77355 265 121 aa, chain - ## HITS:1 COG:DR2556 KEGG:ns NR:ns ## COG: DR2556 COG3947 # Protein_GI_number: 15807540 # Func_class: T Signal transduction mechanisms # Function: Response regulator containing CheY-like receiver and SARP domains # Organism: Deinococcus radiodurans # 5 116 4 115 344 75 39.0 3e-14 MRKLILLVDDKETIAKVASIYLGKDYDIQYFPDPIHALEWLHEGKTPDLIISDIRMPLMR GDEFLHYLKCNELFKDIPVIMLSSEESTSERIRLLQEGAVDYILKPFNPMELKIRVKKII E >gi|226331997|gb|ACIB01000059.1| GENE 70 77558 - 79984 1465 808 aa, chain - ## HITS:1 COG:no KEGG:BF2681 NR:ns ## KEGG: BF2681 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 808 1 808 808 1525 99.0 0 MKTSINKLCIAASILVTAAGISACVDDNKTLYDPEYKTPNPMEEISAPTGFDWSSTHSIK FNVEVNDEFDGQYYYTVEIVDKNPLEATTEEPYNTLAKGVARKGETYQTEVVSSKDTKYL YVRQTDPRGRDRIKQVEIDESTSHIQCSFTGTSAIKTRAFATTRGNNGGIDIPKRTEQSY DISRAIPVTSPSQVLQGGQTYIVTGNFSGKFTDTSLSNSNKATVYIQGTWELAQVTQDFL DIIVLKDGKINGKYLMLQNTSTLTIQSGAEVSLSDQLICNTYSTICNFGDLKTKNMKLNT NDILYNGHKTDITNSLDASQGGNIHNFGKLDVENTIKLNTPSIVYNAPECKIEAKTYEAA GSTNVNFGEMEFDTYDSGGAGGSLYNNCMLFVEHMKAGGIVYLDHGVIAEEKEDDEENEL FEEADDIEFYDNAKVTLANGSMIKAKNIIAKSGLSVNGEGNETSLLKATEKVQIQNWDVR FNGRLCITGKISCSNPDMYQAGSEVTFSESPDVIITGCNGKAEVPDPAPEPSDPVFPIIV DDNHNYTYLFEDQWPLYGDYDMNDIVLEVKKRKISIDKHNKVTEFDLSVELRAVGAQKTI AAAIMFDEIPASAVTQAVTYADNYQPVSFELTDKNIEKGQEYAVVPLFDNAHTLMERPTG SFVNTVSGSDNNQKNTKTIHFTLRFDSSVAPSSDALNINNLNIFIITDRGSKRKEIHVAG YRPTQLANTELFGGNNDASSLNGKKYYISKDNLAWGIMVPTQFKWPLEYTQIQKAYSQFA GWVTTGGADNKKWWNDFDNTKVFQTNKN >gi|226331997|gb|ACIB01000059.1| GENE 71 80200 - 81246 929 348 aa, chain - ## HITS:1 COG:YPO2161 KEGG:ns NR:ns ## COG: YPO2161 COG0252 # Protein_GI_number: 16122393 # Func_class: E Amino acid transport and metabolism; J Translation, ribosomal structure and biogenesis # Function: L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D # Organism: Yersinia pestis # 7 344 5 335 338 294 48.0 2e-79 MKADYPSVLLIYTGGTIGMIENPETGALENFNFDHLLKHVPELKRFNYRISSYQFDPPID SSDMEPAFWAKLVEIINYNYDSFDGFVILHGTDTMAYTASALSFMLENLSKPVILTGSQL PIGTLRTDGKENLITAIEIAAAKNPDGTAIVPEVCIFFENHLMRGNRTTKINAENFNAFR SFNYPPLARVGIHIKYEPNLIRKPDLSKPLKPHYLFDTNVVILTLFPGIQEGIVSALLHV PGLKAVVLKTFGSGNAPQKEWFIRELKDATDRGIIIVNITQCASGAVEMERYGTGIQLLQ AGVISGYDSTPECAVTKLMFLLGHGLDNKEIRYKMNSCLIGEITKRVE >gi|226331997|gb|ACIB01000059.1| GENE 72 81494 - 82285 350 263 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|148984260|ref|ZP_01817555.1| ribosomal protein L11 methyltransferase [Streptococcus pneumoniae SP3-BS71] # 27 240 33 242 258 139 35 8e-32 MNRINQLFDSNPRDLLSIYFCAGYPTLEGTTEVIRTLEKHGVNMIEIGIPFSDPMADGMV IQNAATQALRNGMSLRLLFEQLHDIRRDVKIPLILMGYLNPIMQFGFDNFCRQCAECGID GVIIPDLPFKDYQEHFRTIAERYDVKVIMLITPETSEERVREIDEHTDGFIYMVSSAATT GAQQDFDGQKRAYFKKIEKMNLRNPRMVGFGISNEATFRAACENASGAIIGSRFVTLLHE EKNPEKAITRLKAILNLSSNDLR >gi|226331997|gb|ACIB01000059.1| GENE 73 82345 - 82932 537 195 aa, chain - ## HITS:1 COG:SMc02767 KEGG:ns NR:ns ## COG: SMc02767 COG0135 # Protein_GI_number: 15963780 # Func_class: E Amino acid transport and metabolism # Function: Phosphoribosylanthranilate isomerase # Organism: Sinorhizobium meliloti # 1 195 10 212 215 101 32.0 8e-22 MREAENIREVEQLKVNMIGFIFYPKSPRCLYELPAYMPVKAKRVGVFVNEDKKEIEIFAD RFSLDYIQLHGNESPEYCHSLRATGLRLIKAFSIARRKDFENIGTYEESCDYFLFDTKCE QHGGSGNQFDWSMLNSYKGKKPFLLSGGINPYSPPTLKELRHPQLAGFDLNSRFETKPGL KDVERLRHFLEELRK >gi|226331997|gb|ACIB01000059.1| GENE 74 83056 - 83868 744 270 aa, chain - ## HITS:1 COG:XF0213 KEGG:ns NR:ns ## COG: XF0213 COG0134 # Protein_GI_number: 15836818 # Func_class: E Amino acid transport and metabolism # Function: Indole-3-glycerol phosphate synthase # Organism: Xylella fastidiosa 9a5c # 1 255 1 259 264 191 42.0 9e-49 MKDILSEIIANKRFEIDLQKQAIPSEQLQEKLSDEVQPGYSMKQALASSATGIIAEFKRR SPSKGWIYENACPEQVVPDYIAAGASALSILTDEKFFGGSLKDIRTARPLVNIPILRKDF IIDEYQLFQAKIVGADAILLIAAALEADQCHALAAKAHELGLEVLLEIHTAEELPFINKE IDMVGINNRNLGTFFTDIENSFRLAGQLPQDALLVSESGISDPETVKRLRKAGFRGFLIG ETFMKAQQPGQKLKEFINDLNSPQSDTESH >gi|226331997|gb|ACIB01000059.1| GENE 75 83894 - 84889 974 331 aa, chain - ## HITS:1 COG:MJ0234 KEGG:ns NR:ns ## COG: MJ0234 COG0547 # Protein_GI_number: 15668409 # Func_class: E Amino acid transport and metabolism # Function: Anthranilate phosphoribosyltransferase # Organism: Methanococcus jannaschii # 1 329 2 332 336 210 35.0 4e-54 MKQILYKLFEHQYLGRDEARTILQNIAQGKYNDAQVASLITVFLMRNISVEELCGFRDAL LEMRVPVDLSEFAPIDIVGTGGDGKNTFNISTAACFTVAGAGFPVVKHGNYGATSVSGAS NVMEQHGVKFTDHTDRLRRSMEKCNIAYLHAPLFNPALKAVAPIRKALAVRTFFNMLGPL VNPVIPTYQLLGVYNLPLLRLYTYTYQESATRFAVVHSLDGYDEISLTDEFKVATCGNEK IYTPESLGFNRCRESELDGGNTPEDATRIFDAVMEGTATEAQKNVVIVNAAFAIRVICPE KPIEECIALARESLESGKARETLKKFVELNG >gi|226331997|gb|ACIB01000059.1| GENE 76 84936 - 85541 657 201 aa, chain - ## HITS:1 COG:ECs4211 KEGG:ns NR:ns ## COG: ECs4211 COG0512 # Protein_GI_number: 15833465 # Func_class: E Amino acid transport and metabolism; H Coenzyme transport and metabolism # Function: Anthranilate/para-aminobenzoate synthases component II # Organism: Escherichia coli O157:H7 # 15 199 2 186 187 181 47.0 1e-45 MNNEINTLNRQKGAILLLDNYDSFTYNLLHVVKEQGVTDIEVFRNDEITLDEVERFDKII LSPGPGIPEEAGLLLPIIRKYAATKSILGVCLGHQAIGEAFGATLENLTEVYHGVQTPVS ILKEDILFSGLGREIPVGRYHSWVVSRKDFPGCLEITAESREGQIMALRHRTYDVHGIQF HPESVLTPQGKEIIKNFLNNR >gi|226331997|gb|ACIB01000059.1| GENE 77 85685 - 87091 1338 468 aa, chain - ## HITS:1 COG:TM0142 KEGG:ns NR:ns ## COG: TM0142 COG0147 # Protein_GI_number: 15642916 # Func_class: E Amino acid transport and metabolism; H Coenzyme transport and metabolism # Function: Anthranilate/para-aminobenzoate synthases component I # Organism: Thermotoga maritima # 7 467 4 457 461 278 38.0 1e-74 MNAFNYTTHSKQVLGDLHTPVSIYLKVRDMYPQSALMESSDYHAGENSLSFIALCPLASI GINSGIVTTTYPDNTRREEPLSQSFRVENALNRFINRFHVEGDDKKFCGLYGYTTFNAVK YFEHIPVKESHDEQNDAPDLLYILYKYIIVFNHFKNELTLVEMLAEGEESNLSQLESAIE NRNYASYNFSVTGPVTSTITDEEHKANVRKGIAHCLRGDVFQIVLSRRFIQPYAGDDFKV YRALRSINPSPYLFYFDFGGYRIFGSSPETHCKVESGQAYIDPIAGTTRRTGDTIKDKEL TEALLADPKENAEHVMLVDLARNDLSRNCHDVRVVFYKEPQYYSHVIHLVSRVSGALNNG ANPLKTFIDTFPAGTLSGAPKVRAMQLISEIEPHNRGAYGGCIGFIGLNGELNQAITIRT FVSRNNELWFQAGGGIVARSQDEYELQEVNNKLGALKKAIDLAVKLKN >gi|226331997|gb|ACIB01000059.1| GENE 78 87226 - 88398 1260 390 aa, chain - ## HITS:1 COG:TM0138 KEGG:ns NR:ns ## COG: TM0138 COG0133 # Protein_GI_number: 15642912 # Func_class: E Amino acid transport and metabolism # Function: Tryptophan synthase beta chain # Organism: Thermotoga maritima # 10 389 3 379 389 459 61.0 1e-129 MKSFLVDQDGYYGEFGGAYVPEILHKCVEELQNTYLDVIESEDFKKEFDQLLRDYVGRPS PLYPARRLSEKYGCKMYLKREDLNHTGAHKINNTIGQILLARRMGKKRIIAETGAGQHGV ATATVCALMNMECIVYMGKTDVERQHINVEKMKMLGATVVPVTSGNMTLKDATNEAIRDW CCHPSDTYYIIGSTVGPHPYPDMVARLQSVISEEIKKQLQEKEGRDYPDYLIACVGGGSN AAGTIYHYIDDERVRIVLAEAGGKGIETGMTAATIQLGKMGIIHGARTFVIQNEDGQIEE PYSISAGLDYPGIGPMHANLADKKRAMVLAVNDDEAIRAAYELTRLEGIIPALESAHALG ALEKITFKPEDVVVLTVSGRGDKDIETYLG >gi|226331997|gb|ACIB01000059.1| GENE 79 88601 - 88789 78 62 aa, chain - ## HITS:1 COG:no KEGG:BF2649 NR:ns ## KEGG: BF2649 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 62 1 62 62 81 100.0 9e-15 MWSRIAKRTTTTDGCTKSKNTYNRHFKNNSFLGDKTKNKKEGNKKKKDFICYSKQSIYIC PR >gi|226331997|gb|ACIB01000059.1| GENE 80 89007 - 89246 168 79 aa, chain - ## HITS:1 COG:no KEGG:BF2648 NR:ns ## KEGG: BF2648 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 79 1 79 79 117 94.0 2e-25 MFKKSFLFKRSIILPYISCTYNVHLLGPKIQGPFYAPIESEKKNKKTTITANNQIVTNNN QQKQWTKKSRKVNSKKNTI >gi|226331997|gb|ACIB01000059.1| GENE 81 89303 - 89413 82 36 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MKQRKVLIAIAVVIFILLLLYWLFVAEDINAWLPVN >gi|226331997|gb|ACIB01000059.1| GENE 82 89619 - 90764 1265 381 aa, chain + ## HITS:1 COG:alr4566 KEGG:ns NR:ns ## COG: alr4566 COG1979 # Protein_GI_number: 17232058 # Func_class: C Energy production and conversion # Function: Uncharacterized oxidoreductases, Fe-dependent alcohol dehydrogenase family # Organism: Nostoc sp. PCC 7120 # 1 380 1 383 384 422 53.0 1e-118 MDNFIFQNPVKLIMGRGMISRLSEEIPADKRVMITFGGGSVKKNGVYDQVKEALKNHFTV EFWGIEPNPSIETLRKAIALGKEEKVDYLLAVGGGSVIDGTKLISAGLLYDGDAWDLVLA GRPVTHTVPLATVLTLPATGSEMNSRAVISRHETKEKYPFYSNYPLFSILDPEVTFTLPP HQVACGIADTFVHVMEQYMTTPGQSRVMDRWAEGILQTLMEIAPKIRENQHDYQLMSDFM LSATMALNGFIAMGVSQDWATHMIGHELTALHGLTHGHTLVIVFPGTLRVLRKAKGDKIL QYGERVLGITSGSRDERIDEAIRRTEEFFRSLGLTTRLSEEGIGMETIDEIERRFNERGV RYGENEDVTGAVAKEILKSSL >gi|226331997|gb|ACIB01000059.1| GENE 83 90957 - 91871 768 304 aa, chain - ## HITS:1 COG:MA0083 KEGG:ns NR:ns ## COG: MA0083 COG0248 # Protein_GI_number: 20088982 # Func_class: F Nucleotide transport and metabolism; P Inorganic ion transport and metabolism # Function: Exopolyphosphatase # Organism: Methanosarcina acetivorans str.C2A # 7 296 13 317 543 109 29.0 5e-24 MKKVNYAAIDIGSNAVRLLIKSVNEEGSPEGLLRKVQLIRIPLRLGEDAFTTGEISEGKA EKLIRLMKAYKQLMKIFEVSDYRACATSAMRDARNGKEITRKIEKKTGIRVEIIDGQEEA HIVYDNHIEQLFASGQNYLYVDVGGGSTEINLICDSELKSSRSYNIGTVRMLSGMVKNEE KEQMRTDLQALAAEYAPIQIIGSGGNINKLFRLADKKDKKQSFLPIESLKEICETLKALS KEERIKQFKLKPDRADVIVPAAEIFLEVAKQVNATGITVPTIGLSDGIIDSLYTKNMRME TDAK >gi|226331997|gb|ACIB01000059.1| GENE 84 91868 - 93934 1869 688 aa, chain - ## HITS:1 COG:ECs3363 KEGG:ns NR:ns ## COG: ECs3363 COG0855 # Protein_GI_number: 15832617 # Func_class: P Inorganic ion transport and metabolism # Function: Polyphosphate kinase # Organism: Escherichia coli O157:H7 # 8 688 3 685 688 501 38.0 1e-141 MKSDEAKRKKLYVERDISWMYFNQRILLEAARPEVPLLEQLTFLGIYSNNLDEFFRVRVA TLNRIVEYDDKNIREERDTAARTLKQIGKLHNQYYRQFEETFASIMEKLKQENIHVIKDT ELTPEQEHFITSLYRNRLNGSTNPLFLTKTSRHTDDQTDEDIYLAIRLLRKDSEGKVKMK DYAVIGLPTAEFGRFIRLPDSEGKTYLMFLDDVIRYCLPMIFIGMNYTDYEAYTFKFTKD AEMEIDSDLRTGVLQKISKGIKSRKRGEPIRFVYDKEMPKDLLRKLTDRLNVDKNDTRVA GGRYHNFKDLMKFPDCGRSDLKYPAWPPVFKQELNGTESILRLIRKQDRSLHYPYQSFDT VIRVLREAAISKEVKSIKMTLYRLAKDSKVVKALICAAQNGKKVTVIIELLARFDEASNI SWSKRMQEAGIKVIFGVEGLKIHSKLVHIGTRFGDLACISTGNFHEGNARMYTDFTIMTA HRSIVREVDRVFDFIEKPYLPVNFKELLVSPNDMRKRLTALINQEIKNKRQGKEAYILAK VNHITDRILIQKLYEASTAGVQTDLVVRGNCSLVTGIPGVSDNIRINGIIDRYLEHPRIF IFANGGEEKYYIGSADWMPRNLDNRIEVLTPVYDKVIQAELKRVVCYGLRDTAKGRIVDG SGENKTWENSDAPFRSQEELYKYYKESE >gi|226331997|gb|ACIB01000059.1| GENE 85 94105 - 94671 697 188 aa, chain + ## HITS:1 COG:no KEGG:BF2644 NR:ns ## KEGG: BF2644 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 188 1 188 188 323 100.0 2e-87 MKSMNYSLIRTICALVIGLVLVLWPDAAINYIVITIGVLFLIPGFIVLIGYFGTKPEPGV SRRFPIEGVGSLLFGLWLVTMPGFFADVLMFLLGFILIMGGVQQIASLSMARRWTPVPGG FYVIPVLILIAGIVALFNPTGARNTAFMIIGVSSLVYAVSELINWFKFARRRPKTPLKGE IEDAEIIE >gi|226331997|gb|ACIB01000059.1| GENE 86 94804 - 96036 1212 410 aa, chain - ## HITS:1 COG:MTH52 KEGG:ns NR:ns ## COG: MTH52 COG0436 # Protein_GI_number: 15678081 # Func_class: E Amino acid transport and metabolism # Function: Aspartate/tyrosine/aromatic aminotransferase # Organism: Methanothermobacter thermautotrophicus # 1 406 1 405 410 551 62.0 1e-157 MALVNEHFLKLPGSYLFSDIAKKVNTFKITHPKRDIIRLGIGDVTRPLPKACIEAMHKAV EEMTSAETFRGYGPEQGYDFLIEAIIKNDYAPRGIHLSPTEVFVNDGAKSDTGNIGDILR HDNSVGVTDPIYPVYIDSNVMCGRAGVLDTESGKWSNVTYMPCTAENHFIPAIPEKRIDI VYLCYPNNPTGTTLTKAELKKWVDYALANDTLILFDAAYEAYIREPDIPHSIYEIKGAKK CAIEFRSFSKTAGFTGVRCGYTVVPKELTAATLEGERIPLNRLWNRRQCTKFNGTSYITQ RAAEAIYTPEGKEQIQETINYYMTNARIMKEGLESTGLKVYGGVNAPYLWVKTPKGTSSW RFFDQMLYEANVVGTPGVGFGPSGEGYIRLTAFGERDDCIEAMRRIKNRL >gi|226331997|gb|ACIB01000059.1| GENE 87 96089 - 96898 595 269 aa, chain - ## HITS:1 COG:slr1665 KEGG:ns NR:ns ## COG: slr1665 COG0253 # Protein_GI_number: 16332245 # Func_class: E Amino acid transport and metabolism # Function: Diaminopimelate epimerase # Organism: Synechocystis # 4 267 2 279 279 222 44.0 7e-58 MARAIQFTKMHGTGNDYIYVNTLRFPIARPEKAAIEWSAYHTGIGSDGLVLIGHSDKADF SMRIFNADGSEAMMCGNASRCIGKYLYEYGLTSKNVITLDTLSGIKILELHLEGRTVETV TVDMGVPLETGTIDFDGEFPFLSTQVSMGNPHLVTFVDDIRTVNLSEMGPKLEKHPLFPD RTNVEFAQIMGKNTIRMRVWERGSGITQACGTGACATAVAAHLTGRTGRTVNVVMDGGTL TIEWDEATGHISMTGPAVKVFDGTIELRE >gi|226331997|gb|ACIB01000059.1| GENE 88 97341 - 98216 639 291 aa, chain - ## HITS:1 COG:no KEGG:BF2664 NR:ns ## KEGG: BF2664 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 291 1 291 291 580 99.0 1e-164 MKKLFALLFISMISGMIMAQIPEVSGTIVFKKDGQPVVGALVSVLGTDISTITDIDGRFT LKEIPEKAQRIRVKYIGMQPKDVKIRPIMNVTLDLEKKLAFFIQAGVGVSGYNLENNNED IYGGNNHLYIQGGVGLNYNFSTYFSLQPSVNVVAKGCKNMGRRGSDGQGEITVDPVYLEI PVLIAARFIISDFGRLIFNAGPYAAIGIAGKGKYTDEWEGTSMKFDLFSGDEAVLKRFDA GVQAGFAYEIKHIGFSYTFGYGLLKPFKEWNKEWGATPHNISHNFGLKYIF >gi|226331997|gb|ACIB01000059.1| GENE 89 98425 - 99738 795 437 aa, chain - ## HITS:1 COG:no KEGG:BF2663 NR:ns ## KEGG: BF2663 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 437 1 437 437 880 99.0 0 MKKILFLLTILVSVPAFSQSADERIGTFLNQADWFGLEKNYPILKDSMQADFLKLMSEAL IGYYFNRPDEALQNIHKLLVNHQAEIGGQNALNMAILACQIDGLKGNYATAAQNSRSIME QLKQQSAEQGMYKSLEGICYFYDQLKNIPAPGITCPQEDIIIPVDIEKVKLPTSIEPKGW RGTTILIPVTINGKTYQFIFDTGAGTSYMSQRMAKETGVRILSDSLIINSNLPGAMTGNF GTLENMQIGSITFHNSLITIAPPNAFDSIMIVDAVLGMDFIGLFDEVRIYPKDKKIVFPK SSTPLPSYGRNLMKVDRALKLKAQANGETLMLHFDTGNSTAGLFYQYYEKHKTEFESIGK KEKITGGGFNHVVTKDILRLPSFDMEIGDATAHLKNLAVDITPNGIPAEDDGNIGMDMIN QFDCVTINLKDMFLKLE >gi|226331997|gb|ACIB01000059.1| GENE 90 99958 - 100719 741 253 aa, chain - ## HITS:1 COG:SA0220_2 KEGG:ns NR:ns ## COG: SA0220_2 COG0584 # Protein_GI_number: 15925931 # Func_class: C Energy production and conversion # Function: Glycerophosphoryl diester phosphodiesterase # Organism: Staphylococcus aureus N315 # 28 244 1 217 242 60 23.0 2e-09 MKIKKLLFASALLFSAYNASAQTQVIAHRGFWKTEGSAQNSIAALLKADSIGCYGSEFDV WLAADDQLVVNHDPTFKGKRMENSPSTALTAIKLDNGESLPTLAKYLKAAQPLHTRLILE LKAHSTPLRETKAIEKIVALVKDMGLEERMEYITFSLHATKEFIRLAPEGTPVYYLDGNL SPKELKELGCAGPDYHYTVFRKHPEWIQECHDLGMKVNAWTVNKTDDMKWLIDRKVDFIT TNEPVQLKNLLKK >gi|226331997|gb|ACIB01000059.1| GENE 91 100970 - 102646 1505 558 aa, chain - ## HITS:1 COG:VC0991 KEGG:ns NR:ns ## COG: VC0991 COG0367 # Protein_GI_number: 15641006 # Func_class: E Amino acid transport and metabolism # Function: Asparagine synthase (glutamine-hydrolyzing) # Organism: Vibrio cholerae # 1 558 1 554 554 768 67.0 0 MCGIAGIFNIKIQSRELRNKALRMARKIRHRGPDWSGMYCGGSAILAHERLSIVDPQSGG QPLYSSDRRQVLAVNGEIYNHRDIRAQYAGRYEFRTGSDCEVILALYRDKGIHFLEELNG IFAFALYDEEKDEYLIARDPIGVIPLYIGKDAEGHVYFGSELKALEGFCDEYEPFLPGHY YHSKEGTMKRWYTRDWMEYKEENDKQADSRSPTRQIQDALENAVHRQLMSDVPYGVLLSG GLDSSVISAIAKKYAAKRIETDGASDAWWPQLHSFAIGLKGAPDLIKAREVAEYIGTVHH EINYTLQEGLDAIRDVIYFIETYDVTTVRASTPMYLLARVIKSMGIKMVLSGEGADEIFG GYLYFHKAPDARAFHEETVRKLSKLHLYDCLRANKSLAAWGVEGRVPFLDKEFLDVAMQL DPEIKMAPGKVIEKKVLREAFADMLPSGIAWRQKEQFSDGVGYSWIDTLKEITATAVSDE QMAHAAERFPIHTPMNKEEYYYRSIFEEHFPSESAARSVPSIPSVACSTAEALAWDTAFK NLNDPSGRAVKGVHEEAY >gi|226331997|gb|ACIB01000059.1| GENE 92 103201 - 103650 259 149 aa, chain + ## HITS:1 COG:CAC0158 KEGG:ns NR:ns ## COG: CAC0158 COG0449 # Protein_GI_number: 15893453 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glucosamine 6-phosphate synthetase, contains amidotransferase and phosphosugar isomerase domains # Organism: Clostridium acetobutylicum # 1 50 1 49 608 68 64.0 3e-12 MCGIVGYIGKREAYPILIKGLKRLKYRGYDSAGVAIINDNQLLNVYRPQGETFVFASDGI IETPTVILQEFPSQSYIFRNSTYTEVPVYSFSSFRPMHQFCKITASRAVRPETLKCSDSL IIREAFSDTLLQGGKDEATGIREESVTME >gi|226331997|gb|ACIB01000059.1| GENE 93 103771 - 105654 1863 627 aa, chain + ## HITS:1 COG:HI1207 KEGG:ns NR:ns ## COG: HI1207 COG0034 # Protein_GI_number: 16273127 # Func_class: F Nucleotide transport and metabolism # Function: Glutamine phosphoribosylpyrophosphate amidotransferase # Organism: Haemophilus influenzae # 36 510 17 420 505 126 25.0 2e-28 MEQLKHECGVAMIRLLKPLEYYEKKYGTWMYGLNKLYLLMEKQHNRGQEGAGLACVKLEA NPGEEYMFRERALGSGAITEIFETVQSNFKDLSKEQLHDAAFAKRTLPFAGEAYMGHLRY STTGKSGISYVHPFLRRNNWRAKNLALCGNFNMTNVDEIFARITAIGQHPRKYADTYIML EQVGHRLDREVERLFNLAEAEGLTGMGVTHYIEDHIDLANVLRTSSKEWDGGYVMCGLTG SGESFALRDPWGIRPAFWYQDDEIAVLASERPVIQTALNVPIGEIRELLPGQALLISKEG KLRTAQINKAREKKACSFERIYFSRGSDADIYKERKQLGEKLVPNILKAIDNDLDHTVFS FIPNTAEVAFYGMLQGLDNYLNEEKVRQIASLGHHPDHDELEVILSRRIRSEKVAIKDIK LRTFIAEGNSRNDLAAHVYDITYGSLRSGIDNLVIIDDSIVRGTTLKQSIIGILDRLGPK KIVIVSSSPQVRYPDYYGIDMAKMSEFIAFKAAIELLKDRDMKDVIASAYRKSKDQVGLP KEQMVNYVKDIYAPFTDEEISEKMVELLTPKGTKAKVQIVYQPLEGLHEACPNHTGDWYF SGNYPTPGGVKLLNEAFINYIEQVYQF >gi|226331997|gb|ACIB01000059.1| GENE 94 105689 - 106765 1074 358 aa, chain + ## HITS:1 COG:YJL130c_1 KEGG:ns NR:ns ## COG: YJL130c_1 COG0505 # Protein_GI_number: 6322331 # Func_class: E Amino acid transport and metabolism; F Nucleotide transport and metabolism # Function: Carbamoylphosphate synthase small subunit # Organism: Saccharomyces cerevisiae # 4 356 22 405 433 366 47.0 1e-101 MKNVTLILDDGSRFHGKSFGYEKPVAGEVVFNTAMTGYPESLTDPSYAGQLMTLTYPLVG NYGVPPFSIEPNGLATFMESERIHAEAIIVSDYSQEYSHWNAVESLADWLKREQVPGITG IDTRELTKVLREHGVMMGRIVFDDEPESAPEAVYSGVNYVDKVSCKEVIRYNEGADKKKV VLVDCGVKTNIIRCLLRRDVEVIRVPWNYDFNHLKFDGLFISNGPGDPDTCDAAVQNIRK AMQNPKLPIFGICMGNQLLSKAGGAKIYKLKYGHRSHNQPVRMVGTERCFITSQNHGYAV DNNTLGTDWEPLFINMNDGSNEGIRHKTNPWFSAQFHPEAASGPTDTEFLFDEFVKLL >gi|226331997|gb|ACIB01000059.1| GENE 95 106928 - 110158 3279 1076 aa, chain + ## HITS:1 COG:YJL130c_2 KEGG:ns NR:ns ## COG: YJL130c_2 COG0458 # Protein_GI_number: 6322331 # Func_class: E Amino acid transport and metabolism; F Nucleotide transport and metabolism # Function: Carbamoylphosphate synthase large subunit (split gene in MJ) # Organism: Saccharomyces cerevisiae # 7 1066 6 1062 1070 1211 57.0 0 MKENNIKKVLLLGSGALKIGEAGEFDYSGSQALKALKEEGIETILINPNIATVQTSEGVA DQIYFLPVTPYFVEKVIRKERPEGIMLAFGGQTALNCGVALYREGILEKYNVKVLGTPVQ AIIDTEDRELFVDKLNEIDVKTIKSEAVENAEDARRAARELGYPVIVRAAYALGGLGSGF CDNEEELDLLVEKAFSFSPQVLVEKSLRGWKEVEYEVVRDRFDNCITVCNMENFDPLGIH TGESIVIAPSQTLTNKEYHKLRELAIRIIRHIGIVGECNVQYAFDPESEDYRVIEVNARL SRSSALASKATGYPLAFVAAKLGLGYGLFDLKNSVTKTTSAFFEPALDYVVCKIPRWDLG KFHGVDKELGSSMKSVGEVMAIGRTFEEAIQKGLRMIGQGMHGFVENKELVIPDIDKALH EPTDKRIFVISKAFRAGYTIDQVHELTKIDKWFLQKLMNIMNTSEELHQWGNNHKQIADL PADLLKQAKRQGFSDFQIARAIGYEGDMEDGSLYVRNYRKSLGIVPVVKQIDTLAAEYPA QTNYLYLTYSGTANDVTYLGDHRSIVVLGSGAYRIGSSVEFDWCGVQALNTIRKEGWRSV MINYNPETVSTDYDMCDRLYFDELTFERVMDVLELENPHGVIVSTGGQIPNNLALRLDAQ NIHILGTSAQSIDNAEDRDKFSAMLDRIGVDQPEWRALTSLEDINSFVDKVGFPVLVRPS YVLSGAAMNVCSNQEELERFLQLAANVSKKHPVVVSQFIEHAKEVEMDAVAQNGEIIAYA ISEHIEFAGVHSGDATIQFPPQKLYVETVRRIKRISREIAKALNISGPFNIQYLAKDNDI KVIECNLRASRSFPFVSKVLKINFIELATKVMLGLPVEKPEKNLFELDYVGIKASQFSFN RLQKADPVLGVDMASTGEVGCIGSDTSCAVLKAMLSVGYRIPKKKILLSTGTPKQKVDML EAARMLQKKGYDIFATGGSSKFLTENGVENTRVYWPSEEGHPQALEMLHKKEIDMVVNIP KNLTAGELDNGYKIRRAAIDLNIPLITNARLASAFINAFCTMDIDDIAIKSWEEYK >gi|226331997|gb|ACIB01000059.1| GENE 96 110358 - 110654 409 98 aa, chain + ## HITS:1 COG:VNG6073G KEGG:ns NR:ns ## COG: VNG6073G COG0526 # Protein_GI_number: 16120037 # Func_class: O Posttranslational modification, protein turnover, chaperones; C Energy production and conversion # Function: Thiol-disulfide isomerase and thioredoxins # Organism: Halobacterium sp. NRC-1 # 1 97 20 117 119 57 33.0 4e-09 MNIAEQLDAVSGTNRLVLVEFYADWSPHYEWLEPVVRTYEKQVSEVIKVNIVEDKAVADS FNIETAPAFILLQRGHELWRQVGELTIDELKLVLEEFK >gi|226331997|gb|ACIB01000059.1| GENE 97 111215 - 112102 579 295 aa, chain - ## HITS:1 COG:PA0248 KEGG:ns NR:ns ## COG: PA0248 COG2207 # Protein_GI_number: 15595445 # Func_class: K Transcription # Function: AraC-type DNA-binding domain-containing proteins # Organism: Pseudomonas aeruginosa # 157 294 151 286 288 72 31.0 1e-12 MEKNKPLELALDLLKHSNIVRNNEYRFVTPDFGIVISFTHMESRLFRLGQPYRLKEGRII RGLRGKARITINLIEYNIYPQTIATFLPGSIIELLEFTPDCDFQMIAADKDFLPLLRGDQ LFESHTRQNLVMQVTDTEWKLSDAYFTLIWGTVQEVPFRREVVQHLLAALLHNIKYIRTE EANDTSRRFSHQEEIFWRFIALVNEFSKNERTVGFYADKLCLPPRYLNTLIRQVSHQTVM EWINQSVILEAKVLLKHSNLLVYQISDELHFPNPSFFCKFFKRMTGMTPQEYQKR >gi|226331997|gb|ACIB01000059.1| GENE 98 112434 - 113594 735 386 aa, chain + ## HITS:1 COG:no KEGG:BF2632 NR:ns ## KEGG: BF2632 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 19 386 19 386 386 773 99.0 0 MKTSYLLLSSALLLGCLSACNTSPRESKVMEANDGSVTTIESNGNKLTVCDLSAVKDTIE VPLSEFVEDCRIVRFETSEEAYFKAWFINATDKHIGIRQGNQDVFKLFDRDGKFLYNVGS VGSGPGEYDTTLYDECIDEKNGHIFFTPFVGKRIMMYDINGQWIKDIPLPMQINKAKIWV NEDGSLSVVHMPFEEGEPLAFRVDTEGNILNQIPATAATKVMNFDGEVFSYRNCGDFDFF HTGIDTLFTYDPAGNKLLPKFTMTFPNMNEKPIHLYYRLPHHFIVTYWGNKGEGGGDVLV DTEKNASSYFRLVNDFYGNLPIPAPGHSFYRGYFIQSLEPGQLIEKIEKQIASGKCSGQD EQKLKELAATLTENDNNVLFIGKVKK >gi|226331997|gb|ACIB01000059.1| GENE 99 113806 - 115062 1113 418 aa, chain + ## HITS:1 COG:no KEGG:BF2631 NR:ns ## KEGG: BF2631 # Name: not_defined # Def: outer membrane efflux protein # Organism: B.fragilis # Pathway: not_defined # 1 418 1 418 418 784 99.0 0 MKRVIFSLSFSLFALLMYGQITLEECQRKTRENYPLVRQYGLIEKTKEYNLANASKGYLP QFTLSGKASWQSEVTELPVQVPGVDIKGLPKDQYQVMLELKQNIWDGGEIRSRKEQVKAS SDVDREKLNVDMYALTERVNQVYFGILLLDEQLRQNQLFLEDLERTHKQISSYIENGIAS QSDLDAVSVEQLNTRQKRIELTFSRQAYLSMLALLTGEEMPAGISLQKPVPEWDIPVIAN NRPELIWFDAQNGRLQVQEEALKTQLMPRFGLFVQGAYGNPGLNMLKNEFSPYYVAGVRL SWNFGSLYTLRNDRRVIAANRQLLDSNRDVFLFNTRLQATQQGSAVESVRRQMADDDEII RLRTSIRKASEAKVANGTMTVTDMLRDMTSENLARQTKALHEVQLLMNMYQLKYTTNN >gi|226331997|gb|ACIB01000059.1| GENE 100 115101 - 115997 715 298 aa, chain + ## HITS:1 COG:PA5232 KEGG:ns NR:ns ## COG: PA5232 COG0845 # Protein_GI_number: 15600425 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Pseudomonas aeruginosa # 28 279 41 314 357 87 27.0 2e-17 MKSMKFIGCLYLLALLSACGSRTSDYDATGTFEATEVLVSAEASGKLLYFHVEEGTRLKA GEEVGLIDTLQLYLKKLQLQASMKSVESQRPDVNKQIAATRQQIATARREKRRVENLLKA GAANQKQLDDWEAQIALLERQLTAQMSSLQNSTNSLTEQGSSVAIQVAQVEDQLAKCHVV SPISGTVLAKYAEAGELAAVGKPLFKVADIDQMYLRAYITSEQLSQVKLGNRVTVFSDYG GDERKEYPGVVTWISDRSEFTPKTILTKEERANLVYAVKIAVKNDGLLKIGMYGGVKL >gi|226331997|gb|ACIB01000059.1| GENE 101 116011 - 117471 344 486 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|225088774|ref|YP_002660041.1| ribosomal protein S16 [gamma proteobacterium NOR5-3] # 236 486 3 262 312 137 33 4e-31 MKPAVSVNDVTKRYGEVEALKSATFSVNPGELFGIIGPDGAGKSTLFRILTTLLLADSGT ATVNGLDVVKDYKQIRQQVGYMPGRFSLYQDLTVEENLDFFATVFHTTIRENYDLVKDIY QQIEPFRKRRAGALSGGMKQKLALSCALIHKPEILFLDEPTTGVDPVSRKEFWEMLRHLK DQGITILVSTPIMDEARQCDRIAFINEGEIQGIDVPERILQRFSHILCPPGLERTEVHAG NDFAIEVDQLTKCFGHFTAVDHISFRVNRGEIFGFLGANGAGKTTAMRMLCGLSKPTSGM AWVAGFDVAAHPEEVKKNIGYMSQKFSLYEDLKVWENIRLFAGIYGMQDREIAEKTDALL DRLGFSGERDTLVKSLPLGWKQKLAFSVSIFHNPRIVFLDEPTGGVDPATRRQFWELIYQ AADRGITVFVTTHYMDEAEYCNRVSIMVDGRIEALDTPRGLKAHFHADTMDDVFQQLARK AVRKAD >gi|226331997|gb|ACIB01000059.1| GENE 102 117475 - 118575 799 366 aa, chain + ## HITS:1 COG:CAC3268 KEGG:ns NR:ns ## COG: CAC3268 COG0842 # Protein_GI_number: 15896513 # Func_class: V Defense mechanisms # Function: ABC-type multidrug transport system, permease component # Organism: Clostridium acetobutylicum # 2 365 4 374 378 218 33.0 2e-56 MKQFIAFVKKEFFHIFRDRRTMLILLGMPVVQIILFGFAITTEVKNVRVGVLDPSNDIVT RKIIDRLDASEYFSVKCLLHSPQEMERAFQENEIDMALVFSEQFADRLYTGDARVQVVSD ATDPNMATTQAGYATGVIAAVRQEMLPPGMSVPSVVPNVKLLYNPQMKSAYNFVPGVMGL ILMLICAMMTSISIVREKETGTMEILLVSPVKPLFIILAKAVPYFVLSFVNLTTILLLSV YVLDVPVAGSLFWLIMVSLLFIFVSLSLGLLISTVTRTQVAAMLASGLVLMMPTMLLSGM IFPIESMPLVLQLISDILPARWYIQAVRKLMIEGVDISFVWSEVSILALMAVLLITISFK KFKNRL >gi|226331997|gb|ACIB01000059.1| GENE 103 118572 - 119690 783 372 aa, chain + ## HITS:1 COG:SMb21204 KEGG:ns NR:ns ## COG: SMb21204 COG0842 # Protein_GI_number: 16264618 # Func_class: V Defense mechanisms # Function: ABC-type multidrug transport system, permease component # Organism: Sinorhizobium meliloti # 2 359 6 356 370 155 31.0 9e-38 MIKYLIEKEFKQLLRNSFLPRLIFIFPCMIMLLMPWAANLEIKNIRMNIIDNDHSVISRR LVDKITASTYFQTTALPDSYEKGMEAIEAGTADLLLEIPRGFEKDWVNGSAANVLLAVNA VNGTKGGLGSSYLSAIVNDYGEELRTESGTTSFSADGNLPRIDILTQNLFNARLDYKLFM VPALMVMLLTMLCGFLPALNIVGEKEADTIEQINVTPVGKFTFILAKLIPYWLIGFVVLT LCFVLAWTLYGIFPVGHFWVIYCFSIIFVLAVSGLGLVISNHSATMQQAMFVMWFCMLIL ILMSGLFTPIRSMPEWAQWITMLNPLKYFMQVMRMVYLKGSGFIDLLPQLGALLAFALLF NVWAVKSYRKSG >gi|226331997|gb|ACIB01000059.1| GENE 104 119824 - 122892 2129 1022 aa, chain + ## HITS:1 COG:slr2098_3 KEGG:ns NR:ns ## COG: slr2098_3 COG0642 # Protein_GI_number: 16330584 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Synechocystis # 604 861 1 268 280 189 39.0 4e-47 MNKQTSTPSQIHRLQQTIYENLPVGMELYDGDGYLIEINSVGLKMMGVKDKQDMLGINIF ENPNIPVEMKRRLRAGENIRFTVKYDFDLAKAHYPTVLSGISYFEITVSVVLDENKEIDK FLFIAQDVTEAVHTQEILRQSKQKTALAMQAAEVMLWEFDVCTRLFYAENEPLNGYDPTQ ALTIEDYKKQIHPEDWKKAEIILLDMLSGCDCLYEIDFRIRLSDTSEWQYCKLNCTPYER GTDGKVIKYVGFRKNNTELQRRKLLQENILNSIPLPIHIKDVEDDFRYVFCNEESMRMFG THEGETVCSVLDSEQAERMQKTDLEVFTTGKPYFGVERIILKDGRSYDTIVRKNIIYDGT KRLLLNIRWDQKLQNDLKRRAKVLSMSMEVMNAYTWFYEPSKQRVSFGEGFDKIGRNALD INSFEKFAGCVHPDDRQRFVDTMDAVLKQDSGEWDIEYRADLRGNGNYEWWKTRGVLETS ILNDHPYQYVSGMSISIESYKQTELTLLKNKEKLNKLIRQNELVLNNTNSGLAYITTDYV VQWENVSVCSSSLSFEAYKRGECCYKSAHNRTSPCDNCVLSKVLVSRQMEKIKFHLENNR IVEVLATPVFNESEEIDGIVIRVDDITDRERMIEELRQAKLLAEQSDKLKSAFLANMSHE IRTPLNAIVGFSDLLMNSEEQGDKEEYMQIINTNNELLLKLINDILDLSKLESGSVELKY EEFDLAEYFDSMASSMKQRVTNPKVQLVAVNPYSVCRVRLDKNRVAQVVTNYVTNAIKYT PQGTIEMGYEVVDTGIRLYVRDTGIGIPEEKKRKVFHRFEKLDEFAQGTGLGLSICKAIT ESMGGSVGFESEYSRGSLFWAVLPCDPEVQMRWESNIAIGQNTGKDDLIRSGNQYSALDR KTVLVVEDTSSNYLLISAMLSKHYNLLHAVNGEQAVAMVKEYKIDLLLMDMKMPVMDGLT ATAEIRKFDTNIPIVALTAHAFESDKVAALKSGCNDYLVKPVDKARLMSVLRKYCHPSTL IL >gi|226331997|gb|ACIB01000059.1| GENE 105 122988 - 123416 360 142 aa, chain - ## HITS:1 COG:TM0374 KEGG:ns NR:ns ## COG: TM0374 COG0071 # Protein_GI_number: 15643142 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Molecular chaperone (small heat shock protein) # Organism: Thermotoga maritima # 3 142 12 146 147 70 37.0 1e-12 MMPVRRSQNWLPSIFNDFFDNELMAKANATAPAINVIETDKAYKLELAAPGMTKEDFSVR IDEENNLVISMEKKAENKEEKKDGRYLRREFSYSKFQQTMILPENVDKDHISAQVENGVL NIELPKLSEEEVKKPDRTIEVK >gi|226331997|gb|ACIB01000059.1| GENE 106 123660 - 124124 361 154 aa, chain - ## HITS:1 COG:BS_alaS KEGG:ns NR:ns ## COG: BS_alaS COG0013 # Protein_GI_number: 16079794 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Alanyl-tRNA synthetase # Organism: Bacillus subtilis # 6 138 553 686 878 74 32.0 6e-14 MEQQPQLNDHNKQEYPPMHTAEHLLNATMVKTFGCPRSRNAHIEKKKSKCDYELPTCPTE EQIHAIEEKVNEAIDRHLPVTCEFMTHEEAKSIVDLSKLPENASEILRIVRIGDYDACAC IGQHVENTSEIGLFKIISYDYADGKLRLRFKLIK >gi|226331997|gb|ACIB01000059.1| GENE 107 124169 - 124945 298 258 aa, chain - ## HITS:1 COG:mll0715 KEGG:ns NR:ns ## COG: mll0715 COG1305 # Protein_GI_number: 13470896 # Func_class: E Amino acid transport and metabolism # Function: Transglutaminase-like enzymes, putative cysteine proteases # Organism: Mesorhizobium loti # 157 237 158 237 263 60 33.0 4e-09 MQKLLFLFLLPLIPFSFPCHAQTTYRMVTRNYSPVTDSYPQEVSNFKKADSVYYFTVKVS KAYDDTLKRKVAEKVLYSPNDIYDGEVASYLNPTRLIDYSSPTIELITDSLFKGEDSIMT IIKKGLEFVSHYISFDDSLATAISRGDCKTLDVNHILQRKKGTCSEYTNLFTALMRKKGI PCRFVAGFIFIPEQKFYGCHAWAECYLKQYGWMAVDPQSGKSWLPTTIIQLFAGTDYTGC GLNSFMDLVPKSIEIVKE >gi|226331997|gb|ACIB01000059.1| GENE 108 124989 - 125609 387 206 aa, chain - ## HITS:1 COG:DR0187 KEGG:ns NR:ns ## COG: DR0187 COG2949 # Protein_GI_number: 15805223 # Func_class: S Function unknown # Function: Uncharacterized membrane protein # Organism: Deinococcus radiodurans # 35 206 51 222 222 192 55.0 3e-49 MGIPLLLIVLVITIYVCNRTIQKNSETYIYSTVSDIPYNKVGLLLGTSPKLKSGKANLYF DYRIKAATELYNAGKVKYILVSGDNRRNSYNEPEEMKKALIAAGIPDQRIILDYAGLRTL DSVVRAHLIFGLERFTLISQQFHNERAIYLAQQSHLQAIGYNAQDVSAYAGFKTNLRELL ARVKVFVDIVTNKAPKHLGEKVKIPE >gi|226331997|gb|ACIB01000059.1| GENE 109 125912 - 127945 1712 677 aa, chain - ## HITS:1 COG:BS_uvrB KEGG:ns NR:ns ## COG: BS_uvrB COG0556 # Protein_GI_number: 16080570 # Func_class: L Replication, recombination and repair # Function: Helicase subunit of the DNA excision repair complex # Organism: Bacillus subtilis # 3 668 5 658 661 736 56.0 0 MNFELTSAYKPTGDQPEAIAQLTEGVLEGVPAQTLLGVTGSGKTFTIANVIANINKPTLI LSHNKTLAAQLYSEFKGFFPNNAVEYYVSYYDYYQPEAYLPSSDTYIEKDLAINDEIDKL RLAATSALLSGRKDVVVVSSVSCIYGMGNPSDFYNNVIEIERGRTINRNVFLRRLVDSLY MRNDIELNRGNFRVKGDTVDIYLAYSDNLLRVTFWGDEIDGIEEVDPVSGVTIAPFEAYK IYPANLFMTTKEATLRAIHEIEDDLTKQVAYFESIGKEYEAKRLYERVTYDMEMIRELGH CSGIENYSRYFDGRAAGTRPYCLLDFFPDDFLIVIDESHVSVPQIRAMYGGDRARKINLV EYGFRLPAAMDNRPLKFEEFESMAKQVIYVSATPADYELVQSEGIVVEQVIRPTGLLDPV IEVRPSLNQIDDLMEEIQIRIEKEERVLVTTLTKRMAEELTEYLLNNNVRCNYIHSDVDT LERVKIMDDLRQGVYDVLIGVNLLREGLDLPEVSLVAILDADKEGFLRSHRSLTQTAGRA ARNVNGMVIMYADKITDSMRLTIDETNRRREKQLAYNEEHGITPQQIKKARNLSVFGNGA ETEDTQKGTRAYVEPSSPNIAADPVVQYMSKAQLEKSMERTRKLMQEAAKKLEFIEAAQY RDELLKMEDLMKEKWPG >gi|226331997|gb|ACIB01000059.1| GENE 110 128062 - 129360 1073 432 aa, chain + ## HITS:1 COG:MTH1855 KEGG:ns NR:ns ## COG: MTH1855 COG1541 # Protein_GI_number: 15679843 # Func_class: H Coenzyme transport and metabolism # Function: Coenzyme F390 synthetase # Organism: Methanothermobacter thermautotrophicus # 1 432 1 432 433 499 55.0 1e-141 MIWNESIECMERDNLHKIQSIRLKKIVEYVYHNTPFYRKKMQELGITPDDINGIEDISKL PFTTKLDLRDNYPFGLCAVPMSQIVRIHASSGTTGKPTVVGYTRKDLSTWAECLSRAFTA YGAGRSDIFQVSYGYGLFTGGLGAHAGAENIGASVIPMSSGNTEKQITLMHDFGSTVLCC TPSYALYLADAINDSGFPREEFKLKAGAFGAEPWTESMRKDIETKLGIKAYDIYGLSEIA GPGVGYECECQNGTHLNEDHFFPEIIDPHTLQPVEPGQTGELVFTHLTKEGMPLLRYRTK DLTALHYEKCSCGRTLVRMDRILGRSDDMLIIRGVNVFPTQIESVILEMAEFEPHYLLTI DRKNNTDTMELKVEVRPDYYSDEINKMLALKKKLTGRLQSVLGLGVDVKLVEPRSIERSV GKAKRVIDNRKL >gi|226331997|gb|ACIB01000059.1| GENE 111 129388 - 129813 463 141 aa, chain + ## HITS:1 COG:MTH1854 KEGG:ns NR:ns ## COG: MTH1854 COG4747 # Protein_GI_number: 15679842 # Func_class: R General function prediction only # Function: ACT domain-containing protein # Organism: Methanothermobacter thermautotrophicus # 1 141 1 143 143 110 39.0 1e-24 MVAKQLSIFLENKSGRLTEVTEVLAKEGINLSALCIAENADFGILRGIVSDPDKAYKALK DNHFAVNVTEVVGINCPNVPGALAKVLQYLSNEGVFIEYMYSFANNNSANVIIRPNDMEN CIRVLTEKKVDLLAASDLYKL >gi|226331997|gb|ACIB01000059.1| GENE 112 130001 - 130804 735 267 aa, chain + ## HITS:1 COG:no KEGG:BF2617 NR:ns ## KEGG: BF2617 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 267 1 267 267 452 100.0 1e-126 MKKNIIITLLAATVLSSCGEYNKLLKSTDYEYKYEAAKNYFAKGQYNRSATLLNELITIL KGGDKAEESLYMLAMSYYNQKDYSTAAQSFITYFNTYPRGQFSELARFHAGKALFLDTPE PRLDQSSTYQAIQQLQMFLEYYPQSSRKQEAQNMIFALQDKLVLKELYSARLYYNLGNYM GNNYLSCVITAQNALKDYPYTDYREDLSILILRAKYEMAVNSVEDKKMDRYRETVDEYYA FKNEFPESKYLKEAERIFKDSQKVIKD >gi|226331997|gb|ACIB01000059.1| GENE 113 130820 - 131155 454 111 aa, chain + ## HITS:1 COG:no KEGG:BF2616 NR:ns ## KEGG: BF2616 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 111 1 111 111 186 100.0 2e-46 MDYKKTNAPTNTITRDMMDLCADTGNVYETVAIIGKRANQISVEIKNDLSKKLAEFASYN DNLEEVFENREQIEISRYYEKLPKPNLIAAQEYVEGKIYYRNPAKEKEKLQ >gi|226331997|gb|ACIB01000059.1| GENE 114 131265 - 131714 386 149 aa, chain + ## HITS:1 COG:no KEGG:BF2615 NR:ns ## KEGG: BF2615 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 149 1 149 149 233 100.0 2e-60 MIQRIQTIYLLLVTALLITSMCLPVGSFIGADAAMYVFKPLGVEMNGTLYSTWGVFGILL LSAIIAFATIFLFKNRMLQIRMTIFNSILLIGYYLTFLAFMFVLKKDLSATFQISWALCL PLISIILNWLAVRAIGRDEVMVKAADRLR >gi|226331997|gb|ACIB01000059.1| GENE 115 131797 - 132504 733 235 aa, chain - ## HITS:1 COG:no KEGG:BF2614 NR:ns ## KEGG: BF2614 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 235 1 235 235 416 100.0 1e-115 MKRDQLKSLLLGAILLAGITLPAKAQIGEQRHNFAIGINGGANYSTVSFQPTIKQNGLLG ITGGVTARYISEKYFAMICGAQLELNFSQRGWDEKFDPEQGFSSEDSYVRTMNYLEIPFL AHLAFGKDKGVQFFVNLGPQIGLLLNESEKRKGTWETANRAPNEQYGKWVENKFDYGIVG GGGIEVRTKAGNFLLEGRYYFGLADFYNSTKKDYFSRSANSTITAKITYLFDIKK >gi|226331997|gb|ACIB01000059.1| GENE 116 132501 - 134270 1313 589 aa, chain - ## HITS:1 COG:no KEGG:BF2635 NR:ns ## KEGG: BF2635 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 589 1 589 589 1162 99.0 0 MKSIKRIFFLLSFISVSYLSTFAQENQSYFLHTIEKGQSLYSISSMYGVSKADIIRLNPG CEDKIYAGQAIKIPQNKTAQKGETFHTIQPGETLYRLTTTYKVSAKAICDANPGLSADNF RIGQVIRIPSAAEAIDSTVEAVVAAPSEPAMQPAVKPRCKDMHKVKRRETIFSVSREYGI SEQELIAANPELKNGMKKGQFLCIPYPSEKPTVTVPKTDANIIPPSDSELFRENKEVPKS ISTIKAALLLPFDDKRMVEYYEGFLMAVDSLKRTGTSIDLYVYDCNKESSSLNSILAKSE MKNMNVIFGPAQQQHIKPLAAFAKKNDIRLVIPFSSKEGEVFNNPFIYQINTPQSYLYSE VYEHFTRQFPNANVILLESAVVDKDKVEFIKGLKQELGSKGIPVKTLKENAPVETLKAAL HNDKENFFIPTSGNDLTLLRIIPQLTLLVRDNPEARIHLFGYPEWQTYTKDHLESFFELD TYFYSSFYTNNLLPAAINFTQAYRKWYSKEMEERYPKYGMLGFDTGYFFLKGLSKYGSEL EKNLPQMDLTPIQTGFKFQRVNNWGGFVNKKVFFVHFTKNFELIKLDFE >gi|226331997|gb|ACIB01000059.1| GENE 117 134447 - 137275 2577 942 aa, chain + ## HITS:1 COG:BH3594 KEGG:ns NR:ns ## COG: BH3594 COG0178 # Protein_GI_number: 15616156 # Func_class: L Replication, recombination and repair # Function: Excinuclease ATPase subunit # Organism: Bacillus halodurans # 7 940 6 939 957 1014 55.0 0 MQETEYINVYGARVHNLKDIDAEIPRNSLTVITGLSGSGKSSLAFDTIFAEGQRRYIETF SAYARNFLGNLERPDVDKITGLSPVISIEQKTTNKNPRSTVGTTTEIYDYLRLLYARAGI AYSYLSGEKMVKYTEEQILELILKDYKGKKIYMLAPLVRSRKGHYKELFEQIRKKGYLYV RIDGEVREITHGLKLDRYKNHDIEVVVDKLIVEDKDDKRLKQSVATAMRQGDGLLMILDA QTESIRHYSKRLMCPVTGLSYREPAPHNFSFNSPQGACPRCKGLGVVSQIDVDKVIPNKE MSIYEGAIAPLGKYKNAMIFWQIGALLEKYDASLKTPVKDLPNDAIDEILYGSDERIKIK SSLIGTSSDYFVTYEGVVKYIQMLQEKDASATAQKWAEQFAKTTVCPECKGAKLNKEALH FRIHDKNINELSNMDINELYDWLMKVDKFLTDKQKTIAAEILKEIRTRLKFLLDVGLDYL SLNRSSVSLSGGESQRIRLATQIGSQLVNVLYILDEPSIGLHQRDNLRLINSLKELRDMG NSVIVVEHDKDMMLAADYVIDMGPKAGRLGGEVVFAGTPEEMLKTSTLTSQYLNGQTAIE VPSKRRPGNGKSLWLRGAKGNNLKNVDVEFPLGKLICVTGVSGSGKSTLINETLQPILSQ KFYRSLQDPLEYDSIEGLENIDKVVNVDQSPLGRTPRSNPATYTGVFSDIRSLFVGLPEA KIRGYKPGRFSFNVSGGRCEACSGNGYKTIEMNFLPDVYVPCEVCHGKRYNRETLEVRFK GKSIADVLDMTINRAVEFFENVPQILNKIKTIQDVGLGYIKLGQSSTTLSGGESQRVKLA TELSKRDTGKTLYILDEPTTGLHFEDIRVLMGVLNKLVDKGNTVIVIEHNLDVIKMADYI IDMGPEGGKGGGELLSFGTPEEVAKSNKGYTPKFLREELKIN >gi|226331997|gb|ACIB01000059.1| GENE 118 137531 - 138010 380 159 aa, chain + ## HITS:1 COG:lin0783 KEGG:ns NR:ns ## COG: lin0783 COG2606 # Protein_GI_number: 16799857 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Listeria innocua # 5 156 4 155 158 162 53.0 3e-40 MKINKTNAARLLDKAKIAYELIPYEVDESDLSAVHVAASLGENINQVFKTLVLHGDKTGY FVCVIPGDQEVNLKLAAKVSGNKSCDMIPMKELLSVTGYIRGACSPIGMKKHFPTYIHET CLGFPYIYVSAGQRGLQIKIDPKELINEVRAEVCVLYTV >gi|226331997|gb|ACIB01000059.1| GENE 119 138072 - 139730 1643 552 aa, chain + ## HITS:1 COG:yjdL KEGG:ns NR:ns ## COG: yjdL COG3104 # Protein_GI_number: 16131956 # Func_class: E Amino acid transport and metabolism # Function: Dipeptide/tripeptide permease # Organism: Escherichia coli K12 # 5 515 6 428 485 105 24.0 2e-22 MFEGQPKGLYALALANTGERFGYYTMLAIFTLFLQAKFGYTAAETSTIFGCFLAAVYFIP FFGGILADKFGYGKMVTMGIVVMFIGYALLAIPTSDNSGKIMMFGALALIACGTGLFKGN LQVMVGNLYDAPEYSSKRDTAFSIFYMAINIGAMYAPTAATKVTNYMLGKAGLSYVPQIP SLAHQYLDGTITPANQATLESLQAAQNFTGDIATFCTTYIDKLSEAYNYGFGVACVSLII SMLIYVAFRSTFKHADYNSKQAKPANVVEEKLTPEQTKQRIVALLLVFAVVIFFWMAFHQ NGLTLTFFARDYTAQEVTGLDRLGFNIVNLTFLLIVIYGLFSLFQGKTGKSKTIAGVVVM VALLCLGWSYTSMDPTINILPQIFQQFNPFFVIALTPVSLAVFGYLARKGKEPSAPRKIG IGMMIAVCGFLIMAIGSIGLPTPDALAASGIEKDALVSPNWLISTYLVLTFAELFLSPMG ISFVSKVAPPKYKGMMMGGWFAATAIGNYLVAIIGYLWGGMQLWMVWSVLIVCCLLAALF IFSIMKKLEKVA >gi|226331997|gb|ACIB01000059.1| GENE 120 139842 - 140339 197 165 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|229884790|ref|ZP_04504247.1| acetyltransferase, ribosomal protein N-acetylase [Sebaldella termitidis ATCC 33386] # 4 161 5 160 169 80 31 5e-14 MFTIRKATSDDCKLINELANQVFPATYKEILSTEQLDYMMEWMYAPENIRKQMEEEGHVY FIAYQGDEPCGYVSVQPQDADVFHLQKIYVLPGFQGAHLGSKLFDHAVQYIKEIHPSPCL MELNVNRNNKALHFYEHKGMKKLREGDFPIGNGYYMNDYIMGLEL >gi|226331997|gb|ACIB01000059.1| GENE 121 140501 - 140836 370 111 aa, chain + ## HITS:1 COG:mll2486 KEGG:ns NR:ns ## COG: mll2486 COG1695 # Protein_GI_number: 13472254 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Mesorhizobium loti # 4 105 3 103 108 77 36.0 8e-15 MNVDNVKSQMRKGMLEYCIMLLLHKEPAYASDIIQKLKEARLIVVEGTLYPLLTRLKNDD LLSYEWVESTQGPPRKYYKLTGKGESFLGELEASWKELNETVNHIANRESI >gi|226331997|gb|ACIB01000059.1| GENE 122 140849 - 141928 855 359 aa, chain + ## HITS:1 COG:no KEGG:BF2629 NR:ns ## KEGG: BF2629 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 359 1 359 359 667 99.0 0 MKKTLTVNLGGTVFHIDEDAYRLLDNYLCNLRLHFRKQEGAEEIVNDIENRISELFAEKL SAGSQVITIADVEEVIARMGKPEDFGEDTGEEEPQKTTGQTGAQQGATIRRRLYRNPDDK ILGGVISGLAAYLNWDVTVLRLIMFVVLICGYGVLIPIYIICWLVIPEARTAAEKLNMRG EDITIENIGRTVTDGFERMANGVNNYVNSGKPRSFLQKVGDALVSIAGFFLKACLVVLAI ICSPVLFVLAIVFVALVIAAIAVAIGGGAALYQMLPSVDWSPLISTSPMMTIAGSIAGVV LAGIPLAAIIFVILRQIFNWSPMSSGLKWSLLIIWILAVVIFVINLSYLGWPYPFLWVG >gi|226331997|gb|ACIB01000059.1| GENE 123 142080 - 143075 729 331 aa, chain + ## HITS:1 COG:YPO1228 KEGG:ns NR:ns ## COG: YPO1228 COG2220 # Protein_GI_number: 16121515 # Func_class: R General function prediction only # Function: Predicted Zn-dependent hydrolases of the beta-lactamase fold # Organism: Yersinia pestis # 79 303 84 313 342 158 35.0 1e-38 MGEHICFRRNERLATVNPHWRGNPVVRGKFVNRQHRFRPGMGSVLKWRLSPNPQRKEKKS VKWSPKLNYLRSLDGVVGNSLIWLGHNSFFLQLARKRIMFDPVFGDIPFVKRQSDFPANP DIFTDIDYLLISHDHFDHLDKQSVARLVKNNPGMKLFCGLGTGELIKGWFPELEVTEAGW YQQIEDDGLKITFLPAQHWSKRSVRDGGRRLWGAFMVQADGISLYYSGDTGYSRHFREIP DLFGAPDYALVGIGAYKPRWFMQPNHISPYDALTASTDMKAALTIPMHYGTFDLSDEPLH DPPLVFAAEAKKRKIDVYIPVLGEVVKLKRM >gi|226331997|gb|ACIB01000059.1| GENE 124 143176 - 145284 2020 702 aa, chain - ## HITS:1 COG:XF2260 KEGG:ns NR:ns ## COG: XF2260 COG1506 # Protein_GI_number: 15838851 # Func_class: E Amino acid transport and metabolism # Function: Dipeptidyl aminopeptidases/acylaminoacyl-peptidases # Organism: Xylella fastidiosa 9a5c # 51 694 52 707 709 344 31.0 3e-94 MRQVNLFIMSAAMMLASCGGTKDAGKTDQALIGKSDIKIEGKRMTPEALWAMGRIGGLAV SPDGKQIAYTVAYYSVPENKSNREVFVMNADGTDNRQITHTPYQENEVTWAADGSKLLFL SNDNGSSQLYEMNPDGSGRKQISKYDGDIEGYSISPDGKKILFIAQVKTVKSTADKYPDL DKATGIIITDLMYKHWDEWVTTAPHPFIADFDGKSISNIIDVLEGEPYESPMKPWGGIEQ LAWNTTSDKVAYTCRKKTGLAYAISTNSDIYVYDLNTKKTVNITEGMMGYDTNPQYSPDG KSIAWQSMERDGYEADQNRLFVMNLETGEKRFVSKAFDSNVDAFIWSRDAKTIYFTGVWH GETQIYSLDLANDSVRPVTSGMYDYEGVALFGDKLIAKRHSMSMGDEIYAIALDGQTTQL SQENKQIYDQIEMGKVEGRWMKTTDSKEMLTWVIYPPQFDPNKKYPTLLFCEGGPQSPVS QFWSYRWNMQIMAANGYIVVAPNRRGLPGFGLEWNEAISGDYGGQCMKDYFTAIDEMAKE PFVDSDRLGCVGASFGGFSVYWLAGHHDKRFKAFIAHDGIFNMEMQYLETEEKWFANWDM GGAYWEKQNPTAQRTFANSPHLFVEKWDTPILCIHGEKDYRILANQAMAAFDAAVMRGVP AELLIYPDENHWVLKPQNGVLWQRTFFEWLDQWLKPNETAQK >gi|226331997|gb|ACIB01000059.1| GENE 125 145358 - 146689 1102 443 aa, chain - ## HITS:1 COG:PAB0243 KEGG:ns NR:ns ## COG: PAB0243 COG0534 # Protein_GI_number: 14520582 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Pyrococcus abyssi # 8 443 6 463 463 120 25.0 7e-27 MTTKYTYKEIWIIAYPILISLIMEQLIGMTDTAFLGRVGEVELGASAIAGVYYLAIFMMA FGFSIGAQILIARRNGEQQYQAIGPIFYQGVYFLLSLAVVAFTLSLCFSPHILKKVISSE HIYDASISYINWRVFGFFFSFVGVMFRAFFVGTTQTKTLTLNSVVMVLSNVVFNYILIFG KFGAPQLGIAGAAIGSSLAEMVSVIFFVIYTWKRINCKKYALNHLPGFRPEMLKRILNVS FWTMIQNFFSLSTWFMFFLFVEHLGERALAITNIIRNVSGIPFMVTMAFAATCGSLVSNL IGAGEIKCVPGTIRQHIRIGYIFVLPLVILFALFPNLILSIYTDIPDLREASIPSLWVLC SAYLILVPANVYFQALSGTGNTRTALGLELCVLAIYLIYITYMILYLRVDVAVCWTTEHL YGICILFLSYLYIKKGNWQKKQI >gi|226331997|gb|ACIB01000059.1| GENE 126 146778 - 148634 1635 618 aa, chain - ## HITS:1 COG:BB0442 KEGG:ns NR:ns ## COG: BB0442 COG0706 # Protein_GI_number: 15594787 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Preprotein translocase subunit YidC # Organism: Borrelia burgdorferi # 86 560 74 534 544 138 27.0 3e-32 MDKNTITGLVLIGILLVGFSFLSRPSEEQIAAQKRYYDSIAVVQQQEEALRAKTEAALAN EKEETAADSASLFFSATKGKEAFTTIQNNLVEITLDNKGGRVYSALLKNYMGQDKKPVVL FNGSDASMNFNFYNKKGALQTKDFYFEAVNKTDSSVTMRLAADSASYIDFIYTLKPDNYL MSFVIKATGMDGKLAASTNYVDISWSQRARQIEKGYTYENRLADLTYKYTGDDVDNLSAS KDDEKSVSERLDWIAFKNQFFSSVFIAEQDFEKTTVKSKMEKQGSGYIKDYSAEMSTFFD PTGKQPTDMYFYFGPNHYKTLTALDKGREEKWELNNLVYLGWPLIRWINKWITINVFDWL SGWGLSMGIVLLLLTIMVKIVVFPATWKTYMSSAKMRVLKPKIDEINKKYPKQEDAMKKQ QEVMGLYSQYGVSPMGGCLPMLLQFPILMALFMFVPSAIELRQQSFLWADDLSTYDAFIT FPFHIPFLGNHLSLFCLLMTVTNILNTKYTMQQQDTGAQPQMAAMKWMMYLMPIMFLFVL NDYPSGLNYYYFISTLISVVTMIILRRTTDENKLLTELEAKKKDPKQMKKTGFAARLEAM QKQQEQLAKERANKQNKK >gi|226331997|gb|ACIB01000059.1| GENE 127 148683 - 150284 1863 533 aa, chain - ## HITS:1 COG:CAC2892 KEGG:ns NR:ns ## COG: CAC2892 COG0504 # Protein_GI_number: 15896145 # Func_class: F Nucleotide transport and metabolism # Function: CTP synthase (UTP-ammonia lyase) # Organism: Clostridium acetobutylicum # 3 530 2 529 535 616 55.0 1e-176 MGETKYIFVTGGVASSLGKGIISSSIGKLLQARGYKVTIQKFDPYINIDPGTLNPYEHGE CYVTVDGHEADLDLGHYERFLGIQTTKANNITTGRIYKSVIDKERRGDYLGKTIQVIPHI TDEIKRNVKLLGNKYKFDFVITEIGGTVGDIESLPYLESIRQLKWELGQNALCVHLTYVP FLSAAQELKTKPTQHSVKELQSLGVQPDILVLRTEHDLNTNLRKKVALFCNVAENAVVQS IDASTIYEVPLLMQEQGLDETILQKMGLPVGERPPLGPWKDFLNRRANATETVTIAMVGK YVELQDAYKSILESLSQAATYNDRKVKIEYVSSEHLTPDNVDEQLGHVNGVVICPGFGSR GIEGKFVAAKYTREHNIPTFGICLGMQCMAIEFARNVLGYADANSIEMDEKTKHNVIDIM EEQKAITNMGGTMRLGAYECVLKKDSKVYEAYKEEHIQERHRHRYEFNNDYRKQFEEAGM KCVGINPESDLVEIVEIPTLKWYIGTQFHPEYSSTVLHPHPLFVSFIKAAIDK >gi|226331997|gb|ACIB01000059.1| GENE 128 150475 - 150582 68 35 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MNLYNNSAVLLERALSLRTIQIELRTAFPRLDSAS >gi|226331997|gb|ACIB01000059.1| GENE 129 150668 - 152101 1261 477 aa, chain + ## HITS:1 COG:no KEGG:BF2601 NR:ns ## KEGG: BF2601 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 477 1 477 477 925 100.0 0 MKTSHISTLAFAILFSSSVMAQKTNKPVSETVQSEVKADTLSETLQQYLVLKLNLDGPKP KIDTVSILYNKYIGELEYLNDPSVPMRYIKTDPDYYRLFVPLTYYNSPIAEYSTMHWKFK EPFVTPSLSSQLLPPYDTLQFSKAERASRLVNVALMDLYLNHPNLVVNTEDHIMSRKLYH GDKKIEVPKTEVKSLFRADKVEDNVGEAEMVISKPNWWVTGGNGSLQITQNYISDNWYKG GESNNAVMANLQLFANYNDREKVQFENLFEAKLGFNSSPSDEYHKYLVNTDQLRLYSKLG IQAANNWYYTITGEFKTQFVKGYKANSEELVAAFLAPADVIVSVGMDYKLKKKKFNLSVF MSPLTYNLRYIGNKNVDETKFGLDKGKCSKNDFGAQVQPTISWTIIPSIVVDSRLNYLTN YKWVRVEWENTFNFVLNRYLSTKLFVHARFDDSAKPTTGSSYFQLKELLSFGLNYKW >gi|226331997|gb|ACIB01000059.1| GENE 130 152230 - 152634 408 134 aa, chain + ## HITS:1 COG:no KEGG:BF2600 NR:ns ## KEGG: BF2600 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 134 1 134 134 215 100.0 3e-55 MIKQDYLIRMIQEIISLIVNAILNKKKFRKDEWTEYDCLTRQILGVSQEELLSMSLDEMI DCYEGDPNRMGKIELAAVTLLKVSDEVESDILQKSKLRQDGLSLLKYVQKESSTFSIQRT NLIRMIEINESLKM >gi|226331997|gb|ACIB01000059.1| GENE 131 153310 - 154182 857 290 aa, chain + ## HITS:1 COG:MA0664 KEGG:ns NR:ns ## COG: MA0664 COG2878 # Protein_GI_number: 20089551 # Func_class: C Energy production and conversion # Function: Predicted NADH:ubiquinone oxidoreductase, subunit RnfB # Organism: Methanosarcina acetivorans str.C2A # 1 265 1 261 264 169 40.0 5e-42 MNLILIAVISLGAIALVLAAVLYVASKKFAVYEDPRIAQVSEVLPQANCGGCGYPGCSGF ADACVKAGSLEGKFCPVGGQPVMSQVASILGLDAGTAEPMVAVVRCNGTCTNRPRTNMYD GAKSCAIAASLYGGETGCSYGCLGCGDCVAACQFDAIHMNPETGLPEVDEEKCTACGACV KACPKAIIELRAKGKKSRRVYVSCVNKDKGAVARKACTVSCIGCGKCVKTCPFEAITLEN NLAYIDYNKCKSCRKCVEVCPQHTIIELNFPPRKPKEETPVAEAAKTAEA >gi|226331997|gb|ACIB01000059.1| GENE 132 154219 - 155556 1147 445 aa, chain + ## HITS:1 COG:FN1596 KEGG:ns NR:ns ## COG: FN1596 COG4656 # Protein_GI_number: 19704917 # Func_class: C Energy production and conversion # Function: Predicted NADH:ubiquinone oxidoreductase, subunit RnfC # Organism: Fusobacterium nucleatum # 1 436 6 438 441 371 46.0 1e-102 MLKTFSIGGVHPHENKLSAHQPIVTAEVPAKAVILLGQHIGAPAKPIVAKGDVVKVGTKI AEANGFVSAAIHSSVSGKVAKIDSIVDASGYAKPAIFIDVDGDEWEESIDRSPELVKECE LTSEEIVKKIADAGIVGLGGACFPTQVKLCPPPSFKAECVIINAVECEPYLTADHQLMLE HAEEVMVGVAILMKAVKVNKAFIGIENNKPDAIQLMTKVASSYAGIEVVPLKVKYPQGGE KQLIDAITKRQVASGALPISTGAVVQNVGTAFAVYQAVQKNKPLFERVITVTGKSVAQPS NFLARIGTPMQQLIDACGGLPEDTGKIIGGGPMMGKALVNTDVPTAKGSSGILIMNRKEA KRGEVQNCIRCAKCVSACPMGLEPYLLSALAENTEFERMESERIMDCIECGSCQFTCPAN RPLLDYCRLGKGKVGAMIRARQAKK >gi|226331997|gb|ACIB01000059.1| GENE 133 155562 - 156554 973 330 aa, chain + ## HITS:1 COG:TM0245 KEGG:ns NR:ns ## COG: TM0245 COG4658 # Protein_GI_number: 15643017 # Func_class: C Energy production and conversion # Function: Predicted NADH:ubiquinone oxidoreductase, subunit RnfD # Organism: Thermotoga maritima # 4 328 2 318 318 269 49.0 5e-72 MENKLIVSLSPHVHGGDSVQKNMYGVLIALIPAFIVSLVYFGLGALIVTATSVIACLFFE WAIGKFLLKKETTTICDGSAVITGVLLAFNLPSNLPVWIIILGALFAIGVGKMSFGGLGN NPFNPALAGRVFLLLSFPVQMTSWPVVGQLTAYTDATTAATPLALMKQAIHGDVSAFGQL PDAWSLFIGNNGGCLGEVSAAALILGLLYMLWKRIITWHIPVSILVTVFVFSGIMHLANP EAYVSPVIQLLSGGLMLGAIFMATDYVTSPMSKKGMLIYGVCIGLLTVVIRLFGAYPEGM SFAILIMNAFTPLINTYCKPKRFGEVAKKK >gi|226331997|gb|ACIB01000059.1| GENE 134 156581 - 157219 881 212 aa, chain + ## HITS:1 COG:YPO2241 KEGG:ns NR:ns ## COG: YPO2241 COG4659 # Protein_GI_number: 16122469 # Func_class: C Energy production and conversion # Function: Predicted NADH:ubiquinone oxidoreductase, subunit RnfG # Organism: Yersinia pestis # 3 177 10 188 209 79 32.0 5e-15 MLLVLTGVTAISVALLAYVNELTKGPIADANAKTLNEALKEVLPEFTNNPVAECDTVFSE KDGKKIVDFIIYPAKNGDQWVGTAVEAKSMGFGGELKVLVGFDAEGKIYNYSLLSHAETP GLGSKAADWFKEGNKGSIKGMNPGEQPLTVSKDGGQVDAITASTITSRAFLNAVNAAYGA YKGDGGANGTTGASQKAVQKSETAIVDSVVIK >gi|226331997|gb|ACIB01000059.1| GENE 135 157237 - 157824 697 195 aa, chain + ## HITS:1 COG:FN1593 KEGG:ns NR:ns ## COG: FN1593 COG4660 # Protein_GI_number: 19704914 # Func_class: C Energy production and conversion # Function: Predicted NADH:ubiquinone oxidoreductase, subunit RnfE # Organism: Fusobacterium nucleatum # 1 189 1 190 205 194 62.0 7e-50 MNNFKVLMNGIVKENPTFVLLLGMCPTLGTTSSAINGMGMGLATMFVLICSNVVISLIKN LIPDMVRIPSFIVVIASFVTLLQMVMQAYVPGLYATLGLFIPLIVVNCIVLGRAEAFAAK NNALSSMFDGIGMGLGFTIALTLLGAVREFLGTGKVFNLSIIPEEYGMLVFVLAPGAFIA LGYLIALINSLKKAN >gi|226331997|gb|ACIB01000059.1| GENE 136 157838 - 158410 675 190 aa, chain + ## HITS:1 COG:FN1592 KEGG:ns NR:ns ## COG: FN1592 COG4657 # Protein_GI_number: 19704913 # Func_class: C Energy production and conversion # Function: Predicted NADH:ubiquinone oxidoreductase, subunit RnfA # Organism: Fusobacterium nucleatum # 18 189 21 192 194 176 61.0 2e-44 MEYILIFISAIFVNNIVLSQFLGICPFLGVSKKVDTALGMSAAVAFVLTIATIVTFLIQK FVLDAFGLQYLQTITFILVIAALVQMVEIILKKVSPALYQALGVFLPLITTNCCILGVAI LVIQKDYDLLTGVVYAFSTAIGFGLALVLFAGLREQMSLVKIPEGMKGTPIALITAGLLA MAFMGFSGVV >gi|226331997|gb|ACIB01000059.1| GENE 137 158615 - 159649 1072 344 aa, chain + ## HITS:1 COG:BS_galE KEGG:ns NR:ns ## COG: BS_galE COG1087 # Protein_GI_number: 16080937 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-glucose 4-epimerase # Organism: Bacillus subtilis # 5 335 3 328 339 338 50.0 1e-92 MKERILVTGGTGYIGSHTVVELQNSGYEVIIIDNLSNSNADVVDNIEKVSGIRPVFEKLD CLDFDGLDAVFNKYKGIKAIIHFAASKAVGESVEKPLLYYRNNLVSLINLLELMPKHGIE GIVFSSSCTVYGEPDELPVTENAPIKKATSPYGNTKQINEEIVRDTVASGAPINAILLRY FNPIGAHPTALLGELPNGVPQNLIPYLTQTAIGIREKLSVFGDDYDTPDGSCIRDFINVV DLAKAHVIAIARILEKKQKDKVETFNIGTGRGVSVLELINGFEKATGVKLNYQIVGRRAG DIEKVWANPDYANNELGWKAQETLEDTLRSAWAWQLKLRERGIQ >gi|226331997|gb|ACIB01000059.1| GENE 138 159975 - 161234 862 419 aa, chain - ## HITS:1 COG:no KEGG:BF2612 NR:ns ## KEGG: BF2612 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 419 1 419 419 861 99.0 0 MKRQIVSCGIFAMLLLASCNGAQQTSEVDLIDIAGGMEKLTELKVSDLGKTIRYIPLETT DSCLIGDFPNIKLLDDKIMVYNGKQCLLFDKETGKFICSVGHRGDDPEAYSSTCGYLNPQ NQLLYFNRDPNQLVKYNQKGNFAGVITIPSPETSTGNIQINRPGMNGFVFSDSIIIGHYV GGLGQRCPSSVLYFSDKGNYIDTIPNIIPELGTGQVGDINNISVRKGFGLLEGIIQIQYN NGKQSICVPGYTALWKCNDEIRFKELFTDTIYTLKHNQLEKYIIFNTGKYYWPASERTLT DNNSDRIMISYAIENDKLLYFQFIRGLYTDKHILYNGIYDKNTKVTNIALAENNFVDDLT HFMPFTPSFLNEKGECASLVQAADILTWLEEHPEVKIEKELGVLKNLNEESNPVVVLVK >gi|226331997|gb|ACIB01000059.1| GENE 139 161368 - 162612 675 414 aa, chain - ## HITS:1 COG:no KEGG:BF2590 NR:ns ## KEGG: BF2590 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 414 1 414 414 845 100.0 0 MKNYLITYSCAMFLLASCGSPSQQPKAPGQIDIAGKIESLTELKASDFIKEINYVALETT DSCLVNENPNIQVFKNNIIVNTNKQCLVFDKDSGKFLRSIGHIGNDPGGYSEATFWIDDI TGELYFIGWNGTLMRYDLQGNYLGDVKVASNLGVRNPACFVFTDSLIVSHQLSLLPIQGN EKKSPFLLLDKQGNVIDSIPSLLPVIPVTTNVVRLNILKSEKARELYGNIGVHGMMTAYF NNDDAIESPRVLSTLWKHNGKVRFKEAYLDTIYTLSENKLSPYLIFNTGKYHYPAEERYQ QKENDERVKIDYVMESTTLIIFQFRQRGEVYTGIYNKDTQITQIAKGQNFVNDIDHFMPL NPRNCNTDNEYVDLVQANTILEWMEEHPEVNPDGKFSFIKGINEESNPVVILMK >gi|226331997|gb|ACIB01000059.1| GENE 140 162635 - 163459 515 274 aa, chain - ## HITS:1 COG:NMA1092 KEGG:ns NR:ns ## COG: NMA1092 COG1947 # Protein_GI_number: 15794040 # Func_class: I Lipid transport and metabolism # Function: 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase # Organism: Neisseria meningitidis Z2491 # 4 249 10 245 281 139 39.0 5e-33 MITFPNAKINLGLNITEKRPDGYHNLETVFYPIPLEDALEITILNDSKQKFVLHQSGLEI SGEPETNLVVKAYLLLEQEFQLPPVDIYLYKHIPSGAGLGGGSADAAFMLKLLNEKFNLH LADEKLEEYAAILGADCAFFIKNKPTFAEGIGNIFSPVDLSLKGYQLVLVKPDVFVSTRD AFSQIQPHYPDHSLKEIIRHPVSEWKNCMFNDFEKSVFPQYPVIEEIKKELYSKGAIYAA MSGSGSSVFGLFSPEEKITKMDFEAAFCFQTELK >gi|226331997|gb|ACIB01000059.1| GENE 141 163637 - 165184 1309 515 aa, chain + ## HITS:1 COG:lin0047 KEGG:ns NR:ns ## COG: lin0047 COG0305 # Protein_GI_number: 16799126 # Func_class: L Replication, recombination and repair # Function: Replicative DNA helicase # Organism: Listeria innocua # 24 464 8 440 450 375 45.0 1e-103 MAEQRRNTRSTAKSKVQPVNDYGRIQPQAPELEEAVLGALMIEKDAYSLVSEILRPESFY EHRHQLIYAAITDLAVNQKPVDILTVKEQLSKRGELEEVGGPFYITQLSSKVASSAHIEY HARIIAQKYLARELITFTSNIQSKAFDETLDVDDLMQEAEGKLFEISQRNMKKDYTQINP IIAEAYEQIQKAAARTDGLSGLESGYTKLDKMTSGWQKSDLIIIAARPAMGKTAFVLSMA KNIAVNFRNPVALFSLEMSNVQLVNRLISNVCEIPSEKIKSGQLAAYEWQQLDYKLKDLL DAPLYVDDTPSLSVFELRTKARRLVREHGVRIIIIDYLQLMNASGMAFGSRQEEVSTISR SLKGLAKELNIPIIALSQLNRGVESREGLEGKRPQLSDLRESGAIEQDADMVCFIHRPEY YKIFQDDKGNDLRGMAEIIIAKHRNGAVGDVLLRFKGEYTRFQNPDDDMVIPLPDAGAML GSRMNNTGTVPPPPAEFAPQNSNPFGGGNDGPLPF >gi|226331997|gb|ACIB01000059.1| GENE 142 165214 - 165543 404 109 aa, chain + ## HITS:1 COG:no KEGG:BF2587 NR:ns ## KEGG: BF2587 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 109 1 109 109 205 100.0 3e-52 MKRYCQTLELYNDSELIRAYVAEHQHVWPEIKAGIREVGILDMQIYIHEHTLFMIVDTVD EFDWIKDNERLAKLPRQAEWEAYMSRFQRSLPGQASHEKWKMMERIFKL >gi|226331997|gb|ACIB01000059.1| GENE 143 165548 - 168397 2113 949 aa, chain + ## HITS:1 COG:CAC3055 KEGG:ns NR:ns ## COG: CAC3055 COG2605 # Protein_GI_number: 15896306 # Func_class: R General function prediction only # Function: Predicted kinase related to galactokinase and mevalonate kinase # Organism: Clostridium acetobutylicum # 584 886 2 275 364 87 26.0 2e-16 MQKLLSLPPNLVQSFHELERVNRTDWFCTSDPVGKKLGSGGGTSWLLEECYNEYSDGATF GEWLEKEKRILLHAGGQSRRLPGYAPSGKILTPVPVFRWERGQHLGQNLLSLQLPLYEKI MSLAPDKLHTLIASGDVYIRSEKPLQSIPEADVVCYGLWVDPSLATHHGVFASNRKHPEQ LDFMLQKPSLAELESLSKTHLFLMDIGIWLLSDRAVEILIKRSHKESSEELKYYDLYSDF GLALGTHPRIEDEEVNTLSVAILPLPGGEFYHYGTSKELISSTLSVQNKVYDQRRIMHRK VKPNPAMFVQNAVVRIPLCAENADLWIENSHIGPKWKIASRHIITGVPENDWSLAVPAGV CVDVVPMGDKGFVARPYGLDDVFKGDLRDSKTTLTGIPFGEWMSKRGLSYTDLKGRTDDL QAASVFPMVNSVEELGLVLRWMLSEPELEEGKNIWLRSERFSADEISAGANLKRLYAQRE EFRKGNWQALAVNHEKSVFYQLDLADAAEDFVRLGLDMPELLPEDALQMSRIHNRMLRAR ILKLDGKDYRPEEQAAFDLLRDGLLDGISNRKSTPKLDVYSDQIVWGRSPVRIDMAGGWT DTPPYSLYSGGNVVNLAIELNGQPPLQVYVKPCKDFHIVLRSIDMGAMEIVSTFDELQDY KKIGSPFSIPKAALSLAGFAPAFSAVSYASLEEQLKDFGAGIEVTLLAAIPAGSGLGTSS ILASTVLGAINDFCGLAWDKNEICQRTLVLEQLLTTGGGWQDQYGGVLQGVKLLQTEAGF AQSPLVRWLPDHLFTHPEYKDCHLLYYTGITRTAKGILAEIVSSMFLNSSLHLNLLSEMK AHALDMNEAIQRGSFVEFGRLVGKTWEQNKALDSGTNPPAVEAIIDLIKDYTLGYKLPGA GGGGYLYMVAKDPQAAVRIRKILTENAPNPRARFVEMTLSDKGFQVSRS Prediction of potential genes in microbial genomes Time: Wed May 18 00:27:17 2011 Seq name: gi|226331996|gb|ACIB01000060.1| Bacteroides sp. 3_2_5 cont1.60, whole genome shotgun sequence Length of sequence - 10130 bp Number of predicted genes - 10, with homology - 10 Number of transcription units - 4, operones - 3 average op.length - 3.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 8 - 67 3.5 1 1 Op 1 . + CDS 148 - 666 453 ## BF2585 putative transcriptional regulatory protein UpxY-like protein 2 1 Op 2 . + CDS 702 - 1172 491 ## BF2584 hypothetical protein 3 1 Op 3 13/0.000 + CDS 1196 - 2083 699 ## COG1209 dTDP-glucose pyrophosphorylase 4 1 Op 4 . + CDS 2097 - 2675 351 ## COG1898 dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes + Prom 2685 - 2744 5.3 5 2 Op 1 2/0.000 + CDS 2799 - 3833 598 ## COG2089 Sialic acid synthase 6 2 Op 2 . + CDS 3845 - 5446 875 ## COG1083 CMP-N-acetylneuraminic acid synthetase 7 2 Op 3 . + CDS 5517 - 6182 181 ## COG0110 Acetyltransferase (isoleucine patch superfamily) + Prom 6255 - 6314 7.3 8 3 Tu 1 . + CDS 6348 - 7586 416 ## COG2244 Membrane protein involved in the export of O-antigen and teichoic acid + Prom 7602 - 7661 6.2 9 4 Op 1 . + CDS 7720 - 9183 670 ## ACICU_00082 hypothetical protein 10 4 Op 2 . + CDS 9171 - 10128 342 ## COG0463 Glycosyltransferases involved in cell wall biogenesis Predicted protein(s) >gi|226331996|gb|ACIB01000060.1| GENE 1 148 - 666 453 172 aa, chain + ## HITS:1 COG:no KEGG:BF2585 NR:ns ## KEGG: BF2585 # Name: not_defined # Def: putative transcriptional regulatory protein UpxY-like protein # Organism: B.fragilis # Pathway: not_defined # 1 172 1 172 172 339 100.0 2e-92 MSQQQEYWFAARTRKDQELTTRDALEKIGVEYFLPTQFVIRQLKYRRRRVEVPVIRNLIF VHATKEFACAIANEYGVRLFYMRDFDTKSMLIVPDKQMKDFMFVMNLDPAAVILNDDCFA VGTKVQVIKGDFCGVEGELASLSNRTYVTIRIRGVLSASVKVPKSYLRILAP >gi|226331996|gb|ACIB01000060.1| GENE 2 702 - 1172 491 156 aa, chain + ## HITS:1 COG:no KEGG:BF2584 NR:ns ## KEGG: BF2584 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 156 5 160 160 288 100.0 5e-77 MTQIHELQHVAHELLYLGADGSPIYTDSFRQLNTEVLQKSDALFALKGENPEEEARLCLA LLMGYNATIYDYGDKESKKQVILDRSLLVLESLPSSLLKCQLLTYCYGEVFEEELAKEAH AIIDYWDNKTLSIDEQETVDMLKMIEENQYPNSYID >gi|226331996|gb|ACIB01000060.1| GENE 3 1196 - 2083 699 295 aa, chain + ## HITS:1 COG:NMB0062 KEGG:ns NR:ns ## COG: NMB0062 COG1209 # Protein_GI_number: 15675999 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: dTDP-glucose pyrophosphorylase # Organism: Neisseria meningitidis MC58 # 1 291 1 288 288 407 65.0 1e-113 MKGIVLAGGSGTRLYPITKGVSKQLLPIFDKPMIYYPISVLMLAGIREILIISTPYDLPG FQRLLGDGSDFGVRFEYAEQPSPDGLAQAFIIGEKFIGGDSVCLVLGDNIFYGQSFTRML REAVHTAESENKATVFGYWVSDPERYGVAEFDEAGNVLSIEEKPTVPKSNYAVVGLYFYP NKVVEVAKSIQPSPRGELEITTVNQRFLSDRELKVQLLGRGFAWLDTGTHDSLSEASTFI EVIEKRQGLKVACLEGIALRQGWISPEEMKVLAGPMLKNQYGQYLLKVIDELSIK >gi|226331996|gb|ACIB01000060.1| GENE 4 2097 - 2675 351 192 aa, chain + ## HITS:1 COG:MA3780 KEGG:ns NR:ns ## COG: MA3780 COG1898 # Protein_GI_number: 20092576 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes # Organism: Methanosarcina acetivorans str.C2A # 1 190 1 183 183 220 58.0 1e-57 MNIIKTSIEGLVILEPRLFQDDRGYFFESFNQGEFESNVCQTTFVQDNESKSSYGVIRGL HFQKPPFAQSKLVRVIKGAVLDVAVDIRKGSPTFGKHVSVELTEDNHRQFFIPRGFAHGF SVLSEEVIFQYKCDNFYHPEAEGAIAWNDPDLNIDWRVPTRKVILSDKDTSHKCLGGVLD LFDYKKEIGYLL >gi|226331996|gb|ACIB01000060.1| GENE 5 2799 - 3833 598 344 aa, chain + ## HITS:1 COG:MJ1065 KEGG:ns NR:ns ## COG: MJ1065 COG2089 # Protein_GI_number: 15669254 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Sialic acid synthase # Organism: Methanococcus jannaschii # 4 342 18 332 337 244 41.0 1e-64 MKPFVIAEIGVNFYDTAREKNISPLEAAKLYILEAFKAGVSAVKFQSYKADTIVSKNSPA YWDLSKEPTTTQHALFSKHDGFDKEDYQELCNYCKEVGVAFLSTPFDFNSADYLENMVDI YKISSSDLTNIPFIRYIARKGKPIFLSVGASYLFEIDEVVRAIKEEGNNDICILHCVLSY PTKNEDANLNVIQTLKKIYPDLKIGYSDHTLPDPTMTVLSTAYLLGADVIEKHFTLDKTL KGNDHYHAGDPLDFRTAIDNFNLIQTIRGIDEKTVLPCEIIPRREARRSIVLTRDLKAGT TLSMEHLTFKRPGTGIAPKYLDIVIGKQVKKDLLEDTVLTWDMI >gi|226331996|gb|ACIB01000060.1| GENE 6 3845 - 5446 875 533 aa, chain + ## HITS:1 COG:MA3766_1 KEGG:ns NR:ns ## COG: MA3766_1 COG1083 # Protein_GI_number: 20092564 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: CMP-N-acetylneuraminic acid synthetase # Organism: Methanosarcina acetivorans str.C2A # 2 216 3 226 227 111 35.0 4e-24 MKILAVIPARAGSKGIPNKNIRLIHNKPLIYYAIQNALNSRYITDVVVSTDSPEVEIIAS QMNVNVKKRNIALCGDSITLDSVVNDVASGYDCDYVVTMQPTSPTLTATTLDNAIEYTIK NELDTLISVVNHPHLSWGDEQGKRIPNYSKRLNRQYLPPYYLETGAFLISKAEVVTSHSR IGKKVDVYVIPEEEAIDIDTFSDLMVADVLLQKKKVAIYVNGNNKRGLGHIYRALEIADE FYTKPDIYYDINQTNVKMFGFTTHNLIGVNGFGELLSKLQEKEYNLLINDVLTTSIDYMI AIKKTLPNAKIVNFEDDGEGIYKADLVFNALYQQSNLSNVKVGEKYYIVSKLFMFYHPID IKENVRTIFISFGGADPQNYSDRMLNIISNKKKYEKLHFIVVLGRAKNNVEALMNYNKYD NIDVLFDVKNMPDLMSKCDIGITSRGRTGYELAVLGIPTIAMAQNEREEKHGFVNNDNGF TYLGLNPSDYIIESTLDMYINLSVQDRMKYQYKLLSHDLRNGRRHVMGLINDL >gi|226331996|gb|ACIB01000060.1| GENE 7 5517 - 6182 181 221 aa, chain + ## HITS:1 COG:AGc2119 KEGG:ns NR:ns ## COG: AGc2119 COG0110 # Protein_GI_number: 15888483 # Func_class: R General function prediction only # Function: Acetyltransferase (isoleucine patch superfamily) # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 36 219 8 199 210 117 35.0 1e-26 MILAKIIKHILLRKKGIFLDDKSDINWGLQLGVGFKRAKIVDSKLEINSMGNGCFIEHTI AYGKIELGNYVSISGPGTILHAVIGKIQIGNFSSIGQNVSINEFNHNIRLPSTYAMQLNF FSKNFKDDVTSKGDVIIEEDVWIGSNSVILSGVRIGRGAVIAAGSIVNKDVPPYAIVGGV PFKVIKMRFTANQIEYLEKIRWWEWDDKKIMDNCHFFETEL >gi|226331996|gb|ACIB01000060.1| GENE 8 6348 - 7586 416 412 aa, chain + ## HITS:1 COG:MA4461 KEGG:ns NR:ns ## COG: MA4461 COG2244 # Protein_GI_number: 20093247 # Func_class: R General function prediction only # Function: Membrane protein involved in the export of O-antigen and teichoic acid # Organism: Methanosarcina acetivorans str.C2A # 156 337 216 401 490 60 27.0 8e-09 MGVFLNYSIGRAKIDFKDDFEGYLSSIQGLSCFIGVLILIIIIPFVNSLAEFMEVDKILL ITMVVYLIFYPSIEYMQSKLRFEYRYKENVLIAVINTFSVVIVSIVLILSSGYEEKYIGR IQGIVYPSFLIATICFIFIFIRGKKVIVLNYWKYALNFSIPMIPHALAMIALAQIDRIMI VKMVGDREAGIYSFGYSYAIILSVITNAIINAWQPWLYNCANENKVDEIKKSNKEINKFV CILTILFVAIAPEVLIVLGTKDFVEAKYMVGPVIIGTFFQFLYSYFVLMEMYCKKTIIIA IGSIMAAIVNFYLNMFFIPILGYVVAAYTTMVGYILLMIYHWIAFKLVYKPKIFAEKQIL ILTFFTPIICFVVMYYYDNLVWRFLISFFIVLIYIYSNKYALFAVINRWSKK >gi|226331996|gb|ACIB01000060.1| GENE 9 7720 - 9183 670 487 aa, chain + ## HITS:1 COG:no KEGG:ACICU_00082 NR:ns ## KEGG: ACICU_00082 # Name: not_defined # Def: hypothetical protein # Organism: A.baumannii_ACICU # Pathway: not_defined # 206 411 225 422 480 95 33.0 3e-18 MYKECSINDTTYYAFKKISGITVYTSNDISEYFSHILDSDIIDEKYLNYIQERYTHFENL NSQILATQFFTRHYHYRNYMKSCSYNQQLNWLILNYKNINNIVDDFKPDVVIDTDNAELA RCVMREVCYERDIPYITIEYPRYSFYKSFSFNLNLSVDPIFVEGYQSNLNKGTQELAAEI SYVRDFRSKVSIMHEMYKNDVTSQYKPNSFMKLVRTLVGKISYFWEQDIKAGNLKLKRSN PILYNNSVEYLKFYAKYELIKQYLLRKNKYFYTPSKGEKYLYMPLHLIPESTTFTLAPHY INELTIIEAVSKSLPAGWWLYVKEHQAMVGERGLGFYQKVNKLPNVKMVQLNYYSDPKPW IVNSMGVITISGTTAYEAALLQRHAIIFSDVPFKLLDGVERCRSFEDLPELIRNFSTTLD NEKSCAAYIATVKQLGFSIDLKYLMNQGEKIIRNKSQQDSRYQENLNNLEQLYLLAFSLY KRVVCQK >gi|226331996|gb|ACIB01000060.1| GENE 10 9171 - 10128 342 319 aa, chain + ## HITS:1 COG:BS_yveT KEGG:ns NR:ns ## COG: BS_yveT COG0463 # Protein_GI_number: 16080481 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Bacillus subtilis # 1 217 2 218 344 107 31.0 2e-23 MPKVSVIIPVYGVEKYIERCARSLFEQTLDDMEFIFIDDCTPDGSINILSNVLESYPLRI KQTQIVKMQMNSGQAEVRRLGTQLATGDYIIHCDSDDWVNADMYKKMWEKAVGEDLDIVV CDFYRSDGNNYQIFKGIYDGVYQNKTVYFSELLKGQVTTAVWNKLVSREIYFSNNIIYPT SNMWEDFVLNVQLTYYSRRIGYINLPLYFYFSNPLSICHSNIEQRISQVIDNSNLILRFL HLQGLDKIYKDELLYFKYYSRSELAVHVMEKKYRIMWKNIYPEINVKFCFSKAVSFKEKL KFMSIYSGIYPMIIRGIRI Prediction of potential genes in microbial genomes Time: Wed May 18 00:28:08 2011 Seq name: gi|226331995|gb|ACIB01000061.1| Bacteroides sp. 3_2_5 cont1.61, whole genome shotgun sequence Length of sequence - 140542 bp Number of predicted genes - 116, with homology - 115 Number of transcription units - 52, operones - 30 average op.length - 3.1 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 983 - 1042 8.7 2 2 Op 1 6/0.000 + CDS 1087 - 1653 323 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog + Term 1664 - 1700 -0.8 3 2 Op 2 . + CDS 1727 - 2695 865 ## COG3712 Fe2+-dicitrate sensor, membrane component + Prom 2810 - 2869 4.0 4 3 Op 1 . + CDS 2951 - 6322 3465 ## BF3971 hypothetical protein 5 3 Op 2 . + CDS 6345 - 7964 1524 ## BF3972 hypothetical protein 6 3 Op 3 . + CDS 8002 - 8376 372 ## BF3748 putative lipoprotein 7 3 Op 4 . + CDS 8384 - 10420 1611 ## BF3749 hypothetical protein 8 4 Op 1 . - CDS 10516 - 12708 1118 ## BF3975 putative transcriptional regulator 9 4 Op 2 . - CDS 12712 - 13086 458 ## COG3682 Predicted transcriptional regulator - Prom 13148 - 13207 5.3 - Term 13230 - 13272 7.7 10 5 Op 1 . - CDS 13301 - 16588 3165 ## COG0793 Periplasmic protease - Prom 16626 - 16685 1.8 - Term 16609 - 16655 6.3 11 5 Op 2 . - CDS 16695 - 17636 1245 ## COG0039 Malate/lactate dehydrogenases - Prom 17721 - 17780 6.0 + Prom 17608 - 17667 3.4 12 6 Tu 1 . + CDS 17823 - 18695 990 ## BF3979 hypothetical protein + Term 18722 - 18782 19.1 - Term 18708 - 18768 19.1 13 7 Op 1 . - CDS 18818 - 19759 234 ## COG3712 Fe2+-dicitrate sensor, membrane component 14 7 Op 2 . - CDS 19740 - 20294 396 ## BF3981 RNA polymerase ECF-type sigma factor 15 7 Op 3 . - CDS 20334 - 22901 1621 ## BF3982 hypothetical protein - Prom 22922 - 22981 8.9 + Prom 23300 - 23359 8.9 16 8 Op 1 13/0.000 + CDS 23482 - 24936 1406 ## COG1538 Outer membrane protein 17 8 Op 2 9/0.000 + CDS 25014 - 26006 1220 ## COG0845 Membrane-fusion protein 18 8 Op 3 22/0.000 + CDS 26141 - 27184 699 ## COG0842 ABC-type multidrug transport system, permease component 19 8 Op 4 . + CDS 27184 - 28434 1189 ## COG0842 ABC-type multidrug transport system, permease component + Term 28456 - 28486 -0.1 20 9 Tu 1 . - CDS 28481 - 28729 88 ## + Prom 28494 - 28553 5.2 21 10 Tu 1 . + CDS 28611 - 30560 1663 ## BF3763 hypothetical protein + Prom 30625 - 30684 2.5 22 11 Op 1 . + CDS 30710 - 31390 649 ## COG2755 Lysophospholipase L1 and related esterases 23 11 Op 2 . + CDS 31411 - 32160 432 ## BF3990 hypothetical protein + Prom 32211 - 32270 3.4 24 12 Op 1 . + CDS 32361 - 32726 429 ## BF3991 transcriptional regulator 25 12 Op 2 . + CDS 32747 - 34480 1083 ## BF3992 TonB 26 12 Op 3 . + CDS 34523 - 36424 1493 ## BF3768 putative exported thioredoxin + Term 36455 - 36502 -0.5 - Term 36440 - 36493 15.1 27 13 Op 1 . - CDS 36515 - 37621 1284 ## COG0489 ATPases involved in chromosome partitioning 28 13 Op 2 . - CDS 37630 - 38388 856 ## COG0220 Predicted S-adenosylmethionine-dependent methyltransferase 29 13 Op 3 . - CDS 38398 - 39204 759 ## COG1237 Metal-dependent hydrolases of the beta-lactamase superfamily II 30 13 Op 4 . - CDS 39216 - 40235 1193 ## COG0115 Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase 31 13 Op 5 . - CDS 40281 - 40493 280 ## BF3998 hypothetical protein - Term 40504 - 40553 2.6 32 14 Op 1 . - CDS 40589 - 41821 1296 ## COG1570 Exonuclease VII, large subunit 33 14 Op 2 . - CDS 41825 - 43180 1340 ## COG1404 Subtilisin-like serine proteases 34 14 Op 3 . - CDS 43208 - 44296 1111 ## COG0482 Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain - Prom 44317 - 44376 3.8 + Prom 45156 - 45215 2.5 35 15 Tu 1 . + CDS 45244 - 46104 442 ## BF3778 hypothetical protein - Term 46039 - 46079 6.8 36 16 Op 1 . - CDS 46101 - 47111 833 ## COG1409 Predicted phosphohydrolases 37 16 Op 2 . - CDS 47140 - 47619 624 ## COG0245 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase 38 16 Op 3 . - CDS 47658 - 48272 609 ## COG0179 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 39 16 Op 4 . - CDS 48276 - 48932 558 ## COG2344 AT-rich DNA-binding protein - Prom 49051 - 49110 4.2 + Prom 49034 - 49093 5.1 40 17 Tu 1 . + CDS 49115 - 49465 411 ## COG0023 Translation initiation factor 1 (eIF-1/SUI1) and related proteins + Term 49603 - 49644 9.2 - Term 49587 - 49634 13.1 41 18 Op 1 38/0.000 - CDS 49656 - 50648 1312 ## COG0264 Translation elongation factor Ts - Prom 50678 - 50737 3.2 - Term 50706 - 50745 -0.3 42 18 Op 2 . - CDS 50772 - 51608 1391 ## PROTEIN SUPPORTED gi|53715295|ref|YP_101287.1| 30S ribosomal protein S2 43 19 Op 1 59/0.000 - CDS 51729 - 52115 642 ## PROTEIN SUPPORTED gi|53715296|ref|YP_101288.1| 30S ribosomal protein S9 44 19 Op 2 . - CDS 52122 - 52583 795 ## PROTEIN SUPPORTED gi|53715297|ref|YP_101289.1| 50S ribosomal protein L13 - Prom 52687 - 52746 3.9 + Prom 52335 - 52394 4.4 45 20 Tu 1 . + CDS 52602 - 52898 137 ## BF4014 hypothetical protein - Term 52817 - 52889 10.3 46 21 Tu 1 . - CDS 52918 - 53406 415 ## BF4015 hypothetical protein - Prom 53445 - 53504 8.6 - Term 53480 - 53533 7.1 47 22 Tu 1 . - CDS 53564 - 54967 1570 ## COG0017 Aspartyl/asparaginyl-tRNA synthetases - Prom 54988 - 55047 7.0 - Term 54996 - 55044 10.8 48 23 Op 1 . - CDS 55074 - 56474 1776 ## COG1187 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 49 23 Op 2 . - CDS 56556 - 57902 1545 ## COG0015 Adenylosuccinate lyase - Prom 57926 - 57985 5.4 - Term 58025 - 58089 6.4 50 24 Tu 1 . - CDS 58124 - 60010 1642 ## COG1874 Beta-galactosidase - Prom 60109 - 60168 5.7 - TRNA 60184 - 60258 90.6 # Val TAC 0 0 - TRNA 60310 - 60384 90.6 # Val TAC 0 0 - TRNA 60411 - 60488 91.2 # Val TAC 0 0 - Term 60616 - 60655 1.2 51 25 Op 1 . - CDS 60678 - 61229 309 ## COG0110 Acetyltransferase (isoleucine patch superfamily) 52 25 Op 2 . - CDS 61236 - 62009 477 ## COG3022 Uncharacterized protein conserved in bacteria - Prom 62112 - 62171 3.5 53 26 Tu 1 . + CDS 62131 - 64350 1982 ## BF4022 hyaluronoglucosaminidase precursor - Term 64492 - 64530 2.1 54 27 Op 1 . - CDS 64549 - 64932 225 ## BF4023 hypothetical protein 55 27 Op 2 . - CDS 64947 - 66605 876 ## BF3798 hypothetical protein 56 27 Op 3 . - CDS 66640 - 69114 1810 ## BF4025 hypothetical protein - Prom 69169 - 69228 2.0 - Term 69169 - 69211 7.2 57 28 Op 1 . - CDS 69230 - 71692 2166 ## BF3800 putative outer membrane protein 58 28 Op 2 . - CDS 71716 - 73566 1273 ## BF4028 hypothetical protein 59 28 Op 3 . - CDS 73591 - 74187 387 ## BF4029 hypothetical protein 60 28 Op 4 . - CDS 74217 - 75128 663 ## BF4030 hypothetical protein 61 29 Tu 1 . - CDS 75238 - 76491 952 ## BF3804 putative lipoprotein - Term 76664 - 76706 7.3 62 30 Op 1 . - CDS 76885 - 77829 605 ## BF4033 tyrosine type site-specific recombinase - Prom 77850 - 77909 4.0 63 30 Op 2 . - CDS 78045 - 78473 370 ## COG1664 Integral membrane protein CcmA involved in cell shape determination - Prom 78511 - 78570 7.9 - Term 78596 - 78644 12.3 64 31 Tu 1 . - CDS 78673 - 81894 3503 ## COG0458 Carbamoylphosphate synthase large subunit (split gene in MJ) + Prom 82078 - 82137 4.7 65 32 Tu 1 . + CDS 82218 - 83309 1143 ## COG0180 Tryptophanyl-tRNA synthetase + Term 83357 - 83397 7.5 - Term 83345 - 83385 11.3 66 33 Op 1 . - CDS 83415 - 84602 1218 ## BF4037 major outer membrane protein OmpA - Prom 84638 - 84697 2.9 67 33 Op 2 13/0.000 - CDS 84772 - 86091 1139 ## COG0845 Membrane-fusion protein 68 33 Op 3 . - CDS 86094 - 88304 2079 ## COG2274 ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 69 33 Op 4 . - CDS 88340 - 88939 595 ## BF4040 hypothetical protein 70 33 Op 5 . - CDS 88952 - 89617 318 ## BF4041 hypothetical protein 71 33 Op 6 . - CDS 89685 - 90680 471 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 72 33 Op 7 . - CDS 90677 - 91567 603 ## BF4043 putative alpha-1,3-fucosyltransferase 73 33 Op 8 . - CDS 91617 - 94091 1588 ## BF4044 hypothetical protein 74 33 Op 9 . - CDS 94102 - 94773 528 ## COG1083 CMP-N-acetylneuraminic acid synthetase 75 33 Op 10 . - CDS 94787 - 96001 508 ## BDI_2925 hypothetical protein 76 33 Op 11 . - CDS 96020 - 97495 657 ## COG0641 Arylsulfatase regulator (Fe-S oxidoreductase) - Prom 97571 - 97630 4.6 77 34 Tu 1 . - CDS 97671 - 98873 450 ## BF4066 hypothetical protein - Prom 98920 - 98979 4.3 - Term 98942 - 98986 2.3 78 35 Op 1 . - CDS 99003 - 99260 339 ## gi|253567223|ref|ZP_04844673.1| predicted protein 79 35 Op 2 . - CDS 99277 - 100467 696 ## BF4065 hypothetical protein - Prom 100543 - 100602 3.5 - Term 100543 - 100592 -0.3 80 36 Op 1 . - CDS 100617 - 100868 282 ## gi|253567225|ref|ZP_04844675.1| predicted protein 81 36 Op 2 . - CDS 100920 - 102134 577 ## COG0438 Glycosyltransferase 82 36 Op 3 . - CDS 102131 - 102727 516 ## BDI_3162 hypothetical protein 83 36 Op 4 . - CDS 102720 - 103499 356 ## BF4055 hypothetical protein 84 36 Op 5 . - CDS 103507 - 104127 330 ## BF3829 hypothetical protein 85 36 Op 6 . - CDS 104099 - 104503 254 ## gi|253567230|ref|ZP_04844680.1| predicted protein - Prom 104530 - 104589 6.2 86 37 Tu 1 . - CDS 104622 - 106499 962 ## BF3831 hypothetical protein - Prom 106627 - 106686 8.1 - Term 106594 - 106648 3.1 87 38 Op 1 . - CDS 106807 - 107118 179 ## BF4049 hypothetical protein 88 38 Op 2 . - CDS 107195 - 108415 805 ## BF3834 putative lipoprotein - Prom 108570 - 108629 6.7 89 39 Tu 1 . - CDS 109180 - 110388 480 ## BF3836 hypothetical protein - Prom 110417 - 110476 3.1 90 40 Tu 1 . - CDS 110491 - 111732 712 ## BF3837 hypothetical protein - Prom 111810 - 111869 5.5 91 41 Op 1 . - CDS 111903 - 113147 930 ## BF4064 hypothetical protein 92 41 Op 2 . - CDS 113172 - 114401 523 ## BF4066 hypothetical protein - Prom 114570 - 114629 4.5 + Prom 114537 - 114596 5.9 93 42 Op 1 . + CDS 114692 - 114934 132 ## COG4680 Uncharacterized protein conserved in bacteria 94 42 Op 2 . + CDS 114946 - 115311 218 ## BF4068 hypothetical protein + Term 115340 - 115382 -0.9 - Term 115753 - 115794 1.1 95 43 Op 1 . - CDS 115853 - 117730 1738 ## COG0323 DNA mismatch repair enzyme (predicted ATPase) 96 43 Op 2 . - CDS 117767 - 118063 432 ## BF4070 hypothetical protein 97 43 Op 3 . - CDS 118066 - 119769 1549 ## BF4071 hypothetical protein 98 43 Op 4 . - CDS 119782 - 121152 1642 ## COG0760 Parvulin-like peptidyl-prolyl isomerase 99 43 Op 5 . - CDS 121162 - 122061 743 ## BF4073 hypothetical protein 100 43 Op 6 . - CDS 122061 - 123614 1100 ## COG0760 Parvulin-like peptidyl-prolyl isomerase - Prom 123656 - 123715 5.1 - Term 123644 - 123703 4.4 101 44 Op 1 1/0.000 - CDS 123759 - 125234 1670 ## COG0516 IMP dehydrogenase/GMP reductase - Prom 125258 - 125317 4.4 - Term 125244 - 125304 6.9 102 44 Op 2 . - CDS 125321 - 127501 1771 ## COG0514 Superfamily II DNA helicase - Prom 127595 - 127654 3.9 103 45 Op 1 24/0.000 - CDS 127672 - 128919 241 ## PROTEIN SUPPORTED gi|163762510|ref|ZP_02169575.1| ribosomal protein S16 104 45 Op 2 29/0.000 - CDS 128922 - 129584 943 ## COG0740 Protease subunit of ATP-dependent Clp proteases - Prom 129661 - 129720 5.0 - Term 129636 - 129694 6.4 105 45 Op 3 . - CDS 129722 - 131077 1698 ## COG0544 FKBP-type peptidyl-prolyl cis-trans isomerase (trigger factor) - Prom 131100 - 131159 7.8 106 46 Tu 1 . - CDS 131174 - 131626 -154 ## BF3896 hypothetical protein - Prom 131793 - 131852 2.1 + Prom 132037 - 132096 6.2 107 47 Tu 1 . + CDS 132317 - 132562 260 ## COG0724 RNA-binding proteins (RRM domain) + Term 132580 - 132638 11.2 - Term 132567 - 132625 12.1 108 48 Tu 1 . - CDS 132681 - 133442 270 ## PROTEIN SUPPORTED gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 - Prom 133466 - 133525 5.5 + Prom 133405 - 133464 4.5 109 49 Op 1 23/0.000 + CDS 133518 - 134261 664 ## COG0767 ABC-type transport system involved in resistance to organic solvents, permease component 110 49 Op 2 . + CDS 134258 - 135028 304 ## PROTEIN SUPPORTED gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 + Term 135052 - 135094 4.0 - Term 135103 - 135141 -0.7 111 50 Tu 1 . - CDS 135260 - 135907 685 ## COG2197 Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain - Prom 135968 - 136027 9.1 - Term 136011 - 136048 3.0 112 51 Op 1 . - CDS 136084 - 137397 1139 ## COG1160 Predicted GTPases 113 51 Op 2 . - CDS 137450 - 138331 921 ## COG1159 GTPase 114 51 Op 3 . - CDS 138419 - 139423 905 ## COG0332 3-oxoacyl-[acyl-carrier-protein] synthase III - Prom 139445 - 139504 1.9 - Term 139452 - 139490 5.3 115 52 Op 1 . - CDS 139521 - 139706 339 ## PROTEIN SUPPORTED gi|29349241|ref|NP_812744.1| 50S ribosomal protein L32 116 52 Op 2 . - CDS 139720 - 140295 500 ## BF4091 hypothetical protein - Prom 140428 - 140487 7.2 Predicted protein(s) >gi|226331995|gb|ACIB01000061.1| GENE 1 246 - 779 257 177 aa, chain - ## HITS:1 COG:no KEGG:BF3743 NR:ns ## KEGG: BF3743 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 160 1 160 160 258 100.0 6e-68 MEKYYSTNCPINTTTIIKTIIVAMLFIIAAIIVFASGGYWLGISIILLLLVIMVVTYFCI PRKIIVTDTDIVLYNHGFKRKIPKCDILKARSVTAKDRNGLWRKFAVEGVWGYCGIYASK IHKNLYIYASQNKNWILIETERKNYIVSPENLDIIDVINKWYYLQFSGYWSTFKISF >gi|226331995|gb|ACIB01000061.1| GENE 2 1087 - 1653 323 188 aa, chain + ## HITS:1 COG:PA1912 KEGG:ns NR:ns ## COG: PA1912 COG1595 # Protein_GI_number: 15597108 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Pseudomonas aeruginosa # 36 172 35 162 168 60 34.0 1e-09 MQSNHLLKTQKSFSDIYSIYYVRMLRFSQTYVIAEEDAENIVQDTFLYLWEHLELLEDID HLDAFLFTLIKNRCLNFLKHQSYIQAKTCSLKADEELESQLNLYALEQFDEAVSSISEVE NLLSRTMQKLPERCREIFLLSRIEGLKYKEIAERLDISVNTVENQISIALRKLRSELKEY LPLLVFII >gi|226331995|gb|ACIB01000061.1| GENE 3 1727 - 2695 865 322 aa, chain + ## HITS:1 COG:PA2388 KEGG:ns NR:ns ## COG: PA2388 COG3712 # Protein_GI_number: 15597584 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Pseudomonas aeruginosa # 10 316 25 319 331 73 23.0 5e-13 MNDELEKYFAGTLSATEKTEFLNKLRDNPEAKKEFARMKAVWAVSGLMAQEGDPQKTVRG IAEFDKRLKRRSVHRFRIGFFKYAAMIVLLISTTWFIANWYTQKEQKKQYTEINVPKGQR VNMTLPDGTSVWLSPQSKIKIPNEFNRKNRMVELNGEGYFEVTKNAKKPFIVKTQLFNIQ VLGTRFNVFAYAGKESKFETCLVEGRVLVYNKNNKNEKVYLNPHEKVSLVNNRMVVSTSN FDNEEYLKSGIFSFRSKPFGEILSYLTLWYNVQFNFTGDVKLDERISGKIRQSEDVDNIL IALQGVYPFKFKKTDDEHYEIY >gi|226331995|gb|ACIB01000061.1| GENE 4 2951 - 6322 3465 1123 aa, chain + ## HITS:1 COG:no KEGG:BF3971 NR:ns ## KEGG: BF3971 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1123 24 1146 1146 2227 100.0 0 MKLSVFFIFVFLFQLQAGNVKSQEVRVTLSKSSLTFGELMREIEKQTNYLFMYRDAEIDL SQKIEVKNTSATVKEILTTALRNKKLTYKFSNNYISLYVDKEKAPETMVTQQERKIKIKG VVIDQVGEPIIGANISLKGQPGTGDITDIEGNFTLEVPEKAVLVISYIGYLTQEIPVNGK ASFNIQMKEDTKTLDEVVVVGYGSQKKQTVTASASTLKVSSLKNVPTANLASSLGGRVSG VLIQQTGGEAGYDDPTIIIRGSSSPTSSSPLIVVDGIIGRSMSQLDPSEIESMTVLKDAS AVAPYGARGANGVILITTKRGKSGKAQVDYNFKIGFGTPTRMPEIASSYDHARFMNDAWR NKEMDLGEDPGMYGIYTEEELQKFRDGSDPYGYPNTDWNKEVLLPRAWQQMHSLTASGGS DKVKYFAGFGYVKQDALYGDTRTNKSTSGFNRYNARVNIDANIVDKYLNLSADMAYRQED RNSIAGSTSDVFNNMHRNPQTDPGRFPDGNLGKVSLGVNPIGLATEGGWVKDRKSVLNTR FMLDFNVPGLEGLNLKGIFSYDKIFNSIKRWTTPVDFYVWNKITGEYDGHSPNREGAELK QTYSTSQAMTFEFQAAYNKTIAQDHQLGALFVFSRSEGADENFWASRTKYRIYSIQQLFA GPDKDKDNSGSASETGKVGSVFRLTYNYKEKYMLEANGRLDGSEKFPKSKRYGFFPSVSA AWRISAEPFMEPFSGVLSNLKLRTSWGRAGTDNIGQFQYMSAYGTGGDAVFGGQNPEIAP GYTETRFPNPNITWETSEMFNIGIDASFFNGKLNLEADWFYKKTNDILRERTDMPAILGY KLPAANVGKVDNRGIDLNITHRNHLRDFNYSIAGNLTWARNKVIDLLEPAGEKNNPRQRS TGHSMSQYFGYEALGLFQSDEEANNWPQPQFGKAKAGDIKYKDQNGDGIIDAEDEIAIGR SNYPELVYGLNLNAEWKGFDITFFFQGAALADFYYNGYLAFPYIEGRGGALLEHHIGNTW TPENPKAEFPRLYYGGNANNQLFSSFWMRNGSYLRLKNVEIGYDFKKLLLSKVSEIQGLR LYFSGSNLLTWSQIKYFDPELRSTDGSAYPQMKTFVFGANITF >gi|226331995|gb|ACIB01000061.1| GENE 5 6345 - 7964 1524 539 aa, chain + ## HITS:1 COG:no KEGG:BF3972 NR:ns ## KEGG: BF3972 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 539 1 539 539 1090 99.0 0 MKKINTALFCLFVLLFCSCDVLNKAPLDEIADDSFWSDETLVKYYVNDLYSEISVDGLQL QENRSDNSVSAQRDKYRASWFKFNYDMVSASDPQDDDVWEDYYVKVRKCNRFFERIGTST IEESEKSRLTGEVHFLRAMFYFEMVKRYGGVILLDKVLTMEDNWEIPRSSEKECYDFILE DLKKATEMLPASYGSREKGRATKGAAYALKSRVELYDKRYEDVIKSCAEVYKLGYELVDG TTPEKYRSIWWTTNKDNKEIIFDVQYKSPDVYNNMMVCNMVTYINDKYGDRGWGGLGPTQ ELIDAFEMADGTPATQYSQAPADQVFDINTCGIYEGREPRFYANIVFHGSQIFFNADKGA VTVDRYLMDTPDKGDGSLTGYNVWKWIDYDNYNYPYAGAGSPDFSTNWIILRYAEIYLND AEARLETGDVEGARKAVNMIRQRVGLPDLTESDPEKLRELIRKERRIEFAFEEQRFYDVR RWKIGPETQITLHGVRFVSPTEFKVTKTDIRTWNDRLYLTPVPHDEIVRSSVLKQNPGY >gi|226331995|gb|ACIB01000061.1| GENE 6 8002 - 8376 372 124 aa, chain + ## HITS:1 COG:no KEGG:BF3748 NR:ns ## KEGG: BF3748 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 124 1 124 124 240 100.0 1e-62 MNKIISILICCFIAGCLFVSCEQPDIPDVPNYDKTEILTFKVYNQDKEEVGTPVILSDEG VVTITVDEGTDLSNVFATCTLSSGATLSPALGGYQDWSGLNKEFTVTSASGKRSKPWTVI FKTK >gi|226331995|gb|ACIB01000061.1| GENE 7 8384 - 10420 1611 678 aa, chain + ## HITS:1 COG:no KEGG:BF3749 NR:ns ## KEGG: BF3749 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 678 1 678 678 1427 99.0 0 MKYVIILILCICSSLQMQGALSALKGGKSNLALHLDGKDNNVRTGMGILEPSWTLESWIK GDDCQWDSLEVIIGGGEYSELNWVDYLPLVVKEGKIHSSRANLSSPQTLDDQWHHVALTC DGKQTILYLDGKQVDKADTATAILPGAIGVHDVYYTFGGLIDEVRVWRSALPEQTIRRWM NRPVEATHPAFKSLWGYYNFDDLKDETSVNWVGKGHQAYHIRNGRNKYNEKAPLAHAVPN DNPAFKEFDGNQQLFNAVIIQSEWDADQGSKNDQALKLRIAVQGSKNPLKLTELKLDFTG TTDLADIEQIHIYSTGSEARSTQRKELFGNGHTPEQSLTLRPTHGEEILLQPGINYFLLT FDVRSKATPGHTLYASVPFFKLNGKKIIPETSAEEVRKQVTCNNQTQSNIVKVLQWNIWH GGIHLGNEGQQRVLDLIRSSRADVIMMQEAYGIQQMLADSLGYHLKTHSLKDNLAMYSRF PLEAIAWREPFKSNPAKITLPNGKRIMFVDCWLRYAYRPEYTSGYAEKGLDPSVWVAEDS ILALPDIRNIYTKDIAPNLETDMPVIVTGDFNSCSHLDWTERAKPLHHGYGPVAFPASRY MLENGFKDSFREKNPDEVVYQGGTVAAIYGQMQMSRIDFIYYKGGLKVLSSKIVRTAPEI DYVWASDHAAVLTVFEVE >gi|226331995|gb|ACIB01000061.1| GENE 8 10516 - 12708 1118 730 aa, chain - ## HITS:1 COG:no KEGG:BF3975 NR:ns ## KEGG: BF3975 # Name: not_defined # Def: putative transcriptional regulator # Organism: B.fragilis # Pathway: not_defined # 1 730 1 730 730 1500 99.0 0 MIDFDYSEWAILRFIGASFVLWLCYYLLFDRKAPFNQCRNYLLFSVLLAGAVSVLRIPVY PVEVVKPVKMEQIVVAQEAQTDKVSLMQIDNNGIQPDTLATAMLTDVNEAVTHQTEEEIV EEPFYVSWNYWQIAWIVYGSGVFILLVHLLVEMVRIWRLKRWGTCTTDADGICIVRNNEV VSPFSFYRMIFINRKLEGEVLRVVLLHEKAHIRNHHYRDTLFIEGLSILCWFNPFVWLVK RELRALHEFQVDRCLLSGEIELFEYQSILFEELMGYSPKVANGFHNSLIKKRFIMMKHQY KERLAGVRKIALLPLCIGVLALFSFTESPVLVEPVLPMVSVTIKAETPKVVLPEVTVDSS GNEKDFLLLDTPKIVHYVQSRDAHIVQSGAGQPLAQVSIFVPKALDTLSAESSDGTFSQE QNINHTVDIDLRADQVVLSRAPRKNNAYVRFIERSKEDTRVTLAIPIHFDRHWLQFEKGL SIVDEDSKDVYRIRSVTRGIELNRVYWVVGQEGQMLEFTLVFPPLDRKVKTVSIRDCFPE EKGLTPPNGGAWTLDNLKVDNFQPTAVRQAEYDREGRPLRSDKLEEVTLNANQLSVSSRH NGGRTQIQKIETLPDKTLVVLSVPIHYDRNWLVINKGLCIVDCKTGDEYPVQEEAHGIEM NKLLWVEGCQGRSVLLTLVFPKLPKRVKTIDFYNKYPDAGIISPTNGSSWNWWKIKIKDY QKEPYKRVIL >gi|226331995|gb|ACIB01000061.1| GENE 9 12712 - 13086 458 124 aa, chain - ## HITS:1 COG:CC1640 KEGG:ns NR:ns ## COG: CC1640 COG3682 # Protein_GI_number: 16125886 # Func_class: K Transcription # Function: Predicted transcriptional regulator # Organism: Caulobacter vibrioides # 5 119 23 138 144 64 31.0 5e-11 MNEQKELTKAELQIMQVLWQKENAFVNEILQELEEPRPAYNTVSTVLRVLQNKGFVAYRS FGKNYQYYPLVSKESYTNRFMNRVVDNFFSGSVKAMVSFFTKKEKMSVQEIDELIEMLKE NKKE >gi|226331995|gb|ACIB01000061.1| GENE 10 13301 - 16588 3165 1095 aa, chain - ## HITS:1 COG:VCA0045 KEGG:ns NR:ns ## COG: VCA0045 COG0793 # Protein_GI_number: 15600816 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Periplasmic protease # Organism: Vibrio cholerae # 724 1080 21 379 394 305 44.0 2e-82 MKKLFMSAVALLFAGALWAQENPLWMRHCAISPDGTTIAFTYKGDIFTVPVSGGKATQIT TNPAFDTTPIWSPDSKQIAFASDRMGSMDVFIVSKDGGEPRRLTTFSGGETPVAFTDAGH ILFTADIMPSTEDAGFPSNGQFQQIYQIPVSGGRPVMFSSMPMECISINKEGTILYQDKK GYEDYWRKHQKSPIARDIWMLRPGQTPRYEKQTTFIGEDREPVWAPDGKSFYYLSEENGT FNVYQRTPGSDTSKQVTHHKQHPVRFLSMASNGNLCYGFDGEIYTLAPGGKPQKVSVKIL SDRNDKELIRQIKTSGATEMAVSPDGKEVAFILRGDVYVTSVEYKTTKQVTNTPCQERGI DFAPDGRTLVYASERGGLWQLYTSTIVRKDEKQFTYATELKEERLTNSDVASFNPKYSPD GKEIAFLENRTAIRVINLKTKKVRTVMDAQYQYSYSDGDQWFEWSPDSKWILSEFIGIGG WNNKDIVLLNADGKGEMHNLTESGYSDGNAKWVLGGKAMVWFSDRAGYRSHGSWGAQYDA YIMFFDVDAYDRFRMNKEDLALLEEAEKAEKAEKEKAEKKKKENKKDDKKKDGKEKNKKD GDEEKKEEVKPLKFDLDNRFDRIVRLTVNSSFMGDAVLTPKGDKLYYLAAFESGYDLWEH DLKENSTKILLKGVGGGSLLPDKKGENIFMCTGGGMKKIEIAGSKTTPIAFESFFDYQPG GERAYIFDHVWQQVDDKFYVKDLQGVDWPLYKKSYEKFLPYINNNYDFAELLSEMLGELN ASHTGARYSGAGGALATAALGVFYDDTYNGDGLKIKEIIEQGPFTLKKTDVKPGCIIEKV DGTLIKKGEDYFPLFEGKVGRKVLLSVYDPATKKRFEETVKAISYGAQRELLYKRWVERN RKKVEELSGGRLGYVHIKGMDSQSFRKMYSELLGRYRNKEAVVVDVRHNGGGWLHDDVVT LLSGKEYQRFVPRGQYIGSDPFNKWLKPSCMLVCEDNYSNAHGTPYVYKTLGIGKLVGTP VAGTMTAVWWERQIDPSLVFGIPQVGCMDMQGNYLENHTLEPDVLIYNEPAASLKGEDAQ LKAAVDCLLKELPKK >gi|226331995|gb|ACIB01000061.1| GENE 11 16695 - 17636 1245 313 aa, chain - ## HITS:1 COG:BH3158 KEGG:ns NR:ns ## COG: BH3158 COG0039 # Protein_GI_number: 15615720 # Func_class: C Energy production and conversion # Function: Malate/lactate dehydrogenases # Organism: Bacillus halodurans # 3 307 7 311 314 266 47.0 5e-71 MSKVTVVGAGNVGATCANVLAFNEVADEVVMLDVKEGVSEGKAMDMMQTAQLLGFDTTIV GCTNDYAQTANSDVVVITSGIPRKPGMTREELIGVNAGIVKSVAENLLKYSPNAIIVVIS NPMDTMTYLALKSLGLPKNRVIGMGGALDSSRFKYFLSQALGCNANEVEGMVIGGHGDTT MIPLARLATYKGQPVSTLLSEEKLNEVVASTMVGGATLTKLLGTSAWYAPGAAGAYVVES IIHNQKKMVPCSVMLEGEYGESDLCIGVPVILGKNGIEKIVELELNADEKAKFAASAAAV HKTNAALKEVGAL >gi|226331995|gb|ACIB01000061.1| GENE 12 17823 - 18695 990 290 aa, chain + ## HITS:1 COG:no KEGG:BF3979 NR:ns ## KEGG: BF3979 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 290 1 290 290 473 100.0 1e-132 MNKKNLLIIAVSVLVLALIGLTYLLFSEKQTNRELVQEFQLDKEDLENEYTRFAQQYDEL KLTVSNDSLADLLAQEQVKTQRLLEELRTVKSSNATEIRRLKKELATLRKVMIGYINQID SLNKLTAQQKQVIAEVTQKYNAASRQINNLSEEKKNLNQKVTLAAQLDATNIWIEPRNKR GKKAKKVKDIVKLAIGFTIVKNITAETGERTLYIRITKPDNDALTKSPSNTFPYENRTLT YSIKKYIEYNGEEQTINVFWDVEEFLYAGAYRVDIFADGTMIGSQKFTLD >gi|226331995|gb|ACIB01000061.1| GENE 13 18818 - 19759 234 313 aa, chain - ## HITS:1 COG:PA2388 KEGG:ns NR:ns ## COG: PA2388 COG3712 # Protein_GI_number: 15597584 # Func_class: P Inorganic ion transport and metabolism; T Signal transduction mechanisms # Function: Fe2+-dicitrate sensor, membrane component # Organism: Pseudomonas aeruginosa # 115 275 132 290 331 71 31.0 2e-12 MEQREDNKNDIASRRLKNLFGEALGDLSSVEETETAWQAFASRRRQERVRTLVFGFAAVA SVALLLFWGFSQENFLSQEVEVFASVNSPDKLVMTEDKGIIAVRTPPATTITIHLPDSTE VLLNANSRLEYPKAFTGDLRRVMLEGAARFNVQRDTLHPFIVETGSLQTRVLGTVFDVDS YGCGTTSKVVLYEGSVQVSDKANTKACKIKPGEQVYLDRVGDICISQADICMQKSWTEGL FIFDNVTLRYVMQEIGAWYNTNIVFRSHSLLEERIYFSASRHLPVGEILNVLNDLQIARF IVEGDKIVVSPLS >gi|226331995|gb|ACIB01000061.1| GENE 14 19740 - 20294 396 184 aa, chain - ## HITS:1 COG:no KEGG:BF3981 NR:ns ## KEGG: BF3981 # Name: not_defined # Def: RNA polymerase ECF-type sigma factor # Organism: B.fragilis # Pathway: not_defined # 1 184 1 184 184 325 100.0 5e-88 MAKLTIFRFRGHISREERFRQLFVEIYPRLLRYAIQLMSDREEAKDIVGEVMEEAWKCFD RLEAETQNAYFYTATRNTCLNRLKHLRVEQQHLDTLREVTRMDVNTGYRQHEAQLQQAET IACSLSEPTRTILRLCYWEKLTYRQVAERLEISPDTVKKHISKALRILRNEMNGKEETNG TERG >gi|226331995|gb|ACIB01000061.1| GENE 15 20334 - 22901 1621 855 aa, chain - ## HITS:1 COG:no KEGG:BF3982 NR:ns ## KEGG: BF3982 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 855 1 855 855 1730 100.0 0 MNWGIHDRRGWMAVVLLLCACPFSAQNTGQITLELRNKPLPAVLKLIEKAGEKHIIFSYN ETETYHVTASIHQRNESEALSIVLKSTPFIYKERENYFVIQKGSIDKRLTTIRGSVIDEN NEPLVCANVLLLDKADSAFVNGVVTNQDGSFRIPGEEGRDYLLKTSYIGYQTKIQPCGAM NKVCLFSDTQLMKEVVISVDHPLIVHKDNGLLANVVGTPLAKMGSAAEMISHLPFVTGGI GEYMVLGHGVPVIYINGRKIRDQGELERLRADDILSAEVITTPGVEYGSDVSSVIRIRTI RRRGQGISSGFRGVFSQGHDYNASENLYLNYRTGGLDLFVKGDLKHGNYYQESILNQETD ASSRWEVKGGAASFHKAVYFSGEVGFNYELDDKNSLGARYMPGANVGSVNRTNLGNNFVY KDGEKIEEISSLQHAHTYPTWTHSVNGYYNGVFGQWNVDFNADYLLGKNNSTNEVFNNDD KAAQSENEVRNYLYAMRMVVKRSFRKGTLSFGTEETFTNRHDVFVQSGFSDNADDHIKQS IYSVFADYSLHLDKFNVAVGLRYEHQKTDYYEYGVHQDEQSPVYNDIVPVALVGYEDKGW HASLSYRLIRNNPDYHMLSSSITYSSKYMYRSGDPLLVPQKHHVFILDAGSRWAFVNLFF DRTLDLYTRFLKPYNDETHPGVLLFTMASIPTTDTYGMNLNVSPKIGCWQPQLNGGMYFY DADVRSLGITRHWNEPQFYFELDNSFTFPDGWFLNVNGNISTAAKQSYSLIHREGTVNAR LSKSFLEDALMITLTADDIFHTRYHYMDGYGVRSHILTRSYNDNQRFGIQISYKFNATKS KYKGTGAGQSEKQRL >gi|226331995|gb|ACIB01000061.1| GENE 16 23482 - 24936 1406 484 aa, chain + ## HITS:1 COG:HP1489 KEGG:ns NR:ns ## COG: HP1489 COG1538 # Protein_GI_number: 15646098 # Func_class: M Cell wall/membrane/envelope biogenesis; U Intracellular trafficking, secretion, and vesicular transport # Function: Outer membrane protein # Organism: Helicobacter pylori 26695 # 12 462 37 481 510 79 23.0 1e-14 MKKLLFLFFLLTTPFSLKSQGILSLDSCRALAIANNKELLISGEKINAAHYQKKAAFTNY LPNFSATGAYMRNQKEFSLLNNDQKAALSGLGTSVSGPLQQAAQVIGQLHPELAPMLSQL GGAIVPALNEAGTAIVDAFRTDTRNVYAGAITLTQPLYMGGKIRAYNKITKYAEELARQQ HNSGMQEVILSTDQAYWQVISLVNKKKLAESYLKLLQKLDSDVEKMIAEGVATKADGLSV RVKVNEAEMTLTKVEDGLSLSRMLLCQLCGIDLSTPIVLADEQVDDLPLIPATTNFEIET AYANRPEIRSLELAAKIYQQKINVTRSEHLPSVALMGNYMVTNPSVFNSFENKFKGMWNV GVMVQLPIWHWGEGIYKVKAAKAEARIAQYQLEDAKEKIELQVNQSAFKVNEAAKKLAMA QKNLEKADENLRYATIGFEEGVIAPSNVLEAHTAWLSAQSEKIDAQIDVKLTEIYLQKSL GTLK >gi|226331995|gb|ACIB01000061.1| GENE 17 25014 - 26006 1220 330 aa, chain + ## HITS:1 COG:VC1607 KEGG:ns NR:ns ## COG: VC1607 COG0845 # Protein_GI_number: 15641615 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Vibrio cholerae # 17 329 6 323 324 211 44.0 1e-54 MTSQKSQNSNMLLAFLTLLGVIVLVAVVGFFMLRKGPEIIQGQAEVTEYRVSSKVPGRIL EFRVKEGQKVQAGDTLAILEAPDVIAKMEQARAAEAAAQAQNEKAIKGARQEQIQAAYEM WQKAIAGVDIAEKSYKRVKNLFDQGVMPAQKLDEVTAQRNAAIATEKAAKAQYTMAKNGA EREDKMAAAALVDRAKGAVAEVESYLKETYLIAQAAGEVSEIFPKVGELVGTGAPIMNIA IMDDMWVTFNVREDLLKNLTMGSEFDAIVPALDNQTIRLKVHYMKDLGTYAAWKATKTTG QFDLKTFEVKATPLEKVTNLRPGMSVIIKK >gi|226331995|gb|ACIB01000061.1| GENE 18 26141 - 27184 699 347 aa, chain + ## HITS:1 COG:VC1608 KEGG:ns NR:ns ## COG: VC1608 COG0842 # Protein_GI_number: 15641616 # Func_class: V Defense mechanisms # Function: ABC-type multidrug transport system, permease component # Organism: Vibrio cholerae # 4 309 45 348 387 124 27.0 2e-28 MDSGLPTNMPVGAVDLDNSASSRNILRNLDAFGQTAVVAHYSSVNEARTAMQEGKIYGFF YIPKGMSADAQSQRQPKLSFYTNNSYLIAGSLLFKDMKMMSELASGAAARSVLYAKGATE DQAMGYLQPIVIDTHPIQNPWLNYSVYLCNTLVPGVLMLLIFMVTVFSIGVEIKDRTARE WLRLSNNSIYIALAGKLLPQTVVFFIMGIFYNVYLYGYLHFPCNSGILPMLLATLCLVLA SQCCGILMIGTLPTLRLGLSFASLWGVISFSISGFSFPVMAMNPVLQALSNLFPLRHYFL IYVDQALNGYSMAYSWSNYMALLIFMLLPFFVVHRLKEALIYYKYVP >gi|226331995|gb|ACIB01000061.1| GENE 19 27184 - 28434 1189 416 aa, chain + ## HITS:1 COG:VC1609 KEGG:ns NR:ns ## COG: VC1609 COG0842 # Protein_GI_number: 15641617 # Func_class: V Defense mechanisms # Function: ABC-type multidrug transport system, permease component # Organism: Vibrio cholerae # 17 383 29 396 408 155 28.0 1e-37 MKDISLKDKITQGINDLFYIWKREFRTTFRDQGVLIFFVLVPLVYPLIYSFIYTNEVVRE VPAVVVDDSRSSLSREYLRKVDATPDIQIVAYCADMEEAKQMLKDRLAYGIIYIPKDFSS DIALGKQTQVSIYCDMSGLLYYKSMLLANTAVSLDMNEDIKIARSGNTTDRQDEITAYPI EYEDVAMFNPTNGFAAFLIPAVLILIIQQTLLLGIGLSAGTARENNRFKDLVPINRHYNG TLRIVLGKGLSYFMVYALVSVYVLCAVPRMFSLNQIGQPGTLALFILPYLMACIFFAMTA SIAIRNRETCMLIFVFTSVPLLFISGISWPGAAIPPFWKYFSYIFPSTFGINGFVRINNM GATLSEIPFEYKALWIQTGFYFLTTCWVYRWQIIKSRKHVIDKYKEMKNRGKEFFS >gi|226331995|gb|ACIB01000061.1| GENE 20 28481 - 28729 88 82 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MSTDTLMAPLGDITSITLAYTLLPLITANASNRDIIFLFITCVLLLYVLCLHLLHEDIDA SNVLIIFVSSKSYSIYFYAELS >gi|226331995|gb|ACIB01000061.1| GENE 21 28611 - 30560 1663 649 aa, chain + ## HITS:1 COG:no KEGG:BF3763 NR:ns ## KEGG: BF3763 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 649 1 649 649 1357 100.0 0 MKRKMMSLLLALAVISGSSVYAKVIDVMSPNGAIKVSVDIKDRIYYSVSYDNDQLLKDCY LNLQLQNETLGTNPHLRSTKRGTIDESVKREIPFKNAIVRNHCNTLRMNFSGNYAVEFRV FDNGIAYRFVTDKKGDNIVMGEDFAINFPTNYKAHLSQPDGFKTSYECPYTHVDTEKYAA TDRMSYLPVLIETDKAYKILISEADLSDYPCMFLKSTGKNGMQSIFPKAPLAFGEDGDRS LKITEEADYIAKTDGKRSFPWRMMVISKEDKELIENEMVYNLSAPCVLEDYSWIKPGQVS WEWWHDARLYGVDFRSGFNMDSYKYYIDFASKFGIPYIIMDEGWAKNTRDPFTPNPTINL TELIKYGKDRNVKIVLWLPWLTVENHFDLFKTFADWGIAGVKIDFMDRSDQWMVNYYERV AKEAAKHKLFVDFHGAFKPAGLERKYPNVLSYEGVLGMEQGGNCKPENSIYLPFMRNAVG PMDFTPGSMISAQPEDNRSTRANAMGSGTRAFQMALFIIFESGLQMLADNPVYYYRELPC TEFITSVPVTWDETKVLYAKVGEAVVVAKRKGEQWFIGGITGNQPQNIEIDLGFIPAGQS FTLTSFEDGINADRQAMDYKKKESTVNNQTRMTLKMVRNGGWAGTIKMK >gi|226331995|gb|ACIB01000061.1| GENE 22 30710 - 31390 649 226 aa, chain + ## HITS:1 COG:CAC3448 KEGG:ns NR:ns ## COG: CAC3448 COG2755 # Protein_GI_number: 15896689 # Func_class: E Amino acid transport and metabolism # Function: Lysophospholipase L1 and related esterases # Organism: Clostridium acetobutylicum # 49 225 15 190 190 65 31.0 8e-11 MLMAVFCLGASLIGINAQEKDWANLQRYAQQNAELPKPDKNEKRVVFMGNSITEGWVNTH PDFFKSNGYIGRGIGGQTSYQFLVRFREDVINLSPALVVINAATNDIAENTGAYHEDRTF GNIVSMVELAKANHIKVILTTTLPAAAFGWNPAIKDAPQKIASLNARLKAYAQTNKIPFV DYYSSMVSGSNKALNPAYTKDGVHPTSEGYDVMENLIQQAINKTLR >gi|226331995|gb|ACIB01000061.1| GENE 23 31411 - 32160 432 249 aa, chain + ## HITS:1 COG:no KEGG:BF3990 NR:ns ## KEGG: BF3990 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 249 1 249 249 505 100.0 1e-142 MKPFISFCLLLCICLGKLYAQGVNIVYIGNSITQGALLKTPVTEAPPVQASQYIEQATKQ SVAFRNCGVSGTTTLDFLPIAERQFPNVKSAAQELSQRKGTLLFSIMLGTNDSACNGPFG SPVEPVSYYTNMKAIIDELLSLYPECKVVIHQPIWYSPNTYNGAMYLAAGLKRLKSYTPM IHKLVDYYSQANPNQVFWGDTAASDFFRNNYQSYFTPENGNAGTFYLHPNKEGAGILGKY WAEAILKAI >gi|226331995|gb|ACIB01000061.1| GENE 24 32361 - 32726 429 121 aa, chain + ## HITS:1 COG:no KEGG:BF3991 NR:ns ## KEGG: BF3991 # Name: not_defined # Def: transcriptional regulator # Organism: B.fragilis # Pathway: not_defined # 1 121 1 121 121 199 100.0 2e-50 MEKLTMQEEEVMIYIWELESCYVKDIVAKFEQPTPPYTTVASIVKNLERKKYVKAQRVGN TYLYTPSIKESEYKRSFMSGVVRNYFENSYKEMVSFFAKDQKISTNDLKEIIDMIEKGQE K >gi|226331995|gb|ACIB01000061.1| GENE 25 32747 - 34480 1083 577 aa, chain + ## HITS:1 COG:no KEGG:BF3992 NR:ns ## KEGG: BF3992 # Name: not_defined # Def: TonB # Organism: B.fragilis # Pathway: not_defined # 1 577 1 577 577 1158 99.0 0 MTPELAYFLKINIAIALFYAFYRLFFYRDTFFQWRRTALLCFLVISPLYPVLNIQEWIRS HEPMVAMVDLYATIVLPEIEITPEVRAADWQSIILSTVNIIYWSGVTLLLVRFFAQLASI LRLRLRCRKDQIEGIPVYLLDKESGPFSFFHWIFIYPQAHPQNELSEILTHEGTHARQRH SIDVIISELMCIACWFNPFMWLMKREVRNNLEYMADNRVLEAGHDSKSYQYHLLGLAHQK SAITLSNSFNVLPLKNRITMMNKKRTKEIGRTKYLLFIPLALALMIVSNIEAVARTTKSI AQEVMQTVEEQMEPENAVAVKESTSSPQAANHISQPQESGIAEQPAQEESREQIVFEVVE KMPEFPGGVRNMNHFINSHLRYPVIAQENGTQGQVIAQFVIQADGTLSDLKIVKSVDPLL DAEAMRVIKEMPKWQPGKQRGIAVATRVTVPIRFRLMDSDSAPSTDSNKNQENTIFDVVE HAPEFPGGMEACLKYMYKNIKYPAVAMEAGIQGQVVIQIVIDKDGKIHDPKIVRGVSPEL NAEAIRVISNMPQWIPGKQKGKNVATRFTLPVRFRLA >gi|226331995|gb|ACIB01000061.1| GENE 26 34523 - 36424 1493 633 aa, chain + ## HITS:1 COG:no KEGG:BF3768 NR:ns ## KEGG: BF3768 # Name: not_defined # Def: putative exported thioredoxin # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 633 1 633 633 1285 99.0 0 MKKITFIAILLLCICSLTKAKEKVIEQPPFIAWTSTSIQVDKVVLSDTATVLYIKAFYHP KQWIRISGQSFLKDNNGETYALRSGIGIKPDTEFWMPESGEGEFRLVFPPIPASATSIDF SEGDNVQGAFKIWGIQLKGKALPELLLPQEAIVHKIDINDELPEPKIEYKDATIKGRILD YRPGLVSKIVPIIFDPVKGAYESEEVKINNDGTFVTRVKVPTTTSAAIRLFGKMITFYAV PGEESSIIINTRELCRQQSKFHKDDKPYGEAVYFGGTLAGLSQEYSNCTLKTSILNDYRQ LFKDVAGMDAGAYKDFIYGKRANLLASIEKAPISKALKAVLGNQVDAEAAQAISLTEMVI KQAYTMEHKMSKEEAREYFNTAQIELPENYYANAFRPLASINTMAALYNSKLSELIPYYL RRRTDELTKAWGTDKGIYFDINKAGELYSSIKEFTPLTAEQEVILATLPPACQNEVRDAN NQLLKTLEANKKKTGFTINEVGDVSNEELFSSIISKYRGKVILVDFWATWCGPCRMANKA MLPLKEELKGKDIVYLYITGETSPLKTWENMIPDIHGEHFRLTDAQWSFLGDKFDIRGVP TYLIIDREGNVKHQKTGFPGVAQIKEELMKVYD >gi|226331995|gb|ACIB01000061.1| GENE 27 36515 - 37621 1284 368 aa, chain - ## HITS:1 COG:alr0652 KEGG:ns NR:ns ## COG: alr0652 COG0489 # Protein_GI_number: 17228148 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: ATPases involved in chromosome partitioning # Organism: Nostoc sp. PCC 7120 # 8 351 10 350 356 256 42.0 5e-68 MTIYPKLILDALATVRYPGTGKNLVEAGMVEDNIRIEGMKVSFSLIFEKPTDPFMKSVIK AAETAILTHVGKEVEIVGNISVKTVQAARPEVGKLLPHVKNIIGISSGKGGVGKSTVSAN LAVALAKLGYKVGLLDADVFGPSMPKMFQVEDARPYAEKIDGRDMIIPVEKYGVKLLSIG FFVDPDQATLWRGGMASNALKQLIGDAAWGDLDYFLIDLPPGTSDIHLTVVQTLALTGAV VVSTPQAVALADARKGINMFTNDKVNVPILGLVENMSWFTPAELPENKYYLFGKEGAKKL AEEMNVPLLGQIPIVQSICEGGDKGTPVALDENTVTGRAFLALAASVVRQVDKRNVEMAP TKIVELHK >gi|226331995|gb|ACIB01000061.1| GENE 28 37630 - 38388 856 252 aa, chain - ## HITS:1 COG:CAC2627 KEGG:ns NR:ns ## COG: CAC2627 COG0220 # Protein_GI_number: 15895885 # Func_class: R General function prediction only # Function: Predicted S-adenosylmethionine-dependent methyltransferase # Organism: Clostridium acetobutylicum # 30 215 23 206 211 118 35.0 1e-26 MGKNKLEKFADMASYPHVFEYPYSAVDNVPFDMKGKWHKEFFKNDNPIVLELGCGRGEYT VGLGRMFPDKNFIAVDIKGARMWTGATESLQAGMKNVAFLRTNIEIIDRFFAEGEVSEIW LTFSDPQMKKATKRLTSTYFMERYRKFLVSNGIIHLKTDSNFMFTYTKYMIEENGLPVEF ITEDLYHSDLVDDILGIKTYYEQQWLDRGLSIKYIKFLLPQEGELREPDIEIELDSYRSY NRSKRSGLQTSK >gi|226331995|gb|ACIB01000061.1| GENE 29 38398 - 39204 759 268 aa, chain - ## HITS:1 COG:MTH1101 KEGG:ns NR:ns ## COG: MTH1101 COG1237 # Protein_GI_number: 15679112 # Func_class: R General function prediction only # Function: Metal-dependent hydrolases of the beta-lactamase superfamily II # Organism: Methanothermobacter thermautotrophicus # 4 258 2 251 260 140 35.0 3e-33 MNYKITTLAENSVYGKGLQGEHGLSLLVEAGEHKVLFDTGASDLFLRNARLLGLDLSDVE YVVLSHGHRDHTGGLYAFLKMNSVAKVVCKREVFRKKFKNERENGMLHPEALDKSRFWLV DETTEIVPGVFAFPDVKVVDRNDTHFEHFFTEIEGEMRPDTFEDELALVLKGEKSLSVLS ACSHRGITNIIRAVQNAFPGLGLKLVMGGFHIHNAEEEKFNVISAFLGMKLPKRLGVCHC TGIDKYALFRQQFNDRVFYNYTGWVETI >gi|226331995|gb|ACIB01000061.1| GENE 30 39216 - 40235 1193 339 aa, chain - ## HITS:1 COG:L0086 KEGG:ns NR:ns ## COG: L0086 COG0115 # Protein_GI_number: 15673270 # Func_class: E Amino acid transport and metabolism; H Coenzyme transport and metabolism # Function: Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase # Organism: Lactococcus lactis # 4 339 5 340 340 342 51.0 7e-94 MKEIDWANLSFGYMKTDYNVRINFRNGAWGELEISSSEVLNLHMAATCLHYGQEAFEGLK AFRGKDGKVRIFRLEENAARLQSTCQGIMMAELPTERFKEAILKVVKLNERFIPPYESGA SLYIRPLLIGTSAQVGVHPADEYMFVVFVTPVGPYFKGGFSTNPYVIIREYDRAAPHGTG IYKVGGNYAASLRANKKAHDLGYSCEFYLDAKEKKYIDECGAANFFGIKDNTYITPKSTS ILPSITNKSLMQLAEDMGMKVERRPVPEEELSTFEEAGACGTAAVISPIERIDDLENGKS YVISKDGKPGPVCEKLYNKLRGIQYGDEPDTHGWVTIVE >gi|226331995|gb|ACIB01000061.1| GENE 31 40281 - 40493 280 70 aa, chain - ## HITS:1 COG:no KEGG:BF3998 NR:ns ## KEGG: BF3998 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: Mismatch repair [PATH:bfr03430] # 1 70 1 70 70 74 100.0 1e-12 MPAKKKETYSQAIERLEKIVRQIDSNELEIDELSEKIKEANEIIAFCTGKLTKADQEIEK LLQEKRLSEE >gi|226331995|gb|ACIB01000061.1| GENE 32 40589 - 41821 1296 410 aa, chain - ## HITS:1 COG:DR0186 KEGG:ns NR:ns ## COG: DR0186 COG1570 # Protein_GI_number: 15805222 # Func_class: L Replication, recombination and repair # Function: Exonuclease VII, large subunit # Organism: Deinococcus radiodurans # 5 410 28 416 416 150 33.0 3e-36 MQDTLSLYELNALVRRSLEQCLPDEYWVQAELSDVRTNSTGHCYLEFVQKDPRSNNLIAK ARGTIWANIYRLLKPYFEESTGQLFTSGIKVLVKVTVAFHELYGYSLTVQDIDPTYTLGD MARRRREILRQLEEEGVLTLNKELEMPLLPQRIAVISSATAAGYGDFCHQLQHNPRGFYF RTELFPALMQGNQVEESVLAALDAVNARVDEFDVVVIIRGGGATSDLSGFDTYLLAAACA QFPLPVITGIGHERDDTVLDSVAHTRVKTPTAAAELLIDRMEEAADRLGALAEELHARVF YRLEQERRRLALLQARIPSQVMRKLSESRIKLQMAKSNLSHAAETLLARQHHRLELLQNR IADASPQKLLKRGYSITLKDGKAVKSAACLQSGDELITRLYEGEVMSRVE >gi|226331995|gb|ACIB01000061.1| GENE 33 41825 - 43180 1340 451 aa, chain - ## HITS:1 COG:BS_aprX KEGG:ns NR:ns ## COG: BS_aprX COG1404 # Protein_GI_number: 16078789 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Subtilisin-like serine proteases # Organism: Bacillus subtilis # 154 436 138 431 442 88 29.0 2e-17 MKKLALIALVFVALGAPAQQDTLKYRISLKDKAATTYSLDHPEKFLSEKAIARRQRQQLP VDSTDLPVCRRYVDAIRDKGVKIVAMGKWDNFVTVSCNDSAVIGEIAALPFVRATEKVWV APSKPAAEDKRDSLANSPLKSENYYGPALRQIEISNGEKLHEAGFKGQGMTIAVIDAGYH NVDKIEAMKNIRILGTKDFVEPGSDIYAKGSHGMAVLSCMAMNDPYVMVGTAPEASYWLL RSEDEASEHLVEQDYWAAAVEFADSVGVDVVNTSLGYFTFDDSTKNYKYRDLDGHHALMS RQASKMADKGIVLVCSAGNSGASSWKKITTPGDAENVLTVGAIDRRGVLASFSSIGNTAD NRVKPDVVAVGLNSDVMGTNGNLRKASGTSFASPILCGMVTCLWQACPQLTAKEIIELVR QSGDRVDFPDNIYGYGVPDLWKAYQSVSKKK >gi|226331995|gb|ACIB01000061.1| GENE 34 43208 - 44296 1111 362 aa, chain - ## HITS:1 COG:CAC2233 KEGG:ns NR:ns ## COG: CAC2233 COG0482 # Protein_GI_number: 15895501 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain # Organism: Clostridium acetobutylicum # 6 345 3 354 355 263 38.0 5e-70 MKESKKRVLVGMSGGIDSTATCLMLQEQGYEIVGVTMRVWGDEPQDARELAARMGIEHYV ADERVPFKNTIVKNFIDEYRQGRTPNPCVMCNPLFKFRMLIEWADKLGCDWIATGHYSRL EERNGHIYIVAGDDDKKDQSYFLWRLGQDVLRRCIFPLGNYTKQTVRDYLREKGYEAKSK EGESMEVCFIKGDYRDFLREQCPELDAEVGPGWFVSSEGVKLGQHKGFPYYTIGQRKGLE IALGKPAYVLKINPQKNTVMLGDAGQLRAEYMVAEQDNIVDEDELFACPDLAVRIRYRSR PIPCRVKRLEDGRLLVRFLAEASAIAPGQSAVFYEGRRVLGGAFIASQRGIGLVIEQNKD WK >gi|226331995|gb|ACIB01000061.1| GENE 35 45244 - 46104 442 286 aa, chain + ## HITS:1 COG:no KEGG:BF3778 NR:ns ## KEGG: BF3778 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 286 1 286 286 572 100.0 1e-162 MANPKLPGIPEAEQALLYAKLNEYNRGRMSYKEAGAYFVVLPRPGHPTYSVWIYSPTLEK NRLLFIHELSADINESLRMASTLFFFSRRCLLIVEYNEKRMQSNGDDIISFGRYRGHYLH EILKVDPAYLSWIAYKYTPKIPKQERFVAIAQVYHSVHLDIMQRKARQKREAGRFLGNEG EKLEGLNLKVVRVRLEDDPYKTRVMGTSVQFFVRQIVTLTDPSGNLVVLRISSKTPSPVS CQLPALEHEFRPGEIVHIASARIARTYESYGSKYTRLSHVKFHPAD >gi|226331995|gb|ACIB01000061.1| GENE 36 46101 - 47111 833 336 aa, chain - ## HITS:1 COG:CAC2806 KEGG:ns NR:ns ## COG: CAC2806 COG1409 # Protein_GI_number: 15896061 # Func_class: R General function prediction only # Function: Predicted phosphohydrolases # Organism: Clostridium acetobutylicum # 1 311 2 312 317 156 36.0 6e-38 MKKILGTLLAVLFSFMAINDSAAQNTVLRFNKDGKFKIVQFTDVHFKYGNPASDVALERI GEVLDAEHPDLVIFTGDVVYSSPADKGMLQVLGQVEHRHLPFVVTFGNHDNEQGKTRAEL YDLIRGVAGNLLPDRGTSPSPDYILTVKSSADASKDAALLYCMDSHSYSSLKDVDGYAWL TFGQVSWYRAQSAAYTARNGGKPYPALAFFHIPLPEYNEAAANENAILRGTRMEKACAPQ LNTGMFAAMKEAGDVMGVFVGHDHDNDYAVMWKNILLAYGRFTGGNTEYNHLPNGARVIV LNEGTRTFDTWIRQKGGVVDSTSYPSDYVKDDWRKR >gi|226331995|gb|ACIB01000061.1| GENE 37 47140 - 47619 624 159 aa, chain - ## HITS:1 COG:NMB1512 KEGG:ns NR:ns ## COG: NMB1512 COG0245 # Protein_GI_number: 15677365 # Func_class: I Lipid transport and metabolism # Function: 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase # Organism: Neisseria meningitidis MC58 # 3 159 4 160 160 172 53.0 2e-43 MKIKVGFGFDVHQLVKGRELWLGGILLEHEKGLLGHSDADVLVHAICDALLGAANMRDIG YHFPDNAGEYKNIDSKILLKKTVELIAAKGYQIGNIDATICAERPKLKAHIPSMQQVLAE VMGIDADDISIKATTTEKLGFTGREEGISAYATVLINRV >gi|226331995|gb|ACIB01000061.1| GENE 38 47658 - 48272 609 204 aa, chain - ## HITS:1 COG:YPO2082 KEGG:ns NR:ns ## COG: YPO2082 COG0179 # Protein_GI_number: 16122321 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism # Function: 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) # Organism: Yersinia pestis # 2 190 18 205 218 151 41.0 8e-37 MKIIAVGMNYARHNEELGHTLENKEPVIFMKPDSAILKDGKPFFIPDFSNEVHYETELVV RINRLGKNIASRFAHRYYDAVTVGIDFTARDLQRRFREAGNPWELCKGFDSSAAIGTFVP VERLADVQNFHFHLDIDGKTVQQGHTADMLFRVDDIIAYVSRFVTLKIGDLLFTGTPVGG GPVSIGQHLEGYLETEKLLDFYIR >gi|226331995|gb|ACIB01000061.1| GENE 39 48276 - 48932 558 218 aa, chain - ## HITS:1 COG:lin2178 KEGG:ns NR:ns ## COG: lin2178 COG2344 # Protein_GI_number: 16801243 # Func_class: R General function prediction only # Function: AT-rich DNA-binding protein # Organism: Listeria innocua # 8 208 3 203 215 140 35.0 2e-33 MNNQIQHKDSTKVPEPTLRRLPWYLSNVKLLKQKGERYVSSTQISKEINIDASQIAKDLS YVNISGRTRVGYEVDALIAVLEDFLGFTNMHKAFLFGVGSLGGALLRDSGLSHFGLEIVA AFDVNPSLVGTTLNGIPIFHSDDFQKKMQEYGVHIGVLTVPIEIAQCITDTMVAGGIKAV WNFTPFRIRVPEDIVVQNTSLYAHLAVMFNRLNFNEIE >gi|226331995|gb|ACIB01000061.1| GENE 40 49115 - 49465 411 116 aa, chain + ## HITS:1 COG:alr3795 KEGG:ns NR:ns ## COG: alr3795 COG0023 # Protein_GI_number: 17231287 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Translation initiation factor 1 (eIF-1/SUI1) and related proteins # Organism: Nostoc sp. PCC 7120 # 33 116 33 115 115 79 57.0 1e-15 MKNSDWKDRLNVVYSTNPDYNYEMDDDEEQVTLEPSQQNLRVQLDRKNRGGKVVTLITGF VGTENDLKDLGKLLKTKCGVGGSAKDGEIIVQGDFKQKIVELLKKEGYTKTKTVGG >gi|226331995|gb|ACIB01000061.1| GENE 41 49656 - 50648 1312 330 aa, chain - ## HITS:1 COG:HI0914 KEGG:ns NR:ns ## COG: HI0914 COG0264 # Protein_GI_number: 16272851 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Translation elongation factor Ts # Organism: Haemophilus influenzae # 3 225 4 221 283 119 38.0 1e-26 MAVTMADITKLRKMTGAGMMDCKNALTDAEGDFDKAMKIIREKGQAVAAKRSDREASEGC VLVKVEEGFGAIIALKCETDFVAQNADFVKLTQDILDAAVANKCKTLEEVLALPMGDATV AQAVTDRTGITGEKMELDGYMVLEGATIAAYNHMNRNGLCTMVAFNKKVDEQLAKQVAMQ VAAMNPIAVDEDGVSEEVKQKEIEVAVEKTKVEQVQKAVEAALKKANINPAHVDSEDHME SNMAKGWITAEDVAKAKEIIATVSAEKAANMPEQMIQNIAKGRLAKFLKEVCLLNQEDIM DAKKTVREVLKEADPELKVVDFKRFTLRAE >gi|226331995|gb|ACIB01000061.1| GENE 42 50772 - 51608 1391 278 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|53715295|ref|YP_101287.1| 30S ribosomal protein S2 [Bacteroides fragilis YCH46] # 1 278 1 278 278 540 99 1e-152 MSRTNFDALLEAGCHFGHLKRKWNPAMAPYIFMERNGIHIIDLHKTVAKVDEAAEALKQI AKSGKKVLFVATKKQAKQVVAEKAASVNMPYVIERWPGGMLTNFPTIRKAVKKMTTIDKL TADGTYSNLSKREILQISRQRAKLDKTLGSIADLTRLPSALFVIDVMKENIAVREANRLG IPVFGIVDTNSDPTNIDFVIPANDDATKSVEVILDACCAAMIEGLEERKAEKIDMEAAGE APANKGKKKSAKARLDKSDEEAINAAKAAAFLKEDEEA >gi|226331995|gb|ACIB01000061.1| GENE 43 51729 - 52115 642 128 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|53715296|ref|YP_101288.1| 30S ribosomal protein S9 [Bacteroides fragilis YCH46] # 1 128 1 128 128 251 100 1e-65 MEVVNALGRRKRAIARVFVSEGTGKITINKRDLAEYFPSTILQYVVKQPLNKLGVAEKYD IKVNLCGGGFTGQSQALRLAIARALVKINAEDKPALRSEGFMTRDPRSVERKKPGQPKAR RRFQFSKR >gi|226331995|gb|ACIB01000061.1| GENE 44 52122 - 52583 795 153 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|53715297|ref|YP_101289.1| 50S ribosomal protein L13 [Bacteroides fragilis YCH46] # 1 153 1 153 153 310 99 2e-83 MDTLSYKTISANKATVTKEWVIVDATDQTLGRLGAKVAKLLRGKYKPNFTPHVDCGDNVI IINADKVKLSGNKWNDRVYLSYTGYPGGQREMTPARLIAKPNGEDRLLRKVVKGMLPKNR LGAKLLSNMYVYAGSEHKHDAQNPKAIDINSLK >gi|226331995|gb|ACIB01000061.1| GENE 45 52602 - 52898 137 98 aa, chain + ## HITS:1 COG:no KEGG:BF4014 NR:ns ## KEGG: BF4014 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 93 1 85 102 142 77.0 3e-33 MLLLKIFFTTYQPSPDKGRLTCYELISYPFEIIIGLQKYGFSLNWQTISVFFWIYPVDNE PQAICRKQRPTRQKKTAGNKPHTINATIHNRQLATCSP >gi|226331995|gb|ACIB01000061.1| GENE 46 52918 - 53406 415 162 aa, chain - ## HITS:1 COG:no KEGG:BF4015 NR:ns ## KEGG: BF4015 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 162 1 162 162 336 100.0 1e-91 MKKLYFFTMLAAMLFAVTNVMAQKANFKPANLKGIWQLCHYVSESPDAPGVLKPSNTFKV LSDDGRIVNFTIRPGADAIITGYGSYEQLSDHTYAESIEKNIHLPMLDNQDNVLTFELVD DKVLHLKYFIEKDLNGNELNCWYKETWKRIEMPDKFPEDIVR >gi|226331995|gb|ACIB01000061.1| GENE 47 53564 - 54967 1570 467 aa, chain - ## HITS:1 COG:sll0495 KEGG:ns NR:ns ## COG: sll0495 COG0017 # Protein_GI_number: 16332045 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Aspartyl/asparaginyl-tRNA synthetases # Organism: Synechocystis # 4 467 52 513 513 555 55.0 1e-158 MEKISRTKIVDLMKREDFGAMVNVKGWVRTRRGSKQVNFIALNDGSTINNVQVVVDLANF DEEMLKQITTGACLSVNGVLTESVGTGQKAEVQAREIEVLGTCDNTYPLQKKGHSMEFLR EIAHLRPRTNTFGAVFRIRHNMAIAIHKFFHEKGFFYFHTPIITASDCEGAGQMFQVTTM NLYDLKKDENGSIVYDDDFFGKQASLTVSGQLEGELAATALGAIYTFGPTFRAENSNTPR HLAEFWMIEPEVAFNEIQENMDLAEEFIKYCVRWALDNCADDVKFLNDMFDKGLIERLEG VLKEDFVRLPYTEGIKILEEAVAKGHKFEFPVYWGVDLASEHERYLVEDHFKRPVILTDY PKEIKAFYMKQNEDGKTVRAMDVLFPKIGEIIGGSERESDYNKLMTRIEEMHIPMKDMWW YLDTRKFGTCPHSGFGLGFERLLLFVTGMSNIRDVIPFPRTPRNADF >gi|226331995|gb|ACIB01000061.1| GENE 48 55074 - 56474 1776 466 aa, chain - ## HITS:1 COG:SA1324 KEGG:ns NR:ns ## COG: SA1324 COG1187 # Protein_GI_number: 15927074 # Func_class: J Translation, ribosomal structure and biogenesis # Function: 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases # Organism: Staphylococcus aureus N315 # 232 461 7 238 245 166 42.0 7e-41 MSTENEEWREDSKSENTDAGRDGNRSFNREGGYSRPSYNREGGDRPYRPRFNSNSEDRPQ RSYGDRPQRPSYNREGGDRPYRPRFNSEGGDRPQRSYGDRPQRPSYNREGGDRPYRPRFN SEGGDRPQRSYGDRPQRPSYNREGGDRPYRPRFNSEGGDRPQRSYGDRPQRPSYNREGGD RPYRPRYNNDNRSQGFSRPIRRTGDYDPNAKYSKKKQIEYKEQFVDPNEPIRLNKFLANA GVCSRREADEFITAGVVSVNGEVVTELGTKIKRADVVKFHDETVSIERKVYVLLNKPKDC VTTSDDPQARLTVMDLVKGACAERIYPVGRLDRNTTGVLLLTNDGDLASKLTHPKYLKKK IYHVYLDKNLTKADMDQIAAGIQLEDGEIHADAISYSDEVKRDQVGIEIHSGKNRIVRRI FESLGYKVVKLDRVFFAGLTKKGLRRGEWRYLTEQEVNFLRMGSFE >gi|226331995|gb|ACIB01000061.1| GENE 49 56556 - 57902 1545 448 aa, chain - ## HITS:1 COG:PA2629 KEGG:ns NR:ns ## COG: PA2629 COG0015 # Protein_GI_number: 15597825 # Func_class: F Nucleotide transport and metabolism # Function: Adenylosuccinate lyase # Organism: Pseudomonas aeruginosa # 1 447 1 447 456 453 51.0 1e-127 MKLDLLTAISPIDGRYRGKAEALAAYFSEYALIKYRVQVEVEYFITLCELPLPQLKGIDK SVFESLRNIYRNFTEADAQRIKDIESVTNHDVKAVEYFLKEEFDKLGGLEEYKEFIHFGL TSQDINNTSIPLSIKEALEQVYYPLIEELIAQLKTYATEWESIPMLAKTHGQPASPTRLG KEIMVFVYRLERQLATLKACPVTAKFGGATGNYNAHHVAYPEYDWKAFGNQFVAEKLGLE REEYTTQISNYDNLSAIFDAMKRINTVMIDMNRDFWQYISMEYFKQKIKAGEVGSSAMPH KVNPIDFENAEGNLGIANAILEHLAVKLPVSRLQRDLTDSTVLRNVGTPFGHIVIAIQSS LKGLRKLLLNETAIYRDLDNCWSVVAEAIQTILRREAYPHPYEALKALTRTNQAITETSI KEFIEGLDVNEEIKKELRVITPHSYTGI >gi|226331995|gb|ACIB01000061.1| GENE 50 58124 - 60010 1642 628 aa, chain - ## HITS:1 COG:XF0840 KEGG:ns NR:ns ## COG: XF0840 COG1874 # Protein_GI_number: 15837442 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-galactosidase # Organism: Xylella fastidiosa 9a5c # 9 619 10 601 612 399 36.0 1e-110 MRTKSIFLLLLLAVMPLCVFSQSKSTFEIKNGHFYRNGKITPVLSGEMHYARIPHQYWRH RLQMMKGMGLNTVATYVFWNLHEPEPGKWDFTGDKNLAEFIKTAGEEGMMVILRPGPYVC AEWEFGGYPWWLQNVKGMEIRRDNPEFLKYTKAYIDRLYKEVGSLQCTKGGPIVMVQCEN EFGSYVAQRKDIPLEEHRAYNAKIKQQLADAGFNVPLFTSDGSWLFEGGATPGALPTANG ESDIENLKKVVDQYHDGKGPYMVAEFYPGWLSHWAEPFPQIGASGIARQTEKYLQNDVSF NFYMVHGGTNFGFTSGANYDKKRDIQPDMTSYDYDAPISEAGWVTPKYDSIRNVIKKYVK YTIPEAPAPNPVIEIPSIQLNKVADVLAFAEKQKPVSSDTPLTFEQLNQGYGYVLYTRHF NQPISGTLEIPGLRDYAVVYVDGEQVGVLNRNTKTYSMEIEVPFNATLQILVENMGRINY GSEIVHNTKGIISPVQIAGKEIVGGWDMYQLPMDEMPDLTKLKADTHKNVPSEVAKLKGC PVLYEGTFTLDKVGDTFMDMESWGKGIVFVNGVNIGRYWKVGPQQTLYVPGVWLKKGENK IVIFEQLNETPQTEVKTVKTPVLMKLKG >gi|226331995|gb|ACIB01000061.1| GENE 51 60678 - 61229 309 183 aa, chain - ## HITS:1 COG:BS_yyaI KEGG:ns NR:ns ## COG: BS_yyaI COG0110 # Protein_GI_number: 16081137 # Func_class: R General function prediction only # Function: Acetyltransferase (isoleucine patch superfamily) # Organism: Bacillus subtilis # 2 180 4 182 184 164 46.0 9e-41 MTEIEKMRSGELADMSAPELQVRFEHAKKLLARMRCLSTYDETYRGLLEELIPDLPATSV ICPPFYCDHGDGIRLGEHVFVNANCTFLDGAFITIGSHTLIGPCVQIYTPHHPMDYLERR NPKEYAYPVTIGEDCWIGGGAVICPGVTIGDRCVIGAGSVVTKDIPDDCVAVGNPARVIR CRM >gi|226331995|gb|ACIB01000061.1| GENE 52 61236 - 62009 477 257 aa, chain - ## HITS:1 COG:STM0005 KEGG:ns NR:ns ## COG: STM0005 COG3022 # Protein_GI_number: 16763395 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Salmonella typhimurium LT2 # 1 251 1 252 257 154 36.0 2e-37 MLILLSCAKTMSDVSKTKTPLTTFPGFRKEAAEVALQMSQFSVEELERLLKVNPKIAVEN YRRYQAFHSEGTRELPALLAYTGIVFKRVHPQDFSEEDFCYAQDHLRLTSFCYGLLRPLD MIRPYRLEGDVRLPEPGNRTMFDYWKPILTDRFIADIKKAGGVLCNLASDEMRGLFDWKR VEKEVRVITPEFHVWKNGKLATVVVYTKMSRGEMTRYILKNRIESVEQLKTFAWEGFEFN EQLSDETKYVFTNGKTE >gi|226331995|gb|ACIB01000061.1| GENE 53 62131 - 64350 1982 739 aa, chain + ## HITS:1 COG:no KEGG:BF4022 NR:ns ## KEGG: BF4022 # Name: not_defined # Def: hyaluronoglucosaminidase precursor # Organism: B.fragilis # Pathway: Metabolic pathways [PATH:bfr01100] # 1 739 1 739 739 1476 99.0 0 MKIKRLYLLGAWLLLGVSASAQITSIQPQPQQVLSQARNLSLPDTYLIVGDTEANTHAVS ALKTLLAGKHSDKNGFRIYIGEKGDKAIRKFARQIPNHKEGYYLAINDKEIVLAGQDERG TFYALQTLAQLLNDNQLPVVEIKDYPSVRFRGVVEGFYGTPWSHEARLRQLKFYGENKMN TYIYGPKDDPYHSSPNWRLPYPEKEALQLQELVKVANENEVDFVWAIHPGQDIKWNQEDR DLLLAKFEKMYDLGVRSFAVFFDDISGEGTNPNKQAELLNYIDEKFVKVKPDVTPLVMCP TEYNKSWSNPKGNYLTTLGEKLNPSIQIMWTGDRVISDITKDGIAWINARIKRPAYIWWN FPVSDYVRDHLLLGPVYGNDTQIADQMSGFVTNPMEHAEASKIAIYSVAGYAWNPEKYNS EQTWKDAIRTILPSAADELEFFAAHNSDLGPNGHKYRRDESVELQPLSQRFLDSYLKNGS YTEADFNALEATFGKMVESGDILMTNTGNRPLIVEMMPWLRQFKLLGETGQEVLAMAKAY KEGDNSLFIRKYRHVKALQQQMFQVDQTYNQNPYQPGVKTATKVIKPLIDQTFTTVTERY NKEHGTQLDAATDYMPHKLVSDVEQLRNQPLQIKTNRVLVSPANEVIKWGAGCTLTIELD QAYPGENLDIDFGKPDVAAWGQLEISADGKEWQKVDFKQEKNRITLNLKQTPVKAVRFSN VGNAEQEVYLRRFMITLDK >gi|226331995|gb|ACIB01000061.1| GENE 54 64549 - 64932 225 127 aa, chain - ## HITS:1 COG:no KEGG:BF4023 NR:ns ## KEGG: BF4023 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 127 1 127 127 254 100.0 6e-67 MTLEEILQIEAQNVDCIFLYQEEGAWYAYEHSAFYCYSLLGILDIDWLPCPDGVSSGQKT IRVRVSEPDKFLCTPLLRLMRKRKTEYVVLCKISCGGFYYWREQQQMKFRVLQERESSCT KINEHAE >gi|226331995|gb|ACIB01000061.1| GENE 55 64947 - 66605 876 552 aa, chain - ## HITS:1 COG:no KEGG:BF3798 NR:ns ## KEGG: BF3798 # Name: aapG # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 552 1 552 552 1064 98.0 0 MKIQYIIGILIAFLFASCSHEEEEQKPAYGKIDVAVSVTLPQPESVNTLTRAGGPYTDTD IKNADLLIFDKDAKFMERVKVESDRLVVTGTGINFTVRLDATSERRIIHLVANGRSADGT SDRLNFGDITPGMTENAAISSLQTASLEHVDEGESTLLNHVMPLVMWGRFALNGINIVTK AEGVKLLRSTACIQVKKGNGGGNTGLGDFVIEGITVHQGACHGFLAPTDCTGEVNTPVVA NPVTGGIYLDYRKGWGNGAEPSLYIYERNCSASDYMGVVIAARYKGKKGYYKVVMNGNDG SPLNIVRNHRYIVTVVGVNGPGYESPDIAVASAPSNALKVELTDEDTDLPCIVADGQYRM ASSNNVFSLYGKTDVTTSATGVDICTVYSSRGIQPVLTLPDDCNWLTNLSAQALGSNKYK ITGDFTSAANDAVATTLTMTCDNLSQPVRVSWNPIISDQKDTDSFVLDLVGSTDRNWTVR VLNPTSPGWLFLHPSAASPGALPGDGMVSELSSKYSSHAYLHVAFGASRRGTVQMTSASG GETVARKIVVIQ >gi|226331995|gb|ACIB01000061.1| GENE 56 66640 - 69114 1810 824 aa, chain - ## HITS:1 COG:no KEGG:BF4025 NR:ns ## KEGG: BF4025 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 824 52 875 875 1608 99.0 0 MGLLLFLAFACTDDRESPGGTNGNPSLVFRMTRASADIINNTQVYLFDGDGATAGQFRQK VPDVTYAADRLTMPVAAGTWDITLVSADGDVNSGLVQPVRGQARSSLKMWETRSVGGSLP SMPELRTAYITGQQVIAGQDNVASETALLSRNVALVKVVIADAGGLDINGTHTMKLTNVP TTLNWEGGLYPNKNNPTVSAEPMTGTFTIHNNTAMAGHQYSDTLRFIVPAHKGTDYLNAL PTDTTTSHLKLSVDLASEGGTRFEKTDVVIPRVPRVNGILLVRIFLGGKLDVSTEILNWQ DTQLEADLSQTQLYTDKASVGMAFKDTIHVNTNAREYTVEKAPEATWITSVKKLDGNAVE ITADLDSYVDNHPRTSYITIKANNVTKKIPVTQRPDRGTIKVSEHKLQFSPPHPTASLDV RSVGGNWKLLSQSSKATAATTQGTAGSTSVTFTRTSTSDDSLYDEYYGEGQAVFKNMMTL DTDTVHLSNLFIGIIDDLIEVTQPTVHPDTTCTVNNVKVYGGDTRDLTIINKPSWIHPAP ETDYNPATGIFTFVCDREPNEEERYGEITLGHIDDPDYTVKVSVLQAIIVRIPEFNYFVV QFVWSANDVDVKVGFTGNPTSVVVNGITYNTSSVDAFQNQWVGWSQGGVVNYDNKELLKW GGDATSGQGETAFFNAPVINSAPYPGQHGIDPNAPGLLPRTVTLQVNAGWYRGGGIPMTC NIYAYLGGTMSHVGTNFVNQGGTLVYTSTNKFNVQASGSKQYDHICDIVYDRKKHTAKIN WKGTLWTGTRSMRVPMRSVEETQKPYWTPTIVDTYSNSYRGQGK >gi|226331995|gb|ACIB01000061.1| GENE 57 69230 - 71692 2166 820 aa, chain - ## HITS:1 COG:no KEGG:BF3800 NR:ns ## KEGG: BF3800 # Name: aapE # Def: putative outer membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 820 1 820 820 1571 100.0 0 MKKTRIHRVWILFLLGIMACLQVRAQYMPVVFDKKYGDKNQIQLVCPLAGDEVAMVGKEG QKYNLTWIGREGEVLFSLPLAGFTAVNELTELEDNRILLVGQSAVMNTKGRKDNVTLSGR AVILDRSGHIVTNIYAGGQGSELMKGALLRSGSLILSGMEPKGGNSRQGILLKVDKSGRV IYQYKNAGSGYCDQFEVLGNTTEYICAAFSGDKEKEQTTVVRLDDKGKPYYVTVIPAKRF IVTGMNANINDGSVIVTGNSSTDGGIIYKIRPEGDIVFAKTLIPANQGSVMLNQLQVARN GNILVGGSGSKGYYALLRNDGTALYSGTSNGGVRGVGMNRTTGESVVTTYDVNARRGTFV RILPTGKAEFDRTIDGNFDKVKVTNNGEVLLLSSDEGRVCMYSATGEKEFDRYVTDNKPT VYRQALTASSGELLFLGSGSRLVKLGHGLYVSDVKITKPVNGTATAVFTVTLTGYATTKE GAPVPVSVGYATREASATTANNFTPVKGKLSFTPSRGTADRYLVKQDIEVPVKANDLIEG VKDFELLLSDVQQSYLVKPVGKAVIEDQQAVVKLVRTERGEEGSKDILYELGLFKTDGTP LTNATGANIIVDGIYGEGTADALDFDMGLTPRVIFANGSQKSSFNVRTLEDTRYELPKTV VVNFNKVHSLSGSNVAFDGELLSCSGIVVDQPARLAIASLGDHRVNNNVVSGFFTVSLLR ASDGALLTNATGSDIIVNCVTLPDATAKEGKDFVFTNLHDLRISGDGNHSSANVNGVVLF STDTLEKQVKLKIKSVNQPTGAQPISVSDAERTAEFTIRK >gi|226331995|gb|ACIB01000061.1| GENE 58 71716 - 73566 1273 616 aa, chain - ## HITS:1 COG:no KEGG:BF4028 NR:ns ## KEGG: BF4028 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 616 1 616 616 1238 100.0 0 MRKKKIQILLSAAAFLFLMGGTMPSVAQERKIGRVERRADRNFIRQKFDKAMAQYETAIR REKDEAGQAALHLKTARLYFMVREYGRASEHYDKAMSLRPDLLGVDDVCDYVDALRFQGQ ARKAEAICLDNAYKDIYSRYQRYQNTLEALAMRHSVQEDPGFSAKRLLLNTSNAEFWVGN YGEQPFYAISYSKFNDPGKLFFHRTHYYALDEPGETGVETQKPPRYYGYFRKIPADLQNG PVTFSPDMKSMVATVIEYDKEKTTVEMANRKLRPFRTKLFYSVLKNKKKRFTKYVPAFPQ EEMSSYAHPYLFNEGKSLLFTSDMPGGYGGFDLYVVHWDEEAQAWGTPVNLGPDVNTEGD EIFPVIYKGRLIFSSNGLPGFGGYDLFSAYYDKDGVIPGSISHFPYPVNSVFNDYYMCPL DLRTAYFVSDREMASRDDIYYLRTVEDLGTQQGMPFYGMSEENAILGGALLLNGTTETVS PESVTLKQYAPEGLLMTLYFDFDSDELTDESVRRLEQFINEMGTYQFSELRFDGFADEIG SDSYNYSLSERRAESVAEFLRNHGLNVRFGIEAHGRIKLSPEEVKEEIEYHRWPEGGIDW IQVNRRARRVEIYNKR >gi|226331995|gb|ACIB01000061.1| GENE 59 73591 - 74187 387 198 aa, chain - ## HITS:1 COG:no KEGG:BF4029 NR:ns ## KEGG: BF4029 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 198 1 198 198 388 100.0 1e-107 MKSVIRHILSVAVLFCLISGVQAQSVKVNIPFWLTGSPNVGFEYTLTRQLTVNGEVLWMP YLFKKHEEVFRALQSSVELRYYVNPRNFYTNDSWDGFYIGPYAMYGNFNIGLLKHNDPLQ SYRRKGWGVSGGISTGYKFAFNSRWGLDLNIGLGYAHLQYNKYYLGGEYVNFPLERKKTK RWIGPTKFGINLTYNIFR >gi|226331995|gb|ACIB01000061.1| GENE 60 74217 - 75128 663 303 aa, chain - ## HITS:1 COG:no KEGG:BF4030 NR:ns ## KEGG: BF4030 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 303 1 303 303 620 99.0 1e-176 MRKNRLLIVLFTGVAVLLSLASCTYDYFEDETNYQVFVPEVLNKTVSDCRVLVYNDAGTL VGARYATSPWDKDPRMEAGLFSFRLTPGEYKVYCYTNTDSLTFVDGQHLDASAFILKSSS TGPNRYVQPSDILFQKFVPAIVHPGILQTDTAALERYTGRITVRFKKFPGNVSHIKKVQL LAEGAPVMQYLKNDTLTGRLTPEDKMFHFGTLPVQEKADVLEVDHRFIPSVENEPMRLNY TFLDENGAVINHLPVEVTERETGLPLRLLHGKRIIIEIESYTVIKISVVGWNEDIESGDT DME >gi|226331995|gb|ACIB01000061.1| GENE 61 75238 - 76491 952 417 aa, chain - ## HITS:1 COG:no KEGG:BF3804 NR:ns ## KEGG: BF3804 # Name: aapA # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 417 1 417 417 710 91.0 0 MRKKSMFLWSLLAAMTMSACSEHNELPSGEVPGGNSKANFIVKLNFEGTSMEQGGKTRAA QSTAVPETSWSNIHQVQILLYDASNIVRFSDVVTPTTGNTTFTYTDVPVGTYTMVAIANA KSSTDAINTYLDGGTTPVEWSMWNVRQKQAQNMVMKYKPGTFPTFCAADLSANAAYAEPA EIFMGAVQGVTVSSDGPVTPSPIALKREVSLMRVRLNVKDKEGNTNNENTANGVDFAQDA SIMIYRLPDHLKVMAGNAGGVSATSTATNILSISGGEVFKTTDPTSGYNAGGKVLSGNFT MWRDVVVFPNNGGRANDSATTGTADRQRQYFIVVSGRGKAGHILGDGTALPNDATVYWSG VVKENFVPNVIREVNLTLRTGGSTTVPVTPTEYGGLEITVSAPTPWDSNIVNSDIIM >gi|226331995|gb|ACIB01000061.1| GENE 62 76885 - 77829 605 314 aa, chain - ## HITS:1 COG:no KEGG:BF4033 NR:ns ## KEGG: BF4033 # Name: not_defined # Def: tyrosine type site-specific recombinase # Organism: B.fragilis # Pathway: not_defined # 1 314 1 314 314 615 100.0 1e-175 MKKELLTRFMEKRIVELKKEQRNGTAHVYQSTLNRLKNFMNGREITFSQLTPEWLALFEQ KLLADQLKWNTISTYMRMLRSVYNQALERGVATYVPRLFNKVHTGIDCPVKRAVSPEVIC RLMTDRKPLPGKLAFSRDLFVLLFLLRGMPFVDLAFLRRCDLQGNVITYHRRKTSRKLTV VVGKEAMEIIQKYMYAIPDSPYLFPIIQNPGKDEYGQYARMLRLQNYRLTQVANILGIRD RLSTYTARHTWATTALRQNYNSSLICDAMGHSSVKVTETYFQPYRDDEVNRMNSSLITYI LSKKKRGTDKKIRS >gi|226331995|gb|ACIB01000061.1| GENE 63 78045 - 78473 370 142 aa, chain - ## HITS:1 COG:CT276 KEGG:ns NR:ns ## COG: CT276 COG1664 # Protein_GI_number: 15604997 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Integral membrane protein CcmA involved in cell shape determination # Organism: Chlamydia trachomatis # 23 117 83 177 194 64 33.0 7e-11 MFGKKKNEDYSVKTVKVGVDKLTTIALGTMVKGTITVEGDLRLDGIIEGNVSCRGKVVIG PQGRIKGNVTCTGAVLHGMLQGDIQVAEDLIMKSGCTMNGDVYTCKLEIESKARFNGTCN TTEKDTLVTSQVVKPETVDTEK >gi|226331995|gb|ACIB01000061.1| GENE 64 78673 - 81894 3503 1073 aa, chain - ## HITS:1 COG:YJL130c_2 KEGG:ns NR:ns ## COG: YJL130c_2 COG0458 # Protein_GI_number: 6322331 # Func_class: E Amino acid transport and metabolism; F Nucleotide transport and metabolism # Function: Carbamoylphosphate synthase large subunit (split gene in MJ) # Organism: Saccharomyces cerevisiae # 4 1055 4 1051 1070 1159 54.0 0 MEKEMKKVLVLGSGALKIGQAGEFDYSGSQALKALKEEGINSVLVNPNIATIQTSEGIAD KVYFLPVTTYFVEEIIKKERPDGILLAFGGQTALNCGAELYTKGILDKYGVKVLGTSVEA IMYTEDRDLFVKKLDEIEMKTPISQAVESMEDAIAAARRIGYPVMVRSAYALGGLGSGIC ANEEEFLKLAESSFAFSKQILVEESLKGWKEIEFEVIRDANDHCFTVASMENFDPLGIHT GESIVVAPTCSLDDKELKMLQELSTKCIRHLGIVGECNIQYAFNSDTDDYRVIEVNARLS RSSALASKATGYPLAFVAAKVALGYTLDQIGEMGTPNSAYVAPQLDYYICKIPRWDLTKF AGVSREIGSSMKSVGEIMSIGRSFEEIIQKGLRMIGQGMHGFVGNDDVHFEDLDKELSHP TDLRIFALAQAMEEGYTIERIHELTKIDPWFLGKLKNIVDYKAKLSAYDKVEDIPADVLR EAKVLGFSDFQIARFVLNPVGNMEKENLMVRARRKELGILPAVKRINTIASEHPELTNYL YMTYAVQGYDVNYYKNEKSVVVLGSGAYRIGSSVEFDWCSVNAVQTARKLGYKSIMINYN PETVSTDYDMCDRLYFDELSFERVLDVIDLEQPRGVIVSVGGQIPNNLAMKLYRQSVPVL GTSPISIDRAENRNKFSAMLDQLGIDQPAWQELTSLEDVKGFVEKVGYPVLVRPSYVLSG AAMNVCYDDEELENFLKMAAEVSKEYPVVVSQFLENTKEIEFDAVAQNGEVVEYAISEHI EFAGVHSGDATLVFPAQKIYFATARRIKKISRQIAKELNISGPFNIQFLARNNEVKVIEC NLRASRSFPFVSKVLKRNFIETATRIMLDAPYSRPDKSAFDIDWIGVKASQFSFSRLHKA DPVLGVDMSSTGEVGCIGDDFSEALLNAMIATGFKIPGKGVMFSSGAMKSKVDLLEASRM LFNQGYKIYATAGTAAFLNAHGVDTTPVYWPDEKPGAENNVMKMIADHKFDLIVNIPKNH SKRELTNGYRIRRGAIDHNIPLITNARLASAFIEAFCDLKLEDIQIKSWQEYK >gi|226331995|gb|ACIB01000061.1| GENE 65 82218 - 83309 1143 363 aa, chain + ## HITS:1 COG:L0358 KEGG:ns NR:ns ## COG: L0358 COG0180 # Protein_GI_number: 15672048 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Tryptophanyl-tRNA synthetase # Organism: Lactococcus lactis # 7 348 6 340 341 427 62.0 1e-119 MGKEKIILTGDRPTGKLHIGHYVGSLKRRVELQNSGSFDKTFIFIADAQALTDNIDNPEK VRQNVIEVALDYLACGLDPEKSTIFIQSQIPELCELTFYYMDLVTVSRLQRNPTVKTEIQ MRNFETSIPVGFFTYPISQAADITAFRATTVPVGEDQEPMIEQAREIVRRFNYIYGETLV EPEILLPDNAACLRLPGTDGKAKMSKSLGNCIYLAEEADEIQKKVMSMFTDPDHLRVQDP GKIEGNTVFTYLDAFCRPEHFERYLPDYPNLNELKAHYQRGGLGDVKVKRFLNSIMQEEL EPIRNRRKEFEKDIPAIYEMLRKGCEVARAAAADTLSDVRKAMKINYFDDAELINEQVKR FNK >gi|226331995|gb|ACIB01000061.1| GENE 66 83415 - 84602 1218 395 aa, chain - ## HITS:1 COG:no KEGG:BF4037 NR:ns ## KEGG: BF4037 # Name: not_defined # Def: major outer membrane protein OmpA # Organism: B.fragilis # Pathway: not_defined # 1 395 1 395 395 765 100.0 0 MKKILMLLALAGVTSVASAQQTTITGYEVIQVQDKYQVITNPFWDNWFFSIGGGAEALFG DNDHVGKFRDRISPTLNVAVGKWFTPGLGLRLQYSGLQGRGFAGSETADFVKSGKLANGY YKQKFNYMNLHGDVMFNLNALFGGYNSHRVYEIIPYLGAGFTHSYSKPHREAFAMNAGII NRFRVSSAVDINIEIGGMLAEDKFDGEIGGKHGYDGVASLTAGLTYRFPARGFARPMPQI ISEIELANMRRQMNDMAAANQSLQQQLVDAQNQPVAEVAEQVVVTDANIAPRTVFFTIGS SELSPREEMNLSYLAAKMKEFPDTQYTVYGYADSATGTPAFNKELSQKRAQAVVNALVKK YGVDSSRLKVDAGGGVDKFGKPIYLNRVVLVESVK >gi|226331995|gb|ACIB01000061.1| GENE 67 84772 - 86091 1139 439 aa, chain - ## HITS:1 COG:NMB1738 KEGG:ns NR:ns ## COG: NMB1738 COG0845 # Protein_GI_number: 15677583 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Neisseria meningitidis MC58 # 25 432 55 465 475 75 26.0 2e-13 MEKEHTYKEIELRSEEVQEVMNRVPAWILRSGITVLFVIVVALVAGSYWFKYPDVIAAEV TVSTQDPPAYVVARAAGRLENLYVQNGQEVEPDTNLGTIENTACASDVFSLQERMRKWKQ EGYTPESGKGLFLHSETDRWRLGEIQSAYAAFVSTLSEMVRMNELGYYAKKLQSQRELLE TQKKYYGQVRSQYFLIEKEYALAHASYTRDSILYHRQAMIITEFEQSGSRYLQSLQSRES ARMQLTQVDMQIEQSEETLLDLEKQAFEEKQTQAVNLRNATDQLQSQLTAWEQRYLLRSP VGGKVTFLNVWSVNQYVESGATVFVVAPEEESLPVGTALLPLQGSGKVKAGQRVNLRLNN YPDQEFGYVKGKVKSVSPLPTAEGMYVVDIALPDGLTTNYGKTLPLTREMKGSAEIITDD LRLLERLIMPLRKIFMEQK >gi|226331995|gb|ACIB01000061.1| GENE 68 86094 - 88304 2079 736 aa, chain - ## HITS:1 COG:alr2817 KEGG:ns NR:ns ## COG: alr2817 COG2274 # Protein_GI_number: 17230309 # Func_class: V Defense mechanisms # Function: ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain # Organism: Nostoc sp. PCC 7120 # 5 732 335 1044 1044 396 31.0 1e-110 MKLNSFPHYLQLDAMDCGPSCLRMIAKYYGKSYSLQTLRARSFITREGVSMLGISDAAES IGFRTSGVRISLEQLKKDVPLPCILHWNQNHFVVCYDIKKKRSGYRFYIADPARQLISYS EEEFKKCWLSTKVNGEEKGAALALEPGPEFQGQGDEEESGSRSLRFFLKYLSPYRKQLIQ LILGMLTASLLQLIFPFLTQSLVDVGIRDGNLNFITLILISQLVISVSQLSVEFIRSWIM LHMNTRINISLISDFLAKLMKLPLHYFDTKMIGDIMQRIGDHGRIESFLTGSSISTLFSF VNFFIFAFVLAYYNLGILAIFLVGNSLYICWILVFMKYRRELDIRRFAQAAGEQSSLIQL VTAMQEIKLNNCEKQKRWQWERIQVKLFKISVKGLALGQVQQVGSVFFNQTTNIVISFIA AKSVVEGNMTLGMMMSLTYIIGQLSGPIGSFIGFAQQLQDAKISLERLNEIHGQKDEEQD IASKLTVLPERRDIRIENLSFSYDGADRDYVLNDVNLNIPEHKVTAIVGASGSGKTTLIK LMLGFYTPNKGDIKIGETPLDVVNPHLWRAKSGSVMQDGFIFSDTIANNIAVGEEQVDVE RLRHAVTVANIRDFIDSLPLGYNTKIGMEGNGISQGQRQRLLIARAVYKNPEFLFFDEAT NALDANNEREIMEHLHTFYRGKTVVVVAHRLSTVRDADKIVVLDRGAVAEEGTHRELTEK KGLYYQLVKNQLELGS >gi|226331995|gb|ACIB01000061.1| GENE 69 88340 - 88939 595 199 aa, chain - ## HITS:1 COG:no KEGG:BF4040 NR:ns ## KEGG: BF4040 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 199 1 199 199 397 100.0 1e-109 MDAHIEQIAKSLYFSCKQFDIGLFYGKMGRCLFFFDYSRVTELRAFEELAGELLDEVMES VCLGMPVGLSFGWCGIGWGVEYLVRKGFVEDDDNEGRNKIDEKVMEYDVRRLGDYSLATG LEGISWYVLLRLSSGDKGVRIGEKNYLSDLKSACEKALKKGRYEGILLLLDFLNGKRANY PFGEFFSQIPGEAHYIPDM >gi|226331995|gb|ACIB01000061.1| GENE 70 88952 - 89617 318 221 aa, chain - ## HITS:1 COG:no KEGG:BF4041 NR:ns ## KEGG: BF4041 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 221 1 221 221 451 100.0 1e-126 MKDITMLNKPPLDFFGCNFCLVEKHRMAFVLISKCGLTFLENIAIYASAGMIPDTEDQTH FYIARVKPERFLVPVSEMSGYEREHKSYLKVAVWRDPVERLVSAYKYFILERTFNQYMYM CNLYQDCSFERFLSFVEFELGKANPLWQDEHIRRQSDFYTSADVDCIVPLSKLNRFLAER GVDMPEEKANATSVRFELKDEKQIAKIKELYRLDYEIPVGC >gi|226331995|gb|ACIB01000061.1| GENE 71 89685 - 90680 471 331 aa, chain - ## HITS:1 COG:YPO0187 KEGG:ns NR:ns ## COG: YPO0187 COG0463 # Protein_GI_number: 16120528 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Yersinia pestis # 9 246 6 245 329 140 35.0 4e-33 MKEVKRPVVSVVVPVYNTEPFLAECLHSLEKQTLTDIEIILVNDGSTDNSGRLLREYAGK DARFVYVEQENQGLSAARNTGMEHASGHYLAFLDSDDWLAENALQVLCAIAAKTRTDIVS GNTLAVHADGQVQSWERRGRELFATGTVVSGSTYFSRVMDCRCYVPMVYNYLYRRDFIEQ NGFRFEPGLVHEDELWTPQVLTTAQKITVADIDFYYYRQREGSIMTATAAGRRIASIQLI IEKLLEYSRKHLFEKKYREAKEALYVRLLQIYSTACTLHPDGTYTTLYDRAGEMLRVCEE LRRQESLGRWYSEEILNRMKLYYDRLQTMEG >gi|226331995|gb|ACIB01000061.1| GENE 72 90677 - 91567 603 296 aa, chain - ## HITS:1 COG:no KEGG:BF4043 NR:ns ## KEGG: BF4043 # Name: not_defined # Def: putative alpha-1,3-fucosyltransferase # Organism: B.fragilis # Pathway: not_defined # 1 296 1 296 296 620 99.0 1e-176 MDILILFYNTMWGFPLEFRKEDLPGGCVITTDRNLIAKADAVVFHLPDLPSVMEDEIDKR EGQLWVGWSLECEENYSWTKDPEFRESFDLWMGYHQEDDIVYPYYGSDYGKMLVTARREK PYKKKACMFISSDMNRSHRQEYLKELMQYTDIDSYGKLYRNCELPVEDRGRDTLLSVIGD YQFVISFENAIGKDYVTEKFFNPLLAGTVPVYLGAPNIREFAPGENCFLDICTFDSPEGV AAFMNQCYDDEALYERFYAWRKRPLLLSFTKKLEQVRSNPLIRLCQKIHELKLGGI >gi|226331995|gb|ACIB01000061.1| GENE 73 91617 - 94091 1588 824 aa, chain - ## HITS:1 COG:no KEGG:BF4044 NR:ns ## KEGG: BF4044 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 824 1 824 824 1644 98.0 0 MDRRDYYVEYYEVLDRVLSDYVNHLREEKEYWLKIGRHDVGVPLINLLFWQFTWHREEYD RLRKEGKSCGEALGSVKEILADKQVEEWERQEEEYKIYDHEWYEALAPYGGQLVIYIYNA RQLAYLTPLIERLEEPVLLLSEYEIPDETELPDFVTAITLEFTKTVPLVNPFLKEWFPLI FQYANTFDILMRILQPKGLIFLEGCHYQQLLLATIGRDYGIPTLCIQQGWPSLMHTAFRR MPYRYYLMWGEGFRTLWGKHNPLPDFVPTGYMYQVEPRNETKKECVTFFLQGPFFLSDKR YLQEMIRLIGTVAAEFPARRFLVREHPEFRIGEEVRMEWEQIPNIEMVTDEKLAEVFART RVGVAHYSSSLMEGVAHGAVPLVYDPTEGSRYSPDVEAEGLGMIAKTKEELTGGLSRILG NYEDFKQRIEKEQPLWFQATGGETLRNMVGFIKEKMPPVTLKEIYVVDTDTLTRERPVGV SGLLRCKNCEDFLEMCIDSCIDGLDELIAVYHDCTDRTPEILRQKAAQYPDKIRVFEYQP SVYPIDLDEEELEKAKLLPPDSIHTLAGYCNYALSKASYRYAVKIDADQVYFTDRLKHIC DAYRSDKKVRFNVAECISYNLYRAYVDSFNRIEMRPFRWLERIALWTHALYASYLEKMII RYKVPVSMSGINLFRKDREWMVGLGQEHPEPDSKEILPPFNGVRDTFFFEVSEDRIFRYV TETKPDGRHRGVEVMRCPNEILDAGFCWFHLRALMKEHEEGYRQSYRKHPERFIPLGTFV KLSYRNLQQRYKPFVAVRWAEPVFAYFFMTGKGRIPWKKLKEIE >gi|226331995|gb|ACIB01000061.1| GENE 74 94102 - 94773 528 223 aa, chain - ## HITS:1 COG:MA3766_1 KEGG:ns NR:ns ## COG: MA3766_1 COG1083 # Protein_GI_number: 20092564 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: CMP-N-acetylneuraminic acid synthetase # Organism: Methanosarcina acetivorans str.C2A # 1 216 1 217 227 115 28.0 7e-26 MNRTFCFVPVRKGSRGIPGKNLRMLGDKPLVCWIIDTVLASGIADEICVATNCDEMDSLI RGRYKEVVQIFRRSEWSARDEASSLEVVQEYLNYRKPDRNDDFILLQATSPFTTAQELRG LVEEMKRGEADSYVACCRLKKFRWSDEGRPLDYSFETKPRRQEYKGFLIESGAFYASTVG RILDSGQLLSGVVKVVEVGPAGMIDIDEEADWGLAEHYIETGL >gi|226331995|gb|ACIB01000061.1| GENE 75 94787 - 96001 508 404 aa, chain - ## HITS:1 COG:no KEGG:BDI_2925 NR:ns ## KEGG: BDI_2925 # Name: not_defined # Def: hypothetical protein # Organism: P.distasonis # Pathway: not_defined # 2 402 6 379 389 177 29.0 6e-43 MNYYLYIEPYTLFFRKKGECLFYNTLNKKVLKIDVSNDMYFILDKLETDKYTILSDENLQ TQTVSFWVNRLRETFNGDILPFSDGSVPPAIFPPFINNQRDFERLNTYEWVEKDNQVMNY LEEIYLYLNGCEEEDDSIWKQIPSYLCSDKEMDSRKLLQWLDGCIDKQISQVYLLGGDVL MYKDLNEIWGMLEQKSMSIQLYYRYDLFTENHKKLLNATIGQLTFVVPMWKFDEELFSSL SRTVEGIEQNKRWLFLITSDNEYELAEQLVSKYSLESRSIRPVYKEDNLSFFKDAVYLEE TDICNTCLEKRDFYVSQKINKNDFGRLTVLPDDKIYANVNHAEIGVMEKDTIASVLYKEM TEGHSWLRIRDQKPCCDCIYQWLCPSPSNYELAIGKPNLCHVKP >gi|226331995|gb|ACIB01000061.1| GENE 76 96020 - 97495 657 491 aa, chain - ## HITS:1 COG:CAC0658 KEGG:ns NR:ns ## COG: CAC0658 COG0641 # Protein_GI_number: 15893946 # Func_class: R General function prediction only # Function: Arylsulfatase regulator (Fe-S oxidoreductase) # Organism: Clostridium acetobutylicum # 54 441 95 465 518 93 24.0 1e-18 MKNDIAFSTPFHAYVYSFRHKEYLPLHPILKRIYTVMEEKKNIEEDEELRCYPKEQILHY LQKYKFLKENEFIGEKVKTEFGEITETMVRREVENLTVLTFEVTERCNLRCRYCAFGDLY YGYDERKGENLDFPKAKQILDFLFGIWEKKPHLSVARTLTVGFYGGEPLMNMDLIKQIVS YIDEHKPEGMKFAYNMTTNAMLLRVYQDFLVEHKFHLLVSLDGTKADDCHRVTVNGKSSF VQVFDQIKNLQFCYPEYFKKYVSFNSVIHSESNIERIVDFFRAEFDKQTSLSELNNSSIA QEGKYAEMRKSVFQSIALSPRRKEIDQQLMYNAPDISTVTYFLHHLSNEVFRDYRSMFYG KRNFKLLPTGTCIPFNRKMYVTVHGKILVCERIDHDFAVGHVTDEGVELNFAHVAENHRK YCSKLLSQCKQCYMQESCSQCMYYTNVLADKVVCRNFKNREMFAGYLAMNVDYLEHNRWA YSKVMKEIFIF >gi|226331995|gb|ACIB01000061.1| GENE 77 97671 - 98873 450 400 aa, chain - ## HITS:1 COG:no KEGG:BF4066 NR:ns ## KEGG: BF4066 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 400 1 407 407 648 80.0 0 MKKYLYFIVFVTLWSCSADKVNVKSEDNSFYSVDLRIIEKTKGTVMSLGDLMESYEIIRL DNRDEALIKTYPSSVYVTDNYILLQPDDVVSPVKLFTRKGRYVADIGGVGQGPGEYLYLF SWLVDEKENRIYLGPGRADKVLVYDLKGNYLPDEVIRFGEIVHKSQIWVDYDKKNVAVVT LPFSANVNSNFAINKNVCWVQNREGDIVHRIPVNHYGLIGDYSNGLVAHRNVDAISFSIF EDPMLRTRPDTLYHYDAVKNIITPCFTIDHVVSENQSACTVLYETSRSYWAQVTLYPNNL SSCVSSVRLPAFNVCVSKKDGNVRRIDRFTDPLLGLSYLFLAMKNGYVCISYDPLELMDA LDKVLTQTDLEPDVCKRATDLRNSLHENDNDILIIGKLKQ >gi|226331995|gb|ACIB01000061.1| GENE 78 99003 - 99260 339 85 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253567223|ref|ZP_04844673.1| ## NR: gi|253567223|ref|ZP_04844673.1| predicted protein [Bacteroides sp. 3_2_5] # 1 85 1 85 85 164 100.0 2e-39 MKTLRRIKLNSLSQEDLADREMNALRGGHTCGCACKQGAEFKATNYSANVADDKYSPEGN IICNWVGGSGSDMAVYGGSKVPGMP >gi|226331995|gb|ACIB01000061.1| GENE 79 99277 - 100467 696 396 aa, chain - ## HITS:1 COG:no KEGG:BF4065 NR:ns ## KEGG: BF4065 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 20 395 44 418 421 689 89.0 0 MNKYWGTWIVFIGVFFHSCKQEVKQNNISFYSVDLLEMEKMKGEEILLSDLIESLEIIKL DNREEALIATYSFGIDVSSNYILIEPDGVSALKLFTRKGRYVADIGGVGQGPGEYKYAVN RFLDEKQGRVAIAENKKMLFFDLKGQFLSEESISLPETITKSSIWIDLENEKAVVVVLPF ADIGNPKAPISKNLCWVQDFKGNILQKISAINYAIVPDYSNEVLAPRNVDAYSFSLCQVV GRTRPDTLYHYDIANNLLKPYFTLDNVMQEDKYIVTSLYETPEYYWSRVTIGPAKVLSDG APVRMTVFNVRVSKKDGSVKRIDRFTNDFLGLSYPFLTMRNGYVCITYEPLELMGALDKV LTQTDLEPDVRKRVADLRNSLHENDNDILIIGKLKQ >gi|226331995|gb|ACIB01000061.1| GENE 80 100617 - 100868 282 83 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253567225|ref|ZP_04844675.1| ## NR: gi|253567225|ref|ZP_04844675.1| predicted protein [Bacteroides sp. 3_2_5] # 1 83 3 85 85 148 100.0 1e-34 MKKLSKIKLTNLSQEDLADREMNALRGGHNCGCACSSTSKATNHSSNEDRDLHSPEGNVI CTWVGGAGSDISVYGGSKAPGMP >gi|226331995|gb|ACIB01000061.1| GENE 81 100920 - 102134 577 404 aa, chain - ## HITS:1 COG:MJ1607 KEGG:ns NR:ns ## COG: MJ1607 COG0438 # Protein_GI_number: 15669803 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Methanococcus jannaschii # 173 401 149 381 390 100 30.0 5e-21 MKHIYLIGNGSRAAQYGVGTYIRQMFEFFRQTSSVRLTIVELNSEVKEVTEECDNSGKVR YLKIPAQKSEGRKGDVAYCYRNIAYLLALHFLKDEQNVLHLNYLHHAPLADWLKKIGVEF YLLVTIHYLDWCFMLKGNTRLFRSIIHKEEQSNEWGEKIRNSYERDKRLFQHSDKVICLS QYTQNLLREDYGVEKEKLVVVYNGLKDEAIKLSKEERLEKRSALGFRETDKIILFVGRLD RIKGVQYLIEAFRQVIRKNPNSRLVIVGDGDYDKYLKQCAGIWSYVVLTGKVEKEVLYTF YQIADVGVLPSFHEQCSYVAIEMLMHGLPLIGTGSTGLKEMVEGMHCLPLKEEDDSVDLP IDLLVQWLIEDQEHLRSEKYRRRFEERYTLRKMSENMFSIYLNL >gi|226331995|gb|ACIB01000061.1| GENE 82 102131 - 102727 516 198 aa, chain - ## HITS:1 COG:no KEGG:BDI_3162 NR:ns ## KEGG: BDI_3162 # Name: not_defined # Def: hypothetical protein # Organism: P.distasonis # Pathway: not_defined # 1 193 1 193 214 169 44.0 6e-41 MGKVNEALLKRIADHQMLHGSFRRDLGVLNGKMGIVLFFFHYARYTGRVLYEDFAGEMLE EVIQELHSDLPIRFSDGLCGIGWGVEYLIQNAFIEGDSDEILEDLDQKIMEWDPRRATDL SFESGLEGVACYASSRLKSTVRNRMPFDQVYLSELEMVVQQKGLRMRLELDDVFVRVIDT GNIEEEVSWKYGLKMIAL >gi|226331995|gb|ACIB01000061.1| GENE 83 102720 - 103499 356 259 aa, chain - ## HITS:1 COG:no KEGG:BF4055 NR:ns ## KEGG: BF4055 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 255 1 255 255 356 68.0 5e-97 MKYSEQITIGIPVRIDSPARMRNLQVLLRHLSLSGIKIHVWEAGDVRSELSGFASNKDTY TYERDESLVYHKTLYVNRLLKAASTPVVAIWDADILLPLSQIEASVLAIIEQGYLLSIPY DGVVKMLSEAQSEAFEYSGQGCDYLTMFAATYARLMRRPSCGGVFVVDREKYLHWGGDNE RFVSWGPEDAERIRRIEILGYPVHWVKEGPLYHLWHPRGENSGYATEELAFQNRMEFIKV CSMERNELREYIKAWKNNG >gi|226331995|gb|ACIB01000061.1| GENE 84 103507 - 104127 330 206 aa, chain - ## HITS:1 COG:no KEGG:BF3829 NR:ns ## KEGG: BF3829 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 13 206 5 198 201 262 73.0 5e-69 MVGSLLIGNKHFLVRELVQAGRSTSCLGLTNGKMGIAIALFRYGRLSGELAYEEVASELL DDVCQNLNYSMPISFNDGLCGIGWGIEYLIQHGYVDADSDEILRDIDLYLIRCIHIYGLS GLSLRNGIVGLGRYILIRITPTFLFGDTFSSALLKEYFIYLIDWLEEELKRVDEPVDDLL DFLFDLYPTGFYQTKVSDLIKYCMNK >gi|226331995|gb|ACIB01000061.1| GENE 85 104099 - 104503 254 134 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253567230|ref|ZP_04844680.1| ## NR: gi|253567230|ref|ZP_04844680.1| predicted protein [Bacteroides sp. 3_2_5] # 1 134 1 134 134 252 100.0 5e-66 MKHLLSTIIFLFAVILKGNAESYSFPSPNEINSVINLDLKGIFSSTHTKSLPLTPVEASL VDNTLLNISFVVHSGEVTVRILSEKGILYSSCINSDQQNSLAISVEDFEKGDYKLELTTP AGGYVYGWFTINWE >gi|226331995|gb|ACIB01000061.1| GENE 86 104622 - 106499 962 625 aa, chain - ## HITS:1 COG:no KEGG:BF3831 NR:ns ## KEGG: BF3831 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 498 1 484 594 238 37.0 8e-61 MKSSYICCVLLFVLFLVSCTDGRSEGQVARLLRQAEMCMEECPDSALVYLHQIPDPEKLT GENQADYCLLLTQAMDKNDLPLSSDSLIQIAVGYYSNGKDRLKKGKASFYLGRVKSSRGM LEDAQKYFLEALSILDVTDNLKYQALVRNHLGQLYMNLDLYQDALIMNSQSVSLFQQLTD TANLVYAERDMGRIYLLEGRQDSASLYYQQAINDALSYSKSDIYKDIVSEWGQISMYLEM SPSAEQMLLSNLESDLVLDKTPVCLSLGIYYLSNKLYSKAEGYLLKAAASSRPYTRVSAY KYLGHLETRNLDLIYWDQYEQALDSLERQNLAYAVKEIQEKYNNAALQAHTFKLENERLH STISYLSVILLLLSITSVIYVFYMKERRRRQIEKEEFNRNMKAHEQERARLLAELSDSKL HVEKLEQLKSEQKQSSLEVQEKTNALLEQEKEHSCKISQQLEELKKKWKNQLSINIDLRA RNKDLSYKIKSLKDSDMENTPGLYASINLLVRILTCDLSEIKTLKVDDWEGLFQCIDLLY GNSLRKFIDQYQKQHDEELDRRVVAICYFEYIKVKHARQAAILRVSAQALSKRKQRLKIE LGVPDMASVGRVVKPVCPSNKKELM >gi|226331995|gb|ACIB01000061.1| GENE 87 106807 - 107118 179 103 aa, chain - ## HITS:1 COG:no KEGG:BF4049 NR:ns ## KEGG: BF4049 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 103 308 418 418 179 78.0 3e-44 MKSQRLGLQPIKNYERVVNPQKKRFHMSSRMNSHGKIIITKIADYESNYIKKAGLLEGDE IIAINEIPIKMITIEENTKLNRRGQGKSYKIPMVIDRNEVQGD >gi|226331995|gb|ACIB01000061.1| GENE 88 107195 - 108415 805 406 aa, chain - ## HITS:1 COG:no KEGG:BF3834 NR:ns ## KEGG: BF3834 # Name: not_defined # Def: putative lipoprotein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 406 1 406 406 805 99.0 0 MKNLYLILVLSALVACTGSSQKEKPACVNTPVEPFTGLPVFDLQEQYPEKEIILQDIADV RYVVLETGDSSLVGIRPILMTDSFLVTVNKKSDVIFFDKSGKYLHSFNHTGMSGEEYGDI VSGYCIDEKAREIFIYDGLQSRIQVYGYGGAYRRTLKLPQNRMFASSIFEYDENFLFGED YRLVDSQIGKYPVNKTPYYKISKKDGKLTSIPITVKERIRDGLSYTVGDGAFGYVSLLMS PVARFGSDILIADYSLDTAYVYRDDHLIPLTVRRNHTSENNIPILATVDVMTGRYLLWYT IVKDIDVKNNRVSDPVSYLYDRFTNEYCRVDLVNRDVVSATNIPAFQMRLSANYHVVPEN YAIQYYPAEELIELNGQGKLKGELKEIASKLNDEDNPVLLIAKFKE >gi|226331995|gb|ACIB01000061.1| GENE 89 109180 - 110388 480 402 aa, chain - ## HITS:1 COG:no KEGG:BF3836 NR:ns ## KEGG: BF3836 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 402 1 402 402 793 98.0 0 MDKIYLWFLLLISCSCSHTKEKVSNDMDSSLCVIDITCEYPVEKVNIHDVADVEYVPLET TRNSLLASDCSAFRISDDYITVASGVDNGNIFFFNRKGRYLWTFNRRGGSAEEYSSITAW DADFGMQEIYIYDSFRKKIYIYSLGGQYKRSHALPMKDCTFIDLYNYDKDYLIGYNRFYD FRKKKKVDTHPYYLIDKQSGEMSSIGIVVDKPISEKVHTEIVKFPGGAYKDQVLFLITAL IKNGDAFLIADYALDTIYSYRHHKLVPIAVQTPSVYASDPPVIVACELYTDSYLHFRIIP MYYNPSAPMSPMADAPELVLNRHTGKIAEWKMYDSNYSSDIERPVPTMILQSADRENYGI SMFTAERLIEQYQAGGLKGELKDIASRLSIDDNDILMICKYK >gi|226331995|gb|ACIB01000061.1| GENE 90 110491 - 111732 712 413 aa, chain - ## HITS:1 COG:no KEGG:BF3837 NR:ns ## KEGG: BF3837 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 61 413 1 353 353 660 90.0 0 MNKNIVLLFCALFFLSACTSKPSSSVPIEDDTLSTEETFKDIDLANNLKDCGKPLLLSDI VKDVEYVKLETQDNILVGSINQLKRTKKFVFIYSWHQNHVMMFDTSGKFIRKIGRVGQGP GEIANIHCFTVSDSLVFIYPFGHNGSLIGYDMRDNRFVRRISLKRPAFHSNMIDVMGDYL VYYPGTVPNGNERIFITACVVNGKGETLMEQIPYLPAGVDKSEMTLSSDASWIYQGRSNV YSQINDTIYGITCDSIYPRYHLSLGKYGLPLGRYDVKDLGLKDFIMMQSVCENKECLLFK FSYNKKMWFSRYDKKTEKIDSWEQTPLKGIGWMAIESPGITNDIDGSQSFDGIRYTGENS FYFAITPDNLDQVRRNVAEAKVKFPEKQAELLKLLDEMGEDDNPIIAFYKLKD >gi|226331995|gb|ACIB01000061.1| GENE 91 111903 - 113147 930 414 aa, chain - ## HITS:1 COG:no KEGG:BF4064 NR:ns ## KEGG: BF4064 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 414 1 414 414 845 99.0 0 MKLDYIALFLVLFFLAACGSKSSSPDMVDDNALPTEETFKEIDLANNLKDCGKPLLLSDI VKDVEYVKLETQDNILVGDIKQLKRTEKFVFIYSRNQRHVMMFDTSGKFIRKIGHVGQGP GEIANIHCFTVSDSLVFIYPFSRNGSLTVYDMRDNSFVKEFSLKWPVFFSWTIDVMGDCL VYYPGTVYLPDNSKTFISACVANGKGETLMEQIPYLSSGDKEIVQLSSDPSWLYRGRSNV YSFINDTIYGITCDSIFPRYHLSLGKYKLPSGRYDPTHDYGWGDFVLMQSVYETKEFLFL KFWFDKKLWFSRYDKKTEKIDSWEQTPFKAHYWMILDAPGITNDIDGSQSFKDFDNVGEN CFHFAITPDNLDQVRRNVTEAKVKFPEKQAELLKLLDEMGEDDNPIIAFYKLKD >gi|226331995|gb|ACIB01000061.1| GENE 92 113172 - 114401 523 409 aa, chain - ## HITS:1 COG:no KEGG:BF4066 NR:ns ## KEGG: BF4066 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 406 1 406 407 782 92.0 0 MKARHFFYPLLSLFSVAMFASCSSYVPKEGDIIEDSAFRSVDLRIIEKAEGIVMSLSDLV ESYEVIKLENRDEALIKTYPFGIFVTDNYILLNPDAISPIKLFTRKGQYVADIGGIGQGP GEYKTIHFCMIDEKQKRIYLGPGRANKILTYDLKGNYLSDEAIHFKEIVHKPCIWMDHDK KHVTVVGLPFSENENSNFEISNNVCWVQNREGDIVHRISANHYGLIGDYSNGLVACRNVD AISFSIFEDPMLRTRPDTLYHYDAVKNIITPRFTIDHVVSENQSACTVLYETSRSYWARV TLYPNDISSNSSPVRLTTFNVCVSKKDGSVKRIDRFTNDFLGLSYPFLTMRNGYVCISYD PLELMDALDKVLAQTDLKPEIRKRATGLKNSLHENDNDILMIGKLKSNY >gi|226331995|gb|ACIB01000061.1| GENE 93 114692 - 114934 132 80 aa, chain + ## HITS:1 COG:asl4856 KEGG:ns NR:ns ## COG: asl4856 COG4680 # Protein_GI_number: 17232348 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Nostoc sp. PCC 7120 # 12 79 17 84 85 73 47.0 7e-14 MLMYLCVIGIKKTEKADWSCLADTKQTFNSVDYVGNDRFVFNIKGNDYRLVATILFAAKK VFIRWIGTHKEYDNKDCSNA >gi|226331995|gb|ACIB01000061.1| GENE 94 114946 - 115311 218 121 aa, chain + ## HITS:1 COG:no KEGG:BF4068 NR:ns ## KEGG: BF4068 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 121 1 121 121 214 100.0 6e-55 MAKIKTEKQYKAACSRIEELLKVVSNDTPTDDKNFLELDLISDLVADYEEEHFPIEAPSL VDVIKLRMYEMGLTQTKLSELLNVSPSRISEYLSGKCEPTLKVAREISRKLNIDANIVLG V >gi|226331995|gb|ACIB01000061.1| GENE 95 115853 - 117730 1738 625 aa, chain - ## HITS:1 COG:SPy2121 KEGG:ns NR:ns ## COG: SPy2121 COG0323 # Protein_GI_number: 15675871 # Func_class: L Replication, recombination and repair # Function: DNA mismatch repair enzyme (predicted ATPase) # Organism: Streptococcus pyogenes M1 GAS # 1 625 1 645 660 309 32.0 1e-83 MSDIIHLLPDSVANQIAAGEVIQRPASVIKELVENAIDAEAQNIHVLVTDAGKTCIQVID DGKGMSETDARLSFERHATSKIREASDLFALRTMGFRGEALASIAAVAQVELKTRPESEE LGTKIIIAGSKVESQEAVSCPKGSNFSIKNLFFNIPARRKFLKANSTELSNILAEFERIA LVHPEVAFSLYSNDSELFNLPACHLRQRILSVFGKKLNQQLLSVEVNTTMVKVSGYVAKP ETARKKGAHQYFFVNGRYMRHPYFHKAVMDAYEQLIPAGEQISYFIYFEVDPANIDVNIH PTKTEIKFENEQAIWQILSASIKESLGKFNAVPSIDFDTEDMPDIPAFEQNLPPAPPKVH FNSDFNPFKPSSSSGGGNYSRPKVDWEDLYGGLEKASKMNQPFSDSDPESEEFAVIEEES IATAAPETLYAGEPAVIEKGTQHLQFKGRFILTSVKSGLMLIDQHRAHIRVLFDRYRAQI QQKQGFSQGVLFPEILQLPASEAAVLQSIMDDLSAVGFDLSDLGGGSYAINGVPSGIDGL NPVDLVRSMLHTAMEKGNDVKEEIQDILALTLARAAAIVYGQVLSNEEMVSLVDNLFACP SPNYTPDGRVVLTTIKEEEIDKLFR >gi|226331995|gb|ACIB01000061.1| GENE 96 117767 - 118063 432 98 aa, chain - ## HITS:1 COG:no KEGG:BF4070 NR:ns ## KEGG: BF4070 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 98 1 98 98 194 100.0 1e-48 MGMFNVRKPRGFNHQYIYVDERKEKLAKMEEDAKRDLGILPEKEFSPEDIRGKFIEGTTH LKRRKESGRKPAHLGVILAIIALLIFLWHYLQTGSWSF >gi|226331995|gb|ACIB01000061.1| GENE 97 118066 - 119769 1549 567 aa, chain - ## HITS:1 COG:no KEGG:BF4071 NR:ns ## KEGG: BF4071 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 567 1 567 567 1114 96.0 0 MFRFNKENKFQGRHRFLLAGTLCLLAVCFLMAQDKKPQHDKKAQPEQKVEPEKAQGKKKT RVDLLHADQGQADKLARPDVQVLIGSVKLRHDSMYMYCDSALIYEKTNSFEAFSNVRMEQ GDTLFIYGDYLFYDGMTQIAQLRENVKMINRNTTLLTDSLNYDRLYNLGYYFDGGTLMDE ENVLTSDWGEYSPATKLSVFNHDVKLVNPRFVLTSDTLKYSTDTKIATILGPSDIVSEQN HIYSERGIYNTVSGQAELLDRSVLTNDGKRLTGDSLFYDRKAGYGEAFDNVQMNDTVNKN MLNGDYCYYDELKQNALATKRAVAVDYSRGDSLFMHADTLLMNSYNLDTDSLFREMRAFH KVRMYSIDLQGVCDSLVFNTKDSCLTMYRDPILWNEGQQLLGEEIKVYMNDSTIDWAHII NQALTVEQKDSIHFNQISGKEIKAYFAEGEARKIDIIGNVLLNYYPEEKDSTMIGLNTSE TSLINLFLKDRKMVKMIMSPQSNCILYPMNQIPPDKMKLPTFSWFDYVRPLSKEDIFNWR GKKAGEALRKTERKAISGPKREIINMK >gi|226331995|gb|ACIB01000061.1| GENE 98 119782 - 121152 1642 456 aa, chain - ## HITS:1 COG:STM0092 KEGG:ns NR:ns ## COG: STM0092 COG0760 # Protein_GI_number: 16763482 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Parvulin-like peptidyl-prolyl isomerase # Organism: Salmonella typhimurium LT2 # 11 338 7 339 428 83 26.0 7e-16 MKKFVNFKFVVMFALALVANVASYAQDNVIDEVVWVIGDEAILKSDVEEARLAALYEGRK FDGDPYCVIPEELAVQKLYMHQAVLDSIEVPEAEVIQRVDYQINNYIQAMGTREKLEEYF NKTSTQIREAMRENARDGLIVQRMQQKLVGDIKVTPAEVRRYFKELPQDSIPYVPTQVEV QIITQQPKIPVAEIEDVKRRLREYTDRINKGESDFSTLALLYSEDRGSAIKGGETGFMGK GQMVPEYANVAFNLQDTKKISKIVESEYGFHIIQLIEKRGDRINTRHILLKPKVSDKELD EANARLDSIANDIRSDKFTFDQAASALSQDKDTRNNHGLMQNPQNQTAKFEMQDLPQEIA KVVDKMNIGEISKAFTMVNPKDGKEVCAIVKLKSRINGHKATITDDYQNLKEIVLDKRRE EALQKWIVEKQKHTYVRINPAWQRCDFKYPGWIKKD >gi|226331995|gb|ACIB01000061.1| GENE 99 121162 - 122061 743 299 aa, chain - ## HITS:1 COG:no KEGG:BF4073 NR:ns ## KEGG: BF4073 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 20 299 1 280 280 533 99.0 1e-150 MQRIIVLSSYLKLSIIQLTMRKTDLLLISLLFCASCADKHDHKGQTPLVEVDGNFLYKED LQAVLPAGLSKDDSLLFAEHYVRSWVEDVLLFNQAQSNIPDNGEIDKLVENYRKALIMHT YQQELISQKLSGEIPEQEIADYYEKNKELFKLDRPLMKGLFIKVPLTAPQLGNVRKWYKT ETQDAVEHLEKYSLQNAVKYEYFYDKWVRVADVLDMIPLKAESPEAYMDKNRHIELKDTA FYYFLNISDFRVAGEQEPYEFAQPKVKDMLVNIKRVDFMKQVKDDLYERAVKRKKIINY >gi|226331995|gb|ACIB01000061.1| GENE 100 122061 - 123614 1100 517 aa, chain - ## HITS:1 COG:AGl2623 KEGG:ns NR:ns ## COG: AGl2623 COG0760 # Protein_GI_number: 15891420 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Parvulin-like peptidyl-prolyl isomerase # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 20 257 51 298 315 63 25.0 1e-09 MKKILVGTLTCLFGAIAGHAQQDPVLMRINGQDITRSEFERFCHRNKPSGIAGKETLKRC ADLFVDMKLKLSAAQKAGLDTVSDFRTEMENYHRALSRSYLTDSATDEAYAKKLYDQMKT RSAAGEVKVMRIFRYLPQTALPHHLREAQNLMDSLYHVLEAHPGIDFKTLVNKYSDDKKE FWMGWLQTSQEFEEVAFSLKDGEYSKPFFTPKGIQIVKVTGRREIPPFEQIRGELIHKLS RRPGTDKEIELWVNKLKSICQYTPDKAGMKELLASGRTSRTLFTLDGKSFTGKDFERFAD AHPMGIKRQLNAFVVKSILDYENNRLEQKYPDFRLALQQRRDDLLLAAITRRESRQVSLS DSVALKAFFKEHRTDYNWDSPRYRGAVLHGTHKKTLKSARKFLKKLPEEEWKDAIRLTFN TPASPATIRIEQGTFAEGDNVFVDKLVFKKGDFEPLKSYPFTVVLGEKKKGPESYHEIIP QLIRDYQNHLDALWTERLRASAKVEINQEVLKTVNNH >gi|226331995|gb|ACIB01000061.1| GENE 101 123759 - 125234 1670 491 aa, chain - ## HITS:1 COG:BH0020_3 KEGG:ns NR:ns ## COG: BH0020_3 COG0516 # Protein_GI_number: 15612583 # Func_class: F Nucleotide transport and metabolism # Function: IMP dehydrogenase/GMP reductase # Organism: Bacillus halodurans # 206 488 1 281 282 351 62.0 2e-96 MSFIADKIVMDGLTYDDVLLIPAYSEVLPRTVDLSTKFSRNIELKIPFVTAAMDTVTEAK MAIAIAREGGIGVIHKNMSIKEQAKQVATVKRAENGMIYDPVTIKQGSTVRDALALMAEY KIGGIPVVDDNRYLVGIVTNRDLRFERNMDKRIDEVMTKENLVTTNQSTDLEAASQILQY HKIEKLPVVDKEGKLIGLVTYKDITKAKDKPMACKDSKGRLRVAAGVGVTADTFDRMQAL VDAGADAIVIDTAHGHSKGVIDTLREAKKRYPDIDIVVGNIATGDAAKALVEAGADGVKV GIGPGSICTTRVVAGVGVPQLSAVYDVAKALKGTGIPLIADGGLRYSGDVVKALAAGGYS VMIGSLVAGTEESPGETIIFNGRKFKSYRGMGSLEAMENGSKDRYFQSGEMDVKKLVPEG IAARVPYKGTLYEVIYQLTGGLRAGMGYCGAPDIEKLHDAKFTRITNAGVMESHPHDVTI TSESPNYSRPE >gi|226331995|gb|ACIB01000061.1| GENE 102 125321 - 127501 1771 726 aa, chain - ## HITS:1 COG:alr0205 KEGG:ns NR:ns ## COG: alr0205 COG0514 # Protein_GI_number: 17227701 # Func_class: L Replication, recombination and repair # Function: Superfamily II DNA helicase # Organism: Nostoc sp. PCC 7120 # 6 720 6 712 718 500 39.0 1e-141 MAGKINLTDQLKKYFGFDNFKGNQEPIIQNLLDGNDTFVLMPTGGGKSLCYQLPSLLMEG TAIVISPLIALMKNQVDAMRNFSEEDGVAHFINSSLNKGAIDQVRSDILAGKTKLLYVAP ESLTKEENVEFLRSVKISFYAVDEAHCISEWGHDFRPEYRRIRPIINEIGKAPLIALTAT ATPKVQHDIQKNLGMVDAHVFKSSFNRPNLYYEVRPKTQNVDKDIIKFIKNNPEKSGIIY CLSRKKVEELAEILQANGINARAYHAGMDSATRTQNQDDFLMEKIDVIVATIAFGMGIDK PDVRYVIHYDIPKSLEGYYQETGRAGRDGGEGQCITFYTNKDLQKLEKFMQGKPVAEQEI GKQLLLETAAYAESSVCRRKTLLHYFGEEYTEENCGNCDNCLNPKKQVEAQELLCAVIET IIAVKENFKADYIIDVLQGRETSEVQAHLHEDLEVFGSGMGEEDKTWNAVIRQALIAGYL SKDVENYGLLKVTDAGKKFLKHPKSFKITEDNDFEEVEEETPARGGGSCAVDPVLYSMLK DLRKKLSKKLEVPPYVIFQDPSLEAMATIYPVTLEELQNIPGVGAGKAKRYGEEFCKLIK RHCEENEIERPEDLRVRTVANKSKMKVAIIQAIDRKVALDDIALSKGIEFSELLDEVEAI VYSGTKLNIDYFLDEIMDEDHMLDIYDYFKESTTDKIDDALDELGDEFTEEEVRLVRIKF ISEMAN >gi|226331995|gb|ACIB01000061.1| GENE 103 127672 - 128919 241 415 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163762510|ref|ZP_02169575.1| ribosomal protein S16 [Bacillus selenitireducens MLS10] # 155 405 248 453 466 97 29 3e-19 MAESKNNKKRCSFCGRSENEVGFLITGMNGYICDSCATQAYEITQEAMGAGKQSAGATRL NLKELPKPVEIKNFLDQYVIGQDDAKRFLAVSVYNHYKRLLQKDSGDDVEIEKSNIIMVG STGTGKTLLARTIAKLLHVPFTIVDATVLTEAGYVGEDIESILTRLLQVADYNVPEAEQG IVFIDEIDKIARKGDNPSITRDVSGEGVQQGLLKLLEGSVVNVPPQGGRKHPDQKMIPVN TKNILFICGGAFDGIEKKIAQRLNTHVVGYNASRKTATIDKNNMMQYIAPQDLKSFGLIP EIIGRLPVLTYLNPLDRNALRAILTEPKNSIIKQYIKLFEMDGVKLTFQPEVYEYIVDKA VEYKLGARGLRSIVETIMMDVMFEIPSEDQKEYEVTLDYAKHQLEKANLARLQTA >gi|226331995|gb|ACIB01000061.1| GENE 104 128922 - 129584 943 220 aa, chain - ## HITS:1 COG:sll0534 KEGG:ns NR:ns ## COG: sll0534 COG0740 # Protein_GI_number: 16332068 # Func_class: O Posttranslational modification, protein turnover, chaperones; U Intracellular trafficking, secretion, and vesicular transport # Function: Protease subunit of ATP-dependent Clp proteases # Organism: Synechocystis # 32 217 25 210 226 229 58.0 2e-60 MDDFRKYATKHLGMNAMVLDDVIKSQAGYLNPYILEERQLNVTQLDVFSRLMMDRIIFLG TQIDDYTANTLQAQLLYLDSVDPGKDISIYINSPGGSVYAGLGIYDTMQFISSDVATICT GMAASMASVLLVAGAKGKRSALPHSRVMIHQPMGGAQGQASDIEITAREIQKLKKELYTI IADHSGTSFDKVWADSDRDYWMTAQEAKEYGMIDEVLIKK >gi|226331995|gb|ACIB01000061.1| GENE 105 129722 - 131077 1698 451 aa, chain - ## HITS:1 COG:PA1800 KEGG:ns NR:ns ## COG: PA1800 COG0544 # Protein_GI_number: 15596997 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: FKBP-type peptidyl-prolyl cis-trans isomerase (trigger factor) # Organism: Pseudomonas aeruginosa # 1 445 1 425 436 76 22.0 1e-13 MNVSLQNIDKVSALLTVKLEKADYQPQVDKSLKNIRQKAQVPGFRPGMVPMSLVKKMYGK SVIADEVNKLLSEKVYAYIKDNNINILGEPMPNEEKQPDIDFDTMEEFEFVFDIALAPEF KAEVSDQDKVDYYTIEVTDEMVENQIKAYTQRNGKYEKVDAYEENDMLKGLLAELDEEGN TKEGGIQVEGAVMMPSYMKNDEQKAIFANAKVNDVLVFNPNTAYEGNAVEMASLLKIDKE AAAEVKGNFSFQVEEVTRFVNGELNQEIFDQVFGKDVVKTEEEFRAKVKESIAAQFVADS DYKFLIDVRKVLTDKVGKLEFPDALLKRVMLVNNKDKGEEFVNENYDKSIEELTWHLIKE QLVKENDIKVEQDDVINMAKEATKAQFAQYGMLTIPDDILENYAKEMLKKKESIDGLVNR VVETKLAAALKGKVTLENKTVSMEEFNKMFE >gi|226331995|gb|ACIB01000061.1| GENE 106 131174 - 131626 -154 150 aa, chain - ## HITS:1 COG:no KEGG:BF3896 NR:ns ## KEGG: BF3896 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 150 41 190 190 315 99.0 4e-85 MYPSFSWEGACLYPERGLCSIRNLSLGSQRQLFPDPFLPVGSKSRKGESVKTAICHLFVS INPDYRYHCLFANLYGAIHSYPGISLMYRWIEFHPFADLCSGGFCGIFNIRVLLFWLIKS KTGSFGNRPMAFQACVWLLPAENGASLQMK >gi|226331995|gb|ACIB01000061.1| GENE 107 132317 - 132562 260 81 aa, chain + ## HITS:1 COG:all2777 KEGG:ns NR:ns ## COG: all2777 COG0724 # Protein_GI_number: 17230269 # Func_class: R General function prediction only # Function: RNA-binding proteins (RRM domain) # Organism: Nostoc sp. PCC 7120 # 1 81 1 81 99 84 54.0 5e-17 MNIYVGNLSYRVKEADLQQVMEDYGTVTSCKVIMDRETGKSKGFGFVEIADDAAGAKAIA ELNGAEYEGRTMVVKEARPRE >gi|226331995|gb|ACIB01000061.1| GENE 108 132681 - 133442 270 253 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 [Vibrio campbellii AND4] # 7 230 1 229 245 108 25 1e-22 MEESKMVLRTEDLVKKYGKRTVVSHVSINVKQGEIVGLLGPNGAGKTTSFYMTVGLITPN EGRIFLDDLEITKYPVYKRAQTGIGYLAQEASVFRQMTVEDNIASVLEMTNKPLDYQKDK LESLIAEFRLQKVRKNKGTQLSGGERRRTEIARCLAIDPKFIMLDEPFAGVDPIAVEDIQ QIVWKLKDKNIGILITDHNVQETLSITDRAYLLFEGKILFQGTPEELAENKIVREKYLSN SFVLRRKDFQLKD >gi|226331995|gb|ACIB01000061.1| GENE 109 133518 - 134261 664 247 aa, chain + ## HITS:1 COG:aq_355 KEGG:ns NR:ns ## COG: aq_355 COG0767 # Protein_GI_number: 15605864 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism # Function: ABC-type transport system involved in resistance to organic solvents, permease component # Organism: Aquifex aeolicus # 1 245 1 244 245 115 33.0 7e-26 MIKALRTVGRYIMLMGRTFSRPERMRMFFRQYIKEIEQLGVNSIGIVLLISFFIGAVITI QIKLNIESPWMPRWTVGYVTREILLLEFSSSIMCLILAGKVGSNIASELGTMRVTQQIDA LEIMGVNSANYLILPKITAMVTMIPILVTFSIFAGIIGAFATCWFGGIMTATDLEYGLQY MFVEWFVWCGIIKSLFFAFIIASVSSFFGYTVEGGSIEVGKASTDSVVSSSVLILFADLV LTKLLMG >gi|226331995|gb|ACIB01000061.1| GENE 110 134258 - 135028 304 256 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 [Vibrio campbellii AND4] # 1 237 1 239 245 121 30 2e-26 MIEVKGLYKSFEGKTVLHNIDATFENGKTSLIIGQSGSGKTVLMKCIVGLLTPEKGEVLY DGRNFLAMGKKEKKHLRREMGMIFQSAALFDSMSVLDNVMFPLNMFGTDTLREQTKRAMF CLDRVNLTEAKDKFPGEISGGMQKRVAIARAIALNPQYLFCDEPNSGLDPKTSLVIDDLI HDITREYNMTTIINTHDMNSVMGIGEKIIYIYQGTKEWEGTKDDIFTSTNEQLNNFIFAS DLLRKVKDVEIQNLEG >gi|226331995|gb|ACIB01000061.1| GENE 111 135260 - 135907 685 215 aa, chain - ## HITS:1 COG:RSc0292 KEGG:ns NR:ns ## COG: RSc0292 COG2197 # Protein_GI_number: 17545011 # Func_class: T Signal transduction mechanisms; K Transcription # Function: Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain # Organism: Ralstonia solanacearum # 7 213 4 210 210 98 29.0 9e-21 MEAKAEILLVDDHALVLEGMRRMLESVSDVRVADAVTSGAKAAELIGERDYDIYVLDVNL PDISGFDLVDMIREINESARIIISTMHEEIWIINRLIRQKVNAVILKSSEAVEFENAVKS VLEGNPYTCPRFQSIRQKLSLSPVQIHSKDIPTKRELDVLKAVARGCNTHEVAAELKISE NTVETFRKRLIQKFCAKNAIDMVVKAMSKGWIELE >gi|226331995|gb|ACIB01000061.1| GENE 112 136084 - 137397 1139 437 aa, chain - ## HITS:1 COG:lin2051 KEGG:ns NR:ns ## COG: lin2051 COG1160 # Protein_GI_number: 16801117 # Func_class: R General function prediction only # Function: Predicted GTPases # Organism: Listeria innocua # 4 437 5 435 436 372 45.0 1e-103 MGNLVAIVGRPNVGKSTLFNRLTKTRQAIVNDEAGTTRDRQYGKSEWLGREFSVVDTGGW VVNSDDIFEEEIRKQVLMAVDEADVILFVVDVTNGVTDLDMQVAAILRRAKSPVIMVANK TDNHELRYNAPEFYRLGLGDPYCISAISGSGTGDLMDLIVSKFKKESDEILDEDIPRFAV VGRPNAGKSSIVNAFIGEERNIVTEIAGTTRDSIYTRYNKFGFDFYLVDTAGIRKKNKVN EDLEYYSVVRSIRAIEGADVCILMVDATRGIESQDLNIFSLIQKNSKGLVVVVNKWDLVE NKTDKVMKTFEEAIRSRFAPFVDFPIVFASALTKQRILKVLEEAHKVYENRMIKIPTARL NEEMLPLIEAYPPPATKGKYIKIKYVTQLPNTQVPSFVFFANLPQYVKEPYRRFLENKMR EKWDLSGTPINIYIRQK >gi|226331995|gb|ACIB01000061.1| GENE 113 137450 - 138331 921 293 aa, chain - ## HITS:1 COG:SA1396 KEGG:ns NR:ns ## COG: SA1396 COG1159 # Protein_GI_number: 15927147 # Func_class: R General function prediction only # Function: GTPase # Organism: Staphylococcus aureus N315 # 2 293 4 297 299 255 44.0 8e-68 MHKAGFVNIVGNPNVGKSTLMNVLVGERISIATFKAQTTRHRIMGIYNTDDMQIVFSDTP GVLKPNYKLQESMLNFSTSALADADVLLYVTDVIETPDKNNEFIQKVRQQSAPILLLINK IDLTDQEKLVKLVEEWKELLPQAEIIPISAATKFNVDYVMKRIKDLLPDSPPYFDKDQWT DKPARFFVNEIIREKILLYYDKEIPYSVEVVVEEFKEDAKKIHIHAVIYVERDSQKGIII GKQGKALKKVATEARRDLERFFGKTVFLETYVKVDKDWRSSDKELRNFGYQLD >gi|226331995|gb|ACIB01000061.1| GENE 114 138419 - 139423 905 334 aa, chain - ## HITS:1 COG:CAC3578 KEGG:ns NR:ns ## COG: CAC3578 COG0332 # Protein_GI_number: 15896812 # Func_class: I Lipid transport and metabolism # Function: 3-oxoacyl-[acyl-carrier-protein] synthase III # Organism: Clostridium acetobutylicum # 4 327 1 323 325 281 45.0 8e-76 MEKINAVITGVGGYVPDYVLTNDEISKMVDTNDEWIMTRIGVKERRILNEEGLGTSYMAR KAAKQLMKKTGSNPDDIDLVIVATTTPDYHFPSTASILCDKLGLKNAFAFDLQAACSGFL FLMETAANFIRSGRYKKIIIVGADKMSSMVDYTERATCPIFGDGAAAFMVEPTTEDLGIM DAILRTDGKGLPFLHMKAGGSVCPPSYFTVDNKMHYIHQEGRTVFKYAVSNMSDVSAAIA EKNGLTKDNIDWIVPHQANMRIIEAVAHRMEVPMEKVLVDIQHYGNTSAGTLPLCIWDFE EKLKKGDNIIFTAFGAGFTWGAVYVKWGYDGKKE >gi|226331995|gb|ACIB01000061.1| GENE 115 139521 - 139706 339 61 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|29349241|ref|NP_812744.1| 50S ribosomal protein L32 [Bacteroides thetaiotaomicron VPI-5482] # 1 61 1 61 61 135 100 1e-30 MAHPKRRQSKTRTAKRRTHDKAVAPTLAICPNCGEWHVYHTVCGACGYYRGKLAIEKEAA V >gi|226331995|gb|ACIB01000061.1| GENE 116 139720 - 140295 500 191 aa, chain - ## HITS:1 COG:no KEGG:BF4091 NR:ns ## KEGG: BF4091 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 14 191 1 178 178 352 100.0 3e-96 MGKFDKYKIDLKGMQADSCKYEFLLDNLFFAHIDGPEVQKGKVNVELTVKKTSRAFELSF QTEGIVWVPCDRCLDEMEQPVTSSDKLMVKFGHEYAEEGDNLIVIPEEEGEINVAWFMYE FIALAIPMKHVHAPGKCNKAVTSKLNKHLRTSGDDDAEESFGAGEDIVVEDEAEEQIDPR WNELKKILDNN Prediction of potential genes in microbial genomes Time: Wed May 18 00:32:53 2011 Seq name: gi|226331994|gb|ACIB01000062.1| Bacteroides sp. 3_2_5 cont1.62, whole genome shotgun sequence Length of sequence - 17506 bp Number of predicted genes - 27, with homology - 23 Number of transcription units - 18, operones - 4 average op.length - 3.2 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 426 - 725 131 ## BF2921 hypothetical protein - Prom 797 - 856 4.3 + Prom 704 - 763 1.8 2 2 Op 1 . + CDS 797 - 1099 302 ## BF2920 hypothetical protein 3 2 Op 2 . + CDS 1123 - 1254 83 ## gi|294807225|ref|ZP_06766039.1| hypothetical protein CW3_0679 4 2 Op 3 . + CDS 1282 - 1611 281 ## BF2919 hypothetical protein 5 2 Op 4 . + CDS 1626 - 1886 202 ## BF2918 hypothetical protein + Term 1911 - 1950 6.7 6 3 Tu 1 . - CDS 2199 - 2369 57 ## gi|293369599|ref|ZP_06616177.1| hypothetical protein CUY_2504 - Prom 2552 - 2611 3.6 + Prom 2474 - 2533 3.9 7 4 Tu 1 . + CDS 2585 - 2965 408 ## BF2915 putative single strand binding protein + Term 2984 - 3022 1.1 - Term 2972 - 3008 -0.9 8 5 Tu 1 . - CDS 3107 - 3247 63 ## - Prom 3426 - 3485 8.5 9 6 Tu 1 . + CDS 3453 - 3980 469 ## BF2914 hypothetical protein + Term 4000 - 4054 7.2 - Term 3988 - 4041 9.4 10 7 Tu 1 . - CDS 4224 - 4445 66 ## - Prom 4666 - 4725 6.6 + Prom 4367 - 4426 5.2 11 8 Tu 1 . + CDS 4447 - 4761 354 ## BF2912 hypothetical protein + Term 4795 - 4824 0.3 - Term 4774 - 4821 3.3 12 9 Tu 1 . - CDS 4838 - 5152 103 ## - Prom 5227 - 5286 3.5 13 10 Tu 1 . + CDS 5180 - 5680 387 ## BF2910 hypothetical protein + Term 5718 - 5769 9.4 + Prom 6089 - 6148 5.2 14 11 Tu 1 . + CDS 6171 - 6425 291 ## BF2909 hypothetical protein + Term 6439 - 6480 3.7 + Prom 6803 - 6862 3.4 15 12 Tu 1 . + CDS 6897 - 7577 648 ## BF2908 hypothetical protein + Term 7609 - 7652 7.0 16 13 Op 1 . - CDS 7752 - 7856 100 ## - Prom 7876 - 7935 7.0 17 13 Op 2 . - CDS 7946 - 8590 358 ## BF2906 serine type site-specific recombinase - Prom 8825 - 8884 9.6 + Prom 8728 - 8787 10.2 18 14 Op 1 . + CDS 8826 - 9875 444 ## BF2905 hypothetical protein 19 14 Op 2 . + CDS 9893 - 10339 288 ## BF2904 hypothetical protein 20 14 Op 3 . + CDS 10350 - 10745 187 ## BF2903 hypothetical protein 21 14 Op 4 . + CDS 10742 - 11299 170 ## BF2902 hypothetical protein 22 14 Op 5 . + CDS 11324 - 12184 579 ## BF2901 hypothetical protein 23 15 Op 1 . + CDS 12391 - 13848 954 ## BF2900 hypothetical protein 24 15 Op 2 . + CDS 13902 - 15344 610 ## BF2899 putative outer membrane protein + Term 15388 - 15426 4.2 - Term 15376 - 15413 7.8 25 16 Tu 1 . - CDS 15474 - 15647 240 ## BT_2472 hypothetical protein - Prom 15736 - 15795 6.7 + Prom 15994 - 16053 3.0 26 17 Tu 1 . + CDS 16117 - 16587 327 ## BF2897 hypothetical protein + Term 16596 - 16637 5.1 + Prom 16594 - 16653 3.7 27 18 Tu 1 . + CDS 16763 - 17119 166 ## BF2896 hypothetical protein + Term 17193 - 17230 2.1 Predicted protein(s) >gi|226331994|gb|ACIB01000062.1| GENE 1 426 - 725 131 99 aa, chain - ## HITS:1 COG:no KEGG:BF2921 NR:ns ## KEGG: BF2921 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 99 1 99 99 196 100.0 2e-49 MSLQQVAWLCMQGFTTKYYLCEAERGDFVVNLQVTTLHTIKTWQIPLRHNSHRADRKLIC EHVGVCSVRMCVKGGYEVKEYRSEKRKEARAYLLFWLYG >gi|226331994|gb|ACIB01000062.1| GENE 2 797 - 1099 302 100 aa, chain + ## HITS:1 COG:no KEGG:BF2920 NR:ns ## KEGG: BF2920 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 100 1 100 100 162 99.0 4e-39 MANYATNIFHARTENKTDLDKIEAFLDDTFSEFTNRYGDSVDAEFSSRWVYPEEEIKKLV ESLEDKDKVYIKILTYEFEDEYVSFRIFSQGEWKVKLITE >gi|226331994|gb|ACIB01000062.1| GENE 3 1123 - 1254 83 43 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|294807225|ref|ZP_06766039.1| ## NR: gi|294807225|ref|ZP_06766039.1| hypothetical protein CW3_0679 [Bacteroides xylanisolvens SD CC 1b] # 1 43 1 43 43 82 100.0 6e-15 MVYPTLCLFYKRHTDGTIVGDFDNKITVRFICGKERVLNTVMK >gi|226331994|gb|ACIB01000062.1| GENE 4 1282 - 1611 281 109 aa, chain + ## HITS:1 COG:no KEGG:BF2919 NR:ns ## KEGG: BF2919 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 109 1 109 109 207 99.0 7e-53 MTQIAMKFVQWDVPELEKLKDSKVYKLRERLDNGDKLSREEKNWLTRNVKECCHFKRGIA LMGYRFDFSDVLKRYFVKQHGHIAEYYTIDKTALRSVLYGRIEDIIEVQ >gi|226331994|gb|ACIB01000062.1| GENE 5 1626 - 1886 202 86 aa, chain + ## HITS:1 COG:no KEGG:BF2918 NR:ns ## KEGG: BF2918 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 86 1 86 86 170 100.0 1e-41 MKVTIEHSFCPYCDEVTELYFRIINTILFSGNEAELRESMRQLEKKTPLDEYFTYGYGAR HLWVCQRRPSDKTKIFEHRIMMVEFQ >gi|226331994|gb|ACIB01000062.1| GENE 6 2199 - 2369 57 56 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|293369599|ref|ZP_06616177.1| ## NR: gi|293369599|ref|ZP_06616177.1| hypothetical protein CUY_2504 [Bacteroides ovatus SD CMC 3f] # 1 56 2 57 57 70 100.0 4e-11 MIASIKSDYGKWETSKPVMIRKKSKSKCRHKGETSVRKSVKNSKAKARHAWLELRL >gi|226331994|gb|ACIB01000062.1| GENE 7 2585 - 2965 408 126 aa, chain + ## HITS:1 COG:no KEGG:BF2915 NR:ns ## KEGG: BF2915 # Name: not_defined # Def: putative single strand binding protein # Organism: B.fragilis # Pathway: not_defined # 1 115 1 115 126 211 100.0 7e-54 MKKIENNFTVTGFLGKDAEIREFTNSSVARFPLAVSRQERNAEETNRISAFMNIEAWRKN ENTGSFDQLTKGTMLTIEGYFKPEEWTDKVGVKHNRVVMVAVKFYPPIEKEDVPEKPVKP VKKGKK >gi|226331994|gb|ACIB01000062.1| GENE 8 3107 - 3247 63 46 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MDGGRNLTFGKKYGYKRDWKAAQRYGIERVFFFLLKIVICLFIKWS >gi|226331994|gb|ACIB01000062.1| GENE 9 3453 - 3980 469 175 aa, chain + ## HITS:1 COG:no KEGG:BF2914 NR:ns ## KEGG: BF2914 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 175 1 175 175 313 99.0 2e-84 MKQIIWSSDALLDETAREYYQNFKREELDDDAYKVSDEEWSDEVYNELGDERQNLNKDVN GVIIAFGDLGLWNGRKQGYQILGDNIAGILQSTQYDAEWYGDGYDIRGRMSHHDGTNYVL YRVAENRDDAERIAAKIYNYEIDENGFRQVTRSLHPYVAAVYGWKTLQDNLVQVK >gi|226331994|gb|ACIB01000062.1| GENE 10 4224 - 4445 66 73 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MPQILRLIFYLFPFDTSCTERLIIFLPTGLAARTGKEDAENTLFDISERKILQQQVSAVL DRTDSRQSNFAGK >gi|226331994|gb|ACIB01000062.1| GENE 11 4447 - 4761 354 104 aa, chain + ## HITS:1 COG:no KEGG:BF2912 NR:ns ## KEGG: BF2912 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 104 1 104 104 170 99.0 2e-41 MTFREFMLENGYELQTTFWNDFSIADRFGLSAIQDTFNRAFKEWKENYKYLTELVLVLNH KIWQYYETRPEIATLYNTLWAQASQYAMEYLKDDELSYYYDVTD >gi|226331994|gb|ACIB01000062.1| GENE 12 4838 - 5152 103 104 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MFFFSLFSEAFRLLQSGFDFYVHSRLPLRKSRQGIWTEFLQLRYGAYTTFERLKIGKATE KLCSNSVAVLECFSVAIPLHRKKQVLTGSSLKRNNFVLYKGKTE >gi|226331994|gb|ACIB01000062.1| GENE 13 5180 - 5680 387 166 aa, chain + ## HITS:1 COG:no KEGG:BF2910 NR:ns ## KEGG: BF2910 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 166 1 166 166 323 99.0 1e-87 MEIQFVIVRSENAEYLCHNVNGTYVDVSDPSTEFVSGEDDFRLVEPDSSLTRKEYEFRGE RFYLMPQFYGNGWLALTLQSVEDETEYIVLSVNLESMDALDLPDRTFIDVNHYPDAMEFL ETNNLATYSGYKRRSGFVEYPMAVLNLPLLYQHAPQIFQEANIECF >gi|226331994|gb|ACIB01000062.1| GENE 14 6171 - 6425 291 84 aa, chain + ## HITS:1 COG:no KEGG:BF2909 NR:ns ## KEGG: BF2909 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 84 1 84 84 149 97.0 4e-35 MTIEEVLQHDLKFRYMLLGRLQADCEYYLGFGNKSSRRLWAGSEKAQIEYMTKIHDSFRE NEKPEWLTMEQIKEYSNAMEVTQE >gi|226331994|gb|ACIB01000062.1| GENE 15 6897 - 7577 648 226 aa, chain + ## HITS:1 COG:no KEGG:BF2908 NR:ns ## KEGG: BF2908 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 226 1 226 226 434 98.0 1e-120 MPNYVTNRLEINADRETVQNVMDFLKGKTDEDSTPCYIDFNNIIPMPKDLLIEASTSGEF GMQYIIAQQRKPFNSQDDLKVIQWMEIQEEKVREEALQLGMTYLRNWGKYGYPTWYEWSI ANWGTKWNAFNQNFEEPNVLWFDTAWEGVPLLIQTLSEIFPDVEFQYAYADEDLGSNVGK GTIRNGETDMTFPDNGSNEAFEIVFFVKPGLEEYLELTDEGYRWKA >gi|226331994|gb|ACIB01000062.1| GENE 16 7752 - 7856 100 34 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MGDIIIVLLVFWVVGKLLKGVFGGFSKSSFKDDK >gi|226331994|gb|ACIB01000062.1| GENE 17 7946 - 8590 358 214 aa, chain - ## HITS:1 COG:no KEGG:BF2906 NR:ns ## KEGG: BF2906 # Name: not_defined # Def: serine type site-specific recombinase # Organism: B.fragilis # Pathway: not_defined # 1 214 1 214 214 434 99.0 1e-120 MAKVGYIFKENNDSFDAEREWMQRYGCVQIVEETVEHETLRPMWKQLMANLQRGDEIVIS KFSNAARGLRELAAFIELCRIKIVRVISIHDRVDTRGELFPGTTAADVLWIIGAFPEEIA ALRKYSAHVEKLRQNIKAPAVPKVLPKAERDKTIVDMYINGHSFDDIWAASGFSSKSSIW RILNKYGVKLDRGQTSGPRVKQNPKEDGTNEGNS >gi|226331994|gb|ACIB01000062.1| GENE 18 8826 - 9875 444 349 aa, chain + ## HITS:1 COG:no KEGG:BF2905 NR:ns ## KEGG: BF2905 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 349 1 349 349 682 100.0 0 MKILKSYYKTPSFLTGRCIVSLAILGLISCSDRNGKSLSEATNDPAGIYREYLYNIRRQK DSSFQVLTKHILQWQTVKDSVFRYLRNDTLSHPHSNQREECIRLHDSIRTEFSRLALSKT RTYQELLALKGEFSPYNNDEELHHAAGEIRPFFNSLDNLPLHKGNKEEILAAYRMLLTRT IRNGIHSRNELITYITKEDAIFRAFLSHLHDFEGESMADITRGTEQCCSQIFFAAERKEI TYREAMLYLTMRTNRRQIQNMQICIEDVRNKKIKTSSQAHAYIWMLIHPYTSLDGFSMTL LSDKERKQLDRMAAQTPVAFKTLSRILQSESGQLTELPGMLMDIFIQTL >gi|226331994|gb|ACIB01000062.1| GENE 19 9893 - 10339 288 148 aa, chain + ## HITS:1 COG:no KEGG:BF2904 NR:ns ## KEGG: BF2904 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 148 3 150 150 271 99.0 7e-72 MLRHFFNDFMTFVPLQLPQLLDVTTMEEAQFYGDYALLTFPLRDPYDLEEVMDLFEDDME LITLYHHIPTHADKFGHSTCAYSNPAFGQMFKMNCKTDADGKVNSILVTIYDSLEQMYGE LCLDLDLHSKSGTFKYKKNKDDLLMDFL >gi|226331994|gb|ACIB01000062.1| GENE 20 10350 - 10745 187 131 aa, chain + ## HITS:1 COG:no KEGG:BF2903 NR:ns ## KEGG: BF2903 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 9 131 1 123 123 222 99.0 3e-57 MRDTLYRQMVYWIREYRTWIEVVDDNFYKEYALSRNGYINYIVSRTLVLRAYKDKGSYAK GMTWTIPEHKLDKALAAYRKQEHTFKQRIKKAAIYLSPRDAEVIILLATHNIVQLELMIS PIQIREKPYYL >gi|226331994|gb|ACIB01000062.1| GENE 21 10742 - 11299 170 185 aa, chain + ## HITS:1 COG:no KEGG:BF2902 NR:ns ## KEGG: BF2902 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 185 1 185 185 302 98.0 3e-81 MIWNILQLIFCITLFVLPLTLYKSHRSFMVRFYDAMIHSVKARKLYVQVVLILLLLFHYV YISGHVGEFGVFLSTAICATIYSFRRADRLLRGLCDRSCMFVILSLVALAISFVPHLYTT AVTAAYLLLAALFYPSVRVMTEFQDIGIISEWMKFPRLLAESYYDHHHAILPQDADSGNT DISAQ >gi|226331994|gb|ACIB01000062.1| GENE 22 11324 - 12184 579 286 aa, chain + ## HITS:1 COG:no KEGG:BF2901 NR:ns ## KEGG: BF2901 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 286 3 288 288 567 99.0 1e-160 MKTKQKIAVPILADREVFDYLKEKVGERKTKTEAFCDLLDKSLAGFVSPFLRNKGYELQP NQCHVTVSDLSSEWHWHRATVRSFLDVMEEFGLLNRIRLSKSVIITMTVQTSQSTESCNG QKKLNLAEQLREALSDWIIGKVSLDEIGIKCEQLVRRAMDEAGICDSCPSPDSITRINPA ADDDERAVKIRMVALECITFAAIQRALRKSRFDDSAEFMDYFRLELYGDWTGLIATSKGI AGLILDVDRDENSDYDEDDREFLKTLFKPFLAFAAKAQEATYQIGG >gi|226331994|gb|ACIB01000062.1| GENE 23 12391 - 13848 954 485 aa, chain + ## HITS:1 COG:no KEGG:BF2900 NR:ns ## KEGG: BF2900 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 485 1 485 485 875 100.0 0 MANQKQVLDVQVSKGITTAQSNEHLRDRSEKAEKYAMSKGNYDPTRKRLNFEIAPGGKIH PIDTSRSIPKRMADILSHRGIKDPNEGLLEPKYRTVVNIIFGGSRQRMQELAFGTQQVDF EKGADNTRIERKRDIERWAKDVYSFVCGRYGEQNIAAFIVHLDELNPHIHCTLLPIKDSR FAYKEIFAGKDKFEYSARMKQLHTDFFAEVNTKWGMSRGTSISETGARHRTTEEYRRMLS EECTTIEDNIKLHQQVLGELQSDIRLAERRVKGLTTMVSNLEKQKTEKETLLSAAEYNLK ENKGNAAELAIQIQMLEKELQGIIRQLADKQEKLQTADRQLIELKKDMGAIEERTEELKE EAYQYSRDVHSKVDSLFKDVLLESVISEYRNASAQMNVSERQLFDGSLVQSIAERGTEIM HCATMLFLGMVDDATTFAESHGGGGGGSDLKWGRDEDEDNRAWALRCMRMASRMMRSTIG KKSKR >gi|226331994|gb|ACIB01000062.1| GENE 24 13902 - 15344 610 480 aa, chain + ## HITS:1 COG:no KEGG:BF2899 NR:ns ## KEGG: BF2899 # Name: not_defined # Def: putative outer membrane protein # Organism: B.fragilis # Pathway: not_defined # 1 480 1 480 480 979 99.0 0 MTKQIIFIFALLCTLQAQASVQPVQKDTVRHTIHYEVAELLQPMQPVYLNGVLLPASRTG NWFVSISGGATVFLGTPLGCEDLFGRVKPSYSLAVGKWFTPLVGARVNYSGLQFKDAQLS TQDYHYIHADLLWNLLGRRYARQEQVRWRLAPFMGVGLLHNATNGNNPFALSYGILTQYR ISKRVSAMLELSNTTTFQDFDGYGYPNRLGDHMLSLTAGFTFHLGKVGWKRAVDTAPYIH RNELLVDYGNFLSEENRRYVGRHNQDKRTLVELKKILEIEGLLDTYSHIFDNDDITGCRY PINNYSGLNSLRARLKHSYWDGSSPLDTTILQTENGKPSYNYTASRNVQSAHQDTLAMDS TVLSYADGECIGTPIYFFFALNTTHLTDTSQRLNLDELARVAKKYSLSVRVTGAADSSTG TSSINDSLSISRAGFITAELEQRGIPAKRIIRVSKGGIADYTPVEANRHTKVELFFPKAK >gi|226331994|gb|ACIB01000062.1| GENE 25 15474 - 15647 240 57 aa, chain - ## HITS:1 COG:no KEGG:BT_2472 NR:ns ## KEGG: BT_2472 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 57 51 107 107 82 89.0 6e-15 MEKTFTQICELFDQFSKDANLQMEKGNKAAGTRARKVSLELEKLLKQFRKESLEASK >gi|226331994|gb|ACIB01000062.1| GENE 26 16117 - 16587 327 156 aa, chain + ## HITS:1 COG:no KEGG:BF2897 NR:ns ## KEGG: BF2897 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 156 1 156 156 322 98.0 3e-87 MGKYDFIKLGNLLYWHDPDSGLSNGVYQVASIPENIEEDSVILIASDTSEAEVFPSELSP IHTGRSHKEDFLRWKTEREAEGIEFYDHLSKVMDTENDLSVGDMVAFTNDYGVIFGPCEV LAFGNLCNSGRCVYIDSDSYWFPNRPDQLTIMRGAE >gi|226331994|gb|ACIB01000062.1| GENE 27 16763 - 17119 166 118 aa, chain + ## HITS:1 COG:no KEGG:BF2896 NR:ns ## KEGG: BF2896 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 118 1 118 118 207 97.0 1e-52 MKFTTETAWFPCDETLELVCEKFPTLCYFYQSEESGLAEYWTNDQESKYFPEKYIADLCT PDDKWYKEYFVNQTEVFKWFEVISGQSVESITEILAIAEQRKDENDNSFCNIYEYAAG Prediction of potential genes in microbial genomes Time: Wed May 18 00:34:23 2011 Seq name: gi|226331993|gb|ACIB01000063.1| Bacteroides sp. 3_2_5 cont1.63, whole genome shotgun sequence Length of sequence - 22682 bp Number of predicted genes - 23, with homology - 23 Number of transcription units - 8, operones - 5 average op.length - 4.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . + CDS 420 - 614 265 ## BF2892 hypothetical protein 2 1 Op 2 . + CDS 619 - 903 211 ## BF2891 hypothetical protein 3 1 Op 3 . + CDS 914 - 1285 260 ## BF2890 hypothetical protein + Term 1309 - 1339 1.0 - Term 1285 - 1337 5.5 4 2 Op 1 . - CDS 1340 - 3022 1260 ## COG4227 Antirestriction protein 5 2 Op 2 . - CDS 3028 - 3639 438 ## BF2888 hypothetical protein - Prom 3731 - 3790 2.3 - Term 3761 - 3799 2.7 6 3 Op 1 . - CDS 3911 - 5467 1135 ## COG1705 Muramidase (flagellum-specific) 7 3 Op 2 . - CDS 5479 - 7728 1396 ## BF2885 putative DNA primase 8 3 Op 3 . - CDS 7748 - 9532 1376 ## BF2884 hypothetical protein - Prom 9560 - 9619 3.5 9 4 Op 1 . - CDS 9852 - 10682 498 ## COG0739 Membrane proteins related to metalloendopeptidases 10 4 Op 2 . - CDS 10709 - 11353 465 ## BF2881 hypothetical protein 11 4 Op 3 . - CDS 11366 - 12052 483 ## BF2880 hypothetical protein 12 4 Op 4 . - CDS 12065 - 12760 571 ## BF2879 hypothetical protein 13 4 Op 5 . - CDS 12741 - 13226 423 ## BF2878 hypothetical protein 14 4 Op 6 . - CDS 13239 - 14930 1242 ## BF2877 hypothetical protein 15 4 Op 7 . - CDS 14935 - 16470 1127 ## BF2876 hypothetical protein 16 4 Op 8 . - CDS 16529 - 16753 301 ## BF2875 hypothetical protein 17 4 Op 9 . - CDS 16774 - 17532 736 ## BF2874 hypothetical protein 18 4 Op 10 . - CDS 17606 - 17755 83 ## gi|256840294|ref|ZP_05545802.1| predicted protein - Prom 17980 - 18039 5.2 + Prom 17502 - 17561 8.2 19 5 Tu 1 . + CDS 17775 - 18413 228 ## COG0739 Membrane proteins related to metalloendopeptidases 20 6 Tu 1 . - CDS 18446 - 19282 319 ## BF2872 hypothetical protein - Prom 19447 - 19506 6.4 + Prom 20025 - 20084 5.5 21 7 Op 1 . + CDS 20279 - 20845 518 ## BF2868 putative ribose phosphate pyrophosphokinase 22 7 Op 2 . + CDS 20847 - 21350 195 ## BF2867 hypothetical protein + Term 21393 - 21425 -0.8 - Term 21630 - 21665 5.1 23 8 Tu 1 . - CDS 21692 - 22177 404 ## BF2865 hypothetical protein Predicted protein(s) >gi|226331993|gb|ACIB01000063.1| GENE 1 420 - 614 265 64 aa, chain + ## HITS:1 COG:no KEGG:BF2892 NR:ns ## KEGG: BF2892 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 64 1 64 64 130 100.0 1e-29 MRTKTLYRCDAQKIDISRFPNFHITGSITGMKKLYYGKNALLVRCGSWIYNVSSEPEVYY NIAH >gi|226331993|gb|ACIB01000063.1| GENE 2 619 - 903 211 94 aa, chain + ## HITS:1 COG:no KEGG:BF2891 NR:ns ## KEGG: BF2891 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 94 1 94 94 186 100.0 3e-46 MKKGYKKDFQSWKGIVTLKLLCCNIAAGRFDWKKYCTPQPYCGQEICVIPLHCSYGQIGY TVYFPYSDMPEVEYDWEMNKLTIDKENWENYLQN >gi|226331993|gb|ACIB01000063.1| GENE 3 914 - 1285 260 123 aa, chain + ## HITS:1 COG:no KEGG:BF2890 NR:ns ## KEGG: BF2890 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 123 1 123 123 238 100.0 7e-62 MAQNFYTKWQNAILADAGVYVSKKYRSFQTALVREISKYATAVGAKVTFNLKGHYNTSCF IERNGKFVYISHSSGLSRMGSGVKIELDSFLIRTAQHAKDYRGGHNQYCDITNLQSMIDN LLE >gi|226331993|gb|ACIB01000063.1| GENE 4 1340 - 3022 1260 560 aa, chain - ## HITS:1 COG:XF2061_1 KEGG:ns NR:ns ## COG: XF2061_1 COG4227 # Protein_GI_number: 15838653 # Func_class: L Replication, recombination and repair # Function: Antirestriction protein # Organism: Xylella fastidiosa 9a5c # 23 350 224 516 522 95 28.0 3e-19 MAGYKKQHTDGPNSEDKALDLFAEMMIEKIESIRKDWRKPWFTEEALQWPCNLSGREYNG MNAIMLLIHCEKEGYKIPRFCTFECVQRLNKSDKDNQEKPRVSVLRGEKSFPIMLTTFTC IHKDSGEKIKYDDYKKLSDNEKKEYNVYPKMQVFRVFNVAQTNLQEARPELWQKLEKEYS LPKIENGEYFSFAPVDALIKDNLWICPIKPQHQDNAYYSISRNEIVVPEKEQFKSGEAFY GTLFHEMTHSTGAEGVLDRIKPTTFGSAEYAREELVAELGSALVAQRYGMTKHIKEDSCA YLKGWLDELKESPQFIKTTLLDVKRAASLITQKVDKIALELEQNIDEEQTVAPKEKVYYS SVAYLQLTDDTMRLDAFKDKGDYEGLLTLAKEYYDGNGINEEYTYSSPIQNRGDNLLIED KDFAVVYNGSVGGTYEVMLKFTEKEVCDHIRRYGIEHAGDTLKGVAKEMAAEQFAIMTQQ KIPAFEMPNGDVLYVSYNKESDMIDIGPVTNAGLVAQHRFPYDHNASLDANLQTVNEKLN NMEEYREELQEAEYSGGMRR >gi|226331993|gb|ACIB01000063.1| GENE 5 3028 - 3639 438 203 aa, chain - ## HITS:1 COG:no KEGG:BF2888 NR:ns ## KEGG: BF2888 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 203 1 203 203 404 99.0 1e-112 MIKCNVTVCGVIGRDASIRTNKEGKTFLVFPLRVMIPDTDGKTMPIEVDVSKDTAGKEVS KYRNGSRIEVSGTMYLKHRGDKLYFNLFINEIRTATADAKDTVKGELVFRGKVGQHIEEK RDKKDQPYTMFSAFSTEKVEDGFEYQWVRFFCFGKEREAWLQPGVRVDAKGEISLSAYNG KLNVSCKVEELVQYVADSSNSNQ >gi|226331993|gb|ACIB01000063.1| GENE 6 3911 - 5467 1135 518 aa, chain - ## HITS:1 COG:SPy1438_1 KEGG:ns NR:ns ## COG: SPy1438_1 COG1705 # Protein_GI_number: 15675348 # Func_class: N Cell motility; U Intracellular trafficking, secretion, and vesicular transport # Function: Muramidase (flagellum-specific) # Organism: Streptococcus pyogenes M1 GAS # 22 158 18 155 174 76 37.0 1e-13 MSKNQQYAMKYAEYAMEQMRRYGIPASVTLAQGILESSNGQSRLAQNENNHFGIKATPAW IAEGGRYGIYTDDKPNEKFCSYDSVGDSYEHHSRFLKENSRYAQCFALSPDDYKGWTQNI EQAGYATGGEYAESLQRIIEQNGLQQYDKLVMQEMETQGKRFGTEHNPLRTSENSEYGAK YSFPVEREEFLFVTSPFGMRQDPMDNTKQQMHKGIDIRCNGDAVLATENNGKVVAVNQNK NTPGGKSLTVEYTRTDGSKVQCTYMHLKEVTVKVGDVVQAGGKLGTSGNTGTRTTGEHLH FGVTNFYADGTKRDIDPAAYLTEIAQKGNIKLEVLHNGNSLLTRYKGTEENAAGKNLSPD GWMKKLLSSEDSGVGMSGCNDPIVEMAMTAFSSLMLLAVQIDNKNEEEQKTAISKQMDSG RINLKSLLPGMKNCELAISENGKAILRVNNGELRMSRELTTAELSRLSATLNNNTLTEEA KRIRVTGMLNTVILSEAASQNFEQGMSQQQGQTENLKR >gi|226331993|gb|ACIB01000063.1| GENE 7 5479 - 7728 1396 749 aa, chain - ## HITS:1 COG:no KEGG:BF2885 NR:ns ## KEGG: BF2885 # Name: not_defined # Def: putative DNA primase # Organism: B.fragilis # Pathway: not_defined # 18 749 1 732 732 1422 99.0 0 MKEKSQIEKKAEEKQITLLSTALSEASNAGGHWLNASGKGYPRFYPKGVSVSAFNALFMT LHSDKNGCKTNQFTLFSDAKAQGASVRENEQGVPFLFYNWNKYVHRNNPEQVISRDDYMK LYEEEQKLYKGVHNREIRTLFNIDQTTLPYVDKERYETTLRRYGSAVERGYTEADNRRLH IQFNDFLLRMRDNLVPVRLDGSGVPHYETDKDAVYMPRQREFRHYHDYIQEALRQIVSAT GHQQRLAREGMVMKNGVAPSEDAVRQERLVVELASGIKMLELGLPARLSEESLKTVEYWC RELKENPNLMDALESDVNNAIEVINKAERGEKIEYATMRNRRDTSTMQEQMPKHYFVSNE IRQHPDKETKKIVLVIDPQAKTADVILPAGASTEADNEIPGMNKGRIMRALQKDGIEQVR FYNTDGALGYRPDDSYFAEKIIMLARLKNWAMEKLSTLDVASAVKQANEIGFDHVEMIQD DKKRWALYIKPENKSGYSIYPDKEDINRFFSTLKQAMDNIGKVRMELAHKYYALAEVKPD LKVDLFSSEMPEIDLNRIQRVSVFKTKQDGIQCVATIDGQKQPARSVTPQQWQRMWIAEE RDSYKRHLAATLFADVLQKGQSQEAHTGEKQQKEAELWPIETVAQERTESDNKGISPERQ LWDKLKANHPDALQLLRTKDGYRLYNEDAVQGAKILGITLKEYPEGDITASTEFSTEQLD NYLSKLVRAGARVAISDMEEQETHRGFHR >gi|226331993|gb|ACIB01000063.1| GENE 8 7748 - 9532 1376 594 aa, chain - ## HITS:1 COG:no KEGG:BF2884 NR:ns ## KEGG: BF2884 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 594 1 596 596 1121 98.0 0 MDGIKHSGRFAEMERLVNDYFNCHIAPIMSKTRTDLIRNQGEEMKEYSTSLGGILSMMAS SAQPMSDPYQALKVTGEWNSKTTEDYIEMCKTEITGSEEMQQDLAYMAGQWRDTVVQEIG RARYNELSEQLGCDLAYAYMDHRIEELMIDRLVKERMPKSSADYIIRKAAESSLLGLSQT LSRSPLTDEIEARGEAAYRPNRWEKGTGWMLGTAADTLMMGGTGSWTTLAKFTGADVAIA AITNHFEGKKPDTLSVEQCISKGVFGRDRNVFDDFRKEAAQIQIKENTAIGTDNKQLKKK IPVMDFGFMEWTQNQNSGLLWPNVQSKEEQKYEERYKDVPIVVAPGQEEAYLQSLEQCDK AKMVRTEQDEIMKEEKHETVVPANDAEQHVQTIQSAQVAQGNGWGGLLGMLGLDGIGNIT GNLGYVMAMLPDVLLGIFTGKTESLHLEDNMLPIASIVAGMFVRNPLLKMLLIGLGGMNL LNKAGHEALKERTEGKLNVTNENNVQYRRYTNETLNPRIVNPVLQDSTLIATIDRVPCTI QLSPIVAEAYRTGALPLNTLANAVLAKNDQLRQAAARNYEDGKLETIVRPRGIQ >gi|226331993|gb|ACIB01000063.1| GENE 9 9852 - 10682 498 276 aa, chain - ## HITS:1 COG:TM0409 KEGG:ns NR:ns ## COG: TM0409 COG0739 # Protein_GI_number: 15643175 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane proteins related to metalloendopeptidases # Organism: Thermotoga maritima # 29 135 140 254 271 62 33.0 7e-10 MKYTEEMILQSESGYCMPFEERKGKDVKLSLGYGEQTDPTTGKTYFHHGIDFDVRCYTLA AVASGIVSGIGNDPILGICQTIRYGEYEVTYGHLSNVFAQFGQRVKAGQTVALSGDKLHI GIRFKGEELNPLEFLTMLYGNIQALCHADGGEAATSPNMEMALTTDYEQDRQEIEELMLR FLPYYMEDLQRGAYRLPPHTEQSLRHVFTMGAVKEYFYENMPSISNPLGLGHKAMPLACK VQNLLIADFLHYLALRHGVYLSTMGDDVKKNSTTKP >gi|226331993|gb|ACIB01000063.1| GENE 10 10709 - 11353 465 214 aa, chain - ## HITS:1 COG:no KEGG:BF2881 NR:ns ## KEGG: BF2881 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 214 1 214 214 355 100.0 8e-97 MGIWDSILKYGGKAAKATGRSMGHAALHPSQTLRGTGQAVKTAAIGGAVGYVGWEKLTTD KSVVHIVSDAVIGKSATDTLADAADGVRELTSKAGEAVGSVSGTVAGIDSKLGGVSNFLR QVSNGGVSDMFGNFFRNLGQGNVSGLSIAGLVAAAFLIFGRFGWLGKIAGAFLGMMLIGN NAGIFRTPDTRSIQRIQTPALPVEEQTNSGGMKR >gi|226331993|gb|ACIB01000063.1| GENE 11 11366 - 12052 483 228 aa, chain - ## HITS:1 COG:no KEGG:BF2880 NR:ns ## KEGG: BF2880 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 228 1 228 228 431 99.0 1e-120 MKWTIWISILLLCLTGIGEVQAQNDPVLAGMIAVYTEKAEKELKNQEKVMLMQTTGHIWT KEEVQATTDLQREFNNYLNSFRSIVCYAAQTYGFYYEVSRLTDNMGDFTKQLKRSPANTL AVALSTQRNKIYRELMMNSVEIVNDIRTACLSENKMTEKERMEIVFGIRPKLKTMNTKLQ RLTKAVKYTTMGDIWREIDEGARPEADKRSIVDAAKRRWRQIGKNVRP >gi|226331993|gb|ACIB01000063.1| GENE 12 12065 - 12760 571 231 aa, chain - ## HITS:1 COG:no KEGG:BF2879 NR:ns ## KEGG: BF2879 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 231 1 231 231 432 100.0 1e-120 MKSRIISLGLVASLVCLLPQVAEAQIAASNPLEWTALAEGNELINGQIEKQIKGQTQTAL LQNSIATEFNQIHKWEKQYNSYLKTASGYASSLKACTHLYNDGVRIFLTLGKLGKAIQNN PQGIVASMNMNNLYIETATELVSVFTLLNDAVAKGSNENMLTGAERSKTLWALNDQLSDF SRKLHLLYLSIRYYTFNDVWNNVTAGMLDRDNGEAARMALSHWHRAAALVR >gi|226331993|gb|ACIB01000063.1| GENE 13 12741 - 13226 423 161 aa, chain - ## HITS:1 COG:no KEGG:BF2878 NR:ns ## KEGG: BF2878 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 161 1 161 161 273 99.0 1e-72 MQRTCHKTIVIHPHTGQFVINELPTLVLCGTVWVYGGMEGLPLTAIATVIALILSLLLLY RYLYLRRIRYCIGTEQLVSEYGIIRRKVDYMELYRIVDFQEHQSLLQQFCGLKTVRILSM DRNTPRLDLIGIFHRDDLVSIIRERVETNKRKKGIYEITNH >gi|226331993|gb|ACIB01000063.1| GENE 14 13239 - 14930 1242 563 aa, chain - ## HITS:1 COG:no KEGG:BF2877 NR:ns ## KEGG: BF2877 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 563 1 563 563 1144 99.0 0 MRDGDLTYDDFLQRLNIQDVLIDAGYHLNRRDGLRYPSYVRLDSDGRRIRGDKFIVTQQG KCCFHAQQQKVYNIISFIKEHPHFFTEYHAGMSPDRLVNLVCNRLLNIPVTERKTRIVNP KRDVKPFDIADYDIHKFNPQNRETQKKFYPYFKSRGIDLYTQYAFHRHFYLATKHREDGA TYTNLSFPLTLPKGDGAIVGLEERGRARMDGSGSYKGKAAGSNSSEGLWIASPARTSLTS AKHIYWFESAYDAMAYYQLHQAQNKDLRKAVFISTGGAPSQQQFKGAIKATPHASHHLCF DHDRAGQVYAIHFALTHAGWNFSTCLSQTGRLIVQNNSEDYSQYEIELEPFNFEKITAIL GINDAKQNLKNGERDDMGIGDGYLQEMRMVCMDEYDMARDEGSASEEELEKMRSNLEAIE KAIDASISGPEATGCILYESAAEGYKDWNDQLLGKRIKPEKDNLDDWEISGKATLNHALS DLPEVNPEHIRNGLYDEADHEAVRKRLERADRVIFSFETNDQGMSDKGFQEMYKIREELA RLEVDITNSLSGMREDFHSRFHR >gi|226331993|gb|ACIB01000063.1| GENE 15 14935 - 16470 1127 511 aa, chain - ## HITS:1 COG:no KEGG:BF2876 NR:ns ## KEGG: BF2876 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 511 12 522 522 999 99.0 0 MDLQPEMRNLLMRNGLQAHVAFDGGGYRLIVQGHDSPLLVYPITERQMLALTDWGTNTAN KKAYNILTSIIGKDFYMPKNFVHARNANGRVAMGLHGYRIGIGEYGHMGRLGMPPPFLGW TPREQWGFHLRRVGGQLFFPGPSIVPERPDGRMKPGELQSGGYGFYYKGGQQEQPIVQQD VLKNLQEVITPLVSRPRSKEPAQAYKELIASPVYFSNEKWSECLTSHGLIVDMERRTLTV QSESVNADMVYDLTEEEVKKLATASIEEQPVEKRLDLLNGIIGADFADKVTMEQLNSEQR ISIGLHPEVRHELEERQRQEQELFMQQETPMRQEYIQGSIGAAVDGRDLQLLNESKGWYR EEKHGREVEVSDIAVQPAQTEGKYKMSAVIDGQVISHEISQKDYDKFLAVDDYHRMKLFS KIFNEVDMKTRPEANKGLGVKIFAALTAGAVVASEVAHGFHHHHSPEFYGERFSGPPHPY FKPGVDTPRDVAIRNFEAQMNQDINEMRRGR >gi|226331993|gb|ACIB01000063.1| GENE 16 16529 - 16753 301 74 aa, chain - ## HITS:1 COG:no KEGG:BF2875 NR:ns ## KEGG: BF2875 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 74 1 74 74 122 100.0 3e-27 MAQQKYYPEEILVEKMQSGEYGWLDYINHFSAEWQEEYAHYCEEKGLTVCEDSAAGFVRF KDEQLEAAMKYGNA >gi|226331993|gb|ACIB01000063.1| GENE 17 16774 - 17532 736 252 aa, chain - ## HITS:1 COG:no KEGG:BF2874 NR:ns ## KEGG: BF2874 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 252 1 252 252 515 99.0 1e-145 MKKEQFEFNELPYPTLARFGLTQEMIEDLPLCILKEIGKGGYSPVLPMRVTNENGEVIES RSRFAFIRMDSGEVDVVFYPTLKSSPLECYNEEQQKQLLDGKSIIADVAMADGRRSKAFV QIDEETKQVMYVPTPIIARNLKVLAEVMHLGTVEVNGMQHGEPLTVAVDGEPVTVGIDLH NKTGIRFCAGDAQKWKEQPKREWDKYTFGCYGCWVMNDDGNLDYVPEEEYTEELWNEQKK SGERNRMAGIHK >gi|226331993|gb|ACIB01000063.1| GENE 18 17606 - 17755 83 49 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|256840294|ref|ZP_05545802.1| ## NR: gi|256840294|ref|ZP_05545802.1| predicted protein [Parabacteroides sp. D13] # 1 49 17 65 65 82 97.0 1e-14 MAAKLARSSYGLSIADCFLSKYNYLINFEYDYLRKIIFKTLKIPLYNHL >gi|226331993|gb|ACIB01000063.1| GENE 19 17775 - 18413 228 212 aa, chain + ## HITS:1 COG:TP0864 KEGG:ns NR:ns ## COG: TP0864 COG0739 # Protein_GI_number: 15639850 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane proteins related to metalloendopeptidases # Organism: Treponema pallidum # 75 191 423 540 546 93 41.0 3e-19 MMTGVSLCITPAKAQFNTVAVTPTRYKMEVLDMGLDQAEPASESEISVQEVSTGIPVSAD MDKKKWMDRYLSVSYPLRYIKVTSPYGYRKDPFTGKSKFHGGLDLRARGDKVMAMMEGVV VKVGQDKTSGKYVTLRHGRYTVSYCHLSKILIVKGAIVHPRDVVGITGSTGRSTGEHLHI TCKLNGRSISPSLIFDYIQSIRQECISALAGL >gi|226331993|gb|ACIB01000063.1| GENE 20 18446 - 19282 319 278 aa, chain - ## HITS:1 COG:no KEGG:BF2872 NR:ns ## KEGG: BF2872 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 278 1 278 278 510 100.0 1e-143 MNARLLFILLLFTIGALWYIIGYWRRKAGEAAFAAARLTEKSEERERYCRLAVMAGHREA CRMFCLLHPERFDGHSPHKPFKLRGIRISFYGYYYPSRYNALLNDEQRAFCHSIYQFKQG DIHGIEFFKTCMNALQLKKRPYHIMFMPCSNWIKYGQRFKRLDWYIGKHRQDLTSGLYDV DICDARESLHEAKGGEKRILERNYLITGKIKGKEIIIIDDVLTTGQSVVDYKEEIERCGG KVVAAIFYGKTLSMPSMLLVQIHVWGNHIAHIIERMTK >gi|226331993|gb|ACIB01000063.1| GENE 21 20279 - 20845 518 188 aa, chain + ## HITS:1 COG:no KEGG:BF2868 NR:ns ## KEGG: BF2868 # Name: not_defined # Def: putative ribose phosphate pyrophosphokinase # Organism: B.fragilis # Pathway: not_defined # 1 188 1 188 188 373 99.0 1e-102 MAKTIDIELQKQLDKPQAWFCKYFPARIRNVSEREIADRKLVFDFKDGRAYEEVAQRTAA NMTERYGTSCTNIVFSPVPASTDKKNEIRYKAFCQRVCELTGAINGYDHVSVSGERLTIH ENRKAEKEIRKVNVIEFDSAFFNGRSVVVFDDVITKGLSYATYANQLESLGANVLGGIFL ARTHYKVK >gi|226331993|gb|ACIB01000063.1| GENE 22 20847 - 21350 195 167 aa, chain + ## HITS:1 COG:no KEGG:BF2867 NR:ns ## KEGG: BF2867 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 167 1 167 167 333 98.0 1e-90 MTQTKYDKAVSVCFSGHRNIPFLYRKQLKLQLKAAITKAYAGGYRHFYCGCAMGFDMLAA EVALALQSELSGLQVIAVVPYRGQSERWNDAMKARYDTILCNSDDVIILSEHYYHGCLLR RNDYMVFHSSSLIAWYDGNPKGGTFYTYRKATANGLKVLNLYGSSIV >gi|226331993|gb|ACIB01000063.1| GENE 23 21692 - 22177 404 161 aa, chain - ## HITS:1 COG:no KEGG:BF2865 NR:ns ## KEGG: BF2865 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 161 1 161 161 320 100.0 1e-86 MKQISLHVYQSIDGCPAPSDKHFDAAVDASSCVLIDEETYLRIYMNHLGWPITAKETLVV TNGGIDLTENERVKFIQGDAVAELRQMKEDGDGMVVAYGEEIGALLLDNGLADEITVTTV PVLIGGGEKALECGLNDGRAWIVRSNKVLVDGKMRTVYGKV Prediction of potential genes in microbial genomes Time: Wed May 18 00:35:35 2011 Seq name: gi|226331992|gb|ACIB01000064.1| Bacteroides sp. 3_2_5 cont1.64, whole genome shotgun sequence Length of sequence - 2901 bp Number of predicted genes - 5, with homology - 4 Number of transcription units - 3, operones - 2 average op.length - 2.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 30 - 89 2.8 1 1 Op 1 . + CDS 333 - 554 262 ## BF2863 hypothetical protein 2 1 Op 2 . + CDS 567 - 1139 394 ## BF2862 hypothetical protein + Term 1225 - 1274 9.2 + Prom 1358 - 1417 4.2 3 2 Tu 1 . + CDS 1520 - 1711 97 ## + Term 1887 - 1919 1.3 4 3 Op 1 . - CDS 1652 - 2137 588 ## BF2860 hypothetical protein 5 3 Op 2 . - CDS 2143 - 2700 568 ## BF2859 hypothetical protein - Prom 2750 - 2809 7.3 Predicted protein(s) >gi|226331992|gb|ACIB01000064.1| GENE 1 333 - 554 262 73 aa, chain + ## HITS:1 COG:no KEGG:BF2863 NR:ns ## KEGG: BF2863 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 73 1 73 73 140 97.0 2e-32 MKTEIRQNGKVILSSTDDISIPMIFKNLCGKNFSGNDYQNYLRTVCQDIGVTTGAIEYYA DNVLIEKATIPDF >gi|226331992|gb|ACIB01000064.1| GENE 2 567 - 1139 394 190 aa, chain + ## HITS:1 COG:no KEGG:BF2862 NR:ns ## KEGG: BF2862 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 190 1 190 190 394 100.0 1e-109 MKRKEKAAVDEFVALVGGKTWEPVRHKCMGKWSGMTDYGFVIDGRITLFVSNSMAYFKKR IREWIKSIHTFEAKKDCYLRLLREQIEKDNDKAKDEKLNPVRLIDIGILSPESNSPFDFF APYVLVEINGRRFKHQTAELSCAIMVDSLAGYLEECNCKDIYTARAVRTPDYIFCGVRFD SRDNMYKIGK >gi|226331992|gb|ACIB01000064.1| GENE 3 1520 - 1711 97 63 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MESGVDVFNDNLPKYKPSFANRNDSFLEILVSKPILSIPYDLTPLFNNAITSFSCIIEHL CLS >gi|226331992|gb|ACIB01000064.1| GENE 4 1652 - 2137 588 161 aa, chain - ## HITS:1 COG:no KEGG:BF2860 NR:ns ## KEGG: BF2860 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 161 1 161 161 291 99.0 4e-78 MDDKTGALEKLKAIMAKAEQVDMSSVKIGDIIYVPLDEEDGLILKDGYKDRNKYIVIIGF TPEGVAIGALLINSEIDSSKRSEELLDCQYPLMVRNYRDILDYDSWLDCSDIFELSKLKI TEKNGKLKGCLISEDRERVMQFLRETEVFDNATKRRYGIIK >gi|226331992|gb|ACIB01000064.1| GENE 5 2143 - 2700 568 185 aa, chain - ## HITS:1 COG:no KEGG:BF2859 NR:ns ## KEGG: BF2859 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 185 1 185 185 342 100.0 5e-93 MKTAIQIAKIRAVVLYIMQSFTQGVDYIKLFKILYFAQQDHLVKYGKVIVEDSFRALKHG PVPAYTYKALQIAEGKPLDGNFDEFLSDIEVRDKKVYTSAVPDMDYISGANKRCLDAAIA KYKDTDPYDLSDLSHDSAWEEAMTRIQDDPQKNFITIIDIARAGKATKDMVDYIREKQIV KNALS Prediction of potential genes in microbial genomes Time: Wed May 18 00:35:50 2011 Seq name: gi|226331991|gb|ACIB01000065.1| Bacteroides sp. 3_2_5 cont1.65, whole genome shotgun sequence Length of sequence - 2608 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 44 - 103 2.8 1 1 Tu 1 . + CDS 340 - 2608 1358 ## COG0827 Adenine-specific DNA methylase Predicted protein(s) >gi|226331991|gb|ACIB01000065.1| GENE 1 340 - 2608 1358 756 aa, chain + ## HITS:1 COG:pli0004 KEGG:ns NR:ns ## COG: pli0004 COG0827 # Protein_GI_number: 18450290 # Func_class: L Replication, recombination and repair # Function: Adenine-specific DNA methylase # Organism: Listeria innocua # 11 293 422 677 756 125 32.0 3e-28 MSYNKLKSLVANVEAIETAMKIQVQGRQATAEEKEILSRYSGFGGIKEVLNIGTDKPIGG DMQEPIQRLQELINAYPHFTEPMRHNVIEGIKASVLTAFYTPKFLVDAVVRQIHATFSEN GLKMRSFLEPSAGIGGFLPIAMSGTYDYAIEKDLISGMILSLLHENTLTRTAAFETIGEQ GFEHTTFDVIASNIPFGNFRVFDAELWKKGGMYEQATKTIHNYFFVKAMELLNEGGLLAF ITSRGIADTPGNKFVREYLVNHADLISAIRLPDMLFMQTSGIEVGSDLLIFQKHTHKTVL SQREQLFLQVGREKADAIGTMTEYANKLFTMPKTTLATGSRIVQNQYGKYVRKYQWQGNE NAMSQYLAALLKLDFGRYFRKSLFTGNGQGSEHMQMSLFGNVAMKQVEKGKRAYTDGVEA WMKDGAMVLFEGQVGTIQYRKSSLYQEVAIDFVPVDEGKVNTDRAKDYFPIRKAYFELSI KEREEQKEDNGLRRELNARYDAFVAKWGCFHENDNKEFIMLDSLGVEVFTIEMQLGKDLV KSDIMREPVAFKKIDPNKRLTPIEALASSLNFYGKVDMDYLMQSTDSAEEEIIGDLKGEI FYNPAIGEWEHKGKFLSGNVIAKCKEIGSYLSELTDREKDWTETAVRALADATPEAIPYE ELDINMGERWIDTKLYADFATELFKVETSVMYFDVNDTYMVRLQSYSPVAYNTYFVRNYN GEDLFVHALHDTVPEITKKIYRNGDKVRVPDEEAIQ Prediction of potential genes in microbial genomes Time: Wed May 18 00:35:53 2011 Seq name: gi|226331990|gb|ACIB01000066.1| Bacteroides sp. 3_2_5 cont1.66, whole genome shotgun sequence Length of sequence - 8249 bp Number of predicted genes - 7, with homology - 7 Number of transcription units - 3, operones - 2 average op.length - 3.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . + CDS 3 - 2711 1655 ## COG4646 DNA methylase + Term 2732 - 2780 9.1 - Term 2679 - 2716 -0.4 2 2 Op 1 . - CDS 2953 - 4842 702 ## BF2857 hypothetical protein 3 2 Op 2 . - CDS 4853 - 5071 97 ## BF2856 hypothetical protein 4 2 Op 3 . - CDS 5065 - 5454 284 ## BF2855 hypothetical protein - Prom 5544 - 5603 4.6 5 3 Op 1 . - CDS 5787 - 7271 752 ## COG1401 GTPase subunit of restriction endonuclease 6 3 Op 2 . - CDS 7321 - 7794 222 ## gi|253567332|ref|ZP_04844781.1| conserved hypothetical protein 7 3 Op 3 . - CDS 7799 - 8239 194 ## YPTS_3419 YD repeat-containing protein Predicted protein(s) >gi|226331990|gb|ACIB01000066.1| GENE 1 3 - 2711 1655 902 aa, chain + ## HITS:1 COG:AGpT188_2 KEGG:ns NR:ns ## COG: AGpT188_2 COG4646 # Protein_GI_number: 16119916 # Func_class: K Transcription; L Replication, recombination and repair # Function: DNA methylase # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 1 757 433 1182 1315 315 31.0 2e-85 AATKIQEIRDRFNCWLDRQPIEVRDELVRVYNERFNCYVRPHYDGSAQTFPQLSFGQFPY DSLYPSQKDAIWMIKQNGGGICWHEVGTGKTMIMCVTAYEMKRLGLVHKPLIIGLKANVH EIADTFRKAYPTAKVLYPGKEDFTLANRQEVFSKIKNNNWDCIILTHDQFAKIPQSEETM IDIFTEELADVERNLEFLEQSTMRYRSGKMQEGLEKRKQNLGAKLQELRMKINNRKDDAV DFHTMGIDHIFVDECHIFKNLMFQTRHNRVAGIGNTKGSQRAMNLLFAIRDIQLRTGRDL GATFLSGTVVVNALTELYVMFKYLRPQELQRQRISCFDAWAAIFTKKTADYELNVTGSVK RKERFRTYIKVPELAMFLREITDYCTADMINLDVPEKNVRFLSYPPTIEQEEMIGRLVSF AGSGQWEDLGLDVPQPDNLDKAKMLVATNVARKMALDMRLLGCKFKDDADNKASICARTI YDYYIRSNDNRGTQFIFSDLGTYKPNEWNIYADIKEKLVQLGIPADEIQFIQCATTERAR KKLFEEMNNGKVRVLFGSTTMLGTGVNAQQRAVAVHHLEIPWRPADMEQRNGRAVRKGNT VKLWGGNVVDIVIYGTEKTLDAYKFNLLRNKQMFINQINNGTIAVRRIDEGGMDEDSGMN FAEFVAILSGNNDLLNKTKLDNKIMQLEKEQAIFKKERIRAERKIAAGQGEIEKAKRTEA DFKRDLEYINSYNGAKATLLLNLPQASTEEVGRELHRIAKTYRNGAYGTVGTYAGLNLLV HSEYNMDGTFDRNTFFVEGISGLKYRCGLSGALPLGFVESAQYPHGALSKLPSLIEKQQK AVERIESEIPTLQDIICRQWSKADELSRLKQECKELQHRIDESLKEAEQPQAAKHEAIAE AA >gi|226331990|gb|ACIB01000066.1| GENE 2 2953 - 4842 702 629 aa, chain - ## HITS:1 COG:no KEGG:BF2857 NR:ns ## KEGG: BF2857 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 629 1 629 629 1224 97.0 0 MNTLILDFLNSYADNTNPQYAVMLKGKWGCGKTHLIKEWKKKFDGTADTDEEITLKPIYI STYGMDSVNDIKTAIDRELNPFFYSKTGRFIKGVLKLAGKVVFKTSMDFNEDSKEDGSFS ATLDSLSLLQVKDDSIKGVKFLIFDDIERCLIEMKELLGFINYFVEHCNCHVVVIGDENH LEKLPKAVWDEFKEKTIGREFEIQPDIEEAIEYFLGEVPVSDYLKEMRDFIIACFMCTKS DNLRVLRQCLYDFKSHLNKLPPELVEKDNIFLKNILGSFIAVYAEYNNSENKEVICNWSR DCRISLLQDNNEDKQRIHHLREKYHSLNKELIYNVLNPEYVTAIIQYIITGAPLVEFIVT EIRDKQKELKPWEMLSGFFDMEQQKLERICQDTITSILDREIKDAYQLGYSIAYLSYFHA IGIFYFIQAYVSLIKVRIAEMIDSQTCLEELYQLRGLFISGCNYVTTDSKTPITDDIVDY FLQKMKSKINELPDQMQKALRNLTEGTVEQLIIIDRLPYPDKSCTYELRAIFAYEDANAL FDAICKLSNKSRNTFTQFLAYHYNFDYDLQDVGDRYKADVPCLLKLKDLVGNEISISKGV DKLAFIRLKDALIEAIRRCEGKSDTLPPM >gi|226331990|gb|ACIB01000066.1| GENE 3 4853 - 5071 97 72 aa, chain - ## HITS:1 COG:no KEGG:BF2856 NR:ns ## KEGG: BF2856 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 72 6 77 77 127 98.0 2e-28 MVRLTIPASLYWIFQRRNMKFSCLDIIVLRNITNFVHCGCGDNGFAASFTISIFNVLIAT FLFFNGVKIELC >gi|226331990|gb|ACIB01000066.1| GENE 4 5065 - 5454 284 129 aa, chain - ## HITS:1 COG:no KEGG:BF2855 NR:ns ## KEGG: BF2855 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 129 1 129 129 234 99.0 1e-60 MAELKATVCLQGKDIEVISSHIEFNRKTDNKGRPVTNVIGGRITITVESTRETTILEAMV NSPFKAISGKVIYYNTKDNSIFRVVEFKYAYIVYYKEVYNVDRKRQMYSTITFSADIIMI GDAYLSNQW >gi|226331990|gb|ACIB01000066.1| GENE 5 5787 - 7271 752 494 aa, chain - ## HITS:1 COG:DRB0143 KEGG:ns NR:ns ## COG: DRB0143 COG1401 # Protein_GI_number: 10957435 # Func_class: V Defense mechanisms # Function: GTPase subunit of restriction endonuclease # Organism: Deinococcus radiodurans # 6 181 9 179 969 123 35.0 8e-28 MNCDFTWIPFYKELSDWLLGKQNSQPELISTLKEIGISGFRDGSEGGKEIVLEEIDPFTF FSYLNKFHSDERRVEILQDLRRKLNFSCPEPTDVSGIPTTHPMKVHLFPWKTIRGNNDIN VLWELFGQVKGGKVDERLFQTALNIKSVGKGKLSIVLFYANPERYVPLDSNTSSYLRSKK LGYTYDSFASYNELSEKIVKTLGKRPWEISYEAYNYTPESDSSSIGSIRTLFEKLEDELE DDMDYHIFYRGQSDKSFGLIPSIYREKFLIQNENRIFRDIIAQSPADFKGCTSTFEKLVK MQHYSLPTRLLDITTNPLVALYFACENDAVDGKLFRFEVQTSDIKYFDSDAVSVVSNIAK RPIDFSIEDLRELDRKEFNSEEEIQYLLHEIKYEKPHFQNVIDSKDIERVFCVKPMFDNP RIIRQSGAFFLYGINGNKSQPASLNFSYKVYIINKAQKRKIRKQLEALGIDKSTLFPEVE HVAEHIKDKYHLPK >gi|226331990|gb|ACIB01000066.1| GENE 6 7321 - 7794 222 157 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253567332|ref|ZP_04844781.1| ## NR: gi|253567332|ref|ZP_04844781.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] # 1 157 1 157 157 291 100.0 1e-77 MNIVAKIRVLISSEIPQLNEIVDLRRLYFELNQHIAILASHINLHDLGEWKLLVSINYRN TDKIGIFKRLKRFSSDKEYEISISIPIPDKRKAIYGIAQVDTSFYSQITESLFYSLEPAY DKYLDLNTYMKDSAKSSILFILKQGFKCNGIKIQFKA >gi|226331990|gb|ACIB01000066.1| GENE 7 7799 - 8239 194 146 aa, chain - ## HITS:1 COG:no KEGG:YPTS_3419 NR:ns ## KEGG: YPTS_3419 # Name: not_defined # Def: YD repeat-containing protein # Organism: Y.pseudotuberculosis_PB1 # Pathway: not_defined # 2 143 1340 1481 1494 103 40.0 1e-21 MGMYVSQDPIGLAGGILNLYGYVDDTNAWSDILGLHKNSNDTVGDWVLYLVYDNSTGEYA KVGIGKAEDVMADNSNRRAYTSARKVRQDPNFSNATFQILSTHTNITKGAMKEIEAAKVR QLRSEGHNLPYNRERDHRYHLPSYNK Prediction of potential genes in microbial genomes Time: Wed May 18 00:36:15 2011 Seq name: gi|226331989|gb|ACIB01000067.1| Bacteroides sp. 3_2_5 cont1.67, whole genome shotgun sequence Length of sequence - 3909 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 2 - 3908 2156 ## COG3209 Rhs family protein Predicted protein(s) >gi|226331989|gb|ACIB01000067.1| GENE 1 2 - 3908 2156 1302 aa, chain - ## HITS:1 COG:YPO0762 KEGG:ns NR:ns ## COG: YPO0762 COG3209 # Protein_GI_number: 16121077 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Rhs family protein # Organism: Yersinia pestis # 154 1302 137 1334 1438 316 27.0 2e-85 LKTGVGAQVKGISEGVKANSEAGVSPSVNALDTGVKAAAAIGGLADGLSEAAMLPVLGAM GMKGMACLPISKQLDPVIGVDIHLVTIPPSPVVPMPHPYVGILLRPQDFIAAAVSSFIPP PPTAEQTGDADSAKLAEVGHTVLTMAVGMLGATVKIGGFIPRAVASTPTRSIPHIPMGAG WAAPSAAIPKNNGHAFMGSLTVLADGMPFSGGGAHLHLDCNDVGIPSVHKVPGMFLPTGV INPIPPARQILTSPVPVPLNPMAALARKCTGAFGRFYKKKTRKLADRLHSKVNRTIKSES LKNMLHKAICTVTGHPVDVASGTFFTDEEDFWLDGPVPLSWERTWYSRSDYRGPLGNGWH HAYDMGVVADTEEGTLTLRMSDGIPVAFPLPTAEEPSFILSERKEARLEQDGGYCVWDMA EDLYYRFTRKEYDSVRLLESVTDCNGLGIRFDYTKEGLLRSITDSAGRRLRVEHDTRSGR ILEICGPHPEDPEKEITLASYEYDADGNMTLQRNAAGDVMTYEYAGRLIVKETWRNGLAW YFEYDGTGVGSRCVHTWGDGGIYDHRLTFREGVTEVLDSHGELTVYHHRGGLVWKKVDAN GGEHLWRYDDSRRLLAQTDPLGNSTLYRYDRWGNCTDSSDPCGGSVSAVYPAKGNLRNRP VSVTTPDGGTWEFGYDRSGNLVSRTNPEGAVTRMTYRNGAVASVKDPYGVVTRLAYDRFH NLTEASDSRGNTSLYGYDLLGRCVSVTNPKGAVQKREYDPVGRVVRVLDFDGNDIRLSYD GIDNLTEYHDNVQHVEYGYSGMWKLTRRRDHRGVVSFRYDREERLRRVTNERLQSYEFTL DAVGNVTAEKGFDGAVRRYLRDRGGRVVRETLPSGTEREYGYDACSRVTCVSYPTAGDPD QTYAYGLSGRLVQASRGESTVEFAYNSLGLPVRETADGNTILRTYDHTGRILTLDSTAGA SLRYTRNGYGELEGFTATGGSDADGAGSWESAHRHDTLGFEVERILPGGIVRSFAYDDIG RLVDARTRKDSRTRHMRRYRWGVADRLLSVEDSRRGETRYSYTPTGQLERAEYPDGRVQW RKSDQTGNLYPDPDMKLRRYLGGGRLEQDGEWHCEYDADGNLTERYLGTGRWLDGKKDRW RYRWNADGSLAKVVRPDKREVEFTYDALGRRLSKSFGTTVTRWVWNGNVPLHQWKQHREY SVMEDRWNTDTERRNMTVWLFDEESFVPVAMLKEGRSYSILTDQLGTPTEAYDAEGNEVW SRVLDMDGNVIEETGNKGMVPFLFQGQYYDRETGLAYNRFRY Prediction of potential genes in microbial genomes Time: Wed May 18 00:36:21 2011 Seq name: gi|226331988|gb|ACIB01000068.1| Bacteroides sp. 3_2_5 cont1.68, whole genome shotgun sequence Length of sequence - 27945 bp Number of predicted genes - 26, with homology - 26 Number of transcription units - 11, operones - 5 average op.length - 4.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . - CDS 57 - 344 292 ## BF1725 hypothetical protein 2 1 Op 2 . - CDS 310 - 1686 999 ## BF2846 hypothetical protein 3 1 Op 3 . - CDS 1693 - 3456 999 ## COG3501 Uncharacterized protein conserved in bacteria - Term 3479 - 3528 5.6 4 2 Tu 1 . - CDS 3567 - 3962 292 ## BF2844 hypothetical protein - Prom 4119 - 4178 5.5 - Term 4392 - 4435 3.4 5 3 Op 1 . - CDS 4476 - 5279 328 ## BF2843 hypothetical protein 6 3 Op 2 . - CDS 5285 - 7633 695 ## BF2842 hypothetical protein 7 3 Op 3 . - CDS 7630 - 8571 709 ## BF2841 hypothetical protein 8 3 Op 4 . - CDS 8582 - 9028 321 ## BF2840 hypothetical protein 9 3 Op 5 . - CDS 9025 - 9522 471 ## BF2839 hypothetical protein 10 3 Op 6 . - CDS 9543 - 10691 441 ## BF2838 hypothetical protein 11 3 Op 7 . - CDS 10695 - 11573 504 ## BF2837 hypothetical protein 12 3 Op 8 . - CDS 11587 - 12522 573 ## BF2836 hypothetical protein 13 3 Op 9 . - CDS 12507 - 14945 1050 ## PROTEIN SUPPORTED gi|163764771|ref|ZP_02171825.1| ribosomal protein S8 14 3 Op 10 . - CDS 14942 - 16717 832 ## BF2834 hypothetical protein 15 3 Op 11 . - CDS 16725 - 17147 281 ## BF2833 hypothetical protein - Prom 17289 - 17348 8.8 + Prom 17200 - 17259 7.3 16 4 Op 1 . + CDS 17315 - 17782 481 ## BF2832 hypothetical protein 17 4 Op 2 . + CDS 17795 - 19174 914 ## BF2831 hypothetical protein + Term 19192 - 19238 13.4 + Prom 19204 - 19263 2.4 18 5 Op 1 . + CDS 19287 - 19901 168 ## BF2830 hypothetical protein 19 5 Op 2 . + CDS 19907 - 20479 343 ## BF2829 hypothetical protein 20 6 Tu 1 . + CDS 21049 - 21825 496 ## COG1309 Transcriptional regulator + Term 21961 - 21999 -0.4 + Prom 22527 - 22586 4.3 21 7 Tu 1 . + CDS 22608 - 23021 224 ## BF2827 putative single stranded DNA binding protein + Term 23094 - 23142 3.1 + Prom 23138 - 23197 6.4 22 8 Op 1 . + CDS 23260 - 24015 704 ## COG1192 ATPases involved in chromosome partitioning 23 8 Op 2 . + CDS 24022 - 24276 289 ## BF2825 hypothetical protein - Term 24252 - 24300 5.4 24 9 Tu 1 . - CDS 24347 - 25666 1362 ## COG1672 Predicted ATPase (AAA+ superfamily) - Prom 25729 - 25788 6.0 25 10 Tu 1 . - CDS 25936 - 26172 145 ## gi|253567360|ref|ZP_04844809.1| conserved hypothetical protein - Prom 26193 - 26252 7.5 + Prom 26129 - 26188 3.4 26 11 Tu 1 . + CDS 26221 - 27943 1243 ## COG1475 Predicted transcriptional regulators Predicted protein(s) >gi|226331988|gb|ACIB01000068.1| GENE 1 57 - 344 292 95 aa, chain - ## HITS:1 COG:no KEGG:BF1725 NR:ns ## KEGG: BF1725 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 91 1 87 91 75 47.0 8e-13 MEWLTNILRNLEGLFTNATEYAYANPKVGYLVVIFLLLVWLVGLIFDWKWTYARPGSWGG NFFLDLLGPTGFRFWLGVIIVIAIVASAYLYFRVK >gi|226331988|gb|ACIB01000068.1| GENE 2 310 - 1686 999 458 aa, chain - ## HITS:1 COG:no KEGG:BF2846 NR:ns ## KEGG: BF2846 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 12 458 1 447 447 825 99.0 0 MSNAIQQIADKMLYQWEDAVRKPVKLIRIVINPGDESMLDAFYDYMLALDSEEEDMVFLI ELPFSSRTDFSKDVVGYIAQQVEYWNNSKKPEDIVFERVDWIADYKSEEAENDASVAVAN FNKLTESLVKGTDMKCSFVFNLKNTYDYDGCREWFEKALALPFHKQMVWGISDIKDYEQF GKLMAKHSNDAVSIYPPIDLDGAMEQLAEQAANEDKSDPAASNFRLALIKLMNSVKKGNA AQTEEFAKKCLDIALENVKKDINWISQFVTVYTILYTDQISRKDHKTAMYFADKAVEAAE IGEEKLDPSLACRLLGNTLLGKASIYVRESLWKEAAETYYRGAEAYSRCNDYLMQSEALR LCGWCRENNYEKSLAAECYVEGFRLSDKLSIDLIKNSSYPLLLLSLLNSSARSKLVSDDE INEVLTKVLGDNWENYLYEYKRNLGKYNGMADQHIAKS >gi|226331988|gb|ACIB01000068.1| GENE 3 1693 - 3456 999 587 aa, chain - ## HITS:1 COG:PA2373 KEGG:ns NR:ns ## COG: PA2373 COG3501 # Protein_GI_number: 15597569 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Pseudomonas aeruginosa # 112 583 112 586 668 90 22.0 9e-18 MLEQKKVTLEIAGIPMPSFVQLMLKQSINEHHYFEITLDIQAIEAYGVEIPEASKDWVGK KVIMDFGGTIFVGVATMVGLHRSGGTHGNIKVTGYSSTFLLESDHTCASWCNKSLSDIVK ELTDKAGVQALVNPETKSKLEYECQYEETNFRFIQRLARQYQEWLYYDGQNLVFGKPQAG STTKLTYGEELSVLDVCSQTLARPIKGSSYHSVNDQTYNGQSPDTAAGQNTLGQAAFDSS LALFTAPAVQRAEPRITNKGELDAYFQRRQQSDSAASSFITGESDCRILTVGSIIDVHTA IHTGIGIHVKNSIGTYIITEITHVAGMGDSYQNYFTALPSSIPTLPCPDVPLPVAHTQQA VVVSNEDPKKLGRVQVKMNWQTGPMQTSWIRVLTPDAGTSDKVATNRGFVFIPEKGDQVM VAFRYDDPNRPFVLGSLFHGKSGTGGGSSNKTKSLTTRSGCTITLDDEKGSVMMKDKEGN SYTADGEGNITISASKSITLCVGENKIMIDSEGNISSNAEKDITESAGANIAQSAENIDC TAGSNYNISSTDFTATGSSSATVSGTTKATIDSSGTTAVAGTIVKLN >gi|226331988|gb|ACIB01000068.1| GENE 4 3567 - 3962 292 131 aa, chain - ## HITS:1 COG:no KEGG:BF2844 NR:ns ## KEGG: BF2844 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 131 1 131 131 238 100.0 4e-62 MAFRASLELNNKEFDVLYSNYEFSRNTDPKGMPSSSVLGGRVKATIESTEDTSVIEAMLN SPFKPVEGKIIYKKTEEDAKMKEIQFKNAYIVHYSETLDANNDVPMTITITFSAEEIIVG SAALDNRWPKK >gi|226331988|gb|ACIB01000068.1| GENE 5 4476 - 5279 328 267 aa, chain - ## HITS:1 COG:no KEGG:BF2843 NR:ns ## KEGG: BF2843 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 267 1 267 267 488 100.0 1e-137 MEPTREQLQAEILEILYQKCLQKPIWVSCAEIFWSISNLKVSERSVKDVLDWLVKNDLVI YQADKYQIAKREFMDISRRKALEKKETEENAEVRTVEYGTSIPPHTMNHHPWLEREYQQP VAKSSNTMFAIIALIAFLISGTLVGILLFNSFATERRNNAELTYLPDSIQVPSLQVSTPG YIRDAYTTNRNFKNIQTSLETQQRINSELMDICKAQQSQINILTTCSKQQAADIEHLEKE RKIYTWMIALALLVLSSLLICRYGRKN >gi|226331988|gb|ACIB01000068.1| GENE 6 5285 - 7633 695 782 aa, chain - ## HITS:1 COG:no KEGG:BF2842 NR:ns ## KEGG: BF2842 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 782 1 782 782 1510 98.0 0 MKQLNLIRNLFSVAVMITLACNVSAQVSIRDRIQIMDRDIFNYPELTEAPVKTKKSKTNR IVYSDRTGNQSYEDPYFQRKRSSHGIGTPYYIVGEKNGTYKLVQADPDITGKPKSIIGFL YNSKRHFKEPRKVNYAGWIPSENVLMYDHARINPRNNQPIRYRIGINSINKLFDIHQFFN GDTLKIYGEPFLKTTTDAVVVSGEVVYLYKLDKSGKSALISNVPALSDSTKRFLGWVPAD LLAEVGQNEVYHIDYSRYRDSLLCAVNLMYPDTLALHNANIQGTMLFNLDGNPAGPMTNG NIRLNYPLSVWDKNWNKIINIKGGDIMVSDVRKMEAETKNVNIHVLFNDADLPYLSPYIN VLQNLKLKMKPEYDYTFSATCISSNGENRHLAPTKDFSAWLDYIKQSISSKRKNEQEEQS SFYGFGGAIKQISRYDNESRFSNNVFVILGSKQTLSLHEQQLSWLATQPARLLFAQIDRA SGTSYQDFLLQAKSILDSHSTKYIDHISNYIADSRLVKTELFKNIEATDANLYLYDAPYN SLTVGGILFPKGRNRLESNALETALDSVLWQSFETDSLLLHSLKDYERNIGVLRSRPTSE LTHIFHHTENPDSMSLDDIDRNSVNDTYYIKASVADSIMDGYEYGYLLDDKEVMELLQSY RGLLPEFSDNIGKKELRVLRKQFNRQKKSINRSFYRKVLPRNPYLAQIFYCKTGVMANDS LLNSTKVKKLKKRKVELTDFNSGYTTLISKMKRLEDIYQRNLFRPVFITGKKYYFIPKQL IL >gi|226331988|gb|ACIB01000068.1| GENE 7 7630 - 8571 709 313 aa, chain - ## HITS:1 COG:no KEGG:BF2841 NR:ns ## KEGG: BF2841 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 313 1 313 313 609 100.0 1e-173 MDNKKTITIIILGLVAMAGILLFLFLPSKSHLANIDFYVYDTNDNFHYEVNERLELLVND TAAIKGKRLIWEMGNGDTLMRNTDVSYTYRKAGKYLITLKIDGKHSVNKYIKVISGLEHE AIDSIPRINGVSEGYQGEELVFSAEGYGMDTWLWEFGESGTVDAYEQQVVYSYDIPGEYT ISLQTNTTKYPVKHRITILPRFEKAEELVTVDSLVLVQNDIKKHLQAIADAKVSQKSVYY EHTNYIRNKYFCIDADQVVVVINDEKYNDFNGYCQGLHFLESNRKKRVSIDDVKIDDLNC VKSIHVTQSYIEK >gi|226331988|gb|ACIB01000068.1| GENE 8 8582 - 9028 321 148 aa, chain - ## HITS:1 COG:no KEGG:BF2840 NR:ns ## KEGG: BF2840 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 148 1 148 148 263 100.0 1e-69 MTKDEMLWGNIRFLLLLIFSVAAIYIILCRYILNVPTEDSSELINEINHSERIFEIQHTH MQQAQNIWNEIDSLDFNIHQVQKMDEVKDGIYQLQHIYKENNMNTKFLFGVLSSRMLKCQ FDIKEELNSLVHNNALIERDLEECKANL >gi|226331988|gb|ACIB01000068.1| GENE 9 9025 - 9522 471 165 aa, chain - ## HITS:1 COG:no KEGG:BF2839 NR:ns ## KEGG: BF2839 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 165 1 165 165 237 99.0 1e-61 MEAENKKARLKAKLRFTLVFAIALIVTTTGGIVTIVTAQKGISLLESKKAEYDNVFKKQA ELNFQIEELFRDLNNLKTKRRNSSEHKHMQKLITKKRLLMENDIAMQADKSKYEVYKAML EQIRVIQSSMDDLDRESKKRESNMEQLEKCRIKYQELTKNKLTKP >gi|226331988|gb|ACIB01000068.1| GENE 10 9543 - 10691 441 382 aa, chain - ## HITS:1 COG:no KEGG:BF2838 NR:ns ## KEGG: BF2838 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 382 1 382 382 775 99.0 0 MKTLTYLPVNWVNGLKLTSQHFFANQYCQTEALNREAGRSLTSYNYGLGEVLEGIGDNLE IEISGDTMSTLCVRLKSCNAITKGGLPIVYYDGLYGDEKPCATISESGLQAEDSEYMVLI SVDPYHLIPVGEPDPEEVPLHHPYVLPSIKLHIVPKNQVNKSFYSQNFLLVAEVCRQGNT YKINHQYIPPVQHSACHDGIKTFISQLARTLQSIKEDVKLIYSRNVADKRRDTLANNTFE LCKAFSAFYNSRIFFIEQIALEQPPIYLVQAVNELANGLNSALQSLSETEREQLLQYFYE WTNVTPSEFITKIESLTKLMYDHTNINQSLQTAGAFVTLLSNMFHKMSELEYIGMVRENI IVGDESHETIEAQKKRSWSFIG >gi|226331988|gb|ACIB01000068.1| GENE 11 10695 - 11573 504 292 aa, chain - ## HITS:1 COG:no KEGG:BF2837 NR:ns ## KEGG: BF2837 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 292 1 292 292 542 99.0 1e-153 MKILLYFFARYLLAPLFVAVMIFVLTGIKTIKSKLSLKKLIIFILLASIAVALPSLFGFL KNEYVWGGLTFTILSYILLGALFCKLSTSDLFGAIGIGSSRTAVILTLTTICALGGWCYY LLFELISKLPYSLWNTTNILWFAIPYLIMYSRTLFLDIPHPIYTPWELSYGTFDRKYWDN IDNFGFRTVKVKIKRNIKDPTYASLVVRLPNEISLGNWFNWVIEDQNRRFPQNKIETEKE DMQIGWMFYTSKWFNFPLFIRILDPTLTSESNKIKNNQTIYIRRVQVETKTS >gi|226331988|gb|ACIB01000068.1| GENE 12 11587 - 12522 573 311 aa, chain - ## HITS:1 COG:no KEGG:BF2836 NR:ns ## KEGG: BF2836 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 311 1 311 311 592 100.0 1e-168 MGTMLAQKKPLESIQRSQAINFVAEIFYDELQFILDRRLLDHTMVNCHGMNSRVKSRDVL SVSVQHIAATNADKEYILLNLARNSIYHQLPELLFHPLVLSTPGMSNKEIVEAIRANEKQ DKELIQFFAPFDTEFFKEKVRINNRHLNFFSDPDSKKNFIKMIEVMENVELSITSHQKYK LFLFLCNAERYKENLPAIEQLLLIVLGLKVKLRLEVHEIDETVYLSVGSGCVGQTLGLNG LMISETDDLTATIILDTPTDDYEEVKTHLSNVRRILEFFILSTRNIEVDYLVRGETDFIL GENRLGYNMNL >gi|226331988|gb|ACIB01000068.1| GENE 13 12507 - 14945 1050 812 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163764771|ref|ZP_02171825.1| ribosomal protein S8 [Bacillus selenitireducens MLS10] # 14 811 14 811 815 409 31 1e-113 MNDSNYSPSLVSALKIARGMALQDKHNTFGVSHLVMAMFAENTGLKEILSSMQKDVGYIM EWFDTHREMYRSSGTSTEEVEPDEELSKVLDESERSKIKLGADSIDSICVFTAILREGVV YSHQQIEMLDVSEDDILSHYNALTPLHSYQGEEMQITASVPYADNLKKQETINAGGFVVG REKEVRTILECLERSENKGILIVGESGIGKSSIINAFVKDICENEDEMLKQISIVGLNTA KLLASTSSETEIAQKVVNLMHKLNQLEQAVLVIDDLQVLLENSVSGKASTLINILSAQIS EGAANLVLTLTNDSYRKNIEKHPIEGRLDIIKIEELDTATLESAIQLHKKRIENYYELRI SDACIKDSIALSKRYFKERSLPYAAIDLLDRTAAAVRLCNKNARASVSDLEADFEEIKSL DEKISEGPLYLLYRSVFSKISVVLTTKLSDNYVWDKEDDIAIKAGRLSGIIKELKALSDQ SIEEIRSSEIEAIVAECTNIPIGKIQAREKDRLLSIESKLQERVKGQNRAITTLSDAIIE SRSGLSDPKKPIGSFFFLGPTGTGKTELTKSLAELLFDDESAMIRFDMSEFKEEHSAALL YGAPPGYVGYEEGGLLVTQIRQKPYSVVLFDEIEKAHSSVYDVFLQMMDEGKIHDKLGRE GDFSNSIIIFTSNIGSQWIVEQIQSGHTPDSGKLIEVMAQYFRPEFLGRLTEVVPFAPID ENVAKQIFNLHFGRLQEQLMKQKNIQLNLSDEALQHLANKGYSPKYGARPIAGVIRTYIK KSVSRLIVSEQIKSGDNIVINYRNGELIWEQC >gi|226331988|gb|ACIB01000068.1| GENE 14 14942 - 16717 832 591 aa, chain - ## HITS:1 COG:no KEGG:BF2834 NR:ns ## KEGG: BF2834 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 591 1 591 591 1201 98.0 0 MSDKKQIAIKERILKQAMEFWGVSDIRDMDPVIDLLLEVFAHESNKLHQEIDQSDSRILH RLSRILIGNKWSLPKPAHALMTITPNHGECCELDAEDHFYAEKNIFGKGDIQVFITPLFS YKLVDAKVKAIAYNDSIRHSTDSFSTPTRFLGPKNRIADYCVWVGLDVSKEVLQNMDSVM FCIKPSDMGLLPFMNMAFFYDCTGNRLNAKTGLHVEDPYSNAHYFDEIRDFYSDLYFNVS IKDASKEKHTYREMFPGYVADCEVPDDSENLFWVKIVFPEIFTKESLENLEVYLNTFPVV NRQIVYKQHNFRSTGRIIPLKCPGRTQFLNIRSFQDNTGREYVNRLNQYEENPTGIFSLY FGDLERFDSDSARSLISKVLQLMKEDGSAFASMNPDALSTQLKELFNKINDIEKGLEATL KGDNKIKAFVLSVPQKDATNAEIKYWVTSGSLANGFNERTLIQQFNIEKYDASGIMLRTC MQGGTTHDSEQELINSLRYGLLSRERIISKEDIKSYLLHKLGKHVESVEVGNGVTISPDS KKGLIRVTEVKIKFGQFDKDEIPNLEELAHYLEKDLTERSVCNSNYKIKFV >gi|226331988|gb|ACIB01000068.1| GENE 15 16725 - 17147 281 140 aa, chain - ## HITS:1 COG:no KEGG:BF2833 NR:ns ## KEGG: BF2833 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 140 1 140 140 257 100.0 9e-68 MTYLKMPLQLQSAVAKKLQRCSYQESIAQHIMLLILSHHGEVIGREDFGSMIWDLEFNQL VKISDWEEGVKNSLIKTIEKYEKRLRNVDVNVTLLEIEEENIDKVSHIRRKAQITVTGIM DRTNEKFSFNTSLYISPLSQ >gi|226331988|gb|ACIB01000068.1| GENE 16 17315 - 17782 481 155 aa, chain + ## HITS:1 COG:no KEGG:BF2832 NR:ns ## KEGG: BF2832 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 155 1 155 155 273 100.0 2e-72 MAIYNYGIGGNEVKVDANESIAEIPSNRTLIVQKLTDEAPAAPESVYGLETIEDVFQRFE PTVDLCHVDAEGNEIKEIMTFTGLGDFGAKKIKEKSDFLSKLDIEKEQGVRITRQLTSNR ALQKALENPEVKNAIMDVIESSIEEILANQKKVEL >gi|226331988|gb|ACIB01000068.1| GENE 17 17795 - 19174 914 459 aa, chain + ## HITS:1 COG:no KEGG:BF2831 NR:ns ## KEGG: BF2831 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 459 1 459 459 852 100.0 0 MKQETQQQTASTTTSQKVVSQQAQKSSASAIDKLKEFGGFAFLENIIDGYSNLNPARKAR RNIFLTDEQWENERKALVNRLSVWLKLLRENDTAEQMRDKAKETAENAENLLNHNLKNAL ARTRELESAYRTIAFFYKNTESDKVKNVTIVNATMEQLQDLDNTIFADHISNELKQNFDR LDLRRNYSLLVVPGYIGSNAILDKWSKMAYENKVFLVTDFQDLETPDDVVDIFFNANHTS GDVFKSNTIMTCNWLLGRQREESVGEEENLYIPPSSSLAGKIYNTLMSQVVAGKKFGGLN EVESVRFDLRKSEISELERMGLVPMVNEYSKVMAFSAKTLFNGDNLGLQTYSVVRVFDYI TKVLIDFLNRRAFENWTTRTEADLRGQVVKFLDSVMGPNKLIERFKVMKIEQDPNQKDRV LLDIHITPYFPAKSFVIQLAGHKGDNPEDAIWESEYHQE >gi|226331988|gb|ACIB01000068.1| GENE 18 19287 - 19901 168 204 aa, chain + ## HITS:1 COG:no KEGG:BF2830 NR:ns ## KEGG: BF2830 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 204 1 204 204 398 100.0 1e-110 MKLRDVDIIISGTKTGDTYYAKSYPCSDMDKNSKIELYGVPVYYVYIKGTDDKGQSVKYT WKALRFMPYYNPPNFSSYKTIGWVNSGLHKLNRQPAPEYKKAYEVHNTYSQHNGAIVLKG TFYIHAGPEDLTHIGWGAAGCVEIIGSFSEFKDQVKELSGSTQVDADSAISELVFYKKLY IEIEYATPPNIKANFYKEVSIKRR >gi|226331988|gb|ACIB01000068.1| GENE 19 19907 - 20479 343 190 aa, chain + ## HITS:1 COG:no KEGG:BF2829 NR:ns ## KEGG: BF2829 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 190 1 191 191 328 98.0 4e-89 MCNIKYILCTIFCMCAYLHAAAICSDSISDKFSYKVLENFNYCWDNNIVYEDTNSKDTAL FVSSHKYIISPILNAFCNNVNEQTELAKIGYCCKHYLIKVPNKSKKNIQYYSIEINILSN IKSHAPITSKIEELKKQDYAIIDIISEAPVMCLVFVEADYLVAVTYNMFYDFDDVLSFSK FFVKEHFSTK >gi|226331988|gb|ACIB01000068.1| GENE 20 21049 - 21825 496 258 aa, chain + ## HITS:1 COG:CC2662 KEGG:ns NR:ns ## COG: CC2662 COG1309 # Protein_GI_number: 16126897 # Func_class: K Transcription # Function: Transcriptional regulator # Organism: Caulobacter vibrioides # 6 173 10 181 213 58 29.0 9e-09 MKQKEKKARNRRTNEQIDKEVISELEKLVAEYGFGNVNLSALMKAANIEANVFYRRYGSM ENLYDRLAKQYDFWINDTIDVSSLNILGPKKFFAETFKTLYRNLSDNTVMQKLLLYEMSV INETTKRTAETRDIMNLNLIAFYDNLFKPAKINIKAIMANLIGGIYYLILHRRCAKTCTI DFSTQEGEKVFFEWIDFLTDAIFDKLEAYERNRKAAQEMLSDGISEFKICKYMGINKNDL KMLLSNSHRNSRELSKES >gi|226331988|gb|ACIB01000068.1| GENE 21 22608 - 23021 224 137 aa, chain + ## HITS:1 COG:no KEGG:BF2827 NR:ns ## KEGG: BF2827 # Name: not_defined # Def: putative single stranded DNA binding protein # Organism: B.fragilis # Pathway: DNA replication [PATH:bfr03030]; Mismatch repair [PATH:bfr03430]; Homologous recombination [PATH:bfr03440] # 1 137 1 137 137 278 99.0 4e-74 MLYVHTIGRIGKDCQVITGAHGSFIAFDIAVDDYSHGNSITTWVRVRSKKENHIRLSEYL TKGRLLLIEGTLSASLWKDKDENCQIQLSITADALEFINTGKREGTTSEAESPTDAASNA PVPPVAMPQEDKEDLPF >gi|226331988|gb|ACIB01000068.1| GENE 22 23260 - 24015 704 251 aa, chain + ## HITS:1 COG:Rv1708 KEGG:ns NR:ns ## COG: Rv1708 COG1192 # Protein_GI_number: 15608846 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: ATPases involved in chromosome partitioning # Organism: Mycobacterium tuberculosis H37Rv # 3 248 65 313 318 185 40.0 7e-47 MTKIIAVLNHKGGVGKTTTTINLAAALRQKKKRVLAIDMDGQANLTESCGLSIEEEQTVY GAMRGEYPLPVIELENGLAVVPSCLDLSAAESELINEPGRELILKGLIAKLLDSRKFDYI LIDCPPSLGLLTLNALTTADFLIIPVQAQFLAMRGMAKITSVIEIVKERLNPNLSIGGIV ITQFDKRKTLNKSVAEIINDSFCDKVFKTIVRDNVALAEAPIKGKNVFEYNKNCNGAKDY MALAQEVLKLK >gi|226331988|gb|ACIB01000068.1| GENE 23 24022 - 24276 289 84 aa, chain + ## HITS:1 COG:no KEGG:BF2825 NR:ns ## KEGG: BF2825 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 84 1 84 84 133 100.0 2e-30 MGKSDLLKDSMKSGLDGLLSSTKKSPQKKEATPLKTEKEPAVHCNFVIDKSVHTRMKFLA IEKNMSLRDIVNEAMKEYLEKNGK >gi|226331988|gb|ACIB01000068.1| GENE 24 24347 - 25666 1362 439 aa, chain - ## HITS:1 COG:PAB1371 KEGG:ns NR:ns ## COG: PAB1371 COG1672 # Protein_GI_number: 14521702 # Func_class: R General function prediction only # Function: Predicted ATPase (AAA+ superfamily) # Organism: Pyrococcus abyssi # 3 413 10 417 472 186 34.0 1e-46 MRFFDRTEEIASLRKIREMAKNNAQFTVVTGRRRIGKTSLVWKAYEDEPILYFFVARKAE GDLCEDYRLEIENKLGVPTMGRAEHFTDVFEYLMKLSAERPITLFIDEFQEFFRVNKSVF SDMQRIWDLYSPKSRINLIVCGSIYSMMTKIFKDKKEPLYNRQSRFMTVRPFTPTVLKDI LSEYNPGYTTEDLLALYAFTGGVAKYVQLLVDAGATAKTTMLDQIIKADSIFLGEGKAIL IEEFGKDYGIYFSILSAIARGKTSRSEIENVVGKEIGGYLTKLEKEYEIISKKQPLFEKS SAKNVRYVIEDNFFTFWFRFIYKYSYMLEIENYGSVKMIIGRDYETFSGLMLERYFKRVL IERQVYTRIGGWWDRKGENEIDIVAENELDDTATFFEVKRKAENIDMEKLEAKAAAFMRA TGEFKGYSLSYKGLSMTDM >gi|226331988|gb|ACIB01000068.1| GENE 25 25936 - 26172 145 78 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|253567360|ref|ZP_04844809.1| ## NR: gi|253567360|ref|ZP_04844809.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] # 1 78 1 78 78 147 100.0 1e-34 MFGSLFCLNRLGFTDALQGLTNKRFPTLFPENYSFAERKNFIGNALQSDGIKRGSPPLRL ENQPAMAGMDRMSGIKRI >gi|226331988|gb|ACIB01000068.1| GENE 26 26221 - 27943 1243 574 aa, chain + ## HITS:1 COG:PA5562 KEGG:ns NR:ns ## COG: PA5562 COG1475 # Protein_GI_number: 15600755 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Pseudomonas aeruginosa # 1 170 24 193 290 133 41.0 1e-30 MEATAIQSVEKNITMVALANVQPSNYNPRKNFDEASLAELSESIRQQGVLQPIGVRPIED NRFEIVFGERRYRASLMAGLEDIPAVVMEISDEVAEEMAVTENLQRKDVTPIEEANAYQK LIDSGRHDVQSLAVQFGKNESYIRTRLKFVSLMPEIAQLLEQDEITISVASEICRYGEDI QKEVYNKHLKEGVQYNSWRGMKASDVARNIERQYTTDLERYAFDKTLCLSCPHNTNNMML FCEGGCGNCANRTCLAEMNAAYLTEKAVRLMEERPEVSLCYESFNSNEAVVERLTAMGYE VESLNYYAKAYPEQPEAPRKDEYDTTEEYEQAQSEFEQDLNDYTEECEEIRTRSEAGEII LYFRIESKDIVLCYVPKVTCTTNDTKQEQTLSPVEKLEKQDKRNKEIALEKTVEDTKKQI LEVDMSDCKFGQDEDKMIYFFLLSSLRKEHFEAVGIEEKKPYSYLTDEEKINIIANLTSK QKAIIRRDFLIANFKNAYGNNAIASLLLDFAQKHMPDLLADIKNGHNEVYEKRHQRIEEK KAVLLVQEQAKQEAEQSDKQQSEVGMQTEEQPQE Prediction of potential genes in microbial genomes Time: Wed May 18 00:37:42 2011 Seq name: gi|226331987|gb|ACIB01000069.1| Bacteroides sp. 3_2_5 cont1.69, whole genome shotgun sequence Length of sequence - 26997 bp Number of predicted genes - 32, with homology - 31 Number of transcription units - 10, operones - 5 average op.length - 5.4 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 345 - 404 2.5 1 1 Tu 1 . + CDS 518 - 700 276 ## BF2819 hypothetical protein 2 2 Op 1 . + CDS 808 - 1080 372 ## gi|237719948|ref|ZP_04550429.1| conserved hypothetical protein 3 2 Op 2 . + CDS 1114 - 1470 455 ## BF2817 hypothetical protein + Term 1520 - 1556 2.2 - Term 1508 - 1543 5.8 4 3 Op 1 . - CDS 1588 - 3774 1593 ## BF2815 putative mobilization protein 5 3 Op 2 . - CDS 3778 - 4515 569 ## BF2814 hypothetical protein 6 3 Op 3 . - CDS 4529 - 5185 369 ## BF2813 hypothetical protein 7 3 Op 4 . - CDS 5207 - 5713 303 ## BF2812 hypothetical protein 8 3 Op 5 . - CDS 5716 - 6561 689 ## BF2811 conjugate transposon protein TraN 9 3 Op 6 . - CDS 6574 - 7173 317 ## BF2810 hypothetical protein 10 3 Op 7 . - CDS 7223 - 7558 107 ## BF2809 hypothetical protein 11 3 Op 8 . - CDS 7571 - 8716 1061 ## BF2808 conjugate transposon protein TraM 12 3 Op 9 . - CDS 8727 - 9209 338 ## BF2807 hypothetical protein 13 3 Op 10 . - CDS 9224 - 9595 297 ## BF2806 hypothetical protein 14 3 Op 11 . - CDS 9608 - 10222 442 ## BF2805 conjugate transposon protein TraK 15 3 Op 12 . - CDS 10229 - 10618 343 ## BF2804 hypothetical protein 16 3 Op 13 . - CDS 10648 - 11781 712 ## BF2803 hypothetical protein 17 3 Op 14 . - CDS 11796 - 12566 599 ## BF2802 hypothetical protein 18 3 Op 15 . - CDS 12635 - 13405 156 ## COG0863 DNA modification methylase - Prom 13540 - 13599 2.8 19 4 Op 1 . - CDS 13610 - 14335 497 ## BF2800 hypothetical protein 20 4 Op 2 . - CDS 14332 - 16863 1386 ## BF2799 hypothetical protein 21 4 Op 3 . - CDS 16868 - 17506 429 ## BF2798 hypothetical protein 22 4 Op 4 . - CDS 17506 - 20217 1816 ## BF2797 hypothetical protein - Term 20229 - 20275 9.3 23 5 Op 1 . - CDS 20328 - 20621 388 ## BF2796 hypothetical protein 24 5 Op 2 . - CDS 20633 - 21010 352 ## BF2795 conjugate transposon protein TraE 25 5 Op 3 . - CDS 21071 - 21394 177 ## BF2794 hypothetical protein 26 5 Op 4 . - CDS 21396 - 21848 408 ## BF2793 hypothetical protein - Prom 22004 - 22063 8.0 - Term 22035 - 22066 0.0 27 6 Tu 1 . - CDS 22084 - 22971 543 ## BF2792 DNA primase - Prom 23003 - 23062 8.1 28 7 Op 1 . - CDS 23078 - 24184 696 ## BF2791 hypothetical protein 29 7 Op 2 . - CDS 24162 - 24560 315 ## BF2790 putative excisionase - Prom 24583 - 24642 4.3 - Term 24579 - 24627 1.1 30 8 Tu 1 . - CDS 24737 - 25078 212 ## BF2789 hypothetical protein - Prom 25105 - 25164 1.7 + Prom 25127 - 25186 2.7 31 9 Tu 1 . + CDS 25280 - 25438 94 ## - Term 25549 - 25613 12.2 32 10 Tu 1 . - CDS 25644 - 26774 784 ## COG0582 Integrase - Prom 26914 - 26973 6.6 Predicted protein(s) >gi|226331987|gb|ACIB01000069.1| GENE 1 518 - 700 276 60 aa, chain + ## HITS:1 COG:no KEGG:BF2819 NR:ns ## KEGG: BF2819 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 59 1 59 93 116 100.0 2e-25 MTMDNIKESKEYKLAKEWEMAVNSFSFNPKRFAAAIPDMHPTLQQSLYRLFKECIIVMAG >gi|226331987|gb|ACIB01000069.1| GENE 2 808 - 1080 372 90 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|237719948|ref|ZP_04550429.1| ## NR: gi|237719948|ref|ZP_04550429.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] # 1 90 1 90 90 164 100.0 1e-39 MKTQEVQFGGNNYPCRVVESNEGEELLIGSITLLDALQPGSFNDENEGFASKEAERIYDE VFFFTDMANLRLTDVELVAELKKDNPEWFE >gi|226331987|gb|ACIB01000069.1| GENE 3 1114 - 1470 455 118 aa, chain + ## HITS:1 COG:no KEGG:BF2817 NR:ns ## KEGG: BF2817 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 118 1 118 118 231 100.0 6e-60 MEKEYVIQIAQTIQEQLIGLTPMPVLMSWGIAEFAATIFKDLPALRIKVNGLLHTGYVII ALNGSDYYEVYLLKGKDAECVNEEVCYNELGDVIDRAIECGTDKEEYEKIAISNSPNY >gi|226331987|gb|ACIB01000069.1| GENE 4 1588 - 3774 1593 728 aa, chain - ## HITS:1 COG:no KEGG:BF2815 NR:ns ## KEGG: BF2815 # Name: not_defined # Def: putative mobilization protein # Organism: B.fragilis # Pathway: not_defined # 1 728 1 728 728 1451 100.0 0 MEESKELQGFYKIFRTVVYVSVLLEFFEYAIDPAMLDHWGGILTDIHGRIKQWMIYHDGN LVYSKIATVLLICITCVGTRNKKHLEFNARRQVVYPLTSGLLLLVLSVWLFGHTMETRLY TLPLNIILYMIASIAGVVLVHIALDNISKFIKEGLMKDRFNFENESFEQCEEKVENEYSV NIPMRYYYKGKFRKGWISVSNCFRGTWVVGTPGSGKTFSIIEPFIRQHSAKGFAMVVYDY KFPTLATKLYYHYKKNEKLGRVPQGCKFNMINFVDVEYSRRVNPIQAKYINNLAAASETA ETLLESLQKGKKEGGGGSDQFFQTSAVNFLAACIYFFVNYEREPYDANGKPLYAERQQDP QTKFWKPTGIVRDREGGNIVEPAYWLGKYSDMPHILSFLNESYQTIFEVLETDNEVAPLL GPFQTAFKNKAMEQLEGMIGTLRVYTSRLATKESYWIFHRDGDDFDLKVSDPKNPSYLLI ANDPEMESIIGALNALILNRLVTRVNTGQGKNIPVSIIVDELPTLYFHKIDRLIGTARSN KVSVTLGFQELPQLEADYGKVGMQKIITTVGNVVSGSARAKETLEWLSNDIFGKVVQVKK GVTIDRDKTSINLNENMDNLVPASKISDMATGWICGQTARDFVKTKTGMGGSMNIQESEE FKTTKFFCKTDFDMAEIKKEEAAYVPLPKFYTFKSREERERILYKNFVQVGQDVKEMIKD VQNKRNAK >gi|226331987|gb|ACIB01000069.1| GENE 5 3778 - 4515 569 245 aa, chain - ## HITS:1 COG:no KEGG:BF2814 NR:ns ## KEGG: BF2814 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 245 1 245 245 460 100.0 1e-128 MKDTDIFFSIALMILGIVLMYKSMKGNSCRQILAKLADSIASDKEDINFQIKRFGYLLDE MTADNEKNSILKRHADRLEELVTQLDRTRNELETSNLSMADINGELKRSNAELVKRATQL RNEIQQDELVIQKMQERLDSLKRIKVGLEIALNNIQAEEVHYLSEPVFSLGITPSIKSHL ESHGILYIGDLIHLNEQYLMEIWGIGPVTLDKIKTKLNENGAWFGMDVIRVGNHWYRRKQ GLITD >gi|226331987|gb|ACIB01000069.1| GENE 6 4529 - 5185 369 218 aa, chain - ## HITS:1 COG:no KEGG:BF2813 NR:ns ## KEGG: BF2813 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 218 1 218 218 409 100.0 1e-113 MIRKIHLVAAATAAVLCAACDTHIDVPDTAVRPGHILCEDGTALPYAQYEQSGKKAIAVV FDTEKRGDTEGDGYAVYLWDIAPQAFADSLGIAQGTSADIMAYDGNENTFALYDTQETAS PMAEAVFDLWRYGQSAYVPSVAEMRLLYTMRKIINPVIEQCGGEPLPLDENDCWYWTSTE VEKQQTAKAWLYSMGSGAMQETPKVQAHKVRPIITINE >gi|226331987|gb|ACIB01000069.1| GENE 7 5207 - 5713 303 168 aa, chain - ## HITS:1 COG:no KEGG:BF2812 NR:ns ## KEGG: BF2812 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 168 3 170 170 315 100.0 5e-85 MKKTIIVIITCVCIAASANAQQGSGRLSLGTGLLYKNGMDITLAYEHEMNYRHTWEFFVN GYLQWAECASCGHICPESFWRNYRSYGFGVAYKPCMTRGRNHYGSLRIGASAGSDMNKFL GGLHFGYEHNYVLRAGWTLYWQVKSDVMIKGADVLRAGVVLGVKLPIK >gi|226331987|gb|ACIB01000069.1| GENE 8 5716 - 6561 689 281 aa, chain - ## HITS:1 COG:no KEGG:BF2811 NR:ns ## KEGG: BF2811 # Name: not_defined # Def: conjugate transposon protein TraN # Organism: B.fragilis # Pathway: not_defined # 1 281 1 281 281 518 100.0 1e-145 MKRNLFGIMLLSGISILPKANAQTTYEEMEQLTVNEQITTVITASEPVRFVDISTDKVAG DQPIDNIIRLKPKESGHEDGEILAIVTIVTERYRTQYALIYTTRMKEAVTDKEILLQERN AYNNPAVSMSAADMTHHARRIWNSPAKIRNVATKAHRMVMRLNNIYSVGDYFFIDFSIEN KTNIRFDIDEIRIKLTDKKLAKATNAQTIELTPALVLESGKTFRHGYRNVIVVKKMTFPN DKLLTIEMTEKQISGRNISLNIDYEDILAADSFHADLLEEE >gi|226331987|gb|ACIB01000069.1| GENE 9 6574 - 7173 317 199 aa, chain - ## HITS:1 COG:no KEGG:BF2810 NR:ns ## KEGG: BF2810 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 199 3 201 201 399 100.0 1e-110 MKSSNYLKKIYGNPTDEKYTPGYGVLPIIKYIPEGKIVWCPFDTKHSEFVQKFKDAGFHV VYSHIYNGQDFFNYEPSQWDILVSNPPFSRKVEVFERCLKLGKPFALLMSNYWLNNVTPC RLFQNTDLELLMFDKRIQFGKGKNVPFNSSYFCHKILPKQIIFEQIDVTDKSPSCMQDDI PDKANINPQENKAIMNFQL >gi|226331987|gb|ACIB01000069.1| GENE 10 7223 - 7558 107 111 aa, chain - ## HITS:1 COG:no KEGG:BF2809 NR:ns ## KEGG: BF2809 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 111 1 111 111 219 100.0 2e-56 METDLSEYHKGTDGLYYADYIAPNKAETFIGKLVSTEWWHHRGQFALICNFRTEDRRRIA LFAFQKHTGFYGPRYGNVNFKTVEKGTLWQCEIQMTRTRRCTWARARQIKK >gi|226331987|gb|ACIB01000069.1| GENE 11 7571 - 8716 1061 381 aa, chain - ## HITS:1 COG:no KEGG:BF2808 NR:ns ## KEGG: BF2808 # Name: not_defined # Def: conjugate transposon protein TraM # Organism: B.fragilis # Pathway: not_defined # 1 381 10 390 390 683 99.0 0 MKILEKINFRQPKYMLPAILYFPLLGTSYFIFDLFQTETIEIQDKALQTTEFLNPELPGA QIKDDGIGSKYENMAKSWGKIQDYSAVDNIDREEPDKNKEEYESKYTQDDIDLLTEEQQE KAAAAEIASAKTREQEALAELEKALAEARLRGQNAAVPPAETDTANIAPPQGAAASGTIN EESRAVKTPSADEPPSEVVRKVKTTSDYFNTLSKNVREPKLIQAIIDENIKAVDGSRVRL RLLDDVEIGECVIARGTYLYATVSGFSSGRVKGNISSILVNDELVKVSLSLYDTDGMEGL YVPNSQFRETSKDVASGAMSGNMNMSMGSTTGNSLAQWGMQAVNNAYQKTSNAISKAIKK NKVKLKYGTFVYLVNGQEKRN >gi|226331987|gb|ACIB01000069.1| GENE 12 8727 - 9209 338 160 aa, chain - ## HITS:1 COG:no KEGG:BF2807 NR:ns ## KEGG: BF2807 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 160 1 160 160 340 99.0 1e-92 MNTRLEKSVRSSDEWYTPKEILDALGKFDLDPCAPIHPLWPTAEVMYDQNIDGLSQIWEG RVWLNPPYSRPLIELFVRKLAEHGNGIALLFNRCDSKMFQDVIFPKATGMKFLRHRIRFY RPGGARGDSPSRGSILLAFGEDNAEILRNCAIEGKYVQLN >gi|226331987|gb|ACIB01000069.1| GENE 13 9224 - 9595 297 123 aa, chain - ## HITS:1 COG:no KEGG:BF2806 NR:ns ## KEGG: BF2806 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 123 1 123 123 229 100.0 2e-59 MNIKGFKRMLFGEKMPDKDDPQYKERYEREVQAGHKFAKATRIDQAAAKVQGFANAHRTL FLVIVFTFVIGAFVWNAYRLVTVYRHSPVSRTATEKQDSVLRERHKLLQEVEIREHKNRG DKP >gi|226331987|gb|ACIB01000069.1| GENE 14 9608 - 10222 442 204 aa, chain - ## HITS:1 COG:no KEGG:BF2805 NR:ns ## KEGG: BF2805 # Name: not_defined # Def: conjugate transposon protein TraK # Organism: B.fragilis # Pathway: not_defined # 1 204 1 204 204 408 100.0 1e-113 MVIKHLENKIRLVGIICTAFLAGCIIISVSSIWTARTMVTDAQKKVYVLDGNVPILVTRT TMDETLDVEAKSHVEMFHHYFFTLAPDDKYIRYTMEKAMYLVDETGLAQYNTLKEKGFYS NILGTSAVFSIFCDSISFDKKNMEFTYYGRQRIERRSNILMRELVTAGQLKRVPRTDNNP HGLLIVNWRTLLNKDIEQKTKSNY >gi|226331987|gb|ACIB01000069.1| GENE 15 10229 - 10618 343 129 aa, chain - ## HITS:1 COG:no KEGG:BF2804 NR:ns ## KEGG: BF2804 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 129 11 139 139 234 100.0 1e-60 MNVQQKIEKWCRNERFVHYANERISEELVYAPNHRIDPEYEELDEAITWDNRYIVPMMTY LTYRLQLVKLQKNAKNRNRRVWWIFVHVIMREDYTQLFDGKFEKFLTELHDTVMTMLHDE YTRLSNKKK >gi|226331987|gb|ACIB01000069.1| GENE 16 10648 - 11781 712 377 aa, chain - ## HITS:1 COG:no KEGG:BF2803 NR:ns ## KEGG: BF2803 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 377 1 377 377 731 100.0 0 MADGNILSDFGINILEEEIDDVIFQTNEFLTDATFTGSQGPFWWILQMCMALAALFAIVM AAGMAYKMMVKHEPLDVLKLFRPLAVSIILCWWYPPADTGMAGSGSSWCFLDFLSYIPNC IGSYTHDLYEAEATQIADKFEEVQQLIHVRDTMYQSLQAQADVAHTGTSDPNLVEATMEQ TGVDEVTKMEKDAAELWFTSLTAGVIVGIDKIIMLIALIVYRIGWWATIYCQQILLGMLT IFGPIQWAFSLLPKWEGAWAKWLIRYLTVHFYGAMLYFVGFYVLLLFDIVLCIQVENLTA ITASEQTMAAYLQNSFFSAGYLMAASIVALKCLNLVPDLAAWMIPEGDTAFSTRNFGEGV AQQAKMTATGGIGSMMR >gi|226331987|gb|ACIB01000069.1| GENE 17 11796 - 12566 599 256 aa, chain - ## HITS:1 COG:no KEGG:BF2802 NR:ns ## KEGG: BF2802 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 256 1 256 256 454 100.0 1e-126 MDRILLLVTVTIIATTAAKAQSVTYNHDSPKQNQVTVMETGTGALSPDLYYSILHNKYKK SAAVKNKLSFRTLAGVNLYNQTDEAEAIDSALVSRAKIEALNVADRQADIAWVAEGDKVN GQMVRFKRNIDRILPVGGTPEDKDRWTEYYHIYQCAIDATKDAYMPNAQRKKEYLRIYED ITRQNEILVGYLAKRQNTTITSTLLNATADRTLDKESIVRDAVNRWHESRFAVRGPQSGN NTGGNGDGDETVNKGN >gi|226331987|gb|ACIB01000069.1| GENE 18 12635 - 13405 156 256 aa, chain - ## HITS:1 COG:HP1352 KEGG:ns NR:ns ## COG: HP1352 COG0863 # Protein_GI_number: 15645965 # Func_class: L Replication, recombination and repair # Function: DNA modification methylase # Organism: Helicobacter pylori 26695 # 11 247 10 254 359 75 27.0 1e-13 MKKIVFEKDITLYKADCLEVMPLLPESSIDLVLCDPPFGITASQWDKIIPFSKMWEEIRR VRKDNAPTALFGSEPFSSLLRCGNLAEFKYDWVWEKSKASNFLLAKKQPLKAHELISIFC NGRTPYYPIMEEGEPYENRTKRGSNWTGVNKVPNPTFRNENKGTRYPRSVKYFKTAESEG KTIHVNQKPVALLKYLIKTYTKEGDTVLDFASGSMSTAIACIHTNRKCICIEKDDMHFLR GEERIRNEYNIKRNVE >gi|226331987|gb|ACIB01000069.1| GENE 19 13610 - 14335 497 241 aa, chain - ## HITS:1 COG:no KEGG:BF2800 NR:ns ## KEGG: BF2800 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 241 1 241 241 429 100.0 1e-119 MKRTILIAITLLALLPDVAKGQWTFDIVSVEAYINDHKKQRSLLLARSTLEYSNQLLHEY SREETGKYKEVNIDLDRYTRAFDVIDVMYQSLRTVLNVKDTYSSVSDRIGDYKTMLEAFH EKILKHGNIEPSDALILTINEKAIRDIANEGEHLYKSVSDLVLYATGAAACSTSDLLMVL ESVNKSLDSIEQHLNRAYIETWRYIQVRIGYWKSKIYRERTKREIIDGAFGRWRNAGRLD Y >gi|226331987|gb|ACIB01000069.1| GENE 20 14332 - 16863 1386 843 aa, chain - ## HITS:1 COG:no KEGG:BF2799 NR:ns ## KEGG: BF2799 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 843 1 843 843 1666 99.0 0 MKRLFIFWVILLPALTQHVHAQYYSVNYDARTVAAMAAAFGTEAVAESYYREQVDDILKH YTAAEVAAAGIFSSKFLEHKALSDLGIWCSSTENYYYRRIYHMVAEKIMPKIWVVAKLML RSPQTAIHWGSYLMKVCDDTKSLCMQFESVVTNSTLSFSDIAFLEIDRDIATLLNLAELG GTDWQRMLDNFTKVPGNFTIENLKGDIDNLYNMGVGLATSGMENLGDALLQSSAFHDLLG GKVNEIGNLYEHYSTLFEQAEHDIGSLLIDMVGGQDSVAALFNFSNYDLTSWMTDYMDNA VGNYYTQRWYIARRDQGSISLCDYYPPTDDNSILNGGAWTRFNTSDPGFYPNASQREQAL ANSERYAGWSRSRVQQLNNSNDGYTYTINTRQQAYIISKGNKQTKKAYAYEIHVTQSWNR TEVVYEDVFDSYSMDLNTFKAQLNARLSEFNDNEEGYVYYIASDARNYYQATDAAKLQGC ESVTISVTCSDGATLGQGSTQYKCRKCGGSLDAHSKECVMQTSVTENELDLSELDALIRE ADNQVAVLQSQISALEKENADLLKKIAEASVEDAAAYRQQYNSNRTRIEELKSELAEWQQ KQKEYADAKQEAEAENDVPTDDYYRLPAIMQDCKTAYSLTWQDGGTWSGYTFVRKATMPN INGIITFRATISIARKPKYFLGIKIHRAIIQISWELTSAYTDTHVADVLTLDPNLPNEEK TKIVNNRISEIAREYPNCKITTEYARTEPMEEVPNSDVYHLLWSSDRLEIAREVDSRITK IYADLVSLEKMMHYKRNIIDVLKDVLPGLDTDEGRRLTLVEECHDRWVENARTSRSGRKE VRP >gi|226331987|gb|ACIB01000069.1| GENE 21 16868 - 17506 429 212 aa, chain - ## HITS:1 COG:no KEGG:BF2798 NR:ns ## KEGG: BF2798 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 212 1 212 212 400 100.0 1e-110 MKTKRILITLSLGYGINMMGFESSLTREQISVSNPELTVLSLREFCMLSKENLLRMDDMT PDKVAAIERLLAEYSLRLGMSDVELEAYLNRYYEENPKEKEFYDMCDRLCNSKPVFDENR FREELFRELNSSPMSEKRLSDLGWLRYQTVRETYLNQPFFLRWFGSQEARIKRAIKDTTI IHDMFCRLVTENCIESERWYFNHKEPEYIKEV >gi|226331987|gb|ACIB01000069.1| GENE 22 17506 - 20217 1816 903 aa, chain - ## HITS:1 COG:no KEGG:BF2797 NR:ns ## KEGG: BF2797 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 903 1 903 903 1812 99.0 0 MTLYIILFFIALCTGMALSVYTFGTGGKRKHIFQNIYFSVEDTDGVGVLYTKTGEYSAVL KIENPVQKYSADIDSYYDFTHLFSALAQTLGEGYALHKQDIFVRKQFAKEPEHNQEFLSA SYFRYFNGRPYTDSLCYLAITQEAKKSRLFSYDSKKWRDFLVKIYKVRDLLRDSGVQVKF LNKAEASEYVDRYFAMNFKDRTVSMTNVKSDDETVSMGDKRCKVYSLVDVDCAALPSLIR PYTNIEVNNTEMPVDLVSVVDNIPNAETVVYNQIIFLPSQKRELALLDKKKNRHASIPNP SNQMAVEDIKQVQDVIARESKLLVYTHFNMVVGVPADTDLQKCTNHLENAFGRMGIHISK RAYNQLELFVSSFPGNCYSLNEEYDRFLTLSDAAVCLMYKERVQHSEETPIKIYYTDRQG VPVAIDITGKEGKNKLTDNSNFFCLGPSGSGKSFHMNSVVRQLHEQGTDVVMVDTGNSYE GLCEYFGGKYISYTEERPITMNPFRINREEMNVEKTGFLKNLVLLIWKGTQGTVTKTEDR LIEHVITEYYDAYFNGFEGFTPQQREDLRKSLVIDDRNSSEKRHESERERAVRIEGIIDE IEGRRKELKVEELSFNSFYEYSVQRIPDICEENRITGIDLSTYRYMMKDFYLGGNHEKTL NENMDSSLFDETFVVFEIDSIKDDPLLFPLVTLIIMDVFLQKMRIKKNRKVLVIEEAWKA IASPLMAEYIKFMYKTARKFWASVGVVTQEIQDIIGSEIVKEAIINNSDVVMLLDQSKFK ERFDTIKTILGLTDVDCKKIFTINRLENKEGRSFFREVFIRRGTTSGVYGVEEPRECYMT YTTERAEKEALKLYKRELQCSHQEAIEAYCRDWNTSGIGKALPFAQKVNEAGCVLNLTTK ITS >gi|226331987|gb|ACIB01000069.1| GENE 23 20328 - 20621 388 97 aa, chain - ## HITS:1 COG:no KEGG:BF2796 NR:ns ## KEGG: BF2796 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 97 1 97 97 168 98.0 6e-41 MINDGRYPDYPLFKGLQRPLELMGLQGRYIYWAAGVAGGAIVGFIAAYCLMGFVAGLVVL ATVLSAGIVLIILKQRKGLHSKNVKRGVYVYAYSHKV >gi|226331987|gb|ACIB01000069.1| GENE 24 20633 - 21010 352 125 aa, chain - ## HITS:1 COG:no KEGG:BF2795 NR:ns ## KEGG: BF2795 # Name: not_defined # Def: conjugate transposon protein TraE # Organism: B.fragilis # Pathway: not_defined # 1 125 1 125 125 196 100.0 2e-49 MFQKFKRMCRKAKKTIMQVSTKVRMLIIALLGGIPAMAQSTAGDYSAGTTALSTVAEEIV KYVPVMVKLCYAIAGVVAIIGAISVYIAMNNEEQDVKKKIMMVVGACLFLIAAAQALPLF FGINA >gi|226331987|gb|ACIB01000069.1| GENE 25 21071 - 21394 177 107 aa, chain - ## HITS:1 COG:no KEGG:BF2794 NR:ns ## KEGG: BF2794 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 107 23 129 129 171 99.0 8e-42 MSKAKKILCALCFVPYAAFAKSGSVNYSWGADALATMHDFVVTMMLYVLYICYAVASVFV VVAALQIYIKMNTGEDGVVKSIVSLVGACLFIIGASIVFPAFFGYRI >gi|226331987|gb|ACIB01000069.1| GENE 26 21396 - 21848 408 150 aa, chain - ## HITS:1 COG:no KEGG:BF2793 NR:ns ## KEGG: BF2793 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 150 51 200 200 271 99.0 5e-72 MKSYFIFTIVLTVAYLVYYAVIIVQDLYGKKGNGKPEEEVFDLGAPEDEQSVYVTESDTG FNVGNEKYETDVAPTASPAPQETETADNNGEIAVAEKLKRLKAQAEEQMEETETYLSDAY TADELYKAMLAKGKTGNRPELVWKPLKDRL >gi|226331987|gb|ACIB01000069.1| GENE 27 22084 - 22971 543 295 aa, chain - ## HITS:1 COG:no KEGG:BF2792 NR:ns ## KEGG: BF2792 # Name: not_defined # Def: DNA primase # Organism: B.fragilis # Pathway: not_defined # 1 295 1 295 295 607 100.0 1e-172 MTIDEAKRVRIVDFLAQLGHRAQYMKSEQYWYLSPLRKEVTPSFKVNDRLNEWYDFGEAT GGDLVELGKYLCGTKSVSEALAYIKRYVNGVSLPKTRALPATSRPVEADMKNLIIVPLRH HALLSYLHSRMIDSDIGRMFCKEVHYELRQRRYFALAFGNISGGYEVRNPYYKGCIKNKD ISLIPQSRGEAQSRVCLFEGFMDFLSYLTLKQTDDSAICINAPCDYLVMNSVSNLKRTLT YLQKYTYIHCYLDNDLAGQKTVETIAGMYGRCVYNESNCYAGYKDLNDYLRGKKQ >gi|226331987|gb|ACIB01000069.1| GENE 28 23078 - 24184 696 368 aa, chain - ## HITS:1 COG:no KEGG:BF2791 NR:ns ## KEGG: BF2791 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 368 1 368 368 752 99.0 0 MENERRTEYNVDMRPEEDFLSDILSASQIRATDTYETPPQIIWIDNSTIATLGNFSASTG KAKSKKTFNVSAIVAASLAGKQVLNYRAHLPEGKRKILYVDTEQSRFHCHNVLERILRLA GLPTTTDSENLDFICLREYSPAIRIGVIDYALRQRKGYGLVIIDGIRDLMLDINSTGESV EVINKMMEWSSKYDLHIHCVLHLNKGDNNVRGHIGTEMSNKAETVLVISKNNDCPNISEV HALHIREKEFKPFAFTVNEGGLPVLAEGHLFENAPHQKPKQRTGFMELSIEQHREALSAA FGDKPIRGFENMLQAMMTAYEAIGFKRGRNVMVKLLQYLTDTLKLVIKRDKLFYYDMTQA ETMLFDEE >gi|226331987|gb|ACIB01000069.1| GENE 29 24162 - 24560 315 132 aa, chain - ## HITS:1 COG:no KEGG:BF2790 NR:ns ## KEGG: BF2790 # Name: not_defined # Def: putative excisionase # Organism: B.fragilis # Pathway: not_defined # 1 132 1 132 132 226 100.0 2e-58 MQNNRLTFMERLSERLTSVEAILKKLDPIESLLERIALLEKNIYTTKQVFTFQEACMYIG ISESMLYKLTSGKEIPHYKPRGKMIYFAKEDLDEWLLQNYEPTVDEAARMANEAAATQPF FNQRRHGKRKKN >gi|226331987|gb|ACIB01000069.1| GENE 30 24737 - 25078 212 113 aa, chain - ## HITS:1 COG:no KEGG:BF2789 NR:ns ## KEGG: BF2789 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 113 145 257 257 209 99.0 2e-53 MIGITACANAYHLFCVSTLHVEDMEALLSCKEGFCIRVNNIRHVAILFDTLLEHSFIQAK WQSVLSSGRFLQTKDGKGFVSASSLSSALSALRNNMTSTGYGIRRAIDELREW >gi|226331987|gb|ACIB01000069.1| GENE 31 25280 - 25438 94 52 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MNVHFMAVPADRMVTVDIPVAVYEILHVTVVLLIARHDITEIHCFCFNEQCE >gi|226331987|gb|ACIB01000069.1| GENE 32 25644 - 26774 784 376 aa, chain - ## HITS:1 COG:CPn0024 KEGG:ns NR:ns ## COG: CPn0024 COG0582 # Protein_GI_number: 15617948 # Func_class: L Replication, recombination and repair # Function: Integrase # Organism: Chlamydophila pneumoniae CWL029 # 181 364 106 301 312 79 30.0 1e-14 MPRTRKPIKVKEPIRLRTKELANGSKSLYLDIYRNGKRTYEYLKMYLIPETDRNARQQNE TTMAAANAIKSKRIIELTSGEAGIMNHKDKVYLLDWMQLYKEEQKKRGKKNIGQIKSVTG ILKEYAGERFTLNQIDLTFCHGYIDYMLTNYRPKGKPISASTRNTYYQIFNGALNAAVRA KRILKNPFNEMEKSEKPKMPESVRSYMTIEEVRSLIATPMQNEGVKSAYLFSCFCGLRIS DIIGLQWKDVFIDNGQYRLAVAMQKTKEPIYLPLSNEALKWMPERGDKTADDHVFDLPSG INQLIKPWAKAAGISKRFTFHTARHTFATMMLTLGADLYTVSKLLGHTSVKMTQVYAKIV NKKKDDAVNLTNGLFD Prediction of potential genes in microbial genomes Time: Wed May 18 00:39:46 2011 Seq name: gi|226331986|gb|ACIB01000070.1| Bacteroides sp. 3_2_5 cont1.70, whole genome shotgun sequence Length of sequence - 53709 bp Number of predicted genes - 41, with homology - 40 Number of transcription units - 16, operones - 9 average op.length - 3.8 N Tu/Op Conserved S Start End Score pairs(N/Pv) - Term 1617 - 1667 1.2 1 1 Tu 1 . - CDS 1784 - 1981 76 ## - Prom 2001 - 2060 4.4 + Prom 1638 - 1697 4.4 2 2 Op 1 . + CDS 1878 - 2831 927 ## COG0685 5,10-methylenetetrahydrofolate reductase 3 2 Op 2 1/0.000 + CDS 2835 - 3944 875 ## COG2812 DNA polymerase III, gamma/tau subunits 4 2 Op 3 . + CDS 3961 - 5418 1416 ## COG1774 Uncharacterized homolog of PSP1 5 2 Op 4 . + CDS 5261 - 5866 357 ## BF4095 hypothetical protein 6 3 Op 1 19/0.000 - CDS 5861 - 7318 1058 ## COG0772 Bacterial cell division membrane protein 7 3 Op 2 . - CDS 7299 - 9155 1316 ## COG0768 Cell division protein FtsI/penicillin-binding protein 2 8 3 Op 3 . - CDS 9158 - 9655 220 ## BF3914 putative transmembrane protein 9 3 Op 4 22/0.000 - CDS 9652 - 10497 754 ## COG1792 Cell shape-determining protein 10 3 Op 5 . - CDS 10599 - 11621 1253 ## COG1077 Actin-like ATPase involved in cell morphogenesis 11 3 Op 6 . - CDS 11696 - 13219 1646 ## COG0138 AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) - Prom 13285 - 13344 6.6 - Term 13289 - 13345 14.1 12 4 Op 1 . - CDS 13371 - 15404 2003 ## COG3590 Predicted metalloendopeptidase 13 4 Op 2 . - CDS 15420 - 17375 1936 ## COG0488 ATPase components of ABC transporters with duplicated ATPase domains - Prom 17443 - 17502 5.1 - Term 17488 - 17538 2.0 14 5 Tu 1 . - CDS 17603 - 19993 1195 ## BF4104 hypothetical protein - Prom 20021 - 20080 5.9 - Term 20793 - 20836 9.4 15 6 Tu 1 . - CDS 20870 - 21772 1177 ## BF3922 hypothetical protein - Prom 21823 - 21882 3.3 - Term 21833 - 21885 -0.8 16 7 Op 1 . - CDS 21899 - 24250 2122 ## BF4108 hypothetical protein 17 7 Op 2 . - CDS 24278 - 28822 3847 ## BF3924 hypothetical protein - Prom 28849 - 28908 2.5 18 8 Op 1 . - CDS 28936 - 31236 1800 ## COG0642 Signal transduction histidine kinase 19 8 Op 2 . - CDS 31240 - 32565 1171 ## COG0534 Na+-driven multidrug efflux pump 20 8 Op 3 . - CDS 32549 - 33811 1059 ## COG0612 Predicted Zn-dependent peptidases 21 8 Op 4 . - CDS 33814 - 34653 887 ## COG0652 Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family - Prom 34674 - 34733 4.5 + Prom 34619 - 34678 5.0 22 9 Tu 1 . + CDS 34747 - 36090 1161 ## COG2204 Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains + Term 36101 - 36134 5.1 + Prom 36243 - 36302 4.9 23 10 Tu 1 . + CDS 36390 - 36587 325 ## BF4116 hypothetical protein + Term 36605 - 36662 8.1 - Term 36593 - 36650 9.7 24 11 Op 1 . - CDS 36736 - 37401 179 ## PROTEIN SUPPORTED gi|238855152|ref|ZP_04645474.1| pseudouridine synthase, RluA family 25 11 Op 2 . - CDS 37410 - 38156 276 ## PROTEIN SUPPORTED gi|163739489|ref|ZP_02146899.1| 50S ribosomal protein L17 26 11 Op 3 . - CDS 38169 - 38765 591 ## BF4119 transcriptional regulator + TRNA 39004 - 39079 74.4 # Met CAT 0 0 27 12 Tu 1 . - CDS 39567 - 41090 949 ## BF3934 hypothetical protein - Prom 41123 - 41182 2.3 - Term 41114 - 41153 1.5 28 13 Op 1 . - CDS 41223 - 41996 369 ## BF4122 hypothetical protein 29 13 Op 2 . - CDS 41993 - 42895 291 ## PROTEIN SUPPORTED gi|169795303|ref|YP_001713096.1| ABC transporter ATP-binding protein 30 13 Op 3 . - CDS 42883 - 43911 593 ## COG2312 Erythromycin esterase homolog - Prom 44103 - 44162 4.0 - Term 44204 - 44271 9.2 31 14 Op 1 . - CDS 44295 - 45092 626 ## BF3938 hypothetical protein 32 14 Op 2 . - CDS 45105 - 45473 375 ## COG1725 Predicted transcriptional regulators 33 14 Op 3 . - CDS 45489 - 46304 551 ## BF4127 hypothetical protein 34 14 Op 4 . - CDS 46301 - 47161 246 ## PROTEIN SUPPORTED gi|225084369|ref|YP_002657150.1| ribosomal protein S16 - Prom 47312 - 47371 5.4 - Term 47396 - 47440 5.1 35 15 Op 1 . - CDS 47467 - 48900 1683 ## BF4129 TPR repeat-containing protein 36 15 Op 2 . - CDS 48920 - 49867 817 ## COG0226 ABC-type phosphate transport system, periplasmic component 37 15 Op 3 . - CDS 49870 - 50685 742 ## BF4131 TonB 38 15 Op 4 . - CDS 50715 - 51365 620 ## BF4132 hypothetical protein 39 15 Op 5 . - CDS 51378 - 51983 520 ## BF4133 hypothetical protein 40 15 Op 6 . - CDS 52011 - 52823 753 ## COG0811 Biopolymer transport proteins - Prom 52849 - 52908 2.7 - Term 52911 - 52944 1.5 41 16 Tu 1 . - CDS 52963 - 53406 270 ## BF3948 hypothetical protein - Prom 53571 - 53630 9.2 Predicted protein(s) >gi|226331986|gb|ACIB01000070.1| GENE 1 1784 - 1981 76 65 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MVWYNFSMPVPFKGGSISKEKAVFSLLCIKSITLIIFLFVFAILAEACILGQKYEKKMKR NRCRT >gi|226331986|gb|ACIB01000070.1| GENE 2 1878 - 2831 927 317 aa, chain + ## HITS:1 COG:aq_1429 KEGG:ns NR:ns ## COG: aq_1429 COG0685 # Protein_GI_number: 15606607 # Func_class: E Amino acid transport and metabolism # Function: 5,10-methylenetetrahydrofolate reductase # Organism: Aquifex aeolicus # 14 316 13 287 296 173 32.0 3e-43 MRVIDLIHSNEKTAFSFEILPPLKGTGIEKLYQTIDTLREFDPKYINITTHRSEYVYRDL GNGLFQRNRLRRRPGTVAVAAAIQNKYNITVVPHILCSGFTQEETEYVLLDLQFLGITDL LVLRGDKAKHETVFTPEGDGHHHALDLQQQINNFNKGIFVDGSEMKVTNTPFSYGVACYP EKHEEAPNIDMDIYWLKKKVEAGAEYAVTQLFYDNKKYFEFVEKVHQAGIDIPIIPGIKP FKKISQLNMVPKTFKVDLPEELTKEALKCQTDEEARQVGIEWCISQCKELMAAGVPSIHF YSIGAVDSIKEVAKAIY >gi|226331986|gb|ACIB01000070.1| GENE 3 2835 - 3944 875 369 aa, chain + ## HITS:1 COG:DR2410 KEGG:ns NR:ns ## COG: DR2410 COG2812 # Protein_GI_number: 15807400 # Func_class: L Replication, recombination and repair # Function: DNA polymerase III, gamma/tau subunits # Organism: Deinococcus radiodurans # 3 224 13 195 615 105 32.0 1e-22 MFFRDVIGQEEAKYRLIQEVSEGRIPHAQLFCGPEGVGKFPLALAYARYLSCTNRSDTDA CGVCPSCVKFNKLVHPDVHFVFPVVKNGRSDDYIVEWRKLVLNNPYFTINHWLNEINAEN AQAVIYTKESDEIMKKLSLKSSEGGFKITLLWLPEKMQQACANKLLKLLEEPPEKTIFLL VSEAPDLILQTILSRTQRFNLRKIEEECMAEALQSKYGVQQATSISIAHLANGNFIKALE TIHLNEENQLFFELFVSLMRLSYQRKIREMKLWSEQVAGMGRERQKNFLEYCQRMIRENF IYNLHRKELTYMTLEEQNFATRFAPFVNERNVMGIMDELSEAQKHIEQNVNAKMVFFDFS LKMIVLLKQ >gi|226331986|gb|ACIB01000070.1| GENE 4 3961 - 5418 1416 485 aa, chain + ## HITS:1 COG:BS_yaaT KEGG:ns NR:ns ## COG: BS_yaaT COG1774 # Protein_GI_number: 16077100 # Func_class: S Function unknown # Function: Uncharacterized homolog of PSP1 # Organism: Bacillus subtilis # 41 275 3 231 275 167 39.0 4e-41 MEFKLHNGSGGLCCKSCSRQDKKLNTYDWLADIPGNAEESDMVEVQFKNTRKGYYRNSNK IKLEKGDIVAVEATPGHDIGTVTLTGRLVPLQMKKANFKQDAEIKRIYRKAKAVDMEKYE EAKAKEHTTMIRARQIAASLNLDMKIGDVEYQGDGNKAIFYYIADERVDFRQLIKVLAEA FRVRIEMKQIGARQEAGRIGGIGPCGRELCCATWMTSFVSVSTSAARYQDISLNPQKLAG QCAKLKCCLNYEVDCYVEAQKRLPSREIELETKDGTYYFFKADILSNQISYSTDKNFAAN LVTISGKRAFEVIGLNKRGIKPDSLLEAERKPEPKKPVDLLEQESLTRFDRDRRRNAKDG NGKDEGGNGNRKKKKKNNNNRPQQAANGETQSSQSTPINAQENNGSQPVQNQQRTRGERP KNNNRNNNQPRKNNESREPRNNENRETRKNEPREQRPPREPRGPRNNEQPKHIEKAQENE KPAQE >gi|226331986|gb|ACIB01000070.1| GENE 5 5261 - 5866 357 201 aa, chain + ## HITS:1 COG:no KEGG:BF4095 NR:ns ## KEGG: BF4095 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 46 201 1 156 156 314 100.0 1e-84 MSLVNRVTTRTGKRVKTNPVNSALLANHADRATMNNLNISRKLRKMKSLLKNSICMLLTT WVLTACDENTVYHSYQSTPPDGWKKSDTLFFNVPLKDSLANLRLSVGVRNSSNYPYQNLN ILIHYNLEDSTVWKTDTLKFILTDREGKWTGTGWGSLYQSALPLKDCFVKHPGNYTFKIV HEMKNEQLTGISDVGLKIEHL >gi|226331986|gb|ACIB01000070.1| GENE 6 5861 - 7318 1058 485 aa, chain - ## HITS:1 COG:TP0501 KEGG:ns NR:ns ## COG: TP0501 COG0772 # Protein_GI_number: 15639492 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Bacterial cell division membrane protein # Organism: Treponema pallidum # 52 479 46 430 433 167 32.0 6e-41 METRSVSLWKTLDWVTIVIYLLLIIGGWFSVCGASYDYGDRDFLDFSTRAGRQFVWIICS FGLGFILLMLEERMYDMFAYLIYIGMILLLIVTIFIAPDTKGSRSWLILGPVSLQPAEFA KFATALALAKFMNAYSFNIKKWKCFLPLVAFILLPMLLIILQKETGSALVYLAFFLVLYR EGMPGVVLFSGVCAVVYFVVGIRFDQVFIADTPTPIGEFAVLLMILLFAGSMVWVYRKKW EPVRNMIGGSLLVLLIAYLVSEYLSPFNLVWVEWGLCVVTIGYLLYLSLSERQRAYLLIG LFALGSIGFLYSSDYVFDNILEPHQQIRIKVVLGMEEDLAGAGYNVNQSKIAIGSGGLTG KGFLNGTQTKLKYVPEQDTDFIFCTVGEEQGFVGSAAVLLLFLALILRLIATSERQTSTF GRVYGYSVVSIFLFHLFINIGMVLGLTPVIGIPLPFFSYGGSSLWGFTILLFIFLRIDAG RSRRL >gi|226331986|gb|ACIB01000070.1| GENE 7 7299 - 9155 1316 618 aa, chain - ## HITS:1 COG:RSc0062 KEGG:ns NR:ns ## COG: RSc0062 COG0768 # Protein_GI_number: 17544781 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Cell division protein FtsI/penicillin-binding protein 2 # Organism: Ralstonia solanacearum # 7 598 11 633 801 285 33.0 2e-76 MAKDYVLEKRKYVIGGVALAIVLIYLLRLFSLQIATDDYKKNADSNAFLNKIQYPSRGAM YDRNDKLLVFNQPAYDITMVPKEVENLDTLDLCQTLNITRAQFLKIMSDMKDRRRNPGYS RYTNQVFMSQLSAEECGVFQEKLFKFHGFYIQRRTIRQYSYNSAAHVLGDIGEVSMKDIE NDDYYIRGDYIGKQGIEKSYESYLRGEKGVEVLLRDAHGRIQGHYMDGKYDKRPIPGKNL KLGIDIDLQMLGERLLKNKIGSIVAIEPETGEILCMVSSPDFDPRLMIGRQRGKNHLMLQ RDPMKPLLNRSIMGVYPPGSTFKTAQALTFLQEGIIQTNTPAFPCAHGFNYGSLHVGCHA HGSPLPLIPAIATSCNSYFCWGLFRMFGDRKYGSSQNAITVWKDHMVSQGFGYKLGVDLP GEKRGLIPNAQFYDKAYRGRWNGLTVISIAIGQGEILATPLQIANLGATIANRGHFITPH IVKEIQDEQLDSLYRVPRKTSIDRKHYEDVVTGMRAAVTGGTCRVAGAILPDVEVCGKTG TAQNRGHDHSVFMGFAPMNNPKIAIAVYVENGGFGAVYGVPIGALMMEQYLKGKLSPENE IRAEEYSNRVIMYGNEER >gi|226331986|gb|ACIB01000070.1| GENE 8 9158 - 9655 220 165 aa, chain - ## HITS:1 COG:no KEGG:BF3914 NR:ns ## KEGG: BF3914 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 165 1 165 165 267 100.0 9e-71 MIINYIHRVGWFVGLILLQVLILNNVHIAGVATPFLYVYFILKFNSGTSRNELMVWGFCM GLAIDIFSNTPGMNAAATVLLAFLRPLFLRLFTPRDTLDSIVPSLKSMGIASFLKYLVVS VFVHHFMLLTLEFFSFTSIPLLLLRVVSSTILTITCIMAVEGVRR >gi|226331986|gb|ACIB01000070.1| GENE 9 9652 - 10497 754 281 aa, chain - ## HITS:1 COG:lin1582 KEGG:ns NR:ns ## COG: lin1582 COG1792 # Protein_GI_number: 16800650 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Cell shape-determining protein # Organism: Listeria innocua # 44 268 50 278 295 93 29.0 4e-19 MRNLLNFLLKYNHWFLFILLEVISFVLLFRFNHYQHSVYFSSANVVAGKVYEVSGGITSY FHLKSVNEDLLDRIMELEQQNRNLENALVKHLSDSTELNSIRNLSDTDYEIFKARVINNS LNLVDNYITLNRGSKDGIRPEMGVVDGNGVVGIVYETSSHYSRVISVLNSKSSISCKIVG SEYFGYLKWEYGDARYAYLKDLPRHAEFNLGDTVVTSGYSTVFPEGIMIGTVDDMADSND GLSYLLKVKLATDFGKVSEVRVIARTGQREQKELEQKSLAQ >gi|226331986|gb|ACIB01000070.1| GENE 10 10599 - 11621 1253 340 aa, chain - ## HITS:1 COG:CAC1242 KEGG:ns NR:ns ## COG: CAC1242 COG1077 # Protein_GI_number: 15894525 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Actin-like ATPase involved in cell morphogenesis # Organism: Clostridium acetobutylicum # 1 335 1 332 335 284 46.0 1e-76 MGLFSFTQEIAMDLGTANTIIITGGKIVVDEPSVVALDRRTDKMIAVGEKAKLMHEKTHE NIRTIRPLRDGVIADFYACEQMMRGLIKQVNTRNRLFSPSLRMVIGVPSGSTEVELRAVR DSAEHAGGRDVYLVFEPMAAAIGIGIDVEAPEGNMIVDIGGGSTEIAVISLGGIVSNNSI RIAGDDLTADIQEYMSRQHNVKVSERMAERIKINVGAALTELGEDAPEDYIVHGPNRITA LPMEVPVCYQEVAHCLEKSISKIETAILSALENTPPELYADIVHNGIYLAGGGALLRGLD KRLTDKINIPFHIAEDPLHAVAKGTGVALKNVDRFSFLMR >gi|226331986|gb|ACIB01000070.1| GENE 11 11696 - 13219 1646 507 aa, chain - ## HITS:1 COG:aq_1963 KEGG:ns NR:ns ## COG: aq_1963 COG0138 # Protein_GI_number: 15606962 # Func_class: F Nucleotide transport and metabolism # Function: AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) # Organism: Aquifex aeolicus # 10 507 3 506 506 436 47.0 1e-122 MSESKRIKTALVSVYHKEGLDEIITKLHEEGVEFLSTGGTRQFIESLGYPCKAVEDLTSY PSILGGRVKTLHPKIFGGILCRRGLEQDIQQIEKYEIPEIDLVIVDLYPFEATVASGADE AAIIEKIDIGGISLIRAAAKNFNDVIIVASQAQYKPLLDMLMEHGATSSLEERRWMAKEA FAVSSHYDSAIFNYFDAEEGSAFRCSANSQKTLRYGENPHQKGYFYGNLDEMFDQIHGKE ISYNNLLDINAAVDLIDEFDDVTFAILKHNNACGLASRPTVLEAWKDALAGDPVSAFGGV LITNAVIDKETAEEINKIFFEVVIAPDYDVDALEILGQKKNRIILVRKEAKLPRKQFRSL LNGVLVQDRDLNIETTADLKTVTDKAPTPEEVEDMLFANKIVKNSKSNAIVLAKGKQLLA SGVGQTSRVDALKQAIEKAKSFGFDLQGAVMASDAFFPFPDCVEIADKEGVTAVIQPGGS VKDQLTFDYCNEHGMAMVTTGIRHFKH >gi|226331986|gb|ACIB01000070.1| GENE 12 13371 - 15404 2003 677 aa, chain - ## HITS:1 COG:MA2001 KEGG:ns NR:ns ## COG: MA2001 COG3590 # Protein_GI_number: 20090849 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Predicted metalloendopeptidase # Organism: Methanosarcina acetivorans str.C2A # 32 677 16 665 665 542 41.0 1e-154 MKVIKYLPILAICCMATGCNSKKEAVLTSGIDLANLDTTALPGTSFYQYACGGWMKNHPL TDEYSRFGSFDMLAENNRQQLRGLIEGLAAEKHEAGSIAQKVGELYNIAMDSVKLNKEGA APIKPELEKIGAIKDKAEIYPLIVEMQKRGMYPYFILYVSADDMNSNENMVHTMQGGLGM GERDYYLEDDAQTKEIRDKYQQHVSKMFQLAGYDEATARKAVKAVMNIETRLARSARSQV ELRDPHANYNKKTLEELQKEYPSFAWDVFFSTAGLNNLKEVNIGQPDALKEVNAIIDTVS LEEQILYLQWNLINSAANYLSDDFIAQDFDFYGRTMSGKKEMQPRWKRAVSTVDGSLGEA VGQMYVEKYFPAAAKERMVALVKNLQESLGERIKGLSWMGEETKEKALEKLATFHVKIGY PDKWKDYSSLEIKDDSYWANIERANQWSYNEMIGKYGKPVDKDEWYMTPQTVNAYYNPTT NEICFPAGILQYPFFDMNADDAFNYGAIGVVIGHEMTHGFDDQGRQYDKDGNLKDWWTAE DAKNFEARAAVMANFFDSIEVAPGVHANGEFTLGENIADHGGLQVSYQAFKKATAAAPLK IENGFTPEQRFFLSYANVWAGNIRPEEILKRTKTDPHSLGKWRVDGALPQIGAWYEAFNI TEKDPMYLPVDKRVSIW >gi|226331986|gb|ACIB01000070.1| GENE 13 15420 - 17375 1936 651 aa, chain - ## HITS:1 COG:all4183 KEGG:ns NR:ns ## COG: all4183 COG0488 # Protein_GI_number: 17231675 # Func_class: R General function prediction only # Function: ATPase components of ABC transporters with duplicated ATPase domains # Organism: Nostoc sp. PCC 7120 # 1 532 1 531 564 395 42.0 1e-109 MISVEGLTVEFNATPLFEDVSYVINKKDRIALVGKNGAGKSTMLKILAGLQSPTRGVIAI PRDVTIGYLPQVMILADNHTVMEEAELAFEHIFELQADLERMNQELADRTDYDSEEYHKL IDRFTHENDRFLMMGGTNFHAEIERTLIGLGFSREDFNRPTSEFSGGWRMRIELAKLLLR KPDVLLLDEPTNHLDIESIQWLETFLSTRANAVVLVSHDRAFLNNVTTRTIEITCGQIYD YKVKYDEYIVLRQERREQQLRAYENQQKQIEDTEAFIERFRYKATKAVQVQSRMKQLEKI ERIEVDEVDNSALRLKFVCSSRSGNYPVICEDVKKAYGAHVVFHDVNLTINRGEKVAFVG KNGEGKSTLVKCIMSEIDYEGKLTLGHNVQIGYFAQNQAQMLDENLTVFDTIDRVAVGDI RLKIRDILGAFMFGGEASDKKVKVLSGGERTRLAMIKLLLEPVNFLILDEPTNHLDMRSK DVLKEAIREFDGTVIIVSHDRDFLDGLATKVYEFGGGVVKEHLGGIYDFLQKKKIENLNE LQKANPSSASPANGKKEEGAEEGISENKLSYEAQKELNKKIKKLERLVADCEAAIEQTES AIAILEEKMATPDGASDMSLYEQHQKLKQQLDHTVEEWERVSMELEEMNEK >gi|226331986|gb|ACIB01000070.1| GENE 14 17603 - 19993 1195 796 aa, chain - ## HITS:1 COG:no KEGG:BF4104 NR:ns ## KEGG: BF4104 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 12 796 12 796 796 1513 98.0 0 MKKLLCLIILLFSFIRSEAQSDHICPHEKLYVHTDRANYERGDTVRFRAYLMDTRHETMA YSRYVYAELLADSQVISREMVKDDHGVFSGYLSLGDTLRSGNYTLRFYTRHLSSLPAPRY FYRQIIVGGRSFQDYRKESVTRMAASYHVSFFPEGGRLPSGCVSRVAFKALSPDGLGTDV QGFVVNQRGDTVTTLRSVHRGMGFFNLEPVSGDSYTAVCRNHEGMELRFPLPPADPSAVS LRVDVRKDDFLVRLNSGVSPSPGHSLRVEYRDSILLHAAFSGSRPLRLPRSPLPPGVLRF VLLDGSGVPISGRTAFNQSPSVRADVDFSARLKEEKGRSFWDVSLGLRDSSGEPLGGTLS VSVTDDRYALQDTTVNILSSFLLSSDLQGYVEAPSFYFSGDDSRTSYLLDLLMLTQGWVK YSLVPDYGFFPVERSQCVSGKVVSEYSEKKCIVDAVVTLFSFDKKIMRQTTSDASGCFRF DSLSFPRGTHFLLQARKKKGGTDVALLVDRDSVPSVHSSLPVYADWFRAEMDEPVVSEQS GNDPSPTSANVFQRPSTAPFSMEQYLDEVVVSTKKIEKKKQYAIESLMAHSDWNKTYHVD EMTLSPYSSTKDLLINTPGVGWAVDSNSGDFFYITRLRAGRSSTPPPALLMVDGLETSYS ELVGIPVSIVESIELVKDAAQMAYIGSKASNGAILISTKSGLGTVGKKASNFRIIRPIGY QVKRESYSPSYPVVYGAINNGGNSFRTIYWSPDLLLDKAASHLEFGNPKQGRLTLVVQGI TPDGKLINLTRTLGNE >gi|226331986|gb|ACIB01000070.1| GENE 15 20870 - 21772 1177 300 aa, chain - ## HITS:1 COG:no KEGG:BF3922 NR:ns ## KEGG: BF3922 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 300 1 300 300 610 100.0 1e-173 MKPTLFVLAAGMGSRYGGLKQLDGLGPNGETIMDYSIYDAIRGGFGKVVFVIRKDFEQDF REKILSKYENHIPVELVFQALDNLPEGFTCPADRVKPWGTNHAVLMGKDVIKEPFAVINA DDFYGRDSFAVLGAELSQMDGKKNDYCMVGYRVGNTLSESGSVARGVCETNAEGYLTTVV ERTAIERIDGKVSFKDENGEMQTIGDNTPVSMNMWGFTPDYFAYSEEYFKEFLKENEGNL KSEYFIPLMVNKLVNEGTARVKVLDTTSKWFGVTYAADRQGVVDKIQALVDAGEYPDKLF >gi|226331986|gb|ACIB01000070.1| GENE 16 21899 - 24250 2122 783 aa, chain - ## HITS:1 COG:no KEGG:BF4108 NR:ns ## KEGG: BF4108 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 783 1 783 783 1593 99.0 0 MIRKETVHIIWVTIQAGLVMLLLGACSTTKHLPEGEILYTGSKTIVENEPQAPLTEDALT ELDAALDKAPSTKILGFVPIPFKMWAYNSLVRYKKGFGHWLFNRFAANPPVFISTVNPEI RAKVGTNLLHDYGYFNGTVRFQTVPDKKDSLKASIRYTVDMKDPYYIDTVYYTRFNPRTL RIMERGRRGSLLTPGEQFNVSDLDGERSRISTLLRNRGYFYFRPDYMKYQADTLLNPGHV SLRLIPVPGLPDAAQRPYYVGKTSVFLYGKGGEVPNSTLEYRGLDIHYYKKMQVRPNMLY RWLNYQAYVRNDSLRNSAHSRLYSQYRQTRIQERLSQLSIFRYLDLQYIPQDSTATCDTL NVRLQATFDKPYDAELEFNLTTKSNNQTGPGASFGLTRYNVFGGGETWNVKLKGSYEWQT GQNKGSSLMNSWEMGVSTALTFPRVVFPSFGGREYDFPATTTFRLYIDQLNRAKYYKLLA FGGNATYDFQPTRISRHSLTPLRVTFNVLQHTTKAFEEIADQNKALYRSLQNQFIPAMEY TYTFDNAALRGVRNPIWWQTTFTSAGNITSGIYRIFGKKFSQRDKKLFGVPFAQFLKVNS DFRYTWKIDKNQSIASRVAGGIIWAYGNTNTAPYSEQFYIGGANSVRAFTARSIGPGGFR PTKTSKGLYLDQTGDIRMEANVEYRFRIYGDLHGAVFLDAGNVWMLRKDEEAPEKQLRWK TFGKQIALGTGAGIRYDLDFLILRLDCGVPLHDPYDTGKKGYYNVTGSFWKGLGLHFAVG YPF >gi|226331986|gb|ACIB01000070.1| GENE 17 24278 - 28822 3847 1514 aa, chain - ## HITS:1 COG:no KEGG:BF3924 NR:ns ## KEGG: BF3924 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1514 1 1514 1514 2950 99.0 0 MKRKWIKWVSWILLTPLILFVILMVLLYIPPVQNFLRKEAAAYASEATGMQINVRRIDLR FPLNLLVRGVEVIQAPDTLLSLESLNVHVQALPLFRGKVEVDDISLQQVAVNSANLIDGM RLKGVLGSFRLESHGVDLPNEIAIINRAELSDTHVQLLLNDTTATPKDTAQSEVRWKVDL RHLKLKNVSFSMQLPADSMRLAAHVEEAQVNDAEADLKNLHYGLRSFLVSGTSVNYDVGT AEPAEGFDPSHIALRDIRIGLDSMYYRGRNMNAVIREFSMNDRSGLSVTSLTGRVFANDT IIQVPSLKLLTPHSEMDLTAQTYWELVNIPTTGRLTARFNAMIGKQDVLLLAGGLPDSFK EAYPFRPLVIRAGTEGNLKEMQITRFSAELPGAFSLSGGGELLNLTDSLERSAIIDLRMQ TQNLNFLTALGGTRPDSLLVIPNQMSLVAKARMKGPQYMAQLLLKEGEGMLNLDAAYNGS TEAYRADLKVDALQLHHFLPKDSIYELTTSVAAVGRGIDFTSYRTTASLKASLQSLHYGR YQISGIEVTGDVKNALATARLVSDNRLLKMNANAEYHLAKPYMDGKLDMDVTQLDLYELG IAPKPLKYPLAFNFTAEARRDRIFTHLTAGDMKLNLSARSSLDKLIKQSAHFADVLVKQI DKKELDHGELREALPTAIFSMSAGKENPLAYYLATKDIAFHDVGVKFGTAPDWGINGKAS IHALKMDTLQLDTIYFTVKQDTTRMSLHGGVINGPKNPQIVFKSSFAGEIRNDDAELTLR YENAKGETGVLFGVNVRPLVEGNGKGDGLAFTLIPENPIIAFRKFHFVDHHNWIYLHKNM RLYANVDMADDEDMGFRIKSNRSDTVSLQNIDVELQRIRLSEISEVLPYLPDLSGLFSAE ANYVQTATSLQVSAEANIDELTYERQRIGDVALGVTWLPGERGKHYISTYLTHEGEEILM ADGSLHPSVTGKDSIEVNAIMEHFPLKIANAFVPDQVVTLSGDMDGGLHITGDTDRPLVN GDLVLDSVSVFARQAGARFTFDNRPVQIKNSRLTFDKFAIFTTGKNPFTIDGTVDFRNLT DPRVNLSMLAENYMLLNAPRTKESLVYGKVFVDFNATVRGPVNALVMRGNMNLLGNTDVT YVLMDSPLTVQDRLGDLVTFTSFSDTTTVQKEEAPVVSLGGLDMIMTVQIDPGVRLKADL SADRSSRVELQGGGNLSMQYTPQGDLSLSGRYTLTGGMMKYALPVIPLKEFNINNGSYVE WTGNPMDPMLNLKATERLRASVGSENGQSRMVNFDVSIVVKNRLDNLSLAFEIDAPDDAE VQNQLASMSADERGKQAIAMLATGLYLANSGSSGGGGLNMGSALNSILSSQINALAGNLK NASFSMGVEDHDAADAGGKRTDYSFRYSQRFFNDRFQIVLGGKVSTGAQATNDVESFIDN ISLEYRLDTSGTRYIRLFHNKNYESVLEGEITETGIGLVLRKRIDRLGELFIFRKKKKTL PETQKAPSDSEAHQ >gi|226331986|gb|ACIB01000070.1| GENE 18 28936 - 31236 1800 766 aa, chain - ## HITS:1 COG:alr1285_1 KEGG:ns NR:ns ## COG: alr1285_1 COG0642 # Protein_GI_number: 17228780 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Nostoc sp. PCC 7120 # 261 490 222 455 483 147 35.0 1e-34 MAIPSRFTKVKIAAGYTLLLVVLLLSLLFVHREMEKLSDTDDVQSLQTDSLLLLLKEKDE NTIRMLQIINEANESMITPVELDSIIAEQDTVITQQRVQHRIITKRDSVITKRKKKGFFR RLGEVFAPSKEDTAVLVNTAVEFATDTILEPYNPIDSLHQRIRTVTKRKSRPVVRPNYNA RLRRMNKELTTRIDSMITTYEQVVTQRAMDNASEQQELRNRSTRTIGGIAIAAVLLSACF LIMIWRDITRSNRYRKELEEANKRAEALLEAREKLMLAITHDFKAPLGSIIGYTDLLTGL TTDERQRFYLDNMKSSSQHLLKLVSDLLDFHRLDLNKAEVNRVTFNPAQLFEEIRISFKP LTDAKHLTLSCSIDAELDGRFISDPLRIRQIVNNLLSNAVKFTAKGSIALNITYHSSSVR IEVVDTGKGMAPGDREKIFQEFTRLPGAQGEEGFGLGLSIVHKLVTLLEGSISVQSTLGE GSRFIVILPLYPVGPVTGEKREGNVSSVSTTDQAGEDGVMASPKLNRVLLIDDDRIQLAL TAAMLEQQGIQAVCCQQPDELIEQLRTATFDVLLTDVQMPAINGFDLLKLLRASNIPQAR TIPVIAVTARSEMNEQDFQEHGFAGCLHKPFTVKELLTIISGEEMTGSSAELTPDSLNFR ALTAFSEDDPEAASTIIQTFIEETEKNRNRMESAIRATDVDGIAGMAHKLLPLFTLLGAS EALPLLLWLEQRRGEAVSDEMIQKANEALRQVDIVMTEARRYVAGD >gi|226331986|gb|ACIB01000070.1| GENE 19 31240 - 32565 1171 441 aa, chain - ## HITS:1 COG:VC1540 KEGG:ns NR:ns ## COG: VC1540 COG0534 # Protein_GI_number: 15641548 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Vibrio cholerae # 5 438 8 446 461 175 30.0 2e-43 MRLIYKNHYKALFLLGLPIVIGQVGVIVLGFADTLMIGHHSTNELGAASFVNNMFTLAII FSTGFSYGLTPIVGGFYGTRRFASAGQALRCSLLANLLVGILLTVIMGILYLNVERLGQP EELLPLIKPYYLILLASLVFVLLFNGFKQFTDGITDTKTAMWILLGGNVLNIVGNYILIN GKLGFPELGLLGAGISTLFSRIVMVLVFAFVFFSSRRFLRYKLGFIRLGWSRTLFRQLNA LGWPVAFQMGMETASFSLSTVMVGWLGTIALASHQVMLTISQFTFMMFYGMGAAVAVRVS NFKGQNDIVNVRRTAYAGAHIILAMGVVLLSIVFLFRYQVGGWFTDNTEVSAMVVVLMVP FLAYQFGDGMQINFANALRGISDVKPMMLIAFIAYFIISLPAGYFFGFVMGWGLLGVWMA FPFGLSSAAIMLWLRFRYKTR >gi|226331986|gb|ACIB01000070.1| GENE 20 32549 - 33811 1059 420 aa, chain - ## HITS:1 COG:CC3584 KEGG:ns NR:ns ## COG: CC3584 COG0612 # Protein_GI_number: 16127814 # Func_class: R General function prediction only # Function: Predicted Zn-dependent peptidases # Organism: Caulobacter vibrioides # 6 418 47 468 948 200 32.0 5e-51 MLQINRHILDNGLRLVHAQDTSTQMVALNILYNVGARDENPEHTGFAHLFEHLMFGGSVN IPDYDAPLQLAGGENNAWTNNDITNYYLTVPRQNVETGFWLESDRMLSLDFSERSLEVQR GVVMEEFKQRCLNQPYGDVGHLLRPLAYRVHPYQWPTIGKELSHIANATLEEVKDFFFRF YAPNNAVLAVTGNISFEEALHLTEKWFGPIPRREVPLRQLPPEPVQTEERRLVVERNVPL DSLFMAYHMCDRADSDYYAFDILSDILSNGRSSRLNQHLVQEKQLFSSIDAYISGTLDAG LFHISGKPAAGVSLEEAEAAVREELNELQSALIQEQELEKVKNKFESTQIFGNINYLNVA TNLAWFELNGRAEDMEKEVERYRAVTADRLNAVAQTAFREENGVVLYYKSSRGEKDETYI >gi|226331986|gb|ACIB01000070.1| GENE 21 33814 - 34653 887 279 aa, chain - ## HITS:1 COG:SPy0457 KEGG:ns NR:ns ## COG: SPy0457 COG0652 # Protein_GI_number: 15674576 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family # Organism: Streptococcus pyogenes M1 GAS # 24 279 69 262 268 98 30.0 2e-20 MKQNFWILLIILACSAVACKSGQKKDGNMEKETVLKIETSMGDIKVKLYNETPKHRDNFI KLAKDGTYNGTLFHRVIKDFMVQAGDPESKNAPKGKMLGSGDVGYTVPAEFVYPKYFHKK GALSAARQGDEVNPKKESSGCQFYIVTGKVFNDSTLLNMEQQKNQNKVTEAFNALAQKHM KEIYKMRKANDQDGLYALQDTLFIQAEAEAAKQPDFHFTPEQIKAYTTVGGTPHLDGEYT VFGEVVEGMDIVDKIQQVKTDRSDRPEEDVKIINVSVIE >gi|226331986|gb|ACIB01000070.1| GENE 22 34747 - 36090 1161 447 aa, chain + ## HITS:1 COG:STM4174 KEGG:ns NR:ns ## COG: STM4174 COG2204 # Protein_GI_number: 16767428 # Func_class: T Signal transduction mechanisms # Function: Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains # Organism: Salmonella typhimurium LT2 # 4 440 8 439 441 318 39.0 2e-86 MRSILIVEDDITFGMMLKTWLGKKGFNVSSVSNIARAQKHIDAQPVDLILSDLRLPDHDG IHLLKWMGEKELHIPLIIMTGYADIQSAVQAMKLGAQDYIAKPVNPEELLKKMSEALQKK EAPLPKTPLTEKSPKTKQESHSYLEGESDAAKQLYNYVSLVAPTNMSVLINGASGTGKEY IAHRIHQLSKRSDKPFIAIDCGSIPKELAASEFFGHIKGSFTGALSDKTGAFVAANGGTI FLDEIGNLSYEIQIQLLRALQERKIRPVGSNSEITVDIRLVSATNENLEQAIEKGTFRED LYHRINEFTLRMPTLKERGGDILLFANFFLDQANKELDKQLIGFDANASKALLEYHWPGN LRQMKNIIKRATLLAQGSFIGLAELGSEILETQLSTPKMTLRDEDAEKEHILEALRQTGN NKSRAAQLLDIDRKTLYNKLKLYGIDL >gi|226331986|gb|ACIB01000070.1| GENE 23 36390 - 36587 325 65 aa, chain + ## HITS:1 COG:no KEGG:BF4116 NR:ns ## KEGG: BF4116 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 65 30 94 94 105 96.0 7e-22 MKKLVLVVAMFMFACGSFFTMAQDPVKRDPKKEVKDTAKTEPKKEVADTTAITVAASGAF AQFAE >gi|226331986|gb|ACIB01000070.1| GENE 24 36736 - 37401 179 221 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|238855152|ref|ZP_04645474.1| pseudouridine synthase, RluA family [Lactobacillus jensenii 269-3] # 3 210 83 279 287 73 28 2e-12 MTVIYEDNHIIVVNKTASEIVQGDKTGDTPLSETVKQYLKDKYQKPGNVFIGVTHRLDRP VSGLVIFAKTSKALSRLNEMFKNSEVKKTYWAIVKNCPKQPEGELVHYLVRNEKQNKSYA YDKEVPNSKKAILNYKLIGHSDHYFLLEVDLKTGRHHQIRCQLAKMGCPIKGDLKYGSAR SNPDGSICLHARHVRFVHPVSKELIELDAPVPDSNLWHGFQ >gi|226331986|gb|ACIB01000070.1| GENE 25 37410 - 38156 276 248 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163739489|ref|ZP_02146899.1| 50S ribosomal protein L17 [Phaeobacter gallaeciensis BS107] # 7 248 4 242 242 110 33 1e-23 MGLLDGKTAIVTGAARGIGKAIALKFASEGANIAFTDLVIDENAQNTAKEIEAMGVKAKG YASNAANFEDTAKVVEEIHKDFGRIDILVNNAGITRDGLMMRMSEQQWDMVINVNLKSAF NFVHACTPIMMRQKAGSIINMASVVGVHGNAGQANYSASKAGMIGLAKSIAQELGSRGIR ANAIAPGFIITDMTAGLSEEVKTEWAKKIPLRRGGTPEDVANIATFLASDMSSYVSGQVI QVDGGMNM >gi|226331986|gb|ACIB01000070.1| GENE 26 38169 - 38765 591 198 aa, chain - ## HITS:1 COG:no KEGG:BF4119 NR:ns ## KEGG: BF4119 # Name: not_defined # Def: transcriptional regulator # Organism: B.fragilis # Pathway: not_defined # 1 198 1 198 198 362 100.0 4e-99 MTVSKTKAKLVDVARQLFAKMGVENTTMNDIALASKKGRRTLYTYFKSKDEIYLAVVESE LDILSDMMKRVADKDISPDKKIIEMIYTRLDAVKEVVYRNGTLRANFFRDIWRVEKVRKR FDAKEMQLFKSVLKEGQDKGVFHVDDVEMTAALVHYCVKGIEVPYIRGHIGANLDMETRK RYVANIVFGALHRTEINQ >gi|226331986|gb|ACIB01000070.1| GENE 27 39567 - 41090 949 507 aa, chain - ## HITS:1 COG:no KEGG:BF3934 NR:ns ## KEGG: BF3934 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 507 1 507 507 1063 99.0 0 MIGYRNLLISLFCSMAVSAAGQPRLVKSLVPDMPSQAPDYFCTWNLQGYVASYKSTELTR AAMTEDYLFGDGLYQNWVDCYPAIRKDLYFVMDDSWDIPKDVNDSPNLYLGCVELSSDRF PSFRGDAVERLKQLSEQIKSKGWKGVGGWICAQKAETHAAIPEEEYWKQRIKTANAAGFD YWKVDWGKEDRNGEWRRKLTAIGKRYAPHLYIEHALRNEFIEFSDVFRTYDVENITAQPI TIRRICDLLPYKTVEGAKGIINCEDEPYIAVGLGCAIGVMRHPFAGTLPDGTQDFVFPPV GRDIKRRLDEVVRGVRWHRIAEPFAVGYGTFAIDSVKLTDHWILQENETWNKGRTVGADV TADAPARVARNMKLPEVSGAPLSVCPFVLASRYPNGAVAVSTIGRNVGREYVTEKVAVSI SVDRWDIPIGLFGYFKEVTMVFPSPLKTGKHTVFAQDLAGENPVDITSNVVIKDNRLIIP GEVISRVGLMNASEGDCSDPGMVIRVM >gi|226331986|gb|ACIB01000070.1| GENE 28 41223 - 41996 369 257 aa, chain - ## HITS:1 COG:no KEGG:BF4122 NR:ns ## KEGG: BF4122 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 257 1 257 257 393 100.0 1e-108 MRILRDLQNVIASEYYKTRHDVAAKLFLFFPVLLTVAFIVYDLWNLSQEGYDGTNLWIYN IGRTLFMFYGMLYPLMAALFCAAYIGKEFKNDNYLLLFLFPVPRGTVYVAKLIYLLSMTF LSVLIAYVAFMLSGFILGVCLPSMGFQNFDVRILVISVFFRVFIGLLPILVIQYVFSFLF KNYALALGFSFFMTVFSMIASNWRYINFIPYSSILHAYSSFMQQTVYYWKSFETINISYF IVFSIVGYILYRYKKWR >gi|226331986|gb|ACIB01000070.1| GENE 29 41993 - 42895 291 300 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|169795303|ref|YP_001713096.1| ABC transporter ATP-binding protein [Acinetobacter baumannii AYE] # 11 298 14 304 311 116 27 2e-25 MGKIEIRNLCFRYGKQMVLNNLNLDIPENALYGYLGNNGSGKTTTIQVLLGLARPVKGEV LYDGQPFRDQREKQLRKIGLCPGEPFYYDNLTGYEHLAYLDHIYHCGRTAINKVLAITGI ENARNKKLRHYSTGMIHRLGMAMALLHDPDILFLDEPLNGLDPEGIHSIRELLLQLHQEG KTVFLSSHLLDEVEKTCTHVGILQHGCLLYQGDLSELLNNIEKRIHIRLDKVDLLHSVCK EVQIDSRIKSESILEVILSDDTTYDRLIELLGQGGYHISAIQPLENTLESVYLKLTSQTK >gi|226331986|gb|ACIB01000070.1| GENE 30 42883 - 43911 593 342 aa, chain - ## HITS:1 COG:BS_ybfO KEGG:ns NR:ns ## COG: BS_ybfO COG2312 # Protein_GI_number: 16077300 # Func_class: R General function prediction only # Function: Erythromycin esterase homolog # Organism: Bacillus subtilis # 1 284 88 396 446 60 23.0 4e-09 MVRYLHEKLGYNVILYETGLYDMYLMNQDGRQRMNPSKAVWTFWWGSNETKSLWEYYRSH PSIALDGFDCQLTNYGQGRKHMESVEKYLNGYSSSLSDFPYVQRFFLQMSEFNGNWNYFG YRLDRMLKDSIVQDFNKLENRIHEESRYSMEDGLHQRYIAGLKLRYESIWKYRNVGDLTR MNLRDSIMADNLTWLVDSVYKDQKVIVWCANVHVFNRGRMQVDSTRFTSMGQRLKIHFGD RMYTIAFTSYARRNTDGGIRDPLSTLSLEYLLHQRKVGFAYLNFNELPVDSRWRQAFISG LDQGASLSEVWSEQMDMLFYIDLNYDVYYDKTFINKMYKWAK >gi|226331986|gb|ACIB01000070.1| GENE 31 44295 - 45092 626 265 aa, chain - ## HITS:1 COG:no KEGG:BF3938 NR:ns ## KEGG: BF3938 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 265 1 265 265 531 100.0 1e-149 MKRTTYIFIGILVSVLVILMAGVVYISFQKSETNSYTLTLSEKILRTEFSGIRAVKVYAN DTRRVWLDEACVNVVSSTDGKTRLLSPESEYLKISQNADTLVICLDLTTYDLPKQKKEYL IPALQAKGLQLTIEADSNLAFVANAIRGLRTRIDRLHADSLTTYIRGGELRLDSCEIRAF CVDGDGVTFNAHQSIIPHLYLDLDGVRNWGVHESVIGTEHLTGSGVYHNNLQRGECKKMI WTPKKEDAELRLTIREKGSLILQEE >gi|226331986|gb|ACIB01000070.1| GENE 32 45105 - 45473 375 122 aa, chain - ## HITS:1 COG:BH3492 KEGG:ns NR:ns ## COG: BH3492 COG1725 # Protein_GI_number: 15616054 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Bacillus halodurans # 3 115 5 117 129 76 31.0 1e-14 MNFKESKAIYLQIADRICDEILLGQYQEEERIPSVREYAAMVEVNANTAMRSFDYLQSQD IIYNKRGIGYFVSSGAKELIFSLRRETFLKDELEHVFRQLYTLGVSDDELLTMYRNFMMK QK >gi|226331986|gb|ACIB01000070.1| GENE 33 45489 - 46304 551 271 aa, chain - ## HITS:1 COG:no KEGG:BF4127 NR:ns ## KEGG: BF4127 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 271 1 271 271 491 100.0 1e-138 MIRDTFFSQTRFVNLCRKEMVENWKSNLLRVALMYGAMAVIMLWSGYLSYRAVGQDTDST WEFNLVIFMWGLCVFGCLSASFTMERMKSKTGRLSVLMTPATSFEKYFSRWLVFTVVFLI VFLITYKLADYTKVLVYSLVYPENNAIAITPLSHLFGENTDYYTVFKHTHTFVLMIASYF FCQSCFVLGSSVWPKNSFIKTFSAGMIIFIAYVLIVVGFAKLIWPDQISYNPDMSEETAF ACLSAIAVLFTLTNWTLAYFRFKESEIINRM >gi|226331986|gb|ACIB01000070.1| GENE 34 46301 - 47161 246 286 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|225084369|ref|YP_002657150.1| ribosomal protein S16 [gamma proteobacterium NOR51-B] # 21 277 25 294 309 99 27 4e-20 MKMITVENLSFLYRKSKRAVLHDFSLSLEKGRVYGLLGKNGAGKSTLLYLMSGLLTPKSG KVVYHDVDVRRRLPITLQDMFLVPEEFDLPPVSLISYIELNSPFYPRFSKEDMVKYLHYF EMDINIDLGALSMGQKKKVFMSFALATNTSLLLMDEPTNGLDIPGKSQFRKFIASGMTDD KTILISTHQVRDIDKVLDHVLIMDNSRVLLNESTMSICDKLFFTESENRELLQSSLFSTP SIQGNFLLLPNESGEDSEINLELLFNATLAVPERISALFHSKQTEL >gi|226331986|gb|ACIB01000070.1| GENE 35 47467 - 48900 1683 477 aa, chain - ## HITS:1 COG:no KEGG:BF4129 NR:ns ## KEGG: BF4129 # Name: not_defined # Def: TPR repeat-containing protein # Organism: B.fragilis # Pathway: not_defined # 1 477 1 477 477 837 99.0 0 MKRFQLFLVGAFVAAGPLFAQSADADWHSEVAKVKELIQTNPAQASEEAAQLLKGKNKKN TDLLIAIGQAYLEAGKINEAEAYAALGQKANSKSAAVSVLQGDIAVAKKDAGKACQLYEQ AIYFDPNYKEAYLKFADVYKGASPQLAIEKLEQLKNLDPSCVAADKKLAEVYYLNNKFDK AAEAYAHFINTPEATEDDLTKYSFALFLNHDFEKSLQIALMGLQKNPRDAAFNRLAMYNY TDLKRYDEAMKAADAFFKESDKADFSYLDYMYFGHLLNAVKKYDQAVEAYMKAITLDPAK TDLWREVSSSYELNNEFTKAIEAYKKYSESLSADKRTPDVQFQIGKLYYEKGTQSDTLTV SLDERKAALVSADSIFTEIAKVAPDSYLGNFWRARTNSALDPETTQGLAKPYYEEVAAFL IDKNDPRYNSALIECYSYLGYYYLVANKLPESKEYWNKILAIDPANATAKRALDGIK >gi|226331986|gb|ACIB01000070.1| GENE 36 48920 - 49867 817 315 aa, chain - ## HITS:1 COG:TM1264 KEGG:ns NR:ns ## COG: TM1264 COG0226 # Protein_GI_number: 15644020 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type phosphate transport system, periplasmic component # Organism: Thermotoga maritima # 37 303 21 271 274 83 23.0 4e-16 MTKKQFWLIGAWSLIALSACSSKPKDGLTDTYTSGVIAITADESFQPIVQEEIDVFEGLF PLAGIVPRYTTEVDAINQLLKDSVRLAITTRTLTPEEMNSFHSRKFFPREIKLATDGLAL IVNRQNADSLISVRDIRRILTGQVQKWKELYPASGLGDIQLVFDNKNSSTVRFAVDSICK GAPLSDKDVKALKTNQQVIDYVAHTPDAIGVIGVNWLGNRSDTTNLSFRDEIRVMSVSAD DVATVENSYKPYQAYLYYGNYPLARPIYVLLNDPRNALPWGFASFLTSDRGQRIILKSGL VPATQPVRIVDVKDE >gi|226331986|gb|ACIB01000070.1| GENE 37 49870 - 50685 742 271 aa, chain - ## HITS:1 COG:no KEGG:BF4131 NR:ns ## KEGG: BF4131 # Name: not_defined # Def: TonB # Organism: B.fragilis # Pathway: not_defined # 1 271 1 271 271 478 100.0 1e-133 MAKIDLTSFEWCELIFKGKNKAYGAYKMRADSPKRHNVAMVIVLIIALVGFSLPTLIKMA TPKQKEVMTEVTTLSQLEEPEVKQEEMKRVEPVAPPPPALKSSIKFTAPVIKKDEEVHED DEIKSQEELTQTKVAISIADVKGNDEANGKDIADLKQVVTQAEPAEEQVFDMVEQMPTFP GGTTELMKYIGEHLKYPPIAAENGTQGKVICRFVIGKDGQVRDVTIARSLDPYCDKEAIR VIKSMPKWIPGKQNGKAVAVNFTVPIVFKLQ >gi|226331986|gb|ACIB01000070.1| GENE 38 50715 - 51365 620 216 aa, chain - ## HITS:1 COG:no KEGG:BF4132 NR:ns ## KEGG: BF4132 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 216 1 216 216 357 100.0 1e-97 MSAEVQESGKKKGNSKQKKMTVRVDFTPMVDMNMLLITFFMLCTSLSKPQTMEISMPSND KNITEEQQSKVKASQAITLLLGPDDKLYYYEGEPNYKDYTSLKETTYKPDGLRGILLKKN ATAVRQVNDLKQKKLELKISEDEFTKQLSEIKSGKNTPTVIIKAMDNASYKNLIDALDEM QICNIGKYVITNIAEADEFLVKNFESKGELSQNIAD >gi|226331986|gb|ACIB01000070.1| GENE 39 51378 - 51983 520 201 aa, chain - ## HITS:1 COG:no KEGG:BF4133 NR:ns ## KEGG: BF4133 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 201 1 201 201 357 99.0 1e-97 MGRAKIKKKSTFIDMTAMSDVTVLLLTFFMLTSTFVKKEPVQVTTPASVSEIKIPEKNIL QILVDPNGKIFMSMDKQSDLKAVLESMGQEYGVTFTPEQEKKFMLASTFGVPMKNMKTYL DLPTDKQDAVLKNEGIPCDSLDNQFKSWVRNARAVNADLRIAIKADADTPYSVIKNVMNS LQDLRENRYNLITSLKTTSEN >gi|226331986|gb|ACIB01000070.1| GENE 40 52011 - 52823 753 270 aa, chain - ## HITS:1 COG:FN1312 KEGG:ns NR:ns ## COG: FN1312 COG0811 # Protein_GI_number: 19704647 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Biopolymer transport proteins # Organism: Fusobacterium nucleatum # 63 265 8 200 202 84 28.0 2e-16 METTQKKSTKIVGIKNAGLVIICCFIIAVCIYHFILGNPTNFMNNDPNNHPLPGNFMGTI YKGGVIVPVIQTLLLTVLALSIERYFALRSAFGRGSLVKFVSNIKEALSVGDLRKAQEIC DKQRGSVANVVTSTLRKYEEMENESSLSKDQKLLAIQKELEEATALELPMMEQNLPIIGT ITTLGTLMGLLGTVIGMIRSFAALAAGGSADSMALSQGISEALINTAFGILTGALAVISY NYYTNKIDKLTYSLDEVGFSIVQTFAATHK >gi|226331986|gb|ACIB01000070.1| GENE 41 52963 - 53406 270 147 aa, chain - ## HITS:1 COG:no KEGG:BF3948 NR:ns ## KEGG: BF3948 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 147 1 147 147 265 100.0 5e-70 MRTSVKFILMFVLLMLSFSISGNALNITECSHKSVNSCQVSASSLDTNYSYRYGSDITGF DKSQTLSVSDVELGFKPVSETYSSNNLRLRRILEDSDLFKDTMRKWCLVRENLLVLDQSK SYYSDKDPHYASISCHYYIFALRRILI Prediction of potential genes in microbial genomes Time: Wed May 18 00:41:14 2011 Seq name: gi|226331985|gb|ACIB01000071.1| Bacteroides sp. 3_2_5 cont1.71, whole genome shotgun sequence Length of sequence - 7887 bp Number of predicted genes - 6, with homology - 5 Number of transcription units - 3, operones - 2 average op.length - 2.5 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . - CDS 1 - 508 182 ## COG3537 Putative alpha-1,2-mannosidase 2 1 Op 2 . - CDS 531 - 2162 1421 ## BF1753 hypothetical protein 3 1 Op 3 . - CDS 2174 - 5320 2555 ## BF1752 hypothetical protein - Prom 5346 - 5405 7.7 - Term 5344 - 5389 3.4 4 2 Tu 1 . - CDS 5469 - 6326 281 ## PROTEIN SUPPORTED gi|163762640|ref|ZP_02169704.1| ribosomal protein L33 - Prom 6405 - 6464 5.0 5 3 Op 1 . - CDS 6491 - 7528 593 ## BF1750 1-phosphatidylinositol phosphodiesterase precursor - Prom 7548 - 7607 3.9 6 3 Op 2 . - CDS 7610 - 7864 66 ## Predicted protein(s) >gi|226331985|gb|ACIB01000071.1| GENE 1 1 - 508 182 169 aa, chain - ## HITS:1 COG:XF0842 KEGG:ns NR:ns ## COG: XF0842 COG3537 # Protein_GI_number: 15837444 # Func_class: G Carbohydrate transport and metabolism # Function: Putative alpha-1,2-mannosidase # Organism: Xylella fastidiosa 9a5c # 29 169 49 194 790 148 52.0 4e-36 MKQNTFIYAAIAFFVCSSCTSGKYSPVDYVDPFIGTGFHGHTYPGATVPFGAVQLSPDTR AGNWDACAGYHYDDTTLKGFSHTHLSGTGCIDLGDILFRPTTLKPDLTAESICRPANFSH KDERASAGYYSVILKDEGIKAELTATTHTGMHRYTFPSGKPVTIIVDLA >gi|226331985|gb|ACIB01000071.1| GENE 2 531 - 2162 1421 543 aa, chain - ## HITS:1 COG:no KEGG:BF1753 NR:ns ## KEGG: BF1753 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 543 1 543 543 1072 99.0 0 MKKYIPLLALSALTFCSCSGFLNVQPEGNPTTTSYFLNDEQAIDAIDGLYAPIHQEKGFG RELFWEQGAACDIVWAKSRGFNSLATFNYNGDESPISGGFDLFYQNMARSNWIIKQLLAK EKKGGLSDVEHRSLGEAFFMRGMAHFWIAYRYGTKAQGVPFVRYEDFEGDYDNSIPPQQA SVIDNYKFIIEDMDNAISYLPKFEEYSDADKGRAHKAAAVAYKAKVYAYWATWDETQWNN VIAMVNSLETDYGRGLADTFAEVFSSEFTDFWNKEYIWSIPSNGGSTGGGVEFPGVILED KAWGVYNGWGHIKPSYDIYEEMAKDGAGNDRLVRSILEYNQEFEFFGEKRKFYTDTNLDV GFQINKYMDPFKHKDADTKGYVNTNGNWPTARVNFPLIRFAEMLLFRAEAYLMTDQPGKA KEDLNRIRRRSNLKELIDMPTMADLYHERRCELAFEYTDHLFDLKRWHRSSNVVIKELAA KELNAHPRIRKYADRSNPESTFTIEPYADYLNKTPYQDYMMVFPYPAEQITKSNGKLIQN DGY >gi|226331985|gb|ACIB01000071.1| GENE 3 2174 - 5320 2555 1048 aa, chain - ## HITS:1 COG:no KEGG:BF1752 NR:ns ## KEGG: BF1752 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1048 1 1048 1048 2023 99.0 0 MKQVNLRIYQTILTLLVGLFLSAGAYAQQISVRGIVKDLMGEPVIGANVLVKGTSNGVIT DIDGKFALSAAKNDILIISFVGFMSQEIPVTGKDLMVTLKEDTGLLDEVVVLGYGANARK QDLSAAVGVLSNTDDLTVRPVSSTESLLQGQLAGVTVQSNGGDPTSTPSIVIRGQGSQNG DNVLWVVDGVPGAPIASMSDIESIVVLKDAASAAIYGAQSGAGGVILVTTKKAKAGIPTL SYEGTYGIRQATNLPEPLNAEEELEMRKRSYANANVTLPDGWNIEKNPWIGTTRTDWMDE IFRTAFYQRHNIALNVGTDNYSSRLSFSFDNDEGVLINTYNKNYAIRYNGKFDLNKWVSI SEDLVWKNTENRSKDTNDAYTGPVLSAIYMPASATVYNPLDGTWGGTTTEDPEYIAKYGS NFAGAHGDAVNPVRLLRAENRFNRTSDVWSTTSLQIANIIQGLKFTSRFTYNLKTNNYKN FRPIQDEPGKPNNSNSLDVTNYRTDAWKTENTLTYDNSFGNHTVGALFSTTADHYNVRGL KVNGKNFADESPYLQYLAYAGTTSATDYLTGPDANVSLVARLAYSYDDRYFVTASWRRDY AGRLPKENNFGDFPAATLAWKISNERFFKKSDFIGMLKLRASWGRVGNLGSIDYNYKSLL LGTSYWQEQAQYGVINNATWNNFVYNSSAMNRNLTWETSEQWDLGLDVELFKKRLALSFD YFDKRTFNLIQKQTMNWPSSIGLDPLLINQGEIRNRGIEIQANWNDRVNKDFSYFVSGNF SYLKNWVSDIGVKNADGSPGVWTDSDSKFRNIPYTRQTAEGEPLNSYYLIKTDGIFQSDA EAAAYVDKNGKRIQPNAVAGDLKFIDYNNDGKIDDKDRQYCGSATPKTTYSFSFGATYKK FSFSAMFQGVGGAQAFYAAKSVILSDADGNFNRVKDILNAWSPTNTSSNIPRLSMNDPNS NFSTASDWYLESASYLRLKNLTLSYDLTDVLQKWSHLRERNSRMSVYFSGENLFTITDYS GMDPECGGWDAMKYPVSRVFSFGVKLTY >gi|226331985|gb|ACIB01000071.1| GENE 4 5469 - 6326 281 285 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163762640|ref|ZP_02169704.1| ribosomal protein L33 [Bacillus selenitireducens MLS10] # 5 272 8 312 323 112 26 7e-25 MKLSIDLGGTNVRIAQVENGICLNKMSVPCLAQQDASAVLDQLFQLITGMMNVQVDGIGI GVPSIVDVEKGIVYNVANISSWKKIHLKDILEKRFMVPVAINNDSNCFTLGESMFGEGKP YAHMVGVTIGTGIGAGVIINHRLYCGQYMGAGEIGSLPYLDSDFEHYCSSSFFKRHDTTG VVVAEKAERGDGAALEIWREFGTHLGNLMKVILFSYAPQAIILGGSIVSAFHFFKDTMKD AMQDFPYKILLDNVKIITSYLKDASLLGASALFEKQYLPISIINN >gi|226331985|gb|ACIB01000071.1| GENE 5 6491 - 7528 593 345 aa, chain - ## HITS:1 COG:no KEGG:BF1750 NR:ns ## KEGG: BF1750 # Name: not_defined # Def: 1-phosphatidylinositol phosphodiesterase precursor # Organism: B.fragilis # Pathway: Inositol phosphate metabolism [PATH:bfr00562] # 1 345 1 345 345 731 100.0 0 MGRKYLQSFYVLALIISALFFPAFSGDNNIKTAGANMSYGNRLVMQTGHKSANLVSGVEK VEWMKVLQDTLPVCKISIPGTHDSGSTKGGCMLKTQTADIPAQLQKGIRAFDIRLKEKNG KLGVFHSHAFQDIYWEEDVLLAFISFLQAHPSETLIVSLKKEGGEIKDYASLLSASLNTP AYQRYFIADFHPELTLKSCRGKILFLHRDHAMDNYPGAACIGWDDNTTCLLTLRNKDGKE ATVFLQDEYQYNSGKEAGKKIAACIRNFDRICEEQTFSRRWGISFVSATGLHSGIPLVFA NKVNKPVADYLKENRKRNCGIVFIDFIESSGGQKLVEYLIGGNIY >gi|226331985|gb|ACIB01000071.1| GENE 6 7610 - 7864 66 84 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MEGHLVLKVRWFVPYWQHSLHYSLSGGSNIHHRGCGGIKVIAWILFFFRIWKYEMTIVKF QLSVLYGLASANDILKELLHICAK Prediction of potential genes in microbial genomes Time: Wed May 18 00:41:52 2011 Seq name: gi|226331984|gb|ACIB01000072.1| Bacteroides sp. 3_2_5 cont1.72, whole genome shotgun sequence Length of sequence - 38005 bp Number of predicted genes - 22, with homology - 22 Number of transcription units - 7, operones - 5 average op.length - 4.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 2 - 392 298 ## BF1749 putative metal-dependent membrane protease - Prom 492 - 551 6.4 + Prom 390 - 449 4.0 2 2 Tu 1 . + CDS 497 - 1501 783 ## BF1748 hypothetical protein 3 3 Op 1 . - CDS 1687 - 2754 893 ## BF1823 putative transmembrane protein 4 3 Op 2 . - CDS 2774 - 3196 178 ## BF1822 putative transmembrane protein 5 3 Op 3 1/0.000 - CDS 3193 - 3756 441 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog - Prom 3816 - 3875 4.9 - Term 3838 - 3899 11.0 6 3 Op 4 . - CDS 3919 - 5532 1874 ## COG0488 ATPase components of ABC transporters with duplicated ATPase domains - Prom 5557 - 5616 5.3 7 4 Op 1 5/0.000 + CDS 5768 - 6355 759 ## COG0576 Molecular chaperone GrpE (heat shock protein) 8 4 Op 2 . + CDS 6407 - 7591 1193 ## COG0484 DnaJ-class molecular chaperone with C-terminal Zn finger domain + Term 7643 - 7693 -1.0 - Term 7632 - 7680 -0.7 9 5 Op 1 . - CDS 7792 - 9423 1320 ## BF1740 putative outer membrane protein involved in nutrient binding 10 5 Op 2 . - CDS 9438 - 12182 2494 ## BF1816 putative outer membrane protein - Prom 12206 - 12265 4.0 - Term 12208 - 12263 12.1 11 6 Op 1 . - CDS 12279 - 14780 2457 ## COG3250 Beta-galactosidase/beta-glucuronidase 12 6 Op 2 . - CDS 14794 - 17043 2045 ## COG3537 Putative alpha-1,2-mannosidase 13 6 Op 3 . - CDS 17104 - 19176 1361 ## COG3525 N-acetyl-beta-hexosaminidase 14 6 Op 4 . - CDS 19244 - 20458 850 ## COG3055 Uncharacterized protein conserved in bacteria 15 6 Op 5 1/0.000 - CDS 20469 - 22793 2231 ## COG3525 N-acetyl-beta-hexosaminidase 16 6 Op 6 . - CDS 22813 - 25383 1874 ## COG3250 Beta-galactosidase/beta-glucuronidase 17 6 Op 7 . - CDS 25405 - 27477 1483 ## BF1809 sialate O-acetylesterase 18 6 Op 8 . - CDS 27474 - 28136 477 ## COG2755 Lysophospholipase L1 and related esterases 19 6 Op 9 . - CDS 28157 - 30169 1530 ## COG3525 N-acetyl-beta-hexosaminidase 20 6 Op 10 . - CDS 30201 - 31835 1262 ## COG4409 Neuraminidase (sialidase) - Prom 31912 - 31971 6.8 - Term 32039 - 32083 12.4 21 7 Op 1 . - CDS 32119 - 34134 1620 ## BF1727 hypothetical protein 22 7 Op 2 . - CDS 34146 - 37316 2619 ## BF1726 hypothetical protein - Prom 37340 - 37399 7.5 Predicted protein(s) >gi|226331984|gb|ACIB01000072.1| GENE 1 2 - 392 298 130 aa, chain - ## HITS:1 COG:no KEGG:BF1749 NR:ns ## KEGG: BF1749 # Name: not_defined # Def: putative metal-dependent membrane protease # Organism: B.fragilis # Pathway: not_defined # 1 130 1 130 289 223 100.0 2e-57 MEQQDHESFFSPKGIPAFASIIIFLVSFFIVMSLFHSVLNLFSVVRGYGMGYFFIGEGIM LLSVFIVTFLMMRFLDRRPFSDLGFSLKGRGKDILYGFLMAVLIYAIGFGVCLLTGQIEV VGVHLHWSDL >gi|226331984|gb|ACIB01000072.1| GENE 2 497 - 1501 783 334 aa, chain + ## HITS:1 COG:no KEGG:BF1748 NR:ns ## KEGG: BF1748 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 275 1 279 348 551 97.0 1e-155 MKIKHLFIGIALTANLFAATAQEVKKTYFVSKPGTLISMMTEEEANQVTHLTLTGKINAV DFRHLRDEFKNLQVLDIANASISMYSGKEGTYPDKFYIYMPNFVPAYAFCKMENGTAKGK STLKKIILSEKIKNIEDAAFMGCENLNICQIKKKTPPNLLPEALADSITAIFVPLGASDE YRLKNRWDNFAFIEGEPLEAKIEVGALSTLENEIQKAGLQPKEINFLTIEGKLDAADFKL IRDYMPNLVAVDIEKTNATAIPDFTFSQKKISAPHPSATWTKKHRTTGFQQLRPSLWNGR TTGKHYCYRIWSFHGMRQVAVRSCSWRQNHNDRR >gi|226331984|gb|ACIB01000072.1| GENE 3 1687 - 2754 893 355 aa, chain - ## HITS:1 COG:no KEGG:BF1823 NR:ns ## KEGG: BF1823 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 355 3 357 357 694 99.0 0 MKTNKLLSILLLAVSMVSCTTYYQVKTRIHPDGSAHREVYAFADSAFMAGDPMKNPFMFS LDSGWVVTRFDSVRTHNYFGEEGKINVCAGREEPSVSMFAEQVHPKDPMYRPLVTPQETL TKHFRWFYTYYTYTGIYPELADKGPVPLKNYLNESEQKLWFQGDDTAYRGMNGLEMKELL DRLEKKFYDWYNRSLYELSFEVIRPFIAEIDRGKYMSRLDEVKDSLYLGYQPKDDDPDPD PELICQLLDTHYHTDCFSLLYKEKQQEVDKRFDEETRPIELFGAVIQYELKMPGQMISAN TTFRDREYLVWKVDAYRLLAGEYSLTARSRVPNVWAFILTGVLILLGIGFWIKKR >gi|226331984|gb|ACIB01000072.1| GENE 4 2774 - 3196 178 140 aa, chain - ## HITS:1 COG:no KEGG:BF1822 NR:ns ## KEGG: BF1822 # Name: not_defined # Def: putative transmembrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 140 1 140 140 238 100.0 5e-62 MKKEEMNYEKWLEQLKKTPPVLENPDALTEDIMRAVKAAPFRNRPVRRLNFAAWCSAIAA TLLLGLWVAEAVAVDPILSSEVTRIPKPYQEQTLTDERSYERLMTGEKREIFFSASRSRK KEMLKRERLYTRYEQIMKNE >gi|226331984|gb|ACIB01000072.1| GENE 5 3193 - 3756 441 187 aa, chain - ## HITS:1 COG:BH0263 KEGG:ns NR:ns ## COG: BH0263 COG1595 # Protein_GI_number: 15612826 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Bacillus halodurans # 9 183 8 187 187 95 30.0 7e-20 MEQNEIQVLVEKSRRQDASAFALLVAEYQTFVFRLAFRLLCDEEEARDMVQETFLRVWLS LDKYRPEFRFSTWLYRVACNICYDRLRALQHSPAGALSDITFAELPVCSDDNIEATLVNR ELKAHILYFMQQLTPKQKLVFTLRDIEELEIKEIEKITGFTSVQIKANLYLARKSIRKKL NEINKER >gi|226331984|gb|ACIB01000072.1| GENE 6 3919 - 5532 1874 537 aa, chain - ## HITS:1 COG:BS_ykpA KEGG:ns NR:ns ## COG: BS_ykpA COG0488 # Protein_GI_number: 16078507 # Func_class: R General function prediction only # Function: ATPase components of ABC transporters with duplicated ATPase domains # Organism: Bacillus subtilis # 1 536 1 538 540 670 58.0 0 MITVSNVSVQFGKRVLFNDVNLKFTSGNCYGIIGANGAGKSTFLRTIYGDLDPTTGNITL GPGERLSVLSQDHFKWDAFTVMDTVMMGHTVLWDIMKQREALYAKEDFTDEDGLKVSELE EKFAELDGWNAESDAAMLLSGLGVKEDKHYTLMGELSGKEKVRVMLAQALYGNPDNLLLD EPTNDLDMETVTWLEEYLSNFEHTVLVVSHDRHFLDSVCTHTVDIDYGKINLFAGNYSFW YESSQLALRQQQNQKAKAEEKKKELEEFIRRFSANVAKSKQTTSRKKMLEKLNVEEIKPS SRKYPGIIFTPEREPGNQILEVSGLSKKTEDGVVLFNDVNFNVEKGDKIVFISRNPRAMT AFFEIINGHMKPDAGHFNWGVTITTAYLPLDNTEYFNTDLNLVDWLSQYGEGNEVYMKGF LGRMLFSGEEVLKKVSVLSGGEKMRCMIARMQLRNANCLILDTPTNHLDLESIQAFNNNL KTYKGNILFSSHDHEFIQTVANRVIELTPNGIIDKMMEYDEYITSDHIKELRAKMYK >gi|226331984|gb|ACIB01000072.1| GENE 7 5768 - 6355 759 195 aa, chain + ## HITS:1 COG:STM2681 KEGG:ns NR:ns ## COG: STM2681 COG0576 # Protein_GI_number: 16765996 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Molecular chaperone GrpE (heat shock protein) # Organism: Salmonella typhimurium LT2 # 1 195 1 194 196 81 35.0 9e-16 MDPKEKKTKQEEELKVDDIQDTVEGQSQNEEATEATEPLTAEEKLEKELKEALAQIEDQK DKYLRLSAEFDNYRKRTVKEKAELILNGGEKSIKSILPVIDDMERALTTMETATDVNAVK EGVELIYNKFLSILSQDGVKVIETKDQPLDTDYHEAIAVIPAPTEEQKGKILDCVQTGYT LNGKVIRHAKVVVGE >gi|226331984|gb|ACIB01000072.1| GENE 8 6407 - 7591 1193 394 aa, chain + ## HITS:1 COG:YPO0469 KEGG:ns NR:ns ## COG: YPO0469 COG0484 # Protein_GI_number: 16120798 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: DnaJ-class molecular chaperone with C-terminal Zn finger domain # Organism: Yersinia pestis # 4 372 3 350 379 284 45.0 2e-76 MAEKRDYYEVLEVTKESTVEEIKKAYRKKAIQYHPDKNPGDKEAEEKFKEAAEAYDVLSN PDKRARYDQFGHAGMSGAAGNGGPFGGFSGGMSMDDIFSMFGDIFGGHSGGGFGGGFGGF GGFGGGGSQQRKFRGSDLRVKVKLNLKEISTGVEKKFKLKKYVPCSHCHGTGAEGNSGSE TCPTCKGSGSVIRNQQTILGTMQTRTTCPTCNGEGKIIKDKCKVCGGEGIEYGEEVVTVK IPAGVAEGMQLSMGGKGNAGKHNGIPGDLLILVEEEPHPELIRDENDLVYNLLLSFPTAA IGGAVEIPTIDGKVKVKIEAGTQPGKVLRLRGKGLPSVNGYGTGDLLVNVSVYVPETLSK EEKSTLEKLEESKNFKPSTSIKEKIFKKFRSLFD >gi|226331984|gb|ACIB01000072.1| GENE 9 7792 - 9423 1320 543 aa, chain - ## HITS:1 COG:no KEGG:BF1740 NR:ns ## KEGG: BF1740 # Name: not_defined # Def: putative outer membrane protein involved in nutrient binding # Organism: B.fragilis # Pathway: not_defined # 1 543 1 543 543 1104 99.0 0 MKSRYICMFACLWGLGFLFSCDGFLNENPKDKIPEEDAYKNLTDLYYNAVASLYNNIGGY SDSQGLQGTGRGIYDLNTFTTDEAIMPTRGGDWYDGGFWQGLFLHRWGVDNDAIQATWEY LYKVVSLCNQSLERIDTYQQTHNDEELPVYRAEVRAFRALYYYHLMDLFARVPLVLSSST PLKEVKQNSRKEVFDFVVKELQESEPLLEMAYSNRSGNYYGRITRPVACFLLAKLALNAE IYTDNNWTDGQFPDGKDIYFEVDGERLNAWQTVEAYCDAITAMGYRLEDNYEANFAVYNE SSVENIFTIPMSKTLYTNQMQYLFRSHHYNHAKAYGLGGENGSSATVEVLRTFGYDTEAV DPRFDKCYFAGIVYDLKGKVVTLDDGTQLEYFPWKVDVDISNTPYEKTAGARMKKYAIDE TATKDGKLMENDIVLYRYADVLLMKSEAKVRNGENGDEELNLVRSRVNAPFRTATLANLL AERQLEFAWEGWRRQDLIRFRQYTRTYTGRPQLSGEKNGYTTVFPIPEKVRLMNPNLTQN PGY >gi|226331984|gb|ACIB01000072.1| GENE 10 9438 - 12182 2494 914 aa, chain - ## HITS:1 COG:no KEGG:BF1816 NR:ns ## KEGG: BF1816 # Name: not_defined # Def: putative outer membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 914 1 914 914 1819 99.0 0 MKQHRIICIPTLTFALLLMFTPSILQAQDKPVFPIDSLITVGYATGNKKNISGSVEKITE LGMNKDQITNPLEAIRGRVPGLTIQKGTNGPAALDAVRLRGTTSLTSGNDPLIIVDGVFG DLSMLTSIYPTDIESFTILKDASETAQYGSRGASGVIEITTKKGIRGKTRVSYNGSFGIS SVYKNLEMLSGNEFRNLARQQGVAILDKGNETDFLKEIEQTGLQQNHHVAFYGGTDASSY RVSLGFMDRQGIILNEDMKNFTSNMNMTQHVFDDFLVCELGMFGSIQKNHNLFDLQKTFY SAATFNPTYPNHRNPETDSWDQITNASQITNPLAWMEVKDHDATSHISTHARLTFNLFRE LKLILFGSYTYNIVENSQYLPTAVWAHGQAYKGTRKMESLLGNLMLTYKKKWSKHYFDVL ALAELQKETYSGFYTTVTNFNTDQFGYNNLQAGAIRLWEGTNSYFEQPRLASFLGRFNYT FADRYIFTVNARTDASSKFGANHKWGFFPSVSGAWVVSEEKFMKRIPLVDNLKLRIGYGL AGNQSGIDSYTTLNLVKPNGVVPVGSSQVVTLGNMRNTNPDLKWEVKRTFNAGVDLGMFG NRLLFSLNYYNSKTSDMLYLYNVSVPPFTYNTLLANIGSMRNSGTEIAVGLTPLKTQDME LNVNVNVTFQQNKLLSLSGMYGGENISAPEYKSLASLDGAGFHGGYNHIVYQMVGQPLGV FYLPHCKGLIADGNGGYTYDIADLNGGGVSLEDGEDRYVAGQAVPKTLLGSNISFRYKRF DLSVQVNGAFGHKIYNGTSLTYMNMNIFPDYNVMKKAPGRNIKDQTATDYWLEKGDYINF DYVTVGWNVPVTKWKKYVQSFRLAFTVNNLATITGYSGLSPMINSSTVNSTLGVDDKRAY PLARTYTLGLSINF >gi|226331984|gb|ACIB01000072.1| GENE 11 12279 - 14780 2457 833 aa, chain - ## HITS:1 COG:SP0648_2 KEGG:ns NR:ns ## COG: SP0648_2 COG3250 # Protein_GI_number: 15900551 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-galactosidase/beta-glucuronidase # Organism: Streptococcus pneumoniae TIGR4 # 29 818 59 871 871 430 33.0 1e-120 MNKKIKIAFASMLAVPLLACAQVRTEQTFEKGWKFTREDSKDFSNSTYDDAKWQSVTVPH DWAIYGPFSINNDKQNVAISQDGQKEAMEHAGRTGGLPFVGVGWYRLNFDAPSFSKGKKA TLVFDGAMSHAHVYINGQEAGYWPYGYNSFYVDATPYLKPGEKNTLAVRLENENESSRWY PGAGLYRNVHLVVNEDAHIPTWGTQLTTPVVKDEFAKVNLKTKLDVPAGKAFEGYRIVTE LKDKDGKVVAANEKKGGPFDDNVFEQDFVVTSPALWTPDTPHLYSAVSKVYEGNTLKDEY TTSFGIRSIEIIPNKGFFLNGKKTMFKGVCNHHDLGPLGGIANDAGIRRQIRILKDMGCN AIRTSHNMPAPELIKACDEMGMMIMAESFDEWKAAKVQNGYHKVFDEWVEKDLVNLIHQY RNNPSVVMWCIGNEVPDQWNGDRGPKLSRFLQDICHREDPTRPVTQGMDAPDAVVNNNMA AVMDVAGFNYRPHKYQENYKKLPQQIILGSETASTVSSRGVYKFPVVRRAMQKYDDHQSS SYDVEHCGWSNLPEDDFIQHEDLPYCIGEFVWTGFDYLGEPTPYYTDWPSHSSLFGIIDL AGLPKDRYYLYRSHWNKDKETLHILPHWNWEGREGEVTPVFVYTNYPSAELFINGKSQGK RTKDLSVTVNNSGDSTSVANFKRQQRYRLMWMDTKYEPGTVKVVAYDKDGKAVAEKEIHT AGKPDHIELVADRSVIDANGKDLSFVTVKVVDKEGNLCPLADNEISFKVKGAGTYRAGAN GNPASLESFQTPKMKVFSGMMTAIVQSTEKAGKITLEATGKGLKKGTLLIESK >gi|226331984|gb|ACIB01000072.1| GENE 12 14794 - 17043 2045 749 aa, chain - ## HITS:1 COG:CC0533 KEGG:ns NR:ns ## COG: CC0533 COG3537 # Protein_GI_number: 16124788 # Func_class: G Carbohydrate transport and metabolism # Function: Putative alpha-1,2-mannosidase # Organism: Caulobacter vibrioides # 44 748 57 752 770 350 33.0 5e-96 MKKLPYFVSFAAAWLFIAVASAQENPVDYVNPFVGTTNYGTTNPGAICPQGMMSVVPFNV MGDKSVGNKIDKDSQWWSTPYEHTNTYFTGFSHVNLSGVGCPELGSLLLMPTTGKLNVDY LQYGSAYKDEKATPGYYSNVLTKYGIKNEVSATLRTGISRFTFPKGESNILLNLGEGLTN ETGATVRFVSDTEIEGSKLLGTFCYNPQAVFPIYFVMRINKTPRARGYWKKMRPMTVEAQ WDNTSGKNKLYTAYTKEMSGDDIGTWFTFDTDEGETIEVSMGVSFVSIANARLNLEKEQA GKGFDQIRAEARAMWQNDLSRILVEGGTEEQKRVFYTAMYHLLIHPNILQDVNGQYPAME GSEILTTKGNRYTVFSLWDTYRNVHQLMTLLFPDRQLDMVRTMVDMYKEHGWLPKWELYG RETLTMEGDPSIPVIVDTWMKGLRDFDVQAAYEAMYKSATTPGKDNLMRPDIDDYIAKGY VPLTEQYDNSVSHALEYYIADYALSRFAQALGKKEDAKLFYDRSLGYKHYYCKEFGTLRP ILPDGTFYSPFDPMEGANFAPSPGFHEGNSWNYTFYVPHDIAGLTRLMGGKKKFVDKLQK VFDEGLYDPANEPDIAYPYLFSYFKGEEWRTQKQVKRLLAEYFTDRPDGIPGNDDTGTMS AWAIFSMIGLYPDCPGMPDYTLTTPTFDKVTIQLDPKYYKEKELVITAVRPGDNADLIKE VKLNGKKHAGYRISHEELVHAGKIEFILK >gi|226331984|gb|ACIB01000072.1| GENE 13 17104 - 19176 1361 690 aa, chain - ## HITS:1 COG:CC0447 KEGG:ns NR:ns ## COG: CC0447 COG3525 # Protein_GI_number: 16124702 # Func_class: G Carbohydrate transport and metabolism # Function: N-acetyl-beta-hexosaminidase # Organism: Caulobacter vibrioides # 28 521 30 518 757 381 41.0 1e-105 MIKPTMRICTFLLAAGLMIATAGMKAQSVIPIPLRMEQGSGTFQFSGETLLYTNLKGKEK KMMMDYLETLPIHFKSSKKQAKENVVSLLITKDNSQLPSPESYTLEVTPRKITVQATSGA GLFYGVQTLLQMAQPAMGDTWSVQATTIQDSPRFEYRGLMLDVSRHFRSKEFVKKQIDAL AYYKLNRLHLHLTDAAGWRIEIKKYPLLTEFAAWRPEANWKKWWNEGGRKYCRFDAPGAS GGYYTQDDIRELVNYARERHVTIIPEIEMPAHSEEVLTAYPELSCSGEPYKDADFCVGNE KTFTFLEDVLTEVMELFPSQYIHVGGDEAGKVAWKTCPKCQKRMQDEHLANVDELQSYLI HRVEVFLNAHGRKLLGWDEILQGGLAPNATVMSWRGEQGGIDAVKSGHQAIMTPGSHCYI DGYQDAPYSQPEAIGGYLPLEKVYSYNPIPASLTPDEAKLIYGVQANLWAEYIQTDEHCE YMIYPRILALAEVAWSAPERKSWTDFRPRALKAVAYLQSKGYHPFDLKKEIGNRPEASKP VQHLALGKTVKYNAPYNASYPAQGDKTLTDGIRGSWTYSDGVWQGFISRDRLDVTIDMGE STVLHSIGADFMQVVGPEVFLPTEVIISVSEDGKNFTELSRQTHKVVKSDAVVFKNYAWN GEAKGRYIRYQARAGEEFGGWVFTDEIVVE >gi|226331984|gb|ACIB01000072.1| GENE 14 19244 - 20458 850 404 aa, chain - ## HITS:1 COG:FN1470 KEGG:ns NR:ns ## COG: FN1470 COG3055 # Protein_GI_number: 19704802 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 47 403 32 369 372 103 26.0 6e-22 MTEIKTIRFLSIIGCLLAGMAASCSSRQVAPNEEGISVQKMTGFPEGEPGFSLGVSACYA GIYQGELLIAGGCNFPETPAAEGGKKKFYQGIYAADASADSVFVWRKVGQLPVAAAYGVS VSTPRGIVCVGGSNENGSLSAVYRLSLSDDKQAVIVDTLPSLPCTMDNMSGSVVDYTLFV AGGNVNGKPSNGLYCLNLGNPETGWQQLPDFPGAPRVQPVCVGQRKENETLLYLWGGFSG AFDGRSATLSTDGYCYSPSLQQWQPVSTPIGSDSVPVALGGGAGIALTDSLILCTGGVNK DIFLSALQREEMMKAAVTGGNQAAVDSLKSEAKTYMLHPAEWYRFNDRILIYNTRRDKWE EAVRSQDVARAGAALTGQGQTFFNINGELKPGIRTPEIAKIMID >gi|226331984|gb|ACIB01000072.1| GENE 15 20469 - 22793 2231 774 aa, chain - ## HITS:1 COG:CC0447 KEGG:ns NR:ns ## COG: CC0447 COG3525 # Protein_GI_number: 16124702 # Func_class: G Carbohydrate transport and metabolism # Function: N-acetyl-beta-hexosaminidase # Organism: Caulobacter vibrioides # 31 628 31 634 757 433 41.0 1e-121 MRNLFKIAGLLALTGFISSCNDKETTANYQVIPLPQEITTAQSQPFTLNGSVKIIYPEGN EKMQRNAQFLADYLKKATGKDYAVEAGTEGKGAILLKLGMESENPEAYQLSVNADGVTIA APTEAGVFYGIQTLRKSIPVAIGTTPSLPAVEISDYPRFSYRGAHFDVGRHFFTVDEVKT YIDMMALHNMNRLHWHLTEDQGWRLEIKKYPKLTEIGSKRSETVIGRNSGEYDGKPYGGF YTQEEAREIVAYAADRYITVIPEIDLPGHMQGALAAYPHLGCTGGPYEVWKIWGVSDQVL CAGNDSVLTFIDDVLTEVMDIFPSEYIHVGGDECPKTEWAKCPKCQARIKALGIKSDAKH SKEEYLQSFVINHAEKFLNEHGRQIIGWDEILEGGLAPNATVMSWRGEGGGIEAAKQKHD VIMTPNTYLYFDYYQTKDTENEPLAIGGYVPLERVYGYEPMPSSLTPEEQKHIIGVQANL WTEYIPTFSQAQYMVLPRWAALAEVQWSNPEKKNYENFLSRLPQLINIYDAEGYNYAKHV FDVKSEFVANSATGAVDVVMTTIDGAPIHYTLDGTEPTAASPVCDSILTIKESCTLKAVA VRPTGNSKMLTEQIAFSKSTSKPIKANQPVNKQYEFGGVSTLVDGLKGNGNYKTGRWIAF YKNDMDVTIDLQQPTEISSVAITTCVEKGDWVFDARSFSIEVSDDDKTFTKVASEAYPEM KETDRNGLYEHKLTFDPVKTRYVKVIATSEHSIPAWHGGKGNPGFLFVDEITLN >gi|226331984|gb|ACIB01000072.1| GENE 16 22813 - 25383 1874 856 aa, chain - ## HITS:1 COG:XF0846 KEGG:ns NR:ns ## COG: XF0846 COG3250 # Protein_GI_number: 15837448 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-galactosidase/beta-glucuronidase # Organism: Xylella fastidiosa 9a5c # 43 835 59 860 891 557 38.0 1e-158 MMKKLLYGLVCCCSLAYGQNQDTSDVLMLNDDWSFSQVGTEKWLPATVPGTVHQDLIHHK LLPDPFYGTNEKKIQWVEDEDWEYKTCFVVTEEQLKRDAAQLFFEGLDTYADVYLNGSLV LKSDNMFVGYAVPVKQVLRKGENLLHVYFHSPIKQTLPQWSSNGFNYPADNDHHEKRLSV FTRKAPYSYGWDWGIRMVTSGIWRPVTLRFYDVATIADYHVKQLSLTDQVAKLSNELEIN SISEKEKSAEVLISYSLQGGKEVTVKENVTLKPGLNKIHIPLDIQNPVRWMPNGWGEPHL YDFSAQVICDGKTIASRQHRIGLRTIRVVNEKDKEGESFYFEVNGIPMFAKGANYIPDDA LLPCITTERYKTLFRDMKEANMNMVRIWGGGTYEDDRFYDLADENGILVWQDFMFACTAY PSDPTFLKRVEEEAEYNIKRLRNHASLAMWCGNNEILEGLKYWGWQKNYTPEVYENMFRG YDKLFRGLLPAKVQELDEGRFYKHSSPYFANWGRPESWGIGDSHNWGVWYGKKTFESLDT DLPRFMSEFGFQSFPEMKTIATFAAPEDYQIESEVMNGHQKSSIGNDLIRTYMERDYIVP EKFEDFVYIGLVLQGHGMRHGMEAHRRNRPYCMGTLYWQLNDSWPVVSWSSIDYYGNWKA LHYQAKRAFAPLLVNVIQEGDSLNIYLISDMLEKQSQLTLEMKVIDFNGKTLDKEVIKAV EVAMNTSSCIVRKPLDTWVNPEQRKSSFLLLSLKDKSGRKVAEEVYFFDKTKNLELPQTA ISMKVKQLDGKCELTLSSPKLAKDVFVQIPVQGARFTDNFFDLLPGENKKITITSPEIKK GESLNITVKHVRDTYN >gi|226331984|gb|ACIB01000072.1| GENE 17 25405 - 27477 1483 690 aa, chain - ## HITS:1 COG:no KEGG:BF1809 NR:ns ## KEGG: BF1809 # Name: estS # Def: sialate O-acetylesterase # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 690 1 690 690 1419 99.0 0 MKKILFLSLFLMVCTILSAQKRVKVACVGNSITYGYTLPNPATDSYPSQLQQLLGETYEV GNFGKSGATLLNKGHRPYMQQEEFKKAIAFAGDIVVIHLGINDTDPRDWPNYRDSFVKDY LALIDSFRVANPKCHIIIARLTPIADRHPRFESGTRDWHGEIQQSIETIAKYAGVQLMDF HEPLYPYPYLLPDAVHPNVEGAGILAKTVYSAITGDFGGLHLSELYTDNMVLQHGEPLAI RGKANAGEKVTVSIAKQKLTAKAASNGDWTVTIQPLKAGGPYTLTVSAGKQKQTFNNVLA GEVWLCSGQSNMEFYLSWSKTAKRDIPQAANDQIRLFDMKARWRTDAVEWDASVLDSLNH LQYYKDTEWTVCSPATAGSFSAVAYYFGKMLQDSLKVPVGLICNAIGGSPTEAWVDRSTL EYKFPAILRNWTQNDFIQDWVRGRAALNVKKAVNKQQRHPYEPCYLYEAGIRPLEQYPIK GIIWYQGESNAHNREAHEKLFKLLVESWRKNWENENLPFYYVQLSSINRPSWPWFRDSQR RMMYEIPHTGMAVSSDLGDSLDVHPKHKQPVGERLAHWALNQTYGKKNVTPSGPMFRNVE FRDGAAYVSFDCAEGMHASDGKPLQTFEVAETEDIYYPATAEIVGNQIKVYSKEVKNPLH VRYGWQPFTRANLVNGDGLPASTFRTDWGK >gi|226331984|gb|ACIB01000072.1| GENE 18 27474 - 28136 477 220 aa, chain - ## HITS:1 COG:all0976 KEGG:ns NR:ns ## COG: all0976 COG2755 # Protein_GI_number: 17228471 # Func_class: E Amino acid transport and metabolism # Function: Lysophospholipase L1 and related esterases # Organism: Nostoc sp. PCC 7120 # 68 216 92 243 249 104 35.0 2e-22 MKKILLMMLLLASMASNAQERKYSTFYYQRATLFEELPVTSKDIIFLGNSITNGGEWSEL LNNKHVKNRGISGDICMGVYDRLDAILKGKPAKIFLLIGINDVSRGTSADTIVNRIGMIT QKIKQDSPKTKLYLQSILPVTDHYKMFGGHTSRWQEVKKINEGLMYLADKENVTYIDLYS HFVDEKTGKMNIEYTNDGLHLLGKGYLKWVDIVKPYINKK >gi|226331984|gb|ACIB01000072.1| GENE 19 28157 - 30169 1530 670 aa, chain - ## HITS:1 COG:CC0447 KEGG:ns NR:ns ## COG: CC0447 COG3525 # Protein_GI_number: 16124702 # Func_class: G Carbohydrate transport and metabolism # Function: N-acetyl-beta-hexosaminidase # Organism: Caulobacter vibrioides # 92 430 101 510 757 125 24.0 4e-28 MRNKQIFLISFIIYLCTISSVRAGEYHLLPEPQKFTPLGSSFVLGRTKLSTPVLRQEWEA FVVDRGGVIDDKARASIEVKLVPSLEEVPLNADEAYRLVVNNGKVTVEAVTEHGVYWAMQ TLAQLQDVQKKKATFNGCSIVDWPAFRIRGFMQDVGRTYMSIEELKREIAVLSRFKINVF HWHLTENQSWRLQSKIFPMLNDSVNTTRMPGKYYTLEEAKDLVAFCKAHNILLIPEFDMP GHSEAFVRTFRHDMQSPEGMKILKLLVDEVCETFDVPYLHIGTDEVRFTNPKFVPEMVEY VRAKGKKVISWNPGWHYKPGEIDMTHLWSYRGKAQKGIPAIDSKFHYLNHFDTFGDIIAL YNSRIYNVEQGSDDIAGTILAIWNDRYVANERNIILENNFYPNMLAIAERAWKGGGTEYF DKNGTILPSEGSPEFKEFADFENRMLWHKEHTFKGYPFAYVKQTNVKWNITDAFPNGGDL NKVFPPEQELKDSYLYEGKEYGVHPAIGAGIYLRHVWGKMVPTFYKDPQENHTAYAYTWV YSPKDQEVGLWAEFQNYGRSEMDLAPLQGKWDYKGSRIWINNEEIQPPVWTATHRTKSNE IALGNENCVARPPIAVHLNKGWNKVFLKLPVGKFNMPEVRLVKWMFTTVFVTPDGENAVD GLIYSPDKTK >gi|226331984|gb|ACIB01000072.1| GENE 20 30201 - 31835 1262 544 aa, chain - ## HITS:1 COG:STM0928 KEGG:ns NR:ns ## COG: STM0928 COG4409 # Protein_GI_number: 16764290 # Func_class: G Carbohydrate transport and metabolism # Function: Neuraminidase (sialidase) # Organism: Salmonella typhimurium LT2 # 203 528 66 393 412 113 32.0 8e-25 MKKAVILFSLFCFLCAIPVVQAADTIFVRETRIPILIERQDNVLFYLRLDAKESQTLNDV VLNLGEGVNLSEIQSIKLYYGGTEALQDSGKKRFAPVGYISSNTPGKTLAANPSYSIKKS EVTNPGNQVVLKGDQKLFPGINYFWISLQMKPGTSLTSKVTADIASITLDGKKALLDVVS ENGIEHRMGVGVRHAGDDNSAAFRIPGLVTTNKGTLLGVYDVRYNSSVDLQEHVDVGLSR STDGGKTWEKMRLPLAFGEFGGLPAGQNGVGDPSILVDTKTNNVWVVAAWTHGMGNQRAW WSSHPGMDMNHTAQLVLAKSTDDGKTWSAPINITEQVKDPSWYFLLQGPGRGITMSDGTL VFPTQFIDSTRVPNAGIMYSKDGGKNWKMHNYARTNTTEAQVAEVEPGVLMLNMRDNRGG SRAVAITKDLGKTWTEHESSRKALPESVCMASLISVKAKDNVLGKDLLIFSNPNTTKGRY NTTIKISLDGGVTWSPEHQLLLDEGNNWGYSCLSMIDKETIGILYESSVAHMTFQAVKLK DIIK >gi|226331984|gb|ACIB01000072.1| GENE 21 32119 - 34134 1620 671 aa, chain - ## HITS:1 COG:no KEGG:BF1727 NR:ns ## KEGG: BF1727 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 671 1 671 671 1376 99.0 0 MKLKIKNLFVQACFVAGGVMALTSCNDFLDREPLSSVTPEVYFQTVDHFAAYSIARYQNY FPSHGGYGAGIANNDGGTDNMVAGGRSSRYVKGLWKVPSSDGNWSFSNIRYCNYFFENAV PKYEAGEVAGADADIRHYIGEMHLMRALVYYNLLRTYGDFPIVTEVLPDDKTVLMEKGVR QPRNLVARFILKDLDDAANMMYSKGFKNNTRLNRETALLIKSRVALYEASFERYHKGTGR VPGDDNWPGKKVHSNFSLDVEAEVDFFLTEAMKAAEEVADKITLTPNTKVEDPASPSITS GWNPYFEMYANVDLSGYDEVLFWRQYAKTGSFSIMHGTPAWIASGSNHGLLKSYVESFLM QDGLPWYAASSAAPYKGDATLDDVKANRDNRLQLFMFGESNFVPVYSTEEPGTVKMFAPH PVSNREEMRDQTGYRIRKYASFDMAQNVWGKAESTTGCIIYRGVEAYLNYLEAYYMKNGN VTGKAAQYWRAVRERAGVDPDFTKTINATDLSQETDWGKYSGGQVVDATLLNIRRERRCE FIGEGMRWDDLVRWRSMDHLLTKNYIPEGCNFWDEMYKSANKDENGAEVTFKDSGEEGSN ISSRSFKYLRPYAILKTNNDVYDGYTWQKAHYLNPVPVREMELLSPDEKAETSVLYQNPY WSTKIGEVAEE >gi|226331984|gb|ACIB01000072.1| GENE 22 34146 - 37316 2619 1056 aa, chain - ## HITS:1 COG:no KEGG:BF1726 NR:ns ## KEGG: BF1726 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1056 18 1073 1073 2070 99.0 0 MKKVLFILLGCLLSFNVMAQVKAISGLVTDVTGEPVIGASVVEVGTTNGVITDLNGKFSL KVAPNSQFLVSYIGYKQQTIKVGSESTYNIVLKEDAEVLDEVVVVGYGSQKKVNVTGAVG MVGAEALEARPVANASQALQGVVPGLNLTVGNNGGALDGTLNMNIRGAGTIGDGSGSSPL VLIDGIEGDLNTVNPNDIESVSVLKDAASASIYGARASFGVILVTTKSGKSGKTNVSYSG SARFSDAIGVPDIMDSYTFAQYFNRASANKGGGDIFAPAVMERIKAYQEGTLKATTVDNG AGIWQKWANANGDTDWFEEFYDHWAPSQEHNLSINGGTDKTQYLISGSFLDQKGLMRHGK DKFQRYTLNGKITTAVTDWFKVTYSTKWTREDFERPSYLTGNFFHNLARKWPVHPAYDPN GFPMDEGEVEQMENGGKQNSQKDFYTNQLQLVFEPIKNWKINLDGSVRTTTQYQHWEVLP VYAYNVAGDPYYTVWDMGYGSYAAGSSRVNEYSWKENYYTTNIYSDYFKQFDNGHYFKVM AGFNAELYKTRNITAEKNTLITPGVPTINTATDDPQAYGGYADNSVAGFFARVNWSYKDR YMFEANGRYDGSSRFVGKERWGFFPSFSAGWNIAREPFMESFAEKINMGSLKLRASWGQL GNTNTNDAWYPFYQTMPVGSNYGWLVNGERPNYATNPGIVSSKKTWETVETWDVGLDWSF FNNRLSGSFDYFVRYTYDMIGPAPELSSLLGTSVPKINNSDMKSYGFELEVNWRDRIGEV SYGAKFVLSDDQQKILRYPNDSYDVGSYYKGEHLNDIWGLTTIGIAKSQEEMDAHLAKVD QSSVGTNWGVGDIMYADLDGDGKISNGTNKLGDTGDYRIIGNSTPRFKYGITLDAAWKGF DFSIFMQGIGKRDLWLDGCYFWGANGQGNEWQSTGFAEHWDFFRPEGDPLGANLNSYYPR VNFSGDRNTKVQTRYLQNGAYLRLKNVQLGYTLPRVWTEKAGISSVRVYVSGDNLATITS LSKIFDPEATGSLAGTGSGKLYPLQRVISVGVNVNF Prediction of potential genes in microbial genomes Time: Wed May 18 00:42:53 2011 Seq name: gi|226331983|gb|ACIB01000073.1| Bacteroides sp. 3_2_5 cont1.73, whole genome shotgun sequence Length of sequence - 5378 bp Number of predicted genes - 3, with homology - 2 Number of transcription units - 2, operones - 1 average op.length - 2.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 60 - 119 5.2 1 1 Tu 1 . + CDS 142 - 240 67 ## - Term 567 - 603 4.1 2 2 Op 1 . - CDS 653 - 2221 1074 ## BF1711 hypothetical protein 3 2 Op 2 . - CDS 2243 - 5377 1727 ## BF1710 hypothetical protein Predicted protein(s) >gi|226331983|gb|ACIB01000073.1| GENE 1 142 - 240 67 32 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MKKEKGINPFHEIIKMLFLIECEKERQGKKQV >gi|226331983|gb|ACIB01000073.1| GENE 2 653 - 2221 1074 522 aa, chain - ## HITS:1 COG:no KEGG:BF1711 NR:ns ## KEGG: BF1711 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 522 1 522 522 1053 100.0 0 MKKITSILLFLSALLYVSCDALDLSPEDYYGSGNFWTKEAQVEGYMNGLHNNLRSSYTMF YVLGEARGGTSRYGTSSLGTSMSYSDPIKNNMLTKDNTGISNWYDLYGRIMQVNHFISEV SNGCSFLSESKKGFYLGQAYGLRALYYFMLYKTYGGVPLITDVKVLEGGKISADALYTER STPETILKFIKSDLEASELNFGNNVTIDRAMWTKYATLMLKAEVYMWSAKVTTGDHQATG NSDLAIAQTALQPLINQFSLLDNFSEVFSKKANDEIIFAIRFKDGEATNWAGPFIYYGNI FEGQRYGRDGKLMQDTLDLKGTVGQFLHEYKKALWDSYDDEDMRRDATFMDHYGSAQKEG FGIAMKKGIGSVNSNNQRIFDTDIIVYRYADVLLMMAEIENALSGKCANYVNEVRKRAYG KNWHPQFAYTDGSYADNELTILHERDKEFVWEGKRWFDVVRMHDANGKSLAFSVAANYPN NETPDERVPLIKESEAHKLLWPIDVNTLNNDPKLEQTPGYDK >gi|226331983|gb|ACIB01000073.1| GENE 3 2243 - 5377 1727 1044 aa, chain - ## HITS:1 COG:no KEGG:BF1710 NR:ns ## KEGG: BF1710 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1044 66 1109 1109 2006 99.0 0 GKIKVSYVGYQPQVLDVKGKNSFNIKLKEDSEMLDEVVVTGYGGKQLRTKVTNSIGKVKE DVLQKGLFSNPAQALSGAVSGVRVLQTSGDPGATPTIILRGGTDYNGTGSPLVLVDGQVR GSLSDINPEDIESMEVLKDAGATAIYGARANNGVILVTTKRGKEGKGEVSVKAKVGINYY NNPYEFMNAGDYIYWMRTAYQRSGQIYKDSKGNWVGTADMNSLNNATPYGTGNLYFDPST GAVLDGNKDVRAVWSTMKYTDDLAFLLKQGWQTMTDPVYGDQIIYKNTDPASFNLHTPSL SQDYNISISGGNDKGNYYAGIGYNNTDGTATGNWYKRLTFTFNADYKIKPWLTSSSSFNF ADATWYGLSPGSRGEVEYFNRMLSLPPTFRGYNADGEMLLGPNSSDGNQSFNLSKFKRDN NTDKFTMVQSFDIKLMKGLNLKLTANWYFDEAKYEAFNQDYLSSPNNMNTSRSTSAEFDR TLNQTYNAVLNYDYQITKDHYLAAMLGFEYYDAYQKGFNASGSGAPTDDFGDLQFTSNEE GKRNIDSWHSRQRIMSFFGRVNYDFQSKYLVSFVLRKDGYSKLAKDNRWGVFPGISAGWV FGKEKFMESLQQVVSFAKLRASYGLNGNVNKDWVGNYTVQGSYGSNKYNGNTGYLLGSLP IPYLQWERSQTFEVGMDLSFLENRINTNMTYYNRRTEDKYATIPLPSSSGVSGITSNNGK LQNQGLELEFGFKVLEKRDWKWNINLNAAYNINKILELPYNGLERNRQNAMEVYTGRKLD DGSYEKMWVGGYQEGQRPGDIYAYKAEGLYRSESEIPGNLIDKSTGNNSSNNKILYGPEA WAKLTDQEKSKGLPIQPGDVKWKDVNNDGVIDVYDQVKVGNTTPKWTGGFNTTVSWKDLT LSARFDYALGFTVIDWKTPWIMGNMQGTYNTISDTKNTWSPENPNAKYPTYTWADQLGKR NYARSSSMFTFNGNYLALRELSLAYRLPSQLIKKAGMNDVSFSITGQNLGYLTEAEHMHS PESSSNNGGYPLPRTIIFGVNVSF Prediction of potential genes in microbial genomes Time: Wed May 18 00:43:16 2011 Seq name: gi|226331982|gb|ACIB01000074.1| Bacteroides sp. 3_2_5 cont1.74, whole genome shotgun sequence Length of sequence - 8761 bp Number of predicted genes - 4, with homology - 4 Number of transcription units - 3, operones - 1 average op.length - 2.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 238 - 297 7.4 1 1 Tu 1 . + CDS 382 - 1704 505 ## COG1672 Predicted ATPase (AAA+ superfamily) + Term 1857 - 1908 -0.8 + Prom 2216 - 2275 6.8 2 2 Tu 1 . + CDS 2395 - 3645 720 ## BF1715 hypothetical protein - Term 3923 - 3967 9.5 3 3 Op 1 . - CDS 4001 - 5638 1122 ## BF1714 hypothetical protein 4 3 Op 2 . - CDS 5668 - 8760 2075 ## BF1713 hypothetical protein Predicted protein(s) >gi|226331982|gb|ACIB01000074.1| GENE 1 382 - 1704 505 440 aa, chain + ## HITS:1 COG:PAB1371 KEGG:ns NR:ns ## COG: PAB1371 COG1672 # Protein_GI_number: 14521702 # Func_class: R General function prediction only # Function: Predicted ATPase (AAA+ superfamily) # Organism: Pyrococcus abyssi # 3 402 10 407 472 184 32.0 4e-46 MKFYNRENELAELQRIQELSFEENSRLTVVTGRRRIGKTSLIMRAFEKTPTIYLFVGRKN EASLCREFITLVSQALDIYVPEEISTFKSLFRYIMEVATRQSFNLVIDEFQEFYNINKSI YSDIQDIWDQYRQKTHMNFVVSGSIYSLMEKIFHNEKEPLFGRADNIIKLSAFSLNVLKK IIKDYHPQYTNDDLLALYSFSGGVPKYVELFCDNRVLTVDGMIDFMVRDNSPFTDEGKNL LIEEFGKNYGTYFSILSAISGGYNTQTEIEALLGEKSLGGYLKRLIEDYNIVVRQRPVFS KEGSQTVRYGICDNFIHFWFNYFDRNRSLIEIKNFIGLRKLIKADYPTYSGKILEQYFKQ KYAESYEFRLIGSWWEPKGNQNEIDIVAIYLDNKSAIVAEVKRQKKNFKPELFQKKVEHL ENKVLAKYQINTVCLSLEDM >gi|226331982|gb|ACIB01000074.1| GENE 2 2395 - 3645 720 416 aa, chain + ## HITS:1 COG:no KEGG:BF1715 NR:ns ## KEGG: BF1715 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 416 1 416 416 855 98.0 0 MKNIFLIIGISLFFNGSLYAQSDDWSPKNHNLIKSVREDGRFSSSYGVVHAMLRNTEPRY AFHREFSPKEFRKWQKGLRHAMEEIMKFPQIKNSPAPVCIKREQREGYRLEKWEFYPLPE CVSTFLVLIPDNINKPVPAILCIPGSGGNKEGLAGEPGIAPKLNDRYKDPKLTQALNFVK EGYIAVAVDNPAAGEASDLERYTLGSNYDYDVVSRYLLELGWSYLGYASYLDMQVLNWMK TQKHIRKDRIVVSGFSLGTEPMMVLGTLDTSIYAFVYNDFLCQTQERAEVMTMPDKNGRR PFPNSIRHLIPDFWKNFNFPDIVAALAPRPIILTEGGLDRDLDLVRKAYAIAGTPDNVKI YHYKKFSDPDTRKNVEYLPEGLDRNEYFRMVNVDGPNHYFKSELVVPWLRKLLEER >gi|226331982|gb|ACIB01000074.1| GENE 3 4001 - 5638 1122 545 aa, chain - ## HITS:1 COG:no KEGG:BF1714 NR:ns ## KEGG: BF1714 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 538 1 537 538 790 73.0 0 MKRIKSTILYGLLAASSGVLCTSCADKLDLSPIDYYGSGSYWKTEAHVIGYMDGIHKHLR DAAFQHTFIWGEARGGALIPSGTSADGMGMLYGDIKLQNFDEDHTGGINKFGDIFGRLTN INLFIARVTDATYMDDTKKGYYLGQAYGLRAFYYFDLYRTYGGVPLRLTADVVEGVIDPN KLYMARATPKEVMDQIKKDLDKSMEYFGDNNSFDPNNRGNKKGYWSKAATECLMGEVYLW ISKVSTGDDSANESNLEIAKTHLQNVLSNYGLKMLDDFSSVFDAKNGKGNSEIIFAVRYA EGEATNNNNLFTYAMATGSTKDNYLANGEKFLDALNIANTGSQQLEYKHEIYNSFDVTDT RREATFIASYNKNVETNELTLRGTHVRKNIGYVNAQGSRIYCGDYIIYRLPLVYLMLAEI ENMQGGNVAQYINIVRKRAYGTNWNEAIYGYKNSDYTTNELAILHEKDKEFIQEGQRWWD IRRMTLTKGGKHLVFCKEGSIGTDTPTLDETTEAHKVLWPVDKTLLGNDPLIYQTPGYAT YKKQE >gi|226331982|gb|ACIB01000074.1| GENE 4 5668 - 8760 2075 1030 aa, chain - ## HITS:1 COG:no KEGG:BF1713 NR:ns ## KEGG: BF1713 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1030 66 1089 1089 1655 80.0 0 GKIKVSYVGYQPQVLDVKGKNSFNIKLKEDSEMLDEVVVTGYGGKQLRTKVTNSISKVKE ESLKVGLFSNPAQALSGAVAGLKVTQASGSPGATPTIVLRGGTNLDGTGSPLVMIDGQLR DGLSDINPEDIESMEVLKDAGATALYGARASNGVILVTTKNGKAGHREINFKAKLGLNYV NNPYEFLGGEDYINILRTTYAKSGYKCSDGEYVSIAPLSNLTGASPFGTGNVLGKSAWNI MGKTADNAYLLQKGWKEMVDPLDPSNMIIYKDINPADYNLNNPSFSQDYNINMSGGNDRG SYYAGLGYNRQEGLPVSTFYERYSFVLNASYKITDWLTSTSNFNYNRANWKNMPGSNGSE GNYFGRIMSTPPTARFEDEDGNPTLGPNVADGNQAFQPEKWQTFNQTDKFTMIQGFQIDI MKGLFIKGTANWYYSEGLYESFTKDYMDNMLTEHYTKTRTSTAKFERNFAQTYNAVLNFT HTFAKDHNVNLMLGMEYYDNYKRGFEAKGSGAPTDDFSDLSLTDNGKGKRTINSWHEQYR ILSYFGRLNYDYKSKYLLSAVFRQDGYSSLLGDNRWGFFPGVSAGWIFGQENFIKEAIPF LSFGKLRASYGVNGNASGIGAYTLQGSYNTAVYNGNTGFLIGTLPNPGLRWEKTKTAEVG MDLSFFENRLNANFTYYNRLTSDKYAAFSLPSTTGFSSITNNNGKFRNSGVEIELSGKIL KTKDWSWEASANISFNKNKVVALPDNGLELNRQGGQQIYTGEKFTNEKGEIEYAKKWVGG YQEGKEPGVMVVYKSEGIYRNWNEIPGDLVITSGNYYGKKMYGPEAWKKLTKEQQKNALP IMPGDMKWKDINEDGIIDSYDQIVAGNTTPHWIGGFNTTLRWKNFQLYGRFDFALDYWIY DHTLPEIFLACAQGTYNTTKEVFNTWSEENPNAKYPRYAYADVLTNANYARNSTMFAYKG NYLAIREISLSYSLPKAWANKAYCQKVDVSITGQNLGYITSANVATPEVSNAGSGYALPR TLLFGLNVTF Prediction of potential genes in microbial genomes Time: Wed May 18 00:43:44 2011 Seq name: gi|226331981|gb|ACIB01000075.1| Bacteroides sp. 3_2_5 cont1.75, whole genome shotgun sequence Length of sequence - 9817 bp Number of predicted genes - 4, with homology - 4 Number of transcription units - 2, operones - 2 average op.length - 2.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . + CDS 47 - 3316 2736 ## BF1725 hypothetical protein 2 1 Op 2 . + CDS 3344 - 4915 1303 ## BF1724 hypothetical protein + Term 4958 - 4995 2.0 - Term 4943 - 4986 9.0 3 2 Op 1 . - CDS 5030 - 6592 1223 ## BF1721 hypothetical protein 4 2 Op 2 . - CDS 6610 - 9816 2209 ## BF1722 putative outer membrane protein Predicted protein(s) >gi|226331981|gb|ACIB01000075.1| GENE 1 47 - 3316 2736 1089 aa, chain + ## HITS:1 COG:no KEGG:BF1725 NR:ns ## KEGG: BF1725 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 1089 1 1089 1089 2053 99.0 0 MKKTIFLILCILCSLGAMAQKKSITGVITDGAGESIIGASVVEVGTTNGTITNFDGEFSL TIATGAKFTVSYIGYKSQTITVGAENTYKIVLKEDTEVLDEVVITGYGGSQKRATLTTAI SKLDNSVLKNAAFSNAGQSLQGSVTGLRVVNKTGQPGSEPDITLRGGATITGDNSKALIV VDGIVRNSMSDINPSDIESIQVLKDAASTAIYGARANGGVILVETKSGKEGKASVNYKFK MGVNFARKGYDFCDAHDYIYYNRLGYKRTGRTNVDTQMGYGIGNNLFDIRYLTDENANLK NEGWASMADPFYDGKTILYKDYSGELDDVVFNNSALTQDHYVNITGGNDKGTFSASLGYY KEDGQIRGTGYERFNGALNGSYKVFPFLTVKANATYSWSTQPELWIGQYEFFYRTRSQRP TWNPWNEDGTPASGFGTGDGNPDYYRDKLTSENSTNKSTYSVGFALDILPKKLVLNGNAS LYRYDWQREKFNKSYQAQSSATPDNTRQAEAYVQKYNQIQLNGTLTYTDTFAEKHNLEAM LGTEYFTYDQFDFEAKTQNSPTDDIPTLNAGSTRTYTSTTKTAYRILSGFGRINYNYDMR YLISFVARYDGISKLKDNRWGFFPGVSVGWNIMEERFWKDSKISGVISNLKPRLSYGVNG NVNGIGNFDVYGAYSQVGAKTYGGSTAFYNSGLVNTGLRWEQSQSFEAGLDIGFLNNRLS FILDYYNRTTKDLLTKQALPGYTGFTEIMTNMGTLRNYGFEMEVRANILNNPKGLTWDVT ANLSSVANKIVKLPYNGNPNNRVGGYEVATGRKKADGTDETKWIAGRQEGGKLGELVAYK QNHIFKDWDDVKKYANNRIDEVANLYGPGLAAQYAGKEGWQPIEPGDVCWEDINGDGVIN GYDRQVVGNIFPKVTGGFSTTLGYKNLSLYARFDYALGHTIYNDLAARSLGQYQGSFNII KEVKKTWSETNTDTDLPAFYYADQLSKKNITRSNNGLTAIDNNSSRFYEKGDYLALRELT LNYNLPKTWISKVGMTDASVYVTGQNLFYITGYTGVSPEPAVDTTYGRGIDNGRYPTPRT VLFGLSVTF >gi|226331981|gb|ACIB01000075.1| GENE 2 3344 - 4915 1303 523 aa, chain + ## HITS:1 COG:no KEGG:BF1724 NR:ns ## KEGG: BF1724 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 523 1 523 523 1014 99.0 0 MKKIFTYMLAGLSLLLFSRCDALDIDPTSSIADNNYWKTEAQFSTFNVGLHALLRECSFN FFLLGEPRADIYGDVPFGGEATQGMERLPFNTINKENTGISNFAGMYKVINQINLMIAKT KETTVLSEAGKNYYLGEAYGMRAYLYFHLLRSWGDVILYLDYTNGASLDLSNLSKAASPA EEVMKQIKEDITASEKGFGSDYSFKYGRYYWSMAATQMLKGEVYLWSGRQMGGGTADYTT AKTALQSIVSNANVSLQDDFSKVFAYNNKDNSEIIFSIRNAKDEYNMWDDKFRQNLVPQQ AYMTSTYCNKEGVSFKDLPEGQLNGLIRLQIRYDLYNKAFRDGDTRKDASMTAVYQKQQD GTVKYIAPFCNKYQGVLLDGASQRSFLNDYPIYRYADCLLLLAEAKALLGEDPTAEINQV RERAYGKEFFEANKATLAYPNDKGDFYTDNKYMSGDEDPLEAILKERMREFMFEGKRWYD LRLLGADYVTKRTSAVATRLLWPINESVLTDNPALKQTPGYQN >gi|226331981|gb|ACIB01000075.1| GENE 3 5030 - 6592 1223 520 aa, chain - ## HITS:1 COG:no KEGG:BF1721 NR:ns ## KEGG: BF1721 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 520 1 520 520 1039 99.0 0 MKKIIYIISLFVSSVLYTSCDALDLAPEDYFGSGNYWNNEAQVEGFMYGMHSQLRGNYDM FYCLGELRGGTQRVGSSSQNTSLDYAILRTNTLSQDNPGFTNWFGLYSPIMQVNHFIQKV ENECAFLSEADRKTYLGQAYGLRALYYFMLYKTYGGVPIVTTVELLDGKVTADKFYVERA TPEATLSFIKEDIGKSESYFGTTEINNKHDKTMWSKAATLMLKAEIYMWAAKVSINGYTA SGKSDLEIAKNALNGIIGKFQLLNKFSDVFSTSNRNNAEVIFTLHFADGEATNWGGMFLY QDAVFIGQVYGHDDKKIETDTLNLKGTGGVFRHEYTEDFWKSYDEKDTRRDATFLEYYTK KNKEGFGCVMQKAIGSINSNNNRIYDTDFIVYRYADALLMMAEVENGLGNPCAGYINEVR KRAYGSDYEANKYTEGNYAENELAILHERDKEFVWEGKRWFDVVRMHDANGKSLAFSATA NYPADKSILDPSEEYKLLWPIDVNVMSVNPLLKQTPGYEK >gi|226331981|gb|ACIB01000075.1| GENE 4 6610 - 9816 2209 1068 aa, chain - ## HITS:1 COG:no KEGG:BF1722 NR:ns ## KEGG: BF1722 # Name: not_defined # Def: putative outer membrane protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 1068 40 1107 1107 2030 99.0 0 SVVEVGTTNGVITDIDGKFTLMVDPNGKIKVSFVGYQPQTIDVKGKGSFNIQLKEDSEML EEVVVTGYGGKQLRTKVTNSIAKVKEETLKQGMFTNPAQALSGAVAGLSVSQTSGNPGSA PTLVLRGGTNFDGSGSPLILIDGQVRSSLSDINPDDIESMEVLKDAGATAIYGARANDGV VLVTTKRGKSGRAEVNLKAKFGLNYFKDSYNFMNAGDYLYWMRTGYMNAYVGDMKHPDGA GIKGWSSLSGLTTATPYGTGNLYFDKDGVTPLDGNKTSSAIWSPMLYSDNLSFLLDQGWQ TMIDPIYGDKIIYKNTNIADFNINTPSFSQDYNLSVSGGNDKGNYYAGLGYNKSEGTAVG NWYQRITFTFNADYKLKEWLTSNSSFNFADATWNGLPASQTAEANYFSRCLSLPPTFRGY NADGDMLLGPNSGDGNQQYNFKQFVRDNNTDKFTMNQSFTIDFMKGLSLKLGAIWYYSEE KTEAFNKDYLSSPGNLITSHSTSASYARTLDQTYNAVLNYNYQINKDNFLDAMVGFEYYD SYSKGFSASGSGAPTDDFMDLEYTSKEEGKRSIDSSHSRQRIMSFFGRVNYDYQSKYLLS LVLRRDGYSKLAEENRWGVFPGVSAGWVFSKEEFMKNTASILSFGKLRASFGLNGNVNKN FVGNYTVQGAYGSNRYNGSTGFLLSSIPNPYLMWEKSRTFEVGLDMGFLENRINANLTYY NRLTSDKYANITVPSTSGVSSVTSNNGEFQNQGFEFELGFRIIDAKDWKWNLNWNGALNK NKVVSLPDNGLERNRQSAYQVYTGNGDEKKWVGGYQEGQRPGDLYMFVAEGLYKSQDEIP ANLIDLTTGNNGSSGRPLYGGAEGYNKLTDSQKSNALPIQPGDVKWKDVNNDGVIDNYDM VKVGNTVPKWTGGINTTVSWKDLTLSARFDYALGFTAVDWKTMWIMSCAQGTYNTIEETK NTWTPENPNAKYPTYVWADQLGKRNYCRSSSMFAYNGNYIALRELSLSYKLPSILVQKAK LSNVELSITGQNLGYLTEAKHLFSPEKADNNGGYPLPRTVIFGINVSF Prediction of potential genes in microbial genomes Time: Wed May 18 00:44:20 2011 Seq name: gi|226331980|gb|ACIB01000076.1| Bacteroides sp. 3_2_5 cont1.76, whole genome shotgun sequence Length of sequence - 5584 bp Number of predicted genes - 9, with homology - 9 Number of transcription units - 6, operones - 1 average op.length - 4.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 46 - 105 3.5 1 1 Tu 1 . + CDS 324 - 1574 794 ## BVU_1439 mobilization protein - Term 1566 - 1632 20.4 2 2 Tu 1 . - CDS 1644 - 2414 444 ## gi|237717461|ref|ZP_04547942.1| predicted protein - Prom 2508 - 2567 7.9 + Prom 2382 - 2441 8.5 3 3 Tu 1 . + CDS 2506 - 2967 308 ## gi|198273874|ref|ZP_03206406.1| hypothetical protein BACPLE_00007 - Term 2851 - 2899 7.2 4 4 Tu 1 . - CDS 2971 - 4020 647 ## gi|194442170|ref|YP_002038843.1| rolling-cycle type replication protein - Prom 4170 - 4229 3.1 + Prom 3878 - 3937 8.4 5 5 Tu 1 . + CDS 4032 - 4301 65 ## gi|237717469|ref|ZP_04547950.1| predicted protein 6 6 Op 1 . - CDS 4236 - 4427 80 ## gi|237717468|ref|ZP_04547949.1| predicted protein 7 6 Op 2 . - CDS 4514 - 4912 334 ## BT_p548236 hypothetical protein 8 6 Op 3 . - CDS 4949 - 5269 173 ## COG2337 Growth inhibitor 9 6 Op 4 . - CDS 5263 - 5511 334 ## gi|189464117|ref|ZP_03012902.1| hypothetical protein BACINT_00452 Predicted protein(s) >gi|226331980|gb|ACIB01000076.1| GENE 1 324 - 1574 794 416 aa, chain + ## HITS:1 COG:no KEGG:BVU_1439 NR:ns ## KEGG: BVU_1439 # Name: not_defined # Def: mobilization protein # Organism: B.vulgatus # Pathway: not_defined # 3 408 4 452 467 146 28.0 1e-33 MAKTCIRVEACNIGSSERHNLRSKELDYIRPELTHRNEQWVECSIAEVHRDITEKYKEAT GQGLQKKATPIREGVIVISEETTIQQLQDLAEKLEERFGVHAFQIYTHKDEGANVWDGKE EAWKPNYHAHMIFDWTDGHTGKTVKLNRHDMAEMQTITAECLNMERGVSSDKKHLSAMQY KNKMETEKAEQLQKDIEQLNRAYTAGTEKITTVQKELSTAQKELNSMKTDIHINEAKEAA AKAGKTVFNAVTSVFNVGEVKRQAKEIETLKKENYNLSFKNQNLEGQLRTNNIEMRRVQE TTDRQIKAEASKLKPITNLFPEMENAQENIEELQTMGIQNKDIRQLLIGKEIYYTGNLYD KDKRKSYQVKNVKIDISKSQNGITTIWLNNIHFKNFLKELWNTLQKTLDNGNWLHR >gi|226331980|gb|ACIB01000076.1| GENE 2 1644 - 2414 444 256 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|237717461|ref|ZP_04547942.1| ## NR: gi|237717461|ref|ZP_04547942.1| predicted protein [Bacteroides sp. 2_2_4] # 1 256 53 308 308 488 100.0 1e-136 MKKVMFMFISLLTVLGVCSCNDNEVIEPVVSDSVSDMEVLSRFVDVNEITNEYYFNENKK TRALSYVTGSDWQDLEKVSPLSIEKYKNNLQVLNAQVASAISNPNTAYVVFSVNGKTLVK KVKEDANFDFSVFRDVVTETRAVLPSLSINGGSQSTTGVFYDSSRTLKMQVDLNASIQNN YYFFEVLNPNAKPSPDDNITTPESVAFSGTGPLWSNTFTWTSYWDANVPGQGFKWEFKGK GTTPSFGFIANCTFSR >gi|226331980|gb|ACIB01000076.1| GENE 3 2506 - 2967 308 153 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|198273874|ref|ZP_03206406.1| ## NR: gi|198273874|ref|ZP_03206406.1| hypothetical protein BACPLE_00007 [Bacteroides plebeius DSM 17135] # 1 153 40 192 192 296 100.0 3e-79 MNRIIHISLLISTVSLFLISNTGCVEKQGYYNHGEESIISLICDITWAGKKTTDENGSVW QGTYKFNKNGTYTRTNIEIDKQGNKKEANIYGQWSFGDPSFSTIYFGGEHYWDIDELTKN KFSFYDRSGKFGDPFMNREYIELTPYQENNTTN >gi|226331980|gb|ACIB01000076.1| GENE 4 2971 - 4020 647 349 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|194442170|ref|YP_002038843.1| ## NR: gi|194442170|ref|YP_002038843.1| rolling-cycle type replication protein [Bacteroides fragilis] # 1 349 1 349 349 703 100.0 0 MCKDRTNITNCKESPLKNGSDSLKKKAKVKIISHNFTRKLSEKNPDSKLMKAYMDAHVCS NVYTVQGGKAHSHYCHRAFCPICQRIQTAQNVKKFLPVMKYLASEGREFYFVTLTLQNCV TDDARVLRDFMRRCNRMWADGIRTKHKFRSLGISGVIKKECTYHVVKKDLFSFHYHFHII VDSLEAAKYVVAQWKKLHGSTVADARFQKYKKIEDFENAAIEVFKYASKASVSKSKGRDG KKKIQINYKALDMIYTAMHGMQRMSSFGQFRTMIGDEALNDFRDEDLVLNIEGLNVEDGC YTWYMSVKDKVADWYNMETGEALCCYKVTRKDELLQDIYLDEDIFPNSG >gi|226331980|gb|ACIB01000076.1| GENE 5 4032 - 4301 65 89 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|237717469|ref|ZP_04547950.1| ## NR: gi|237717469|ref|ZP_04547950.1| predicted protein [Bacteroides sp. 2_2_4] # 1 89 2 90 90 158 100.0 9e-38 MWLSFGIRPAKNEHKRLLILKIQSPLVTAFGLSFFLSPKVGNSFQTAKHHSKNLHITSQK AFTTKLSSLSSMGARPHSPGCAILTQIHY >gi|226331980|gb|ACIB01000076.1| GENE 6 4236 - 4427 80 63 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|237717468|ref|ZP_04547949.1| ## NR: gi|237717468|ref|ZP_04547949.1| predicted protein [Bacteroides sp. 2_2_4] # 12 63 37 88 88 87 100.0 3e-16 MSDIRQFTPVFFLQLSRIARQGLSLCTAKIVFFPEVSLSVVDLVVYLCQNCAAGGMGASP HTR >gi|226331980|gb|ACIB01000076.1| GENE 7 4514 - 4912 334 132 aa, chain - ## HITS:1 COG:no KEGG:BT_p548236 NR:ns ## KEGG: BT_p548236 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 132 1 132 132 183 62.0 2e-45 MKPTDYIEWDNLKDIPFFLCQVVEDREKQDLDIYYLGKRVLHDYDHVGHYLRTAVILFRR VKSRTADWVNLRNLWTLRNCVRENYNHGIGMNDLIFGENFDGDNLDTLTPLTKKRFDFLC KRIKELDPYATI >gi|226331980|gb|ACIB01000076.1| GENE 8 4949 - 5269 173 106 aa, chain - ## HITS:1 COG:NMA0400 KEGG:ns NR:ns ## COG: NMA0400 COG2337 # Protein_GI_number: 15793407 # Func_class: T Signal transduction mechanisms # Function: Growth inhibitor # Organism: Neisseria meningitidis Z2491 # 1 104 1 104 105 94 46.0 4e-20 MVEQYEVYWVELDPTRGGEMAKTRPCVVVTPSDLNMYLTTVVIVPITSTIRNYPYRVLCS VAGREGEIATDQIRTVDKSRLKRKIGDLNFSEIKRLREVFQQMFCE >gi|226331980|gb|ACIB01000076.1| GENE 9 5263 - 5511 334 82 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|189464117|ref|ZP_03012902.1| ## NR: gi|189464117|ref|ZP_03012902.1| hypothetical protein BACINT_00452 [Bacteroides intestinalis DSM 17393] # 1 82 1 82 82 153 100.0 3e-36 MVTNIIQIGNSKGIILPSEVLKQLRLSLKSAVSISLDGNNIVIKAQPRQGWAEAAKRAHE NGDDELLIPDVFEDEKFEDWTW Prediction of potential genes in microbial genomes Time: Wed May 18 00:45:15 2011 Seq name: gi|226331979|gb|ACIB01000077.1| Bacteroides sp. 3_2_5 cont1.77, whole genome shotgun sequence Length of sequence - 1752 bp Number of predicted genes - 0 Number of transcription units - 0, operones - 0 average op.length - 0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + SSU_RRNA 300 - 1752 99.0 # CR626927 [R:3205531..3207063] # # Bacteroides fragilis NCTC 9343 # Bacteria; Bacteroidetes; Bacteroidia; Bacteroidales; Bacteroidaceae; Bacteroides. Prediction of potential genes in microbial genomes Time: Wed May 18 00:45:16 2011 Seq name: gi|226331978|gb|ACIB01000078.1| Bacteroides sp. 3_2_5 cont1.78, whole genome shotgun sequence Length of sequence - 3365 bp Number of predicted genes - 0 Number of transcription units - 0, operones - 0 average op.length - 0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + TRNA 1 - 77 86.4 # Ile GAT 0 0 + TRNA 106 - 179 84.8 # Ala TGC 0 0 + LSU_RRNA 330 - 3208 99.0 # AP006841 [R:3140885..3143762] # 23S ribosomal RNA # Bacteroides fragilis YCH46 # Bacteria; Bacteroidetes; Bacteroidia; Bacteroidales; Bacteroidaceae; Bacteroides. + 5S_RRNA 3307 - 3365 100.0 # AP006841 [R:3140678..3140786] # 5S ribosomal RNA # Bacteroides fragilis YCH46 # Bacteria; Bacteroidetes; Bacteroidia; Bacteroidales; Bacteroidaceae; Bacteroides. Predicted protein(s) Prediction of potential genes in microbial genomes Time: Wed May 18 00:45:20 2011 Seq name: gi|226331977|gb|ACIB01000079.1| Bacteroides sp. 3_2_5 cont1.79, whole genome shotgun sequence Length of sequence - 1639 bp Number of predicted genes - 0 Number of transcription units - 0, operones - 0 average op.length - 0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + SSU_RRNA 185 - 1639 99.0 # CP000140 [D:416000..417494] # 16S ribosomal RNA # Parabacteroides distasonis ATCC 8503 # Bacteria; Bacteroidetes; Bacteroidia; Bacteroidales; Porphyromonadaceae; Parabacteroides. Prediction of potential genes in microbial genomes Time: Wed May 18 00:45:22 2011 Seq name: gi|226331976|gb|ACIB01000080.1| Bacteroides sp. 3_2_5 cont1.80, whole genome shotgun sequence Length of sequence - 3468 bp Number of predicted genes - 0 Number of transcription units - 0, operones - 0 average op.length - 0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + TRNA 130 - 203 78.1 # Ile GAT 0 0 + TRNA 214 - 287 84.8 # Ala TGC 0 0 + LSU_RRNA 373 - 3142 99.0 # CP000140 [D:417969..420738] # 23S ribosomal RNA # Parabacteroides distasonis ATCC 8503 # Bacteria; Bacteroidetes; Bacteroidia; Bacteroidales; Porphyromonadaceae; Parabacteroides. + 5S_RRNA 3365 - 3468 100.0 # CP000140 [D:147281..147431] # 5S ribosomal RNA # Parabacteroides distasonis ATCC 8503 # Bacteria; Bacteroidetes; Bacteroidia; Bacteroidales; Porphyromonadaceae; Parabacteroides. Predicted protein(s) Prediction of potential genes in microbial genomes Time: Wed May 18 00:45:23 2011 Seq name: gi|226331975|gb|ACIB01000081.1| Bacteroides sp. 3_2_5 cont1.81, whole genome shotgun sequence Length of sequence - 4845 bp Number of predicted genes - 2, with homology - 2 Number of transcription units - 1, operones - 1 average op.length - 2.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . + CDS 2 - 2791 2178 ## BF0594 hypothetical protein 2 1 Op 2 . + CDS 2809 - 4323 1278 ## BF0669 hypothetical protein + Term 4349 - 4386 7.1 Predicted protein(s) >gi|226331975|gb|ACIB01000081.1| GENE 1 2 - 2791 2178 929 aa, chain + ## HITS:1 COG:no KEGG:BF0594 NR:ns ## KEGG: BF0594 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis_NCTC9343 # Pathway: not_defined # 1 929 83 1011 1011 1764 99.0 0 VEVAIKPVVKITLKPDTEVLDEVVVTGYGNFKKSSFTGAASSITTEKLQDVPSLSVQDKL AGSVAGVQITSTSGQPGAVESVRIRGMGSINAGNDPLYVIDGVPVMTGDASEFTYSQSGN SLLSTINSNDIESMTVIKDAAAASLYGSRAANGVIVITTKKGASGKLKLNVRADWGFSNK AIDYRPILNGEDRRDILYMGLKNYALNSGKDETYAVNYADNNIDKYAAKPWSGYTDWEDV LFRNGSHQNYEVNAQGGNERTKFYTSFGYTKQEGITLQSGYERITGRANMSHKADRVTIE ASTMFTNSTQNVNSEGTSFSSPIMCLAMTASPSTFPYNEDGTFSTSFPALNGANPLQTAT YNYDRSTIVRTLNTLSATWNIWDNLNIKETLGYDFNQTNNRVWWDPRSNDGRSSKGVFQR YMMNRSKLNTQTQLTYNKTFAEHHNIDVLLGFETEDYKYDYTYTNGNTYPSYLPEITNAG VSRGASNINSYRMTSYLGRLNYDYAGKYYVSGSFRRDGSSRLSRDSRWGNFWSVSGSWRL SQEAFMESLSNVITDAKIRASYGVNGTQPKDYYGYMGVYEFGYNYNGNGGSSEARFYNPS LKWEKNYATNIGIDLTLFNRLTVSAEWYNRETKDLLMDKPISAAVGVINSSGVANMLVNV GSMRNRGFELELKSTNIQNKDLLWTTSLNIGHNKNKLTKLDGEQQEIISGVSIHRVGQPY YSIYAYEYAGVDPQTGKELYYINGEDGSRETTTNSAAANKTIIGSIEPKVQGGLTNYVSW KFIDFNLTLTYSLGGHAYDYATWLQSNGGTYHYLGNVPAYYKMEDTWQKTGDNAKLPQFA YGNANKASSRWLMSTDHLRVKNMTLGFTLPQSWSSKAGISKLRAYVSGNNLLTWKKKSLY VDPEVPVDGLCTFETPALRTVTFGLEIGF >gi|226331975|gb|ACIB01000081.1| GENE 2 2809 - 4323 1278 504 aa, chain + ## HITS:1 COG:no KEGG:BF0669 NR:ns ## KEGG: BF0669 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 504 1 504 504 953 99.0 0 MKKEFMKKIYKSITLVAAILSLSSCGNDWLDRKPADGIPSEDAITNYNDALTARTGMYDG IQGNSNATSYYGARMFYYGDVRADDMQARTQGMRSSSCYEMLYTVDDAPNMWNIPYNVIR RANRLIEAINEKKVTDATEAQIGKIYSEALVVRALVHFDLVRIYGMPYTADNGASLGVPV IVKPLERNDLPSRNTVAEVYTQVITDLTDAINSGYLAKDQTPGYINEWAAKALLTRVYLT KGDNENALKVAEDIITNSPYKLWANEEYVNAWYKSNGAHTNEMIFEVVNASNDDWTDRNG IAYLLNENGYADAIVTKSFMNMLSQDPKDVRIGMVLPAQYDKDLQEEYGDAKIFINKFPA DKDDVGEMRLNNLPLLRLSEVYLSAAEAAAKLGGHQDKAAKYLNEIVQRANPEAKAISEA DATVERIILERRKEMIGEGQRYFDALRNNETIVRYKDEGDKGYHYSLIKESQSFDRTYFR AILPIPVDETNVNPNLRAQQNPGY Prediction of potential genes in microbial genomes Time: Wed May 18 00:45:40 2011 Seq name: gi|226331974|gb|ACIB01000082.1| Bacteroides sp. 3_2_5 cont1.82, whole genome shotgun sequence Length of sequence - 1310 bp Number of predicted genes - 3, with homology - 3 Number of transcription units - 1, operones - 1 average op.length - 3.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . - CDS 2 - 242 58 ## gi|189462404|ref|ZP_03011189.1| hypothetical protein BACCOP_03090 2 1 Op 2 . - CDS 199 - 993 503 ## BT_2995 hypothetical protein 3 1 Op 3 . - CDS 990 - 1289 106 ## gi|189464125|ref|ZP_03012910.1| hypothetical protein BACINT_00460 Predicted protein(s) >gi|226331974|gb|ACIB01000082.1| GENE 1 2 - 242 58 80 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|189462404|ref|ZP_03011189.1| ## NR: gi|189462404|ref|ZP_03011189.1| hypothetical protein BACCOP_03090 [Bacteroides coprocola DSM 17136] # 1 80 1 80 150 142 100.0 9e-33 METNSHPEAAASNVKQGQYSPRKRKDTTPRVNPQLAVELVYQELKRVEVYTKRIEDATAR KVQIDGKSLESAENRLKNVL >gi|226331974|gb|ACIB01000082.1| GENE 2 199 - 993 503 264 aa, chain - ## HITS:1 COG:no KEGG:BT_2995 NR:ns ## KEGG: BT_2995 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 242 1 261 403 105 29.0 2e-21 MIGKGKSISHGVAALEYDLAKEINGQAVATEIARHELYGCTGAEMVQEMKPYHIDFPNVK NNCLRFEVSPSIEESATFTDADWAELGNDFMQRMGLANHQYIIIRHSGTESKKEQAHLHI LANRVSLSGELYRDNWIGKKATEAANAIAKERNFVQSQDIGKVNKAEIKEAMDGVLKKMQ GFDFTKFKEELGKRGFKVREARASTGKLNGYYVTARSGTEYKASEIGKGYTLAHIERTQS KLKCNSMNISHGNKLTPGSGSFQR >gi|226331974|gb|ACIB01000082.1| GENE 3 990 - 1289 106 99 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|189464125|ref|ZP_03012910.1| ## NR: gi|189464125|ref|ZP_03012910.1| hypothetical protein BACINT_00460 [Bacteroides intestinalis DSM 17393] # 1 99 1 99 99 148 100.0 1e-34 MELRRNEKITFRCTELEKDALAEQAARCSLSVSEYCRSLSLGGRPRERYTEEERQLLRDI AQLKGTLQRLNNYFGGRQYREVFEENRALITELKKILSR Prediction of potential genes in microbial genomes Time: Wed May 18 00:45:54 2011 Seq name: gi|226331973|gb|ACIB01000083.1| Bacteroides sp. 3_2_5 cont1.83, whole genome shotgun sequence Length of sequence - 925 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . + CDS 27 - 924 372 ## CFPG_P2-1 replication protein A Predicted protein(s) >gi|226331973|gb|ACIB01000083.1| GENE 1 27 - 924 372 299 aa, chain + ## HITS:1 COG:no KEGG:CFPG_P2-1 NR:ns ## KEGG: CFPG_P2-1 # Name: not_defined # Def: replication protein A # Organism: A.pseudotrichonymphae # Pathway: not_defined # 23 276 24 272 532 145 38.0 2e-33 MWGLCIFAANQSFMKKKLPITKNKDVVVSWVYTWSKQQDMSIHEQRIVLRILEACQAELK GVKLKDYAGTKRKFEHGLWDVDAQMHVSDVIFSGRDYNEIIAALDSLAGRFFTYEDDEEW WKCGFISNPKYKKRTGIITFRVSNDLWDVFTKFAKGYREFELNKALALPTGYSLRFYMLM SGQVYPLDISLENLKDRLGIPADKYKDKNGKDRIDHFEERVLKPAKAALDESCPYTFNYV KVRENPNNKRSKVTGFRFYPVYQPQFRDEELEGKELQAKVTARYQIDSHVYEYLRYSCG Prediction of potential genes in microbial genomes Time: Wed May 18 00:45:58 2011 Seq name: gi|226331972|gb|ACIB01000084.1| Bacteroides sp. 3_2_5 cont1.84, whole genome shotgun sequence Length of sequence - 663 bp Number of predicted genes - 3, with homology - 3 Number of transcription units - 2, operones - 1 average op.length - 2.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . + CDS 2 - 211 161 ## gi|255692735|ref|ZP_05416410.1| conserved hypothetical protein 2 1 Op 2 . + CDS 231 - 461 283 ## gi|237707884|ref|ZP_04538365.1| predicted protein + Term 462 - 509 10.1 - Term 453 - 494 5.5 3 2 Tu 1 . - CDS 498 - 662 126 ## gi|298484485|ref|ZP_07002637.1| initiator RepB protein Predicted protein(s) >gi|226331972|gb|ACIB01000084.1| GENE 1 2 - 211 161 69 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|255692735|ref|ZP_05416410.1| ## NR: gi|255692735|ref|ZP_05416410.1| conserved hypothetical protein [Bacteroides finegoldii DSM 17565] # 1 69 82 150 150 135 98.0 8e-31 DFERQGYRMKNGGYVDKRISFYSILCAVISLLFACFMCYLWTDAAKDRDNYKQYYEYYQE QAREQKGNK >gi|226331972|gb|ACIB01000084.1| GENE 2 231 - 461 283 76 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|237707884|ref|ZP_04538365.1| ## NR: gi|237707884|ref|ZP_04538365.1| predicted protein [Bacteroides sp. 9_1_42FAA] # 1 76 35 110 110 145 100.0 7e-34 MGTTEHEEPRFFFILNKGAKSGGEITHAVLNGSIVSKPAGWDAFHGLALAREKLSSEEIQ QQMKELGVEMEIVPLI >gi|226331972|gb|ACIB01000084.1| GENE 3 498 - 662 126 54 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|298484485|ref|ZP_07002637.1| ## NR: gi|298484485|ref|ZP_07002637.1| initiator RepB protein [Bacteroides sp. D22] # 1 54 301 354 354 105 100.0 1e-21 TSEEINRNKETFITAQEKITDLIGELALLNGKSREKNNPKGWIINALKGKIKDK Prediction of potential genes in microbial genomes Time: Wed May 18 00:46:12 2011 Seq name: gi|226331971|gb|ACIB01000085.1| Bacteroides sp. 3_2_5 cont1.85, whole genome shotgun sequence Length of sequence - 1216 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 2 - 860 391 ## CFPG_P2-1 replication protein A - Prom 1042 - 1101 5.5 Predicted protein(s) >gi|226331971|gb|ACIB01000085.1| GENE 1 2 - 860 391 286 aa, chain - ## HITS:1 COG:no KEGG:CFPG_P2-1 NR:ns ## KEGG: CFPG_P2-1 # Name: not_defined # Def: replication protein A # Organism: A.pseudotrichonymphae # Pathway: not_defined # 10 263 24 272 532 144 37.0 5e-33 MKKKLPITKNKDVVVSWVYTWSKQQDMSIHEQRIVLRILEACQAELKGVKLKDYAGTKRK FEHGLWDVDAQMHVSDVIFSGRDYNEIIAALDSLAGRFFTYEDDEEWWKCGFISNPKYKK HTGIITFRVSNDLWDVFTKFAKGYREFELNKALALPTGYSLRFYMLMSGQVYPLDISLDN LKDRLGIPADKYKDKNGKDRIDNFEERVLKPAKAALDESCPYTFNYVKVRENPNNKRSKV TGFRFYPVYQPQFRDEELEGKELQAKVTARYQIDSHVYEYLRYSCG Prediction of potential genes in microbial genomes Time: Wed May 18 00:46:17 2011 Seq name: gi|226331970|gb|ACIB01000086.1| Bacteroides sp. 3_2_5 cont1.86, whole genome shotgun sequence Length of sequence - 1781 bp Number of predicted genes - 4, with homology - 4 Number of transcription units - 2, operones - 1 average op.length - 3.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 7 - 66 4.2 1 1 Tu 1 . + CDS 103 - 300 305 ## gi|189464126|ref|ZP_03012911.1| hypothetical protein BACINT_00461 + Term 419 - 463 6.1 + Prom 343 - 402 3.2 2 2 Op 1 . + CDS 494 - 793 56 ## gi|255012199|ref|ZP_05284325.1| hypothetical protein Bfra3_23867 3 2 Op 2 . + CDS 790 - 1584 493 ## BT_2995 hypothetical protein 4 2 Op 3 . + CDS 1541 - 1781 67 ## gi|255012201|ref|ZP_05284327.1| hypothetical protein Bfra3_23877 Predicted protein(s) >gi|226331970|gb|ACIB01000086.1| GENE 1 103 - 300 305 65 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|189464126|ref|ZP_03012911.1| ## NR: gi|189464126|ref|ZP_03012911.1| hypothetical protein BACINT_00461 [Bacteroides intestinalis DSM 17393] # 1 65 1 65 65 110 95.0 4e-23 MVEYCVYWLENGEPMHEVFSNLAAAEMYSCAIRGKENVEWVEVSEEEAIDLDELEDMFPD DFCGV >gi|226331970|gb|ACIB01000086.1| GENE 2 494 - 793 56 99 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|255012199|ref|ZP_05284325.1| ## NR: gi|255012199|ref|ZP_05284325.1| hypothetical protein Bfra3_23867 [Bacteroides fragilis 3_1_12] # 1 99 1 99 99 148 100.0 1e-34 MEVRRNEKITFRCTRYEKLALAEQAARCSMSTSEYCRSLSLGGRPRERYTEEERQLLRDI AQLKGTLQRLNNYFGGRQYREVFEENQALITELKKILSR >gi|226331970|gb|ACIB01000086.1| GENE 3 790 - 1584 493 264 aa, chain + ## HITS:1 COG:no KEGG:BT_2995 NR:ns ## KEGG: BT_2995 # Name: not_defined # Def: hypothetical protein # Organism: B.thetaiotaomicron # Pathway: not_defined # 1 242 1 261 403 104 29.0 3e-21 MIGKGKSISHGTAALEYDLAKEIDGQTAAIEIARHELYGCTGAEMVQEMKPYHADFPNVK NNCLRFEVSPSIEESATFTDADWAELGNDFMQRMGLANHQYIIIRHSGTESKKEQAHLHI LANRVSLSGELYRDNWIGKKATEAANAIAKERNFVQSKDIGKANKAEIKEAMDDVLKKMQ GFDLTKFKEELGRRGFKVREARASTGKLNGYYVTARSGTEYKASEIGKGYTLAHIERTQS KLKYNSMNISHGNKLTPGSGSFHR >gi|226331970|gb|ACIB01000086.1| GENE 4 1541 - 1781 67 80 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|255012201|ref|ZP_05284327.1| ## NR: gi|255012201|ref|ZP_05284327.1| hypothetical protein Bfra3_23877 [Bacteroides fragilis 3_1_12] # 1 80 1 80 150 147 100.0 2e-34 METNSHPEAAASTVKQGQYSPRKRKDTAPKVNPQLAIELFFQEMGRIEAYTKRIEDATSR KVQIDGKSLESAENRLKNVL Prediction of potential genes in microbial genomes Time: Wed May 18 00:46:36 2011 Seq name: gi|226331969|gb|ACIB01000087.1| Bacteroides sp. 3_2_5 cont1.87, whole genome shotgun sequence Length of sequence - 1619 bp Number of predicted genes - 3, with homology - 3 Number of transcription units - 1, operones - 1 average op.length - 3.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 5/0.000 - CDS 11 - 598 463 ## COG2148 Sugar transferases involved in lipopolysaccharide synthesis 2 1 Op 2 1/0.000 - CDS 630 - 866 247 ## COG0399 Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis 3 1 Op 3 . - CDS 895 - 1617 603 ## COG0399 Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis Predicted protein(s) >gi|226331969|gb|ACIB01000087.1| GENE 1 11 - 598 463 195 aa, chain - ## HITS:1 COG:SP1838 KEGG:ns NR:ns ## COG: SP1838 COG2148 # Protein_GI_number: 15901667 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Sugar transferases involved in lipopolysaccharide synthesis # Organism: Streptococcus pneumoniae TIGR4 # 18 195 49 230 230 157 45.0 9e-39 MIRFFDIVFSLLGILLLSPVFLLLYIAICLESKGGGFYKQLRVGRYGGDFYVYKFRSMRV GADKKGLITVGGRDPRITRTGYLIRKYKLDELPQLFNVLKGDMSLVGPRPEVRKYVDLYT DEQKKVLSVRPGITDYASIEYVDENMILGEASDPDRAYIEQIMPDKIRYNMKYICNRSVK EYFKIIFLTFWSIIR >gi|226331969|gb|ACIB01000087.1| GENE 2 630 - 866 247 78 aa, chain - ## HITS:1 COG:SP1837 KEGG:ns NR:ns ## COG: SP1837 COG0399 # Protein_GI_number: 15901666 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis # Organism: Streptococcus pneumoniae TIGR4 # 1 74 330 403 408 75 41.0 2e-14 MIDEIAKSEVAVNVHFIPMPMLSFFKSMGYDIKDYPQAYQNFKSEISLPIYPQLDSEKLN FIIETVKAAYATVIAENR >gi|226331969|gb|ACIB01000087.1| GENE 3 895 - 1617 603 240 aa, chain - ## HITS:1 COG:CAC2260 KEGG:ns NR:ns ## COG: CAC2260 COG0399 # Protein_GI_number: 15895528 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis # Organism: Clostridium acetobutylicum # 9 229 91 297 394 191 45.0 9e-49 TYSATALAVLHAGAKPVMVDSGTDFNISVEAVRKAITPKTKAIIPVDIAGFPCDYERIMA LVQEPEMVKLFRSESPVQEKLGRILVMNDAAHSLGARYSSRQRTGCETDVAIFSLHAVKN VTTAEGGAICLNLPKPFDNTELYKELRMTSLNCQTKDAFSKSKAGGWRYDIVGFGMKINM ADVNAAIGLAQIREYPELLKERKRVFNAYSDAFSACDWAIVPPSVDGEKGKLLSHLCFAY Prediction of potential genes in microbial genomes Time: Wed May 18 00:46:37 2011 Seq name: gi|226331968|gb|ACIB01000088.1| Bacteroides sp. 3_2_5 cont1.88, whole genome shotgun sequence Length of sequence - 1576 bp Number of predicted genes - 2, with homology - 2 Number of transcription units - 2, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . + CDS 58 - 435 327 ## gi|301311570|ref|ZP_07217497.1| putative transposase subfamily 2 2 Tu 1 . + CDS 561 - 1424 398 ## COG2801 Transposase and inactivated derivatives Predicted protein(s) >gi|226331968|gb|ACIB01000088.1| GENE 1 58 - 435 327 125 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|301311570|ref|ZP_07217497.1| ## NR: gi|301311570|ref|ZP_07217497.1| putative transposase subfamily [Bacteroides sp. 20_3] # 1 122 1 122 144 219 100.0 4e-56 MRKKHEHYSEEEKLHLLHSYYQSGMSKTSFCKQHGISGITLLNKWLAKYESVVKEVSLAP CQAPTDMSDRSKEDYHDENARLKKRVKELEKALAFSRLETEARDLMITRAEEYFNIPIRK KPGAK >gi|226331968|gb|ACIB01000088.1| GENE 2 561 - 1424 398 287 aa, chain + ## HITS:1 COG:PA0257 KEGG:ns NR:ns ## COG: PA0257 COG2801 # Protein_GI_number: 15595454 # Func_class: L Replication, recombination and repair # Function: Transposase and inactivated derivatives # Organism: Pseudomonas aeruginosa # 1 242 23 260 263 161 39.0 1e-39 MLDTVREIRGEDPGIGGYKLWIMLTALFGERFMPGRDSFYTLLRRHRLMLPPRKARGTTN SNHRYHKWKNLIKGFVPTSPNQLWVSDITYIPLAGGDVCYLHLVTDAYSHKIVGWVLADS LRASATLEALRQAIDQAVEMTGSENLEGLIHHSDRGVQYCCDAYVERLRRHGIAISMTED YKPTDNAVAERINGIIKVERLYRQGLFETIERAASVIERYIYFYNYRRPHMSVGYKTPGI VHGEKETQIKMWRNKRKPVKSNEKEMDTIALQSRTTNLSEGLCSAQR Prediction of potential genes in microbial genomes Time: Wed May 18 00:46:43 2011 Seq name: gi|226331967|gb|ACIB01000089.1| Bacteroides sp. 3_2_5 cont1.89, whole genome shotgun sequence Length of sequence - 1553 bp Number of predicted genes - 2, with homology - 2 Number of transcription units - 2, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 51 - 110 6.6 1 1 Tu 1 . + CDS 291 - 773 104 ## Cphamn1_0884 hypothetical protein + Term 833 - 869 -1.0 - Term 550 - 589 -0.2 2 2 Tu 1 . - CDS 781 - 1269 284 ## Cphamn1_0884 hypothetical protein - Prom 1314 - 1373 4.3 Predicted protein(s) >gi|226331967|gb|ACIB01000089.1| GENE 1 291 - 773 104 160 aa, chain + ## HITS:1 COG:no KEGG:Cphamn1_0884 NR:ns ## KEGG: Cphamn1_0884 # Name: not_defined # Def: hypothetical protein # Organism: C.phaeobacteroides_BS1 # Pathway: not_defined # 1 137 1130 1266 1290 157 51.0 1e-37 MSDVKIGLTKLYNQFHNSQLTIISIEDENLQDKIFEKKYGKESIWLKKHLTNRSCNFSYN NIVERIHKLRTLHIQMDNYILGIYGWNDIKLEHNFYEVSYLPESDRIRFTVHPAVREEIL NRLLKLNHQLHKMEISNPYSTQKESTKGNKDNVSKDKMLF >gi|226331967|gb|ACIB01000089.1| GENE 2 781 - 1269 284 162 aa, chain - ## HITS:1 COG:no KEGG:Cphamn1_0884 NR:ns ## KEGG: Cphamn1_0884 # Name: not_defined # Def: hypothetical protein # Organism: C.phaeobacteroides_BS1 # Pathway: not_defined # 1 153 1130 1282 1290 133 46.0 2e-30 MCDNRLGMTKLYNQFHNDRLLEISNLAHFPSNGKEFEKIYGKENFSFLKHLCKYSNNTSI KDACRDILELRELYCKIDISVLNAYGWSDISLKHDFYEVDYLPENDRVRFTIHPNARKEI LRRLLLLNHEQYSKEQESLPSKQNRRIKIKNDQSFIDLFSDV Prediction of potential genes in microbial genomes Time: Wed May 18 00:46:49 2011 Seq name: gi|226331966|gb|ACIB01000090.1| Bacteroides sp. 3_2_5 cont1.90, whole genome shotgun sequence Length of sequence - 1219 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 10 - 1218 403 ## COG1002 Type II restriction enzyme, methylase subunits Predicted protein(s) >gi|226331966|gb|ACIB01000090.1| GENE 1 10 - 1218 403 402 aa, chain - ## HITS:1 COG:VNG6136H KEGG:ns NR:ns ## COG: VNG6136H COG1002 # Protein_GI_number: 16120088 # Func_class: V Defense mechanisms # Function: Type II restriction enzyme, methylase subunits # Organism: Halobacterium sp. NRC-1 # 3 360 55 395 661 71 21.0 3e-12 GGGFDITIGNPPYISAPTQIASPELNEQRSRIVASKKYKSLNEKWDLYVPFMELGMQLLC PNGIFSMIVPYPLTNQKYGKKLRKMIAEEYHLLEIADLNGTKIFENATVSNCIPFIKNTQ PEGELRITKIFEDKTIREVLSKSPEALKQDEKNYVWNLTEEERTGNRFANMNILGDFCYI SVGMVVNANEKNAQGAFKKEDLISDSYDAIHCRKYIEAKDIDKFQVKRVRYLEWNTERCP DKLRRPTFRELYDCPKLLINRLGVLKVYLDMDTHFLHSDSMFCAVLWKDLKGVYNKSLSS SIKKFCKHNRAVMESLSEKVDLYYLLGILNSSMADQLLADQRGGDYHIYPEHIRNLPIPA PQREAQNAIGVIVKEILHRREENIDYSELEEQLNGLVTALYQ Prediction of potential genes in microbial genomes Time: Wed May 18 00:46:50 2011 Seq name: gi|226331965|gb|ACIB01000091.1| Bacteroides sp. 3_2_5 cont1.91, whole genome shotgun sequence Length of sequence - 998 bp Number of predicted genes - 2, with homology - 2 Number of transcription units - 1, operones - 1 average op.length - 2.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . + CDS 11 - 505 255 ## ABBFA_002275 core protein 2 1 Op 2 . + CDS 508 - 948 311 ## lse_0573 hypothetical protein Predicted protein(s) >gi|226331965|gb|ACIB01000091.1| GENE 1 11 - 505 255 164 aa, chain + ## HITS:1 COG:no KEGG:ABBFA_002275 NR:ns ## KEGG: ABBFA_002275 # Name: not_defined # Def: core protein # Organism: A.baumannii_AB307-0294 # Pathway: not_defined # 1 141 1463 1605 1623 98 36.0 7e-20 MGMYVSQDPIGLAGGILNLYGYVIDVNTTVDILGLSSKKLGKNINAKIGDGMENHHLIPE EVWKQNQVMFDALGLDMDSADNGRLVPDSDKRRMQLGEAVYHRGSHPKYSHHVKSRIAKI RTNWKPGVNDDATRKRIRRLQIQLNGQIRKGNVPRGKCSYNKIG >gi|226331965|gb|ACIB01000091.1| GENE 2 508 - 948 311 146 aa, chain + ## HITS:1 COG:no KEGG:lse_0573 NR:ns ## KEGG: lse_0573 # Name: not_defined # Def: hypothetical protein # Organism: L.seeligeri # Pathway: not_defined # 8 132 6 124 138 64 30.0 1e-09 MENLNSYRLVDKSEFDEINRLAEEAKQTAKIATTDAIEQQLTKMKDFINNFIANKNNEED DIENYAYSFGSLFGNLIKTKYDWEWYQIIVDGEEFYCIASPKQRACCICHNYFYSILEGS HNNNFKLLFNMIKKDYPSSWNFMVLS Prediction of potential genes in microbial genomes Time: Wed May 18 00:46:56 2011 Seq name: gi|226331964|gb|ACIB01000092.1| Bacteroides sp. 3_2_5 cont1.92, whole genome shotgun sequence Length of sequence - 910 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 3 - 837 738 ## BDI_1420 putative transcriptional regulator UpxY-like protein Predicted protein(s) >gi|226331964|gb|ACIB01000092.1| GENE 1 3 - 837 738 278 aa, chain - ## HITS:1 COG:no KEGG:BDI_1420 NR:ns ## KEGG: BDI_1420 # Name: not_defined # Def: putative transcriptional regulator UpxY-like protein # Organism: P.distasonis # Pathway: not_defined # 1 278 1 278 370 552 99.0 1e-156 MMNVLRDGRSERGTARVGKRHDLKTVRWYVLTLPTTGVARRDRISPAKSLDAELSRRKRR GETLFEYFAPSYVEVRKVDGKMVNTKRPLLFNYVFVRSSVEEIFQMKRTLPLYNFLPRVS SGGMTHFPYLSDDEMGNLRWVAESYSNELPVYVPDSDRLLKGDRVRITSGYFTGMEAEVV IQPGGGHKDVMARILDCMWVPLLEVKPGEYELIELNTKGKHVYTHLDNDRLREGLHDALG RYHASGNVSEEDTRLAREVLRSYGSLRAETDVMRCKIY Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:00 2011 Seq name: gi|226331963|gb|ACIB01000093.1| Bacteroides sp. 3_2_5 cont1.93, whole genome shotgun sequence Length of sequence - 764 bp Number of predicted genes - 2, with homology - 2 Number of transcription units - 1, operones - 1 average op.length - 2.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . - CDS 2 - 443 287 ## BF0123 hypothetical protein 2 1 Op 2 . - CDS 465 - 764 331 ## BF0123 hypothetical protein Predicted protein(s) >gi|226331963|gb|ACIB01000093.1| GENE 1 2 - 443 287 147 aa, chain - ## HITS:1 COG:no KEGG:BF0123 NR:ns ## KEGG: BF0123 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 147 684 830 834 277 97.0 9e-74 MADYLRYMYKTVRKYFGEAIVVTQEVDDIISSPVVKESIINNSDCKILLDQRKYMNKFDQ IQALLGLTEKEKSQILSINMANNPSRLYKEVWIGLGGTQSAVYATEVSAEEYLAYTTEET EKVEVYRLAEQLGGDIEAAIRQLAERR >gi|226331963|gb|ACIB01000093.1| GENE 2 465 - 764 331 99 aa, chain - ## HITS:1 COG:no KEGG:BF0123 NR:ns ## KEGG: BF0123 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 99 577 675 834 191 97.0 8e-48 DDYRKELAERDIKVEKSDFNIDNMLTTMRQYYRGGRYDFLLNSTENIDLLGKRFIVFEID SIKDNRELFPVVTIIIMEAFINKMRRLKGVRKQLIVEEA Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:05 2011 Seq name: gi|226331962|gb|ACIB01000094.1| Bacteroides sp. 3_2_5 cont1.94, whole genome shotgun sequence Length of sequence - 712 bp Number of predicted genes - 2, with homology - 1 Number of transcription units - 2, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . + CDS 3 - 281 88 ## + Term 310 - 349 2.0 - Term 298 - 337 2.0 2 2 Tu 1 . - CDS 478 - 687 175 ## gi|256841947|ref|ZP_05547452.1| predicted protein Predicted protein(s) >gi|226331962|gb|ACIB01000094.1| GENE 1 3 - 281 88 92 aa, chain + ## HITS:0 COG:no KEGG:no NR:no TLAGVSLPLCDRKNLPCFFAENIPNSKSAVIATPIAAKAYFTGEIKFNKNIIPLIIQPQE WFNTIWSLIKILSTIGLNTCDIISKHFSQRCS >gi|226331962|gb|ACIB01000094.1| GENE 2 478 - 687 175 69 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|256841947|ref|ZP_05547452.1| ## NR: gi|256841947|ref|ZP_05547452.1| predicted protein [Parabacteroides sp. D13] # 1 69 13 81 81 131 100.0 2e-29 MLDAIRYWQENDKGGLEEDVKAIDDAITFIACEHDAPGVLTEKESLSLIAALSFLKKRLC LFEGKEEPK Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:16 2011 Seq name: gi|226331961|gb|ACIB01000095.1| Bacteroides sp. 3_2_5 cont1.95, whole genome shotgun sequence Length of sequence - 701 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 1 - 700 285 ## COG0732 Restriction endonuclease S subunits Predicted protein(s) >gi|226331961|gb|ACIB01000095.1| GENE 1 1 - 700 285 233 aa, chain - ## HITS:1 COG:XF0296 KEGG:ns NR:ns ## COG: XF0296 COG0732 # Protein_GI_number: 15836900 # Func_class: V Defense mechanisms # Function: Restriction endonuclease S subunits # Organism: Xylella fastidiosa 9a5c # 40 233 224 421 442 105 32.0 8e-23 MAKQLYDYWFVQFDFPNEEGKPYKSSGGKMVWNEKLKREIPEGWDISLIKDIATTYSGGT PKSTNIEYYDNGEIAWINSGELNSPIITKTTNYITKCGLENSSAKLYPSNSILVAMYGAT AGKVSLLTFEACSNQAVCGVIPTIENMLYYVYFHISSLYSHFITLSTGSARDNISQDTIK NILLPIPTRNILKLFDEKIGSIYQTIVNNYQQIDSLTKQRDELLPLLMNGQVS Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:16 2011 Seq name: gi|226331960|gb|ACIB01000096.1| Bacteroides sp. 3_2_5 cont1.96, whole genome shotgun sequence Length of sequence - 692 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 132 - 191 5.8 1 1 Tu 1 . + CDS 333 - 551 234 ## BDI_2920 TPR repeat-containing protein Predicted protein(s) >gi|226331960|gb|ACIB01000096.1| GENE 1 333 - 551 234 72 aa, chain + ## HITS:1 COG:no KEGG:BDI_2920 NR:ns ## KEGG: BDI_2920 # Name: not_defined # Def: TPR repeat-containing protein # Organism: P.distasonis # Pathway: not_defined # 1 69 36 104 588 118 94.0 8e-26 MPQHPDSALMLLEQIENKENLSRKDKAHYYLLLTEAQDKTFVKHETDSLITIATDYYEET DDLETKSESLVL Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:18 2011 Seq name: gi|226331959|gb|ACIB01000097.1| Bacteroides sp. 3_2_5 cont1.97, whole genome shotgun sequence Length of sequence - 677 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 1 - 676 205 ## COG0732 Restriction endonuclease S subunits Predicted protein(s) >gi|226331959|gb|ACIB01000097.1| GENE 1 1 - 676 205 225 aa, chain - ## HITS:1 COG:Cj1551c KEGG:ns NR:ns ## COG: Cj1551c COG0732 # Protein_GI_number: 15792859 # Func_class: V Defense mechanisms # Function: Restriction endonuclease S subunits # Organism: Campylobacter jejuni # 44 208 4 167 380 134 45.0 2e-31 MAKQLYDYWFVQFDFPNEEGKPYKSSGGNMVWNEKLKRNIPIGWNNGTLIDIANITMGQS PDGTSYNEIGEGVLFYQGSTDFGMRFPSVRQYTTAPSRFAKKGDILMSVRAPVGAVNIAN NDCCIGRGLSALNSKIGSTTHLYYILNDLRIAFDQRNAAGTTFGSITKEDLYNLPIVIPA KEVISAFDKICSPMFDRQMLLGEEIDTLIKQRDELLPLLLNGQVL Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:19 2011 Seq name: gi|226331958|gb|ACIB01000098.1| Bacteroides sp. 3_2_5 cont1.98, whole genome shotgun sequence Length of sequence - 659 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 2 - 659 655 ## BF3047 hypothetical protein Predicted protein(s) >gi|226331958|gb|ACIB01000098.1| GENE 1 2 - 659 655 219 aa, chain - ## HITS:1 COG:no KEGG:BF3047 NR:ns ## KEGG: BF3047 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 219 27 245 303 416 100.0 1e-115 VLEGFLTVLLGEPIRIVEILESEGNQLNETDKFNRVDIKARNSKDEIIIVEVQNTREIYY LERILFGVAKAITEHIELGQLYSEVKKVYSISILYFDIGRGTDYLYHGQNSFVGVHTGDL LEVSTKEKNAIVRKLPAEIFPEYFLIRVNEFNKVAVTPLEEWIEYLKTGVIHPDTKAPGL EEARRKLVYYNMNKAEQLAYDEHINAIMIQNDVLSTAAM Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:23 2011 Seq name: gi|226331957|gb|ACIB01000099.1| Bacteroides sp. 3_2_5 cont1.99, whole genome shotgun sequence Length of sequence - 643 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 2 - 605 611 ## BF3345 hypothetical protein Predicted protein(s) >gi|226331957|gb|ACIB01000099.1| GENE 1 2 - 605 611 201 aa, chain - ## HITS:1 COG:no KEGG:BF3345 NR:ns ## KEGG: BF3345 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 201 1 201 257 395 100.0 1e-109 MKIKRLLVLAVLPMLCLAVNAQNSSKDNTPKKGDFTVAATVGYNSYTSVTAPSGLLTDYE VRALSTNWADKKLMVGFEGGWFFKDQWKLNLGGGVSFTNNPGYPAVPGTIDDSNKNNSAD ENMGEIPNYRAVADAQSFAYNVSAGVDRYFNIKRVPNLMWYTGIRVGFAYGENEMKYDEE TSMGKSIAESWNLRGALTIGV Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:27 2011 Seq name: gi|226331956|gb|ACIB01000100.1| Bacteroides sp. 3_2_5 cont1.100, whole genome shotgun sequence Length of sequence - 631 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 1 - 631 279 ## BDI_3156 hypothetical protein Predicted protein(s) >gi|226331956|gb|ACIB01000100.1| GENE 1 1 - 631 279 210 aa, chain - ## HITS:1 COG:no KEGG:BDI_3156 NR:ns ## KEGG: BDI_3156 # Name: not_defined # Def: hypothetical protein # Organism: P.distasonis # Pathway: not_defined # 1 210 78 287 522 419 99.0 1e-116 EEWTKHPILHFDLNISHYDAPDSLYKILNDTLSRYEEEYGTRPTEETLPLRFAGIIDRAY RKTGQRAVILIDEYDKPLLQNLHDEEMQNRFRNMLKPFYGVLKTMDRAIRFALLTGVTKF GKVSVFSDLNNLDDISMREPYAAICGITETELRTHFDEDIHTLASALERTYEEARSLLRK RYDGYHFVAGGPGIYNPFSLLNTFKYMRLS Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:31 2011 Seq name: gi|226331955|gb|ACIB01000101.1| Bacteroides sp. 3_2_5 cont1.101, whole genome shotgun sequence Length of sequence - 624 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . + CDS 2 - 376 314 ## BDI_2633 hypothetical protein + Term 542 - 586 9.3 Predicted protein(s) >gi|226331955|gb|ACIB01000101.1| GENE 1 2 - 376 314 124 aa, chain + ## HITS:1 COG:no KEGG:BDI_2633 NR:ns ## KEGG: BDI_2633 # Name: not_defined # Def: hypothetical protein # Organism: P.distasonis # Pathway: not_defined # 1 124 1028 1151 1151 200 84.0 2e-50 DIAPTYAAWLRMQNGCLVLSDDRSSNFDEMNTGGDDALIFNVEQGDDIATDNETIDAVEG ISVVAGNGTVTVQGAAGKSVVITNILGKVIAETVLTSDNATISVPAGIVAVAVDGEEAVK AIVK Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:34 2011 Seq name: gi|226331954|gb|ACIB01000102.1| Bacteroides sp. 3_2_5 cont1.102, whole genome shotgun sequence Length of sequence - 600 bp Number of predicted genes - 2, with homology - 2 Number of transcription units - 1, operones - 1 average op.length - 2.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . - CDS 2 - 383 117 ## BDI_1594 putative LPS biosynthesis related glycosyltransferase 2 1 Op 2 . - CDS 398 - 598 141 ## BDI_0125 tyrosine-protein kinase Predicted protein(s) >gi|226331954|gb|ACIB01000102.1| GENE 1 2 - 383 117 127 aa, chain - ## HITS:1 COG:no KEGG:BDI_1594 NR:ns ## KEGG: BDI_1594 # Name: not_defined # Def: putative LPS biosynthesis related glycosyltransferase # Organism: P.distasonis # Pathway: not_defined # 1 127 1 127 379 199 99.0 2e-50 MQITLLLLSGFLFSVLFGMVIIPRILVISHKKRLYDVPDSRKVHTTPVPRLGGLSFFPVI LMSMFLVIGFRLYFWDMDSSSLSFNMLYEYLFLFVGMTLLYLVGVCDDLVGVGYRYKFAV QIVSALL >gi|226331954|gb|ACIB01000102.1| GENE 2 398 - 598 141 66 aa, chain - ## HITS:1 COG:no KEGG:BDI_0125 NR:ns ## KEGG: BDI_0125 # Name: not_defined # Def: tyrosine-protein kinase # Organism: P.distasonis # Pathway: not_defined # 1 32 754 785 822 69 96.0 3e-11 GFSYINVLRRERKFPKLATVINGLDMSKRKNSYGYGYGKKYGYGKGYGYGYGYGYGYGFE AGDKKK Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:38 2011 Seq name: gi|226331953|gb|ACIB01000103.1| Bacteroides sp. 3_2_5 cont1.103, whole genome shotgun sequence Length of sequence - 579 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 1 - 508 293 ## COG3525 N-acetyl-beta-hexosaminidase Predicted protein(s) >gi|226331953|gb|ACIB01000103.1| GENE 1 1 - 508 293 169 aa, chain - ## HITS:1 COG:CC0447 KEGG:ns NR:ns ## COG: CC0447 COG3525 # Protein_GI_number: 16124702 # Func_class: G Carbohydrate transport and metabolism # Function: N-acetyl-beta-hexosaminidase # Organism: Caulobacter vibrioides # 8 159 69 219 757 130 43.0 1e-30 MKKNAEFLASYIKEITGYELATATGQPGKGISLVIDQSIQNPEGYQLTVSDNGIRIAGST DAGVFYGIQTLRKSIPATAQGMNVELPAATINDYPRFAYRGMMLDVSRHFFPVDSVKTYL DILALHNQNTFHWHLSDDQGWRIEIKKYPELTQIGSKRKETVIGHNSGT Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:39 2011 Seq name: gi|226331952|gb|ACIB01000104.1| Bacteroides sp. 3_2_5 cont1.104, whole genome shotgun sequence Length of sequence - 573 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 1 - 358 316 ## BF0116 hypothetical protein - Prom 379 - 438 2.2 Predicted protein(s) >gi|226331952|gb|ACIB01000104.1| GENE 1 1 - 358 316 119 aa, chain - ## HITS:1 COG:no KEGG:BF0116 NR:ns ## KEGG: BF0116 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 119 1 119 328 233 94.0 2e-60 MRKVIMMFALAMGIATANAQENVTVETTNGSDQPTLTKEVYPQKEADGDLYHGLTRKLGF DRMVPPHGLEVTYDKTVHVIFPAEVRYVDLGSPDLIAGKADGAENVIRVKATVRNFPNE Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:42 2011 Seq name: gi|226331951|gb|ACIB01000105.1| Bacteroides sp. 3_2_5 cont1.105, whole genome shotgun sequence Length of sequence - 573 bp Number of predicted genes - 0 Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:42 2011 Seq name: gi|226331950|gb|ACIB01000106.1| Bacteroides sp. 3_2_5 cont1.106, whole genome shotgun sequence Length of sequence - 563 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 151 - 540 328 ## BDI_1357 hypothetical protein Predicted protein(s) >gi|226331950|gb|ACIB01000106.1| GENE 1 151 - 540 328 129 aa, chain - ## HITS:1 COG:no KEGG:BDI_1357 NR:ns ## KEGG: BDI_1357 # Name: not_defined # Def: hypothetical protein # Organism: P.distasonis # Pathway: not_defined # 1 129 7 135 135 250 93.0 1e-65 MSSLPSGVRLVRLLNEHLGEIMGRERTNQNSIHLYCTGPYWVAFECSAYQLRRVFPDSEV TPMRLLGYPFPVVMVSVTDRSLRSYARKHILRRDDKDYKQLTVPGFSLSDYQGWHKREVE GLPLLSETV Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:45 2011 Seq name: gi|226331949|gb|ACIB01000107.1| Bacteroides sp. 3_2_5 cont1.107, whole genome shotgun sequence Length of sequence - 558 bp Number of predicted genes - 0 Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:45 2011 Seq name: gi|226331948|gb|ACIB01000108.1| Bacteroides sp. 3_2_5 cont1.108, whole genome shotgun sequence Length of sequence - 556 bp Number of predicted genes - 0 Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:46 2011 Seq name: gi|226331947|gb|ACIB01000109.1| Bacteroides sp. 3_2_5 cont1.109, whole genome shotgun sequence Length of sequence - 555 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . + CDS 3 - 317 152 ## BDI_2777 hypothetical protein + Term 329 - 369 -0.7 Predicted protein(s) >gi|226331947|gb|ACIB01000109.1| GENE 1 3 - 317 152 104 aa, chain + ## HITS:1 COG:no KEGG:BDI_2777 NR:ns ## KEGG: BDI_2777 # Name: not_defined # Def: hypothetical protein # Organism: P.distasonis # Pathway: not_defined # 1 104 705 808 834 215 97.0 5e-55 VVDKRFADEVKGFTPQLDSTATITLDSYRPNKLVYTTKTNSEQLAVFSEIYYQPGWEATI DGKPAPHFRADWILRAMLVPAGEHQIVFEFRPQGYITAAYVTSF Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:48 2011 Seq name: gi|226331946|gb|ACIB01000110.1| Bacteroides sp. 3_2_5 cont1.110, whole genome shotgun sequence Length of sequence - 552 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 7 - 552 312 ## BDI_3156 hypothetical protein Predicted protein(s) >gi|226331946|gb|ACIB01000110.1| GENE 1 7 - 552 312 181 aa, chain - ## HITS:1 COG:no KEGG:BDI_3156 NR:ns ## KEGG: BDI_3156 # Name: not_defined # Def: hypothetical protein # Organism: P.distasonis # Pathway: not_defined # 2 181 343 522 522 354 100.0 9e-97 DLTIKDYEPEFKIYRLGFPNQEVEEGFMKYLLPFYTNIQASKSPFEIGRFVREVRAGDYD AFFHRLQSFFADTSYEAIIGRNPERDTELHYRNVLFIVFKLVGLYTQVEYHTSNGRIDLV LQTDRYVYIMEFKLNGTAEEALRQIEEKGYALPFAGDDREVLKIGANFSSETRNIERWLV G Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:52 2011 Seq name: gi|226331945|gb|ACIB01000111.1| Bacteroides sp. 3_2_5 cont1.111, whole genome shotgun sequence Length of sequence - 550 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 13 - 498 560 ## BF2893 hypothetical protein Predicted protein(s) >gi|226331945|gb|ACIB01000111.1| GENE 1 13 - 498 560 161 aa, chain - ## HITS:1 COG:no KEGG:BF2893 NR:ns ## KEGG: BF2893 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 161 1 161 161 318 100.0 6e-86 MHSRIFQISKMWIEKENYLNEDTLHQGDGSFYDYCAEIDDEERKEDIRYLVNTALPKDMF ELVGDDTIRYIGGVEQWKENFVTNIRKKAEAITTENMLEFVGPVYQLEKALENPLDIAYH FYLDGEGYQSFAEKSFAFMEFVCTLEPGTILYIGGVIDYHF Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:55 2011 Seq name: gi|226331944|gb|ACIB01000112.1| Bacteroides sp. 3_2_5 cont1.112, whole genome shotgun sequence Length of sequence - 541 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . + CDS 2 - 539 280 ## BF0120 hypothetical protein Predicted protein(s) >gi|226331944|gb|ACIB01000112.1| GENE 1 2 - 539 280 179 aa, chain + ## HITS:1 COG:no KEGG:BF0120 NR:ns ## KEGG: BF0120 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 179 40 218 334 342 91.0 4e-93 VAYRVWQSLARAEPIDVFPMLRPFAVGLCIMFFPSVVLGTINSVLSPVVQGTAKLLETQT LDMNKYREQKDRLEYEAMVRNPETAYLVSNEEFDKQLEELGWSPSDMVTMAGMYIDRGMY KMKKGIRDFFREILELMFQASALVIDTIRTFFLVVLAILGPIAFAISVWDGFQSTLTQW Prediction of potential genes in microbial genomes Time: Wed May 18 00:47:58 2011 Seq name: gi|226331943|gb|ACIB01000113.1| Bacteroides sp. 3_2_5 cont1.113, whole genome shotgun sequence Length of sequence - 538 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . + CDS 2 - 536 203 ## BF0123 hypothetical protein Predicted protein(s) >gi|226331943|gb|ACIB01000113.1| GENE 1 2 - 536 203 178 aa, chain + ## HITS:1 COG:no KEGG:BF0123 NR:ns ## KEGG: BF0123 # Name: not_defined # Def: hypothetical protein # Organism: B.fragilis # Pathway: not_defined # 1 178 398 575 834 358 96.0 5e-98 ETNYRSSLSPFGIKMVDRLTGKPLHLDISDLPMKRGITTNRNKFVLGPSGSGKSFFMNHL VRQYYEQGAHVVLVDTGNSYQGLCEMIRRKTGGTDGVYFTYTEEKPISFNPFYTDDYVFD VEKKDSIKTLLLTLWKSEDDKVTKTESGELGSAVSAYIERIRADRSIVPSFNTFYEYM Prediction of potential genes in microbial genomes Time: Wed May 18 00:48:02 2011 Seq name: gi|226331942|gb|ACIB01000114.1| Bacteroides sp. 3_2_5 cont1.114, whole genome shotgun sequence Length of sequence - 524 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 11 - 70 1.7 1 1 Tu 1 . + CDS 172 - 523 151 ## BDI_3408 putative iron transport-related exported protein Predicted protein(s) >gi|226331942|gb|ACIB01000114.1| GENE 1 172 - 523 151 117 aa, chain + ## HITS:1 COG:no KEGG:BDI_3408 NR:ns ## KEGG: BDI_3408 # Name: not_defined # Def: putative iron transport-related exported protein # Organism: P.distasonis # Pathway: not_defined # 1 117 1 117 727 239 99.0 2e-62 MKRKCLCAALAVSCAINVSFAQVRLQGKVVDESNEPIPGANIRVSESLNGTTTDASGKFE LNLPDGRHRIRVTYLGYEQGVYQTDHSEKDVVIKLKEKYVNIDQVVVTGTGSHRRMS Prediction of potential genes in microbial genomes Time: Wed May 18 00:48:05 2011 Seq name: gi|226331941|gb|ACIB01000115.1| Bacteroides sp. 3_2_5 cont1.115, whole genome shotgun sequence Length of sequence - 501 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) - Term 57 - 119 6.7 1 1 Tu 1 . - CDS 141 - 500 158 ## Fjoh_4386 hypothetical protein Predicted protein(s) >gi|226331941|gb|ACIB01000115.1| GENE 1 141 - 500 158 119 aa, chain - ## HITS:1 COG:no KEGG:Fjoh_4386 NR:ns ## KEGG: Fjoh_4386 # Name: not_defined # Def: hypothetical protein # Organism: F.johnsoniae # Pathway: not_defined # 5 117 316 428 438 124 50.0 1e-27 KKYSEKDEKDYIAAKSRFFNLSFADGNIMVRVLESLSDFYKEGKLLHHCVFSNAYYKRED SLIMSATVDGRRMETVEFSLSRMEVCQCRGKSNQLSAYHDRILNLVRDNIPLIRERMVV