Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:06:09 2011 Seq name: gi|296155609|gb|ADVK01000001.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00001, whole genome shotgun sequence Length of sequence - 19920 bp Number of predicted genes - 21, with homology - 21 Number of transcription units - 11, operones - 7 average op.length - 2.4 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 1/0.750 + CDS 2 - 505 740 ## COG1954 Glycerol-3-phosphate responsive antiterminator (mRNA-binding) 2 1 Op 2 . + CDS 522 - 1577 1006 ## COG0598 Mg2+ and Co2+ transporters - Term 1549 - 1613 12.1 3 2 Tu 1 . - CDS 1614 - 2603 1375 ## COG4296 Uncharacterized protein conserved in bacteria - Prom 2630 - 2689 11.2 - Term 2660 - 2695 5.1 4 3 Op 1 59/0.000 - CDS 2718 - 3119 654 ## PROTEIN SUPPORTED gi|19703673|ref|NP_603235.1| SSU ribosomal protein S9P 5 3 Op 2 . - CDS 3135 - 3569 739 ## PROTEIN SUPPORTED gi|19703672|ref|NP_603234.1| 50S ribosomal protein L13 - Prom 3642 - 3701 13.0 + Prom 3937 - 3996 14.1 6 4 Tu 1 . + CDS 4068 - 5417 894 ## PROTEIN SUPPORTED gi|145629959|ref|ZP_01785741.1| 50S ribosomal protein L21 + Prom 5419 - 5478 9.9 7 5 Op 1 36/0.000 + CDS 5603 - 6169 346 ## PROTEIN SUPPORTED gi|163801060|ref|ZP_02194960.1| 50S ribosomal protein L35 8 5 Op 2 46/0.000 + CDS 6247 - 6453 359 ## PROTEIN SUPPORTED gi|19703669|ref|NP_603231.1| 50S ribosomal protein L35P 9 5 Op 3 . + CDS 6478 - 6828 566 ## PROTEIN SUPPORTED gi|19703668|ref|NP_603230.1| 50S ribosomal protein L20 + Term 6921 - 6979 2.2 + TRNA 6875 - 6958 64.5 # Ser TGA 0 0 + TRNA 6982 - 7070 66.2 # Ser GCT 0 0 - Term 7128 - 7176 5.4 10 6 Op 1 . - CDS 7202 - 8089 1294 ## COG3588 Fructose-1,6-bisphosphate aldolase - Prom 8148 - 8207 5.6 11 6 Op 2 . - CDS 8245 - 10068 2365 ## COG0326 Molecular chaperone, HSP90 family - Prom 10114 - 10173 10.1 - Term 10150 - 10205 5.6 12 7 Tu 1 . - CDS 10220 - 10762 759 ## COG1853 Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family - Prom 10808 - 10867 9.1 + Prom 10810 - 10869 14.1 13 8 Op 1 1/0.750 + CDS 10897 - 11934 1115 ## COG3053 Citrate lyase synthetase 14 8 Op 2 . + CDS 11952 - 12458 560 ## COG3697 Phosphoribosyl-dephospho-CoA transferase (holo-ACP synthetase) - Term 12426 - 12473 6.2 15 9 Op 1 . - CDS 12476 - 13387 1233 ## EUBREC_2750 hypothetical protein 16 9 Op 2 . - CDS 13390 - 14502 1422 ## Vpar_1397 hypothetical protein - Prom 14686 - 14745 13.7 + Prom 14683 - 14742 10.4 17 10 Op 1 . + CDS 14768 - 15598 954 ## FN2078 DeoR family transcriptional regulator 18 10 Op 2 . + CDS 15614 - 16414 925 ## COG1262 Uncharacterized conserved protein 19 10 Op 3 . + CDS 16411 - 17565 1080 ## FN0976 hypothetical protein 20 10 Op 4 . + CDS 17522 - 18385 1213 ## COG1397 ADP-ribosylglycohydrolase + Term 18407 - 18442 -0.6 21 11 Tu 1 . + CDS 18723 - 19913 2131 ## COG0133 Tryptophan synthase beta chain Predicted protein(s) >gi|296155609|gb|ADVK01000001.1| GENE 1 2 - 505 740 167 aa, chain + ## HITS:1 COG:FN0333 KEGG:ns NR:ns ## COG: FN0333 COG1954 # Protein_GI_number: 19703676 # Func_class: K Transcription # Function: Glycerol-3-phosphate responsive antiterminator (mRNA-binding) # Organism: Fusobacterium nucleatum # 1 167 20 186 186 273 100.0 9e-74 ITLEKALNSNSELVFIILSNIMNIKDYTDKLKKVNKKVYIHVDMIDGLNGTNNGVDYIVN TVKPDGILTTKSNVVAHAYKNNINVIQRFFILDSLSYKKALLNIKENKVVAVEIMPGLMP KIIKKLSQETHIPIITGGLIKEKEDVINAINAGALSVSTTETKLWED >gi|296155609|gb|ADVK01000001.1| GENE 2 522 - 1577 1006 351 aa, chain + ## HITS:1 COG:FN0332 KEGG:ns NR:ns ## COG: FN0332 COG0598 # Protein_GI_number: 19703675 # Func_class: P Inorganic ion transport and metabolism # Function: Mg2+ and Co2+ transporters # Organism: Fusobacterium nucleatum # 1 351 1 351 351 631 100.0 0 MPNSNRKLGLLPGSVVYTGENPNYNITVTVIYYSKTFHKKDVFSADDKIDIDLEFDGNIW INIDGINDVSLIKRIGKIFNIDSLSLEDIANPEQRVKIDDRDSYIHIILKMLQMEILTKD VQYEQLSLIIKDNILITFQETPYDLFESIRSRLENPRTKLASKDVSYLAYILIDTIVDNY LLVLDEVENEIDDIENKLIESADKEDLENILALKQNIAVLKKFISPIRELISKLQARSML NYFHEDMKYYLGDLNDHGIIVFDTVDMLNNRATELIQLYHSMISNTMNEVMKILAIISTI FMPLSFIVGLYGMNFEYMPELKWHYGYFITLGLMAGLVILMIVYFKKKKWF >gi|296155609|gb|ADVK01000001.1| GENE 3 1614 - 2603 1375 329 aa, chain - ## HITS:1 COG:CAC0781 KEGG:ns NR:ns ## COG: CAC0781 COG4296 # Protein_GI_number: 15894068 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Clostridium acetobutylicum # 23 284 26 274 276 59 25.0 8e-09 MQEQEFSLSQENNSGDKVEKEKEIIALINSKGTFTSEITSENKSLLVANAELIAYIDCAT NELHKTKTKIEWLVTSEESKKNFIYNIEKYHIYHLKVKEVDTDNFLLIDVLERDLENELL KETLKECEQKAAIVIEEPDLGKFILDKNLKAFLSQIEWLNPKKQIGISLNIGENTRIKAL EKVGAFFVTLEKLLTNKKEWDKKLKVYAAENLVDLANELRKNLKGMFKFIKIWKWRFIGK IELISLAINPNGEFVATFDDKKLFVGHKIVVNGNINGELLNSAIVENFNIEDYKKVELAL PASEEKNENTLDGIPAVDNNMNDKEESKE >gi|296155609|gb|ADVK01000001.1| GENE 4 2718 - 3119 654 133 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19703673|ref|NP_603235.1| SSU ribosomal protein S9P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 133 1 133 133 256 99 8e-68 MAEKITQYLGTGRRKTSVARVRLIPGGQGVEINGKAMDEYFGGRAILSRIVEQPLALTET LNKFAVKVNVVGGGNSGQAGAIRHGVARALLLADESLKEALREAGFLTRDSRMVERKKYG KKKARRSPQFSKR >gi|296155609|gb|ADVK01000001.1| GENE 5 3135 - 3569 739 144 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19703672|ref|NP_603234.1| 50S ribosomal protein L13 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 144 1 144 144 289 99 1e-77 MKKYTFMQRKEDVVREWHHYDAEGQILGRLAVEIAKKLMGKEKLTFTPHIDGGDYVVVTN VEKIVVTGKKLTDKVYYNHSGFPGGIRARKLGEILAKKPEELLMLAVKRMLPKNKLGRQQ LTRLRVFAGAEHSHVAQKPNKVEL >gi|296155609|gb|ADVK01000001.1| GENE 6 4068 - 5417 894 449 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|145629959|ref|ZP_01785741.1| 50S ribosomal protein L21 [Haemophilus influenzae 22.4-21] # 3 443 5 440 456 348 41 1e-95 MLNFIASINELFWGAILILLLVGTGIFYTLKLKFVQVRKFKKGVSQLTGNFNINGKDADH NGMSSFQALATAIAAQVGTGNLAGAATAIVSGGPGAIFWMWVSAFFGMATIYAEAILSQL FKRKVEGEITGGPAYYIEELFNKNFLSKILAVFFALSCILALGFMGNGVQANSIGEAMKN AFNISPYITGSVIALLGGFVFFGGVKRIASFTEKVVPLMAGLYILICFVIIIINYSNIIG AFEAIFVNAFSMKSILGGFLGMGVKKAIRYGVARGLFSNEAGMGSTPHAHAIAKVKNPVE QGNVALITVFIDTFIVLTLTALVILTSNIGNGTLTGITLTQRAFENALGYSGTIFIAIAL FFFAFSTIIGWYFFGEANIKYLFGKKAINVYRILVMAAIFIGSTQKVELVWELADLFNGL MVIPNLIALIVLYKLVVNTSNEHDKLYNL >gi|296155609|gb|ADVK01000001.1| GENE 7 5603 - 6169 346 188 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163801060|ref|ZP_02194960.1| 50S ribosomal protein L35 [Vibrio campbellii AND4] # 26 186 1 165 166 137 43 4e-32 MYFRTGFCLFYFFQWRCSVISDKTRINEKIRGKEFRIISFDGEQLGIMTAEQALNLASSQ GYDLVEIAPSATPPVCKIMDYSKYKYEQTRKLKEAKKNQKQVVIKEIKVTARIDSHDLET KLNQVTKFLEKENKVKVTLVLFGREKMHANLGVTTLDEIAEKFSETAEVEKKYADKQKHL ILSPKKVK >gi|296155609|gb|ADVK01000001.1| GENE 8 6247 - 6453 359 68 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|19703669|ref|NP_603231.1| 50S ribosomal protein L35P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 68 1 68 68 142 100 1e-33 MPKMKTHRGAKKRIKVTGTGKFVIKHSGKSHILTKKDRKRKNHLKKDAVVTETYKRHMQG LLPYGEGR >gi|296155609|gb|ADVK01000001.1| GENE 9 6478 - 6828 566 116 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|19703668|ref|NP_603230.1| 50S ribosomal protein L20 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 116 1 116 116 222 100 1e-57 MRVKTGIIRRKRHKRVLKAAKGFRGASGDAFKQAKQATRRAMAFATRDRKVNKRRMRQLW ITRINSAARMNGVSYSVLMNGLKKAGILLDRKVLADIALNNATEFTKLVEAAKSAL >gi|296155609|gb|ADVK01000001.1| GENE 10 7202 - 8089 1294 295 aa, chain - ## HITS:1 COG:FN0322 KEGG:ns NR:ns ## COG: FN0322 COG3588 # Protein_GI_number: 19703667 # Func_class: G Carbohydrate transport and metabolism # Function: Fructose-1,6-bisphosphate aldolase # Organism: Fusobacterium nucleatum # 1 295 1 295 295 559 100.0 1e-159 MNEKLEKMRNGKGFIAALDQSGGSTPKALKLYGVNENEYSNDKEMFDLIHKMRTRIIKSP AFNESKILGAILFEQTMDSKIDGKYTADFLWEEKKVLPFLKIDKGLNDLDADGVQTMKPN PTLADLLKRANERHIFGTKMRSVIKKASPAGIARVVEQQFEVAAQVVAAGLIPIIEPEVD INNVDKVQCEEILRDEIRKHLNALPETSNVMLKLTLPTVENLYEEFTKHPRVVRVVALSG GYSREKANDILSKNKGVIASFSRALTEGLSAQQTDEEFNKTLAASIDGIYEASVK >gi|296155609|gb|ADVK01000001.1| GENE 11 8245 - 10068 2365 607 aa, chain - ## HITS:1 COG:FN0321 KEGG:ns NR:ns ## COG: FN0321 COG0326 # Protein_GI_number: 19703666 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Molecular chaperone, HSP90 family # Organism: Fusobacterium nucleatum # 1 607 1 607 607 1023 99.0 0 MKKEEKIFKAETKELLNLMIHSIYTNKEIFLRELISNANDAIDKLKFQSLTDTDILKDND KFRIDISVDKDNRTLTISDNGIGMTYEEVDDNIGTIAKSGSKLFKEQLEEAKKGDIDIIG QFGVGFYSGFIVADKITLETKSPYSENGVKWISSGDGNYEIEEIAKQDRGTKITLHLKDG DEYNEFLEDWKIKDLVKKYSNYIRYEIYFGDEVINSTKPIWKKDKKELKDEDYNEFYKAT FHDWNDPLLHINLKVQGNIEYNALLFIPKKLPFDYYTKNFKRGLQLYTKNVFIMEKCEDL IPEYFNFISGLVDCDSLSLNISREILQQNAELQVISKNLEKKITSELEKILKNDREKYVE FWKEFGRSIKAGVQDMFGMNKEKLQDLLIFVSSHDDKYTTLKEYVDRMGDNKEILYVPAE SVDAVKYLPKMEKLKEQGREVLILTDKIDEFTLMAMRDYSGKEFKSINSSDFKFSDDKEK EEEVKKIADENKELIQKAKEFLKDKVSEVELSNNIGNSASSLLAKGGLSLEMEKTLSEMT NNNDMPKAEKVLAINPEHVLFNRLKSSVNTEDFNKLVDVLYNQALLLEGFNIENPAEFIK NLNSLIK >gi|296155609|gb|ADVK01000001.1| GENE 12 10220 - 10762 759 180 aa, chain - ## HITS:1 COG:FN0320 KEGG:ns NR:ns ## COG: FN0320 COG1853 # Protein_GI_number: 19703665 # Func_class: R General function prediction only # Function: Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family # Organism: Fusobacterium nucleatum # 1 180 1 180 180 334 100.0 5e-92 MTKRKINVLDYSSEILKALSKGILLTVKGEEKVNTMAISWGALGIEWNKLLFTAYIRENR YTKAILDKTLDFTINIPLDKMDSKVFNISGTKSGRDIDKIKEANLTLVDSEIVSSPAIKE LPITLECKVLYKQKQVLDKLPEEIVKRDYPQDVDGTAVGSNRDPHTAYYGEIVAAYIIEE >gi|296155609|gb|ADVK01000001.1| GENE 13 10897 - 11934 1115 345 aa, chain + ## HITS:1 COG:FN0319 KEGG:ns NR:ns ## COG: FN0319 COG3053 # Protein_GI_number: 19703664 # Func_class: C Energy production and conversion # Function: Citrate lyase synthetase # Organism: Fusobacterium nucleatum # 1 345 1 345 345 643 99.0 0 MYISKIYPNDKKSLKLIDELLAKEEIRKDNNLDYTCAMFDDDMNIVATGSCFKNTLRCLA VDNSHQGEGLMNQIVTHLVDYEFSRGLTHLFLYTKNKSMKFFKDLGFFEIVNIENQIVFM ENRRTGFSDYLDNLKKDLKVGKNIASLIMNANPFTLGHQYLVEKASSENDVLHLFIVSDD SSLVPFEVRKKLVIEGTKHLKNICYHETGDYIISSATFPSYFQKDEVAVIESQANLDIEV FTKIAKVLNINKRYVGEEPNSLVTNIYNQTMVKKLPENNIECVVVPRKKYSDNVISASTV RQIIKNGNLEDLKNLVPETTYNYFLSDEAKAIIDKIRSQDNVIHY >gi|296155609|gb|ADVK01000001.1| GENE 14 11952 - 12458 560 168 aa, chain + ## HITS:1 COG:FN0318 KEGG:ns NR:ns ## COG: FN0318 COG3697 # Protein_GI_number: 19703663 # Func_class: H Coenzyme transport and metabolism; I Lipid transport and metabolism # Function: Phosphoribosyl-dephospho-CoA transferase (holo-ACP synthetase) # Organism: Fusobacterium nucleatum # 1 165 1 165 171 270 98.0 6e-73 MQGIEVGIDEVLNCREKRVAIQNEMIRKYKKPVISFTMNIPGPIKTNNEIKKAFDIGKVL ILEKLKENNIEILEIQELNENTGNELFISTNSLAKKIKNITVAIEEGYDLGRLFDIDVID INFEKLSRKSFRKCLICEEQAQECGRSRKHSVEELQNKVEKILSKGAK >gi|296155609|gb|ADVK01000001.1| GENE 15 12476 - 13387 1233 303 aa, chain - ## HITS:1 COG:no KEGG:EUBREC_2750 NR:ns ## KEGG: EUBREC_2750 # Name: not_defined # Def: hypothetical protein # Organism: E.rectale # Pathway: not_defined # 1 302 1 306 307 382 64.0 1e-105 MSNIIAVIWDFDKTLVDGYMQDPIFEKYDVDSKKFWEEVNSLPDKYWDEQKVKVNKDTIY LNHFINKTKEGVFKGLNNDLLFELGKELKFYKGIPEIFEKTKKIIEENSIFQEYNIKVEH YIVSTGMKNMIEGSIIKKHVEGIWGCELIQIKNKDNNWEISEIGYTIDNTSKTRAIFEIN KGINKNPEYDVNAKINEGNRRVLFKNMIYIADGPSDVPAFSVIKKGGGSTFAIYPKSDLK AFKQVEKLREDNRVDMYAEADYSEGTTTYMWITNKIEELAQNIVSEEKSKLEASISDSPK HLN >gi|296155609|gb|ADVK01000001.1| GENE 16 13390 - 14502 1422 370 aa, chain - ## HITS:1 COG:no KEGG:Vpar_1397 NR:ns ## KEGG: Vpar_1397 # Name: not_defined # Def: hypothetical protein # Organism: V.parvula # Pathway: not_defined # 1 369 1 332 332 137 27.0 7e-31 MSEVWRLQTKTEVSKKNKLEDKVANELIKRKIVAIGWTLREDIYNELTESEKNKLEDNEK SIKNDFEEYKKIIKKKYSTIKGEKKNFFNGKVNTNLFRLNNLKENDLVWIRSKGKYYLGR VTEKSHYLYAYRESQKNSDILKLGINNQFTDIEWQKIGSESDIPGRILIAFYHKGTLIEI DEKSVLDISQILYNKRDNYYKIPNKIVNNKENFYDLLSPYDCEDLLYFYLYHKNKYIAIP STNKTSTQNYEFEMVNPKDRNEKIYIQVKNGYSKGSDLYLENFKGIDGIVYLLTTAGNIY ETKTKKKLLQIDFKQNYEFEEIGSTENNKKIYVVNPEALYEFAKKAYKNESVLMPQSILQ WFEYLEEEKE >gi|296155609|gb|ADVK01000001.1| GENE 17 14768 - 15598 954 276 aa, chain + ## HITS:1 COG:no KEGG:FN2078 NR:ns ## KEGG: FN2078 # Name: not_defined # Def: DeoR family transcriptional regulator # Organism: F.nucleatum # Pathway: not_defined # 1 276 1 280 280 134 33.0 5e-30 MKKVGFVIPSFMYEILVGDMEYFRLKLGELGNKILSYYLGKTILGKLDFKTNSSERVQFN LSKTNEKILEQLKKEKKLEKEGQYFRNIYFTYINNLRYIRERIIFNRNFEDIENAIKFNR KIIIEYHSKIRTVNPYHICIANKEERSYLFCYCEVANDYRAFRVSEIKDIKILDIELERK DSLYIKNVKESFDPFLSFNKKVKVRFTERGVKRYEKALVNRPRLISKENDIYTFQCSEKM AKVYFPQFYAEVEILEPISLREELKKDFQKILDLYK >gi|296155609|gb|ADVK01000001.1| GENE 18 15614 - 16414 925 266 aa, chain + ## HITS:1 COG:BH0900 KEGG:ns NR:ns ## COG: BH0900 COG1262 # Protein_GI_number: 15613463 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Bacillus halodurans # 3 234 33 257 286 111 33.0 1e-24 MREDMVFVKGGKYKLYENNNIEVETFDLEVSKYLVTQKLWTDIMGRNPSIVKDILGMKPV ENITWWHALQFCNKLSELEGLRPVYDLSQIKKGKLYINQLNNEKSIDYYEDFLIDKNGKI GDFSKTEGYRLPTSIEWEWFAGGGQKAINEGTFGHIFYKRDNLDKIAWYCDNSFFTTHDP GEKMANELGIFDYIGNVYEWCHDQRCEFKKFKEEKGNTFYNMYKVIMGGSFCSDPYDFYP FTISYPSIKSSVEIGFRFIKTFRRII >gi|296155609|gb|ADVK01000001.1| GENE 19 16411 - 17565 1080 384 aa, chain + ## HITS:1 COG:no KEGG:FN0976 NR:ns ## KEGG: FN0976 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 96 358 31 305 305 107 31.0 6e-22 MSDIFLLKSDKKISIKKILKLTGEFDSFKFQDIPDYDIYFDEKDENYNKDLEIKDDDEDY KILSSINVGDNHPYFLSQEILDAIFFDKENYSDERKKINNYEENNERLDMDEELLEYPLS VVGAVRIWKKNCIRGFEAFYDRFSSNYGVRTFSPCARKDWEEAIKYIIKLSKMLNTDIRT EDGEIYARENIEKYSYDKSILAGLNYLSKHISTFIDTNINYIFFNKEIVEKIKKSKDQIK TFEQTVHMIKKELDEKEAERFERKELKSISYTIYENKEVILFLNPNLDFFNRNLSEKSQY KIDFALSIKDTQNKYFLSHSVEYNTFIKMLPKDSYRYIDARRILVKPLKKEDIYLLSKKC YKEFIRLGGKNDRSNNWRCYWKFL >gi|296155609|gb|ADVK01000001.1| GENE 20 17522 - 18385 1213 287 aa, chain + ## HITS:1 COG:SP1044 KEGG:ns NR:ns ## COG: SP1044 COG1397 # Protein_GI_number: 15900915 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: ADP-ribosylglycohydrolase # Organism: Streptococcus pneumoniae TIGR4 # 1 255 1 271 284 207 40.0 2e-53 MIGAIIGDVIGSFYEGKIKKAKSKNFELFTPYSICTDDTIMTIAVGQALVNTYQEKEISI IQKELIKEMQRLGKIYPYSRYGKQFSHWLREENPKPYNSFGNGSGMRVSSVAWLYDNLED VNKYAEITASVSHNHPEGIKGACAIASAIYLASQKKSKDEIKKYIEEKFEYILKPISQVV EEESNYGTSSQITIPVAIQAFLEGKDFEDILRTALFAGGDTDTIACMACSIAEIYYEIPD NLLKFAYSRMDLPLKKPLKNILMLIKEKNRLNDNLKKVFALLKNENI >gi|296155609|gb|ADVK01000001.1| GENE 21 18723 - 19913 2131 396 aa, chain + ## HITS:1 COG:FN0317 KEGG:ns NR:ns ## COG: FN0317 COG0133 # Protein_GI_number: 19703662 # Func_class: E Amino acid transport and metabolism # Function: Tryptophan synthase beta chain # Organism: Fusobacterium nucleatum # 1 395 1 395 395 737 96.0 0 MATENKKGYFGEFGGSYVPEVVQKALDKLEEAYNKYKDDEEFLKEYHHYLKDYSGRETPL YFAESLTNYLGGAKIYLKREDLNHLGAHKLNNVIGQILLAKRMGKKKVIAETGAGQHGVA TAAAAAKFGMQCDIYMGALDVERQRLNVFRMEMLGATVHAVEKGERTLKEAVDAAFKAWI NNIDDTFYVLGSAVGPHPYPSMVKDFQRVISQEARKQILEKENRLPDMIIACVGGGSNAI GAFAEFIPDKNVKLVGVEAAGKGVDTNRHAATLTLGTVGVLDGMKTYALFNEDGSVKPVY SISPGLDYPGIGPEHAFLRDSKRAEYVTATDDAAVNALLLLTKKEGIIPAIESSHALAEV IKRAPKLDKDKIIIVNISGRGDKDVAAIAEYLKNKN Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:06:32 2011 Seq name: gi|296155607|gb|ADVK01000002.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00002, whole genome shotgun sequence Length of sequence - 1782 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 39 - 98 5.6 1 1 Tu 1 . + CDS 152 - 1627 1608 ## COG3666 Transposase and inactivated derivatives Predicted protein(s) >gi|296155607|gb|ADVK01000002.1| GENE 1 152 - 1627 1608 491 aa, chain + ## HITS:1 COG:FN0028 KEGG:ns NR:ns ## COG: FN0028 COG3666 # Protein_GI_number: 19703380 # Func_class: L Replication, recombination and repair # Function: Transposase and inactivated derivatives # Organism: Fusobacterium nucleatum # 1 491 1 491 491 858 99.0 0 MIKSTNNNRFFKFFQPKLFYINKDIDKDDPVRLFSTILEEMDFSNLMQVFPNKTKVHPVN MFAIIIYAYSRGIYSTRDIEYLCKDSQRAQYLLNSHNIPDYSTIARFLSKATDVIHELFT QFVEKLFKLSEISTETIYIDGTKIEAYANKYSFVWKKSTLKYKERLEENILELIDDFNRY FNKDLDNIFGVFSYLENLNIQKVHGKGKRKSKEQVLLEKAESFIERFEKYTNYLEILGER NSFSKIDKDATFMRMKEDYMRNGQLKPGYNLQIGVISEYIASYEIFHNPSDSKTLIPFLE KIKSQNIEIQNVVADAGYESLPNYEYLETNNYVSYIKPIYYEKSKTRKYKKDLNKVENLD YDEKENRLFRKDGLELEFLNYSKDKKSIYFKNPETEKIVRYNKEFRRLSKISKSNIETEI GKQLRMNRSIQVEGAFAVLKEDMKLRKLKVRGKESAKREIGLFCIAYNFNRYLAKLIRKK QGVILHPLKTA Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:06:33 2011 Seq name: gi|296155604|gb|ADVK01000003.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00003, whole genome shotgun sequence Length of sequence - 1557 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 114 - 1289 774 ## COG3547 Transposase and inactivated derivatives - Prom 1493 - 1552 11.8 Predicted protein(s) >gi|296155604|gb|ADVK01000003.1| GENE 1 114 - 1289 774 391 aa, chain - ## HITS:1 COG:FN1357 KEGG:ns NR:ns ## COG: FN1357 COG3547 # Protein_GI_number: 19704692 # Func_class: L Replication, recombination and repair # Function: Transposase and inactivated derivatives # Organism: Fusobacterium nucleatum # 1 391 1 391 391 693 99.0 0 MFLLGIDIAKLNHVASCIDSSTNEIVFSNFKFKNDFKGFSTLLDKIKTFDTKNLIIGLES TSHYGENLINFLFKQHFKVALINPLQTSHLRKANIRDAKNDNLDSLNIAKSLIFAKLNFI SEKNINCFSLKKLTRFRSSLIKQRSKAKIQLTSLLDLLFPELQYLFKSKIHSKAIYSLLK KYPSAEEIAALKDDEISNLLYASSKGHFKKEKSIELKSLAKTTVGIKDTSISLHVIQLIE LIELYDKQIKDIVTKIADTVDKLDTKLLSVPGISIIACAIILGETNNFNNFSDSTKLLAF AGLDPKIRQSGNFNASSCRMSKKGSPYLRYALIFTAWNIVRHSEKFNKYYCLKRSQGKSH YNSLGHVAHKLVRVIFTLIKKNIVYQEENLE Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:06:53 2011 Seq name: gi|296155529|gb|ADVK01000004.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00005, whole genome shotgun sequence Length of sequence - 69905 bp Number of predicted genes - 75, with homology - 73 Number of transcription units - 20, operones - 16 average op.length - 4.4 N Tu/Op Conserved S Start End Score pairs(N/Pv) - Term 13 - 53 -1.0 1 1 Op 1 1/1.000 - CDS 59 - 1177 1159 ## COG0053 Predicted Co/Zn/Cd cation transporters 2 1 Op 2 1/1.000 - CDS 1164 - 5591 5016 ## COG1112 Superfamily I DNA and RNA helicases and helicase subunits - Term 5605 - 5642 4.0 3 2 Op 1 4/0.000 - CDS 5657 - 6403 946 ## COG2099 Precorrin-6x reductase 4 2 Op 2 6/0.000 - CDS 6429 - 7178 1160 ## COG1010 Precorrin-3B methylase 5 2 Op 3 . - CDS 7171 - 8184 1259 ## COG2073 Cobalamin biosynthesis protein CbiG 6 2 Op 4 . - CDS 8197 - 8928 547 ## FN0953 hypothetical protein - Prom 8975 - 9034 4.7 7 3 Tu 1 . - CDS 9051 - 10298 1203 ## COG4277 Predicted DNA-binding protein with the Helix-hairpin-helix motif - Prom 10429 - 10488 8.4 8 4 Op 1 . - CDS 10610 - 11137 711 ## FN0955 hypothetical protein 9 4 Op 2 . - CDS 11140 - 11388 408 ## FN0956 hypothetical protein - Prom 11410 - 11469 10.3 10 5 Op 1 . - CDS 11488 - 12261 1291 ## COG2875 Precorrin-4 methylase 11 5 Op 2 . - CDS 12299 - 12760 509 ## FN0958 hypothetical protein 12 5 Op 3 . - CDS 12772 - 12966 278 ## FN0958 hypothetical protein 13 5 Op 4 . - CDS 12981 - 13703 1156 ## COG2243 Precorrin-2 methylase 14 5 Op 5 . - CDS 13730 - 14155 424 ## FN0960 hypothetical protein 15 5 Op 6 . - CDS 14186 - 14290 209 ## 16 5 Op 7 . - CDS 14325 - 14828 431 ## FN0963 hypothetical protein - Prom 14859 - 14918 6.0 17 6 Op 1 1/1.000 - CDS 14932 - 15501 942 ## COG2242 Precorrin-6B methylase 2 18 6 Op 2 1/1.000 - CDS 15523 - 16488 1268 ## COG1052 Lactate dehydrogenase and related dehydrogenases 19 6 Op 3 6/0.000 - CDS 16475 - 17131 800 ## COG2241 Precorrin-6B methylase 1 20 6 Op 4 . - CDS 17118 - 18245 1717 ## COG1903 Cobalamin biosynthesis protein CbiD 21 6 Op 5 . - CDS 18261 - 19004 641 ## FN0968 hypothetical protein 22 6 Op 6 . - CDS 19001 - 19777 664 ## FN0969 hypothetical protein 23 6 Op 7 1/1.000 - CDS 19802 - 20452 991 ## COG2082 Precorrin isomerase 24 6 Op 8 1/1.000 - CDS 20476 - 21468 1131 ## COG3177 Uncharacterized conserved protein 25 6 Op 9 3/0.000 - CDS 21468 - 22802 1748 ## COG1797 Cobyrinic acid a,c-diamide synthase 26 6 Op 10 1/1.000 - CDS 22821 - 23894 1075 ## COG0079 Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 27 6 Op 11 1/1.000 - CDS 23968 - 24339 435 ## COG0346 Lactoylglutathione lyase and related lyases 28 6 Op 12 . - CDS 24344 - 25321 1135 ## COG1270 Cobalamin biosynthesis protein CobD/CbiB 29 6 Op 13 . - CDS 25341 - 26258 1145 ## FN0976 hypothetical protein 30 6 Op 14 1/1.000 - CDS 26285 - 27775 1994 ## COG1492 Cobyric acid synthase - Prom 27801 - 27860 10.5 31 6 Op 15 . - CDS 27865 - 29160 1316 ## COG1757 Na+/H+ antiporter - Prom 29234 - 29293 13.4 + Prom 29213 - 29272 9.2 32 7 Op 1 . + CDS 29341 - 29709 327 ## FN0979 hypothetical protein 33 7 Op 2 . + CDS 29723 - 29980 288 ## FN0980 hypothetical protein + Term 29985 - 30042 10.0 - Term 29971 - 30028 10.0 34 8 Op 1 17/0.000 - CDS 30035 - 31315 1852 ## COG0151 Phosphoribosylamine-glycine ligase 35 8 Op 2 . - CDS 31332 - 32846 2119 ## COG0138 AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 36 8 Op 3 . - CDS 32872 - 33642 952 ## FN0983 hypothetical protein 37 8 Op 4 1/1.000 - CDS 33680 - 34489 926 ## COG3315 O-Methyltransferase involved in polyketide biosynthesis 38 8 Op 5 21/0.000 - CDS 34509 - 35051 753 ## COG0299 Folate-dependent phosphoribosylglycinamide formyltransferase PurN 39 8 Op 6 13/0.000 - CDS 35039 - 36058 815 ## PROTEIN SUPPORTED gi|169632702|ref|YP_001706438.1| phosphoribosylaminoimidazole synthetase 40 8 Op 7 2/0.000 - CDS 36058 - 37404 1959 ## COG0034 Glutamine phosphoribosylpyrophosphate amidotransferase 41 8 Op 8 4/0.000 - CDS 37449 - 38162 1177 ## COG0152 Phosphoribosylaminoimidazolesuccinocarboxamide (SAICAR) synthase - Prom 38200 - 38259 8.2 42 8 Op 9 1/1.000 - CDS 38291 - 38764 765 ## COG0041 Phosphoribosylcarboxyaminoimidazole (NCAIR) mutase 43 8 Op 10 . - CDS 38776 - 42513 4844 ## COG0046 Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain - Term 42548 - 42576 -0.9 44 8 Op 11 . - CDS 42578 - 42673 154 ## - Prom 42708 - 42767 6.6 + Prom 42827 - 42886 8.9 45 9 Op 1 1/1.000 + CDS 42957 - 43748 1131 ## COG1183 Phosphatidylserine synthase 46 9 Op 2 1/1.000 + CDS 43751 - 44827 1023 ## COG0859 ADP-heptose:LPS heptosyltransferase 47 10 Op 1 . + CDS 44944 - 46281 1282 ## COG0168 Trk-type K+ transport systems, membrane components 48 10 Op 2 . + CDS 46316 - 47101 768 ## FN0994 hypothetical protein 49 10 Op 3 . + CDS 47119 - 47706 454 ## FN0995 hypothetical protein + Term 47743 - 47781 8.1 - Term 47727 - 47771 5.5 50 11 Op 1 . - CDS 47776 - 48474 417 ## COG5522 Predicted integral membrane protein 51 11 Op 2 . - CDS 48487 - 49137 820 ## FN0997 hypothetical protein - Prom 49310 - 49369 12.1 + Prom 49136 - 49195 10.0 52 12 Op 1 1/1.000 + CDS 49400 - 50902 2017 ## COG0747 ABC-type dipeptide transport system, periplasmic component 53 12 Op 2 1/1.000 + CDS 50916 - 51950 1440 ## COG1363 Cellulase M and related proteins + Term 51954 - 52017 14.7 + Prom 51953 - 52012 8.6 54 13 Op 1 4/0.000 + CDS 52035 - 53174 1264 ## COG0502 Biotin synthase and related enzymes 55 13 Op 2 12/0.000 + CDS 53164 - 53823 736 ## COG0132 Dethiobiotin synthetase + Prom 53826 - 53885 4.7 56 13 Op 3 1/1.000 + CDS 53909 - 55249 1632 ## COG0161 Adenosylmethionine-8-amino-7-oxononanoate aminotransferase + Prom 55251 - 55310 3.9 57 14 Op 1 1/1.000 + CDS 55345 - 56796 2190 ## COG2067 Long-chain fatty acid transport protein 58 14 Op 2 . + CDS 56815 - 57381 680 ## COG1309 Transcriptional regulator + Prom 57416 - 57475 9.8 59 15 Tu 1 . + CDS 57500 - 58057 723 ## FN1005 hypothetical protein + Term 58068 - 58112 7.5 60 16 Tu 1 . - CDS 58111 - 58524 678 ## COG0454 Histone acetyltransferase HPA2 and related acetyltransferases + Prom 58547 - 58606 15.0 61 17 Tu 1 . + CDS 58708 - 59544 569 ## FN1007 hypothetical protein - Term 59548 - 59604 8.0 62 18 Op 1 . - CDS 59620 - 60138 685 ## FN1008 hypothetical protein 63 18 Op 2 . - CDS 60179 - 60670 641 ## Celal_1460 hypothetical protein 64 18 Op 3 . - CDS 60685 - 61056 513 ## FN1009 hypothetical protein 65 18 Op 4 . - CDS 61060 - 61191 93 ## gi|296327343|ref|ZP_06869895.1| conserved hypothetical protein - Prom 61318 - 61377 10.6 - Term 61334 - 61369 0.3 66 19 Op 1 1/1.000 - CDS 61389 - 61685 443 ## COG1799 Uncharacterized protein conserved in bacteria - Prom 61711 - 61770 12.4 67 19 Op 2 . - CDS 61772 - 62737 279 ## PROTEIN SUPPORTED gi|149007035|ref|ZP_01830704.1| 50S ribosomal protein L31 type B 68 19 Op 3 . - CDS 62757 - 64604 2112 ## COG1493 Serine kinase of the HPr protein, regulates carbohydrate metabolism 69 19 Op 4 . - CDS 64629 - 65414 1166 ## FN1013 hypothetical protein 70 19 Op 5 1/1.000 - CDS 65407 - 66642 1765 ## COG0285 Folylpolyglutamate synthase 71 19 Op 6 . - CDS 66654 - 67355 982 ## COG0775 Nucleoside phosphorylase - Prom 67405 - 67464 12.7 + Prom 67365 - 67424 9.3 72 20 Op 1 . + CDS 67475 - 68371 1113 ## COG1560 Lauroyl/myristoyl acyltransferase 73 20 Op 2 . + CDS 68368 - 69189 1158 ## FN1017 hypothetical protein 74 20 Op 3 . + CDS 69216 - 69494 242 ## gi|296327352|ref|ZP_06869904.1| hypothetical protein HMPREF0397_0097 75 20 Op 4 . + CDS 69516 - 69665 140 ## gi|289765197|ref|ZP_06524575.1| conserved hypothetical protein Predicted protein(s) >gi|296155529|gb|ADVK01000004.1| GENE 1 59 - 1177 1159 372 aa, chain - ## HITS:1 COG:FN0948 KEGG:ns NR:ns ## COG: FN0948 COG0053 # Protein_GI_number: 19704283 # Func_class: P Inorganic ion transport and metabolism # Function: Predicted Co/Zn/Cd cation transporters # Organism: Fusobacterium nucleatum # 1 372 1 372 372 589 98.0 1e-168 MKKIKEEKRENVIIKTSIIGIFINLLLVTFKAIIGLISNSIAILLDAVNNLSDALSSIVT IISTKIADLEPDKEHPLGHGRIEYLSAMIVAGIIFYAGITSLIESIKKIFVPVEVEYSNI TFIILIVSIIIKLLLGKFVKNIGEKFNSPSLVASGSDATFDAILSSSALVSAILYIFTDI NIEAYVGVLISIFIIKSGIEIFMDAVNEILGKRVDKETINEIKRTICKIENVYGAYDLML HNYGPDKYVGSVHIEVPDSMTAEEIDPLERKISNIVLEKHNIYLTGITVYSMNTKNMDIV KLRYKIYKIVMSNDGVLEFHGFYLEEKNKSIRFDIIIDYSIKNREEIYDKILNDVKKEYP DYSINIKVDIDI >gi|296155529|gb|ADVK01000004.1| GENE 2 1164 - 5591 5016 1475 aa, chain - ## HITS:1 COG:FN0949 KEGG:ns NR:ns ## COG: FN0949 COG1112 # Protein_GI_number: 19704284 # Func_class: L Replication, recombination and repair # Function: Superfamily I DNA and RNA helicases and helicase subunits # Organism: Fusobacterium nucleatum # 51 1475 1 1425 1425 2468 99.0 0 MNKRESIIALYQYIAEVIKSLKTEKKDIYNEEWYYFLESLPKHSGITFNYLDNKNNLSYQ KILQVEKLPFLKPLAIDEELLEWISGDWGDYKSSVKLLSEKIIKENNSLKVANISDEEKE ILEKLLKDRQLWIEEQKKIENVRKLFDVLYSKYLSLDRDSDTLELLVGNGIVKVPNEDIC YPVLLKKVNFSLDAERNLITIVDSSDDDFITQELYLNFLAEVENVNLDNVFKLGDKIIEN NIHPISKNDVIKDFFREFIHNLNPRAEFIEDKKISDEDNIITIEWKPILFIRKKDDGKIE AINNIIKDIEEGGEVPGYLTELVGIIENDKKEIEDIPDILFTKETNNEQVEIIKNIYSHK AVVVQGPPGTGKTHTIANLLGHFLAEGKNVLITSQTRKALEVLKEKIPNEIQDLCISMLD DDSSDLGNSVESISEKLGYLNLEKLKNEYEEIERNRNDLKEDIKNIKRKIFNIKYQESKP IIYNNESISLKEAGEFLRKNERELDKIPGIVSSGVLCPVNNEELEFLKTGYKKSVSKEEE KEIELGLNKISDFWSLEEFEGMLKAKKEFKSEIEFLLENKKYHINDEILYIDDNIVIDLK KFKNYTNIDNIIPEELKLIEDWKKDVCIAGTENSGDRKIWLDFIKDIRRLYELTNNTKDK FFKKDIVYKDIDVSTAKKLIIALKDGLEKPGLFFKHKLRKAKKEIADRVTINKRILEIPY DCDVALEYTSLAELEENTKNSWKLLMTGNTLMDKESNNKNFFKQLYSYADQMEYLLNWYD KERKIFLNRVENAGFERLDISKKEGSPIYVDEINQIFDFIPKLEELITIGKVGLKYSEID KKRTEYLEKIEGIIKENSFLGSEIKNAIEKENTEKYSETLKKLEVLAGKEELYRKHKNLL KNIKTVANLWADELEKGLFNEKVENIYNTWRYKQISQTLKELIEKPYESLQEDILEKSEE LNKLTAELVTKKTWYNIVKFIEEKDNLAISQALRGWKQTIQKIGKGTGKNTVLYKKHAKE KMLLCQKVVPAWIMPLNKVFDTLNPVENKFDIVIVDEASQSDISSLILLYMAKKIIVVGD DKQVSPSDVGVNIDKINMFRRKYIKGNVANDDLYGVRASLYSIVSTTFQPISLREHFRSV PEIIGYSNKTSYDNQILPLRDSNSSILKPAIVEYKVDGKRDEKNKVNKVEAETIVTLIEA CLDMKEYKNSSFGVISLLGDEQAELIQNLIVKRIPAIEIENHKILCGNPASFQGDERDVM FISLVDSSEENKNLRLVGEGVEGATRKRYNVAISRAKDQLWLVHSIDKNSLKEGDLRKEL FEYINSVKENTFEKIINENTVISDFENEITKHLLERNYTVKQQWKVGSYDIDIVAIYRDK KIAIECDGKNLNHSQEKIIANLAEQEVLERCGWEFIRVRASQYFRNPDKALKELILQLDD KGIYPNNKENHSNGNELLNSIKSKALELMEKYEEN >gi|296155529|gb|ADVK01000004.1| GENE 3 5657 - 6403 946 248 aa, chain - ## HITS:1 COG:FN0950 KEGG:ns NR:ns ## COG: FN0950 COG2099 # Protein_GI_number: 19704285 # Func_class: H Coenzyme transport and metabolism # Function: Precorrin-6x reductase # Organism: Fusobacterium nucleatum # 1 248 19 266 266 435 99.0 1e-122 MIWVIGGTKDSRDFLEKFIRYDKDIIVSTATEYGAKLLENLPVKTLSKKMDKEAMLKFVE DNKITKVIDTSHPYAFEVSKNAMEVAEEKNIEYFRFEREKVDILPKKYKNFEEIEDLIEY VEKLDGNILVTLGSNNVPLFKDLKNLSNIYFRILSRWDMVKKCEDNNILPKNIIAMQGPF TENMNVAMMEQFNIKYLISKKAGDTGGEREKVSACDKLDVEIIYLEKQEIIYKNCYKDIS ILIKNLVQ >gi|296155529|gb|ADVK01000004.1| GENE 4 6429 - 7178 1160 249 aa, chain - ## HITS:1 COG:FN0951 KEGG:ns NR:ns ## COG: FN0951 COG1010 # Protein_GI_number: 19704286 # Func_class: H Coenzyme transport and metabolism # Function: Precorrin-3B methylase # Organism: Fusobacterium nucleatum # 1 249 1 249 249 480 99.0 1e-136 MNNGKIYVVGIGPGNMEDISIRAYNILKNVNVIAGYTTYVDLVRDEFSDKEFLVSGMKRE IERCKEVLEVAKIGKNVALISSGDAGIYGMAGIMLEVAMGSGIEVQVVPGITSTIAGAAL VGAPLMHDQAIISLSDLLTEWEVIKKRIECASQGDFAISLYNPKSKGRTEQIVEAREIML KYKLPTTPVALLRHIGRKEENYTLTTLEDFLNYEIDMFTIVLVGNSNTYVKDGKMITPRG YEKKSNWGK >gi|296155529|gb|ADVK01000004.1| GENE 5 7171 - 8184 1259 337 aa, chain - ## HITS:1 COG:FN0952 KEGG:ns NR:ns ## COG: FN0952 COG2073 # Protein_GI_number: 19704287 # Func_class: H Coenzyme transport and metabolism # Function: Cobalamin biosynthesis protein CbiG # Organism: Fusobacterium nucleatum # 1 322 1 322 337 560 98.0 1e-159 MKLAFWTVTKGAGNIAREYKEKLKEHLKDYEIDVFTLKKYDVENTIQIKNFTCNINEKFS QYDGHIFIMASGIVIRKIASLIDTKDKDPAVLLIDEGKHFVISLLSGHLGRANELTYSLA NILKLVPVITTSSDVTGKIAVDTISQKLNAELEDLKSAKDVTSLIVNGQKVNILLPKNVK VTDKNSADGFILVSNKKNIEYTRIYPKNLILGIGCKKDTKAEDILSAIEDCLDKNNLDIK SVKKIATVDVKENEKGLIDAVKFLNLDLEIISREEIKKVQDQFDGSDFVEKNIGVRAVSE PVALLSSTGNGKFLVMKEKCNGITISIYEEEIEKIYE >gi|296155529|gb|ADVK01000004.1| GENE 6 8197 - 8928 547 243 aa, chain - ## HITS:1 COG:no KEGG:FN0953 NR:ns ## KEGG: FN0953 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 243 1 243 243 445 98.0 1e-124 MPNYYYDGSFDGLLTVIYMAYNDRESNMLRVNAKAEQLILELDDIHIITDFSKARRVEKA ICDKLSQDFLNKIRTCFLSNDKNKDTIIIHTVYKALKQGEEILNSLDEHAFYMNKLVKQV LNERHRYLGLLRFKEMKDGIMFSTIEPKNNVLPILISHFKNRMKREKIAIFDKGRKMIVY YDGKKAEIFFVESLEIEWSDEEIEYSELWKTFHKSISIKERENKKLQQSNIPKYYWKHLV EDM >gi|296155529|gb|ADVK01000004.1| GENE 7 9051 - 10298 1203 415 aa, chain - ## HITS:1 COG:FN0954 KEGG:ns NR:ns ## COG: FN0954 COG4277 # Protein_GI_number: 19704289 # Func_class: R General function prediction only # Function: Predicted DNA-binding protein with the Helix-hairpin-helix motif # Organism: Fusobacterium nucleatum # 1 415 1 415 415 805 99.0 0 MSKSIEEKLRILSDAAKYDVSCSSSGSSRKNTNNGLGNAAINGICHSWSADGRCISLLKI LMTNYCIYDCKYCINRKDNDIERAILTPDEIVKLTINFYRRNYIEGLFLSSGIIKNVDYT MELMIAVAKKLRLEEKFNGYIHMKVIPGASRQLINEIGLYVDRVSVNIEFAENRALKLLA PDKKPTDISTSMGLIRKNMLENAEDKRLFKSTPSFIPAGQTTQMIIGASGESDYAILSRS ENLYKNFDLKRVYYSGYVPVNKSGILVSADQSVPMIREHRLYQADWLLRFYDFRADEILN EKDPFVDPFLDPKTNWAIKNSHLFPIEINKASYKELLRVPGIGVTSAKRIVMTRKYSTIR YEHLKKLGIVIKRAKYFITVNGEFLGFKKENPELIRNALMEKEKMVAEQLKLFNV >gi|296155529|gb|ADVK01000004.1| GENE 8 10610 - 11137 711 175 aa, chain - ## HITS:1 COG:no KEGG:FN0955 NR:ns ## KEGG: FN0955 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 175 1 175 175 316 99.0 3e-85 MKRKFINVTKKYIEDLAPTDFCVELIQPAWETVNIYGTYEEYEETLKPYTIEQRYLLAMH WLGAEVANGGFQQFLGNSTAIVWEDAYKGYQAIGSEKLTYLIEELIKIYGRNIPFDREER ANMLENFSEEKLEEIDTVTDLYYEIEETEWRKVTLWVKADSEKFLIQAEINDYGN >gi|296155529|gb|ADVK01000004.1| GENE 9 11140 - 11388 408 82 aa, chain - ## HITS:1 COG:no KEGG:FN0956 NR:ns ## KEGG: FN0956 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 82 1 82 82 155 98.0 7e-37 MKALKNKTWQFEKSGIGGKVELFGVNIFDYKWEDTYTVAVLDPKYNNEHYLHVYRVIINE KEYEFATGEFSNGCWCFYLPKE >gi|296155529|gb|ADVK01000004.1| GENE 10 11488 - 12261 1291 257 aa, chain - ## HITS:1 COG:FN0957 KEGG:ns NR:ns ## COG: FN0957 COG2875 # Protein_GI_number: 19704292 # Func_class: H Coenzyme transport and metabolism # Function: Precorrin-4 methylase # Organism: Fusobacterium nucleatum # 1 257 1 257 257 463 95.0 1e-130 MEKYKEKVYFIGAGPGDPELITIKGQRIVKEADVIIYAGSLVPKEVIDCHKDGAEIYNSA SMTLDEVIDVTVKAIKENKKVARVHTGDPAIYGAHREQMDMLDEYGIEYEVIPGVSSFLA SAAALKKEFTLPNVSQTVICTRIEGRTAVPEKEGLESLAKHRASMAIFLSVHMIDKVVET LATSYPMTTPVAVVQRASWPDQKIVLGTLETIEQKVKEAGINKTAQILVGDFLGNEYEKS KLYDKYFSHEYRKGIKK >gi|296155529|gb|ADVK01000004.1| GENE 11 12299 - 12760 509 153 aa, chain - ## HITS:1 COG:no KEGG:FN0958 NR:ns ## KEGG: FN0958 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 2 153 75 226 226 239 92.0 3e-62 MLEAYNRRRVEKIDDIKNRISYKDKKFDFVNNYFAELVEKNKAVVYDRKNEKEIYYLIKV KYDNAIYYYGGGGSLYNGYNFYADKEWTELALQSDIITQFGFEIHSSIGDNPYNRELSPE AKKNFENNKGFNQRKELYEKAMQTPNVTQSFSY >gi|296155529|gb|ADVK01000004.1| GENE 12 12772 - 12966 278 64 aa, chain - ## HITS:1 COG:no KEGG:FN0958 NR:ns ## KEGG: FN0958 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 62 1 62 226 102 90.0 5e-21 MKKLLAILFLIMTVQGIAETIVKGTYETKRKRYIELTETFEKEIFLNYTLPDNKKEIVTY KGLK >gi|296155529|gb|ADVK01000004.1| GENE 13 12981 - 13703 1156 240 aa, chain - ## HITS:1 COG:FN0959 KEGG:ns NR:ns ## COG: FN0959 COG2243 # Protein_GI_number: 19704294 # Func_class: H Coenzyme transport and metabolism # Function: Precorrin-2 methylase # Organism: Fusobacterium nucleatum # 1 240 9 248 248 416 93.0 1e-116 MTNKFYGIGVGVGDPEEITLKAVNTLKKLDVVILPEAKKDEGSVAYEIAKEYMKEDVERV FVEFPMLKSLEDRENARKENAKIVQKLLDEGKNVGFLTIGDTMTYSTYVYILEHLPEKYL VETIPGISSFVDMASRFNFPLMIGDETLKVVPLNKKTNIEFELENNDNIVFMKVSRNFEN LKQALIKTENIDRIIMVSNCGKENQKVYYDIKDLTEEDIPYFTTLIVKKGGFEKWKKFNI >gi|296155529|gb|ADVK01000004.1| GENE 14 13730 - 14155 424 141 aa, chain - ## HITS:1 COG:no KEGG:FN0960 NR:ns ## KEGG: FN0960 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 115 45 159 164 157 79.0 1e-37 MESDNHPIIETILFQPGSYLSIKAYYLDISKEGIYKKVEADIPDILAVFENIMSKVENKK EYRIPNHRSSKFKCYEYSYYENDKEIYAIATGIFMKGYLLKVNIISTIKKEVEKAMYYLF NIKEADPKEMEFLKKINSYKE >gi|296155529|gb|ADVK01000004.1| GENE 15 14186 - 14290 209 34 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MQIILLILKLFGFLIPEKKEFYIDDKWTIELPAN >gi|296155529|gb|ADVK01000004.1| GENE 16 14325 - 14828 431 167 aa, chain - ## HITS:1 COG:no KEGG:FN0963 NR:ns ## KEGG: FN0963 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 15 167 1 153 153 232 96.0 3e-60 MKLKKISLFTIILVLILSFTTYSIEKTEVFNIDNEWTISLPEDWQMEGKSSYQQYYSFYP NIPFYSTIKIYNYDIEENQNINLLEDLKKVHPNSNIKKKKLDLNKIKIKNTKVEAYEYSF DENGTKLYAISYFILIKGNLLIANFYSISKKEIEKMLKYFYSIEKIN >gi|296155529|gb|ADVK01000004.1| GENE 17 14932 - 15501 942 189 aa, chain - ## HITS:1 COG:FN0964 KEGG:ns NR:ns ## COG: FN0964 COG2242 # Protein_GI_number: 19704299 # Func_class: H Coenzyme transport and metabolism # Function: Precorrin-6B methylase 2 # Organism: Fusobacterium nucleatum # 1 189 1 189 189 360 100.0 1e-100 MHIYDKDFTQTELPMTKQEIRAISIAKLMLKPNSILIDVGAGTGTIGIEAATYMPQGKVY AIEKEEKGLDTIKLNAEKFNLDNFELIHGKAPDAIPNIPYDRMFIGGSTGGIEEIINHFL TYAKDEAILVINCITLETQSKSLEILKEKGFKDIEIVTVTIGRAKRVGPYTMMFGENPIC IIKVVKRNK >gi|296155529|gb|ADVK01000004.1| GENE 18 15523 - 16488 1268 321 aa, chain - ## HITS:1 COG:FN0965 KEGG:ns NR:ns ## COG: FN0965 COG1052 # Protein_GI_number: 19704300 # Func_class: C Energy production and conversion; H Coenzyme transport and metabolism; R General function prediction only # Function: Lactate dehydrogenase and related dehydrogenases # Organism: Fusobacterium nucleatum # 1 321 1 321 321 602 100.0 1e-172 MEKNKLKILFLDRNAVGPYELKGIFSKYGEYTELNLTNNDDIASYLKDYDVVILNRIRLG KKEFEKTPHLKLVLLTGTGYNHIDLVAAKEYGVTIANVANYSTNSVSQLTMTLLLNELTR AEKLSQEVKQNKWEEISNKMDRYYHVDTEGKVLGILGHGNIGKKVESYAKSFGMEVMIAK IPGREYTDNLENRFELDEVLEKCDIFSIHAPLTDLTRDLINLDRMKKMKKSAIILNLGRG PIINEEDLYYVLKNKIISSAATDVMTTEPPQNDCKLLELDNFTVTPHLAWKSLKSVERLF AAIENNLNLFLENKLIGLESK >gi|296155529|gb|ADVK01000004.1| GENE 19 16475 - 17131 800 218 aa, chain - ## HITS:1 COG:FN0966 KEGG:ns NR:ns ## COG: FN0966 COG2241 # Protein_GI_number: 19704301 # Func_class: H Coenzyme transport and metabolism # Function: Precorrin-6B methylase 1 # Organism: Fusobacterium nucleatum # 1 218 12 229 229 374 99.0 1e-104 MQLNKINVVGLGPGNIKYLSTAGIDCIKQAEIIIGSTRQLSDLKTIISEKQEIYILGKLN ELITYLKENVERKITIIVSGDTGYYSLVPYLSKNLSKNILNIIPNISSYQYLFSKLGENW QNFRLASVHGREFDYIKNINDEDIEGLVLLTDDIQNPYEITKKLYNNGIRNLTVIVGENL SYDNEKITILEIEDYKKLNRKFDMNVLILKKGENYGKK >gi|296155529|gb|ADVK01000004.1| GENE 20 17118 - 18245 1717 375 aa, chain - ## HITS:1 COG:FN0967 KEGG:ns NR:ns ## COG: FN0967 COG1903 # Protein_GI_number: 19704302 # Func_class: H Coenzyme transport and metabolism # Function: Cobalamin biosynthesis protein CbiD # Organism: Fusobacterium nucleatum # 1 375 1 375 375 721 99.0 0 MEEKELKNGYTTGTCATAAVKVALEALIYGKKATEVDITTLNYTNLKIPVQKLRVRNNFA SCAIQKYAGDDPDVTNGISICAKVQLVKELPKVDRGAYYDNCVIIGGRGVGFVTKKGLQI AVGKSAINPGPQKMITSVVNEILDGSDEKVIITIYVPEGRAKALKTYNPKMGVIGGISVL GTTGIVKAMSEDALKKSMFAELKVMREDKNRDWVIFAFGNYGERHCQKIGLDTEQLIIIS NFVGFMIEAAVKLEFKKIIMLGHIAKAIKVAGGIFNTHSRVADGRMETMAACAFLVDEKP EIIRKILASNTIEEACDYIEKKEIYHLIANRVAFKMQEYARADIEVSAAIFSFKGETIGE SDNYQRMVGECGAIK >gi|296155529|gb|ADVK01000004.1| GENE 21 18261 - 19004 641 247 aa, chain - ## HITS:1 COG:no KEGG:FN0968 NR:ns ## KEGG: FN0968 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 247 1 247 247 410 100.0 1e-113 MKIISKFKDFYDYKVVKYGVDEKLIYNRKTYCEYFQGFFRDINIDYRISEDDFNKNLKEN TKVTDEKNIHKILFIGEKLIHLFFTENGVYTHFDIKNEENLRKLTNFEYKKEITFKDGEK FNIYSRFRSDWDYLLSYNRKKLINLNINKDDIILNEPILLIEYIGECNNEKAKRYYSPSL YKFIYNPNLSQMGVYIDTDFVWQSLVEFLSNKRSEKEISPEISNENKILSKGFDLKTSFR PNMKKKK >gi|296155529|gb|ADVK01000004.1| GENE 22 19001 - 19777 664 258 aa, chain - ## HITS:1 COG:no KEGG:FN0969 NR:ns ## KEGG: FN0969 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 258 1 258 258 431 100.0 1e-119 MKIISRFKDFYDYKVAKYGVDEKLVYNRKTYYDYYKILHINILTDENGRVSIEDFNNNLK ENIKYFKHNNHNKVLIVGEKIVHLFFTEDKVYTHFDIKNPEKIADHIYKYWAYYTDTKEI IFNDEKKFEFFITFNDIWNDLFSYNRKRFLSYLNTPNDDILFNEPMILIEYVGKADRKTV RFDNSIYKITYNPNLSQMGIYFDEDFIWQSLVEFLSNKRSEKEITPEVSNKNKILSKGFD LKTSFRPNMKKKKHKGDV >gi|296155529|gb|ADVK01000004.1| GENE 23 19802 - 20452 991 216 aa, chain - ## HITS:1 COG:FN0970 KEGG:ns NR:ns ## COG: FN0970 COG2082 # Protein_GI_number: 19704305 # Func_class: H Coenzyme transport and metabolism # Function: Precorrin isomerase # Organism: Fusobacterium nucleatum # 1 216 4 219 219 396 100.0 1e-110 MSYIKVPGDIEKRSFEIIEEELGDKVKKFSESELLIVKRIVHTSADFEYADLIEFQNNAI ESGLKALEKGCKIYCDTNMIVNGLSKPALAKYNCSAYSLVSDKEVIEAAKKEGLTRSIVG VRKAGKDSETKIFILGNAPTALYQLKEMIEKGEIEKPALVIGVPVGFVGAAESKEEFKKL GVPYITVNGRKGGSTIGVAILHGIIYQIYKREGFHA >gi|296155529|gb|ADVK01000004.1| GENE 24 20476 - 21468 1131 330 aa, chain - ## HITS:1 COG:FN0971 KEGG:ns NR:ns ## COG: FN0971 COG3177 # Protein_GI_number: 19704306 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 330 1 330 330 581 99.0 1e-166 MKKELSPPFKITNEILNFIYEIGELVGKISAEKEFEKNLTLRRENRIKTIYSSLAIEQNT LTLEQVTDVINGKRVLAPLKDIKEVQNAYEIYERIDELDENSVKDLLLAHKIMTSELIKE SGRFRSKNAGVYQGDKLIHMGTLPEYIPELINNLFLWLKNSKEHPLIKAAVFHYEFEFIH PFQDGNGRIGRLWHSLILSKWKKIFAWLPIESLVQKYQKEYYISINNSNRDGESTEYILF MLRIINETLIELVENKKMTDKMTDKMTDKNRERIKLVIKYLSQNNSINNKETQNLLNISE ATAKRFLNKLVKENILEAVGEYKARKYIKK >gi|296155529|gb|ADVK01000004.1| GENE 25 21468 - 22802 1748 444 aa, chain - ## HITS:1 COG:FN0972 KEGG:ns NR:ns ## COG: FN0972 COG1797 # Protein_GI_number: 19704307 # Func_class: H Coenzyme transport and metabolism # Function: Cobyrinic acid a,c-diamide synthase # Organism: Fusobacterium nucleatum # 1 444 1 444 444 878 100.0 0 MKAFMLAGVSSGIGKTTISMALMSVFNNVSPFKVGPDYIDPGFHEFITGNKSYNLDIFMM GEQGVKYSFYKHHKDISIIEGVMGLYDGMDNSLDNNSSAHIARFLGVPVILVLDGVGKST SIAAQVLGYKMLDPRVNISGVIINKVSSAKTYAIFKEAIEKYTGVKCLGFVEKNDKLNIS SQHLGLLQASEVEDLREKLSILKNLVLQNIDLKEIEKIASEQTRTFNENETEIEPPLYIS YLKDRYVGKIIAIAQDRAFSFYYNDNIEFLEYMGFKVKYFSPLKDSKVPECDIIYLGGGY PENFAEELSNNKEMFNSIRKNYEQGKNILAECGGFMYLSNGIEQIEGKVYQMCGLVPCVV NMTNRLDISRFGYILINNKNDIEIARGHEFHYSKLKAVLEDTRKFKAVKKDGRTWECIFN EKNLYAGYPHIHFFGSYKFIEEVF >gi|296155529|gb|ADVK01000004.1| GENE 26 22821 - 23894 1075 357 aa, chain - ## HITS:1 COG:FN0973 KEGG:ns NR:ns ## COG: FN0973 COG0079 # Protein_GI_number: 19704308 # Func_class: E Amino acid transport and metabolism # Function: Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase # Organism: Fusobacterium nucleatum # 1 357 1 357 357 651 99.0 0 MKDLHGGNIYKFQREGKNNILDYSSNINPLGVPQKFIDIAKENFDKLVNYPDPYYIELRK KIAEFNSLNMDNVIVGNGATEILFLYIRALKPKKVLILAPCFAEYARALKSVSAKMEYFE LKESDNFYPNIINLKKEIENNNYDLLLFCNPNNPTGQFIKLEDIKEIVKTCENKNTRIFI DEAFIEFIEKWKEKTVSLLKNRNVFIMRAFTKFFAIPGLRLGYGLGFDENILKKMWEEKE PWTVNTFANLAGLTMLDDKEYIKKSEKWILEEKKFMYKELSDFQYIKAYKTECNFILLKI YNISSASLRDKMIEKNILIRDASNFKFLDFHFVRLAIKDRESNLKLLETLDTIMEYK >gi|296155529|gb|ADVK01000004.1| GENE 27 23968 - 24339 435 123 aa, chain - ## HITS:1 COG:FN0974 KEGG:ns NR:ns ## COG: FN0974 COG0346 # Protein_GI_number: 19704309 # Func_class: E Amino acid transport and metabolism # Function: Lactoylglutathione lyase and related lyases # Organism: Fusobacterium nucleatum # 1 123 1 123 142 245 99.0 2e-65 MNKITCICLGVKNMEKSIKFYRDGLGYKTNCKENNPPVCFFDTPGTKFELYPLDLLVKDI DESTLKIGSGFSGFTLAYNVEKKEDVDKVIELVKNAGGKIIKEPQNVFWGGYHAYFSDLD NYF >gi|296155529|gb|ADVK01000004.1| GENE 28 24344 - 25321 1135 325 aa, chain - ## HITS:1 COG:FN0975 KEGG:ns NR:ns ## COG: FN0975 COG1270 # Protein_GI_number: 19704310 # Func_class: H Coenzyme transport and metabolism # Function: Cobalamin biosynthesis protein CobD/CbiB # Organism: Fusobacterium nucleatum # 1 325 1 325 325 565 99.0 1e-161 MFNYFAVKFGLAYIIDLILGDPRWLYHPVIIIGKLISFLEKILYKAKNKIFSGAILNILT LSTTFIVSLFLARIGYVIEIFFLFTTLATKSLADEGKKVYRILKSGDIEKAKKELSYLVS RDTNTLSLDKIIMSVVETIAENTVDGFISPVFFAFVGSFFYVELFGKVVSLALPFAMTYK AINTLDSMVGYKNEKYIDFGKVSARVDDIANFIPARLTGLIFVPLSSLILGYNFKNSLKI FFRDRKKHSSPNSGQSESAYAGALGIQFGGKISYFGKDYEKQKIGDKLKEFDYEDIKKAV NILYAVSLIATISFILTSIIIVLLG >gi|296155529|gb|ADVK01000004.1| GENE 29 25341 - 26258 1145 305 aa, chain - ## HITS:1 COG:no KEGG:FN0976 NR:ns ## KEGG: FN0976 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 305 1 305 305 535 99.0 1e-150 MSISFYVKNKKKFLGYEAVLNVEEALTILDKELNSYNTGNIEVNDLLLSPVSNYECLLIG EDKVSARGFELSYDNKNKDYAVRIFTPSSREDWLLALEYIKALAKKFNSEIVNERGEVYT VDNIDKFNYERDILYGIEVITSNMKSGEADNYAIFGIDRVVSFNQEMLDKINNSDSPIDT FSNIVKEIQYLDAYSAHQQFYKNKTDGKIIGAYTLTQNLRTILPYKPSVEFENSDIVKND EISFWNIALVTINGDENDPNSYQVIRNLNYDDFIKKLPINKYKSIDASYIMVEPLSKEEI LDLLK >gi|296155529|gb|ADVK01000004.1| GENE 30 26285 - 27775 1994 496 aa, chain - ## HITS:1 COG:FN0977 KEGG:ns NR:ns ## COG: FN0977 COG1492 # Protein_GI_number: 19704312 # Func_class: H Coenzyme transport and metabolism # Function: Cobyric acid synthase # Organism: Fusobacterium nucleatum # 1 496 1 496 496 889 99.0 0 MKKANLMIVGTSSGAGKSLFVTALCRIFYKDKYKVSPFKSQNMALNSYITKDGKEMGRAQ VVQAEASGTEPNVNMNPILLKPSTINKIQIIVCGKSIGNMSGVEYNQYKKNLIPILKETY SKIENKNDIVVIEGAGSPAEINIKEEDISNFAMARIADAPVILVADIDRGGVFASIYGTI MLLHEEDRKRIKGIVINKFRGNKEVLKPGFEIIENLTGVKTLGVIPYTDIDIEDEDSLSE KYKSFKLNKNSNKIKISVIKLKHISNATDIDALSIYNDVEIQFVTERSQIGNEDFIIIPG SKNTIDDLKWLKESGIAEEIIKRARTETIIFGICGGFQILGNKVKDLYHIEGDIEELNGL GLLDLETIMENEKTLVQYKGKLVVDNGILKTLNDFEIKGYEIHQGITKGNEKNLTTDERT IFVNRDNIIATYLHGIFDNKDFTNTLLNEIRRRKGLEEVNDNISYEEYKLKEFDKLEKLV RENVDIDEIYKIIGLK >gi|296155529|gb|ADVK01000004.1| GENE 31 27865 - 29160 1316 431 aa, chain - ## HITS:1 COG:FN0978 KEGG:ns NR:ns ## COG: FN0978 COG1757 # Protein_GI_number: 19704313 # Func_class: C Energy production and conversion # Function: Na+/H+ antiporter # Organism: Fusobacterium nucleatum # 1 431 1 431 431 640 99.0 0 MGSIIVILLFSLSLIVCLLLKYSVVYALIIGYLIFISYGFMKGHNLIVLIKKSFEGVLTV KNILLVFIFIGMITALWRASGTIAFIVYMGSKLISPSILILLTFLLCSILSVLIGTSLGT AATIGVICFSIGKAMGINPYYVGGAVLSGIYFGDRCSPMSTSALLIAELTKTNLYTNIKL MIKTSIIPFVMTCLFYLFLGFNSTVSNISVNVTEIFKQNYNINIIVIIPAILIIILSILK INVKKTMLLSIVVSFIIAMFSQKENIVSLINYCIYGYHHPNEKLNLMMKGGGILSMVNVG LIVGISSSYSGIFKETKMLVSLKKYLKDFSKKTSNYFVIFLSSIVSGAIACNQSLGIILT NELCEEIVDKQKRAIILENTVILLTGLIPWNTAMVVPLKTLDVGVMSGLFAFYLYFLPLW NLFLGVMKEKS >gi|296155529|gb|ADVK01000004.1| GENE 32 29341 - 29709 327 122 aa, chain + ## HITS:1 COG:no KEGG:FN0979 NR:ns ## KEGG: FN0979 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 122 1 122 122 223 100.0 3e-57 MSDSSKVYFHIYCAKNITIDNQKFSELEHIKGDDLEGVIKVSTDKHTITANYDLPSNSIF NSRIKAKDISISCPILEAGKHYVIAIYEVTPEIASTEENTYMDYVLAEELEKGYSICLYR MK >gi|296155529|gb|ADVK01000004.1| GENE 33 29723 - 29980 288 85 aa, chain + ## HITS:1 COG:no KEGG:FN0980 NR:ns ## KEGG: FN0980 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 85 1 85 85 120 100.0 1e-26 MRIRLSGEVGGMIIVIFTGIIAAAIIDGILSFIEKHVVKDNEGGKKMFSLIRKVNWIFFI LFIVLDLTNVFPLIRTILFAILSRQ >gi|296155529|gb|ADVK01000004.1| GENE 34 30035 - 31315 1852 426 aa, chain - ## HITS:1 COG:FN0981 KEGG:ns NR:ns ## COG: FN0981 COG0151 # Protein_GI_number: 19704316 # Func_class: F Nucleotide transport and metabolism # Function: Phosphoribosylamine-glycine ligase # Organism: Fusobacterium nucleatum # 1 426 1 426 426 781 99.0 0 MKVLIVGSGGREHAIAWKISQNSKVDKIFAASGNAYNKVIKNCENINLKTSDDILNFAIK EKVDLTIVGSEELLVDGIVDKFQENNLTIFGPNKEAAMLEGSKAFAKDFMQKYGVKTAKY QSFTDKEKAIKYLDKISYPVVIKASGLAAGKGVVIAQNRKEAEETLNDMMTNKVFAAAGD TVVIEEFLDGVEISVLSITDSEVIIPFISAKDHKKISEKETGLNTGGMGVIAPNPYYTKT IEEKFIQNILNPTLKGIKAEKMNFVGIIFFGLMVANGEVYLLEYNMRMGDPETQAVLPLM KSDFLNVINSALNKDLKNIKIDWEDKSACCVVMAAGGYPVKYEKGNFISGLEKFDSNKSD NKIFFAGVKEENDKFYTNGGRVLNVVSIKDSLEKAIEEAYKNVKEISFKDNYYRKDIGTL YVPVKY >gi|296155529|gb|ADVK01000004.1| GENE 35 31332 - 32846 2119 504 aa, chain - ## HITS:1 COG:FN0982 KEGG:ns NR:ns ## COG: FN0982 COG0138 # Protein_GI_number: 19704317 # Func_class: F Nucleotide transport and metabolism # Function: AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) # Organism: Fusobacterium nucleatum # 1 504 1 504 504 907 94.0 0 MRKRALISVYDKTGILDFAKFLVSKGIEIISTGGTYKYLKENNIEVIEVSKITNFEEMLD GRVKTLHPNIHGGILALRDNEEHMRTLKERNIDTIDYVVVNLYPFFEKVKENLSFEEKIE FIDIGGPTMLRSAAKSFKDVVVISDVKDYELVKEEMNKSNDVSYETRKRLAGKVFNLTSA YDAAISQFLLDEDFPEYLNISYKKSMEMRYGENSHQKAAYYTDNMSDGAMKDFKQLNGKE LSYNNIRDMDLAWKVVSEFDEICCCAVKHSTPCGVALGDNVEEAYKKAYETDPVSIFGGI VAFNREVDEASAKLLSEIFLEIIIAPSFSKSALEILTKKKNIRLIVCKNKPSDKKELIKV DGGILIQDTDNRLYENLEIVTKAKPTSQEEKDLIFALKVVKFVKSNAIVVAKNLQTLGIG GGEVSRIWAAEKALERAKERFNVIDVVLSSDAFFPFKDVAELAAKNGVKAIIQPSGSVND EDSIEECDKNNISMIFSKLRHFKH >gi|296155529|gb|ADVK01000004.1| GENE 36 32872 - 33642 952 256 aa, chain - ## HITS:1 COG:no KEGG:FN0983 NR:ns ## KEGG: FN0983 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 256 1 256 256 448 99.0 1e-124 MSFNLFIFERRENIKTSIDINNYIEEFTKYEEEEDYNSLEGCSETIVKFAKKMFEKFPPV NGKYLHLDEIAFTSKNSETHLTDYSLGKYGVFCALDYSVADEAISYIISLADEYKIGVYN PQSSEVIYPKNIEILKYRTEDRDDTFTDWYTIENSINTLDSLERGTSNRENAFVTVWFEK NGKDEDEYIQCTPNYLKKGFFKSKILIDKYYFEVMIDKKLYQTNVSDKKDLIRLMKEWCC ERKNPDVKNYEIIMEL >gi|296155529|gb|ADVK01000004.1| GENE 37 33680 - 34489 926 269 aa, chain - ## HITS:1 COG:FN0984 KEGG:ns NR:ns ## COG: FN0984 COG3315 # Protein_GI_number: 19704319 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism # Function: O-Methyltransferase involved in polyketide biosynthesis # Organism: Fusobacterium nucleatum # 1 269 1 269 269 485 96.0 1e-137 MEKLDGVANTLYVPLYGRIYVSKKFPEYFYDEMALKIEEKFTSGISKGSFEYTNMAYAAR YYNMDKMIIKFIKEHKISNIVLLGIGLETAYDRITQKCGLGEVNYYGIDLPEVIEIRKKY FGERKQETLIAGDMFEMKWKEQIDTSIPTLLIVSGVFQYFFEDKIIEFIKNLKKEFPYGE LIFDTARKKTGLRFANWYIRRTGNLEALMHFYIEDSVDFSKKTDTILVEELAFFRDAREL LRKKLNFITKLFMKIADYKRQALIIYLKW >gi|296155529|gb|ADVK01000004.1| GENE 38 34509 - 35051 753 180 aa, chain - ## HITS:1 COG:FN0985 KEGG:ns NR:ns ## COG: FN0985 COG0299 # Protein_GI_number: 19704320 # Func_class: F Nucleotide transport and metabolism # Function: Folate-dependent phosphoribosylglycinamide formyltransferase PurN # Organism: Fusobacterium nucleatum # 1 180 1 180 180 322 96.0 3e-88 MFKIIVLVSGSGTNMLQLIKNNIKIDCIIADRECKAKNIADEYKIDFVLLNRNKEISKNL LKIFEERKPDLIVLAGFLSILDGEILEKYKNKIINIHPSLLPKYGGKGMYGLKVHQAVFE NGDKESGCTVHYVTSNVDAGEIIGQEKVDISMAKSPEEIQKIVLEREWKLLPRVVKKLIK >gi|296155529|gb|ADVK01000004.1| GENE 39 35039 - 36058 815 339 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|169632702|ref|YP_001706438.1| phosphoribosylaminoimidazole synthetase [Acinetobacter baumannii SDF] # 4 336 13 344 356 318 48 5e-86 MINSYKDSGVDKEEGYKAVELMKKNVLKTHNKSVLTNLGSFGAMYELGQYKNPVLISGTD GVGTKLEIAMKQKKYDTVGIDCVAMCVNDVLCHGAKPLFFLDYLACGKLDAEIAAQLVSG VTEGCLQSYAALVGGETAEMPGFYKEGDYDIAGFCVGIVEKEDLIDGSKVKEGNKIIAVA SSGFHSNGYSLVRKVFTDYDEKISLKEYGENVTMGDVLLTPTKIYVKPILKVLEKFNVNG MAHITGGGLYENLPRCMGKDLSPVVFKDKVKVPEIFKLIAERSKIKEEELFGTFNMGVGF TLVVEEKDVEAIIELLTSLGEIAYEIGHIEKGDHSLCLK >gi|296155529|gb|ADVK01000004.1| GENE 40 36058 - 37404 1959 448 aa, chain - ## HITS:1 COG:FN0987 KEGG:ns NR:ns ## COG: FN0987 COG0034 # Protein_GI_number: 19704322 # Func_class: F Nucleotide transport and metabolism # Function: Glutamine phosphoribosylpyrophosphate amidotransferase # Organism: Fusobacterium nucleatum # 1 448 1 448 448 876 99.0 0 MGILALHSKKVRNDLVGIAYYGMYALQHRGQAGAGYTICDSKTNNEVRIKTIKNVGLVSD VFKVEDFQKFTGTILIAHTRYGSKNTVSSRNCQPVGGESAMGYISLVHNGDISNQAELKQ ELLNNGSLFQTAIDTEIILKLLSINGKYGYKEAVLKTVKKLKGCFTLGIIINDKLIGVRD PEGLRPLCLGRIVENDMYVLASETCALDAIGAEFVRDIEAGEMVIIDDNGVESIKYKESS KKASSFEYIYFGRPDSVIDGISVYDFRHQTGKCLYEQNPIEADIVIGVPDSGVPAAIGYS EASGIPYSAALLKNKYVGRTFIAPVQELRERAVKVKLNPIKELIKGKRVVVIDDSIVRGT TSKKLIDILFEAGAKEVHFRSASPVVIEESYFGVNIDPNNKLMGSYMSIEEIRKVIGATT LDYLSLKNLKKILNGGKDFYMGCFKEDE >gi|296155529|gb|ADVK01000004.1| GENE 41 37449 - 38162 1177 237 aa, chain - ## HITS:1 COG:FN0988 KEGG:ns NR:ns ## COG: FN0988 COG0152 # Protein_GI_number: 19704323 # Func_class: F Nucleotide transport and metabolism # Function: Phosphoribosylaminoimidazolesuccinocarboxamide (SAICAR) synthase # Organism: Fusobacterium nucleatum # 1 237 1 237 237 434 99.0 1e-122 MEKGKFIYEGKAKQLYETDDKDLVIVHYKDDATAGNGAKKGTIHNKGIMNNEITALIFNM LEEHGIKTHFVKKLNDRDQLCQRVKIFPLEVIVRNIIAGSMAKRVGIKEGTKINNTIFEI CYKNDEYGDPLINDHHAVAMGLATYDELKEIYDITGKINNLLKEKFDNIGITLVDFKIEF GKNSKGEILLADEITPDTCRLWDKKTGEKLDKDRFRRDLGNIEEAYIEVVKRLTEKK >gi|296155529|gb|ADVK01000004.1| GENE 42 38291 - 38764 765 157 aa, chain - ## HITS:1 COG:FN0989 KEGG:ns NR:ns ## COG: FN0989 COG0041 # Protein_GI_number: 19704324 # Func_class: F Nucleotide transport and metabolism # Function: Phosphoribosylcarboxyaminoimidazole (NCAIR) mutase # Organism: Fusobacterium nucleatum # 1 157 1 157 157 277 97.0 5e-75 MKVGIIFGSKSDVDVMKGAADCLKKFGIEYSAHVLSAHRVPELLEETLEKFEKEDYGVII AGAGLAAHLPGVIASKTVLPVIGVPIKAAVEGLDALFSIVQMPKSIPVATVAINNSYNAG MLAVEILAIGNKELREKLLEFRKEMKEDFKKNIHVEL >gi|296155529|gb|ADVK01000004.1| GENE 43 38776 - 42513 4844 1245 aa, chain - ## HITS:1 COG:FN0990_1 KEGG:ns NR:ns ## COG: FN0990_1 COG0046 # Protein_GI_number: 19704325 # Func_class: F Nucleotide transport and metabolism # Function: Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain # Organism: Fusobacterium nucleatum # 1 979 5 983 983 1832 98.0 0 MSNLRFFVEKKKGFDLDAKRLEKQFREELGVNVRDLRLINCYDIFNLNDNKEDIEKIKKM ILSEPVTDTITTELDLKGKKYFAVEFLPGQFDQRADSALQCIDIVSSEKQNADILTSRII ILNNELSDEELNKIKKFYINPIEMREKDLSVLKKEEILFNSEVITYDNFILFDDVDLEKI RTDLGLSMSFEDLKFIQGHYKEIGRNPTETEIKVLDTYWSDHCRHTTFETKINKVTFPDS EFGKQMEKEFNGYLKLKEDVSKKRDVSLMDMATIVAKYLKKEGKLDNLEVSEENNACSVY VDVEVEDFEGKKAIEKWLLMFKNETHNHPTEIEPFGGASTCLGGAIRDPLSGRAYVYQAI RVTGSGNPLETVEETLKGKLPQKKITTGAASGYASYGNQIGIATSLVSEIYHDGYKAKRM EVGAVVAAAPVENVVRKSPVPNDSIIIIGGKTGRDGCGGATGSSKEHNDKSLLLCGAEVQ KGNAPEERKIQRLFRNPNATKLIKKCNDFGAGGVSVAIGELADGVEVNLDLVPVKYDGLN GTELAISESQERMAVIVSKEDTEKFLKFVDEENLLGTVVGYVTDKNRLTLNWKGKAIVDI SRDFLNTNGVQQNIDIEVRDYKDENIFEKFKTSDSSLEKKWLHNIIKLNVVSQKGLVEMF DSSVGAGTILAPFGGKYQMTPTDVSIMKFPVLNKNTNTASAITWGFNPYISEWSTYHGAI YAVVESLAKLVAAGVDYKTARLSFQEYFEKLGKDSYKWSKPFLALLGAMKVQKDFDVAAI GGKDSMSGTFNDISVPPTLISFAVSTVNVTDVISTEFKKAKNKIYLVENKINEKDFLFDS QELKENFHFILKNIKDKKIVSAMVVKIGGLAEVLSKMSFGNRLGFEINNKNVDLFSLKPA SILIETTEELSYKNAIYLGEVTDKFEGKVNGESINLEEVEATWLNKLKPIFPYKLEEKIE TYDIKNKISEKKIYKSSITIAKPKVVIAAFPGTNSEYDMYNRFNENGGEAKITLLRNLTQ KHLTESVDEMCKDLRNSQIFVLPGGFSAGDEPDGSGKFMAAVLQNPKLMDEIKAFLDRDG LILGVCNGFQALVKSGLLPYGEIGNVHENSPTLTFNKIGRHISQIVKTKIVTNNSPWLSS FEIGETFDIPVSHGEGRFYASDGVLKQLFENGQIATQYVDFDLEATNEFRFNPNGSSFAI EGIISPDGKIFGKMGHSERYSKDTFKNIDGNKNQNLILNGIKYFK >gi|296155529|gb|ADVK01000004.1| GENE 44 42578 - 42673 154 31 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MKLITKKNLNKIKFKVAVFYGIVANSCNCKL >gi|296155529|gb|ADVK01000004.1| GENE 45 42957 - 43748 1131 263 aa, chain + ## HITS:1 COG:FN0991 KEGG:ns NR:ns ## COG: FN0991 COG1183 # Protein_GI_number: 19704326 # Func_class: I Lipid transport and metabolism # Function: Phosphatidylserine synthase # Organism: Fusobacterium nucleatum # 1 263 1 261 261 441 94.0 1e-124 MVKKKYIAPNLITAGNMFLGYLSITESIKGNYKMAILFILLAMVCDGLDGKTARKLDAFS EFGKEFDSFCDAVSFGLAPSMLIYAILTKNVPGSPFVVPVSFLYALCGVMRLVKFNIINV ASSEKGDFSGMPIPNAAAMVVSYIMICNVLETTFGLTIFNIRIFIAISVISASLMVSTIP FKTPDKTFSFIPKKLALIIILGLLASMYWTLEYSVFIISYTYVLLNILTYFYKRFGSEEE KDENEDELTEEFIEVDESEEKGE >gi|296155529|gb|ADVK01000004.1| GENE 46 43751 - 44827 1023 358 aa, chain + ## HITS:1 COG:FN0992 KEGG:ns NR:ns ## COG: FN0992 COG0859 # Protein_GI_number: 19704327 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: ADP-heptose:LPS heptosyltransferase # Organism: Fusobacterium nucleatum # 1 358 1 358 358 630 99.0 1e-180 MFSQNDKINILVIRFKRIGDAILSLPLCHSLKLTFPNSKVDFVLYEDVAPLFEGHPYIDN VITITKEEQKNPFKYIKKVYNVTRKKYDIIIDIMSTPKSELFCMFSRKTPFRIGRYKKKR GFFYNYKMKEKESLNKVDKFLNQLLPPFEEVGFDVKKDYDFKFFAEKKEKEKYKQKMLET GVDFSKPVIAFSIYSRVAHKIYPIDKMKEVVKFLINRYSAQIIFFYSADQKDEIQKIHKE IGDFKNIFSSIETPTIKDLVPFLENCDYYIGNEGGARHLAQGVGIPTLAIFSPSADLKEW LPFPSEKNMGISPIDILEKNALSLDDFNKMNPEEQFSLIDIETIKEMSDELIEKNKRK >gi|296155529|gb|ADVK01000004.1| GENE 47 44944 - 46281 1282 445 aa, chain + ## HITS:1 COG:FN0993 KEGG:ns NR:ns ## COG: FN0993 COG0168 # Protein_GI_number: 19704328 # Func_class: P Inorganic ion transport and metabolism # Function: Trk-type K+ transport systems, membrane components # Organism: Fusobacterium nucleatum # 1 445 39 483 483 738 99.0 0 MAYIIPIIILCILSYFLSDKSPENQSFFSKEGLVIVSLSWLLISFFGALPFVISGSISNI VDAFFESVSGFTTTGASILPEVESLSRSILFWRSFTHVVGGMGVLVLVLAILPKGNNQAL HIMRAEVPGPTVGKIVAKMNYNSRILYIIYISMIIILIIFLLLGGMPLFDACIHAFGTAG TGGFSCKNTSIGFYNSAYIDYVTSIGMIAFGLNFNLFYLLILGNIKQVFKSEEAKYYLGI IFIATTLICINIYPIYSSISRMIRDVFFTVSSIITTTGFSTVDFDKWPTFSKTILMFLMF CGACAGSTAGGFKVSRLAILIKKFVREFKKIGHPNKVLNIKLEGKTLDKEMLEGVDSYFI LYSVILFILLLITAWDSDTFITALSAVLATFNNIGPGLGAVGPTLNFASYSAFTKIVLSL GMLLGRLEIIPLLILVSPRIYRKRD >gi|296155529|gb|ADVK01000004.1| GENE 48 46316 - 47101 768 261 aa, chain + ## HITS:1 COG:no KEGG:FN0994 NR:ns ## KEGG: FN0994 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 21 261 1 241 241 410 98.0 1e-113 MENYGKPKKVVGMKFILKFIMLVFFLLFSFFLFSLTKFFIKDFNRGYSASGTTYVIFVII AEIVLISFTFGLPYLLMKLYPKIYYYDDGFQVGKKNGKIFYEKLDYFFIPAYNRINSFMA IKYTDNEGNWKAIPAINYARNSFELFQQDFVNVNFPKAMKKLENNEVIEFLFNDPKKRLM AWGSKKYMKKKLEQSLKIKVTRESITFDAETYEWDKYKIFISLGSITVQEKDGAPILVLG GNALVHRVNLLEAIINTFGKN >gi|296155529|gb|ADVK01000004.1| GENE 49 47119 - 47706 454 195 aa, chain + ## HITS:1 COG:no KEGG:FN0995 NR:ns ## KEGG: FN0995 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 195 1 195 208 340 99.0 2e-92 MDAQKYMTLEKKDNYKLLTFFGVFIAFIVVTFFLYMLARKSETSVYAPIIAVAIPFLAAI GANYMGKKSDEKKNDNQDSWSNSDEALTKTKKSRFSKFGPTGPDLTHFGFILSLISTYLS IYLSEVLNLTKILQITHPNDSFSAIFIDVVKNIFKADWSRKYLIQYWLWATFFIIILIVT VILGQKNLVKIKRKE >gi|296155529|gb|ADVK01000004.1| GENE 50 47776 - 48474 417 232 aa, chain - ## HITS:1 COG:FN0996 KEGG:ns NR:ns ## COG: FN0996 COG5522 # Protein_GI_number: 19704331 # Func_class: S Function unknown # Function: Predicted integral membrane protein # Organism: Fusobacterium nucleatum # 1 232 1 232 232 363 98.0 1e-100 MEDKFVLFSNQHFITMGIGFFSCILLVFLGFFTEKKAAFAKIIAIIVLGVKIAELSFRHY YYGETVAQLLPLHLCPIVIILSIFMMFFHSEVLFQPVYFWSIGAFFAILMPDIRDGMSNF ASQSFFITHFFILFSTVYAFVHFRFRPTKSGFIYSFLMLVILAFIMYFINNRLGTNYLYV NHPPVTKSLVDFMGPWPYYIFSLAGIDIAISFFMYLPFRRNKKSKYGSWRSY >gi|296155529|gb|ADVK01000004.1| GENE 51 48487 - 49137 820 216 aa, chain - ## HITS:1 COG:no KEGG:FN0997 NR:ns ## KEGG: FN0997 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 216 1 216 216 425 99.0 1e-118 MAKQKYYAYFFDKKNNGIVDTWVECEKIVRGTKARYKSFIDRTVAQDWLDSGASYERNIG LNAPINTTLEKGIYFDSGTGRGIGVEVRITDENKENILDKISQSALKKLLKDTAWIKNEF GNIQLEAGKTNNFGELIGFYFALKCAELLKCNIISGDSRLVIDYWSLGRFHESNLELDTI SYINKVILLRKEFEKNKGIVKHISGDINPADLGFHR >gi|296155529|gb|ADVK01000004.1| GENE 52 49400 - 50902 2017 500 aa, chain + ## HITS:1 COG:FN0998 KEGG:ns NR:ns ## COG: FN0998 COG0747 # Protein_GI_number: 19704333 # Func_class: E Amino acid transport and metabolism # Function: ABC-type dipeptide transport system, periplasmic component # Organism: Fusobacterium nucleatum # 1 500 1 500 500 957 99.0 0 MKKIFYLLFVISLFLVGCGQNKTESPNGSTVVIGQGAKPKSLDPHMYNSIPDLLVSRQFY NTLFSREKDGSIKPELAESYEYKNDKELDIILKKGVKFHDGSELTADDVVFSFERMKDKP GSSIMVEEIDKVEKVNDYEIKLFLKNSSSALLYNLAHPITSIVNKKYVEAGNDLSVAPMG TGAFKLIAYNDGEKIELEAFKDYFEGAPKVEKITFRAIPEDTSMLAALETGEVDIATGMP PVSTQTIEANDKLDLISEPTTATEYICLNVEKVPFDNKDFRKALNYAIDKQSIIDSIFSG RGKVAKSIVNPNVFGYYDGLEEYPFNPEKAKELIEKSDVKDMSFSLYVNDSPVRLQVAQI IQANLKDVGIDMNIETLEWGTYLQKTGEGDFTAYLGGWISGTSDADIVLYPLLDSKSIGF PGNRARYSNPEFDKEVEMARIALKPEERKEHFKNAQIISQEDSPLIVLYNKNENIGINKR IKGFEYDPTTMHKFKNLEIK >gi|296155529|gb|ADVK01000004.1| GENE 53 50916 - 51950 1440 344 aa, chain + ## HITS:1 COG:FN0999 KEGG:ns NR:ns ## COG: FN0999 COG1363 # Protein_GI_number: 19704334 # Func_class: G Carbohydrate transport and metabolism # Function: Cellulase M and related proteins # Organism: Fusobacterium nucleatum # 1 344 4 347 347 662 99.0 0 MNIDLKYILNKTVELLNIPSPVGYTHNAIEWVKNELKKLGIKNYNITKKGALIAYIKGKD SNYKKMISAHVDTLGAVVKKIKKNARLEVTNVGGFAWGSVEGENVTIHTISGKTYSGTLL PIKASVHVYGDVAREMPRTEETMEIRIDEDVKTDEDVLKLGILQGDFISFETHTRILDNG YIKSRYLDDKLCVAQILSYIKYLKDNKLKPKTDLYIYFSNYEEIGHGVSVFPEDLDEFIA VDIGLVAGEDAHGDEKKVQIIAKDSRSTYDFTLRKKLQETASKNNIKYTVGVYNRYGSDA TTAILQGFDFKYACIGPNVDATHHYERCHNDGIIETVKLLIAYL >gi|296155529|gb|ADVK01000004.1| GENE 54 52035 - 53174 1264 379 aa, chain + ## HITS:1 COG:FN1000 KEGG:ns NR:ns ## COG: FN1000 COG0502 # Protein_GI_number: 19704335 # Func_class: H Coenzyme transport and metabolism # Function: Biotin synthase and related enzymes # Organism: Fusobacterium nucleatum # 20 379 1 360 360 698 99.0 0 MLSYIQVIQILSARKGMRYMLKEKNSAGGGKFNFFNFLKEKDNNQAEPSNVKEFISYLKN KIINEKYEITREEAIFLSQIPNNDMETLNILFNAADQIREVFCGKYFDLCTIINAKSGKC SENCKYCAQSVHFKTGTDVYGLISKELALYEAKRNENEGAHRFSLVTSGRGLNGNEKELD KLVEIYKYIGEHTSKLELCASHGICTKEALQKLVDAGVLTYHHNLESSRRFYPNVCTSHS YDDRINTIKNAKEVGLDVCSGGIFGLGETIEDRIDMALDLRTLEIHSVPINVLTPIPGTP FENNEEVNPFEILKTISIYRFIMPKSFLRYCGGRIKLGEHAKTGLRCGINSALTGNFLTT TGTTIETDKKMIKELGYEI >gi|296155529|gb|ADVK01000004.1| GENE 55 53164 - 53823 736 219 aa, chain + ## HITS:1 COG:FN1001 KEGG:ns NR:ns ## COG: FN1001 COG0132 # Protein_GI_number: 19704336 # Func_class: H Coenzyme transport and metabolism # Function: Dethiobiotin synthetase # Organism: Fusobacterium nucleatum # 1 219 1 219 219 413 99.0 1e-115 MKFKDFFVIGTDTDVGKTYVSTLLYKALKKYNFQYYKPIQSGCFLKDGKLTAPDVDFLTK FVGIDYDDSMVTYTLKEEVSPHLASEMEGTRIEIENVKRHYEDLKKKHSNILVEGAGGLY VPLIRDIFYIYDLIKLFNLPVVLVCGTKVGAINHTMLTLNALDTMGIKLHGLVFNNYRGQ FFEDDNIKVVLALSKIKNYLIIKNGQKEISDKEIEKFFN >gi|296155529|gb|ADVK01000004.1| GENE 56 53909 - 55249 1632 446 aa, chain + ## HITS:1 COG:FN1002 KEGG:ns NR:ns ## COG: FN1002 COG0161 # Protein_GI_number: 19704337 # Func_class: H Coenzyme transport and metabolism # Function: Adenosylmethionine-8-amino-7-oxononanoate aminotransferase # Organism: Fusobacterium nucleatum # 1 446 7 452 452 907 99.0 0 MINNLSELQKKDLKYVFHPCAQMKDFEENPPLVIKKGDGLYLIDENGNKYMDCISSWWVN LFGHCNKRINRVITEQVNNLEHVIFANFTHEPAAELCEELTKVLPKGINKFLFSDNGSSC IEMALKLSFQYHLQTGNPQKTKFISLENAYHGETIGALGVGDVDIFTKTYRPLIKEGRKV KVPYIDSKLSNDEFVKLEDECIKELEDLIIKNHNEIACMIVEPMVQGAAGIKIYSAKFLK AARDLTKKYNIHLIDDEIAMGFGRTGKMFACEHAGIEPDMMCIAKGLSSGYYPIAMLCIT TDIFNAFYADYKEGKSFLHSHTYSGNPLGCRIALEVLRIFKEENILKTINEKGTYLRNKM KEIFEDKSYIKDIRNIGLIGAIELKDNLLPNVRIGKEIYNFALKKGAFLRPIGNSVYFMP PYVITYEEIDKMLHVCKESIEELLKI >gi|296155529|gb|ADVK01000004.1| GENE 57 55345 - 56796 2190 483 aa, chain + ## HITS:1 COG:FN1003 KEGG:ns NR:ns ## COG: FN1003 COG2067 # Protein_GI_number: 19704338 # Func_class: I Lipid transport and metabolism # Function: Long-chain fatty acid transport protein # Organism: Fusobacterium nucleatum # 211 483 1 273 273 495 97.0 1e-140 MKKLLLLTAILSSGLYAASIDHIQTYSPDYLANQAQTGMINGVSPYYNPAALGRLEKGKY LHAGFQFAHGHEKMSYKGKEHKANLNQAIPNIALTFVDDKGATFFNFGGLAGGGKLRYDG VSGVDVLTDLAQFKPLGIYDKGSSLTGSNKYEQITLGRAFNIDDKLSFSVAARVVHGTRR LSGNLNIGANPTAQYKQEKAQQVAQEVSKAVDAATQGKGLSAAQIAAIKTKKTNEALTKL QQRVQDLNQNGLTGDIDSRREAWGYGFQLGVNYKVNDKLNLAARYDSRIKMNFKAKGHEH QLETTDILKQTIGLSTFYPQYTINSKIRRDLPAILSVGASYKVADNYLVSTTANYYFNHQ AKMDRVTTFGEHEHGRDYKNGWEIAVGNEYKLNDKFTLIGSLNYANTGAKTASYNDTEYA LNSFTLGGGIRYQYDESLAITASVAHFIYQGAEGNFKEKYGVTENQKYKKEITAVGLSVT KKF >gi|296155529|gb|ADVK01000004.1| GENE 58 56815 - 57381 680 188 aa, chain + ## HITS:1 COG:FN1004 KEGG:ns NR:ns ## COG: FN1004 COG1309 # Protein_GI_number: 19704339 # Func_class: K Transcription # Function: Transcriptional regulator # Organism: Fusobacterium nucleatum # 1 188 1 188 188 321 99.0 5e-88 MPKKVLFSKEFILDKSFELFKEEGIESISARNVGKILDASPAPIYKSIGSMKNLKKELIK KAKDLFIEYLTKKRTGIKFLDIGMGISIFAREEKQLFLQVFSKDNIEGSLIDEFLKLIRE EIKKDERLIRIDKEKQEELLVSCWVFAHGLSTLIATGFFKNPTDEFIEKVLRDAPARLFY EYIENHSK >gi|296155529|gb|ADVK01000004.1| GENE 59 57500 - 58057 723 185 aa, chain + ## HITS:1 COG:no KEGG:FN1005 NR:ns ## KEGG: FN1005 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 26 185 1 157 157 234 79.0 1e-60 MRKNFTILLLTLFSILASFSYANQTLINQQNAQRFLDSQLQGQLEVDRQRREREAAAAQQ YQYQEPRDDRIYHRYAVFVWNEDTGHYMGFPELLVAPTNNSKKAAIKLAKKEFTRWFKEN DGVDVKVTDYVSWNGGLRYIVVMGKNRKTGKWEAFTKFEVDDKNKEILINECSKTCDECQ FDWSN >gi|296155529|gb|ADVK01000004.1| GENE 60 58111 - 58524 678 137 aa, chain - ## HITS:1 COG:FN1006 KEGG:ns NR:ns ## COG: FN1006 COG0454 # Protein_GI_number: 19704341 # Func_class: K Transcription; R General function prediction only # Function: Histone acetyltransferase HPA2 and related acetyltransferases # Organism: Fusobacterium nucleatum # 6 137 1 132 132 208 95.0 2e-54 MEGDFMNYKIIKNDTNYNIDDLTKLLNTSYWAKDRKKETVKKTVEKSLCYFAYDTDKNKL IGFARAITDYTTNYYLCDIIVDEEYRGKGIGKKLVETLINDEDLVHVRGLLITKDAKKFY EKFGFYNKEDVMQKDKK >gi|296155529|gb|ADVK01000004.1| GENE 61 58708 - 59544 569 278 aa, chain + ## HITS:1 COG:no KEGG:FN1007 NR:ns ## KEGG: FN1007 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 278 17 294 294 419 96.0 1e-116 MIIWNILIFLGLSIFLFFSLSNKKVDFSDIREYIFLIIIIIILLISFATIIKKIKLLKVF SNKSDNIIKTSEKEIYAYKFLNMYKKLVYRRMLVIFAPILFSFVIVIFLYIATSKIQNRN EEIVSLIYVGGKITFFITLLPCLVIFLIIYISSSQKIRMLQKVYDFASKEEVNNLDETES LTELYTPKPKYIFTRKFFINWDGSLNIFILEDIEKVEYKKYNYFFIYGTKLLIHLKNGKR KKIRYAGPDENEWRKRNFIVKKNSNIEGKVEYHINLPI >gi|296155529|gb|ADVK01000004.1| GENE 62 59620 - 60138 685 172 aa, chain - ## HITS:1 COG:no KEGG:FN1008 NR:ns ## KEGG: FN1008 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 172 6 179 179 281 95.0 1e-74 MSNRLGILLSAGIVFIFASVFLIIAEISKRISENKIENFQGEAEGEVLEVIKSGRDGVGG KLLDTFVVYQYEVNKHKYIVKPYVLSKNAAINQKYFDSEIVTCITYMGTHGMSRQTKYRA GESITIKYNLENPKKHEILNDKDKMFAYKGFKIAGSIIMIIPIILVIVSFFV >gi|296155529|gb|ADVK01000004.1| GENE 63 60179 - 60670 641 163 aa, chain - ## HITS:1 COG:no KEGG:Celal_1460 NR:ns ## KEGG: Celal_1460 # Name: not_defined # Def: hypothetical protein # Organism: C.algicola # Pathway: not_defined # 46 156 123 218 235 87 44.0 2e-16 MEQIKKEKSDFIKTKIKELREKIARPCTEFETKKFQYDDDICPDPYPLKPKLNNTDFPIW DGGGFDFELAEEIDELERDCFYDEKTKELKSEDNPDKLDYYDDIADTHSYLHKFGGYPSY CQPGLGLEAIKDYHFMFQISSDSVANYNIVDSGSFIMKMKING >gi|296155529|gb|ADVK01000004.1| GENE 64 60685 - 61056 513 123 aa, chain - ## HITS:1 COG:no KEGG:FN1009 NR:ns ## KEGG: FN1009 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 10 123 1 114 114 207 100.0 1e-52 MKKIFILLGMLLLGIVSYAKEEDILGTWFIKENGKIVEIYKNKDGEYTGKIKENNFIFLK QNNDLTYSKERNSLAYFTLKFPDYKFSYHVWINIQKDGSLFLKGTGNTEVGKYVGEWHLI REK >gi|296155529|gb|ADVK01000004.1| GENE 65 61060 - 61191 93 43 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296327343|ref|ZP_06869895.1| ## NR: gi|296327343|ref|ZP_06869895.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 43 1 43 43 65 100.0 1e-09 MLVERNLQTVAWRKSDLNELKNYDDEELDGDFNFYRRIIRIEE >gi|296155529|gb|ADVK01000004.1| GENE 66 61389 - 61685 443 98 aa, chain - ## HITS:1 COG:FN1010 KEGG:ns NR:ns ## COG: FN1010 COG1799 # Protein_GI_number: 19704345 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 98 1 98 98 171 100.0 3e-43 MTDNIDIVFLKPTKFEDCVICANYIKEDKIVNMNLSQLDDNDSRRVLDYIAGAIFITKAE IVNVGNKIFCSIPSNRNFLNEMNRDTSHDEEEVEIVRG >gi|296155529|gb|ADVK01000004.1| GENE 67 61772 - 62737 279 321 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|149007035|ref|ZP_01830704.1| 50S ribosomal protein L31 type B [Streptococcus pneumoniae SP18-BS74] # 5 321 1 308 311 112 28 7e-24 MKEFLEKFKKIKDTIEKNENIVLTAHINPDGDAVGSGLGLLLTLKENYENKNIRFVLQDN IPYTTKFLKGSEEIETYDKNKKYSCDLLIFLDSATRDRTGETGRNIESRTTINIDHHVSN PEYGDIACVIAYSSSTSEIIYNFIKYMEYKFSLSVAEALYLGLVNDTGNFSHSNVKVGTM QMATDLIAMGVNNNYIVTNFLNSNSYQTLKMMGEALKNFKFYPEKKLSYYYLDHETMKKY DAKKEDTEGIVEKILSYYEASVSLFLREEADGKIKGSMRSKYEINVNEVANLFGGGGHYK AAGFSSNLSADEILETVLKNI >gi|296155529|gb|ADVK01000004.1| GENE 68 62757 - 64604 2112 615 aa, chain - ## HITS:1 COG:FN1012 KEGG:ns NR:ns ## COG: FN1012 COG1493 # Protein_GI_number: 19704347 # Func_class: T Signal transduction mechanisms # Function: Serine kinase of the HPr protein, regulates carbohydrate metabolism # Organism: Fusobacterium nucleatum # 1 615 1 615 615 1163 100.0 0 MYTYTTIREIVDKLNLEILNEGNLDLKIDIPNIYQIGYELVGFLDKESDELNKYINICSL KESRFIATFSKERKEKVISEYMSLDFPALIFTKDAIITEEFYYYAKRYNKNILLSNEKAS VTVRKIKFFLSKALSIEEEYENYSLMEIHGVGVLMSGYSNARKGVMIELIERGHRMVTDK NLIIRRVGENDLVGYNAKKREKLGHFYLEDIKGGYVDVTDHFGVKSTRIEKKINILIVLE EWNEKEFYDRLGLDVQYEDFVGEKIQKYIIPVRKGRNLAVIIETAALTFRLRRMGHNTPL EFLTKSQEIIERKKKEREEYMNTNRLPVTKLINEFDLEIKYGEDKVSSTYINSSNVYRPS LSLIGFFDLIEEVKNIGIQIFSKIEFKFLENLPPIERVNNLKKFLTYDIPMIVLTVDANP PDYFFDLVSKSGHILAIAPYKKASQIVANFNNYLDSFFSETTSVHGVLVELFGFGVLLTG KSGIGKSETALELIHRGHRLIADDMVKFYRNTQGDVVGKSAELPFFMEIRGLGIIDIKTL YGLSAVRLSKTLDMIIELQAVDNSDYMSAPSAHLYEDVLGKPIKKRILEISSGRNAAAMV EVMVMDHMSGLLGEK >gi|296155529|gb|ADVK01000004.1| GENE 69 64629 - 65414 1166 261 aa, chain - ## HITS:1 COG:no KEGG:FN1013 NR:ns ## KEGG: FN1013 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 261 1 269 269 155 92.0 1e-36 MDKKKKETQKRKVVQVKPKKKKNTVDFSKFFNFLSFVVFVAFAWFMYKKVIQQEKIEQAI VENTSKQITSTMDIRNEDFYNGPKKEVKKEEEKEEKEEIVEDIKTPEEIVSVEDKPKITE TTSVASIEKKEEKKEIKAPEKDKKVSDSKKLEEGKEAAKKVIKDEVEKKAKKETKEKVEK KKEEKKDEQPKKEEKKEVKKSETPKENKNNEEVEKTQKKESQDSEIKTIKTKKEPVEQLT NEEVKAKLNNEIREIEGTYTP >gi|296155529|gb|ADVK01000004.1| GENE 70 65407 - 66642 1765 411 aa, chain - ## HITS:1 COG:FN1014 KEGG:ns NR:ns ## COG: FN1014 COG0285 # Protein_GI_number: 19704349 # Func_class: H Coenzyme transport and metabolism # Function: Folylpolyglutamate synthase # Organism: Fusobacterium nucleatum # 1 411 5 415 415 764 99.0 0 MHIDDLLEELYAYSMFSIRLGLDNIKEICKYLGNPEKSYKVIHITGTNGKGSVSTTVERI LLDAGYKIGKYTSPHILKFNERIIFNDKYISDEDVAKYYERVKKIIDEHNIQATFFEVTT AMMFDYFKDMKADYVILEAGMGGRYDATNICDNIVSVITNVSLEHTEYLGDTIYKIAKEK AGIIKNCPYTIFADNNPDVKKAIEETTDKYVNVLEKYKDSTYKLDFNTFLTNIFIDGNKY EYSLFGDYQYKNFLCAYEVVKYLGVDENIIKEAAKKVIWQCRFEVYSKNPLVIFDGAHNL AGVEELIKIVKQHFSKDEVTVLVSILKDKDRVSMFRKLNEISSNIVLTSIPDNPRASTAK ELYDYVENKKDFEYEEDPIKAYNLALSKKRKLTICCGSFYILIKLKEGLNG >gi|296155529|gb|ADVK01000004.1| GENE 71 66654 - 67355 982 233 aa, chain - ## HITS:1 COG:FN1015 KEGG:ns NR:ns ## COG: FN1015 COG0775 # Protein_GI_number: 19704350 # Func_class: F Nucleotide transport and metabolism # Function: Nucleoside phosphorylase # Organism: Fusobacterium nucleatum # 1 233 5 237 237 412 100.0 1e-115 MRIGIIGAMHEEIVELKNSMIDINEVQISNLKFYEGKLCSKDIVLVESGIGKVNAAISTT LLISQFKVEKIIFTGVAGAVNPEIKVTDIVIGTDLVESDMDVTAGGNYKLGEIPRMKSSY FKADPYLFTLAESVASKLYGTDKIHKGRIISRDEFVASSEKVNKLREIFSAECVEMEGAA VAHVCEIMNIPFVVIRSISDKADDEAGMTFDEFVKIAAKHSKSIVEGILSIIK >gi|296155529|gb|ADVK01000004.1| GENE 72 67475 - 68371 1113 298 aa, chain + ## HITS:1 COG:FN1016 KEGG:ns NR:ns ## COG: FN1016 COG1560 # Protein_GI_number: 19704351 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Lauroyl/myristoyl acyltransferase # Organism: Fusobacterium nucleatum # 73 298 1 226 226 421 99.0 1e-117 MYYIQYIVARFFIFLLLLFPERLRFKFGNFLGTVAYKLIKSRRLTALINLRMAFPEKSEE EIEKIAKKSFKIMIKAFLCSLWFEKYLINPKNIKIINQESMENAYKKGKGVMAATMHMGN MEASTVSAGEHKIITVAKKQRNPFINNYITKLRGKANYMEVIEKNEKTSRVLISKLKEKK IYALFSDHRDKGAIVNFFGKETKAPSGAVSMALKFEIPFVLVYNTFNEDNTITVYVTDEI ELKRTGNFKEDVQNNVQYLINIMEDVIRKYPEQWMWFHDRWNNFREYKKHLKNRGNKK >gi|296155529|gb|ADVK01000004.1| GENE 73 68368 - 69189 1158 273 aa, chain + ## HITS:1 COG:no KEGG:FN1017 NR:ns ## KEGG: FN1017 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 17 272 17 272 272 427 99.0 1e-118 MKKSLKKALLILLTLLSMVFIVACGEKASKKEVVEKFVENSKNMKSADVTMTMKMEQKNS TNPAGISMEATGNISMILEPNLAMKMDLVIPFVNNKLSMYVKDDYSYVQSPTDNQWIKQS NKGFEEQFKKAYAQSNALYDFFMKNLDKIDLSEKDGNYLLTIKDFKEILKKEFSALDPTG KGLDGFEDLILTFTVDKKTFLPVNFTMVGAVNEGGIKMNFSFDVKYSNINNVKEIIIPKE ALEAKEEKSIEEQLQETNKIENTEKDDKVDKKF >gi|296155529|gb|ADVK01000004.1| GENE 74 69216 - 69494 242 92 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296327352|ref|ZP_06869904.1| ## NR: gi|296327352|ref|ZP_06869904.1| hypothetical protein HMPREF0397_0097 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] hypothetical protein HMPREF0397_0097 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 92 1 92 92 125 100.0 1e-27 MISIMSYSEDFILPKKEIIEKFTENSKNINSMNIVVEDTIINKNNNNLIIYKEASLILKP FSMELTVKMPSFDIYVYAKDGYIYTAESFDFN >gi|296155529|gb|ADVK01000004.1| GENE 75 69516 - 69665 140 49 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|289765197|ref|ZP_06524575.1| ## NR: gi|289765197|ref|ZP_06524575.1| conserved hypothetical protein [Fusobacterium sp. D11] conserved hypothetical protein [Fusobacterium sp. D11] # 1 48 134 181 264 76 93.0 6e-13 MVKEFQASVYNQNEVYDILKNNIDKIELEEIRGNYIITILDPYIIKSIY Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:08:45 2011 Seq name: gi|296155527|gb|ADVK01000005.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00006, whole genome shotgun sequence Length of sequence - 1007 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . + CDS 1 - 1006 1183 ## COG2831 Hemolysin activation/secretion protein Predicted protein(s) >gi|296155527|gb|ADVK01000005.1| GENE 1 1 - 1006 1183 335 aa, chain + ## HITS:1 COG:FN0292 KEGG:ns NR:ns ## COG: FN0292 COG2831 # Protein_GI_number: 19703637 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Hemolysin activation/secretion protein # Organism: Fusobacterium nucleatum # 40 335 8 303 350 519 95.0 1e-147 FPQRENNILNIRDLDQGIDNLGDNSKIDIKASDKNEYSNIYIKRDNKPISFGINYNDLGQ FDTSRHRLRYFLGTHNIFGLNESLDFSYQNKLQRQYKERDTKNFSFGVSVPFKYWTFSYS YDNSEYLRTIKIQNTKYRATGKTKNQTFGLRKMLHRNENYKIDIGARITLKDSKNYIDDL RLVSSSRKLSVLTVDTTYTGRIFSGLLSTNVGVSFGLERFAANKDKEEWFRNEYTPKAQF RKYNVNISWYKPIDKFYYKINLGGQYSKDILYSQEKIGIGDDTTVRGFKDESTQGDKGFY IRNEIGYKGNQFLEPYIAYDYGRVFNNKVNEDKVE Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:08:51 2011 Seq name: gi|296155512|gb|ADVK01000006.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00007, whole genome shotgun sequence Length of sequence - 15385 bp Number of predicted genes - 19, with homology - 14 Number of transcription units - 8, operones - 5 average op.length - 3.2 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 1/0.000 - CDS 2 - 1913 2404 ## COG1404 Subtilisin-like serine proteases - Prom 2099 - 2158 10.4 - Term 2111 - 2163 7.9 2 1 Op 2 1/0.000 - CDS 2173 - 2700 675 ## COG2110 Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 - Prom 2881 - 2940 6.6 3 1 Op 3 1/0.000 - CDS 2985 - 3287 186 ## COG1115 Na+/alanine symporter - Prom 3434 - 3493 7.3 - Term 3288 - 3351 -0.1 4 1 Op 4 . - CDS 3565 - 3780 404 ## COG0584 Glycerophosphoryl diester phosphodiesterase - Prom 3858 - 3917 8.6 - Term 3882 - 3927 5.0 5 2 Op 1 . - CDS 3945 - 4319 384 ## FN1956 hypothetical protein 6 2 Op 2 . - CDS 4367 - 4615 376 ## FN1957 hypothetical protein - Prom 4710 - 4769 9.8 + Prom 4631 - 4690 4.9 7 3 Tu 1 . + CDS 4722 - 4949 544 ## - TRNA 4791 - 4865 72.4 # Gln TTG 0 0 - Term 4745 - 4775 1.2 8 4 Op 1 . - CDS 4903 - 5034 402 ## - TRNA 4906 - 4989 68.7 # Leu TAG 0 0 - TRNA 5002 - 5077 94.1 # Lys TTT 0 0 9 4 Op 2 . - CDS 5040 - 5159 539 ## - Prom 5342 - 5401 10.9 - TRNA 5081 - 5156 74.0 # His GTG 0 0 + Prom 4974 - 5033 2.6 10 5 Tu 1 . + CDS 5117 - 5431 1074 ## - TRNA 5167 - 5242 93.2 # Gly TCC 0 0 - TRNA 5250 - 5326 82.1 # Pro TGG 0 0 11 6 Op 1 13/0.000 - CDS 5434 - 7836 3822 ## COG0457 FOG: TPR repeat 12 6 Op 2 . - CDS 7848 - 8885 1550 ## COG0457 FOG: TPR repeat - Term 8902 - 8945 1.9 13 7 Op 1 . - CDS 8953 - 9165 443 ## FN1966 hypothetical protein 14 7 Op 2 . - CDS 9241 - 10191 1296 ## FN1967 hypothetical protein 15 7 Op 3 35/0.000 - CDS 10204 - 10977 175 ## PROTEIN SUPPORTED gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) 16 7 Op 4 33/0.000 - CDS 10974 - 11999 955 ## COG0609 ABC-type Fe3+-siderophore transport system, permease component 17 7 Op 5 4/0.000 - CDS 12002 - 12844 1079 ## COG0614 ABC-type Fe3+-hydroxamate transport system, periplasmic component - Prom 12895 - 12954 5.7 18 7 Op 6 . - CDS 13072 - 15045 2745 ## COG1629 Outer membrane receptor proteins, mostly Fe transport - Prom 15082 - 15141 6.1 19 8 Tu 1 . - CDS 15168 - 15272 61 ## - Prom 15325 - 15384 8.8 Predicted protein(s) >gi|296155512|gb|ADVK01000006.1| GENE 1 2 - 1913 2404 637 aa, chain - ## HITS:1 COG:FN1950_1 KEGG:ns NR:ns ## COG: FN1950_1 COG1404 # Protein_GI_number: 19705252 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Subtilisin-like serine proteases # Organism: Fusobacterium nucleatum # 93 590 1 498 498 942 99.0 0 MREQILKNKLILIALSAVLLVSCGGSGGGGSSNAPINPGNNVSPTPKKSGIEDKINPTNP ESAVNPSVPTPSKPTNPTNNVKRESLEELRARMDETKVREFIFNEQNNSIENIPMDNRVL NGDKVKVGVLDGDFINKKNYLENKYGNIEILERNYNPKHSNHGELVLKVLREKNKLGIIA GSIGENDDFSGKEGTTVIPKLETYEKVLEKFNPNQKVKVFSQSWGVPETIGSYISKSEEE RMRAIAPGQTSSEGRKILDFYKKEVNKDTLFIWANGNTLVNNQNQVVLFNDAYYQGGLPH LYSELEKGWITVVGVKKKDGNMLNKHYTDTAHLAYPGNAKWWAISADAEVVDSKGVTHRG SSFAAPRVAKVAALVAEKYDWMIADQVRQTLFTTTDRTNIDENETYIRNIVTEPDEKYGW GMLNRERALKGPGAFINIRRQYQYSPSSNNIFKADITENKVSYFENDIFGDGGLEKLGKG TLHLTGNNSYERGSIVKEGTLEIHKIHANQINVENKGTLVLHSKAIIGYKVNSNSVIDEK EITASNITAKNLDNKGTVKVTGTTAVIGGDYIAHSGSTTEMDFSSKVRVLGKIDMQGGAI TLSSNRYTTLRETATIMEASNVQGNIANVETNGMRTA >gi|296155512|gb|ADVK01000006.1| GENE 2 2173 - 2700 675 175 aa, chain - ## HITS:1 COG:FN1951 KEGG:ns NR:ns ## COG: FN1951 COG2110 # Protein_GI_number: 19705253 # Func_class: R General function prediction only # Function: Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 # Organism: Fusobacterium nucleatum # 1 175 1 175 175 336 100.0 1e-92 MYKNIIKLISGDITKIPEVEAIVNAANSSLEMGGGVCGAIFKAAGSELAQECKEIGGCNT GEAVITKGYNLPNKYIIHTVGPRYSTGENREAERLASAYYESLKLANEKGIRRIAFPSIS TGIYRFPVDEGAKIALTTAIKFLDKNPSSFDLILWVLDEKTYIVYKEKYKKLLEI >gi|296155512|gb|ADVK01000006.1| GENE 3 2985 - 3287 186 100 aa, chain - ## HITS:1 COG:FN1953 KEGG:ns NR:ns ## COG: FN1953 COG1115 # Protein_GI_number: 19705255 # Func_class: E Amino acid transport and metabolism # Function: Na+/alanine symporter # Organism: Fusobacterium nucleatum # 1 100 1 100 100 139 100.0 2e-33 MVVVLGGLKKLANISTLLVPIMSIFYVVVGLLVILLNIQQVPSVIKEIFTQAFSMKAVAG GTGGYIIARAMQYGIIMVCIQMKLEKEQQHLYFYFSVSQL >gi|296155512|gb|ADVK01000006.1| GENE 4 3565 - 3780 404 71 aa, chain - ## HITS:1 COG:FN1954 KEGG:ns NR:ns ## COG: FN1954 COG0584 # Protein_GI_number: 19705256 # Func_class: C Energy production and conversion # Function: Glycerophosphoryl diester phosphodiesterase # Organism: Fusobacterium nucleatum # 1 71 1 71 71 118 95.0 3e-27 MTKNFAHRGFSGKYSENTMLTFQKAIEVGAELKKNNIEINTWTVNGKDNINDLIDKKVDI LIGNYPDLVKK >gi|296155512|gb|ADVK01000006.1| GENE 5 3945 - 4319 384 124 aa, chain - ## HITS:1 COG:no KEGG:FN1956 NR:ns ## KEGG: FN1956 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 124 1 124 124 200 100.0 1e-50 MLEILNKSLNGILLGTKRNEIGEKILNDPNYFLEFDRKNKIESEASLITISVLDRKEFSL NGKIINFRNFSKFIKYEKNIVEEEDNAYSYIFPEHNLTLYVDYINQNFMQILIYDDSLKD LYER >gi|296155512|gb|ADVK01000006.1| GENE 6 4367 - 4615 376 82 aa, chain - ## HITS:1 COG:no KEGG:FN1957 NR:ns ## KEGG: FN1957 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 82 1 82 82 139 100.0 4e-32 MNKFSIHGTEEGNTTSIKLDEIAILADPETLLKIGEFIIKTAHVMKGYEVDYSQLQDEIS DFDNKNNTDIIIYNQDYDYKCD >gi|296155512|gb|ADVK01000006.1| GENE 7 4722 - 4949 544 75 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MSLVIFIFQTKKLELFSSIKFYIWLGWLDSNQRITESKSVALPLGDTPIIRLKKAYWLYY FMVRRERLELSRLGH >gi|296155512|gb|ADVK01000006.1| GENE 8 4903 - 5034 402 43 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MSQVQILYDAPSFIYAGMAELADALDLGSSVPDVRVQVSLSAP >gi|296155512|gb|ADVK01000006.1| GENE 9 5040 - 5159 539 39 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MVTVVQLVERQFVALVVASSSLVGHPICVISSVGRAHDF >gi|296155512|gb|ADVK01000006.1| GENE 10 5117 - 5431 1074 104 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MPQTGALPTELRSPYYSGAGNEVRTRDIQLGRLTLYQLSYSRTLMVGIARFELAAPCSQG RCATGLRYIPTYILIPRYYSPFFLFCQAIFLHFRGYKNKVLKNK >gi|296155512|gb|ADVK01000006.1| GENE 11 5434 - 7836 3822 800 aa, chain - ## HITS:1 COG:FN1964 KEGG:ns NR:ns ## COG: FN1964 COG0457 # Protein_GI_number: 19705260 # Func_class: R General function prediction only # Function: FOG: TPR repeat # Organism: Fusobacterium nucleatum # 92 800 1 709 709 1234 99.0 0 MKEEILQKIESLYDLEKHQEIIDMIEALPAEQLNTELISELGRAYNNTQQFEKALEILKS IEFEEANNFRWNCRIAYSYYFLDDFVNAEKYLLIANKLNPEDDFTCTLLIETYISLSRVE DENGNHDKAIEYALEAKKYVRDEEGENNADSFLAWLYDRYGNYTEAEMLLKNIINKSKND EWLYSELGYCLAEQGKQEEALESYFKAIELNRNDAWIFTRIGMCYKNMDRKEEAIEYYLK ALEQKEDDIFIMSDIAWLYDSLGEFEKALKYLERLEELGQNDAWTSTEYGYCLAKLKRFD EAIVKINRALEAEDEDKDTAYIYSQLGWCQRHLEKYDEAIETFLKAKKWARNDAWINIEL GHCYKAKNEKEKALEFYLKAEKFDKNDISLLSDIAWHYDALDRNEEALKYIKRVVRLGRD DAWINEEYGACLSGLGKYKEAIKKYEYALSLDEEGKDERYINSQLGWCYRQLEEYKKAIK FHKKAKELGRNDIWINMEIGMCYAKLEEYEKAAENYLIAYEMDRDDIFTLTELGWVNNAM EKYDDAIEFLLKAEKLGRNDEWINTEIGLNLGRSGKTQEGIERLEKSLTMVEDDDIEQKI FINSEIGWLYGRLEEPNVEEALRYLTVAKELGRDDEWLNSELGFELGYNPDTREEALKYF ERAIELGRNDAWVWEMRGTLLFDLRKYEEALDSFRKAYALNDDGWYLYSIGRCLRRLERY EEALENLLNSRQISLNEDDVVDGEDLELAFCYIGMGDKEKAKEYLKSAKDSIEKQGTLND YIKEEIDEIEKGILSLARLS >gi|296155512|gb|ADVK01000006.1| GENE 12 7848 - 8885 1550 345 aa, chain - ## HITS:1 COG:FN1965 KEGG:ns NR:ns ## COG: FN1965 COG0457 # Protein_GI_number: 19705261 # Func_class: R General function prediction only # Function: FOG: TPR repeat # Organism: Fusobacterium nucleatum # 1 345 1 345 345 615 100.0 1e-176 MDEKFWEKINTFGENGEFDKIVKEIKKLPEDKLDIEIINVLGRAYMNLGDYENALDTYLS YIGKAKEDVTNVDIWLYSECGWLCNEVGDYEQGLKYLLEAEKLGRDDEWLNTEMGQCLGR LERAEEGLERLKKSLKLIETEAPENINEKIFINSEIGYLYGFLENSEEALKYFHIAKDLG RKDDWIYMHLWFNLERSKGKEEALKYFENEAKTDDKNAIVWASLGQIYMNFFRNYEEAEK VFKKAFGLSGDGLYLYNRGMVLRILGRYKEAVEVLLQSRKISVQEGDVTDGEDLDLVRCY IALKDKENAKKYLEFAKEGMENIPEEHVDEFEDALKELEDLIAKI >gi|296155512|gb|ADVK01000006.1| GENE 13 8953 - 9165 443 70 aa, chain - ## HITS:1 COG:no KEGG:FN1966 NR:ns ## KEGG: FN1966 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 70 1 70 70 97 100.0 2e-19 MNRDAKFINFSEEHELDYILKKYGKETNKENRDLLKEFGKKAKEFLGKTMLEHDEFYKYL ADNSLVSKLK >gi|296155512|gb|ADVK01000006.1| GENE 14 9241 - 10191 1296 316 aa, chain - ## HITS:1 COG:no KEGG:FN1967 NR:ns ## KEGG: FN1967 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 316 1 316 316 546 99.0 1e-154 MYKLLNIADFYSNEELEKDMQYFSEKYGFDGFELIKFFDGDNSPLKKYIKGYHMRFFPSW MELYLEDFNSLYDELKDEKYFKSLCGGYSKKELIEYYKRELKIAKELEVEYMVFHACNVK VTEAMTYDFKYSDRKVLNAVISIINEIFEDGEYGFKLLFENLWWSGLRLTNKEEIEYLLN GVKYKNVGFILDTGHMINNNRDIKNSKEGIEYIKKNIKNIGEYKNLIYGIHLNYSLSGEY VNRAIKENKEKNLNIEEIMNNVYQHVGSIDYHDPFENEEIIDIISSLPIKYLVFELIGNT REESEDKIQRQWKIFN >gi|296155512|gb|ADVK01000006.1| GENE 15 10204 - 10977 175 257 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) [Campylobacter concisus 13826] # 1 218 1 219 223 72 25 2e-12 MKNILEVKNISYSVGENKILKDISFKCQSGEIIGIIGPNGSGKTTLLKTINGINPISSGD ILLNDKSTKEYTEKELARDISFMNQNTNIEFDFPCIDIVVLGRYPYLERFQEYSKKDMEL AEKYMELTDTLKFKDKSILQLSGGERQRVLFAKILTQESQVILLDEPTASLDMRHEEDLL KEVSKEKDKDKIIILVIHNLRTAIKYCSRLILLSNGNIIKDGTVEEVITEENLNNIFGIK TKVYYNEISKSLDFCII >gi|296155512|gb|ADVK01000006.1| GENE 16 10974 - 11999 955 341 aa, chain - ## HITS:1 COG:FN1969 KEGG:ns NR:ns ## COG: FN1969 COG0609 # Protein_GI_number: 19705265 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-siderophore transport system, permease component # Organism: Fusobacterium nucleatum # 1 341 1 341 341 528 100.0 1e-150 MKYRINFNLFLFFLLIGIIIFSLFYGAVRVPISDVIKIILNKTGLFNFEISKQSYIPIVF FVRFPRIMVAVIVGGALALCGCTMQSLLKNPIVDSGIIGISSGASLGAVIAVSLGLTATN IFAMPIFSGTFALIISAIIYKISTLRGKTDNLLLILSGIAIGSFVGAITSVILTSLAETE MKEYIFWAMGSLNGRRWEHFFFGLIPITILSPILFYYGKELNILLLGEEEAKSLGINIKK IRGKILIIIALLTAISVCISGNITFVGLIVPHILRKIIGSDNRKLLKSSFLAGACFLTFG DLLSRVVLAPKEISVGIVTALIGAPYFIYLIVKIRREGKTL >gi|296155512|gb|ADVK01000006.1| GENE 17 12002 - 12844 1079 280 aa, chain - ## HITS:1 COG:FN1970 KEGG:ns NR:ns ## COG: FN1970 COG0614 # Protein_GI_number: 19705266 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-hydroxamate transport system, periplasmic component # Organism: Fusobacterium nucleatum # 1 280 1 280 280 485 99.0 1e-137 MLFSFTIVNAKGVQVKKYNCIVSLTLSGDEMLLGLVPENRIAGLSGKINEDKEISNIVDK AKKFPKVEGNEEVLMSLEPDLIIVADWLSKRITDIGAITGAKVYFYKTPNSYEEQKKLIR DLANLVEEKENGEKLIKNMDDRLKALQNKIAKNYKGAKPRILMYTSFSTTSGKNTTFNDM VKLINGVNVVAEAGIDGFKDISKEKVIELNPDIIIVPIAKKYDNVNKISKLFFEDPSFKN VKAIKNKKVYFIQYKDITPTSQYMINSIEELAKVVYQFKE >gi|296155512|gb|ADVK01000006.1| GENE 18 13072 - 15045 2745 657 aa, chain - ## HITS:1 COG:FN1971 KEGG:ns NR:ns ## COG: FN1971 COG1629 # Protein_GI_number: 19705267 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor proteins, mostly Fe transport # Organism: Fusobacterium nucleatum # 1 657 1 657 657 1223 98.0 0 MKKKFMLLALIVLGGMCAFAEESPVLELKQTVVTSDSFGTPVRETAKNMTVINAKEIKEK GAKTIADALRGVPGVVVRQMDGASPMLDLRGSGATSQFNTVILLDGIPVSGLAGFNLNTV PVDEIEKIEVLQGAGAVMYGDGAIGGVVNIITKAPTKKVVYGGAGLEVGSWRTIRENVYL GGKIGDKFLLNASYSGNSSKDYRDRSPQYENKKDKRDSLWLRGKYLLDNGSIAINYNHSE DKDYYTGSLSKEQFDKNPRQVGSWSGYTYGINDIINAKYNQKINDKIDIFLTSGYYHNKN KFQKNSTSEYFIRPEVKLTYAKDSYVTLGLDYRDGKRDFKDDVFVNGVNQKAPDDKRESF AGYVMNKTTFGNWQFTQGYRREKVKYEYSSKVYDPMTWQLKEIKPKSADYSNNDSFEFGV NYLYSDTGNVFFNYARALRTPTIQDAGAWYGPVKTQKNDIFEIGLRDAYKNTSISTSIFY VNSKNEIYYDKTNPFSSNNQNFDGKVRRIGAQLSLAHYFDKLTLRERVSYIVPKVTSGIY DGKEFAGVSRWTANVGATYNITKGLTANIDGYYQSNAYAEDDFDNYFSKGNNYVTVDASL SYAFENGIELYTGVSNLFDKKYANAVTSTRSTFGAGPRKVYYPANGRSVYAGIKYTF >gi|296155512|gb|ADVK01000006.1| GENE 19 15168 - 15272 61 34 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MLIGLIKEVGCESHTAMLLYCGRNYNSHWETGKV Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:09:39 2011 Seq name: gi|296155486|gb|ADVK01000007.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00008, whole genome shotgun sequence Length of sequence - 29204 bp Number of predicted genes - 26, with homology - 25 Number of transcription units - 7, operones - 5 average op.length - 4.8 N Tu/Op Conserved S Start End Score pairs(N/Pv) - Term 310 - 376 1.5 1 1 Op 1 1/0.000 - CDS 395 - 1216 902 ## COG3708 Uncharacterized protein conserved in bacteria - Prom 1236 - 1295 7.2 2 1 Op 2 1/0.000 - CDS 1299 - 1946 755 ## COG4122 Predicted O-methyltransferase 3 1 Op 3 1/0.000 - CDS 1940 - 3247 1467 ## COG0144 tRNA and rRNA cytosine-C5-methylases - Prom 3267 - 3326 12.4 4 2 Op 1 12/0.000 - CDS 3328 - 3834 459 ## COG0602 Organic radical activating enzymes 5 2 Op 2 3/0.000 - CDS 3839 - 6025 2714 ## COG1328 Oxygen-sensitive ribonucleoside-triphosphate reductase - Prom 6091 - 6150 18.6 - Term 6113 - 6172 10.2 6 3 Op 1 8/0.000 - CDS 6284 - 7354 1447 ## COG3839 ABC-type sugar transport systems, ATPase components 7 3 Op 2 21/0.000 - CDS 7369 - 9051 1936 ## COG1178 ABC-type Fe3+ transport system, permease component 8 3 Op 3 1/0.000 - CDS 9070 - 10128 1660 ## COG1840 ABC-type Fe3+ transport system, periplasmic component - Prom 10200 - 10259 8.7 - Term 10182 - 10235 9.1 9 4 Op 1 35/0.000 - CDS 10264 - 11049 227 ## PROTEIN SUPPORTED gi|90020817|ref|YP_526644.1| ribosomal protein S16 10 4 Op 2 33/0.000 - CDS 11052 - 12047 1044 ## COG0609 ABC-type Fe3+-siderophore transport system, permease component 11 4 Op 3 1/0.000 - CDS 12064 - 13116 1330 ## COG0614 ABC-type Fe3+-hydroxamate transport system, periplasmic component 12 4 Op 4 5/0.000 - CDS 13139 - 14794 1777 ## COG2710 Nitrogenase molybdenum-iron protein, alpha and beta chains 13 4 Op 5 1/0.000 - CDS 14797 - 15627 943 ## COG1348 Nitrogenase subunit NifH (ATPase) 14 4 Op 6 35/0.000 - CDS 15655 - 16419 252 ## PROTEIN SUPPORTED gi|149915877|ref|ZP_01904401.1| 50S ribosomal protein L17 15 4 Op 7 33/0.000 - CDS 16422 - 17468 1023 ## COG0609 ABC-type Fe3+-siderophore transport system, permease component 16 4 Op 8 . - CDS 17479 - 18513 1430 ## COG0614 ABC-type Fe3+-hydroxamate transport system, periplasmic component 17 4 Op 9 17/0.000 - CDS 18555 - 19166 270 ## PROTEIN SUPPORTED gi|149915877|ref|ZP_01904401.1| 50S ribosomal protein L17 18 4 Op 10 44/0.000 - CDS 19196 - 20026 238 ## PROTEIN SUPPORTED gi|163765018|ref|ZP_02172066.1| ribosomal protein L9 19 4 Op 11 49/0.000 - CDS 20031 - 20891 790 ## COG1173 ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 20 4 Op 12 38/0.000 - CDS 20881 - 21858 234 ## PROTEIN SUPPORTED gi|167855436|ref|ZP_02478201.1| 30S ribosomal protein S21 21 4 Op 13 . - CDS 21892 - 23529 2184 ## COG0747 ABC-type dipeptide transport system, periplasmic component - Prom 23567 - 23626 4.3 - Term 23554 - 23591 -0.8 22 5 Tu 1 . - CDS 23672 - 23761 198 ## - Prom 23871 - 23930 9.5 - Term 23917 - 23963 4.1 23 6 Op 1 13/0.000 - CDS 23996 - 25774 2812 ## COG0173 Aspartyl-tRNA synthetase 24 6 Op 2 1/0.000 - CDS 25793 - 27034 1716 ## COG0124 Histidyl-tRNA synthetase 25 6 Op 3 . - CDS 27046 - 28269 1431 ## COG2256 ATPase related to the helicase subunit of the Holliday junction resolvase - Prom 28290 - 28349 14.5 + Prom 28314 - 28373 12.2 26 7 Tu 1 . + CDS 28429 - 29196 875 ## FN0296 putative cytoplasmic protein Predicted protein(s) >gi|296155486|gb|ADVK01000007.1| GENE 1 395 - 1216 902 273 aa, chain - ## HITS:1 COG:FN0315_2 KEGG:ns NR:ns ## COG: FN0315_2 COG3708 # Protein_GI_number: 19703660 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 131 273 1 143 151 278 98.0 5e-75 MNIIKFFNQTIDYIETTLEDKIDEKKIVQLSGYSYPMFSRIFSILTAYSLSEYIRLRRLT KAAFDLRKNEEKVIDIAFKYQYESPDSFSLAFKKYHNCTPMEVKQGKDFKIFSPIHLSLT IKGGKAMEISIKKKEKFIVGGIKAENIETFQCPKVWEELFKKVSFNDLEKLGNGNSYGVC YETTSSKSINYIVAFDVKNVSEAKKLGLDTMEIPEAEYAVVKLKGKIPNCIHEGWKYVME VFFPEHGYKHAGTPDFELYSEGDMESDNYEMEL >gi|296155486|gb|ADVK01000007.1| GENE 2 1299 - 1946 755 215 aa, chain - ## HITS:1 COG:FN0314 KEGG:ns NR:ns ## COG: FN0314 COG4122 # Protein_GI_number: 19703659 # Func_class: R General function prediction only # Function: Predicted O-methyltransferase # Organism: Fusobacterium nucleatum # 1 215 1 215 215 367 99.0 1e-102 MLEELKEANEYISSKIDKYKSPNLELIKEIEKDAEINNVPIISKEIREYLKFIIRTNKNI KNILEIGTATGYSGIIMSEEIQGRNGTLTTIEIDEDRFKIAQSNFEKSNLKGIEQILGDA TEEIEKLNKNFDFIFIDAAKGQYKKFFEDSYKLLNEAGIVFIDNILFRGYLYKESPKRFK TIVKRLDEFVNYLYENFDFVLLPISDGVGIIHKPF >gi|296155486|gb|ADVK01000007.1| GENE 3 1940 - 3247 1467 435 aa, chain - ## HITS:1 COG:FN0313 KEGG:ns NR:ns ## COG: FN0313 COG0144 # Protein_GI_number: 19703658 # Func_class: J Translation, ribosomal structure and biogenesis # Function: tRNA and rRNA cytosine-C5-methylases # Organism: Fusobacterium nucleatum # 1 435 1 435 435 745 97.0 0 MNVKKVAINLISQVDRGAYSNIALNETFKTLNVNSKEKAFITEIFYGVIRNKKFLDYIIE KNTKEIKKEWIRNLLRISIYQITFMDSDDKGVVWEGTELAKKKYGIPISKFINGTLRNYL RNKDLELKRLDDEKNYEVLYSIPKWFYNTLEKQYGNENLKQAITSLKKIPYLSVRVNKLK YSEEEFEEFLKEKDIQIIKKVDTVYYVNSGLIINSEEFKTGKIIAQDASSYLAAKNLDAM PNELVLDICAAPGGKTAVLAENMKNMGEIIAIDIHQHKIKLIDTNMKKLRIDIVKAIVMD ARNVNKQGRKFDKILVDVPCSGYGVIRKKPEILYSKNRENVEELAKLQLEILNSAADILK DGGELIYSTCTITDEENTNNIKKFLEERKEFKVEKLYIPENVLGDFDSLGGFCINYKEEI MDNFYIIKLKKGEKC >gi|296155486|gb|ADVK01000007.1| GENE 4 3328 - 3834 459 168 aa, chain - ## HITS:1 COG:FN0312 KEGG:ns NR:ns ## COG: FN0312 COG0602 # Protein_GI_number: 19703657 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Organic radical activating enzymes # Organism: Fusobacterium nucleatum # 1 168 1 168 168 317 100.0 7e-87 MNYSGIKYADMINGKGIRVSLFVSGCTHCCKNCFNQETWNENYGKKFTEKEENEIIEYFK KYGKTIKGLSLLGGDPTYPKNIKPLLKFIKKFKENLPDRDIWIWSGFTWEEILEDENRFS LIKECDVLIDGKYVDSLKDLNLKWKGSSNQRVIDIKKSLEKNKVIEYI >gi|296155486|gb|ADVK01000007.1| GENE 5 3839 - 6025 2714 728 aa, chain - ## HITS:1 COG:FN0311 KEGG:ns NR:ns ## COG: FN0311 COG1328 # Protein_GI_number: 19703656 # Func_class: F Nucleotide transport and metabolism # Function: Oxygen-sensitive ribonucleoside-triphosphate reductase # Organism: Fusobacterium nucleatum # 1 728 1 728 728 1467 99.0 0 MKRVIKRDGSVVEFDRSRIINAIKKTFEQASREPNMKLIEKIASQVEDLPDKVLAVEQIQ DIVVKKLMGSSEKDIAMSYQSYRTLKAEIREKEKGIYRQIGELVDASNEKLLSENANKDA KTISVQRDLLAGISSRDYYLNKIVPEHIKLAHIKGEIHLHDLDYLLFRETNCELVNIEAM LKGGCNIGNAKMLEPNSVDVAVGHIVQIIASVSSNTYGGCSIPYLDRALVRYIKKTFKKH FLRGAKYIDDLNEKQIEELKKEDLEYSNEVIKNKYPKTYEYSVDMTEESVKQAMQGLEYE INSLSTVNGQTPFTTVGIGTETSWEGRLVQKYVLKTRMAGFGAKKETAIFPKIVYAMCEG LNLNEGDPNWDISQLAFECMTKSIYPDILFITPEQLKNETVVYPMGCRAFLSPWKDENGK EKYAGRFNIGATSINLPRIAIKNRGDEEGFYKELDRILEICKDNCLFRAKYLENTVAEMA PILWMSGALSEKNQKDTIKDLIWGGYSTVSIGYIGLSEVSQLLYGKDFSESEEVYEKTFN ILKYIADKVLEYKQKYNLGFALYGTPSESLCDRFARVDKQEFGDIKGITDKGYYDNSFHV SSRINMSPFEKLRLEALGHKYSAGGHISYIETDSLTKNLDAIPDILRYAKMVGIHYMGIN QPVDKCHICGYKGEFTATKEGFTCPQCGNHDSNEMSVIRRVCGYLSQPNARPFNKGKQEE IMHRVKHS >gi|296155486|gb|ADVK01000007.1| GENE 6 6284 - 7354 1447 356 aa, chain - ## HITS:1 COG:FN0310 KEGG:ns NR:ns ## COG: FN0310 COG3839 # Protein_GI_number: 19703655 # Func_class: G Carbohydrate transport and metabolism # Function: ABC-type sugar transport systems, ATPase components # Organism: Fusobacterium nucleatum # 1 356 1 356 356 682 99.0 0 MASVTITGVTKSFGNVTVLQEFNQKFEDGEFITLLGPSGCGKTTMLRLIAGFEKPSSGEI YIGDKLVSSEKEFLPPEKRGIGMVFQSYAVWPHMNVFDNIAYPLKIQKINKNEIEERVKQ VLKIVHLEQYKDRFPSELSGGQQQRVALGRALVAQPEILLLDEPLSNLDAKLREEMRYEI KEITKKLKITVIYVTHDQIEAMTMSDRIVLINKGEVQQVAPPQEIYSKPKNMFVANFVGK VDFITGKVEGSKILLDNSNNQILSNTSSFKGKVVVAIRPENAILSDDGEITGKVYSKFYL GDCNDLRVEIGNGNILRIIARASTYNTLNEGDEVKIKILDYFVFEDDGKDQIKIMT >gi|296155486|gb|ADVK01000007.1| GENE 7 7369 - 9051 1936 560 aa, chain - ## HITS:1 COG:FN0309 KEGG:ns NR:ns ## COG: FN0309 COG1178 # Protein_GI_number: 19703654 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+ transport system, permease component # Organism: Fusobacterium nucleatum # 1 560 1 560 560 942 99.0 0 MVGQKKWRIDIKWIVILAIVAFLLIFEVFPLFYLLIKSLFSGGSFSWEAYRRVYTYDLNW IALKNTIITAGFTTILGVVIAFPLAFLVGRTDMYGKKFFRTLFVVTYMVPPYVGAMAWLR LLNPNAGVLNKFLMKIFGLGTAPFNIYTTSGIVWVLTCFFYPYAFITISRAMEKMDPSLE EASRISGASPLKTLFKVTIPMMTPSIIAAGLLVFVASASSYGIPSIIGAPGQIYTVTMRI IDFVHIGSEEGLTDAMTLAVFLMLISNIILYISTFVVGRKQYITMSGKSTRPNIVELGKW RLPITIIISVFSFFVIILPFITVAITSFTVNMGKPLTLSNLSLKAWEKVFSRASIISSTT NSFLTATAAAFFGILISCVMAYLLQRTNVKGKRIPDFLITLGSGTPSVTIALALIISMSG KFGINIYNTLTIMVVAYMIKYMLMGMRTVVSAMSQVHPSLEEAAQISGANWFRMLKDVTL PLIGASIVAGIFLIFMPSFYELTMSTLLYSSNTKTIGYELYIYQTYHSQQVASALATAIL LFVILVNYILNKLTKGQFSI >gi|296155486|gb|ADVK01000007.1| GENE 8 9070 - 10128 1660 352 aa, chain - ## HITS:1 COG:FN0308 KEGG:ns NR:ns ## COG: FN0308 COG1840 # Protein_GI_number: 19703653 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+ transport system, periplasmic component # Organism: Fusobacterium nucleatum # 1 352 1 352 352 642 99.0 0 MKKKFFWLVLSLMTLFLVACGGGEKKEETTDSANANTEISGKIVIYTSMYEDIIDNVSEK LKKEFPNLEVEFFQGGTGTLQSKIIAELQANKLGCDMLMVAEPSYSLELKEKGILHAYLS KNAENLALDYDKEGYWYPVRLLNMVLAYNPDKYKKEDLALTFEDFAKREDLAGKISIPDP LKSGTALAAVSALSDKYGEEYFKNLANLKVVVESGSVAVTKLETGEATEIMILEESILKK REEENSTLEVIYPEDGIISIPSTIMTVKEDMSANKNIKAAEALTDWFLSPAGQEAIVEGW MHSVLKNPEKAPYDAKATDEILKASMPINWEKTYKDREELRKMFEKFITKAN >gi|296155486|gb|ADVK01000007.1| GENE 9 10264 - 11049 227 261 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|90020817|ref|YP_526644.1| ribosomal protein S16 [Saccharophagus degradans 2-40] # 3 238 12 244 318 92 28 4e-18 MNLEIKNGNFSYTDGNPILKDINLKIDSGEIFTILGQNGIGKTTLLKCINGVLKWNSGEV FIDNKKVDSIKDLKDIAYVPQAHSFSFSYTVRELSIMGRAKYLNIFSTPSKSDYDIVEKV LDEMGILYLKDRKCSELSGGQLQLVFLARALVGEPKILILDEPESHLDFKNQTKILRTIV QLAKKKNITCIFNTHYPEYALRISDKSMLIGKDDYIIGKTSEVINEENLKKYFGINTKIV EIKDEKQKIKSVVITDNLEKE >gi|296155486|gb|ADVK01000007.1| GENE 10 11052 - 12047 1044 331 aa, chain - ## HITS:1 COG:FN0306 KEGG:ns NR:ns ## COG: FN0306 COG0609 # Protein_GI_number: 19703651 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-siderophore transport system, permease component # Organism: Fusobacterium nucleatum # 1 331 3 333 333 525 100.0 1e-149 MNIYRKKMVFLIMVLILCILVSIFLGRFFISPKMFFDVLSDSIKGVENNPIESSIIFELR IPRIIMNILVGAGLSISGVAFQGIFQNPLVSPDIISVSSGSAFGAVLGILLFGMNSYVII LALLFGILSVIITYSLSKVRGESSVLSLILSGMVITALFSALISLVKYTADPYDKLPAIT YWLMGSFSSSSYNNIKIAVFPIITGIMILYFLRWRINILSLGDEEVKALGMNPVYIRGFI IVAVTMISATCVTLTGIIGWVGLLIPHICRMYIGADNIKLIPSSCIMGAVFMLIIDGIAR TATSSEIPIGILTSLVGAPFFIIIFKKYRSW >gi|296155486|gb|ADVK01000007.1| GENE 11 12064 - 13116 1330 350 aa, chain - ## HITS:1 COG:FN0305 KEGG:ns NR:ns ## COG: FN0305 COG0614 # Protein_GI_number: 19703650 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-hydroxamate transport system, periplasmic component # Organism: Fusobacterium nucleatum # 15 350 1 336 336 608 99.0 1e-174 MKKIFSSMFILFLFLFANINAKTVTDLTGKKVTIKDNPSRIAIVPIPWASLAYAVDGDSS KIVGMHPSAKKSYEISMLKELAPNMKNVNSVFVDNNFNINYEELALLKPDIVVVWDYQND AIEKLAKLKIPAVAIKYGTLEDVQQGIKLLGDILNRQEKAQKLINYHKDTNKYFASKTEK LANTKRKKILYIRDSQLTVATGKSVNNIMIDMAGGVNVAKDITTGNWSKVTMEEIIKWNP DIIILSNFDKILPEDIYNNKFEGQDWSKINAVKNKKVFKAPIGIYRWDAPSAETPLMIKW IAKVANPELFNDYNMRKDIKDFYLEFFNYELSDEQLNFILNSKINKGLNL >gi|296155486|gb|ADVK01000007.1| GENE 12 13139 - 14794 1777 551 aa, chain - ## HITS:1 COG:FN0304 KEGG:ns NR:ns ## COG: FN0304 COG2710 # Protein_GI_number: 19703649 # Func_class: C Energy production and conversion # Function: Nitrogenase molybdenum-iron protein, alpha and beta chains # Organism: Fusobacterium nucleatum # 137 551 1 415 415 711 99.0 0 MNIERCLKNEKNKMLKTLLNIPENIVISIGPTGCLNVLYNEAIKENKLENLYTFPISEID MVSANHIEKLEKYIVKIISENFEKIKSIIIYLTCADLILVSDFSFLMEKIKKDYGIIVKI LERGPIAKRKIMPEKRLEKLLVKLEYELKNTSKIKDKKISDFKIEIQHIVPPITSDYSGA CSTLYGENILKILISPSGCKTPVAYDEIRNIDCSLQYSTSLNELEIVTGEIKGLKDNIKE IISQNPKIELIAIISTVVPQIIGMNLESIVENIEETLDIPCVFINTNSFENYYSGISLTL KSLAKKFMVKNKKIKNTVNIIGYSPLTFGKIEKLEELFSLIKSLDLNILAVFSDNLSLEK IKNSTSAELNLVLSYEGLALAKYMEKEFSIPYVIINVVSKYGIENTENILKRFFYKIDNS FEKLEKRDKLDDRKVMIIASPFMAINIADSLRKDFSFDNILALSLIKESRKFKKIEYLEF LNIVNTEDDLKEKIKEYKPDILISDPVYKNLINDGLTFIPLLHYGYSTRLYLELDYEYCG KKAYEYFKKFI >gi|296155486|gb|ADVK01000007.1| GENE 13 14797 - 15627 943 276 aa, chain - ## HITS:1 COG:FN0303 KEGG:ns NR:ns ## COG: FN0303 COG1348 # Protein_GI_number: 19703648 # Func_class: P Inorganic ion transport and metabolism # Function: Nitrogenase subunit NifH (ATPase) # Organism: Fusobacterium nucleatum # 1 276 1 276 276 507 99.0 1e-143 MLKIAIYGKGGIGKSTISSNLSAMISKSGKKVLHIGCDPKGDSTRNLMGRKIPTVISILK EKNNLNREDIIYKGFNGIECVETGGPEAGVGCAGRGIITTMEELEDLKVFDEERDIIIYD VLGDVVCGGFAVPMREKYADVIYIVTSSEFMSIFAANNIMKSIKNFSKMKNIKFGGLIHN QRNNNSSINILKIFADMTKSKIIGEIPFSKELIKSELNGKTIAEMYPNSNLYNNFLELSE KILSNQDDLTFSPLSEEEMEYLAAEILKKNIYYEEE >gi|296155486|gb|ADVK01000007.1| GENE 14 15655 - 16419 252 254 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|149915877|ref|ZP_01904401.1| 50S ribosomal protein L17 [Roseobacter sp. AzwK-3b] # 3 227 279 509 563 101 29 5e-21 MKLEIKNLSFSYKNKEILNNISFEVYSGTLLSILGANGAGKTTLIKCINGILKLKKGEVL IDEKNFNNKSLKEKSKIMSYVPQITSSFDIDLTVFDTVLLGRVPHKTFKFTEWDKQIALN NIKKLDLEKYLFNYVGELSGGEKQRVLIARALTQEPKILILDEPISNLDLKFQLETMKIL KNLAKEDNLIVITILHDLNFAISYSDKILFLKNGKINNFGDTKKIVTTSNIKEIFSVEID IVQFKNKNYIIPLE >gi|296155486|gb|ADVK01000007.1| GENE 15 16422 - 17468 1023 348 aa, chain - ## HITS:1 COG:FN0301 KEGG:ns NR:ns ## COG: FN0301 COG0609 # Protein_GI_number: 19703646 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-siderophore transport system, permease component # Organism: Fusobacterium nucleatum # 1 348 1 348 348 513 99.0 1e-145 MNSINSIHQYNSKKIFKILTAFFILLFVSYISVFKGIANINLKRVLVTIYKNLSFNNTEP LSPREMVVFLDLRLTRVVLGNIAGFLLAICGTVMQAITENKMSSPFTTGISSAASMGAAL SILFFTGKYVYFDLITIFFAFSFGIICSFLVYGISNVKGMNKSTLILTGIAFNYLFSSGN AALQFIANEDVLSSIVNWTFGNLSGVSWNKILILFLILLIFFPYFFINRYSYNLLLTGED SATSLGVDVKKFRFLSGIIVTLITSAVVSFIGIIAFVGIIAPHISRMLIGDDHKYSIILS GIIGAFLVVFSDYIGRNLLSPIIIPIGIVISFVGIPIFIYLIINSKRG >gi|296155486|gb|ADVK01000007.1| GENE 16 17479 - 18513 1430 344 aa, chain - ## HITS:1 COG:FN0300 KEGG:ns NR:ns ## COG: FN0300 COG0614 # Protein_GI_number: 19703645 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-hydroxamate transport system, periplasmic component # Organism: Fusobacterium nucleatum # 14 344 1 331 331 638 98.0 0 MKKSIKNFLLFLFLFISSLSFAEISFKDDVGREIVLEKPLTKVVVASRYNNELIRAIGSI KNVISVDDNTAQDRVYWKDFDPKNSIGKGQNNLNYEKIIELAPEALITPRNSSYEKDIEQ LSKAGIKVIVVTGWDNAHMPEQIERLGKVFGNEKGAKKLIEFYNKNLNEVKKRVAKVKNK KTIYWEYGEPYTTAIPGTSNDGWVNMMRVAGGINIFDDPTIKGKTIDPEKILLEDPDLIM KTTSGAAYKNTGVYTAPSQEECKNIMNEMINRSGWKDLKAIKSKNVYITTGFCSGGLGKL IGVVYTAKWLYPEEMKDIDPDKVFEEWMTMQGVKVPKGHVYKLK >gi|296155486|gb|ADVK01000007.1| GENE 17 18555 - 19166 270 203 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|149915877|ref|ZP_01904401.1| 50S ribosomal protein L17 [Roseobacter sp. AzwK-3b] # 1 197 278 494 563 108 29 4e-23 MLEIKNISFNYLKGMPILKDISLNVKSGEIVGIFGSSGCGKSTLAKIIAGILKPSKGTIL VDGKELEKEGYCSVQLIYQHPEKATNPKWKMDKILNEGWMVDNETIKKMGIKNYWLTRWP QELSGGELQRFCITRALGPKTKYLIADEITTMLDALTQVKIWKNLIATAKEKNIGMIVVT HNKFLADRICDKIINLEKLNSVD >gi|296155486|gb|ADVK01000007.1| GENE 18 19196 - 20026 238 276 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163765018|ref|ZP_02172066.1| ribosomal protein L9 [Bacillus selenitireducens MLS10] # 30 252 40 269 329 96 27 2e-19 MNILEVKNLNIGFNMYDKLLNQKLHQMIFNLNVTIKEGEILAIAGSSGSGKSLMAHAILG ILPKNAVVSAEIKFKNEIVDENRLYQLRGKEITFVPQSIAYLDPLMTIEDQLMRKDIDRQ DFFKVMDTLGFTKSDLGKYPFQLSGGMARRVLIANTILSKADLIIADEPTPGLSLDLAVE VLNHFRNMANDGKGILLISHDIDLVCNVADRMAIFYGGHILETLNTKDFLKGEKYIRHPL TKAFWKALPQNDFEETDMEDIRLQCKKLNFELPILE >gi|296155486|gb|ADVK01000007.1| GENE 19 20031 - 20891 790 286 aa, chain - ## HITS:1 COG:MA1912 KEGG:ns NR:ns ## COG: MA1912 COG1173 # Protein_GI_number: 20090761 # Func_class: E Amino acid transport and metabolism; P Inorganic ion transport and metabolism # Function: ABC-type dipeptide/oligopeptide/nickel transport systems, permease components # Organism: Methanosarcina acetivorans str.C2A # 1 286 1 285 285 280 53.0 2e-75 MIDNVLYQEKSSYRKLNLRKKTLLYLIISIVFLLIILIWGGFMDKNSYGVDFSVRNNPPS LKHIFGTDWMGRDMLARTIKGLSTSLIIGVVASLISCIFAVIVGSVCAIFGKKLDKFFLW LIDLFQGMPHLILLVLISILTGKGTVGILVGVALTHWTTLSRIIRAEILSIKGEPYIVIA KKLGTANVKIASKHILPHIIPQFIVGTVLLFPHAILHEAGITFLGFGLPPEEPAIGVILS ESMRYITSGYWWLAFFPGLALMLMVFAFDAIGHNLEKIINPNTGQE >gi|296155486|gb|ADVK01000007.1| GENE 20 20881 - 21858 234 325 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|167855436|ref|ZP_02478201.1| 30S ribosomal protein S21 [Haemophilus parasuis 29755] # 66 315 43 310 320 94 28 6e-19 MKKIIWQIAKLFFLLFAISIITFLLMKLTPIDPVSSYLGADNNVTQEQYDYLVKVWGLNQ PSIVQYFNWLKGVLSGNMGDSFIYNKPVFELISKAANNTFMLMLTSWLLSGIIGFVLGIV AAAYHGRIIDQIIQKISYLFASMPTFWFAILMLMFFAVRLGWFPVGMSAPIGVIEEDVSL WHRIHHMILPAITLSILGISKIALHTREKLIDVLNSEYFLYSKVNGEKLWEFIRKHGIRN ILLPAVTIQFASISELFGGSVLVENVFSYAGLGNITKIAGVKGDMPLLLAITLVSAIFVF VGNLCANILYPIIDPRIREGMHNDR >gi|296155486|gb|ADVK01000007.1| GENE 21 21892 - 23529 2184 545 aa, chain - ## HITS:1 COG:MA1915 KEGG:ns NR:ns ## COG: MA1915 COG0747 # Protein_GI_number: 20090764 # Func_class: E Amino acid transport and metabolism # Function: ABC-type dipeptide transport system, periplasmic component # Organism: Methanosarcina acetivorans str.C2A # 43 541 53 548 553 375 41.0 1e-103 MRKFLKIGISVIAISLMFSFLGCGKNSSEVKETGRIERLENKEEVIVGVGQSMLEKGFDP CKGWGNYGVTLIQSKLLDFDFENNIIKDLAENYEVSEDGKTWIFKIREDVKFSDGQKLTA KDVAFTFNKTKEIGTTFDFKLLEKAEALDDKTVKFTFSAPCSTFIYNAANLGIVAEHAYK DTNTYSLNPIGSGPYKLVSYTQGQQLILDRNEEYYGTKPKFKRLTLVTMTPDTALASIKA GDIDIVNVSEAMAQEKIENYSILATKTMDFRAISMPIIKKSEKLTEKGNPMGNDVTSDIA IRKAINYGVDREEIIENVLYGYGEVIFDFFDSLPWGIRDEIRKEFKNGNIEKANEVLDKA GWEMKDDGIREKNGIKAEFRLLYPASDDTRQSCAEAFAVQCKKIGINVIPEGSDWTEMAK RHSSDACVIGGGQYTPEVVARFYFSERIGGPWSNIVRENNSTVDEHIRAAYLATDEKVAI ENWQKALWDGKEGGSILGDAPYCTICYLEHLYFVRDGLDLGRQKLHTHARDLSLMANIEE WDFKK >gi|296155486|gb|ADVK01000007.1| GENE 22 23672 - 23761 198 29 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MRYQNEPVAVIEFFLVLYHWIFIWEGKEI >gi|296155486|gb|ADVK01000007.1| GENE 23 23996 - 25774 2812 592 aa, chain - ## HITS:1 COG:FN0299 KEGG:ns NR:ns ## COG: FN0299 COG0173 # Protein_GI_number: 19703644 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Aspartyl-tRNA synthetase # Organism: Fusobacterium nucleatum # 1 592 1 592 592 1153 98.0 0 MVYRTHNLGELRLKNIGEVVTLSGWVDTKRNVSTNLTFIDLRDREGKTQIVFNNELLSEK VLEEVQKLKSESVIKVIGEVKERSNKNPNIPTGEIEIFAKEIEILNTCDTLPFQISGVDD NLSENMRLTYRYLDIRRNKMLNNLKMRHRMIMSIRNYMDKAGFLDVDTPVLTKSTPEGAR DFLVPSRTNPGTFYALPQSPQLFKQLLMIGGVEKYFQIAKCFRDEDLRADRQPEFTQLDI EMSFVEKEDVMNEIEGLAKYVFKNVTGEEANYTFQRMPYAEAMDRFGSDKPDLRFGVELK DLSDIVKNSSFNAFSSTVQNGGLVKAVVAPNANEKFSRKVISEYEEYVKTYFGAKGLAYI KLTADGIASPIAKFLSEEEMKAIIEKTEAKTGDVIFIVADKKKVVHSALGALRLRIGKDL ELINKDDFKFLWVVDFPMFDYDEEEQRYKAEHHPFTSIKAEDLDKFLAGQTEDIRTNTYD LVLNGSEIGGGSIRIFNPQIQSMVFDRLGLSQEEAKAKFGFFLDAFKYGAPPHGGLAFGI DRWLMVMLKEESIRDVIPFPKTNKGQCLMTEAPNIVDEKQLEELFIKSTYEK >gi|296155486|gb|ADVK01000007.1| GENE 24 25793 - 27034 1716 413 aa, chain - ## HITS:1 COG:FN0298 KEGG:ns NR:ns ## COG: FN0298 COG0124 # Protein_GI_number: 19703643 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Histidyl-tRNA synthetase # Organism: Fusobacterium nucleatum # 1 413 1 413 413 781 99.0 0 MKLIKAVRGTKDIIGEEAKKYIYISNVAQKMFENYGYNFVKTPIFEETELFKRGIGEATD VVEKEMYTFKDRGDRSITLRPENTASLVRCYLENAIYAKEEISRFYYNGSMFRYERPQAG RQREFNQIGLEVFGEKSPKVDAEVIAIGYKFLEKLGITDLEVKINSVGSKASRTVYREKL IEHFKSHLDDMCEDCRDRINRNPLRLLDCKVDGEKDFYKSAPSIIDYLFEDERKHYDDVK KYLDVFGIKYTEDPTLVRGLDYYSSTVFEIVTNKLGSQGTVLGGGRYDNLLKELGDKDIP AVGFATGVERIMMLLEENYPKNTPDVYIAWLGENTSETALKIAESLRDNDIKVYIDYSEK GMKSHMKKADKLETRYCVILGEDELNKGIVLLKDFSTREQKEIKIEEIINYIK >gi|296155486|gb|ADVK01000007.1| GENE 25 27046 - 28269 1431 407 aa, chain - ## HITS:1 COG:FN0297 KEGG:ns NR:ns ## COG: FN0297 COG2256 # Protein_GI_number: 19703642 # Func_class: L Replication, recombination and repair # Function: ATPase related to the helicase subunit of the Holliday junction resolvase # Organism: Fusobacterium nucleatum # 1 407 1 407 407 793 99.0 0 MNLFQKNYKNVEPLAYKLRPKSLDDFVGQEKLLGKDGVITRLILNSTLSNSIFYGPPGCG KSSLGEIISNTLDCNFEKLNATTASVSDIRNIVETARRNIELYNKRTILFLDEIHRFNKN QQDALLSYTEDGTLTLIGATTENPYYNINNALLSRVMVFEFKALTNEDILKLIDKGLNFL NICMGDKIKEIIVDISQGDSRIALNYVEMYNNIHSQMSEDEIFSIFKERQVSFDKKQDKY DMISAFIKSIRGSDPDAAVYWLARLLDGGEDPKYMARRLFIEASEDIGMANPEALLVANA AMNACEKIGMPEVRIILAHTTIYLAISSKSNSVYEAIDGALADIKKGELQEVPMNICHDN VGYKYPHNYTDNFIKQKYMNKKRKYYKPGNNKNEKLIAEKLAKLWDE >gi|296155486|gb|ADVK01000007.1| GENE 26 28429 - 29196 875 255 aa, chain + ## HITS:1 COG:no KEGG:FN0296 NR:ns ## KEGG: FN0296 # Name: not_defined # Def: putative cytoplasmic protein # Organism: F.nucleatum # Pathway: not_defined # 1 255 1 255 255 483 99.0 1e-135 MKQNFNEIKQNWNFYMCRVDDKPASIRLNLALSNIAPVEDYKHRISIFIKMNNPTEDGLS SNEEYPMLCDIEDEVIDRLETLEDIFAGTVKTQGRLELYVFTKNPEKSEELCKEAFKKFP NYQWKSYIDEDKEWDFYFNFLYPDVYSYHAIMNRSVIDNLTNQGDNLEKKREIDHWIYFS SEENINIAIKKVEELGYKILSSKKLDDEKNYPYQLNISRMDSAIYSHVNQIVWELIEIAE PLNGYYDGWGCNITK Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:09:53 2011 Seq name: gi|296155469|gb|ADVK01000008.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00009, whole genome shotgun sequence Length of sequence - 15033 bp Number of predicted genes - 16, with homology - 16 Number of transcription units - 9, operones - 5 average op.length - 2.4 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 58 - 117 12.8 1 1 Op 1 9/0.000 + CDS 201 - 1130 1337 ## COG2984 ABC-type uncharacterized transport system, periplasmic component + Prom 1153 - 1212 5.0 2 1 Op 2 13/0.000 + CDS 1248 - 2081 1162 ## COG4120 ABC-type uncharacterized transport system, permease component 3 1 Op 3 . + CDS 2084 - 2860 234 ## PROTEIN SUPPORTED gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 + Term 2871 - 2907 3.0 4 1 Op 4 . + CDS 2936 - 3778 978 ## FN2078 DeoR family transcriptional regulator + Term 3780 - 3823 7.1 - Term 3764 - 3816 9.5 5 2 Tu 1 . - CDS 3822 - 5252 1732 ## COG1757 Na+/H+ antiporter - Prom 5285 - 5344 15.4 - Term 5359 - 5397 4.0 6 3 Op 1 5/0.000 - CDS 5439 - 5972 830 ## COG0503 Adenine/guanine phosphoribosyltransferases and related PRPP-binding proteins 7 3 Op 2 . - CDS 5985 - 7052 1599 ## COG2252 Permeases - Prom 7155 - 7214 10.5 + Prom 7140 - 7199 8.6 8 4 Tu 1 . + CDS 7233 - 7484 339 ## FN2071 hypothetical protein + Prom 7561 - 7620 18.3 9 5 Op 1 1/0.000 + CDS 7664 - 9118 2069 ## COG1492 Cobyric acid synthase + Prom 9153 - 9212 13.3 10 5 Op 2 1/0.000 + CDS 9245 - 10600 645 ## PROTEIN SUPPORTED gi|145632256|ref|ZP_01787991.1| 50S ribosomal protein L27 + Term 10658 - 10713 9.3 + Prom 10638 - 10697 8.4 11 6 Op 1 1/0.000 + CDS 10720 - 12177 1679 ## COG1078 HD superfamily phosphohydrolases 12 6 Op 2 . + CDS 12199 - 12738 832 ## COG0526 Thiol-disulfide isomerase and thioredoxins + Term 12799 - 12854 10.4 + Prom 12974 - 13033 9.3 13 7 Tu 1 . + CDS 13142 - 13318 160 ## gi|237740944|ref|ZP_04571425.1| transcriptional regulator - Term 13299 - 13347 4.5 14 8 Op 1 5/0.000 - CDS 13367 - 13783 411 ## COG2856 Predicted Zn peptidase 15 8 Op 2 . - CDS 13743 - 14210 655 ## COG1396 Predicted transcriptional regulators - Prom 14260 - 14319 13.3 + Prom 14308 - 14367 10.9 16 9 Tu 1 . + CDS 14394 - 14897 533 ## FN2064 hypothetical protein Predicted protein(s) >gi|296155469|gb|ADVK01000008.1| GENE 1 201 - 1130 1337 309 aa, chain + ## HITS:1 COG:FN2081 KEGG:ns NR:ns ## COG: FN2081 COG2984 # Protein_GI_number: 19705371 # Func_class: R General function prediction only # Function: ABC-type uncharacterized transport system, periplasmic component # Organism: Fusobacterium nucleatum # 11 309 1 299 299 489 99.0 1e-138 MKKLLTILGLMLGLTTLSVAAEKEIKVGITQIVEHPSLDATRKGVEKALKEKGKGKNIKI EYQSAQGDFGTAQLIAKSYSSSKKDVIVAISTPSAQAALNATKTIPIVYTAVTDGASAGL KGNNITGTSDMSPLDKQAELIKTLLPNAKKVGFLYNPSEQNSLLLLEKFKRIAKAKGLTV VEKGVSSVNDINLAIDSLLGQIDVLYIPTDNLVYSSASLVIQKANKKNVPVIASTNDIVE KGALATESIDYEKLGYQTGERVIDVLNGKNPKDIPVETLKQTTLVINQKIAKKYNISLDN PKLKNAVIK >gi|296155469|gb|ADVK01000008.1| GENE 2 1248 - 2081 1162 277 aa, chain + ## HITS:1 COG:FN2080 KEGG:ns NR:ns ## COG: FN2080 COG4120 # Protein_GI_number: 19705370 # Func_class: R General function prediction only # Function: ABC-type uncharacterized transport system, permease component # Organism: Fusobacterium nucleatum # 1 277 1 277 278 429 96.0 1e-120 MLQATIEQSLIFAIMVLGVYISFRILNFPDMTVDGTFPLGAAISAKLLTLGVNPYLTLIV TLVAGAVAGAITGLIHVKLKVKDLLAGILVMTALYSVNLRVMGKSNIPLFEEDNIFNTEY SMMITIVVLILISKFILDYLLKTKFGFTLKALGDNENLIVSLGLNEEKYKIYGLMIANAF VAFSGAILAQYQGFADVGMGTGIIVTGLASIIIGDTLFGKRRRLAGTTIVIIGSILYRGV IAVTLSMGMDASDLKLITSVIVIVILCIQKQKDKRRK >gi|296155469|gb|ADVK01000008.1| GENE 3 2084 - 2860 234 258 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 [Vibrio campbellii AND4] # 1 228 1 218 245 94 31 3e-19 MINISNIKKTFYSALGEEKAVFNGLNLEINQGDFISVIGSNGAGKTTLLNTITGNIDLDG GYIEVDGKNINNLAKHKRGEFISKVYQNPALGTAPSMTIFENLSMADNKGKMFGLSLGLN YSRKKYYMNLLKELDLGLENLLDTEVQYLSGGQRQCLALIMATLNQPKVLLLDEHTAALD PKTSKIIMDKTEEIVEKFEIPTLMITHNLQDAIKYGNRLIMLHNGKIILDIKGKEKEELT QDTLMEIFQKKATYSDVM >gi|296155469|gb|ADVK01000008.1| GENE 4 2936 - 3778 978 280 aa, chain + ## HITS:1 COG:no KEGG:FN2078 NR:ns ## KEGG: FN2078 # Name: not_defined # Def: DeoR family transcriptional regulator # Organism: F.nucleatum # Pathway: not_defined # 1 280 1 280 280 443 99.0 1e-123 MKKIRVTVPEDVWDIIKIDQEDFGINNNKFCNYILEKLKFNRKIETEKLLQAQGRTHKKI IQFDLNVNNKGIYYDILKSNEVEIEAEYFRELFEIYCSKFKYQRELFIYEDKLKSILDAI KDENKLKIKYFSEIIDIDPIFIRREDKGNENFLFCYVEKLNSYQNYKLKEMEIVAILPEK MKKRDKKFIESMKKKYDPFLGKAATIKVKLTTLGESLLKTFTEYRPKLIKNEKDIYYFET SEEQAKMYFRGFSKEAEILEPFSLREEIIKEYQEALSIYK >gi|296155469|gb|ADVK01000008.1| GENE 5 3822 - 5252 1732 476 aa, chain - ## HITS:1 COG:FN2077 KEGG:ns NR:ns ## COG: FN2077 COG1757 # Protein_GI_number: 19705367 # Func_class: C Energy production and conversion # Function: Na+/H+ antiporter # Organism: Fusobacterium nucleatum # 1 476 1 476 476 829 98.0 0 MLKKEKIKPSLFVAALPFVFLIVVMLIGNIVYGAPAQLPLILGIAFTCILGHFLGYSYQE IEDSMIETNKMGLQANFIMLIVGCLIGSWIIGGVVPGMIYYGLKLFTPRIFLIILPIMCA IISVSTGSAWTTAGTMGTAAMGIGVGMGIPAPLVAGAVVTGASFGDKLSPLSDSTNLAAA TAEAGLFDHVRHMLKTTVPSFLIALLIYAFLGRNFGSANINSEAIESITSTLASNFKITP LIFIPPIIIIVIIFLKVPPVPGMLIGTLAGVGMCFYQGVDLQTILVALYEGPSIETGNAV VDKLLNRGGMLFMMETISLVICALAFGGAIKSIGCIDKIIETVLKHLRRRGSIITSNVLM CILCNFAAADQYMSIVIPGQMYKKVYKKLNLAPENLSRTLEDAGTLTSGLVPWSTCGAVY LATLGVSAFQYGRFHILGLVNPIVAIIYAYLLIFLNPLDKSKPIKDRLTDEDLKEL >gi|296155469|gb|ADVK01000008.1| GENE 6 5439 - 5972 830 177 aa, chain - ## HITS:1 COG:FN2073 KEGG:ns NR:ns ## COG: FN2073 COG0503 # Protein_GI_number: 19705363 # Func_class: F Nucleotide transport and metabolism # Function: Adenine/guanine phosphoribosyltransferases and related PRPP-binding proteins # Organism: Fusobacterium nucleatum # 1 177 1 177 177 282 93.0 2e-76 MKTYTLHVAGLTRELPIIKLSYDLSIASFVILGDTEIVRKTAPIIAKKLPEVDLIVTAEA KGIPLAYEISKVLNLNEYIVARKSVKAYMEEPIEVELHSITTTDSQKLYLNNQDARKIKG KKVALVDDVISTGQSLKALERLVEKAGGKVVAKAAILAEGDAKNRKDIVFLEALPTF >gi|296155469|gb|ADVK01000008.1| GENE 7 5985 - 7052 1599 355 aa, chain - ## HITS:1 COG:FN2072 KEGG:ns NR:ns ## COG: FN2072 COG2252 # Protein_GI_number: 19705362 # Func_class: R General function prediction only # Function: Permeases # Organism: Fusobacterium nucleatum # 1 355 1 355 355 494 96.0 1e-139 MTVNDVLAALGVVLNGIPQALLAATYGFASIPTAFGFIVGAVACLLYGSAIPISFQAETI ALAGMLGKDIRERLSIILFSGITMVILGLTGTLSTIVNFAGSTIINAMMAGVGIMLTRIA LSGLKESRIVTASSIVSAFITYFFFGQNLVYTIVVCVIFSSLVANIFKINFGGGIIENYK KIEIKKPIINLSVIRGALALACLTIGANIAFGNITASMTGKYEANIDHLTIYSGLADAAS SLFGGGPVEAIISATAAAPNPLTSGVLMMAIMAVILFFGLLPKISKYIPGHSVHGFLFIL GAIVTVPTNASLAFSGGTPQDYVVAATAMTVTAANDPFIGLLVALVVKYIFVFIG >gi|296155469|gb|ADVK01000008.1| GENE 8 7233 - 7484 339 83 aa, chain + ## HITS:1 COG:no KEGG:FN2071 NR:ns ## KEGG: FN2071 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 83 1 83 83 110 97.0 2e-23 MEFLRSLYFAFDEEVQVEIFKGLTFYSIAILILCILVYVGGKILSSSPETKGKVSAFVFG IVVANIIAVVYILYMLREIGIIQ >gi|296155469|gb|ADVK01000008.1| GENE 9 7664 - 9118 2069 484 aa, chain + ## HITS:1 COG:FN2070 KEGG:ns NR:ns ## COG: FN2070 COG1492 # Protein_GI_number: 19705360 # Func_class: H Coenzyme transport and metabolism # Function: Cobyric acid synthase # Organism: Fusobacterium nucleatum # 1 484 8 491 491 863 99.0 0 MIQGTGSSVGKTLIVAGLCRVFVQDGYRVSPFKSQNMALNSFVDIEGLELSRGTVIQAEA GYEIPRAFMNPILLKPNSDNNSQVIINGKIAYTADAKKYFSNSKELKKIALETYKNNIEN NFDIAVLEGGGSPAEINLREYDLVNMGMAELIDSPVILVGNIDIGGVFASIYGTVMLLDE NDRKRIKGYIINKFRGDSDLLKPAIDILDKKFRAEGLDIKFLGVLPYADLKIEEEDSLSD EDKRVYSNDKKYINISVIKTKKMSNFTDFHAFKQYDDVRVRYVYDVKDLGEEDIIVFPGS KNTITDLEDLKKRGIFEKVKELKEKGKIIIGICGGLQMLGKKIFDPKHLESDILETEGFN FFDYETTFDEIKKTEQVTKKIEVTEGILKDFNGYEIKGYEIHQGVTDISTSVISKDNVFA TYIHGVFDNSKFTNDFLNMIRREKNMPEQKETLSFNEFKEREYNKLAKLLRENLDIEEIY KILN >gi|296155469|gb|ADVK01000008.1| GENE 10 9245 - 10600 645 451 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|145632256|ref|ZP_01787991.1| 50S ribosomal protein L27 [Haemophilus influenzae 3655] # 4 451 3 445 456 253 33 7e-67 MTGIYKIVDSVNGLLWGKNILVFMLIGAALYFSFKTKFMQFRLFHKIIKVLFKNEKGKHG ISSLETFFLGTACRVGAGNIAGVVAAISVGGAGSIFWMWLVAMLGSATAFIESSLAVIYR KKEKDGSYTGGTPFIIEKRLNMRWLGVIYALASVVCYFGVTQVMSNSITGSIISVYTWGA DNKFFNLQNISSIAVAILVAYIIFFSKSKKDSIVESLNKIVPFMAIIYVVAVIYILVTNL THIPAMIGNIISQAFGTREVLGGTFGAVVMNGVRRGLFSNEAGSGNSNYAAAAVHIDIPA KQGMVQAFGVFIDTLVICSATAFIVLLAPESVISGLSGMGLFQAAMSYHLNGVGSLFVVI LMFFFCISTILAVAFYGRSAVNFIHESKYLNIVYQAILILMIYIGGIKQDIFIWSLADFG LGIMTVINILVIIPIAKPALDSLKSYEKELK >gi|296155469|gb|ADVK01000008.1| GENE 11 10720 - 12177 1679 485 aa, chain + ## HITS:1 COG:FN2068 KEGG:ns NR:ns ## COG: FN2068 COG1078 # Protein_GI_number: 19705358 # Func_class: R General function prediction only # Function: HD superfamily phosphohydrolases # Organism: Fusobacterium nucleatum # 1 485 29 513 513 916 99.0 0 MGVKVVKDLVYSYIEIDESVQKLIDTASFQRLKRIKQLSSSYIFPSTNHTRYEHSIGVMH LACNFFEVLEKDFRKYGLTEDRISYLRLHIKLAGLLHDVGHPPFSHLGEKFLDKNEIIAC IKNEYSHLVDVDKIFYNNGRLIGKEHELLSCYCILRKFYKILKEEIDKNIDVAFICRCII GNTYPDSENWDKNICVRIISSDSIDVDKLDYLTRDNHMTGEIAPKMDIKRLLACLTITEN KELKYVAKAIPAVQTVVDSRDILYLWVYHHHISIYTDYIIGRILKRCMTLYDEHRGQALE EMNREEYFSPKAITDYLITDDDIYSHLRKIYVLSLEGKVDDFNTIIIKQIFERDFLKPLW KTIYEYKDFEKNLVDKKIIKSYDELEDILKNEKNIEDITNTLLKKLNLKEGEVFIITKYN KFYNSNKEAPISLLLNGEERKLSDLLPQKEFGKFHTMAFFVFVPKKYKKEAKEIVIEELQ KISQT >gi|296155469|gb|ADVK01000008.1| GENE 12 12199 - 12738 832 179 aa, chain + ## HITS:1 COG:FN2067 KEGG:ns NR:ns ## COG: FN2067 COG0526 # Protein_GI_number: 19705357 # Func_class: O Posttranslational modification, protein turnover, chaperones; C Energy production and conversion # Function: Thiol-disulfide isomerase and thioredoxins # Organism: Fusobacterium nucleatum # 16 179 1 164 164 304 99.0 6e-83 MRFKSKLVLILIFVIMSFSVFAAKSDKKDDVKLPNIVLYDQYGKKHNIEEYKGKVVVINF WATWCGYCVEEMPGFEKVYKEFGSNKKDVIILGVAGPKSKENLNNVDVEKDKIILFLKKK NITYPSLMDETGKSFDDYKVRALPMTYVINKNGYLEGFVNGAITDEQLRKAINETLKKK >gi|296155469|gb|ADVK01000008.1| GENE 13 13142 - 13318 160 58 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|237740944|ref|ZP_04571425.1| ## NR: gi|237740944|ref|ZP_04571425.1| transcriptional regulator [Fusobacterium sp. 4_1_13] transcriptional regulator [Fusobacterium sp. 4_1_13] # 1 58 222 279 279 85 81.0 1e-15 MKTILGLIKENSNITIKEIGLKLKVSRPTVYRDMKYLKENNILEYQESSKKGKWIIKK >gi|296155469|gb|ADVK01000008.1| GENE 14 13367 - 13783 411 138 aa, chain - ## HITS:1 COG:FN2066 KEGG:ns NR:ns ## COG: FN2066 COG2856 # Protein_GI_number: 19705356 # Func_class: E Amino acid transport and metabolism # Function: Predicted Zn peptidase # Organism: Fusobacterium nucleatum # 1 138 1 138 138 252 100.0 1e-67 MTDKRKKEILKLINNLYFEFGTKNPLRLCKGLGIEVVSANIEMKGLYTEVFSSKLIIIQN LLDDFAKLFVVGHELFHALEHDCEQVRFFRECTSFKTNIYEEEANFFATHLLEDCIPFHQ DEIADLEIAEELEKYLDI >gi|296155469|gb|ADVK01000008.1| GENE 15 13743 - 14210 655 155 aa, chain - ## HITS:1 COG:FN2065 KEGG:ns NR:ns ## COG: FN2065 COG1396 # Protein_GI_number: 19705355 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Fusobacterium nucleatum # 1 155 1 155 155 240 99.0 7e-64 MKLISDFAERLRIALDFKKMKATELSELTGINKSTISQYLSKEYEPKRDRIELFAKTLNV NEVWLTGYDVPMEISSIDKNDSLVEEYELSPDELKEYENIKMTTSTLMFNGRPASENDKI ELEKVLKDFFVKALLKKRADEENDRQKKKRNSKIN >gi|296155469|gb|ADVK01000008.1| GENE 16 14394 - 14897 533 167 aa, chain + ## HITS:1 COG:no KEGG:FN2064 NR:ns ## KEGG: FN2064 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 167 1 167 167 294 98.0 8e-79 MKNIHTNFMTEYILKLTGEYASANRIHDVLNLSLSFTYTLANNNKVRNRVKNGRTEYNME DFLRSLELSYNNKPISNPLTKEDFEINNFHNWEARNDIEKYLEKLLLDELGQFTCIKDLV EMFKVSKTIWYEALEEGKIMYFTISSRKIIVTRSLLPFLREALSERY Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:10:20 2011 Seq name: gi|296155427|gb|ADVK01000009.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00010, whole genome shotgun sequence Length of sequence - 34866 bp Number of predicted genes - 43, with homology - 42 Number of transcription units - 12, operones - 7 average op.length - 5.4 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 7/0.000 - CDS 7 - 846 1466 ## COG1250 3-hydroxyacyl-CoA dehydrogenase 2 1 Op 2 . - CDS 862 - 1638 1285 ## COG1024 Enoyl-CoA hydratase/carnithine racemase - Prom 1681 - 1740 9.1 + Prom 1704 - 1763 13.6 3 2 Tu 1 . + CDS 1945 - 4533 3208 ## COG0474 Cation transport ATPase + Term 4549 - 4607 2.5 4 3 Tu 1 . - CDS 4622 - 5086 712 ## COG3467 Predicted flavin-nucleotide-binding protein - Prom 5116 - 5175 9.3 - Term 5186 - 5230 1.1 5 4 Tu 1 . - CDS 5234 - 5542 164 ## PROTEIN SUPPORTED gi|148826039|ref|YP_001290792.1| 50S ribosomal protein L35 - Prom 5610 - 5669 8.8 6 5 Op 1 1/0.750 - CDS 5686 - 6978 1672 ## COG2252 Permeases 7 5 Op 2 1/0.750 - CDS 7016 - 7885 167 ## PROTEIN SUPPORTED gi|161507907|ref|YP_001577871.1| ribosomal protein large subunit 8 5 Op 3 1/0.750 - CDS 7885 - 8985 1056 ## COG0772 Bacterial cell division membrane protein 9 5 Op 4 1/0.750 - CDS 9005 - 9445 861 ## COG0756 dUTPase 10 5 Op 5 1/0.750 - CDS 9447 - 10673 1728 ## COG0612 Predicted Zn-dependent peptidases 11 5 Op 6 22/0.000 - CDS 10692 - 11783 1220 ## COG0795 Predicted permeases 12 5 Op 7 . - CDS 11783 - 12862 1077 ## COG0795 Predicted permeases 13 5 Op 8 . - CDS 12871 - 13410 636 ## FN1032 hypothetical protein 14 5 Op 9 . - CDS 13426 - 14607 612 ## PROTEIN SUPPORTED gi|223476703|ref|YP_002580685.1| ribosomal protein L11 methyltransferase, putative - Prom 14632 - 14691 13.8 + Prom 14542 - 14601 12.7 15 6 Op 1 1/0.750 + CDS 14781 - 15392 613 ## COG1309 Transcriptional regulator 16 6 Op 2 . + CDS 15407 - 15886 378 ## COG0655 Multimeric flavodoxin WrbA 17 6 Op 3 . + CDS 15899 - 16525 419 ## FN1036 hypothetical protein 18 6 Op 4 1/0.750 + CDS 16574 - 17020 533 ## COG0716 Flavodoxins + Term 17035 - 17103 1.2 + Prom 17057 - 17116 14.4 19 7 Op 1 1/0.750 + CDS 17205 - 18113 1124 ## COG0697 Permeases of the drug/metabolite transporter (DMT) superfamily 20 7 Op 2 8/0.000 + CDS 18124 - 18876 538 ## COG1296 Predicted branched-chain amino acid permease (azaleucine resistance) 21 7 Op 3 1/0.750 + CDS 18873 - 19196 232 ## COG1687 Predicted branched-chain amino acid permeases (azaleucine resistance) 22 7 Op 4 1/0.750 + CDS 19208 - 20383 961 ## COG4552 Predicted acetyltransferase involved in intracellular survival and related acetyltransferases 23 7 Op 5 . + CDS 20431 - 21276 253 ## PROTEIN SUPPORTED gi|212640476|ref|YP_002316996.1| Uncharacterized protein conserved in bacteria containing two ribosomal protein S1-like RNA-binding domains + Prom 21278 - 21337 2.5 24 7 Op 6 . + CDS 21362 - 21493 115 ## Lebu_1563 hypothetical protein + Term 21509 - 21549 1.4 25 8 Op 1 . - CDS 21526 - 22254 669 ## FN1044 hypothetical protein 26 8 Op 2 . - CDS 22251 - 23060 790 ## FN1045 hypothetical protein - Prom 23166 - 23225 12.0 + Prom 23138 - 23197 10.6 27 9 Op 1 . + CDS 23219 - 23299 174 ## 28 9 Op 2 . + CDS 23299 - 24249 739 ## FN1046 hypothetical protein 29 9 Op 3 . + CDS 24268 - 25032 702 ## FN1047 hypothetical protein 30 9 Op 4 . + CDS 25052 - 26131 881 ## FN1048 hypothetical protein 31 9 Op 5 . + CDS 26152 - 26478 639 ## FN1049 hypothetical protein 32 9 Op 6 . + CDS 26495 - 26878 688 ## COG0346 Lactoylglutathione lyase and related lyases 33 9 Op 7 . + CDS 26899 - 27618 701 ## FN1051 hypothetical protein 34 9 Op 8 . + CDS 27638 - 28105 550 ## FN1052 hypothetical protein 35 9 Op 9 . + CDS 28141 - 28689 551 ## FN1053 hypothetical protein 36 9 Op 10 . + CDS 28713 - 29090 383 ## FN1054 hypothetical protein 37 9 Op 11 . + CDS 29115 - 29231 155 ## gi|34763313|ref|ZP_00144269.1| hypothetical protein - Term 29487 - 29537 -0.8 38 10 Tu 1 . - CDS 29558 - 30568 499 ## PROTEIN SUPPORTED gi|148988856|ref|ZP_01820271.1| 50S ribosomal protein L9 - Prom 30667 - 30726 11.4 + Prom 30662 - 30721 12.0 39 11 Op 1 . + CDS 30811 - 31410 530 ## FN1056 hypothetical protein 40 11 Op 2 . + CDS 31426 - 31884 564 ## COG0454 Histone acetyltransferase HPA2 and related acetyltransferases 41 11 Op 3 . + CDS 31920 - 32687 735 ## FN1058 hypothetical protein + Prom 32692 - 32751 6.5 42 11 Op 4 . + CDS 32781 - 34058 1507 ## COG1114 Branched-chain amino acid permeases + Term 34070 - 34123 0.4 43 12 Tu 1 . - CDS 34100 - 34864 1046 ## COG4884 Uncharacterized protein conserved in bacteria Predicted protein(s) >gi|296155427|gb|ADVK01000009.1| GENE 1 7 - 846 1466 279 aa, chain - ## HITS:1 COG:FN1019 KEGG:ns NR:ns ## COG: FN1019 COG1250 # Protein_GI_number: 19704354 # Func_class: I Lipid transport and metabolism # Function: 3-hydroxyacyl-CoA dehydrogenase # Organism: Fusobacterium nucleatum # 1 279 1 279 279 529 100.0 1e-150 MKVGIIGAGTMGAGIAQAFAQTEGFTVVLCDINNEFAANGKKKIAKGFEKRIAKGKMEQA DADKILERITTGTKDICGDCDLIIEAAIENMEIKKQTFKELDDICKPEAIFATNTSSLSI TEIGAGLKRPMIGMHFFNPAPVMKLVEIIAGLNTPTDIVDKIKKVSEDIGKVPVQVEEAP GFVVNKILIPMINEAIGIYAEGIASVEGIDTAMKLGANHPIGPLALGDLIGLDVCLAIMD VLYHETGDSKYRAHTLLRKMVRGKQLGQKTGKGFYDYTK >gi|296155427|gb|ADVK01000009.1| GENE 2 862 - 1638 1285 258 aa, chain - ## HITS:1 COG:FN1020 KEGG:ns NR:ns ## COG: FN1020 COG1024 # Protein_GI_number: 19704355 # Func_class: I Lipid transport and metabolism # Function: Enoyl-CoA hydratase/carnithine racemase # Organism: Fusobacterium nucleatum # 1 258 1 258 258 495 100.0 1e-140 MSVISYKQENFIGIVTIERPEALNALNSQVLDELSATFAGINLETTRVVLLTGSGSKSFV AGADIAEMSTLNSDEGTKFGYKGNEIFRKIETFPLPVIAVINGFALGGGCELAMSCDFRI CSENAIFGQPEVGLGITPGFGGTQRLARLIGLGKAKEMIYTANTIKADEALNIGLVNHVY PQETLMEEAMKLAGKIAKNAPFAVRACKKAINEGIDTDMDRAIIIEEKLFGSCFATEDQK VGMKAFLDKVKGVEYKNK >gi|296155427|gb|ADVK01000009.1| GENE 3 1945 - 4533 3208 862 aa, chain + ## HITS:1 COG:FN1022 KEGG:ns NR:ns ## COG: FN1022 COG0474 # Protein_GI_number: 19704357 # Func_class: P Inorganic ion transport and metabolism # Function: Cation transport ATPase # Organism: Fusobacterium nucleatum # 1 862 1 862 862 1560 99.0 0 MKHFTKSKKQLFEEFETVSTGLTDEEVEKRRKKYGENKFIEKEKDGLIKIFFYQFKDSLV IILLVAAIISFFSGNEESSFVIVLVLILNSILGAYQTIKAQKSLDSLKKMSSPKCKVIRE HEQLEVDSSELVPGDIVIIEAGDIVPADGRIIENFSLLVNENSLTGESNSIEKTDEILQY EDLALGDQVNMVFSGSLVNYGRAKILITETGMSTELGKIATLLDQTEENVTPLQKSLDIF GKRLTLAIVILCIFIFGIYVYHGNTVLDSLLLAVALAVAAIPESLNPIITIVLSLETEKL AKENAIVKELKSIEALGSISVICSDKTGTLTQNKMTVKKIFINGKLDDEYSLNINKKIDK LLLDSFILCTDATDTIGDPTETALIHLTQKYDMSFRDERKDSKRISEIPFDSVRKLMTVL YEDKKGKYIIFTKGAFDSLITKFKYFIDENGDIQNINEEFIKKIEKVNNDLAEEGLRVLT FAYKYMDEPKDLTTQDEDSYIFHALVGMIDPPREESKVAVQECIRGGIKPVMITGDHKIT ARTIAKNIGIFKDGDMAIEGVELEKMSDEELENSVEKISVYARVSPEHKIRIVNAWQKLG KICAMTGDGVNDAPALKKANIGIAMGITGTEVSKNAASMILADDNFSTIVKAIITGRNVY RNIKNAIGFLLSGNTAAILAVLYSSLANLPVIFSPVQLLFINLLTDSLPSIAVGVEPKNE DILDEKPRDPNEAILTKRFSSKLLIEGVLIAIFIIIAFYIGLKDSVLKGSTMAFATLCLA RLFHGIDYRGQRNIFAIGFFKNKFSLIAFGVGFALLNLVLLIPSLYKIFGITRIEAVNFL EIYILSLIPTILIQIYKAIKYR >gi|296155427|gb|ADVK01000009.1| GENE 4 4622 - 5086 712 154 aa, chain - ## HITS:1 COG:FN1023 KEGG:ns NR:ns ## COG: FN1023 COG3467 # Protein_GI_number: 19704358 # Func_class: R General function prediction only # Function: Predicted flavin-nucleotide-binding protein # Organism: Fusobacterium nucleatum # 1 154 3 156 156 276 99.0 1e-74 MRKADREIKSKEEIIDIIKRCDVIRLAFNNGDYPYILPLNFGFEINENKIIFYFHSALEG TKVDIMKKDNRVSFEMDTKHKLQYYEEKGYCTMSYESVIGRGKIKILSEDEKINALKKLM RHYHKNEDTYFNPAAISRTLVYSLEVEEMTAKKK >gi|296155427|gb|ADVK01000009.1| GENE 5 5234 - 5542 164 102 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|148826039|ref|YP_001290792.1| 50S ribosomal protein L35 [Haemophilus influenzae PittEE] # 1 93 4 96 96 67 34 9e-11 MTKKEFVNAFAEKGELKIKDSERLVNAFLETVENALLKGDGVRFIGFGSWEVKERSAREV KNPQTGKMIKVEAKKVVKFKVGKPLADKVAGQKGAKKATKKK >gi|296155427|gb|ADVK01000009.1| GENE 6 5686 - 6978 1672 430 aa, chain - ## HITS:1 COG:FN1025 KEGG:ns NR:ns ## COG: FN1025 COG2252 # Protein_GI_number: 19704360 # Func_class: R General function prediction only # Function: Permeases # Organism: Fusobacterium nucleatum # 1 430 6 435 435 681 99.0 0 MSFLDGYFRITERDSTVSREVMGGITTFLAMAYIIIVNPSILSLSGMDKGALITVTCLAS FIGTIIAGVWANSPIALAPGMGLNAFFTYTLTLEKQVPWQTALGIVFLSGCFFLILAIGG IREKIANSIPVPLRLAVGGGIGLFIAFIGLKSMGIVVANQATYVGLGEFTKTTCVSIIGL FIIAIMEIKRMKGGILLGIIVTTILGIIIGDVSLPEKIISLPPSPAPIMFKLDILSAMKL SLIGSIFSFMFVDLFDSLGTLMSCSKEMGLVNEKGEIKNLGRMLYTDAASTIMGASIGTS TVTAYVESAAGIVAGARTGLATTVTALGFLLSLFFTPLISIVPGYATAPALIIVGIFMFR QVAALDFSDFKILFPAFITIFTMPLTYSISTGLALGFLSYLIVHILVGDFKKINITLLFI GAICLLHLLV >gi|296155427|gb|ADVK01000009.1| GENE 7 7016 - 7885 167 289 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|161507907|ref|YP_001577871.1| ribosomal protein large subunit [Lactobacillus helveticus DPC 4571] # 90 272 83 265 285 68 25 4e-11 MIEYIIDEEYEKVRVDRFLRKHLKNINLSEIYKMLRKGKIKVNNKKISQDYRLVLGDVVF IFLPENFKENNEEKFIELSQERKEKLKEMIVFENENLFVINKLLGDVVHKGSGHDISLLE EFRSYYSNNNINFVNRIDKLTSGLVIGAKNIKTAREVAKEIQASNIIKRYYILVNGKIEK DNFILENYLKKDEEKVIVSDIEKEGYKKSITYFKKIKEHNKYTLLEAELKTGRTHQLRAQ LNHIGNNIVGDTKYGKDEREDMMYLFSYYLKIDLYNLEIKLEIPNFFLI >gi|296155427|gb|ADVK01000009.1| GENE 8 7885 - 8985 1056 366 aa, chain - ## HITS:1 COG:FN1027 KEGG:ns NR:ns ## COG: FN1027 COG0772 # Protein_GI_number: 19704362 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Bacterial cell division membrane protein # Organism: Fusobacterium nucleatum # 1 366 1 366 366 603 99.0 1e-172 MQNNIYLKKISKFSVFFIVNILLLFIISLSTIYSATITKSEPFFLKEIVWFIISIFVFIG VSLVDYRKYYKYATAIYIFNILMLLSVLVIGTSRLGAKRWIDLGPLALQPSEFSKLFLIF TFSAYLINNYSDRYTGFRAMFMSFLHIFPVFFLIAIEPDLGTSLVIILIYGMLLFLNKLE WKCIATVFFTIAAFIPISYKFLLKGYQKDRIDTFLNPELDALGTGWNITQSKIAIGSGKI FGKGFLNNTQGKLKYLPESHTDFIGSVFLEERGFLGGSMLLLIYIVLLVQIIYIADTTED KFGRYVCYGIATIFFFHIFVNMGMIMGIMPVTGLPLLLMSYGGSSLVFSFLILGVVQSVR IHRGSK >gi|296155427|gb|ADVK01000009.1| GENE 9 9005 - 9445 861 146 aa, chain - ## HITS:1 COG:FN1028 KEGG:ns NR:ns ## COG: FN1028 COG0756 # Protein_GI_number: 19704363 # Func_class: F Nucleotide transport and metabolism # Function: dUTPase # Organism: Fusobacterium nucleatum # 1 146 1 146 146 253 98.0 9e-68 MKKVQVKVIREKGVELPKYETEGSAGMDVRANIKESITLKSLERILVPTGLKVAIPEGYE IQVRPRSGLAIKHGITMLNTPGTVDSDYRGELKVIVVNLSNEAYTIEPNERIGQFVLNKI EQIEFVEVEELDSTERGEGGFGHTGK >gi|296155427|gb|ADVK01000009.1| GENE 10 9447 - 10673 1728 408 aa, chain - ## HITS:1 COG:FN1029 KEGG:ns NR:ns ## COG: FN1029 COG0612 # Protein_GI_number: 19704364 # Func_class: R General function prediction only # Function: Predicted Zn-dependent peptidases # Organism: Fusobacterium nucleatum # 1 408 1 408 408 689 99.0 0 MENIKLKKLDNGITLITEKLPDMSTFSMGFFVKTGAMNETKKESGISHFIEHLMFKGTKN RTAKEISEFVDFEGGILNAFTSRDLTCYYIKLLSSKIDIAIDVLTDMLLNSNFDEESIEK ERNVIIEEIKMYEDIPEEIVHEKNVEYALRGVHSNSISGTVASLKKIDRKAILNYLEKYY VAENLVIVASGNIDEKYLYKELNKKMKNFRKTKKEEVLDLSYEIKKGKKVVKKPSNQIHL CFTTRGVSSKSELRYPAAIISNVLGEGMSSRLFQKIREERGLAYSVYTYLTRFENCGLLS VYVGTTKEDYKEVIKLIKEEFKNIKENGISERELRKAKNKYESAFTFSLESTSSRMNRLA STYIIYGKIISLDKVREDIEKVTLKDIKKAAEFLFDEQFYSQTIVGDI >gi|296155427|gb|ADVK01000009.1| GENE 11 10692 - 11783 1220 363 aa, chain - ## HITS:1 COG:FN1030 KEGG:ns NR:ns ## COG: FN1030 COG0795 # Protein_GI_number: 19704365 # Func_class: R General function prediction only # Function: Predicted permeases # Organism: Fusobacterium nucleatum # 1 363 1 363 363 647 99.0 0 MIKKMDIYISKYFIKFFLMNIIGFMGVFLLAQTFKIIKYINQGKLAGGEIFDYILNLLPK MFVETAPLSVLLAGLITISIMASNLEIVSLKTSGIRFLRIVRAPLIIAFVISLFVFFVNN SIYTKSLAKINFYRRGEIDETLKLPTTKENAFFINNTEGYLYLMGKINRETGLAENIEVV KFNTEISKPKEIITAKSAKFDTEENKWIFSNVNIYNVETKETTAKTEYKSNLYKDDPSNF IRASAEDPRMLTIKELKKTIKEQKNIGEDTRIYLAELAKRYSFPFASFIVAFIGLSVSSK YVRGGRTTMNLVICVVAGYGYYLVSGAFEAMSLNGILNPFIASWVPNILYLIIGIYFMNR AEY >gi|296155427|gb|ADVK01000009.1| GENE 12 11783 - 12862 1077 359 aa, chain - ## HITS:1 COG:FN1031 KEGG:ns NR:ns ## COG: FN1031 COG0795 # Protein_GI_number: 19704366 # Func_class: R General function prediction only # Function: Predicted permeases # Organism: Fusobacterium nucleatum # 1 359 1 359 359 600 100.0 1e-171 MKIINKYILDELKGPIILAVFVFTFIFLLDIVVTMMEHIIVKGISVFDVLRLLSFYIPPI LTQTIPIGMFLGIMICFTKFSRNSESVAMVSTGMSIRDILKPILAIAIGASIFIVFLQES IIPRSFIKLKYVGTKIAYENPVFQLKEKTFIDNLDEYSIYVDEVSSDGKAKNIIAFEKPE DKNKFPMVLTGEEAFWKDSAIIIKESQFVSFDEKGKKNLVGTFDEKRVVLTAYFQDLNIK IKDVEALSIIDLIKGLKKVEATEVIKYKIEIFRKLALVFSTVPLAVIGFCLSLGHHRISK KYSFVLAMIIVFAYIIFLNIGIVMATAGKLNPFIATWTPNLLLYLLGYKLYKAKEVRGI >gi|296155427|gb|ADVK01000009.1| GENE 13 12871 - 13410 636 179 aa, chain - ## HITS:1 COG:no KEGG:FN1032 NR:ns ## KEGG: FN1032 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 179 1 179 179 244 100.0 1e-63 MYLDILILIIFILGIFSGIKNGIFVEIISVFGFAVNLLITKIYTPVVLKFLKRSDASFEN NYVITYIVTFITVYLVVSMILIFVKKAFKGLKKGFFNKMMGGVAGFIKALIVSLVIILVY TYSTKLAPSLEKYSQGSSAISIFYEIVPSFEAYIPDILVEDFNKNATKKIIEKNINTML >gi|296155427|gb|ADVK01000009.1| GENE 14 13426 - 14607 612 393 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|223476703|ref|YP_002580685.1| ribosomal protein L11 methyltransferase, putative [Thermococcus barophilus MP] # 1 391 1 393 396 240 34 1e-62 MSKIIIKKEKEQKILNFYPNVYKDEIKDTIGNVKTGDIVDVITNDMKFLARGYVTEGTSA FVRILTTKDEKIDRKFIFERIKNAYEKREYLLDETNSVRAFYSEADFIPGLIIDKFDKYV SIQFRNSGVEIFRQDIIEAVKKYLKPKGIYERSDVENRVIEGVETKTGIIFGEIPERTIM IDNGVKYSIDIVDGQKTGFFLDQRNSRKFIAKYINNQTRFLDVFSSSGGFSMSALKNGAK EVIAMDKDSHALELCHENYKLNEFTADFSTIEGDAFLMLNSLATRNKKFDIITLDPPSLI KKKTEIYKGRDFFLDLCDKSFKLLEDGGILGVITCAYHISLQDLIEVTRMSASKNNKLLS VMGINYQPEDHPWILHIPETLYLKALWVKVEGR >gi|296155427|gb|ADVK01000009.1| GENE 15 14781 - 15392 613 203 aa, chain + ## HITS:1 COG:FN1034 KEGG:ns NR:ns ## COG: FN1034 COG1309 # Protein_GI_number: 19704369 # Func_class: K Transcription # Function: Transcriptional regulator # Organism: Fusobacterium nucleatum # 1 203 8 205 205 315 97.0 3e-86 MEKSYHHGNLKEELIKKGIELINEVGENKLSLRKLAIICGVSNSAPYTHFKSKDELLKGM SLYILNLLKLELENTRKKYKNKENLLVMLGKTYVIFFLKNPKYYYFLSSRKDIEIDLSIK IDNNNMTALDILKEEAINKFSKLGISNEDIQNKILAMWSLVAGLVAVINMTSKNYLENWE CKIEEIIKASFITSSFITYCKEN >gi|296155427|gb|ADVK01000009.1| GENE 16 15407 - 15886 378 159 aa, chain + ## HITS:1 COG:FN1035 KEGG:ns NR:ns ## COG: FN1035 COG0655 # Protein_GI_number: 19704370 # Func_class: R General function prediction only # Function: Multimeric flavodoxin WrbA # Organism: Fusobacterium nucleatum # 1 159 1 159 159 298 98.0 4e-81 MEIIIHDLPEEKLKTIYGIIDNSLVITNNKKIKSCTGCFYCWTKNPGECRIKDGYDNLAE LYSKVEKIIIISRCCYGSYSPFIKNVLDRSIPYLLPFFKIKNKEMHHTIRYKKNLYFELY FYGEDIADEEKEIAKNMVKANCINLNVTNFTVSFLETIS >gi|296155427|gb|ADVK01000009.1| GENE 17 15899 - 16525 419 208 aa, chain + ## HITS:1 COG:no KEGG:FN1036 NR:ns ## KEGG: FN1036 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 208 5 212 212 360 100.0 1e-98 MKITIINGSPRFKKSNSEILKNYLLNFIKENEINEFFSFSIKLDDDIKTDIYNSDILIFI FPLYVDSIPANLLDLLVRFEDENVINSKTKIYCIVNNGFFEGVQNHLAISQIRCWSKKVN AQWGQGIGVGGGELLSHLKKVPLGQGPLKNLGIILEKFSKNILSLKGDEDIYINPNYPKI LYFLQANISWFIEGRKNKLKFRDLFKKI >gi|296155427|gb|ADVK01000009.1| GENE 18 16574 - 17020 533 148 aa, chain + ## HITS:1 COG:FN1037 KEGG:ns NR:ns ## COG: FN1037 COG0716 # Protein_GI_number: 19704372 # Func_class: C Energy production and conversion # Function: Flavodoxins # Organism: Fusobacterium nucleatum # 1 148 1 148 148 264 100.0 3e-71 MEKICIVYDSMHNMNTEKLVLSLKENYNDVDIIKVNNFDINAIDNYPKIGLASGIYWGRF SKNIEELLDKILDSDVKNLFFIYTSGIGKVRYEKKLIKRLEEKNKICLGIFSCKGFDKYG PFKLIGGINKGKPNEKDIQNLIKFFENI >gi|296155427|gb|ADVK01000009.1| GENE 19 17205 - 18113 1124 302 aa, chain + ## HITS:1 COG:FN1038 KEGG:ns NR:ns ## COG: FN1038 COG0697 # Protein_GI_number: 19704373 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; R General function prediction only # Function: Permeases of the drug/metabolite transporter (DMT) superfamily # Organism: Fusobacterium nucleatum # 1 302 2 303 303 520 100.0 1e-147 MNKKRYFGDLMLFLAAFIWGTAFVAQVTGMDRIGPFTFNMARSIIAIICLGAYLIITKSK LPKDIGVLLQGGLVCGFFIFMGTSLQQIGLQYTTAGKTGFITSFYILIIPFLTMIFLKHK IDLLTWISIIIGFIGLYLLAIPNLSDFTINRGDFIVFLGSFCWGGHILIIDYYSKKVSPV QLSFLQFVMLTILSGICALLFENETATMNNIFLSWKSIAYAGFLSSGIAYTLQMVGQKYT NPIIASLILSLEAVFAALAGYIMLDEVMTSREFLGCSIVFLAIIFSQIPKDIFKKKYVSL KK >gi|296155427|gb|ADVK01000009.1| GENE 20 18124 - 18876 538 250 aa, chain + ## HITS:1 COG:FN1039 KEGG:ns NR:ns ## COG: FN1039 COG1296 # Protein_GI_number: 19704374 # Func_class: E Amino acid transport and metabolism # Function: Predicted branched-chain amino acid permease (azaleucine resistance) # Organism: Fusobacterium nucleatum # 1 250 1 250 250 407 99.0 1e-114 MISYDIVKYNHFKRGVLMEEFRFALKRYFLISLAYFFIGVTFGLLMKEAGYGTIWSFLSA IFIYGGTIQLLLVGILKNHTPILTVGLISLLVNSRHMFYGLTYIDEFKKIRKKSFLKFLY LSLTLTDEVYSLYIGSKFPERLNRTKIMLWINSLAYSTWIFGCTMGGILYNFINFDLKGI DFIITEFFCIVVISQLVEDKSYISTSVGMISSIVAFLIVGSNFILLAIIFSMISLLLLKN KVSKKEVDKI >gi|296155427|gb|ADVK01000009.1| GENE 21 18873 - 19196 232 107 aa, chain + ## HITS:1 COG:FN1040 KEGG:ns NR:ns ## COG: FN1040 COG1687 # Protein_GI_number: 19704375 # Func_class: E Amino acid transport and metabolism # Function: Predicted branched-chain amino acid permeases (azaleucine resistance) # Organism: Fusobacterium nucleatum # 1 107 1 107 107 163 96.0 9e-41 MSNNLYTFLAIISAGIGMVICRILPYIIFANGKLPKIVKFYEKYLPFSLMAILFCLCLST VKFSVYPHGFPEILTLLIVGALQFWKKNVILSLFLGTAIYLVIIRYI >gi|296155427|gb|ADVK01000009.1| GENE 22 19208 - 20383 961 391 aa, chain + ## HITS:1 COG:FN1041 KEGG:ns NR:ns ## COG: FN1041 COG4552 # Protein_GI_number: 19704376 # Func_class: R General function prediction only # Function: Predicted acetyltransferase involved in intracellular survival and related acetyltransferases # Organism: Fusobacterium nucleatum # 1 391 1 391 391 595 97.0 1e-170 MKVRYAKKSEKEIAIKFWKDSFKDNEEQIKFYFDNIYNEKNYLVLEDNSKIVSSLHENDY IFNFNNNSIKSKYIVGVSSDITMRNKGYMSKLLISMLENSKRKGLPFVSLTPINPKIYRK FGFEYFSNIEYYNFSVEELANFKLPKENYSYIEINEENKKLYLNDLIKIYNFNMKDNFCY LERDKFYFDKILKEASSDKMKAFILYKDKKASAYIILGLYEENIEIRECMALDSISYKEI LTLIYGYRDYYKNVSLASPNNSSIEFLFDNQLNIEKNVKPFMMMRVLNPLAIFKNLKLEN SNIKIYIEDKILKENTGLYSLNNEISFSNITEEKASYDLKIDIADLIFLITGYFFIDELI KLGKIDIKNRNIFKKINKIFSKKNSYLYEFI >gi|296155427|gb|ADVK01000009.1| GENE 23 20431 - 21276 253 281 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|212640476|ref|YP_002316996.1| Uncharacterized protein conserved in bacteria containing two ribosomal protein S1-like RNA-binding domains [Anoxybacillus flavithermus WK1] # 30 280 24 280 285 102 29 4e-21 MIKVGKRQKLVINNFSSVGAYLFAGTDDDKDNILLPNNELEGKDLKEGDEVEVLIYRDSE DRLIATFRKTEALVGTLAKLEVVDDNPKLGAFLDWGLNKDLMLPNSQKETKVEIGKKYLV GLYEDSKGRVSATMKIYKFLMPSNDIKKGDIVNATVYRINDEIGVFVAVEDRYFGLIPKS ECFEKYSVGDELTLRVTRVREDKKLDLSPRKLLSDQMESDAELVLGKMRLLKEHFRFNDN SSAEDIKDYFGISKKAFKRAIGSLLKNGLIEKNGDYFILKK >gi|296155427|gb|ADVK01000009.1| GENE 24 21362 - 21493 115 43 aa, chain + ## HITS:1 COG:no KEGG:Lebu_1563 NR:ns ## KEGG: Lebu_1563 # Name: not_defined # Def: hypothetical protein # Organism: L.buccalis # Pathway: not_defined # 1 43 209 251 251 77 83.0 2e-13 MGSWNDSPPCYAAEKGLESEYQNLSSELLTQIRLALLYSVNEW >gi|296155427|gb|ADVK01000009.1| GENE 25 21526 - 22254 669 242 aa, chain - ## HITS:1 COG:no KEGG:FN1044 NR:ns ## KEGG: FN1044 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 242 1 242 242 305 89.0 1e-81 MIVAILISVHLLADFLFQTSAYSEKKRKMLKSLLLHCFIYFIVFEIIFFILFQCEKAFIL GLIISVLHFLINYTVNKLEKYFPKRRLQLLFFSLNQLIHIVMIVGMYYILNLENLTNNLY TKLQTYEDFKIIVLYTFVFSIILDPASVFIRKLFTSISPKIYPKTNLKELKAGNIIGKLE RIIIAILLLNNQFGVMGFVLTAKSIARFKQMENKDFAEKYLIGTLTSFLIALISVLILKE LL >gi|296155427|gb|ADVK01000009.1| GENE 26 22251 - 23060 790 269 aa, chain - ## HITS:1 COG:no KEGG:FN1045 NR:ns ## KEGG: FN1045 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 269 1 269 269 455 98.0 1e-126 MKRYSALMIDLKNSRSYSIQDRNNIQNSVLNSINILNKIFKNSIEKEVEFSAGDEIQGLF VSPKSAYLYYRLFSTVIFPIEIHSGIGFGTWDIKVDSASSTAQDGTVYHYARKAIDEAKK SLEYSVLFYSKSKNDIIVNSLINTSTLLSSKQSEYQNKLMLLAEILYPIASGDIIEYEKL KELLKFIQFEKKENLIIDIDYPVISTQLEKESFYITEGKKRGLSTQISKLLGVSRQSIEK AVKTGNIYELRNLTISILKAMESIEGENL >gi|296155427|gb|ADVK01000009.1| GENE 27 23219 - 23299 174 26 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MKLQFELTNEQKFDSNYENLKIMGGE >gi|296155427|gb|ADVK01000009.1| GENE 28 23299 - 24249 739 316 aa, chain + ## HITS:1 COG:no KEGG:FN1046 NR:ns ## KEGG: FN1046 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 66 311 1 243 245 377 87.0 1e-103 MFFLGYLFYYSVLATLIDPSIFFSILSFVLGGMLYFNSGSPLQGIAICAFGLLHLISRLC YHSKKMPTKDSSIYSYAKHNYLSISIATVIFAIPIWYIIIKSKITLEISPVYIFIASLLI SWAILFKIVDRIFIHNRETQEVVLGDYFTIYRSIGSIRRSLTHIYIFKFKSSPDLYSTGM WRYRIFIDKLDSKFSCTFGKGIFGTTYITSVKLIEDARINAPENQNPYSTFSFRKRSLTK KDYTFPWHILLFASGIPVFCFGVFFLYVAHIGPQNSDVRLLEYIELYKVIGIGGIISGII LEIIAIFVFIKFIRKK >gi|296155427|gb|ADVK01000009.1| GENE 29 24268 - 25032 702 254 aa, chain + ## HITS:1 COG:no KEGG:FN1047 NR:ns ## KEGG: FN1047 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 17 254 1 238 238 334 89.0 3e-90 MNEPKKWGYIFDEKLNMYVPNLPRQKKLAKVFLILSLISFIAILIQIYFFDKTSYEKISF LTYTSIVVFLFLALYLVLKINIYLAEKRLQEVKELKLEKNFEIKALKNRRFLAYMMIWIL LIVMFVSHPNILKQFSTRYIFYLIFAIVAFIYNFYTLFKEFKNNKYSLIIIGKTIKIYYE NNEKEFITTDNISYAKFYAIARGRGRRDRNPTLQIFDSEEKKLVEMTIKAMDYYSLKKYF EKYNVTIDNQYKEF >gi|296155427|gb|ADVK01000009.1| GENE 30 25052 - 26131 881 359 aa, chain + ## HITS:1 COG:no KEGG:FN1048 NR:ns ## KEGG: FN1048 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 359 1 367 367 461 87.0 1e-128 MELLEKDEEYIISLLEQGKKVEAIVFVKDKTGMTLKDAKDYIDKKDISISEEDEKYISSL IDENKKLEAVVFLHKNRDMSLKDSKNYTDKLILKKNIETNKESTHKWSCIYDKKLNALVP NLPRQKKALKIMLNIFLALLVITLIQFLFLDRSSDIKMIILEFSILGILLFTITLPLVSL HIYSIENKLKKLENLELSNQFEVRAFISNLELFLEVLLILIFIIVIPILFVKTYKKVDCK DYKEIFYFLVLIVITIYGIYELLKMFKYKKYSLNISNREISLLYNKNEIKSIKIENLNYI NFYAKKSRRGISSNIPVIQIFDMEKNIFTEMKVKISDYILLKMYFEKYKVMVSDDFKMF >gi|296155427|gb|ADVK01000009.1| GENE 31 26152 - 26478 639 108 aa, chain + ## HITS:1 COG:no KEGG:FN1049 NR:ns ## KEGG: FN1049 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 107 1 107 108 160 94.0 2e-38 MRAKEFAEMCYEEKEIQLKEYMNGNESLVAKLKNELALSNKQEKILYELLDTVLTDTYIT LLYALDGTASLGNGKQENFKLYGEDGNLIFNSGELEMAVYEIFYENKN >gi|296155427|gb|ADVK01000009.1| GENE 32 26495 - 26878 688 127 aa, chain + ## HITS:1 COG:FN1050 KEGG:ns NR:ns ## COG: FN1050 COG0346 # Protein_GI_number: 19704385 # Func_class: E Amino acid transport and metabolism # Function: Lactoylglutathione lyase and related lyases # Organism: Fusobacterium nucleatum # 1 127 1 127 127 234 98.0 2e-62 MFIEHIAMYVNDLEKTKEFFIKYLGAKSNNIYHNKKTDFKSYFLTFDSGCRLEIMTKPEL VDEKKDLKRTGFIHIAFSVGSKEKVDELTEILKADGFEVISGPRTTGDGYYESCIVGIEE NQIEITI >gi|296155427|gb|ADVK01000009.1| GENE 33 26899 - 27618 701 239 aa, chain + ## HITS:1 COG:no KEGG:FN1051 NR:ns ## KEGG: FN1051 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 239 1 239 239 355 97.0 1e-96 MNKDFDNNILDFTKRKKILKLILFLLVISLVILIVQLFFKDSLSFSEGNLTIINSFVAFI LLFLYITLTADMYVTIKRIKEREKIEVPNEFRVDGFKQTYFIILYIIVLLSALALVFASI ITGQNIVMTLMGIVITIIVSTYIYSMIKNRKFSLEVKNKNIKIFYKNQEIGTFEIKYISF VAFSGSGGQKVKKGDYPIMAVYSLRGEEFLRMPLSLRNYWLMKKYFLKYNIDIEDSYNL >gi|296155427|gb|ADVK01000009.1| GENE 34 27638 - 28105 550 155 aa, chain + ## HITS:1 COG:no KEGG:FN1052 NR:ns ## KEGG: FN1052 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 155 1 155 155 254 98.0 6e-67 MSKILLILPFVFVFIGIFTVIYIIYTTINEKRRKKLRDEEFKKIKETLFSYEFESTQKNA VNKNFDFKNYLYSGDYVKVIKDFKDYYGFTYQAGEKFYFACVYFLPYEDGYTLYISKNKL NISPIYLQNREETQGEICSHPEEYFEIIEQGRFKR >gi|296155427|gb|ADVK01000009.1| GENE 35 28141 - 28689 551 182 aa, chain + ## HITS:1 COG:no KEGG:FN1053 NR:ns ## KEGG: FN1053 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 182 1 182 182 296 99.0 2e-79 MYSYLYDNKFLLWTTIIFMFIGMITVTIILYIFFLSIVKRKKSKKEMEELLEILSTLKNN NRIERDKETKKLSETLSPYEFESTQLNYVNKSFNFEKYLYSGDYVKVIKAFKDYYGFTHQ VGEKWYFASQYSLLSEYGDVLYISTDKINVDTIYLEDRKDNLYTHPEEYFEILEQGRFKR EI >gi|296155427|gb|ADVK01000009.1| GENE 36 28713 - 29090 383 125 aa, chain + ## HITS:1 COG:no KEGG:FN1054 NR:ns ## KEGG: FN1054 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 125 1 125 125 193 99.0 2e-48 MRMAMIMMTLVMVIFIICSVALISLLGRKNTMECFYVEDNILYLNSLFVKKISLSDIRNI EFKTFRSRGSYSGKIIVNLKNAKVIKRYFQTSQVAFFVSEQMVLAEIEKITSILKKYYIP YTINK >gi|296155427|gb|ADVK01000009.1| GENE 37 29115 - 29231 155 38 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|34763313|ref|ZP_00144269.1| ## NR: gi|34763313|ref|ZP_00144269.1| hypothetical protein [Fusobacterium nucleatum subsp. vincentii ATCC 49256] hypothetical protein [Fusobacterium nucleatum subsp. vincentii ATCC 49256] # 1 37 1 37 217 67 97.0 4e-10 MERIILNQENHSVKEIYQDFYMFLAEYFTPLGYKYRKL >gi|296155427|gb|ADVK01000009.1| GENE 38 29558 - 30568 499 336 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|148988856|ref|ZP_01820271.1| 50S ribosomal protein L9 [Streptococcus pneumoniae SP6-BS73] # 9 313 7 307 308 196 38 1e-49 MEQKKMKYLENLVGKTPMLELIFDYKGEERRIFVKNESYNLTGSIKDRMAFYTLKKAYEK NEIKKGAPIVEATSGNTGIAFSAMGAILGHPVIIYMPDWMSEERKSLIRSFGAKIILVSR KEGGFLGSIEKTKEFAKNNPDTYLPSQFSNLYNSEAHYYGIGLEIVNEMKSLNLNIDGFV AGVGTGGTVMGIGKRIKENFSNAKICPLEPLNSPTLSTGYKVAKHRIEGISDEFIPDLVK LDKLDNVVSVDDGDAIVMAQKLAKCGLGVGISSGANFIGALMLQNKLGKDSVIVTVFPDD NKKYLSTDLMREEKVKEDFLSKDITLKEIKNVLRVI >gi|296155427|gb|ADVK01000009.1| GENE 39 30811 - 31410 530 199 aa, chain + ## HITS:1 COG:no KEGG:FN1056 NR:ns ## KEGG: FN1056 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 199 1 199 199 308 100.0 7e-83 MTYEFKYSTSNSNGIFYYLFIFLALFIAIFINYLILKFFNKQILLTLDKIIYGDYKHSVL AFLLFFIVPFIFLYGIFPCFIAGLMINKVMYKNGSIDILNNYAVLHYKNEEIILEKENFS IETIERKNIYLLNTKFILMRLGKEEIYLYVIKTDNGKKYKLYKQTKGYFKSIYARCTNDT SLSIAMQKLSKISNIKKEK >gi|296155427|gb|ADVK01000009.1| GENE 40 31426 - 31884 564 152 aa, chain + ## HITS:1 COG:FN1057 KEGG:ns NR:ns ## COG: FN1057 COG0454 # Protein_GI_number: 19704392 # Func_class: K Transcription; R General function prediction only # Function: Histone acetyltransferase HPA2 and related acetyltransferases # Organism: Fusobacterium nucleatum # 7 152 1 146 146 254 98.0 3e-68 MIRKCRLEDKKDWVRLNKQFIEYEYKDENVWNSPLKFGNLEEDFELILNDTSTILFAIIE EEKMIGFMNIQCFYSIWSHGKVFFLDDFFIEENFRRKDYGEKALKDLQKYAKKSGIKRIQ LMAENTNPRAIKFYKKHKFNEQEIHLFLKYLI >gi|296155427|gb|ADVK01000009.1| GENE 41 31920 - 32687 735 255 aa, chain + ## HITS:1 COG:no KEGG:FN1058 NR:ns ## KEGG: FN1058 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 255 1 255 255 348 100.0 1e-94 MEMFGMILFIPLFFFFSIFLAFTIRKRIIEKIIFIENDENLKDLRAWDFFYNILRMEKSA KPIYYTEFFLLILDTIYILFAGYKEYLKEIEFVKEYPDFPISPISFVFTKFAIPIILWFI ILLLLLFALIMKKKENKRISEMLDNLEKSKLLNYAKEDFIKSDRIVGTGIVIQSDIKLGD KFLFSIYPAYIIPYSWIDNVKIDEIRGRGGRRLYLDFTLKKSYESIKIFFAKEDVAEKIK DFSLKTKNKNNRDKI >gi|296155427|gb|ADVK01000009.1| GENE 42 32781 - 34058 1507 425 aa, chain + ## HITS:1 COG:FN1059 KEGG:ns NR:ns ## COG: FN1059 COG1114 # Protein_GI_number: 19704394 # Func_class: E Amino acid transport and metabolism # Function: Branched-chain amino acid permeases # Organism: Fusobacterium nucleatum # 1 425 1 425 425 676 99.0 0 MYKTKDVLLTGFALFAMLFGAGNLIFPPMLGYETSSSWTLTMLAFIITGVGFPFLGILSV SIAGNGIKDFANRVSPTFSKIFAIISILAIGPMLAIPRTGATAYEITFLYNGMESPIYKY IYLICYFGIVILFSLRANKVIERVGKILTPILLILLFLIIIKGIFFANLSVKPDVYPHAF KKGFLEGYQTMDTIASIAYAGIILKAIKNGRNLTQKQEFSFLIKAGLVAILSLALIYGGF ALVGAKMHSVLATNDKIELLVKTTSYLLGNYGNLVLAICVAGACLTTAIGLIATVGEFFS SITSFKYEKIVAFTVIISFLLSILGVESIIRISVPILVFIYPIIISLIVLNLFGKYIKND YVYKGVVLFTGIIGLIESLDSLGIKNYYTDSVLEILPFSDYGLTWLFPGLVGYILFSFMF RKVKK >gi|296155427|gb|ADVK01000009.1| GENE 43 34100 - 34864 1046 254 aa, chain - ## HITS:1 COG:FN1060_2 KEGG:ns NR:ns ## COG: FN1060_2 COG4884 # Protein_GI_number: 19704395 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 75 254 1 180 180 321 100.0 1e-87 MIQVLHFKNEKSDKFWFVETLDSEMMVNYGKTGATGKYEIKEFDSSEACEKEALKLINSK KKKGYEEFPEFDRNNHYYFDDEEYGLNPLTSHPTFRKYFLDNFYYDCGDEEAPFGSDEGN DTLYELQEAIQKKKKIDFFDFPKVIIEKLWEMKYLSPNVEKTDEELEEQAKSEFNGLLGE QIILQSDQVILAVAFGQAKITGKIDKKLLELALKSLNRIDRLNRLIWNWDKEEATYYIET MKKDLIKFKEDFQK Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:12:08 2011 Seq name: gi|296155282|gb|ADVK01000010.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00011, whole genome shotgun sequence Length of sequence - 144913 bp Number of predicted genes - 147, with homology - 142 Number of transcription units - 57, operones - 36 average op.length - 3.5 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . + CDS 3 - 365 414 ## FN0551 Crp family regulatory proteins 2 2 Op 1 1/0.909 - CDS 352 - 1461 1518 ## COG3616 Predicted amino acid aldolase or racemase 3 2 Op 2 4/0.000 - CDS 1481 - 2806 1813 ## COG3048 D-serine dehydratase 4 2 Op 3 . - CDS 2840 - 4189 1947 ## COG2610 H+/gluconate symporter and related permeases - Prom 4310 - 4369 12.9 + Prom 4202 - 4261 11.2 5 3 Tu 1 . + CDS 4431 - 5048 677 ## COG1396 Predicted transcriptional regulators - Term 5042 - 5108 9.2 6 4 Tu 1 . - CDS 5126 - 5746 851 ## FN0556 hypothetical protein - Prom 5780 - 5839 17.6 7 5 Tu 1 . - CDS 6157 - 6222 198 ## - Prom 6252 - 6311 6.1 - Term 6277 - 6323 6.2 8 6 Tu 1 . - CDS 6346 - 7080 1026 ## FN0557 hypothetical protein - Prom 7120 - 7179 12.4 + Prom 7122 - 7181 11.9 9 7 Op 1 . + CDS 7410 - 8129 1004 ## FN0558 TraT complement resistance protein precursor 10 7 Op 2 1/0.909 + CDS 8167 - 9855 2608 ## COG1109 Phosphomannomutase 11 7 Op 3 2/0.182 + CDS 9839 - 10942 1106 ## COG0635 Coproporphyrinogen III oxidase and related Fe-S oxidoreductases + Prom 11052 - 11111 8.0 12 8 Op 1 14/0.000 + CDS 11142 - 11813 910 ## COG0325 Predicted enzyme with a TIM-barrel fold 13 8 Op 2 1/0.909 + CDS 11834 - 12286 781 ## COG1799 Uncharacterized protein conserved in bacteria 14 8 Op 3 . + CDS 12273 - 13274 1412 ## COG0482 Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain + Term 13281 - 13329 8.1 + Prom 13280 - 13339 8.6 15 9 Op 1 . + CDS 13396 - 13587 227 ## gi|296327470|ref|ZP_06870016.1| conserved hypothetical protein 16 9 Op 2 . + CDS 13620 - 14015 634 ## PTH_0968 hypothetical protein + Term 14025 - 14072 2.8 - Term 14007 - 14065 4.0 17 10 Op 1 . - CDS 14140 - 14682 643 ## FN0564 hypothetical protein 18 10 Op 2 . - CDS 14725 - 14940 203 ## FN0565 hypothetical protein - Prom 15054 - 15113 10.8 19 11 Tu 1 . - CDS 15172 - 16278 1016 ## FN0566 hypothetical protein - Prom 16298 - 16357 7.7 20 12 Op 1 . - CDS 16433 - 16942 638 ## FN0568 hypothetical protein 21 12 Op 2 . - CDS 16939 - 19275 2187 ## COG2849 Uncharacterized protein conserved in bacteria - Prom 19383 - 19442 10.9 + Prom 19277 - 19336 17.4 22 13 Op 1 . + CDS 19469 - 19960 268 ## FN0570 hypothetical protein 23 13 Op 2 . + CDS 19941 - 20867 975 ## COG0758 Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 24 13 Op 3 . + CDS 20925 - 21416 373 ## Tresu_2679 hypothetical protein 25 13 Op 4 . + CDS 21413 - 21589 378 ## FN0575 hypothetical protein + Term 21614 - 21648 1.3 + Prom 21594 - 21653 8.3 26 14 Op 1 . + CDS 21739 - 23307 2329 ## COG2304 Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 27 14 Op 2 . + CDS 23322 - 24047 841 ## FN0577 hypothetical protein 28 14 Op 3 1/0.909 + CDS 24051 - 25871 1679 ## COG0514 Superfamily II DNA helicase 29 14 Op 4 7/0.000 + CDS 25868 - 30724 6207 ## COG2373 Large extracellular alpha-helical protein + Prom 30916 - 30975 7.4 30 14 Op 5 1/0.909 + CDS 31150 - 33396 2350 ## COG4953 Membrane carboxypeptidase/penicillin-binding protein PbpC 31 14 Op 6 23/0.000 + CDS 33400 - 34569 1472 ## COG4591 ABC-type transport system, involved in lipoprotein release, permease component 32 14 Op 7 1/0.909 + CDS 34544 - 35239 259 ## PROTEIN SUPPORTED gi|163765018|ref|ZP_02172066.1| ribosomal protein L9 33 14 Op 8 1/0.909 + CDS 35254 - 35730 695 ## COG3212 Predicted membrane protein 34 14 Op 9 2/0.182 + CDS 35736 - 36029 469 ## COG2350 Uncharacterized protein conserved in bacteria 35 14 Op 10 40/0.000 + CDS 36080 - 36754 1015 ## COG0745 Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 36 14 Op 11 . + CDS 36732 - 38069 1380 ## COG0642 Signal transduction histidine kinase + Term 38119 - 38155 -0.7 37 15 Op 1 . - CDS 38205 - 38381 231 ## FN0587 hypothetical protein 38 15 Op 2 . - CDS 38411 - 38644 212 ## FN0588 hypothetical protein - Prom 38690 - 38749 9.1 + Prom 38641 - 38700 10.1 39 16 Tu 1 . + CDS 38725 - 39045 378 ## COG1733 Predicted transcriptional regulators + Term 39121 - 39166 2.3 - Term 39039 - 39090 8.2 40 17 Op 1 4/0.000 - CDS 39098 - 40279 1770 ## COG1473 Metal-dependent amidase/aminoacylase/carboxypeptidase 41 17 Op 2 . - CDS 40295 - 41815 1734 ## COG2978 Putative p-aminobenzoyl-glutamate transporter - Prom 41882 - 41941 15.5 + Prom 41902 - 41961 11.3 42 18 Op 1 1/0.909 + CDS 41997 - 44210 3176 ## COG0210 Superfamily I DNA and RNA helicases 43 18 Op 2 4/0.000 + CDS 44221 - 45054 1183 ## COG0774 UDP-3-O-acyl-N-acetylglucosamine deacetylase 44 18 Op 3 25/0.000 + CDS 45076 - 45501 660 ## COG0764 3-hydroxymyristoyl/3-hydroxydecanoyl-(acyl carrier protein) dehydratases 45 18 Op 4 5/0.000 + CDS 45519 - 46292 1139 ## COG1043 Acyl-[acyl carrier protein]--UDP-N-acetylglucosamine O-acyltransferase 46 18 Op 5 5/0.000 + CDS 46292 - 47095 933 ## COG3494 Uncharacterized protein conserved in bacteria 47 18 Op 6 1/0.909 + CDS 47105 - 48175 1357 ## COG0763 Lipid A disaccharide synthetase 48 18 Op 7 . + CDS 48172 - 49923 202 ## PROTEIN SUPPORTED gi|169795303|ref|YP_001713096.1| ABC transporter ATP-binding protein + Term 50132 - 50174 1.1 - Term 50120 - 50160 1.5 49 19 Op 1 . - CDS 50341 - 50841 730 ## FN0600 hypothetical protein 50 19 Op 2 . - CDS 50921 - 51391 681 ## COG2849 Uncharacterized protein conserved in bacteria - Prom 51413 - 51472 12.9 + Prom 51406 - 51465 15.8 51 20 Tu 1 . + CDS 51518 - 52228 1270 ## FN0602 hypothetical protein - Term 52223 - 52270 5.8 52 21 Tu 1 . - CDS 52320 - 53264 826 ## COG0583 Transcriptional regulator - Prom 53428 - 53487 7.8 + Prom 53323 - 53382 14.7 53 22 Op 1 1/0.909 + CDS 53439 - 54647 1619 ## COG1301 Na+/H+-dicarboxylate symporters 54 22 Op 2 1/0.909 + CDS 54675 - 55886 1593 ## COG0436 Aspartate/tyrosine/aromatic aminotransferase 55 22 Op 3 1/0.909 + CDS 55883 - 56308 649 ## COG0251 Putative translation initiation inhibitor, yjgF family + Term 56319 - 56389 7.5 + Prom 56345 - 56404 10.2 56 23 Op 1 1/0.909 + CDS 56433 - 57014 875 ## COG1713 Predicted HD superfamily hydrolase involved in NAD metabolism 57 23 Op 2 10/0.000 + CDS 57011 - 59113 1236 ## PROTEIN SUPPORTED gi|15894003|ref|NP_347352.1| fused ribonuclease/ribosomal protein S1 58 23 Op 3 . + CDS 59130 - 59576 680 ## COG0691 tmRNA-binding protein 59 23 Op 4 . + CDS 59645 - 63151 4341 ## FN0610 hypothetical protein 60 23 Op 5 . + CDS 63177 - 65090 2789 ## COG0441 Threonyl-tRNA synthetase 61 23 Op 6 . + CDS 65103 - 65621 899 ## FN0612 hypothetical protein + Term 65638 - 65686 5.2 + Prom 65643 - 65702 7.4 62 24 Tu 1 . + CDS 65759 - 66190 564 ## FN0613 hypothetical protein + Term 66208 - 66251 6.0 + Prom 66201 - 66260 14.5 63 25 Op 1 35/0.000 + CDS 66338 - 68053 188 ## PROTEIN SUPPORTED gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 64 25 Op 2 . + CDS 68057 - 69781 2231 ## COG1132 ABC-type multidrug transport system, ATPase and permease components - Term 69855 - 69898 1.0 65 26 Op 1 . - CDS 70040 - 71584 1776 ## FN0616 hypothetical protein 66 26 Op 2 1/0.909 - CDS 71621 - 72715 1261 ## COG0592 DNA polymerase sliding clamp subunit (PCNA homolog) 67 26 Op 3 1/0.909 - CDS 72730 - 73758 1473 ## COG0687 Spermidine/putrescine-binding periplasmic protein 68 26 Op 4 . - CDS 73829 - 74698 1026 ## COG0668 Small-conductance mechanosensitive channel - Prom 74762 - 74821 14.7 69 27 Tu 1 . - CDS 74824 - 76128 1616 ## COG0427 Acetyl-CoA hydrolase - Prom 76272 - 76331 9.7 + Prom 76197 - 76256 8.9 70 28 Op 1 1/0.909 + CDS 76347 - 77000 874 ## COG1059 Thermostable 8-oxoguanine DNA glycosylase 71 28 Op 2 . + CDS 77021 - 77977 1113 ## COG0679 Predicted permeases 72 29 Op 1 2/0.182 - CDS 78014 - 79336 1715 ## COG1757 Na+/H+ antiporter 73 29 Op 2 1/0.909 - CDS 79338 - 80534 1527 ## COG1168 Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities 74 29 Op 3 1/0.909 - CDS 80554 - 81081 610 ## COG4283 Uncharacterized conserved protein - Prom 81103 - 81162 10.4 - Term 81114 - 81154 4.3 75 30 Op 1 3/0.000 - CDS 81174 - 82202 1213 ## COG2222 Predicted phosphosugar isomerases 76 30 Op 2 3/0.000 - CDS 82218 - 83276 1102 ## COG0449 Glucosamine 6-phosphate synthetase, contains amidotransferase and phosphosugar isomerase domains 77 30 Op 3 13/0.000 - CDS 83286 - 84110 1202 ## COG3716 Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IID 78 30 Op 4 13/0.000 - CDS 84097 - 84876 1229 ## COG3715 Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIC 79 30 Op 5 9/0.000 - CDS 84887 - 85348 554 ## COG3444 Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIB 80 30 Op 6 . - CDS 85361 - 85786 529 ## COG2893 Phosphotransferase system, mannose/fructose-specific component IIA 81 30 Op 7 . - CDS 85811 - 85906 115 ## - Prom 86107 - 86166 15.2 + Prom 85918 - 85977 4.5 82 31 Tu 1 . + CDS 85997 - 87241 1076 ## FN0633 replication protein + Term 87254 - 87314 8.1 - Term 87242 - 87302 10.7 83 32 Op 1 1/0.909 - CDS 87309 - 89126 2895 ## COG1217 Predicted membrane GTPase involved in stress response 84 32 Op 2 . - CDS 89149 - 90012 1195 ## COG0130 Pseudouridine synthase - Prom 90154 - 90213 12.1 + Prom 90102 - 90161 10.7 85 33 Tu 1 . + CDS 90269 - 90628 418 ## FN0636 hypothetical protein + Term 90640 - 90688 11.0 - Term 90628 - 90676 12.5 86 34 Op 1 . - CDS 90685 - 91125 524 ## gi|296327540|ref|ZP_06870086.1| cell division protein FtsL 87 34 Op 2 . - CDS 91142 - 91501 389 ## FN0638 hypothetical protein 88 34 Op 3 . - CDS 91519 - 91623 57 ## 89 34 Op 4 1/0.909 - CDS 91626 - 91955 382 ## COG4997 Uncharacterized conserved protein 90 34 Op 5 . - CDS 91967 - 92848 922 ## COG1266 Predicted metal-dependent membrane protease 91 34 Op 6 . - CDS 92871 - 93506 736 ## FN0641 methyltransferase (EC:2.1.1.-) 92 34 Op 7 . - CDS 93519 - 93989 475 ## FN0642 hypothetical protein 93 34 Op 8 1/0.909 - CDS 93962 - 94399 650 ## COG3708 Uncharacterized protein conserved in bacteria 94 34 Op 9 6/0.000 - CDS 94415 - 95872 1865 ## COG0007 Uroporphyrinogen-III methylase 95 34 Op 10 4/0.000 - CDS 95896 - 96792 1339 ## COG0181 Porphobilinogen deaminase 96 34 Op 11 1/0.909 - CDS 96828 - 97817 1149 ## COG0373 Glutamyl-tRNA reductase - Prom 97847 - 97906 15.0 97 35 Op 1 1/0.909 - CDS 97918 - 99048 1054 ## COG3629 DNA-binding transcriptional activator of the SARP family 98 35 Op 2 1/0.909 - CDS 99060 - 100589 1874 ## COG1574 Predicted metal-dependent hydrolase with the TIM-barrel fold 99 35 Op 3 1/0.909 - CDS 100617 - 102245 2256 ## COG1574 Predicted metal-dependent hydrolase with the TIM-barrel fold 100 35 Op 4 . - CDS 102245 - 103669 1617 ## COG1757 Na+/H+ antiporter - Prom 103698 - 103757 9.2 + Prom 103734 - 103793 14.0 101 36 Tu 1 . + CDS 103892 - 104758 232 ## PROTEIN SUPPORTED gi|161507907|ref|YP_001577871.1| ribosomal protein large subunit + Prom 104832 - 104891 9.8 102 37 Op 1 . + CDS 104940 - 105947 1799 ## COG0057 Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase + Term 105982 - 106015 4.0 103 37 Op 2 . + CDS 106026 - 106664 787 ## FN0653 hypothetical protein 104 37 Op 3 . + CDS 106685 - 107881 1840 ## COG0126 3-phosphoglycerate kinase 105 37 Op 4 . + CDS 107929 - 108288 572 ## FN0655 hypothetical protein 106 37 Op 5 . + CDS 108303 - 108683 704 ## FN0656 hypothetical protein + Term 108702 - 108751 9.3 + Prom 108700 - 108759 14.7 107 38 Tu 1 . + CDS 108844 - 109365 289 ## PROTEIN SUPPORTED gi|50365462|ref|YP_053887.1| acetyltransferase of 30S ribosomal protein L7 + Term 109374 - 109411 2.3 - Term 109349 - 109410 7.5 108 39 Op 1 22/0.000 - CDS 109418 - 110203 1149 ## COG1464 ABC-type metal ion transport system, periplasmic component/surface antigen 109 39 Op 2 32/0.000 - CDS 110218 - 110919 1008 ## COG2011 ABC-type metal ion transport system, permease component 110 39 Op 3 . - CDS 110909 - 111916 1117 ## COG1135 ABC-type metal ion transport system, ATPase component + Prom 111886 - 111945 8.6 111 40 Tu 1 . + CDS 112059 - 112166 152 ## + Prom 112197 - 112256 9.8 112 41 Op 1 . + CDS 112304 - 112711 454 ## FN0661 hypothetical protein 113 41 Op 2 . + CDS 112730 - 113686 1307 ## COG0010 Arginase/agmatinase/formimionoglutamate hydrolase, arginase family + Term 113706 - 113762 7.6 + Prom 113740 - 113799 12.3 114 42 Op 1 . + CDS 113828 - 114295 530 ## FN0663 hypothetical protein 115 42 Op 2 . + CDS 114336 - 115484 1917 ## COG2070 Dioxygenases related to 2-nitropropane dioxygenase + Term 115518 - 115563 4.3 + Prom 115546 - 115605 9.8 116 43 Op 1 . + CDS 115633 - 116124 660 ## FN0665 N-acetylmuramoyl-L-alanine amidase (EC:3.5.1.28) 117 43 Op 2 . + CDS 116156 - 116794 764 ## FN0666 hypothetical protein 118 44 Tu 1 . - CDS 116801 - 118138 1281 ## COG0534 Na+-driven multidrug efflux pump - Prom 118180 - 118239 6.3 + Prom 118416 - 118475 10.8 119 45 Op 1 25/0.000 + CDS 118560 - 119486 1459 ## COG0803 ABC-type metal ion transport system, periplasmic component/surface adhesin 120 45 Op 2 42/0.000 + CDS 119510 - 120193 258 ## PROTEIN SUPPORTED gi|225084369|ref|YP_002657150.1| ribosomal protein S16 + Prom 120198 - 120257 11.1 121 46 Op 1 10/0.000 + CDS 120281 - 121198 1042 ## COG1108 ABC-type Mn2+/Zn2+ transport systems, permease components 122 46 Op 2 1/0.909 + CDS 121195 - 122037 1015 ## COG1108 ABC-type Mn2+/Zn2+ transport systems, permease components + Term 122061 - 122108 6.0 + Prom 122095 - 122154 9.7 123 47 Op 1 1/0.909 + CDS 122225 - 123541 1749 ## COG1373 Predicted ATPase (AAA+ superfamily) 124 47 Op 2 1/0.909 + CDS 123565 - 124029 650 ## COG2606 Uncharacterized conserved protein 125 47 Op 3 . + CDS 124054 - 124902 696 ## COG0697 Permeases of the drug/metabolite transporter (DMT) superfamily - Term 124928 - 124965 3.2 126 48 Op 1 41/0.000 - CDS 125001 - 126620 1595 ## PROTEIN SUPPORTED gi|167855908|ref|ZP_02478658.1| 50S ribosomal protein L28 127 48 Op 2 . - CDS 126638 - 126910 506 ## COG0234 Co-chaperonin GroES (HSP10) - Prom 127031 - 127090 12.0 + Prom 126995 - 127054 13.7 128 49 Tu 1 . + CDS 127082 - 128704 1904 ## Lebu_0003 hypothetical protein + Term 128712 - 128756 6.2 + Prom 128733 - 128792 15.2 129 50 Op 1 . + CDS 128949 - 129185 152 ## gi|296327582|ref|ZP_06870128.1| conserved hypothetical protein 130 50 Op 2 . + CDS 129234 - 129986 737 ## SNSL254_A2953 hypothetical protein 131 50 Op 3 . + CDS 129979 - 131106 1139 ## TERTU_1467 hypothetical protein + Prom 131177 - 131236 7.5 132 51 Tu 1 . + CDS 131273 - 131389 69 ## + Prom 131414 - 131473 5.9 133 52 Op 1 . + CDS 131524 - 131787 318 ## Tola_2268 hypothetical protein 134 52 Op 2 . + CDS 131809 - 132909 738 ## gi|296327587|ref|ZP_06870133.1| ABC superfamily ATP binding cassette transporter permease subunit + Prom 132961 - 133020 13.2 135 53 Op 1 7/0.000 + CDS 133122 - 133724 832 ## COG2815 Uncharacterized protein conserved in bacteria 136 53 Op 2 10/0.000 + CDS 133749 - 134615 841 ## COG1162 Predicted GTPases 137 53 Op 3 1/0.909 + CDS 134608 - 135255 1132 ## COG0036 Pentose-5-phosphate-3-epimerase 138 53 Op 4 1/0.909 + CDS 135268 - 135945 943 ## COG1846 Transcriptional regulators 139 53 Op 5 . + CDS 135947 - 137572 1702 ## COG1293 Predicted RNA-binding protein homologous to eukaryotic snRNP + Term 137582 - 137633 8.6 - Term 137571 - 137622 5.1 140 54 Tu 1 . - CDS 137624 - 137869 438 ## FN0683 hypothetical protein - Prom 137896 - 137955 8.0 + Prom 137892 - 137951 13.8 141 55 Tu 1 . + CDS 138051 - 139751 2750 ## COG1151 6Fe-6S prismane cluster-containing protein + Term 139772 - 139815 5.6 - Term 139752 - 139809 5.2 142 56 Op 1 . - CDS 139827 - 141281 1794 ## COG4145 Na+/panthothenate symporter 143 56 Op 2 . - CDS 141294 - 141587 269 ## FN0686 integral membrane protein - Prom 141776 - 141835 11.4 + Prom 141635 - 141694 6.7 144 57 Op 1 . + CDS 141723 - 143126 1088 ## FN0687 hypothetical protein 145 57 Op 2 . + CDS 143177 - 143689 847 ## FN0688 hypothetical protein 146 57 Op 3 . + CDS 143767 - 144279 847 ## FN0689 hypothetical protein 147 57 Op 4 . + CDS 144302 - 144811 621 ## FN0690 hypothetical protein Predicted protein(s) >gi|296155282|gb|ADVK01000010.1| GENE 1 3 - 365 414 120 aa, chain + ## HITS:1 COG:no KEGG:FN0551 NR:ns ## KEGG: FN0551 # Name: not_defined # Def: Crp family regulatory proteins # Organism: F.nucleatum # Pathway: not_defined # 24 120 1 97 97 129 98.0 3e-29 SGWGKRNNINEARYNAEKSYKKNVDLNSEIILSEIDGEKKKNIEIIEKLKELNIRKQNSE KLIEIFRSKEKVSCASLANYLDISERTANRLLLKLEENNLAVSDLVKINRGRPKIFFRFF >gi|296155282|gb|ADVK01000010.1| GENE 2 352 - 1461 1518 369 aa, chain - ## HITS:1 COG:FN0552 KEGG:ns NR:ns ## COG: FN0552 COG3616 # Protein_GI_number: 19703887 # Func_class: E Amino acid transport and metabolism # Function: Predicted amino acid aldolase or racemase # Organism: Fusobacterium nucleatum # 1 369 1 369 369 668 98.0 0 MKKKELKTPTILLNIEALKNNIKKYQKLCTEYKKELWPMIKTHKSMEIVEMQIKEGATGA LCGTLDEVEACCQIGIKKIMYAYPVASEENIKRIIEISKKTEFIIRLDSLEAAIKINKMA ETENIIINYNIIVDSGLHRFGVSLKNLLTFAEELKKLKYLKLKGISSHPGHVYSSTCEAD IQQYVLDECETLRKAKEILEKEGYYLEYITSGSTPTFEEAVKDLNINVYHPGNYVFLDSI QLSINKAKIKDCALTVLATIISHPSENLFICDAGAKCLGLDQGAHGNNSIVGYGTVIDHP EVIVSSLSEEVGKLKIEGETNLKIGDKIEIIPNHSCSTANLCSYYTVTEGDNVIKSIKVD VRGNSIRRI >gi|296155282|gb|ADVK01000010.1| GENE 3 1481 - 2806 1813 441 aa, chain - ## HITS:1 COG:FN0553 KEGG:ns NR:ns ## COG: FN0553 COG3048 # Protein_GI_number: 19703888 # Func_class: E Amino acid transport and metabolism # Function: D-serine dehydratase # Organism: Fusobacterium nucleatum # 1 441 1 441 441 835 99.0 0 MDIKNMITNNPLIKNMIDKKEVGWTNPKEMNYTEYEKKLPLKDQELKEAEERLKRFAPFI KKVFPETEETYGIIESPLEEIFNMQKELEKKYHTEILGKLYLKMDSHLPVAGSIKARGGV YEVLKHAEELAMEAGLLKLEDDYSILADKKFKDFFSKYKIQVGSTGNLGLSIGITSAALG FQVIVHMSADAKKWKKDMLRSKGVQVIEYESDYGKAVEEGRKNSDADPMSYFVDDEKSMN LFLGYTVAASRIKKQFDKKGIVINKEHPLIVYIPCGVGGAPGGVAYGLKRIFKENVYIFF VEPVLAPCMLLGMQTGLHEKISVYDVGIHGITHADGLAVARPSGLVGRLMEPILSGIFTV DDYKLYDYLRILNETENKRIEPSSCAAFEGVVSLLKYEDSKKYIENRIGKNINNVYHVCW ATGGKMVPQEDMEIFLNTYLK >gi|296155282|gb|ADVK01000010.1| GENE 4 2840 - 4189 1947 449 aa, chain - ## HITS:1 COG:FN0554 KEGG:ns NR:ns ## COG: FN0554 COG2610 # Protein_GI_number: 19703889 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism # Function: H+/gluconate symporter and related permeases # Organism: Fusobacterium nucleatum # 1 449 1 449 449 723 100.0 0 MNVTFTIFAILLSILLLVLLTIKVKLHPFFALTVSAFFFGLISGHSIPDIIGAYSDGLGG TIAGIGVVIAIGTVMGALLENSGAAETMAETILKITGKKNADIGLAVTGYFVSIPVFCDS AFVLLSPLAKRVSKDTGGSMTTMAVALAMGLHATHMLVPPTPGPLAVAGILGANLGLVIL CGMLVSIPVTIVAIIAGRIFGKKYHFLPEIEEVHTDEKAKNLPSPFMSFSPIIVPIILML LKTVGSLESKPFGTGVLYNIFDSLGQTIVALFIGLIIAFFTYKSVYPYDKNVWTFDGIFG ESLKTAGQIVLIVGAGGAFATVLKLSNLQEIVMNLFTGISIGIIVPYIIGAIFRTAIGSG TVGMITAASMLLPLLDILGFNTPMGLVIAMLACAAGGFMVFHGNDDFFWVVVSTSGMKPE VAYKTFPIISVLQSVTALICVFILKIIFL >gi|296155282|gb|ADVK01000010.1| GENE 5 4431 - 5048 677 205 aa, chain + ## HITS:1 COG:FN0555 KEGG:ns NR:ns ## COG: FN0555 COG1396 # Protein_GI_number: 19703890 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Fusobacterium nucleatum # 7 205 1 199 199 349 98.0 2e-96 MKKDKIVYNKKELLRKDNIKMNKEINVGITIKNIRKSKKLLLKDVALKCGISSSMLSQIE KGNANPSLNTIKSIAQVLEVPLFKFFMDLEKEKYEFHLLKKDDRKIISTEYVTYELLSPD VETNIECMQMTLIGKNAETSVKPMAHKGEEIAVLLNGKVKLTIGKFSIVLSSGDSIHIPS MTPHKWTNLHTEKSVVIFSVSPPEF >gi|296155282|gb|ADVK01000010.1| GENE 6 5126 - 5746 851 206 aa, chain - ## HITS:1 COG:no KEGG:FN0556 NR:ns ## KEGG: FN0556 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 18 206 18 206 206 322 99.0 9e-87 MKKIFILLIVLLGLLVVSCGKKWDYEVTKKESIEIGNDITFYLLTLKEKESGNEIDSLQI TKEGFDKYNVKDKLTKEELDSIKISEDSLANSDLTLRGNYCIVENGSSTDLKFIPENTEF SILVIGKQEDVTPTIPTMILIDKEHKLFYVIILKNQWDAEEFEYTINKDDLKNIEARHMA TGNYDENDLSSIENLKNDIHFVEEEK >gi|296155282|gb|ADVK01000010.1| GENE 7 6157 - 6222 198 21 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MGMQRFRRGCEVIGSKSGLSL >gi|296155282|gb|ADVK01000010.1| GENE 8 6346 - 7080 1026 244 aa, chain - ## HITS:1 COG:no KEGG:FN0557 NR:ns ## KEGG: FN0557 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 244 1 244 244 416 99.0 1e-115 MNKFFIALFVVVFGFSFSNKVFGKTVVKSKNNIVDVVFILDRSGSMGGLESDTIGGFNSM LEKQRKIEGKAFITTVLFDDQYELLHDRVNINKISNITEKEYFVRGSTALLDAIGKTIAK EKAIQDTLGKNEKADKVLFVIITDGLENASREYSSATVKKLIETQKEKYGWEFLFLGANI DAIATANSIGISAEKAVNYNSDSIGTKKNYDTLNKAVEEVRSGKELNKEWKADIEADYNE RNKK >gi|296155282|gb|ADVK01000010.1| GENE 9 7410 - 8129 1004 239 aa, chain + ## HITS:1 COG:no KEGG:FN0558 NR:ns ## KEGG: FN0558 # Name: not_defined # Def: TraT complement resistance protein precursor # Organism: F.nucleatum # Pathway: not_defined # 24 239 1 216 216 358 99.0 1e-97 MKKIFKTILFSLLLISVFVSCSTLHTVVSKRNLDVQTKMSDTIWLEPAAANEKTVFVQVR NTSGKNLNIEQKVINVLTSKGYRIVNDPAGAKYWLQANILKVDKVNLDSNNGFSDAVLGA GIGGVLGAQRSGGVTTALGWGLAGAAIGTLADALVDDTAYAMVTDILITEKTGRNVQTST RNSVKQGNSGSITSTSSASSNMEKYSTRVLSTANQVNLNFNSAIPILEDELGKVIGGIF >gi|296155282|gb|ADVK01000010.1| GENE 10 8167 - 9855 2608 562 aa, chain + ## HITS:1 COG:FN0559 KEGG:ns NR:ns ## COG: FN0559 COG1109 # Protein_GI_number: 19703894 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphomannomutase # Organism: Fusobacterium nucleatum # 1 562 19 580 580 1065 99.0 0 MFLDEYKKWLDSNMLSASEKEELKNIANDEKEIESRFYTDLSFGTAGMRGVRGIGRNRMN KYNIRKATQGLANYIIKATGEVGKQKGVAIAYDSRLDSVENALNTAMTLAGNGIKVYLFD GVRSTPELSFAVRELKAQAGIMITASHNPKEYNGYKVYWEDGAQIVDPQATGIVSSVEAV NIFNDIKLMEEKEAIDKGLLVYVGEKLDDRYIEEVKKNAINPNVENKDKVKFVYSPLHGV AARPVERVLKEMGYTNVYPVKEQEKPDGNFPTCDYANPEDTTVFKLSIELADKVGAKICI ANDPDGDRVGLAVLDNDGKWFFPNGNQIGILFAEYILNYKKDIPKNGTMITTVVSTPLLD TIVKKNGKKALRVLTGFKYIGEKIRQFENKELDGTFLFGFEEAIGYLVGTHVRDKDAVVA SMIIAEMATTFENNGSSIYNEIIKIYEKYGWRLETTVPITKKGKDGLEEIQKIMKSMRVK SHTEIAGVKVKEYRDYQKGVENLPKADVIQMVLEDETYLTVRPSGTEPKIKFYISVVDSD KKVAEEKLAKIKKEFINYAENL >gi|296155282|gb|ADVK01000010.1| GENE 11 9839 - 10942 1106 367 aa, chain + ## HITS:1 COG:FN0560 KEGG:ns NR:ns ## COG: FN0560 COG0635 # Protein_GI_number: 19703895 # Func_class: H Coenzyme transport and metabolism # Function: Coproporphyrinogen III oxidase and related Fe-S oxidoreductases # Organism: Fusobacterium nucleatum # 1 365 1 365 365 664 98.0 0 MLKIYNTYIHIPFCERKCNYCDFTSLKGTDSQIEKYVNYLLKEIEIYSKEYDLSEKQDTI YFGGGTPSLLPINSLEKILSKFSYDKNTEITIEVNPKTVDTNKLKEYRKLGINRLSIGIQ TFNDDNLKVLGRIHSSQEAIEVYNLARECGFDNISLDIMFSLPYQTLSMLQNDLEKLVSL NPNHISIYSLIWEEGTKFFRDLKSGKLKETDNDLEASMYEYIIEFLKSKDYIHYEISNFS KKDFESRHNSTYWENKKYLGVGLSAAGYLDNVRYKNFFNLKDYYNNLDRNILPIDEKEIL TEEDIEQYRYLVGFRLLNKIIIPNEKYLEKCMSLCKEGYLLEKENGYILSHKGLMLFNDF ISNFIDI >gi|296155282|gb|ADVK01000010.1| GENE 12 11142 - 11813 910 223 aa, chain + ## HITS:1 COG:FN0561 KEGG:ns NR:ns ## COG: FN0561 COG0325 # Protein_GI_number: 19703896 # Func_class: R General function prediction only # Function: Predicted enzyme with a TIM-barrel fold # Organism: Fusobacterium nucleatum # 1 223 1 223 223 364 99.0 1e-101 MSIKTNVEEILEDIKKYSPYPEKVKLVAVTKYSSVEDIEKFLETRQNICGENKVQVIKDK IEYFKEKNKKIKWHFIGNLQKNKVKYIIDDVDLIHSVNKLSLAQEINKKAEQSSKIMDVL LEINVYGEESKQGYSLDELKCDIIELQNLKNLNIIGVMTMAPFTDDEKILRMVFSELRKI KDELNKEYFNNNLTELSMGMSNDYKIALQEGSTFIRVGTKIFK >gi|296155282|gb|ADVK01000010.1| GENE 13 11834 - 12286 781 150 aa, chain + ## HITS:1 COG:FN0562 KEGG:ns NR:ns ## COG: FN0562 COG1799 # Protein_GI_number: 19703897 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 40 150 1 111 111 188 100.0 2e-48 MGLLKDIKELVGINTGDDEEDIEEMEQEQTSKALSKRQKMEAEEVDEFRYEDYSTIFIDP KQFEDCKKIATYIEKEKMITINLENIGPNVAQRIMDFLAGAMEIKNASFAQIAKHVYTIV PENMKVYYEGKKREKKLIDLEKGERFNGEN >gi|296155282|gb|ADVK01000010.1| GENE 14 12273 - 13274 1412 333 aa, chain + ## HITS:1 COG:FN0563 KEGG:ns NR:ns ## COG: FN0563 COG0482 # Protein_GI_number: 19703898 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain # Organism: Fusobacterium nucleatum # 1 333 1 333 333 637 98.0 0 MEKIKALALFSGGLDSALAIKVVQEQGIEVIALNFVSHFFGGKNEKAESMAKQLGIRLEY IDFKKRHILVVEDPVYGRGKNMNPCIDCHSLMFKIAGELLEEYGASFVISGEVLGQRPMS QNSQALEKVKKLSGMEDLVLRPLSAKLLPPSKAEIEGWVDREKLLDINGRSRQRQMELME FYGLVEYPSPGGGCLLTDPGYSSRLKVLEDDGLLKDDHYWLFKLIKEARFFRFSQARYLF VGRNKESNDKIDEFRKEKNLDFYIQSSEVPGPHIIANTNLTDEEIDFAKKLFSRYSKVKG NEKVILNNSGNLEEVDVLDLKKLDEEIKKYQQL >gi|296155282|gb|ADVK01000010.1| GENE 15 13396 - 13587 227 63 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296327470|ref|ZP_06870016.1| ## NR: gi|296327470|ref|ZP_06870016.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 63 1 63 63 122 100.0 6e-27 MPNIIPYKVLVKILLENGWELDHTTGSHEIYIKDGKTCPVKCTKKDIPNGTLASIKRITG LKF >gi|296155282|gb|ADVK01000010.1| GENE 16 13620 - 14015 634 131 aa, chain + ## HITS:1 COG:no KEGG:PTH_0968 NR:ns ## KEGG: PTH_0968 # Name: not_defined # Def: hypothetical protein # Organism: P.thermopropionicum # Pathway: not_defined # 2 130 3 132 137 112 43.0 4e-24 MKEKYIYPCIIYEEDGIYYANFKDFDACFTDGESIEEVIINAKDVLEGTIFSLLKNNLEI PEPALTKPNLENNEFLVYIDIWLTPIVDKVKNQTVKKTLTIPKWLNDEAEKHSINFSNLL QTAIKKYLNIQ >gi|296155282|gb|ADVK01000010.1| GENE 17 14140 - 14682 643 180 aa, chain - ## HITS:1 COG:no KEGG:FN0564 NR:ns ## KEGG: FN0564 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 180 4 183 183 259 96.0 3e-68 MTNNEKIIELMDIFEKAKEKFLKDEKEIIRIDINERTLSARLMFHLQTILLEDKLYREKY KTYSVDCEYNRINEIEYKILKVCEYIEKTKNFEEVDKKVYPDIIVHKRNENNNLIVIEMK KANSYIKKKENDKNRLKAMTNPRKLNNFNYILGVYFEVDTIGNNNHIIEFFVNGKEYKKN >gi|296155282|gb|ADVK01000010.1| GENE 18 14725 - 14940 203 71 aa, chain - ## HITS:1 COG:no KEGG:FN0565 NR:ns ## KEGG: FN0565 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 71 67 137 137 91 94.0 8e-18 MLFNAIYRFFGIGFVKFFYSFAVSAVYLEKFFSMDVFFMALKVDLAVILGSILGNLSGYL SCAFNKKYEEF >gi|296155282|gb|ADVK01000010.1| GENE 19 15172 - 16278 1016 368 aa, chain - ## HITS:1 COG:no KEGG:FN0566 NR:ns ## KEGG: FN0566 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 57 368 15 326 326 512 98.0 1e-143 MKKIFLIMTLIFMFSVSIFAENKNNSNNNQIIKITAIWKDNELIEISNNNSNISENKQIL SQLSLVYKQIFYNRELKTEISIDKINLLVKFNTFNGKEIKEVEYLYEGKNNHGTIKYFDK EGKLKLKAAISTNNLLNGNSRLDSIEAYYENGNKIDMNLLRYPVEIDGLWKNNKLTGIVK GMIYKEDKLFVQLFVSPLEESEFYQNHKSKLEEVYDNQNIIFKEVLDGENQTLNTKVYNE DNVLMTEEILYEKNDSFFKISKQFYANGNVKEISNFKDGVQDGFHEIYNEDGSKKLLKNY SDGKLISQENFNDEGFFIKTFTTLKNIAYKIRNIFTFIADFWIVFVFLVLPVIGILALIY EAIFKRKK >gi|296155282|gb|ADVK01000010.1| GENE 20 16433 - 16942 638 169 aa, chain - ## HITS:1 COG:no KEGG:FN0568 NR:ns ## KEGG: FN0568 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 13 169 9 179 179 120 59.0 2e-26 MIKSENIKDLTKELIDIFEKAKNEFLEKEKTIIKNDTNERTLTQRLAFYLELQLRKNIKY ENYSVDCEYNRKEEDIKRLKFGKNTDKKEIYPDIIVHQRKIKNNLIAIEMKKTTSRNTDK IKDIEKLEALTDRKNGYHYTLGIYFELDITDNNNIIKFFVDGKEYNYFI >gi|296155282|gb|ADVK01000010.1| GENE 21 16939 - 19275 2187 778 aa, chain - ## HITS:1 COG:FN2119 KEGG:ns NR:ns ## COG: FN2119 COG2849 # Protein_GI_number: 19705409 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 497 778 69 336 338 106 28.0 2e-22 MKKIFLILTLFFVFCVNVCAYPSDFFKSYPNGNIKINIDWDNGKGYYDANFYDGKNNKVK AKKNWENFWDINSFQEFYDTLKNEYSNQKIKVLINITNEEHPIGMFLNAKFYNSKDILVR EANNDRFSHNTKIYDGNGKLINETENNIQQNEFIFKEYGKNSKLLNETYISFDYNQIGVK AYNNNNILIMESSYTIKNLDFDNVKIIMSNLFENSNYKDVLDGYEKFYDDNGNLELEKTY KNGELVDTKEYKINTYEEKSVSVTSDSDTNVDNNLDIENKSFFEKNRSFILYSLGFLLVM AIIIFFTVTGIKEKLDNKNVSYDYEDTEDKKILYCNDKPVNRKIRYIYLPDSEFTSISLE ITYKHGVPIFIKLHHYDPYVSQENTNFEILIKKISKNFEGSFTILYDEHKIIANGKFKLP PWLYGNIYYYKKDEYRKDIFTKFVEKRRGISCTIQMWWLVSKIHFSPFGGRNNIPRNKEE KIKYLNEKYFKFLESIVIKGNIQDYYSNNNLKKNINIENNEKDGLFETFYPNGELEEKGY YQNGERIGLFKIFNEDGTLKEALEYKDIEKESFYDTGELKEKITYNHNVKHGTFKVFYKN GELKEKGCYDEGKLMWRENYKDRKLNGLFESFYDTGELKEKRHYKDGEIEGIVEYFYKSG NVRRKEYYEKGKTERLIEYFFENGNLKKRGYYDYENSKKDTLMKMNGLVESFFEDGKLEE KAFYKNDKKYGLSEIFYENGNLKERGNYKEDKKHGIFEYFDEDGKLIKKENYREGKLI >gi|296155282|gb|ADVK01000010.1| GENE 22 19469 - 19960 268 163 aa, chain + ## HITS:1 COG:no KEGG:FN0570 NR:ns ## KEGG: FN0570 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 163 1 163 163 185 97.0 4e-46 MNEVIFIIMMMVFSFLTFLVDIVLLIRFITLIKQIFYKDYSLTIENQLLPNYFPKIKNKY IRLFFNYIVFCITIHIVAFYYPRRFHKLIFSLEYFWKSFVLTFIMTFIMLIVFSITRKIK FLNNKVNKFSYLELILIPLMLISSIYYFNIFFEWFERYGENYV >gi|296155282|gb|ADVK01000010.1| GENE 23 19941 - 20867 975 308 aa, chain + ## HITS:1 COG:FN0571 KEGG:ns NR:ns ## COG: FN0571 COG0758 # Protein_GI_number: 19703906 # Func_class: L Replication, recombination and repair; U Intracellular trafficking, secretion, and vesicular transport # Function: Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake # Organism: Fusobacterium nucleatum # 5 308 1 304 304 468 91.0 1e-132 MEKTMYSKEELLIFSFINSNYDISIQNLTYKIFNFSNKENINFFKLNRIKKIEFIKSFFS EDNIEKILSVFDKLALYKIETEKIIKNCEEKNIKIFYYSYENYPKNLMNIKESPYVIFVK GNLPSNEELEKSFAIVGTRKPSKEGIDFARDIGQYLFKNNIYNISGLALGIDTVGHNMSL QKTGAILGQGLNLEIYPRENIKLAEMILENNGFLLSELIPQTEISLFSLIKRDRLQSALT SGIVIAETGIKGGTVNTFKYAREQKKKIFISEINKEFIERHKKDLIVIKNSLDFEKKLKN NLIQKNLF >gi|296155282|gb|ADVK01000010.1| GENE 24 20925 - 21416 373 163 aa, chain + ## HITS:1 COG:no KEGG:Tresu_2679 NR:ns ## KEGG: Tresu_2679 # Name: not_defined # Def: hypothetical protein # Organism: T.succinifaciens # Pathway: not_defined # 13 158 30 170 174 86 33.0 5e-16 MLYLKRLSDNIKVRATIKEIKYSKSGFKNWLFDWSKTEKKAYKILALYVEGDNRIQGAIS IRENLQNRTIEIDIVESAPFNSSYNKKIKDKEYIGVAICLFVEVCKRSFENGYDGYVEFT AKSNLVKYYMDNMRAIPIDAQRMYINTSGAKWLIEKYYGGVDL >gi|296155282|gb|ADVK01000010.1| GENE 25 21413 - 21589 378 58 aa, chain + ## HITS:1 COG:no KEGG:FN0575 NR:ns ## KEGG: FN0575 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 2 58 1 57 57 97 100.0 2e-19 MMLEDFGMMTEPMPDNRVQYDLRALDKYCSERKILPIDLSEEELKKFEIPKDKSVANF >gi|296155282|gb|ADVK01000010.1| GENE 26 21739 - 23307 2329 522 aa, chain + ## HITS:1 COG:FN0576 KEGG:ns NR:ns ## COG: FN0576 COG2304 # Protein_GI_number: 19703911 # Func_class: R General function prediction only # Function: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain # Organism: Fusobacterium nucleatum # 154 522 1 369 369 669 99.0 0 MKNTKKFTILLLILASLFLIACGKDEKKDDTENKTGDKVEANVSTNLSEEELQIAKGVNG DLPDPVYTYEAIVDEAGGLYQSPQPNEDNYVKKHDIWTEDVQKELKTIKPALDENASEEE IQHLFNQFLYIVGYDYTPFETIDRFSYVIFKNDMENPFTHEKIEENMNVNVEIVLDASGS MVKKIGDKTMMEIAKESIKKVLSEMPANAKVGIRVFGHKGDNTASKKDESCGSNELIYPI GDLNVEGIEKALEPIQPTGWTSIAKSIEYGVEDLKALDGEKTLNILYIITDGIETCGGNP VEIAKQLKGENTNIVLGIIGFNVDANQNRLLKQIADAAGGYYSSVNDANKLTGELYRINE LAFSDYKWEVLNDNLITRVKGMHNEILIFNKAAYGSKGITEKVDLSTAILYGGISSSNDP KFAGLYKIYGKVDKRLKELSEERKNKIDAIFEEEYNKRKKESEEYIAYLESRKGEMVAYV PSTSRVSPRSAYYTGTSNKGGTREDAKKDAEKIKAEKEAAKQ >gi|296155282|gb|ADVK01000010.1| GENE 27 23322 - 24047 841 241 aa, chain + ## HITS:1 COG:no KEGG:FN0577 NR:ns ## KEGG: FN0577 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 241 1 241 241 407 99.0 1e-112 MSNIEEFKKLYNFEFEEIKAKDYKEIEKKYLASYKEGKEKGFTPVFLVLDDVLLEKFELD MEDENTDNIMDIVKSNLEKYKDINAVEFLKNFQEENTEDYFTKKNYKYDNREKYNLELLS TLFNSSKKNKSDVVLVKVSTKNPYEVLGYFGMGGYNDCPLPAEQVAVAKYWYEKYGAVPA VITYDEIEFYVEKPVQTLEEAKKLAVEHYAFCYDIVEQCYGTFEKLVDGLYKNIQWYFWW D >gi|296155282|gb|ADVK01000010.1| GENE 28 24051 - 25871 1679 606 aa, chain + ## HITS:1 COG:FN0578 KEGG:ns NR:ns ## COG: FN0578 COG0514 # Protein_GI_number: 19703913 # Func_class: L Replication, recombination and repair # Function: Superfamily II DNA helicase # Organism: Fusobacterium nucleatum # 1 606 9 614 614 1134 99.0 0 MKSEALKILKEYYGYDNFREGQEKIIDAILEKRNVLGIMTTGAGKSICYQIPALIFEGLT IIISPLISLMKDQVDSLRLIGIEASYLNSALTSDEYNKILFRIKRGKIKLLYISPERLEN KFFLNFIKTVKISMVVVDEAHCVSQWGENFRRSYLRIADFVKYLSGETQIQTLAFTATAT PKIKVDIIEKLNIINPFIYIDYFNRDNIYFKVVDNSGLDKDLDIDSKPFIIDYLRKHKGK SGIIYCSTRKNVDDIYSYLVGFDRSVTKYHGGMSKEEREKNQKLFLDDDVEIMVATNAFG MGINKSNIRYVIHANIPADLESYYQEAGRAGRDGGSAEAILIYNEKDRDIQRYLMEKEAE GKTDRNYLNKRLKNFNKMIEYAELKTCYREYILKYFGEKMIRNYCGYCENCKKEKNIKDF SLEAKKIISGVGRAKESLGISTLANMLMGKADTKMLNKGLDKISTFGIMKEDKQEWIESF INYMISEKYLIQSAGSFPVLKLGEKYKDILNGNIKVIRKEDEKIDFDYYENDLFKELNSL RKEISKQENIAPYIIFSDMTLIEMAEKKPRNRWDMLKIKGIGNQKFKSYGEKFLERIINY CMEERI >gi|296155282|gb|ADVK01000010.1| GENE 29 25868 - 30724 6207 1618 aa, chain + ## HITS:1 COG:FN0579 KEGG:ns NR:ns ## COG: FN0579 COG2373 # Protein_GI_number: 19703914 # Func_class: R General function prediction only # Function: Large extracellular alpha-helical protein # Organism: Fusobacterium nucleatum # 8 1618 1 1611 1611 2831 99.0 0 MKKILKLVFILSILAIAFVACKKDKQEQQNDTGQSETSQGYDGYQQVLYVNNAKFNISGD IVVLFSEELDKNQDFKKLVEVEGLDGDITIMPFINKLIIKGNFKKDNPYNVKVSKDIKGI SGTSLTNDYIKYNLYLGKKEPSLSFVDSGNVLPSINNKKINFNSVNISKVKLEIVKVYTN NITQYLKLYSNEYGVNEWELKDDLGDIVYTKEYEIDSKEDQVVKNSIDLSNTIDTKGIYY VKISAIGQDSIDYDIGKYGEPNSFGYDGDVIYARAEKTIILSDIGIVANSNNSKLDLKLL NLNTLNPIPNARLEFINSKNQTLEEGTTNSNGEYKSKTNLDKVFYVLVKSGNEFNVLYLA GSKINYSDFDIGGSLDGSDLKLYTYTDKGYYRPGDEINVSLIARSKEKMNDNQPFEYSFT GPDGSTKINNEIVKESKNGFYTFKIKTDMNDLTGAWTLSIKFGGKEITQKVFIESKIANT IAIDADEDKIYTKADVKDKAIDFKFAFKYLSGANVDKGTIVNFDYAVIEKDTQSKKYREY NFSNPSNYRYQFRNFAETTLDDGSGEVNLKLDMPDTLQSKNLYLSTIVNVSDANGRYSTE HKVFKIINRENSVGVQKVSQNDNETSVKYILLNEKTDSLVAGKKLKYRIYNKEYNWWYDY YYNDGEKSFKENIETVLLEEGEITSGSSPELLKVTKLGDGTNFIEIEDEETGHSSGVFVY KFHYGDKRHGTIENLNITADKEKYNIGDIAKIKYSGAVGSKALVTIEKDGKIIKEYWKTL TVKDNEETIVIEKDFFPNAYVSISVFQKYVDKQNDRPLRLYGSVPLMVDDKSRMLTLQVD TKTEVLPGGDLKIKLSNKENKKMYYEVFLVDEGVLRMTDYKKPDPYKFFYEKRAKLVQSY DNFSNIIERYSDKVANRLKTGGGDAEEDALDSPMAAEVAYKKEDMQLFGDAQRFANLTIF RGVAESDENGNAVVDVKLPNFFGRMRLFVVAVSDESYGSAEKSISVKAPVIVETSAPRVL KVGDKFTVPITLFPIEKAIGNSEITLTYNGKTYNKKVNVKDGKSEKVLFELEAPATVGTT KIDIDFKSSKYSYKDTINLNVDTNYPYQYNEKSIVLEPNQEFTLSASEYKDFINGSVKSK LVLSSYQKLGIEKLIKSLLDYPYICLEQISSKGMAMLYIDNLTTDPIEKNDAKNEINTII GKLNNNYQLRNGAFAYWPGSQEEGISTVYAIRFLIEAKEKGYYVPETMFEKSKEYLNSIA MRTDVPKIEVLYLLAAIGDPNVSEMNIVFDRYYKNISVVEKWRLLAAYSKIGEKDFARKE ADKLPRKAERKDGSYYADDNAEILKYYTVIYGTADAELYNSVLAIAKSDAWLTTYEKANI VQALAGDGKVSPEKKNISFKLIVDGKEQNLELKDGEYTFRNLGTKENVKKIVIKNTSSSK LYVNSFYKGKPVKYDEKDESKNITITRKFVDMAGKEIDVKSLKAGTRFKMILTSKLANAD SPDISLLQILPSGWEFDNTQTNIQSAGGDMIPVPVDVEEVDNEEYGESSSSNSVNYVDIK DDRVAYFYPLYSGEDKVIEINLIAVTPGTYRLPGTKVESMYNNNYRAYLKGFEVKVKE >gi|296155282|gb|ADVK01000010.1| GENE 30 31150 - 33396 2350 748 aa, chain + ## HITS:1 COG:FN0580 KEGG:ns NR:ns ## COG: FN0580 COG4953 # Protein_GI_number: 19703915 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane carboxypeptidase/penicillin-binding protein PbpC # Organism: Fusobacterium nucleatum # 25 748 1 724 724 1296 98.0 0 MLKNINFKKVIIFFITLFILLFIYLIKVYITYNPKKLVKEINYSKVVLDRKGEILSVFLN NEEEFHIKYDGEVPETLKTAVINYEDKKFYSHSGVDYPRILKSFFNNITGGKKMGASTIS MQVVKLLEPKKRTYFNKLVEVVKAYKLESEFSKEEILKIYLNNVPYGSNIVGYSGAIKMY FNKEVKDLSYAEATLLAVLPNSPGILNLKKNNDKLETKRNRLLKTLLDRKLIDERQYKFS LLEKFPNKIYYYEKKAPQFSIFLKNRYPEKIIKSTLDYNLQKKLEKIVHDYSNAMKDVGI NNAAVLVVNNKTKEVLAYVASQDFYDKRNNGEIDGLQAKRSPASLLKPFLFALSIDDGLI VPDSVYPDVPIYFGNFYPKNSSDTFTGMVKIEEALIKSLNIPFVKLLSDYGVDRFYYFLE NNDNYPEDRFDKYGLSLILGTREMRPVDIAKLYMGLANYGKVSNLKYTLTEDKPREYQQF SRGASYLTLDTLSKVVRPGNENLYSEQRPISWKTGTSYGMKDAWSVGVSPDYTVLVWLGN FNQKSIFSLSGVETAGNLLFKVFNIVDINSRTFEKPTDDLKEIEIDEKTGYRKFYDVESK KVLYPKNAKLLRISPYYKKIFVDEDDMEIDSRSQNFDKRKEKIVIEYPIEVSNYFFLNGV RENKNVKIAYPVQNLNIFVPKDFDGYKKVAMKLYNPNNEYVYWYLDEDYVGYSNEKEKFF ELDIGKHKLTIVTESGAREEVKFNINKR >gi|296155282|gb|ADVK01000010.1| GENE 31 33400 - 34569 1472 389 aa, chain + ## HITS:1 COG:FN0581 KEGG:ns NR:ns ## COG: FN0581 COG4591 # Protein_GI_number: 19703916 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: ABC-type transport system, involved in lipoprotein release, permease component # Organism: Fusobacterium nucleatum # 1 389 1 389 389 625 97.0 1e-179 MIEFFIAKKQMFERKKQSILSIVGVFIGITVLIVSLGVSNGLDKNMINSILSLTSHITVY SPENISDYEEISKDIETIKGVKGVVPTIETQGIVKYEGGIEPYVSGVKVVGYDLEKAIKT MKLDTYIIDGKIDLEDKKGVLIGNELAKATGATVGDKIKLITSEETDLEMNVAGIFQSGF YEYDINMVLIPLTTAQYITYSDNTVGRLSVRLDNPYDAQKLVLDVARKLPETYFIGTWGE QNKALLSALTLEKTIMLVVFSLIAIVAGFLIWITLNTLVREKTKDIGIMRAMGFSKKNIM LIFLIQGIILGIIGIILGIIVSLILLYYIKNYAVDLVSNIYYLKDIPIEISLKEIAIIVG ANFIVILISSIFPAYRAARLENVEALRYE >gi|296155282|gb|ADVK01000010.1| GENE 32 34544 - 35239 259 231 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163765018|ref|ZP_02172066.1| ribosomal protein L9 [Bacillus selenitireducens MLS10] # 1 218 1 225 329 104 31 3e-21 MWRHLDMNNIIMKLEDIDKFYMETGNKLHILKKLNLEVKRGEFVSILGKSGSGKSTLLNI MGLLDKIDGGKIWIDNKEVSSLNEVERNNIKNHFLGFVFQFHYLMSEFTALENVMIPALL NNFKNKVEIEKEAKELLEIVDLAERIKHKPNQLSGGEKQRVAIARAMINKPKLILADEPT GNLDEDTGELIFSLFRKINKEHNQSIVVVTHARDLSQVTDRQIFLKKGVLE >gi|296155282|gb|ADVK01000010.1| GENE 33 35254 - 35730 695 158 aa, chain + ## HITS:1 COG:FN0583 KEGG:ns NR:ns ## COG: FN0583 COG3212 # Protein_GI_number: 19703918 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 14 158 1 145 145 244 99.0 4e-65 MKKIKNIGILLFLLISTLSFSYQVNYDDVVDIVLRNYPQSRVTKIEIFNYKDKTVYEGEA FDKGQKIEFIIDVNTGEVFKIAPDYDDKYNPNYNLPITFEQASRIGLDNSFNGKVKSIEL KNINKRAYYIVEVKEDKSEKEIRIDANSGKVIGIKEEE >gi|296155282|gb|ADVK01000010.1| GENE 34 35736 - 36029 469 97 aa, chain + ## HITS:1 COG:FN0584 KEGG:ns NR:ns ## COG: FN0584 COG2350 # Protein_GI_number: 19703919 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 96 1 96 97 144 98.0 3e-35 MKKYVVILSGKKKNTLTEKLLIDHVKHLKEINKKGQLFICGPLVDSDKAIKIINANSMEE ALKIVNADPFTINSYYPDVEVLALEEANEENEYLLKR >gi|296155282|gb|ADVK01000010.1| GENE 35 36080 - 36754 1015 224 aa, chain + ## HITS:1 COG:FN0585 KEGG:ns NR:ns ## COG: FN0585 COG0745 # Protein_GI_number: 19703920 # Func_class: T Signal transduction mechanisms; K Transcription # Function: Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain # Organism: Fusobacterium nucleatum # 1 224 1 224 224 388 100.0 1e-108 MKILVVEDEKDLNNIISKHLKKNNFSVDSVFDGEEALEYLSYGDYDVIILDVMMPKMNGY EVVKNLRANKNETAVLMLTARDGIDEKIKGLDLGADDYLVKPFDFRELIARIRALVRRKY GNISNELQIDDLIVDTSKKSVTRAGKNIELTGKEYEVLEYLIQNKGRVLSRDKIRDGVWD YAYEGESNIIDVLIKNIRKKIDLGDSKPLIHTKRGLGYVLKEDE >gi|296155282|gb|ADVK01000010.1| GENE 36 36732 - 38069 1380 445 aa, chain + ## HITS:1 COG:FN0586 KEGG:ns NR:ns ## COG: FN0586 COG0642 # Protein_GI_number: 19703921 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Fusobacterium nucleatum # 1 445 1 445 445 771 100.0 0 MFLKKMNRLLSRIPVSIRVTVWFTAVIVFLFSIVLSSAILLEDRYINNSSTEELVIAVEK IYEDPDEFENFNDGIYYIKYNENNEIIAGKIPKDFDLTLAFSIEDINTYQIENKKFLYYD TRLKNTGDWIRGIYPLNRFQNDISKMWDIVFYYISPLFIAFVAFVGYKIVKNAFKPVKKI SETALAIKKSKNFSRRIELDYSEDEIHKMASAFNEMLDTVEEVFIHEKQFSSDVSHELRT PITVISAQSEYALDYVETIDEAKESFEVINRQAKKMTNLINQIMELSKMERQNEIEKDRI NFSNIILQLLEDYKNLLENNNIELIINIEKDLRIYGNKIMLERLFINLFTNAMKFTKTTI KVSLNRINKEVILQIKDDGVGIAKKDQKYIWDRFFQTNDSRNKDKNKGSGLGLSMVNRIV QLHSATIEVESETGEGSCFIVRFPI >gi|296155282|gb|ADVK01000010.1| GENE 37 38205 - 38381 231 58 aa, chain - ## HITS:1 COG:no KEGG:FN0587 NR:ns ## KEGG: FN0587 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 58 1 58 58 101 100.0 1e-20 MNEKQTKIIGWMGTTLSILMYVSYIPQIIGNLNGNKTSFIQPLVAAINCTIWVSYGLF >gi|296155282|gb|ADVK01000010.1| GENE 38 38411 - 38644 212 77 aa, chain - ## HITS:1 COG:no KEGG:FN0588 NR:ns ## KEGG: FN0588 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 77 1 77 77 91 100.0 1e-17 MGLVYHKNNNIINPNNIISIQFSDKIKTGRQEIKNKDIKLIVNILNIAKYDSDFNDGKSI KMEIKFKFYLKKLLKVV >gi|296155282|gb|ADVK01000010.1| GENE 39 38725 - 39045 378 106 aa, chain + ## HITS:1 COG:FN0589 KEGG:ns NR:ns ## COG: FN0589 COG1733 # Protein_GI_number: 19703924 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Fusobacterium nucleatum # 1 106 2 107 107 187 99.0 3e-48 MKKDLPACPVELTLLLISNKWKVLIIRDLLDGTKRFSELKKSINNISQKVLTSNLREMEE NDLLTRKVYPEVPPRVEYTLTDIGYSLKTLLDDMDKWGTWYRSEVS >gi|296155282|gb|ADVK01000010.1| GENE 40 39098 - 40279 1770 393 aa, chain - ## HITS:1 COG:FN0590 KEGG:ns NR:ns ## COG: FN0590 COG1473 # Protein_GI_number: 19703925 # Func_class: R General function prediction only # Function: Metal-dependent amidase/aminoacylase/carboxypeptidase # Organism: Fusobacterium nucleatum # 1 393 1 393 393 810 100.0 0 MEEKIKKLSEKYLERVMELRRELHQYPELGFDLFKTAEIVKKELDRIGIPYKSEIAKTGI VATIKANKPGKTVLLRADMDALPITEESRCTFKSTHDGKMHACGHDGHTAGLLGAGMILN ELKDELSGTIKLLFQPAEEGPGGAKPMIDEGVLENPKVDAAFGCHVWPSVKAGHIAIKDG DMMTHTTSFDVIFQGKGGHASQPEKTVDPVIIACQAVTNFQNVISRNISTLRPAVLSCCS IHAGDAHNIIPDKLVLKGTIRTFDEGITDQIVDRMDEILKGLTTAYGASYEFLVDRMYPA LKNDHELFVFSKNALEKILGKDCIEVMDDPVMGSEDFAYFGKQVPSFFFFVGINDEQLEN ENMLHHPKLFWNEKNLITNMKTLSQLAVEFLNK >gi|296155282|gb|ADVK01000010.1| GENE 41 40295 - 41815 1734 506 aa, chain - ## HITS:1 COG:FN0591 KEGG:ns NR:ns ## COG: FN0591 COG2978 # Protein_GI_number: 19703926 # Func_class: H Coenzyme transport and metabolism # Function: Putative p-aminobenzoyl-glutamate transporter # Organism: Fusobacterium nucleatum # 1 506 1 506 506 836 99.0 0 METNKKIEKEKLSLLNKMLNKVEVVGNKMPDPTTIFVILCILIFIISFILSKFGVSVEHP GTKEIIKAENLLSSDNLKAVLVSTVKVFQTFPPLGAVLVTMIGIGLADKSGYLEVLLTLS IKKVPKKLIYFTVVFAGLVFTAIGDGGFIVLPPLAAIIFINIKKNPLVGIFLSFAGAAIG FCSGFFVGMNDILLSSFTNPAAQILEPAFQKSPTMTIYFNMANALLQIFIITWVTIKFIE PRFPVNEEHFKENASTEIGDLEKKGVKYASISFLLFVAFIVFLAIGPNAFLKDENGSLIS VNSPLMGGLIFFMSVAFLIPGFVYGKITKKIKSDKDAVKLIATSLSEMGGYILIVFVAAQ FLNLFTKSNLGIIMAIKGANLIKAAGFKGLPLIITYVILVAFINLFIGSASAKWAILSPI FIPMFMLLGYDPALTQMAYRIGDSSTNMISPLFPYVPLLLAVANKYDKNFGLGTLMANML PYSFITLIGSLLLFTAFFIFNIPFGI >gi|296155282|gb|ADVK01000010.1| GENE 42 41997 - 44210 3176 737 aa, chain + ## HITS:1 COG:FN0592 KEGG:ns NR:ns ## COG: FN0592 COG0210 # Protein_GI_number: 19703927 # Func_class: L Replication, recombination and repair # Function: Superfamily I DNA and RNA helicases # Organism: Fusobacterium nucleatum # 3 737 1 735 735 1273 97.0 0 MNLNLLEKLNDKQKEAASQIDGSILILAGAGAGKTRTITYRIAHMIENVGISPYSILAVT FTNKAAKEMRERVEDLVGDIAKVCTISTFHSFGMRLLRMYAKEAGYNSNFTIYDTDDQKK IVKAILKGQHISLNGVKLTERDIVSMISKIKEEIKTLDEYSIMNKQIVEVYDKYNRALLE SNAMDFSDILLNTYKLLQKPEILEKVQNKYKYIMIDEYQDTNNLQYKIIDLIARKSSNLC VVGDENQSIYGFRGANILNILNFENNYNNAKIVKLEENYRSTTTILDAANELIKNNKSSK DKKLWTQNGKGDLIKVLACDNGRDEVSRIIEFIRENHQNGVAYRDMTILYRTNAQSRIFE EGLLRYSIPHKVFGGISFYSRAEIKDIIAYLSIIVNPQDELNLQRIINVPKRKVGEKGIE KIITYARENNLNLLEALSHIKEISGLTVVGKEKLLEMYDIIKELKDLSYTETTSYIVQTL IDKIKYIDYIKDNYDDAEARIENIDEFKNSILELENVVGELRLNEYLENVSLISATDDLE EKSDYVKLMTIHNSKGLEFPIVFLVGFENEIFPGTRAMFDEKEMEEERRLCYVALTRAEK KLYLSHATIRFVYGQDRLSTPSVFLKEIPEKLLDIDIKKERLYFVDDYSDEIKTYGNNKK FEKKKTEINTKNTIKLDDNAKKVIDNLGFKIGDKVKHKKFGLGVIKSIDAKKIYVQYVDG TKEMAVILADKLLTKFE >gi|296155282|gb|ADVK01000010.1| GENE 43 44221 - 45054 1183 277 aa, chain + ## HITS:1 COG:FN0593 KEGG:ns NR:ns ## COG: FN0593 COG0774 # Protein_GI_number: 19703928 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-3-O-acyl-N-acetylglucosamine deacetylase # Organism: Fusobacterium nucleatum # 1 277 7 283 283 525 99.0 1e-149 MKRKTLKNIVEYDGIGLHKGEIIKMKLIPAKSGGIIFRMVNMPEGKNEILLDYRNTFDLT RGTNLKNEHGAMVFTIEHFLSALYVLGITDLVIELNGNELPICDGSAIKFLDLFQESGIV ELDEYVEEIIVKEPIFLSKGDKHVIALPYPDGYKLTYAIRFEHTFLKSQLAEFEITEENY RKEIASARTFGFDYEVEYLKQNNLALGGTLENAIVIKKDGVLNPDGLRFDDEFVRHKMLD IIGDLKILNRPIRAHIIAIKAGHLIDIEFAKILDNIK >gi|296155282|gb|ADVK01000010.1| GENE 44 45076 - 45501 660 141 aa, chain + ## HITS:1 COG:FN0594 KEGG:ns NR:ns ## COG: FN0594 COG0764 # Protein_GI_number: 19703929 # Func_class: I Lipid transport and metabolism # Function: 3-hydroxymyristoyl/3-hydroxydecanoyl-(acyl carrier protein) dehydratases # Organism: Fusobacterium nucleatum # 1 140 1 140 141 254 90.0 2e-68 MLDILEIMKRIPHRYPFLLVDRILEMDRENQTIKGKKNVTMNEEFFNGHFPGHPVMPGVL VIEGMAQCLGVLVMESALGKVPYFAAIENTKFRNPVRPGDTLIYDVKVDKVKRNFVKASG KTYVDDAIVAEASFTFVIADA >gi|296155282|gb|ADVK01000010.1| GENE 45 45519 - 46292 1139 257 aa, chain + ## HITS:1 COG:FN0595 KEGG:ns NR:ns ## COG: FN0595 COG1043 # Protein_GI_number: 19703930 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Acyl-[acyl carrier protein]--UDP-N-acetylglucosamine O-acyltransferase # Organism: Fusobacterium nucleatum # 1 257 1 257 257 490 99.0 1e-138 MVDIHSTAIIEDGAIIEDGVKIGPYCIVGKDVIIKKGTVLQSHVVVEGITEIGENNTIYS FVSIGKDNQDLKYKGEPTKTIIGNNNSIREFVTIHRGTDDRWETRIGNGNLIMAYVHVAH DVIIGDDCIFSNNVTLAGHVVIDSHAIIGGLTPVHQFTRIGSYSMIGGASAVSQDVCPFV LAAGNTVVLRGLNIVGLRRRGFSDEEISNLKKAYRILFRQGLQLKDALEELEKDFSEDKN VKYLVDFIKSSDRGIAR >gi|296155282|gb|ADVK01000010.1| GENE 46 46292 - 47095 933 267 aa, chain + ## HITS:1 COG:FN0596 KEGG:ns NR:ns ## COG: FN0596 COG3494 # Protein_GI_number: 19703931 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 267 1 267 267 480 99.0 1e-135 MEKIGLIVGNGKFPLYFIEEAKNSNISVYPIGLFPSVDEEIKKIDNYTEFNIGHIGEIIK YLLLRDINKIVMLGKVEKKLIFENLILDKYGEKIMEIVPDNKDETLLFAIIGFIKLSGIK VLPQSYLMKKFIFETKCYTEKKPDVDDEKTISIGIEAARLLSRVDVGQTVVCRDRAVIAV EGIEGTDETLKRAGQYSDKDNILIKMSRPQQDMRVDVPVIGLNTIETAIKNGFKGIVAQA KKMIFLNQKECIELANKNNIFIVGKKI >gi|296155282|gb|ADVK01000010.1| GENE 47 47105 - 48175 1357 356 aa, chain + ## HITS:1 COG:FN0597 KEGG:ns NR:ns ## COG: FN0597 COG0763 # Protein_GI_number: 19703932 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Lipid A disaccharide synthetase # Organism: Fusobacterium nucleatum # 1 356 1 356 356 639 100.0 0 MKFFVSTGEASGDLHLSYLVKSVKARYKDVDFVGVAGEKSQKEGVEILQDINELAIMGFT EVLKKYKFLKQKAYEYLQYIKDNQIKNVILVDYGGFNVKFLELLKNEIKDIKIFYYIPPK VWIWGEKRVEKLRFADYIMVIFPWEVDFYKKHNINAIYFGNPFTDFYKKVERTGNKILLL PGSRRQEIKAMLPVFEEIINNLKDDKFILKLNSNQDLKYTENFKKYNNIEIIIDKKLKDI VSDCKLSVATSGTITLELALLGLPSIVVYKTTFINYLIGKYILKIGYISLPNLVLNDEIF PELIQKDCEAKNIEKHMKKVLENLPEIEEKIENMRKKVEGKAVVESYADFLVKEGK >gi|296155282|gb|ADVK01000010.1| GENE 48 48172 - 49923 202 583 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|169795303|ref|YP_001713096.1| ABC transporter ATP-binding protein [Acinetobacter baumannii AYE] # 359 557 21 216 311 82 27 1e-14 MKILKFKNKSLNIFLGYSYRYKWQMIAVIILSIIASLMSAAPAWLSKKFVDDVLIGQNKK MFMWIIGGIFAATVIKVISSYYSQIASNFVTETIKRKIKIDIFSHLEKLPISYFKKNKLG DTLSKLTNDTTSLGRIGFIVFDMFKEFLTVLVLTARMFQVDYILALVSLVLLPLVIRVVR KYTKKIRKYGRERQDTTGKVTAFTQETLSGIFVIKAFNNTDFAIDKYKDLTKEEFEQAYK TTKVKAKVSPINEVITTFMVLLVVLYGGYQILVAKRITSGDLISFVTALGLMHQPLKRLI SKNNDLQDSLPSADRVVEIFDEKIETDVFGEAVEFNEKIQDIKFENVNYKYDDSNEYVLK NINLDVKAGEIVAFVGKSGSGKTTLVNLLARFFNTDEGSITVNGVNIKNIHLDTYRDKFA IVPQETFLFGGTIKENISFGKEVTEEEIISAAKMANAYNFIQEDLPNKFETEVGERGALL SGGQKQRIAIARALIKNPEIMILDEATSALDSESEKLVQEALDSLMEGRTTFVIAHRLST IVRADKIVVMENGEIKEMGTHSELIAMNGIYKNLYDIQFNENI >gi|296155282|gb|ADVK01000010.1| GENE 49 50341 - 50841 730 166 aa, chain - ## HITS:1 COG:no KEGG:FN0600 NR:ns ## KEGG: FN0600 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 166 1 166 166 251 98.0 9e-66 MKLTLQQAIFTISNLTKKQRRLLDYIRDNYVVPLKVNGKEVFEQVQADEMLKNLSELDLV NQDIVALKDGINVANSENFIENKSLFALLEEVRLKRAVLYDLEYLLKRESTRVENGVGVV QYGILNRNELMEKFNKLENEVNSLSEKIDSVNSKTEIEVKLLSSID >gi|296155282|gb|ADVK01000010.1| GENE 50 50921 - 51391 681 156 aa, chain - ## HITS:1 COG:FN0601 KEGG:ns NR:ns ## COG: FN0601 COG2849 # Protein_GI_number: 19703936 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 16 156 1 141 141 238 98.0 3e-63 MKNKIFILAFSLLLSVSAFSNPVEVRKKDLRVVEKIYYLKDSEVPFTGKVSEGRDRLYYL NGKQDGKWISFYKNGNIKSIVNWKDGKLNGKYVIYENNGRKSTETIYKDGKENGYYYLYN SNGTYRTKGAYSMGKPVGEWEYYDKDGKLTNKVIAE >gi|296155282|gb|ADVK01000010.1| GENE 51 51518 - 52228 1270 236 aa, chain + ## HITS:1 COG:no KEGG:FN0602 NR:ns ## KEGG: FN0602 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 236 1 236 236 441 99.0 1e-122 MKIRSVETALRADVSQNVPNGVDALGIFDNLVQPIFPFPLENLSIILSFSEMEGPTMYQV RVNAPNDDLISKGDFGVLPDQFGYGRKVVNLGGILITERGKYSVDIFEIGADNKLKFLQT KRLFNADYPPQREISDAEKEAILADEKLIRMVKTEFKPFEFANDESVKPIKLQISLDNSV PVEEGYIAFPEDDTIEIKGKKFDLTGMRRHVEWMFGRPIPRAEEEIPNEEKKEENK >gi|296155282|gb|ADVK01000010.1| GENE 52 52320 - 53264 826 314 aa, chain - ## HITS:1 COG:FN0603 KEGG:ns NR:ns ## COG: FN0603 COG0583 # Protein_GI_number: 19703938 # Func_class: K Transcription # Function: Transcriptional regulator # Organism: Fusobacterium nucleatum # 1 314 1 314 314 550 99.0 1e-156 MFKYIKYIYEIYENQSFSKAAKKLYISQPALSSIIKKAEDELQLPIFDRSTNPISLTEAG EYYIAAAKEIIEIENKIKHKFKELRTSIEDSFSIGGSTFFCTHVLPNLVDAFTDLYSNYS IKVIEANADELSKCLKSDIVDLIIDVEMLDLNIFNSIKWASENIILAVPSQFHINKKLQE YKLSFEQIKTGEYLKEKYKSVNLKYFKDEPFIFLKNGNDMYSRGLRMCKKANYSPNIIMY MDQLLTSYYVSKLGNGISFIRDTITKYEDPTDKLVFYKIDDIEATRNIMLYYKKNIKLSE VTKKFISFITNNIL >gi|296155282|gb|ADVK01000010.1| GENE 53 53439 - 54647 1619 402 aa, chain + ## HITS:1 COG:FN0604 KEGG:ns NR:ns ## COG: FN0604 COG1301 # Protein_GI_number: 19703939 # Func_class: C Energy production and conversion # Function: Na+/H+-dicarboxylate symporters # Organism: Fusobacterium nucleatum # 1 402 1 402 402 611 100.0 1e-175 MKKLALWKQIIIAILLGILVGVFMPNIVPKFKFLGDIFLRLLKMLIAPLVLFTLISGVCK MGDIKQLRTVGMRIVIFYLASSTVSAILGVLIALFTQPGKGVIDLLGTEVGKDVSYNFIE NLIGWVPINIFEALATGNTLQIIFFALLMGVVLLSLGDSVSVVIKIVDQAADAMMKLTEI VMKFAPIGIFALIADLTISLSGNMLKQVLNMILTVYIVLLFIIFIVYPIAIKLFTKESGL KFLKNVTPAMIVAASTTSSAATLPVSLRCANEELGVPENIFGFTLPLGNTCNMNGLAVGL GVISVFASNIYGYPITFVSLLQFVFMGLVLSVGCAGVKGAGIVMSTVLLQTFGMPLTLIP ILAAIWPITDIGYTTANITGDLVGTVVVANSLNSLDKEIFRK >gi|296155282|gb|ADVK01000010.1| GENE 54 54675 - 55886 1593 403 aa, chain + ## HITS:1 COG:FN0605 KEGG:ns NR:ns ## COG: FN0605 COG0436 # Protein_GI_number: 19703940 # Func_class: E Amino acid transport and metabolism # Function: Aspartate/tyrosine/aromatic aminotransferase # Organism: Fusobacterium nucleatum # 1 403 1 403 403 766 99.0 0 MSMVEEKFKKLGVENAPGQESLQKGVKLDLKGEKILGEKIDFSHGDVDAHKPLPNSLELF IEGFNKGGIQAYTEYKGNKEIREYIAGKLSEYIKMNIPADNLIITPGTQGALFLATGSLI TRGTKVSIVEPDYFANRKLVEFFEGEIVPIELDYFNVNEKKAGLNLKQLEEAFKSGVELF LFSNPNNPTGVIYSDEEITEIARLVNKYNVTVIADELYSRQIFDNREFYHLITKNINQDK LITIIGPSKTESLSGFRLGIAYGSPTIIKRMEKLQAITTLRTAGYNQAVLKSWFSEPDGF MKDRIEKHQAIRDNLIEKFKTIDGIKIRKTEAGSYIFPTLPELEVNLGDFVKILRIYANV IVTPGTEFGKSFINSIRLNFSQDEKKAAEGVDRILEMIRRYKK >gi|296155282|gb|ADVK01000010.1| GENE 55 55883 - 56308 649 141 aa, chain + ## HITS:1 COG:FN0606 KEGG:ns NR:ns ## COG: FN0606 COG0251 # Protein_GI_number: 19703941 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Putative translation initiation inhibitor, yjgF family # Organism: Fusobacterium nucleatum # 1 141 1 141 141 249 99.0 1e-66 MKENNKVIPQGKYIPAKRCGNLVFTAGMTPRNNGVLIMEGKIDNNEPLEKYIIAVEQATE NALKAIKNILSKDEVIVDILSLNVYVNSENNFKKHSKLADFASEYLFKELGEIGIASRTS VGVISLPGNAPVEIQIIAAIE >gi|296155282|gb|ADVK01000010.1| GENE 56 56433 - 57014 875 193 aa, chain + ## HITS:1 COG:FN0607 KEGG:ns NR:ns ## COG: FN0607 COG1713 # Protein_GI_number: 19703942 # Func_class: H Coenzyme transport and metabolism # Function: Predicted HD superfamily hydrolase involved in NAD metabolism # Organism: Fusobacterium nucleatum # 1 179 1 179 193 296 98.0 2e-80 MKYNFKELKEIVKSKMSLKRFVHTLGVVEMSEKLAKIYNADIEKCKVAALLHDICKEMDM EYIKNICKNNFMNELSEEDLENNEILHGFAGAYYVKNELEINDKEILSAIKYHTVGAENM TSVEKIVYIADAIEYGRNYPSVVKIREETFKNLDRGILMEIEHKEKYLESIGKKSHPNTD ELKKELLKKEEDL >gi|296155282|gb|ADVK01000010.1| GENE 57 57011 - 59113 1236 700 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|15894003|ref|NP_347352.1| fused ribonuclease/ribosomal protein S1 [Clostridium acetobutylicum ATCC 824] # 8 692 5 697 730 480 38 1e-134 MNLEKDLEKIKELLKDVKYLTLDQITSFLEWSPKKKKDNKIIILSWIDSGDLIMDKKHRL SLPENSGYVKGVFRIIKNKFAFVDKEDSEEKEGIFIPKEEFNNALDGDTVLVEITEKKKS DKGAEGRVVKIIEHRKNTVVGILEKSKNFAFVIPTGSFGKDIYIPNSKIANADNKDLVAV EITFWGDDDRKPEGKIIKILGSSTNSKNMIDALIYREGLSEKFSNEAMQQTKEIIKEEID YSNRKNLTNLSIITIDGADAKDLDDAVYVEKLENGNYKLIVAIADVSHYVKKETVLDLEA RHRGNSVYLVDRVLPMFPKEISNGICSLNEKEEKLTFSCEMEIDLKGDVVNYEVYKSVIK SVHRMTYKDVNAILDGDKDLINEYSDIYEMLKQMLELSKILRIKKFTRGSIDFELPELKV ILDEDNNKVEKVLLKDRGEGEKIIEDFMIAANETVAERIYWLELASIYRTHEKPDREKIV TLNEILSKFGYKIPNFDNLHPKQFQEIIERSKDKETSMLVHKTILRALKQARYTVEDIGH FGLASSHYTHFTSPIRRYSDLMVHRVLFSTIDNSIKPFKEADLDEIAQHISKTERVAMKA EDESVRIKLVEYMQKRVGETLNVMVTGFAQKKIFFETDEHIECSWDITTAINYYAFDEEN YCMRDTDSDKVFYLGDKVDVVLKKADLLTLEIAVVPLDDF >gi|296155282|gb|ADVK01000010.1| GENE 58 59130 - 59576 680 148 aa, chain + ## HITS:1 COG:FN0609 KEGG:ns NR:ns ## COG: FN0609 COG0691 # Protein_GI_number: 19703944 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: tmRNA-binding protein # Organism: Fusobacterium nucleatum # 1 148 1 148 148 238 98.0 3e-63 MIIANNKKAFFDYFIEEKYEAGIELKGSEVKSIKAGKVSIKEAFVRIINDEIFIMGMSVV PWEFGSVYNPEERRVRKLLLHRKEIKKIHEKVKIKGYTIVPLDVHLSKGYVKIQIAIAKG KKTYDKRESIAKKDQERNLKREFKSNNR >gi|296155282|gb|ADVK01000010.1| GENE 59 59645 - 63151 4341 1168 aa, chain + ## HITS:1 COG:no KEGG:FN0610 NR:ns ## KEGG: FN0610 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 14 1168 1 1155 1155 2047 98.0 0 MKRKFIFLLILSAMIINNSAYSENATADSDEVVIDLNDNTMTAERGVVVTNGNMKGLFYR FQRDPVTGEITFTNNALMNISQPTGSIKIETEGGKVSQKDEKGEFYNSFAYINVAKMTGA EAPNDKIYFGSPYIKYEDEKIYAKDAWVTTDFNIVNFQKEPQKAGYHIFSSDVIVEPDKQ ITLKNSDLFIGEKDVMPFTFPWFRANIRSGSTVPLFITIQSDDDYGAATSMGFLYGNRRD KFRGGFAPKFADKMGILVGRWENWYRFDNVGETRLNIDDWLIYAKEKSEPKDTNELPDYE KRHKRYKVELTHDYDGENGSFHFISQNSTRSMVGNLKELMDKYDNNNVYSSLGLDRFKFD KNIGFYNLDANLFNLGEAKDLSFTGKMSLVSDKKTYGLLVYDKIDDISYGSTIDHDLYTN LSLTKDNDKFKLNARYDYLYDMDPGSTASDLMSRNERIGADFLLKENGLNISYDKRRGDD YRNFSFWEEDINTSARKRNILGVDFSYTPTTVAKYEFNNFENIKASLGNYKVGNYTFTPS VSYNFLDRKLDTAKDTYRTTVLGSNRLAEFNRFENIIYNNTLERRADLNLSNDNETYRVG FGKTTSEIWSREGLFDGTYRKYENKSKFYEVQLGRQNLPLGSVGTFGIDGTFRQDKFDGS SDTTNLINLKLGNDLYLYKTENLDVTNKFKAEIQKYSFSGNKNNEEVRLITKSDYIKFDN SLVFDGKSTATTYDIGYKSSKNPYGVKNKNGEQFTTGLGIKFNEDTNLSLKYIDDKRFTS KTKSGKNVNDLFMKQYSVNFETKKYDLGFSNTDIDFVGDDFSTTTDFREDINEHRIRAGY KFDNSKLSLSYAEGKDKLKTDDGRYLDRKNRMYSVAYNIYGDVEQDFIGAFKTYRYGNNR IEDDIRNTDVYSFSYAYRDKRFEQEELMRYATLEYEKPKDQITNDEIEQIRAILDRKNSF HNQFELTRIQDETFRIGNYKKTLSAYVNFEKNKKRYSQTGNLKDSLSNFSGGLTVSYNRL GVGYTFTQKASWKNSGGSYKWSKDTKEHELSVYAKIGKPSQGWKIKTYAMFYDNKNDPTS SKNRKRSVDSIGVEIGKEMGYYEWAISYENRYKTSSRDYEWRVGVHFTLLTFPNNSLFGI GAKNRGGTTSTKPDGYLLDRPSQLKNSY >gi|296155282|gb|ADVK01000010.1| GENE 60 63177 - 65090 2789 637 aa, chain + ## HITS:1 COG:FN0611 KEGG:ns NR:ns ## COG: FN0611 COG0441 # Protein_GI_number: 19703946 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Threonyl-tRNA synthetase # Organism: Fusobacterium nucleatum # 18 637 1 620 620 1252 99.0 0 MLVKYNGENKEYNNNINMFEIAKGISNSLAKKSVGAKVDGKNVDMSYILDHDAEVEFIDI DSPEGEDIVRHSTAHLMAQAVLRLYPDTKVTIGPVIENGFYYDFDPVEQFTEEDLEKIEA EMKRIVKENIKLEKYVLPRDEAIEYFRDVDKNKYKVEVVEGIPQGEQVSFYKQGDFTDLC RGTHVPSTGYLKAFKLRTVAGAYWRGNSKNKMLQRIYGYSFSNEERLKHHLKLMEEAEKR DHRKLGKELELFFLSEYGPGFPFFLPKGMIVRNVLIDLWRREHEKAGYQQLETPIMLNKE LWEISGHWFNYRENMYTSEIDELEFAIKPMNCPGGVLAFKHQLHSYKDLPARLAELGRVH RHEFSGALHGLMRVRSFTQDDSHIFMTPDQVQDEIIGVVNLIDKFYSKLFGFEYEIELST KPEKAIGSQEIWDMAEAALAGALDKLGRKYKINPGDGAFYGPKLDFKIKDAIGRMWQCGT IQLDFNLPERFDVTYIGEDGEKHRPVMLHRVIYGSIERFIGILIEHYAGAFPMWLAPVQV KVLTLNDECIPYAKEIMNKLEELGIRAELDDRNETIGYKIREANGKYKIPMQLIIGKNEV ENKEVNIRRFGSKDQFSKSLDEFYTYVVDEATIKFDK >gi|296155282|gb|ADVK01000010.1| GENE 61 65103 - 65621 899 172 aa, chain + ## HITS:1 COG:no KEGG:FN0612 NR:ns ## KEGG: FN0612 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 7 172 1 166 166 288 100.0 9e-77 MKKILLMSLLCLAIIACGKKEKAKQEIAETTNVTQEQNVEVPNPFVEVKTLDEASKIAGF TLEVPVTYEDYKKQVIQAIENDMIEVIYLEEESGYEGLRIRKAKGTDDISGDYNEYKNVE TMKVGDYDVTEKGDEGNIFIVTWTDGTYSYAIDTDRAELSKEDVANLISNIK >gi|296155282|gb|ADVK01000010.1| GENE 62 65759 - 66190 564 143 aa, chain + ## HITS:1 COG:no KEGG:FN0613 NR:ns ## KEGG: FN0613 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 143 1 143 143 219 99.0 4e-56 MKKILLILLICLATIVNGVPNPFIKVNTMDEAFKMTGFTLETPATYKNYKKKVINVIKNK MIEVVYLKESNTEGLFIRKSKGTYKTNKDIKTVKIGDYDVREKTKEENISLATWTDGTYS YVINPNGTKLNAEDMAELILSIK >gi|296155282|gb|ADVK01000010.1| GENE 63 66338 - 68053 188 571 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 [Vibrio campbellii AND4] # 337 556 2 226 245 77 26 4e-13 MKQKSNFTFLLSYAKNEKYKLYLSAFLSVCSSILMVVPYILIYNIILELLKTDLDYNRIK HLAIYTAILIVVRLILFILSGVFSHVAAFNILYNIRMQTVKHLGNINLGYFRERNIGEIK KAINEDVEKLENFLAHQIPDLAAAITTPAVLLIFLFFLEWRIAIFLIIPIILAILAQIAM FKDYGKRLDNYNSLLQRLTSTITQYIKGMNVFKAFNLTAHSFKKYIDVNNEYTENWHSMT DDFKNPYGIFLAVVDSALIFVIPSGGYLYLTDKINISTLLIFLLLSYTFLTSLKTLMQFA GTFSFVLAGANNVRSMIEFPIQNDGKNLKDINFKEDISFNDVTFSYDKNDVLKNINLILR PNTITALVGPSGSGKTTIAYLLGRFWDIQKGSIKIGDIDIKDIDVNYLLSNISYVFQDIF MLTDTIFENIKMGLDKRKEEIYQAAKDAEIHEFIMSLPNGYDTIIGDGYIKLSGGEKQRI SIARCLLKNSPIVVLDEITAYSDIENEAKIQSAIRNLLKDKTAIIIAHRLYTIKDVDNIV VLNEGEIVESGKHQDLITKENGLYKHLWEVK >gi|296155282|gb|ADVK01000010.1| GENE 64 68057 - 69781 2231 574 aa, chain + ## HITS:1 COG:FN0615 KEGG:ns NR:ns ## COG: FN0615 COG1132 # Protein_GI_number: 19703950 # Func_class: V Defense mechanisms # Function: ABC-type multidrug transport system, ATPase and permease components # Organism: Fusobacterium nucleatum # 1 574 1 574 574 985 99.0 0 MLNNLKILLDKDYTPVKKATYYQLLDILFNMIIYTILFLTIYSLIEKSFTMNKIYWYSGL LLIALIFKSHFGGCGMVKMQKTGSTASKNLRIAMGDHVKKLNLGYFNSHNLGYLINILTM DITDFEQAITHNIPDLLKVLVLSVYLLLITFFINFKLAIIQIIVVLLTIPILKVGGEKLE KIGVEKKTVSAKLISTIIEYISGIEVFKSFGVIGDKFERLEKGFRDLKKYSIKLELIAVP YVLLFQVIIDLLFPILLLLAVRFFMNGELEAKMLVGFIVLSLTLTNVIRNFSVSYSITRY LFLSVAKISDTLNYPTISYKDEDFNFSSYDISFEDVDFSYTEDRKVLKDINFTAKNNEIT ALVGKSGSGKSTIMSLIARFWDTTKGSIKIGGKDIKEVNPDSLLKNISMVFQDVYLINDT IYENIRIGNLNASEEEIMNAAKIANCHDFISKLPKGYDTYIGEEGSTLSGGEKQRISIAR ALLKNSPIILLDEATASLDADSEHEIKMAINELIKDKTVIIIAHRLNTIKDANKIIVMDD GKIIESGNHEKLMNDKGTYYSMFTAMEKAKEFSI >gi|296155282|gb|ADVK01000010.1| GENE 65 70040 - 71584 1776 514 aa, chain - ## HITS:1 COG:no KEGG:FN0616 NR:ns ## KEGG: FN0616 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 19 512 1 494 495 912 98.0 0 MLFCLFGSMLFAAQKTTKMVKNDYDLKFNPDKYISKEIEINGQKIKYRAYENIIYIKNPI DKDYQNMNIYIPEEYFNNLSIGSYNSNNAPIFFPNTVGGYMPGKADTVGLGRDGKANSLT YALSKGYVVAAPGARGRTLTDDKGNYIGKAPAAIVDLKAAVRYLYLNDEVMPGDANKIIS NGTSAGGALSALLGASGNSQDYLPYLKEIGAAETRDDIFAVSAYCPITNLENADSAYEWM YNGVNSYSRMEFTRNTSAQEYNDRSLTRSTVQGNLTNDEINISNKLKTLFPIYLNSLKLT DDGGNLLTLDKSGNGSFKTYLSIIIRNSANRALREGKDISQFKKAFTIENNKVVAVNLDV YTHIGDRMKSPPAFDSLDASSGENNLFGDKKSDSKHFTKFSFDINNKAAIDYFRNGKFND KNNKISVPKMADKNIIKMMNPMYYIDSNTSTKYWRIRHGAVDKDTSLAIPAILALKLKNS GKVVDFASPWGQGHGGDYDLEELFNWIDGVVKNN >gi|296155282|gb|ADVK01000010.1| GENE 66 71621 - 72715 1261 364 aa, chain - ## HITS:1 COG:FN0617 KEGG:ns NR:ns ## COG: FN0617 COG0592 # Protein_GI_number: 19703952 # Func_class: L Replication, recombination and repair # Function: DNA polymerase sliding clamp subunit (PCNA homolog) # Organism: Fusobacterium nucleatum # 1 364 1 364 364 632 99.0 0 MKFSINRQKTIEIIGEYSNILKDNPVKPSLAGLFIQAKNNQVVFKGANTEIELIRYANCE IESEGQVLIKPALLLEYIKLLEGENINFEKKDGYLIVNNAEFSILDDNTYPELTEIIPIV IASENTVKFTMSLEKVKFLTNSSTSTDTLFNSIKMIFKDNVLELVSTDSFRLIYMKKELN NMINKDVLVPGDSIAVLYKIFKDLDEEFSLAASDDKLIVTWKDAYFTCKLLSLSFPDFRP LINNTNHDKRFEFNRDELNLALKKVISVTKNSSDSKNVATFNFKGNQLLISGVSANAKIN QKVNMIKTGEDLKLGMNCKYIKDFIDNVDKNIIIDATNSSSMLRFMEEGNENYIYLIMPV NIRV >gi|296155282|gb|ADVK01000010.1| GENE 67 72730 - 73758 1473 342 aa, chain - ## HITS:1 COG:FN0618 KEGG:ns NR:ns ## COG: FN0618 COG0687 # Protein_GI_number: 19703953 # Func_class: E Amino acid transport and metabolism # Function: Spermidine/putrescine-binding periplasmic protein # Organism: Fusobacterium nucleatum # 1 342 1 342 342 648 100.0 0 MKKIFLLFLVTIILVSCGDSKDENTLYVYSWADYIPQFVYSDFEKETGIRVIEDIYSSNE EMYTKIKAGGEGYDIIMPSSDYYEIMMKEDMLAKLDKSQLENVKNIDDSYMAKLREFDPE NDYGVPYMRGITCIAVNKKFVKDYPRDYTIYNREDLAGRMTLLDDMREVFVPALALNGYK QDADSTEAMEKAKSTILNWKKNIAKFDAESYGKGFANGDFWVVQGYPDNIYRELSEEDRE NVDFIVPPGEQGYSSIDSFVVLKDSKHLENAMKFINYIHRPDVYAKISDFIEIPSINTEA DKLITKKPLYDVEKTKNAQLLIDIGDKLNIQNKYWQEILIAN >gi|296155282|gb|ADVK01000010.1| GENE 68 73829 - 74698 1026 289 aa, chain - ## HITS:1 COG:FN0619 KEGG:ns NR:ns ## COG: FN0619 COG0668 # Protein_GI_number: 19703954 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Small-conductance mechanosensitive channel # Organism: Fusobacterium nucleatum # 9 289 1 281 281 517 99.0 1e-146 MNSTFYEKMLENLLVNLEHYLPMLAGKLVAFLVICFIWPKLTKFLVKSFEKAMSLRNNDP LLISFLKSLIKTIMYIILAFILVGILGVRATSLVTILGTAGVAVGLALQGSLSNLASGIL ILFFKQVSKGDFVSSLDKNIEGTVQSIHILYTIIQQPNGPVIIVPNSQIANASIINYSKN PFRRLDLVYSASYDDPVDKVISVLHQVIENEPRIIKNNPSMPITISLTKQNASSLDYMFR AWVRKEDYVDTMLDCNINVKKFFDKNGIEIPYNKLDLYMKNNLDIDNKQ >gi|296155282|gb|ADVK01000010.1| GENE 69 74824 - 76128 1616 434 aa, chain - ## HITS:1 COG:FN0621 KEGG:ns NR:ns ## COG: FN0621 COG0427 # Protein_GI_number: 19703956 # Func_class: C Energy production and conversion # Function: Acetyl-CoA hydrolase # Organism: Fusobacterium nucleatum # 1 434 1 434 434 857 100.0 0 MKNWKESYKSKICTPDEAIQKIKGAKRISFGHICSESTVLTEALVRNKKLFKKLEIAHLL SVGKSEYAKEENSEYFRHNALFIGPKTREAANSSYGDYTPTFFFETAKLFQKDEELALDA MLLQVSTPDEHGYCSYGLSCDYTKSATENAKIVIAQINKFVPRTLGNCFVHIDNIDYIIE EDTPIPEVQPPVVGEIERKIGGFCASLVRDGDTLQLGIGAIPVAVLNFLKDKKDLGIHSE MISDGIVDLINLGVITNKKKNLNPNKAIATFLMGSKKLYDYANDNPAIELHPVDYVNNPI IIAQNDNMVSINSAIQVDLMGQVNAEYIDSKQFSGPGGQVDFVRGATMSSGGKSIIALPS TTGKGTISRIVFTFDEGVPVTTSRNDVDYVITEYGIAHLRGKTLRERAKLLIEIAHPDFR EELRKKALEKFGEL >gi|296155282|gb|ADVK01000010.1| GENE 70 76347 - 77000 874 217 aa, chain + ## HITS:1 COG:FN0622 KEGG:ns NR:ns ## COG: FN0622 COG1059 # Protein_GI_number: 19703957 # Func_class: L Replication, recombination and repair # Function: Thermostable 8-oxoguanine DNA glycosylase # Organism: Fusobacterium nucleatum # 1 217 1 217 217 384 99.0 1e-107 MKKNEYFNEIEKIYKEMSSHFKERLKEFKNIWENGTNKDIHLELSFCILTPQSKALNAWQ AITNLKKDDLIYNGKAEELVEFLNIVRFKNNKSKYLVELREKMMKDGKIITKDFFNTLPT VVEKREWIVKNIKGMSYKEASHFLRNVGFGENIAILDRHILRNLVKLEVIDELPKTLTPK LYLEIEEKMRDYCEFVKIPMDEMDLLLWYKEAGVIFK >gi|296155282|gb|ADVK01000010.1| GENE 71 77021 - 77977 1113 318 aa, chain + ## HITS:1 COG:FN0623 KEGG:ns NR:ns ## COG: FN0623 COG0679 # Protein_GI_number: 19703958 # Func_class: R General function prediction only # Function: Predicted permeases # Organism: Fusobacterium nucleatum # 1 318 1 318 318 503 98.0 1e-142 MGDMENFLLAFNVVFPIFLIMMLGVILKRKNMVDEKSLNVMNSLIFRLFMPTLLFFNIYN MGDLSTLSFDNLKLLAYAFISILIVLLLAWLIYMPNVKDRKKLSVLIQGVYRGNFVLFGL AIADSLYGKESLGTVSLLTAIVIPTFNVIAVILLEYYSGNEVNKIKLIKQVFKNPLIIAT LTAIVFLVLKINIPKPVYKAIGDISKIATPLAFLVLGAGLKFGNILKNLKYLISVNILRL IGNPLITVGLGKLLGFQGIELVALLSMSACPTAVVSYTMAKEMNADGDLAGEIVATTSML SIFTIFCWVLMLKNLEWI >gi|296155282|gb|ADVK01000010.1| GENE 72 78014 - 79336 1715 440 aa, chain - ## HITS:1 COG:FN0624 KEGG:ns NR:ns ## COG: FN0624 COG1757 # Protein_GI_number: 19703959 # Func_class: C Energy production and conversion # Function: Na+/H+ antiporter # Organism: Fusobacterium nucleatum # 1 440 5 444 444 705 99.0 0 MKSELKEKKYGAFSFLPLIVFLALYIGSGIFFTLIGAEGAFKKFPRHVALLAGIIVALLM NRGLKLEKKIDIFSENAGNPGVILIGLIYLLAGGFQGAAKAMGGVESVVNLGLTFIPSIF LVPGVFLISCFISTAIGTSMGTVAAMAPIAIGVAQAANLNVPLTAAAVIGGAYFGDNLSM ISDTTISAAKGVGSEMKDKFKMNFFIALPAAIFAAIMYGIMGGNGSITGEYNYHIIRVLP YIVVLITALIGFNVSGVLVLGIAMTGVIGLLEGNITFLDWIGAIGEGMSDMFSITIVAIL ISGLIGLVKYYGGIEWLVNSIISKIKSRKNAEYGISLISGLLSAALVNNTIAIIITAPIA KEIGQKYNIVPKRLASLIDIFACAFIALTPYDGGMLMITALVDVSPLEVLKYSFYIFALI VTTCITIQFGLLRTKEEKNN >gi|296155282|gb|ADVK01000010.1| GENE 73 79338 - 80534 1527 398 aa, chain - ## HITS:1 COG:FN0625 KEGG:ns NR:ns ## COG: FN0625 COG1168 # Protein_GI_number: 19703960 # Func_class: E Amino acid transport and metabolism # Function: Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities # Organism: Fusobacterium nucleatum # 1 398 1 398 398 741 97.0 0 MEKEKFLKEYLVERKGTNSLKWDALDKRFGNPDLISMWVADMEFKTPKEMVEALKERVEH GVFGYSYVSDDYYNAVIKWHKEKHNYEIKKEWLRFSTGVVTAIYWFINIFTKVNDSVLIL TPVYYPFHNAVKDNNRKLITCDLKNTNGYFTIDYEEVEKKIVENNVKLFIQCSPHNPAGR VWKEEELSKILEICKKHNVLVISDEIHQDIVMKGYKHIPSAIVESGKYADNLITISAASK TFNLAGLIHSNIIISNDKLRKKYDEEIKKINQTECNTLGMLATQVGYEKGEYWLENIKEL IEGNFNYLKSELNKNIPEIIITNLEGTYLVFLDLRKIIPIDKVKEFIQDKCNLAIDFGEW FGKNFKGFIRMNLATDPQIVKKVVENIISEYKKLRSDR >gi|296155282|gb|ADVK01000010.1| GENE 74 80554 - 81081 610 175 aa, chain - ## HITS:1 COG:FN0626 KEGG:ns NR:ns ## COG: FN0626 COG4283 # Protein_GI_number: 19703961 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 175 11 185 185 295 94.0 3e-80 MKEYTSKVELTSTIKASYQKYIDEFENISEDLKDKRFEEVDRTPAENLAYQVGWTTLLLK WERDEKAGLEVYTPSENFKWNNLTGLYKWFNKEYSYLSLAELKSILNKNISDIYKMIDEM SEDKLFKPHQRKWVDNSNKTAVWEVCKFIHINTVAPFGTFRTKIRKWKKLLLQKN >gi|296155282|gb|ADVK01000010.1| GENE 75 81174 - 82202 1213 342 aa, chain - ## HITS:1 COG:FN0627 KEGG:ns NR:ns ## COG: FN0627 COG2222 # Protein_GI_number: 19703962 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted phosphosugar isomerases # Organism: Fusobacterium nucleatum # 7 342 1 336 336 634 99.0 0 MSKYKEMLKFNEDEYRKSAELIIEAYPIAKKVADEISVEGYENIFFSAVGGSLAPMMAIG EIAKQISKKPIFIEQAAELLTRGHKSLSHNSILITLSKSGDTKETVAMAKYAKENGIRVI SLTKELNSPLALNSNYVIPMRHENGVEYEYMLLFWLFFRLMENNGDFSEYEKFAEQLKKL PENLLEAKYKFEPIAKEIGKKYYKEPYMIWIGSGETWGETYLFSMCLLEEMQWIKTKSVT SSEFFHGTLELIEKDTCVFLIKSSGKTRILDDRAEKFLKNYTEKLTVIDTQDFKLEGIDE KYRWIIAPTIASTILVDRLAFHFEDNTKHSLDIRRYYRQFNY >gi|296155282|gb|ADVK01000010.1| GENE 76 82218 - 83276 1102 352 aa, chain - ## HITS:1 COG:FN0628 KEGG:ns NR:ns ## COG: FN0628 COG0449 # Protein_GI_number: 19703963 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glucosamine 6-phosphate synthetase, contains amidotransferase and phosphosugar isomerase domains # Organism: Fusobacterium nucleatum # 1 352 1 352 352 559 99.0 1e-159 MKETMLTNILEEEKIFKNIISNFETKNYLLIKELIKIELKNILILATGSSMNAALITKYF ISDILNINIEIKEPFNYYNYEKINENIDLVIAISQSGKSASTISALKYVKKCKNIPSIAI TSNNMSIIAKESNMILDLGIGIEQVGFVTKGFSATVLNLFLLAIILAKEKKLISANQKEV YLKELNIIIENIPQVILKTENFIKEKKEIFLNAKRFIGIGYGACFGLVKEFETKFTETIR LPSQGFELEAYMHGPYLEANKEHIIFYFDNKGKLSQRLFLLKNYMMPYIKQSFVIALDNG DINLNLNLNEHLATLLLIIPIQIMSYRIAEIKEIDLNIKIFEDFDKILKSKI >gi|296155282|gb|ADVK01000010.1| GENE 77 83286 - 84110 1202 274 aa, chain - ## HITS:1 COG:FN0629 KEGG:ns NR:ns ## COG: FN0629 COG3716 # Protein_GI_number: 19703964 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IID # Organism: Fusobacterium nucleatum # 1 274 6 279 279 498 100.0 1e-141 MKIFNNIENKEMNKELDRVFWRSFQMEFAWNYERQMNLGYVYAMIPVLKKIYANDKEGLK KALKRHLEFFNMTPHIVTLMLGISSAMEKENSESENFDENSINNIKTALMGPLSGIGDSF FWGTLRLLATGIGTALSLQGNILGPILFLLIFNIPHIFIRYIFTKLGYKLGIEFLNKLEK NGIMEKLTFGASILGLTVIGAMIARMIEISTPLILGSENNPIAIQGILDDIMPGILQLGI FGIVYYLLGKKVKPLTILLGMAIVGILGSFIGIF >gi|296155282|gb|ADVK01000010.1| GENE 78 84097 - 84876 1229 259 aa, chain - ## HITS:1 COG:FN0630 KEGG:ns NR:ns ## COG: FN0630 COG3715 # Protein_GI_number: 19703965 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIC # Organism: Fusobacterium nucleatum # 1 259 1 259 259 388 99.0 1e-108 MLQALLLGLIAFVAQSEFALGTSLISRPIVTGLFTGLVMGDIKAGLIMGATLELAFIGSF SIGGSIPPDVVTGGILGVAFSIASKTGIETVLLLALPIATFTLILKNVYLGLLIPMLSHK ADIYAEEGNTKGIERMQILSGLGLSFMLAAIVFFSYLLGSNVISSILNAIPDFIQRGLAV ATGIIPALGFAMLAKLLINKTVIPYLFLGFAIAIYSQIPLTGIAIMGAILSVIIVNITNG IELRYKTKNQSEVENDEDF >gi|296155282|gb|ADVK01000010.1| GENE 79 84887 - 85348 554 153 aa, chain - ## HITS:1 COG:FN0631 KEGG:ns NR:ns ## COG: FN0631 COG3444 # Protein_GI_number: 19703966 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIB # Organism: Fusobacterium nucleatum # 1 153 1 153 153 233 97.0 8e-62 MILLLRVDHRLLHGQVVFSWIQNLKADCILIANDSVATDELRKSTLKLAKPQEIKLVIKN INDTIDSINSGITDKYKLLIIVESIEDAYKITKETKQIKQINLGGIKPRENSKNISKTIN LLENEELMLKDLIKNGIEIEIRQLATDNKILYK >gi|296155282|gb|ADVK01000010.1| GENE 80 85361 - 85786 529 141 aa, chain - ## HITS:1 COG:FN0632 KEGG:ns NR:ns ## COG: FN0632 COG2893 # Protein_GI_number: 19703967 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphotransferase system, mannose/fructose-specific component IIA # Organism: Fusobacterium nucleatum # 1 141 4 144 144 218 99.0 2e-57 MYQFIIATHGLFAEGIKNSIEIILGKFENLSTLSCYTDSNFNLKKEIDEILKKYNNKEVI VITDIFGGSVNNLFMEEIPLNKNIHLITGLNLPLVLNLLGEQENYLIPEELIQNSMEISS DAVKYCNLELIKTSKNEDEDF >gi|296155282|gb|ADVK01000010.1| GENE 81 85811 - 85906 115 31 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MNSITHIFLEEKLYLTLKAKNLTSFFLCVIF >gi|296155282|gb|ADVK01000010.1| GENE 82 85997 - 87241 1076 414 aa, chain + ## HITS:1 COG:no KEGG:FN0633 NR:ns ## KEGG: FN0633 # Name: not_defined # Def: replication protein # Organism: F.nucleatum # Pathway: not_defined # 1 414 1 414 414 589 98.0 1e-167 MKKIIDLIEEPSVFDIKIEYTKKFSKNDIYFKQFLIKKIIKTNEKIFYLTEKEAKKILIF PHGENFDIFLKKFCSKRLIIKYKKSEEEYYELILNIISSVLKNKNTYVLKVSEDFYKIFN SEKNDFKFYKLNIFLGFSNIITRKLFNLIKNIYNELSIEISLDNLRRYLNLEESYERFFD FEKKILIPSLKEIENFTSYKILYSKIKNSISTNARVKAIRFNIIQNSDDKSENDISILYK LVKPFAQNLFTLQKFISYQANFYSYQYLKKNIEYSLLHGKNNLDSFLVEAIKYDWVNTKF KEKLQEYSKKYSLIFKLSQKIIIIEEFRKVILKSIEKNELDKELSLILNFMKISARIFEN NIYKDNNLLKNKVYLTFYKNLQEVNECIFEDEKTIILAEFNQNCSNSNLAIFKK >gi|296155282|gb|ADVK01000010.1| GENE 83 87309 - 89126 2895 605 aa, chain - ## HITS:1 COG:FN0634 KEGG:ns NR:ns ## COG: FN0634 COG1217 # Protein_GI_number: 19703969 # Func_class: T Signal transduction mechanisms # Function: Predicted membrane GTPase involved in stress response # Organism: Fusobacterium nucleatum # 1 605 1 605 605 1181 99.0 0 MKIKNIAIIAHVDHGKTTLVDCLLRQGGVFKTHELEKVEERVMDSDDIERERGITIFSKN ASVKYKDYKINIVDTPGHADFGGEVQRIMKMVDSVLLLVDAFEGPMPQTKYVLKKALEQG HRPIVVVNKVDKPNARPEDVLYMVYDLFIELNANEYQLEFPVIYASGKTGFARKELTDEN MDMQPLFETILEHVQDPDGDVTKPTQFLITNIAYDNYVGKLAVGRIHNGTLKRNQDVMLI KRDGKQVRGKVSVLYGYEGLKRVEIEEAEAGDIVCVAGIDDIDIGETLADINEPIALPLI DIDEPTLAMTFMVNDSPFVGKEGKFVTSRHIWDRLQKEIQTNVSMRVEATDSPDSFIVKG RGELQLSILLENMRREGFEVQVSKPRVLFKEKDGKKLEPIELALIDVDDSFTGTVIEKMG VRKAEMVSMVPGQDGYTRLEFKVPARGLIGFRNEFLTDTKGTGILNHSFFDYEEYKGDIP TRNKGVLIATEPGVTVPYALNNLQDRGTLFLDPGIPVYEGMIVGEHNRENDLVVNVCKTK KLTNMRAAGSDDAVKLATPRKFTLEQALDYIAEDELVEVTPTNIRLRKKILKEGDRRKNW SATNK >gi|296155282|gb|ADVK01000010.1| GENE 84 89149 - 90012 1195 287 aa, chain - ## HITS:1 COG:FN0635 KEGG:ns NR:ns ## COG: FN0635 COG0130 # Protein_GI_number: 19703970 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Pseudouridine synthase # Organism: Fusobacterium nucleatum # 1 287 1 287 287 475 94.0 1e-134 MEGIIVVNKPKGITSFDVIRKLKKILKTKKIGHTGTLDPLATGVMLVCVGKATKLASDLE AKDKVYIADFDIGYATDTYDIEGKKIAENTIEISKENLEQSIKKFIGNIKQIPPMYSAIK IDGNKLYHLARKGIEVERPERDVTIEYINLLDFKDNKAKIETKVSKGCYIRSLIYDIGQD LGTYATMITLQRKQVGDYSLENSYSLEQIEEMTLNDDFKFLKTIEEIFSYDKYSLQTEKE LTLYKNGNTVKIKENLENKKYRIYFQDEFIGLANIENNNLLKGYKYY >gi|296155282|gb|ADVK01000010.1| GENE 85 90269 - 90628 418 119 aa, chain + ## HITS:1 COG:no KEGG:FN0636 NR:ns ## KEGG: FN0636 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 119 1 119 119 228 94.0 6e-59 MEINRLNTSPIDDKYWEVLEEYSYETSKGLIVVPKGFKTDYASVPKIFRNIINTYGKHGR AAVVHDWLYSSQCKIDVTRAEADKIFLEIMVEWNVKKYKRILMYVLVRIFGESHFRKGD >gi|296155282|gb|ADVK01000010.1| GENE 86 90685 - 91125 524 146 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296327540|ref|ZP_06870086.1| ## NR: gi|296327540|ref|ZP_06870086.1| cell division protein FtsL [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] cell division protein FtsL [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 146 1 146 146 236 100.0 5e-61 MRKNFILTILMFLFINILSIAVENPTNISDSTGLVDPNFQEALKDYKPNLENIDKMFNYI EKNIKEKGRAVFYSKLEKEKNEVIVVDENNNIIYTKKIPEKLAEQTPYFEVKQIYQLKNG KTLGYSEMNTEILGKKVLYKNGKILK >gi|296155282|gb|ADVK01000010.1| GENE 87 91142 - 91501 389 119 aa, chain - ## HITS:1 COG:no KEGG:FN0638 NR:ns ## KEGG: FN0638 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 119 1 119 119 172 99.0 3e-42 MNKEILDLVEKILTFLKVEDYNKLKNILNIIEKDYPNYYKFFEKFKDRNLIEKISDVFGS PTFGGGPLILLGKKLEQEEKQKEVVLKKGIFKNEIKEILKNYFNPDEEKTFLEFLLEKL >gi|296155282|gb|ADVK01000010.1| GENE 88 91519 - 91623 57 34 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MINNVQESNRAKLPPLPFNRNIFHIRQYKFRKNN >gi|296155282|gb|ADVK01000010.1| GENE 89 91626 - 91955 382 109 aa, chain - ## HITS:1 COG:FN0639 KEGG:ns NR:ns ## COG: FN0639 COG4997 # Protein_GI_number: 19703974 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 98 1 98 98 117 93.0 7e-27 MKKEIIYNKLIRDNILEIISNNNQKSSYHIATDEEYKNKLLEKLQEEICEFITDKNEEEL ADILEVIEHIITAFNFNKKRILEIREKKAKKNGKFNKKIILEKVFNMEG >gi|296155282|gb|ADVK01000010.1| GENE 90 91967 - 92848 922 293 aa, chain - ## HITS:1 COG:FN0640 KEGG:ns NR:ns ## COG: FN0640 COG1266 # Protein_GI_number: 19703975 # Func_class: R General function prediction only # Function: Predicted metal-dependent membrane protease # Organism: Fusobacterium nucleatum # 1 293 1 293 293 432 100.0 1e-121 MTNKFQSYVDNIEEKNKFKLLLLPILVVTFIIVLNQFLILILIPIFNDSLKEILSFTGTS NLVDEAFCLFLSIFLITKISKLKAEQLGFSKDNIASSYLKGALFGILQISTVFFIIFALK AIDVYYVGNINVLLLIKVFIIFIFQGLFEEILFRGYLMPMFSKVIGIKFTIILLSFLFTC IHLINPNLDIIGLANVFLAGVTFSLIYYYTGNLWIVGAMHTLWNFILGFIVGSQISGIVT YHSVFFSIPVENKDLISGGVFGFEASIVTTIVELAISLFVIYLIKKEKNKINF >gi|296155282|gb|ADVK01000010.1| GENE 91 92871 - 93506 736 211 aa, chain - ## HITS:1 COG:no KEGG:FN0641 NR:ns ## KEGG: FN0641 # Name: not_defined # Def: methyltransferase (EC:2.1.1.-) # Organism: F.nucleatum # Pathway: Histidine metabolism [PATH:fnu00340]; Tyrosine metabolism [PATH:fnu00350]; Polycyclic aromatic hydrocarbon degradation [PATH:fnu00624]; Microbial metabolism in diverse environments [PATH:fnu01120] # 1 211 1 211 211 319 96.0 6e-86 MKKFIINNYLEDIRKKIPAYDLMLEIIFNSILKVKTDISQIKNILAIGGQSFEAKNLSKI YNNSKITIVEPSEIMLNIVKNECEDLKNLEYICDKFESYKNDKNFQLCLCLLVLQFVENP KSFLEKIYNSLDKNSLFIISIFSNKQLAYWKEFALSKGAKKEQVEKTFNNQSEVMNILSP DYTETLLKEIGFLKVEKICEILSVDMWIAEK >gi|296155282|gb|ADVK01000010.1| GENE 92 93519 - 93989 475 156 aa, chain - ## HITS:1 COG:no KEGG:FN0642 NR:ns ## KEGG: FN0642 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 156 1 156 156 282 97.0 3e-75 MLFVYSSEIKYWEGYLKEAGSNAFTLSPLKWISTTNAVGLFVRLGRGGGTFAHKDIVFKF ASWLSAEFELYIIKDYQRLKDDENSKLSLNWNLNREISKINYKIHTDAIKTYLLGNLTKE QLSYKYVSEADMLNVALFNKRTKEWREENPKLKVNI >gi|296155282|gb|ADVK01000010.1| GENE 93 93962 - 94399 650 145 aa, chain - ## HITS:1 COG:FN0643 KEGG:ns NR:ns ## COG: FN0643 COG3708 # Protein_GI_number: 19703978 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 145 1 145 145 272 97.0 1e-73 MPYRLKAVTIRTNNSEEGIRKIAELWEDVLTGKLPLLSDGIVPISQYSNYESNEKGDYDI SIVGVEHNFFEDIEKEVEKGLYKKYEAVDENGSVELCTKKAWENVWNDTHSGILKRAFTI DFESSVSAEFSKDGKAHCYLYIAVK >gi|296155282|gb|ADVK01000010.1| GENE 94 94415 - 95872 1865 485 aa, chain - ## HITS:1 COG:FN0644_1 KEGG:ns NR:ns ## COG: FN0644_1 COG0007 # Protein_GI_number: 19703979 # Func_class: H Coenzyme transport and metabolism # Function: Uroporphyrinogen-III methylase # Organism: Fusobacterium nucleatum # 1 251 1 251 251 469 95.0 1e-132 MKKGKAYIIGAGPGDFELLTLKSKRIIENADCIVYDRLISDDILKLAKKDAELIYLGKGN TEGGLIQDEINETLVKKCLEGKKVARVKGGDPFVFGRGGEEIEALFKNEIEFEVIPGITS SISVPAYAGIPVTHRGLARSFHIFTGHTMENGKWHNFENIAKLEGTLIFLMGVKNLDLIV NDLIKYGKDSKTPVAIIEKGATKNQRVTVGNLENILELVEKNKITPPAITIIGEVVNLRE SFKWFGTENLAKKILVTRDKKQAVEMSENISKRGGIPVELPFIEIENLNIDLKDLEKYKA ILFNSPNGVKAFFENIKDVRCLANIKIGAVGVKTKEILEKYKIVPDFVPNEYLVDKLAEE AVKYTNENDNILIVTSDISPCDTDKYNSLYKRNYEKIVAYNTKKLKVDREKVLETLKDID IITFLSSSTVEAFYENLDGDFFILGDKKIASIGPMTSETIKRLGMKVDYEAEKYTADGLL DVIFK >gi|296155282|gb|ADVK01000010.1| GENE 95 95896 - 96792 1339 298 aa, chain - ## HITS:1 COG:FN0645 KEGG:ns NR:ns ## COG: FN0645 COG0181 # Protein_GI_number: 19703980 # Func_class: H Coenzyme transport and metabolism # Function: Porphobilinogen deaminase # Organism: Fusobacterium nucleatum # 1 298 1 298 298 556 95.0 1e-158 MKKNIIIGSRGSILALAQANLVKDRLQGNYPNLSFEIKEIVTSGDKDLKSNWENSDISLK SFFTKEIEQKLLDGEIDIAVHSMKDMPAISPKGLICGAIPDREDPRDVLVSKNGFLVTLP QGAKVGTSSLRRAMNLKAVRPDFEIKHLRGNIHTRLKKLETEDYDAIVLAAAGLKRTGLA DKITEYLNGEVFPPAPAQGVLYIQCRENDEEIKKFLKSIHNETVAKIVEIEREFSKIFDG GCHTPMGCYSQINGDKIKFTAVYSDEGKQIRAVVEDDLAKGKEIAHMVAEEIKNKIVK >gi|296155282|gb|ADVK01000010.1| GENE 96 96828 - 97817 1149 329 aa, chain - ## HITS:1 COG:FN0646 KEGG:ns NR:ns ## COG: FN0646 COG0373 # Protein_GI_number: 19703981 # Func_class: H Coenzyme transport and metabolism # Function: Glutamyl-tRNA reductase # Organism: Fusobacterium nucleatum # 1 329 1 329 329 571 98.0 1e-163 MLDLENIIVIGVSHENLSLLERENFMRTRPKYIIERLYTEKKINAYINLSTCLRTEFYIE LNSNIKAKEIKNLFSVDMIVKSGVEAIEYLFKVGCGFYSVIKGEDQILAQVKGAYAEALE NEHSSKFLNIIFNKSIELGKKFRTKSMIAHNALSLEAISLKFIKSKFPNIEDKNIFILGI GELAQDILTLLTKEQLKNIYIANRTYHKAEQIKKEFDIVNIIDYREKYPKMIEADVIISA TSAPHIVVEYDKFVPQMKENKDYLFIDLAVPRDVDERLANFKNIEIYNLDDIWEVYHLNS MNRDKLLEDYSYLIDEQMEKLIKTLNYYE >gi|296155282|gb|ADVK01000010.1| GENE 97 97918 - 99048 1054 376 aa, chain - ## HITS:1 COG:FN0647 KEGG:ns NR:ns ## COG: FN0647 COG3629 # Protein_GI_number: 19703982 # Func_class: T Signal transduction mechanisms # Function: DNA-binding transcriptional activator of the SARP family # Organism: Fusobacterium nucleatum # 1 376 1 376 376 603 100.0 1e-172 MLDIKFLGKIKIEYDGVDITDKFGAKTKALLSLLILNKDKPLNREKIILYLWPDSSEDSG KFNLRFNLWQLKNIIGLDENGNKFLHTGRSHCGINVNYKYNCDIIDIKTFNLKENVTIKK LEELRKKFSGEFFEGFYFKNCNDFNESIILERGYFEEQKIKILLKLVSLYEIEQNFEKCS EILKELINIEPYDEEIALRILEIYEKNGKRSSAILFYEDFKKKFMTFLGIQPSEELEKKY LEIKSKDISKEKIDNKNKVTFKNKNELLLETHCVGEIEYYWTNNLLDKILENINISNYLN EKEIKDLGYININLFTDALLLIPPKVRIINILLKLLEKLTTEYNLIVKIIQIEKIDYISK IFLEEIERREFITIKE >gi|296155282|gb|ADVK01000010.1| GENE 98 99060 - 100589 1874 509 aa, chain - ## HITS:1 COG:FN0648 KEGG:ns NR:ns ## COG: FN0648 COG1574 # Protein_GI_number: 19703983 # Func_class: R General function prediction only # Function: Predicted metal-dependent hydrolase with the TIM-barrel fold # Organism: Fusobacterium nucleatum # 1 509 1 509 509 993 99.0 0 MYADIIIKNAKCITVNNKKVFEWLAIKDEKIIAIDNGEKYNSLINKTTIILDAKGKSVLP GFIDSHFHLVQTAMNEEGINLHNVSTFEEIGKKIKKANLKSKESIFGVRLEKENLQEKKF PDRKVLDQFSDNIPIWINNLDYQVSILNTYGLLYYKIPFRIDGIELDEKGAPTGIFRGKA NATLRTNILNSYPNKNREENIRKLIPKLLKVGITTVNAMEGGYMYSDKDANFIYEHIADF PIDIVLFYQCLDLDKVREKKLKRIGGSLYIDGTMGARTAALTFEYNDKPGEMGRLIFSQK ELNEFVEECYINKLQLSLYTIGDRAIEIALNAHEYALEKTGIKGLRHRMEHVELASLSQI KRAKKLGIIFSMNPTYERYWGGSNKMYSERLGKKYVETNKFREIIDGGLTLCGGSDSDVC DYNPFVGIHSAVNHPVKKHRISLYEAIRIYTINGAYAIFEENNKGSLEIGKLADIIILDT DIFEIDKESIDKIKVLCTIKSGKILYNAI >gi|296155282|gb|ADVK01000010.1| GENE 99 100617 - 102245 2256 542 aa, chain - ## HITS:1 COG:FN0649 KEGG:ns NR:ns ## COG: FN0649 COG1574 # Protein_GI_number: 19703984 # Func_class: R General function prediction only # Function: Predicted metal-dependent hydrolase with the TIM-barrel fold # Organism: Fusobacterium nucleatum # 1 542 1 542 542 1077 99.0 0 MLDKLFINGEIYSMKKEGEKFQSLGVKDGKIVFLGTNDEAKNVSSKELIDLKGKMMIPGM ADAHLHLYAYCQNLTFVDLSKVHDINEMVSLMKEKVKNIKKGDWIKGVNFDQSKWKENRF PTLQEMDSISKDNPVIIKRCCLHAVVANSKALEMAGIGKNYQAGSGGIVELDKDGMPNGI LREQSTKVFDDILPDPLKDIEVQKKIMQDVLNDMSSKGITTIHTYAAKIWQYNEDISIYK NFEKEEKLPLRVTVCIDELFEPEILTEEKLNNPYRKVQLGAYKIFSDGSMGSRSAALKAP YTDDPENSGFMLFTQEELNNKILTAYEHGLQPAIHAIGDRALDMTLAAIEYTLKTTKEKG MTDEEQKKRLPFRIIHVQMIDDNLLERMKKLPLVLDIQPIFLCTDLHWIEDRIGKERLKG SFALKTMENAGLIQTGGSDCPVETYEPLKGIYAAVTRQDMEGYPTEGFLPEERLSIYEAL CMYTKNVHYATGQESVLGTLEIGKFADLTVLEKDLFKIDETEIKDVKVEQTYVAGNCVFM IK >gi|296155282|gb|ADVK01000010.1| GENE 100 102245 - 103669 1617 474 aa, chain - ## HITS:1 COG:FN0650 KEGG:ns NR:ns ## COG: FN0650 COG1757 # Protein_GI_number: 19703985 # Func_class: C Energy production and conversion # Function: Na+/H+ antiporter # Organism: Fusobacterium nucleatum # 1 474 1 474 474 805 99.0 0 MQNQGNVKQPSLWLCLSIVLFLIVSFLLQLIIKGEPDVHMTLFFASVFASAMLIIFNKTK FSLIEEGIIHGCKIATISMMILMFIGVMIPAWIAAGTIPTLIYYGLKLISPSIFLVTATL TCAIATLCTGTSWGTAATFGVALMGIGGGLGISPGMTAAAVICGAIFGDKMSPISDTVNL SAGTCEVNIFDNIKSVATATIPGFILTIVVFIFLDLKFNSGEIHSQAVDNMLAILSNNFN LTPIHALVSLIPMILVLILALKKINALATIVVSAIVAMFIAILLQKYSLIDMMSYMNYGF KIDTGNFDVDKLLNRGGLQSMMWTVSIGYLGLSYGGILEKTGVLNTLLNSMQTITKNSRN LILSHIVTGFLTIMLSASPYVSILIPGRMFIKGYEKLGIKKSVASRTCEHSGICLDPLLP WSLGAVYFSGVLGVKTMDYAIYCVLLYVVPLIATFYAITGIFVWKENSKEGVND >gi|296155282|gb|ADVK01000010.1| GENE 101 103892 - 104758 232 288 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|161507907|ref|YP_001577871.1| ribosomal protein large subunit [Lactobacillus helveticus DPC 4571] # 74 277 82 278 285 94 33 4e-18 MKKYIVEHEYDGYEIGTYLKETKGYSSRGLRNLEIYLNGKRIKNNAKKIKKLNRIVVIEK EKSTGIKAMDIPIDIAYEDENLLIVNKEPYIIVHPTQKKVDKTLANAVVNYFEKTLGKTL VPRFYNRLDMNTSGLIIIAKNAYTQAFLQDKTEVKKTYKVIASGIIEKDDFFIKIPIGKV GDDLRRIELSEENGGKSAKTHIKVLEKNYEKNITFLEARLYTGRTHQIRAHLSLIGHSLV GDELYGGDMKLAKRQMLHAYKLEFQNPKTLENLKIEIDIPVDMKEVLK >gi|296155282|gb|ADVK01000010.1| GENE 102 104940 - 105947 1799 335 aa, chain + ## HITS:1 COG:FN0652 KEGG:ns NR:ns ## COG: FN0652 COG0057 # Protein_GI_number: 19703987 # Func_class: G Carbohydrate transport and metabolism # Function: Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase # Organism: Fusobacterium nucleatum # 1 335 1 335 335 627 99.0 1e-179 MAVKVAINGFGRIGRLALRVMSKNKDFDVVAINDLTDAKTLAHLFKYDSAQGRFDGTIEV TDDGFVVNGDSIKVFAKANPEELPWKDLGVDVVLECTGFFTSKEKAEAHIKAGAKKVVIS APATGDLKTVVYNVNDNILDGSETVISGASCTTNCLAPMAKVLNDKFGIVEGLMTTIHAY TNDQNTLDAPHKKGDLRRARAAAENIVPNTTGAAKAIGLVIPELKGKLDGAAQRVPVITG SITELVTVLGKEVTVEEINAAMKAASNESFGYTEEPLVSSDIIGINFGSLFDATQTKVLT VDGKQLVKTVAWYDNEMSYTSQLIRTLKKFVEISK >gi|296155282|gb|ADVK01000010.1| GENE 103 106026 - 106664 787 212 aa, chain + ## HITS:1 COG:no KEGG:FN0653 NR:ns ## KEGG: FN0653 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 212 1 212 212 392 99.0 1e-108 MITSKNLSSLPDIQKLKQICKSISALEIIMELEWLMRYYSYNLSWDVDEEVFEMRSGCGE NMLILFSKHGSVISGINDEYFDWKNDSPKIENLTKGLPKQFDNFIYNEPIKTRKNTFCIW RTVVDSEWQTGETIEPDGSEDILYPLDGDPEKYVEFCEDYYDKKVPLDIVEKIYQGEPIT LEMIYKLNDEIEDKDIEIIKNELEEIKYPNTL >gi|296155282|gb|ADVK01000010.1| GENE 104 106685 - 107881 1840 398 aa, chain + ## HITS:1 COG:FN0654 KEGG:ns NR:ns ## COG: FN0654 COG0126 # Protein_GI_number: 19703989 # Func_class: G Carbohydrate transport and metabolism # Function: 3-phosphoglycerate kinase # Organism: Fusobacterium nucleatum # 1 398 1 398 398 717 100.0 0 MKKIITDLDLNNKKVLMRVDFNVPMKDGKITDENRIVQALPTIKYVLEHNAKLILFSHLG KVKIEEDKATKSLKAVAEKLSELLGKNVTFIPETRGEKLESAINNLKSGEVLMFENTRFE DLDGKKESKNDSELGKYWASLGDVFVNDAFGTAHRAHASNVGIAENIGNGNSAVGFLVEK ELKFIGEAVNNPKRPLIAILGGAKVSDKIGVIENLLTKADKILIGGAMMFTFLKAEGKNI GTSLVEDDKLDLAKDLLAKSNGKIVLPVDTVIASEFKNDIEFSTVDVDNIPNNKMGLDIG EKTVTLFDSYIKTAKTVVWNGPMGVFEMSNFAKGTIGVCESIANLTDAVTIIGGGDSAAA AISLGYADKFTHISTGGGASLEFLEGKVLPGVEAISNK >gi|296155282|gb|ADVK01000010.1| GENE 105 107929 - 108288 572 119 aa, chain + ## HITS:1 COG:no KEGG:FN0655 NR:ns ## KEGG: FN0655 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 8 119 1 112 112 172 100.0 3e-42 MKNLKSLMAISFAVLSLGSFAADKVYEATAEAKGYNEEGVPIVLTVKAIKKDGKVVVTDI VAKHQETDKIGAVAIEKLIEEVKTKQNYNKLDSVAGATSTSAGFRRAIRNAVKDIEKQN >gi|296155282|gb|ADVK01000010.1| GENE 106 108303 - 108683 704 126 aa, chain + ## HITS:1 COG:no KEGG:FN0656 NR:ns ## KEGG: FN0656 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 126 1 126 126 217 100.0 1e-55 MNFKDFGIREWLVIAFIILGLAAFAFEDIFKPKIYQAEGTGIGYNDDITLKVSAYKKKDK TIRVTNIEVEHADTDEIGGVAVQKLVDDIKAKQKLADIDFVAGATFTSEGFKEALDIAID DIRNQE >gi|296155282|gb|ADVK01000010.1| GENE 107 108844 - 109365 289 173 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|50365462|ref|YP_053887.1| acetyltransferase of 30S ribosomal protein L7 [Mesoplasma florum L1] # 3 172 2 170 170 115 39 9e-25 MEKIILVKPNLFYADEILKYKKEFLKDESIINGAAGLDRFSIVEDWLEELEKRSNKDTVP EGLVPSSTYLGIREKDNYIVGMIDIRHCLNDFLLQAGGHIGCGVRKSERKKDYAKQMIKL ALEKCRKLKIEKVLITCNDDNIASERSIISCGGKLEDIRTVDGKNYKRFWIEL >gi|296155282|gb|ADVK01000010.1| GENE 108 109418 - 110203 1149 261 aa, chain - ## HITS:1 COG:FN0658 KEGG:ns NR:ns ## COG: FN0658 COG1464 # Protein_GI_number: 19703993 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type metal ion transport system, periplasmic component/surface antigen # Organism: Fusobacterium nucleatum # 1 261 1 261 261 469 99.0 1e-132 MKFTKLFGTVGAFLLLSAGALAGTLKVGATPVPHAEILELIKPDLKKQGVDLKIVEFTDY VTPNLALSDKEIDANFFQHKPYLDKFIEERKLNLVSLGNVHVEPLGLYSKKIKSINDLKK GDTIAIPSDPSNGGRALILLHNKGVITLKDPKNLFATEFDIVKNPKKLKFKPTEVAQLPR ILPDVTAAIINGNYALQANLSPAKDSLILEGKESPYANILVVRKGGEKKEDIQKLLKALR SEKVKKYINEKYSDGSVVPAF >gi|296155282|gb|ADVK01000010.1| GENE 109 110218 - 110919 1008 233 aa, chain - ## HITS:1 COG:FN0659 KEGG:ns NR:ns ## COG: FN0659 COG2011 # Protein_GI_number: 19703994 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type metal ion transport system, permease component # Organism: Fusobacterium nucleatum # 1 233 1 233 233 355 99.0 6e-98 MEISSLIEPLFENFENPIVSMLAVSTVETIYMVFLSTIFSLLLGFPIGVLLVITKEDGIY EMKKFNAILGVIINALRSFPFIILMILLFPLSRFVVGSTIGATAAVVPLSIGAAPFVARI VEGALLEVDYGLIEASQSMGASNSTIIFKVMLPECYSTLVHGIIVTIISLIGYSAMAGTI GAGGLGDLAIRFGYLRFKLDIMIYAIIIIIILVQVIQSVGNYIVYRRQKKLGK >gi|296155282|gb|ADVK01000010.1| GENE 110 110909 - 111916 1117 335 aa, chain - ## HITS:1 COG:FN0660 KEGG:ns NR:ns ## COG: FN0660 COG1135 # Protein_GI_number: 19703995 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type metal ion transport system, ATPase component # Organism: Fusobacterium nucleatum # 1 335 1 335 335 612 99.0 1e-175 MITLENVNKIYSNNLHAVKDVNLKVNEGDIFGIIGLSGAGKSSLIRLINRLEEPTSGKIF INGENILNLNKTELLERRKKIGMIFQHFNLLSSRTVEENVAFALEIANWNKKDIGKRVSE LLEIVGLSDKAKYYPSQLSGGQKQRVSIARALANNPDILLSDEATSALDPKTTKSILELI KEIQQKFSLTVLMITHQMEVVKEICNKVAIMSDGRIVEQGGVHHIFAEPKNEITKELISY VHQQTDTELNYLHHKGKKIIKVKFLGTSTQEPIISKVIKEYGIDISVLGGTIDKLATMNI GHLYLELDGDLAAQTKAIEFMQTMDVIVEVIYNGD >gi|296155282|gb|ADVK01000010.1| GENE 111 112059 - 112166 152 35 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MGQSLRFSGWLFILSNLFLILEQVYNNLKRLSTII >gi|296155282|gb|ADVK01000010.1| GENE 112 112304 - 112711 454 135 aa, chain + ## HITS:1 COG:no KEGG:FN0661 NR:ns ## KEGG: FN0661 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 2 135 1 134 134 245 99.0 5e-64 MLHKENRFIAEIEGIFDRKILEEANNISFICTSLSIGSDRQLGKVLLETYIYTLSELEFL PKNIILFNEAVLLTLPISDCCLYLKRLQDRGVEILVCDTSASYYDIVKKMQVGKLIGMKS IIEKQLGATKLIHIS >gi|296155282|gb|ADVK01000010.1| GENE 113 112730 - 113686 1307 318 aa, chain + ## HITS:1 COG:FN0662 KEGG:ns NR:ns ## COG: FN0662 COG0010 # Protein_GI_number: 19703997 # Func_class: E Amino acid transport and metabolism # Function: Arginase/agmatinase/formimionoglutamate hydrolase, arginase family # Organism: Fusobacterium nucleatum # 1 318 1 318 318 655 100.0 0 MEWNGRVDGYDEDILRIHQVIQVKNLDELMENDYTGKKVCFVSYNSNEGIRRNNGRLGAA DGWKHLKTALSNFPIFDTDIKFYDLKDPVDVKAGKLEEAQQELAEVVAKLKSKDYFVVCM GGGHDIAYGTYNGILSYAKTQTKDPKVGIISFDAHFDMREYNKGANSGTMFYQIADDCKR EGIKFDYNVIGIQRFSNTKRLFDRAKSFGVTYYLAEDILKLSDLNIKPILERNDYIHLTI CTDVFHITCAPGVSAPQTFGIWPNQAIGLLNTIAKTKKNLTLEVAEISPRYDYDDRTSRL IANLIYQVILKHFDCEIN >gi|296155282|gb|ADVK01000010.1| GENE 114 113828 - 114295 530 155 aa, chain + ## HITS:1 COG:no KEGG:FN0663 NR:ns ## KEGG: FN0663 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 13 155 1 143 143 198 99.0 6e-50 MISRIKKNFAELMESLTIKKDEILKQNKIINLESLLSFYEENKKIFLEKKEVLLTTINEY IPSINLKIDVDLSFIEKLDITCINELIEKVGKFYEDNCIEPVENDIREKVVEKFKKIIKF SKILYINFIDTISNYSLNYFSKKTERAPPIKFLFD >gi|296155282|gb|ADVK01000010.1| GENE 115 114336 - 115484 1917 382 aa, chain + ## HITS:1 COG:FN0664 KEGG:ns NR:ns ## COG: FN0664 COG2070 # Protein_GI_number: 19703999 # Func_class: R General function prediction only # Function: Dioxygenases related to 2-nitropropane dioxygenase # Organism: Fusobacterium nucleatum # 1 382 1 382 382 770 100.0 0 MKELKGIKIGKYYIEKPIVQGGMGVGVSWDQLAGTVSKNGGLGTISGICTGYYDNLKYCK KVVNGRPIGADALNSREAMIELFKNARKICGDKPLACNILHALNDYSKIVEYALEAGANI IVTGAGLPLELPKLVESYPDVAIVPIVSSGRALKIICKKWQAAGRLPDAVIVEGPKSGGH QGVKADDLFIPEHQLESIVPEVKEERDKWGDFPIIAAGGIWDNDDIQKIMALGADAVQLG TRFIGTYECDASEEFKNILINAKKEDIVIVKSPVGYPGRAVKTNLIKNLTHDTPTIKCYS NCVTPCNLGEEARKVGFCIANCLSDSYNGKVDTGLFFSGENGYRIEKLVSVEDLINELMT STKNTILAKVNSENIIENTVNF >gi|296155282|gb|ADVK01000010.1| GENE 116 115633 - 116124 660 163 aa, chain + ## HITS:1 COG:no KEGG:FN0665 NR:ns ## KEGG: FN0665 # Name: not_defined # Def: N-acetylmuramoyl-L-alanine amidase (EC:3.5.1.28) # Organism: F.nucleatum # Pathway: not_defined # 11 163 1 153 153 289 96.0 3e-77 MKKLLLVTLFLLSSLSALAIRYVVDTKDGYANLRERADSKSKVIKKLKNNHEMVFWHEKG EWFCVGAEPDDKYSDMTDGYIHRSQIKLHPKTYTISSKDGYANVRNEAAANSHSIAELKN GTLVTKFEEKGEWWGIEFDSEDGTPFDYGYVHKSQLKKYVEKM >gi|296155282|gb|ADVK01000010.1| GENE 117 116156 - 116794 764 212 aa, chain + ## HITS:1 COG:no KEGG:FN0666 NR:ns ## KEGG: FN0666 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 8 212 1 205 205 300 98.0 2e-80 MKKLLILVLTLFSLEGFSANYEVVKNPSVKISKQDIQKNNKSIEEAIKEEYAWNSESDLL VSRRNEAIDEYKNFEKTSYFLEKPYFEAINEVGTMFEKNTITEIKYNSPTEVEVYITENG KFLADIAENCKVEVDKKFKSKMGYLPEDFRENVKNKEEVRKVYEEYRDLMKKELLSKRKE IESAEEGSLEVFYTVEKKNNKWTVIERHARVN >gi|296155282|gb|ADVK01000010.1| GENE 118 116801 - 118138 1281 445 aa, chain - ## HITS:1 COG:FN0667 KEGG:ns NR:ns ## COG: FN0667 COG0534 # Protein_GI_number: 19704002 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Fusobacterium nucleatum # 1 416 6 421 426 660 100.0 0 MKKIYDMTKGKIWTIILSFSLPLLGASLIQQLYNTADMIFVGNFVGKEATGAVGASSLLF TCIIGLFTGVSIGVGVAVSQKIGSKNLETASKVSHTAITFGIIGGIILTLVGFFSAEFLL TLMNTPKEIMYESVIYLKIYFLSMLPMILYNIGSGIIRSTGNSKTPFYILIIGGLTNVLA NYIFIVVFKMGVSGVAIATTLSQTLTAVIVLTYLFKNKTAIKFKSSELKINFSLLKQILY FGLPAGIQSMLITFSNIIVQYYINGYGGDAVAAYATYFKLENFIWMPIVAIGQASMTFSG QNVGANNYKRVKKGALVAIFLSGGLSIVIATIILTFSHTFMRIFIKNEEIIYLGSQIALT TFPFYWLYSILEVLGSSLRGMGYSIVSMYITTICLCGVRISLLYLISKFNFDFKSVAYVY PMTWFFTASIFIIAFLKIINKKDYK >gi|296155282|gb|ADVK01000010.1| GENE 119 118560 - 119486 1459 308 aa, chain + ## HITS:1 COG:FN0668 KEGG:ns NR:ns ## COG: FN0668 COG0803 # Protein_GI_number: 19704003 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type metal ion transport system, periplasmic component/surface adhesin # Organism: Fusobacterium nucleatum # 1 308 5 312 312 556 100.0 1e-158 MKKIFKLLTVMMISLFIVACGEKKEEVGTSNEIQKIKVTTTLNYYQNLIEEIGGDKVEVI GLMKEGEDPHLYVATAGDVEKLQNADLVVYGGLHLEGKMTDIFANLSNKYILNLGDQLDK SLLHKEDENTYDPHVWFNTKFWAIQAKSVADKLSEILPENKDYFENNLQVYLKSLDEATE YIQAKINEIPEESRYLITAHDAFAYFAEQFGLQVKAIQGVSTDSEIGTKQIEDLANFIVE HNIKAIFVESSVNHKSIEALQEAVKAKGGNVEIGGELYSDSMGDKENNTETYIKTIKANA DTISNALK >gi|296155282|gb|ADVK01000010.1| GENE 120 119510 - 120193 258 227 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|225084369|ref|YP_002657150.1| ribosomal protein S16 [gamma proteobacterium NOR51-B] # 3 211 8 222 309 103 28 3e-21 MNAIEIRNLTVAYGENIALENLNLDVEVGSLMALVGPNGAGKSTLIKTILKFLKQITGEI KINGKTLAYVPQRNSVDWDFPTTLFDVVEMGCYGRVGLFKRVNKEEKQKVLKAIEQVGML DFKDRQISELSGGQQQRTFIARALVQEADIYLMDEPFQGVDSTTEKSIVNILKKLKSEGK TLIIVHHDLQTVPTYFESVTFINKTVIASGKVKEIYTQENIDKTYRK >gi|296155282|gb|ADVK01000010.1| GENE 121 120281 - 121198 1042 305 aa, chain + ## HITS:1 COG:FN0670 KEGG:ns NR:ns ## COG: FN0670 COG1108 # Protein_GI_number: 19704005 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Mn2+/Zn2+ transport systems, permease components # Organism: Fusobacterium nucleatum # 1 305 1 305 305 438 100.0 1e-123 MAEILKLFFNSYTFKVVTLGCILLGVVSAIIGTFAVLKKESLLGDGVSHASLAGICLAFL ITGKKELYILLLGALIIGLICIFLIHYTQLKSKVKFDSAIALMLSTFFGLGLVLLTYLKK IPGAKKAGLNRFIFGQASTLVVKDIYLIIVVGLILIFLVLLFWKEIKISIFQADYAKTLG INSSKINFLVSTMIVINVIIGIQIAGVILMTAMLVIPPVAAKQWSKKLSVVTLLSAIIGG ISGAMGSIISTFDVALPTGPLIILVSGIFALISFLFSKKGIIARNYRIYIRNKKLKLEEN KGDKE >gi|296155282|gb|ADVK01000010.1| GENE 122 121195 - 122037 1015 280 aa, chain + ## HITS:1 COG:FN0671 KEGG:ns NR:ns ## COG: FN0671 COG1108 # Protein_GI_number: 19704006 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Mn2+/Zn2+ transport systems, permease components # Organism: Fusobacterium nucleatum # 1 280 1 280 280 340 100.0 2e-93 MSAGLTIQLIAILISVACSLLGVFLVLRSMSMLTDAISHTVLLGIVLSFFIIHKLDSPLL IVGATLTGLLTVYLVELITDTNLVKEDAAIGIVLSILFSIAVVLISKYTANIHLDIDAVL LGEIAFAPFHTKEIFGFKIANGIVNGVSILILNLLVITIFFKEIKISIFDRALALTLGLF PEVFHYLLMSLVSVTAVVSFDVVGATLMISFMIGPATIAYMISKSLRMMLIYSALIGAIS SILGYHLAVLLDVSISGSIAVVIGIVFFIVLFGKRVKKII >gi|296155282|gb|ADVK01000010.1| GENE 123 122225 - 123541 1749 438 aa, chain + ## HITS:1 COG:FN0672 KEGG:ns NR:ns ## COG: FN0672 COG1373 # Protein_GI_number: 19704007 # Func_class: R General function prediction only # Function: Predicted ATPase (AAA+ superfamily) # Organism: Fusobacterium nucleatum # 1 438 1 438 438 777 98.0 0 MLKKESELKNRSHYLEKLIEFKDTDFVKIITGIRRCGKSSLMKLMIKHLLENGVEKEQII QINFESIEFKKMTVENLYNYVKSNLPKDKKAYLFFDEIQKILEWQDAINSFRVDFECDIY ITGSNAFLLSSEYATYLAGRSVEIKVFPLSFIEFVDFHGYKIIEKKNLVGRITRKIENEN GETYEIKELFEAYMTFGGMPSLTEVPLELDKALTILDGIYSSVVIRDILEREKQRGRRQV TDSSLLRKIIMFLADNIGNNTSINSISNILLNEKLIETKPAVQTVQSYVSTLLEAYIFYE IKRFDIKGKEYLKTLGKYYIIDIGLRNYLLGFRNRDIGHIIENIVYFELLRRGYDVAIGK IGENEIDFIATNINTKIYIQVTENMTNSTIRERELAPFYKIQDNFEKIVITNDESYLGIQ DGIKIIRLVDFLLDENIL >gi|296155282|gb|ADVK01000010.1| GENE 124 123565 - 124029 650 154 aa, chain + ## HITS:1 COG:FN0673 KEGG:ns NR:ns ## COG: FN0673 COG2606 # Protein_GI_number: 19704008 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 154 1 154 154 265 96.0 3e-71 MSIEAVRKHLEKYGLENKIREFKESTATVEEAAKVNSCEPARIAKSLSFIINDIPTIIVV AGDAKINNQKFKAKFKTKAKMIVGNDVENLIGHPVGGVCPFGIKDNVKVYLDESMKRFET MLPACGTANSAIELTLEELEKASNYIEWIDVCQI >gi|296155282|gb|ADVK01000010.1| GENE 125 124054 - 124902 696 282 aa, chain + ## HITS:1 COG:FN0674 KEGG:ns NR:ns ## COG: FN0674 COG0697 # Protein_GI_number: 19704009 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; R General function prediction only # Function: Permeases of the drug/metabolite transporter (DMT) superfamily # Organism: Fusobacterium nucleatum # 1 282 1 282 282 385 99.0 1e-107 MQENHKYNIYMFIATIFFGMTYVLTKICLKYSTEFHIISFRFLIAFVVSLIFLQKKIFPV KRTEFLYSLLLGALLFLVFIAMTIGVKYTTATNASFLISLSVIFIPFFSWFFNKEKPKKR IFVVLIIALIGIILLTLDKNLEFHMGDILCLICATLFTFYVIMTEKIVKNNNPTALGVLQ FGWVFLFSFLVQYPIESFVIPKNKFFWLSMVLLGIFCTAFGYIAQTIAQKNLSSTVVGFI LSLEPVFSGIFGYFFLNEYLSFQQYIGAFLLLISVIYVSVKN >gi|296155282|gb|ADVK01000010.1| GENE 126 125001 - 126620 1595 539 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|167855908|ref|ZP_02478658.1| 50S ribosomal protein L28 [Haemophilus parasuis 29755] # 2 539 3 547 547 619 59 1e-176 MAKIINFNDEARKKLEIGVNTLADAVKVTLGPRGRNVVLEKSYGAPLITNDGVTIAKEIE LEDPFENMGAALVKEVAIKSNDVAGDGTTTATILAQAIVKEGLKMLSAGANPIFLKKGIE LAAKEAVEVLKDKAKKIESNEEISQVASISAGDEEIGKLIAQAMAKVGETGVITVEEAKS LETTLEIVEGMQFDKGYVSPYMVTDSERMTAELDNPLILLTDKKISSMKELLPLLEQTVQ MSKPVLIVADDIEGEALTTLVINKLRGTLNVVAVKAPAFGDRRKAILEDIAILTGGVVIS EEKGMKLEEATIEQLGKAKTVKVTKDLTVIVDGGGQQKDISARVNSIKAQIEETTSDYDK EKLQERLAKLSGGVAVIKVGAATEVEMKDKKLRIEDALNATRAAVEEGIVAGGGTILLDI IESMKDFNETDEIAMGIEIVKRALEAPIKQIAENCGLNGGVVLEKVRMSPKGFGFDAKNE KYVNMIESGIIDPAKVTRAAIQNSTSVASLLLTTEVVIANKKEEEKAPMGAGGMMPGMM >gi|296155282|gb|ADVK01000010.1| GENE 127 126638 - 126910 506 90 aa, chain - ## HITS:1 COG:FN0676 KEGG:ns NR:ns ## COG: FN0676 COG0234 # Protein_GI_number: 19704011 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Co-chaperonin GroES (HSP10) # Organism: Fusobacterium nucleatum # 1 90 1 90 90 142 100.0 2e-34 MNIKPIGERVLLKPIKKEEKTKSGILLSSKSSNTDTQNQAEVIALGKGEKLEGIKVGDKV IFNKFSGNEIEDGDVKYLIVNAEDILAIIG >gi|296155282|gb|ADVK01000010.1| GENE 128 127082 - 128704 1904 540 aa, chain + ## HITS:1 COG:no KEGG:Lebu_0003 NR:ns ## KEGG: Lebu_0003 # Name: not_defined # Def: hypothetical protein # Organism: L.buccalis # Pathway: not_defined # 2 539 3 544 545 569 58.0 1e-160 MKKLAIGVDDFKEIIKENFYYIDKTKFIEDILEDGSKVKLLNRPRRFGKTINMTTLKYFF DIKNAEENRKLFNNLYIEKSKYIEEQGKYPVIFLSLKEIKGKTWEEMLEQIKNYTSSIYN NFEYIREVLNESELKNFDTIWLKKERADYSNSIKNLTNFLYKYYKKETILLIDEYDVPLI EAYLNNYYSDAISFFKIFLGGALKTNQYLKMGVMTGIIRVIKAGIFSDLNNLSVYTILDN DYDEAFGLTEKEVEQALKDYTILEELNDVKFWYDGYKIGNKEVYNPWSIINFLKNKELKG FWIKTSGNQLIKKVLEDATSDVNEGLLKLFNGEDVEEVVTGTSDLSNLLNYRDVWELLVF SGYLTIKEKIDRRNYILKIPNQEIREFFKDEFIDLYFGESKLKKILNALKENNIEEFERI FQNILLNSVSTWDTSKEAFYHGLSFGMLSYLDGEYYVTSNFESGYGRYDIIVEPRNKNKR GFIIECKIVKDEKDLEKISKEAIEQIKNKKYDTKLKERGIKEITLLGLAFCGKRMKVNYE >gi|296155282|gb|ADVK01000010.1| GENE 129 128949 - 129185 152 78 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296327582|ref|ZP_06870128.1| ## NR: gi|296327582|ref|ZP_06870128.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 78 1 78 78 109 100.0 5e-23 MSSNYIDPISGKSLSIEEVLEENKTDEDLQNIIYLYYYHKILKIGYLEEIEKNNLVFQNY QKFLENLIKLLQRENYQK >gi|296155282|gb|ADVK01000010.1| GENE 130 129234 - 129986 737 250 aa, chain + ## HITS:1 COG:no KEGG:SNSL254_A2953 NR:ns ## KEGG: SNSL254_A2953 # Name: not_defined # Def: hypothetical protein # Organism: S.enterica_Newport # Pathway: not_defined # 2 239 126 360 371 145 38.0 2e-33 MADCLVGKYEFYFNDGSSGNITKKLSMKNYHKKIYHTGIFDNFDREIPIINLFKLHGSVS WKYINDKNNKPREIKVEYFEDNYTKYPENLIEKVSNEEIDREKKEIENNKDLKEKIKNVK NELFEKFALIFPEKNKFKNTLYQEFYYQNLRQLSYELEKQNSILIVFGFSFGDEHIAEIV KRACNNPTLNIYIFCYSLNTENEILNNLKLEEFPNNIKTILPEDNGKIDFNIFLKKLFEV NSKIESDNNE >gi|296155282|gb|ADVK01000010.1| GENE 131 129979 - 131106 1139 375 aa, chain + ## HITS:1 COG:no KEGG:TERTU_1467 NR:ns ## KEGG: TERTU_1467 # Name: not_defined # Def: hypothetical protein # Organism: T.turnerae # Pathway: not_defined # 7 373 4 370 595 236 39.0 2e-60 MNSNFEVGIIWAVNGNQVIIKMNSITSDFTYFYNGEEYEGIRNGGYLSITRGHIDIVCQI ESEEIKDNYQKESKNNYKENERYEKFIYARCIGFFENEKFNFGIKYSPIIYNTVKMISSN TIKAILSPKKEKNEIEFVIGKDLQYGLKIDLPWNKIFNTHIGIFGNTGSGKSNTLTSLYK ILLNHEKLNLENSKFVFIDFNGEYTGDKVLCRNKIIYELNTRQDNKGDKFPIKSSSFWDE EVLSILFHATEKTQKSFLKNLVKGRNKYISNKNSLENYLKKIYIRCLETLNPKKEFVEIL IRTLKLIDYDDIFLKKLQWHSQNQKFYIGKIYYNQGKVDESLYKHLEDRIKVENTDQFTE LEIRANLDLIRGLFT >gi|296155282|gb|ADVK01000010.1| GENE 132 131273 - 131389 69 38 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MFPILLAKEYFKEHKKQNEKEEKKTFHLIIDEAHNILS >gi|296155282|gb|ADVK01000010.1| GENE 133 131524 - 131787 318 87 aa, chain + ## HITS:1 COG:no KEGG:Tola_2268 NR:ns ## KEGG: Tola_2268 # Name: not_defined # Def: hypothetical protein # Organism: T.auensis # Pathway: not_defined # 1 86 510 594 594 101 55.0 8e-21 MSQFHNFFIHRLINEKDLRLIDNVISTLDRSSKQLIPVLPQGACIVTGTAFEFPKIIQVD KIENREERPNSDDIDLEELWEKNEEIK >gi|296155282|gb|ADVK01000010.1| GENE 134 131809 - 132909 738 366 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296327587|ref|ZP_06870133.1| ## NR: gi|296327587|ref|ZP_06870133.1| ABC superfamily ATP binding cassette transporter permease subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] ABC superfamily ATP binding cassette transporter permease subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 366 1 366 366 465 100.0 1e-129 MEELLNYLKEFLFKIFKNKWKALIIYIIILLTISFKGFFSEGLNTLNSNNILSFYNLPKE IVGIKYIEFWQFLFGVLLIIGMTIYPKIFLKKYLILKIIFEIVFTLFNTIIIGLIMGKFF KNNNFIRFLIIHPETFYIVWLLFFSAVLEIKDIKESYYWLEIYWLKFIRKLKEISNYIKI SKVRLKKKIKKMRFNELLCYLIIVIYYTLLITIFFLLLFYNNIYLFNKILVMLILLIMLK TFYIYKLKIFILFKSMGISVLVLIPLYSFFNITNKGPEDKRIYLYLEKEQGIIDVLVYQK EEDYVMQNAKIIKTYSNKKILILDTRSFYILNKEEIYNKEIKVENFDNIEKGIVMDNEEA KIKLEL >gi|296155282|gb|ADVK01000010.1| GENE 135 133122 - 133724 832 200 aa, chain + ## HITS:1 COG:FN0678 KEGG:ns NR:ns ## COG: FN0678 COG2815 # Protein_GI_number: 19704013 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 200 1 200 200 333 100.0 1e-91 MKKFRKENEVDDLDLDNLEFEEEVIEIEEKKDDKKKLAKIILNIVLILAIIKLGSNIFQR YYFNEFYYKAPNLLGLNIDEAKKTISHSALNIREMGEVYSDLPYGTVALQEPSEGTIVKR ARNIKVWVSKESPSVFLDDLVGMNYIEASSLLNKNGMKVGEVKRIKSDLPINQVIATSPK SGEPISRGQKFDFLISNGLE >gi|296155282|gb|ADVK01000010.1| GENE 136 133749 - 134615 841 288 aa, chain + ## HITS:1 COG:FN0679 KEGG:ns NR:ns ## COG: FN0679 COG1162 # Protein_GI_number: 19704014 # Func_class: R General function prediction only # Function: Predicted GTPases # Organism: Fusobacterium nucleatum # 1 285 1 285 285 521 99.0 1e-148 MINKIQGFYYVESNGEVFECKLRGILKKTNNKYNCVVGDRVEISEDNSIVEIFKRDNLLI RPIVANVDYLAIQFAAKHPNIDFERINLLLLTAFYYKIKPIVIVNKIDYLTEEELVELKE KLSFLERISVPMFLISCYQNIGLEEVENFLKDKITVIGGPSGVGKSSFINFLQSERILKT GEISERLQRGKHTTRDSNMIKMKAGGYIIDTPGFSSIEVPNIENREELISLFPEFLNIDS CKFLNCSHIHEPGCNVKKQVEENKISKERYDFYKKTLEILLERWNRYD >gi|296155282|gb|ADVK01000010.1| GENE 137 134608 - 135255 1132 215 aa, chain + ## HITS:1 COG:FN0680 KEGG:ns NR:ns ## COG: FN0680 COG0036 # Protein_GI_number: 19704015 # Func_class: G Carbohydrate transport and metabolism # Function: Pentose-5-phosphate-3-epimerase # Organism: Fusobacterium nucleatum # 1 215 1 215 215 405 100.0 1e-113 MTNGIKIAPSILSSDFSKLGEEVAAIDKAGADYVHIDVMDGQFVPNLTFGPPVIKCIRKC TGLIFDVHLMIDKPERYIEDFVKAGADIVVVHAESTIHLHRVIQQIKSFGIKAGISLNPS TPEEVLKYVINDIDMVLVMSVNPGFGGQKFIPAVVEKIKAIKKMRVDIDIEVDGGITDET IKVCVDAGANIFVAGSYVFSGNYKERIDLLKLKAK >gi|296155282|gb|ADVK01000010.1| GENE 138 135268 - 135945 943 225 aa, chain + ## HITS:1 COG:FN0681 KEGG:ns NR:ns ## COG: FN0681 COG1846 # Protein_GI_number: 19704016 # Func_class: K Transcription # Function: Transcriptional regulators # Organism: Fusobacterium nucleatum # 1 225 1 225 225 394 100.0 1e-110 MSVNIQRVNDVLEEYYKLFYKTEDMALKRGIKALTHTELHIIESIGENTQLTMNELADKI GITMGTATVAISKLSDKGYIDRARSTTDRRKVFVSLTKKGVDALTYHNNYHKMIMASITE SIPDKDLEQFVKTFEVILESLRNKTDYFKPMTITDFKEGTKVSVVEIKGTPIVQNYFLNH NIENFTLLKVLKSNDKSQFKIEKEDGEILTLDILDAKNLIGVKAD >gi|296155282|gb|ADVK01000010.1| GENE 139 135947 - 137572 1702 541 aa, chain + ## HITS:1 COG:FN0682 KEGG:ns NR:ns ## COG: FN0682 COG1293 # Protein_GI_number: 19704017 # Func_class: K Transcription # Function: Predicted RNA-binding protein homologous to eukaryotic snRNP # Organism: Fusobacterium nucleatum # 1 541 1 541 541 860 99.0 0 MLYIDGISLSKIKEELKKGLEGKRINRIFKNNEYTISIHFGKIELLLSCIPSLAICYITK SKEQPILDIASSIISNLRKNLMNAMLTDIEQLGFDRILAFHFSRINELGEIKKYKIYFEC IGKLSNVIFTDEENKVLDTLKKFHISENFDRTLFLGETYIRPKFDKKILPIDIKEDNFNR IIKNNLLLSSEIEGVGKFLNNIKSFDDFKNILNSDVKAKIYFRDKKIKLATVLDLNFKDY DEVKEFSSYDEMINFYIDYEHTTTSFMLLKNRLESLLEKKLKKLNKTLSLIKKDIEDSKT MDSIKEEGDILASVLYNVKRGMNSIKAYDFYNNKEIEIELNPLISPNENLDRIYKRYNKV KRGLTNAIRREKEIKEEINYVETTLLFIENSADVASLREIEEELIKLNYIKSLYNKKKTK LKKEVKYGLIEGENYLILYGRNNLENDNLTFKISAKDDYWFHVKDIPSSHIILKTSKLTD ELIVKSAQVSAYFSKANLGEKVTVDYTLRKNVSKPNGAKPGFVIYVSQKSIVVEKVELEK I >gi|296155282|gb|ADVK01000010.1| GENE 140 137624 - 137869 438 81 aa, chain - ## HITS:1 COG:no KEGG:FN0683 NR:ns ## KEGG: FN0683 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 81 1 81 81 154 100.0 9e-37 MHDGCSGKFDDGMQVLAKLRMMGFSKQDMPFPMTFTCKECGEEITMTTFEYECPHCSMIY AVTPCHAFDVENILTAGKAKK >gi|296155282|gb|ADVK01000010.1| GENE 141 138051 - 139751 2750 566 aa, chain + ## HITS:1 COG:FN0684 KEGG:ns NR:ns ## COG: FN0684 COG1151 # Protein_GI_number: 19704019 # Func_class: C Energy production and conversion # Function: 6Fe-6S prismane cluster-containing protein # Organism: Fusobacterium nucleatum # 1 566 1 566 566 1142 99.0 0 MDKMFCYQCQETAKGTGCTSIGVCGKTSETSGLQDLLLYTEKGVAAYSTVFRKNGKAKEL LEGKVNRYLINSLFITITNANFDDNAILDEIKAGLKLREELKALATDEEKKEAEKYGADL VNWYYESNEDLIKFSENQSVVGVLRTENEDVRSLRELIMYGLKGLAAYAEHAFNLEKTSE EIFAFVEEALLGTMDDSLNAEQLVALTIKTGEYGVKVMALLDEANTSALGTPEITKVKIG AGKRPGILISGHDLWDLKQLLEQSKDSGIDIYTHSEMLPGHGYPELKKYSHFYGNYGNAW WDQRKDFTNFNGPIIFTTNCIVPPVKNATYKDRVFTTNATGYPGWKRIKVNADGTKDFSE VIELAKTCQAPVEVESGEITVGFAHNQVLSLADKVVENIKSGAIKRFVVMSGCDGRMSQR HYYTEFAENLPKDTIILTSGCAKYKYNKLNLGDINGIPRVLDAGQCNDSYSWAVVALKLK EVFGLNDINELPIVFNIAWYEQKAVIVLLALLYLGIKNIHVGPTLPGFLSPNVAKVLVEN FGIAGITTVEEDLKKFGLYEGSGLDN >gi|296155282|gb|ADVK01000010.1| GENE 142 139827 - 141281 1794 484 aa, chain - ## HITS:1 COG:FN0685 KEGG:ns NR:ns ## COG: FN0685 COG4145 # Protein_GI_number: 19704020 # Func_class: H Coenzyme transport and metabolism # Function: Na+/panthothenate symporter # Organism: Fusobacterium nucleatum # 1 484 1 484 484 784 100.0 0 MNKILIIIPIIIYLVTMLLIAYKVNNIKNSSKSFTNEYYLGSRSMGGFVLAMTIVATYVG ASSFIGGPGIAYNLGLGWVLLACIQVPTAFFTLGVLGKKLSIISRKLNAITIFDVLKARY NNSFLNVLASIMLIVFFISAIVAQFIGGARLFEAVTGLSYLTGLIIFSSVVIIYTTFGGF RAVTLTDAIQAVVMFTATTVLFVVILKNGNGMENIMMKIKDIDPNLLRPDSGGNIAKPFI MSFWILVGIGILGLPATTIRCMAFKDTKAMHNAMIIGTSLVGILVLGMHLVGVMGRAVIP DLQEVDKIIPILALKNLYPILAGVFIGGPLAAAMSTVDSLLIISSSTLIKDLYVTYLDKN TSESKIKKISMCTSFLIGILVFILSIKPISLIAWVNLFALGGQEIVFFCPLILGLYWKGA NATGAIASIFAGIATYLTLEILKTKIFALHNIVPGLVVAIIVFIVASYFGKKSDEKTIKI FFEY >gi|296155282|gb|ADVK01000010.1| GENE 143 141294 - 141587 269 97 aa, chain - ## HITS:1 COG:no KEGG:FN0686 NR:ns ## KEGG: FN0686 # Name: not_defined # Def: integral membrane protein # Organism: F.nucleatum # Pathway: not_defined # 1 97 8 104 104 152 100.0 4e-36 MDKDSKKYNISKQINKEVLITIVLYLIYFIWWYYFAYEYSSDNVEEYKYILGLPEWFFYS CVVGLILINILVYICIKFFFKDIDFDKYNEGNKSNQK >gi|296155282|gb|ADVK01000010.1| GENE 144 141723 - 143126 1088 467 aa, chain + ## HITS:1 COG:no KEGG:FN0687 NR:ns ## KEGG: FN0687 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 467 1 467 467 790 100.0 0 MYIAITGKGKSKVVQFCEQHRIAKTNKKKTVVIKTIGNYEALLEKNPNIIFELKEEAKRL TEEKKKNTSKNTLFRFGHSLVHSFWSEIGLSKILGESLSKTLFSLVVYRLGSSYSTYLEN RKTPFLNLESVSHSDFYETLLELEKKEKDLIECFNKFFEKKVKKEKKIAYYYLSSYKYNS YWKVLYGLSGLDIQEENEELNLNMALFFDSYGIPISYQLSMKEKFLEKKLKEIKKSLKIS KLILVSTQENKMKNRSFISPILFENLDFEIQREVLKERKWKVIEKDMKTDEVFEKNKIIN IDDNLKLYVYWTKKRAFKDYMEKNGRNGYICLMTDEELIEPHEIPNIFQHIWNIEDKFKI TDVKFSEKHLHGHFILCYICLCIIRYFQYLLGSDGKAFVPMIYANKAISNPMIFMEKKGN ELFLNPIHLTNSYLKLSKILGLGEFSQEMSVEKFEKNSGLKINNILL >gi|296155282|gb|ADVK01000010.1| GENE 145 143177 - 143689 847 170 aa, chain + ## HITS:1 COG:no KEGG:FN0688 NR:ns ## KEGG: FN0688 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 18 170 1 153 153 267 99.0 1e-70 MKKIFSYIVLSFALIMLVACGKPDSQKAFEKGFKETMADINKKMNEDDNEVTKMMAKILE KATYTVNRVEENGNISELDVTIKAVNLTKYLTEFMVSLKPLVESNMGEEVFTKATVNYFS DLSKKDLDYTETNVKVHMEKIEGEWKVINTDDILVGIFGGLKEFVRSPLN >gi|296155282|gb|ADVK01000010.1| GENE 146 143767 - 144279 847 170 aa, chain + ## HITS:1 COG:no KEGG:FN0689 NR:ns ## KEGG: FN0689 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 18 170 1 153 153 265 99.0 5e-70 MKKILRYLVLSLVVLMLVACGKPDSQKAFEERFKEFNSVLTKQMEGADEGSKKMAEIISK ATYTVNKVEEKGDNSELNVTIKAVNLGKYVNEYITAATEKYGVNVSADKQEEFNKFSVDY FTNVLNDKNIEYVDTEVNVQMQKSEEGWIITNPNDIVSATLGGAGNLIGL >gi|296155282|gb|ADVK01000010.1| GENE 147 144302 - 144811 621 169 aa, chain + ## HITS:1 COG:no KEGG:FN0690 NR:ns ## KEGG: FN0690 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 16 169 1 154 154 256 96.0 3e-67 MKKVFCYIILTCMFLMLVACGKPSSQKTFENFFKEFQNDLIKREEGSVEPFKTILKISEK TTYKINKVEENGDNAQLDVTIKAVNLGKYTDELSAHLEATASAELTEEEINQKAVDYFTE LLKNEKKLEFIETNIQVQMKKISGKWNILNTDDIYTAIAGNPVTVEIVP Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:16:08 2011 Seq name: gi|296155275|gb|ADVK01000011.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00012, whole genome shotgun sequence Length of sequence - 10806 bp Number of predicted genes - 6, with homology - 6 Number of transcription units - 2, operones - 1 average op.length - 5.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 151 - 210 14.3 1 1 Tu 1 . + CDS 311 - 1519 1145 ## COG1373 Predicted ATPase (AAA+ superfamily) + Term 1532 - 1568 0.4 - Term 1512 - 1563 9.6 2 2 Op 1 1/1.000 - CDS 1567 - 4968 4101 ## COG0587 DNA polymerase III, alpha subunit 3 2 Op 2 . - CDS 5000 - 5482 486 ## COG0454 Histone acetyltransferase HPA2 and related acetyltransferases 4 2 Op 3 . - CDS 5506 - 7290 2132 ## FN1385 hypothetical protein 5 2 Op 4 1/1.000 - CDS 7300 - 9978 2657 ## COG0553 Superfamily II DNA/RNA helicases, SNF2 family 6 2 Op 5 . - CDS 10002 - 10715 657 ## COG2220 Predicted Zn-dependent hydrolases of the beta-lactamase fold - Prom 10745 - 10804 7.8 Predicted protein(s) >gi|296155275|gb|ADVK01000011.1| GENE 1 311 - 1519 1145 402 aa, chain + ## HITS:1 COG:FN1382 KEGG:ns NR:ns ## COG: FN1382 COG1373 # Protein_GI_number: 19704717 # Func_class: R General function prediction only # Function: Predicted ATPase (AAA+ superfamily) # Organism: Fusobacterium nucleatum # 1 402 1 402 402 696 99.0 0 MTKRELYIEKIKPFIDKDIIKVLTGIRRSGKSVMLKLIMEELKQNGIDEKQFININFENL INRELTTADKLHKYILKRASEIKNKCYIFLDEIQEVKDWEKYINSLRVNEEYEFDIYITG SNAKLLSGELSTYLAGRYVEFVIYPFSFKEFLDTLKPIQSNVSTKEAFQKYIKFGGMPFL YNLAFEEEASLQYLKDIYSSIILKDITQKNKIRDTDLLEKVIDYLIMNIGNNFSATSISK FFKSENRKVSVETILNYIKATEEAFLIYKVSRDDLIGKKILNINEKYYIADHGIREAILE SNQRDINQIFENIIYLELLRKGYNIRVGKVDNLEVDFVCTKRNEKIYIQVAYLLASPETI EREFSSLEKINDNYPKYVISMDEFDMSRNGIRHINIIAFLLN >gi|296155275|gb|ADVK01000011.1| GENE 2 1567 - 4968 4101 1133 aa, chain - ## HITS:1 COG:FN1383 KEGG:ns NR:ns ## COG: FN1383 COG0587 # Protein_GI_number: 19704718 # Func_class: L Replication, recombination and repair # Function: DNA polymerase III, alpha subunit # Organism: Fusobacterium nucleatum # 1 1133 1 1133 1133 2072 98.0 0 MENNFVHLNLHTEYSLSEGVNSIDSFLVKAKELGMTSLAVTDYANMFCAIEFYQKAKKMG IKPIIGLELPITNRDEQNIFSLTLLAKNYNGYKNLVKLASELYKKNENRELKLNKEILKE HSQDLIALSSSMNGEIGKAILTNSSDEKVNKIIDEYIEIFSKENFYLEIQANELSETKII NDKFYDLAKFHDLELVATNNVYYVDRDGYELHDIIICIQSGLKVKEKNRKRAISKELYLK SKDEMKRFLGEKFEKAIENANYIASLCNIEITFGNLQFPYYEVPSEYSGMDEYLKTICHT NIKKLYKEDLTKDILDRLEYELSVIIKMGYSGYFIVVWDFISYAKRRGIPVGPGRGSAAG SLVAYCLGITMIDPIRYNLLFERFLNPERISMPDIDIDICRERRDELIDYVVHKYGRDRV AHIITFGRMKARAAIRDIGRVLDIDLKKIDKLAKLISPFQTLEKTLKENVEVAKLYTTDI ELQKVIDLAIRIENTVRHVSTHAAGILITKEDLDRTVPIYLDEKEGVIATQYQMKELEEL GLLKIDFLGLKNLSNIQRTIDYIKKYKNIDVDLYKIPLDDKKVFQMLSLGDSTGVFQLES AGIRKIMKRLKPDKFEDIVALLALYRPGPLQSGMVDDFINRKNGKEKIEYPHKNLEIILK ETYGVILYQEQVMKIASYMADYSLGEADLLRRAMGKKNFAIMRENREKFIERAVHNGYTE EKSEEIFELIDKFAGYGFNKSHSVAYAMISYWTAYFKVHYPAYYYAAVMTSEISETGDIA YYFNDAKEHEVRVYSPNINSPSAYFEVKNDGITYSLAAIKNFGLTMAKKIVEDVKLNGKY TTLEEFVFRNKKNGMNKRALEALILSGALDEIKGNRKEKFLSIDKVLDYSSKAPKTDEIQ QMNLFGEAAKTIDKFNLAISEDFTLDEKLNKEKEFLGFYLSSHPLDKYKDILTTFSIKKL SEFDLEGNQVIKTFGTIINLKKIITKKEEQMAMFNLSCYDRTLSCIAFPRVYERFISELI EKKTVYIEGKIQIDNYRGESRSKLLVDKLVELDKIYEYPAKKLFILIEPEDSYRYSRLKD LINFNKGKTQIIFAIKNKNEKKLQTMNKGIKLSKEFFESLVELMGIDKIKIEM >gi|296155275|gb|ADVK01000011.1| GENE 3 5000 - 5482 486 160 aa, chain - ## HITS:1 COG:FN1384 KEGG:ns NR:ns ## COG: FN1384 COG0454 # Protein_GI_number: 19704719 # Func_class: K Transcription; R General function prediction only # Function: Histone acetyltransferase HPA2 and related acetyltransferases # Organism: Fusobacterium nucleatum # 39 160 1 122 122 231 97.0 4e-61 MLAKFIDTMRRIAVNIKEFKGNKKEFLLLLLLADEKEEMIDKYIEKGIMYLLDDNGIKGE CVVTDEGDGILEIKNIVIEPDCQRKGYGKALIDFIVKKYRGQYSILQVGTGDSPMIISFY EKCGFVRSHSIKNFFIDNYDKPIFECSVQLVDMIYLQKKL >gi|296155275|gb|ADVK01000011.1| GENE 4 5506 - 7290 2132 594 aa, chain - ## HITS:1 COG:no KEGG:FN1385 NR:ns ## KEGG: FN1385 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 17 594 1 578 578 940 99.0 0 MNELEYILSKKYDQEILLKLFQKYFVNWIADGYIGKELNIFEISTIGEKTDKEVLLKLFV EFYGNEENFKKIFETLSEEVKEIFKVVVWEEKFPIKKEDLKKYLDTYTDNFEKEVFIPKN EYLFFDLEEFDKDMNTAFSIKYDIARYIRNFIDNKPKDYYLHSDNSSSNLAFKLYRDNNE NEFINNMNFYLDFYNSGENPISSSGKILKDFKRNMQKHCGITEYYNDVKGLEFLKTETLC LIFTLLEKKYRISSYFNNKNIKNIIDDFMTTETFDKEESYNYTNLFLNFLKGTRNIWENP EKISEAVKSLLGLLKEMQKDDVVSIDNIVKAFIYRDKDVELIAFKDVKDYIYINEANGER AKILEYKQYEDYIIEPFVKSYIFLLGIFGVFEIFYEKPFFKKGLYLKNNYLSKYDGLKYI KLTNLGRYILGHTDKYKLPKIYEKAEVQIDDKRQFVTVVGEAPAKMMFFEKIGTKVKENM FKLTYDSFIKGIKNYDELIERIERFKENIDNKELSQNWEEFFENLEKKFNSVRIEDDYTI LKLENNKELIQTVIKDSRFKNLSLKAEEYHLLVKKENLKEVIKIFSEYGYYIVE >gi|296155275|gb|ADVK01000011.1| GENE 5 7300 - 9978 2657 892 aa, chain - ## HITS:1 COG:FN1386 KEGG:ns NR:ns ## COG: FN1386 COG0553 # Protein_GI_number: 19704721 # Func_class: K Transcription; L Replication, recombination and repair # Function: Superfamily II DNA/RNA helicases, SNF2 family # Organism: Fusobacterium nucleatum # 1 892 1 892 892 1478 99.0 0 MLVEGEDNLYLALYDSEKNLISSYSNLNQNDINNYIENLENEKEFFISWEEKESSDYLKL DKTLLEYLLEKDNFVNSNFETIAKKEIENLSLLIRDNNEIEDRLDIYVEINDNLLIKNNI VGNYIYSQGIFYKIDIEEDSQFPLIDLFQKIDKYELESYCTLILKNYKNIDLKYEDYETI ISDEKTAIPQIIIEKIAFDNSLYLKINSIISTMDYEFFTKNQIEAVLTVNELEKKLEISK INLENLSSDMFEIVKVLTKLQKSIGLKSSYYIDNENFIILNEELAKEFVKKELLQLTGKY SIIGTDRLRKYNIKAVKPKISGKFSYNLDYFEGEVEVEIEGEKFSIQQLLNNYKKDEYIV LSDGTNALINREYIEKLQRVFKEEDGNKIKVSFFDMPIVQDLIDEKSIENDFMGSKDFFE GINKLAEENIDYPKLKATLRDYQKYGYKWLKYLTDNNLGACLADDMGLGKTLQAITLISK MHEEKKKKSMVIMPKSLIYNWENEIKRFSPKLKVGVYYGINRDFSSLKKVDVILTTYGTI RNDIESLLKQKIDLLVLDESQNIKNINSQTTKAVLLLNAKKRVALSGTPIENNLLELYSL FRFLNPEMFDSVQKFTNDYIVPIQKYSDTSTIEELRKKIYPFLLRRIKKEVLADLPDKIE KLVYVDMNDEHRRFYEERRKYYYSLLEKNTSSQGNFDKFFVLQAINELRHIVSSPELESK KIISSKKEVLIENVIEAIENNHKVLVFVNYLSSIESICDSLKENKIKYLKMTGQTKDRQN LVDKFQNDSRYKVFVMTLKTGGVGLNLVSADTIFIYDPWWNTTVENQAIDRAYRLGQDKT VFAYKMIMRNTIEEKILKLQEIKNKLLDDLISEDNLSTKNLSKSDIEFILGS >gi|296155275|gb|ADVK01000011.1| GENE 6 10002 - 10715 657 237 aa, chain - ## HITS:1 COG:FN1387 KEGG:ns NR:ns ## COG: FN1387 COG2220 # Protein_GI_number: 19704722 # Func_class: R General function prediction only # Function: Predicted Zn-dependent hydrolases of the beta-lactamase fold # Organism: Fusobacterium nucleatum # 1 237 1 237 237 406 99.0 1e-113 MVYYIYHSAFVIEVEKSILIFDFYKFPSNKKKEKEDFFNRFIKRIDKKVYIFSTHSHSDH FNKEILTWLEMNENIKYILSDDIRIYKHKNFYFTKEGDSFELDNLKISTFGSTDLGSSFY VNTENKNIFHSGDLHFWHWEDDTPEEEKTMYDAYMVQLEKIKKLDRIDIAFVPVDPRLGV NTLEGVELFYKILKPKIIIPMHFSDDYSQMKNFIEKFKYNDDVKIIEIKDSMEKVLE Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:16:24 2011 Seq name: gi|296155254|gb|ADVK01000012.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00013, whole genome shotgun sequence Length of sequence - 19688 bp Number of predicted genes - 22, with homology - 20 Number of transcription units - 8, operones - 5 average op.length - 3.8 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . - CDS 1 - 145 90 ## COG1381 Recombinational DNA repair protein (RecF pathway) 2 1 Op 2 . - CDS 160 - 738 605 ## FN1493 hypothetical protein 3 1 Op 3 . - CDS 738 - 1601 986 ## COG1792 Cell shape-determining protein - Prom 1661 - 1720 8.7 4 2 Op 1 . - CDS 1765 - 2940 1114 ## COG0477 Permeases of the major facilitator superfamily - Prom 2979 - 3038 4.5 5 2 Op 2 . - CDS 3086 - 3199 149 ## - Prom 3330 - 3389 9.5 + Prom 3075 - 3134 11.6 6 3 Tu 1 . + CDS 3364 - 4263 979 ## COG0697 Permeases of the drug/metabolite transporter (DMT) superfamily + Term 4275 - 4330 5.2 + Prom 4357 - 4416 9.5 7 4 Tu 1 . + CDS 4448 - 5926 2331 ## COG5295 Autotransporter adhesin + Term 5939 - 5988 5.5 - Term 5927 - 5976 1.7 8 5 Op 1 17/0.000 - CDS 5983 - 6714 376 ## PROTEIN SUPPORTED gi|163765018|ref|ZP_02172066.1| ribosomal protein L9 9 5 Op 2 44/0.000 - CDS 6715 - 7497 256 ## PROTEIN SUPPORTED gi|163765018|ref|ZP_02172066.1| ribosomal protein L9 10 5 Op 3 49/0.000 - CDS 7494 - 8297 1063 ## COG1173 ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 11 5 Op 4 38/0.000 - CDS 8290 - 9219 737 ## COG0601 ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 12 5 Op 5 . - CDS 9234 - 10838 2546 ## COG0747 ABC-type dipeptide transport system, periplasmic component - Prom 10878 - 10937 10.9 13 6 Tu 1 . - CDS 11013 - 11201 351 ## + Prom 10961 - 11020 11.3 14 7 Op 1 6/0.000 + CDS 11245 - 11706 809 ## COG0054 Riboflavin synthase beta-chain 15 7 Op 2 16/0.000 + CDS 11708 - 12817 1383 ## COG1985 Pyrimidine reductase, riboflavin biosynthesis + Prom 12825 - 12884 3.4 16 7 Op 3 15/0.000 + CDS 12910 - 13665 943 ## COG0307 Riboflavin synthase alpha chain 17 7 Op 4 . + CDS 13675 - 14874 1719 ## COG0108 3,4-dihydroxy-2-butanone 4-phosphate synthase + Term 14900 - 14947 12.0 + Prom 14935 - 14994 9.6 18 8 Op 1 1/1.000 + CDS 15030 - 16106 1333 ## COG2849 Uncharacterized protein conserved in bacteria 19 8 Op 2 1/1.000 + CDS 16078 - 16293 191 ## COG2849 Uncharacterized protein conserved in bacteria 20 8 Op 3 1/1.000 + CDS 16318 - 17958 1564 ## COG2849 Uncharacterized protein conserved in bacteria 21 8 Op 4 1/1.000 + CDS 17976 - 19490 1332 ## COG2849 Uncharacterized protein conserved in bacteria 22 8 Op 5 . + CDS 19378 - 19632 240 ## COG2849 Uncharacterized protein conserved in bacteria + Term 19640 - 19679 2.1 Predicted protein(s) >gi|296155254|gb|ADVK01000012.1| GENE 1 1 - 145 90 48 aa, chain - ## HITS:1 COG:FN1492 KEGG:ns NR:ns ## COG: FN1492 COG1381 # Protein_GI_number: 19704824 # Func_class: L Replication, recombination and repair # Function: Recombinational DNA repair protein (RecF pathway) # Organism: Fusobacterium nucleatum # 1 48 1 48 233 84 100.0 4e-17 MIFLRGKGIIIAKKDIEEADRYITIFMEDYGKVSTVIKGIRKSKKRDK >gi|296155254|gb|ADVK01000012.1| GENE 2 160 - 738 605 192 aa, chain - ## HITS:1 COG:no KEGG:FN1493 NR:ns ## KEGG: FN1493 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 192 1 192 192 288 99.0 8e-77 MKKFIILLFILVQGLIFSATKTLSDIKTVKFDVVEKTTVKSKKKEISYKIDFELPNKIKK EVTAPELNKGEIYLYDYTENKKVVYLPLFNEVKENKIVDDENRIIKAINKIIEEEKKNKN FSQNYYAKKPQSLNIDEQVSINILSYIEIDGYVFPEVVEIKDGGTKVGDVKISNLKINPI LDNKTFTEIPKK >gi|296155254|gb|ADVK01000012.1| GENE 3 738 - 1601 986 287 aa, chain - ## HITS:1 COG:FN1496 KEGG:ns NR:ns ## COG: FN1496 COG1792 # Protein_GI_number: 19704828 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Cell shape-determining protein # Organism: Fusobacterium nucleatum # 78 287 1 210 210 359 99.0 4e-99 MKKESKIKILLPILAVIIVTVLIFNRLLFKLKAQIDKAVLPIQSKVYNVANRAIGIKDII FSYENFITENENLKKENMELKIQKVRNEKIYEENERLLKLLEMKENNIYKGDLKFARVSF SDINNLNNKIFIDLGTEDNIKIDMITVYGDYLVGKIVAVHDNYSEVELITNPNCIISAKT MGNVLGIARGSDEEDGLLYFQPSIVEDNLKEGDEIITSGISDIYPEGIKIGKIEQIDEKE NYGYKRVTLKPGFESKDLRELIVISRENAVNRPIVKEEVKELKGDEK >gi|296155254|gb|ADVK01000012.1| GENE 4 1765 - 2940 1114 391 aa, chain - ## HITS:1 COG:FN1497 KEGG:ns NR:ns ## COG: FN1497 COG0477 # Protein_GI_number: 19704829 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Fusobacterium nucleatum # 18 391 1 374 374 559 100.0 1e-159 MKKSIIGLLWGESTLKVMAILYDSVITAFLLQLGLKNTQIGLLWSVVLLTQMLFDYPTGS FADRYGRLKIFTIGMVLTGSAIVMIAYSVSISMLYISAILMGIGESQISGTLFPWFVNSL DKVDNLQEKEEYILKSNGQVQYSTNIIGILTGFVISFLNLDYKFILILAGTFQAINGIFI YFSFQDNKSTETNLIKIGKKSFQIFLKDYKLWIYTLTMTIHYSFYSVHLFIWQPRANLLG VIGSKLTGINSVYLSCLVISGLIIKYKKEIKNYLYVLYVILIPISLIIIYQSQNLILYIL GTILLGISNGMVAPQIMSTVHYFIPDEVRSSVISLLSSLSSIFLIFLQVIIGKILDIKGN YYLEILCVLFGIIYIVCIILILKWLKENRNR >gi|296155254|gb|ADVK01000012.1| GENE 5 3086 - 3199 149 37 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MDYTTGREFHPAPKIIFNIYFSHINILFFCEFVNNYF >gi|296155254|gb|ADVK01000012.1| GENE 6 3364 - 4263 979 299 aa, chain + ## HITS:1 COG:FN1498 KEGG:ns NR:ns ## COG: FN1498 COG0697 # Protein_GI_number: 19704830 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; R General function prediction only # Function: Permeases of the drug/metabolite transporter (DMT) superfamily # Organism: Fusobacterium nucleatum # 1 285 1 285 299 466 98.0 1e-131 MDNHIKGALLVCLAATMWGFDGIALTPRLFSLHVPFVVFILHLLPLILMSILFGKEEVKN IKKLQKNDLFFFFCVALFGGCLGTLCIVKALFLVNFKHLTVVTLLQKLQPIFAIILARLL LKEKLKRAYLFWGFLALLGGYLLTFEFHLPEFVSSDNLLPASLYSLLAAFSFGSATVFGK RILKSASFRTALYLRYLMTSCIMFVIVTFTSGFGDFLVATAGNWLIFVIIALTTGSGAIL LYYFGLRYITAKVATMCELCFPISSVVFDYLINGNVLSPVQIASAILMIISIIKISKLN >gi|296155254|gb|ADVK01000012.1| GENE 7 4448 - 5926 2331 492 aa, chain + ## HITS:1 COG:FN1499 KEGG:ns NR:ns ## COG: FN1499 COG5295 # Protein_GI_number: 19704831 # Func_class: U Intracellular trafficking, secretion, and vesicular transport; W Extracellular structures # Function: Autotransporter adhesin # Organism: Fusobacterium nucleatum # 1 492 1 479 479 717 91.0 0 MKKFVSLKLIVFSFILVAGSVSFSAPVFQAGTGTDSTVAGVNNEANGEKSSAFGYENKAK EKLSSAFGYKNIANGIEGSAFGISNLAKGQYSSAFGFRNVANKRHSSAFGSGNEANGEQS SAFGFKNTVSGFNSSAFGSQYEVTGNFSGAFGMGEFNGQYQYKNEGNNSYMIGNKNKIAS GSNDNFILGNNVHIGGGINNSVALGNNSTVSASNTVSVGSSTLKRKIVNVGDGAISANSS DAVTGRQLYSGNGIDTAAWQNKLNVTRKNDYKDANDIDVNKWKAKLGVGSGGGGGAPVDA YTKSEADNKFANKTDLNDYTKKDDYKDANGIDVDKWKAKLGTGAGTADIENLRNEVNEKI DDVKDEVRTVGSLSAALAGLHPMQYDPKAPVQVMAALGHYRDKQSVAVGASYYFNDRFMM STGIALSGEKKTKTMANVGFTLKLGKGSGVTYDETPQYVVQNEVKRLTVENQELKERVRN LEEKLNMLLKNK >gi|296155254|gb|ADVK01000012.1| GENE 8 5983 - 6714 376 243 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163765018|ref|ZP_02172066.1| ribosomal protein L9 [Bacillus selenitireducens MLS10] # 11 217 30 242 329 149 36 1e-35 MKAVELINITKKYGEQEVLNSFSLDIEKGKCLAIMGESGSGKSTIAKIIIGLEKQNSGTV KIFGKERDIKTTFKDIEFLFQDSYNALNPSMTVEDLIYEPLQFLVNTDKSRKKEIVLELL EQVELSSELLTRKRDELSGGQLQRVCLARALSTKPQIIIFDESLSGLDPLVQDKILDLLY KIQKQYQLTYIFISHDFRLCYFLADRIILIDSGNITEDFKELDKEIIPKTEIGKILLKDI SKL >gi|296155254|gb|ADVK01000012.1| GENE 9 6715 - 7497 256 260 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163765018|ref|ZP_02172066.1| ribosomal protein L9 [Bacillus selenitireducens MLS10] # 12 228 31 242 329 103 31 1e-21 MNILEIKNLSLKISDEKILKNINFELKEKEIISIIGKSGSGKTMLSKMIMGLKNKNMQIE GEILFKDNNIFDFSEEDLRKYRGEGIGYITQNPLNVFLPFQKIKTTFLETYLSHKNISKS EVIELAKKNLKQVNLENAEEILNKYPFELSGGMLQRVMVAIIIGLDAKIIIADEVTSALD SYNRYEMIKIFKELNKMGKSIILITHDYYLMKSISERCLVMENGEVIEKFNPKLKAELIK ENSKFGAKLLETTIYKRKGS >gi|296155254|gb|ADVK01000012.1| GENE 10 7494 - 8297 1063 267 aa, chain - ## HITS:1 COG:FN1502 KEGG:ns NR:ns ## COG: FN1502 COG1173 # Protein_GI_number: 19704834 # Func_class: E Amino acid transport and metabolism; P Inorganic ion transport and metabolism # Function: ABC-type dipeptide/oligopeptide/nickel transport systems, permease components # Organism: Fusobacterium nucleatum # 1 267 5 271 271 444 98.0 1e-125 MAKNIKFYFAIFLLFFWIVLAIVAPMVAPYDPQYVDLSLKLLSPNKTYLLGTDALGRDIL SRIIYGARLSISISLSIQIILLLISVPIGLFIGWRQGKEESFFDWLTMIFSTFPSFLLAM VLVGMLGARISNMIISVVAVEWIYYARILKNSVISQKQNEYVKYAILKGMPTRYILKKHI FPFVYGPILTASLMNIGNIILMISSFSFLGIGVQPNISEWGNMIHDSRTFFRNHPNLMLY PGIMILLAVGSFRFIASQIEEKFRGIK >gi|296155254|gb|ADVK01000012.1| GENE 11 8290 - 9219 737 309 aa, chain - ## HITS:1 COG:FN1503 KEGG:ns NR:ns ## COG: FN1503 COG0601 # Protein_GI_number: 19704835 # Func_class: E Amino acid transport and metabolism; P Inorganic ion transport and metabolism # Function: ABC-type dipeptide/oligopeptide/nickel transport systems, permease components # Organism: Fusobacterium nucleatum # 1 309 4 312 312 531 98.0 1e-151 MKKKIFDIISALLVISILAFIFIQLSPGDPAENYLRASHLPITDELLKQKREELGLNSPL IIQYLKWLKNVLLGNFGYSFLRKEPAIYLTFKSLYATFQLTIFSTFLIILISLPIGILSA IKTGTWIDKIVISTTTIFVSMPVFWLGFSLILLFSVKLNWLPVSGRGGFLNFVLPSITLS VPFIGQYIEFIKKSILENIQNNLLENAVLRGLKKRYIISNYLLKGAWIPILSGFSFTFVS ILTGSILVEEIFSWPGIGFLFTKAIQAGDVPLIQACIMVFGMLFIIATHFMNDILKYLDP RIKGGKNNG >gi|296155254|gb|ADVK01000012.1| GENE 12 9234 - 10838 2546 534 aa, chain - ## HITS:1 COG:FN1504 KEGG:ns NR:ns ## COG: FN1504 COG0747 # Protein_GI_number: 19704836 # Func_class: E Amino acid transport and metabolism # Function: ABC-type dipeptide transport system, periplasmic component # Organism: Fusobacterium nucleatum # 13 534 1 522 522 1019 98.0 0 MKKLIKLFLMLTLSLLFLVACGEKGKEETKSEDTQKKTLTISWNQDVGFLNPHAYLPDQF ITQGMVYEGLVNYGENGEILPSLAESWEISEDGKTYTFHLRKGVKFSDGSDFNANNVKKN FDSIFLNKERHSWFGLTGHISSYRAVDKNTFEFVLDEAYTPTLYDLAMIRPIRFLADAGF PDDGDTYKGIKASIGTGPWILKEHKKDEYAIFEKNPNYWGEKPILDEVVIKIIPDAETRA LQFEAGELDMIYGNGLISYDTFKSYQEDSKYKTAISEPMSTRLLMFNTTTGPLSDINLRY ALTYATDKKAISDGILNGIEKPADTIFAPNMPHSKQDLKPFEYNLDKAKEYIEKAGYKMG KEFYEKDGKVLTLVFPYIATKTLDKQIAEYIQGQWKKIGVNVEIKALEEKNFWEETDDLK YNVMLNYSWGAPWDPHAYINAMATVAENGNPDYEAQLGLPMKKELDAKIHQVLLEANPEK VEQLYKEILTTLHEQAVYVPLTYQSLIAVYKDNLTGVRFMPQEYELPLSYIDKK >gi|296155254|gb|ADVK01000012.1| GENE 13 11013 - 11201 351 62 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MLKQSLSNMSIIKKSNSSPIQTLLSVLEFHQIKANAFVDCTTGREFHPALKTLFYIFDYL HH >gi|296155254|gb|ADVK01000012.1| GENE 14 11245 - 11706 809 153 aa, chain + ## HITS:1 COG:FN1505 KEGG:ns NR:ns ## COG: FN1505 COG0054 # Protein_GI_number: 19704837 # Func_class: H Coenzyme transport and metabolism # Function: Riboflavin synthase beta-chain # Organism: Fusobacterium nucleatum # 1 153 5 157 157 283 100.0 1e-76 MKVFEGKFNGKGTKIAIVAARFNEFITSKLIGGAEDILKRHEVQDDDINLFWVPGAFEIP LIAKKLAQSKKYDAVITLGAVIKGSTPHFDYVCAEVSKGVAHVSLESEVPVIFGVLTTNS IEEAIERAGTKAGNKGADAAMTAIEMINLIKGI >gi|296155254|gb|ADVK01000012.1| GENE 15 11708 - 12817 1383 369 aa, chain + ## HITS:1 COG:FN1506_2 KEGG:ns NR:ns ## COG: FN1506_2 COG1985 # Protein_GI_number: 19704838 # Func_class: H Coenzyme transport and metabolism # Function: Pyrimidine reductase, riboflavin biosynthesis # Organism: Fusobacterium nucleatum # 147 369 1 223 223 422 100.0 1e-118 MDKNSDEKYMARAIELAKRGTGGVNPNPLVGAVIVKDGKIIGEGWHKKFGGPHAEVWALN EAGENAKGATVYVTLEPCSHQGKTPPCAKRIIEAGIKRCVVACIDPNPLVAGKGMKIIEN AGIEVELGVLEKEAKEINKIFFKYIENKIPYLFLKCGITLDGKIATRNGKSKWITNEIAR EKVQFLRTKFMAIMVGINTVLKDNPSLDSRLDEKKFGIEKRNPFRVVIDPNLESPIEGKF LNFNDGKAIIVTSNDNKGLEKLEKYKNLGTIFIFLEGKIFKIQDILKELGKLGIDSVLLE GGSGLISTAFKENIVDAGEIFIAPKIIGDSSAIPFINGFNFDSMEDVFKLPNPKFNIYGD NISIEFENL >gi|296155254|gb|ADVK01000012.1| GENE 16 12910 - 13665 943 251 aa, chain + ## HITS:1 COG:FN1507 KEGG:ns NR:ns ## COG: FN1507 COG0307 # Protein_GI_number: 19704839 # Func_class: H Coenzyme transport and metabolism # Function: Riboflavin synthase alpha chain # Organism: Fusobacterium nucleatum # 1 251 1 251 251 462 99.0 1e-130 MSFPDLQRILDFLSLRNLLSNELFLIISEVRWYMFTGLVEEKGSVISLNNGDKSIKLKIK ANKVLENVKLGDSIATNGVCLTVTEFSKDYFVADCMFETISRSNLKRLKAGDEVNLEKSI TLATPLGGHLVTGDVDCEGEIVSITQEGVAKIYEIKISRKYMRYIVEKGRATIDGASLTV ISLTDDTFSVSLIPHTQEKIILGSKKVGDIVNIETDLVGKYIERFVYFDKLEQKENKKSK ITREFLLENGF >gi|296155254|gb|ADVK01000012.1| GENE 17 13675 - 14874 1719 399 aa, chain + ## HITS:1 COG:FN1508_1 KEGG:ns NR:ns ## COG: FN1508_1 COG0108 # Protein_GI_number: 19704840 # Func_class: H Coenzyme transport and metabolism # Function: 3,4-dihydroxy-2-butanone 4-phosphate synthase # Organism: Fusobacterium nucleatum # 1 203 1 203 203 392 100.0 1e-109 MIYKIEDVLEDIKNGIPLIIVDDENRENEGDLFVAAEKATYESINLMATFARGLTCTPMT SEYAIRLGLDPMTARNTDAKCTAFTVSVDAKEGTTTGISIADRLTTIKKLANKNSVPSDF TKPGHIFPLIAKDKGVLEREGHTEATVDLCKICGLTPVSVICEILKDDGTMARMDDLEIF AKEHNLKIITIADLIKYRKKTEQLMKIDVVANMPTDSGTFKIVGFDNPIDGKEHIALVKG DVKGKEAVTVRIHSECFTGDILGSLRCDCGSQLKTAMRRIDKLGEGIILYLRQEGRGIGL LNKLRAYNLQEEGMDTLDANLHLGFGADMRDYAVAAQMLKALGVKSIKLLTNNPLKINGL EEYGIPVVKREEIEIEANKINKVYLKTKKERMGHLLKIK >gi|296155254|gb|ADVK01000012.1| GENE 18 15030 - 16106 1333 358 aa, chain + ## HITS:1 COG:FN1512 KEGG:ns NR:ns ## COG: FN1512 COG2849 # Protein_GI_number: 19704844 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 358 39 396 396 609 95.0 1e-174 MVPMRKIFIILILMLSIFSIVNAHPFKTEKELYDYYAEIDKKINEELKNNPEKILKDRKN SLKPLYLDVFGADKVLGDNSYLFGFDKNGKIMSVMKRAVLDGPSMIARIYYPNGNLKEVY LDDDDFVTGIVRTYYESEKKHEEIPYYKGKKEGLRKIYFENGNLSNEVYYVDDLREGKTI DYYNDGKVFRLKNYKDNIGNGEFTEYYRNGQIKVKGNYKGGLREGEFKFYSESNKYLGSV FYKNKEIIKNTLSKEDMKDLSASFEFADMALFLRSVTRDIVGVTTDIYPNGKPKLYMPYS VNGELHGKYLEFYESGKILSETTYENGLRQGKSIIYLENGKIIGETNYIDGKKEGKSF >gi|296155254|gb|ADVK01000012.1| GENE 19 16078 - 16293 191 71 aa, chain + ## HITS:1 COG:FN1514 KEGG:ns NR:ns ## COG: FN1514 COG2849 # Protein_GI_number: 19704846 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 10 71 442 503 503 73 70.0 1e-13 MVKKKVKVSKLLQKRSFINGKAEGEFIEYYENGVIKEKAYFINNKLEKEHLFYDKNGNLT KTEIYKNGIKQ >gi|296155254|gb|ADVK01000012.1| GENE 20 16318 - 17958 1564 546 aa, chain + ## HITS:1 COG:FN1514 KEGG:ns NR:ns ## COG: FN1514 COG2849 # Protein_GI_number: 19704846 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 44 546 1 503 503 805 99.0 0 MKKIFIILFLMLSIFTIINANPFKTEKELNNFFTEIDKKIKEELKKNYREEYSKRKISPN NEYSFEIEDDGTSFFTRNIDIKPKTEITQYFNKKGELYMISSLTSETDKELYALYRKYDK NGNLFIYTYAIDGKNTDKGYYSDGKLAYILELKILKGQAPIPNGKYIEYYKNGQIKVQGN HKEGKRDGEFRAFLRNGKSAGSIFYKNGKIIKSTLVNSMKDNASFSILTDINYNLNSHEI ITDEFPNKLLKQYFIFNKNGLLDGESRQYYEEGDIKSISPFKNNVADGVFISYYQNGNIK EKHIYKNGNKEGEGIFYYENGKLEEKYFMKNGKLDGEAINYFEDGKIRNKSIFKDGVLLE EEVYQDNEIIKNTFKNEEIVQQDIYSKNKVLKAKKFFLENGKMKIISYYENGNKEEEVFV INELLDGEALVYYPSGKLKEKDFFKNGKREGEAIIYYENENVKQKSLFKNDKREGDLFMY YPSGKLCQTEKFINGKEDGEVIEYYESGVIKEKAYFINGKQEKEHFFYDEKGNLIKTDIY KNGVKQ >gi|296155254|gb|ADVK01000012.1| GENE 21 17976 - 19490 1332 504 aa, chain + ## HITS:1 COG:FN1515 KEGG:ns NR:ns ## COG: FN1515 COG2849 # Protein_GI_number: 19704847 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 502 1 502 555 773 95.0 0 MRKNFFILIFLFFIFSILNANPLKNETELQKFRNKVDKVIKEELKNDYKKEYLKRKDNLK KIENNGAIGFEDEDFIFQFEDNALTLASKKLKSIPNTAITQEFDKTGNFFRIFSITSIDE NFLLYRYFDKNSNLVIDVYGTNGKVIQKGYYNNKQLAYIIEGNILKNLDSIPNGKYTEYY KNGQIKIQGHTKEGKRDGEFKTFLKNGKSAGSVIYKDGKIIKSTLIKTMKDNASFSLITD INYILDTSHTIKKVDFENGLLRTYFIFNKNGLLDGNSIEYYEEGNIESIVPYKNNVVEGL VITYYENGNIKEEVNYKNDNMNGEAKSYDENGKLNGRTIFKDDIKLEEEVHKENEILKNT FKNGEVVKQDICSPNGTLKERRVLNGNEMEYSTFYPNGNVKQKIFAKDKIIIKEQLYARN GNIMSNSFFSNGKPVTEVFEYYPDGKIHKKISSSNNMLDGDYLEYYPNGKLKNKAFFKND KQEGEYTAYYESSAIMQKVPYKKW >gi|296155254|gb|ADVK01000012.1| GENE 22 19378 - 19632 240 84 aa, chain + ## HITS:1 COG:FN1515 KEGG:ns NR:ns ## COG: FN1515 COG2849 # Protein_GI_number: 19704847 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 22 84 495 555 555 84 82.0 5e-17 MENLKIKLFLRMINKKGNILPIMRVVPSCKKFLIKNGEAIAYYENGNIEQKAYFINGKQE KEHLYYDEKGNLTKTEIYKNGIKQ Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:16:49 2011 Seq name: gi|296155219|gb|ADVK01000013.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00014, whole genome shotgun sequence Length of sequence - 32062 bp Number of predicted genes - 37, with homology - 32 Number of transcription units - 17, operones - 9 average op.length - 3.2 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . - CDS 19 - 561 841 ## FN0212 hypothetical protein 2 1 Op 2 1/1.000 - CDS 588 - 1094 571 ## COG1778 Low specificity phosphatase (HAD superfamily) 3 1 Op 3 1/1.000 - CDS 1104 - 1676 795 ## COG0817 Holliday junction resolvasome, endonuclease subunit 4 1 Op 4 1/1.000 - CDS 1676 - 2428 293 ## PROTEIN SUPPORTED gi|163764775|ref|ZP_02171829.1| ribosomal protein L16 - Prom 2464 - 2523 6.3 5 1 Op 5 . - CDS 2548 - 3300 239 ## PROTEIN SUPPORTED gi|163797523|ref|ZP_02191474.1| 50S ribosomal protein L9 - Prom 3331 - 3390 17.0 + Prom 3305 - 3364 11.1 6 2 Op 1 1/1.000 + CDS 3488 - 4141 396 ## COG0664 cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases 7 2 Op 2 . + CDS 4138 - 5007 1193 ## COG2071 Predicted glutamine amidotransferases 8 3 Op 1 9/0.000 - CDS 5030 - 5752 777 ## COG3279 Response regulator of the LytR/AlgR family 9 3 Op 2 . - CDS 5745 - 7421 1913 ## COG3275 Putative regulator of cell autolysis - Prom 7613 - 7672 11.7 + Prom 7577 - 7636 11.4 10 4 Tu 1 . + CDS 7695 - 9119 2257 ## COG1966 Carbon starvation protein, predicted membrane protein + Term 9156 - 9217 10.1 + Prom 9189 - 9248 7.9 11 5 Tu 1 . + CDS 9298 - 10644 1507 ## COG2211 Na+/melibiose symporter and related transporters 12 6 Tu 1 . - CDS 10661 - 10777 165 ## - Prom 10974 - 11033 9.5 + Prom 10655 - 10714 10.4 13 7 Tu 1 . + CDS 10846 - 12024 940 ## COG0658 Predicted membrane metal-binding protein + Prom 12047 - 12106 8.2 14 8 Op 1 . + CDS 12180 - 12260 122 ## 15 8 Op 2 . + CDS 12244 - 12342 170 ## 16 8 Op 3 . + CDS 12386 - 14377 2953 ## COG0556 Helicase subunit of the DNA excision repair complex 17 8 Op 4 . + CDS 14414 - 15325 1166 ## Selsp_0070 hypothetical protein - Term 15324 - 15379 9.2 18 9 Op 1 . - CDS 15410 - 16306 741 ## FN0230 hypothetical protein 19 9 Op 2 . - CDS 16327 - 16596 400 ## BBR47_50570 hypothetical protein 20 9 Op 3 . - CDS 16589 - 16705 162 ## gi|262065947|ref|ZP_06025559.1| conserved hypothetical protein - Prom 16858 - 16917 4.2 21 10 Op 1 . - CDS 17007 - 17156 210 ## COG4859 Uncharacterized protein conserved in bacteria 22 10 Op 2 . - CDS 17174 - 18130 1036 ## FN0233 hypothetical protein 23 10 Op 3 . - CDS 18166 - 18642 518 ## FN0234 hypothetical protein - Prom 18668 - 18727 8.1 - Term 18713 - 18770 2.9 24 11 Op 1 17/0.000 - CDS 18784 - 19512 241 ## PROTEIN SUPPORTED gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) 25 11 Op 2 21/0.000 - CDS 19522 - 20526 1530 ## COG0715 ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 26 11 Op 3 1/1.000 - CDS 20513 - 21109 704 ## COG0600 ABC-type nitrate/sulfonate/bicarbonate transport system, permease component - Term 21299 - 21337 2.1 27 12 Tu 1 . - CDS 21489 - 22421 1321 ## COG4874 Uncharacterized protein conserved in bacteria containing a pentein-type domain - Prom 22513 - 22572 13.6 + Prom 22385 - 22444 12.0 28 13 Tu 1 . + CDS 22546 - 22695 234 ## + Term 22719 - 22761 3.2 + Prom 22865 - 22924 10.3 29 14 Op 1 16/0.000 + CDS 23039 - 23866 998 ## COG0207 Thymidylate synthase 30 14 Op 2 1/1.000 + CDS 23866 - 24363 579 ## COG0262 Dihydrofolate reductase 31 14 Op 3 1/1.000 + CDS 24371 - 25729 1656 ## COG0569 K+ transport systems, NAD-binding component 32 14 Op 4 1/1.000 + CDS 25742 - 27097 1797 ## COG0617 tRNA nucleotidyltransferase/poly(A) polymerase + Term 27098 - 27129 1.1 + Prom 27138 - 27197 8.2 33 15 Op 1 15/0.000 + CDS 27231 - 27428 443 ## COG2608 Copper chaperone 34 15 Op 2 . + CDS 27434 - 29743 3199 ## COG2217 Cation transport ATPase 35 15 Op 3 1/1.000 + CDS 29811 - 30641 1060 ## COG2849 Uncharacterized protein conserved in bacteria + Term 30653 - 30696 10.3 36 16 Tu 1 . + CDS 30705 - 31631 935 ## COG2849 Uncharacterized protein conserved in bacteria + Prom 31685 - 31744 12.0 37 17 Tu 1 . + CDS 31931 - 32060 196 ## Predicted protein(s) >gi|296155219|gb|ADVK01000013.1| GENE 1 19 - 561 841 180 aa, chain - ## HITS:1 COG:no KEGG:FN0212 NR:ns ## KEGG: FN0212 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 180 1 180 180 292 98.0 4e-78 MLNFEKINKMIDLIEESQIMEGLTFNEFAMEFYSEVKLVPLSRYLKTNNRVKRMPKIMNM RKAGELLLFTKTDDETLSFLKRKGYNEIPSLDYKTIMLLRKLDPIDNWKKVLAFFNGDKT VEEINLSTRPILFPQEIKKLEEYIKDELSLNDDEFEKFMSISSVAIKNKEVMKAIKKLSR >gi|296155219|gb|ADVK01000013.1| GENE 2 588 - 1094 571 168 aa, chain - ## HITS:1 COG:FN0213 KEGG:ns NR:ns ## COG: FN0213 COG1778 # Protein_GI_number: 19703558 # Func_class: R General function prediction only # Function: Low specificity phosphatase (HAD superfamily) # Organism: Fusobacterium nucleatum # 1 168 1 168 168 295 98.0 3e-80 MENIKILVLDVDGTLTNGKIYVNDKDNSFKAFNVKDGFALVNWLKLGGEVAILTGKKSNI VERRAEELGIKYVIQGSKNKTQDLKNLLDRLNITFENTAYMGDDLNDLGVMKNVGLTACP KDSVQEVLEISNFISSKNGGDGAVREFLEYIMKNNGMWKKILEKYSNE >gi|296155219|gb|ADVK01000013.1| GENE 3 1104 - 1676 795 190 aa, chain - ## HITS:1 COG:FN0214 KEGG:ns NR:ns ## COG: FN0214 COG0817 # Protein_GI_number: 19703559 # Func_class: L Replication, recombination and repair # Function: Holliday junction resolvasome, endonuclease subunit # Organism: Fusobacterium nucleatum # 1 190 1 190 190 352 100.0 2e-97 MRVIGIDPGTAIVGYGIIDYDKNKYSIVDYGVVLTSKDLSTEERLEIVYDEIDKILKKYK PEFMAIEDLFYFKNNKTVISVAQARGVILLAGKQNNIAMTSYTPLQVKIGITGYGKAEKK QIQQMVQKFLGLSEIPKPDDAADALAICITHINSLGSKLSFGRTNNLNKIVVPSGTNKIS LEEYKNLLKK >gi|296155219|gb|ADVK01000013.1| GENE 4 1676 - 2428 293 250 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163764775|ref|ZP_02171829.1| ribosomal protein L16 [Bacillus selenitireducens MLS10] # 22 250 11 232 236 117 33 9e-26 MPNILEMIDKFLNLKFAGELTVEVVCFRLLLAIILGGIVGYEREKNNRPAGFRTHILVCF GAAIVSMVQDQLRLNILDLARTEGTAVASVIKTDLGRLGAQVISGVGFLGAGSIMKEKGE TIGGLTTAAGIWATACVGLGIGWGFYNIAIVAILFMIIIMVSLKRLESKFVRKSRLLKFE VKFFDTDDFANGLIEAYEIFRQKSIKISEIDKYQDEGIVTFTVSIKGRNNISDVVVSLSS IKNVEYVKDV >gi|296155219|gb|ADVK01000013.1| GENE 5 2548 - 3300 239 250 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163797523|ref|ZP_02191474.1| 50S ribosomal protein L9 [alpha proteobacterium BAL199] # 3 243 7 250 259 96 28 2e-19 MKIFIVGGSSGIGLSLAKRYASLGNEVAICGTNEEKLKKIEECNKNIKIYKVDVRNKEEL KSAIDDFSKGNLDLIINSAGIYTNNRTTKLTDKEAYAMIDINLTGVLNTFEAVRDVMFKN NRGHIAIISSAAGLLDYPKASVYARTKMTIMGVCETYRSFFRNYNINITTIVPGYIATDK LKSLSEEDITKKPTVLSEEESTNIIIKAIEEKKEKIIYPLSMKILISVIRKLPKKLLTYI LMKQANWGKK >gi|296155219|gb|ADVK01000013.1| GENE 6 3488 - 4141 396 217 aa, chain + ## HITS:1 COG:FN0217 KEGG:ns NR:ns ## COG: FN0217 COG0664 # Protein_GI_number: 19703562 # Func_class: T Signal transduction mechanisms # Function: cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases # Organism: Fusobacterium nucleatum # 1 217 1 217 217 345 99.0 3e-95 MISKEDIKHLEKIFPFWLDISQNDRAKIILSSRVLSLKENSIFFNSHELDGLLFLKSGKL RFFLSSLDARELPLYYLNNMEVEFFENFTDKTISTILDIAFIVEKNSEILLIPYSVLNLF RNKYSIMEKFLHNLTREKFSKSLLSLQNILLIPLKDRLLNFLYGLNRTEISLTHEEIAKN LGSSREVISRNLKVLEKKNFLKINRKKIIILDRSGVL >gi|296155219|gb|ADVK01000013.1| GENE 7 4138 - 5007 1193 289 aa, chain + ## HITS:1 COG:FN0218 KEGG:ns NR:ns ## COG: FN0218 COG2071 # Protein_GI_number: 19703563 # Func_class: R General function prediction only # Function: Predicted glutamine amidotransferases # Organism: Fusobacterium nucleatum # 1 289 1 289 289 537 100.0 1e-152 MKKPIIGISASMIYEEKDELFLGDKYSCVAYSYVDAVYKSWGIPVTLPILKDVSAIREQV KLLDGLILSGGRDVDPHFYGEEPLEKLEAIFPERDVHEMALIRAAIDLKKPILAICRGMQ ILNVTYGGTLYQDISYAPGEHIKHCQVGSSYQATHSINIDKNSILFKMADKSEIERVNSF HHQALKQVAKGLKVVATAPDGIIEAVERENEDEVFVIGVQFHPEMMFDKSIFARAIFKRF INICIDSRPAEVLLKDEVHHTEEHKEKNIDEKIKEIEDEEKKEFFKGDL >gi|296155219|gb|ADVK01000013.1| GENE 8 5030 - 5752 777 240 aa, chain - ## HITS:1 COG:FN0219 KEGG:ns NR:ns ## COG: FN0219 COG3279 # Protein_GI_number: 19703564 # Func_class: K Transcription; T Signal transduction mechanisms # Function: Response regulator of the LytR/AlgR family # Organism: Fusobacterium nucleatum # 1 240 1 240 240 406 100.0 1e-113 MINCIIVEDELPAREELKYFVNEEKEIKLIAEFDNPLDTLNFLENNTADVIFLDINMPDM NGISLGKIITKMYPDMKIVFITAYKDYAVDAFEIKAFDYLLKPYSESRIKSLLKSLVNIK TELTSSVKNTNLKKITVNIDERLYVISLNDIDYIEASEKETLIFSNQKKYVSKIKISKWE KMLKEDNFYRCHRSFIVNLDKITEIEQWFNSSWIIKIKNYATAIPVSRNNIKELKELFSV >gi|296155219|gb|ADVK01000013.1| GENE 9 5745 - 7421 1913 558 aa, chain - ## HITS:1 COG:FN0220 KEGG:ns NR:ns ## COG: FN0220 COG3275 # Protein_GI_number: 19703565 # Func_class: T Signal transduction mechanisms # Function: Putative regulator of cell autolysis # Organism: Fusobacterium nucleatum # 18 558 1 541 541 957 99.0 0 MNIQFISHLISNIGCSAMIAFFFIKIDRANIIIKSKAKTKKDIVALSFFFSLLSISGTYI GLNFNGAILNTRNVGVIAGGILGGPYVSIITGLVAGIHRAFVNLGRETAIPCAISTIIGG FLTAYVHRFIKNKDRIFFGFFLACTIENLSMGLILILLKDKILAQNIVTSFYIPMVLMNS IGASVLILIVEDIIQKSEIVAGNQAKLALEIANKTLPYFRETENLSEVCKIIAENLGAKA TVITDKKDIIAGFSFDKAEITRTAIKSNNTRKVLKTGEVMLVIKEDDEIIEDFLDISPHI KSCIILPLKEKNDVNGTLKIFFDTAEKITEKNRYLMIGLSHLISTQMEISKVENLISLLK YSELKALQSQINPHFLFNVLNTMTSLIRTNPEKAREVTIDLSNYLRYNLDNNVKSVELIK ELNQIDNYIKIEKARFGDKLNIVYDVDESLYNFQIPSLIIQPLVENSIKHGILKKRENGC VKVIVKKIDKDIEVIIEDDGVGIEQTVIDNLDKQIQENIGLKNVHQRLKLLYGEGLNIKK LEQGTRINFRILGGVKYD >gi|296155219|gb|ADVK01000013.1| GENE 10 7695 - 9119 2257 474 aa, chain + ## HITS:1 COG:FN0221 KEGG:ns NR:ns ## COG: FN0221 COG1966 # Protein_GI_number: 19703566 # Func_class: T Signal transduction mechanisms # Function: Carbon starvation protein, predicted membrane protein # Organism: Fusobacterium nucleatum # 1 474 1 474 474 829 100.0 0 MYSFIGSIIALVLGYFIYGKFVEGVFGIDTSRETPAKRLADGVDYMEMSWPKAFLIQFLN IAGTGPIFGAVAGALWGPAAFIWIVFGCIFAGSVHDFLIGMMSLRKDGASVSEIVGENLG MTAKQIMRIFSVILLLLVGVVFIMSPAQILTNITGVNYTVWLGVIIIYYLCATVLPIDTI IGKIYPIFGLSLLVMAFGIGGGLIINNANIPEIAFVNMNPAGRSVFPYLCITIACGAISG FHATQSPMMARCLKTEKEGRRVFYGAMISEGIIALIWAAAAMSFFGGIPQLAEAGTAAVV VNKISVGILGKAGGVLALLGVVACPITSGDTAFRSARLTIADSLKYKQGPVVNRFVVAIP LFVVGIVLCFTPFDVIWRYFGWANQTLATIALWAAVKYLANRGKNYWIALIPAMFMTVVV TSYILAAPEGFVRFFGDKDIKVIEHIAIAVGCVVSLGCTAAFFMSNKKTDLITE >gi|296155219|gb|ADVK01000013.1| GENE 11 9298 - 10644 1507 448 aa, chain + ## HITS:1 COG:FN0222 KEGG:ns NR:ns ## COG: FN0222 COG2211 # Protein_GI_number: 19703567 # Func_class: G Carbohydrate transport and metabolism # Function: Na+/melibiose symporter and related transporters # Organism: Fusobacterium nucleatum # 1 448 1 448 448 726 99.0 0 MKKLTTKVQVLYALGVSYAIVDQIFAQWILYFYLPSANSGLKPFMAPVLVSIALAVSRFV DMITDPLVGFMSDKYNSKYGRRIPFVAVGTIPLILVTIAFFYPPTSNERASFYYLMIVGS LFFTFYTIVGAPYNALIPEIGRTPEERLNLSTWQSVFRLSYTAVAIILPGILIKMIGGDN VLFGIRGMIIFLCVIVFIGLAVTVFTIRERDYSTGEVSNVSFKDTIGIIIKNKNFILYLF GMMFFFIGFNNLRAIMNYYVEDIMGYGKREITLVSAVLFGAAAICFYPTNKLSKKYGYRK IMLCCLAMLIVTTSMLFFLGKVFPVNFGFVLFGLIGIPLAGAAFIFPPAMLSEISTQISE DSGARIEGISFGIQGFFMKTSFLISIVILPIILVMGNDVSVISAIASRVSKVEKAGIYLA SLSSVFFFIISFIFYYKYSDSKKAIIKK >gi|296155219|gb|ADVK01000013.1| GENE 12 10661 - 10777 165 38 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MSIIVERAFWSSRNTNGYQAVVYKNKIYYILILRKTVN >gi|296155219|gb|ADVK01000013.1| GENE 13 10846 - 12024 940 392 aa, chain + ## HITS:1 COG:FN0223 KEGG:ns NR:ns ## COG: FN0223 COG0658 # Protein_GI_number: 19703568 # Func_class: R General function prediction only # Function: Predicted membrane metal-binding protein # Organism: Fusobacterium nucleatum # 15 392 1 378 378 548 96.0 1e-156 MKKLFLLMILFIVLMLRFSLSVRVTEIFQKEVYRMNLSLEDGKIKILKINNKYPLKNIYG KLGYKENGKYEGYFLVKSIKEYENIYFVELEDVKSTKIEDNFLEKYLQTLFNRAEKDYSY GAKNINRAILLGDNTRIRKDLKDKIRYIGLSHIFAMSGLHIALVIAIFYFIFKKTIKNKR LIEILLLTSITLYYFSVKESPSFTRAYIMAVVYLLGKLFYEKVDLGKTLFISAVVSIFIN PTVIFSISFQLSYGAMIAIAYIFPYVRKINYKKFKILDYILFIITIQIFLIPITVYYFNS VQFLSVISNLLLLSLASFYITVNYIALFLENFYLSFLLKPIIEILYKILIYLIDFFAELP YLSVEYENKNLIYIYIVFLVIIVVYKNIKQRD >gi|296155219|gb|ADVK01000013.1| GENE 14 12180 - 12260 122 26 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MNAFILPLSKVEKTYNFGGDFYGHLC >gi|296155219|gb|ADVK01000013.1| GENE 15 12244 - 12342 170 32 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MVIYVETINFNGGAILLLVLIVLLFWWKHRRK >gi|296155219|gb|ADVK01000013.1| GENE 16 12386 - 14377 2953 663 aa, chain + ## HITS:1 COG:FN0224 KEGG:ns NR:ns ## COG: FN0224 COG0556 # Protein_GI_number: 19703569 # Func_class: L Replication, recombination and repair # Function: Helicase subunit of the DNA excision repair complex # Organism: Fusobacterium nucleatum # 1 653 1 653 663 1167 98.0 0 MENNLFKIHSDYKPTGDQPTAIDSIVKNIENGVKDQVLLGVTGSGKTFTIANVIERVQRP SLIIAPNKTLAAQLYSEYKKFFPENAVEYFVSYYDYYQPEAYIKTTDTYIEKDSSVNDEI DKLRNAATAALIHRRDVIIVASVSSIYGLGSPDTYRRMTIPIDKQTGISRKELMKRLIAL RYDRNDVAFERGQFRIKGDVIDIYPSYMSNGYRLEYWGDDLEEISEINTLTGQKVKKNLE RIVIYPATQYLTADDDKDRIIQEIKDDLKVEVKKFEDDKKLLEAQRLRQRTEYDLEMITE IGYCKGIENYSRYLSGKKPGETPDTLFEYFPKDFLLFIDESHITVPQVRGMYNGDRARKE SLVENGFRLKAALDNRPLRFEEFREKSNQTVFISATPGDFEVEVSDNHIAEQLIRPTGIV DPEIEIRPTKNQVDDLLDEIRKRVAKKERVLVTTLTKKIAEELTEYYIELGVKVKYMHSD IDTLERIEIIRALRKGEIDVIIGINLLREGLDIPEVSLVAIMEADKEGFLRSRRSLVQTI GRAARNVEGRVILYADIMTDSMKEAITETERRRKIQKEYNAYHHIDPKSIVKEIAEDLIN LDYGIEEKKFENNKKVFRNKADIEKEITKLEKKIKKLVEELDFEQAIVLRDEMLKLKELL LEF >gi|296155219|gb|ADVK01000013.1| GENE 17 14414 - 15325 1166 303 aa, chain + ## HITS:1 COG:no KEGG:Selsp_0070 NR:ns ## KEGG: Selsp_0070 # Name: not_defined # Def: hypothetical protein # Organism: S.sputigena # Pathway: not_defined # 12 301 13 302 311 161 32.0 4e-38 MSKVTKSIEKEFQKFLNEKYKEGMSEEERENLITEFMCQYNFKENKFILNEKTAETSDNF LELAEEANDEKLALKYARKALKLDKDNLDAEKLIAELVSVNQIDMLKRFEKILKHGDEIM LKNGYMNKECIGSFWSILETRPYIRVKHQYMNILKECGMLSFAISECEDMIKLCENDNLG VRHILMALYAFTENEEKALTLYKKYSGYEETQILVSLSILYYRKNNLKKSLSYLKKLEKI NKDTKKFFKDVIYDKIENYIEKMSDFGYKPYSGEELFVFFDNNSYLFTTGGYFEWAYEEL KNK >gi|296155219|gb|ADVK01000013.1| GENE 18 15410 - 16306 741 298 aa, chain - ## HITS:1 COG:no KEGG:FN0230 NR:ns ## KEGG: FN0230 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 228 1 228 232 364 98.0 2e-99 MLKYLKSSFYLFIFYFFFNFSSNLLATEIKSQEKLYGITIDDSWYDDEKIENIIEGIKNL PVKPVVRIVMSKDIKPKDYVSLFRKVHKAAYIMAQPIDSFEMNTYKNVESYRKRFEDSYK YLKDYVDIWEIGNEVNGEEWIKESPKFTVKKIYSAYKLIKSKNGITALTPYYFPPEENKI SMENWLEKYIPKDMKSGLDYVFISYYEDDNDNFQPKWKDVFTNLEKIFPNSKLGIGECGN TSQNATKQSKIKMINHYYSMTKYTTNYVGGYFWWYWVQDCIPYKNNEVWLELSNNMKN >gi|296155219|gb|ADVK01000013.1| GENE 19 16327 - 16596 400 89 aa, chain - ## HITS:1 COG:no KEGG:BBR47_50570 NR:ns ## KEGG: BBR47_50570 # Name: not_defined # Def: hypothetical protein # Organism: B.brevis # Pathway: not_defined # 2 87 154 240 253 84 47.0 1e-15 MYDWQKDNPKKNYYNDYFNKFFEESYKKYPEIQTSSGNFIYWEIPETHHKIAMFKTGFGD GYYMSLWGLNEKDEVCEVVIPFINPELID >gi|296155219|gb|ADVK01000013.1| GENE 20 16589 - 16705 162 38 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|262065947|ref|ZP_06025559.1| ## NR: gi|262065947|ref|ZP_06025559.1| conserved hypothetical protein [Fusobacterium periodonticum ATCC 33693] conserved hypothetical protein [Fusobacterium periodonticum ATCC 33693] # 1 37 221 257 346 73 86.0 4e-12 MPKGYTIDDSHILNGFCVDAGLASFCDASIVEEYTKFV >gi|296155219|gb|ADVK01000013.1| GENE 21 17007 - 17156 210 49 aa, chain - ## HITS:1 COG:FN0232 KEGG:ns NR:ns ## COG: FN0232 COG4859 # Protein_GI_number: 19703577 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 18 49 1 32 65 60 87.0 9e-10 MKKYIENAGSCIITKSLMNGKTKLRWLFREEPINNINTGWIAFGDKDND >gi|296155219|gb|ADVK01000013.1| GENE 22 17174 - 18130 1036 318 aa, chain - ## HITS:1 COG:no KEGG:FN0233 NR:ns ## KEGG: FN0233 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 318 1 318 318 574 99.0 1e-162 MANFKFLETEYQLKKLKPKYNNFWYAGKIKGYWCIVTTNFYEKLCSITIGAHKEDTHKSL IEILNKEIGLKKVKISTEDATVTISYKIPFFTSSNRKKFDEIIETVISNLKRNDFLTGGF LDGTNDSTLSIVEVGQKYFYLTDSEYKKKSEDLELKREENINKKENFILGILGVIGVALL GILAYVLAGIAGYYVWAIPAFLTAMASTVYKHLAGKISIISSFVIFILLAISLFIATFLE YTWRLYRFYKEEYIVTFGEVLKEVPQIILEVPDVKSAFTKDILINGGILILGFIITFISA YKSEDRFAKIKKIDDNKM >gi|296155219|gb|ADVK01000013.1| GENE 23 18166 - 18642 518 158 aa, chain - ## HITS:1 COG:no KEGG:FN0234 NR:ns ## KEGG: FN0234 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 12 158 1 147 147 234 97.0 6e-61 MKKIIIFYLILLGLTFVACGKEKNIGEFIDKEKISSEYKIEKEDENSIEFSDKDEYNPPI FRIFSIEKILKIDYNNPHKLDKMEEYYLSHDWKTISKDEQTLIVSYKEDDTQNYEYNIHT FDNSKTELIIAVSVGASRKLSETELVNILKEAKSFIKK >gi|296155219|gb|ADVK01000013.1| GENE 24 18784 - 19512 241 242 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) [Campylobacter concisus 13826] # 1 205 1 216 223 97 27 1e-19 MKNILDIKNLSYSFGNNPILKDINIHVNENEMVAIVGSSGVGKSTLFNLIAGVLKKQAGK ITINGSEDYIGKVAYMLQKDLLFEHKTIIDNVILPLIIAKIDKKEALEEGNKILKQFNLD KYANKYPQQLSGGMRQRVALIRTYMFKKNIFLLDEAFSALDAITKKELHKWYLDLKKEFN LTTLLITHDIEEAVFLSDRIYILGNKPGEIIGEIKIEINPNEDIDVQRLFYKKEILNIMN IE >gi|296155219|gb|ADVK01000013.1| GENE 25 19522 - 20526 1530 334 aa, chain - ## HITS:1 COG:FN0236 KEGG:ns NR:ns ## COG: FN0236 COG0715 # Protein_GI_number: 19703581 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components # Organism: Fusobacterium nucleatum # 1 334 1 334 334 614 99.0 1e-176 MKKIKYFLSVIFAIFMLVACGEKKEETKTEAPVELKKVDFLLDWVPNTNHTGLYVAKEKG YFSEEGIDLDIKQPANESTSDLVINNKAPMGVYFQDYMSSKLAKGAPITAIAAIIENNTS GIITNKKLNINSPKELAGHKYGTWDIPIELGMLQFIIEKDSGDFSKVELVPNTDDNSITP LSNGAFDAAPVYYAWDKIMGDSLGIETNFFYYKDYAPELNFYSPVIIANNDYLKDNKEEA TKILRAIKKGYQYAMEHPEEAAEILIKYAPELENKKAMIIESQKYLATQYASDKDKWGYI DPTRWNAFYNWLNEKGLTKNPIPENTGFSNDYLE >gi|296155219|gb|ADVK01000013.1| GENE 26 20513 - 21109 704 198 aa, chain - ## HITS:1 COG:FN0237 KEGG:ns NR:ns ## COG: FN0237 COG0600 # Protein_GI_number: 19703582 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type nitrate/sulfonate/bicarbonate transport system, permease component # Organism: Fusobacterium nucleatum # 1 198 44 241 241 338 99.0 5e-93 MLEALIGLALGIIIASLLAIIMDSFETINRIVYPLLIFTQTIPTIALAPILVLWLGYDMT PKIVLIVINTTFPIVISILDGFRHCDKDAIQLLKLMNASRWQILYHVKIPTALTYFYAGL RVSVSYAFISAVVSEWLGGFEGLGVFMIRAKKAFDYDTMFAIIILVSAISLISMELVKRS EKKFIKWKYLEEEENEKD >gi|296155219|gb|ADVK01000013.1| GENE 27 21489 - 22421 1321 310 aa, chain - ## HITS:1 COG:FN0238 KEGG:ns NR:ns ## COG: FN0238 COG4874 # Protein_GI_number: 19703583 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria containing a pentein-type domain # Organism: Fusobacterium nucleatum # 1 310 1 310 310 548 94.0 1e-156 MEASMKKNITNKILMVRPVSFAFNEETAVNNHYQKLDNKPAQEIQNNALIEFDNMVEKLK KIGIDVRVMQDTKEPHTPDSIFPNNWFSTHYSNTVVLYPMFAENRRLERTDNLYDYFDKA DDLNIVDYSNLEKENIFLEGTGALVLDRKNKKAYCSLSERANEKLLDIFCEDAGYKKIAF HSNQTVDGKRKPIYHTNVMMAMGENYAILCADSIDNLKERENVIRELKNDNKEIVYISEY QVEHFLGNTIELINNENVNICVMSATAYSVLTDEQKNIIEKYDVIVPVDVHTIERYGGGS ARCMIAELFI >gi|296155219|gb|ADVK01000013.1| GENE 28 22546 - 22695 234 49 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MLKKLFLSVFSLILITYAGIETLKMSKVVKNEVSKQIVNSSVESKIFLR >gi|296155219|gb|ADVK01000013.1| GENE 29 23039 - 23866 998 275 aa, chain + ## HITS:1 COG:FN0240 KEGG:ns NR:ns ## COG: FN0240 COG0207 # Protein_GI_number: 19703585 # Func_class: F Nucleotide transport and metabolism # Function: Thymidylate synthase # Organism: Fusobacterium nucleatum # 1 275 1 275 275 559 99.0 1e-159 MKARFDKIYKDIVDTIAEKGIWSEGNVRTKYADGTAAHYKSYIGYQFRLDNSDDEAHLIT SRFAPSKAPIRELYWIWILQSNNVDVLNELGCKFWNEWKMQDGTIGKAYGYQIAQETFGQ KSQLHYVINELKKNPNSRRIMTEIWIPNELSEMALTPCVHLTQWSVIGNKLYLEVRQRSC DVALGLVANVFQYSVLHKLVALECGLEPAEIIWNIHNMHIYDRHYDKLIEQVNRETFEPA KIKINNFKSIFDFKPDDIEIIDYKYGEKVSYEVAI >gi|296155219|gb|ADVK01000013.1| GENE 30 23866 - 24363 579 165 aa, chain + ## HITS:1 COG:FN0241 KEGG:ns NR:ns ## COG: FN0241 COG0262 # Protein_GI_number: 19703586 # Func_class: H Coenzyme transport and metabolism # Function: Dihydrofolate reductase # Organism: Fusobacterium nucleatum # 1 163 1 163 164 294 99.0 5e-80 MEKKYYKNLKMIVCVGKDNLIGDRTPDENSNGMLWHIKEELMYFKSKTVGNTVLFGGTTA KYVPIELMKKNREVIILHRNMDVPKLIEDLTLENKTIFIAGGYSIYKYFLDNFEIDEIFF SKIKDSVEVKKAVEPLYLPNIEDYGYKMVDKKDYEEFTAYIYKIG >gi|296155219|gb|ADVK01000013.1| GENE 31 24371 - 25729 1656 452 aa, chain + ## HITS:1 COG:FN0242 KEGG:ns NR:ns ## COG: FN0242 COG0569 # Protein_GI_number: 19703587 # Func_class: P Inorganic ion transport and metabolism # Function: K+ transport systems, NAD-binding component # Organism: Fusobacterium nucleatum # 1 452 1 452 452 733 99.0 0 MKIVIVGAGKVGELLCRDLSLEGNDIILIEQDIKILEKILANNDIMGFVGSGVSYDVQME AEVPKADVFIAVTEKDEINIISSVIAKKLGAKYTIARVRSTDYSSQLNFMTESLGIDLVI NPELEAAKDIKQNIDFPEALNVENFLNGRLKLVEFHVDEDSILNNVSLFDFKQKFFPNLL VCIIKRGEEVIIPSGNSFIKGNDRIYITGSNSEIIKFQDILGKDRRKIKSAFIIGAGIIS HYLAQELLKDKIAVKIVEINPEKANKFSESLPEATVINADGSNEDVLKEENFQNYDSCIS ITGIDEINMFISIYAKKIGIKKIITKLNKLSFVDILGENSFQSIITPKKIIADNIVRVVR SIANKKKNLIENFYRLENNTVEAIEILVNSDSKINNIPLKDLKIKKNLIIAYIVRNNVAI FPKGTDVIKEGDRVIIITTESFFDDINNIIEE >gi|296155219|gb|ADVK01000013.1| GENE 32 25742 - 27097 1797 451 aa, chain + ## HITS:1 COG:FN0243 KEGG:ns NR:ns ## COG: FN0243 COG0617 # Protein_GI_number: 19703588 # Func_class: J Translation, ribosomal structure and biogenesis # Function: tRNA nucleotidyltransferase/poly(A) polymerase # Organism: Fusobacterium nucleatum # 1 451 1 451 451 789 97.0 0 MNKISINNFSDVEIGILNKLNEYGKGYIVGGAIRDILLGLKPKDVDFTTNLPYETLKKLF SEYNPKETGKSFGVLRIRIDDIDYEIAKFREDNYEEKDGLKIIPEGKKVSFVDDIKNDLT RRDFTINAMAYNEVEGIVDLYNGQKDIENKVINFIGNAEQRIIEDPLRVLRAFRFMSRLD FSLSENTIEAIKKQKNLLKNIPEEKITMEFSKLLLGENIRNTLTLMKDTGVLELIIPEFK ATYDFEQCNPHHNLDLFNHIISVVSKVPADLELKYSALLHDIAKPVVQTFDEKGIAHYKT HEIVGADMARDILTRLKLPVKLIETVTEIIKKHMVLYKDVTDKKFNKLLSEIGYDNLWRL IEHCIADNESKNNEVVSTENDLHERLKRAVEKQMQITVNDLAVNGKDLIELGFIGSEIGK IKEELLDKYLSEEVQNEKEEMLEYVKEKYKK >gi|296155219|gb|ADVK01000013.1| GENE 33 27231 - 27428 443 65 aa, chain + ## HITS:1 COG:FN0244 KEGG:ns NR:ns ## COG: FN0244 COG2608 # Protein_GI_number: 19703589 # Func_class: P Inorganic ion transport and metabolism # Function: Copper chaperone # Organism: Fusobacterium nucleatum # 10 65 1 56 56 85 100.0 2e-17 MKLNLKIDGMGCEHCIKSVKEALEEIKGIKVLDVKIGSAEVEAENDSILNEIKEKLDDAG YDLVR >gi|296155219|gb|ADVK01000013.1| GENE 34 27434 - 29743 3199 769 aa, chain + ## HITS:1 COG:FN0245 KEGG:ns NR:ns ## COG: FN0245 COG2217 # Protein_GI_number: 19703590 # Func_class: P Inorganic ion transport and metabolism # Function: Cation transport ATPase # Organism: Fusobacterium nucleatum # 1 769 1 769 769 1374 99.0 0 MENNINLRTEIGNNQENENQKLELKIDGISCQACVAKIERKLSKTNGVGKALVNISNNMA DIEYNEKEIKASEIMKIIEKLGYTPKRREDLKDKEEALRAEKKLKLELTKSKIVIVLSFI LMYISMSHMFGLPLPNILNPEMNIVNYVLTQLILAITVMIIGKRFYKVGFRQLYMLSPNM DSLVAVGTSSAFIYSLYISYKIFAGNNIHLIHSLYYESAAMIIAFVMLGKYLETLSKGKA SAAIKKLVNFQAKKASIIRNGEIIEIDIEEVSKGDTVFIKPGEKIPVDGVIIEGHSTIDE AMITGESIPVEKTENDKVYSGSINKDGALKVTVNATEGETLISKIAKLVEDAQMTKAPIA RLADKVSLIFVPTVIFIAVFAALLWWFLIKYNVISVSQNQFEFVLTIFISILIIACPCSL GLATPTAIMVGTGKGAELGILIKSGEALEKLNQIDTIVFDKTGTLTEGMPKVIDIVSLGN IDKDEILKISASMEVSSEHPLGKAIYDEAKEKNINLYDVKNFLAISGRGVIGEIEDKKYL LGNKKLLLDNNIKDLHEEEIHKYELQGKTTIFLADEEKLIAFITLADVVRNESIELIKKL KKENIKTYMLTGDNERTARVIAEKLGIDDVIAEVSPEDKYKKVKELQEQGKKVAMVGDGI NDSPALAQADVGIAIGSGTDIAIESADIVLMGKDIKIILTAIRLSRATIKNIKENLFWAF FYNTCGIPIAGGLLYLFTGHLLNPMIAGLAMGMSSVSVVSNALRLKRFK >gi|296155219|gb|ADVK01000013.1| GENE 35 29811 - 30641 1060 276 aa, chain + ## HITS:1 COG:FN0247 KEGG:ns NR:ns ## COG: FN0247 COG2849 # Protein_GI_number: 19703592 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 14 276 1 263 263 446 99.0 1e-125 MKKVLSALLLIIAMVLSACGGVKYEYKDGVMYGDGKEATGTFEFKAGKYKVKANFVDGLV DGLLEKYYPDGSIMVKDTYVNGENTKEEIYYKNGQLMGTFSDDEDLKFYYDDGKLIMTYN DKIGETLIYHENGNPLMTTNNKETAIYNENNEMLFKVENNKLVDIGATLKNLENGSFEFV KDNKVIAKIDANGEIINYLYSTGETMLKVNEATGITEFFFKNGNTFMKQEGNKSVVNYKD GKTLYELEGDVWKFYNQEGEEIISNFELITDIKKVD >gi|296155219|gb|ADVK01000013.1| GENE 36 30705 - 31631 935 308 aa, chain + ## HITS:1 COG:FN0248 KEGG:ns NR:ns ## COG: FN0248 COG2849 # Protein_GI_number: 19703593 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 308 1 308 308 524 100.0 1e-148 MKKGIILLALIFGACVNLENIGGNSGGEVKEIKTTNTNISTSKNYEKRNGVLYVDNVLAN GKQEYKEKNGVIIKGNYREGLPDGVQEKYYPSGKIYGKINIINNKTEGTETNYYENGKTL SQLDYTQGKLISGKIYYENGDLLSKIEGKKMTIFYSSGKKLFTMDKSDIAVYHENGKEVF SNSDDGIKINGEAAEKSILDMFSKNNLLKTAFYLLTSSTVQAEYKNGKPSIQLQGTTAVM YYESGKILLELSPSLDGTVNSKIYYENGQLMQVEDRNKSGRSVKVYDKAGNLISDTTYSK EHEIRQIF >gi|296155219|gb|ADVK01000013.1| GENE 37 31931 - 32060 196 43 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MKAKILLCSMLILGSLSYASETDSVAQEVMNEVKNIEAEYQAL Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:17:47 2011 Seq name: gi|296155195|gb|ADVK01000014.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00015, whole genome shotgun sequence Length of sequence - 19354 bp Number of predicted genes - 23, with homology - 23 Number of transcription units - 5, operones - 3 average op.length - 7.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 6/0.000 - CDS 49 - 1599 2487 ## COG3051 Citrate lyase, alpha subunit 2 1 Op 2 6/0.000 - CDS 1602 - 2492 1443 ## COG2301 Citrate lyase beta subunit 3 1 Op 3 1/1.000 - CDS 2501 - 2785 536 ## COG3052 Citrate lyase, gamma subunit 4 1 Op 4 1/1.000 - CDS 2796 - 3635 872 ## COG1767 Triphosphoribosyl-dephospho-CoA synthetase 5 1 Op 5 1/1.000 - CDS 3622 - 4968 2085 ## COG5016 Pyruvate/oxaloacetate carboxyltransferase - Prom 5006 - 5065 7.9 - Term 5003 - 5042 5.3 6 2 Tu 1 . - CDS 5067 - 6431 1795 ## COG3493 Na+/citrate symporter - Prom 6650 - 6709 12.4 + Prom 6402 - 6461 10.4 7 3 Op 1 . + CDS 6602 - 7399 1018 ## FN1374 transcriptional regulator 8 3 Op 2 1/1.000 + CDS 7411 - 7899 624 ## COG2606 Uncharacterized conserved protein 9 3 Op 3 1/1.000 + CDS 7916 - 8875 1207 ## COG0564 Pseudouridylate synthases, 23S RNA-specific 10 3 Op 4 1/1.000 + CDS 8913 - 9542 946 ## COG0164 Ribonuclease HII 11 3 Op 5 1/1.000 + CDS 9548 - 9907 382 ## COG0792 Predicted endonuclease distantly related to archaeal Holliday junction resolvase 12 3 Op 6 1/1.000 + CDS 9897 - 10118 199 ## COG3478 Predicted nucleic-acid-binding protein containing a Zn-ribbon domain 13 3 Op 7 . + CDS 10093 - 10740 666 ## COG1040 Predicted amidophosphoribosyltransferases 14 3 Op 8 . + CDS 10754 - 11359 626 ## FN1367 methyl-accepting chemotaxis protein 15 3 Op 9 1/1.000 + CDS 11379 - 12134 1337 ## COG0149 Triosephosphate isomerase 16 3 Op 10 1/1.000 + CDS 12158 - 13252 1481 ## COG0012 Predicted GTPase, probable translation factor + Prom 13257 - 13316 7.5 17 3 Op 11 . + CDS 13336 - 13515 314 ## PROTEIN SUPPORTED gi|19704699|ref|NP_604261.1| 50S ribosomal protein L32P + Term 13528 - 13579 14.1 - Term 13515 - 13568 12.3 18 4 Op 1 1/1.000 - CDS 13594 - 14328 517 ## PROTEIN SUPPORTED gi|163765018|ref|ZP_02172066.1| ribosomal protein L9 19 4 Op 2 4/0.000 - CDS 14328 - 15032 201 ## PROTEIN SUPPORTED gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) 20 4 Op 3 49/0.000 - CDS 15029 - 15796 665 ## COG1173 ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 21 4 Op 4 38/0.000 - CDS 15796 - 16713 694 ## COG0601 ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 22 4 Op 5 . - CDS 16723 - 18210 1597 ## COG0747 ABC-type dipeptide transport system, periplasmic component - Prom 18310 - 18369 11.3 23 5 Tu 1 . - CDS 18376 - 19134 878 ## FN1358 hypothetical protein - Prom 19175 - 19234 9.3 Predicted protein(s) >gi|296155195|gb|ADVK01000014.1| GENE 1 49 - 1599 2487 516 aa, chain - ## HITS:1 COG:FN1380 KEGG:ns NR:ns ## COG: FN1380 COG3051 # Protein_GI_number: 19704715 # Func_class: C Energy production and conversion # Function: Citrate lyase, alpha subunit # Organism: Fusobacterium nucleatum # 1 516 1 516 516 969 99.0 0 MKFIKNAVGREIPEYLEGIGELVPFKGVDAIKPTKNKAGAKLRMRIQDEPKMVASIEEAI KKSGLKDGMTISFHHHMRNGDTVVNEVLDIIAKMGIKDLTLAPSSLSSCHGPVIDHIKSG VVTGIQSSGLREPLGDEISKGILKKPVIIRSHGGRARAVEDGELHIDIAFIAAPSCDEMG NMNGRTGKSACGSMGYAIVDAQYADYVIAITDNLVPFPNLPASIDQTLVDSVVVVDSIGD PKKIVSGAIRDSDNPRDLLIAKNAVDVIINSGYFKDGFVYQTGTGGASLSVTKLLKEEMI KQNIKASLGLGGITSQLVSLHEEGLMDALFDTQSFDLDAVRSIAENPKHYEISASFYANP NTPGPAVNNLTFVMLSALEIDKDFNVNVMTKSDGTINQAVGGHQDTAAGARISVILAPLM RARIPIIVDKVTTVCTPGEAVDVICTDYGIVVNPRRKDLIETLTKAGVELKTIEEMKEMA EQLTGKPDPVEFTDEIVGVVEYRDGSIIDVIKKVKE >gi|296155195|gb|ADVK01000014.1| GENE 2 1602 - 2492 1443 296 aa, chain - ## HITS:1 COG:FN1379 KEGG:ns NR:ns ## COG: FN1379 COG2301 # Protein_GI_number: 19704714 # Func_class: G Carbohydrate transport and metabolism # Function: Citrate lyase beta subunit # Organism: Fusobacterium nucleatum # 1 296 1 296 296 563 99.0 1e-160 MAIRDRLRRTMMFLPGNNPSMITDAYIYGPDSVMIDLEDATSVNQKDAARFLVSEALKTI DYKTTETVVRVNGLDTPFGADDIRAVVKAGVNVVRLPKTDTPDEIIAVDKLITEVEKEIG REGETLLMAAIESATGIMNVKEIALASKRLMGIALGAEDYVTNLKTSRSKHGWELYYARE AIVLAARNAGIYCFDTVYSDVNNLDGFRQEVQFIKDLGFDGKSCIHPKQVRIVHEIYTPT QKEIEKSIRIINGAKEAEAKGSGVISVDGKMVDNPIIMRAQRVLELAKASGIYKED >gi|296155195|gb|ADVK01000014.1| GENE 3 2501 - 2785 536 94 aa, chain - ## HITS:1 COG:FN1378 KEGG:ns NR:ns ## COG: FN1378 COG3052 # Protein_GI_number: 19704713 # Func_class: C Energy production and conversion # Function: Citrate lyase, gamma subunit # Organism: Fusobacterium nucleatum # 1 94 1 94 94 156 98.0 1e-38 MVLKTVGVAGTLESSDALITVEPANQGGIVIDISSSVKRQFGRQIEETVLNTIKELGVEN ANVKVVDKGALNYALIARTKAAVYRAAESNDYKF >gi|296155195|gb|ADVK01000014.1| GENE 4 2796 - 3635 872 279 aa, chain - ## HITS:1 COG:FN1377 KEGG:ns NR:ns ## COG: FN1377 COG1767 # Protein_GI_number: 19704712 # Func_class: H Coenzyme transport and metabolism # Function: Triphosphoribosyl-dephospho-CoA synthetase # Organism: Fusobacterium nucleatum # 1 279 1 279 279 474 93.0 1e-134 MQMNSKEIAKIATKALLYEVSISPKAGLVSRLSNGSHKDMDFYTFIDSVLSLSSYFSECF IYGQKNNFYSPNFFKNLRDLGKKAEKEMYQATNGVNTHKGTIFSMGILISVLASYFKETD EIDLKILSEKIKNMCFPLLNELENTNDFSTYGEKAFKNYHLTGARGLALSGYDIVLLDGI NKLKEFTKSLDFETSCILLLFYYISILDDTNIVNRTNFETLKEIQILCKNLYEENVKSLS KEKIRNEMSKLNDIFIEKNISAGGSADLLILTIFIHFIK >gi|296155195|gb|ADVK01000014.1| GENE 5 3622 - 4968 2085 448 aa, chain - ## HITS:1 COG:FN1376 KEGG:ns NR:ns ## COG: FN1376 COG5016 # Protein_GI_number: 19704711 # Func_class: C Energy production and conversion # Function: Pyruvate/oxaloacetate carboxyltransferase # Organism: Fusobacterium nucleatum # 1 448 1 448 448 884 99.0 0 MNKIKIMETCLRDGHQSLMATRLTTAEMLPIIEKLDSVGYHSLEMWGGATFDAALRFLNE DPWERLREIKKRVKNTKLQMLLRGQNLLGYRNYADDIVERFVKKSIQNGIDIVRIFDALN DVRNLQTACKATKKYGGHAQLAMSYTISPVHTVEYYKNLALEMQEIGADSIAIKDMSGIL LPEVAYELVRELKSVLRVPVEVHTHATAGLASMTYIKAVEAGADIIDTAISPLSGGTSQP ATESIVRAFQGTERETGFDLELLKEIAEYFKPIRAKYLQEGILNPQALMTEPSIVEYQLP GGMLSNFLSQLKMQKAEHKYEDVLREIPRVRKDLGYPPLVTPLSQMVGTQAIFNILTGQR YKLIPNEIKNYVRGLYGKSPVPISDEIKKTIIFNEEVFTGRPADKLAPEYDKLVEETRNF ARSEEDVLSYALFPQVAKDFLIKKYANE >gi|296155195|gb|ADVK01000014.1| GENE 6 5067 - 6431 1795 454 aa, chain - ## HITS:1 COG:FN1375 KEGG:ns NR:ns ## COG: FN1375 COG3493 # Protein_GI_number: 19704710 # Func_class: C Energy production and conversion # Function: Na+/citrate symporter # Organism: Fusobacterium nucleatum # 1 454 1 454 454 813 100.0 0 MAKKNFKELFDLRESKWGGISLPMFLCALIVVAIVVYIPFGLDKEGNPASFLRPNFLIMF SALAVFGLLFGEIGDRIPIWNDFIGGGTILVFFMAAVFGTYNLVPENFMKAVKIFYGKQP VNFLEMFIPALIVGSVLTVDRKTLIKSISGYIPLIIIGVLGASAGGVLVGLAFGKSPIDV MMNYVLPIMGGGTGAGAVPMSEIWSSKTGRPAAEWFGFAISILSIANVFAILCGALLKKL GEMKPSLTGNGELIIDNSKEAIRDKEIDVKPELTDTTAAFILTGVLFMVAHILGELWSKL PIEFELHRLVFLILLTMFLNIANLVPDNIKAGAKRMQTFFSKHTIWILMASVGFTTDVKE IAKAAAPSNILIALAIVLGAAGLIMLVARIMKFYPVEAAITAGLCMANRGGAGDVAVLGA ADRMDLMSFAQISSRIGGAMMLVLGSLMFSAFAS >gi|296155195|gb|ADVK01000014.1| GENE 7 6602 - 7399 1018 265 aa, chain + ## HITS:1 COG:no KEGG:FN1374 NR:ns ## KEGG: FN1374 # Name: not_defined # Def: transcriptional regulator # Organism: F.nucleatum # Pathway: not_defined # 1 265 3 267 267 387 99.0 1e-106 MGKIAFLVSGEKMFKKIKKYIDTEDVILVETTISNALEEAKNLIDEGVKVILTKLAIKIK IEDKVEIPVLSIENNISDYIELLKEIDIKNNRIAFVDYIEAPESLVDLSKIISNDIVFKT FTSEEECEAIVKELKNKSYSILIGSVLTKKYANKYDLKSYEVEISKDSYSMYIEIAEQII KFTDLKKSKAKVLKSIEIMIDNYLKNEEKMEKNILDKVTMNDVEKDRLIEGLKRNGFSLS NTAKDLGMSRTTLWRKLKKFNIIIE >gi|296155195|gb|ADVK01000014.1| GENE 8 7411 - 7899 624 162 aa, chain + ## HITS:1 COG:FN1373 KEGG:ns NR:ns ## COG: FN1373 COG2606 # Protein_GI_number: 19704708 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 162 1 162 162 271 97.0 3e-73 MKKTNAIRELEIHKIEHIVREYEVDEEHLDALSVALKTNKDITRVFKTLVLLNEKREMVI ACIPGMEKLDLKKLAKLSGHKKLEMLPMKDLFSMTGYVRGGCSPIGIKKRHSIFIHESAL DNKTILVSGGLRGLQIEIEPQKLIDYLKIIVGDIIEDVNIEF >gi|296155195|gb|ADVK01000014.1| GENE 9 7916 - 8875 1207 319 aa, chain + ## HITS:1 COG:FN1372 KEGG:ns NR:ns ## COG: FN1372 COG0564 # Protein_GI_number: 19704707 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Pseudouridylate synthases, 23S RNA-specific # Organism: Fusobacterium nucleatum # 1 319 1 319 319 567 100.0 1e-161 MKRVIMENIKEKFEFEVNSEYEGMRLDKYLSEQIEEATRSYLEKLIDNNFVKVNSKIINK NGRKLKLGEKIEVLIPEEENIDIEPENIPLNIVYENDDFILINKSYGMVVHPAYGNYTGT LVNALLYYTNNLSSVNGNIRPGIIHRLDKDTSGLILVAKNNYAHAKLASMFIDKTIHKTY LCIVKGNFSEENLSGRIENLIGRDSKDRKKMTIVKENGKIAISNYKVVEQVEGYSLVEVA IETGRTHQIRVHMKSINHVILGDSVYGTEDKNVKRQMLHAYKLEFLNPLDNKKYIFKGKL FDDFIEVAKRLKFNIEKYI >gi|296155195|gb|ADVK01000014.1| GENE 10 8913 - 9542 946 209 aa, chain + ## HITS:1 COG:FN1371 KEGG:ns NR:ns ## COG: FN1371 COG0164 # Protein_GI_number: 19704706 # Func_class: L Replication, recombination and repair # Function: Ribonuclease HII # Organism: Fusobacterium nucleatum # 1 209 7 215 215 358 99.0 3e-99 MDNPLYFYDLEYKNVIGVDEAGRGPLAGPVVAAAVILKEYTEELDEINDSKKLTEKKREK LYDIIMKNFDVAVGISTVEEIDKLNILNADFLAMRRALKDLKSLKNEKEYTVLVDGNLKI KEYIGKQLPIVKGDAKSLSIAAASIIAKVTRDRLMKDLANIYPDYSFEKHKGYGTKTHIE AIKDKGAIEGVHRKVFLRKILETEEEKTK >gi|296155195|gb|ADVK01000014.1| GENE 11 9548 - 9907 382 119 aa, chain + ## HITS:1 COG:FN1370 KEGG:ns NR:ns ## COG: FN1370 COG0792 # Protein_GI_number: 19704705 # Func_class: L Replication, recombination and repair # Function: Predicted endonuclease distantly related to archaeal Holliday junction resolvase # Organism: Fusobacterium nucleatum # 1 119 1 119 119 192 98.0 1e-49 MNTREIGNKYEDKSVEILVKEDYKILERNYQNKFGEIDIIAKKNKEIIFIEVKYRKTNKF GYGYEAVDRRKIMKILKLANYYIQSKKYQDYKIRFDCMSYLGDELDWIKNIVWGDEVGF >gi|296155195|gb|ADVK01000014.1| GENE 12 9897 - 10118 199 73 aa, chain + ## HITS:1 COG:FN1369 KEGG:ns NR:ns ## COG: FN1369 COG3478 # Protein_GI_number: 19704704 # Func_class: R General function prediction only # Function: Predicted nucleic-acid-binding protein containing a Zn-ribbon domain # Organism: Fusobacterium nucleatum # 1 73 3 75 75 117 98.0 4e-27 MAFSCPKCRCRNYEEKSIILPEKKKNFIKIELNTYYAKTCLNCGYTEFYSAKIVDDETAK EKCKTDAEVEGSY >gi|296155195|gb|ADVK01000014.1| GENE 13 10093 - 10740 666 215 aa, chain + ## HITS:1 COG:FN1368 KEGG:ns NR:ns ## COG: FN1368 COG1040 # Protein_GI_number: 19704703 # Func_class: R General function prediction only # Function: Predicted amidophosphoribosyltransferases # Organism: Fusobacterium nucleatum # 12 215 1 204 204 340 99.0 1e-93 MLKLKEAIRESLRFLFFDNACSCCHSKLDREGYICSKCLEKLKKEAFLKNKDEFYYLFIY EKAIRQIISDYKLRNRKDLARDIAFLIKKPIFQLIEREKIDIIIPVPISEEREIERGFNQ IEYLLECLDIKYKKIERIKNTKHMYTLKDNEKREKNVEKAFKNSLNLENKNVLIVDDIIT SGATINSISEELRRDNENINIKAFSIAVARHFIKE >gi|296155195|gb|ADVK01000014.1| GENE 14 10754 - 11359 626 201 aa, chain + ## HITS:1 COG:no KEGG:FN1367 NR:ns ## KEGG: FN1367 # Name: not_defined # Def: methyl-accepting chemotaxis protein # Organism: F.nucleatum # Pathway: not_defined # 1 201 1 201 201 323 99.0 2e-87 MEVYIDNQKTNFGRRSKDLEKILKAISKKLEKHEKVIQNIYINGSNIQDSIILDIDMDRP NIMEVETKSYTDLILDSLTLSKEYIETFFEVKKDFQQLIENNEKISGIEIEETDSFLNWF SDLLFFLVENYAFAFRSLQATIQTFREELVTLAELKERKDYVAYVSVLDYCVSDILENFK VNIDYYYKSILEEVEQRQIVF >gi|296155195|gb|ADVK01000014.1| GENE 15 11379 - 12134 1337 251 aa, chain + ## HITS:1 COG:FN1366 KEGG:ns NR:ns ## COG: FN1366 COG0149 # Protein_GI_number: 19704701 # Func_class: G Carbohydrate transport and metabolism # Function: Triosephosphate isomerase # Organism: Fusobacterium nucleatum # 1 251 1 251 251 466 100.0 1e-131 MRRLVIAGNWKMYKNNKEAVETLTQLKDLTRDVKNVDIVIGAPFTCLSDAVKIVEGSNVK IAAENVYPKIEGAYTGEVSPKMLKDIGVTYVILGHSERREYFKESDEFINQKVKAVLEIG MKPILCIGEKLEDREGGKTLEVLAKQIKEGLVDLSKEDAEKTIVAYEPVWAIGTGKTATP EMAQETHKEIRNVLAEMFGKDVADKMIIQYGGSMKPENAKDLLSQEDIDGGLVGGASLKA DSFFEIIKAGN >gi|296155195|gb|ADVK01000014.1| GENE 16 12158 - 13252 1481 364 aa, chain + ## HITS:1 COG:FN1365 KEGG:ns NR:ns ## COG: FN1365 COG0012 # Protein_GI_number: 19704700 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Predicted GTPase, probable translation factor # Organism: Fusobacterium nucleatum # 1 364 1 364 364 701 99.0 0 MIGIGIVGLPNVGKSTLFNAITKAGAAEAANYPFCTIEPNVGVVTVPDERLNELAKIINP QKIVPATVEFIDIAGLVKGASKGEGRGNKFLSNIRSTSAICQVVRCFDDDNITHVDGSVD PLRDIDVINTELIFADIETIDKAIEKHEKLARNKIKESVELMSVLPKVKKHLEEFKLLKT LDLTDDEKQILKNYQLLTLKPMIFAANVAEDDLATGNKYVDLVREFTRSIGSEVVVVSAK VESELQEMDEESKQEFLEALGVKEAGLNRLIRAGFKLLGLQTYFTAGPKEVRAWTIRIGD TAPKAAGEIHTDFEKGFIRAKVVSYDNFIKNSGWKMSQENGVLRLEGKDYIVQDGDLMEF LFNV >gi|296155195|gb|ADVK01000014.1| GENE 17 13336 - 13515 314 59 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|19704699|ref|NP_604261.1| 50S ribosomal protein L32P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 59 1 59 59 125 100 2e-28 MAVPKKKTSKAKKNMRRSHHALTAIGLVTCEKCGAPKRAHRVCLECGDYKGTQVLETAE >gi|296155195|gb|ADVK01000014.1| GENE 18 13594 - 14328 517 244 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163765018|ref|ZP_02172066.1| ribosomal protein L9 [Bacillus selenitireducens MLS10] # 2 238 11 265 329 203 39 6e-52 MLLTVENLTKIYIKKKILNNVSFSVKKGEIFGILGKSGAGKSTIGKILLQLLKPTMGTIL FEGKPLSEIPRKDIQAIFQDPYSALNPSLKIGEILEEPLIANGIFQKEKRRKKVEETLIK VGLLESDYEKYPEELSGGQQQRVCIAGAIILSPKLIICDEPIASLDLAIQVQILDLIHKI NQEEGISFIFITHNLPAIYRIADRILLLYHGEVQEIQEVEDFFHNPRSEYGKKFLQTLNL IKNF >gi|296155195|gb|ADVK01000014.1| GENE 19 14328 - 15032 201 234 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) [Campylobacter concisus 13826] # 1 211 1 219 223 82 28 3e-15 MMKILKIKNLNLKIHEKEILKNISFEIEEGEIIGLIGESGSGKTIFTKYILGILPTAAHF TQETFEVVPKIGAIFQNAFTSLNPTVKIGKQLQHLYISHYGNKKDWKEKIESLLEEVGLD KKKNFLDKYPYELSGGEQQRIVIMGALIGEPNFLIADEVTTALDVETKIEIIKFFKKLQK KLKISILFITHDLSILKDFADKIYVMYHGEIIDENHPYRKQLFQLSQDIWRRKQ >gi|296155195|gb|ADVK01000014.1| GENE 20 15029 - 15796 665 255 aa, chain - ## HITS:1 COG:FN1361 KEGG:ns NR:ns ## COG: FN1361 COG1173 # Protein_GI_number: 19704696 # Func_class: E Amino acid transport and metabolism; P Inorganic ion transport and metabolism # Function: ABC-type dipeptide/oligopeptide/nickel transport systems, permease components # Organism: Fusobacterium nucleatum # 1 255 1 255 255 415 98.0 1e-116 MKKYLYIIIILVGIIFCIAFYQNPYKISENFTLLKPSFQHILGTDNLGRDIFSRLLLGTF YSIFIAFSAILLAGIIGSLLGAVAGYFEGYIDELFLFISEIFMSIPVILITLGIIVLLNN GFHSIILALFVLYMPRTLNYVRGLVKREKHKNYIKIAKIYGVNHFRIIIRHIAPNIILPI LVNFSTNFAGAILTEASLGYLGFGIQPPYPTLGNMLNESQSYFLLAPWFTILPGLMILFL VYKINQISKKYQEKK >gi|296155195|gb|ADVK01000014.1| GENE 21 15796 - 16713 694 305 aa, chain - ## HITS:1 COG:FN1360 KEGG:ns NR:ns ## COG: FN1360 COG0601 # Protein_GI_number: 19704695 # Func_class: E Amino acid transport and metabolism; P Inorganic ion transport and metabolism # Function: ABC-type dipeptide/oligopeptide/nickel transport systems, permease components # Organism: Fusobacterium nucleatum # 1 305 1 305 305 499 98.0 1e-141 MYYIKKIFRMLLSIFSIGTFSFLLLELIPGDPETTILGIEASAKDLENLREQLGLNLSFG TRYWNWLCGVFQGDLGISFKYKEPVFNLILERLPLTISIAFISIFIVFVMSIPLSFFLHN TKNKKIKKIGESILSIFISIPSFWLGIIFMYLFGIILRWASTGYNNTWQSLILPCLVISI PKIGWISMHLYSNLYKELREDYIKYLYSNGMKKIYLNFYILKNAFLPIIPLTGMLLLELI TGVVIIEQIFSIPGIGRLLVQSVLMRDIPLIQGLIFYTSTFVVLLNFIIDILYSLLDPRI QVGEQ >gi|296155195|gb|ADVK01000014.1| GENE 22 16723 - 18210 1597 495 aa, chain - ## HITS:1 COG:FN1359 KEGG:ns NR:ns ## COG: FN1359 COG0747 # Protein_GI_number: 19704694 # Func_class: E Amino acid transport and metabolism # Function: ABC-type dipeptide transport system, periplasmic component # Organism: Fusobacterium nucleatum # 1 495 1 495 495 911 98.0 0 MKRKLFFEKVLVSILLTFILIACQKEENKEESIRTVSTVDIDSLNPYQVVSSASDQILLN VFEGLIMPGVDGTVVPALAESYEISEDGRTYTFSIRKGVKFHNGNDMDIKDVEFSLNYMS GKLGNNPTEALFENIEKIEILDDSHIAIYLSKPDSSFIYYMKEAIVPDENKDHLEDIAIG TGPYKIAEYQKEQKLVLSKNEEYWGEKAKISTVTILISPNSETNFLKLLSGEINFLTNID PKRIPELDKYQILSSPSNLCLILSLNPKEKPFDDIEVRKAINLAIDKNKVIQLAMNGKGT PIYTNMSPVMSKFLWNAPEEQANLEKAKQILEEKKLLPMEFTLKVPNSSKFYLDTAQSIR EQLKDIGITVNLEMIEWATWLSDVYTNKKYVASLAGLSGKMEPDAILRRYTSTYPKNFTN FNNARYDVLVEEAKRTSNEEKQIENYKEAQKILSEEQAAIFLMDPNIIIATEKGIEGFEF YPLPYLNFAKLYFKK >gi|296155195|gb|ADVK01000014.1| GENE 23 18376 - 19134 878 252 aa, chain - ## HITS:1 COG:no KEGG:FN1358 NR:ns ## KEGG: FN1358 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 20 252 1 233 233 400 99.0 1e-110 MININEATNNQKLKELLKKMYEDNNPRLENEVLEEIIMKVNFLSYINSNQNNTETDFENI NFNVLTTDDNKIYLPAFTDLEELAKWSIPSNMDTITLNFDNYVEIILENENIEGLVINPF GDLYILSKEWLRELKDMKKDRLKVNEVRIEANSKILISEPKHLPTMMIETIKNCCDNLGN IKKAWLLEMITEKDKSWLLILDFEGDKNYIFSKISQAARNYLGNMYLDMLPYEDDFARNS VQNHKAFYTKNK Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:18:07 2011 Seq name: gi|296155174|gb|ADVK01000015.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00016, whole genome shotgun sequence Length of sequence - 18390 bp Number of predicted genes - 21, with homology - 19 Number of transcription units - 8, operones - 5 average op.length - 3.6 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 6/0.000 - CDS 1 - 202 158 ## COG0726 Predicted xylanase/chitin deacetylase 2 1 Op 2 3/0.000 - CDS 202 - 1347 1227 ## COG0438 Glycosyltransferase 3 1 Op 3 1/1.000 - CDS 1368 - 2153 755 ## COG3475 LPS biosynthesis protein 4 1 Op 4 . - CDS 2166 - 3305 1313 ## COG0859 ADP-heptose:LPS heptosyltransferase - Prom 3329 - 3388 9.9 - Term 3371 - 3407 -0.1 5 2 Op 1 . - CDS 3409 - 3543 146 ## gi|296327693|ref|ZP_06870234.1| conserved hypothetical protein 6 2 Op 2 . - CDS 3561 - 4103 691 ## COG4283 Uncharacterized conserved protein 7 2 Op 3 . - CDS 4073 - 4195 95 ## 8 2 Op 4 1/1.000 - CDS 4258 - 5202 625 ## COG2378 Predicted transcriptional regulator - Prom 5380 - 5439 9.6 9 3 Tu 1 . - CDS 5447 - 6793 1818 ## COG2252 Permeases - Prom 6841 - 6900 13.6 + Prom 6831 - 6890 9.4 10 4 Op 1 5/0.000 + CDS 7000 - 8322 1881 ## COG0672 High-affinity Fe2+/Pb2+ permease 11 4 Op 2 . + CDS 8381 - 9034 1012 ## COG3470 Uncharacterized protein probably involved in high-affinity Fe2+ transport + Term 9062 - 9093 1.0 - Term 9089 - 9131 6.1 12 5 Op 1 . - CDS 9159 - 9362 463 ## 13 5 Op 2 1/1.000 - CDS 9429 - 9944 837 ## COG0778 Nitroreductase 14 5 Op 3 1/1.000 - CDS 9995 - 10822 1139 ## COG0647 Predicted sugar phosphatases of the HAD superfamily 15 5 Op 4 11/0.000 - CDS 10850 - 12136 721 ## PROTEIN SUPPORTED gi|90020581|ref|YP_526408.1| ribosomal protein L16 16 5 Op 5 11/0.000 - CDS 12151 - 12621 464 ## COG3090 TRAP-type C4-dicarboxylate transport system, small permease component - Prom 12657 - 12716 5.0 - Term 12637 - 12683 4.1 17 5 Op 6 . - CDS 12722 - 13765 299 ## PROTEIN SUPPORTED gi|149199369|ref|ZP_01876406.1| Ribosomal protein L22 + Prom 13974 - 14033 14.8 18 6 Tu 1 . + CDS 14063 - 14656 626 ## PROTEIN SUPPORTED gi|148988990|ref|ZP_01820390.1| hypothetical protein CGSSp6BS73_02415 19 7 Op 1 40/0.000 - CDS 14793 - 16097 1298 ## COG0642 Signal transduction histidine kinase 20 7 Op 2 . - CDS 16100 - 16801 842 ## COG0745 Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain - Prom 16959 - 17018 11.8 + Prom 16822 - 16881 13.8 21 8 Tu 1 . + CDS 17023 - 18388 1087 ## COG1807 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family Predicted protein(s) >gi|296155174|gb|ADVK01000015.1| GENE 1 1 - 202 158 67 aa, chain - ## HITS:1 COG:FN1244 KEGG:ns NR:ns ## COG: FN1244 COG0726 # Protein_GI_number: 19704579 # Func_class: G Carbohydrate transport and metabolism # Function: Predicted xylanase/chitin deacetylase # Organism: Fusobacterium nucleatum # 1 67 1 67 250 109 100.0 1e-24 MIKIFEYFFKKEIPVLMYHRLINNKDEIGKNTIYLNVDEFEKQLKYLKDNNYITITFKDL YKIPKKE >gi|296155174|gb|ADVK01000015.1| GENE 2 202 - 1347 1227 381 aa, chain - ## HITS:1 COG:FN1245 KEGG:ns NR:ns ## COG: FN1245 COG0438 # Protein_GI_number: 19704580 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Fusobacterium nucleatum # 1 381 1 381 381 608 99.0 1e-174 MKKILFKSGSTMMGGLEKVQIEYINFLLEQKKYQVKIVIENDNGKDNALEKYINSKIIYL KDFNYILKIRKLRENRKKSLWSRIKYNFAISKEKKYADEKFLQIYKEYKPDIVIDFDSSL TKIIDKLDLSKNLVWIHSSIENWKKKKSKINRFVDRISKYDKIVCICKEMKEDLIKLKSS LRNKVDFLYNPIDFDKIKKLSEENFYEEDKKILENKYLLSIARLDCIPKDFETLFKAYEI AKKDGYDGKLYIIGDGPDKEKVEKLKEDNVYKDEIILLGRKENPYNWLKKADKLILSSKY EGFAMVTLEGLCLGKNVIASNCKTGPKEILADNRGKLFKVGDYLTLAKYIVLEDNQKDLK FNLEEFERNKIFEKFLEILED >gi|296155174|gb|ADVK01000015.1| GENE 3 1368 - 2153 755 261 aa, chain - ## HITS:1 COG:FN1246 KEGG:ns NR:ns ## COG: FN1246 COG3475 # Protein_GI_number: 19704581 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: LPS biosynthesis protein # Organism: Fusobacterium nucleatum # 1 261 1 261 261 447 97.0 1e-125 MYNNSELRKIQQKKLEILIDIAKFCNENKIRYWLDSGTLLGAVRHGGFIPWDDDIDIIIM QEDAKFLKENYKSENFEIINTNEEGINFYKVISKKEQVQVGDSIAELDIDIFLVTYYPDS LALKFWNSFFHLKKNRIDRFSFSLFFTNILINLKRKLEKTKLFNYKNIEKKISYILDEAK KKNKAMSNIAYTPDCGFYLIIWREDEIFPLKKMKFEGIEFNIPNNYDTYLKKMYYSYMDL PPKEKRVPDHYQDKELKLIIK >gi|296155174|gb|ADVK01000015.1| GENE 4 2166 - 3305 1313 379 aa, chain - ## HITS:1 COG:FN1247 KEGG:ns NR:ns ## COG: FN1247 COG0859 # Protein_GI_number: 19704582 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: ADP-heptose:LPS heptosyltransferase # Organism: Fusobacterium nucleatum # 1 379 1 379 379 666 98.0 0 MKNLIKKLNRIFQDYMREKRLKIGKYIWDRKDRVKILEGNSFLEDNGIKSILFLRYDGKI GDMVVNSLMFREIKKVYPNIKIGVVARGAAMDIIKDNPNIDKIYEYHKDRKKIKDLALKI KEEKYDLLIDFSEMLRVNQMMLINLCGARFNMGLGRKEWKLFDLSIESDKDFKWTEHITN RYLAYLVKLGLKKENIDISYDIYLKDEKKYEFFFNEIKENKKLILNPYGASKHKSFTIET LENIITCLKDKDIAIILVYFGDKYKELEFLEKKYNNIYMPQKIESILDTAILIKKSDYVI SPDTSIVHIASALNKKMITVYPPNGGKYGVDHLVWAPKSEYSRVIFCKDKTGTYDEIDIN TFNFDEIKEEILKLINNSD >gi|296155174|gb|ADVK01000015.1| GENE 5 3409 - 3543 146 44 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296327693|ref|ZP_06870234.1| ## NR: gi|296327693|ref|ZP_06870234.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 44 1 44 44 68 100.0 1e-10 MHFVSEPKMKAKKSIIGSAFMIDSYIYEELKLKSTGARENKKNI >gi|296155174|gb|ADVK01000015.1| GENE 6 3561 - 4103 691 180 aa, chain - ## HITS:1 COG:FN1248 KEGG:ns NR:ns ## COG: FN1248 COG4283 # Protein_GI_number: 19704583 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 180 25 204 204 319 100.0 2e-87 MPRPKTKEELMIAAKDNYEKLNTLISKMSAEELNTPFDFSMDEKKKEAHWKRDKNLRDIL IHLYEWHQLILNWVYSNQNGEEKPFIPKPYNWKTYGDMNVEFWKKHQNTSLKDATKMFHK SHKDVLELAERFTNEEMFSKDVYKWVGGSVLGSYFVSATSSHYDWAMKKLKAHQKNCKLK >gi|296155174|gb|ADVK01000015.1| GENE 7 4073 - 4195 95 40 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MNTKYYQCRTMLIEEKVDFYGTNSNGRRKILCQDQKQKRS >gi|296155174|gb|ADVK01000015.1| GENE 8 4258 - 5202 625 314 aa, chain - ## HITS:1 COG:FN1249 KEGG:ns NR:ns ## COG: FN1249 COG2378 # Protein_GI_number: 19704584 # Func_class: K Transcription # Function: Predicted transcriptional regulator # Organism: Fusobacterium nucleatum # 1 314 1 314 314 567 100.0 1e-161 MSNYCCIINIGDDKIMKDNRLFRILYYILEKGKVRANQLADRFEVSVRTIYRDIDSLSSA GIPIYTTQGKGGGIEIAKDYVLSKSLLSENEKKQIMSALQVLDNTTKQYENDLLTKLSAL FKIKSTNWIEVDFNNWQNNQMYEKTFNDIKSSILNKYTISFSYFNSNEKETSRSVKPVRL LSKNQDWYLYALCLLRNDFRYFKLSRIKNLEIHTEKFEDNFDDIILKKEMSHNNTIHIKV KFEHKVAFRVYDEINNEIIEDAKGNLYTEMEIPNDYNLYSYILSFGDGAEVLEPKEIRMQ IKKMISKMAEKYII >gi|296155174|gb|ADVK01000015.1| GENE 9 5447 - 6793 1818 448 aa, chain - ## HITS:1 COG:FN1250 KEGG:ns NR:ns ## COG: FN1250 COG2252 # Protein_GI_number: 19704585 # Func_class: R General function prediction only # Function: Permeases # Organism: Fusobacterium nucleatum # 1 448 1 448 448 714 100.0 0 MQEALRKFFNFEEYETNFKKEIIAGTTNFLTMAYILGVNTIILSSAGMDFNSVFLATAIS SAIACFVMGLVANAPLGLAPGMGSNSFFTFIVVKLYGYSYQEALAMVFVSGTLFLLLSAT GIRDKIINSIPENLKQSIGAGTGFFIALIGLVKAGIAVSHPATLITLGNFKNPTVLLAVF GLLLTIVLMSRKIDAAVFFGLLITAIVGIVLGKFGIEGMPKFSNEIIKVNTSLNHFGDFF YGIKSLIAKPKSIFLIFTFFFVDFFDTAGTLVAITNKISSKTGKNYKMKKMLFSDAVGTV VGAVLGTSTVTTLTESTSGVAAGGRTGLTAITTGIWFLIASIFTPLVAIASPIEIGGMFF EPVIAPSLICVGILMATQLSSIDWHDFTAASAGFVTIMIMIVGYSIPDGIAAGFIVYVFS KLFTKNVKDISPSVWAMFVLFVLHFALK >gi|296155174|gb|ADVK01000015.1| GENE 10 7000 - 8322 1881 440 aa, chain + ## HITS:1 COG:FN1251 KEGG:ns NR:ns ## COG: FN1251 COG0672 # Protein_GI_number: 19704586 # Func_class: P Inorganic ion transport and metabolism # Function: High-affinity Fe2+/Pb2+ permease # Organism: Fusobacterium nucleatum # 8 440 1 433 433 797 99.0 0 MKKYFKSLFAFIFVFSLFISFSSTNVEAAQKKKYDTWQDVAKDMNIEFQAAKKFIEEGNN DEAYNAMNRAYFGYYEVQGFEKNVMVNIAAKRVNEIEATFRRIKHTLKGNIEGNVAELDK EIDTLAMKVYKDAMVLDGVASIDDPDDLGMRVFGNEQTVTGSETAVKFKSFGASFGLLLR EGLEAILVVVAIIAYLVKTGNQKLCKQVYIGMGFGVICSFLLAFLIDWLLGGVGQELMEG ITMFLAVAVLFWVSNWILSRSEEQAWSRYIKSQVQKSIDQNSGRALIFSAFLAVVREGAE LVLFYKAILTGGQTNKLYAFYGFLVGAVVLVVIYLIFRYTTVRLPLKPFFMFTSILLFLL CISFMGKGVVELTEAGVISGSTVIPAMNGYQNTWLNIYDRAETLIPQIMLVIASSWMLLN NYFKEKKAKKEAETLAKENK >gi|296155174|gb|ADVK01000015.1| GENE 11 8381 - 9034 1012 217 aa, chain + ## HITS:1 COG:FN1252 KEGG:ns NR:ns ## COG: FN1252 COG3470 # Protein_GI_number: 19704587 # Func_class: P Inorganic ion transport and metabolism # Function: Uncharacterized protein probably involved in high-affinity Fe2+ transport # Organism: Fusobacterium nucleatum # 1 217 1 228 228 351 94.0 5e-97 MKNLKFLLGALLVLGLVACGEKKEEEKPAEQPAATTEVPAEKPGESGFAEVPIAETVVGP YQVAAVYFQAVDMIPEGKQPSAAESDMHLEADIHLLPEAAKKFGFGDGEDIWPAYLTVNY KVMSEDGKTELTSGTFMPMNADDGAHYGINIKKGLIPIGKYKLQLEIKAPTDYLLHVDSE TGVPAAKDGGVAAAEEFFKTQTVEFDWTYTGEQLQNK >gi|296155174|gb|ADVK01000015.1| GENE 12 9159 - 9362 463 67 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MKKISLVILVVAGILVGCTHTEKTATGGAVAGAAVGALLGNDARSTAVGAAIGGALGAGA GELTKNK >gi|296155174|gb|ADVK01000015.1| GENE 13 9429 - 9944 837 171 aa, chain - ## HITS:1 COG:FN1254 KEGG:ns NR:ns ## COG: FN1254 COG0778 # Protein_GI_number: 19704589 # Func_class: C Energy production and conversion # Function: Nitroreductase # Organism: Fusobacterium nucleatum # 1 171 1 171 171 346 100.0 1e-95 MDLLKLMGDRYSCRRYSAEDVKEEDILKLLEAAKIAPTAHNEQPQRIYVVKSEEGKAKLM KDFKFDFKAPCYLVCGYNEEEAWKNPLDNNKDSGEVDISIIMTHMMLMAEELGLGTCWIG YFDPIAVKKNLEIPDNIKVIGILSLGYHREDDRPAKLHTIYRNNEDLVKFL >gi|296155174|gb|ADVK01000015.1| GENE 14 9995 - 10822 1139 275 aa, chain - ## HITS:1 COG:FN1255 KEGG:ns NR:ns ## COG: FN1255 COG0647 # Protein_GI_number: 19704590 # Func_class: G Carbohydrate transport and metabolism # Function: Predicted sugar phosphatases of the HAD superfamily # Organism: Fusobacterium nucleatum # 1 275 1 275 275 506 99.0 1e-143 MTPFILNLGGLMEKLENIKCYLLDMDGTIYLGNELINGAKEFLEKLKEKKIRYIFLTNNS SKNKNRYVEKLNKLGIEAHREDIFSSGEATTIYLNKKKKGAKIFLLGTKDLEDEFEKAGF ELVKERNKNIDFVVLGFDTTLTYEKLWIACEYIANGIEYIATHPDFNCPLENGKFMPDAG AMIAFIKASTGKEPTVIGKPNSHIIDAIIEKYDLKKSELAMVGDRLYTDIRTGIDNGLTS ILVMSGETDKKMLEKTIYKPDYIFDSVKELKEKIE >gi|296155174|gb|ADVK01000015.1| GENE 15 10850 - 12136 721 428 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|90020581|ref|YP_526408.1| ribosomal protein L16 [Saccharophagus degradans 2-40] # 1 424 3 426 435 282 36 1e-75 MEALYPIIILFVLFFLNIPIAFALMGSALFYFIFLNTTMSMDMVIQQFVTSVESFPYLAV PFFIMVGSVMNYSGISEELMNMAEVLAGHMKGGLAQVNCLLSAMMGGISGSANADAAMES KILVPEMIKKGFSKEFSAAVTAASSAVSPVIPPGTNLILYALIANVPVGDMFLAGYTPGI LMTAAMMVTVYIISKKRGYEPSRERMARPVEIIKQTIKSIWALAIPFGIIMGMRIGVFTP TEAGGVAVFFCFLVGFFIYKKLKLHHIPIILMETVKSTGAVMIIIASAKVFGYYMTLERI PQFITNSLMNFTDNKFVLLMVINLLLLFVGMFIEGGAALVILAPLLVPAVKALGVDPLHF GVIFIVNIMIGGLTPPFGSMMFTVCSIVGVRLEGFIKEVWPFILALLVVLFLVTYSESIA LFIPNLLR >gi|296155174|gb|ADVK01000015.1| GENE 16 12151 - 12621 464 156 aa, chain - ## HITS:1 COG:FN1257 KEGG:ns NR:ns ## COG: FN1257 COG3090 # Protein_GI_number: 19704592 # Func_class: G Carbohydrate transport and metabolism # Function: TRAP-type C4-dicarboxylate transport system, small permease component # Organism: Fusobacterium nucleatum # 10 156 1 147 147 193 99.0 9e-50 MKDFLKKFELYIGSIFVSITIIVVIINVFTRYFLKFTYFWSEEVAVGCFVWTIFLGTAAA YREKGLIGVEAIIVLLPEKIRNIVEFLTYILLTVLSGLMCVFSFTYVMSSSKITAALELS YGYINFSIVISFALMTLYSIIFTIESLKKAFLHKTN >gi|296155174|gb|ADVK01000015.1| GENE 17 12722 - 13765 299 347 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|149199369|ref|ZP_01876406.1| Ribosomal protein L22 [Lentisphaera araneosa HTCC2155] # 1 320 1 329 346 119 25 1e-26 MKKVLSLIFLSLLTLALVACGGKKEEATKESGDAKQEARVIKVTTKFVDDEQTAKSLVKV VEAINQRSNGTLELQLFTSGTLPIGKDGMEQVANGSDWILVDGVNFLGDYVPDYNAVTGP MLYQSFEEYLRMVKTPLVQDLNAQALEKGIKVLSLDWLFGFRNIEAKKPIKTPEDMKGLK LRVPTSQLYTFTIEAMGGNPVAMPYPDTYAALQQGVIDGLEGSILSFYGTKQYENVKEYS LTRHLLGVSAVCISKKCWDSLTDEQRTIIQEEFDKGALDNLTETEKLEDEYAQKLKDNGV TFHEVDAEAFNKAVAPVYDKFPKWTSGIYDKIMENLTQIREDIKNGK >gi|296155174|gb|ADVK01000015.1| GENE 18 14063 - 14656 626 197 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|148988990|ref|ZP_01820390.1| hypothetical protein CGSSp6BS73_02415 [Streptococcus pneumoniae SP6-BS73] # 6 195 3 192 192 245 57 1e-64 MNREQKLKRIIEKIKNDEENKKYTEQGIDPLFSAPKEARIVIVGQAPGIKAQENRLYWKD KSGDKLRVWTGINEKTFYSSNLLAIIPMDFYYPGKGKSGDLPPRKDFGDKWHHKILELLP NVELFILVGKYAQEFYLKGKIKDNLTNTVKAYKEYLPKFFPIVHPSPLNIGWLKKNLWFE EEVVPTLKDMVTKIMKN >gi|296155174|gb|ADVK01000015.1| GENE 19 14793 - 16097 1298 434 aa, chain - ## HITS:1 COG:FN1260 KEGG:ns NR:ns ## COG: FN1260 COG0642 # Protein_GI_number: 19704595 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Fusobacterium nucleatum # 8 434 1 427 427 695 99.0 0 MKKISKELLKTYYWVIVLFAIFSIFIIVNFSVYLWKENQNDIKVIEEFIEYQMEELKNRE DLDYLSKEWVLKKILDKAPKIRDVYLEIFYNDKKYAKSPYLPDKAHNFLDYYSVTKVYQI EGFDDIKVKITRRNVRDRLLILNAFTSFVFFLLFCLYVIIRIQKKFFDKFKNSLDNLKIF TQDYNLDSEIRIHNEENFIEFSILQKSFKNMLLRLKEQSQLQIDFVNNASHELKTPIFVI KGYVDMLNDWGKDDKEVLDESLIVLKKEIQNMQELTEKLLFLAKSRNLIVEKTNINLDNV LKEVIDNLSFAYPKQKINYNSSEIFIDSDIGLLKLLFKNLIENAIKYGNDNPINIELKKE KKIKVIIEDFGIGISKEAIPHIFERFYREDEARNREIKSYGLGLSIVKEIVALLNIDIQI ESQINKGTKITLLF >gi|296155174|gb|ADVK01000015.1| GENE 20 16100 - 16801 842 233 aa, chain - ## HITS:1 COG:FN1261 KEGG:ns NR:ns ## COG: FN1261 COG0745 # Protein_GI_number: 19704596 # Func_class: T Signal transduction mechanisms; K Transcription # Function: Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain # Organism: Fusobacterium nucleatum # 1 233 1 233 233 432 100.0 1e-121 MLLFSWVRKMNKILIIEDDKNIQRLLTLELRHKNYSVDSAYDGEQGIEMFSKNSYDLVLL DLMLPKKSGKEVCQELRKLAETPIIIITAKDSVLDKVELLDLGANDYICKPFAMEELLAR IRVATRNKENSNNKQFYLENEIKMDILAKKVFLNEVEVSLTKTEFLILEYFMKNKGLSCS REKIIIGVWGYDFDGEEKIVDVYINSLRKKIDPKSYYIHTIRGFGYIFQYKED >gi|296155174|gb|ADVK01000015.1| GENE 21 17023 - 18388 1087 455 aa, chain + ## HITS:1 COG:FN1262 KEGG:ns NR:ns ## COG: FN1262 COG1807 # Protein_GI_number: 19704597 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family # Organism: Fusobacterium nucleatum # 1 455 1 455 519 748 97.0 0 MFSTKRKDIFVLTILSFFALLCLIWIREIDSSEAKNLISAREILQNSNWWTPTLNGHFYF ENPPLPVWITAFVMMVTHSHSEVILRLPNMLCCIFTVLFLYRSMIKIKKDRLFAFLCSFV LLSTFMFIKLGAENTWDIYTYSFAFCAAIAFYVYIKYGEKKNLYRMGILLTLSFLSKGPV GFYSLFVPFFLAHYIIFPREIFRRKTLSVIFSIFVSIAVASIWGISMYLNHGNYFLDVIK DEVLSWRTKHIHSFIFYTDFVVYMGSWLFFSIYVLFKIPKEKESKIFYLWTVLVLIFISL IEMKKKKYGLPLYLTSSITIGQLCIYYFRKPYLELKKREKTLLIIQQCFLIIVVIGSLIF LTYFGFLKKEISFGLFFLYAILHLLFLFLFAVGYTEISYAKRVIIFTGLTMLLVNFSSSW ILVNKYMQNNLLRFRIPISQEVLGNPAPIYSQSFD Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:18:32 2011 Seq name: gi|296155147|gb|ADVK01000016.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00017, whole genome shotgun sequence Length of sequence - 25660 bp Number of predicted genes - 37, with homology - 26 Number of transcription units - 14, operones - 6 average op.length - 4.8 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . - CDS 2 - 704 1027 ## COG1454 Alcohol dehydrogenase, class IV 2 1 Op 2 . - CDS 727 - 2076 1856 ## COG2610 H+/gluconate symporter and related permeases 3 1 Op 3 . - CDS 2134 - 2214 67 ## 4 1 Op 4 4/0.000 - CDS 2233 - 3423 1872 ## COG4948 L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 5 1 Op 5 . - CDS 3438 - 4334 970 ## COG0329 Dihydrodipicolinate synthase/N-acetylneuraminate lyase 6 1 Op 6 . - CDS 4357 - 5097 881 ## COG1414 Transcriptional regulator - Prom 5146 - 5205 9.0 + Prom 5175 - 5234 13.9 7 2 Tu 1 . + CDS 5265 - 6848 1516 ## COG3263 NhaP-type Na+/H+ and K+/H+ antiporters with a unique C-terminal domain 8 3 Op 1 . - CDS 7069 - 7626 862 ## FN1560 hypothetical protein 9 3 Op 2 1/0.000 - CDS 7631 - 8359 627 ## COG1496 Uncharacterized conserved protein 10 3 Op 3 . - CDS 8359 - 9363 1269 ## COG2876 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 11 3 Op 4 . - CDS 9376 - 9639 381 ## FN1563 hypothetical protein - Prom 9668 - 9727 14.4 + Prom 9556 - 9615 9.2 12 4 Op 1 . + CDS 9666 - 9851 589 ## - TRNA 9745 - 9821 95.0 # Asp GTC 0 0 - TRNA 9830 - 9905 94.0 # Val TAC 0 0 13 4 Op 2 . + CDS 9879 - 10019 274 ## + Term 10047 - 10075 0.7 - TRNA 9917 - 9992 87.4 # Phe GAA 0 0 - TRNA 10006 - 10089 64.5 # Ser TGA 0 0 14 5 Op 1 . - CDS 10068 - 10151 241 ## - TRNA 10091 - 10165 66.8 # Glu TTC 0 0 15 5 Op 2 . - CDS 10144 - 10302 392 ## - TRNA 10182 - 10259 96.0 # Met CAT 0 0 + Prom 10083 - 10142 2.4 16 6 Tu 1 . + CDS 10267 - 10374 310 ## - TRNA 10268 - 10344 90.7 # Arg TCT 0 0 - TRNA 10353 - 10428 94.1 # Lys TTT 0 0 17 7 Op 1 . - CDS 10391 - 10498 275 ## - TRNA 10438 - 10513 93.2 # Gly TCC 0 0 - TRNA 10537 - 10613 81.5 # Met CAT 0 0 18 7 Op 2 . - CDS 10575 - 10757 698 ## - TRNA 10621 - 10708 72.2 # Leu TAA 0 0 19 7 Op 3 1/0.000 - CDS 10759 - 11091 360 ## COG2827 Predicted endonuclease containing a URI domain 20 7 Op 4 1/0.000 - CDS 11101 - 11970 784 ## COG0470 ATPase involved in DNA replication 21 7 Op 5 1/0.000 - CDS 11973 - 13001 1591 ## COG1077 Actin-like ATPase involved in cell morphogenesis 22 7 Op 6 8/0.000 - CDS 13003 - 13392 475 ## COG1939 Uncharacterized protein conserved in bacteria 23 7 Op 7 1/0.000 - CDS 13380 - 14801 2048 ## COG0215 Cysteinyl-tRNA synthetase 24 7 Op 8 1/0.000 - CDS 14833 - 15528 820 ## COG1211 4-diphosphocytidyl-2-methyl-D-erithritol synthase 25 7 Op 9 . - CDS 15504 - 17840 3128 ## COG1193 Mismatch repair ATPase (MutS family) - Prom 18082 - 18141 12.5 + Prom 17737 - 17796 8.0 26 8 Tu 1 . + CDS 17930 - 18001 96 ## - Term 18121 - 18166 3.0 27 9 Op 1 . - CDS 18196 - 19644 1939 ## FN1582 hypothetical protein 28 9 Op 2 . - CDS 19648 - 19980 229 ## FN1583 hypothetical protein 29 9 Op 3 1/0.000 - CDS 19973 - 20779 981 ## COG2367 Beta-lactamase class A 30 9 Op 4 . - CDS 20822 - 21475 822 ## COG5505 Predicted integral membrane protein - Term 21476 - 21508 -0.7 31 9 Op 5 2/0.000 - CDS 21534 - 22001 554 ## COG5505 Predicted integral membrane protein 32 9 Op 6 . - CDS 22022 - 23113 1883 ## COG4948 L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily - Term 23132 - 23181 7.1 33 10 Tu 1 . - CDS 23280 - 23342 68 ## - Prom 23508 - 23567 9.5 + Prom 23298 - 23357 9.8 34 11 Tu 1 . + CDS 23444 - 23911 575 ## COG0454 Histone acetyltransferase HPA2 and related acetyltransferases + Term 23919 - 23965 7.6 - Term 23907 - 23952 7.4 35 12 Tu 1 . - CDS 24080 - 24193 167 ## - Prom 24299 - 24358 10.1 + Prom 24281 - 24340 13.3 36 13 Tu 1 . + CDS 24450 - 25109 889 ## COG2932 Predicted transcriptional regulator + Term 25119 - 25159 2.0 - Term 25100 - 25154 10.1 37 14 Tu 1 . - CDS 25168 - 25659 676 ## FN1590 lipoprotein Predicted protein(s) >gi|296155147|gb|ADVK01000016.1| GENE 1 2 - 704 1027 234 aa, chain - ## HITS:1 COG:ECs3659 KEGG:ns NR:ns ## COG: ECs3659 COG1454 # Protein_GI_number: 15832913 # Func_class: C Energy production and conversion # Function: Alcohol dehydrogenase, class IV # Organism: Escherichia coli O157:H7 # 2 231 4 233 383 266 57.0 3e-71 MNRYVLNETSYFGAGCRTELATEVKNKGYKKALLVSDKILASCGVLDKVKDVLNKAKIPY DEFLEIKQNPTIKNCKDGLEALNKSGADFIIAVGGGSVIDTAKAIGIVKNNPSFADIKSL EGVANTTKKSVPVIALPTTCGTAAEVTINYVITVEEENRKIVCVDPKDIPVVAIVDAELM QSMPPKTIASTGMDALTHAIEGYITKGAHVISDMFEVQAIELIAKHLRGAVKDK >gi|296155147|gb|ADVK01000016.1| GENE 2 727 - 2076 1856 449 aa, chain - ## HITS:1 COG:BH0805 KEGG:ns NR:ns ## COG: BH0805 COG2610 # Protein_GI_number: 15613368 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism # Function: H+/gluconate symporter and related permeases # Organism: Bacillus halodurans # 6 414 8 422 456 180 31.0 5e-45 MSPFLLLVCLLISIVIVVILISKFKFNAALALVLGALLMGVLAKMNILEVVKGINSGFGG MMTGIGFPIGFGIILGQLLSDSGGANVIADKIVKIFPKSKAVYAIAFAGFILSIPVFFDV TFVVLIPIAIATMQKVNKSIPYMVGSISIGAGIAHSVIPPTPNPLAAAEIFNFDLGIILG TGLIIGIIVLIISLYIYNKILDRGIWNKERDETGFGIDIAEQVKLEKYPSLIEALIPIFL PIITILLNTVYSVLFPEKKSTILEFFGTKSISMLLGTLAAYFIAVKYIGNQKTSVSATKS LESAGIVFLITGAGGSFAEIIKLSGVNDAIVSLVTNLGGNLILILVLSWGLGVIFRQITG SGTVAGITSMTIMSSVAPTIAIHPVFIALACLSGGMFGATVNDSGFWIVSNMSGFTLSGG AKTYTLGEAIASVISLIVIIISGLITVII >gi|296155147|gb|ADVK01000016.1| GENE 3 2134 - 2214 67 26 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MRILPSLKNPTNQVQLVILKLYHNKK >gi|296155147|gb|ADVK01000016.1| GENE 4 2233 - 3423 1872 396 aa, chain - ## HITS:1 COG:STM2291 KEGG:ns NR:ns ## COG: STM2291 COG4948 # Protein_GI_number: 16765618 # Func_class: M Cell wall/membrane/envelope biogenesis; R General function prediction only # Function: L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily # Organism: Salmonella typhimurium LT2 # 4 385 6 396 405 457 56.0 1e-128 MEKTKQRIKAIRAYVVEGGGGDYHDQGEEHWIVKQIATPMSIYPDYKMTRTSFGINALKT LVVEVEAEDGTVGFAISTGGYPAAWLVMNHLDRFIVGSDVTDIEKMWDQMYRATLYYGRK GIVMNAISAIDLALWDLLGKLRKEPVYRLLGGKVRDELEFYATGPRPDLAKEMGFIGGKI PLVYGPADGEEGLRKNIELAREMREKVGPDFLLMWDCWMALDLPYAKKLMEESSKLGFKW IEECFNPDDYWSYEDLKKLAPNNIMVTGGEHEATRYGFRMLIEKCDLDILQPDVGWCGGM TELIKIANLAEAHGKLVIPHGSGVYSHHFVVTRVNSPFTECLMMAPKADKVLPQYYPLLK DEPIPVDGKLKLGDKPGFGVELNKENLVEVKKSQQL >gi|296155147|gb|ADVK01000016.1| GENE 5 3438 - 4334 970 298 aa, chain - ## HITS:1 COG:yjhH KEGG:ns NR:ns ## COG: yjhH COG0329 # Protein_GI_number: 16132119 # Func_class: E Amino acid transport and metabolism; M Cell wall/membrane/envelope biogenesis # Function: Dihydrodipicolinate synthase/N-acetylneuraminate lyase # Organism: Escherichia coli K12 # 1 292 19 311 319 195 34.0 8e-50 MNNLKGVFPPVITVFNKDGRIDIQSAKKHMDYLISKGVDGLAYFGTTGEFFSMSLKEKKE YIDEILKYNNKRTKILVGVGSTNKDEVIDFIKYLEKKDIAGILLINPYFSVYDETEVEAY YNCIAQNTNLKIIIYNFPQLTGLNFSVPLVEKLVKNNTNIVGIKDTIIDQTHLIDMLNIK KIKEDFIVYCAFESQSLGALVSGIEGFINATANFIPEATVGLWKAYKEKDFEQCCMYYRK MCQAMEVYKLSTPLLLACKKAVYENILGYDGYEKLPALPLDNQKVEKLRFILKTLKIN >gi|296155147|gb|ADVK01000016.1| GENE 6 4357 - 5097 881 246 aa, chain - ## HITS:1 COG:BH3725 KEGG:ns NR:ns ## COG: BH3725 COG1414 # Protein_GI_number: 15616287 # Func_class: K Transcription # Function: Transcriptional regulator # Organism: Bacillus halodurans # 3 223 2 224 251 138 34.0 1e-32 MSVKSAERVLEIFNLLAENPMGLTCKEISQKLGYAASSTFELLKTLEENNYLLVNENKKY FLGMMLIRLGNIVSENLDLKKIIKPHLLEIMNTFLETTFLGMITENNIIYIDKVQSNQTV STNANIGSIKPAYCTGLGKIILANMKEDKLNQLLNTIKFEKYTKNTIIDKGKLKEELKKY KKKGYAIDNEEIEEGLWCLAVPIYDSSFEVKMAISISGPKERMLSKKEKIKEFMLKKSKE ISKLLV >gi|296155147|gb|ADVK01000016.1| GENE 7 5265 - 6848 1516 527 aa, chain + ## HITS:1 COG:FN1559 KEGG:ns NR:ns ## COG: FN1559 COG3263 # Protein_GI_number: 19704891 # Func_class: P Inorganic ion transport and metabolism # Function: NhaP-type Na+/H+ and K+/H+ antiporters with a unique C-terminal domain # Organism: Fusobacterium nucleatum # 1 527 1 527 527 829 90.0 0 MNNILFFSSVVIILSIFMYRYLSKFGVPMLLVFISLGMIFGVNGIFKIDYENYELSRDIC SFALIYIIFFGGFGTNLSMARGIIKKSLILSSLGVIFTSLLTGIFSHYVLKIDWYTSFLI GSVLGSTDAASVFSILRSHKLNLKENTASLLEIESGSNDPFAYVLTISFLTLSKGELNLP ILLFKQVCFGLLVGYIFAKISILSIKKIHNIDSGMSMALIMASMLLSYSISEFVGGNGYI TIYLLGVLIGNVRFNKKSEIVSFFNGITSIMQILIFFLLGLLVNPLEALKYTVPAILIMI AMTLFIRPFVVCSLISPLKSSRGQKLLVSWAGLRGAASVVFAILVVVVNKKIGMIVFNIA FIVVLLSIAIQGSLLPFFSRKFNMIDEEENVLKTFNDYSDTEDVDFITAEIGEAHKWVGK QVKNLEFMPSVLLVLIIRNGQNIIPNGDTVIEKGDRVVLCGSSFVDRDTRINLYENVVDK NSKYKNKSIRELDRNTLVVLIKRDGVAMIPGGNTTILENDILVLLDR >gi|296155147|gb|ADVK01000016.1| GENE 8 7069 - 7626 862 185 aa, chain - ## HITS:1 COG:no KEGG:FN1560 NR:ns ## KEGG: FN1560 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 16 183 1 166 167 221 86.0 1e-56 MTKKFLAVSFLSLMLVACGGGNGSGTATGSSGSGTLELSQRDKELANGNPNIAAEILVQK AILQEAKNEKLTEQEQYNLDLAKQEVEVNFYLQKKFDKDFSNVSAVSEEEAKKYYDEHKS EIGNTPFEKIKDAIVNEIVYQKQTEIVHKYYNDLAEKYKINDILNKEYPQEAANTDNTKT EEKNK >gi|296155147|gb|ADVK01000016.1| GENE 9 7631 - 8359 627 242 aa, chain - ## HITS:1 COG:FN1561 KEGG:ns NR:ns ## COG: FN1561 COG1496 # Protein_GI_number: 19704893 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 242 1 242 242 370 89.0 1e-102 MNYIDNDIKDFDNYIEFTTFNKFNIKILFTKKNYGNVLEKSREEIKKDFSLQNKIMVSSH QTHSDNVVLIGDNADKTYFENTDGILTSNKNVAIFTKYADCLAIFIYDEESKIFGAVHSG WKGTYQEIVKRAIEKINPKNLSTINILFGIGISCENYKVGVEFYEEFRNKFPKEIVEKAF SIKDDDFYFNNQLFNYYLLKDYGVKEDKIFLNNRCTFKENFHSFRRDKELSGRNGAIMFM EV >gi|296155147|gb|ADVK01000016.1| GENE 10 8359 - 9363 1269 334 aa, chain - ## HITS:1 COG:FN1562 KEGG:ns NR:ns ## COG: FN1562 COG2876 # Protein_GI_number: 19704894 # Func_class: E Amino acid transport and metabolism # Function: 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase # Organism: Fusobacterium nucleatum # 1 334 1 334 334 590 87.0 1e-168 MYIRLKNKNFSKKVNDFLEKNKIKYFFVLDGENIKYAILYIPNDFQKENFKEIEDLIEIT EIKSAYKFVSREFKKSNTIIDIRGHLIGGDNFMFMAGPCSVENKEMLSNIAKEVKKGGAV VLRGGAYKPRTSPYDFQGLGEIALKYLREVADKNDMLVVTEAMDVENLDLVCTYSDIIQI GARNMQNFSLLKKLGKVNKPILLKRGLSATINEFLLSAEYIIAHGNREVILCERGIRTFE TMTRNTLDINAIALIKELSHLPIIIDASHGTGKRSLVEPVTLAGIFAGANGAMVEVHENP DCALSDGPQSLDFKLFEKLARNIKKSLIFRKELD >gi|296155147|gb|ADVK01000016.1| GENE 11 9376 - 9639 381 87 aa, chain - ## HITS:1 COG:no KEGG:FN1563 NR:ns ## KEGG: FN1563 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 87 1 87 87 148 94.0 8e-35 MYNEIDLHNLDFKLALNIFKRKYNEALKRKDKREILIIHGYGANKLGHIPILATNLRIFL SKNKDKLSYRLSINPGVTYVTPISRLD >gi|296155147|gb|ADVK01000016.1| GENE 12 9666 - 9851 589 61 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MSNIIAQFLIFFIKNKKTSSKLVDFLMAGVTRLELATSCVTGRRSNQLSYTPNKNGGHNR T >gi|296155147|gb|ADVK01000016.1| GENE 13 9879 - 10019 274 46 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MLSQLSYATVLKNGAQRRNRTTDTGIFSPLLYRLSYLGIIFLMAEG >gi|296155147|gb|ADVK01000016.1| GENE 14 10068 - 10151 241 27 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MVRTSDFHSGNRGSIPLRGTMEGYPNW >gi|296155147|gb|ADVK01000016.1| GENE 15 10144 - 10302 392 52 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MGRGFESLQIRHYMWIHSSVWSEHSAHNRVVAGSSPAGSTIFFVKMPRSFSG >gi|296155147|gb|ADVK01000016.1| GENE 16 10267 - 10374 310 35 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MAYLEGFEPPTHALEGRCSIQLSYRYILNGASYKI >gi|296155147|gb|ADVK01000016.1| GENE 17 10391 - 10498 275 35 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MVERQPSKLNVASSNLVSRSNYLCVISSVGRAHDF >gi|296155147|gb|ADVK01000016.1| GENE 18 10575 - 10757 698 60 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MYDIIMDVLRGARKYKCLDGGIGRRTGLKILWYLVPCRFDSGSRHHLYRGVEQSGSSSGS >gi|296155147|gb|ADVK01000016.1| GENE 19 10759 - 11091 360 110 aa, chain - ## HITS:1 COG:FN1575 KEGG:ns NR:ns ## COG: FN1575 COG2827 # Protein_GI_number: 19704896 # Func_class: L Replication, recombination and repair # Function: Predicted endonuclease containing a URI domain # Organism: Fusobacterium nucleatum # 1 100 1 100 100 136 99.0 9e-33 MAYYLYMLRCEDGSIYTGVAKDYLKRYEEHLSAKGAKYTKSHKVVKIERVFLCDSRSIAC SLESEIKKYIKKKKENIISKPDSFIKDIENVRKIKIKKFFKKSKKKFDIM >gi|296155147|gb|ADVK01000016.1| GENE 20 11101 - 11970 784 289 aa, chain - ## HITS:1 COG:FN1576 KEGG:ns NR:ns ## COG: FN1576 COG0470 # Protein_GI_number: 19704897 # Func_class: L Replication, recombination and repair # Function: ATPase involved in DNA replication # Organism: Fusobacterium nucleatum # 1 289 1 289 289 448 98.0 1e-126 MLDEFLKNELSFNRESGTYLFYGDDLEKNYNIALEFSAELFSKNVENENERNKIIDKTLR NLYSDLMVVDTLNIDIVRDIIKKSYTSSHEGGAKVFILKNIQDIRKESANAMLKLIEEPT KDNFFILISKRLNILSTIKSRSIIYRIRKSTPEELGVDKYVYNFFLGFSNDIEKYKEKEI DLMLEKSYNSIGGVLKEYEKEKNIEVKIDLYKCLRNFVQESSNLKKYEKIKFAEDIYLNS SKENVNLIVEYLINLVKRDKNLKEKLEYKKMLRYPINLKLLLISMIMSI >gi|296155147|gb|ADVK01000016.1| GENE 21 11973 - 13001 1591 342 aa, chain - ## HITS:1 COG:FN1577 KEGG:ns NR:ns ## COG: FN1577 COG1077 # Protein_GI_number: 19704898 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Actin-like ATPase involved in cell morphogenesis # Organism: Fusobacterium nucleatum # 1 342 1 342 342 621 100.0 1e-178 MGLFNFRANRSIGIDLGTANTLVYSKKHKKIVLNEPSVVAVERETKRVLAVGNEAKEMLG KTPDTIVAVRPLSEGVIADYDITEAMIKYFIKKIFGSYSFFMPEIMICVPIDVTGVEKRA VLEAAISAGAKKAYLIEEARAAALGSGMDIAVPEGNMIIDIGGGSTDVAIISLGGTVVSK TIRVAGNNFDSDIIKYVKKTYNLLIGDRTAEEIKMKIGTALPLEEEETMEVKGRDLLMGL PKVVTITSEEVREAIKDSLDQILQCIRTVLEKTPPELASDIVDKGMIMTGGGSLIRNFPE MLTKYTNLKVTLADNPLESVVIGAGLALDQIDYLRKIEKAER >gi|296155147|gb|ADVK01000016.1| GENE 22 13003 - 13392 475 129 aa, chain - ## HITS:1 COG:FN1578 KEGG:ns NR:ns ## COG: FN1578 COG1939 # Protein_GI_number: 19704899 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 112 1 112 129 206 100.0 8e-54 MDNVDFSKDIRDYSGLELAFLGDAIWELEIRKYYLQFGYNIPTLNKYVKAKVNAKYQSLI YKKIINDLDEEFKVIGKRAKNSNIKTFPRSCTVMEYKEATALEAIIGAMYLLKKEEEIKK IINIVIKGE >gi|296155147|gb|ADVK01000016.1| GENE 23 13380 - 14801 2048 473 aa, chain - ## HITS:1 COG:FN1579 KEGG:ns NR:ns ## COG: FN1579 COG0215 # Protein_GI_number: 19704900 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Cysteinyl-tRNA synthetase # Organism: Fusobacterium nucleatum # 1 473 1 473 473 932 99.0 0 MIKIYNTLTGHLDEFKPLKENEVSMYVCGPTVYNYIHIGNARPAIFFDTVRRYLEYRGYK VNYVQNFTDVDDKMINKANIENVSIKEIAERYIKAYFEDTSKINLKEEGMIRPKATENIN EMIEIIQSLVDKGYAYESNGDVYFEVKKYRDGYGELSKQNIEDLESGARIDVNEIKRDAL DFALWKASKPNEPSWDSPWGKGRPGWHIECSAMSRKYLGDSFDIHGGGLDLIFPHHENEM AQSKCGCGGTFARYWMHNGYININGEKMSKSSGSFVLLRDILKYFEGRVIRLFVLGSHYR KPMEFSDTELNQTKSSLERIENTLKRIKELDRENIKGIDDCQELLATKKEMEAKFIEAMN EDFNTAQALGHIFELVKAVNKTLDEANISKKGLEVIDEVYSYLVMIIQDVLGVQLKLEVE VNNISADLIELILELRRNAREEKNWALSDKIRDRLLELGIKIKDGKDKTTWTM >gi|296155147|gb|ADVK01000016.1| GENE 24 14833 - 15528 820 231 aa, chain - ## HITS:1 COG:FN1580 KEGG:ns NR:ns ## COG: FN1580 COG1211 # Protein_GI_number: 19704901 # Func_class: I Lipid transport and metabolism # Function: 4-diphosphocytidyl-2-methyl-D-erithritol synthase # Organism: Fusobacterium nucleatum # 1 231 1 231 231 389 99.0 1e-108 MYSSNSEIKKKVTFILAAAGQGKRMNLNSPKQFLDYRGEPLFYSSLKLAFENKNINDIII ITNKENLNFMVKYCQNKNLFSKVKYIVEGGSERQYSIYNAIKKIKDTDIVIIQDAARPFL KDKYIEESLKILNDDCDGAIIGVKCKDTIKIIDKNGIVLETPNRDNLIMVHTPQTFKFEI LKKAHQMAEEKNILATDDASLVEMISGKIKIIYGDYDNIKITVQEDLKFLK >gi|296155147|gb|ADVK01000016.1| GENE 25 15504 - 17840 3128 778 aa, chain - ## HITS:1 COG:FN1581 KEGG:ns NR:ns ## COG: FN1581 COG1193 # Protein_GI_number: 19704902 # Func_class: L Replication, recombination and repair # Function: Mismatch repair ATPase (MutS family) # Organism: Fusobacterium nucleatum # 1 778 1 778 778 1349 99.0 0 MNKHSFNVLEFDKLKELILANIVIDDNREVIENLEPYKDLSALNNELKTVKDFMDLLSFD GGFEAIGLRNINSLMEKIKLIGTYLEVEELWNINVNLRTVRIFKSRLDELGKYKQLREMI GNIPNLRVIEDVINKTINPEKEIKDDASLDLRDIRLHKKTLNMNIKRKFEELFEEPSLSN AFQEKIITERDGRMVTPVKYDFKGLIKGIEHDRSSSGQTVFIEPLSIVSLNNKMRELETK EKEEIRKILLRIAELLRNNKDDILIIGEKVMYLDILNAKSIYAVENRCEIPTVSNKEILS LEKARHPFIDKDKVTPLTFEIGKDYDILLITGPNTGGKTVALKTAGLLTLMALSGIPIPA SENSKIGFFEGVFADIGDEQSIEQSLSSFSAHLKNVKEILEAVTKNSLVLLDELGSGTDP IEGAAFAMAVIDYLNEKKCKSFITTHYSQVKAYGYNEEGIETASMEFNTDTLSPTYRLLV GIPGESNALTIAQRMGLPESIISKAREYISEDNKKVEKMIENIKTKSQELDEMRERFARL QEEARLDRERAKQETLIIEKQKNEIIKSAYEEAEKMMNEMRAKASALVEKIQHEEKNKED AKQIQKNLNMLSTALREEKNKTVEVVKKIKTKVNFKVGDRVFVKSINQFANILKINTSKE SAMVQSGILKLEVPFDEIKIVEEKKEKVYNVNNHKKTPVRSEIDLRGKMVDEAVYELETY LDRATLNGYTEVYVIHGKGTGALREGILKYLKACKYVKEYRIGGHGEGGLGCTVVTLK >gi|296155147|gb|ADVK01000016.1| GENE 26 17930 - 18001 96 23 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MSIIVEGAFWSSRNTNGYQAVKI >gi|296155147|gb|ADVK01000016.1| GENE 27 18196 - 19644 1939 482 aa, chain - ## HITS:1 COG:no KEGG:FN1582 NR:ns ## KEGG: FN1582 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 27 482 1 456 456 885 99.0 0 MNSKTLTKVILVLIVIAIGFYLIKRAMTPKKIQKEAVFLGIEGYGDLTKGENLDHSLISK FKFNFYIDGQQKTLSVNNGKEVKEGVYTFDIQNQLQEGYIYDIVIDKDTVESVKLLDNDR KAMLSGKVNNIEQDKFIEIGEEKIDLTKNTGIYKITWKAGNSLVEKVGIDTLKDKTVKVT LDKDGKAKNVYLTFVSEKYMSPVIAIPGEKTLKNFLTTALQPVGTTLYIYGGSWDWQDEG SSLQATTIGIPQSWIDFYQYQNADYTYREKDENEETKNPSSSYYPYGKWNQYCYAGADCS GYVGWVIYNTLNKENGKDGYVMGATKMAKTFAENGWGTWTQDVKIPTNHDGSDFKVGDIF SMNGHVWISFGTCDDGSIVIAHSTPSNSINGQPGGGIQISAIGPSENCEAYQLAKMYMEK YYPDWCRRYKVVLKKSEDYIKFKKDSAAGKFSWNLENGILTDPDDYANKKPAEILKDIFQ EK >gi|296155147|gb|ADVK01000016.1| GENE 28 19648 - 19980 229 110 aa, chain - ## HITS:1 COG:no KEGG:FN1583 NR:ns ## KEGG: FN1583 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 110 8 117 117 192 99.0 3e-48 MNKQTIYITGEARTTIDNAITKMFGTFYIAFEIILSTDEIIDVDCNATLRLTRDFVNRLF LNHNIIKDEEMLKKEVTTRYFGSSSKAILTAYHDALQHYKKVKNDFKEKR >gi|296155147|gb|ADVK01000016.1| GENE 29 19973 - 20779 981 268 aa, chain - ## HITS:1 COG:FN1584 KEGG:ns NR:ns ## COG: FN1584 COG2367 # Protein_GI_number: 19704905 # Func_class: V Defense mechanisms # Function: Beta-lactamase class A # Organism: Fusobacterium nucleatum # 5 268 1 264 264 464 96.0 1e-130 MENVMEKYTEWKKEIKKIILQVEGKVCINFYDLGKNNGFSINGSEKVLSASMIKLLILAE LMKKVSENNFSLSDTITITNFMKTEGDGVLKELNAGHHFTLKELATLMIIVSDNQATNIL IDFLGMENINFLGKELGLRETFLERRMMDAEASKNGYDNYTSADDLSLLLKLIYQEKLIN KETSQLMLDILLRQQQGERLQRYLPPDIKIAHKCGDLDNLENDGGIIWLGNKIYILVVLT SGMSNLQCRQTIGKISKFVYDKMEESLE >gi|296155147|gb|ADVK01000016.1| GENE 30 20822 - 21475 822 217 aa, chain - ## HITS:1 COG:FN1585 KEGG:ns NR:ns ## COG: FN1585 COG5505 # Protein_GI_number: 19704906 # Func_class: S Function unknown # Function: Predicted integral membrane protein # Organism: Fusobacterium nucleatum # 1 217 177 393 393 363 99.0 1e-100 MVRYSSKWDNATKADTSKLQAVADAAAKEVEKEKKTASAADWIFLIGISLMVSAVSQMVG AHLQNAFASVGLEVFDKGTMTTVFVTILGLVCALTPLGKLPAVEELSTVYLYAVVSLLAS TASVVDLLTAPMWIVYGLFILAIHVALMFVLSKMFHWDLCMVSTASLANIGGSASAPIVA SAYNPSYAGIGVLMGVLGAAVGNFFGIGIGQILKMLS >gi|296155147|gb|ADVK01000016.1| GENE 31 21534 - 22001 554 155 aa, chain - ## HITS:1 COG:FN1585 KEGG:ns NR:ns ## COG: FN1585 COG5505 # Protein_GI_number: 19704906 # Func_class: S Function unknown # Function: Predicted integral membrane protein # Organism: Fusobacterium nucleatum # 1 147 1 147 393 259 97.0 1e-69 MVITNGFTYIAFLMCLAGCLLLLEKYSKWRIFNVVPALVFIYILNMFFCTMGLFDSEACS KAYSVLKNNLLYAMIFVMLLRCDFRKLAKLGERMVAIFLACSFTLFIGFIVGYPIFKSFL GTDVWGAVAALYASWVGGSANMAAMQGFTSRCRSI >gi|296155147|gb|ADVK01000016.1| GENE 32 22022 - 23113 1883 363 aa, chain - ## HITS:1 COG:FN1586 KEGG:ns NR:ns ## COG: FN1586 COG4948 # Protein_GI_number: 19704907 # Func_class: M Cell wall/membrane/envelope biogenesis; R General function prediction only # Function: L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily # Organism: Fusobacterium nucleatum # 1 363 13 375 375 695 99.0 0 MKITEIKLGIISVPLRVPFKTALRSVNSVEDVIVEIHTDTGNVGYGEAPPTGVITGDTTG AIIGALKDHIIKTLIGRDVDDFENLMKDLNSCIVKNTSAKAATDIALWDLYGQLHKIPVY KLLGGSRNKIITDITISVNPPEEMARDAINAIKRGYDTLKVKVGIDPTLDVARLSAIREA IGKDYRIRIDANQAWTPKQAIKLLNQMQDKGLDIELVEQPVKAHDFEGLAYVTKYSNVPV LADESVFSPEDAFKILQMKAADLINIKLMKCSGIYNALKIISMAEIVGVECMIGCMLETK VSVNAAVHLACAKQIITKIDLDGPVLCSEDPIIGGAVFNEKEITVSDDFGLGIKGINGIK YID >gi|296155147|gb|ADVK01000016.1| GENE 33 23280 - 23342 68 20 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MFIFSYKNKDIVNNNKGIDK >gi|296155147|gb|ADVK01000016.1| GENE 34 23444 - 23911 575 155 aa, chain + ## HITS:1 COG:FN1587 KEGG:ns NR:ns ## COG: FN1587 COG0454 # Protein_GI_number: 19704908 # Func_class: K Transcription; R General function prediction only # Function: Histone acetyltransferase HPA2 and related acetyltransferases # Organism: Fusobacterium nucleatum # 19 155 1 137 137 228 100.0 4e-60 MNILIREATEIDYPAINKMLLKLQNYHSENVPTIYKKLDIFFTFDEYLEILKDKNIYFLL ATLDNEAIGLIWLSFNEKLSKYEYLRKQIWIEGIYVKTKYRRKGIAQKLVNEAINKAKFL NAQSIELMIWDFNETSKKFFENYFKIRSLILTKEL >gi|296155147|gb|ADVK01000016.1| GENE 35 24080 - 24193 167 37 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MHFLEFKRKFSLLNEEEREFVYKLTLEDAINFLKTIY >gi|296155147|gb|ADVK01000016.1| GENE 36 24450 - 25109 889 219 aa, chain + ## HITS:1 COG:FN1589 KEGG:ns NR:ns ## COG: FN1589 COG2932 # Protein_GI_number: 19704910 # Func_class: K Transcription # Function: Predicted transcriptional regulator # Organism: Fusobacterium nucleatum # 1 219 1 219 219 400 100.0 1e-112 MSFGKTLKRIRLKHKDSLRGLAKKIDLHFTFIDKVEKGTAPISKNFIENVVAVYPEESEV LKKEYLKETLPDIFQKEEAIKIVSNSEVLNLPVYGKASAGRGYLNMDTPDYYMPILKGNF SKRSFFVEITGNSMEPTLEDGEFALVDPDNTSYSKNKIYVVTYNDEGYIKRLEMKDKLRV ITLKSDNPDYDDIDIPEEMQEYFQINGRVVEVISKKKLL >gi|296155147|gb|ADVK01000016.1| GENE 37 25168 - 25659 676 163 aa, chain - ## HITS:1 COG:no KEGG:FN1590 NR:ns ## KEGG: FN1590 # Name: not_defined # Def: lipoprotein # Organism: F.nucleatum # Pathway: not_defined # 1 163 252 414 414 309 90.0 3e-83 TNDAQTEPLLKQIAANGGYFIEADLPSPTMGYPGALGVEFTDDEKGNWPKILEKVEKAVI AIGGSGRMGTWAYSYNFAGVEGLTDLAVKSIESGDRDFTLGKLLASLDVATPGAKWNGSI MKDNNGVDVPNAFFIYQDTYVFGKGYMGITSVEIPEKYTKIGN Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:20:03 2011 Seq name: gi|296155083|gb|ADVK01000017.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00019, whole genome shotgun sequence Length of sequence - 83389 bp Number of predicted genes - 66, with homology - 64 Number of transcription units - 24, operones - 14 average op.length - 4.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 2 - 2009 2702 ## COG1404 Subtilisin-like serine proteases - Prom 2059 - 2118 8.8 + Prom 2070 - 2129 14.5 2 2 Tu 1 . + CDS 2222 - 3082 1033 ## COG0384 Predicted epimerase, PhzC/PhzF homolog 3 3 Op 1 18/0.000 - CDS 3209 - 3922 283 ## PROTEIN SUPPORTED gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 4 3 Op 2 19/0.000 - CDS 3922 - 4704 265 ## PROTEIN SUPPORTED gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 5 3 Op 3 24/0.000 - CDS 4694 - 5674 1393 ## COG4177 ABC-type branched-chain amino acid transport system, permease component 6 3 Op 4 20/0.000 - CDS 5675 - 6562 1228 ## COG0559 Branched-chain amino acid ABC-type transport system, permease components - Prom 6662 - 6721 6.7 - Term 6691 - 6732 -0.8 7 3 Op 5 1/0.875 - CDS 6778 - 7929 1708 ## COG0683 ABC-type branched-chain amino acid transport systems, periplasmic component - Prom 7964 - 8023 8.8 - Term 7938 - 7985 -0.3 8 4 Op 1 1/0.875 - CDS 8088 - 8870 1352 ## COG4221 Short-chain alcohol dehydrogenase of unknown specificity 9 4 Op 2 . - CDS 8888 - 11329 3888 ## COG0457 FOG: TPR repeat - Prom 11355 - 11414 10.8 + Prom 11280 - 11339 8.9 10 5 Op 1 35/0.000 + CDS 11470 - 11667 118 ## COG1132 ABC-type multidrug transport system, ATPase and permease components 11 5 Op 2 . + CDS 11615 - 12280 885 ## COG1132 ABC-type multidrug transport system, ATPase and permease components + Term 12388 - 12426 -0.4 - Term 12272 - 12322 4.2 12 6 Tu 1 . - CDS 12357 - 12614 427 ## PROTEIN SUPPORTED gi|19704769|ref|NP_604331.1| 50S ribosomal protein L28P - Prom 12648 - 12707 10.5 + Prom 12650 - 12709 12.8 13 7 Op 1 10/0.000 + CDS 12945 - 13682 954 ## COG1349 Transcriptional regulators of sugar metabolism + Prom 13691 - 13750 3.4 14 7 Op 2 19/0.000 + CDS 13770 - 14699 1189 ## COG1105 Fructose-1-phosphate kinase and related fructose-6-phosphate kinase (PfkB) 15 7 Op 3 . + CDS 14689 - 16560 2461 ## COG1299 Phosphotransferase system, fructose-specific IIC component + Term 16579 - 16612 1.1 + Prom 16570 - 16629 8.2 16 8 Op 1 . + CDS 16652 - 18190 2446 ## COG0519 GMP synthase, PP-ATPase domain/subunit + Prom 18274 - 18333 10.2 17 8 Op 2 . + CDS 18354 - 19283 421 ## gi|296327753|ref|ZP_06870292.1| hypothetical protein HMPREF0397_0485 + Term 19376 - 19416 6.3 - Term 19364 - 19404 3.1 18 9 Tu 1 . - CDS 19418 - 20116 854 ## CA_C2453 CBS domain-containing protein - Prom 20149 - 20208 7.3 - Term 20459 - 20504 6.1 19 10 Tu 1 . - CDS 20544 - 21569 827 ## PROTEIN SUPPORTED gi|167855185|ref|ZP_02477956.1| 50S ribosomal protein L31 - Prom 21633 - 21692 19.3 - Term 21662 - 21697 4.1 20 11 Tu 1 . - CDS 21753 - 28121 7835 ## Lebu_0671 autotransporter beta-domain protein - Prom 28200 - 28259 10.4 - Term 28273 - 28321 9.1 21 12 Op 1 . - CDS 28357 - 39717 15196 ## FN1449 hypothetical protein - Prom 39743 - 39802 10.8 - Term 39827 - 39871 11.1 22 12 Op 2 . - CDS 39893 - 41311 1926 ## COG2985 Predicted permease - Prom 41341 - 41400 14.7 - Term 41473 - 41511 0.2 23 13 Op 1 35/0.000 - CDS 41517 - 42599 1595 ## COG0206 Cell division GTPase 24 13 Op 2 . - CDS 42622 - 43965 1679 ## COG0849 Actin-like ATPase involved in cell division 25 13 Op 3 . - CDS 43962 - 44672 657 ## FN1453 hypothetical protein 26 13 Op 4 6/0.000 - CDS 44685 - 45548 1360 ## COG1181 D-alanine-D-alanine ligase and related ATP-grasp enzymes 27 13 Op 5 11/0.000 - CDS 45563 - 46408 1049 ## COG0812 UDP-N-acetylmuramate dehydrogenase 28 13 Op 6 26/0.000 - CDS 46405 - 47787 1765 ## COG0773 UDP-N-acetylmuramate-alanine ligase 29 13 Op 7 4/0.125 - CDS 47792 - 48856 1440 ## COG0707 UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase 30 13 Op 8 28/0.000 - CDS 48866 - 50164 1778 ## COG0771 UDP-N-acetylmuramoylalanine-D-glutamate ligase 31 13 Op 9 . - CDS 50164 - 51249 1456 ## COG0472 UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase - Prom 51288 - 51347 5.2 + Prom 51287 - 51346 6.0 32 14 Tu 1 . + CDS 51368 - 51541 214 ## FN1460 putative cytoplasmic protein 33 15 Op 1 . - CDS 51492 - 53321 2184 ## COG0770 UDP-N-acetylmuramyl pentapeptide synthase - Prom 53347 - 53406 12.1 34 15 Op 2 . - CDS 53425 - 54834 1061 ## COG1167 Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs - Prom 54875 - 54934 12.3 + Prom 54776 - 54835 8.9 35 16 Tu 1 . + CDS 54941 - 55783 1549 ## COG0214 Pyridoxine biosynthesis enzyme + Term 55791 - 55847 7.2 - Term 55779 - 55834 14.6 36 17 Op 1 . - CDS 55843 - 57594 2638 ## COG1154 Deoxyxylulose-5-phosphate synthase 37 17 Op 2 . - CDS 57619 - 57714 145 ## - Prom 57828 - 57887 7.9 38 18 Op 1 . + CDS 58155 - 58799 167 ## PROTEIN SUPPORTED gi|42631297|ref|ZP_00156835.1| COG0697: Permeases of the drug/metabolite transporter (DMT) superfamily 39 18 Op 2 . + CDS 58800 - 59084 332 ## FN1466 hypothetical protein + Term 59103 - 59142 -1.0 40 18 Op 3 . + CDS 59157 - 59699 500 ## FN1467 hypothetical protein - Term 59558 - 59607 -1.0 41 19 Op 1 1/0.875 - CDS 59680 - 60273 665 ## COG1853 Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family 42 19 Op 2 . - CDS 60275 - 61597 1106 ## COG0534 Na+-driven multidrug efflux pump - Prom 61646 - 61705 7.0 + Prom 61668 - 61727 10.4 43 20 Op 1 1/0.875 + CDS 61776 - 62924 1521 ## COG3055 Uncharacterized protein conserved in bacteria 44 20 Op 2 2/0.125 + CDS 62942 - 63943 1155 ## COG1609 Transcriptional regulators 45 20 Op 3 9/0.000 + CDS 63963 - 64946 378 ## PROTEIN SUPPORTED gi|149199369|ref|ZP_01876406.1| Ribosomal protein L22 46 20 Op 4 1/0.875 + CDS 64970 - 66823 698 ## PROTEIN SUPPORTED gi|126646729|ref|ZP_01719239.1| Ribosomal protein L16 47 20 Op 5 4/0.125 + CDS 66826 - 67701 373 ## PROTEIN SUPPORTED gi|163762640|ref|ZP_02169704.1| ribosomal protein L33 48 20 Op 6 3/0.125 + CDS 67723 - 68595 1268 ## COG0329 Dihydrodipicolinate synthase/N-acetylneuraminate lyase 49 20 Op 7 1/0.875 + CDS 68610 - 69284 981 ## COG3010 Putative N-acetylmannosamine-6-phosphate epimerase + Prom 69291 - 69350 8.8 50 21 Tu 1 . + CDS 69443 - 70330 867 ## COG0697 Permeases of the drug/metabolite transporter (DMT) superfamily - Term 70313 - 70367 11.1 51 22 Op 1 . - CDS 70391 - 70726 641 ## FN1478 hypothetical protein 52 22 Op 2 . - CDS 70766 - 71353 593 ## FN1479 hypothetical protein 53 22 Op 3 2/0.125 - CDS 71377 - 72708 1591 ## COG2239 Mg/Co/Ni transporter MgtE (contains CBS domain) 54 22 Op 4 1/0.875 - CDS 72742 - 73863 1559 ## COG0343 Queuine/archaeosine tRNA-ribosyltransferase 55 22 Op 5 9/0.000 - CDS 73878 - 76055 2655 ## COG0317 Guanosine polyphosphate pyrophosphohydrolases/synthetases 56 22 Op 6 1/0.875 - CDS 76074 - 76586 901 ## COG0503 Adenine/guanine phosphoribosyltransferases and related PRPP-binding proteins 57 22 Op 7 1/0.875 - CDS 76601 - 77398 806 ## COG0457 FOG: TPR repeat 58 22 Op 8 1/0.875 - CDS 77414 - 77998 719 ## COG2928 Uncharacterized conserved protein - Prom 78019 - 78078 3.9 59 22 Op 9 . - CDS 78082 - 79362 1603 ## COG1253 Hemolysins and related proteins containing CBS domains - Prom 79423 - 79482 6.1 + Prom 79202 - 79261 6.8 60 23 Tu 1 . + CDS 79461 - 79559 71 ## 61 24 Op 1 1/0.875 - CDS 79622 - 80083 663 ## COG4492 ACT domain-containing protein 62 24 Op 2 1/0.875 - CDS 80099 - 80950 1099 ## COG0190 5,10-methylene-tetrahydrofolate dehydrogenase/Methenyl tetrahydrofolate cyclohydrolase 63 24 Op 3 1/0.875 - CDS 80944 - 81861 1042 ## COG0223 Methionyl-tRNA formyltransferase 64 24 Op 4 1/0.875 - CDS 81898 - 82347 615 ## COG1327 Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains 65 24 Op 5 1/0.875 - CDS 82366 - 82839 645 ## COG1762 Phosphotransferase system mannitol/fructose-specific IIA domain (Ntr-type) 66 24 Op 6 . - CDS 82833 - 83387 492 ## COG1381 Recombinational DNA repair protein (RecF pathway) Predicted protein(s) >gi|296155083|gb|ADVK01000017.1| GENE 1 2 - 2009 2702 669 aa, chain - ## HITS:1 COG:FN1426_1 KEGG:ns NR:ns ## COG: FN1426_1 COG1404 # Protein_GI_number: 19704758 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Subtilisin-like serine proteases # Organism: Fusobacterium nucleatum # 67 620 1 528 528 368 41.0 1e-101 MRKKILKSKVMLIVLASILFVSCGGGGGGGGGGSSNLPLNPGTPSVPSNPSTPAAPEDSY PKMTNPLDSQKGNMSALKANLYAAQVGSGVNIPKDTREEDGKVAGTNVGLLEGEGVKVAV LDADFQDAVRATAKDKTGHNVVTPRRTMTLTNLYPGIEILPRKNSEITNASSIVNTTLEH GEEVLEVIKDMDTSISPPENKIGLIVGSFGQDYHNNKTNENVYGAVLPNKETYDAALAKF GNQSVKIFNQSFGSDKAYDDSSLAQYRNEGDAMPLYFNKVVGAVGFPEKQIPYYRNLVQN EGGLFIWSAGNDGNKNSSMDAGLPYFDHRLESGWISVVGIADEENGKYYIRGSGNTPQTS KLSSAGFEARYWAISALMYGAEHYSTHKYGVGSSYAAPRVTQAAALVYEKFPWMTNEQIR QTLFTTTDRTELTEDPDNLSEAKLRNITKYPDSTYGWGMLNTERALKGPGAFMDISYYGD TSTFKANVIGTSYFDNNIYGDGGLQKSGLGTLHLTGNNSFSGGSVVKQGTLEIHQVHASP ITVESGGKLIFNPKAIVGYDIDSFKLIENIDPQRITDSGIKIRNYGTVAFKGTTAIIGGD YVGYNGSTTEVGFLSKVRVLGDIKYQPNTTVRILSNDYVTTQGSSNTVMEGKSVQGNIAN VETNGMRNA >gi|296155083|gb|ADVK01000017.1| GENE 2 2222 - 3082 1033 286 aa, chain + ## HITS:1 COG:FN1427 KEGG:ns NR:ns ## COG: FN1427 COG0384 # Protein_GI_number: 19704759 # Func_class: R General function prediction only # Function: Predicted epimerase, PhzC/PhzF homolog # Organism: Fusobacterium nucleatum # 1 286 8 293 293 532 98.0 1e-151 MKIFVCDAFSSKIFKGNQAGVVILEENENYPRETLMKNIAAELKHSETAFVKKIDNKKFK IRYFTPTEEVELCGHATISVFSVLRELNIVSVGKYIAETLAGNLEIIIDKDFIWMDMASP KIEYIFNLDEIKEIYSAFNLDVNQAPKNLIPKVVNTGLSDIIIPIENKETLDSFVMNKEK VIEISKKYKVVGAHLFTFDKMKKVTAFCRNIAPLVGIDEECATGTSNGALTHYLKDYNII SIQDINIFIQGESMGKTSTILSRYKEDKKTIQIGGNAVISFECKLY >gi|296155083|gb|ADVK01000017.1| GENE 3 3209 - 3922 283 237 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 [Vibrio campbellii AND4] # 3 232 1 237 245 113 28 3e-24 MAMLEVKDLQVFYDNIQALKGISLEINEGEVVSIIGANGAGKTTTLQTISGLITPKSGSI IFEGKNLLKEKAHNICKLGIAQVPEGRRIFSKLAVKDNLKLGQFTVKDSAEKKEEDRANF YKVFPRMSERKNQLAGTLSGGEQQMLAMGRALMSRPKLLILDEPSMGLSPLFVKEIFEVI KQLKEKGTTILLVEQNAKMALSISDRAYVIETGEIVLEGKAEDLLYNDRVKKAYLGG >gi|296155083|gb|ADVK01000017.1| GENE 4 3922 - 4704 265 260 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 [Vibrio campbellii AND4] # 7 248 1 231 245 106 26 4e-22 MENRKPLLVAKDISISFGALKAVNNFNLEINSGELIGLIGPNGAGKTTVFNILTGVYNAS SGKYILDGENVIKTSTSALVKKGLARTFQNIRLFKYLSVLDNVVAAYNFRMKYGILPGML RLPSYWKEEKEAKEKAMALLKIFDLDKYANMHAGNLPYGEQRKLEIARAMATEPKILLLD EPAAGMNPKETEDLMNTIKLIRDKFGIAVLLIEHDMKLVLGICERLVVLNYGKILASGKP NEVINNPQVIEAYLGKEEDE >gi|296155083|gb|ADVK01000017.1| GENE 5 4694 - 5674 1393 326 aa, chain - ## HITS:1 COG:FN1430 KEGG:ns NR:ns ## COG: FN1430 COG4177 # Protein_GI_number: 19704762 # Func_class: E Amino acid transport and metabolism # Function: ABC-type branched-chain amino acid transport system, permease component # Organism: Fusobacterium nucleatum # 42 326 1 285 285 439 99.0 1e-123 MDKNKKLSYITTYVLLTVLYFILFSLISSGFISRYQVGILILILINVILAASLNVTVGCL GQITLGHAGFMSIGAYTAALLTKSGLLSGYPGYVVALIIGGLVAGIIGFIIGIPALRLTG DYLAIITLAFGEIIRVLIEYFKFTGGAQGLTGIPRVNNFTLIYFITIFSVIFMYSVMTSR HGRAVLAIREDEIASGASGINTTYYKTFAFVLSAIFAGIAGGIYAHNLGILGAKQFDYNY SINILVMVVLGGMGSFTGSILSAIVLTILPEVLRSFAEYRMIVYPLILIIMMLFRPKGLL GREEFQISKIILYFTKKLKRGEVNGK >gi|296155083|gb|ADVK01000017.1| GENE 6 5675 - 6562 1228 295 aa, chain - ## HITS:1 COG:FN1431 KEGG:ns NR:ns ## COG: FN1431 COG0559 # Protein_GI_number: 19704763 # Func_class: E Amino acid transport and metabolism # Function: Branched-chain amino acid ABC-type transport system, permease components # Organism: Fusobacterium nucleatum # 1 295 14 308 308 493 99.0 1e-139 MEFLLQIINGLQIGSIYALVSLGYTMVYGIAQLINFAHGDIIMIGAYTSLFSIPLFTSMG LPIWTTVIPAIIICAVIGCLAKRIAYRPLRNSPRISNLITAIGVSLFIENVFMKVFTPNT RSFPKIFNQAPITFGNGINISFGAAITILTTVILSVGLQLFMKKTKYGKAMIATSQDYAA SELVGINVDRTIQLTFAIGSGLAAVGSVLYVSAYPQIQPLMGSMLGIKAFIAAVLGGIGI LPGAVLGGFILGIVESLTRAYLSSQLADAFVFSILIIVLLFKPTGILGKNVKEKV >gi|296155083|gb|ADVK01000017.1| GENE 7 6778 - 7929 1708 383 aa, chain - ## HITS:1 COG:FN1432 KEGG:ns NR:ns ## COG: FN1432 COG0683 # Protein_GI_number: 19704764 # Func_class: E Amino acid transport and metabolism # Function: ABC-type branched-chain amino acid transport systems, periplasmic component # Organism: Fusobacterium nucleatum # 1 383 1 383 383 697 100.0 0 MKKKLLTTLLGASLLLVACGGEKTEEKSTKEDETIKIGAMGPLTGAVAIYGISATNGLKL AVDEINANGGILGKQVELNLLDEKGDSTEAVNAYNKLVDWGMVVLIGDVTSKPSVAVAEV AAQDGIPMITPTGTQLNITEAGSNIFRVCFTDPYQGVVLAKFAKDKLGAKTVAIMSNNSS DYSDGVANAFVTEAEKQGIQVVAREGYSDGDKDFKAQLTKIAQQNPDILFIPDYYEQDGL IAIQAREVGLKSVIVGSDGWDGVVKTVDPSSYAAIEDVYFANHYSTKDSNEKIQNFIKNY KEKYNDEPSAFSALSYDTAYLLKAAIEKAGTTDKEAVTKAIKEIQFEGITGQLTFDENNN PVKSITIIKIVNGDYTFDSVVTK >gi|296155083|gb|ADVK01000017.1| GENE 8 8088 - 8870 1352 260 aa, chain - ## HITS:1 COG:FN1433 KEGG:ns NR:ns ## COG: FN1433 COG4221 # Protein_GI_number: 19704765 # Func_class: R General function prediction only # Function: Short-chain alcohol dehydrogenase of unknown specificity # Organism: Fusobacterium nucleatum # 1 260 1 260 260 483 99.0 1e-136 MEENRVKGKIAFISGASSGIGKATAEKLAQMGVNLILCARRENILNELKENLEKQYGIKV KNLVFDVRNYNDVLKNINSLDDEWKKIDILVNNAGLAVGLEKFYEYNMEDVDKMVDTNIK GFVYIANTIIPLMLATDKVCTIVNIGSVAGEIAYPNGSIYCATKFAVRAISDSMRSELID KKIKVTNIKPGLVDTGFSLVRFRGDKEKADNVYKGIDPLYAEDIADTVAYVVNLPEKVQI TDLSITPLHQANAIHIYKKK >gi|296155083|gb|ADVK01000017.1| GENE 9 8888 - 11329 3888 813 aa, chain - ## HITS:1 COG:FN1434 KEGG:ns NR:ns ## COG: FN1434 COG0457 # Protein_GI_number: 19704766 # Func_class: R General function prediction only # Function: FOG: TPR repeat # Organism: Fusobacterium nucleatum # 157 813 1 657 657 1122 99.0 0 MKDELLKKIEDLYDLDKHQEIIDMIETLPAEQLNNELIGQLGRAYNNVQNYEKAIEILKS IELEEGNTMRWNYRIGYSYYYLDDYENAEKNFLKAHEINPENEEIKNYLLNIYIELSKQI INKDDSQEAQDKALEYALKSKEYITTDDDRIQCDSYLAWFYDKIGSYDLAEELLKSVISS GRDDIWVNSEYGYCLGELNRLEESLEHYLRAKELGRNDGWIHSQIGWTYRLLGKYEEALE ADFKAQEIGQNDAWINVEIGICYKELEKYEEAIKYYLVANKINEGKNIWLLSDMAWVYGV MENYDEELKYLEEVKKLGRDDEWIYAEYGKVYYKLEQYEKALEFFAKAQKLGQNDAWINV QIARCYKALDKNEVALKAYLKAEKFESDDIWLLSEIAWLYDGIGKYKEGLKYLKRIEKLG RDDCWFNTEYGFCLMRMQKYDKAIEKYKHALELKEELNEEIYLNCQIGFCYRLLEKYEEA LKCHLKAQELGRNDDWINIEIGLCYKELEKYEKALEHYLIAYEQNKEDAWLLSDIGWIYN ELEKYEDGLQFLQKSQELGREDSWIYAEIGQCLGRLGKYEEGIEKLKKGLEILDEDKTNE NTQERIFINSEIGWLYGKIENSDPNEALHYLYAARDLGRDDQWLNAEIGWELGYNDKGKD EEAIKYFERSIELGRDDEWVWARVANIYFDLERYEDALKAYNRAYELEGAYKEGKDSLYI CSIGRTLRRLGKYEEAVEKLLESRRLSLEEGDGVDLEDLELAHCYAVLGDKDKAEEHMKL SLDALGTYAESDEYLKKQFDEIKEMINVLSKPS >gi|296155083|gb|ADVK01000017.1| GENE 10 11470 - 11667 118 65 aa, chain + ## HITS:1 COG:FN1435 KEGG:ns NR:ns ## COG: FN1435 COG1132 # Protein_GI_number: 19704767 # Func_class: V Defense mechanisms # Function: ABC-type multidrug transport system, ATPase and permease components # Organism: Fusobacterium nucleatum # 1 65 13 77 77 95 100.0 3e-20 MVYSTSIKMIAILIFACAITLTLFKSLPYILKVQKKLDRMVLVLRERFIGSKIIRAFDNS KKRKR >gi|296155083|gb|ADVK01000017.1| GENE 11 11615 - 12280 885 221 aa, chain + ## HITS:1 COG:FN1436 KEGG:ns NR:ns ## COG: FN1436 COG1132 # Protein_GI_number: 19704768 # Func_class: V Defense mechanisms # Function: ABC-type multidrug transport system, ATPase and permease components # Organism: Fusobacterium nucleatum # 9 221 1 174 174 265 80.0 6e-71 MDQKLLEPLTILKKERDKFNDIAQDYTDNYITINKKFDLLSSMAFSLMSVIITLIIFFGA RKVLNNTLEIGSITAIVEYSLTTIAALIMSSMVLVQMPKAVVSIDRIEEVLAVTSENGMI SQGQQQLLTIAHTILPNPKVMILDEATSNIDTKTEKDIQAVISQLMKGRTSFVIAHRLST IRNADLILVMKDGDIVEQGNHDELMKFDEIYANLYNTQFNQ >gi|296155083|gb|ADVK01000017.1| GENE 12 12357 - 12614 427 85 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704769|ref|NP_604331.1| 50S ribosomal protein L28P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 85 1 85 85 169 100 6e-41 MQRCEITGTGLISGNQISHSHRLTRRVWKPNLQVTTLVVNGSPIKVKVCARTLKTLKGAS EVEVMRILKANIATLSERLLKHLNK >gi|296155083|gb|ADVK01000017.1| GENE 13 12945 - 13682 954 245 aa, chain + ## HITS:1 COG:FN1439 KEGG:ns NR:ns ## COG: FN1439 COG1349 # Protein_GI_number: 19704771 # Func_class: K Transcription; G Carbohydrate transport and metabolism # Function: Transcriptional regulators of sugar metabolism # Organism: Fusobacterium nucleatum # 1 245 1 245 245 392 99.0 1e-109 MLFEDRISLILKLIEQNGSIENSKIIKDLKISEATLRRDLDYLEKEGKIKRVRGGAILKK VARKEIAIKEKNFNKDSKKKIAKLAAQFISNGDYIYLDAGTTTYEIIDYMKGKDIKVVTN GIIHLEKLIANDIETYLIGGRIKKSTLAIVGVKALRDLSEFRFDKAFIGINGINENGYST HDIEEALIKKQAIDNSNKAFILADSSKFDIVYFANVAKLEEATIITDKKEINKNIIKHTQ IINTY >gi|296155083|gb|ADVK01000017.1| GENE 14 13770 - 14699 1189 309 aa, chain + ## HITS:1 COG:FN1440 KEGG:ns NR:ns ## COG: FN1440 COG1105 # Protein_GI_number: 19704772 # Func_class: G Carbohydrate transport and metabolism # Function: Fructose-1-phosphate kinase and related fructose-6-phosphate kinase (PfkB) # Organism: Fusobacterium nucleatum # 1 309 6 314 314 560 98.0 1e-159 MIYSVTLNPSIDFIVRVKDFQLGETNRAYEDNFFAGGKGIMVSKLLKNVKTDCVNLGFLG GFTGTFIEQNLKKLNILSDFVTVNENTRVNVKLKTETETEINCQGPKISENEKEEFLNKI RKIKSDDFVILSGSVPSNLGNDFYITIIEILNKNGVNFTLDSSGETFSKSLKYKPFLIKP NKDELKEYAKREFKNNQEIIDYVRENLVDKAEHVIISLGGEGALYIDKNFSFFAQPLRVK ENVVNTVGAGDSVVAGFVSCMLKHNEVEEAFRFAVACGTATSFSEDIGELNFIEEIYNRL VIEKENYGN >gi|296155083|gb|ADVK01000017.1| GENE 15 14689 - 16560 2461 623 aa, chain + ## HITS:1 COG:FN1441_3 KEGG:ns NR:ns ## COG: FN1441_3 COG1299 # Protein_GI_number: 19704773 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphotransferase system, fructose-specific IIC component # Organism: Fusobacterium nucleatum # 296 623 1 328 328 577 99.0 1e-164 MEIKDLLKKDLMIMDLKASTKMEAIDEMVAKLKEKNIISDEAVFKDLILKREERSSTGLG EGIAMPHAKTSVVNTPSVLFARSNKGIDYDALDDEPVYIFFMIAASEGAHDLHIETLAKL SKMLLNDDFTQGLKTCGSPDEVYALVDKYSEKTQEAPKEEVKETQNTNKKRILAVTACPT GIAHTYMAEAALKEAGEKLGVDVKVETNGADGIKNNLTTNDINDAVGIIVAADKKVETTR FNNRKVIVTSTADAIKNAETLIKKVLNNEAPVFKAETNDKSEEDTQENDSIGRIIYKSIM SGVSNMLPFVIGGGILLALSFIVERFMGQNQLFKLLFDVGAGAFHFLIPVLAGFIAMSIA DKPGFMPGAVAGYMASQGAGFLGGLIGGFIAGYSVILLKKMTKNMSKQFDGMKSMVIYPI FSLLITGVLMYFIIGPIFTKINLIVANWLNNMGTANAVVLGAILGGMMSVDMGGPINKAA YAFSIGVFTDTGNGAFMAAVMAGGMVPPLAIALAMTLFKNNFDEKEKQSTISNFILGLSF ITEGAIPFAAKEPVKVIGSCIVGAAIAGGLTQFWSVSAPAPHGGIFVIPAMPSVHSAIFF VVSIAIGAVISGVIFGVLRGKKK >gi|296155083|gb|ADVK01000017.1| GENE 16 16652 - 18190 2446 512 aa, chain + ## HITS:1 COG:FN1444_2 KEGG:ns NR:ns ## COG: FN1444_2 COG0519 # Protein_GI_number: 19704776 # Func_class: F Nucleotide transport and metabolism # Function: GMP synthase, PP-ATPase domain/subunit # Organism: Fusobacterium nucleatum # 195 512 1 318 318 633 99.0 0 MKKGGIVILDFGSQYNQLIARRVREMGVYAEVVPFHEDVDKILAREPKGIILSGGPASVY AEGAPSLDIKLFQNNIPILGLCYGMQLITHLHGGKVARADKQEFGKAELELDDKNHILYK NIPNKTTVWMSHGDHVTEMAPDFKIIAHTDSSIAAIENSDKNIYAFQYHPEVTHSQHGFD MLKNFIFGIAKAEKNWSMENYIETTVKQIKERVGNKQVILGLSGGVDSSVAAALINKAIG RQLTCIFVDTGLLRKDEAKQVMEVYAKNFDMNIKCINAEERFLTKLAGVTDPETKRKIIG KEFVEVFNEEAKKIEGAEFLAQGTIYPDVIESVSVKGPSVTIKSHHNVGGLPEDLKFELL EPLRELFKDEVRKVGRELGIPDYMVDRHPFPGPGLGIRILGEVTKEKADILRKADAIFIE ELRKADLYNKVSQAFVVLLPVKSVGVMGDERTYEYTAVLRSANTIDFMTATWSHLPYEFL EKVSNRILNEVKGINRLTYDISSKPPATIEWE >gi|296155083|gb|ADVK01000017.1| GENE 17 18354 - 19283 421 309 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296327753|ref|ZP_06870292.1| ## NR: gi|296327753|ref|ZP_06870292.1| hypothetical protein HMPREF0397_0485 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] hypothetical protein HMPREF0397_0485 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 309 1 309 309 528 100.0 1e-148 MEKVFFEYNKNSVTLYEALKNATPPSDMVEIFNKFQKQIYSYTDFEIKNFKKLQNFAEHK QLVNFLCDLLKRVCPHYENKEFYYILVVLTYSQEQKAIEYLKLLTEFIITNQLNLCHILL TIFNEINEPYLLSNKEKLEQYFSELYIQSDYFEFVKRVGLDTSKIEFKKIGGEGSFSFDL TNMEGIQYEFQLDNETDKKKFCKLSISVFTPKGMKNCFDVSLSSECFLLYHHHKDYYVSS KEEIKLLNKETNQEFYIENSTNLLELKNIVRQIESILQIKFVSNIVHSYFTKPLKGKKEL QKWWEDNNF >gi|296155083|gb|ADVK01000017.1| GENE 18 19418 - 20116 854 232 aa, chain - ## HITS:1 COG:no KEGG:CA_C2453 NR:ns ## KEGG: CA_C2453 # Name: not_defined # Def: CBS domain-containing protein # Organism: C.acetobutylicum # Pathway: not_defined # 2 229 10 240 243 110 29.0 4e-23 MEDSISIFRELCNRFEDLVRTKDKIEDAEGVFHHFSIQKENKKFKQDIEVIRKIRNLISH GECKIDGKVAIKINENIIKKFKEIINFLENPPLVTSRYITKMFVVDLEEKLETLIKVMNE KKISHVPVLDKDKKLIGVFSENTIFSKLSDDEIIEIGKEYKVKDYEKYIKIENHSSEYFD FIKRNEELSVAQTLFNKSIKSDKKLVMIFITENGKKSEKILGILTPWDLLDM >gi|296155083|gb|ADVK01000017.1| GENE 19 20544 - 21569 827 341 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|167855185|ref|ZP_02477956.1| 50S ribosomal protein L31 [Haemophilus parasuis 29755] # 4 333 4 336 339 323 47 2e-87 MVEIEDSIYKEIKSILEQARNKVYKVANSTMVQAYWNIGRVIVEKQGGNNKAEYGAALIK NLSKKMTKEFGKGFTVANLKNMRQFYLIFQKSYALRSELTWTHYRLLMRVENENARNFYI EECIKSNWSTRQLERQITTLFYERLLSSKDKEKVSKEIYKLEPQIKKVEDIIKDPYVLEF LGLPENTNFLEKNLEQALIDHLQKFLLELGRGFSFVARQKRITFDGRHFYIDLVFYNYLL KCFVLIDLKVGDLTHQNLGQMQMYVHYFEEEMMNEGDNPPIGIVLCADKSDSIVKYTLSK NETQVFASKYKAYLPSEEELLSEIKREYNMLKQEEELGKDE >gi|296155083|gb|ADVK01000017.1| GENE 20 21753 - 28121 7835 2122 aa, chain - ## HITS:1 COG:no KEGG:Lebu_0671 NR:ns ## KEGG: Lebu_0671 # Name: not_defined # Def: autotransporter beta-domain protein # Organism: L.buccalis # Pathway: not_defined # 523 2122 20 1550 1550 1033 43.0 0 MRNDLYKVERDLRSIAKRYNSVKYSLGLAILFLMLGGDAFSQEINDGNVTTNTIPTREQI TSSKENLKNSVNNLQSKIDDARAENEKGLAGLKLKLIQLMEQGNQVVKSPWASWQFGINY MYEQWGSSYKGRGDKPSYEGIFTRSDDIFQRYIHTASTHYSSLLTGNDSTQASANLNGKN SSHFGTTDLQLIEEPIVEVEVSAGVKPRQPKKMEPLTVRVNHNLSFTVPKLPDFEIPDEK VITGKTTPSSGDEIPALGASALYSKNITPRVVTEPQAKDDTIAPIQNTDISNGKIDITFD GKIKTVGANKEIDEENSTMKYSTQGVNFVGKTRGVTDFTYSNHTHNSQKFKYKNYSSIIN YVMGHNDFSINNTTITIGGRNYAHLRSAISIEGDDFSTGASAPNKITTLTIGENTKIFQK TDGSTILFNTLNGKYGNLVNKGTIEVTGKKGSAIDFAYPDNPPENLHPIVENAITGKIIG GGHLPTTEEDYLKIKRGEFVGDGGNTGLVREIAGGRTKSGYHFIQNGTFELRGYRQFGFF TSNNEGTRIYEINKPIKLLGKDGYGMTFYNIGKNFYTEGTIVNKGSYEIMVPRGETGYNS NNVKKSIFRIHMSGEENTALYFIQARNNYKVDSHKNFNIDSVDIVSENAKNNTLVRLERV DNFNLGGEGQYHNLLMKKEKGWKSRLITAHDSTNIKIHEKMEMGVFNSDNVTGITIQNSK RTDSNLTNRGKLVMTGNTSIKSVKDMKPEDLTEAGKGMRGFLANNKATLTNHGDFLFYGG GYKGYAEYYGDDPNDSTKVKFFSKYLERGSYGMNAKYGGKIVSTGVAYIHVKDKKSVGLF ASESKDNISPEITISNAKVIAEDGAINAVANKSGIINFKDNNVLFTKKNALTFLTGYANG VADGKFNIQGNLKAEIEKDGTAFYYTLPNSGNFDFVSWYNANFNHSAGKKLTLNMREGGR VLLLANGKVNLTSLPSMDFSSGALSSLAGKLEITGSRNYIPYSLIESNLKVDRDVNLDSN TDSYNKMQITQSSIVNEKTIAGTKAKQVAIAQENANVKEADKVSLTNKGTIQLTGKKSTG IYGKRGVLLNDSTGKISVGEESAAMYLLEDNKATTMGGKVSNLGDISLGKGSIGIYYSDK DKDGNVFTGSNSNTVGGAYNLKNILSSSENVIGMYLNSDNMAATNNKKYINESTGLIQLL GEHSIGMFAEGNGNYSTENKGRIVLGNASSLINANIGIFAKNEKTFIKNSGSITGGKNTV GLYGYQITTTNTSKIIVGDSGIGVYSGKGNLNLAGDLKIGDKEAKGVYLVGNSAQNVVYK FSNITLGNGSFGLVNIGKNKTITSTVSQVTLNNRNMFMYSEDDLGSITNHTALRSNGDQN YGIYSSGTVTNYGSIDFRNGKGNVGLYGTNKDRVVKNVGNIYVGASQPRENYSIGMAGGY YNTDTNTLVNTGNIENTGNIEVHGERGIGMYATGYGSKAVNRGHIKLIGARSIGMYLDQH ATGENYGTIEATADAIGAVGAVAMNGAIFKNYGQIKLLPAAGSIGTYSGKDSITEDNATL GGQTGTVEAGTKHYSRTASNETEKTVVDETGKPTVKIETKKNPTRVEINGKVVPPTKVDT NSVTNPNYLSVNPGEKSIEDKFTTNRKNNGRIGSIGMYVDTSGVNYTNPIQNLNLLSPKT KVDLIFGIEAAEYTNAKVIQIGEKILKPYNDAMAQATGKKWQIYSASLTWIATAKMNVDG MENAIMAKIPYTNFAKKSDSYNYPFTDGLEQRYGVEALGSRENQVFQKLNGIGKNEPILL AQAFDEMKGKQYANVQQRVQATGNILDKEFKYLKKEWRTASKNSNKVKTFNTKGEYKTNT AGVVDYKNTAYGVVYLHESEDIKLGKGIGWYTGIVNNTFKFKDIGNSKEEQLQAKIGLFK SVPFDDNNSLNWTISGDVFAGYNKMYRKYLVVNEIFNAKSKYYTYGVGVKNEIGKEFRMS EDFSIRPYGALKVEYGRTSKIKEKTGEIKLEVKSNDYFSVKPEIGTELAYKHYFETKSLT ASVGLAYENELGKVANGKNKARVAGTTADWYNLRGEKEDRKGNVKTDFNIGLDNQRIGVN ANVGYDTKGHNVRSGVGLRVIF >gi|296155083|gb|ADVK01000017.1| GENE 21 28357 - 39717 15196 3786 aa, chain - ## HITS:1 COG:no KEGG:FN1449 NR:ns ## KEGG: FN1449 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 613 3786 32 3165 3165 4420 90.0 0 MGNNLYKVEKDLRSIAKRYRSVKYSLGLAILFLMLGVGAFSEEVNLENSQVATREELKTS VGNVQTKLNILRNENKEKIKNLRLELIQLMEQGDQVVKSPWSSWQFGMNYFYENWRSTYK GSGDKSEKYVFNGIYTRGNWKVRNAMDMAENQRVGGRPLTPGNDSLNSWKNASNSSGGGV TIDKDTSIGSSTNGNKSWGLVDLRDLREPTNEVEILAHISPKEVTKQVISLNITEPTVQT LEAPDVNPQVNEPLPAPVIELPEVETVTINPLSINAPSAPTAPSAPTINIAITAPSAPVP PTINVVPASPSTPSAPTINIGIQPPSITPLSITTPDSVDAPNVVSPQIKPVDFTVDPGGD SNVYSSEKHNSHQGWKATWDKTINVTELTNRNYVSLTRDVVDSTIPDDQVINVTKPNNRA MVVDEVRKNGIKVDFKGTINLQRSQNVGIDLQGTHQGSISTPLIANIYNSGKIIGNAKYG STANKEQIAFGFNNADGSNNTTMTNMVNKGEITLNAPSSAGIQLKPEDPHDWQPGAPNYW NAAPLQIGNKTPNKIKGRVLMKADNQKDINLNGSGSFGMVTVFNEGVPKRLFAPSYVANY ENLRSERTYDGERVLPGGEIGRSALSDSKYTSGVYNTGNININGDSSIGVGLLQEIQEVK LAGNINIGTKAVAQETDISNLPNAGSKTYDKNKVEEAVGVFAGVPTMPVKPGEKDTLINS SGARNTATSLVGTETVEINGNISLGEHATQSIGALVGDTEIDLNKGQLTENGVNKGSNVD RKLKRSGDITAKSNSVINVGGKKNYGFVVSNSAHSSKFGTTVDSLEYNVDKTHHGKGINE GIINIIGDESVGFAMIKGGNSKSSGTISVKDNAENSIGFYGKEDSFTNSGTIEVTSAKKK NKAVVLDGTNASNKINFTNTGNIYVNTSDNNNTNLDGNGNIGIYAQGNYKVEHNSGIVKA GKDVIAFYVKDSTGEVNINAPIELANSSKGTTIGIYSDGNAKVKFGTGSKLKIGEKAVGL YSADPTKFNNTFKIESGKTLDVELGKNSTFGLLNGNNTVTNSPLLSKYLNNNTSDKINIV SFGEGASLFYATSKAKAILDEDYKVTNGDAISTAVLVANNGANVEIASGKKLETNTNAGL IAINGTVGFTSVAKNNGTLISTRIDKGIGIYTSAANGENSGTITMQNKNAVGILGSKDSN LKNTGKIELEAVSSAGVYAEDSNMINSGTTSEIIVNKETSVGIYAKETSISSVSKNVKNE GKIEIKADGDGKSAGIYSKKEGGAKLTIENTGNIEVAQKASAGIYAKNESTQANTQSEVT NSGLVKMSAENSIGIMGEKSKITNTGTGTKGIEIVEKKSAGILVTNESAVINSGRISLSN SSISASSDGLVGISVDGSSTGENDASGEIKVDAAYSTGMLSSGGGITNAGKIALEKKESV GMYATNANVTNSGATPKGIFIKDEKSVGIYSKINSSSIANKTVTNSGTIDIAGTSKTGSA GIYSIIESGATKKLSVVNNGNITINQKKSVGIYAKNESSHANTESDVTNSGKIEVKNEGS AGVLAEKYKVTNTGSGTNGIIVSAKKSAGIIGKLGSEIINSGIIKTEIATPTVATDGVVG ISLNNSKATNTSGGVITLDTDYSTGMYGEANSQLTNEGNITGTNKEYIVGMAGDSSTVTN KNIITLNGKKATGIFGKNSSTLLNETTGKITTKEEESVGMYSSSSLKATNKGTIITEKKT SAGMLGDKANIENDSSITTKEEMSAGMYVKNGTSKATNKGTVTTEKKTSAGILAEIDEAN GGTVSGLNETTGTITVSEETSAGMLGKVKSAVTASTAKLSLTNKKDININTKNSAGIMVV NESTAVGKENVLAENTGIINLTSSSATNEKNIGILANKATGINTGNINVNSKESIGMLGQ NASSITNNKTITLSGEKGIGMLSKDTSSIADNNDTINVNGKESLGMLGEDSGTVKNNKTI SVTAEKGVGIFVRDNGVGKGSGTGENTSTGTITLENKEAVGIFAKNNGTSDSAKNSGTIN LGKADGSTIKESLIGMFAQAEAGKKANVKNTKDINVNTKKSVGIYAKNDASNITDVDLEN TGDININSKESAGVYAPKANISKVGTITLKNSIDSNGSSAVYVSKGGKVANTAGAKINLG TVNQNRVAYYVNGKDSALAGADIGKITGYGVGVYLQGTSGDKATLDKNTSKLDYTLQGTG NGIIGLLLKGETDIQSYTKGIKVGNTVAATSPSDKAKYAIGIYADAQGTAGTPYNITTPI TAGKSGVGIFADKDSNINYTGNMEIGDGTTAGTGIFITKKIGATGGKVTLGTNTIKLKGT KGVVAIASEGTTFNGGNATIELVGSNIQGVGVYAKKGSTVNIDHWTFNNNGNSAEEVRSE EGRVYINANKNLKPKMVLTHVINGETSIATGKTVTSVNDGSITAKENIGLMAEGIKNHSM TWQEGNFEAVNHGIIDFSAAEKSTAMFINSARAKNDGTIKVGKNSIGIYGFYNKDTRKYD GASTNPDPNKLELETTSNSKMSLGDASTGMYLTNAEKIENKGGQITSESGATKNVGIYAV NGQDTKVSANNKILNMTTATNINLGNGSVGLYSKGQSNTVRNTVTNTGDITVGDKITGSP SVAMYAENTNLTTNSKITVGKDGIAFYGKNSDITAKGSANFSNKGVLAYLENSKFVSHLG NLGSTQNTMLYLKNSIAQLDGAGTKVDMDVADGYTGAYIEGNSTLTGVKTIKLGQDSTGL FLKDANFVSNAESITGTKDKARGILATNSNLTNNSKIFLSGAESIGIYSNANNTKSVVNN GELTIAGKKTLGVFLKGSQSFENKANINIADSANSLEPTIGIYTAEGSSNIKHTSGTIDV GQKSIGIYSTTNSNVEMNGGKIHVKDQGVGIYKQNGKATIKGELDIDTHIATTKDSEPTG VYAVNGTEINDQASKISIGAKSYGFILNNTDPNKTNIYTNTDAGTVSLGNDSVFLYSNGK ANIINNRTINANGASHLIAFYIKNGGNFTNNGTIDFSTGKGNIGIYAPGGKATNKGRVYV GKTDDIDPMTGKVYSDVSKIVYGIGMAADNGGYIKNDGEIRIYNNKSIGMYGKGIGTTVE NTGKIYLDGSRATATDKIQSMTGVYVDDGAKFINRGEIRTTDSYAGRDGKVNENVTGLVG VAVMNGSTLENHGKILIDADNSYGVIIRGKRDSKGNVERYAVIKNYGEIKVRGKGTWGIS WKDVSQAYIDELQKQINDKISSDPEGQALRAATGTNKDYEGVTITVKNGKPTFLRNGVPI SDSEVEQIGKLIGKESNLGLSDIGFYVDTLGRTKPIDIDGATPPINSQLIIGTEYSEKTN KKQWFVKGDVIKPFLDQIQGRNFKLTSIAGSLTWIATPVLDNYGQITGVAMAKLPYTSFV NKTDNAWNFADGLEQRYDMNALDSVEKRIFNKLNSIGKNEQTLLTQAYDEMMGHQYANVQ QRVQATGIVLDKEFSHLRNSWSNPSKDSNKVKTFGMKGEYKTDTAGVIDYKYNAYGVAYV HENEDIKLGKGTGWYTGIVHNTFKFKDIGNSKEKQLQAKVGLFKSVPFDENNSLNWTISG DIFIGHNKLERKFLVVDEIFHAKSKYYTYGIGIKNEIGKEFRLSEDFSIRPYGALKVEYG RVSKIKEKSGEMKLEVKENDYLSIRPEIGTELAYRHYFGTKTLRTSVGVAYENELGRVAN GKNKARVAGTTADWFNIRGEKEDRKGNVKVDLNVGIDNQRLGVTGNVGYDTKGHNVRGGV GLRVIF >gi|296155083|gb|ADVK01000017.1| GENE 22 39893 - 41311 1926 472 aa, chain - ## HITS:1 COG:FN1450 KEGG:ns NR:ns ## COG: FN1450 COG2985 # Protein_GI_number: 19704782 # Func_class: R General function prediction only # Function: Predicted permease # Organism: Fusobacterium nucleatum # 5 472 1 468 468 828 99.0 0 MHFDVVGFIFNSLVLLFFTMTLGNLFGNVKFRKFNFGITGTLFIGLFVGYFLTKYAVTIT EGSKYFSKAQNVLKGSIIDNSIMNLSLLIFIVGTGLLAAKDMKYAITKFGKQFVVIAIFI PFVGAVASYGFSQIFSKMSPYQITGTYTGALTSSAGLAAATESSEAESRHLASEFQDLSE GTKTKILAIINNAKERDAKLKNEDIPEKMIVENTPTLSAEDTEVYVTEAKAGVGVGHSIG YPFGVLFLILGINFIPKIFRFDVEKEKEKYFAQKKIDLSSDKDAGKNTIPEVKMDFVGFS IAVFLGYFLGSIKISMGPLGTFSLGSIGGAIIVALILGFIGKIGPITFRMDSVVLGKMRT YFLSIFLAGTGLNYGFRVVEAVTGDGIMIAIVSALVAILSVLFGFLLGHYVFHINWTLLS GAITGGMTSAPGLGAAIDALDCDEPAVSYGATQPLATLCMVIFSIIIHKLPI >gi|296155083|gb|ADVK01000017.1| GENE 23 41517 - 42599 1595 360 aa, chain - ## HITS:1 COG:FN1451 KEGG:ns NR:ns ## COG: FN1451 COG0206 # Protein_GI_number: 19704783 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Cell division GTPase # Organism: Fusobacterium nucleatum # 1 360 1 360 360 585 100.0 1e-167 MTDAIKDLVKIKVIGVGGGGGNAINDMLYSGVTGVEYIAANTDKQDLEKSLADVKLQIGE KLTKGQGAGASPETGRLAAEEDIEKIQELLKGTDMLFITAGMGGGTGTGAAPVIAKAAKE LDVLTVAVVTKPFNFEGERRKNNAESGIELLRQNVDSLVIIPNDKLFDLPDKSITLQNAF KEANNILRIGIKAVVDLVLGQGFINLDFADIKSVLKDSDIAVLGFGDGEGENRAMKAAEK ALQSPLLEKSIQGADKILINLMTSQDVGLSESQTVTDVIRQAAGKKIEDVMFGVTIVPEF TDRIEITIIANNFKEGVDSNTDSPIRMDSAKPAEPLKETERKKEPEEEIDIPPWMRNNKR >gi|296155083|gb|ADVK01000017.1| GENE 24 42622 - 43965 1679 447 aa, chain - ## HITS:1 COG:FN1452 KEGG:ns NR:ns ## COG: FN1452 COG0849 # Protein_GI_number: 19704784 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Actin-like ATPase involved in cell division # Organism: Fusobacterium nucleatum # 1 447 1 447 447 799 100.0 0 MKDDTIRKVALDIGNNRIKLLVGEMSSDFQRLAVTNYVKTRSNGISKSLIENPEALAIAL KEAVSKAESVESPITRLSLALGGSGIHSATVNVKTSFPEKEIEKTDMDNLLRQAKRQIFG GREGQYRILYKEVYNKKIDISSGIVKEPIGMVGKELQADIHLVYVDDNYVQKFIQVVNKI GIDIDRIYLNSYASAKGTLDDETKKMGVAHVDIGYGSTSIIILKYGKVLYAKTKPVGEMH YISDLSIILKIPKEGAEEILNKLKNKQIEADNTIRYGAKKVTLREIKDIILARTGDIIDF ITTTIDESGFNGHLTKGIVLSGGAVEIDGVSEQIASRSGYLTRRMLPIPLKGLKDAFYSD AVAIGIFLEDMEREYRAYLEESRQPAPTVKEKKEIIKEETTSNNKINDKKMDRKEEIDSF LEDVEEIEPKKEDGKIKSFFKWFGELF >gi|296155083|gb|ADVK01000017.1| GENE 25 43962 - 44672 657 236 aa, chain - ## HITS:1 COG:no KEGG:FN1453 NR:ns ## KEGG: FN1453 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 46 236 1 191 191 291 98.0 2e-77 MGIRLLFLSGIIYLIYMLPQNFFRLDYFNIDKVNITDNSKMLQNELTKLAEKLYNKSNIY IDSNEIKEYIEKDIRVESAKVEKNSLGEITIDVKEKDLVYYAVIGKNIYLTDKEGKIFAY LNEKEVQGVPFIIANSEEEIQEISRFLNEISDLAIFKKISQIYKVNDKEFVIILTDGVKI KTNRITDNNDEINKEKENKRYLIAEQLYFNMSKERKIDYIDLRFNDYIIKYLGDSR >gi|296155083|gb|ADVK01000017.1| GENE 26 44685 - 45548 1360 287 aa, chain - ## HITS:1 COG:FN1454 KEGG:ns NR:ns ## COG: FN1454 COG1181 # Protein_GI_number: 19704786 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: D-alanine-D-alanine ligase and related ATP-grasp enzymes # Organism: Fusobacterium nucleatum # 1 287 1 287 287 527 99.0 1e-150 MRIAVFMGGTSSEKEISLKSGEAVLESLQKQGYDAYGVILDERNQVSAFVDNDYDLAYLV LHGGNGENGKIQAVLDILGKKYTGSGVLASAITMDKDKTKQIAQSVGIKTPKSYRTVEEI ERFPVIIKPVDEGSSKGLFLCNNKEEAEEAVKKLAKPIIEDYIIGEELTVGVLNGEALGV LKIIPQADVLYDYDSKYAKGGSVHEFPAKIENKSYKEAMKIAEKIHSEFGMKGISRSDFI LSEGELYFLEVNSSPGMTKTSLIPDLATLKGYSFDDVVRITVETFLE >gi|296155083|gb|ADVK01000017.1| GENE 27 45563 - 46408 1049 281 aa, chain - ## HITS:1 COG:FN1455 KEGG:ns NR:ns ## COG: FN1455 COG0812 # Protein_GI_number: 19704787 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylmuramate dehydrogenase # Organism: Fusobacterium nucleatum # 1 281 1 281 281 509 99.0 1e-144 MKIFTNQEMKNYSNMRVGGKAKKLIILETKEEIIDVYNDKENTDIFILGNGTNILFTDEY MDKIFVCTKKLNKIEDLGKNLVKVETGANLKDLTDFMKDKNYTGIESLFGIPGSIGGLVY MNGGAFGTEIFDKIVSIEVFDEKHQIREIKKEDLKVAYRKTEIQDKNWLVLSATFKFDNG FDAARVKEIKELRESKHPLDKPSLGSTFKNPEGDFAARLISECGLKGTIIGNAQIAEKHP NFVLNLGNATFKDIIDILTLVKKSVLEKFGIKLEEEIIIVR >gi|296155083|gb|ADVK01000017.1| GENE 28 46405 - 47787 1765 460 aa, chain - ## HITS:1 COG:FN1456 KEGG:ns NR:ns ## COG: FN1456 COG0773 # Protein_GI_number: 19704788 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylmuramate-alanine ligase # Organism: Fusobacterium nucleatum # 1 460 9 468 468 877 99.0 0 MEKIYFIGINGIGMSGLAKIMKCKGYDVKGADICTNYVTEELLSMGIVVYNEHDEENVKG ADYVIASTAIKESNPELSYAKNNGIEILKRGELLAKLLNRETGIAVAGTHGKTTTSSMLS AVMLSKDPTIVVGGILPEIKSNAKPGKSEYFIAEADESDNSFLFMNPKYAVITNIDADHL DVHGNLDNIKKSFIKFICHTQKEAIICLDCENLKEVVTRLPEEKTVTTYSIKDESANVFA KNIKIEDRKTIFELYINKELIGEFSLNIPGEHNIQNSLPVIYLAFKFGVSKEEIQEALNK FKGSKRRYDVLFDKELENGYGNKTKRVRIVDDYAHHPTEIKATLKAIKSIDTSRLVAIFQ PHRYSRVHFLLDEFKDAFVNVDKVILLPIYAAGEKNEFNISSEELKEHINHNNVEVMNEW KDIKRYVSRVKKDSTYIFMGAGDISTLAHEIAEELEGMSE >gi|296155083|gb|ADVK01000017.1| GENE 29 47792 - 48856 1440 354 aa, chain - ## HITS:1 COG:FN1457 KEGG:ns NR:ns ## COG: FN1457 COG0707 # Protein_GI_number: 19704789 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase # Organism: Fusobacterium nucleatum # 1 354 4 357 357 668 99.0 0 MKKVMLTTGGTGGHIYPALAVADRLKIKGIEAVFVGSTERMEKDLVPESGHKFIGVDISV PRGLKNIRKYLKAIRTAYKVIKEEKPDAIIGFGNYISVPVIIAGILLRKKIYLQEQNVNI GSANKMFYKIAKMTFLAFDKTYDDIPIKSQSRFKVTGNPLRKEIDGLKYATEREKLGIKP SEKVLLITGGSLGAQEINNIVMKYWEKFCADKNLRIFWATGNNFEQLKKVRKTKKENDRI EPYFNDMLNVMAAADLVVCRAGALTISEIIELEKPAIIIPYGSIKVGQYENAKVLTDYDA AYVFTRDELDESMKKVFEIIRNDEKLKKMRIRLKPLKKPNAAEEIIASLDIWRN >gi|296155083|gb|ADVK01000017.1| GENE 30 48866 - 50164 1778 432 aa, chain - ## HITS:1 COG:FN1458 KEGG:ns NR:ns ## COG: FN1458 COG0771 # Protein_GI_number: 19704790 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylmuramoylalanine-D-glutamate ligase # Organism: Fusobacterium nucleatum # 1 432 23 454 454 749 99.0 0 MKKAMIYGLGISGTGAKELLEKEGYEIIVVDDKKAMTSEEALNHLEGLEFFIKSPGIPYN DFVKEVQKRGIKILDEIEIAYNYMIEKGLKTKIIAITGTNGKSTTTAKISDMLNHAGYKA TYAGNIGRSLSEVLLKEKDLDFISLELSSFQLENIENFKPCISMIINIGPDHIERYKSFD EYYNIKFNITKNQTEDLYFIENIDDEEIEKRAKQIKAKRISVSKFKKADIFVENDKIYHD KDDIIDVDKLSLKGIHNLENTLFMVATAEILKLDREKLKGFLMIATPLEHRTELFFNYGK LKFINDSKATNVDSTKFAIQANKNSILICGGYDKGVDLAPLAEMIKENIKEVYLIGVIAD KIENELKKVGYEDNKIHKLVNIENSLQDMKKRFTKDSDEVILLSPATSSYDQFNSFEHRG KVFKELVLKIFG >gi|296155083|gb|ADVK01000017.1| GENE 31 50164 - 51249 1456 361 aa, chain - ## HITS:1 COG:FN1459 KEGG:ns NR:ns ## COG: FN1459 COG0472 # Protein_GI_number: 19704791 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase # Organism: Fusobacterium nucleatum # 1 361 1 361 361 597 100.0 1e-171 MLYFLAEHLAKLEFLKSIYLRAFLGFVISFCIVLFAGRPFIKYLKIKKFGEEIRDDGPSS HFSKKGTPTMGGVLIIAAVLLTSIFINDLTNSLILLVLLSTIMFAAIGFIDDYRKFTVSK KGLAGKKKLLFQGAIGLIIWAYIYFIGLTGRPMVDLSLINPISAYPYYIGAIGLFFLIQI VLMGTSNAVNITDGLDGLAIMPMIICSTILGVIAYFTGHTELSSHLHLFYTVGSGELSVF LSAVTGAGLGFLWYNCYPAQIFMGDTGSLTLGGILGVIAIILKQELMLPIMGFIFVLEAL SVILQVGSFKLRGKRIFKMAPIHHHFELMGIPESKVTMRFWIGTLIFGIIALGAIKMRGI L >gi|296155083|gb|ADVK01000017.1| GENE 32 51368 - 51541 214 57 aa, chain + ## HITS:1 COG:no KEGG:FN1460 NR:ns ## KEGG: FN1460 # Name: not_defined # Def: putative cytoplasmic protein # Organism: F.nucleatum # Pathway: not_defined # 1 57 1 57 57 74 100.0 1e-12 MSRVNFGMFEANLLASLPNLQRILNFYSLRNLASNELFLLIFMIVLLYPPKQSPLML >gi|296155083|gb|ADVK01000017.1| GENE 33 51492 - 53321 2184 609 aa, chain - ## HITS:1 COG:FN1461_2 KEGG:ns NR:ns ## COG: FN1461_2 COG0770 # Protein_GI_number: 19704793 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylmuramyl pentapeptide synthase # Organism: Fusobacterium nucleatum # 194 609 1 416 416 723 99.0 0 MKKAIFLDRDGTINVEKDYIYKSEDLVFEEGTIEALKTFKNLGYILIVVSNQSGIARGYF TEKDLNIFNNNMNEILKKNGVEITEFYCCPHHPDGIGGYKKVCKCRKPNNKMIEDAITKY NIDREKSYMIGDKTSDIGAGLKSNLKTVLVKTGYGLKDMEKIDKNETLICENLKDFSEIL KREKLNELIFEEFSKKVQIKNVVMDSRKVTEGSLFFAINNGNSYVKDVLDKGASLVIADN IDVKDERIIKVSDTIATMQDLAIKYRKKLDIQVVGITGSNGKTTTKDIVYSLLSVKAKTL KTEGNYNNHIGLPYTLLNVTDEEKFVVLEMGMSSLGEIRRLGEISSPNYAIITNIGDSHI EFLKTRDNVFKAKTELLEFVIKENTFVCGDDEYLAKLDVNKIGFNNSNTYKIESYKFSDK GSKFVLDGKEYEMSLLGKHNISNTAIAIELAKKIGLTDEEIENGLKEIKISNMRFQEIKI GEDIYINDAYNASPTSMKAAIDTLNEIYNDKYKVAILGDMLELGDNEIDYHIDVLNYLLD KKIKLIYLYGERMKKAYNMFMKSKSEEYRFWYYPTKEGIVESLKNIKMEKVILLKASRGT ALEDIIKLS >gi|296155083|gb|ADVK01000017.1| GENE 34 53425 - 54834 1061 469 aa, chain - ## HITS:1 COG:FN1462 KEGG:ns NR:ns ## COG: FN1462 COG1167 # Protein_GI_number: 19704794 # Func_class: K Transcription; E Amino acid transport and metabolism # Function: Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs # Organism: Fusobacterium nucleatum # 1 469 1 469 469 813 98.0 0 MIILNLVNKSKVPLYIQIYTEIKELIQNKVLKANEKLPSKKDFIDYYNISQNTIQNALYL LLEEGYIFSIERKGYFVSDIENLIVHNVKIENKRQYKEKKKIHYDFSYSGVDSKSLAKTI FKRITKDVYDEENEDLLFQGHIQGDFLLRKSICEYLSQSRGFKAEAEQLIISSGTEYLFY IIFKLFNNKIYGLENPCHKMFKELFLTNDVSFKAISLDESGIMIEDLKKYNVNIAYVTPS HQFPTGTIMSISRRTELLNWANETPNCYIVEDDYDSEFKYTGRPIPALKASDINDKVIYL GSFSKSISPAIRVSYLVLPKALLNIYQKKLPYFICPVPTLNQKILYRFIKDGYFVKHINK MRTLYKKKREYLVNTIKMYSSEILGKEISIQGADAGLHLVIKLNKKINEKMFLKECLENS LQLYSLEEYYLEKIYNETSSFLLGYANLANKEIDEGILLLLQILKKYYI >gi|296155083|gb|ADVK01000017.1| GENE 35 54941 - 55783 1549 280 aa, chain + ## HITS:1 COG:FN1463 KEGG:ns NR:ns ## COG: FN1463 COG0214 # Protein_GI_number: 19704795 # Func_class: H Coenzyme transport and metabolism # Function: Pyridoxine biosynthesis enzyme # Organism: Fusobacterium nucleatum # 1 280 1 280 280 509 100.0 1e-144 MDTRFNGGVIMDVTTKEQAIIAEEAGAVAVMALERIPADIRAAGGVSRMSDPKLIKEIMS AVKIPVMAKVRIGHFVEAEILQAIGIDFIDESEVLSPADSVHHVNKRDFSTPFVCGARNL GEALRRISEGAKMIRTKGEAGTGDVVQAVSHMRQIIKEINLVKALREDELYVMAKDLQVP YDLVKYVHDNGRLPVPNFSAGGVATPADAALMRRLGADGVFVGSGIFKSGDPKKRAKAIV EAVKNYNNPEIIAKVSEDLGEAMVGINENEIKIIMAERGV >gi|296155083|gb|ADVK01000017.1| GENE 36 55843 - 57594 2638 583 aa, chain - ## HITS:1 COG:FN1464 KEGG:ns NR:ns ## COG: FN1464 COG1154 # Protein_GI_number: 19704796 # Func_class: H Coenzyme transport and metabolism; I Lipid transport and metabolism # Function: Deoxyxylulose-5-phosphate synthase # Organism: Fusobacterium nucleatum # 1 583 1 583 583 1124 98.0 0 MYLEKINSPEDVKKLNIEEMKVLAQEIREAIIKRDAIHGGHFGPNLGIVEATIALHYVFE SPKDKFVFDVSHQTYPHKILTGRREAFTDEAHYDDVTGYSNQNESEHDHFILGHTSTSIS LALGLAKARDVKGETGNVVAIVGDGSLSGGEALEGLDFAGGELKSNFIIVVNDNEMSIAE NHGGLYKNLKLLRETEGKAECNLFKAIGLEYIFIKDGNNLEELIETLKKVKDINHPIVVH IYTQKGKGYKPAEEDKESWHYTMPFNIENGKPLNEDIEDYCNITKEYLIKKMKEDKTVVT ITAGTPGDFGFSKKERDELGSQFVDVGIAEQTAVAMSSGMASKGAKPVFTVVSSFIQRTY DQLSQDLCINNNPATIVVSYGGAIGMTDVTHLGWFDIAMMSNIPNLVYLAPTTKEEHLAM LEWSIGQQEHPVAIRIPGGKVVSNGKKVTKNFSKLNSYEVNQKGEKVAIIGLGTFYQLGE KAAKLYEEKTGVKVTVINPMYITGVDEKLLEELKKDHSIVITLEDGVLDGGFGEKIARFY GNSDMKVLNYGLKKEFLDRYNIGKVLTKNRLKANLIVEDLLKF >gi|296155083|gb|ADVK01000017.1| GENE 37 57619 - 57714 145 31 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MKQRKQIEETLEKLNYKIARYEVAVETGKLT >gi|296155083|gb|ADVK01000017.1| GENE 38 58155 - 58799 167 214 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|42631297|ref|ZP_00156835.1| COG0697: Permeases of the drug/metabolite transporter (DMT) superfamily [Haemophilus influenzae R2866] # 1 203 71 277 290 68 26 8e-11 SAFAFQTYGLKYTTVSKQSFLTSLYIVIIPLLDFIFFKNKIKKNVIALFFLILVGLLFIS FKNFKNFEFYLNFGDILTILCALGFALNILLFSKIKKFNINIINITIIQMLTIGILAFIF QIIFEKKVINFSMNFSLIYLILICTMLNFTIQNISQKYVPAHIMGLILSLEAIFGTVFAI IFLNETISQNFIIGTFLIAIAVILIQYFENKRGD >gi|296155083|gb|ADVK01000017.1| GENE 39 58800 - 59084 332 94 aa, chain + ## HITS:1 COG:no KEGG:FN1466 NR:ns ## KEGG: FN1466 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 94 1 94 94 169 96.0 4e-41 MFPNILDNAIVYFYTQKGDYGSVFYNNGKIECIHYLAICSYDNKEYYLFHCNNKFEVIAD YLFDSIEECKDMASKCKKDIVWVEQLENYSRIVR >gi|296155083|gb|ADVK01000017.1| GENE 40 59157 - 59699 500 180 aa, chain + ## HITS:1 COG:no KEGG:FN1467 NR:ns ## KEGG: FN1467 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 180 5 184 184 298 98.0 1e-79 MLKIRNKFFTFNEDEKEHREYLSIWNKETEVHFFGTINKDYEKNLEEKLVWLEKNRLPII DKFIQEDNTFNSINQMIENKELEINNQKILNKITEEDFKNSFYAELINFEFLDDTITIDL YMGTKPDYLMGHFINVKITSNFNISVNHAYILPSLINQAIQRNFRKPWKKDDKILIFPYV >gi|296155083|gb|ADVK01000017.1| GENE 41 59680 - 60273 665 197 aa, chain - ## HITS:1 COG:FN1468 KEGG:ns NR:ns ## COG: FN1468 COG1853 # Protein_GI_number: 19704800 # Func_class: R General function prediction only # Function: Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family # Organism: Fusobacterium nucleatum # 1 186 1 186 197 328 99.0 4e-90 MKKRNLKGSVVLNPVPVVLITCKNSEGKDNVFTVAWVGTVCSKPPILSISIRPERLSYDY IKETMEFIINLPSKKQTKEVDFCGVRSGRQIDKIKECGFTLQEGKKVKSSYIKECPINIE CKVKDIIKLGSHDMFIAEVLCSHIDEDLFDEKDKIHFEKANLISYSHGEYFSLSKEAIGK FGYSVMKKKKKAKHKGK >gi|296155083|gb|ADVK01000017.1| GENE 42 60275 - 61597 1106 440 aa, chain - ## HITS:1 COG:FN1469 KEGG:ns NR:ns ## COG: FN1469 COG0534 # Protein_GI_number: 19704801 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Fusobacterium nucleatum # 13 440 13 440 440 626 97.0 1e-179 MFKKKIFKTIFKYAIPNVISMWIFTLYTMIDGVFISRFVGSTALAGVNLVLPLINFIFSI SIMIGVGSSTLMAIKFGENKYDEGNKIFTLSTFLNLFLGIFISAIILLNIDRVINILGAN KSQEVYRYVKEYLTIIVFFSVFYMSGYAFEIYIKIDGKPSYPAICVLVGGLTNLVLDYVF VVIFHYGVTGAAIATGISQVASCTMLLLYITLNAKYVKFIKLNKINFEKISKILKTGFSE FLTEISSGILILIYNLVILKKIGVLGVSIFGTVSYITSFITMTMIGFSQGIQPVISYYLG KKNDKNLKEILKISIIFLGLLGIFCFIFISLFSEYIGKIFFREQDMILYVKRVLRIYSLS YLIIGVNIFVSAYFTAIKKVIYSALITFPRGILFNSILLLILPNIFGNKVIWIVSFLSEV LTIFICIYLLKKIKRKGILN >gi|296155083|gb|ADVK01000017.1| GENE 43 61776 - 62924 1521 382 aa, chain + ## HITS:1 COG:FN1470 KEGG:ns NR:ns ## COG: FN1470 COG3055 # Protein_GI_number: 19704802 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 11 382 1 372 372 705 99.0 0 MIKKIYCLLLLVFLSSFGYASQKTLSIENNRLVWDYAGSLPAQKGFDKNIGTAGLLQGII GNYIVVGGGANFPEALEKGGKKVTHKDLYLLKDVNGKLKTIEQIQLDYPIAYGASVSVKE ENAIYYLGGSPDSEHMRDVLKVTLKNGKLKTEIYAKLPLGFENGVAQYKDGKIYYGVGKI ENSEGKNVNSNKFYVFDLKTKETKELAEFPGEARQQTVGQILNNKFYVFSGGSNISYIDG YAYDFKTNTWKKVADVVVDNEKILLLGANSIKISENKMLVIGGFDYKLWNEANYHLSNLK DEKLKAYKASYFGAEPQSYNWNRKILIFDVTKNSWKSIGEVPFDAPCGAALLLMNNNIYS INGEIKPGVRTERMYKAYIISK >gi|296155083|gb|ADVK01000017.1| GENE 44 62942 - 63943 1155 333 aa, chain + ## HITS:1 COG:FN1471 KEGG:ns NR:ns ## COG: FN1471 COG1609 # Protein_GI_number: 19704803 # Func_class: K Transcription # Function: Transcriptional regulators # Organism: Fusobacterium nucleatum # 1 333 1 333 333 561 99.0 1e-160 MITQKELAARLGLSRTTIARAINNSPNIKPETKEKILKLVKELGYEKNYVGSLLASKKKI VYSFIVESRNSYYTEQIKLGIKGAKKEYKHYNLEIIEIVTNINKPMEQVLKLKKLLNSDK QIDGIIIIPLDKTKILELINPYLERIKFITVSVFLSKKIAYVGTDYQKCGRLAAELLTKT LNSNDKVLVIDNGDDNVSSKYYLNGFLDRANNDKMNIVGPIKKNGVEESLQFLKTILKKE NISSIFINRYAQDILLELSDNILKRQKNITTGIGNRIRKLIMERKILATVADDVYSTGYK ACQLMVDMLYKEFGKSVKKIILEPQILLMENLK >gi|296155083|gb|ADVK01000017.1| GENE 45 63963 - 64946 378 327 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|149199369|ref|ZP_01876406.1| Ribosomal protein L22 [Lentisphaera araneosa HTCC2155] # 22 307 45 329 346 150 30 3e-35 MKKTSLLKAVILFGVMTTSAFAAKYNLKMGMTAGTSQNEYKAAEVFAKELKKRSNGEIEL KLYPNAQLGKDDLAMMQQLEGGALDFTFAETGRFSTFFPEAEVFTLPYMIKDFNHMKKAV NTKFGKDLFKKVHDKKGMTVLAQAYNGTRQTTSNKAIKSLADMKGMKLRVPGAAANLAYA KYTEAAPTPMAFSEVYLALQTNAVDGQENPLSTIKAQKFYEVQKYLAMTNHILNDQLYLV SNITMEELPENLQKVVKESAEVAAEYHTKLFMDEEKSLKDFFKSKGVTITEPNLADFKKA MKPFYDEYIKKNGKVGENAIKAIEAVR >gi|296155083|gb|ADVK01000017.1| GENE 46 64970 - 66823 698 617 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|126646729|ref|ZP_01719239.1| Ribosomal protein L16 [Algoriphagus sp. PR1] # 191 613 4 428 431 273 34 2e-72 MKVFNKLEEWLGGSLFIGMFVILVMQIFSRQIFNSPLIWSEELSRLIFVYVGLLGVSMGI RSQQHIMIDFLYAKFPKSMQKIIFTIIQILILACLIFFLYFGYDLFIKKEEIEIVSLGIS MKWMYLALPLITLLMLVRFYQAYSENYAQNKVYIKPIFILALMIILVLIAFIKPELFKIL KLSNYFDLGEMTIYYVLIAWLVMIFFGVPVGWSLLVACILYFALTRWKVVYFAADKLVYS LDSFSLLSVPFFILTGILMNGAGITERIFNFAKAMLGHYTGGMGHVNVAASLIFSGMSGS AIADAGGLGQLEIKAMRDEGYDDDICGGLTAASCIIGPLVPPSISMIIYGVIANQSIAKL FLAGFVPGFLTTIALMIMNYFVCKKRGYKKTAKASPKERWIAFKKSFWALLTPILIIGGI FSGIFTPTEAAVIATFYSIILGGFIYKELTVKSFFKHCVEAVAISGVTVLMIMTVTFFGD IIAREQVAMRVAEIFIKYATSPMMVLVMINLLLLFLGMFIDALALQFLVLPMLIPIAEQV GIDLVFFGVMTTLNMMIGILTPPMGMALFVVAQVGKMSVSTVAKGVLPFLLPIFITLVII TIFPQIILFLPNLIVGG >gi|296155083|gb|ADVK01000017.1| GENE 47 66826 - 67701 373 291 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163762640|ref|ZP_02169704.1| ribosomal protein L33 [Bacillus selenitireducens MLS10] # 6 288 8 317 323 148 30 1e-34 MNILAIDIGGTMIKYGLVSFDGKILSTDKIKTEASKGLNNILNKIDNIFKRYKENNPVGI AVSGTGQINGMIGKVIGGNPIIPNWIGTNLVKILEEKYNLPIVLENDVNCVALGEKWIGA GKDLSNFICLTIGTGIGGGIILNNQLFRGENFVAGEFGHILIKKGEFEQFASTTALIRLV KERTGKTLNGKEIFDLEKKEIVEYQEVISEWIENLAEGLSSIIYCFNPANIILGGGVIEQ GEPLINRIKNSLFKKIGPQFKEKLNITQAKLGNNAGMIGASYLLLEKINKR >gi|296155083|gb|ADVK01000017.1| GENE 48 67723 - 68595 1268 290 aa, chain + ## HITS:1 COG:FN1475 KEGG:ns NR:ns ## COG: FN1475 COG0329 # Protein_GI_number: 19704807 # Func_class: E Amino acid transport and metabolism; M Cell wall/membrane/envelope biogenesis # Function: Dihydrodipicolinate synthase/N-acetylneuraminate lyase # Organism: Fusobacterium nucleatum # 1 290 1 290 290 541 98.0 1e-154 MKGIYSALMVPYNEDGSINEKGLREIIRYNIDKMKVDGLYVGGSTGENFMISTEEKKRVF EIAIDEAKDSVNLIAQVGSINLNEAVELGKYVTKLGYKCLSAVTPFYYKFDFSEIKDYYE TIVRETGNYMIIYSIPFLTGVNMSLSQFGELFENEKIIGVKFTAGDFYLLERVRKAFPDK LIFAGFDEMLLPATVLGIDGAIGSTYNINGIRAKQIFELAKNSKITEALEIQHTTNDLIE GILSNGLYQTIKEILKLEGVDAGYCRKPMKKISQEQVEFAKELHKKFLKN >gi|296155083|gb|ADVK01000017.1| GENE 49 68610 - 69284 981 224 aa, chain + ## HITS:1 COG:FN1476 KEGG:ns NR:ns ## COG: FN1476 COG3010 # Protein_GI_number: 19704808 # Func_class: G Carbohydrate transport and metabolism # Function: Putative N-acetylmannosamine-6-phosphate epimerase # Organism: Fusobacterium nucleatum # 1 224 1 224 224 358 98.0 3e-99 MNKILESIRGKLIVSCQALEDEPLHSSFIMGRMAYAAYSGGAAGIRANTVDDIKEIKKNV SLPIIGIIKKVYNNSDVYITPTIKEVEDLINEGVQIIAIDATKRERPDRKDLKDFIAEIK EKYPSQLFMADISSVDEALYAEKIGFDIVGTTLVGYTDYTKNYKALEELEKVVKVVKIPV IAEGNIDTPLKAKKALEIGAFAVVVGGAITRPQQITKKFVDEMK >gi|296155083|gb|ADVK01000017.1| GENE 50 69443 - 70330 867 295 aa, chain + ## HITS:1 COG:HI0687 KEGG:ns NR:ns ## COG: HI0687 COG0697 # Protein_GI_number: 16272629 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; R General function prediction only # Function: Permeases of the drug/metabolite transporter (DMT) superfamily # Organism: Haemophilus influenzae # 1 294 1 298 304 313 68.0 2e-85 MRKDYFTSFFYIILMGFGFPIMRFMSIHFETVNNNAVRFLSGGFLFILICIFKFREELKK ILLEPKIILKLLLLGIFMSGNMYFFINGLKYTSALAGSIFGILAMPLAIIMAAIFFKDER SKIKQKKFYIGSIFAFIGSLLFVLYGNKVGESLKFLKGTLFLGTAIFIQSIQNLLVKNVA KKLHAIVISASTATLSGIIYLILSIHTGKIIQLKEVGEGMLIGLSLAGIYGMLTGMLMAF YIVQKQGVIVFNIIQLLIPVSTAIVGYFTLGETINFYQGIGAIIVIFGCIIALKI >gi|296155083|gb|ADVK01000017.1| GENE 51 70391 - 70726 641 111 aa, chain - ## HITS:1 COG:no KEGG:FN1478 NR:ns ## KEGG: FN1478 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 111 1 111 111 102 93.0 5e-21 MKKGIFAMFILVASMAMVACTNANATNEGAAGEKDAFKALEKRREYYKEQDKERAKMKAE MQTSTMSEETPMMAPEEDTKAQEKAMKEAAEAKEKADKEALKILEKKRKTN >gi|296155083|gb|ADVK01000017.1| GENE 52 70766 - 71353 593 195 aa, chain - ## HITS:1 COG:no KEGG:FN1479 NR:ns ## KEGG: FN1479 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 8 195 1 188 188 307 96.0 2e-82 MRKNLEILDKIYNLRYKSGKVHLFYSINKLVGRFGNVVSLDKIYVSKDYLSYLSEKLFQD KNRLISFFGGNNKYVRLSLVHEFMQDFGRDIAQDIKDDFLELKQYNSSIFKATKERMLVL KENENEDITDEDIVLIQSYLSNWKKLQDKIKHFIPEEFYSQKINYFYTSLLSYIKFLEKL NPDYESGIKYLQAIN >gi|296155083|gb|ADVK01000017.1| GENE 53 71377 - 72708 1591 443 aa, chain - ## HITS:1 COG:FN1480 KEGG:ns NR:ns ## COG: FN1480 COG2239 # Protein_GI_number: 19704812 # Func_class: P Inorganic ion transport and metabolism # Function: Mg/Co/Ni transporter MgtE (contains CBS domain) # Organism: Fusobacterium nucleatum # 1 443 7 449 449 760 99.0 0 MEEIIQLLEQNRLAELKEILIKENPIDVADVFEEFPKEKDLIIFKLLPKDFSSEVFSYLS PEKQQEVIENITDEEIKFIMEDMYLDDTVDFIEEMPANIVDKILKNTSHDKRKLINQMLK YPENSAGSVMTVEYISFKDNYTVKQAIDYYRKIAIDKEETDICFVTDSKKKLVGIISLKT LILSNDDSYIKDEMDTNFVSVLTKDDQEETAALFRKYDLTTMPVVDHEDRLVGVITVDDI VDVIDQENTEDIQKMAAMNPSDEEYLKESVMSLAKHRIIWLLVLMISATFTGLVIKKYEE VLQSAVYLAVFIPMLMDTGGNAGSQSATLVIRGIALEEIEFSDIFKVIWKELRVSVLVGF ILSGINFLRIYYFTKSGFETSLVVAISMFLTIIMAKVIGGVLPLIAKSLKIDPAIMASPL ITTIVDTAALIIYFQLSVIFLHI >gi|296155083|gb|ADVK01000017.1| GENE 54 72742 - 73863 1559 373 aa, chain - ## HITS:1 COG:FN1481 KEGG:ns NR:ns ## COG: FN1481 COG0343 # Protein_GI_number: 19704813 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Queuine/archaeosine tRNA-ribosyltransferase # Organism: Fusobacterium nucleatum # 1 373 1 373 373 761 100.0 0 MKLPVTYKVEDKDGKARAGVITTLHGEIETPVFMPVGTQATVKTMSKEELLDIGSEIILG NTYHLYLRPNDELIARLGGLHKFMNWDRPILTDSGGFQVFSLGSLRKIKEEGVYFSSHID GSKHFISPEKSIQIQNNLGSDIVMLFDECPPGLSTREYIIPSIERTTRWAKRCVEAHQKK DIQGLFAIVQGGIYEDLRQKSLDELSEMDENFSGYAIGGLAVGEPREDMYRILDYIVEKC PEEKPRYLMGVGEPVDMLNAVESGIDMMDCVQPTRLARHGTVFTKDGRLVIKSERYKEDT KPLDEECDCYVCKNYSRAYIRHLIKVQEVLGLRLTSYHNLYFLIKLMKDAREAIKEKRFK EFKEKFIQRYEGK >gi|296155083|gb|ADVK01000017.1| GENE 55 73878 - 76055 2655 725 aa, chain - ## HITS:1 COG:FN1482 KEGG:ns NR:ns ## COG: FN1482 COG0317 # Protein_GI_number: 19704814 # Func_class: T Signal transduction mechanisms; K Transcription # Function: Guanosine polyphosphate pyrophosphohydrolases/synthetases # Organism: Fusobacterium nucleatum # 1 725 1 725 725 1355 100.0 0 MNNYWEQLLDKAKANHLNLDFDKIKLALGFAEESHQGQYRKSGDDYIIHPVEVAKILMDM KMDTDTIVAGLLHDVVEDTLIPIADIKYNFGDTVATLVDGVTKLKTLPNGTKNQAENIRK MILAMAENIRVILIKLADRLHNMRTLKFMKPEKQQSISKETLDIYAPLAHRLGMAKIKSE LEDIAFSYLHHDEFLEIKRLVDNTKEERKDYIENFIRTIIRTLSDLDIKAEVKGRFKHFY SIYKKMYQKGKEFDDIYDLMGVRVIVEDKATCYHVLGIVHSQYTPVPGRFKDYIAVPKSN NYQSIHTTIVGPLGKFIEIQIRTKDMDDIAEEGIAAHWNYKENKKSSKDDNIYGWLRHII EFQNESDSTEDFIEGVTGDIDRGTVFTFSPKGDIIELPVGATALDFAFMVHTQVGCKCVG AKVNGRMVTIDHKLKSGDKVEIITSKNSKGPSIDWLDIVVTHGAKGKIRKFLKDENKETV TKIGKDNLEKEASKLGMTLKELENDLTLKKHMEKNNIPNLEEFYFYIGEKRSRLDVLITK IKTSLEKERAASTLTIEEVLKKKEEKKKEGKNDFGIVIDGINNTLIRFAKCCTPLPGDDI GGFVTKLTGITVHRKDCPNFHAMVEKDPSREILVKWDENLIETKMNKYNFTFTVVLNDRP NILMEIVNLIANHKINITSVNSYEVKKDGDRIVKVKISIEIKAKTEYDYLINNILKLKDV ISVER >gi|296155083|gb|ADVK01000017.1| GENE 56 76074 - 76586 901 170 aa, chain - ## HITS:1 COG:FN1483 KEGG:ns NR:ns ## COG: FN1483 COG0503 # Protein_GI_number: 19704815 # Func_class: F Nucleotide transport and metabolism # Function: Adenine/guanine phosphoribosyltransferases and related PRPP-binding proteins # Organism: Fusobacterium nucleatum # 1 170 1 170 170 333 100.0 6e-92 MDLKNYVASIENYPKEGIIFRDITPLMNDGEAYKYATEKIVEFAKDHHIDIVVGPEARGF IFGCPVSYALGVGFVPVRKPGKLPREVIEYAYDLEYGSNKLCLHKDSIKPGQKVLVVDDL LATGGTVEATIKLVEELGGVVAGLAFLIELVDLKGRERLDKYPMITLMQY >gi|296155083|gb|ADVK01000017.1| GENE 57 76601 - 77398 806 265 aa, chain - ## HITS:1 COG:FN1484 KEGG:ns NR:ns ## COG: FN1484 COG0457 # Protein_GI_number: 19704816 # Func_class: R General function prediction only # Function: FOG: TPR repeat # Organism: Fusobacterium nucleatum # 21 265 1 245 245 318 98.0 6e-87 MTVNKKIVIVLGLSILISGCLSANKEKNYNFIKGLNKYQKNDKVSALENYKKAYEIDKNN VVLLNEIAYLYVDLGNYEEAENYYKKALEVKPNDENSLKNLLQLLYLQNKRTEMKKYIPM VIDRNSFVYNLNNFRVAILENEEAEVEKSLLKISSNDRFLEEYNESFYIDLASVGGLSNN TIKYSNTIFEKAYKKYSNKNKDIVKIYANFLIDIKEYRKAEDILMKYIVNNEDNLDEHVL LKKLYTKENNKQKLENLKKILRNKI >gi|296155083|gb|ADVK01000017.1| GENE 58 77414 - 77998 719 194 aa, chain - ## HITS:1 COG:FN1485 KEGG:ns NR:ns ## COG: FN1485 COG2928 # Protein_GI_number: 19704817 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 194 30 223 223 319 98.0 2e-87 MAFRIINNTAIIKVLKKLVYFGFGEKADAFYIQILVYIVAALIILFSITLLGYMTKLVFF SKIIKKATDILERIPIIKTVYSAVKQITEIAYSDNGESVYKKVVAVEFPRKGIYAIGFLT ADKNTSLKEFLEDKEIVNVFVPTAPNPTSGFLLCVPREDIHPLNMTVEWAFKLIVSGGYI TEELVKEKEEKITE >gi|296155083|gb|ADVK01000017.1| GENE 59 78082 - 79362 1603 426 aa, chain - ## HITS:1 COG:FN1486 KEGG:ns NR:ns ## COG: FN1486 COG1253 # Protein_GI_number: 19704818 # Func_class: R General function prediction only # Function: Hemolysins and related proteins containing CBS domains # Organism: Fusobacterium nucleatum # 1 426 1 426 426 717 99.0 0 MDTYLNVLILVVLILLSGFFSAAETALSAYRSNYLEKLDEEKHPKKYAVMKKWLKEPNAM LTGIVICNNIVNILASSIATVVIVNYFGNKGSSVALATAIMTILILIFGEITPKLMARNN SAKIAEKVSVVIYVLSIILTPAVSCLIFISRLVGRILGVDMTSPQLMITEEDIISFVNVG NAEGIIEEDEKEMIHSIVTLGETSAKEVMTPRTSMLAFEATKTINEVWDEIIDNGFSRIP IYEETIDNIVGILYVKDLMEHIKNNELNLPIKQFIRSAYFVPETKSIIEILKEFRGLKVH IAIVLDEYGGVVGLVTIEDLIEEIVGEIRDEYDDEEESFFKKIADNEYEVDAMTDIETIN KDLELELPISEDYESLGGLIVTTTGKICEVGDEVQIDNIYLKVLEVDKMRVSKVFIRILE KDKEEE >gi|296155083|gb|ADVK01000017.1| GENE 60 79461 - 79559 71 32 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MSRANLTVFEADLSASLANFLETLSNLLFRVF >gi|296155083|gb|ADVK01000017.1| GENE 61 79622 - 80083 663 153 aa, chain - ## HITS:1 COG:FN1487 KEGG:ns NR:ns ## COG: FN1487 COG4492 # Protein_GI_number: 19704819 # Func_class: R General function prediction only # Function: ACT domain-containing protein # Organism: Fusobacterium nucleatum # 1 153 1 153 153 241 100.0 4e-64 MAFKKKDTENKEFYIVDKRILPKSIQNVIKVNDLILKTKMSKYSAIKKVGISRSTYYKYK DFIKPFYEGGEDRIYSLHLSLKDRVGILSDVLDVIAREKISILTVVQNMAVDGVAKSTIL IKLSESMQKKVDKIISKIGKVEGIADIRITGSN >gi|296155083|gb|ADVK01000017.1| GENE 62 80099 - 80950 1099 283 aa, chain - ## HITS:1 COG:FN1488 KEGG:ns NR:ns ## COG: FN1488 COG0190 # Protein_GI_number: 19704820 # Func_class: H Coenzyme transport and metabolism # Function: 5,10-methylene-tetrahydrofolate dehydrogenase/Methenyl tetrahydrofolate cyclohydrolase # Organism: Fusobacterium nucleatum # 1 283 1 283 283 499 99.0 1e-141 MLMDGKELARDIKVKIKTEIDNIKKIHNINPMVATILVGDDPASQVYLNSQVKSYQDLGI GVQKYFFSEEISEAYLLNLIDKLNKDTEVDGIMINLPLPPQINATKVLNSIKLIKDVDGF KAENLGLLFQNNEGFTSPSTPAGIMALIEKYNIDLEGKDVVVVGSSNIVGKPIAALILNS RGTVTICNIYTKNLAEKTKNADILISAVGKAKLITEDMVKEGAVVIDVGINRVNGKLEGD VDFENVQKKASHITPVPGGVGALTVAMLLSNILKSFKANRGII >gi|296155083|gb|ADVK01000017.1| GENE 63 80944 - 81861 1042 305 aa, chain - ## HITS:1 COG:FN1489 KEGG:ns NR:ns ## COG: FN1489 COG0223 # Protein_GI_number: 19704821 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Methionyl-tRNA formyltransferase # Organism: Fusobacterium nucleatum # 1 305 13 317 317 573 98.0 1e-163 MGTPTFAVPSLEKIYKEHEIISVFTKVDKPNARGKKINYSPIKEFALANDLKIYQPENFK DSTLIEEIRNMQADLIVVVAYGKILPKEIIDIPKYGVINLHSSLLPRFRGAAPINAAIIN GDTKSGVSIMYVEEELDAGDVILQEETEISDEDTFLSLHDRLKDMGADLLLKAIELIKKG EVKAQKQDKKLVTFVKPFRKEDCKIDWTKTSREIFNFIRGMNPIPTAFSNLNGTIIKIYE TKINDKVYNNATCGEVVEYLKGKGIVVKTSDGSLIISSAKPENKKQMSGVDLINGKFLKI GEKLC >gi|296155083|gb|ADVK01000017.1| GENE 64 81898 - 82347 615 149 aa, chain - ## HITS:1 COG:FN1490 KEGG:ns NR:ns ## COG: FN1490 COG1327 # Protein_GI_number: 19704822 # Func_class: K Transcription # Function: Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains # Organism: Fusobacterium nucleatum # 1 149 1 149 149 234 100.0 5e-62 MKCPFCSSEDTKVVDSRTTIDGSTKRRRECNNCLKRFSTYERFEESPIYVVKKDNRRVKY DREKLLRGLTFATAKRNVSREELDKIITDIERSLQNSLISEISSKDLGEKVLEKLRELDQ VAYVRFASVYKEFDDIKSFIEIVEEIKKD >gi|296155083|gb|ADVK01000017.1| GENE 65 82366 - 82839 645 157 aa, chain - ## HITS:1 COG:FN1491 KEGG:ns NR:ns ## COG: FN1491 COG1762 # Protein_GI_number: 19704823 # Func_class: G Carbohydrate transport and metabolism; T Signal transduction mechanisms # Function: Phosphotransferase system mannitol/fructose-specific IIA domain (Ntr-type) # Organism: Fusobacterium nucleatum # 1 157 6 162 162 281 99.0 2e-76 MVNSIKITDYITEDLIDLDLKSKNREGILIELSELLEKSPNIRGEEKDIYKALVDREKLG STGIGKGVAIPHAKTESATGLTVAFGISKEGIDFNSLDEEEVHLFFVFASPNKDSQIYLK VLARISRLIREEEFRENLFNCKTPKEVIDCIREKEEN >gi|296155083|gb|ADVK01000017.1| GENE 66 82833 - 83387 492 184 aa, chain - ## HITS:1 COG:FN1492 KEGG:ns NR:ns ## COG: FN1492 COG1381 # Protein_GI_number: 19704824 # Func_class: L Replication, recombination and repair # Function: Recombinational DNA repair protein (RecF pathway) # Organism: Fusobacterium nucleatum # 1 184 50 233 233 298 100.0 4e-81 AVDILSLTDFQFYKKNDSLIISNFSTVKDYIGIKSDIDKINIAFYIFSILNQILVENGRN RKIYEVLEKTLDYLNTSNDERKNYLLILFFLNTLIKEEGISLEDNSNMEELQVVVETQRK VEIDDNVKRILQYLFEDNLKVVINDEKYKINYIRKAILVLENYINFHLDTNINAQKILWG ALLW Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:22:16 2011 Seq name: gi|296155056|gb|ADVK01000018.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00021, whole genome shotgun sequence Length of sequence - 21780 bp Number of predicted genes - 26, with homology - 26 Number of transcription units - 9, operones - 6 average op.length - 3.8 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . + CDS 3 - 206 218 ## FN1219 hypothetical protein + Term 380 - 435 1.2 + Prom 391 - 450 13.3 2 2 Tu 1 . + CDS 477 - 1397 645 ## PROTEIN SUPPORTED gi|148988856|ref|ZP_01820271.1| 50S ribosomal protein L9 + Term 1407 - 1446 6.3 + Prom 1444 - 1503 8.8 3 3 Tu 1 . + CDS 1529 - 2395 1142 ## COG3878 Uncharacterized protein conserved in bacteria + Term 2495 - 2533 0.2 - Term 2652 - 2701 6.8 4 4 Op 1 1/1.000 - CDS 2715 - 3104 714 ## COG4922 Uncharacterized protein conserved in bacteria 5 4 Op 2 1/1.000 - CDS 3119 - 3646 829 ## COG0778 Nitroreductase 6 4 Op 3 1/1.000 - CDS 3675 - 4562 1268 ## COG2877 3-deoxy-D-manno-octulosonic acid (KDO) 8-phosphate synthase 7 4 Op 4 1/1.000 - CDS 4501 - 5958 1886 ## COG0769 UDP-N-acetylmuramyl tripeptide synthase 8 4 Op 5 . - CDS 5958 - 6638 819 ## COG0692 Uracil DNA glycosylase 9 4 Op 6 . - CDS 6651 - 7244 719 ## BCB4264_A2363 SMI1 / KNR4 family - Prom 7280 - 7339 8.5 - Term 7324 - 7368 6.5 10 5 Op 1 . - CDS 7395 - 7835 371 ## FN1229 hypothetical protein 11 5 Op 2 1/1.000 - CDS 7819 - 8562 1016 ## COG2849 Uncharacterized protein conserved in bacteria 12 5 Op 3 1/1.000 - CDS 8578 - 10041 2210 ## COG0516 IMP dehydrogenase/GMP reductase - Prom 10074 - 10133 9.3 13 5 Op 4 . - CDS 10135 - 10476 466 ## COG1733 Predicted transcriptional regulators - Prom 10522 - 10581 10.8 + Prom 10317 - 10376 10.6 14 6 Op 1 . + CDS 10586 - 11116 586 ## COG2249 Putative NADPH-quinone reductase (modulator of drug activity B) + Prom 11192 - 11251 7.5 15 6 Op 2 . + CDS 11274 - 11981 727 ## COG2340 Uncharacterized protein with SCP/PR1 domains + Prom 12020 - 12079 9.4 16 7 Op 1 1/1.000 + CDS 12100 - 12858 759 ## COG0666 FOG: Ankyrin repeat 17 7 Op 2 . + CDS 12864 - 13139 368 ## COG0666 FOG: Ankyrin repeat + Term 13141 - 13212 14.2 - Term 13136 - 13192 5.1 18 8 Op 1 3/0.333 - CDS 13253 - 14221 758 ## COG0697 Permeases of the drug/metabolite transporter (DMT) superfamily 19 8 Op 2 1/1.000 - CDS 14194 - 15798 1689 ## COG0510 Predicted choline kinase involved in LPS biosynthesis - Prom 15865 - 15924 10.8 20 9 Op 1 . - CDS 15935 - 17128 746 ## COG3307 Lipid A core - O-antigen ligase and related enzymes 21 9 Op 2 . - CDS 17144 - 17788 754 ## FN1239 hypothetical protein 22 9 Op 3 . - CDS 17798 - 18520 747 ## FN1240 lipopolysaccharide core biosynthesis protein RfaY 23 9 Op 4 1/1.000 - CDS 18522 - 19253 622 ## COG3774 Mannosyltransferase OCH1 and related enzymes 24 9 Op 5 3/0.333 - CDS 19268 - 20350 1191 ## COG0726 Predicted xylanase/chitin deacetylase 25 9 Op 6 3/0.333 - CDS 20365 - 21225 952 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 26 9 Op 7 . - CDS 21230 - 21778 659 ## COG0726 Predicted xylanase/chitin deacetylase Predicted protein(s) >gi|296155056|gb|ADVK01000018.1| GENE 1 3 - 206 218 67 aa, chain + ## HITS:1 COG:no KEGG:FN1219 NR:ns ## KEGG: FN1219 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 67 85 151 151 111 100.0 9e-24 WANGEIFEYFLHHKKEKEIKLITDIHSLNEKELQFIKDLDNFLNNKGKVLKFFNVHNGKY QNLKEIL >gi|296155056|gb|ADVK01000018.1| GENE 2 477 - 1397 645 306 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|148988856|ref|ZP_01820271.1| 50S ribosomal protein L9 [Streptococcus pneumoniae SP6-BS73] # 2 301 3 300 308 253 46 1e-66 MLVNSVIDLIGNTPLVKINNIDTFGNEIYVKLEGSNPGRSTKDRIALKMIEEAEKEGLID KDTVIIEATSGNTGIGLAMICAVKNYKLKIVMPDTMSIERIQLMRAYGTEVILTDGSLGM KACLEKLEELKKNEKKYFVPNQFTNVNNPKAHYETTAEEILKDLNNKVDVFICGTGTGGS FSGTAKKLKEKLPNIKTFPVEPASSPLLSKGYIGPHKIQGMGMSIGGIPAVYDGNLADDI LVCEDDDAFEIMRELSFKEGILGGISTGATFKAALDYSKENADKGLKIVVLSTDSGEKYL SNICDL >gi|296155056|gb|ADVK01000018.1| GENE 3 1529 - 2395 1142 288 aa, chain + ## HITS:1 COG:FN1221 KEGG:ns NR:ns ## COG: FN1221 COG3878 # Protein_GI_number: 19704556 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 288 1 288 288 520 98.0 1e-147 MDFKKIITEILDNLKKNEITISTKFNNNSEIVDKSKIGGKPYLPKDFTWPYYQELPLSFL AQINLEEVSSLDKDKLLPDKGMLYFFYELETQEWGYSPQDKGCAKVFYFENTTNFTLINF PKDMEDYYKIPEFKVNFKSNISLPSYEDFDNLNEEKNILEKYKTYENFKEFENKLFDEYS DICDEYMESLKNYTKLLGYPDIIQDSMEEECAAVTRGFNMGGIGYPKKYKEEIKKASKDW ILLFQMDTVESNDYELMFGDCGYLYFWIKKEDLANKNFENIWLILQCC >gi|296155056|gb|ADVK01000018.1| GENE 4 2715 - 3104 714 129 aa, chain - ## HITS:1 COG:FN1222 KEGG:ns NR:ns ## COG: FN1222 COG4922 # Protein_GI_number: 19704557 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 129 1 129 129 247 100.0 5e-66 MNNQLEQNKINAIAFYKTMFDGEPEKAIKLYVGDEYRQHNPMVADGKAGIIEYFTRMKKE YPIKEVKFVRAIAQGDLVAMHTHQIWGEPDNKEYVTMDFFRFDENNKIVEHWDSIQEVIK NTKSGRTMY >gi|296155056|gb|ADVK01000018.1| GENE 5 3119 - 3646 829 175 aa, chain - ## HITS:1 COG:FN1223 KEGG:ns NR:ns ## COG: FN1223 COG0778 # Protein_GI_number: 19704558 # Func_class: C Energy production and conversion # Function: Nitroreductase # Organism: Fusobacterium nucleatum # 1 175 1 175 175 362 100.0 1e-100 MDEVLKVIRERRSIRKFRSDMLPKEIIDKVIESGLYAASGKGQQSPIIISITNKELRDKL SKMNCKIGGWKEDFDPFYNAPVVLVVLAPKDWPTYIYDGSLVIGNMMLAAHSLNIGSCWI HRAKQEFESEEGKEILKSLGINGEYEGIGHCVLGYIDGNYLNTPARKENRVFYID >gi|296155056|gb|ADVK01000018.1| GENE 6 3675 - 4562 1268 295 aa, chain - ## HITS:1 COG:FN1224 KEGG:ns NR:ns ## COG: FN1224 COG2877 # Protein_GI_number: 19704559 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: 3-deoxy-D-manno-octulosonic acid (KDO) 8-phosphate synthase # Organism: Fusobacterium nucleatum # 10 295 1 286 286 578 99.0 1e-165 MIKKLQEEKLSEERWWKMLINDINKVKVGNIVFGGKKRFVLIAGPCVMESQELMDEVAGG IKEICDRLGIEYIFKASFDKANRSSIYSYRGPGLEEGMKMLTKIKEKFNVPVITDVHEAW QCKEVAKVADILQIPAFLCRQTDLLIAAAETGKAVNIKKGQFLAPWDMKNIVVKMEESRN KNIMLCERGSTFGYNNMVVDMRSLLEMRKFNYPVIFDVTHSVQKPGGLGTATSGDREYVY PLLRAGLAIGVDAIFAEVHPNPAEAKSDGPNMLYLKDLEEILKIAIEIDKIVKGV >gi|296155056|gb|ADVK01000018.1| GENE 7 4501 - 5958 1886 485 aa, chain - ## HITS:1 COG:FN1225 KEGG:ns NR:ns ## COG: FN1225 COG0769 # Protein_GI_number: 19704560 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylmuramyl tripeptide synthase # Organism: Fusobacterium nucleatum # 1 485 1 485 485 911 99.0 0 MNIFSGIEYKVLKDVNLDRKYDGIEYDSRKIKENYIFVAFEGANVDGHNYIDSAVKNGAT CIIVSKKVEMKHNVSYILIDDIRHKLGYIASNFYEWPQRKLKIIGVTGTNGKTSSTYMIE KLMGDIPITRIGTIEYKIGDEVFEAVNTTPESLDLIKIFDKTLKKKIEYVIMEVSSHSLE IGRVEVLDFDYALFTNLTQDHLDYHLTMENYFQAKRKLFLKLKDINNSVINVDDEYGKRL YDEFIVDNPEIISYGIENGDLEGDYSDDGYIDVKYKNQIEKVKFLLLGDFNLYNTLGAIG IALKIGISMEEILKRVSNIKAAPGRFEALDCGQDYKVIVDYAHTPDALVNVIVAARNIKN GSRIITIFGCGGDRDRTKRPIMAKVAEDLSDVVILTSDNPRTESPEQIFDDVKKGFIKSD DYFFEPDREKAIKLAINMAEKNDIILITGKGHETYHIIGTKKWHFDDKEIARREIVRRKM VENVN >gi|296155056|gb|ADVK01000018.1| GENE 8 5958 - 6638 819 226 aa, chain - ## HITS:1 COG:FN1226 KEGG:ns NR:ns ## COG: FN1226 COG0692 # Protein_GI_number: 19704561 # Func_class: L Replication, recombination and repair # Function: Uracil DNA glycosylase # Organism: Fusobacterium nucleatum # 1 226 1 226 226 416 99.0 1e-116 MSKINNDWKEILEEEFEKEYFVKLKETLEEEYKNYTVYPPKRDILNAFFLTPYSEVKVVL LGQDPYHQRGQAHGLAFSVNYGIKTPPSLVNMYKELQDDLGLYIPNNGFLEKWSKQGVLL LNTTLTVRDSEANSHSKIGWQTFTDNVIKSLNEREKPVIFILWGNNAKSKEKFIDTNKHY ILKGVHPSPLSANKGFFGCKHFSEANRILKNLGEKEIDWQIENKEI >gi|296155056|gb|ADVK01000018.1| GENE 9 6651 - 7244 719 197 aa, chain - ## HITS:1 COG:no KEGG:BCB4264_A2363 NR:ns ## KEGG: BCB4264_A2363 # Name: not_defined # Def: SMI1 / KNR4 family # Organism: B.cereus_B4264 # Pathway: not_defined # 1 190 20 209 216 139 40.0 5e-32 MGGYIRETKIEAPAKEEEILEIEKKLGYSLPEDFRDILLNYSSHFEYYWTSDRESDNRII ELPNNLKSIFGTNLHWGLDLLLDFEDGRKNWIDICFLDYNDEYDKVWYNKLPFYDCGNGD YIAIELEKENYGKIVYLSHDGGDNHGYYLADNFKDLLDNWSKVGAVLEWELFFTEGKGID PECENAKLLRKYIFSKI >gi|296155056|gb|ADVK01000018.1| GENE 10 7395 - 7835 371 146 aa, chain - ## HITS:1 COG:no KEGG:FN1229 NR:ns ## KEGG: FN1229 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 146 1 146 146 233 100.0 2e-60 MKNGNSMLISRVKQVYLYIFSNFNEEWNSEVKKILSKEEFLIFSEMKNYDKVHSYSLYQK IKSNNILSSKKIYLKLALLHDSGKGKVGLFRRIKKVLIGDKILEKHPEIAFEKLKKINFE LAELCLQHHNKDVDEKMKIFQELDDK >gi|296155056|gb|ADVK01000018.1| GENE 11 7819 - 8562 1016 247 aa, chain - ## HITS:1 COG:FN1230 KEGG:ns NR:ns ## COG: FN1230 COG2849 # Protein_GI_number: 19704565 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 86 247 1 162 162 284 100.0 9e-77 MNKFTKFFILAGVLFNFSLLSAEIKEVESLDQISNEIVGGKTEKKATKEKSVETAKNTED VKDIPEESATRTVDKNSIVDIYERKMKDKIAYKEGSNIPFTGVFGVVIDDKIESYEEYKN GLLDGETAYFAKGKQVKLLSEMYTKGKLNGQQKSYYENGKLKSIVYYSNDKINGIESYDR SGNLLHKSIFQGGTGDWKFYWSNGKVSEEGKYKAWRKDGVWKKYREDGSLDTVIKYDNGR LLSEKWQ >gi|296155056|gb|ADVK01000018.1| GENE 12 8578 - 10041 2210 487 aa, chain - ## HITS:1 COG:FN1231_3 KEGG:ns NR:ns ## COG: FN1231_3 COG0516 # Protein_GI_number: 19704566 # Func_class: F Nucleotide transport and metabolism # Function: IMP dehydrogenase/GMP reductase # Organism: Fusobacterium nucleatum # 203 487 1 285 285 529 99.0 1e-150 MNGKIVKEGITFDDVLLIPAKSDVLPNEVSLKTRLTKKITLNLPILSAAMDTVTESDLAI ALARQGGMGFIHKNMSIEEQAAEVDRVKRSESGMITNPITLNKDSRVYQAEELMSRYKIS GLPVIEDDGKLIGIITNRDIKYRKDLDQPVGDIMTSKGLITAPVGTTLEQAKEILLANRI EKLPITDQNGYLKGLITIKDIDNIIQYPNACKDELGKLRCGAAVGVAPDTIERVSALVKA GVDIITVDSAHGHSQGVINMIKEIKKNFPDLDIVGGNIVTAEAAKELIEAGVAAVKVGIG PGSICTTRVVAGVGVPQLTAVNDVYEYCKDKNIGVIADGGIKLSGDIVKALAAGGDCVML GGLLAGTKEAPGEEIILEGRRFKIYVGMGSIVAMKRGSKDRYFQAGEVDNSKLVPEGIEG RIAYKGSVKDVVFQLAGGIKAGMGYCGTKTIKDLQINGRFVKITGAGLIESHPHDITITK EAPNYSK >gi|296155056|gb|ADVK01000018.1| GENE 13 10135 - 10476 466 113 aa, chain - ## HITS:1 COG:FN1232 KEGG:ns NR:ns ## COG: FN1232 COG1733 # Protein_GI_number: 19704567 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Fusobacterium nucleatum # 1 113 1 113 113 211 100.0 3e-55 MENKSCVESNVKLEDTGFGYTLSLIGGKYKMIIIYKLYENSPFMRYNELKRSIGNISFKT LTSTLKELEEDNIIIRKEYPQIPPRVEYSLSKKGRTLIPILNMMCDWGEKNSI >gi|296155056|gb|ADVK01000018.1| GENE 14 10586 - 11116 586 176 aa, chain + ## HITS:1 COG:FN1233 KEGG:ns NR:ns ## COG: FN1233 COG2249 # Protein_GI_number: 19704568 # Func_class: R General function prediction only # Function: Putative NADPH-quinone reductase (modulator of drug activity B) # Organism: Fusobacterium nucleatum # 1 176 5 180 180 335 97.0 3e-92 MKTLIVVAHPNLKDSKVNESWLKETEKYPDKFTIHNLYEAYPNEVIDNIEKEQKLIEEHD SLILQFPIYWFNCPPFMKKWLDDVFTDGWAYGKNGNNLENRNIGLAVTAGISENNYSEDG KYKHSLKEILLPFEMTFDYCSANYKGFHAFYSAEFEATDKRIKDSIPQYINFLNSI >gi|296155056|gb|ADVK01000018.1| GENE 15 11274 - 11981 727 235 aa, chain + ## HITS:1 COG:FN1234 KEGG:ns NR:ns ## COG: FN1234 COG2340 # Protein_GI_number: 19704569 # Func_class: S Function unknown # Function: Uncharacterized protein with SCP/PR1 domains # Organism: Fusobacterium nucleatum # 18 235 1 218 218 372 95.0 1e-103 MKKFFKIFMLFTFIFNTLTTYSIDIKKKYSDKYMIDLHTWLPNTFEELKTINEDELYRMA VEKYHYHQDKSNFYSQKEFKQIEKFINVEKLNQYFVERLNKERAKLGLSSNVRIDNTLIK AAKIRSNELAVAKRISHKRPNKTEYWTVFEKVDKNLVDKYSFENILKVSISNEAQMISEK FIANYFFDSWKESPEHWKFMVDPELKKIGVNFSFGSSDDTNFLVQINYGVLLGMR >gi|296155056|gb|ADVK01000018.1| GENE 16 12100 - 12858 759 252 aa, chain + ## HITS:1 COG:FN1235 KEGG:ns NR:ns ## COG: FN1235 COG0666 # Protein_GI_number: 19704570 # Func_class: R General function prediction only # Function: FOG: Ankyrin repeat # Organism: Fusobacterium nucleatum # 1 245 1 245 477 405 97.0 1e-113 MELQDLKDIYKSKTAEKRYDYYRTLPLDEKDLFGDTLLDIALNFADVEALKILLERNVDV NKINNHGNRPLHNLILLSEYKNLDDVLTCAELLLDNGASVLRKNDIGETPVLSAVRRNYF EILELFIKRNLKLDLKDSYGNGPLHIAAYSCNPNNEEKKKKIFKLLLDAGIENDIKNDNG DTPIDILSNQKTDPILLSILKGKYDFDNPNNITNLTSAMSLYSAILSNNYDVIKAHLEME IDIQKKIIILMV >gi|296155056|gb|ADVK01000018.1| GENE 17 12864 - 13139 368 91 aa, chain + ## HITS:1 COG:FN1235 KEGG:ns NR:ns ## COG: FN1235 COG0666 # Protein_GI_number: 19704570 # Func_class: R General function prediction only # Function: FOG: Ankyrin repeat # Organism: Fusobacterium nucleatum # 16 91 402 477 477 107 98.0 6e-24 MGVACHILDLKSVELLLKNGADVGSIDKNGYTVLMYTALNTENGVALEIAKMLHDFGDVK ISYINNDEQSAMDIAVENNNENLVNWLLTKI >gi|296155056|gb|ADVK01000018.1| GENE 18 13253 - 14221 758 322 aa, chain - ## HITS:1 COG:FN1236 KEGG:ns NR:ns ## COG: FN1236 COG0697 # Protein_GI_number: 19704571 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; R General function prediction only # Function: Permeases of the drug/metabolite transporter (DMT) superfamily # Organism: Fusobacterium nucleatum # 1 322 1 322 322 444 99.0 1e-125 MDKSYRYGVIAGIFSGITWALYTIINNLITKNTIFNSYIEKMFIPVLVIVFLHDFFSSIW LFFYLWRKKKIFELKKTIKSKNIFLIFLGALFGGPIGMSGYLLGIKYMGASYTASFSSTY LIVGTILSVIFLKEKINLKMIIAVLISMVGIFILNFQINEMDSNKISILGIFFLMLCIFG WALEGLIASYILKYKNTDIEPSIAIFIRQLTSTIFYSFLIIPYIGAYNLVFIVLKSNIVL YIALISVIGSLSFFLWYYSMSIIGVARGISLNVSYIIWTIIFEIILFNAKFQLNFIVASI LFIVSVILIAMSPEEKYIKELE >gi|296155056|gb|ADVK01000018.1| GENE 19 14194 - 15798 1689 534 aa, chain - ## HITS:1 COG:FN1237_2 KEGG:ns NR:ns ## COG: FN1237_2 COG0510 # Protein_GI_number: 19704572 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted choline kinase involved in LPS biosynthesis # Organism: Fusobacterium nucleatum # 267 534 1 268 268 471 98.0 1e-132 MNAIIIAAGMGTRLNPLTLSTPKPLIKIFGKPMIEKNIEYLLQEGIEEIVIVTGYMKDKF EYLRDKYKEVKLIYNPKYKEYNNIYSFYLARDFLKDSYILDGDIYLTRNIFKKEIDESKY FSKKINMFNNEWQLLLNNDGKIRKVEIGGSENYIMSGISFFTNKDCQKLKKIVEIYVKDE IKLKKYYWDHIIKENIHEFDIGIEKIQDNIIYEIDNLEELVELDKSYKDMLPINSFKNEI QKLKEILISNLKIDLKDIGNIQFIGGMTNKNYLVEINSKKYVLRKPGEGTESIINRYNEK NNLKLVSKINIDSNLYFFDEKSGIKLSEYINNSEMLTPSNAKYNLEEVAFILKKLHNSQI IFPNIFDPFKEMKRYEELINKEDGKFYEGYFELKKEVFKLKEVLKSFNIELVSCHNDTVP ENFLKKGNNLFLIDWEYSGLNDPIWDLAAFSIESNLSDDEEKELLDYYFENSINSTIKIR MEIHKICQDFLWSIWTIFKEMNGVSFGEYGIKRLVSAQKRLEDLKLWINLTDME >gi|296155056|gb|ADVK01000018.1| GENE 20 15935 - 17128 746 397 aa, chain - ## HITS:1 COG:FN1238 KEGG:ns NR:ns ## COG: FN1238 COG3307 # Protein_GI_number: 19704573 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Lipid A core - O-antigen ligase and related enzymes # Organism: Fusobacterium nucleatum # 33 397 1 365 365 595 99.0 1e-170 MFYKDKKEILNFLGEWTTYAYIFSAFFNSKINMKIGYLLLIVSFFYICFNRNIIKLVNKK IYGMLLLILVLGSIWNYISADMIGMSKFLNINTRFFYGLAMFPFLLNIKKNRFSILIFLA VNLLSAMYLYNESYVYHLLDDLGRIRAILLIGWIYTLIYTFEKISEDFKKYIFLLSASIL PFIALGKSGSRAGALSLFLVIFLYLIFKVFKEKKSIKFVSVITIIILLTGLFLPKEYKEK LKTSFQTTENISNEDRIVMWKAGIEIFKENPIFGIGSYKKGIYPHVQKYVEENVNDEQLR GEFINRDRFAKLHNMYIDFFVQNGILGLLYLVFLFVLIPFEFFKSEKNKESIAAFFSMIF YCSYGLTWSLWSSLGISQALFHTFLIWMLVNLKRKNL >gi|296155056|gb|ADVK01000018.1| GENE 21 17144 - 17788 754 214 aa, chain - ## HITS:1 COG:no KEGG:FN1239 NR:ns ## KEGG: FN1239 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 214 1 214 214 358 97.0 1e-97 MNWQFFDKYLEENGNFGECISNHKNKILVYKENVEGTNFYIKKYIPYGKRKIRMAFGIYD DRAIHYEKVVKYLEKLDLPYVKLEYKKIKRISFFDRVSIIVTKDCGLTFENFVNDFEKNK DLITKFYDIFIILVKNKIYPIDYNTGGVLIDINGKLRLTDFDDYRIRSFLTNSLKKRLIR NLKRIYLEEKRTEECKKNLKNQIKRVVKELNWKV >gi|296155056|gb|ADVK01000018.1| GENE 22 17798 - 18520 747 240 aa, chain - ## HITS:1 COG:no KEGG:FN1240 NR:ns ## KEGG: FN1240 # Name: not_defined # Def: lipopolysaccharide core biosynthesis protein RfaY # Organism: F.nucleatum # Pathway: Lipopolysaccharide biosynthesis [PATH:fnu00540]; Metabolic pathways [PATH:fnu01100] # 1 240 1 240 240 400 98.0 1e-110 MAELNLKKYKELNIYYYEKEFLDLALKVIDGNYSTCQILKDTKRNYVSVIEIDGKKYVYK EPRNEFRIPQRQFTTFLKKGEALTTLVNINKLINIGFKEFVKPLVAVNKRHYGFIVSSFF IMEFVEGEDNRKNLDMIVEKMKEIHKLGYYHGDFNPGNFLVENKQIHILDTQGKKMFFGN YRAHYDMITMKYDSYDEMIYPYKKNLFYYLAYSMKRFKRLAFIEKIKYFKKKLRDKGWKI >gi|296155056|gb|ADVK01000018.1| GENE 23 18522 - 19253 622 243 aa, chain - ## HITS:1 COG:FN1241 KEGG:ns NR:ns ## COG: FN1241 COG3774 # Protein_GI_number: 19704576 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Mannosyltransferase OCH1 and related enzymes # Organism: Fusobacterium nucleatum # 1 243 1 243 243 436 100.0 1e-122 MIEKKIHYVWFGNAKSEKVLKCIESWKKNLPDYEIIEWNEKNFNIEEELKSNKFFRECYN RKLWAFVSDYVRVKVLYNYGGIYLDTDMEIIKDITPLLDADMFLGYENEDTMSFGIVGVI PKHKVFKKMYEFYQNEIWKSPLHIVTSILTEILEKEYHGKYRENNINIYPREYFYPFNHD EEFTKDCITGNTYAIHWWGKSWKKNPKVYFLKYKHLPWWKKYPKHVAKLINYYFKNLFNF RKE >gi|296155056|gb|ADVK01000018.1| GENE 24 19268 - 20350 1191 360 aa, chain - ## HITS:1 COG:FN1242 KEGG:ns NR:ns ## COG: FN1242 COG0726 # Protein_GI_number: 19704577 # Func_class: G Carbohydrate transport and metabolism # Function: Predicted xylanase/chitin deacetylase # Organism: Fusobacterium nucleatum # 1 360 1 360 360 595 98.0 1e-170 MNILILYKNIGDKDIIKDLKNNNVYFLNQKEYSYKKIKELKNKKDIQIIVCIGRNSFLLN IYSYFLNIPVVYTDNMKNIEDIETLLQNKLAYKIRRDLPVLMYHRVIDTKNEIGFYDTYV TKENFEKQMKYLSENNYISLTFKDIQNGEYKKRFDKNKKYVIITFDDGYKDNLKNALPIL KKYNMKIVLFLITSESYNKWDTDVENREKEKKFNLMSKEEVKELIASNLVEIGGHTTKHL DMPNVDLKKIEEDLKVSNKILEEITGYTPISFAYPWGRSTKDVREIVKKEGYKFAVSTED GPACFSDDLFEIVRVGVYSDDSIEKFALKISGKYPFIREKRNEMKAFRNKIRKFFRIKTK >gi|296155056|gb|ADVK01000018.1| GENE 25 20365 - 21225 952 286 aa, chain - ## HITS:1 COG:FN1243 KEGG:ns NR:ns ## COG: FN1243 COG0463 # Protein_GI_number: 19704578 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Fusobacterium nucleatum # 1 286 1 286 286 498 99.0 1e-141 MKDKITVIVTLYNRLEYARNMILALQQQTKQIDELIFADDGSSEKLMEYIEDLLVDCNFK IKHVYQDDIGFRLARSRNNGAREASGDYLIFLDQDVIFDNDFIESIYNSRKKKRMIFSEA LGSSLEEKNKIQELINTQKFDYKEIYDLVDNTKKVEQDQIVNKEKFYNFLYKLKLRSRGA KIVGLIFSLFKEDFININGLDEKYIGYGYEDDDFGNRFFKYGGETFAFKMKRYPIHMYHK AASPNGSPNEDYYRQRKIEISKKNYRCEYGYDKTFGEDKYKVIEIK >gi|296155056|gb|ADVK01000018.1| GENE 26 21230 - 21778 659 182 aa, chain - ## HITS:1 COG:FN1244 KEGG:ns NR:ns ## COG: FN1244 COG0726 # Protein_GI_number: 19704579 # Func_class: G Carbohydrate transport and metabolism # Function: Predicted xylanase/chitin deacetylase # Organism: Fusobacterium nucleatum # 1 182 69 250 250 314 100.0 5e-86 KNKKYIILTFDDGYKDNYNLLFPLLKKYNMKAVIYMVSDEKYNIWDVEASGEKRFDLMSK NEMLEMYKSGLVEFGGHTLHHPKLDTLTEKEQRYEIEENKIYLEKTLGEKLYSFAYPYGI FNETSKKIVKELGFNYGIATDSGKFYIEDDLYQVRRIGIFSDITMSKFKRRVKGNYNLKY TR Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:22:39 2011 Seq name: gi|296155041|gb|ADVK01000019.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00023, whole genome shotgun sequence Length of sequence - 12899 bp Number of predicted genes - 15, with homology - 14 Number of transcription units - 7, operones - 3 average op.length - 3.7 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 25 - 810 723 ## FN0484 lipase (EC:3.1.1.3) - Prom 847 - 906 4.8 + Prom 801 - 860 10.0 2 2 Tu 1 . + CDS 916 - 1044 68 ## 3 3 Op 1 1/1.000 - CDS 996 - 1619 1092 ## COG0035 Uracil phosphoribosyltransferase 4 3 Op 2 . - CDS 1677 - 1922 424 ## PROTEIN SUPPORTED gi|19703817|ref|NP_603379.1| 50S ribosomal protein L31P - Prom 1953 - 2012 10.7 - Term 1995 - 2029 -0.8 5 4 Op 1 . - CDS 2041 - 2439 520 ## FN0481 hypothetical protein 6 4 Op 2 . - CDS 2457 - 2762 492 ## FN0480 hypothetical protein 7 4 Op 3 1/1.000 - CDS 2764 - 3213 484 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 8 4 Op 4 1/1.000 - CDS 3269 - 4333 1395 ## COG0821 Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 9 4 Op 5 1/1.000 - CDS 4379 - 5497 1359 ## COG0739 Membrane proteins related to metalloendopeptidases 10 4 Op 6 1/1.000 - CDS 5525 - 6766 1351 ## COG1158 Transcription termination factor 11 4 Op 7 1/1.000 - CDS 6789 - 8096 512 ## PROTEIN SUPPORTED gi|229879795|ref|ZP_04499292.1| SSU ribosomal protein S12P methylthiotransferase - Prom 8141 - 8200 13.1 12 5 Op 1 2/0.000 - CDS 8239 - 11253 2789 ## COG0841 Cation/multidrug efflux pump - Prom 11285 - 11344 5.3 13 5 Op 2 . - CDS 11355 - 11924 717 ## COG1309 Transcriptional regulator - Prom 11962 - 12021 8.9 14 6 Tu 1 . + CDS 12101 - 12604 995 ## COG0716 Flavodoxins + Term 12621 - 12677 9.9 - Term 12611 - 12663 8.2 15 7 Tu 1 . - CDS 12669 - 12899 306 ## COG5295 Autotransporter adhesin Predicted protein(s) >gi|296155041|gb|ADVK01000019.1| GENE 1 25 - 810 723 261 aa, chain - ## HITS:1 COG:no KEGG:FN0484 NR:ns ## KEGG: FN0484 # Name: not_defined # Def: lipase (EC:3.1.1.3) # Organism: F.nucleatum # Pathway: Glycerolipid metabolism [PATH:fnu00561]; Metabolic pathways [PATH:fnu01100] # 22 261 1 240 240 464 99.0 1e-129 MKKFFKILFFIILLSILTLWLVKIFLLTHKYQVKYYNEDKIEKDIVITFNGIYGYEKQLR FIDEKLAEDGYSVVNIQYPTVDDKIVEMTDKYIVPTIDEQVKKLNEINLERKAQNLPELK INFVVHSMGSCLIRYYLKEHKLNSLGKVVLISPPSHGSQLADNPIADLLWYFIGPAVADM KTDENSFVNQLGNPTYPCCVLIGDKSNNFLYSILIKGEDDGMVPLATAKLEGSPLKTIEN TTHTSILEKQETVDEILKFLK >gi|296155041|gb|ADVK01000019.1| GENE 2 916 - 1044 68 42 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MSLANLQRILNFYPLGNLASNELFLLSYFVPNILSPQSPSPG >gi|296155041|gb|ADVK01000019.1| GENE 3 996 - 1619 1092 207 aa, chain - ## HITS:1 COG:FN0483 KEGG:ns NR:ns ## COG: FN0483 COG0035 # Protein_GI_number: 19703818 # Func_class: F Nucleotide transport and metabolism # Function: Uracil phosphoribosyltransferase # Organism: Fusobacterium nucleatum # 1 207 8 214 214 407 100.0 1e-114 MSVIEINHPLIEHKMTILRSVDTDTKSFRENLNEIAKLMTYEATKNLKLETTEVTTPLMK TQAYTLQDKVALVPILRAGLGMVDGILDLIPTAKVGHIGVYRNEETLEPVYYYCKLPTDI ASRKVILVDPMLATGGSAVYAIDYLKEQGVTDIIFMCLVAAPDGIARLLNKHPDVPIYTA KIDQGLNENGYIYPGLGDCGDRIFGTK >gi|296155041|gb|ADVK01000019.1| GENE 4 1677 - 1922 424 81 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19703817|ref|NP_603379.1| 50S ribosomal protein L31P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 81 1 81 81 167 100 3e-41 MKKGIHPEFDLVVFEDMAGNQFLTRSTKIPKETTTFEGKEYPVIKVAVSSKSHPFYTGEQ RFVDTAGRVDKFNKKFNLGKK >gi|296155041|gb|ADVK01000019.1| GENE 5 2041 - 2439 520 132 aa, chain - ## HITS:1 COG:no KEGG:FN0481 NR:ns ## KEGG: FN0481 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 132 1 132 132 213 100.0 3e-54 MKRFFCVLFCLVSLFTFASEENGLGIVDDADLRAAGVKAENIKKAKELVKQVASNYELRL LERKQLELQINKYILDNPEKYLKQIDEMFDKIGEIEATIMKERLRSQIQMKKYITAEQYM KAKEIAIKRLSK >gi|296155041|gb|ADVK01000019.1| GENE 6 2457 - 2762 492 101 aa, chain - ## HITS:1 COG:no KEGG:FN0480 NR:ns ## KEGG: FN0480 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 101 1 101 101 175 100.0 6e-43 MSPKEKVRANIYKALLEEEKRKNKRMSIFSVGLFFVGIVTMSTYNSLVKSVPNPDISSNN VVASGQVREALMSSIYDDSSVIGTKTTQLNPDELFIYNTQI >gi|296155041|gb|ADVK01000019.1| GENE 7 2764 - 3213 484 149 aa, chain - ## HITS:1 COG:FN0479 KEGG:ns NR:ns ## COG: FN0479 COG1595 # Protein_GI_number: 19703814 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Fusobacterium nucleatum # 1 149 1 149 149 231 100.0 2e-61 MDFDNIYEEYFDRVYYKVLSVVKNDDDAEDICQETFISVYKNLSKFREESNIYTWIYRIA INKTYDFFKKRKLEFEINDDVLSLPEDVNFDTKVILEEKLKLISEKEKEIVVLKDIYGYK LKEIAEMKKMNLSTVKSVYYKALKDMGGN >gi|296155041|gb|ADVK01000019.1| GENE 8 3269 - 4333 1395 354 aa, chain - ## HITS:1 COG:FN0478 KEGG:ns NR:ns ## COG: FN0478 COG0821 # Protein_GI_number: 19703813 # Func_class: I Lipid transport and metabolism # Function: Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis # Organism: Fusobacterium nucleatum # 1 354 1 354 354 629 100.0 1e-180 MERNTRVVKVANLKIGGNNPIIIQSMTNTNSADVEATVKQINNLEKVGCQLVRMTINNIK AAEAIKEIKKKVSLPLVADIHFDYRLALLAIKNGIDKLRINPGNIGSDENVKKVVEAAKE KNIPIRIGVNSGSIEKEILEKYGKPCVDALVESAMYHIRLLEKFDFFDIIVSLKSSNVKM MVEAYRKISSLVDYPLHLGVTEAGTKFQGTVKSAIGIGALLVDGIGDTLRVSLTENPVEE IKVAKEILKVLDLSDEGVEIISCPTCGRTEIDLIGLAKQVEEEFRTKKNKFKIAVMGCVV NGPGEAREADYGIAAGRGIGILFKKGEIIKKVSESNLLEELKKMISEDLENKKD >gi|296155041|gb|ADVK01000019.1| GENE 9 4379 - 5497 1359 372 aa, chain - ## HITS:1 COG:FN0477 KEGG:ns NR:ns ## COG: FN0477 COG0739 # Protein_GI_number: 19703812 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane proteins related to metalloendopeptidases # Organism: Fusobacterium nucleatum # 52 372 1 321 321 504 99.0 1e-143 MKRIVRKTMGYTLILAIVVFSFRLYMISSKEVFDNALFTDYFQVDEAENGGLELTTSNFT TFEKEYNFVKEEVVEKKEEKPPVQQKRAEKITYKVQKKDTVQSIAKKFGVKPETIMINNQ TAMDNKLKVGEVLTFPSIDGLYYKLQKNEMLAKVAKKYGVKVVDIVDYNNINPKKLKAGT TLFLKGVTLKKYKEVEQRLIAAQQAKEEQKKEKAQKGKGKKGGGSAPPPDTGGGDDGGAP ASYSGEGFAFPVRYAGITSPFGNRYHPVLKRYILHTGVDLVAKYVPLRASKAGVVTFAGN MSGYGKIIIIKHDNGYETRYAHLSVISTNVGEHVNKGDLIGKTGNSGRTTGAHLHFEIRH NGVPKNPMKYLQ >gi|296155041|gb|ADVK01000019.1| GENE 10 5525 - 6766 1351 413 aa, chain - ## HITS:1 COG:FN0476 KEGG:ns NR:ns ## COG: FN0476 COG1158 # Protein_GI_number: 19703811 # Func_class: K Transcription # Function: Transcription termination factor # Organism: Fusobacterium nucleatum # 1 413 1 413 413 771 99.0 0 MDILNKLLLKDLQEIAKVMEIEIGVGHKKDELKKLISNSLEENNTELAYGTLDTAPEGFG FLKETTLGKNIYMSASQIKRFKLRRGDQVLGEVRKPIGEEKNYAIRRVLKANDNDLAALE SRVPYEELIPTYPTEQFVLGIEQGNISGRILDLISPIGKGQRALIIAPPKAGKTTFISSI ANALIEGQKDSEVWILLIDERPEEVTDIKENVEGATVFASTFDDDPKNHIKVTEEIIEKA KMKVEDGENVVILLDSLTRLARAYNIVMPSSGKLLSGGIDPTALYHPKNFFGAARNIKDG GSLTIIATILVDTGSKMDEVIYEEFKSTGNCDIYLDRQLAEFRIFPAIDITKSGTRKEEL LLNKNQIDDIWNLRRLLNDYDNKVSATSALIKAIKTTRNNDELLAQLPKVLYK >gi|296155041|gb|ADVK01000019.1| GENE 11 6789 - 8096 512 435 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|229879795|ref|ZP_04499292.1| SSU ribosomal protein S12P methylthiotransferase [Slackia heliotrinireducens DSM 20476] # 1 435 18 444 446 201 28 2e-51 MKKASIITYGCQMNVNESAKIKKIFQNLGYDVTEEIDNADAVFLNTCTVREGAATQIFGK LGELKALKEKRGTIIGVTGCFAQEQGEELVKKFPIIDIVMGNQNIGRIPQAIEKIENNES THEVYTDNEDELPPRLDAEFGSDQTASISITYGCNNFCTFCIVPYVRGRERSVPLEEIVK DVEQYVKKGAKEIVLLGQNVNSYGKDFKNGDNFAKLLDEICKVEGDYIVRFVSPHPRDFT DDVIEVIAKNEKISKCLHLPLQSGSSQILKKMRRGYTKEKYLALVDKIKSKIPGVALTAD IIVGFPGETEEDFLDTIDVVQKVSFDNSYMFMYSIRKGTKAATMDNQIEESVKKERLQRL MEVQNKCSFYESSKYKGRIVKVLVEGPSKKNKEVLSGRTSTNKIVLFRGNLALKGQFVNV KINECKTWTLYGEIV >gi|296155041|gb|ADVK01000019.1| GENE 12 8239 - 11253 2789 1004 aa, chain - ## HITS:1 COG:FN0474 KEGG:ns NR:ns ## COG: FN0474 COG0841 # Protein_GI_number: 19703809 # Func_class: V Defense mechanisms # Function: Cation/multidrug efflux pump # Organism: Fusobacterium nucleatum # 1 1004 9 1012 1012 1729 99.0 0 MKLIKIAIKRTIVTTMISISLLILGIFAMKSMRTELLPDIEYPVVKIITHWSGASAADVE KQITNKIERILPNVEGIENISSESSYENSSISVEFNYGVNIQDKVTEIQREVFQIKNDLP NSAKNPIIKKTEVGAGAITLFLTFVSPDKKALFSYLENYVKPNLETISGVAEVSILGGTK KQLQIQIEPAKLASYNLTPMDIYQLIRKSSMVIPLGSLMNGREEYVIQALGELESVEEYE NILLHSNGDTLRLKDIANVVLTEEDPLNLGFNRGKPATTIAISKSSDGSTIEINEKIMKA IKNMEETMPSNITYFKIFDSSESIKKSINTVGKSALQGLILASLFLWIFFKNKKMTLIVS FAFPLAISTTFILMKGIHSTFNLISLMGLAIGVGMLTDNSVVVIDNIYNHIQEEKNSIEA AFIGTNQVFSSVLASTMTSIIVFLPIIFTKGIFKEMFQDMVWAIIFSNVAALLVSVTFIP MLASKFMKKNIIQTEGKYFSKIQRKYQRFLSISLKHKKKTVLISLLFAFLIFGIGGKFVK FGFLTKQDYGYYSVIAEFQNGSDFEKIQELRNEIETIIKKEPHTKSYFSIIQKRNGTISV NVDVGFKEKRKESIFEIVKKVRKEVEKIPDIRTTFFYEYAKGKPKKDIEFQIVGTDLETI QILARQIYKDVLKIKGVTDVSSTLDSGGKKLEIIFKRDKIQSLNMSIKQIEETISYYLLG GDRANTITIKSGNEEIEVLVRLSKDNRKSIKQLENLKIKVTDNSFINLSEIADIRKVENQ LSIDKINRFYSVSIYVNDGGIGTQKIQQELVKIFSEKNKDSSIQYRWGGDAEKMQRAMKE LMLTFLIAIFLIYALLASQFESFLFPFLVMGSIPFSLVGVIIGFLITQHTLDAVAMVGIV LLIGIVVNNAIVLLDFIQQKEKESKNKKEAIEKACNLRLRPILLTSLTTIVGMIPLSLGI GDGSEVYQGLGISIIFGMSFSTLLTLIFVPTTYYMLTSIFSKKL >gi|296155041|gb|ADVK01000019.1| GENE 13 11355 - 11924 717 189 aa, chain - ## HITS:1 COG:FN0473 KEGG:ns NR:ns ## COG: FN0473 COG1309 # Protein_GI_number: 19703808 # Func_class: K Transcription # Function: Transcriptional regulator # Organism: Fusobacterium nucleatum # 1 189 1 189 189 292 100.0 2e-79 MPQILKEEIKNRIYKAASKIFYEKGFLKTKMKDISEEAKIPVGLVYTYYKNKEELFDEIV NPIYYYLNLAIEKEEKEEGSALERFKATGEEYVLKLLNQHKSLVILMDKAQGTKHENAKQ VFIKILENHIKRQVQKKGIIIEEEILIHILASNFTESLLEIARHYENPDWAKKILNLVTK CYYEGVNSI >gi|296155041|gb|ADVK01000019.1| GENE 14 12101 - 12604 995 167 aa, chain + ## HITS:1 COG:FN0472 KEGG:ns NR:ns ## COG: FN0472 COG0716 # Protein_GI_number: 19703807 # Func_class: C Energy production and conversion # Function: Flavodoxins # Organism: Fusobacterium nucleatum # 1 167 1 167 167 281 100.0 3e-76 MKTVGIFFGTTGGKTQEVVDIIAAQLGDAQVFDVANGVAEMEVFDNIIMASPTYGMGELQ DDWASVIDEVADMDFSGKVVAFVGVGDAAIFGGNYVEAMKHFYDAVQPKGAKIVGFTSTD GYDFEASEAVIDGDKFMGLAIDASFDTDEITSKVEDWLENKVKDELL >gi|296155041|gb|ADVK01000019.1| GENE 15 12669 - 12899 306 76 aa, chain - ## HITS:1 COG:FN0471 KEGG:ns NR:ns ## COG: FN0471 COG5295 # Protein_GI_number: 19703806 # Func_class: U Intracellular trafficking, secretion, and vesicular transport; W Extracellular structures # Function: Autotransporter adhesin # Organism: Fusobacterium nucleatum # 1 76 265 340 340 134 97.0 4e-32 VGFTLKLGKGSGVTYNETPQYVVQNEVKRLTVENQELKSKINNQDDRIKAQDNKINEQDE KIKNLEEKLNKLLKTK Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:23:14 2011 Seq name: gi|296154976|gb|ADVK01000020.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00024, whole genome shotgun sequence Length of sequence - 71755 bp Number of predicted genes - 66, with homology - 64 Number of transcription units - 23, operones - 16 average op.length - 3.7 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 1/0.500 + CDS 2 - 895 1196 ## COG0596 Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) 2 1 Op 2 . + CDS 885 - 1895 1205 ## COG0252 L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D 3 1 Op 3 . + CDS 1911 - 2642 873 ## FN0750 hypothetical protein 4 1 Op 4 . + CDS 2660 - 3904 1519 ## FN0749 hypothetical protein + Term 3917 - 3961 7.8 + Prom 3934 - 3993 13.0 5 2 Op 1 . + CDS 4016 - 5365 1554 ## FN0748 hypothetical protein 6 2 Op 2 . + CDS 5362 - 5877 634 ## COG0494 NTP pyrophosphohydrolases including oxidative damage repair enzymes 7 3 Op 1 . - CDS 6085 - 7281 1522 ## COG1168 Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities 8 3 Op 2 . - CDS 7325 - 8812 660 ## PROTEIN SUPPORTED gi|145629959|ref|ZP_01785741.1| 50S ribosomal protein L21 - Prom 8919 - 8978 11.7 + Prom 8835 - 8894 10.8 9 4 Tu 1 . + CDS 8980 - 9528 714 ## COG1396 Predicted transcriptional regulators - Term 9634 - 9681 10.6 10 5 Op 1 7/0.000 - CDS 9736 - 10224 696 ## COG0319 Predicted metal-dependent hydrolase 11 5 Op 2 . - CDS 10241 - 12313 2347 ## COG1480 Predicted membrane-associated HD superfamily hydrolase 12 5 Op 3 1/0.500 - CDS 12331 - 14799 2275 ## COG1199 Rad3-related DNA helicases 13 5 Op 4 . - CDS 14815 - 15285 575 ## COG4807 Uncharacterized protein conserved in bacteria - Prom 15341 - 15400 10.6 + Prom 15375 - 15434 8.5 14 6 Op 1 1/0.500 + CDS 15501 - 16466 1528 ## COG3643 Glutamate formiminotransferase + Term 16483 - 16524 2.6 15 6 Op 2 1/0.500 + CDS 16543 - 17784 1804 ## COG1228 Imidazolonepropionase and related amidohydrolases 16 6 Op 3 . + CDS 17802 - 18440 1062 ## COG3404 Methenyl tetrahydrofolate cyclohydrolase + Term 18459 - 18511 5.9 17 7 Tu 1 . - CDS 18506 - 19561 1253 ## COG2849 Uncharacterized protein conserved in bacteria - Prom 19661 - 19720 7.2 + Prom 19567 - 19626 10.3 18 8 Op 1 . + CDS 19713 - 20042 374 ## FN0737 hypothetical protein 19 8 Op 2 . + CDS 20044 - 20799 947 ## COG0500 SAM-dependent methyltransferases + Term 20806 - 20854 1.6 - Term 20789 - 20847 3.1 20 9 Tu 1 . - CDS 20888 - 22696 2627 ## COG5295 Autotransporter adhesin - Prom 22762 - 22821 14.2 21 10 Op 1 1/0.500 - CDS 22892 - 24598 1896 ## COG1032 Fe-S oxidoreductase 22 10 Op 2 1/0.500 - CDS 24605 - 25843 1928 ## COG2195 Di- and tripeptidases - Prom 25876 - 25935 10.9 - Term 25858 - 25909 2.0 23 11 Op 1 . - CDS 25947 - 27137 1171 ## COG1323 Predicted nucleotidyltransferase 24 11 Op 2 . - CDS 27154 - 27687 812 ## FN0731 hypothetical protein 25 11 Op 3 1/0.500 - CDS 27716 - 28330 511 ## COG0588 Phosphoglycerate mutase 1 26 11 Op 4 . - CDS 28367 - 29053 1127 ## COG0588 Phosphoglycerate mutase 1 - Prom 29084 - 29143 14.6 + Prom 29101 - 29160 18.0 27 12 Op 1 . + CDS 29184 - 29903 656 ## COG3177 Uncharacterized conserved protein 28 12 Op 2 . + CDS 29926 - 30681 1352 ## FN0728 hypothetical protein 29 12 Op 3 . + CDS 30683 - 30745 94 ## 30 12 Op 4 . + CDS 30766 - 30858 168 ## 31 12 Op 5 . + CDS 30900 - 31196 286 ## FN0726 hypothetical protein + Prom 31200 - 31259 5.8 32 13 Op 1 1/0.500 + CDS 31281 - 31985 1020 ## COG1179 Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 1 33 13 Op 2 . + CDS 31999 - 32502 713 ## COG0716 Flavodoxins + Term 32522 - 32563 7.1 - Term 32670 - 32725 12.2 34 14 Op 1 . - CDS 32741 - 34597 1552 ## FN0723 hypothetical protein 35 14 Op 2 . - CDS 34601 - 37657 2683 ## COG2319 FOG: WD40 repeat - Prom 37840 - 37899 13.0 + Prom 37677 - 37736 13.2 36 15 Tu 1 . + CDS 37826 - 38545 681 ## FN0721 hypothetical protein + Term 38553 - 38592 4.5 - Term 38541 - 38580 4.5 37 16 Op 1 4/0.000 - CDS 38589 - 39152 900 ## COG0231 Translation elongation factor P (EF-P)/translation initiation factor 5A (eIF-5A) 38 16 Op 2 . - CDS 39152 - 40204 918 ## COG4394 Uncharacterized protein conserved in bacteria 39 16 Op 3 . - CDS 40204 - 40512 415 ## FN0718 hypothetical protein 40 16 Op 4 . - CDS 40535 - 41215 921 ## COG1187 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 41 16 Op 5 . - CDS 41215 - 42159 1072 ## FN0716 phophatidylinositol-4-phosphate 5-kinase (EC:2.7.1.68) 42 16 Op 6 . - CDS 42152 - 43024 1169 ## FN0715 hypothetical protein - Prom 43055 - 43114 6.9 43 17 Tu 1 1/0.500 - CDS 43122 - 44066 1038 ## COG1902 NADH:flavin oxidoreductases, Old Yellow Enzyme family - Prom 44194 - 44253 4.1 - Term 44125 - 44162 -0.4 44 18 Op 1 7/0.000 - CDS 44265 - 44795 626 ## COG2059 Chromate transport protein ChrA 45 18 Op 2 1/0.500 - CDS 44792 - 45352 675 ## COG2059 Chromate transport protein ChrA 46 18 Op 3 . - CDS 45345 - 46559 1686 ## COG0452 Phosphopantothenoylcysteine synthetase/decarboxylase 47 18 Op 4 . - CDS 46576 - 47253 862 ## FN0710 hypothetical protein 48 18 Op 5 1/0.500 - CDS 47319 - 48791 557 ## PROTEIN SUPPORTED gi|163803542|ref|ZP_02197411.1| 30S ribosomal protein S20 49 18 Op 6 1/0.500 - CDS 48782 - 49459 837 ## COG1354 Uncharacterized conserved protein 50 18 Op 7 1/0.500 - CDS 49431 - 50390 410 ## PROTEIN SUPPORTED gi|163762565|ref|ZP_02169630.1| ribosomal protein S2 51 18 Op 8 1/0.500 - CDS 50405 - 51304 791 ## COG1481 Uncharacterized protein conserved in bacteria 52 18 Op 9 . - CDS 51323 - 54058 3544 ## COG0749 DNA polymerase I - 3'-5' exonuclease and polymerase domains - Prom 54219 - 54278 13.4 + Prom 54209 - 54268 18.1 53 19 Op 1 . + CDS 54297 - 55370 1160 ## CDR20291_0944 hypothetical protein 54 19 Op 2 . + CDS 55401 - 56573 1574 ## COG1473 Metal-dependent amidase/aminoacylase/carboxypeptidase + Term 56587 - 56635 3.1 - Term 56354 - 56401 7.1 55 20 Op 1 1/0.500 - CDS 56634 - 58187 2102 ## COG0500 SAM-dependent methyltransferases 56 20 Op 2 31/0.000 - CDS 58209 - 59162 1146 ## COG0341 Preprotein translocase subunit SecF 57 20 Op 3 1/0.500 - CDS 59162 - 60397 1798 ## COG0342 Preprotein translocase subunit SecD 58 20 Op 4 9/0.000 - CDS 60421 - 60837 544 ## COG0816 Predicted endonuclease involved in recombination (possible Holliday junction resolvase in Mycoplasmas and B. subtilis) 59 20 Op 5 . - CDS 60854 - 63457 3582 ## COG0013 Alanyl-tRNA synthetase 60 20 Op 6 . - CDS 63532 - 63699 119 ## gi|296327900|ref|ZP_06870436.1| conserved hypothetical protein 61 20 Op 7 . - CDS 63660 - 63833 93 ## FN0696 hypothetical protein 62 20 Op 8 . - CDS 63895 - 64620 246 ## PROTEIN SUPPORTED gi|119503196|ref|ZP_01625280.1| Ribosomal protein S16 - Prom 64652 - 64711 2.1 63 21 Op 1 . - CDS 64721 - 67426 3315 ## FN0694 S-layer protein 64 21 Op 2 1/0.500 - CDS 67419 - 70052 3517 ## COG0249 Mismatch repair ATPase (MutS family) - Prom 70093 - 70152 15.1 65 22 Tu 1 . - CDS 70154 - 71083 481 ## PROTEIN SUPPORTED gi|145632364|ref|ZP_01788099.1| ribosomal protein L11 methyltransferase - Prom 71140 - 71199 8.4 + Prom 71058 - 71117 9.9 66 23 Tu 1 . + CDS 71183 - 71728 585 ## FN0691 hypothetical protein Predicted protein(s) >gi|296154976|gb|ADVK01000020.1| GENE 1 2 - 895 1196 297 aa, chain + ## HITS:1 COG:FN0752 KEGG:ns NR:ns ## COG: FN0752 COG0596 # Protein_GI_number: 19704087 # Func_class: R General function prediction only # Function: Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) # Organism: Fusobacterium nucleatum # 1 297 23 319 319 599 99.0 1e-171 VHSIYVEECGNPNGEPIIFLHGGPGAGCGKKARRFFDPKYYHIILFDQRGCGKSIPFLEL KENNIFYSVEDMEKIRLHIGIDKWTIFAGSYGSTLGLTYAIHYPEKVKRMVLQGIFLANE DDVKWYFQKGISEIYPAEFKIFKDFIPIDEQDNLLEAYHKRFFSDNIKVRNEAIKIWSRF ELRTMESEFTWPSEEEVQDYEISLALIEAHYFYNKMFWNDSEYILNRAEIIKDIPIQIAH GRFDLNTRVISAYKLSEKLNNCELVIVEGVGHSPFTEKMSKVLIKFLEDIKEIDNGK >gi|296154976|gb|ADVK01000020.1| GENE 2 885 - 1895 1205 336 aa, chain + ## HITS:1 COG:FN0751 KEGG:ns NR:ns ## COG: FN0751 COG0252 # Protein_GI_number: 19704086 # Func_class: E Amino acid transport and metabolism; J Translation, ribosomal structure and biogenesis # Function: L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D # Organism: Fusobacterium nucleatum # 1 336 1 336 336 649 100.0 0 MENKVLIINTGGTIGMVGKPLRPAYNWSEITKGYSMLEKFPTDYYQFEKLIDSSDVTTDF WINLVEVIEKNYDKYLGFVILHGTDTMAYTGSMLSFLLKNLAKPVVLTGAQAPMVNPRSD GLQNLINSIYIAGHKLFDTPLIPEVCICFRDSLLRANRSKKTDSNNYYGFSSPNYNPLAE IATEIKVISDRILKIPNEKFYVEKNIDANVLLLELFPGLNSKYISDFIESNKNIKALILK TYGSGNTPTSEDFIETLKSISKKGIPILDITQCISGSVKMPLYESTDKLSKLGIINGSDI TSEAGLTKMMYLLGKNLSLQEIKNAFTSSICGEQTV >gi|296154976|gb|ADVK01000020.1| GENE 3 1911 - 2642 873 243 aa, chain + ## HITS:1 COG:no KEGG:FN0750 NR:ns ## KEGG: FN0750 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 243 1 243 243 366 99.0 1e-100 MKRWIFLILAIIIISIFSIIKSCQRSKREVINVYTDREIEIFVDKLAKKYERIDKETEIR LNSLNSIEDYDIIFINEDSKIKNNSKKKYEVKDFFEDNLVIIGRRKIDNLSQMLTSSIAI PNYKTNIGKTGLDILAKVEGFQEMAKKIQYKDDVMSALESVDLYEVDYAFVTSKILPLAK NSEICYIFPANIGRSKILYKAYINLESEKNPKKFYDFIEEDLADKSPIPIKPNPEKNKVI KTN >gi|296154976|gb|ADVK01000020.1| GENE 4 2660 - 3904 1519 414 aa, chain + ## HITS:1 COG:no KEGG:FN0749 NR:ns ## KEGG: FN0749 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 10 414 1 405 405 643 98.0 0 MKRVLIFFCMLLLTSNSFAEEKLVTENVTENIQQPVEQKTQKIVVDVKSVYDSLNIKNKI DYSIFQKAYLGYVQISNKNPGVLIIIDYSKPSNEERFYVLDLDKKKLVYSTRVAHSKNSG LEIPLEFSDDPNSYQSSLGFFVTLGEYNGAYGYSLRLKGLEENINANAEDRAIVIHGGDI VEDEYIKKFGFAGRSLGCPVLPHSLTREIIDFIKHGRVLFIYGNDEEYVDDSTYLSKLAP VFEGSPKNIVEIEKTIEIQKVSPTPTVTVVTTTTAPITVVTDNKKDNNNRETVIAEANIT KIFEIIKQEYKYTNNVDNSKVAYTKLLKDVIQEKLNKITEIKETTENKKINDENMDNKEK AVEQTITDNTPETKKEDETVVIQQNENVQEDKKEERKHPEEVTKKSLGLENKLK >gi|296154976|gb|ADVK01000020.1| GENE 5 4016 - 5365 1554 449 aa, chain + ## HITS:1 COG:no KEGG:FN0748 NR:ns ## KEGG: FN0748 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 20 449 1 430 430 712 99.0 0 MDKITQRMKQIEILTVALSMLILIFFINYVINEHENIFIGMYKIITSPAVLVTDFIQVGG IGAAFLNAILIFSFNFFLVKSFKVKITGITIAAFFTVFGFSFFGKNILNILPFYLGGILY SIYTSTDFSEHIVPIAFSSALAPFVSSIAFYGDISYETSYINAILIGVLIGFIVVPLARS LYDFHEGYDLYNLGFTAGILGSVIIAVLKLYHFEITPQFLLSTEYDTPLKILCSAAFISL IIIGFYINDNSFSGYFSLIKDDGYKSDFTQKYGYGLTFINMGIMGFISMGFVIITGQTFN GPVLAALFTVVGFSANGKTIFNTIPILIGILLASLGSKGNIFTLAISGLFGTALAPISGI FGPIAGIIAGWLHLAVVQNVGLVHGGLNLYNNGFSAGIVAGFLLPIFNMITDNNNQRKMN IQRKHMNFLKNVQANIKKRIHKKEDEEKE >gi|296154976|gb|ADVK01000020.1| GENE 6 5362 - 5877 634 171 aa, chain + ## HITS:1 COG:FN0747 KEGG:ns NR:ns ## COG: FN0747 COG0494 # Protein_GI_number: 19704082 # Func_class: L Replication, recombination and repair; R General function prediction only # Function: NTP pyrophosphohydrolases including oxidative damage repair enzymes # Organism: Fusobacterium nucleatum # 1 171 1 171 171 314 99.0 6e-86 MKILDTPNLKFLKVGVDTDPLNNHNLEYLEKQNAIAALILNHSGDKVLFVNQYRAGVHNY IYEVPAGLIENDEKPIVALEREVREETGYKREDYDILYDSNTGFLVSPGYTTEKIYIYII KLKSDDIVPLDLDLDETENLYTRWIDIKDAGKLTLDMKTIFSLHIYSNLIK >gi|296154976|gb|ADVK01000020.1| GENE 7 6085 - 7281 1522 398 aa, chain - ## HITS:1 COG:CAC2970 KEGG:ns NR:ns ## COG: CAC2970 COG1168 # Protein_GI_number: 15896223 # Func_class: E Amino acid transport and metabolism # Function: Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities # Organism: Clostridium acetobutylicum # 5 390 3 382 384 320 41.0 4e-87 MKKIDLNKTYERRGTDSVKWDHMDFLDNRATKNTLPLWLSDMEFKVADEILETLKNRIEH GFLGHSMAGDEYLNAVCNWYKNKFNWIIDKNSIFYSPGILPAIGFVISALTNLEDGVIIQ PPVFYPFSELIKGQGRTVINNNLINENGYYKIDFSDLREKASNPKNKLLILCSPHNPVGR VWTKEELEEIVKICQETNTYILSDEIHCDLTRKNIQHFPIKTITNYKNIIVAVSPSKTFN VAGLPIASIIIDDEKLREIWLKETKNKYYIRFAPPLDMVLSIAAYNKCGYWLEQVLDYIE DNFNFMEKYLKENLPKINYKKPEGTYLAWVNFGAYVDYKELLDVLITKYDLLIESGHVFG KPGDGYFRISVACPRVYLEEGLKRIVVAIKELTNMNKK >gi|296154976|gb|ADVK01000020.1| GENE 8 7325 - 8812 660 495 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|145629959|ref|ZP_01785741.1| 50S ribosomal protein L21 [Haemophilus influenzae 22.4-21] # 1 455 1 446 456 258 35 5e-68 MSISNIIISIDNFVWGISLIILSLGSGLLFSIALKFPQIRLFKEMLKCSFKNNKADSGLT PFQALSIAIGGRVGTGNITGTASAILFGGPGAVFWMCIMAIICSASAYVESALAQVWKEK INNEYAGGPSFYMKKGLRFKSAGIFYAVLSVIFLIIFAGVQTNAFSSVAAASFNISPLII AIAYTFLLVIVIFGGAKSIAKVADKVVPIMSISYITVAILIILINITKVPEMFSLIIKSA ISIDAVFGGIVGASISWGVRRGVFSNEAGLGTGAWVAGCADISHPAKAGLSQTFSVFISL LICIATSLMILVTDSYNVIGTDGNYIVQNIIGDYDIYVTHSIDSVVQGFGTIFITIAMFL FTFTTIMAYSLYLSRINYFFFSGHTDRKNVKIVSTLINIVTTILAFLGPLVSSSTIWSMA SAMCGLISLVNLFSLILLYKPAIITLKDYEKQLKIGLDPVFIPEKYEIENAELWNKIIEE DYTTELDNYKKAFND >gi|296154976|gb|ADVK01000020.1| GENE 9 8980 - 9528 714 182 aa, chain + ## HITS:1 COG:BH2909 KEGG:ns NR:ns ## COG: BH2909 COG1396 # Protein_GI_number: 15615472 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Bacillus halodurans # 8 179 10 182 189 124 38.0 9e-29 MDRLNILVSENIKRIRQEKNLSLGDLAKLSDVSKSMLAQIERGEGNPTLSTLWKIANGMQ VSFNTLIAQPKLPYKVTKLAEIEPILDMNGGLKNYSLFSDIENNFSVYQIEVGKEISWIS EAHLRGTAEFVIVIQGTLEIKLEEKTFILKKGENLWFKADVPHSYCNLDEGTTIFHNILY NK >gi|296154976|gb|ADVK01000020.1| GENE 10 9736 - 10224 696 162 aa, chain - ## HITS:1 COG:FN0746 KEGG:ns NR:ns ## COG: FN0746 COG0319 # Protein_GI_number: 19704081 # Func_class: R General function prediction only # Function: Predicted metal-dependent hydrolase # Organism: Fusobacterium nucleatum # 1 162 1 162 162 268 99.0 5e-72 MELIVDFSSDLQTEKYNMFIDTLYENNHLENYIKKVLNLEKIESDRPLYLSLLLTDNENI QVINREYRDKDAPTDVISFAYHETEDFNIGPYDTLGDIIISLERVEEQASEYNHSFEREF YYVLTHGILHILGYDHIEEEDKKLMREREEAILSSFGYTRDK >gi|296154976|gb|ADVK01000020.1| GENE 11 10241 - 12313 2347 690 aa, chain - ## HITS:1 COG:FN0745 KEGG:ns NR:ns ## COG: FN0745 COG1480 # Protein_GI_number: 19704080 # Func_class: R General function prediction only # Function: Predicted membrane-associated HD superfamily hydrolase # Organism: Fusobacterium nucleatum # 62 690 1 629 629 1103 99.0 0 MKKFTIFGFKFLFDIKKKDSSDEERYSDSYFLKEKVFYLILALFLITISSKIPILFRNNN YMTGDVVKSDIYSPKTIVFRDKIGKDKLIQDMIDRLDKDYIYSSEAADIYIEEFDNFHKE IIAIKKGNLKSFDYSGFERKTGKVMPERIINKLLEEDEEKIDETFSKLTTQLENAYKAGI YKEKNSIRINEPAKTDIEALEPFEREIINNFLIPNYIYDEAKTRNTINEKVSQINDQYIE IKAGTLIAKTGEVLTDRKIDILDRLGIYNYKMSIFIIALNLIFLLVISSIFNVVTTKFYS KEILEKNKYRAIMLLTIATLLVFRIVPSSMIYLLPLDTMLLLLLFIVKPRFSVFLTMIVI SYMLPITDYDLKYFTIQSIAIFATGFLSKNISTRSSVIAIGIQLAILKILLYLILSFFSV EESYGVALNTIKIFISGLFSGMFAIALLPYFERTFNILTVFKLMELADLSHPLLRKLSIE APGTFQHSMMVATLSENAVIEIGGDPTFTRVACYYHDIGKTKRPQYYVENQTDGKNLHND ISPFMSKMIILAHTREGAEMGKKYKIPKEIRDIMFEHQGTTLLAYFYNKAKEIDPNIQEE EFRYSGPKPQTKESAVILLADSIEAAVRSLDVKDPVKIEQMVRKIVDSKIRDNQLSDANI TFREVEIIVNSFLKTFGAIYHERIKYPGQK >gi|296154976|gb|ADVK01000020.1| GENE 12 12331 - 14799 2275 822 aa, chain - ## HITS:1 COG:FN0743 KEGG:ns NR:ns ## COG: FN0743 COG1199 # Protein_GI_number: 19704078 # Func_class: K Transcription; L Replication, recombination and repair # Function: Rad3-related DNA helicases # Organism: Fusobacterium nucleatum # 82 822 1 741 741 1345 99.0 0 MVMDIKDRFSEESLQTIKEYLQENNNKSMIFKATFDDNELIQEPFFLSLYKKKNFEETLT KVGKNEVVIRTTKPNQLYPSDMELELSEELYNRRNIAYCLLSSDLDDFYFVQDIDRIFLE KINIENYFSKDGILAKEIKGFEYRQEQEEMAHYIQDAINEDRKIIIEAGTGTGKTLAYLI PAIKWAVVNKKKVIIATNTINLQEQLLLKDIPLAKSIIKEDFSYVLVKGRSNYLCKRLFN ELSIGRSIDIEIFSMEAREQIECILKWGNKTKTGDKAELPFEVYPDVWELIQSTTELCLG KKCPYRKECFYMKTRMEKMEADILISNHHVFFADLNVRAETDFDSEYLILPRYDMVIFDE AHNIESVARSYFSVEVSKISFTRLLNRIYQKKNKRKKEKSALIRVEDTIDEKDLEDSQQY AYLLNILKEEISILQNIGDEYFDEIRKIYETNTEAPIRKSLNNFEMTKSRFLETLRDKKD IFQSKLADFLTLMMSFNNVIDEEKEKNPEVINFNNHLKMFKAYIDSFKFINSFEDDNYIY WLDINSKRTNVLLTATPLNIAEKLSTVLFDNLDRLVFASATIVANGNFDYFKKSLGLDEE DCIECIIKSPFNYDEQMSVYIPTDIQDSENINAFVTDASKFILNILLKTDGKAFILFTSY TMLNQIYYSISKKLMNKGFEVFLHGDKPRSQLIKEFKEAENPILFGTTSFWEGVDVQGEN LSNVIITKLPFLVPTDPVVSAISKKIEEDGGNSFTDFQLPEAIIKFKQGVGRLIRKKTDS GNIFILDNRILKKRYGSLFINALPSQKNIKILEKDDIIEEIE >gi|296154976|gb|ADVK01000020.1| GENE 13 14815 - 15285 575 156 aa, chain - ## HITS:1 COG:FN0742 KEGG:ns NR:ns ## COG: FN0742 COG4807 # Protein_GI_number: 19704077 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 156 1 156 156 272 99.0 2e-73 MTNNDFLRRLRYALNIKDNVMVQIFKKGNVLLTREDVIDYLKKDIDEGFKKLSNNDLIAF LDGLITQKRGKKEDGTPVSQVKVTKNNLNNILLRKLRIALAFKSYDMIKIFKLGGIEISE GELSALFRSEDHKNYKECGDKYIRVFLKGLTEYYRN >gi|296154976|gb|ADVK01000020.1| GENE 14 15501 - 16466 1528 321 aa, chain + ## HITS:1 COG:FN0741 KEGG:ns NR:ns ## COG: FN0741 COG3643 # Protein_GI_number: 19704076 # Func_class: E Amino acid transport and metabolism # Function: Glutamate formiminotransferase # Organism: Fusobacterium nucleatum # 1 321 1 321 321 634 99.0 0 MAKIVECIPNYSEGKDLAKIERIVTPYKNNPKIKLLSVEPDANYNRTVVTVLGDPQEVKK AVIESIGIATKEIDMNVHKGEHKRMGATDVVPFLPIQEMTTEECNEISKEVGKAVWEKFQ LPVFLYESTATAPNRVSLPDIRKGEYEGMAEKLKQPEWAPDFGERAPHPTAGVTAIGCRM PLIAFNINLATTNMDIPKEIAKAIRFSSGGFRFIQAGPAEILDKGFVQVTMNIKDYTKNP IYRIMETVKMEAKRWGVKVTGCEIIGATPFASLTDSLKYYLACDGIKDDVDAMSMDKVVE LMIKYLGLTDFDVKKVLEANI >gi|296154976|gb|ADVK01000020.1| GENE 15 16543 - 17784 1804 413 aa, chain + ## HITS:1 COG:FN0740 KEGG:ns NR:ns ## COG: FN0740 COG1228 # Protein_GI_number: 19704075 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism # Function: Imidazolonepropionase and related amidohydrolases # Organism: Fusobacterium nucleatum # 1 413 1 413 413 786 99.0 0 MQADLVLYNIGQLVTSRELDKTKKMDNIEVIENNGYIVIEKDKIVAIGSGEVPKEYLTPA TEMVDLSGKLVTPGLIDSHTHLVHGGSRENEFAMKIAGVPYLEILEKGGGILSTLKSTRN ASEQELIEKTLKSLRHMLELGVTTVEAKSGYGLNLEDELKQLEVTKILGYLQPVTLVSTF MAAHATPPEYKDNKEGYVQEVIKMLPIVKERNLAEFCDIFCEDKVFSVDESRRILTAAKE LGYKLKIHADEIVSLGGVELAAELGATSAEHLMKITDSGINALANSNVIADLLPATSFNL MEHYAPARKMIEAGIQIALSTDYNPGSCPSENLQFVMQIGAAHLKMTPKEVFKSVTINAA KAIDKQDTIGSIEVGKKADITVFDAPSMAYFLYHFGINHTDSVYKNGKLVFKK >gi|296154976|gb|ADVK01000020.1| GENE 16 17802 - 18440 1062 212 aa, chain + ## HITS:1 COG:FN0739 KEGG:ns NR:ns ## COG: FN0739 COG3404 # Protein_GI_number: 19704074 # Func_class: E Amino acid transport and metabolism # Function: Methenyl tetrahydrofolate cyclohydrolase # Organism: Fusobacterium nucleatum # 1 212 1 212 212 358 100.0 4e-99 MKLVELDVLKFLDIVDSNSPAPGGGSVSALASSLGASLARMVAHLSFGKKNYEALADDVK AKFVANFDELLKIKNELNDLIDKDSEAYNTVMAAYKLPKETDEEKAARSAEIQKSLKYAI QTPYDIVVLSGKAISLLGEILANGNQNAITDIGVGTMLLMVGLEGGILNVKVNLSSIKDT AYVEKITKEIYDIKATAEKEKERIMGIVNAAL >gi|296154976|gb|ADVK01000020.1| GENE 17 18506 - 19561 1253 351 aa, chain - ## HITS:1 COG:FN0738_1 KEGG:ns NR:ns ## COG: FN0738_1 COG2849 # Protein_GI_number: 19704073 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 149 1 149 149 265 100.0 1e-70 MKKNFIICTFIIFLLASFTVFAEREVDLEKLEYDDKSKLVYLEGENEPFTGIAKDYYKDK SLKIEFPYKNGKMEGRGKEYYPSGKFKSDAFFVDGLLQGKSIGYYENGNLEYEENYKDGE LDGLIKDYFESGKIKAEINYKNGELDGPAKEYYENGQVYIQESYKNGELDGESLNFDENG NLKSKEVYKNGELVESSNKNIGEVSIPSKKSNTESKLKYYIGISAFLTVIIGLIVYTIFK MLTAFPKTNHLSDEQRSRIFKILMKYDEGKDGLFSAYRMNGVGTGYYRVRSMMVDNEKVY IYAKMFSILYIPTPITLGYLLCYNKDKILASFSNTTFKEAKKEIEETILHL >gi|296154976|gb|ADVK01000020.1| GENE 18 19713 - 20042 374 109 aa, chain + ## HITS:1 COG:no KEGG:FN0737 NR:ns ## KEGG: FN0737 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 109 1 109 109 195 97.0 4e-49 MPHLKIRGIEKNLIVENSKEIIDKLTKIIGCDRNCFTIEHQNTEYIFDGKIVDGYTFVEL YWFARDEKIKKEVADFLTKFIKKINNNKDCCIIFFTLTGDNYCDNGEFF >gi|296154976|gb|ADVK01000020.1| GENE 19 20044 - 20799 947 251 aa, chain + ## HITS:1 COG:FN0736 KEGG:ns NR:ns ## COG: FN0736 COG0500 # Protein_GI_number: 19704071 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism; R General function prediction only # Function: SAM-dependent methyltransferases # Organism: Fusobacterium nucleatum # 1 251 1 251 251 475 96.0 1e-134 MDKKLIANWLEEEKNAHIQGWDFSHIHGRYEEENDLPWDYKNIIKQYLKPEYKLLDIDTG GGEFLLTLEHPFKNTSVTENYPPNIEFCKKNLVPLGINLYETDGASLLPFKDNEFDIIIN KHGNFNISELFRILKTGGIFITQQVGAENDKELIELLLPKTELSFPDLYLKNISKKFKKT GFKILQEQEAFCPIKFYDIGALVWYAHIIEWEFPNFSVNNCIENLFKAQEILEKQGVVEG KIHRFLIVAQK >gi|296154976|gb|ADVK01000020.1| GENE 20 20888 - 22696 2627 602 aa, chain - ## HITS:1 COG:FN0735 KEGG:ns NR:ns ## COG: FN0735 COG5295 # Protein_GI_number: 19704070 # Func_class: U Intracellular trafficking, secretion, and vesicular transport; W Extracellular structures # Function: Autotransporter adhesin # Organism: Fusobacterium nucleatum # 1 602 16 617 617 903 100.0 0 MKKSVSLKLIIFSFLLVSGSITYSATPTIEAGTGTDSTEAGIDNEASKEKSSAFGYKNIA DGEESSAFGFSNTASGKRSSAFGHSNKASGERSSAIGYNNEVTGLESSVLGNNYKVTGNH SGAFGIGTYDEVNRVYKRINGGGNSYMIGNDNNIAKGANDNFILGNGVDIGVDSNGRKIE GSVVLGHGSSVSESEVVSVGTPDGGRRIVNVREARITATSNDATTGKQLYQVAKATSTDI DVAAWKAKLGVGSGGVDLTVYSKRDASNLTATDISAWRTKLGVGAGGGGGIVNTATGTGS TGLGIDNTVTGDYSTAIGYKNNVSGNKSGAFGDPNVVTGNGSYAFGNDNTINGNNNFVLG NNVTIGSGIQNSVALGNNSTVSSSNEVSVGSATQKRKITNVADGDVSATSTDAVTGRQLY KAMQNSGATGIENLRNEVNEKIDDVKDEVNHVGSLSAALAGLHPMQYDPKAPTQVMAALG HYRNKQSVAVGLSYYFNDRFMVSAGVAIGGERRVKSMANVGFTVKLGKGSGVEYNETPQY VVQNEVKRLTVENQNLKSQVNNQGKENQELKAEINSLNTKNKEQDEKIKNLEEKLNRLLK NK >gi|296154976|gb|ADVK01000020.1| GENE 21 22892 - 24598 1896 568 aa, chain - ## HITS:1 COG:FN0734 KEGG:ns NR:ns ## COG: FN0734 COG1032 # Protein_GI_number: 19704069 # Func_class: C Energy production and conversion # Function: Fe-S oxidoreductase # Organism: Fusobacterium nucleatum # 1 568 1 568 568 1165 99.0 0 MKFLPTTKEEMKNLGWDSIDVLLISGDTYLDTSYNGSVLVGKWLVEHGFKVGIIAQPEVD IPDDITRLGEPNLFFAVSGGCVDSMVANYTATKKRRQQDDFTPGGVNNKRPDRAVLVYSN MIRRFFKGTTKKIVISGIESSLRRITHYDYWTNKLRKPILFDAKADILSYGMGEMSMLQL ANALKNGEDWKNIKGLCYLSKEMKEDYLSLPSHSECLADKDNFIEAFHTFYLNCDPITAK GLCQKCDDRYLIQNPPSERYSEEIMDKIYSMEFARDVHPYYKKMGAVRALDTIKYSVTTH RGCYGECNFCAIAIHQGRTIMSRSQSSIVEEVKNIAGTPKFHGNISDVGGPTANMYGLEC KKKLKLGACPDRRCLYPKKCPHLQVNHNNQVELLKKLKKIPNIKKIFIASGIRYDMILDD NKCGQMYLKEIIKDHISGQMKIAPEHTEDKILGLMGKDGKSCLNEFKNQFYKINNELGKK QFLTYYLIAAHPGCKDKDMMDLKRYASQELRVNPEQVQIFTPTPSTYSTLMYYTEKDPFT NQKLFVEKDNGKKQKQKDIVTEKRKNRK >gi|296154976|gb|ADVK01000020.1| GENE 22 24605 - 25843 1928 412 aa, chain - ## HITS:1 COG:FN0733 KEGG:ns NR:ns ## COG: FN0733 COG2195 # Protein_GI_number: 19704068 # Func_class: E Amino acid transport and metabolism # Function: Di- and tripeptidases # Organism: Fusobacterium nucleatum # 1 412 1 412 412 763 99.0 0 MDSKKYSTLKERFLRYVKFNTRSDDASETIPSTPSQMEFAKMLKKELEELGLSNIFINKA CFVNATLPSNMDKKVATVGFIAHMDTADFNAEGISPQIVENYDGKDIVLNKEQNIVLKVE EFPNLKNYISKTLITTDGTTLLGADDKSGIVEIIEAVKYLKEHPEINHGDIKIAFGPDEE IGRGADYFDVKEFAADYAYTMDGGPVGELEYESFNAAQAKFKIKGVSVHPGTAKGKMINA SLIASEIIEMFPKDEVPEKTEGYEGFYFLDEMKSNCEEGEVVYIIRDHDKSKFLAKKEFV KELVEKVNKKYGREVVKLELKDEYYNMGEIIKDHMYVVDIAKQAMENLGIKPLIKAIRGG TDGSKISFMGLPTPNIFAGGENFHGKYEFVALESMEKATDVIVEIVKLNAER >gi|296154976|gb|ADVK01000020.1| GENE 23 25947 - 27137 1171 396 aa, chain - ## HITS:1 COG:FN0732 KEGG:ns NR:ns ## COG: FN0732 COG1323 # Protein_GI_number: 19704067 # Func_class: R General function prediction only # Function: Predicted nucleotidyltransferase # Organism: Fusobacterium nucleatum # 1 396 1 396 396 696 100.0 0 MFENVVGLIVEYNPFHNGHLHHIQEIDRLFEDNIKIAVMSGDYVQRGEPSLINKLEKTKI ALSQGIDIVIELPTFYSTQSAEIFAKGSVNLLNKLSCSHIVFGSESNDLDKLKRIATISL TKEFELSLREFLAEGFSYPTAFSKVLFDEKLGSNDILALEYLRAIKTINSKIEAYCIKRE KTGYYDDEKDNFSSATYIRKILLDSNEKKENKLNKIKNLVPEFSYKILEENFGVFSCLSD FYDLIKYNIIKNCSELKNIQDLEIGLENRLYKYSLENLKFEDFFNEVLTKRITISRLQRI LLHSLFNLTENITEKVKNEVPFVKILGFSTKGQKYLNYLKKSKNYNERKILTSNRNLKEI LNEEEAKLFNFNELCSQIYRIKSSYINIGYPIIKKD >gi|296154976|gb|ADVK01000020.1| GENE 24 27154 - 27687 812 177 aa, chain - ## HITS:1 COG:no KEGG:FN0731 NR:ns ## KEGG: FN0731 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 177 1 177 177 282 100.0 4e-75 MKKIILYLFLLGTSLSFGATNDLPDNVEKKIRSAVSTFSGSEKRENYAWYKDSYLEMVER LDKSGIPETDKQMIIKRLEAMYGGNYPKQLARVNDEINDYKGLVNRIREEQNAVQQKTEA ENQKSKEEIKSILSSSSIPKVDLDKIEQNAKAEYPNDYTLQKAYIKGAIKTYNDLKK >gi|296154976|gb|ADVK01000020.1| GENE 25 27716 - 28330 511 204 aa, chain - ## HITS:1 COG:FN0730 KEGG:ns NR:ns ## COG: FN0730 COG0588 # Protein_GI_number: 19704065 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphoglycerate mutase 1 # Organism: Fusobacterium nucleatum # 1 204 1 204 204 317 100.0 8e-87 MKKIIIFFLLIIAITGNSEIIKEKKILRKETMELILVCHEKIQSDFENIDLSPSGIEAVK QLAEKMKKNYSFDIAYTSNLKIANRTLNYILEEMNELEIPINKSETLNTITRKDLEGKNV FESLKSYWKSDISKNLKEGKNVLIVTDEDTIRILIKYLLDMSDRDIQDVYIPIDNTFYFE VDKNLEVISAGFPIQKVLERIDEF >gi|296154976|gb|ADVK01000020.1| GENE 26 28367 - 29053 1127 228 aa, chain - ## HITS:1 COG:FN0729 KEGG:ns NR:ns ## COG: FN0729 COG0588 # Protein_GI_number: 19704064 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphoglycerate mutase 1 # Organism: Fusobacterium nucleatum # 1 228 1 228 228 442 100.0 1e-124 MKLVLIRHGESAWNLENRFTGWKDVDLSPKGMEEAKSAGKILKEMNLVFDVAYTSYLKRA IKTLNIVLEEMDELYIPVYKSWRLNERHYGALQGLNKAETAKKYGDEQVHIWRRSFDVAP PSIDKNSEYYPKSDRRYADLADSDIPLGESLKDTIARVLPYWHSDISKSLQEEKNVIVAA HGNSLRALIKYLLNISNEDILNLNLVTGKPMVFEIDKDLKVLSSPELF >gi|296154976|gb|ADVK01000020.1| GENE 27 29184 - 29903 656 239 aa, chain + ## HITS:1 COG:pli0008 KEGG:ns NR:ns ## COG: pli0008 COG3177 # Protein_GI_number: 18450294 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Listeria innocua # 26 209 17 210 254 71 29.0 2e-12 MNKILKTLLEEKEIKLKGSLYHLTQVNFSYNSNHIEGSKLTEEQTQYIYETNSFISDKEK IISIDDINETVNHFKCFDYILENINILDENLIKALHKILKNNTSDSQKDWFKVGDYKLRA NFIGNTKTTSPSNVLKEIKKLLDEYNSKIKITFDDIVDFHYKFEAIHPFQDGNGRVGRLI MFKECLRNDIVPFIIDEEHKLFYYRGLKNYKEDKAYLIETYLSAQDKYIKLLNELKIKF >gi|296154976|gb|ADVK01000020.1| GENE 28 29926 - 30681 1352 251 aa, chain + ## HITS:1 COG:no KEGG:FN0728 NR:ns ## KEGG: FN0728 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 41 251 1 211 211 344 81.0 2e-93 MQPTKEWLEKWEKVKDKLQPSSNLEDYFTLKEIAGKELDVLDIGPCSIPTGEFLVRDPLV YLIFRTQMPYFQKIPTGEFRTELAVIKASDGDCDRYAAARLKFNDNKIAYYEEAMIGQEE LNDIQKGDYFGFSVDAGLASFCDKKLHDLFCDFAEKWVEENPDGNAYDDYFAKFFKESYE NNPKYQRDAGDWINWTIPGTDYHLPMFQSGFGDGTYPTYIAYDKDGNVCQLIVELIDIEL AYSEIDEEDEE >gi|296154976|gb|ADVK01000020.1| GENE 29 30683 - 30745 94 20 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MLDFIHVLMMCINFKKIIHI >gi|296154976|gb|ADVK01000020.1| GENE 30 30766 - 30858 168 30 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MYKLPVENLDLNRIDIFKQLVKAKESLDIF >gi|296154976|gb|ADVK01000020.1| GENE 31 30900 - 31196 286 98 aa, chain + ## HITS:1 COG:no KEGG:FN0726 NR:ns ## KEGG: FN0726 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 95 1 95 99 119 86.0 5e-26 MKEAKEFSEIENIITTYDELYKEMVLKDKSNPNAKDLLELLFFEFYTKNEYIRTKLNISR QTVTSYLKQLEEVGILSSKKIRKEILYKNISLFKIAEN >gi|296154976|gb|ADVK01000020.1| GENE 32 31281 - 31985 1020 234 aa, chain + ## HITS:1 COG:FN0725 KEGG:ns NR:ns ## COG: FN0725 COG1179 # Protein_GI_number: 19704060 # Func_class: H Coenzyme transport and metabolism # Function: Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 1 # Organism: Fusobacterium nucleatum # 1 234 1 234 234 416 98.0 1e-116 MFLQRTELLISSNNVEKLKNSNVIVFGLGGVGGATVEALVRAGIGNLSIVDFDTIDKTNL NRQIITTQSVIGKPKVDVAKDRILSINPNINLTIYNEKFLKEKIDLFFKDKKYDYIVDAI DLVTAKLDLIEFATNSKIPIISCMGTGNKLNPAQFKVVDINKTSVCPLAKIIRKELKKRR INKLKVVYSDETPRKPLNLDGDREKAKNVGSISFVPPVAGMLLASEVIKDICEL >gi|296154976|gb|ADVK01000020.1| GENE 33 31999 - 32502 713 167 aa, chain + ## HITS:1 COG:FN0724 KEGG:ns NR:ns ## COG: FN0724 COG0716 # Protein_GI_number: 19704059 # Func_class: C Energy production and conversion # Function: Flavodoxins # Organism: Fusobacterium nucleatum # 1 167 1 167 167 291 99.0 4e-79 MKTIGIFYATLTKTTVGIVDEIEFFLKKDDFKTFNVKNGVKEIENFENLILVTPTYQVGE AHAAWMNNLKKLEEIDFTGKVVGLVGLGNQFAFGESFCGGIRYLYDIVVKKGGKVVGFTS TDGYHYEETSIIEDGKFIGLALDEENQPNLTPKRIGDWITEIKKEFK >gi|296154976|gb|ADVK01000020.1| GENE 34 32741 - 34597 1552 618 aa, chain - ## HITS:1 COG:no KEGG:FN0723 NR:ns ## KEGG: FN0723 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 618 1 618 618 1050 94.0 0 MKKISNIIKHYFNKNLWIIYILGLVLSLIGSFQVYHGRYDNILKEISVISVSVLKLFLFV PIEGFIKQNPLAYELAIWIAPLTTLLATFSIFNKLYTAIKLKLTHFYKEHIIVMGCNDYS LSFMKNYISLKNKKKILCVLPERSQEKDIETLNRLGVITCTIDYMPGLNDENMRISSEYN FASVDTIICFEDEPKNYGYLKLISELITKRRKKKEKIVNVYVNTVNKYIRNIVQHKMDEI KIFDIKYFNIYDLIAYNLINLKNFKLYETSGLKKDWIKIKKEKGENFSLDDFSNLIGTPN ILLIGFKNCGKSLFELVVNQTLVNAKENMKITIVDRKISNIIEEYKATIRELKKVANIEL IDGDINHISTQNKIKENHKKDPFTAVLFSTKNCTESLIFMDLLGEEIFKNVNTAVFCENI WENKPLIESIILKYPNITVFGELIDVLNFESITNESLEIKAKEFNAYYNKITENIMDSPE QDISVEEQWTSLSNIKKDSSRNQCMHQNVKEVLLGKIAQMEGLSSVEELLNRWKAMIDSV SVKEQINIIEKNPAMSYMSALEHKRWSNFYYMRNFVYSEKKDEVNGTHNSLIDDWDEFLN SKKREEVIYDFISVLSVK >gi|296154976|gb|ADVK01000020.1| GENE 35 34601 - 37657 2683 1018 aa, chain - ## HITS:1 COG:FN0722_2 KEGG:ns NR:ns ## COG: FN0722_2 COG2319 # Protein_GI_number: 19704057 # Func_class: R General function prediction only # Function: FOG: WD40 repeat # Organism: Fusobacterium nucleatum # 262 1018 1 757 757 1302 98.0 0 MVEIQNTTKKKYKYDAFISYRHIEPDLTIAEILHDMIEKFNIPKHLRTVSNDGSLIDDKH VFRVFRDREELSTKDLSTMIEEAIANSENLIVICSKRTSFSPWCRKEVQLFKRIHGANNI IPVLIEGEPDESFIDELKNLKATFINSENVEEEKNIELLAADIRPDEVKSPSFKGYEILQ NSKDPKLDELTKKSLDILKKSEIYRIVASMLNVNYGDLKLRHQERYLKRIIYTSVAASIA MLIFVVSVTTLYLKSVASERKANEQSSLMTLNMANEANLQGNRILGVLIAQEAMKNVSPK MEKYNKLEAQYENILNNSLITLPFSNQFILPTESETASFGISSDSKWLISSSSFNNAIIW DLDNGGIKKTLTFESPVVSIALSPDSKKSYVGTANNKIFEVNMEDYEIKNVFGNSTLPAV AMRISKNNKYLFALRNALILDVFDIQNQKKLYSFTFPIDNMITGFAENPQTNNFFILRKD NTITEYDINTGEVLIVHASTTNPNKNFFRKMTITDNGTLFYSDIQENTESFIMKNLQSGQ INCASNIRNFSSNIVVNKDATLLYVSSLNNFITRFDLSNLKPDEEINVPQRVTYLDTKRT QYNESIKNMILSPDENTLAVVLNNMAVGAFSGIKNITSDSSPEFILNEKSTHKSSVDIIK FTPDSKKIVTSANDYTIRVMDTESTMGKSQLLNGKIVASSRDKNSILILSGDKISKYNFD TNKEEFIAILDPKFLKVFQQFSITNNVSLVALSDSTEGASASVFDVKQDKKIYTTKSHTV KAGILPFISGLQFSNDGNFLFTLGPDNQLFIHNAKTGEFLFSLEDKENGEATSFIMSNDD NFVAINYITKKSTIFSLREKKVVQKINGEVMAVDSENNNIKAIYGQVDSKLFVTKPNSKV LYYADNKIKTATGTLKYNTINMSYDGKYYISSIPKNNTIITDLVTGEPVRTFYNSNNDFF ISLPVISKDNKKVAYSESKDKIIIMNMYSTEELSKMATKILKGRQLNELELNSIGRRE >gi|296154976|gb|ADVK01000020.1| GENE 36 37826 - 38545 681 239 aa, chain + ## HITS:1 COG:no KEGG:FN0721 NR:ns ## KEGG: FN0721 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 239 1 239 239 422 99.0 1e-117 MKRKLFFILSLFLISSIYVFGENFPQKAKTVNDFIPKGWKEILTTNGDLNKDKLEDTVIV IEKEDKKNIKKNDVLGPDYLNLNPRILLVLFKQKDGTYILASKNDKGFIQSENDEENPAL MDTLNGINIKNNILRINFSYFLSAGSWWTSTNVYIFRFQNNVFELIGYESNAYMRNTGEE EGTSINFSTNKAKITTGGNIFEEKENNPKDEWRYLKFEKKYILDEMTESTLDEILDIVY >gi|296154976|gb|ADVK01000020.1| GENE 37 38589 - 39152 900 187 aa, chain - ## HITS:1 COG:FN0720 KEGG:ns NR:ns ## COG: FN0720 COG0231 # Protein_GI_number: 19704055 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Translation elongation factor P (EF-P)/translation initiation factor 5A (eIF-5A) # Organism: Fusobacterium nucleatum # 1 187 1 187 187 349 100.0 2e-96 MKIAQELRAGSTIKIGNDPFVVLKAEYNKSGRNAAVVKFKMKNLISGNISDAVYKADDKM DDIKLDKVKAIYSYQNGDSYIFSNPETWEEIELKGEDLGDALNYLEEEMPLDVVYYESTA VAVELPTFVEREVTYTEPGLRGDTSGKVMKPARINTGFEVQVPLFVEQGEWIKIDTRTNE YVERVKK >gi|296154976|gb|ADVK01000020.1| GENE 38 39152 - 40204 918 350 aa, chain - ## HITS:1 COG:FN0719 KEGG:ns NR:ns ## COG: FN0719 COG4394 # Protein_GI_number: 19704054 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 350 1 350 350 566 99.0 1e-161 MKINSIDIFCEVIDNYGDVGVAYRLAREFKRIYPKKELRFIINQTEEINLIKKSNDMEII TYKDISKIENSADLIIESFACEIPKEYMDKALKNSKLIINLEYFSAEDWVDDFHLQESLL GGNLKKYFFIPGLSKKSGGILLDNEFLERKKKVEENKEYYLEKFGINEKYDLIASVFSYE KNFDSFIKELKKLDKKILLLILSEKTQKNFIKYFDNNNNYDKIKFVKLPFFTYDKYEELL ALCDFNLVRGEDSFVRALLLGKPFLWHIYPQDKNTHIKKLESFLDKYCPNNKELKETFIN YNINRDDFSYFFKNFKEIEEHNKKYANHLIKNCNLIEKLIKFIENIGGKN >gi|296154976|gb|ADVK01000020.1| GENE 39 40204 - 40512 415 102 aa, chain - ## HITS:1 COG:no KEGG:FN0718 NR:ns ## KEGG: FN0718 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 16 102 16 102 102 81 98.0 1e-14 MKKKILCLVILILLITACTPSFGVGAGARGRRSSVSAGTGISTGTKTKKINDNKMKKKSK TTAKKPKKVVRQNTIKNNKTEATGNTNKTGTTVKRVKQERQE >gi|296154976|gb|ADVK01000020.1| GENE 40 40535 - 41215 921 226 aa, chain - ## HITS:1 COG:FN0717 KEGG:ns NR:ns ## COG: FN0717 COG1187 # Protein_GI_number: 19704052 # Func_class: J Translation, ribosomal structure and biogenesis # Function: 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases # Organism: Fusobacterium nucleatum # 1 226 1 226 226 399 99.0 1e-111 MRLDKFLVECGIGSRKEVKKLISNGEITVNGMSYILAKDNIDENSDTIEYNGERLEYKEF RYYIMNKKAGYITATEDFKEDTVMDLLPEWVIKKDLAPVGRLDKDTEGLLLFTNDGKLNH KLLSPKNHIDKVYYVEIEKNILDEDILKLEQGVDIGNYITQPAKVEKISDNKIYLTIKEG KFHQVKKMLEAVNNKVCYLRRESFGKLTLNDLTLGEVKEVSLEDII >gi|296154976|gb|ADVK01000020.1| GENE 41 41215 - 42159 1072 314 aa, chain - ## HITS:1 COG:no KEGG:FN0716 NR:ns ## KEGG: FN0716 # Name: not_defined # Def: phophatidylinositol-4-phosphate 5-kinase (EC:2.7.1.68) # Organism: F.nucleatum # Pathway: not_defined # 1 314 1 314 314 537 99.0 1e-151 MNKDLKKFILFLIGSIIVAFAISYSYSAYQTCQHNKKINEVKNAFDFGGTNKKAEDVKKN ENSQEAWQNQMLEILELLGYSKVDTRPFYKRIYDKLTGKKVYNYIDKSGHETEAGSFEFC NEVLYENLNIKNETVMVEVKDNKIIEKFFDGDKELMQIELIANDDYSSYDQKIYSYITKK TITVKDVLNKDTYLNTKNGIIEYEDGKTIEFTHKNSEMNGPAIETFPNGDKIEFNFVNDK RIGEAEKFYKNGDREIFTYGENNKKTGSSTYYFANGDVEETIYVDGVLQGPAKYMYKDGV VEHYKYKDGKRIED >gi|296154976|gb|ADVK01000020.1| GENE 42 42152 - 43024 1169 290 aa, chain - ## HITS:1 COG:no KEGG:FN0715 NR:ns ## KEGG: FN0715 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 290 1 290 290 497 99.0 1e-139 MATINYIAVVKQLESGKFLISFPDFEGITTTAETEESIQDVASGVIKAKLAELKKANIEA PEAKKITEVSKELKDGEFTTYVAVKESFDFKSTMTNLKDKETVKETAKEMTNKVNDFVNN VPEGKENLFAMGGGVLSILNTLFFAVVTVKLPFFGNYSIGFFKGLSEVSDFSKEAKNSQF ILIFSGILFLAMAGFLTYSAFIKNKNFLKYSIFGNIALLVIFYIVLYVKLPGGEASKYIS VSYFKILLYIISLGLAYLTYRALDKKDKEEVEENAKPLGTILEKEEDRGE >gi|296154976|gb|ADVK01000020.1| GENE 43 43122 - 44066 1038 314 aa, chain - ## HITS:1 COG:FN0714 KEGG:ns NR:ns ## COG: FN0714 COG1902 # Protein_GI_number: 19704049 # Func_class: C Energy production and conversion # Function: NADH:flavin oxidoreductases, Old Yellow Enzyme family # Organism: Fusobacterium nucleatum # 1 314 1 314 314 606 99.0 1e-173 MEKINIFTDFKIKNIHIKNRIVLPPMVRFSLIGDDSYVTQDLIEWYGMIAKSGVGLIIVE ATAVEESGKLRENQIGIWNDSFIEGLTKVANEIHKYDVPCMIQIHHAGFKEKISEVAEEE LDRILKLFEEAFVRAKKCGFDGIEIHGAHTYLISQLNSKLWNKRKDKYGERLYFSKKLIE NTKYLFDDNFILGYRMGGNEPELEDGIENAKILESYGLDILHVSSGVPNPKYKRQVKIKN FPKDFPLDWIIYMGTEIKKHVKIPVIGVSKIKKESQASWLVENNLLDFVAVGKAMISQDK WMEKARKDFVLRKK >gi|296154976|gb|ADVK01000020.1| GENE 44 44265 - 44795 626 176 aa, chain - ## HITS:1 COG:FN0713 KEGG:ns NR:ns ## COG: FN0713 COG2059 # Protein_GI_number: 19704048 # Func_class: P Inorganic ion transport and metabolism # Function: Chromate transport protein ChrA # Organism: Fusobacterium nucleatum # 1 176 1 176 176 249 98.0 1e-66 MTYFDLFFVFFKVGLFSFGGGYAILPLMQHEVVDVNKWISFKDFMDIVAISQITPGPISI NLATHVGYRIDGTLGSTIATTSVVLPSIIIVSIIVIFLKRFNKLPVVQRIFKSLRVTIVG LILAAGIALFVKENFIDYKSYIIFTSVLIGGLIFKIGSITLIILSGLAGAILYYII >gi|296154976|gb|ADVK01000020.1| GENE 45 44792 - 45352 675 186 aa, chain - ## HITS:1 COG:FN0712 KEGG:ns NR:ns ## COG: FN0712 COG2059 # Protein_GI_number: 19704047 # Func_class: P Inorganic ion transport and metabolism # Function: Chromate transport protein ChrA # Organism: Fusobacterium nucleatum # 1 186 1 186 186 315 100.0 4e-86 MNRNRIIEIFILFFKIGAFTIGGGYAMLSLIEDEIVNKKKWLDHEEFLDGMAIAQSTPGV LAVNISLITGYKIAGFMGMFAGMLGAVLPSFFIVLFLSQILLAYGNHPLVVAIFNGVKPA ITALILISVYRIGKSANINRYNFLIPLIVAILIRYFGVSPIIVIIATMILGNIFYILKEK KEDDRK >gi|296154976|gb|ADVK01000020.1| GENE 46 45345 - 46559 1686 404 aa, chain - ## HITS:1 COG:FN0711 KEGG:ns NR:ns ## COG: FN0711 COG0452 # Protein_GI_number: 19704046 # Func_class: H Coenzyme transport and metabolism # Function: Phosphopantothenoylcysteine synthetase/decarboxylase # Organism: Fusobacterium nucleatum # 1 404 1 404 404 688 94.0 0 MKNILLGVTGGIAAFKSASIVSLLKKKGYNVKVVMTKNATNIIGPLTLETLSRNRIYVDM WDTNPHYEVEHISLADWADIVLIAPATYNIIGKVANGIADDMLTTIISAVSVRKPVFFAL AMNVNMYENPILKENIDKLKSYGYRFIEAEEGLLACNYVAKGRMSEPEDIIAEIERYNIF SKIENYDTVLKGKKILITSGRTKENIDPIRYLSNNSSGKMGYCLAQAAIDLGAEVTLISG PTNLEIPKGLKNFISVESALEMYKKVDEYFGDTDIFIACAAVADYRPKEYKKEKIKKSDS DLILELVRNPDILFEMGKKKDNQLLVGFAAETNDIKENALKKLEKKNLDIIVANNASTMG TDSNTIEVIKRDKSSVEIKQKNKIELAYDILLEVILELKKDKNE >gi|296154976|gb|ADVK01000020.1| GENE 47 46576 - 47253 862 225 aa, chain - ## HITS:1 COG:no KEGG:FN0710 NR:ns ## KEGG: FN0710 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 225 1 225 225 321 96.0 1e-86 MGLGDFLFKEKEEKYLKQIEDLQNKLKKKEEEILQLKYDLEVVTQERDNRISGKQLEIFE RNLKQNVESSKKYKDLLISYRINPEKIQYKYKVELKNFYSGKKFQEILNIFNEKNILFVD YLKEEDFNDIPRETKNFDEAKQRFLDFKSERFDWEIATFINRGEKISKIYSKSKKLVTIF SDLYLEYMDDIANFDFMSLKSYGFKTPQIEEFIQKRDEYYKEYRI >gi|296154976|gb|ADVK01000020.1| GENE 48 47319 - 48791 557 490 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163803542|ref|ZP_02197411.1| 30S ribosomal protein S20 [Vibrio campbellii AND4] # 1 420 4 434 520 219 31 4e-56 KMLKKSIHTMIITMISRVLGLFRGTLVAYFFGASILTDAYYSAFKISNFFRQLLGEGALG NTFIPLYHKKKKEEGEERSREYIFSVLNITFLFSFLVSILMIIFSSYIIDFIVVGFSDEL KIVASRLLKIMSFYFLFISLSGMMGSILNNFGYFAIPASTSIFFNLSIIFSAIWLTKYFD IDALAYGVLIGGILQFLVVFFPFFRLLKTYSLKIDFKDVYLKLLGIKLIPMLIGVFARQI NTIVDQFFASFLVAGSITALENASRIYLLPVGVFGVTISNVLFPTISKAAANNDKEGTNK GIISALNFLNFLTIPSLFVLTFFSKDVIRLIFSYGKFNEEAVRITAECLFYYSLGLLFYV GVQLVSKAYYAMGDNKRPAKFSIIAIIMNIVLNYLFIKNFQHKGLAMATSISSGVNFFLL LFIYVKNYVKLDLKNLIFTSIKICVSSIIATGAAYYINNVILKLIVFSAVFLLQWVYPIY KYRERIFYKK >gi|296154976|gb|ADVK01000020.1| GENE 49 48782 - 49459 837 225 aa, chain - ## HITS:1 COG:FN0708 KEGG:ns NR:ns ## COG: FN0708 COG1354 # Protein_GI_number: 19704043 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 225 1 225 225 344 100.0 8e-95 MEEVVVKLNNFEGPFDLLLNLIEKNKMKISDINISQLIDEYLEVLRVSKRENIEIKSDFI IIASELIEIKTLNLLNLDKDKEKETNLRRRLEEHKLFKEVVPKVAKLEKEFNISYSRGES KRVIKKIAKDYDLTSLTTDDIFEVYKKYFDSVDISEVMELNLMKQYDIKEVMDNILMKVY FKKWPIDDLFLEAENKLHLIYIFLAILELYKDAKINIDNGEITKC >gi|296154976|gb|ADVK01000020.1| GENE 50 49431 - 50390 410 319 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163762565|ref|ZP_02169630.1| ribosomal protein S2 [Bacillus selenitireducens MLS10] # 19 305 20 311 317 162 34 5e-39 MIVVNDILTTDIDFNDTYVAIGNFDGVHYGHKKLIKEAIKAARENGKQAVVFTFANHPME ILFPEKKFDYINTNEEKLYLLESLGVDVVIMQKVDKDFLEYTPLEFVRILKNKLKVKEIF IGFNFSFGKGGVGKAEDLEYLAEVHNIKVTELPPVTLNGELVSSSVIRKKIANSDFEGAI KFLDHPMLVIGEVIHGKKIARELGFPTTNIKMDNRLYPPFGIYGAFLQVGNKNSQVLYGV VNVGYNPTLKQEISLEVHILDFNKEVYGEKVYVQIVKFMRKEKKFSSIDELKATIQADVD RWKLFKREMKYGRSCSKTQ >gi|296154976|gb|ADVK01000020.1| GENE 51 50405 - 51304 791 299 aa, chain - ## HITS:1 COG:FN0706 KEGG:ns NR:ns ## COG: FN0706 COG1481 # Protein_GI_number: 19704041 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 299 1 299 299 491 100.0 1e-139 MSYSSNVKQEITKKIPATNLECLAEISSIFENKSMSLKDGVEIKMENSILAKRVYSLIKN TSSLKFGIKYSVTKKFTEHKVYTITLYKQKGLKEFLDSFKFSYLDIIQNDEIFRGYIRGF FLSCGYIKDPKKEYSLDFFVDNEELAGKIYNILFSKKKKIFKTNKKNKILVYLRNSEDIM DILVLMDALQHFFEYEETTIIKNLKNKTIREMNWEVANETKTLNTGNYQIKMIKYIGEKI GLNSLSPVLEEAAFLRLNNPEDSLQSLADMINISKSGIRNRFRRIEEIYNSLLEEEKNS >gi|296154976|gb|ADVK01000020.1| GENE 52 51323 - 54058 3544 911 aa, chain - ## HITS:1 COG:FN0705_2 KEGG:ns NR:ns ## COG: FN0705_2 COG0749 # Protein_GI_number: 19704040 # Func_class: L Replication, recombination and repair # Function: DNA polymerase I - 3'-5' exonuclease and polymerase domains # Organism: Fusobacterium nucleatum # 411 911 1 501 501 907 100.0 0 MKKAVLLDVSAIMYRAYFANMNFRTKNEPTGAVYGFINTLLSIINEFKPDYMAAAFDVKR SSLKRTEIYSDYKSNRQSAPEDLIKQIPRIEEALEAFNINRYKIEGYEADDVLGSLAKKL AKQDIEVIIVTGDKDLSQLVEKNITVALLGKGTEGEKFGTLKTSDDVVNYLGVVPEKIPD LFGLIGDKSDGIPGVTKIGEKKALSIFSQYDSLEKIYENIDNLKSIDGIGPSLIKNLINE KDIAFMSRELAKIFTNLDITVEEDGLQYGMDRRKLYSLCKVLEFKMFIKKLGLEEKPQNP TLFSIENLEEKKESPKIVEEENIEFTKEINLDLSNRELLIIDNENSLNEQKEYLTNYKKI ASIYYEGLGIILSTEDKDFYFPLNHGGLLAKNIDRNLVVKFISELDIKFISYNFKALLNL GISFKSMYMDMMIAYHLISSQTKIDPIIPITEYSKLEPKDFKTAFGKVNVELITAQDFSK YLSDISIGILAIYDELNYLLKKEDLYKILMENEMPLIPVLSLMERKGIEIDVQYFKNYSL ELEKELLKVEKAIYEEAGEEFNINSPKQLGDILFVKLNLPSGKKTKTGYSTDVMVLEDLE SYGYNIARLLLDYRKLNKLKTTYVDTLPLLVDENSRIHTSFNQIGTATGRLSSSEPNLQN IPVKTDDGIKIREGFIAGEGKVLMSIDYSQVELRVLTSMSKDENLIEAYREEKDLHDLTA RRIFNLPDSETVSREQRTIAKIINFSIIYGKTPFGLAKELKIPVKDASEYIKKYFEQYPR VTTFEKEVIEFGEEHGYVKTLFGRKRYISGIDSKNKTIKSQAERMAVNTVIQGTAAEVLK KVMVKVYDILKDKEDIALLLQVHDELIFEVEKNSVEKYSGILADIMKNTVQLEDVKLNIN INIGKNWAEAK >gi|296154976|gb|ADVK01000020.1| GENE 53 54297 - 55370 1160 357 aa, chain + ## HITS:1 COG:no KEGG:CDR20291_0944 NR:ns ## KEGG: CDR20291_0944 # Name: not_defined # Def: hypothetical protein # Organism: C.difficile_R20291 # Pathway: not_defined # 1 357 1 395 395 330 49.0 8e-89 MTTILALVIVLVAMSLGDIVSIKSKAWIPSVSVTALIFLFGFWWIFPKNVISTAGILGIM VLLLTVGLIFFDKQTAVAGGPPLSGGIVAAIIIKEAATALGNDKLAILAILIYVVQGFVG YPLTSILLKREGNRLLKDFKNNKENVAEKILNKESERKTLIPNLPEKYNTIYVILLKLAF VAYLADCFTKFINNTIFQNTVLSPFVTCLIFGIIAAELGLVDRQALNKSNSFGWMITVLM AFIYEGLNKATPEMLLEVLLPLLEIIIIGVAGLLIFAFIVGKILKESPYMSLCITLNALY GFPPNYILTNEVIKSLASNNDESEYLTNKILPKMLIGGFTSVTIASVLIAGIFSKFL >gi|296154976|gb|ADVK01000020.1| GENE 54 55401 - 56573 1574 390 aa, chain + ## HITS:1 COG:FN0703 KEGG:ns NR:ns ## COG: FN0703 COG1473 # Protein_GI_number: 19704038 # Func_class: R General function prediction only # Function: Metal-dependent amidase/aminoacylase/carboxypeptidase # Organism: Fusobacterium nucleatum # 139 330 15 206 218 363 97.0 1e-100 MNVKNITKKYKDYIIEKKRYFHMNPEPSFNEYNTSKVVQEELKKIGIPFEVFAKTGIIAT IKGQNSGKTVLLRADMDALEVCKKNNVSYKSQKEGLMHACGHDGHMAMLLGAAHVLNEIK NDISGEIKLLFQPAEETAQGAKAIIEESKIIDSIDTAFAIHLWQGVPVGKISLESGARMA AADLFSIKVKGKSGHGSMPHETIDAVVVASAIVMNLQHLVSRNTNPLDTLVVTVGKLTAG TRHNIIAGEALLEGTIRSFSDEVWKKVPEQIERVVKNTAAAYDAEVEINLVRATPPLVND QDISNILKTSAEKLYGEEVVTKYAKTSGGEDFAYFTQVVPGALAFVGIRNDKKGINSPHH NETFDMDEEALEMGANLYAQFAIDFLNSKK >gi|296154976|gb|ADVK01000020.1| GENE 55 56634 - 58187 2102 517 aa, chain - ## HITS:1 COG:FN0701_1 KEGG:ns NR:ns ## COG: FN0701_1 COG0500 # Protein_GI_number: 19704036 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism; R General function prediction only # Function: SAM-dependent methyltransferases # Organism: Fusobacterium nucleatum # 1 248 1 248 248 469 100.0 1e-132 MANKETIENVKIIQKSYDETPYKSKTFYYTQPGRQQMVLKLLGFKTPDLEKARVLEIGCS FGGNIIPFALENPKAEVVGIDLSNVQIEEGNRIIEFLNLENIRLIHQNVLEFDEKLGKFD YIICHGVFSWVNEEVQRGILNVIKNHLSENGSAILSYNTYPGWKNLEVARDVMLFRDEML KNRGEQINESNAVKYGRGAIEFLSQFSILNEKVKAGINGITEKDDYYILHEYFEENNKPL YLYDFNKMLLEYGLIHVVDSDLMKTFPNISNEIEEKLSAECGNDNIAKEQYYDFLLDRQF RISIVTHEANKKKINISKDVRITDLKEIDIRGKYQKNKDGFHTIENNEIKDEEISLILDI LSENYPNTLTIDELEKKVREKNKLENNNVYANAVYLMYGKLVEAYSRKLTVKKEEKIKLN SKYKDYLNYFITNPNPVIALASYEGTVNYDNFNPMMLFIMTLFDGTRTDEDIFNLLLEKE KTGEVVITFEESSSKEEVIKNNIEICRNFIEINFLNK >gi|296154976|gb|ADVK01000020.1| GENE 56 58209 - 59162 1146 317 aa, chain - ## HITS:1 COG:FN0700 KEGG:ns NR:ns ## COG: FN0700 COG0341 # Protein_GI_number: 19704035 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Preprotein translocase subunit SecF # Organism: Fusobacterium nucleatum # 1 317 1 317 317 560 99.0 1e-159 MKTNLHVIKNIKIYLSISLVLVIFSIVIFFTKGLNYGIDFSGGNLFQLKYNGTTVTLNQI NENLDKLAKELPQINSNSRKVQISDDGTIIVRVPEISENDKGKVLNNLKELGSYTLDKED KVGASIGDDLKKSAIYSLGIGAILIVIYITMRFEFSFAIGGILSLLHDIIIAVGFIALMG YEVDTPFIAAILTILGYSINDTIVIYDRIRENLKRKHKGWILEQCMDESINQTAIRSLNT SVTTLFSVIAILVFGGASLKTFIMTLLIGILAGTYSSIFVATPVVYLLNKRKGNNMEDMF KDDEENNDGKRVEKILV >gi|296154976|gb|ADVK01000020.1| GENE 57 59162 - 60397 1798 411 aa, chain - ## HITS:1 COG:FN0699 KEGG:ns NR:ns ## COG: FN0699 COG0342 # Protein_GI_number: 19704034 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Preprotein translocase subunit SecD # Organism: Fusobacterium nucleatum # 1 411 1 411 411 712 100.0 0 MNKKLFLRLLIVIAIFAVALYYSLAKPIKLGLDLKGGAYVVLEAVEDENSNVKIDNDAMN RLIEVLNRRVNGIGVAESTIQKAGDNRVIVELPGLQNTEEAINLIGKTALMEFKLMNEDG SLGETLLTGSALQKAEVSYDNLGRPQISFNMTPEGAQVFAKITRENIGRQLAITLDGVVQ TAPKINTEISGGSGVITGNYTVEEAKGTAALLNAGALPIKAEIAETRTVGATLGDESIAQ SKNAGMVAIVLIWVFMIIFYRLPGIIADLAIIIFGFITFACLNFIDATLTLPGIAGFILS LGMAVDANVIIFERIKEELRFGNSIRNSIDSGFNKGFIAIFDSNLTTLIITTILFVFGTG PIKGFAVTLALGTLASMFTAITVTKVLLLTFVNMFGFRSPKLFGVTVEEAK >gi|296154976|gb|ADVK01000020.1| GENE 58 60421 - 60837 544 138 aa, chain - ## HITS:1 COG:FN0698 KEGG:ns NR:ns ## COG: FN0698 COG0816 # Protein_GI_number: 19704033 # Func_class: L Replication, recombination and repair # Function: Predicted endonuclease involved in recombination (possible Holliday junction resolvase in Mycoplasmas and B. subtilis) # Organism: Fusobacterium nucleatum # 1 138 1 138 138 203 100.0 6e-53 MKRYIALDIGDVRIGVARSDIMGIIASPLETINRKKVKSVKRIAEICKENDTNLVVVGIP KSLDGEEKRQAEKVREYIEKLKKEIENLEIIEVDERFSTVIADNILKELNKNGAIEKRKV VDKVAASIILQTYLDMKK >gi|296154976|gb|ADVK01000020.1| GENE 59 60854 - 63457 3582 867 aa, chain - ## HITS:1 COG:FN0697 KEGG:ns NR:ns ## COG: FN0697 COG0013 # Protein_GI_number: 19704032 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Alanyl-tRNA synthetase # Organism: Fusobacterium nucleatum # 1 867 1 867 867 1627 97.0 0 MLTGNEIREKFIEFFMQKQHKHFESASLIPDDPTLLLTVAGMVPFKPYFLGQKEAPCSRV TTYQKCIRTNDLENVGRTARHHTFFEMLGNFSFGDYFKKEAIKWSWEFVTEVLKINKDKL WVTVFTTDDEAEKIWVEECNFPKERIVRMGESENWWSAGPTGSCGPCSEIHVDLGVQYGG DENSKIGDEGTDNRFIEIWNLVFTEWNRMEDGSLEPLPKKNIDTGAGLERIAAVVQGKTN NFETDLLFPILEEAGKITGSQYGKNPETNFSLKVITDHARAVTFLVNDGVIPSNEGRGYI LRRILRRAVRHGRLLGYTDLFMYKMVDKVVEKFEVAYPDLRKNLENIRKIVKIEEEKFSN TLDQGIQLVNQEIDNLLINGKNKLDGEISFKLYDTYGFPYELTEEIAEERGITVLREEFE AKMEEQKEKARSAREVVMEKGQDSFIEEFYDKYGVTEFTGYEKTEDEGKLLSLREAKDRK YLLIFDKTPFYGESGGQIGDQGKIYSDNFEAKVLDVQKQKDIFIHTVKFEKGIPEENKTY KLEVDVIKRLDTAKNHTATHLLHKALREVVGTHVQQAGSLVDSEKLRFDFSHYEALTEEQ LSKIEDIVNKKIREGIEVVVSHHSIEEAKKLGAMMLFGDKYGDVVRVVDVHGFSTELCGG THIDNIGKIGLFKITSEGGIAAGVRRIEAKTGYGAYLVEKEEADILKNIEKKLKATNSNL VEKVEKNLETLKDTEKELEALKQKLALFETKAAISGMEEIGGVKVLIAAFKDKSTEDLRT MIDTIKDNNEKAIVVLASTQDKLAFAVGVTKTLTDKIKAGDLVKKLAEITGGKGGGRPDF AQAGGKDEGKLLDAFKEVREIIGAKLV >gi|296154976|gb|ADVK01000020.1| GENE 60 63532 - 63699 119 55 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296327900|ref|ZP_06870436.1| ## NR: gi|296327900|ref|ZP_06870436.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 55 1 55 55 80 100.0 3e-14 MEKRRVFFTIFSITNATGSSYAYGSGFSLPKEYQKKWFQLISTLRKRFEEYQKKK >gi|296154976|gb|ADVK01000020.1| GENE 61 63660 - 63833 93 57 aa, chain - ## HITS:1 COG:no KEGG:FN0696 NR:ns ## KEGG: FN0696 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 8 57 1 50 50 81 98.0 1e-14 MLSSSTSLFAESSNPYYKAEENVYFLMGDFVNEWGFTNFNPIDEKWKKEESSLQFFL >gi|296154976|gb|ADVK01000020.1| GENE 62 63895 - 64620 246 241 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|119503196|ref|ZP_01625280.1| Ribosomal protein S16 [marine gamma proteobacterium HTCC2080] # 3 223 4 223 305 99 30 5e-20 MISLSADSLVKTYKKRKVVDKVSLEVNKGEIVGLLGPNGAGKTTTFYMITGIVKPDSGQV MCADQDITNLPMYKRADMGIGYLAQEPSVFRNLTVEENIEVVLEMKNISKKMQRETVDRL LEEFKLTHVRDSLGYSLSGGERRRIEIARTIANNPSFILLDEPFAGVDPIAVEDIQNIIR HLKKRDLGILITDHNVRETLSITDKSYIMAKGKVLIEGAPREIANNPEARRIYLGEKFKL D >gi|296154976|gb|ADVK01000020.1| GENE 63 64721 - 67426 3315 901 aa, chain - ## HITS:1 COG:no KEGG:FN0694 NR:ns ## KEGG: FN0694 # Name: not_defined # Def: S-layer protein # Organism: F.nucleatum # Pathway: not_defined # 259 901 1 643 643 1055 98.0 0 MSKKKIIYIAMGVIAVVLGYFNYFGSDKEVGDIKKIVETINAVYESDDYHVEAEKEIDYL DEKESKFEKAKAKIQGMLLSGDNVFLDKDRNLTLDTNILGISPNGWEIKASELKYDKTTQ ELISTKPMYAKNEEKGIEVLANKFKTTISMDNITLEDGVVIKNKLFSILADKANYNNSSK IITLEGNIRLSNGIGEVGDINTLKDVKDIPNNSINKNDKEMSGTFSKVYFNLDERNLYAT DGFDLKYDEVGLRGKNIVLNETTQSLKVTDDVKFTYQDYVFDVSYIEKEPNSDIINIYGK IKGGNPVYSVLADKGEYNVNDKKIRIFGNVDITSTKGEKLVLDNVVYSSATKEADMYGNK IKYTSPENNLEAEYIHYNTVTKEVTTNKPFDSWNQKGEGLTGTSISYNLGTKDFYSKENI TVKNKDYGLTTKNVTYKEETGILTAPEPYVIKSNSGDSTVNGNSITYNKKTGELLSPGEI IIDNKGTIIKGHDLVYNNISGLGKVEGPIPFENKADKMSGIAKEIIIKKGDYVDLVGPIK AKRDTTNMEFANARYLYKDGLVHVNTPVKFNDPVSSMVGSVSSATYNPKDSILRGTNFNM EEPDRSAKAQNIVLYNKDKRRLELVGNAYLSSGKDNISGPKIVYYLDTKDAETPTNSVIN YDQYTIKSTYAKVNRESGAVFAKKADVKSVDGNEFSANEAKGNTNDVVHFTGNVKGKSKQ KEGDVFFTGDKADLYMSKVNDKYQAKKVIVDTKSTFTQLNRKIDSNYLELDLIKKEVYAR KNPVLTIDDGPKGSTLVKADDVTGYIDKELIKLNKNVYVKNINEKKEETVLTADRGTVTK EMADVYDKVKIVTKESVTTANEGHYDMVNRKIRAKGNVHVDYTGDKSTSTIFNDMTSTKK K >gi|296154976|gb|ADVK01000020.1| GENE 64 67419 - 70052 3517 877 aa, chain - ## HITS:1 COG:FN0693 KEGG:ns NR:ns ## COG: FN0693 COG0249 # Protein_GI_number: 19704028 # Func_class: L Replication, recombination and repair # Function: Mismatch repair ATPase (MutS family) # Organism: Fusobacterium nucleatum # 1 877 20 896 896 1534 99.0 0 MSADTPLMQQYKKIKEEYQNEILMFRLGDFYEMFFEDAKIASKELGLTLTKRNREKGQDV PLAGVPYHSVASYIAKLVEKGYSVAICDQVEDPKSATGIVKREVTRVITPGTIIDVDFLD KNNNNYIACIKINTTENIVAIAYADITTGEFSVFEIKGKNFFEKALAEMNKIQASEILLD EKTYSEYIEILKEKISFLGVKFTEVPNVRKAESYLTSYFDIMSIEVFSLKSKDLAISTSA NLLYYTDELQKGNELPFSKIEYKNIDNIMELNISTQNNLNLVPKRNEEAKGTLLGVLDNC VTSVGSRELKKIIKNPFLDIEKIKQRQFYVDYFYNDVLLRENIREYLKDIYDVERIAGKI IYGTENGKDLLSLKESIRKSLETYKVLKEHQEIKDILDIDVKILLDIYNKIELIINIEAP FSVREGGIIKDGYNSELDKLRKISKLGKDFILEIEQRERERTGIKGLKIKYNKVFGYFIE VTKANEHLVPEDYIRKQTLVNSERYIVPDLKEYEEKVITAKSKIEALEYELFKQLTSEIK GHIDSLYKLANRIANLDIVSNFAHIATKNSYVKPEIGDGDILEIKGGRHPIVESLIPSGT YVKNDIILDDKNNLIILTGPNMSGKSTYMKQIALNIIMAHIGSYVAADCAKIPIVDKIFT RVGASDDLLTGQSTFMLEMTEVASILNNATNKSFIVLDEIGRGTSTYDGISIATAITEYI HNNIGAKTIFATHYHELTELEKELERAINFRVEVKEDGKNVVFLREIVKGGADKSYGIEV ARLSGVPKEVLNRSNKILKKLETRKNLIENKIKAEQMILFGNGFEEENEEEETEILSENE SKVLELLKNMDLNSLSPLESLLKLNELKKILIGGTDE >gi|296154976|gb|ADVK01000020.1| GENE 65 70154 - 71083 481 309 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|145632364|ref|ZP_01788099.1| ribosomal protein L11 methyltransferase [Haemophilus influenzae 3655] # 1 307 34 342 353 189 35 3e-47 KIMKKIYIAPIAGVTDYTFRGILEDFKPDLIFTEMVSVNALSVLNDKTISKILKLRAGNA VQIFGEDIEKIKASVKYIENLGVKHINLNCGCPMKKIVNCGYGAALVKEPEKIKKILSEI KSVLNNDTKLSVKIRIGYKEPENYIQIGKIAEELGCDHITVHGRTREQLYSGKADWKYIK EVKDNISIPVIGNGDIFTGEDALEKIFYSNVDGVMLARGIFGNPWLIRDIREILEYGEIK TPTTKADKINMAIEHLKRIRVDNDNKFVFDVRKHISWYLKGIENCAEAKRKINIISDYDE IIKILEEML >gi|296154976|gb|ADVK01000020.1| GENE 66 71183 - 71728 585 181 aa, chain + ## HITS:1 COG:no KEGG:FN0691 NR:ns ## KEGG: FN0691 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 181 1 181 181 278 99.0 5e-74 MKNKLFNIIIFSFLLNSVNIFSLDSDIKEIEPIESIHSDQNASSFADDNIKTFSENSFEE NYSPNKSKISNIVSTNKDLKTNKKEENNGEDKFEEITDRTNRITALGSAMGAVDLSKTPA DKFRVGAGVGHSAKNQAVAVGIGYAPTERLRLNTKISTTTNSTKSSRSNGISIGVSYDLD W Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:25:30 2011 Seq name: gi|296154863|gb|ADVK01000021.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00025, whole genome shotgun sequence Length of sequence - 111986 bp Number of predicted genes - 119, with homology - 112 Number of transcription units - 47, operones - 26 average op.length - 3.8 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . + CDS 1 - 96 168 ## + Term 111 - 158 3.4 + Prom 112 - 171 8.4 2 2 Tu 1 . + CDS 203 - 634 339 ## Smon_0058 hypothetical protein + Term 642 - 677 -0.6 - Term 566 - 609 -0.8 3 3 Op 1 12/0.000 - CDS 807 - 1571 1487 ## COG0024 Methionine aminopeptidase 4 3 Op 2 1/0.600 - CDS 1581 - 2207 1058 ## COG0563 Adenylate kinase and related kinases 5 3 Op 3 . - CDS 2285 - 3214 1317 ## COG0451 Nucleoside-diphosphate-sugar epimerases - Prom 3282 - 3341 13.0 + Prom 3204 - 3263 13.4 6 4 Op 1 . + CDS 3407 - 5026 2496 ## COG0488 ATPase components of ABC transporters with duplicated ATPase domains + Prom 5028 - 5087 6.4 7 4 Op 2 . + CDS 5107 - 5325 359 ## FN1302 hypothetical protein + Term 5344 - 5393 10.1 - Term 5325 - 5386 19.1 8 5 Op 1 1/0.600 - CDS 5394 - 5972 731 ## COG2096 Uncharacterized conserved protein 9 5 Op 2 . - CDS 5987 - 6451 711 ## COG0629 Single-stranded DNA-binding protein - Prom 6480 - 6539 15.2 + Prom 6520 - 6579 14.4 10 6 Op 1 3/0.000 + CDS 6610 - 7278 930 ## COG1917 Uncharacterized conserved protein, contains double-stranded beta-helix domain + Prom 7304 - 7363 13.2 11 6 Op 2 2/0.100 + CDS 7412 - 9112 1908 ## COG0500 SAM-dependent methyltransferases 12 6 Op 3 4/0.000 + CDS 9115 - 9714 483 ## COG0558 Phosphatidylglycerophosphate synthase 13 6 Op 4 . + CDS 9716 - 10519 591 ## COG4589 Predicted CDP-diglyceride synthetase/phosphatidate cytidylyltransferase 14 6 Op 5 . + CDS 10594 - 10788 450 ## FN1309 hypothetical protein + Term 10802 - 10841 8.6 15 7 Tu 1 . + CDS 10853 - 10936 65 ## + Term 10981 - 11038 -0.6 - Term 10790 - 10829 8.6 16 8 Tu 1 . - CDS 10966 - 11622 739 ## COG1802 Transcriptional regulators - Prom 11663 - 11722 11.2 + Prom 11642 - 11701 15.6 17 9 Op 1 . + CDS 11725 - 12726 1167 ## COG0473 Isocitrate/isopropylmalate dehydrogenase 18 9 Op 2 . + CDS 12759 - 14069 1865 ## COG2851 H+/citrate symporter + Term 14077 - 14117 4.1 - Term 14063 - 14104 8.1 19 10 Op 1 11/0.000 - CDS 14111 - 14902 876 ## COG0810 Periplasmic protein TonB, links inner and outer membranes 20 10 Op 2 30/0.000 - CDS 14911 - 15300 539 ## COG0848 Biopolymer transport protein 21 10 Op 3 . - CDS 15303 - 15911 850 ## COG0811 Biopolymer transport proteins - Prom 16031 - 16090 11.1 + Prom 15974 - 16033 12.8 22 11 Tu 1 . + CDS 16066 - 17490 1755 ## COG4166 ABC-type oligopeptide transport system, periplasmic component + Term 17685 - 17731 4.2 23 12 Tu 1 . - CDS 17467 - 17643 174 ## FN1314 hypothetical protein - Prom 17738 - 17797 2.4 - Term 17703 - 17733 -0.5 24 13 Op 1 . - CDS 17882 - 18427 870 ## FN1315 hypothetical protein 25 13 Op 2 1/0.600 - CDS 18461 - 19237 925 ## COG0327 Uncharacterized conserved protein 26 13 Op 3 1/0.600 - CDS 19234 - 20058 1193 ## COG0568 DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 27 13 Op 4 31/0.000 - CDS 20074 - 21330 1837 ## COG0568 DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 28 13 Op 5 1/0.600 - CDS 21359 - 23170 1625 ## COG0358 DNA primase (bacterial type) - Term 23185 - 23227 5.1 29 14 Op 1 1/0.600 - CDS 23234 - 24925 2225 ## COG0760 Parvulin-like peptidyl-prolyl isomerase 30 14 Op 2 1/0.600 - CDS 24954 - 26339 1894 ## COG2204 Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains - Prom 26367 - 26426 3.9 31 15 Op 1 1/0.600 - CDS 26428 - 27447 1299 ## COG0750 Predicted membrane-associated Zn-dependent proteases 1 32 15 Op 2 1/0.600 - CDS 27448 - 28125 912 ## COG0125 Thymidylate kinase 33 15 Op 3 15/0.000 - CDS 28113 - 29276 1577 ## COG0743 1-deoxy-D-xylulose 5-phosphate reductoisomerase 34 15 Op 4 32/0.000 - CDS 29294 - 30178 1031 ## COG0575 CDP-diglyceride synthetase 35 15 Op 5 1/0.600 - CDS 30171 - 30863 969 ## COG0020 Undecaprenyl pyrophosphate synthase - Prom 30924 - 30983 12.6 - Term 30948 - 30984 0.1 36 16 Op 1 22/0.000 - CDS 30995 - 31888 1193 ## COG0142 Geranylgeranyl pyrophosphate synthase 37 16 Op 2 1/0.600 - CDS 31890 - 32102 346 ## COG1722 Exonuclease VII small subunit - Prom 32124 - 32183 7.9 38 17 Op 1 1/0.600 - CDS 32315 - 32863 347 ## PROTEIN SUPPORTED gi|163764797|ref|ZP_02171850.1| ribosomal protein L29 39 17 Op 2 1/0.600 - CDS 32876 - 33907 1235 ## COG0809 S-adenosylmethionine:tRNA-ribosyltransferase-isomerase (queuine synthetase) 40 17 Op 3 32/0.000 - CDS 33888 - 35039 1420 ## COG2890 Methylase of polypeptide chain release factors 41 17 Op 4 . - CDS 35039 - 36112 1551 ## COG0216 Protein chain release factor A 42 17 Op 5 . - CDS 36137 - 36553 571 ## FN1333 hypothetical protein 43 17 Op 6 1/0.600 - CDS 36576 - 37604 1238 ## COG0860 N-acetylmuramoyl-L-alanine amidase - Prom 37633 - 37692 4.3 44 17 Op 7 . - CDS 37700 - 37984 320 ## COG1862 Preprotein translocase subunit YajC - Prom 38063 - 38122 11.9 + Prom 38061 - 38120 13.8 45 18 Tu 1 . + CDS 38302 - 38391 66 ## + Term 38560 - 38610 1.2 - Term 38351 - 38397 3.6 46 19 Tu 1 . - CDS 38406 - 38582 113 ## COG0582 Integrase - Prom 38611 - 38670 6.7 + Prom 38970 - 39029 8.2 47 20 Op 1 . + CDS 39053 - 39412 88 ## gi|296327950|ref|ZP_06870485.1| conserved hypothetical protein 48 20 Op 2 . + CDS 39406 - 39705 203 ## gi|296327951|ref|ZP_06870486.1| conserved hypothetical protein 49 20 Op 3 . + CDS 39731 - 40153 297 ## gi|296327952|ref|ZP_06870487.1| conserved hypothetical protein + Term 40330 - 40370 3.1 + Prom 40196 - 40255 3.9 50 21 Tu 1 . + CDS 40376 - 40852 413 ## gi|296327953|ref|ZP_06870488.1| conserved hypothetical protein + Term 40875 - 40918 3.5 51 22 Tu 1 . - CDS 40940 - 41032 59 ## - Prom 41118 - 41177 12.2 + Prom 40887 - 40946 12.1 52 23 Op 1 . + CDS 41177 - 41443 472 ## bpr_IV094 addiction module antitoxin 53 23 Op 2 . + CDS 41443 - 41709 215 ## COG4115 Uncharacterized protein conserved in bacteria + Term 41770 - 41819 6.1 - Term 41758 - 41805 9.5 54 24 Op 1 44/0.000 - CDS 41819 - 42793 815 ## PROTEIN SUPPORTED gi|163765018|ref|ZP_02172066.1| ribosomal protein L9 55 24 Op 2 44/0.000 - CDS 42786 - 43793 633 ## PROTEIN SUPPORTED gi|163765018|ref|ZP_02172066.1| ribosomal protein L9 56 24 Op 3 49/0.000 - CDS 43812 - 44681 1297 ## COG1173 ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 57 24 Op 4 38/0.000 - CDS 44691 - 45617 283 ## PROTEIN SUPPORTED gi|167855436|ref|ZP_02478201.1| 30S ribosomal protein S21 - Prom 45637 - 45696 3.0 - Term 45631 - 45681 8.5 58 24 Op 5 1/0.600 - CDS 45701 - 47236 2256 ## COG0747 ABC-type dipeptide transport system, periplasmic component - Prom 47282 - 47341 9.8 - Term 47413 - 47454 9.2 59 25 Op 1 1/0.600 - CDS 47473 - 48333 876 ## COG0697 Permeases of the drug/metabolite transporter (DMT) superfamily 60 25 Op 2 1/0.600 - CDS 48372 - 49112 916 ## COG3713 Outer membrane protein V 61 25 Op 3 1/0.600 - CDS 49115 - 50911 2401 ## COG0438 Glycosyltransferase 62 25 Op 4 2/0.100 - CDS 50922 - 51815 1073 ## COG1032 Fe-S oxidoreductase - Prom 51849 - 51908 8.8 63 26 Tu 1 . - CDS 52055 - 52858 1150 ## COG0561 Predicted hydrolases of the HAD superfamily - Prom 52888 - 52947 9.5 + Prom 52968 - 53027 8.3 64 27 Tu 1 . + CDS 53053 - 53454 497 ## COG5015 Uncharacterized conserved protein + Term 53467 - 53511 5.2 - Term 53308 - 53360 -0.6 65 28 Tu 1 . - CDS 53509 - 53934 668 ## FN0389 hypothetical protein - Prom 53997 - 54056 14.5 + Prom 54007 - 54066 13.8 66 29 Tu 1 . + CDS 54124 - 54921 1117 ## COG3315 O-Methyltransferase involved in polyketide biosynthesis + Term 54925 - 54964 -0.7 - Term 54903 - 54946 10.5 67 30 Op 1 . - CDS 54955 - 61599 8685 ## FN0387 hypothetical protein 68 30 Op 2 . - CDS 61669 - 62622 854 ## COG2342 Predicted extracellular endo alpha-1,4 polygalactosaminidase or related polysaccharide hydrolase 69 30 Op 3 . - CDS 62632 - 63111 493 ## FN0385 hypothetical protein - Prom 63251 - 63310 6.1 - Term 63232 - 63285 1.7 70 31 Op 1 . - CDS 63370 - 63582 162 ## FN0385 hypothetical protein 71 31 Op 2 4/0.000 - CDS 63583 - 65901 1920 ## COG4267 Predicted membrane protein 72 31 Op 3 . - CDS 65903 - 67324 1484 ## COG0438 Glycosyltransferase 73 31 Op 4 . - CDS 67328 - 68254 643 ## FN0382 hypothetical protein 74 31 Op 5 . - CDS 68247 - 70103 1678 ## COG4878 Uncharacterized protein conserved in bacteria 75 31 Op 6 . - CDS 70075 - 70962 755 ## FN0379 hypothetical protein - Prom 70982 - 71041 5.2 + Prom 71019 - 71078 17.9 76 32 Tu 1 . + CDS 71313 - 72287 1195 ## COG1087 UDP-glucose 4-epimerase + Term 72297 - 72345 -1.0 - Term 72277 - 72340 14.1 77 33 Op 1 17/0.000 - CDS 72343 - 73989 1694 ## COG1178 ABC-type Fe3+ transport system, permease component 78 33 Op 2 7/0.000 - CDS 73979 - 75094 1500 ## COG3842 ABC-type spermidine/putrescine transport systems, ATPase components 79 33 Op 3 1/0.600 - CDS 75108 - 76175 269 ## PROTEIN SUPPORTED gi|167854980|ref|ZP_02477755.1| 50S ribosomal protein L13 - Prom 76202 - 76261 6.0 80 33 Op 4 . - CDS 76271 - 78805 2758 ## COG0608 Single-stranded DNA-specific exonuclease - Prom 78845 - 78904 11.0 - Term 78867 - 78903 3.5 81 34 Tu 1 . - CDS 78929 - 79696 814 ## FN0371 hypothetical protein - Prom 79791 - 79850 10.7 + Prom 79755 - 79814 10.8 82 35 Op 1 1/0.600 + CDS 79907 - 80821 1075 ## COG0681 Signal peptidase I 83 35 Op 2 1/0.600 + CDS 80842 - 81273 187 ## PROTEIN SUPPORTED gi|228002792|ref|ZP_04049785.1| (SSU ribosomal protein S18P)-alanine acetyltransferase 84 35 Op 3 1/0.600 + CDS 81266 - 82699 1973 ## COG0015 Adenylosuccinate lyase + Prom 82701 - 82760 9.9 85 36 Op 1 1/0.600 + CDS 82836 - 83354 420 ## COG4769 Predicted membrane protein 86 36 Op 2 . + CDS 83371 - 84729 2231 ## COG1109 Phosphomannomutase + Prom 84736 - 84795 7.1 87 37 Op 1 . + CDS 84821 - 85195 317 ## FN0365 ATP synthase protein I, sodium ion specific 88 37 Op 2 40/0.000 + CDS 85231 - 85980 895 ## COG0356 F0F1-type ATP synthase, subunit a 89 37 Op 3 37/0.000 + CDS 86034 - 86303 561 ## COG0636 F0F1-type ATP synthase, subunit c/Archaeal/vacuolar-type H+-ATPase, subunit K 90 37 Op 4 38/0.000 + CDS 86344 - 86835 736 ## COG0711 F0F1-type ATP synthase, subunit b 91 37 Op 5 41/0.000 + CDS 86832 - 87365 723 ## COG0712 F0F1-type ATP synthase, delta subunit (mitochondrial oligomycin sensitivity protein) 92 37 Op 6 42/0.000 + CDS 87382 - 88884 2519 ## COG0056 F0F1-type ATP synthase, alpha subunit 93 37 Op 7 . + CDS 88895 - 89743 1036 ## COG0224 F0F1-type ATP synthase, gamma subunit 94 38 Tu 1 . - CDS 89821 - 89895 97 ## - Prom 89988 - 90047 11.6 + Prom 89908 - 89967 6.3 95 39 Op 1 42/0.000 + CDS 90064 - 91452 2068 ## COG0055 F0F1-type ATP synthase, beta subunit 96 39 Op 2 1/0.600 + CDS 91462 - 91866 598 ## COG0355 F0F1-type ATP synthase, epsilon subunit (mitochondrial delta subunit) 97 39 Op 3 . + CDS 91886 - 92257 479 ## COG0346 Lactoylglutathione lyase and related lyases + Term 92268 - 92300 -0.4 98 39 Op 4 . + CDS 92301 - 92438 102 ## + Term 92446 - 92479 1.5 99 39 Op 5 . + CDS 92511 - 93662 2052 ## COG0192 S-adenosylmethionine synthetase + Term 93670 - 93710 4.2 - Term 93674 - 93718 -0.9 100 40 Tu 1 . - CDS 93722 - 94597 949 ## COG0697 Permeases of the drug/metabolite transporter (DMT) superfamily - Prom 94704 - 94763 79.6 + TRNA 94687 - 94763 72.9 # Arg CCT 0 0 - Term 94672 - 94731 19.5 101 41 Tu 1 . - CDS 94768 - 96147 1657 ## COG1757 Na+/H+ antiporter - Prom 96177 - 96236 11.2 - Term 96315 - 96370 -0.3 102 42 Op 1 . - CDS 96431 - 96874 613 ## FN0351 hypothetical protein 103 42 Op 2 . - CDS 96898 - 97341 547 ## FN0350 hypothetical protein - Prom 97371 - 97430 7.3 104 42 Op 3 . - CDS 97433 - 97888 798 ## COG1490 D-Tyr-tRNAtyr deacylase - Prom 97980 - 98039 11.3 + Prom 97924 - 97983 7.9 105 43 Op 1 1/0.600 + CDS 98016 - 99521 1898 ## COG1488 Nicotinic acid phosphoribosyltransferase 106 43 Op 2 1/0.600 + CDS 99514 - 100485 1000 ## COG0688 Phosphatidylserine decarboxylase 107 43 Op 3 1/0.600 + CDS 100466 - 100864 389 ## COG5341 Uncharacterized protein conserved in bacteria 108 43 Op 4 1/0.600 + CDS 100887 - 101978 1260 ## COG0628 Predicted permease 109 43 Op 5 . + CDS 101978 - 103759 2168 ## COG0116 Predicted N6-adenine-specific DNA methylase 110 43 Op 6 1/0.600 + CDS 103813 - 104316 640 ## COG0652 Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family 111 43 Op 7 1/0.600 + CDS 104357 - 105685 1696 ## COG2056 Predicted permease + Term 105735 - 105776 6.5 + Prom 105687 - 105746 3.9 112 44 Tu 1 . + CDS 105790 - 107100 1695 ## COG3314 Uncharacterized protein conserved in bacteria + Term 107192 - 107228 0.3 113 45 Op 1 . - CDS 107180 - 107464 286 ## FN0339 hypothetical protein 114 45 Op 2 . - CDS 107534 - 107653 231 ## 115 45 Op 3 . - CDS 107671 - 108123 593 ## COG3086 Positive regulator of sigma E activity - Prom 108174 - 108233 16.9 + Prom 108174 - 108233 10.1 116 46 Tu 1 . + CDS 108282 - 108602 303 ## FN0337 hypothetical protein + Prom 108675 - 108734 18.1 117 47 Op 1 . + CDS 108815 - 109990 1828 ## FN0336 hypothetical protein 118 47 Op 2 1/0.600 + CDS 110033 - 110581 170 ## PROTEIN SUPPORTED gi|163756109|ref|ZP_02163225.1| 30S ribosomal protein S1 + Term 110602 - 110656 -0.8 + Prom 110593 - 110652 3.1 119 47 Op 3 . + CDS 110672 - 111919 1722 ## COG1448 Aspartate/tyrosine/aromatic aminotransferase Predicted protein(s) >gi|296154863|gb|ADVK01000021.1| GENE 1 1 - 96 168 31 aa, chain + ## HITS:0 COG:no KEGG:no NR:no SQVNAQGAENKELKERVQKLEEKLDKLLKTK >gi|296154863|gb|ADVK01000021.1| GENE 2 203 - 634 339 143 aa, chain + ## HITS:1 COG:no KEGG:Smon_0058 NR:ns ## KEGG: Smon_0058 # Name: not_defined # Def: hypothetical protein # Organism: S.moniliformis # Pathway: not_defined # 2 141 7 146 407 148 49.0 8e-35 MIKEILSLVNLSHILNFINTFLTQQHLEYIKASINKFLLCRNIKASFIKYTCTECGHYHT IPITCKSRLCPSCGFKYSATWTQKMINDILNIPHRYILFTIPKELRAFFCYDRTLLSKFA KAVNEVMKYQFHNIHKKIARKFM >gi|296154863|gb|ADVK01000021.1| GENE 3 807 - 1571 1487 254 aa, chain - ## HITS:1 COG:FN1297 KEGG:ns NR:ns ## COG: FN1297 COG0024 # Protein_GI_number: 19704632 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Methionine aminopeptidase # Organism: Fusobacterium nucleatum # 1 254 1 254 254 489 99.0 1e-138 MRLIKTLDEIKGIKKANQIIAKIYADIIPPYLKAGITTREIDKIIDDYIRSCGARPACIG VEGFYGPFPAATCISVNEEVVHGIPGDRVIKDGDIVSLDIVTELNGYYGDSAKTFAIGEI DEESRKLLEVTEKSREIGIEAAVVGNRLGDLGHAVQAYVEKNGFSVVRDFAGHGVGLDLH EEPMIPNYGRKGRGLKIENGMVLAIEPMVNVGTYKVAIMPDGWTVVTRDGKRSAHFEHSV AIIDGKAVILSELD >gi|296154863|gb|ADVK01000021.1| GENE 4 1581 - 2207 1058 208 aa, chain - ## HITS:1 COG:FN1298 KEGG:ns NR:ns ## COG: FN1298 COG0563 # Protein_GI_number: 19704633 # Func_class: F Nucleotide transport and metabolism # Function: Adenylate kinase and related kinases # Organism: Fusobacterium nucleatum # 1 208 4 211 211 375 96.0 1e-104 MNLVLFGAPGAGKGTQAKFIVDKYGIPQISTGDILRVAVANKTKLGLEAKKFMDAGQLVP DEVVNGLVAERLAEKDCEKGFIMDGFPRTVVQAKALDEILTKLGKQIEKVIALNVPDKDI IERITGRRTSKITGKIYHIKFNPPVDEKPEDLVQRADDTEEVVVKRLETYHNQTAPVLDY YKAQNKVTEIDGTKKLEDITQDIFKILG >gi|296154863|gb|ADVK01000021.1| GENE 5 2285 - 3214 1317 309 aa, chain - ## HITS:1 COG:FN1299 KEGG:ns NR:ns ## COG: FN1299 COG0451 # Protein_GI_number: 19704634 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Nucleoside-diphosphate-sugar epimerases # Organism: Fusobacterium nucleatum # 1 309 1 309 309 551 99.0 1e-157 MKKILIMGGNQFVGKEIAKNFLEKDYTIYVLNRGTRKNIEGVFFLKVDRDNLIEMENILK DIEVDIIVDVSAYTEEQVDILHKVMKNGFKQYILISSASVYNNIECTPVNEGCQTGENLI WGDYAKNKYLAEKKTIENSNLYNFKYTIFRAFYIYGIGNNLDRENYFFSRIKYNLPIFIP SKNNIIQFGYVEDLALAIESSIENSDFYNQIFNISGDEYVTMSEFAEICGKVMAKKAVIK YVNTEENKIKARDWFPFREVNLFGDISKLENTGFRNTYSLIQGLEKTYKYNDENDLITKP ILNKLEIEN >gi|296154863|gb|ADVK01000021.1| GENE 6 3407 - 5026 2496 539 aa, chain + ## HITS:1 COG:FN1301 KEGG:ns NR:ns ## COG: FN1301 COG0488 # Protein_GI_number: 19704636 # Func_class: R General function prediction only # Function: ATPase components of ABC transporters with duplicated ATPase domains # Organism: Fusobacterium nucleatum # 1 539 1 539 539 1050 100.0 0 MIATASLGMRFSGRKLFEDVNLKFTPGNCYGVIGANGAGKSTFVKILSGELEATEGEVIF DKNKRMSVLKQDHFQYEDEEVLNVVLMGNKKLWDIMVEKNAIYAKTDFTDEDGIRAAELE GEFAELNGWEAETEAETLLMGLKIGADLHHKLMKELTEPEKVKVLLAQALFGEPDVLLLD EPTNGLDVKAISWLENFIMGLENSTVIVVSHDRHFLNKVCTHITDIDYGKIKMYVGNYDF WYESNELMKTLINNKNKKLEQKRQELQEFIARFSANASKSKQATSRKKQLEKLQLEDMQM SNRKYPFVEFKPEREAGNNLLKVENLSKTIDGIKVLDNVSFTIETGDKVVFLAKNDLVKT TLLSILAGEIEADSGTYTWGVTTSQAYMPRDNSQYFNNTDVNLIDWLRPYSPDEHEAFIR GFLGRMLFSGDETLKKVSVLSGGEKVRCMLSKLMLSGANVLLFDNPSDHLDLESITSLNK ALIKFKGTILFGAHDHEFIQTVANRIIEITPKGLVDKVTTYDEYLEDETIQARLDEMYS >gi|296154863|gb|ADVK01000021.1| GENE 7 5107 - 5325 359 72 aa, chain + ## HITS:1 COG:no KEGG:FN1302 NR:ns ## KEGG: FN1302 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 72 1 72 72 128 100.0 6e-29 MKELLQKLAWKKCHIATVNHKFKDATVLEVTDGCLLIETSEKEKVIINLQFVRLVVEAKE GALAPVFVPHDL >gi|296154863|gb|ADVK01000021.1| GENE 8 5394 - 5972 731 192 aa, chain - ## HITS:1 COG:FN1303 KEGG:ns NR:ns ## COG: FN1303 COG2096 # Protein_GI_number: 19704638 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 192 1 192 192 358 100.0 2e-99 MEDKKYVNITKVYTKRGDKGETDLLGGSAARKDSLKVESYGCVDEASSFIGVARYYCKNK IIKERLKVIQNKLLVLGGFLASDERGKEMMKDQIKEDDIKLLEEYIDEYNQKLPPLKHFI LPGDEEVATHFHVARTVVRRAERRIVSLKAQEPDLNPLIQKYVNRLSDLMFVLARYSEEV ENKKWKSANLNI >gi|296154863|gb|ADVK01000021.1| GENE 9 5987 - 6451 711 154 aa, chain - ## HITS:1 COG:FN1304 KEGG:ns NR:ns ## COG: FN1304 COG0629 # Protein_GI_number: 19704639 # Func_class: L Replication, recombination and repair # Function: Single-stranded DNA-binding protein # Organism: Fusobacterium nucleatum # 1 154 1 154 154 261 100.0 3e-70 MNLVVLNGRLTRDPELKFGQSGKAYSRFSIAVDRPFQSSSDKNSQTADFINCVAFGKTAE FIGEYFRKGRKILLNGRLQMSQYESEGKKITTYVVIADSVEFGEAKTSSGTTETSSYGRS ESKSTNNTIETPGFDENSSDDMGAPTEIDDEFPF >gi|296154863|gb|ADVK01000021.1| GENE 10 6610 - 7278 930 222 aa, chain + ## HITS:1 COG:FN1305 KEGG:ns NR:ns ## COG: FN1305 COG1917 # Protein_GI_number: 19704640 # Func_class: S Function unknown # Function: Uncharacterized conserved protein, contains double-stranded beta-helix domain # Organism: Fusobacterium nucleatum # 112 222 1 111 111 213 100.0 2e-55 MVKIEVAKAICFNQLINSKETEVVSMRILNQSNSYISLFSLAKNEEITAEAMLGNRYYYC FNGSGEVSIENNKKHISNGDFLEVLAHNNYSIKSSDTLKLIEIGEKIGDEAMENQTLKML ESASAFNLADCVEYKEGQIVSKNLVAKSNLVITIMSFWKGETLDPHKAPGDALVTVLDGE GKYIVDGKTFIVKKGESTVLPANIPHAVEAVENFKMMLALVK >gi|296154863|gb|ADVK01000021.1| GENE 11 7412 - 9112 1908 566 aa, chain + ## HITS:1 COG:FN1306_2 KEGG:ns NR:ns ## COG: FN1306_2 COG0500 # Protein_GI_number: 19704641 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism; R General function prediction only # Function: SAM-dependent methyltransferases # Organism: Fusobacterium nucleatum # 265 566 1 302 302 569 99.0 1e-162 MENFYFNTFDGNKIFYRIWNFEKNKKTLIIIHRGHEHSERLSELTQNEKFLKYNIFAYDL RGHGYTEVKSSPNAMDYVRDLDSFIKHLKNEYQIKEEDIFIVANSIGGVILSAYVHDFAP NIAGMALLAPAFEIKLYIPFAKQLVTLLTKIKKDAKVMSYVKAKVLTHDIEEQNKYNSDK LINKEINAKLLIDLADMGKRLVEDSMAIELPTIIFSAEKDYVVKNSAEKRFFLNLSSKKR EFIELENFYHGIIFEKEREKVYKMLDDFIQDVFKNQNTSLDVSPREFSRKEYERIGLEEY PLSEKIFYSIQKFSMKTFGFLSKGMSLGLKYGFDSGISLDYIYKNQADGKLLLGKFIDRF YLNQIGWAGVRERKKNLLTLIEEKINNLGEENVKILDVAGGTGNYLFDIKEKYPNVQILI NEFKKSNIEVGEEVIKKNNWENISFVNYDCFNKETYKKINYTPNIVIISGVFELFEDNNM LENTISGVAEILDKNGTVIYTGQPWHPQLKQIALVLNSHKGHGKSWLMRRRSEKELDSLF ENYNLKKEKMLIDNDGIFTVSLAELR >gi|296154863|gb|ADVK01000021.1| GENE 12 9115 - 9714 483 199 aa, chain + ## HITS:1 COG:FN1307 KEGG:ns NR:ns ## COG: FN1307 COG0558 # Protein_GI_number: 19704642 # Func_class: I Lipid transport and metabolism # Function: Phosphatidylglycerophosphate synthase # Organism: Fusobacterium nucleatum # 1 199 1 199 199 310 100.0 1e-84 MDISIYKLKTKFQNLLMPICEKLVKLKITPNQITVTTVLLNIIFAGIIYKFSNYNFLYLT IPVFLFLRMALNALDGMIANKFNQKTKIGVFYNEVGDVVSDTVFFYIFLRVIGINEVYNL LFIFLSALSEYIGVVAVMVDNKRHYEGPMGKSDRAFLISLLAIIYFFIGNQYFDYILILS IILLIFTIYNRVKSSLRGE >gi|296154863|gb|ADVK01000021.1| GENE 13 9716 - 10519 591 267 aa, chain + ## HITS:1 COG:FN1308 KEGG:ns NR:ns ## COG: FN1308 COG4589 # Protein_GI_number: 19704643 # Func_class: R General function prediction only # Function: Predicted CDP-diglyceride synthetase/phosphatidate cytidylyltransferase # Organism: Fusobacterium nucleatum # 55 267 1 213 213 288 100.0 9e-78 MLLLMLVVDILALIILFFIRNKISDKKFANIKQRIFTWFIIIILFYLGSRDRIYMLILFG LISILSFREFLQFAYIKYDNELKISSFIVNLAFYIGIYLKNFYILLILFILICLRYYKRG FIIFAFFITSYLIGSICYIDDLNFIINYLILIELNDVFQYISGNIFGERKITPNISPNKT VEGLIGGIILTTLTATLLKYFANIDFQVKFIPYICLIGFIGDIFISSLKRKVNLKDSGNL LLGHGGILDRVDSLIFTAPIILLIFKL >gi|296154863|gb|ADVK01000021.1| GENE 14 10594 - 10788 450 64 aa, chain + ## HITS:1 COG:no KEGG:FN1309 NR:ns ## KEGG: FN1309 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 64 1 64 64 109 100.0 3e-23 MVTGDMNIMEAVEKYPVIVEVLQRNGLGCVGCMIASGETLAEGIEAHGLDTKAILDEINS LIKE >gi|296154863|gb|ADVK01000021.1| GENE 15 10853 - 10936 65 27 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MYMLLFVQVKLYTIIKLLFYKIEFMKS >gi|296154863|gb|ADVK01000021.1| GENE 16 10966 - 11622 739 218 aa, chain - ## HITS:1 COG:AGl2035 KEGG:ns NR:ns ## COG: AGl2035 COG1802 # Protein_GI_number: 15891135 # Func_class: K Transcription # Function: Transcriptional regulators # Organism: Agrobacterium tumefaciens strain C58 (Cereon) # 8 200 22 216 238 73 23.0 3e-13 MVKNRKNISDNVYEAIREHLLSQELNFGDKIVELDYCQKLNVSRTPLREAIKQLEIEGII ERAPNGRIKVMSMDEKRIDEIFQIRIALEDIIFNNLSKNNEFIPKLEDNLKLTEFQIKSE NWNEARKLFSEFNKILYSHSGLEFTIKILKYYNFILEKLRYNSLESNTRIVEAYQEHLIL LEYFKNNDIENAKIFNKEHLLRSKESIMTFFEKKYKKF >gi|296154863|gb|ADVK01000021.1| GENE 17 11725 - 12726 1167 333 aa, chain + ## HITS:1 COG:XF2596 KEGG:ns NR:ns ## COG: XF2596 COG0473 # Protein_GI_number: 15839185 # Func_class: C Energy production and conversion; E Amino acid transport and metabolism # Function: Isocitrate/isopropylmalate dehydrogenase # Organism: Xylella fastidiosa 9a5c # 2 329 3 331 335 340 50.0 2e-93 MKKITLIPGDGIGTEISKSLVRIFKSAKVPIEFEIENAGLTVYEQTGELIPESLYKSIEK NKVAIKGPITTPIGEGFKSINVSLRKKYDLYSNIRPVKTISGINTKYENVNMVIFRENTE GLYIGEEKFENQEKTSAIAIKRITKKGSIRIIKEAFEYAKKNNFNKVTVVHKANILKITD GMFLETAREISKQYKDIELEEMIVDNMCMQLVTNPQKFQVIVTMNFYGDFLSDLAAGLVG GLGVAPGANIGDDIAIFEAVHGSAPDIAGQNKANPTALILSSIEMLKYLNLNEYAEKIEK AIFKVLALPNFKTYDLSGNIGTKEFTDKIIEFL >gi|296154863|gb|ADVK01000021.1| GENE 18 12759 - 14069 1865 436 aa, chain + ## HITS:1 COG:BH0745 KEGG:ns NR:ns ## COG: BH0745 COG2851 # Protein_GI_number: 15613308 # Func_class: C Energy production and conversion # Function: H+/citrate symporter # Organism: Bacillus halodurans # 3 434 2 440 442 296 40.0 7e-80 MNLAIVGFIMLFLVVFLLFKEKSIPMILFITIPVIAAFIAGFSIEEVDGFIKDGIKTVSN MAILFIFSVTFFGIMSDAGMFDILVNKLVKKAGKNVVLIAIVTAIIAIFSHLDGATVTTV LVTIPALLPLYKKMNIRPQLLLLITGCGMGVMNLLPWGGPVVRAAAVLNMDVNDLWHLLI PIQIMGLIATFALAVVMAIREVKYYGAGQVNDLILNVNSSDEVDNSVENLKRSKLVLFNL LLTFAVLVVLLFNKFPNYFVFMFGCSIALLVNYPNIKEQKARIKAHTSAALDVSAVMLAA GIFVGVLGKSGMLEAMTVPLLKIIPSFIAKYLQLVMGVLALPLGTIVGTDSYFYGIMPLA MEVGQNYAIEPLNMAIAMLIGKNISLLVSPMVPATFLAIGLTNTELKDHLKYSLIPLWIL SLIMLIFAVIIGMVKL >gi|296154863|gb|ADVK01000021.1| GENE 19 14111 - 14902 876 263 aa, chain - ## HITS:1 COG:FN1310 KEGG:ns NR:ns ## COG: FN1310 COG0810 # Protein_GI_number: 19704645 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Periplasmic protein TonB, links inner and outer membranes # Organism: Fusobacterium nucleatum # 22 263 1 242 242 320 100.0 2e-87 MKKYILISLVLHLIILFGFGVMQTAQLGKDEPKNQVVPIAFVAKQTSENPGGKVLDTEER EKQSPEPPKPKVEKKPEEKKPEEKKVEKKPEKKEIESNIPSKDAKPVEKQPETTTSENTQ STESADKVESTPSDNNSSSGGGNTSSTGSGDEGFGSNFISDGDGSYIALSSKGINYQIIN EVEPDYPSQAESIGYSKQVKVTVKFLVGLKGNVEKAEITQSHKDLGFDAEVMKAIKKWKF KPIYHNGKNIKVYFVKTFVFDPQ >gi|296154863|gb|ADVK01000021.1| GENE 20 14911 - 15300 539 129 aa, chain - ## HITS:1 COG:FN1311 KEGG:ns NR:ns ## COG: FN1311 COG0848 # Protein_GI_number: 19704646 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Biopolymer transport protein # Organism: Fusobacterium nucleatum # 30 129 1 100 100 167 100.0 4e-42 MSKYKKKRESAKLDLTPLIDVVFLLIIFFMVTTTFNNFGSVQIDLPSSTIQKTDKNKSIE IIIDKDGNYHISEDGKITQVQFADLDAYLKSVKEATVSADKNLKYQTIMDVITKIKENGV DNLGLTFYE >gi|296154863|gb|ADVK01000021.1| GENE 21 15303 - 15911 850 202 aa, chain - ## HITS:1 COG:FN1312 KEGG:ns NR:ns ## COG: FN1312 COG0811 # Protein_GI_number: 19704647 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Biopolymer transport proteins # Organism: Fusobacterium nucleatum # 1 202 1 202 202 374 99.0 1e-103 MLHYLEVGGPILWVLVIISIGAFAVVLERIVFFARNEKNVGSNFKDEILLLVASKKLDEA IALCDTKKSCVASAVRKFLQKAPKGIDVQDYEFILKEVTNQEISPYERRLNLLASVMSIS PMLGLLGTVTGMIRAFTNISKYGAGDAAVVADGIAEALLTTAAGLMIAIPVIVVYNYLNR RLEKMENEIDDVITNIINIFRR >gi|296154863|gb|ADVK01000021.1| GENE 22 16066 - 17490 1755 474 aa, chain + ## HITS:1 COG:FN1313 KEGG:ns NR:ns ## COG: FN1313 COG4166 # Protein_GI_number: 19704648 # Func_class: E Amino acid transport and metabolism # Function: ABC-type oligopeptide transport system, periplasmic component # Organism: Fusobacterium nucleatum # 1 474 1 474 474 845 98.0 0 MKIFKFLSIFILSCLLFACGEEKLEETEKVEQIFYTVMPKQEYKLNSQSYSGNERALLTQ LFEGLTELKTEGVRLISVLSIEHSDNYKEWTFTLRDDLKWSDGEKITADTYLDSWLDSLE NSKSDEIYRMFVIKGAEDFYNKKVDKSSVGLKIQDNKFIVSLNAPIKNFDEWVSNPIFYP IRKENINLSLDKKIVNGAFKVSSFNDDEIILERNENYWDSINTKLKEIKISLIEDAIMAY EMFPRNEIDYFGEPFYPMPFDRLNQVNTLPEKLVFPTTRYWYISIPNENKENFFDKTEIR KLIYAVSDPEFMGKVILENDSPAIFSHPHPSSDILNKAKEDFEKIKENSNFNFSETPYIA YFENNNLLEKKLLLSTVKEWIGQFKISIRVTSNSDSAITFKIEKYLVGTNNINDLYYYIN YKYGTNIKSDEEFLDNLPVIPLLQEYDTVLSHSNVRGLNLTPSGDIYLKYINMQ >gi|296154863|gb|ADVK01000021.1| GENE 23 17467 - 17643 174 58 aa, chain - ## HITS:1 COG:no KEGG:FN1314 NR:ns ## KEGG: FN1314 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 58 1 58 60 83 89.0 3e-15 MLALFQGKSLEVISDYNSRKMINKYKTRLIECLKLDNIKTVTKKFCNSFFNLLHIYIF >gi|296154863|gb|ADVK01000021.1| GENE 24 17882 - 18427 870 181 aa, chain - ## HITS:1 COG:no KEGG:FN1315 NR:ns ## KEGG: FN1315 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 5 181 1 177 177 283 99.0 3e-75 MFLFLLSSVFSFATINDNLNLLKDEEKSEINEKIEEIQNVKGLTIFVNTLAEDEGFAISD PERAMILNLKKGDKEVYKVELSFSKDIDVEDYQEDINTTLNDSAELLERKEYGKYILTVL DGAGSVLQEVNIEALNQMTMTKEQENNSTPIMVAAFVIIILFIVYKMYAAYKDKSNQEED D >gi|296154863|gb|ADVK01000021.1| GENE 25 18461 - 19237 925 258 aa, chain - ## HITS:1 COG:FN1316 KEGG:ns NR:ns ## COG: FN1316 COG0327 # Protein_GI_number: 19704651 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 258 1 258 258 416 100.0 1e-116 MKARDIINILEKKFPKINAEEWDNIGLLIGDYDKEVKKIQFSLDATLESIENAISEKVDM LITHHPIIFKAIKDITEQNILGKKIRNLIKNDINVYSIHTNLDSSIEGLNDYVLKKIGIS EYKILDFDEEKNCGIGRIFKLNEEKNLKKFIEELKLKLKILNLRVISNDLNKKIKKVALI NGSAMNYWKKAKKEKVDLFITGDVSYHDALDALENGLSVIDFGHYESEYFFYEILIEELK DNNLEFLVFNREPIFKFY >gi|296154863|gb|ADVK01000021.1| GENE 26 19234 - 20058 1193 274 aa, chain - ## HITS:1 COG:FN1317 KEGG:ns NR:ns ## COG: FN1317 COG0568 # Protein_GI_number: 19704652 # Func_class: K Transcription # Function: DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) # Organism: Fusobacterium nucleatum # 1 274 1 270 270 394 98.0 1e-109 MKLLSLEKYLLKNSLDDEEFQKLVFEISEKLELEALSEDRKLTDEEIDYEYIDFLIAETL ESLKDDVCTCEVDCGVEDCCGTRVEKNLKKVYEIALYMLRDGISYEDLTQEGIIGLIKAH ELFEDDKDFKLYKDYYIAKEMFNYINNYANYRKSAFKDYAEHEIHKDSHLKVSLKDRDKS EELKKLEKENKEKHIEEMKHLEKRAETLFDYLNLKYRLSEREIAVLVLYYGLDGHKKKTF SQISEITKIDDDNLDKILKGAMFKLSNVDEKVEL >gi|296154863|gb|ADVK01000021.1| GENE 27 20074 - 21330 1837 418 aa, chain - ## HITS:1 COG:FN1318 KEGG:ns NR:ns ## COG: FN1318 COG0568 # Protein_GI_number: 19704653 # Func_class: K Transcription # Function: DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) # Organism: Fusobacterium nucleatum # 88 418 1 331 331 518 99.0 1e-147 MKELIKNEKARALIKKAVEEGIITYEEINEELGDDFPAENIEQLINEMLEQGIKIVDEEQ LDELGEDELREKELEDDYTDNEEHDDLLEDDSEDKIDENEDENDETEEHFTEFDDEFNPE YIEDVSEDELSNEKLLNLGNSAKVDEPIKMYLREIGQVPLLTHEEEIDYAKKAYEGDEEA SKKLIESNLRLVVSIAKKHTNRGLKLLDLIQEGNIGLMKAVEKFEYTKGYKFSTYATWWI RQAITRAIADQGRTIRIPVHMIETINKIKKESRIYLQETGKDASPEILAERLGMEVDKIK AIQEMNQEPISLETPVGSEEDSELGDFVEDQKTTSPYEATNRAILREELDAVLKTLSPRE EKVLRYRYGLDDSSPKTLEEVGKIFNVTRERIRQIEVKALRKLRHPSRKKKLEDFKVD >gi|296154863|gb|ADVK01000021.1| GENE 28 21359 - 23170 1625 603 aa, chain - ## HITS:1 COG:FN1319 KEGG:ns NR:ns ## COG: FN1319 COG0358 # Protein_GI_number: 19704654 # Func_class: L Replication, recombination and repair # Function: DNA primase (bacterial type) # Organism: Fusobacterium nucleatum # 1 603 1 603 603 1047 99.0 0 MYFKQEDIDKLLDNLRIEEVVGEFIELKKVGSSYRGLCPFHADTNPSFFVTPEKKICKCF VCGSGGNAINFYSKIKNISYTEAIRELSQKYRINIKEYKNTNTNENYEKFYQIMEDSHNF FMEKIFSQDSRGALEYLSNRGLDTNLIKEHQLGYAPPKWSELYELLNAKGYSDEDLLALG LIKKSEEGRIYDAFRNRIIFPIFSPSGRIIAFGGRTLEKDTSVPKYINSPDTPIFKKGKN AYGIERAVNIKNKNYSILMEGYMDVLSANIYGFDTSIAPLGTALTEEQAQLIKRYSSNIL LSFDMDKAGISATERASFILKSQGFNIRVLQFEESKDPDEFLKKNGREAFLKVVENSLEI FDFLYNLYSSEYDLNNNIIAKQNFVERFKDFFSNVDNDLEKEMYLKKLSEKIDISIDVLR KTLVEQNKKHVTRKDYLDENQEKIEKKEFKQANNLEMAIVKMLLRKPEYYNFFKEEKLES DIANRIFKFFNQKIKENLFFDSNIIMKEFKKYVEESNDFSQYEKNNELARIIMDYILIPN KIEEERENIELFKSYLRVKLKLRDKTKDDIAKKIEFGKLKKEIAKAKSVEEFIKVYNSFK YLF >gi|296154863|gb|ADVK01000021.1| GENE 29 23234 - 24925 2225 563 aa, chain - ## HITS:1 COG:FN1320 KEGG:ns NR:ns ## COG: FN1320 COG0760 # Protein_GI_number: 19704655 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Parvulin-like peptidyl-prolyl isomerase # Organism: Fusobacterium nucleatum # 208 563 1 356 356 595 100.0 1e-170 MSIRKFRKKMKPFIIILTVVFILSLAYGGYESYRTSRANKKAQEAMLLNKDYIQKIEIER AKQDLSRTYSDRVDKELVDILAFNDVIDKNLTLHIAKDLKVKVPSSEVNKQYEELESSMG DKEQFRRMLQVRGLTKDSLKNQIEENLLMQKTREEFAKNINPNDEEIDAYMALYSIPSDK REDAVNLYKTEKGTEAFREALLKARKEMKIKDLAPEYENLLEKTAYEEEGFTITNLDLAR ATANVMLGQKISKEDAEKQAKEMISRQIKMAKIAKEKGVKVNENLDSISQFQDYYVGLAE KVRDEVKPTDDELMKFFNANREKYSIPATADAKLVFISVKSAKEDDDLAKEKAEKLLSEL TPENFTEKGKTLGNNQDIIYQDLGTFGTKAMVKEFEEALKDVPSNTVVNKVIKTKFGYHV ALVKENNNNRQWGVEHILIVPYPSEKTVTEKLEKLNKIKADIEAGTLALNDKIDEDAIQS FDAKGITPDGIIPDFVYSPEIAKAVFSTELNKVGIINPNKATIIIFQKTKEVKAENANFD KSKEQVKSDYVNKKVAEYMSKLF >gi|296154863|gb|ADVK01000021.1| GENE 30 24954 - 26339 1894 461 aa, chain - ## HITS:1 COG:FN1321 KEGG:ns NR:ns ## COG: FN1321 COG2204 # Protein_GI_number: 19704656 # Func_class: T Signal transduction mechanisms # Function: Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains # Organism: Fusobacterium nucleatum # 1 461 9 469 469 795 100.0 0 MKNAILAISEKKEILKQIRKELAEKYEVITFNNLLDAIDMVRESDFDLILLDNALEGISV GEAKKKLTSIGKDFITVALVDEINEAETKELEKFGIFAYLLKPIKIEDLDAIILPSLNGL ELIKENKRLEEKLSILEEDTDIIGQSAKIKDVRNLIEKIADSDLPVLIVGETGTGKDIIA KEIHRKSDRNKGKYAQVSCALYPGELIERELFGYERGAFLGANASKKGLLEEIDGGTIYI EDIAKMDIKVQSRFLKAIEYGEFKRVGGTKVRKSNVRFLVGTDIDLKQETEKGKFRKDLY HRLTALTIEVPPLRERKEDIPVLANYFLNKIVRILHKETPVISGEAMKFLMEYYYPGNIM ELKNLIERMALLSKDKILDVEQLPLEIKTKSDIVENKTVVGVGPLKEILEQEIYSLEEVE RVVIAIALQKTRWNKQETSKILGIGRTTLYEKIRKYGLDTK >gi|296154863|gb|ADVK01000021.1| GENE 31 26428 - 27447 1299 339 aa, chain - ## HITS:1 COG:FN1322 KEGG:ns NR:ns ## COG: FN1322 COG0750 # Protein_GI_number: 19704657 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted membrane-associated Zn-dependent proteases 1 # Organism: Fusobacterium nucleatum # 1 339 1 339 339 608 98.0 1e-174 MTFLIAVVMLGLIIFVHELGHFLTAKLFKMPVSEFSIGMGPQVFSVDTKKTTYSFRAIPI GGYVNIEGMEVGSEVENGFSSKPAYQRFIVLFAGVFMNFLMAFILLFVTAKISGRIEYDT NAIIGGLVKGGANEQILKVDDKILELDGKKINIWTDISKVTKELQDKEEITALVERNGKE ENLTLKLTKDEENNRVVLGISPKYKKIDLSTTESLDFAKNSFNSILTDTVKGFFILFSGK VSLKEVSGPVGIFKVVGEVSKFGWISIASLCVVLSINIGVLNLLPIPALDGGRIIFVLLE LVGIKVNKKWEEKLHKGGMILLLFFILMISVNDVWKLFN >gi|296154863|gb|ADVK01000021.1| GENE 32 27448 - 28125 912 225 aa, chain - ## HITS:1 COG:FN1323 KEGG:ns NR:ns ## COG: FN1323 COG0125 # Protein_GI_number: 19704658 # Func_class: F Nucleotide transport and metabolism # Function: Thymidylate kinase # Organism: Fusobacterium nucleatum # 1 225 1 225 225 392 99.0 1e-109 MGKIIVIEGTDSSGKETQTKLLYERVKKIYDKTIKISFPNYDSPACEPVKMYLAGAFGTD ATKVNPYPVSTMYAIDRYASFKQDWEKKYIDDYVIITDRYVTSNMIHQASKIKNNEEKDE YLKWLVDLEYNKNKIPEPDIVIFLKMPIDKAKELMENRKNKIDGSERKDIHEVNEDYLKR SYENATAISKKYNWCEIECVENNKIKSIEKINDEIFSKIKEIIGG >gi|296154863|gb|ADVK01000021.1| GENE 33 28113 - 29276 1577 387 aa, chain - ## HITS:1 COG:FN1324 KEGG:ns NR:ns ## COG: FN1324 COG0743 # Protein_GI_number: 19704659 # Func_class: I Lipid transport and metabolism # Function: 1-deoxy-D-xylulose 5-phosphate reductoisomerase # Organism: Fusobacterium nucleatum # 1 387 4 390 390 714 99.0 0 MKKILILGSTGSIGTNALELIRNNREQYQVVGISGNKNIDLLKKQIEEFKPISIYVGSEQ DAIYLKKEYSFIKEVYFGENGLAELSKNSDYDIILTAVSGAIGIDATVEAIKRGKRIALA NKETMVSAGVYINKLLKEYPKAEIVPVDSEHSALFQSLQGFKKENLKKLIITASGGTFRG KDLAYLENVTVEQALKHPNWSMGKKITIDSSTLVNKGLEVIEAHELFNVDYDDIEVVIHP QSIIHSMVEYVDGSIIAQMGVVNMKTPILYAFTYPEKEYNSSINFLDLIKNNNLTFEEAD RKVFKGIDLAYRAGRTGDTMPTVFNASNEIAVELFMKKQIKFLDIYRIIEEAMNSHQVLT LNTDNALNVIKEVDREIRKKVREQWEK >gi|296154863|gb|ADVK01000021.1| GENE 34 29294 - 30178 1031 294 aa, chain - ## HITS:1 COG:FN1325 KEGG:ns NR:ns ## COG: FN1325 COG0575 # Protein_GI_number: 19704660 # Func_class: I Lipid transport and metabolism # Function: CDP-diglyceride synthetase # Organism: Fusobacterium nucleatum # 1 294 1 294 294 461 99.0 1e-130 MFKWNRVLVALIGVPLLLFIYTGESFFKINLYGLPMLIFTNLVIGTGTYEFYKMIKILGK EVYDKFGIIVAIIIPNLVYLENRRNYFEYNLIAVVLIIATIFMLTYRIFKNQIRGTLEKV SYTLLGIVYVSVFFSQIIILYFLGAMYPLILQVLVWISDTSAGIVGVAIGRKFFKNGFTE ISPKKSVEGALGSVVFTGLTFMLIVVMYIEKIKGATIGEIFLSFIIGAVISVVAQIGDLI ESLFKRECGVKDSGTILMGHGGILDRFDSMILVLPFVTTVLYFFRLYVSYQYGI >gi|296154863|gb|ADVK01000021.1| GENE 35 30171 - 30863 969 230 aa, chain - ## HITS:1 COG:FN1326 KEGG:ns NR:ns ## COG: FN1326 COG0020 # Protein_GI_number: 19704661 # Func_class: I Lipid transport and metabolism # Function: Undecaprenyl pyrophosphate synthase # Organism: Fusobacterium nucleatum # 1 230 1 230 230 426 99.0 1e-119 MEKNIPKHIAIIMDGNGRWAKKRGLARSFGHMEGAKTLRKALEYLTEIGVKYLTVYAFST ENWNRPQEEVSTLMKLFLKYIKNERKNMMKNKIRFFVSGRKNNVPEKLQKEIEKLEEETK NNDKITLNIAFNYGSRAEIVDAVNRIIKDGKENITEEDFSKYLYNDFPDPDLVIRTSGEM RISNFLLWQIAYSELYITDVLWPDFDEKEIDKAIESYNQRERRFGGVKNV >gi|296154863|gb|ADVK01000021.1| GENE 36 30995 - 31888 1193 297 aa, chain - ## HITS:1 COG:FN1327 KEGG:ns NR:ns ## COG: FN1327 COG0142 # Protein_GI_number: 19704662 # Func_class: H Coenzyme transport and metabolism # Function: Geranylgeranyl pyrophosphate synthase # Organism: Fusobacterium nucleatum # 1 297 1 297 297 527 98.0 1e-150 MNDFQVYLKEKTDFFETELKKELKELSYPETIAKGMEYAILNGGKRLRPFLLFATLELLN QNIEKGVKSAIALEMIHSYSLVHDDLPALDNDDYRRGKLTTHKVFGEAEGILIGDSLLTY AFYVLSQKNLELLSSKQIVNIISKTSEYAGIDGMIGGQMIDIQSENKKIDLETLKYIHSH KTGKLIKLPIEIACIIANLEKDKREVLEEYADLIGLAFQVKDDILDVEGTFEDLGKPVGS DVDLHKATYPSILGMEESKKILNNTVEKAKEIIKNKFGEEKGKVLISLADFIKDRKK >gi|296154863|gb|ADVK01000021.1| GENE 37 31890 - 32102 346 70 aa, chain - ## HITS:1 COG:FN1328 KEGG:ns NR:ns ## COG: FN1328 COG1722 # Protein_GI_number: 19704663 # Func_class: L Replication, recombination and repair # Function: Exonuclease VII small subunit # Organism: Fusobacterium nucleatum # 1 70 1 70 70 81 100.0 4e-16 MGKNTFEENLENLDKIIESLESGELSLDDAIKEYENAMKLIKSSSKILNEAEGKLLKVIE KNGEIDIEEI >gi|296154863|gb|ADVK01000021.1| GENE 38 32315 - 32863 347 182 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163764797|ref|ZP_02171850.1| ribosomal protein L29 [Bacillus selenitireducens MLS10] # 1 180 13 192 199 138 38 1e-31 MRIIAGEAKNRIIKTRKGFDTRPTLESVKESLFSIIAPYIEGSIFLDLFSGSGSISLEAI SRGAKRAVMIEKDGEALKYIIENIDNLGFSDRCRAYKNDVIRAIEILGRKNEKFDIIFMD PPYQDNVTKKVLKAIDKANILAEDGLIICEHHLLEDLEDNIASFRKTDERKYNKKILTFY TK >gi|296154863|gb|ADVK01000021.1| GENE 39 32876 - 33907 1235 343 aa, chain - ## HITS:1 COG:FN1330 KEGG:ns NR:ns ## COG: FN1330 COG0809 # Protein_GI_number: 19704665 # Func_class: J Translation, ribosomal structure and biogenesis # Function: S-adenosylmethionine:tRNA-ribosyltransferase-isomerase (queuine synthetase) # Organism: Fusobacterium nucleatum # 1 343 9 351 351 641 99.0 0 MSTYLSDYDYFLPEELIGQKPREPRDSAKLMLIDRKNGSVEHKNFYNIIDYLQKGDVLVR NATKVIPARIFGHKDTGGVLEILLIKRITLDTWECLLKPAKKLKLGQKLYIGENKELIAE LLEIKEDGNRILKFYYEGSFEEILDKLGSMPLPPYITRKLENKDRYQTVYAQRGESVAAP TAGLHFTEELLNKILDKGVEIVDIFLEVGLGTFRPVQTVNVLEHKMHEESFEISEKVAKI INEAKAEGRRIISVGTTATRALESSVDENGKLIAQKKDTGIFIYPGYKFKIVDALITNFH LPKSTLLMLVSAFYDREKMLEIYNLAVKEKYHFFSFGDSMFIY >gi|296154863|gb|ADVK01000021.1| GENE 40 33888 - 35039 1420 383 aa, chain - ## HITS:1 COG:FN1331 KEGG:ns NR:ns ## COG: FN1331 COG2890 # Protein_GI_number: 19704666 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Methylase of polypeptide chain release factors # Organism: Fusobacterium nucleatum # 30 383 1 354 354 588 99.0 1e-168 MNLVEILKFTEEYLKKYSFSKPRLEAEKLVSYVLNLDRIALYIHYERELSEDEKTSIKQY LKKMVEENKTFDELKGEKKDFKEENLDIFNKSVEYLKKNGVSNPLLDTEYIFSDVLKVNK NTLKYSMSREIKEEDKNKIREMLVLRAKKRKPLQYILGEWEFYGLPFKMSEGVLIPRADT EILVEQCIQLMREVEEPNILDIGSGSGAISIAVANELKSSSVTGIDINEKAIKLAIENKI LNKIENVNFIESNLFGKLDKDFKYDLIVSNPPYISKEEYETLMPEVKNYEPQNALTDLGD GLHFYKEISKLAGEYLKDTGYLAFEIGYNQAKDVSKILQDNNFAILSIVKDYGGNDRVII AKKAIKAENFEEIEEEEDVNLSE >gi|296154863|gb|ADVK01000021.1| GENE 41 35039 - 36112 1551 357 aa, chain - ## HITS:1 COG:FN1332 KEGG:ns NR:ns ## COG: FN1332 COG0216 # Protein_GI_number: 19704667 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Protein chain release factor A # Organism: Fusobacterium nucleatum # 1 357 9 365 365 592 99.0 1e-169 MFDKLEEVVARYDELNKMLVSPEVLADSKKMIECNKAINEITEIVEKYKEYKKYVDDIEF IKESFKTEKDSDMKEMLNEELKEAEEKLPKLEEELKILLLPKDKNDDKNVIVEIRGGAGG DEAALFAADLFRMYSRYAERKKWKIEIIEKQDGELGGIKEIAFTIIGLGAYSRLKFESGV HRVQRVPKTEASGRIHTSTATVAVLPEVEDIQEVTVDPKDLKIDTYRSGGAGGQHVNMTD SAVRITHLPTGIVVQCQDERSQLKNREKAMKHLLTKLYEMEQEKQRSEVESERRLQVGTG DRAEKIRTYNFPDGRITDHRIKLTVHQLEAFLDGDIDEMIDALITFHQAELLSASEQ >gi|296154863|gb|ADVK01000021.1| GENE 42 36137 - 36553 571 138 aa, chain - ## HITS:1 COG:no KEGG:FN1333 NR:ns ## KEGG: FN1333 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 138 1 138 138 207 97.0 9e-53 MNLKKINLTIVLAGIVVLLVIITLLMPSREKIKEIEVKKVEVKKEEMVEVTVYGVAKDSD SPSKYTLTLKQASTSDLLRTAVEDMVKKYSSNSELINIYFSDDRVYYEFNNKDLSEAFLN ALQMTTQEITGMEEISLL >gi|296154863|gb|ADVK01000021.1| GENE 43 36576 - 37604 1238 342 aa, chain - ## HITS:1 COG:FN1334 KEGG:ns NR:ns ## COG: FN1334 COG0860 # Protein_GI_number: 19704669 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: N-acetylmuramoyl-L-alanine amidase # Organism: Fusobacterium nucleatum # 5 342 1 338 338 607 99.0 1e-174 MKRKLLSIFFFFLISALSFSAKVNDVKFSANKFIINLNASDGECLVSADEESRLIYIEIQ NLDSNSFEKFSRNLELDIRGSNLFEDVIIDKSKDSVSLTLQVAPKVSYIMDATNNKIELN LQRTSKNKHLIVIDPGHGGKDPGAARGSVVEKKIVLAVGTYLRDELSKDFNVIMTRDSDF FVVLSERPKIGNKNKAALFVSVHANAAENKSANGVEVFYFSKKSSPYAERIANFENSIGE KYGDSSDKIIQISGELAYKKNQENSIRLAKKVVENIADRLEMRNGGVHGANFAVLRGFNG TGILIELGFVSNSYDAEILVDPSSQQKMAEEIAKSIREYLTR >gi|296154863|gb|ADVK01000021.1| GENE 44 37700 - 37984 320 94 aa, chain - ## HITS:1 COG:FN1335 KEGG:ns NR:ns ## COG: FN1335 COG1862 # Protein_GI_number: 19704670 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Preprotein translocase subunit YajC # Organism: Fusobacterium nucleatum # 1 94 1 94 94 137 100.0 4e-33 MQELFAKYGGTGAIIVLWIAIFYFLIIRPNKKKQQQQQNLLNSLKEGTEVITIGGIKGTI AFVGEDYVEIRVDKGVKLTFRKSAIANVINNNQQ >gi|296154863|gb|ADVK01000021.1| GENE 45 38302 - 38391 66 29 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MYSHTFFVDNLLIVNLVFDNKNPSNFFGY >gi|296154863|gb|ADVK01000021.1| GENE 46 38406 - 38582 113 58 aa, chain - ## HITS:1 COG:FN0402 KEGG:ns NR:ns ## COG: FN0402 COG0582 # Protein_GI_number: 19703744 # Func_class: L Replication, recombination and repair # Function: Integrase # Organism: Fusobacterium nucleatum # 1 58 1 58 58 79 89.0 2e-15 MKYHTLHDTRHTFATLLVNAKVNKEVIIKIIGHKRYKTTLDIYVHKNYDDMKRAINQI >gi|296154863|gb|ADVK01000021.1| GENE 47 39053 - 39412 88 119 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296327950|ref|ZP_06870485.1| ## NR: gi|296327950|ref|ZP_06870485.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 119 1 119 119 135 100.0 9e-31 MSFNILFLKLFSIILIFLAFNFTHKTVFVQNYFFITSTNYNYIEEFPTQKKILSFLNRTL FIICASLILSYIFDYKIIYLGALVSTFLIIWPPIVYLNLLKFPFTTKKLLYYFVIFYTC >gi|296154863|gb|ADVK01000021.1| GENE 48 39406 - 39705 203 99 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296327951|ref|ZP_06870486.1| ## NR: gi|296327951|ref|ZP_06870486.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 99 1 99 99 132 100.0 6e-30 MLNTYLLVYFSDNFLKKSLQGENIYILDNSAVALIINLFFLSFPTILDKIISKLYYSNYN MSINIFEEEVRLTIEKISFIQYYLLSLYKFEILKFSKKK >gi|296154863|gb|ADVK01000021.1| GENE 49 39731 - 40153 297 140 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296327952|ref|ZP_06870487.1| ## NR: gi|296327952|ref|ZP_06870487.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 140 1 140 140 228 100.0 2e-58 MLCIENLNRGGKIYRKLEKIYCYFFYERAIKKDISVGISQIKISNIANILRQSPIMFKKN LFKPSFSINVMTLIIKKIFNEYNDCSDYFNGDIFSYIASNYNGNYEIIDVKIYSAVLRSL MKNKTLKYKKINYDEEFYVY >gi|296154863|gb|ADVK01000021.1| GENE 50 40376 - 40852 413 158 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296327953|ref|ZP_06870488.1| ## NR: gi|296327953|ref|ZP_06870488.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 158 1 158 158 301 100.0 1e-80 MEINGIYFAKGEFYQIIRDIGGVWNDSKERPIVCLLKIDDTDIYWAIPMGNLNHRNEKAK ERLDFYLNIEESDIRSCFYHIGKTTTDTIFFISDVIPIKEIYIDREYLGFNNIHYVIKNK KLISELERKLKRILYFEDSRPNYFRQHITDLKNKLLSE >gi|296154863|gb|ADVK01000021.1| GENE 51 40940 - 41032 59 30 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MSTNYQQKTVRIQKSPQLFSSSGFFVEIKL >gi|296154863|gb|ADVK01000021.1| GENE 52 41177 - 41443 472 88 aa, chain + ## HITS:1 COG:no KEGG:bpr_IV094 NR:ns ## KEGG: bpr_IV094 # Name: not_defined # Def: addiction module antitoxin # Organism: B.proteoclasticus # Pathway: not_defined # 1 85 13 97 98 97 58.0 2e-19 MSMKLINIRMDEDLKKEMEIVCNDLGINITTAFTIFAKKLTREKRIPFSVSIDPFYSNEN IKALQNSIDEVKDGKVIMKTIEELEAME >gi|296154863|gb|ADVK01000021.1| GENE 53 41443 - 41709 215 88 aa, chain + ## HITS:1 COG:SA2195 KEGG:ns NR:ns ## COG: SA2195 COG4115 # Protein_GI_number: 15927985 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Staphylococcus aureus N315 # 1 85 4 88 88 99 61.0 2e-21 MKISFSIQAWEEYLYFQSQDKKTLKKINELIKDIERNGVLNGIGKPEKLTNNLTGLYSRR INDKDRLVYKLENDFIVILQCKGHYNDN >gi|296154863|gb|ADVK01000021.1| GENE 54 41819 - 42793 815 324 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163765018|ref|ZP_02172066.1| ribosomal protein L9 [Bacillus selenitireducens MLS10] # 5 312 11 324 329 318 52 7e-86 MNKVLLEVKNLKKYFQTPKGQLHAVDNVNFAIEEGKTLGVVGESGCGKSTTGRTILRLLE ATDGEIIFEGKNIREYSKAEMKKIREEMQIIFQDPFASLNPRMTVSEIIAEPLIIHKKCK NKEELSNRVKELMDTVGLSQRLVNTYPHELDGGRRQRIGIARALALNPKFIVCDEPVSAL DVSIQAQVLNLMKDLQEKLSLTYMFITHDLSVVKYFSNDIAVMYLGELVEKAPSKDLFKN PIHPYTKALLSAIPTINIRKKMERIKLEGEITSPINPGVGCRFAKRCIYAEEICFKESPK LEKVGEAHFFACHRAKELGFVDKK >gi|296154863|gb|ADVK01000021.1| GENE 55 42786 - 43793 633 335 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163765018|ref|ZP_02172066.1| ribosomal protein L9 [Bacillus selenitireducens MLS10] # 23 321 35 328 329 248 42 9e-65 MENKNLLEIRDLEIQYVKDDETVHAVNGISVDIAEGETLGLVGETGAGKTTTALGIMRLI TGPTGKIKSGAIKFNGKSILEIPEEEMRKIRGNDISMIFQDPMTSLNPVMTVGEQIAEVI EIHEHIGKEEAMNKAAEMLELVGIPGARKNDFPHQFSGGMKQRVVIAIALACNPKLLIAD EPTTALDVTIQAQVLDLMTDLKNKFKTSMLLITHDLGVVAQVCDKVAIMYAGEIVEYGTL EDVFENPKHPYTLGLFGSIPSLDEEKTRLVPIKGLMPDPTNLPSGCKFNPRCPHAVELCS QRAPVVTEVSKGHKVQCLIAEGLVKFKENWEEENE >gi|296154863|gb|ADVK01000021.1| GENE 56 43812 - 44681 1297 289 aa, chain - ## HITS:1 COG:FN0398 KEGG:ns NR:ns ## COG: FN0398 COG1173 # Protein_GI_number: 19703740 # Func_class: E Amino acid transport and metabolism; P Inorganic ion transport and metabolism # Function: ABC-type dipeptide/oligopeptide/nickel transport systems, permease components # Organism: Fusobacterium nucleatum # 1 289 1 289 289 498 100.0 1e-141 MEKTKNKKQSQWAEVFRMLRKNRMAMLGLIILIILVLLALFADVIANYDTIVIKQNLAER LMPPNGKHWLGTDEFGRDIFARLIHGARVSLKVGILAISISVVVGGILGAISGYFGGVID NVIMRVVDIFLAVPSILLAIAIVSALGPSMLNLMISISVSYVPNFARIVRASVLSIRDQE FIEAAKAIGASNSRIIMKHIIPNSLAPVIVQGTLGVAGAILSTAGLSFIGLGIQPPAPEW GSMLSGGRQYLRYAWWVTTFPGVAIMITILSLNLLGDGLRDALDPRLKQ >gi|296154863|gb|ADVK01000021.1| GENE 57 44691 - 45617 283 308 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|167855436|ref|ZP_02478201.1| 30S ribosomal protein S21 [Haemophilus parasuis 29755] # 62 307 40 316 320 113 28 4e-24 MYKYILKRLVLLIPVMLGVTLLVFTIMYLTPGDPAQLILGESAPKEAVAALREKMGLNDP FFMQYFRFVKNALVGDFGRSYTTGREVFAEIFARFPNTVVLAVLGVLISILIGIPVGIIS ATKQYSITDSFSMILALLGVSMPVFWLGLMLILLFSVKLGIFPSGGFDGFSSVILPSIAL GVGSAAIVTRMTRSSMLEVIRQDYIRTARAKGVAEKVVINKHALKNALIPIITVVGLQFG GLLGGAVLTESVFSWPGVGRLMVDAIRQKDTPTVLASVVFLAVVYSVVNLLVDLLYAFVD PRIKSQYK >gi|296154863|gb|ADVK01000021.1| GENE 58 45701 - 47236 2256 511 aa, chain - ## HITS:1 COG:FN0396 KEGG:ns NR:ns ## COG: FN0396 COG0747 # Protein_GI_number: 19703738 # Func_class: E Amino acid transport and metabolism # Function: ABC-type dipeptide transport system, periplasmic component # Organism: Fusobacterium nucleatum # 1 511 1 511 511 1026 99.0 0 MKKKFWLLMAMVLSVLFLVACGGDPDKKSDAGAGGKRDTLVIGNGADAKSLDPHASNDNP SSNVRVQIFDTLMDLDDDGIPQPMLAESWERPDDKTIIFHLRKGVKFHNGDEMKASDVKF SLERALKSPEVSHILAGINGVEVLDDYTVKVTTEKPMAAILNNLSHTTIAILSEKATTEA GDKFGQNPIGTGPYKFVSWQSGDRITLEAFPDYWRGETPVKNIVFRNIVEETNRTIGLET GELDIAYDIKGLDKNKLKEDERFTLLEGPQVSITYLGFNLRKAPYDNLKVREAISYAIDQ KPIIDTVFLGAGEPANSIIGPNIWGYYDVEKYTQDIEKAKALLAEAGYPNGFKAKIWVND NPVRRDTAVILQDQLKQIGIDLTIETVEWGAFLDGTARGDHEMYLLGWGTVTRDPDYGMF ELISSSTMGAAGNRSFYSNPEVDKLLEAGKTELDPEKRKDIYKQIQEIVRRDIPMYMIVY PLQNVITKKDVKNFKLDAAQTYRLYGVSIEE >gi|296154863|gb|ADVK01000021.1| GENE 59 47473 - 48333 876 286 aa, chain - ## HITS:1 COG:FN0395 KEGG:ns NR:ns ## COG: FN0395 COG0697 # Protein_GI_number: 19703737 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; R General function prediction only # Function: Permeases of the drug/metabolite transporter (DMT) superfamily # Organism: Fusobacterium nucleatum # 1 286 1 286 286 461 100.0 1e-130 MVVSDKTKGIFWMLVSVLGFTFMGIAVKYLPRIPTYEKVFFRNSVSFITSAYILYRKKES IKVAKQNIPFVFGRSFFGFVGMVANFYALENLTMAEANMLNKLSPVFVTICACIFLKEKV DKKQVIGIILMLIAVVFVIKPSFSPEVIPSLVGLFSAILAGFSYTIIRYLYGKVKAEINV FYFSLLSVICTFPLMMLNFIKPNLFETFMLIVGIGVSAAMGQFGLTYAYTFAPASEVSIY NYVIIITSMLMDYILFSTIPDLFSFIGGFIIMATAIYLYLHNKKKE >gi|296154863|gb|ADVK01000021.1| GENE 60 48372 - 49112 916 246 aa, chain - ## HITS:1 COG:FN0394 KEGG:ns NR:ns ## COG: FN0394 COG3713 # Protein_GI_number: 19703736 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Outer membrane protein V # Organism: Fusobacterium nucleatum # 1 246 1 246 246 446 99.0 1e-125 MKKYLLIILILFSTVVMANDDFKASVTVGYGTNDSVYKGKEYYRIPIFINTSYKNLYLEG TEIGAKFIDTDRFDTSVFLELQDGHYIKPSKMESGYRTIKKRKFQQTFGLKADIRIDEIS KNLILSPYFSAGNRGTQTGASLSYLYMPAENIIISPSISTKYLSKKYTDYYFGVDRDELG GNITNEYNPDGAFEFGAGLYGEYYFTKHISALAYLNMSRYSSEVRKSPITEDRIITNVGA GLKYTF >gi|296154863|gb|ADVK01000021.1| GENE 61 49115 - 50911 2401 598 aa, chain - ## HITS:1 COG:FN0393_1 KEGG:ns NR:ns ## COG: FN0393_1 COG0438 # Protein_GI_number: 19703735 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Fusobacterium nucleatum # 1 350 1 350 350 635 98.0 0 MNILMALSQLEITGAEVYATTIADELIERGNKVYIVSDTLTTPTKAEYIKLEFNKRSLLK RIEHIKFLYNFLKEKDIQIVHAHSRASSWSCQVACKLAGIPLITTTHGRQPIHFSRKLIK AFGDYSIAVCENIKKHMVNDIGFSENKVSVILNPVNYKKLDLEKKVNDKKVISIVGRLSG PKGDVAYDLLEILSQDELLSKYKVRLIGGKELPERFLKFKEKYIEFIGYVPNIQEKIFES DIVIGAGRVAFESLLNKTALIAVGETEYIGFVNKGNLDKSLASNFGDIGSMKYPKIEKEI LLNDIKKALNLSENEKEELKDIIFKETNLQNIVDKIEKKYFELYVNKKKYDIPVIMYHRV IDNPENEGVHGTYIYENIFREHMKYLKDKNYTVITFKDLDKIGWRNRFEKDKKYVFITFD DGYKDNYDLAFPILKEFGFKATIFLMGSSTYNEWDVKASGEKEFPLMSVEMIKEMQDYGI EFGAHTFNHPKLNTLSNEEIEHQIVDVKKPLEEKIGKKIITFAYPYGILNDYAKEMIKKA GYTFALATDSGSVCLSNDLYQIRRIAIFPNTNLFSFKRKVAGNYNFIKIKREEKNRSK >gi|296154863|gb|ADVK01000021.1| GENE 62 50922 - 51815 1073 297 aa, chain - ## HITS:1 COG:FN0392 KEGG:ns NR:ns ## COG: FN0392 COG1032 # Protein_GI_number: 19703734 # Func_class: C Energy production and conversion # Function: Fe-S oxidoreductase # Organism: Fusobacterium nucleatum # 1 297 1 297 297 550 98.0 1e-156 MYDLYDFPLYRPPSEAYSLIIQITLGCSHNRCSFCSMYKDKKFIIKPIEDIKSDIDAFRA LYKNRAVEKIFLADGDALIVPTDILIQVLDYIKEVFPECKRVSIYGTAIAIHQKSVEDLK KLYEKGLTLVYLGVESGDDDALKFIKKGIKAEKIVELSKKIMSTGIDLSITLIAGLLGKY QDNKMHAINTAKIITDISPKYVSILNLRLYEGTELYNLMQEGKYDYMEGIEVLKEMKLVL SSMDTSKITRPIIFRANHASNYLNLKGNLPDDIVRMIKEIDYAIENEAINVNNYRFL >gi|296154863|gb|ADVK01000021.1| GENE 63 52055 - 52858 1150 267 aa, chain - ## HITS:1 COG:FN0391 KEGG:ns NR:ns ## COG: FN0391 COG0561 # Protein_GI_number: 19703733 # Func_class: R General function prediction only # Function: Predicted hydrolases of the HAD superfamily # Organism: Fusobacterium nucleatum # 1 267 1 267 267 491 98.0 1e-139 MRYKLVVCDMDGTLLTSNHKISDHTADVIKKIEDNGIKFMIATGRPYLDARYYRDTLKLK SFLITSNGARAHDEDNNPIVIENIPKEFVKRLLAYNVGKDIHRNIYLNDDWIIEYEIEGL VEFHKESGYRFNIDNLNKYENEEAAKVFFLGKDEDIENLEKNMEKEFQNDLSITISSPFC LEFMKKGVNKAETLKKVLKLLNIEPEEVIAFGDSMNDYEMLSLVGKPFIMGNANQRLIDA LPNVEVVGNNNEDGIGKKLIEIFNIIL >gi|296154863|gb|ADVK01000021.1| GENE 64 53053 - 53454 497 133 aa, chain + ## HITS:1 COG:FN0390 KEGG:ns NR:ns ## COG: FN0390 COG5015 # Protein_GI_number: 19703732 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 133 1 133 133 229 100.0 8e-61 MIDFSKFLKENLNGIFTTVENGKPKSRAFQFLFSDGKKVYFCTENNKAVYKQIKENPNVS FCTHKTDFSYVLSISGKATFVNDINLKTRALNEYPMLKEIFKTSDNPVFELFYVDVEEVD TFDFVNGSKKEKI >gi|296154863|gb|ADVK01000021.1| GENE 65 53509 - 53934 668 141 aa, chain - ## HITS:1 COG:no KEGG:FN0389 NR:ns ## KEGG: FN0389 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 9 141 1 133 133 208 100.0 5e-53 MKKILFTMMILFGISAFSSNSYETDLVGRMKVLEEKLQTKIDSGVSVDMSNIITELSEGW ENELNTVYSLLMEKLPKEEKIKLENEQKEWLKNRNIKAKKEAKQVEGGTLQPVLLEGSVR AQNKERAIELAKRYDKLVNKN >gi|296154863|gb|ADVK01000021.1| GENE 66 54124 - 54921 1117 265 aa, chain + ## HITS:1 COG:FN0388 KEGG:ns NR:ns ## COG: FN0388 COG3315 # Protein_GI_number: 19703730 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism # Function: O-Methyltransferase involved in polyketide biosynthesis # Organism: Fusobacterium nucleatum # 1 265 5 269 269 491 100.0 1e-139 MKIKLDGVAETLLITLNARAKDYKSPKSVLHDKKSFEIASQLDYDFKKFDTAWASYYGVL ARAYIMDEEVKKFIEKYPDCTIVSIGCGLDTRFERVDNGKITWYNLDLPEVMENRKLLFK ENDRVKNISKSVFENDWTKEVVTDGKELLIVSEGVLMFFNEDEVKKILEILVNNFDKFEL HLDLLYKGTIKMSAKHDTLKKMNDVKFKWGVKDGSEIVKLEPKLKQIGLINFTKKMGKIL PLSKKIFIPIFWLMNNRLGIYTYNK >gi|296154863|gb|ADVK01000021.1| GENE 67 54955 - 61599 8685 2214 aa, chain - ## HITS:1 COG:no KEGG:FN0387 NR:ns ## KEGG: FN0387 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 491 2214 1 1724 1724 2911 100.0 0 MKNNLEQIERHLRSIAKRYKSVKYSLSLVIVFLMVGINVFSEELMTQEAVSRESIQNSVG NLQTKINDLKSQNDKQLEGLRLELIQLMEQGNQVVKSPWSSWQFGINYMYSKWNGAYKGR GDKKNWPKDLREYFTDPLKRFVKNDSVTSSTYGLTTLNILYEPPAEITVSAGIRPKNVNK KESKFEPKEPAGALPTFEPRLIGTPGKPVAPATPSVNVFDTPDIPANGQSFAQRPIIGKR TVDEYDNQISPRFNNAVVAQNYDEYTPEPDTPNGNIDVNMGETTTWGGGKIRLKSTVPEG RPFENGATETKDVNGAPRTMPKDPFISKPSGTYYLNPGKVETRDVWDIEGKKKDTVNAFI SDSRDHDTTINGNYTVKNLGSGPTTKLFFSYNPSGVGGKKYDNKASWGWSDGTEQAVNRV AKFTGNLTLKGVEDPNNKTALVGLEHQLWAKSKTSTHKDDKDEWNNSNSNSTLLNTGNIT LASGNYLVGMMIDVEYSNDKNHKNHKTINKGTININSKNSVGIDFAKYELGLLQTDVSLG NINVNGSNNYGFRMAKLFDGETKTYQGADNGSNLTVDGNKYYDNTTITGEGGKIVVGGKE NVGVSISKGVSADKSANPISNIKSLNIEVVGENVVGFLRNKDFSINNTGNIVLDDTTVQS LTFGNGAKNSTLVRSDNGTIDIKKNLNATKGSTGNSFSQATSGGRVVNHSILKSTLSHFT GMISYGRGNLRSSTAINKGTIELTGDSDSNIGMAALNSGEIRNENGTIKVLGKGKNKAAI YTDQTSSATLNGGNYFISGESSSGIYNQGSTILGDNNKIFASNGAVGIYSSGGTIDGSHA ANGVDIEVDDGDSEEKGLAVYGENGTDIKLQNSKINVKKGIAGVAAFGANTKINLNGATL KYSGSGYAAYATGGGKIDLSNSRIELRGRATGFEMAGSGTSPITLNSNTRIHVYSNDVTV MNIKDMASLNYSNLNSTAFNSYLQGATVHAENGATDYKLAAVDGIGAYNINSSLDKKLAV NPANKNTNDYIFTRLLSVQRARMNLKSGNNVRAVLSSSELAAIKEKTVVGLAMNSSKNAV SNTETAINLENNTTVTADRTDAGDGAVGLFVNYGTVNVASGATINVEKENNVVNEKAVGI YAVNGTEVDNKGTINVGGKNSIGIFGIAYRTDNAGNKKIDEFGSNAVGQGKVNIKNQGTL SLNGEGASGILVKNNKGTAASLGEHKALNTGTINMSGNKAIGMFAEKGYLKNEGTINITG SQQGIGMYGHLGSILENGTNGKINVADSNDENKLNIGMFTDDINTKIMNAGKISVGKNSY GIYGKNITTTATSKIKTGDNGVGIFSSTKDSNTTLDLAPGSEITVGNNNAVGVFSLGNKA AHITSFSKMNIGNGSFGYVIRSKGSTLNSNYAGETELKQDGTYIYSTDEDGTITNKTKLK SSGNKNYGIYASGNVKNLADINFATGYGNVGLYSTSNNSNKGITNGEAGSSGIKPKITVG ASKIRDTNGNLLKEKDRLYSIGMAAGYSWTEEDLKKPEAQRPKQFIGRIVNYGTISVTGD DGIGMYAVGRGSRAINHGLIDLSGKNSIGMYLDQGAIGENYGTIRTAPNNTKDGIIGVVA LNGSVIKNYGTISISGAGNTGIYKAQGGNNEGKKPEVSNGATDISSKATADTSKKLGTSQ ILSPPGATNAVIKEKGKVLTPDYVDRPTPNAPNVRAGSTILNLKQLQKFNGNSRARASEL GMYIDTSGIKFTKPIEGLEKLTNLKRINLIFGVEATKYSNNTAIQIGSNILEPYNKVIEK LSRSGSGKKWILNSSSLTWMATATQNPSNGTLGNVYMKKIPYTTFAVKGDTDTYNFLDGL EQRYGVEGLNSREKELFNKLNDIGKGEAQLFVQAVDEMKGHQYANTQQRVYATGQILDTE FNYLREEWATASKDSNKIKAFGARGEYKTNTAGVIDYKNYAYGVAYIHENESVKLGKDIG WYTGFVHNTFRFEDIGKSKEEMLLGKIGMFKSIPFDDDNSLNWTVSGNVFVGRNKMHRKF LIVDEIFNAKSKYYAYGIGVKNEIGKEFRLSEDFSIRPYGALKLEYGRISKIKEKTGEIR LEVKSNDYVSIKPEIGTELKYKYLFTNRKTLTVGLGVAYENELGKVANPKNKARVAYTAA DWYNLRGEKEDRRGNIKTDLTIGLENTRFGATANVGYDTKGHNVRTGLGLRVIF >gi|296154863|gb|ADVK01000021.1| GENE 68 61669 - 62622 854 317 aa, chain - ## HITS:1 COG:FN0386 KEGG:ns NR:ns ## COG: FN0386 COG2342 # Protein_GI_number: 19703728 # Func_class: G Carbohydrate transport and metabolism # Function: Predicted extracellular endo alpha-1,4 polygalactosaminidase or related polysaccharide hydrolase # Organism: Fusobacterium nucleatum # 66 308 1 243 254 445 99.0 1e-125 MKKYLILCCYFILSSFIFSQEIYRDRMRDFIRELRKNTSRDKIFITQNGNALYFRDGKID EEFFSVTDGTTQESLFYGDELKFNTLTSPKLKKELLDMLIPIRQAGKVVLTINYGKGEKA KKYVESESKKIDLVAELLPSFEAKEIYQPMEGFNQNNIYSLKDAKNFLCLLNPEKFKTLE QYKSSLENVDFDILLIEPSINGEFFSREQIESLKKKSSGARRLVIAYFSIGEAENYRHYW KKSWNKKHPDWIAEENSNWNGNYRVKYWSSEWKSIIKDYQKKLDEIGVDGYLLDTVDTYY YFEDKDEAKQKAKTKKK >gi|296154863|gb|ADVK01000021.1| GENE 69 62632 - 63111 493 159 aa, chain - ## HITS:1 COG:no KEGG:FN0385 NR:ns ## KEGG: FN0385 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 159 148 306 306 279 99.0 2e-74 MGKGGTYIEFETDINKVFGRKKHGYSLLASLKGTSNIGYGVQFSNVLEYEYLNYNQYKNA HKTKWETVLRWTYEINENIAFSPEVTFKVEKYNNSKENYLIESSAGPYVLFTKNINDDLR IYGKVGVPVFRKDESKAEGYRYSKSQTSAYGKIGFEYIF >gi|296154863|gb|ADVK01000021.1| GENE 70 63370 - 63582 162 70 aa, chain - ## HITS:1 COG:no KEGG:FN0385 NR:ns ## KEGG: FN0385 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 11 70 1 60 306 115 98.0 4e-25 MKKIIKILFYLFCISTLSFAEEDIENTRDRGIDKMNFYIPVSKQNKYSNFAEFDLRKDKN IYKWTMADGY >gi|296154863|gb|ADVK01000021.1| GENE 71 63583 - 65901 1920 772 aa, chain - ## HITS:1 COG:FN0384 KEGG:ns NR:ns ## COG: FN0384 COG4267 # Protein_GI_number: 19703726 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 1 772 1 772 772 1372 100.0 0 MAGIGFELKKLFSAEEELPFANLRAIIFSIIVSVGPWLITATSLNIIIWISNQIELARPK QLIFMSSIFYCFIFSQILTCIFQYIITRYVSDCVFKKKISKIRGAYFGSIKLVAILAFFI SFIFIKNGDLSIPYKASFVFLFVFMSLSWISMIFISLLKKYRFLIFSFFFGNFISMALGF YFLKYPVTFFEEEPIFWMLLSYGIGIFINFILTSSYILRAFKGKSENNFEFLTYLKGYFS LVLIGFFYSVGVWGHVFMNWIVGDSYRIAGVFQVSPLYEVAIFYCYCISIPSIVYFAIFL ETKFLPVYKEYYKKICKTGTYSEIENSLSKMKQTLYQEILYGMELQFLISLTCVLLANAV FTYFDMDIYLLDLFRVSVFSTYCATFVSILITLYLYFDLRIHGICIAFFLLFSNFFFTYI FGRLGRQYTGVGFFIASFLTFGIAIFVFPKVFRNLNYSTMFWQNFEYKVGGNFVKNITKL FNKKVYLGIILLFLLLFGGCASYYSKNGFNKNTKHNWHTMGVYGKDGLDSEGYAANGFNQ QGFNRKRMNQSTKTAYDFNGFDYKGIHKETKKAYDERGFNAKSYNVFTNSLYDKDGFNHE GIHKVTKKPYNENGWDVYGINEKTKTEYDENGWDINGINKRSFNRDGWNIETKSKYDYAG FDFEGIHKDTKKTYDERGFDVNLNNVFTNSPYDKNGFNYEGIHKVTGKEYDENGWNYYGL HEKTKTYYNPQGYNVDGLDKDGYEKGKRPPGLEDEWMDKNGFSKKGIYIKGY >gi|296154863|gb|ADVK01000021.1| GENE 72 65903 - 67324 1484 473 aa, chain - ## HITS:1 COG:FN0383 KEGG:ns NR:ns ## COG: FN0383 COG0438 # Protein_GI_number: 19703725 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Fusobacterium nucleatum # 1 473 1 473 473 883 99.0 0 MATICFVCEGAYPYVVGGVSAWVHELITSNPQHDFKILCIIPDEKFAKLKYQIPKNVIEI KNILMDSSLNFSYSSFIKAGLQENEEKKDSVKELIHFQIDGNADEKLNIIEKLFSKEMGS PLEIILSQEYWDVLLKQYKDYHEKGNFSIYYWTYRNIILNLLKIGQEDIPKADIYHCVTT GYAGFIGCLVAHRKMGKFLLTEHGIYPREREEEILGANWIDADFKNIWIDYFYYLSKLAY QYSDKIISLFEYTRSIQIHYGADEKKSKVIPNGVDEQKYGDIVRKKREGFHIGSVLRVVP IKDIKMMIKGFKIAEKNMPDATLWLIGPTDEDEEYYEECLELVKNLELEEKVIFTGRANV LEYYSFLDLLILTSISEGQPLSILEGLASGIPFIATDVGNCREILLEKKDIGEAGLIIPP TSYTDLAEAFLKLYKNTEKLNEFSENGKKIIKKYYTKESFIGQYRKLYEELGD >gi|296154863|gb|ADVK01000021.1| GENE 73 67328 - 68254 643 308 aa, chain - ## HITS:1 COG:no KEGG:FN0382 NR:ns ## KEGG: FN0382 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 14 308 1 295 295 483 99.0 1e-135 MNKNSKLDILIIILFVIEIFLVKYFYQTEELLHFRSLFWGFHGLVTLIFLGISYIFLHED IGYYDVFLMLFPLIGISLLLLERIFQKWKVSDAVIDELLSSGEQEEKKEQKFVPEEFEIM SYYDLLSSDNIDEKKTFLFSFQPKEIDLKIKILKKALLDKNIDVIHYAATELNKIETELQ NKISELEKKGDREELYKTYKTYINSGLLYDSILEFYLRKAAELLGSLDNNSINKEEELLS LCKLGNEKEEYEDILKKRVKRNEEKEDIQDYCYFLYQENRFEDLLKILKKYKSKDIEIPY CFQQYVKE >gi|296154863|gb|ADVK01000021.1| GENE 74 68247 - 70103 1678 618 aa, chain - ## HITS:1 COG:FN0380 KEGG:ns NR:ns ## COG: FN0380 COG4878 # Protein_GI_number: 19703722 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 303 1 303 303 563 100.0 1e-160 MKFYKKEGTNKFIYFFIAIFIIVLILQINRVIDKGGFFSLEQSFSFDKKSAKNTTFTFDK PQKMLVFYNKKSSQSKDILKNLEEAFIFNKVNYTLADIGDIVSITGYDTFIFATDSFIGL QKSTFEGVKQATSNGKNLIFLNTSEYNPFNSISGIQKTGKVIEKSSEIHFTHKLFPGLDQ HSPSLEMVVHPSFKMDLDADCKVLAWSKESTPLLWEKKYGKGRILYTNASFFADKITRGL MNQWVSYGNDWYITPFLNAKLMHIDDFPAPIPRTINKVIQDEYQMSTRDFYKQIWWKDML EIAKQRNLIYSGFIIIDYNDAVNKEDMKEISQITLEDLDIEGRELFLHGGEIGIHGYNHN PLVFDGDIDFPALSYHPWRSEEDMAAGMNQLLKYVKKMFGKKIKLYVYVPPSNILKEEGK AALVKNYPDLNTISSIFYGDERGSYASEIGRDKTIPKLFNFPRFSSGFYYDKDDMWSLFN AIAIYGYWAHFVHPDDVISNDRGKDKTWNELKKEFDKLIGEVEANHPYLEPIRASELTQR YINIEDLKIQSEKRDNKIYVGMENYREPFYMTIRIRNNSIKNISSGTFKEIYDTEDSKIY LLQVETPDLIITLGEENE >gi|296154863|gb|ADVK01000021.1| GENE 75 70075 - 70962 755 295 aa, chain - ## HITS:1 COG:no KEGG:FN0379 NR:ns ## KEGG: FN0379 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 295 1 295 295 480 98.0 1e-134 MKTIFRCLLESIVYTILVLLIFYFLPYTKKEFLILNLHPLEVVVAFMSLRYGTYLGILSS FIAIFGYIFAYLHSGNDMILFLLKFQYYKFFLMFLFTAMVLGKFKLNYKNREEDLKKGFE HLENNFQEEKEKNQQLLDINISLKNQIIRSGGSIVSFHNLKNGLLQLKKEELYEKVLEIF RQLLACEVCSIYTLVDNKLIRRFEMGKSKMEKEILLNSEAGKRFLEVSKKNISLNFPFDI AGKQPIFIGPLYNEKNITGFLEIESFSYTTGEKYNFELFKILMEEINEILQKRGD >gi|296154863|gb|ADVK01000021.1| GENE 76 71313 - 72287 1195 324 aa, chain + ## HITS:1 COG:FN0378 KEGG:ns NR:ns ## COG: FN0378 COG1087 # Protein_GI_number: 19703720 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-glucose 4-epimerase # Organism: Fusobacterium nucleatum # 1 324 1 324 324 652 100.0 0 MQTILVTGGAGYIGSHAVVELLDNNYNVVVIDTLENGFKEFVDKRAKFYQGNVQDYELMS RIFQENKIEAVMHFAGYIRVPESVDDPNKYYLNNTYTTMCLIQSMVKHNIKNIIFSSTAA VYGEITEDNPIDEKHSTIPINPYGASKLMSERIIRDCAKAYGLNYSIFRYFNVAGAHEKY PIGQKGAGVTSLITLTLQAAKDSNRILEVFGDDFPTKDGTGIRDYIHVVDLVKAHVLSLK LLFKNESNIFNLGNGNGFSVLETVEAARKVTNKEIICKIAARRKGDPACVIASSEKAKKI LGWKAQYTNVEKIIETGWHFVEKQ >gi|296154863|gb|ADVK01000021.1| GENE 77 72343 - 73989 1694 548 aa, chain - ## HITS:1 COG:FN0377 KEGG:ns NR:ns ## COG: FN0377 COG1178 # Protein_GI_number: 19703719 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+ transport system, permease component # Organism: Fusobacterium nucleatum # 1 548 3 550 550 872 99.0 0 MLSKKKDIWIVISLCVLAFYIIFMIYPLGILFKNAVIENNGNFTFAYFSKFLSKNYYFST IFNSFKVSLAATALTLIIGTPLAYFYNMYKIKGKTFLQITIILCSMSAPFIGAYSWILLL GRNGLITNAIKNLTGFNVPSIYGFGGILLVLCLQLYPLVFLYVSGALRNIDNSLLEASEN MGCTGTKRFFKIIIPLCIPTILAAALMVFMRAFADFGTPLFIGEGYRTFPVEIYNQFMNE TGSDKNFASAVSIIAIIITSLIFLLQRYINGKYKFTMNALHPIEAKEVKGIKSVLIHLYC YLVVFISYAPQLYVIYTSFQNTSGKLFTKGYSLKSYTEAFSKVGNAIQNTFFIGGLALIL IIVISILIAYLVVRRNNFVNRTIDTLSMVPYVIPGSVVGIALVSAFNKKPFVLVGTFLIM VISLIIRRNAYTIRSSVAILQQIPISIEEAAISLGASRMKSFFKITTPMMINGIISGALL SWITIITELSSSIILYNYKTITLTLQIYVYVSRGSYGIAAAMSTILTMMTVVSLLIFMKV SKNKNVMM >gi|296154863|gb|ADVK01000021.1| GENE 78 73979 - 75094 1500 371 aa, chain - ## HITS:1 COG:FN0376 KEGG:ns NR:ns ## COG: FN0376 COG3842 # Protein_GI_number: 19703718 # Func_class: E Amino acid transport and metabolism # Function: ABC-type spermidine/putrescine transport systems, ATPase components # Organism: Fusobacterium nucleatum # 1 371 1 371 371 695 100.0 0 MSVNIKIENAQKRYGDNIIIENLSLDIKQGEFFTLLGPSGCGKTTLLRMIAGFNSIEKGN FYFNEKRINDLDPAKRNIGMVFQNYAIFPHLTVEQNVEFGLKNRKVSKEEMKIEIDKFLK LMQIDEYKDRMPERLSGGQQQRVALARALVIKPDVLLMDEPLSNLDAKLRVEMRTAIKEI QNSIGITTVYVTHDQEEAMAVSDRIAVMKDGEIQHLGQPKDIYQRPANLFVATFIGKTNV LKGNLNGSILKVAGKYEVSLNNIKDKNIKGNVVISIRPEEFVIDENQTKDGMRAFIDSSV FLGLNTHYFAHLESGEKIEIVQESKIDSIIPKGAEVYLKVKQDKINVFTEDGSRNILEGV NNDAIGVAYVK >gi|296154863|gb|ADVK01000021.1| GENE 79 75108 - 76175 269 355 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|167854980|ref|ZP_02477755.1| 50S ribosomal protein L13 [Haemophilus parasuis 29755] # 1 302 3 287 346 108 27 2e-22 KEKMKKFIKFLLMSMGMIFMLVACGGDKEKTEATPETQGSNELVIYSPNADDEVNKIIPA FEEATGIKVILQSMGSGDVLARISAEKENPQADINWGAISMGVLATTPDLWESYTSENEK NVPDAYKNTTGFFTNYKLDGSAALLVNKDVFKKLGLDPEKFNGYKDLLWPELKGKIAMGD PTASSSAIAELTNMLLVMGEKPYDEKAWEFIEKFIGQLDGTILSSSSQIYKATADGEYAV GVTYENPAVTLLQDGATNLKLVYPEEGSVWLPGAAAIVKNAPHMENAKKFIDFLISDEGQ KVVAETSTRPVNTSIKNTSEFIRPFDEIKVAYEDIPYCAEHRKEWQERWTNILTK >gi|296154863|gb|ADVK01000021.1| GENE 80 76271 - 78805 2758 844 aa, chain - ## HITS:1 COG:FN0374 KEGG:ns NR:ns ## COG: FN0374 COG0608 # Protein_GI_number: 19703716 # Func_class: L Replication, recombination and repair # Function: Single-stranded DNA-specific exonuclease # Organism: Fusobacterium nucleatum # 1 844 1 844 844 1516 99.0 0 MKKNTKWILENKTNYEKIFEDKREKKLDFIIEDLIENRNLSLDTNFDFNPFDLKDMDIAT QRIFEAIKNNQKIYIYGDYDVDGITSVSLLYLAFSELGANVNYYIPLRDEGYGLNKEAIQ ILKNENADLVISVDCGINSIEEINFANELNLDFIITDHHEITGGIPKALAVINPKREENI YSFKYLAGVGTAFMLIYALYTQMNKLNDLEKYLDIVAIGTVADIVPLVSDNRKFVKKGLE TLKNTKWIGIKQLLRKIFPDNWDTKEYFAYDIGYIIAPIFNAAGRLEDAKQAVSLFVEED GFKCLSIIDKLLENNTERKDIQKKILEMSIAEIEKKQLYNKNLILVANKAFHHGVIGIVA SKILDKYYKPTIIMEIKESEGVATASCRSIGDLNIVECLNSVSDILVKYGGHSGAAGFTI KIENIEEFYVRIDRYVEENFDKDLFIKKLKIEKILAPYKVNYEFLKELEILEPYGAKNHT PVFAFRNCQYENLRFTKNSTEHLMLDIKKDGYYFKNCIFFGGGDYYDIISSSKEIDIAFK LKLETFKDRYMCKLQLEDVKNSIENSKFEDDYLELNGKDISFPIETVVYPKRLDIEEPLN LIFNDYGVAITKDRTIIENIDNNLAKILTILKNKFYYEFSVKIKKKYLKTENINLHLEID TLKNETLKSFPLKDALIFKEIKNLLIRDFEYNSIQKKVLASIFKDKKKTLVVMDRKRGAT TIIDTIKCYCKYRNLSFSINNEKEKADFYIFENFTEMEEINSLITNNILVISNEDMEVSG FNKIIDDYTIPNNIKCVDYHDISILKRNNSLYYPFLTDEEKNNILELIDKNKDVFSTREI IVHF >gi|296154863|gb|ADVK01000021.1| GENE 81 78929 - 79696 814 255 aa, chain - ## HITS:1 COG:no KEGG:FN0371 NR:ns ## KEGG: FN0371 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 255 1 255 255 411 100.0 1e-113 MKKRFLVLSFLLFVICSFNLLAISFSEKANKVEEFIPKGWKTLIIKKGDLNKDKIDDIVL VIEKNDPKNIKKSESTYEASVVHNFNPRIILVLFKDKDSQYNLVAKNEDGFIVSEGRSYE EGFEKLASPNNDKLSDSIAIKNNTLHIYTYFEATRSSNSTEYIFRYQNNRFELIGLEVNN NGASGGYLESSNYSFNFSTKKLKKYVSREDMVNEEKAKEEKTEKDIDVENKYILDTMTEN TLEEILTEYIYKYYN >gi|296154863|gb|ADVK01000021.1| GENE 82 79907 - 80821 1075 304 aa, chain + ## HITS:1 COG:FN0370 KEGG:ns NR:ns ## COG: FN0370 COG0681 # Protein_GI_number: 19703712 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Signal peptidase I # Organism: Fusobacterium nucleatum # 1 282 1 282 286 527 100.0 1e-149 MKTIFYGVFYFFLTVFFIYIFVKEKDLAKKFDAHRENFVNKIIEKYNIKNENNIKYFKKS LYYIETLGTALILVVIIQRFYIGNFKIPTGSMIPTIEVGDRVFADMVSYKFTTPKRNSII VFKEPIQDKVLYTKRAMGLPGERIKIEEDVLYINGEKTDFRRYSNLGIGDKEWKIPQKND KLQIIPAGNYNEAYKSVSFDIAEVQKKLKNNSSLIYELMPNLKFVVNGEETGPILDFIHD KDILDKLMRGETIEITLKDNYYLALGDNTDNSFDSRYWGFVKESRIRGRALVRFWPLNRM GLVK >gi|296154863|gb|ADVK01000021.1| GENE 83 80842 - 81273 187 143 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|228002792|ref|ZP_04049785.1| (SSU ribosomal protein S18P)-alanine acetyltransferase [Anaerococcus prevotii DSM 20548] # 1 137 1 141 146 76 34 5e-13 MIKKLEINDTDYIDQIFNLEKEIFKNSAFNRTYLDTLIKGDNSFIYVYLIDSKVCGYLIV LDSIDIYEILAIATIEECRNKDIAQELLNKIKTKDIFLEVRESNQPAINFYKKNKFKEIS IRKNYYSKPNENAIIMKLEVNNE >gi|296154863|gb|ADVK01000021.1| GENE 84 81266 - 82699 1973 477 aa, chain + ## HITS:1 COG:FN0368 KEGG:ns NR:ns ## COG: FN0368 COG0015 # Protein_GI_number: 19703710 # Func_class: F Nucleotide transport and metabolism # Function: Adenylosuccinate lyase # Organism: Fusobacterium nucleatum # 1 477 1 477 477 918 99.0 0 MSNEIYSNPLCERYSSKEMMHNFSPDKKFSTWRKLWVALAESEKELGLDISQEQIDEMKK NIYNIDYELAAKKEKEFRHDVMAHVHTFGTQAPLAMPIIHLGATSAFVGDNTDLIQIKDG LQIIKTKIINVMSNLSKFALDNKSIATLGFTHFQAAQLTTVGKRATLWLQSLMLDLEELE FREKTLRFRGVKGTTGTQASFKDLFNGYFSKVEELDILVSKKMGFDKRFAVTGQTYDRKV DSEIMNLLANIAQSAHKFTNDLRLLQHLKEVEEPFEKTQIGSSAMAYKRNPMRSERISSL AKFVIALQQSTAMVASTQWFERTLDDSANKRLSLPQAFLAVDAILIIWNNIMEGLVVYDK IIEKHIMSELPFMATEYIIMECVKTGGDRQELHERIRVHSMEAGKQVKVEGKDNDLIDRI VNDNYFKLDKAKLLSILEPKNFIGFAPEQTEKFINTEIKPILEKYKALIGMDSELKV >gi|296154863|gb|ADVK01000021.1| GENE 85 82836 - 83354 420 172 aa, chain + ## HITS:1 COG:FN0367 KEGG:ns NR:ns ## COG: FN0367 COG4769 # Protein_GI_number: 19703709 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 1 172 1 172 172 222 100.0 2e-58 MIKKEHREEIYLIALVLLGLYLSLIENIIPKPFPWMKIGLSNISVLIAFEKFNSKMALQT ILLRVFIQALMLGTLFTPNFIISFSAGLISTLFMIFLYNFRKYLSLLSISCISAFTHNLL QLIVVYFLLFRNISLNSKSIIIFIVIFLGLGVIMGLITGIITTKINLKRNKI >gi|296154863|gb|ADVK01000021.1| GENE 86 83371 - 84729 2231 452 aa, chain + ## HITS:1 COG:FN0366 KEGG:ns NR:ns ## COG: FN0366 COG1109 # Protein_GI_number: 19703708 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphomannomutase # Organism: Fusobacterium nucleatum # 1 452 1 452 452 842 100.0 0 MGRYFGTDGIRGEANRELTVDKALRLGYALGYYLKNNNPNEEKIKVIMGSDTRISGYMLR SALTAGLTSMGIYIDFVGVIPTPGVAYITKQKKAKAGIMISASHNPAKDNGIKIFNLEGY KLSDEIENQIEDYMDNLDKILANPLAGDKVGKFKYAEDEYFQYKNYLTQCVKGNFKDIKI VLDTANGAAYRAAKDVFLDLRAELVVINDAPNGRNINVKCGSTHPDILSKVVVGYEADLG LAYDGDADRLIAVDKFGNVIDGDKIIGILALGMKNKGTLKNNKVVTTVMSNIGFEKYLKE NSIELLRANVGDRYVLEKMLAEDVVIGGEQSGHIILKDYATTGDGVLSSLKLVEVIRDTG KDLHELVSSIKDAPQTLINVKVDNIKKNTWDKNEIIMSFINEANKKYKDEVRILVRKSGT EPLIRVMTEGDDKQLVHKLAEDIAHLIEKELN >gi|296154863|gb|ADVK01000021.1| GENE 87 84821 - 85195 317 124 aa, chain + ## HITS:1 COG:no KEGG:FN0365 NR:ns ## KEGG: FN0365 # Name: not_defined # Def: ATP synthase protein I, sodium ion specific # Organism: F.nucleatum # Pathway: not_defined # 20 124 1 105 105 146 96.0 3e-34 MEEIKKLFKITIIVTIVTCLLGLIFQNKYLLFGISGGCAISVIALCMLSLDSKAIAYSKD VKIAKRIAYIGYAKRYFLHLLFLGTLLYFTNDFQLFLSGFIGTLNVKLSIYFMNILRKIK SLLK >gi|296154863|gb|ADVK01000021.1| GENE 88 85231 - 85980 895 249 aa, chain + ## HITS:1 COG:FN0364 KEGG:ns NR:ns ## COG: FN0364 COG0356 # Protein_GI_number: 19703706 # Func_class: C Energy production and conversion # Function: F0F1-type ATP synthase, subunit a # Organism: Fusobacterium nucleatum # 32 249 1 218 218 345 99.0 4e-95 MILGPIEFTSGPLVSGPDIIFSIFGIPISSTVVTTWFVLLFFYLFFKLGTRNLQLIPGKF QSVLEGIYEFLDGTIGQILGVWKKKYYTFFASLFLFIFLSNIITFFPIPWFSIKNGIFTI YPAFRAPTADLNTTIGLALIVTTLFISINIKNNGVSGYLKGFADPTPVMLPLNVVGEFAK PLNISMRLFGNMFAGMVIMGLIYMAVPYFVPAALHLYFDLFAGLVQSFVFVTLSMVYVQG SIGDTEYTN >gi|296154863|gb|ADVK01000021.1| GENE 89 86034 - 86303 561 89 aa, chain + ## HITS:1 COG:FN0363 KEGG:ns NR:ns ## COG: FN0363 COG0636 # Protein_GI_number: 19703705 # Func_class: C Energy production and conversion # Function: F0F1-type ATP synthase, subunit c/Archaeal/vacuolar-type H+-ATPase, subunit K # Organism: Fusobacterium nucleatum # 1 89 1 89 89 126 100.0 9e-30 MDLLTAKTIVLGCSAVGAGLAMIAGLGPGIGEGYAAGKAVESVARQPEARGSIISTMILG QAVAESTGIYSLVIALILLYANPFLSKLG >gi|296154863|gb|ADVK01000021.1| GENE 90 86344 - 86835 736 163 aa, chain + ## HITS:1 COG:FN0362 KEGG:ns NR:ns ## COG: FN0362 COG0711 # Protein_GI_number: 19703704 # Func_class: C Energy production and conversion # Function: F0F1-type ATP synthase, subunit b # Organism: Fusobacterium nucleatum # 1 163 1 163 163 210 100.0 7e-55 MPIISIDYTFFWQIINFFLLLFIVKKYFKEPISKIINERKQKIEAELVEATKNKKEAEQL LKDAEAQINASRKEATEIVKAAQRKAEEEAHNLIREARENRENILKTTELEITKIKNDAK EELGREVKNLAAELAEKIIKEKVDDAQEISLIDKFIAEVGEDK >gi|296154863|gb|ADVK01000021.1| GENE 91 86832 - 87365 723 177 aa, chain + ## HITS:1 COG:FN0361 KEGG:ns NR:ns ## COG: FN0361 COG0712 # Protein_GI_number: 19703703 # Func_class: C Energy production and conversion # Function: F0F1-type ATP synthase, delta subunit (mitochondrial oligomycin sensitivity protein) # Organism: Fusobacterium nucleatum # 1 173 1 173 174 244 98.0 8e-65 MIKSQVGRRYSKAIFEIAEEKKQVKEIYEMLNSAMVLYRTNKEFKHFILNPLIDNEQKKS VLNEIFGKDNSENLNILLYILDKGRMSCIKYIVAEYLKIYYRKNKILDVKATFTKELTDE QKKKLIDKLSQKTGKEINLEIKIDKDILGGGIIKIGDKIIDGSIRRELDNWRKKLRS >gi|296154863|gb|ADVK01000021.1| GENE 92 87382 - 88884 2519 500 aa, chain + ## HITS:1 COG:FN0360 KEGG:ns NR:ns ## COG: FN0360 COG0056 # Protein_GI_number: 19703702 # Func_class: C Energy production and conversion # Function: F0F1-type ATP synthase, alpha subunit # Organism: Fusobacterium nucleatum # 1 500 1 500 500 956 99.0 0 MNIRPEEVSSIIKKEIDNYKKSLEIKTSGTVLEVGDGIARIFGLSNVMSGELLEFPHGVM GMALNLEEDNVGAVILGNASLIKEGDEVRATGKVVSVPAGEDLLGRVINALGDPIDGKGE IHVDKYMPIERKASGIIARQPVSEPLQTGIKSIDGMVPIGRGQRELIIGDRQTGKTAIAI DTIINQKGQDVKCIYVAIGQKRSTVAQIYKKLSDLGCMDYTIIVAATASEAAPLQYMAPY SGVAIGEYFMEKGEHVLIIYDDLSKHAVAYREMSLLLRRPPGREAYPGDVFYLHSRLLER AAKLSDELGGGSITALPIIETQAGDVSAYIPTNVISITDGQIFLESQLFNSGFRPAINAG ISVSRVGGAAQIKAMKQVASKVKLELAQYTELLTFAQFGSDLDKATKAQLERGHRIMEIL KQPQYHPFAVERQVVSFYIVINGHLDDIEVSKVRRLEKELLDYLKANTNILTEIADKKAL DKDLEEKLKESIANFKKSFN >gi|296154863|gb|ADVK01000021.1| GENE 93 88895 - 89743 1036 282 aa, chain + ## HITS:1 COG:FN0359 KEGG:ns NR:ns ## COG: FN0359 COG0224 # Protein_GI_number: 19703701 # Func_class: C Energy production and conversion # Function: F0F1-type ATP synthase, gamma subunit # Organism: Fusobacterium nucleatum # 1 282 1 282 282 488 98.0 1e-138 MPGMKEIKSRIKSVQSTRQITNAMEIVSTTKFKRYSKLVTESRPYEESMRKILGNIASRV KNEGHPLFDGRKEVKSIAIIITTSDRGLCGSFNSSTLKELEKLVEKNKNKNITIIPFGRK AIDFITKRNYEFSESFSKISPDEMNKIAGEISEEVVEKYNNHIYDEVYVIYNKFISALRY DLTCERIIPIARPEVELNSEYIFEPSTEYILSALLPRFINLQIYQAILNNTASEHSARKN SMSSATDNADEMIKTLNIKYNRNRQSAITQEITEIVGGASAL >gi|296154863|gb|ADVK01000021.1| GENE 94 89821 - 89895 97 24 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MILEALNELVHLELLTDSEFAANS >gi|296154863|gb|ADVK01000021.1| GENE 95 90064 - 91452 2068 462 aa, chain + ## HITS:1 COG:FN0358 KEGG:ns NR:ns ## COG: FN0358 COG0055 # Protein_GI_number: 19703700 # Func_class: C Energy production and conversion # Function: F0F1-type ATP synthase, beta subunit # Organism: Fusobacterium nucleatum # 1 462 1 462 462 875 99.0 0 MNKGTITQIISAVVDVAFKDELPAIYNALKVKLEDKELVLEVEQHLGNNVVRTVAMDSTD GLKRGMEVIDTGKPITIPVGKAVLGRILNVLGEPVDNQGPLNAETFLPIHREAPEFDDLE TETEIFETGIKVIDLLAPYIKGGKIGLFGGAGVGKTVLIMELINNIAKGHGGISVFAGVG ERTREGRDLYGEMTESGVITKTALVYGQMNEPPGARLRVALTGLTVAENFRDKDGQDVLL FIDNIFRFTQAGSEVSALLGRIPSAVGYQPNLATEMGALQERITSTKSGSITSVQAVYVP ADDLTDPAPATTFSHLDATTVLSRNIASLGIYPAVDPLDSTSKALSEDVVGKEHYEVARK VQEVLQRYKELQDIIAILGMDELSDEDKLTVSRARKIERFFSQPFSVAEQFTGMEGKYVP VKETIRGFREILEGKHDDIPEQAFLYVGTIEEAVAKSKDLAK >gi|296154863|gb|ADVK01000021.1| GENE 96 91462 - 91866 598 134 aa, chain + ## HITS:1 COG:FN0357 KEGG:ns NR:ns ## COG: FN0357 COG0355 # Protein_GI_number: 19703699 # Func_class: C Energy production and conversion # Function: F0F1-type ATP synthase, epsilon subunit (mitochondrial delta subunit) # Organism: Fusobacterium nucleatum # 1 134 1 134 134 223 99.0 9e-59 MPSFDVSVVTQVKKILEQEAGYLRLRTSEGDIGILPNHAPFVAELSMGKMEIESPNKDRR DIYFLSGGFLEISDNQATVIADEVFPIEKIDVESEQALVENFKKELEKVSTEEEKRKLQK KIKISLAKIDAKNN >gi|296154863|gb|ADVK01000021.1| GENE 97 91886 - 92257 479 123 aa, chain + ## HITS:1 COG:FN0356 KEGG:ns NR:ns ## COG: FN0356 COG0346 # Protein_GI_number: 19703698 # Func_class: E Amino acid transport and metabolism # Function: Lactoylglutathione lyase and related lyases # Organism: Fusobacterium nucleatum # 1 123 1 123 123 231 99.0 3e-61 MKFHFLHENFNVLDLEKSIKFYDEALGLKVVREKFAEDGSYKIVYLGDGITHFQLELTWL ADRTEKYDLGDEEFHLAFEVDNYDEAFKKHTEMGCVVFINEKMGIYFITDPDGYWLEILP PKK >gi|296154863|gb|ADVK01000021.1| GENE 98 92301 - 92438 102 45 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MTKFIKNVKIILEINHQERLRDRPVEISATYVKCVVLIPDRWKRL >gi|296154863|gb|ADVK01000021.1| GENE 99 92511 - 93662 2052 383 aa, chain + ## HITS:1 COG:FN0355 KEGG:ns NR:ns ## COG: FN0355 COG0192 # Protein_GI_number: 19703697 # Func_class: H Coenzyme transport and metabolism # Function: S-adenosylmethionine synthetase # Organism: Fusobacterium nucleatum # 1 383 1 383 383 764 99.0 0 MKKFTYFTSEFVSPGHPDKISDQISDAILDACLKDDPNSRVACEVFCTTGLVVVGGEITT STYIDVQDIVRKKIDEIGYRPGMGFDSNCGTLSCIHAQSPDIAMGVDIGGAGDQGIMFGG AVRETEELMPLALVLSREILVKLTNMMKSNEIEWARPDQKSQVTLAYDENGKVDHVDSIV VSVQHDEDTTHDEIEKIVIEKVVKPVLEKYNLSSDNIKYYINPTGRFVIGGPHGDTGVTG RKIIVDTYGGYFRHGGGAFSGKDPSKVDRSAAYAARWVAKNIVAAELADKCEIQLSYAIG VPKPVSVKVDTFGTSKVDEDKISEAVSKVFDLSPRGIEKALELREGNFKYQDLAAFGHIG RTDIDTPWERLNKVDELKKAIEL >gi|296154863|gb|ADVK01000021.1| GENE 100 93722 - 94597 949 291 aa, chain - ## HITS:1 COG:FN0354 KEGG:ns NR:ns ## COG: FN0354 COG0697 # Protein_GI_number: 19703696 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; R General function prediction only # Function: Permeases of the drug/metabolite transporter (DMT) superfamily # Organism: Fusobacterium nucleatum # 68 291 1 224 224 331 97.0 1e-90 MDLKEIYTKLTAKKCAFIAIFFWATAFVLTKVVLKEVDVTTLGVLRYFFASIIVIFILIK KKIPIPELKDIPAFIFAGFSGYAGYIALFNIATLLSSPSTLSVINALAPAITAIVAYFIF NERIKLIGWLSMGISFCGILILTLWDGVLTVNKGILYMLAGCLLLSLYNISQRYLTKKYS SFDVSMYSMLIGGILLVIYSPSSITNMFSISFTSLILIVYMSIFPSIISYFFWTKAFELA KHTTEVTSFMFVTPVLATLMGIIILGDIPKLSTLIGGVVIILGMILFNKTK >gi|296154863|gb|ADVK01000021.1| GENE 101 94768 - 96147 1657 459 aa, chain - ## HITS:1 COG:FN0352 KEGG:ns NR:ns ## COG: FN0352 COG1757 # Protein_GI_number: 19703695 # Func_class: C Energy production and conversion # Function: Na+/H+ antiporter # Organism: Fusobacterium nucleatum # 1 459 1 459 459 684 96.0 0 MFDLVQHRKPEKLEAFIMIVIVFLMLGYPMLGIPNMVPHIPVLITIIFLLLYGVINKVKF SLLQESMIQSVSTSMGAVFLFFFIGILVSILMMSGAIPTLMFLGLNVISTKVFYLSSFLI TAIIGMAIGSSLTTVATLGVALMGMSNAFDLSPAITAGAIVSGSFFGDKMSPLSDTTGIA ASIVGVDLFDHIKNMMYTTVPAFIISSIAFGLLSPWNKVGNISTVEQFKIDILSTGLVNN LSLICFLILIIFSLFKVPAIMTIIYTSIVGLIISIVNNNYSLQEISTFLFGGFSKADLPQ NIAPLLNRGGINSMFFTLTIVILALSLGGLLFGLGIIPTLLDSMAHFLVSPSRATICVVL TALGVNYIVGEQYLSILLAGKTFKPIYDKLGLHSKNLSRTLEDAGTVVNPLVPWGVCGVF ITSVLGVSTLIYLPFAFFCYLCVILTIISGFTGISISKK >gi|296154863|gb|ADVK01000021.1| GENE 102 96431 - 96874 613 147 aa, chain - ## HITS:1 COG:no KEGG:FN0351 NR:ns ## KEGG: FN0351 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 4 147 1 144 144 245 99.0 4e-64 MKFMYLYFGVVIIFVIGYQIFMFTRANKRKKEMLEWLEKNPKAAKVYIKTNSSLLASMFT PSSIRLIAIDDDYPMTSFTEGFKQGFYLAPGKHKITSSFEKTRPGFFYKTVTTKYDSTTQ EVEAEAEKTYIYSFDKKNEQYTFTEMN >gi|296154863|gb|ADVK01000021.1| GENE 103 96898 - 97341 547 147 aa, chain - ## HITS:1 COG:no KEGG:FN0350 NR:ns ## KEGG: FN0350 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 9 147 1 139 139 252 98.0 3e-66 MEKVYKSRVYRVIFNFLCGTVLSIFVFFIAHIWLSSLTSIIIATVIFLFYIWLVIWGNFI TIIVNDKELIVKNGKKEDKYEFSKYYFRARTVSSSGDTECSLYAIDENGNETHIDCELIG IGQFRQLINDLKLDVGVNKINTIKKDK >gi|296154863|gb|ADVK01000021.1| GENE 104 97433 - 97888 798 151 aa, chain - ## HITS:1 COG:FN0349 KEGG:ns NR:ns ## COG: FN0349 COG1490 # Protein_GI_number: 19703692 # Func_class: J Translation, ribosomal structure and biogenesis # Function: D-Tyr-tRNAtyr deacylase # Organism: Fusobacterium nucleatum # 1 151 4 154 154 263 98.0 8e-71 MRTVIQRVKYAKVSVDGKVLGEIDKGLLVLLGITHEDSIKEVKWLVNKTKNLRIFEDEEE KMNLSLEDVKGKALIISQFTLYGNSIKGNRPSFIDAAKPDLAKDLYLKFIEELKSFDIET QEGEFGADMKVELLNDGPVTIIIDTKDANIK >gi|296154863|gb|ADVK01000021.1| GENE 105 98016 - 99521 1898 501 aa, chain + ## HITS:1 COG:FN0348 KEGG:ns NR:ns ## COG: FN0348 COG1488 # Protein_GI_number: 19703691 # Func_class: H Coenzyme transport and metabolism # Function: Nicotinic acid phosphoribosyltransferase # Organism: Fusobacterium nucleatum # 1 501 1 501 501 957 98.0 0 MNNDFILTEFARVINSDRYQYTESDIFLMENMQNKVAVFDMFFRKTEDGGFAVVSGVQEV IHLIEVLNTTSEEEKKKYFSKILEEEHLINFLSKMKFTGNLYAIQDGEIVYPNEPVITIK APLIQAKILETPILNIMNMNMAISTKASMVTRAAEPIKVLAFGSRRAHGFDSAVEGNKAA IIGGCYGHSNLVTEYKYGIPSNGTMSHSYIQAFGVGAEAEKEAFVTFIKHRRQRKNNSLI LLVDTYDTIHIGIENAIKAFKECGVNDSYEGIYGVRLDSGDLAYQSKKCRKRFDEEGFTK AKITLTNSLDEQLIRSLREQGACVDMYGVGDAIAVSKSYPCFGGVYKIVELDEEPLIKIS GDVIKISNPGFKEVYRIFDKEGFAYADLISLVKNDSDKEKLLNNDELMIRDEKYEFKNSI LKKDEYTYKKLTKIYIKDGIIDRTSYEDLFDIMKSQKHYFDSLAKVSPERKRLENPHSYK VDLSSDLINLKYGLINKIKNV >gi|296154863|gb|ADVK01000021.1| GENE 106 99514 - 100485 1000 323 aa, chain + ## HITS:1 COG:FN0347 KEGG:ns NR:ns ## COG: FN0347 COG0688 # Protein_GI_number: 19703690 # Func_class: I Lipid transport and metabolism # Function: Phosphatidylserine decarboxylase # Organism: Fusobacterium nucleatum # 24 323 1 300 300 537 99.0 1e-152 MSKKKILIFLVIVCFIIYNKESEMDFEKIQYIERKTGEIKTEKVMGEGALKFLYYNPFGK LALHTVVKRKFLSDWYGRKMSKPESKEKIKSFVEEMEIDMSEYKRPIEDYTSFNDFFYRE LKDGARKIDYNENVIVSPADGKILAYQNIKEVDKFFVKGSKFTLEEFFNDKELAKKYEDG TFVIVRLAPADYHRFHFPVDGEISEIKKILGYYYSVSTHAIKTNFRIFCENKREYAILKT EKFGDIAMFDIGATMVGGIVQTYKTNSSVKKGEEKGYFLFGGSTCILVFEKNKVVIDKDI IENTQNKIETRIYMGEKFGNEKN >gi|296154863|gb|ADVK01000021.1| GENE 107 100466 - 100864 389 132 aa, chain + ## HITS:1 COG:FN0346 KEGG:ns NR:ns ## COG: FN0346 COG5341 # Protein_GI_number: 19703689 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 3 132 1 130 130 236 99.0 8e-63 MEMKKTKYFKIGDLVIYVFLIIFFSILILKIGSFKDVKGAKAEIWVDGELKYVYPLQKEE KNIFVETNLGGCNVQFKDNMVRVTTSNSPLKIAVKQGFIKSPGEVIIGIPDRLVVKVIGD SEDDSGIDFVAR >gi|296154863|gb|ADVK01000021.1| GENE 108 100887 - 101978 1260 363 aa, chain + ## HITS:1 COG:FN0345 KEGG:ns NR:ns ## COG: FN0345 COG0628 # Protein_GI_number: 19703688 # Func_class: R General function prediction only # Function: Predicted permease # Organism: Fusobacterium nucleatum # 33 363 1 331 331 454 100.0 1e-127 MNLKSTLKIIGIILIFVILQSYFINPDNLVNIMDKWKDYFMTIIMSIFIAILLEPIVKYL KKKSKINDILAISLSIVFIIVIFIILSLIVIPEIISSIKTLNDIYPTISEKVIITGKNIA NYLAKKNIYTINMDEIDNYFTNFVSNNTTNITKLISSILGSLIDWTIGFTNLFLAFVLAF LILLDKKHLIKTLENMIKIIFGVKNTPYIMKKLSLSKDIFLGYVSGKIIVSTIVGLCVYI VLLITGTPYAALSAILLGIGNMIPYVGSIIGGIIAFFLILLVAPIKTIILLVAITISQLV DGFIVGPKIIGNKVGLNTFWIIVSMIIFGNLFGIVGMFLGAPIMSILKLFYVDLLKKAEQ GGE >gi|296154863|gb|ADVK01000021.1| GENE 109 101978 - 103759 2168 593 aa, chain + ## HITS:1 COG:FN0344 KEGG:ns NR:ns ## COG: FN0344 COG0116 # Protein_GI_number: 19703687 # Func_class: L Replication, recombination and repair # Function: Predicted N6-adenine-specific DNA methylase # Organism: Fusobacterium nucleatum # 1 378 1 378 379 681 98.0 0 MIFIASTTMGLESIVKDECLALGFKNIKVFDGRVEFEGDFKDLIKANIYLRCSDRVFIKM AEFKALTYEELFQNIKSINWQDFIDEDGEFPISWVSSVKSKLYSKSDIQRISKKAIVEKL KEKYKREIFLENGALYSIKIQCHKDIFIVMLDSSGESLTKRGYRAQKRVAPIKETLAAAL VYLSKWKADEVLLDPMCGTGTIAIEAAMIARNIASGANRNFASEKWSIIEKNLWTDIRDE AFSNEDLSKELKIYASDIDERSIEIAKENSEKAGVEDDIIFEVKDFKNIESPAKYGAMIV NPPYGERLMGDEDIEELYRDFGNFCKKKLAKWSYYIITSYEDFEKAFDKKATKNRKLYNG GIKCYYYQYFGDRKNGYKIKIEDFIKYAKEVCLQNLFLANNIKVDLKNQDNLYEVERIEK EVISAYENIYLSLDEEFLLNLYKENKKAFKQLEDTIEKMKKDTNLKDEYIKTKIKKREKL KGNSGAEVVEKFFKYKIKELKKIKGDLLQKLKKLLDKEEKLNLDLSNAIQEVEQLEIIDK LQPIRAEFRNLSIQLDRYQKELEETENKLLKKWYYEIYGTTNKEILLKAYNSQ >gi|296154863|gb|ADVK01000021.1| GENE 110 103813 - 104316 640 167 aa, chain + ## HITS:1 COG:FN0342 KEGG:ns NR:ns ## COG: FN0342 COG0652 # Protein_GI_number: 19703685 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family # Organism: Fusobacterium nucleatum # 1 167 1 167 167 296 100.0 1e-80 MSLQAIIKTNKGEINLNLFSDVAPVTVLNFITLAKTGYYNGLKFHRVIEDFMVQGGDPTG TGAGGPGYQFGDEFKEGVVFNKKGLLAMANAGPNTNGSQFFITHVPTEWLNYKHTIFGEV VSQKDQDVVDSIKQGDTMNEVTIIGDTDRLIEDNKEFYTQLKNFLKI >gi|296154863|gb|ADVK01000021.1| GENE 111 104357 - 105685 1696 442 aa, chain + ## HITS:1 COG:FN0341 KEGG:ns NR:ns ## COG: FN0341 COG2056 # Protein_GI_number: 19703684 # Func_class: R General function prediction only # Function: Predicted permease # Organism: Fusobacterium nucleatum # 1 442 1 442 442 672 99.0 0 MILLNPVVISVIVMTVLCLLKLNVLLALFISALVAGVTAGMPINDIMVGLITGMGGNGET ALSYILLGTLAVAINSTGVTSIVSKKIASVVNGKKKVLLLVIAFFACFSQNLIPVHIAFI PILIPPLLKLMNDLKLDRRAMACSLTFGLKTPYIALPVGFGAIFHSIIAGEMTNNGMTVA QGDVWKSTWILGLFMIIGLLLAIFVSYNKDREYKDLPLIGIEEVQADKMEAKHWLALLAA SAAFIIQALAATEVIKIDKGSLPLGAVVALLVMLIFGVIKWKNLDEFINGGVGLMGLIAF IMLVAAGYGNIIRQTGAINELVDSIHGLIGGSKAIGVSIMLLIGLLITMGIGTSFGTIPV VAAIYIPLCIKLGVSVSGSIVVLAAAAALGDAGSPASDSTLGPTSGLNADGQHDHIMDTC VPTFTHYNILLLIGGFIGGMFL >gi|296154863|gb|ADVK01000021.1| GENE 112 105790 - 107100 1695 436 aa, chain + ## HITS:1 COG:FN0340 KEGG:ns NR:ns ## COG: FN0340 COG3314 # Protein_GI_number: 19703683 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 436 1 436 436 746 98.0 0 MENKKYPTSVIIKFLACSLIGIFLFFVPITLNGKSTIPLDHIVNFVLKIPYFKEMYGTLV IIIGVFLPFYKKTWNKNTTSIVFSLLKILALPFLFMILFNNGPEFLMNKDVIPFIWNKIV IPVTTIVPVGSIFLSLIISYGLMEFVGVFMRPIMKPIWKTPGRSAIDAVASFVGSYSLAL LITNRVYKEGKYTNKEAVIIATGFSTVSATFMVIVAKTLDLMDNWNLYFWLTIIVTFLVT AITARIYPIRNKSDAYFENQKGDIEKDIPKDKFKVAFNEGMEVCANSGSILENVIINLKD GIILAFNIGPSLMAIGTLGIVLANHTPIFDWIGYLVYPFTLISGFEEPLLTAKALALGIT EMFLPAVLVTKLSFEVKMLVAITCVSEVLFFSASIPCMMATDIPISFKDYLIIWFERVVL SILVSIPLIYLVKVLM >gi|296154863|gb|ADVK01000021.1| GENE 113 107180 - 107464 286 94 aa, chain - ## HITS:1 COG:no KEGG:FN0339 NR:ns ## KEGG: FN0339 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 94 64 157 157 147 98.0 1e-34 MLVENAIDCDSEFGVFYITTEQIKEAYDVLFPLTKEDLLEKYNFSEMLEDEVYPIVEDDD EKEFFDYIYSYLLEIKEFYKKNIEKELAILFYIS >gi|296154863|gb|ADVK01000021.1| GENE 114 107534 - 107653 231 39 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MGMYGSYISLEKKELEKLLNGNKDFDDIESVETLDIDKS >gi|296154863|gb|ADVK01000021.1| GENE 115 107671 - 108123 593 150 aa, chain - ## HITS:1 COG:FN0338 KEGG:ns NR:ns ## COG: FN0338 COG3086 # Protein_GI_number: 19703681 # Func_class: T Signal transduction mechanisms # Function: Positive regulator of sigma E activity # Organism: Fusobacterium nucleatum # 37 150 1 114 114 217 100.0 6e-57 MVNKGIVTKINGDTVAVKLYKSSSCSHCSCCSETNKMGSNFEFKINQKVELGDLVTLEIS EKDVVKAAFIAYIFPPIFMILGYIVADQLGFSEMQSIFGSFLGLGVGFIFLALYDRFFAK KTIDEEIKVVAVEKYDPNACTNLAESCEDF >gi|296154863|gb|ADVK01000021.1| GENE 116 108282 - 108602 303 106 aa, chain + ## HITS:1 COG:no KEGG:FN0337 NR:ns ## KEGG: FN0337 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 106 7 112 112 151 100.0 9e-36 MKRFQNATMEYNASKNDLVIRDVNTNGIFFAIEFFENTKQIKRVFTLYPVSIEIQKYNIL ELKFSVQNENNNQSLLTLILELNQLVADKRSVINISNEDLNNITLN >gi|296154863|gb|ADVK01000021.1| GENE 117 108815 - 109990 1828 391 aa, chain + ## HITS:1 COG:no KEGG:FN0336 NR:ns ## KEGG: FN0336 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 152 391 1 240 240 477 100.0 1e-133 MLMIAMSFNINAAQTASFAETINNENIEVIATYDNELPQEIKKIYKPKHTGEGVSYFDYI FIKARVSNLREKPDPDSQIVGKYTYDSKLKLLEKIKYQGNLWYLAQDQNGVKGYIAASQT EKRNFRFQMALDKIHDLEDFINKSLNEGATLMSVNTYAPNPSNINPKRQKDKYGTSLDQN LLGISQKGEQIIIPDRSVVKIIENRGDKALVKALSIPETLEVSKAKLSTFPSIKKGFRKV IAIDIENQNFMVFEKSKQTNEWELISYVYTKTGIDSELGYETPKGFFTVPVVKYVMPYTD ETGQKAGTAKFAIRFCGGGYLHGTPINVQEEVNKEFFLRQKEFTLGTYTGTRKCVRTSEG HAKFLFDWLVGSANKDSNDQRLSEDAYFIVF >gi|296154863|gb|ADVK01000021.1| GENE 118 110033 - 110581 170 182 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163756109|ref|ZP_02163225.1| 30S ribosomal protein S1 [Kordia algicida OT-1] # 77 176 244 343 347 70 36 5e-11 MKKKLTAVLLLALLVTACSSTKTTKKTNVGVDSTNKYAVEDTEANKKPLEDIIVFNEEGV TIRREGNNLILSMPELILFDFDKYAVKDGIKPSLSTLAKALGENKDIHIKIDGYTDFIGT EAYNLDLSVKRARAIKDFLISKGAIGSNISIEGYGEQNPADTNQTAAGRSRNRRVEFIIS RG >gi|296154863|gb|ADVK01000021.1| GENE 119 110672 - 111919 1722 415 aa, chain + ## HITS:1 COG:FN0334 KEGG:ns NR:ns ## COG: FN0334 COG1448 # Protein_GI_number: 19703677 # Func_class: E Amino acid transport and metabolism # Function: Aspartate/tyrosine/aromatic aminotransferase # Organism: Fusobacterium nucleatum # 1 415 1 415 415 785 99.0 0 MLAKRYTGKKLVDNIFATSKKAKQAIVKFGKENVINATIGSLYNEDEKLAVYDVVESVYR NLPPEDLYAYATNVIGEDDYLEEVIKAVFFDDYKEALKELHIASIATTGGTGAISNTVKN YMDTGNKVLLPNWMWGTYKNIVIENGGKIETYQLFNENGDFNFEDFKNKVLELAKIQKNV VLILNEPSHNPTGFRMTYEEWVNLLDFFKSIKDTNIIVIRDVAYFEYDDRGEEETKKLRK LLIGMPKNILFMYAFSLSKSLSIYGMRIGAQIAVSSDEEVIQEFKDALPFSCRTTWSNIP KGGMKLFATIMKNPELKANFLKEKQGYMNLLNERANIFLKEAKEENLKVLPYKSGFFVTV PVGETVDKIIEDLESKNIFVIKFDTGIRIGLCSVPKRKIKGLAKKIKDSINKFKN Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:28:21 2011 Seq name: gi|296154796|gb|ADVK01000022.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00026, whole genome shotgun sequence Length of sequence - 62812 bp Number of predicted genes - 68, with homology - 63 Number of transcription units - 18, operones - 15 average op.length - 4.3 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . - CDS 48 - 314 368 ## COG2026 Cytotoxic translational repressor of toxin-antitoxin stability system 2 1 Op 2 . - CDS 322 - 540 455 ## FN0210 CopG family transcriptional regulator 3 1 Op 3 . - CDS 604 - 771 158 ## gi|296328023|ref|ZP_06870557.1| conserved hypothetical protein - Prom 948 - 1007 9.7 4 2 Op 1 . - CDS 1031 - 1660 717 ## gi|296328024|ref|ZP_06870558.1| hypothetical protein HMPREF0397_0751 5 2 Op 2 . - CDS 1672 - 1818 93 ## 6 2 Op 3 . - CDS 1850 - 2443 788 ## COG3291 FOG: PKD repeat 7 2 Op 4 2/0.000 - CDS 2454 - 3602 1740 ## COG1775 Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB 8 2 Op 5 4/0.000 - CDS 3624 - 4952 2011 ## COG1775 Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB 9 2 Op 6 1/1.000 - CDS 5015 - 5812 1440 ## COG1924 Activator of 2-hydroxyglutaryl-CoA dehydratase (HSP70-class ATPase domain) 10 2 Op 7 1/1.000 - CDS 5842 - 7101 1436 ## COG0786 Na+/glutamate symporter - Prom 7182 - 7241 9.8 - Term 7160 - 7231 8.5 11 3 Op 1 3/0.000 - CDS 7244 - 8998 2728 ## COG4799 Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) 12 3 Op 2 21/0.000 - CDS 9019 - 9822 1327 ## COG2057 Acyl CoA:acetate/3-ketoacid CoA transferase, beta subunit 13 3 Op 3 1/1.000 - CDS 9825 - 10790 1434 ## COG1788 Acyl CoA:acetate/3-ketoacid CoA transferase, alpha subunit - Prom 10813 - 10872 9.2 - Term 10808 - 10851 6.2 14 4 Op 1 9/0.000 - CDS 10901 - 12028 1578 ## COG1883 Na+-transporting methylmalonyl-CoA/oxaloacetate decarboxylase, beta subunit 15 4 Op 2 . - CDS 12043 - 12447 710 ## COG0511 Biotin carboxyl carrier protein 16 4 Op 3 . - CDS 12485 - 12805 406 ## FN0199 hypothetical protein 17 4 Op 4 1/1.000 - CDS 12827 - 14833 1742 ## COG3711 Transcriptional antiterminator - Prom 14941 - 15000 11.1 - Term 15038 - 15083 5.1 18 5 Op 1 2/0.000 - CDS 15106 - 15906 992 ## COG0500 SAM-dependent methyltransferases 19 5 Op 2 1/1.000 - CDS 15903 - 16253 482 ## COG1695 Predicted transcriptional regulators 20 5 Op 3 13/0.000 - CDS 16243 - 17910 225 ## PROTEIN SUPPORTED gi|229849245|ref|ZP_04469311.1| LSU ribosomal protein L17P 21 5 Op 4 49/0.000 - CDS 17907 - 18731 717 ## COG1173 ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 22 5 Op 5 38/0.000 - CDS 18728 - 19684 685 ## COG0601 ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 23 5 Op 6 1/1.000 - CDS 19681 - 21228 2315 ## COG0747 ABC-type dipeptide transport system, periplasmic component - Prom 21250 - 21309 9.4 - Term 21321 - 21363 8.2 24 6 Tu 1 . - CDS 21385 - 22818 1549 ## COG2865 Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen - Prom 22840 - 22899 7.8 - Term 22884 - 22926 8.2 25 7 Op 1 7/0.000 - CDS 22937 - 24595 1580 ## COG2972 Predicted signal transduction protein with a C-terminal ATPase domain 26 7 Op 2 3/0.000 - CDS 24582 - 25367 824 ## COG4753 Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain 27 7 Op 3 3/0.000 - CDS 25398 - 26282 1096 ## COG0229 Conserved domain frequently associated with peptide methionine sulfoxide reductase 28 7 Op 4 13/0.000 - CDS 26351 - 26980 878 ## COG0526 Thiol-disulfide isomerase and thioredoxins - Prom 27001 - 27060 6.7 29 7 Op 5 . - CDS 27064 - 27717 854 ## COG0785 Cytochrome c biogenesis protein - Prom 27941 - 28000 15.7 + Prom 27934 - 27993 13.6 30 8 Op 1 . + CDS 28038 - 29354 1279 ## COG3593 Predicted ATP-dependent endonuclease of the OLD family 31 8 Op 2 . + CDS 29375 - 29965 404 ## FN0184 hypothetical protein + Term 30044 - 30096 4.2 + Prom 29971 - 30030 7.7 32 9 Op 1 6/0.000 + CDS 30113 - 31543 2350 ## COG0579 Predicted dehydrogenase 33 9 Op 2 4/0.000 + CDS 31554 - 32819 2186 ## COG0446 Uncharacterized NAD(FAD)-dependent dehydrogenases 34 9 Op 3 . + CDS 32819 - 33163 487 ## COG3862 Uncharacterized protein with conserved CXXC pairs + Term 33167 - 33227 15.6 + Prom 33210 - 33269 10.4 35 10 Op 1 . + CDS 33295 - 33771 543 ## FN0180 TPR repeat-containing protein 36 10 Op 2 1/1.000 + CDS 33776 - 35230 1933 ## COG0666 FOG: Ankyrin repeat 37 10 Op 3 . + CDS 35247 - 36227 1504 ## COG0666 FOG: Ankyrin repeat + Term 36351 - 36387 -1.0 38 11 Op 1 22/0.000 - CDS 36311 - 36610 393 ## COG0851 Septum formation topological specificity factor 39 11 Op 2 22/0.000 - CDS 36616 - 37410 1046 ## COG2894 Septum formation inhibitor-activating ATPase 40 11 Op 3 1/1.000 - CDS 37412 - 38062 815 ## COG0850 Septum formation inhibitor - Prom 38123 - 38182 6.6 41 12 Tu 1 . - CDS 38184 - 39140 1644 ## COG2070 Dioxygenases related to 2-nitropropane dioxygenase - Prom 39168 - 39227 8.6 - Term 39273 - 39307 -0.9 42 13 Op 1 . - CDS 39343 - 40830 1834 ## FN0173 hypothetical protein 43 13 Op 2 . - CDS 40842 - 41567 290 ## PROTEIN SUPPORTED gi|227512216|ref|ZP_03942265.1| ribosomal protein S4e - Prom 41801 - 41860 11.6 + Prom 41599 - 41658 3.0 44 14 Tu 1 . + CDS 41690 - 41803 89 ## 45 15 Op 1 . - CDS 42043 - 43365 1870 ## COG1160 Predicted GTPases 46 15 Op 2 . - CDS 43428 - 43835 524 ## FN0169 coproporphyrinogen III oxidase 47 15 Op 3 . - CDS 43851 - 45473 1319 ## FN0289 hypothetical protein 48 15 Op 4 . - CDS 45490 - 45603 181 ## 49 15 Op 5 . - CDS 45623 - 45982 379 ## gi|296328067|ref|ZP_06870601.1| hypothetical protein HMPREF0397_0794 50 15 Op 6 . - CDS 45996 - 46280 502 ## FN0165 hypothetical protein 51 15 Op 7 . - CDS 46332 - 46511 159 ## - Prom 46572 - 46631 9.0 - Term 46586 - 46629 6.3 52 16 Op 1 . - CDS 46639 - 47136 595 ## COG0454 Histone acetyltransferase HPA2 and related acetyltransferases 53 16 Op 2 . - CDS 47140 - 47253 145 ## 54 16 Op 3 1/1.000 - CDS 47265 - 48170 1223 ## COG3023 Negative regulator of beta-lactamase expression 55 16 Op 4 1/1.000 - CDS 48229 - 49191 1295 ## COG0646 Methionine synthase I (cobalamin-dependent), methyltransferase domain 56 16 Op 5 . - CDS 49212 - 50552 1638 ## COG0534 Na+-driven multidrug efflux pump - Prom 50580 - 50639 8.6 - Term 50612 - 50662 7.8 57 17 Op 1 7/0.000 - CDS 50668 - 51717 669 ## PROTEIN SUPPORTED gi|163764769|ref|ZP_02171823.1| ribosomal protein L18 58 17 Op 2 1/1.000 - CDS 51710 - 53068 1780 ## COG1066 Predicted ATP-dependent serine protease 59 17 Op 3 . - CDS 53072 - 53563 301 ## PROTEIN SUPPORTED gi|163764798|ref|ZP_02171851.1| ribosomal protein S19 60 17 Op 4 . - CDS 53584 - 54825 1241 ## FN0155 hypothetical protein 61 17 Op 5 1/1.000 - CDS 54850 - 56226 1652 ## COG1530 Ribonucleases G and E 62 17 Op 6 3/0.000 - CDS 56201 - 57247 971 ## COG1243 Histone acetyltransferase 63 17 Op 7 1/1.000 - CDS 57234 - 57938 760 ## COG0571 dsRNA-specific ribonuclease 64 17 Op 8 27/0.000 - CDS 57954 - 59195 2014 ## COG0304 3-oxoacyl-(acyl-carrier-protein) synthase - Prom 59215 - 59274 8.2 - Term 59222 - 59259 3.1 65 18 Op 1 6/0.000 - CDS 59286 - 59513 471 ## COG0236 Acyl carrier protein 66 18 Op 2 14/0.000 - CDS 59581 - 60480 1530 ## COG0331 (acyl-carrier-protein) S-malonyltransferase 67 18 Op 3 16/0.000 - CDS 60520 - 61506 1325 ## COG0332 3-oxoacyl-[acyl-carrier-protein] synthase III 68 18 Op 4 . - CDS 61506 - 62504 1276 ## COG0416 Fatty acid/phospholipid biosynthesis enzyme - Prom 62594 - 62653 10.2 Predicted protein(s) >gi|296154796|gb|ADVK01000022.1| GENE 1 48 - 314 368 88 aa, chain - ## HITS:1 COG:FN0211 KEGG:ns NR:ns ## COG: FN0211 COG2026 # Protein_GI_number: 19703556 # Func_class: J Translation, ribosomal structure and biogenesis; D Cell cycle control, cell division, chromosome partitioning # Function: Cytotoxic translational repressor of toxin-antitoxin stability system # Organism: Fusobacterium nucleatum # 1 88 1 88 88 151 96.0 3e-37 MKYDVEYSKTAMNTIKKLDSSTSKLIRTWIEKNLINTENPRIKGKALTGDLKGLWRYRVG DYRILADIQDDKIVILILDIGHRSKIYL >gi|296154796|gb|ADVK01000022.1| GENE 2 322 - 540 455 72 aa, chain - ## HITS:1 COG:no KEGG:FN0210 NR:ns ## KEGG: FN0210 # Name: not_defined # Def: CopG family transcriptional regulator # Organism: F.nucleatum # Pathway: not_defined # 1 72 1 72 72 111 97.0 1e-23 MGTTATLRLDETEKAIIQDYASSKGMTMSEFVKRVVLDYIEDEYDLKVYKEYLKEKENGS LKTYSHKEVWGE >gi|296154796|gb|ADVK01000022.1| GENE 3 604 - 771 158 55 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296328023|ref|ZP_06870557.1| ## NR: gi|296328023|ref|ZP_06870557.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 55 11 65 65 73 100.0 6e-12 MYQNFCELQKELEKIFNRKVDLIKKETFDYKFRSDNVREYKEKIKEEILESVLYI >gi|296154796|gb|ADVK01000022.1| GENE 4 1031 - 1660 717 209 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296328024|ref|ZP_06870558.1| ## NR: gi|296328024|ref|ZP_06870558.1| hypothetical protein HMPREF0397_0751 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] hypothetical protein HMPREF0397_0751 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 209 1 209 209 367 100.0 1e-100 MDIKFLVPTILSIISICFTIIGIKRLSLFQNWQVYFEENKEKIRQQEKITDNIFNKLNSR AGLIPYFNIILDDSKIKEENGKIYLGISLINIGKESATNVGICFTEERKEIIVEGYDKNP DSYIVYNYLDKYYAMVGDKISFSIVTDRKEKIMNVFLRFKIQYYDLIGNRYEQEFRFAFY DDFNGKNKTYYSLNNISDLPKIVEENKYE >gi|296154796|gb|ADVK01000022.1| GENE 5 1672 - 1818 93 48 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MIINFIFGTFIFDVYLNYVINKIKEYFVNVENDIKEEFLKKIYKNIIY >gi|296154796|gb|ADVK01000022.1| GENE 6 1850 - 2443 788 197 aa, chain - ## HITS:1 COG:MA4285_2 KEGG:ns NR:ns ## COG: MA4285_2 COG3291 # Protein_GI_number: 20093074 # Func_class: R General function prediction only # Function: FOG: PKD repeat # Organism: Methanosarcina acetivorans str.C2A # 36 168 659 795 1325 72 37.0 6e-13 MDQNIWEYDDFIFKGDELKGMTQKGKDKVKVEGKTDLVIPELTPDGLPLKKIADNAFYRR GLTSVIIPSTVESIGYDAFGVCKLKEVKLPEALVNIEGFAFYRNKLTKVEFGSKVKRLEP SSFAMNELSELNLPETVEYIGASAFYKNSLETVSFPKSVTKIDMYAFRKNNIHKVEVANS VDLHKFAFETFTAVERF >gi|296154796|gb|ADVK01000022.1| GENE 7 2454 - 3602 1740 382 aa, chain - ## HITS:1 COG:FN0208 KEGG:ns NR:ns ## COG: FN0208 COG1775 # Protein_GI_number: 19703553 # Func_class: E Amino acid transport and metabolism # Function: Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB # Organism: Fusobacterium nucleatum # 1 382 1 382 382 749 100.0 0 MAEIKELLEQFKYYAENPRKQLDKYLAEGKKAVGIFPYYAPEEIVYAGGMVPFGVWGGQG PIEKAKDYFPTFYYSLALRCLEMALDGTLDGLSASMVTTLDDTLRPFSQNYKVSAGRKIP MVFLNHGQHRKEEFGKKYNAKIFNNAKEELEKICDVKITDENLKKAFKVYNENREEKRKF IKLAAKHPQSIKASDRSNVLKSSYFMLKDEHTDLLRQLNQKLEALPEEQWDGVRVVTSGI ITDNPGLLEVFDNYKVCVVADDVAHESRALKVDIDLSIEDPMLALADQFARMDEDPLLYD PDIIKRPKYVLDLVKENNADGCLLFMMNFNDTEEMEYPSLKQAFDEAKVPLIKMGYDQQM VDFGQVKTQLETFNELVQLSRF >gi|296154796|gb|ADVK01000022.1| GENE 8 3624 - 4952 2011 442 aa, chain - ## HITS:1 COG:FN0207 KEGG:ns NR:ns ## COG: FN0207 COG1775 # Protein_GI_number: 19703552 # Func_class: E Amino acid transport and metabolism # Function: Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB # Organism: Fusobacterium nucleatum # 1 442 1 442 442 912 99.0 0 MAGKMEKLPNKKPRPIEGHKPAAAILRGVVDKVYANAWEAKKRGELVGWSSSKFPIELAK AFDLNVVYPENHAASTAAKKDGLRLCQAAEDMGYDNDICGYARISLAYAAGEPTDSRRMP QPDFLLCCNNICNMMTKWYENIARIHNIPLIMIDIPFSNTVDTPEEKVDYLIGQFDHAIK QLEELTGKKFDEKKFEDACARANRTAAAWLKSCKYMGYKPSPLSGFDLFNHMADIVAARC DEEAAMGFELLADEFEQSIKEGTSTWEYPEEHRILFEGIPCWPGLKPLFEPLKDNGVNVT AVVYAPAFGFRYNNVREMAAAYCKAPCSVCIETGVEWRETMAKENGISGALVNYNRSCKP WSGAMPEIERRWKEDLGIPVVHFDGDQADERNFSTEQYNTRVQGLVEIMQERKEEKLAKG EEVYTNFENTKETDWSKETIKH >gi|296154796|gb|ADVK01000022.1| GENE 9 5015 - 5812 1440 265 aa, chain - ## HITS:1 COG:FN0206 KEGG:ns NR:ns ## COG: FN0206 COG1924 # Protein_GI_number: 19703551 # Func_class: I Lipid transport and metabolism # Function: Activator of 2-hydroxyglutaryl-CoA dehydratase (HSP70-class ATPase domain) # Organism: Fusobacterium nucleatum # 1 265 1 265 265 490 100.0 1e-138 MSNVFTMGIDVGSTASKCVILKDGKEIVAKSVISVGTGTSGPARAMKEALEQIGLSSVNE LQGAVATGYGRNSLAEVPAQMSELSCHAKGAYFLFPNVHSIIDIGGQDSKALKIGDNGML ENFVMNDKCAAGTGRFLDVIAKVLEVDLEDLEKLDEKSTVDVAISSTCTVFAESEVISQL AKGTKIEDIVKGIHTAIASRVGSLAKRIGIKDDVVMTGGVALNKGMVRALERNLGFKLHT NEYCQLNGAIGAALFAYQKYTMTHQ >gi|296154796|gb|ADVK01000022.1| GENE 10 5842 - 7101 1436 419 aa, chain - ## HITS:1 COG:FN0205 KEGG:ns NR:ns ## COG: FN0205 COG0786 # Protein_GI_number: 19703550 # Func_class: E Amino acid transport and metabolism # Function: Na+/glutamate symporter # Organism: Fusobacterium nucleatum # 1 419 1 419 419 702 100.0 0 MEEINVLKFDMFTTLMLAVLAIYFGDFMRKIFPILKKYCLPASVVGGTVFALISLLLFKM GIVQLDFDYKAINQLFYCIFFAASGAAASMALLKKGGKLVAIFAVLAAILAACQNGMALV VGKFMNIDPLISMMTGSIPMTGGHGNAASFAPIAVDAGAPAAIEVAIAAATFGLISGCML GGPFGNFLVKRFKLEGSTSNEQAMGEIDAEGESGNLLVDKPNIIQAVFLMCIAIGIGKII ELGLKSIQENTGWKVALPIHVCCMFAGIVIRLIYDRKEGNHDVLYESIDIVGEFSLALFV SMSIITMKLWQLSGLGLALVVLLIAQLVLIVTFCYFLTFRLLGKNYDAAVMAVGHMGFGL GAVPVSMTTMQAVCKKYRYSKLAFFVVPVIGGFISNLTNAVIITKFLDLAKDLHTIWIS >gi|296154796|gb|ADVK01000022.1| GENE 11 7244 - 8998 2728 584 aa, chain - ## HITS:1 COG:FN0204 KEGG:ns NR:ns ## COG: FN0204 COG4799 # Protein_GI_number: 19703549 # Func_class: I Lipid transport and metabolism # Function: Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) # Organism: Fusobacterium nucleatum # 1 584 1 584 584 1155 100.0 0 MNYSMPKYFQNMPQVGKALANIDEANENAVKEIEAAIADSVAAMQDSGTSDEKIHDKGQM TALERVAELVDEGTWYPLNTLYNPENFETGTGIVKGLGRIGGKWAVVVASDNKKIVGAWV PGQADNLLRASDTAKCLGIPLVYVLNCSGVKLDEQEKVYANRRGGGTPFFRNAELQQLGI PVIVGIYGTNPAGGGYHSISPTILIAHKDANMAVGGAGIVGGMNPKGYIDMEGAIQIAEA TMAAKKVEVPGTIHVHYDKTGFFREVYDDEIGVIDGIKKYMDYLPAYDLEFFRVDEPTEP ALDPNDLYSIIPMNQKKIYNIYDVIGRLFDNSEFSEYKKGYGPEVVTGIAKVDGLLVGVI ANAQGLLMNYPEYREKAVGIGGKLYRQGLIKMSEFVTLCSRDRLPIVWLQDTSGIDVGNP AEEAELLGLGQSLIYSIENSHVPQIEITIRKGSAAAHYVLGGPQGNNTNAFSLGTAATEV YVMNGETAASAMYSRRLAKDYKAGKDLQPTIDKMNQLINEYTAKSRPAYCAKTGMVDEIV PLYDLRGYISAFANAVYQNPKSICAFHQMILPRAIREFETYTKK >gi|296154796|gb|ADVK01000022.1| GENE 12 9019 - 9822 1327 267 aa, chain - ## HITS:1 COG:FN0203 KEGG:ns NR:ns ## COG: FN0203 COG2057 # Protein_GI_number: 19703548 # Func_class: I Lipid transport and metabolism # Function: Acyl CoA:acetate/3-ketoacid CoA transferase, beta subunit # Organism: Fusobacterium nucleatum # 1 267 1 267 267 536 100.0 1e-152 MAKNYKNYTNKEMQAITIAKEIKDGQIVIVGTGLPLIGATVAKNKFAPNCKLIVESGLMD CSPIEVPRSVGDLRLMGHCAVQWPNVRFIGFETNEYLNGNDRMIAFIGGAQINPYGDLNS TIIGDDYIKPKTRFTGSGGANGIATYSNTVIMMQHEKRRFIDKIDYITSVGWAGGPGGRE KLGLPGNRGPLAVVTDKGILRFDEKTKRMYLAGYYPGVTIEDIVENTGFEIDTSRAVQLE APSEEIIKMIREDIDPGQAFIKVPVEE >gi|296154796|gb|ADVK01000022.1| GENE 13 9825 - 10790 1434 321 aa, chain - ## HITS:1 COG:FN0202 KEGG:ns NR:ns ## COG: FN0202 COG1788 # Protein_GI_number: 19703547 # Func_class: I Lipid transport and metabolism # Function: Acyl CoA:acetate/3-ketoacid CoA transferase, alpha subunit # Organism: Fusobacterium nucleatum # 1 321 1 321 321 661 100.0 0 MSKIMSLHDAIAKYVESGDSLCFGGFTTNRKPYAAVYEIIRQGQTDFIGYSGPAGGDWDM LIGCGRIKAFINCYIANSGYTNVCRRFRDAVEKKHNLLLEDYSQDVIMLMLHASSLGLPY LPVKLMEGSDLEYKWGISAEIRKTIPKLPDKKLERIPNPFKEGEEVIAVPVPRLDTAIIS VQKASINGTCSIEGDEFHDIDIAIAAKKVIVIAEEIVTEEEIRRDPSKNSIPQFCVDAVV HVPYGTHPSQLYNYYDYDADFYKMYDKVTKTDEDFEQFIKEWVIDVKDHEGYLAKLGLPR VSKLKVVPGFQYAAKLVKDGE >gi|296154796|gb|ADVK01000022.1| GENE 14 10901 - 12028 1578 375 aa, chain - ## HITS:1 COG:FN0201 KEGG:ns NR:ns ## COG: FN0201 COG1883 # Protein_GI_number: 19703546 # Func_class: C Energy production and conversion # Function: Na+-transporting methylmalonyl-CoA/oxaloacetate decarboxylase, beta subunit # Organism: Fusobacterium nucleatum # 1 375 1 375 375 573 100.0 1e-163 MSFFSVLAELLEASGFAALTWQNIAMILVSFVLFYLAIVKKFEPLLLLPISFGMFLVNLP LAGLMDEGGVINIMSYGVKSNLFPCLVFMGVGAMTDFSPLIANPISLILGAAAQLGIYVA FIFATQIGFTPAEAAAIGIIGGADGPTSIYIANNLAPHLLAPIAVAAYSYMALIPLIQPP IMKALTTKKERAVKMGQLRKVSKTEKIVFPIAVVLFTSLLLPSVAPLLGLLMLGNLFKES GVVQRLSDTAQNAMINIITIMLGLSVGAKADGATFLDISTLKIIAMGLAAFCFSTAGGVF LGKLLYIITGGKINPLIGSAGVSAVPMAARVSQTVGAKENPTNFLLMHAMGPNVAGVIGS AVAAGFFMMIFKGTM >gi|296154796|gb|ADVK01000022.1| GENE 15 12043 - 12447 710 134 aa, chain - ## HITS:1 COG:FN0200 KEGG:ns NR:ns ## COG: FN0200 COG0511 # Protein_GI_number: 19703545 # Func_class: I Lipid transport and metabolism # Function: Biotin carboxyl carrier protein # Organism: Fusobacterium nucleatum # 1 134 1 134 134 171 100.0 5e-43 MKYVVTVNGKKFEVEVEKVGGAGKSLSRQPVERRETVVKSEPVVETKVAAAPVEAAPAAT ATTGGTTITSPMPGSILDVKVNVGDKVKFGQTLAILEAMKMENDIPATADGEVAEIRVKK GDVVETDSVLIVLK >gi|296154796|gb|ADVK01000022.1| GENE 16 12485 - 12805 406 106 aa, chain - ## HITS:1 COG:no KEGG:FN0199 NR:ns ## KEGG: FN0199 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 7 106 1 100 100 142 100.0 4e-33 MWTSNTMTFGESLITFLIGFSIVFFALISLALFIIISSKVINLLVKEEAPEQKPVVSNVN ANTATKAVVKEDNQEEAERLAVIISAISEEMREPVERFTIVSITEI >gi|296154796|gb|ADVK01000022.1| GENE 17 12827 - 14833 1742 668 aa, chain - ## HITS:1 COG:FN0198 KEGG:ns NR:ns ## COG: FN0198 COG3711 # Protein_GI_number: 19703543 # Func_class: K Transcription # Function: Transcriptional antiterminator # Organism: Fusobacterium nucleatum # 9 668 1 660 660 969 99.0 0 MLKKQHFELLKLIENEKKLPKIAELLTLTERSIRYKIDEINEELGTKKIEIKKREFFSSL TDEDMTKLVENVEGENYIYNQKERQELIILYTLMKKDNFLLKEVAEKLGTSKSTIRNDLK NLKKILLDYNIKLLQDDKLKYYFDYLEDDYRYFIATYLYKYVSFDEKYDKIFFDDISYFR KVIYKEIKDEYITEINSVSKKMKKAELDYMDETLNILAILMVISQKREKKNSNLKIENIK ILEKRKEYLQLKKIFTDFSNTNLMFFTDYLFRITRDEKDIFVKFKNWLDIIVAVNKIVRT FEIKIKVDLKNIDIFLDEIFYYIKPLIFRTKKRIKLKNSILKDVKKLYPSIFNFLKKNFY FLEEVINEKVSDEEIAYLVPFFHKALQNNNKINKKGILVTTYKENIALFLKENIEAEFLI DIDKILTLKSFEQIVKDLENYDYILTTFSVEKDFVKEIKRTKIIELNPILTEKDIKKLEE AGLTKNKKIKMTALLKVILENSSDVNVKKLINSLDETFPEKIYNDVDKNKFLLGNFLKQE DIFKTNLNSFEEILIKIFKLSFLQKNDINGIMNKASNNNFYSYLGEKIGIIFHKLNTKNS QDKVLIAINEKEICINGKKISTIILINSNCEIKYKAIIYNFVKLFFQNKKFSFNNNRLDI YDYLISNI >gi|296154796|gb|ADVK01000022.1| GENE 18 15106 - 15906 992 266 aa, chain - ## HITS:1 COG:FN0197 KEGG:ns NR:ns ## COG: FN0197 COG0500 # Protein_GI_number: 19703542 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism; R General function prediction only # Function: SAM-dependent methyltransferases # Organism: Fusobacterium nucleatum # 1 266 1 266 266 456 96.0 1e-128 MNIEEFEKIMQREDREEKQEDLEKIWDSKSEWFFKRTEKKQENFSNRLIFKMIKEKNLLN ENSKVLDIGCGTGRHLLEFSKFTSYVIGTDISSKMLDYAKEKLKNVREAKFVHGNWMENF TKEKEFDLVFASMTPAITTIEHIKRMCLISKKYCLMERFVYNRDPIREEIEGILGRKLNK LHSNQKEYTYGVWNIVWNLGYFPEIYYDKYVYEAEKTVVDYMEQIICTDEEKNKIIEFLK GKEKNGKIMSKEYVIKAIILWDVNIF >gi|296154796|gb|ADVK01000022.1| GENE 19 15903 - 16253 482 116 aa, chain - ## HITS:1 COG:FN0196 KEGG:ns NR:ns ## COG: FN0196 COG1695 # Protein_GI_number: 19703541 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Fusobacterium nucleatum # 1 116 1 116 116 218 100.0 3e-57 MDINERSKFKHLTAFILVLLAEKNYSPREVKKSLLENFPGFTRDMSTVYRCLSSLEKEGL VTVDWYLPDGGAAKKIYSLTEKGWEALYEWKKDIVIRKRNFEVFLKKIENLIEGEK >gi|296154796|gb|ADVK01000022.1| GENE 20 16243 - 17910 225 555 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|229849245|ref|ZP_04469311.1| LSU ribosomal protein L17P [Thermanaerovibrio acidaminovorans DSM 6589] # 332 550 144 356 398 91 29 1e-17 MTEILKVENLTITYYSHSGSSQIGVKNVNFSLNEGEIYGIVGESGSGKSTILLAIMGLLY ERAKIEGKIYFKGKEIQNLNKKEMKEVCFKEIGLIFQNQMESLNPSLTIEKQILEILRKK FSDKNELQKKLDEVLEMVGLELKNKKKYPHELSGGMRQRVFIAMGICLNPPLLLIDEPTT ALEEESKKNILNLLKDIKRKYNTTMLIISHDFEVIEYLTEKVLILLRGNLIEKGRTEKIL EEPKHPYTFALLQSSTFLNPWKDLWGIKEEEDEKYPCPFYKRCTQKVEACLNYIPCLKSD IEEGVACYKNGIEKLLVLKNIRKLFKTSEQKVEAVKNCSLRVRQGEIVALLGKSGSGKTT LLRIIAGLLSKDAGEICFFQEKIEKNNLIAREKCLQIIEQDPFSSMNPSLTVEEIIAEPT VILHKKNLALCKEIVVENLKKVGLETNYNFLKKQANELSGGQRQKVAIARALSMDPKLLL ADEISSMLDESGKLNIMRLLKQLQHNIGFSVLLVTHDITLAKKVADYIYYMDSGCIIEEG SVRKIFCKKRGNDGY >gi|296154796|gb|ADVK01000022.1| GENE 21 17907 - 18731 717 274 aa, chain - ## HITS:1 COG:FN0194 KEGG:ns NR:ns ## COG: FN0194 COG1173 # Protein_GI_number: 19703539 # Func_class: E Amino acid transport and metabolism; P Inorganic ion transport and metabolism # Function: ABC-type dipeptide/oligopeptide/nickel transport systems, permease components # Organism: Fusobacterium nucleatum # 1 274 1 274 274 486 99.0 1e-137 MRIKEINLKLSYVFLFFILILAILPYFSFLKSGTIPSGTALLPPSKEHWFGTDDLGIDIF SEICYGAKNTIILSCSSALFAAIGGSVIGMVAGYFGGIFDEILLGIIDFFISVPDLLLMV VLGTFLGPSLRNIVFSIVLVSWIMPAKITRSQILRMKQENYIKIAKIYGAGFIHLFLWHF WKPFFSIIMISVIKLMNRAILAEAALSYLGLGDPLSKSWGMIITRAMDFPNIYLTEFWKW WLVYPVTFMVLMVLSIAIVGQKIERKIGGVYRTQ >gi|296154796|gb|ADVK01000022.1| GENE 22 18728 - 19684 685 318 aa, chain - ## HITS:1 COG:FN0193 KEGG:ns NR:ns ## COG: FN0193 COG0601 # Protein_GI_number: 19703538 # Func_class: E Amino acid transport and metabolism; P Inorganic ion transport and metabolism # Function: ABC-type dipeptide/oligopeptide/nickel transport systems, permease components # Organism: Fusobacterium nucleatum # 1 318 1 318 318 554 98.0 1e-158 MKKKYIVLFFILLTIHFILPRIMKADPFVFLSSDGTEVASYTEEEILKYKQYYGLDMPLW RQYLNYLLGIFTGNLGYSIYFKEKVTTLIFSRLVWTAGIVIFSLCISSVFGLFLGSFSAW NCQRKIDTILYQGMVIISEIPSFLTANMILMFFIIKWRILPTAGGITPFIKIEFSWNFIL DIIKHAILPSLTLTFLRLPDFYFVSRSAMLQQIQKKYVETAQAKSLGDIYILMRHCLPNA INPIMTRFLLSIQTMFNATLIVENVFKYPGIGKLIRDAVFYRDYLLLQGIFLVITIFILT ISFLGENFYQTIEKRKEL >gi|296154796|gb|ADVK01000022.1| GENE 23 19681 - 21228 2315 515 aa, chain - ## HITS:1 COG:FN0192 KEGG:ns NR:ns ## COG: FN0192 COG0747 # Protein_GI_number: 19703537 # Func_class: E Amino acid transport and metabolism # Function: ABC-type dipeptide transport system, periplasmic component # Organism: Fusobacterium nucleatum # 1 515 1 515 515 1003 98.0 0 MRNKKLGIIGIFVLCLLFLVACNKEKEKMAKPEEKEVVLRLEGGDFGYPNPFRHQNRGPG FFKMELIYDSLLEKDENGLIPWLAKEWEVSEDGKTFTFTLVDGAKWHDGKDLTAEDVAFT VEYFKKHPPVRGGLMLNGEYLMDTVSVDGNKVIIHTVDYTLVALEKIGSMRIIPKHIWEK VDDPEKFSGEGDIVGSGPYKLVAYNSEQGSYKMEKNNEFWGLEAAATRIEWIPVSDRVLA FQNGEIDITVIPVDLLKNFENNKEFKIVKNFGLHNYRLYFNFDKVPSLQDKDVRQAIAYA IDRKELIDKLERGSGLEGSQGYLPPTHSMYNKNLPAYSYNVEKAKELIKGKTFEVELLVG NSPKEVKMAELIKIRLEEIGISVKVVSIDSKARDSKVRDKDYQMAIMKSGGMGADPDMLR EIYSSKSKKGELAGYHNEVLDKLLSQQSTEKDVEKRKEFIYKIQEVLAEEIPMLLLYGEV ENTVYRPEKYDYWTTRYDHTKLDHPKLSYVIRPKK >gi|296154796|gb|ADVK01000022.1| GENE 24 21385 - 22818 1549 477 aa, chain - ## HITS:1 COG:FN0191 KEGG:ns NR:ns ## COG: FN0191 COG2865 # Protein_GI_number: 19703536 # Func_class: K Transcription # Function: Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen # Organism: Fusobacterium nucleatum # 1 477 1 477 477 873 99.0 0 MTIEEMKNLIQNGEKIDVEFKESKEALTKDIFDSVCSFNNRNGGHIFLGVSDKKEIIGVN KNKIDKIIKEFTTSINNPQKIYPPLYLVPLVVEIDEKIIVYIRVPKGYQVCRHNGKIWDR SYEGDINITDNSELVYKMYARKQSSYFVNKVYPNLKIDFLDINIIEKVRKMAILRNQNHT WKNLNDEELLRSANLILIDPETNKEGITLAAILLFGKDNSIMSVLPQHKTDAIFRVENKD RYDDRDVIITNLIDSYDRLISFGQKHLNDLFVLDGIINVNARDRILREIISNTLAHRDYS SGFPAKMIIDDEKIIIENSNLAHGIGELDLQKFEPFPKNPPISKVFREIGLADELGSGMR NTYKYTQLYSGMKPIFEEGNIFRTIIPLKEIATEKVGGENVPRNVPRNVPRNVPQKSIEE MKEIMKEIIKKNNRVSRKYLAEILGVSEKTITRYIKEIPNLSYIGKGKNGYWKLDEE >gi|296154796|gb|ADVK01000022.1| GENE 25 22937 - 24595 1580 552 aa, chain - ## HITS:1 COG:FN0190 KEGG:ns NR:ns ## COG: FN0190 COG2972 # Protein_GI_number: 19703535 # Func_class: T Signal transduction mechanisms # Function: Predicted signal transduction protein with a C-terminal ATPase domain # Organism: Fusobacterium nucleatum # 1 552 1 552 552 971 100.0 0 MKMNNKPLNIKIGFYFLITNLVLVLLLGSIFYFSSSSLLIQKEISAKTEAIEKSGNYIEL YMNKLTTLSQVISHDKGVYDYLKNKDESEKNRILNIIDNTLSTDPYIKSIILIRKDGAVI SNEKNVNMEVSSDMMKEEWYVNSLMNPMPVLNPLRKQNFSLDGMDDWVISVSREIADTNG ENLGVLLIDVKYQALHEYLQNQETGKNSDIVILDEDNRIVYYKEIPYDTSQEKYLKNLKN IEEGYNRKENTVTVKYPIKNTHWTLIEISYMQEIDSLKNHFFEMIVISCLASLLITVLIN ISVLRRITKPIKELEQHMNNFNNDLSKINLKGDVSIEILSLQNHFNEMIDKIKYLREYEI NALYSQINPHFLYNTLDTIIWMAEFQDTEKVISITKALSNFFRISLSNGKEKIPLKEEIN HIKEYLYIQKQRYEDKLEYKISIQEELENIEVPKIILQPFVENAIYHGIKNLDTTGIISI YSQIVENKIELIIEDNGIGFEAAKKQALMKMGGVGIKNVNKRIQYYYGKEYGAKIDSSFK TGARIIITLPYK >gi|296154796|gb|ADVK01000022.1| GENE 26 24582 - 25367 824 261 aa, chain - ## HITS:1 COG:FN0189 KEGG:ns NR:ns ## COG: FN0189 COG4753 # Protein_GI_number: 19703534 # Func_class: T Signal transduction mechanisms # Function: Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain # Organism: Fusobacterium nucleatum # 1 261 1 261 261 412 99.0 1e-115 MYKLMIADDEPLIRRGIKQLIDLSSLQIGEIYEASTGEEALKVFEEFKPEIVLMDINMPK IDGLSVAKKIKSINLDTKIAIITGYNYFDYAQTAIKIGVEDYILKPISKSDVSEIIVKLV SSLQKERKDKEIEKVLEKITTVDTQDNIAKNNYKELIQNIIEESYTDSQFTLSVLSEKLD LSSGYLSIMFKKNFGIPFQDYLLQKRMEKAKLLLLTTELKNYEIAEQVGFEDVNYFITKF KKYYQITPKQYREMVLKNENE >gi|296154796|gb|ADVK01000022.1| GENE 27 25398 - 26282 1096 294 aa, chain - ## HITS:1 COG:FN0188_2 KEGG:ns NR:ns ## COG: FN0188_2 COG0229 # Protein_GI_number: 19703533 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Conserved domain frequently associated with peptide methionine sulfoxide reductase # Organism: Fusobacterium nucleatum # 147 294 1 148 148 303 100.0 3e-82 MEKIYGVIDVTSGYANGKTKNPKYQDLHNSGHAETVHVKYDASKVNLSTLLKYYFKIIDP TSVNKQGNDRGSQYRTGIYYVNQNDKSVIQDEIKEQQKKYSQKIVVEVLPLKEYYLAEEY HQDYLKKNPNGYCHIDLSKADDIIVDEKKYPKLSEKELRMKLNSKQYEVTQNGDTERAFQ NDYWDFFDKGIYVDITTGEPLFSSTDKYASQCGWPSFVKPIVPEVVTYHNDTSFNMIRTE VRSRSGKAHLGHVFDDGPRDRGGKRYCINSAAIQFIPYAEMEAKGYGYLLPLVK >gi|296154796|gb|ADVK01000022.1| GENE 28 26351 - 26980 878 209 aa, chain - ## HITS:1 COG:FN0187 KEGG:ns NR:ns ## COG: FN0187 COG0526 # Protein_GI_number: 19703532 # Func_class: O Posttranslational modification, protein turnover, chaperones; C Energy production and conversion # Function: Thiol-disulfide isomerase and thioredoxins # Organism: Fusobacterium nucleatum # 1 209 1 209 209 323 100.0 1e-88 MKGLKKLFLGIMMLLMGAVAFGAEMDLSKVTLKDLNGMSYSFGKDGKPTYVKFWASWCPI CLSGLEDIDNLSKEKKDFEVVTVVSPGLVGEKKTEDFKKWYKSLEYKNIKVLLDEKGELS KMLNVRVYPTSVVVNKAGKAEKVLPGHLEKAEIKKLFSSKMNDAHMMKDDKMMMNDKGMK DTMMKDDKMMNDKHMMKDDKMNMEKKTSM >gi|296154796|gb|ADVK01000022.1| GENE 29 27064 - 27717 854 217 aa, chain - ## HITS:1 COG:FN0186 KEGG:ns NR:ns ## COG: FN0186 COG0785 # Protein_GI_number: 19703531 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Cytochrome c biogenesis protein # Organism: Fusobacterium nucleatum # 1 217 1 217 217 299 100.0 3e-81 MLNTELFIGAVYVAGLLSFFSPCIFPLLPVYIGMLSTSGKKSIIKTLVFVIGLSTSFVLL GFGAGSIGSFLISKTFRIISGVIVIIFGIIQMEIVKIPFLERTKLVDIKGKEDDSIWGAF LLGFTFSLGWTPCVGPILASILFISSGGGNPYYGALMMFIYVLGLATPFVILSLSSKYVL TKVSAIKKHLGIMKKIGGLLIIIMGILLLTDKLSIFL >gi|296154796|gb|ADVK01000022.1| GENE 30 28038 - 29354 1279 438 aa, chain + ## HITS:1 COG:FN0185 KEGG:ns NR:ns ## COG: FN0185 COG3593 # Protein_GI_number: 19703530 # Func_class: L Replication, recombination and repair # Function: Predicted ATP-dependent endonuclease of the OLD family # Organism: Fusobacterium nucleatum # 39 438 1 400 400 734 100.0 0 MQLLKVQIKNWQTFTDVSLECKDFLVFIGASSTGKSSFMKALLYFFQARNLHDGDIRNPN LPLEIIITLKGEKGHIFQLRILNNPYQSIRYFIKNHISKPEKNNRNWEEIEEKDFKKQVS EISIFYVPAFMKASYLDYLVEKLFQNENLKKYYKHYKRFKNSINKKMSFGYYRHIFIEFL QEIIIKEKSHAFWKNSILLWEEPEFYLNPQQERACYDALLQNTKLGLMAVVSTNSSRFIE LENYQSLCIFRRIKEDVEIYQYNGNLFSGDEVTVFNMNYWINPDRSEIFFAKKVILVEGQ TDKIVLSYLAKNLGIFNYDYSIVECGSKSSIPQFIRLLNAFHIPYVVVYDKDNHYWRNET ELENSTLKNKMIQNLIWKKLGEWVEFENDIEEEIYDESRDKKNYKNKPFYALETVINSNY IVPKRLEEKVRKIFEEIK >gi|296154796|gb|ADVK01000022.1| GENE 31 29375 - 29965 404 196 aa, chain + ## HITS:1 COG:no KEGG:FN0184 NR:ns ## KEGG: FN0184 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 54 196 1 143 143 210 100.0 3e-53 MAIQINLEKYGHKKKGYLNFSWTSLFFNFLVPLFRLDIKWVFIFISPYLFLYIMTFGVID SSQFFILTLPHLSIFPTFILLLDYYHSINHLNTSVFVMIILIPILRVIFSFIYNDFYTKN LLKEGYLPPEDDEYSNAILKGNRFLEYTKEDLLDKEKMERYRLIIEEYEQERKKDLINFV LIVILLILLIILLVYI >gi|296154796|gb|ADVK01000022.1| GENE 32 30113 - 31543 2350 476 aa, chain + ## HITS:1 COG:FN0183 KEGG:ns NR:ns ## COG: FN0183 COG0579 # Protein_GI_number: 19703528 # Func_class: R General function prediction only # Function: Predicted dehydrogenase # Organism: Fusobacterium nucleatum # 1 476 23 498 498 890 94.0 0 MFDVVVIGAGIMGAAVSRELSKYELKILLLDKENDVSCGTTKANSAIVHAGYDAKEGSLM AKYNVLGNAMYEKLCEEVDAPFRRVGSYVLAFSEKEKEHLEMLYQRGLNNGVPEMEIIDA AEIQKREPHVSKEAVAALYAGTAGITGPWELATKLIENAMENGVELKLNSEVTNIKKEND IFKIKLKNGEVIETKKLINAAGVYADFINNMLSSKKFKITPRIGEYYLLDKVQGYLTDSV IFQCPTEMGKGILVSKTVHGNIIVGPTASDVENKDDVGNTQEGLDTVRKFATKSIKDVNF RDNIRNFAGLRAEADTGDFILGEAEDVKGFFNMAGTKSPGLTSSPAMAVDLAKIVVESFD GVKEKENFIKNRKMIHFINLSPEEKAEVIKKDPRYGRIICRCENITEGEIVDAIHRKCGG RTLNGVKRRVRPGAGRCQGGFCGPRVQEILARELGEDLEEIVMEQKDSYILTGKTK >gi|296154796|gb|ADVK01000022.1| GENE 33 31554 - 32819 2186 421 aa, chain + ## HITS:1 COG:FN0182 KEGG:ns NR:ns ## COG: FN0182 COG0446 # Protein_GI_number: 19703527 # Func_class: R General function prediction only # Function: Uncharacterized NAD(FAD)-dependent dehydrogenases # Organism: Fusobacterium nucleatum # 1 421 1 421 421 749 96.0 0 MNMKYDLVIVGGGPAGLAAAVEAKKNGIDNILVIERAKELGGILQQCIHNGFGLHEFKEE LTGPEYAQRFMDQLFELNIEYKLDTMVLGISENKIVQAINSVDGYMIIEAKSIILTMGCR ERTRGAIAIPGDRPAGIFTAGAAQRYINMEGYMVGKRVVILGSGDIGLIMARRLTLEGAK VLAVAELMPFSGGLMRNIVQCLNDYDIPLYLSHTVVDIIGKDRVEKVIIAKVDENKKAIP GTEIEYECDTLLLSVGLIPENDISRATGIKIDPRTNGPIVNELMETSIPGIFASGNVVHV HDLVDFVSIESRKAGKSAAKYIKGEVTDGAYVEVQTGNGIGYTVPQKFRIENIEKNLELS MRVRQIYKNVKIVVKSNDFVIHSVKKNHMAPGEMEKITLSKTVLGKIDANKIVVEVVEED K >gi|296154796|gb|ADVK01000022.1| GENE 34 32819 - 33163 487 114 aa, chain + ## HITS:1 COG:FN0181 KEGG:ns NR:ns ## COG: FN0181 COG3862 # Protein_GI_number: 19703526 # Func_class: S Function unknown # Function: Uncharacterized protein with conserved CXXC pairs # Organism: Fusobacterium nucleatum # 1 114 1 114 114 195 97.0 2e-50 MEKEMICIVCPVGCHISVNTETYEVKGNACPRGAVYGKEELTAPKRVVTSTVKIKNALDK RCPVKTEKSIPKELNFKLMDELKNIELTAPVKRGDIVIKNVFNTGVDVVVTKDM >gi|296154796|gb|ADVK01000022.1| GENE 35 33295 - 33771 543 158 aa, chain + ## HITS:1 COG:no KEGG:FN0180 NR:ns ## KEGG: FN0180 # Name: not_defined # Def: TPR repeat-containing protein # Organism: F.nucleatum # Pathway: not_defined # 1 158 1 158 158 228 94.0 7e-59 MLRDNNNFLEKKDFFEQGILALHFNRPLEAIKYLLLLEEEKNSAVPFNIALCYLKAQKYE TVLFYLEKALAEIRRNRSIEISKDNYPELLAFEEENDAYTKPMLYLTPLQFPDLAREQIL RLMVDILFILEKKEEMYKTINSLKNKNYKNVKDKIKRS >gi|296154796|gb|ADVK01000022.1| GENE 36 33776 - 35230 1933 484 aa, chain + ## HITS:1 COG:FN0179 KEGG:ns NR:ns ## COG: FN0179 COG0666 # Protein_GI_number: 19703524 # Func_class: R General function prediction only # Function: FOG: Ankyrin repeat # Organism: Fusobacterium nucleatum # 18 484 1 467 467 820 98.0 0 MSLYDLNDLYWKLRDNPMSREEVLEYYRNADIEEKDSAQSSLLHIAAEHGDSLAIEVLLN RGMDANIENSEAEKPLHRLAEGTRHINNGEEIAKCAELLLDAKASVLRKDRFGRTPVILA AKNAYYEILKVFIDRGLKLSLKDSEGNSALHIACQYFSDYDKENADRYFKTIKYLLEAGL DPNEKNNDDETATDMAIKRCNKKIIALLLGNYDEENPNELLIQTGGLSLHRAIEKKDYEA VNALIKLGTDVNAFSEEEDTLFREMTPLGIAFYMFDEYSVKALLEAGADVNLKTTEENTA LGEILGYMKDNYFSFNKIPLIEELLKLLLDNGLKINDTVDKKGNTAFIKACKSIDENNLS NGKTLAAVVAKFLLKENCDVNSTNLYGQTALMFLCASRDIESQDLQIQVLEAGADVGTMD KNGDTPLIYAAKNRNANSGKEMAELLFDFGDPKLEHVNNDGKTALEIATDLNNEEFVKFL LTKM >gi|296154796|gb|ADVK01000022.1| GENE 37 35247 - 36227 1504 326 aa, chain + ## HITS:1 COG:FN0178 KEGG:ns NR:ns ## COG: FN0178 COG0666 # Protein_GI_number: 19703523 # Func_class: R General function prediction only # Function: FOG: Ankyrin repeat # Organism: Fusobacterium nucleatum # 1 326 1 326 326 541 99.0 1e-154 MDNSMIFLNACKNGQKGVVEAFIKKGGLDFNKRDSLGNTALFYACMKGSKDIVKLLLSNG ADGSLANNNSMIPLHAVSKSGNKEIISLLLDKGSDINTTDKEGRTPLIYTLMENRTEAAK LLLEKGADTQIKDNDGHKAIDYATANGLRDIITLLLKNENNDNKNNSGNTPLHQACYNNQ SEVVRELLKQDGIELNIVNDNGNTPLIIAAIESNLLIVQLLLKAGADAKQRLLNGNTALH FAAENGNQYIGKALLEAGAEIDGQNEMGETALLIAAMEGYNDFVKLLVENGANVNIVDNS QNSPLFYASEKGYTEIVEILLLAGAK >gi|296154796|gb|ADVK01000022.1| GENE 38 36311 - 36610 393 99 aa, chain - ## HITS:1 COG:FN0177 KEGG:ns NR:ns ## COG: FN0177 COG0851 # Protein_GI_number: 19703522 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Septum formation topological specificity factor # Organism: Fusobacterium nucleatum # 1 99 1 99 99 161 100.0 2e-40 MGFFSNFFKKENSKDDAKNRLKLVLIQDRAMLPSGVLENMKDDILKVLSKYVEIEKSKLN IEMCPYEDDPRKIALVANIPILKSSTREVTKTTKQQKRK >gi|296154796|gb|ADVK01000022.1| GENE 39 36616 - 37410 1046 264 aa, chain - ## HITS:1 COG:FN0176 KEGG:ns NR:ns ## COG: FN0176 COG2894 # Protein_GI_number: 19703521 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Septum formation inhibitor-activating ATPase # Organism: Fusobacterium nucleatum # 1 264 1 264 264 467 99.0 1e-132 MGARIIVVTSGKGGVGKTTTTANIGAGLADKGHKVLLIDTDIGLRNLDVVMGLENRIVYD LVDVIEERCRISQAFIKDKRCPNLVLLPAAQIRDKNDVTPEQMKSLIDSLKASFDYILVD CPAGIEQGFKNAIVAADEAVVVTTPEVSATRDADRIIGLLEASGIKEPRLVINRLRIDMV KDKNMLSVEDILDILGIKLLGVVPDDETVVISTNKGEPLVYKGDSLAAKAFKNIANRIEG VDVPLLNLDVKMSLLDKIKFVFKR >gi|296154796|gb|ADVK01000022.1| GENE 40 37412 - 38062 815 216 aa, chain - ## HITS:1 COG:FN0175 KEGG:ns NR:ns ## COG: FN0175 COG0850 # Protein_GI_number: 19703520 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Septum formation inhibitor # Organism: Fusobacterium nucleatum # 1 216 1 216 216 388 100.0 1e-108 MSNHVIIKGKNDRLVIALDPNIDFLDICDILKTKILEAKDFIGNSRMAIEFSGRALTNEE ENKLIGIITDNSDIVVSYIFSKRAESEEENIDLNSLNPLIEEGKTHFFRGTLRSGSKIES DGNVVVLGDVNPSSMIRARGNVIVLGHLNGTVCAGLGGDDRAFIAAIYFNPIQITIGMKT ITDIQDEILDSTRVDKKSKFKVASIKNQEIVVEELI >gi|296154796|gb|ADVK01000022.1| GENE 41 38184 - 39140 1644 318 aa, chain - ## HITS:1 COG:FN0174 KEGG:ns NR:ns ## COG: FN0174 COG2070 # Protein_GI_number: 19703519 # Func_class: R General function prediction only # Function: Dioxygenases related to 2-nitropropane dioxygenase # Organism: Fusobacterium nucleatum # 1 318 1 318 318 549 100.0 1e-156 MKNNKICEILGIKYPIFQGAMAWVSGGELAGAVSKDGGLGIIAGGGMEPELLRENIRKAK AITTNPFGVNLMLLRPDVEDQMNVCIEEGVKVITTGAGNPGAFMEKLKAANIKVIPVIPT VKLAERMEKIGADAVIVEGMESGGHVGTLTTMALLPQVVNAVNIPVIAAGGIASGKQFLA ALAMGAEGIQCGTIFLTAKECLIHQNYKNIILKAKDRSTTVTGTSTGHPVRVIENKLAKE MIELERSGAPKEEIEKLGTGSLRLAVIDGDVERGSFMSGQVAAMVNDERTTKEILEFLMN DLKLETEVLKRRLENWDI >gi|296154796|gb|ADVK01000022.1| GENE 42 39343 - 40830 1834 495 aa, chain - ## HITS:1 COG:no KEGG:FN0173 NR:ns ## KEGG: FN0173 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 35 495 1 461 461 743 98.0 0 MKKFIGILLLFIVIVVTILYFARDFLLKEYLERKMSKANNAPVTIGSVDLDYFEGYVTLK DIKIMSNLHKDEIFISIDELKSYYDIDFRKKVITFDDTEIDGIAFFKDSDYENSDREAVV TFENKVTEAEEKTKRDKVLMELKTLYLNKIEENHLNLDEILSRDLVNKDNISELEKIKQS IKNIKESNEKNLNISDVVGEISNIGKSTKKLGKNLKLDNLSKTEEDIRENLTLEESLDRV VRNFLDRNKLVLFDLDSYINIYLNLVYEQKIYNLSLKYRDILDEIRLRKEEDSKLSDGDI WELFFNSISVTSNVYGISFNGEVKNFSTRLFKNKGNTEFKLFGEKGNTIGEFKGFINFNT ELTESTLNIPEADLKDLGNNLLKGGEGVLFQSLSTNGYHLSINGSIHLKNMKLDIDKVIE SMKIEDEVIKEIIAPLLRQLNTGEIYYSYDTDTRILTIKTNVVEVFDNILNGENSSLKTK IRERIKSDFLKRIVG >gi|296154796|gb|ADVK01000022.1| GENE 43 40842 - 41567 290 241 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|227512216|ref|ZP_03942265.1| ribosomal protein S4e [Lactobacillus buchneri ATCC 11577] # 37 240 55 260 264 116 32 3e-25 MNENLEKIENYIKLAEKTDTVIYSNQFFPVSQLNNLKYSGIKFSFKGLNEDCEKKLLAVY PECFTEDYLYFPVKYFKIIKKSKFISLEHKHYLGNILSLGIKREILGDLIVKNDECYGII LENMFDFLKENLLRINSSPVEIIEIDEKEVPQNEFKELNIRLSSLRLDSLVSELTNLSRA LSVNYIDLGNVQVNYEIQREKSYKVSIGDTIIIKKYGKFKIEKENGLTKKDKLKLIVRKY I >gi|296154796|gb|ADVK01000022.1| GENE 44 41690 - 41803 89 37 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MSLSLWEITLLFRGGLFFFLGGEKKILLSITFRRDPI >gi|296154796|gb|ADVK01000022.1| GENE 45 42043 - 43365 1870 440 aa, chain - ## HITS:1 COG:FN0170 KEGG:ns NR:ns ## COG: FN0170 COG1160 # Protein_GI_number: 19703515 # Func_class: R General function prediction only # Function: Predicted GTPases # Organism: Fusobacterium nucleatum # 1 440 1 440 440 848 98.0 0 MKPIVAIVGRPNVGKSTLFNNLVGDKIAIVDDLPGVTRDRLYRDTEWSGSEFVIVDTGGL EPRNNDFLMAKIKEQAEVAMNEADVILFVVDGKSGLNPLDEEIAYILRKKNKPVILCVNK IDNFFEQQDDVYDFYGLGFEYLVPISGEHKVNLGDMLDIVVDIIGKMDFPEEDEEVLKLA VIGKPNAGKSSLVNKLSGEERTIVSDIAGTTRDAIDTLIEYKDNKYMIIDTAGIRRKSKV EESLEYYSVLRALKAIKRADVCILMLDAKEGLTEQDKRIAGIAHEELKPIIIVMNKWDLV ENKNNATMKKMKEELYAELPFLSYAPIEFISALTGQRTTNLLEISDRIYEEYTKRISTGL LNTILKDAVLMNNPPTRKGRVIKINYATQVSVAPPRFVLFCNYPELIHFSYARYIENKFR EAFGFDGSPIMISFENKSSD >gi|296154796|gb|ADVK01000022.1| GENE 46 43428 - 43835 524 135 aa, chain - ## HITS:1 COG:no KEGG:FN0169 NR:ns ## KEGG: FN0169 # Name: not_defined # Def: coproporphyrinogen III oxidase # Organism: F.nucleatum # Pathway: not_defined # 1 135 1 135 135 219 97.0 2e-56 MTHDFNILGEEEKIYIDDNLILYIIETLEWVDTFYFLKTREKRKGLNYYGNTYFEGDNVK KLKRILFYWKELFGEARDKFEIASDFEIIKTYDLDKDKYIKQKLNKNEVIEDLEKLISLC ERAERENKIIEHWGI >gi|296154796|gb|ADVK01000022.1| GENE 47 43851 - 45473 1319 540 aa, chain - ## HITS:1 COG:no KEGG:FN0289 NR:ns ## KEGG: FN0289 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 288 539 51 306 308 140 39.0 2e-31 MKKKEIKLNHFNIYAVIAIIVLMYFSIKCIQFYFELEMPKGILETIIEIKNIFILNEKFE LDKLLENLLFLLLVFIPPLLEYLFVKKIYKICNYFLSEEKIIISDEHFYYTRKLAIINFE KFEINLNEIKRITKIPMKIPNRFSTNIPALATLWYFKEQKRILIKDKNGKEYKIWNIPVR KFSPSTYYGTPKDNADLYIKELREHLKLEEENIEDSQKTESLNVEMKKLIYSHPDLSEKK ESFFILFFFQIFLIFIFLVVFSEGITVFYEGGIEILFFIVMGIACILITYFIIKAMKNAI IYFFPYEEYEIIEDRLYYKKKLKLFSKSFIMEKFDVSLKDIESISSLAPKISYMGLKSLD DFKPCKRIHIRLKNGKGYEVCNFSKNPYNYDFFRNTNKVLEMEFKEIFNNIKSFIENGEI KYNFEKQLEEIKSNYNLEKSERYNFILNKIIEKEKLYLYKNEEKFIVNAEEIAIKNLEIF KTMNFEEMDFYEFYVDYLSKKENQDKKVLVGFNGADGKEVTMSKLKDDINKIRDSKSTFI >gi|296154796|gb|ADVK01000022.1| GENE 48 45490 - 45603 181 37 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MEIEIKEKIDTSLPSENNVDISSLGKLLGIEILKQIL >gi|296154796|gb|ADVK01000022.1| GENE 49 45623 - 45982 379 119 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296328067|ref|ZP_06870601.1| ## NR: gi|296328067|ref|ZP_06870601.1| hypothetical protein HMPREF0397_0794 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] hypothetical protein HMPREF0397_0794 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 119 1 119 119 110 100.0 4e-23 MELIILTNISIILSLCLILIFINKLEEEKELSLKTIIVTIIIILFIVNCAYYLAEHKSSL LFHFNIFIIVAYIILIITGLFLAISKSKTSYLKYILFGILFLIVPVYAIMMMAVGAMPI >gi|296154796|gb|ADVK01000022.1| GENE 50 45996 - 46280 502 94 aa, chain - ## HITS:1 COG:no KEGG:FN0165 NR:ns ## KEGG: FN0165 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 20 94 1 75 75 119 97.0 6e-26 MLEEKLLKKIKTINENFINLGFDLEEDLIELVTQREDIKDRIENTKYKKMTFSKDEEANS YILNLEDCQISFDIIEGEDEQGPWFEVECNIIFF >gi|296154796|gb|ADVK01000022.1| GENE 51 46332 - 46511 159 59 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MELVILIVVGSVISIIVNLFLLIFKKKSIYFKYILLSLLSLVVGSFIFMMIALQALPNG >gi|296154796|gb|ADVK01000022.1| GENE 52 46639 - 47136 595 165 aa, chain - ## HITS:1 COG:SP0950 KEGG:ns NR:ns ## COG: SP0950 COG0454 # Protein_GI_number: 15900828 # Func_class: K Transcription; R General function prediction only # Function: Histone acetyltransferase HPA2 and related acetyltransferases # Organism: Streptococcus pneumoniae TIGR4 # 19 144 23 146 166 63 29.0 1e-10 MDVIKLISVSNNELKNEALNIYLKNDYYFNKMSDNPPSISNVEKDIEAIPNGVQKNQKNY RLISFNDEILGVVDYLTDYPEKNTILIGFFIIKNDKQKQGLGTKIFGYLEKSFKNKKFLK IRIGVLVDNEIGLSFWKKQNFKEIERKFLKFEKSEKEVIVMEKEI >gi|296154796|gb|ADVK01000022.1| GENE 53 47140 - 47253 145 37 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MTEKEIELSMDKALKKLPFEIKKIKFVLAVIKFFKRK >gi|296154796|gb|ADVK01000022.1| GENE 54 47265 - 48170 1223 301 aa, chain - ## HITS:1 COG:FN0164 KEGG:ns NR:ns ## COG: FN0164 COG3023 # Protein_GI_number: 19703509 # Func_class: V Defense mechanisms # Function: Negative regulator of beta-lactamase expression # Organism: Fusobacterium nucleatum # 14 301 1 288 288 562 100.0 1e-160 MKKILAFISLLFLMVACSSTEEVKSTGNRNINRGSNISRSQTTIRNIGKFQVDSNSYVAT GKNERIQFIIVHYTATDNAGAIKELTSSRVSSHFLVLDEDDNKIYSLVPLEQRAWHAGAS AFRGRTNINDTSVGIEIVSEGIAKEFVPDPNPYHPYDHYVDYKPIQIEKTAQIIKYVAEK YNIPARNILAHSDIAPSRKKDPGAKFPWKELYDKYNIGAWYDEADKQEFMDEEKFNATSI REIKDELRKYGYEINRFDEWDKESKDVVYAFQLHFNPKNPTGEMDLETFAILKALNKKYP E >gi|296154796|gb|ADVK01000022.1| GENE 55 48229 - 49191 1295 320 aa, chain - ## HITS:1 COG:FN0163 KEGG:ns NR:ns ## COG: FN0163 COG0646 # Protein_GI_number: 19703508 # Func_class: E Amino acid transport and metabolism # Function: Methionine synthase I (cobalamin-dependent), methyltransferase domain # Organism: Fusobacterium nucleatum # 12 320 1 309 309 596 99.0 1e-170 MKKMMFELEKELEERILVLDGAMGTVLQKYELTLEDFNGAKGCYEILNETRPDIIFEVHK KYIEAGADIIETNSFNCNAISLKDYHLEDKVYDLAKKSAEIARDAVKKSGKKVYVFGSVG PTNKGLSFSVGDVPFKRAVSFDEMKEVIKVQVAGLIDGGVDGILLETIFDGLTAKAALLA IEEVFEEKNIKLPVSISATVNKQGKLSTGQSIESLMVDLDRDFVISFGFNCSFGAKNLVP LVIKIKELTTKFVSLHANAGLPNQNGEYEETAQKMRDDLLPLIENQAINILGGCCGTDYE HIKLIAELVKGQKPRILPKR >gi|296154796|gb|ADVK01000022.1| GENE 56 49212 - 50552 1638 446 aa, chain - ## HITS:1 COG:FN0162 KEGG:ns NR:ns ## COG: FN0162 COG0534 # Protein_GI_number: 19703507 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Fusobacterium nucleatum # 1 446 1 446 446 771 99.0 0 MNSIGNVGKKTLISLTIPIFLELLLVTVVGNIDTIMLGYYSDEAVGTIGGITQLLNIQNV IFSFINMATSILTAQFLGAKDYKRVKQVISVSLVLNILLGVVLGGIYLFFWRGLLQKMNL PEELVGIGKYYFQLVGGLCIFQGIILSCGAILKSHGRATETLIINVGVNILNILGNALFI FGWLGMPVLGPTGVGISTVISRGIGCVVAFYMMCKYCNFTFRKKYIKPFPFKIVKNILSI GLPTAGENLSWNMGQLMIVAMVNTMGTTIIASRTYLMLISSFIMTLSIALGQGTAIQIGH LVGAGEIKEVYNKCLKSVKIAFIFAFVATSIVCLLRKPIMNIFTTNPDILKASLKIFPLM IILEMGRVFNIVIINSLHAAGDIKFPMFMGIIFVFTIAVLFSYIFGISLGWGLVGIWIAN AMDEWIRGIAMYFRWKSKKWQNKSFV >gi|296154796|gb|ADVK01000022.1| GENE 57 50668 - 51717 669 349 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163764769|ref|ZP_02171823.1| ribosomal protein L18 [Bacillus selenitireducens MLS10] # 9 349 16 358 360 262 37 4e-69 MTKQDLMDIIVKVAPGSPLREGVDYILDAGIGALIIIGYDDEVEKVRDGGFLIDCDYTPE RIFELSKMDGAIILNDDCSKILYANVHVQPDNSYSTTESGTRHRTAERAAKHLKREVVAI SERKKNVTLYKGNLKYRLKNFDELNIEVGQVLKTLESYRHVLNRSLDSLTILELDDLVTV LDVANTLQRFEMVRRISEEITRYLLELGSRGRLVNMQVSELIWDLDEEEESFLKDYIDDE TDTDSVRRYLRSLSDSELLEVENVVVALGYSKSSSVFDNKIAAKGYRVLEKISKLTKKDV EKIVNTYKDISEIQEVTDEDFSAIKISKFKIKALRAGINRLKFTIEMQR >gi|296154796|gb|ADVK01000022.1| GENE 58 51710 - 53068 1780 452 aa, chain - ## HITS:1 COG:FN0157 KEGG:ns NR:ns ## COG: FN0157 COG1066 # Protein_GI_number: 19703502 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Predicted ATP-dependent serine protease # Organism: Fusobacterium nucleatum # 1 452 1 452 452 790 98.0 0 MVKGTVYYCSECGYKSVKWAGKCPQCGAWSSFEEVDELPKDVKKSKNSDIKVYEFKDVEY TSEDRYKTKYEEFDRLLGGGLLKGEVVLVTGNPGIGKSTLLLQVANSYKDYGDVLYISGE ESPAQIKNRGERLKISGEGIYIMAEMDILNIYEYVVSKKPKVVVVDSIQTLYNSSMDSIS GTPTQIRECTLKIIELAKKYNISFFIVGHITKDGKVAGPKLLEHMVDAVFNFEGDEGLYY RILRSIKNRFGSTNEIAVFSMEENGMREIKNSSEYFLSEREEKNIGSMVVPILEGTKVFL LEVQSLITDSGIGIPKRVVQGYDRNRIQILTAIAEKKLYVPLGMKDLFVNVPGGLAIEDP AADLAVLMSILSVHKGFAISQKIAAIGELGLRGEIRKVFFLERRLKELEKLGFTGVYVPE SNRKEIEKKKYKLKIIYLKNLDELLERMNKND >gi|296154796|gb|ADVK01000022.1| GENE 59 53072 - 53563 301 163 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163764798|ref|ZP_02171851.1| ribosomal protein S19 [Bacillus selenitireducens MLS10] # 2 150 4 151 164 120 42 2e-26 MKIGVYAGSFDPITKGHQDIIERALKIVDKLIVVVMNNPTKNYWFNLDERKNLISKIFEG SDSIKVDEHAGLLVDFMAKNSCNILIKGLRDVKDFSEEMTYSFANKKLSNGEVDTIFIPT SEKYTYVSSTFVKELAFYNQSLTGYVDDKVIADILNRAKEYRG >gi|296154796|gb|ADVK01000022.1| GENE 60 53584 - 54825 1241 413 aa, chain - ## HITS:1 COG:no KEGG:FN0155 NR:ns ## KEGG: FN0155 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 413 1 413 413 602 98.0 1e-170 MIKKIIFYLVFSVISLAQQIELKSIEKTIIANGQNYTTTLSQSYDEKNKKLEVLYIEKGD YPFGTKEIIQFDTEGKKELSKEKFKYNISTGNWNKDYKSVTTYEKNKKIEETYMAEENRW TGYMKYESENTDNSETFIIYNFKNKKWNPFSKTYTLLNENKKNNIIETYNWDINKQKWDL ESKSVYTYNQEGKLEETVDYKKENNWIADQKLKYYTDDKGNQVCFNLFFQNGKWIEQDKT ISEIDKVNNKKVAITQQLNKETKQLENTRRFIQTYKKDMLEQEVQYFWDKDKKEWYKKYE QNLSYDENKNLIRKQAIYDDDSGVQFTYKFDKNGNNIEILLEHLNSQTKLWGAHEKIEYL YDLSIMKDKVLDRANINDENEISVNLILEKKYYLYDGKKWILTEKSRYLYDKK >gi|296154796|gb|ADVK01000022.1| GENE 61 54850 - 56226 1652 458 aa, chain - ## HITS:1 COG:FN0154 KEGG:ns NR:ns ## COG: FN0154 COG1530 # Protein_GI_number: 19703499 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Ribonucleases G and E # Organism: Fusobacterium nucleatum # 1 458 1 458 458 697 98.0 0 MKKYLILSKSVYETKLALLEDNKLDEIYIERNNQKEITGNIYKGKVVDILNNSEIIFVDI GLDKNALLSFENKKNIPKFNIDDKLIVQTETEPRDEKGAKLTLDYSINGENLVLLPKSKN LSISKKIKDIEEVNRLKNIFLGIDNGLILKTNSEGKSEESLLEEYKKLKNIENRINRDFE KINIGLLYDVNSILKKAESLLDDTIEEFIIDDKNIFEEIKVLLEETGKKDLIKKLRKYFK GEEIFEYYNINSQIERALDRKVYLDSGAYIIIEKTEALISIDVNTGQNTGNKTSQELIFQ TNLEATKEIARQIKLRNLAGIIIIDFIDMKKISDRKRVLEEFKKYLNKDRIEINSLEYTN LGLIQFTRKRQGKELAFYYKEKCQYCEGTGYFLSKDRIILNLLEDLNAQIKSQDIKKIVI RTKKDIIKELNKYIDNNKIEYIEDNNFYKIGYKMELYN >gi|296154796|gb|ADVK01000022.1| GENE 62 56201 - 57247 971 348 aa, chain - ## HITS:1 COG:FN0153 KEGG:ns NR:ns ## COG: FN0153 COG1243 # Protein_GI_number: 19703498 # Func_class: K Transcription; B Chromatin structure and dynamics # Function: Histone acetyltransferase # Organism: Fusobacterium nucleatum # 1 348 1 348 348 645 96.0 0 MKHYNIPVFISHFGCPNACVFCNQKKINGRETDVSLDDLKNIIDSYLKTLPKNSIKQVAF FGGTFTGISINLQKEYLEVVKKYIDNNDVEGVRISTRPECIDDEILTQLKKYGVKTIELG IQSLDDKVLKATGRNYTYDIVKKSCDLIKSYGFELGVQLMIGLPKSDFKSDLQSAMKSLD LNPDIARIYPTLVIKGTELEFMYKKNLYQSLSIEEAVERTVPIYSLLELKNINVIRVGLQ PAEDLTADGVIISGPFHPAFRDLIENKIYFNFLSKFYDKDRKLDIEVNERNVSKIVGQKA INKKTFYPNFKILINNNLSLDELIINSKKYTRKEILEGEFNEKISDFI >gi|296154796|gb|ADVK01000022.1| GENE 63 57234 - 57938 760 234 aa, chain - ## HITS:1 COG:FN0152 KEGG:ns NR:ns ## COG: FN0152 COG0571 # Protein_GI_number: 19703497 # Func_class: K Transcription # Function: dsRNA-specific ribonuclease # Organism: Fusobacterium nucleatum # 1 234 1 234 234 408 98.0 1e-114 MKNLLDLEHKLNYYFNDRNLLKNALLHKSLGNERKEYKNQNNERLELLGDAVLDLIVAEY LYKNYKNASEGTIAKLKAMIVSEPILAKISRQIGVGKFLMLSRGEVMSGGRNRESILADS FEAILGAVYIDSNLDEARVFALSHIKQYIDHIEENEDILDFKSILQEYVQKEFKTVPTYE LVAERGPDHMKEFEIQVIVGNYKEKAVAKNKKKAEQLSAKALCIKLGVKYNEAL >gi|296154796|gb|ADVK01000022.1| GENE 64 57954 - 59195 2014 413 aa, chain - ## HITS:1 COG:FN0151 KEGG:ns NR:ns ## COG: FN0151 COG0304 # Protein_GI_number: 19703496 # Func_class: I Lipid transport and metabolism; Q Secondary metabolites biosynthesis, transport and catabolism # Function: 3-oxoacyl-(acyl-carrier-protein) synthase # Organism: Fusobacterium nucleatum # 1 413 1 413 413 770 99.0 0 MKRVVVTGLGLISSLGIGLEESWKKLIDGETGIDLITSYDTTDQPVRIAGEVKGFEPTDY GIEKKEVKKLARNTQFALVATKMALEDANFKIDETNADDVGVLVSAGVGGIEIMEEQYKN MLEKGPKRISPFTIPAMIENMAAGNIAIYYGAKGPNKSIVTACASGTHSIGDGFDLIRHG RAKAMIVGGTEASVTKFCINSFANMKALSTRNETPKTASRPFSKDRDGFVMGEGAGILIL EELESALTRGAKIYAEMVGYGETCDANHITAPIETGEGATKAMRAALKDANIPLEDVTYI NAHGTSTPTNDVVETRAIKALFGDKAKDLYISSTKGATGHGLGAAGGIEGVIIAKAIADG IIPPTINLHETEEECDLNYVPNKAIKTDVKVAMSNSLGFGGHNSVIVMKKFEK >gi|296154796|gb|ADVK01000022.1| GENE 65 59286 - 59513 471 75 aa, chain - ## HITS:1 COG:FN0150 KEGG:ns NR:ns ## COG: FN0150 COG0236 # Protein_GI_number: 19703495 # Func_class: I Lipid transport and metabolism; Q Secondary metabolites biosynthesis, transport and catabolism # Function: Acyl carrier protein # Organism: Fusobacterium nucleatum # 1 75 1 75 75 106 100.0 9e-24 MLDKVKEIIVEQLGVDADQIKPESNFVDDLGADSLDTVELIMSFEEEFGVEIPDTEAEKI KTVQDVINYIEANKK >gi|296154796|gb|ADVK01000022.1| GENE 66 59581 - 60480 1530 299 aa, chain - ## HITS:1 COG:FN0149 KEGG:ns NR:ns ## COG: FN0149 COG0331 # Protein_GI_number: 19703494 # Func_class: I Lipid transport and metabolism # Function: (acyl-carrier-protein) S-malonyltransferase # Organism: Fusobacterium nucleatum # 1 299 1 299 299 530 98.0 1e-150 MGKVAFVYPGQGTQYIGMGKELYENNNKAKELFDKIFNSLDIDLKNVMFEGPEDLLKRTD YTQPAIVSLSLVLTELLKEKGIEPDYVAGHSVGEFAAFGGANYLSIEDAVKLVAARGRIM REVAEKVNGSMAAVLGMDVEKIKEVLKSVDGVVEAVNFNEPNQTVIAGEKEAVEKACVAL KEAGAKRALPLAVSGPFHSSLMKEAGEQLKEEAKKYTFNVGKIKIIANTTAEPLETDSEV KDEIYRQSFGPVKWVDTINKLKSLGVTKIYEIGPGKVLSGLIKKIDKEIEVENIELIEA >gi|296154796|gb|ADVK01000022.1| GENE 67 60520 - 61506 1325 328 aa, chain - ## HITS:1 COG:FN0148 KEGG:ns NR:ns ## COG: FN0148 COG0332 # Protein_GI_number: 19703493 # Func_class: I Lipid transport and metabolism # Function: 3-oxoacyl-[acyl-carrier-protein] synthase III # Organism: Fusobacterium nucleatum # 1 328 1 328 328 633 99.0 0 MQSIGIKGMGYYVPENVFTNFDFEKIIDTSDEWIRTRTGIIERRFASKDQATSDLATEAS LKAIKNAKISKEDVDMIILATTTADYIAQGAACIVQNKLGLKKIPCFDLNAACTGFIYGL EVAYSLVKSGLYKNILVIGAETLSRIIDMQNRNTCVLFGDGAAAAIVGEVEKGYGFLGFS IGAEGEDNMILKVPAGGSKKPNNEETIKNRENFVIMKGQDVFKFAVSTLPKVTSDALEKA KLKVNDLSMVFPHQANLRIIESAAKRMKFPLEKFYMNLSRYGNTSSASVGIALGEAIEKG LVKKGDNIALTGFGGGLTYGSTIIKWAY >gi|296154796|gb|ADVK01000022.1| GENE 68 61506 - 62504 1276 332 aa, chain - ## HITS:1 COG:FN0147 KEGG:ns NR:ns ## COG: FN0147 COG0416 # Protein_GI_number: 19703492 # Func_class: I Lipid transport and metabolism # Function: Fatty acid/phospholipid biosynthesis enzyme # Organism: Fusobacterium nucleatum # 1 332 1 332 332 577 99.0 1e-164 MKIALDAMSGDFAPISTVKGAIEALEEIEGLQVILVGKEGIIKEELKKYKYDTNRIEIKN ADEVIVMTDDPVKAVREKKDSSMNVCIDLVKEKLAQASVSCGNTGALLASSQLKLKRIKG VLRPAIAVLFPNKKDSGTLFLDLGANSDSKPEFLNQFATMGSKYMEIFSGKKNPKVALLN IGEEETKGNELTRETYTLLKENKDIDFYGNIESTKIMEGDVDVVVTDGYTGNILLKTSEG IGKFIFHIVKESIMESWISKLGALLVKGAMKKVKKKTEASEYGGAIFLGLSELSLKAHGN SDSRAIKNALKVASKFIELNFIEELRKTMEVE Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:29:58 2011 Seq name: gi|296154758|gb|ADVK01000023.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00028, whole genome shotgun sequence Length of sequence - 34454 bp Number of predicted genes - 41, with homology - 37 Number of transcription units - 11, operones - 10 average op.length - 4.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) - Term 126 - 181 9.6 1 1 Op 1 . - CDS 182 - 592 573 ## PROTEIN SUPPORTED gi|19703772|ref|NP_603334.1| 50S ribosomal protein L19 2 1 Op 2 . - CDS 621 - 773 163 ## FN0429 hypothetical protein - Prom 804 - 863 5.8 3 2 Op 1 1/0.000 - CDS 869 - 1645 908 ## COG1387 Histidinol phosphatase and related hydrolases of the PHP family 4 2 Op 2 9/0.000 - CDS 1642 - 2256 990 ## COG0461 Orotate phosphoribosyltransferase 5 2 Op 3 . - CDS 2249 - 2962 1126 ## COG0284 Orotidine-5'-phosphate decarboxylase 6 2 Op 4 . - CDS 2971 - 3651 802 ## FN0425 putative cytoplasmic protein 7 2 Op 5 13/0.000 - CDS 3648 - 4562 1366 ## COG0167 Dihydroorotate dehydrogenase 8 2 Op 6 3/0.000 - CDS 4574 - 5353 1268 ## COG0543 2-polyprenylphenol hydroxylase and related flavodoxin oxidoreductases - Prom 5405 - 5464 3.9 9 2 Op 7 24/0.000 - CDS 5478 - 8654 4598 ## COG0458 Carbamoylphosphate synthase large subunit (split gene in MJ) 10 2 Op 8 7/0.000 - CDS 8669 - 9745 1621 ## COG0505 Carbamoylphosphate synthase small subunit 11 2 Op 9 15/0.000 - CDS 9763 - 11040 1657 ## COG0044 Dihydroorotase and related cyclic amidohydrolases 12 2 Op 10 8/0.000 - CDS 11054 - 11944 1179 ## COG0540 Aspartate carbamoyltransferase, catalytic chain - Prom 11964 - 12023 2.1 13 2 Op 11 . - CDS 12031 - 12552 679 ## COG2065 Pyrimidine operon attenuation protein/uracil phosphoribosyltransferase - Prom 12770 - 12829 10.4 + Prom 12807 - 12866 10.5 14 3 Tu 1 . + CDS 12896 - 13531 686 ## FN0413 hypothetical protein - Term 13535 - 13581 9.2 15 4 Op 1 1/0.000 - CDS 13596 - 14189 699 ## COG0353 Recombinational DNA repair protein (RecF pathway) 16 4 Op 2 1/0.000 - CDS 14202 - 14498 496 ## COG2926 Uncharacterized protein conserved in bacteria 17 4 Op 3 5/0.000 - CDS 14519 - 15487 1481 ## COG0205 6-phosphofructokinase 18 4 Op 4 10/0.000 - CDS 15520 - 16461 1761 ## COG0825 Acetyl-CoA carboxylase alpha subunit 19 4 Op 5 . - CDS 16473 - 17387 1270 ## COG0777 Acetyl-CoA carboxylase beta subunit - Prom 17524 - 17583 13.4 + Prom 17526 - 17585 18.3 20 5 Op 1 . + CDS 17610 - 18134 901 ## FN0407 hypothetical protein + Term 18149 - 18201 9.1 21 5 Op 2 1/0.000 + CDS 18214 - 19278 1459 ## COG0787 Alanine racemase 22 5 Op 3 . + CDS 19346 - 20323 1175 ## COG0180 Tryptophanyl-tRNA synthetase + Term 20333 - 20367 6.2 - Term 20323 - 20353 3.6 23 6 Op 1 1/0.000 - CDS 20360 - 21892 2593 ## COG0008 Glutamyl- and glutaminyl-tRNA synthetases - Prom 21912 - 21971 2.3 24 6 Op 2 . - CDS 21977 - 22903 1245 ## COG1186 Protein chain release factor B - Prom 22923 - 22982 2.0 25 7 Op 1 . - CDS 23004 - 23078 73 ## 26 7 Op 2 1/0.000 - CDS 23098 - 23466 547 ## COG0736 Phosphopantetheinyl transferase (holo-ACP synthase) 27 7 Op 3 . - CDS 23463 - 24221 1057 ## COG0084 Mg-dependent DNase - Prom 24247 - 24306 8.4 - Term 24255 - 24316 8.5 28 8 Op 1 . - CDS 24389 - 24694 346 ## gi|296328114|ref|ZP_06870647.1| 50S ribosomal protein L4 - Prom 24750 - 24809 3.0 29 8 Op 2 . - CDS 24815 - 24886 83 ## - Prom 25074 - 25133 11.1 30 9 Op 1 . - CDS 25143 - 25970 840 ## COG0596 Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) 31 9 Op 2 . - CDS 25990 - 26067 177 ## - Term 26072 - 26121 9.1 32 10 Op 1 1/0.000 - CDS 26132 - 26836 1036 ## COG1359 Uncharacterized conserved protein - Prom 26858 - 26917 8.9 33 10 Op 2 36/0.000 - CDS 26958 - 27629 303 ## PROTEIN SUPPORTED gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 34 10 Op 3 . - CDS 27638 - 28843 1618 ## COG0577 ABC-type antimicrobial peptide transport system, permease component 35 10 Op 4 . - CDS 28847 - 29284 599 ## FN1350 integral membrane protein - Prom 29346 - 29405 8.2 36 11 Op 1 . - CDS 29438 - 29860 683 ## COG4939 Major membrane immunogen, membrane-anchored lipoprotein 37 11 Op 2 . - CDS 29857 - 29976 172 ## 38 11 Op 3 36/0.000 - CDS 29994 - 30668 287 ## PROTEIN SUPPORTED gi|225084369|ref|YP_002657150.1| ribosomal protein S16 39 11 Op 4 10/0.000 - CDS 30672 - 31874 1668 ## COG0577 ABC-type antimicrobial peptide transport system, permease component 40 11 Op 5 4/0.000 - CDS 31884 - 33164 1660 ## COG0577 ABC-type antimicrobial peptide transport system, permease component 41 11 Op 6 . - CDS 33167 - 34453 1182 ## COG4393 Predicted membrane protein Predicted protein(s) >gi|296154758|gb|ADVK01000023.1| GENE 1 182 - 592 573 136 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19703772|ref|NP_603334.1| 50S ribosomal protein L19 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 21 136 1 116 116 225 100 3e-58 MVKNSIFRFNKLNEYLEEEEMKEKLIELVEKEYLRSDIPQFKAGDTIGVYYKVKEGNKER VQLFEGVVIRVNGGGVAKTFTVRKVTAGIGVERIIPVNSPNIDRIEVLKVGRVRRSKLYY LRGLSAKKARIKEIVK >gi|296154758|gb|ADVK01000023.1| GENE 2 621 - 773 163 50 aa, chain - ## HITS:1 COG:no KEGG:FN0429 NR:ns ## KEGG: FN0429 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 50 58 107 107 75 98.0 7e-13 MRLSKMEDTESLNKFIFDFKEKYINKKLNKEFYQRLIPKMDNIELLKNLY >gi|296154758|gb|ADVK01000023.1| GENE 3 869 - 1645 908 258 aa, chain - ## HITS:1 COG:FN0428 KEGG:ns NR:ns ## COG: FN0428 COG1387 # Protein_GI_number: 19703770 # Func_class: E Amino acid transport and metabolism; R General function prediction only # Function: Histidinol phosphatase and related hydrolases of the PHP family # Organism: Fusobacterium nucleatum # 1 258 1 258 258 421 100.0 1e-118 MIFDQHVHSSFSFDSNEDLENYINVSNENDIITTEHLDFENPIINYNDSLIDYLKYIGQI KNLNKKYPNKFFSGIEIGYTQKTEKRVEDFLRNKNFNLKLLSIHQNGIYDFMCINKKLIQ LENLVKEYFELMIQALESSIKFNVLAHFEYGLRIIDISVVEFDNLASTFLNKIVELIVKK EIAFEVNTKSMYKYKKENLYSYMIEKYIKKGGRLFTLGSDAHNIKEYAYKFDEAKKFLLS KNIKEIILFKDKAIMKKL >gi|296154758|gb|ADVK01000023.1| GENE 4 1642 - 2256 990 204 aa, chain - ## HITS:1 COG:FN0427 KEGG:ns NR:ns ## COG: FN0427 COG0461 # Protein_GI_number: 19703769 # Func_class: F Nucleotide transport and metabolism # Function: Orotate phosphoribosyltransferase # Organism: Fusobacterium nucleatum # 1 204 8 211 211 395 99.0 1e-110 MNREIINALLDIKAVELRVDKENWFTWASGIKSPIYCDNRLTMSYPKIRKQIAEGFVKKI KELYPNVDYIVGTATAGIPHAAWISDIMDLPMLYVRGSAKDHGKTNQIEGKFEKGKKVVV IEDLISTGKSSVLAAQALQEEGLEVLGVIAIFSYNLNKAKEKFDEAKIPFSTLTNYDVLL ELAKETGLIGDKENQILIDWRNNL >gi|296154758|gb|ADVK01000023.1| GENE 5 2249 - 2962 1126 237 aa, chain - ## HITS:1 COG:FN0426 KEGG:ns NR:ns ## COG: FN0426 COG0284 # Protein_GI_number: 19703768 # Func_class: F Nucleotide transport and metabolism # Function: Orotidine-5'-phosphate decarboxylase # Organism: Fusobacterium nucleatum # 1 237 1 237 237 424 98.0 1e-118 MKKEVIIALDFPTLEKTLEFLDKFKEEKLFVKVGMELYLQNGPIVIDEIKKRGHKIFLDL KLHDIPNTVYSAAKGLAKFNIDILTVHAAGGSEMLKGAKRAMTEAGVNTKVIAITQLTST SEEDMRKEQNIQTSIEESVLNYARLAKESGVDGVVSSVLETKKIREQSGEDFIIINPGIR LAEDSKGDQKRVATPIDANRDGASYIVVGRSITRNENPEERYRLIKNMFEMGDKYVE >gi|296154758|gb|ADVK01000023.1| GENE 6 2971 - 3651 802 226 aa, chain - ## HITS:1 COG:no KEGG:FN0425 NR:ns ## KEGG: FN0425 # Name: not_defined # Def: putative cytoplasmic protein # Organism: F.nucleatum # Pathway: not_defined # 1 220 1 220 221 345 99.0 9e-94 MKNIFKDYLDIFEKYPKDEYLTKEERKERYKLLQEYEKRNYQDEISIDEFQNFTSLYIDK IDISSQFIGKFLKVLKNNINNGGTFALKFLIGDKDENDYYLKFFKLLYDELGDRINLINK LLEKEPDYLPAIKQKYKILSNYVDFSIHEMPWGILLDKPISEKDAKIEALADLDNFSKLS KKLGKDNEKYIEECRIYYNAWFDFLDNKDKYKSYEEYLEKNNILRK >gi|296154758|gb|ADVK01000023.1| GENE 7 3648 - 4562 1366 304 aa, chain - ## HITS:1 COG:FN0424 KEGG:ns NR:ns ## COG: FN0424 COG0167 # Protein_GI_number: 19703766 # Func_class: F Nucleotide transport and metabolism # Function: Dihydroorotate dehydrogenase # Organism: Fusobacterium nucleatum # 1 304 1 304 304 570 99.0 1e-162 MSERLRIQIPGLDLKNPIMPASGCFAFGIEYAELYDISKLGAIMIKAATKEARFGNPTPR VAETSSGMLNAIGLQNPGVDEIISNQLKKLEKYDVPIIANVAGSDIEDYVYVADKISKSP NVKALELNISCPNVKHGGIQFGTDPNVARNLTEKVKAVSSVPVYVKLSPNVTDIVAMAKA VETGGADGLTMINTLVGIVLDRKTGKPIIANTTGGLSGPAIKPVAIRMVYQVAQAVNIPI IGMGGVMDEWDVIDFISAGASAVAVGTANFTDPFVCPKIIDSLELALDKLGVNHILDLKG RAFR >gi|296154758|gb|ADVK01000023.1| GENE 8 4574 - 5353 1268 259 aa, chain - ## HITS:1 COG:FN0423 KEGG:ns NR:ns ## COG: FN0423 COG0543 # Protein_GI_number: 19703765 # Func_class: H Coenzyme transport and metabolism; C Energy production and conversion # Function: 2-polyprenylphenol hydroxylase and related flavodoxin oxidoreductases # Organism: Fusobacterium nucleatum # 1 259 1 259 259 484 96.0 1e-137 MKIEDCTVEENVQIAKDTYKMKIKGNFVKECRTPGQFVNIRIGDGREHVLRRPISISEID RGENLVTIIYRTVGEGTKFMANIKKGNEIDVMGPLGRGYDVLSLTKEQTALLVGGGIGVP PLYELAKQFNQRGIKTITILGFNSKDEVFYEDEFKKFGETYVSTVDGSVGTKGFVTDVIK KLQEENKLVFNKYYSCGPVPMLKALVNTVGEDGYISLENRMACGIGACYACVCKKKKKDD YTRVCYDGPVYLASDVEIE >gi|296154758|gb|ADVK01000023.1| GENE 9 5478 - 8654 4598 1058 aa, chain - ## HITS:1 COG:FN0422 KEGG:ns NR:ns ## COG: FN0422 COG0458 # Protein_GI_number: 19703764 # Func_class: E Amino acid transport and metabolism; F Nucleotide transport and metabolism # Function: Carbamoylphosphate synthase large subunit (split gene in MJ) # Organism: Fusobacterium nucleatum # 1 1058 6 1063 1063 2081 99.0 0 MPKRKDIKTILVIGSGPIIIGQAAEFDYAGTQACLSLREEGYEVILVNSNPATIMTDKEI ADKVYIEPLTVEFLSKIIRKEKPDALLPTLGGQVALNLAVSLHESGILDECGVEILGTKL TSIKQAEDRELFRDLMNELNEPVPDSAIVHTLEEAENFVKEIDYPVIVRPAFTMGGTGGG ICYNEEDLHEIVPNGLNYSPVHQCLLEKSIAGYKEIEYEVMRDSNDTAIVVCNMENIDPV GIHTGDSIVVAPSQTLTDREHHMLRDVSLKIIRALKIEGGCNVQIALDPNSFKYYIIEVN PRVSRSSALASKATGYPIAKIAAKIAVGMTLDEIINPVTKSSYACFEPAIDYVVTKIPRF PFDKFGDGDRYLGTQMKATGEVMAIGRTLEESLLKAIRSLEYGVHHLGLPNGEEFSLEKI IKRIKLAGDERLFFIGEALRRDVSIEEIHEYTKIDLFFLNKMKNIIDLEHLLKDNKGNIE LLRKVKTFGFSDRVIAHRWEMTETEITELRHKHNIRPVYKMVDTCAAEFDSNTPYFYSTY EFENESTRSDKEKIVVLGSGPIRIGQGIEFDYATVHAIMAIKKLGYEAIVINNNPETVST DFSISDKLYFEPLTQEDVMEILDLEKPLGVVVQFGGQTAINLADKLVKNGIQILGSSLDS IDTAEDRDRFEKLLIGLKIPQPLGKTAFDVETALKNANEIGYPVLVRPSYVLGGRAMEIV YNDEDLTKYMEKAVHINPDHPVLIDRYLIGKEIEVDAISDGENTFIPGIMEHIERAGVHS GDSISIYPPQSLSEKEIETLIDYTKKLASGLEVKGLINIQYVVSKGEIYVLEVNPRASRT VPFLSKVTGVPVANIAMQCILGKKLKDLGFTKDIADIGNFVSVKVPVFSFQKLKNVDTTL GPEMKSTGEVIGTDVNLQKALYKGLTAAGIKIKDYGRVLFTIDDKNKEAALNLAKGFSDV GFSILTTEGTGIYFEEYGLKVKKVGKIDNSDYSVLDAIQNGDVDIVINTTTKGKSSEKDG FRIRRKATEYGVICFTSLDTANALLRVIESMSFRVQTL >gi|296154758|gb|ADVK01000023.1| GENE 10 8669 - 9745 1621 358 aa, chain - ## HITS:1 COG:FN0421 KEGG:ns NR:ns ## COG: FN0421 COG0505 # Protein_GI_number: 19703763 # Func_class: E Amino acid transport and metabolism; F Nucleotide transport and metabolism # Function: Carbamoylphosphate synthase small subunit # Organism: Fusobacterium nucleatum # 1 358 1 358 358 724 99.0 0 MYNRQLILEDGTVYKGYAFGADVENVGEVVFNTSMTGYQEILSDPSYNGQIVTLTYPLIG NYGINRDDFESMKPCIKGMVVKEVCTTPSNFRSEKTLDEALKDFGIPGIYGIDTRALTRK LRSKGVVKGCLVSIDRNVDEVVAELKKTVLPTNQIEQVSSKSISPALGRGRRVVLVDLGM KIGIVRELVSRGCDVIVVPYNTTAEEVLRLEPDGVMLTNGPGDPEDAKESIEMIKGIINK VTIFGICMGHQLVSLACGAKTYKLKFGHRGGNHPVKNILTGRVDITSQNHGYAVDIDSLN DTDLELTHIAINDRSCEGVRHKKYPVFTVQFHPEAAAGPHDTSYLFDEFIKNIDNNMK >gi|296154758|gb|ADVK01000023.1| GENE 11 9763 - 11040 1657 425 aa, chain - ## HITS:1 COG:FN0420 KEGG:ns NR:ns ## COG: FN0420 COG0044 # Protein_GI_number: 19703762 # Func_class: F Nucleotide transport and metabolism # Function: Dihydroorotase and related cyclic amidohydrolases # Organism: Fusobacterium nucleatum # 1 425 1 425 425 829 99.0 0 MLLKNCKILKNTKFEKVDILIRDNKIEKISENIDITDENIIDIKNRFVTAGFIDVHVHWR EPGFSKKETVYTASRAAARGGFTTVMTMPNLNPVPDSVETLNKQLEIIKKDSVIRAIPYG AITKEEYGRELSDMEAIASNVFAFTDDGRGVQSANVMYEAMLMGAKLNKAIVAHCEDNSL IRGGAMHEGKRSAELGIKGIPSICESTQIVRDVLLAEAANCHYHVCHISAKESVRAVREG KKNGIKVTCEVTPHHLLSCDEDIKEDNGMWKMNPPLRSREDRNALIVGILDGTIDIIATD HAPHTMEEKIRGIEKSSFGIVGSETAFAQLYTKFVKTDIFSLEMLVKLMSENVAKIFDLP YGKLEENSFADIVVIDLEKEITINSNNFLSKGKNTPYINEKINGIPVLTISNGKIAYIDK EEINL >gi|296154758|gb|ADVK01000023.1| GENE 12 11054 - 11944 1179 296 aa, chain - ## HITS:1 COG:FN0419 KEGG:ns NR:ns ## COG: FN0419 COG0540 # Protein_GI_number: 19703761 # Func_class: F Nucleotide transport and metabolism # Function: Aspartate carbamoyltransferase, catalytic chain # Organism: Fusobacterium nucleatum # 1 296 9 304 304 541 98.0 1e-154 MKNLLSMEDLTNEEILSLVKRALELKKGAENKKRNDLFVANLFFENSTRTKKSFEVAEKK LNLNVVDFEVSTSSVQKGETLYDTCKTLEMIGINMLVIRHSENEYYKQLENLKIPIINGG DGSGEHPSQCLLDIMTIYETYGKFDGLNIIIVGDIKNSRVARSNKKALTRLGAKVTFVAP EIWKDESLGEFVNFDDVIDKVDICMLLRVQHERHTDSKEKTEFSKENYHKNFGLTEERYK RLKEGAIIMHPAPVNRDVEIADSLVESEKSRIFEQMKNGMFMRQAILEYIIEKNKL >gi|296154758|gb|ADVK01000023.1| GENE 13 12031 - 12552 679 173 aa, chain - ## HITS:1 COG:FN0418 KEGG:ns NR:ns ## COG: FN0418 COG2065 # Protein_GI_number: 19703760 # Func_class: F Nucleotide transport and metabolism # Function: Pyrimidine operon attenuation protein/uracil phosphoribosyltransferase # Organism: Fusobacterium nucleatum # 1 173 5 177 177 292 100.0 2e-79 MKILLDEDGIRRSITRISYEIIERNKTVDNIVLVGIKSRGDILAERIKQKLLEVENIDAP LETIDITYYRDDIDRKNFDLDIKDTEFKTNLTGKVVVIVDDVLYTGRTIRAGLDAILSKS RPAKIQLACLIDRGHRELPIRADFIGKNIPTSHSENIEVYLKELDGKEEVVIL >gi|296154758|gb|ADVK01000023.1| GENE 14 12896 - 13531 686 211 aa, chain + ## HITS:1 COG:no KEGG:FN0413 NR:ns ## KEGG: FN0413 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 16 211 1 196 196 353 100.0 3e-96 MLRELKRELEKALNIMKPIWQRQNNTDDNSTNFIYNCNSLDEIIKVSKSKNLDLNYVLHR WYNFKTSDACEKIFEFYGAIKEKNPTHHDIDFYIANEPFDLKLTVYPYALKNTGIYYDLT KTNEKNKLIEWFYLNQSQQNRKHMKNRIFIVCDGSNNMKLKSDFSKICKEVKLWIEPYLN KEKTPSFNTLIIKDIDDKEYTVKSDIILITD >gi|296154758|gb|ADVK01000023.1| GENE 15 13596 - 14189 699 197 aa, chain - ## HITS:1 COG:FN0412 KEGG:ns NR:ns ## COG: FN0412 COG0353 # Protein_GI_number: 19703754 # Func_class: L Replication, recombination and repair # Function: Recombinational DNA repair protein (RecF pathway) # Organism: Fusobacterium nucleatum # 1 197 1 197 197 380 97.0 1e-106 MPTKSLERLILEFNKLPGVGQKSATRYAFHILNQSEEDVKNFAEALLAVKKNVKKCHICG NYCESDTCHICSDNARDHRIICVVEESKDIMILERTTKYRGVYHVLNGRLDPLNGITPNE LNIKSLLERIAKDDIEEIILATNPNIEGETTAMYLAKLMKNFGIKITKLASGIPMGGNLE FSDTATISRALDDRIEI >gi|296154758|gb|ADVK01000023.1| GENE 16 14202 - 14498 496 98 aa, chain - ## HITS:1 COG:FN0411 KEGG:ns NR:ns ## COG: FN0411 COG2926 # Protein_GI_number: 19703753 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 98 1 98 98 111 100.0 4e-25 MDSVLELVRKERRKNQIKREIEDNDRKIRDNRKRVELLLNLKEYLKVNMSYEEIIDIIEN MQSDYEDRVDDYIIKNAELGKERREISKTIKDFKKSVS >gi|296154758|gb|ADVK01000023.1| GENE 17 14519 - 15487 1481 322 aa, chain - ## HITS:1 COG:FN0410 KEGG:ns NR:ns ## COG: FN0410 COG0205 # Protein_GI_number: 19703752 # Func_class: G Carbohydrate transport and metabolism # Function: 6-phosphofructokinase # Organism: Fusobacterium nucleatum # 1 322 8 329 329 614 100.0 1e-176 MEKKLAILTSGGDAPGMNAAIRATAKIAEYYGFEVYGIRRGYLGMLNDEIFPMTGRFVSG IIDKGGTVLLTARSEEFKEARFREIAANNLKKKGINYLVVIGGDGSYRGANLLYKEHGIK VVGIPGTIDNDICGTDFTLGFDTCLNTILDAMSKIRDTATSHERTILIQVMGRRAGDLAL HACIAGGGDGIMIPEMDNPIEMLALQLKERRKNGKLHDIVLVAEGVGNVLDIEEKLKGHI NSEIRSVVLGHIQRGGTPSGFDRVLASRMAAKAVEVLNKGEAGVMVGIEKNEMVTHPLEE ACSVDKRKSIEKDYELALLLSK >gi|296154758|gb|ADVK01000023.1| GENE 18 15520 - 16461 1761 313 aa, chain - ## HITS:1 COG:FN0409 KEGG:ns NR:ns ## COG: FN0409 COG0825 # Protein_GI_number: 19703751 # Func_class: I Lipid transport and metabolism # Function: Acetyl-CoA carboxylase alpha subunit # Organism: Fusobacterium nucleatum # 1 313 1 313 313 581 99.0 1e-166 MQFEFQIEELEHKIEELKKFAEEKEVDLSDEIAKLKDQRDIALKVLYDDLTDYQRVMVSR HPERPYTLDYINYITTDFIELHGDRLFRDDPAIVGGLCKIDGKNFMIIGHQKGRTMQEKV FRNFGMANPEGYRKALRLYEMAERFKLPILTFIDTPGAYPGLEAEKHGQGEAIARNLMEM SGIKTPIVSVVIGEGGSGGALGLGVADKVFMLENSVYSVISPEGCAAILYKDPNRVEEAA NNLKLSSQSLLKIGLIDDIIDEALGGAHRGPEDTAFNLKNVVLEAVNELEKLPVDELVEK RYEKFRQMGVFNR >gi|296154758|gb|ADVK01000023.1| GENE 19 16473 - 17387 1270 304 aa, chain - ## HITS:1 COG:FN0408 KEGG:ns NR:ns ## COG: FN0408 COG0777 # Protein_GI_number: 19703750 # Func_class: I Lipid transport and metabolism # Function: Acetyl-CoA carboxylase beta subunit # Organism: Fusobacterium nucleatum # 1 304 1 304 304 580 100.0 1e-165 MSILKNLAKNLGLTNITQPKKKYATVGEKKSDEEKERAKYKVKNIDNLKEDEVTKCPSCG VLSHKSEIRANMKMCSNCNHYFNMSARERIELLIDEGTFKEEDVTLTSANPINFPEYVEK IEKAQHDSGMNEGVISGLGEINGLKVSIACMDFNFMGGSMGSVVGEKITAALERAIEHKI PAVVVAISGGARMQEGLTSLMQMAKTSAAVKKMRLAGLPFISVPVNPTTGGVTASFAMLG DIIISEPKARIGFAGPRVIEQTIRQKLPENFQKSEFLQECGMVDVIAKREDLKATIFKVL NDII >gi|296154758|gb|ADVK01000023.1| GENE 20 17610 - 18134 901 174 aa, chain + ## HITS:1 COG:no KEGG:FN0407 NR:ns ## KEGG: FN0407 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 174 1 174 174 287 100.0 1e-76 MNKNKIILFLISLFVFTACSSIDEYLPSFLTESSTPAAIQEAVASRVNPEKEIYALASAQ ISKSGSIIAQSRANKQASETLNKEIRKEVEALYRGNLDEMDAFSKSVISPVFSDLVTYST DLAMKKVTQKGAWEDSEKIYTLLAVDRNEVTTAADKVFKKFVEDASKNLGNAAK >gi|296154758|gb|ADVK01000023.1| GENE 21 18214 - 19278 1459 354 aa, chain + ## HITS:1 COG:FN0406 KEGG:ns NR:ns ## COG: FN0406 COG0787 # Protein_GI_number: 19703748 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Alanine racemase # Organism: Fusobacterium nucleatum # 1 354 1 354 354 662 99.0 0 MRTWIEIDKENLKYNVLKLKEIANNKEVLGVVKANAYGLGSIEIAKILKEVEVKLFGVAN LEEAIELQEAGIKDKILILGASFEDELVEATKRGVHVAISSMEQLQFLVSKNLNPNIHLK FDTGMTRLGFEVDDAEKVIEYCKNNNLNLVGIFSHLSDSDGNTIETKNFTLEQIEKFKKI ANSLNLEYIHISNSAGITNFHNDILGNLVRLGIGMYSFTGNKKTPYLKNIFTIKSKILFI KKVKKDSFVSYGRHYTLPADSTYAVLPIGYADGLKKYLSKGGYVLINNHRCEIIGNICMD MTMIRVPKEIENSIKIGDEVTVINADILDNLNIPELCVWEFMTGIGRRVKRIIV >gi|296154758|gb|ADVK01000023.1| GENE 22 19346 - 20323 1175 325 aa, chain + ## HITS:1 COG:FN0405 KEGG:ns NR:ns ## COG: FN0405 COG0180 # Protein_GI_number: 19703747 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Tryptophanyl-tRNA synthetase # Organism: Fusobacterium nucleatum # 1 325 1 325 325 626 99.0 1e-179 MKRSLSGIQPSGILHIGNYFGAMKQFVDLQDSYDGFYFIADYHSLTSLTKAETLKENTYN IVLDYLAVGLDPSKSTIFLQSNVPEHTELTWLLSNITPVGLLERGHSYKDKIAKGIPSNT GLLTYPVLMAADILIYDSDVVPVGKDQKQHLEMTRDIAMKFNQQYGVEFFKLPEPLILDD SAIVPGTDGQKMSKSYNNTINMFATKKKLKEQVMSIVTDSTPLEEPKNPDNNITKIYALF NNIDKQNELKDKFLAGNFGYGHAKTELLNSILEYFGTAREKREELEKNMDYVKDVLNEGS KKARTIAIEKIKKAKEIVGLVGNIY >gi|296154758|gb|ADVK01000023.1| GENE 23 20360 - 21892 2593 510 aa, chain - ## HITS:1 COG:FN1340 KEGG:ns NR:ns ## COG: FN1340 COG0008 # Protein_GI_number: 19704675 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Glutamyl- and glutaminyl-tRNA synthetases # Organism: Fusobacterium nucleatum # 1 510 7 516 516 1009 99.0 0 MCVDCKKRVRTRVAPSPTGDPHVGTAYIALFNIAFAHVNDGDFILRIEDTDRNRYTEGSE QMIFDALKWLDLDYSEGPDVGGDYGPYRQSERFDLYGKYAKELVEKGGAYYCFCDHERLE NLRERQKAMGLPPGYDGHCRSLSKEEIEEKLKAGVPYVIRLKMPYEGETVIHDRLRGDVV FENSKIDDQVLLKADGYPTYHLANIVDDHLMGITHVIRAEEWIPSTPKHIQLYKAFGWEA PEFIHMPLLRNDDRSKISKRKNPVSLIWYKEEGYLKEGLVNFLGLMGYSYGDGQEIFTLQ EFKDNFNIDKVTLGGPVFDLVKLGWVNNQHMKMKDLGELTRLTIPFFVNEGYLTNENVSE KEFETLKKVVGIEREGAKTLKELAKNSKFFFVDEFSLPELREDMDKKERKSVERLLNSLK DEIGLKSIKLFIEKLEKWNGNEFTAEQAKDLLHSLLDDLQEGPGKIFMPIRAVLTGESKG ADLYNVLYVIGKERALKRIENIVKKYNIGI >gi|296154758|gb|ADVK01000023.1| GENE 24 21977 - 22903 1245 308 aa, chain - ## HITS:1 COG:FN1341 KEGG:ns NR:ns ## COG: FN1341 COG1186 # Protein_GI_number: 19704676 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Protein chain release factor B # Organism: Fusobacterium nucleatum # 1 308 1 308 308 566 100.0 1e-161 MNFEKNIVSRYEKLATEIDDEEVLIDFVESGETSFESELSEKHKNLKTDIEEFEINLLLD GEYDMNNAIVTIHSGAGGTEACDWADMLYRMYLRWCNLKGYKVSELDFMEGDSVGVKSVT FLVEGINAYGYLKSEKGIHRLVRISPFDANKKRHTSFASVEVVPEVDENVEVEINPVDIR IDTYRASGAGGQHVNMTDSAVRITHFPSGIVVTCQKERSQLSNRETAMKMLKSKLLELEL KKKEEEMKKIQGEQSDIGWGNQIRSYVFQPYALVKDHRTNTEIGNVKAVMDGDIDGFINS YLRWIKNN >gi|296154758|gb|ADVK01000023.1| GENE 25 23004 - 23078 73 24 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MDILEIKREFLEMKEKTENIRRSL >gi|296154758|gb|ADVK01000023.1| GENE 26 23098 - 23466 547 122 aa, chain - ## HITS:1 COG:FN1342 KEGG:ns NR:ns ## COG: FN1342 COG0736 # Protein_GI_number: 19704677 # Func_class: I Lipid transport and metabolism # Function: Phosphopantetheinyl transferase (holo-ACP synthase) # Organism: Fusobacterium nucleatum # 1 122 1 122 122 199 99.0 1e-51 MIIGIGNDIIEIERIEKVISKEGFKNKIYTQKELENIQKRGNRTETYAGIFSAKEAISKA IGTGVREFSLTDLEILNDDLGKPYVVVSEKLDKILRNKKENYQIEISISHSRKYATAMAI IL >gi|296154758|gb|ADVK01000023.1| GENE 27 23463 - 24221 1057 252 aa, chain - ## HITS:1 COG:FN1343 KEGG:ns NR:ns ## COG: FN1343 COG0084 # Protein_GI_number: 19704678 # Func_class: L Replication, recombination and repair # Function: Mg-dependent DNase # Organism: Fusobacterium nucleatum # 1 252 7 258 258 481 100.0 1e-136 MKIIDSHVHLNLEQFDNDREEVFKRIEEKLDFVVNIGFDLESSEKSVEYANKYPFIYAVI GFHPDEIEGYSDEAEKKLEELAKNPKVLAIGEIGLDYHWMTRPKEEQWDIFRKQLELARR VNKPVVIHTREAMEDTVNILNEFPDITGILHCYPGSVETAKRMIDRFYLGIGGVLTFKNA KKLVEVVKEIPIEKLVIETDCPYMAPTPYRGQRNEPIYTEEVVKKMAELKNMSYEDVVRI TNKNTRKVFKML >gi|296154758|gb|ADVK01000023.1| GENE 28 24389 - 24694 346 101 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296328114|ref|ZP_06870647.1| ## NR: gi|296328114|ref|ZP_06870647.1| 50S ribosomal protein L4 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] 50S ribosomal protein L4 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 101 1 101 101 159 100.0 6e-38 MAVFETLIVEPVAENFKKTIKEVKEQEGGTVSEENNVQVNDDNYDLIVLDKFYDEVINKG NEAYLYNFLSSELAIIGNTLYARNAYKFKKKEYQKYFGEKS >gi|296154758|gb|ADVK01000023.1| GENE 29 24815 - 24886 83 23 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MDKEAKDDNKEIKIITKQKMTQR >gi|296154758|gb|ADVK01000023.1| GENE 30 25143 - 25970 840 275 aa, chain - ## HITS:1 COG:FN1345 KEGG:ns NR:ns ## COG: FN1345 COG0596 # Protein_GI_number: 19704680 # Func_class: R General function prediction only # Function: Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) # Organism: Fusobacterium nucleatum # 1 275 1 275 275 508 99.0 1e-144 MFFNYYNEKVYYEEFGEDKPILIIHGLACNIELMKGCIEPIFKKVNGYKRIYIDLLGMGK SNNCSLEYASSDKILDMLLSFIKEKIDKEFLLIGESYGGYLSRGIVSKCYKNVKGLMLLC SMIIPDDSKRVLPVGNLKFHDKEFLEKLDKNRREFFLEYMIIANEKMYKRFEKEVISGIE QANNDFIEKLRENYSFTFNIDEEIKMTNFSKPTLFIAGRQDNIVGYHDLYNLLEDYPRAT FVILDIAGHNLQIEQEELFNSLFLNWLERIEKYSI >gi|296154758|gb|ADVK01000023.1| GENE 31 25990 - 26067 177 25 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MNIAEKYIEELMEHDLDFISKDTIY >gi|296154758|gb|ADVK01000023.1| GENE 32 26132 - 26836 1036 234 aa, chain - ## HITS:1 COG:FN1347 KEGG:ns NR:ns ## COG: FN1347 COG1359 # Protein_GI_number: 19704682 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 17 234 1 218 218 383 99.0 1e-106 MFKKLLLVLGILTSVSMYAVPILNVYDFEVKKDKETSYKSVTEDYVNKTMGVEQGVLGLF AATDERDKTTSYIVEIYNDYLAFSNHTKNQASKDFKAVIPQIAEGNLNSAEIDVQIAKDK KIEQNDNTFAVYTVIDVKPENDKEFAEIIKNIVETTFNEEGTLLVYLGTDRRNSNKWCLF EVYKDIDSYLNHRSAKYFKDYITQTKDMIAGKKRAELQVLKIENKGGLDYKKLY >gi|296154758|gb|ADVK01000023.1| GENE 33 26958 - 27629 303 223 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 [Vibrio campbellii AND4] # 1 219 1 219 245 121 34 7e-27 MLEIKNISKAYNRQGKDFFAVKDVDLNILDGDFVHIIGRSGSGKSTLLNIVAGLLSADKG TLLLDGTNYLELNDEEKSKFRNKNIGFIPQSPALLGYLNILENIRLPYDMYEKDGDSEGK ARYFLNELGLEHLAKSYPKELSGGELRRIIIARALMTDPKILIADEPTSDLDIEATKEVM DLLKKINEKGTTILIVTHELDTLKYGKKVYTMSEGVLTEGKNL >gi|296154758|gb|ADVK01000023.1| GENE 34 27638 - 28843 1618 401 aa, chain - ## HITS:1 COG:FN1349 KEGG:ns NR:ns ## COG: FN1349 COG0577 # Protein_GI_number: 19704684 # Func_class: V Defense mechanisms # Function: ABC-type antimicrobial peptide transport system, permease component # Organism: Fusobacterium nucleatum # 1 401 1 401 401 720 99.0 0 MSKRIDANSLAMENIRQRKTRSTCMILLVALFSLIVYMGSMFSLSLSRGLESLSDRLGAD VIVVPAGYKAEIESVLLKGEPSTFYLPADTIEKLEKFGEIEKMTPQIYVATLSASCCSYP VQIIGIDIDTDFLIYPWITHNIDKELKDGEAIVGSHVVGEKGETVHFFNEELKIVGRLKQ TGIGFDATVFVNQNTAKQLAKASERITANKVAEEDVISSVMIKAKPGVDSVKLASKISKE LSKEGVFAMFSKKFVNSISSNLKVLSTSILVLVGAIWILSIVVLSISFTAVFNERKKEMA VLRVLGASKKMLREIILKEAVILSLWGAGIGSFLGVILSIIQLPLLASKFSMPFLSPSLF QYIGIFILSFVLGVFIGPFSTVRVVKKLTDKDSYLSLREEM >gi|296154758|gb|ADVK01000023.1| GENE 35 28847 - 29284 599 145 aa, chain - ## HITS:1 COG:no KEGG:FN1350 NR:ns ## KEGG: FN1350 # Name: not_defined # Def: integral membrane protein # Organism: F.nucleatum # Pathway: not_defined # 1 145 1 145 145 238 99.0 5e-62 MKKNILEKLALILSVILFLVPKYIAPVCGPKEDGSHMACYFSGNMVMKLAGAIFIITLLM IIFSKVKIVKMLGSVAVIVISAYVYLIPHGMSGLHNEMGKPFGVCKMDTMFCHVHHTFEI ATGIAVVIGILMVFSLISTFLKKED >gi|296154758|gb|ADVK01000023.1| GENE 36 29438 - 29860 683 140 aa, chain - ## HITS:1 COG:FN1351 KEGG:ns NR:ns ## COG: FN1351 COG4939 # Protein_GI_number: 19704686 # Func_class: S Function unknown # Function: Major membrane immunogen, membrane-anchored lipoprotein # Organism: Fusobacterium nucleatum # 1 140 1 140 140 240 100.0 5e-64 MKKYLLVGMIVALSLLTACGKKDFSKMTFNDGEYQGHYESDDKDHKDSADVTITIQDGKI VACTAEFRDGKGNIKGDDYAKDAGEDKYMKAQIAVQGFSTYADKLVEVQDPNEVDAVSGA TVSNKEFKEAVWNALEKAKK >gi|296154758|gb|ADVK01000023.1| GENE 37 29857 - 29976 172 39 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MPAIIISRAPRGSFNNNGRRSSLNLYLILFSVTVDRRAK >gi|296154758|gb|ADVK01000023.1| GENE 38 29994 - 30668 287 224 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|225084369|ref|YP_002657150.1| ribosomal protein S16 [gamma proteobacterium NOR51-B] # 8 218 9 219 309 115 31 5e-25 MDNREVLLEVKNVSKIYGDLHALKEVNFQVRKGEWVAIMGSSGSGKSTIMNIIGCMDKPS IGEVILDGQDITKESQNSLTKIRREKIGLIFQQFHLIPYLTALENVMVAQYYHSIPDEEE ALQALERVGLKDRAKHLPSQLSGGEQQRVCIARALINSPEIILADEPTGNLDEVNEKIVI EILKQLHKEGSTIVVVTHDLEVGDVAERKIILEYGKIIDIIDQK >gi|296154758|gb|ADVK01000023.1| GENE 39 30672 - 31874 1668 400 aa, chain - ## HITS:1 COG:FN1353 KEGG:ns NR:ns ## COG: FN1353 COG0577 # Protein_GI_number: 19704688 # Func_class: V Defense mechanisms # Function: ABC-type antimicrobial peptide transport system, permease component # Organism: Fusobacterium nucleatum # 1 400 1 400 400 678 99.0 0 MTKRQMYIKLVVSSLIRRKARMIVALLAVAIGATIMSGLVTIYYDIPRQLGKEFRSYGAN FVVLPSGNEKITDTEFDKIKNEMSTQKIVGMAPYRYETTKINQQPYILTGTDMIEVKKNS PFWYIEGEWSTNDDENNVMIGKEISKKLNLQVGETFIIEGPKAGAKVVASKQSDSAEESK KKDLNSDFYSKKLKVKGIITTGGAEESFIFLPISLLNEILEDDTKIDSIECSIEADSKQL DNLANKLKTVDENITARPIKRVTQSQDIVLGKLQALVLLVNIVVLILTMISVSTTMMAVV AERRKEIGLKKALGAYDSEIRKEFLGEGSALGFIGGLLGVGLGFVFAQEVSLSVFGRAIE FQWLFAPITIIVSMIITTLACLYPVKKAMEIEPALVLKGE >gi|296154758|gb|ADVK01000023.1| GENE 40 31884 - 33164 1660 426 aa, chain - ## HITS:1 COG:FN1354 KEGG:ns NR:ns ## COG: FN1354 COG0577 # Protein_GI_number: 19704689 # Func_class: V Defense mechanisms # Function: ABC-type antimicrobial peptide transport system, permease component # Organism: Fusobacterium nucleatum # 1 426 3 428 428 793 98.0 0 MFWRMVRGTLFRQKSKMLMIAFTVALGVSLATAMMNVMLGVGDKVNKELKTYGANITVMH KDASILDDLYGLSGEGVSNKFLLESEVPKIKQIFWGFAIVDFAPYLERTGEVEGVSNKVK IYGTWFEKHLVMPTGEEVDAGIKNLKTWWEIKGEWLKDDDLDGVMIGSLIAGKYNIKIGD TINVKGTNETKKLTVRGIINSGGDDDEAIYTVLKTTQDLFGLEDKITMIEVSALTTPDND LAKKAAQDPNSLTISEYETWYCTAYVSSISYQIQEVLTDSVAKPNRQVAESEGTILNKTE LLMLLICILSSFASALGISNLITASVIERSQEIGLIKAIGGTNRRIILLILTEIVLTGIF GGIFGYIAGIGFTQIIGKTVFSSYIEPAVIVVPIDIALVFAVTIIGSIPAIRYLLTLKPT EVLHGR >gi|296154758|gb|ADVK01000023.1| GENE 41 33167 - 34453 1182 428 aa, chain - ## HITS:1 COG:FN1355 KEGG:ns NR:ns ## COG: FN1355 COG4393 # Protein_GI_number: 19704690 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 131 428 1 298 298 580 100.0 1e-165 MLKFYIDVINYLAIFAFLLGIITALLIKYKNLYLNIVVGLISLVGLACSITMTVFKQLYP QKMVKISLQYNRWALAIGMGFMLIALLFQFLKTLKEKGRSKLCILSIISISFSMIAIWFL AFTIIPQVYAMTKEFVAFGEDSFGTQSLLRVGGFLLGLLTIFLIALSVQKVYFRLKSSLA KIFALLIYLIASFDFFLRGVSALARLRFLKSSNSLVFNVMVFEDKSTAYIVILFTIVACI FSLLLFKDSRKVIGTFKNNALLRLEKARLKNNKHWLSSLAFFSILSVFSITVVHSHITKP VALTPPQPYQEEGNMIVIPLTDVEDGHLHRFSYIATGGNNVRFIVVKKPKGGSYGLGLDA CDICGVAGYFERNDEIVCKRCDVVMNKSTIGFKGGCNPVPFEYEIKNKKIYIDKATLEKE KDRFPVGD Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:30:43 2011 Seq name: gi|296154743|gb|ADVK01000024.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00029, whole genome shotgun sequence Length of sequence - 13433 bp Number of predicted genes - 15, with homology - 14 Number of transcription units - 6, operones - 4 average op.length - 3.2 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 3 - 456 495 ## COG2831 Hemolysin activation/secretion protein + Prom 566 - 625 11.0 2 2 Op 1 8/0.000 + CDS 718 - 2463 1844 ## COG4988 ABC-type transport system involved in cytochrome bd biosynthesis, ATPase and permease components 3 2 Op 2 . + CDS 2453 - 4135 1718 ## COG1132 ABC-type multidrug transport system, ATPase and permease components + Term 4220 - 4272 1.8 4 3 Tu 1 . - CDS 4191 - 4307 246 ## - Prom 4329 - 4388 14.0 + Prom 4302 - 4361 8.1 5 4 Op 1 2/0.000 + CDS 4390 - 4899 647 ## COG0716 Flavodoxins 6 4 Op 2 . + CDS 4962 - 5429 593 ## COG1309 Transcriptional regulator + Term 5430 - 5484 5.5 - Term 5424 - 5465 3.1 7 5 Op 1 . - CDS 5483 - 6388 1267 ## COG0657 Esterase/lipase - Prom 6424 - 6483 4.2 - Term 6397 - 6440 -0.4 8 5 Op 2 . - CDS 6485 - 7876 1221 ## COG0471 Di- and tricarboxylate transporters 9 5 Op 3 . - CDS 7891 - 8154 425 ## Acfer_1067 HI0933 family protein 10 5 Op 4 . - CDS 8206 - 8886 784 ## COG1878 Predicted metal-dependent hydrolase 11 5 Op 5 . - CDS 8902 - 10005 1268 ## COG1263 Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific 12 5 Op 6 . - CDS 10049 - 10906 1261 ## COG2513 PEP phosphonomutase and related enzymes - Prom 11045 - 11104 8.3 - Term 10979 - 11027 8.1 13 6 Op 1 . - CDS 11111 - 12061 1313 ## COG0119 Isopropylmalate/homocitrate/citramalate synthases 14 6 Op 2 1/0.500 - CDS 12076 - 12894 1278 ## COG1082 Sugar phosphate isomerases/epimerases 15 6 Op 3 . - CDS 12909 - 13433 721 ## COG0169 Shikimate 5-dehydrogenase Predicted protein(s) >gi|296154743|gb|ADVK01000024.1| GENE 1 3 - 456 495 151 aa, chain - ## HITS:1 COG:FN0293 KEGG:ns NR:ns ## COG: FN0293 COG2831 # Protein_GI_number: 19703638 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Hemolysin activation/secretion protein # Organism: Fusobacterium nucleatum # 19 151 1 133 178 195 90.0 3e-50 MFRKIFLLSLLIFSLSYAVDEIDIEKRRQKQQDFDNLIKSQDFSVPKNIGDNEQKNLILN VNSIDLEGNTIFEDFQIDAILRRYVGKAKDIYALINELENKYIEKGYVTTKVGLNTEKSD FENGNISLFVLEGKIDKVFYDDKENKFKTFI >gi|296154743|gb|ADVK01000024.1| GENE 2 718 - 2463 1844 581 aa, chain + ## HITS:1 COG:FN1819 KEGG:ns NR:ns ## COG: FN1819 COG4988 # Protein_GI_number: 19705124 # Func_class: C Energy production and conversion; O Posttranslational modification, protein turnover, chaperones # Function: ABC-type transport system involved in cytochrome bd biosynthesis, ATPase and permease components # Organism: Fusobacterium nucleatum # 1 581 1 581 581 1013 98.0 0 MIDKRLYNFSGNIKKYISITTFLSCVKLIANIFFYFIFAFLLVSLINKDFSFSYSYIIIS ILIIVFVRQFSTIKVAHMLGNLVVDVKRNLRKLIFEKTLKLGLAYSQLFKTQELIHLSVD NVEQLEVYFGGFLTQFYYCIVSSFILFFSIAYFNLKIAFILLGFSLAIPLSLYIILNKVK KVQKKYFAKYMNVGTLFLDSLQGLTTLKIYGTDEKREEEIAKMSEEFRIETMKVLKMQLL SIAVINWIIYAGTILAIVTSVKLFLNGSLGLFPMLFIFMLAPEFFIPMRTLTSLFHVAMT GVSAAENIISFVDSPERNNNGNKEFKNENEIKVSKLNFSYPDGTQSLKDIDMSFKKGNLT AVVGHSGCGKSTLVSVLSGELKSKENEIFVDDIDIQNIKIEDKIKNILKITHDSHIFFGT VRENLAMANENLSDETMIEVLKKVKLWDIFSKNKGLDTSLESQGKNLSGGQAQRVALARA LLYDASVYIFDEATSNIDIESEEIILNIIYSLSKEKTVIYISHRLPAIKNADCIYVMDKG KVIESGKHDELYAKKELYYNMYKHQEELETYLTKRGENNEK >gi|296154743|gb|ADVK01000024.1| GENE 3 2453 - 4135 1718 560 aa, chain + ## HITS:1 COG:FN1820 KEGG:ns NR:ns ## COG: FN1820 COG1132 # Protein_GI_number: 19705125 # Func_class: V Defense mechanisms # Function: ABC-type multidrug transport system, ATPase and permease components # Organism: Fusobacterium nucleatum # 1 560 4 563 563 1030 98.0 0 MKNRSTFNIVSNLLKLLDSLWKFMTIAVSTGVIGFIFSFCITLFGAYAFLSIIPATKDSL KYVFGGGYSTQTYFYAMIFCGFFRAILHYLEQFTNHYIAFHILAEIRVKLFKIMRKLAPA KMENKNQGNLISMITSDIELLEVFYAHTISPVLIAFFTSIFLFLYFFQLNYIYALYMLFT QFIVGIVVPYIAHKRSAKSGIEVRSKLGKLNDEFLDKLKGIREIIQYSQGKKILKKIDEI TSSLGENQKDLRNKASEVQMLVDSAIIILSIAQLLLSLFLISKDLVSIEATILAGVLQVG SFAPYINLAALGNILAQTFASGERVLNLMDEKPAVSDDISISNDDITENDDIIIDNISYS YENTNNKIFKDFSLKIKKGQLTGIMGPSGCGKSTLLKLIMRFWDVDSGKIVLDRKDIKAI PLKNLYQKFNYMTQSTSLFIGNIRDNLLVAKADATDEEIYTALKKASFYDYVMSLPDKLD SIVEEGGKNFSGGERQRIGLARAFLANREFFLLDEPTSNLDILNEAIILKSLADEAKDKT VILVSHRESTLSICNNIFKI >gi|296154743|gb|ADVK01000024.1| GENE 4 4191 - 4307 246 38 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MSRTNLGVFEANLLTILEAINELVHLELLTDTEFVANS >gi|296154743|gb|ADVK01000024.1| GENE 5 4390 - 4899 647 169 aa, chain + ## HITS:1 COG:FN1822 KEGG:ns NR:ns ## COG: FN1822 COG0716 # Protein_GI_number: 19705127 # Func_class: C Energy production and conversion # Function: Flavodoxins # Organism: Fusobacterium nucleatum # 1 169 1 169 169 303 98.0 1e-82 MKTLIIYSSETGNTKMVCEKAFEYINGEKVIIPIKEKNSINLDEFDNIVVGTWIDKANAN AEARKFINTLSNKKIFFIGTLAASLESEHAKKCFNNLTKLCSKKNNFVDGVLTRGKVSKD LQEKFTKFPLNIIHKFVPNMKEIILEADCHPNESDFLLIKGFIDKNFNY >gi|296154743|gb|ADVK01000024.1| GENE 6 4962 - 5429 593 155 aa, chain + ## HITS:1 COG:FN1823 KEGG:ns NR:ns ## COG: FN1823 COG1309 # Protein_GI_number: 19705128 # Func_class: K Transcription # Function: Transcriptional regulator # Organism: Fusobacterium nucleatum # 1 155 1 155 156 228 99.0 2e-60 MARKGAYTKEMILEAAIKLFKKEGSDAITAKNIAKELNCSVAPIYSVYLSLNDLKKDLAF EIEKNILEEKNIHPLLSKMLAKLEVNDTDEQFLSKLEEIKRNILNKDSKMSIFSQFSEFI SLIYQTKKTKFSKLKILEIIAKHKKYITEFRNSKS >gi|296154743|gb|ADVK01000024.1| GENE 7 5483 - 6388 1267 301 aa, chain - ## HITS:1 COG:MT0230 KEGG:ns NR:ns ## COG: MT0230 COG0657 # Protein_GI_number: 15839600 # Func_class: I Lipid transport and metabolism # Function: Esterase/lipase # Organism: Mycobacterium tuberculosis CDC1551 # 55 278 157 368 403 118 35.0 1e-26 MFGAIYESQSLDVKLTRPQVSYTQNITYSQPLGKMNEVVKLEMDIIKPVTQNKLPVVLFV TGGGFVGSLKSNYLQQRLEIAEAGYVVASIEYRKIPNGVFPEPLEDVKSAIRFLRANADK FGIDKNKIAVMGSSAGGYLVAMAGTTNGYKQFDKGDNLNQNSDVQAVIDIYGLSDLATVG EDFSKEIQEIHKSSAAPEALWLNGVVLSNEINSVDNMPEKVKAANPMTYITKDTPPFLLL HGDKDILVSPSQTEKLHKALVTKGIDSTRYIVKGAAHGGEYWVQPEVMKVIINFLNKNLK K >gi|296154743|gb|ADVK01000024.1| GENE 8 6485 - 7876 1221 463 aa, chain - ## HITS:1 COG:MTH788 KEGG:ns NR:ns ## COG: MTH788 COG0471 # Protein_GI_number: 15678812 # Func_class: P Inorganic ion transport and metabolism # Function: Di- and tricarboxylate transporters # Organism: Methanothermobacter thermautotrophicus # 253 456 232 441 443 60 23.0 7e-09 MQNIWKYIVSEKVNKIWGIKMGLSIMFFAIILIWKPFNLTFQQAVIVANTILVIIWWSTG IINKIPASLFLLVIFYIFSGASIKMILSFSLSETFLMIIVTYLFSQGIANSGLIEKILQP LLIKLVHTPCQCLIAIVGIFYLTMYIIPQPLARLIIVSSVLFHFLQQINLPERTKKVLMY GVFVASAVVNMSTKDADIIMSNIAANFSEMPISNRTWAYYMFIPTLITCCLLGILFIYIF HKALIGIPLKNVEKENKISPFSAQQKLAIGIIIMTVLLWTTNGIHGINNTLITIISTIIL FAIKILHKEDWKSIDITTLIFLTAAFSIGNIIKFCGAADKVFGQLQAIFPTKFSLLYIYV MILTTMLLHMILGSNTTTLSVVIPGLMILCSQVVKSPIIVFISVISVSFHAILPFHSISL MIGVSNNYYPAKYITKLGLPVTLLVYLVVIGIYIPYWNIVGLL >gi|296154743|gb|ADVK01000024.1| GENE 9 7891 - 8154 425 87 aa, chain - ## HITS:1 COG:no KEGG:Acfer_1067 NR:ns ## KEGG: Acfer_1067 # Name: not_defined # Def: HI0933 family protein # Organism: A.fermentans # Pathway: not_defined # 1 87 333 418 418 80 49.0 2e-14 MGTYLEVESGSYFDIPLGALQSINIENLYGAGRIVSSDEVAFAAIRVMGTCFATGHAAGV AAAYQALNGNVDREKIREELKRQNALV >gi|296154743|gb|ADVK01000024.1| GENE 10 8206 - 8886 784 226 aa, chain - ## HITS:1 COG:PAE0036 KEGG:ns NR:ns ## COG: PAE0036 COG1878 # Protein_GI_number: 18311668 # Func_class: R General function prediction only # Function: Predicted metal-dependent hydrolase # Organism: Pyrobaculum aerophilum # 9 221 2 205 223 87 31.0 1e-17 MIKLEGIKFIDISYIVKNKMPADPALKLPTLEFFSNEGVNGQLHNLEVISYCPHTGTHMD APFHIDTNGDSIEKLDPTLLIGPAVVVSLDYSDRRPCVITADIIKKWEKSNIEIQKGDAV LLNTGHSKYWESGKEEYIEKGYVCLSTDLAKYFVDKGVRFVGLESISVDGPETGTEAHKI LLGNNVYIVENLTNLDKIESKRCITMGTFPAVKGASGVWIRLLALV >gi|296154743|gb|ADVK01000024.1| GENE 11 8902 - 10005 1268 367 aa, chain - ## HITS:1 COG:ZascF_2 KEGG:ns NR:ns ## COG: ZascF_2 COG1263 # Protein_GI_number: 15803232 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific # Organism: Escherichia coli O157:H7 EDL933 # 16 361 13 364 392 180 33.0 4e-45 MNDNIVKKKLGIFGKIMNVIQECMGPVIPMIMAGGLIKVLVIILVAFNIISDNSDTCVIL SAIGDAPYYFLPFEVAVTAAIYFKIPILLSIATVGIMFSPKFLELFASEVFFVGIPVINV TYSYSVLPVILLVWIMNLLYNWIKKIMQDSLFNFLGNTVVLLISSLLAVLVIGPIGNLIA QAIRIFILQIQSMSQVAAEVFFAIIFPFLNMLGMQWPFILDAITHVGENGYESLIIISLL CVNISQGGACLASSFRIKDNKKKSECLGMGLTAIISGASEPATYSNFGYKRPMIAALIGS GVAGLYAGIIGIKAFSFVPPAFATVLIFMNSKYSINIIHAIITGFIALVVSFLVSYILGI EAQKKSN >gi|296154743|gb|ADVK01000024.1| GENE 12 10049 - 10906 1261 285 aa, chain - ## HITS:1 COG:all1863 KEGG:ns NR:ns ## COG: all1863 COG2513 # Protein_GI_number: 17229355 # Func_class: G Carbohydrate transport and metabolism # Function: PEP phosphonomutase and related enzymes # Organism: Nostoc sp. PCC 7120 # 1 281 1 285 287 202 38.0 8e-52 MIKTKKLRDLINGNKVVVAPCAYDALSARAIEFMGFELAATTGFGMHGTMLGVPDNGLLT FTEMVRMCRNMASSINIPMMADAEGGYGNAINTYRTIKEFENSGLAGLFIEDQELPPNCP YIKGTKLIRVEEMIGKIKAALEARKDKNFVIVARTDAPFEEAIERLNAYYEAGVDMVKPM PRSRKELEDYPKYLKAPIHLGFTYGKETTLGLTATDCGKMGYKIVTFPFSELMASTTAIL RILKEIKEKGTDESFYQEMIKFEEYLKIVNIDLYNELDRKYMLDI >gi|296154743|gb|ADVK01000024.1| GENE 13 11111 - 12061 1313 316 aa, chain - ## HITS:1 COG:BH1134 KEGG:ns NR:ns ## COG: BH1134 COG0119 # Protein_GI_number: 15613697 # Func_class: E Amino acid transport and metabolism # Function: Isopropylmalate/homocitrate/citramalate synthases # Organism: Bacillus halodurans # 1 296 1 301 303 228 40.0 1e-59 MNFGKKVILCEVGPRDGLQNECTILTVDQKVELINDITDAGYKVIEVGSFMSPKAVPQMA TTDEVMKKIKRKDGVEYRVLIANLKGVERAIACDCKKVKLNVSASRAHNLANLNRTPEET VAGFQACVDLAKANNIIVSGSISMPFGSPWERYIPIQDVKSIVEAYLKAGVNQISLSDAS GMAVPSQINSMCKEMIKSYPEVSWILHLHNTRGVAMANIIAGLDTGIDCFDTSFGGLGGC PFVPGAAGNIASEDVIHMLSEMGVETGIDLDKMIEVAKKVQKFVGHDTDSYILKAGKSSE LIRELPKGQGKNETQK >gi|296154743|gb|ADVK01000024.1| GENE 14 12076 - 12894 1278 272 aa, chain - ## HITS:1 COG:PA0238 KEGG:ns NR:ns ## COG: PA0238 COG1082 # Protein_GI_number: 15595435 # Func_class: G Carbohydrate transport and metabolism # Function: Sugar phosphate isomerases/epimerases # Organism: Pseudomonas aeruginosa # 6 240 7 242 271 118 32.0 1e-26 MKKEYSLAHLTAISTTPLELAKIAANCGYDYVSIRQIYMGVTGEVPVDLAENKKMYQEIK DLFKDTGLRLLDIELAKIFDGVDLKKYESAFETGKSLGAKHVLSSIWTDNREYAIEKFAD LCDLAKKYDLTVDLEAVPIAGVKSFAEVADILRIIKRDNAGLMMDTHHFNRANDSVELLK TFPKEWFHYAQICDVPPAPTTREEMIRIMRESRDYLGEGTIDVATILNTMPIVPYSIELP NSKKVEEYGYEGHARKCLETAKKYCDTFVIGR >gi|296154743|gb|ADVK01000024.1| GENE 15 12909 - 13433 721 174 aa, chain - ## HITS:1 COG:lin0493 KEGG:ns NR:ns ## COG: lin0493 COG0169 # Protein_GI_number: 16799568 # Func_class: E Amino acid transport and metabolism # Function: Shikimate 5-dehydrogenase # Organism: Listeria innocua # 15 172 131 291 291 119 37.0 3e-27 VIASLDELNIPFRKQQVVLVGVGGAGRAIAIQLAYEGVGELCIKELNKDLANEVKETINK YISNVKVKILPDDEKSLKEELKDACLLVNATPLGMKGRENLCVISGPEVLHKDLFVYDIV YDPRETLLMKYAKEAGCKTTNGINMMIWQGAIAFKIWFDVDMPQDYVRQELFEK Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:31:20 2011 Seq name: gi|296154652|gb|ADVK01000025.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00030, whole genome shotgun sequence Length of sequence - 96898 bp Number of predicted genes - 94, with homology - 87 Number of transcription units - 40, operones - 25 average op.length - 3.2 N Tu/Op Conserved S Start End Score pairs(N/Pv) - Term 396 - 440 10.2 1 1 Op 1 1/0.929 - CDS 485 - 1279 850 ## COG0796 Glutamate racemase 2 1 Op 2 1/0.929 - CDS 1353 - 2030 753 ## COG0553 Superfamily II DNA/RNA helicases, SNF2 family 3 1 Op 3 1/0.929 - CDS 1990 - 2502 378 ## COG0553 Superfamily II DNA/RNA helicases, SNF2 family - Prom 2535 - 2594 2.8 4 2 Op 1 1/0.929 - CDS 2596 - 2958 245 ## COG0553 Superfamily II DNA/RNA helicases, SNF2 family 5 2 Op 2 1/0.929 - CDS 2945 - 3136 241 ## COG0553 Superfamily II DNA/RNA helicases, SNF2 family - Prom 3186 - 3245 5.2 6 3 Tu 1 1/0.929 - CDS 3251 - 3532 204 ## COG0553 Superfamily II DNA/RNA helicases, SNF2 family - Prom 3605 - 3664 8.5 - Term 3821 - 3856 1.3 7 4 Op 1 . - CDS 3873 - 4913 1720 ## COG1494 Fructose-1,6-bisphosphatase/sedoheptulose 1,7-bisphosphatase and related proteins 8 4 Op 2 . - CDS 4910 - 5224 239 ## FN1158 hypothetical protein 9 4 Op 3 4/0.000 - CDS 5217 - 5741 921 ## COG0242 N-formylmethionyl-tRNA deformylase 10 4 Op 4 1/0.929 - CDS 5751 - 8051 2618 ## COG1198 Primosomal protein N' (replication factor Y) - superfamily II helicase 11 4 Op 5 1/0.929 - CDS 8067 - 10070 2370 ## COG0768 Cell division protein FtsI/penicillin-binding protein 2 - Prom 10154 - 10213 5.2 12 5 Op 1 . - CDS 10222 - 11472 1017 ## COG1295 Predicted membrane protein 13 5 Op 2 . - CDS 11488 - 11817 459 ## FN1153 hypothetical protein 14 5 Op 3 . - CDS 11837 - 13027 1444 ## COG0436 Aspartate/tyrosine/aromatic aminotransferase - Prom 13160 - 13219 9.9 + Prom 13056 - 13115 8.2 15 6 Tu 1 . + CDS 13185 - 14531 1588 ## COG0534 Na+-driven multidrug efflux pump 16 7 Tu 1 . - CDS 14541 - 14675 84 ## - Prom 14796 - 14855 8.0 + Prom 14682 - 14741 9.5 17 8 Op 1 . + CDS 14791 - 17502 2422 ## FN1150 hypothetical protein 18 8 Op 2 . + CDS 17495 - 20665 2975 ## COG1074 ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) + Term 20673 - 20707 6.2 - Term 20661 - 20695 6.2 19 9 Tu 1 . - CDS 20708 - 21880 1551 ## COG1301 Na+/H+-dicarboxylate symporters - Prom 21919 - 21978 10.3 + Prom 21979 - 22038 12.4 20 10 Op 1 1/0.929 + CDS 22065 - 23348 1922 ## COG3681 Uncharacterized conserved protein 21 10 Op 2 . + CDS 23371 - 23862 618 ## COG2849 Uncharacterized protein conserved in bacteria 22 11 Tu 1 . - CDS 23927 - 24076 65 ## gi|296328162|ref|ZP_06870693.1| conserved hypothetical protein - Prom 24198 - 24257 9.5 + Prom 24176 - 24235 21.1 23 12 Op 1 . + CDS 24304 - 25983 2381 ## COG1164 Oligoendopeptidase F 24 12 Op 2 . + CDS 26062 - 26832 1066 ## FN1144 hypothetical protein + Term 26845 - 26911 7.2 - Term 26832 - 26899 13.2 25 13 Op 1 1/0.929 - CDS 26900 - 27724 1131 ## COG0363 6-phosphogluconolactonase/Glucosamine-6-phosphate isomerase/deaminase 26 13 Op 2 . - CDS 27742 - 28656 939 ## COG1242 Predicted Fe-S oxidoreductase - Prom 28832 - 28891 20.7 + Prom 28664 - 28723 11.7 27 14 Tu 1 . + CDS 28971 - 29741 1136 ## COG2116 Formate/nitrite family of transporters + Term 29750 - 29803 14.2 - Term 29738 - 29790 4.1 28 15 Op 1 2/0.000 - CDS 29791 - 31014 1344 ## COG3581 Uncharacterized protein conserved in bacteria 29 15 Op 2 1/0.929 - CDS 31016 - 33943 3133 ## COG1924 Activator of 2-hydroxyglutaryl-CoA dehydratase (HSP70-class ATPase domain) - Prom 34071 - 34130 8.4 30 16 Tu 1 . - CDS 34284 - 34706 808 ## COG3576 Predicted flavin-nucleotide-binding protein structurally related to pyridoxine 5'-phosphate oxidase - Prom 34744 - 34803 10.6 31 17 Op 1 9/0.000 - CDS 34847 - 36415 1517 ## COG3639 ABC-type phosphate/phosphonate transport system, permease component 32 17 Op 2 15/0.000 - CDS 36384 - 37127 240 ## PROTEIN SUPPORTED gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) - Term 37149 - 37182 2.3 33 17 Op 3 1/0.929 - CDS 37205 - 38080 1376 ## COG3221 ABC-type phosphate/phosphonate transport system, periplasmic component - Prom 38161 - 38220 11.0 34 18 Tu 1 . - CDS 38222 - 38677 499 ## COG2731 Beta-galactosidase, beta subunit - Prom 38712 - 38771 10.4 + Prom 38651 - 38710 11.3 35 19 Tu 1 . + CDS 38805 - 39965 1836 ## COG1820 N-acetylglucosamine-6-phosphate deacetylase + Term 39972 - 40017 1.2 - Term 39959 - 40004 5.0 36 20 Op 1 . - CDS 40013 - 40594 725 ## COG1057 Nicotinic acid mononucleotide adenylyltransferase 37 20 Op 2 . - CDS 40591 - 41364 739 ## FN1131 hypothetical protein 38 20 Op 3 1/0.929 - CDS 41370 - 42374 1154 ## COG1663 Tetraacyldisaccharide-1-P 4'-kinase 39 20 Op 4 . - CDS 42402 - 45983 4570 ## COG1196 Chromosome segregation ATPases - Prom 46019 - 46078 14.1 + Prom 46008 - 46067 10.7 40 21 Op 1 1/0.929 + CDS 46102 - 48084 2448 ## COG1506 Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 41 21 Op 2 1/0.929 + CDS 48110 - 49930 2224 ## COG4907 Predicted membrane protein 42 21 Op 3 1/0.929 + CDS 49911 - 50417 591 ## COG0454 Histone acetyltransferase HPA2 and related acetyltransferases 43 21 Op 4 2/0.000 + CDS 50426 - 50977 864 ## COG1704 Uncharacterized conserved protein 44 21 Op 5 . + CDS 50995 - 52179 173 ## PROTEIN SUPPORTED gi|163756109|ref|ZP_02163225.1| 30S ribosomal protein S1 + Term 52186 - 52245 13.7 - Term 52174 - 52231 13.3 45 22 Tu 1 . - CDS 52235 - 52780 807 ## COG0526 Thiol-disulfide isomerase and thioredoxins - Prom 53008 - 53067 15.8 + Prom 52842 - 52901 8.9 46 23 Op 1 . + CDS 52943 - 55429 3300 ## COG1022 Long-chain acyl-CoA synthetases (AMP-forming) + Prom 55483 - 55542 8.4 47 23 Op 2 . + CDS 55596 - 57116 2038 ## COG4868 Uncharacterized protein conserved in bacteria + Term 57348 - 57403 7.1 - Term 57343 - 57386 7.6 48 24 Op 1 . - CDS 57478 - 59844 1908 ## COG0641 Arylsulfatase regulator (Fe-S oxidoreductase) - Prom 59896 - 59955 3.7 49 24 Op 2 . - CDS 59957 - 60319 362 ## gi|296328189|ref|ZP_06870720.1| PTS family fructose porter, IIA/HPr component - Prom 60544 - 60603 16.1 + Prom 60260 - 60319 4.9 50 25 Tu 1 . + CDS 60406 - 60492 59 ## + Term 60690 - 60734 1.1 - Term 61309 - 61350 2.2 51 26 Tu 1 . - CDS 61380 - 62963 1987 ## COG1866 Phosphoenolpyruvate carboxykinase (ATP) - Prom 63147 - 63206 16.1 52 27 Op 1 14/0.000 - CDS 63227 - 63511 492 ## PROTEIN SUPPORTED gi|19704454|ref|NP_604016.1| 50S ribosomal protein L27 53 27 Op 2 14/0.000 - CDS 63512 - 63841 530 ## PROTEIN SUPPORTED gi|237742036|ref|ZP_04572517.1| 50S ribosomal protein L27 54 27 Op 3 . - CDS 63844 - 64158 516 ## PROTEIN SUPPORTED gi|19704452|ref|NP_604014.1| 50S ribosomal protein L21P 55 27 Op 4 . - CDS 64210 - 64290 60 ## 56 27 Op 5 1/0.929 - CDS 64268 - 64513 306 ## COG3340 Peptidase E - Prom 64538 - 64597 2.4 57 28 Op 1 . - CDS 64626 - 65417 865 ## COG2215 ABC-type uncharacterized transport system, permease component 58 28 Op 2 . - CDS 65438 - 65542 139 ## 59 28 Op 3 . - CDS 65539 - 66129 439 ## COG3683 ABC-type uncharacterized transport system, periplasmic component - Prom 66265 - 66324 11.2 + Prom 66070 - 66129 13.7 60 29 Op 1 49/0.000 + CDS 66280 - 67218 888 ## COG0601 ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 61 29 Op 2 5/0.000 + CDS 67218 - 68030 806 ## COG1173 ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 62 29 Op 3 5/0.000 + CDS 68068 - 69630 2226 ## COG0747 ABC-type dipeptide transport system, periplasmic component 63 29 Op 4 44/0.000 + CDS 69643 - 70422 246 ## PROTEIN SUPPORTED gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) 64 29 Op 5 . + CDS 70440 - 71201 193 ## PROTEIN SUPPORTED gi|225088774|ref|YP_002660041.1| ribosomal protein S16 + Term 71211 - 71258 11.3 - Term 71245 - 71305 -0.7 65 30 Op 1 9/0.000 - CDS 71347 - 71835 494 ## COG0494 NTP pyrophosphohydrolases including oxidative damage repair enzymes 66 30 Op 2 1/0.929 - CDS 71858 - 72325 559 ## COG0454 Histone acetyltransferase HPA2 and related acetyltransferases 67 30 Op 3 . - CDS 72352 - 73578 1998 ## COG1760 L-serine deaminase 68 30 Op 4 . - CDS 73607 - 74104 738 ## FN1105 hypothetical protein 69 30 Op 5 1/0.929 - CDS 74108 - 74692 862 ## COG0632 Holliday junction resolvasome, DNA-binding subunit 70 30 Op 6 1/0.929 - CDS 74704 - 77586 3967 ## COG0178 Excinuclease ATPase subunit - Term 77601 - 77647 6.1 71 31 Op 1 1/0.929 - CDS 77654 - 78193 619 ## COG1859 RNA:NAD 2'-phosphotransferase 72 31 Op 2 1/0.929 - CDS 78186 - 79532 1482 ## COG1373 Predicted ATPase (AAA+ superfamily) - Prom 79580 - 79639 8.5 - Term 79571 - 79631 6.0 73 32 Op 1 . - CDS 79703 - 79969 344 ## COG2026 Cytotoxic translational repressor of toxin-antitoxin stability system 74 32 Op 2 . - CDS 79951 - 80181 305 ## FN1099 hypothetical protein - Prom 80206 - 80265 8.9 75 33 Op 1 . - CDS 80328 - 80465 243 ## 76 33 Op 2 . - CDS 80519 - 82306 2321 ## FN1097 hypothetical protein 77 33 Op 3 . - CDS 82315 - 83559 1366 ## FN1096 hypothetical protein 78 33 Op 4 . - CDS 83552 - 83902 410 ## FN1095 hypothetical protein - Prom 84015 - 84074 7.7 79 34 Tu 1 . - CDS 84120 - 84215 60 ## - Prom 84321 - 84380 12.9 + Prom 84230 - 84289 16.3 80 35 Tu 1 . + CDS 84407 - 85489 1005 ## COG0463 Glycosyltransferases involved in cell wall biogenesis + Term 85558 - 85613 6.0 81 36 Op 1 1/0.929 - CDS 85503 - 85931 434 ## COG1959 Predicted transcriptional regulator 82 36 Op 2 1/0.929 - CDS 85965 - 86873 1165 ## COG3872 Predicted metal-dependent enzyme 83 36 Op 3 1/0.929 - CDS 86888 - 88411 1573 ## COG2208 Serine phosphatase RsbU, regulator of sigma subunit 84 36 Op 4 5/0.000 - CDS 88476 - 90245 1836 ## COG0322 Nuclease subunit of the excinuclease complex 85 36 Op 5 . - CDS 90232 - 91104 870 ## COG1660 Predicted P-loop-containing kinase 86 36 Op 6 . - CDS 91131 - 92480 2001 ## COG0446 Uncharacterized NAD(FAD)-dependent dehydrogenases - Prom 92509 - 92568 9.4 - Term 92556 - 92596 5.2 87 37 Op 1 . - CDS 92607 - 93188 536 ## FN1087 hypothetical protein 88 37 Op 2 1/0.929 - CDS 93194 - 94519 1499 ## COG1106 Predicted ATPases - Prom 94556 - 94615 18.8 - Term 94605 - 94636 1.0 89 38 Op 1 . - CDS 94664 - 95212 893 ## COG0693 Putative intracellular protease/amidase 90 38 Op 2 . - CDS 95228 - 95476 397 ## FN1084 hypothetical protein - Prom 95500 - 95559 5.4 91 39 Op 1 . - CDS 95587 - 96183 734 ## COG2431 Predicted membrane protein 92 39 Op 2 . - CDS 96180 - 96455 242 ## FN1082 hypothetical protein 93 39 Op 3 . - CDS 96534 - 96620 74 ## - Prom 96816 - 96875 8.0 + Prom 96430 - 96489 11.5 94 40 Tu 1 . + CDS 96662 - 96871 292 ## FN1081 hypothetical protein Predicted protein(s) >gi|296154652|gb|ADVK01000025.1| GENE 1 485 - 1279 850 264 aa, chain - ## HITS:1 COG:FN1161 KEGG:ns NR:ns ## COG: FN1161 COG0796 # Protein_GI_number: 19704496 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glutamate racemase # Organism: Fusobacterium nucleatum # 1 264 1 264 264 507 100.0 1e-143 MAEKTQRIGIFDSGLGGTTVLKELINSLPNEDYIYYGDNGNFPYGSGKTKNELQKLTERI LDFFVKNNCKLVIVACNTASTAAIDYLREKFSLPIIGIIEAGVKIASKNTKNKNIAVIST KFTAESHGYKNKAKMLDSELNVKEIACIEFAQMIETGWDTFDNRKELLNKYLSEIPKNAD TLVLGCTHYPLIREDIEKNIKIKVVDPAVEIVERTIQTLTSLNLLNDKKERGRIIFFVTG ETYHFKPTAEKFLGKEIEIYRIPK >gi|296154652|gb|ADVK01000025.1| GENE 2 1353 - 2030 753 225 aa, chain - ## HITS:1 COG:FN1160 KEGG:ns NR:ns ## COG: FN1160 COG0553 # Protein_GI_number: 19704495 # Func_class: K Transcription; L Replication, recombination and repair # Function: Superfamily II DNA/RNA helicases, SNF2 family # Organism: Fusobacterium nucleatum # 10 225 874 1089 1089 369 96.0 1e-102 MDIYLIMIIFIKINETSAQNINVNTNKIEVLAMLTKLRQICIDPRSLYEDVSSSSSKINA YIELIEKSIENNQKILLFSSFTTVLDLVAQECDNLSIPYFMLTGETNKVKRKEMVENFQN EAVPLFLISLKAGGTGLNLTKASVVIHLDPWWNISVQNQATDRAHRIGQEDTVQLFNLIT KNTIEEKILNLQSKKKELSDIFVENSKGSFSSLTKEQLLDLFKLE >gi|296154652|gb|ADVK01000025.1| GENE 3 1990 - 2502 378 170 aa, chain - ## HITS:1 COG:FN1160 KEGG:ns NR:ns ## COG: FN1160 COG0553 # Protein_GI_number: 19704495 # Func_class: K Transcription; L Replication, recombination and repair # Function: Superfamily II DNA/RNA helicases, SNF2 family # Organism: Fusobacterium nucleatum # 1 170 644 813 1089 304 98.0 5e-83 MLKLRAMNLGGILADDMGLGKTLQVITYLESVKRERASLIVTPASLILNWENEFNKFNSS VLTLSIYGDRKNREGLLSNLKNEVVITSYDYLKRDMDLYENIDFDTIILDEAQYIKNHKT KVAQAVKKINSKFKLVLTGTPLENSLAEIWSIFDFLMNGYLFNYDYFYKN >gi|296154652|gb|ADVK01000025.1| GENE 4 2596 - 2958 245 120 aa, chain - ## HITS:1 COG:FN1160 KEGG:ns NR:ns ## COG: FN1160 COG0553 # Protein_GI_number: 19704495 # Func_class: K Transcription; L Replication, recombination and repair # Function: Superfamily II DNA/RNA helicases, SNF2 family # Organism: Fusobacterium nucleatum # 6 120 497 611 1089 189 98.0 1e-48 MLIYNLSQSIKNFNKTRKINYSVGVKVTSNFLELDFSSTDLDKGEIIDVLNQYRAKKKYY RLKKSEIILMDQEQLEFLDDFIKDFNIKDNDLKKGNIKIPNFRAYQLNVLQNKYMDIEKN >gi|296154652|gb|ADVK01000025.1| GENE 5 2945 - 3136 241 63 aa, chain - ## HITS:1 COG:FN1160 KEGG:ns NR:ns ## COG: FN1160 COG0553 # Protein_GI_number: 19704495 # Func_class: K Transcription; L Replication, recombination and repair # Function: Superfamily II DNA/RNA helicases, SNF2 family # Organism: Fusobacterium nucleatum # 1 63 433 495 1089 104 95.0 5e-23 MNLIQIVEIFKILNPDIIDNLKNYINFHQKIYEINQTMIIVKKSEIDKFIEDIIPLLHTY ADI >gi|296154652|gb|ADVK01000025.1| GENE 6 3251 - 3532 204 93 aa, chain - ## HITS:1 COG:FN1160 KEGG:ns NR:ns ## COG: FN1160 COG0553 # Protein_GI_number: 19704495 # Func_class: K Transcription; L Replication, recombination and repair # Function: Superfamily II DNA/RNA helicases, SNF2 family # Organism: Fusobacterium nucleatum # 1 90 217 306 1089 155 96.0 3e-38 MHYDAFTDFSKKQLDFILKHPDNIIQFKKNMINLDENNMDDFYNTYFDSPSAEIIFQEED FKTELVIEKNDKDYEIYLKNKFYNPKEKGKSPR >gi|296154652|gb|ADVK01000025.1| GENE 7 3873 - 4913 1720 346 aa, chain - ## HITS:1 COG:FN1159 KEGG:ns NR:ns ## COG: FN1159 COG1494 # Protein_GI_number: 19704494 # Func_class: G Carbohydrate transport and metabolism # Function: Fructose-1,6-bisphosphatase/sedoheptulose 1,7-bisphosphatase and related proteins # Organism: Fusobacterium nucleatum # 1 346 1 346 346 623 99.0 1e-178 MKRELALEFARVTEAAALAAHKWVGRGKKESADQAGVDAMRTMLNRLAIDGEIVIGEGEI DEAPMLYIGEKVGQIYNEEEKDSVTYVDPVDIAVDPVEGTRMTAQGQPNAITVLAVGKKG SFLKAPDMYMEKLIVGPEAKGKIDLSKPLEDNIHAVAKALKKELKDLMIVILDKPRHKEL IKDLQAMGVKVYALPDGDVAGSILTCMIDSDVDMLYGIGGAPEGVISAAVIRALGGDMQA RLKLRSEVKGASLENDKISKFEKLRCEEQELKVGEILKLEDLAKDDEIIFSATGITGGDL LEGVKRKGNIARTQTLVVRGLSKTVRYINSIHNLDFKDEKITHLVK >gi|296154652|gb|ADVK01000025.1| GENE 8 4910 - 5224 239 104 aa, chain - ## HITS:1 COG:no KEGG:FN1158 NR:ns ## KEGG: FN1158 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 91 1 91 104 84 98.0 1e-15 MNKKLLWLIVIIILLLSSFNVVPQIKHSLSKKKSTKEEIIAVNKKIEEIKEDIEKYDKKI TSLDDEFEKERVARNMFQMVKENEVIYKYVEKKNKKIEKTEEEK >gi|296154652|gb|ADVK01000025.1| GENE 9 5217 - 5741 921 174 aa, chain - ## HITS:1 COG:FN1157 KEGG:ns NR:ns ## COG: FN1157 COG0242 # Protein_GI_number: 19704492 # Func_class: J Translation, ribosomal structure and biogenesis # Function: N-formylmethionyl-tRNA deformylase # Organism: Fusobacterium nucleatum # 1 174 1 174 174 302 99.0 2e-82 MVYEIKKYGEDVLKQIAKEVELSEINDEFRQFLDDMVETMYETDGVGLAAPQIGVSKRIF VCDDGNGVLRKVINPIIVPLTEETQEFEEGCLSVPGIYKKVERPKRVLLKYLNEYGKEVE EIAENFLAVVVQHENDHLDGILFIEKISPMAKRLIAKKLANIKKETKRIKEENE >gi|296154652|gb|ADVK01000025.1| GENE 10 5751 - 8051 2618 766 aa, chain - ## HITS:1 COG:FN1156 KEGG:ns NR:ns ## COG: FN1156 COG1198 # Protein_GI_number: 19704491 # Func_class: L Replication, recombination and repair # Function: Primosomal protein N' (replication factor Y) - superfamily II helicase # Organism: Fusobacterium nucleatum # 1 766 1 766 766 1289 99.0 0 MQYFDIYIDSMKGIYTYSDKNDEFEVGENVIVPFRNIKKAGFIIRKNLKESFEFKVLNIS SKVKNSLKLSNEQIKLIEWMVDYYLTSYDSVIKAMIPKKIKISYSNIYTINLNKLNILNR FLDNGIIKYMISLTKISYSTAKTKFKKSVVDSLIDKKILYKDENNIYVNIGNFSKLKKEN KEIFEYFYKKTIVKKEKLEEKFKKIVIKELEENEILKIEAHINEKKEYISDNTEKVFENK SLLNKKQLAIKENIENSDKKYFLLKGVTGSGKTEIYIELIKKAFFEGYGSIFLVPEISLT PQMVERFQSEFKNNIAILHSSLSDIERAKEWESIYIGEKKIVLGVRSAIFSPVKNLKYII LDEEHEATYKQDSSPRYNAKYVAIKRCLDEGTKLVLGSATPSIESYYYAKTSIYELLSLD DRYGNAEMPNIQVVDMKQEDDLFFSKALLEEIKNTLLKNEQVILLLNRKGYSTYIQCKDC GYVEECDNCSIKMSYYKSVNKYKCNYCGKQIHYTGKCTKCGSTNLIHSGKGIERIEEELK KYFDVPMIKVDSELSRNKDYFSKIYKDFSDKKYSILIGTQIIAKGLHFPNVTLVGVINSD IILNFPDFRSGEKTFQLLTQVSGRAGRGNKKGKVIIQTYEPENNVIKDSKEENYDLFYEK EISSRKIFSYPPFSKILNIGFSSEDEERLLDISKKFYDEIKSQDIELYGPTPSMVYKVQK RFRMNIFVKGSKKKIDKFKLFLKKKLNEFNDTKVRIVVDIDPINMI >gi|296154652|gb|ADVK01000025.1| GENE 11 8067 - 10070 2370 667 aa, chain - ## HITS:1 COG:FN1155 KEGG:ns NR:ns ## COG: FN1155 COG0768 # Protein_GI_number: 19704490 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Cell division protein FtsI/penicillin-binding protein 2 # Organism: Fusobacterium nucleatum # 1 667 45 711 711 1229 99.0 0 MFAKRSSIMLLIILFFLIVYGLRLLGIQYLQKSKYVALMNEQLLSINKEIGQRGLIYDNK GKKLAFNNRKYTISINPSLLNDEKIHDEIVKDIVAIRDSKIVKLEDNILEKLSKLASEGN RYKRLVKDIDDEQKEQIDELLAGIERTKVKGSPKYRTVLQFEKSIERKYYKQDEYEKLIG MVRFTEESRDERLGISGLEKQYQNYLVEKRRNIPKLYGLNKKNILALSKETLFSDLNGKN LYLTIDADLNFILNDEIKAQFKNTNAYEAYGLIMDPNNGKILAVAAFSKDKNLLRNNIFQ SQYEPGSIFKPLIVAAAMNEKFINENTAFNVGDGKIKRFKKTIRESSRSTRGIITAREVV MKSSNVGMVLISDYFTNALFEDYLKAFGLYDKTGVDFPNELKPYTSSYKNWDGLKKNNMA FGQGVVITPIQMITAFSAVVNGGTLYKPYIVEKITDSEGTVIMRNTPTAVRKVISEEVSE KMRSILEDTVEKGTGKRAYIEGYAVGGKTGTAQLSAGKSGYIRNEYLSSFIGFFPADKPK YIVMAMFMRPQADIQANKFGGVVAAPVVGNIIRRIIKEEEGFAKNVETINVSSSKDENGE SIKSNLDAISYEDVMPDLEGMSPQEVLAVFKETNIDIEVIGTGLVEEQRPAAGDSLKNVK KVKILLK >gi|296154652|gb|ADVK01000025.1| GENE 12 10222 - 11472 1017 416 aa, chain - ## HITS:1 COG:FN1154 KEGG:ns NR:ns ## COG: FN1154 COG1295 # Protein_GI_number: 19704489 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 21 416 1 396 396 643 96.0 0 MKNLFENFGSKKFNTESFKLMLKRAYEKYQRANSSFWVTSLSFYTILAIVPILAILVSLS SWFGAEDYVINQIKDIAPLKGGTLELLTDFSNNLLMNARSGVLAGVGFLFLGWTFIQMFS LIEESFNEIWHIKKSRSLIRKISDYISFFIFLPLLFITLNGLTLLLLSKIKETIFLYYIV KNILPLISMTIFFMALYLVMPNTMVKIIPAFIASVIVSIAFLLFQYIFILLQFLLIGYDT VYGGFSVIFIFLIWVRICWFIVILGVHITYLIQNANFDINIENDNINISFNSKLYITFKV LEEIIKRYLNNQSPPNMSDLRKVTTSSPFLIENVLDDLIRGGYVLSSRDYSEKVFCIAKN IEEVSLKEIYDFIANTGEEIYILQDGKITDNIEKIIIDKDYSRTLKSLGGESAEEN >gi|296154652|gb|ADVK01000025.1| GENE 13 11488 - 11817 459 109 aa, chain - ## HITS:1 COG:no KEGG:FN1153 NR:ns ## KEGG: FN1153 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 12 109 12 109 109 155 98.0 6e-37 MKKVLVLLALLSMTCGATEILSEYYVMEKVLPLLTEAESYVVNGQEVKAIKVDNKVLKAL STTDDPFYYYNSVKEKKMVRLGDYILTPVTFSSIDSASSSYFNNNFIKK >gi|296154652|gb|ADVK01000025.1| GENE 14 11837 - 13027 1444 396 aa, chain - ## HITS:1 COG:FN1152 KEGG:ns NR:ns ## COG: FN1152 COG0436 # Protein_GI_number: 19704487 # Func_class: E Amino acid transport and metabolism # Function: Aspartate/tyrosine/aromatic aminotransferase # Organism: Fusobacterium nucleatum # 1 396 1 396 396 767 99.0 0 MRISDRVKNMKYSAVRKLAPLAAEAEKKGIKVYRLNIGQPNIETPKLFFEGLKNIPDHVI RYADSRGISILLEQVIEVYARDGHILKKEDIIVTEGGSEALTFAMLAICNPNDEVLIPEP FYSNYKSFLDIAGAKIIPIPTDIKNDFALPKKEEIQKLITSKTKAILYSNPCNPTGKVYT EEEVKLLADLAAENDLFVIADEPYREFIYDDNDKHYSLLDIEKAKENVIIIDSVSKHYSA CGARVGFLISKNKDFMTYIMKLCQARLAAPTVEQYAVASLMKAPKEYFKEIKEIYKRRRD IIVNSLNKIEGVTCSTPKGAIYAFAKLPVESSEDFCKWLLTEFVYDNSTVMLAPGEGFYE TEGLGKNEVRFSFCVGENDIEKAMRVLEEALKVYKK >gi|296154652|gb|ADVK01000025.1| GENE 15 13185 - 14531 1588 448 aa, chain + ## HITS:1 COG:FN1151 KEGG:ns NR:ns ## COG: FN1151 COG0534 # Protein_GI_number: 19704486 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Fusobacterium nucleatum # 1 448 1 448 448 724 99.0 0 MNIKEFLFEDKKLIKRIFKIAIPSLFDLLAQTLISTSDMIMVSSIGASAISSVGVGSAAF NAIIPALIAVAIGTTAILSRAYGAKNREEGQKALMQSYFIAIPIGIILMLSFFFFAKPII EIVGNAKDLNLEDAIIYQKTTAIGFVFLSIGITTFYAFRALGKNKIPMIGNTMVLIVNII FNYLFIYILKWGVFGAALATSIARGSVVIMCIYLIFVNKRQWISLNIKKMKFDYFIAKRI IKVGIPAAIEQLALRFGMLIFEIMVISLGNLNYAAHKIALTAESFSFNLGFAVSLAATAL VGQELGKNSPKNALKNGYICTIIGLIIMSTMGLLFFIAPNLLISLFTDDPQVVSLSTMAL RLVSICQPFLAISMILSGALRGAGATRSVLFITFFGIFLVRIPITYVFLYIFNMGLAGAW IVMTIDLIFRSSLCFYTFKKGKWKYLKV >gi|296154652|gb|ADVK01000025.1| GENE 16 14541 - 14675 84 44 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MSRANLAVSERSEFSEFAANVNFLSLRNLASNELFFIIFYENSQ >gi|296154652|gb|ADVK01000025.1| GENE 17 14791 - 17502 2422 903 aa, chain + ## HITS:1 COG:no KEGG:FN1150 NR:ns ## KEGG: FN1150 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 903 1 903 903 1361 95.0 0 MNKLIFNYLPYNNQNQNELYSKIDKIINEDSSNDTLVVVESGMAQKHYFAYVNKSKLLVK NNIIGFEDFLDRIFLSNKKVLGDIKRFFLFYSCLKEDTKNKLNINNYFECIEIADDFFEF FSYIKNKDMLKFLNLSKWQEEKFEIFFEIKEEMDKFLDENSYIPSDWLYSLENLDLSFIK KYKKIVFYDIVDFPHNFLEIINSIQSVCEVEILLQMENKDFDMENLKLNKVSLPDKKIDV KLSKYTNDLELHTMIRTNKYDGYFSTDLNKEDRYSIFIKSNKFYLNDTKFYQVIETYLNL LNGIDYKNKKYIDIFLVKENIFKNAFMSFYGLDIEDYRCFEKIISNDYRYISLELLNTDY YSYYLEDNENLKIKLKLIFETLNDIEKIKDINSLNELLCNKFFSSKTDIDFFIEEKFDTL YDKIYEILGLLNSNENIDFFKNFNKFFKSSLGKNIFTLFFNYLNRIVIYSIQKNKNKKNE LKDLDLIKYSVKNIENPAIIYTDSQTLPKIKINNNLFTEQQKIKLGLKTNEDEILTQKYR FFQNLLSLNKIDIYSLVDKDNNIDFSAFVYEFINKYTALENNMDNLKQYFKAIYSKEKTE NFSKDKTFFRAYSKDKSDFKDNILKIGAYDYIELKKNETFFFLDKICGIESSDEIVADIG ISAKVLGNILHKTLEEIFRENWKNILQSSENLLISTDIIEKYLKRSIWKEELKIEIFMEN YINEVLMPRLVSNIEKFFKVLYEELKGEKILRIEAEKKSKTQDKAYLKYDEIEVFLNGRA DLLIETSKARYIVDFKTGGYKREQLEFYAIMFYGSDNSLPVYSTAYNFWDEQEAKNFKFE KHLIDKLPEKDSQFKELLIEFFKNKYYVLPKKSALKESEYDFNEYYRYKNIIPLDKMTGD IDE >gi|296154652|gb|ADVK01000025.1| GENE 18 17495 - 20665 2975 1056 aa, chain + ## HITS:1 COG:FN1149 KEGG:ns NR:ns ## COG: FN1149 COG1074 # Protein_GI_number: 19704484 # Func_class: L Replication, recombination and repair # Function: ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) # Organism: Fusobacterium nucleatum # 1 1056 1 1056 1056 1598 96.0 0 MNKIKNLVLKASAGTGKTYRLSLEYIIALCKKGDIEPIDYKNILVMTFTRKATAEIKEGI LNKLSEFMEIYEISKNSELSVIETISDNKLIDNKKRNNYLNLIESIKNIEPKLDIDNNFL ENLSKVNKEIIKNKEKLKIYTIDAFFNIIFKNIVTNLMKIKSYTMLDEEDNFSYYKKVLE NIFNNEKLFNDFKNFFTENSEKNIDNYISIIQRLISSRWKYILSLNDNPKPTKKEKFSIT KSSIEILREIFSYIENDCKKDLDDVLKNDYKKYIGKTEETQKEFLFKDFKLLFKSGTTGL IYNGNKLKKASDAEHKEYINARHEELREILSKEIFNEVLIPYEEKIFELSSEIYNLYDSF KIRDKKFTFSDIAIYTYMAIFNKNNALRDENGLTDIFFETLDMNIEAIFIDEFQDTSILQ WKILYEFTKKAKTVICVGDEKQSIYGWRDGEKRLFENLETILEAKEDSLDKSYRSDRNIV SYCNQFFKAIEKIEDWKFPTSEVNSKNDGYVKAICIKDLQDKIEDEEQKKELNINTVLLQ ELKNFEPYDNVAIIARTNAELSEIANLLEDEKIPYILNNEKNISEYSGIFECFELLKYLV YENELALFNFISSPLSNFGTNEIEILLKNKKEVFSYINFSQDNNFINSLDKKIIRFLEKI VFLKKNYKKFTVQDLIFEIIKKFQFIDYFNKDNEVKNIYDFYLLINYYSSILELLNDYKE NKLSLSDTNSETKGVELVTIHKSKGLEFKTTFVIKNSKKSKTDDIDFLFEMNDKYDKTVF SLFCKKGYKPILKTCFEERIENYDKKIKEEEINNFYVALTRPKNNLIVIYEDRLFEENPL NESNIDDFFNCELGKISLDEKKSKTEDIIEKNLENDLYNSQSYFSSSIYENEEEIKNIEV NESKFLLETEEKRMIGILVHYFFENLKYGTEEEVEFSKTLCYKKYLSYFGEEKLNKIFSK ENIEMFLTKDKEIFSKKWDYIYSEYILYDYEEKKEYRIDRLMIKDNGNGTGEIYIVDFKT GGKNQNQLDTYRDVLKKNFEELQNYNIKTKFLEFDI >gi|296154652|gb|ADVK01000025.1| GENE 19 20708 - 21880 1551 390 aa, chain - ## HITS:1 COG:FN1148 KEGG:ns NR:ns ## COG: FN1148 COG1301 # Protein_GI_number: 19704483 # Func_class: C Energy production and conversion # Function: Na+/H+-dicarboxylate symporters # Organism: Fusobacterium nucleatum # 1 390 1 390 390 605 97.0 1e-173 MEKEKKGDTLIIKLVLGVIAGIIIGLVSNEQVISVILPIKFFLGELIFFVVPFIIIGFIA PAITQLKSNASKMLLTMLGLSYLSSVGAALFSATAGYVLIPKLNIVSSVEGLKELPEILF KVQIPPAISVMGALVLALLMGLAVVWTNSKRTEELLNEFNNIMLTIVNKIIIPILPIFIA ATFATLAYEGSITKQLPVFLKVIVIVLIGHYIWITILYTIAGIISGKNPWSLLKHYGPAY MTAVGTMSSAATLPVSLKCVKKSGVLDEEITNFAIPLGATTHLCGSVLTETFFVMVVSKI LYGDVPPVGTMILFIVLLGIFAVGAPGVPGGTVLASLGLIISVLGFDETGTALMITIFAL QDSFGTACNITGDGALALILNGIFRKKQEN >gi|296154652|gb|ADVK01000025.1| GENE 20 22065 - 23348 1922 427 aa, chain + ## HITS:1 COG:FN1147 KEGG:ns NR:ns ## COG: FN1147 COG3681 # Protein_GI_number: 19704482 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 17 427 1 411 411 776 99.0 0 METKIEKVLKILEEEIVAAEGCTEPIALSYAAAKARRILGTVPNKVDVFLSGNIIKNVKS VTIPNSEGMIGIEPAIAMGLIAGDDKKELMVISDTTHEQVQEVRDFLDKKLIKTHVYPGD IKLYIRLEISNGENNVLLEIKHTHTNITRILKNDKVLLSQICNDGDFNSSLTDRKVLSVK YIYDLAKTIDIDLIKPIFQKVIRYNSAIADEGLKGKYGVNIGKMILDNIEKGIYGNDVRN KAASYASAGSDARMSGCALPVMTTSGSGNQGMTASLPVIKFAAEKNLSEEELIRGLFVSH LITIHVKTNVGRLSAYCGAICAASGVAAALTFLHGGSFEMVCDAITNILGNLSGVICDGA KASCAMKISSGIYSAFDATMLALNKDVLKSGDGIVGVDIEETIRNVGELAQCGMKGTDET ILGIMTK >gi|296154652|gb|ADVK01000025.1| GENE 21 23371 - 23862 618 163 aa, chain + ## HITS:1 COG:FN1146 KEGG:ns NR:ns ## COG: FN1146 COG2849 # Protein_GI_number: 19704481 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 36 163 1 128 128 226 99.0 1e-59 MRKKFKMLLLAIGVLTLFSACSSYYTREEEYARNMMLVLSKTTSSTTSFEKRWKTTNKAA IVDTFKNGEKDGEFRRYYLNGNLLMRGYCKAGKVDGIWEDYYPNGKVLMSGYMKGNKEIG NWKYYNESGQLLGEVPYDQIPKTIKDVREKNVDEFWKDIKSGK >gi|296154652|gb|ADVK01000025.1| GENE 22 23927 - 24076 65 49 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296328162|ref|ZP_06870693.1| ## NR: gi|296328162|ref|ZP_06870693.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 49 1 49 49 88 100.0 1e-16 MFTYKKYPNTNILACWGNLTKKGNPRHSLYASIDANLEVFNIENYIENL >gi|296154652|gb|ADVK01000025.1| GENE 23 24304 - 25983 2381 559 aa, chain + ## HITS:1 COG:FN1145 KEGG:ns NR:ns ## COG: FN1145 COG1164 # Protein_GI_number: 19704480 # Func_class: E Amino acid transport and metabolism # Function: Oligoendopeptidase F # Organism: Fusobacterium nucleatum # 1 559 1 559 559 1034 99.0 0 MKFKDMPYIRPDMEKVKKIFKEIKEKIENANSANEQIKIIEEFADFKKDLYTTIEIAYVR LSIDTTDEFYENENNFFDENKPIIETLNTEVSRVIYNSKFRDELEKRFGKHYFKLLECKL VLNEKAIPFMQKENALSTKYDKIIANSKIIFRGKEYTVSQMPPLLQNPDREFRKEAYQAR AKFFEDHQEEFDNIYDEMVKVRTEMAHVLGYENYIDLQYKLLNRTDYNHKDVAKYREKVL KTLTPLAIKIREKQAERLGIKDFKYFDEACDFRDGNSNPNGDVDFIVKNAQKMYSELSSE TGKFFDFMIENELMDLVAKPKKRVGGYCISFDKYKSPFIFSNFNGTKGDIDVITHEAGHA FQCYMSQYQLLPEYIWPTYDAAEIHSMSMEFLTWPWMELFFGKNANKFKYSALKGALTFI PYGVTIDHFQHYVYENPNATPEERRKKYHELELMYVPDLEYDNDFYNSGAFWFSQGHVFW APFYYIDYTLAQVCAFQYLLKYLDNKEETLKEYITLCKAGGSESFFKLLEIGKLKNPMNT DILEEITPKLEELLNSIKI >gi|296154652|gb|ADVK01000025.1| GENE 24 26062 - 26832 1066 256 aa, chain + ## HITS:1 COG:no KEGG:FN1144 NR:ns ## KEGG: FN1144 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 8 256 1 249 249 400 98.0 1e-110 MRNSLKKLLFVVLTVFSVIFIGACGKSKIDKKEVIEKFIAASESMKSGDMLVNMKMTQNQ NGNKNNMEMTMDVSLILEPIAMKMAMAIPSQNLKMNSFIKDNTMYIQNPVDNQWVKQSLT DEIAGQFKGYMTNSDATYDAMRNNIDKVDIDEKDGNYIISISKDSEFLKEAMKKQLANTN TAAGQIGNDVKIENIAVKYIVDKNTYLLSSSVVSFDFEMQGMKISMEMDAKMSNINNVTD IVVPEEALNAKEIPHQ >gi|296154652|gb|ADVK01000025.1| GENE 25 26900 - 27724 1131 274 aa, chain - ## HITS:1 COG:FN1143 KEGG:ns NR:ns ## COG: FN1143 COG0363 # Protein_GI_number: 19704478 # Func_class: G Carbohydrate transport and metabolism # Function: 6-phosphogluconolactonase/Glucosamine-6-phosphate isomerase/deaminase # Organism: Fusobacterium nucleatum # 1 274 1 274 274 538 99.0 1e-153 MRFIVTGNKRAADWGAVYIVKKIKEFNPSPEKKFVLGLPTGSTPLQMYKRLVQFNKEGII SFKNVITFNMDEYVGLPKTHPQSYHYYMYNNFFNHIDIDKENVNILNGMAKNYKEECRKY EEKILEVGGIDLFLGGVGVDGHIAFNEPGSSFKSRTREKQLTEDTIIVNSRFFNNDITKV PQSALTVGVSTIMDAKEVLIMVEGNNKARALHMGIEEGINHMWTISALQLHEKAIIVADE DACAELKVATYKYYKDIEKKNYNIDKLIENLYKK >gi|296154652|gb|ADVK01000025.1| GENE 26 27742 - 28656 939 304 aa, chain - ## HITS:1 COG:FN1142 KEGG:ns NR:ns ## COG: FN1142 COG1242 # Protein_GI_number: 19704477 # Func_class: R General function prediction only # Function: Predicted Fe-S oxidoreductase # Organism: Fusobacterium nucleatum # 1 304 1 304 304 572 98.0 1e-163 MIRKIYTLNDFLKEKFNEKIYKVSLDGGFTCPNRDGKFSKGGCIFCSENGSGDFTSGKLK SIHQQIDEQIELVSKKYKGDKYIAYFQNFTNTYADINYLRKIYEEALSHKNIVGLAIATR PDCLEDDILKLLDELNKKTFLWIELGLQTINDKVAKYFNRAYETKIYEEASQKLNKLNIK FVTHIIIGLPKEKKDDYLKTAIFSQNYGTWGLKLHLMYVVKNTPLEKLYQSGNLKVHTKE EYIEKIVNILENISPEIVIHRMTGDGDRETLVAPLWSIKKIDVLNSIHKELKKRNTYQGR LYIK >gi|296154652|gb|ADVK01000025.1| GENE 27 28971 - 29741 1136 256 aa, chain + ## HITS:1 COG:FN1141 KEGG:ns NR:ns ## COG: FN1141 COG2116 # Protein_GI_number: 19704476 # Func_class: P Inorganic ion transport and metabolism # Function: Formate/nitrite family of transporters # Organism: Fusobacterium nucleatum # 1 256 1 256 256 446 98.0 1e-125 MADGHKTPSELVDYMIKTGIDKATKPLFKLMLLGIFGGAFIALGGAGNIISGSTLVKTDP GLAKFVGACVFPVGLIMVVILGSELFTSNCLLTVAYTNKKITFTQLIRNIVTVYLFNYVG SFIVAYITVKGGSFNADSLNYLQDIATHKVHSTAYALFIKGILCNVLVCGAVLLSYTAKD TIGKLFGAWFPIMLFVLIGYDHSIANMFYLTAAKLVDSSFEVSLILYNLFYVTLGNFAGG MVIGLPLYFCYYKKQD >gi|296154652|gb|ADVK01000025.1| GENE 28 29791 - 31014 1344 407 aa, chain - ## HITS:1 COG:FN1140 KEGG:ns NR:ns ## COG: FN1140 COG3581 # Protein_GI_number: 19704475 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 407 1 407 407 805 99.0 0 MNKNCKVLIPMMMDIHFDLIAGVLKNEGYDVEVLKTDHRGVIEEGLKSVHNDMCYPALLV IGQFIDALKSGKYDINNVALLITQTGGGCRASNYIYLLRKALEINGFHQVKVWSLNFEGL DKKNEFTLSFSAYFNLFYSILYGDLLMSIYHQSVAYEKNSGDSKGVLTYWKDKLISEIGT KTFKKLKENYKKIIENFLTIPKNSDKKKIRVGIVGEIYMKYSPLGNNHLTDYLEKEGVEV VNTGLLDFLLFNLYDTIFDRKIYGRKGLKYYFVKYIVRYIEKKQKEMIEVIKRYKAFIPP SPFSKVIEMTKGYLGHGVKMGEGWLLTAEMLEFIEIGVKNIVCAQPFGCLPNHIIAKGMI RKIKDNHPEANIIAVDYDPGASSVNQENRIHLMLENARMMASQGVKK >gi|296154652|gb|ADVK01000025.1| GENE 29 31016 - 33943 3133 975 aa, chain - ## HITS:1 COG:FN1139_1 KEGG:ns NR:ns ## COG: FN1139_1 COG1924 # Protein_GI_number: 19704474 # Func_class: I Lipid transport and metabolism # Function: Activator of 2-hydroxyglutaryl-CoA dehydratase (HSP70-class ATPase domain) # Organism: Fusobacterium nucleatum # 1 640 1 640 640 1274 99.0 0 MHYKIGIDVGSTTLKTVILNEKDEIIEKSYQRHFSKVREMTLNHFKSLQELLKGKKFKLA ITGSAGLGISKDYGIPFVQEVFSTAGAVKKCYPQTDIVIELGGEDAKILFLQGSIEERMN GTCAGGTGAFIDQMATLLDMEVSELDKISFAHERIYPIASRCGVFAKTDVQPLLNQGAKK ADIAASIYQAVVEQTITGLAQGRPIKGTVLFLGGPLYFLKGLQERFVEVLKLSKEEAIFP ELAPYFVALGSAYFADTVDEEFEYDEVVQLLSQKKEKKVEHLEKPLFTSEEEYEAFLKRH KKISVPTRDITTYSGKAYLGLDSGSTTIKVVLLDEEENILYRYYSSSKGNPVSLFLEQLK KIRELCAERIEIVSSAVTGYGEELMQVAFGVDIGIVETIAHYTAAKHFNPDVDFIIDIGG QDIKCFHIKDGAIDSIVLNEACSSGCGSFLETFAKSLGYSMQDFAKKAIFSKSPAELGSR CTVFMNSSVKQAQKDGAETEDISAGLARSVVKNAIFKVIRARDVNELGKHIVVQGGTFLN DAVLRSFEQEIGREVLRPEISELMGAYGAALYGKKIQKEKSKLLNLTELENFQHISSSGM CKLCTNHCQLTINTFTNGQKFISGNKCERGAGKKLQSDLPNMVAYKNQLFNAIPLKGGGR ARIGLPRVLNIYEMLPFWAELFRSLNCDVVLSSVSNRKIYMKGQNTIPSDTVCYPAKLVH GHIIDLLEKDLDAIFYPCMSYTFDEGISDNCYNCPIVAYYPELIQANIPDITKTHFLYPH LGIENHSLFAERMYEEFKNIIPKLSKKEMEKATENAFKTYYEYRENIRQEGTRILKFAME NNHPVIILASRPYHIDPEINHGLDRLLNSLQFVVVTEDALYPVEGKLTIKILNQWGFHAR MYNAAKYVSQHKNMELVHLVSFGCGIDAITTDEIHDILRSNNKLYTQLKIDEVNNLGAAK IRLKSLQATMKEREM >gi|296154652|gb|ADVK01000025.1| GENE 30 34284 - 34706 808 140 aa, chain - ## HITS:1 COG:FN1138 KEGG:ns NR:ns ## COG: FN1138 COG3576 # Protein_GI_number: 19704473 # Func_class: R General function prediction only # Function: Predicted flavin-nucleotide-binding protein structurally related to pyridoxine 5'-phosphate oxidase # Organism: Fusobacterium nucleatum # 1 140 4 143 143 273 100.0 9e-74 MAKLTDAIKDLILNPVKEGAWTAQLGWIATVREDGAPNIGPKRSCRIYDDATLIWNENTA GEIMKDIERGSKVAVAFANWDKLDGYRFVGTAEVHKEGKYYDEVVEWAKGKMGAPKAAVV FHIEEVYTLKSGPNAGTRID >gi|296154652|gb|ADVK01000025.1| GENE 31 34847 - 36415 1517 522 aa, chain - ## HITS:1 COG:FN1137 KEGG:ns NR:ns ## COG: FN1137 COG3639 # Protein_GI_number: 19704472 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type phosphate/phosphonate transport system, permease component # Organism: Fusobacterium nucleatum # 1 522 1 522 522 728 99.0 0 MTLEKFIKLHKLKTFFKILTIVIVLLLFFFTLNLDFQDYIDGFTRLKGLVVSMMRIDTED KKIVLFKMFETIVTAFASSFIGVVLAVLCSPFLATNISNKYLARFLTICFSIFRTIPALV MAAILVSLIGIGSFTGFISLLIITFFSATKLLKEYLEEINQAKIQSFRTFGFSKFTFLKS CIYPFSKPYIISLFFLTLESSIRGASVLGMVGAGGIGEELWKNLSFLRYDKVSFIILILL IFIFLTDSLSWFFRKKDSLIKITTYQGYKKNKVISKLITCLILILLVYSLNILYEDTNKI SLPIFFERLLVFLKKLTYLDFSYTPKVLVALWQSFLVAFFATFFAAPTAIVISYFASSVT SNKKIAFIIKIFINFIRTFPPVIVAILFFSGFGPGLISGFFALYIYTTGVITKVYVDVLE SIETDYGLYGRSLGLKNFYTYLRLWLPSTYTNFVSIFLYRFESNMKNSSVLGMVGAGGIG QLLMNHIAFRNWEKVWVLLIFLIITIILIENLSEYIRNKVNS >gi|296154652|gb|ADVK01000025.1| GENE 32 36384 - 37127 240 247 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) [Campylobacter concisus 13826] # 1 226 1 219 223 97 27 3e-19 METIIEVKNLVKNYGDKQILKNISFNINKGEIISIIGESGAGKSTLMRCLNGLEGINSGS IKFYDTDITKLKEKEKNSIKKRMAYVFQDLNIIDNMYVIENVLVPFLNRKNFIQVLFNQF SKQEYERALYCLEKVGISKLAYTKAKYLSGGEKQRVAIARSLAPNVDLILADEPISSLDE KNSAQIMEIFKRINIKKNKTIILNLHNVEIAKKFSDKILALKNGEIFFYKKSAEVNEDDI RKVYQTS >gi|296154652|gb|ADVK01000025.1| GENE 33 37205 - 38080 1376 291 aa, chain - ## HITS:1 COG:FN1135 KEGG:ns NR:ns ## COG: FN1135 COG3221 # Protein_GI_number: 19704470 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type phosphate/phosphonate transport system, periplasmic component # Organism: Fusobacterium nucleatum # 10 291 1 282 282 514 98.0 1e-146 MKRVWKLLTLVSLIFLLISCGKKKEEKPLIMGLSPIANSEKLIEDTAPLHKMLGDEIGRP VEGFIATNYIGVVEALGTGTIDFALIPPFAYILANKKNGTEALLTSINKHDEPGYYSVLL VRTDSRIEKVEDLKGKKVAFVDPSSTSGYIFPAVILMDHGINVEQDITYQFAGGHDKALQ LLINGDVDVIGTYESAITKFAKEFPEVTEKVKILQKSDLIPGITLVVSSKVDDATKQKIK AAFLKVTATKEGQELTLKLFGIKGFEEANVDNYKLIEDKLNKMGIDIEKIK >gi|296154652|gb|ADVK01000025.1| GENE 34 38222 - 38677 499 151 aa, chain - ## HITS:1 COG:FN1134 KEGG:ns NR:ns ## COG: FN1134 COG2731 # Protein_GI_number: 19704469 # Func_class: G Carbohydrate transport and metabolism # Function: Beta-galactosidase, beta subunit # Organism: Fusobacterium nucleatum # 1 151 5 155 155 268 99.0 2e-72 MIYAKLKNIKTYKGINKNLDKAIDFIIEKKYLNASFGKNIIEGDTIYFNCPEKPVTRENT DLELEYHKKYIDIHIVLEGEENIVYTPFEDCIETQSYNIEGDYGLVKGKAQVEFIMNPKN FLLFFPEEPHLALLKVDTPKEIKKIIFKVEI >gi|296154652|gb|ADVK01000025.1| GENE 35 38805 - 39965 1836 386 aa, chain + ## HITS:1 COG:FN1133 KEGG:ns NR:ns ## COG: FN1133 COG1820 # Protein_GI_number: 19704468 # Func_class: G Carbohydrate transport and metabolism # Function: N-acetylglucosamine-6-phosphate deacetylase # Organism: Fusobacterium nucleatum # 1 386 1 386 386 722 100.0 0 MKKILLKNAKLVLENKLINGSILIFKNKIEKIFTDNDNLSEFIFDEVIDLKGKYLGPAFI DVHTHGADGADAMDGNEEALRKISSYLVKEGTANFLATTLTSTKEILKDVLEVVANLQDK DIEGANIFGVHMEGPYFAIEYKGAQNDKYMKPAGIKELEEYLSVKDGLVKLFSISPHNQE NLEAIKFLADRGVVASVGHSGASYEAVMKAVDYGLSHATHTYNGMKGFTHREPGVVGAVF NSDNIMAEIIFDKVHVHPEAVRTLIKIKGVDKVVCITDSMSATGLAEGQYKLGELDVNVK DGQARLVSNNALAGSVLRMDIAFKNLIELGYSITDAFKMTSTNAAKEFKLNTGILKEGKD ADLVVLDKDYKVCMTMVKGKIKFTNL >gi|296154652|gb|ADVK01000025.1| GENE 36 40013 - 40594 725 193 aa, chain - ## HITS:1 COG:FN1132 KEGG:ns NR:ns ## COG: FN1132 COG1057 # Protein_GI_number: 19704467 # Func_class: H Coenzyme transport and metabolism # Function: Nicotinic acid mononucleotide adenylyltransferase # Organism: Fusobacterium nucleatum # 1 193 1 193 193 306 97.0 1e-83 MKIAIYGGSFNPMHIGHEKIVDYVLKNLDMDKIIIIPVGIPSHRENNLEQSDTRLKICRE IFKNNKKVEVSDIEIKSEGKSYTYDTLLKLIEIYGKDNDFFEIIGEDSLKNLKTWRNYKE LLNLCKFIVFRRKDDKNIEIDNEFLNNKNIIILENEYYDISSTEIRNKVKNNEDISGLVN KKVKKLIEKEYID >gi|296154652|gb|ADVK01000025.1| GENE 37 40591 - 41364 739 257 aa, chain - ## HITS:1 COG:no KEGG:FN1131 NR:ns ## KEGG: FN1131 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 257 1 257 257 361 94.0 2e-98 MKKDSKVEFLREKNLEKAIELIKEKGKFTILSEYSTFFDMRTYFKVNEDGDIFQKSYNPI TLLYLFCDDKKNLAEYLFKYSYPEEKQNIKKIDRASNLDIETLKKNVMKTLVNSHLDFSK IFAKELFLRDKKSFFEVMYSFSLMGNPKNLKLFFVYALEEIFSQINYDENIFYIIIAYLT KFRDDYSTYMEVDENNLNFDSSNYSDDKKIYINIFEKVLEKYNLKNKNRFKITLYKYFEK DFTLNQDLKNILMEKMI >gi|296154652|gb|ADVK01000025.1| GENE 38 41370 - 42374 1154 334 aa, chain - ## HITS:1 COG:FN1130 KEGG:ns NR:ns ## COG: FN1130 COG1663 # Protein_GI_number: 19704465 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Tetraacyldisaccharide-1-P 4'-kinase # Organism: Fusobacterium nucleatum # 10 334 1 325 325 628 98.0 1e-180 MRLLSYIYLLITTIRNFLYDEKILPIRKVPGVEVICIGNVSVGGTGKTPAVHFFVKKLLA RGRKVAVVSRGYRGKRKRDPLLVSDGMVIFATPQESGDESYLHAINLKVPVIVGADRYKA CMFAKKHFDIDTIVLDDGFQHRKLYRDRDVVLIDATNPFGGGYVLPRGLLREDFKRAVKR ASEFIITKSDLVNERELKRIKNYFIKKFHKEVSVAKHGISKLCDLKGNMKPLFWVKAKKL MIFSGLANPLNFEKTVISLAPAYIERLDFKDHHNFKTKDIALIRKKAEKMDADYILTTEK DLVKLPDNLNISNLYVLKIEFTMLEDNTLKNMEG >gi|296154652|gb|ADVK01000025.1| GENE 39 42402 - 45983 4570 1193 aa, chain - ## HITS:1 COG:FN1129 KEGG:ns NR:ns ## COG: FN1129 COG1196 # Protein_GI_number: 19704464 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Chromosome segregation ATPases # Organism: Fusobacterium nucleatum # 1 1193 1 1193 1193 1701 99.0 0 MLIILELGEDMYLKAVEINGFKSFGDKVYIDFNRGITSIVGPNGSGKSNILDAVLWVLGE QSYKNIRAKESQDVIFSGGKEKKPATKAEVSLIIDNADRYLDLDNDTVKITRRIHISGEN EYLINDTKSRLKEIGTLFLDTGIGKTAYSVIGQGKVERIINSSPKEIKSIIEEAAGIKKL QANRIEAQKNLANIEINLDKVEFILNETRENKNKIEKQAELAQKYIDLRDEKSSLAKGIY ITELEQKEKNLSENENIKEKYQTECFELQEKLNKTLERLNTIDLEKEEVKKEKLLIDSRN KELRNIISEKEKEKAVTSERLDNVKKEKLVKEEYILHLDNKIEKKLEEVTESKNKKDEIS KNIVEMAAANKEFENKIFNLENIKVEKSDLIENRAKKVRDLELEKQLASNEIENNEKKLK SSQDEVENFKQELEEANKKLLANNEEKDLVHSQLEARKEELTKTEERNEFLVNQLSEISK SINKLSQDIREFEYQEKTSSGKLEALVRMDENNEGFFKGVKEVLNSGISGIDGVLISLIN FDEKYEKAVEAAIPGNLQDIIVEDKEVAKKCIAFLTEKKLGRASFLALDTIKPNRREFRA NINGVLGLAADLITADKKYQKVIDFIFGGLLIVENIDIATDILNKNLFSGNIVTLTGELV SSRGRITGGENQKSTINQIFERKKEIKTLEEKVTDLKSKITEGSKKREDLSIKLENYENE VDKIDSLEDSIRKDIDLLKKDFESLSEKSEKLSKDIRSISFNIEDAEKYKTSYQDRINSS FSTIEETEKHIASLKKDIEADENLLKQTISEIDSLNKQFSDTRILFLNNQSTIEQLEKDI HSKEIENVELQEEKEKNSKIVIELSHNIEELETLEEELQSQIEEHTKIYNSENRDIETLN EREQNLSNEERELSKDKSKLETDSLHANDRFEKIVEVIEKIKVDILNINEKLNELVEITA QVIEVEKLKSSKDRLRSLENKINNFGDVNLLAINEFKELKERYDYLARERDDVVKSRKQV MDLIQEIDERIHEDFHTTYQNINENFNKMCDETIRNTEGRLNIINPEDFENCGIEIFVKF KNKKKQPLSLLSGGEKSMVAIAFIMAIFMYKPSPFTFLDEIEAALDEKNTKNLLGKLRDF TDKSQFILITHNKETMKESNSIFGVTMNKEIGISKIVSPDKITKILSENKENN >gi|296154652|gb|ADVK01000025.1| GENE 40 46102 - 48084 2448 660 aa, chain + ## HITS:1 COG:FN1128 KEGG:ns NR:ns ## COG: FN1128 COG1506 # Protein_GI_number: 19704463 # Func_class: E Amino acid transport and metabolism # Function: Dipeptidyl aminopeptidases/acylaminoacyl-peptidases # Organism: Fusobacterium nucleatum # 1 660 1 660 660 1241 97.0 0 MENLKLDSFLEYKFLSNLDFNPDGSNLAFSISEADLEKNSYKHFIYNLNTKNKEIKKLTH SGKEKNSLWLNNNTILFSADRDKDIEEKKKLGETWAIFYALDIKNGGEAYEYMRLPIDVI EIKIIDENNFILTADFDNNSLNLNDLKGEEREKAIKQIEENKDYEVLDEIPFWSNGNGFR NKKRNRLYHFDKLNNKLTPISDEYTNIEDFNIKENKVVFIGRTYKDKQTLTAGLYTYDIK NNKLETIIPDNLYDISYANFIEDKIICALSDMKEYGINENHKIYLIDSNKNINLLYENDT WLACTVGSDCRLGGGKSFKVIGNKLYFLSTIADSVHLSSLDTNGKVEVLSSENGSIDFFD IANNEIYYVGMRNYSLQEIYKLENNSSIKLSSFNEKINKKYKISKPEVFDFITNGDTTKG FVIYPIDYDKNKTYPAILDIHGGPKTVYGDVYYHEMQVWANMGYFVIFTNPHGSDGYGNK FADIRGKYGTIDYEDLMNFTDYVLEKYPIDKSRVGVTGGSYGGYMTNWIIGHTDRFKCAA SQRSISNWISKFGTTDIGYYFNADQNQATPWINYDKLWWHSPLKYADKVKTPTLFIHSEE DYRCWLAEGLQMFTALKYHGIEARLCMFRGENHELSRSGKPKHRIRRLTEITNWFEKYLK >gi|296154652|gb|ADVK01000025.1| GENE 41 48110 - 49930 2224 606 aa, chain + ## HITS:1 COG:FN1127 KEGG:ns NR:ns ## COG: FN1127 COG4907 # Protein_GI_number: 19704462 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 1 567 1 567 606 1037 99.0 0 MKKNILRIFLFLIISIISFSASFSISGLDVEAKLQKDGSMLVSEAVTYDIDEINGIYFDI DAKGYGGITSLQVFEDEGHYENNVISYREVDPVNYEVTENDGVYRIKLYSKNYNNVRTFK FVYTLPEAIKVYNDVAQLNRKMVGQDWQQGISTVKVTIELPVSNDYDNSKVLVFGHGPLT GEVDKIENTVVYRLDDYYPGDFLEAHILMEPVIFSEFNVSNVIHKNMKQELLDMEARLAD EANEERDNALRREENIQKLSDNAKTIFGVEASIWAVLMYYIHVVFKRKNKSKNNDIKYLR DLPDDSSPALVGGVMTKSVNNNEILATIVDLIRKKVLTLETSDKKTIITLTGSTGLLSAQ EKTIIDIYINDFGDGRSLDLKSIGFFHKVPMKTAGKFEKWSSYIINEMNRKGLIYEHIGC GATLIFILLSVIFAFGGLLQTALTNNTLFMFGIPLGVVLFFSAGTAKYPSKKLAETISKW QAFKNFLSDYSQLEEAKITSIHLWEQYFVYAIALGVSDKVVKAYRKALDMGIIKDSDGIS NITYSPIFNNNFSRSFNNLNGMVSKTNSRASSAIASSRRSSSSGGGGGFSSGSSGGGGSR GGGGGF >gi|296154652|gb|ADVK01000025.1| GENE 42 49911 - 50417 591 168 aa, chain + ## HITS:1 COG:FN1126 KEGG:ns NR:ns ## COG: FN1126 COG0454 # Protein_GI_number: 19704461 # Func_class: K Transcription; R General function prediction only # Function: Histone acetyltransferase HPA2 and related acetyltransferases # Organism: Fusobacterium nucleatum # 14 168 1 155 155 263 99.0 1e-70 MEAEASKIYGDNYMKSDYDIIEIKKDNINLIEDLWEKNRIFHQNKTSNFSYQYLNLNFNE RMNNIFNSKNIKYYKISAIINKSNIVGYCLSIIQGSSGELCTLFIDEQHRNNGLGHLLVD KHLDWLKDNKCESIFVNVLVENESTISFYESLGFKQNIINMEIPLKKI >gi|296154652|gb|ADVK01000025.1| GENE 43 50426 - 50977 864 183 aa, chain + ## HITS:1 COG:FN1125 KEGG:ns NR:ns ## COG: FN1125 COG1704 # Protein_GI_number: 19704460 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 18 183 18 183 183 301 99.0 6e-82 MIALGVIIGIIVILAGIAIGYKNKFVVLDNRVKNSWSQIDVQMQNRFSLVPNLVETVKGY AKHEKETFEGIANAKTRYMSATTPEEKMEANNQLSGFLGRLFAISEAYPELKANTSFENL QAQLVEVENKIRFARQFYNDTVTEYNQTIQMFPGSLFAGFFNYRNAELFKANEMAREEVQ VKF >gi|296154652|gb|ADVK01000025.1| GENE 44 50995 - 52179 173 394 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163756109|ref|ZP_02163225.1| 30S ribosomal protein S1 [Kordia algicida OT-1] # 254 389 213 344 347 71 31 2e-11 MKKEFLISIIVAILLVFGVKTYYDKKTTNNTQVSTEKEVSNENEMLVPGYALGEIPAITA PEMPDLSVTENPDAKITLDMTKKISSVPGISVTPVKVENSNIVGGDYSMQIGKNGDGQFT DKDKTVQTDGKGAGQYTDENVTIQRNEDGSGQYTNKITGVTLQVDAQGEGQYLDEKNKIS FQIGADGTGVYKDENNDTTITIGENNSTYTKGNITIENNGDGSGTYNDKDKELLIENDGK GKAIITLKGKTIEVEAKPLEKPEKFPKLKMVPPVPSIEANSLLITLDSGILFDVDKYDVR PEAEEVLKNLVIVLKEADIKAFEIDGHTDSDASDEHNQVLSENRANAVKNFLTSQGIMAE ITIKGYGESRPIASNDTPEGKQKNRRVEIVIPTI >gi|296154652|gb|ADVK01000025.1| GENE 45 52235 - 52780 807 181 aa, chain - ## HITS:1 COG:FN1123 KEGG:ns NR:ns ## COG: FN1123 COG0526 # Protein_GI_number: 19704458 # Func_class: O Posttranslational modification, protein turnover, chaperones; C Energy production and conversion # Function: Thiol-disulfide isomerase and thioredoxins # Organism: Fusobacterium nucleatum # 25 181 1 157 157 292 100.0 3e-79 MKKKLLVIMLFILSLTSFAIPLNNMDKDGNVTLPNIELVDQYGKKHNLEDYKGKVVMINF WVSWCSDCKGEMPKVVELYKEYGENKKDLIILGVATPISKEYPNNKDKIDKKALLKYIAD NKYVFPSLFDETGKTYAEYEIEEYPSTFIIDRNGHLKVYIKGAISKEELKQNIESVLNPK K >gi|296154652|gb|ADVK01000025.1| GENE 46 52943 - 55429 3300 828 aa, chain + ## HITS:1 COG:FN1122_1 KEGG:ns NR:ns ## COG: FN1122_1 COG1022 # Protein_GI_number: 19704457 # Func_class: I Lipid transport and metabolism # Function: Long-chain acyl-CoA synthetases (AMP-forming) # Organism: Fusobacterium nucleatum # 1 600 1 600 600 1057 99.0 0 MQIVTDKNKVALYYKDIAVTYKEFILNTKKIKQFTKIKEFTNNMIYMENRPELLYSFFAI WDSRATCVCIDASSTAVELTYYIDNSDVVKIFTTKMQVEKVKGALSILSKQIEIIVVDEI NLSEIKIDENSSENLVINSPEKEDTALILYTSGTTGKPKGVMLTFDNILANVDSLDVYKM YEETDVTIALLPLHHILPLLGTGVMPLLYSATIVFLEDISSVALIDAMKKYKVTMMIGVP KLWEVMHKKIMDTINSKAITRFIFKLAKKVNSLSFSKLIFKKVSEGFGGHIKFFVSGGSK LNPQITKDFYTLGIKICEGYGMTETSPIISYTPKNNIVFDSAGKVIKDVEVKIADDNEIL VKGRNVMKGYYKNPEATAEIIDKDGWLHTGDLGKLVNDYLYITGRKKEMIVLSNGKNINP IEIETKISSMTNLISEIVVTEYNSILTAVIHPDLEKVKEEKVDNIYENLKWEVVDKYNQK TPDYKKILDVKIINEDFPKTKIGKIKRFMIADMLDGKIEKQERKPEPDFEEYNKIKKYLV DIKGKDVYFDSHIEIDLGMDSLDMVEFQYFLDLNYGIKEENLISKYPTLLELANYIKDNR NQEKIGNLDWKEIVNKDTTAKLPSSSFIAIIFKFLSFIFFKTFFRVKVKGKEKIEKDKPT IFVANHQSFLDGFLFNYSVPLKVLKKTYFLATVIHFKSSIMKFMADSSNVVLVDMNKDIA EVMQILAKLLKENKNIAIYPEGLRTRDGKMNKFKKSFAILAKELDIDVQPYVIDGAYDLF PAGKKFPRPGKIRVEFLDKIKVEKLTYDEIVNEAYSVIKNKIESDTEN >gi|296154652|gb|ADVK01000025.1| GENE 47 55596 - 57116 2038 506 aa, chain + ## HITS:1 COG:FN1121 KEGG:ns NR:ns ## COG: FN1121 COG4868 # Protein_GI_number: 19704456 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 506 1 506 506 993 100.0 0 MKIGFNHEKYLEEQSKYILERVNNHDKLYIEFGGKLLADLHAKRVLPGFDENAKIKVLNK LKDKIEVIICVYAGDIERNKIRGDFGITYDMDVFRLIDDLRENDLKVNSVVITRYEDRPS TALFITRLERRGIKVYRHFATKGYPSDVDTIVSDEGYGKNAYIETTKPIVVVTAPGPGSG KLATCLSQLYHEYKRGKDVGYSKFETFPVWNVPLKHPLNIAYEAATVDLNDVNMIDPFHL EEYGEIAVNYNRDIEAFPLLKRIIEKITGKKSIYQSPTDMGVNRVGFGITDDEVVRKASE QEIIRRYFKTGCDYKKGNTDLETFKRSEFIMHSLGLKEEDRKVVTFARKKLELLNQEKDD KNDKQKTLSAIAFEMPDGKIITGKKSSLMDAPSAAILNSLKYLSNFDDELLLISPTILEP IIKLKEKTLKNRYIPLDCEEILIALSITAATNPMAEVALSKLSQLEGVQAHSTHILGRND EQYLRKLGIDVTSDQVFPTENLYYNQ >gi|296154652|gb|ADVK01000025.1| GENE 48 57478 - 59844 1908 788 aa, chain - ## HITS:1 COG:YPO3046 KEGG:ns NR:ns ## COG: YPO3046 COG0641 # Protein_GI_number: 16123223 # Func_class: R General function prediction only # Function: Arylsulfatase regulator (Fe-S oxidoreductase) # Organism: Yersinia pestis # 108 423 11 327 391 113 27.0 2e-24 MKNLLLNAKLIKLADECLLVLSQTQNKWFMTNITNKTVLKMMDGRHSSNEILTANPFLTE RKLDKLIATLRELNMLCINSCECNNKTHNNIKCHKNYPTHVVINLTDQCNLNCLYCYVDS SPTRSNYMKPEIAIKIASELINMNSDNEEILTVVFHGGEPTLNLNAIRAFCEYIIPYRDR FDLCIQTNGTNISDEFIKLVKKYSINIGLSIDGYKKLHDATRIDIKGKGSFDSLEKGIKV LKEEGISFGILTVLNRYNYKYTREIIDYFSSIGAKNCAFLRLTEIGRENSHPELLITGEE IFESFCNIIDWLIEYNNNNEIPFEERTISKMVKIISGGKRDYMCMRKPCGAGRDTIGIDT QGGVYPCDDMVGISKFYMGNLMESNLKSIIDNTNVLDIIDTSNNKLNMECSDCSWKSLCS EVCSAQYYSSPHETVLDEPECQFHQKLIPELLKRYFKDPTVFKLLCRDLRGYEKKEFYFN ITYTCNSHCIFCAADHDISPINNVITLDMMKDLIAFHQIQAGDIVVLNGGEPTTNPEFIN IISLFSEKKISTVAYSNGRNLSNYNYCRKLIEAGLNKISIPIFGFNSETHDYCTGVKNSF EETIAGIQNIIELRKKLCSNIIIEIKLLYIKYLLDINPEIIRWLINEFPSIDIISVNSLI VSDTVMLRKEELIPNFETWSDSVNKTLIVAKECGITDKIELNDTPYCLINDRNYDFLEDY IVRDGSYLADIKTSTYIDYENLNGQKNIPIIIEDDNICVECVMYKKCKFFNRTYGNPNVA IESLKILS >gi|296154652|gb|ADVK01000025.1| GENE 49 59957 - 60319 362 120 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296328189|ref|ZP_06870720.1| ## NR: gi|296328189|ref|ZP_06870720.1| PTS family fructose porter, IIA/HPr component [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] PTS family fructose porter, IIA/HPr component [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 120 1 120 120 172 100.0 1e-41 MDKFNRKKSVELLSKLKEKDIIDVKSDSKLVNDDSSSKKGWMKEEKPWINYPGNSQTKIL IAIGIKNISATQLKELTKLGATIENGEKLIFDVDQSEKVQKILGEAFNLEDLSNSIKKKK >gi|296154652|gb|ADVK01000025.1| GENE 50 60406 - 60492 59 28 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MTTGCRFFIEIIHFNHIIDTYNIKVISL >gi|296154652|gb|ADVK01000025.1| GENE 51 61380 - 62963 1987 527 aa, chain - ## HITS:1 COG:FN1120 KEGG:ns NR:ns ## COG: FN1120 COG1866 # Protein_GI_number: 19704455 # Func_class: C Energy production and conversion # Function: Phosphoenolpyruvate carboxykinase (ATP) # Organism: Fusobacterium nucleatum # 1 527 1 527 527 1087 99.0 0 MKMYGLEKLGINNVTAAHYNLSPAQLVEKALANNEGILSDTGAFVISTGKYTGRAPDDKF FVDTPEVHKYIDWSRNQPIEKEKFDAIFGKLVAYLQNREIFIFDGRAGANPEYTRRFRVI NELASQNLFIHQLLIRTDEEYNENNDIDFTIISAPNFHCVPEIDGVNSEAAIIINFEKKI AIICATKYSGEIKKSVFSIMNYIMPHENILPMHCSANMDPVTHETAIFFGLSGTGKTTLS ADPNRKLIGDDEHGWCDKGIFNFEGGCYAKCINLKEESEPEIYRAIKFGSLVENVVVDPI TRKIQYEDASITPNTRVGYPIDYIPNAELSGVGGIPKVVIFLTADSFGVLPPISRLSQEA AMYHFVTGFTAKLAGTELGVKEPVPTFSTCFGEPFMPMDPSVYAEMLGERLKKHNTKVYL INTGWSGGAYGTGKRINLKYTRAMVTAVLNGYFDNAEYKHDDIFNLDILQSCPGVPSEIM NPIDTWQDRDKYIVAAKKLANLFYNNFKEKYPNMPENITNAGPKYND >gi|296154652|gb|ADVK01000025.1| GENE 52 63227 - 63511 492 94 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704454|ref|NP_604016.1| 50S ribosomal protein L27 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 94 1 94 94 194 100 2e-48 MQFLFNIQLFAHKKGQGSVKNGRDSNPKYLGVKKYDGEVVKAGNIIVRQRGTKYHAGNNM GIGKDHTLFALIDGYVKFERLGKNKKQISIYSEK >gi|296154652|gb|ADVK01000025.1| GENE 53 63512 - 63841 530 109 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|237742036|ref|ZP_04572517.1| 50S ribosomal protein L27 [Fusobacterium sp. 4_1_13] # 1 109 1 109 109 208 95 7e-53 MTKVEIFRKNGSIVGYKASGHSGYSEQGSDIICSAITTSLQMTLAGIQEVLKLKPKFKIN DGFLDVDLRDISQNKFTEINILTESMALFLRELAKQYPKYIRLVEKEEK >gi|296154652|gb|ADVK01000025.1| GENE 54 63844 - 64158 516 104 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704452|ref|NP_604014.1| 50S ribosomal protein L21P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 104 1 104 104 203 100 3e-51 MYAVIKTGGKQYKVTEGDVLKVEKLNAEVNTTVELTEVLLVAGGDNAVKVGKPLVEGAKV VVEVLSQGKGPKVINFKYKPKKASHRKRGHRQLFTEVKVTSIIA >gi|296154652|gb|ADVK01000025.1| GENE 55 64210 - 64290 60 26 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MINLKLNKQYLLTNLNFYDNINRINA >gi|296154652|gb|ADVK01000025.1| GENE 56 64268 - 64513 306 81 aa, chain - ## HITS:1 COG:FN1116 KEGG:ns NR:ns ## COG: FN1116 COG3340 # Protein_GI_number: 19704451 # Func_class: E Amino acid transport and metabolism # Function: Peptidase E # Organism: Fusobacterium nucleatum # 1 81 114 203 203 124 85.0 3e-29 MLYIGESAGAIITSKDIEYNDLMDDKTIAKDLKDYSGLNLVDFYMVPHLNEFPFEESSKQ IVKKYKDKLNIIVKDDKFEIK >gi|296154652|gb|ADVK01000025.1| GENE 57 64626 - 65417 865 263 aa, chain - ## HITS:1 COG:FN1115 KEGG:ns NR:ns ## COG: FN1115 COG2215 # Protein_GI_number: 19704450 # Func_class: R General function prediction only # Function: ABC-type uncharacterized transport system, permease component # Organism: Fusobacterium nucleatum # 20 263 1 244 244 372 99.0 1e-103 MKRIIKYIVGIIAIALIYLLISNFNLIMYKIAIYQQEIVEKISKLIEKENEKIIYTMLFF TFLYGIVHSFGPGHGKTLVLTYSVKEKLNFPKLLLVSFLIAYLQGVSAYILVKFIINLSD KASMMLFYDLDNRTRLIASILIILIGLYNIYSVLRNKCCEHHHETKVKNILGFSIVLGLC PCPGVMTVLLFLESFGLSENLFLFTLSMSTGIFLVILFFGILANTFKKTLVEDENFKLHK ILSLVGASLMILFGIFQILILGE >gi|296154652|gb|ADVK01000025.1| GENE 58 65438 - 65542 139 34 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MKDIIGIIIKVVGVFLLAFFGTFIIMKLILMAKS >gi|296154652|gb|ADVK01000025.1| GENE 59 65539 - 66129 439 196 aa, chain - ## HITS:1 COG:FN1114 KEGG:ns NR:ns ## COG: FN1114 COG3683 # Protein_GI_number: 19704449 # Func_class: R General function prediction only # Function: ABC-type uncharacterized transport system, periplasmic component # Organism: Fusobacterium nucleatum # 1 196 1 196 196 288 98.0 4e-78 MYYKKVKIKLEREKEMKFIYNIILFFIISIYTYSHPHVFFDTNIEVKIENQKLQGIELQL NLDELNTRLNKRILKPDKEMNVEQENIVFLKHLFKHIRVKYNNKTYKEDDIIFEQAKLED DSLEIYFFIPIDEKITKNSKLKIALYDTKYYYNYDYEKSSLKIDKNMRAKIKFFTNDKIK FYFNLVSPDEYEVSFE >gi|296154652|gb|ADVK01000025.1| GENE 60 66280 - 67218 888 312 aa, chain + ## HITS:1 COG:FN1113 KEGG:ns NR:ns ## COG: FN1113 COG0601 # Protein_GI_number: 19704448 # Func_class: E Amino acid transport and metabolism; P Inorganic ion transport and metabolism # Function: ABC-type dipeptide/oligopeptide/nickel transport systems, permease components # Organism: Fusobacterium nucleatum # 1 312 1 312 312 541 99.0 1e-154 MIKYIIKRIFYLIPILIGVTFLTFLMLYLAPSDPISMKYTSMATVGDSKYIEEKKEEMGL NDSFLKQYVRWSKNVLSGDFGISTKYNVPVKDEIAKRLPKTLALTGTSVLITIFLAFPFG IISAQYKNKLVDYIIRFFSFTGISIPSFWLGLMLMYFFSVKFKLLPIIGSKGIKSLILPS ITLSVWLVAVYIRRIRACILEEINKNYVVALKSKGISYSKIMFFHILPNSLLTIVTMFGM SIGSILGGTTIIETIFEYRGLGKMAADAITNRDYFLMQGYVIWTAIIYVVINLLVDILYK YLNPRIKIGDES >gi|296154652|gb|ADVK01000025.1| GENE 61 67218 - 68030 806 270 aa, chain + ## HITS:1 COG:FN1112 KEGG:ns NR:ns ## COG: FN1112 COG1173 # Protein_GI_number: 19704447 # Func_class: E Amino acid transport and metabolism; P Inorganic ion transport and metabolism # Function: ABC-type dipeptide/oligopeptide/nickel transport systems, permease components # Organism: Fusobacterium nucleatum # 1 270 1 270 270 423 99.0 1e-118 MKNKKIDYKFFIILTLAILIIFITIFANYLAPYNPDYQNYEAISQAPNSIYLLGTDYVGR DILSRILYGGRYSLLIALLVTFLIAFIGIVIGLISGYLGGVVDIIIMRIVDMIMSFPYIV FVIAVVTIFGGGLKNLILAMTLISWTNYARVTRAMVISLKNNDFINQAKLSGASNIRIMY KYLAPNVLPYLIVLTTQDIANNLLTLSSLSLLGIGVQPPTAEWGLMLSEGKKYIQTAPWI LFFPGIAIFICVIVFNLLGDSLRDILDPKE >gi|296154652|gb|ADVK01000025.1| GENE 62 68068 - 69630 2226 520 aa, chain + ## HITS:1 COG:FN1111 KEGG:ns NR:ns ## COG: FN1111 COG0747 # Protein_GI_number: 19704446 # Func_class: E Amino acid transport and metabolism # Function: ABC-type dipeptide transport system, periplasmic component # Organism: Fusobacterium nucleatum # 1 520 19 538 538 973 96.0 0 MKKKVLLGIFLALISVGILMSCGTEKEKEAVLTEAQANGGHMNIALYWFGETLDPAENWD GWTLTRAAVGETLVTVDENLQLVGQLADSWENIDETTWKFHIRQGVTFQNGNPLTPEAVK SSIERTVKINERGENALKLASIDVDGEYVVIKTKEPYGAFLANISDPMFIIVDTSVDTSK FKETPICTGPYMVTSFKPATSFETVAYENYWGGKPALDSVTVFDIEDDNTRALSLQSGDV DMAQGIRAGDIALFTDNKDYIVKSTTGTRIEFLVMNIEKAPFNDKNLRLAINSAVDYDTI AKVVGGGAVPARAPFPASAPYGYSELNKQTFDLEKTKTLLAEAGYKDTNNDGYVDKDGKN LELNIYGTAGGNTRANSTVAELLESQLKTAGIKANIKIAENLDEIKKNGEFDLLFQNWQT VSTGDSQWFLDNAFKTGGSGNYGKYSNKQLDDLINKLATTFDVKERQKITKETSQIIIDE GYGTYLISQANVNVSNNKVENMGNFPIDYYFLTVNTKIKK >gi|296154652|gb|ADVK01000025.1| GENE 63 69643 - 70422 246 259 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) [Campylobacter concisus 13826] # 1 232 1 221 223 99 30 6e-20 MKPLLEIKNLNINYKNSIKAVKDVNFTLEDNQIISIVGESGSGKSTLIRAILKLLPTGGE IESGNIFFLGKDILSLNKNELNKLRGKDIGMIFQDPNSTMDPIKIIEKQFIEYILEHNNI PKKEAIELAKEYLLKLNLTDVDRVLKSYPFELSGGMKQRVVIAMTMAQSPRLLLADEPTS ALDVTVQAQVIKELKRIRENFKTAIILVTHNMGVASYISDKIAVMKNGEIIEFGDKEQII KNPQREYTKSLLNAIINLK >gi|296154652|gb|ADVK01000025.1| GENE 64 70440 - 71201 193 253 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|225088774|ref|YP_002660041.1| ribosomal protein S16 [gamma proteobacterium NOR5-3] # 6 231 12 230 312 79 25 9e-14 MKEDLLIIENISKTFKVDKNKELKALKNVNIRLKKGECIGIVGESGCGKSTLARIVVGIE KKTSGKIIFDNKEIEGISETKDIQMIFQNPLSSFNPRMKIVDYMWEPLRNYFKLSKKDSI PLIKKSLIDVSLDENALEKYPHEFSGGQLQRITIARAIIIKPKLIVCDEITSALDVSVQK QILELLKKLQKDLSLSYLFIGHDLAVLQDISQKIVVMYMGEIIEELNSIDLKSKAKHPYT KLLLNSIFEVDKI >gi|296154652|gb|ADVK01000025.1| GENE 65 71347 - 71835 494 162 aa, chain - ## HITS:1 COG:FN1108 KEGG:ns NR:ns ## COG: FN1108 COG0494 # Protein_GI_number: 19704443 # Func_class: L Replication, recombination and repair; R General function prediction only # Function: NTP pyrophosphohydrolases including oxidative damage repair enzymes # Organism: Fusobacterium nucleatum # 1 162 1 162 162 288 96.0 4e-78 MEKFAVPCVAAIIEKIVNNEKYILIQTRQKEDGAETNGMLEVPAGKIREYENIFGALRRE VKEETGLIITKILGEDRQVSNLIDGNEVISYTPYCVTQNLSGVYSIILNTFLCEAEGELL SETNESQNIHWMKIEDLKKILKNNPEKIFLLHINALQKYFKK >gi|296154652|gb|ADVK01000025.1| GENE 66 71858 - 72325 559 155 aa, chain - ## HITS:1 COG:FN1107 KEGG:ns NR:ns ## COG: FN1107 COG0454 # Protein_GI_number: 19704442 # Func_class: K Transcription; R General function prediction only # Function: Histone acetyltransferase HPA2 and related acetyltransferases # Organism: Fusobacterium nucleatum # 48 155 1 108 108 208 100.0 4e-54 MKIELMKITLKDTKEIWEMQVKAFKELLDKYQDFETNPASEPISNIEMRLKQNFTFFYFI CVDNKKVGAIRIIDYKEKYKNKRISPIFILPEYRNKGIAQSAIKICEEIHGNNNWELSTI LQEKGNCYLYEKLGYHSTGKTQVINDRLTLIFYEK >gi|296154652|gb|ADVK01000025.1| GENE 67 72352 - 73578 1998 408 aa, chain - ## HITS:1 COG:FN1106 KEGG:ns NR:ns ## COG: FN1106 COG1760 # Protein_GI_number: 19704441 # Func_class: E Amino acid transport and metabolism # Function: L-serine deaminase # Organism: Fusobacterium nucleatum # 1 408 1 408 408 835 100.0 0 MDTLKELFKIGAGPSSSHTIGPERATKRVKEKFPNADSYIVELWGSLAATGKGHYTDKII IETFKPIPVEIIWKPEFVHELHTNGMKFIALDKDKKQIGEWIVFSVGGGTIRDYDELMDK SPKKEVYPLNSMKEIVKWCKDNKKQLWQYVEECEGPSIWQHLRYIDQAMTDAVKRGLEKS GDVPGPFKYPKRAREMYEKALSKRASLIFTNKVFAYALAVSEENASMGQVVTAPTCGASG VIPGVLRGMKEEYELVEKHILRGLAIAGLIGNLVKYNATISGAEGGCQAEVGTACSMAAA MATYFMGGNTDQIEYAAESAMEHHLGMTCDPVGGYVIIPCIERNAICAVRAINTATYCMS TDGKHTISFDEVIKTMKETGKDMCSAYKETSDGGLAKYYDRILVDSKE >gi|296154652|gb|ADVK01000025.1| GENE 68 73607 - 74104 738 165 aa, chain - ## HITS:1 COG:no KEGG:FN1105 NR:ns ## KEGG: FN1105 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 165 1 165 165 303 97.0 2e-81 MRGLEEIYLKGFGYDKYLGIASQDELEKLEELYKNIVISDEFVNKIKTINKKVSVLVSVE TWCPFARVFLTTLRKINEINHIFDLSLITYGRGVSELAGYLKINEDDFVVPTAVFLDNNF SKLRVFNGFPEKYHKEDTLDTIDATRNYLKGKSVNDILEDILNIF >gi|296154652|gb|ADVK01000025.1| GENE 69 74108 - 74692 862 194 aa, chain - ## HITS:1 COG:FN1104 KEGG:ns NR:ns ## COG: FN1104 COG0632 # Protein_GI_number: 19704439 # Func_class: L Replication, recombination and repair # Function: Holliday junction resolvasome, DNA-binding subunit # Organism: Fusobacterium nucleatum # 1 194 1 194 194 326 100.0 2e-89 MFEYLYGTVEYKKMDYIAIDINGVGYRVYFPLREYEKIEVGNKYKLYIYNHIKEDTYKLI GFLDERDRKIFELLLKINGIGSSLALAVLSNFSYNKIIEIISKNDYTTLRQVPKLGEKKA QIIILDLKGKLKNLTYTEEETVSMDMLEDLVLALEGLGYNKKEIDKTLEKIDLNKFSSLE DAIKGILKNMRIGD >gi|296154652|gb|ADVK01000025.1| GENE 70 74704 - 77586 3967 960 aa, chain - ## HITS:1 COG:FN1103 KEGG:ns NR:ns ## COG: FN1103 COG0178 # Protein_GI_number: 19704438 # Func_class: L Replication, recombination and repair # Function: Excinuclease ATPase subunit # Organism: Fusobacterium nucleatum # 1 960 1 960 960 1866 99.0 0 MYYKNYIERNSEIKLMIDKITIKGARQHNLKNIDIELPKNEFIVITGVSGSGKSSLAFDT IYSEGQRRYVESLSAYARQFIGQMNKPEVDSIEGLSPAISIEQKTTNRNPRSTVGTITEV YDYLRLLFAHIGTAHCPICHTAVEKQSVDEIVESIMTKFDDGSKIILLAPVVKDKKGTHK NIFLNLFKKGFVRARVNGEVLYLEDEIELDKNKKHNIEVVVDRLVLKKDDKDFESRLTQS IETAIDLSNGKLIINDGKNDYLYSENYSCPNHEDVSIPELNPRLFSFNAPYGACPECKGL GKKLEVDENKLIENPELSIEDGGMYIPGAMARKGYSWEIFKAMAKVAKIDLTKPVKDLTQ KELDIIFYGYDEKFRFDYTGGEFDFHGYKEYEGAVKNLERRYYETFSDAQKEEIENKYMV ERICKVCNGKRLKDEVLAVTVNGKNIMEICDMSIKNSLDFFMNMNLTEKQEKIAKEILKE IRERLTFMTNVGLDYLTLSRETKTLSGGESQRIRLATQIGSGLTGVLYVLDEPSIGLHQK DNDKLLATLNRLKELGNTLIVVEHDEDTMMQADKILDIGPGAGEFGGDIVAFGSPKEIMK NKDSITGKFLSGKEAIDIPKKRRKWDKSIKLYGAKGNNLKNIDVEFPLGVMTVVTGVSGS GKSTLINSTLYPILFNKLNKGKLYPLEYKKIEGLNALEKVINIDQTPIGRTPRSNPATYT KLFDDIRDIFAETQDAKLHGFKKGRFSFNVKGGRCEACQGAGILKIEMNFLPDVYVECEV CKGKRYNKETLDVYYKGKNIYDVLEMSVLEAYEFFKNIPALERRLKVLIDVGLDYIKLGQ PATTLSGGEAQRIKLATELSKMSKGNTVYILDEPTTGLHFQDIKKLLEVLNRLLEKGNTV IIIEHNLDVIKTADHIIDIGVDGGENGGTVVATGTPEEIAKSKKSYTGKYIAKILKNKTK >gi|296154652|gb|ADVK01000025.1| GENE 71 77654 - 78193 619 179 aa, chain - ## HITS:1 COG:FN1102 KEGG:ns NR:ns ## COG: FN1102 COG1859 # Protein_GI_number: 19704437 # Func_class: J Translation, ribosomal structure and biogenesis # Function: RNA:NAD 2'-phosphotransferase # Organism: Fusobacterium nucleatum # 1 179 1 179 179 310 96.0 8e-85 MDNDVKLGRFISLILRHKPETIDLKLDKNGWADTKELIEKISKSGREIDFKTLERIVNEN NKKRYSFNEDKTKIRAVQGHSIKVNLELKEVVPPAILYHGTAFKNLESIKKEGIKKMSRQ HVHLSADIETAKNVATRHSGKYIILEIDTEAMLKENYKFYLSENKVWLTDFVPSKFIKF >gi|296154652|gb|ADVK01000025.1| GENE 72 78186 - 79532 1482 448 aa, chain - ## HITS:1 COG:FN1101 KEGG:ns NR:ns ## COG: FN1101 COG1373 # Protein_GI_number: 19704436 # Func_class: R General function prediction only # Function: Predicted ATPase (AAA+ superfamily) # Organism: Fusobacterium nucleatum # 1 448 23 470 470 855 99.0 0 MERFILNDLIKWKNSKYRKPLILKGVRQVGKTWILKEFGNICYENIAYFNFDENPEYRQF FQTTKDINRILQNLILISGYKIVPEKTLIIFDEVQDAPEVINSLKYFYENTPEYHITCAG SLLGISLAKPSSFPVGKVDFLNIYPMSFSEFLLANGDENLKLFLDNLNNIENIPDAFFNP LYEKLKMYYVTGGMPEAVYMWTQERDIELVRKTLNNILEAYERDFAKHPNIYEFPKISMI WKSIPSQLSKENKKFIYKVVKEGARAREYEDALQWLVNANLVTKVFKCSAPRMPLSSYDD ISAFKIYLVDVGLLTRLSQLSPNTFGEGNRLFTEFKGALTENYILQGLISQFEVPPRYWA ENNYEVYFIIQNENNIIPIEVKAETNIKSKSLQKFKEKFKDDIKLRVRFSFENLKLDNDL LNIPLFMVDYAEKIINIALNKKQGENNG >gi|296154652|gb|ADVK01000025.1| GENE 73 79703 - 79969 344 88 aa, chain - ## HITS:1 COG:FN1100 KEGG:ns NR:ns ## COG: FN1100 COG2026 # Protein_GI_number: 19704435 # Func_class: J Translation, ribosomal structure and biogenesis; D Cell cycle control, cell division, chromosome partitioning # Function: Cytotoxic translational repressor of toxin-antitoxin stability system # Organism: Fusobacterium nucleatum # 1 88 1 88 88 146 96.0 7e-36 MGYRILIPDNVNKKILKFDRNTRKLLYDYINKNLKDTDDPRLHGKALTGNLKGLWRYRIM DYRLIVDIQDEQLIIVAVDFNHRRKIYL >gi|296154652|gb|ADVK01000025.1| GENE 74 79951 - 80181 305 76 aa, chain - ## HITS:1 COG:no KEGG:FN1099 NR:ns ## KEGG: FN1099 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 75 1 75 75 97 92.0 2e-19 MSVISIRFNEEEEEILKNYVKSKEINLSQYIKNIIFEKIEEEYDLKSVQEYLKAKSEGTL NLIPFEEATKEWDIEY >gi|296154652|gb|ADVK01000025.1| GENE 75 80328 - 80465 243 45 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MNIINPDPLGFRKMEKERVINELEKAIQKAKVEGRIEDVEKLKKN >gi|296154652|gb|ADVK01000025.1| GENE 76 80519 - 82306 2321 595 aa, chain - ## HITS:1 COG:no KEGG:FN1097 NR:ns ## KEGG: FN1097 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 595 1 595 595 956 99.0 0 MEKNFKNLFDERKKLSKELDKTYIDMIENINNRVKELDDLEKVLQEDFLKACAMDVAGET VRTFFDTGDYNITADQLYDRIVNFSYEDKNDPLSNTLEVNKKNIYNLENSKESQENLKKE IGEPEKLFKKRIVIDENGKERKEYEDKDMIQKGKNKYRKDYDEIGNAKKIDVDHTQALAQ AKAHVKYLKEGGEKAIKEFYNSEDNFQMLGRTANQCKGDAKVFLKDENGNYVKDKDGNKI DITYEATPEQMTEAIIEKLEGGKNRDPKTTQKLKDEGILDENGKVKPGVKRKIKEGIKKS QDKEGEVTYKNIDKTKVAKDAGKETVKQAGKMISGQLIYYLVPPMIYEIKENIKNKNNED STLENLKNSAEKIIEYASSKIGKILANIFPNGIKKFLKNFFDIIIEIVKGALKKILKIAK QLLITVVDAIKILFDSSKSFLEKMDAILQLVAGIFVNIAIGILGEYIQKQFMIPEMFWFP LEMIIRIILSNFIMLLLKKLDLFGVNRKIKIEKVRMIFEEEREKTDLELQEKLDNVNYNN QQIFEELNLELKGIQKSIQENNMFNISIKNDVKRMFEIFGKELKMNKELRTFLGN >gi|296154652|gb|ADVK01000025.1| GENE 77 82315 - 83559 1366 414 aa, chain - ## HITS:1 COG:no KEGG:FN1096 NR:ns ## KEGG: FN1096 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 414 1 414 414 605 92.0 1e-171 MSKFYTLVHYIEKLRKEIEDIQKSDDEIKKEQEDKTTEVGIVIQSAFNIFGSKDISSEIY QGNNDEDEIKNIYDICDKNYVKLANEIIYVNVKREELWKSIRDIQEYLSNLGILTENKTD YSEFTLENIEIPKLDKEINFLNNKLEFLERDARKNDIEYGKYEGFGTTLWNRLASPLESV KAERKLEAINIAQKKLYVTLRFYFDYYKDSLKLQKPYLEECIKTAKLYRESLNNLLDTID KKILPFLEISNLFLKSLIIKELVKENKLNENMSLKDIESPKSIGNLKDTKYNIYYTFFEN LFHLYTSIETAFKENILTNLMFFKNDYLEGDELQDIESKIKDSAKTYPEFGTFDWIISID TSEEEYIYSAKDIKFLNYSKSIKEAEEKIKIINEKIADVKKNLDEIEEISNELE >gi|296154652|gb|ADVK01000025.1| GENE 78 83552 - 83902 410 116 aa, chain - ## HITS:1 COG:no KEGG:FN1095 NR:ns ## KEGG: FN1095 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 116 4 120 120 147 82.0 1e-34 MDKTFIFSIVGIVFLIVVIFFFIKNRKKSDDNSMEEFIITEEDNKSSQKKEPEINITATD FKVVGNKDKYTISFKGDKIQFFVKDNEIIGFLDVEKNSKIYYYEKAKVKEGEDKNV >gi|296154652|gb|ADVK01000025.1| GENE 79 84120 - 84215 60 31 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MYNNSKNLKNKNIFLIMTKYVVIILYNKNIK >gi|296154652|gb|ADVK01000025.1| GENE 80 84407 - 85489 1005 360 aa, chain + ## HITS:1 COG:FN1094 KEGG:ns NR:ns ## COG: FN1094 COG0463 # Protein_GI_number: 19704429 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Fusobacterium nucleatum # 1 360 1 360 360 666 97.0 0 MKKTLILVPALNPPKQLIDYVKSLLDNGLNDILLVDDGSKEEFKEIFDTIEKFLEGNIKV FRHAKNLGKGRALKNAFNYFLTLSNLAEYSGVVTADSDGQHRVEDVIKVAKEVEENPNKL ILGCRDFNLEQVPPKSKFGNKITNGAFKLFYGKNISDTQTGLRGFPTAIIKDFLDIAGER FEYETKMLIYCFQKEIEIKEVVIETIYFNDNSETHFNPILDSIKIYKVTLSPFLKYIASA TSSFILDILSFKWILAILIALGNIEGAGIITISTIVARIISSSFNFYLNKKFVFKYEKNT KKSLLKYYSLCTVQMLLSAILVTVVWKHTRYAETSIKIVVDSILFLLSYFIQQRWVFKRK >gi|296154652|gb|ADVK01000025.1| GENE 81 85503 - 85931 434 142 aa, chain - ## HITS:1 COG:FN1093 KEGG:ns NR:ns ## COG: FN1093 COG1959 # Protein_GI_number: 19704428 # Func_class: K Transcription # Function: Predicted transcriptional regulator # Organism: Fusobacterium nucleatum # 1 142 1 142 142 244 99.0 4e-65 MKLKNEIEYVFRILNYLSLQDKNRIVTSAEIAENENIPHLFSIRVLKKMEKKGLLKIFKG ANGGYKLNKEPKDITLRDAVETIEDEIIIKDKSCVVGQTSCSIIFEALEKVENNFLNNLE KVNFQELTCPTHVPLKIEDEIK >gi|296154652|gb|ADVK01000025.1| GENE 82 85965 - 86873 1165 302 aa, chain - ## HITS:1 COG:FN1092 KEGG:ns NR:ns ## COG: FN1092 COG3872 # Protein_GI_number: 19704427 # Func_class: R General function prediction only # Function: Predicted metal-dependent enzyme # Organism: Fusobacterium nucleatum # 1 302 3 304 304 569 99.0 1e-162 MSTNKPSIGGQAVIEGVMMRGTECLATAVRRPSGEIVYKKTKIIGKNSNFAKKPFIRGVL MLFESLVIGVKELTFSANQAGEDDEKLSDKEAVFTTIFSLALGIGIFIVLPSIVGSFFFP TNRIYANLTEAILRLIIFIGYIWGISFSKEVGRVFEYHGAEHKSIYTYENGLELTPENAK KFTTLHPRCGTSFLFIVMFIAIIVFSVIDFMLPIPTNLLTKFLLKVVVRIVLMPVIASLA YELQKYSSCHLNNPLIKLISLPGLALQKITTREPDLDELEVAIVAIKASLGEEVNNATEI FE >gi|296154652|gb|ADVK01000025.1| GENE 83 86888 - 88411 1573 507 aa, chain - ## HITS:1 COG:FN1091 KEGG:ns NR:ns ## COG: FN1091 COG2208 # Protein_GI_number: 19704426 # Func_class: T Signal transduction mechanisms; K Transcription # Function: Serine phosphatase RsbU, regulator of sigma subunit # Organism: Fusobacterium nucleatum # 61 507 1 447 447 780 99.0 0 MITAFYMILAFLIYIFFTYIYIKRLVNQYINEELKIISGLKNKEKLDKLPDNIKTEYMET LEKIIKQENELNNSIDEIKEYRKELDVTYSTLVSKSTQLEYTNSLLEKRVRNLSNLNHIS RVALSMFNIDKIVDTLADAYFVLTATTRISIYLWEGEKLVNKKIKGSIDFTESFSYPMNL LEKFTNEDYTKIYSDLSRKITILNDEKVIITPLKVKERQLGVILLVQNKDQILEINSEMI SALGIQASIAIDNAINYAELLEKERISQELELASSIQKQILPKGFERIKGMDIATHFSPA KEVGGDYYDLSLKNNNLSVTIADVSGKGVPAAFLMALSRSMLKTINYVSNYTPAEELDLF NKIVYPDITEDMFITVMNAEYNLDTSLFTYSSAGHNPLVIYKKENDTVELYGTKGVAIGF IEDYNYKENSFELKNGDIIVFYTDGIIECENKNRKLFGTQRLLDVVYKNKTLSAKELKEK ILEAIKNFREDYEQTDDITFVILKSVK >gi|296154652|gb|ADVK01000025.1| GENE 84 88476 - 90245 1836 589 aa, chain - ## HITS:1 COG:FN1090 KEGG:ns NR:ns ## COG: FN1090 COG0322 # Protein_GI_number: 19704425 # Func_class: L Replication, recombination and repair # Function: Nuclease subunit of the excinuclease complex # Organism: Fusobacterium nucleatum # 1 589 1 589 589 1046 100.0 0 MDIGKIDIPESSGVYLMKKNNKVIYVGKAKNLKNRVSSYFNRVHESEKTNELVKNIEDIE FFLTNTEIDALLLENNLIKKYSPKYNILLKDEKTYPFIKISKEDFSSIKIVRTTKALDIK SGEYFGPYPYGAWRLKNILMKLFKIRDCNRDMKKTSPRPCLKYYMKSCTGPCVYKDIKEE YNKDVENLKQVLKGNTSKLINELTALMNKASQDMDFEKSIIYREQIKELKSIASSQIIQY ERELDEDIFVFKTILDKAFICVLNMRDGKILGKSSTSIDLKNKITDNIYEAIFMSYYSKH ILPKSLVLDAEYENELSVVVKALTIEDSKKKEFHFPKIKSRRKELLDMAYKNLERDIESY FSKKDTIEKGIKDLHDILGLKRFPRKIECFDISNIQGKDAVASMSVSIEGRAARKEYRKF KIRCKDTPDDFSMMREVIERRYSKLPDIEFPDVILIDGGLGQINSAGEVLKRLGKIHLSE LLSLAERNEEIYKYGESIPYVLSKDMEALKIFQRVRDEAHRFGITYHRKIRSKRIISSEL DKIDGIGEVRRRKLLTKFGSISAIKKASIEELKEIIPEKVALEIKNKIR >gi|296154652|gb|ADVK01000025.1| GENE 85 90232 - 91104 870 290 aa, chain - ## HITS:1 COG:FN1089 KEGG:ns NR:ns ## COG: FN1089 COG1660 # Protein_GI_number: 19704424 # Func_class: R General function prediction only # Function: Predicted P-loop-containing kinase # Organism: Fusobacterium nucleatum # 1 290 1 290 290 537 99.0 1e-153 MKTKHVIIVTGLSGAGKTTALNILEDMSYYTIDNLPLGLEKSLLDTEIEKLAVGIDIRTF KNTKDFFTFINYIKESGVKMDIIFIEAHEAIILGRYTLSRRAHPLKEVTLLRSILKEKKI LFPIREIADLVIDTTEIKTVELEKRFKKFIFAKDGENTDININIHIQSFGYKYGIPTDSD LMFDVRFIPNPYYIEKLKELNGFDEEVKEYVLSQKESKEFYFKLLPLLEFLIPQYIKEGK KHLTISIGCSGGQHRSVTFVNKLAEDLKNSKVLEYINVYVSHREKELGHW >gi|296154652|gb|ADVK01000025.1| GENE 86 91131 - 92480 2001 449 aa, chain - ## HITS:1 COG:FN1088 KEGG:ns NR:ns ## COG: FN1088 COG0446 # Protein_GI_number: 19704423 # Func_class: R General function prediction only # Function: Uncharacterized NAD(FAD)-dependent dehydrogenases # Organism: Fusobacterium nucleatum # 15 449 1 435 435 791 99.0 0 MNKKIIIVGGVAAGMSAASKAKRIDKSLDITVYEMTDAISWGACGLPYYVGDFYPNASLM VAKTYEEFQKEGINVKIKHKVENIDFKNKKVFVRNLNENKVFEDNYDKLVIATGASSTSP KDIKNLDAEGVYHLKTFNEGLEVKKEMMKKENENIIIIGAGYIGIEIAEAALKLGKNVRI FQHSARILNKTFDKEITDLLENHIREHEKISLHLNESPVEVRTFEDKVIGLKTDKKEYVA NLIIVATGVKPNTEFLKDTGIELFKNGAIIINRFGETNIPNVYAAGDCATVYHSVLEKNV YIALATTANKLGRLIGENLTGTNKLFIGTLGSAGIKVLEFEAARTGITEQEAKDNNINYK TIFVDGEDHSAYYPGGEDVYIKLIYNADTKILLGAQLAGKRGAALRTDSLAVAIQNKMTV QELANMDFLYAPPFATTWDIMNVAGNVAK >gi|296154652|gb|ADVK01000025.1| GENE 87 92607 - 93188 536 193 aa, chain - ## HITS:1 COG:no KEGG:FN1087 NR:ns ## KEGG: FN1087 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 192 1 192 192 324 98.0 1e-87 MARISYLKGLVICHGKSEKLICDFIKSNLRIQIEIDSDKKGKKSIQITSIMKFLSGEKYK NIVSFKNKFDDIEPIKDRKKLPNYFKVFIIMDTDDCNENQKKSFKDKSMFKKHWLYDYIV PIYNDNNLEEVLVDAGIKFQKNGNERKTEYPKVFPMNGISDIEGIKKFGKDLKNSKKTNM EKFINFCLTLIEK >gi|296154652|gb|ADVK01000025.1| GENE 88 93194 - 94519 1499 441 aa, chain - ## HITS:1 COG:FN1086 KEGG:ns NR:ns ## COG: FN1086 COG1106 # Protein_GI_number: 19704421 # Func_class: R General function prediction only # Function: Predicted ATPases # Organism: Fusobacterium nucleatum # 1 441 1 441 441 715 99.0 0 MFTYIKLKNYKSLIELEVDLTKKENTPKKLISIYGENGIGKSNFVDSFYTLKRIISTRTI NEKVRILTEKQKDLQSNDFDKALYFFGQLGSIIKNGFFSDSTDIITESKTINSKDNMVIE VGFKIKSKSGIYRIETDDTDIISEKLDFTLNKNKVNFFEITKKEKNLNESVFIDNEYKKE ILSIIEKYWGKHSLLSLIAYEIEDKKENYVKKRIFNGIFEVINFFLSLSILSRNKMEVFK DIEKEKLFYGTLSINEEKKLTNIENVINTFFIALFSDIKQAYYKKKIDNDKINYILCFKK NIYNKLIDIEYNRESTGTKNILKILPYLISAAKGKTVIIDEIDNGIHDLLMLKILENLSE DLKGQLIITTHNTLLLEEEFIKDSIYIFKVDENANKQLISLKKFEGRIHPNLSIRKRYLK GLYGGIPFPMDIDFNELIESM >gi|296154652|gb|ADVK01000025.1| GENE 89 94664 - 95212 893 182 aa, chain - ## HITS:1 COG:FN1085 KEGG:ns NR:ns ## COG: FN1085 COG0693 # Protein_GI_number: 19704420 # Func_class: R General function prediction only # Function: Putative intracellular protease/amidase # Organism: Fusobacterium nucleatum # 1 182 1 182 182 349 100.0 2e-96 MKTYVFLADGFEILETFAPVDVLKRCGAEVVTVSTEKDLFVASSQKNIVKADAMLSDIDY KTADLIIIPGGYPGYVNLRENKEVVDIVKYFLDNDKYVASICGGPTIFSYNNLANGTKLT GHSSTKEELSKNHIYVDVPTHIDRKIITGVGAGQAINFAFKIAEQFFDKEKIEEVKKGME II >gi|296154652|gb|ADVK01000025.1| GENE 90 95228 - 95476 397 82 aa, chain - ## HITS:1 COG:no KEGG:FN1084 NR:ns ## KEGG: FN1084 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 82 4 85 85 144 100.0 1e-33 MFETWAETLYDETFNDMFDALVAEYKNGEITVEQLKINLAEQQQILLNAFTEGEVKSTYC NAMVDAHQYVLALINNGKIIRE >gi|296154652|gb|ADVK01000025.1| GENE 91 95587 - 96183 734 198 aa, chain - ## HITS:1 COG:FN1083 KEGG:ns NR:ns ## COG: FN1083 COG2431 # Protein_GI_number: 19704418 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 1 198 1 198 198 278 100.0 5e-75 MIIVSCAVIVGILLGYFTKSYINFDISLLIQFGLYLLLFFIGIDIGKNDNILNDLKKLNK KVLFLPFITIIASLAGGAVASILLSLSMGESVAISAGMGWYSFSAIELSKVSVELGGIAF LSNIFRELLAIFLIPIIAKKIGSFESVSVAGATAMDSVLPIINKSNPAEISIISFYTGLV ISIVVPILIPILVNIFSL >gi|296154652|gb|ADVK01000025.1| GENE 92 96180 - 96455 242 91 aa, chain - ## HITS:1 COG:no KEGG:FN1082 NR:ns ## KEGG: FN1082 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 91 1 91 91 120 100.0 2e-26 MLDIFIYVCIILFGVFLVRKKLFPERLLKKVSLLQSLSLYLLLGAMGYKIGSDDRLISNL HILGGKALVISVFAIVFSIIFVKFFYWRDKK >gi|296154652|gb|ADVK01000025.1| GENE 93 96534 - 96620 74 28 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MSIDKNILYKKVYYYNFILIIKKQLKLK >gi|296154652|gb|ADVK01000025.1| GENE 94 96662 - 96871 292 69 aa, chain + ## HITS:1 COG:no KEGG:FN1081 NR:ns ## KEGG: FN1081 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 69 1 69 69 74 100.0 2e-12 MENLDKENLTEKIKNLELKVKEIDLKIEEVKKEMKMLENNKENLTDLLDLYTRQLEYGKK DFKLRGSEK Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:33:13 2011 Seq name: gi|296154631|gb|ADVK01000026.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00031, whole genome shotgun sequence Length of sequence - 19419 bp Number of predicted genes - 20, with homology - 20 Number of transcription units - 7, operones - 5 average op.length - 3.6 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . - CDS 22 - 1716 2246 ## COG1132 ABC-type multidrug transport system, ATPase and permease components - Prom 1743 - 1802 14.2 - Term 1774 - 1820 9.1 2 1 Op 2 . - CDS 1853 - 2287 702 ## COG0783 DNA-binding ferritin-like protein (oxidative damage protectant) - Prom 2431 - 2490 19.2 - Term 2488 - 2522 3.0 3 2 Tu 1 . - CDS 2550 - 3095 876 ## COG2849 Uncharacterized protein conserved in bacteria - Prom 3138 - 3197 12.0 + Prom 3136 - 3195 17.5 4 3 Tu 1 . + CDS 3293 - 3544 359 ## COG4545 Glutaredoxin-related protein + Term 3778 - 3833 -0.8 5 4 Op 1 . - CDS 3651 - 4544 728 ## FN1076 hypothetical protein 6 4 Op 2 1/1.000 - CDS 4559 - 5206 794 ## COG0596 Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) - Prom 5231 - 5290 14.3 - Term 5266 - 5309 2.3 7 5 Op 1 . - CDS 5320 - 6321 737 ## PROTEIN SUPPORTED gi|163762490|ref|ZP_02169555.1| ribosomal protein L28 8 5 Op 2 . - CDS 6339 - 6800 551 ## FN1073 hypothetical protein 9 5 Op 3 1/1.000 - CDS 6847 - 7947 1363 ## COG1161 Predicted GTPases 10 5 Op 4 5/0.000 - CDS 7944 - 8816 878 ## COG4974 Site-specific recombinase XerD 11 5 Op 5 6/0.000 - CDS 8823 - 10127 1813 ## COG1206 NAD(FAD)-utilizing enzyme possibly involved in translation - Term 10134 - 10167 2.4 12 5 Op 6 13/0.000 - CDS 10175 - 12436 3204 ## COG0550 Topoisomerase IA 13 5 Op 7 5/0.000 - CDS 12500 - 13354 950 ## COG0758 Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 14 5 Op 8 1/1.000 - CDS 13378 - 14130 1022 ## COG0457 FOG: TPR repeat 15 5 Op 9 . - CDS 14111 - 15325 1499 ## COG1570 Exonuclease VII, large subunit - Prom 15374 - 15433 11.8 - Term 15380 - 15432 10.4 16 6 Op 1 . - CDS 15440 - 15886 793 ## FN1065 hypothetical protein 17 6 Op 2 . - CDS 15888 - 16649 1130 ## FN1064 hypothetical protein 18 6 Op 3 . - CDS 16668 - 17852 1840 ## COG1473 Metal-dependent amidase/aminoacylase/carboxypeptidase - Prom 17944 - 18003 12.0 + Prom 17887 - 17946 15.7 19 7 Op 1 . + CDS 18041 - 18823 405 ## PROTEIN SUPPORTED gi|163802692|ref|ZP_02196583.1| 30S ribosomal protein S21 20 7 Op 2 . + CDS 18857 - 19411 311 ## FN1061 hypothetical protein Predicted protein(s) >gi|296154631|gb|ADVK01000026.1| GENE 1 22 - 1716 2246 564 aa, chain - ## HITS:1 COG:FN1080 KEGG:ns NR:ns ## COG: FN1080 COG1132 # Protein_GI_number: 19704415 # Func_class: V Defense mechanisms # Function: ABC-type multidrug transport system, ATPase and permease components # Organism: Fusobacterium nucleatum # 1 564 1 564 564 1033 99.0 0 MLKKFISYYKPHKKMFFLDLLAAFFISICDLFYPVLTRTILYDFIPHKKLKVIFLFLIIL FFIYIIKMLLNYFVGFYGHVVGVKIQADMRRDLFKHIQNMPISYFDKNQTGDIMSRIIND LVDISELAHHGPEDVFISGVLVIGSFIYLVNLNAILTCIVFFFIPILALLTILLRNRMMK AFAETRVTVGAINANLSNAISGIRVSKSFNNSKYEFNKFELGNIRYIIARKAAYLWLAVF QGGIYYIIDMLYLVMLLSGSIFTYYNKITVIDFVTYMLFVNMLITPVKRLINSVEQFQNG MSGFRRFFEIINIPEEEEGKLEVGKLKGDIVFDNVTFRYEENENVFDNFSLKIKAGTNVA LVGESGVGKSTICHLIPRFYEILGGKITIDNIDIREMSLSSLRKNIGIVSQDVFLFTGTI KENIAYGKLDATDEEIYRAAKYANIHDYIMTLEKGYDTQVGERGIRLSGGQKQRISIARV FLANPPILILDEATSALDSITERNIQKSLDELSEGRTTLVVAHRLTTIRKADVIIVITKD GIAEMGNHEELMNMKGIYYKLNQA >gi|296154631|gb|ADVK01000026.1| GENE 2 1853 - 2287 702 144 aa, chain - ## HITS:1 COG:FN1079 KEGG:ns NR:ns ## COG: FN1079 COG0783 # Protein_GI_number: 19704414 # Func_class: P Inorganic ion transport and metabolism # Function: DNA-binding ferritin-like protein (oxidative damage protectant) # Organism: Fusobacterium nucleatum # 1 144 1 144 144 253 100.0 1e-67 MKNKENLNKYLSNLGILITKTHNLHWNVVGARFKAIHEYTESLYDYYFEKFDEVAEAFKM KGEFPLVKVADYLKHATVKELEAKDFTIPEVVTSIKEDIELMLADARKIREVANEEDDFL VANMMEDQIEYFVKQLWFISAMAK >gi|296154631|gb|ADVK01000026.1| GENE 3 2550 - 3095 876 181 aa, chain - ## HITS:1 COG:FN1078 KEGG:ns NR:ns ## COG: FN1078 COG2849 # Protein_GI_number: 19704413 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 181 1 181 181 279 100.0 2e-75 MNNQYNKDGKKEGLWVKIYDNGVVQEERNYVNGVREGVYKSYYMNGEIEIIKNYKNGNLH GKYQTFYSDGKLNSEYNLVDGRKVGEYKEFYPNGILKRETVYVNDGTTSKNIKYFPNGKI KLEVNFVDGHMEGPYKEYHSNEKLFKECSYNKKGKLEGKYREYDVEGNLLKETTYENGVE I >gi|296154631|gb|ADVK01000026.1| GENE 4 3293 - 3544 359 83 aa, chain + ## HITS:1 COG:FN1077 KEGG:ns NR:ns ## COG: FN1077 COG4545 # Protein_GI_number: 19704412 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Glutaredoxin-related protein # Organism: Fusobacterium nucleatum # 1 83 1 83 83 154 98.0 3e-38 MPKMYGSMLCPDCVKAKEYFEKINYKYDFVNITESMANLKEFLHLRDTRKEFDEVRSFKY VGIPAILTDDNKIIIGDDVFKIK >gi|296154631|gb|ADVK01000026.1| GENE 5 3651 - 4544 728 297 aa, chain - ## HITS:1 COG:no KEGG:FN1076 NR:ns ## KEGG: FN1076 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 297 1 297 297 350 99.0 3e-95 MTKFKDRMKFILRIILYISIIFFIPIIITALYIVHNDGINAFQRGISTFLLIILCTLLLL IICSQLKYGLIFGLNKLKFPENLVKNNVLKNTLKIFLRLLLYTFIIFLSALIIIILIDFS DKDLNIFPLEATQFLIMIAFIGILFMIHNDIKKLYNFSSDKIKLFEVLTQKIIEKVKKIK EKILTIKLDFFKNTLEKTKKILNSVSDKVESLVDLLENKINYPEKSEFKGRCDFSNLRNE IVNFAKIIYQILAYLTVFSVICILIPITIALVYQLLIYLITLLMTIWKVILAIISQL >gi|296154631|gb|ADVK01000026.1| GENE 6 4559 - 5206 794 215 aa, chain - ## HITS:1 COG:FN1075 KEGG:ns NR:ns ## COG: FN1075 COG0596 # Protein_GI_number: 19704410 # Func_class: R General function prediction only # Function: Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) # Organism: Fusobacterium nucleatum # 1 215 1 215 215 387 100.0 1e-108 MNYRIALIHGFFRNYKDMEDLENNLMNMGYTVDNLNFPLTFPPIERAIDILKEYLLSLKE KGINKQNEIVLIGFGFGGVLIRETLKLEEVKGIVDKIILLSSPINDSTLHRRLKRTFPFM DLIFKPLAIYAKTRRDRRRFDKDIEVGLIIGRESSGFFGKWLGEYNDGYIEMKDVNFPDA KDKILIPITHNELNKRIGTARYINNFIAKGKFRLE >gi|296154631|gb|ADVK01000026.1| GENE 7 5320 - 6321 737 333 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163762490|ref|ZP_02169555.1| ribosomal protein L28 [Bacillus selenitireducens MLS10] # 8 332 3 320 336 288 42 2e-77 MGLFDKLFKKKETEKIEEQVDKEKEIVEKEENQKVNISQRLTKSKEGFFSKLKNIFTSKS KVDDSIYEELEDLLLQSDVGLNMTTNLINQLEKEVKSNKVDNTEEVYEILKRLMSEFLLS QDSKIYLKDNKINVILIVGVNGVGKTTTIGKLALKYKNLGKKVLLGAGDTFRAAAVEQLE EWAKRADVDIIKGREGADPASVVYDTLSRAEATKADIVIIDTAGRLHNKANLMRELEKIN NIIKKKIGEQEYESLLVIDGTTGQNGLNQAKEFNSVTDLTGFIVTKLDGTAKGGIVFSVS EELKKPIKFIGLGEKIGDLIEFNAKDFVEAIFN >gi|296154631|gb|ADVK01000026.1| GENE 8 6339 - 6800 551 153 aa, chain - ## HITS:1 COG:no KEGG:FN1073 NR:ns ## KEGG: FN1073 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 153 16 168 168 288 100.0 5e-77 MERFSTFSFLTLIPIVALMIFVFISLFRAKNEEVDLPKILLKDIKTMRMAIDDYYKATGT FPDLVLANSDEKLEKIYYEKDGEKIYFKDYLRQSSLPKTPAFRDLDESNKIYLVENFRKV TNDGGWNYNIKTGEIHANLPYNFFEQGIDWENY >gi|296154631|gb|ADVK01000026.1| GENE 9 6847 - 7947 1363 366 aa, chain - ## HITS:1 COG:FN1072 KEGG:ns NR:ns ## COG: FN1072 COG1161 # Protein_GI_number: 19704407 # Func_class: R General function prediction only # Function: Predicted GTPases # Organism: Fusobacterium nucleatum # 1 366 1 366 366 721 99.0 0 MTKKCVGCGVELQNTDKNLQGYTPKPINTKENMYCQRCFQLKHYGKYSVNKMTREDYKKE VGKLLDDVKLVIAVFDIIDFEGSFDVEILDILREKDSIVIVNKLDLIPDEKHPSEVANWV KNRLAEESIVPLDIAIVSTKNGYGVNGVFRKIKHFYPDGVNAMVIGVTNVGKSSVINRLL GKKIATVSKYPGTTIKNTLNMIPFTNIGLYDTPGLIPEGRASDLVCDNCAQKIIPSGEIS RKTFKAKHNRMIMIGNLVKFKILNNDEIKPIFSIYAAKGVQFHETTIEKSKELELGNFFT IPCDCCKEEYNKHKKINKTLTINTGEELVFKGLAWVSVKRGPLNIQVTLPEEIEISVRKA FINPRR >gi|296154631|gb|ADVK01000026.1| GENE 10 7944 - 8816 878 290 aa, chain - ## HITS:1 COG:FN1071 KEGG:ns NR:ns ## COG: FN1071 COG4974 # Protein_GI_number: 19704406 # Func_class: L Replication, recombination and repair # Function: Site-specific recombinase XerD # Organism: Fusobacterium nucleatum # 1 290 1 290 290 400 90.0 1e-111 MTEKINIEKSIKNFIYYLEFEENKKNNTVISIRKDLNNFLEYLNKKNLVTLDKLDELVIK EYLAELKAIDLSNSTHNRRLSSIKKFYKYLINNNLKEKGKEILIEGIKNDEKKVEYLNPN EIELLREEMKEESFNVLRDRLMFELLYSSGMTVAELLSLGELNFNLEKREVYLLKNKISK VLYFSQTCKEVYLKFLIAKKEKFKEEDNPNIIFINNSNMRLTDRSVRRLINKYSEKANLQ KEVSPYTLRHSFCLYMLRNGMSKEYLAKLLDLKSIGLLDIYENLCKKEIL >gi|296154631|gb|ADVK01000026.1| GENE 11 8823 - 10127 1813 434 aa, chain - ## HITS:1 COG:FN1070 KEGG:ns NR:ns ## COG: FN1070 COG1206 # Protein_GI_number: 19704405 # Func_class: J Translation, ribosomal structure and biogenesis # Function: NAD(FAD)-utilizing enzyme possibly involved in translation # Organism: Fusobacterium nucleatum # 1 434 1 434 434 756 97.0 0 MEKEVIVVGAGLAGSEAAYQLAKRGIKVKLYEMKAKKKTPAHSKDYFSELVCSNSLGSDS LENASGLMKEELRILGSLLINIADKNRVPAGQALAVDRDGFSEEVTKILKNTENIEIIEE EFTEIPNDKIVIIASGPLTSDKLFEKISEITGEESLYFYDAAAPIVTFESIDMNKAYFQS RYGKGDGEYINCPMNKEEYYNFYNELIKAERAELKNFEKEKLFDACMPIEKIAMSGEKTM TFGPLKPKGLINPRTEKMDYAVVQLRQDDKEGKLYNIVGFQTNLKFGEQKRVFSMIPGLE NAEFIRYGVMHRNTFINSTKLLDKTLRLKNRDNIYFAGQITGGEGYVTAIATGMYVAMNV ANRLENKKEFILEDISEIGAIVNYITEEKKKFQPMGANFGIIRSLDENIRDKKEKYRKLS ERAIKYLKKSIKGV >gi|296154631|gb|ADVK01000026.1| GENE 12 10175 - 12436 3204 753 aa, chain - ## HITS:1 COG:FN1069_1 KEGG:ns NR:ns ## COG: FN1069_1 COG0550 # Protein_GI_number: 19704404 # Func_class: L Replication, recombination and repair # Function: Topoisomerase IA # Organism: Fusobacterium nucleatum # 1 681 4 684 684 1162 99.0 0 MPKKSEKNKLVIVESPAKAKTIEKILGSSYKVISSYGHIIDLPKTKIGVDVNNDFKPSYN TIKGKGEVIKQLKEASKKADKIYLASDPDREGESIAWHIANTLKLDHNEKNRIEFHEITE KAIKDAVKNPRKINIARVNSQQARRILDRLVGYEISPFLWKLISPNTSAGRVQSVALKII CELEDKIKAFVPEKYWDVKGIFDTKYSLNLYKIDNKKIDKLKDEKLLDRVKKDLKKKYEV ISSKVSKKTKNPPLPLKTSTLQQLASSYLGFSASKTMMVAQKLYEGISIKGEHKGLITYM RTDSTRISEEAKEMARNYITKNFGKEYLGSATPKAKKESKNVQDAHEGIRPTDINYTPEN IMEFLDKDQFKLYNLIWQRFLISQLAAMKYEQFEYILEKDKIEYRGTINKIIFDGYYKVF KEDEDLPIGDFPEIKEGDKFTLDKLDIKEDYTKPPARLTESSLVKTLEAEGIGRPSTYAS IIDTLKKREYVELQNKSFVPTEIGYEVKTQLDKFFPNIMNIKFTAKLEDELDEVDSGDKN WIDLLKVFYTELQKYEEKCKVVVEEELEKLVESDVIAKNGKPMIMKIGRFGRYLASQDTE SKENISLKGIDISLEDIKKGKIFVKKQIEELGKKKEGQKTDIILDNGSRLLLKYGRFGAY LESENYKEDNIRKTIPKEIKTKIEKNTIKKENDILYLKDIFDKIEKEEAEILKKAGKCEK CGRPFKISNGRWGKFLACTGYPECKNIKKIEKK >gi|296154631|gb|ADVK01000026.1| GENE 13 12500 - 13354 950 284 aa, chain - ## HITS:1 COG:FN1068 KEGG:ns NR:ns ## COG: FN1068 COG0758 # Protein_GI_number: 19704403 # Func_class: L Replication, recombination and repair; U Intracellular trafficking, secretion, and vesicular transport # Function: Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake # Organism: Fusobacterium nucleatum # 1 284 5 288 288 529 99.0 1e-150 MNYNFITINDDIYPECLKEISNPPLKLYYKGNLDLLKEERLISVVGTRNPSSYGKLCCEY MVKKMTGANITIVSGFAKGIDSIAHKTSLLTEGKTIAVIASGLDIVYPASNLSLYREIEE KGLILSEYEAGVKPFKSNFPQRNRIIAGLSKGTIVVESKDRGGSLITADLALEFNRDVYA VPGDVFSEYSKGCNNLIRDSRAKSLSNINELLKDYSWEIEEKNDSNKYTKNQILILNSLS SEKNLDNILIETKIEQTEILAELMTLEIMGVIKSIAGGRYKKIL >gi|296154631|gb|ADVK01000026.1| GENE 14 13378 - 14130 1022 250 aa, chain - ## HITS:1 COG:FN1067 KEGG:ns NR:ns ## COG: FN1067 COG0457 # Protein_GI_number: 19704402 # Func_class: R General function prediction only # Function: FOG: TPR repeat # Organism: Fusobacterium nucleatum # 14 250 1 237 237 423 99.0 1e-118 MKKILISLFLIISVIGLSESDRETGITAERNETPVIQNTTDDGGETVENPEQQKTTAGFY EYRPEILIQLDEQMKSAGRGSVGQLNARYEQELNAYLEMYSYDSDRIFYLANEYMLLNNY HRANKIFLKDNKDIKNVFGAATTYRFMGQNENAIQKYTQAISMNPGFAESYLGRGLANRN LDNYDSAVNDLQTYISRTGAHDGYVALADVYFKMGKNKEAYSIASQGLAKYKDSRILRTL ANNIYKNKID >gi|296154631|gb|ADVK01000026.1| GENE 15 14111 - 15325 1499 404 aa, chain - ## HITS:1 COG:FN1066 KEGG:ns NR:ns ## COG: FN1066 COG1570 # Protein_GI_number: 19704401 # Func_class: L Replication, recombination and repair # Function: Exonuclease VII, large subunit # Organism: Fusobacterium nucleatum # 1 404 1 404 404 725 100.0 0 MEKIYSVSEFNRMVKSYIDDIDDFQEFFIEGEISNITYYKSGHLYFSIKDSKSQIKCVAF NYKLKRIPEDLKEGDLVKLFGDVGFYEARGDFQVLARYIEKQNALGSLYAKLEKVKEKLT GLGYFNEEHKKDLPKFPKNIGVVTALTGAALQDIIKTTRKRFNSINIYIYPAKVQGIGAE QEIIKGIETLNKIKEIDFIIAGRGGGSIEDLWAFNEEEVAMAFFNSKKPIISAVGHEIDF LLSDLTADKRAATPTQAIELSVPEKESLLEDLKAREIYITKLLKSYVDSMKRELLLRIEN YYLKNFPNTVNSLRESIVEKEIQLKEAMESFIEQKRNIFENKIDKISVLNPINTLKRGYT VSQVKNKRIDVLDDIEINDEMMTILKDGKVISVVKEKIYEKNIN >gi|296154631|gb|ADVK01000026.1| GENE 16 15440 - 15886 793 148 aa, chain - ## HITS:1 COG:no KEGG:FN1065 NR:ns ## KEGG: FN1065 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 18 148 1 131 131 174 99.0 1e-42 MPLTFNLLILIITGILTLIGNFVGFKVSPVEAIPGVLILIIIAFIGILLSKIIPMKIPSV AYIVTLATILTIPGMPMSELISNYTAKVNFLALCTPILAYAGIYTGKNLDTLKRTGWKIF ILALFVMLGTYLGSAIIAQVILKMLGQI >gi|296154631|gb|ADVK01000026.1| GENE 17 15888 - 16649 1130 253 aa, chain - ## HITS:1 COG:no KEGG:FN1064 NR:ns ## KEGG: FN1064 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 253 1 253 253 424 99.0 1e-117 MKNIKLHFIALCLVVISEYIGIVKFNLGKGVIALFPMLYAMVFGVLTKFLKITNEKDMED AGSLVSVTLLLLMAKYGTTIGPSFPKLVSASPALILQEFGNIGTVLLGVPIALFLGLKRE TIGATHSIAREPNIAIIADKYGLDSPEGEGVLGVYIVGTVFGTIFIGLLASFLAAYTPLH PYSLAMASGVGSASMMTASVGALSTLYPDMADTIAAFGASSNLLSGLDGVYMSIWIALPL TEYLYKKFNKEGK >gi|296154631|gb|ADVK01000026.1| GENE 18 16668 - 17852 1840 394 aa, chain - ## HITS:1 COG:FN1063 KEGG:ns NR:ns ## COG: FN1063 COG1473 # Protein_GI_number: 19704398 # Func_class: R General function prediction only # Function: Metal-dependent amidase/aminoacylase/carboxypeptidase # Organism: Fusobacterium nucleatum # 1 394 1 394 394 788 99.0 0 MNVLEEVKKIEQKIIQWRRDLHKIPELNLYLPKTTKYIEEKLKKMGIEYKTLVNGNAIVG LIKGNSEGKTIGLRADMDALPIEEETGLEFSSTHKGCMHACGHDGHTAMLLGAAKILNEN RDKFKGNVKLLFQPGEEYPGGALPMIEEGAMENPKIDVVIGLHEGVIDERVGKGKIAYKD GCMMASMDRFLIKVKGKGCHGAYPQMGVDPIVIASEIILSLQKISSREINTNEPIIVSVC RINGGFSQNIIPDMVELEGTVRATNNETRKFIANRIEEIVKGITSANRGSYEIEYNFKYP AVINDKEFNKFFLESAKKIIGEENIFELPTPVMGGEDMAYFLEKAPGTFFFLSNPKVYPD GKVYPHHSPKFDVDENYFHIGAALFVQTVLDYLK >gi|296154631|gb|ADVK01000026.1| GENE 19 18041 - 18823 405 260 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163802692|ref|ZP_02196583.1| 30S ribosomal protein S21 [Vibrio campbellii AND4] # 2 253 10 261 271 160 32 6e-39 MKFSFQEKGTGETIVLIHSYLWDSEMWREQIDLLSKKYHCLAIDLPSHGKENFDLKKGYS LSDLAKDVVDFIDEKGIKEFHYIGLSVGGMLAPYLYELKKDSMKSIIMMDSYSGEEGIEK HTLYFKLLDLIEEYQTIPQPMAIQIANMFFAKNNCNVENRNYFNFLNRLQNFDKTNIKNI VTLGRAIFGRDDKLDVIPKISCPLYFIVGNEDEPRPPKESLEMSKLNKNSKYIVVENAGH ISNLDNAKFVNKIFSEIFKL >gi|296154631|gb|ADVK01000026.1| GENE 20 18857 - 19411 311 184 aa, chain + ## HITS:1 COG:no KEGG:FN1061 NR:ns ## KEGG: FN1061 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 184 1 184 184 269 99.0 4e-71 MLHNINLLGFLLITVSFLFGIKLPDWDFKLGLRHRNILTHSPFITIIFIALYETKTSYFF KYFIVGFSTATAIHILFDLFPRKWYGGALLKIPFNNISCSEETTKIFFTITVLISTFLGI FYMTEIQEYYFVLFYAILTFIKKRKYENSFIKPAFIFAFLYLFLGSFKFEVISKIIRGVI SKFI Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:33:55 2011 Seq name: gi|296154556|gb|ADVK01000027.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00032, whole genome shotgun sequence Length of sequence - 70516 bp Number of predicted genes - 78, with homology - 73 Number of transcription units - 30, operones - 18 average op.length - 3.7 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 26 - 106 75 ## - Prom 127 - 186 5.0 2 2 Tu 1 . - CDS 212 - 721 684 ## COG2963 Transposase and inactivated derivatives - Prom 894 - 953 3.9 3 3 Op 1 . - CDS 955 - 1017 101 ## 4 3 Op 2 . - CDS 1060 - 1707 677 ## COG1272 Predicted membrane protein, hemolysin III homolog - Prom 1908 - 1967 13.1 + Prom 1824 - 1883 11.8 5 4 Tu 1 . + CDS 1934 - 2113 353 ## FN1884 hypothetical protein + Term 2134 - 2186 8.1 + Prom 2158 - 2217 12.6 6 5 Tu 1 . + CDS 2244 - 2633 556 ## COG0824 Predicted thioesterase + Term 2663 - 2706 6.9 - Term 2651 - 2694 3.1 7 6 Tu 1 . - CDS 2702 - 3280 776 ## COG0778 Nitroreductase - Prom 3323 - 3382 10.9 - Term 3352 - 3388 2.4 8 7 Op 1 . - CDS 3406 - 3678 416 ## PROTEIN SUPPORTED gi|19705184|ref|NP_602679.1| SSU ribosomal protein S20P - Prom 3714 - 3773 7.4 9 7 Op 2 . - CDS 3827 - 4126 373 ## FN1878 hypothetical protein - Prom 4211 - 4270 14.8 + Prom 4162 - 4221 9.4 10 8 Tu 1 1/1.000 + CDS 4346 - 5674 1573 ## COG2252 Permeases + Prom 5734 - 5793 7.2 11 9 Op 1 1/1.000 + CDS 5819 - 6421 770 ## COG0693 Putative intracellular protease/amidase 12 9 Op 2 1/1.000 + CDS 6483 - 6977 175 ## PROTEIN SUPPORTED gi|90022209|ref|YP_528036.1| ribosomal protein S2 13 9 Op 3 1/1.000 + CDS 6981 - 7430 802 ## COG0698 Ribose 5-phosphate isomerase RpiB 14 9 Op 4 . + CDS 7443 - 7781 459 ## COG0537 Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases 15 9 Op 5 . + CDS 7798 - 8094 382 ## FN1872 hypothetical protein + Term 8171 - 8218 4.2 + Prom 8207 - 8266 11.7 16 10 Tu 1 . + CDS 8290 - 8586 442 ## FN1871 hypothetical protein + Term 8605 - 8652 5.1 + Prom 8626 - 8685 14.0 17 11 Op 1 . + CDS 8710 - 10020 1561 ## COG2865 Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 18 11 Op 2 . + CDS 10042 - 11094 1147 ## gi|296328268|ref|ZP_06870797.1| conserved hypothetical protein 19 11 Op 3 . + CDS 11110 - 11835 497 ## FN1870 hypothetical protein 20 11 Op 4 . + CDS 11855 - 12574 571 ## FN1870 hypothetical protein + Term 12583 - 12622 1.3 + Prom 13082 - 13141 6.6 21 12 Op 1 . + CDS 13185 - 13571 682 ## FN1869 hypothetical protein 22 12 Op 2 . + CDS 13594 - 14409 1340 ## COG3246 Uncharacterized conserved protein 23 12 Op 3 . + CDS 14427 - 15464 1727 ## FN1867 Zn-dependent alcohol dehydrogenase and related dehydrogenase + Prom 15467 - 15526 3.2 24 13 Op 1 . + CDS 15584 - 16861 1728 ## COG1509 Lysine 2,3-aminomutase 25 13 Op 2 . + CDS 16865 - 17881 1330 ## FN1865 hypothetical protein 26 13 Op 3 . + CDS 17883 - 19346 1723 ## COG1193 Mismatch repair ATPase (MutS family) 27 13 Op 4 . + CDS 19346 - 20902 2307 ## FN1863 L-beta-lysine 5,6-aminomutase alpha subunit (EC:5.4.3.3) 28 13 Op 5 1/1.000 + CDS 20902 - 21693 1480 ## COG5012 Predicted cobalamin binding protein 29 13 Op 6 1/1.000 + CDS 21716 - 22339 716 ## COG1279 Lysine efflux permease + Prom 22342 - 22401 5.1 30 14 Tu 1 . + CDS 22475 - 24124 2081 ## COG1757 Na+/H+ antiporter + Term 24146 - 24198 3.6 + Prom 24148 - 24207 9.6 31 15 Op 1 . + CDS 24229 - 24315 113 ## 32 15 Op 2 . + CDS 24375 - 25481 1366 ## FN1859 major outer membrane protein - Term 25286 - 25328 -0.6 33 16 Tu 1 . - CDS 25478 - 25552 143 ## - Prom 25716 - 25775 8.0 34 17 Tu 1 . + CDS 25530 - 25661 122 ## + Prom 25669 - 25728 5.3 35 18 Op 1 1/1.000 + CDS 25749 - 27125 2103 ## COG2031 Short chain fatty acids transporter 36 18 Op 2 21/0.000 + CDS 27204 - 27857 1176 ## COG1788 Acyl CoA:acetate/3-ketoacid CoA transferase, alpha subunit 37 18 Op 3 . + CDS 27875 - 28528 1160 ## COG2057 Acyl CoA:acetate/3-ketoacid CoA transferase, beta subunit + Term 28535 - 28603 21.1 - Term 28437 - 28487 1.3 38 19 Op 1 . - CDS 28565 - 30022 1411 ## COG4865 Glutamate mutase epsilon subunit 39 19 Op 2 . - CDS 30038 - 31426 1884 ## FN1854 methylaspartate mutase (EC:5.4.99.1) 40 19 Op 3 . - CDS 31444 - 31854 565 ## COG2185 Methylmalonyl-CoA mutase, C-terminal domain/subunit (cobalamin-binding) - Prom 31880 - 31939 14.7 - Term 31901 - 31953 9.1 41 20 Tu 1 . - CDS 31975 - 32379 612 ## FN1852 hypothetical protein - Prom 32415 - 32474 13.7 42 21 Op 1 1/1.000 - CDS 32489 - 33793 321 ## PROTEIN SUPPORTED gi|162456259|ref|YP_001618626.1| putative ribosomal protein 43 21 Op 2 1/1.000 - CDS 33838 - 34767 1198 ## COG0332 3-oxoacyl-[acyl-carrier-protein] synthase III 44 21 Op 3 2/0.250 - CDS 34764 - 36038 1228 ## COG1541 Coenzyme F390 synthetase - Prom 36059 - 36118 5.2 45 22 Op 1 3/0.250 - CDS 36217 - 37035 864 ## COG0491 Zn-dependent hydrolases, including glyoxylases 46 22 Op 2 . - CDS 36998 - 37981 1092 ## COG0451 Nucleoside-diphosphate-sugar epimerases 47 22 Op 3 . - CDS 37981 - 38607 627 ## FN1846 hypothetical protein 48 22 Op 4 . - CDS 38597 - 39763 750 ## FN1845 ceramide glucosyltransferase (EC:2.4.1.80) 49 22 Op 5 . - CDS 39760 - 40533 768 ## COG0300 Short-chain dehydrogenases of various substrate specificities - Prom 40569 - 40628 12.3 50 23 Op 1 . - CDS 40929 - 42380 1561 ## COG3291 FOG: PKD repeat 51 23 Op 2 9/0.250 - CDS 42416 - 42820 721 ## COG3412 Uncharacterized protein conserved in bacteria 52 23 Op 3 10/0.000 - CDS 42830 - 43438 965 ## COG2376 Dihydroxyacetone kinase 53 23 Op 4 1/1.000 - CDS 43474 - 44460 1681 ## COG2376 Dihydroxyacetone kinase - Prom 44500 - 44559 11.7 - Term 44517 - 44568 7.3 54 24 Op 1 18/0.000 - CDS 44580 - 46073 2422 ## COG0554 Glycerol kinase 55 24 Op 2 . - CDS 46092 - 46823 1341 ## COG0580 Glycerol uptake facilitator and related permeases (Major Intrinsic Protein Family) - Prom 47017 - 47076 11.3 + Prom 47056 - 47115 15.4 56 25 Op 1 1/1.000 + CDS 47140 - 47670 586 ## COG1852 Uncharacterized conserved protein 57 25 Op 2 . + CDS 47681 - 50491 3613 ## COG0457 FOG: TPR repeat 58 25 Op 3 . + CDS 50513 - 50890 550 ## FN1835 hypothetical protein 59 25 Op 4 30/0.000 + CDS 50918 - 51529 615 ## COG0811 Biopolymer transport proteins 60 25 Op 5 11/0.000 + CDS 51540 - 51983 547 ## COG0848 Biopolymer transport protein 61 25 Op 6 1/1.000 + CDS 51989 - 52780 980 ## COG0810 Periplasmic protein TonB, links inner and outer membranes 62 25 Op 7 1/1.000 + CDS 52800 - 54191 1631 ## COG2204 Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 63 25 Op 8 . + CDS 54195 - 55649 1730 ## COG2812 DNA polymerase III, gamma/tau subunits 64 25 Op 9 . + CDS 55664 - 56494 706 ## FN1829 hypothetical protein 65 25 Op 10 16/0.000 + CDS 56511 - 56960 729 ## PROTEIN SUPPORTED gi|19705133|ref|NP_602628.1| 50S ribosomal protein L9P 66 25 Op 11 1/1.000 + CDS 56972 - 58312 2090 ## COG0305 Replicative DNA helicase 67 25 Op 12 . + CDS 58334 - 59557 1579 ## COG0826 Collagenase and related proteases - Term 59472 - 59535 -0.8 68 26 Tu 1 . - CDS 59730 - 60185 743 ## FN1825 hypothetical protein - Prom 60249 - 60308 8.1 + Prom 60226 - 60285 11.0 69 27 Op 1 . + CDS 60315 - 61931 1985 ## COG1227 Inorganic pyrophosphatase/exopolyphosphatase 70 27 Op 2 . + CDS 61954 - 62169 253 ## gi|296328317|ref|ZP_06870846.1| conserved hypothetical protein + Term 62192 - 62229 4.1 71 28 Op 1 . - CDS 62198 - 62773 589 ## gi|296328318|ref|ZP_06870847.1| conserved hypothetical protein 72 28 Op 2 . - CDS 62816 - 63703 765 ## COG0583 Transcriptional regulator - Prom 63788 - 63847 18.5 + Prom 63745 - 63804 10.7 73 29 Op 1 3/0.250 + CDS 63992 - 65404 279 ## PROTEIN SUPPORTED gi|148544941|ref|YP_001272311.1| 50S ribosomal protein L29P 74 29 Op 2 . + CDS 65423 - 67225 2807 ## COG0028 Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] + Prom 67227 - 67286 5.4 75 30 Op 1 11/0.000 + CDS 67320 - 67823 529 ## COG3090 TRAP-type C4-dicarboxylate transport system, small permease component 76 30 Op 2 9/0.250 + CDS 67836 - 69128 700 ## PROTEIN SUPPORTED gi|149195935|ref|ZP_01872991.1| Ribosomal protein L16 77 30 Op 3 1/1.000 + CDS 69143 - 70165 312 ## PROTEIN SUPPORTED gi|149199369|ref|ZP_01876406.1| Ribosomal protein L22 78 30 Op 4 . + CDS 70186 - 70516 467 ## COG0169 Shikimate 5-dehydrogenase Predicted protein(s) >gi|296154556|gb|ADVK01000027.1| GENE 1 26 - 106 75 26 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MRRIKILLKKLEKFTMRIKEDMVIAE >gi|296154556|gb|ADVK01000027.1| GENE 2 212 - 721 684 169 aa, chain - ## HITS:1 COG:FN1887 KEGG:ns NR:ns ## COG: FN1887 COG2963 # Protein_GI_number: 19705192 # Func_class: L Replication, recombination and repair # Function: Transposase and inactivated derivatives # Organism: Fusobacterium nucleatum # 1 169 1 169 169 233 95.0 1e-61 MSKLTRENKIEIFERRKNGETISSLAKAFDVHKSNIEYLIALIKKHGFDILRKDKNRLYS KDFKLQIINRILVNHESINSVAIDIGLVSASILHNWLSKFKENEYNVVEKKKGRKPKFMT KPKKNDKVLSEKEKIKLLEDEIIYLKAENEYLKKLRALVQERELEEKKK >gi|296154556|gb|ADVK01000027.1| GENE 3 955 - 1017 101 20 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MKKILLALMLTILFVACGGS >gi|296154556|gb|ADVK01000027.1| GENE 4 1060 - 1707 677 215 aa, chain - ## HITS:1 COG:FN1885 KEGG:ns NR:ns ## COG: FN1885 COG1272 # Protein_GI_number: 19705190 # Func_class: R General function prediction only # Function: Predicted membrane protein, hemolysin III homolog # Organism: Fusobacterium nucleatum # 1 215 1 215 215 343 100.0 2e-94 MRLNRRLTFSEELGNTITHGVMAAATLVLLPIGSLWGYFHGGYASATGISIFIMSLLLMF LSSTLYHAMNHNSKHKAIFRILDHIFIYVAIAGSYTPVALVIVGGWKGILIVVLQWVIVL VGILYKSLATRAMPKLSLTLYLVMGWTAIFFFPTLVRKANTVFLVLVILGGVMYSIGAYF FVHDYKKYYHMIWHLFINIAAILHLIGIGFFLYRK >gi|296154556|gb|ADVK01000027.1| GENE 5 1934 - 2113 353 59 aa, chain + ## HITS:1 COG:no KEGG:FN1884 NR:ns ## KEGG: FN1884 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 59 1 59 59 75 100.0 9e-13 MASCENMKKGEVYKCQCCDFEIEVKKACDCGTNDKCETHDETHECCEFICCGKPLVKVK >gi|296154556|gb|ADVK01000027.1| GENE 6 2244 - 2633 556 129 aa, chain + ## HITS:1 COG:FN1881 KEGG:ns NR:ns ## COG: FN1881 COG0824 # Protein_GI_number: 19705186 # Func_class: R General function prediction only # Function: Predicted thioesterase # Organism: Fusobacterium nucleatum # 1 129 1 129 129 228 99.0 3e-60 MFKFNYTIKQEDLNYGDHVGNERALLFFQWARESFLRQNNLSESNIGDGSGFIQVEATVQ YKKQLFLDQKIEVRITKIEIKGLKIIFEHEIYNGQDLVITGTATVLAYNYEEQKVKKVPA NFKELVKKY >gi|296154556|gb|ADVK01000027.1| GENE 7 2702 - 3280 776 192 aa, chain - ## HITS:1 COG:FN1880 KEGG:ns NR:ns ## COG: FN1880 COG0778 # Protein_GI_number: 19705185 # Func_class: C Energy production and conversion # Function: Nitroreductase # Organism: Fusobacterium nucleatum # 1 192 1 192 192 345 98.0 2e-95 MIIENIKHARSHRRFTEKKINEEEILEMLEGARFSASTKNAQILRYSYTIDDENCKKLFS AVSLGGLLKNEDKATLEERARGFILISAKKDVKTPESRLYFDVGIASQNIILIADELGYG ACIVISYNKKAFEEILGLPEEYDSKAVIILGESKDIVKLVDSKDEEDTKYFVENGIHHVP KLKLGDLILGKK >gi|296154556|gb|ADVK01000027.1| GENE 8 3406 - 3678 416 90 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19705184|ref|NP_602679.1| SSU ribosomal protein S20P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 90 1 90 90 164 100 9e-40 MANSKSAKKRILVAERNRVRNQAVKTRVKTMAKKVLSTIEVKDVEAAKVALSVAYKEFDK AVSKGILKKNTASRKKARLAAKVNSLVSSL >gi|296154556|gb|ADVK01000027.1| GENE 9 3827 - 4126 373 99 aa, chain - ## HITS:1 COG:no KEGG:FN1878 NR:ns ## KEGG: FN1878 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 99 1 99 99 157 100.0 1e-37 MKPEVRDAINNINRFIQETKFIDVSSNLKVEENVVARNLQGKESDVVAEVMENLELIFKE ISKVHNKGQADEYTERYYYLSDKIYDDIDKFKADFFITK >gi|296154556|gb|ADVK01000027.1| GENE 10 4346 - 5674 1573 442 aa, chain + ## HITS:1 COG:FN1877 KEGG:ns NR:ns ## COG: FN1877 COG2252 # Protein_GI_number: 19705182 # Func_class: R General function prediction only # Function: Permeases # Organism: Fusobacterium nucleatum # 1 427 1 427 442 695 100.0 0 MKSNFETSLSKSLEKRFKFLDNGTNLKTELIAGLTTFATMSYVLATIPNMLEGAGLNRAS ILTALIIFIIICSIAMALYTNRPFALAPGLSSVAIIGTALPQMNMPVEVAFGLVFLSGLI FVIISFVGIREIVVKAIPASVKISISAGIGLYISLIGLKMAGVVVANPKNNTLNLGDMTT AKSILFVIGFLLILVLEARKIKGSLILAILIVTIIGIPMGVTKVPTNLINIPTGISDISF KIDILGALKPEYFPWIFTFFVPDFFGTMGIILGIANRAGWLDKDGNMQDIDRCFKVDSLS TVAGSFFCMPVMTTYLESASGVEDGGRTGMTALFTSFLFALTLLFTPIALMVPGVATAPV LTIIGFQMLSSMKSVNYNDKTESLPAFIAVAMTIFTFNIATGLSLSVLSYIILKVFSGKA KEIPKVMYGLALVLLYYLYTLI >gi|296154556|gb|ADVK01000027.1| GENE 11 5819 - 6421 770 200 aa, chain + ## HITS:1 COG:FN1876 KEGG:ns NR:ns ## COG: FN1876 COG0693 # Protein_GI_number: 19705181 # Func_class: R General function prediction only # Function: Putative intracellular protease/amidase # Organism: Fusobacterium nucleatum # 1 200 1 200 200 357 99.0 7e-99 MKKIAIFLFEGAELFEIASFTDVFGWNNVVGLKEFRDIKVETISYKESIKCTWGGELRAE KIITEDNVENFYEYDALVIPGGFGKANFFKDNDNEIFKKLIKYFSENNKVIVAICSAVIN LLETTYIRDKKVTTYLLDNKRYFNQLKNYNIIPVEEEIVIDNNLLTCSGPGNALELSFRV LEKLTSKENVKIIQNNMYLK >gi|296154556|gb|ADVK01000027.1| GENE 12 6483 - 6977 175 164 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|90022209|ref|YP_528036.1| ribosomal protein S2 [Saccharophagus degradans 2-40] # 1 145 6 149 151 72 29 8e-12 MKIGENKVVTLEYKVYDADTKELLEDTAELGPYFYIQGMGQFLPKIEAALDGKSKGHKLK IEIPMDEAYGDYDEELVEELTKADFTDFDDIYEGMEFVVELEDGTEMIAVITEIDGDKVY TDSNHPFSGRNLLFEVEVADVREATDEELDHGHVHFHGFEDDKE >gi|296154556|gb|ADVK01000027.1| GENE 13 6981 - 7430 802 149 aa, chain + ## HITS:1 COG:FN1874 KEGG:ns NR:ns ## COG: FN1874 COG0698 # Protein_GI_number: 19705179 # Func_class: G Carbohydrate transport and metabolism # Function: Ribose 5-phosphate isomerase RpiB # Organism: Fusobacterium nucleatum # 1 149 1 149 149 281 99.0 3e-76 MKIALGADHGGYELKEKIKQHLAKKEGIEVIDFGTNSTESVDYPKYGHLVAKSVVDKEVD FGILVCGTGIGISIAANKIKGIRAANCTNTTMAKLTREHNNANILALGARIVGDVLALDI VDEFLSVSFEGGRHQKRVDQIEVEECNLF >gi|296154556|gb|ADVK01000027.1| GENE 14 7443 - 7781 459 112 aa, chain + ## HITS:1 COG:FN1873 KEGG:ns NR:ns ## COG: FN1873 COG0537 # Protein_GI_number: 19705178 # Func_class: F Nucleotide transport and metabolism; G Carbohydrate transport and metabolism; R General function prediction only # Function: Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases # Organism: Fusobacterium nucleatum # 1 112 1 112 112 211 100.0 4e-55 MATLFTKIINKEIPANIVYEDDDVIAFKDIAPVAPVHVLVVPKKEIPTINDITDEDTLLI GKVYRVIGKLAKEFGIDKDGYRVVSNCNEHGGQTVFHIHFHLIGGEKLGTMV >gi|296154556|gb|ADVK01000027.1| GENE 15 7798 - 8094 382 98 aa, chain + ## HITS:1 COG:no KEGG:FN1872 NR:ns ## KEGG: FN1872 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 15 98 1 84 84 109 98.0 3e-23 MKKILILMMIAFSLVVYGEKFPYTSESDRKKILKELKEVSDRLEKNGNEKDAQLVMKKID EIMKITAELERRNADGDKKAEEELDKWSKDIEELDIKF >gi|296154556|gb|ADVK01000027.1| GENE 16 8290 - 8586 442 98 aa, chain + ## HITS:1 COG:no KEGG:FN1871 NR:ns ## KEGG: FN1871 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 15 98 1 84 84 113 98.0 3e-24 MKKILILIVMVFGLVACGEKFPYTSKSNKEKMLKEFEVAAEKAEKTNDEKDIQVAFEKMG EIIKIAEELDKRSSNGDKKAKEELDKWDEVIKELNIQF >gi|296154556|gb|ADVK01000027.1| GENE 17 8710 - 10020 1561 436 aa, chain + ## HITS:1 COG:MA2121 KEGG:ns NR:ns ## COG: MA2121 COG2865 # Protein_GI_number: 20090964 # Func_class: K Transcription # Function: Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen # Organism: Methanosarcina acetivorans str.C2A # 1 333 11 343 458 130 29.0 4e-30 MELKENINVEFKKKFTTELKKEVVAFANTNEGTIYIGIDDFGNIIGIKNVDEVLNQVVLS IRNSIKPDITMYCNSKIERIENKDVIIIQVQRGALRPYYIADKGLKPSGVYVRQGSSSVP ASEESIRKMIKETDGDSYEKLRSLNQDLTCDYTKKIFEKNALLFGLSQKKTLGLIGEDDL YTNLALLLSDQCNHTLKVAVFEGIEKNIFKDRKEFKGSLLKQVTEAYEFINLLNKTEATF EGLTRKDERDYPTEAIREALLNAVVHREYSFSGSTLVNIYEDRIEFISLGGIVSGLSLDS IMLGVSQSRNEKLANIFYRLHLIEAYGTGIKKIFSSYEKIGLKPTIKTEIGAFQVVLPNI HYVKKIENDNTEIKSIYKDILDFIEKKGGTTRKEIEEYINLSQTRVITLLKEMLELNLIK KEKDKIDRRAYKYYRK >gi|296154556|gb|ADVK01000027.1| GENE 18 10042 - 11094 1147 350 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296328268|ref|ZP_06870797.1| ## NR: gi|296328268|ref|ZP_06870797.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 350 1 350 350 556 100.0 1e-157 MKPKNIKVKDNEINISFFDKSEEIIKDEIVKDFDKKSEEKKDNTDKKIDALTNNKVKEKA LSSQKIFRQNINKSYKNSQNKIKNKIEEYNLYLVFAASILTIMTLGWAIFYSFINNEYQN NRSKIFSMPKNFFKINLYENIMIWLFILIPPLLYVSYKKVKGIKIPFYFFNVFMEVLFTI VNMALWSDKLDSFFSKYSLYNKKPELFMYIFLLFFTIFFIVINLIVLKERKDKEINYSAC LGYLYSNSKNLKKKNREGWKNSIFIILYVGILGLAYYGRAKPYECVVVEGQEINSKKIKV IIGEHEGIYAIATVTDDGKVIIFEKDSVELKKIEDIKKVKKMYFREEKFE >gi|296154556|gb|ADVK01000027.1| GENE 19 11110 - 11835 497 241 aa, chain + ## HITS:1 COG:no KEGG:FN1870 NR:ns ## KEGG: FN1870 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 241 6 242 242 330 90.0 2e-89 MYYHIVKKDGLKYKDLDIKEVDKIIKKEKLNFNNSRIYSSVQDLEHTFKIFEDEESRNLP QILDRNEIRISDILSCKQKKKYLKNCSYDFFLKIFNSSFKPRGKVFLNKTFIIWIMLLST ILNRYLFYILYKYYYSFTSYKLFFGFNIKPLWYITLIYILVPLTLILLIWYRDKYFKENY LIIFIVLMVIINGVIAYTIGDIIENIVKNFGGLVAIIIGLIFTQLFYNLFRRLSYNKYKD F >gi|296154556|gb|ADVK01000027.1| GENE 20 11855 - 12574 571 239 aa, chain + ## HITS:1 COG:no KEGG:FN1870 NR:ns ## KEGG: FN1870 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 227 6 242 242 100 35.0 5e-20 MYYHLIMEDGLKYKDLSIEELEKIIDEKKLSYNNSRVYSSERKLNEVYENNKEELTEEGK DISFENVKIEKLISIAERKNDLQNSSYKFFLDIKNPMEKINWKNFLSKTYIIWAMVIMYS IKLLLIHNYLFTGEMEYKDFELIEKFLNNSIFLIILLVWRKDKYYKKEYVVACILPNVLI NIINFIIFKYIDATLTIIVITDILRAILIQLVYNYIRERSYRDYKELDTINIPMNKKLF >gi|296154556|gb|ADVK01000027.1| GENE 21 13185 - 13571 682 128 aa, chain + ## HITS:1 COG:no KEGG:FN1869 NR:ns ## KEGG: FN1869 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 128 1 128 128 254 100.0 1e-66 MKSLIRLRMSSHDAHYGGNLVDGARMLQLFGDVATELLIQLDGDEGLFKAYDSVEFMAPV FAGDYIEAEGEIVNVGNSSRKMVFEARKVIVPRPDISDSAADVLAEPIVVCRATGTCVTP KDKQRGKK >gi|296154556|gb|ADVK01000027.1| GENE 22 13594 - 14409 1340 271 aa, chain + ## HITS:1 COG:FN1868 KEGG:ns NR:ns ## COG: FN1868 COG3246 # Protein_GI_number: 19705173 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 271 2 272 272 543 100.0 1e-154 MEKLIITAAICGAEVTKEHNPAVPYTVEEIAREAESAYKAGASIIHLHVREDDGTPTQDK ERFRKCIEAIREKCPDVIIQPSTGGAVGMTDLERLQPTELHPEMATLDCGTCNFGGDEIF VNTENTIKNFGKILIERGVKPEIEVFDKGMIDYAIRYQKQGFIQKPMHFDFVLGVQMSAS ARDLVFMSESIPEGSTWTVAGVGRHQFQMAALAIVMGGHVRVGFEDNVYIDKGILAKSNG ELVERVVRLAKELGREIATPDEARQILSLKK >gi|296154556|gb|ADVK01000027.1| GENE 23 14427 - 15464 1727 345 aa, chain + ## HITS:1 COG:no KEGG:FN1867 NR:ns ## KEGG: FN1867 # Name: not_defined # Def: Zn-dependent alcohol dehydrogenase and related dehydrogenase # Organism: F.nucleatum # Pathway: not_defined # 1 345 1 345 345 627 99.0 1e-178 MKKGCKYGTHRVIEPAGVLPQPAKKISNDMEIFSNEILIDVIALNIDSASFTQIEEEAGH DVEKVKAKIKEIVAERGKMQNPVTGSGGMLIGTVEKIGDDLVGKTDLKVGDKIATLVSLS LTPLRIDEIINIKPEIDRVEIKGKAILFESGIYAVLPKDMPENLALAALDVAGAPAQVAK LVKPCQSVAILGSAGKSGMLCAYEAVKRVGPTGKVIGVVRNDKEKALLQRVSDKVKIVIA DATKPMDVLHAVLEANDGKEVDVAINCVNVPNTEMSTILPVKEFGIAYFFSMATGFSKAA LGAEGVGKDITMIVGNGYTVDHAAITLEELRESAVLREIFNEIYL >gi|296154556|gb|ADVK01000027.1| GENE 24 15584 - 16861 1728 425 aa, chain + ## HITS:1 COG:FN1866 KEGG:ns NR:ns ## COG: FN1866 COG1509 # Protein_GI_number: 19705171 # Func_class: E Amino acid transport and metabolism # Function: Lysine 2,3-aminomutase # Organism: Fusobacterium nucleatum # 1 425 1 425 425 868 100.0 0 MNTVNTRKKFFPNVTDEEWNDWTWQVKNRLESVEDLKKYVDLSEEETEGVVRTLETLRMA ITPYYFSLIDLNSDRCPIRKQAIPTIQEIHQSDADLLDPLHEDEDSPVPGLTHRYPDRVL LLITDMCSMYCRHCTRRRFAGSSDDAMPMDRIDKAIEYIAKTPQVRDVLLSGGDALLVSD KKLESIIQKLRAIPHVEIIRIGSRTPVVLPQRITPELCNMLKKYHPIWLNTHFNHPQEVT PEAKKACEMLADAGVPLGNQTVLLRGINDSVPVMKRLVHDLVMMRVRPYYIYQCDLSMGL EHFRTPVSKGIEIIEGLRGHTSGYAVPTFVVDAPGGGGKTPVMPQYVISQSPHRVVLRNF EGVITTYTEPENYTHEPCYDEEKFEKMYEISGVYMLDEGLKMSLEPSHLARHERNKKRAE AEGKK >gi|296154556|gb|ADVK01000027.1| GENE 25 16865 - 17881 1330 338 aa, chain + ## HITS:1 COG:no KEGG:FN1865 NR:ns ## KEGG: FN1865 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 19 338 1 320 320 575 100.0 1e-162 MLDTYKFIEKYKRISIIGMEKNVGKTTLLNKLIADIGKNKKLGLTSIGRDGEDIDVVTNT DKPRIYVREGSIIATGRDCLNKCDITKEILYVTDFTTPMGSIVIVRALSDGYVDIAGPSY NKQVKIVVELMEKFGSEISIVDGALGRKSTAISDVSEATILSTGAALSLDMPKVIDETKK TVYFLRLDEIDNDIKEKIKDFKEEKAVLFYKNGEVAVLEVNNSIDLSNILKEYLKKDLEY FYIRGAITPKIIEAFVNSRGNYEKITLLAEDGTKFFLNSSLLNKAKLSGIEFKVLNKINL LFVTINPHSPLGVDFNKEEFKTRLQKEISVPVINVLGD >gi|296154556|gb|ADVK01000027.1| GENE 26 17883 - 19346 1723 487 aa, chain + ## HITS:1 COG:FN1864 KEGG:ns NR:ns ## COG: FN1864 COG1193 # Protein_GI_number: 19705169 # Func_class: L Replication, recombination and repair # Function: Mismatch repair ATPase (MutS family) # Organism: Fusobacterium nucleatum # 1 487 1 487 487 805 99.0 0 MKFIDENSLNRLNFKELLSRIDMYSGYGKNKLNNLSNFLVGEENKLEEEFERMEKIYNLI SDDKREMLKLEMILYKFDNIRKTVENAINDIVLDTVDLFELKVQLMAMIELNLLLNENRD VFSDFILEDMGELFGALDPNNEKIATFYIYESYSVILKEIRRQKKEVENKLFNETDYETI QKLKNERLSILVDEEREEFKIRRNLTALVKKFSSIFLNNTEKIGNLDFVLGKVRFAKEYN GIRPVVSRKKEIVLEDAINLEVKEVLEAKNKKYTPISIKLNVGTTMITGANMGGKSVALK TIAENVLLFQMGFFVFAKYASIPLLDFIFFVSDDMQDISKGLSTFGAEIIKLKEINSYVK SGTGLIVFDEFARGTNPKEGQKFVRALAKYLNEKSSISIITTHFDSVVEKNMKHYQVVGL KNLDFESLKNRLKANNSLELIQDNMDFTLEESIETEVPKDALNIAKLIGLDDEISEMIYK EYEWEEQ >gi|296154556|gb|ADVK01000027.1| GENE 27 19346 - 20902 2307 518 aa, chain + ## HITS:1 COG:no KEGG:FN1863 NR:ns ## KEGG: FN1863 # Name: not_defined # Def: L-beta-lysine 5,6-aminomutase alpha subunit (EC:5.4.3.3) # Organism: F.nucleatum # Pathway: Lysine degradation [PATH:fnu00310] # 1 518 1 518 518 1039 100.0 0 MGKLDLDWGLVKEARESAKKIAADAQVFIDAHSTVTVERTICRLLGIDGVDEFGVPLPNV IVDFIKDNGNISLGVAKYIGNAMIETKLQPQEIAEKVAKKELDITKMQWHDDFDIQLALK DITHSTVERIKANRKAREDYLEQFGGDKKGPYIYVIVATGNIYEDVTQAVAAARQGADVV AVIRTTGQSLLDFVPFGATTEGFGGTMATQENFRIMRKALDDVGVELGRYIRLCNYCSGL CMPEIAAMGALERLDMMLNDALYGILFRDINMKRTLVDQFFSRIINGFAGVIINTGEDNY LTTADAIEEAHTVLASQFINEQFALVAGLPEEQMGLGHAFEMEPGTENGFLLELAQAQMA REIFPKAPLKYMPPTKFMTGNIFKGHIQDALFNIVTITTGQKVHLLGMLTEAIHTPFMSD RALSIENARYIFNNLKDFGNDIEFKKGGIMNTRAQEVLKKAAELLKTIETMGIFKTIEKG VFGGVRRPIDGGKGLAGVFEKDNTYFNPFIPLMLGGDR >gi|296154556|gb|ADVK01000027.1| GENE 28 20902 - 21693 1480 263 aa, chain + ## HITS:1 COG:FN1862 KEGG:ns NR:ns ## COG: FN1862 COG5012 # Protein_GI_number: 19705167 # Func_class: R General function prediction only # Function: Predicted cobalamin binding protein # Organism: Fusobacterium nucleatum # 1 263 1 263 263 511 100.0 1e-145 MSSGLYSTEKRDFDTTLDLTQIRPYGDTMNDGKVQMSFTLPVACNEKGIEAALQLARKMG FVNPAVAFSEALDKEFSFYVVYGATSFSVDYTAIKVQALEIDTMDMHECEKYIEENFGRE VVMVGASTGTDAHTVGIDAIMNMKGYAGHYGLERYKGVRAYNLGSQVPNEEFIKKAIELK ADALLVSQTVTQKDVHIENLTNLVELLEAEGLRDKIILIAGGARITNDLAKELGYDAGFG PGKYADDVATFILKEMVQRGMNK >gi|296154556|gb|ADVK01000027.1| GENE 29 21716 - 22339 716 207 aa, chain + ## HITS:1 COG:FN1861 KEGG:ns NR:ns ## COG: FN1861 COG1279 # Protein_GI_number: 19705166 # Func_class: R General function prediction only # Function: Lysine efflux permease # Organism: Fusobacterium nucleatum # 1 207 1 207 207 345 100.0 4e-95 MEKYLQGFLMGLAYVAPIGVQNLFVINSAITQKRSKALLIALIVIFFDVTLAFACFFGIG LLIDKLEWLKLIILLVGSLVIIYIGQGLLRSKSELKKNDNMDIPLLKAITSACVVTWFNP QAIIDGTMMLGAFRATLSSEAGIYFILGVTSASFCWFMGLSIFISLFSHKFNDKVLKVIN IVCGLVIIFYGVKLLLNFYKMFIHYIY >gi|296154556|gb|ADVK01000027.1| GENE 30 22475 - 24124 2081 549 aa, chain + ## HITS:1 COG:FN1860 KEGG:ns NR:ns ## COG: FN1860 COG1757 # Protein_GI_number: 19705165 # Func_class: C Energy production and conversion # Function: Na+/H+ antiporter # Organism: Fusobacterium nucleatum # 25 549 1 525 525 930 99.0 0 MNNKIFLISHCINKVKKLYYGGFGMFKRLWYGLQNFGSISLPGKVGVVIGVLIILGILYG IVTSKKFRDAFLKLSPVIILAELMMDEFDALLAAPIATIYASFVAMLLTKQKFNGIVDHA IDNVKEIQVALFILMAAYAMAEAFMSTGVGASLILIALKVGITAKTVAVVGAIVTSILSI ATGTSWGTFAACAPIFLWLNHIVGGNILLTTAAIAGGACFGDNIGLISDTTIVSSGIQKV EVVRRIRHQGVWSALVLLSGIIVFAIAGFTMDLPSTVGDPAEAINSIPADVWTALAEKRE SAVKLLEQVRSGVPLYMAIPLIIVLALAFMGTQTFICLFAGLFFAYIFGMMAGTVTSTMD YLDMMMGGFASAGSWVIVMMMWVAAFGGIMKSMNAFEPISKLLSRISGSVRQLMFYNGLL CVFGNATLADEMAQIVTIGPIIKEMVEDNVEGSEEDLYTLRLRNATFSDAMGVFGSQLIP WHVYIAFYMGIATIVYPLHEFVAIDIIKYNFIAMVAVASILILTLTGLDRFIPLFKLPSE PAVKLKKQK >gi|296154556|gb|ADVK01000027.1| GENE 31 24229 - 24315 113 28 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MNKKTKNIFTNIKKCSILSSMGDVYMGL >gi|296154556|gb|ADVK01000027.1| GENE 32 24375 - 25481 1366 368 aa, chain + ## HITS:1 COG:no KEGG:FN1859 NR:ns ## KEGG: FN1859 # Name: not_defined # Def: major outer membrane protein # Organism: F.nucleatum # Pathway: not_defined # 1 368 1 368 368 706 99.0 0 MKKLALVLGSLLVVGSVASAKEVMPAPTPAPEKVVEYVEKPVIVYRDREVAPAWRPNGSV DVQYRWYGNVENRTPKKEDPASPWLGDNVNAGRLQTLTKVNFTEKQTLEIRTRNYHTLMN PKDSQAADDQVRVRHFYKFGKLGSSKIDVTSRLEYKQNNGDAGRKQAEASVLFDFADYIY SNNFFKADKFGFRLGYQHKWAGHNSGVVGQPFNKGTQDNYFINFESEYTLPWGFSAELNA YNYYNVHNKKFATYNKGDKKSQFYGEIEAYLYQHTPLYKTNNVELSFDFEGGYDPYTWHQ YKVVSAKDSNKYEVYMLPTLQVSYKPTDFVKLYAAAGAEYRNWAVTAESKAKNWRWQPTA WAGMKVTF >gi|296154556|gb|ADVK01000027.1| GENE 33 25478 - 25552 143 24 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MRLLTVFILLINNFLLILTLAKLD >gi|296154556|gb|ADVK01000027.1| GENE 34 25530 - 25661 122 43 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MKTVSSLTVFYFYKSFKFLNIYFINKLYKLCFKTYISFTKILY >gi|296154556|gb|ADVK01000027.1| GENE 35 25749 - 27125 2103 458 aa, chain + ## HITS:1 COG:FN1858 KEGG:ns NR:ns ## COG: FN1858 COG2031 # Protein_GI_number: 19705163 # Func_class: I Lipid transport and metabolism # Function: Short chain fatty acids transporter # Organism: Fusobacterium nucleatum # 1 458 1 458 458 827 99.0 0 MESVKERKGIFKRFTSMCVRVMERWLPDPFIFCALLTFIVFIGAIFLTKATPLQVVGFWA DGFWSLLSFSMQMALVLVTGHTLASSRLFKKMLSTFASGIKGPKQAILIVSIVSGIACAL NWGFGLVIGALFAKEIAKKVKGVDYRLLIASAYTGFLVWHGGLSGSIPLQLASGGEGLAK QTAGVVTEAIPTSQTLFSPMNIFIIVGLLIIVPLLNMAMFPSKDEIVEVDPKLLVKPEEV VMDTSKMTPAERVENSSAVSILLSIMGFVYIGYYLKTKGFALNLNLVNFIFLFLGILLHG TPRRYLNALAEATKGAGGILLQFPFYAGIMGIMVGADADGMSLAKLMSNFFVSISTEKTF PVFSFISAGVVNFFVPSGGGQWAVQAPIVMPAGQAIGVSAAKSAMAIAWGDAWTNMIQPF WALPALGIAGLGAKDIMGYCLIVTVVSGIFICTGFLLF >gi|296154556|gb|ADVK01000027.1| GENE 36 27204 - 27857 1176 217 aa, chain + ## HITS:1 COG:FN1857 KEGG:ns NR:ns ## COG: FN1857 COG1788 # Protein_GI_number: 19705162 # Func_class: I Lipid transport and metabolism # Function: Acyl CoA:acetate/3-ketoacid CoA transferase, alpha subunit # Organism: Fusobacterium nucleatum # 1 217 1 217 217 394 100.0 1e-110 MKQKIVSMEEAISHVKDGMTVHIGGFIACGTPESIITALIEKGVKDLTIVANDTGLIDKG IGRLVVNNQVKKVIASHIGTNPETGRRMQSGEMEVELVPQGTLAERVRAAGYGLGGILTP TGLGTIVQEGKQIINVDGKDYLLEKPIKADVALIFGTKVDELGNVICEKTTKNFNPLMAT AADVVIVEALEIVPAGSLSPEHLDISRIFIDYIVKSK >gi|296154556|gb|ADVK01000027.1| GENE 37 27875 - 28528 1160 217 aa, chain + ## HITS:1 COG:FN1856 KEGG:ns NR:ns ## COG: FN1856 COG2057 # Protein_GI_number: 19705161 # Func_class: I Lipid transport and metabolism # Function: Acyl CoA:acetate/3-ketoacid CoA transferase, beta subunit # Organism: Fusobacterium nucleatum # 1 217 1 217 217 405 100.0 1e-113 MEMDKNLVREVIAKRVAQEFHDGYVVNLGIGLPTLVANYVGDMDVIFQSENGCIGVGPAP EKGKEDPYLVNAGAGFITAAKGAMFFDSAYSFGIIRGGHVDATVLGALEVDEKGNLANWM IPGKKVPGMGGAMDLVVGAKKVIVAMEHTSNGAIKILKECKLPLTAVGVVDLIITEKAVF EVTDKGLVLKEITPYSSLEDIKATTAADFIIADDLKK >gi|296154556|gb|ADVK01000027.1| GENE 38 28565 - 30022 1411 485 aa, chain - ## HITS:1 COG:FN1855 KEGG:ns NR:ns ## COG: FN1855 COG4865 # Protein_GI_number: 19705160 # Func_class: E Amino acid transport and metabolism # Function: Glutamate mutase epsilon subunit # Organism: Fusobacterium nucleatum # 1 485 1 485 485 916 100.0 0 MPITFKKIDKEDFLEIRKNFLENYKNLDDFDLNTAIRFHKSLPDHKNFQKKIEQSVQDNK IMTQAHSKETLLEDLIKNLNTFYRVGQADFLSIIIDSHTRENHYDNAKVILEDSIKSNKS LLNGFPLINYGTKLARKIINDVEVPLQIKHGSPDARLLVEVALLSGFSAFDGGGISHNIP FSKSISLKDSLENWKYVDRLVGIYEENGIKINREIFSPLTATLVPPAISNSIQILETLLA VEQGVKNISIGVAQYGNITQDIASLLALKEHIQFYLDTFSFKDINISTVFNQWIGGFPEE ELKAYSLISYSTTIALFSKTNRIFVKNIDEYAKNSLGNTMINSLLLTKTILDIGNNQKIN NYEEIIFEKEQIKKETAQIIAKIFSRCDGDLRKAIIEAFEYGVLDVPFAPSKYNLGKMMP ARDSEGMIRYLDIGNLPFCPLIEEFHNKKIKERSMKENREINFQMTIDDIFAMSQGKLIN KKSRE >gi|296154556|gb|ADVK01000027.1| GENE 39 30038 - 31426 1884 462 aa, chain - ## HITS:1 COG:no KEGG:FN1854 NR:ns ## KEGG: FN1854 # Name: not_defined # Def: methylaspartate mutase (EC:5.4.99.1) # Organism: F.nucleatum # Pathway: Metabolic pathways [PATH:fnu01100] # 1 462 1 462 462 855 99.0 0 MSTRIYLTIDFGSTYTKLTAIDLNKKEIVATSRAMTTVKTDVLIGFNEAFEILENDLKNN LKSYEIIKKVACSSAAGGLKIIAIGLVPELTTEAAKKAALSSGARVIKTYAFNLTDKDIE EISELPHDMLLLTGGTNGGNREYILNNAKILAKNNIEKPIVVAGNEEVSEQIAKIFKEHN IEFYTTENVMPVVNKINVIPVKEVIREVFMRNIVKAKGMENVQKIIGDIIMPTPTAVMKA AEIFSKDDNNSIVIDIGGATTDVHSIGKGLPKTNDIQLKGMEEPYSKRTVEGDLGMRYSS LALYEAASLNKIREYLGSKDSKINIRENFKFRQENTDFVAETEDDIVFDEMMAMLCTEIA MNRHVGTLESIFSPMGTLFVQNGKDLTDVKYVIGTGGILNNSRNPRKILDLTLFNEDNPL LLKPKYPKFLVDKTYIMSAMGLLANDYPDIAYQIMKTYLVEI >gi|296154556|gb|ADVK01000027.1| GENE 40 31444 - 31854 565 136 aa, chain - ## HITS:1 COG:FN1853 KEGG:ns NR:ns ## COG: FN1853 COG2185 # Protein_GI_number: 19705158 # Func_class: I Lipid transport and metabolism # Function: Methylmalonyl-CoA mutase, C-terminal domain/subunit (cobalamin-binding) # Organism: Fusobacterium nucleatum # 1 136 1 136 136 260 100.0 4e-70 MAKKKIVIGVIGSDCHTVGNKIIHNKLEESGFDVVNIGALSPQIDFINAALETNSDAIIV SSIYGYGELDCQGIREKCNEYGLKDILLYIGGNIGSSNEKWENTEKRFKEMGFDRIYKPG TPIEETIIDLKKDFKI >gi|296154556|gb|ADVK01000027.1| GENE 41 31975 - 32379 612 134 aa, chain - ## HITS:1 COG:no KEGG:FN1852 NR:ns ## KEGG: FN1852 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 9 134 1 126 126 234 100.0 1e-60 MKKFLFALMLLGAVSAFAGQYPDGMYRGVYVSGQETQVEVQYELKNDVITSIKYRTLFYK GHDWLKEDEYVAKNGGYLKLLERITNKKIQDVLPTMYNSEEIEKGGATVRESKVRSALQY GLNVGPLKLAKKTK >gi|296154556|gb|ADVK01000027.1| GENE 42 32489 - 33793 321 434 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|162456259|ref|YP_001618626.1| putative ribosomal protein [Sorangium cellulosum 'So ce 56'] # 237 434 1 204 207 128 40 1e-28 MLREDGRNFNEERKIKITKNVNIYAEGSVLIEVGNTKVICTASVSDKVPSFLRGTGKGWV TAEYSMLPRATNERNPREASKGKLTGRTVEIQRLIGRALRASIDLEKLGERLITIDCDVI QADGGTRTTSITGGYIALALAIKKLLQEEILEEDPLISNVAAISVGKINSNLMVDLKYSE DFAAEVDMNVIMNKKDEFIEVQGTGEESTFTRAELNQLLDLAENSIKRLIELQDRIINQE DLKIFLATANKHKIEEISDIFSGIENIEILSIKDGIEIPEVIEDGDTFEANSKKKAVEIS KFLNMITIADDSGLCVDALNGEPGVYSARYSGTGDDLKNNEKLINNLQGIENRNAKFVSV ITLAKPNGEVYSFRGEIQGKIVDTPRGNTGFGYDPHFYVEEYQKTLAELPELKNKISHRA KALEKLKKELKNIL >gi|296154556|gb|ADVK01000027.1| GENE 43 33838 - 34767 1198 309 aa, chain - ## HITS:1 COG:FN1850 KEGG:ns NR:ns ## COG: FN1850 COG0332 # Protein_GI_number: 19705155 # Func_class: I Lipid transport and metabolism # Function: 3-oxoacyl-[acyl-carrier-protein] synthase III # Organism: Fusobacterium nucleatum # 1 309 1 309 309 568 93.0 1e-162 MRKIQFIGYGVELPKNTVNFKEQIRYRISGDEKQISLAVSACQKALKNANITIDDIDCIV SASAVGIQPIPCMAALIHEKIAKGTSIPALDINTTCTSFITALDTMSYLLETGRYKRVLI VSCDVASRALNPKQKESFQLFSDGAVAFIVEKTDKEIGVIDAMQKTWSEGAHSTEIRGGL SNFHPKNYTESTKEEFMFDMNGKTILSLCMKKVPKMMEEFLENNNMKISDIDMLIPHQAS VAMPLIMEKLGVPKGKYINEVKEFGNMVSASVPMTLAHGLEKQKIKNGDIILLIGTAAGL TTNIMLIKI >gi|296154556|gb|ADVK01000027.1| GENE 44 34764 - 36038 1228 424 aa, chain - ## HITS:1 COG:FN1849 KEGG:ns NR:ns ## COG: FN1849 COG1541 # Protein_GI_number: 19705154 # Func_class: H Coenzyme transport and metabolism # Function: Coenzyme F390 synthetase # Organism: Fusobacterium nucleatum # 1 424 1 424 424 726 92.0 0 MKKIFKIILTFIKVRYFSKWTSRDKLLKYQKKQVEKHLKFLKENSPYFKTHKITEDFTMN KAFMMDNFDELNTLGVKKNEAMEIALNSEKTRNFNQKYKNISVGLSSGTSGHRGMFITTP EEQGIWAGTILAKMLPKNNIFGHRIAFFLRADNDLYKTINSFLISLEYFDTFKDIDEHIE RLNKYQPTMMVAPPSLLLILSKKIEEGELKVSPKRVISVAEILEKPDEEYIKKQFKLNII HQIYQATEGFLACTCEYGHLHLNEDLIKFEKKYIDEKRFYPIITDFSRTSQPFVNYYLND ILVESTEPCECGSVLQRIEKIEGRSDDIFKFINKSDKEVIVFPDFIRRTILFVENIREYQ VFQTSNNLLEVAILNITEEQKELIRKEFNKLFTSLEIENIKIKFIDYKIDKTKKLKRIVR KVGE >gi|296154556|gb|ADVK01000027.1| GENE 45 36217 - 37035 864 272 aa, chain - ## HITS:1 COG:FN1848 KEGG:ns NR:ns ## COG: FN1848 COG0491 # Protein_GI_number: 19705153 # Func_class: R General function prediction only # Function: Zn-dependent hydrolases, including glyoxylases # Organism: Fusobacterium nucleatum # 8 270 1 263 263 469 93.0 1e-132 MLNTAEKMIEKVDYFACGYCTNDLKRVFKGFDKTIVNFYAGVFLIKHKKLGYILYDTGYS MDILKNNLKYFLYRFANPITLKKEDMIDYQLKEKGIDKEDIKYIIISHLHPDHIGGLKFF PNSYLILTKTCYNNFKLKKDSLLIFNELLPSNFEDRLILIDDYKDNSLFPYKNSFDLFSD LSMLIVEVNGHTKGQACLFLPDNNLFIAADVCWGTEFLPFTDKMKWLPRKIQNNFEDYKK GSELLKKLIENNISVIVSHDKKEKIFNILENQ >gi|296154556|gb|ADVK01000027.1| GENE 46 36998 - 37981 1092 327 aa, chain - ## HITS:1 COG:FN1847 KEGG:ns NR:ns ## COG: FN1847 COG0451 # Protein_GI_number: 19705152 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Nucleoside-diphosphate-sugar epimerases # Organism: Fusobacterium nucleatum # 1 327 2 328 328 619 98.0 1e-177 MKVLLTGATGFLGKYVIDELKNNSYQVVAFGRNEKIGHTLIDENVEFYKGDIDNLDDLFK ASQDCSAVIHAAALSTVWGKWKDFYNVNVLGTKNVVQVCEEKNLKLVFVSSPSIYAGAKD QLDVKEDEAPKENDLNYYIKSKIMAENIIKSSKLNYMIIRPRGLFGVGDTSIIPRLLELN KKIGIPLFVDGKQKVDITCVENVAYALRLALENNQYSRQIYNITNDEPIEFKKILTLFFN EIGTEGKYLKWNYNLIFLLVSFLEIFYKFFRIKKEPPITKYTLYLMRYSQTLNIDKAKKE LGYYPKMSILEGVKKYVEHSRKNDRES >gi|296154556|gb|ADVK01000027.1| GENE 47 37981 - 38607 627 208 aa, chain - ## HITS:1 COG:no KEGG:FN1846 NR:ns ## KEGG: FN1846 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 208 1 208 208 374 99.0 1e-102 MKFKEYLKKLESLDISKTLLKEDKIVFVISGSSNLKTAALEPDRFEILNIFKDFGYKIIN SNFPYNEDFEHSEFEDINILKASLSNIIYYPHTLFNKRFEKEILRHLEPIKSLKDVIIIS LSSGLNVWKKFMDLTNYDNENIKIFALGPVGKGYGKLKNTIVFKGIFDIYSWLLDFHKAD KIVNCGHLGYFKDRKVKEIIQELMKGVK >gi|296154556|gb|ADVK01000027.1| GENE 48 38597 - 39763 750 388 aa, chain - ## HITS:1 COG:no KEGG:FN1845 NR:ns ## KEGG: FN1845 # Name: not_defined # Def: ceramide glucosyltransferase (EC:2.4.1.80) # Organism: F.nucleatum # Pathway: Metabolic pathways [PATH:fnu01100] # 22 388 14 380 380 612 99.0 1e-173 MTILFNTLLTLTIILLILKLIFSFIYFQKINSLEKSKIDESKYTIVQPILSGDPRLEEDL TANLKNTTDMEFIWLVDKSDKIAIQTAEKILKNKNYSNRIEIYYLDDVPQEVNPKIFKLE QVVDKIKTEYTIILDDDSVIDRKRLDELSIYEKDKTEWIATGIPFNYNIRGFYSKLISAF INSNSIFSYFSLSFLKENKTINGMFYILRTDILKKYSAFENIKYWLCDDLALATYLLSKD VKIIQSTIFCNVRNTVPSFKRYILLMKRWLLFSNVYMKNAFSIKFLFIILLPTLLPTILL FLSFYLGINYLVLTLNLFIGKVALFYITRIFIYEPFRISSSQTKELLYELLSEFLLPFML IYTLLTPPVILWRNKKIRVKDGKIHYEI >gi|296154556|gb|ADVK01000027.1| GENE 49 39760 - 40533 768 257 aa, chain - ## HITS:1 COG:FN1844 KEGG:ns NR:ns ## COG: FN1844 COG0300 # Protein_GI_number: 19705149 # Func_class: R General function prediction only # Function: Short-chain dehydrogenases of various substrate specificities # Organism: Fusobacterium nucleatum # 1 257 1 257 257 459 100.0 1e-129 MEKILITGASSGIGEELTRNLANKSKKLFLLARSLDKLNLLKKELEEKFSSLECVCIKYD LTDINNLENIVENCDVDLVINCAGFGKITDFSKLSDKEDLDTINVNFISPLILTKKFSEK FLQKGQGTILNICSTAALYQHPYMAVYSSAKSALLHYSLALDEELSHKNKNVRVLSVCPG PTASNFFEKDIQEKFGSSQKFMMSSEDVAKRIIKVIENKKRFSIIGFRNKLSIFLINLLP ISLQLKLVGLILKKVIK >gi|296154556|gb|ADVK01000027.1| GENE 50 40929 - 42380 1561 483 aa, chain - ## HITS:1 COG:MA4289 KEGG:ns NR:ns ## COG: MA4289 COG3291 # Protein_GI_number: 20093078 # Func_class: R General function prediction only # Function: FOG: PKD repeat # Organism: Methanosarcina acetivorans str.C2A # 181 322 672 812 1734 85 33.0 2e-16 MEEKDIKLYKDIFYYINEDGKTLTVVGFKNSTKVAEVPDIIEGMKVTHMTAEFPTYRCTS LKEITISSGITLERRLFALNHSIEKVFLKSGVTLCGQVFEYSDLKEIKMEKGVKINDIPT LFRVERNKKYYGNFLFNSYPISKNPNAPRIWKECFEENLKRSDLLDKEGYEFQDMQNSII FAGCESLTEIEIPEGIEILPNGTFYACTSLCRVILPKTLKVIGTLCFSSCSNLVNIVIPE GVVAIGEDAFGGCTNLETVVLPSTLTHIVGNPFTKCKSLKRIVVPEGMLGLDGSEELADE IFRYVSTFTVGYTSFNKKDFYSKKTWKQIFKSKLVSLITTKKISEKELKEKEDLAIKKLK EQGEFDKIKIYIDEFMEYFLENLSSSNYKKKDISLLKKKIITYIDSLQKISKNNYSNEEM ILKEVKELVEYINEFNKKFQAIETEDREEICEIIDKCTILAGYPYPNPQIQSDFDITYEW RKW >gi|296154556|gb|ADVK01000027.1| GENE 51 42416 - 42820 721 134 aa, chain - ## HITS:1 COG:FN1842 KEGG:ns NR:ns ## COG: FN1842 COG3412 # Protein_GI_number: 19705147 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 134 3 136 136 226 97.0 7e-60 MVGFVVVSHSKELAEAAIHLANEMKRYDFPLINGSGTDGNFLGSNPLIIKEAILKAKTDK GALIFVDIGSSVLNTQVAIDFLADEGVDIENIKIADAPLVEGLIAGVAVNDEKADIESVL DELKELKTFSKLTY >gi|296154556|gb|ADVK01000027.1| GENE 52 42830 - 43438 965 202 aa, chain - ## HITS:1 COG:FN1841 KEGG:ns NR:ns ## COG: FN1841 COG2376 # Protein_GI_number: 19705146 # Func_class: G Carbohydrate transport and metabolism # Function: Dihydroxyacetone kinase # Organism: Fusobacterium nucleatum # 1 202 1 202 202 365 99.0 1e-101 MYLEIIKKISDEIIKNEEYLTELDREIGDGDHGVNLARGFTEIKNQLNNFKDLPVSDIFT KMGMILLTKVGGASGAIYGTAFMSAGTYLKGKTEFNNQILLETLNSMIEGIQKRGKAVLN EKTMLDTIMPTYNFLEKSFNEGKNLKDIKNEVIEVAKNSMEATKDIIATKGRASYLGERS VGHIDPGAMSSYLMIKTVCENI >gi|296154556|gb|ADVK01000027.1| GENE 53 43474 - 44460 1681 328 aa, chain - ## HITS:1 COG:FN1840 KEGG:ns NR:ns ## COG: FN1840 COG2376 # Protein_GI_number: 19705145 # Func_class: G Carbohydrate transport and metabolism # Function: Dihydroxyacetone kinase # Organism: Fusobacterium nucleatum # 1 328 5 332 332 602 99.0 1e-172 MKKLINDKNNIVEEVVQGMIKAFPDKVSRVENEPIIIRKNKKVNKVALISGGGSGHEPAH AGYVGYGMLDAAVCGEIFTSPGADKVYNAIKAVDTGKGVLLIIKNYSGDVMNFEMAGEMA QAEGITVKQVVVDDDIAVENSTYTVGRRGIAGTIFVHKILGAAAEKGYDLDKLVELGNKV VKNLKTMGMSLKPCTVFTTGKESFEIADDEVEIGLGIHGEPGTHREKMTTANEFTKKLFE KIYAESNVQNGDRFAVLVNGLGETTLIELFIINNHLQDLLKDKRIEVAKTLVGNYMTSLD MGGFSISLLKLDKEMEELLNAEEDTIAF >gi|296154556|gb|ADVK01000027.1| GENE 54 44580 - 46073 2422 497 aa, chain - ## HITS:1 COG:FN1839 KEGG:ns NR:ns ## COG: FN1839 COG0554 # Protein_GI_number: 19705144 # Func_class: C Energy production and conversion # Function: Glycerol kinase # Organism: Fusobacterium nucleatum # 1 497 1 497 497 981 99.0 0 MKYIVALDQGTTSSRAILFDESQNIIGVAQKEFTQIYPNEGWVEHDPMEIWSSQSGVLSE VIARAGISQHDIIALGITNQRETTIVWDKNTGKPVYNAIVWQCRRTAKICDELKEIEGFS DYVKDNTGLLVDAYFSGTKIKWILDNIEGARERAEKGELLFGTVDTWLIWKLTNGKIHAT DYTNASRTMLYNIKELKWDEKILETLNIPKSMLPEVKDSSGTFGYANLGGKGGHRIPIAG VAGDQQSALFGQACFEEGESKNTYGTGCFLLMNTGEKFVKSNNGLITTIAIGLNGKVQYA LEGSVFVGGASVQWLRDELKLISDSKDTEYFARKVKDSAGVYVVPAFVGLGAPYWDMYAR GAILGLTRGANKNHIIRATLESIAYQTKDVLKAMEEDSGIKLNGLKVDGGAAANNFLMEF QADILGESVKRPTVLETTALGAAYLAGLAVGFWENKNEIKQKWVLDKEFTPNMSKEERDK KYAGWLKAVERTKKWEE >gi|296154556|gb|ADVK01000027.1| GENE 55 46092 - 46823 1341 243 aa, chain - ## HITS:1 COG:FN1838 KEGG:ns NR:ns ## COG: FN1838 COG0580 # Protein_GI_number: 19705143 # Func_class: G Carbohydrate transport and metabolism # Function: Glycerol uptake facilitator and related permeases (Major Intrinsic Protein Family) # Organism: Fusobacterium nucleatum # 1 243 12 254 254 387 99.0 1e-108 MSNMSLYIGEFVGTTLLLLLGNGVNMTCSLKHSYGKGGGWIVTTFGWGFAVMIPAYVTGW VSGAHMNPALTIALAVTGKFSRDLVFGYIIAQMLGGILGATLAYLTYKAQMDAEPEPAVK LGVFSTGPSIDAPIWNIVTEIIGTALLLIGVLAIGYGEVGIQPGNGAFFVGLLIVIIGMA TGGATGYAINPARDLGPRIAHAILPIKGKGDSNWKYSWIPVVGPIIGGILGAVIFDAFLS AVL >gi|296154556|gb|ADVK01000027.1| GENE 56 47140 - 47670 586 176 aa, chain + ## HITS:1 COG:FN1837 KEGG:ns NR:ns ## COG: FN1837 COG1852 # Protein_GI_number: 19705142 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 176 1 176 176 299 100.0 2e-81 MKKFYISLLKSLLYIAFIVTTKFKKQKLNDYFSRKFLEINNDYVLKKIKKKLNGKILILL PHCIQLYDCEYKITSDINNCRACGKCVVYDFLDIQNKYQDVEIKIATGGTLARKYVKETK PDLIIAVACKRDLISGIKDAEPFLTYGVFNKIKGEPCINTTVVMDDIYEILDEINL >gi|296154556|gb|ADVK01000027.1| GENE 57 47681 - 50491 3613 936 aa, chain + ## HITS:1 COG:FN1836 KEGG:ns NR:ns ## COG: FN1836 COG0457 # Protein_GI_number: 19705141 # Func_class: R General function prediction only # Function: FOG: TPR repeat # Organism: Fusobacterium nucleatum # 1 936 1 936 936 1392 97.0 0 MRKYLIISLLASYSIVFAGESEDFKAVDNLYKERNFKAALVESEKFLQKYPESKHQKSMR DKVAKIYFLEKNYKKSEEIFKKLFTMEEKQSQKDEYASYLARANALLNKPDAAMFYVKEI KDKKVFQKTFFAVAQNFLAKENNEAAQKAYKEIIDNKYENYKESMMGLGIVYYNLKDYDK AIYWLSEFSKEMPKENKEMVSYLKASALYRKGNTDEAISRFEELANVEPSTEYSRKAALY LIEIYSNRKDEGKVTFYLNKIRGTKEYNTAMTMIGDLYVTKENYNKALDYYGQSNDKNNP KLIYGEAYSLYKNGKYEEALKKFQSLKNSDYYNQSIYHIFAINYKLKNFDEIIRDREIIR KVVVSQVDTDNIIRIIANSAYQVGNYKLAKDYYGRLFAVSPDKENLFRVILLDSQMLDME DLRIRFNQYNELYSNDTEFKKDVYLYTGDAYYKAGDPERAEQIYKTYLNQYTNTEVISSL MSTLLEQKKYDEMQQYLSLAQDESSVNYLKGIAAMGLGKYDEAESEFQKVLASGDQSLST KVYLNRVRNYFLAERYNEAVQAGEQYLSKLSPDKDKVIYSEMLDKIGLSYFRLGKYDQAR SYYSKIASMKGYEAYGKFQIADSYYNEKNYEKAASLYKEVYNQFGETFYGEQAYYKYIMT LSLTGNTEAFEREKNNFMKVYPNSNLRGTIANLTTNMYIESGDTDKAIESLNNSSSNTDD VTVKETNTTKIITLKLQKKDYKDIEKYIAELPTEEERAYYSAQYYAAKKDPKVVKEYEVL LQYDKYKAYASKGLGDYYFDKKDLAKAKKYYGTYATVNKNPDEYILYRLGQANEKENNLK MALTDYKAVYSKNDKLANDAMLRAAEIYDKQENVAEAEKLFTKLYAVKNNKDLKAYSIEK LIYYKLVKNNTKEAKKLYDELKKLDAKRAQKFKAYF >gi|296154556|gb|ADVK01000027.1| GENE 58 50513 - 50890 550 125 aa, chain + ## HITS:1 COG:no KEGG:FN1835 NR:ns ## KEGG: FN1835 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 5 125 1 121 121 164 97.0 2e-39 MSKKLLALFLVLGLMAYAEDTDTVVNDTATTESSAEENVATTELTKQVTSENNQQLDVKE IDTEDLILQNQNLESSSVNITGENLKENGDKVKVNQENSATLEEELSRGVEKKGFFRRVL DKLFG >gi|296154556|gb|ADVK01000027.1| GENE 59 50918 - 51529 615 203 aa, chain + ## HITS:1 COG:FN1834 KEGG:ns NR:ns ## COG: FN1834 COG0811 # Protein_GI_number: 19705139 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Biopolymer transport proteins # Organism: Fusobacterium nucleatum # 17 203 1 187 187 339 100.0 2e-93 MQILKAGGILMYFILLMGIIGLYAVLERFSYFLIKEKNNFSKLPSDVRQLINEGKIKEAI VALNSNKSSTSVVLKEILIYGYKENKETLSALEEKGKEKAIERIKSLERNMWLLSLAANA SPLLGLLGTVTGMIKAFNSIALNGTGDAGVLAKGISEALYTTAGGLIVAIPCMIFYNYFN KKIDLIVSDIEKTCTELLNHFRE >gi|296154556|gb|ADVK01000027.1| GENE 60 51540 - 51983 547 147 aa, chain + ## HITS:1 COG:FN1833 KEGG:ns NR:ns ## COG: FN1833 COG0848 # Protein_GI_number: 19705138 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Biopolymer transport protein # Organism: Fusobacterium nucleatum # 34 147 1 114 114 181 98.0 5e-46 MKLERIKRRSGGTLVLEITPLIDVVFLLLIFFMLATSFDERSAFKIDLPKSTAAKTKSTL KEVQVLVDKDKNVYLRYTDNSGKSQNEKLDLTSFVSVVSEKLNNSENKDVIISADKNIDY GFIVEIMSLLKESGASAINIDTAIKSR >gi|296154556|gb|ADVK01000027.1| GENE 61 51989 - 52780 980 263 aa, chain + ## HITS:1 COG:FN1832 KEGG:ns NR:ns ## COG: FN1832 COG0810 # Protein_GI_number: 19705137 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Periplasmic protein TonB, links inner and outer membranes # Organism: Fusobacterium nucleatum # 25 263 1 234 234 350 96.0 2e-96 MKKNDYICLFLSIVINIAIVFVLAMFLTDETVETDEIKIGLVAVESDASTRFKGEKNVDA KKQNLEADSIEKKEENKPQEEKPEKEKPEKKVEEDKKAEKTVQVEDKPKTTPKKEKPSLA DLKKQISDSQPKTSNGGFSPSADPDGEEVVDRVLQNVTYSNGLVSGSKMGNSEDGLLVDW NDSNRAPEFPQSARASGKHGKIKIKLKVDKAGNVLSYVIVEGSGVPEIDASVERVVGSWR VKLLKKGKPVNGTFYLNYNFNFK >gi|296154556|gb|ADVK01000027.1| GENE 62 52800 - 54191 1631 463 aa, chain + ## HITS:1 COG:FN1831 KEGG:ns NR:ns ## COG: FN1831 COG2204 # Protein_GI_number: 19705136 # Func_class: T Signal transduction mechanisms # Function: Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains # Organism: Fusobacterium nucleatum # 1 463 2 464 464 810 99.0 0 MLLGFRLDNDLKLEFENNFENDLVFVENIMSFMEAIKSRKYEAIVIDERNSKEEALINLI VKVTEIQKKAVIIILGETSNWRIIAGSIKAGAYDYILKPELPRTIVRIVEKSVKDYKGLV ERVDKTKSTGEKLIGRSKLMIDLYKVIGKVANNSAPVLVTGERGTGKTSVAKAIHQFSNV YDKPLISINCNSYRANLLERKLFGYEKGSFEGAAFSQYGELEKAEGGILHLANIESLSLD MQSKILFLLEENKFLRLGGMEPINAFVRIIASTSVNLEELIGKGLFIDELYRKLKVLEID IPNLKERKEDIPFIIDHYMAECNQEMNKNIKGVTKVALKKIMRYDWPGNVNELKNAIKYA VAMCRGSSILIEDLPPNVVGEKILNGKEESKTVSIENLVKNEINQLRSKNKKRDYYFEII SKIEKEIIKQVLEITNGKKVETAEILGITRNTLRTKMNYYDLE >gi|296154556|gb|ADVK01000027.1| GENE 63 54195 - 55649 1730 484 aa, chain + ## HITS:1 COG:FN1830 KEGG:ns NR:ns ## COG: FN1830 COG2812 # Protein_GI_number: 19705135 # Func_class: L Replication, recombination and repair # Function: DNA polymerase III, gamma/tau subunits # Organism: Fusobacterium nucleatum # 1 484 1 484 484 868 100.0 0 MHITLYRKYRPSSFSEVSGENEIVKSLKLSLKNKSMAHAYLFSGPRGVGKTTIARLIAKG VNCLNLKETGEPCNECKNCKAINEGRFSDLIEIDAASNRSIDEIRSLKEKINYQPVEGLK KVYIIDEAHMLTKEAFNALLKTLEEPPAHVMFILATTELEKILPTIISRCQRYDFKPLDL EEMKLGLEHILKEENLSMTDDVYPVIYENSSGSMRDSISILERLIVTANGEEIDLKIAED TLGITPSSRIKIFLNKLLNENEYDIINELENLANESFDIELFFKDLAKYCKNSIVKKEID IDKGLKIISTIYDVIGKFKFEDDKKLVGYVIVAEILSNTKQTVVKVVNTTQTNVNPPTSS TEEKPKDKEKVNIKLTISDVKNNWNSILAEANNKRFSYRAFLMGANPVRIEENNLYINYD KKYSFAKDLMETPEYSQEFTKIVKSFFNEDDLEIKYEVVGQKKEEENKNSEFFQKIENYF KGEN >gi|296154556|gb|ADVK01000027.1| GENE 64 55664 - 56494 706 276 aa, chain + ## HITS:1 COG:no KEGG:FN1829 NR:ns ## KEGG: FN1829 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 276 1 279 279 221 65.0 3e-56 MAIVSILMSVGTIIMYFFLSLFIPFLTYLIPKYKITKVNLYKKKYSLAINILVALILFCI SPEYLLIYLIFPYAMEFMFYLFNKIAKRMQVFNRIVLMSIVPSILISFYLYFNMDRINYI ATNLPRMTNIVEQVGIENISVLQESMVLVSSYYIFGAFFVVLLANFFLFLTLIPSTYKLW KISCYWIIPYMLILWAHKYNISVNILFENNILEIIKWIYVLYGIKVIYNLTDRIGVKSNI LKHGVSMLLGLSYPMVAFIIGALASFEFIEVKEIRM >gi|296154556|gb|ADVK01000027.1| GENE 65 56511 - 56960 729 149 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|19705133|ref|NP_602628.1| 50S ribosomal protein L9P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 149 1 149 149 285 100 5e-76 MAKIQVILLEDVAGQGRKGEIVTVSDGYAHNFLLKGKKGVLATPEELQKIENRKKKEAKK QEEERNKSLELKKLLESKVLDIPVKAGENGKLFGAITSKEIASQIKEELGLDIDKKKIEA NIKNLGPDEVHIKLFTDVKAVIKVNVIAK >gi|296154556|gb|ADVK01000027.1| GENE 66 56972 - 58312 2090 446 aa, chain + ## HITS:1 COG:FN1827 KEGG:ns NR:ns ## COG: FN1827 COG0305 # Protein_GI_number: 19705132 # Func_class: L Replication, recombination and repair # Function: Replicative DNA helicase # Organism: Fusobacterium nucleatum # 1 446 1 446 446 805 100.0 0 MEFENLKKIPHSLEAERALIGGIFYNQDLFDEIRDIVNAGDFYKIEHSSIYAAIEKVYSE NKGIDGILIEEEIKKSNSKNKDEILEILSDILDEITSSYNLLEYANLIKEKAMLRRLGNV GAEITQLAYNDVRVAEEIIDEAEAKVLNLSKNILKNNIVDMKTAGVAEIMRMERVSENRG KTLGIPTGFIDLDKMTSGLNNSDLIILAARPAMGKTAFALNLALNAGKEKKNVLVFSLEM PVQQLYQRLLSIESGISQNKLKNAYLEEQDWEKLTIATGILSNSNIFVADLPHTNVLEIR SYARKMKSQKQLDLIIIDYLQLINGSGRGGSEFNRQQEISDISRALKGLARELDVPVIAL SQLSRAVESRVDKRPMLSDLRESGAIEQDADIVAFLYREEYYIPDTENKGITELIIGKHR NGATGTVKLNFLSEFTKFTNYTDQVK >gi|296154556|gb|ADVK01000027.1| GENE 67 58334 - 59557 1579 407 aa, chain + ## HITS:1 COG:FN1826 KEGG:ns NR:ns ## COG: FN1826 COG0826 # Protein_GI_number: 19705131 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Collagenase and related proteases # Organism: Fusobacterium nucleatum # 1 407 4 410 410 802 98.0 0 MKKAELLAPAGNMEKFKMALHYGADAVFMGGKMFNLRAGSNNFSDEELEEAVAYAHERGK RVYITLNIIPHNDELEALPEYVKFLEKVGVDGVIVADLGVFQVVKENSNLNISISTQASN TNWRSVKMWKDMGAKRVVLAREISLENIKEIREKVPDIELEVFIHGAMCMAISGRCLLSN YMTGRDANRGDCAQACRWKYSLVEETRPDETMPVYEDEHGTYIFNSKDLCTIEMIDKILD TGVDSLKIEGRMKGIYYVSNCVKVYKDALNSYYSGNYEYNPEWRNELESISNRSYTEGFY HGKAGVESLNYNNRNSYSQTHKLVAKIEKKLSDNEYLVAIRNKLFVGQQVQIVSPEIKVR DFIMPEMILLDKMGRETESVESANPNSFVKIKTDTPMNELDMLRIVL >gi|296154556|gb|ADVK01000027.1| GENE 68 59730 - 60185 743 151 aa, chain - ## HITS:1 COG:no KEGG:FN1825 NR:ns ## KEGG: FN1825 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 7 151 1 145 145 243 98.0 2e-63 MKKIIVMIGLSLSLVACGVVSAAGSVVGGTVKAVGSVTGAVIKTTGKLIGSVIGGSDSEV KVKDTKYKFSGVELEIDQYSAVVTGKLSHNGSTKKNLRLSIPCFDKDGNKVGDAIATIDE LEKSKKWEFRAVLNEPNVASCKIKDAYITVD >gi|296154556|gb|ADVK01000027.1| GENE 69 60315 - 61931 1985 538 aa, chain + ## HITS:1 COG:FN1824 KEGG:ns NR:ns ## COG: FN1824 COG1227 # Protein_GI_number: 19705129 # Func_class: C Energy production and conversion # Function: Inorganic pyrophosphatase/exopolyphosphatase # Organism: Fusobacterium nucleatum # 1 538 1 538 538 947 99.0 0 MEEILVFGHKNPDTDSICSSITMANLREKQGSKATPCRLGEINKETKFVLDKVGIKTPKL LKTVSAQIIDLNYVEKSTVSTEDSIKEALDLMTKENFSSLPVIDKDGYFKTMLSISDIAN TYLEIDYSDLFSKYSTTYENLKEALDGEIISGVYPKGEITSNLKEVSELESMKKGDIIIT TSLTDGIDKSIQAGAKVVIVCCKKEDFISPRVTSECAIMLVRHSLVKSISLISQSISVGG ILNTEKVLFNFNKEDFLNEIRGIMKDANQTNFPVLENDGKVYGTIRTKHLIDFHRKKVIM VDHNEFSQSVEGIQDAQILEVVDHHKFANFQTNEATKIRTEPVGCTSTIVYGLYKEAKIE PDEKTALLMLSAILSDTLLFKSPTCTLRDVEVAKELGKLAKIKDIEKYGMEMLVAGTSMS KENMKEIINQDKKVFPVGDIEIAVAQINTVQIQELADRKEEIKKEVEHEIGKYGYSLFIF VVTDIINSNSLLFVYGKEIDLVQNAFKKDVVDNEVLLENVVSRKKQIIPFLMTAAQNI >gi|296154556|gb|ADVK01000027.1| GENE 70 61954 - 62169 253 71 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296328317|ref|ZP_06870846.1| ## NR: gi|296328317|ref|ZP_06870846.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 71 1 71 71 86 100.0 7e-16 MKYWKEEQILLKKLIEKYCEIEDRNKLIKILEMKDRIPYKYFINEFSKLKIASKMTDEEL EEYQKKIMVNI >gi|296154556|gb|ADVK01000027.1| GENE 71 62198 - 62773 589 191 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296328318|ref|ZP_06870847.1| ## NR: gi|296328318|ref|ZP_06870847.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 191 1 191 191 347 100.0 2e-94 MFNFFKKDSNSLLYDTYKPFLSDEKYIWLNAEGWYKLIHYLFDDKIKDEFNLIPIKNGCW ADAYNDTRRRVISLFHLNTSFATFKWGWNFEYIPHYTSKITWCRTDKSIYTHTFELSPKF INHKEENYTTFGKFEFKYKNNSEGFQKFVCDHLKVWDTLHEAIVEYYNATSTYEKMLKRL KENQKGGSLIF >gi|296154556|gb|ADVK01000027.1| GENE 72 62816 - 63703 765 295 aa, chain - ## HITS:1 COG:lin0491 KEGG:ns NR:ns ## COG: lin0491 COG0583 # Protein_GI_number: 16799566 # Func_class: K Transcription # Function: Transcriptional regulator # Organism: Listeria innocua # 1 270 1 269 297 160 33.0 3e-39 MTLKSLEYFKKLAELQHFTKAANELHISQPSLSYAISELEKDLEIPLFDRKDKKITLNYY GEVFLSYVNQSLSILDEGINFVKHLSNPKSRNISMGYIQSLSSSIIPPFIEEFYKNPLNK NIQFSFTQKDNNELIKIFLDRKLDIIFCVDAVKGAISQAIGEQELCFVVSKEHPLSKKEN IKLKDLENENFLLINSGTNLRKTIDKNFKKLKFNPKVTLELGQCSNILIYVEKNLGISIV PKVDIIGHPLNEKLQIINVKNLDLKRTIYVNWFPRKNENLTLKYIENFIEDFFLK >gi|296154556|gb|ADVK01000027.1| GENE 73 63992 - 65404 279 470 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|148544941|ref|YP_001272311.1| 50S ribosomal protein L29P [Lactobacillus reuteri DSM 20016] # 62 413 90 435 477 112 25 7e-24 MAKELTEAQVKELDELMEKAAKAAKIIETYDQEKVDRLIRAVAWSVANKKTFLELVDMGI AESGLGDPISRQNKRFKIRGVLRDALRAKSVGIIEELPEKGIVKYAKPVGIIGALIPTTN PDLTPAGQAVYGIKARDVIIFSPHPRSKKTTFETVRLMREALEREGAPADILQCITNPSV AMTEELMKRVDLVIATGGRPMVKAAYSSGTPAYGSGAGNATMIYDETTNIDEATYNTMLS KTSDYGSGCSADGNIIIYDKIYDDVLKGLEKHGGYLANAKEREMLKKVMWDEEGHRLSNT VAIAPQKLAEAAGFTIPADRKFIVVEGDGIGKHYPFSGEKLTTLLATHKYSGEFENALDV MRAIYEVGGKGHSVGIYSFDEDHINRLALAAPVSRIMVRQPQSKANSGAATNGMPMTSSM GCGTWGGNIVSENICLKHYMNTTWVSKPIPRDMPSDEELFGEFYDPEMEK >gi|296154556|gb|ADVK01000027.1| GENE 74 65423 - 67225 2807 600 aa, chain + ## HITS:1 COG:PAB0888 KEGG:ns NR:ns ## COG: PAB0888 COG0028 # Protein_GI_number: 14521546 # Func_class: E Amino acid transport and metabolism; H Coenzyme transport and metabolism # Function: Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] # Organism: Pyrococcus abyssi # 6 558 4 539 562 331 36.0 2e-90 MKKLLSEQLVDYLERRDTKYIFGLCGHTVIAMLDALEKSKNLKYISVRHEQIASTAADGY ARATGRASVVLCHLGPGLTNATTGVANASLDSIPMVVIAGDVPSYYYGKHPHQEVNMHAD ASQYEIYKPFVKRVWRIDRAEMFPEILDKAFRLAESGRPGPVLISVPMDMFSREVDTRFF ERTYLDSHETVKPSLDEEAAKKIVKALVEAKNPVIHVGGGIILARASEELKDFVEFLEIP VSRTLMGQGALSDKHPLMMGMTGFWGTYFINGKTSQADVILGLGTRFAEADSSSWYNGIT FDADKTKFMQIDIEPSEIGRNYPVSIGAEADLKSALKVLLKVAKELYPNGKKHPEIIKAI EKYKKEFKDSNKTIEEDSRYPMTPQRILKDVREVLPEDAIICTDVGWNKNGVGQQFDITQ PGTIMHPGGLATMGFGSAALLGVKLAKPDKKVITLIGDGGFGTNPSVLATAKEYNIPVVW VVMNNYAFGTIAGLEGAHYKHNFGTVFRIDNKPYNPEWSEVAKAYGIKAKKIQSADEFKE AFREAINSNEPYLLDVPMENIPVPTEGIWNINDIYTPKENVKDGVLMSGEAIRSKHVSTK >gi|296154556|gb|ADVK01000027.1| GENE 75 67320 - 67823 529 167 aa, chain + ## HITS:1 COG:SMb20443 KEGG:ns NR:ns ## COG: SMb20443 COG3090 # Protein_GI_number: 16264173 # Func_class: G Carbohydrate transport and metabolism # Function: TRAP-type C4-dicarboxylate transport system, small permease component # Organism: Sinorhizobium meliloti # 9 161 32 183 190 58 28.0 4e-09 MLKALDDYLEETILLILLVLMTCIMGIQIVSRYVFQNSLTWSEELVRYMFVWSAFLGIPF CIKHGLSIKVDQFRNLFPIPLQKALMYIDKIIIFLLFLVLFIYSFSVVKASYLSGQTSPA MQLPIWMVQISVTVSSLLSMIRSIQNFLNLVRGRIKLEQKDGVLYQK >gi|296154556|gb|ADVK01000027.1| GENE 76 67836 - 69128 700 430 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|149195935|ref|ZP_01872991.1| Ribosomal protein L16 [Lentisphaera araneosa HTCC2155] # 5 428 7 424 432 274 33 1e-72 MTIGVLFLIFIILIVVGIPIGMVLAISGILPNMFDSMFPANVPYIIRSMINGVDSFPILA VPMFILSGNIMAKGKISEKLFNFFAYFIGNLTAGLPIATIVTCLFYGAISGSGPATTAAV GAMTIPILVGRGYDKTFCTSLVAVAGGLGVIIPPSIPFIFYGQSSGASVGNLFIAGVFPG ILIGACLMLYSWYYCKKNGEDKESLANYTKEMRKNGFLKLFLNSFWALLSPVIILGSIYS GIASPTEAAVISVFYSLIVAGFIYRTISFKDIKETLVESVKTYSSILFIIAAAIGFARIL TYYEAPEIIAEAISSTVSSKIGFLIIVNIILLFVGMIMDTTPAILILTSIFLPTAQAYGI NPIHFGVIMVVNLAIGFVTPPLGVNLFVASSISGVPIEKIVHKAVPFIVSFIIALLIITF IPSVSLVLIK >gi|296154556|gb|ADVK01000027.1| GENE 77 69143 - 70165 312 340 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|149199369|ref|ZP_01876406.1| Ribosomal protein L22 [Lentisphaera araneosa HTCC2155] # 60 333 73 344 346 124 29 1e-27 MKNSIKSLLIGMMVILMLFVSCSKDGKTDSGAKIYTVYLACDSPEDTVTYILLDKFASLM EEKSNGRIKARRYANAKLGGDIEIIEALQHGNITFVVQNTAPQVNFVSELGVFDLPMAFP NIEVARKVLDGPLLDKLKEYHVKQNIKLYGYADQGFREMTSNKKIEKIEDLKGVKIRTMS NPNHIEFWKAVGANPTPMNFGELYIGLQQGVVVAQENPIEATVAAKLYEQQKYVVMTNHV IHALSLIGSPAVIDKIPEDLQKIIDESAKEAIEYARKIADERVEGRLKIVRNSGTEIIEF NQQLYDDMKAAGQPLYDSISNKIGKDLVDLLTEEVKKTSN >gi|296154556|gb|ADVK01000027.1| GENE 78 70186 - 70516 467 110 aa, chain + ## HITS:1 COG:lin2338 KEGG:ns NR:ns ## COG: lin2338 COG0169 # Protein_GI_number: 16801401 # Func_class: E Amino acid transport and metabolism # Function: Shikimate 5-dehydrogenase # Organism: Listeria innocua # 1 110 1 108 289 110 49.0 6e-25 MSNRIQGTTGLIGLIGDPLKHSRSPHMHNSAFDKLGLDYVYLCFEVPKGELKSGIEALKT FSAKGSNITFPHKQDVLKYLDDISEDAKIIGSVNTIKIDSKTKKITGYNT Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:36:08 2011 Seq name: gi|296154553|gb|ADVK01000028.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00033, whole genome shotgun sequence Length of sequence - 1683 bp Number of predicted genes - 2, with homology - 2 Number of transcription units - 1, operones - 1 average op.length - 2.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 48 - 107 9.8 1 1 Op 1 2/0.000 + CDS 148 - 888 489 ## COG3666 Transposase and inactivated derivatives 2 1 Op 2 . + CDS 932 - 1630 873 ## COG3666 Transposase and inactivated derivatives Predicted protein(s) >gi|296154553|gb|ADVK01000028.1| GENE 1 148 - 888 489 246 aa, chain + ## HITS:1 COG:FN1511 KEGG:ns NR:ns ## COG: FN1511 COG3666 # Protein_GI_number: 19704843 # Func_class: L Replication, recombination and repair # Function: Transposase and inactivated derivatives # Organism: Fusobacterium nucleatum # 1 245 1 245 493 423 100.0 1e-118 MIKPINNNKYFKFFQPKLFYINNDIDNDDPVRLLSAILEEMDFSNLLQVFPNKTKVHPVN MFAVIIYAYSQGKYSTRDIEFLCRDSQRTQYLLNSLNVPSYSTISRFLSKASDIIYELFC QFVEKLFKLSEIPTETIYIDGTKIEAYANKYSFVWKKSTLKYKEKLEENILELIDEFNKY FNKEKELDNIFDIFSYLKKLKIQKIYGRGKRKSKEQLFLEKAQSYVEKFNKYTNYLEILG ERNSFF >gi|296154553|gb|ADVK01000028.1| GENE 2 932 - 1630 873 232 aa, chain + ## HITS:1 COG:FN0372 KEGG:ns NR:ns ## COG: FN0372 COG3666 # Protein_GI_number: 19703714 # Func_class: L Replication, recombination and repair # Function: Transposase and inactivated derivatives # Organism: Fusobacterium nucleatum # 1 232 262 493 493 399 99.0 1e-111 MRNGQLKPGYNLQIGVISEYIASYEIFHNPADTKTLIPFLEKIRSQNIEIKNVVADAGYE SFPNYEYLEKNNYVSYIKPIYYEKSKTRKYQKNLNRVENLEYDEKENRLFRKDGLELEFQ YYGEDGKTIYFKNPETEKIIKYNNEFRRLSKKSKDNIESDLGKQLRMNRSIQVEGAFAVL KEDMKLRKLKVRGKNSTKREIGLFCIAYNFNKYLAKLSRKKQGVVLHPLKTA Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:36:20 2011 Seq name: gi|296154509|gb|ADVK01000029.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00034, whole genome shotgun sequence Length of sequence - 45443 bp Number of predicted genes - 43, with homology - 42 Number of transcription units - 18, operones - 10 average op.length - 3.5 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . + CDS 2 - 5554 7057 ## COG3210 Large exoproteins involved in heme utilization or adhesion 2 1 Op 2 . + CDS 5554 - 6036 528 ## FN2115 hypothetical protein + Term 6059 - 6095 5.9 + Prom 6197 - 6256 6.5 3 2 Op 1 . + CDS 6288 - 7160 941 ## gi|296328333|ref|ZP_06870860.1| hypothetical protein HMPREF0397_1053 4 2 Op 2 . + CDS 7157 - 7657 573 ## FN2115 hypothetical protein + Term 7680 - 7716 4.3 + Prom 7670 - 7729 3.9 5 3 Op 1 . + CDS 7882 - 8799 1149 ## gi|296328335|ref|ZP_06870862.1| conserved hypothetical protein 6 3 Op 2 . + CDS 8796 - 9293 696 ## FN2115 hypothetical protein + Term 9312 - 9349 6.6 + Prom 9296 - 9355 4.6 7 4 Op 1 . + CDS 9375 - 9653 333 ## gi|296328337|ref|ZP_06870864.1| conserved hypothetical protein 8 4 Op 2 . + CDS 9650 - 9820 173 ## gi|296328338|ref|ZP_06870865.1| conserved hypothetical protein 9 4 Op 3 . + CDS 9858 - 10142 238 ## FN2115 hypothetical protein + Term 10165 - 10201 5.9 + Prom 10155 - 10214 5.7 10 5 Tu 1 . + CDS 10409 - 11077 866 ## gi|296328340|ref|ZP_06870867.1| hypothetical protein HMPREF0397_1060 + Term 11098 - 11134 5.9 + Prom 11319 - 11378 7.3 11 6 Op 1 . + CDS 11504 - 11818 293 ## FN2115 hypothetical protein 12 6 Op 2 . + CDS 11848 - 12330 511 ## FN2115 hypothetical protein 13 6 Op 3 . + CDS 12361 - 13020 912 ## Sterm_1092 hypothetical protein 14 6 Op 4 . + CDS 13039 - 14781 1851 ## COG2831 Hemolysin activation/secretion protein 15 6 Op 5 . + CDS 14793 - 16613 2137 ## COG2849 Uncharacterized protein conserved in bacteria + Term 16617 - 16650 4.0 - Term 16605 - 16638 4.0 16 7 Op 1 4/0.000 - CDS 16641 - 17630 1690 ## COG1087 UDP-glucose 4-epimerase 17 7 Op 2 4/0.000 - CDS 17630 - 19159 1765 ## COG4468 Galactose-1-phosphate uridyltransferase 18 7 Op 3 1/1.000 - CDS 19159 - 20328 1672 ## COG0153 Galactokinase - Prom 20380 - 20439 13.8 - Term 20388 - 20439 13.2 19 8 Tu 1 1/1.000 - CDS 20464 - 21981 1693 ## COG1288 Predicted membrane protein - Prom 22224 - 22283 15.0 - Term 22209 - 22265 2.8 20 9 Op 1 . - CDS 22293 - 23777 2313 ## COG3333 Uncharacterized protein conserved in bacteria 21 9 Op 2 . - CDS 23798 - 24241 354 ## FN2104 hypothetical protein - Prom 24285 - 24344 7.5 - Term 24267 - 24318 7.2 22 10 Op 1 1/1.000 - CDS 24349 - 25326 1473 ## COG3181 Uncharacterized protein conserved in bacteria 23 10 Op 2 2/0.000 - CDS 25406 - 27301 2309 ## COG0488 ATPase components of ABC transporters with duplicated ATPase domains 24 10 Op 3 2/0.000 - CDS 27327 - 28232 959 ## COG0697 Permeases of the drug/metabolite transporter (DMT) superfamily 25 10 Op 4 1/1.000 - CDS 28242 - 29735 1624 ## COG1404 Subtilisin-like serine proteases - Prom 29775 - 29834 7.8 - Term 29802 - 29848 5.8 26 11 Tu 1 . - CDS 29856 - 30233 332 ## COG2832 Uncharacterized protein conserved in bacteria - Prom 30375 - 30434 9.7 + Prom 30200 - 30259 9.2 27 12 Tu 1 . + CDS 30382 - 31155 1288 ## COG0489 ATPases involved in chromosome partitioning + Term 31301 - 31352 2.4 + Prom 31782 - 31841 6.7 28 13 Tu 1 . + CDS 32079 - 32261 128 ## FN2096 hypothetical protein + Prom 32366 - 32425 6.6 29 14 Op 1 24/0.000 + CDS 32448 - 33692 1461 ## COG2804 Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 30 14 Op 2 10/0.000 + CDS 33689 - 34729 611 ## COG1459 Type II secretory pathway, component PulF + Prom 34844 - 34903 5.2 31 14 Op 3 . + CDS 34934 - 35410 782 ## COG2165 Type II secretory pathway, pseudopilin PulG 32 14 Op 4 . + CDS 35395 - 35892 297 ## FN2092 integral membrane protein 33 14 Op 5 . + CDS 35904 - 35981 190 ## 34 14 Op 6 . + CDS 36014 - 36427 297 ## FN2091 hypothetical protein 35 14 Op 7 . + CDS 36429 - 36986 432 ## FN2090 hypothetical protein 36 14 Op 8 . + CDS 36955 - 37512 590 ## FN2089 hypothetical protein 37 14 Op 9 . + CDS 37505 - 38674 817 ## FN2088 hypothetical protein 38 14 Op 10 . + CDS 38683 - 39483 637 ## FN2087 hypothetical protein + Prom 39486 - 39545 2.5 39 15 Tu 1 1/1.000 + CDS 39617 - 41164 1723 ## COG1450 Type II secretory pathway, component PulD + Term 41280 - 41332 -0.0 + Prom 41376 - 41435 10.4 40 16 Tu 1 . + CDS 41625 - 42110 804 ## COG3212 Predicted membrane protein 41 17 Tu 1 . - CDS 42193 - 42912 592 ## COG3619 Predicted membrane protein - Prom 42939 - 42998 12.0 + Prom 42937 - 42996 9.1 42 18 Op 1 . + CDS 43028 - 43618 762 ## FN2083 hypothetical protein + Prom 43626 - 43685 14.5 43 18 Op 2 . + CDS 43779 - 45413 2483 ## COG2759 Formyltetrahydrofolate synthetase Predicted protein(s) >gi|296154509|gb|ADVK01000029.1| GENE 1 2 - 5554 7057 1850 aa, chain + ## HITS:1 COG:NMB1214 KEGG:ns NR:ns ## COG: NMB1214 COG3210 # Protein_GI_number: 15677087 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Large exoproteins involved in heme utilization or adhesion # Organism: Neisseria meningitidis MC58 # 627 1199 1144 1711 2273 74 23.0 1e-12 VIRSYGKVKLNEKDVSNVEGYILSDGITKEQAQNWKVEEKFKEESKEDKNKKDLNEKKEQ TTGTLLNIGRLDNTKGIIASLSQTTVNVGKIANNDGKIVSKGAVELTTPNEYEYKGLVEG DYSTTLNAKKIIINNNIDRKNTLSLISKEELTLGQSIRARILSIATQADLRNTKDISATN LLSITAKNIENSGNLYSNENTYLEAKNGDLINKDGGSIKADKQVYIKVENGRVVNGTAKY LDGAYRREDGVLVDNRQTSKEKPSIIAGKTETIIKAKDFINTSLIGKAGQGITYIELTGQ GINASIGDNIAKIEGQKVSVEGGTGLRNIGAVISGTEITRVTSKNGKVLNESTIVSGSTE NIRNIGKIEGNGIVYVEGKEIENIAGNIGGAGGTLLKSTKGNIEDKTITLIDNRRGVTET YTEMREVKPKRKWWRRRDEYFERKEDEKPEPKKYEQVQVYKYWDIVNKTKTVSGVIGNGK DTILDSAKDLVLESSDIRAKDNIVLNAKNHILMLSTVNTEYKFKTETSSKRKWGRKKTTT KTWTEDNTYANPVELTSGGYILINYREKGKPADNKGVFAQGVNFNAKKGIIAKSDGNIYI QGVKDKLNSTYDSHTTRSFIGIKYKRTSDYISDNREKYKHSQLYGESGITLDSQGRLRVQ GIDIQTIGPVYLKGVKGVEILSGNEVSSRYEVHTSKKLKISGDKNGIFRGVEQSKNTKEI DTIKSIGSIINSKGSMVTIEGDKVVSIGSKIGAAGDINLIGKNGVIIKDGENFAKIKEET EKMRTGMFMSWSLKNLSASAGVEAVYNKTNEGKTIVTPEKNTFVTNKNIYITSSEGNVLL QGDFGAKENIGITAEKGKIYIKDSKSEILTDSKSVNARMALAFGINLSGIKDTLKSYRNS YKALKEIGNLGRVISFARDMAKGKSLLESLDGKEDTINAMNNLFAGPSSGGVTAGLDLTG SINAAKSTGKYLQNITTNIRAGKDITFKSKEFETEGSFIRAENNLSIDASKILIQASADK YATNSKNMGANFGVTLMGVEGVSAGLNYGQMNSKGTLYNNAQIQAGNKLIVKADNMTIRG GKLKGKHTDVDVKENLLIESLQDSEKMKQIGTNIGYSLKYGKDKDGNPDNRNNGNLGLSY GEKKKLWVKEQSGIIGTESVKVKVGGKLSLIGSIIANIDDKGNLALSYGKLEVKDLDSYD KQININGSVELNQRSKDNDNKELVLKENKEKDKKDNDNNDNSNNNLKKSKNSNDDKGEDV EEKSNRVDETYGIGIEGSDKRKITRATIGKGVINGKEEVEGVNRDIGKSDEIIKDINVKK VEVQYKSERNSWGDFGKIMASNAGIIGDFLDDFNEKALGKSRPDYEIKFRNKVYESISKF ESKLAPINDIVSIFPTGEYDGGILEQIVKLVRKDKTPIIEIAIRKNEDGTPSINLEEKRK LSEVGVDENGKRQKTVQVFVNGIRERRSDAVRNAILKSMSPENLEKYKRGETVKIALIYN QTRGLVADGLECVVGKVFDGSLSSLYGATGVSRGAAIAFASGDKSINYDTGTYSQGNIVT VGAFNKLKNNNIKLGNEETSFLLRMYGSPTKKSTMVAFEEPLGIKVVGSAANMTDFVAHS KKSLGIFGETKLVNLANVEKVKNNKIQNILGYVPVLKVFTKETGAGLSLPQLSDENKEKD DYKYSIVTGIEKIEQIQQNYSKILPKDMRLSDKGIESLINNSHGTYTYESAAIAENIGKK LEEYKVASPDRKEELEKEIKDLYIQDQVNKINLMIYGPPILNDSPFLLDEVSEYNKGEFG KKYDYGNYQDKGLSKNKELNISPQSYVKKKNSTIIDINEYLRNLRKGVDR >gi|296154509|gb|ADVK01000029.1| GENE 2 5554 - 6036 528 160 aa, chain + ## HITS:1 COG:no KEGG:FN2115 NR:ns ## KEGG: FN2115 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 10 160 1 151 151 108 44.0 9e-23 MMKRMIILIIMGLTLSSCQLFTEAIKDNINRVGMERERKEARKKDAYAAVGNPEYETGVE LAIQDIKKRPVNKKVEFGETTLLIPENTRLNPKHGNIVDEKTGYGIAILFEIKDYCTKVF YRKKVRNDKYILLYYNNEDKNLNVIGQKIIKANSLTNTCK >gi|296154509|gb|ADVK01000029.1| GENE 3 6288 - 7160 941 290 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296328333|ref|ZP_06870860.1| ## NR: gi|296328333|ref|ZP_06870860.1| hypothetical protein HMPREF0397_1053 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] hypothetical protein HMPREF0397_1053 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 290 1 290 290 514 100.0 1e-144 MGAFNKLKNNNIKLGNGEDSFLLRMYGSPTKKSTMVAFEEPLGIKVVGSAANMTDFVAHS KKSLGIFGETKLVSLANVDKVKNYKIQNILGFLPALKVFTKETGAGLFLPQLSNENKEKD DYKYSIVVTKEIKDQIQENYSKVLNPNEMLSNKGIESLISNSHGTYTYESAAIAENIGKK LEEYKVASPDRKEELKKEIKDLYIQDQVNKINLMIYGPPILNDSPFLLDEVSEYNKGAFG KKYGYGNYQDKGLPKNKEMNLKPKLYTRKGNPTTADINEYLKNLREGVGL >gi|296154509|gb|ADVK01000029.1| GENE 4 7157 - 7657 573 166 aa, chain + ## HITS:1 COG:no KEGG:FN2115 NR:ns ## KEGG: FN2115 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 11 166 1 151 151 103 42.0 4e-21 MMLKRVVMLIVLGLMFSSCQEIKDFGELMGIKDAKNRYEREREKKELSKKDAPGAIAVDE YKEEVKGVLQNIMKRPVNKKVQFEGTTLLIPENTRLNLKHGNIVDEKTGYGIAIMFKVKE YCTEVFYRKKVKDNKYILLYYNNMDKDLNAIAQKIIKANGLTNTCK >gi|296154509|gb|ADVK01000029.1| GENE 5 7882 - 8799 1149 305 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296328335|ref|ZP_06870862.1| ## NR: gi|296328335|ref|ZP_06870862.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 305 28 332 332 573 100.0 1e-162 MYSQGNEVGLGAFNWLQDKGIKLGYKDKNKFLVGMFGSPVMRDLIAGFGDSLNFKFRGSA INFRDFIGNDDKMYGIFGETKLINYTELGKTNIGKENNSLIEKIKRGPIAKALGRKQEMP GLFVSRLFIADKDDNDFKYSYDITVDDIRRIKENYRNAKDGKKDLTDNNIVNTIRYPHGV YTYIDPEKAEEKYNLAVMYRKASPVEKKVIEERMKAINKEVHDYRLRLAIEGPPILIDSP YLTKAVEDYRKDEFRKEYGYGNYQDKGLPKNEKMNNSPQPYTRKETPPTLGIEEYLKELR EGVGL >gi|296154509|gb|ADVK01000029.1| GENE 6 8796 - 9293 696 165 aa, chain + ## HITS:1 COG:no KEGG:FN2115 NR:ns ## KEGG: FN2115 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 11 165 1 151 151 94 38.0 2e-18 MMLKKGIVLIILGLIFSSCDLIYYGKIAVYENKYRAESESETREATKKDGPGAISKDEYK EDVERVIQDIMKRPVNKKVEYEGTTLLIPENTRLNLKHRNIVDEKTGYGIAIMFNLDDGC TPRVFYTKKIRRDLYIFLFYNEGDKELDVIAEKIIKANGFSKNCK >gi|296154509|gb|ADVK01000029.1| GENE 7 9375 - 9653 333 92 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296328337|ref|ZP_06870864.1| ## NR: gi|296328337|ref|ZP_06870864.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 92 1 92 92 157 100.0 2e-37 MDKIYKEVHDYRLKLAIEGPPILIDSPYLTKAVEDYRKEEFRKEYSYGNYQDKGLPKNKE KNMSPQPYTRKETPTTLGIGEYLKGLREGVGL >gi|296154509|gb|ADVK01000029.1| GENE 8 9650 - 9820 173 56 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296328338|ref|ZP_06870865.1| ## NR: gi|296328338|ref|ZP_06870865.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 56 1 56 56 99 100.0 9e-20 MMLKRIVMLIVLGLIFSSCDFIHYGKIAVYENKLRLEMERERRIKKKRWTSSNCCG >gi|296154509|gb|ADVK01000029.1| GENE 9 9858 - 10142 238 94 aa, chain + ## HITS:1 COG:no KEGG:FN2115 NR:ns ## KEGG: FN2115 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 94 55 151 151 69 42.0 3e-11 MKRPVNKRVEFEGTTLLIPENTRLNLKHGNIVDEKTGYGIAIMFKKETYCAKVFYTKKVR NDLYIVLFYNYKNKNLDTIGQKIIKANGFTNTCK >gi|296154509|gb|ADVK01000029.1| GENE 10 10409 - 11077 866 222 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296328340|ref|ZP_06870867.1| ## NR: gi|296328340|ref|ZP_06870867.1| hypothetical protein HMPREF0397_1060 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] hypothetical protein HMPREF0397_1060 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 222 1 222 222 371 100.0 1e-101 MMLKNSFKFIVLIILVFIINACSSNSNSFWGFKPHFSTGTYIDAYAIIENRKINRMGIPK KDIDKMNDIINDKYGIRFIDDERIAPKDYNENYRIKFYNDFKMTVNGKEYIMSKEKIRYS AYDYDLELPIKITHTNYNEYILDIGEIEIIDTDGKIIRPRTKIPPILFKKTIFRRFINDI TGSDYDVYYRGWAEDYPKDPSTLKKMYNNLEKKFGKFKNIKK >gi|296154509|gb|ADVK01000029.1| GENE 11 11504 - 11818 293 104 aa, chain + ## HITS:1 COG:no KEGG:FN2115 NR:ns ## KEGG: FN2115 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 12 104 51 151 151 68 40.0 9e-11 MDKYRVGVEEAIEDVMKRPVNKKVQFEGATFIIPENTRINPKHGNLVDEKTGYGIFISFS INPHCISKKINNREYGFFFDKHDTNINKIAKEIMRINGFKDTCK >gi|296154509|gb|ADVK01000029.1| GENE 12 11848 - 12330 511 160 aa, chain + ## HITS:1 COG:no KEGG:FN2115 NR:ns ## KEGG: FN2115 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 9 160 1 151 151 67 32.0 1e-10 MKKIILLLMMVLMLSSCYYLDVMIYNMETRYIVNQAAKKDGEAAYFVDEYTEGVKAAIKD VTKRPLTQKVKYGELELILPENTKIKKISDNIVDKKTGYGLQIVFNKSGYCTNPGISYMG YYSKKAENYIYELIYNKNIEGLEEIAQKIIKANGFTKGCK >gi|296154509|gb|ADVK01000029.1| GENE 13 12361 - 13020 912 219 aa, chain + ## HITS:1 COG:no KEGG:Sterm_1092 NR:ns ## KEGG: Sterm_1092 # Name: not_defined # Def: hypothetical protein # Organism: S.termitidis # Pathway: not_defined # 6 219 4 210 210 141 45.0 2e-32 MKKNLFMILGLLTLGTSVFAKDNIVEFKTGFSPAPRYDVTPSKKAKFSYEIGAEYRYLVT ENTELGAGIAYQSHGKLKGFTDVEDKNLKVEVEPFKLYDSVPLYATVKYNFRNSSDITPY VKADLGYSFNVNGDNKTKYKTYSKATGAMLDSGTLKDLKVKNGMYYSVGTGVTYKGFVVG VAYQVNTAKIKGTRYDGTKDNGSANFRRFTINLGYQFTF >gi|296154509|gb|ADVK01000029.1| GENE 14 13039 - 14781 1851 580 aa, chain + ## HITS:1 COG:PA0040 KEGG:ns NR:ns ## COG: PA0040 COG2831 # Protein_GI_number: 15595238 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Hemolysin activation/secretion protein # Organism: Pseudomonas aeruginosa # 92 580 91 562 562 176 25.0 1e-43 MLRVKKTLLIFSLMLAINNLSFSEGKILKNELPKNEREFFEGDRLIDLQDEIKESETNID TSELPKFFVKTILLKTPTITIKPLVNPKKIDKILAEYKNKEINILDLRALVKKLNEEYAN EGYITTRVYLEPNQNLQSGEIKLIALEGKIEEIILDKDTAKDKRRAFFAFTNEKDKILNI SHIDNGIDNLNRVESNNSKINIVPGTKQGYSKVIIESEKEKPIRLVLNYEDTQKNKQKYK ATLEYDNLFGINDNIYFSYRGDVRKLTKDHKDDYTEAYSAGYSFPFKSWTLRFSFNKSRE KSLILGNTTNYTLLTKSNQYGVNTTKLLYRDADTKVNLTMGLDIKREKTYVAARRLETQD RNITVASVGVNGLFKPFKGIMSYNLSYSKGIKGFRSKEDNPFNAGTMSTPSIGAADNRFE FGKVNLNLSYYKPFYFKNQGVIFRTMFNAQYTNDSLFSIEKYSIGDFSTVKGFPSTVSGD IGYNTKVELSYIISNNEGKMGQFLYKVRPYIEADLGKVRNNYNEKGEKGGKIATMSSYSV GIRYYGEKITLDTGISKIDSGRSLMKTDSHRGFATVSVIF >gi|296154509|gb|ADVK01000029.1| GENE 15 14793 - 16613 2137 606 aa, chain + ## HITS:1 COG:FN1514 KEGG:ns NR:ns ## COG: FN1514 COG2849 # Protein_GI_number: 19704846 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 178 606 47 503 503 137 30.0 5e-32 MKKIIMFLLTLIFLIGCGTSEQDKGIEIDNLQEIDGKIYKYGEEKPYSGKIIIKNENDKV VMIETAKDGKIEGEVKTYYESGKLKEKYNIKDEKIDETYKWYAEDGSIEIENKIAISTKD KKAFTGTYTETHINGNVSKIEKFVDGKRDGETLYYYDDGQIKERIPYIKGLKEGDYFYLN KKGEVIGKGVFLNDKREGQWLIYDEEDKTLIEKIYSNGLEEGVYKVYFENGKIRAEGTYK DGKLVGLYRAYYPDGSLETEVNYIDGKREGPYKINYENGKVKETGNYKDDKLVGDVSVMS TNGEKIADLHYTQDGKKTGKWIYLYPNGKVQQEFVYENDKPIGTYKKYYENGAISEEGQY KNGLLDGEVKLYYDNGKIASKVNFIRNSKEGEAVSYYENGKEREKGTFKHNRYEDGVNIY YENGDIAVKQIFKNGKLNGSYKEFYEGNKSKIEATYVNGKEEGDYIVYYENNQKQVISKF KNGQPEGEWIYYYSNGKEKKKMNFLNGMKDGKEIEYYENGNKKLESEFKNNKETGLWTVY FESGKISTTFSYLDGLLNGPVTIYDEKGIKIVDGNYKEGREDGKWLFYDEKGKIKKEENY ILGKKQ >gi|296154509|gb|ADVK01000029.1| GENE 16 16641 - 17630 1690 329 aa, chain - ## HITS:1 COG:FN2109 KEGG:ns NR:ns ## COG: FN2109 COG1087 # Protein_GI_number: 19705399 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-glucose 4-epimerase # Organism: Fusobacterium nucleatum # 1 329 1 329 329 684 99.0 0 MSILVCGGAGYIGSHVVKYLLEKNEDVVVVDSLITGHIDAVDEKAHLELGDLKDEEFLNR VFEKYQIDGVIDFAAFSLVGESVGEPLKYFENNFYGTLCLLKVMKNNNVDKIVFSSTAAT YGEAENMPILETDRTEPTNPYGESKLAVEKMFKWCANAYGLKYTALRYFNVAGAYPSGEI GEAHTCETHLIPLILQVALGQREKISIYGDDYPTPDGTCIRDYIHVMDLADAHYLALNRL RNGGDSQIFNLGNGEGFSVKEVIEVTRKVTGHPIPAEVSPRRAGDPARLIASSKKAIDEL KWKPKYDKLEQIIETAWNWHKNHPNGYED >gi|296154509|gb|ADVK01000029.1| GENE 17 17630 - 19159 1765 509 aa, chain - ## HITS:1 COG:FN2108 KEGG:ns NR:ns ## COG: FN2108 COG4468 # Protein_GI_number: 19705398 # Func_class: G Carbohydrate transport and metabolism # Function: Galactose-1-phosphate uridyltransferase # Organism: Fusobacterium nucleatum # 1 509 1 509 509 1021 99.0 0 MEICSLINRLIKYALKNSLITEDDVMFVRNELMALLHLKDWQDVKENCQIPEYPQEILDK ICDYAVEQKIIEDGVTDRDIFDTEIMGKFTAFPREIIETFKELSQQNIKLATDFFYNFSK KTNYIRTERIEKNLYWKSPTEYGDLEITINLSKPEKDPKEIERQKNMPQVNYPKCLLCYE NVGFTGTLTHPARQNHRVIPLTLDNERWYFQYSPYVYYNEHAIIFCSEHREMKINRDTFS RTLDFINQFPHYFIGSNADLPIVGGSILSHDHYQGGNHEFPMAKSEIEKEVIFDKYPNIK AGIVKWPMSVLRLKSLNRKDLVDLADKTLKAWREYSDEEVGVFAYTNSTPHNTITPIARK RGDYFEIDLVLRNNRTDEANPLGIFHPHSEHHNIKKENIGLIEVMGLAVLPGRLKFEMRK IAEYLKDEDFEKKISDDKDTAKHLTWLKAFINKYSNLKTLSVDEILENILNVEIGLTFSR VLEDAGVFKRDEKGKNAFLKFINHIGGRF >gi|296154509|gb|ADVK01000029.1| GENE 18 19159 - 20328 1672 389 aa, chain - ## HITS:1 COG:FN2107 KEGG:ns NR:ns ## COG: FN2107 COG0153 # Protein_GI_number: 19705397 # Func_class: G Carbohydrate transport and metabolism # Function: Galactokinase # Organism: Fusobacterium nucleatum # 1 389 1 389 389 735 99.0 0 MLENLIKDFKEIFKYSGEVERFFSPGRVNLIGEHTDYNGGFVFPCALDFGTYAVVKKRED KTFKMYSKNFENLGIIEFNLDNLIYDKKDDWANYPKGVIKTFLDRNYKIDSGFDVLFFGN IPNGAGLSSSASIEVLTAVILKDLFKLDVDIIEMVKMCQVAENKFIGVNSGIMDQFAVGM GKKDNAILLDCNTLKYEYVPVKLMNMSIVIANTNKKRGLADSKYNERRTSCEEAVKVLNN NGVNIKYLGELTVAEFEKVKHYITDEEQLKRATHAVTENERAKIAVEFLKKDDIAEFGKL MNKSHTSLRDDYEVTGLELDSLVEAAWEEKGTVGSRMTGAGFGGCTVSIVENDYVDSFIK NVGKKYKEKTGLEASFYIANIGDGAGKVK >gi|296154509|gb|ADVK01000029.1| GENE 19 20464 - 21981 1693 505 aa, chain - ## HITS:1 COG:FN2106 KEGG:ns NR:ns ## COG: FN2106 COG1288 # Protein_GI_number: 19705396 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 1 505 14 518 518 880 100.0 0 MSTNKKKKRGFPSAFTVLAIILVLAAALTYIVPSGQFSRLTYDDSTNEFVITDHDNNVTT EPATQEVLDRLQIQLSLDKFTEGVIKKPIAIPGTYQRIEQRPQGFLDIIKAPVTGSMDTV DIMLFVLVLGGIIGIINKIGAFDAGMAALSKRTKGKEFLLVTLVFLLTTLGGTTFGLAEE TIAFYPILMPIFLLSGFDVLTCIAAIYMGSSIGTMFSTVNPFATVIASNAAGISFTEGLT FRIVALVLASVITLVYMYWYAQKVKKDKTKSYVYVDEEEIHKRFLGEYDSNSEKEFTWRR KLCLLIFALAFPVLIWGVSLGGWWFEEMTALFLGVAIVIMFLSGLSEKEAINTFISGSAD LVGVVLTVGLARSINIVMDNGFISDTLLYYSTEFIAGMSKGVFAVAQLIIFSFLGFFIPS SSGLAVLSMPIMAPLADTVGLSREVVINAYNWGQGWMSFITPTGLILVTLEMAGTTFDKW LKYILPLMGIMGVFSALMLIINTML >gi|296154509|gb|ADVK01000029.1| GENE 20 22293 - 23777 2313 494 aa, chain - ## HITS:1 COG:FN2105 KEGG:ns NR:ns ## COG: FN2105 COG3333 # Protein_GI_number: 19705395 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 494 1 494 494 775 100.0 0 MSDVLFGYAAALTPINLVAAVISVTIGITIGALPGLSAAMGVALLIPITFGMDPSTGLIT LAGVYCGAIFGGSISAILIRTPGTPAAAATAIDGYELTKQGKAGTALGTAIIASFIGGIL SAIPLYLFAPRLAKLALLFGPAEYFWLSIFGLTIIAGASTKSIVKGLISGALGLMLSTVG MDPMLGNPRFTFGVPALLSGIPFTAALIGLFSMSQVLMLAEKKIKQAGNMVEFDNKVLLS KKQILEILPTSLRSTVIGSIIGILPGAGASIAAFLGYNEAKRFSKKKELFGHGSIEGIAG SEAANNAVTGGSLIPTFTLGIPGESVTAVLLGGLMIQGLQPGPDLFTVHGKITYTFFAGF VIVNIFMLILGLFGSKLFARVSRISDSYLIPLIFSLSVIGSYAIHNQMSDVWVMFVFGII GYFVQKFELNSASIVLALILGPIGESGLRRSLILNHNNYSILFQSTVSKVLLLLTLFSLF SPIIMSKLKKRSKE >gi|296154509|gb|ADVK01000029.1| GENE 21 23798 - 24241 354 147 aa, chain - ## HITS:1 COG:no KEGG:FN2104 NR:ns ## KEGG: FN2104 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 147 1 147 147 204 100.0 1e-51 MRKYDKFLTIGLFILEVFYFLLIKQLPPKAARYPYFVLGLMIFLTLLLAINTFLIKPKNT EENKEEDQFKGNLYGQFFLVIALSAIYVILIDIIGFFVTTAVYLFVTMLALKSNIKWSII VSILFPIFLYLIFVSFLKVPVPRGFLL >gi|296154509|gb|ADVK01000029.1| GENE 22 24349 - 25326 1473 325 aa, chain - ## HITS:1 COG:FN2103 KEGG:ns NR:ns ## COG: FN2103 COG3181 # Protein_GI_number: 19705393 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 18 325 1 308 308 589 99.0 1e-168 MKKNFLAVLTLLLSLLLVACGGEKKTETNPETYPDKPVNVIIAYKAGGGTDVGARILMAE AQKNFPQTFVIVNKPGADGEIGYTELAKATPDGYTIGFINLPTFVSLPHERQTKYKIDDV EPIMNHVYDPGVLVVKADSQFQTLAEFVDYAKAHPEELTISNNGTGASNHIGAAHFAKEA GIQVTHVPFGGSTDMISALRGGHVNATVAKISEVASLVKSGELKLLASFTENRLEGFEDV PTLTESGYPVIFGSARAIVAPKGTPKEIIQKLHDVFKTALESPDNIEKSKNASLPLKYMS PEELAQYIKDQEKYIIETVPTLGIN >gi|296154509|gb|ADVK01000029.1| GENE 23 25406 - 27301 2309 631 aa, chain - ## HITS:1 COG:FN2102 KEGG:ns NR:ns ## COG: FN2102 COG0488 # Protein_GI_number: 19705392 # Func_class: R General function prediction only # Function: ATPase components of ABC transporters with duplicated ATPase domains # Organism: Fusobacterium nucleatum # 1 631 1 631 631 1061 100.0 0 MAILQVNDIYMGFSGETLFKEISFSVDEKDKIGIIGVNGAGKTTLIKLLLGLENSEINPA TNERGTISKKSNLKVGYLAQNTQLNKENTVFNELMTVFNNLLEDYNRMQEINFLLTLDLD NFDKLMEELGEISERYERHEGYSIEYKIKQILNGLNIPENLWTMKIGNLSGGQNSRVALA KILLEEPDLLVLDEPTNHLDLTSIEWLEKILKDYNKSIILISHDVYFLDNVVNRVFEIEG KRLKDYKGNYTDFLIQKEAYLSGEVKAYEKEQEKIKKMEEFIRRYKAGVKSKQARGREKI LNRMEKMENPVVTTQKIKLKFDIKAQSVDLVLDIKNLSKTFEDKLLFKDLNLKVYRGERI GLIGKNGTGKSTLLKIINNLEKASSGEFKIGERVSIGYYDQNHQGLGLNNNIIEELMYYF TLSEEEARNICGAFLFREDDIYKKISSLSGGEKARVAFMKLMLEKPNFLILDEPTNHLDI YSREILMDALEDYPGTILVVSHDRNFLDTVVNKIYELKTDGVETFDGDYEAYKQERDNIK VKNEEAVKSYEEQKKAKNRLTSLEKKLVRLEEEIQKIEEQKEEVNKKYLLAGEKNNVDEL MSLQEELDNLDNKILEKYQEWEDVEKELKNL >gi|296154509|gb|ADVK01000029.1| GENE 24 27327 - 28232 959 301 aa, chain - ## HITS:1 COG:FN2101 KEGG:ns NR:ns ## COG: FN2101 COG0697 # Protein_GI_number: 19705391 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; R General function prediction only # Function: Permeases of the drug/metabolite transporter (DMT) superfamily # Organism: Fusobacterium nucleatum # 1 301 1 301 301 455 97.0 1e-128 MKKDANFGMLSTFIGGTLWGINGVMGSFLFLYKNITTNWLIPYRLVLAGLLLLGYLYYKT GSKIFDILKNPKDLLQILLFGFIGMLGTQYTYFSAIQYSNAAIATVLTYFGPTLVLIFMC LKERRKPLKYEIIAILLSSFGVFLLATHGDVTSLQISFKALVWGMLSALSVVFYTVQPEK LLKKYGPPIVVAWGMIIGGIFITFVTKPWNIDVIFDFTAFFVFLLIVFFGTIVAFILYLT GVNIIGPTKASIIACIEPVAATICAILFLGISFGFLDFIGFICIISTIFIVAYFDKKVKK K >gi|296154509|gb|ADVK01000029.1| GENE 25 28242 - 29735 1624 497 aa, chain - ## HITS:1 COG:FN2100 KEGG:ns NR:ns ## COG: FN2100 COG1404 # Protein_GI_number: 19705390 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Subtilisin-like serine proteases # Organism: Fusobacterium nucleatum # 82 497 1 416 416 776 99.0 0 MKVRLAKEDKNSNYKVSLSDISKEKDFIKILEDYNIQYKKTEYFKDFFMYKLIDINSKFI MILQEKASNYIKYIEPVSIYSLPIQIDDIEGEVPVIYPEENKNYVTLGVIDNGIAHIKYL APWIKRVHTRFLKKDTSATHGTFVSGIALYGDKLENREMVKNEGFYLLDATVLSATTIEE DDLLQNITLAIKENHKKVKIWNLSLSVKLAIEEDTFSDFGVVLDHLQKTYGVLICKSAGN GGNFMKKLPKGKLYHGSDSLLSVVVGAINNERYASNYSRIGLGPRGTIKPDVASYGGELL LGDNGEMIMKGVKSFSRNGNIASSSGTSFATARISSLATIIYQNICKDFKDFSDFNSTLL KALIIHSAKNTDRNLSIEEIGYGIPATSTEILSYFKNENVKIFNGVMEKNQEIDLDASFF EYKKDIKVKITLVYDTEFDYLQNGEYIKSDIKIKDISENGRNLTRKFEGILARNNKIELY SDSDIKKNYTLIVEKLN >gi|296154509|gb|ADVK01000029.1| GENE 26 29856 - 30233 332 125 aa, chain - ## HITS:1 COG:FN2099 KEGG:ns NR:ns ## COG: FN2099 COG2832 # Protein_GI_number: 19705389 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 125 1 125 125 206 94.0 8e-54 MKNLKKKIYIFVGILAVGLGIIGLFLPVMPTVPFLLVALFCFERSSKKYHEMILNNKYFG KILRDYYEGKGLTTSVKIKAILFLTCGIAFSFYKVQHLHLRIMLAVIWLGVTIHIILLKT KPKRK >gi|296154509|gb|ADVK01000029.1| GENE 27 30382 - 31155 1288 257 aa, chain + ## HITS:1 COG:FN2098 KEGG:ns NR:ns ## COG: FN2098 COG0489 # Protein_GI_number: 19705388 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: ATPases involved in chromosome partitioning # Organism: Fusobacterium nucleatum # 1 257 1 257 257 474 99.0 1e-134 MIQKEAPKVKDDKNIKNVIAVMSGKGGVGKSTVTTLLAKELRKKGYSVGVMDADITGPSI PRLMGVSEQKMTTDGKNMYPVVTEDGIEIVSINLMIDENEPVVWRGPVIAGAVMQFWNEV VWGNLDYLLIDMPPGTGDVPLTVMKSFNIKGLIMVSVPQDMVSMIVTKAIKMARKLNMNI IGLIENMSYITCDCCNNKIYLTDENDTQTFLKENDVELLGELPMTKQIAKLTKGESEYPE ETFSKIADRVMEKVKEL >gi|296154509|gb|ADVK01000029.1| GENE 28 32079 - 32261 128 60 aa, chain + ## HITS:1 COG:no KEGG:FN2096 NR:ns ## KEGG: FN2096 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 18 60 23 65 65 71 100.0 1e-11 MKKVILILLFLLIYIQIFSLQSKKNLVKIDIIGKSGIKSYYVNFSNEQNLDSFEIYDTLD >gi|296154509|gb|ADVK01000029.1| GENE 29 32448 - 33692 1461 414 aa, chain + ## HITS:1 COG:FN2095 KEGG:ns NR:ns ## COG: FN2095 COG2804 # Protein_GI_number: 19705385 # Func_class: N Cell motility; U Intracellular trafficking, secretion, and vesicular transport # Function: Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB # Organism: Fusobacterium nucleatum # 1 414 1 414 414 738 97.0 0 MGNSSEKIEKYFKKTINSTMNNNKNLYIEDIEELFIRENVNSNKGIFSILLEAIKFSASD IHIEALTDKIRIRYRINGILKEVAEIDKSFLSSIVSKLKILSSLDIVEKRKPQDGRFSLK YKGREIDFRTSIMPTMNGEKIVIRILDKFNYNFTLEDLYLSEENKKIFYKAINQNTGIIL VNGPTGSGKSSTLYSILKYKNREEVNISTVEDPIEYQIEGINQVQCRNEIGLGFATILRA LLRQDPDILMVGEIRDKETAEIAVKASLTGHLVFSTLHSNDSLGCINRLVNLGIDSYLLS LVLQMVVSQRLVRKLCPHCKKEDENYKEKLKSLNLSEEKYKDIKFYISDGCEKCMGTGYI GRIPVFEIIYFDDILKDILAQKKEIKQNFKTLLDDAMDKIKEGLTSLDEIMRQL >gi|296154509|gb|ADVK01000029.1| GENE 30 33689 - 34729 611 346 aa, chain + ## HITS:1 COG:FN2094 KEGG:ns NR:ns ## COG: FN2094 COG1459 # Protein_GI_number: 19705384 # Func_class: N Cell motility; U Intracellular trafficking, secretion, and vesicular transport # Function: Type II secretory pathway, component PulF # Organism: Fusobacterium nucleatum # 1 346 1 346 346 511 96.0 1e-145 MKNKKEKILFFTNELAIMLKSGLTFTTAIEIILREEKEKNFKEVLKKIHKNLIAGKSIFE SFKNFDKIFGNTYLYMLKIGEVSGSIAERLEDISKSLEFDLANQKKLGGILVYPIVVISL TLIIVTFLLTFILPNFITIFEENQVELPLITRILLFISRNFHYILLFIIALILIIFFTNM YINNNKFKRIQKDKWLLNMKLFGELRKLSLSSDLYHSFSILLSVGIGIIESVDILYINNN NYYIKENLLEVKKSLLAGNNIATALKKLNLYDERFSILITAGEESGYLSENLLQISKILK NDFEYKLKKLISLLEPLVVVFLGLIVAFVVVAIYLPILSIGDVFSQ >gi|296154509|gb|ADVK01000029.1| GENE 31 34934 - 35410 782 158 aa, chain + ## HITS:1 COG:FN2093 KEGG:ns NR:ns ## COG: FN2093 COG2165 # Protein_GI_number: 19705383 # Func_class: N Cell motility; U Intracellular trafficking, secretion, and vesicular transport # Function: Type II secretory pathway, pseudopilin PulG # Organism: Fusobacterium nucleatum # 8 158 1 151 151 283 98.0 9e-77 MKNRGFSLIEIVVAVAIMGILSGIVGLQLRSYIAKSKDTKAVATLNTLRVAAQLYQVDNE ETLIDTASLTTYDEQKVKDALKKLEPYLDNNAKAIIEKPEMAIGGSRAAQNGDIKYGGKV RITFKDPNGNSSDGYYMWLEPEGTTGGFDIKGNKWIEF >gi|296154509|gb|ADVK01000029.1| GENE 32 35395 - 35892 297 165 aa, chain + ## HITS:1 COG:no KEGG:FN2092 NR:ns ## KEGG: FN2092 # Name: not_defined # Def: integral membrane protein # Organism: F.nucleatum # Pathway: not_defined # 1 165 1 165 165 193 96.0 2e-48 MDRILIILLYILLFLVMYIDINKKYIPNILNFSILVLSIFICGIDKVDIFFIGASCYTLP ILIFYGYISDILKKEVFGFGDIKLIIPLGGLLYLGEINIFLQIYIFYLLVFSLATLYIII YIVISYCRNKPVKIKGVELAFAPYICLAFIIIYNFNLMEKIIEKF >gi|296154509|gb|ADVK01000029.1| GENE 33 35904 - 35981 190 25 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MTSYNVERAFWSSRDIIGCQVITTL >gi|296154509|gb|ADVK01000029.1| GENE 34 36014 - 36427 297 137 aa, chain + ## HITS:1 COG:no KEGG:FN2091 NR:ns ## KEGG: FN2091 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 10 137 1 128 128 172 88.0 4e-42 MNKNKAFSLMEVIVSVFILILVLIPSIKLNIQQIKTYSKIKNADSELHFFTSLNNYLKSE NIINSHLEFNDYSDFITRFNNFGNSFQNLKNKNFKLIIDMEKTEIDFSNRKENASLIKVE YRGDKKIYKNTLLKFEE >gi|296154509|gb|ADVK01000029.1| GENE 35 36429 - 36986 432 185 aa, chain + ## HITS:1 COG:no KEGG:FN2090 NR:ns ## KEGG: FN2090 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 185 5 189 189 303 97.0 2e-81 MYINKNKALSFIEIIVAVTILLAVSFASLITFYSINKSFLVMNNTYKREKEIMTFRDLVI SHIKWNESLEIRVTNISKSNPINSLGDLFLKPDEKEGNLLVLKIKNYDEIEKKVEKYYRC FLFYEDKASLSYFDDSNIHSLVNIFNGTVILENCTGKFLVENNILKVYLKDKDKEYEEIL YYEQK >gi|296154509|gb|ADVK01000029.1| GENE 36 36955 - 37512 590 185 aa, chain + ## HITS:1 COG:no KEGG:FN2089 NR:ns ## KEGG: FN2089 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 7 185 1 179 179 305 98.0 8e-82 MKKYCTMNKNKAFIFLQVIIISFLFISLTLFVQILLNSRFNLYKTDIKTQENFQDYGFLD EIIKQEFKNIEEKINNREIKDVTEYIALSENGEKIFLIAEYNKRISFGGYRLLEDEKGKN YYNYLKDKIKGMYKPRVNVHFVKNIKIADKNYSLFATMEYEIGSSREPDTLHNGILTRMW IKENV >gi|296154509|gb|ADVK01000029.1| GENE 37 37505 - 38674 817 389 aa, chain + ## HITS:1 COG:no KEGG:FN2088 NR:ns ## KEGG: FN2088 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 389 1 389 389 580 97.0 1e-164 MFKKILPLSHIDDIEKRVRNTVLNLENKFFSIFKIQIENIINEEDRKEKIEDRLEVIFPR YNSDDFVLRYEILKKDRKKENIVVYLLDLALLNDYIIDDMKDYGFVSIIPSFFVCREKKN INHYFNFDISETMLVVTEYMNNNILDISTFKLSKSSFDNDEEVDIEDKYSIANSYLVNIE DDIEIIFTGNKINFDELDLMNKNYSYFEVESLDFTKYLNFLPDDIKNKYSLYYVNTKYLY TLLIISIITVLSTIILYHNIHKSEEKLEQLEAESSSLEDEINEARNEMEEIEKQHKDLLE YIEKEEYKDFKISSLLEELSYLCPNGVKISSIEYDENKIFNIEGSTGKIDNVVKFLENIT NSKNFKLYNYDYILRKENEIEFKLEIKYF >gi|296154509|gb|ADVK01000029.1| GENE 38 38683 - 39483 637 266 aa, chain + ## HITS:1 COG:no KEGG:FN2087 NR:ns ## KEGG: FN2087 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 41 239 1 199 226 260 96.0 5e-68 MDLDLKNKNLKIIVLIFIYLLVFYFLIFKNILRLSEIKELVEQEDIKIGKLNYEKNVVLK ALDIKRKDFEKDREKIENQEEKSDKDEFDNIPNLFNYIEEKISKNNIIFQSFGRSRKDGD KLKVAMSFNGSERNIKNFFREIENENYDINFSSSYLKISVNNNLLEVKTTLSASVLDKTE KIDSLIDNNINSKDIFKRTSKNPEGEEVSYSYMRIGDKTYYRVSKKKEIESKTTTENKED KKSKTKKSSKKEEDKESVDNKKEKVE >gi|296154509|gb|ADVK01000029.1| GENE 39 39617 - 41164 1723 515 aa, chain + ## HITS:1 COG:FN2086 KEGG:ns NR:ns ## COG: FN2086 COG1450 # Protein_GI_number: 19705376 # Func_class: N Cell motility; U Intracellular trafficking, secretion, and vesicular transport # Function: Type II secretory pathway, component PulD # Organism: Fusobacterium nucleatum # 114 515 1 402 402 676 98.0 0 MKKFTLIIFLIFNTFIFPAALNRDIDIIDMPLHEVLAVLSKETGKNLICSKEAKDIVIDT YFNKGEDVNSVLQFLAESYDLSMKKENNTIIFMLQSEKNTKKAKIIGKITSNNMALKNAK IELKDLNKVVYTDSSGNFIIDNIPKDVYVCKISKKGYEERGEIIDTVKSINVLNVDLKEN QNNYESKDISDDLSNSNFYEIDGKFYYTKTFSLFNVSPDEVSRILNETFGENIKVSTLSK VNKLVVSAERDILENTISIIEDIDKNPKQVKITSQILDISNNLFEELGFDWVYKQNVESQ ERNSLTAMILGKAGLNGVGSTVNIVRQFHNKSDVLSTGINLLEATNDLVVSSVPTLMIAS GEEGEFKVTEEVVVGIKTHREDKKDRYSEPVFKEAGLIMKVKPFIKDNDYIILEISLELS DFKFKKNVLNLKDINSGTYNSEGGSKVGRGLTTKVRVKNGDTILLGGLKKSIQQNIESKI PILGDIPIISFFFKNTTKKNENSDMYIKLKVEIDE >gi|296154509|gb|ADVK01000029.1| GENE 40 41625 - 42110 804 161 aa, chain + ## HITS:1 COG:FN2085 KEGG:ns NR:ns ## COG: FN2085 COG3212 # Protein_GI_number: 19705375 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 1 161 1 161 161 246 100.0 1e-65 MKKILIVAAIILGSIGFSINVLAELNEQQAKDIIKKEVPNGQITKFKLDKENGKMVYEIK VMDGNIEKEYEIDAETGAILKMEQEQKGNKNANSVNNPKISSDKAKEIALKNSKNGKFKE IELKHKNGVLVYDVEIAEGFMDREFLIDANTGEILRDKKDF >gi|296154509|gb|ADVK01000029.1| GENE 41 42193 - 42912 592 239 aa, chain - ## HITS:1 COG:FN2084 KEGG:ns NR:ns ## COG: FN2084 COG3619 # Protein_GI_number: 19705374 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 1 239 1 239 239 406 99.0 1e-113 MDKQKFKEFFNNKEEFAPNERLWLFCMLMLIAGFWGGFTYSLRGRVFVNAQTGNLVFLSL GIASWDIALIKNALATFLAYFCGIITAEFISKEINKISFLIWERILLFFSLIVTICLGFI PETAPFEFTNFSIAFTAAMQFNTFEKAHGMGMATPFCTNHVKQASANFVRFLKTRDSNKL RISLSHLSMILSFITGATLSIFLGRFFLGKAIWFSSFFIVITLYFFSKSLRTYKTKKLK >gi|296154509|gb|ADVK01000029.1| GENE 42 43028 - 43618 762 196 aa, chain + ## HITS:1 COG:no KEGG:FN2083 NR:ns ## KEGG: FN2083 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 196 1 196 196 359 100.0 3e-98 MKAKIKNIDLMKVKDNENTYYGFSQEWYKDVWQQKAGCGATVASSIINYYNQIDNFKEIG ILDALKVMEELWFYLLPTEHGLNSIKLFYDGIKNYYNNKKVIIDYINVDIKDKPSLDEII NFIGKELLKDRPIAFLNLCNGEENNLDKWHWVLVVEMFEENGEYFLNIIDDKEIMKINLS LWYRTITNDGGFITFK >gi|296154509|gb|ADVK01000029.1| GENE 43 43779 - 45413 2483 544 aa, chain + ## HITS:1 COG:FN2082 KEGG:ns NR:ns ## COG: FN2082 COG2759 # Protein_GI_number: 19705372 # Func_class: F Nucleotide transport and metabolism # Function: Formyltetrahydrofolate synthetase # Organism: Fusobacterium nucleatum # 1 544 1 544 544 1031 100.0 0 MTDIQIAQAAKKENIVEIAKKLGLTEDDIEQYGKYKAKVNLDVLQKNKRPNGKLILVTAI TPTPAGEGKSTVTIGLTQALNKMGKLSAAAIREPSLGPVFGMKGGAAGGGYAQVVPMEDI NLHFTGDMHAIGIAHNLISACIDNHINSGNALGIDVTKITWKRVVDMNDRALRNIVIGLG GKANGYPRQDSFQITVGSEIMAILCLSNSITELKEKIKNIVIGTSVTGKLIKVGDFHIEG AVAALLKDAIKPNLVQTLENTPVFIHGGPFANIAHGCNSILATKMALKLTDYVVTEAGFA ADLGAEKFIDIKCRLGGLKPDCAVIVATVRALEHHGKGDLKAGLENLDKHIDNIKNKYKL PLVVAINKFITDTDEQINMIEKFCNERGAEVSLCEVWAKGGEGGIDLAEKVLKAIDNNKT EFDYFYDINLTIKEKIEKICKEIYGADGVIFAPATKKVFDVIEAEGLNKLPVCMSKTQKS ISDNPALLGKPTGFKVTINDLRLAVGAGFVIAMAGDIIDMPGLPKKPSAEVIDIDENGVI SGLF Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:38:30 2011 Seq name: gi|296154448|gb|ADVK01000030.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00035, whole genome shotgun sequence Length of sequence - 59219 bp Number of predicted genes - 60, with homology - 57 Number of transcription units - 20, operones - 12 average op.length - 4.3 N Tu/Op Conserved S Start End Score pairs(N/Pv) + Prom 110 - 169 11.9 1 1 Op 1 2/0.000 + CDS 202 - 825 896 ## COG0491 Zn-dependent hydrolases, including glyoxylases 2 1 Op 2 1/1.000 + CDS 848 - 1771 368 ## PROTEIN SUPPORTED gi|148988049|ref|ZP_01819512.1| 30S ribosomal protein S9 3 1 Op 3 1/1.000 + CDS 1795 - 2742 466 ## PROTEIN SUPPORTED gi|116517028|ref|YP_816079.1| glucokinase + Prom 2814 - 2873 9.8 4 2 Op 1 16/0.000 + CDS 2939 - 3964 1609 ## COG1879 ABC-type sugar transport system, periplasmic component + Term 3990 - 4022 2.5 + Prom 3971 - 4030 5.7 5 2 Op 2 10/0.000 + CDS 4054 - 5556 187 ## PROTEIN SUPPORTED gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 6 2 Op 3 . + CDS 5579 - 6598 1467 ## COG4211 ABC-type glucose/galactose transport system, permease component + Term 6627 - 6665 2.1 - Term 6474 - 6519 1.1 7 3 Op 1 21/0.000 - CDS 6642 - 7187 396 ## COG0477 Permeases of the major facilitator superfamily - Prom 7209 - 7268 9.4 8 3 Op 2 1/1.000 - CDS 7317 - 7868 231 ## COG0477 Permeases of the major facilitator superfamily 9 3 Op 3 1/1.000 - CDS 7877 - 8833 1597 ## COG0039 Malate/lactate dehydrogenases - Prom 8866 - 8925 12.0 - Term 8933 - 8986 12.1 10 4 Op 1 1/1.000 - CDS 9004 - 12570 4916 ## COG0674 Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, alpha subunit - Prom 12595 - 12654 6.7 11 4 Op 2 21/0.000 - CDS 12662 - 13858 1839 ## COG0282 Acetate kinase 12 4 Op 3 . - CDS 13909 - 14913 1492 ## COG0280 Phosphotransacetylase 13 4 Op 4 . - CDS 14919 - 15026 66 ## - Prom 15267 - 15326 11.2 14 5 Tu 1 . + CDS 15780 - 15986 120 ## FN1174 hypothetical protein - Term 18069 - 18117 -0.8 15 6 Op 1 13/0.000 - CDS 18284 - 18562 335 ## COG1343 Uncharacterized protein predicted to be involved in DNA repair 16 6 Op 2 12/0.000 - CDS 18567 - 19559 824 ## COG1518 Uncharacterized protein predicted to be involved in DNA repair 17 6 Op 3 6/0.000 - CDS 19571 - 20065 577 ## COG1468 RecB family exonuclease 18 6 Op 4 . - CDS 20108 - 22546 2607 ## COG1203 Predicted helicases - Prom 22572 - 22631 7.3 19 7 Op 1 . - CDS 22635 - 23735 909 ## FN1180 hypothetical protein 20 7 Op 2 . - CDS 23748 - 24650 1324 ## COG1857 Uncharacterized protein predicted to be involved in DNA repair 21 7 Op 3 . - CDS 24663 - 26219 1560 ## FN1182 hypothetical protein 22 7 Op 4 . - CDS 26209 - 26961 952 ## FN1183 putative cytoplasmic protein - Prom 26987 - 27046 9.0 23 8 Tu 1 . - CDS 27073 - 27606 201 ## COG4823 Abortive infection bacteriophage resistance protein - Prom 27671 - 27730 2.8 24 9 Tu 1 . - CDS 28060 - 28818 996 ## COG0846 NAD-dependent protein deacetylases, SIR2 family - Prom 28877 - 28936 13.0 + Prom 28874 - 28933 12.1 25 10 Tu 1 . + CDS 28991 - 29092 119 ## + Term 29129 - 29198 12.3 + Prom 29118 - 29177 11.2 26 11 Op 1 3/0.000 + CDS 29204 - 30382 1734 ## COG1473 Metal-dependent amidase/aminoacylase/carboxypeptidase 27 11 Op 2 . + CDS 30396 - 31148 1066 ## COG0834 ABC-type amino acid transport/signal transduction systems, periplasmic component/domain + Term 31161 - 31199 7.2 + Prom 31152 - 31211 9.9 28 12 Op 1 . + CDS 31395 - 31892 645 ## FN1188 hypothetical protein 29 12 Op 2 . + CDS 31902 - 32285 432 ## FN1189 hypothetical protein 30 12 Op 3 . + CDS 32292 - 34499 2822 ## COG2217 Cation transport ATPase 31 12 Op 4 . + CDS 34512 - 35264 729 ## FN1191 hypothetical protein 32 12 Op 5 . + CDS 35294 - 35566 575 ## FN1192 hypothetical protein + Term 35601 - 35638 5.1 + Prom 35587 - 35646 4.5 33 13 Op 1 . + CDS 35698 - 35958 295 ## FN1193 hypothetical protein 34 13 Op 2 . + CDS 35988 - 36065 60 ## + Term 36191 - 36242 -0.5 35 14 Op 1 . - CDS 36191 - 36571 313 ## FN1195 hypothetical protein 36 14 Op 2 . - CDS 36579 - 36824 267 ## FN1195 hypothetical protein - Prom 36930 - 36989 7.8 - Term 36888 - 36946 -0.2 37 15 Op 1 . - CDS 36994 - 37614 613 ## FN1196 hypothetical protein 38 15 Op 2 . - CDS 37634 - 38275 687 ## FN1197 hypothetical protein 39 15 Op 3 . - CDS 38278 - 39540 1319 ## COG1106 Predicted ATPases - Prom 39567 - 39626 11.6 + Prom 39622 - 39681 8.1 40 16 Tu 1 . + CDS 39710 - 40525 844 ## COG0389 Nucleotidyltransferase/DNA polymerase involved in DNA repair 41 17 Tu 1 . - CDS 40589 - 41395 902 ## FN1200 hypothetical protein - Prom 41418 - 41477 15.1 - Term 41465 - 41498 1.4 42 18 Op 1 . - CDS 41509 - 41877 374 ## FN1201 hypothetical protein 43 18 Op 2 1/1.000 - CDS 41884 - 42660 1203 ## COG0171 NAD synthase 44 18 Op 3 1/1.000 - CDS 42672 - 43541 1307 ## COG1161 Predicted GTPases 45 18 Op 4 1/1.000 - CDS 43522 - 44229 804 ## COG0313 Predicted methyltransferases 46 18 Op 5 1/1.000 - CDS 44240 - 45559 1842 ## COG0793 Periplasmic protease 47 18 Op 6 1/1.000 - CDS 45574 - 46350 1013 ## COG1189 Predicted rRNA methylase 48 18 Op 7 1/1.000 - CDS 46352 - 47176 990 ## COG3481 Predicted HD-superfamily hydrolase 49 18 Op 8 1/1.000 - CDS 47160 - 48962 1903 ## COG1154 Deoxyxylulose-5-phosphate synthase 50 18 Op 9 1/1.000 - CDS 48976 - 49275 229 ## PROTEIN SUPPORTED gi|212638657|ref|YP_002315177.1| Predicted RNA-binding protein containing KH domain, possibly ribosomal protein 51 18 Op 10 1/1.000 - CDS 49301 - 51127 2620 ## COG0595 Predicted hydrolase of the metallo-beta-lactamase superfamily 52 18 Op 11 . - CDS 51144 - 53117 2783 ## COG0768 Cell division protein FtsI/penicillin-binding protein 2 53 18 Op 12 . - CDS 53107 - 53529 157 ## FN1212 hypothetical protein 54 18 Op 13 . - CDS 53540 - 54523 1209 ## FN1213 hypothetical protein 55 18 Op 14 5/0.000 - CDS 54527 - 55834 865 ## PROTEIN SUPPORTED gi|16079597|ref|NP_390421.1| hypothetical protein BSU25430 56 18 Op 15 1/1.000 - CDS 55821 - 56528 889 ## COG1385 Uncharacterized protein conserved in bacteria 57 18 Op 16 1/1.000 - CDS 56532 - 56963 469 ## COG1959 Predicted transcriptional regulator 58 18 Op 17 1/1.000 - CDS 56960 - 57958 1129 ## COG2255 Holliday junction resolvasome, helicase subunit - Prom 58024 - 58083 9.9 59 19 Tu 1 . - CDS 58173 - 58775 702 ## COG4399 Uncharacterized protein conserved in bacteria - Prom 58899 - 58958 11.5 + Prom 58897 - 58956 12.4 60 20 Tu 1 . + CDS 58977 - 59217 334 ## FN1219 hypothetical protein Predicted protein(s) >gi|296154448|gb|ADVK01000030.1| GENE 1 202 - 825 896 207 aa, chain + ## HITS:1 COG:FN1162 KEGG:ns NR:ns ## COG: FN1162 COG0491 # Protein_GI_number: 19704497 # Func_class: R General function prediction only # Function: Zn-dependent hydrolases, including glyoxylases # Organism: Fusobacterium nucleatum # 1 207 1 207 207 397 97.0 1e-110 MRVKCFHLGAYGTNCFLAYDENNIAYFFDCGGRNLEKVYEFISEHNLDLKYIVLTHGHGD HIEGLNDLASHYPEAKVYIGEEDKDFLYNSELSLSDAIFGEFFKFKGEIHTVKEGDMVGD FKVIDTPGHTIGSKSFYYENNKILISGDTLFRRSYGRYDLPTGSLEMLCHSLKKLSNLPD ETVVYNGHTDNTTIGEEKRFLERVGIL >gi|296154448|gb|ADVK01000030.1| GENE 2 848 - 1771 368 307 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|148988049|ref|ZP_01819512.1| 30S ribosomal protein S9 [Streptococcus pneumoniae SP6-BS73] # 4 293 1 295 306 146 34 3e-34 MEKIYDIIIVGAGPAGLTAGIYAGRGNLSTLILEKEGIGSMIMTHQIDNYPGSHIGASGK EIYDTMKKQALDFGCEIKPATVLGFDPYDEIKVVKTDAGNFKTKYIIIATGLGKIGAKKV KGENKFLGAGVSYCATCDGAFTKGRVVSLVGKGDEIIEEALFLTRYAKEVNIFLTSDDLD CNEELKEAILSKENVKITKKVKLLEIKGEEFVTELELEIAGNKETISTDFVFLYLGTKNN IELYGEFVNLSEAGYIITDETMKTRTDKMYAIGDIREKDVRQVATATNDGVIATTFILKE ILKSKKK >gi|296154448|gb|ADVK01000030.1| GENE 3 1795 - 2742 466 315 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|116517028|ref|YP_816079.1| glucokinase [Streptococcus pneumoniae D39] # 5 315 6 318 319 184 33 1e-45 MKHYIGIDLGGTNTKIGVVDSEGNLINSKIIKTHSHQNVDKTLERIWETAKKLILEKEIP LFSVVGIGIGIPGPVKNQSTVGFFANFDWEKNLNLKEKMEKLSGIETRIENDANIIAQGE AIFGAAKGKKSSITIAIGTGIGGGIFYNGNLISGMSGVGGEIGHMKVVKDGKTCGCGQNG CFEAYASASSLVKEAKERLKLNEDNLLFKEINGDLDELEAKNIFDAARKGDQFSKDLIEY ESDYLALGIGNLLNIINPECIVISGGISLAGDEILLPIKEKLKKYTMPPALENLEIKIGT LGNEAGVKGAVALFI >gi|296154448|gb|ADVK01000030.1| GENE 4 2939 - 3964 1609 341 aa, chain + ## HITS:1 COG:FN1165 KEGG:ns NR:ns ## COG: FN1165 COG1879 # Protein_GI_number: 19704500 # Func_class: G Carbohydrate transport and metabolism # Function: ABC-type sugar transport system, periplasmic component # Organism: Fusobacterium nucleatum # 1 341 1 341 341 601 100.0 1e-172 MKKFGMLLGSIILASALVACGEKKEEAKTDAPATEKLSIGLTAYKFDDNFIALFRKAFEA EAAAKADIVAVTAIDSQNSVATEKEQIEAVLEKGVKAFAINLVDASAADGIINLLKEKDV PVVFYNRKPSDEAIASYDKLFYVGIDPNAQGIAQGELIEKLWKENPDLDLNKDGVIQYVM LTGEPGHPDAVARTKYSISTLNDHGVKTEELHQDTAMWDTATAKDKMDAWLSGPNGSKIE VVICNNDGMALGAVESMKAAGKVLPTFGVDALPEALVKIEAGEMAGTVLNDAKGQASATF NMVVNLAQGKEATEGTDLKLDNKIILIPSIGIDKSNVADFK >gi|296154448|gb|ADVK01000030.1| GENE 5 4054 - 5556 187 500 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 [Vibrio campbellii AND4] # 273 478 17 217 245 76 26 3e-13 MENLKYVLEMENISKEFPGVKALDDVQLKLKPGTVHALMGENGAGKSTLMKCLFGIYEKN SGKILLDGVEVNFKSTKEALENGVSMVHQELNQVLQRNVLDNIWLGRYPMKGFFVDEKKM YNDTINIFKDLDIKVDPRKKVADLPIAERQMIEIAKAVSYKSKVIVMDEPTSSLTEKEVD HLFRIIKKLKESGVGIIYISHKMEEIKMISDEITILRDGKWISTNDVSKISTEQIISMMV GRDLTERFPKKDNKAKEMILEVKNLTALNQPSIQDVSFELYKGEILGIAGLVGSKRTEIV ETIFGMRPKKHGEIILNGKTVKNKSPEDAIKNGFALVTEERRSTGIFSMLDVAFNSVISN LDRYKNKFRLLKNKDIEKDTKWIVDSMRVKTPSYSTKIGSLSGGNQQKVIIGRWLLTEPE VLMLDEPTRGIDVLAKYEIYQLMIDLAKKDKGIIMISSEMPELLGVTDRILVMSNGRVAG IVKTSETNQEEIMELSAKYL >gi|296154448|gb|ADVK01000030.1| GENE 6 5579 - 6598 1467 339 aa, chain + ## HITS:1 COG:FN1167 KEGG:ns NR:ns ## COG: FN1167 COG4211 # Protein_GI_number: 19704502 # Func_class: G Carbohydrate transport and metabolism # Function: ABC-type glucose/galactose transport system, permease component # Organism: Fusobacterium nucleatum # 1 339 1 339 339 548 99.0 1e-156 MIARTNEGKIDYKKIIIESGLYLVLFCMLIAIIIKEPTFLSLRNFKNILTQSSVRTIIAL GVAGLIVTQGTDLSAGRQVGLSAVISGTLLQSMTNVNKAFPKLGEFSIFTTILIVVVVGV VIASINGVVVATLNVHPFIATMGTMTIVYGINSLYYDKAGAAPISGFVEKYSKFAQGYIK IGSYTIPYLIIYAAIATLIMWTLWNKTKFGKNVFAVGGNPEAAKVSGVNVVLTLMGIYAL SGAYYAFGGFLEAGRIGSATNNLGFMYEMDAIAACVIGGVSFYGGVGRISGVITGVIILT IINYGLTYTGVSPYWQYIIKGIIIVTAVAFDSIKYAKKK >gi|296154448|gb|ADVK01000030.1| GENE 7 6642 - 7187 396 181 aa, chain - ## HITS:1 COG:L183932 KEGG:ns NR:ns ## COG: L183932 COG0477 # Protein_GI_number: 15673152 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Lactococcus lactis # 71 159 1 89 124 77 51.0 9e-15 MFLNFFIANNDEIINPGILIKKYQISEKLFGFSVACYGLGSVVAGIFIYYNKKFKLLKKL KLLFILNSSLMCLGGFLSIMLFKYNHYIYFTVFIFFQFLIGMITTFVNVPLISSFQKNVE VEYQSRFFSILSFFSNGLGPLGVLYAGYLSSYIGADVTYIIDNVAIIIIVFLVFKNIERD C >gi|296154448|gb|ADVK01000030.1| GENE 8 7317 - 7868 231 183 aa, chain - ## HITS:1 COG:FN1168 KEGG:ns NR:ns ## COG: FN1168 COG0477 # Protein_GI_number: 19704503 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; P Inorganic ion transport and metabolism; R General function prediction only # Function: Permeases of the major facilitator superfamily # Organism: Fusobacterium nucleatum # 5 183 1 179 302 226 88.0 2e-59 MGVKVQSKESNIKLLLLGRSVSLFGNTIYLIVLPLYILNITQNLKFTGIFFAAVNLPTTI ISIFIGTIIEKFNKKNIILISDFLTSILYFILFLYFKNSNSLIFLFLISLIINIISKFFE IASKVLFSEINTPETLEKYNGLQSFLENTVMIIGPVIGTYLFATFSFNFVLIIISLAYFL SFL >gi|296154448|gb|ADVK01000030.1| GENE 9 7877 - 8833 1597 318 aa, chain - ## HITS:1 COG:FN1169 KEGG:ns NR:ns ## COG: FN1169 COG0039 # Protein_GI_number: 19704504 # Func_class: C Energy production and conversion # Function: Malate/lactate dehydrogenases # Organism: Fusobacterium nucleatum # 1 318 1 318 318 644 99.0 0 MLQTRKVGIVGIGHVGSHCALSMLLQGVCDEMVLMDIIPEKAKAHAIDCMDTISFLPHRA IIRDGGIQELSKMDVIVISVGSLTKNEQRLEELKGSLEAIKSFVPDVVKAGFNGIFVTIT NPVDIVTYFVRELSGFPKNRVIGTGTGLDSARLKRILSEVTNIDSQVIQAYMLGEHGDTQ VANFSSATIQGVPFLDYMKTHPEQFKGVELSVLEKQVVRTAWDIISGKNCTEFGIGCTCS NLVKAIFHNERRVLPCSAYLDGEYGHSGFYTGVPAIIGSNGVEEILELPLDERERKGFED ACAVMKKYIEVGKSYKIV >gi|296154448|gb|ADVK01000030.1| GENE 10 9004 - 12570 4916 1188 aa, chain - ## HITS:1 COG:FN1170_1 KEGG:ns NR:ns ## COG: FN1170_1 COG0674 # Protein_GI_number: 19704505 # Func_class: C Energy production and conversion # Function: Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, alpha subunit # Organism: Fusobacterium nucleatum # 1 410 1 410 410 830 99.0 0 MAKKMQTMDGNQAAAYASYAFTEVAGIYPITPSSPMAEYTDEWAAKGMKNIFGVPVKLVE MQSEGGAAGTVHGSLQAGALTTTYTASQGLLLKIPNMYKIAGELLPGVIHVSARSLSAQA LSIFGDHQDIYAARQTGFAMLATNSVQEVMDLAGVAHLSALKSRVPFLHFFDGFRTSHEI QKVEVMDYEDLKKLVDWKALEGFRKRALNPEHPVTRGTAQNDDIYFQAREVQNKFYDAVP DIVADYMKEISKITGREYKPFNYYGAPDAERVIIAMGSVCEAAQEVIDYLVEKGEKVGLI SVHLFRPFSAKYFFDVLPKTAKRISVLDRTKEPGSLGEPLLLDIKALFYNKENAPLIVGG RYGLSSKDTTPAQIKAVFDNLAKDTPKDAFTVGIVDDVTHTSLEVGPAMALADPTTKACL FYGLGADGTVGANKNSIKIIGDKTDLYAQGYFAYDSKKSGGVTRSHLRFGKKPIRSTYLV SRPTFVACSVPAYLNQYDMTSGLKEGGKFLLNCVWTKEEALENIPNNVKRDLAKNKARLF IINATALAQEIGLGQRTNTIMQAAFFKLAEIIPFEEAQQYMKDYAKKSYAKKGDEIVQLN YNAIDRGANDIIEIEVDPAWANLEVTPLNEPKETTGCAYCQPDTEFVKKIVRPVNAIKGY DLPVSAFLGYEDGTFENGTSAFEKRGVAVDVPIWNVDKCIQCNQCSYVCPHAAIRPFLIN EDELKAAPNSFATKKATGKGLDELVYRIQVSALDCVGCGSCANVCPANALDMRPIAESLE THEDINTNYLYNNVEYRSDLMPLDTVKGSQFSQPLFEFHGACPGCGETPYIKLITQLYGD RMMVANATGCSSIYSGSAPATPYTTNKNGEGPSWGSSLFEDNAEYGFGMHVGVEALRFRI QHTMEENMDKVDEDIATLFKDWIANRQYSVRTREVRDILVPKLEALGTDFAKEILDLKQY LVKKSQWIIGGDGWAYDIGYGGLDHVLASNEDINVLVMDTEVYSNTGGQASKATPTGSVA KFAAAGKPVKKKDLAAIAMSYGHIYVAQVSMGANQQQFIKAVKEAEAHQGPSIIIAYSPC INHGIKKGMSKSQTEMKLATECGYWPIFRYNPSLEKIGKNPLQIDSKEPKWEKYEEYLSG EVRYQTLAKSNPEEAKDLFEKNKKDAQKRWRQYKRMAALDYSEEKEAE >gi|296154448|gb|ADVK01000030.1| GENE 11 12662 - 13858 1839 398 aa, chain - ## HITS:1 COG:FN1171 KEGG:ns NR:ns ## COG: FN1171 COG0282 # Protein_GI_number: 19704506 # Func_class: C Energy production and conversion # Function: Acetate kinase # Organism: Fusobacterium nucleatum # 1 398 1 398 398 789 99.0 0 MKILVINCGSSSLKYQLINPETEEVFAKGLCERIGIDGSKLEYEVVTKDFEKKLETPMPS HKEALELVISHLTDKEIGVIASVDEVDAIGHRVVHGGEEFAQSVLINDAVLKAIEANNDL APLHNPANLMGIRTCMELMPGKKNVAVFDTAFHQTMKPEAFMYPLPYEDYKELKVRKYGF HGTSHLYVSGIMREIMGNPEHSKIIVCHLGNGASITAVKDGKSVDTSMGLTPLQGLMMGT RCGDIDPAAVLFVKNKRGLTDAQMDDRMNKKSGILGLFGKSSDCRDLENAVVEGDERAIL AESVSMHRLRSYIGAYAAIMGGVDAICFTGGIGENSSMTREKALEGLEFLGVELDKEINS VRKKGNVKLSKDSSKVLIYKIPTNEELVIARDTFRLAK >gi|296154448|gb|ADVK01000030.1| GENE 12 13909 - 14913 1492 334 aa, chain - ## HITS:1 COG:FN1172 KEGG:ns NR:ns ## COG: FN1172 COG0280 # Protein_GI_number: 19704507 # Func_class: C Energy production and conversion # Function: Phosphotransacetylase # Organism: Fusobacterium nucleatum # 1 334 4 337 337 631 99.0 0 MSFLGQVRKKALQANRRIVLPESSDERIIRAASQILKEGLAQVVLVGNQEAIMHSAKAYE VSLSGAKIVDPYNFERLNDYINKLVELRAKKGMTPEEAKKILLNDPTFFGAMLVRMGDAD GMVSGSASPTANVLRAAIQVIGTQPGVKTVSSVFIMELSQFKDLFGSILVFGDCSVIPFP TSEQLADIATSAAETAIKIAGINPRVALMTFSTKGSAKHECVDRVIEAGHILRERKVQFR FDAELQADAALVKSIGEIKAPLSDVSGNANVLIFPTLSAGNIGYKLVQRLAGANAYGPII QGLNAPVNDLSRGCSVEDIVVLTAITSAQACIDC >gi|296154448|gb|ADVK01000030.1| GENE 13 14919 - 15026 66 35 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MNFTIKLLHLIVIGVNIYLINLKDKFKVLSTKNRE >gi|296154448|gb|ADVK01000030.1| GENE 14 15780 - 15986 120 68 aa, chain + ## HITS:1 COG:no KEGG:FN1174 NR:ns ## KEGG: FN1174 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 68 1 68 68 97 98.0 2e-19 MTGIKYDSKDNMNPLNFYRAVKENRLPLSEKNINDFTNIKLKVSPKLINLLQESSIFYNF SPKKRNTN >gi|296154448|gb|ADVK01000030.1| GENE 15 18284 - 18562 335 92 aa, chain - ## HITS:1 COG:FN1176 KEGG:ns NR:ns ## COG: FN1176 COG1343 # Protein_GI_number: 19704511 # Func_class: L Replication, recombination and repair # Function: Uncharacterized protein predicted to be involved in DNA repair # Organism: Fusobacterium nucleatum # 1 92 15 106 106 169 100.0 1e-42 MYVVVVYDISLDEKGSRNWRKIFGICKRYLHHIQNSVFEGELSEVDIQRLKYEVSKYIRD DLDSFIIFKSRNERWMEKEMLGLQEDKTDNFL >gi|296154448|gb|ADVK01000030.1| GENE 16 18567 - 19559 824 330 aa, chain - ## HITS:1 COG:FN1177 KEGG:ns NR:ns ## COG: FN1177 COG1518 # Protein_GI_number: 19704512 # Func_class: L Replication, recombination and repair # Function: Uncharacterized protein predicted to be involved in DNA repair # Organism: Fusobacterium nucleatum # 1 330 9 338 338 597 99.0 1e-170 MKRSFFLYSNGTLKRKDNTITFINEKDEKRDIPIEMVDDFYVMSEMNFNTKFINYISQFG IPIHFFNYYTFYTGSFYPREMNISGQLLVKQVEHYTNEQKRVEIAREFIEGASFNIYRNL RYYNGRGKDLKLYMDQIEELRKRLKEVNNVEELMGYEGNIRKIYYEAWNIIVNQEIDFEK RVKNPPDNMINSLISFVNTLFYTKVLGEIYKTQLNPTVSYLHQPSTRRFSLSLDISEVFK PLIVDRLIFSLLNKNQITEKSFVKDFEYLRLKEDASKLIVQEFEDRLKQIITHKDLNRKI SYQYLVRLECYKLIKHLLGEKKYKSFQMWW >gi|296154448|gb|ADVK01000030.1| GENE 17 19571 - 20065 577 164 aa, chain - ## HITS:1 COG:FN1178 KEGG:ns NR:ns ## COG: FN1178 COG1468 # Protein_GI_number: 19704513 # Func_class: L Replication, recombination and repair # Function: RecB family exonuclease # Organism: Fusobacterium nucleatum # 1 164 1 164 164 269 96.0 2e-72 MDKNITGMMVYYYEVCKRKLWYFVNEIQLEKNNSNVILGKLLEENTYTRDEKKINIDGVI NIDFIRSKKVLHEIKKSNSIEPASLLQVQYYLYYLEKKGLIGLKGILDYPLLKQTVEVNL TDEDRENLDNIIIGIKEILRKESPPALEKKGICKKCAYFDLCFV >gi|296154448|gb|ADVK01000030.1| GENE 18 20108 - 22546 2607 812 aa, chain - ## HITS:1 COG:FN1179 KEGG:ns NR:ns ## COG: FN1179 COG1203 # Protein_GI_number: 19704514 # Func_class: R General function prediction only # Function: Predicted helicases # Organism: Fusobacterium nucleatum # 1 812 1 812 812 1333 96.0 0 MLDEELLKKYKAKSDKSIFEHNQDLKKQKDVLINLGYLNDKEKIELLSYAIDYHDSGKIN LKFQRRIEENIKRINGEKYNSKYLKFDKENEIEHNILSAFLINSQDFNSEKEYLAVLYSV IFHHRYSDAISTINNQIFPNIEEILEDNLPNGVVTYERDIPLELNFQDLNTVENIKLLGL LMKCDHSASGGYEIEYPNDFLEDALNNLLNEFKEKDKSADWNDMQKFCKENSDKNIIAIA DTGMGKTEGGFLWGGNNKIFFVLPLRTAINAMYKRVKLFVPKNKILEEKVGLLHSNSLEY YLNNKKELVIDDKDEKEMDILEYNKRGKHLSLPVTICTPDQIFNFILKYKGYESKLATLS YSKIILDEMQMYDASLLAAVIFGITKIMEMGGKIAIVTATFPPIIEYFLNKYLIKNNKNV IKDLDKPQEVFEESIFIKKKFTNNKKIRHNIVLIDDEIGIEQILWQFRKNKKENKKSNKI LVICNTIKKAQEIYLKLKEENDLKDKINMLHSNFIREDRESKEKDILDFGKTEFDGEGIW ISTSLVEASLDIDFDYLFTELQDLNSLFQRFGRCNRKGKKSVDEANCFIYLKIEDKYLKE KGSKYGFIDKDIYENSKKGLENYCEVVSKNEIENSKDYNELFKNYSKQINEGDKIKLIEE NLSFENLKESNFVDEFEKAYEKYQRILNSDENSQDALKLRDIQSVTVIPYNIYEKNEENI KELVKRIEDKNLGLEERQKAKTKLLKKTLSIQYYQLSKYISEILKGKADANKYKSESINK FEKITVMEADYDKELGFRAKDFKDGIPTYEFI >gi|296154448|gb|ADVK01000030.1| GENE 19 22635 - 23735 909 366 aa, chain - ## HITS:1 COG:no KEGG:FN1180 NR:ns ## KEGG: FN1180 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 124 366 1 243 243 330 95.0 8e-89 MEALRIILKQSSANYRKAGTVDNKMTYPLPIPSTVIGALHNICGYTEYHSMDISIQGKFT SLSRRVYTDYCFLNSALDDRGNLVKVVDPDTFSGAFVKVASAKKSQGNSFKDRITIQVHN EELLQEYCNLKEKSKEIEELKNSEYKKKLEEFKLLKKEITDKKKKEDKKSETFKQLSEEE KKIKLEEEKYKEDFKNFEYESYTKPYSYFQNLVTSLKNYEVLNNIFLILHIKADKQTLKD IEENIYNLQSLGRSEDFVEVVECKMVELQEFLRNIRVSKFSMYLKNEDVSDKKIIPLAVD QDHQAGGTKYYLDKNYKLEKNRRIFKKVPVVYSNFIGAKNSSENVKLDYLEILSQDKKQE ILVNFL >gi|296154448|gb|ADVK01000030.1| GENE 20 23748 - 24650 1324 300 aa, chain - ## HITS:1 COG:FN1181 KEGG:ns NR:ns ## COG: FN1181 COG1857 # Protein_GI_number: 19704516 # Func_class: L Replication, recombination and repair # Function: Uncharacterized protein predicted to be involved in DNA repair # Organism: Fusobacterium nucleatum # 1 300 1 300 300 530 98.0 1e-150 MKKNALTITVVANMTSNYSEGLGNISSVQKIYRDRNVYAIRSRESLKNAIMVQSGMYEDL ETEANGATQKKVDENLNATNCRALEGGYMNTKENTYVRNSSFYLTDAISTESFINETRFH NNLYLATNYANTHTDKNGKSLNVQKDASEVGLMPYQYEYEKSLKVYSLTIDLEKIGKDPN FPDKEANNNEKFERVKSLLEAVENLSLIVKGNLDNAEPVFAIGGLSLRKTHYFENVVRVE QGALILGEALKEKKEDGFSCALLKGDIFTNEVEIIKELQPISMREFFKSLIEDVKNYYGA >gi|296154448|gb|ADVK01000030.1| GENE 21 24663 - 26219 1560 518 aa, chain - ## HITS:1 COG:no KEGG:FN1182 NR:ns ## KEGG: FN1182 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 518 1 517 517 748 90.0 0 MKYDIDKNKYGFDTAVSASDWKYSAAVTGLIYYFKELEKKYEIKKITIDEITDSYLLYNK EDINEENYLDFIERFYSEEALAHKKIENQLNHTKEFTPEIIKSIKENMSANTVLKEVFSK VKFDGTNKDEVLKLLDKERDYIIKKSFENKDNLYANYCQVSNGKSKLFTSSEKSPCRLKG YYFDPNRKSKATGYNFASSSVDYFDDEIFDFIPFAFTGSPFETIFLNDNLDLEILENMNY KLREYFFEEKEKEIEKIKNFKQEKAIKEKKNEETEGNQNSVPLKKLFLNILQKKVDYIKY GMEIIYKNRDKEYFETWYLRNESIKVFKEIEDFSKLDIRIKITDKYYFNLLDEVFSAILN LSLLTNSILYLLKDRESFIKIDATRENLSKLFKYNYAINELIKVNQIIRNGGKEMDENLK KSIKACSIAVVKKFIKENSLNKLASYRQKLLSSVVAKNHKRILDVLTQLSVYSGVYFSFA FDYIENQTQNEDIIHYFILELDQSRLESKKNKENEDKE >gi|296154448|gb|ADVK01000030.1| GENE 22 26209 - 26961 952 250 aa, chain - ## HITS:1 COG:no KEGG:FN1183 NR:ns ## KEGG: FN1183 # Name: not_defined # Def: putative cytoplasmic protein # Organism: F.nucleatum # Pathway: not_defined # 1 250 1 250 250 434 94.0 1e-120 MRFILSFELDTVRLPIEIRRTVISFFKKSLTEAHNSKYYPEFFTGTQIKDYSFSVIFPLD KYFGEEIYLKKPEMKVVVSCSEKNNIGFLLVNVFLSQRNKKFPLPKDTHMILKDVRIIEE KIIRGEEAIFQTTIGGGVVVREHNKEENKDIYYSVGNERFEEVLNWLMKERFKRLGYPED IFKNFSCELLDGRKIVVKHFDLKFPVTTGRFKVKAPKILLEEIYRTGMGSRLSQGFGLLE YLGGEIKDEV >gi|296154448|gb|ADVK01000030.1| GENE 23 27073 - 27606 201 177 aa, chain - ## HITS:1 COG:FN1184 KEGG:ns NR:ns ## COG: FN1184 COG4823 # Protein_GI_number: 19704519 # Func_class: V Defense mechanisms # Function: Abortive infection bacteriophage resistance protein # Organism: Fusobacterium nucleatum # 1 168 1 168 299 281 94.0 3e-76 MSCSESHLSYEEQLNKFISRGMLVKNKAKALERLKHISYYKIKQFSTFFMDNSGNYKQNT SFEAVIQNFYFDKNLRMEFLKCSEKIELSIKNKIAYLLGVKYGAFGYLNFSSWCDRSRPK QEIQNEELKFKKKIQKKMKLFSDNSIIKDFVINNPTETYLSIWRLSEVYLWRSIISF >gi|296154448|gb|ADVK01000030.1| GENE 24 28060 - 28818 996 252 aa, chain - ## HITS:1 COG:FN1185 KEGG:ns NR:ns ## COG: FN1185 COG0846 # Protein_GI_number: 19704520 # Func_class: K Transcription # Function: NAD-dependent protein deacetylases, SIR2 family # Organism: Fusobacterium nucleatum # 1 252 1 252 252 493 99.0 1e-139 MDSKRDEKILELVKILKNTKYLVFFGGAGTSTDSGVKDFRGKDGLYKTLYKDKYRPEEVL SSDFFYSHRDIFMEYVEKELNIKGLKPNKGHMALVELEKIGILKAVITQNIDDLHQVSGN KNVLELHGSLKRWYCLSCGKTADRNFSCECGGVVRPDVTLYGENLNQSVVNEAIYQLEQA DTLIVAGTSLTVYPAAYYLRYFRGKNLIIINDMDTQYDGEASLVIKDNFSYVMDRVVEEL KKIQFGKTLKNI >gi|296154448|gb|ADVK01000030.1| GENE 25 28991 - 29092 119 33 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MFLSKTRIRKNIFTIKLNNLSNCMLRTVYAFVF >gi|296154448|gb|ADVK01000030.1| GENE 26 29204 - 30382 1734 392 aa, chain + ## HITS:1 COG:FN1186 KEGG:ns NR:ns ## COG: FN1186 COG1473 # Protein_GI_number: 19704521 # Func_class: R General function prediction only # Function: Metal-dependent amidase/aminoacylase/carboxypeptidase # Organism: Fusobacterium nucleatum # 34 392 1 359 359 704 99.0 0 MKSRKEILSGLFDKYRNELTNLNEYLYNNPELGLQEYKACAAHTDILKKYGFKVEKGFAN FETAYKASYKKQNGPRIAILAEYDALPKIGHGCGHNAYGVTSIAGGILIKELIQKLDLQG EILVIGTPAEETNGAKVDMAKLGIFNDIDVAMSVHPSGEAHIRSGKSHAMEALQFTFKGK TAHAAASPHEGINALDGVLNLFNSINALRQQILSSARIHGIISNGGEAANIIPDLAIANF YVRAETLEYLKELVEKVKNCAKGAALASDTELEITNYETSFANLVTNKKLMKLYEKNLRT LGVTDIRDKEGLGSTDMGDVSHCCPTIHPYFPLTTRHLVGHTIEFATATIQEEAYKGMKE ACLAMALSCLDIFEKPEILKEIKEEFYQTFKK >gi|296154448|gb|ADVK01000030.1| GENE 27 30396 - 31148 1066 250 aa, chain + ## HITS:1 COG:FN1187 KEGG:ns NR:ns ## COG: FN1187 COG0834 # Protein_GI_number: 19704522 # Func_class: E Amino acid transport and metabolism; T Signal transduction mechanisms # Function: ABC-type amino acid transport/signal transduction systems, periplasmic component/domain # Organism: Fusobacterium nucleatum # 12 250 1 239 239 427 99.0 1e-119 MKKIITILMLIMSTLSFAAKKLYVGTNAEFKPYEYLENNKMVGFDIELMELLGKELGYEI KWQDMSFDGLLPALQMKKIDAVIAGMSATPEREKAVSFSIPYIFFEGGHSVIVNSKSTFK KKEELKEKTIGVQLGTIQEQFAKDNGSVPKLYNNFTEALLDLQNQKIDAVIIAEVSANEY LKTMKGVKKIDTIKDKLPNASIAFRKADSKLAKEFSDTILKLKDSPGYAKLVKKYFPEHY DNFIANQKKK >gi|296154448|gb|ADVK01000030.1| GENE 28 31395 - 31892 645 165 aa, chain + ## HITS:1 COG:no KEGG:FN1188 NR:ns ## KEGG: FN1188 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 165 1 165 165 275 98.0 4e-73 MKDKTLLPNFYGIFEVKSLTKNRIRIEINKLKNNKEEIDELKENLKKISVIKNFKIIQSL GSLTVEFDDSQIDAQFMIGIILKLLNLDEELLKDRKGRIKNTFSNLGKLADITVYNKTKG LFDAKTLAGTMLLIYGIKKFKREMFLPSGATLIWWAYRLLSKKGV >gi|296154448|gb|ADVK01000030.1| GENE 29 31902 - 32285 432 127 aa, chain + ## HITS:1 COG:no KEGG:FN1189 NR:ns ## KEGG: FN1189 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 12 127 1 116 116 200 100.0 2e-50 MFKNILKQTYLMFNKVKVVHSIPGRIRLLIPSLDKFPEQMKKHEHYITTIIKLKNGIKSV EFSYLTSKILIEYDKMKLKEQDIVDWLNKIWKIIVDNEDVYQGMSVDDVEKNVKRFFEML KSELEGR >gi|296154448|gb|ADVK01000030.1| GENE 30 32292 - 34499 2822 735 aa, chain + ## HITS:1 COG:FN1190 KEGG:ns NR:ns ## COG: FN1190 COG2217 # Protein_GI_number: 19704525 # Func_class: P Inorganic ion transport and metabolism # Function: Cation transport ATPase # Organism: Fusobacterium nucleatum # 1 735 1 735 735 1284 99.0 0 MKNDNLLACEIVHRLRGRIRIKSKAFKYIGNPLKSEIEKQLLQVRYIENVEISLVTGTIL IYFEDVSLSDQNLISLIQNTLNSHIFEICKNEKVEKSSKYIIERKLQEESPKEIMKKIVT TAGLLGYNLFFKSKSTVALTGIRRFLNYNTLSTLALAMPVLKNGINSLIKNKRPNADTLS SSAIISSILLGKESAALTIMFLEEVSELLTVYTMEKTRGAIKDMLSVGENYVWKEISEDN VKRVPIEEIQKDDIIVVQTGEKISVDGKIIRGEALIDQSSITGEYMPIKKSEGEEVYAGT IIKNGNISIIAEKVGDDRTVSRIIKLVEDANSNKADIQNYADTFSAQLIPLNFILAGIVY ASTRSITKAMSMLVIDYSCGIRLSTAVAFSAAINTAAKNGILVKGSNFIEELSKSETVIF DKTGTITEGKPKVQSIEVFDNNMSENEMIGLAGAAEEQSSHPLATAIMSEIKDRGIEIPK HNKIKTVVSRGVETKIGKGKEAKIIRVGSKKYMLENNIDLTLATEAERGIISRSEIGLYV AQDEKIIGLIGVSDPPRENIKKAINRLRNYGVDDIVLLTGDLRQQAETIASRMSIDRYES ELLPEDKAKNILKFQSKGSNVIMIGDGVNDAPALSYANVGVALGSTRTDVAMEAADITIT QDNPLLVPGVIGLSKNTVKTIKENFAMVIGLNTFALVLGATGILAPIYASVLHNSTTILV VMNSLKLLKYDIKTN >gi|296154448|gb|ADVK01000030.1| GENE 31 34512 - 35264 729 250 aa, chain + ## HITS:1 COG:no KEGG:FN1191 NR:ns ## KEGG: FN1191 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 8 250 1 243 243 422 94.0 1e-117 MKKLTITMLHILPNRVRLKLSAPIKDIKSFYSNIKNNLKNLEMKYNRQLKTVTLNFSPDE IFLQEIIYRTAISFSIENGLLPVKLIEENPYKSISPLSMYALASILVSSLNGLINKKDTK LQNSMNIFSMGLTVGSVLEHAYGEVKKRGMFDIEILPAMYLLKSFFTGQKLSSVLIMWLT TFGRHLTVSHNMTKLVKVFRMKTEKGYQYTATIVDDNSIQNFSDFIHHVFFRKHSDYCQF NEKYVTLSKN >gi|296154448|gb|ADVK01000030.1| GENE 32 35294 - 35566 575 90 aa, chain + ## HITS:1 COG:no KEGG:FN1192 NR:ns ## KEGG: FN1192 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 90 1 90 90 128 100.0 1e-28 MFGNGVTKNHLVGAAVGVGVAAVAFYLYKKNQAKVDEFLRKQGINIKTSSCSSLENLDIE GLTEMKEHIEDLIAEKSAAESIEEVIVEAE >gi|296154448|gb|ADVK01000030.1| GENE 33 35698 - 35958 295 86 aa, chain + ## HITS:1 COG:no KEGG:FN1193 NR:ns ## KEGG: FN1193 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 86 4 89 89 155 98.0 7e-37 MKDIDVVYKGQTLTLTRFWGNNKLCLWIKNSNQISIPKMEFVGGYPNEYCIFLENLSLEE LKEIKAINGETLNLEEIINERKNLKD >gi|296154448|gb|ADVK01000030.1| GENE 34 35988 - 36065 60 25 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MERIIMHYDMNAFYFYKFFSIFKLF >gi|296154448|gb|ADVK01000030.1| GENE 35 36191 - 36571 313 126 aa, chain - ## HITS:1 COG:no KEGG:FN1195 NR:ns ## KEGG: FN1195 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 126 86 211 211 223 96.0 2e-57 MIEFLYQLQNGINESTDLGKVEDAYIKILLQLDKKITGNLAKKFYKGIRCGIIHQSQTKE DTAITYELESIIERNGGYYLCNPLTLLKKLEGKYKDYWKNVSEAKYNEEDGKKLVTKYKQ ILSHIS >gi|296154448|gb|ADVK01000030.1| GENE 36 36579 - 36824 267 81 aa, chain - ## HITS:1 COG:no KEGG:FN1195 NR:ns ## KEGG: FN1195 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 77 1 77 211 137 97.0 1e-31 MFKNDRTNFSKDTTTEEAKIILKEFETSLDKDHSNSVKLYQKICKKMRERVTQNYFKPIK QFSNFIASDPNKESYGYSYGG >gi|296154448|gb|ADVK01000030.1| GENE 37 36994 - 37614 613 206 aa, chain - ## HITS:1 COG:no KEGG:FN1196 NR:ns ## KEGG: FN1196 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 167 11 177 177 289 99.0 4e-77 MFNNLKILVNHLKEIGWIIDIFRFQYNGIRCIVLIKLFLENERKKNPYALAKIQFIKEND INDCIEIEADLYNLYFNEVRDFREFFNIEYSSHIGDIIKEFKEYFSKNIPIKPNDNINQQ ERILLYQSLNHEEEEDKIYCYRLIRLGIKNGVQQKRSIYRDNKARLLRPSLYEKVKDDKT ITFAFTDEECIEKNDEEILNNFYNNK >gi|296154448|gb|ADVK01000030.1| GENE 38 37634 - 38275 687 213 aa, chain - ## HITS:1 COG:no KEGG:FN1197 NR:ns ## KEGG: FN1197 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 213 1 213 213 315 90.0 8e-85 MKKTNRLSKKRNERKKILLKSGAYLIITDTEKTEKNYFEGIKNIIPDNLKNDLQIKIYSN KALSKIIDFAAEERNKDERFRDIWLVFDRDEVKNFDELIEEAKESKMNVGWSNPCFEIWL MSYFQLPKNINVSQKCCETFEKIFKENTSKKYKKSEEKIYNILCENGDENRAIERAREKY YQVRKEYSQPSKMIRCTTVYKLVEELKKKIDGE >gi|296154448|gb|ADVK01000030.1| GENE 39 38278 - 39540 1319 420 aa, chain - ## HITS:1 COG:FN1198 KEGG:ns NR:ns ## COG: FN1198 COG1106 # Protein_GI_number: 19704533 # Func_class: R General function prediction only # Function: Predicted ATPases # Organism: Fusobacterium nucleatum # 1 420 1 420 420 730 98.0 0 MLLQFYFSNYRSFEGEGILDMRASGSNELSSHIRNTLNEKVLPVTAIYGANASGKSSVFE AFQFMALCVLESLSFSDDNKKNPYKLKVDSFKFSESREKPSEFEINYIDKKGKKELYYNY GFKIDNSGILEEYLASNTKTGVKRNEDYTYIFKRERNQKLYLDSSIEKFRENLEISLKEK TLLVSLGAKLNIDEFIRVRTWFINTEVINFSNSLYGAFLENILPNNIIESEEVRKNLVSF INSFDDSIIDIEVEKISAIDENDKDNYRVFTIHKSDKGTSTARISMNEESSGTKKMFSLY QTLLDVLEKGGVFFADELDIKLHPLLMRNILLTFTDKEKNSNNAQLIFTTHNTIYMDMDL LRRDEIWFVEKDNGVSNLYSLDDITNEKGEKIRKDSNYEKHYLLGNYGAIPNLKNLLGRE >gi|296154448|gb|ADVK01000030.1| GENE 40 39710 - 40525 844 271 aa, chain + ## HITS:1 COG:FN1199 KEGG:ns NR:ns ## COG: FN1199 COG0389 # Protein_GI_number: 19704534 # Func_class: L Replication, recombination and repair # Function: Nucleotidyltransferase/DNA polymerase involved in DNA repair # Organism: Fusobacterium nucleatum # 100 271 179 350 350 304 97.0 1e-82 MERIIMHYDMDAFYASIEINRNPKLKNKPLVVGENIVTTASYEARKYGIHSAMKVSDAKL LCSKLIVIPADKAEYIRISKEIHNLILKITNKVDFIEREGVGKKFFEILKNDKIFYVKDI FKYSLNYLVKKYGKSRGENLYCSVRGIDFDEVEYQREIHSIGNEETFLIPLQNNSEIIRE FNSLFEYTFERLLKNNIFTQSITIKMRYTSFKTYTKSKKLKFSTRSKDFLYNEMFELINS FEKEDEVRLLGVYFGDIKKSNLVQLALNKNL >gi|296154448|gb|ADVK01000030.1| GENE 41 40589 - 41395 902 268 aa, chain - ## HITS:1 COG:no KEGG:FN1200 NR:ns ## KEGG: FN1200 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 10 268 1 259 259 486 100.0 1e-136 MKKSLILLSMLALSVTAFAAKPTLPKSYTVRYTHNFGRIDGFVQIPKGGQFNTTSERRPT FDELNIKNINYPELFVGAKWDNFGIYYGIKYKSFKGDTTLNEDLKTHDLQLRKGDKISSK HLYAFHNLGFSYDFKLNSKFTVTPKIEFSVFQFSYKFSSSGSTTVSNDERKFNAGGIRVG GEANYQFTDDFGLRFDVMTHIPHDSIKSSLDTSLTGSYNLYRNGNTEVNVLAGIGYDSFK YRDTQKDMQNFMDSKTKPVYKLGLELKF >gi|296154448|gb|ADVK01000030.1| GENE 42 41509 - 41877 374 122 aa, chain - ## HITS:1 COG:no KEGG:FN1201 NR:ns ## KEGG: FN1201 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 122 3 124 124 162 100.0 3e-39 MKNQLKEDLNDFLKEKEELREVIGKIGGNNNPQSKLITTLFVGIVLVIFVTGIILKQLSP TTTLLLLILILSFKIIWMQQQTQKSMHFQFWILNSIEIRINELDKKQKKLEKMIEELEKK KE >gi|296154448|gb|ADVK01000030.1| GENE 43 41884 - 42660 1203 258 aa, chain - ## HITS:1 COG:FN1202 KEGG:ns NR:ns ## COG: FN1202 COG0171 # Protein_GI_number: 19704537 # Func_class: H Coenzyme transport and metabolism # Function: NAD synthase # Organism: Fusobacterium nucleatum # 1 258 1 258 258 479 100.0 1e-135 MNKLDLNLKEVHNELVEFLRENFKKAGFSKAVLGLSGGIDSALVAYLLRDALGKENVLAI MMPYKSSNPDSLNHAKLVVEDLKINSKTIEITDMIDAYFKNEKEATSLRMGNKMARERMS ILFDYSSKENALVVGTSNKTEIYLGYSTQFGDAACALNPIGDLYKTNIWDLSRYLKIPNE LIEKKPSADLWEGQTDEQEMGLTYKEADQVMYRLLEENKTVEEVLAEGFNKDLVDNIVRR MNRSEYKRRMPLIAKIKR >gi|296154448|gb|ADVK01000030.1| GENE 44 42672 - 43541 1307 289 aa, chain - ## HITS:1 COG:FN1203 KEGG:ns NR:ns ## COG: FN1203 COG1161 # Protein_GI_number: 19704538 # Func_class: R General function prediction only # Function: Predicted GTPases # Organism: Fusobacterium nucleatum # 1 289 1 289 289 495 98.0 1e-140 MSMTQINWYPGHMKKTKDLIEENLKLIDVALEIVDARIPLSSKNPNIASLSKNKKRIIVL NKSDLVSKQELDKWKKYFKEQDFADEVVEMSAETGYNVKKLYEAIEYVSKERKEKLLKKG LKKVSTRIIVLGIPNVGKSRLINRIVGKNSAAVGNKPGFTRGKQWVRIKEGIELLDTPGI LWPKFESETVGMNLAITGAIRDEILPIEDVACSLLRKMLEQGRWESLKERYKLLEEDRDD KVLENILSKIALRMAMLNKGGELNVLQAAYTLLRDYRVAKLGKFGLDEI >gi|296154448|gb|ADVK01000030.1| GENE 45 43522 - 44229 804 235 aa, chain - ## HITS:1 COG:FN1204 KEGG:ns NR:ns ## COG: FN1204 COG0313 # Protein_GI_number: 19704539 # Func_class: R General function prediction only # Function: Predicted methyltransferases # Organism: Fusobacterium nucleatum # 1 235 1 235 235 432 100.0 1e-121 MLYIVATPIGNLEDMTFRAIRTLKEVDYIFAEDTRVTKKLLDHYEIKSTVYRYDEHTKQH QIANIINLLKEEKNIALVTDAGTPCISDPGYEVVDEAHKNNIKVVAIPGTSALTASASIA GISMRRFCFEGFLPKKKGRQTLLKQLAEEKERTIVIYESPFRIEKTLKDIETFMGKRDVV IVREITKIYEEVLRGSTSELIEKLEKNPIKGEIVLLIEPQQKEQKGGNKYVNDTD >gi|296154448|gb|ADVK01000030.1| GENE 46 44240 - 45559 1842 439 aa, chain - ## HITS:1 COG:FN1205 KEGG:ns NR:ns ## COG: FN1205 COG0793 # Protein_GI_number: 19704540 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Periplasmic protease # Organism: Fusobacterium nucleatum # 13 439 1 427 427 744 100.0 0 MKITLKKAAAILMIAISSLSFTEDDRTGFLSNMRELKEISDIMDVIQDSYVENANAQKYK EEKNKNSARKNTGVTKKSLMQGALRGMMESLDDPHSVYFTKEEMRSFQEDIKGKYVGVGM VIQKKVGEPLTVVSPIEDGPAYKVGIKPKDKVIEIDGESTYNLTSEEASKRLKGKANTIV KVKVFREVNKMTKVFELKRETIELKYVKSKMLDDGIGYLRLTQFGDNVYPDMKKALEDLQ AKGMKGLIFDLRSNPGGELGQSIKIASMFIEKGKIVSTRQKKGEESVYTREGKYFGNFPM VVLINGGSASASEIVSGALKDHKRATLIGEKTFGKGSVQTLLPLPDGDGIKITIAKYYTP NGISIDGTGIEPDTKIEDKDYYLISDGAITNIDENQQKENKKEIIKEVKGEKVAKEVDTH KDIQLEAAIKAIKGMTNKK >gi|296154448|gb|ADVK01000030.1| GENE 47 45574 - 46350 1013 258 aa, chain - ## HITS:1 COG:FN1206 KEGG:ns NR:ns ## COG: FN1206 COG1189 # Protein_GI_number: 19704541 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Predicted rRNA methylase # Organism: Fusobacterium nucleatum # 1 258 9 266 266 421 99.0 1e-118 MRLDEYLVENEYFENLEIAKRQIMVGNVIVNERKIDKPGEIILLDKVKSIRIKEKDSPYV SRGGLKLEKAIKVFDLDFKDKIILDIGASTGGFTDCSLQNGAKFVYAVDVGTNQLDWKLR NDCRVKSIENKHINNLEKSDLKDDIDIIVMDISFISIKKVLYKIKELSKENGFAIFLIKP QFEAKRNEIEKGIVDDFNVHKRVINEVVEEAKIHQLFLENLTVSPIKGTKGNIEYLAKFS KKNNFFSNKEIVDKLFNN >gi|296154448|gb|ADVK01000030.1| GENE 48 46352 - 47176 990 274 aa, chain - ## HITS:1 COG:FN1207 KEGG:ns NR:ns ## COG: FN1207 COG3481 # Protein_GI_number: 19704542 # Func_class: R General function prediction only # Function: Predicted HD-superfamily hydrolase # Organism: Fusobacterium nucleatum # 1 274 1 274 274 486 100.0 1e-137 MVEKNSKSKKFIDCLLNFQDVKDLELCDDQGVKVSTHTYDVLNISINKIKEKYIELEEAI KNVDFFAITVGIIMHDISKSSIKRNEENLSHSQMMIQNPEYIISEVYEVLEFIEKQVGYR LIKDVKENIAHIVQSHHGKWGKVQPETEEANIVYIADMESAKYHRINPIQANDILKYSVK GLGLTEIEKKLNCTAAVIKDRIRRAKRELNLKTFAELLEVYKEKGRVPIGDKFFVLRSEE TKKLKKFVDKQGFYNLFMKNPLMEYMIDDKIFEK >gi|296154448|gb|ADVK01000030.1| GENE 49 47160 - 48962 1903 600 aa, chain - ## HITS:1 COG:FN1208 KEGG:ns NR:ns ## COG: FN1208 COG1154 # Protein_GI_number: 19704543 # Func_class: H Coenzyme transport and metabolism; I Lipid transport and metabolism # Function: Deoxyxylulose-5-phosphate synthase # Organism: Fusobacterium nucleatum # 1 600 1 600 600 1174 100.0 0 MNEELTQRCEEIRKNLIEVVSKNGGHLGSNLGVVELTVCLDEIFDFKEDIVLFDVGHQAY IYKILTDRAERFDSIRTRKGLSPFLDPNESSYDHFISGHAGTALPAAVGFAIANPDKKVI VVVGDASISNGHSLEALNYIGYKKLENILIIVNDNEMSIGENVGFISKFLKKVISSGKYQ NFREDVKSFINRIKADRVKRTLERLERSIKGYVTPFYALESLGFRFFNVFEGNNIEKLLP MLKKIKDLKGPTILLVKTEKGKGYCFAEEDKEKFHGIAPFNIETGNTYKSLVSYSEVFGN KILELGKEDENIYTLSAAMIKGTGLHKFSEEFPERCIDTGIAEGFTVTLAAGLAKSGKKP YVCIYSTFIQRAISQLIHDVSIQNLPVRFIIDRSGIVGEDGKTHNGIYDLSFFLSIQNFT VLCPTTAKELGQALEISKNFNLGPLVIRIPRDSIFDIENEEPLEIGRWKVIKKGSKNLFI ATGTMLKIILEIYDKLQNRGIYCTIISAASVKPLDENYLLNYIKEYDNIFVLEENYVKNS FGTAILEFFNDNGIQKPLHRIALKSAIIPHGKREELLKEERLKGESLIERIEELIYGRKK >gi|296154448|gb|ADVK01000030.1| GENE 50 48976 - 49275 229 99 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|212638657|ref|YP_002315177.1| Predicted RNA-binding protein containing KH domain, possibly ribosomal protein [Anoxybacillus flavithermus WK1] # 1 94 2 95 97 92 44 4e-18 MNSKKRAFLKKKAHNLEPIVRIGKDGLNQNIIQSILDAIASRELIKVKILQNCEEEKTII YSKLMDIKDFEVVGMIGRTIIIFKENKKNPTISLEWKNI >gi|296154448|gb|ADVK01000030.1| GENE 51 49301 - 51127 2620 608 aa, chain - ## HITS:1 COG:FN1210 KEGG:ns NR:ns ## COG: FN1210 COG0595 # Protein_GI_number: 19704545 # Func_class: R General function prediction only # Function: Predicted hydrolase of the metallo-beta-lactamase superfamily # Organism: Fusobacterium nucleatum # 1 608 1 608 608 1135 100.0 0 MSIKDDVQHLKTKKTKKKVEKNITEKTKKQKEEVKKNEVVQNNNKKVKTSKKDLDKMYVI PLGGLEEVGKNCTIIQYKDEIIIVDAGAIFPDENLPGIDLVIPDYTFLENNKSKVKGLFV THGHEDHIGGIPYLYEKIEKDTVIYAGKLTNALIKSKFENFGVKKALPKMVEVGSRSKVS VGKYFTVEFVKVTHSIADSYCLSVKTPAGHVFLTGDFKIDLTPVDNEKVDFMRLSELGEE GVDLMLSDSTNSEVEGFTPSERSVGDAFRQEFQKATGRIVIAVFASHVHRIQQIIDTAAH FKRKIAIDGRSLLKVFEIAPSVGRLTIPKNLLIPISSVDKYDDDEIVILCTGTQGEPLAA LSRIAKNMHKHIALREGDTVIISSTPIPGNEKAVSTNINNILKYDVDLVFKKIAGIHVSG HGSKEEQKLMLNLINPKHFMPVHGEYRMLKAHMRSAIETGVPKDKILLTQNGDKVEVTKE YAKINGKVNSGEILVDGLGVGDIGSKVIKDRQQLSEDGIVIVAYSIDKETGKIVSGPEMS TKGFVYYKDSEDTIKEAQDLLSKKINKNETYLGRDWADLKGHVRDLLSRFFYEKLKRNPI ILPMLLEI >gi|296154448|gb|ADVK01000030.1| GENE 52 51144 - 53117 2783 657 aa, chain - ## HITS:1 COG:FN1211 KEGG:ns NR:ns ## COG: FN1211 COG0768 # Protein_GI_number: 19704546 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Cell division protein FtsI/penicillin-binding protein 2 # Organism: Fusobacterium nucleatum # 1 630 1 630 657 1161 100.0 0 MKLNRYKNNDVILGDKRNARELIFKIIVFLCFLILFLRLLYLQVLQGNEFSYLAERNQYK LVKIDSPRGKIFDSKNRLVVTNGTGYRLIYSLGREENEEYIKEIAKLTDKTEEIVRKRIK YGEIFPYTKDNVLFEDLDEEKAHKLIEIVNNYPYLEVQVYSKRKYLYDTVASHTIGYVKK ISEKEYEALKEEGYTPRDMVGKLGIEKTYDDLLRGRNGFKYIEVNALNKIEREVEKVKSP IVGKNLYMGINMELQQYMEEEFEKDGRSGSFIALNPKTGEIITIVSYPTYSLNTFSSQIS PEEWDAISNDPRKILTNKTIAGEYPPGSTFKMISAIAFLKSGIDPKLKYNDYTGYYQVGN WKWRAWKRGGHGATDMKKSLVESANTYYYKFSDQIGYAPIVKTARDFGLGNVSGIDVPGE KKGIIPDPDWKKKRTKTVWYRGDTILLSIGQGFTLVTPIQLAKAYTFLANKGWAYEPHVI SKIEDLQTGKMEIVSSKKTVLEDYPESYYDIINDALIATVDQNNGTTRIMKNPYVKVAAK SGSAQNPHSKLTHAWVAGYFPADKDPEVVFVCLLEGAGGGGVMAGGMAKRFLDKYLEVEK GIEPVQYTPHIEPKTTNSTIQTSGNQENEDSGEGMGEERENEERETGETTTSEGEQN >gi|296154448|gb|ADVK01000030.1| GENE 53 53107 - 53529 157 140 aa, chain - ## HITS:1 COG:no KEGG:FN1212 NR:ns ## KEGG: FN1212 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 140 1 140 140 171 100.0 1e-41 MTVIIISLILIFIGNSLPLDGVFIGITLPFITFIIGKRRSLFFIFLAWLLFSLQTDKYSF NLLVMSLFGFLNFLLFCYVEYDKKSIFYLIPMDIGFYLLIVYRSFYIGIDVKYLIINIIS FFILNYFYASRKNKRKIDEA >gi|296154448|gb|ADVK01000030.1| GENE 54 53540 - 54523 1209 327 aa, chain - ## HITS:1 COG:no KEGG:FN1213 NR:ns ## KEGG: FN1213 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 327 1 327 327 541 100.0 1e-152 MATKKKKKKKGRAPVLVIVLTIILSILLYFNFRGNNIKLSKDERVLIIGKQNLYAVYEDK LAVKIPFELYIDSDETVEDLVDSQNYENVLEKINSVVPEKLTRYTVIKSGEIKLDVENAR NIPETNIGDRRYILTSSVYAMFKDLYHEKNAVDELNENILVDVLNANGIGGYARKTGELI KSTLGMKYNAANYETTQDQSYVVLNDISKEKAAEILDKLPEKYFKIKNKSSIPTLANIVV IIGSEKQINFKIDIYANQEKLKEASEKLRKAGYGSISSQPEKEETEQSIIEYNKEDYFIA LRIAKILGISDMVENSDLENKIGITIK >gi|296154448|gb|ADVK01000030.1| GENE 55 54527 - 55834 865 435 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|16079597|ref|NP_390421.1| hypothetical protein BSU25430 [Bacillus subtilis subsp. subtilis str. 168] # 7 421 4 423 451 337 41 7e-92 MSFSKKVAFHTLGCKVNQYETESIKNQLIKRGYEEVPFEDKSDIYIINSCTVTSIADRKT RNMLRRAKKINPKAKVIVTGCYAQTNSREILEIEDVDFVIDNKNKSNIVNFVGAIEDISF EREKNGNIFQEKEYQEYEFATLREMTRAYVKIQDGCNHFCSYCKIPFARGKSRSRKKENI LKEIEKLVEDGFKEIILIGIDLSAYGEDFEEKDNFESLLEDILRIKDLKRVRIGSVYPDK ITDRFIELFKNKKLMPHLHISLQSCDDTVLKNMRRNYGSSLIKKSLLKLKSKVKDMEFTA DVIVGFPKEDEIMFQNTYDVIKEIEFSGLHIFQYSDREGTIASNMDGKIDAKTKKQRADR LDSLKQEMIVDSRKKYLEKSLEVLVEEEKNGEYFGYSQNYLRVKFRSDKKDLVNNLINVK VKCVENDILIAEKEM >gi|296154448|gb|ADVK01000030.1| GENE 56 55821 - 56528 889 235 aa, chain - ## HITS:1 COG:FN1215 KEGG:ns NR:ns ## COG: FN1215 COG1385 # Protein_GI_number: 19704550 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 235 1 235 235 378 99.0 1e-105 MLSVVVTEVYDDFILVIDAGDINHIKNAFRKVKGDKIRAVDGANEYLCEIEKIEDKEIKL KILEKIEDKFSLDIEVDAGISILKGDKMDLTIQKLTELGINKIIPISVKRCVVKLDKKKD RWDTISKEALKQCQGVVPTVIDEIKKINKLNFKDYDLILVPYENEKEIFIKEILRDLKIK PSKILYIIGAEGGFEKEEIDYLKENGAKIISLGKRILRAETAAIVTGGVIINEFL >gi|296154448|gb|ADVK01000030.1| GENE 57 56532 - 56963 469 143 aa, chain - ## HITS:1 COG:FN1216 KEGG:ns NR:ns ## COG: FN1216 COG1959 # Protein_GI_number: 19704551 # Func_class: K Transcription # Function: Predicted transcriptional regulator # Organism: Fusobacterium nucleatum # 1 143 1 143 143 263 100.0 1e-70 MKVNTKVRYGLKALAYIAENSTDKKLVRIKEISEDQDISVQYLEQILFKLKNENIIEGKR GPTGGYKLAIEPKEIDLYMIYRILDDEEKVIDCNEMGEGKTHSCSEEGCGDTCIWSKLDN AMTKILSETSLQDFINNGKRIQE >gi|296154448|gb|ADVK01000030.1| GENE 58 56960 - 57958 1129 332 aa, chain - ## HITS:1 COG:FN1217 KEGG:ns NR:ns ## COG: FN1217 COG2255 # Protein_GI_number: 19704552 # Func_class: L Replication, recombination and repair # Function: Holliday junction resolvasome, helicase subunit # Organism: Fusobacterium nucleatum # 1 332 1 332 332 617 100.0 1e-176 MERIISELEMPNEIEIQKSLRPKSFDEYIGQENLKEKMSISIKAAQKRNMVVDHILLYGP PGLGKTTLAGVIANEMKANLKITSGPILEKAGDLAAILTSLEENDILFIDEIHRLNSTVE EILYPAMEDGELDIIIGKGPSAKSIRIELPPFTLIGATTRAGLLSAPLRDRFGVSHKMEY YNENEIKSIIIRGAKILGVKINEDGAIEISKRSRGTPRIANRLLKRVRDYCEIKGNGTID KLSAKNALDMLGVDSNGLDDLDRNIINSIIENYDGGPVGIETLSLLLGEDRRTLEEVYEP YLVKIGFLKRTNRGRVVTSKAYQHFKKVEVKI >gi|296154448|gb|ADVK01000030.1| GENE 59 58173 - 58775 702 200 aa, chain - ## HITS:1 COG:FN1218 KEGG:ns NR:ns ## COG: FN1218 COG4399 # Protein_GI_number: 19704553 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 200 1 200 200 326 100.0 2e-89 MKQLLVMILISGAIGWITNWVAIKMLFRPHREINFGLFKIQGLIPKRRAEIGTGIAKIVQ NELISVKDVISNIDREEFSKRLNKLIDEVLNKNLKRKVKEKFPLLQVFFTDKVAKDIGNA IKGIIMENKEKIFEIFSNYAEENIDFEIIISDKISNFSLDKLEEIITLLAKKELKHIEVI GAVLGMIIGAVQYLITLIVI >gi|296154448|gb|ADVK01000030.1| GENE 60 58977 - 59217 334 80 aa, chain + ## HITS:1 COG:no KEGG:FN1219 NR:ns ## KEGG: FN1219 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 80 1 80 151 135 100.0 6e-31 MSVLYIKILSDYYHHIVGDLEENRKNFLEKFYSYLLEKDEYGYAPIFEGELERIDYLLKQ ISLEAKGMSLDEFFKLMSWY Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:39:53 2011 Seq name: gi|296154446|gb|ADVK01000031.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00036, whole genome shotgun sequence Length of sequence - 1859 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 3 - 1836 2173 ## COG5651 PPE-repeat proteins Predicted protein(s) >gi|296154446|gb|ADVK01000031.1| GENE 1 3 - 1836 2173 611 aa, chain - ## HITS:1 COG:FN1381 KEGG:ns NR:ns ## COG: FN1381 COG5651 # Protein_GI_number: 19704716 # Func_class: N Cell motility # Function: PPE-repeat proteins # Organism: Fusobacterium nucleatum # 74 611 6 526 1176 437 52.0 1e-122 MKKIFLKVLLFVFLFSISLYGASPQDITLEKGETKEYEVKAVNNEANPAVYAKDLILKKG ATLKVNGNIDNVNKTIVGIFVKRQGFKGGNLEIQKGAKLEVNVEVQNYTLSSKKIPLEGV RTFSLNSSNKLKIDEGATLTVKAHSNKALIPEDNHNSHSHDDEDDLFHQMWHRGIGINLG TSLEVKGNLNVESTGTGIKNYKEPSDERSLVFDKGSKTFIKAGLIGIDANTEKQRVIFKE GSYSQIIGGAYGLISKNATFEKGSKVKAMGKYGLGNFKYSGTGDKYTFQAGSEIEILANQ AALYAINIKEGSTKLIAKARNVFMQVDTRVNGNENNLATLTLEKGSILEGNIDKSWKAKI LMEKGTKIFSDDRIDTNLDLRGELFVGPREAYENIKSKNYKEEIEKMDFVAGASVILMGK LKNQSLKSLNAKTPYENLPTDKYYTVRFNDYTYNNKIKVNFDNSKIHLRIGGADKTGATN DKIFFSRNTTINGKGEVVFHKRNSSQVTKNTVFKILEEEGLKNINGIQGYYLEKLPLKIP DVSFGQLVFTTKVQKLNGRYVVSLVFKGIEADNFLLAKGETKTLNKKKEGSLEDAILSAK EVTIEEKSTLN Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:39:54 2011 Seq name: gi|296154444|gb|ADVK01000032.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00037, whole genome shotgun sequence Length of sequence - 5329 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 3 - 5328 7453 ## FN0254 hypothetical protein Predicted protein(s) >gi|296154444|gb|ADVK01000032.1| GENE 1 3 - 5328 7453 1775 aa, chain - ## HITS:1 COG:no KEGG:FN0254 NR:ns ## KEGG: FN0254 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 300 1775 1 1476 1677 2340 99.0 0 KGDKAEKYPFEGIFTRDTNLFNRYMPTNSENYSLLPKSSNPRSATSNSRANLNGYGIAST QPVPEPSVETEVNAGINPKSINKVPLNIAAKVANSPILPEAVKFAPIDPKIVIPADPDLP EAPKFSVVLGADCNEGCNSNSSTPRQNTKGGFLSSGDNASKQNIDVILHYTWRNNSGAEK SYAFKMYKESNTSALPAPPDGGNTYYFNSYNFGGTKEFASGVVNSQGTDKNHQHFFIGGS RFWEIDNRSSGTFEFPNGKTLNLGGILTLGLVSQENGTELKNSGTMTDIEEKDEKWIKEM PYDAGKDYLTIKGPTEDYKVKRSGEGYVGYKVGIAQVQENGPSQFDGNKQKMTNSSTGKI NFFGERSIGMYVYLPGNTTHAIMRNEGEINLRGSESYGMKIAAKSADRAEMVNTSTGKIT LGKNGKDSASNSIAMALTADSTVTNGVSLKRGNARNEGTITVKDVQNSLGAYVNVNSSIT NTATGKININSTIAELTAAEKAANKKQAYNMGMRADTVSGAEIINNGEITIDGAHAIGML ANGSKLVNTKTISSTNVKNGIGIVGMNGATVENSGTIKVVGTGNTNNIGILINGSTGTVG TPAGATPPSIEVSGDNSTGALVTGAGSSLTMKGNVTVSGNSITGIVANGTTVKLEGDATV KVDNNGAEAGEINKKGSYGIVVKGSAGKFEGKDTTVNTKVTTDKSIGLYSEGELIVKKAN VTATDGAINFFAKDGKININGGGTTITGQKSLLFYTSGSGKVTLGGAMTATIKGGTTPST RGTAFYYVSPIRYGAFNTGAIQNYFNNTFGNGTSTLNHLTLNMEQGSRLFVASNVGMNLS DTSATNLMSGITNAPTITGSNYKTFMLYLSKLTINQAVDLNDSNSNYAQLEIANSSIENA NNMTGTQNRQVAMAQENGNDTSGAGYESKEVTLTNTATGVINLTGEETTGIYAKRGRIVN AGKISVGKKSTGIYLVEDDRSPAAAAVGATATNDASGVITVGENSTGIYYKVKNDNADGK GTNTGVSGGISNDGRIESTAKDVIAMSFDSPYSSKTVKNEHTGVIDLQGQNSTGIFATGA GTYTAINEGKIKLASSTNVNTPNIGMYTDKSTITLESDGTIEGGDKTVGIYGYGVNLKTN AITKAGTGGTGVYSKGGNITINGGTLSVGENGAEGSNDAVGVYYVGAGGTITNNAANVNI GNSAYGFVVQNENGAAVTLKTSTPNVTLKNDAVYAYSNNRAGSVTNTSVLNSTGNGNYGI YSAGTVTNNGNINFGTGIGNVGIYSILGGTATNNARITVGASDAATEKFGIGMAAGYKSS DSGNIINSSTGVINVTGKNSIGMYATGPSSTATNKGTINLSGENSVGMYLDNGATGINEG TITTVGSPKGAKGVVLSNNSKLINRAGAKININSAEGFGIYRVNSEETNITVANYGDITV GGGATTSGEFDPTGGKPLEKTAGGVSLKSPKGTNDINITVNGKPVPNVEKVTNPIGHRGD ALISSLGMYVDTLRGTNPINGLSKLNVKKAELLYGVEAAENSNSKYFEVSGKILKPYQDA VKNAQGIKWSHNSAAFTWMALPTVDSNGVPLKVGMAKIPYTAFAGKEPMPVEVSDTYNFL DGLEQRYDKNALDSREKLLFNKLNGIGNNESVLFYQAVDEMMGHQYANVQQRIQSTGIIL DKEFKYLRDEWRTASKDSNKIKAFGTRGEYKTDTAGVIDYKNHAYGVAYVHENEDIKLGR GIGWYTGIVHNTFKFKDIGKSKEHMLQGKVGLFKS Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:40:25 2011 Seq name: gi|296154438|gb|ADVK01000033.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00041, whole genome shotgun sequence Length of sequence - 10967 bp Number of predicted genes - 5, with homology - 5 Number of transcription units - 4, operones - 1 average op.length - 2.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) - Term 114 - 162 10.0 1 1 Tu 1 . - CDS 225 - 1412 1615 ## COG1301 Na+/H+-dicarboxylate symporters - Prom 1497 - 1556 14.2 + Prom 1422 - 1481 15.5 2 2 Tu 1 . + CDS 1583 - 2929 1484 ## COG0166 Glucose-6-phosphate isomerase + Term 2942 - 2991 -0.6 - Term 2926 - 2983 9.2 3 3 Op 1 . - CDS 2998 - 3561 613 ## FN2055 hypothetical protein 4 3 Op 2 . - CDS 3530 - 3850 388 ## FN2057 hypothetical protein - Prom 3883 - 3942 7.6 - Term 3892 - 3942 9.8 5 4 Tu 1 . - CDS 3955 - 10965 9989 ## FN2058 hypothetical protein Predicted protein(s) >gi|296154438|gb|ADVK01000033.1| GENE 1 225 - 1412 1615 395 aa, chain - ## HITS:1 COG:FN2053 KEGG:ns NR:ns ## COG: FN2053 COG1301 # Protein_GI_number: 19705343 # Func_class: C Energy production and conversion # Function: Na+/H+-dicarboxylate symporters # Organism: Fusobacterium nucleatum # 1 395 1 395 395 652 99.0 0 MGAKKLGLVPRLIIAIIVGILIGQFLPLWIVRIFKTFSTFFGLFLSFFIPLMIVGYVVSG IAKLTEGAGKLLGFTAIVSYVSTIVAGTFSYMVAANLYPKLISGISSRINFEGKDVAPYF TIPLKPPIDVTAAIVFAFMMGITISVMRSQNKGETTFKLFGEYEEIITKVLAGFVIPLLP FHILGIFSEMAYSGIVFKVLGVFLAIYLCIFAMHYVYMLVMFSIAGGVSKRNPFTLIKNQ IPAYFTAVGTQSSAATIPVNIQCGLKNGTSPEIVDFVVPLCATIHLSGSMITLTSCIMGV LLLNGMPHSLGMMFPFLCMLGIAMVAAPGAPGGAVMSALPFLFLIGIDAQGPLGSLLIAL YITQDSFGTAINVSGDNAIAIYVDEFYKKYIKKAA >gi|296154438|gb|ADVK01000033.1| GENE 2 1583 - 2929 1484 448 aa, chain + ## HITS:1 COG:FN2054 KEGG:ns NR:ns ## COG: FN2054 COG0166 # Protein_GI_number: 19705344 # Func_class: G Carbohydrate transport and metabolism # Function: Glucose-6-phosphate isomerase # Organism: Fusobacterium nucleatum # 1 448 1 448 448 856 99.0 0 MKKINLDYSKVFNFISQEELNQIKVSIDKVAEKLHNKSGAGNNFLGWLDLPINYDKEEFS RIKKASEKIKADSDVLIVIGIGGSYLGARAVIECLSHSFFNSLNKEKRNAPEIYFAGQNI SGRYLKDLIEIIGDRDFSVNIISKSGTTTEPAIAFRVFKELLENKYGEKAKDRIYVTTDK NKGALKKLADEKGYEKFVIPDDVGGRFSVLTAVGLLPIAVTGINIDALMNGAQIAREDYS KDFADNDCYKYAAIRNILYKKNYNIEILANYEPRFHYISEWWKQLYGESEGKDKKGIFPA SVDLTTDLHSMGQYIQDGRRNLMETILNVENSDKDIVIKKEVEDLDGLNYLEGKGLSFVN NKAFEGTLLAHIDGGVPNLVINIPEVTAFNIGYLIYFFEKACAISGYLLEVNPFDQPGVE SYKKNMFALLGKKGYEELSKELNERLKK >gi|296154438|gb|ADVK01000033.1| GENE 3 2998 - 3561 613 187 aa, chain - ## HITS:1 COG:no KEGG:FN2055 NR:ns ## KEGG: FN2055 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 67 187 90 211 218 194 96.0 2e-48 MQEKIVVTLSDFFSEYQYLLKELNENDYSKFKKVLSEEANLSNLGTTLKFLTKILYEKYN KKVVVLIDEYEENLQEILLTSVNNDTKKGNEAFYHDLIMGMGLYLEGEYITKSNIKSGLG RYDFLIEPKNKSKRAFIMEFKSTDSVEKLEEISKEALKQIEDKKYDISLKQNGIKEITHI GIAFYGK >gi|296154438|gb|ADVK01000033.1| GENE 4 3530 - 3850 388 106 aa, chain - ## HITS:1 COG:no KEGG:FN2057 NR:ns ## KEGG: FN2057 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 106 7 112 112 185 97.0 4e-46 MKRIGIGLSDFKELIEENYYYFDKTKFIDEVVKDGAKVKLFTRPRRFGKTLNMSMLKCFF DIKEADKNRKLFKGLYIEKTESFKEQGQYPLIFLSLNARKNSCYPF >gi|296154438|gb|ADVK01000033.1| GENE 5 3955 - 10965 9989 2336 aa, chain - ## HITS:1 COG:no KEGG:FN2058 NR:ns ## KEGG: FN2058 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 544 2336 1 1794 1794 2905 98.0 0 VNPVANGVPTREEIATSRENLRNSVGSLQSKIDEARAENSKSLAGLRLELIQLVEQGDQV VKSPWMSWQFGINYMYSKWNGTYKGRGDKAEKYPFEGIFTRSTNVFGRTATARTADQKAA LASIIATNGGFNPNGNGLNYGLVNRALVPEDPMTIEVSAGIRPKNIQKSAITLSVPPVDI PQPRPSATPGVPNTPSAPNINIPSFLPVAPDVKEPQLPPPPVYNLVLGADCNEDCNSSNS TPRQTTKDGWIGSPEQKDSQNVKLLLHYTWPYNKGAESSLAFKMYKQAFYANIPAGEYNF NSYNYGNGKEFADNIVDSQGTERNHQYFFIGGSRFWEIDNYGGIGQRFEIKSGTTLNLAG ILTLALVSQENGGILANAGTITDVKEKDDQYIKEFTDREINGPSGKMNIKKSKDGYVGYK VGLAQVEENSDGGISAGHVEPNSHQYQELINEGTIDFRGNNSMGIYVYQPRNVQNTYTWY STYAKVINAITGKIYLSGKESYGMRIAAKSEGQAEMTNKGLIELRNSPLSGGKDKADKSV AMIMKADDTVKHKVSIAENKINNEGTIKISGTVENSIGMYLDIASNMTNKKEIEISSKAS DGNLNVGMRADQVQDKYTDSNYDTSVINAGTGKISLIGENSIGMIAANTSKWGSAKAKNL GEISITAGKRNIGMLSANEASVINETTGKINIGTSPNSVGMASIIQGSKVSKAENKGKIN VTGANSTGVYNTGEFLMNTATAEINVAGRQSIGLYAKGNTPQTKTELKAGTVTASNSGVG LYADKSTVRLGGGTDPLNLIANNGGLLFYNYESNNKNNVANGVFNLAGNVKATVNNGGYA FYLKGTTINPDGTMTNLANFFNSMFDDKNSTGKLDVTLNSGGTLMVLDRPNGGNIRLSNV ANTSSIVSSLGNRVNINSASTKYKVYAVYRGGLQIDQNVNLDNDSNSTNPDAFYKVDFRS SNMTLDSGKSMIGSKQGQVALFQGNYAETGDVGTVDKVKITNNGKISLSGNSGTKTTTAM AGDFVTLTNNKDIEVTGDRAVGMFGAGGSKVLNNTNGTITVGKEGVALYGINKLGTSTLG DQTISLTNKGTLKGINGKTKAFGIFAENTLAVNKSTLINSGTIDFSNSQESIGIHSVDST VSNTGKINMGLKGVAINAKNSDITSSGDITLAGNGIAFNLGGSFTGRTLNFSSKVTLNGD GNAIFNLKDMTFSSSGSSLTENLNIASNGKSFSYFSMDNSVLTYDKNKTFAGNKVTLVSA KNSTVDWKANLVLSGEESVAFYLNGRRAVANPELTIAAGKTITLLGNKSVGAYGVNGARI INNGNITIGTNGAALYSTGATGTINNTGTLTLGKNSTGIYMKDGTGVNNSGEIVSTQEGA KGLVVSSNTAALYSNGGKIKLTGTGSIGIHAEGAAHSIISSADVEVGDTTGTNQSVAIHL KDGGEVRVLSHTSVKAGNNSIGIYGSTILATIENDAKVEVGDGGIGIYAKNGNVNLESGS KMTIGKTLGTNKEAVGVYYVGNGGTVNNNLSSFNIGKGSIGIVDAGTGASTINNNLTSVV LPGDAVYTYTSNTNSNVIGNTKITSSGNGNYGYYVAGNLTNNAAASINFANGTGNVGIYS AYRAGGTGIARNAATITVGKTDLENELYSIGMAAGYTDSKDTSKNRVGHVINTGTINVGF DNSIGMYASGVGSIAENDGTINITGKKAIGMYLENGATGINTVNGKITIESSAEGAIAAY STGKNTIFKNYGTITLKAPSSKGIVTANDGQGTNLGAGNIDVQNGSAEATKNIEGTAGGD KRFGDKTLSVPKGGRTKSTITDADGKVLNPAEIDVTKVGNLSVTDPGQVAKNMKALQGHN DFSSVSRIGMYVDTSGVNYTNPIQGLEHLRGLKKADLIIGAEAAEYTNAKTISIGPNILK RYNEALAKTGLEKYDIISGSLTWAAATNGLTGTGAIKSVILTKVDYKEYAKKSTTPYNFL DGLEQRYDKNALDSREKRLFNRINSIGKNEAILLTQAFDEMLGQQYANVQQRVQATGDIL DKEFDYLRDEWRTASKDSNKIKTFGTNGEYKTSTAGIKDYKYHAYGVAYVHESEDIKLGR GTGWYTGIVHNTFKFKDIGKSKEQMLQAKVGLLKSVPFDDNNSLNWTISGDIFVGRNKMH RKFLVVDEIFNAKSKYYTYGIGIKNEIGKEFRLSESFILRPYAALKLEYGRVSKIREKSG EVRLEVKHNDYFSVRPEIGAELGFKHYFGMKALKTTLGVAYENELGRVANAKNKARVGYT SAGWYNLRGEKEDRRGNVKFDLNVGIDNTRVGVTANVGYDTKGENLRGGLGLRVIF Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:41:04 2011 Seq name: gi|296154437|gb|ADVK01000034.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00042, whole genome shotgun sequence Length of sequence - 1620 bp Number of predicted genes - 0 Number of transcription units - 0, operones - 0 average op.length - 0 N Tu/Op Conserved S Start End Score pairs(N/Pv) - SSU_RRNA 27 - 1528 100.0 # FJ982958 [D:1..1502] # 16S ribosomal RNA # uncultured bacterium # Bacteria; environmental samples. Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:41:12 2011 Seq name: gi|296154411|gb|ADVK01000035.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00043, whole genome shotgun sequence Length of sequence - 27089 bp Number of predicted genes - 27, with homology - 25 Number of transcription units - 10, operones - 7 average op.length - 3.4 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . - CDS 3 - 897 1271 ## COG5295 Autotransporter adhesin 2 1 Op 2 . - CDS 894 - 992 83 ## - Prom 1031 - 1090 10.4 - Term 1041 - 1090 11.1 3 2 Tu 1 . - CDS 1119 - 2462 1876 ## COG2978 Putative p-aminobenzoyl-glutamate transporter 4 3 Op 1 1/1.000 - CDS 2864 - 3472 780 ## COG3142 Uncharacterized protein involved in copper resistance 5 3 Op 2 23/0.000 - CDS 3486 - 4178 824 ## COG1346 Putative effector of murein hydrolase 6 3 Op 3 1/1.000 - CDS 4178 - 4534 360 ## COG1380 Putative effector of murein hydrolase LrgA - Prom 4555 - 4614 5.3 - Term 4568 - 4600 2.5 7 4 Op 1 . - CDS 4616 - 6097 1846 ## COG1190 Lysyl-tRNA synthetase (class II) 8 4 Op 2 . - CDS 6120 - 7373 1710 ## FN0465 hypothetical protein 9 4 Op 3 . - CDS 7391 - 7723 179 ## FN0464 hypothetical protein 10 4 Op 4 1/1.000 - CDS 7805 - 8272 331 ## COG1576 Uncharacterized conserved protein 11 4 Op 5 1/1.000 - CDS 8283 - 10196 2318 ## COG0323 DNA mismatch repair enzyme (predicted ATPase) - Prom 10223 - 10282 11.8 - Term 10264 - 10303 4.6 12 5 Op 1 1/1.000 - CDS 10330 - 10896 858 ## PROTEIN SUPPORTED gi|34763431|ref|ZP_00144379.1| PROBABLE SIGMA(54) MODULATION PROTEIN; SSU ribosomal protein S30P 13 5 Op 2 . - CDS 10947 - 11915 1175 ## COG0113 Delta-aminolevulinic acid dehydratase 14 5 Op 3 . - CDS 11928 - 12815 938 ## COG2849 Uncharacterized protein conserved in bacteria 15 5 Op 4 . - CDS 12832 - 13659 973 ## FN0458 hypothetical protein 16 5 Op 5 . - CDS 13674 - 14114 493 ## FN0457 hypothetical protein 17 5 Op 6 1/1.000 - CDS 14134 - 15540 2045 ## COG1306 Uncharacterized conserved protein - Prom 15570 - 15629 6.1 18 6 Op 1 1/1.000 - CDS 15663 - 16202 908 ## COG1592 Rubrerythrin - Prom 16298 - 16357 15.8 - Term 16335 - 16371 -0.8 19 6 Op 2 2/0.000 - CDS 16453 - 17928 2330 ## COG1012 NAD-dependent aldehyde dehydrogenases - Prom 17995 - 18054 12.7 - Term 18011 - 18073 10.9 20 7 Op 1 1/1.000 - CDS 18094 - 19848 2205 ## COG0006 Xaa-Pro aminopeptidase 21 7 Op 2 . - CDS 19873 - 21696 2597 ## COG0449 Glucosamine 6-phosphate synthetase, contains amidotransferase and phosphosugar isomerase domains - Prom 21893 - 21952 5.6 22 8 Tu 1 . - CDS 21984 - 22154 408 ## - Prom 22399 - 22458 9.8 - TRNA 21989 - 22075 76.7 # Leu CAA 0 0 - TRNA 22084 - 22160 75.9 # Arg ACG 0 0 - TRNA 22224 - 22300 75.0 # Arg TCG 0 0 23 9 Op 1 7/0.000 + CDS 22520 - 23293 1362 ## COG1540 Uncharacterized proteins, homologs of lactam utilization protein B 24 9 Op 2 1/1.000 + CDS 23308 - 24495 1677 ## COG1914 Mn2+ and Fe2+ transporters of the NRAMP family 25 9 Op 3 21/0.000 + CDS 24509 - 25258 1019 ## COG2049 Allophanate hydrolase subunit 1 26 9 Op 4 . + CDS 25251 - 26261 1445 ## COG1984 Allophanate hydrolase subunit 2 + Term 26269 - 26313 3.1 - Term 26255 - 26299 3.1 27 10 Tu 1 . - CDS 26321 - 27028 906 ## COG0813 Purine-nucleoside phosphorylase Predicted protein(s) >gi|296154411|gb|ADVK01000035.1| GENE 1 3 - 897 1271 298 aa, chain - ## HITS:1 COG:FN0735 KEGG:ns NR:ns ## COG: FN0735 COG5295 # Protein_GI_number: 19704070 # Func_class: U Intracellular trafficking, secretion, and vesicular transport; W Extracellular structures # Function: Autotransporter adhesin # Organism: Fusobacterium nucleatum # 145 296 311 464 617 166 75.0 4e-41 MKKYVSLKLIVFSFLLVASAAYSAPAFQAGTESDSTVAGGANTATGVASSAFGFKNKANG MGSSAFGYNNKVTGKFSSTVGTLTYTSGDESATFGVGAWDKSLNGGKGDFVYKNEGKYSF SIGRYNNIAAGTENNHILGNNSEISASNSVAIGTKNKISGNFSTAVGYNNNVSGNHSGAF GDPNLVTGNGSYAFGNDNTIKGDNNFVFGNNVTIEAGIQNSVALGNGSTVSSSNEVSVGS KGKERKITNVADGEISATSTDAVNGRQLYKAMQNSTNIENLRSEVYEKIDNVKDEVRE >gi|296154411|gb|ADVK01000035.1| GENE 2 894 - 992 83 32 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MFIFVAIFLFLIQYILINYVFKNKILFREEFL >gi|296154411|gb|ADVK01000035.1| GENE 3 1119 - 2462 1876 447 aa, chain - ## HITS:1 COG:FN0470 KEGG:ns NR:ns ## COG: FN0470 COG2978 # Protein_GI_number: 19703805 # Func_class: H Coenzyme transport and metabolism # Function: Putative p-aminobenzoyl-glutamate transporter # Organism: Fusobacterium nucleatum # 1 447 66 512 512 792 100.0 0 MTLSIKSLLNAEGIRYIFSSMVKNFTGFAPLGTVLVALIGIGVAEGSGLMGATMKKVVTA TPKRLLTSIVVLAGVMSNIASDAGYVVLIPLGAVIFLSFGRHPIAGLAAAFAGVSGGFSA NLLLSTTDPLLSGITTEAAKLLNPDYFVNPASNYYFMCASTILITVMGTFITEKIIEPRL GEYKGEVLVDHNELTAQEKKALRCAGVSVIIFCIVMGILTIPANAILRVDGTLKQWTHDG LVPALMMFFLIPGIVYGKVAGTIKNDKDVAKMMGSSLATMGGYLALAFAASQFVAFFSYT NLGTYVAVKGADFLQSIGLTGLPLIIIFVLVAAFINLFMGSASAKWAIMAPIFVPMLMRL GYTPEFTQLAYRIGDSSTNIITPLMTYFAMIVAFMQKYDKESGMGTLISVMLPYSVAFLI GWTIFLMIWFITGLPIGVEGAIHLAGM >gi|296154411|gb|ADVK01000035.1| GENE 4 2864 - 3472 780 202 aa, chain - ## HITS:1 COG:FN0469 KEGG:ns NR:ns ## COG: FN0469 COG3142 # Protein_GI_number: 19703804 # Func_class: P Inorganic ion transport and metabolism # Function: Uncharacterized protein involved in copper resistance # Organism: Fusobacterium nucleatum # 1 202 1 202 202 369 99.0 1e-102 MIKEACVESFEKALEAQNNGANRIELCENLSVGGTTPSYGTVKICLEKLNISIFPMVRAR GGNFVYSKDEIEIMKEDIKIFKKLGVKGVVLGCLTSDNKIDLELTKELVDLAYPMEVTFH KAIDEILNPLDYIDGLVNIGIKRILTSGGKATALEGKDLINEMIKKSNGRLKIVVAGKVT KENLNDLSNLISADEFHGKLIV >gi|296154411|gb|ADVK01000035.1| GENE 5 3486 - 4178 824 230 aa, chain - ## HITS:1 COG:FN0468 KEGG:ns NR:ns ## COG: FN0468 COG1346 # Protein_GI_number: 19703803 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Putative effector of murein hydrolase # Organism: Fusobacterium nucleatum # 1 230 1 230 230 370 99.0 1e-103 MKDTIVGNLFFGLIISYFSFEIGRWIFKKTQSPICNPFLIGTSIVIFILKFFDISTDDYY KGAGMILFLLGPATVALAVPLYKKWDLFKKFFVPVMTGAVVGSFVGIISVIILGKLFGME EQLIFSLMPKSITTPFGIEISSMLGGIPAITVVSIMLTGITGNVTAPLISKIFRVKHSVA VGIGIGVSSHAVGTSKAMEIGEIEGSMSALSIVFAGLLTLIWAPLLKFLV >gi|296154411|gb|ADVK01000035.1| GENE 6 4178 - 4534 360 118 aa, chain - ## HITS:1 COG:FN0467 KEGG:ns NR:ns ## COG: FN0467 COG1380 # Protein_GI_number: 19703802 # Func_class: R General function prediction only # Function: Putative effector of murein hydrolase LrgA # Organism: Fusobacterium nucleatum # 1 118 1 118 118 142 100.0 1e-34 MINEFMLIFVINYVGILISSILHFPLPGTITALLLLFLLLQLKVLKLEKIENAANFLLLN MTLFFMPPTVKIIDSYHLLEKDLFKIIVIIVVSTFITMGITGKVVQVMIDYREKKGLK >gi|296154411|gb|ADVK01000035.1| GENE 7 4616 - 6097 1846 493 aa, chain - ## HITS:1 COG:FN0466 KEGG:ns NR:ns ## COG: FN0466 COG1190 # Protein_GI_number: 19703801 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Lysyl-tRNA synthetase (class II) # Organism: Fusobacterium nucleatum # 1 493 1 493 493 971 99.0 0 MEKYFDRLEKEPLIAERWKKIEELESYGIKPFGKKYDKQIMIGDILKHKPEENLKFKTAG RIMSLRGKGKVYFAHIEDQSGKIQIYIKKDELGEEQFDHIVKMLNVGDIIGLEGELFITH TEELTLRVKSIALLTKNVRSLPEKYHGLTDVEIRYRKRYVDLIMNPEVRETFIKRTKIIK AIRKYLDDRGFLEVETPIMHPILGGAAAKPFITHHNTLNIDLFLRIAPELYLKKLIVGGF ERVYDLNRNFRNEGISTRHNPEFTMVELYQAHADFNDMMDLCEGIISSVCQEVNGTTDIE YDGVQLSLKNFNRVHMVDMIKEVTGVDFWKEMTFEEAKKLAKEHHVEVADHMNSVGHIIN EFFEQKCEEKVIQPTFVYGHPVEISPLAKRNEDNPNFTDRFELFINKREYANAFTELNDP ADQRGRFEAQVEEAMRGNEEATPEIDESFVEALEYGLPPTGGMGIGIDRLVMLLTGAPSI RDVILFPQMKPRD >gi|296154411|gb|ADVK01000035.1| GENE 8 6120 - 7373 1710 417 aa, chain - ## HITS:1 COG:no KEGG:FN0465 NR:ns ## KEGG: FN0465 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 8 417 1 410 410 703 99.0 0 MLKKLSILVVAGILVGCANIDNLAGKRSGGPIREIGGEPQIPGETKIEDNKVDDGKVISK EANNENIKEYLSIVKGNLKGSIKKVEDNVENNYTVGLGETLIFPLDNERAIKLTASPKNT NAKINLSNGKVSFRSVYQGQYILSTYVDGSLNRKISVAVIAKYDFSERELYDVIMQDFES KNKDLENAVTLYKMMFPAGRYAKEVNYLFLKYAYDIKNTSLMNEALAGVKTDFSSYSDSE KATILRVAKLTNKDIFVPSETYNTSNPELKSALQEYIGNKGSLDKNDKVFIEKTKKEAPE TANEVVRDKLKAVIDGTAPVKVGSSSASKSENKGESYYDKAMKNLNSNPRVAIENFKKSL STEKIQDKKPEIYYNIASSYAKLGNKVEVTKYLRLLKQEFPNSEWAKKSEALTKLVK >gi|296154411|gb|ADVK01000035.1| GENE 9 7391 - 7723 179 110 aa, chain - ## HITS:1 COG:no KEGG:FN0464 NR:ns ## KEGG: FN0464 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 110 1 110 110 138 100.0 9e-32 MVGEIEGELKEDIENINFFLKNSQNEYSIKLENKELSLIRNSENKLNINLKFNGEKSNFF YEIDNFKQKFLVLGEKYSYNNKNKSFQFSYILFDENHNEINKIKIAIEYI >gi|296154411|gb|ADVK01000035.1| GENE 10 7805 - 8272 331 155 aa, chain - ## HITS:1 COG:FN0463 KEGG:ns NR:ns ## COG: FN0463 COG1576 # Protein_GI_number: 19703798 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 155 1 155 155 238 96.0 3e-63 MNINIICIGKIKDKYINDGIAEFSKRMTSFVSLNIIELKEYNKEDSINISIEKESSEIMK QISKSNSYNILLDLDGKEITSENMSKYIENLKNIGISSINFIIGGSNGVSKNVKNSVDMK LKFSHFTFPHQLMRLILLEQVYRWFAISNNIKYHK >gi|296154411|gb|ADVK01000035.1| GENE 11 8283 - 10196 2318 637 aa, chain - ## HITS:1 COG:FN0462 KEGG:ns NR:ns ## COG: FN0462 COG0323 # Protein_GI_number: 19703797 # Func_class: L Replication, recombination and repair # Function: DNA mismatch repair enzyme (predicted ATPase) # Organism: Fusobacterium nucleatum # 1 637 7 643 643 1134 99.0 0 MSRIRILDESVSNAIAAGEVVENPTSMIKELIENSLDAGSKEIKLEVWNGGRDISISDSG CGMSKEDLLLSIERHATSKIFTKEDLFNIRTYGFRGEALSSIASVSKMILSSRTEDMQNG TQMNVLGGKVTNLKDIQKNIGTQIEIKDLFYNTPARKKFLRKENTEYLNIKDIFLREALA NPNVKFILNIEGKESIKTSGNGIENAILEIFGKNYLKNFSKFSLGYLGNANLFKANRDSI FVFINGRSVKSKIVEEAVIAAYHTKLMKGKYPTALIFLEVEPSEIDVNVHPSKKVVKFAN QNAIFDLIKGEIENFFTDDEDFISPYIEAENEVEENTKNNFLDINDFKDDMQDFSQLSVV GKEVYSKKDYNNIKVEKESFTDINKKINTFGSAGTTTESIINLNEIKEDSKNIENFDNSR EINDKVKDKYIFNQEDTGRGKIFDDFTSLKNIDFKVIGQVFDTFILVERNGLLEIYDQHI IHERILYEKLKQEYYNHSMSKQNLLVPIRFELDPREKQLALENIEIFSSFGFDIDDFDKN EILLRSIPTMNLRDSYENIFREILDNISKNKDVDIRENIIVSMSCKGAIKANHKLTIEEM YSMVAKLHEVGEYTCPHGRPIIVKMSLLDLEKLFKRK >gi|296154411|gb|ADVK01000035.1| GENE 12 10330 - 10896 858 188 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|34763431|ref|ZP_00144379.1| PROBABLE SIGMA(54) MODULATION PROTEIN; SSU ribosomal protein S30P [Fusobacterium nucleatum subsp. vincentii ATCC 49256] # 8 188 1 181 181 335 96 2e-91 MIKGSGSMKLSIHGRKITLTDAIRKYAEEKISKVEKFNDSIIKIDATLAASKLKTGNAHV TEILAYLSGSTLKATATETDLYASIDKAVDIMESLLKKHKEKRSRAKVQDDTRKKSYSFD YIVEPEEKLSDEKKLVRVYLPLKPMEISEAILQLEYLNRVFFAFTNTTTGKMAVVYKRKD GDYGVIED >gi|296154411|gb|ADVK01000035.1| GENE 13 10947 - 11915 1175 322 aa, chain - ## HITS:1 COG:FN0460 KEGG:ns NR:ns ## COG: FN0460 COG0113 # Protein_GI_number: 19703795 # Func_class: H Coenzyme transport and metabolism # Function: Delta-aminolevulinic acid dehydratase # Organism: Fusobacterium nucleatum # 1 322 1 322 322 635 99.0 0 MFTRTRRLRRNFLTRELVKNISIEKSSLIYPLFVCDGENIKSEIESMPQQFRYSLDRLNE ELDDLLKLGINNILLFGIPNHKDELGSQAYDENGIVQRTVRQIRRDYQDKFLIVTDVCMC EYTSHGHCGILHNHDVDNDETLEYIAKIALSHAKAGADIIAPSDMMDGRIGKIREVLDKN NFKDIPIMSYSVKYSSAYYGPFRDAADSAPSFGDRKTYQMDFRSYNNFYREVEADTQEGA DFIMVKPAMAYLDVIKSVSKISNLPIVAYNVSGEYSMVKAAAKNNWIDEQKIVMENMYAI KRAGADIIITYHAKDIAKWLSN >gi|296154411|gb|ADVK01000035.1| GENE 14 11928 - 12815 938 295 aa, chain - ## HITS:1 COG:FN0519 KEGG:ns NR:ns ## COG: FN0519 COG2849 # Protein_GI_number: 19703854 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 41 295 45 343 343 72 27.0 8e-13 MKKKLIVLLMFLLSCIAVYSNDINSQSDSNSFSSENFLKKLTSLTSKNPEKTEKFANYLK NEMKGKKEVSYFIKIDKAEKKITVLAENGEILFDEVVSEEVINSFPTYQTKIKEVEEKGL VKTYVEASYMVKTVRKPNFKNEKVEEPEKKEKTELSKLEDNIKLLLKSYDVLNSSIASIY EARDKMVTVQRYRNKTMTITGEEDGQKIKIVYNFDNSFVGGAMKIFVDNVLISQSKIKNL LPDGEIKLFNASGKISGMATAKEGKLDGVAKLFDENGNTIEEVIYRNNKIVKRIK >gi|296154411|gb|ADVK01000035.1| GENE 15 12832 - 13659 973 275 aa, chain - ## HITS:1 COG:no KEGG:FN0458 NR:ns ## KEGG: FN0458 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 275 1 275 275 386 89.0 1e-106 MRKSFLFIFLIFLVNSIFSYSSIKAATDEELQKVFDKNKEIIVVYRNSIKDNIPKKYIEN IIPKEEFNILNDNHIKITIKYTQKNKSDILAEIYTPNEDLVVKTEIKLRKKILFNEIEKL VQEIEDNEASNQSDILNNKFSESFEENVKSFVSYSYYDDGSLNSKTEYDFDKKNITMLTY GDGKILSKTIAKYKGSIQDENMDIDFYENLTKTFIKMKVKKVESGQEVRTFYPSGKLESV GVYKGNILNGEYKEYDESGNLIKEILYKDGIEAKK >gi|296154411|gb|ADVK01000035.1| GENE 16 13674 - 14114 493 146 aa, chain - ## HITS:1 COG:no KEGG:FN0457 NR:ns ## KEGG: FN0457 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 6 146 1 141 141 180 87.0 2e-44 MDIKFLFYAFLCIIAFILVIRFGNKKSILIKEKKFYHKLLRLKLSTRILTIILAWIFSTT WIVIAFKYNYKGISILGMGPFVIICCGVVNIPIWEYLRKKIIKSSYSDSIKEWLNFFNTL FTAYLVFIEFFVIGSSIILARFLHWI >gi|296154411|gb|ADVK01000035.1| GENE 17 14134 - 15540 2045 468 aa, chain - ## HITS:1 COG:FN0456 KEGG:ns NR:ns ## COG: FN0456 COG1306 # Protein_GI_number: 19703791 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 89 468 1 380 380 702 97.0 0 MKFMKKLSLFIMFLAIGAFFSKNSYSKEKSSNSYYEYTTEKIKIYSDENKKENIGTLIKG TRVNVYDTKEIIKKSKDKKGNEIEKKTVMKKITYKDVHKTKVAWMEDGYLVPTLNEAVDQ RFKNLDFTEKEKKEYTDNKRVKVRGLYVSAHSVALKGRLDELIELAKKNNINAFIIDVKG DYGELTFPMSDEINKYIKSANKNPIIKDIEPVIKKLKDNGIYAIARIVSFKDTIYAKENP DKIIVYKDGGKAFTNSDGLVWVSAYDKNLWEYNVTVAKEAAKAGFNEIQFDYVRFPASNG GKLDKVLNYRNTDNLTKSEAIQKYLHYAKEELEPYQVYVSADIYGQVGSSSDDMALGQFW EAVSSEVDYVSPMMYPSHYGKGVYGLAVPDANPYKTIYQSTKDSINRNNNIDSPAIIRPW IQAFTAAWVKGHINYGPNEIKEQVKAMKDLGVDEYILWSPTNRYEKFF >gi|296154411|gb|ADVK01000035.1| GENE 18 15663 - 16202 908 179 aa, chain - ## HITS:1 COG:FN0455 KEGG:ns NR:ns ## COG: FN0455 COG1592 # Protein_GI_number: 19703790 # Func_class: C Energy production and conversion # Function: Rubrerythrin # Organism: Fusobacterium nucleatum # 1 179 1 179 179 318 98.0 3e-87 MDLKGSKTEKNLMAAFAGESQARNKYNFYAKIAKEEGYEQIAELFDITAGNEKEHAKLWF KALHGDTIPDTLTNLADAAAGENYEWTDMYTKFAEEAKEEGFLKLAKQFEMVAKIEKEHE ERYRKLLENIKNGTVFHSEEKVAWECMECGHLHYGNDAPGKCPVCGADKAKFKRRAVNY >gi|296154411|gb|ADVK01000035.1| GENE 19 16453 - 17928 2330 491 aa, chain - ## HITS:1 COG:FN0454 KEGG:ns NR:ns ## COG: FN0454 COG1012 # Protein_GI_number: 19703789 # Func_class: C Energy production and conversion # Function: NAD-dependent aldehyde dehydrogenases # Organism: Fusobacterium nucleatum # 1 491 1 491 491 984 100.0 0 MENILKKSYKMFINGEWIDSSNGVMVKSYAPYNNELLSEFPDASESDIDFAIKSAKEAFK TWRKTTVKERARILNKIADIIDENKDLLATVETMDNGKPIRETKLVDIPLAATHFRYFAG CILADEGQATVLDEKFLSLILREPIGVVGQIIPWNFPFLMAAWKLAPALAAGDTVVLKPS STTTLSLLVLMELIQDVIPKGVVNLVTGKGSTAGEFLKNHPDLDKLAFTGSTAVGRDIAL AAAEKLIPATLELGGKSANIILDDADIEKALEGAQLGILFNQGQVCCAGSRIFVQEGIYD EFISKLVKKFENIKIGNPLDPTTIMGSQIDARQVKTILDYVEIAKQEGGVILTGGVKYTE NGCDKGNFVRPTLITNVNNGCRVSQEEIFGPVAVVIKFKTDDEVIAQANDSEYGLGGAVF TKNINRALKLAREIQTGRVWINTYNQIPEHAPFGGYKKSGIGRETHKVILEHYTQMKNIL IDLEEGTSGLY >gi|296154411|gb|ADVK01000035.1| GENE 20 18094 - 19848 2205 584 aa, chain - ## HITS:1 COG:FN0453 KEGG:ns NR:ns ## COG: FN0453 COG0006 # Protein_GI_number: 19703788 # Func_class: E Amino acid transport and metabolism # Function: Xaa-Pro aminopeptidase # Organism: Fusobacterium nucleatum # 1 584 1 584 584 1072 98.0 0 MEINKRIEKARKVMKKYKVDAYIVTSSDYHQSEYIDDYFKGREYLSGFTGSAGILVIFKD EACLWTDGRYHIQAEKQLKDSEVKLFKQGNLGVPTYQEYIISKLAENSKIGIDAKILLSS DITEILSKKKYKMVDFDLLAEVWDKRKKLPNGKIFILEDKYTGKTYKEKVKEIRATLKEK GANYNIISSLDDIAWIYNFRGCDIIHNPVALSFTIISEKKSILYINEKKLDKKAQKYFKD NKVEIKEYFEFFKDIKKIKGNILVDFNKISYAIYEAINKNTLINSMNPSTYLKAHKNKTE IANTKKIHIQDGVAIVKFMYWLKNNYKKENITEFSAEQEINSLRKEIEGYLDLSFHTISA FGKNAAMMHYSAPEKKSAKIGDGVYLLDSGGTYLKGTTDITRTFFLGKVGKQEKIDNTLV LKGMLALSRAKFLFGATGTNLDILARQFLWNVGIDYKCGTGHGVGHILNVHEGPHGIRFQ YNPQRLEVGMIVTNEPGAYIEGSHGIRIENELLVKEAYETEYGKFLEFETITYAPIDLDG IVKTLLTKEEKQQLNEYHSEVYKKLSPYLNKKEKEFLKEYTKSI >gi|296154411|gb|ADVK01000035.1| GENE 21 19873 - 21696 2597 607 aa, chain - ## HITS:1 COG:FN0452 KEGG:ns NR:ns ## COG: FN0452 COG0449 # Protein_GI_number: 19703787 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glucosamine 6-phosphate synthetase, contains amidotransferase and phosphosugar isomerase domains # Organism: Fusobacterium nucleatum # 1 607 1 607 607 1123 99.0 0 MCGIIGYSGSKANAVEVLLEGLEKVEYRGYDSAGIAFVTDSGIQIEKKEGKLENLKNHMK NFEVLSCTGIGHTRWATHGIPTDRNAHPHYSESKDVALIHNGIIENYVEIKKELLEQGVK FSSDTDTEVVAQLFSKLYDGDLYSTLKKVLKRIRGTYAFAIIHKDFPDKMICCRNHSPLI VGLGEHQNFIASDVSAILKYTRDIIYLEDGDVVLVTKDNVTVYDKDEKEVKREVKKVEWN FEQASKGGYAHFMIKEIEEQPEIIGKTLNVYTDKEKNVKFDEQLEGINFHDIDRIYVVAC GTAYYAGLQGQYFMKKLLGIDVFTDIASEFRYNDPVITNKTLAIFVSQSGETIDTLMSMK YAKEKGARTLAISNVLGSTITREADNVIYTLAGPEISVASTKAYSSQVLVMYLLSLYMGA KLGKIEEKDYQKYISDISLLKENIVKLISEKEKIHNIAKKIKDIKNGFYLGRGIDEKVAR EGSLKMKEINYIHTEALPAGELKHGSIALIEKGVLVVAISTNLEMDEKVVSNIKEVKARG AYVIGACKEGSLVPEVVDDVIQVKDSGELLTPVLTVVGLQYLAYYTSLEKGYDVDKPRNL AKSVTVE >gi|296154411|gb|ADVK01000035.1| GENE 22 21984 - 22154 408 56 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MAQLDRASDYGSEGCGFDSCQVRQIKSLSGGIGRRTRLKIWNSSECAGSSPASGTI >gi|296154411|gb|ADVK01000035.1| GENE 23 22520 - 23293 1362 257 aa, chain + ## HITS:1 COG:FN0439 KEGG:ns NR:ns ## COG: FN0439 COG1540 # Protein_GI_number: 19703777 # Func_class: R General function prediction only # Function: Uncharacterized proteins, homologs of lactam utilization protein B # Organism: Fusobacterium nucleatum # 1 257 1 257 257 504 97.0 1e-143 MKFFVDLNSDIGEGYGAYKLGMDEEITKCVTSVNCACGWHAGDPLIMDKTVKIAKENNVA VGAHPGYPDLLGFGRRKMVVTPNEARTYMLYQLGALNAFAKANGTKLQHMKLHGAFYNMA AVEKDLADAVLDGIEQFDKDIIVMTLSGSYMAKEGKRRGLKVAEEVFADRGYNPDGTLVN RNLPGAFVKEPDEAIARVIKMIKTKKVTAINGEEIDIAADSICVHGDNPKAIEFVDRIRK SLIADGIEVKSLYEFIK >gi|296154411|gb|ADVK01000035.1| GENE 24 23308 - 24495 1677 395 aa, chain + ## HITS:1 COG:FN0438 KEGG:ns NR:ns ## COG: FN0438 COG1914 # Protein_GI_number: 19703776 # Func_class: P Inorganic ion transport and metabolism # Function: Mn2+ and Fe2+ transporters of the NRAMP family # Organism: Fusobacterium nucleatum # 1 395 1 395 395 606 98.0 1e-173 MEKKNNLSVLLGAAFLMATSAIGPGFMTQTAVFTKDMGATFAFVILASVLMSFVAQLNVW RVLAVSKMRGQDIANNVLPGLGYFITFLVCLGGLAFNIGNVGGAALGFQVLFDLDLKIAA LVSGALGVIIFSFKSASKLMDKLTQILGAMMILLIGYVAFSTNPPVGTAIKETFIPSSVN LIAIITLIGGTVGGYIMFSGGHRLIDAGIVGEENLPQVNKSAILGMGVATIVRVFLFLAV LGVVSLGNQLDAGNPAADAFKIAAGTVGYKIFGLVFLAAALTSIVGAAYTSVSFLKTLFK VVKDYENLFIIGFIVVSTLILIFLGKPVKLLVLAGSLNGLILPITLAITLIASKKEGIVG KYKHSNILFLLGWVVVLVTAYIGVQSLSKLAELFA >gi|296154411|gb|ADVK01000035.1| GENE 25 24509 - 25258 1019 249 aa, chain + ## HITS:1 COG:FN0437 KEGG:ns NR:ns ## COG: FN0437 COG2049 # Protein_GI_number: 19703775 # Func_class: E Amino acid transport and metabolism # Function: Allophanate hydrolase subunit 1 # Organism: Fusobacterium nucleatum # 1 249 14 262 262 463 97.0 1e-130 MENSVKFLFSGDSALVIEFGNEISVDINKKIRKMMDNIKKENIDGIVELVPTYCSLLINY DVLKVDYQSLVEKLKTLLNDDNETIEDEEVTLIEIPTLYNDEFGPDLSYVAEYNKLSKEE VIKIHTGTDYLVYMLGFMPGFTYLGGMSEKIATPRLESPRLQIYSGSVGIAGKQTGMYPS MSPGGWRIIGRTPLKLYNPDSETPVYISSGDYIRYVSISEEEYNNILKKVENNEYKLNIR KVKRGELNA >gi|296154411|gb|ADVK01000035.1| GENE 26 25251 - 26261 1445 336 aa, chain + ## HITS:1 COG:FN0436 KEGG:ns NR:ns ## COG: FN0436 COG1984 # Protein_GI_number: 19703774 # Func_class: E Amino acid transport and metabolism # Function: Allophanate hydrolase subunit 2 # Organism: Fusobacterium nucleatum # 1 336 1 336 336 626 96.0 1e-179 MPNIKVHKPGLCTTVQDIGRIGYQQFGIPVSGVMDEFAFTVANYLVESDKNNAVLEIPFL GPTLEFDFDVTIAITGADIQPKINNQDIKMWQSINVKKGDTLSFGGLKTGIRTYLAFSAE INVPIVMGSKSTLLKSKLGGFDGRQLKMGDIINFKNVKVLSKKNILDKKYLPVYSHNQNI RIVLGPQDNYFDENSIKTLLENKYQVTKDADRMGMRLLGEVIKHKDKADIISDAAVFGSI QVPGNGQPIILLADRQTTGGYTKIATVIKADLPKLAQMVPNDTIEFSLINIEEAQKEYKE FYKILDEIKESFVVKPKVYTEKQIYVIKKLFGNRKK >gi|296154411|gb|ADVK01000035.1| GENE 27 26321 - 27028 906 235 aa, chain - ## HITS:1 COG:FN0435 KEGG:ns NR:ns ## COG: FN0435 COG0813 # Protein_GI_number: 19703773 # Func_class: F Nucleotide transport and metabolism # Function: Purine-nucleoside phosphorylase # Organism: Fusobacterium nucleatum # 1 235 7 241 241 441 99.0 1e-124 MSIHIGAKLEDIAETILLPGDPKRAKWIAENYLENAFCYTDIRGMLGFTGTYKGKMISIQ GTGMGIPSISIYITELMKDYGVKNLIRVGSAGSYQEDIKVRDVVIAMSTSTDSNINNRKF NGANFSPTANFELFSMALKVAEEKNIKIKAGNVLTSDEFYSDNSDYYKKWADFGVLAVEM ETAGLYTLAAKYKAKALSILTISDSLVSPEITSAEEREKTFSEMIELALETAIRI Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:41:51 2011 Seq name: gi|296154377|gb|ADVK01000036.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00044, whole genome shotgun sequence Length of sequence - 43405 bp Number of predicted genes - 33, with homology - 33 Number of transcription units - 15, operones - 9 average op.length - 3.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 2/0.000 + CDS 564 - 2474 2976 ## COG1960 Acyl-CoA dehydrogenases 2 1 Op 2 1/1.000 + CDS 2499 - 3716 1796 ## COG0426 Uncharacterized flavoproteins + Term 3734 - 3777 1.2 + Prom 4020 - 4079 17.7 3 2 Tu 1 . + CDS 4108 - 5565 2159 ## COG1757 Na+/H+ antiporter + Term 5580 - 5636 3.2 - Term 5567 - 5623 6.2 4 3 Op 1 1/1.000 - CDS 5643 - 9209 4649 ## COG0674 Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, alpha subunit 5 3 Op 2 1/1.000 - CDS 9284 - 10621 1508 ## COG1757 Na+/H+ antiporter - Prom 10666 - 10725 6.4 6 4 Tu 1 . - CDS 10768 - 11955 1836 ## COG0626 Cystathionine beta-lyases/cystathionine gamma-synthases + Prom 11952 - 12011 8.5 7 5 Tu 1 . + CDS 12070 - 13497 1453 ## COG1167 Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs + Term 13522 - 13574 13.2 8 6 Op 1 1/1.000 - CDS 13580 - 14248 819 ## COG0235 Ribulose-5-phosphate 4-epimerase and related epimerases and aldolases 9 6 Op 2 . - CDS 14257 - 15627 852 ## COG1167 Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs - Prom 15811 - 15870 9.2 + Prom 15703 - 15762 15.1 10 7 Op 1 1/1.000 + CDS 15805 - 16962 1375 ## COG1979 Uncharacterized oxidoreductases, Fe-dependent alcohol dehydrogenase family 11 7 Op 2 1/1.000 + CDS 16975 - 18270 1645 ## COG1757 Na+/H+ antiporter 12 7 Op 3 4/0.000 + CDS 18300 - 19349 1243 ## COG0182 Predicted translation initiation factor 2B subunit, eIF-2B alpha/beta/delta family 13 7 Op 4 . + CDS 19364 - 20518 1268 ## COG4857 Predicted kinase + Term 20530 - 20587 10.2 - Term 20515 - 20575 13.7 14 8 Tu 1 . - CDS 20576 - 21790 1529 ## COG1171 Threonine dehydratase - Prom 21830 - 21889 11.9 - Term 21989 - 22038 -0.1 15 9 Tu 1 1/1.000 - CDS 22095 - 22610 731 ## COG1288 Predicted membrane protein - Prom 22636 - 22695 6.8 16 10 Op 1 1/1.000 - CDS 22779 - 23303 663 ## COG1288 Predicted membrane protein 17 10 Op 2 1/1.000 - CDS 23321 - 23512 239 ## COG1288 Predicted membrane protein - Prom 23574 - 23633 12.2 - Term 23588 - 23631 0.7 18 11 Op 1 1/1.000 - CDS 23667 - 25121 1872 ## COG2195 Di- and tripeptidases - Prom 25155 - 25214 5.5 19 11 Op 2 1/1.000 - CDS 25220 - 26101 1307 ## COG3643 Glutamate formiminotransferase 20 11 Op 3 1/1.000 - CDS 26119 - 27654 2382 ## COG2986 Histidine ammonia-lyase 21 11 Op 4 1/1.000 - CDS 27675 - 28229 804 ## COG3404 Methenyl tetrahydrofolate cyclohydrolase 22 11 Op 5 1/1.000 - CDS 28245 - 29465 1387 ## COG1228 Imidazolonepropionase and related amidohydrolases 23 11 Op 6 1/1.000 - CDS 29478 - 30788 1988 ## COG2056 Predicted permease 24 11 Op 7 1/1.000 - CDS 30807 - 32105 976 ## COG3314 Uncharacterized protein conserved in bacteria - Prom 32134 - 32193 6.3 - Term 32127 - 32174 11.4 25 12 Tu 1 1/1.000 - CDS 32195 - 33949 2974 ## COG2987 Urocanate hydratase - Prom 34015 - 34074 11.4 - Term 34071 - 34125 12.5 26 13 Op 1 . - CDS 34137 - 37193 2826 ## COG3899 Predicted ATPase 27 13 Op 2 . - CDS 37221 - 38177 815 ## FN1399 putative cytoplasmic protein - Prom 38250 - 38309 8.2 - Term 38235 - 38273 5.1 28 14 Op 1 2/0.000 - CDS 38312 - 39724 792 ## PROTEIN SUPPORTED gi|145629959|ref|ZP_01785741.1| 50S ribosomal protein L21 29 14 Op 2 . - CDS 39782 - 40696 1206 ## COG2066 Glutaminase - Prom 40827 - 40886 13.3 + Prom 40832 - 40891 10.9 30 15 Op 1 . + CDS 41052 - 41351 337 ## FN1395 hypothetical protein 31 15 Op 2 8/0.000 + CDS 41416 - 41727 331 ## COG2739 Uncharacterized protein conserved in bacteria 32 15 Op 3 23/0.000 + CDS 41738 - 43072 2134 ## COG0541 Signal recognition particle GTPase 33 15 Op 4 . + CDS 43123 - 43386 436 ## PROTEIN SUPPORTED gi|19704724|ref|NP_604286.1| SSU ribosomal protein S16P Predicted protein(s) >gi|296154377|gb|ADVK01000036.1| GENE 1 564 - 2474 2976 636 aa, chain + ## HITS:1 COG:FN1424_1 KEGG:ns NR:ns ## COG: FN1424_1 COG1960 # Protein_GI_number: 19704756 # Func_class: I Lipid transport and metabolism # Function: Acyl-CoA dehydrogenases # Organism: Fusobacterium nucleatum # 1 377 1 377 377 731 100.0 0 MLFKTTEEHEALRMQVREFVETEVKPIAAILDKENKFPHEAIKKFGQMGFMGLPYPKEYG GAGKDILSYAIAVEELSRVDGGTGVILSAHVSLGSYPIFAFGTEEQKKKYLTPLAKGEKL GAFGLTEPNAGSDAGGTETTAVKEGDYYILNGEKIFITNADVAETYVVFAVTTPDIGTKG ISAFIVEKGWEGFTFGDHYDKLGIRSSSTCQLLFNNVKVPKENLLGKEGDGFKIAMSTLD GGRIGIAAQALGIAQGAFEHALEYAKEREQFGKPIAFQQAISFKLADMATKLRTARFLIY SAAELKEHHEPYGMESAMAKQYASDIALEVVNDAVQIFGGSGYLKGMEVERAYRDAKITT IYEGTNEIQRVVIAAHLIGKAPKSDAIAVAKKKKGPVTGPRKNIIFKDGSTKEKVAALVA ALKADGYDFTVGIPLDTPIGKSERVVSAGKGIGDKKNMKLIENLAKQAGASVGCSRPVAE TLQYLPLDRYVGMSGQKFVGNLYIACGISGALQHLKGIKDATTIVAINTNANAPIFKNAD YGIVGDVAEILPLLTKELDNGEAKKDAPPMKKMKRVIPKVVYSPHVYVCSGCGHEYNPEI GDEDSDIKPGTRFKDLPEDWTCPDCGDPKSGYIDAK >gi|296154377|gb|ADVK01000036.1| GENE 2 2499 - 3716 1796 405 aa, chain + ## HITS:1 COG:FN1423 KEGG:ns NR:ns ## COG: FN1423 COG0426 # Protein_GI_number: 19704755 # Func_class: C Energy production and conversion # Function: Uncharacterized flavoproteins # Organism: Fusobacterium nucleatum # 1 405 1 405 405 830 100.0 0 MHNVRKITEDLYWIGANDRRLALFENIHPIPEGVSYNSYMLLDKETVVFDTVDWSVTRQY IENIEYLLNGRELDYLVVHHMEPDHCGSIEELALRYPKMKIISSEKGFMFMRQFGYKSIN GHQLIEAKEGDKFKFGKHEIVFLEAPMVHWPEVLVSFDTTNGALFSADAFGSFKSLDGRL FNDEVNWDRDWLDEGRRYLTNIVGKYGPHIQHLLKKAGPIVDKIKFICPLHGVVWRNDFG YLIDKYDKWSRYEPEEKGILIAYASMYGNTENAVEILAAKLAEKGITNIKMFDVSNTHVS YLISNVFKYSHIVIASPTYNLGIYPVIHNFVMDMKALNLQNRTVAIVENGSWARKSGDLL QEFFETQIKDITVLNERVGLTSSANNVNLDEMDTLVDALVESLNK >gi|296154377|gb|ADVK01000036.1| GENE 3 4108 - 5565 2159 485 aa, chain + ## HITS:1 COG:FN1422 KEGG:ns NR:ns ## COG: FN1422 COG1757 # Protein_GI_number: 19704754 # Func_class: C Energy production and conversion # Function: Na+/H+ antiporter # Organism: Fusobacterium nucleatum # 13 485 1 473 473 832 99.0 0 MTLARKPKIWEALVPIVGMALIIVYSMLVLKIDPHIPIVISTILAGFMALKVGCTWSEIS TGMIESVYRAVEALIIVMIVGMLIGSWVLAGSVPTMIYYGLELISPRFFLPTGCILCAIV SVATGSAWTSGGTIGVALMGIGTGLGINPALTAGMVISGAYFGDKISPLSDSTNVAAAAA ETDLYLHVRSMMYTTVPSFIIALLLYLIIGLRYQASSIDLENIKLIKNALSSSFLISPWL LIPPIVVLVTAIKRIPAIPSLLLATTVGGIFAMFFQNVAFVDVLNVLQNGYVGNTNVPIV DKLLTRGGVNGMLWTISLIIFALCFGGILEKAKFTEVILERIVKHIHSVGSLVATTIATG ILCDFVLTDQYLANIIPGRMYYKVYDDMGLERYYLSRTLEDGGSLWSPMFPWNGCGAYQS ATLGVSSFSYFPYAFLSLINPIVSIFMAYMGIAVFRKKIKEKGIEVIEVSNISELDKIRE KDKSD >gi|296154377|gb|ADVK01000036.1| GENE 4 5643 - 9209 4649 1188 aa, chain - ## HITS:1 COG:FN1421_1 KEGG:ns NR:ns ## COG: FN1421_1 COG0674 # Protein_GI_number: 19704753 # Func_class: C Energy production and conversion # Function: Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, alpha subunit # Organism: Fusobacterium nucleatum # 1 410 3 412 412 833 100.0 0 MKRVMQTMDGNQAAAYASYAFTEVAGIYPITPSSPMAEYVDEWAAKGMKNIFDVPVKLVE MQSEGGAAGTVHGSLEAGALTTTYTASQGLLLKIPNMYKIAGELLPGVIHVSARSLSVQA LSIFGDHQDVYATRQTGFTIMASGSVQEVMDMGTVAHLTAIKSRVPVLHFFDGFRTSHEI QKIELMDYDVCKKLVDYDEIQKFRDRALNPEHPVTRGTAQNDDIYFQTREAQNKFYDAVP DIAAYYMEEISKETGREYKPFKYRGAADADRVIVAMASVCQTAEETVDYLVEKGEKVGLV TVHLYRPFSEKYFFNVLPKTVKKIAVLERSKEQGAPGEPLLLDVKSIFYDKENAPIIVGG RYGLSSKDTTPAQIKAVFDNLLQDKPKTNFTIGIVDDVTFTSLEIGERLNVADPSTKACL FFGLGADGTVGANKNSIKIIGDKTDLYAQGYFAYDSKKSGGVTRSHLRFGKKPIRSTYLV SSPSFVACSVPAYLKQYDMTSGLKKGGKFLLNCVWDKDEVLEHIPDNIKYDLAKSESKFY IINATKLAHEIGLGQRTNTIMQSAFFKLAEIIPYEEAQKYMKEYALKSYGRKGDDVVQLN YKAIDVGASGLVEIPVDPNWANLKVEAIQKIDKNNDTSNCKTELLTSFVKDIVEPINAIK GNDLPVSAFLGREDGTFENGTAAFEKRGVAVNVPIWNLDKCIECNQCAYVCPHAAIRAFL ITDEEKAASPVEFATKKANGKGLEDLTYRIQVTPLDCTGCGSCANVCPAKALDMNPIAVA LENHEDEKAAYIYSNVTYKTDKMPTSTVKGSQFSQPLFEFNGACPGCGETPYLKVISQMF GDRMMVANASGCSSVYSGSAPSTPYTKNCCGEGPTWASSLFEDNAEYGFGMHVGVEALRD RIQHIMEVSMDKITPALQGLFREWIENRNYAAKTREISPKILNALEGNNETYAKDIIGLK QYLIKKSQWIVGGDGWAYDIGYGGLDHVLASKEDINVIVMDTEVYSNTGGQSSKATPTAA VAKFAAAGKPLKKKDLAAICMSYGHIYVAQVSMGANQQQFLKAIQEAESYNGPSIIIAYS PCINHGIKKGMSKSQTEMKLATECGYWPIFRYNPLLEKDGKNPLQLDSKEPKWELYQDYL MGETRYMTLKKTNPDEANDLFEKNMFDAQRRWRQYKRLASLDYSDEKR >gi|296154377|gb|ADVK01000036.1| GENE 5 9284 - 10621 1508 445 aa, chain - ## HITS:1 COG:FN1420 KEGG:ns NR:ns ## COG: FN1420 COG1757 # Protein_GI_number: 19704752 # Func_class: C Energy production and conversion # Function: Na+/H+ antiporter # Organism: Fusobacterium nucleatum # 1 445 1 445 445 753 99.0 0 METKASFKGLIPFLVFILLYLGTGIFLHIAGVELAFYQLPGPVAAFAGIVVAFIIFNGTI QEKFNTFLEGCGHPDIITMCIIYLLAGAFAVVSKAMGGVDSTVNLGITYIPPHYIAVGLF IIGAFISTATGTSVGSIVALGPIAVGLGEKSGVPMALILAAVMGGAMFGDNLSVISDTTI AATKTQGVEMKDKFRINLYIALPAAILTIILLFFFARPDVIPEAVTHEYNLVKVFPYIFV LVMALAGVNVFVVLTSGILLSGIIGFIYGDFTLLGYGKEIYNGFTNMTEIFVLSLLTGGM AQMVTRQGGIQWVIDTVQKFIVGKKSAKVGVGLLVSLADIAVANNTVAIIITGGISKKIS EKNNVDLRESAAILDIFSCIFQGLIPYGAQMLILLGFAGDKVAPTQLIPLLWYQLLLGIF TLIYIFVPQISKKALSILDKNIEKK >gi|296154377|gb|ADVK01000036.1| GENE 6 10768 - 11955 1836 395 aa, chain - ## HITS:1 COG:FN1419 KEGG:ns NR:ns ## COG: FN1419 COG0626 # Protein_GI_number: 19704751 # Func_class: E Amino acid transport and metabolism # Function: Cystathionine beta-lyases/cystathionine gamma-synthases # Organism: Fusobacterium nucleatum # 1 395 1 395 395 776 99.0 0 MEMKKSGLGTTAIHAGTLKNLYGTLAMPIYQTSTFIFDSAEQGGRRFALEEAGYIYTRLG NPTTTVLENKIAALEEGEAGIAMSSGMGAISSTLWTVLKAGDHVVTDKTLYGCTFALMNH GLTRFGVEVTFVDTSNLEEVKNAMKKNTRVVYLETPANPNLKIVDLEALSKIAHTNPNTL VIVDNTFATPYMQKPLKLGVDIVVHSATKYLNGHGDVIAGLVVTRQELADQIRFVGLKDM TGAVLGPQEAYYIIRGLKTFEIRMERHCKNARTIVDFLNKHPKVEKVYYPGLETHPGYEI AKKQMKDFGAMISFELKGGFEAGKTLLNNLKLCSLAVSLGDTETLIQHPASMTHSPYTKE EREAAGITDGLVRLSVGLENVEDIIADLEQGLEKI >gi|296154377|gb|ADVK01000036.1| GENE 7 12070 - 13497 1453 475 aa, chain + ## HITS:1 COG:FN1418 KEGG:ns NR:ns ## COG: FN1418 COG1167 # Protein_GI_number: 19704750 # Func_class: K Transcription; E Amino acid transport and metabolism # Function: Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs # Organism: Fusobacterium nucleatum # 1 475 1 475 475 898 100.0 0 MRKKLIRNSDVTISTQLYEMLRQDMLGNKWKENDKFYSVRQISIKYKVNLNTVLKVIRML EEEGYLYSIKGKGCFVKKGYNLDIEKRMTPILNTFRFGQNSKDMEINFSNGGPPKEYFPI KEYKEIVNEVLSNETESRYLMAYQNIQGLESLRETLAEFIGKYGIRREKDDIIICSGTQI ALELISTAFGISPKKTVLLSDPTYQNAVHILKSYCNIENIDMKDDGWDMQEFEELLKRKK IDFVYMMTNFQNPTGISWSFQKKKKMIELSLKYDFYIIEDECFSDFFYNSRECPKSLKAL DKYERVFFIKTFSKVVMPALGLTMLIPPKKYIDSFSLNKYFIDTTTSGINQKFLEIFIKR GLLEKHLEKLRLNLKMKMEYMLSELQKIKHLEIIHIPKGGFFIWVNLANYINSEKFYYKC RLRGLSVLPGFIFYSSTEETTSKIRISIVSSKIEEMKKGLEIIQDVLNNCDFKLN >gi|296154377|gb|ADVK01000036.1| GENE 8 13580 - 14248 819 222 aa, chain - ## HITS:1 COG:FN1417 KEGG:ns NR:ns ## COG: FN1417 COG0235 # Protein_GI_number: 19704749 # Func_class: G Carbohydrate transport and metabolism # Function: Ribulose-5-phosphate 4-epimerase and related epimerases and aldolases # Organism: Fusobacterium nucleatum # 1 222 1 222 222 431 99.0 1e-121 MLLENERKEIIVYGKKMITDGLTRGTGGNISICDAEQKLMAITPSGIDYFKLTPEDIVII DVETGKIVDGNRVPSSESDMHRIFYKYRKGVFSVVHTHSKYATAISCTDIEGLPAINYLL AVAGTDVPCAEYATYGTIKLAKNAFKAMEDKKAVLLSNHGMIAIGKNLTEAYNIAENVEF CSELFCISKSIGSPKILSKEEMLNMIERFKDYGKRIEEHEEI >gi|296154377|gb|ADVK01000036.1| GENE 9 14257 - 15627 852 456 aa, chain - ## HITS:1 COG:FN1416 KEGG:ns NR:ns ## COG: FN1416 COG1167 # Protein_GI_number: 19704748 # Func_class: K Transcription; E Amino acid transport and metabolism # Function: Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs # Organism: Fusobacterium nucleatum # 1 456 1 456 456 788 99.0 0 MKYEEVIQDIKVKIIQGYWKVDDKLPSLRQLARKYETSSNTIAFAFRILRDEGYIFSIPA VGYFIKKRKDFQISKQVKTILKSYYDAENKNDNLINFTSINLLSKYINNNFISYLFETQK KNFNKNKTLENKSSLVSCISNLLEEDEIFTLDENMIITSSSQLSIEMIIRLFSNKNKLTI ALSDPSHYSVINILEKLVNIRGVHLLDDGWDFKDFENILSLEKIDLVYVTPNFHDPSGIC WSEDKKMYLLELAEKFDFYIIEEDNYSSLSYSQKYLSFKSFERIGKERIFYIRDFSTLLG SFLGLTCVIVPPKLKDKFLMEKIAFSIIPSQVQQNMLETFINNGYFFFFLNKLKNILNNR LNYLVNELKNIAELKIMHQPEGGFFIWLKLNRQIDEDIFYELCKNNGLLILPGYIFYEDN RNNAKFRISFASTSLYEIEIGIKKIKKIISYLKEIN >gi|296154377|gb|ADVK01000036.1| GENE 10 15805 - 16962 1375 385 aa, chain + ## HITS:1 COG:FN1415 KEGG:ns NR:ns ## COG: FN1415 COG1979 # Protein_GI_number: 19704747 # Func_class: C Energy production and conversion # Function: Uncharacterized oxidoreductases, Fe-dependent alcohol dehydrogenase family # Organism: Fusobacterium nucleatum # 1 385 1 385 385 769 100.0 0 MDNFNYKNDTKIIFGKDNYSEIGKNIKIFSKKTPKILLHYEADGELIKKLGIYEKVISSL KEFDIEFIELGGVVPNPRLSLVYEGIKICKEENITFILAVGGASVIDSAKAISLGAVDNG DVWDFFTAKRIPQDTLGIGVVLTIPGAGSEMSESSIITDENKKQKAVCDTEVNFPKFAIL NPEVCYTIPDRLMAAGIVDILSHLMERYFTKSIDTALSDSLIEATMKIVIKYGPLLMKDR KNYNYCSQIMWAATMAHNGMIACGRVADWASHRIEHEISGIYDLTHGIGMAIIFPAWMKY TKNIRPQIFEKFFKEVFNTVNIDEGINKLEEFFKSLGINLKLSDYGITEEYFSLMAEKAL GNSETLGRFMQLNKQDIINILNLAK >gi|296154377|gb|ADVK01000036.1| GENE 11 16975 - 18270 1645 431 aa, chain + ## HITS:1 COG:FN1414 KEGG:ns NR:ns ## COG: FN1414 COG1757 # Protein_GI_number: 19704746 # Func_class: C Energy production and conversion # Function: Na+/H+ antiporter # Organism: Fusobacterium nucleatum # 13 431 1 419 419 640 99.0 0 MKKKGSFLGLVPLLIFLLIYASTGIFTDKIDNMPLLVAFTITVAIALCFNNPNKEKISFE QKVEIFCKGASDSTLLLLVIIFLLAGAFYSVADAMGAVKSMVNLGLTLLPLKMLLPGLFI IGCILSFAMGTSMGTVSALTPIAVGIANETGISLPLICGVVVGGAMFGDNLSFISDTTIA ATRTQEVEMKDKFKVNFLIVLPAVILNIIVLSFIGGEGVQGAVYEYSLLNLVPYVSIIVL ALIGINVIIVLTIGVVLGLIIGLINNSFVFIEIFSVVQRGMGWMENMAIIALVVGGVVAI MEYLGGIDYLLENLTTKIKSKKGAEFGMSILVSLLCLATTNNTVSIITAGPLAKDIADKF SVDRRRVAGLLDIFSSAFQGLMPYAGQILVAAAMAQISPVSIVPYSWYSMFMIVMGILAI ITGIPKLKEKN >gi|296154377|gb|ADVK01000036.1| GENE 12 18300 - 19349 1243 349 aa, chain + ## HITS:1 COG:FN1413 KEGG:ns NR:ns ## COG: FN1413 COG0182 # Protein_GI_number: 19704745 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Predicted translation initiation factor 2B subunit, eIF-2B alpha/beta/delta family # Organism: Fusobacterium nucleatum # 1 349 1 349 349 674 99.0 0 MQRMDEGLAFLLQYENVAWYQDGKVKILDRRVYPREIKFVICTNYLEVRQAIADMVTQSA GPYTAAGMGMALAAYQSKNLSKEEQIKFLEKASYDISTARPTTINRMKLVTESCLEVAKQ AITENKNPVEVIFQRTYDSLERRYRRMSLVAKNLVSLFPNKGKVLTQCFGETIVGCMGRE IKNQNKDIEFFCPETRPYLQGARLTASVLKEQGFKTTVITDNMVAWTILQKNIDLFTSAA DTICMDGYIVNKVGTLQIAILCKHFGIPYFVTGIPDIDKLKKDIVIEERNPNEVLEFNNL KNTLNGVEAYYPAFDITLPYLINGVVTDKKIFSPYNLDEYFKDEVEQYY >gi|296154377|gb|ADVK01000036.1| GENE 13 19364 - 20518 1268 384 aa, chain + ## HITS:1 COG:FN1412 KEGG:ns NR:ns ## COG: FN1412 COG4857 # Protein_GI_number: 19704744 # Func_class: R General function prediction only # Function: Predicted kinase # Organism: Fusobacterium nucleatum # 1 384 1 384 384 712 99.0 0 MKYQEHFLLDCDEVISYVKEKNLFQGNANLTVKEIGDGNINYIFKVENKIDGKSIILKQA DKLLRSSGRPLDLTRSKIEANILRIENNLAPHFVPEIYFYDEIMCVLAMEDISEYKNLRT ELIAGKIFPSFVDNISEFLSRTLLLTTDLFMNKFEKKKNVKEFTNPELCDISECLVFTEP YDNNRNRNIITAGNEEFVENTLYKNEDLHFTILKLREKFMNYSQSLIHGDLHSGSIFINE KGIKIIDPEFSFYGPMAYDIGNVIGNLYFPLYRAKFFMEDSKKKEEFINWLEKCILDIPI LFSEKCKLLWEKYSNDKLLKNKKFRDYYIENIVKDSLAYAGTEMIRRTVGDAKVLELTSL ENSEKKLQLEKELISKAVSMIMKN >gi|296154377|gb|ADVK01000036.1| GENE 14 20576 - 21790 1529 404 aa, chain - ## HITS:1 COG:FN1411 KEGG:ns NR:ns ## COG: FN1411 COG1171 # Protein_GI_number: 19704743 # Func_class: E Amino acid transport and metabolism # Function: Threonine dehydratase # Organism: Fusobacterium nucleatum # 1 404 1 404 404 654 99.0 0 MAKLEAFIKAKEKLSKVLLETQLIYSPIFSKESGNEVFIKPENLQKTGSFKIRGAYNKIS NLTDAEKKRGVIASSAGNHAQGVAYGAKESGIKAIIVMPKSTPLIKVESTKQYGAEVILH GDVYDDAFKKAKELEEKEGYVFVHPFNDEDVLDGQGTIALEILEELPETDIILVPIGGGG LISGIACAAKILKPEIKIIGVEPEGAASAYEAIKENKVVELKEANTIADGTAVKKIGDLN FEYIKKYVDKIITVSDYELMEAFLLLVEKHKIIAENSGILSIAATKKLKEKNKKVVSVIS GGNIDVLMISSMINKGLIRRDRIFNFTVSIPDKPGELAKVVDLIAEQGANVIKLEHNQFK NLSRFKDIELQITVETNGSEHVQNLTQAFEEKGYEIVRIKSKIN >gi|296154377|gb|ADVK01000036.1| GENE 15 22095 - 22610 731 171 aa, chain - ## HITS:1 COG:FN1410 KEGG:ns NR:ns ## COG: FN1410 COG1288 # Protein_GI_number: 19704742 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 18 171 10 163 163 258 100.0 3e-69 MDLNLVKCVMNFYKVVLINMLFPCIMIGLANSVVIILKDAFILDTIIHGLASLLNGLPAS IAAIGMFIVQDLFNIVVPSGSGQAAITMPIMAPLADMVGITKQTAVLAFQLGDAFTNVMA PTGGEILAALAMCGTVPFKTWMKYLAPLFVIWWLVSFVFLTIATQIQYGPF >gi|296154377|gb|ADVK01000036.1| GENE 16 22779 - 23303 663 174 aa, chain - ## HITS:1 COG:FN1409 KEGG:ns NR:ns ## COG: FN1409 COG1288 # Protein_GI_number: 19704741 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 1 172 71 242 247 314 100.0 7e-86 MSVTLGIQNAAYIISFLLIIGGMFAILNATGAINTGMANVVRSMKGRELLMIPVCMIVFG CGSAFCANFEEFLAFVPLVLACCYAMGFDSLTAVGIIFCAAASGYAGAITNAFTTGVAQS IAGLPMFSGMGLRIPLFITLITVSIIYVMYHAHKVKKNPESSSVYQNDLEQKNI >gi|296154377|gb|ADVK01000036.1| GENE 17 23321 - 23512 239 63 aa, chain - ## HITS:1 COG:FN1409 KEGG:ns NR:ns ## COG: FN1409 COG1288 # Protein_GI_number: 19704741 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 1 61 1 61 247 117 98.0 7e-27 MLKTEEIKQSKALNPMLLLVCMILIIAILSYIIPAGVYDRVMDEKLGKELVDPNSFHYVE KIL >gi|296154377|gb|ADVK01000036.1| GENE 18 23667 - 25121 1872 484 aa, chain - ## HITS:1 COG:FN1408 KEGG:ns NR:ns ## COG: FN1408 COG2195 # Protein_GI_number: 19704740 # Func_class: E Amino acid transport and metabolism # Function: Di- and tripeptidases # Organism: Fusobacterium nucleatum # 1 484 6 489 489 924 99.0 0 MRKLENLKPERVFYYFEELSKIPRESGNEKAVSDYLIKTAKKLDLESYQDENLNVIIKKT ATKGYENSKGIILQGHLDMVCEKELESKHNFKTDALNLIIEDGYLRADATTLGADNGIAV AMSLAILEDNNLEHPQIEFLGTVEEETTMKGALRLKPNLLTGKYLINVDSEAEGLLTAGS AGGRTVVIDFEEEKATFSKDKYAFFMLGVKNLIGGHSGMEIDKGRMNANKILTELLVEVK KNFEIKLCSFKGGTKENAIPRVAMVEVAVMKKDLKNFRIKLIEIMKNIIDKYILIEKNME LEMFPTDEHSKCFSDDFFNRFLAFMSEAPTGLNTWLQTYPDIVESSNNIAILRTFDNKIH IEISLRSAEPTIMDKLTTIFKNLSKRYKANFQVTKGYPEWRFKKDSHLRNTALKLYNNLF GKNMEVTVMHAGLECGVIGVNYPDLDIISIGPNIYDVHTPKEKMDIKSVERTYLYLTELL KTLK >gi|296154377|gb|ADVK01000036.1| GENE 19 25220 - 26101 1307 293 aa, chain - ## HITS:1 COG:FN1407 KEGG:ns NR:ns ## COG: FN1407 COG3643 # Protein_GI_number: 19704739 # Func_class: E Amino acid transport and metabolism # Function: Glutamate formiminotransferase # Organism: Fusobacterium nucleatum # 1 293 1 293 293 568 100.0 1e-162 MNKILMAEVNISEGTNLELIEKVKKSFIDEKNIEIVDIDSNVDHNRTVFTYKGEPSAVLN ATKKLAKCAVDLIDMRNHKGSHPRMGAVDVVPFIPVKNITTEEAVEIAKEFGKYLGEQGV PVYFYEDAQEKEYRKTLPSIRKGQYEALEEKMKDPKWAPDEGPKEFNPKSGGTVTGARFP LVAFNINLDTYNLEIGKKIVKAVRSATGGYSCIRAIALELEEKKQIQVSMNMINYEKTPI HRVFETIKSEANRYNVNIVDTELVGPVPIYALRDVLDFYLRIADSFSLDQIYF >gi|296154377|gb|ADVK01000036.1| GENE 20 26119 - 27654 2382 511 aa, chain - ## HITS:1 COG:FN1406 KEGG:ns NR:ns ## COG: FN1406 COG2986 # Protein_GI_number: 19704738 # Func_class: E Amino acid transport and metabolism # Function: Histidine ammonia-lyase # Organism: Fusobacterium nucleatum # 1 511 1 511 511 975 96.0 0 MQKIIEINGSNLTIEDVVAVARYGAKVKLDEKQKDKILESRKYVEEALSNKMPIYGINTG FGKFENVPISEEELELLQKNLIYSDACGVGEAFDTEVVRTMMLLRANAISKGFSGVMIET IECLLNMLNAGVHPIVRSKGSVGSSGDLCPLAHMVLPMMGEGEAEYKGEILSGKEAMKRA GVSTITLKAKEGLALINGTQAMMGNAVLAVYDTENLLKQADIVAALTVDALGGIVDAFDE RIHLIRPHKGQIDSAENLRNLLKDSKRTTRQGEKRMQDAYSLRCTPQVHGASRLAFDYVK QTVETEINSVTDNPLIFPGENGACISGGNFHGQPIAIAMDTLGILVSEIANISERRIEKL VNPALSHGLPAFLVKNGGINDGFMIPQYVAAALVSENKVLAHPASVDSIPTSANQEDHVS MGTIGARKARTIVDHAQHVVSIELLCAAQAADFWDSKNLGVGTKEAYRTLREKVDFMEND VIFYPLMDKSFEIIKSAILLANVEKIIGLLK >gi|296154377|gb|ADVK01000036.1| GENE 21 27675 - 28229 804 184 aa, chain - ## HITS:1 COG:FN1405 KEGG:ns NR:ns ## COG: FN1405 COG3404 # Protein_GI_number: 19704737 # Func_class: E Amino acid transport and metabolism # Function: Methenyl tetrahydrofolate cyclohydrolase # Organism: Fusobacterium nucleatum # 1 184 6 189 189 315 98.0 4e-86 MNYKEIFDLILDENDFTVGGGSSSAIVGAMACGLMGMVANLSKGKDYGYSDKEYDDIIKE LNETKANFLQGAVDDNKVYMLIVNAYKLPKSSEEEKEIRRKAIQNAGVEAAKVPLSNALL NKKVNEIGKKLLEKSNPACITDLQAGVDLSHIGINMGKSNVKANLPLIKDEKIVKNFEEE IKNL >gi|296154377|gb|ADVK01000036.1| GENE 22 28245 - 29465 1387 406 aa, chain - ## HITS:1 COG:FN1404 KEGG:ns NR:ns ## COG: FN1404 COG1228 # Protein_GI_number: 19704736 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism # Function: Imidazolonepropionase and related amidohydrolases # Organism: Fusobacterium nucleatum # 1 406 1 406 406 781 99.0 0 MSYILLKNCRELLTIEENAKDLIGLKNNTSLLIENERIKKIGTYEDLKKEISSNNFQEID CSDKVVMPGYVDCHTHLIFGESRVDEYVASFTMTKNEIKNKIVRTGIEASIFSTRNATDE ELINSSLIKLNRMLKHGTTTVEIKSGYGIDMETEIRLLKLINILKEKSPQTILSTYLGAH YFDTKMGKEKYIDFMINEVMPVIKKENLAQFCDVWCDEGYYNAEDCYKILKAGLENDMLP TLHTECYSAIGGAKVAAELKAANVGHLNYISSEDIKLLKEANVVGVLIPSTDFSVKHKKP FVPKPMLDEGMTIAIATNLNPGNWVEDMNISMILACRNHKMTENEAIRATTLGGAKALKI EKDYGSLEVSKFADIQIRNSDSYKNVVYKFGVNEIVHVIKNGKIIF >gi|296154377|gb|ADVK01000036.1| GENE 23 29478 - 30788 1988 436 aa, chain - ## HITS:1 COG:FN1403 KEGG:ns NR:ns ## COG: FN1403 COG2056 # Protein_GI_number: 19704735 # Func_class: R General function prediction only # Function: Predicted permease # Organism: Fusobacterium nucleatum # 1 436 1 436 436 724 100.0 0 MVLLNPIVISVLVLTVLCLFKLPVLAALLLSALTAGLAGGFNLTETMSAFIGGMGGNANT ALSYILLGALAYTINKTGAADILAKKISKLVKGNKFVLALIIILVSIASGTIIPVHIAFI PILIPPLLAMMNQMKMDRRMLAICFGFGLKAPYITIPVAYGAIFQGIIKDSVNDAGLSIG LDIVWKTTWIAGLAMLFGLICGLIYYSKNREYRIDEKNHEDFSNDSEEIIIDPKHWLTLG AGIIALIVQLITGLLPLGAIAALIFLVLVRVVKWKEIQEILEGGIHLMGFIAFVMLIASG YATVIRATGAVDHLVESAFNMLGGSKLAGSSIMILLGLLITMGIGTSFGTVPVIAAIYVP LSIKLGFSPAAIVFMIAVAAALGDAGSPASDTTLGPTAGLNADGQHDHIWDTCVPQFICY DIPLMIAGIICPLFMN >gi|296154377|gb|ADVK01000036.1| GENE 24 30807 - 32105 976 432 aa, chain - ## HITS:1 COG:FN1402 KEGG:ns NR:ns ## COG: FN1402 COG3314 # Protein_GI_number: 19704734 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 385 1 385 394 627 99.0 1e-179 MNKKIIVTKFLLYSFIGIFMFFISIKIGDRNTIPIDHLVKFILKIPHLQIIYGITIIFVG TIFPLIKKTWNKNKLNFIMSILNIIGLIFTIMSIFKFGPEIITQESMGPYVLFKVVIPVI LIVPIGSIFLAFLVSYGLMEGIGTLMEPIMRPIFKTPGRSAIDAVASFVGSYSLALLVTN GVYRENKYTTKEAAIIATGFSTVSATFMIITLNTLNLMEYWNLYFWVCLIVTFIATAITA RIYPLSKMPDIYFNKNLKVENENLENKKSNIFREAWNTAISNFLKSDSVLDNTISNLKSG IKLALNIGATIMSVGVISLLLAQYTKIFDILGYLFYPLTLFFKTSDPFLIAKSATITIAD MYVPAIISTGASIDVRFIIAVLCISEILFFSASIPCILATDIPIKVRDIIIIWFERVVIS LILIVPIVKFIF >gi|296154377|gb|ADVK01000036.1| GENE 25 32195 - 33949 2974 584 aa, chain - ## HITS:1 COG:FN1401 KEGG:ns NR:ns ## COG: FN1401 COG2987 # Protein_GI_number: 19704733 # Func_class: E Amino acid transport and metabolism # Function: Urocanate hydratase # Organism: Fusobacterium nucleatum # 1 584 1 584 584 1215 100.0 0 MNGEKGLYFRENPEILYEVKAQGGGLKDKETLRCKGWRQETILRMLEFNMENAEIPELLV IYGGNGKCARNWESYWAIVDSLKNLEDDETLVVQSGMPVAIFKTHKEAPVVVMATTNIMQ ATWERFYDLQDKNLTIFAQYTAAPWEYIGTQGVIEGTFETLSAIAMKKFNDDLTGKIYLT AGAGGMGANQTWAMKMHGGVCIVVDVNEKILKKRIEKDYLDIIAPTLEEAIKIAKENAKA KKPISIGVVGNAADMYEKILASDFRPDICSEMCPCHDPISYLPSGYTAEEADELRIKDRD KYLHLARETMKRQLAAMVALKADGVEVFEYGTSIRKECMDAGFPREEAMKIKGFVAEYIR PLFCEGRGPFRWTCLSRDPEDLKVSEEIALEICKGDKLVERWINLARKNLPIEGMPARIC YMGFGERKRFGLAINQAIKDGRIKGTVAFSRDNLDSGSIVNPTFESENMPDGGDYISDWP YLNALLDCAGGCDLIAIQQNYSMGEAVHTGVTMIADGTEEADRRLAACLTTDSGIGVIRH AQAGYKAAKDVANGKGKFTTDSIKVPLWWQPAEKVTFGPKGKYR >gi|296154377|gb|ADVK01000036.1| GENE 26 34137 - 37193 2826 1018 aa, chain - ## HITS:1 COG:FN1400_2 KEGG:ns NR:ns ## COG: FN1400_2 COG3899 # Protein_GI_number: 19704732 # Func_class: R General function prediction only # Function: Predicted ATPase # Organism: Fusobacterium nucleatum # 247 1018 1 772 772 1283 100.0 0 MTKIICKLFGSPKIYEDKKEIFLPSGKLTAFFYYLLLKKVVSRDEVAGMFWASSNEQNAK ISLRNALHKIRKSFKEDIILSPNKSILTLNKDLDIDIDAEKFQKDPLNNFSLYNGDFLKG FYVKEAIDFDYWVLEINTFYKELFIKTAEKKIEEDFLQSRFENLETSITSLLTADNFNDK AYLYLMKFYKQKGRYDKIINEYKNIQKLMEEELGIDPPDEIKNIYKEALKCIEKNKEINI KKNTMDIYCRDFELDSIQLNLDNFQKKFSNKSILITGESGIGKTILKKEILNRNSGKFKI FETACFSMEKDFSYLPWTNIIRDMENELLKFNLKRPHLWDNILKNLFFDGANNIQPSIEI LENKENFNIDLIYNSIYSALDILSKEKKIIIVFEDIQWADQLSIKLLINLILHIHSNVLF ILTKTNSIDTVTDRLFLTLKDLNKILLIDLKPFSKRDIALIIKKNFSQKNISNEEIDEIF EKSKGNPFFLKEYIELFKKNKKNNEITSKLHNVLQEKFLNLTENEMNILKIISVFYGDVN LDTLLKLINLKAFEALKFLNLLIEKNIIEEKKKDSKVIITFTYSAYKDYIFNEMNDSSKQ IINMEIAKTLEEELSNLNNIATYNKLKYHYQKANVNIKTLKYDVYILNYYLNFNHEIFPN LDDYDLSKQVKLFIGNDKANKWMNEIKKELMLVKKSKMNSLDIQEIKKIELIFMYCKGRY LIREGSYTDGINLMNRVISIAKDLKEEKIQISAHKQMAIYAIQINNYQIMLKHIIEGIKI ARKEKTVDVGIFYRLYGVYYLIRDEFKTAESLFKKSIELFLEFERIGDKNSISIAANYNY IGEIRNSEGNFEEAMELFNKAIKLCENYEASCLSTFYINAGKTSYLIGNFQDMKKFFLLA EKIIKNFDSYWKNSVLNAFLALDAFLENDNLKVIHYLKCAISEGKIINNPRDIGMVYFVE AIIIYSIEIKNIKKYEDIKKILEENSNFYYYKAIKYLDSTRDKSEIEYLKNFLNINKI >gi|296154377|gb|ADVK01000036.1| GENE 27 37221 - 38177 815 318 aa, chain - ## HITS:1 COG:no KEGG:FN1399 NR:ns ## KEGG: FN1399 # Name: not_defined # Def: putative cytoplasmic protein # Organism: F.nucleatum # Pathway: not_defined # 1 318 1 318 318 492 99.0 1e-137 MENMYILKSKNSIIFNDGDINEVVFNFKEYEDILNNLSTEKYNFFKMIHEKYNIKNEEEI KNKFLYIFHFILIKNICNYILDKYTSKKINFLYFNKNIKNEKFKLSDELSLDDVLKNIII SLINSEEYLSQNLNIDFKKFDINEIISDKIEDKGINFYFYYDSIKKQDLKSKIEKDLLEL GYIDKNKKNTDNRYTLSIYIDDEQLEKIGIDNYQDYLLNWISIGYLKMLIKIHDFLINYY NLTLEKGLKIDDVMLVLIDIFDTEVKEFPQGLKKSIEVGKETSGKCFFINKIIQPVSLTP ELTLLLQGKDAYNVVPRI >gi|296154377|gb|ADVK01000036.1| GENE 28 38312 - 39724 792 470 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|145629959|ref|ZP_01785741.1| 50S ribosomal protein L21 [Haemophilus influenzae 22.4-21] # 7 448 6 445 456 309 38 2e-83 MDFLNYIIGQINTVLWSYVLIALLLLSGLFYTLRTGFAQGRLLGDMVALITGKLSSLKDG EKKIAGQVTGFQAFCIAVASHVGTGNLAGVAIAVVVGGPGALFWMWIIALLGAATSLIEN TLAQTYKVKDGKGGFRGGPSYYMEKALGQKTLGYIFSIIVIVTFAFVFNTVQANTIAQAF ETTFNLGGVVSGVILAALTALIIFGGLHRIANVVGFLVPIMAIGYVVVAVIVLALNIHHI PRLFMSIIEAAFGLKQAVGGAIGVAMLQGIKRGLYSNEAGMGSAPNAAATSNVSHPVKQG LLQAFGVFVDTILICSATGFIVLLYPDFATTAKEGIQVTQDALAYSIGNWGKDFITLCIF LFAFSSLVGNYYYGEANLEFLTKRKSSMLIFRVLTVACVFLGSVAKLAFVWNIADVSMGI MALMNIIVIAILSPKAVAIIRDYIKQRKEGKNPVFKAKDIPGLENTECWD >gi|296154377|gb|ADVK01000036.1| GENE 29 39782 - 40696 1206 304 aa, chain - ## HITS:1 COG:FN1397 KEGG:ns NR:ns ## COG: FN1397 COG2066 # Protein_GI_number: 19704729 # Func_class: E Amino acid transport and metabolism # Function: Glutaminase # Organism: Fusobacterium nucleatum # 1 304 1 304 304 578 100.0 1e-165 MKELLKELVEKNRKFTADGNVANYIPELDKADKNALGIYVTTLDGQEFFAGDYNTKFTIQ SISKIISLMLAILDNGEEYVFSKVGMEPSGDPFNSIRKLETSSRKKPYNPMINAGAIAVA SMIKGKDDREKFSRLLNFAKLITEDDSLDLNYKIYIGESDTGFRNYSMAYFLKGEGIIEG NVNEALTVYFKQCSIEGTAKTISTLGKFLANDGVLSNGERILTTRMAKIIKTLMVTCGMY DSSGEFAVRVGIPSKSGVGGGICSVVPGKMGIGVYGPSLDKKGNSLAGGHLLEDLSAELS LNIF >gi|296154377|gb|ADVK01000036.1| GENE 30 41052 - 41351 337 99 aa, chain + ## HITS:1 COG:no KEGG:FN1395 NR:ns ## KEGG: FN1395 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 99 1 99 99 123 100.0 3e-27 MNKENKNFDIISFLFNNESFINSLLENLKKELMEVIFSENLSLFKKSIFIQGVFTYANLI LSNNTSMSDEEKSKIMQEIVEISNLLTENSMEDIKRYTN >gi|296154377|gb|ADVK01000036.1| GENE 31 41416 - 41727 331 103 aa, chain + ## HITS:1 COG:FN1394 KEGG:ns NR:ns ## COG: FN1394 COG2739 # Protein_GI_number: 19704726 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 103 1 103 103 144 100.0 6e-35 MILDEFVEIANLLEIYSSLLSEKQKEYLEDHFENDLSLSEIAKNNNVSRQAIYDNIKRGV ALLYDYEDKLKFYQMKKNIRKELVDLKEDFTKENLEKIIENLL >gi|296154377|gb|ADVK01000036.1| GENE 32 41738 - 43072 2134 444 aa, chain + ## HITS:1 COG:FN1393 KEGG:ns NR:ns ## COG: FN1393 COG0541 # Protein_GI_number: 19704725 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Signal recognition particle GTPase # Organism: Fusobacterium nucleatum # 1 444 1 444 444 779 100.0 0 MLENLGNRFQDIFKKIRGHGKLSETNIKDALREVKMSLLEADVNYKVVKDFTNRISEKAI GTEVIRGVNPAQQFIKLVNDELVELLGGTSSKLTKGLRNPTIIMLAGLQGAGKTTFAAKL AKFLKKQNEKLLLVGVDVYRPAAIKQLQVLGQQIGVDVYSEEDNKDVVGIATRAIEKAKE INATYMIVDTAGRLHVDETLMDELKELKKAIKPQEILLVVDAMIGQDAVNLAESFNNALS VDGVILTKLDGDTRGGAALSIKAVVGKPIKFIGVGEKLNDIEIFHPDRLVSRILGMGDVV SLVEKAQEVIDENEAKSLEEKIKSQKFDLNDFLKQLQTIKRLGSLGGILKLIPGMPKIDD LAPAEKEMKKVEAIIQSMTKEERKKPDILKASRKIRIAKGSGTDVSDVNKLLKQFDQMKS MMKMFSSGKMPNIGAMGKGRRFPF >gi|296154377|gb|ADVK01000036.1| GENE 33 43123 - 43386 436 87 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|19704724|ref|NP_604286.1| SSU ribosomal protein S16P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 87 1 87 87 172 100 3e-42 MLKLRLTRLGDKKRPSYRIVAMEALSKRDGGAIAYLGNYFPLEDSKVVLKEEEILNYLKN GAQPTRTVKSILVKAGLWAKFEESKKK Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:42:23 2011 Seq name: gi|296154314|gb|ADVK01000037.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00045, whole genome shotgun sequence Length of sequence - 66515 bp Number of predicted genes - 64, with homology - 60 Number of transcription units - 27, operones - 17 average op.length - 3.2 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . - CDS 18 - 335 379 ## Smon_1286 hypothetical protein - Prom 435 - 494 4.9 - Term 365 - 411 1.4 2 1 Op 2 . - CDS 528 - 833 409 ## Smon_1287 putative phage-associated protein - Prom 861 - 920 8.6 3 2 Tu 1 . - CDS 1076 - 1744 659 ## COG0732 Restriction endonuclease S subunits - Prom 1969 - 2028 8.2 - Term 1750 - 1796 1.4 4 3 Op 1 . - CDS 2037 - 3641 1278 ## MARTH_orf198 hypothetical protein 5 3 Op 2 . - CDS 3638 - 4669 840 ## MARTH_orf197 hypothetical protein 6 3 Op 3 . - CDS 4644 - 5504 1250 ## COG4096 Type I site-specific restriction-modification system, R (restriction) subunit and related helicases - Prom 5536 - 5595 11.4 7 4 Op 1 1/0.750 - CDS 5630 - 6007 691 ## COG0251 Putative translation initiation inhibitor, yjgF family - Prom 6036 - 6095 6.4 8 4 Op 2 . - CDS 6098 - 8926 2786 ## COG1061 DNA or RNA helicases of superfamily II - Prom 8985 - 9044 11.8 + Prom 8967 - 9026 14.9 9 5 Op 1 1/0.750 + CDS 9049 - 10635 2070 ## COG0513 Superfamily II DNA and RNA helicases 10 5 Op 2 1/0.750 + CDS 10645 - 11577 951 ## COG1559 Predicted periplasmic solute-binding protein 11 5 Op 3 14/0.000 + CDS 11586 - 12932 1152 ## COG0037 Predicted ATPase of the PP-loop superfamily implicated in cell cycle control 12 5 Op 4 1/0.750 + CDS 12929 - 15070 1317 ## PROTEIN SUPPORTED gi|157803230|ref|YP_001491779.1| 50S ribosomal protein L9 + Term 15116 - 15172 -0.3 + Prom 15083 - 15142 4.3 13 6 Op 1 1/0.750 + CDS 15184 - 15441 422 ## PROTEIN SUPPORTED gi|237743036|ref|ZP_04573517.1| SSU ribosomal protein S15P + Term 15456 - 15490 2.0 14 6 Op 2 . + CDS 15511 - 16620 1341 ## COG5438 Predicted multitransmembrane protein + Prom 16652 - 16711 5.9 15 7 Op 1 . + CDS 16747 - 16995 111 ## FN1981 transposase 16 7 Op 2 . + CDS 17005 - 17304 303 ## Smon_0558 hypothetical protein + Term 17525 - 17560 -0.6 17 8 Tu 1 . - CDS 17358 - 17579 74 ## - Prom 17610 - 17669 9.8 + Prom 17939 - 17998 9.7 18 9 Tu 1 . + CDS 18067 - 18633 1117 ## COG0450 Peroxiredoxin - Term 18661 - 18718 4.1 19 10 Tu 1 . - CDS 18719 - 18835 70 ## - Prom 18868 - 18927 6.3 + Prom 18687 - 18746 8.6 20 11 Tu 1 . + CDS 18935 - 20572 2341 ## COG0492 Thioredoxin reductase + Term 20607 - 20664 19.0 - Term 20595 - 20651 16.1 21 12 Tu 1 . - CDS 20672 - 22090 1734 ## COG4452 Inner membrane protein involved in colicin E2 resistance - Prom 22128 - 22187 14.4 + Prom 21840 - 21899 11.1 22 13 Op 1 . + CDS 22076 - 22150 57 ## 23 13 Op 2 . + CDS 22204 - 23376 1306 ## FN1986 hypothetical protein + Term 23386 - 23425 4.1 - Term 23369 - 23414 0.2 24 14 Tu 1 . - CDS 23417 - 24052 590 ## COG1802 Transcriptional regulators - Prom 24079 - 24138 11.8 + Prom 24234 - 24293 14.7 25 15 Tu 1 . + CDS 24346 - 25728 2160 ## COG3033 Tryptophanase + Term 25771 - 25823 8.2 + Prom 25742 - 25801 5.9 26 16 Tu 1 . + CDS 25859 - 27175 1763 ## COG0733 Na+-dependent transporters of the SNF family + Term 27244 - 27289 1.6 + Prom 27200 - 27259 13.2 27 17 Op 1 1/0.750 + CDS 27378 - 28040 651 ## COG0484 DnaJ-class molecular chaperone with C-terminal Zn finger domain 28 17 Op 2 11/0.000 + CDS 28051 - 29391 1757 ## COG1207 N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 29 17 Op 3 1/0.750 + CDS 29393 - 30343 1347 ## COG0462 Phosphoribosylpyrophosphate synthetase 30 17 Op 4 . + CDS 30345 - 30998 683 ## COG0009 Putative translation factor (SUA5) 31 17 Op 5 . + CDS 31005 - 31439 442 ## FN1994 hypothetical protein + Prom 31472 - 31531 9.6 32 18 Op 1 . + CDS 31674 - 32387 676 ## FN1995 hypothetical protein 33 18 Op 2 1/0.750 + CDS 32384 - 33091 919 ## COG1738 Uncharacterized conserved protein + Term 33115 - 33173 3.2 + Prom 33123 - 33182 7.3 34 19 Op 1 8/0.000 + CDS 33223 - 33507 227 ## COG1396 Predicted transcriptional regulators 35 19 Op 2 . + CDS 33494 - 34639 1460 ## COG3550 Uncharacterized protein related to capsule biosynthesis enzymes + Term 34643 - 34682 3.6 - Term 34629 - 34671 8.9 36 20 Op 1 2/0.000 - CDS 34677 - 35237 915 ## COG4929 Uncharacterized membrane-anchored protein 37 20 Op 2 . - CDS 35230 - 37035 1794 ## COG4984 Predicted membrane protein - Prom 37067 - 37126 8.6 + Prom 37089 - 37148 7.3 38 21 Op 1 7/0.000 + CDS 37174 - 37722 177 ## PROTEIN SUPPORTED gi|163764517|ref|ZP_02171573.1| ribosomal protein L32 39 21 Op 2 15/0.000 + CDS 37732 - 38526 990 ## COG1122 ABC-type cobalt transport system, ATPase component 40 21 Op 3 34/0.000 + CDS 38510 - 39325 203 ## PROTEIN SUPPORTED gi|119503196|ref|ZP_01625280.1| Ribosomal protein S16 + Prom 39406 - 39465 9.8 41 21 Op 4 . + CDS 39508 - 40311 386 ## COG0619 ABC-type cobalt transport system, permease component CbiQ and related transporters + Term 40377 - 40413 -0.4 42 22 Tu 1 . - CDS 40256 - 40807 719 ## COG0386 Glutathione peroxidase - Prom 41023 - 41082 16.1 + Prom 41026 - 41085 10.6 43 23 Op 1 33/0.000 + CDS 41159 - 42301 1819 ## COG0614 ABC-type Fe3+-hydroxamate transport system, periplasmic component + Prom 42307 - 42366 3.1 44 23 Op 2 35/0.000 + CDS 42394 - 43431 1356 ## COG0609 ABC-type Fe3+-siderophore transport system, permease component 45 23 Op 3 . + CDS 43433 - 44197 245 ## PROTEIN SUPPORTED gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 46 23 Op 4 . + CDS 44209 - 45612 1421 ## COG0534 Na+-driven multidrug efflux pump 47 23 Op 5 . + CDS 45662 - 46264 407 ## COG0500 SAM-dependent methyltransferases 48 24 Op 1 12/0.000 - CDS 47436 - 48158 361 ## PROTEIN SUPPORTED gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 49 24 Op 2 2/0.000 - CDS 48158 - 49699 2060 ## COG1732 Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 50 24 Op 3 1/0.750 - CDS 49659 - 50141 544 ## COG1846 Transcriptional regulators - Prom 50188 - 50247 14.1 - Term 50221 - 50261 6.1 51 25 Op 1 . - CDS 50418 - 53081 3623 ## COG0525 Valyl-tRNA synthetase 52 25 Op 2 . - CDS 53111 - 53218 156 ## - Prom 53326 - 53385 3.4 53 26 Op 1 . - CDS 53496 - 53678 327 ## gi|256846926|ref|ZP_05552380.1| conserved hypothetical protein - Prom 53699 - 53758 2.7 54 26 Op 2 4/0.000 - CDS 53760 - 54344 729 ## COG0218 Predicted GTPase 55 26 Op 3 18/0.000 - CDS 54360 - 56666 3088 ## COG0466 ATP-dependent Lon protease, bacterial type 56 26 Op 4 24/0.000 - CDS 56676 - 57947 1630 ## COG1219 ATP-dependent protease Clp, ATPase subunit 57 26 Op 5 29/0.000 - CDS 57959 - 58540 966 ## COG0740 Protease subunit of ATP-dependent Clp proteases - Prom 58581 - 58640 2.9 - Term 58552 - 58610 11.1 58 27 Op 1 1/0.750 - CDS 58644 - 59933 1953 ## COG0544 FKBP-type peptidyl-prolyl cis-trans isomerase (trigger factor) 59 27 Op 2 1/0.750 - CDS 59949 - 61622 1841 ## COG0608 Single-stranded DNA-specific exonuclease 60 27 Op 3 32/0.000 - CDS 61622 - 61984 570 ## COG0858 Ribosome-binding factor A 61 27 Op 4 15/0.000 - CDS 62001 - 64214 3198 ## COG0532 Translation initiation factor 2 (IF-2; GTPase) 62 27 Op 5 22/0.000 - CDS 64241 - 64771 856 ## PROTEIN SUPPORTED gi|237742963|ref|ZP_04573444.1| ribosomal protein L7Ae 63 27 Op 6 32/0.000 - CDS 64764 - 65837 637 ## PROTEIN SUPPORTED gi|17988250|ref|NP_540884.1| transcription elongation factor NusA 64 27 Op 7 . - CDS 65864 - 66334 528 ## COG0779 Uncharacterized protein conserved in bacteria - Prom 66406 - 66465 15.2 Predicted protein(s) >gi|296154314|gb|ADVK01000037.1| GENE 1 18 - 335 379 105 aa, chain - ## HITS:1 COG:no KEGG:Smon_1286 NR:ns ## KEGG: Smon_1286 # Name: not_defined # Def: hypothetical protein # Organism: S.moniliformis # Pathway: not_defined # 1 105 9 111 125 84 40.0 2e-15 MVEFFDIKTNTTKIKSRPVLILSSTRNNDYTILPISTITNKVNMDIDYDIKIDPVSYPKL KLKQVSYVRTHKRTFIHQASIDRSNIIGDLKKDYEELFLKLLMML >gi|296154314|gb|ADVK01000037.1| GENE 2 528 - 833 409 101 aa, chain - ## HITS:1 COG:no KEGG:Smon_1287 NR:ns ## KEGG: Smon_1287 # Name: not_defined # Def: putative phage-associated protein # Organism: S.moniliformis # Pathway: not_defined # 1 101 6 105 169 78 42.0 1e-13 MEKILNVAEYIFKEYYRISGEYIDEMKLQKLLYFSQRESLAILNKPMFSENFEGWKYGPV SREVRTYFTPKDGITTNTEDIKSENKYIINNIILEYGSLAS >gi|296154314|gb|ADVK01000037.1| GENE 3 1076 - 1744 659 222 aa, chain - ## HITS:1 COG:Cj1051c_2 KEGG:ns NR:ns ## COG: Cj1051c_2 COG0732 # Protein_GI_number: 15792378 # Func_class: V Defense mechanisms # Function: Restriction endonuclease S subunits # Organism: Campylobacter jejuni # 28 190 292 453 454 107 42.0 1e-23 MKGYGKAREYSNCIIYGRAAKEFTKGDFISMKNVSENSFEIIEKNFEKFKDLQKGYTQFI ENDILFAKIIPCMKNRKTTIITNLKEKIGYSSTEFHILRSTKIINNKLLYNFLKQKRFRE DARCNMTGSVGFRRVPTEFMKNYPFPLPPPPLEEQQEIVRILDEVLENENKVKKLLELEE KMDILEKSILHKAFKGELGTQNINNEPAMGLLKFYKNLPFSL >gi|296154314|gb|ADVK01000037.1| GENE 4 2037 - 3641 1278 534 aa, chain - ## HITS:1 COG:no KEGG:MARTH_orf198 NR:ns ## KEGG: MARTH_orf198 # Name: not_defined # Def: hypothetical protein # Organism: M.arthritidis # Pathway: not_defined # 1 534 1 534 535 582 62.0 1e-164 MIIKSIYIGNSNEAFIEDSFGKDFNIIYSDDNNKGKTIVIQSIMYCLGNSPVFPASFPFG NYYHILTIESNGKEFDVCRKKNNFIVKYKSEIFVFDNISEFKRYWDKNITKLPVIIKDGR KKLTDLELLLQLFFVGQDKKMTHDIVSSGWLKKADFYNMINSMCGIKNTFDNDDDVESLK VRNNQLKEEKKSLLKKNKILKSNRGAVEILSSSNSRIALEEKLKETEKMRDIISSLTSDR NNAIKRKTKNELVLKELRSLNRTMKTGKILCMNCGSTHISYESSDSEFNFDISTIDMRTN ILKSIEEKIGIYDEEIGRISKEIQHQQNKLNTLLETEDISIEELLLAKLDLDGTQEDDDK IEEISKEIDNINNRIKQLVEQNETMLKESKENLENIVLEMNQFNKYIDPSISDIYEDIFT SKYKTYSGSEGTQFYLSKMYAFAKVLKHDYPIIVDSFRAEDLSTDREKRVISKFSELPNQ LILTTTLKEEEHLKYETLQNIHSIDYSGHKNYHILQKKYVDKFIEKLNSMMIRL >gi|296154314|gb|ADVK01000037.1| GENE 5 3638 - 4669 840 343 aa, chain - ## HITS:1 COG:no KEGG:MARTH_orf197 NR:ns ## KEGG: MARTH_orf197 # Name: not_defined # Def: hypothetical protein # Organism: M.arthritidis # Pathway: not_defined # 1 342 1 342 342 434 65.0 1e-120 MSYVVKSSEKLRPTASDSETKALLYLMNFRDDSDEIYYFVVDFFNDLTGMSRLADKMWDV QSKGAKCSSPKAVGKELVTLFKNYISDFKFDYYILFLGGISSSARKNSTLTHFDIENIQK NALKSIKEGLIKECKNKTYIDNNDINDEKISKFLRKVFFVIDDKNKSEYVKKIIKLNPAI IPEEETLVAIFNEIRDVQSSKKNIGVVEGKIIKTPDEAIMFGRHLSITEIKLLVLNRILT KKVVGAELPPSFSEIYNKYPEEKRRSLLEDCQLDFSKALFDTTSQDDYWKLFEEVYRIAV NNPKYDVNEIFNNLDNELKKKCQHFNVLSLKYFIAKIKDGIKI >gi|296154314|gb|ADVK01000037.1| GENE 6 4644 - 5504 1250 286 aa, chain - ## HITS:1 COG:hsdR KEGG:ns NR:ns ## COG: hsdR COG4096 # Protein_GI_number: 16132171 # Func_class: V Defense mechanisms # Function: Type I site-specific restriction-modification system, R (restriction) subunit and related helicases # Organism: Escherichia coli K12 # 3 254 23 324 1188 99 29.0 8e-21 MYSNFNFLQNDWQGLAKIGEMAEYMLYKNPSTAIMKLRQFGEELVKLMLKAENFTYDKNV LPVDRILILKRAGLIPADIDNILTSLRKKGNDAIHNYYRDEKKAETFLSLAVKLGAWFQE VYGTDYLFQSESVEYKKPENIDYEKEYQKLVERTDEIEKELENIKTVPHLASREDRKKLI SKKKEIEFTEEETRLIIDKQLIDAGWEVDTKVLNYKLNKTLPEKKRNIAIAEWPCIKENG RKGFADYALFLGEKAKETFNIVKLEQLPNLDGICGGDKYVLCCEII >gi|296154314|gb|ADVK01000037.1| GENE 7 5630 - 6007 691 125 aa, chain - ## HITS:1 COG:FN1973 KEGG:ns NR:ns ## COG: FN1973 COG0251 # Protein_GI_number: 19705269 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Putative translation initiation inhibitor, yjgF family # Organism: Fusobacterium nucleatum # 1 125 4 128 128 220 98.0 5e-58 MKKVINTTNAPAALGPYSQAIEANGVLYVSGQIPFVPATMTLVSEDVEEQTKQSLENIGA ILKEAGYDFKDVVSATVYIKDMNDFTKINEVYDKYLGEVKPARACVEVARLPKDVKVEIG VIAVK >gi|296154314|gb|ADVK01000037.1| GENE 8 6098 - 8926 2786 942 aa, chain - ## HITS:1 COG:FN1974_2 KEGG:ns NR:ns ## COG: FN1974_2 COG1061 # Protein_GI_number: 19705270 # Func_class: K Transcription; L Replication, recombination and repair # Function: DNA or RNA helicases of superfamily II # Organism: Fusobacterium nucleatum # 182 942 1 761 761 1284 97.0 0 MENILLEALKTSSIDFNIDSDEKYQYELIANGEEKIVTRLRKYFEDCDEFIISVAFITMG GISLFLEELKNLENKGIKGKILTGDYLTFTEPKALKKLLSYKNIDLKVATNRKHHTKAYF FRKENIWTLIVGSSNLTQGALTVNFEWNIKVNSLENGKIVKSVLETFNREFDNLKTLTEE DIENYQKKYEQLKKLIEVNNQNLDLDEIKPNSMQVQALKNLEETRKENDRALLISATGTG KTYLSAFDVKQTKAKKILFVAHRKVILERSKNSYQRILKNKKMEIFDSNFQINNKDEVVF AMVQTLNKEKNLNIFPKDYFDYIIIDEVHHGGAKTYQSIFEYFKPKFLLGITATPERTDD FNIYQLFNYNVAYEIRLQDAMKEELLCPFHYFGISDIVIDGESIDEKTSIKNLTSDERVK HILEKSKYYSYSGERLYCLIFVSKVEEAKILVEKFLEQGVKAIALSSENSDNEREEAIRK LEQGEIEYIVSVDIFNEGVDIPCVNQVILLRPTTSAIVYIQQLGRGLRKHKNKAYTVVLD FIGNYEKNFLIPIAISQNNSYDKDFMKRFLMNATDFLAGESSISFDEISKERIFENINKT NFSNRKLIEEDFKLLEKQLGRIPYLYDFYEKNMLSPTVILKYKKDYDEVLKNIAPKYRTG NLNNIEKKFLIFLSTFFTPAKRIHEMLILKEALIKQKLNIIETEKILKDKYSLNNQENSI KNAFEHLSKEIFITLSTTKSFESVLYKKDEEYYLDENFKNSYKNNSYFKILIDDLIKYNL AFAEKNYNNFVKASIKLFGEYTKQEAFWYLNLNFNNGFQVSGYTPFENERKLLIFITMDN LSERADYSNEFYDSQTFSWFSKSSRYLRKDNKLTIEGKIAENFYEINVFVKKNNGENFYY LGDVEKVLSAKEIKDSQGKSMVKYIFKLKKDVKKELLDYFNM >gi|296154314|gb|ADVK01000037.1| GENE 9 9049 - 10635 2070 528 aa, chain + ## HITS:1 COG:FN1975 KEGG:ns NR:ns ## COG: FN1975 COG0513 # Protein_GI_number: 19705271 # Func_class: L Replication, recombination and repair; K Transcription; J Translation, ribosomal structure and biogenesis # Function: Superfamily II DNA and RNA helicases # Organism: Fusobacterium nucleatum # 1 528 1 528 528 946 99.0 0 MEQLEKLKEFRELGLSEKVLKVLSKKGYESPTPIQRLTIPALLKNDKDIIGQAQTGTGKT AAFSLPIIENFENLEHIQAIVLTPTRELALQVAEEMNSLSTSKKMKVIPVYGGQSIDIQR KLIKTGVDVVVGTPGRVIDLIERKLLKLNSLKYFILDEADEMLNMGFVEDIEKILTFTNE DKRMLFFSATMPDEIMKVAKNHMKEYEVLAVKSRELTTDLTEQIYFEVNERDKFEALCRI IDLTKEFYGIIFCRTKTDVNEIVGRLNDRGYDAEGLHGDIGQNYREVTLKRFKTKKINIL VATDVAARGIDINDLSHVINYAIPQEVESYVHRIGRTGRAGKEGTAITFITPQEYRRLLQ IQKAVKKEIKKEKLPDVKDVIQAKKFRIIDDIGQILIDNDYDKFKKLAKDLLKMEDAENI VASLLKLSYSDVLDESNYNEISPVKMEDTGKTRLFIAMGRKDGMTPKKLVEFIVKKSKIK QSYIKNAEVYEGFSFVSVPFKEAEIIVEAFAKNRKGKKPLIEKAKSKK >gi|296154314|gb|ADVK01000037.1| GENE 10 10645 - 11577 951 310 aa, chain + ## HITS:1 COG:FN1976 KEGG:ns NR:ns ## COG: FN1976 COG1559 # Protein_GI_number: 19705272 # Func_class: R General function prediction only # Function: Predicted periplasmic solute-binding protein # Organism: Fusobacterium nucleatum # 17 310 17 310 310 519 98.0 1e-147 MKKLFAIIFIIILILAGTTVYQLIKKDKYNLVLEIDKDKPLKESLSALPISNNPFFKLYL KFRNDGKNIKAGNYELRGKFNMIELVSMLESGKSKVFKFTIIEGNTVKNVVDKLVANEKG SRENFEKAFKEIDFPYPTPDNNFEGYLYPETYFIPESYDEKAILNVFLKEFLKKFPVENY PDKDEFYQKLIMASILEREAAVESEKPLMASVFYNRIAKNMTLSADSTVNFVFNYEKKRI YYKDLEVDSPYNTYKNKGLPPGPICNPTVSSVNAAYNPADTEYLFFVTKGGGEHFFSKTY KEHLDFQKNK >gi|296154314|gb|ADVK01000037.1| GENE 11 11586 - 12932 1152 448 aa, chain + ## HITS:1 COG:FN1977 KEGG:ns NR:ns ## COG: FN1977 COG0037 # Protein_GI_number: 19705273 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Predicted ATPase of the PP-loop superfamily implicated in cell cycle control # Organism: Fusobacterium nucleatum # 1 448 1 448 448 686 99.0 0 MELFREILKLNKKYNLIENSDTIVVGFSGGPDSVFLVEMLKKLQYFFNFKIYLVHINHLL RGEDADSDENFSFEYAKKNNLEIFIKRIPVKEIAKEVGKTLEEVGREERYKFFSEIYEKV GANKIATAHNKDDQIETFLFRLIRGTSLQGLEGIKIKNNNIIRPISEIYKKDILEYLNKN EIQYKIDKTNFENEFTRNSIRLNLIPFIEERYNIKFKDKIFSLIEEIRENNQNNSLNLSN YTDSENRIILEKIKFLSDFDKKNLLSLFLNQKNIEVNRNKIDEINSLIKSNGTKKIDLDK SYRIVKDYTHLYIEEKKENFTIVNRVVKLKIPSEQIFDDFKISVNIVKNLDIPKKKNQYL LDALYNDIIEVRYRKEGDRIFLDEKHSKKIKEVFIDQKIPKDMRDRLPIFLYNNKIFWIY NVKKAYIPKINKNESKLIKVLITVEEVK >gi|296154314|gb|ADVK01000037.1| GENE 12 12929 - 15070 1317 713 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|157803230|ref|YP_001491779.1| 50S ribosomal protein L9 [Rickettsia canadensis str. McKiel] # 217 710 104 596 636 511 52 1e-144 MNDNQFENEDLKNKDDSDVHEKQENKENEKEEPKQEKSQEQDSHNENKNDSEDKKSSDED KKQDDKYNPFNNKRDDEKRRVVGKAVKVNFNFKGLLMLIFIITLAVVVPSIMDENKNQQI VDISYSDFIKNIENKKIGVVEEKDGYVYGYKASEVKYLETKSNSIKSKLGFDGKNEVQGL KARLITNRLGEDSNLMAVINNNNAIIQSVEPPEPSLFLSIVLAFLPYIIMIGFLVFMLNR MNRGGGGGPQIFNMGKSRAKENGENISNVTFADVAGIDEAKQELKEVVDFLKEPEKFRKI GAKIPKGVLLLGEPGTGKTLLAKAVAGEAKVPFFSMSGSEFVEMFVGVGASRVRDLFSKA RKNAPCIVFIDEIDAVGRKRGTGQGGGNDEREQTLNQLLVEMDGFGTDETIIVLAATNRA DVLDKALRRPGRFDRQVIVDMPDIKGREEILKVHAKGKKFASDVDFKIIAKKTSGMAGAD LANILNEGAILAAREGRTEITMADLEEASEKVQMGPEKRSKVVSETDKKIVAYHESGHAI VNFVVGGEDKVHKITMIPRGQAGGYTLSLPAEQKLVYSKKYFMDEIAIFFGGRAAEEIIF GKDNITSGASNDIQVATSFAQQMVTKLGMSEKFGPILLDGTREGDMFQSKYYSEQTGKEI DDEIRSIINERYQKALSILNENRDKLEEVTRILLEKETIMGDEFEAIMRGENV >gi|296154314|gb|ADVK01000037.1| GENE 13 15184 - 15441 422 85 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|237743036|ref|ZP_04573517.1| SSU ribosomal protein S15P [Fusobacterium sp. 7_1] # 1 85 1 85 85 167 100 2e-40 MKTKAEIIKEFGKSEADTGSTEVQIALLTEKINHLTEHLRVHKKDFHSRLGLLKMVGQRK RLLAYLTKKDLEGYRALIAKLGIRK >gi|296154314|gb|ADVK01000037.1| GENE 14 15511 - 16620 1341 369 aa, chain + ## HITS:1 COG:FN1980 KEGG:ns NR:ns ## COG: FN1980 COG5438 # Protein_GI_number: 19705276 # Func_class: S Function unknown # Function: Predicted multitransmembrane protein # Organism: Fusobacterium nucleatum # 1 369 1 369 369 623 99.0 1e-178 MKKILVFVIFLVCSTITFSDKVITNKNETKEEYLSGKIIALVSEENSDEDSIAKLQKFNV KLLDGIDKGEVVEIDLPIYMNDEYNINAKVGDRVVVYKTFDNYGNDEMQLQYYISDVDKR IELYVMGIGFVALVLLIARKNGLKSLFALTVTIAFIIKIFIPAIYNGYSPILFAIITAIF SSLVTIYFTVGMNKKFIVALLGVTGGVLIAGILSYIFTYRMRLNGFLDSDLLACAYLFKN IKMKEIIPAGVIIGSLGAVMDVAVSIASSINELHEIDPNISKKSMFKSAINIGNDIIGTM INTLILAYIASAIFTLLLIYMQANEYPLIRILNFQDIAVEIMRSICGSIGILIAVPLTAY IGTLIYKKK >gi|296154314|gb|ADVK01000037.1| GENE 15 16747 - 16995 111 82 aa, chain + ## HITS:1 COG:no KEGG:FN1981 NR:ns ## KEGG: FN1981 # Name: not_defined # Def: transposase # Organism: F.nucleatum # Pathway: not_defined # 1 77 1 77 122 138 94.0 6e-32 MKYQFHNIHKKIARNFKVPKSSPNYFTNSDIIHYGLITAIHTFDRDLKWNPHIYALVSLG GFTKNFTFKKLDYFHVPSIAEQ >gi|296154314|gb|ADVK01000037.1| GENE 16 17005 - 17304 303 99 aa, chain + ## HITS:1 COG:no KEGG:Smon_0558 NR:ns ## KEGG: Smon_0558 # Name: not_defined # Def: hypothetical protein # Organism: S.moniliformis # Pathway: not_defined # 1 98 218 317 407 94 54.0 2e-18 MLNIVQNGNYPNLKIKNLAKKAVSKLYKEDKRLFFNVGSGDVSSPKGIVKYLGRYLTRAP IAEYKITYYDNEKVTFFFNDLANDKKKTYVTMDIDKFVQ >gi|296154314|gb|ADVK01000037.1| GENE 17 17358 - 17579 74 73 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MVIRELLEINFMYGSSIPIRTYINFLYIHHNFTIRAHKWMYIKYINRLFYIKRIFRFRKY FFVFFNFTEILRP >gi|296154314|gb|ADVK01000037.1| GENE 18 18067 - 18633 1117 188 aa, chain + ## HITS:1 COG:FN1983 KEGG:ns NR:ns ## COG: FN1983 COG0450 # Protein_GI_number: 19705279 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Peroxiredoxin # Organism: Fusobacterium nucleatum # 1 188 1 188 188 376 100.0 1e-104 MSLIGRKVPEFKATAFKKGEKDFVTVTDKDLLGKWSVFVFYPADFTFVCPTELEDLQDNY EAFKKEGAEVYSVSCDTAFVHKAWADHSERIKKVTYPMVADPTGFLARAFEVMIEEEGLA LRGSFVINPEGKIVAYEVHDNGIGREAKELLRKLQGAKFVAEHGEVCPAKWQPGSETLKP SLDLIGEL >gi|296154314|gb|ADVK01000037.1| GENE 19 18719 - 18835 70 38 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MNKTTVMFIIVERVFRSSRNTNGYQVVALTNLFILLDF >gi|296154314|gb|ADVK01000037.1| GENE 20 18935 - 20572 2341 545 aa, chain + ## HITS:1 COG:FN1984_1 KEGG:ns NR:ns ## COG: FN1984_1 COG0492 # Protein_GI_number: 19705280 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Thioredoxin reductase # Organism: Fusobacterium nucleatum # 1 332 1 332 332 592 99.0 1e-169 MERIYDMIVIGGGPAGLSAGIYGGRAKLDVLIIEKENKGGQISLTSEVVNYPGILEISGS EFMTQTKKQAQGFGVNFVQEEVLDMDFTEKIKTIKTNNAEYKTLSVVIATGAAPRKLGFP GEQEFTGRGVAYCATCDGEFFTGMDIFVIGAGFAAAEEAMFLTKYGKSVTIIAREPDFTC AKLIGDKVKAHPKITVKFNTELTELTGDVKPTAAKFKNNVTGEITEYKAKVGETFGVFVF VGYAPSSQIFKGHIEIDKAGFIPTNEDLMTNVDGVFAVGDIRPKRLRQVVTAVADGAIAA TSIEKYVHDLREELGIKKEEKEEEKTTSVTTEKEHFLDNELRQQLVAVVDRFENPVEIVV FKDPNNEESVNIENAVKDIASISPEKLKFSSYNEGENKELETKVKVTRTPTIAILDKDGN YTGLKYSSLPSGHELNSFILGLYNVAGPGQKVATESLEKIEKINKPVNIKIGISLSCTKC PKTVQATQRIATLNKNVEMEMINIFTFQDFKNRYDIMSVPAIIVDDQHIYFGEKTVEDML EIINK >gi|296154314|gb|ADVK01000037.1| GENE 21 20672 - 22090 1734 472 aa, chain - ## HITS:1 COG:FN1985 KEGG:ns NR:ns ## COG: FN1985 COG4452 # Protein_GI_number: 19705281 # Func_class: V Defense mechanisms # Function: Inner membrane protein involved in colicin E2 resistance # Organism: Fusobacterium nucleatum # 19 472 1 454 454 873 100.0 0 MENNQQIPPYVRRTVSPVMKKIAFLFIFVLVLLIPLELIKNLIDDRGRLYNQTITNIGNE WGKSQKIIAPVITISYTDTGINKKDSVNNTKTVAVVPVERKFAILPEELNATIEMKDEVR QRGIYNATVYNANIKLKGYFSSKDFPEDKKVQGCVSIGLTDTKALIKINKFKIGDMEQDL EAMSGTMATPLITNGISGQLGPEHNSIMDKEKIPFEIDIDFRGSRDISILPLGKKNHFEI KSNWKSPSFSGVLPTERTIDENGFLAKWEVSNLIRNYPQIIDVNEDKYSDFYDEYYDYSS ENYSTYGNYSNGNTVVKVALFDSVTSYTQIYRACYYGILFIGMSLVVVYIFEVVSKKAAH YVQYGVVGFSLVIFYLLLLSLSEHIGFEWAYLISSLAIVIPNSMYITSMTSNKKFGIGMF IFLSGIYAILFSILRMEQYALLTGSLLILAVLYAVMYLTKKADVFQTLEEKE >gi|296154314|gb|ADVK01000037.1| GENE 22 22076 - 22150 57 24 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MIIFHTSPFKFILFTIIFIFLSIA >gi|296154314|gb|ADVK01000037.1| GENE 23 22204 - 23376 1306 390 aa, chain + ## HITS:1 COG:no KEGG:FN1986 NR:ns ## KEGG: FN1986 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 55 390 1 336 336 581 100.0 1e-164 MKKFFLTILFCILSIFTFANDWEFGSEGEHIIPLKGSNMSIKKEKITLKLMPDGMLVNVK FTFDNPTAENKIIGFVTPESGSGGEGEGKGNRKPEPLNIKSFKTTVNGKEVKSNVELLSK LLSKGVLDNNVIKEYTEKEKNFYNYVYYFNADFKQGENVVEHNYFYTGSYGVYERDFSYV VTTISKWKNKTVEDFEIEVQPGNYFVKLPYSFWKNNNKINWEIVGKGKMVTIAPTKKPND EDANGIEKFGVIYLKLDNGFVRYKAKNFSPTDDFYMVRMDNILGFGYEFPEGKVQGYKFK DKYFDIAARGNSYGDTGFIKEYQQDLNDKDLDIIRNYPYALAGYDFDKKDLKDYFSQFIW YSPVSKNVKIDSDSNNIIKAVDEINTKRKK >gi|296154314|gb|ADVK01000037.1| GENE 24 23417 - 24052 590 211 aa, chain - ## HITS:1 COG:FN1987 KEGG:ns NR:ns ## COG: FN1987 COG1802 # Protein_GI_number: 19705283 # Func_class: K Transcription # Function: Transcriptional regulators # Organism: Fusobacterium nucleatum # 3 211 1 216 216 339 96.0 2e-93 MKVVKDLLSEQIYKILKNDIINSKINFGEVLVNKNLQERFEVSSTPIREDGIIEEITRSG AKLIDFDPNFACEVNQLIMTITLGVIEYSLENKENRKEIVDNLNKYIKLQQNNVSTDLYY EYDYHFHKTFFDYSNNKLLKDLFKKYNLINEILVKAYHKGAFSLEIRMACLEDHENIIKS IEENNISKTLDSVKKHYLRADRIFKNKLKIN >gi|296154314|gb|ADVK01000037.1| GENE 25 24346 - 25728 2160 460 aa, chain + ## HITS:1 COG:FN1988 KEGG:ns NR:ns ## COG: FN1988 COG3033 # Protein_GI_number: 19705284 # Func_class: E Amino acid transport and metabolism # Function: Tryptophanase # Organism: Fusobacterium nucleatum # 1 460 1 460 460 940 99.0 0 MRFEDYPAEPFRIKSVETVKMIDKAAREEVIKKAGYNTFLINSEDVYIDLLTDSGTNAMS DKQWGGLMQGDEAYAGSRNFFHLEETVKEIFGFKHIVPTHQGRGAENILSQIAIKPGQYV PGNMYFTTTRYHQERNGGIFKDIIRDEAHDATLNVPFKGDIDLNKLQKLIDEVGAENIAY VCLAVTVNLAGGQPVSMKNMKAVRELTKKHGIKVFYDATRCVENAYFIKEQEEGYQDKTI KEIVHEMFSYADGCTMSGKKDCLVNIGGFLCMNDEDLFLAAKEIVVVYEGMPSYGGLAGR DMEAMAIGLRESLQYEYIRHRILQVRYLGEKLKEAGVPILEPVGGHAVFLDARRFCPHIP QEEFPAQALAAAIYVECGVRTMERGIISAGRDIKTGENHKPKLETVRVTIPRRVYTYKHM DVVAEGIIKLYKHKEDIKPLEFVYEPKQLRFFTARFGIKK >gi|296154314|gb|ADVK01000037.1| GENE 26 25859 - 27175 1763 438 aa, chain + ## HITS:1 COG:FN1989 KEGG:ns NR:ns ## COG: FN1989 COG0733 # Protein_GI_number: 19705285 # Func_class: R General function prediction only # Function: Na+-dependent transporters of the SNF family # Organism: Fusobacterium nucleatum # 1 438 1 438 438 756 99.0 0 MDNSERKFQSKIGFILTCVGSAVGMANIWAFPYRVGKYGGAVFLLIYFMFIALFSYVGLS AEYLIGRRAETGTLGSYEYAWKDVGKGKLGYGLAYIPLLGSMSIAIGYAVIAAWVLRTFG AAVTGKILEVDTAQFFGEAVTGNFVIMPWHIAVIVLTLLTLFAGAKSIEKTNKIMMPAFF VLFFILAVRVAFLPGAIEGYKYLFVPDWSYLSNVETWINAMGQAFFSLSITGSGMIVCGA YLDKKEDIINGALQTGVFDTIAAMIAAFVVIPASFAFGYPASAGPSLMFMTIPEVFKQMP FGQLLAILFFVSVVFAAVSSLQNMFEVVGESIQTRFKMTRKSVIVLLGIIALVIGIFIEP ENKVGPWMDIVTIYIIPFGAVLGAISWYWILKKESYMEELNQGSKVTRSEIYHNVGKYVY VPLVLVVFVLGVIYHGIG >gi|296154314|gb|ADVK01000037.1| GENE 27 27378 - 28040 651 220 aa, chain + ## HITS:1 COG:FN1990 KEGG:ns NR:ns ## COG: FN1990 COG0484 # Protein_GI_number: 19705286 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: DnaJ-class molecular chaperone with C-terminal Zn finger domain # Organism: Fusobacterium nucleatum # 46 220 1 175 175 255 99.0 4e-68 MEVMLVPLLILFFILVAGFGIENTFKILPPLIILGLLIYFLGWIAVKYFWIILPIWFVSK LLSNKNRGNGSTYSRTYRRTDDDFFNSYRSNGSSGSTNRTYGGTFNSREEAEEFFRTFFG GGFGQGTTGTSGSTYGGYSSQNSGSSYQRNTSSTYTTDKSKYYSILGVSRGASQDEIKKA YRKLAKEHHPDRFVNSSDSEKKYHENKMKEINDAYENLTK >gi|296154314|gb|ADVK01000037.1| GENE 28 28051 - 29391 1757 446 aa, chain + ## HITS:1 COG:FN1991 KEGG:ns NR:ns ## COG: FN1991 COG1207 # Protein_GI_number: 19705287 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) # Organism: Fusobacterium nucleatum # 1 446 1 446 446 798 99.0 0 MKSIIMAAGKGTRMKSDLPKVVHLAHGKPMIVRIIDALNALDVEENILILGHKREKVLEV LGNDVSYVVQEEQLGTGHAVKQAVPKIKDYDGDVLIINGDIPLIRKQTLIDFYNLYKNEN ADGIILSAIFENPFSYGRVIKDGNKVLRIVEEKETNEEQKKVKEINAGVYIFKAQALVKA LEKINNNNEKGEYYITDVIEILSNDKKVISYSLEDSMEIQGVNSKVELALVSKVLRERKN TALMEDGVILIDPATTYIDDEVKIGRDTTIYPNVTLQGNTEIGENSEILSGTRIIDSKIY DNVRIESSVIEESIVENGVTIGPYAHLRPKSHLKENVHIGNFVETKKSTLEKGVKAGHLT YLGDAHIGEKTNIGAGTITCNYDGKNKFKTEIGKDVFIGSDTMLVAPVNIGDNSLIGAGS VITKDVPSDSLSVERSKQIIKEGWKK >gi|296154314|gb|ADVK01000037.1| GENE 29 29393 - 30343 1347 316 aa, chain + ## HITS:1 COG:FN1992 KEGG:ns NR:ns ## COG: FN1992 COG0462 # Protein_GI_number: 19705288 # Func_class: F Nucleotide transport and metabolism; E Amino acid transport and metabolism # Function: Phosphoribosylpyrophosphate synthetase # Organism: Fusobacterium nucleatum # 1 316 1 316 316 595 100.0 1e-170 MINFNNVKIFSGNSNLELAKKIAEKAGLQLGKAEIQRFKDGEVYIEIEETVRGRDVFVVQ STSEPVNENLMELLIFVDALKRASAKTINVIIPYYGYARQDRKSKPREPITSKLVANLLT TAGVNRVVAMDLHADQIQGFFDIPLDHMQALPLMARYFKEKGFKGDEVVVVSPDVGGVKR ARKLAEKLDCKIAIIDKRRPKPNMSEVMNLIGEVEGKIAIFIDDMIDTAGTITNGADAIA QRGAKEVYACCTHAVFSDPAIERLEKSVLKEIVITDSIALPERKKIDKIKILSVDSVFAN AIDRITNNQSVSELFN >gi|296154314|gb|ADVK01000037.1| GENE 30 30345 - 30998 683 217 aa, chain + ## HITS:1 COG:FN1993 KEGG:ns NR:ns ## COG: FN1993 COG0009 # Protein_GI_number: 19705289 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Putative translation factor (SUA5) # Organism: Fusobacterium nucleatum # 1 217 1 217 217 351 99.0 7e-97 MEKYLKIDKISDISDDKWTLLSEEIKKGSLIIYPTDTVYGLGAIVTNEQSINNIYLAKSR SFSSPLIALLSSVDKVEEVAYVSDKNRELLKKLSKAFWPGALTVILKSKEHIPSIMVSGG DTIGVRIPNLDLAIKIIDLAGGILATTSANISGEATPKSYDELSEAIKSKVDILIDSGKC KLGEASTIIDLTSDVPKILRKGAISIEEIEKIIGRVG >gi|296154314|gb|ADVK01000037.1| GENE 31 31005 - 31439 442 144 aa, chain + ## HITS:1 COG:no KEGG:FN1994 NR:ns ## KEGG: FN1994 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 144 1 144 144 207 100.0 1e-52 MKATRVNPGTLSPMEMNNMSSMMGMMNSIQKIGKGKRKYTVKLDKTDKKLLVRFINEAKK QFSDTASNSQYAGVYNFLTYITDIASKKESTEIKMSYEEQDFIKRMLQDSVRGMEKMQFF WYQFIRKFTVRMLTKQYRELLKKF >gi|296154314|gb|ADVK01000037.1| GENE 32 31674 - 32387 676 237 aa, chain + ## HITS:1 COG:no KEGG:FN1995 NR:ns ## KEGG: FN1995 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 237 1 237 237 390 93.0 1e-107 MGIRYSKVEGKFQREIVLLKSFPCAYGKCSFCNYIEDNSNNEEEINRVNLEVLNEITGEF EILEVINSGSVFEIPKKTLEKIREVVYEKDIKVLYFEIFYSYLSRLNELIDYFNEKKKVE IRFRTGIESFDNDFRRKVYNKNIFLDEKKIKELSEKIYSVCLLIATQGQTKEMIKKDIEL GLKYFKAVTINVFVDNGTAVKRDIELVKWFIQDMKHLFYNDRIEILIDNKDLGVFEQ >gi|296154314|gb|ADVK01000037.1| GENE 33 32384 - 33091 919 235 aa, chain + ## HITS:1 COG:FN1996 KEGG:ns NR:ns ## COG: FN1996 COG1738 # Protein_GI_number: 19705292 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 235 1 235 235 412 97.0 1e-115 MMHNIFFWFLMLVINFSCILFAYRKFGKIGLYIWVPISTILANVQVVILVNLFGLEATLG NILYAGGFLITDILSENYGKKAANTAVKIGFFSLIATTLIMQCAIHFKPLDIPEGLAIFE SVKGIFSLLPRLAIASLIAYLISQFHDVWLYHKIREFFPEKKFIWLRNNGSTMLSQLIDN IVFTTIAFYGVYPVDVMFNIFLSTYIIKFIVAICDTPFVYIADKMFRDKKIPEDI >gi|296154314|gb|ADVK01000037.1| GENE 34 33223 - 33507 227 94 aa, chain + ## HITS:1 COG:FN1997 KEGG:ns NR:ns ## COG: FN1997 COG1396 # Protein_GI_number: 19705293 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Fusobacterium nucleatum # 1 94 13 106 106 141 98.0 3e-34 MKLNFLNIKTPKEIQLEIAKNIRKRRKELKLTQEEFSKKSGVSFGSIKRFENTGEISLFS LIKIAIILECEDEFLNLFQQKQYNSIEEIINEQD >gi|296154314|gb|ADVK01000037.1| GENE 35 33494 - 34639 1460 381 aa, chain + ## HITS:1 COG:FN2000 KEGG:ns NR:ns ## COG: FN2000 COG3550 # Protein_GI_number: 19705296 # Func_class: R General function prediction only # Function: Uncharacterized protein related to capsule biosynthesis enzymes # Organism: Fusobacterium nucleatum # 241 381 1 141 145 262 97.0 9e-70 MNKIKSLQVFYNEKKVGTLALMKNNIVAFEYDNEWLNNGFSISPYSLPLKKQVFIPKIEP FDGLYGVFSDSLPDGWGRLLVDRMLNSQNINPREINPISRLAIVGETGMGALSYNPVYNL LEDKDYQEDYDSLALSCKKILNTKYSDDLDNLFRLGGSSGGARPKILTKINNEDWIIKFP SSLDDENIGKLEYLYSLCAKKCKINMPETKLFPSKISSGYFGVKRFDRKKSSTDAIRKIH MISVSGLLETSHRIPNLDYNDLMQLTLNLTKSFEDVEKLFRLMCFNVFSHNRDDHSKNFS FIYNEESKKWELSPAYDLTYSYSINGEHATTVNGNGVNPDLKDILKVAENIGLDKKKAEN IASEIKKIVKKDLEIFLSNNQ >gi|296154314|gb|ADVK01000037.1| GENE 36 34677 - 35237 915 186 aa, chain - ## HITS:1 COG:FN2001 KEGG:ns NR:ns ## COG: FN2001 COG4929 # Protein_GI_number: 19705297 # Func_class: S Function unknown # Function: Uncharacterized membrane-anchored protein # Organism: Fusobacterium nucleatum # 1 186 1 186 186 292 100.0 2e-79 MSNSVKKILLIINIIILFVITGFSAMKEENYKKLDSYFYLELAPVDPRSILQGDYMTLNY DITDKVSDFIYNNRTYIYDGENENEVEEIRELRKLADAKKAYIAVRLDENKVAKFVKVTK EKTDEKDLLFIAYKTDGFDVNINANSYLFQEGTGNKYQDARYSKVVLVGDKLRLVDLRDK DFKEIK >gi|296154314|gb|ADVK01000037.1| GENE 37 35230 - 37035 1794 601 aa, chain - ## HITS:1 COG:FN2002 KEGG:ns NR:ns ## COG: FN2002 COG4984 # Protein_GI_number: 19705298 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 32 601 1 570 570 754 98.0 0 MFEKIKRFFLLFSVIFLIAGVTSFTAYNWENMSNIEKLAVPSVLIIVGLVAYLFLKKEIY KNLAIFFSSFMIGTLFAVYGQVYQTGADVWILFRNWAIFLIIPMVVTGYYSLMILFSIVT AISTGFYLDLYLSGDIVPFLSSLIFGIILIVYPFLQKSFKFKFNNIFYNIMIGIFYICFM VSGSIAINANDYGFIAIILYLAFVGVVYLVAYGQLKKITVKVLSITALGVFGVAFIMKMI KNIFFADITVYILLSLLVIIGTIAGVVKSVSKLENENIKKFTNLVVGFLKILAFFLLIAL VFSLLSSMGLEEGALIVVSIILIIFSYFAARMLKLEKDKLEIVAFIAGLICLGGYLRFYL EMKSLTVLLIVTIIYDVFWFTMPTRALDLLLLPLHYFLLGDFLIEKLEYVDYYYIIIFVA LIIEGYFIYNKKLLSNEKIKRILCGNEFTLLVMSTVFYYTMGAATFLIAEVIDLPSYAGY YNVVLVVFTAIIGLFIIFKEIKNPTLKIVLSLMWIALNYFAYSETLGLAVTLLLMLIYAF RESKWGLAVSTLATVYVIFAYYISFYKTLLDKSIALNISGGLLLVAYLVLKYGFKGVEDN E >gi|296154314|gb|ADVK01000037.1| GENE 38 37174 - 37722 177 182 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163764517|ref|ZP_02171573.1| ribosomal protein L32 [Bacillus selenitireducens MLS10] # 2 180 4 182 190 72 28 5e-12 MKIKNMLYAAMFAAIVAVLGLMPPIPLPFIPVPITLQTMGVMLAGSFLGKRLGFVSMLLI VVIVLLGVPILSGGRGGMAILAGPTGGFFIVWPFAAFLIGFLVEKSWKNINIAKYIVANI IGGIVLVYLVGAIYLSYITKMLIDKAFLATMAFIPGDILKAIVVSVLCYKLKEISPINEV VR >gi|296154314|gb|ADVK01000037.1| GENE 39 37732 - 38526 990 264 aa, chain + ## HITS:1 COG:FN2004 KEGG:ns NR:ns ## COG: FN2004 COG1122 # Protein_GI_number: 19705300 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type cobalt transport system, ATPase component # Organism: Fusobacterium nucleatum # 1 264 1 264 264 451 98.0 1e-127 MIQVENLSFSYQNNKVFKNLSFSIKKGEYLCIIGKNGSGKSTLAKLLAGLIFQQEGSIKI SGYDTKNQKDLLEIRKLVGIIFQNPENQIINTTVFDEVVFGLENLATPRENIKEIAENSL KSVALLEYKDRLTYQLSGGEKQRLAIASVLAMGTEILIFDEAISMLDPVGKKEVLKLMKE LNSQGKTIIHITHNRNDILEASEVMVLSNREIKYQDNPYKIFEDDEFNPFLIKIKNILEK NNIRVDDKNINMEDLVRLVYENIS >gi|296154314|gb|ADVK01000037.1| GENE 40 38510 - 39325 203 271 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|119503196|ref|ZP_01625280.1| Ribosomal protein S16 [marine gamma proteobacterium HTCC2080] # 1 234 1 247 305 82 26 5e-15 MKISLKNVGYEYPTFENNKNGIYDVSLEIDSHKRIAIVGHTGSGKSTLLKLIKGLLKKQT GEINIGGKIEDIGYIFQYPEHQIFETTIFKDISYGLKRLKLNEKEVLERVEKVLELVGLD KDYLHHSTLNLSGGEKRRVALAGVLIMQPQLLLLDEATVGLDLNGKEQLFKILLDWQKEE NKSFLFITHDMNDVSEYAEEVIVMDKGKLLYHTNPSELFEKYSDELESLGLELPECISFF NKLNQKLKNPIKISGDIKEESILKVIEEKIK >gi|296154314|gb|ADVK01000037.1| GENE 41 39508 - 40311 386 267 aa, chain + ## HITS:1 COG:FN2006 KEGG:ns NR:ns ## COG: FN2006 COG0619 # Protein_GI_number: 19705302 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type cobalt transport system, permease component CbiQ and related transporters # Organism: Fusobacterium nucleatum # 1 248 1 248 266 372 96.0 1e-103 MNIILGEYIKKESILHQLDPRTKIIGSFSLILSFLFINTFIGYIITGILAFTLILLSKIP LKEFLKSLKYLLYILIFSSIFHILSHQDGKLLLQLGKFSIYDSGLFSALKIIFRIVFLLI FSSLLILTTKPLDIALGLETLLSPLKKIGLPIQDFSLMISITLRFIPTILQEANTIKMAQ QARGESFEGKNPFKKLYQYSLILLPLLVSVIQKVENLTLAMEARAFHCGLERTNFHKLEF TKKDYTAGIFIFLTIFLFLFLILLHLL >gi|296154314|gb|ADVK01000037.1| GENE 42 40256 - 40807 719 183 aa, chain - ## HITS:1 COG:FN2007 KEGG:ns NR:ns ## COG: FN2007 COG0386 # Protein_GI_number: 19705303 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Glutathione peroxidase # Organism: Fusobacterium nucleatum # 1 183 17 199 199 321 92.0 6e-88 MKIYDFTVKNRKGEDISLENYKGKVLLIVNTATRCGFTPQYDELENLYEKYNKEGFEVLD FPCNQFGNQAPESNEEIHNFCQLNYKVKFDQFAKVEVNGENAIPLFKYLKEEKGFAGFDP KHKLTSILTEMLSKNDPDFAKKSNIKWNFTKFLVDKSGNIVARFEPTTSAEELEKEIKKL LEI >gi|296154314|gb|ADVK01000037.1| GENE 43 41159 - 42301 1819 380 aa, chain + ## HITS:1 COG:YPO1343 KEGG:ns NR:ns ## COG: YPO1343 COG0614 # Protein_GI_number: 16121623 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-hydroxamate transport system, periplasmic component # Organism: Yersinia pestis # 39 378 37 376 378 276 40.0 4e-74 MRLKTILKSIFIVLFFSFLFISCGEKKEAEEVVNDTAEKITFVDLAGREVTLDKPADRIF LGFYEESYLAVAKDFSKVVSISKAEWADFFTGQYKAYEEQMPSIKDMIDTGSIYKASFSM ETLLNSRPEVAILAPFQYETLAENIKKLEDSGIKVVVIDYNSQTLEKHMQSTRILGMITG NQERAEKLALNYEKAIKEVEERVAKVDPKKRVYIELGNLGPKEIGNSYGDYLWGSLAKVA GGNNIAEGKVESYGPLDPEFILSSNPEMILLAGSRWSNDAGDRVLIGFNVNPEETWVRIK PYLERAGWDKLDAVKNGQVFAVDHGGLRSIYDYVYVQYIAKSLYPDLFQDIDPVKNLEEF YGEYLPVKPNGTFMTQYQVK >gi|296154314|gb|ADVK01000037.1| GENE 44 42394 - 43431 1356 345 aa, chain + ## HITS:1 COG:MA2149 KEGG:ns NR:ns ## COG: MA2149 COG0609 # Protein_GI_number: 20090992 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-siderophore transport system, permease component # Organism: Methanosarcina acetivorans str.C2A # 11 343 17 350 355 240 44.0 3e-63 MEHKLTGEEMYRKINQKRRLTSLFTLIAILIALLFDLFIGSSGMSLKDIIVVLWQGPGIK SIETSIIWNIRLPMTLICLTVGASLGLAGTQMQTILANPLASPYTLGVSSAAGFGAAIAF ISGFPFKNMPWVNAPFMAFMMTLTGTMAIYFLGKVKGMRAQSMVLFGIVTHFFFQALLSL VQFRSTPEVAGQIVYWMFGSLLKATWVGVFASGFIFILCALLLSRYAWKLTALSAGEERA KSLGIDTDRVRLHVFLISSLLTAGAVAFVGTIGFIGLVAPHFARYFAGEDQRYLAPMASL FGVLLITFASILAKLIIPGIIIPIGIVTSLVGVPFLVFLIIRKGV >gi|296154314|gb|ADVK01000037.1| GENE 45 43433 - 44197 245 254 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 [Vibrio campbellii AND4] # 4 220 2 226 245 99 28 6e-20 MLQLEIKNIAVNYGKKEVIKDVSCIFNGGDVISLIGPNGTGKTTILKAIAKLISHHGEIK IIQDSEVKTFRESITYVPQMSVNIVNLTVFEMVLLGRVKDLTWKVEKVHLDAVAQILEEL HLTPLSYTKFSSLSGGQKQMVIMAQALVSAPKVLLLDEPTSALDLKHQLQVMEIARNYTK KTNAITILVLHDIALATRYSDQLLLLHEGYSMKQGTVQEVIRPELLEKVYEVKLDVSRSE KGYITVTPISTINE >gi|296154314|gb|ADVK01000037.1| GENE 46 44209 - 45612 1421 467 aa, chain + ## HITS:1 COG:TM0815 KEGG:ns NR:ns ## COG: TM0815 COG0534 # Protein_GI_number: 15643578 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Thermotoga maritima # 13 467 5 454 464 137 25.0 6e-32 MISLIQFFSSSVLLKQEKTEQEIPDKKTINKEYWKVAFPAAIEGVLLNLMLLADLIMVGM LGIEKAAAVGIVSQPKMILQMIVSAAGVAITAIVARRKGEGDEEGLNSCIKQSLLLLGLL YFLFVCLSFIFSKNIVSFAGANEDYIEYASIYFQYIALSVFFKALCVVLSSAQIGVGNTK IVLISGMIGNALNVLLNYLLIFGKYGFPEMGIQGAAIATVIGNAVIFVILLYSVTKGDYG INILKKGSYHFTKKVLAPLQEIGTNSFLEHIFERIGLFIFARMIASLGTVAMGTHHYCIL LWDLYYYFGVGMSSASASFTGRKLGEKRKDLAILYMRAAQYSGLWISIFVGIIFYLLKNS IFSLMISDERVILLGSSVMMIISLLIIPQTQAQVTAGVLRGAGDNRFIAIYSLFISAILR PCLAYIFAFVWKLGLVGIWIAFFSDEFLKMLLAQYRIQKGVWLQKKI >gi|296154314|gb|ADVK01000037.1| GENE 47 45662 - 46264 407 200 aa, chain + ## HITS:1 COG:PA4836 KEGG:ns NR:ns ## COG: PA4836 COG0500 # Protein_GI_number: 15600029 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism; R General function prediction only # Function: SAM-dependent methyltransferases # Organism: Pseudomonas aeruginosa # 32 163 32 163 263 68 25.0 8e-12 MSKSLLFDLKVLEFKLKESLKLYEETKIEENFELLKANIDELCSFIIKKENHLSFFQTTE NENIRTYVVSIRDLSTKILGIIEKEEARKILEDANSCFQYGEELKLTVKQEIHDYKMTSQ DHVLFVGSGSMPITAFTIAKETGAEITCVDIDKKALDLSKKVAIKLGFPNIIFENELHPK SWTQDWRCSSIIPLFLLYIS >gi|296154314|gb|ADVK01000037.1| GENE 48 47436 - 48158 361 240 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 [Vibrio campbellii AND4] # 1 235 1 239 245 143 34 2e-33 MIEFKNISKSYGNQEVIKDFNLTIECGTFLTIIGSSGSGKTTILKMINGLIKADKGEVLI NDKNIQDEDLIELRRKIGYVIQGNILFPHLTVFDNIAYVLNLKKYDKKEIEKIVNEKMDM LNLSRDLKDRLPDELSGGQQQRVGIARALAASPDIILMDEPFGAVDAITRYQLQKDLKEL HKKTEATIVFITHDITEALKLGTKVLVLDKGEIQQYDIPKNICSNPKNEFVKQLLKMAEM >gi|296154314|gb|ADVK01000037.1| GENE 49 48158 - 49699 2060 513 aa, chain - ## HITS:1 COG:FN2009_2 KEGG:ns NR:ns ## COG: FN2009_2 COG1732 # Protein_GI_number: 19705305 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) # Organism: Fusobacterium nucleatum # 207 513 1 307 307 582 98.0 1e-166 MINQLIKLLTEDFKFFLNLTIEHILISLLAISIASVLGIILGIIISEYRKFSGLILGAVN ILYTIPSIALLGFFITITGVGNTTALIALIIYALLPIIRSTYTGIITINPLIIEASEGMG STKLQQLFKVKIPLALPVLMSGIRNMVTMTIALAGIASFVGAGGLGVAIYRGITTNNSAM TFLGSLLIAILALVFDFILGLIEKRLTNHKRVKYKINLKVIILGLFIVIFGAYFSLNSKK DKTINIATKPMTEGYILGQMLTELIEQDTDLKVNITNGVGGGTSNIHPAIVKGEFDLYPE YTGTSWEAVLKKEASYDESKFDELQKEYKEKYNLEYVNLYGFNNTYGLAVNKDIAKKYNL KTYSDLAKVSNNLIFGAEYDFFEREDGYKELQKVYNMNFKKQIDMDIGLKYQAMKDKKID VMVIFTTDGQLAISDVVVLEDDKKMYPSYRAGTVIRSEILSEYPELKPVLEKLNNILDDK TMADLNYQVESEGKKPEDVAREYLQEKGLLEAR >gi|296154314|gb|ADVK01000037.1| GENE 50 49659 - 50141 544 160 aa, chain - ## HITS:1 COG:FN2010 KEGG:ns NR:ns ## COG: FN2010 COG1846 # Protein_GI_number: 19705306 # Func_class: K Transcription # Function: Transcriptional regulators # Organism: Fusobacterium nucleatum # 1 160 1 160 160 255 99.0 2e-68 MQRLGGFLITKLKQLHSRSLAQCISEQGIDAFSGEQGKILFVLWQKDKVTQKELASETGL AKNTITVMLEKMEKNNLIRRITDENDKRKSLVILTDHAKSLKKCSDKISDEMTKKMYEGF SEEEIDKFEEYLHRIIKNFEEKRKVISDDKSIDKIIDRRF >gi|296154314|gb|ADVK01000037.1| GENE 51 50418 - 53081 3623 887 aa, chain - ## HITS:1 COG:FN2011 KEGG:ns NR:ns ## COG: FN2011 COG0525 # Protein_GI_number: 19705307 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Valyl-tRNA synthetase # Organism: Fusobacterium nucleatum # 1 885 1 885 887 1779 99.0 0 MNELDKNYSPNEIEEKWYKIWEDSKYFAASLSSEKENYSIVIPPPNVTGILHMGHVLNNS IQDTLIRYNRMTGKNTLWMPGCDHAGIATQNKVERKLAEDGLKKEDIGREKFLEMTWDWK EKYGGIITKQLRKLGASLDWDRERFTMDEGLSYAVRKIFNDLYHDGLIYQGEYMVNWCPS CGTALADDEVDHEEKDGHLWQIKYPVKDSDEYIIIATSRPETMLADVAVAVHPEDERYKH LIGKTLILPLVNREIPVIADEYVDKEFGTGALKITPAHDPNDYNLGKKYNLPIINMLTPD GKIVEDYPKYAGLDRFEARKKIVEDLKAQDLFIKTEHLHHAVGQCYRCQTVIEPRVSPQW FVKMKPLAEKALEVVRNGEVKILPKRMEKIYYNWLENIRDWCISRQIWWGHRIPAWYGPD RHVFVAMDEAEAKEQAKKHYGHDVELSQEEDVLDTWFSSALWPFSTMGWPEKTKELDLFY PTNTLVTGADIIFFWVARMIMFGMYELKKIPFKNVFFHGIVRDEIGRKMSKSLGNSPDPL DLIKEYGVDAIRFSMIYNTSQGQDVHFSTDLLGMGRNFANKIWNAARFVIMNLEGFDVKS VDKTKLDYELVDKWIISRLNETAKDVKDCLEKFELDNAAKAVYEFLRGDFCDWYVEIAKI RLYNDDEDKKISKLTAQYMLWTILEQGLRLLHPFMPFITEEIWQKIKVDGDTIMLQQYPV ADDSLIDVKIEKSFEYIKEVVSSLRNIRAEKGISPAKPAKVVVSTSNSEELETLEKNELF IKKLANLEELTCGTGLEAPSQSSLRVAGNSSVYMILTGLLNNEAEIKKINEQLAKLEKEL EPVNRKLSDEKFTSKAPQHIIDRELRIQKEYQDKIKKLKESLKSFVK >gi|296154314|gb|ADVK01000037.1| GENE 52 53111 - 53218 156 35 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MIYQVANRIEELDPEKKGTDVGIYEKKIIEYEYYE >gi|296154314|gb|ADVK01000037.1| GENE 53 53496 - 53678 327 60 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|256846926|ref|ZP_05552380.1| ## NR: gi|256846926|ref|ZP_05552380.1| conserved hypothetical protein [Fusobacterium sp. 3_1_36A2] conserved hypothetical protein [Fusobacterium sp. 3_1_27] conserved hypothetical protein [Fusobacterium sp. 3_1_36A2] conserved hypothetical protein [Fusobacterium sp. 3_1_27] # 5 60 43 98 324 81 89.0 2e-14 MLLVFSYSYSAIMPETDWKKKDLKGKVKTMTETTYKYEDNEVKKKKTIFNENGYIIEELH >gi|296154314|gb|ADVK01000037.1| GENE 54 53760 - 54344 729 194 aa, chain - ## HITS:1 COG:FN2013 KEGG:ns NR:ns ## COG: FN2013 COG0218 # Protein_GI_number: 19705309 # Func_class: R General function prediction only # Function: Predicted GTPase # Organism: Fusobacterium nucleatum # 1 194 1 194 194 352 99.0 3e-97 MKIKKADFVKSAVYEKDYPEQLDKMEFAFVGRSNVGKSSLINSLTSRLKLARTSKTPGRT QLINYFLINDEFYIVDLPGYGFAKVPKEMKKQWGQTMERYIASKRKKLVFVLLDIRRVPS DEDIEMLEWLEYNEMDYKIIFTKIDKLSNNERAKQLKAIKTRLIFDNEDVFFHSSLTNKG RDEILNFMEEKLNN >gi|296154314|gb|ADVK01000037.1| GENE 55 54360 - 56666 3088 768 aa, chain - ## HITS:1 COG:FN2014 KEGG:ns NR:ns ## COG: FN2014 COG0466 # Protein_GI_number: 19705310 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: ATP-dependent Lon protease, bacterial type # Organism: Fusobacterium nucleatum # 1 768 1 768 768 1415 99.0 0 MLKAPFLPIRDLVIFPNVVTPIYVGRANSIATLEKAIANKTKLVLGLQKDASQENPTFDG DIYEVGVIANIVQIIRMPNNNIKVLVEAEDRVKIKNIEKEENEYVTTYTVIKETLKDSKE TEAIYRKVFTRFEKYVSMIGKFSSELILNLKKIEDYSNGLDIMASNLNISSEKKQEILEI SNVRDRGYRILDEIVAEMEIASLEKTIDDKVKNKMNEAQRAYYLKEKISVMKEELGDFSQ DDDVIEIVDRLKNTELPKEVREKLEAEVKKLTKMQPFSAESSVIRNYIEAVLDLPWNSET NDVLDLKKASQILERDHYGLKDAKEKVLDYLAVKKLNPSMNGVILCLSGPPGIGKTSLVK SIAESMGRKFVRVSLGGVRDEAEIRGHRRTYVGSMPGKIMKAMKEAGTNNPVMLLDEIDK MSNDFKGDPASAMLEVLDPEQNKNFEDHYIDMPFDLSKVFFVATANDLRNVSVPLRDRMD ILQLSSYTEFEKLHIAQKFLLKQAQKENGLANIDIKIPDKVMFKLIDEYTREAGVRNLKR EIITICRKLAREVVEKDTKKFNLKPTDLEKYLGKAKFRPEKSRKATGKIGVVNGLAWTAV GGVTLDVQGVDTPGKGEVTLTGTLGNVMKESASVAMTYVKANLKKYPPKDKDFFKDRTIH LHFPEGATPKDGPSAGITITTAIVSVLTNKKVRQDIAMTGEITITGDVLAIGGVREKVIG AHRAGIKEVILPEDNRVDTDEIPDELKSTMKIHFAKTYDDVSKLVFVK >gi|296154314|gb|ADVK01000037.1| GENE 56 56676 - 57947 1630 423 aa, chain - ## HITS:1 COG:FN2015 KEGG:ns NR:ns ## COG: FN2015 COG1219 # Protein_GI_number: 19705311 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: ATP-dependent protease Clp, ATPase subunit # Organism: Fusobacterium nucleatum # 1 423 1 423 423 800 98.0 0 MSKKMDRCSFCGRTEREVTQLFQGPGDVFICDSCVESCHSLLRDDMYTLAREYENSRDGK SPNNKNYKGQIELLKPVEIKAKLDEYVVGQDEAKKVLSVAVYNHYKRILNNGQDDDGVEL QKSNVLLVGPTGSGKTLLAQTLAKILNVPFAIADATTLTEAGYVGDDVENVLVRLIQACN YDIPNAERGIIYIDEFDKIARKSENVSITRDVSGEGVQQALLKIIEGTKSQVPPEGGRKH PNQELIEIDTKNILFIVGGAFEGLEKIIKARTNKKVIGFGAEVQKQDNMGTEGEFFKKVL PEDLMKQGIIPELVGRLPVITTLDNLDEQTLINILTKPKNAIVKQYQKLCKLEGVKLEFT QEALTEIAKRALKRKMGARGLRAIIEHTMLDIMFELPSNNNIKEITITKETIDNYKKAEI EYK >gi|296154314|gb|ADVK01000037.1| GENE 57 57959 - 58540 966 193 aa, chain - ## HITS:1 COG:FN2016 KEGG:ns NR:ns ## COG: FN2016 COG0740 # Protein_GI_number: 19705312 # Func_class: O Posttranslational modification, protein turnover, chaperones; U Intracellular trafficking, secretion, and vesicular transport # Function: Protease subunit of ATP-dependent Clp proteases # Organism: Fusobacterium nucleatum # 1 193 1 193 193 371 100.0 1e-103 MYNPTVIDNNGKSERAYDIYSRLLKDRIIFVGTAIDETVANSIIAQLLYLEAEDPEKDII MYINSPGGSVTDGMAIYDTMNYIKPDVQTVCVGQAASMGAFLLAAGAKGKRFALENSRIM IHQPLISGGLKGQATDISIHANELLKIKDRLAELLAKNTGKTKEQILRDTERDNYLSSEE AVNYGLIDSVFRR >gi|296154314|gb|ADVK01000037.1| GENE 58 58644 - 59933 1953 429 aa, chain - ## HITS:1 COG:FN2017 KEGG:ns NR:ns ## COG: FN2017 COG0544 # Protein_GI_number: 19705313 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: FKBP-type peptidyl-prolyl cis-trans isomerase (trigger factor) # Organism: Fusobacterium nucleatum # 1 429 1 429 429 720 99.0 0 MNYEVKKLEKSAVEVKLYLTAEEVKPIVDKVLAHVGEHAEVAGFRKGHAPKEVLMTNYKD HIESDVANDAINANFPEIVDKEKLEPVSYVRLKEINLKDDLNLTFDIDVYPQFELGNYKG LEAEKKSFEMTDDLLKEELEIMVRNHAKLEEVEDAGYKAQLDDTVDLAFEGFMDGAPFPG GKAESHLLKLGSKSFIDNFEEQLVGYTKGQEGEITVKFPAEYHAAELAGKPAQFKVKINA IKKLRQPELNDDFAKELGYASLDELKAKTKEETTKRENDRIENEYVSALLDKLMETTTID VPVSMVQAEIQNRLKELEYQLSMQGFKMDDYLKMMGGNVETFAAQLAPAAEKKVKIDLIL DRIAKDNNFEATDEELNQRMEEIAKMYGMDVPALEEELKKNKNLENFKASVKYDIVMKKA IDEVVKNAK >gi|296154314|gb|ADVK01000037.1| GENE 59 59949 - 61622 1841 557 aa, chain - ## HITS:1 COG:FN2018 KEGG:ns NR:ns ## COG: FN2018 COG0608 # Protein_GI_number: 19705314 # Func_class: L Replication, recombination and repair # Function: Single-stranded DNA-specific exonuclease # Organism: Fusobacterium nucleatum # 2 557 1 556 556 1000 99.0 0 MVCMVKEKSTDELVKELLEKRDHESKNQIEKFINPDYSDFRNPFDFENMEAIVNKIISAR ENKEKIFIYGDYDVDGISGTAFLTKFFNEIGISADCYIPSRKETDYGVSKKSIDYFHKRH GKLVITVDTGYNTIEDVRYAKDLGIEVIVTDHHKTVKEKFDDEILYLNPKLSKNYKFQYL SGAGVAFKLAQGICMSLDLDMEIIYKYLDIVMIGTIADVVPMIDENRLIIKKGLKIIKNT KIKGLSYLLNYLRLNKKTLTTTDVSYYISPLINSLGRVGISRMGADFFLKDDDFDLYNII EEMKEQNKQRRALEKNIFDDAMRKIKNLKIPLDKLSIIFLSSPKWHPGVIGVVSSRLAIK FNIPVVLVAIEGDYGKASCRSVGDISIFNLLSDVRNFLERYGGHDLAAGFVIHKENINKV KKYFIKAIPKMKLEHNKSKKDYEKNFDFELPLEDLGDKTFEFMEKMGPFGSNNPHPLFFD RNLKLDDIKKFGVDFRHFNGIIYKDKVNYNAVGFELAEEISPDYMNKTYNIVYYPEKIIL NDEEVTQIILKSIKENK >gi|296154314|gb|ADVK01000037.1| GENE 60 61622 - 61984 570 120 aa, chain - ## HITS:1 COG:FN2019 KEGG:ns NR:ns ## COG: FN2019 COG0858 # Protein_GI_number: 19705315 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Ribosome-binding factor A # Organism: Fusobacterium nucleatum # 1 120 1 120 120 187 100.0 5e-48 MKKQRLEGIGKEIMRVISKVLLEEVKNPKIKGLVSVTEVDVTEDLKFADTYFSILPPLKS DEKKYDHEEILEALNEIKGFLRKRVAEEVDIRYTPEIRVKLDNSMENAIKITKLLNDLKV >gi|296154314|gb|ADVK01000037.1| GENE 61 62001 - 64214 3198 737 aa, chain - ## HITS:1 COG:FN2020 KEGG:ns NR:ns ## COG: FN2020 COG0532 # Protein_GI_number: 19705316 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Translation initiation factor 2 (IF-2; GTPase) # Organism: Fusobacterium nucleatum # 1 737 1 737 737 1245 99.0 0 MKVRVHELAKKYEIKNKEFLEILKKDIGITVTSHLSNLDEDQVNKIDDYFAKMNMLKVET VEPVKVHKEKKEEKPIRKIMDEDENDEGEGYSQKNNKKTKFQQTKNKKNNNITFEEDGNS HKNKSKKKKGRRTDFVLKTVEATPDVVEEDGIKIIKFRGELTLGDFAEKLGVNSAEIIKK LFLKGQMLTINSPITLSMAEDLAADYDVLIEEEQEVELDFGEKFDLEIEDKVADLKERPP VITIMGHVDHGKTSLLDAIRTTNVVGGEAGGITQKIGAYQVERDGKRITFIDTPGHEAFT DMRARGAQVTDIAILVVAADDGVMPQTVEAISHAKVAKVPIIVAVNKIDKPEANPMKVKQ ELMEHGLVSAEWGGDVEFVEVSAKQKINLDGLLDTILITAEILELKGNNKKRAKGVVLES RLDPKIGPIADILVQEGTLKIGDVIVAGEVQGKVKALLNDKGERVNNATVSQPVEVIGFN NVPDAGDTMYVIQNEQHAKRIVEEVRKERKIQETTKKTISLESLSDQFKHEDLKELNLIL RADSKGSVDALRDSLLKLSNDEVAVSIIQAASGAITESDVKLAEAAGAIIIGYNVRPTTK ALKEAEVSKVEIRTSGIIYHITEDIEKALAGMLEPEYREEYLGRIEIKKVFKVSKVGNIA GCIVIDGKVKNDSNIRILRDNVVIYEGKLASLKRFKDDAKEVVAGQECGLGVENFNDIKD GDVVEAFEMVEVKRTLK >gi|296154314|gb|ADVK01000037.1| GENE 62 64241 - 64771 856 176 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|237742963|ref|ZP_04573444.1| ribosomal protein L7Ae [Fusobacterium sp. 4_1_13] # 1 176 1 176 176 334 94 9e-91 MSSTHIPERTCIICRAKNGKSKLFRLAKVKEAFYEFDKEQKKQSRAVYVCKSLNCLGKLA KHNKVKLDSQDLMSMLNIINKANKNYLNILNSMKNSGELVFGINLLFENIEHIHFIVMAQ DISKKNEEKVLRRINELKIPYVVVGTMEELGKVFNKEEITVIGIKDKKMARGLIEE >gi|296154314|gb|ADVK01000037.1| GENE 63 64764 - 65837 637 357 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|17988250|ref|NP_540884.1| transcription elongation factor NusA [Brucella melitensis 16M] # 10 355 11 354 537 249 39 2e-65 MKAKDSKNFLEALDELEREKGISKESVLEAIELALLAAYKKNYGEDENVEVIVDRENGDI KVFASKIIVNADDLLDPNKEISLEDAKKIKKRVKVGDTLKFEVNCEDFRRNAVQNGKQIV IQKVREAEREHIFNKFKEREDSIVTGIIRRIDNRKNIFIEIDGIELILPPAEQSISDVYR VGERIKVYILSVEKTNKFPKILISRKNEGLLKKLFEIEIPEITSGIIEIKSVAREAGSRA KVAVYSEVPNIDTVGACIGQRGARIKNIVDELNGERIDIVEWKPVIEEFVSAVLSPAVVS NVTILEDGTARVLVEPSQLSLAIGKNGQNARLAARLTGMRVDIKVIDSEALKEEENE >gi|296154314|gb|ADVK01000037.1| GENE 64 65864 - 66334 528 156 aa, chain - ## HITS:1 COG:FN2023 KEGG:ns NR:ns ## COG: FN2023 COG0779 # Protein_GI_number: 19705319 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 156 1 156 156 259 99.0 1e-69 MEENNQIVEKITRIVNPFIEEMNLSLVDVEYVQDGGYWYVRIFIENLNGDLNIEDCSKLS SKIEDKIEELIEHKFFLEVSSPGLERPLKKLEDYTRFIGEKITLHLKHKLDDKKQFKTII KEVNGDNIIFLMDKKEVEIKFNEIRKANILFEFNDF Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:43:31 2011 Seq name: gi|296154303|gb|ADVK01000038.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00046, whole genome shotgun sequence Length of sequence - 14504 bp Number of predicted genes - 11, with homology - 10 Number of transcription units - 7, operones - 2 average op.length - 3.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . + CDS 31 - 294 148 ## COG2801 Transposase and inactivated derivatives + Term 391 - 441 -0.0 - Term 229 - 285 12.2 2 2 Op 1 . - CDS 342 - 518 293 ## FN1890 hypothetical protein - Prom 541 - 600 5.2 3 2 Op 2 . - CDS 617 - 706 179 ## - Prom 726 - 785 10.9 4 3 Tu 1 . - CDS 806 - 1528 1109 ## COG0584 Glycerophosphoryl diester phosphodiesterase - Prom 1556 - 1615 7.5 - Term 1596 - 1651 8.0 5 4 Tu 1 . - CDS 1709 - 8815 9559 ## FN1893 hypothetical protein - Prom 8948 - 9007 11.7 - Term 8985 - 9027 6.1 6 5 Tu 1 . - CDS 9029 - 9736 703 ## COG2992 Uncharacterized FlgJ-related protein - Prom 9779 - 9838 10.2 7 6 Op 1 . - CDS 9840 - 10169 407 ## FN1895 hypothetical protein 8 6 Op 2 11/0.000 - CDS 10185 - 11270 1324 ## COG1172 Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 9 6 Op 3 21/0.000 - CDS 11263 - 12282 1455 ## COG1172 Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 10 6 Op 4 . - CDS 12284 - 13867 209 ## PROTEIN SUPPORTED gi|90020817|ref|YP_526644.1| ribosomal protein S16 - Prom 13935 - 13994 14.1 - Term 13938 - 13982 1.1 11 7 Tu 1 . - CDS 14012 - 14503 783 ## FN1899 lipoprotein Predicted protein(s) >gi|296154303|gb|ADVK01000038.1| GENE 1 31 - 294 148 87 aa, chain + ## HITS:1 COG:FN1447 KEGG:ns NR:ns ## COG: FN1447 COG2801 # Protein_GI_number: 19704779 # Func_class: L Replication, recombination and repair # Function: Transposase and inactivated derivatives # Organism: Fusobacterium nucleatum # 1 86 92 206 207 131 73.0 2e-31 MLNLAFKENENYENLIFHSDQGWQYQHYSYQERLKEKKITQSISRKGNSLDNGLIEDYIY YYNNKRIKEKLKGLTPASYRSQSLLVG >gi|296154303|gb|ADVK01000038.1| GENE 2 342 - 518 293 58 aa, chain - ## HITS:1 COG:no KEGG:FN1890 NR:ns ## KEGG: FN1890 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 58 1 58 58 63 100.0 4e-09 MTKKIENFIDNVIEEKKEQFKTMMGKEHKVENMIKDLKTLNLSNEKLEEVIKVARKYV >gi|296154303|gb|ADVK01000038.1| GENE 3 617 - 706 179 29 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MFKIRNKQFSTYMADVHICLGNIYFYSLT >gi|296154303|gb|ADVK01000038.1| GENE 4 806 - 1528 1109 240 aa, chain - ## HITS:1 COG:FN1891 KEGG:ns NR:ns ## COG: FN1891 COG0584 # Protein_GI_number: 19705196 # Func_class: C Energy production and conversion # Function: Glycerophosphoryl diester phosphodiesterase # Organism: Fusobacterium nucleatum # 1 240 22 261 261 442 99.0 1e-124 MKIFAHRGASGYAPENTLTAIKKAIEMKADGIEIDIQLTKDGKIVVIHDWKVDRTTTGRG FVYELDFDYIRSLDAGQWYTKDFVGEVVPTLEEVLDVLPNDMMLNIEIKDTARKHSNIEE KMLEVLKKYPEKFENIIVSSFHHDKIKRLQELEPKLKLALLTDSEFIEIEKYLSTNGLKS YSYHPEINLISKKDVEILHKNDVKVFVWTVNKEEDLDYLVTLGVDGVITNYPDIMKELLM >gi|296154303|gb|ADVK01000038.1| GENE 5 1709 - 8815 9559 2368 aa, chain - ## HITS:1 COG:no KEGG:FN1893 NR:ns ## KEGG: FN1893 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1008 2368 1 1361 1361 2259 99.0 0 MKNNLSKVEKDLRSIARKYKTVRYSIGLAVLFLMLGINAFSEETVTREIIQNSVGNLQAK IETLKVENEKALEGLRLELVQLMEQGNQVVKSPWSSWQFGLNYMYSRWGGAYKGRGDKKE KYPYEGIFARSNDLFLRSISPDSDFYEKYTAASKERLKNSATTSDRKRQGLRGSNYGLEN TLNQQEPIVQIELGASVRPREIVKSPVNVTAPRITVNPVTPFSKPSAPSEPTAPTIDIKG FDPAAPDVEAPDLPVAPTFNIQLGSYRNYMTQNTLGQTSGGRFSGDGKSYDTSENKTVRD TDLGTIPTVIYAWANGSGIGNFDSALLKAYFDYTNKNRGNGGGTLTVEGNLTIDSINPLT DRQKENETNAGRPHNAQPFLVGGARIATLDNARGGATIRNKATVNMIGPLVVGYEIQNDN AGSGKREVINEGTLTDDKEKDLEEIGGLKKGQVGGGKDTPSDTLELRRSQNLGNDKITVT RTRDILNNDGTMKEKGGYTGYKIGMILTQEFDDPDPGNNYYRLINDGRISFMGRNSIGIQ VYAKPDSAPNTIIDVINKENSNGTGEITLGGIESYGLKLSSRILREANGRKSVFENRGKI NIKGGDGSENSLSSGMAVLEDTTMMGNNFAIRAYKGMVVNKGTINVSGGKGNTGMILKVD AEDDITNDTNGIINVSGTANIGMRVDKGAVPTGASGTPEAINNGTINVSGSATKAKDGNI GMVAHKQAKAINNKHITFASGTKYGTGLLAKGAGAKIENKGGNAKITGSGLERTIGMSAL QGTTGLNSGTIDLSGNRVTGVYNEGTFNMTGGLLKASGNQSISLYSKGSSTTTNITGGTI AAGDKAVGLYADGSRIGISNPTKLEAHNGGLMFYNYASNDPTNPSGRFNLTGNVEGDIKS GGTAFYFKGAASNTASFLNQMFNKGQASSTGKLKLKLEDGATLFVLDSPGGAPIKLSTVR TLGTALGDKVDISRTTSNKYKAYTVYKGSLEIDEAVNLDNETDNFYKMDFTASNVTINEN ISVRGSKASKVIIAQANYNGATNSSTIKVTNKGKIDYSGNKSTALATDFGQVTNETSGTI RMSGDNSIGLYGAANSIVTNKGTIEMGKAGVGIWGANNLSNKYANRNINILNSGTIRGIS GKEGVFGIYAKNSHAGATSNISHSGNIDLSQAKKSTGIFMTKGTLNSSGNISVNEGSVGV NAEDSTVNVNGGTHTIGANSIGFNLKGNSSLLANSGNISITGKGSVAYLFEGVNLTSGTN FKDNLTLTATNGYTYINLTNSTLNYKNQKTINNDETIFVNSKNSTVNLLEGNDISSTKNK VVGVYSEGGVVSNAGKMTLMGDGSSALYSKGAATVNGAPGKITIGANGSGIYVVNAGSTG SNYGEITIGAGSVGMRAENGKIKNNSTGKISSTAEKATGMSQSGNENLENEGTITLTGNQ SVGMHSESVTAAGHQMINKGIVTVGHSATATSPSIGMYAANTDKTTIVNNGKVIAGNKST GIYGGNITLNNNSETSAGNGGIGVYSKGGTVDIKENAKISVGDTLGDKQEGVGVYLAGNN QTLNSDTDNLTIGKGSFGYVMTGQGNTVRTGKAGTTRMINLTHSSIFMYSADRTGTAVNY NNLRSTGDLNYGIYASGRVDNYGTIDFSQGIGNIGAYSYTKGATTTPNSIRNYGTINVSK SDLQTNPDDRKYGIGMAAGYSEESPAGSGRKVTRGIGSIENHGLIRVTTPDSIGMYATGK GSRIYNGPTGRIELSGRKRNIGIFAENGAEVVNEGTITTVGSGNVGQIGIGITSGATLIN RGNIHVNAARGYGLFVAGGIVKNYGNITVAGGAQKTKEVSASDTSKALGDEGLDRVGIKS PAGASKGTITSNGKVKKPTIVQAIPNRKPSEIPKSSIGMYLDTSGINYTKPINNVGALAG LKQGDLIVGTEAADYTNSKYIQLGQDIIKPYNEMIRKSGIEKWSIYSASLTWMASITQLP DYTIRNAYLVKIPYTVFAGDKNTTRDTYNFTDGLEQRYGVEGLNSREKELFKKLNKIGNN ERILLQQAFDEMMGHQYANVQQRIYATGQILDKEFDYLRNEWKTASKDSNKIKIFGTKGE YKTDTAGVIDYKNEAYGMAYVHENEDIKLGKGIGWYTGIVDNTFKFKDIGKSKEEQIQAK VGLLKSIPFDDNNSLNWTISGDIFVGYNKMHRKYLVVNEIFNAKSKYYTYGIGIKNKISK DFRLSEDFSLVPYGSLNLEYGRVNKIKEKVGEIRLEVKENYYVSVNPEIGAELTYKHLLA SRKTFRMGLGIAYENELGKVANGKNKARVAYTNADWFNIRGEKEDRKGNIKFDLNIGLDN QRVGVTANAGYDTKGHNVRGGLGLRVIF >gi|296154303|gb|ADVK01000038.1| GENE 6 9029 - 9736 703 235 aa, chain - ## HITS:1 COG:FN1894 KEGG:ns NR:ns ## COG: FN1894 COG2992 # Protein_GI_number: 19705199 # Func_class: R General function prediction only # Function: Uncharacterized FlgJ-related protein # Organism: Fusobacterium nucleatum # 33 235 1 203 203 336 99.0 2e-92 MKKYLLAVIFLCLSFLSYSNSTEVLDQDMNTGVITQAKDFAKVKGKSKKKIFIDTLIPTI EKIRVKVEADKQYVISLIEKEILTEEEKLFLNEMFTKYKVKSKSKNDLVHKMVVPPTSFI LGQASLESGWGSSKLAKEGNNLFAIRSTLKDKERTVYLGPNQFYKKYESMEESVEDYIMT LSRHSSYSNLRKAINDGEETIVLVKHLGNYSEVKNIYEQRLTQIITKNNLVKYDD >gi|296154303|gb|ADVK01000038.1| GENE 7 9840 - 10169 407 109 aa, chain - ## HITS:1 COG:no KEGG:FN1895 NR:ns ## KEGG: FN1895 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 109 1 109 109 168 100.0 5e-41 MKNTLKVTIIVLILVVISIILFITGKRHDILIENNSSTGIKYSINGEPYKILDTGKKAEG MTKGIGNVIFIKTNDNKVIEKDLPSDDINIFINEIVNNSENWYKEKDGN >gi|296154303|gb|ADVK01000038.1| GENE 8 10185 - 11270 1324 361 aa, chain - ## HITS:1 COG:FN1896 KEGG:ns NR:ns ## COG: FN1896 COG1172 # Protein_GI_number: 19705201 # Func_class: G Carbohydrate transport and metabolism # Function: Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components # Organism: Fusobacterium nucleatum # 22 361 1 340 340 558 100.0 1e-159 MDNNKIKNFLLNNSVPILILIMVAVMFPLSGLSGDYLIREMIQRISRNLFLIMSLLIPIV AGMGLNFGIVLGAMGGQLALILITNWHIMGLQGIFLAMILSIPFSILLGYIGGVILNRAK GKEMITSMILGYFINGAYQLVVLYSMGKIFPVKDKTLLLSSGRGIKNTVDLTEVAKSIDN AIPLRIFRYDIPVLTILFIIALCFFVIWFRKTKLGQDMRAVGQDMEVSKSAGIEVNKVRI YAIVISTVLAGIGQVIYLQNLGTINTYNSHEQIGMFSVAALLIGGASVARATIPNAISGV ILFHTMFVVAPRAGKELMGSAQIGEYFRVFISYGIIALVLIIYEWRRKKEKEREREKAIG F >gi|296154303|gb|ADVK01000038.1| GENE 9 11263 - 12282 1455 339 aa, chain - ## HITS:1 COG:FN1897 KEGG:ns NR:ns ## COG: FN1897 COG1172 # Protein_GI_number: 19705202 # Func_class: G Carbohydrate transport and metabolism # Function: Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components # Organism: Fusobacterium nucleatum # 1 339 1 339 339 558 100.0 1e-159 MLKKFGLPRLIILIFLISTYIIAPFVGIPISTALSDTIIRFGMNAILVLSLMPMIESGAG LNFGMPLGIEAGLLGSLLSIELGFSGFVGFVLAIILAIVFAFVFGWAYGVILNRVKGGEM MIATYIGFSSVAFMCIMWIVLPFRKADMIWAYGGSGLRTTISVESYWKGVLNNIFGKISQ AIPVGEIIFFLLLAFLMWLFFRTKSGLSMSAVGKNEKFAQATGINADKSRKQSVIISTVI AAIGIIVYQQSFGFIQLYLAPFNMAFPAIAAILIGGASVNRVTIWHVMIGTFLFQGILTM TPTVVNALIKTDMSETIRIIVSNGMILYALTRKEGGSRG >gi|296154303|gb|ADVK01000038.1| GENE 10 12284 - 13867 209 527 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|90020817|ref|YP_526644.1| ribosomal protein S16 [Saccharophagus degradans 2-40] # 261 518 2 236 318 85 27 3e-16 MSDTILKIENLSKSFGDNTVLKDINLELKEGEILGLVGENGAGKSTLMKIIFGMEVIRET GGYNGKIFFDGKEVNFLSPFDALNAGIGMVHQEFSLIPGFKVSENIVLNRESTKNNLMTH FFGEGISKIDQKDNTKRAQEAILKLGVNLTGQEQINEMAVAYKQFTEIAREIERESTKLL VLDEPTAVLTEDEAQILLETMKKLANKGITIIFITHRLNEIMTVSDKVTVLRDGQLINTV PTKSTNVNEITEWMIGRKVSTSSEGKNTDNSNVENLMEIKDLWVDMPGEMLKGLDLDIKK GEILGLGGMAGQGKIAVANGVMGLFKTKGNIKYKGEDLVLNKPTYPLEKGIFFVSEDRKG VGLLLDESIERNIAFPAMEIKGLFLKKYLGLINVIDDKAVTDNAKKYIEKLEIKSMSEKQ KVAELSGGNQQKVCVAKAFTMEPDLLFVSEPTRGIDVGAKQLVLETLKEYNRERNTTIVV TSSEIEELRNICDRIAIINEGKVAGILSATASILDFGKLMSGIKGGE >gi|296154303|gb|ADVK01000038.1| GENE 11 14012 - 14503 783 163 aa, chain - ## HITS:1 COG:no KEGG:FN1899 NR:ns ## KEGG: FN1899 # Name: not_defined # Def: lipoprotein # Organism: F.nucleatum # Pathway: not_defined # 1 163 254 416 416 325 98.0 5e-88 TNDAQTEPLLKQIAAHGGYFIEADLPSPTMGYPGALGIEFTDDEKGNWPKILEKVEKAVV DAGGSGRMGTWAFSYNFSGIEGLTDLAIKSIEAGDRDFTLDKVLASLDTATPGSKWNGSL MKNNNGVDIPNSFFVYQDTYVFGKGYMGITSVEVPEKYGKIGN Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:44:56 2011 Seq name: gi|296154178|gb|ADVK01000039.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00049, whole genome shotgun sequence Length of sequence - 140606 bp Number of predicted genes - 126, with homology - 123 Number of transcription units - 44, operones - 33 average op.length - 3.5 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 1/1.000 + CDS 6 - 800 1124 ## COG0561 Predicted hydrolases of the HAD superfamily 2 1 Op 2 . + CDS 858 - 1691 968 ## COG0037 Predicted ATPase of the PP-loop superfamily implicated in cell cycle control 3 1 Op 3 . + CDS 1700 - 1849 173 ## 4 1 Op 4 . + CDS 1860 - 3170 1809 ## COG1022 Long-chain acyl-CoA synthetases (AMP-forming) + Prom 3272 - 3331 10.5 5 2 Op 1 1/1.000 + CDS 3377 - 4357 1153 ## COG0204 1-acyl-sn-glycerol-3-phosphate acyltransferase 6 2 Op 2 . + CDS 4380 - 5066 801 ## COG0670 Integral membrane protein, interacts with FtsH 7 2 Op 3 . + CDS 5085 - 5867 1133 ## FN0865 hypothetical protein + Term 5896 - 5942 7.2 + Prom 5928 - 5987 11.9 8 3 Op 1 . + CDS 6100 - 6510 472 ## FN0863 hypothetical protein 9 3 Op 2 . + CDS 6537 - 6890 539 ## FN0862 hypothetical protein 10 3 Op 3 . + CDS 6935 - 7609 921 ## FN0860 hypothetical protein 11 4 Tu 1 . + CDS 7686 - 8195 585 ## FN0859 hypothetical protein + Term 8220 - 8246 -1.0 + Prom 8231 - 8290 11.7 12 5 Op 1 7/0.000 + CDS 8408 - 9928 1549 ## COG1640 4-alpha-glucanotransferase 13 5 Op 2 4/0.000 + CDS 9993 - 12362 3251 ## COG0058 Glucan phosphorylase 14 5 Op 3 6/0.000 + CDS 12400 - 14226 2274 ## COG0296 1,4-alpha-glucan branching enzyme 15 5 Op 4 7/0.000 + CDS 14259 - 15407 1499 ## COG0448 ADP-glucose pyrophosphorylase 16 5 Op 5 17/0.000 + CDS 15428 - 16591 1353 ## COG0448 ADP-glucose pyrophosphorylase 17 5 Op 6 . + CDS 16608 - 17993 1606 ## COG0297 Glycogen synthase 18 6 Op 1 3/0.000 - CDS 18042 - 18641 778 ## COG1073 Hydrolases of the alpha/beta superfamily 19 6 Op 2 . - CDS 18677 - 19345 605 ## COG0500 SAM-dependent methyltransferases 20 6 Op 3 . - CDS 19362 - 19952 512 ## FN0850 putative cytoplasmic protein 21 6 Op 4 . - CDS 19953 - 21086 1148 ## COG0156 7-keto-8-aminopelargonate synthetase and related enzymes - Prom 21113 - 21172 5.6 22 7 Op 1 . - CDS 21182 - 21796 466 ## FN0848 hypothetical protein 23 7 Op 2 1/1.000 - CDS 21813 - 23612 2001 ## COG0457 FOG: TPR repeat - Prom 23638 - 23697 11.1 - Term 23650 - 23702 1.3 24 8 Op 1 . - CDS 23709 - 24662 1219 ## COG2849 Uncharacterized protein conserved in bacteria 25 8 Op 2 . - CDS 24728 - 25339 531 ## FN0845 hypothetical protein - Prom 25490 - 25549 12.3 + Prom 25437 - 25496 9.5 26 9 Op 1 . + CDS 25532 - 25732 268 ## FN0843 hypothetical protein 27 9 Op 2 . + CDS 25707 - 25820 173 ## gi|254303859|ref|ZP_04971217.1| hypothetical protein FNP_1519 28 9 Op 3 . + CDS 25864 - 26097 284 ## FN0842 hypothetical protein + Term 26207 - 26260 15.3 - Term 26294 - 26351 0.1 29 10 Op 1 . - CDS 26594 - 27580 1025 ## COG0582 Integrase 30 10 Op 2 . - CDS 27592 - 28539 843 ## Lebu_0718 hypothetical protein - Prom 28612 - 28671 6.3 - Term 28661 - 28697 1.7 31 11 Op 1 . - CDS 28752 - 29048 477 ## FN0836 hypothetical protein 32 11 Op 2 . - CDS 29065 - 29736 921 ## FN0835 hypothetical protein - Prom 29804 - 29863 5.5 33 12 Op 1 . - CDS 29865 - 31385 1828 ## FN0834 hypothetical protein 34 12 Op 2 . - CDS 31386 - 33005 2012 ## FN0833 hypothetical protein - Prom 33027 - 33086 16.7 35 13 Op 1 . - CDS 33089 - 33553 632 ## FN0832 hypothetical protein 36 13 Op 2 . - CDS 33577 - 34125 514 ## Exig_0313 hypothetical protein 37 13 Op 3 . - CDS 34139 - 34723 601 ## Msm_1749 hypothetical protein 38 13 Op 4 . - CDS 34739 - 36754 2378 ## COG4930 Predicted ATP-dependent Lon-type protease 39 13 Op 5 . - CDS 36763 - 39276 2956 ## TepRe1_0534 hypothetical protein - Prom 39301 - 39360 8.6 - Term 39312 - 39357 4.1 40 14 Op 1 . - CDS 39362 - 41377 2251 ## COG1479 Uncharacterized conserved protein 41 14 Op 2 . - CDS 41428 - 43236 1911 ## Mpet_1790 hypothetical protein - Prom 43271 - 43330 9.8 + Prom 43211 - 43270 10.4 42 15 Tu 1 . + CDS 43300 - 43419 98 ## + Term 43524 - 43563 1.7 43 16 Op 1 . - CDS 43447 - 44901 1256 ## gi|296328620|ref|ZP_06871137.1| possible AbiZ 44 16 Op 2 . - CDS 44908 - 48555 4627 ## COG1002 Type II restriction enzyme, methylase subunits 45 16 Op 3 . - CDS 48580 - 49911 1479 ## COG2865 Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen - Prom 49931 - 49990 7.1 - Term 49949 - 49991 7.7 46 17 Tu 1 . - CDS 49998 - 51092 1027 ## BF1979 putative DNA repair ATPase - Prom 51135 - 51194 6.3 - Term 51120 - 51172 0.3 47 18 Op 1 . - CDS 51299 - 52873 1475 ## Cag_1611 putative DNA repair ATPase 48 18 Op 2 . - CDS 52887 - 53054 158 ## gi|296328625|ref|ZP_06871142.1| conserved hypothetical protein 49 18 Op 3 . - CDS 53047 - 54159 1424 ## Pnuc_1118 SMC domain-containing protein - Prom 54205 - 54264 18.0 - Term 54259 - 54297 5.4 50 19 Tu 1 . - CDS 54304 - 58056 4932 ## TepRe1_0530 hypothetical protein - Prom 58085 - 58144 8.7 + Prom 58073 - 58132 10.2 51 20 Op 1 . + CDS 58211 - 58657 545 ## gi|296328628|ref|ZP_06871145.1| conserved hypothetical protein + Prom 58790 - 58849 7.0 52 20 Op 2 . + CDS 58886 - 59179 445 ## FN0829 hypothetical protein - Term 59097 - 59134 1.1 53 21 Op 1 36/0.000 - CDS 59284 - 60510 346 ## PROTEIN SUPPORTED gi|163788031|ref|ZP_02182477.1| 50S ribosomal protein L9 54 21 Op 2 24/0.000 - CDS 60507 - 61169 342 ## PROTEIN SUPPORTED gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 55 21 Op 3 . - CDS 61198 - 62340 1289 ## COG0845 Membrane-fusion protein 56 21 Op 4 . - CDS 62353 - 63666 1479 ## FN0825 putative cytoplasmic protein 57 21 Op 5 . - CDS 63670 - 64521 802 ## FN0824 DeoR family transcriptional regulator 58 21 Op 6 1/1.000 - CDS 64540 - 66342 528 ## PROTEIN SUPPORTED gi|149914878|ref|ZP_01903407.1| 30S ribosomal protein S2 59 21 Op 7 . - CDS 66363 - 66881 615 ## COG0703 Shikimate kinase - Prom 66912 - 66971 10.9 - Term 66924 - 66971 3.3 60 22 Op 1 . - CDS 66984 - 67871 1105 ## FN0821 hypothetical protein - Prom 67909 - 67968 9.2 61 22 Op 2 1/1.000 - CDS 67970 - 69349 454 ## PROTEIN SUPPORTED gi|163788782|ref|ZP_02183227.1| 30S ribosomal protein S1 - Prom 69384 - 69443 12.1 62 23 Tu 1 . - CDS 69464 - 71461 2612 ## COG0457 FOG: TPR repeat - Prom 71607 - 71666 11.8 + Prom 71503 - 71562 14.3 63 24 Tu 1 . + CDS 71727 - 72002 494 ## COG0776 Bacterial nucleoid DNA-binding protein + Term 72007 - 72063 6.2 - Term 71883 - 71923 0.1 64 25 Op 1 35/0.000 - CDS 72042 - 72821 187 ## PROTEIN SUPPORTED gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) 65 25 Op 2 33/0.000 - CDS 72821 - 73879 1258 ## COG0609 ABC-type Fe3+-siderophore transport system, permease component 66 25 Op 3 2/0.125 - CDS 73891 - 75411 2046 ## COG0614 ABC-type Fe3+-hydroxamate transport system, periplasmic component 67 25 Op 4 . - CDS 75438 - 75866 699 ## COG1720 Uncharacterized conserved protein - Prom 76010 - 76069 10.8 - Term 76038 - 76078 2.2 68 26 Op 1 . - CDS 76102 - 77055 930 ## FN0811 hypothetical protein 69 26 Op 2 . - CDS 77070 - 78092 1495 ## COG2008 Threonine aldolase 70 26 Op 3 11/0.000 - CDS 78135 - 81056 3243 ## COG0610 Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 71 26 Op 4 27/0.000 - CDS 81046 - 82413 1270 ## COG0732 Restriction endonuclease S subunits 72 26 Op 5 . - CDS 82406 - 84583 2635 ## COG0286 Type I restriction-modification system methyltransferase subunit 73 26 Op 6 1/1.000 - CDS 84597 - 85049 510 ## COG0219 Predicted rRNA methylase (SpoU class) 74 26 Op 7 . - CDS 85049 - 85669 693 ## COG0406 Fructose-2,6-bisphosphatase - Prom 85718 - 85777 10.1 + Prom 85650 - 85709 11.5 75 27 Op 1 1/1.000 + CDS 85780 - 86517 878 ## COG1212 CMP-2-keto-3-deoxyoctulosonic acid synthetase 76 27 Op 2 1/1.000 + CDS 86537 - 88087 1860 ## COG2385 Sporulation protein and related proteins 77 27 Op 3 1/1.000 + CDS 88099 - 88842 733 ## COG4912 Predicted DNA alkylation repair enzyme + Term 88876 - 88922 1.4 + Prom 88909 - 88968 13.0 78 28 Op 1 2/0.125 + CDS 89065 - 89721 691 ## COG0785 Cytochrome c biogenesis protein 79 28 Op 2 1/1.000 + CDS 89740 - 91224 2113 ## COG0225 Peptide methionine sulfoxide reductase + Term 91419 - 91453 3.1 + Prom 91454 - 91513 11.3 80 29 Op 1 34/0.000 + CDS 91538 - 92248 860 ## COG0765 ABC-type amino acid transport system, permease component 81 29 Op 2 16/0.000 + CDS 92241 - 92969 608 ## PROTEIN SUPPORTED gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 82 29 Op 3 1/1.000 + CDS 92999 - 93727 1114 ## COG0834 ABC-type amino acid transport/signal transduction systems, periplasmic component/domain + Term 93742 - 93781 6.1 + Prom 93733 - 93792 9.9 83 30 Tu 1 . + CDS 93848 - 95785 2008 ## COG1523 Type II secretory pathway, pullulanase PulA and related glycosidases - Term 95784 - 95830 10.4 84 31 Op 1 . - CDS 95851 - 97788 2123 ## COG3855 Uncharacterized protein conserved in bacteria 85 31 Op 2 . - CDS 97843 - 97977 68 ## - Prom 97997 - 98056 4.2 86 32 Op 1 1/1.000 - CDS 98060 - 100615 3467 ## COG0574 Phosphoenolpyruvate synthase/pyruvate phosphate dikinase 87 32 Op 2 . - CDS 100627 - 101250 686 ## COG0517 FOG: CBS domain + Prom 101293 - 101352 9.6 88 33 Tu 1 . + CDS 101420 - 101833 581 ## FN0794 hypothetical protein + Term 101858 - 101898 0.7 - Term 101834 - 101899 6.5 89 34 Tu 1 1/1.000 - CDS 101912 - 103111 1811 ## COG0786 Na+/glutamate symporter - Prom 103141 - 103200 14.6 - Term 103271 - 103322 10.1 90 35 Op 1 6/0.000 - CDS 103339 - 105360 3305 ## COG2987 Urocanate hydratase 91 35 Op 2 . - CDS 105377 - 106912 2520 ## COG2986 Histidine ammonia-lyase - Prom 107061 - 107120 9.9 + Prom 107140 - 107199 20.6 92 36 Op 1 1/1.000 + CDS 107260 - 108378 1030 ## COG1940 Transcriptional regulator/sugar kinase 93 36 Op 2 . + CDS 108394 - 109236 1080 ## COG1284 Uncharacterized conserved protein 94 36 Op 3 . + CDS 109313 - 109732 345 ## FN0788 hypothetical protein + Term 109735 - 109792 0.3 - Term 109791 - 109836 6.6 95 37 Op 1 29/0.000 - CDS 109852 - 111027 2059 ## COG2025 Electron transfer flavoprotein, alpha subunit 96 37 Op 2 2/0.125 - CDS 111053 - 111841 1195 ## COG2086 Electron transfer flavoprotein, beta subunit 97 37 Op 3 . - CDS 111862 - 113007 1928 ## COG1960 Acyl-CoA dehydrogenases - Prom 113042 - 113101 8.3 + Prom 113129 - 113188 9.2 98 38 Op 1 1/1.000 + CDS 113211 - 113942 635 ## COG4123 Predicted O-methyltransferase 99 38 Op 2 12/0.000 + CDS 113954 - 114712 760 ## COG2966 Uncharacterized conserved protein 100 38 Op 3 1/1.000 + CDS 114733 - 115224 483 ## COG3610 Uncharacterized conserved protein 101 38 Op 4 1/1.000 + CDS 115243 - 116127 923 ## COG0523 Putative GTPases (G3E family) 102 38 Op 5 1/1.000 + CDS 116136 - 117362 1194 ## COG0500 SAM-dependent methyltransferases 103 38 Op 6 . + CDS 117391 - 119193 2416 ## COG0481 Membrane GTPase LepA + Term 119205 - 119236 1.1 - Term 119192 - 119223 1.1 104 39 Op 1 1/1.000 - CDS 119230 - 120213 1444 ## COG2502 Asparagine synthetase A - Prom 120237 - 120296 13.2 105 39 Op 2 1/1.000 - CDS 120300 - 121589 1712 ## COG1362 Aspartyl aminopeptidase - Prom 121657 - 121716 10.0 - Term 121656 - 121705 5.1 106 40 Op 1 1/1.000 - CDS 121719 - 122534 1136 ## COG2849 Uncharacterized protein conserved in bacteria 107 40 Op 2 1/1.000 - CDS 122602 - 122883 495 ## COG3077 DNA-damage-inducible protein J 108 40 Op 3 1/1.000 - CDS 122954 - 123463 621 ## COG0716 Flavodoxins 109 40 Op 4 1/1.000 - CDS 123460 - 124695 1401 ## COG0635 Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 110 40 Op 5 35/0.000 - CDS 124709 - 125485 212 ## PROTEIN SUPPORTED gi|90020817|ref|YP_526644.1| ribosomal protein S16 111 40 Op 6 2/0.125 - CDS 125485 - 126453 977 ## COG0609 ABC-type Fe3+-siderophore transport system, permease component 112 40 Op 7 4/0.000 - CDS 126468 - 128630 2899 ## COG1629 Outer membrane receptor proteins, mostly Fe transport 113 40 Op 8 1/1.000 - CDS 128700 - 129590 904 ## COG0614 ABC-type Fe3+-hydroxamate transport system, periplasmic component - Prom 129645 - 129704 14.1 - Term 129772 - 129805 2.4 114 41 Op 1 1/1.000 - CDS 129819 - 130247 588 ## COG1970 Large-conductance mechanosensitive channel 115 41 Op 2 . - CDS 130277 - 131362 1415 ## COG0482 Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain 116 41 Op 3 . - CDS 131378 - 132001 737 ## FN0764 amino acid transporter LysE - Prom 132134 - 132193 10.6 + Prom 132024 - 132083 14.6 117 42 Tu 1 . + CDS 132120 - 132476 193 ## FN0762 hypothetical protein + Term 132485 - 132531 1.1 118 43 Tu 1 . - CDS 132532 - 133302 998 ## COG1521 Putative transcriptional regulator, homolog of Bvg accessory factor - Prom 133410 - 133469 10.4 + Prom 133284 - 133343 15.9 119 44 Op 1 . + CDS 133552 - 134364 627 ## FN0760 hypothetical protein 120 44 Op 2 1/1.000 + CDS 134366 - 134944 781 ## COG0424 Nucleotide-binding protein implicated in inhibition of septum formation 121 44 Op 3 1/1.000 + CDS 134962 - 136008 1404 ## COG1077 Actin-like ATPase involved in cell morphogenesis 122 44 Op 4 12/0.000 + CDS 136021 - 136566 559 ## COG1386 Predicted transcriptional regulator containing the HTH domain 123 44 Op 5 1/1.000 + CDS 136556 - 137260 783 ## COG1187 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases 124 44 Op 6 31/0.000 + CDS 137269 - 137559 359 ## COG0721 Asp-tRNAAsn/Glu-tRNAGln amidotransferase C subunit 125 44 Op 7 21/0.000 + CDS 137576 - 139039 443 ## PROTEIN SUPPORTED gi|163737840|ref|ZP_02145257.1| 30S ribosomal protein S4 126 44 Op 8 . + CDS 139053 - 140498 1935 ## COG0064 Asp-tRNAAsn/Glu-tRNAGln amidotransferase B subunit (PET112 homolog) Predicted protein(s) >gi|296154178|gb|ADVK01000039.1| GENE 1 6 - 800 1124 264 aa, chain + ## HITS:1 COG:FN0869 KEGG:ns NR:ns ## COG: FN0869 COG0561 # Protein_GI_number: 19704204 # Func_class: R General function prediction only # Function: Predicted hydrolases of the HAD superfamily # Organism: Fusobacterium nucleatum # 1 264 7 270 270 474 99.0 1e-133 MKLVVSDLDGTLLNDDSEVSNETIEMIKKLKENGIEFAIATGRSFNSANKIRKEIGLEIY LICNNGANIYNKNGKMIKNNIMPADLIRKVINFLTENSIGYFAFDGSGINFYVPNDMEID AELLNEHIPHYIKNLEDINNLPALEKILIIEEDTERIYEIKDLVHKNFDNELEIVISADD CLDLNIKGCSKRGGVEYISQELKINPKEIMAFGDSGNDYKMLKFVGHPVAMKDSFMSKRD FENKTDFTNDESGVAKYLQKYFNL >gi|296154178|gb|ADVK01000039.1| GENE 2 858 - 1691 968 277 aa, chain + ## HITS:1 COG:FN0868 KEGG:ns NR:ns ## COG: FN0868 COG0037 # Protein_GI_number: 19704203 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Predicted ATPase of the PP-loop superfamily implicated in cell cycle control # Organism: Fusobacterium nucleatum # 1 277 1 277 277 527 100.0 1e-150 MENLIANDGVGEIAFLNKKEKIEESLRTTYRKKIWKNFIKAIKDFDLIKDGDKIAVGVSG GKDSLLLCKLFQELKKDKSKNFEVKFISMNPGFEAIDIDKFKENLIEMGIDCELFDANVW QIAFEEAPNNPCFLCAKMRRGVLYKKVEELGFNKLALGHHFDDIVETTMINMFFAGTVKT MLPKVPSTSGKMDIIRPLAYVREKDIINFMKYNDIQAMSCGCSIESGKVDSKRKEIKFLL QELEMKNPNIKQSIFNAMKNINLDYVLGYTSGNKTKE >gi|296154178|gb|ADVK01000039.1| GENE 3 1700 - 1849 173 49 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MNFYLKLLIKILERSMTAKDSEILKKLKSGYDLSNEEKKELEDLIDNLI >gi|296154178|gb|ADVK01000039.1| GENE 4 1860 - 3170 1809 436 aa, chain + ## HITS:1 COG:FN0867_1 KEGG:ns NR:ns ## COG: FN0867_1 COG1022 # Protein_GI_number: 19704202 # Func_class: I Lipid transport and metabolism # Function: Long-chain acyl-CoA synthetases (AMP-forming) # Organism: Fusobacterium nucleatum # 1 435 1 435 606 816 98.0 0 MSIKYLYDRKKIAVTYGEEKYSYADVIKYVNYYSEFLDISKGDRVALMMENRPESIFSFF SIWAKKGIALSLDAGYTVEQLAFVLNDSKPKYIFVSNKVKEVVEKANEQVGNIVKIMVVD EITLPIDYVIKQEEYENDSNEDLAVIVYTSGTTGNPKGVMITYENIKTNMEGVRAVDLVT ETDVILAMLPYHHIMPLCFTLILPMYMGVPIVLLTEISSASLLKALQENRVTVILGVPRV WEMLDKAIMTKINQSSLAKFMFKMALKINSMSIRKTLFSKVHKQFGGNIRLMVSGGAKID KNILEDFRTMGFCAIQGYGMTETAPIITFNVPGRERSDSAGEVIPNVEVKIADDGEILVK GKNVMKGYYNNEQATKEAFDKDGWFHTGDLGKMNGKYLIIIGRKKEMIVLPNGKNIDPND IEAEIIKNTDLIKKLL >gi|296154178|gb|ADVK01000039.1| GENE 5 3377 - 4357 1153 326 aa, chain + ## HITS:1 COG:FN0867_2 KEGG:ns NR:ns ## COG: FN0867_2 COG0204 # Protein_GI_number: 19704202 # Func_class: I Lipid transport and metabolism # Function: 1-acyl-sn-glycerol-3-phosphate acyltransferase # Organism: Fusobacterium nucleatum # 101 326 1 226 226 404 99.0 1e-112 MLKDLIEEKTENTDKKEAKKIIEVPAEMKEKFDIINKYMDERYQKAIDLDSHIELDLGFD SLDIVEFMNFLNSTFEITIVEQDFVENKTISAIIKLINDKAGKLVEKIDKNENLKKIIES DSNVKLPKDARYAKVLKFILSLMFRYYFKYKYRGKENLGEGAGIIVGNHQSYLDAFMLNN AFTYKELGGNYYIATALHFKSNFMKYLAGHGNIILVDANRNLKNTLQAAAKVLKSGKKLL IFPEGARTRDGQLQEFKKTFAILSKELNVPIYPFVLKGAYEAFPYNKKFPKRNNISVQFL EKIEPNHKTVEELVEETKNNIAKNYY >gi|296154178|gb|ADVK01000039.1| GENE 6 4380 - 5066 801 228 aa, chain + ## HITS:1 COG:FN0866 KEGG:ns NR:ns ## COG: FN0866 COG0670 # Protein_GI_number: 19704201 # Func_class: R General function prediction only # Function: Integral membrane protein, interacts with FtsH # Organism: Fusobacterium nucleatum # 5 228 1 224 224 307 100.0 9e-84 MYYNMNDIDIRSSNNFLRKVFLYMILGIAISFGTGAYLLYFNQGLLSTLFNYYQFLVIAE LAMVFSISLFINKMSSSLARILFFAYSLVNGITLTVIGLIYAPQVIFYAFMITLTIFIVT AIYGYTTQEDLSSYRRFFIIALISLIILSIINVFMRVGMLEWVITIAGVVIFTGLIAYDV NRIKFISYQLADGDNETIEKMGIIGALNLYLDFINLFIYILRIFGRKK >gi|296154178|gb|ADVK01000039.1| GENE 7 5085 - 5867 1133 260 aa, chain + ## HITS:1 COG:no KEGG:FN0865 NR:ns ## KEGG: FN0865 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 20 260 1 241 241 427 99.0 1e-118 MKKTLLLMFSILCVNTFSYVERNDQVGNRGLELIRESNINQNMGLSKESGSTQIIDVYGG NGKFAKTKGFMIGTTSNFVDYPNITAGVTVAYDKYKYKPDNNDYWGRDYDVNTYFSYKLN KNLFTVGFGYSQARQVAKRGYIGNLEYGRFLTGNTYLYAGVEGENRIYKGEGSENLRFAN YKLGVLRQDTWKKLKFLNGVEVNMDNRKYDIEDRGRGNLTFLSRVSYYIYDDLLFDVQYR GTKNSKFYDSVVGVGFTHYF >gi|296154178|gb|ADVK01000039.1| GENE 8 6100 - 6510 472 136 aa, chain + ## HITS:1 COG:no KEGG:FN0863 NR:ns ## KEGG: FN0863 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 79 136 1 58 58 99 100.0 5e-20 MKLKSTSCLLIILAFIIFIGTNEDSYAKGKKKKAKMQIEMEEYGDVGLITDTPKFKADTE KEFKWANDKTKEEILQRAMEKMNREEKDIKIFFINGVTIKDIDFSTYEMKYIPWGEMEHK SKNGKIVKFYYFEKRM >gi|296154178|gb|ADVK01000039.1| GENE 9 6537 - 6890 539 117 aa, chain + ## HITS:1 COG:no KEGG:FN0862 NR:ns ## KEGG: FN0862 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 81 1 81 81 131 87.0 9e-30 MNRIVYTIGGFILMGISEIFKYYHLSRIFGKTGDLIGVVGLIAAVYGLWPIIKIFLALIG IRDNDSSSSNGTNWLEKLRKNAEKAKIREEKEKKQMEEIERKGRKLNEEIRKERKKN >gi|296154178|gb|ADVK01000039.1| GENE 10 6935 - 7609 921 224 aa, chain + ## HITS:1 COG:no KEGG:FN0860 NR:ns ## KEGG: FN0860 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 140 224 1 85 85 163 100.0 4e-39 MKKILLNLLLLFFLISCGGIKKDPEVVKYIIDKVNTDIEMLIPRGEIDTSRSNALMLREM QLAKNIKVEVTDVLSNKEYIELYAKKLEELEAKSPGSYSPLSPEEIKERSKDPNIVYYKY AITADIVNLDEDLKRLHPKMKNFNTILDTKAELFTILSDENISNLKYETVEGFGYFRYNT ETKQGDYTNTVFGISKNFQGYLEIGPFLEARGHEKYNFNTPLEE >gi|296154178|gb|ADVK01000039.1| GENE 11 7686 - 8195 585 169 aa, chain + ## HITS:1 COG:no KEGG:FN0859 NR:ns ## KEGG: FN0859 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 19 169 19 169 169 269 100.0 3e-71 MKKLLIYLLGILGVLLLVSCGEGKVDESRIYHIDFPFYDRDGSTKQITVELYVVRPDGKD YPDGLKNFIKNQVKNYKEPLAKALYLQEDLFYFVPLSVYDVDVQEEINKYLEDNGYSLEI SKGIDSLDISKYRPTRENEQDLYIQLIELGKKDNYFQDRYFLYTYVKNK >gi|296154178|gb|ADVK01000039.1| GENE 12 8408 - 9928 1549 506 aa, chain + ## HITS:1 COG:FN0858 KEGG:ns NR:ns ## COG: FN0858 COG1640 # Protein_GI_number: 19704193 # Func_class: G Carbohydrate transport and metabolism # Function: 4-alpha-glucanotransferase # Organism: Fusobacterium nucleatum # 1 506 1 506 506 936 99.0 0 MKRECGVLLAISSLPSSYGIGDFGKEAYRFIDFLVSSGQSLWQILPLCPVEYGNSPYQSP STFAGNFLYLDLENLVNNEYLTQKDIDVLKQEVSYIDYEYIKSQKESLLRKASQAFFYKK KEEKDFKKFQKENKFWLEDYALFLTLNKKFKGKMWNTWQKEYKFREKKFIEEAKKTYQEE YLYESFIQYYFQKQWKQLKNYANKKGIKIIGDLPIYVATHSADTWQNPKLFCFDKHLKIK LVAGCPPDYFSKTGQLWGNVLYNWKEMKKNDYSWWINRIKHSFLLYDILRLDHFRGFASY WAIRYGEKTAINGKWEKGPRYQFFKKLESKITNMDVVAEDLGRLTADVFKLLEQTKYPNM KVLEFGLAEWDNMYHPRNYLENSVAYTGTHDNMPIVEWYENLNQEEKNICDENLKNFLKD YDTNIWEPIQWRAIEALYASKSNRVIVPLQDILGLGRDSRMNTPSTVGNNWAWRIYWNYR YKDLENKLYNLAKKYRRISKGEDNEN >gi|296154178|gb|ADVK01000039.1| GENE 13 9993 - 12362 3251 789 aa, chain + ## HITS:1 COG:FN0857 KEGG:ns NR:ns ## COG: FN0857 COG0058 # Protein_GI_number: 19704192 # Func_class: G Carbohydrate transport and metabolism # Function: Glucan phosphorylase # Organism: Fusobacterium nucleatum # 1 789 1 789 789 1528 99.0 0 MKFDKEEWKEKLEERLLEKFSVSLKDASPFEVYRALGETVMSFIAKDWYETKQEYSKTKQ AFYLSSEFLMGRALGNNLINLGIDKEIQKFFKELGIDYNQIEDEEEDAALGNGGLGRLAA CFMDSLATLNLPGQGYSIRYRNGIFNQYLRDGYQVEKPETWLKYGDVWSVMRPEDEVIVN FGHTSVRALPYDMPIIGYGTNNINTLRLWEAHSIVDLDLGVFNQQDYLHATQDKTLAEDI SRVLYPNDSTDDGKKLRLRQQYFFVSASLQDIIKKFKKVHGREFSKIPEFIAIQLNDTHP VIAIPELMRILVDVEGVLWEDAWEIVKKTFSYTNHTILAEALEKWWIGLYQEVVPRIFQI TEGIHNQFKNELANLYPNDINKQNRMQIIQGNMIHMAWLAIYGSHKVNGVAELHTEILKE HELRDWYELYPEKFLNKTNGITQRRWLLKSNSQLASYITELIGDAWIKDLSELKKLGQFM NDEKVLNKIWDIKIEKKKELVEYLRETQGIDINPKSIFDVQVKRMHEYKRQLLNIFQVYD LYQQLKQNPNMDFTPTTYIYGAKAAPGYKIAKGIIRLINDIAQIINADGDVKEKLKVVFV ENYRVTVAEKIFPAADISEQISTAGKEASGTGNMKFMLNGAITLGTLDGANVEIVKEAGE ENEYIFGMRVKDIDELRKKGYDPRFPYNNITGLKQVVDALIDGKLSDLGSGIYREIHSLL MERGDQYFVLEDFEDYRRKQREINRDYKDKISWAKKMLKNIANAGKFSSDRTILEYANEI WNIKETKIK >gi|296154178|gb|ADVK01000039.1| GENE 14 12400 - 14226 2274 608 aa, chain + ## HITS:1 COG:FN0856 KEGG:ns NR:ns ## COG: FN0856 COG0296 # Protein_GI_number: 19704191 # Func_class: G Carbohydrate transport and metabolism # Function: 1,4-alpha-glucan branching enzyme # Organism: Fusobacterium nucleatum # 1 608 4 611 611 1186 100.0 0 MSGQMEQYLFHRGEFRQAYEYFGAHPTRSSTIFRIWAPSAKSVAVVGDFNDWRAREEDYC HKLTNEGIWEVEIKKIKKGNLYKYQIETSWGEKILKSDPYAFYSELRPQTASIVNGKPKF RWADKRWLNNREIGYAKPINIYEVHLGSWKKKEDGTYYNYKEIAELLVEYMLEMNYTHIE IMPIIEYPFDGSWGYQGTGYYSVTSRYGTPDDFMYFVNYFHKNNLGVILDWVPGHFCKDS HGLYRFDGSACYEYEDPSLGENEWGSANFNVSRNEVRSFLLSNLYFWIKEFHIDGIRMDA VSNMLYYKDGLSENKHSVEFLQYLNQSLHEEYPDVMLIAEDSSAWPLVTKYQADGGLGFD FKWNMGWMNDTLKYMEQDPFFRKSHHGKLTFSFMYAFSENFILPLSHDEIVHGKNSILNK MPGYYEDKLAHVKNLYSYQMAHPGKKLNFMGNEFVQGLEWRYYEQLEWQLLKDNKGSQDI QKYVKALNKLYLEEEALWYDGQDGFEWIEHENINENMLIFLRKTPNMEDFIIAVFNFSGK DHEIYPLGVPLEDGEYEVILDSNEKKFGGSYQGRKRKYKSIKKSWNYREQYIEIKIAKNS AVFLKYKK >gi|296154178|gb|ADVK01000039.1| GENE 15 14259 - 15407 1499 382 aa, chain + ## HITS:1 COG:FN0855 KEGG:ns NR:ns ## COG: FN0855 COG0448 # Protein_GI_number: 19704190 # Func_class: G Carbohydrate transport and metabolism # Function: ADP-glucose pyrophosphorylase # Organism: Fusobacterium nucleatum # 1 382 3 384 384 750 100.0 0 MKKKRIIAMILAGGQGTRLKELTEDLAKPAVAFGGKYRIIDFTLTNCSNSGIDTVGVLTQ YEPRILNNHIGRGSPWDLDRMDGGVTVLQPHTRKNDEKGWYKGTANAIYQNIKFIEEYDP EYVLILSGDHIYKMNYDKMLQFHIQKDADATIGVFKVPLVDAPSFGIMNTKDDMSIYEFE EKPKEPKSDLASMGIYIFNWKLLKKYLDEDEKDPNSSNDFGKNIIPNMLNDGKKMFAYPF KGYWRDVGTIQSFWDAHMDLLSEDNELDLFDKSWRVNTRQGIYTPSYFTKESKIKNTLID KGCIVEGEIEHSVIFSGVKIGKNSKIIDSIIMADTEIGDNVTIQKAIIANDVKIVDNIVI GDGKKIAVVGEKKIIDSQSLVK >gi|296154178|gb|ADVK01000039.1| GENE 16 15428 - 16591 1353 387 aa, chain + ## HITS:1 COG:FN0854 KEGG:ns NR:ns ## COG: FN0854 COG0448 # Protein_GI_number: 19704189 # Func_class: G Carbohydrate transport and metabolism # Function: ADP-glucose pyrophosphorylase # Organism: Fusobacterium nucleatum # 1 387 1 387 387 734 100.0 0 MIRSYMAIIYLGDSKKNISPLTKVRALASIPVGGSYRIIDFSLSNVVNAGIRNVGLFCGN EELNSLTDHIGMGAEWDLARKKDGIFIFKRMLDDNFSLNQARISKNMEYFFRSTQQNIVV LNAHMVYNLDVSDLIEKHEASGKEITMVYKKVKKANEHFNHCSSVKIDENNRVIGIGQNL FFKEEENISLDAFVLSKELMLKLLVDSIQEGKYNVLSELIARNLPSLNINAYEFKGYLQC INSTREYFNFNMNLLNQKIRDDVFGIKSGRKIFTKVKDTPPTLFKETADVENSLVSNGCI IEGTVKNSILSRGAIIEKDVVLEECVILQDCHIKKGAHLKNVIVDKNNIIHENEKLSASK EYPLVIEKSMKWDTKQYQDLMDYIRNK >gi|296154178|gb|ADVK01000039.1| GENE 17 16608 - 17993 1606 461 aa, chain + ## HITS:1 COG:FN0853 KEGG:ns NR:ns ## COG: FN0853 COG0297 # Protein_GI_number: 19704188 # Func_class: G Carbohydrate transport and metabolism # Function: Glycogen synthase # Organism: Fusobacterium nucleatum # 1 461 1 461 461 927 99.0 0 MKVLFATGEAFPFVKTGGLGDVSYSLPKTLKQKENVDIRVILPKYSKISNELLKDARHLG HKEIWVAHHNEYVGIEEVELEGVIYYFVDNERYFKRPNVYGEFDDCERFLFFCKAVVETM DITKFKPDIIHCNDWQSALIPIYLKERGIYDVKTIFTIHNLRFQGFFFNNVIEDLLEIDR AKYFQEDGLKYYDMISFLKGGVVYSDYITTVSDSYAEEIKTQELGEGIHGLFQKYDYKLS GIVNGIDKISYPLSKKPHKILKADLQKKLGLDVEEDTPLVVIITRLDRQKGLDYIVEKFD EMMSLGIQFILLGTGEKRYEHFFAYQEYLHKGQVCSYIGFNQELSTEIYAGADIFLMPSV FEPCGLSQMIAMRYGCIPVVRETGGLKDTVKPYNEYTGEGDGFGFKQANADDMIKTLKYA IKMYHRPNVWQEIIKNAKKRDNSWDKPAKRYKELYQRLIEG >gi|296154178|gb|ADVK01000039.1| GENE 18 18042 - 18641 778 199 aa, chain - ## HITS:1 COG:FN0852 KEGG:ns NR:ns ## COG: FN0852 COG1073 # Protein_GI_number: 19704187 # Func_class: R General function prediction only # Function: Hydrolases of the alpha/beta superfamily # Organism: Fusobacterium nucleatum # 1 198 1 198 199 320 90.0 1e-87 MKNVVIYIHGKYGTAEEAEYYRKFFNETDIIGFEYTSEYPWDFQKEFSNFIDNIYTKYKK ISIIANSIGAYFTMLSLTNKNIEKAFFISPIVDMEKLIIDMMVSENITEEELYKKKEIKT SFGETISWDYLTFVRKNPIEWNVPTYILYGEKDNLTSYETILNFTNKSKANLTIMKGGEH WFHTAEQMEFLNNWIKNLT >gi|296154178|gb|ADVK01000039.1| GENE 19 18677 - 19345 605 222 aa, chain - ## HITS:1 COG:FN0851 KEGG:ns NR:ns ## COG: FN0851 COG0500 # Protein_GI_number: 19704186 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism; R General function prediction only # Function: SAM-dependent methyltransferases # Organism: Fusobacterium nucleatum # 1 222 1 222 222 380 98.0 1e-106 MNFNKHYNVYEKYSLAQKQVAKNLLDYMGKSNIFNTQINSIFEIGCGTGIFTKEYRKCFL HSDLILNDIFDVREFIKDIDYNIFIQENIEELDIPKSDLVVSSSVFQWIKDKDSLIRNIA ENTDNLCFSSYVSGNLIEIKNHFDISLDYLNIEEFKKILKKYFSSVKYYSETIKLDFEDP MAVLKHLKYTGVTGFQKTSISKIKTFKNNILTYEVAYFICKK >gi|296154178|gb|ADVK01000039.1| GENE 20 19362 - 19952 512 196 aa, chain - ## HITS:1 COG:no KEGG:FN0850 NR:ns ## KEGG: FN0850 # Name: not_defined # Def: putative cytoplasmic protein # Organism: F.nucleatum # Pathway: not_defined # 1 196 1 196 196 313 98.0 3e-84 MSKIYFFNGWGMDENLLIPIKNSTDYDIEVINFPYDIDKDFIDKDDSFIGYSFGVYYLNK FLSENKDLKYKKAIGINGLPQTIGKFGINEKMFNITLDTLNEENLEKFLINMDIDDSFCK SNKSFDEIKNELQFFKNNYRIIDNHIDFYYIGKNDRIIPANRLEKYCQNHSLAYKLLEYG HYPFSYFKDFKDILDI >gi|296154178|gb|ADVK01000039.1| GENE 21 19953 - 21086 1148 377 aa, chain - ## HITS:1 COG:FN0849 KEGG:ns NR:ns ## COG: FN0849 COG0156 # Protein_GI_number: 19704184 # Func_class: H Coenzyme transport and metabolism # Function: 7-keto-8-aminopelargonate synthetase and related enzymes # Organism: Fusobacterium nucleatum # 1 377 5 381 381 665 99.0 0 MQKEKIIQELQELKNDNRFRTVKTNDKSLYNFSSNDYLSLAHDKDLLQKFYQNYNFDNYK LSSSSSRLIDGSYLTVMRLEKKVEEIYGKPCLVFNSGFDANSSVIETFFDKKSLIITDRL NHASIYEGCINSRAKILRYKHLDVSALEKLLKKYSENYNDILVVTETVYSMDGDCAEIKQ ICDLKEKYNFNLMVDEAHSYGAYGYGIAYNEKLVNKIDFLVIPLGKAGASVGAYVICDEI YKNYLINKSKKFIYSTALPPVNNLWNLFVLENLVNFQDRIEKFQELVTFSLNTLKKLNLK TKSTSHIISIIIGDNLNAVNLSNNLKELGYLAYAIKEPTVPKDTARLRISLTADMKKEDI EAFFKTLKAEMKKIGVI >gi|296154178|gb|ADVK01000039.1| GENE 22 21182 - 21796 466 204 aa, chain - ## HITS:1 COG:no KEGG:FN0848 NR:ns ## KEGG: FN0848 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 204 1 204 204 323 100.0 3e-87 MEKEKLIEEILEKEWSYFSKLNNIGGRADCQDNREDFIIMRKSQWETFNEETLISYLDDL NSKNNPLFQKYGQMMKYNSPQEYEKIKDILENPNKNKITLVEKIMSIYIEWEEEFFKKYP IFSSMGRPLYSTEDDNIETSIETYLRGELLSYSEKTLELYLKYIIEMKEKNINLAIKNMD NLASMQGFKNSDEVEEYYKNLQKN >gi|296154178|gb|ADVK01000039.1| GENE 23 21813 - 23612 2001 599 aa, chain - ## HITS:1 COG:FN0847 KEGG:ns NR:ns ## COG: FN0847 COG0457 # Protein_GI_number: 19704182 # Func_class: R General function prediction only # Function: FOG: TPR repeat # Organism: Fusobacterium nucleatum # 1 599 1 599 599 1111 100.0 0 MNLDGLNKQREKYQIEGNILKEIEILKEILVETEKEYGSESDEYIKALNELGGTLKYVGY YDEAENNLKKSLEFIKKKYGDNNLAYATSLLNLTEVYRFAQKFNLLEENYKKIVKIYQDN SADNSFSYAGLCNNFGLYYQNIGDMKSAYDLHLKSLDILKHYDSEEYLLEYAVTLSNLFN PSYQLGMKEKAVEYLNKAIDIFEKNVGIEHPLYSASLNNMAIYYYNERELNKAIEFFERA AEISKKTMGVDSDNYKNILSNIDFIKKEVVKSGDNIKVQDTKKDNIINSSDLKNIKGLEL SKRYFYDIVLPEFEKSLENILPLCAFGLVGEGSECYGYDDELSQDHDFGPSVCIWLRKDD YLKYKDRINKVLKNLPKTYLGFRELKESEWGYNRRGLLNIEDFYFKFIGSANPPQTINDW QKIPETALATVTNGEIFLDNLGEFTKIREQLLNYYPEVIRQNKIATRLMNISQHGQYNYV RCLRRNDLVSANQCLYLFVDEVIHLVFLLNKRYKIFYKWANRALLNLKILGNEIHKLLQD MVFTQNKIPYVKKICKVLADELRNQKLTDCESEFLGDLGVDIQKNIDDEFFKNYSPWLD >gi|296154178|gb|ADVK01000039.1| GENE 24 23709 - 24662 1219 317 aa, chain - ## HITS:1 COG:FN0846 KEGG:ns NR:ns ## COG: FN0846 COG2849 # Protein_GI_number: 19704181 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 14 317 1 304 304 539 99.0 1e-153 MKKYLLGAFLIIAMNLFGAKLSDVKGLEKLKNYNEIKDVQVEKIVNYDVTKSVSKKIFLP EDNKFNGVLVKNENNDIVEITFYKNGVSDGVSYTYYLNGDLKSVSTYRKGMIEGPQVLYR PNGKMESEQVFENNSLVSEKYYDKNGKITKEYHFNKLRNGILKKYYKDTEKMSSTSTVIV NREKVDGVEKMSFVLDGETKVFRKDGTLMAILQYKDGSLQDLTQKFYYPNGKIQYYVVVA GDEIKDFKVKDRIITYYENGKVNQDCNQQNDGSWVCKNYDKDGKFLDEEIRGAEPISNGD TKFWENILNPVLEVLAN >gi|296154178|gb|ADVK01000039.1| GENE 25 24728 - 25339 531 203 aa, chain - ## HITS:1 COG:no KEGG:FN0845 NR:ns ## KEGG: FN0845 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 136 201 1 66 69 98 90.0 2e-19 MEWILLYGSCLFFIFCIKIFDFDLKNFLDRYMDKILTFFVIIITFLSIFQKKFEIDVTYQ TIILLIILLILVYRTPLLSFFLENFDEISIGNTKLKIKSIQSTIQNLANDGILDKNMKGI EEIRKENKENEDETLLYGKFLLNFSKIDSLLNSIETVKFRNQIVHNGIEEKYEKEELHQI FIDFINIEEKIISELKNLKPFYF >gi|296154178|gb|ADVK01000039.1| GENE 26 25532 - 25732 268 66 aa, chain + ## HITS:1 COG:no KEGG:FN0843 NR:ns ## KEGG: FN0843 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 66 1 66 66 95 87.0 7e-19 MKKIKYFLNGLSAIIKYAVSDTNSYLTQTSSYFIKDDNDALSKDWKIIGNDLRGTIDKYD KEFKTR >gi|296154178|gb|ADVK01000039.1| GENE 27 25707 - 25820 173 37 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|254303859|ref|ZP_04971217.1| ## NR: gi|254303859|ref|ZP_04971217.1| hypothetical protein FNP_1519 [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] hypothetical protein FNP_1519 [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] # 1 36 1 36 129 66 86.0 5e-10 MTKNSKQDSDNYKNNNIVRIGITTIHSGPLPDPETLY >gi|296154178|gb|ADVK01000039.1| GENE 28 25864 - 26097 284 77 aa, chain + ## HITS:1 COG:no KEGG:FN0842 NR:ns ## KEGG: FN0842 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 77 1 77 77 109 96.0 4e-23 MAKKQSEHRQYLEREQVIGETKLRLRGQLIGGCAIVVLIILGFILILNDKNVAGASAVII ALIGIIYSISYGKNKDK >gi|296154178|gb|ADVK01000039.1| GENE 29 26594 - 27580 1025 328 aa, chain - ## HITS:1 COG:FN0837 KEGG:ns NR:ns ## COG: FN0837 COG0582 # Protein_GI_number: 19704172 # Func_class: L Replication, recombination and repair # Function: Integrase # Organism: Fusobacterium nucleatum # 1 328 1 328 328 533 98.0 1e-151 MEIKKIDEKDLVISQRKKRNRDSKKTIFEIYKSEKTVKDYMFHLKDFLHFVYDGENDFSI SEVIPLMQDIEKEDVEAYIVHLFEDRKLKKTSVNTILSALKSLYKELESNGLKNPVKYIK LFKVNRNIENVLKVSIDDIRKIIGLYKIDSEKKYRNITILYTLFYTGMRSKELLTLQFKH YLKREDEYFFKLIQTKSGKDVYKPIHKSLVKKLEEYKKYLISMYSLDLKDLDEKYIFSTS VLDNTPLSYRSLNAIIQDMGKLIGKDISPHNIRHAIATELSLSGADILEIRDFLGHSDTK VTEVYINARSILEKKVLEKLPEINLDEE >gi|296154178|gb|ADVK01000039.1| GENE 30 27592 - 28539 843 315 aa, chain - ## HITS:1 COG:no KEGG:Lebu_0718 NR:ns ## KEGG: Lebu_0718 # Name: not_defined # Def: hypothetical protein # Organism: L.buccalis # Pathway: not_defined # 1 312 129 439 439 276 51.0 6e-73 MIADAICYPTDGNKNFFWNVPNKPVKTLATGPAYLGDNENSFTYIWGQPVYLYPTQTTDS YNENRVGYYMDKIKELGDSSPRAIVYNFSDFINFVIDGHHKACASALLGESLRCLLIIPG VFTKYYNVKEDKNKIYLAFSSTDISNVDIPERYSSLVKFEIPAPRSKEIIIKDGIVNKRN WEKKYLDSVKKYLTQKEYGRIVDILINDKIEITDDLIEYCLIHFDIKSQTKMEKIIYKLK LLNIEKAQDIALKYAKNSLKYEINKNLREFIYKILVSIKNNNEVEQIFVDYYTYYSENKE DPVLEIINSYWEGLK >gi|296154178|gb|ADVK01000039.1| GENE 31 28752 - 29048 477 98 aa, chain - ## HITS:1 COG:no KEGG:FN0836 NR:ns ## KEGG: FN0836 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 98 1 98 98 169 95.0 3e-41 MLNHIVMWKIKEDVEDKEKVKLNIKNGLEGLFGKIEELREIKVERFMETTSTHDVALFVK VDNEDTLKKYATNPLHVEVIKNYIKPFVYDRVCIDFFE >gi|296154178|gb|ADVK01000039.1| GENE 32 29065 - 29736 921 223 aa, chain - ## HITS:1 COG:no KEGG:FN0835 NR:ns ## KEGG: FN0835 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 26 223 1 198 198 355 92.0 7e-97 MTTWNDVFSANLGKMMAIQIACGEFVVKNRNWNVDFDKGIIAFGNDEYPLQFLGSEANSS NTWLWAWENINGFDDKIISLARSIKEKGEKINLEPLTDAEIDITDELNGHNLSIVACGLA DKNYCYYRGPHSGGAIFVAFNGVDEKVFSSVNTRKFIDITMRSIQQFSLNHKLFIESFLD WNKNKYKWQENIIIADFGKDGELRIEFEKVKDNFRIKNINLNS >gi|296154178|gb|ADVK01000039.1| GENE 33 29865 - 31385 1828 506 aa, chain - ## HITS:1 COG:no KEGG:FN0834 NR:ns ## KEGG: FN0834 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 504 1 505 511 726 79.0 0 MKKIGFIILLTLSFLLLTNCNKNENKNPKIKFSDDTYKLFEEFTENKKDIIKKLKTLNKD EANKLYEKYVENNNDILGQIEEATTEFLDSIYYGSAEEQFTEKDWNDTNKILNKYDLELW DIGEGMVTIRELPHLYYDVFKDYVTDDYKEYLKIWAKDDEELYQADAGLSISFEELGDRI ARWENFLNKYPNSTLKPKVTALLNSYREDYLLGMENTPTIDGGYDNVPITIYEEAKKEYD RFMKKYPNSPTVELIKYFIENYKNDNIHELIKCKIIEKFEKDPFVDVLSENLGKMMAIER NYEKYILKDKEWEVDIDNGYIYSDKEKYPIQIIGSSILNKDEGSTWVWAWKNSDVFNEDL LSFAYNVHGLGMDLKLCTFINSEFKITDEINENILSATACGISGENLAFDNIYYSVNSMV FYYTIKDLPNEVFSSVNLSEFVDITSSCTDKYTLNEKLFVESFLKWNKTKYKWQGNSIIA NFEKDGKLEIEFEKIGDKYKIKKLLL >gi|296154178|gb|ADVK01000039.1| GENE 34 31386 - 33005 2012 539 aa, chain - ## HITS:1 COG:no KEGG:FN0833 NR:ns ## KEGG: FN0833 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 16 539 16 539 539 813 98.0 0 MKKILIIIFTIAIFLTGGIFGYKKIVADEREKKIIQMFNKDILDNFVENKKSVIERLKTS NPEEADKIYNYYLKISQLIIENINTEHLDFLNNIYNEDSEYYFTERDWKTANKFLNNYDL EIFDLAETEVKIIEVPNYYYNIFKNYVTDDYKEYLKITSKENEEPYYTDGSILVPYDKIA DRLLTWENFLKKYPNSDLAEIANEKCNIYRRIYILGSDNAPTREGGWENNELFYIPENNL KEFNRFIEKYPDSPTVELVKFYLENYKNKDVDTMLNEKIDKEFYLGGIENREKGNLFSKE SNDLLDEFKKNKEEVINKLKTLSKEEANEIYEEYSVDNDKILEKINEIDVEMLDNAFYKD EDIEKEKLEKQNKFLDSYGLEVIQIEDGFTLTEKKKFYYNLFKNFVTNDYREFLKLYSED IDYIEYSNFFDKYVEIIADRIVAWEKFLEKYPDSKLKGKAQNIYYTYRAGYIIRLTSSET KESLMNGKANEAVKEFNRFIRKYPNSPTSDIIKYYLENYKEEDINTLISKKINKNYGGE >gi|296154178|gb|ADVK01000039.1| GENE 35 33089 - 33553 632 154 aa, chain - ## HITS:1 COG:no KEGG:FN0832 NR:ns ## KEGG: FN0832 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 154 1 154 154 289 97.0 2e-77 MGLMDLVKKAFLGATDEENRKNKAKMRAIFNESVPNGDDYKLIYCHSEDTTNAVVVKVTK HNNFIVGYKEGEVVVIPVNPDLLDYGKAIIFNRKNESRTEASFGFCKVSNPETTLYFVPI TYEPALGGKGKYSVAVTQSSAEVSEFKKFFKKGL >gi|296154178|gb|ADVK01000039.1| GENE 36 33577 - 34125 514 182 aa, chain - ## HITS:1 COG:no KEGG:Exig_0313 NR:ns ## KEGG: Exig_0313 # Name: not_defined # Def: hypothetical protein # Organism: E.sibiricum # Pathway: not_defined # 1 182 4 186 188 135 41.0 7e-31 MKDRFKELIKKIKSDEFYNNRGLANEVPFYIFDYNPKYELEIRDFVKNKLLASLEDDDRL KAVEIDIFELLLESMRNDNILEMAFEIEEKKGTKFLYEKLKKSFNTEIIMKYISEKTKDK NFLILTGVGKIFPIVRTHTILNNLQNIFDHTKVLLFFPGEYTSTDLRLFGFEDNNYYRAF KI >gi|296154178|gb|ADVK01000039.1| GENE 37 34139 - 34723 601 194 aa, chain - ## HITS:1 COG:no KEGG:Msm_1749 NR:ns ## KEGG: Msm_1749 # Name: not_defined # Def: hypothetical protein # Organism: M.smithii # Pathway: not_defined # 1 183 1 184 197 92 33.0 1e-17 MEYRAITAENFYLIEMRNTCKFILENKVEEDLKKMLKVNNILETVSESNFSKKFNTINKR LKFLTDNLKKQIVNTDLTSARFINLYSILCNERFILEFLEEVVKEKYDNYDYSIKESDFL SYLATKSEQSEIINNWTAEGKRKMLVKIKNFLTEGGFLEKNKDSYKIIKPIVDLAVIDEI KENGNKKILKIMFY >gi|296154178|gb|ADVK01000039.1| GENE 38 34739 - 36754 2378 671 aa, chain - ## HITS:1 COG:STM4491 KEGG:ns NR:ns ## COG: STM4491 COG4930 # Protein_GI_number: 16767735 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Predicted ATP-dependent Lon-type protease # Organism: Salmonella typhimurium LT2 # 2 670 23 693 694 791 55.0 0 MDINEIGSKVFEGKIVRKDLVSKIKGGANVPVYVLEYLLGMYCNQTDEESIEEGMSKVKK ILAENYVRPDEAEKVKSKIKEIGVYNVIDKVTVILNEKKDRYEGHLSNLGVSNIEIHKSY IKDYEKLLSGGIWCILTLSYQYDEYNATDSPFKLNKLKPIQIASLDMNEVYEARKHFTKD EWIGFILRSSGMEFENFDKDAIWHLLARMMPLVENNYNLCELGPRGTGKSYIYKEISPNS ILLSGGQTTVANLFYNMSKRQVGLVGYWDVVAFDEIAGIKFKDKDGIQIMKDFMASGSFA RGKEEKNANASMVFIGNINQSVDVLVKTAHLLVDFPPEMNNDSAFFDRMHCYIPGWDIPK LSPKSFTKEYGLIVDYMAEIFRELRKTSYGDALDRYFSLGRDLNQRDTIAVRKTVSALIK LVYPDGIFIKEEVEEILIRALEYRRRIKEQLKKMAGMEFFATNFSYIDKESGEEKYVGLK EQGGSKLIPEGPLKAGALYTIAMSTNSVKGLYKIESQISAGKGKITVSDNKYRKTFENAF NYLKINSKRISGAINISEKEFYLSVADEKNVGNTETITLGGFIAMCSISLNRQLMPQTVI LGEMALSGSINAVSDLASTLQIAREAGAKKALIPILNAVDMSTLPPDILMDIQPIFYQDP IDATQKALGLM >gi|296154178|gb|ADVK01000039.1| GENE 39 36763 - 39276 2956 837 aa, chain - ## HITS:1 COG:no KEGG:TepRe1_0534 NR:ns ## KEGG: TepRe1_0534 # Name: not_defined # Def: hypothetical protein # Organism: Tepidanaerobacter_Re1 # Pathway: not_defined # 1 791 1 794 849 359 33.0 3e-97 MEIDKIKEMLEYRFSLTTELPQKRHIIFWYDSKKEFKDLIDELNLTDVKIIKLTKSVDKK GEAIYTNIFKTKYTLEVIDTESNYLIYSEYPRAIDSENYLLDIEKYSEFFEADKSAMIVE ELKLDRTNYRFGEIIREYSSFFANKERREKLIKLIENPESLDEEKFKLSILTTISGAKTV DILEILKNIILNRNKLEDIEKWMNLEFLFSEIKKKFDIEITSFEQFLKILMVTHFYFELG KKPHTNLENYFKGRKNELYIFTDSLLQNKQSSEIIRAEFYELAKDLNIKDRIDELELDYS IKGTAFEYFDKVIIKDIIEVFNSEIIDYDKYKKYIEIRLDNSLWREEYQHFYNVLLAVND FFRIKDSLIIEDREELREIFKDYTKNYFLIDKLYRDFYYSYDKIKNSELAPLFDTLKSKI DKFYEIDYLEKLLALWSSKVYEREKLSQQKDFYKNNIVKTDVRTVVIISDALRYEVGYEI SQKLRKEANIKEIKIEAMLTDLPSRTFLGMANLLPCKKERDIDLVSAKVLIDGIDSQGTE NREKILKTSCEESSAISFDNFYKMNRAKQEEFIKGKKVIYIYHDSIDAIGDKGKTENNTF NACKDAVENIVSLSKLLSSLGVVNIYITSDHGFLYEKKEVEEYNKLELKNTKYKSIGKRY AIYEKEVEEKGCVTLKLDSLYGVFPEKNQRIKASGSGLQFVHGGASPQEMIIPLINYKSG ANSKKISKVQVRIRESVAKITSNLTKFSIYQIEAVSIKDKFIERDVSVALYDGDVRVSDE KKLKLNSIEENTIHDFRLTLSGEHKKVTLKVIDIESGDILDSKEYDVSIGIASDFDF >gi|296154178|gb|ADVK01000039.1| GENE 40 39362 - 41377 2251 671 aa, chain - ## HITS:1 COG:Z5943m_1 KEGG:ns NR:ns ## COG: Z5943m_1 COG1479 # Protein_GI_number: 15804980 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Escherichia coli O157:H7 EDL933 # 1 560 1 561 592 323 36.0 8e-88 MKASEKKIKDLFSEAKTFFAIPVYQRDYNWQEKHCKQLFEDILNVGKDIDITSHFIGSIV YIHEGVYGIGEKEFYVIDGQQRMITITLLHIALYHRLKESKEEYADEIYELYLVNKFSKR DIKLKLLPPEENLNILNKILEENWEELEDYQDRNIVKNYKFFKEIISNYSNEEIEYLLAG LDKIIYVDIALEKDKDDPQKIFESLNSTGLDLSQGDLIRNYILMDLEREKQNLVYKNLWL PIENNCKISLGNEIKNYVSDFIRDYLTLKNGKIPSKPKVFEEFKEFYNKNNDEQLEDIKN FSEEYSHIIKPNTEKDKEIRKELENLKVLDQTVINTFLIGILRDYRENKIVKNEILEILK LLQSYIWRRFITEKPSNALNKIFQGMYLRISKDQKYYKSLEESLLNQDFPTDDELKEALK TKNVYKDKEKLRYVFKELENYNHNELIDFENEKITIEHIFPQKPNKSWKEKYSDYELEEM KTFKDTISNLTLTGSNANLGNKSFLEKRNDDIHGYKNSKLYLNKYLSKLNEWNLSAMEGR FEELFKNIVKVWKRPENSEDKDIEKVTFVLKGAASSGTGKLLAYEKFEILKGSIIVKGNK GNENVEKRNKRIIEELLENNLVEKDGNKYILKENYKVSSPSAAASLILGRNANGWKEWKT FDGKLLNEFRK >gi|296154178|gb|ADVK01000039.1| GENE 41 41428 - 43236 1911 602 aa, chain - ## HITS:1 COG:no KEGG:Mpet_1790 NR:ns ## KEGG: Mpet_1790 # Name: not_defined # Def: hypothetical protein # Organism: M.petrolearius # Pathway: not_defined # 1 570 1 567 603 415 45.0 1e-114 MKLKQLKLKNFRGYKEENYVEFENLTAFVGKNDVGKSTILEALEIFFNNKTVQCEREDLS VNHKDEDENIEISCVFSDVDIPIILDSNFETNLKDEYLLNKDGFLEIKKVFKCSIAKPKA NSYIVCCYPSEENCKDLLLLKSTELKRRAENLDIPKENYNASINASIRRAIFNNFSDLNL VETDLAVDKEDSKKIFNKLEEYFPMYALFQSDRASSDSDKEIVDPMQIAISQAIKGLEVE INKIKEEVKNKTLEIANKTLEKLKEMNSTLADSLIPEFKAEPKFDSLFKLSINSDDGIAI NKRGSGVRRLILLNFFRAEAERQLKENSKKNNIIYAFEEPETSQHPNHQIMLIESFLKLS QKENCQIILTTHTPALAGMLPLESLRLVKKEEGKTEIFSTTEETYKEITDMLGILPEPIP KTSKGILLVEGVDDILFFYHLNKLLKEAKEIKEMFSENGINVISTGGCDNLKYWVTKKLV QQFNLPWAIFLDSDKKNKNDISKNTKFVDKISSEGILAFCTRKREIENYLHSELLKNEFK NLTSIGDYDDIKKLTRKDIFKKEWEKMTFKYLRERETYIDDKDNTEHFELTEVVKKILNM IK >gi|296154178|gb|ADVK01000039.1| GENE 42 43300 - 43419 98 39 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MSLGVSLNFINPDSVLLNKALKIINLSFSFIFLPSCTNA >gi|296154178|gb|ADVK01000039.1| GENE 43 43447 - 44901 1256 484 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296328620|ref|ZP_06871137.1| ## NR: gi|296328620|ref|ZP_06871137.1| possible AbiZ [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] possible AbiZ [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 484 1 484 484 791 100.0 0 MENIKVKIWDVEPSELLLKIKEMLEKQKINVTRESEEEYDIVISSDNDYKNKSAIFIQGS LNPKCSNEILAYLSIIADKLNIIICIKEENLTNKNLGFKVKENYVGTYNPEMYRMPIFIE DKKTEIYRRLSIITIFKINISPTHSTENIDFECSLEELENILEINYNIWKDLYQDGDDIN YLEVYQNLKNKIFNFFTYFTNIEDITDEILYSQIYYNITNSNEYFENKRKYLINIEAPIN KDKNFIKIIEFYKYYQEQKFDNISHAIEYYILIYRRPEMAGNYLNNTNLYCYYPAFYYLK EQFNLKTSNTQIDDEINILKKEMSVIEETIPKRKNEILNYMEAISKGLDIWKNGFLDDVE NWKIKINKWQKESENKINALEKAYNEKLKLEAPEKLWNEKAKNYGKAYVIWLLITIVLGI GIVFFTAYTIKVFYFNNASNNNLENEVFKYFPKTFLFLGILSLALYILRVVIKIVLSNTF TIRI >gi|296154178|gb|ADVK01000039.1| GENE 44 44908 - 48555 4627 1215 aa, chain - ## HITS:1 COG:MA2372 KEGG:ns NR:ns ## COG: MA2372 COG1002 # Protein_GI_number: 20091204 # Func_class: V Defense mechanisms # Function: Type II restriction enzyme, methylase subunits # Organism: Methanosarcina acetivorans str.C2A # 1 1214 1 1160 1161 753 38.0 0 MNKSSLKIFAIEARKELMEKMRTRLEILGITKNGIEKAKVIGKEVEVKGTLYPKESYDSL IRKYKQVGYEELIEESAYTWFNRLTALAFMEANGYIEEKMIFNNGVKNEPAIIDNYYEFE FFKNLDNNLQKELHNLRDENTPNSIEKLYSILMEEKCEDLSNIMPFMFKKKGTYSDILFP TGLLMENSLLVRLRKEIGEEAPIELIGWLYQYYNSEKREVVYNGSMKKSKINKEYIAPAT QLFTPDWIVKYMVENSLGKLALESTGINENLKNNWKYYIDSEKEENSEKIKIEDIKILDP AMGSGHILVYAFDLLFEMYENLGWSTKETVLSILRNNLYGLEIDERAGQLASFALMMKAR EKFSRLFSVLKREEDFKLNTLILEESNSLSERIRNKIKANNLNNLNKIIQEFEDAKEYGS ILKLESIDKEILEKEFNILKESFNNNEQETLIFNEDEMIIDINEELELIVSLIKQHIIMI NKFDITVTNPPYMGNSRMNGILKEYIDKNYSDVKSDLFAVFFIKCCEITKRKGYLGFMSP FVWMFIKSYEELRKIFINGKTISSLVQLEYSGFDDATVPICTFVLQNSYTDKKGEYIKLS DFKGAKNQPIKTLEAVKNPNCNWRFQAKQKDFEKIPGSPIAYWVSDKIREIFEKNQKLGE VGEAKQGLATADNNRFLRLWNEVNYNKIGYSMSNSQEALESKKKWFPYNKGGEFRKWYGN QEYLVNWENDGYEIKNFYDEKGKLRSRPQNTEYYFKESISWTDITSSGNSFRYYPKGFLY DVTGMSYFIDESKQKNLLGILNNKLIYIITKILNPTLHLQIGDLIKVPYFTIKNEKFNIL VQQNIDISKEEWDSREISWDFEKLSLIDGKDLKTTYENYCNHWRDNFVQLHKNEEELNRF FIEIYDLQDEMDEKVAFEDITILKKEATIIQIDNSVPKNFSSESEKYLYDRGVSLEFNKD ELIKQFLSYAVGCIMGRYSTNKSGLIIANSDDILELSENKFIVKGTDGEIRQEVERKFLP DEFGIIPITDEKDFSNDIVEKIKEFIKFVYGEDNLKDNLNFIAEALGNKDNKPAEEIIRA YFIKDFYADHLQRYQKRPIYWLMNSGKKNAFSCLFYMHRYEPLTVARVRADYLIPYQEML ENKRKFIERQLSDDDISAKEKKNIEKQLKELDTLLKELREYANEVKHIAEQKIPLDLDDG VNVNYEKLGTILKKR >gi|296154178|gb|ADVK01000039.1| GENE 45 48580 - 49911 1479 443 aa, chain - ## HITS:1 COG:MA2370 KEGG:ns NR:ns ## COG: MA2370 COG2865 # Protein_GI_number: 20091202 # Func_class: K Transcription # Function: Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen # Organism: Methanosarcina acetivorans str.C2A # 6 437 11 442 458 109 27.0 1e-23 MKKYIESEKLELKEKYTDIICKEIVSFLNGNGGTILIGVRDDGTVVGVDKIDETLRKISD IITTQIEPNPQDEISSELKFEEGKTIIILNINKGRKHIYCQKKYGFSSHGCTIRIGTTCK EMTIEQIKIRYEKKFIDTEYMLKKRASLADLSFRELKIYYSEKAYHLDEKSFETNLNLRN EDGEYNLLAELLSDRNNIPFIFVKFQGRNKASISERNDYGYGCLLTTYQKIKNRLEAENI CISDTTTRPRKDIYLFDYDCVNEAILNAFVHNDWTITEPQISMFHDRLEILSHGGLPSGM TRKQFFDGISKPRNVTLMRVFLNMGLTEHTGHGVPTIINRYGEEVFEIGNNYICCTIPFD EKVINQKNEKNVGLNVGLNVGLNKTEKKVIEFLIENPSFTSDNLAEKIGVTKRTIERTLK KLQEKKMIERIGSKRDGNWIVIK >gi|296154178|gb|ADVK01000039.1| GENE 46 49998 - 51092 1027 364 aa, chain - ## HITS:1 COG:no KEGG:BF1979 NR:ns ## KEGG: BF1979 # Name: not_defined # Def: putative DNA repair ATPase # Organism: B.fragilis # Pathway: not_defined # 2 362 579 939 945 214 41.0 4e-54 MKSLNENVKSKILEVKNYLQNDNFIFKVCEKTKLLEEEIVNSQNNIKKYEEKMGNMEKQL NLVELLEQEKQKKVLIEEKEKEIILLKNDLTNKKILEKYQELLALYENKILEHLKFKNIS EDIELVVKLKFNTNSFKEKFSEKISKKLVLEKQFGENIFIGNEFKFTKDCHLENIKNIYD KLINNKEEIKINQSYSLEEVLEGLFKDYFSIEYDLVQNGDSLLEMSPGKRGIVLFQLFLQ LSNSDTPILIDQPEDNLDNRTIYQELNTFIKNRKLKRQIILVSHNANLVVSTDSENVIVA NQEIKKGNNYEYNEKYKFEYINGALEETFNNGKKFHEKGIREHVCEILEGGEKAFKIREK KYGF >gi|296154178|gb|ADVK01000039.1| GENE 47 51299 - 52873 1475 524 aa, chain - ## HITS:1 COG:no KEGG:Cag_1611 NR:ns ## KEGG: Cag_1611 # Name: not_defined # Def: putative DNA repair ATPase # Organism: C.chlorochromatii # Pathway: not_defined # 2 509 4 505 966 238 33.0 4e-61 MYTEYKEGSKWIKCDFHIHTPCSVLNNQFGDNFEEYIKKMLRKALESDTRIIAITDYYSI DGYKKLKEEYLERESKLKELGFLDEEILKIRQILFLANIEFRLDILVNGAKVNFHIILSD KIKISDIEENFLKRIEFPFQGTEKRTLTRSNIESLGKKLKLEQNNLRGNEYQIGIGQLAV DSSQVLNLLENSDIFKNKYLVVIPSDEDLSNIRWDGQDHNIRKILIQQAHCLFSSNKSTI SWGLGEKSENKEEYVKEFFSLKPCIHGSDAHCYEKLFRPDKNRYCWVKSIPTFEGIRQML FEPKERVYIGETFPSEKQPYNIIKRVKFVDSKNEFQSEWIYLNEGLNSIIGGKSSGKSLL LYYIAKTIISKKIDNLKEDIGKNINFLGYNFENQIGREFNFVVEWADGVEINLKNKEIKR KITYIPQMYINYIAENKNNKNELNNILLGILNENKEFKDNIENINKKINQKSIEINEEIS IFFRNRTKLIELENEKIDIGDLEGIEKNIDRLEKESQEIGYISI >gi|296154178|gb|ADVK01000039.1| GENE 48 52887 - 53054 158 55 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296328625|ref|ZP_06871142.1| ## NR: gi|296328625|ref|ZP_06871142.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 55 1 55 55 69 100.0 9e-11 MSKKKEFIAIISEGEKVLVLSAFPFFLIEYFGEEFFIKYQKSKTLKITKTFINIF >gi|296154178|gb|ADVK01000039.1| GENE 49 53047 - 54159 1424 370 aa, chain - ## HITS:1 COG:no KEGG:Pnuc_1118 NR:ns ## KEGG: Pnuc_1118 # Name: not_defined # Def: SMC domain-containing protein # Organism: Polynucleobacter # Pathway: not_defined # 1 368 1 366 368 280 44.0 1e-73 MLIEFKVEGFKNFEKELVFDLSKTRNYNFNENAIKDGIVKTGLIYGINGSGKSNLGLAIF DIILHLTDKEKTINLYDYYLNLTNSNTMAKFYYKFKFGNDILEYEYQKDKPQNLVSEVVK INNKLIAEYDYLTNKFELNLEGTESLNKNLNGNNISFIKYINNNINPEPSTKQFKIKEII STFLKFVDNMLLFSSLDGNFYQGFKKGGGSISEEIINRGKLKDFENFLRVTGIDYTLIEK EVGKEKRIYCKFKSGEVDFFEIASRGTKSLTLFYAWLIRLNDVSFVFIDEFDAFYHVNLA KIVVEELLKLNVQAILTTHDTTIMTNDLLRPDCYFVLSEGKIKSLPDLTEKELRQAHNLE KMYRAGAFNE >gi|296154178|gb|ADVK01000039.1| GENE 50 54304 - 58056 4932 1250 aa, chain - ## HITS:1 COG:no KEGG:TepRe1_0530 NR:ns ## KEGG: TepRe1_0530 # Name: not_defined # Def: hypothetical protein # Organism: Tepidanaerobacter_Re1 # Pathway: not_defined # 4 1102 1 1092 1188 557 34.0 1e-156 MENLTIKNILQKDIERKINGVVKADSNEKDTVITELNEYVVTEEIRERLTKFFDKYVESI NFPTEDMGVWISGFFGSGKSHFLKMIGHILENNTYDGKKVVDFFKEKIDDAILMGNIEKA AEIPTDVILFNIDNVSDQDTYQNKDSIAVAFLKKFNEYLGFTRDDIEIAEFERKLWEDGK LEEFKKVFEEESGKTWKDANRNLDFHSDDFLDVIEKLKIMSRESAERWLERDVVRSISAE SFRDILENYLKMKGPKHRIVFLVDEIGQYIGDNSKLMLNLQTLVETLGVKFKGRVWVGVT SQQDLSSILNNSEHRKNDFSKIQDRFKTMLALSSGNIDEVIKKRLLIKKKIEGEDLEKIF DKKRVEIENLIHFEKTMTLPLYDDNKDFSETYPFVAYQFNLLQKVFEKVRNMGHSGQHMS RGERSLLSSFQEAGIKVKDKNIGILVPFNYFYESIEQFLEDNVRRPFIHARNEKGIDDFG LEVLKLLFLLKGINGIEPTLNNLTSFMVNSMDCDRIELEKKIKKALEKLEKEVLIQKDGE NYYFLTNEEQDINREIEREDIDLKKIDEKIDSYIFKEIFTKNSILMEETGNKYNFTRTID ETVYSKSGEDLAITIFTERADDYDNVAIVGTRAESDLILRLAKDDETYRNEIKLFLKVES YIRNKQKDNERETIIRILEIKQRENKIRDRRIKNELERLIVEAEVFVYGQKQDIKTKDAS KKIEESLKALANHRFHKAKLVKKPYDEAEIRNILSYVYDTEKNGVLFAIKKDVESNINSE AIKEVLERITLLEKRGDTPITLKNISEYYLRTPYGWGQLTINGLVGELWKYKLIDLQESK VFVTDENIATNLLIKLQNKNLEKIVISLREEIDPELIRKVNNLLKEIKTIKEDTGEVTLD SPKEDLLEILKRKIGIAKSYKIECEHNSYPGKKELNDWIDLLDEIILSKSNAEKTLKNFL EMENELSKEYDKVDRVFDFFTSSKKERYDKAIEKINKIEEYKDYIGSLKETDAYKTIEEI KLDKNIYERIREFDDLISELDKAKDELIEIEKTSLKEKVEKYKKEFSEKLKDNPEIIKKL KEKLNEFLEKEVNNKDNSNDMAIFMKSKKLENIVNNFEEEYKNSAKKEIEKLENYLNEVA EDKTDIDELRKSIKSTYSNYKDEIAKSDIKNVSVTIAKAIKDKEDFNAEINGKAKKKERI KLRKISINSKSNIESEEQVNEYISAIEKDIEKLKNEMLEAIKNNKIVDIG >gi|296154178|gb|ADVK01000039.1| GENE 51 58211 - 58657 545 148 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296328628|ref|ZP_06871145.1| ## NR: gi|296328628|ref|ZP_06871145.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 148 1 148 148 228 100.0 2e-58 MFFNQKNQVCWETCIEDDTVENDVDLEASIKDDTEKNMSDFIKRPVEDNKNNNKNYITSF WSPILIDDLSLALLYSQQFPVSSSVPKTTYNTAYIVKPKKKIAEKKIDNDNNINTVESLL SKLEKIKKERSKYLYPLKPYKRYTLYKK >gi|296154178|gb|ADVK01000039.1| GENE 52 58886 - 59179 445 97 aa, chain + ## HITS:1 COG:no KEGG:FN0829 NR:ns ## KEGG: FN0829 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 19 97 19 97 97 80 98.0 2e-14 MKKFIIFLLLILSICIFGQEKKDYEPYLIRNGEKLLSYNDDAEEDEEYDMDEYDSEEEDP YYWETYIEDDTVENDMDLYEDPIEDDTEKNMSDFRKK >gi|296154178|gb|ADVK01000039.1| GENE 53 59284 - 60510 346 408 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163788031|ref|ZP_02182477.1| 50S ribosomal protein L9 [Flavobacteriales bacterium ALC-1] # 7 408 9 413 413 137 29 2e-31 MSFFDILKGSLATLKANKLRTLLTMLGIIIGISSVIAMWAIGNGGRDSILGDLKKVGYGK FTVTIDYKNENFKYSNYFTMQTVDMLKASHKFKAVAANIEDRFRLIKDKKPYFSFGTVST EDFEKISPVTMMSGRNFLPFEYNSNERVITIDNISAKKMFGDIKSALGQSVEISRDRKKA GHSYKIIGVFKSPYESFGKLFGEGENFPVLFRMPYKAYAVSFNQDPDVFETLIVEAKNGN EISEAMLEAKNILEFNKNAKNLYVTNAVSNDIESFDKILSTLSLFVTLAASISLLVGGIG VMNIMLVTVVERTKEIGIRKALGAKNRDILKQFLFESIILTVFGGLVGIFIGILFGLLTG VVVGIKPIFSMVSIIVSLSISVVVGVIFGVSPARRAAKLNPIDALRTE >gi|296154178|gb|ADVK01000039.1| GENE 54 60507 - 61169 342 220 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 [Vibrio campbellii AND4] # 2 220 1 218 245 136 35 6e-31 MIITVDKINKTYKNGSLELQVLKNISFKVDKGEFLAIMGSSGSGKSTMMNILGCLDNQYE GRYILDGIDISKSTENELSEIRNKKIGFIFQSFNLLPRLTALENVELPLVYSSIPKEERH KRANELLEMVGLKDRTHHRPNELSGGQRQRVAIARALVNNPSIILADEPTGNLDSKSEEE IIEILQKLNKMGKTIVIVTHEPSIGEIAERKIIFKDGEII >gi|296154178|gb|ADVK01000039.1| GENE 55 61198 - 62340 1289 380 aa, chain - ## HITS:1 COG:FN0826 KEGG:ns NR:ns ## COG: FN0826 COG0845 # Protein_GI_number: 19704161 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Fusobacterium nucleatum # 43 380 1 338 338 554 99.0 1e-157 MKNIFKGKLKFIILLILIVLGLIYYFTHRNKKEKIYINDYSYMEVKKTDEIGTLNLNGYI KANNPIGIFVDKKLKVKEVFIKNGDFVKKGQVLMTFDDDETNKLNRNIEKERINLQKIQR DLKTTRELQKLGGASKNDVKNLEDNARISQLSIDEYTEVLNKTATEVRSPVDGVVSNLKA QENYLVDTDSSLLEIIDSSDLRIIVEIPEYNSQAVKIGQSVKVRQDISDDDKVYDGEITK ISRLSTTSSLTSENVLEADVKTKEVIPNLVPGFKIKAVLQLKADEKNIIIPKIALQSENG KYFVFTIDIKNTIKRKEVTVKNIVGDNIIVTSGLNVGEILIITPDNRLSDGLILTEGGNP NSSGEETLSIPADEAEVVEN >gi|296154178|gb|ADVK01000039.1| GENE 56 62353 - 63666 1479 437 aa, chain - ## HITS:1 COG:no KEGG:FN0825 NR:ns ## KEGG: FN0825 # Name: not_defined # Def: putative cytoplasmic protein # Organism: F.nucleatum # Pathway: not_defined # 28 437 1 410 410 613 99.0 1e-174 MYERKNMKKILLFFLILTNLSLLAEETLSIDEALDRVGNDRGSYEFKKFQNSQESTNIKI KNNKLGDFNGVTLSSGYNISENNFDNRPRKYDRTFQNKATYGPFFVNYNYIQSDRSYVSF GIEKNLKDVFYSKYNSNLKINNYQQELNKISYDKNIQTKKINLVGLYQDILNTRNELEYR KKAYEHYRVDLDKFKKSYELGASPKINLESAELEAEDSKIQIDILETKLKSLYDIGKTDY NIDFENYTLLDFIENNESIDSILNNYMRDEVEELRLNLSMAEERKSYSNYDRYMPDLYLG YERVDRNLRGDRYYRDQDLFTIKFSKKLFSTDSEYKLNELEVENLKNDLNEKIRVVNAEK IKLKSEYNELLKLASIGDKKSNIAYKKYQIKEKEYELNKSSYLDVIDEYNKYLSQEIETK KAKNALNAFVYKIKIKR >gi|296154178|gb|ADVK01000039.1| GENE 57 63670 - 64521 802 283 aa, chain - ## HITS:1 COG:no KEGG:FN0824 NR:ns ## KEGG: FN0824 # Name: not_defined # Def: DeoR family transcriptional regulator # Organism: F.nucleatum # Pathway: not_defined # 1 283 1 283 283 461 99.0 1e-128 MSKKIKVTLPQNIYEIIKNDIDDFNMTSNHFMNYIFLNLNGKYKNFKGNPTIAGQSKEKS SIQFNLNKASNLIYYDVLRENNAQNESEFMRSLLIRYATNPKNKRELFIFKESVERLNLA IKDKKNVYITFNDNRKVKVSPYHIGSSDLEIANYIFCYDYSEEKYKNYKLNYLKQVYTTS EIGKWEDKEYINDVIKNFDPFLSKGKVIKIRLSEKGKKLFKAIKINRPKLISENRDIFEF EASEEQIKRYFTYFLDDATVVEPIELKEWFIEKYENALKNLKK >gi|296154178|gb|ADVK01000039.1| GENE 58 64540 - 66342 528 600 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|149914878|ref|ZP_01903407.1| 30S ribosomal protein S2 [Roseobacter sp. AzwK-3b] # 183 595 24 423 425 207 34 2e-52 MINGNTSGLKEYILENLDKLYSTKIEKGKIINQEIVDYISEISNKINREINVAIDRNGNI IDISIGDSSTVNLPVIPVYDKKLSGVRIIHTHPGGNPHLSSVDISALIKLKLDCIVSIGV SEEGVTGYEVAICSIVNDELTYDRTLLENLDDFDYLEEIKEVEENLRKRNITEDDKEYAL LIGIDKEEYLDELEELASACDVKVVGRFFQKRSKPDPLFLIGSGKIQELALTRQVRKVNL LIFDEELSGLQLKMIEEVTGCKVIDRTTLILEIFARRARTREAKLQVELAQLKYRSNRLI GFGVTMSRLGGGVGTKGPGEKKLEIDRRVIKKTIAYLNNELENIKKIRNTQRSKREDSGM PRVSLVGYTNVGKSTLRNVLVDMYQNDKTLKKEEVLSQDMLFATLDTTTRTIELKDKRIV SLTDTVGFIQKLPHDLVESFKSTLEEVIFSDLIIHVADISSKNVIEQINAVEDVLEELNC LDKTKILLLNKIDNVTKDNSFPLMEKKIEEIKAKYSNYQILIISAKNRFNIDELMELIKK NLIVKTYNCKLLIPYVNTEIAARVHRNTIVKSESFVDEGIILEVVMNEKEYNKFKDFIFN >gi|296154178|gb|ADVK01000039.1| GENE 59 66363 - 66881 615 172 aa, chain - ## HITS:1 COG:FN0822 KEGG:ns NR:ns ## COG: FN0822 COG0703 # Protein_GI_number: 19704157 # Func_class: E Amino acid transport and metabolism # Function: Shikimate kinase # Organism: Fusobacterium nucleatum # 1 172 1 172 172 297 100.0 5e-81 MKDNIALIGFMGSGKTTVGKLLAKTMDMKFVDIDKVIEAHEKKSINDIFHEKGQIYFRDL EREIILQESLKNDCVIATGGGSILDNENIKRLKETSFIVFLNATIECLYLRLKDNTTRPI LNDVEDKRKLIEELLEKRKFLYQISADYIIDINEHTNIYETVDKIKEIYIIS >gi|296154178|gb|ADVK01000039.1| GENE 60 66984 - 67871 1105 295 aa, chain - ## HITS:1 COG:no KEGG:FN0821 NR:ns ## KEGG: FN0821 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 8 295 1 288 288 444 99.0 1e-123 MKKVLLVLTALLLISCVNLNEPKSQKVTNITKNSGTKNNVVNTQKDNKKKNMSAVTNNAK VIKTRNLLKEAEAIPEDTYANKVKKYKAYESLTAYNPRYKDKLSSRINDLSNKIEKTYNF TISGADLVFQDILDNVLYDDIDNKIFTFSIDNPDVTLQIEMSSINYNKPVVNVKTIPKEY SEKYTNKDGNEILNVVKYYENETTETAGLTFVVEYKLVSNLTGEILVSDRKSIEKNYNES WKTYYISSFRIDKKKQIPSDEKEKHVPVKEEIYQAAFQEMFDTINKDINNLASIK >gi|296154178|gb|ADVK01000039.1| GENE 61 67970 - 69349 454 459 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163788782|ref|ZP_02183227.1| 30S ribosomal protein S1 [Flavobacteriales bacterium ALC-1] # 5 454 4 444 458 179 28 6e-44 MEKIYDLLVIGWGKAGKTLSAKLGAKGKKVAIIEENPKMYGGTCINVGCLPTKSLVHSAK ILAEVKKYGIDGDYSFKNNFFKEAMKKKEEMTIKLRNKNFGLLDTNENVDIYNGRASFVS NNEVKITSSDNKEIVLKADKIVINTGSVSRTLNTEGIDNKNVMTSEGILELKELPKKLLI IGAGYIGLEFASYFSNFGSEVSVFQFDDAFLAREDEDETKIIKEILENKGVKFFFNTSVK KFEDLGDSVKAICMKDGQEFSEEFNKVLVAVGRKPNTDNLGLENTSIQLGKFGEILVDDY LKTNAPNIWAVGDVKGGAQFTYVSLDDFRIVFPQILGENNRRKLSDRVLIPTSTFIDPPY SRVGINEKEAQRLGINYTKKFALTNTIPKAHVINEIDGFTKILINENDEIIGASICHYES HEMINLLALAINQKIKSKVLKDFIYTHPIFIESLNDILG >gi|296154178|gb|ADVK01000039.1| GENE 62 69464 - 71461 2612 665 aa, chain - ## HITS:1 COG:FN0819 KEGG:ns NR:ns ## COG: FN0819 COG0457 # Protein_GI_number: 19704154 # Func_class: R General function prediction only # Function: FOG: TPR repeat # Organism: Fusobacterium nucleatum # 1 665 1 665 665 1137 98.0 0 MKNNLIELLNILHKEAKHQEIIDKIEALPNEEKTPEIIGILARAYNNIDNYEKAVELLKS TEEYGKDTNVWNYRIGYSYYYLDNYLEAEKYFLKAVELNPTDSDSHLFLCWIYQELTDKE KDNSEKIIEYLNKSIEYVNIYSKLEPEESIKDELIFAEERLGWAYDRLNNFIEGEKHLRS AIELGDNDKWVYSQLGYTLRHQDRYEEALENYMKSVELGRNDTWIYSEIAWTYFSLEKFS EALEYINKAKELSPVEIDLSLVSRTSSILIALGKHTEAIKLLEDIVNRDEYKNDIGILSD LAFAYDDLEDYKNGLIYLKRANELGRDDIWINTEFAYAYYYLGEYEKSYDYLIIVKNLGR DDLTLKLMFANTLSKMEKYEEAIEYYLELLENDKYKNDAILNCQIGWNYGELEKPKEALK YLFKAEKLGKDDRMINIDIGINLAKTGEIQDGINRLKRALTMEEGITLNDKIFLNSEIAY WYGELRDVENALNYLYITKDLGRDDAWINSQIGWNLLEEDLKEALKYLNKAKDLGKDNIW INRQFGFAYSKLGEYEKAISSFKKARELGANDSWLLYQLGLALKEYGNIEEAINIFKEEI EITDYQGFGDLQLAWCYALIDEKEKAKEYFKNVDMYLSSSLEKDEDLKKDYNTVNELINS NIYFN >gi|296154178|gb|ADVK01000039.1| GENE 63 71727 - 72002 494 91 aa, chain + ## HITS:1 COG:FN0818 KEGG:ns NR:ns ## COG: FN0818 COG0776 # Protein_GI_number: 19704153 # Func_class: L Replication, recombination and repair # Function: Bacterial nucleoid DNA-binding protein # Organism: Fusobacterium nucleatum # 1 91 1 91 91 148 100.0 2e-36 MTKKEFAKLLFEKGVFTTRTEAEKKVDIIFDSIEKILLDGENLSIINWGKLEIVERAPRL GRNPKTGEEVKIGKRKSVKFRPGKAFLEKLN >gi|296154178|gb|ADVK01000039.1| GENE 64 72042 - 72821 187 259 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) [Campylobacter concisus 13826] # 1 213 4 218 223 76 27 6e-13 MLEIKDLNFSYNKHKKSIFENLSVNFAKGFNVILGPNGAGKSTLLKAIFGLLKYQGEICY DGINLSKINFNKKTELISYLPQMDLNISPLTVLEMVLLGRLPELNQKISDEDIKAVTEIL KVLNIEDLITRSFSELSGGQKKMVFVAQTLVRNPKLILLDEPTNSLDLQKQLELCQFLQN FIELKKVDIVTVLHDINLAIRYADYIVILSNDGSLYDSGEAKKVISEKMLREVYGVSGDV IFDDDKKPVVSVKKSIRDN >gi|296154178|gb|ADVK01000039.1| GENE 65 72821 - 73879 1258 352 aa, chain - ## HITS:1 COG:MA2149 KEGG:ns NR:ns ## COG: MA2149 COG0609 # Protein_GI_number: 20090992 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-siderophore transport system, permease component # Organism: Methanosarcina acetivorans str.C2A # 13 350 14 352 355 250 46.0 3e-66 MEVKKNIDILNNKIIYKNLVKKKVIVINIIIILLLLLFLLNISIGSTSISIKEILKAIFI NEGEGNNILIIRKIRLPMSLMAIVVGFSLGIGGCEIQTILKNPIASPYTLGITSAASFGA ALALVLNNSILNLPDTLAVTGNAFFFTFLISTMIYLFSSQRGIGKTAIILFGIALNFLFT SFTMILQYVADEDKLQNLIFWNFGSLLKTTWTKFLIVLIVLIICIIFLYKNFWKLTAMTL GDAKAESIGVNPHKLRKQIILIVSLLSAVAVSFVGTIGFIGLIAPHIARLIVGEDQRFFM PLSALLGGFILSLSFFISKVVIPGVILPIGLVTSVIGIPFFISMIFGKRRSM >gi|296154178|gb|ADVK01000039.1| GENE 66 73891 - 75411 2046 506 aa, chain - ## HITS:1 COG:MA2148 KEGG:ns NR:ns ## COG: MA2148 COG0614 # Protein_GI_number: 20090991 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-hydroxamate transport system, periplasmic component # Organism: Methanosarcina acetivorans str.C2A # 151 493 42 386 389 141 28.0 2e-33 MKNLKKLFFILNFILVFIFSQTLYGENKNLVTGTKTGTAVGYHGKITVLTELKDGKFINI SVKNHSETKDIGDIAILKIPNEIVKKQSLDVDSVAGATVTSKAIVEAVANSLEKMGVDPV KYTYKSDVNKNNNLNAKLDLKKLPKKKAIKETVIITDAKGRKVEIGLPISTYAISTMDVI DYIIPLKGKEAFNMLVGSGQDGGHGLNKYAKLYTPVVGNYMEHTAQISEHASPFDLEMLL AVQPDVLIVNSAMAAHKYALEIEDQLNEAGIKIILIDVPGKDSEKSVQQAMKILGDVFQE KEKANEVINFIDKQYLLMNSKNLKTRKDKPTVYYEKSGYSEVFGPTATGKSGWGIIINMA GGKNIADELLADKPVSKGGGNTIDPEFVLKSNPDFIILSGVNDGWLDSSKEKKKCKFDIV NRNGWKDLKAIKNKKLYEFAHSTSRSIYGFYPALKMATIFYPEEFKEVNPEAILNEFFDK FMILDSSITTWMYNLEDCDKTKINKK >gi|296154178|gb|ADVK01000039.1| GENE 67 75438 - 75866 699 142 aa, chain - ## HITS:1 COG:AF0241 KEGG:ns NR:ns ## COG: AF0241 COG1720 # Protein_GI_number: 11497857 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Archaeoglobus fulgidus # 1 129 1 129 139 145 55.0 3e-35 MILKKIGVVHSVYENKDNVPRQGKYSDEKSVIEIFPEYVDALDGVEFLKNIIVLYWGDRA DRSVLKSVPPFATLEKGVFSTRSPNRPNPIAICVCKILSIEENKITVMGLDALNNSPVLD IKVFIPRVDTDEDYKNASETTK >gi|296154178|gb|ADVK01000039.1| GENE 68 76102 - 77055 930 317 aa, chain - ## HITS:1 COG:no KEGG:FN0811 NR:ns ## KEGG: FN0811 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 317 1 317 317 543 99.0 1e-153 MSVSFYIKNKKKFFGYQKVMKVREVIDLFKEYKLSFYNIDFHVNDPDGEKFYNTSIESWQ ENHNSILFGVEGKSARGFEFSYNSRKNSYVIREYTPASENDWIVALEFMKVLAEKLNSKI VSEQGDTFTFKTINTFDYKSDIETGIKVISDILNKENEKAFNVDNIYGIKRVVSFNKEII ERIVNSSDEIKEFSNFCEDIQYINAYSAKQSFVEDRATKEKWGYYVLTENLRTVLPYKPS VEFFSMDYIKNEEVAFWKIFFCAYKVDENGEEGIDKIGESIYDDFIKKLPTDKYKFIDAS YIVVEPLNRDEILEILK >gi|296154178|gb|ADVK01000039.1| GENE 69 77070 - 78092 1495 340 aa, chain - ## HITS:1 COG:FN0810 KEGG:ns NR:ns ## COG: FN0810 COG2008 # Protein_GI_number: 19704145 # Func_class: E Amino acid transport and metabolism # Function: Threonine aldolase # Organism: Fusobacterium nucleatum # 1 340 1 340 340 654 98.0 0 MISFKNDYSEGACQEVLEALIKTNYEQTVGYGEDEYCEEAKNLIKENINYPNADIYFLVG GTQTNTTVISHSLRPYEAVIACKTGHISIHETGAIEATGHKIIEVDGIDGKLTPDLILNE LRKHEDHHMVKPKMVYISNTTEIGTVYTKKELENISKVCKDNNLYLYLDGARLASALTSE KCDINLEDYPRYCDVFYIGGTKCGLLFGEAVVIINDEIKKEFNFSVKQKGGLFAKGRLLG IQFATLFKDDLYYRIGVHSNKMALKIKNAFIEKGIKLATDSYTNQVFVDLSQEQIKKLEK DVIFSVEFFGIGESQSSRFVTSWATKEEDVDKLVELIKSL >gi|296154178|gb|ADVK01000039.1| GENE 70 78135 - 81056 3243 973 aa, chain - ## HITS:1 COG:PA2732 KEGG:ns NR:ns ## COG: PA2732 COG0610 # Protein_GI_number: 15597928 # Func_class: V Defense mechanisms # Function: Type I site-specific restriction-modification system, R (restriction) subunit and related helicases # Organism: Pseudomonas aeruginosa # 38 966 53 1006 1146 770 44.0 0 MGIRDTKMETELEKNIIDYLEEKQGYRYIKANEMKLAFNRKYALDEVRLLEFIEKSQPRV FKELSLDIESKKESFFKQLDLCIRKDGIISVLKNGINHYPATSTISLFYHALDGNRESSY KEFEQNIFFVTNQLTFSERNKGLELDIAIFINGLPIITMELKSRASSTGWTYKDAEEQYK NDRKPIEPLFSFKRCMAHFAVDENFITFTTKLDDKNTVFMPFNKGTESGGSGNPINPSGT MTDYLWKDFLDKKVLTSLIKDFSYVDNNKVLVFPRYHQYRVVTKLVEDVIKNGVGKNYLI QHSAGSGKSNSITWLAYRLVEVAKKVQNGDISKFEKIFNSVIVVTDRLNLDKQIDDNIRK FIDVRSVVGHASSSTDLKNFLVNEKKIIITTIQKFPYLLEKIGTELKGKNFAIIIDEAHS SQSGRAAASLNMAVSATIDINDDNDFEIEDKLNELIEARKMPENASFFAFTATPKAKTLQ MFGNVFDLYSMKQAIEEGFILDVLKNYTYYENFYKIKKSVEDNPIFDKKKAQKKIRKYVE GQEFPIREKAEVMVEHFLHNTATKIAGQAKAMVVTQSILSAIEYYHCINDMLKKSNTGFE AVIAFSGEKEYNGKTVTEASLNGFPDNETAIKFKEAKYKFLIVADKYQTGYDEPLLHTMY VDKVLNDVKAVQTLSRLNRCTKNKIDTCIIDFANKPEHIQEAFQPYYKETRLDGEVDPNK LFNLLSILDSKYVYEKDEVDELVELFLKNSSRDKIDNIIDEAVERYNALSEEDQVEFKSG VKSFIRTYNFLASILAVGQIEWEKKVIFFEHLIHRLPTPTEDLVKGILETIDLESYRLEK KNTIDIILEDKDGAIEGAGIGAGKKSEAELTPLENIVSTFNNVFGNIEWQDKDNVIRQIK ELPEMVMKNDKFKNALINSDLENIKREYSIALRDVFKNIMKDNMELFGQWTNNSDFKKWI DNAIFEEIMTLRN >gi|296154178|gb|ADVK01000039.1| GENE 71 81046 - 82413 1270 455 aa, chain - ## HITS:1 COG:jhp0726 KEGG:ns NR:ns ## COG: jhp0726 COG0732 # Protein_GI_number: 15611793 # Func_class: V Defense mechanisms # Function: Restriction endonuclease S subunits # Organism: Helicobacter pylori J99 # 120 405 125 427 454 99 25.0 2e-20 MNNYDSYKETDIPWLGEIPSHWETKKIGKIFDIRKEKNSPVKTKEVLSLSSMYGVSLYSE RKEKGGNKPKENLEAYNLCYPGDILVNSMNIVAGSVGISNYFGAISPVYYSLQNLSEKKY SKYYLEYLFRNYNFQRSLVGLGKGIQMSETEDGRLFTVRMRISWDTLKSQEFPTPPIEEQ IQIANYLDWKINEIDRLILIEKEQIKELENLKQKYIDEIYQNIKTKNFISLSKIGTFFKG GGFSRENLSDSDYGAILYGDIYTKYNYFFEECISKIDENAYFNSKCIDGNVVLFTGSGET KEDIGKNVAYVGTKKIALGGDIIALKPNKNFSPKFIAYFSNTSNIKAFKHMKSTGDIIVH ITLGAIKSIKIPFISIEEQKDIVKKIDEYILNLKNLIALIEDKIKYFLSLKQSLIAEVVT GKIDVRNITIPQYEKVETIFETDNIDEMEVQNYGD >gi|296154178|gb|ADVK01000039.1| GENE 72 82406 - 84583 2635 725 aa, chain - ## HITS:1 COG:PA2735 KEGG:ns NR:ns ## COG: PA2735 COG0286 # Protein_GI_number: 15597931 # Func_class: V Defense mechanisms # Function: Type I restriction-modification system methyltransferase subunit # Organism: Pseudomonas aeruginosa # 13 513 14 518 792 513 52.0 1e-145 MSSVYDVVDYNKLVSFIWSVADDCLRDVYVRGKYRDVILPMTIIARFDAIIDAEKTNILQ TKEWAESSGWDIHKTLDTSIDLPFYNISKFRLKDLKSETNSQNLKKNFEEYLDGFSNNIK EILEKFEFNNQLTKMTNAGILGSVIEKFTSSDLNLSPYDEKNSYGIVVKKGLDNHAMGTL FEEIIRKFNEENNEEAGEHFTPRDVVELMADIAVVPVMNKIKNGTYSIYDGACGTFGMAT IAEERLQTLAKKNNKNVSIHLIGQEVNPETYAISKADLLIRGGDTVSNNVFYGSTLSDDK TSGEHFDFMLSNPPYGKTWKTDLAVLGVGSDKDLKKNIIDKRFVTSYKEQEDFRMLPDVS DGQLLFLLNNISKMKDTELGSRIIEVHNGSALFTGDAGNGASNARRYMIEEDLIEAIIQL PENMFYNTGITTYIWILSNRKEERRKGKIQLINASELKTPLRKNLGKKNCEFSKENRKII LDTYLNFKENEISKIFSNEEFAYYKVTVDRPLRQAIICNDEKIKEIEKELENIGFNSKIN KANLEGTFVKNSATVVKELEKTDNILAYLELLKDMKKDDKYLDFEEFEKLFNKKLKKYGL KAASLSKFISTGLMTNMIVRDENASIQKDTKGNIVVDPELRDTEIVPFTYKGGIEEFIKK EVLPYHNDAFVDETKTQIGYEINFTKYFYKAKELENVETIVARIKELEKESDGMMKNILE GLYDE >gi|296154178|gb|ADVK01000039.1| GENE 73 84597 - 85049 510 150 aa, chain - ## HITS:1 COG:FN0809 KEGG:ns NR:ns ## COG: FN0809 COG0219 # Protein_GI_number: 19704144 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Predicted rRNA methylase (SpoU class) # Organism: Fusobacterium nucleatum # 1 150 1 150 150 308 99.0 2e-84 MNIVLYQPEIPYNTGNIGRSCVLTNTTFHLIKPLGFSLDEKQVKRAGMDYWHLVDLKIWE SFEEFLEANKGIRLFYATTKTKQRYSDVKYEENDYIMFGPESRGIPEDILNKNPERCITI PMIPMGRSLNLSNSAVIILYEAYRQLGFNF >gi|296154178|gb|ADVK01000039.1| GENE 74 85049 - 85669 693 206 aa, chain - ## HITS:1 COG:FN0808 KEGG:ns NR:ns ## COG: FN0808 COG0406 # Protein_GI_number: 19704143 # Func_class: G Carbohydrate transport and metabolism # Function: Fructose-2,6-bisphosphatase # Organism: Fusobacterium nucleatum # 1 206 1 206 206 372 100.0 1e-103 MEIYFVRHGQTIWNVEKRFQGLSDSPLTELGIIQAKLLGEKLKDIKFDKFYSTSLKRAND TANYIKGNRKQEVEIFDDFVEISMGDMEGIKQEDFKKLYPEQVKNFFFNQLEYNPSSFKG ESFIEVRERVTKGLEKFIKLNKNYERVLVVSHGATLKTLLHYISGKDISTLSDEAIPKNT SYTIVKYENGKFEITDFSNTTHLEEK >gi|296154178|gb|ADVK01000039.1| GENE 75 85780 - 86517 878 245 aa, chain + ## HITS:1 COG:FN0807 KEGG:ns NR:ns ## COG: FN0807 COG1212 # Protein_GI_number: 19704142 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: CMP-2-keto-3-deoxyoctulosonic acid synthetase # Organism: Fusobacterium nucleatum # 1 245 1 245 245 448 99.0 1e-126 MKFLGIIPARYSSTRLEGKPLKMIEGHTMIEWVYKRAKKSNLDSLIVATDDERIYNEVIN FGGQAIMTSKNHTNGTSRIAEVCEKMTEYDTIINIQGDEPLIEYEMINSLIETFKENKDL KMATLKHKLLDKEEIKNPNNVKIVCDKNDYAIYFSRSVIPYPRKNGNISYFKHIGIYGYK RDFVIEYSKMLATPLEEIESLEQLRVLENGYKIKVLETTHSLIGVDTQENLEQVINYIKE NNIKI >gi|296154178|gb|ADVK01000039.1| GENE 76 86537 - 88087 1860 516 aa, chain + ## HITS:1 COG:FN0806 KEGG:ns NR:ns ## COG: FN0806 COG2385 # Protein_GI_number: 19704141 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Sporulation protein and related proteins # Organism: Fusobacterium nucleatum # 184 516 1 333 333 625 99.0 1e-179 MKKQLSLIFLSVLVLISCTNEPAKKVKTIKPNGDYKTGTTITTERGKREKIALENTVFKK LGLPLPYNTFGAAIPYLVPVNDNHKESFGVFEEYNEDKALKYFKNLSSRGHGDNSPYWRW KTSIKKSDLYSKAGSRLIAIYKNNPRNVLTLVNGEWQQAPIRSVGTVQDIIVAARGESGI ITHMLVITSNGKYLIAKEFNIRKLLATNNALYGSKGEEGAYNSKPITPNVTSLPSAYLAL EDEGRYINIYGAGFGHGVGMSQFAAGTLTKNGENYKNVLKRYYTNVKLSTVESVLGKDKE IKVGITTNGSLEHGRLTIFSSENKVQIYNDDFDIAVGENERVDVRNTSGATTITLENGKT FKTKNPLNFNAKGEYLTLSPVRKGHTSSPRYRGVITIIPRGSNLRVINTLDIEKYLLQVV PSEMPKSFGVEALKVQAVAARTYAVSDILKGKYAQDGFHIKDTVESQVYNNQVENEEATR AIEETEGEIMTYDNMPIDAKYFSTSSGFTSHASNVW >gi|296154178|gb|ADVK01000039.1| GENE 77 88099 - 88842 733 247 aa, chain + ## HITS:1 COG:FN0805 KEGG:ns NR:ns ## COG: FN0805 COG4912 # Protein_GI_number: 19704140 # Func_class: L Replication, recombination and repair # Function: Predicted DNA alkylation repair enzyme # Organism: Fusobacterium nucleatum # 1 247 5 251 251 407 99.0 1e-113 MEIENLTFKTEKEYKEFLDYLFSIRDIEYRDFNTKIVVPVDCEIIGIRTPILRDIAKKIA KTSSENFLNLFEKLFTKKKVKYYEEKVLYGFLIGYSKIDFQERLKRIDFFINIIDNWAVC DIVDSSFKFINKNKEDFYTYLTSKLSATNPWEQRFIFVMLLAYCVEEKYLKDIFKICEKI KSDEYYVKMAKAWLLSVCYVKYKNETYKFLEKTKLDAWTVNKSIQKIRESLRVTKEEKEK ILVLKRK >gi|296154178|gb|ADVK01000039.1| GENE 78 89065 - 89721 691 218 aa, chain + ## HITS:1 COG:FN0804 KEGG:ns NR:ns ## COG: FN0804 COG0785 # Protein_GI_number: 19704139 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Cytochrome c biogenesis protein # Organism: Fusobacterium nucleatum # 1 218 1 218 218 306 98.0 2e-83 MFTQEIAYSTAYLAGVASFFSPCIFPIIPVYISILSNGEKKSLSKTFAFVLGLSVTYIVL GFGAGVIGDLFLNNNVRIIGGIIVVILGLFQMEILKLKFLEKTKIMNYEGENQSIFSTFL LGLTFSLGWTPCVGPILASILILASSSGDTTNSVMLMFIYLLGMATPFVIFSLASKALFK KMSFIKKYLPTIKKVGGFLIIIMGLLLIFNKLNIFLTV >gi|296154178|gb|ADVK01000039.1| GENE 79 89740 - 91224 2113 494 aa, chain + ## HITS:1 COG:FN0803_2 KEGG:ns NR:ns ## COG: FN0803_2 COG0225 # Protein_GI_number: 19704138 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Peptide methionine sulfoxide reductase # Organism: Fusobacterium nucleatum # 183 347 1 165 165 321 100.0 2e-87 MKKFILPLILMFIVGVFVFAKMLSNNLKKETEKEKDLLENIQLIDMNGNDYTFSRDKNIY IKFWASWCPTCLAGLEELDRLAGETNNFEVVTVVFPGINGEKNPAKFKEWYDTLDYKNIK VLYDTDGKLLQIFKIKALPTSAIIYKDLIIDNIIVGHISNGQIKNYYEEKGENITMENNT KNIKEIYLAGGCFWGVEEYFARIDGVIDSVSGYANGSFDNPSYENVCNNSGHAETVHITY DSTKVSLDTLLKYYFRIIDPTSVNKQGNDKGVQYRTGIYYQNEEDKQIALNAIKEEQKKY SKPIVVEIEKLKRFDKAEEYHQDYLKKNPNGYCHINLNKASEAIIDEKKYQKPSDEVLKE KLSTLEYQITQEAATERAFTHEYYKNQEDGIYVDITTGEPLFSSKDKYDAGCGWPSFTKP IATEVVNYKKDSSHGMNRVEVRSRAGEAHLGHVFEDGPRDRGGLRYCINGASLRFIPYDK MDEEGYGEFKKYVK >gi|296154178|gb|ADVK01000039.1| GENE 80 91538 - 92248 860 236 aa, chain + ## HITS:1 COG:FN0802 KEGG:ns NR:ns ## COG: FN0802 COG0765 # Protein_GI_number: 19704137 # Func_class: E Amino acid transport and metabolism # Function: ABC-type amino acid transport system, permease component # Organism: Fusobacterium nucleatum # 1 236 1 236 236 375 100.0 1e-104 MEYLEILKDTFLTDDRYMYIVNGVIFSIGITLFSAILGIILGLLLAIMKLSHFYPFKRIK VLENFNPLSKIAYIYIDVIRGTPVVVQLMILANLIFVGILRETPILIIGGIAFGLNSGAY VAEIIRAGIEGLDKGQMEAGRALGLSYSQAMRKIIVPQAVKNILPALVSEFITLLKETSI IGFIGGIDLLRSANIITSQTYRGVEPLLAVGIIYLILTSIFTAFMRKVERGLKVSD >gi|296154178|gb|ADVK01000039.1| GENE 81 92241 - 92969 608 242 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 [Vibrio campbellii AND4] # 1 239 1 242 245 238 50 9e-62 MINVENLSKNFGNLKVLKNISTTINKGEIISIIGPSGSGKSTFLRCINKLEEPTKGHIYI DGMDLMDKNTDINKIRERVGMVFQHFNLFPNMTVLENLTLSPIMVKKESKEEAEKYASYL LEKVGLSDKANSYPSQLSGGQKQRIAIARALAMKPEVILFDEPTSALDPEMIKEVLDVMR DLAKEGMTMIIVTHEMGFAKNVGNRILFMDNGEIIEDCSPKDFFENPTNERIKDFLNKVL NK >gi|296154178|gb|ADVK01000039.1| GENE 82 92999 - 93727 1114 242 aa, chain + ## HITS:1 COG:FN0800 KEGG:ns NR:ns ## COG: FN0800 COG0834 # Protein_GI_number: 19704135 # Func_class: E Amino acid transport and metabolism; T Signal transduction mechanisms # Function: ABC-type amino acid transport/signal transduction systems, periplasmic component/domain # Organism: Fusobacterium nucleatum # 13 242 1 230 230 392 98.0 1e-109 MKKFVKLMLMSLLSVVISISVFAKSNVVYVGTNAEFAPFEYLDKNKIVGFDIDLLDAISK ETGLEFRIQDMAFDGLLPALQTKKVDMVIAGMTATPERQKTVAFSKPYFKAKQVVITKGE NKSLKSFKDLSGKKVGVMLGFTGDAVVSEIKGVKIERFNAAYAAILALSQNKIDAVVLDS EPAKKYTANNKQFVIANIPAEEEDYAIAFRKNDKELINKVNVALDKIKANGEYDKILKKY FK >gi|296154178|gb|ADVK01000039.1| GENE 83 93848 - 95785 2008 645 aa, chain + ## HITS:1 COG:FN0799 KEGG:ns NR:ns ## COG: FN0799 COG1523 # Protein_GI_number: 19704134 # Func_class: G Carbohydrate transport and metabolism # Function: Type II secretory pathway, pullulanase PulA and related glycosidases # Organism: Fusobacterium nucleatum # 1 645 1 645 645 1299 99.0 0 MYYNFNHYINLGANLEKDGCSFAIYAKNVNSLSLNIFHSSEDTVPYEKHILSPSEHKLGD IWSIFLENIKEGTLYNWEINGMAILDPYALAYTGNKTIENKKSIVLARVGTETKHILIPK KNMIIYESHIGLFTKSPSSNTLNGATYSAFEEKIPYLKNLGINVVEFLPIFEWDDFTGNL DRESFFLKNVWGYNPINFFALTKKYSSSKDENSANEINEFKKLIFSLHKNGIEVILDVVY NHTAEGGTGGKVYNFKAMGENIFYTKDRENYFTNFSGCGNTLNCNHKVVKDMIIQSLLYW YLEVGVDGFRFDLAPVLGRDSNNQWARHSLLHELIEHPILSHAKLIAESWDLGGYFVGAM PSGWCEWNGAYRDTVRQFIRGDFGQVPELIKRIFGSVDIFHANKNGYQSSINFICCHDGF TMWDLVSYNLKHNLLNGENNQDGENNNHSYNHGEEGFTENSHIISLRKQQIKNMILILYI SQGIPMLLMGDEMGRTQLGNNNAYCQDNPTTWVDWDRKKDFEDVFLFTKNMISLRKSYSI FKKETPLIEGEEVILHGIKLYQPDLSFHSLSIAFQLKDIKSNTDFYIAFNSYTEQLCFEL PILENKSWYILTDTSKVDTCNFQETKCQDTHCCVLPKSSVILISK >gi|296154178|gb|ADVK01000039.1| GENE 84 95851 - 97788 2123 645 aa, chain - ## HITS:1 COG:FN0798 KEGG:ns NR:ns ## COG: FN0798 COG3855 # Protein_GI_number: 19704133 # Func_class: G Carbohydrate transport and metabolism # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 645 1 645 645 1262 99.0 0 MNTEIKYLELLSKTFKNIAETSTEIINLQAIMNLPKGTEHFMTDIHGEYEAFNHVLRNGS GTIRNKIEEAYGNKLTENEKKELASIIYYPKEKVELMQNKDNFNIDRWMITIIYRLIEVC KVVCSKYTRSKVRKAMTKDFEYILQELLYEKKELANKKEYFDSIVDTIISIDRGKEFIIA ICNLIQRLNIDHLHIVGDIYDRGPFPHLIMDTLAEYSNLDIQWGNHDILWIGAALGNKAC IANVIRICCRYNNNDILEEAYGINLLPFATFAMKYYGDDPCKRFRAKEGVDSDLIAQMHK AMSIIQFKVEGLYSERNPELEMSSRESLKHINYEKGTINLNGVEYPLNDTNFPTVNPENP LELLEEEAELLDKLQVSFLGSEKLQKHMQLLFAKGGMYLKYNSNLLFHACIPMEPNGEFS ELFVEDGYYKGKALMDKIDNIVRQAYYDRKNVEVNKKHRDFIWYLWAGRLSPLFGKDVMK TFERYFIDDKTTHKEIKNPYHKLINDEKVCDKIFEEFGLNPRTSHIINGHIPVKVKEGES PVKANGKLLIIDGGFSRAYQSTTGIAGYTLTYNSYGMKLASHLKFISKEAAIKDGTDMIS SHIIVETKSKRMKVKDTDIGKSIQTQINDLKKLLKAYRIGLIKSN >gi|296154178|gb|ADVK01000039.1| GENE 85 97843 - 97977 68 44 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MSQANLSMFEANLLASWLNLQRILDFLSLRNLASNELFFTTFTI >gi|296154178|gb|ADVK01000039.1| GENE 86 98060 - 100615 3467 851 aa, chain - ## HITS:1 COG:FN0796 KEGG:ns NR:ns ## COG: FN0796 COG0574 # Protein_GI_number: 19704131 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphoenolpyruvate synthase/pyruvate phosphate dikinase # Organism: Fusobacterium nucleatum # 1 851 1 851 851 1665 99.0 0 MKQVYEFKDGGKEMVALLGGKGANLAEMAKIDLPIPKGIIISTTACNEYFKNDKKLSTVL EEEILTNIRVLEYETGKKFQSTKPLLVSVRSGAPVSMPGMMDTILNLGFNDYVAEKMLEI TKDEKFVYSSYLRFVQMFSEIAKGINRKKFMHLKATDYKAQIIESKNIYREECGEMFPEN YKAQILIAVKSIFDSWNNDRAILYRKLHNIDNNMGTAVVIQEMVFGNFNDKSGTGVLFTR NPSTGEDKIFGEVLLNAQGEDIVAGIRTPDNIELLKTSMPNIYNELAETVKRLEKHNRDM QDVEFTIEDSKLYILQTRNGKRTAEASLKIAMDLVKEGIITKEEAILKVEPASINKLLNG DFEEKYLKGATLLTKGLAASSGVAVGRIMFDAKRVKIREKTILVREETSPEDLQGMALAQ GIVTLKGGATSHGAVVARGMGKCCVTGCSEIKIDEINKTMTVGKYTLKEGDFISVSGHTG EIYLGKIPLKENSFSDELKEFVSWASEIKRMGVRMNADTPEDVEQGKAFGAKGIGLCRTE HMFFKKDKIWTIREFILSDRGEEKEKALRKLHNLQKEDFLNIFKILEGDEANIRLLDPPV HEFLPKTLEDKKKMAEILSISVENIEKRIYRLKDENPMLGHRGCRLGVSYPELYRIQARA IIEAAYECAKKGIKVHPEIMIPFIMEAKELAYIRAEIEEEVENFFKEVGVTVEYKLGTMI EIPRACLLADEIAEYADFFSFGTNDLTQMSMGLSRDDSVKFLDDYREKGIWEGEPFYSID TKAVTKLVEIGVKNGKTTKPNLTIGICGEHGGDPKSIEFFEGQKFDYVSCSPFRVPTAIL AAAQSYLKLKK >gi|296154178|gb|ADVK01000039.1| GENE 87 100627 - 101250 686 207 aa, chain - ## HITS:1 COG:FN0795 KEGG:ns NR:ns ## COG: FN0795 COG0517 # Protein_GI_number: 19704130 # Func_class: R General function prediction only # Function: FOG: CBS domain # Organism: Fusobacterium nucleatum # 10 207 1 198 198 350 100.0 1e-96 MNINNGSDRMDLTERQRKILMMLREKSLLSGDEIAQNLNVTKSALRTDFSILTRLKLITA KQNKGYIYNKCTIKRVRDCMSPQNSISVKTSVYDAIIHLFNFDLGTLIVVENEKLVGIIS RKDLLKAALNGKNIERIPVSMIMTRMPNIVHCFEDDNIMEAIEKLIKHEIDSLPVLRKEK GKLSLVGRFTKTNVTKLFYQELKNKSI >gi|296154178|gb|ADVK01000039.1| GENE 88 101420 - 101833 581 137 aa, chain + ## HITS:1 COG:no KEGG:FN0794 NR:ns ## KEGG: FN0794 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 137 1 137 137 232 100.0 4e-60 MKIIINVKGLSRKKVIHQEEVELINKISTTKDLITELVKINVEKFNKKIDDKDILSIMTN EYIAEAARGGKVGDEVHGDKKSNLEKALDTAYLAFEDALYCIFINDEQSEKLDDSLNLKD GDILTFIRLTMLAGRMW >gi|296154178|gb|ADVK01000039.1| GENE 89 101912 - 103111 1811 399 aa, chain - ## HITS:1 COG:FN0793 KEGG:ns NR:ns ## COG: FN0793 COG0786 # Protein_GI_number: 19704128 # Func_class: E Amino acid transport and metabolism # Function: Na+/glutamate symporter # Organism: Fusobacterium nucleatum # 1 399 1 399 399 686 100.0 0 MFEYQLNMAETVGFAIILLLLGRWIKKKVSFFEKFFIPAPVIGGTLFSIILLIGHQTETF TFTFNDDIKNLLMIAFFTTVGFSASLKILKKGGVGVALFLLAAVVLVILQDIVGPVLAKA LGINPLLGLAAGSIPLTGGHGTSGAFGPYLEDLGATGATVVAVASATYGLIAGCLIGGPI GRRLMIKNNLKPTENKAGVDDSILGSTTEVTEERLFSAVVYIGIAMGIGATITLILGNHG IKFPAYLMGMVVAAIIRNILDFNQKQLPFNEIGIVGNISLSLFLSMALMSMKLWQLIDLA VPLIVILLVQTLLMAFFAYFITFNIMGRDYDAAVMSTGHCGFGLGATPNAMANMETFTTA NGPSVKAFFIIPIVGSLFIDFINAGVIQTFATWIVNNFM >gi|296154178|gb|ADVK01000039.1| GENE 90 103339 - 105360 3305 673 aa, chain - ## HITS:1 COG:FN0792 KEGG:ns NR:ns ## COG: FN0792 COG2987 # Protein_GI_number: 19704127 # Func_class: E Amino acid transport and metabolism # Function: Urocanate hydratase # Organism: Fusobacterium nucleatum # 1 673 1 673 673 1413 99.0 0 MLNNKNIYDAMTIKLTAEDIPMEIPKLDPSIRRAPKRIVKLSDHDIELALRNALRYIPEE FHEMLAPEFLKELEDRGRIYGYRFRPEGNIYGRPIDEYKGKCTEAKAMQVMIDNNLDFDI ALYPYELVTYGETGQVCQNWMQYRLIKKYLENMTQDQTLVVASGHPTGLFRSNPYAPRAI ITNGLMVGLFDNYDDWARGAAIGVANYGQMTAGGWMYIGPQGIVHGTYSTILNAGRLFCG VPADGDLRGKLFITSGLGGMSGAQGKACEIAKGVAIVAEVDLSRINTRLEQGWVNVIANT PEEAFKIAEEKMASKTPYAIAYHGNIVEILEYAIEHNKHIDLLSDQTSCHAVYDGGYCPV GTSFEERTKLLGTDRAKFRELVNEGLKRHYKAIKTLHNRGVYFFDYGNSFLKSIYDVGVT EISKNGKDDKEGFIFPSYVEDILGPELFDYGYGPFRWVCLSRKKEDLLKTDKAALELVDP NRRYQDRDNYVWIQDADKNGLVVGTQARIFYQDAMSRTRIALKFNEMVRNGEIGPVMLGR DHHDVSGTDSPFRETSNIKDGSNIMADMATQCFAGNAARGMTMIALHNGGGVGIGKSING GFGMVLDGSKRVDEILWQAMPWDVMGGVARRAWARNPHSIETVVEYNLDNKGTDHITLPY IVNDELVKKVLKK >gi|296154178|gb|ADVK01000039.1| GENE 91 105377 - 106912 2520 511 aa, chain - ## HITS:1 COG:FN0791 KEGG:ns NR:ns ## COG: FN0791 COG2986 # Protein_GI_number: 19704126 # Func_class: E Amino acid transport and metabolism # Function: Histidine ammonia-lyase # Organism: Fusobacterium nucleatum # 1 511 6 516 516 966 99.0 0 MELVLGSKNITLEDLINVTRKGYKVSISEEAYEKIDKARALVDKYVEEGKVSYGITTGFG KFAEVSISKEQTGQLQKNIVMSHSCNVGNPLPIDIAKGIVLLRAVNLAKGYSGARRIVIE KLVELLNKDVTPWIPEKGSVGSSGDLSPLAHMSLVLIGLGKAYYKGELLEAKDALAKADI EPIPALSSKEGLALTNGTQALTSTGAHVLYDAINLSKHLDIAASLTMEGLHGIIDAYDPR IGEVRGHLGQINTAKNMRNILAGSKNVTKQGVERVQDSYVLRCIPQIHGASKDTLEYVKQ KVELELNAVTDNPIIFVDTDEVISGGNFHGQPMALPFDFLGIALSEMANVSERRIEKMVN PAINHGLPAFLVEKGGLNSGFMIVQYSAASLVSENKVLAHPASVDSIPTSANQEDHVSMG SVAAKKSKDIFENVRKVIGMELITACQAIDLKGAKDKLSPATKVAYDEVRKIISYVSEDR PMYIDIHAAEDLIKTNKIVENVEKAIGKLEF >gi|296154178|gb|ADVK01000039.1| GENE 92 107260 - 108378 1030 372 aa, chain + ## HITS:1 COG:FN0790 KEGG:ns NR:ns ## COG: FN0790 COG1940 # Protein_GI_number: 19704125 # Func_class: K Transcription; G Carbohydrate transport and metabolism # Function: Transcriptional regulator/sugar kinase # Organism: Fusobacterium nucleatum # 1 372 16 387 387 669 99.0 0 MYQKEIKQSNENIVFHSIYFTENSFSIPDLTKITNMTFPTIKRVLNEFLKKDIIKEWTLS TGGVGRRAVKYKYNPDFCYSIGVSIDEEKIRFIVINTIGKILQSKEIETTDENFLIFFER NLKYFIEEIDPKYLAKVIGVGISIPGIYNKENHFLEFNNIDRYESSIIKKLEENIKLPIW VENEANMSILAEAIIGKHKDLADFTVISINNKVTCSTFYKFGNKSEDYFFKASRVHHMIV DYENKKKVGDCISFKILKDKILEAFPNIKNLDKFFSNKKYKESKTGKKILDEYLTYMGII LKNLLFTYNPKKLIISGELSQYGNYLLDDILNIVYEKNHIFYRGRETISFSNFKGSSSII GAALFPIVDNLM >gi|296154178|gb|ADVK01000039.1| GENE 93 108394 - 109236 1080 280 aa, chain + ## HITS:1 COG:FN0789 KEGG:ns NR:ns ## COG: FN0789 COG1284 # Protein_GI_number: 19704124 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 280 1 280 280 423 100.0 1e-118 MSNKYFQILKEYSIVTLACIVMAFNINYFFLANKLAEGGIAGVSLILHYLTNIDIGYLYF ILNIPLIILAYIFIGKDFLIKTLFATLVLTIFLKLFGNFRGPIDDILMAAIFGGGINGIA IGIIFYAGGSSGGTDIIAKIINKRFGIAIGKILLTIDFIILSMVAFIFGKVIFMYTLISL LVSAKMIDIIQEGIYSAKGVTIITNKAEELRKKIMEDTGRGLTLINAKGAYTQKEIGMLY CVVGKYQLIKVKNIVKEIDPEAFMIVNQVHEVIGKGFLGQ >gi|296154178|gb|ADVK01000039.1| GENE 94 109313 - 109732 345 139 aa, chain + ## HITS:1 COG:no KEGG:FN0788 NR:ns ## KEGG: FN0788 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 139 1 139 139 251 100.0 7e-66 MNNNEFINKYTSGKCLSFLDFQVVAKKYGIYFEKINNDIIVCYDGNGDPKVAAFKFYKNF FPETTLTPLNFDLITNISNFHLRFLKDKINEISQKYGLPPFYKQSISIKENAISLLNALK TRYAIHREDIEFIKYILDL >gi|296154178|gb|ADVK01000039.1| GENE 95 109852 - 111027 2059 391 aa, chain - ## HITS:1 COG:FN0785 KEGG:ns NR:ns ## COG: FN0785 COG2025 # Protein_GI_number: 19704120 # Func_class: C Energy production and conversion # Function: Electron transfer flavoprotein, alpha subunit # Organism: Fusobacterium nucleatum # 1 391 1 391 391 720 100.0 0 MNLNDYKGILVFAEQRDGILQNVGLELLGKATELAYEINKQIALKDAGDELADFAGKKEA AIKSVDVAAATLEEDDQELKNKVADIKKNNPDAAKVTALLVGHNIKNLAQDLIKAGADKV LVVDKPELEVYDTEAYAQVLTATINDQKPEIVLFGATTLGRDLAPRVSSRIATGLTADCT KLELLKDKERQLGMTRPAFGGNLMATIVSPDHRPQMATVRPGVMKKLPKSDDRTGEVVEF PVTLDTSKMKVKLLKVVKEGGNKVDISEAKILVSGGRGVGSKQNFELLEDLATEIGGIVS SSRAQVDAGNMPHDRQVGQTGKTVRPEVYFACGISGAIQHVAGMEESEFIIAINKDRFAP IFSVADLGIVGDLHKILPILTEEIKKYKATK >gi|296154178|gb|ADVK01000039.1| GENE 96 111053 - 111841 1195 262 aa, chain - ## HITS:1 COG:FN0784 KEGG:ns NR:ns ## COG: FN0784 COG2086 # Protein_GI_number: 19704119 # Func_class: C Energy production and conversion # Function: Electron transfer flavoprotein, beta subunit # Organism: Fusobacterium nucleatum # 1 262 1 262 262 485 100.0 1e-137 MRIVVCIKQVPDTTEVKIDPVKGTIIRDGVPSIMNPDDKGGLEEALKLKDLYGAEVIVIT MGPPQAEAILREAYAMGADRAILITDRKFGGADTLATSNTIAAAIRKIEDIDLIVAGRQA IDGDTAQVGPQIAEHLGLPQVSYVKEMEYKEDSKSFVIKRATEDGYFLLELPTPGLVTVL SEANQPRYMNVGAIVDVFERPIETWTFDDIEIDPAKIGLAGSPTKVNKSFTKGVKEPGVL HEVDAKEAANIILEKLKEKFII >gi|296154178|gb|ADVK01000039.1| GENE 97 111862 - 113007 1928 381 aa, chain - ## HITS:1 COG:FN0783 KEGG:ns NR:ns ## COG: FN0783 COG1960 # Protein_GI_number: 19704118 # Func_class: I Lipid transport and metabolism # Function: Acyl-CoA dehydrogenases # Organism: Fusobacterium nucleatum # 1 381 1 381 381 736 100.0 0 MEFNVPKTHELFRQMIREFVEKEVKPIAAEVDENERFPVETVEKMAKIGIMGIPIPKQYG GAGGDNLMYAMAVEELSKACATTGVIVSAHTSLGTWPILKFGNEKQKQKYLPKMASGEWI GAFGLTEPNAGTDAAGQQTMAVQDPETGEWILNGAKIFITNAGYAHVYVVFAMTDKSKGL KGISAFIVEANTPGFSIGKKEMKLGIRGSATCELIFENCRIPKENLLGDKGKGFKIAMMT LDGGRIGIASQALGIAAGALDEAINYAKERKQFGRSLAQFQNTQFQIANLDVKVEAARLL VYKAAWRESNNLPYSLDAARAKLFAAETAMEVTTKAVQIFGGYGYTREYPVERMMRDAKI TEIYEGTSEVQRMVIAANIIK >gi|296154178|gb|ADVK01000039.1| GENE 98 113211 - 113942 635 243 aa, chain + ## HITS:1 COG:FN0782 KEGG:ns NR:ns ## COG: FN0782 COG4123 # Protein_GI_number: 19704117 # Func_class: R General function prediction only # Function: Predicted O-methyltransferase # Organism: Fusobacterium nucleatum # 1 243 1 243 243 387 100.0 1e-107 MNTNLESIIPLLNKNLKIIQRSDYFNFSIDSLLISEFVNIKKNIKKILDLGTGNAAIPLF LSKKTSAKIYGIEIQEISYNLALRNININNLNEQIYIIYDNMKNYLKYFDIGSFDIVISN PPFFKINENINFLNNLDQLSIARHEIEINLEELTKIASELVKDRGYFYLVHRADRLSEII NNLQKYNFEAKKIKFCYTTEYKNAKIVLIEAIKNGKSGLTILPPLIINKENGEYTDEVLR MFE >gi|296154178|gb|ADVK01000039.1| GENE 99 113954 - 114712 760 252 aa, chain + ## HITS:1 COG:FN0781 KEGG:ns NR:ns ## COG: FN0781 COG2966 # Protein_GI_number: 19704116 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 252 5 256 256 443 100.0 1e-124 MQYDNLVMKVLSTANTIGKILLTSGAETYRVEKAISTVCRRFDLKAETFVTMTCVLTSAK KRDGETITEVNRIYTVSNNLDKIDRIHKILLNIHKYELEDLEKEVKKIQIQTVYKKNTLL ISYFFSAAFFALLFGGKFNDFLVAGFGGIIIFYMSKYANKLKLNNFFINTLGGFLITILS ILATKVGLVSTPSYSAIGTLMLLVPGLALTNAIRDLINGDLIAGTSRTVEAALVGSALAI GTGFALFAMSYF >gi|296154178|gb|ADVK01000039.1| GENE 100 114733 - 115224 483 163 aa, chain + ## HITS:1 COG:FN0780 KEGG:ns NR:ns ## COG: FN0780 COG3610 # Protein_GI_number: 19704115 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 163 1 163 163 239 100.0 1e-63 MNYLEVFTAFFATFFFGIIFSLTGKKLIYSSFAGGLGWYTHLLFFKELSYSKTASFVISA VVITIFSEIIGRLEKTTVTSTLIPALIPLVPGGGIYYTMSFFVENRFSEAFDKGRETIFL TVALSVGIFLVSTFSQILDRTIKYTKVLKKYRKFKEYKRKHKI >gi|296154178|gb|ADVK01000039.1| GENE 101 115243 - 116127 923 294 aa, chain + ## HITS:1 COG:FN0779 KEGG:ns NR:ns ## COG: FN0779 COG0523 # Protein_GI_number: 19704114 # Func_class: R General function prediction only # Function: Putative GTPases (G3E family) # Organism: Fusobacterium nucleatum # 1 294 1 294 294 520 99.0 1e-147 MRILLVSGFLGAGKTTFIKELPKNINLEFVVLENEYADIGIDRDFLDEKNLNVWEMSEGC ICCSMKGDFKSSIKRIYSEINPEYLIIEPTGVGMLSSIIENIREINNNDIEILSPLTLID ITSFNEYLKTFNDFFYDNLKNTGKVILTKLESFNSFDVESIKSEILKTNSNLEIITTDYR YFPKEWFGEILNKNIDNKIIDKSFSLKTHINLRTFSKENVNLKTMDELGLLLNRLVNGDF GKIYRAKGIVKIDGYWGKFNLVYKNFEMEPITDAKGTKIVIIGNNLDIKNLKNI >gi|296154178|gb|ADVK01000039.1| GENE 102 116136 - 117362 1194 408 aa, chain + ## HITS:1 COG:FN0778 KEGG:ns NR:ns ## COG: FN0778 COG0500 # Protein_GI_number: 19704113 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism; R General function prediction only # Function: SAM-dependent methyltransferases # Organism: Fusobacterium nucleatum # 1 408 1 412 412 631 98.0 0 MKKEDILSELIENIKDDKLIKIVFSDKQDGDFNKIIIKPLSLKFAKNIQIESFKDNKAFH KNIELNNIEKIKNILKEYVENFKQILLQIESLNISFMKKKETFIKKENNNNLIKNSNEHN KKKQYILNEGDKIDFLIELGLMSVEGKILKSSYNKFKQINKYLEFINDVIVELKTKKLIN NHINILDFGCGKSYLTFALYYYLKNYRKDLSFSIVGLDLKKDVIEFCNKLAQKLSYENLE FLNGNIKDYDRAKEVDLVFSLHACNNATDYSLEKALSLNAKAILAVPCCHHEFFEKIQKN KDSKFYDTLKIIADNGIVLDKFASLATDSFRSLTLELCGYKTKMIEFIDMEHTPKNILIK AIKSRSSNLKEKLKEYNSLKKFLGIQPLLEELTKKYFLIDTNTEIPYN >gi|296154178|gb|ADVK01000039.1| GENE 103 117391 - 119193 2416 600 aa, chain + ## HITS:1 COG:FN0777 KEGG:ns NR:ns ## COG: FN0777 COG0481 # Protein_GI_number: 19704112 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane GTPase LepA # Organism: Fusobacterium nucleatum # 1 600 5 604 604 1154 100.0 0 MLQKNKRNFSIIAHIDHGKSTIADRLLEYTGTVSERDMKEQILDSMDLEREKGITIKAQA VTLFYKAKNGEEYELNLIDTPGHVDFIYEVSRSLAACEGALLVVDAAQGVEAQTLANVYL AIENNLEILPIINKIDLPAAEPEKVKREIEDIIGLPADDAVLASAKNGIGIEDILEAIVH RIPAPNYDEDAPLKALIFDSYFDDYRGVITYVKVLDGNIKKGDKIKIWSTEKELEVLEAG IFSPTMKSTDILSTGSVGYIITGVKTIHDTRVGDTITSVKNPALFPLAGFKPAQSMVFAG VYPLFTDDYEELREALEKLQLNDASLTFVPETSIALGFGFRCGFLGLLHMEIIVERLRRE YNIDLISTTPSVEYKVSIDNQEEKVIDNPCEFPDPGRGKITIQEPYIRGKVIVPKEYVGN VMELCQEKRGIFISMDYLDETRSMLSYELPLAEIVIDFYDKLKSRTKGYASFEYELSEYK ISNLVKVDILVSGKPVDAFSFIAHNDNAFHRGKAICQKLSEVIPRQQFEIPIQAALGSKI IARETIKAYRKNVIAKCYGGDITRKKKLLEKQKEGKKRMKSIGNVEIPQEAFVSVLKLND >gi|296154178|gb|ADVK01000039.1| GENE 104 119230 - 120213 1444 327 aa, chain - ## HITS:1 COG:FN0776 KEGG:ns NR:ns ## COG: FN0776 COG2502 # Protein_GI_number: 19704111 # Func_class: E Amino acid transport and metabolism # Function: Asparagine synthetase A # Organism: Fusobacterium nucleatum # 1 327 1 327 327 654 100.0 0 MAYISSLDILETEIAIKKVKDFFESHLSKELDLLRVSAPLFVIPESGLNDNLNGTERPVS FDTKSGERVEIVHSLAKWKRMALYRYNIENDKGIYTDMNAIRRDEDTDFIHSYYVDQWDW EKIISKEDRNEEYLKDVVRKIYSVFKKTEEYITTEYPKLTKKLPEEITFITAQELENKYP NLTPKNREHAAAKEYGAIFLMKIGGKLSSGEKHDGRAPDYDDWDLNGDIIFNYPLLGIGL ELSSMGIRVDEKSLDEQLKIANCEDRRSLPYHQMILNKVLPYTIGGGIGQSRICMFFLDK LHIGEVQASIWSQEVHEICRQMNIKLL >gi|296154178|gb|ADVK01000039.1| GENE 105 120300 - 121589 1712 429 aa, chain - ## HITS:1 COG:FN0775 KEGG:ns NR:ns ## COG: FN0775 COG1362 # Protein_GI_number: 19704110 # Func_class: E Amino acid transport and metabolism # Function: Aspartyl aminopeptidase # Organism: Fusobacterium nucleatum # 1 429 1 429 429 816 99.0 0 MEKLKLAKHLINFIDESPSNYFACINTKNILNDKGFIELFETEEWKLKKGGKYFVTINDS GIIAFTIGSEKISKSGYKIAASHTDSPGFLIKPSPEINRKGFNILNTEVYGGPILSTWFD RPLSFSGRVFVESDNALKPKKYFIKYDKDLFIIPSLCIHQNRGVNDGMAINAQKDTLPLV TITDENEKFSLKKLLAKQLKVKEDKILSYDLNLYSREKGCLLGAKGEFISVGRLDNLAAL HAGLMSLVNNKDKRNTCVVVGYDNEEIGSNSIQGADSPTLKNILERISNAMKLSFEEHQQ ALANSFVISNDAAHSIHPNYLEKSDPTNEPKINAGPVIKMAANKSYITDGYSKSVIEKIA KDFKIPIQTFVNRSDVRGGSTIGPIQQSQIRILGIDIGSPLLSMHSVRELGGVDDHYNLY RLISEFFKI >gi|296154178|gb|ADVK01000039.1| GENE 106 121719 - 122534 1136 271 aa, chain - ## HITS:1 COG:FN0774 KEGG:ns NR:ns ## COG: FN0774 COG2849 # Protein_GI_number: 19704109 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 24 271 1 248 248 430 100.0 1e-120 MKKILFLLLAFCSFVAFAAGELNMDEVNKYVSEKLNRDKEITITYKLNKANNTLEGYSEE GKLIVVNSLKDEPDIVQMAGMKTKVSEKNGKLNPVSEIYLANGQLVVRNTYKFNRDTNIF ATDAVIAYVNGEIPYSADLKAFLDGIDRIQIENFENNKLALYTTYEINHKTQQIIIKNGL SAKATITKAVLSINGLNGTMETYYENGKVNQKVAIKNGLFNGKVEKFSDKSGKLVGTGIM KDGLPDGEFIEYDEAGKVISKAKYKDGKEVK >gi|296154178|gb|ADVK01000039.1| GENE 107 122602 - 122883 495 93 aa, chain - ## HITS:1 COG:FN0773 KEGG:ns NR:ns ## COG: FN0773 COG3077 # Protein_GI_number: 19704108 # Func_class: L Replication, recombination and repair # Function: DNA-damage-inducible protein J # Organism: Fusobacterium nucleatum # 1 93 1 93 93 134 96.0 4e-32 MAILTIKVDDNVLEEAKRIFDEIGMDTDRAINTFLKKCISENAIPFDLTISNNENWTKML EKNVKKEDARKVEKNKTNLDEDIEELVYEGLDV >gi|296154178|gb|ADVK01000039.1| GENE 108 122954 - 123463 621 169 aa, chain - ## HITS:1 COG:FN0772 KEGG:ns NR:ns ## COG: FN0772 COG0716 # Protein_GI_number: 19704107 # Func_class: C Energy production and conversion # Function: Flavodoxins # Organism: Fusobacterium nucleatum # 1 169 1 169 169 327 99.0 5e-90 MKTLIVYSTISGNTKSVCERIYGALNAEKEIINVKDIKNLQVNNYDNFIIGFWCDKGTMD KDSIDFLKILNNKNIYFVGTLGADPDSGHWNDVFENAKKLCSENNNFKDGLLIWGRISQE MQDMMKNFPASHPHAVTPERLARWEAASTHPDENDFKKAEEFFSNLLNN >gi|296154178|gb|ADVK01000039.1| GENE 109 123460 - 124695 1401 411 aa, chain - ## HITS:1 COG:FN0771 KEGG:ns NR:ns ## COG: FN0771 COG0635 # Protein_GI_number: 19704106 # Func_class: H Coenzyme transport and metabolism # Function: Coproporphyrinogen III oxidase and related Fe-S oxidoreductases # Organism: Fusobacterium nucleatum # 1 411 1 411 411 769 99.0 0 MFKIRYKSHHDVGNIISKFTENLKASKNDFLDLLNRENKNKQLGIYFHTPYCDKICSFCN MNRKQLDNDLEEYTKYLCEEIKKYGAYEFCKTSEIDVVFFGGGTPTIFKKEQLERILKTL NENFKFAKDYEMTFETTLHNLSFEKLKIMEENGVNRISVGIQTFSNRGRKLLNRTYDKDY VVERLKEIKKRFSGLVCIDIIYNYANQTDEEVLQDADLLAEVGADSASFYSLMIHDGSNI SKEREKDKSVYIYNLARDEKLHNLFYNRCIEKGYKLLELTKITNGRDAYKYIRNNNGLRN LLPIGAGAGGHIQDIGAYNMNQQMSFYSKTTEISHNLSMISGLMQFDKFDLEVIKKYCNE ESYKIIYRKLKEFEKEGYIKIENNFAIYQLKGIFWGNSLVADIIEEIGRHL >gi|296154178|gb|ADVK01000039.1| GENE 110 124709 - 125485 212 258 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|90020817|ref|YP_526644.1| ribosomal protein S16 [Saccharophagus degradans 2-40] # 2 243 10 244 318 86 27 7e-16 MAIINIEKLNYSYGKKEVLKELSLNIDENKLTGIIGPNGCGKSTLAKNIIRYINGKFEYF KIMDIDIRQLSHKKIAQLISYIPQKSTIISNISVFDYVLLGRFPLLKNSWDNYSEKDYEI VENNINLLNIKELRGRNVETLSGGELQKALLARALAQEAKILLLDEPTSALDLNNAVEFM KILKNISIKKEISVIIIIHDLNLASLFCDSLIILKDGKFIKKGNPKEVINEENIKSIYNL DCKVCYNENDKPYIIPKT >gi|296154178|gb|ADVK01000039.1| GENE 111 125485 - 126453 977 322 aa, chain - ## HITS:1 COG:FN0769 KEGG:ns NR:ns ## COG: FN0769 COG0609 # Protein_GI_number: 19704104 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-siderophore transport system, permease component # Organism: Fusobacterium nucleatum # 1 322 1 322 322 503 100.0 1e-142 MKKIFFLISLMITFIVIALSLSIGSVFIPIKSLLFLSPMDEYMKMIIFDLRLPRILMAFL VGMLLASSGNIVQIIFQNPLADPYIIGIASSATFGAVIAYLLKLPEFSYGMVAFICCMIS TLLIFKISKRGNKIEVNTLLIVGITLSAFLAGFTSFAIYMIGEDSFKITMWLMGYLGNAS WSQIIFLIIPLVFSSAYFYAKRNELDILMLGDEQAHSLGVNIAKLKFHLLIVSSFVVAYS VAFTGMIGFVGLIVPHIMRSIIGPLNARLIPFVLIYGGIFLLICDTFGRIILAPVEIPIG VITSILGAPFFLYLALKRSRRK >gi|296154178|gb|ADVK01000039.1| GENE 112 126468 - 128630 2899 720 aa, chain - ## HITS:1 COG:FN0768 KEGG:ns NR:ns ## COG: FN0768 COG1629 # Protein_GI_number: 19704103 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor proteins, mostly Fe transport # Organism: Fusobacterium nucleatum # 6 720 1 715 715 1283 98.0 0 MKKYLMGLSIFIFCASAYGEVINLGEKNIYSETGFEKNLRNSTTSPYIITSKDIETKGYT SVSEILDSVPGVNVQEGLRPAVDVRGQGFQKAKATVQLLVDGVPANMLDTSHMNVPIDVV NINEIERIEVIPGGGAVLYGSGTSGGVINIITKKYKGNNNVRGGVGYQLASFRNNKFDVS AGTSVGDFDFDINYSKNRKYGYRDYDFTNSDYFSGRINYNINKTSNIAFKYSGYRDKYTY PNFLDQKELDENRRQSGIDKEAKENNRIKKDEFTLTYNTKIGDKNDLNILGFYQKTDIPS ESIEDYTSEYKGMLAGQTAKLRKALRDPRLSARARAAMQNRLNALLAELGSTNNVDLKKF SQFKDTKKAIKIKDKFTYDSVGSNVIVGLGYTNNDMLRVSKTDLVGKRTMADTKLDLSKK TFEVFALNTFKVNRFEFIQGLRFENSKYDGTRKNNDVALDIKKSKDDWAGSLAVNYLYSD TGNAYVKYERAFTSPAPGQLVDKIEIAPRVYTYKVNNLKSESTNLFEVGWNDYLFGSLLS ADVFYSETKDEIATIFDVGGHGFGFKNTNIGKTKRYGFDLSAEQKFEKFTFKEAYSFIET KILKDNSNSFEGKHIADVPKHKLVFSVDYDITSKFTVGADYEYRAAAFIDNANKYGKDKA KSVFNLRANYKITNSLNVYAGINNIFGAKYYNSVRGNSRGEKFYDPTPKINYYAGFKYKF >gi|296154178|gb|ADVK01000039.1| GENE 113 128700 - 129590 904 296 aa, chain - ## HITS:1 COG:FN0767 KEGG:ns NR:ns ## COG: FN0767 COG0614 # Protein_GI_number: 19704102 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-hydroxamate transport system, periplasmic component # Organism: Fusobacterium nucleatum # 22 296 1 275 275 466 99.0 1e-131 MKKIITFICFIFFTISSFAIKVENNQILDDYGNKIEAKEYKKIIVTDPGVIEILFKIGGE KSIVAIAKTSRSKIYPSDKVDKLVSIGNVSNLNLEKVVEYKPDLIVVSSMMLRNVEAIKK MGYKVIVSNASNLNGILDTISVTGIISGKKDEAEKLRKECSLKLEKFEKENTKKASKLKG AILFSTSPMSAFSENSIPGDVLKHLGVINIAENVPGQRPILSPEYILKENPDFLAGAMSL DDPQQIIEASNVIPKIKAGKNKNIFILDSSVILRSSYRIFDEMEVLKEKLNKIENK >gi|296154178|gb|ADVK01000039.1| GENE 114 129819 - 130247 588 142 aa, chain - ## HITS:1 COG:FN0766 KEGG:ns NR:ns ## COG: FN0766 COG1970 # Protein_GI_number: 19704101 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Large-conductance mechanosensitive channel # Organism: Fusobacterium nucleatum # 1 142 1 142 142 237 98.0 5e-63 MVLGGLMKLFDEFKTFVMRGNVVDLAVGVIIGAAFGKIVTSLVNDIFMPIIGMIIGNIDF SSLVIKLGEPVEGAEQAAIRYGMFIQEIVNFLIIALCVFVAIKVINKLQKKKEEASAPAP GPTKEEVLLTEIRDALNKIAEK >gi|296154178|gb|ADVK01000039.1| GENE 115 130277 - 131362 1415 361 aa, chain - ## HITS:1 COG:FN0765 KEGG:ns NR:ns ## COG: FN0765 COG0482 # Protein_GI_number: 19704100 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain # Organism: Fusobacterium nucleatum # 1 361 2 362 362 687 98.0 0 METKSIAPEFKKYLKFDSNNSNIRIGVAMSGGVDSSTVAYLLKQQGYDIFGVTMKTFKDE DSDAKKVCDDLGIEHYILDVRDEFKEKVVDYFVNEYMNGRTPNPCMVCNRHIKFGKMLDF ILSKGASFMATGHYTKLKNGLLSVGDDSNKDQVYFLSQIEKNKLSKIIFPVGDLEKTKLR ELAEQLGVRVYSKKDSQEICFVDDGKLKQFLIENTKGKAEKPGNIVDKNGNILGKHKGFS FYTIGQRKGLGISSEEPLYVLAFDRKTNNIIVGQNEDLFRDELIATRLNLFSVSSLEGLD NLECFAKTRSRDILHKCLLKKDGDNFQVKFIDNKVRAITPGQGIVFYNNDGNVIAGGFIE K >gi|296154178|gb|ADVK01000039.1| GENE 116 131378 - 132001 737 207 aa, chain - ## HITS:1 COG:no KEGG:FN0764 NR:ns ## KEGG: FN0764 # Name: not_defined # Def: amino acid transporter LysE # Organism: F.nucleatum # Pathway: not_defined # 58 207 1 150 150 202 99.0 1e-50 MDTTILKGVLTGLILSLPFGPVGVYCMELTIVEGRWKGYITALGMVTMDMVYSTVALLFL SSVKEYVVKYERYLSLFIGIFLMIVSSKKLLKKIELKELSVDFKSMLQNYLTGVGFAIVN ISTILVIATVFAFLRILDDVTTLSSLETIIGVGLGGSGLWFFTTYIISHFRRLFGKEKLI KIIKFANGIIFILALFVVIYSAKQIIN >gi|296154178|gb|ADVK01000039.1| GENE 117 132120 - 132476 193 118 aa, chain + ## HITS:1 COG:no KEGG:FN0762 NR:ns ## KEGG: FN0762 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 118 1 118 118 146 100.0 2e-34 MSELEIKIIKFLLSSAVYSENAIMKNLGIDKDTLDKSFKILEENGYLESYENFIKREKLN EESDCCKTKKNKICSSCSSSSCSSCSSNSSCCDNNIFSDLTDFSKIKVITMKAVDNFS >gi|296154178|gb|ADVK01000039.1| GENE 118 132532 - 133302 998 256 aa, chain - ## HITS:1 COG:FN0761 KEGG:ns NR:ns ## COG: FN0761 COG1521 # Protein_GI_number: 19704096 # Func_class: K Transcription # Function: Putative transcriptional regulator, homolog of Bvg accessory factor # Organism: Fusobacterium nucleatum # 1 256 1 256 256 464 100.0 1e-131 MIIGIDIGNTHIVTGIYDNNGELISTFRIATNDKMTEDEYFSYFNNITKYNEISIKKVDA ILISSVVPNIIITFQFFARKYFKVEATIVDLEKKLPFTFAKGINYTGFGADRIIDITEAM QKYPDKNLVIFDFGTATTYDVLKKGVYIGGGILPGIDMSINALYGNTAKLPRVKFTTPSS VLGTDTMKQIQAAIFFGYAGQIKHIIKKINEELNEEIFVLATGGLGKILSAEIDEIDEYD ANLSLKGLYTLYKLNK >gi|296154178|gb|ADVK01000039.1| GENE 119 133552 - 134364 627 270 aa, chain + ## HITS:1 COG:no KEGG:FN0760 NR:ns ## KEGG: FN0760 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 270 1 270 270 391 100.0 1e-107 MKRIKFVYIYFLFLFYLIGGYFVNLPFINRGIYEKIYKYLGIMLIPTLLFFILYGFVFLI RDKKLRFFWELRLYYMFIFFIVAVYLYILFSSGVYFANVKNFVFDGEFLRTLINKSLFEY NIGYLPTYILYELMNISLKFNQYPFYYFYYFLIAFEAFLIILTVFNPMRKSIKKSNARRK KERQRARIEAELMEQIKIKEDLERKEALKIQKHKKMEEDAIKKKADNFEKMKKSKRASRK KGNEKPMEDKIKKQMNGIVLQKTVTINRED >gi|296154178|gb|ADVK01000039.1| GENE 120 134366 - 134944 781 192 aa, chain + ## HITS:1 COG:FN0759 KEGG:ns NR:ns ## COG: FN0759 COG0424 # Protein_GI_number: 19704094 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Nucleotide-binding protein implicated in inhibition of septum formation # Organism: Fusobacterium nucleatum # 1 192 1 192 192 345 100.0 4e-95 MILASNSQRRQEILKDAGFNFKVITSNIEEISDKKNITERILDIAEKKLEQIAKNNINEF VLAADTVVELDGKILGKPKNREEAFRFLKSLSGKVHRVITAYVFKNISKNILIREVVVSE VKFFDLDDDTINWYLDTDEPFDKAGAYGIQGYGRILVEKINGDYYSIMGFPISNFLENLR KIGYKISLIDKI >gi|296154178|gb|ADVK01000039.1| GENE 121 134962 - 136008 1404 348 aa, chain + ## HITS:1 COG:FN0758 KEGG:ns NR:ns ## COG: FN0758 COG1077 # Protein_GI_number: 19704093 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Actin-like ATPase involved in cell morphogenesis # Organism: Fusobacterium nucleatum # 1 348 6 353 353 630 99.0 0 MKKFMGKLLGIFSDDLGIDLGTSNTLICMKNKGIILREPSVVAISTKTKEIFEVGEKAKH MIGRTPSTYETIRPLRNGVIADYEVTEKMLRSFYKRIKSGTLLNKPRVIICVPAGITQVE KRAVMEVTREAGAREAYLIEEPMAAAIGVGINIFEPEGSMVVDIGGGTSELAVVSLGGVV KKSSFRVAGDRFDTAIVDYVRQKHNLLIGEKSAEDIKIKIGTVSPEEEDMEIEVSGKYVL NGLPKDITLTSSELIDTLSTLVQEIIEEIRVVFEKTPPELAADIKKRGIYISGGGALLRG IDKKIAAGLNLKVTISEDPLNAVINGIGVLLNNFSLYSKVLVSTETEY >gi|296154178|gb|ADVK01000039.1| GENE 122 136021 - 136566 559 181 aa, chain + ## HITS:1 COG:FN0757 KEGG:ns NR:ns ## COG: FN0757 COG1386 # Protein_GI_number: 19704092 # Func_class: K Transcription # Function: Predicted transcriptional regulator containing the HTH domain # Organism: Fusobacterium nucleatum # 1 181 1 181 181 301 100.0 6e-82 MSIKNQVESIIFLGGDENKIKDLAKFFKISIEDMLKILLELKDDRKDTGINLEVDSEIVY LSTNPLYGEVINNYFEQETKPKKLSSASIETLSIIAYKQPITKSEIESIRGVSVDRIVSN LEERKFVRNCGKQESGRKANLYEVTDKFLSYLGIKSITELPDYDLLKEKIKNMENITTNE D >gi|296154178|gb|ADVK01000039.1| GENE 123 136556 - 137260 783 234 aa, chain + ## HITS:1 COG:FN0756 KEGG:ns NR:ns ## COG: FN0756 COG1187 # Protein_GI_number: 19704091 # Func_class: J Translation, ribosomal structure and biogenesis # Function: 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases # Organism: Fusobacterium nucleatum # 1 234 1 234 234 405 100.0 1e-113 MRINKFLSTLGIASRRAIDKYIEEGRITVNGNTATTGMDINENDNIFIDGKKIKTKIDEE KVYFMLNKPLEVLSSSSDDRGRKTVVDLIKTDKRIFPIGRLDYMTSGLILLTNDGELFNR VVHPKSEIYKKYYIKVFGEIKKEEIDELKKGVLLDDGKTLPAKISGIKYDKNKTSMYISI REGRNRQIRRMIEKFEYKVLMLRREKIGELSLGDLPEGKYRELTKQEVEYLYSI >gi|296154178|gb|ADVK01000039.1| GENE 124 137269 - 137559 359 96 aa, chain + ## HITS:1 COG:FN0755 KEGG:ns NR:ns ## COG: FN0755 COG0721 # Protein_GI_number: 19704090 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Asp-tRNAAsn/Glu-tRNAGln amidotransferase C subunit # Organism: Fusobacterium nucleatum # 1 96 1 96 96 134 100.0 5e-32 MALTREEVLKIAKLSKLSFEDKEIEKFQVELNDILKYIDMLNEVDTSKVEPLVYINESVN NFREKEEKPSLEIEKVLFNAPESAENAIVVPKVIGE >gi|296154178|gb|ADVK01000039.1| GENE 125 137576 - 139039 443 487 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163737840|ref|ZP_02145257.1| 30S ribosomal protein S4 [Phaeobacter gallaeciensis BS107] # 23 486 21 463 468 175 30 1e-42 MVYNNLYELTAKELRDKFLSNELSAEEIVNSFYERIEKVEDKIKSFVSLRKDKALDEARK LDEKRKNGEKLGRLAGIPIAIKDNILMEGQKSTSCSKILENYIGIYDATVVKKLKEEDAI IIGITNMDEFAMGSTTKTSFHHKTSNPWDLNRVPGGSSGGAAASVAAQEVPISLGSDTGG SVRQPASFCGVVGFKPTYGRVSRYGLMAFASSLDQIGTLAKTVEDIAICMNVIAGVDDYD ATVSKKEVPDYTEFLNKDIKGLKIGLPKEYFIEGLNPEIKNVVDNSVKALKELGAEVVEI SLPHTKYAVPTYYVLAPAEASSNLARFDGIRYGYRAKDYTDLESLYVKTRSEGFGAEVKR RIMIGTYVLSAGFYDAYFKKAQKVRTLIKQDFENVLNEVDVILTPVAPSVAFKLSDTKTP IELYLEDIFTISANLAGVPAISLPGGLVDNLPVGVQFMGKPFDEEILIKIADALEKKIGR LNLPKLD >gi|296154178|gb|ADVK01000039.1| GENE 126 139053 - 140498 1935 481 aa, chain + ## HITS:1 COG:FN0753 KEGG:ns NR:ns ## COG: FN0753 COG0064 # Protein_GI_number: 19704088 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Asp-tRNAAsn/Glu-tRNAGln amidotransferase B subunit (PET112 homolog) # Organism: Fusobacterium nucleatum # 1 481 1 481 481 878 99.0 0 MIKEWESVIGLEVHLQLKTGTKVWCGCKSDYDESGINLHTCPICLGHPGALPKLNKKVVD YAIKAALALNCQINNESGFDRKNYFYPDAPKNYQITQFEKSYAEKGYLEFKLNSGRQVKI GITKIQIEEDTAKAVHGKNESYLNFNRASIPLIEIISEPDMRNSEEAYEYLNTLKNIIKY TKVSDVSMETGSLRCDANISVMEKGSKVFGTRVEVKNLNSFKAVARAIDYEIGRQIELIQ NGGKVDQETRLWDEENQITRVMRSKEEAMDYRYFNEPDLLKLVISDEEIEEIKKDMPETR LAKIERFKNNYSLDEKDAFILTEEVELSDYFEEVVKYSNNAKLSSNWILTEVLRILKHKN IDIEKFTISSENLAKIIKLIDKNTISSKIAKEVFEIALDDSRDPEIIVKEKGLVQLSDTS EIEKMVDEVLANNQKMVDDYKSADEGRKPRVLKGIVGQVMKISKGKANPEIVNDLIMEKL K Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:48:53 2011 Seq name: gi|296154170|gb|ADVK01000040.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00050, whole genome shotgun sequence Length of sequence - 12133 bp Number of predicted genes - 7, with homology - 7 Number of transcription units - 4, operones - 1 average op.length - 4.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 3 - 6216 8643 ## FN1554 hypothetical protein - Prom 6352 - 6411 13.2 - Term 6603 - 6638 5.3 2 2 Op 1 30/0.000 - CDS 6665 - 7849 1535 ## PROTEIN SUPPORTED gi|119502908|ref|ZP_01624993.1| Ribosomal protein S19 - Prom 7876 - 7935 11.2 3 2 Op 2 51/0.000 - CDS 7937 - 10018 3017 ## COG0480 Translation elongation factors (GTPases) 4 2 Op 3 56/0.000 - CDS 10064 - 10534 778 ## PROTEIN SUPPORTED gi|19704889|ref|NP_602384.1| 30S ribosomal protein S7 5 2 Op 4 . - CDS 10562 - 10930 627 ## PROTEIN SUPPORTED gi|19704890|ref|NP_602385.1| 30S ribosomal protein S12 - Prom 10969 - 11028 12.9 + Prom 10946 - 11005 8.5 6 3 Tu 1 . + CDS 11117 - 11644 572 ## gi|296328710|ref|ZP_06871226.1| conserved hypothetical protein + Term 11656 - 11693 1.3 - Term 11643 - 11680 5.1 7 4 Tu 1 . - CDS 11689 - 12132 940 ## COG1454 Alcohol dehydrogenase, class IV Predicted protein(s) >gi|296154170|gb|ADVK01000040.1| GENE 1 3 - 6216 8643 2071 aa, chain - ## HITS:1 COG:no KEGG:FN1554 NR:ns ## KEGG: FN1554 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 780 2071 1 1292 1582 2017 97.0 0 MSNNLYKVEKNLRSIAKRYKSIKYSVGLAILFLMLGVGAFSEEVNDTQLNNVPTREEIAS SRENLKNSVGSLQSKINQARAENSKGLEGLRLELIQLMEQGDQVVKSPWMSWQFGANYMY SKWNGTYKGKGDKAEKYPFEGMFTRSTNLFERAVSPLSEKYKELATSTNPYSASSNARNG LGSGYGLASTTPRQEPLVAINIEASIRPKDVTRSAVSAPTVGVGAPRLDTLNVPSSEPLS VTPPDPKAPEKTVSIVQPNASPFTGFFFDADYNAINFSAYNNGKKNADADLSDVDLYSGL EYNSWANGNTNPQGAAKTGYKKGDTHTEVTNLNTVDVNDGNHNVQKVGRTTNIIYRRGNT NGTPVNLSNLNIYVRGYYDGTLSGGKGTTGGKMDGWIDSGRGAEGGSADPDPLNPTRGTI GLHTLLNAKVSKVTANLYGRAGFLTSETWRSGTVTMDRTTVNVYNDQNSVFYIMPAAYGT IAAYLSAGGSHDNFYIGGLKGSTNVNLYGTGNSVYLSTGISGARHIENKGIINSDGASNI VYSNIGYTPDWSKTWYQDRGGAAGNRPLQNGYTKNIMRSVIKLGTDGGQVNLYGDENVGL FFGSKMGGADPKSWEKGHRDAEFKNANYLRKASFIGIYQGEIELHAKIGEKTSSTGKDQV NGVGNIHENTPNGKKYDPKFVEGAVGIYSESGQREGIDPIRDLGVPTKANGGSGHYTHIG SLNNDKVHNLQVGKVDVKFGSKSKNGFMFISKLGTVMDIAKPGTAKDYIGTLSEEITDGM NGENTTEEDASTGTTIAYAEGTWDQGKHQLGSTAAEKAKNDNKAAGSTANKLQGLGSEIN IFTPKVTLASKEGIAYMGDNKGIINVGTTANKVKTTAVNHKSIIGFARDEGTVTINGNVE AKDAKAQTNKWQNIAGLAVKTKTGTKGGTVTINGDVDIHGMAGFADGTGSKVDLKGTGNK VSTGTDGALAAKNGGVVNFGGGTITHKNNGTVVGKNDHESSVPFYADNNSKINFEGDGTN PSRTTIEMADGVLMSEEATAYNGLNDGKAKYNGMKNVTVKLIGDNVILKTSIGKDITWTG ASGLVASLKSDMKLGGLDLGTHKYKVYYLNGTFKIDTNIDLDDSKIAFHNVGLSNEMVTI NTGREIKSATGKGLAVASNKNAITNASSGYINKGNINITGGSLASGTIGLNVSYGTVRNE KNINVANGIGVYGINGSKLVNETSGKINIGTQGVGMAGFASAGARKNYGTDKLNSSSPFF NTEKLFEIENNGTIQANGDKSIGLYGETNDLYGRGLTSSNGSITNNGKLILTGNKAVGIV SKRATVKLNGTGSSDIVLGKEGIGVYAENSLVNLNSNYGIEVKDKGTGVYVDKDSEIITP GRTVELKYTGSNTGTGVGLFYEGKTAAIMTNGTNVKLIDTVGTTGGLVGLYTNNGGILTN NGNISGDKGYGIITDGTEINNAGNITFNNPVTSKNASVGIYTKSSDKITNSLIGKIKLGK NSVGIYGKAVENSGEIEVGDGGTGIYSGGGNVNLNSTGKINVGRDKAVAIYAKGTNQNIT AHSGSTINLGNTSFGIINEGTNNKITSNIANINNLGNDTVYIYSTDTRGRVTNNTNLKST GYLNYGIYSAGTVENKGNIDFSSGYGNVGIYSIKGGNANNTGNITVGKTMEISTPTASDP TKTTTYYAIGMAAGYTPESGTGYTGNITNTGTINVNGDGGSIGMYGTERGTRVINNGTIN LRANNSVGMYLDNGAYGENNGTIQTIGTGLKKVTGVVVKNGSRFKNNGTVKLDAESAIGL LTKGGPGGANPGIIENYGTLDIKGVGAQRTKESKGTDPLEKEMGGVAIKTPKNSSTSQIT VDGKVVKPTVVETSAKEYQDMSLSTIGMYINTSGTKFTRPITGLSALKQLKRADLIIGVE AAQNTTSKTIQVGQKILEPYNKTIKDNPQIEKWNIYSGSLTWMANIAQNQTDGTIQNAYL AKIPYTHWAGNESTPVDKKDAYNFLDGLEQRYGVEEIGTRENQVFQKLNGIGKNEQILFF QAVDEMMGHQYANVQQRVQTTGIILDKEFKY >gi|296154170|gb|ADVK01000040.1| GENE 2 6665 - 7849 1535 394 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|119502908|ref|ZP_01624993.1| Ribosomal protein S19 [marine gamma proteobacterium HTCC2080] # 1 392 1 405 407 595 72 1e-170 MAKEKYERSKPHVNIGTIGHVDHGKTTTTAAISKVLSDKGWASKVDFDQIDAAPEEKERG ITINTAHIEYETEKRHYAHVDCPGHADYVKNMITGAAQMDGAILVVSAADGPMPQTREHI LLSRQVGVPYIVVYLNKSDMVEDEELLELVEMEVRELLTEYGFPGDDIPVIRGSSLGALN GEEKWVEKILELMEAVDNYIPTPERAVDQPFLMPIEDVFTITGRGTVVTGRVERGVIKVG EEIEIVGIKPTTKTTCTGVEMFRKLLDQGQAGDNIGVLLRGTKKEEVERGQVLAKPGSIH PHTNFKGEVYVLTKDEGGRHTPFFTGYRPQFYFRTTDITGAVTLPDGVEMVMPGDNITMT VELIHPIAMEQGLRFAIREGGRTVASGVVSEITK >gi|296154170|gb|ADVK01000040.1| GENE 3 7937 - 10018 3017 693 aa, chain - ## HITS:1 COG:FN1556 KEGG:ns NR:ns ## COG: FN1556 COG0480 # Protein_GI_number: 19704888 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Translation elongation factors (GTPases) # Organism: Fusobacterium nucleatum # 1 693 1 693 693 1358 99.0 0 MARKISLDMTRNVGIMAHIDAGKTTTTERILFYTGVERKLKEVHEGQATMDWMEQEQERG ITITSAATTCFWKGHRINIIDTPGHVDFTVEVERSLRVLDGAVAVFSAVDGVQPQSETVW RQADKYKVPRLAFFNKMDRIGANFDMCVSDIKEKLGSNPVPIQIPIGAEDQFEGVVDLIE MKEIVWPVDSDQGQHFDVKDIRAELKEKAEETRQYMLESIVETDDALMEKFFGGEEITKE EIIKGLRKATIDNTIVPVVCGTAFKNKGIQALLDAIVNYMPAPTDVAMVEGRDPKNPDVL IDREMSDDAPFASLAFKVMTDPFVGRLTFFRVYSGFVEKGATVLNSTKGKKERMGRILQM HANNREEIEHVYCGDIAAAVGLKDTATGDTLCAENAPIVLEQMEFPEPVISVAVEPKTKN DQEKMGIALSKLAEEDPTFKVRTDEETGQTIISGMGELHLEIIVDRMKREFKVESNVGKP QVAYRETITQSCDQEVKYAKQSGGRGQYGHVKIILEPNPGKEFEFVNKITGGVIPREYIP AVEKGCKEALESGVIAGYPLVDVKVTLYDGSYHEVDSSEMAFKIAGSMALKQAATKAKPV ILEPVFKVEVTTPEEYMGDIIGDLNSRRGMVSGMIDRNGAKIITAKVPLSEMFGYATDLR SKSQGRATYSWEFSEYLQVPASIQKQIQEERGK >gi|296154170|gb|ADVK01000040.1| GENE 4 10064 - 10534 778 156 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704889|ref|NP_602384.1| 30S ribosomal protein S7 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 156 1 156 156 304 100 2e-82 MSRRRAAVKRDVLPDSRYSDKVVTKVINSIMLDGKKSIAEGIFYSAMDLIKEKTGQEGYD IFKQALENIKPQIEVRSRRIGGATYQVPVEVKADRQQTLAIRWLTTYTRARKEYGMIEKL AAELIAAANNEGATIKKKEDTYKMAEANRAFAHYRV >gi|296154170|gb|ADVK01000040.1| GENE 5 10562 - 10930 627 122 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704890|ref|NP_602385.1| 30S ribosomal protein S12 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 122 1 122 122 246 100 8e-65 MPTLSQLVKKGRQTLTEKKKSPALQGNPQRRGVCIRVYTTTPKKPNSALRKVARVKLTNG IEVTCYIPGEGHNLQEHSIVLVRGGRTKDLPGVRYKIIRGALDTAGVAKRKQGRSKYGAK NA >gi|296154170|gb|ADVK01000040.1| GENE 6 11117 - 11644 572 175 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296328710|ref|ZP_06871226.1| ## NR: gi|296328710|ref|ZP_06871226.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 175 1 175 175 278 100.0 8e-74 MDLKEIKNIFESSKYFSKILIEDDFEISGLINLWNENDVDISIEFNPDYTDNIDFYKISL RLIEEKLNWINKNRKLICKTFIEEENMFYGLNEDIEKQLSKKEKAKIGNLEFSAPLTEEE FSNSLYITYINFYVEDEKNINCNFDLDSEPDYLFGHLANIEIDENNDILMSGING >gi|296154170|gb|ADVK01000040.1| GENE 7 11689 - 12132 940 147 aa, chain - ## HITS:1 COG:ECs3659 KEGG:ns NR:ns ## COG: ECs3659 COG1454 # Protein_GI_number: 15832913 # Func_class: C Energy production and conversion # Function: Alcohol dehydrogenase, class IV # Organism: Escherichia coli O157:H7 # 5 147 241 383 383 167 60.0 5e-42 IVDMEGMSIGQYVAGMGFSNVGLGIVHSMAHPLGGVYDIAHGVANALLLPIVMEYNMPVC IDKYGNIAKAMGVDITNMSKEEAAKAAIDAVRQLAIDVNIPQTLRELNIPKEGLPRLAKD ALADVCTGGNPREVTYEDILKLYEIAY Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:49:36 2011 Seq name: gi|296154149|gb|ADVK01000041.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00051, whole genome shotgun sequence Length of sequence - 19236 bp Number of predicted genes - 21, with homology - 20 Number of transcription units - 10, operones - 5 average op.length - 3.2 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 21/0.000 - CDS 16 - 675 1025 ## COG2057 Acyl CoA:acetate/3-ketoacid CoA transferase, beta subunit 2 1 Op 2 3/0.000 - CDS 675 - 1394 1063 ## COG1788 Acyl CoA:acetate/3-ketoacid CoA transferase, alpha subunit 3 1 Op 3 1/1.000 - CDS 1457 - 2251 1106 ## COG1024 Enoyl-CoA hydratase/carnithine racemase - Prom 2350 - 2409 11.0 - Term 2303 - 2343 3.0 4 2 Op 1 1/1.000 - CDS 2438 - 3328 1121 ## COG1159 GTPase 5 2 Op 2 1/1.000 - CDS 3332 - 4090 642 ## COG0582 Integrase 6 2 Op 3 17/0.000 - CDS 4092 - 5753 2263 ## COG0497 ATPase involved in DNA repair 7 2 Op 4 1/1.000 - CDS 5747 - 6550 953 ## COG0061 Predicted sugar kinase 8 2 Op 5 1/1.000 - CDS 6567 - 7793 1754 ## COG4942 Membrane-bound metallopeptidase 9 2 Op 6 . - CDS 7768 - 8715 1245 ## COG2177 Cell division protein - Prom 8752 - 8811 11.2 - Term 8756 - 8823 11.9 10 3 Op 1 . - CDS 8847 - 9236 610 ## FN0264 hypothetical protein - Prom 9275 - 9334 10.3 11 3 Op 2 . - CDS 9339 - 10022 961 ## COG0760 Parvulin-like peptidyl-prolyl isomerase - Prom 10252 - 10311 9.2 12 4 Op 1 11/0.000 + CDS 10341 - 12572 3552 ## COG1882 Pyruvate-formate lyase + Term 12584 - 12614 3.6 13 4 Op 2 1/1.000 + CDS 12638 - 13369 677 ## COG1180 Pyruvate-formate lyase-activating enzyme + Prom 13409 - 13468 14.2 14 5 Op 1 2/0.000 + CDS 13512 - 13889 527 ## COG0640 Predicted transcriptional regulators 15 5 Op 2 15/0.000 + CDS 13892 - 14113 318 ## COG2608 Copper chaperone 16 5 Op 3 . + CDS 14123 - 15997 2361 ## COG2217 Cation transport ATPase + Term 16029 - 16074 6.1 17 6 Tu 1 . - CDS 16073 - 17584 1681 ## COG1288 Predicted membrane protein - Prom 17619 - 17678 9.9 + Prom 17626 - 17685 8.8 18 7 Tu 1 . + CDS 17780 - 17881 73 ## 19 8 Tu 1 . - CDS 17870 - 18313 299 ## COG3177 Uncharacterized conserved protein - Prom 18441 - 18500 7.0 20 9 Tu 1 . - CDS 18525 - 18668 126 ## gi|296328731|ref|ZP_06871246.1| fic family protein - Prom 18695 - 18754 9.3 - Term 18783 - 18826 10.3 21 10 Tu 1 . - CDS 18882 - 19235 568 ## FN0254 hypothetical protein Predicted protein(s) >gi|296154149|gb|ADVK01000041.1| GENE 1 16 - 675 1025 219 aa, chain - ## HITS:1 COG:FN0273 KEGG:ns NR:ns ## COG: FN0273 COG2057 # Protein_GI_number: 19703618 # Func_class: I Lipid transport and metabolism # Function: Acyl CoA:acetate/3-ketoacid CoA transferase, beta subunit # Organism: Fusobacterium nucleatum # 1 219 1 219 219 434 100.0 1e-122 MAELTGRELVASRCAKFFKDGDFVNLGIGLPLMCVNYLPEGIDLWLEAEIGIVGSGPSPK WGEEDIDIIDAGGMPASIIKGGSVCPHTTSFGFIRGGHIDITVLGTLQVDQEGNLANWTI PGKLVPGMGGAMDLCAGVKRIIIATEHCEKSGNSKILKKCTLPLTGAKCVTDIVTERCYF EVTDKGLVLKELAPGYTVEDIKACTEADFILADEIGVMQ >gi|296154149|gb|ADVK01000041.1| GENE 2 675 - 1394 1063 239 aa, chain - ## HITS:1 COG:FN0272 KEGG:ns NR:ns ## COG: FN0272 COG1788 # Protein_GI_number: 19703617 # Func_class: I Lipid transport and metabolism # Function: Acyl CoA:acetate/3-ketoacid CoA transferase, alpha subunit # Organism: Fusobacterium nucleatum # 1 239 1 239 239 424 99.0 1e-119 MLSKVVTKEEALSKFCDGQTIMFSDWHGEFAADDLIDGVLEKGVKDIKAIAVSAGMPDLG VGKLIEAKRVKSLITTHIAFNPCAKEQMFAGELDVEFSPQGTFSERIRCGGFGLGGCLTP TGLGTEVEEGKQKFNINGKEYLLELPLRADIALIKATKADTAGNLYFRMTSGAIADSMAF AADTVIVEVEELVELGELGPEEIHVPAPIVDMVYVRTGQKRHTCPLWQRLRAKAEGRDK >gi|296154149|gb|ADVK01000041.1| GENE 3 1457 - 2251 1106 264 aa, chain - ## HITS:1 COG:FN0271 KEGG:ns NR:ns ## COG: FN0271 COG1024 # Protein_GI_number: 19703616 # Func_class: I Lipid transport and metabolism # Function: Enoyl-CoA hydratase/carnithine racemase # Organism: Fusobacterium nucleatum # 1 264 1 264 264 500 100.0 1e-141 MKLEKLIYTVENGIAVVTMNYMKNLNAIDEQMADELMYVVDTAEKDPNVKVMVLKGAEKA FSAGGDIGYFYQLIQAGGEVNMDGLIGKVGTVADGMKKMSKIVITSVCGAAAGAGVSLAL GGDFIICSDNAKFILAFVNLGLVPDTGGTYLLSKAIGVPRTMELAATGRPVSAEEAKELG FVYKVVPVEELNDFTMKFAQKIAAGPLISYKNIKKQIYDANFADYKKWLDETEIPTQREC AATMDFQEGCKAFMEKRKAVFKGE >gi|296154149|gb|ADVK01000041.1| GENE 4 2438 - 3328 1121 296 aa, chain - ## HITS:1 COG:FN0270 KEGG:ns NR:ns ## COG: FN0270 COG1159 # Protein_GI_number: 19703615 # Func_class: R General function prediction only # Function: GTPase # Organism: Fusobacterium nucleatum # 1 296 1 296 296 508 98.0 1e-144 MKAGFIAVVGRPNVGKSTLINKLVSEKVAIVSDKAGTTRDNIKGILNFKDNQYIFIDTPG IHKPQHLLGEYMTNIAVKILKDVDIILFLIDASKPIGTGDMFVMNRINENSKKPRILLVN KVDLITDEQKEEKIKEIEEKLGKFDKIIFASGMYSFGISQLLEALDPYLEDGVKYYPDDM YTDMSTYRIITEIVREKILLKTRDEIPHSVAIEIINVERKEGKKDKFDINIYVERDSQKG IIIGKNGKMLKEIGVEARKEIEELLGEKIYLGLWVKVKDNWRKKKPFLKELGYVEE >gi|296154149|gb|ADVK01000041.1| GENE 5 3332 - 4090 642 252 aa, chain - ## HITS:1 COG:FN0269 KEGG:ns NR:ns ## COG: FN0269 COG0582 # Protein_GI_number: 19703614 # Func_class: L Replication, recombination and repair # Function: Integrase # Organism: Fusobacterium nucleatum # 12 252 1 241 241 333 93.0 2e-91 MNILEKYIENLVVKKNLLQTTVDAYKFDINEYFEFLKGKNIDILDTDEKIFNEYFSDVEK NYKKNTFSRKYSTIRGLYKFLLKNRYIDKIFEYKLSVTKPDDEITEKKNNIVFKKKEYQD FINSLSDNFNEMRLMLISKMIVEYKINLVNIFEIQIKDLLKYDFQKIIIVRNNKIISYDI DKEMKEELENYYKKYAFEKRFLFGVYSKSTFISDLKRYNLDFKTLKNCMQEDEKDLIENI RKIYFEIGIGDK >gi|296154149|gb|ADVK01000041.1| GENE 6 4092 - 5753 2263 553 aa, chain - ## HITS:1 COG:FN0268 KEGG:ns NR:ns ## COG: FN0268 COG0497 # Protein_GI_number: 19703613 # Func_class: L Replication, recombination and repair # Function: ATPase involved in DNA repair # Organism: Fusobacterium nucleatum # 1 553 6 558 558 819 96.0 0 MLRELKIENLAIIDELDIEFDKGFIVLTGETGAGKSIILSGINLLIGEKASVDMIRDGEE NLVAQGVFDVDEEQKKALEAMGIDTDGDEIIIRRSYSRSGKARAFINNVRISLADLKEIA STLVDIVGQHSHQMLLNKNNHIKLLDSFLNKDEKDLKENLASLLSQYREIDSKIENIERE KKETLEKKEFYEYQLEEIEKLKLKDGEDELLEVEYKRVFNAEKIREKVYESLEYLKDDDD SALSLITNSIRNIEYLGKYDKRYTELAKRMENAYYELEDCANEIEDISKGIDVTENDLDK IASRMNTLKRIKEKYKRTLPELIEYREDLKEKLSDIDSGDFKTKELKKELNKIKDEYDKI AEKLTNSRKEIAVKIENELLNELKFLNMEDAKLKVQINKLEKMTSEGYDDVEFFISTNVG QDLKPLNKIASGGEVSRVMLALKVIFSKVDNIPILIFDEIDTGIGGETVRKIALKLKEIG DNTQIISITHSPVIASKASQQFYIEKYVENFKTISRVKKLSAEERIKEIGRMLVGEKINN DVLEIANKMLNEV >gi|296154149|gb|ADVK01000041.1| GENE 7 5747 - 6550 953 267 aa, chain - ## HITS:1 COG:FN0267 KEGG:ns NR:ns ## COG: FN0267 COG0061 # Protein_GI_number: 19703612 # Func_class: G Carbohydrate transport and metabolism # Function: Predicted sugar kinase # Organism: Fusobacterium nucleatum # 1 267 1 267 267 458 98.0 1e-129 MIKLSIIYNEDKEDAIKIYKELLKYLKSKKEFEVLDDKNISQAEYIVVIGGDGTLLRGFK KIKDKKVKIIAINSGTLGYLTEIRKDGYKKIFENILKGKINIEERYFFTVKIGKKKYNAL NEVFLTKDNIKRNIVSSEIYVDDKFLGKFKGDGVIIATPTGSTAYSLSAGGPIVTPELKL FLITPIAPHNLNTRPIILSGDVKIVLTLAGPSEFGIVNVDGHTHNKINLEDEVEISYSKE SLKIVLPDDRNYYNVLREKLKWGENLC >gi|296154149|gb|ADVK01000041.1| GENE 8 6567 - 7793 1754 408 aa, chain - ## HITS:1 COG:FN0266 KEGG:ns NR:ns ## COG: FN0266 COG4942 # Protein_GI_number: 19703611 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Membrane-bound metallopeptidase # Organism: Fusobacterium nucleatum # 6 408 1 403 403 517 100.0 1e-146 MSLKMMKIKTILVFFLLSASIYPASKSVKDMNKRLKNIDKEIEKKNTRIKAIDTETSKLE KMIKELEEEIKKLEHERKEIEDEITVVKKNIDYSRKNLEISEVEHNRKESEFVAKIIAWD KYSKIHRKEIDEKVLLTKNYREMLHGDLQRMGHIEKVTGSIKEVKEKIEAEKRKLDRLEA ELRENLRKSDIKKEEQKKLKEKLQVEKKGHQSSIEKLKKEKQRISKEIERIIRENARRAA EKAAREKAAKEAAKNKGKGSKRSGGTKVTTTTVDMPKISNPEAYKRIGKTIKPLNGQIVV YFGQKKAGVVESNGIEIKGKLGNPVVASKAGTVIYADKFQGLGKVVMIDYGGGIIGVYGN LLAIKVNLNSKVSSGQTIGVLGLSSDKEPNLYYELRANLRPIDPIPTF >gi|296154149|gb|ADVK01000041.1| GENE 9 7768 - 8715 1245 315 aa, chain - ## HITS:1 COG:FN0265 KEGG:ns NR:ns ## COG: FN0265 COG2177 # Protein_GI_number: 19703610 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Cell division protein # Organism: Fusobacterium nucleatum # 8 286 1 279 308 501 100.0 1e-142 MLEGVSVMYKLFGYGLKGIPYINRLKRRVFYAVVITVVALNIFISFSLNLKSLTNEKIFN SFIVADLQNNLNQDKKNEIEKYILGINGVRSVRFMDKFESFKNLQNELNISIPESSNPLT DSLVISVKDPTLLGQIQETIESREEVKEVYKDESYLKQSKEQGFITSIAQIGSGVFSFFI ALITIIIFNFGVAIEFLNNANTGLDYAENIRKSKIRNLLSFTMSTVIGTLIFFNIYVLFR KHVSHANFNSSMLSLKEIILWHFGAIVILNLLVWLIPANVGRIEYAEEEDEDYDEFDDEF YEEDGDYDEFEDDED >gi|296154149|gb|ADVK01000041.1| GENE 10 8847 - 9236 610 129 aa, chain - ## HITS:1 COG:no KEGG:FN0264 NR:ns ## KEGG: FN0264 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 129 1 129 129 124 100.0 1e-27 MKKFLLLAVLAVSASAFAANDAASLVGELQALDAEYQNLANQEEARFNEERAQADAARQA LAQNEQVYNELSQRAQRLQAEANTRFYKSQYQDLASKYEDALKKLESEMEQQKAIISDFE KIQALRAGN >gi|296154149|gb|ADVK01000041.1| GENE 11 9339 - 10022 961 227 aa, chain - ## HITS:1 COG:FN0263 KEGG:ns NR:ns ## COG: FN0263 COG0760 # Protein_GI_number: 19703608 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Parvulin-like peptidyl-prolyl isomerase # Organism: Fusobacterium nucleatum # 1 227 5 231 231 315 99.0 4e-86 MEEDKILHGILLKKAKEAQYTNYEIEQLNLQSESLFIRYFLEREAAKIVENTNIEENVLK KIYEENQNLYKFPKKVKIDTIFVKDLEKAEEILKEINLKNFSSLKEKNDEKGQEAKAVTD DFLFVTEIHPALVEEILKEEKKNVILKKAIPVQEGFHIVYLKDIEDERQAIFDEARETIL ADVKRNIFGQVYNQLIEDIANETVKPEEPVKGEKNKENKSTKATSKE >gi|296154149|gb|ADVK01000041.1| GENE 12 10341 - 12572 3552 743 aa, chain + ## HITS:1 COG:FN0262 KEGG:ns NR:ns ## COG: FN0262 COG1882 # Protein_GI_number: 19703607 # Func_class: C Energy production and conversion # Function: Pyruvate-formate lyase # Organism: Fusobacterium nucleatum # 1 743 1 743 743 1520 100.0 0 MEAWRGFKSGEWQNSINVSDFIKHNYTEYLGDESFLEGPTENTKKLWDILSGMLKIEREK GIYDAETKTPSKIDAYGAGYISKDLETIVGLQTDAPLKRAIFPNGGLRMVESSLEAFGYK LDPTTKEIYEKYRKSHNAGVFSAYTPAIKAARHTGVITGLPDAYGRGRIIGDYRRVALYG VDRLIEERKREFDAYDPEEMTEDVIRDREEMFEQLEALKALKRMAAAYGFDIGRPAETAQ EAIQWTYFGYLGAIKDQNGAAMSLGKTAGFLDVYIERDLKEGRITEKQAQEFIDHFIMKL RIVRFLRTPEYDQLFSGDPVWVTESIGGMNNDGKSWVTKNAYRYLNTLYNLGTAPEPNLT ILWSERLPENWKKFCSKVSIDTSSLQYENDDIMRPQFGEDYGIACCVSPMAIGKQMQFFG ARANLPKALLYAINGGKDELKKDQVTPAGQFERITSDYLDFDEVWEKYDKMLTWLASTYI KALNIIHYMHDKYSYEALEMALHSLDIKRTEACGIAGLSIVADSLAAIKYGKVRVIRDEA GDAVDYVVEQPYVPFGNNDDRTDELAVKVVRTFMNKIRSHKMYRDAEPTQSVLTITSNVV YGKKTGNTPDGRRAGAPFGPGANPMHGRDTKGAVASLASVAKLPFEDANDGISYTFAITP ETLGKTDDEKKNNLVGLLDGYFKQTGHHLNVNVFGRELLEDAMEHPENYPQLTIRVSGYA VNFIKLTREQQLDVINRTISNKM >gi|296154149|gb|ADVK01000041.1| GENE 13 12638 - 13369 677 243 aa, chain + ## HITS:1 COG:FN0261 KEGG:ns NR:ns ## COG: FN0261 COG1180 # Protein_GI_number: 19703606 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Pyruvate-formate lyase-activating enzyme # Organism: Fusobacterium nucleatum # 1 243 1 243 243 494 100.0 1e-140 MQGYINSFESFGTKDGPGIRFVVFMQGCPLRCLYCHNVDTWELKDKNYIYTPEEVLAELN KVKAFLSGGITISGGEPLLQSSFVLEVFKLCKENGIHTALDTSGYIFNEQAKKVLEYTDL VLLDIKHIDKDMYKKLTSVDLEPTLNFIKYLQEINKPTWIRYVLVPGYTDDIKDLNDWAK FVSQFDIVKRVDILPFHQMAIYKWEKTNREYKLKDTPTPNKEQIQRAEEIFRKYNLPLYK ERS >gi|296154149|gb|ADVK01000041.1| GENE 14 13512 - 13889 527 125 aa, chain + ## HITS:1 COG:FN0260 KEGG:ns NR:ns ## COG: FN0260 COG0640 # Protein_GI_number: 19703605 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Fusobacterium nucleatum # 1 125 1 125 125 230 100.0 4e-61 MKTTKTVNSCDCDSVNQELVDKVKKKFPEDEILGDLSDFFKVIGDGTRIRILWALDVSEM CVCDIANVLNMTKSAVSHQLRALRDADLVKFRKSGKEVLYSLSDNHVKEIFEQGLIHIQE DKGDE >gi|296154149|gb|ADVK01000041.1| GENE 15 13892 - 14113 318 73 aa, chain + ## HITS:1 COG:FN0259 KEGG:ns NR:ns ## COG: FN0259 COG2608 # Protein_GI_number: 19703604 # Func_class: P Inorganic ion transport and metabolism # Function: Copper chaperone # Organism: Fusobacterium nucleatum # 1 73 1 73 73 107 100.0 4e-24 MKKVFKLEGLNCAHCAAKIEEKVGKLEGVKSVVINFMTTKMTLESIDDNIEEIIENVKKL INEVEPDVNMVKA >gi|296154149|gb|ADVK01000041.1| GENE 16 14123 - 15997 2361 624 aa, chain + ## HITS:1 COG:FN0258 KEGG:ns NR:ns ## COG: FN0258 COG2217 # Protein_GI_number: 19703603 # Func_class: P Inorganic ion transport and metabolism # Function: Cation transport ATPase # Organism: Fusobacterium nucleatum # 18 620 12 614 614 1030 99.0 0 MIGWCKMKKKKEVIIIISAILFAVALFVRMTQTLQLILMLVAYILLGKDTVLKAVKNVEK GDFFDENFLMTIATLGAIIIGEYPEAVAVMLFYEVGELFQGYAINKSRKSIADMMNIKPE YANVIRDNKSEKVDPDEVQINEIIEIKPGERVPLDAIIIKGESTLDTSALTGESLPVEVR EGATILSGCININALIIAKVTKEYFDSTVNKVLDLVENAAAKKSTSERLITRFAKIYTPI VISLAVFLAILPPIISGEYNFRVWIFRALSFLVVSCPCAFVISVPLSFFSGIGAASRAGV LIKGGNYLETLSKVDTVVFDKTGTLTKGVFNVQKVVVVDKNIQEDEFISLVTMAESGSNH PISKSIQKYYNKEIDKSSINSIKEISGKGIEAIVNNMKILVGNKKLVSVPNDLIIDDIGT ILYVEIANKFTGYIVISDEIKKDAEKAIKSLKDIGIKKSIMLTGDVEKVAKKVGEDLRLD EIYSNLLPQDKVSKFEEIIKNKNSKGNVVFVGDGINDAPVLARADVGIAMGAMGSDAAIE AADVVIMTDEPSKIVTAIKSSKKTMKIAMQNIILAFGVKAIALILSALGIADMWMAVFAD TGVTILAVLNSFRALKIENQQAII >gi|296154149|gb|ADVK01000041.1| GENE 17 16073 - 17584 1681 503 aa, chain - ## HITS:1 COG:FN0257 KEGG:ns NR:ns ## COG: FN0257 COG1288 # Protein_GI_number: 19703602 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 1 503 1 503 503 816 99.0 0 MKRKNFEFPTAYTVLFLILILVTVLTHVIPAGKYNRLFYQENTNEFVVETYGNGNINLEA TQKNLDKLDIKIDVNKFIDGTIKKPMAIPNTYVKLDGKAQGLEELISAPISGIAESIDII IFVLVLGGIVGIVNKTGTFNIAMKAISQKTKGKEFSLVIISFIFFAAGGTIFGFWEETIP FYSILIPLFLINNFDPLVPMATIFLGSAIGCMFSTVNPFSTIIASNAAGISFNEGLKFRF VALIIFSILSLLYIYKYIKKVKKDSTNSFVIEEQEEIREKFLKDYNQETNVKFDWRKKII LFLFIFQFVIMIWGVSLLGWWFQETAAMFFGVAIIIMLLSGLSEKEAVNGFISGASEVVG VTLIIGLARAINIIMENGMISDTLLFYSSNVVAEMGKGLFSIVMLLIFAFLGIFIPSTSG LAVLSMPILAPLADTVGLSRAIVVDAFTWGQGIILFITPTGLIFVVLQIVGIPYNKWLKF VMPLILIITILIIIILYVRSVFF >gi|296154149|gb|ADVK01000041.1| GENE 18 17780 - 17881 73 33 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MVNSSRIPIFSKRFNILLTVRKLKLVLETISDA >gi|296154149|gb|ADVK01000041.1| GENE 19 17870 - 18313 299 147 aa, chain - ## HITS:1 COG:SMa2105 KEGG:ns NR:ns ## COG: SMa2105 COG3177 # Protein_GI_number: 16263601 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Sinorhizobium meliloti # 1 144 200 343 406 109 37.0 2e-24 MSDLEKFIHNNDIKILHLLKIAILHYQFETIHPFSDGNGRVGRLMIPLYLLDKKILNKPC FYILDYFEKNRTEYYNSLTRVRENNDMISWIKFFLKGVIITAQIAKKKFQKVVMTVKNYE EKVSTLSGNWGNTLKVLQSFYDNPLSI >gi|296154149|gb|ADVK01000041.1| GENE 20 18525 - 18668 126 47 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296328731|ref|ZP_06871246.1| ## NR: gi|296328731|ref|ZP_06871246.1| fic family protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] fic family protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 47 1 47 47 77 100.0 3e-13 MIYLDKFKSGNKIKQELYYSFVPNFINDFWKWDNSDINILLESPKLY >gi|296154149|gb|ADVK01000041.1| GENE 21 18882 - 19235 568 117 aa, chain - ## HITS:1 COG:no KEGG:FN0254 NR:ns ## KEGG: FN0254 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 117 1561 1677 1677 216 99.0 4e-55 GEVKLEVKQNQYFSVKPEIGAELGFKHYFGMKALKTTLGVAYENELGRVANGKNKARVAD TSADWFNIRGEKEDRKGNVKFDLNVGIDNTRVGVTANAGYDTKGHNLRGGLGLRVIF Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:50:15 2011 Seq name: gi|296154068|gb|ADVK01000042.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00052, whole genome shotgun sequence Length of sequence - 83513 bp Number of predicted genes - 86, with homology - 79 Number of transcription units - 36, operones - 17 average op.length - 3.9 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 3 - 456 443 ## COG2831 Hemolysin activation/secretion protein - Prom 510 - 569 12.3 - Term 579 - 625 2.2 2 2 Op 1 1/0.833 - CDS 825 - 1376 779 ## COG0302 GTP cyclohydrolase I 3 2 Op 2 19/0.000 - CDS 1388 - 3448 2772 ## COG0751 Glycyl-tRNA synthetase, beta subunit - Prom 3486 - 3545 11.4 - Term 3574 - 3630 -0.3 4 3 Op 1 1/0.833 - CDS 3712 - 4584 1172 ## COG0752 Glycyl-tRNA synthetase, alpha subunit 5 3 Op 2 16/0.000 - CDS 4588 - 5031 466 ## COG0597 Lipoprotein signal peptidase 6 3 Op 3 1/0.833 - CDS 5055 - 7856 4428 ## COG0060 Isoleucyl-tRNA synthetase 7 3 Op 4 1/0.833 - CDS 7853 - 10270 2467 ## COG0642 Signal transduction histidine kinase 8 3 Op 5 . - CDS 10289 - 12457 1708 ## PROTEIN SUPPORTED gi|51894064|ref|YP_076755.1| ribosomal protein S1-like protein - Prom 12490 - 12549 8.7 9 4 Tu 1 . - CDS 12551 - 12910 331 ## FN0064 putative cytoplasmic protein - Prom 13015 - 13074 7.5 - Term 12997 - 13056 7.1 10 5 Op 1 . - CDS 13079 - 13429 722 ## FN0063 hypothetical protein - Term 13442 - 13497 6.1 11 5 Op 2 . - CDS 13499 - 13789 595 ## FN0062 putative cytoplasmic protein 12 5 Op 3 . - CDS 13815 - 13895 83 ## 13 5 Op 4 1/0.833 - CDS 13885 - 15375 2076 ## COG2317 Zn-dependent carboxypeptidase - Prom 15409 - 15468 17.3 - Term 15439 - 15492 8.1 14 6 Tu 1 1/0.833 - CDS 15514 - 16785 1828 ## COG1686 D-alanyl-D-alanine carboxypeptidase - Term 16793 - 16848 13.2 15 7 Op 1 20/0.000 - CDS 16855 - 17232 711 ## COG0822 NifU homolog involved in Fe-S cluster formation 16 7 Op 2 1/0.833 - CDS 17294 - 18487 1678 ## COG1104 Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes - Prom 18513 - 18572 5.0 17 8 Op 1 . - CDS 18574 - 19218 857 ## COG0177 Predicted EndoIII-related endonuclease 18 8 Op 2 . - CDS 19223 - 19702 414 ## FN0056 acetyltransferase (EC:2.3.1.-) 19 8 Op 3 1/0.833 - CDS 19730 - 20233 273 ## PROTEIN SUPPORTED gi|228000081|ref|ZP_04047083.1| acetyltransferase, ribosomal protein N-acetylase - Prom 20265 - 20324 9.8 - Term 20296 - 20342 7.5 20 9 Tu 1 . - CDS 20344 - 21552 880 ## PROTEIN SUPPORTED gi|163739624|ref|ZP_02147033.1| 50S ribosomal protein L32 - Prom 21590 - 21649 5.6 21 10 Op 1 . - CDS 21786 - 23060 1892 ## COG1114 Branched-chain amino acid permeases - Prom 23110 - 23169 7.7 22 10 Op 2 . - CDS 23189 - 23551 400 ## COG1393 Arsenate reductase and related proteins, glutaredoxin family - Prom 23583 - 23642 11.1 - Term 23611 - 23661 8.5 23 11 Op 1 . - CDS 23681 - 25402 2927 ## COG1053 Succinate dehydrogenase/fumarate reductase, flavoprotein subunit - Prom 25436 - 25495 7.5 - Term 25486 - 25535 6.8 24 11 Op 2 . - CDS 25561 - 26085 851 ## FN0049 hypothetical protein - Prom 26280 - 26339 12.3 + Prom 26237 - 26296 10.0 25 12 Tu 1 . + CDS 26407 - 27165 1102 ## COG0647 Predicted sugar phosphatases of the HAD superfamily + Term 27170 - 27220 3.0 - Term 27159 - 27206 4.6 26 13 Tu 1 . - CDS 27212 - 27973 1054 ## COG0708 Exonuclease III - Prom 28027 - 28086 7.2 27 14 Op 1 4/0.000 - CDS 28215 - 28658 552 ## COG0757 3-dehydroquinate dehydratase II 28 14 Op 2 1/0.833 - CDS 28639 - 29442 958 ## COG0169 Shikimate 5-dehydrogenase 29 14 Op 3 1/0.833 - CDS 29439 - 29696 238 ## COG1605 Chorismate mutase 30 14 Op 4 . - CDS 29662 - 30207 534 ## COG2849 Uncharacterized protein conserved in bacteria - Prom 30374 - 30433 11.9 + Prom 30241 - 30300 17.8 31 15 Op 1 1/0.833 + CDS 30321 - 31574 997 ## COG0772 Bacterial cell division membrane protein 32 15 Op 2 1/0.833 + CDS 31643 - 31831 381 ## COG4224 Uncharacterized protein conserved in bacteria 33 15 Op 3 1/0.833 + CDS 31841 - 33226 2040 ## COG0017 Aspartyl/asparaginyl-tRNA synthetases 34 15 Op 4 . + CDS 33238 - 33789 759 ## COG1658 Small primase-like proteins (Toprim domain) + Term 33791 - 33841 11.2 + Prom 33812 - 33871 6.5 35 16 Tu 1 . + CDS 33966 - 34253 486 ## FN0038 hypothetical protein + Term 34281 - 34320 4.5 + Prom 34313 - 34372 7.4 36 17 Op 1 . + CDS 34401 - 34853 492 ## FN0037 hypothetical protein 37 17 Op 2 . + CDS 34853 - 35485 747 ## COG2323 Predicted membrane protein 38 17 Op 3 . + CDS 35549 - 35641 79 ## + Prom 35675 - 35734 11.2 39 18 Tu 1 . + CDS 35773 - 36447 764 ## FN0034 hypothetical protein + Term 36455 - 36507 12.6 + Prom 36480 - 36539 6.0 40 19 Tu 1 . + CDS 36590 - 41752 5878 ## FN0033 hypothetical protein + Prom 41923 - 41982 17.3 41 20 Op 1 . + CDS 42004 - 42075 77 ## 42 20 Op 2 . + CDS 42121 - 43110 747 ## COG4927 Predicted choloylglycine hydrolase 43 20 Op 3 . + CDS 43158 - 44000 841 ## FN0031 hypothetical protein 44 20 Op 4 . + CDS 43975 - 44058 148 ## 45 20 Op 5 . + CDS 44033 - 44155 124 ## - Term 44177 - 44227 1.4 46 21 Tu 1 . - CDS 44253 - 44729 691 ## COG3467 Predicted flavin-nucleotide-binding protein - Prom 44752 - 44811 9.5 + Prom 44780 - 44839 8.2 47 22 Tu 1 . + CDS 44872 - 45303 648 ## COG0716 Flavodoxins + Term 45307 - 45342 6.0 - Term 45295 - 45330 6.0 48 23 Op 1 1/0.833 - CDS 45345 - 45812 821 ## COG2849 Uncharacterized protein conserved in bacteria 49 23 Op 2 1/0.833 - CDS 45851 - 46351 592 ## COG2849 Uncharacterized protein conserved in bacteria 50 23 Op 3 1/0.833 - CDS 46360 - 47085 1117 ## COG2849 Uncharacterized protein conserved in bacteria - Prom 47110 - 47169 15.3 - Term 47309 - 47358 4.0 51 24 Tu 1 . - CDS 47398 - 48897 2062 ## COG1288 Predicted membrane protein - Prom 48927 - 48986 13.7 - Term 48958 - 49003 5.1 52 25 Op 1 1/0.833 - CDS 49037 - 49318 602 ## COG2088 Uncharacterized protein, involved in the regulation of septum location 53 25 Op 2 1/0.833 - CDS 49334 - 50206 1046 ## COG1947 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 54 25 Op 3 3/0.000 - CDS 50199 - 50498 356 ## COG1188 Ribosome-associated heat shock protein implicated in the recycling of the 50S subunit (S4 paralog) - Prom 50530 - 50589 5.6 55 25 Op 4 . - CDS 50614 - 53559 3088 ## COG1197 Transcription-repair coupling factor (superfamily II helicase) - Prom 53580 - 53639 13.9 + Prom 53448 - 53507 14.3 56 26 Tu 1 . + CDS 53695 - 54180 676 ## FN0018 hypothetical protein + Term 54230 - 54280 4.2 - Term 54175 - 54224 8.1 57 27 Op 1 . - CDS 54231 - 55493 1409 ## COG3177 Uncharacterized conserved protein 58 27 Op 2 . - CDS 55572 - 55661 61 ## 59 27 Op 3 . - CDS 55621 - 56493 816 ## FN0016 hypothetical protein - Prom 56521 - 56580 11.1 + Prom 56594 - 56653 10.0 60 28 Tu 1 . + CDS 56673 - 57221 706 ## FN0015 hypothetical protein + Term 57243 - 57295 11.2 - Term 57082 - 57118 -0.6 61 29 Tu 1 . - CDS 57218 - 57907 764 ## COG1811 Uncharacterized membrane protein, possible Na+ channel or pump - Prom 57959 - 58018 8.7 + Prom 57956 - 58015 8.7 62 30 Tu 1 . + CDS 58061 - 58432 407 ## COG0563 Adenylate kinase and related kinases - Term 58424 - 58455 -0.6 63 31 Op 1 1/0.833 - CDS 58459 - 58968 641 ## COG1827 Predicted small molecule binding protein (contains 3H domain) 64 31 Op 2 13/0.000 - CDS 58961 - 59821 506 ## PROTEIN SUPPORTED gi|163755345|ref|ZP_02162465.1| 30S ribosomal protein S6 65 31 Op 3 10/0.000 - CDS 59784 - 61091 1632 ## COG0029 Aspartate oxidase 66 31 Op 4 1/0.833 - CDS 61093 - 61989 1092 ## COG0379 Quinolinate synthase - Prom 62054 - 62113 10.2 - Term 62119 - 62149 -0.5 67 32 Op 1 11/0.000 - CDS 62175 - 64061 2511 ## COG0445 NAD/FAD-utilizing enzyme apparently involved in cell division - Prom 64093 - 64152 7.3 68 32 Op 2 4/0.000 - CDS 64282 - 65649 1825 ## COG0486 Predicted GTPase 69 32 Op 3 16/0.000 - CDS 65655 - 66398 1172 ## COG1847 Predicted RNA-binding protein 70 32 Op 4 18/0.000 - CDS 66400 - 67017 736 ## COG0706 Preprotein translocase subunit YidC 71 32 Op 5 16/0.000 - CDS 67014 - 67262 97 ## COG0759 Uncharacterized conserved protein 72 32 Op 6 . - CDS 67271 - 67651 377 ## COG0594 RNase P protein component 73 32 Op 7 . - CDS 67658 - 67792 224 ## PROTEIN SUPPORTED gi|197735492|ref|YP_002164270.1| hypothetical protein FNP_0004 + Prom 68126 - 68185 13.8 74 33 Tu 1 . + CDS 68217 - 68321 69 ## + Prom 68341 - 68400 14.0 75 34 Op 1 . + CDS 68426 - 70339 2242 ## FN0001 chromosomal replication initiator protein DnaA 76 34 Op 2 9/0.000 + CDS 70389 - 70604 325 ## COG2501 Uncharacterized conserved protein + Term 70606 - 70660 -0.8 77 34 Op 3 . + CDS 70679 - 71728 693 ## COG1195 Recombinational DNA repair ATPase (RecF pathway) 78 34 Op 4 . + CDS 71706 - 71978 246 ## FN2127 hypothetical protein 79 34 Op 5 24/0.000 + CDS 71991 - 73898 2840 ## COG0187 Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit + Prom 73965 - 74024 8.9 80 34 Op 6 1/0.833 + CDS 74044 - 76479 3319 ## COG0188 Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit 81 34 Op 7 1/0.833 + CDS 76492 - 76953 445 ## COG0622 Predicted phosphoesterase 82 34 Op 8 40/0.000 + CDS 76980 - 77996 1454 ## COG0016 Phenylalanyl-tRNA synthetase alpha subunit 83 34 Op 9 3/0.000 + CDS 78010 - 80406 3350 ## COG0072 Phenylalanyl-tRNA synthetase beta subunit 84 34 Op 10 1/0.833 + CDS 80437 - 81168 1167 ## COG2849 Uncharacterized protein conserved in bacteria + Term 81189 - 81249 10.5 + Prom 81193 - 81252 5.5 85 35 Tu 1 . + CDS 81274 - 81843 835 ## COG2849 Uncharacterized protein conserved in bacteria + Term 81858 - 81895 3.2 + Prom 81883 - 81942 11.2 86 36 Tu 1 . + CDS 82035 - 83511 1992 ## COG3210 Large exoproteins involved in heme utilization or adhesion Predicted protein(s) >gi|296154068|gb|ADVK01000042.1| GENE 1 3 - 456 443 151 aa, chain - ## HITS:1 COG:FN0293 KEGG:ns NR:ns ## COG: FN0293 COG2831 # Protein_GI_number: 19703638 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Hemolysin activation/secretion protein # Organism: Fusobacterium nucleatum # 19 151 1 133 178 181 82.0 5e-46 MLRKIILIFSLMFSFVYAVDEIDIEKRRQEQQNFDDLIRNQNFDVQKGLENEGQKNLILN VSSIDLDGNTVFEDFQIDIILRKYIGDNKDIYALISELENKYIEKGYVTTKVGLNTEKSD FENGKISLFVLEGKIDKVFYDGKENKFKTFI >gi|296154068|gb|ADVK01000042.1| GENE 2 825 - 1376 779 183 aa, chain - ## HITS:1 COG:FN0071 KEGG:ns NR:ns ## COG: FN0071 COG0302 # Protein_GI_number: 19703423 # Func_class: H Coenzyme transport and metabolism # Function: GTP cyclohydrolase I # Organism: Fusobacterium nucleatum # 1 183 5 187 187 341 98.0 5e-94 MDSKRIENAFFEVIEALGDVEYKEELKDTPKRIADSYKEIFYGIDIDPKEVLTKTFEVNS NELIMEKNMDFYSMCEHHFLPFFGTVCIAYIPNKKVFGFGDILKLIEILSRRPQLQERLT EEIAKYIYEILNCQGVYVVVEAKHLCVTMRGQKKENTKILTTSAKGVFETDSNKKLEVLT LLK >gi|296154068|gb|ADVK01000042.1| GENE 3 1388 - 3448 2772 686 aa, chain - ## HITS:1 COG:FN0070 KEGG:ns NR:ns ## COG: FN0070 COG0751 # Protein_GI_number: 19703422 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Glycyl-tRNA synthetase, beta subunit # Organism: Fusobacterium nucleatum # 1 686 1 686 686 1216 99.0 0 MELLFEIGMEEIPARFLNQALEDLKSNFEKKLKNNRIKFDGVRTYGTPRRLVLVADEVAE MQEDLDELSIGPSRERAYKDGALTKAGEGFLKAHRIEEEQIEIIKNDKGEYIAFKRFLKG KPTEAILPEILKALVLEETFQKSMRWSTKTIRFARPIEWFLALYGGKVIDFEIEGIKSSN KSKGHRFFGKEFEVSSVENYLKKIRENNVIIDISERRKMIEEMINNSLLEDEKADVDKAL LDEVTNLVEHPFAIVGTFSEEFLEVPQEVLIISMKVHQRYFPILNKKGKLLPKFIVIRNG IDFSENVKKGNEKVLSARLADARFFYYEDLKIPLDNNVEKLKTVVFQKDLGTIYNKVKRC EKIAEFLVGKLKYNYMKEDILRTVKLAKADLVSNMINEKEFTKLQGFMGENYALKAGEEI GVALGIKEHYYPRFQGDLLPSGIEGIIAGISDRIDTLVGCFGVGVIPTGSKDPFALRRTA LGIVNIIINANLDISLKDLVKVSLDALEEDKVLKTDRAKVETDVLDFLKQRIINVFTDMK YRKDAILAVLDKDSDNITTALEIVKVITEKLSKDKMQALLQVVKRVSNIMKGNKDVTIKE KLFKTDIEKTLYAESKKVESEVEQAIKEKEYADYFEKLFTLVPTIDKYFETVIVMDEDKN VRDNRIGQLTYIMNLFEKIAYLNKLD >gi|296154068|gb|ADVK01000042.1| GENE 4 3712 - 4584 1172 290 aa, chain - ## HITS:1 COG:FN0069 KEGG:ns NR:ns ## COG: FN0069 COG0752 # Protein_GI_number: 19703421 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Glycyl-tRNA synthetase, alpha subunit # Organism: Fusobacterium nucleatum # 1 290 1 290 290 599 100.0 1e-171 MTFQEIIFSLQQFWSSKGCIIGNPYDIEKGAGTFNPNTFLMSLGPEPWNVAYVEPSRRPK DGRYGDNPNRVYQHHQFQVIMKPSPTNIQELYLESLRVLGIEPEKHDIRFVEDDWESPTL GAWGLGWEVWLDGMEITQFTYFQQVGGLELEVIPVEITYGLERLALYIQNKENVYDLEWT KGVKYGDMRYQFEFENSKYSFELATLDKHFKWFDEYEEEAKKILDQGLVLPAYDYVLKCS HAFNVLDSRGAISTTERMGYILRVRNLARRCAEVFVENRKALGYPLLNKK >gi|296154068|gb|ADVK01000042.1| GENE 5 4588 - 5031 466 147 aa, chain - ## HITS:1 COG:FN0068 KEGG:ns NR:ns ## COG: FN0068 COG0597 # Protein_GI_number: 19703420 # Func_class: M Cell wall/membrane/envelope biogenesis; U Intracellular trafficking, secretion, and vesicular transport # Function: Lipoprotein signal peptidase # Organism: Fusobacterium nucleatum # 9 147 27 165 165 231 99.0 4e-61 MFLILLIIDQYSKFIVDSTLSVGETVPVIDGFFNLTYVQNRGVAFGLFQGKIDIVSILAI IAIGLILFYFCKNFKKISFLERIAYTMIFSGAIGNMIDRLFRAYVVDMLDFRGIWSFIFN FADVWINIGVVLIIVEHIFFNRKKRVK >gi|296154068|gb|ADVK01000042.1| GENE 6 5055 - 7856 4428 933 aa, chain - ## HITS:1 COG:FN0067 KEGG:ns NR:ns ## COG: FN0067 COG0060 # Protein_GI_number: 19703419 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Isoleucyl-tRNA synthetase # Organism: Fusobacterium nucleatum # 1 933 1 933 933 1910 100.0 0 MSDKEYTSTLHLPKTDFQMKANLPNKEPKYIKKWTEEKIYEKGLEKNKNGETFILHDGPP YANGNTHIGHALNKILKDIILKYKTFRGFRSPYVPGWDTHGLPIELQVVKEVGVGKAREM SALEIRKLCEKYAKKWVGIQKEQFIRLGVLGDWDNPYLTLDPRFEAKQLELFGEIYVNGY IFKGLKPVYWSPATETALAEAEIEYYDHVSPSIYVRMQANKDLLDKIGFNEDVYVLIWTT TPWTLPANVAICLNENFDYGLYKTEKGNLILAKDLAESAFKDIGIENFELIKEFKGKDLE YTTYKHPFLERTGLIILGDHVTADAGTGAVHTAPGHGQDDYVVGLAYKLPVISPIDHRGC LTEEAGDLFKGLVYSEANKAIIKHLTETGHILKMQEISHSYPHDWRSKTPVIFRATEQWF IRMEGGDLREKTLKVIDKINFIPSWGKNRIGSMMETRPDWCISRQRVWGVPIPIFYNDET NEEIFHKEILDRICDLVREHGSNIWVEKSPEELIGEELLVKYNLKGLKLRKETNIMDVWF DSGSSHRGVLEVWEGLHRPCDLYLEGSDQHRGWFHTSLLTSVASTGDSPYKSVLTHGFVN DGEGKKMSKSLGNTVSPEDVIKVYGADILRLWCGSVDYRDDVRISDNIVKQMSEAYRRIR NTARYILGNSYDFNPKTDKVAYKDMLEIDKWALNKLEVLKRSVTESYDKYEFYNLFQGIH YFAAIDMSAFYLDIIKDRLYTEKKDSVARRAAQTVMYEVLMTLTKMVAPILSFTAEEIWE SLPAETRESESIFLADWYVNNDEYLKPELDEKWQQIIKLRKEVNKKLEKARQGENKIIGN SLDAKVSLYTEDNTLKEFIKENLELLKIVFIVSDLEVVDSADGNYTDAEEIEKLKIKIAH ADGEKCERCWKYDELGTDSEHPTLCPRCAAVLK >gi|296154068|gb|ADVK01000042.1| GENE 7 7853 - 10270 2467 805 aa, chain - ## HITS:1 COG:FN0066 KEGG:ns NR:ns ## COG: FN0066 COG0642 # Protein_GI_number: 19703418 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Fusobacterium nucleatum # 69 805 1 737 737 1253 99.0 0 MFIKKDSLLLRIISYNGIAIVIVASIMASLFGIMIFNELNMRLLDKSRERTLLVNKAYLF FIDRSREQLYDASNDAVNLVLVDSNDKLIQNRLASAVRNQLSTESYGLYGKSFIQIVSPN RIILGESGDRDIKYDLYKSNNIIPSKEFLEYGKAEYVSTKDALYVRIVQPYRLYKSTERN YIVLTFPLNNYSLSEIKEYAYLTKDDKVFILSKGGYLYGELSLDKVDDFFKNFKFNKVGR ELSDNKYYFSEKKIGDDYYYLGMLALKNNNSDDYVGDIGVAISKNDFVVIKYMLATIILV VGILAVVISTALCARIFAKLLSPLNALADKTEKIGVNNEKDERGIDFKEENIFEIRSISN SLKFMTERIEENEKLLKQNNNKLNTNLNRLVAVEKLLMGIDLVGSLSEGVNEVLRALTSE VGLGYSRAIYLEYNEEKDELSVKNYAINPHILANTEKYTEGINGFTFQINNIGEMMPLLN IKYEPGGIFWESMESGKIIYHNDKGFKYTYGNKLFKTLGLKNFMILPIADEDIKIGCILV DYFGKDNLISEEEVEANNLLLMNLLIRIKNAMTEESKLMKERYLTMSKVSNKFIKNNKKL INYIETFIENLINNGYNNKDIDKIKRYLRDEKKKNIVIKDSLDSSKNNFEVFNFEKLVEK IVKNSQKILKKYGVNTSLFIDFSGNMYGDKKKIYQMFIQILRNSINAILTRNKLDKKINI VVVGDKNHRIILEIIDNGVGMTQEEVKAVMRPYSDTKGNSIMGTGLITIYKIVKEHNGLM TISSELDVGTKIRIIFNEYREETNQ >gi|296154068|gb|ADVK01000042.1| GENE 8 10289 - 12457 1708 722 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|51894064|ref|YP_076755.1| ribosomal protein S1-like protein [Symbiobacterium thermophilum IAM 14863] # 2 719 4 720 764 662 49 0.0 MEKIYKIVAEELKIPVDKVENTIKLLDDGATIPFVARYRKEITGNLDEVQIGDILQKVEY LRNLEERKEEVIRLIEEQGKLTEELRNSIIEAKILQEVEDIYFPYRKKKKTKADIAKERG LEPLAEKFYTANNLEEIQSLAKDFITEEVPTIEDAIEGAMLIIAQNISEKAEYRERIREI YLKYSIIESKASKKAAELDEKKVYTDYYEYVEKVEKMPSHRILALNRGEKEDILTVHLRL EDSDRERIENMILREFPKNDLASTYKEIIKDSLDRLIVPSIEREVRNALTERAEIESIAV FKDNLKNLLLQAPLKEKNILALDPGYRTGCKVAVIDKYGFYRENTVFFLVEAMHNPRQIQ DAREKFLKLVKKYDIDIVSIGNGTASRETETFVANIIREEKLNVKYLIVNEAGASVYSAS KIAAEEFPDLDVTVRGAISIGRRIQDPLAELVKIDPKSIGVGMYQHDVNQSKLDESLDNV ISHVVNNVGANINTASWALLSHISGIKKTVAKNIVNYRKENGNFKNRKQILKVKGVGPKA YEQMAGFLVIPEGENILDNTVIHPESYGIAEAILGKIGFDLEKYNNELDVARERLKSFDY KKFAKENEFGLETVKDVYEALLKDRRDPRDDFEKPLLKSDILNIDNLEVGMELEGTVRNV VKFGAFIDIGLKNDALLHISEISDKYIDDPSKVLSVGQIIKVKIKDVDKDRGRVGLTRKG QN >gi|296154068|gb|ADVK01000042.1| GENE 9 12551 - 12910 331 119 aa, chain - ## HITS:1 COG:no KEGG:FN0064 NR:ns ## KEGG: FN0064 # Name: not_defined # Def: putative cytoplasmic protein # Organism: F.nucleatum # Pathway: not_defined # 3 119 1 117 117 219 98.0 4e-56 MELTKLLVKDLMNGKFELFSNYIYKTKEYLIKVPKGFVTDYASIPKLLRIMVLPYGKHSD ASVVHDWLYSSNCNLEISREKADKIFLEVLKEEKVNFFLRTLMYIAVRKFGGSRFRNGV >gi|296154068|gb|ADVK01000042.1| GENE 10 13079 - 13429 722 116 aa, chain - ## HITS:1 COG:no KEGG:FN0063 NR:ns ## KEGG: FN0063 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 7 116 1 110 110 83 98.0 2e-15 MKKMLMMVALILGVSAMAEDATTDAMATMKKEAKKVEEKMVDVKEDVKVETKKAEEKAVD VKKDAEEKVEAAKDDVKKSTNNVKKNVKKAGKKVKKIETKVEEKVEAVTDKVEAKN >gi|296154068|gb|ADVK01000042.1| GENE 11 13499 - 13789 595 96 aa, chain - ## HITS:1 COG:no KEGG:FN0062 NR:ns ## KEGG: FN0062 # Name: not_defined # Def: putative cytoplasmic protein # Organism: F.nucleatum # Pathway: not_defined # 1 96 1 96 96 97 100.0 1e-19 MGAFDDLTGKAEELVGAVTDKAKELKDETVAKAEELKDKTVEKAEELKNKVVDKAKELKE GAEGKASELKDKAAEKAEELKDKITDGADSLINKLK >gi|296154068|gb|ADVK01000042.1| GENE 12 13815 - 13895 83 26 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MKYRLTFKKFYCTIIVIKKYKCLGIE >gi|296154068|gb|ADVK01000042.1| GENE 13 13885 - 15375 2076 496 aa, chain - ## HITS:1 COG:FN0061 KEGG:ns NR:ns ## COG: FN0061 COG2317 # Protein_GI_number: 19703413 # Func_class: E Amino acid transport and metabolism # Function: Zn-dependent carboxypeptidase # Organism: Fusobacterium nucleatum # 1 496 1 496 496 880 99.0 0 MKEKFRELVKKKNRIYANLELLHWDLETKTPVKSKPYLSDLVAELSMKEYELFTSDEFVN LVETLNKEKENLSEIEKKEIELSMEDIEKMKKIPADEYEAYAKLTSINQGIWEEAKSKKD FSIVKANLEKIFNYNKKFAEYRRKNEKNLYDVLLNDYEKGMDTQKLDIFFNELKKEIVPF LRKIQEKKKSLKEEDKINVPVDEDIQFKFAKYLADYVGFDFEKGVVETSEHPFTLNLNKN DVRLTTNNKRNIPFSTVFSIIHEAGHGIYEQQTGDELIDTLLGTGGTMGLHESQSRFMEN IVGRNEAFWKPLYKKAQDFYSFLKDITFEEFSKQINQIEPSLIRVEADELTYSLHIMVRY EIEKMIFSGEVSIDDLPKIWNQKMVEYLGIEPKNDSEGLMQDVHWYCGLVGYFPSYAIGN AYASQIYNTMKKDFDVDKALENQDMKKITDWLGEKIHKYGRLKDTPEIIKKVTGEELNPK YYIDYLKEKYKKIYEI >gi|296154068|gb|ADVK01000042.1| GENE 14 15514 - 16785 1828 423 aa, chain - ## HITS:1 COG:FN0060 KEGG:ns NR:ns ## COG: FN0060 COG1686 # Protein_GI_number: 19703412 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: D-alanyl-D-alanine carboxypeptidase # Organism: Fusobacterium nucleatum # 56 423 1 368 368 623 99.0 1e-178 MYMKFKKIILVMSILGILFTNSYSNEVREVQLIDDFSAELLGETKTQDPEVPKPAMQQET QEITEDKNVDEEQIENSKDRVSENKEENKEKEETVKNENSHKENEVKKDEIKETIEKARE KNNSVKEVKEDKDLAIEEPENPEKDKQKYEMIKYYSADGVEWELPDNFRAVVVGDTKGNV IFAKDADTMYPLASVTKMMSLMVTFDEINAGKISLNDSVRISKNPLKYGGSGIPLKAGQM FVLEDLIKASAVYSANNATYAIAEYVGNGSVFSFVAKMNKKLKEYGLQNEIKYHTPAGLP TRVTKQPMDEGTARGIYKLSIEALKYKKYIEIAGIKSTKIHNGKISIRNRNHLIGENGVY GIKTGFHKEAKYNIAVASKFEGIDVIIVVMGGETYKTRDGIVLSVLDILNSNYTIKNGLI KRK >gi|296154068|gb|ADVK01000042.1| GENE 15 16855 - 17232 711 125 aa, chain - ## HITS:1 COG:FN0059 KEGG:ns NR:ns ## COG: FN0059 COG0822 # Protein_GI_number: 19703411 # Func_class: C Energy production and conversion # Function: NifU homolog involved in Fe-S cluster formation # Organism: Fusobacterium nucleatum # 1 125 4 128 128 226 100.0 5e-60 MQYTEKVMQHFMNPHNVGVIENPDGYGKVGNPSCGDIMEIFIKVDNNIISDVKFRTFGCA SAIASSSISTDMIIGKTVEEALKLTNKQVVDELGGLPAVKMHCSVLAEEAIKMAIEDYIS KRDGK >gi|296154068|gb|ADVK01000042.1| GENE 16 17294 - 18487 1678 397 aa, chain - ## HITS:1 COG:FN0058 KEGG:ns NR:ns ## COG: FN0058 COG1104 # Protein_GI_number: 19703410 # Func_class: E Amino acid transport and metabolism # Function: Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes # Organism: Fusobacterium nucleatum # 1 397 1 397 397 752 99.0 0 MKVYLDNNATTKVDEEVVKAMLPYFSDYYGNPFSLHLFGAETGKAVTEARQTIADILKAK PNEIIFTASGSEADNLAIRGIAKAYKHRGKHIITSTIEHPAVKNTFMDLIEDGFEVTMVP VDENGVMIIDEFKKVLREDTILVSVMHANNEVGSFQPIEEIAKITKERKIILHVDAVQTM GKVEIYPERMGIDLLSFSGHKFHAPKGIGVLYKRDGVRLARIITGGNQEGKRRPGTSNVP YIVGLAKALEISVANMKEEWNREETLRNYFEDEVSKRIPEIKINGKEARRLPGTSSITFK YLEGESMLLNLSLKGIAVSSGSACSSDSLQPSHVLLAMGIPAEFAHGTLRFSLSKYTTKE EIDYTIESLVEIIGKLRELSPLWKTFKDNKLTNEASF >gi|296154068|gb|ADVK01000042.1| GENE 17 18574 - 19218 857 214 aa, chain - ## HITS:1 COG:FN0057 KEGG:ns NR:ns ## COG: FN0057 COG0177 # Protein_GI_number: 19703409 # Func_class: L Replication, recombination and repair # Function: Predicted EndoIII-related endonuclease # Organism: Fusobacterium nucleatum # 14 214 1 201 201 398 99.0 1e-111 MTKKEKVKKILVELEKKFGTPKCALDFKTPFELLVAVILSAQCTDKRVNIVTEEMFKHVN TPEQFANMELEEIENYIKSTGFFRNKAKNIKKCSEQLLEKYNGEIPQDMDKLTELAGVGR KTANVVRGEVWGLADGITVDTHVKRLTNLIGLVDSEDPVKIELELMKIVPKKSWIVFSHY LILHGRATCIARRPRCLECEISKYCNYGIKKLSE >gi|296154068|gb|ADVK01000042.1| GENE 18 19223 - 19702 414 159 aa, chain - ## HITS:1 COG:no KEGG:FN0056 NR:ns ## KEGG: FN0056 # Name: not_defined # Def: acetyltransferase (EC:2.3.1.-) # Organism: F.nucleatum # Pathway: Tyrosine metabolism [PATH:fnu00350]; Benzoate degradation [PATH:fnu00362]; Naphthalene degradation [PATH:fnu00626]; Aminobenzoate degradation [PATH:fnu00627]; Limonene and pinene degradation [PATH:fnu00903]; Microbial metabolism in diverse environments [PATH:fnu01120] # 1 159 1 159 159 231 99.0 7e-60 MSKFKIRNMREDDIEIIYKNLHFDFVNKYFKNRKQQQKIHKNHNEWYKTHISSFDYSIYI FEDEENNFVALTSYEILANVAKVNIYLNKDYRNKKYSQEILSESINKFLIDYKNIKYLQA YILEENIASKKIFENLGFIYNNEKEICNDGLEYLVYKRI >gi|296154068|gb|ADVK01000042.1| GENE 19 19730 - 20233 273 167 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|228000081|ref|ZP_04047083.1| acetyltransferase, ribosomal protein N-acetylase [Brachyspira murdochii DSM 12563] # 5 167 4 166 166 109 33 4e-23 MEIEIREVEVEDYKELLDFMRKVKGETNFLLGYPDEIKLSYEDEKEHIKKVKSSETSNHF VAMKEDKMIGCTSFNGNTARKMKHYGTIGISVLKEYWGRGVATVLLEKLISWAKEKGIKK INLDVFENNKRAIELYEKFGFKLEGCIEDGIFDGENYINLLVYGLKI >gi|296154068|gb|ADVK01000042.1| GENE 20 20344 - 21552 880 402 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163739624|ref|ZP_02147033.1| 50S ribosomal protein L32 [Phaeobacter gallaeciensis BS107] # 7 395 12 410 418 343 44 2e-93 MANVYDVLKERGYLKQLTHEEEIREILGKEKVTFYIGFDPTADSLHVGHFIAMMFMAHMQ QHGHRPIALAGGGTGMIGDPSGRSDMRTMMTVEMIDHNVECIKKQMQKFIDFSEDKAILA NNADWLRNLNYIEFLRDVGEHFSVNRMLAAECYKSRMENGLSFLEFNYMIMQAYDFYILN HKYNCTMQLGGDDQWSNMIAGVELLRRKDRKPAYAMTCTLLTNSEGKKMGKTAKGALWLD PEKTTPYEFYQYWRNIDDQDVEKCLALLTFLPMDEVRRLGALKDAEINEAKKVLAYEVTK IIHGEEEATKAKEATEALFGSGNNLDNAPKIEVTDEDFSKELLDVLVDRKIIKTKSEGRR LIEQNGMSLNDEKIKDVKFTLNDNTLGLLKLGKKKFYNIVKK >gi|296154068|gb|ADVK01000042.1| GENE 21 21786 - 23060 1892 424 aa, chain - ## HITS:1 COG:FN0053 KEGG:ns NR:ns ## COG: FN0053 COG1114 # Protein_GI_number: 19703405 # Func_class: E Amino acid transport and metabolism # Function: Branched-chain amino acid permeases # Organism: Fusobacterium nucleatum # 1 424 1 424 424 670 99.0 0 MYNMIDVITAGFALFAMLFGAGNLIFPPMLGYELGSSWGVASFAFILTGVGIPLMGIIAS ANAGKSLDSFSDKVSPLFAKFYGIALILSIGPLLALPRTGATAYEVTFYHAGFTTSIWKY LYLGIYFLLALLFSLKSSKVVDRVGKILTPILLIVLFIILVKGVFFNDLPIAERIYELPF KKGFTEGYQTMDALAAIVFSTVILNAIRGKVELTPKQEFSYLLKVGLIAAAGLAIVYAGL SYIGASFGGLDLVAGAEKTDLLVKISINLLGKIGYLILAICVAGACLTTSIGLIVTVAEY FSNLIKVSYEKLVVITTIIGFLFAIFGVNKIVIISVPVLVFLYPISIALIILNFFRIKNA NVFKGVVLVSGLIGLYEGISVTGITMPEILSNIYNSLPLVNLGLPWLVPALIVGICCYFI KDEK >gi|296154068|gb|ADVK01000042.1| GENE 22 23189 - 23551 400 120 aa, chain - ## HITS:1 COG:FN0052 KEGG:ns NR:ns ## COG: FN0052 COG1393 # Protein_GI_number: 19703404 # Func_class: P Inorganic ion transport and metabolism # Function: Arsenate reductase and related proteins, glutaredoxin family # Organism: Fusobacterium nucleatum # 1 120 1 120 120 175 100.0 2e-44 MKDIVFFCYPKCSTCQKAKKWLQENSVEFTERDIVKDNPTEAELKKFYKKSKKELKKFFN TSGILYREMELKDKLPTMTEEEMLKLLATDGKLVKRPMIVTKDVILNGFKEEEWKKLLKK >gi|296154068|gb|ADVK01000042.1| GENE 23 23681 - 25402 2927 573 aa, chain - ## HITS:1 COG:FN0050_2 KEGG:ns NR:ns ## COG: FN0050_2 COG1053 # Protein_GI_number: 19703402 # Func_class: C Energy production and conversion # Function: Succinate dehydrogenase/fumarate reductase, flavoprotein subunit # Organism: Fusobacterium nucleatum # 90 573 1 484 484 797 99.0 0 MRKNFFGKLFGLTLLMFTLLFSGASAEVYEGTGLGYDKNGIILDVEITNNKIVDIKVKRA KESDFATPAIQEIAKKVIATQSLDVDGISGASLTSEGTKEAIEEAVSKSGVTLTAVAVQN TKAVELPKEADVVVIGAGGAGLTSAIAAHEKGAKVILIEKTELLGGNTNYATAGLNAAGT KIQEKLGEKDSPELFYEDTMKGGKNKNNKELVKVLANNSSAIVDWLIERGADLSELTSTG GQSAKRTHRPTGGSAVGPNIISALSKTAENEKIDIRKGTKAIALVKGKNRIVGVKVKEAD GKEYTIKAKAVIVATGGFGANAKMVEKYNPKLKGFGSTNSPAIVGDGIVMVEKVGGALVD MNEIQTHPTVVYKKTNMITEAVRGEGAILVNKDGKRFIDELETRDVVSKAILSQNGKSAF LIFDEGIRTKLKAADGYVKKGFAVEGTLEEIAAKIGTDAKTLEATLNKYNEAVKNKVDSE FNKKNLPKELTGTKYYAIEISPAVHHTMGGVRINTNAEVLGKNGRPIKGLYAAGEVTGGI HGANRIGGNAVADITIFGKIAGENAATYSKSVK >gi|296154068|gb|ADVK01000042.1| GENE 24 25561 - 26085 851 174 aa, chain - ## HITS:1 COG:no KEGG:FN0049 NR:ns ## KEGG: FN0049 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 13 174 1 162 162 166 99.0 3e-40 MKKKLFGVLLFSMVLSSLAYAEADADKMRERMEQREKEQQQGVVENSSSVEKSNENVAGT TSTNLSLEEEREAYAALERARARIEKEEQEKLKAQQEMAEAQNQMEAQTETMAEQTANEN PNQNQVFVEESTPRMTPEEEKEAYEALERVRARILKEDEERAELLKAAAEQQVQ >gi|296154068|gb|ADVK01000042.1| GENE 25 26407 - 27165 1102 252 aa, chain + ## HITS:1 COG:FN0048 KEGG:ns NR:ns ## COG: FN0048 COG0647 # Protein_GI_number: 19703400 # Func_class: G Carbohydrate transport and metabolism # Function: Predicted sugar phosphatases of the HAD superfamily # Organism: Fusobacterium nucleatum # 1 252 1 252 252 457 99.0 1e-128 MKTYIIDLDGTMYSGGTNIDGAREFIDYLHSKNLPYIFLTNNATRTKKLAKEHMLNLGFK DIKEEDFFTSAMATAQYIAKNYTEKKCFMLGESGLEEALKECNLELVQENANFVVVGLDR NATYKKYSEALHHILKGAKFIATNPDRLLANNETFDIGNGAVIDMLEYASGVEAVKIGKP YQTILNILLEEKKLKKEDIILLGDNLETDIKLGYDAKIETIMVCSGVHTEKDIARLKVYP TRVVKNLRELIK >gi|296154068|gb|ADVK01000042.1| GENE 26 27212 - 27973 1054 253 aa, chain - ## HITS:1 COG:FN0047 KEGG:ns NR:ns ## COG: FN0047 COG0708 # Protein_GI_number: 19703399 # Func_class: L Replication, recombination and repair # Function: Exonuclease III # Organism: Fusobacterium nucleatum # 1 253 1 253 253 508 100.0 1e-144 MKLISWNVNGIRAAIKKGFLDYFNEQNADIFCLQETKLSAGQLDLELKGYHQYWNYAEKK GYSGTAIFTKQEPLSVSYGLGIEEHDKEGRVITLEFEKFYMVTVYTPNSKDELLRLDYRM VWEDEFRKYLKNLEKKKPVVVCGDLNVAHKEIDLKNPKTNRRNAGFTDEERGKFTELLDS GFIDTFRYFYPNLEQVYSWWSYRGRARENNAGWRIDYFVVSKGLEKSLVDAEIHSQIEGS DHCPVVLFLEFNK >gi|296154068|gb|ADVK01000042.1| GENE 27 28215 - 28658 552 147 aa, chain - ## HITS:1 COG:FN0046 KEGG:ns NR:ns ## COG: FN0046 COG0757 # Protein_GI_number: 19703398 # Func_class: E Amino acid transport and metabolism # Function: 3-dehydroquinate dehydratase II # Organism: Fusobacterium nucleatum # 1 147 1 147 147 283 100.0 5e-77 MKIMVINGPNLNMLGIREKNIYGTFSYDDLCKYIKDYPEYKDKNIEFEFLQSNVEGEIVN FIQEAYSKKYDGIILNAGGYTHTSVAIHDAIKAVSIPTVEVHISNIHAREDFRKVCVTSP ACIGQITGLGKLGYILAVVYLIEYYNK >gi|296154068|gb|ADVK01000042.1| GENE 28 28639 - 29442 958 267 aa, chain - ## HITS:1 COG:FN0045 KEGG:ns NR:ns ## COG: FN0045 COG0169 # Protein_GI_number: 19703397 # Func_class: E Amino acid transport and metabolism # Function: Shikimate 5-dehydrogenase # Organism: Fusobacterium nucleatum # 19 267 1 249 249 409 99.0 1e-114 MRKFGLLGKKLSHSLSPLLHKTFFEDIGLKDEYKLYEVDETEIDNFKNYMLENSIEGVNI TVPYKKIFLDKLNFISDEAKDIGAINLLYIKDNKFYGDNTDYYGFKYTLTKNDIDVKNKK IAIIGKGGASASVNKVLKDMGAKDITFYFRRDKLSKIEFPENMEGDIIINTTPVGMYPNI HDNLVNEEILKNFKIAIDLIYNPLETEFLKIARKNGLKTINGMDMLIEQALKTDEILYDI LLSTQLRKKIRKKMKKKVEEFYENNGD >gi|296154068|gb|ADVK01000042.1| GENE 29 29439 - 29696 238 85 aa, chain - ## HITS:1 COG:FN0044 KEGG:ns NR:ns ## COG: FN0044 COG1605 # Protein_GI_number: 19703396 # Func_class: E Amino acid transport and metabolism # Function: Chorismate mutase # Organism: Fusobacterium nucleatum # 1 85 2 86 86 103 100.0 8e-23 MIELELMRKKIDEIDDKLLVLFKERLEVSKKIGLLKKKNKIEIFDPQREQEIIDSCTKNI SEDERIYIEKFLRNLMDISKEVQSK >gi|296154068|gb|ADVK01000042.1| GENE 30 29662 - 30207 534 181 aa, chain - ## HITS:1 COG:FN0043 KEGG:ns NR:ns ## COG: FN0043 COG2849 # Protein_GI_number: 19703395 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 24 181 1 158 158 284 99.0 5e-77 MKIKKIFLFLLLLVCFELFAVSKLPKNLFNSDKINILKKGILNGPINVYYPNGKIQAKQF FINNRKAGIWQYYYESGKLKAEIVYNIMSNDEEGIIKTYDEKGVIISEGRIVNDNMAGVW NYYDEKGRKNYTYDFVKGIITTYDEKGKIIFQVTERDLANRFREIQQEISDDRVRANEEK N >gi|296154068|gb|ADVK01000042.1| GENE 31 30321 - 31574 997 417 aa, chain + ## HITS:1 COG:FN0042 KEGG:ns NR:ns ## COG: FN0042 COG0772 # Protein_GI_number: 19703394 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Bacterial cell division membrane protein # Organism: Fusobacterium nucleatum # 1 417 1 417 417 678 100.0 0 MKKNLVIDDDVVENKTLYKKVNDIKKERENEKEKGLIGRRKNAIISFFLILMIIGLINFL SSISRFDNARILDKIVKQSAILGVSLLIFFTTSRSKFGNIIYKIISKPSFRIFVLLSSLI IFLIIAYVPSESLFPTINGGKGWVHIGPVSIQVPEIFKIPFIMVLANILSRGKDDKKKIE YMQNLVSVLFYTAIFAITITICLQDMGTAIHYFMIASFMIFLSDIPNKLIFPAFFGILAS IPILLYIFLHTLSGYKQHRIKVFLDGILHSNYDREEAYQIYQSLIAFGTGGVLGKGFGNG VQKYNYIPEVETDFAIATYAEETGFIGMILVLFLFFSLFFLIMGVANKSKNYFSKYLVGG IAGYFITQVIINIGVAIGLIPVFGIPLPFISSGGSSLLAISIAMGLVIYVNNTQTLK >gi|296154068|gb|ADVK01000042.1| GENE 32 31643 - 31831 381 62 aa, chain + ## HITS:1 COG:FN0041 KEGG:ns NR:ns ## COG: FN0041 COG4224 # Protein_GI_number: 19703393 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 62 1 62 62 73 96.0 1e-13 MEMKDIIAKVNYYAKLSKERKLTEEEVKDREIYRKMYLDQFKAQVKEHLDNIEIVDEKDF KN >gi|296154068|gb|ADVK01000042.1| GENE 33 31841 - 33226 2040 461 aa, chain + ## HITS:1 COG:FN0040 KEGG:ns NR:ns ## COG: FN0040 COG0017 # Protein_GI_number: 19703392 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Aspartyl/asparaginyl-tRNA synthetases # Organism: Fusobacterium nucleatum # 1 461 1 461 461 877 96.0 0 MITVKDIFRHGKDYLDKEIELFGWVRKIRDQKKFGFIELNDGSFFKGVQIVFEEGLENFD EISRLSIASTIKVKGVLVKSQGSGQDLEVKADKIEIFQKADLEYPLQNKRHTFEYLRTKA HLRARTNTFSAVFRVRSVLAYALHKFFQENNFVYVHTPIITGSDAEGAGEMFRITTLDLN KVPKKENGEVDFSKDFFGKSTNLTVSGQLNGETYCAAFRNIYTFGPTFRAEYSNTARHAS EFWMIEPEIAFGDLNANMELAEAMVKYIIKYVMDNCPEEMEFFNSFIEKGLFDKLNNVLN NDFGRVTYTEAIEILEKSGKKFEFPVKWGIDLQSEHERYLAEEYFKKPVFVTDYPKDIKA FYMKLNEDNKTVRAMDLLAPGIGEIIGGSQREDNYELLTKRMKELGLKEEDYEFYLDLRR FGSFPHSGYGLGFERMMMYLTGMQNIRDVIPFPRTPNNAEF >gi|296154068|gb|ADVK01000042.1| GENE 34 33238 - 33789 759 183 aa, chain + ## HITS:1 COG:FN0039 KEGG:ns NR:ns ## COG: FN0039 COG1658 # Protein_GI_number: 19703391 # Func_class: L Replication, recombination and repair # Function: Small primase-like proteins (Toprim domain) # Organism: Fusobacterium nucleatum # 1 183 1 183 183 323 98.0 9e-89 MKKKIKEVIVVEGKDDISAVKNAVDAEVFQVNGHAVRKNKSIEILKLAYENKGLIILTDP DYAGEEIRKYLCKHFPNAKNAYISRVSGTKDGDIGVENASPEDIITALEKARFSLDNSEN IFNLDLMIDYNLIGKDNSADLRALLGAELGIGYSNGKQFMAKLNRYGISLEEFKKAYEKI NMK >gi|296154068|gb|ADVK01000042.1| GENE 35 33966 - 34253 486 95 aa, chain + ## HITS:1 COG:no KEGG:FN0038 NR:ns ## KEGG: FN0038 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 95 6 100 100 121 97.0 9e-27 MNQVEKKELMGKFAKKLENAIKREASVIKEMENDKALIKYLEGLKASGAAFDNTVYESYD AWIETIKKQIKKSESTLKNIEFKKVELEAIQKYIA >gi|296154068|gb|ADVK01000042.1| GENE 36 34401 - 34853 492 150 aa, chain + ## HITS:1 COG:no KEGG:FN0037 NR:ns ## KEGG: FN0037 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 150 1 150 150 261 96.0 9e-69 MRFYSYNYLLEQIAKFDWWGTTLIIFLIICLIFTMYKYYQRQKETKFRELAIILGLGIVV MISIKISQYRVTQVNDNKYRQAIHFIEVVAEDLKTDKENIYINTSASIDGTLVRIGSLYY RVISGDNGENYLLEKIDLNNPKIEIIEVKK >gi|296154068|gb|ADVK01000042.1| GENE 37 34853 - 35485 747 210 aa, chain + ## HITS:1 COG:FN0036 KEGG:ns NR:ns ## COG: FN0036 COG2323 # Protein_GI_number: 19703388 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 1 210 1 210 210 366 97.0 1e-101 MELSYLDIAIKLTMGLLSLVLVINISGKGNLAPSSAMDQVLNYVLGGIVGGVIYNPGISI LQYFIILMIWTMIVLILKWLKTNSILFKSILDGQPVIIIKKGILDVEACRRAGLTANDIA FKLRTNGVYSVKKVKRAVLEQNGQLIIVLQDEENPKYPIITDGTIQTNILEAIDKDMDWL QEQLKEMGYENISDVFLAEYNNGKITVITY >gi|296154068|gb|ADVK01000042.1| GENE 38 35549 - 35641 79 30 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MIISNTAEHRDFLADSKLGQKLRKLMSWIK >gi|296154068|gb|ADVK01000042.1| GENE 39 35773 - 36447 764 224 aa, chain + ## HITS:1 COG:no KEGG:FN0034 NR:ns ## KEGG: FN0034 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 9 224 1 216 216 323 97.0 4e-87 MKKFSLKFMLFVILSIFSSLTYADGAFSKYYNGRFYFSIDVPVEKYENGGGETSKLDFIK NSKSIPTKEFFSAYEAGNSDGLTIKDKSENITILAYGTNFLNTEETNGLEDIENIKESFK KDNLDYNEFIKKYYNGKLSKNINPLKYNYNKVLFINGNNITYNTIEKDFYVVSYIENNKI HYKKVIYNKDKNSYAIFEASYLEKDKKIMDSIVNEMVKSFKIIK >gi|296154068|gb|ADVK01000042.1| GENE 40 36590 - 41752 5878 1720 aa, chain + ## HITS:1 COG:no KEGG:FN0033 NR:ns ## KEGG: FN0033 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 114 1720 1 1607 1607 2783 96.0 0 MLSFYKKSLEKDVEKFVEKIKKDSKKLDKENQKFIEDIFLTEKDGTYYSYAAYLKNAIKQ GLSSKKDIKFNDIFPKNIYPAIELLIGKKFFKIFLDISKNTIKYPFSRGYYRRMVRSANY YNHIDFLFELLEDLVDLNFLNLDIVTALKGEYDHDGIYGLGSPYLTAYEIDNGNKELIDL IKEALGSQKSKIELNYSIMQAIFISSNKELVELTGKLLLAAKLQEGVRQQICETMDSGLQ ENFEYMFKIIYDNNLIRFSSVKRALGTWTGLTRDENIDINKFGKKELEIINKLIANPKYE DELLKSDDNVEVYLGLWNKATRDIKYSIEAMEKLLKSSKYHTKLLISYYLHTIEDMYYQR EIAKKVIKEYGKDNKNIVEILACYLHYAIGYIYASSLKNDIKTGKITSKMFFKDKKEALE FFDILENALSLMKGKEKVFDPCIFPWNVKSIDTGEISTALLFITILYPDEVLKNKVMKYI KEIDTWNRGHYLEALFEKPSNKEQKDFVITMLSDRTGAGITAYEIAKDNNLIKEYPREIE DLLRLKSGETRKNLIDLLMCQDKKALLTSIDNLVSAKNENKRLAGLDILNLANSKQKPLY DKKEVKNLVAKISSPTDAEKILIENLSDKKKKESENTLSKLYNTEYKLDLPYEIKEVDKS SKAIKKNKKGEYIIENTFNIKKIFSKSTDELFKLVKKLSELYVKNENYEYMSFYNKEYVL LRDRFNPEKNLDNIPYNERQKLAYYPLEDIWREFYKKEIKDFSTLWQLYTLLLKDYNSNL NENNAKEYQDFYKKILGIDITELRTKLKKANLKYVFTESYYNDTGFVLEIISMLYKEYCK ENEAYLFEIGKLFTSYVLENFELKDIVEQREKYNKEIYYSVNIYNTNSGLYYLFAKTIDY LEFYNNEKSFIESFVLRYNLDEKINKYISENLKDYEIQGVTKDLGLRNYAIANNLKITEK DLIYKYVLELDNEAEKEIDVDAFSELDNYMNNYRNILAKKEDKRLTTLNQFMLNEALKII YNEGRKIVDYVVQNELKRGDSPTIYSKSIYRIKRIEGIDYLVQILQALGKETLDRSYYYW GGYDSKKSVLSHLLKVCYPTEKDNSKELAKKLKGTNIIEQRLIEVAMYSSQWLEIIESYL GWKGLVSGCYYFQAHMSDVDNNKEGLIAKYTPISIDDLRDGAFDIDWFKSAYKELGEKKF EMLYDSAKYISDGAKHSRARMFADAVLGNLKLKETEKKIEDKRNKDLVASYSLIPLLKDK QKDALHRYQFLQKFLKESKKFGAQRRASEAKAVNISLENLSRNMGYSDVTRLIWNMETAL INEMKEYFKPKKLDDVDVYIKIDDLGQSEIIYEKAGKELKSLPTKLKKDKYIEAIKEVHK NLKEQYRRSRKMLEEAMEDGTEFYGYEIENLMTNPVIAPILKSLVFKMGNDLGYYVDKKL KSAKKKSVAIKDDSLLKIAHCFDLFESGEWSNYQKDIFDRELKQPFKQVFRELYVKTVDE KGRDKSLRYAGHQVQPAKTVALLKTRRWIIDGQEGLEKVYYKKNIIAKIFALADWFSPAD IEAPTLEEIQFFDRKTFKPILIDEVPDLVFTEVMRDIDLVVSVAHIGDVDPEASHSTIEM RKAIVEFNCKLFKLKNVTFTENHALIKGERAEYSIHLGSGLIHQKAGSAINVLPVHSQHR GRVFLPFIDDDPKTAEIMAKVLLFAQDDKIKDVFILEQIK >gi|296154068|gb|ADVK01000042.1| GENE 41 42004 - 42075 77 23 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MAKIAPMPVGAIFYINSQSLKLV >gi|296154068|gb|ADVK01000042.1| GENE 42 42121 - 43110 747 329 aa, chain + ## HITS:1 COG:FN0032 KEGG:ns NR:ns ## COG: FN0032 COG4927 # Protein_GI_number: 19703384 # Func_class: R General function prediction only # Function: Predicted choloylglycine hydrolase # Organism: Fusobacterium nucleatum # 1 329 1 329 329 615 99.0 1e-176 MYHSRWKNSHREAGFTYGDRLYKNNIIINFKSYLTKEKINYVEQVFDIYNEYYPEIIEEI KGFAEGQHTDFKIVFAFLVTMYVFTYDNYCSMLALTNKNCLVFARNSDFLVDIKKVSDST FYKLNSNFSFIGNTTAMIQMEDGINEKGLACGLTFVYPTVKNYGFNAGFLIRYILEKCET TEQAVDFLNKVPIGSSQNIIIIDRFRNLAVAELNSSHKTIRINEANVVYRTNHFIEQTML KYKYLGDDDVFSHLRYKTLNSQNYTEFNLSGIFELLKGKNGFICQYDKTKKFDTIWSSVF DIKNKVIYRCEGNPKQKKFIIDKRLKFSF >gi|296154068|gb|ADVK01000042.1| GENE 43 43158 - 44000 841 280 aa, chain + ## HITS:1 COG:no KEGG:FN0031 NR:ns ## KEGG: FN0031 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 10 280 1 271 271 471 98.0 1e-131 MKKIFLLLSVFFISNSLFAATDDKHIFYLDNPTTKNIEITLDNKVYKLKPKTYEVLNLKM GQHIAELSDGTKVYFKIFANSKGGIINPSGATYTINYFRYQSPRICVDWSEPEDTVLPTF DDFIIDKNYIAWEYDIFEEVTRESMPKKLSPEVDIYVFTKIYSPSEFKDIDYDIEKPKAN LPKIDSDYNIPNNEDKTFQNYIKQIINLDKTYKDTNDAKKQKKILKEYDKIAKIIWSEYP KYNIAQGSYDNVDLKALNLKSLDRGIIITKIEDDLNDRKR >gi|296154068|gb|ADVK01000042.1| GENE 44 43975 - 44058 148 27 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MTLMTEKDNTRDFKGLLKKCDIDYDSL >gi|296154068|gb|ADVK01000042.1| GENE 45 44033 - 44155 124 40 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MILTTTAYKFVTTTTFDKEFKKLDKSVQKIIAKYIKNSQF >gi|296154068|gb|ADVK01000042.1| GENE 46 44253 - 44729 691 158 aa, chain - ## HITS:1 COG:FN0030 KEGG:ns NR:ns ## COG: FN0030 COG3467 # Protein_GI_number: 19703382 # Func_class: R General function prediction only # Function: Predicted flavin-nucleotide-binding protein # Organism: Fusobacterium nucleatum # 1 158 1 158 158 313 100.0 1e-85 MRRKDREILDEVKIDKFIRNCDCCRIGFYDKENDEVYIVPLNFGYSNIDNKRVFYFHGAK EGRKIDLISKTNKVTFEMDSNHELIVGKMACNYSERYQCVMGTGLISFVEDKEEKAMALN EIMFQNMGKKDWDFPEPMLNGVAVFKIEVTSLSCKEHL >gi|296154068|gb|ADVK01000042.1| GENE 47 44872 - 45303 648 143 aa, chain + ## HITS:1 COG:FN0029 KEGG:ns NR:ns ## COG: FN0029 COG0716 # Protein_GI_number: 19703381 # Func_class: C Energy production and conversion # Function: Flavodoxins # Organism: Fusobacterium nucleatum # 1 143 1 143 143 270 100.0 6e-73 MNKINIVYYTFTGNTLRMVKAFEKGLQEANVPFKSYSIVELKDDNEVFDCEILALASPAN QTEEIEKNYFQPFMERNAKKFKDRKIYLFGTFGWGSGKFMSNWIKQVEELGAKIVELPMA CKGSPNSETKEKLANMAKKIATM >gi|296154068|gb|ADVK01000042.1| GENE 48 45345 - 45812 821 155 aa, chain - ## HITS:1 COG:FN0026 KEGG:ns NR:ns ## COG: FN0026 COG2849 # Protein_GI_number: 19703378 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 155 1 155 155 254 100.0 6e-68 MSSVLAFSARVIKSTEAVVKNNVIYVGQEKSPYTGVIETYNEKGVLEAKADIKNGQMDGS SKIYYPSGKLQSEATFKNNVQVGVQKDYTEDGKLKLELPYKNGKLDGVVKSYYPNGKIEI EEPYKNGERDGVAKAYDENGKVVQQATFKNGKQIK >gi|296154068|gb|ADVK01000042.1| GENE 49 45851 - 46351 592 166 aa, chain - ## HITS:1 COG:FN0025 KEGG:ns NR:ns ## COG: FN0025 COG2849 # Protein_GI_number: 19703377 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 166 1 166 166 271 93.0 4e-73 MKKLLLGLFLISSALTFSARVVKSTETVLKDNLIYVGEETTPYTGVIETYNEQGILVVKD EFKNGLRDGSSKKYFLNGGKVSLESTFSNGIQVGVEKRYYESGELLSERSYKNGKMDGIG KSYYQNGQVEMEEPYKNGERDGVIKVYDENGKVVRQATFKNGKQVK >gi|296154068|gb|ADVK01000042.1| GENE 50 46360 - 47085 1117 241 aa, chain - ## HITS:1 COG:FN0024 KEGG:ns NR:ns ## COG: FN0024 COG2849 # Protein_GI_number: 19703376 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 14 241 1 228 228 363 96.0 1e-100 MKKLLAGLLLVGSVLSFGAAQRVPLEKLVGNGDGELLYLEGEQKPYSGEVERKYPNGKLL GLATLKDGKLEGKAYVYYENGKVKREETYVNGKANGPAKSYHENGQVEYETNFKNSQREG IEKAYSKTGILVSEVPFKNDNATGLVKLYNEQTGKLEYETNVVNGVRNGLSKKFYPNGKL LSEVVFKNDKEEGIMKAYYENGKLQGEATYKNGQLDGVVKMYDETGKVVDQATFKNGKQV K >gi|296154068|gb|ADVK01000042.1| GENE 51 47398 - 48897 2062 499 aa, chain - ## HITS:1 COG:FN0023 KEGG:ns NR:ns ## COG: FN0023 COG1288 # Protein_GI_number: 19703375 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 1 499 1 499 499 862 99.0 0 MKKIKMPDTFVIIFFVVIFASLLTYIVPVGKFEMQEVTYVTNTGAEKTRNVPVPGSFSYE LDDKGNELKKGIKIFEPGGEVGVTNYVFEGLASGDKWGTAVGIVAFLLVVGGAFGIILKT GAVESGIYSMISKSKGSELVLIPVIFILFSLGGAVFGMGEEAIPFAMLIIPIVIDMGYDS ITGILITYISTQIGFATSWMNPFSVAVAQGVSGIPVLSGAGFRIFMWLFFTAFGVIYTIF YARKVKRNPESSIAYKTDGYFRDNFKSEEQGNREFKLGHKLIILVLILGMAWVVYGVVKE GYYLPEIATQFVIMGLIAGIIGVVFKLNNMSVNDIATSFRKGAEDMVGAALVIGMAKGIV LILGGTSADTPTILNTILNYVASALSNMSAAFCAWVMYIFQSVFNFFVVSGSGQAALTMP IMAPLSDLVGVTRQVAVLAFQLGDGFTNMIVPTSGILMAVLGIAKIEWGVWAKYQIKFQL ILFALGSCFIFFAVFTNFS >gi|296154068|gb|ADVK01000042.1| GENE 52 49037 - 49318 602 93 aa, chain - ## HITS:1 COG:FN0022 KEGG:ns NR:ns ## COG: FN0022 COG2088 # Protein_GI_number: 19703374 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Uncharacterized protein, involved in the regulation of septum location # Organism: Fusobacterium nucleatum # 1 93 1 93 93 170 97.0 5e-43 MKVTNVKIKKVDGDKFDRLRAYVDVTLDDCLVIHGLKLMQGEQGMFVAMPSRKMRNEEFK DIVHPICPELRNDITKVVQEKYFALDQEQEAAV >gi|296154068|gb|ADVK01000042.1| GENE 53 49334 - 50206 1046 290 aa, chain - ## HITS:1 COG:FN0021 KEGG:ns NR:ns ## COG: FN0021 COG1947 # Protein_GI_number: 19703373 # Func_class: I Lipid transport and metabolism # Function: 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase # Organism: Fusobacterium nucleatum # 1 290 5 294 294 495 99.0 1e-140 MNKYKIFSNAKINIGLNVFQKESDGYHNIDSIMAPIDLSDEMDVTFYSDLGDLKIECSDK SIPTDERNILYKTYKIFFEESKKEKEKIDIILKKNIPSEAGLGGGSSNAGFFLKLLNKHY GNVYNEKELEKLAMRVGSDVPFFIKNKIARVGGKGNRVDLVENNLKDSIILIKPLDFGVS TKEAYESFDNLKEVKYADFDKIIKNLKEGNRIALESNIENSLEQGILETDTNIKMLKMTL NSVVSGKKFFMSGSGSTYYTFVTELEKSQIETRLKTFVDNVKIIICKTIN >gi|296154068|gb|ADVK01000042.1| GENE 54 50199 - 50498 356 99 aa, chain - ## HITS:1 COG:FN0020 KEGG:ns NR:ns ## COG: FN0020 COG1188 # Protein_GI_number: 19703372 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Ribosome-associated heat shock protein implicated in the recycling of the 50S subunit (S4 paralog) # Organism: Fusobacterium nucleatum # 1 99 1 99 99 158 100.0 2e-39 MRLDKFLKVSRIIKRRPIAKLVVDGGKVKLDGKVVKAAAEVKVGQILEIEYYNKYFKFEI LQVPLGNVSKDKTSDLVKLIETKGLDIEINLDKDEDFFE >gi|296154068|gb|ADVK01000042.1| GENE 55 50614 - 53559 3088 981 aa, chain - ## HITS:1 COG:FN0019 KEGG:ns NR:ns ## COG: FN0019 COG1197 # Protein_GI_number: 19703371 # Func_class: L Replication, recombination and repair; K Transcription # Function: Transcription-repair coupling factor (superfamily II helicase) # Organism: Fusobacterium nucleatum # 1 981 1 981 981 1596 96.0 0 MEKKFRGEIPFWLKNKKNNLVYICSSNRNIDDYFFVLKDFYKGKILRIKKENEVGELKKY NYDLLELINSNEKFIILISLDYFLEDYYSEANSIFIEKGKNINIKDIEEKLVDAGFEKKY MVAQRKEYSIRGDILDIFNINQDNPVRIEFFGNEVDRITYFDINSQLSIEKKDSIELYID NNKNKRDFLSLMSINKNKIEYYYENNEILQAKIKRFINENLDREKEILSKIAELSKIGIQ IEIQKFSEEELKQFEVIDRVKKLSENTKITIYSEEATRYKEIFKDYSVKFEKYPLFEGYK TDDKLILTDREIKGIRVKRERVEKKALRYKAVDEIKEQDYVIHENFGVGIFLGLENIEGQ DYLKIKYADEDKLFVPVDSINKIEKFINISDVIPEIYKLGRKGFKRKKDKLSEDIEIFAK EIIKIQAKRNLGNGFKFSKDTVMQEEFEETFPFTETPAQLKAIEDVKRDMESGKVMDRLI CGDVGFGKTEVAIRATFKAVMDGKQVILLVPTTVLAEQHYERFSERFKNYPVHIEILSRV QSKKEQVESLKRIENGSADLVIGTHRLLSDDIRFKDIGLLIIDEEQKFGVKAKEKLKKIK GNLDVLTLTATPIPRTLNLSLLGIRDLSVIDTSPEGRQKIHTEYIDNNKNFIKEIILSEI SREGQVFYIFNSVKRMESKVKEIRELLPEYIKVSYIHGQMLPRDIKKNIQDFENGNVDVL VATTIIENGIDIENANTMIIEGVEKLGLSQVYQLRGRIGRSTKKSYCYMLMNENKTKNAK KREESIREFDNLTGIDLAMEDSKIRGVGEILGEKQHGAVETFGYNLYMKMLNEEILKLKG EAEEKLDEVDVELNFPRFLPDSYIEKNEKVKIYKRALALKNLDELKNLYNELEDRFGKIK SEAKGFFDFIKIRIIARDLGITTIKQDKENKDRILINFNEKKINVDKIIYLLSNKKIMYS KFTRTIGYNGDIFEFFKLYSS >gi|296154068|gb|ADVK01000042.1| GENE 56 53695 - 54180 676 161 aa, chain + ## HITS:1 COG:no KEGG:FN0018 NR:ns ## KEGG: FN0018 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 6 161 1 156 156 289 98.0 3e-77 MKKIFLIFLSLVCISCSQLDIYPSEEERAILTTIGVSRLLNEDEKKTLASSNFVDFVSIH KFFDVNSKNSAFMKEMYNDIISGPCLTDKTIKLINMHYEKIIVADNLDLTQRAIEKMGRT IEGRATLAKCRFVFFNYDSEERVKELSKEYGFKYVFPKLNK >gi|296154068|gb|ADVK01000042.1| GENE 57 54231 - 55493 1409 420 aa, chain - ## HITS:1 COG:FN0017 KEGG:ns NR:ns ## COG: FN0017 COG3177 # Protein_GI_number: 19703369 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 420 1 415 415 572 88.0 1e-163 MSNKYENLIKLYYKKQNIENEYIKRIENPATFITNLKINPIKRGNKILDKEYNLFYLNLI EHTLLEEIITKNSSQINLISNELPQIAIKDIIIKILSNELYKTNKIEGIEIVKSEIHSSL KDNKKFNKKSNKLDGIIKKYKDIMEKNFKDTQHIDSLSSFRKIYDEMFEDFEKSGNYKLD GKYFRKDTVKVINGLGNTIHIGINGEEAIEKNMENLIQFMSRKDIPFLVKASISHFFFEY IHPFYDGNGRFGRYLLSLYLARKLDILTAFSVSYSISKNLDDYYKSFVEVEDVNNYGEIT FFVENILKTIKNGQEEIIKLLNDSIMKLNHSREIFEEITKDLSKKEKIILYIYLQNYLFN DFEKISNIELTFVIKNFNKQNITQQTINKYTQELENKGYLVKIKQRPLTYTLSDKITDKL >gi|296154068|gb|ADVK01000042.1| GENE 58 55572 - 55661 61 29 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MNIINYLISKEKCDKLVECIRIIDRELRK >gi|296154068|gb|ADVK01000042.1| GENE 59 55621 - 56493 816 290 aa, chain - ## HITS:1 COG:no KEGG:FN0016 NR:ns ## KEGG: FN0016 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 218 1 218 218 374 98.0 1e-102 MKNNLFTFATSELSQDAFICWCLNWINYPNEILYPMAKDIFSNLLKEEKNLENKEIEIRK QYKKIDVLVILKNSKKVYIIEDKTNTFENNQIIRYKEAIKNEIDTIKTVYFKTGFWFSDD DSVLADIKINREDFLGILNKYRGKNQILDDYCEYFERVTESEEKEKNYLISEEELTQKKY WELNIARSIITQYQFMRYIFSKRYIRSGRSIGGGVYTQYDILKDKVFPDTKNENLSEDKR TYSVFWRIDSKDKIGPYISINFYTEHSKGDIKPRCREDEYNKLFNIKRKM >gi|296154068|gb|ADVK01000042.1| GENE 60 56673 - 57221 706 182 aa, chain + ## HITS:1 COG:no KEGG:FN0015 NR:ns ## KEGG: FN0015 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 182 1 182 182 299 98.0 4e-80 MGMDLCYYGVKEEEIPKILDGNFEEDFSELEPSYTTRIFSAKDFYYLYTGGKELEEEDFQ GKNERDLFVEAFLGEATVSFAPGDIYSYCSCKEKVKEISEFLNKINIKDYFKKIGTIEEF SSENFKGEKFTYLGVKTRRRFYPSMEKEYIFDIEATTNEFNELKEFYNKLANEDLALYIY IF >gi|296154068|gb|ADVK01000042.1| GENE 61 57218 - 57907 764 229 aa, chain - ## HITS:1 COG:FN0013 KEGG:ns NR:ns ## COG: FN0013 COG1811 # Protein_GI_number: 19703365 # Func_class: R General function prediction only # Function: Uncharacterized membrane protein, possible Na+ channel or pump # Organism: Fusobacterium nucleatum # 1 105 1 105 113 164 99.0 1e-40 MGLITNFLSIIMGGILGLTIGKKFNEDIKNIIVDCAGIFIIVIGIKSALVAQKDIMILIY LIIGAVIGQLINIDKRIKNFSQFLEDKFVKEKNSLNNEKSFAKGFSTATILYCVGAMSIL GSINSGLTNDNTILNIKAILDGVISIVLTSIYGVGVIFSAVSVTIYQGIFYLFASQIKDY LNPQAISELNAVGGVLVLAIGINMTFKKDIKTANMLPAIFIPLLVSIFS >gi|296154068|gb|ADVK01000042.1| GENE 62 58061 - 58432 407 123 aa, chain + ## HITS:1 COG:FN0012 KEGG:ns NR:ns ## COG: FN0012 COG0563 # Protein_GI_number: 19703364 # Func_class: F Nucleotide transport and metabolism # Function: Adenylate kinase and related kinases # Organism: Fusobacterium nucleatum # 1 121 1 121 165 184 95.0 4e-47 MKIHIIGCSGTGKTYLAKKLSNKYNIPRYDLDNIYWDNSSEKYGIKMEIEKRDKLLQNIL EKDSWIIEGIYYKWLEQSFKDADIIYILDLPKYIYKFRIIKRFIKRKLKLETSKKEIDKI LEI >gi|296154068|gb|ADVK01000042.1| GENE 63 58459 - 58968 641 169 aa, chain - ## HITS:1 COG:FN0011 KEGG:ns NR:ns ## COG: FN0011 COG1827 # Protein_GI_number: 19703363 # Func_class: R General function prediction only # Function: Predicted small molecule binding protein (contains 3H domain) # Organism: Fusobacterium nucleatum # 1 169 1 169 169 250 99.0 8e-67 MIEREEREKKILEILRNSETLVSGTYLAEFFDVSRQVIVQDIAILKAKNIDIISTNRGYR LLSKGIKKIIKVKHDDSEIRNELNAIVDLGASVEDVFVIHKTYGEIRVKLDIKSRRDVDL LVENINSKLSKPLKNLTDNYHYHTIIAENENIFKEVEDKLKKLGILLEE >gi|296154068|gb|ADVK01000042.1| GENE 64 58961 - 59821 506 286 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163755345|ref|ZP_02162465.1| 30S ribosomal protein S6 [Kordia algicida OT-1] # 15 279 15 283 286 199 41 4e-50 MNLRKIDKFQMDDSIRMALKEDITSEDISTNAIYKKDRLAEISLYSKEEGILAGLDVFKR VFELLDNSVEFTEYKKDGDKILNKDLILKIKANVKTILSAERTALNYLQRMSGIATYTQK MVEALDDKNILLLDTRKTTPNMRIFEKYSVRVGGGYNHRYNLSDAIMLKDNHIDAAGSIT EAIKLAKEYSPFIKKIEIEVEDLKGVEEAVKAGADIIMLDNMDIETTKKAIKIINKQAII ECSGNIDITNINRFKGLEIDYVSSGAITHSAKILDLSLKNLRYVDD >gi|296154068|gb|ADVK01000042.1| GENE 65 59784 - 61091 1632 435 aa, chain - ## HITS:1 COG:FN0009 KEGG:ns NR:ns ## COG: FN0009 COG0029 # Protein_GI_number: 19703361 # Func_class: H Coenzyme transport and metabolism # Function: Aspartate oxidase # Organism: Fusobacterium nucleatum # 1 435 1 435 435 790 98.0 0 MKVENCDVVIIGSGVAGLICALTLDKNFRIILVTKKKLKDSNSYLAQGGISVCRGKEDRE EYIEDTLIAGHYKNDREAVEILVDESEEAIKTLIENGVKFTGDEKGLFYTREGGHRKFRI LYCEDQTGKYIMESLIERILERDNIKIIEDCEFLDIIEKENNCLGILAKKEEIFAIKSKF TVLATGGLGGIYKNTTNFSHIKGDGVAVAIRHNIKLKDISYIQIHPTTFYTKENERKFLI SESVRGEGAILLNQKLERFTDELKPRDKVTKAILEEMKKDKSEYEWLDFSTIKLDIKERF PNIYNNLMKKGINPLKDKVPIVPAQHYTMGGIKVDMNSKTSMKNLYAIGEVACTGVHGQN RLASNSLLESVVFAKRASQSIIDENNISVYNNITDDIFKNIVDKVIVSDEKENKNIIEKR IKEDEFEKNRQISNG >gi|296154068|gb|ADVK01000042.1| GENE 66 61093 - 61989 1092 298 aa, chain - ## HITS:1 COG:FN0008 KEGG:ns NR:ns ## COG: FN0008 COG0379 # Protein_GI_number: 19703360 # Func_class: H Coenzyme transport and metabolism # Function: Quinolinate synthase # Organism: Fusobacterium nucleatum # 1 298 1 298 298 517 94.0 1e-146 MKDRIKKLQKEKDVAILAHYYVDGEVQEIANYVGDSFYLAKTATKLKNKTIIMAGVYFMG ESIKILNPEKIVHMVDIYADCPMAHMITIKKIKEMREKYDDLAVVCYINSTAEIKAYCDI CITSSNAVKIVSKLKEKNIFIVPDGNLASYIAKQIKDKNIILNEGYCCVHNLVHLENVIK LKNEYPNAKVLAHPECKEEILNLANYIGSTSGIIEEVLKGGDEFIIVTERGIQHKIYEKA PNKKLHFADTLICKSMKKNTLEKIEKILSDGGDELEVDDEIAKKALIPLERMLELAGD >gi|296154068|gb|ADVK01000042.1| GENE 67 62175 - 64061 2511 628 aa, chain - ## HITS:1 COG:FN0007 KEGG:ns NR:ns ## COG: FN0007 COG0445 # Protein_GI_number: 19703359 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: NAD/FAD-utilizing enzyme apparently involved in cell division # Organism: Fusobacterium nucleatum # 1 628 1 628 628 1181 98.0 0 MDKDYDVIVVGAGHAGVEAALASARLGNKTALITLYLDTISMMSCNPSIGGPGKSNLVTE IDVLGGEMGRHIDEFNLQLKDLNTSKGPAARITRGQADKYKYRRKMREKLEKTENISLIQ DCVEEILVEDIKDSQNSNYIKEVTGIKTRLGIIYNAKVIVLATGTFLKGKIVIGDVSYSA GRQGETSAEKLSDSLRELGIKIERYQTATPPRLDKKTIDFSQLEELKGEEHPRYFSLFTK KEKNNTVPTWLTYTSDKTIEVIKEMMKFSPIVSGMVNTHGPRHCPSIDRKVLNFPDKEKH QIFLEMESENSDEIYVNGLTTAMPAFVQEKILKTIKGLENAKIMRHGYAVEYDYAPASQL YPSLENKRISGLFFAGQINGTSGYEEAAAQGFIAGVNAAKKIKGEKPVIIDRSEAYIGVL IDDLIHKKTPEPYRVLPSRAEYRLTLRYDNAFMRLFDKIKEVGIVDKDKIEFLEKSINDV YMEINNLKNISVSMNEANKFLENLDIEERFVKGVKASEILKIKDVRYDDLKVFLNLNDYE DFVKNQIETMIKYEIFIERENKQIEKFKKLEHMYIPENINYDEIKGISNIARAGLDEVRP LSIGEATRISGVTSNDITLIIAYMNMKL >gi|296154068|gb|ADVK01000042.1| GENE 68 64282 - 65649 1825 455 aa, chain - ## HITS:1 COG:FN0006 KEGG:ns NR:ns ## COG: FN0006 COG0486 # Protein_GI_number: 19703358 # Func_class: R General function prediction only # Function: Predicted GTPase # Organism: Fusobacterium nucleatum # 1 455 1 455 455 775 99.0 0 MLLDTIAAISTPRGEGGISIVRMSGQDSLNILEKIFKPKNKKVSELKNYSINYGHIIDNE HIVDEVLVSIMKAPNTYTREDIIEINCHGGYLVTEKVLEVVLKNGARIAEIGEFTKRAFL NGRIDLTQAEAVIDVIHGKTEKSLSLSLNQLRGDLRDKIATIKKSVLDLAAHINVVLDYP EEGIDDPVPENLVENLKKASAEIKDLVSSYDKGKIIKDGIKTAIIGKPNVGKSSILNSLL REDRAIVTHIPGTTRDIIEEVININGIPLLLVDTAGIRNTDDIVENIGVEKSKELINSAD LILYVIDTSREIDEEDFRIYDIINTDKVIGILNKIDIKKEINLSKFPKIEKWIEISALSK LGIDNLENEIYKYIMNENIEDSSQKLVITNVRHKSALEKTNEALLNIIETIDMRLPMDLM AVDIKDALDSLSEVTGEISSEDLLDHIFSNFCVGK >gi|296154068|gb|ADVK01000042.1| GENE 69 65655 - 66398 1172 247 aa, chain - ## HITS:1 COG:FN0005 KEGG:ns NR:ns ## COG: FN0005 COG1847 # Protein_GI_number: 19703357 # Func_class: R General function prediction only # Function: Predicted RNA-binding protein # Organism: Fusobacterium nucleatum # 85 247 1 163 163 241 100.0 8e-64 MGIEKTIEIKAIDKEKALKRALNILGVELTENEAVEIVEKVKPRKKFFGLFGTEPGIYKV SIKTKEKESKEKKVSTQKSEKVEKMEKKERVETSHKNNIENTELEKEITEKVSFFVEKMK LDIKFKLKRIKERVYVVEFFGKDNALVIGQKGKTLNSFEYLLNSMIKNCRIEIDVERFKE KRNETLRILAKRMAEKVSKYGKTVRLNAMPPRERKIIHEVVNKYPDLDTYSEGRDPKRYI VIKKKRG >gi|296154068|gb|ADVK01000042.1| GENE 70 66400 - 67017 736 205 aa, chain - ## HITS:1 COG:FN0004 KEGG:ns NR:ns ## COG: FN0004 COG0706 # Protein_GI_number: 19703356 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Preprotein translocase subunit YidC # Organism: Fusobacterium nucleatum # 1 205 1 205 205 334 100.0 7e-92 MSYIYNLLKQFLAFLLNTTDKYVGNFGISIIIVTILIKIILLPLTLKQDKSMKEMKKLQP ELEKIKQKYANDKQMLNIKTMELYREHKVNPLGGCLPILVQLPILFALFGVLRSGIIPAD SSFLWMRLADPDPFYVLPVLNGAVSFLQQKLMGTSDNAQMKNMMYVFPIMMIVISYRMPS GLQLYWLTSSLIAVIQQYFIMKKGA >gi|296154068|gb|ADVK01000042.1| GENE 71 67014 - 67262 97 82 aa, chain - ## HITS:1 COG:FN0003 KEGG:ns NR:ns ## COG: FN0003 COG0759 # Protein_GI_number: 19703355 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 82 1 82 82 155 97.0 1e-38 MKKILILLIRFYQKFISPMFPAKCRFYPTCSQYTLEAIKDHGTIKGTYLGIRRILKCHPF HEGGYDPVPKRKNKNSEGKREE >gi|296154068|gb|ADVK01000042.1| GENE 72 67271 - 67651 377 126 aa, chain - ## HITS:1 COG:FN0002 KEGG:ns NR:ns ## COG: FN0002 COG0594 # Protein_GI_number: 19703354 # Func_class: J Translation, ribosomal structure and biogenesis # Function: RNase P protein component # Organism: Fusobacterium nucleatum # 16 126 1 111 111 166 99.0 1e-41 MKNTGFLPRVVFGDMMNTLKKNGEFQNIYNLGNKYFGNYSLIFFNKNKLEYSRFGFVASK KVGKAFCRNRIKRLFREYIRLNINKINDNYDIIIVAKKKFGENIEDLKYKDIEKDLNRVF KNSKII >gi|296154068|gb|ADVK01000042.1| GENE 73 67658 - 67792 224 44 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|197735492|ref|YP_002164270.1| hypothetical protein FNP_0004 [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] # 1 44 1 44 44 90 100 2e-17 MKRTFQPNQRKRKKDHGFRARMSTKNGRKVLKRRRVRGRAKLSA >gi|296154068|gb|ADVK01000042.1| GENE 74 68217 - 68321 69 34 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MNIHLNFTLISIFILKYKNVISVTFIKKKFIFYY >gi|296154068|gb|ADVK01000042.1| GENE 75 68426 - 70339 2242 637 aa, chain + ## HITS:1 COG:no KEGG:FN0001 NR:ns ## KEGG: FN0001 # Name: not_defined # Def: chromosomal replication initiator protein DnaA # Organism: F.nucleatum # Pathway: not_defined # 1 637 1 637 637 995 99.0 0 MKKEKNIEDKDKELVEIIETENFEVSKTGSLADDLMQFENINDIKIENQEVPDIEVQEIY IRETGNYLNLQENFINIPIEMIYFPFFTPQKQNKRINFKYTFEDLGVTMYSTLIPKDKKD KVFQPSIFEEKIYTFLISMYQEKTNKKIDDSEVAIEFEISDFIVNFLGNKMNRAYYAKVE QALKNLKNTIYQFEISNHTKFGKNKFEDSSFQLLNYQKLKVGKKTFYRVILNKNIVNKIK SKRYIKYNTKNLLEIMIKDPIASRIYKYISKIRYKNNKGEINVRTLAAIIPLKIEQRVER IVKNGVKEYYLNRMKPVLTRILKAFEVLLELKYLLSFEEIYKKDENTYYIAYVFNKERDG DCHVSEFVKKTDKNIVKENLDGVEELIDVDADIEYQDNIEYLINKAKENPKISVKWNAWV DKKIQKILNENGEEMLKRVLNILIHMDKNIEIGLPNYISGILKNIGGKGSKKVKNTNMTI FENVSKGKGLKNKSQIKQARKKGMERISNFKEIMSENNFLENKSETKTEKLLLEEKISNS DLGMNRTIDNVGEKTYNIGEKNLDEILSHFDKKIRNEIEEKALEKIKKEIDNSNIDVILN VKKFSKTMYYKMIGATIMEILKSEYKEMLEDTNKNDK >gi|296154068|gb|ADVK01000042.1| GENE 76 70389 - 70604 325 71 aa, chain + ## HITS:1 COG:FN2129 KEGG:ns NR:ns ## COG: FN2129 COG2501 # Protein_GI_number: 19705419 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 71 1 71 71 112 100.0 2e-25 MKNIEKVKISTEFIKLDQFLKWLAVVDSGSDAKQVILDGKVKVNDEVETRRGRKIYPEYK VEIFDKIYVVE >gi|296154068|gb|ADVK01000042.1| GENE 77 70679 - 71728 693 349 aa, chain + ## HITS:1 COG:FN2128 KEGG:ns NR:ns ## COG: FN2128 COG1195 # Protein_GI_number: 19705418 # Func_class: L Replication, recombination and repair # Function: Recombinational DNA repair ATPase (RecF pathway) # Organism: Fusobacterium nucleatum # 1 349 21 369 369 549 99.0 1e-156 MSDKINVFYGKNAQGKTSLLEAIYYSSTGISFKTKKTSEMIKYNFDEFISSISYQDYIAN NKISVRFKNIAGAKKEFFFNKKRISQTDFYGKINIIAYIPEDIILINGSPKHRRDFFDIE ISQIDKEYLTNLKNYDKLLKIRNKYLKENKRNSEEFAIYEKEFIKYASYIIFTRIEYVKS LSIILNLQYRKLFNIAQELNLRYETSLDKTAKVTIEMIQESLKREISQKKYQEDRYKFSL VGPHKDDYKFLLNGHEAKISASQGEKKSIIFSLKLSEIEIIKKNRKENPVVIIDDITSYF DEDRRKSILDFFNKRDIQVLISSTDKLDIEAKNFYVEKGIIKDESNINK >gi|296154068|gb|ADVK01000042.1| GENE 78 71706 - 71978 246 90 aa, chain + ## HITS:1 COG:no KEGG:FN2127 NR:ns ## KEGG: FN2127 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 90 1 90 90 127 100.0 1e-28 MKVISISEIATFKISNDDRIKLMILKEKWKELFLELSQNSSIIDFKENTIYIKGYNSAVK HYIFTNKIKLIGQILENLEIKFEIEDIKIK >gi|296154068|gb|ADVK01000042.1| GENE 79 71991 - 73898 2840 635 aa, chain + ## HITS:1 COG:FN2126 KEGG:ns NR:ns ## COG: FN2126 COG0187 # Protein_GI_number: 19705416 # Func_class: L Replication, recombination and repair # Function: Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit # Organism: Fusobacterium nucleatum # 1 635 5 639 639 1226 99.0 0 MSYEAQNITVLEGLEAVRKRPGMYIGTTSERGLHHLVWEIVDNSVDEALAGYCDKIEVKI LPENIIEVMDNGRGIPTDIHPKYGKSAMEIVLTILHAGGKFENDNYKVSGGLHGVGVSVV NALSEWLEVEVRKNGVVHYQKYHRGKPEENVKVIGSCDESEHGTIVRFKADGEIFETLIY NYFTLSNRLKELAYLNKGLTIILSDLRKEEKKEETYKFDGGILDFLNEIVKEDTTIIEKP FYVSSEQDNVGVDVTFTYTTSQNEVIYSFVNNINTHEGGTHVQGFRTALTKVINDVGKAQ GLLKDKDGKLMGNDIREGVVGIVSTKIPQPQFEGQTKGKLGNSEVSGIVNTIVSSSLKIF LEDNPNITKIIIEKILNSKKAREAAQKARELVLRKSVLEVGSLPGKLADCTSKKAEECEI FIVEGDSAGGSAKQGRDRYNQAILPLRGKIINVEKAGLHKSLESSEIRAMVTAFGTSIGE TFDIAKLRYGKIILMTDADVDGAHIRTLILTFLYRYMKDLITEGNIYIACPPLYKVSSGK QIIYAYNDLELKNILGQMNQDNKKYTIQRYKGLGEMNPEQLWETTMNPDGRLLLKVSIDN AREADMLFDKLMGDKVEPRREFIEEHAEYVKNIDI >gi|296154068|gb|ADVK01000042.1| GENE 80 74044 - 76479 3319 811 aa, chain + ## HITS:1 COG:FN2125 KEGG:ns NR:ns ## COG: FN2125 COG0188 # Protein_GI_number: 19705415 # Func_class: L Replication, recombination and repair # Function: Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit # Organism: Fusobacterium nucleatum # 1 811 1 811 811 1443 100.0 0 MSNVDNRYIEEELKESYLDYSMSVIVSRALPDVRDGLKPVHRRILFAMNEMGMTNDKPFK KSARIVGEVLGKYHPHGDSAVYGTMVRMAQDFNYRYLLVEGHGNFGSIDGDSAAAMRYTE ARMEKITAELLEDIDKDTIDWRKNFDDSLDEPTVLPAKLPNLLLNGAIGIAVGMATNIPP HNLGELVDGILAIIDNKDIEILELMNYIKGPDFPTGAIIDGRVGIIDAYKTGRGKIKVRG KVDIEELKNGKSNIIVSEIPYQLNKANLIEKIANLVKEKKITEISDLRDESNREGIRILI EVKKGEEPELVLNKLYKYTDLQTTFGVIMLSLVNNVPRVLNLKEMLNEYIKHRFDVITRR TAFDLDKAEKRAHILKGYQIALENIDRIIELIRASSDGTVAREQLIEKYAFTDIQARSIL DMKLQRLTGLEREKIDTEFKEIETLIKELREVLEDNNKIYDIMKKELLELKEKYGDKRRT KIEEERMEILPEDLIKDEEIIITYTNKGYVKRIEASKYKAQRRGGKGVSALNTIEDDYAE KITSASTLDTIMVFTDRGKVYNIRAYEIPDLSRQSRGRLLSNIINLSEGEKVRDTIVIKE FSPEKEVVFITKKGLIKKTSLGEFKNINNSGLIAIKTREDDDLIFVGLIEDVNKEEILIA THDGYCTRFLTDTIRATGRSTQGVKAITLREGDAVVSAMLIKNPETDILTITENGYGKRT SLDEYPQYNRGGKGVINLKASEKTGKVVSVLEVTEDEELMCITSNGIVIRTSISEISRIG RATQGVRIMKVADDEKVAAITKIKKEEELED >gi|296154068|gb|ADVK01000042.1| GENE 81 76492 - 76953 445 153 aa, chain + ## HITS:1 COG:FN2124 KEGG:ns NR:ns ## COG: FN2124 COG0622 # Protein_GI_number: 19705414 # Func_class: R General function prediction only # Function: Predicted phosphoesterase # Organism: Fusobacterium nucleatum # 1 153 1 153 153 285 98.0 2e-77 MKKILVLSDSHSYFDKVLKIFEKEKPDIVIGAGDGIKDIEELSYVYPKAKYYMVKGNCDF FDRSHNEENLFEIDGIKVFLTHGHLYDVKRTLNSIKEIGKKLNVSLIVFGHTHKPYIEKY GNMTLFNPGATEDGRYGIIILEQGNIELFHKQL >gi|296154068|gb|ADVK01000042.1| GENE 82 76980 - 77996 1454 338 aa, chain + ## HITS:1 COG:FN2123 KEGG:ns NR:ns ## COG: FN2123 COG0016 # Protein_GI_number: 19705413 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Phenylalanyl-tRNA synthetase alpha subunit # Organism: Fusobacterium nucleatum # 1 338 1 338 338 677 99.0 0 MKEEILKVKEEIQNYIKESKTLQRLEEIRVNYMGKKGIFTELSKKMKDLSVEERPKIGQI INEVKEKINSLLDERNKALKEKELNERLESEIIDISLPGTKYNYGTIHPINETMELMKNI FSKMGFDIVDGPEIETVEYNFDALNIPKTHPSRDLTDTFYLNDSIVLRTQTSPVQIRYML EHGTPFRMICPGKVYRPDYDISHTPMFHQMEGLVVGKDISFADLKGILTHFVKEVFGDRK VRFRPHFFPFTEPSAEMDVECMICHGDGCRLCKESGWIEIMGCGMVDPEVLKYVGLNPDE VNGFAFGVGIERVTMLRHGIGDLRAFFENDMRFLKQFK >gi|296154068|gb|ADVK01000042.1| GENE 83 78010 - 80406 3350 798 aa, chain + ## HITS:1 COG:FN2122_2 KEGG:ns NR:ns ## COG: FN2122_2 COG0072 # Protein_GI_number: 19705412 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Phenylalanyl-tRNA synthetase beta subunit # Organism: Fusobacterium nucleatum # 146 798 1 653 653 1196 98.0 0 MLISLNWLKQYVDIKESIDEIANALTMIGQEVEAIDIQGKDLDNVVIGQIVEFDKHPNSD RLTLLKVNVGGEEPLQIICGAKNHKLNDKVVVAKIGAVLPGNFKIKKSKIRDVESYGMLC SEAELGFAKESEGIIILPEDAPIGTEYREYMNLNDVIFELEITPNRPDCLSHIGIAREVA AYYNRKVKYPMIEITETIESINTMVKVDIEDKDRCKRYMGRVIKNVKVQESPDWLKSRIR AMGLNPINNIVDITNFVMFEYNQPMHAFDLDKLEGNITIRAAKKNEKITTLDGVDRVLKN GELVIADDEKAIAIAGVIGGQNTQIDNETKNIFVEVAYFTPENIRKTSRELGIFTDSAYR NERGMDVENLNVVMARAVSLIAEVTGGDVLSEVIDKYVEKPQRAEISLNLEKLNKFIGKN LTYDEVGKILTHLDIELKPLGEGTMLLIPPSYRADLTRPADIYEEVIRMYGFDNIEAKIP VMSIESGEENINFKMPRIVRGILKELGLNEVINYSFIPKFTKELFNFGDEVIEIKNPLSE DMAIMRPTLLYSLITNIKDNINRNQTDLKLFEISKTFKNLGAEKDELAIEDLKIGIILAG REDKNLWNQSKTDYNFYDLKGYLEFLFERLNITKYSLTRLKNNNFHPGASAEIKIGEDII GVFGELHPNLVNYFGIKREKLFFAELNLTKLLKYIKIKVNYESISKYPEVLRDLAITLDR DILVGDMIKEIKKKVALIEKIDIFDVYSGDKIDKDKKSVAMSIILRDKNRTLTDGDIDTA MNTILELIKDKYNGEIRK >gi|296154068|gb|ADVK01000042.1| GENE 84 80437 - 81168 1167 243 aa, chain + ## HITS:1 COG:FN2121 KEGG:ns NR:ns ## COG: FN2121 COG2849 # Protein_GI_number: 19705411 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 14 243 1 230 230 389 99.0 1e-108 MKKLLAGLFLLGSMLAFAAGEQRVPIEKVELNQQTSLVYLQGQQIPFTGIVEKKYASGKL EAALEFKDGKLNGKTLVYNENGKMKTEENYVNGALDGVSKSFYANGSVEFETTFRNNVKE GVEKHYSPSGRVETEVLFKNNVANGIAKQYNAEGKLEYETMIVNGKREGLSKKHYPSGKL LSEVTFRNDKEEGMMKGYFENGKLQLEIPYKNGQVDGLVKRYDENGQVVEQATFKNGQEI KAK >gi|296154068|gb|ADVK01000042.1| GENE 85 81274 - 81843 835 189 aa, chain + ## HITS:1 COG:FN2120 KEGG:ns NR:ns ## COG: FN2120 COG2849 # Protein_GI_number: 19705410 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 16 189 16 189 189 285 99.0 3e-77 MKKLLLGLFLLASALSFAAGRVLRDKEAEFKDGVVYVKGENKAYTGTLEGYNERGVLETR KEFKNGKMDGSSKIFYPNGKVLSEATFKDGNQVGVQKDYYEDGKIKAELPYKNDVIEGTM KEYYPNGKLKSNISMKNGKRDGLEKIYYENGKLKYELNYKNGEPYGNMKLYDETGNLVGE GPYFEVTTK >gi|296154068|gb|ADVK01000042.1| GENE 86 82035 - 83511 1992 492 aa, chain + ## HITS:1 COG:FN1817 KEGG:ns NR:ns ## COG: FN1817 COG3210 # Protein_GI_number: 19705122 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Large exoproteins involved in heme utilization or adhesion # Organism: Fusobacterium nucleatum # 14 482 2 441 2806 157 31.0 4e-38 MTDKKILFVKGYFRNKLLRKTITFIMLLLFSFNIFAEVVPDPASIGTRATKTASGVDQLD IATPNKNGTSYNSLKELQVSEQGLILNNNKDIVVNTKTAGYVARNRNLDNSAAANLIITE VTGKNRTNINGTVEVAGKRADLVMANRNGIYVNGGNFLNTDRVTLTTGSLQMKNGDLVGI DVSQGQIGIGGKGLDALGLTELELLGKTIDIAGIIKTSKETRLLVSAGGQTYEYKTKEVK SKGETYEGIAIDGKAVGSMYAGKIDIISNDKGAGVNTKGDLVSVDDIVLTANGDITTVKV DSGKDLKYKTTQKVKMNGKTTVAKKVKIKAKETEINAKVVTGYLEKALGKKSLDIESEKT NITAKIEAQGKIRINSNQIQNSGEIFATEKVNIAGNKLNNNNGEIRSNQKIEINTKEASN VKGYILSDGLTKEDVKKEENKNTKNTEEVKNKEKGINITGDLDNTEGVIRGREISLGNLT GNNKGKIDSIGA Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:52:12 2011 Seq name: gi|296154045|gb|ADVK01000043.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00053, whole genome shotgun sequence Length of sequence - 28328 bp Number of predicted genes - 23, with homology - 22 Number of transcription units - 13, operones - 5 average op.length - 3.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 67 - 1083 1346 ## COG1052 Lactate dehydrogenase and related dehydrogenases - Prom 1263 - 1322 21.6 2 2 Tu 1 . + CDS 1461 - 2738 1981 ## COG0334 Glutamate dehydrogenase/leucine dehydrogenase + Term 2789 - 2824 5.1 - Term 2814 - 2862 3.0 3 3 Op 1 1/0.800 - CDS 2871 - 3737 1053 ## COG0682 Prolipoprotein diacylglyceryltransferase 4 3 Op 2 1/0.800 - CDS 3764 - 4546 702 ## COG2035 Predicted membrane protein 5 3 Op 3 . - CDS 4551 - 5630 1100 ## COG0787 Alanine racemase - Prom 5663 - 5722 11.5 6 4 Op 1 . - CDS 5766 - 5915 229 ## 7 4 Op 2 . - CDS 5935 - 6864 979 ## FN0493 hypothetical protein - Prom 6886 - 6945 16.5 + Prom 6991 - 7050 13.9 8 5 Op 1 10/0.000 + CDS 7151 - 7870 280 ## PROTEIN SUPPORTED gi|163739489|ref|ZP_02146899.1| 50S ribosomal protein L17 9 5 Op 2 . + CDS 7929 - 9137 1993 ## COG0183 Acetyl-CoA acetyltransferase + Term 9177 - 9221 -0.9 + Prom 9165 - 9224 9.7 10 6 Tu 1 . + CDS 9311 - 9784 379 ## FN0665 N-acetylmuramoyl-L-alanine amidase (EC:3.5.1.28) + Term 9812 - 9850 3.7 - Term 9963 - 10011 7.0 11 7 Op 1 . - CDS 10042 - 14682 6057 ## FN0498 hypothetical protein 12 7 Op 2 . - CDS 14707 - 16956 2788 ## COG1629 Outer membrane receptor proteins, mostly Fe transport - Prom 17002 - 17061 7.7 + Prom 16962 - 17021 10.6 13 8 Tu 1 . + CDS 17146 - 18096 1011 ## COG2342 Predicted extracellular endo alpha-1,4 polygalactosaminidase or related polysaccharide hydrolase - Term 18066 - 18111 9.1 14 9 Tu 1 . - CDS 18121 - 20472 3436 ## COG1982 Arginine/lysine/ornithine decarboxylases - Prom 20516 - 20575 15.0 - Term 20554 - 20596 4.8 15 10 Op 1 1/0.800 - CDS 20597 - 21181 837 ## COG0279 Phosphoheptose isomerase 16 10 Op 2 4/0.000 - CDS 21198 - 22106 991 ## COG0583 Transcriptional regulator 17 10 Op 3 4/0.000 - CDS 22182 - 22928 788 ## COG0531 Amino acid transporters 18 10 Op 4 1/0.800 - CDS 22783 - 23562 642 ## COG0531 Amino acid transporters 19 10 Op 5 1/0.800 - CDS 23579 - 24310 1194 ## COG2071 Predicted glutamine amidotransferases 20 10 Op 6 . - CDS 24337 - 26046 2199 ## COG0018 Arginyl-tRNA synthetase - Prom 26266 - 26325 10.9 + Prom 25923 - 25982 5.4 21 11 Tu 1 . + CDS 26015 - 26215 152 ## FN0507 hypothetical protein 22 12 Tu 1 . - CDS 26342 - 27187 1295 ## COG4667 Predicted esterase of the alpha-beta hydrolase superfamily - Prom 27250 - 27309 14.4 + Prom 27214 - 27273 13.6 23 13 Tu 1 . + CDS 27312 - 28319 1662 ## COG1052 Lactate dehydrogenase and related dehydrogenases Predicted protein(s) >gi|296154045|gb|ADVK01000043.1| GENE 1 67 - 1083 1346 338 aa, chain - ## HITS:1 COG:FN0487 KEGG:ns NR:ns ## COG: FN0487 COG1052 # Protein_GI_number: 19703822 # Func_class: C Energy production and conversion; H Coenzyme transport and metabolism; R General function prediction only # Function: Lactate dehydrogenase and related dehydrogenases # Organism: Fusobacterium nucleatum # 1 338 1 338 338 667 99.0 0 MKVLFYGVREVEIPLFHELNKKEGFGYELELIPDYLNSKETAEKAKGFECVVLRGNCFAT KEVLDMYKEYGVKYLLTRTVGTNHIDVKYAKELGFKLAYVPFYSPNAIAELAVSLAMSLL RHLPYTAEKFKNRNFTVDAQMFSKEVRNCTVGVIGLGRIGFTAAKLFKGLGANVIGYDMF PKTGVEDIVTQVPMDELIKKSDIITLHAPFIKENGKIVTKEFLNNMKENSILINTARGEL MDLEAVVEALESGHLAAAGIDTIEGEVNYFFKNFSDKQAEFRADYPLYNRLLDLYPRVLV TPHVGSYTDEAASNMIETSFENLKEYLDTGACKNDIKA >gi|296154045|gb|ADVK01000043.1| GENE 2 1461 - 2738 1981 425 aa, chain + ## HITS:1 COG:FN0488 KEGG:ns NR:ns ## COG: FN0488 COG0334 # Protein_GI_number: 19703823 # Func_class: E Amino acid transport and metabolism # Function: Glutamate dehydrogenase/leucine dehydrogenase # Organism: Fusobacterium nucleatum # 1 425 15 439 439 809 100.0 0 MSKETLNPLESGQKQVKKACDALGLDPAVYELLKEPQRIIEITIPVKMDDGSIKTFKGYR AAHNDAVGPFKGGIRFHQNVNSDEVKALSLWMSIKCQVTGIPYGGGKGGITVDPSELSQR ELEQLSRGWVRGMWKYLGEKVDVPAPDVNTNGQIMAWMQDEYNKLSGEQTIGVFTGKPLS YGGSQGRNEATGFGVAVTMREAFKALGKNLKGATVAVQGFGNVGKFTVKNIMKLGGKVVA VAEFEKGKGAYAIYKDSGFTFEELEAAKAAGSLTKVAGAKELSMDEFWALNVEAIAPCAL ENAITNHEAELIKAGIVCEGANGPITPEADEVLYKKGIVVTPDVLTNAGGVTVSYFEWVQ NIYGYYWTEKEVEEKEERAMVDAFTPIWALKKEFDGKGQPISFRQATYMKSIKRIAEAMK IRGWY >gi|296154045|gb|ADVK01000043.1| GENE 3 2871 - 3737 1053 288 aa, chain - ## HITS:1 COG:FN0489 KEGG:ns NR:ns ## COG: FN0489 COG0682 # Protein_GI_number: 19703824 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Prolipoprotein diacylglyceryltransferase # Organism: Fusobacterium nucleatum # 1 288 1 288 288 508 99.0 1e-144 MNPVFLKIGPIELHYYGLMYAIAFFVGISLGKKIAKERNFDLDLVENYAFVAIISGLIGG RLYYILFNLPYYLQNPFEILAVWHGGMAIHGGILGGIAGTLIFAKIKKINPLILGDFAAG PFILGQAIGRIGNFMNGEVHGVPTFTPFSVIFNVKPKFYEWYTYYQSLSISDKANYPDLV PWGVVFPTSSPAGSEFPNLALHPAMLYELILNLIGFFIIWFILRKKENKASGYMWWWYII IYSINRIIVSFFRVEDLMFFNFRAPHVISIILIAVSIFFLKKDNKKIF >gi|296154045|gb|ADVK01000043.1| GENE 4 3764 - 4546 702 260 aa, chain - ## HITS:1 COG:FN0490 KEGG:ns NR:ns ## COG: FN0490 COG2035 # Protein_GI_number: 19703825 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 1 260 1 260 260 398 100.0 1e-111 MILLFFKSIIIGVANIIPGVSGGTLAVMLNVYDPITEKIGNFFLVDRKTKVSYFFYLLVV FVGAATGIFLFANIIKYSITNYPRITVTVFTLLILPSIPYIVKGLDYKKKKNILAFCYGA IIMIFFILLGLKYGDKTTGAVTIQLAEGICFTKGYLIKLFFCGVVAAGAMIIPGISGSLL LIMLGEYYNVVYLISSLTLSLKERSFTIFIPLLMLALGIGGGLVAISKAINYLLKNHREF TLFFIEGIITFSIIQMWLSI >gi|296154045|gb|ADVK01000043.1| GENE 5 4551 - 5630 1100 359 aa, chain - ## HITS:1 COG:FN0491 KEGG:ns NR:ns ## COG: FN0491 COG0787 # Protein_GI_number: 19703826 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Alanine racemase # Organism: Fusobacterium nucleatum # 1 359 1 359 359 666 99.0 0 MNTSFYVSLDKNALYHNIEYLREYKQKELLPVIKANAYGHNYLLIAKALYDFNIKTWAAA RFSEAITITEYMMNNFSINDFKILVFESLDDYYSFLEKYPQICPTINSIKDLKNALANNI PIDRLSLKIDFGFGRNGIKAEEVDELKNLIKFNSLKFLGIFSHLFSSTYTDGLEVIRKFT EVVNKLGKDNFEMVHLQNAAGIYNYNVEVVTHIRTGMLTYGLQEAGFYDHDLKPVFTGLI GYVDSVRYVNELDYVAYEDLTSISPKTKKIAKIKIGYGDGFPKANNKTTCLIKKKEYVIS QVTMDNTFIEVDDRVNVGDKVHLYHRPNEMKTKTGLSMLEALIALSPLRVKRIFEGEEN >gi|296154045|gb|ADVK01000043.1| GENE 6 5766 - 5915 229 49 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MSISVDNLIKILPAIIVILLGIWLLKKFFNILIAVIFIAVVIFVVSKYI >gi|296154045|gb|ADVK01000043.1| GENE 7 5935 - 6864 979 309 aa, chain - ## HITS:1 COG:no KEGG:FN0493 NR:ns ## KEGG: FN0493 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 309 1 309 309 518 99.0 1e-145 MKDIRLLELYDRLLKNEDIDIKKYAEENKVNIRTAERDIKTIRNFLAKKNKTELIHNSKK KKYQLTYAEDSINLTKSEILAVSKILLASRAFLKDEISLIVDKIVKQCSSEDDLKSIQNL LKNEKFHYVELQHNKSFIKHIWDFGDAIKNKKKLEISYKKMDGKIVKRVVNPVGLMFSEF YFYLLAHIENIDKEKHFNNKDDEYPTIYRIDRIEDFKILNEKFTPTLYTNRFQEGKFRKQ VQFMTGGKLRKIKFIYKTNSIEALLDKIPTAKILEKNKDTYLISAQVFGNGIDRWILSQG NAIEVIEDN >gi|296154045|gb|ADVK01000043.1| GENE 8 7151 - 7870 280 239 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163739489|ref|ZP_02146899.1| 50S ribosomal protein L17 [Phaeobacter gallaeciensis BS107] # 7 235 4 238 242 112 33 3e-24 MNRLEGKVAVVTGSARGIGRAIVEKLAAHGAKMVISCDMGETSYEQGNVVHKILNVTDRE AIKAFVDEVEKEYGKIDILVNNAGITKDGLLMRMTEDQWDAVINVNLKGVFNMTQAVSRS MLKARKGSIITLSSVVGLHGNAGQTNYAATKGGVVAMSKTWAKEFGGRNVRANCVAPGFI QTPMTDVLSEDTIKGMLDATPLGRLGQVEDIANTVLFLASDESSFITGEVISVSGGLML >gi|296154045|gb|ADVK01000043.1| GENE 9 7929 - 9137 1993 402 aa, chain + ## HITS:1 COG:FN0495 KEGG:ns NR:ns ## COG: FN0495 COG0183 # Protein_GI_number: 19703830 # Func_class: I Lipid transport and metabolism # Function: Acetyl-CoA acetyltransferase # Organism: Fusobacterium nucleatum # 1 402 1 402 402 754 100.0 0 MSKVYVVAAKRSAIGSFLGTLSPLKPGDLGAQIVKNILEETKVDPANIDEVIVGNVLSAG QAQGVGRQVAIRAGIPYEVPAYSVNIICGSGMKSVITAFSNIKAGEADLVIAGGTESMSG AGFILPGAIRGGHKMADLTMKDHMILDALTDAYHNIHMGITAENIAERYGITREEQDAFA LDSQLKAIAAVDSGRFKDEIAPVVIPNKKGDIIFDTDEYPNRKTDAEKLAKLKPAFKKDG SVTAGNASGLNDGASFLMLASEEAVKKYNLKPLVEIVATGTGGVDPLVMGMGPVPAIRKA FNKTDLKLKDMELIELNEAFAAQSLGVIKELCKEHGVTPEWIKERTNVNGGAIALGHPVG ASGNRITVTLIYEMKKRGVEYGLASLCIGGGMGTALILKNVK >gi|296154045|gb|ADVK01000043.1| GENE 10 9311 - 9784 379 157 aa, chain + ## HITS:1 COG:no KEGG:FN0665 NR:ns ## KEGG: FN0665 # Name: not_defined # Def: N-acetylmuramoyl-L-alanine amidase (EC:3.5.1.28) # Organism: F.nucleatum # Pathway: not_defined # 23 155 12 148 153 80 39.0 3e-14 MKKLILVVMFLILCNLGFSKTSFEVSFNDGTSVNLREKASSNSKILAKLEIFDGGEVIKK EGDWYYIKYRTESEKILYGYIHESQGFLVETYVVSSKDGYANIRWEPSSNGKIAGTEKNG TILEVYDEKGEWLHITYGDSPHFPVAYVHKSQVKKEE >gi|296154045|gb|ADVK01000043.1| GENE 11 10042 - 14682 6057 1546 aa, chain - ## HITS:1 COG:no KEGG:FN0498 NR:ns ## KEGG: FN0498 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 964 1546 1 583 583 959 93.0 0 MKIKRLIMILLIVAATVSYADERTDTIKQIDKIIDNAITKAIESDKTNPYLKQSRIKSIK EQILSAIKKDLDEPGLTRLDIPEILPKIDKEIKKIGDGIGPKNNYRIIQESKEVIQPVEQ TAYEGFLRVVDTQNRMNGVKSNYWEKGNVKAQRTHNGVDLVPIAHVVYENAASDLREEPN SNSKIRRSSHTIADLAEKGIGAFTKEDYSELAYKNSNQFYEKRFYFGAGNTVKDIIFLNK EKFSSEVENTKKNKNERYYIEGEYKLVTTNATEDERSKSFGAEKDVNPLDITMKEYRTRI EGKSKAEISAFLKEKMAQKNIKNVIQEGEDLYTVDSKGRKWKVDWKLEPVSVESGSKTEY KDTVFTTINYYSPFDDKSTTDNRGKLLYTKDGSIYAQDKNKYTNDVNLKLTETETKVETK IKKMLKLTKNKYGPFEGISKADFDAKRSEYTDPTKYDYYYDDDDERYYVSKITRISKEFE TTDTKSIEEFKKYAQNLVEENINKEIKENVKVKKDVTKSLNDFIATAKERVEKGEAPRNQ FDQYFYDKKHLSKEDFENKWVKPFKNPEYIKAKENYERELAETTAKRDEAEKAKDKYTKI TDTLFDKITQNVGRGVISYTDFYEDTITKTLISEAKLEQLINSKPELNTEEKKNLVREYY KEWKNFENSSKIFYTLNSQVESGIAKKYGFWDNPAATPEQRKYVGLNGLIKDLDLTRSIA GKNIEFRGIGRIEGTVDLGEGKNTLKIAEQMTGQYGTNITLGAYAKLKNIDTVEVGGSLT LDNAQASISGRTSLRMDIDATKKNSEGHYYQHALKDSDPNIRFIKYGTTNMDSRNDFMIE LLTSKITENEAIIDMGRKIDYTWHDIKTGKDYDMTIPFVSDSIAHQLINNKKLSKNGTSL LELKTREELRRLNSDENAVYRSIRNANKLGILSPTLTTTNKKTTFNTVDQEKEAKKKKDL LTYLKTKTDEELVNDLSQFNLSETEKKEALELVKKLKNSDDFKSIMNKEKELKTKLDEIN KLEKDSNYQRLHFQEIIGKIESDFNGLSNKVYSLYPNAKDLEKQFEIDLKREDPYTLVRK AVLLADKIPELKTKLDSIKNELKTKLNDTRELLKKDLATIKELKAKYPNSKFGDIEKTIE TILSGDNLERMVLSSRDYNKNSDLADLVSDFKKLASDISLQLKEKEDIEKALKENDASIE QPRYADYHRLKSKIFYTMREEEVLSELKNMLNQLSDRNIYSKLNKISKNEISTYTTIPFE VTHALTDKKNIARGGFISNRTVQDNFKGNIYTAYGLYETTANSNTKYGILFGGANTKHNE VYQRSLTTVATESEIKGVSAYVGGYFNKPVINNLNWITGIGTQYGRYKVKREMRNNYQDL HSEGKVNTASLNTYSGFIINYPIQEDVFVQLKGLLAYTMIKQSKINESGDLPLDINGKTY HYVDGEAGISFNKIFYGEDLKSSISAGAYGILGLSGYKNANMEAKINGSSSSFGIKGDRV KRDAIKIHLDYNVQTDVGYTYGLEGTYITNSKENNVKIGIKGGYVF >gi|296154045|gb|ADVK01000043.1| GENE 12 14707 - 16956 2788 749 aa, chain - ## HITS:1 COG:FN0499 KEGG:ns NR:ns ## COG: FN0499 COG1629 # Protein_GI_number: 19703834 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor proteins, mostly Fe transport # Organism: Fusobacterium nucleatum # 7 749 1 743 743 1291 93.0 0 MKKLLVLLTILSSIIAYAEDTIELNQTTVKGSKTSDYTAPPKEQKNTFVITQERIREKNY KNVEDILRDAPGVVVQNTAFGPRIDMRGSGEKSLSRVKVLVDGVSINPTEETMASLPINA IPVESIKKIEIIPGGGATLYGSGSVGGVVSISTNSNVTKDNFFMDLNYGSYDNRNFGFAG GYNFNKNLYVNYGFSYLNSEDYREHEEKENKIYLLGFDYKINAKNRFRFQTRFSDIKQDS SNQIPVEELKNNRRKAGLNMDIDTKDRSYTFDYEYRPTQNITLSTSLYKQEQDRDINTES IDDIKIIASNRRFSHITQEKIFYDVKSEMQAKFEEDKKGLKVKAKFDYNLVGDNVSETIL GFDYQTSTNKRNSLVQSETLKNYYDSSIGGFRNLDSADRSPIINKVDMKMTKKSEGIYIF NKWGLSNWLDVTLGGRMERTKYNGYRENGPNVMPYVTPEKKRIDTDEKLTNYAGELGLLF KYNDTGRIYTRYERGFVTPFGNQLTDKIHDTELKNKQAGIIVPPSVNVASKYVANNLKSE KTDTFEIGFRDYIWGSSISTSFFLTDTTDEITLISSGVTNPAVNRWKYRNIGKTRRLGIE FEAEQNIGKFRFNQSLTLVRTKVLIANDEARIAKGDQVPMVPRLKATLGVKYNFTDRLSG YLNYVYLAKQESRELRENPDISKDDVVVKHTIGGHGTLEAGLSYKPDNYSDIKIGAKNIL SNKYNLRETSLEALPAPKRNYYLQLNVRF >gi|296154045|gb|ADVK01000043.1| GENE 13 17146 - 18096 1011 316 aa, chain + ## HITS:1 COG:FN0500 KEGG:ns NR:ns ## COG: FN0500 COG2342 # Protein_GI_number: 19703835 # Func_class: G Carbohydrate transport and metabolism # Function: Predicted extracellular endo alpha-1,4 polygalactosaminidase or related polysaccharide hydrolase # Organism: Fusobacterium nucleatum # 128 316 1 189 189 314 93.0 1e-85 MINIKKYILLLLFSISCFSFSAVNNIYKERMRDFIKELRNNTNKEKIIITQNGSELYFKK GKIDNKFFAITNGTTQESLYYGDELRFNVPTPKGLKNELLELTVPIRKKGKPVFVINYGK GKKKREFLKKEDLKTKFVSELLPSFNADKLYETIKDYNDEDINSLNEVKNFLCFLNSENF SDIDEYYQALKNTNYDLLLIEVSYKNTFFTKEQIEDLKIKNNGGKRIVIAYLSIGEAEDY RFYWNKKWNKKKPNWIIKESENWEGNYIVKYWSPEWKDIIKEYQKKLDEIGVDGYLLDTI DTYRYFEENYKGTVIN >gi|296154045|gb|ADVK01000043.1| GENE 14 18121 - 20472 3436 783 aa, chain - ## HITS:1 COG:FN0501_1 KEGG:ns NR:ns ## COG: FN0501_1 COG1982 # Protein_GI_number: 19703836 # Func_class: E Amino acid transport and metabolism # Function: Arginine/lysine/ornithine decarboxylases # Organism: Fusobacterium nucleatum # 1 503 1 503 503 1002 100.0 0 MSKLDQNKTPLFTVLKDEYVRRNILPFHVPGHKRGKGVDEEFYNFMGEAPFSIDVTIFKM VDGLHHPKSCIKESQELVADAYGVKHSFFAVNGTSGAIQAMIMSVVKAGEKILVPRNVHK SVSAGIILSGSEPVYMNPEIDENLGIALGVKPQTVENMLKQDPDIAAVLIINPTYYGVAT DIKKIADIVHSYDIPLIVDEAHGPHLHFHDELPVSAVDAGADICTQSTHKILGAMTQMSL IHVNSDRVNVEKVKQILSLLHTTSPSYPLMASLDCARRQIATQGQELLTRTIELAKYFRR EANRIPGIYCFGEELVGKDGFFAFDPTKITISAKELGLKGGELESLLVDDYNIQMELSDY YNTLGLITIGDTEESVNKLLDALRDISRRFFGKGKTLEKNIIKLPETPELVLMPREAFYS EKNKVPFKESVGKISGEMIMAYPPGIPIIIAGERISQDIIDHIEELKEADLHIQGMEDPE LETINVIEEEDAIYLYTEKMKNVLIGVQTNLGVNKTGTEFGPDDLIQAYPDTFDEMELIS VERQKEDFNDKKLKFKNTVLDTCEKIAKRVNEAVIDGYRPILVGGDHSISLGSVSGVSLE KEIGVLWISAHGDMNTPESTLTGNIHGMPLALLQGLGDRELVNCFYEGAKLDSRNIVIFG AREIEVEERKIIEKTGVKIVYYDDILRKGIDNVLDEVKDYLKIDNLHISIDMNVFDPEIA PGVSVPVRRGMSYDEMFKSLKFAFKNYSVTSADITEFNPLNDINGKTAELVNGIVQYMMN PDY >gi|296154045|gb|ADVK01000043.1| GENE 15 20597 - 21181 837 194 aa, chain - ## HITS:1 COG:FN0502 KEGG:ns NR:ns ## COG: FN0502 COG0279 # Protein_GI_number: 19703837 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphoheptose isomerase # Organism: Fusobacterium nucleatum # 1 194 1 194 194 336 100.0 1e-92 MNLISSYKTEFELLRKFIEEEEERKETEKVAQKLANIFTKGKKVLICGNGGSNCDAMHFI EEFTGRFRKERRALPAISISDPSHITCVANDYGFEYIFSKGVEAYGQEGDMFIGISTSGN SPNVIKAVEQAKAQGLVTVGLLGKDGGKLKGMCDYEFIIPGKTSDRVQEIHMMILHIIIE GVERIMFPENYVEE >gi|296154045|gb|ADVK01000043.1| GENE 16 21198 - 22106 991 302 aa, chain - ## HITS:1 COG:FN0503 KEGG:ns NR:ns ## COG: FN0503 COG0583 # Protein_GI_number: 19703838 # Func_class: K Transcription # Function: Transcriptional regulator # Organism: Fusobacterium nucleatum # 1 302 1 302 302 485 100.0 1e-137 MSLGVIILDLHYLEIFYEVAKAKSFTKAAEKLFINQSAVSIQVKKFEDILKVKLFDRSSK KIKLTYIGETLYKMAEDIFEKVKRAEKEISRVIEVDRARIAIGASSIIAEPLLPSLMKDF SSAHEEIEYNVTISNKEHLLKLLKEGELDVIIIDSQHITDSNLEIVSIEKGPYVLISSQA YLNVEDIEKDPIITRNTIPNNNKAIEVIEDRYGINFSTKINVVGNLEVIKGMVREGVGNV ILPYYAVYKDIKKGDFKVISKVDEVKDGYELIITKDKKDLSQITKFINIVKNHKIVMEST RH >gi|296154045|gb|ADVK01000043.1| GENE 17 22182 - 22928 788 248 aa, chain - ## HITS:1 COG:FN0504 KEGG:ns NR:ns ## COG: FN0504 COG0531 # Protein_GI_number: 19703839 # Func_class: E Amino acid transport and metabolism # Function: Amino acid transporters # Organism: Fusobacterium nucleatum # 20 248 225 453 453 313 97.0 2e-85 MLLLVLKVWQVVLLTWKNLKKNLPRAIPLAIAIIAAIYFGIVFVSMYIDPVAMVTSKDPV VLASVFKNQLLQKIIVIGALMSMFGINVAASFHTPRVFEAMAKEKQVPDFFAKRTKSGLP LTSFILTAVIAVVIPLAFNYNMAGIIIISSISRFIQFIVVPVAVIVFFYGKSKEEILNAN KNFIIDVIIPIIALLLTILLLVKFNWTQQFSTKLDDGTVTVNLKAVVSMVIGYLILPIVL RIYMRTKK >gi|296154045|gb|ADVK01000043.1| GENE 18 22783 - 23562 642 259 aa, chain - ## HITS:1 COG:FN0504 KEGG:ns NR:ns ## COG: FN0504 COG0531 # Protein_GI_number: 19703839 # Func_class: E Amino acid transport and metabolism # Function: Amino acid transporters # Organism: Fusobacterium nucleatum # 7 232 1 226 453 358 100.0 5e-99 MENNQKMKFWSIVLLTINSIIGTGIFLSPGAVAKLVGDKAATIYLAAAVFAAVLAVTFAA ASKYVVKSGAAYAYSKAAFGDEVGSYVGITRVVSASIAWGVMATGVVKTTLSIFGQDSSN VKNVTIGFITLMFVLLIINLIGTKLLTLISNISTIGKIGALGITIIAGIFILVFSDGANL QGLTLLKDAEGNNLIPEFTTSVFVTALVGAFYAFTGFESVASGSADMEKPEKKSSKSNST SYCNNCCYIFWNSICFNVY >gi|296154045|gb|ADVK01000043.1| GENE 19 23579 - 24310 1194 243 aa, chain - ## HITS:1 COG:FN0505 KEGG:ns NR:ns ## COG: FN0505 COG2071 # Protein_GI_number: 19703840 # Func_class: R General function prediction only # Function: Predicted glutamine amidotransferases # Organism: Fusobacterium nucleatum # 1 243 1 243 243 476 99.0 1e-134 MERKPIIGISSSVIVDESGSFAGYKRAYVNKDYVDAVVRAGGVPLIIPFTTNKEVIISQA QLIDGLILSGGHDVSPYNYGQEPSQKLGETFPERDTYEMILLEESKKRNIPILGICRGSQ LINVAAGGTLYQDLSLIPGNILKHNQVNKPTLKTHIIKIGENSIISSVFGKETMVNSFHH QAIDKVADDFKVVARANDGVVEAIEHKTYKFLVAVQWHPEMLAVECEKARELFAKFVEEA KNK >gi|296154045|gb|ADVK01000043.1| GENE 20 24337 - 26046 2199 569 aa, chain - ## HITS:1 COG:FN0506 KEGG:ns NR:ns ## COG: FN0506 COG0018 # Protein_GI_number: 19703841 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Arginyl-tRNA synthetase # Organism: Fusobacterium nucleatum # 1 569 1 569 569 1087 99.0 0 MKITSRELTDIFQKHVENLFPNKELKPVEITVATNENFGDYQCNFAMINSKIIGDNPRKI AEEVKNNFPYGDVIEKLEVAGPGFINIFLSDKYISNSIKKIGEDYDFSFLNRKGKVIIDF SSPNIAKRMHIGHLRSTIIGESVSRIYRFLGYDVVADNHIGDWGTQFGKLIVGYRNWLDK KAYKKNAIEELERVYVKFSDEAEKDPSLEDLARAELKKVQDGEEENTKLWKEFITESLKE YNKLYKRLDVHFDTYYGESFYNDMMADVVKELVDKKIAVDDDGAKVVFFDEKDNLFPCIV QKKDGAYLYSTSDIATVKFRKNTYDVNRMIYLTDARQQDHFKQFFKITDMLGWNIEKYHI WFGIIRFADGILSTRKGNVIKLEELLDEAHSRAYDVVNEKNPNLSEGEKQNIAEVVGVSS VKYADLSQNKQSDIIFEWDKMLSFEGNTAPYLLYTYARIQSILRKVAELNIDLNENIEIK TENKIEKSLATYLLAFPISVLKAGETFKPNLIADYLYELSKKLNSFYNNCPILNQDIETL KSRALLIKKTGEVLKEGLELLGIPILNKM >gi|296154045|gb|ADVK01000043.1| GENE 21 26015 - 26215 152 66 aa, chain + ## HITS:1 COG:no KEGG:FN0507 NR:ns ## KEGG: FN0507 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 66 1 66 66 108 100.0 8e-23 MSVNSLLVIFINILPLFLNKKTDLSPHYGFSRQTQQVVVSYLKYTVSVITIDIRDFSDLL KTAPTP >gi|296154045|gb|ADVK01000043.1| GENE 22 26342 - 27187 1295 281 aa, chain - ## HITS:1 COG:FN0508 KEGG:ns NR:ns ## COG: FN0508 COG4667 # Protein_GI_number: 19703843 # Func_class: R General function prediction only # Function: Predicted esterase of the alpha-beta hydrolase superfamily # Organism: Fusobacterium nucleatum # 1 281 1 281 281 514 99.0 1e-146 MKVGLVLEGGGMRALFTAGVLDALLDVKELNIDGIVGVSAGALFGVNYVSEQKERAIRYN KKYARDKRYMGFYSWITTGNAVNEDFAFYEIPFKLDVFDQEKFKESKIDFYVVMTNIENG QAEYVLIKDVFEQMEYLRATSALPFASKIIEINGKKYLDGGISDSIPIDYCESLGYDKII LILTRPENNYKDDKLNFLYKLVYRKYPNLVERLINMGKDYEIVLKKIKDLENKNKIFVIR PPKVLKIGRLEKNEDKIQNVYDIGLSAGKKEINNLFEYLNK >gi|296154045|gb|ADVK01000043.1| GENE 23 27312 - 28319 1662 335 aa, chain + ## HITS:1 COG:FN0511 KEGG:ns NR:ns ## COG: FN0511 COG1052 # Protein_GI_number: 19703846 # Func_class: C Energy production and conversion; H Coenzyme transport and metabolism; R General function prediction only # Function: Lactate dehydrogenase and related dehydrogenases # Organism: Fusobacterium nucleatum # 1 335 1 335 335 613 99.0 1e-175 MQKTKIIFFDIKDYDKEFFKKYGADYNFEMTFLKVRLTEETANLTKGYDVVCGFANDNIN KETIDIMAENGIKLLAMRCAGFNNVSLKDVNEKFKVVRVPAYSPHAIAEYTVGLILAVNR KINKAYVRTREGNFSINGLMGIDLYEKTAGIIGTGKIGQILIKILRGFDMKVIAYDLFPN QKVADELGFEYVSLDELYANSDIISLNCPLTKDTKYMINRRSMLKMKDGVILVNTGRGML IDSADLVEALKDKKIGAVALDVYEEEENYFFEDKSTQVIEDDILGRLLSFYNVLITSHQA YFTKEAVGAITVTTLNNIKDFVEGRPLVNEVPQNQ Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:52:50 2011 Seq name: gi|296154041|gb|ADVK01000044.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00054, whole genome shotgun sequence Length of sequence - 1937 bp Number of predicted genes - 3, with homology - 3 Number of transcription units - 1, operones - 1 average op.length - 3.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) - Term 112 - 144 -0.9 1 1 Op 1 1/0.000 - CDS 166 - 480 555 ## COG0526 Thiol-disulfide isomerase and thioredoxins - Prom 524 - 583 14.8 - Term 566 - 613 5.9 2 1 Op 2 . - CDS 623 - 1741 1236 ## COG1454 Alcohol dehydrogenase, class IV 3 1 Op 3 . - CDS 1761 - 1937 221 ## FN0091 phosphoserine phosphatase (EC:3.1.3.3) Predicted protein(s) >gi|296154041|gb|ADVK01000044.1| GENE 1 166 - 480 555 104 aa, chain - ## HITS:1 COG:FN0093 KEGG:ns NR:ns ## COG: FN0093 COG0526 # Protein_GI_number: 19703445 # Func_class: O Posttranslational modification, protein turnover, chaperones; C Energy production and conversion # Function: Thiol-disulfide isomerase and thioredoxins # Organism: Fusobacterium nucleatum # 1 104 1 103 103 181 99.0 3e-46 MAIVKGTKENFDAEVLKASGVVVVVDFGANWCGPCKSLVPILDEIVEEDPSKKIVKVDID EQEELAAQYKIMSVPTLLVFRNGEIIDKSIGLIQKHEVKALFAK >gi|296154041|gb|ADVK01000044.1| GENE 2 623 - 1741 1236 372 aa, chain - ## HITS:1 COG:FN0092 KEGG:ns NR:ns ## COG: FN0092 COG1454 # Protein_GI_number: 19703444 # Func_class: C Energy production and conversion # Function: Alcohol dehydrogenase, class IV # Organism: Fusobacterium nucleatum # 1 372 1 372 372 706 99.0 0 MKEFRLQPKILFGEDSLDYLKTLKCKKVMIVTDEVMTQLRLTDFVTNSLSSTTEINIFDK VEPNPSVATIENGLKDFFSFEPECVIALGGGSSIDACKGILYFAYKLYKKLNINKKKIFF IAIPTTSGTGSEVTSYSVVTNGEHKIALANDLMLPDVALLSTKFLGALPAKVVADTGMDV LTHALEAYVSTNANPFSSSLAIKSIKMIFENLVTHYNDRKIEGPKRNVQFASCMAGIAFD NSSLGINHSIAHTVGAKFHIAHGRANAIIMPYVIEVNTEANKKYYEVSRELGLPANTIEE GKYSLLSFVRILKEKLGIEKCLKDYGVDFETFKREIPSILSDIKKDICTIYNPNKLSDEE YIRLLLKIYFGE >gi|296154041|gb|ADVK01000044.1| GENE 3 1761 - 1937 221 58 aa, chain - ## HITS:1 COG:no KEGG:FN0091 NR:ns ## KEGG: FN0091 # Name: not_defined # Def: phosphoserine phosphatase (EC:3.1.3.3) # Organism: F.nucleatum # Pathway: Glycine, serine and threonine metabolism [PATH:fnu00260]; Methane metabolism [PATH:fnu00680]; Metabolic pathways [PATH:fnu01100]; Microbial metabolism in diverse environments [PATH:fnu01120] # 2 58 310 366 366 108 100.0 8e-23 AILVAGDAVGDESMLTKFEDTEVLLIMKREGKLDDVAKDSRALIQKRNAQTGLLDPKN Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:53:14 2011 Seq name: gi|296153962|gb|ADVK01000045.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00055, whole genome shotgun sequence Length of sequence - 72884 bp Number of predicted genes - 79, with homology - 77 Number of transcription units - 27, operones - 23 average op.length - 3.3 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 1/1.000 - CDS 2 - 863 1019 ## COG0607 Rhodanese-related sulfurtransferase 2 1 Op 2 . - CDS 860 - 2854 2414 ## COG0337 3-dehydroquinate synthetase - Prom 2891 - 2950 5.5 - Term 2916 - 2951 3.1 3 2 Op 1 . - CDS 2961 - 3782 969 ## FN0872 hypothetical protein 4 2 Op 2 1/1.000 - CDS 3802 - 5457 2074 ## COG0616 Periplasmic serine proteases (ClpP class) 5 2 Op 3 1/1.000 - CDS 5471 - 5986 611 ## COG0494 NTP pyrophosphohydrolases including oxidative damage repair enzymes 6 2 Op 4 1/1.000 - CDS 5988 - 6758 918 ## COG0566 rRNA methylases - Prom 6784 - 6843 10.1 7 3 Op 1 1/1.000 - CDS 6893 - 8605 1816 ## COG0405 Gamma-glutamyltransferase 8 3 Op 2 1/1.000 - CDS 8602 - 9981 1804 ## COG0591 Na+/proline symporter - Prom 10089 - 10148 12.2 - Term 10078 - 10146 9.5 9 4 Tu 1 . - CDS 10186 - 10893 874 ## COG1802 Transcriptional regulators - Prom 10932 - 10991 10.1 10 5 Op 1 5/0.000 - CDS 11261 - 12046 512 ## COG4587 ABC-type uncharacterized transport system, permease component 11 5 Op 2 2/0.400 - CDS 12062 - 13078 269 ## PROTEIN SUPPORTED gi|169795303|ref|YP_001713096.1| ABC transporter ATP-binding protein 12 5 Op 3 1/1.000 - CDS 13059 - 13847 625 ## COG3694 ABC-type uncharacterized transport system, permease component - Prom 13935 - 13994 12.1 13 6 Op 1 35/0.000 - CDS 14047 - 14802 243 ## PROTEIN SUPPORTED gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 14 6 Op 2 33/0.000 - CDS 14804 - 15841 1219 ## COG0609 ABC-type Fe3+-siderophore transport system, permease component 15 6 Op 3 . - CDS 15855 - 16715 1095 ## COG0614 ABC-type Fe3+-hydroxamate transport system, periplasmic component - Prom 16891 - 16950 12.3 + Prom 16687 - 16746 20.8 16 7 Tu 1 . + CDS 16918 - 18900 2589 ## COG1629 Outer membrane receptor proteins, mostly Fe transport 17 8 Op 1 1/1.000 - CDS 19009 - 20811 2219 ## COG1164 Oligoendopeptidase F 18 8 Op 2 1/1.000 - CDS 20826 - 22049 1908 ## COG2233 Xanthine/uracil permeases 19 8 Op 3 1/1.000 - CDS 22110 - 22493 748 ## COG5496 Predicted thioesterase 20 8 Op 4 . - CDS 22558 - 23187 614 ## COG1564 Thiamine pyrophosphokinase 21 8 Op 5 . - CDS 23187 - 24026 1053 ## FN0891 DNAse I homologous protein DHP2 precursor (EC:3.1.21.-) - Prom 24058 - 24117 12.1 + Prom 24033 - 24092 12.4 22 9 Op 1 1/1.000 + CDS 24208 - 24939 1034 ## COG0560 Phosphoserine phosphatase 23 9 Op 2 . + CDS 24974 - 25396 500 ## COG1959 Predicted transcriptional regulator 24 9 Op 3 . + CDS 25427 - 25963 453 ## FN0894 hypothetical protein + Term 25975 - 26019 -0.6 - Term 25954 - 25983 -0.2 25 10 Op 1 . - CDS 26009 - 26281 334 ## FN0895 hypothetical protein 26 10 Op 2 . - CDS 26278 - 26472 223 ## FN0895 hypothetical protein 27 10 Op 3 . - CDS 26496 - 26879 514 ## FN0896 hypothetical protein - Prom 27016 - 27075 12.9 + Prom 26904 - 26963 8.6 28 11 Op 1 1/1.000 + CDS 26983 - 27651 270 ## PROTEIN SUPPORTED gi|241889384|ref|ZP_04776685.1| 30S ribosomal protein S8 29 11 Op 2 . + CDS 27726 - 29597 1763 ## COG1533 DNA repair photolyase + Term 29684 - 29735 2.3 + Prom 29668 - 29727 3.0 30 12 Op 1 . + CDS 29781 - 30497 825 ## COG1028 Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 31 12 Op 2 1/1.000 + CDS 30504 - 31286 1030 ## COG1235 Metal-dependent hydrolases of the beta-lactamase superfamily I 32 12 Op 3 1/1.000 + CDS 31296 - 31883 752 ## COG1573 Uracil-DNA glycosylase 33 12 Op 4 1/1.000 + CDS 31873 - 32418 889 ## COG0212 5-formyltetrahydrofolate cyclo-ligase 34 12 Op 5 1/1.000 + CDS 32440 - 33411 1375 ## COG0794 Predicted sugar phosphate isomerase involved in capsule formation 35 12 Op 6 . + CDS 33421 - 35004 1701 ## COG2509 Uncharacterized FAD-dependent dehydrogenases + Term 35015 - 35062 1.1 - Term 35003 - 35048 8.3 36 13 Op 1 . - CDS 35054 - 35377 379 ## FN0905 hypothetical protein 37 13 Op 2 1/1.000 - CDS 35396 - 36403 1442 ## COG0240 Glycerol-3-phosphate dehydrogenase - Prom 36433 - 36492 7.4 38 14 Op 1 1/1.000 - CDS 36597 - 37268 785 ## COG4123 Predicted O-methyltransferase 39 14 Op 2 1/1.000 - CDS 37270 - 38208 1082 ## COG1774 Uncharacterized homolog of PSP1 40 14 Op 3 1/1.000 - CDS 38229 - 38927 912 ## COG2003 DNA repair proteins 41 14 Op 4 2/0.400 - CDS 38937 - 40001 1363 ## COG2038 NaMN:DMB phosphoribosyltransferase 42 14 Op 5 6/0.000 - CDS 40016 - 40591 595 ## COG0406 Fructose-2,6-bisphosphatase 43 14 Op 6 8/0.000 - CDS 40596 - 41432 867 ## COG0368 Cobalamin-5-phosphate synthase 44 14 Op 7 . - CDS 41445 - 42008 715 ## COG2087 Adenosyl cobinamide kinase/adenosyl cobinamide phosphate guanylyltransferase 45 14 Op 8 . - CDS 42070 - 42801 756 ## FN0914 hypothetical protein 46 14 Op 9 1/1.000 - CDS 42816 - 43310 651 ## COG2190 Phosphotransferase system IIA components 47 14 Op 10 . - CDS 43338 - 43787 669 ## COG3187 Heat shock protein - Prom 43818 - 43877 16.1 + Prom 43799 - 43858 15.0 48 15 Op 1 . + CDS 43888 - 44904 941 ## FN0917 hypothetical protein 49 15 Op 2 . + CDS 44930 - 45712 967 ## FN1144 hypothetical protein 50 16 Tu 1 . - CDS 45811 - 46737 1375 ## COG0501 Zn-dependent protease with chaperone function - Prom 46781 - 46840 6.2 51 17 Op 1 . - CDS 46909 - 47679 786 ## FN0921 hypothetical protein - Prom 47701 - 47760 2.9 52 17 Op 2 1/1.000 - CDS 47767 - 48705 1038 ## COG2334 Putative homoserine kinase type II (protein kinase fold) 53 17 Op 3 . - CDS 48717 - 50156 1290 ## COG1502 Phosphatidylserine/phosphatidylglycerophosphate/cardioli pin synthases and related enzymes - Prom 50227 - 50286 13.5 + Prom 49912 - 49971 15.8 54 18 Op 1 . + CDS 50074 - 50268 106 ## 55 18 Op 2 . + CDS 50314 - 51111 893 ## FN0924 hypothetical protein 56 18 Op 3 . + CDS 51150 - 52028 755 ## FN0925 hypothetical protein 57 18 Op 4 . + CDS 52041 - 52820 1045 ## COG2357 Uncharacterized protein conserved in bacteria + Term 52831 - 52870 3.2 - Term 52817 - 52858 6.1 58 19 Tu 1 . - CDS 52872 - 53600 556 ## FN0927 hypothetical protein - Prom 53642 - 53701 4.2 59 20 Op 1 12/0.000 - CDS 53703 - 54347 175 ## PROTEIN SUPPORTED gi|238855674|ref|ZP_04645973.1| ribosomal protein ala-acetyltransferase 60 20 Op 2 1/1.000 - CDS 54328 - 54789 671 ## COG0802 Predicted ATPase or kinase 61 20 Op 3 1/1.000 - CDS 54803 - 55267 767 ## COG2870 ADP-heptose synthase, bifunctional sugar kinase/adenylyltransferase 62 20 Op 4 . - CDS 55254 - 55871 732 ## COG0494 NTP pyrophosphohydrolases including oxidative damage repair enzymes - Prom 55898 - 55957 14.3 + Prom 55986 - 56045 10.3 63 21 Op 1 . + CDS 56067 - 56552 582 ## FN0932 hypothetical protein 64 21 Op 2 5/0.000 + CDS 56562 - 57824 1314 ## COG0128 5-enolpyruvylshikimate-3-phosphate synthase 65 21 Op 3 . + CDS 57808 - 58881 1492 ## COG0082 Chorismate synthase - Term 58813 - 58864 3.5 66 22 Op 1 . - CDS 58876 - 59970 956 ## FN0935 hypothetical protein 67 22 Op 2 . - CDS 59967 - 60425 447 ## FN0936 CapC protein 68 22 Op 3 . - CDS 60415 - 61563 1161 ## FN0937 gamma-polyglutamic acid synthetase (EC:6.3.2.-) - Prom 61593 - 61652 9.4 69 23 Op 1 . - CDS 61655 - 62791 1647 ## FN0938 hypothetical protein 70 23 Op 2 . - CDS 62803 - 62949 206 ## 71 23 Op 3 . - CDS 62966 - 64237 1488 ## COG1593 TRAP-type C4-dicarboxylate transport system, large permease component - Term 64250 - 64289 7.0 72 24 Op 1 . - CDS 64293 - 64826 715 ## FN0940 hypothetical protein 73 24 Op 2 . - CDS 64845 - 66584 2450 ## COG0405 Gamma-glutamyltransferase - Prom 66623 - 66682 11.8 + Prom 66560 - 66619 10.0 74 25 Op 1 40/0.000 + CDS 66790 - 68178 1249 ## COG0642 Signal transduction histidine kinase 75 25 Op 2 . + CDS 68168 - 68782 663 ## COG0745 Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 76 26 Op 1 . - CDS 68806 - 70173 1543 ## COG0534 Na+-driven multidrug efflux pump - Prom 70205 - 70264 10.1 77 26 Op 2 . - CDS 70343 - 70987 231 ## COG0671 Membrane-associated phospholipid phosphatase - Prom 71038 - 71097 11.8 + Prom 70960 - 71019 14.8 78 27 Op 1 . + CDS 71041 - 71691 651 ## COG1451 Predicted metal-dependent hydrolase + Term 71696 - 71737 4.3 + Prom 71724 - 71783 14.6 79 27 Op 2 . + CDS 71825 - 72637 1116 ## COG5266 ABC-type Co2+ transport system, periplasmic component + Term 72749 - 72802 1.8 Predicted protein(s) >gi|296153962|gb|ADVK01000045.1| GENE 1 2 - 863 1019 287 aa, chain - ## HITS:1 COG:FN0870 KEGG:ns NR:ns ## COG: FN0870 COG0607 # Protein_GI_number: 19704205 # Func_class: P Inorganic ion transport and metabolism # Function: Rhodanese-related sulfurtransferase # Organism: Fusobacterium nucleatum # 48 287 1 240 240 425 100.0 1e-119 MIDVIDNISAYFDNDLINIIYKDLKMNGFSDEEIEKILKDKNRDLPMMEVNIFQLNRYKL GSIGFTSRELENLKIDFVEEKLLSNDYNGDKPTNKIVYLKVLFDKGSKKILGCQIANEKN IEARLNAIKSIIEKGGDLKDLVKYKVNPTDNEWNPDILNILALNVMAKSEKISTDIEAKD IENLLKNKEFLLDVREDYEYQNGHIKGAVNLPLREILSQKDTLPKDRDIYVYCRSAHRSA DAVNFLKSLGFDKVHNIEGGFIDISFNEYHKDKGNLENSIVTNYNFD >gi|296153962|gb|ADVK01000045.1| GENE 2 860 - 2854 2414 664 aa, chain - ## HITS:1 COG:FN0871_1 KEGG:ns NR:ns ## COG: FN0871_1 COG0337 # Protein_GI_number: 19704206 # Func_class: E Amino acid transport and metabolism # Function: 3-dehydroquinate synthetase # Organism: Fusobacterium nucleatum # 1 350 1 350 350 624 97.0 1e-178 MKKIFDDIYVGSNIISKLNDYTEDFDKILIFSNETIADLYFEKFKSTLNEKDKVFYFAIK DGEEYKNIESILPVYDFMLENNFSRKSLIISLGGGVICDMGGYISATYMRGIEFIQVPTS LLAQVDASVGGKVAINHPKCKNMIGSFKNPYRVIIDVGFLKTLPKREFKSGMGELLKHSF LTKDKSYLEYIENNVEKIKNLDNEVLENIVEQSIRIKKHYVDIDPFEKGERAFLNLGHTY AHTLESFFDYKAYTHGEAVAKGVIFDLELSLLREKIDTKYLERARNIFKLFDIDTDLIYL PSDKFIPLMRKDKKNSFNKIITILLDGEGHLSKTEVQEDEIVKIIDKYKNNFLRTSIDIG TNSCRLLIAEIEKDNENITFKKEIYKDLEIVKLGEDVNKNKFLKEEAIERTLKCLKKYRE IIDNYSIEDKNIICFATSATRDSTNRDYFIKKVFDETKIKINCISGDKEAYINFKGVISS FDKDFKENILVFDIGGGSTEFTLGNIQGIEKKISLNIGSVRITEKFFLNNKIYNYSEENR IKSKEWVKENLKELEDFKKLNFTLIGVAGTTTTQVSVREKMEVYDGEKIHLSNLTSKEIN DNLSLFIKNINKQEIKGLDPKRKDVIIGGTIILKEILDYFGKDFIIVSENDNLMGAILEG VENK >gi|296153962|gb|ADVK01000045.1| GENE 3 2961 - 3782 969 273 aa, chain - ## HITS:1 COG:no KEGG:FN0872 NR:ns ## KEGG: FN0872 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 13 273 1 257 257 490 98.0 1e-137 MKNRTYNFFIIILLFSLQTFLYAEINYVNKKGMTIETRYNVPTGYKRVNVEKRSFAEFLR NQKLKPYGEKALYYNGKEKPNSGIYDSVLDVEIGKQDLHQCADAIMLLRAEYLYSKKEYN KINFHFTSGFEAKYSKWIEGYRISVQGKGSYVKKANPSNTYKDFKNYMNIVFSYCGTLSL EKEMKLQSLDKMKIGDIFIKGGSPGHAVIIVDMAENDKGEKIFMLAQSYMPAQQTQILIN PNNKELGVWYSLKGKAELITPEWDFSINQLRSF >gi|296153962|gb|ADVK01000045.1| GENE 4 3802 - 5457 2074 551 aa, chain - ## HITS:1 COG:FN0873 KEGG:ns NR:ns ## COG: FN0873 COG0616 # Protein_GI_number: 19704208 # Func_class: O Posttranslational modification, protein turnover, chaperones; U Intracellular trafficking, secretion, and vesicular transport # Function: Periplasmic serine proteases (ClpP class) # Organism: Fusobacterium nucleatum # 58 551 1 494 494 858 99.0 0 MFVLSALLQAVIISIVIIIVLLIPIFFILGKLKNKDKVSLKGVKTVVFNLNELVEDYMIS TVSINKTLSHEAVLKALENLVNDKKIEKIIIDVDEVDLSRVHIEELKEIFEKLSVNKEII AIGTTFDEYSYQVALLANKIYMLNTKQSCLYFRGYEYKEPYFKNILANLGITVNTLHIGD YKVAGESFSNDKMSEEKKESLINIKETLFQNFINLVKEKRKVDITNEIFSGDLIFANSEK AIQLGLIDGLSTYEEIGIDYNEDTVDFGEYVSAYKRKKNKSKNTIAIINLEGEIDTRESK ESIINYDNVVEKLDELEDIKNLKGLVLRINSPGGSALESEKIYQKLKKLDIPIYISMGDL CASGGYYIATVGKRLFANPVTLTGSIGVVILYPEFTETINKLKVNMEGFSKGKGFDIFDV SSKLSEESKEKIIYSMNEVYSEFKEHVMEARNISEEDLEKIAGGRVWLGSQAKENGLVDE LGSLNDCINSLVKDLELKDFKLTYIRGRKSIMEVVSAMKPQFIKSDIIEKIEMLKSYSNK ILYYDESLENL >gi|296153962|gb|ADVK01000045.1| GENE 5 5471 - 5986 611 171 aa, chain - ## HITS:1 COG:FN0874 KEGG:ns NR:ns ## COG: FN0874 COG0494 # Protein_GI_number: 19704209 # Func_class: L Replication, recombination and repair; R General function prediction only # Function: NTP pyrophosphohydrolases including oxidative damage repair enzymes # Organism: Fusobacterium nucleatum # 1 171 1 171 171 306 100.0 8e-84 MKFTHISKKQVFKNDVITVFEEKLSLPNNNIVTWTFTGKKEVVAIIAEMDGEIIFVKQYR PAIKKELLEIPAGLVEKGEDIVEAAKREFEEEIGYRANKLEKICTYYNSAGVNAGQYHLF YASDLEKTHQHLDENEFLEIVTIPINEINIFSFEDSKTIIALSYLNMKNKK >gi|296153962|gb|ADVK01000045.1| GENE 6 5988 - 6758 918 256 aa, chain - ## HITS:1 COG:FN0875 KEGG:ns NR:ns ## COG: FN0875 COG0566 # Protein_GI_number: 19704210 # Func_class: J Translation, ribosomal structure and biogenesis # Function: rRNA methylases # Organism: Fusobacterium nucleatum # 1 256 6 261 261 405 100.0 1e-113 MEIIESKENKLIKSLKKLKQKKYRDSENKFLAEGYKFLDYNYSPEMIIVREDIFQSNFYF EKINKFSCKEIVVTRKIFEELSSQENSQGIIILYNKKTNDLKFLSNNLVILDDVSDPGNL GTIIRICDATNFKDIILTKGTVDAYNEKVIRATMGSILNVNLYYLEKSEIINFLKENNYS IISTYLDKTAILYNKIELKEKNAVIFGNEGNGISDDFINITDYKTIIPILSNTESLNVAV ATGIILYKFREIEGAF >gi|296153962|gb|ADVK01000045.1| GENE 7 6893 - 8605 1816 570 aa, chain - ## HITS:1 COG:FN0876 KEGG:ns NR:ns ## COG: FN0876 COG0405 # Protein_GI_number: 19704211 # Func_class: E Amino acid transport and metabolism # Function: Gamma-glutamyltransferase # Organism: Fusobacterium nucleatum # 1 570 1 570 570 1100 99.0 0 MKSNKKIYMWPSAWEKPVVKGTYGAVSSNSCYATQAGLEILNKGGNAFDAAVAVSLVLSV VEEHHSGIGGGCFTLFYSVKDNETFALDGRGTAPKKATKDLFLKDGEVQDEWKDLGGQSV LVPGLLKTMDVLLKKYGTMELKEVVLPALKYAKEGFEIGYTYSLTMHDDSVQRKMNLSNE FKKLYLKEDKSFYKFGDIYRNEKIAKLLELISKNGIDIFYNGEIAEKIVNIVNKNGGCFI AEDLKKYMPKIRKVEVSTYRGYEIKSFSPPSSGATLIEMLNIIENKNIKDMGHNSAETIH ILAEAMKLGFSDRNVVLADPDYAKINDDKLISKEYAKERYSLVSSEAKEYTSGNLSNIKE YNGNTSHFSIIDRYGNVVSQTQTIRDWFGSGIIVDDYGFVLNNGMSDFSALAGAITSQGL TYGDLNAIEGGKIPLSSMSPTIVFKDKKPFMSIGAAGGPRIITGILQGIINAIDYDMLPE QLVNMPFINCLTKNQGLEVEYGISKDTLNILEKKGHIIKEIPVYQAMSTMLNSVMNIDGV LYAASTKRVDGCGGVLFENGHIALEGIIQR >gi|296153962|gb|ADVK01000045.1| GENE 8 8602 - 9981 1804 459 aa, chain - ## HITS:1 COG:FN0877 KEGG:ns NR:ns ## COG: FN0877 COG0591 # Protein_GI_number: 19704212 # Func_class: E Amino acid transport and metabolism; R General function prediction only # Function: Na+/proline symporter # Organism: Fusobacterium nucleatum # 4 459 1 456 456 794 99.0 0 MLIMTWLITFIVVVGIGVYAGTKIKSSNQWSGGDRSLGVLSLGCVFAAWQIGGMAIVGAA QNGYNLGISGAWYSIAGSFYFIVLAFFAKVIREKMPGESVPKYLQIRFDSKTSKLYSYAW IIYGFFYIPIQLKTVSSIISLVVPELNGILIMLIGVTVAVLYTGFSGMKGASAVGRVVCI GIYILLIVFTIITLQKFDGYRELMQKLPQEYSKMNNMPIQRIIAWIFGGCISTAVMQSVL QPLLAAKDPETARKGSILGYLISAPICFFTALCGILSKVSGADLGDGTTAFAYAIKTFSS PVFAGIIFAFSTMIIAATMATMMLATGTIITNIYKTQINPEADDNKILKMSKVITFIFAY LTLIPAFLIPSSSLTNLFLRLQHIAAAPVSFSILLGLTWKKVTKEGAFFSMLSGMIVGII WMVLGFSDKIEAIYPVVLVTYFIGIIVSLLTNKKGEELA >gi|296153962|gb|ADVK01000045.1| GENE 9 10186 - 10893 874 235 aa, chain - ## HITS:1 COG:FN0878 KEGG:ns NR:ns ## COG: FN0878 COG1802 # Protein_GI_number: 19704213 # Func_class: K Transcription # Function: Transcriptional regulators # Organism: Fusobacterium nucleatum # 1 235 1 235 235 409 100.0 1e-114 MEENKDTLLGKFIKTLSYKEQAYDLIKDAILFNKFRIGAIYSQESICNELGISRTPVREA LIELQKEGYITILRGRGIEVTPVTEEDAKDILEVRIFYEKNNAFLAAKRIKDEDIKLLKE CIEKLESNLSTFDSQLLYRIDHQFHRLVAKATQNNWMYKETELILDNYLRFENKSVYNNS IDGQLVFKEHLAIFNAIKDKDSEKAKKMMEKHLVNSYYRTLKKIWNTEKEELEIK >gi|296153962|gb|ADVK01000045.1| GENE 10 11261 - 12046 512 261 aa, chain - ## HITS:1 COG:FN0879 KEGG:ns NR:ns ## COG: FN0879 COG4587 # Protein_GI_number: 19704214 # Func_class: R General function prediction only # Function: ABC-type uncharacterized transport system, permease component # Organism: Fusobacterium nucleatum # 1 261 1 261 261 366 100.0 1e-101 MKKYFKIFKISLISYLEYRVNFVLSFLFSLVPFSVSVLLWVAVAKHSEFIKVKEVVSYYF VILIVKNITTTNSIIRFSDDIRLGELNKYLLKPYNYCFYNLMADLPERIVFIVMNFIPLI LIYAFLHSYINLDLSLIKVFFFIIFLILGYLINFFIDFLIGLYSFYFSKVSSLYTSIKVL RNLSAGNIFPLLMLPAKIFLTLQFLPFMYTSYVPTMLLLEKTSFDLILKNLFISITWLSI LCLFSAMLWKRGMKRYSAYGG >gi|296153962|gb|ADVK01000045.1| GENE 11 12062 - 13078 269 338 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|169795303|ref|YP_001713096.1| ABC transporter ATP-binding protein [Acinetobacter baumannii AYE] # 46 338 19 304 311 108 25 1e-22 MSVQEVNNEYVILTENLCKDYIYYKKEAGLKGSLKNLFHREKLIKKAVQNLSIKIPKGAI VGLIGLNGAGKTTTLKMLTGIIMPTSGKVDVLGYFPFDKKKEYLRRIAMVMGNKSQLWWD LPALDTFELNKIIYEIDDSEYKNTLNSMVEIMGVEKQLNVQVRRLSLGERMKMELIASLI HKPDIVFLDEPTIGLDVITQYNIRNFLKEYCVKYGSTILLTSHNFNDIVTLCDSIILINN GEMIYSDTFKNFQKQFFNQKYFVLKLKEPNVDEFISKLHLENDIKIEKIDSNSIKIATDN NKSLDILKNISGNFIQELSDINIENISMDDVIRKIYQK >gi|296153962|gb|ADVK01000045.1| GENE 12 13059 - 13847 625 262 aa, chain - ## HITS:1 COG:FN0881 KEGG:ns NR:ns ## COG: FN0881 COG3694 # Protein_GI_number: 19704216 # Func_class: R General function prediction only # Function: ABC-type uncharacterized transport system, permease component # Organism: Fusobacterium nucleatum # 1 262 1 262 262 400 99.0 1e-111 MKRYFQIMKAYLRGSLMYQMEYKFNFLVGGTFELIWMVMYIIFINVAFLHTKDINGWNKY QMLMLTFQGGLMDSVFTFAVVPGLKRLPELINKGTLDFLLLKPVNKKFNISFNEFDIPQI KNIFINIFGIIYCIKKLQIVLTPTKILIYILLSINGFLMIYSIMFMLMSLAFWFMRMDIV MGIGSELITVGNKPMSIYPNIIQKILIFIIPLFVCFNFPILYIVKGLNLYFIIYSFIATA ICFMVLNFIFKRGLRRYVSAGS >gi|296153962|gb|ADVK01000045.1| GENE 13 14047 - 14802 243 251 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163803615|ref|ZP_02197481.1| 50S ribosomal protein L34 [Vibrio campbellii AND4] # 1 223 1 227 245 98 26 1e-19 MIEVKKLNFSIENKKIIQDISIKVNQGQFIGVIGANGSGKSTLLKNVYRFLKYDSGNIKL KNVDLYSYSSKSLAKEMAVLAQKQNMNFDFSVEEIVEMGRYAYKHSIFDSEENKNSKFIE DALNAVGMYKMKDRSFLSLSGGEMQRVLIARALAQNTEILILDEPTNHLDIKYQIQIMEL VKETRKTILAVIHDINIASSYCNYIYALKDGKICFEGSPEEIFTKEKIKNIFDVEADVLI HPKNKKPLIVF >gi|296153962|gb|ADVK01000045.1| GENE 14 14804 - 15841 1219 345 aa, chain - ## HITS:1 COG:FN0884 KEGG:ns NR:ns ## COG: FN0884 COG0609 # Protein_GI_number: 19704219 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-siderophore transport system, permease component # Organism: Fusobacterium nucleatum # 1 345 1 345 345 536 100.0 1e-152 MKKHIKNYKSLSLLLFIILILVSTFAITVGSVSLKNLDVWKIIVNKSFNHNIFTITWEES SEIIVWTLRAPRIVTAILAGASLSFVGILMQALTKNPLASPYILGISSGASTGAVLVILI FSGSYIFISVGAFILGTLTALLVFYFANSNGFSSTKLVLVGAAISAIFSGLTSLIIAVTP NERAIRGALFWMSGSLAGSTWQYIPFLFISLLIVFILVYPKYDELNILVTGDENATSLGV DVKKIRFLIMITSTFLTGIVVANTGIIGFVGLVIPHITRGLVGGNHKKVIPIAILLGAVF LVLTDTLTRTVVSSQEIPIGVITSVLGAPFFLSMLRRKSYRFGGE >gi|296153962|gb|ADVK01000045.1| GENE 15 15855 - 16715 1095 286 aa, chain - ## HITS:1 COG:FN0885 KEGG:ns NR:ns ## COG: FN0885 COG0614 # Protein_GI_number: 19704220 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-hydroxamate transport system, periplasmic component # Organism: Fusobacterium nucleatum # 1 286 1 286 286 494 99.0 1e-140 MKKILFSFLLLSSLCFAKVPQRAVSAAHFSTEILLSIGAEKQMVGTAYPDNEILPSLKEK YDKIPVLSMKNPTKEQFYAVKPDFLTGWDSTVLDKNLGPIKELEKNGVQVYIMKSLHSSD INLVFEDILTYGKIFNLENNAKKVVGKMKSDLAAVQKQLPKNKVKVFTYDSGDKAPFVVG GDSIGNTIITLAGGDNIFKDIKKAWADGNWEKVLVENPDIIVIIDYGDQSAESKIKFLKE KSPIKDLKAVKNNKFVVIELADITAGVRNVDAIKKLAKAFHNITIK >gi|296153962|gb|ADVK01000045.1| GENE 16 16918 - 18900 2589 660 aa, chain + ## HITS:1 COG:FN0886 KEGG:ns NR:ns ## COG: FN0886 COG1629 # Protein_GI_number: 19704221 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor proteins, mostly Fe transport # Organism: Fusobacterium nucleatum # 1 660 1 660 660 1199 99.0 0 MFKKITLLSLILATAVYANDEADIKLNESVITAQNFKTTVRNTASNVTIVTAKDIEEKGA QNLVDALRMVPGIMVKNYYGNIAFDIGGYSSVHAERNSIITYDGVRISAKEATNIPISAI ERVEVIPNGGGILYGDGANGGVINILSKSIYGKDNNKKISGNVRTEYGSRGSYKYGFSTI AKATDKLTFKVDYSRDRYRSERDSDENGKVVSRSQEVSIDIKYKFDNADLTVKYTRNEKH RADGGDLEEADYYKNRKMVTWVARDFTRSNDWYVNYRQNIGENTELLTYIDLYDTRENDD ISKTLDRDYSRKTAKLQLKHKYLNSHYFIIGTDYMKEKLKGLGYNGEYTGKNTEKTDYGI FAMNELKFGKFTFAQGLRYNKAEYDFYWRNKYPVPRNIRGEHGEQEYKNYAANLELRYDY SDTGMVYGKWSRDFRTPIAREMYYTLEGSKLKAQTQNTFEIGAKDYIAGTYISLSTFYKK TNGEIYYQGTPNKESTRPGAVVFPYYNMGDTRRLGIELLTEQYVKNFTFTESISYLNHKI VDSDFESRKNKEIPMVPNWKLAFGVGYKFNNKLNVNADIVYYGKFYDSDDPENVRPKDRG NYATVSLSANYKFENGFAINARVNNLFDKKYEDYVGYWDGTRQYSPAAGKYYSIGASYTF >gi|296153962|gb|ADVK01000045.1| GENE 17 19009 - 20811 2219 600 aa, chain - ## HITS:1 COG:FN0887 KEGG:ns NR:ns ## COG: FN0887 COG1164 # Protein_GI_number: 19704222 # Func_class: E Amino acid transport and metabolism # Function: Oligoendopeptidase F # Organism: Fusobacterium nucleatum # 1 600 1 600 600 1083 100.0 0 MKDRKSIDKKYKWNLNDIYENYDIWESDLEKFEKLTKEVPKFKGEIKKSPKKFVELEILM EKIAKLLDRLYLYPYMLKDLDSTDEITSIKMQEIEMIYSKFATETAWISPEMLEIPEETM NEWIKKYPELQERKFGLSEMYRLRKHVLSEDKEQLLSHFAQFMGSSSDIYAELSISDIKW NTVKFSTGEEIPVSNGVYSKILATNRNQEDRKLAFEALYKSYENSKNTFAAIYRAIVQRN VASCNARNYESSLDRALENKNIPREVYFSLVESTQENTAPLRRYVELRKKALKLKEYHYY DNSINIVDYNKIFKYDDAKEMVLKSVEPLGEDYQQKMKRAISEGWLDVFETKNKRSGAYS INIYDVHPYMLLNYQETMDDVFTLAHELGHTLHSMLSSEAQPYSTADYTIFVAEVASTFN ERLILDYMLKNSNDSLEKIALLEQALGNIVGTYYIQTLFATYEYEAHKLIEEYKAITPDI LSEIMFNLFKKYFGDMVTIDELQKIIWARIPHFYNSPFYVYQYATSFASSAKLYEDLKAN LENREKYLTLLKSGGNNHPMEQLKLAGVDLTKKESFDAVAKEFDRLLDILENELKKINLI >gi|296153962|gb|ADVK01000045.1| GENE 18 20826 - 22049 1908 407 aa, chain - ## HITS:1 COG:FN0888 KEGG:ns NR:ns ## COG: FN0888 COG2233 # Protein_GI_number: 19704223 # Func_class: F Nucleotide transport and metabolism # Function: Xanthine/uracil permeases # Organism: Fusobacterium nucleatum # 1 407 1 407 407 650 100.0 0 MKILGLKTKIILGMQHVLAMFGATVLVPFLTGLNPSIALICAGVGTLMFHSVTKGIVPVF LGSSFAFIGATALVLREQGIAILKGGVISAGLVYVMMSFIVLKFGVERIKSFFPPVVVGP IIMVIGLRLSPVALSMAGYSNNTFDRDSLIIALIVVISMIFISILKKSFFRLVPILISVA IGYIVAYFMGDVDLSKIHEASWIGLPEGAWDTITTVPKFTFSGVVALAPIALVVFIEHIG DITTNGAVVGKDFFKNPGVHRTLLGDGIATMAAGLLGGPANTTYGENTGVLAVTKVYNPA ILRIAACFAIVLGLIGKFGVILQTIPQPVMGGVSIILFGMIAAVGVRTIVEAQLDFTHSR NLMIAALIFVLGIAIGDITIWGTISISGLALAALVGIVLNKILPEDK >gi|296153962|gb|ADVK01000045.1| GENE 19 22110 - 22493 748 127 aa, chain - ## HITS:1 COG:FN0889 KEGG:ns NR:ns ## COG: FN0889 COG5496 # Protein_GI_number: 19704224 # Func_class: R General function prediction only # Function: Predicted thioesterase # Organism: Fusobacterium nucleatum # 1 127 1 127 127 218 99.0 2e-57 MLEVGMKLEVEKLVTDNDTASKAASGAVEVLATPFMIAWMEEASLHLAQKGLENGLTTVG TEVNIKHLKGTLVGKTVKIVSILREIDRKKLVFDVEALEDGVVVGTGTHTRFIIDPVKFY EKLKNAK >gi|296153962|gb|ADVK01000045.1| GENE 20 22558 - 23187 614 209 aa, chain - ## HITS:1 COG:FN0890 KEGG:ns NR:ns ## COG: FN0890 COG1564 # Protein_GI_number: 19704225 # Func_class: H Coenzyme transport and metabolism # Function: Thiamine pyrophosphokinase # Organism: Fusobacterium nucleatum # 1 209 1 209 209 348 100.0 3e-96 MKIAYLFLNGELRGDKNFYLDFIKNHKGDIYCADGGANFCYELTLIPKEIYGDLDSIKDE VKEFYQEKKVKFIKFKIEKDYTDSELLLNEIQNKYDVIYCIAGLGGSIDHELTNINLLAK YSNLIFISEKEKIFKIDSDSKFNDMINTKISFVIFSDQVKGLTLKGFKYSIENLDIKKGE ARCISNIIVENKANLLIKSGSLLCVIKEN >gi|296153962|gb|ADVK01000045.1| GENE 21 23187 - 24026 1053 279 aa, chain - ## HITS:1 COG:no KEGG:FN0891 NR:ns ## KEGG: FN0891 # Name: not_defined # Def: DNAse I homologous protein DHP2 precursor (EC:3.1.21.-) # Organism: F.nucleatum # Pathway: not_defined # 1 279 1 279 279 520 100.0 1e-146 MKKRKLLLLTIFALFFCLSILSSADEAYIASFNILRLGAAKKDMPQTAKILQGFDIVGLV EVINRDGVEELVDELNKASDEKWDYHISPFGVGSSKYKEYFAYVYKKDKVKFIKSEGFYK NGKSSLLREPYGATFQIENFDFTFVLVHTIYGNNESQRKAENFKMVDVYDYFQDRDKKEN DIFIAGDFNLYALDESFKPLYKHSDKITYAIDPAIKTTIGTKGRANSYDNFFFSQKYSQE FTGSSGALDFSGDNPKQMREIISDHIPVFIVVETSKDDD >gi|296153962|gb|ADVK01000045.1| GENE 22 24208 - 24939 1034 243 aa, chain + ## HITS:1 COG:FN0892 KEGG:ns NR:ns ## COG: FN0892 COG0560 # Protein_GI_number: 19704227 # Func_class: E Amino acid transport and metabolism # Function: Phosphoserine phosphatase # Organism: Fusobacterium nucleatum # 1 243 5 247 247 467 99.0 1e-131 MIAAFFDIDGTIYRNALLIEHFKKLVKYELFDDIQYRLKVEEAYNLWDTRKGDYDDYLLD LTQLYVVAIKGLPVKYNDFISNQVLLLKGNRVYTYTREMIEWHKKMGHKVFFISGSPSFL VSRMAKKMGVDDFCGSIYEIDEETQTFSGKILKPMWDSAHKQEAIENFIKKYNIDLSKSY AYGDTNGDFSMLSLVGNPRAINPSKELITRVKNDENLKSKTQIIIERKNVIYKLNSDVEL IEF >gi|296153962|gb|ADVK01000045.1| GENE 23 24974 - 25396 500 140 aa, chain + ## HITS:1 COG:FN0893 KEGG:ns NR:ns ## COG: FN0893 COG1959 # Protein_GI_number: 19704228 # Func_class: K Transcription # Function: Predicted transcriptional regulator # Organism: Fusobacterium nucleatum # 1 140 1 140 140 239 100.0 8e-64 MKIKNEVRYALQIIYYLTLNRDKDIISSNEISAEENIPRLFCLRIIKKLEKAGVVKIFRG AKGGYVLTRDPKRLTFRDIIEIIDDDIVLQPCIDSSTICSTRGADCSIRHALKKIQDDLL DDFDKINFYDLVENNASLQI >gi|296153962|gb|ADVK01000045.1| GENE 24 25427 - 25963 453 178 aa, chain + ## HITS:1 COG:no KEGG:FN0894 NR:ns ## KEGG: FN0894 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 178 1 178 178 245 85.0 5e-64 MDIQFENKYYADDKMGKEYVNKILAKNSRGIILLYILIFLSILINWILRGNLSEIYWMIA CIVILLLFNFYNQKYLFRLLKRTDRSIHNDQSYPTLVQFGNNIFMQEGKFSMELDYSKIV KIYYLKYSYVLMFTNSNGIMVKYDSFTKGNFEDFKEFIKENCKKAKIIVKNKSYIFGL >gi|296153962|gb|ADVK01000045.1| GENE 25 26009 - 26281 334 90 aa, chain - ## HITS:1 COG:no KEGG:FN0895 NR:ns ## KEGG: FN0895 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 2 90 66 154 154 103 98.0 2e-21 MSDTPDFEIIFQELNEVKRDDFVEEIVPEIKSKFEEENDTLEDKTYKEEFYIENNERKKE LKSKMENYIKEVIFDEKKAMKDEENKNLLI >gi|296153962|gb|ADVK01000045.1| GENE 26 26278 - 26472 223 64 aa, chain - ## HITS:1 COG:no KEGG:FN0895 NR:ns ## KEGG: FN0895 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 64 1 64 154 98 100.0 8e-20 MDKTLKEKIIDSTFKGIDKIIENTYKNHPDEKSYSGCRIQEGYDDYLKIVFKKRKIEYHK DDFF >gi|296153962|gb|ADVK01000045.1| GENE 27 26496 - 26879 514 127 aa, chain - ## HITS:1 COG:no KEGG:FN0896 NR:ns ## KEGG: FN0896 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 127 1 127 127 229 100.0 4e-59 MKKEEKVVEVLREEGYKVSEEDVLIAQLIPSFLGGFFTIIPKQVFLAYNNKEFFVIAATL MKGDPDKNKIKHYSLDEVNVQMKNGLLLNGKLILTHSDGKRENYRAMKFAFGSLASNNFK KALQKFQ >gi|296153962|gb|ADVK01000045.1| GENE 28 26983 - 27651 270 222 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|241889384|ref|ZP_04776685.1| 30S ribosomal protein S8 [Gemella haemolysans ATCC 10379] # 16 222 14 216 216 108 33 8e-23 MKELKDFIKNKKIDFKKLEKFGFELIDNSYYYHTFLLKNQFKMSVKINLDNSIFTEIIDT ETNEPYILYLLEMKRSGYSEKVYKAYNEVLEKIQKECFEDQIFKASYTKQIIAYVKNKYG DELEFLWEKSPKNAVVRRKSSKKWYAVILTVSKRKLNLDSDEIIEVINLHNTVEEIKKLI DNKRYFPAYHMNKKYWCTICLDGTVELEEIYKLIDISYELAK >gi|296153962|gb|ADVK01000045.1| GENE 29 27726 - 29597 1763 623 aa, chain + ## HITS:1 COG:FN0898_2 KEGG:ns NR:ns ## COG: FN0898_2 COG1533 # Protein_GI_number: 19704233 # Func_class: L Replication, recombination and repair # Function: DNA repair photolyase # Organism: Fusobacterium nucleatum # 294 623 1 330 330 549 96.0 1e-156 MLYIVTALYIEAKPLISLFNLKKDNTFTKFQVFSNENIKLIISGTGKIKSATALTYLISN KDIKENDYIINIGFIASSNNNSQLGDIVYISKIQNAYSDTTFYPEMIYKHNFLEGSLITF DKIIEKKIENVEYIDMEAYGFFQTASIFFKKDKIFLLKIVSDILKENVEDRILIDFKDDN LFNKSYKKIYDFLLKFINIPDNNKNNFNNNEQDLIKKVLENLKLSDTMTYEFFNILKYLK IKYGNIDILKKYENIEVNSKVQGKKIFEEIKEFSKLNNKVEIERKTSNNKNSNLFNNRFS HIYVEKKILNNKNTLEILSKFKDVKIIEIDNYKEVFSSNNQDFHLQKLGQKLILASNKPN MIYEGAVVCESFENDNFYYTSSIINCVYDCEYCYLQGVYSSGNIVIFIDIEKVFEEVEEL YNKLRTLYLCVSYDTDLLAIESICGFSEKWYYFIEDKKDLKIELRTKSGNIDKFLNLKPL DNFIIAFTLSPENLALKNEKYTASFKNRVKAIKELQEKGWKVRICIDPLIYSDNFEKNYS QMIEYLFNEIDKEKVTDVSIGVFRISKEYLKKMRNQKQNSEILYYPFECIDGVYTYSDKT KSYMINFVKEQFLKYININKIYM >gi|296153962|gb|ADVK01000045.1| GENE 30 29781 - 30497 825 238 aa, chain + ## HITS:1 COG:mlr2833 KEGG:ns NR:ns ## COG: mlr2833 COG1028 # Protein_GI_number: 13472508 # Func_class: I Lipid transport and metabolism; Q Secondary metabolites biosynthesis, transport and catabolism; R General function prediction only # Function: Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) # Organism: Mesorhizobium loti # 3 184 8 186 271 95 31.0 8e-20 MKIALVTGATSGIGYEISKRLLKMNYTVYGIGRNFIKNNENIFEEYENFIPVTCDLSKLD DLEKTLHSLKKIKFNLIVNSAGIGYFGLHEEMNISKIKNMIAVNLQAPLVISQYFLKTLK ENKGMIINISSVTANKESPLASVYSATKAGLSQFSKSLFEEVRKNDVKVITVYPDMTKTN FYDNNTYFECDDDEKAYIKMEDIGNTIEFILNQSENIVFTDITIKPQRHKIKKVKRKE >gi|296153962|gb|ADVK01000045.1| GENE 31 30504 - 31286 1030 260 aa, chain + ## HITS:1 COG:FN0900 KEGG:ns NR:ns ## COG: FN0900 COG1235 # Protein_GI_number: 19704235 # Func_class: R General function prediction only # Function: Metal-dependent hydrolases of the beta-lactamase superfamily I # Organism: Fusobacterium nucleatum # 1 260 1 260 260 494 99.0 1e-140 MKISILGSGSAGNSTFVEIEDYKLLVDTGFSCKKTEEKLEKIGKKLSDISAILITHEHSD HINGAGVIARKYDIPIYITPESYRAGAAKLGQIDKSLLNFIDGDFILNDKVKVSPFDVMH DAERTIGFKLETQLNRKIAISTDIGYITNIVREYFKDVDAMVIESNYDFNTLMNCAYPWN LKERVKSRNGHLSNNECAKFIKEMYTDKLKKVFLAHVSKDSNNISLIKETLEDEFIGMIR KPNCEITTQDNVTKLFDIDE >gi|296153962|gb|ADVK01000045.1| GENE 32 31296 - 31883 752 195 aa, chain + ## HITS:1 COG:FN0901 KEGG:ns NR:ns ## COG: FN0901 COG1573 # Protein_GI_number: 19704236 # Func_class: L Replication, recombination and repair # Function: Uracil-DNA glycosylase # Organism: Fusobacterium nucleatum # 1 195 1 195 195 346 100.0 2e-95 MDEISELWEDLKFEAGSIGNELLPKDRQEVYIGMGDRNADILFVGNDPKLYLAEDYKVES KSSGAFLIRLLDVVEYLPETYYITTLSKREIKIKNFNEEERKKLIDLLFMQILLISPKIV VFLGKEVAQLIENKEIDFDNERGQFKKWRGDIETYLTYDVETVIKARNDSGKKAAIALNF LNDMKNIKERLNNDE >gi|296153962|gb|ADVK01000045.1| GENE 33 31873 - 32418 889 181 aa, chain + ## HITS:1 COG:FN0902 KEGG:ns NR:ns ## COG: FN0902 COG0212 # Protein_GI_number: 19704237 # Func_class: H Coenzyme transport and metabolism # Function: 5-formyltetrahydrofolate cyclo-ligase # Organism: Fusobacterium nucleatum # 1 181 1 181 181 295 98.0 2e-80 MMNKKEARTLIKERRMNLSKEYIDVASDKIFEKLLQNEDFKNAKTVMSYMDFKNEVKTDR INTFIKNSGKILVLPKVVDKETMIVIEDKNQYIVSPFGNKEPDGEEYKGSIDVIITPGIA FDRDKNRVGFGRGYYDRFFVKQPNAKKIAIAFEKQIIDEGIETDKYDKKVDILITEDDII K >gi|296153962|gb|ADVK01000045.1| GENE 34 32440 - 33411 1375 323 aa, chain + ## HITS:1 COG:FN0903_1 KEGG:ns NR:ns ## COG: FN0903_1 COG0794 # Protein_GI_number: 19704238 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted sugar phosphate isomerase involved in capsule formation # Organism: Fusobacterium nucleatum # 1 206 1 206 206 382 98.0 1e-106 MLDTEIIEIAKNIYDTEIKSLELRMNKLSENFVKVVRKIYDCKGKVVVTGIGKTGIIGKK ISATFASTGTTSIFMNSTEGLHGDLGIINQEDIVLAISNSGESDEILAIMPAIKNIGAYI IAMTGNINSRLAKASDLYINTHVEEEGCPINLAPMSSTTNALVMGDALAGCLMKLRNFSP QNFAMYHPGGSLGRKLLTRVGNLMKTGEALALCKADTSMEDIVILMSEKKLGVVCVMNDE NNILVGIITEGDIRRALSHKEEFFKLKAKDIMTTKYTKVDKGEMATQALSIMEDRPHQIN VLPVFDNDKFVGVIRIHDLLKVR >gi|296153962|gb|ADVK01000045.1| GENE 35 33421 - 35004 1701 527 aa, chain + ## HITS:1 COG:FN0904 KEGG:ns NR:ns ## COG: FN0904 COG2509 # Protein_GI_number: 19704239 # Func_class: R General function prediction only # Function: Uncharacterized FAD-dependent dehydrogenases # Organism: Fusobacterium nucleatum # 1 527 1 527 527 969 96.0 0 MKVNISNIIVSINKNQEKEIYRELEKNGISRDNIENLKYLKKSIDSRKKNDIKFIYTLEI SLRKNINLEKYSKLSLVKEDVYEKRMPLYPKREVAVVGTGPAGLFSALGLAELGYIPIVF ERGEEVDKRNITTDNFIKTNILNPNSNIQFGEGGAGTYSDGKLNTRIKSEYIEKVFKEFI ECGAQEEIFWNYKPHIGTDVLRIVIKNLREKIKSLGGKFYFNSLVEDIEVKNNEIKALKI LEVDTQKRYTYDIDKVIFAIGHSSRDTYKMLHSKGVAMENKPFAIGVRIEHLRKDIDKMQ YGEAVSNPLLEAATYNMAFNNKKETRGIFSFCMCPGGEIVNASSELGASLVNGMSYSTRS GKFSNSAIVVGISEKDYGDQIFSGMYLQEKLEKKNYEIVGTYGAIYQNVIDFMKHKKTTF EIESSYKMKLFSYDINNFFPDYITRNLQSAFENWSKNNLFISERVNLIGPETRTSAPIKI LRDLKGESISIKGLFPIGEGAGYAGGIMSAAVDGIKIVDLAFSKKIV >gi|296153962|gb|ADVK01000045.1| GENE 36 35054 - 35377 379 107 aa, chain - ## HITS:1 COG:no KEGG:FN0905 NR:ns ## KEGG: FN0905 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 7 107 1 101 101 156 98.0 2e-37 MKKIILMLVSVLIINACTSTKNAPFNEVEASLNQKYGALSNEYYKMLENPIVEKDRRSIL NKFESFRTEVRDLKKNRKNPSSNETRVLNSFIDKSSTNIQYLNDLAE >gi|296153962|gb|ADVK01000045.1| GENE 37 35396 - 36403 1442 335 aa, chain - ## HITS:1 COG:FN0906 KEGG:ns NR:ns ## COG: FN0906 COG0240 # Protein_GI_number: 19704241 # Func_class: C Energy production and conversion # Function: Glycerol-3-phosphate dehydrogenase # Organism: Fusobacterium nucleatum # 1 335 1 335 335 613 99.0 1e-175 MVKISVIGSGGWGIALAILLHKNGHNLTIWSFDKKEAEELKMNRQNKTKLPNILLPEDIK VTNNLKEAVDNKDILVLAVPSKAIRSVSKSLKDIIKDNQTIVNVAKGLEEDTLKTMTDII EEELKEKNPQVAVLSGPSHAEEVGKGIPTTCVVSAHNKELTLYLQNIFMNPSFRVYTSPD MIGVEIGGALKNVIALAAGIADGLNYGDNTKAALITRGIKEISSLGVAMGGEQSTFYGLT GLGDLIVTCASMHSRNRRAGILLGQGKTLDEAIKEVNMVVEGIYSAKSALMAAKKYNVEI PIIEQVNAVLFENKNAAEAVNELMIRDKKLEIQSW >gi|296153962|gb|ADVK01000045.1| GENE 38 36597 - 37268 785 223 aa, chain - ## HITS:1 COG:FN0907 KEGG:ns NR:ns ## COG: FN0907 COG4123 # Protein_GI_number: 19704242 # Func_class: R General function prediction only # Function: Predicted O-methyltransferase # Organism: Fusobacterium nucleatum # 1 223 1 223 223 373 99.0 1e-103 MLKDDEIIEKLDEKFEIIQKVGGYKYGEDTILLFKLFQASLNKKNIKLLDIGTGNGILPI LLSDNEFLSELIGIDIQKENIERANKALKLNRIEKNIQFECMDVKEYKKSNYFDVIISNP PYMDDNGKKINENEHKAISRHEIKLSLSELISNAKRLLKPIGLLYFIHRTHRLVEIIKTL DKNNFSVKKIIFIYSAQNNKSTMMFVEAIKGKQVKLEIENYYI >gi|296153962|gb|ADVK01000045.1| GENE 39 37270 - 38208 1082 312 aa, chain - ## HITS:1 COG:FN0908 KEGG:ns NR:ns ## COG: FN0908 COG1774 # Protein_GI_number: 19704243 # Func_class: S Function unknown # Function: Uncharacterized homolog of PSP1 # Organism: Fusobacterium nucleatum # 1 312 1 312 312 574 100.0 1e-164 MENNIIDVTDINTEVISIDPEKLHTVLIVTFETTKKRYYFEVLGNEIFKKNDKVIVETIR GTELGIASNNPIQMKEKDLVLPIKPVLKLASEREIEIYNLQRKEADDAFIACKEKIQKHQ LEMKLVACEYTFDKSKLIFYFTANGRIDFRELVKDLAILFKTRIELRQIGVRDEARILGN IGPCGKELCCKTFINKFDSVSVKMARDQGLVINPTKISGVCGRLLCCINYEYTQYEEALK NFPAVNQIVKTEIGEGKVVSISPLNNFLYVDVRDKGISRFDIKDIKFNRKEASILKNMKT QEEIENKILEKE >gi|296153962|gb|ADVK01000045.1| GENE 40 38229 - 38927 912 232 aa, chain - ## HITS:1 COG:FN0909 KEGG:ns NR:ns ## COG: FN0909 COG2003 # Protein_GI_number: 19704244 # Func_class: L Replication, recombination and repair # Function: DNA repair proteins # Organism: Fusobacterium nucleatum # 1 232 1 232 232 399 99.0 1e-111 MSEKDNQGHRERIREKFLNNGIDGFAEYEILELLLTYCIPRKDTKPIAKELLNKFKSLDN VFKASFDKLSAIDGLGKNSITFLKLLGELPSIIYKDELKNKKLIDKETLKISNKDILLKY LRNKIGYEEIEKFYVIYLSSSNEVIEFEENSVGTLDRSSVYPREIYKKVINLNAKSIILA HNHPSDNITPSKSDIELTNEIAKGLKNFGALLIEHIIITKNSYFSFLEEGLI >gi|296153962|gb|ADVK01000045.1| GENE 41 38937 - 40001 1363 354 aa, chain - ## HITS:1 COG:FN0910 KEGG:ns NR:ns ## COG: FN0910 COG2038 # Protein_GI_number: 19704245 # Func_class: H Coenzyme transport and metabolism # Function: NaMN:DMB phosphoribosyltransferase # Organism: Fusobacterium nucleatum # 1 354 1 354 354 669 98.0 0 MKDKNSLFDLINKIESIDNVSIKKAQTELDRKMKPKDSLGVLEDICKKAAGIYGYPLKKL EKKCHIVVAADNGIIEEGVSSCPIEYTSIVSEAMLNQIACIGIFTKTLGIDLNVVDIGMK NDIKRNYPNLIHKKIRRGTYNFYKERAMSINECLEAIFTGIDIINEKSKDYDMFSNGEMG IANTTTSSALLYSVTRQNIDDIVGRGGGLSDEGLNKKKKVIIEACERYNTFEMDAVEMLA SVGGFDLACMLGIYIGAALNKKLILVDGFISSVAALLACSLNKNIQDYLLFTHKSEEPGV NIILDYLKEKTFLNMNMRLGEGTGAVLACPIIDCAIEMINTMKSPEEVYNLFNK >gi|296153962|gb|ADVK01000045.1| GENE 42 40016 - 40591 595 191 aa, chain - ## HITS:1 COG:FN0911 KEGG:ns NR:ns ## COG: FN0911 COG0406 # Protein_GI_number: 19704246 # Func_class: G Carbohydrate transport and metabolism # Function: Fructose-2,6-bisphosphatase # Organism: Fusobacterium nucleatum # 1 191 1 191 191 364 100.0 1e-101 MGKLILIRHGQTEMNAQSLYFGKLNPPLNDLGISQAYQAREKLLNIDYDNIYSSPLERAK QTAEICNYLDKDIVYDSNLEEINFGIFEGLTFKEISEKYPVEVKKMKEDWKEYNYVTGES PKEMLQRAVSFLEILDYTKNNLIVAHWGIINSIISYFISGNLDSYWKFKIQNASIVIFEG NFEFSYLTKLD >gi|296153962|gb|ADVK01000045.1| GENE 43 40596 - 41432 867 278 aa, chain - ## HITS:1 COG:FN0912 KEGG:ns NR:ns ## COG: FN0912 COG0368 # Protein_GI_number: 19704247 # Func_class: H Coenzyme transport and metabolism # Function: Cobalamin-5-phosphate synthase # Organism: Fusobacterium nucleatum # 1 278 1 278 278 432 99.0 1e-121 MKGFLLLLSFMTRIPMPKTDYDEEKLGKSMKYFPVVGIIVGFILLFFCIIFNFILKNISY SAVLPLMIIVVILTDLITTGALHLDGLADTFDGIFSYRSKHKMLEIMKDSRLGSNGALAL ILYFLLKFILLFSLTIESREAAVYAIITYPVVSRFCSVVSCASSPYARGSGMGKTFVDNT KTCGLIVATVITLLYTIGMLFMPFILFTNYSLPMQFMIRSVLIIVIIVGLLALFAFAFSK LIERKIGGITGDTLGALLEISSLVYIFLFLVIPTFFIG >gi|296153962|gb|ADVK01000045.1| GENE 44 41445 - 42008 715 187 aa, chain - ## HITS:1 COG:FN0913 KEGG:ns NR:ns ## COG: FN0913 COG2087 # Protein_GI_number: 19704248 # Func_class: H Coenzyme transport and metabolism # Function: Adenosyl cobinamide kinase/adenosyl cobinamide phosphate guanylyltransferase # Organism: Fusobacterium nucleatum # 1 187 1 187 187 358 99.0 2e-99 MGKIIFFTGGSRSGKSKFAEEYIYENQYKNKIYFATAIAFDKEMQDRIEMHVKRRDNTWK TVEGYKNLVSLVKNDIDNVDVILFDCVTNFVSNYMLMDSEIDWDNVDLSVVHGIEDKIEE ETINFLEFVKSKKCDCVFVTNEIGSGLVPDYPLGRYFRDICGRINQLIAKNSDEAYLAVS GIKLKIK >gi|296153962|gb|ADVK01000045.1| GENE 45 42070 - 42801 756 243 aa, chain - ## HITS:1 COG:no KEGG:FN0914 NR:ns ## KEGG: FN0914 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 243 1 243 243 426 99.0 1e-118 MKLLKSVFTLILYFIFSLSLFAEVKFSNDFNLDIKKKFSDLEIKTMYRDLSLNDKVSFSC FNNAIHGLEKIEDLEIFDSSNDNLLVMVDYTKPSTEERLFIIDLRKKQLLISSLVAHGRG TGDLYATNFSNKNNSYSTSSGFYLTGNIYNGKNGESLELYGLEKGKNDNARKRTIVIHSA YYANKSFAEKYGRLGRSKGCLVLPTDLNTKIINLIFGGVVLYVHTNFDENKEYDFSKLLS KSF >gi|296153962|gb|ADVK01000045.1| GENE 46 42816 - 43310 651 164 aa, chain - ## HITS:1 COG:FN0915 KEGG:ns NR:ns ## COG: FN0915 COG2190 # Protein_GI_number: 19704250 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphotransferase system IIA components # Organism: Fusobacterium nucleatum # 1 164 1 164 164 288 99.0 4e-78 MGLFDIFKKKEKTIVTIYSPINGKVIELKEVPDEAFAQKMVGDGCAIEPDKGIICSPIDG QLMNIFPTNHAIIFETIDGLEMIVHFGIDTVKLDGKGFQKLREPGPIKIGDEIVKYNLDE IKDGVPSTRSPIIINNMEKVEKIEILSLGKVVKISEPIMKVTLK >gi|296153962|gb|ADVK01000045.1| GENE 47 43338 - 43787 669 149 aa, chain - ## HITS:1 COG:FN0916 KEGG:ns NR:ns ## COG: FN0916 COG3187 # Protein_GI_number: 19704251 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Heat shock protein # Organism: Fusobacterium nucleatum # 1 149 1 149 149 248 100.0 3e-66 MKKFFILGITAVALTACTDVNVPFMSSAKTESSTSSSTPVFANLKEQLNGREFVIVTEGY NKKTSIGFQGDRVYGFSGINRYFGNYQISGGKFVFDDFGLTQMAGSEEEMTKELQFLDLL RKNKSIKLSGDTLTLISTEGIELVFKKTK >gi|296153962|gb|ADVK01000045.1| GENE 48 43888 - 44904 941 338 aa, chain + ## HITS:1 COG:no KEGG:FN0917 NR:ns ## KEGG: FN0917 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: Purine metabolism [PATH:fnu00230]; Pyrimidine metabolism [PATH:fnu00240]; Metabolic pathways [PATH:fnu01100]; DNA replication [PATH:fnu03030]; Mismatch repair [PATH:fnu03430]; Homologous recombination [PATH:fnu03440] # 11 332 1 322 322 508 99.0 1e-142 MFYFLYGNSPMIEFETEKITDEILEKYPNIIPKFFDCSLKEENEFLSALQINSIFKTVDF LVLKRSENLKSSGIQKLFKSIKNYSLDEKNIIIIYNVPIQYGKVVSDYELTKVSIKLIEE LATFKDCTMIKESKATLNYVKQNLNITEKDAKNFIELLGDDYYHIKNETNKVANFLEGQP YSFEKIKNLISIDKDYNMKDLIENFLKTKNFSDILSFLEKNKDSYLGFIYMLTDELINLL KLTSLIKSGKISKNMNYNIFKELYNDFSDLFIGKNFKAQHPYTIFLKLNSFENFSEEFLE KRLKELLEIEYKVKSGERDIDIETEVFLGKFFNSKIFK >gi|296153962|gb|ADVK01000045.1| GENE 49 44930 - 45712 967 260 aa, chain + ## HITS:1 COG:no KEGG:FN1144 NR:ns ## KEGG: FN1144 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 9 255 2 246 249 188 46.0 2e-46 MIKILRKSIFIILAIFSITFIVTCGKKDNLDKKEIIEKFIEASENTKSMDGTIDIKTEME LNGNKVVENISMDSSLILDPFMMKVEMQVPNHPKILGYLDEEFIYFLNPQNNKWYKQNMN DLFGKRFKTYITNTDPFYDVIKDNIDKIDIEEKNGNYIISISKESQIFKKATEKQISSIN AIANSSLENETKIENMSAVYTVDKNTYLTTSSSMSYDLKIPGNGKMTIDVVCKMRNINKI ENITIPEEAKNATTINNIKK >gi|296153962|gb|ADVK01000045.1| GENE 50 45811 - 46737 1375 308 aa, chain - ## HITS:1 COG:FN0920 KEGG:ns NR:ns ## COG: FN0920 COG0501 # Protein_GI_number: 19704255 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Zn-dependent protease with chaperone function # Organism: Fusobacterium nucleatum # 1 304 1 304 309 553 96.0 1e-157 MKGLAELKNKIVKAPHLNIFKIGTWVTMGLFATFLLVYIFVGDEMLNYFPLLILFAFATP FISLMMSKASVKRAYNIRMIGDGEARTEKEKLVVDTITLLSQKLGLQRLPEIGVYPSNDV NAFATGASKNSAMVAVSQGLLNNMNETEIIGVLAHEMSHVVNGDMLTSSILEGFVSAFGV IATLPFLMGENNNRGRRAASSMATYYMVRNVANIFGKIVSSAYSRRREYGADKLAAEITD PSYMKSALLRLQEISEGRISLQNSDREFASFKITNNFSMGNIFGNLFASHPSLAKRIAAI ERMENKEF >gi|296153962|gb|ADVK01000045.1| GENE 51 46909 - 47679 786 256 aa, chain - ## HITS:1 COG:no KEGG:FN0921 NR:ns ## KEGG: FN0921 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 7 252 1 246 309 397 98.0 1e-109 MYIINIMWLVGPVALFTVLFILLISNSLIKSKKVFFAFFVLILVYAILGVACYYFYEVFL SNQYISLLISLSCIGLGAFINLIIVYLEIAKLTKKDSKTELLILQHDIEKNIQVEDKWFN ILFSYTADKWVVSDINADLFSKLDEGNFEEDSAEIIEMNKEIKIINGNYKFLKRNLRRKY FFLKNLKSINNLEDKEEVKNLITKKELEFSFSKETDNTIELIKTLSNELLNLLRIEENDK TKNMEENLARTIWSFI >gi|296153962|gb|ADVK01000045.1| GENE 52 47767 - 48705 1038 312 aa, chain - ## HITS:1 COG:FN0922 KEGG:ns NR:ns ## COG: FN0922 COG2334 # Protein_GI_number: 19704257 # Func_class: R General function prediction only # Function: Putative homoserine kinase type II (protein kinase fold) # Organism: Fusobacterium nucleatum # 1 312 1 312 312 478 98.0 1e-135 MGVFINLFQDEIDFIEEKYKIKILEIKNIDNGILNSNFYVEAKNKKYILRIYEANRTIDE EKQELILLDKIANFIPVSKAIRNIDNEYISILNNKKFALFEYVDGNSITKIDTYIIREIA MNLGKLHSFSKEISFEKYNRKTRIDFNFYYNEIKKSKIDFKFKKELLNLADEINSYDFST LPSGIIHGDIFSDNVLLDEYNNIKVIFDFNESYYAPFIFDIAIVINFWIRIKDFDFFDEN NFIRDFLNYYSKYRKITKEELKLLDIACKKTALTFIFLRVYKEKIENSYQKAISIEEKSY LDLIKLIDEYEK >gi|296153962|gb|ADVK01000045.1| GENE 53 48717 - 50156 1290 479 aa, chain - ## HITS:1 COG:FN0923 KEGG:ns NR:ns ## COG: FN0923 COG1502 # Protein_GI_number: 19704258 # Func_class: I Lipid transport and metabolism # Function: Phosphatidylserine/phosphatidylglycerophosphate/cardioli pin synthases and related enzymes # Organism: Fusobacterium nucleatum # 1 479 1 479 479 872 99.0 0 MQDISKILLTLVQLFLQYVWVANLFFIIVIIMIEKKNPLYTILWIFLLTLVPYVGFFIYL FFGLTFKKKRVANKIYKIKKLKSRKDVSKSDNEELKRWKGLITYLEMSTDNHISSNNDIQ VYFTGEDFFPELKKEIANAKKFINMEYFIFQFDGIGKEIADLLIEKAKEGVEVNLIIDGV NLANFRLKLYFKNTGVNLHLFFRTYIPIFNIRLNYRNHRKVTIIDNRVAFVGGMNIGDEY LGKGKIGYWRDTSVKIYGDIVSSFEKEFYFSLSIVKNEFLKDEKFSNEISLKYEEDEGIY MQLISSGPNYEFPAIRDNYIKLIQEARKSVFIQTPYFVPDDLLLDTLKSAVLSGIDVKIM IPNKADHPFIYWVNQYYVWELLRLGANIYRYENGFIHSKTILVDEEVVSVGTCNFDYRSF YLNFEINLNIYNKEVANSFKVQYYKDITISKKLTFADFKKRSIFTKVKESVFRLLSPIM >gi|296153962|gb|ADVK01000045.1| GENE 54 50074 - 50268 106 64 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MIKNKFATQTYCKKSCTNVNKILEISCIAIPQLIFYIWYIIAFFFNFSKKLKKYFKIIDK NLYI >gi|296153962|gb|ADVK01000045.1| GENE 55 50314 - 51111 893 265 aa, chain + ## HITS:1 COG:no KEGG:FN0924 NR:ns ## KEGG: FN0924 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 57 265 1 209 209 360 99.0 4e-98 MSDNNMKFNLFIGENFNELISLPTNRIIIRNLLSVADRDVVVLNNSLSLPELVQKLMDKI LYGRKEIVEIISNIFSMENKPDLTFYNSIFDSNIFSSIISTNYDYTAEENFLNLIKISTP FNVSNDESGRIAFYKVYGDYKDRDKVVISTQDVKRIKMLAFYNEFWEKLRAEFNKRPTIL FTVNLEDKVFLDVLDFIIAKTDRLQPIYLYTGDEIDKLLTDKDIISFINKYSIEIIKGEN KEFIANVKEKFFGEKKSGDVQQNYA >gi|296153962|gb|ADVK01000045.1| GENE 56 51150 - 52028 755 292 aa, chain + ## HITS:1 COG:no KEGG:FN0925 NR:ns ## KEGG: FN0925 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 292 1 292 292 446 100.0 1e-124 MQDDKEDKVEEIKEEVKEEIFIEETIENKEINLTLKENDFFETASEIKFNSMFLDYFPIK YRNFSKMFVPLKITSLGVTNVDFGFTVLDNVSVKILEFSKFKLIEFRKKEFRIAIDSEDD LFEYEIFKNIKNPKLRYIFEFFTNLFHGANIKFNFSEDKYELDFHNHIEHFKFITLNEFL TQYEKLVTDLRLYKYKNLSSAENSFYELDLLDKCNNLVESSSWVNAKIKCDSNVNVGDTL TINRLHKIKFDNFPYDVEELITTHPLTKGEIKFGVINLNRKAVKIKLNKVYK >gi|296153962|gb|ADVK01000045.1| GENE 57 52041 - 52820 1045 259 aa, chain + ## HITS:1 COG:FN0926 KEGG:ns NR:ns ## COG: FN0926 COG2357 # Protein_GI_number: 19704261 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 259 1 259 259 471 99.0 1e-133 MDKLIKEEFFKEFSIDEDYFLSTGLVWNELENIYEDYIELVPLLEKEAEYVVSKLIDVPS VHSVRRRVKKPTHLIEKIIRKGKKYQERNISVLNYKEIVTDLIGIRVLHLFKDDWQNIHH EILNLWDIKETPQVNIRRGDYNLSQFKETIKDINCDVIVREHGYRSVHYLVGIDITKTLN ISVEIQVRTVFEEAWSEIDHIMRYPYDVDNPIITEYLGIFNRIVGSADEMGTFLKKVKEN FGTVRDTDEVQRELDIKFK >gi|296153962|gb|ADVK01000045.1| GENE 58 52872 - 53600 556 242 aa, chain - ## HITS:1 COG:no KEGG:FN0927 NR:ns ## KEGG: FN0927 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 37 242 9 219 219 80 33.0 5e-14 MFLKIFFIDYCVLYEKIKKQDYDEEKKNSKWYKKVYKYCLNNWENLLSYQKYTLILVFLV FLIGLTVAALTKDILYYFIGIFLNYFIFSIAITYFFSIKNQEKMLPVYKREYQQRMQVVS EILKIHSIDYKDEVKINFLIEKLREKRNEKPLLKILKKFFFSSVPLTILYIIREKILNII ENGNLVFVFFIYFITVFVILLIKIIYDNTYGIYVNKYKKYYDYLIDDLKQILIFNNKFKE EN >gi|296153962|gb|ADVK01000045.1| GENE 59 53703 - 54347 175 214 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|238855674|ref|ZP_04645973.1| ribosomal protein ala-acetyltransferase [Lactobacillus jensenii 269-3] # 43 211 1 180 380 72 28 8e-12 MLILGIDTSTKICTCSIFDSENGVIAETSLSVKKNHSNIVMPIIDNLFKISDLTINDIDK IAVAIGPGSFTGVRIALGIAKGLAMALNKPLIAINELDILEAIASGNENEIIPLIDARKE RVYYKYQNKYVDDYLINLTSNFDKNKKYIFVGDGAIKYKNILKDNLGENAIILPMYNSFP RASILCELAINKEEANIYTLEPEYISKSRAEKHF >gi|296153962|gb|ADVK01000045.1| GENE 60 54328 - 54789 671 153 aa, chain - ## HITS:1 COG:FN0929 KEGG:ns NR:ns ## COG: FN0929 COG0802 # Protein_GI_number: 19704264 # Func_class: R General function prediction only # Function: Predicted ATPase or kinase # Organism: Fusobacterium nucleatum # 1 153 1 153 153 253 100.0 7e-68 MEKVLTFNQIDELAKKLANYVEENTVIALIGELGTGKTTFTKTFAKEFGVKENLKSPTFN YVLEYLSGRMPLYHFDVYRLCNSEEIYEIGYEDYINNGGVALIEWANIILEDLPKEYIRI EFKYTTKEDERLVDIRYIGNKEKEAKFNADFGN >gi|296153962|gb|ADVK01000045.1| GENE 61 54803 - 55267 767 154 aa, chain - ## HITS:1 COG:FN0930 KEGG:ns NR:ns ## COG: FN0930 COG2870 # Protein_GI_number: 19704265 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: ADP-heptose synthase, bifunctional sugar kinase/adenylyltransferase # Organism: Fusobacterium nucleatum # 1 154 7 160 160 280 98.0 8e-76 MNIDRKLASKLVEEAKRSGKKVVFTNGCFDILHAGHVTYLNEAKRQGDILIVGVNSDKSV KKLKGETRPINSENDRAFVLDGLKAVDYTVIFDEDTPEELIAYLKPSIHVKGGDYKKEDL PETKIVESYGGEVIILNFVEGKSTTNIIEKINKK >gi|296153962|gb|ADVK01000045.1| GENE 62 55254 - 55871 732 205 aa, chain - ## HITS:1 COG:FN0931 KEGG:ns NR:ns ## COG: FN0931 COG0494 # Protein_GI_number: 19704266 # Func_class: L Replication, recombination and repair; R General function prediction only # Function: NTP pyrophosphohydrolases including oxidative damage repair enzymes # Organism: Fusobacterium nucleatum # 1 205 1 205 205 356 97.0 2e-98 MNKNRILLRDRYFESAVMICIANIDGKDCFILEKRAKNIRQAGEISFPGGKKDKKDKNFR ETAIRETLEELQIKRKAITNISKFGILVAATGVIIECYLCKLNIKSLDEINYNKDEVERL LVVPIAFFIKNKAIKGEVEISNIAKFDIKKYNFPERYEKDWKIPSRYVYIYMYENEPIWG MTAEIICDFIKTLKEDGKVGFYEYR >gi|296153962|gb|ADVK01000045.1| GENE 63 56067 - 56552 582 161 aa, chain + ## HITS:1 COG:no KEGG:FN0932 NR:ns ## KEGG: FN0932 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 161 7 167 167 231 98.0 6e-60 MLFYEDLVRKIEEEKIENIEKIEKFKLNGTKSLVGYGIAIPLILIGLFEVYSYTIYHKWY LLLIGVIFLGIGLKQFKTILTYSYVIDTETKNLKFGKLNLKFDNIQSGTLKEMKLGKRVV TVIDMITNDRKQIVIPLFMAKQIRFILLIKEILAERFSIKK >gi|296153962|gb|ADVK01000045.1| GENE 64 56562 - 57824 1314 420 aa, chain + ## HITS:1 COG:FN0933 KEGG:ns NR:ns ## COG: FN0933 COG0128 # Protein_GI_number: 19704268 # Func_class: E Amino acid transport and metabolism # Function: 5-enolpyruvylshikimate-3-phosphate synthase # Organism: Fusobacterium nucleatum # 1 420 4 424 424 719 96.0 0 MNKKIIKADKLVGEVTPPPSKSVLHRYIIASSLAKGISKIENISYSDDIIATIEAMKKLG ANIEKKDNYLLIDGSKTFDKEYLNNDSEIDCNESGSTLRFLFPLSIVKENKILFKGKGKL FKRPLSPYFENFDKYQIKYSYINENEILLDGVLKNGEYEIDGNISSQFITGLLFSLPLLN GNSKIVIKGKLESSSYIDITLDCLNKFGINIINNSYKEFIIEGNQTYKSGNYQVEADYSQ VAFFLVANSIGSNIKINGLNVNSLQGDKKIIDFISEIDNWTKNEKLILDGSETPDIIPIL SLKACISKKEIEIVNIARLRIKESDRLSATVQELSKLDFDLIEKEDSILINSRKNFIYNS KEIVSLSSHSDHRIAMTVAIASTCYEGEIILDNLDCVKKSYPNFWEVFLSLGGKIYEYLG >gi|296153962|gb|ADVK01000045.1| GENE 65 57808 - 58881 1492 357 aa, chain + ## HITS:1 COG:FN0934 KEGG:ns NR:ns ## COG: FN0934 COG0082 # Protein_GI_number: 19704269 # Func_class: E Amino acid transport and metabolism # Function: Chorismate synthase # Organism: Fusobacterium nucleatum # 1 357 1 357 357 685 99.0 0 MNTWGNKIRLSIFGESHGEAIGIVIDGLEAGTKLNLENINKFIERRKAGKSSFTTSRKEK DEYKILSGYKDGYTTGAPLCVIFENTNTISKDYENLKDLLRPNHADYPAGIKFKGFNDVR GGGHFSGRMTLPLTFAGAIAIDILGEKGIKIFSHIKRILDIKDKSFLDFKEKDLEKFENL KESSLPFIENDLEDKTKELLEKIKLSGNSVGGEIECSCFNLPIGLGNPFFDSLESKISHL AFSVPAIKGISFGIGFDFANILGSEANDLYYLDNNEIKTRTNNNGGILGGLSTGMPLVFS VVVKPTSSISLEQKTVNIKEMKEDILKINGRHDTCIVQRVLPVIEAIMALAILDEIL >gi|296153962|gb|ADVK01000045.1| GENE 66 58876 - 59970 956 364 aa, chain - ## HITS:1 COG:no KEGG:FN0935 NR:ns ## KEGG: FN0935 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 43 364 1 322 322 564 98.0 1e-159 MKLIINKKYKDFFLLVLALIFLFINYYLKPKKIKIENRYYSEMVIAAQKDKLLQEEILKE KINRGLEIDKSLDKNEIGLIGLEWSGITTTLGDIEAKRTSTNPDFAALLVKLFKEAGLKK GDIVAANFSSSFPALNLAFISAADTLGLKAIIITSIGSSTYGGNIEDFTYLDMENYLYSK NLIENRTIAYSLGGAGDIGKEFNKDIIEKIKNRINGYGLKFFYEKEFDKNLENRYEFYKN ESQNNIKAFINIGGNLLSLGENADIINNQKILLDESSPLKTGLVGKFLKDDIPVFYLLNI KSIALYYNLEYDPDKFSKIGTSSIYYDSSKNFWNYIIVVIFGLFILVHLIFFRLKKNKFI EISL >gi|296153962|gb|ADVK01000045.1| GENE 67 59967 - 60425 447 152 aa, chain - ## HITS:1 COG:no KEGG:FN0936 NR:ns ## KEGG: FN0936 # Name: not_defined # Def: CapC protein # Organism: F.nucleatum # Pathway: not_defined # 1 152 1 152 152 189 100.0 2e-47 MINEIMVLGVILSIVFYEITEISPGGLIVPAYFALYLDNPTKIILTIFISIITYLLLKVL SNYTIIYGRRRFTVCIILSFLIKTLLKYFNIYILNENEIYFFNIAIVGIIIPGILAQEVD RNGVIKTLSSLIILSVFIKSLIEIFFMVGANV >gi|296153962|gb|ADVK01000045.1| GENE 68 60415 - 61563 1161 382 aa, chain - ## HITS:1 COG:no KEGG:FN0937 NR:ns ## KEGG: FN0937 # Name: not_defined # Def: gamma-polyglutamic acid synthetase (EC:6.3.2.-) # Organism: F.nucleatum # Pathway: not_defined # 1 382 1 382 382 704 100.0 0 MEIIVVILSLFYILYLFFEKINLDKNRKNLKYIIHINGIRGKSTVSRLIDAGLRAGGYKV FTKTTGTSPRIIDTNAKEFEINRQGKANIREQISVITWASKEKAEVLILECMAVKPELQY VCENKILKSDIVAITNVREDHLDEMGDSLDKIADSLSNTIPKKATFFTADKNYFNFFKNR CEDKNTRAFLSKNIKNEYWEIDFPNNIALAMDICKYLNVDEKIALEGMRTYHKDPGSLKV LTYLNKKNFRIFFVNTLAANDPDSTEIILDRVCIKTYWNNERYLLVNNRADRLSRLKQFV NFTIKFENRFDKILISGENKNLFYKYLLKNRIDKNRIIILSDEKYFENIEDDSLIFAVGN ICRLGKKLVDYFEERGEIIDDK >gi|296153962|gb|ADVK01000045.1| GENE 69 61655 - 62791 1647 378 aa, chain - ## HITS:1 COG:no KEGG:FN0938 NR:ns ## KEGG: FN0938 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 117 378 1 262 262 525 99.0 1e-147 MKGNKITGIIMIFLSLVITFIAGKEFLKTRELEPIVKSDGVTSMQKLSDYLPALKGTRGD SDIYILQGKEPGGSVLILGGTHPNEPAAFLTTVLLVENLKVDKGTVYIIPRANGSAMSHN DPQEASPQRFTIKTPYGERWFRFGSRATNPLDQWPDPDVYIHAASGQKLSGNETRNLNRA YPGRADGTYTEKVAYAITELIKKNDINMEIDLHEASPEYPVINAIVAHERAMPISSQVVM NMEFEDIQIGLEPSPPSLHGLTHRELGDYTNTYAVLMETANASQGRLRGKTDENLVLTGK DPTYVKAQKIGRLFVPYDENGHPIEERVGRHLTGVVQHIIVMGENEPDKEIIIEGLPSYE DLQKNGVGTYLKEVKEDK >gi|296153962|gb|ADVK01000045.1| GENE 70 62803 - 62949 206 48 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MMMYIYLTIVLYVLIMVILNLLEEKSIAKQLNAALVIIPLILRILFIK >gi|296153962|gb|ADVK01000045.1| GENE 71 62966 - 64237 1488 423 aa, chain - ## HITS:1 COG:FN0939 KEGG:ns NR:ns ## COG: FN0939 COG1593 # Protein_GI_number: 19704274 # Func_class: G Carbohydrate transport and metabolism # Function: TRAP-type C4-dicarboxylate transport system, large permease component # Organism: Fusobacterium nucleatum # 1 423 1 423 423 591 99.0 1e-169 MSIELILFLAMIAVFAIACFVFKLPVSLAMVLSSITGTLIAGEGIPIRHLVEGMFGYLDT ILVIATAMIFMKVIQEIGTLNALSATIIKKFHKIPWLLLIFLMFVSMFPGMITGSSTASV LTAGSIVAPILLLIGIPVTETATIIALGGLCGMIAPPVNIPAMIIGGGIDMPYVGFTIPL LLLTIPVAIFSVLFLGLKYVKKIDYDKIKNEIDFSAMEQYGIKLYLPVILAIFLMIMDKV LPNIFGLGMPLIFIISAFVGLFCGKKINFFKVSKNAINESLPVMGILMGVGMFIQIITLT GVRGYIVVNSLSLPQSLVYIAMAITIPLFGAVSSFGASSVLGVPFLMVFLAKNQIITGSA ISFIASLGDLMPPTALAGIFAAQIVGMKDYTPVLKKSIVPAIVIIIYSILMIIFSKELAV IIY >gi|296153962|gb|ADVK01000045.1| GENE 72 64293 - 64826 715 177 aa, chain - ## HITS:1 COG:no KEGG:FN0940 NR:ns ## KEGG: FN0940 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 177 1 177 177 266 97.0 2e-70 MKKIFIALLFLLSAVSVFAAKFEKPVTLTSVGQSADVQMVKALLKKAGIEAKFDKSLTAE GIKDEKTIILAIGGSSKGLGAAGIKAEDELARAEKLIKAAKAKKIKIIGMHIGGEARRGE LSDKFVKVAAPYCDYLIVVDEGNKDGIFTKMSSEKKILIDTIPKITNAVDPLKKAFE >gi|296153962|gb|ADVK01000045.1| GENE 73 64845 - 66584 2450 579 aa, chain - ## HITS:1 COG:FN0941 KEGG:ns NR:ns ## COG: FN0941 COG0405 # Protein_GI_number: 19704276 # Func_class: E Amino acid transport and metabolism # Function: Gamma-glutamyltransferase # Organism: Fusobacterium nucleatum # 1 579 1 579 579 1082 99.0 0 MKRKFILVGIIAIAFSVISYGKVSSNSVKNSNVENWKPYDANGEMIRTDRGATGKIGVVS TSKVEASRIGADILKKGGNAIDAAVAAGFALGVVEPNSSGLGGGGFMLIRIAKTGETVFI DFRERAPQKASPEMWTVDANGKVVGNKKVEGGKAAAIPGEVAGLLYALEKYGTMSREQVI RPAVNLAKNGYYVTPTLSNDMKSEFDKLEKYPESAKIYLNKEGLPYEVGDKFTNKDLAKT LEIIIKKGKDGFYKGEVAQAIVKTLNKYDGLYTMEDLANYKPLIRKPVKGTYRGYEIISS PSPSSGGAIVIEILNILENFNVSELDVNSPEYLHLFSEAYKLAYADRAKYMGDTDYTPVP MKGFVSKKYAKEISKDIDMKVSHESKAHDPWQYESEDTTHYSIADKEGNMVAITKTVNGL FGNSVVVDGYGFVMNNEMDDFVVGAGHPNSVAPGKTPLSSMSPTIVLKDGKPFMVLGSPG ATKIISTVSQVISRVIDHKMSIQDAIDTPRLWDNTSNVINIESRISDETVKKLEAMGHKV NKTSDWDRGMGSVQGVLYKNDGTLEGGADPRRDGKALGL >gi|296153962|gb|ADVK01000045.1| GENE 74 66790 - 68178 1249 462 aa, chain + ## HITS:1 COG:FN0942 KEGG:ns NR:ns ## COG: FN0942 COG0642 # Protein_GI_number: 19704277 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Fusobacterium nucleatum # 1 462 1 462 462 728 99.0 0 MLIKRITYKIHKIELFLGIFMVFISIILPNFILYSNLAIYEYIETSIDLWDKEQLLYAVF ITVFQNISRLFPIFFSVFLIADSVEIFVNKKSNFIFKIIFSLIVIQILYFIVYKIHFDMS YYFGKVAILQMLYLILHLHYQFQQITLLKRSFILFLVFVGIQWLDITRYFSILDYKTTGE LLFDIKNIAFLMKAEHMLDLIGILFFILFFTFAILLSIIFFNQEKRQKMYIKETEVAKTL SDLKLKEIENRYFKEIQYLVHDLKTPLFSIGTLIEILDMQEESEQKKIYYKKIEKSLERC NIMVSEILRDKNKNSISTEKVFNFILSYLSGHECIKYINYQNYCKERKIKVNKITFSRAI TNLIINSYEAFLGKNGKIDLIIKDYKKVILIKIEDNGKGMTDEEIEKAFEIGYSTKKSSG VGLNFIKTVMDEHKCELKILNKRNSGLGAYIVMKGENIENEK >gi|296153962|gb|ADVK01000045.1| GENE 75 68168 - 68782 663 204 aa, chain + ## HITS:1 COG:FN0943 KEGG:ns NR:ns ## COG: FN0943 COG0745 # Protein_GI_number: 19704278 # Func_class: T Signal transduction mechanisms; K Transcription # Function: Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain # Organism: Fusobacterium nucleatum # 1 204 1 204 204 363 99.0 1e-100 MKNKILIIDDSKEILFAISEFFRIKNWDVFTALSMEEALKIVSTKKLDIIIIDYHMPYIN GVLGVKLIRQIDETVPIIALTVEGLENIAKDFFEAGANDFSIKPIKVLDLYSRVNVHLQK SRNNESNNDKEYHKGISEATISIIEEKMKTYKEYIMIEEISQITGLSNQTVNKYMNHLVK LGYVDLKVVYGKIGRPRNEYLWIK >gi|296153962|gb|ADVK01000045.1| GENE 76 68806 - 70173 1543 455 aa, chain - ## HITS:1 COG:FN0944 KEGG:ns NR:ns ## COG: FN0944 COG0534 # Protein_GI_number: 19704279 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Fusobacterium nucleatum # 1 455 1 455 455 766 99.0 0 MGEERKELNPLGYKLIGKLLKSLAIPAIIANLVNALYNVVDQIFIGQRIGYLGNAATNIA FPITTMCLAIGLTLGIGGASNFSLELGKGYPEKSKHTAGTAASTLIIIGIMLCIVVRIFL EPLMLDFGATDKILEYSMEYTGITSFGIPFLLFSIGVNPLVRADGNAKYSMIAIVTGAIL NTILDPLFMFVYNWGIAGAAWATVISQIVSALLLLIYFPRFKSVKFSLNDFIPQLHYLKR IISLGFASFIYQFSNMIVLVTTNNLLKIYGKNSIYGSDIPIAVFGIVMKINVIFIAIVLG LVQGAQPIFGFNYGAKNYHRVRETMRLLLKVTFSIATILFIIFQVFPKQIISLFGEGDKL YFEFATKYMRIFLAFISLNSIQISIATFFPSIGKAIKGAIVSLTKQLIVLFPLLLTLPRF FGVEGVIYATPLTDLIAFTVAIIFLTNELKHMPKK >gi|296153962|gb|ADVK01000045.1| GENE 77 70343 - 70987 231 214 aa, chain - ## HITS:1 COG:FN0945 KEGG:ns NR:ns ## COG: FN0945 COG0671 # Protein_GI_number: 19704280 # Func_class: I Lipid transport and metabolism # Function: Membrane-associated phospholipid phosphatase # Organism: Fusobacterium nucleatum # 16 214 1 199 199 285 98.0 6e-77 MLKLIKLIVGGENFKVDKLLKLKIKYTIFISIFFVIFYKGAEFYTYTVNNVPSYFMEWER NIPFLPIFMLPYMTSALFFLVTIFLEKNESSLKLLMKRAIFLTVVSTFIFVIFPMKFYFP KPEIDNQIFKFPFYLLGKLDSSFNQCPSLHVSFAFLSVVVYYREIKLKFLKSFLCIWGFL LAVSILFVYQHHFIDFVGGTLMFLITCIIFPRKF >gi|296153962|gb|ADVK01000045.1| GENE 78 71041 - 71691 651 216 aa, chain + ## HITS:1 COG:FN0946 KEGG:ns NR:ns ## COG: FN0946 COG1451 # Protein_GI_number: 19704281 # Func_class: R General function prediction only # Function: Predicted metal-dependent hydrolase # Organism: Fusobacterium nucleatum # 1 216 14 229 229 345 99.0 3e-95 MEYTVTKKKIKNFIIRIYPDLRIAVSVPLLASNKDIENFVQSKKEWIETTLEKIKIVKEN KNNLKENVIKILGKEIEKKVIKSDLERIRLTDTSIYIYSKDTGNAGIEKKLLEWKIEKLK AILDEYLANYTKLLNRNINYYQIKKLSSAWGIYHKKENYISFNFDLIEKDIECIEYVVLH ELCHIFYMNHQKDFWSLVEKYMSDYKIRRKKLKNLS >gi|296153962|gb|ADVK01000045.1| GENE 79 71825 - 72637 1116 270 aa, chain + ## HITS:1 COG:FN0947 KEGG:ns NR:ns ## COG: FN0947 COG5266 # Protein_GI_number: 19704282 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Co2+ transport system, periplasmic component # Organism: Fusobacterium nucleatum # 1 270 1 270 270 525 100.0 1e-149 MLSKKLLIGALVATMSVSSFAHFQMIYTADSDISGKSSVPFELIFTHPSDGVKAHSMDIG KDEKGTINPVVEFFSVHNGEKTDLKAGLKASKFGPTSKQVTSYKFNLDKSSGLKGGGDWG LVFVPAPYYEASEEIYIQQITKVLVNKDNLATDWNKKLANGYPEIIPLSNPITWKGEIFR GQVVDKAGKPVANAEIEIEYLNSNIKNSKFVGELQKEKTATVIYADANGYFSFIPVHKGY WGFAALGAGGEMKHNGKELSQDAVLWIEAK Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:54:49 2011 Seq name: gi|296153960|gb|ADVK01000046.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00056, whole genome shotgun sequence Length of sequence - 6033 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 2 - 6032 8000 ## FN2047 hypothetical protein Predicted protein(s) >gi|296153960|gb|ADVK01000046.1| GENE 1 2 - 6032 8000 2010 aa, chain - ## HITS:1 COG:no KEGG:FN2047 NR:ns ## KEGG: FN2047 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 671 2010 1 1340 1630 2196 99.0 0 RGDKAEKYPFEGIFTRSNDLFLRNISPDSDVYEKYTSSISETSANSATTSTIKKRGGSLG YGIASTLPTQEPIVQIELGASVRPKKITKSPITVTPPSITVNAVTPLSTPTDPSAPTPPK IDIQKFDPVAPDPITVSLPTPPTFNIKLGSYRNYMTQDTLGDINGGRHTGAGKSINTSGN TSIGDSELGSPTIIYAWQNGGGVGNFDSALLKAYFDYTRMSGPGGGTLTVTGNITIDSVN TLNDAEKAAEGDLANKTGRYWNAQPFLVGGSRVATLDNANGGATILNKATVNMVGPLVVG YEIQNDGGGAYAGTKKREVINAKGGKLTDDAEKDMLKIGGLKKGKIGGGYDVKSDSIELI RPENLGNDKITVTRTRDIVDSKGNVLVKGGYTGYKIGMILTHEYDDPNPGINYYRLINDG EISFMGRNSIGIQVYAPPANNPNTVIHVINGESGDGAKTITLGGIQSYGLKLSSRILKDK AGQKSVFENRGTINISGGDGNEESLSSGIAVLEDAGMTGNKAIRAHKDLVINKGKINVSG GQGNSGMVLKVKANDDITNNDDGTITVTGNKNIGMRVDLGTTKTDDSGTFTPKAINKGKI TINNGLRNIGMVANNSDGNHKAIAENNSGASITFKNNSTKAIGMFSQDGGEIVNVGDIKG EGNSLKETIGMAIQPALSGSGHNNTASNGINKGDIDLSGEKVTGVYNQGKFINEKNITTS GGGSISLYAKGQSSKTEIKSGTITAKNKALGLFADGKATIELGGAGDNALSLKADGLGTL LFYNYTKSGSTYTADGKFKLVNNVDGELTNGATAFYFRDTTPGKTAGTTADKLNAMFGGS TSGKILKLKLDENSTLFVLDNTSPNTTPVKLSSVALDKINEYLGQYVKVSPNSSKNFKAY KATKATLSIDDNVDLDNHTGVQIDKYYRVDFINSSVTVAAGKKMYGTDAGKLKQAIAQAN FDGGNVGDVKVTNNGTIDYSKKAATAVVVDFGQATNNGLIKMDAANGDKQNSIGLFGASS SILTNSSTGEIQLGTKGVGIWGANKIDSSLASWSKNINIVNAGKITGLAGKEGIFGIYAN NSEAGAISTITHSGTIDLSQVAKSIGILMTKGTLTSSGNVSVQDGSVGISAKDSNVTING GTHTIGKKSAGLKLTGNTSRLMANSGNISITGIGSAAYLFDNVNLTSGTNFKDNLALTAT NGYTYINISNNSILNYKNTRTINNDESIFVNAKNSTINLLSGTTVTSTKKKITGVYSENS AISNAGTLTLTGDGSSALYGKAGSDLINSGKITIGKNGSGIYSINSTGKNNGEITIGEGS VGMRAENAIIENETTGKITSTGTSVLGMSQSGGTQNIINKGAITLTGDKSIALHSEGINS PNHKVINTGNITVGDSSNILSPSIGVYSANATNSTVENSGKVVAGTKSTAIYAGNVDLTG TSETTAGDGGIGVYSKEGTVKISENSKINVGATLGKGQEGVGVYLAGNNQTLNSDTDKLS IKKGSFGYVMTGQGNTVRTGKVGTTGIVSLSNDSVFMYSGDRTGTIRNYNNLKSTGNENY GIYALGSVENRGNIDFSQGVGNVGAYSYVEGATTIPNAIKNYGTIKVSKSDINDPDNRKY GIGMAAGYSEETPKGSGNFVTRGLGHIENHGTIKVTDPDSIGMYATGSGSKILNAGRIEL SGPKRNIGIFAENGAEVINTGTITTVGSGNVGQIGIAIRKGAILDNRGTINIDASKGYGL LIAGGIVKNYGEANITVGSGATKIKEVSAADTSKEIEDLRGNKVKIHSPAGAANGVITEN GVVRKPKIVHVQAIPNRKPNDIPTSSVGMYMDTSGINYTRPINNIGALRGLTQSDIIIGV EATKYTTVKTIQLGQDIIEPYNEMIRKSGIEKFNIYSGSLTWMASITQLPDYTIRNAYLR KIPYTVWAGKMATPIDKNDTYNFTDGLEQRYGVEGIGTRENQVFQKLNSIGNNEEILLYQ AFDEMMGHQYANVQQRVQTTGVILDKELDY Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:55:16 2011 Seq name: gi|296153957|gb|ADVK01000047.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00058, whole genome shotgun sequence Length of sequence - 1950 bp Number of predicted genes - 2, with homology - 2 Number of transcription units - 1, operones - 1 average op.length - 2.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) - Term 1 - 50 3.0 1 1 Op 1 1/0.000 - CDS 58 - 420 387 ## COG1694 Predicted pyrophosphatase 2 1 Op 2 . - CDS 430 - 1818 1808 ## COG0006 Xaa-Pro aminopeptidase - Prom 1884 - 1943 9.6 Predicted protein(s) >gi|296153957|gb|ADVK01000047.1| GENE 1 58 - 420 387 120 aa, chain - ## HITS:1 COG:FN1948 KEGG:ns NR:ns ## COG: FN1948 COG1694 # Protein_GI_number: 19705250 # Func_class: R General function prediction only # Function: Predicted pyrophosphatase # Organism: Fusobacterium nucleatum # 1 119 1 119 119 176 100.0 1e-44 MESAQQLLLKKLSNESSINEIQSYIKEIMKMRGFNKEKSSDKILLLVEEVGELAKAIRKN ERKLGIDKTKEYNYSSIESEIADVFIVLLSICDILNIDLFKAFLDKEEENIKRTWSINKN >gi|296153957|gb|ADVK01000047.1| GENE 2 430 - 1818 1808 462 aa, chain - ## HITS:1 COG:FN1949 KEGG:ns NR:ns ## COG: FN1949 COG0006 # Protein_GI_number: 19705251 # Func_class: E Amino acid transport and metabolism # Function: Xaa-Pro aminopeptidase # Organism: Fusobacterium nucleatum # 1 462 1 462 462 874 99.0 0 MLSKEVYVNRRKKLKENFKDGLILIMGNNFSPLDCEDNTYPFIQDATFRYYFGIEHNGLI GIIDIDENEEIIFGNDYTMSDIIWMGKQKFLKELAVEVGIEKFLEKEELKKYLENRKNIR FTNQYRADNIMYLSSILNINPFEFDKNISFDLVKAIIKQRNIKDKIEIEEIEKAVNITKE MHLSAMRNIKAGMKEYELVAEVEKQPRKYNAYYSFQTILSKNGQILHNHKHLNTLKDGEL VLLDCGALSEGGYCGDMTTTFPVSGKFTERQKTIHNIVRDMFDRAKDLARVGITYKEVHL EACKILAANMKKLGLMKGEVEDIVSSGAHALFMPHGLGHMMGMTVHDMENFGEINVGYDE GEKKSTQFGLSSLRLAKKLEIGNVFTIEPGIYFIPELFEKWKNEKLHEEFLNYDEIEKYI DFGGIRMERDILIQEDGISRILGDKFPRTADEIEEYMKEYRK Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:55:31 2011 Seq name: gi|296153914|gb|ADVK01000048.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00060, whole genome shotgun sequence Length of sequence - 51663 bp Number of predicted genes - 42, with homology - 42 Number of transcription units - 13, operones - 9 average op.length - 4.2 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . - CDS 150 - 344 229 ## FN1516 hypothetical protein 2 1 Op 2 1/0.750 - CDS 362 - 2941 4006 ## COG0495 Leucyl-tRNA synthetase 3 1 Op 3 1/0.750 - CDS 2949 - 3563 629 ## COG1595 DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog 4 1 Op 4 1/0.750 - CDS 3574 - 4278 348 ## PROTEIN SUPPORTED gi|163764761|ref|ZP_02171815.1| ribosomal protein S11 5 1 Op 5 . - CDS 4288 - 5559 1982 ## COG0766 UDP-N-acetylglucosamine enolpyruvyl transferase - Prom 5588 - 5647 13.1 6 2 Op 1 4/0.000 - CDS 5737 - 7680 2152 ## COG1629 Outer membrane receptor proteins, mostly Fe transport 7 2 Op 2 . - CDS 7711 - 8568 1122 ## COG0614 ABC-type Fe3+-hydroxamate transport system, periplasmic component - Prom 8702 - 8761 12.8 + Prom 8789 - 8848 11.6 8 3 Op 1 49/0.000 + CDS 8992 - 9930 1041 ## COG0601 ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 9 3 Op 2 5/0.000 + CDS 9931 - 10761 1356 ## COG1173 ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 10 3 Op 3 5/0.000 + CDS 10811 - 12391 2394 ## COG0747 ABC-type dipeptide transport system, periplasmic component 11 3 Op 4 44/0.000 + CDS 12403 - 13188 463 ## PROTEIN SUPPORTED gi|149915877|ref|ZP_01904401.1| 50S ribosomal protein L17 12 3 Op 5 . + CDS 13181 - 14155 1455 ## COG4608 ABC-type oligopeptide transport system, ATPase component - Term 14228 - 14271 8.0 13 4 Op 1 . - CDS 14304 - 24689 14255 ## FN1526 hypothetical protein 14 4 Op 2 . - CDS 24702 - 25091 573 ## FN1527 hypothetical protein 15 4 Op 3 . - CDS 25107 - 25499 631 ## FN1528 hypothetical protein 16 4 Op 4 . - CDS 25512 - 25883 623 ## FN1529 hypothetical protein - Prom 25907 - 25966 11.6 - Term 26116 - 26160 5.1 17 5 Tu 1 . - CDS 26167 - 26730 507 ## COG2963 Transposase and inactivated derivatives - Prom 26771 - 26830 5.7 18 6 Op 1 23/0.000 - CDS 26832 - 27539 1017 ## COG1346 Putative effector of murein hydrolase 19 6 Op 2 1/0.750 - CDS 27532 - 27915 399 ## COG1380 Putative effector of murein hydrolase LrgA 20 6 Op 3 29/0.000 - CDS 27946 - 28917 1622 ## COG2025 Electron transfer flavoprotein, alpha subunit 21 6 Op 4 2/0.000 - CDS 28929 - 29708 1202 ## COG2086 Electron transfer flavoprotein, beta subunit 22 6 Op 5 1/0.750 - CDS 29720 - 30856 1831 ## COG1960 Acyl-CoA dehydrogenases 23 6 Op 6 1/0.750 - CDS 30887 - 32314 2032 ## COG0277 FAD/FMN-containing dehydrogenases - Prom 32478 - 32537 13.8 24 7 Op 1 4/0.000 - CDS 32542 - 33708 1266 ## COG0003 Oxyanion-translocating ATPase 25 7 Op 2 . - CDS 33705 - 34895 980 ## COG0003 Oxyanion-translocating ATPase - Prom 35032 - 35091 10.9 + Prom 34985 - 35044 17.7 26 8 Op 1 13/0.000 + CDS 35108 - 35659 783 ## COG1556 Uncharacterized conserved protein 27 8 Op 2 1/0.750 + CDS 35702 - 37861 2829 ## COG1139 Uncharacterized conserved protein containing a ferredoxin-like domain 28 8 Op 3 2/0.000 + CDS 37862 - 38842 756 ## COG0142 Geranylgeranyl pyrophosphate synthase 29 8 Op 4 1/0.750 + CDS 38839 - 39759 877 ## COG1575 1,4-dihydroxy-2-naphthoate octaprenyltransferase 30 8 Op 5 1/0.750 + CDS 39761 - 40456 282 ## PROTEIN SUPPORTED gi|163754278|ref|ZP_02161401.1| 30S ribosomal protein S15 31 8 Op 6 12/0.000 + CDS 40482 - 41777 2168 ## COG0644 Dehydrogenases (flavoproteins) 32 8 Op 7 . + CDS 41780 - 42064 345 ## COG2440 Ferredoxin-like protein + Term 42073 - 42124 2.1 - Term 42061 - 42110 5.5 33 9 Tu 1 . - CDS 42126 - 44192 2684 ## COG0480 Translation elongation factors (GTPases) - Prom 44239 - 44298 13.0 - Term 44332 - 44381 7.1 34 10 Tu 1 . - CDS 44398 - 45867 2035 ## COG1263 Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific - Prom 45984 - 46043 13.3 + Prom 45877 - 45936 14.4 35 11 Op 1 26/0.000 + CDS 46101 - 46517 562 ## COG1585 Membrane protein implicated in regulation of membrane protease activity 36 11 Op 2 . + CDS 46536 - 47420 1421 ## COG0330 Membrane protease subunits, stomatin/prohibitin homologs - Term 47406 - 47446 -0.5 37 12 Op 1 . - CDS 47484 - 48227 946 ## FN1550 hypothetical protein 38 12 Op 2 . - CDS 48251 - 48403 214 ## FN1551 hypothetical protein 39 12 Op 3 . - CDS 48416 - 48784 423 ## FN1551 hypothetical protein 40 12 Op 4 . - CDS 48798 - 49742 1039 ## FN1552 hypothetical protein 41 12 Op 5 . - CDS 49747 - 51135 1356 ## COG1106 Predicted ATPases - Prom 51156 - 51215 14.3 - Term 51207 - 51270 18.4 42 13 Tu 1 . - CDS 51316 - 51663 562 ## FN1554 hypothetical protein Predicted protein(s) >gi|296153914|gb|ADVK01000048.1| GENE 1 150 - 344 229 64 aa, chain - ## HITS:1 COG:no KEGG:FN1516 NR:ns ## KEGG: FN1516 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 64 1 64 64 93 98.0 3e-18 MNEKLKAFFKEILIILCLVVGVNLFAFIAIKFGFLNSEYSMAGCTFIGVGAYLVYFFTKL KGKK >gi|296153914|gb|ADVK01000048.1| GENE 2 362 - 2941 4006 859 aa, chain - ## HITS:1 COG:FN1517 KEGG:ns NR:ns ## COG: FN1517 COG0495 # Protein_GI_number: 19704849 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Leucyl-tRNA synthetase # Organism: Fusobacterium nucleatum # 1 859 1 859 859 1722 99.0 0 MRDYEFKEIEKKWQERWSKDNIFKTENQVEGKENYYVLSMLPYPSGKLHVGHARNYTIGD VISRYKRMKGYNVLQPMGWDSFGLPAENAAIQNGTHPAIWTKSNIENMRRQLKLMGFSYD WEREIASYTPEYYKWNQWLFKRMYEKGLIYKKKSLVNWCPDCQTVLANEQVEDGMCWRHS KTHVIQKELEQWFFKITDYADELLEGHEEIKDGWPEKVLTMQKNWIGKSFGTELKLKVVE TGEDLPIFTTRIDTIYGVSYAVVAPEHPIVEKILKDNPSIKDKVTEMKNTDIIERGAEGR EKNGIDSGWHIENPVNKEIVPLWIADYVLMNYGTGAVMGVPAHDERDFVFAGKYNLPVKQ VITSKKSDEKVQLPYIEEGVMINSGEFNGLSSKDALVKIAEYVEEKGYGKRTYKYRLKDW GISRQRYWGTPIPALYCEKCGEVLEKDENLPVLLPDDIEFSGNGNPLETSNKFKEATCPC CGGKARRDTDTMDTFVDSSWYFLRYCDPKNLNLPFSKEIVDKWTPVDQYIGGVEHAVMHL LYARFFHKVLRDLGLLSSNEPFKRLLTQGMVLGPSYYSEKENKYLLQKDAIIKGEKAYSQ SGEELQVKVEKMSKSKNNGVDPEEILDKYGADTTRLFIMFAAPPEKELEWNENGLAGAYR FLTRVWRLVFENSELVKNANDKIDYNKLSKEDKTLLIKLNQTIKKVTDAIENNYHFNTAI AANMELINEVQTYVSSSMNSEQAAKILGYTLKKIIIMLSPFVPHFCDEIWEELGEKGYLF NEKWPEYDEKMLSSDETTIAVQVNGKVRGSFEIAKDSEQALVEKTALELPNVAKHLESMN VVKIIVIPNRIVNIVVKPQ >gi|296153914|gb|ADVK01000048.1| GENE 3 2949 - 3563 629 204 aa, chain - ## HITS:1 COG:FN1518 KEGG:ns NR:ns ## COG: FN1518 COG1595 # Protein_GI_number: 19704850 # Func_class: K Transcription # Function: DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog # Organism: Fusobacterium nucleatum # 1 204 1 204 204 324 99.0 8e-89 MEDINAILKKAQSGDNEAIDLILKEYSKLLSFNAQKYYLVGAEKEDLVQEGILGLLKAIK FYDETKSSFSSFAFLCIRREMISAIRKANTQKHMVLNEALKTNAILEDSAYFDDEGHNIN NYKSPESNPEEVYLLKEEIEEFKKFSENNFSKFEKEVLTYLIRGYSYREIATILSKNLKS IDNTIQRIRKKSEEWIKEEENIKR >gi|296153914|gb|ADVK01000048.1| GENE 4 3574 - 4278 348 234 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163764761|ref|ZP_02171815.1| ribosomal protein S11 [Bacillus selenitireducens MLS10] # 4 234 9 245 255 138 33 6e-32 MERIIGINPVTEALLNKEKNIEKLELYNGLKGETVQKLKDLASKRNIKIFYTGKKIENSQ GVAVYISNYDYYKDFDEAYEELTGKDKSLVLVLDEIQDPRNFGAIIRSAEVFKVDLIIIP ERNSVRINETVVKTSTGAIEYVDISKVTNLSDTINKLKKLDYWVYGAAGEANISYNEEDY SNKVVLVLGNEGSGIRKKVREHCDKLIKIPMYGQINSLNVSVASGILLSRIVNR >gi|296153914|gb|ADVK01000048.1| GENE 5 4288 - 5559 1982 423 aa, chain - ## HITS:1 COG:FN1520 KEGG:ns NR:ns ## COG: FN1520 COG0766 # Protein_GI_number: 19704852 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetylglucosamine enolpyruvyl transferase # Organism: Fusobacterium nucleatum # 1 423 1 423 423 789 99.0 0 MVEAFKIIGGKKIAGELKVDGSKNSTLPIMIATLVEKGTYVLRNVPDLRDIRTLVALLES LGLEVEKLDANSYKIINNGLSGAEASYDLVKKMRASFLVMGGMLAIEKRGKVALPGGCAI GARPVDLHLKGFEALGAKINIEHGYVEATTENGLIGGNIVLDFPSVGATENIIMAAVKAK GKTVLENAAKEPEIEDLCNFLIKMGAKISGVGTSRLEIDGVDKLTACEYSIIPDRIVAGT YIIASILFDGSIKVSGIVPEHLSSFLLKLEEMGTKFKIEGDKLEVLTKLSDLKPAKVTTM PHPGFPTDLQSPMMTLMCLVNGTSEIKETIFENRFMHVPELNRMGAKIEIDSSTAKITGV ENFSSAEVMASDLRAGASLILAALKANGESLVNRIYHVDRGYENFEEKFKALGANIERIK TEA >gi|296153914|gb|ADVK01000048.1| GENE 6 5737 - 7680 2152 647 aa, chain - ## HITS:1 COG:FN1971 KEGG:ns NR:ns ## COG: FN1971 COG1629 # Protein_GI_number: 19705267 # Func_class: P Inorganic ion transport and metabolism # Function: Outer membrane receptor proteins, mostly Fe transport # Organism: Fusobacterium nucleatum # 1 647 1 657 657 525 43.0 1e-148 MKKRLIILAILSISVSAFAMKEEAPVQKLNETTITTPERFGTKIRNVAKNIQIVTKKDME EKGAKNLFEALRGVPGVVLRRDGGGHIDLRGSGENDKRNMIFLIDGIPYSGLSIFDINSI SMEEIERIEIIQSGSAVLYGDGTIGGVVNLVTKPITTEKYSNSIGMEYGSWETAKLNVNV GTKLTDNLAVSASYSGEQTEEYRDRSIDFKDKKDRRESIWLKSKYNLKDGEIELKYNHLK NNDYVTGLLSEKEFKESPKKAGTTNALLKAKSDLLNLSFNKKLNNKFEVFLQGGYYTDET KYYEIGPGYKDYDKEGNVSYFIRPQIKYYYMKDSYIILGGDTKKETATNKLFPNSPKTIR KKESIYLLNSNKIGKIEITEGYRMEKVDLQGRKKSKNFKEDAIEIGLNYLYSDTGNLYFN YTKGFRVPTLGEMNSWVGDMKSHKNHTFELGLRDVYENTSINTSIFTLYSKDEIFYDSIV ANPTPKNPNRKGANRNFESEVRRIGGQVALEHSIGKLSLREKISYIVPKIMGGYYKGKDF PGVPKLTATLGLTYNFENGLKLNIDGYYQGKTYAGTDFLNKYGKHNSYTVVDANISYNFE NGLELYGGVKNLFDKTYATAFFPRATGELRYDPDNGRSFYTGFKYTF >gi|296153914|gb|ADVK01000048.1| GENE 7 7711 - 8568 1122 285 aa, chain - ## HITS:1 COG:FN0885 KEGG:ns NR:ns ## COG: FN0885 COG0614 # Protein_GI_number: 19704220 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+-hydroxamate transport system, periplasmic component # Organism: Fusobacterium nucleatum # 1 283 1 283 286 291 55.0 7e-79 MKKIFLLIFLIENLVFAKIPQRAVSISHFTTEILLLIGAENQMVGTAYLDTPILPELKDR FEKIPILSDKFPTKELFYSVRPDFVTGWEGLVKSRNLGPKKELEENGVQVYIFKSLNDER IEVLYSDILELGKIFKLEKNARELVEKIKKELSEIEKKVPKQKKKVLVCDPGDDQPFVLG GKGIGNYIIELAGAENVTAEINKAWGYSTWEKIIVSNPEYILIPDYEGRDYEEKVKYLKN ESPIKDLKAIKENKIIKIDLAGISPGVRISTEAKKIAEKLHGIKF >gi|296153914|gb|ADVK01000048.1| GENE 8 8992 - 9930 1041 312 aa, chain + ## HITS:1 COG:FN1521 KEGG:ns NR:ns ## COG: FN1521 COG0601 # Protein_GI_number: 19704853 # Func_class: E Amino acid transport and metabolism; P Inorganic ion transport and metabolism # Function: ABC-type dipeptide/oligopeptide/nickel transport systems, permease components # Organism: Fusobacterium nucleatum # 1 312 1 312 312 536 99.0 1e-152 MVKNNFINRILQILVVLFGISFFTFSLTYLSPGDPAEIMLTECGNIPTPELVEQTRAELG LDKPFAEQYFRWASNVAHGEFGKSYSLRVPVVSKIKTAFMPTLKLSLLSLTFMIIISLPL GILAALKVNKWQDYFVRAISFTGLSIPSFWLGLIFLSVFGVMLRWVTVSGGKADFKSMIL PAFTLGFAMSAKYIRQVRHTVLEELNKDYVIGAKMRGIKESTILIKHVLPNALIPLITLL GLSLGSLLGGTAVIEIIYNFPGMGNLAIKAISFRDYPLVQAYVLLIALIYLVINLIVDFS YKLLDKRIEGAN >gi|296153914|gb|ADVK01000048.1| GENE 9 9931 - 10761 1356 276 aa, chain + ## HITS:1 COG:FN1522 KEGG:ns NR:ns ## COG: FN1522 COG1173 # Protein_GI_number: 19704854 # Func_class: E Amino acid transport and metabolism; P Inorganic ion transport and metabolism # Function: ABC-type dipeptide/oligopeptide/nickel transport systems, permease components # Organism: Fusobacterium nucleatum # 1 276 1 276 276 476 100.0 1e-134 MKATKFIKGHKQLIFFLVMAIIIVLIAIFAKQIAPKDPLNAVMDKPLHSPDKVNLLGTDI LGRDILSRIIYGTRYSLFMTLVLVGTVFTLGTILGLLAGYFGGIVDTLIMRLADMMVSFP GIILAIAIAGLLGPSMTNAIIAISAVTWPKYARLSRSMVLKIKKELYVEAARLTGSKDKD ILFKYILPNMVTLMLVTAISDIGALMLEISALSFLGFGAQPPIPEWGAMLNEGRTYLAKA PWLMLYPGMAIVIVVVVFNMLGDNIKDLIDIKEEDF >gi|296153914|gb|ADVK01000048.1| GENE 10 10811 - 12391 2394 526 aa, chain + ## HITS:1 COG:FN1523 KEGG:ns NR:ns ## COG: FN1523 COG0747 # Protein_GI_number: 19704855 # Func_class: E Amino acid transport and metabolism # Function: ABC-type dipeptide transport system, periplasmic component # Organism: Fusobacterium nucleatum # 1 526 1 526 526 1034 100.0 0 MKFFTKKSFAFLMAILMMFTLVACGGDKKEETPATGTTNANGELVVGVTSFADTLEPTEQ YFSWVITRYGVGENLVRFDENGELQALLAEEWKVSDDKLTWEFKIRDGVKFSNGNPLTAE AVKSSLERTFRKSKRADGFFKPTSIVADGQTLKISTEKPVAILPQCLADPLFLIIDTSDN VEEYTTNAPICTGPYVFKEFVPTEYAVVERNEDYWGGKAGLAKVTFKCINDQSTRALSLK TGEIGVAYNLKIENKADFEGQDDINIQELKSLRSTYAFMNQHGALGDLALRQALLRALDK KAYTENLLGGAATPGKAPIPPTLDFGFDKLVDDNAYNPESAKEILAKAGYKDVDGDGFVE KPDGSKLDLNFVIYTSREELKVYAQAAQANLKDVGIKVTLKTVSYETLLDMRDSGNFDLL IWNVLAANTGDPEKYLYENWDSRSASNQAGYKNEKVDELLDKLNVEFDPEKRKDLAIEIQ QLIMNDAATVFFGYETTFLYSNKKVQNVKMFPMDYYWLTKDVTVSE >gi|296153914|gb|ADVK01000048.1| GENE 11 12403 - 13188 463 261 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|149915877|ref|ZP_01904401.1| 50S ribosomal protein L17 [Roseobacter sp. AzwK-3b] # 1 248 8 258 563 182 37 1e-92 MLEIKDLTIQYGEKNAVVENFSLTMQKGEIISIVGESGSGKSTVLRSIIGGLLGQGKITS GDIIFNGKSLLNLSNNEWRELRGTVISMISQDCGATLNPIRKIGSQYIEYINAHTNLNKT EAEKKALFMLEKVRLPEVKNIMNSYPYELSGGMKQRVGIAMALTFEPELVLADEPTSALD VTTQAQIVKQMMELRDEFHTGIIIVTHNMGVAAYMADKIIVMQNGVVVDSGTREEVINNP KSDYTKKLLKAIPEMDGERFV >gi|296153914|gb|ADVK01000048.1| GENE 12 13181 - 14155 1455 324 aa, chain + ## HITS:1 COG:FN1525 KEGG:ns NR:ns ## COG: FN1525 COG4608 # Protein_GI_number: 19704857 # Func_class: E Amino acid transport and metabolism # Function: ABC-type oligopeptide transport system, ATPase component # Organism: Fusobacterium nucleatum # 1 324 19 342 342 642 99.0 0 MSKNNELILEARNVTKQFKVSKNNTLTACDNINLSMYKGKTLGIVGESGCGKSTFLRMLM NLEKISSGEIFYKGRDISKFSKDEIWESRQHIQMVYQDPGASFNPRMKVVDILTEPLINY DRLKKEDKEKKAIELLEMVDLPADFIHKYPQNMSGGQKQRIGIARALSLEPEILVCDEAT SALDVSIQKNIIELLVKLQKERDLCIVFICHDIALVQSFAHEIAVMYLGNVLEVLPGERL KDSAYHPYTKALLSSLFSINMNFSEKIASIEGDVPSPISLPSGCVFQGRCKFVKDKCKGQ KPTLENINTKHGVACYFTKEINNL >gi|296153914|gb|ADVK01000048.1| GENE 13 14304 - 24689 14255 3461 aa, chain - ## HITS:1 COG:no KEGG:FN1526 NR:ns ## KEGG: FN1526 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1328 3461 1 2143 2143 3208 96.0 0 MKDYNKVESCLKSFLKNNKRLSYSMALLITFLINGGFSYADEAVQVPLRTEIKTRIEKEQ ENISQMLKEADESMKDIELKIKKLTQRGEFWVKPLEKSYQGFIFANWGNYSKNKNKTESN FNGPEYSASYGRNMGYGQFSNGKYYGEYGIVKNPLEFVDKIDFGANITPKAVIEKTIIEK TVIKKDITAPSVTPPTVEVGEITLTAPEEVAIDEMTPPNEPNPSVTTPSTVPALQGITVA AVSEVNVTPSTPEVAAAPTINAPGVQPPATPAGFTPRLITPPEAPEDIVITPPGEVQANL TGSGANPTTYYYWWNGNKGAISQISLTAGTFNIESGNTINVNGYQAKAFTGTTPQGTKPS DGDYVINQQFFHTLLNVAYSEYSNGVVINNNRAGFTLINLETEGQVSGNLDNRVGTYIDS AKRDKLRSYQMYSGIKGQDNTELLFINKGTVNLNADNTTYLFTTSHNDGNLRTNYLDNEG TISAKGKNSIIIKHSPDTNQAKAWIYSNNGIMKAEGKGSIVYGAAYSYLGSGRAAFVNDG TIEVSGEEAIGVILPRKTGNSELANGHLVWLKKPISLKGKKSMGFVAQNTNISNEKNLVK FNIDAEKAIGILQDVEGAGTAKTAGKIVIEGNSSGSMGIYAKQGILELIKPHADSGETES SIELKAGSNNIGIFAKGASTVNFNGITKITGGERQKIALATEGSTINLKNTVTAGDKITS NFVKDAVPLYATGTGSKINVVTPDSLKFYLGGNSTAAYAKDGGIINMNRTNLPADPSIYI KGENGKGIGLFAKDGGVINAQKHYIKVENGSVALSSVGANATNKSNIDFTGGKLEYTGNG YAVYSDGTGTVNLSDAELNLYGSSTAFDVDFNATTLPTILNANTRIHANSDDVIAFNLKN ASGLTTVGGIETSIKSKIETKLGLGSGSLNNLFTGSTSTKYKVAAVDGGEITVGNLDKSG TKDDTDQAKRDGYQYFNRFLAQRLKATATGSTIKAVLNSNFANSNFNGQVVGFEMNSSKN ATSVNETAINLVDSKIIADRTDAGTGAIGAFINYGEVNIAATSKIEVEKENNVVNKQAVG VYAVNGSKVDNKGTIDVGGDQSVGILGMAYREGSSNNPIVNEFGGKSGEGTVNITNEKDI KMSGKDAIGIYAMNNNPDKTVTSHLVINKGTVEVGDSAEKTAVGIYAKGVDVKPESGKIK IGKKAVGIYAEDSNVGEVNKDLGTIDFNGDDGVGIYLKGSGSNLLGNKVTLTQSKDSKNK VGILADRGTSSIIKTEVAVGALNNVIAYYSKGNHEFNVQSNVTLNENSIGISGEGDLQYG DGTNPYIMKLGKGSTGLFGTKNIVLKDKTNIELNGENSVGAYASGASGVITSEGKIKFLK EKSIGLYGANGATVKDKTTMDFSNANAKNNIGVYLAGAKWDIDRALTFDSAHEKGNIYLF AQGGSTATLKNGFDITPLTAPTGNNRTIGMYLDTAVKGGATTADNTVDMSDGNAKISVTK KAIGIYAKNVDNSKNNIINKIKVLSDGQGTVGVFTDGNLKLSGNGGLIEAQNAGIGLYGN KGTVTVEGTHKVEVTSAGTGMYLTKGSHLSGGKLELENKTAGTSAAGIYYEGTNNEVDHN TDIEVTAGENLLAIYANGLKLNNNKEILIKKGKNNVAAYITGNSTFKNKGKIQLGHPTQN DFESGIGIYVVDGEAINESGKTIDIYDFENTASGGSLSVGMLAKAGAGKTAKVTNKGTIN ANGEVIGMVVEDNSEGLNDTGAEIVAKDKEDINAKAIGAYVKGANAKFENKGKISAENIA LALQGTGANKILNSGTLNLTKTGAIGVYAKDSVVDFNIAPTVAGANKTVALYASGTTKIK SQITSASGKAHIGVYAEGDAEFLSGSKVTVGNGDGNDYGIGVYTKSGYNKTVNTDIQLGG EKTIGFYLGATGGTGSTVTHTGTIDVGSGIGTYIPEHSKFIAQNTTFNVGDNGTAVYLKG GEVDLGKTGTANINFNGTNGRAIYQDGGTITTGTGLHIQGSGSFLTLKNANSSINSLVEV GASGIGINGIYDMAGKDYTLTLESPNGHIKLGGNKATGIAAVAKSTVGPNKVNVINKGTI ETTSGEKTTGIYGKGANIENATGAKINIGAKGVGIYTTNDNSLENTTLNNAGEINLTGDE ATGLVAVKAKTNQDFIVGKISGTKDKLVGAYFKDSQAVTKVKDFNISLGTNAKGLVFDGG KDFTVTSSSTNKVKIGNTTGNSRGIGIAALGVNGNISKTDVVVGKGSLGLYAKDKKLTFD LASGKLESSDANRSSILAYADGNTSEVALNGGGTLKVGANGIALGTKGGKVSANATTTVE VDGVKGLGAYVENGGSIDNNFDIKVKSAEGIGMYAKGGALTSVAKVSEIKGNKSIGYVFE NITSAITMPNSVQLTDINATGQVGVVAQGTGNGLTVAGVSVVGSGNTGVYSSTGKAVINN GTLTVGDSTGKSSIGIYSKGGAVTSTGSATIGKNSIAIYGKDTAATLNGNLTIGEKGIGL YVDNTATSKGDTAVNGNITVGANGAIGIQTTNSKVNLTGDLSVASGDSKGIFSMGAGNVE TTGNITVGSNSVGIYKNGSGEVKTAAGKTLTVADSAYGIFSKGAKLINNMNVTVGVDAIG AYVDGNDLTSTGTVTVGDKGVGLFVKGTGKTLTSTGNITVGSNNSVGLYAGDNANIAQSG NITVANNNGIGVYSKGNGNVSTIGAITVGKDSIGVYKDGKGTMNINASSPIQTMTIAEKG YGLYYKGNSRADSIINSNMNMTLGKEAVGIYAKNTTVNHVGDITVGETNIGSSGFTTPSD NKNSIGIFGDNSNINFKGNMLVDKPLSVGIYGSNGGSITVKSGSTITVKNGATGIMTGSK VESITLESGSTLNVDGKVDTSVYTNATKSNISFGIAAYSGAINNQGTINVTNGATGIYLA GTASLVNQGTITVDAISKQIGRPDTKASAELGGIKVTDKGEVTINNKVINGGTVNIKGDL NMAGMGLDVSTGKTIVDARSISGVAEVLPNFSKGNSEQKVTIKDVFRTGAVGAFSGDVKS KSVSWIAKISKEPGSSTTTKDITMVRIPYNSLISGERYKNLASGLEDIRSKIGKDSSSPI FKSLDNISSHRDFARAIANIRGDIYSNVQERMKTVENAFDKSYNELLSSYNKTRNVNKFG IIYTGGEHKDSTLGVSGYKYKSTGVLYLNDREAFTYGGKYGWSAGIVGSNFEFNGDTNKG SKERVVSGKLGLHYQAPLNKEDDNAKLKWLTRGEVTVNNHRTNRYSQVGKDTYQNKASFY STELSWKNIISYDYDINTNWMVKPYTGIDMSYGHIFNIKEKNEGLPLEVKGKDYFVITPN VGVETKYVLPLGVTHQVFAKADTEFSYDVAKLYHGVNQAKMKNASSGYYDLSKPERRRAR VAVGAELGLEKENAYGVTFRAEYQGYKKSQLNYGVRLNYKF >gi|296153914|gb|ADVK01000048.1| GENE 14 24702 - 25091 573 129 aa, chain - ## HITS:1 COG:no KEGG:FN1527 NR:ns ## KEGG: FN1527 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 129 1 129 129 139 99.0 5e-32 MKKILLLLSSLFLFACTNIDTGVDESKEAQISRLLKEADKKKEKTVEVEKKLVTDNGEEV IEEEATVQNKKSHKGMTRGEIMEYEMTRVSDEMNALQADVQQYQEKKAQLKAYQEKLQKL EELNNAGIK >gi|296153914|gb|ADVK01000048.1| GENE 15 25107 - 25499 631 130 aa, chain - ## HITS:1 COG:no KEGG:FN1528 NR:ns ## KEGG: FN1528 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 13 130 1 118 118 85 98.0 5e-16 MKIKEILLLSLSMTSLIACTNSANNITNNEDRDAMQILEEKREYYEELDKKKEKEAKGYQ VEMTAEEAETEKEKLANMTEDEKLMYKVDKAKEKIDNMLDIAKKARQEELDVEETKNQVE EALKNITEEK >gi|296153914|gb|ADVK01000048.1| GENE 16 25512 - 25883 623 123 aa, chain - ## HITS:1 COG:no KEGG:FN1529 NR:ns ## KEGG: FN1529 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 123 1 123 123 172 100.0 4e-42 MKKVILTLFVLLSIGIFANDEIISELKGLNAEYENLVKEEEARFQKEKELSERAAAQNVK LAELKASIEEKLLAAPEERKTKFFKDTFDGLVKDYSKYLSQINEKIAENTEIVSNFEKIQ KIR >gi|296153914|gb|ADVK01000048.1| GENE 17 26167 - 26730 507 187 aa, chain - ## HITS:1 COG:FN1887 KEGG:ns NR:ns ## COG: FN1887 COG2963 # Protein_GI_number: 19705192 # Func_class: L Replication, recombination and repair # Function: Transposase and inactivated derivatives # Organism: Fusobacterium nucleatum # 1 104 1 104 169 179 100.0 2e-45 MSKLTRENKIEIYERRKNGETFSSLAKAFDVHKSNIEYLIALIKKHGFDILRKDKNRLYS KDFKLQIINRILVNHESINSVAIDIGFPAPSILHNWLSKFKENENYEDLIFHSDQGWQYQ HYSYQERLKEKKISQSMSRKGNSLDNGLMECFFGLLKSEMFYEQEKNYKTLEECIDTRKL DKFNLTY >gi|296153914|gb|ADVK01000048.1| GENE 18 26832 - 27539 1017 235 aa, chain - ## HITS:1 COG:FN1531 KEGG:ns NR:ns ## COG: FN1531 COG1346 # Protein_GI_number: 19704863 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Putative effector of murein hydrolase # Organism: Fusobacterium nucleatum # 1 235 10 244 244 382 100.0 1e-106 MSDIINKILFSPFFGIVLSLIAYEIGKYFFSKTKSIFCNPLLIGIILTIVFLMVLNIPFE AYDKGGSIIKIFISPVESVIIGVALYEQFQILKRNWFPILVSTVLGSTFSIIILYILGKV FGLPDDIFHATLPKSVTTAIALDIASKFGWQESLIPMMTVSTGIIGAVIAPLVTKFMKSP VAKGLAMGTASHAVGTAKAIEMGEVEGAMSGLALSLSAISTSFMIPLLLNTILKL >gi|296153914|gb|ADVK01000048.1| GENE 19 27532 - 27915 399 127 aa, chain - ## HITS:1 COG:FN1532 KEGG:ns NR:ns ## COG: FN1532 COG1380 # Protein_GI_number: 19704864 # Func_class: R General function prediction only # Function: Putative effector of murein hydrolase LrgA # Organism: Fusobacterium nucleatum # 1 127 1 127 127 192 99.0 1e-49 MGQWIIILALALVGQFVSDLISFPIPKTIIASIILFLLLEFKVVKADYFKEALAGCKKHL AFLFLPVGVGIMTQLNSGPAMVYVKVLLIMIISTILIMLVTGVLADLIIGIQEKILGGKD KKEGKNE >gi|296153914|gb|ADVK01000048.1| GENE 20 27946 - 28917 1622 323 aa, chain - ## HITS:1 COG:FN1533 KEGG:ns NR:ns ## COG: FN1533 COG2025 # Protein_GI_number: 19704865 # Func_class: C Energy production and conversion # Function: Electron transfer flavoprotein, alpha subunit # Organism: Fusobacterium nucleatum # 1 323 1 323 323 578 99.0 1e-165 MGKNIMVYIETTDNSPINVSLEALTLAKKISKENNEQVMAVLIGENLDEAAKKCFDYGAD EVLYVEENKKELEAVGNALIDVKEKYNPSVIFLGSTINGKDLANIVASKMKTPAFVDAVN VKYENGKYLMTLPMYSGNILKEVTFDGDKTLVIAVRSGACKKEVCENAKSGEVKKEKAAD KNLFTKIAEIVQEISETVNLEEAEVIVAGGRGMGSKENFELVKQLADVCGGVVGATRPAT EDEWIPRSHQVGQSGKIVAPKLYIACGISGATQHVSGIMGSNYIVAINKDEDAPIFEVAD VGIVGNVMDIIPIMIEEIKKIKS >gi|296153914|gb|ADVK01000048.1| GENE 21 28929 - 29708 1202 259 aa, chain - ## HITS:1 COG:FN1534 KEGG:ns NR:ns ## COG: FN1534 COG2086 # Protein_GI_number: 19704866 # Func_class: C Energy production and conversion # Function: Electron transfer flavoprotein, beta subunit # Organism: Fusobacterium nucleatum # 1 259 1 259 259 442 99.0 1e-124 MEILVCIKQVADDSVEVFMNEKTGKPALEGVEKVVNAFDTYALEMAVRLKETKGDITVTT LSLGGEDAKNGLKNCLAVGADEAFHIKDENYQEKDAVIIAQALFKGIQKIEEQRGKKFDI IFCGKETTDFAAGQVGIMLADELNYGVVTNLVDIDTEGEKVIAKKETETGYEKVEVASPC LVTVNKPNYEPRYPTIKSKMAARKKEIAEVSTEVANESAIKEVKLFSPPKRQAGVKIKTG TAEELVAQAIQKMLEAKVF >gi|296153914|gb|ADVK01000048.1| GENE 22 29720 - 30856 1831 378 aa, chain - ## HITS:1 COG:FN1535 KEGG:ns NR:ns ## COG: FN1535 COG1960 # Protein_GI_number: 19704867 # Func_class: I Lipid transport and metabolism # Function: Acyl-CoA dehydrogenases # Organism: Fusobacterium nucleatum # 1 378 1 378 378 732 100.0 0 MAYLISEEAQDLLKDVKKFCDNEVREQCKEYDKSGEWPKEIYDKAIEQGYQALEVPEKYG GPGLKRVDIAALIEEMAIADAGFATTISASGLAMKPVFISGSEEQKQRMCDLVLDGGFGA FCLTEPGAGSDASAGRTTAVKDGDEYVLNGRKCFITNGAVASFYCITAITDKEKGLKGIS MFFVEKGTKGLSTGNHEDKMGIRTSNTCDVVLEDCRIPASALVGKEGEGFAIAMKTLDQA RSWIGCIAVGIAQRGIQEAIAYGKERIQFGKPIIKNQALQFKIADMEIKTETARQMVAHA LTKMDLGLPYGKESAIAKCYAGDIAMEVSSEAIQMFGGYGYSREYPVEKLIRDSKIFQIF EGTNEIQRIVIANNVIGR >gi|296153914|gb|ADVK01000048.1| GENE 23 30887 - 32314 2032 475 aa, chain - ## HITS:1 COG:FN1536 KEGG:ns NR:ns ## COG: FN1536 COG0277 # Protein_GI_number: 19704868 # Func_class: C Energy production and conversion # Function: FAD/FMN-containing dehydrogenases # Organism: Fusobacterium nucleatum # 1 475 1 475 475 934 100.0 0 MGNHIYNKVSEELVEKFKKIVPGKVYTKDEINKDFFHDEMPIYGEGEPEVVIDVTTTEAI SEIMKLCYENNIPVIPRGAGTGLTGAAVAITGGVMLNMTKMNKILAYDYENFVVRVEPGV LLNELAEDALKQGLLYPPDPGEKFATLGGNVSTNAGGMRAVKYGTTRDYVRAMTVVLPTG EIIKLGATVSKTSTGYSLLNLMIGSEGTLGVITELTLKLIPAPKETISLIIPYENLDECI ATVPKFFMNHLQPQALEFMEREIVLASERYIGKSVFPKTLEGVDIGAYLLVTFDGDNMEA LEEITERASEVVLEAGALDVLVADTPAKKKDAWAARSSFLEAIEAETKLLDECDVVVPVN QIAPYLHYVNEVGKKYDFTVKSFGHAGDGNLHIYACSNDMEIGEFKRQVEEFLTDIYSKA SELGGLISGEHGIGYGKMEYLANFSGEVNMRLMRGIKEVFDPKMILNPNKVCYKA >gi|296153914|gb|ADVK01000048.1| GENE 24 32542 - 33708 1266 388 aa, chain - ## HITS:1 COG:FN1537 KEGG:ns NR:ns ## COG: FN1537 COG0003 # Protein_GI_number: 19704869 # Func_class: P Inorganic ion transport and metabolism # Function: Oxyanion-translocating ATPase # Organism: Fusobacterium nucleatum # 1 388 1 388 388 720 98.0 0 MRILIYTGKGGVGKTSIAAATALFLANSGEKVILMSTDQAHSLGDVLDKKLNGEICQVFQ NLDVVEIDTIEESQKVWRNLQDYLKQIISAKANNGIEIDEALLFPGLEEIFSLLKILDIY EANEYDVMVVDCAPTGQSLSMLTYSEKLNMLADTILPMVQSINSIFGSFISKKTSVPKPR DIIFEEFKNLVKRLTKLYEIFHKRDSTSIRIVTTPEQIVLEEARRNYTWLQLYNFNVDAV YMNKLYPKEAMNGYFEDWEKIQKDSIYLAEESFSEQKLFKLELQSEEIHGKEALEKIAKV LYKNINPAEIFCQTESFKIEEVNGTRILTIHLPFAKEENISVIKEKYDIIISLLNETRRF HLPDKLKTRKITDYSYKNGELKISMDYE >gi|296153914|gb|ADVK01000048.1| GENE 25 33705 - 34895 980 396 aa, chain - ## HITS:1 COG:FN1538 KEGG:ns NR:ns ## COG: FN1538 COG0003 # Protein_GI_number: 19704870 # Func_class: P Inorganic ion transport and metabolism # Function: Oxyanion-translocating ATPase # Organism: Fusobacterium nucleatum # 1 396 1 396 396 705 99.0 0 MTRIIIFTGKGGVGKSSVAAAHALSSAKSGKKTLLVSADTAHNLGDIFKIQIGSKITKIS ENLDALELDSDVVKREIFPEVKNTMLDLMGKSGIGITNLNENFSLPGFENLFSLLKIKEI YESNQYEHILVDCAPTGETLALLKLPELLAWYMEKFFPVGKKIVRILSPISKLAYKVILP SVKTMDTIELIHQKLLELQELLKNNEICSIRLICIPEKMIVEETKRNFMYLNLYKYQVDI VFINRIITDEIENPFMQKWKKIQSKYIKELEEVFFNIPVVKVPWYPKEIVGKSGLKLLCN TIENLPDLFSVKKVTQNEEYFPCEDGYLLKIQLPFVKEEELKIYHHEMDINIKINNVNRC VPLPNVLRKSHIVDTKLENGNLYVHFQLEKKKEEKS >gi|296153914|gb|ADVK01000048.1| GENE 26 35108 - 35659 783 183 aa, chain + ## HITS:1 COG:FN1539 KEGG:ns NR:ns ## COG: FN1539 COG1556 # Protein_GI_number: 19704871 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 183 1 183 183 350 99.0 9e-97 MKSITDDLYESFKRNLESVNGNCMRTPKAELGKVITELFKEHNVDSISLFESPMLKEAGV VSSLQQNGITVHTDHIRLHAETDKGGLSEAQHGIAELGTIVQEQDNADGRMVATMSEYYI GVVKGSTIVPTYDDMFDILSGMKKIPNFVGFITGPSRTADIECVGTVGVHGPIDVSIVIV DDE >gi|296153914|gb|ADVK01000048.1| GENE 27 35702 - 37861 2829 719 aa, chain + ## HITS:1 COG:FN1540_1 KEGG:ns NR:ns ## COG: FN1540_1 COG1139 # Protein_GI_number: 19704872 # Func_class: C Energy production and conversion # Function: Uncharacterized conserved protein containing a ferredoxin-like domain # Organism: Fusobacterium nucleatum # 1 463 1 463 463 958 100.0 0 MASEDLKKEIRFALENATLGRTLGNFCKTYPARREKSYAGVDFEKTREKIAEVKSYAAEH IDEMIAEFTTNCEARGGHVYHAKSTEDAMEWIRQLVKEKGVKTIVKSKSMASEEIKMNHV LTDDGVLVQETDLGEFIIALEGNTPVHMVMPALHLNKEQVADLFTDYTKVKNNPIISEEV KTARRVMRDKFTHADMGVSGANVAVAETGTVFTMTNEGNGRMVGTLPPIHLYVFGIEKFV KSLSDARYIFKALPRNGTAQRITSYISMYTGACEVTTNKETDEKCKKDFYCVILDDPGRR EILAEPDFREIFNCIRCGACLDVCPAFALVGGHVYGSNVYTGGIGTMLTHFLVSEERAAE IQNICLQCGRCNEVCGGGLHISDMIMKLREKNMKDNPDALKKFALDAVSDRKLFHSMLRI ASVAQGMFTKGEPMIRHLPMFLSGMTKGRSFPAIAQVPLRDFFHTIKQDVKNPKGTVAIF AGCLLDFVYTDLARAVVANMNSIGYKVEMPLGQACCGCPATNMGDTENARKEAEININGM ESEKYDYVVSACPSCTHQLHLYPTFFEEGTEMHKRAKELAEKTSDFCKLFYELGGMSETG DGKPMKVTYHDSCHLKRSLKVSKEQRELLKNTKGVEFVEMNDCDNCCGFGGSYSLLYPEI SAPILEKKIQNIKESGADVVALDCPGCLMQIKGGLDARGIDDIKVKHTAEIIAEKRGLI >gi|296153914|gb|ADVK01000048.1| GENE 28 37862 - 38842 756 326 aa, chain + ## HITS:1 COG:FN1541 KEGG:ns NR:ns ## COG: FN1541 COG0142 # Protein_GI_number: 19704873 # Func_class: H Coenzyme transport and metabolism # Function: Geranylgeranyl pyrophosphate synthase # Organism: Fusobacterium nucleatum # 1 326 1 326 326 606 100.0 1e-173 MIEILLKEWIERVKYYIHLIADYSIKESQVGAVLDDVLNLSGKMLRSKLLLLCAFLGTNW KNNKEKICKLAAMVELTHLASLIHDDIVDDSPYRRGKVSIQGKYGKDAAVFAGDFLIARI FYYGAVENLNEAVSILSKTVEYMCIGEIEQDLCRYQENISEKKYFENIERKTAALFQAAS YIGAKEAGCSEEIIQKLELFVKNLGLMFQLKDDILDFTSNSKELGKETHKDFQNGIYTFP VIMALKNPKAREVLLPIMRKNKEEKLSLAEILQLQNYIIQFGGIEASYEKIKYLSNINRQ LIKELGQNSKITVYLEKILNDLEVEN >gi|296153914|gb|ADVK01000048.1| GENE 29 38839 - 39759 877 306 aa, chain + ## HITS:1 COG:FN1542 KEGG:ns NR:ns ## COG: FN1542 COG1575 # Protein_GI_number: 19704874 # Func_class: H Coenzyme transport and metabolism # Function: 1,4-dihydroxy-2-naphthoate octaprenyltransferase # Organism: Fusobacterium nucleatum # 1 306 1 306 306 522 98.0 1e-148 MKNSQLTTKLALQLTSPHTWIASIGPVLFGILFCKLERYPLSFWKSILLIFTCIFMQSSV NTFNDYMDYIKGNDSKKDYVEESDAVLIYNSINPKQALILGIIYLILGAVLGMIACIQSG FLPLGIGCIGGIVVLLYSGGPFPISYLPIGEIISGFVMGILIPLGVAAVSDGKLHNEIFL YALPLMIGIALIMMTNNGCDIEKDLRAKRHTLAVLLGRKNTKNLYHILIVIWLLVTSALS IYLLGMVGYITPVLILLRYRCFKFLIKSELLACKRIEQMKNIVKANIIVNIAYFIAILTK IIMEMM >gi|296153914|gb|ADVK01000048.1| GENE 30 39761 - 40456 282 231 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163754278|ref|ZP_02161401.1| 30S ribosomal protein S15 [Kordia algicida OT-1] # 13 231 1 221 221 113 32 3e-24 MNKEQKSKFVHEVFENIHTQYDRANNKISFGLQKNWKQELVDKIVEETPKNSDFLDVCCG TGDISIWLAKKREDLNIIGVDFSSAMLNEAVKKSKNLNIVWKEADALCLPFKDSSFSAVA ISFGLRNTVDYEKVLKEMIRVVKKDGFIYCLDSFVPDCLWIQPFYKMYFKYIMPLFGGGK KYYKEYVWLYESTQQFLKKKELILLYEKLGLKEIKHCSKMFGVCVMIQGKK >gi|296153914|gb|ADVK01000048.1| GENE 31 40482 - 41777 2168 431 aa, chain + ## HITS:1 COG:FN1544 KEGG:ns NR:ns ## COG: FN1544 COG0644 # Protein_GI_number: 19704876 # Func_class: C Energy production and conversion # Function: Dehydrogenases (flavoproteins) # Organism: Fusobacterium nucleatum # 1 431 1 431 431 807 100.0 0 MSEEKFDAIIVGGGLAGCSAAIVLANAGLAVLVVERGDFCGAKNMTGGRLYGHSLEKIIP NFAEEAPIERKITREKISLMSEDSSLDIGFGSKKLSSNNENASYTVLRSTFDRWLASKAE EAGAEIIPGILVDELIVEDGKVVGVSATGEELYADVVILADGVNSLLAQSLGMKKELEPH QVAVGAKEVIKLGEDVINQRFAVNNGEGVAWLSCGDPTLGGFGGGLLYTNKDSVSVGVVA TLSDIGHSDLSINQLLDRFKEHPAIAPYLEGGTSIEYSGHLVPEEGLHMVPELYRDGVLV TGDAAGFCINLGFTVRGMDFAIESGRLAAEAVIKAHEIGDFSAATLSNYKKLIDSSFIME DLNQYKGFPTLLGRREIFEGLPAMVDDIATKAFTVDGKQGQSLMMYVIHSVAEHTTAAKL VDFVTTVLEAF >gi|296153914|gb|ADVK01000048.1| GENE 32 41780 - 42064 345 94 aa, chain + ## HITS:1 COG:FN1545 KEGG:ns NR:ns ## COG: FN1545 COG2440 # Protein_GI_number: 19704877 # Func_class: C Energy production and conversion # Function: Ferredoxin-like protein # Organism: Fusobacterium nucleatum # 1 94 1 94 94 189 100.0 1e-48 MKKMTIEDKLGLNVFHVDEENSHIDVDKNFTDEVEIKKLLLACPAECYKYIDGKLSFSHL GCLECGTCRVLSHGKIVKSWKHPIGEVGVTFRQG >gi|296153914|gb|ADVK01000048.1| GENE 33 42126 - 44192 2684 688 aa, chain - ## HITS:1 COG:FN1546 KEGG:ns NR:ns ## COG: FN1546 COG0480 # Protein_GI_number: 19704878 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Translation elongation factors (GTPases) # Organism: Fusobacterium nucleatum # 1 688 3 690 690 1340 100.0 0 MKVFTTENIRNISLLGHRGSGKTTLVESILYVKDYIKRKGDVENGTTVSDFDKEEIRRIF SINTSLIPVEHNDVKLNFLDTPGYFDFVGEVVSALRVSASAVLVLDATAGVEVGTEKAWK LLEERKLPRIIFVNKMDKGYVNYPKLLTELKEKFGKKIAPFCIPIGEKDEFKGFVNVVDM VGRVFDGKECVDTPIPADIDVSEVRNLLFEAIAETDEVLMDKYFAGEEFTKEEIVKGLHK GVVNGDIVPVMVGSAQQNIGIHTLLNYLELYMPCPTELFSGQRIGEDPTTQQEKVVKISS ENPFSAIVFKTLVDPFIGKISFFKVNSGTIKKETEVFNPKKNKKERIAQILTMQGNKQIE LDELRAGDIGATTKLQFTQTGDTLCDKNFPVVFNKIRFPKPNIYSGVLPADKNDDEKLST AIQRVMEEDPTFVMSRNYETKQLLIGGQGEKHLYIILCKIKNKFGVHAELENVIVSYRET ILGKAEVQGKHKKQSGGAGQFGDVFIRFEHSDKDFEFIDEIKGGVVPRNYIPAVEKGLIE AKEKGVLAGYPVINFKATLYDGSYHPVDSNDLSFKLAAILAFKAGMEKAKPVLLEPFVRM EIRIPEEYMGDVMGDLNKRRGRVLGMDHTETGEQLLLAEVPEAEILKYSIDLRALTQGRG EFEYEFVRYEEVPENISKRIIEERNKDK >gi|296153914|gb|ADVK01000048.1| GENE 34 44398 - 45867 2035 489 aa, chain - ## HITS:1 COG:FN1547_1 KEGG:ns NR:ns ## COG: FN1547_1 COG1263 # Protein_GI_number: 19704879 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific # Organism: Fusobacterium nucleatum # 1 411 1 411 411 714 99.0 0 MFSYLQKIGKALMVPVAVLPAAAIMLGIGYWIDPSGWGANSQLAAFLIKAGAAILDNMPI LFAVGVAYGLSKDKDGAAALAGLVAFEIVTTLLSTGAVSQIMGIPQEEVHAAFGKINNQF IGILCGVISGELYNKFHKIELPRFLAFFSGKRFVPIITSVVMIVVSFILTYIWPVIYGAL VTFGTSIAKLGPVGAGIYGFFNRLLIPVGLHHAVNSVFWFNVAGINDIGRFWGSPAAAYA DLPEILQGTYHVGMYQAGFFPIMMFGLLGACVAFIQTSKPENRTKIFSIMVAAGFTSFLT GVTEPIEFAFMFVAPVLYLLHALLTGVSLFLAASFNWMAGFSFSGGFIDFFLSLKNPNAN NPFMLIVLGLVFFVIYYFVFLFVIKAFDLKTPGREESEEEKAEAVKVKTTNSELAESLVG LLGGADNIVEVDNCTTRLRLKVKDSANVKEGEIKKLVPGVLKPSKETVQVIIGPHVEFVA TELRRILGR >gi|296153914|gb|ADVK01000048.1| GENE 35 46101 - 46517 562 138 aa, chain + ## HITS:1 COG:FN1548 KEGG:ns NR:ns ## COG: FN1548 COG1585 # Protein_GI_number: 19704880 # Func_class: O Posttranslational modification, protein turnover, chaperones; U Intracellular trafficking, secretion, and vesicular transport # Function: Membrane protein implicated in regulation of membrane protease activity # Organism: Fusobacterium nucleatum # 1 138 1 138 138 212 97.0 2e-55 MGYVFWLILTIIFSIIEFMGPALVSVWFAFASAITIFVSLAFDNLKVEITFFTVISVLAI ILLRPFAKKVLSSKDNFDAEAIDTSIVIKKIVDVSKEEKVYDVSYKGSIWTALSNEIFEI GDIPMISGFKGNKIIIKK >gi|296153914|gb|ADVK01000048.1| GENE 36 46536 - 47420 1421 294 aa, chain + ## HITS:1 COG:FN1549 KEGG:ns NR:ns ## COG: FN1549 COG0330 # Protein_GI_number: 19704881 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Membrane protease subunits, stomatin/prohibitin homologs # Organism: Fusobacterium nucleatum # 1 294 1 294 294 498 99.0 1e-141 MFYIPFFILLIVLIAIVMFKAVKIVPESQVYIVEKLGKYYQSLSSGLNLINPFFDRVARI VSLKEQVVDFDPQAVITKDNATMQIDTVVYFQITDPKLYTYGVERPLSAIENLTATTLRN IIGDMTVDETLTSRDIINTKMRQELDDATDPWGIKVNRVELKSILPPNDIRVAMEKEMKA EREKRAKILEAQATRESAILVAEGEKQSAILRAEAEKEVKIKEAEGRAQAILEVQKAEAE AIKVLNEAKPTKEILALKSFATFEKVADGKSTKILIPSEIQNLAGFMQAIKEIK >gi|296153914|gb|ADVK01000048.1| GENE 37 47484 - 48227 946 247 aa, chain - ## HITS:1 COG:no KEGG:FN1550 NR:ns ## KEGG: FN1550 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 247 1 247 247 412 92.0 1e-114 MASYDYKNGKSRIEEILNNKMEIIEKEKVPKDDNFTFDNGYYSWVSAIFVDIRESSKLFT DKDKEKVAKIIRSFTSEIIEILREDDNLREIGIRGDCVYAIYTTPKKNDIYEIAEKTFFI NTFMKMLNRLLEEKSYPTIKVGIGMSTAQELVVKAGRKNVSINSKVWIGDAVTKASNLSS LGNKDGVLPIVFSELSYTNFIKQLVENNKNKDPKSWFEEDSTYEYGVYYSADIIVENFCN WIENGMN >gi|296153914|gb|ADVK01000048.1| GENE 38 48251 - 48403 214 50 aa, chain - ## HITS:1 COG:no KEGG:FN1551 NR:ns ## KEGG: FN1551 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 50 128 177 177 86 100.0 4e-16 MNSENFLDDIIKQIYKNSMICSKKYERYNLGIKCSILGFILFIIVYGGIL >gi|296153914|gb|ADVK01000048.1| GENE 39 48416 - 48784 423 122 aa, chain - ## HITS:1 COG:no KEGG:FN1551 NR:ns ## KEGG: FN1551 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 122 1 122 177 162 99.0 3e-39 MESKNEENFFTVEIAQDTLDRIIGFINACDTKVSIILAIFGIIITIIFTGNISIIENLKN IINDSKNIKYLPIFLLLISIVIFLYGLIMLFMALYANIKASGEKSIIYFKDISLSEDYEA YK >gi|296153914|gb|ADVK01000048.1| GENE 40 48798 - 49742 1039 314 aa, chain - ## HITS:1 COG:no KEGG:FN1552 NR:ns ## KEGG: FN1552 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 314 1 314 314 477 99.0 1e-133 MREKRRFAERTRISKEDRTIKKYFLVFEGNRTEGIYFNAINELKDKIGINPLIEIISIER TYTEEGWSNPKKILEQLLKDLEEIENGKFSYKTLVDKIIEIIMEDEKFFSKISKETSSKE IIEDIKNEIESLDSIVENIEEDCEFFLNMIIKKFFLTIEEIPNILETVLKNIENKQITYS EDIDKMCLIVDRDKKSFKEEQYNYVKEECKRKNFKFYVTNPCFEFWLLLHFDEVHSINRE KLLENKRASSKVRYVESELKKYFPYNKNKYNAELLIEKIDLAIENEKRFCEDIEELKDKL GSNIGLLIKELKEK >gi|296153914|gb|ADVK01000048.1| GENE 41 49747 - 51135 1356 462 aa, chain - ## HITS:1 COG:FN1553 KEGG:ns NR:ns ## COG: FN1553 COG1106 # Protein_GI_number: 19704885 # Func_class: R General function prediction only # Function: Predicted ATPases # Organism: Fusobacterium nucleatum # 1 462 1 462 462 782 99.0 0 MLIRFNVKNFLSFAEREDGRTEEFSMLTGKVQKKKEHIYDDGKIKLLKFTAIYGANASGK SNLVKAIDFMKETIINGLPKGHTEKYCRVKSENKAKESYFEFEIKLGEKYYSYGFEIILN ESKFISEWLVELKSDNKEKIIFNRDIQKGKYKFGKFLEKEKELINKLRVYAEDIKDDSSV LFLSIMNKNKKNLYEDYKTISILKETYFWVKDNLDVNYPDRPISNYSYMANAENVEEVCK FISAFGTGIKNFKMVDVPVEKVINKLSKSIKDRLMSDLEKRRVEIKKEKDIEKIAFIMRS NKDFFILNIDNNQNLTCKTIHFSHEKSDIFFSLDEESDGTIRLLDLLEVLLSNKDKTYVI DELDRCLHPSLSYKFIKTFLQKAEKSNIQLIVTTHESRLMDFDLLRRDEIWFINKKSSGE SDIYSLEEYNERFDKKIDKAYLEGRYGGVPIFSTVFPIEKED >gi|296153914|gb|ADVK01000048.1| GENE 42 51316 - 51663 562 115 aa, chain - ## HITS:1 COG:no KEGG:FN1554 NR:ns ## KEGG: FN1554 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 115 1468 1582 1582 210 98.0 1e-53 VKLEVKQNQYFSVKPEIGAELGFKHYFGMKALKTTLGVAYENELGRVANGKNKARVVDTS ADWFNIRGEKEDRRGNVKFDLNVGIDNTRVGVTANVGYDTKGENLRGGLGLRVIF Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:57:07 2011 Seq name: gi|296153861|gb|ADVK01000049.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00061, whole genome shotgun sequence Length of sequence - 54468 bp Number of predicted genes - 54, with homology - 50 Number of transcription units - 26, operones - 14 average op.length - 3.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 1 - 988 1015 ## gi|296328969|ref|ZP_06871476.1| conserved hypothetical protein - Prom 1147 - 1206 11.1 + Prom 1032 - 1091 14.2 2 2 Tu 1 . + CDS 1121 - 1360 313 ## FN1796 hypothetical protein + Prom 1370 - 1429 7.7 3 3 Op 1 30/0.000 + CDS 1466 - 2596 1320 ## COG3842 ABC-type spermidine/putrescine transport systems, ATPase components 4 3 Op 2 36/0.000 + CDS 2583 - 3437 719 ## COG1176 ABC-type spermidine/putrescine transport system, permease component I 5 3 Op 3 1/0.875 + CDS 3430 - 4215 827 ## COG1177 ABC-type spermidine/putrescine transport system, permease component II 6 3 Op 4 . + CDS 4275 - 5099 1020 ## COG0652 Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family + Term 5111 - 5154 8.3 - Term 5146 - 5202 12.2 7 4 Tu 1 . - CDS 5231 - 6472 1585 ## COG0786 Na+/glutamate symporter - Prom 6585 - 6644 16.7 8 5 Op 1 1/0.875 + CDS 6859 - 7512 738 ## COG1309 Transcriptional regulator 9 5 Op 2 . + CDS 7570 - 9030 2159 ## COG2195 Di- and tripeptidases 10 5 Op 3 . + CDS 9068 - 10264 1749 ## FN1805 hypothetical protein + Term 10289 - 10355 2.0 + Prom 10280 - 10339 5.6 11 6 Tu 1 . + CDS 10369 - 11232 783 ## COG0697 Permeases of the drug/metabolite transporter (DMT) superfamily + Term 11414 - 11446 -0.2 - Term 11197 - 11262 18.0 12 7 Op 1 . - CDS 11280 - 12074 1407 ## COG5266 ABC-type Co2+ transport system, periplasmic component 13 7 Op 2 . - CDS 12123 - 12542 661 ## FN1808 hypothetical protein - Prom 12566 - 12625 10.3 14 8 Op 1 12/0.000 - CDS 12775 - 13653 1181 ## COG0803 ABC-type metal ion transport system, periplasmic component/surface adhesin 15 8 Op 2 42/0.000 - CDS 13655 - 14548 787 ## COG1108 ABC-type Mn2+/Zn2+ transport systems, permease components 16 8 Op 3 25/0.000 - CDS 14573 - 15262 233 ## PROTEIN SUPPORTED gi|225084369|ref|YP_002657150.1| ribosomal protein S16 17 8 Op 4 1/0.875 - CDS 15264 - 16172 1357 ## COG0803 ABC-type metal ion transport system, periplasmic component/surface adhesin 18 8 Op 5 . - CDS 16184 - 17092 1033 ## COG0803 ABC-type metal ion transport system, periplasmic component/surface adhesin 19 8 Op 6 . - CDS 17104 - 17592 711 ## FN1814 hypothetical protein - Prom 17711 - 17770 15.1 - Term 17950 - 17997 7.2 20 9 Op 1 2/0.125 - CDS 18038 - 18187 143 ## COG3666 Transposase and inactivated derivatives 21 9 Op 2 . - CDS 18191 - 18373 237 ## COG3666 Transposase and inactivated derivatives - Prom 18400 - 18459 6.9 22 10 Op 1 . - CDS 18551 - 19258 939 ## FN1816 hypothetical protein 23 10 Op 2 12/0.000 - CDS 19271 - 27697 10666 ## COG3210 Large exoproteins involved in heme utilization or adhesion 24 10 Op 3 1/0.875 - CDS 27712 - 29502 2027 ## COG2831 Hemolysin activation/secretion protein - Prom 29540 - 29599 14.0 - Term 29589 - 29651 6.1 25 11 Tu 1 . - CDS 29656 - 30507 1426 ## COG1136 ABC-type antimicrobial peptide transport system, ATPase component - Prom 30550 - 30609 7.7 + Prom 30507 - 30566 9.5 26 12 Tu 1 . + CDS 30615 - 30701 113 ## 27 13 Op 1 2/0.125 - CDS 30758 - 31453 886 ## COG0378 Ni2+-binding GTPase involved in regulation of expression and maturation of urease and hydrogenase 28 13 Op 2 . - CDS 31453 - 32631 1687 ## COG1840 ABC-type Fe3+ transport system, periplasmic component - Prom 32716 - 32775 10.6 + Prom 32702 - 32761 11.0 29 14 Tu 1 . + CDS 32786 - 33640 878 ## COG0731 Fe-S oxidoreductases + Term 33812 - 33880 26.1 + TRNA 33712 - 33787 81.3 # Thr TGT 0 0 + TRNA 33790 - 33864 66.8 # Glu TTC 0 0 + TRNA 33881 - 33965 70.5 # Tyr GTA 0 0 - Term 33869 - 33935 31.6 30 15 Tu 1 . - CDS 34009 - 35373 1329 ## COG1672 Predicted ATPase (AAA+ superfamily) - Prom 35401 - 35460 12.3 + Prom 35501 - 35560 12.6 31 16 Op 1 . + CDS 35603 - 35797 275 ## gi|296328999|ref|ZP_06871506.1| conserved hypothetical protein 32 16 Op 2 . + CDS 35824 - 35991 205 ## gi|296329000|ref|ZP_06871507.1| conserved hypothetical protein 33 16 Op 3 . + CDS 36058 - 36225 236 ## - Term 36228 - 36264 3.0 34 17 Op 1 1/0.875 - CDS 36276 - 36743 728 ## COG0716 Flavodoxins 35 17 Op 2 1/0.875 - CDS 36767 - 37945 1999 ## COG0484 DnaJ-class molecular chaperone with C-terminal Zn finger domain 36 17 Op 3 1/0.875 - CDS 37993 - 38505 733 ## COG0350 Methylated DNA-protein cysteine methyltransferase - Prom 38555 - 38614 6.3 37 18 Op 1 29/0.000 - CDS 38767 - 40590 2951 ## COG0443 Molecular chaperone 38 18 Op 2 21/0.000 - CDS 40629 - 41234 895 ## COG0576 Molecular chaperone GrpE (heat shock protein) 39 18 Op 3 . - CDS 41245 - 42267 1535 ## COG1420 Transcriptional regulator of heat shock gene - Prom 42410 - 42469 11.9 + Prom 42302 - 42361 11.8 40 19 Op 1 . + CDS 42450 - 42752 393 ## FN0111 hypothetical protein + Prom 42754 - 42813 9.5 41 19 Op 2 . + CDS 42849 - 44114 1748 ## COG0172 Seryl-tRNA synthetase 42 19 Op 3 . + CDS 44098 - 44574 216 ## FN0109 hypothetical protein + Prom 44577 - 44636 3.8 43 20 Tu 1 . + CDS 44683 - 44778 77 ## - Term 44688 - 44745 -0.1 44 21 Op 1 . - CDS 44775 - 44861 56 ## 45 21 Op 2 1/0.875 - CDS 44854 - 45855 1246 ## COG1619 Uncharacterized proteins, homologs of microcin C7 resistance protein MccF 46 21 Op 3 . - CDS 45930 - 47378 2213 ## COG0591 Na+/proline symporter - Term 47489 - 47541 8.0 47 22 Op 1 . - CDS 47708 - 48130 626 ## FN0106 hypothetical protein - Prom 48194 - 48253 12.8 48 22 Op 2 . - CDS 48269 - 48883 645 ## COG4832 Uncharacterized conserved protein - Prom 48909 - 48968 11.5 + Prom 48969 - 49028 11.0 49 23 Tu 1 . + CDS 49051 - 49512 222 ## PROTEIN SUPPORTED gi|50365462|ref|YP_053887.1| acetyltransferase of 30S ribosomal protein L7 + Term 49522 - 49570 0.1 - Term 49513 - 49559 6.1 50 24 Op 1 24/0.000 - CDS 49565 - 50611 999 ## COG0208 Ribonucleotide reductase, beta subunit 51 24 Op 2 . - CDS 50592 - 52853 2750 ## COG0209 Ribonucleotide reductase, alpha subunit 52 24 Op 3 . - CDS 52859 - 53062 303 ## FN0101 glutaredoxin - Prom 53107 - 53166 6.6 + Prom 53247 - 53306 15.7 53 25 Tu 1 . + CDS 53352 - 54023 797 ## COG1018 Flavodoxin reductases (ferredoxin-NADPH reductases) family 1 + Term 54029 - 54071 2.5 54 26 Tu 1 . - CDS 54057 - 54410 443 ## COG0221 Inorganic pyrophosphatase Predicted protein(s) >gi|296153861|gb|ADVK01000049.1| GENE 1 1 - 988 1015 329 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296328969|ref|ZP_06871476.1| ## NR: gi|296328969|ref|ZP_06871476.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 329 1 329 329 630 100.0 1e-179 MEYILVRTDAEDINSLQIKQVITLDEVQNAMNLMSVDSFYNEKKISISICSETYLDCIAI SFRDITSPYRYLNLLSGILNCKVIDIQANNFLGKEKSPSFNNKMKNITFSDFVYEKKNKL SEVYIRMFIDNILGDMEMIKNLVASHFTGNLISYEILDDNSFIFSLEMSECSESLLLLEI EKNRKYRNNFRITISCNDNNLAFLGPGYYATVVEEGRFLAKILKGKFSISPNELPYLEKP EYDFNWLVSYYDKLLQQHLYDINCEFKNGKQAYFCWSEEMYQPLPVPSTVITPLGRYSIP KLELEMQQNGFESMRNRFFIFNHFIKTAD >gi|296153861|gb|ADVK01000049.1| GENE 2 1121 - 1360 313 79 aa, chain + ## HITS:1 COG:no KEGG:FN1796 NR:ns ## KEGG: FN1796 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 79 1 79 79 101 100.0 9e-21 MERLTKEEIKKIIDELKKSGKYKEYEEMLLDDFEEHHVVYKVEADEIIAIAHKNKTIPYK LIEYYDWQQMNYLIEEEND >gi|296153861|gb|ADVK01000049.1| GENE 3 1466 - 2596 1320 376 aa, chain + ## HITS:1 COG:FN1797 KEGG:ns NR:ns ## COG: FN1797 COG3842 # Protein_GI_number: 19705102 # Func_class: E Amino acid transport and metabolism # Function: ABC-type spermidine/putrescine transport systems, ATPase components # Organism: Fusobacterium nucleatum # 1 376 1 376 376 717 99.0 0 MEKKDINIVNVNKSFDGVQILKDINLKIEQGEFFSIIGPSGCGKTTLLRMIAGFISPDSG AIYLGDENMVNLPPNLRNVNTIFQKYALFPHLNVFENVAFPLRLKKVDEKTINEEVNKYL KLVGLEEHSTKKVSQLSGGQQQRISIARALINKPGVLLLDEPLSALDAKLRQNLLIELDL IHDEVGITFIFITHDQQEALSISDRIAVMNAGKVLQVGTPAEVYEAPADTFVADFLGENN FFSGKVTEIINEELAKINLEGIGEIIIELDKKVKIGDKVTISLRPEKIKLSKNEIKKTKN YMNSAAVYVDEYIYSGFQSKYYVHLKNNEKLKFKIFMQHAAFFDDNDEKAIWWDEDAYIT WDAYDGYLVEVESEKK >gi|296153861|gb|ADVK01000049.1| GENE 4 2583 - 3437 719 284 aa, chain + ## HITS:1 COG:FN1798 KEGG:ns NR:ns ## COG: FN1798 COG1176 # Protein_GI_number: 19705103 # Func_class: E Amino acid transport and metabolism # Function: ABC-type spermidine/putrescine transport system, permease component I # Organism: Fusobacterium nucleatum # 1 284 1 284 284 477 99.0 1e-134 MKKNSKLGLSYSLPINIWLTVFFLIPILIILSYSFLKRGTYGGVEFKLSFETFNIFIDKV FLKILINTIYISILITIFTVLLAIPISYYIARSRHKQELLFLIIIPFWTNFLVRIYSWIA LLGNNGFINHFLMKLHIINEPIKMLYNVPAVVIISVYTSLPFAILPLYAVVEKFDFSLLD AARDLGATNFQAFRKVFIPNIKAGIITSTIFTLIPALGSYAVPKLVGGTNSLMLGNVIAQ HLTVTRNWPLASTISGALIVLTSIVVWLFSKYEEKENKVGENNV >gi|296153861|gb|ADVK01000049.1| GENE 5 3430 - 4215 827 261 aa, chain + ## HITS:1 COG:FN1799 KEGG:ns NR:ns ## COG: FN1799 COG1177 # Protein_GI_number: 19705104 # Func_class: E Amino acid transport and metabolism # Function: ABC-type spermidine/putrescine transport system, permease component II # Organism: Fusobacterium nucleatum # 1 261 4 264 264 412 100.0 1e-115 MSNNLNRRKTSFIIFILAMIFFYLPLIVLVIYSFNDGKGMVWNGFSLRWYKELFKHSNNI WKAFYYSIFIALISSLVSTIIGTFGAIALKWFDFKGKKYLKNMSVLPLVVPDIIIGVSLL IMFATLKFKLGIITIFIAHTTFNIPYVLFIVLSRLDEFDYSIVEAAYDLGATNRQTLTKV IIPMLLPAIISAFLMALTLSFDDFVITFFVSGPGSSTLPLRIYSMIRLGVSPVVNALSVI LIVISILLTLSTKKLQKNFIK >gi|296153861|gb|ADVK01000049.1| GENE 6 4275 - 5099 1020 274 aa, chain + ## HITS:1 COG:FN1800 KEGG:ns NR:ns ## COG: FN1800 COG0652 # Protein_GI_number: 19705105 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family # Organism: Fusobacterium nucleatum # 1 274 1 274 274 513 100.0 1e-145 MKKLFKLFSILALSLIFLVSCSSIKSTMKSVASVFKSPTKYNNVTVTFVTTQGEITFFLY PEAAPLTVANFINLAKRGFYDNTKFTRSVDNFIVQGGDPTGTGMGGPGYTIPDEFVEWLD FYQPGMLAMANAGPNTGGSQFFFTFSPADWLNGVHTVFGEVRSEGDFQRIRKLEMGDVVK EVKISENGDFILSLFKDQVEQWNSVLDREYPNLRKVAIKDPNPQDVEAYKEELERLYAKK EKNNSDFEYPITKFIRKVFNKAGGYTPKAPVISN >gi|296153861|gb|ADVK01000049.1| GENE 7 5231 - 6472 1585 413 aa, chain - ## HITS:1 COG:FN1801 KEGG:ns NR:ns ## COG: FN1801 COG0786 # Protein_GI_number: 19705106 # Func_class: E Amino acid transport and metabolism # Function: Na+/glutamate symporter # Organism: Fusobacterium nucleatum # 1 413 1 413 413 725 99.0 0 MNFETIEGVLNINLNSTMTLALAALLLIMGYSINKRLTILNKYCIPAPVVGGFIFMFLTW LGHISGTFKFNFENIFQSTFMLAFFTTVGLGASFTLLKKGGKLLIIYWLTCGIISILQNV IGMTISKTIGLEAPYALLSSAISMVGGHGAALAYGDTFAKMGYESAPLVGAAAATFGLIS AVLIGGPLGRRLIEKNNLKPDNTENFDQSVTEINADKGEKLLDLDIIKNVVVILLCMAVG SYISTLIGKLIKMDFPSYVGAMFVAVIVRNINEKTHTYNFSFSLVDGIGNVMLNLYLSLA LMTLKLWELSGLIGGVLLVVACQVAFMILIAYFVVFRILGSNYDAAVMCSGLCGHGLGAT PSAIVNMTAINEKYGMSRKAMMIVPIVGAFLVDIIYQPATVWFIKTFIKGFVQ >gi|296153861|gb|ADVK01000049.1| GENE 8 6859 - 7512 738 217 aa, chain + ## HITS:1 COG:FN1803 KEGG:ns NR:ns ## COG: FN1803 COG1309 # Protein_GI_number: 19705108 # Func_class: K Transcription # Function: Transcriptional regulator # Organism: Fusobacterium nucleatum # 1 217 1 217 217 359 100.0 2e-99 MDKLNIKKRRVMMYFIEATQELILNEGIENLSIKKIADTAGYNTATIYNYFEDLEELILY SSIDYLKIYLKDLRNKINSDMKAIEMYKTIYKVFVHHSFEKPEIFHTLFFGKYSYKLEKI IKKYYEIFPDDITGQTDLTKSILIEANIHNRDIPVMKQMIKEGSILEEEAPYIMEAIVRI HQSYLENILQQREEFSLEEHKNKFFNIFNFLLKRKNK >gi|296153861|gb|ADVK01000049.1| GENE 9 7570 - 9030 2159 486 aa, chain + ## HITS:1 COG:FN1804 KEGG:ns NR:ns ## COG: FN1804 COG2195 # Protein_GI_number: 19705109 # Func_class: E Amino acid transport and metabolism # Function: Di- and tripeptidases # Organism: Fusobacterium nucleatum # 1 486 1 486 486 921 99.0 0 MAYKSVEDLRKHRVFYNFLEISKIPRQTFFEKEVSDFILNWAKKLGLEVHQDEKNNLLIR KPATLGYENKKPIVIQAHIDMVCEKRPEVEHDFRKDPIKLILEGDILSTGNRTTLGADDG IGVAMAMAILEDKNLKHPPVDVILTTAEEEDMSGALNINKSWFNTNRLINIDHVVDTEII AGSCGGVGVDLKFPVEHTKKTDNYKGYQIKISGLRGGHSGEDIHKGRANANVLLANLLNL LREKINFLISDLKGGNFRLAIPREAHVTVALEEKDIDILKNIVKNFESEAKKIYEETAVD LKIEVSAENLAENLLSKNTVDKIIDAIILSPNGVSSMIGSLNVVESSCNLGEVYIKENHV YLVTEIRATFDKNRDYIYNKIALIGKYLGGELRVFSAYPSWVYKAHSNLRDTANKVYSEI FGENIKTLAVHAGLECGCFVDKIQGDMDAISIGPNAWDLHSPNERLSVSSTEKVYKFLTK VLENLD >gi|296153861|gb|ADVK01000049.1| GENE 10 9068 - 10264 1749 398 aa, chain + ## HITS:1 COG:no KEGG:FN1805 NR:ns ## KEGG: FN1805 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 11 398 1 388 388 576 99.0 1e-163 MHPAMAFFILMAFLALGEFVSVKTRAIVPSILIFLILLLVGVWGGFLPKEIIDLGGFSEA MTEVIMVVIVVNMGSSLSLDSLKKEWKTVLIGIGAIAGIAGVILPIGSIIYDWQTAVVAA PPIAGGFVAAFEMSKTSLAKGLPHLSTIALLLLALQEFPVYFLLPSLLRSETLRRLDLFR KGELKAVAVEEEKNKKRLIPPIPEKYMDTSTYLFLLGLVGMLAILASMLSGKFFNSFGID FKISPTIFALFFGIIAGEIGLLERKSLQKANCFGFFVVASVVGVMGGLVNSSMEEILALI IPLVVLIFLGIIGMAIGGIIIGKMLKITWQMSFAIALNCLIGFPVNFLLTNEAINVLAKT EEEKDFLSNTMVPTMLVGGFTTVTLGSVVFAGILTNFL >gi|296153861|gb|ADVK01000049.1| GENE 11 10369 - 11232 783 287 aa, chain + ## HITS:1 COG:FN1806 KEGG:ns NR:ns ## COG: FN1806 COG0697 # Protein_GI_number: 19705111 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; R General function prediction only # Function: Permeases of the drug/metabolite transporter (DMT) superfamily # Organism: Fusobacterium nucleatum # 1 287 1 287 287 471 99.0 1e-133 MKNNSKAYFLIIAAVVIWSLSGLLVKAVDADPLWISLIRSLGGGIFLLPYIFKEKIYPMK NILFGGIFMAIFLLAITITTRISSSAMAISMQYTAPMYVIGYGFYKSKEIKFNKFIVFLL IFAGIIFNSITSMNGGNWWAIVSGITIGVAFVFYSYNLQKVKKGNLLGIVALINIISAVF YGVLLIFRYSSPPSSFNEIIILSISGIVISGISYALYGEGLREVSMEKAMIICLAEPVLN PIWVYLGKGEIPSMTTVIGSTLILLSAIVDITFSIKNNKKDKKTVTN >gi|296153861|gb|ADVK01000049.1| GENE 12 11280 - 12074 1407 264 aa, chain - ## HITS:1 COG:FN1807 KEGG:ns NR:ns ## COG: FN1807 COG5266 # Protein_GI_number: 19705112 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Co2+ transport system, periplasmic component # Organism: Fusobacterium nucleatum # 1 264 1 264 264 513 100.0 1e-145 MKKSLVLIGSILLAANLFAHDHFLYTSNLDASNQKEVKMKAILGHPAEGPEAEPISIATV DGKTHMPKAFFVVHDGVKTDLLSKVKVGTIKTKKGEYVALDAVYTAEDGLKGGGSWVFVM DSGNTKDSGFMFNPVEKLIITKDSAGSDYNQRVAPGHNEIVPLVNPVNAWKENVFRAKFV DKDGKPIKNARIDVDFINGKIDMVNNTWAANKEAPKTSLRVFTDDNGVFAFVPSRAGQWV IRAVASMDREKKVVHDASLVVQFE >gi|296153861|gb|ADVK01000049.1| GENE 13 12123 - 12542 661 139 aa, chain - ## HITS:1 COG:no KEGG:FN1808 NR:ns ## KEGG: FN1808 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 15 139 1 125 125 231 99.0 7e-60 MKKFLVLIIGVLMSVVVFAHAPLISIDDNGDGTVYIEGGFSNGASAEGVEIIIVKDKAYN GPEESFKGKEIIYKGKLDAKNSLTLPKPATEKYEVYFNAGEGHVVSKKGPALTAGEKANW DKATASFDFGEWKELMMEK >gi|296153861|gb|ADVK01000049.1| GENE 14 12775 - 13653 1181 292 aa, chain - ## HITS:1 COG:FN1809 KEGG:ns NR:ns ## COG: FN1809 COG0803 # Protein_GI_number: 19705114 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type metal ion transport system, periplasmic component/surface adhesin # Organism: Fusobacterium nucleatum # 10 292 1 283 283 520 100.0 1e-147 MKKILLFVLMLVLGSFSFAENVVITSIQPLYSLTSYLTKGTDIKVYTPFGSDISMTMSKE SIREEGFNLAVAKKAQAVVDIAKIWPEDVIYGKARMNKINIVEIDASHPYDEKMTTLFFS DYSNGKVNPYIWTGSKNLVRMVNIIGRDLIRLYPQNKAKIEKNITKFTADLLKIENEANE KLLAVGDAEVISLSENLQYFLNDMNIYTEYVDYDSVNAQNIAKLIKDKGIKVIVSDRWLK KDAIKALKEAGGEFVVINTLDIPMDKDGKMDPEAILKAFKENTDNLIEALSK >gi|296153861|gb|ADVK01000049.1| GENE 15 13655 - 14548 787 297 aa, chain - ## HITS:1 COG:FN1810 KEGG:ns NR:ns ## COG: FN1810 COG1108 # Protein_GI_number: 19705115 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Mn2+/Zn2+ transport systems, permease components # Organism: Fusobacterium nucleatum # 1 297 1 297 297 415 100.0 1e-116 MLESFRNFLMNLAEQGDIPSSFKYGFVINAMICALLIGPILGGIGTMVVTKKMAFFSEAV GHAAMTGIAIGVILGEPFSAPYVSLFTYCILFGLVINYTKNRTKMSSDTLIGVFLSISIA LGGSLLIYVSAKVNSHALESILFGSILTVNDTDIYILVVSAIVIGFVLVPYLNKMLLASF NPNLAIVRGVNVKLIEYIFIIIVTIITIASVKIIGSILVEALLLIPAAAAKNLSKSIKGF VGYSVIFALVSCLLGVYLPIHFDISIPSGGAIIMISSFIFIVTIIIKVLFKNFAEGE >gi|296153861|gb|ADVK01000049.1| GENE 16 14573 - 15262 233 229 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|225084369|ref|YP_002657150.1| ribosomal protein S16 [gamma proteobacterium NOR51-B] # 16 228 20 236 309 94 30 1e-18 MNGLEIQIKDLNLILSGNEILEDINLTVKAGEIHCLVGPNGGGKTSLLRCVLGQMPFTGS IEMKYEKDKIIGYVPQVLDFERTLPITVEDFMAMTNQIKPCFFGLSKKCKPEVDNLLKKL GVFEKKKRLLGNLSGGERQRVLLAQALFPRPNLLILDEPLTGIDKAGEDYFKEIVKELKN EGMTILWIHHNLTQVKELADTVTCIKKRMIFSGDPKEELKEDKIMRIFE >gi|296153861|gb|ADVK01000049.1| GENE 17 15264 - 16172 1357 302 aa, chain - ## HITS:1 COG:FN1812 KEGG:ns NR:ns ## COG: FN1812 COG0803 # Protein_GI_number: 19705117 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type metal ion transport system, periplasmic component/surface adhesin # Organism: Fusobacterium nucleatum # 1 302 1 302 302 550 100.0 1e-156 MYKKILAILMIIFSFSAFAKEKLKIGVTLQPYYSFAANIVKDKAEVVPVVRLDQYDSHSY QPKPEDIKRMNTLDVLIVNGVGHDEFIFDILNAADRKKDIKVIYANKNVSLMPIAGSIRA EKVMNPHTFISITTSIQQVYNIAKELGEIDPANKEFYLKNARDYAKKLRKLKTDALNEVK HLGNIDIRVATLHGGYDYLLSEFGIDVKAVIEPSHGAQPSAADLEKVIKIIKNEKIDIIF GEKNFNNKFVDTIHKETGVEVRSLSHMTNGPYEVDSFEKFIKIDLDEVVKAIKDVAAKKG KK >gi|296153861|gb|ADVK01000049.1| GENE 18 16184 - 17092 1033 302 aa, chain - ## HITS:1 COG:FN1813 KEGG:ns NR:ns ## COG: FN1813 COG0803 # Protein_GI_number: 19705118 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type metal ion transport system, periplasmic component/surface adhesin # Organism: Fusobacterium nucleatum # 1 302 1 302 302 513 99.0 1e-145 MKRIIFIVFLIFNTFLLGQEKLKVGITLLPYYSFVANIVKDKAEVIPIVKAESFDSHTYQ PKVEDIERASKVDVVVVNGIGHDEFIYKILDAVDKNKRPVIINANKDVPLMPVAGTLNDE KIMDSHTFISITAAIQQVHNITKELVKLDPKNKDFYLANSREYVKKLRKLKTDALKEVQN VNGEDVKVATFLGGYNYLLAEFGIDVKAVLEPTHGSQISMSSLQKIIEKIKKDKIDIIFG EKNYSDEYVTIIKNETGIEVRKLEHLTTGAYTADSFEKFIKVDLDEVVSAIKYVKSKNKS KK >gi|296153861|gb|ADVK01000049.1| GENE 19 17104 - 17592 711 162 aa, chain - ## HITS:1 COG:no KEGG:FN1814 NR:ns ## KEGG: FN1814 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 162 31 192 192 302 99.0 3e-81 MAAIALKIRQRTEYKIDTKEDEIVSYEVLNNIELGIYSDIKNSLVDISQLRDEQNSLPSL DLLAEEEIPPYFKDITWEQRGAVEWTAFKHDGEDYFIGKGNGKVGTFLVKFNNENMDESD IFYMKETPSFNDIEKNFEKYEHIAKKIVPFTGNDERRKLTGE >gi|296153861|gb|ADVK01000049.1| GENE 20 18038 - 18187 143 49 aa, chain - ## HITS:1 COG:FN1815 KEGG:ns NR:ns ## COG: FN1815 COG3666 # Protein_GI_number: 19705120 # Func_class: L Replication, recombination and repair # Function: Transposase and inactivated derivatives # Organism: Fusobacterium nucleatum # 1 49 63 111 111 81 93.0 4e-16 MEKLFKLSEIPTETIYIDGTKIEAYANKYSFVWEKSTLKYKERLGLYNK >gi|296153861|gb|ADVK01000049.1| GENE 21 18191 - 18373 237 60 aa, chain - ## HITS:1 COG:FN1815 KEGG:ns NR:ns ## COG: FN1815 COG3666 # Protein_GI_number: 19705120 # Func_class: L Replication, recombination and repair # Function: Transposase and inactivated derivatives # Organism: Fusobacterium nucleatum # 1 60 1 60 111 109 100.0 1e-24 MFAVIIYAYSRGIYSTRDIEYLCKGSQRAQYLLNSSNIPDYSTIARFLLKSNDIIYELFC >gi|296153861|gb|ADVK01000049.1| GENE 22 18551 - 19258 939 235 aa, chain - ## HITS:1 COG:no KEGG:FN1816 NR:ns ## KEGG: FN1816 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 235 1 233 233 350 92.0 3e-95 MEELSTKEFNEIYKKYFKEYFDKPLKKDGYYKKGTITFYKINKLMMIEVINFQRHYDRLT VNFGVSPLWCGALKGAMSIGGRINKFSKEKSHWRYQWWNIRNEEEIKNSMIEILELIQTG LYKWFTENDNEENIMESLKEAYFDKINEYMFLATAMAKFKRYDEILPYIEKVKKEYREEY TAEQRNNVEWLKKLFKEILLLEEKVREGNKAVDKYIVEREKQSLIEMGLEKLIKN >gi|296153861|gb|ADVK01000049.1| GENE 23 19271 - 27697 10666 2808 aa, chain - ## HITS:1 COG:FN1817 KEGG:ns NR:ns ## COG: FN1817 COG3210 # Protein_GI_number: 19705122 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Large exoproteins involved in heme utilization or adhesion # Organism: Fusobacterium nucleatum # 1 2808 1 2806 2806 4066 96.0 0 MRNKFLKKFITVIFLLIYNIEIFAANLVVDLNSNHNTKLDESANGVPIVNISTPNKNGIS VNEFSEYNIGKEGQIINNADNIGRSHLGGLINANPNLAPNQAANLVILQVNGSNRSQIEG YLEALSRERIDVILSNENGLYINNGGTINIKNFTATTGKVNLKDGDFIGIDVEKGNIVIG PKGMDGTNANYVEIIAKTLELRGNIVTNDLKVVTGSNSTTSTNNIAIDAKELGGMYANRI RIISTDKGAGVNSDAFIVSKNSKLEITADGKIKVNKVQGKGIDIKGKEYEQKDLAYSDEG ISINADKIKLSGTGTQANKQINLNGTVENSTTIYTKEGIKTKGLTNTGIVQAINKIEVEG NLTNSGEILTNKNLTAKDTISTKKLIAKDGISVGKLENSGKVISNKKLNVDGSLNNSGEI QTLDNINIKENTVNIGDILTNGTLTSKDVKNEKVISVSKDINIGKLENSGKVSTAKKLDI NGNLTNLGNIQAVENISVTNNVLNKGTILTNGFFTSKDIKNEKELSANKDISVSKLENSG NVVTNSKININGVLTNSGELKALDNITTTGNITNNGSILTNKNFVTSDLINNKKIIAKEK IDIKNLKNTGTIASGDKFTINGNLENANNIETTKLDIIGNKLTNSGSIKADNITTNVANI TNDGKILSFNNISFSNAQNITNRNEITALKDIEANNTNLVNSGDIASNGKVLLNNSNITN AKKIASSTIEIKDNKKFDNTGEIKGNNVTLTTTNDIDLVGKLHGAQSLTISGKNIINNGE TTGTGTTTITASNNFTNNKNLSAQTLTVTATGDVVNNKELNGGKVSITGNNIQNNDLIAA AGDLTLTATNKLENKSGKTIFAGNKLTITAKEIKNNKRAELLGNNIELTADKVRNEVGTI KAFNDITIKTDKFENIGEVKDLDKYESYYETWDGKILSESEINDWKRYYSESSRKRSNGS AGDHVRDKQREAYERISNTVVEDKYKSLLFPKYKKLMEGYLGNEGEHREKTGTAKIQDIP LKEKVKSLGETERGKVLAGGNITIEGKNAGNTSEVLNKDSIISAGNTVRINTNKLENIVS IGEKVKVKTGQESMKIKFEHTGRRPRKKVRMEVTYTRDFADDYITKKVPVLDENGKQVYE TYEVGDKTKRRKKYETVTEYVGRYAYVTGSPSVIEGKNVVINPSSIVKQEINDANGKINE GKENKVIKEERKVHTGVNKDIKKENIAPNQINVKDELKKYGNVGTDGTIYNGNNGINGQI AGSTKVIDKIIKNGKIDTDASLSSALFIKNISSDSKYIMETRLKYIDQKRFFGSDYFLNR VGYEDKWNRVKRLGDAYYENELIERSVTEKLGTRFLNGKEISAKDLMDNAVEEAKKNGLT VGQPLTKEQIAKLDKDIVWYEYQEVDGIQVLAPKVYLSQNTIKNLNTDSRSRITGLENTY VRTGNLENAGLIGGYGNTYVEAKEVNNRTLGNQFAEIRGNKTTIIAQNNINNIGAKISGN ENLNLVAVNGDIVNKSTVEKIGFNNGEFDRSKFTKIDSVGEIVSNGNMYMLTNNYTSVGA ITQAKNANINVTNDINIKSQEVSGEQKFGKDDSQYNYYGFERNIGSVVKAENLNTTASNL NITGSAVTTKRANLNVDKLNIESKVDKEDEIRKSSYKDLLKSGSKEEIIHNEENSAGSLY VKDKGTIKGDINLVGSNLVLGNDSFVGGKVTTDSRELHSSYSLEEKKKGFSGSIGSGGFS VGYGKSESKLKEKDLTNAKSNLVLGDGTTLNKGADITATNLIHGNISINNGDVKFGARKD VKDIETSSKSSGINLSVKIKSDALDRAKQGVDSFKQMKSGDILGGIASTTNTVTGVVSGL ASNQGTKLPLSAVNKNNSDDDNDDDQNNQDNTVGKDNLKAAQATNNFYANVGVSLGFNKS SSNSNSHSESGVVTTIKGKNGNSSITYNNVKNIEYIGTQAQNTKFIYNNVENITKKAVEL NNYSSSSSRSSGISAGVTIGYGDGTQTSVDAVKVSASKSKMNTNGTNFQNGRFVNVDEVH NNTKNMTLSGFNQEGGTVTGNIRNLTIESKQNTSTTKGSTKGGSLSIAPNGMPSGSANYS QTNGERKYVDNATTFLIGDGSNLKVGKVENTAGAIGTTGNGKLSIDEYVGHNLENKDETT TKGASLSLSPDSNVISGVGINYANRDLESVTKNTVIGNVEIGKSFGDEINKDLDSMTEIT KDRKFETNINIESQTIKYALNPEAFKQDLKKAKNEIEDIGNVIENTVNPPGEDSRNFFAN LRAQRWNTSFYNVTGSRMEELSRQFKAGEIDENQLKEAARDLIKGYGKDIGIDFEVVYLD EETMPKDSKGSTGSSYILDKKNRKVLIPIDVSKIGDINELLGTLTEEVSHGKDALEGRQD KKVAEDKSNNEKGLESLGRPANDYVKNRLGEDNNSKIKLSTDGIDLSNVDVGEKVGDVNT QDDRNSGYAETLPSQRDELWLNNMHFRLLKFKKGINEFKYVALGTAIRINSMLFNPEDKT IRDFTLGAFSAGAEDLSLNTYKIPKKFQYNTSAYKYGELFGHSTMSAIGILGTIIGSGMQ GGGTAAAPATGGLSLIGVPAGVVVQTYSAGLTATSAINSAKAGLDIKAMKSEGGSSSSAE TEIYDKKEKDEKLEKIELESREKRPTYRESEKDVYDFLDEKYKNNQNVEVQEQASFKGGE EAKHFGEKGSVRPDDTVIIKTETKNGTVISRESYEVKNYKIENYNNMTSKIAEQAKRRVD ELPKGTVQKVVIDARGQKLTAETEIKTINDIVKKSNGIINAENIFFIK >gi|296153861|gb|ADVK01000049.1| GENE 24 27712 - 29502 2027 596 aa, chain - ## HITS:1 COG:FN0131 KEGG:ns NR:ns ## COG: FN0131 COG2831 # Protein_GI_number: 19703476 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Hemolysin activation/secretion protein # Organism: Fusobacterium nucleatum # 1 565 1 565 566 980 98.0 0 MKKIVTYIFLVFNVFAFSDSFNENEDERTILKQEQRSEQERLQKEFQKREEIFNQLKSEK TDKQEVSTNEIKFHISQINLEDNERLLNEIEKENILGKYINRDLGSTDITNLITELTNRL IAKGYITSVATISENNDLSTKTLNLKIIPGKIEKIILNEDKTLDNLKKYFLVDTKAGKVL NIRDLDTTTENFNYLEANNMTMEIIPSEIQNHSIVKLKNEMKEKFTVSVLTNNYGEDRQN AIWRGGVSINIDSPLGIGDRVYFSYMTVHKKKADRSWKRTTESLKPGEIAPIGPKGYDPR KDTLPYKRDLDLYNFRYTLKFNSYTLSLGSSRTENTSSFYTPNTVYDMETVSNTFSVNLD KVLLRNQKNKLTFGIGLKRKHNQSYIEEAILSDRVLTIGDISLNGTTTFYGGLLGASLGY ERGMRALGAERDKNKGVKSPKAEFMKYTLNTNYYKPLTQKLVYRFNTTLTHSNDVLYGSE KHSIGGVGSVGGYHRTGNIQGDKAIEIENELSYRVLDSEKFGKISPYLSYSYGKVRNNKN NSKYRKGYMSGAILGLRYNMKYLDLDVAYAKPLARSNYLKPKNREIYFSATLKIKF >gi|296153861|gb|ADVK01000049.1| GENE 25 29656 - 30507 1426 283 aa, chain - ## HITS:1 COG:FN0130 KEGG:ns NR:ns ## COG: FN0130 COG1136 # Protein_GI_number: 19703475 # Func_class: V Defense mechanisms # Function: ABC-type antimicrobial peptide transport system, ATPase component # Organism: Fusobacterium nucleatum # 16 283 1 268 268 488 99.0 1e-138 MSIDNNELDEMDFDLLDILGVTEQKVESITLLPGYNKKGEKEGYEELVIKAGEIVAIVGP TGSGKSRLLADIEWGAQGDTPTKRTVLVNGELMDAKKRFSPSYKLVAQLSQNMNFVMDLT VREFIDLHAESRLVLDRESVIEKIFNQANELAGEKFTIDTPITSLSGGQSRALMISDTAI LSTSPIVLIDEIENAGIDRKKALDLLVGNNKIVLMATHDPILALMGDRRIVIKNGGINKV IESTPEEKNILGALTELDDVVQGMRNKLRYGERLELDFEIKKK >gi|296153861|gb|ADVK01000049.1| GENE 26 30615 - 30701 113 28 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MSRANLGVFEADLSASLPNLQRTLIFQR >gi|296153861|gb|ADVK01000049.1| GENE 27 30758 - 31453 886 231 aa, chain - ## HITS:1 COG:FN0129 KEGG:ns NR:ns ## COG: FN0129 COG0378 # Protein_GI_number: 19703474 # Func_class: O Posttranslational modification, protein turnover, chaperones; K Transcription # Function: Ni2+-binding GTPase involved in regulation of expression and maturation of urease and hydrogenase # Organism: Fusobacterium nucleatum # 1 231 1 231 231 455 98.0 1e-128 MKLITVSGPPSSGKTSLIIKTVESLKAQNIKVGIVKFDCLYTDDDVLYEKAGILVKKGLS GSVCPDHFFASNIEEVVQWGKTNNLDLLITESAGLCNRCSPYLKDIKAVCVIDNLSGINT PKKIGPMLKLADIVVITKGDIVSQAEREVFASRVQTVNPKAAIIHINGLTGQGTYEFGSL IMDKNEEIDTVIERKLRFPLPSAVCSYCLGETRIGSSYQLGNIRKINFEEQ >gi|296153861|gb|ADVK01000049.1| GENE 28 31453 - 32631 1687 392 aa, chain - ## HITS:1 COG:FN0128 KEGG:ns NR:ns ## COG: FN0128 COG1840 # Protein_GI_number: 19703473 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type Fe3+ transport system, periplasmic component # Organism: Fusobacterium nucleatum # 79 392 1 314 314 613 98.0 1e-175 MYISKSMSIKSIVEKYPETIPVFTNIGFKGLDNPAVLQKLEEQNITLEKAMMIKKEDVDA FIPMLQQAIASVEREDEGVKEASLMGLLPCPVRIPLLEGFEKYLADNKDIKVKYELKAAY SGLGWIKDEVIDKNDIDKLADMFISAGFDLFFDKGLMGKFKEQGIFKDMTGIEKYNTDFD NENIHLKDPHGDYSMIGVVPAIFIVNKAALDGREVPRAWADLLKPEFAKSVSLPIADFDL FNSILIHIYKLYGFEGVKNLGRSLLSNLHPAQMVEAKEPVVTIMPYFFSKMIPEKGPKEV IWPKEGAIISPIFMLTKASKAKELDKIIKFMSGKAVGDTLANQGLFPSVHPEVKNPVNGK PMLWVGWDFIYSNDMGELIKKCEETFKEGAGE >gi|296153861|gb|ADVK01000049.1| GENE 29 32786 - 33640 878 284 aa, chain + ## HITS:1 COG:FN0127 KEGG:ns NR:ns ## COG: FN0127 COG0731 # Protein_GI_number: 19703472 # Func_class: C Energy production and conversion # Function: Fe-S oxidoreductases # Organism: Fusobacterium nucleatum # 1 284 1 284 284 491 100.0 1e-139 MYKHVFGPVPSRRLGISLGVDLVVSKSCNLNCIFCECGATKKIQLERQRFKDMDEILNEI QSVLKNIKPDYITFSGSGEPTLSLDLGNISKAIKEDLKYKGKICLITNSLLLANNQVIKE LEYIDLIVPTLNTLKQDIFEKIVRPDYRTSVDEIKKGFVNLNNSNYKGKIWIEIFILENI NDSEENFIEIANFLNLENIRYDKIQLNTIDRVGAERDLKAISFDKIFKAKKILEENELHN IEIIKSLNELDDNKKILINQELLDNMKQKRLYQEEEINKIFKKS >gi|296153861|gb|ADVK01000049.1| GENE 30 34009 - 35373 1329 454 aa, chain - ## HITS:1 COG:FN0123 KEGG:ns NR:ns ## COG: FN0123 COG1672 # Protein_GI_number: 19703471 # Func_class: R General function prediction only # Function: Predicted ATPase (AAA+ superfamily) # Organism: Fusobacterium nucleatum # 1 454 1 454 454 746 95.0 0 MNFINREKELETLNKEYKKDNSFVVLYGRRRVGKTTLIKEFIKGKKAFYFFADKQNENLQ IERFKNQISEYFKDEFLKKIEIKDWDTVFDYLLTKISNEKIIIVIDEFQYLCMINKDFSS IFQRIYDEKLKDKNIMIILCGSLISMMYSETLAYESPLYGRRTAQIKLQAIKFKYYSKFF RDKSTQELIELYSITGGVPKYILSLDRNKSALYNIENNIFDKNNYLYSEPKFLLQEEVND LSRYFSILNAISIGHTKMSAISSYLQINAGGLSSYISKLIDLDILEKESPITENIENTKK VLYKIKDNYLKFWFSYVYPYQSYLEIENLTYIKNKIENEFDLYVSKIYEDLARESIWENM KFPLVKVGRWWDKDTEIDIVALGEDNKIVFGECKYSKKQVGLNILNELKEKSKKVIWKND KREEYYIIFSKSGFSQDLIELAKKKNNIILKELV >gi|296153861|gb|ADVK01000049.1| GENE 31 35603 - 35797 275 64 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296328999|ref|ZP_06871506.1| ## NR: gi|296328999|ref|ZP_06871506.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 64 4 67 67 94 100.0 2e-18 MKKFKSIFLVLVIILLSLGFTSTTYARERNGNRIAHDKWNNKGRKGNAGWRFSLDDEEDY ENYE >gi|296153861|gb|ADVK01000049.1| GENE 32 35824 - 35991 205 55 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296329000|ref|ZP_06871507.1| ## NR: gi|296329000|ref|ZP_06871507.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 55 1 55 55 89 100.0 7e-17 MKKYIGAIYVVIGVIILEYLGNLFFNWNFSFLDSLLTGICGALGYLWIAIRKKSK >gi|296153861|gb|ADVK01000049.1| GENE 33 36058 - 36225 236 55 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MTKLKIIGNLIITIILVIIIDFFVSKVFKSEFSVYQNILTGITAYIGYYFATKKK >gi|296153861|gb|ADVK01000049.1| GENE 34 36276 - 36743 728 155 aa, chain - ## HITS:1 COG:FN0119 KEGG:ns NR:ns ## COG: FN0119 COG0716 # Protein_GI_number: 19703467 # Func_class: C Energy production and conversion # Function: Flavodoxins # Organism: Fusobacterium nucleatum # 1 155 10 164 164 297 99.0 5e-81 MAKSLIIYYSLDGKTKKVVDVLEKLTNADVYEIELEKPYTKLTAYTIGLGHCKIGYEPPI KNEIDLSTYDKIFIGGPTWWFTYAPPINSFINKYDLSDKIIYPFGTATSNFGAYFERFNK GCKAKEIKKPLKILRSTLKNGLEEAVKLWLNEEYN >gi|296153861|gb|ADVK01000049.1| GENE 35 36767 - 37945 1999 392 aa, chain - ## HITS:1 COG:FN0118 KEGG:ns NR:ns ## COG: FN0118 COG0484 # Protein_GI_number: 19703466 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: DnaJ-class molecular chaperone with C-terminal Zn finger domain # Organism: Fusobacterium nucleatum # 1 392 1 392 392 606 99.0 1e-173 MAKRDYYEVLGIDKSASENDIKKAYRKAAMKYHPDKFANASDAEKKDAEEKFKEINEAYQ ILSDSQKKQQYDQFGHAAFEQGGAGFGGGFNAGGFDFGDIFGDIFGGGGFGGFEGFSGFG GSSRRSYVEPGNDLRYNLEITLEEAAKGVEKTIKYKRTGKCEHCHGTGGEDDKMKTCPTC NGQGTIRTQQRTILGVMQSQSVCPDCHGTGKVPEKKCKHCHGTGTAKETVEKKVNVPAGI DDGQKLKYAGLGEASQNGGPNGDLYVVIRIKSHDIFVRDGENLYCEVPISYSTAVLGGEV EIPTLNGKKMIKVPEGTESGKLLKVKGEGIKSLRGYGQGDIIVKITIETPKKLTDKQKEL LQKFEESLNEKNYEQKSSFMKKVKKFFKDIID >gi|296153861|gb|ADVK01000049.1| GENE 36 37993 - 38505 733 170 aa, chain - ## HITS:1 COG:FN0117 KEGG:ns NR:ns ## COG: FN0117 COG0350 # Protein_GI_number: 19703465 # Func_class: L Replication, recombination and repair # Function: Methylated DNA-protein cysteine methyltransferase # Organism: Fusobacterium nucleatum # 1 170 1 170 170 283 100.0 1e-76 MKNIKGISFLYNKEIGYLEIIEEKDGISEISFLGNIDIETRRNLYNIFNESPLTKKCSQQ LEEYFKGKRKEFNIELDIRGTEFQKQCWDALVKVAYGETISYSDEAKMIGKDKAVRAVGS ANGKNCIPIIIPCHRIVYKGGEIGGYSGGEGGNKGIEIKKYLLELEKKFK >gi|296153861|gb|ADVK01000049.1| GENE 37 38767 - 40590 2951 607 aa, chain - ## HITS:1 COG:FN0116 KEGG:ns NR:ns ## COG: FN0116 COG0443 # Protein_GI_number: 19703464 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Molecular chaperone # Organism: Fusobacterium nucleatum # 1 607 1 607 607 1016 99.0 0 MSKIIGIDLGTTNSCVAVMEGGSATIIPNSEGARTTPSVVNIKDNGEVVVGEIAKRQAVT NPTSTVSSIKTHMGSDYKVEIFGKKYTPQEISAKTLQKLKKDAEAYLGEEVKEAVITVPA YFTDSQRQATKDAGTIAGLDVKRIINEPTAAALAYGLEKKKEEKVLVFDLGGGTFDVSVL EISDGVIEVISTAGNNHLGGDDFDNEIINWLVAEFKKENGIDLSNDKMAYQRLKDAAEKA KKELSTLMETSISLPFITMDATGPKHLEMKLTRAKFNDLTKHLVEATQGPTKTALKDASL EANQIDEILLVGGSTRIPAVQEWVENFFGKKPNKGINPDEVVAAGAAIQGGVLMGDVKDV LLLDVTPLSLGIETLGGVFTKMIEKNTTIPVKKSQVYSTAVDNQPAVTINVLQGERSRAT DNHKLGEFNLEGIPAAPRGVPQIEVTFDIDANGIVHVSAKDLGTGKENKVTISGSSNLSK EEIERMTKEAEAHAEEDKKFQELVEARNKADQLISATEKTLKENPDKVSEEDKKNIEAAI EELKKVKDGDDKSAIDSAMEKLSQASHKFAEELYKEAQAQAQAQQQAGANAGSDKKDEDV AEAEVVD >gi|296153861|gb|ADVK01000049.1| GENE 38 40629 - 41234 895 201 aa, chain - ## HITS:1 COG:FN0114 KEGG:ns NR:ns ## COG: FN0114 COG0576 # Protein_GI_number: 19703462 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Molecular chaperone GrpE (heat shock protein) # Organism: Fusobacterium nucleatum # 1 201 1 199 199 255 98.0 3e-68 MKDKEIKEEVLKEEINKEVNEKKKCECEEGKEEAHEHEHKNDEHACCGKHNHKEEIEKLK AEIEEWKNSFLRKQADFQNFTKRKEKEVDELKKFASEKIITQFLGSLDNFERAIESSSES KDFDSLLQGVEMIVRNLKDIMSSEDVEEIPTEGAFNPEYHHAVGVEASEDKKEDEIVKVL QKGYMMKGKVIRPAMVIVCKK >gi|296153861|gb|ADVK01000049.1| GENE 39 41245 - 42267 1535 340 aa, chain - ## HITS:1 COG:FN0113 KEGG:ns NR:ns ## COG: FN0113 COG1420 # Protein_GI_number: 19703461 # Func_class: K Transcription # Function: Transcriptional regulator of heat shock gene # Organism: Fusobacterium nucleatum # 1 340 12 351 351 578 100.0 1e-165 MGISEREKLVLNAIVDYYLTVGDTIGSRTLVKKYGIELSSATIRNVMADLEDMGFIEKTH TSSGRIPTDMGYKYYLTELLKVEKITQEEIENISNVYNRRVDELENILKKTSTLLSKLTN YAGIAVEPKPDNKKVSRVELVYIDEYLVMAIIVMDDRRVKTKNIHLAYPISKEEVEKKVD ELNAKIRNNEIAINDIEKFFTESTDIVYEYDDEDELSKYFINNLPSMLKNENIAEVTDVI EFFNERKDIRELFEKLIEQKAQENSKSNVNVILGDELGIKELEDFSFVYSIYDIGGAQGI IGVMGPKRMAYSKTMGLINHVSREVNKLINSMEKDKNKKV >gi|296153861|gb|ADVK01000049.1| GENE 40 42450 - 42752 393 100 aa, chain + ## HITS:1 COG:no KEGG:FN0111 NR:ns ## KEGG: FN0111 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 10 100 1 91 91 113 91.0 3e-24 MKQKERIEKMEKILYSSSKLLEELEKILNKIEKDSKNYDELIKYYYSKNWSKDKEDFEKD LFPDVESAYVLTEDGIYDTMTSSTGLAIHMLELATKMLKR >gi|296153861|gb|ADVK01000049.1| GENE 41 42849 - 44114 1748 421 aa, chain + ## HITS:1 COG:FN0110 KEGG:ns NR:ns ## COG: FN0110 COG0172 # Protein_GI_number: 19703458 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Seryl-tRNA synthetase # Organism: Fusobacterium nucleatum # 1 421 4 424 424 835 99.0 0 MLELKFMRENVEMLKEMLKNRNSNVDMDAFVELDSKRREVLSEVENLKRERNNASAEIAN LKKEKKNADHIIEKMGEVSTKIKDLDAELVEIDEKIKDIQLNIPNVYHPSTPIGPDEDYN LEIRKWGIPKKCDFEPKSHWDIGEDLGILDFERGAKLSGSRFVLYRGAAARLERAIINFM LDVHTLEEGYTEHITPFMVKAEVCEGTGQLPKFEEDMYKTTDDMYLISTSEITMTNIHRK EILEQAELPKYYTAYSPCFRREAGSYGKDVKGLIRLHQFNKVEMVKITDAESSYDELEKM VNNAETILQRLELPYRVIQLCSGDLGFSAAKTYDLEVWLPSQNKYREISSCSNCEAFQAR RMGLKYRVPNGSEFCHTLNGSGLAVGRTLVAIMENYQQEDGSFLVPKVLIPYMGGVDVIK K >gi|296153861|gb|ADVK01000049.1| GENE 42 44098 - 44574 216 158 aa, chain + ## HITS:1 COG:no KEGG:FN0109 NR:ns ## KEGG: FN0109 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 158 1 158 158 207 98.0 8e-53 MLLKSSLFILLLVNIFISNLIILSAILVVVLILNLTLNKNLKKHSKQLKVLLFFYLSTFL IQLYYGQQGKVLFKFYSFYITQEGLINFGVSFIRILNLVLMSWLINEMKLLTGRFSKYQK IIDTVIDLVPKVFVLFKKRMKAKNFTRYILKDISKRYE >gi|296153861|gb|ADVK01000049.1| GENE 43 44683 - 44778 77 31 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MCLSKVILEAINELVHLELLTDTEFAANVNF >gi|296153861|gb|ADVK01000049.1| GENE 44 44775 - 44861 56 28 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MIKKVLKGCCKLMNGIKNSSLLAKFLNI >gi|296153861|gb|ADVK01000049.1| GENE 45 44854 - 45855 1246 333 aa, chain - ## HITS:1 COG:FN0108 KEGG:ns NR:ns ## COG: FN0108 COG1619 # Protein_GI_number: 19703456 # Func_class: V Defense mechanisms # Function: Uncharacterized proteins, homologs of microcin C7 resistance protein MccF # Organism: Fusobacterium nucleatum # 1 333 6 338 338 598 94.0 1e-171 MEKKVIGVYAPANAAHIWFEEKYLFAKKQLENIGFKIVEGNLVKDKTYQGYRTASAKERA EEMMNLVKNKDIDIMMPVIGGYNSGSLLPYLDFGEIEKSKKKFFGYSDITAIQMAILKKT DLKPIYGGSLIPTFGEYEGISSFLKNTLDNLFFKKSYSLEEPEFYSNKLLNAFTDEWKTK KREYTKNEGWKILNEGEIEGEVIVANIATLVSILASEYVPTFRNKILILEEMNATIDSEE KNLNTLKISGVFEGVKGLIFGKPEVYDDRNSNLEYIDIIKEVLGKRDYPIIYNFDCGHTI PSLMISQDSLLSLKASDNEGVKVEILKNSFIDD >gi|296153861|gb|ADVK01000049.1| GENE 46 45930 - 47378 2213 482 aa, chain - ## HITS:1 COG:FN0107 KEGG:ns NR:ns ## COG: FN0107 COG0591 # Protein_GI_number: 19703455 # Func_class: E Amino acid transport and metabolism; R General function prediction only # Function: Na+/proline symporter # Organism: Fusobacterium nucleatum # 1 482 1 482 482 826 97.0 0 MASYEIFITFGIYLIFLILIGVYFYSKTTTHESYVLGNRGVGYWVTAMSAQASDMSGWLL LGLPGAVYLSGLTEIWIVIGLATGTYLNWKFVAPALRIQTEKYNSLTIPSFISQKLNDNK GYIRTFSAIVILFFFTIYSASGLVAGGKLFDSLLGIDYKWGVLIGGGTIIIYTFLGGYLA CCWTDFFQGCLMFFAIIVVPVVAYFNGGGIDGISTAMDVKNISLNIFKYAKVLSLPVIIS GLGWGLGYFGQPHIIVRFMSIDSADELWKSRLIAMIWVFISLLGAIAVGVTGIGVFTDVS QMGGDAEKVFIFLIHKLFNPWIAGILFAAILSAIMSTISSQLLVSSNTLTEDFYKYIVKR EKSHKEMIWVGRLCVIVIFIIASFLAMNPSSKVLELVSYAWAGFGGVFSPVILFTLYKKD LHWKTVLTSMIIATITVITWKTSGLGNVIYEIVPSFVINCISIYLLEKFRVFEDKKVKVL VK >gi|296153861|gb|ADVK01000049.1| GENE 47 47708 - 48130 626 140 aa, chain - ## HITS:1 COG:no KEGG:FN0106 NR:ns ## KEGG: FN0106 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 140 1 140 140 249 100.0 4e-65 MEAKKEFLRMINECEEIALATSIHDFPNVRIVNYYYDEKNNVMYFATYTGREKISEFWKN NNISFTTIPVNRGKREHIRARGHVRESGKSILDLREEFSNKMADFAEIIDKYSKDLKVYE IKFSEVTVTLDSRYYEKVSL >gi|296153861|gb|ADVK01000049.1| GENE 48 48269 - 48883 645 204 aa, chain - ## HITS:1 COG:FN0105 KEGG:ns NR:ns ## COG: FN0105 COG4832 # Protein_GI_number: 19703453 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 204 1 204 204 308 96.0 4e-84 MKYEWKKEEKKIYGTKEKAEILKIPKQQFIMIDGIDNPNETNFSNKVSALYSLAYSIKIL YKNMMKEAKETKIKDFTVYPLEGIWKKLEEEKLDKSKLKYTIMIKQPDFITQKNFNDALE VVKKRKPSILYDEIYFNSLEEGKSIQILHVGSYDNEPISFGKMEELMNKLNLVRVSDYHR EIYLNNKNRTSTDKLKTILRYSIK >gi|296153861|gb|ADVK01000049.1| GENE 49 49051 - 49512 222 153 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|50365462|ref|YP_053887.1| acetyltransferase of 30S ribosomal protein L7 [Mesoplasma florum L1] # 1 153 23 167 170 90 37 2e-17 FIKYKEEALFEKAIINSSTSLEKFSSVEGWLEELKIKSSEDTVPEGLVPSTTYLGVRKID NYVVGMVDIRYCLNEFLTQVGGNIGYDVRKSERNKGYAKQILKFALEKCKDLKMKKVLIT CDEDNIASEKVILSAEAKFENIKSFEGKNKKRF >gi|296153861|gb|ADVK01000049.1| GENE 50 49565 - 50611 999 348 aa, chain - ## HITS:1 COG:FN0103 KEGG:ns NR:ns ## COG: FN0103 COG0208 # Protein_GI_number: 19703451 # Func_class: F Nucleotide transport and metabolism # Function: Ribonucleotide reductase, beta subunit # Organism: Fusobacterium nucleatum # 1 348 1 348 348 662 99.0 0 MKAAVDRKKLFNPEGDDTLNARKIIKGNSTNLFNLNNVRYQWANQLYRTMMANFWIPEKV DLTQDKNDYENLTVPEREAYDGILSFLIFLDSIQTNNIPNISDHVTAPEVNLLLAIQTFQ EAIHSQSYQYIIESILPKQSRDLIYDKWRDDKVLFERNSFIAKIYQDFIDEQSDENFAKI IIANYLLESLYFYNGFNFFYLLASRNKMVGTSDIIRLINRDELSHVVLFRSMVKEIKNDF PNFFSAETIYSMFKTAVEQEITWTEHIIGNRVLGITSQTTEAYTKWLANERLKSLGLEPL FSGFNKNPYKHLERFADTEGEGNVKSNFFEGTVTSYNMSSSIDGWEDF >gi|296153861|gb|ADVK01000049.1| GENE 51 50592 - 52853 2750 753 aa, chain - ## HITS:1 COG:FN0102 KEGG:ns NR:ns ## COG: FN0102 COG0209 # Protein_GI_number: 19703450 # Func_class: F Nucleotide transport and metabolism # Function: Ribonucleotide reductase, alpha subunit # Organism: Fusobacterium nucleatum # 1 753 3 755 755 1476 99.0 0 MERRKVINRDNIVEDLNIEKIREKLLRACDGLEVNMVELESNIDSIYEENITTQKIQASL INTAVTMTTFEESDWAYVAGRLLMMEAEREVYHSRGFSYGDFSKTIRKMTELELYDERLL SYTEEELHQIAQLIDINRDMVYDYAGANMFVNRYLIKYDGKTYELPQETFMAISMMLALN EKEGETRVEIVKEFYNALSLRKLSLATPILANLRIPDGNLSSCFITAIDDNIESIFYNID SIARISKNGGGVGVNVSRIRAKGSMVNGYYNASGGVVPWIRIINDTAVAVNQQGRRAGAV TVALDTWHLDIETFLELQTENGDQRGKAYDIYPQVVCSNLFMKRVKNNETWTLLDPYEIR KKYGIELCELYGYEFENLYEKIENDPNIKLKKVLSAKELFKSIMKTQLETGMPYIFFKDR ANEVNHNSHMGMIGNGNLCMESFSNFKPTINFVEEEDGNTSIRKSEMGEIHTCNLISINL AELTSEELEKHVALAVRALDNTIDLTVTPLKESNKHNLLYRTIGVGAMGLADYLAREYMI YEESINEINEIFERIALYSIKASALLAKDRGAYKAFKGSKWDQGIFYGKKREWYDINSKF KDEWNEVFYLVETNGLRNGELTAIAPNTSTSLLMGSTASVTPTFSRFFIEKNQRGAIPRT VKHLKDRAWFYPEFKNVNPISYVKIMAKIGSWVTQGVSMEMVFDLNKDIKAKDIYDTLMT AWEEGCKSVYYIRTIQKNTNNISDKEECESCSG >gi|296153861|gb|ADVK01000049.1| GENE 52 52859 - 53062 303 67 aa, chain - ## HITS:1 COG:no KEGG:FN0101 NR:ns ## KEGG: FN0101 # Name: not_defined # Def: glutaredoxin # Organism: F.nucleatum # Pathway: not_defined # 1 67 1 67 67 113 98.0 2e-24 MIKVYGKENCSKCISLKGILTDRNIEFEYIEDMKTLMIVASKARIMSAPVIEYNDNVYTM EAFLKVI >gi|296153861|gb|ADVK01000049.1| GENE 53 53352 - 54023 797 223 aa, chain + ## HITS:1 COG:FN0100 KEGG:ns NR:ns ## COG: FN0100 COG1018 # Protein_GI_number: 19703448 # Func_class: C Energy production and conversion # Function: Flavodoxin reductases (ferredoxin-NADPH reductases) family 1 # Organism: Fusobacterium nucleatum # 9 223 1 215 215 392 98.0 1e-109 MKKLYDLSLIERNDVAENTIELTFTKPSDYDFKIGQYIFLDVANKRENKITRALSIASHP DEDILRFVMRISDSDFKTRCLEMKKGDNATITQATGNFGFKFSDKEIVFLISGIGIAPII PMLMELEKINYQGKVSLFYSNRTLAKTTYHERLQNFNIKNYNYNPVFTGIQPRINIDLLK EKLNDIYNSNYYIIGTSDFIKTMKTLLEENYIDKKNYLVDNFG >gi|296153861|gb|ADVK01000049.1| GENE 54 54057 - 54410 443 117 aa, chain - ## HITS:1 COG:FN0099 KEGG:ns NR:ns ## COG: FN0099 COG0221 # Protein_GI_number: 19703447 # Func_class: C Energy production and conversion # Function: Inorganic pyrophosphatase # Organism: Fusobacterium nucleatum # 1 117 1 117 117 203 91.0 5e-53 MLKDIEKYRFYLNKEVLVKVDRKLGEKHPNFDFIYPVNYGYIPNTLSEDGEEIDVYILGI FYPIDEFQGICKAVICRYDNNENKLIVVPKDKNYSIEQMEALLEFQERFFKHKIIIE Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:58:36 2011 Seq name: gi|296153825|gb|ADVK01000050.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00062, whole genome shotgun sequence Length of sequence - 35255 bp Number of predicted genes - 35, with homology - 35 Number of transcription units - 12, operones - 9 average op.length - 3.6 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 3 - 1368 2484 ## COG5295 Autotransporter adhesin - Prom 1415 - 1474 9.6 + Prom 1327 - 1386 9.2 2 2 Op 1 6/0.000 + CDS 1423 - 1641 143 ## COG0454 Histone acetyltransferase HPA2 and related acetyltransferases + Term 1665 - 1695 -0.4 + Prom 1700 - 1759 9.6 3 2 Op 2 . + CDS 1782 - 2333 351 ## PROTEIN SUPPORTED gi|228004009|ref|ZP_04050993.1| acetyltransferase, ribosomal protein N-acetylase 4 2 Op 3 . + CDS 2349 - 3884 1728 ## FN1293 hypothetical protein 5 2 Op 4 . + CDS 3917 - 5407 1572 ## FN1292 hypothetical protein 6 2 Op 5 . + CDS 5394 - 6779 1458 ## FN1291 hypothetical protein + Prom 6797 - 6856 7.4 7 3 Op 1 . + CDS 6909 - 8405 1508 ## FN1290 hypothetical protein 8 3 Op 2 . + CDS 8402 - 9994 1505 ## FN1289 hypothetical protein 9 3 Op 3 . + CDS 10063 - 10311 269 ## PROTEIN SUPPORTED gi|15610598|ref|NP_217979.1| translation initiation factor IF-1 10 3 Op 4 . + CDS 10331 - 10444 200 ## PROTEIN SUPPORTED gi|197735973|ref|YP_002164751.1| hypothetical protein FNP_0496 + Prom 10573 - 10632 8.0 11 4 Op 1 48/0.000 + CDS 10652 - 11008 591 ## PROTEIN SUPPORTED gi|19704621|ref|NP_604183.1| 30S ribosomal protein S13 12 4 Op 2 36/0.000 + CDS 11054 - 11443 659 ## PROTEIN SUPPORTED gi|19704620|ref|NP_604182.1| 30S ribosomal protein S11 13 4 Op 3 26/0.000 + CDS 11483 - 12070 979 ## PROTEIN SUPPORTED gi|19704619|ref|NP_604181.1| 30S ribosomal protein S4 14 4 Op 4 50/0.000 + CDS 12099 - 13079 1368 ## COG0202 DNA-directed RNA polymerase, alpha subunit/40 kD subunit 15 4 Op 5 . + CDS 13107 - 13457 574 ## PROTEIN SUPPORTED gi|19704617|ref|NP_604179.1| 50S ribosomal protein L17P + Term 13478 - 13524 6.8 - Term 13472 - 13505 1.0 16 5 Op 1 1/0.800 - CDS 13514 - 14476 1577 ## COG4870 Cysteine protease 17 5 Op 2 . - CDS 14495 - 16234 1848 ## COG0265 Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain - Prom 16278 - 16337 18.5 + Prom 16241 - 16300 12.7 18 6 Op 1 1/0.800 + CDS 16456 - 17436 1377 ## COG0491 Zn-dependent hydrolases, including glyoxylases 19 6 Op 2 1/0.800 + CDS 17519 - 17944 508 ## COG0454 Histone acetyltransferase HPA2 and related acetyltransferases 20 6 Op 3 . + CDS 17972 - 19432 2025 ## COG2195 Di- and tripeptidases + Term 19433 - 19486 9.8 - Term 19422 - 19472 9.1 21 7 Op 1 . - CDS 19476 - 19889 527 ## FN1276 hypothetical protein 22 7 Op 2 27/0.000 - CDS 19889 - 22951 3718 ## COG0841 Cation/multidrug efflux pump 23 7 Op 3 13/0.000 - CDS 22951 - 24063 1521 ## COG0845 Membrane-fusion protein 24 7 Op 4 . - CDS 24110 - 25381 1543 ## COG1538 Outer membrane protein 25 7 Op 5 . - CDS 25403 - 26038 515 ## FN1272 TetR family transcriptional regulator - Prom 26113 - 26172 8.6 + Prom 26137 - 26196 12.6 26 8 Op 1 1/0.800 + CDS 26245 - 27981 2041 ## COG0616 Periplasmic serine proteases (ClpP class) 27 8 Op 2 1/0.800 + CDS 28050 - 28400 179 ## PROTEIN SUPPORTED gi|149916415|ref|ZP_01904934.1| 30S ribosomal protein S21 + Term 28440 - 28482 1.1 + Prom 28407 - 28466 5.2 28 9 Op 1 1/0.800 + CDS 28490 - 29113 573 ## COG2121 Uncharacterized protein conserved in bacteria 29 9 Op 2 2/0.000 + CDS 29133 - 31043 2611 ## COG0143 Methionyl-tRNA synthetase 30 9 Op 3 2/0.000 + CDS 31055 - 31672 881 ## COG0457 FOG: TPR repeat 31 9 Op 4 . + CDS 31684 - 32568 1174 ## COG1210 UDP-glucose pyrophosphorylase + Term 32575 - 32621 1.1 - Term 32561 - 32607 1.1 32 10 Op 1 . - CDS 32737 - 33363 1062 ## COG2885 Outer membrane protein and related peptidoglycan-associated (lipo)proteins - Prom 33441 - 33500 8.1 33 10 Op 2 . - CDS 33536 - 34009 518 ## FN1264 hypothetical protein - Prom 34048 - 34107 9.3 + Prom 34032 - 34091 11.3 34 11 Tu 1 . + CDS 34177 - 34998 1212 ## COG4822 Cobalamin biosynthesis protein CbiK, Co2+ chelatase + Term 35026 - 35086 7.3 - Term 35015 - 35069 9.5 35 12 Tu 1 . - CDS 35071 - 35253 163 ## COG1807 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family Predicted protein(s) >gi|296153825|gb|ADVK01000050.1| GENE 1 3 - 1368 2484 455 aa, chain - ## HITS:1 COG:FN0735 KEGG:ns NR:ns ## COG: FN0735 COG5295 # Protein_GI_number: 19704070 # Func_class: U Intracellular trafficking, secretion, and vesicular transport; W Extracellular structures # Function: Autotransporter adhesin # Organism: Fusobacterium nucleatum # 1 455 16 439 617 200 44.0 4e-51 MKKYADLKLIVFSILFVTASITYSAPPVFQAGAGADSTEAGVNNTASGEYSSAVGAGNQA SGVASSAVGANNKAKGEISSAVGYMNTASGFASSAFGDENKASGFASSAVGHTNTASGEY SSAFGANNEAKGKNSSTLGFENKADGEYSSAFGVENTASGVGSSAVGAGNQASEEYSSAF GYMNEAYGVASSAVGNANIANKKNSSALGAGNEASGVASSAVGFKNKADGENSSAFGVEN KASGVASSALGFKNNASGAASSALGYYNKASGVASSALGTGNEVNGDYSSAVGVDNAASG KSSSALGAGNVASKENSSAVGVDNAASGVGSSALGAGNEVNGDYSTAVGYKNKVSGNHSG AFGDPNVVTGNRSYAFGNDNTINGDNNFVLGNNVTIRAGIQNSVALGNGSTVSSSNEVSV GSRGKERKITNVADGEVSATSTDAVNGRQLYNAMQ >gi|296153825|gb|ADVK01000050.1| GENE 2 1423 - 1641 143 72 aa, chain + ## HITS:1 COG:FN1295 KEGG:ns NR:ns ## COG: FN1295 COG0454 # Protein_GI_number: 19704630 # Func_class: K Transcription; R General function prediction only # Function: Histone acetyltransferase HPA2 and related acetyltransferases # Organism: Fusobacterium nucleatum # 21 68 84 131 135 79 89.0 1e-15 MYQKIERLTTNINQIIKIYFLNNNENLYIRVNSSRYGVPIYEKLGFVKMEEEKERDGLKF TPMKLVLKDEMK >gi|296153825|gb|ADVK01000050.1| GENE 3 1782 - 2333 351 183 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|228004009|ref|ZP_04050993.1| acetyltransferase, ribosomal protein N-acetylase [Anaerococcus prevotii DSM 20548] # 4 180 2 176 280 139 42 2e-32 MNTEIDISNVKLETERLILRAWEITDLDNFFEYASINGVGEKAGWEHHKSKDESLEILKM FIDEKKVFAIVLKENQKAIGSIGIEECRQDLDKNLENLLGRELGYVLSKDYWNKGIMTEA VSKVIEYCFKILKLNYLIATCFNYNIASKRVLEKLNFKFYKDIIIKTKYNTVEKSTLMIL KNN >gi|296153825|gb|ADVK01000050.1| GENE 4 2349 - 3884 1728 511 aa, chain + ## HITS:1 COG:no KEGG:FN1293 NR:ns ## KEGG: FN1293 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 119 511 1 393 393 677 95.0 0 MRGNNQKNSNIMIKTCIFMSLIIFLLCSIVILCIAFSSDDTYEIEKNGERYGKSEFYKYK DKIYVLVIGSGMLEVEGVDIPTFKVFDKDKEDERENVGFDKNKIYFGNIAVSDLDTDKLY YVGNNYYSDSTNSYFCSTSPKFNEELSAGTAIIQNISHFFFKTRKPQNYFYPYKKLETNK RLKKFEEIRNFATNGEEIYYAGEKLANANINTIKKIEEGLFYFVDKENVYYKSKLLSFKN NEKLKVFHENDYNVYYLYDEESGNVYADDYLFDTANAPYKVIGIDGTHHFSLLFISKDGV YFYEPFRKKQERIGDNIFKGEIKEIYPDIFSDDENVYYLDVYEDWAKKRVNNYFSLRKRP LNGELISRNTRIHYLDKKTAWENDWEKVADIYSNTNGSIWKKGNKYYYFDIYGFGQSIHK PIYEITDKEVLDYLLNFSKLKDKDIINLPDKIRDFISEGKLIAFNGEVKMTATIHFIEDP YAYDIPKIIFISIAFLIGLYGKYRKSKFSKK >gi|296153825|gb|ADVK01000050.1| GENE 5 3917 - 5407 1572 496 aa, chain + ## HITS:1 COG:no KEGG:FN1292 NR:ns ## KEGG: FN1292 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 44 496 1 453 453 702 91.0 0 MKTNDLDDLFKKKKTSNTLFYFKIAIIIFAIFILFFIPFTISNMISSDSKSYEIKTNGEQ YGTSNFFKYQEKIYVFTLNDGMQALENVDIETFKTLNSGDYYTKNIGLDKNSVYFENIII PDLDPNKFEVIGNGYYTDGTNSYFYSPFSELDKDSSKYIYPYKKIENAKNLKAYENLELF AVDGDNVYYKGEILKNADLNTLKIIDKNNEYFADKENVYYRSKLLAIRNSGKLKIVSSEQ GDEFLYDEASGYVFIGDYTFDKEKAPYKVIGNNGTTLYNLIFIAKDGIYYYDSEKKKQLK AGDNIFIGNIEEISPNIFTDDENIYYFSAHSVRSGSRKNLGELLSRNTDICYLDKKDGWK KVKDIRESSIGSIWKKGNKYYYFNNLGIFHFTDNTIYEISDKETLDYLLAKADDETDDIK SEGLTAINTDYIRELIKNEKLIAVSGEKKMTITVKYETDIVDKIFKYSIRIFLVVYFIFI IFKNFRKSRRISNENK >gi|296153825|gb|ADVK01000050.1| GENE 6 5394 - 6779 1458 461 aa, chain + ## HITS:1 COG:no KEGG:FN1291 NR:ns ## KEGG: FN1291 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 84 461 1 378 411 632 95.0 1e-180 MRINDFDENLKFKKKRTSNFSFIIKIVLIIFAMFMLFFIFFINFSPNEKDVNSFEIETNG EKYGKSEFIKYNGKVYVLVWGNGMYTLNNVDIDTFRAISSEDFYSKVVGLDKNHVYFGNI AIPDLDPNKFYIIGNGYYSDGTNTYYCSPVSERNKDLSIISELFQITINVFSKNKKPQTY IYPYKKVETDKKLRAVKNMFSFATDGKKVYYKGEILENADLNTLKSVDGYTEYFVDKENV YYQSKLLPIKNSDKLKTVSTKQGNRVLYDEANGYVFIEDYSFNREKAPYKVLGNGGNHLY DLFFIAKDGIYFYNTQKKKQERIGDNIFSENIEELTPNVFTDDKNIYYFDTYDVWFKGKN TGHILTSKNTIIYYLDKKDNWEKVTDIRDGTVVGTIWKKGNDYYYFDEFYMKNTIYQIAD KETLDYLLNANNINHDNMVNLVENKKLIVVNGEEKIRATTE >gi|296153825|gb|ADVK01000050.1| GENE 7 6909 - 8405 1508 498 aa, chain + ## HITS:1 COG:no KEGG:FN1290 NR:ns ## KEGG: FN1290 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 159 498 1 340 340 583 99.0 1e-165 MRINDFDEEFNFKKKRSSNTLFIIKIVFIILAIFAILSSVLFLSKMGSSDSYEMEEKGER YGNSEFIKYQGKISVPVPSGGRYFLNGVDINSFRVLNLGDRDTRIIGLDKNHVYFGNIAI SDLDPNKLEVIGNGYYSDGTTTYFCSPLSERNENLSTSMEIFQFLIYSFSKTKKPQTYIY PYKKVETNNKLIAVKDLYYFATDGEKVYYKGEILENADLNTLKSVDGYNEYFADKENVYY KSKLLPIKNSGKLMVISAEQGDKFLYDEANGYVFIEDYSFDREKAPYKVIGNNGNHLYNL AFVNNEGIYYYDDQEKKQLKAGDNIFIGNIEEISPNIFTDDKNIYYFHAYDVWKNYKNGG SVLFSRNTEIYYLDKKDGWEKVKDIRSGIIGTIWKRGNRYYYFDNLGMFQLINNTIYEIR DKETLEYLLLNGDEIGSSHSIGKFVENGKLIAINGEKKGEIVVKYKSARITMAKYSKIFL VIIVIVSVIIKIIRGIRE >gi|296153825|gb|ADVK01000050.1| GENE 8 8402 - 9994 1505 530 aa, chain + ## HITS:1 COG:no KEGG:FN1289 NR:ns ## KEGG: FN1289 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 529 1 529 571 728 86.0 0 MKINGEDLSISKEKNSKKTLLKIIITFLLIIIIFTLYEFFFIFKIKSDYDFNQEILNNGQ KYEKSIYIKYKDKIYACVYEDAYQLDDVDIGSFKVLDSMDYSDSYVAVDKNNVYFGNISI PDLKPDKLYTVGNNYYSDGINSYFCLNTFKKNEDLANKSKIRQYIEYYFFEGEKPQEYSY PFKKVETNKSLKAVKNLSYFATDGEKVYYQGEVLENADLNTLRAVDRYKEYFADKENVYY KSKLLALSSNEKLKVVRADQEGEDYLYDGLNGNVFLEEYAFDKKYLPYQVLGEKSNHIRD LLFVNKDGIFFYNPETKEQERVRDNIFIGEIEEINPSVISDDKNIYYLHSYNVYKKKKTK YGYMDVLVSKNIGIFYLDEKKNWEKIKDIASGTTGQVWKKGSRYYYFDNLGVHQLIDDVI YEIKDNRTLEKLLDTKYISTDEIREFVRSKKIIAVKGEEVTTASVKYEESHIAEIFLIVF FATIIAITALILYLKWRNMKLEMKKIDEEIKKKNKKIEPLIRYYSDKEEK >gi|296153825|gb|ADVK01000050.1| GENE 9 10063 - 10311 269 82 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|15610598|ref|NP_217979.1| translation initiation factor IF-1 [Mycobacterium tuberculosis H37Rv] # 10 81 1 73 73 108 71 6e-23 MNFCSMGGKMSKKDVIELEGTIVEALPNAMFKVELENGHTILGHISGKMRMNYIKILPGD GVTVQISPYDLSRGRIVYRKKN >gi|296153825|gb|ADVK01000050.1| GENE 10 10331 - 10444 200 37 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|197735973|ref|YP_002164751.1| hypothetical protein FNP_0496 [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] # 1 37 1 37 37 81 100 6e-15 MKVRVSIKPICDKCKIIKRHGKIRVICENPKHKQVQG >gi|296153825|gb|ADVK01000050.1| GENE 11 10652 - 11008 591 118 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|19704621|ref|NP_604183.1| 30S ribosomal protein S13 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 118 1 118 118 232 99 3e-60 MARIAGVDIPRNKRVEIALTYIYGIGKPTSQKILKEAGINFDTRVKDLTEEEINKIREII KDIKVEGDLRKEVRLSVKRLMDIKCYRGLRHKMNLPVRGQSSKTNARTVKGPKKPIRK >gi|296153825|gb|ADVK01000050.1| GENE 12 11054 - 11443 659 129 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|19704620|ref|NP_604182.1| 30S ribosomal protein S11 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 129 1 129 129 258 99 4e-68 MAKKTVAKIKKKSKNIPNGVAHIHSTFNNTIVAITDVDGKVVSWKSGGTSGFKGTKKGTP FAAQIAAEQAAQIAMENGMRKVEVKVKGPGSGREACIRSLQAAGLEVTKITDVTPVPHNG CRPPKRRRV >gi|296153825|gb|ADVK01000050.1| GENE 13 11483 - 12070 979 195 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|19704619|ref|NP_604181.1| 30S ribosomal protein S4 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 195 1 195 195 381 100 1e-105 MARNRQPVLKKCRALGIDPVILGVKKSSNRQIRPNANKKPTEYATQLREKQKAKFIYNVM EKQFRKIYEEAARKLGVTGLTLIEYLERRLENVVYRLGFAKTRRQARQIVSHGHIAVNGR RVNIASFRVKVGDIVSVIENSKNVELIKLAVEDATPPAWLELDRAAFSGKVLQNPTKDDL DFDLNESLIVEFYSR >gi|296153825|gb|ADVK01000050.1| GENE 14 12099 - 13079 1368 326 aa, chain + ## HITS:1 COG:FN1283 KEGG:ns NR:ns ## COG: FN1283 COG0202 # Protein_GI_number: 19704618 # Func_class: K Transcription # Function: DNA-directed RNA polymerase, alpha subunit/40 kD subunit # Organism: Fusobacterium nucleatum # 1 326 17 342 342 577 99.0 1e-164 MLKIEKQAKAIKITEVKESNYKGQFIVEPLYRGYGNTLGNALRRVLLSSIPGAAIKGMRI EGVLSEFTVMDGVKEAVTEIILNVKEIVVKAESSGERRMSLSIKGPKVVKAADIVADIGL EIVNPEQVICTVTTDRALDIEFIVDTGEGFVVSEEIDKKDWPVDYIAVDAIYTPIRKVSY EIQDTMFGRMTDFDKLTLNVETDGSIEIRDALSYAVELLKLHLDPFLEIGNKMENLRDDI EEMIEEPMDIQVIDDKSHDMKIEELDLTVRSFNCLKKAGIEEVSQLASLSLNELLKIKNL GKKSLDEILEKMKDLGYDLEKNGSPE >gi|296153825|gb|ADVK01000050.1| GENE 15 13107 - 13457 574 116 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|19704617|ref|NP_604179.1| 50S ribosomal protein L17P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 116 1 116 116 225 100 3e-58 MNHNKSYRKLGRRADHRKAMLKNMTISLIKAERIETTVTRAKELRKFAERMITFGKKNTL ASRRNAFAFLRDEEVVAKIFNEIAPKYAERNGGYTRIIKTSVRKGDSAEMAIIELV >gi|296153825|gb|ADVK01000050.1| GENE 16 13514 - 14476 1577 320 aa, chain - ## HITS:1 COG:FN1281 KEGG:ns NR:ns ## COG: FN1281 COG4870 # Protein_GI_number: 19704616 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Cysteine protease # Organism: Fusobacterium nucleatum # 1 320 1 320 320 664 100.0 0 MLTMYQAYTKRRFSKGEKEGKIIETGWLPPLPDMRDYGSDHPKILKLAEKLGIETDKKEE KLPKSVDLREWCSPVEDQLTLGSCTANAAVGIVEYFQRRAHGIHIEGSRLFIYKATRKLM MTKGDSGAWLRSTMGALVLFGVPDEKYFPYTLDGIHINPSWDEEPDSFLYSMAKNYATLQ YFCHDPHGKKQTKNEILNSVKKYLAAGIPAMFGFYGFSSFEASNSLGCIPYPGNDEQANW GHSVVAIGYDDKKKIKNTRYGIETTGALLIRNSWGKSWGEEGYGWLPYDYILNGLAEDFW SIISMDWVDTNQFGLNKNSH >gi|296153825|gb|ADVK01000050.1| GENE 17 14495 - 16234 1848 579 aa, chain - ## HITS:1 COG:FN1280_1 KEGG:ns NR:ns ## COG: FN1280_1 COG0265 # Protein_GI_number: 19704615 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain # Organism: Fusobacterium nucleatum # 1 317 1 317 317 609 99.0 1e-174 MLKGFNSEDISYEILEKKKENIEQQALERFLEEKSEKENKHIPKEDMLFNSKDSIAMERI VGKNDLFPISYLQIGLNISKSVCRISIRDSRGVVVGYGTGFLVAPNIILTNNHVINSYEV ASNSIAEFNYQDDENFMPCPTYNFRLNPQTFFITDVKLDFTLVALNENVTNQKHLEDFGY LKMTQKEGSILPEEYVSIIQHPKGGPKSVTLRENKVSGLKENFIHYLTDTEPGSSGSPVF NDQWTLVALHHSGVPNPEIKDEWIANEGILISAIVNYLAKKYSSLKENEQAIIKEIVPDI ELPKENNITSSVDDEPLGYNPLFLGKDYEIPLPKLSKEMEKDTAKTENGNYVLDYIHFSI VMKKSRGLAYFTAVNIDGTDAVKIRRTADNWKFDPRISQNYQYGDEVYVNNDLDRGHLVR RTDPNWGKNALKANEDTFYFTNSTPQHKNLNQKTWVELEDYIFRNAVLNQFKVSVFTGPV FREDDMIYRQKYQIPAEFWKVVVMLKEDGNISATAYLQTQKNMIENLEFAYGEYKTYQVP VRNIEKLTGLDFGNLSKFDPMANIEATGIVITGPESIKF >gi|296153825|gb|ADVK01000050.1| GENE 18 16456 - 17436 1377 326 aa, chain + ## HITS:1 COG:FN1279 KEGG:ns NR:ns ## COG: FN1279 COG0491 # Protein_GI_number: 19704614 # Func_class: R General function prediction only # Function: Zn-dependent hydrolases, including glyoxylases # Organism: Fusobacterium nucleatum # 1 326 1 326 326 650 99.0 0 MLNEIAKNIYLVEVPLPKNPLRALNCYFIKNGENILVVDSGFDHEESEKAFFEALEELGA KVGKTDMFLTHLHADHSGLALKFKNKYQGKVYCSQIDTDYINQMKHELYADRFVPTLKVM GIDPDFKFFETHPGLVYCVKGKLETTIVKDGDKIDFGDYHFEVVDLSGHTPGQMGIYDRE HKILFSGDHILNKITPNISFWEFKYEDILGIYLKNLDKVYNMEVDIIYAAHRGVIENPKL RIEELQKHYADRNAEVYSLLKEGEKFSAVQIAAKMHWDYRAKNFEEFPNNQKWFATGEAL ANLEHLRAIGKANYEFIDGIAYYRAI >gi|296153825|gb|ADVK01000050.1| GENE 19 17519 - 17944 508 141 aa, chain + ## HITS:1 COG:FN1278 KEGG:ns NR:ns ## COG: FN1278 COG0454 # Protein_GI_number: 19704613 # Func_class: K Transcription; R General function prediction only # Function: Histone acetyltransferase HPA2 and related acetyltransferases # Organism: Fusobacterium nucleatum # 12 141 1 130 130 231 100.0 2e-61 MIKEFDSSQKMMEDIIYIDSKSFKDIDVNVDELCKRIKKNKQYQLFIKYSENIPVAYLGI LYMSNLHYDGAWIDLVAVVEEHRNKGIGKELLKFAENMVKEKKGTVLTGLVRKDNVSSST MFLNSNFESSEKDFILYSKNI >gi|296153825|gb|ADVK01000050.1| GENE 20 17972 - 19432 2025 486 aa, chain + ## HITS:1 COG:FN1277 KEGG:ns NR:ns ## COG: FN1277 COG2195 # Protein_GI_number: 19704612 # Func_class: E Amino acid transport and metabolism # Function: Di- and tripeptidases # Organism: Fusobacterium nucleatum # 1 486 1 486 486 903 98.0 0 MSNKLVNLKPERVFYYFEELSKIPRESANEQAVSNFLVDTAKKLGLKVYQDKINNIVIKK PATKGYENSDGIILQGHMDMVCEKELDSNHNFKTDGIDLIVDGKFLRANKTTLGADNGIA VAMGLAILEDENIEHPEIELLVTVEEETTMRGALELEENILTGKMLINIDSEEEAWVTVG SAGGREIDITFNEEKEKFENANSDFYRLEVKNLCGGHSGAEIHKNRLNANKVMSEVISEI KKNFDMKLCDIKGGSKDNAIPRECYFDTAIDKSSSQNFMNKSKEIFENFKSKYIEQDPNI TFEISKLENKYSEIYSNDLFEKVLRILNDLPTGVNTWLKEYPDIVESSDNLAIIKIIDNK ITVILSLRSSEPSILDNLEEKIITIVKKYNASYEVSGGYPEWRFKPISRLRDTAVKTYQD LFNEKMQVTVIHAGLECGAISMHYPNLDMISIGPNIYDVHTPKERMEIASVEKYYKYLVE LLKKLK >gi|296153825|gb|ADVK01000050.1| GENE 21 19476 - 19889 527 137 aa, chain - ## HITS:1 COG:no KEGG:FN1276 NR:ns ## KEGG: FN1276 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 7 137 1 131 131 229 100.0 2e-59 MNNIKNMNDTESENLKFVVLHINDTQVRLLEKFFEKIGIYYYTIEENVKRAIDKSIKHQQ TKVWPGSDALVTFPLGDKKVDEFLIKLKTFRMVLPKGLILSVAIIPLEKLIKSVYEEDIP IDEELMEELQNDKDYNI >gi|296153825|gb|ADVK01000050.1| GENE 22 19889 - 22951 3718 1020 aa, chain - ## HITS:1 COG:FN1275 KEGG:ns NR:ns ## COG: FN1275 COG0841 # Protein_GI_number: 19704610 # Func_class: V Defense mechanisms # Function: Cation/multidrug efflux pump # Organism: Fusobacterium nucleatum # 1 1020 1 1020 1020 1783 99.0 0 MSLAGISIRRPVATTMVMVSFIFIGLLAMFSMKKELIPNIKIPVVTITTTWTGAVSENVE TQVTKKIKDSLSNVDAIDKIQTVSAYGVSNVVVNFDYGVDTDEKVTQIQREVSKIANNLP NDANTPLVRKIEVAGGNMTAIIAFNADSKTALTTFIKEQLKPRLESLPGIGQVDIFGNPD KQLQIQVNSDKLASYNLSPMELYNIVRTSVATYPIGKLSTGNKDMIIRFMGELDYIDQYK NILISSDGNTLRLKDVADIVLTTEDADNVGYLNGKESVVVLLQKSSDGDTITLNNAAFKV IEEMKPYMPAGTEYSIEMDASENINSSISNVSSSAIQGLVLATIILFIFLKSFRTTILIS LALPVAIIFTFAFLSMRGTTLNLISLMGLSIGVGMLTDNSVVVVDNIYRHITELNSPVME ASENATEEVTFSVIASALTTIVVFLPILFIPGLAREFFRDMSYAIIFSNLAAIIVAITMI PMLASRFLNRKSMKSEDGRLFKKVKTFYLKVINKAISHKGLTVLIMAVLFFFSIFVGPKL LKFEFMPKQDEGKYSLTAELQNGTDLNKAERIAKELEEIIKSDPHTQSYLMLVSTSSISV NANVGKKNTRDDSVFTIMNDIRNKTSKVLDARVSMTNQFSGGQTNKDVQFLLQGSNQDEI KKFGKQLLEKLQSYNGMVDISSTLDPGIIELRVNIDRDKIASYGISPTVVAQTISYYMLG GDKANTATLKTDTEEIDVLVRLPKDKRNDINILASLNIKVGDNKFVKLSDVATLQYAEGT SEIRKKNGIYTVTISGNDGGVGLGAIQSKIIEEFNNLNPPSTVSYSWGGQTENMQKTMSQ LSFALSISIFLIYALLASQFESFIMPIIIIGSIPLALIGVIWGLVILRQPIDIMVMIGVI LLAGVVVNNAIVLIDFIKTMRTRGYDKDYSIIYSCETRLRPILMTTMTTVFGMIPMALGL GEGSEFYRGMAITVIFGLSFSTILTLVLIPILYSIVDSFTSKLMTRIKVISRKSKKKEAK >gi|296153825|gb|ADVK01000050.1| GENE 23 22951 - 24063 1521 370 aa, chain - ## HITS:1 COG:FN1274 KEGG:ns NR:ns ## COG: FN1274 COG0845 # Protein_GI_number: 19704609 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Fusobacterium nucleatum # 1 370 1 370 370 601 98.0 1e-172 MKKILTIFLAASLLLVACGKDKENTKTKESKQETTTTEEQKVVKSVEVAAVTTREMSKLF ESSAVWEPLAKVDFSTDKGGTVKTIYKKNGEYVKKGEVIVKLSDAQTEADFLQAKANYQS ATSNYNIARNNYQKFKTLYDKQLISYLEFSNYQASYTSAQGNLEVAKATYMNAQNSYSKL GARAEISGVVGNLFIKEGNDIAAKEVLFTILNDKQMQSYVGITPEAISKVKLGNEINVKI DALAKEYKAKITELNPIADSTTKNFKVKLALDNPDGEIKDGMFGNVIIPVGESSVLSIED EAIVTRDLVNYVFKYEDGKAKQIEVQVGATNLPYTEISSPEIKEGDKIIVKGLFGLQNND SVEIKNGVNK >gi|296153825|gb|ADVK01000050.1| GENE 24 24110 - 25381 1543 423 aa, chain - ## HITS:1 COG:FN1273 KEGG:ns NR:ns ## COG: FN1273 COG1538 # Protein_GI_number: 19704608 # Func_class: M Cell wall/membrane/envelope biogenesis; U Intracellular trafficking, secretion, and vesicular transport # Function: Outer membrane protein # Organism: Fusobacterium nucleatum # 11 423 1 413 413 701 99.0 0 MKKFLTVFLLMTNIVLARDLTLEQAIDLSLNNSKEMRISEKNLDISKLNVSKAFKNALPS VTYTGAYTVGEHERKILTQSERNYVNKKRGYTQNIKLTQPLFTGGTITAGIKGAKAYENI ASYSFLQSKIKNRLDTIKIFSDIINAQRNLEALSYSEGILLKRYQKQEEQLKLRLITKTD VLQTEYSIEDIKAQMINIKNIIDTNMEKLYIRTGISKSEPLNLVPFDIPNNFSEKINLDS DLKQAINESISAKVAEEQVKVASATKMAAVGDLLPQVSAFASYGTGERTTFERSYKDSEW TGGVQVSWKVFSFGSDLDNYRVAKLQEEQEELKETSTKENIEINVRSAYLNVLSLEKQIA SQGKALEVAKVNFELNQEKYDAGLISTVDYLDFENTYRQARIAYNKLLLDYYYAFETYRS LLI >gi|296153825|gb|ADVK01000050.1| GENE 25 25403 - 26038 515 211 aa, chain - ## HITS:1 COG:no KEGG:FN1272 NR:ns ## KEGG: FN1272 # Name: not_defined # Def: TetR family transcriptional regulator # Organism: F.nucleatum # Pathway: not_defined # 1 211 1 211 211 305 100.0 1e-81 MNSDIDKKYLILEKAKDMIITESYSSLSISKLTSELSISKGSFYTYFPSKDKMLSEILDE YIKNIIIFKNNLLENSKNIDECLDYYINSILSLTDEELKLELVITNLKRNYEVFNEENFK KLKIIACTMIDFVKEVLNKYKNDINIDEKDIEKCSKMIFSIAEVFLIMENVDFNSDRFSF KTLNEVKKMYRSDEMKEHLEFIKKSIKKILY >gi|296153825|gb|ADVK01000050.1| GENE 26 26245 - 27981 2041 578 aa, chain + ## HITS:1 COG:FN1271 KEGG:ns NR:ns ## COG: FN1271 COG0616 # Protein_GI_number: 19704606 # Func_class: O Posttranslational modification, protein turnover, chaperones; U Intracellular trafficking, secretion, and vesicular transport # Function: Periplasmic serine proteases (ClpP class) # Organism: Fusobacterium nucleatum # 14 578 1 565 565 951 99.0 0 MKILHYLKRFILFVIKEILSFFIKLFLFLFVVGIIISAAIKSFEEKPTVAIKNKAYVLIN LADSYNERLLKSNLFEDDSINFYTLLQSIEAISYDDRVEGIILKLNGDSLSYAQSEELAH EISMARAANKKIIAYFENVGRKNYYLASYANEIYMPSANSTNVNIYPYFKENFYIKGLAD KFGVKFNIIHVGDYKSYMENLASNTMSKEAKEDTVRVLDKNYNNFLDVVSLNRKINREDL DKIIKDGELVAASSADLMNNNLIDKYVYWDNVISMVGGKDKIITIQEYAKNYYKEGSMVD SNNIVYVIPLEGDIVESQTEVFSGEENINVSETLEKLNIAKENNKVKAVVLRVNSPGGSA LTSDIIAKKVKELAEEKPVYVSMSSVAASGGYYISTNAHKIFVDRNTITGSIGVVSILPD FSKLITDNGVNIEKISDGEYSDLYSADSFTEKKYNKIYNSNLKVYEDFLSVVSKGRRIDK EKLKTIAEGRIWTGDEAIKIGLADEIGGLNETIYAIAEDNNMDEYGIVIAKDKLELGNIY KKYSRYIKMNAKDLVKEKIFKDYLYNKPVTYLPYDVLD >gi|296153825|gb|ADVK01000050.1| GENE 27 28050 - 28400 179 116 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|149916415|ref|ZP_01904934.1| 30S ribosomal protein S21 [Roseobacter sp. AzwK-3b] # 13 113 9 107 114 73 37 2e-12 MVRKLKGTKTAGGNQADIVKQAQVMQQQMLEVQEQLKSKEVSSSVGGGAVSVKVNGQKEL MEVKLSDEVLKDAANDKEMLEDLILTAVKDAMAKAEELAEGEMAKVTGGINIPGLF >gi|296153825|gb|ADVK01000050.1| GENE 28 28490 - 29113 573 207 aa, chain + ## HITS:1 COG:FN1269 KEGG:ns NR:ns ## COG: FN1269 COG2121 # Protein_GI_number: 19704604 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 207 3 209 209 333 100.0 2e-91 MEENKKYRILGTLLYYVLRIISSTLKIEIVNKYGIDMQRPHIYGFWHSKLFITPIFFRNV EKKLAMSSPTKDGELISVPLEKMGYILVRGSSDKNQISSTISLLKYLKKGYSIGTPLDGP KGPKEKPKKGLLYLSQKTSVPLVPVGISYSKKWILKKTWDQFEIPKPFSKVKIFLGEPIL IDDDEDLDKYTEIVKNGINSLNNQIEF >gi|296153825|gb|ADVK01000050.1| GENE 29 29133 - 31043 2611 636 aa, chain + ## HITS:1 COG:FN1268_1 KEGG:ns NR:ns ## COG: FN1268_1 COG0143 # Protein_GI_number: 19704603 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Methionyl-tRNA synthetase # Organism: Fusobacterium nucleatum # 1 526 1 526 526 1060 99.0 0 MKKNFFVSTPIYYVNGDPHVGSAYTTIAADVINRYNKAMGMDTHFVTGLDEHGQKVEQAA KQNGFTPQAWTDKMTPNFKNMWTALDIKYDDFIRTTEDRHKKAVKRILDIVNAKGDIYKG EYEGKYCVSCETFFPENQLNGSNKCPDCGKDLTVLKEESYFFKMSKYADALLKHIDEHPD FILPHSRRNEVISFIKQGLQDLSISRNTFSWGIPIDFAPGHITYVWFDALTNYITSAGFE NDENKFDKFWNNSRVVHLIGKDIIRFHAIIWPCMLLSAGIKLPDSIVAHGWWTSEGEKMS KSRGNVVNPYDEIKKYGVDAFRYYLLREANFGTDGDYSTKGIIGRLNSDLANDLGNLLNR TLGMYKKYFNGTIVSSSTAEPIDDEIKSMFNDVVKDVEKYMYLFEFSRALETIWKFISRL NKYIDETMPWTLAKNEAKKARLAAVMNILCEGLYKIAFLIAPYMPESAQKISNQLGIDKD ITNLKFDDIKEWSIFKEGHKLGEASPIFPRIEIEKEEIVETKKELKIENPVDIKEFNKIE IKVVEILDVDKVEGADKLLKFKVFDGEFERQIISGLAKFYPDFKKLISEKVLAVVNLKFT KLKGEISQGMLLTTEDKNGVSLIKIDKTVEAGAIVS >gi|296153825|gb|ADVK01000050.1| GENE 30 31055 - 31672 881 205 aa, chain + ## HITS:1 COG:FN1267 KEGG:ns NR:ns ## COG: FN1267 COG0457 # Protein_GI_number: 19704602 # Func_class: R General function prediction only # Function: FOG: TPR repeat # Organism: Fusobacterium nucleatum # 1 205 1 205 205 358 100.0 3e-99 MGIINKKDEEFFENVEYFSEIIDRINDIQENNNYSDEEMDNDLDVALWRAFVYINLWSYK GYAKAERILKKVENKGIKNPIWCYRYAVSIARLRKYEEALKYFLIGTEVDSTYPWNWLEL GRLYYKFGELDKVFECIEKGLELVPNDYEFLTLKDDVKNDRGYFYSINHYINEEVDKTED RELDYGDDEEWEKFKKETHYGEKCL >gi|296153825|gb|ADVK01000050.1| GENE 31 31684 - 32568 1174 294 aa, chain + ## HITS:1 COG:FN1266 KEGG:ns NR:ns ## COG: FN1266 COG1210 # Protein_GI_number: 19704601 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-glucose pyrophosphorylase # Organism: Fusobacterium nucleatum # 1 294 8 301 301 560 100.0 1e-159 MKKVTKAVIPAAGLGTRVLPATKALPKEMLTIVDKPSLQYIVEELVDSGITDIVIITGRN KNSIEDHFDFSYELENTLKNDNKLDLLEKVSHISNIANIYYVRQSMPLGLGHAVLKAKSF IGDEPFVIALGDDIIYNPEKPVTKQMIEKYELYGKSIIGCQEVAKEDVSKYGIAKLGHKV DETTYQMLDFLEKPSVNEAPSRTACLGRYLLSGKVFKYLEETKPGKNGEIQLTDGILAMM RDNEEVLAYNFIGKRYDIGSKFGLLRANIEFGLRNEETKEEVKEYLKKLDIEKI >gi|296153825|gb|ADVK01000050.1| GENE 32 32737 - 33363 1062 208 aa, chain - ## HITS:1 COG:FN1265 KEGG:ns NR:ns ## COG: FN1265 COG2885 # Protein_GI_number: 19704600 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Outer membrane protein and related peptidoglycan-associated (lipo)proteins # Organism: Fusobacterium nucleatum # 11 208 1 202 202 303 96.0 2e-82 MKNKKIIASCMLALSLVSCTGLEAGNGAYTAGGAALGAVAGQVIGKDTKGTLIGAAVGSL LGMGWGAYRDNQERELKAKLQGTQAQVRKDGNALVINLPGGVTFASDSANITSGFYSALN GIAQSLNNYPETRIQVNGYTDNTGKDAHNQELSQRRANAVAQYLIAQGVSSNRIVANGFG SSNPIASNSTPEGRLQNRRVEIKILPAQ >gi|296153825|gb|ADVK01000050.1| GENE 33 33536 - 34009 518 157 aa, chain - ## HITS:1 COG:no KEGG:FN1264 NR:ns ## KEGG: FN1264 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 35 157 1 120 120 212 92.0 3e-54 MRKEWAFLKLLTTKGYKKVVLIPLAFCLGIFLYYLYIDFTGGEVDKTVYDDGTVRISAQS DLGSCKLPKILDALNIPIHDELKIRNYNVYLDKEENINSVEIYCSTDKDGNEIIEWYKEK LNSTNSTDNAIGIWNNFEMDVSFNKFSNLVSIVLKKQ >gi|296153825|gb|ADVK01000050.1| GENE 34 34177 - 34998 1212 273 aa, chain + ## HITS:1 COG:FN1263 KEGG:ns NR:ns ## COG: FN1263 COG4822 # Protein_GI_number: 19704598 # Func_class: H Coenzyme transport and metabolism # Function: Cobalamin biosynthesis protein CbiK, Co2+ chelatase # Organism: Fusobacterium nucleatum # 1 273 11 283 283 515 100.0 1e-146 MSKKALFMVHFGTTHNDTKELTIDKMNKKFADEFKDYDSFTAYTSRIVLKRLKDRGEVFS TPIRVLNSLADQGYEELIVQTSHVIPGIEYENLVREVNSFSNKFKSVKIGKPLLYYIDDY KKCVKALADEYVPKNKKEALVLVCHGTDSPLATSYAMIEYVFDEYGYDNVFVVCTKAYPL MDTLLKKLRKNGIEEVRLAPFMFVAGEHAKNDMAVTYKEELEKNGFKVNQVILKGLGEFD AIQNIFLDHLKSAIEKDEEDIADFKKEYSAKYL >gi|296153825|gb|ADVK01000050.1| GENE 35 35071 - 35253 163 60 aa, chain - ## HITS:1 COG:FN1262 KEGG:ns NR:ns ## COG: FN1262 COG1807 # Protein_GI_number: 19704597 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family # Organism: Fusobacterium nucleatum # 1 60 460 519 519 117 100.0 5e-27 WKLGKEIKSLNKNIPDERVIFFLGKNEPKELLKTYEIKKVYNYQKVTHDMERLYLLEKIY Prediction of potential genes in microbial genomes Time: Sat Jul 9 19:59:48 2011 Seq name: gi|296153776|gb|ADVK01000051.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00063, whole genome shotgun sequence Length of sequence - 65628 bp Number of predicted genes - 48, with homology - 47 Number of transcription units - 19, operones - 10 average op.length - 3.9 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 2 - 530 836 ## FN1899 lipoprotein + Prom 749 - 808 18.1 2 2 Tu 1 . + CDS 854 - 1846 1437 ## COG3641 Predicted membrane protein, putative toxin regulator - Term 1836 - 1887 12.1 3 3 Op 1 1/1.000 - CDS 1924 - 2577 805 ## COG0664 cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases 4 3 Op 2 1/1.000 - CDS 2587 - 3072 706 ## COG2131 Deoxycytidylate deaminase - Prom 3095 - 3154 7.0 - Term 3160 - 3223 1.8 5 4 Tu 1 . - CDS 3232 - 5664 3236 ## COG0446 Uncharacterized NAD(FAD)-dependent dehydrogenases - Prom 5830 - 5889 11.3 + Prom 5636 - 5695 6.6 6 5 Tu 1 . + CDS 5804 - 6163 428 ## COG1733 Predicted transcriptional regulators + Term 6167 - 6218 6.6 - Term 6155 - 6204 2.4 7 6 Tu 1 . - CDS 6213 - 10718 5792 ## FN1905 168 kDa surface-layer protein precursor - Prom 10889 - 10948 13.0 + Prom 10853 - 10912 8.6 8 7 Op 1 1/1.000 + CDS 10979 - 12415 2369 ## COG0260 Leucyl aminopeptidase + Term 12439 - 12472 -0.9 9 7 Op 2 . + CDS 12491 - 13078 676 ## COG1739 Uncharacterized conserved protein + Term 13088 - 13143 12.9 - Term 13073 - 13134 16.4 10 8 Op 1 1/1.000 - CDS 13158 - 14231 1587 ## COG0584 Glycerophosphoryl diester phosphodiesterase 11 8 Op 2 . - CDS 14274 - 15272 1353 ## COG1044 UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase 12 8 Op 3 . - CDS 15297 - 15770 772 ## FN1910 hypothetical protein 13 8 Op 4 . - CDS 15811 - 17904 2786 ## COG4775 Outer membrane protein/protective antigen OMA87 14 8 Op 5 . - CDS 17980 - 22452 5124 ## FN1912 hypothetical protein - Prom 22495 - 22554 12.8 - Term 22543 - 22588 3.0 15 9 Op 1 1/1.000 - CDS 22617 - 24182 2227 ## COG1418 Predicted HD superfamily hydrolase 16 9 Op 2 1/1.000 - CDS 24206 - 24553 448 ## COG1366 Anti-anti-sigma regulatory factor (antagonist of anti-sigma factor) 17 9 Op 3 . - CDS 24554 - 24964 514 ## COG3920 Signal transduction histidine kinase 18 9 Op 4 . - CDS 24969 - 25361 414 ## FN1916 hypothetical protein 19 9 Op 5 1/1.000 - CDS 25408 - 26310 1101 ## COG0324 tRNA delta(2)-isopentenylpyrophosphate transferase 20 9 Op 6 1/1.000 - CDS 26303 - 27589 1780 ## COG0536 Predicted GTPase - Prom 27613 - 27672 8.1 21 10 Op 1 1/1.000 - CDS 27696 - 28445 1113 ## COG0500 SAM-dependent methyltransferases 22 10 Op 2 1/1.000 - CDS 28442 - 29473 1168 ## COG0482 Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain 23 10 Op 3 . - CDS 29473 - 30177 746 ## COG0340 Biotin-(acetyl-CoA carboxylase) ligase 24 10 Op 4 . - CDS 30196 - 32169 1785 ## FN1922 hypothetical protein 25 10 Op 5 1/1.000 - CDS 32170 - 34182 1886 ## COG0338 Site-specific DNA methylase - Prom 34217 - 34276 10.7 - Term 34240 - 34271 -0.7 26 11 Op 1 2/0.000 - CDS 34281 - 35558 1397 ## COG1055 Na+/H+ antiporter NhaD and related arsenite permeases 27 11 Op 2 5/0.000 - CDS 35626 - 36900 1467 ## COG1055 Na+/H+ antiporter NhaD and related arsenite permeases 28 11 Op 3 1/1.000 - CDS 36917 - 37852 1194 ## COG0517 FOG: CBS domain - Prom 37919 - 37978 9.6 - Term 37924 - 37971 8.0 29 12 Op 1 1/1.000 - CDS 37980 - 40517 3171 ## COG1461 Predicted kinase related to dihydroxyacetone kinase 30 12 Op 2 1/1.000 - CDS 40529 - 41083 761 ## COG1396 Predicted transcriptional regulators 31 12 Op 3 1/1.000 - CDS 41090 - 42298 295 ## PROTEIN SUPPORTED gi|229231897|ref|ZP_04356325.1| SSU ribosomal protein S12P methylthiotransferase 32 12 Op 4 1/1.000 - CDS 42309 - 42824 649 ## COG1267 Phosphatidylglycerophosphatase A and related proteins 33 12 Op 5 1/1.000 - CDS 42824 - 44986 2446 ## COG0826 Collagenase and related proteases 34 12 Op 6 . - CDS 44983 - 45555 824 ## COG0237 Dephospho-CoA kinase - Prom 45587 - 45646 9.6 35 13 Op 1 . - CDS 45751 - 47637 1920 ## COG0553 Superfamily II DNA/RNA helicases, SNF2 family 36 13 Op 2 . - CDS 47627 - 48844 1519 ## CLOST_1638 hypothetical protein 37 13 Op 3 . - CDS 48845 - 51784 2634 ## Amet_2144 RNA polymerase subunit alpha - Prom 51903 - 51962 11.7 - Term 52167 - 52221 -0.9 38 14 Tu 1 . - CDS 52257 - 52454 155 ## - Prom 52496 - 52555 6.9 39 15 Tu 1 . - CDS 52557 - 52688 152 ## gi|256846593|ref|ZP_05552050.1| predicted protein - Prom 52712 - 52771 3.2 40 16 Op 1 . - CDS 53579 - 54580 1160 ## COG3392 Adenine-specific DNA methylase - Prom 54600 - 54659 4.4 41 16 Op 2 . - CDS 54661 - 55452 720 ## FN1936 hypothetical protein 42 16 Op 3 . - CDS 55469 - 55936 326 ## FN1938 hypothetical protein 43 16 Op 4 . - CDS 56014 - 57060 1149 ## FN1939 hypothetical protein 44 16 Op 5 . - CDS 57105 - 58475 1076 ## COG0534 Na+-driven multidrug efflux pump - Prom 58500 - 58559 5.7 + Prom 58765 - 58824 12.6 45 17 Tu 1 . + CDS 58867 - 61443 1810 ## PROTEIN SUPPORTED gi|163764771|ref|ZP_02171825.1| ribosomal protein S8 + Term 61480 - 61527 11.1 - Term 61472 - 61511 -0.8 46 18 Tu 1 . - CDS 61520 - 62209 548 ## COG2964 Uncharacterized protein conserved in bacteria - Prom 62234 - 62293 9.4 + Prom 62265 - 62324 11.7 47 19 Op 1 . + CDS 62421 - 64058 2404 ## COG3033 Tryptophanase + Prom 64084 - 64143 12.7 48 19 Op 2 . + CDS 64182 - 65516 1844 ## COG0733 Na+-dependent transporters of the SNF family + Term 65549 - 65605 10.1 Predicted protein(s) >gi|296153776|gb|ADVK01000051.1| GENE 1 2 - 530 836 176 aa, chain - ## HITS:1 COG:no KEGG:FN1899 NR:ns ## KEGG: FN1899 # Name: not_defined # Def: lipoprotein # Organism: F.nucleatum # Pathway: not_defined # 3 176 1 174 416 335 100.0 4e-91 MKMKKILFNLLAIFMLVVAVACGKKEAPTEDANAQQEAASEVATQDYHIGIVTTSVSQSE DNFRGAEAVLKQYGSSNDEGGKITVVTVPDNFMQEQETTISQMVSLADDPKMKAVIVAEG IPGTYPAFKAIREKRPDILLFVNNTHEDPVQVSTVADVVVNSDSVARGYLIVKTAH >gi|296153776|gb|ADVK01000051.1| GENE 2 854 - 1846 1437 330 aa, chain + ## HITS:1 COG:FN1900 KEGG:ns NR:ns ## COG: FN1900 COG3641 # Protein_GI_number: 19705205 # Func_class: R General function prediction only # Function: Predicted membrane protein, putative toxin regulator # Organism: Fusobacterium nucleatum # 1 330 1 330 330 493 100.0 1e-139 MKNFFIKSLNGMAFGLFSSLIVGLILKQIGTLFNIEFLIYLGGFAQLLMGAGIGVGVAYA LESPVLILISSAITGMYGAGSINFIDGQAILKVGEPMGAYFSVIFGLLISKQLAGKTKFD IILLPMTTIIFGCLLGKFFAPYISAIITEIGVIVNKTTELRPILMGLTLSVIMGIILTLP ISSAAIGISLGLSGLAAGAALTGCCCQMIGFAIMSYDDNDLGTVFSIGFGTSMIQIPNII KNPIIWVPPIASSAILGVLSTTVFKLSSNSIASGMGTSGFVGQIASFSVNGMPYLPTMII LHFLLPAILTFIIYKVLKKKGYIKAGDLKI >gi|296153776|gb|ADVK01000051.1| GENE 3 1924 - 2577 805 217 aa, chain - ## HITS:1 COG:FN1901 KEGG:ns NR:ns ## COG: FN1901 COG0664 # Protein_GI_number: 19705206 # Func_class: T Signal transduction mechanisms # Function: cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases # Organism: Fusobacterium nucleatum # 1 217 1 217 217 347 100.0 7e-96 MIKTLKETVVFNSIDEKTIKNILEKTKYEIKKYSPNESIAFRGDEVKGLYIILKGTLITE MLTEEGNVIKIEELVPSDVIASAFIFGKKNSFPVDLSVKDEAEILFVERKEFLKLLFSEE KILENFLNEISNKTQLLTNKIWNSFNNKTIKKKFCDYVKKNQKNNLFSIQNLGALAEFFG VERPSLSRVLSELVKDEKLERIGRNKYKILDIEFFEI >gi|296153776|gb|ADVK01000051.1| GENE 4 2587 - 3072 706 161 aa, chain - ## HITS:1 COG:FN1902 KEGG:ns NR:ns ## COG: FN1902 COG2131 # Protein_GI_number: 19705207 # Func_class: F Nucleotide transport and metabolism # Function: Deoxycytidylate deaminase # Organism: Fusobacterium nucleatum # 1 161 14 174 174 335 100.0 3e-92 MRENYINWDSYFMGIAILSSMRSKDPNTQVGACIVNEDKRIVGVGYNGLPKGCDDKEFPW QRDGEFLNTKYPYVCHAELNAILNSIKSLKDCIIYVALFPCHECTKAIIQSGIKEIVYLS DKYTDTDSNRASKKMLDSAGVKYRRFEPDIEKLEINFANIE >gi|296153776|gb|ADVK01000051.1| GENE 5 3232 - 5664 3236 810 aa, chain - ## HITS:1 COG:FN1903_1 KEGG:ns NR:ns ## COG: FN1903_1 COG0446 # Protein_GI_number: 19705208 # Func_class: R General function prediction only # Function: Uncharacterized NAD(FAD)-dependent dehydrogenases # Organism: Fusobacterium nucleatum # 1 469 1 469 469 863 99.0 0 MKKVLIVGGVAGGASTATRLRRLDENLEIVIFEKGEYVSFANCGLPYYIGDIIQNRESLL VQTPESLKARFNLDVRVNSEVVGVNGKDKKVKVKTKNGEEYEEIFDFLVLAPGAKPIFPA IKGIENKKIFTLRNINDMDKIKAEIKNHNVKKATVIGGGYVGIETAENLKHLGIDTTLIE AAPHILAPFDSEISNILEYELVNNGINLMISEKVVEFQEDGNEIIIKLESGKIVTTDMLI LSIGVSPDTKFLQNSGINLGERGHILVNEKLETNIDGIYALGDSIIVKNYITNQDVAIPL AGPANRQGRIVAGNIVGRNEKYKGSLGTAIIKIFELTGASTGLNERSLKQLNIPYEKVYL HPNNHAAYYPGATAISIKALYNKENGQILGAQAVGISGVDKFIDVIATSIKFKATIDDLT ELELAYAPPFLSAKSPANMLGFIGQNIEDNLLGQVFMEDLENYNEKETIILDVREELELI SGKLNNSINIPLSELRKRCTELPKDKEIWTYCAVGLRGYIASRFLTQKGYKVKNLAGGIK IEEKELIKMQEETFSNKENSDYNVDKEDEYLDLSGLSCPGPLVKIKEKIDKLQESKKLKV KVSDPGFYNDIQAWSKVTKNSLLSLDKKDGLTYATLQKGQASKVVVKEQENVIIEDNSNM TMVVFSGDLDKAIAAFIIANGALTMGKKVTMFFTFWGLSILKKKKLAKKSFIEKMFAMML PKNSQDLPVSKMNFFGIGAKMIRSVMKKKNIMSLEELMKKAKDSGINITACTMSMDVMGI SKEELIDGINYGGVGQYLGETEKSNNNLFI >gi|296153776|gb|ADVK01000051.1| GENE 6 5804 - 6163 428 119 aa, chain + ## HITS:1 COG:FN1904 KEGG:ns NR:ns ## COG: FN1904 COG1733 # Protein_GI_number: 19705209 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Fusobacterium nucleatum # 1 119 30 148 148 227 100.0 4e-60 MDKNKKYNCFFEFTLDIVGGKWKPIILYYININSVARHSELKRFIPSINERMLTRQLREL EEDNLIERKVYPVVPPKVEYKLTEYGKSLIPILKSLVLWGKDYAKAIKFDNFKMNLPEE >gi|296153776|gb|ADVK01000051.1| GENE 7 6213 - 10718 5792 1501 aa, chain - ## HITS:1 COG:no KEGG:FN1905 NR:ns ## KEGG: FN1905 # Name: not_defined # Def: 168 kDa surface-layer protein precursor # Organism: F.nucleatum # Pathway: not_defined # 14 1501 1 1487 1487 2338 97.0 0 MFKNKYLFVLALLVATTSYSEDYTITQETKKESKEILKHEKEDSGRVGKFAESGNAIIFQ RDGSLINNGLLRGSITSIGEDTKNEKSKKGEIIVTAQGNGVSGVGYSPYSNKNDIDKKIS SVINNGYISGEANLTAGNAEAFGSIEEVYSAANGISGLALSDFGEGGIVDGPGGATRSGR RKSASLLSTNILASKTAEKNLENKVGEIKKTDPKDFYGKNNKFTLGDITNSETISGKAIL KTKEGYIRKDSEAFGGKLAISHQWRTINATSTGNGISNLSYITTVDRYTYSEDKVNGSYI KNIKNSGIISGNVEATTGSGATQTTIRASVTGNGISSAAYSNNFVKSLTEAIIDKIQNRG TISGRLKAEVGNNTARSFHEYSNAIITGSANGVSVYSRSSNGQMKNSRAVIGELTNSGSI SGNLYAKAGSGYGEVRADVKSSGNGVAIYTESSTKKETKIGSIDNKGMITGKAVIYGGKD KKAMENKTLRDKIFHVPLTNNIAGYSLPDEDVEADKKAVAEMKKYEKEMANELENKIAKY QKQIENKEKDILAKTPKKPEISETDKKDRDNTFKKYLRDEIKEQQNLIWSGINDTVVAEA KKRKAELEKLGSNATDDEKIKFLEEYKKVLEKRLEESGDPTWKNQEKKNRIAKINDILNK NKVKDIPDSPEILALKKEKAELEKELAKVMDEKKKSDHLAANIHTETGVVASGNGISIND GKEDGVVLGSLKNSGVISGETEIHHGNSQRKYSRIAHKNSGAGIAVGGNVEGKIENTGII SGTEFALLAKGKRNDAYGIDKENVTYESGFKGGVDNYGILAGRIIIGGYQSAHTGSHGEW GKIEEEYGYFETIYKDKKHYNNKGIFLVLDRNGEVRKVIGEDKETIHNGRTVKNVLDING TYEGDTSNKIINGVGKDGVVVAKENQNKNIENSIINGFKNAIKVKKNGLVKISNSTINAN GFGKSYAILGDEGANEVEITNKSIINGKIDLGAGDDRLTLSGNYRLNREVDLGTGNNTLA FGKKENGIRSTSSYSTFGASSTNTSVFNGTVNNANKVEVNENTAFSSNSRINGVNELNLA NGKTLDYYFVDKENQAFAELAKSNRNLKVKGSGKVNLVPIGSKVQIGYEKDLLGFSFGNN LLKPNSNGGTTLPSNNGRMTPPSNNPRNNTPGNRGGFFIDPRLTPFDRYSKGAFDAYIAD YKSTGIDFLKDYKSSQEVFKNYLYNLTNNNPVGYLKDSAIDTVLAYHKANTNSNFTKKGQ ITVSGEIFNIRNKYYENNKVVSNGASAEISYGTSDNTNIGLNIGGGQSKVKTSNENLKSE VLVLGVNAEYKVKNFSWKNEVSYGRIKPKKYSALDTYSLYSQLGYNIPLSETWSLTPKVS VFLSKVKQKEMTVNNIKVAKDDDTFVETTVGTELNKKFMFGNNGINFGLGVDYSMVKDLD DSKAHFVGGSTEFDLRNYDRKNRATAEIKFGYEHISGITATFRIRKNKDITSSGFGLGYK F >gi|296153776|gb|ADVK01000051.1| GENE 8 10979 - 12415 2369 478 aa, chain + ## HITS:1 COG:FN1906 KEGG:ns NR:ns ## COG: FN1906 COG0260 # Protein_GI_number: 19705211 # Func_class: E Amino acid transport and metabolism # Function: Leucyl aminopeptidase # Organism: Fusobacterium nucleatum # 1 478 1 478 478 907 99.0 0 MSFQCVKKYEDSYDKYVLAATSEKVVLPDYLDKESKKIAETIIKKNKFTAKASEKISMTL VNKKKVIEFIIIGLGEKKKLDAKNTRQYLFDGLKNIIGKVLFSFDNKDLDNIDILAEVVE HINYKFDKYFSKKKEEFLEVSYLTDKKVPKLIEGYELAKISNIVKDLVNEQAEVLNPKEL ADRATKLGKKFGFDVEILDEKKAQKLGMNAYLSVARAAHHRPYVIVMRYKGNAKSKYTFG LVGKGLTYDTGGLSLKPTDSMLTMRCDMGGAATMIGAMCSVAKMKLKKNVTCVVAACENS IGPNAYRPGDILTAMNGKTIEVTNTDAEGRLTLADALTYIVRKEKVNEVIDAATLTGAIM VALGEDVTGVFTNDDKMARKVIDASENWNEYFWQMPMFDLYKKNLKSSYADMQNTGVRWG GSTNAAKFLEEFIDDTKWVHLDIAGTAWASGANPYYSQKGATGQVFRTVYSYIKDNKN >gi|296153776|gb|ADVK01000051.1| GENE 9 12491 - 13078 676 195 aa, chain + ## HITS:1 COG:FN1907 KEGG:ns NR:ns ## COG: FN1907 COG1739 # Protein_GI_number: 19705212 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 195 1 195 195 339 98.0 2e-93 MEKIKTVKKECKIEFEEKKSKFIGYVKPVFSKEEAEDYIKYIKNLHSDATHNCSAYKINN NGLEFFKVDDDGEPSGTAGKPIGDIINYMEVSNLVVIATRYFGGIKLGAGGLVRNYAKTA KLAITEAEIIDFIDKIDLIFEIPYERLGEVEKLLKEYEAEVIDKSFLEKIVFKVKINKNF YDNLENYTFINLLDI >gi|296153776|gb|ADVK01000051.1| GENE 10 13158 - 14231 1587 357 aa, chain - ## HITS:1 COG:FN1908 KEGG:ns NR:ns ## COG: FN1908 COG0584 # Protein_GI_number: 19705213 # Func_class: C Energy production and conversion # Function: Glycerophosphoryl diester phosphodiesterase # Organism: Fusobacterium nucleatum # 1 357 1 357 357 708 99.0 0 MKLKSCLVLLGILSSTALFAAHNGKIIIAHRGASGYLPEHTLESKALAFAQQADYLEQDL AMSKDGKLIVIHDHFLDGLTDVAKKFPNRKRADGRYYVIDFTWPELQTLEMTENFTTKDG KQTAVYPNRFPLWKSDFKLHTFEEEIEFIQGLEKSTGKKIGIYPEIKAPWFHHQNGKDIA KATLEVLKKYGYTKKSDMVYLQTFDYNELKRIKTELMPKMGMDLKLVQLIAYNDWHETEE KDKNGKWVNYDYDWMFKEGAMKEIAKYADGVGPGWYMLIDDKNSKVGNIVYTPMVKDIAT TKMELHPYTVRKDALPEFFTDVNQMYDALLNKAGATGVFTDFPDLGVQFLENQKNKK >gi|296153776|gb|ADVK01000051.1| GENE 11 14274 - 15272 1353 332 aa, chain - ## HITS:1 COG:FN1909 KEGG:ns NR:ns ## COG: FN1909 COG1044 # Protein_GI_number: 19705214 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase # Organism: Fusobacterium nucleatum # 1 332 1 332 332 604 100.0 1e-173 MEYRVTDIITLLNAEYKGEVVEKVSKLSPFFHSDEKSLTFAADEKFLKNLSQTKAKVIIV PDIDLPLIEGKGYIVVKDSPRVIMPKLLHFFSRNLKKIEKMREDSAKIGENVDIAPNVYI GHDVVIGNNVKIFPNVTIGEGSIIGDGTVIYSNVSIREFVEIGKNCVIQPGAVIGSDGFG FVKVNGNNTKIDQIGTVIVEDEVEIGANTTIDRGAIGDTIIKKYTKIDNLVQIAHNDIIG ENCLIISQVGIAGSTTIGNNVTLAGQVGVAGHLEIGDNTMIGAQSGVPGNVEANKILSGH PLVDHREDMKIRVAMKKLPELLKRVKALEEKK >gi|296153776|gb|ADVK01000051.1| GENE 12 15297 - 15770 772 157 aa, chain - ## HITS:1 COG:no KEGG:FN1910 NR:ns ## KEGG: FN1910 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 157 1 157 157 167 100.0 1e-40 MKKLLLVASVLLATSAFAEKVGVVDSQKAFFQFSETKKAQQSLESQAKKVENEARQKEVA LQKEYVALQAKGDKLTDAEKKAFEKKSQDFQAFLNSSQDKLNKEQMAKLKKIEDVYVKAI KKVAADGKYDYIFEAEALKVGGEDITARVLKEMEALK >gi|296153776|gb|ADVK01000051.1| GENE 13 15811 - 17904 2786 697 aa, chain - ## HITS:1 COG:FN1911 KEGG:ns NR:ns ## COG: FN1911 COG4775 # Protein_GI_number: 19705216 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Outer membrane protein/protective antigen OMA87 # Organism: Fusobacterium nucleatum # 20 697 1 678 678 1332 100.0 0 MKKILIALLFVISLTSFSTMVNLPIKSVEVVNNQQVPASLIKNTLKLKEGAKFSTEALLA DFNALKETGYFEDVILQPVSYDGGVRIVVDVVEKENVVDLLKEKGVAINTLREDTDKSIV LSSVKFTGNKRVTTSELLDITQLKAGEYFSRSRVEDAQRRLLATGKFSEVRPDAQVANGK MALSFEVVENPIVKSVIITGNNTIPTSTIMSELTTKPGSVQNYNNLREDRDKILGLYQAQ GYTLVNITDMSTDENGTLHISIVEGIVRRIEVKKMVTKQKGNRRTPNDDVLKTKDYVIDR EIEIQPGKIFNVKEYDATVDNLMRLGIFKNVKYEARSIPGDPEGIDLILLIDEDRTAELQ GGVAYGSETGFLGTLSLKDSNWRGKNQQFGFTFEKSNKNYTGFALDFYDPWIKDTDRVSW GWGAYRTSYGDEDSILFHEIDTIGFRTNIGKGLGKNFTLSLGTKVEYIKEKHEDGKLRQA NNGKWYYKEKNKWREIEGVDDKYWLWSIYPYISYDTRNNYLNPTSGFYGKFQVEAGHAGG YKSGNFGNATLELRTYHKGLFKNNIFAYKVVGGVATNNTKESQKFWVGGGNSLRGYDGGF FKGSQKLVATIENRTQLNDIIGLVVFADAGRAWKQNGRDPSYTRDNSRFGHNIGTTAGVG IRLNTPIGPLRFDFGWPVGNKMDDDGMKFYFNMGQSF >gi|296153776|gb|ADVK01000051.1| GENE 14 17980 - 22452 5124 1490 aa, chain - ## HITS:1 COG:no KEGG:FN1912 NR:ns ## KEGG: FN1912 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 316 1490 1 1175 1175 1932 96.0 0 MLNSIKKIPKKISIPVSIVAVIVFIITAVLLNLEKIIEKVSARFINGRVVIENVDLSFSK AIVKNITLYDDKNNVLFNSPEVIANISFKNLKKGRIDELVVNSAVVNVIRDKDGVINFTK LSKTKSEEKPKNPLNKVVASNVRVNYEDYTFPSKLERKIENINAIVTASKEKLVETADIN IEDENIQLKTFFKDESNDKITSLQGELRIDKFLLDKDLLKSLVNNKKMYFSDVNIISDLS FKTDKTLKNTKIVGNLDIISEFFRYDDIDSDIKDIKLSSKFNGRDGEVNLGLNIFEKNKE FSLAYKDEELNSVITLDRIDESILNKIIPIREKKLDFKNINIEDIKTIVHYSDDRGLSIK TTMKPNNSEFKGIELNDFNLYMTSKAGKNNLSARILTKIKDIPENITLNVENQKDNTDII LALKSPIKDNIIPDINIKAKVENQKDIIKANIDSNIVDFNMDYNKDKKLTKVYGNKFTIN YDVDKKKITNGEGRIPFEIYNTANYLDFVAKNNKIQIKELKLKDKTNKNSYFTAKGNANL DNGEFKIDYEGKAASISRKVKENDLILSFDGKGKLENKNNILSSQGQINDLSLEYIGKIE KINGTYNLKKVGKNIEANVLTKIDSIGYDKYNFKNFNLNVNYSEDQVKIKDFSNNLISLK GDYDVKNQKVNANLSVNRITNNDVAFNKAEFILENLKANVEGDIKNLQGAVDLGSTVITL PSKDFVKITGKASIKNSIVNISGINLDNNLITGKYNLKDKNLDLKVSLSEKHLEKYYGAK DLGYILYGQINVKGVAGKIKAIAKGRATNFEKKLPDLAYDIEYNAENYSDGIASINGLDI IDRQYGDLLGVKGQVNLKEKTLDIKNKHNKIDLAKLQNILSNPNIKGIINTDLTINGTIN NPIYKLDISSSEVSVKTFKINDISLNLTGDKEKASLNKLNLDVYKNLIVGNGYYDIKNKT YNLLVKSNNKIDVSKFQSFLTPYGIENAKGKIALNVEINERTEKGYINLENISLESTKAK LKLSNFSGPINFGDRRIDVGELKASLNNSPLVVDGFVDLANISKLDKEDLIRSLPYKLYF KMNNFNYLYPKVIKISGSTELTVTNEEVYGNLIIKDAIVYDIPNNYYRDFFSLIREQLRK RRTDVVSTKIDDKQSKTDKEKVEEMRRMLNKLMPIDFVVKTEKPILIDMDNFNIVVPEVY GKLYIDLNLNGKKGKYYITGETELKEGYFFVGTNEFQVDRALAVFNENVPLPEINPNIFF ESTIEMDDEEYHFNTAGKVNQLRYEISSKTAKVGGDLSALIVNPNADEHIYSYGDGNEIF ITFMKNLIAGQAGQVVFGSTTRYIKRKFDLTKFVIRPEVKIYNSDSSVNRLGGTTTDNRG FNPEIYNVNIKLEAKDNIYKDKLFWKANVRIIGTGKDVIKNQTMKVDSKVREYDVGLEYK VDDSKTIEIGVGTVPDKYRTDPNKDYRKPNYHIGFKFRKRYRDFSEIFSF >gi|296153776|gb|ADVK01000051.1| GENE 15 22617 - 24182 2227 521 aa, chain - ## HITS:1 COG:FN1913 KEGG:ns NR:ns ## COG: FN1913 COG1418 # Protein_GI_number: 19705218 # Func_class: R General function prediction only # Function: Predicted HD superfamily hydrolase # Organism: Fusobacterium nucleatum # 14 521 1 508 508 771 99.0 0 MDLLIFLGLGVFALALIFAVFFKKIVIDRQIEKLNDLEDEVEKAKLKAKEIVEEAERDAV SKAKEIELKAKEKAYQIKEEIEKEARNSKNEIAQKEARIIKKEEILDGKIEKIEIKSLEL EKINDELEEKRKEIDDLRVKQEEELSRVSELTKADAREILLRKVREEMTHDMAITIREFE NKLDEEKEKISQKILSTAIGKAAADYVADATVSVINLPNDEMKGRIIGREGRNIRTIEAL TGVDVIIDDTPEAVVLSCFDGVKREVARLTIEKLITDGRIHPGKIEEIVNKCKKDIEKEI VAAGEEALIELSIPTMHPEIIKTLGRLKYRTSYGQNVLTHSIEVAKIASTMAAEIGANVE LAKRGGLLHDIGKVLVNEIETSHAIVGGEFIKKFGEKQDVINAVMAHHNEVEFETVEAIL VQAADAVSASRPGARRETLTAYIKRLENLEEIANSFEGVESSYAIQAGRELRIVINPDKV SDDEATLMSREVAKKIEDTMQYPGQIKVTILRETRAVEYAK >gi|296153776|gb|ADVK01000051.1| GENE 16 24206 - 24553 448 115 aa, chain - ## HITS:1 COG:FN1914 KEGG:ns NR:ns ## COG: FN1914 COG1366 # Protein_GI_number: 19705219 # Func_class: T Signal transduction mechanisms # Function: Anti-anti-sigma regulatory factor (antagonist of anti-sigma factor) # Organism: Fusobacterium nucleatum # 1 115 1 115 115 186 100.0 1e-47 MENNFEILERVKDDIQIIEINGELDAFVAPKLKETFNRLIEKDNNKYIVDFKGLIHINSL AMGILRGKLQVVREMGGDIKIVNLNKHIQTIFETIGLDEIFEIYKNEEEALKNFK >gi|296153776|gb|ADVK01000051.1| GENE 17 24554 - 24964 514 136 aa, chain - ## HITS:1 COG:FN1915 KEGG:ns NR:ns ## COG: FN1915 COG3920 # Protein_GI_number: 19705220 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Fusobacterium nucleatum # 28 136 1 109 109 185 100.0 2e-47 MENFEKEINRVKIFIPSFLSSLSTVRAMVRVYLREHHISELDEIQILSVVDELTTNAVEH AYSYDKGEIEIVLNFYKKTIFLTVEDFGKGYDESLDSKEDGGFGLSIARKLVDVFKIEKK TKGTVFKVEKRIKEAV >gi|296153776|gb|ADVK01000051.1| GENE 18 24969 - 25361 414 130 aa, chain - ## HITS:1 COG:no KEGG:FN1916 NR:ns ## KEGG: FN1916 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 35 130 1 96 96 151 98.0 6e-36 MKKIILLITMLFLLISCSNNNYIKTGFSQNEKQELVLFKDRIKNNLSENNLAYIKENTKD SYRNKYILEKLQNIDFAKLNIFVSEPSYTNEYPSSLLALNMNEDTYYFELFFIFDNQNKK WLIFDLKERG >gi|296153776|gb|ADVK01000051.1| GENE 19 25408 - 26310 1101 300 aa, chain - ## HITS:1 COG:FN1917 KEGG:ns NR:ns ## COG: FN1917 COG0324 # Protein_GI_number: 19705222 # Func_class: J Translation, ribosomal structure and biogenesis # Function: tRNA delta(2)-isopentenylpyrophosphate transferase # Organism: Fusobacterium nucleatum # 1 300 4 303 303 514 99.0 1e-146 MNKAIVIAGPTGVGKTKISIDLAKKLNAEIISSDSAQVYRGLNIGTAKIREEEKEGIKHH LIDIVEPVLKYSVGNFEKDVNKILNQNSEKNFLLVGGTGLYLNSVTNGLSILPEADKKTR EYLTTLNNQALLELALKYDEEATKEIHPNNRVRLERVVEVFLLTGQKFSELSKKNIKNNN FKFLKIALERNRENLYDRINKRVDIMFAQGLVDEVKNLYKIYGDKLYSLNIIGYNEIIDY INAKISLDEAVYQIKLNSRHYAKRQFTWFKADKEYQWFNLDGISEQEIVKTIYTLFNIKA >gi|296153776|gb|ADVK01000051.1| GENE 20 26303 - 27589 1780 428 aa, chain - ## HITS:1 COG:FN1918 KEGG:ns NR:ns ## COG: FN1918 COG0536 # Protein_GI_number: 19705223 # Func_class: R General function prediction only # Function: Predicted GTPase # Organism: Fusobacterium nucleatum # 1 428 1 428 428 724 99.0 0 MFIDEVIITVKAGNGGDGSAAFRREKFVQFGGPDGGDGGKGGDVIFVADSNINTLIDFKF KKLFKAQNGENGQKKQMYGKKGEDLIIKVPVGTQVRDFTTGKLILDMSVNGEQRVLLKGG KGGYGNVHFKNSIRKAPKIAEKGGEGAEIKVKLELKLLADVALVGYPSVGKSSFINKVSA ANSKVGSYHFTTLEPKLGVVRLEEGKSFVIADIPGLIEGAHEGVGLGDKFLKHIERCKMI YHIVDVAEIEGRDCIEDFEKINHELKKFSEKLAGKKQIVIANKMDLIWDMEKFEKFKSYL AEKGIEIYPVSVLLNEGLKEILYKTYDMLSRIEREPLEEETDITKLLKELKIEKEDFEIT RDEEDAIVVGGRIVDDVLAKYVIGMDDESLVTFLHMMRSLGMEEALQEFGVQDGDTVKIA DVEFEYFE >gi|296153776|gb|ADVK01000051.1| GENE 21 27696 - 28445 1113 249 aa, chain - ## HITS:1 COG:FN1919 KEGG:ns NR:ns ## COG: FN1919 COG0500 # Protein_GI_number: 19705224 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism; R General function prediction only # Function: SAM-dependent methyltransferases # Organism: Fusobacterium nucleatum # 1 249 1 249 249 484 95.0 1e-137 MSYQNINASIIDKWIKEEDWEWGKAISHEEYIKALNGDWNVKLTPVKFVPHEWFGDLKGK KLLGLASGGGQQIPVFTALGAECTVLDYSDAQLENEKTVAERENYKVNIIKADMSNSLPF EDESFDIIFHPVSNCYIENVELVFKECYRILKKGGILLCGLSTEINYLVDENEEKIVFSM PFNPLKNKEHREFLEKFDGGYQFSHTLSEQLGGQLKAGFILTNIEDDTNGAGRLHEMNIS TYIMTRAVK >gi|296153776|gb|ADVK01000051.1| GENE 22 28442 - 29473 1168 343 aa, chain - ## HITS:1 COG:FN1920 KEGG:ns NR:ns ## COG: FN1920 COG0482 # Protein_GI_number: 19705225 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain # Organism: Fusobacterium nucleatum # 1 343 1 343 343 605 92.0 1e-173 MKKVVIGMSGGVDSSVSAYLLKEQGYEVIGVTLNQHLEENLKDIEDAKKVCDKLGIIHEV VNIRKDFENIVIKYFLDGYSSGKTPSPCVICDDEIKFKILFDIADKYKAEYVATGHYTSV EYSEVFSKYLLKSVHSIIKDQSYMLYRISPDKLERLIFPLKPYSKQGIREIALKIGLEVH DKKDSQGVCFAKEGYKEFLKENLKDEIVRGNYVDKDGNILGQHEGYQLYTIGQRRGLGIN FSKPVFITEIKAQTNEIVLGEFSELFTDKVELINCKFSVEYKKIEKLELLARPRFSSTGF YGKLMKDKEKIYFKYNEENAHNSKGQHIVFFYDGFVVGGGEIK >gi|296153776|gb|ADVK01000051.1| GENE 23 29473 - 30177 746 234 aa, chain - ## HITS:1 COG:FN1921 KEGG:ns NR:ns ## COG: FN1921 COG0340 # Protein_GI_number: 19705226 # Func_class: H Coenzyme transport and metabolism # Function: Biotin-(acetyl-CoA carboxylase) ligase # Organism: Fusobacterium nucleatum # 1 234 1 234 234 402 93.0 1e-112 MKFLKFDEIDSTNNYMKENISSFENYDIVAAKVQTSGRGRRGNVWLSPEGMALFSFLLKP EKTLSIIKATKLPLLAGISTLSALKKMKDGAYSFKWTNDVFLNSKKLCGILIERVKDNFV VGIGINVANKIPEDIKNIAISMESDYDIEKIILKVVEEFSVYYKRFSEGKWQEIIEEINS YNFLKNKKIRVHIGDKVFEGIARNIVEDGRIEIEMDREIKLFSVGEIKIEKDYY >gi|296153776|gb|ADVK01000051.1| GENE 24 30196 - 32169 1785 657 aa, chain - ## HITS:1 COG:no KEGG:FN1922 NR:ns ## KEGG: FN1922 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 58 657 1 600 600 934 92.0 0 MKELKYKSFCWVIGTTSFRTAKLNLKIEQQLRLLDEFHKSINTWEWSNSTQEKYYDFMKS KQFVSGDASRKDKDAREKTSGLVDIGLIAENRLLTEVGKKLLEITNKEEISKNNIFNIED DSFIYLKQLLKTSINVDGKFIKPYIVLIYCLEKLEYLTYDEFTYFIPLITDKKSLNKIIE NIRKYRKKELKKEDVIYKKLISMENYQEAEKALLANKVTEELICSAGMNRKSKTYDKIFF QLYQLLKAIFIDKKNQNYLKTFEIIGSITGKSSIHWKNLIFKTNKKESVKKDKEKSIDEN CPFKVSKKEDEFKKVFFKYLHTFKAMATLEDYFDLNRRYLALTETFIFEDSTIKLDIFPK YYFRESVLNLIDEAFIEEKNLKVDIELSEISNSLKVNLNRIYSNLSKDLNIKITNPNEAN KYIQDERYKRFNELIKNKFTDEVLLNLLEYFEKREDKEIEKLVTDEATIPTIFEYILGII WHKVSEFEGNILEYMKLSLEANLLPRTHASGGYADIIYEYLGNKKYPKHSLLIEATLSDG SNQRKMEMEPVSRHLGDYRIKSKNSNDYSLFITTLLEQNIITDFRFRKIMPYEKNGEVIE GMKIIPIDTNFLKEIIKNKITYKELYSDFEEHYQKEPDERNWYKNMIEKINKKYKEI >gi|296153776|gb|ADVK01000051.1| GENE 25 32170 - 34182 1886 670 aa, chain - ## HITS:1 COG:FN1923_2 KEGG:ns NR:ns ## COG: FN1923_2 COG0338 # Protein_GI_number: 19705228 # Func_class: L Replication, recombination and repair # Function: Site-specific DNA methylase # Organism: Fusobacterium nucleatum # 367 670 1 304 304 514 99.0 1e-145 MENLKLFLDDEDKKKNKVEKLIEELKLKRFFINNRRYLGNKYSLTNFIKKIVEENCKSID VVADVFSGTGSVSEIFKDKELITNDLLYCNYISNYAWFSNEDYSEEKIINIIYEYNEIET SENNYVRENFADTFFSADDCSKIGYIREDIEKKYNNKEINYKEYSILITILLYGMDRIAN TVGHYDAYRKKIEFDKELYLGIMLPDKNLNKNNKCFNEDATNLVKKIKCDLLYLDPPYNS RQYSDAYHLLENIARWEKPKVFGIARKMDRKAIKSSYCTIEATQKFRELIENTNARYILL SYNNMSEKGDGRSNAKILDKDILEILEKKGKVKVFEEEYKMFSTGKSNVKDNKERLFLCE VEENRSVSSPFNYTGGKYKLLEQLQKLFKEEEIFLDIFTGGANVGINSKSSKIIFNDVNA KIISLMEYIKNIDVEELLKKIDNIIVDYGLSNTMLYGYEYYSCNSSEGLANCNKEAFLKL REDYNKKLNKGIEDYNLLYVLIVFSFNNQVRFNSKGEFNLPVGKRDFNSKMRNKLILFSK KLKEKDIEFYSKDFRDIDINKIPKNTFVYCDPPYLITTAGYNENGMWTDKEEKDLLNFLK ELDKKGLKFALSNVLESKNKENKILKDWILENNFYCNYLKKDYSNSNYQRKEKDSVSVEV LVTNYNTEEM >gi|296153776|gb|ADVK01000051.1| GENE 26 34281 - 35558 1397 425 aa, chain - ## HITS:1 COG:FN1924 KEGG:ns NR:ns ## COG: FN1924 COG1055 # Protein_GI_number: 19705229 # Func_class: P Inorganic ion transport and metabolism # Function: Na+/H+ antiporter NhaD and related arsenite permeases # Organism: Fusobacterium nucleatum # 1 425 1 425 425 682 100.0 0 MLLVLGILIFIVVFYCIITEKIPSAYATMLGALAMAFLGIVNEEEILETIHSRLEILLLL IGMMIIVSLISETGVFQWFAIKVVKIVRGDPLKLLILLSLVTATCSAFLDNVTTILLMAP VSILLAKQLKLDPFPFVMTEVLASDIGGMATLIGDPTQLIIGSEGKLNFNEFLFNTAPMT VIALIILLTVVYFTNIRKMKVSNELKARIMELESERILKDKKLLKQSMIILTAVIIGFVL NNFVNKGLSVISLSGGIFLAFLTEREPKKIFGGVEWDTLFFFIGLFIMIKGIENLGIIKF IGDKIIEISTGNFKVASISIMWLSSIFTSIFGNVANAATFAKIIKTIIPDFQNIANTKVF WWALSYGSCLGGSITMIGSATNVVAVSASAKAGCKIDFMKFFKFGSKIAILNLIAATVYM YLRYL >gi|296153776|gb|ADVK01000051.1| GENE 27 35626 - 36900 1467 424 aa, chain - ## HITS:1 COG:FN1925 KEGG:ns NR:ns ## COG: FN1925 COG1055 # Protein_GI_number: 19705230 # Func_class: P Inorganic ion transport and metabolism # Function: Na+/H+ antiporter NhaD and related arsenite permeases # Organism: Fusobacterium nucleatum # 1 424 1 424 424 705 100.0 0 MLYLGILIFIVVFYCIITEKVPSSWATMAGGLLMTLIGITSQEQVLETIYTRLEILFLLV GMMMIVLLISETGVFQWFAIKVAQLVRGEPFKLIILLSIVTAVCSAFLDNVTTILLMAPV SILLAKQLKLNPFPFVITEVMSANIGGLATLIGDPTQLIIGAEGKLTFNEFLLNTAPVAI LSMISLLATVYFMYAKDMKVSNELKAKIMELDSSRSLKDIKLLKQSIVIFSLVIIGFILN NFVDKGLAMIALSGAVCLSLLAKKNPKEMFEGVEWETLFFFIGLFMMIKGIENLDIIKFI GDKMIHLTEGHFGGAVFSTMWISAVFTSVIGNVANAATFSKIINIMTPSFSGVAGIKALW WALSFGSCLGGNLSLLGSATNVVAVGAADKAGCKIKFVQFLKFGGIIAIENLIIASIYIY FRYL >gi|296153776|gb|ADVK01000051.1| GENE 28 36917 - 37852 1194 311 aa, chain - ## HITS:1 COG:FN1926_2 KEGG:ns NR:ns ## COG: FN1926_2 COG0517 # Protein_GI_number: 19705231 # Func_class: R General function prediction only # Function: FOG: CBS domain # Organism: Fusobacterium nucleatum # 145 311 1 167 167 297 100.0 2e-80 MKFSSYLNPDYIFPCLEVESKEEIIRTIVDKVAEDNKMVSEQKYEIIKNILKREEEISTC IGSGIFLPHTRMIDFSDFIIAVATVKNKLEAEIGGTNQTDDIKVVFLIISDVLKNKNLLK AMSAISKIALKNPEIIEKIKMATHEKQILELLSANDIEIEHKIIAEDVLSPEIKPAREND TLEEIAKRLILEQKSALPVLTEDGVLLGEITERELIGFGMPEHLALMSDLNFLTVGEPFE EYLLNESTMTIKDIYRKDIRHLIIDKETPIMEICFKMVYKGMHRLYVVNPKNNKYLGIIN RSDIIKKVLHI >gi|296153776|gb|ADVK01000051.1| GENE 29 37980 - 40517 3171 845 aa, chain - ## HITS:1 COG:FN1927_1 KEGG:ns NR:ns ## COG: FN1927_1 COG1461 # Protein_GI_number: 19705232 # Func_class: R General function prediction only # Function: Predicted kinase related to dihydroxyacetone kinase # Organism: Fusobacterium nucleatum # 1 560 1 560 560 1022 99.0 0 MKIEIKILNPVRLTKLFIAASRWLSKYADVLNDLNVYPVPDGDTGTNMSMTLQSVENALI GLQSEPNMEELVDIISEAVLLGARGNSGTILSQIIQGFLDAVRDKEEIDIETAAKAFVSA KERAYKAVSQPVEGTILTVIRKVSEAAMAYDGPKDDFIPFLVNLKNAAADAVEDTPNLLP KLKEAGVVDAGGKGIFYVLEGFEKSVTDPEMLKDLARIANSQVNRKQKLEYINKNEIKFK YCTEFIIESGSFDLDEYKEKIGKLGDSMVVAQTRKKTKTHIHTNNPGQALEIAGSLGDLN NIKIENIEIQHSHVLVKEEELNKVDIRGIVKETVPEEPKLLFNEKNIENSVAIYAVVDNK NIADLFLKDGASATLIGGQTKNPSVSDIEEGLKKIKAKTIYILPNNKNIIASAKLAAKRD NRDIIVIDTKTMLEGHYFTKNKKMNLQTLLRQLKFNNSIEITKAVRDTKVNDIEIKVGDN IALVNGTLTEKAEKVEDLIKKIYEKYTNDNTLAVTLVRGKTATEEGNEIIKSKNFKKFYE YDGEQDNYSYYIYLEQRDPSLSKIAILTDSASDLTPDMIEGLDVTVIPIRLKIGENNYKD GVNLSKKEFWHKLMTEKVVPKTAQPSPAEFRDYYEELFNKGYEKIISIHISSKMSGTQQV AKVAREMLKREKDIIIVDSKSVTFGQAYQVLEAAKMIKSGVKLDDILTRLYEIADKMKVY FAVSDLTYLEKGGRIGRASSVIGNLLKLRPVLKLEDGEVCLETKTFGERGAISYMEKIIK NEGKNSIYLYTAWGGTNQELQNTDVLKRTADTMRKIEYKGRFEIGATIGSHSGPVFGIGI ISKIR >gi|296153776|gb|ADVK01000051.1| GENE 30 40529 - 41083 761 184 aa, chain - ## HITS:1 COG:FN1928 KEGG:ns NR:ns ## COG: FN1928 COG1396 # Protein_GI_number: 19705233 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Fusobacterium nucleatum # 1 184 1 184 184 320 100.0 1e-87 MTIGEKLKKSRNDKGMSLRELATKVELSASFLSQIEQGKASPSIENLKKIAHTLDVRVAY LIEDEEDDIRNIEYIKKENIRYIESLDSNIKMGILLSNNREKNMEPIIYEIGVDGESGRD FYSHGSSEEFIYILEGELEVYVANKKYKLSKGDSLYFKSSLKHRFKNASKKEVKALWVVS PPTF >gi|296153776|gb|ADVK01000051.1| GENE 31 41090 - 42298 295 402 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|229231897|ref|ZP_04356325.1| SSU ribosomal protein S12P methylthiotransferase [Cryptobacterium curtum DSM 15641] # 245 399 746 904 904 118 37 1e-25 MKAGIFLVGTELLNGATIDTNSIYIAEELNKYGIEIEFKMTVRDVMCEITKALTYAKKNV DLVILTGGLGPTDDDITKEAMAKFLKKKLVVDEKEKKELLKKYKAYKNPNKTNFKEVEKP EGAVSFKNDVGMAPAVYIDGMVAFPGFPNELKNMFPKFLKYYVKENNLKSQIYIKDIITY GIGESVLETTVKDLFTEGDIFYEFLVKDYGTLIRLQTKIENKKNVAKIVKKLYNRISEFI IGEDDDRIENTIYECLNLGEKPLTISTAESCTGGMVASKLIEVPGISENFIESIVSYSNE AKIKRLKVKKETLEKYGAVSEEVAREMLAGLKTDVGISTTGIAGPGGGTKDKPVGLVYIG IKVKNEVKVFKRELKGDRNKIRQRAMMHALYNLLKILSKKVR >gi|296153776|gb|ADVK01000051.1| GENE 32 42309 - 42824 649 171 aa, chain - ## HITS:1 COG:FN1930 KEGG:ns NR:ns ## COG: FN1930 COG1267 # Protein_GI_number: 19705235 # Func_class: I Lipid transport and metabolism # Function: Phosphatidylglycerophosphatase A and related proteins # Organism: Fusobacterium nucleatum # 1 171 1 171 171 283 98.0 9e-77 MGNHNHNHKLIKNLGTCFGLGEMSFMPGTFGTLGGIPIFLALTYIKKFFLNVMVYNSFYL VFLVTFFAISVYVADICEKEIFKKEDPQAVVIDEVLGFLTTLFLINPVGIKAILIAMLLA FIIFRILDITKIGPIYKSQSFGNGVGVVLDDFLAGIIGNFILVFIWTKFFY >gi|296153776|gb|ADVK01000051.1| GENE 33 42824 - 44986 2446 720 aa, chain - ## HITS:1 COG:FN1931 KEGG:ns NR:ns ## COG: FN1931 COG0826 # Protein_GI_number: 19705236 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Collagenase and related proteases # Organism: Fusobacterium nucleatum # 1 720 1 720 720 1208 95.0 0 MKIVAPAGNMERFYSAISATADEIYLGLKGFGARRNAENFTVEELKKAIDYAHLRGSRIF LTLNTIMTNREIELLYPTLKDLYNYGLDAIIVQDIGYADYLHKNFPSIEIHGSTQMTVAN YYEINYLKELGFKRIVLPRELSFEEIKEIREHTDIELEVFVSGSLCISFSGNCYMSSFIG GRSGNRGMCAQPCRKEYKTSCGEKSYFLSPKDQLYGLDEIKKLQEIGIESIKVEGRMKDV SYVYETVSYFRSLINGIDKEENTYKLFNRGYSKGYFYNNGKTIMNRDYSYNMGEKIGEVV GKSIRLDEDIVSGDGVTFVSKDYKNLGGTYINKIAYKNEKLVLNFPERTKYIFRNYNKRL NDEISKKIKSTDKKLEINFDFTGKLDENLILKTYLEDENGNRILNLEEISETLTQKAQKR AISEEDIKEKLTEIGDSEFTVKDIKIDIDENIFIPLSELKNLKRNAVEKFREKILSYFRR DLDSELKENNQEYFKLEIEKDEPKDLEIKVIVSNEEQKNFLENIKDEYNIKKVYYRTYDI AKQSMLSQHNLDNKLASNLYELLENKNSTVMLNWNMNIVNSYTINVLEKIEKLESFIISP EINFSKIRELGKTRLKKALLIYSKLKGMTIDVDIADNKNEVITNKENDKFNIIKNEYGTE IFLDKPLNIINIMKDIKKLNVDIVVLEFTTETIEDIKKVLKQLKTRKGEYREYNYKRGVY >gi|296153776|gb|ADVK01000051.1| GENE 34 44983 - 45555 824 190 aa, chain - ## HITS:1 COG:FN1932 KEGG:ns NR:ns ## COG: FN1932 COG0237 # Protein_GI_number: 19705237 # Func_class: H Coenzyme transport and metabolism # Function: Dephospho-CoA kinase # Organism: Fusobacterium nucleatum # 1 190 4 193 193 271 90.0 4e-73 MIIGLTGGIASGKSTVSKYLAEKGFKVYDADKIAKDISEKKSVQEEIISTFGNKILDKNR NVDKKKLKEIVFENKEKLKQLNAIIHPKVIHFYKELKGKNTSEIIIFDVPLLFESGIDKF CDKILVIISDYEIQLNRIVERDKIDRDLAEKIIKSQLSNEERIKKADVVIENNSNLEDLF EKVERFCEMI >gi|296153776|gb|ADVK01000051.1| GENE 35 45751 - 47637 1920 628 aa, chain - ## HITS:1 COG:CPn0835 KEGG:ns NR:ns ## COG: CPn0835 COG0553 # Protein_GI_number: 15618744 # Func_class: K Transcription; L Replication, recombination and repair # Function: Superfamily II DNA/RNA helicases, SNF2 family # Organism: Chlamydophila pneumoniae CWL029 # 255 625 835 1187 1215 74 25.0 7e-13 MDYDFTLQLEDNNLVLISSSSLSNFLNNKVYKLKLKKYVSSGKTEENKLFFKKEIGYIKY KKIIEIIEDYSLKNNIDFYISKELEEYIYKREMYINERSRVGLGIKKQSEEVLEKYQKYK TIIDEQMSRKLREKQAWDSFFMFTMKKSANFSVPGSGKTSSVYGVFSFLSSQGLVDKIVM IGPKNSFISWKDEFYNCFNDKRDLKLFNIQDYSSTRSKKNALLYEAAHKKNLLLFNYESL DSILEEVKTIIDDKTLLVFDEVHKVKNPNGKRAKNSLEISYNAKYTIALTGTPIPNSYLD IKNLLNILYHEEYNDFFGFKDVQLQNTSEYDIEDINKKIQPFFCRTTKKQLEVPEANPDI IILCKVSDKENQIFNILLLKYAKNRLALIIRLLQLESNPKMLLKAISENQEDFSDILDTT LDPGFIDYIDYSQDIIDLINSIDRTKKFNFCIQQVEQLNSEGKSVIVWCIFVDSIRQLAF QLEKKGISVGVIYGSTSEEERRDILNKFKEKEIDVLITNPHTLAESVSLHSVCHDAIYYE YSYNLVHLLQSKDRIHRLGLKEGQYTQYYFLHSIFLTRDGFEYSLDQKIYQRLLEKERIM LEAIDKDILESLGSIEDDIEIIFKDLKF >gi|296153776|gb|ADVK01000051.1| GENE 36 47627 - 48844 1519 405 aa, chain - ## HITS:1 COG:no KEGG:CLOST_1638 NR:ns ## KEGG: CLOST_1638 # Name: not_defined # Def: hypothetical protein # Organism: C.sticklandii # Pathway: not_defined # 1 402 2 403 405 432 60.0 1e-119 MNLLENGLLEKFIFKTENKKRLVIGNIPKDYPIYKIRLDKLFYNDQNDRISTWISEYKMK NNIDKIDYSDKESYNKIIHEFITNSNYDAMKRTQQNIEAIGQIEAGVVLADGRIIDGNRR FTCLRNIQEKNQKEQYFEAVILDYSIEHNEKQIKMLELVLQHGVDEKVGYSPIERLVGIY HDIVETKLLTIAEYAKSVNDTGKNIEIEVEKAKLLAEFLEFINAKKQFYLARHLELADPL KELYVMLKKINDEETQDDLKNAVFSQFICKPAGDLTRYIRKIKDIASNPKHLKDYLDEQS PVVEKVCDKLEGFEQLTNKEIGELGEDKGIKEEFSRITEKYVSRVAGDITRNAPLKSMEK ACYALDEIDLKILHKLKDEQIKDLKEKITELENITQKIKEEISGL >gi|296153776|gb|ADVK01000051.1| GENE 37 48845 - 51784 2634 979 aa, chain - ## HITS:1 COG:no KEGG:Amet_2144 NR:ns ## KEGG: Amet_2144 # Name: not_defined # Def: RNA polymerase subunit alpha # Organism: A.metalliredigens # Pathway: not_defined # 6 979 9 912 912 623 41.0 1e-176 MKKDTSIQILHLSKRSYNVLKKFNIITIQDLLTISKEDIKNFRGIGEKSIQEILSKIEIL KDYNFDDIDISEVNVENLSKNKYFLNKYGIKYQDIFIEDLGISEKSIDFLKSLNIECYSE LLTKTEKEFESIKYSEEIIQKEIKSIRKKPNIETAINLEKDIPIDFLGLSVRAKNCLKSA NIEYYSQLISKTEEELMTIKHMGVTTLKELQRLKFLIFFYFGFPVVDTENKESKEEKISN ESINFITKIAKILDCNTEKLILNISNCYFSLVPNRDSIKEIDYITKNILSLLWEDEYGKE KWIKYIIKEISKNIYGVKEDALWENISEFFKDKKLYKKTIEYLCETNIIKNLYDDRFIVI YKSIKEEVHNYLKESEAKIILSRISGKTLEEIGEALNVTRERIRQIETRGFKKLYLEKFR EDFFLDIYLKYDVNREAFFSVLKEEETYNYLSLRYRDELNQVKDFRKPLQDILEDENIPV IIRRNFEKFIYKNYITFDKERILFSKASFTDYLIKHFANEELSYNEFKEIYYMFLNDLGY EEEESLKILDRGYENRIRDNMNVLWKSGRKFRYYNILGYDFKELLETLNLNQYENEEYST LKFFRMYPDLMEIYDIHDEYELHNLLKKICTEDKYPQIKFNRMPSIEFGEADREQQVKEL LSLLSPISKQDFVKEYEDFYGVDSKTFAANYLFCIEKYFCNGVYDIKFEEYDDIILTDVK EILFEELYTVQEIKEKFKRTFPDYKKELLNPIMLKKLGYKISGGYIIKNQYNSASDYFFQ FLQKNEIIKLDNISSRIKNLPMFVLQLYKLKEEYEIIEFLPNCFINFSKLKNLGITKQYL KEYCLNVLDFVGRNKYFTLFSLKKEGLYHELDELGFEDYFYISILIEDRERISYKRIGKN KLMYSSNEGASFESFLEYIVYQQEKLYIEIYELNDLLKEKYNLQFNTYDLISSIKSTSMY YDPISKTVFADYEIYYEVI >gi|296153776|gb|ADVK01000051.1| GENE 38 52257 - 52454 155 65 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MAENLFYIGFFIIFLLSFFIILEKFVNEDEFNKKLLKLETIVIIFVFGLILSRELSNITM LIIFL >gi|296153776|gb|ADVK01000051.1| GENE 39 52557 - 52688 152 43 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|256846593|ref|ZP_05552050.1| ## NR: gi|256846593|ref|ZP_05552050.1| predicted protein [Fusobacterium sp. 3_1_36A2] predicted protein [Fusobacterium sp. 3_1_36A2] # 1 43 36 78 78 70 93.0 4e-11 MIIGGFLLVYIGREVGGYGAIIGSITIAIGSVIWNKQKNKDKN >gi|296153776|gb|ADVK01000051.1| GENE 40 53579 - 54580 1160 333 aa, chain - ## HITS:1 COG:FN1935 KEGG:ns NR:ns ## COG: FN1935 COG3392 # Protein_GI_number: 19705240 # Func_class: L Replication, recombination and repair # Function: Adenine-specific DNA methylase # Organism: Fusobacterium nucleatum # 1 333 1 333 333 604 99.0 1e-173 MNYIGSKLSLKSFIKDTILEISKIDNENKVFADLFAGTGVIGSEFKKMGYKVIANDIQHY SYILNKHFIENNSPINIELLEHLNSIEGKEGFIYNNYCIGSGSERNYFSDFNGKKCDAIR QELEKLYRDKVIDEHQYNYFLASLINSIDKHANTASVYGAFLKSLKKSAVKDFELELLPI IDGNKDGKAYNEDINILIKKIKGDILYLDPPYNARQYSANYHLLETISRYDDPIIKGKTG LRDYSNQKSKFCSKSQVNQVFEELISDADFKYIFLSYNDEGLMSLETIKEIMGKYGKYQC FTTDYKRFRADKEENRNHKKSSTIEYLHCLIKK >gi|296153776|gb|ADVK01000051.1| GENE 41 54661 - 55452 720 263 aa, chain - ## HITS:1 COG:no KEGG:FN1936 NR:ns ## KEGG: FN1936 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 69 263 2 196 196 347 99.0 4e-94 MSNSTDKILQELEEERVRRTMLIKENLQKAYDELEKENFPVTKRIKFIADLGACKKIAYH YELICKDWEEGKKLNIESSFDRHGSEGIEFLFKQLSKIEDEKIRIFTVFLLAEVLSKLRH KEFYSSFCNQLILILKSLLNTNDEFLRRKIIIAFAWVGTSKEIDILTQLMLNDSDALCRA WSATSLMQMSFHRVDKEIICKKTKNIFVQAIEKEKDLYTCGIIIEAVQILFGKRWISSSA VENIELEKIEKARKAAVRFLNKY >gi|296153776|gb|ADVK01000051.1| GENE 42 55469 - 55936 326 155 aa, chain - ## HITS:1 COG:no KEGG:FN1938 NR:ns ## KEGG: FN1938 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 155 1 155 155 286 100.0 1e-76 MYFIAVILFLLGFLSISLGRISSDNVKNINALLSDDRNIQETKGSLQVIEIRSTRHSFEC DCELIFINHNGKEFSYKETYSGFNSKASFLWKCENKGKVPITVIYNKRLPSKHFVKELKP LEVNKNSRIGYIIIGILFMLLGIFIIAVNFKLKIK >gi|296153776|gb|ADVK01000051.1| GENE 43 56014 - 57060 1149 348 aa, chain - ## HITS:1 COG:no KEGG:FN1939 NR:ns ## KEGG: FN1939 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 26 348 1 323 323 490 99.0 1e-137 MIMEELMKKYLILFLSIFIFTSCTRMAINSANNKAARNEFYKGILELDKSVRKNIDNREL FESYENIFNRGRDYYNSTNETRELFLMEKLYLDLPDNIKQKLSGINVDINKHRKNGERIA DDLFENTQSMGENTYREKIKKYKQYKKVLTYNPNEKNKVENELKKLDKKIEKTYSYRING NDSTLNSEIENKFSQEIKNNLFRYSSANPDVRLEINVDMVYFYPEDVDMKSFPKQYTENY KDSNGKDKTNIVSYSENIFKKTTSMGVRLNYRLVSNLTGEIIFNGSKNFDKKYEEKWKTY FIISDKIFNRNRLPKDENEKSVPSKKKIVEDITKEILSTIDADFHKLP >gi|296153776|gb|ADVK01000051.1| GENE 44 57105 - 58475 1076 456 aa, chain - ## HITS:1 COG:FN1940 KEGG:ns NR:ns ## COG: FN1940 COG0534 # Protein_GI_number: 19705245 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Fusobacterium nucleatum # 1 456 1 456 456 779 99.0 0 MKINWKIFREIIYLAIPAVGEMTLYMMIWIFDTMMIGKYGGELAVSSVGLSTEIIYSFFN IIIAVGVSTALTSLISRAIGSKDYKKAETIANAGIKIAVVLAFIFFSLLFFVPGKILNLA GATKEMLPLATRYAKISSFSFFLLTISSTTNGVFRGVKDTKTSLYVAGSINIVNLFLDYV LIFGNLGFPEWGITGAAVATVAGNFMGILLQWSRLKKLPFKISLFSYVSKKDIWEIIRFA VPSGLQEANFSLSRLLGLTFILSLGTTAFAANQIGIAIEAISTMPGWGVAIACTALVGHS IGENKANKSQEYTLYSIIIASIFMGVLAFFFFFIPKTLISFFINKQEIDVIRIGAMCLQV AAFEQIPIAFVTVLGSYFKGIGNPKIPFYVSFFTNWFLRIPVAFYLISILKLPVHIFWII TTFQWLLESIVLYYLYRKNINTILKNTSVSNIIDKI >gi|296153776|gb|ADVK01000051.1| GENE 45 58867 - 61443 1810 858 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163764771|ref|ZP_02171825.1| ribosomal protein S8 [Bacillus selenitireducens MLS10] # 6 857 5 811 815 701 44 0.0 MMNPNQFTENTISAINLAVDISKGNMQQSIKPEALALGLLMQNNGLIPRVIEKMGLNLQY IISELEKEMNNYPKVEVKVSNENISLDQKTNSILNRAEKIMNEMEDSFLSVEHIFKAMIE EMPIFKILGISLEKYMEVLMNIRGNRKVDNQNPEATYEVLEKYAKDLVELAREGKIDPII GRDSEIRRAIQIISRRTKNDPILIGEPGVGKTAIVEGLAQRILNGDVPESLKNKKIFSLD MGALVAGAKYKGEFEERMKGVLKEVEESNGNIILFIDEIHTIVGAGKGEGSLDAGNMLKP MLARGELRVIGATTIDEYRKYIEKDPALERRFQTILVNEPNVDDTISILRGLKDKFETYH GVRITDTGIVEAATLSQRYISDRKLPDKAIDLIDEAAAMIRTEIDSMPEELDQLTRKALQ LEIEIKALEKETDDASKERLKVIEKELAELNEEKKVLTSKWELEKEDISKIKNIKREIEN VKLEMEKAEREYDLTKLSELKYGKLATLEKELQEQQNKVDKDGKENSLLKQEVTADEIAD IVSRWTGIPVSKLTETKKEKMLHLEDHIKERVKGQDEAVKSVADTMLRSVAGLKDPNRPM GSFIFLGPTGVGKTYLAKTLAYNLFDSEDNVVRIDMSEYMDKFSVTRLIGAPPGYVGYEE GGQLTEAIRTKPYSVILFDEIEKAHPDVFNVLLQVLDDGRLTDGQGRIVDFKNTLIIMTS NIGSHFILEDPNLSEDTREKVADELKARFKPEFLNRIDEIITFKALDLPAIKEIVKLSLK DLENKLKPKHITLEFSDKMVDYLANNAYDPHYGARPLRRYIQREIETSLAKKILANEIHE KSNVLIDLDNNHIVFKEV >gi|296153776|gb|ADVK01000051.1| GENE 46 61520 - 62209 548 229 aa, chain - ## HITS:1 COG:FN1942 KEGG:ns NR:ns ## COG: FN1942 COG2964 # Protein_GI_number: 19705247 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 229 1 229 229 398 100.0 1e-111 MKNELLNQYKILVNFLGKTLGPSFEIVLHEIKSEEVKMIAIANGEISNRTLENSISSETL DILKNKSYHNEESMVNHTVLLKNGKKVRSSSMFIKENQKIIGMLCINFDDSKFHDINCQI LRIIHPDMFVKNYLSDISYNILVDENKDQSNEENSTDNIEAFMEKIFQDVNLKFNYPLER LTKQEREKIVKALYEKGIFNLKDAINFVAKKLSCSPTTVYRYVGKIEKG >gi|296153776|gb|ADVK01000051.1| GENE 47 62421 - 64058 2404 545 aa, chain + ## HITS:1 COG:FN1943 KEGG:ns NR:ns ## COG: FN1943 COG3033 # Protein_GI_number: 19705248 # Func_class: E Amino acid transport and metabolism # Function: Tryptophanase # Organism: Fusobacterium nucleatum # 1 545 1 545 545 1130 100.0 0 MKKYLLDVPVPRSFSYVKRNIPEVTVEQRERALKATHYNEFAFPAGMLTVDMLSDSGTTA MTDQQWSAMFLGDESYGRNKGYYVLLDAMRDCFERGDNQKKIINLVRTDCQDIEKMMNEM YLCEYEGGLFNGGAAQLERPNAFLMPQGRAAESILFEIVRKILAVREPGKVFTIPSNGHF DTTEGNIKQMGSVPRNLYNKELLYEVPEGGRYEKNPFKGDMDINKLQQLIDAVGIENIPM IYTTVTNNTICGQAVSMKSIRETAKIAHKYEIPFMLDAARWAENCYFIKMNEEGYRDKSI AEIAKEMFSYCDGFTASLKKDGHANMGGILAFRDKGYFWKKFSDFNEDGTVKTDVGILLK VKQISSYGNDSYGSMSGRDIMALAAGLYECCNFNYLQERVEQCNYLAEGFYKAGVKGVVL PAGGHGVYINMDEFFDGKRGHETFAGEGFSLELIRRYGIRVSELGDYSMEYDLKTPEQQA EVANVVRFAINRSVYSQEHLDYVIAAVKALYEDRESIPNMRIVSGHNLPMRHFHAFLEPY PNEEK >gi|296153776|gb|ADVK01000051.1| GENE 48 64182 - 65516 1844 444 aa, chain + ## HITS:1 COG:FN1944 KEGG:ns NR:ns ## COG: FN1944 COG0733 # Protein_GI_number: 19705249 # Func_class: R General function prediction only # Function: Na+-dependent transporters of the SNF family # Organism: Fusobacterium nucleatum # 1 444 16 459 459 763 99.0 0 MSAIEKRDGFTTKWGFILACIGSAVGMGNIWRFPVLVSELGGMTFLIPYFIFVILIGSTG VIEEFALGRAAGAGPVGAFGMCTEMRGNRSIGEKIGIIPILGSLSLAIGYSCVMGWVFKY AWMSINGSMYAMQSNMEVIGSTFGQTASAWGANFWIIIALIASFIIMSMGVSGGIEKANK IMMPILFVLFVLLGIYIVFQPGSSNGYKYIFTVNFEGLLNPKIWIFAFGQAFFSLSVAGH GSVIYGSYLSKSEDIPNSARNVALFDTLAALLAAFVIIPAMAVGGAELSSGGPGLMFIYL VNIMNNMAGGRIIEVIFYLCILFAGVSSIINLYEAPVAFLQEKFKANRVTATAIIHIIGC IVAICIQGIVSQWMDVVSIYICPLGALLAAIMFFWVAGKEFAEEAVNMGATKKIGSWFYP AGKYIYCLLALVALIAGALLGGIG Prediction of potential genes in microbial genomes Time: Sat Jul 9 20:01:34 2011 Seq name: gi|296153767|gb|ADVK01000052.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00064, whole genome shotgun sequence Length of sequence - 13098 bp Number of predicted genes - 9, with homology - 8 Number of transcription units - 3, operones - 3 average op.length - 3.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . - CDS 337 - 399 64 ## 2 1 Op 2 . - CDS 424 - 963 410 ## gi|296329107|ref|ZP_06871611.1| hypothetical protein HMPREF0397_1804 3 1 Op 3 . - CDS 938 - 1441 402 ## gi|294784562|ref|ZP_06749851.1| conserved hypothetical protein 4 1 Op 4 . - CDS 1452 - 1667 232 ## gi|260495778|ref|ZP_05815899.1| hemolysin 5 1 Op 5 . - CDS 1651 - 2481 1248 ## gi|296329110|ref|ZP_06871614.1| hypothetical protein HMPREF0397_1807 - Term 2503 - 2544 2.1 6 2 Op 1 . - CDS 2561 - 3466 695 ## FN0289 hypothetical protein 7 2 Op 2 . - CDS 3463 - 4359 1128 ## COG3210 Large exoproteins involved in heme utilization or adhesion - Prom 4387 - 4446 2.2 8 3 Op 1 . - CDS 4509 - 5015 619 ## FN1599 hypothetical protein 9 3 Op 2 . - CDS 5042 - 12946 9950 ## COG3210 Large exoproteins involved in heme utilization or adhesion - Prom 12976 - 13035 4.8 Predicted protein(s) >gi|296153767|gb|ADVK01000052.1| GENE 1 337 - 399 64 20 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MTDTRYNNLKIMNDKNYTKK >gi|296153767|gb|ADVK01000052.1| GENE 2 424 - 963 410 179 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296329107|ref|ZP_06871611.1| ## NR: gi|296329107|ref|ZP_06871611.1| hypothetical protein HMPREF0397_1804 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] hypothetical protein HMPREF0397_1804 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 179 1 179 179 210 100.0 3e-53 MKVVTKKIKNTDEYSIIRYSISGMIVKTIIFIILYIIYTYLGIYNFGEILSSKIIIKYII RSLPFFIFLYSEVILTSSKELLLIKENNLILKKYILFFCYYSKVIKLEDIRKIYYEKVAF KDFPVLVFPTDLLKKIKFRVKESEFEDKIYAFGYKMSEYESYKIIEEVEDVIKVKKFYI >gi|296153767|gb|ADVK01000052.1| GENE 3 938 - 1441 402 167 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|294784562|ref|ZP_06749851.1| ## NR: gi|294784562|ref|ZP_06749851.1| conserved hypothetical protein [Fusobacterium sp. 3_1_27] conserved hypothetical protein [Fusobacterium sp. 3_1_27] # 1 167 12 178 178 279 99.0 6e-74 MKTVTKRKDFICIVYDEELLKKNITSIQFNLLLCLILPIHISAMYNRNWIYVYIVFCIFF IMDYNNQIPFFEIGSKVKITMYNDRIEIMRRKRNALHLYEEIKNIEYQTKRFGNSDYYSL KITKTNKKVYKYRLGLIEEEVLEIYNIIKENYEEWRIKKYESCNKEN >gi|296153767|gb|ADVK01000052.1| GENE 4 1452 - 1667 232 71 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|260495778|ref|ZP_05815899.1| ## NR: gi|260495778|ref|ZP_05815899.1| hemolysin [Fusobacterium sp. 3_1_33] hemolysin [Fusobacterium sp. 3_1_33] # 3 71 311 379 379 112 91.0 1e-23 MVKKINDGDDGKYIKEQLEKINPNYKNEVNLETIEFITNYLIENKDKLYKKTEFDKAYDR LKKLRAEGYYD >gi|296153767|gb|ADVK01000052.1| GENE 5 1651 - 2481 1248 276 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296329110|ref|ZP_06871614.1| ## NR: gi|296329110|ref|ZP_06871614.1| hypothetical protein HMPREF0397_1807 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] hypothetical protein HMPREF0397_1807 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 276 1 276 276 528 100.0 1e-148 MITRYPRENGYQGVIPEVLLTDEAHSFTVDSKDKETGAKRREKIYFSINDIADPNLAFSR LFGHEKAHMNTYDEGKDGEETSIHTREKIGSENKNKVFTEEEKADYLNNLRNKYKDQKSI EQQFAEAKHVPEKDKEHWAVLISTNTAIAMGLRVNEGSSIAFIVDEKSNGDMYFAQTADV GIGLGTPAIGVGISGAYFSDVNKPEDLNGWAGTLGGSFSVVGIDLHTNFGKDKKFFAGIR LNVSKGMEMDKILILNTLKEKDILVEIIAMLYGKKN >gi|296153767|gb|ADVK01000052.1| GENE 6 2561 - 3466 695 301 aa, chain - ## HITS:1 COG:no KEGG:FN0289 NR:ns ## KEGG: FN0289 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 301 1 308 308 251 54.0 3e-65 MRKIVNKNSTYFYSYAGLIPLIFFEIILLIFLFIFSREIVPAIAILIPNYFFIRAIIIVL KYFISIEECYCIEDKFYYKKILWNRWILREFEIPIYEIKKIQDNGKLILKTELQAGVGYL LYYFNPYERICIETIFGRKYNIWNYKKRPSYWSIDNLYEDDREFLDSLFSIRAMVADRRK EILFNQKIENLMERYNFPLDERYNYILNKIVDEEKLFLFKKDNNFIINGDSEAIKDLEIF KNMNFEEIDFYIFYVNYLSKKEYEGKKVLVGYNGVDGKEVTMSKFKEDINEIRDSRSTFK N >gi|296153767|gb|ADVK01000052.1| GENE 7 3463 - 4359 1128 298 aa, chain - ## HITS:1 COG:FN0290 KEGG:ns NR:ns ## COG: FN0290 COG3210 # Protein_GI_number: 19703635 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Large exoproteins involved in heme utilization or adhesion # Organism: Fusobacterium nucleatum # 1 293 412 717 727 291 57.0 9e-79 MNKLLNDALRAKGYKGPDIKMVLTDVEDPNGPYYTDTLTNTVVFDRKKLASANRDQILNA LGHEFGHYSKEDNKTGNQTIANYSGEKLEDRTKGMVSKEATEDTLAAIRNNKNVITGEEG KKLAESIPMDRREYYEAITFEGRIMATVIGADAGMGVIYYKDPKTGEEQFGRIENIAVKY GVSTAIEGGGTIKSYSKPNTPIQDFAGYYGGIRGTVGYGLNVNYELSVSNNEGKHGGGVG VGTGISFAGFFGKRKVIVLENLSEAEKEVIRTLVRYKGEIVDTYDFYYNLKKIKELMK >gi|296153767|gb|ADVK01000052.1| GENE 8 4509 - 5015 619 168 aa, chain - ## HITS:1 COG:no KEGG:FN1599 NR:ns ## KEGG: FN1599 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 168 1 168 168 258 85.0 5e-68 MITLEDFKNNNLKINWKVIDIGCLGSEIFKNELSYDDIINFSLEEFDEKNKLILRIVASD RDEYQEMGYLVQELANMEKSEYKLAFEKWKLVYIKKNFPKLNKNIIQGLIELNDLWVKLD FPEDSPYILQGVKNNISPQEYYTEENYIYLYNRHLKWIRDKSDYLNGK >gi|296153767|gb|ADVK01000052.1| GENE 9 5042 - 12946 9950 2634 aa, chain - ## HITS:1 COG:FN0291 KEGG:ns NR:ns ## COG: FN0291 COG3210 # Protein_GI_number: 19703636 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Large exoproteins involved in heme utilization or adhesion # Organism: Fusobacterium nucleatum # 1 1775 1 1877 1881 964 46.0 0 MKGSLKRVIAIFMLFLHIVSLADGIVPDNGVSKNLQVDKAANGVPLVNIEAPDNNGTSHN VYKDYNVDGRGAILNNSKDLTNSQLGGLIYGNPNLQNSKEASTIINEVSGVNRSRIEGYQ EIVGKKANYILANPNGIYINGAGFINTGNVTLTTGSGNNLLNPEKGTIEVAGKGLDLRNI NKAELVARVAELSAPVYGGEEVNLKLGSQGKANKPEYVLDARALGSIYAGRINIIVNEDG VGVKTQAPMYATKGDVVISSKGKVYLKDTQAKGNINIFSTETEIGEKLISENKINIENKK LLNKGKIIANKDVAVKGNVENNKLIFTNKDLNVEGDLKNTANIQTKNDIKINGKNIENTG LIVADKKININSDNINNTNKLVAKDTLDINNKILTNSGKIYSGNKTKIANQKINNLGDIT SSGEIDINSTDIENNNILANGDISINTKELKSKGKIYSDKNVSLISNKIENNELTAKNLK IVTDKLNNNTKIATTANIDITAKNLVNKGMIYSTGKNDLKVTDLRNSGNILSVGNMNINQ NKNLINSGKIQSNEDIVINSENIENNELIGKNINITTNSLKNNSKIVAKANNFITTKDLV NIGLLYSTGKNDLKVTDLRNSGNILSVGNMNISQNKNLINSGKIQSNEDMVINSKDIKNN KELIAKNINIETDKLENTDKLIATNNMIVNSEILKNSSLVQALKMTLIGDTITNNGNVLG VNDITIKNKNLKNNGSLVNNGRIQANNTLELYLKDIVNNSKVFSKDKINIESKNLNNKNE IVAVGKIVVNSDSLENDNTKGVIFSKDELNITSNKIDLTRNIGAGKLLKLTTNKLERPDS YITGSDLDITINGDYTNNKELIGKNLKLTAYNLENNSIMASAGNTELKGNNSFKNNENSL LYGRELVKLEGRNFINKGEVSSFGDLNMNFTGDITNLKTIEVVGNGIIKANNYINKGYLT GNHSYKWVEGSKSNINKNNLPKELIEKANRDVENNKHGKFRGWDEPEANIEWVKEAESNY KSNKAYLKIGGNLTFNITNKLLNQEANILAGKNITINAGELNNTREGKDVDITITFARKY HYRYHHHGHNKTGHGYFRADEEYKQTLYSDKPTQIIAGGNININAKKIGNGEYQDYKSGY INDIKKVEKASNIKDIDINKNIDTETTSNVRKDGSSKIDDTFEITNNSIVKKIKEDSAVG IEDYIEIPKNDNGMFIVNKKGANPKFSYLIETNPKMIDKGFYLSSDYFFSRIKFNPDKNI RLLGDAFYENRLITRAVLEGTGKRFLYSNDVNEERKKLFDNAVAAQKDLNLSLGIALSKE QINNLKSDILWYVEEVVNGEKVLVPKLYLSKNTLKSIVEEQGNIIKAGGNFVINNASIVD NSGKIIAKNNVVINSKNIYQNAAYSDTGIYATNIGLVAKENIENIGGNIVATNNVSIYSE NGDIKNSKKLSIHENDYHNVYTDVRGSGDIVGNKISIVANNVENLGADIKAQDKIQIGAK KNLVIGNLEAIDKKVKNGGKDFVLDEKKTNVGSNLRAKDISLTSLENIGITGSNVVADNK MNIGAKGDISIISGKDSILHEEKHSKSKGFGRSQSSTDIAYATHNVASNIIGDKVNITSE KNISLLGSNVQANTEGQIKANGNITQAGVKDINYSYHKTTKKGFMGLTSKSVTDENYAEK AILSATLAGDKGLTYDSKNNLILSGVKVVSSGSINLKGKNVEINPLETKSYNKHEEVKKG FSGSFSPKGISVSYGKDKLESKTDILNQTASQIVSNKDINIEATDKVKAKSVDIYAKNDV NISGDNGVEISTANNSYDNTTKQSSSRIGASVGINSAIVNTVENVRDIKKLTDFSGNSYD ILNNASKVVGAIKDGAAATNSLINYKYAGIDSTGAETLKNSPNIFKANISYNKSESKSSV HNETVEKSSLVSGRNMNIKSKNGSITISGTDVKVGNDLDLSAKKDITIKASEENFTSSNS SSQMGIGLSADLSKGKIADLSISKAGTKGRGNGTNYINSTVNVGGKLKTNSENLTLSGAN VEADKLDIKVKNLVIESKQDKSERKDSSYGGSFSIDLANPSNFSANINGSKGNGEKEWVN KQTTLIARNGGKVDTDSLTNIGAVIGSENEKEKLKVSANKVIVKDLEDKNKYENIGGGIT IGTDVPNTSIKHDKVDKEQINRASAINTDFEISGKKTSTEDLGFNTDIDKAQEKTKDEEK HLDAELHTDLIGEDKRNEIKYAFKKLGSLHEILDQKKFKESMEGVLLDKFKDEHQKEFNL IKDENLSLEDKQKLAQNLVERYLRENGYQGVIPEVLLTDEAHSFTVDSKDKETGAKRREK IYFSINDIADPNLAFSRLFGHEKAHMNTYDEGKKGEETAIHTREKIGSENKNKVFTEKEK ADYLNNLRNKYRDQKSIEQQFAEAKHVPEKDKENFIYEIAFEILENNPYIIEETLALIDS ALLLNLSEEQVRARAEKNKKELDKLFGNNDKDEIDNLNDIKNYKSIIISGKKNIEGEKEG KEEQVTNVESASISSPSPLPDPNKDDKDKKENEVNSLERTGSATKEDPYHAFNDIIDNYA DEAHKFEINNGELYQVEGSLNGVDGRFEWIIEDGKVTHRMFVPNGKITGIPIKP Prediction of potential genes in microbial genomes Time: Sat Jul 9 20:02:31 2011 Seq name: gi|296153738|gb|ADVK01000053.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00065, whole genome shotgun sequence Length of sequence - 29561 bp Number of predicted genes - 30, with homology - 28 Number of transcription units - 5, operones - 5 average op.length - 6.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . - CDS 3 - 912 1067 ## FN0091 phosphoserine phosphatase (EC:3.1.3.3) 2 1 Op 2 2/0.000 - CDS 936 - 1385 565 ## COG4766 Ethanolamine utilization protein 3 1 Op 3 . - CDS 1396 - 2478 2066 ## COG3192 Ethanolamine utilization protein 4 1 Op 4 . - CDS 2478 - 2813 404 ## FN0088 hypothetical protein 5 1 Op 5 . - CDS 2826 - 3074 429 ## COG4576 Carbon dioxide concentrating mechanism/carboxysome shell protein 6 1 Op 6 . - CDS 3076 - 3663 615 ## FN0086 hypothetical protein 7 1 Op 7 . - CDS 3663 - 4430 821 ## COG4812 Ethanolamine utilization cobalamin adenosyltransferase 8 1 Op 8 . - CDS 4492 - 4611 72 ## - Prom 4660 - 4719 10.0 9 2 Op 1 2/0.000 - CDS 4728 - 6170 470 ## PROTEIN SUPPORTED gi|148544941|ref|YP_001272311.1| 50S ribosomal protein L29P - Prom 6209 - 6268 5.2 - Term 6236 - 6273 1.0 10 2 Op 2 5/0.000 - CDS 6278 - 6562 546 ## COG4577 Carbon dioxide concentrating mechanism/carboxysome shell protein 11 2 Op 3 4/0.000 - CDS 6597 - 7004 759 ## COG4577 Carbon dioxide concentrating mechanism/carboxysome shell protein 12 2 Op 4 . - CDS 7015 - 7668 982 ## COG4816 Ethanolamine utilization protein 13 2 Op 5 . - CDS 7670 - 7747 59 ## 14 2 Op 6 8/0.000 - CDS 7774 - 8661 1578 ## COG4302 Ethanolamine ammonia-lyase, small subunit 15 2 Op 7 5/0.000 - CDS 8673 - 9968 1871 ## COG4303 Ethanolamine ammonia-lyase, large subunit - Prom 10005 - 10064 6.4 16 2 Op 8 2/0.000 - CDS 10068 - 11498 1618 ## COG4819 Ethanolamine utilization protein, possible chaperonin protecting lyase from inhibition - Prom 11518 - 11577 4.3 - Term 11521 - 11568 9.2 17 3 Op 1 3/0.000 - CDS 11611 - 12999 1290 ## COG3920 Signal transduction histidine kinase 18 3 Op 2 1/0.000 - CDS 12999 - 13577 729 ## COG3707 Response regulator with putative antiterminator output domain 19 3 Op 3 4/0.000 - CDS 13567 - 14004 480 ## COG4917 Ethanolamine utilization protein 20 3 Op 4 1/0.000 - CDS 14007 - 14348 560 ## COG4810 Ethanolamine utilization protein 21 3 Op 5 5/0.000 - CDS 14350 - 15177 1085 ## COG0294 Dihydropteroate synthase and related enzymes 22 3 Op 6 . - CDS 15164 - 15988 628 ## PROTEIN SUPPORTED gi|148994682|ref|ZP_01823786.1| 50S ribosomal protein L13 - Prom 16139 - 16198 11.8 - Term 16179 - 16237 7.1 23 4 Op 1 . - CDS 16360 - 16512 116 ## gi|296329136|ref|ZP_06871639.1| hypothetical protein HMPREF0397_1832 24 4 Op 2 2/0.000 - CDS 16509 - 17279 1033 ## COG3210 Large exoproteins involved in heme utilization or adhesion 25 4 Op 3 2/0.000 - CDS 17231 - 17533 319 ## COG3210 Large exoproteins involved in heme utilization or adhesion - Prom 17590 - 17649 7.9 - Term 17629 - 17677 1.7 26 4 Op 4 . - CDS 17704 - 19857 2903 ## COG3210 Large exoproteins involved in heme utilization or adhesion - Prom 19890 - 19949 5.4 27 5 Op 1 . - CDS 19980 - 21062 1033 ## gi|296329140|ref|ZP_06871643.1| conserved hypothetical protein 28 5 Op 2 2/0.000 - CDS 21089 - 23776 3190 ## COG3210 Large exoproteins involved in heme utilization or adhesion 29 5 Op 3 12/0.000 - CDS 23745 - 29408 6549 ## COG3210 Large exoproteins involved in heme utilization or adhesion 30 5 Op 4 . - CDS 29420 - 29560 94 ## COG2831 Hemolysin activation/secretion protein Predicted protein(s) >gi|296153738|gb|ADVK01000053.1| GENE 1 3 - 912 1067 303 aa, chain - ## HITS:1 COG:no KEGG:FN0091 NR:ns ## KEGG: FN0091 # Name: not_defined # Def: phosphoserine phosphatase (EC:3.1.3.3) # Organism: F.nucleatum # Pathway: Glycine, serine and threonine metabolism [PATH:fnu00260]; Methane metabolism [PATH:fnu00680]; Metabolic pathways [PATH:fnu01100]; Microbial metabolism in diverse environments [PATH:fnu01120] # 1 303 1 303 366 566 99.0 1e-160 MSIENSCVRLDEGRWNPKNREVLEKLIEKYRDTNSYAVFDWDNTSIQGDSQLNLFIYQIE NLIYKLNPQKFNEVIRKNVPTNNFKERYKNLDGEILNVTKLANDIYKDYIFLYENYISDK KLSLKEIRNTEEFKDFRAKMHYLHNALPGNFSAELACLWEFYLLSGMTKDEVKSLAKEAT DTKLGEAIGDVIVESSRVLTGEAGMVREIYDNGLRIRPEMANLYHELKRNGIDVYIISAS MQELIEVFATDKSYGYNLDVENIYAMKLKSTTDNILIDEYNYDIPFTQREGKSETINKFI KSK >gi|296153738|gb|ADVK01000053.1| GENE 2 936 - 1385 565 149 aa, chain - ## HITS:1 COG:FN0090 KEGG:ns NR:ns ## COG: FN0090 COG4766 # Protein_GI_number: 19703442 # Func_class: E Amino acid transport and metabolism # Function: Ethanolamine utilization protein # Organism: Fusobacterium nucleatum # 1 149 1 149 149 275 100.0 2e-74 MDKELLEELIRKVIKEELGKAEQSESEYKQMDKSGIGVVKLNQMKKRVKMDTGNPKDQVT TTDLFTLQESPRLGAGLMEMKETTFPWTLTYDELDYIIEGRLEILIDGRKVVGEAGDVIL IPKNSKIEFSVPNYAKFLYFVYPANWSEL >gi|296153738|gb|ADVK01000053.1| GENE 3 1396 - 2478 2066 360 aa, chain - ## HITS:1 COG:FN0089 KEGG:ns NR:ns ## COG: FN0089 COG3192 # Protein_GI_number: 19703441 # Func_class: E Amino acid transport and metabolism # Function: Ethanolamine utilization protein # Organism: Fusobacterium nucleatum # 1 360 1 360 360 538 98.0 1e-153 MGINEIIIYIMVFFMAVGALDKCIGNKFGYGQKFEEGIMAMGALALSMVGIVSLAPVLAN ILRPVVGPVYAALGADPAMFATTLLANDMGGYPLAMSLAQDPMVGKFAGLILGSMMGATV VFTIPVALGIIEKEDRPYLAKGVLAGMVAIPFGCFVGGLVADFPVMTVLRNLVPIIIFAV LIIIGLWFIPEKMTTGFTYFGTGVVVVITIGLAAAIIENLTGFVVIPGMAPIDEGMGIIW SIAIVLAGAFPLVHFITKVFKKPLEKVGEKLGMNEIGAAGLVASLANNIPMFGMMKDMDP NGKVMNVAFAVCAAFVFGDHLGFTGGVDKAMIAPMIAGKLAGGILAIIIAKILFTTKKAK >gi|296153738|gb|ADVK01000053.1| GENE 4 2478 - 2813 404 111 aa, chain - ## HITS:1 COG:no KEGG:FN0088 NR:ns ## KEGG: FN0088 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 111 1 111 111 181 96.0 1e-44 MSKRFITLKNVEEQIGSGKIYLDKKAILSSSLQDYIREHNIQVVYGEETCSVKPSIEDCA CLKEEVNNTNSKADNFAEVARMVVKILKNDYGIQDEKKIMQVIKIIKEVLK >gi|296153738|gb|ADVK01000053.1| GENE 5 2826 - 3074 429 82 aa, chain - ## HITS:1 COG:FN0087 KEGG:ns NR:ns ## COG: FN0087 COG4576 # Protein_GI_number: 19703439 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism; C Energy production and conversion # Function: Carbon dioxide concentrating mechanism/carboxysome shell protein # Organism: Fusobacterium nucleatum # 1 82 1 82 82 137 98.0 5e-33 MLIGEVIGNVWATKKYDGLDGLKFLIVKTEDNKRMVAFDSVGAGIGEKVIISTGSSARNI LNMRDVPVDAAIIGIIDGMDEE >gi|296153738|gb|ADVK01000053.1| GENE 6 3076 - 3663 615 195 aa, chain - ## HITS:1 COG:no KEGG:FN0086 NR:ns ## KEGG: FN0086 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 195 1 195 195 237 95.0 2e-61 MNNFDENYIIELVKKELSRYLTDQGIEIKKEVYFLGDDHEIKEQLSQKFNFSENAKILIV SQLSLKNLYNISNAIYENEYEEKIIKFLLENKEIVIIKEGIEYSKYENIPLAVQKKYKEY LEKIKSYGIKVENKEFYINSLTKKEEIYGKKLLDLNKLKELEAKGIRRIIVENSIVTSSA EEYAKEKNIEIIKRR >gi|296153738|gb|ADVK01000053.1| GENE 7 3663 - 4430 821 255 aa, chain - ## HITS:1 COG:FN0085 KEGG:ns NR:ns ## COG: FN0085 COG4812 # Protein_GI_number: 19703437 # Func_class: E Amino acid transport and metabolism # Function: Ethanolamine utilization cobalamin adenosyltransferase # Organism: Fusobacterium nucleatum # 1 255 1 255 255 404 98.0 1e-113 MVLSEDILKIKYRKEPFDVFEIEKGTLLTPSAKQFLNEKGIELLIKEEKPLVSTKNEEDN VEAEEKIFYEKPKYVGKNGEYYFEKPEYMTVVDGNVLISKNSKLIALRGKIETFLAEVLL TGKEIELTSNNDKLIRDIETIIKFIQNIMVAEKLNKILENQTFFDSKSIKDIKEIIENPK EYFKKGHLLEISLNSDLTIHKLNRLRFLARELEIQAIDYFVEDYKVSRKDLLEAFNILSD VIYIIILKVDNGEYR >gi|296153738|gb|ADVK01000053.1| GENE 8 4492 - 4611 72 39 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MSYNVERAFVELEKHYRLSSNRYIKLKILKYFISRKTFI >gi|296153738|gb|ADVK01000053.1| GENE 9 4728 - 6170 470 480 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|148544941|ref|YP_001272311.1| 50S ribosomal protein L29P [Lactobacillus reuteri DSM 20016] # 11 431 43 448 477 185 29 3e-46 MDRDLLSIQQVRDLVKSAKIAQKLYSTFTQEQIDRVVYAIVQEMKNHYVDLAKKANEETG FGKWEDKVIKNKFANEFVYDYIKNMKTVGILNETDTVTEVGVPMGIVAALTPSTNPTSTA IYKTLISLKAGNAVIVSPHPNAKSCVIDTVKLMQKAAVAAGAPEGLIGVIEIPTIEGTNE LMKSKDTSIILATGGEAMVRAAYSSGRPAIGVGPGNGPAYIEKTANVKEAVRKIIESKTF DNGVICASEQSVIVEPCNKEAVMDEFRRQGGFFLSKEESEKLGRFILRPNGTMNPQIVGK DAQTLAKLAGLNIPLNVKVLLSEQNTVSKTNPYSREKLTTILAFYVEENAEKACERAIEL LENEGEGHTLIIHSEDKDIIREFALRKPVSRMLINVGGSLGGVGATTNLAPAFTLGCGAV GGSSTSDNITPMNLINIRRVAVGVRELSDFKKDTNTSSNLDVINSEVEDMIRRIIAEYRK >gi|296153738|gb|ADVK01000053.1| GENE 10 6278 - 6562 546 94 aa, chain - ## HITS:1 COG:FN0083 KEGG:ns NR:ns ## COG: FN0083 COG4577 # Protein_GI_number: 19703435 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism; C Energy production and conversion # Function: Carbon dioxide concentrating mechanism/carboxysome shell protein # Organism: Fusobacterium nucleatum # 1 94 1 94 94 123 100.0 9e-29 MSTLNALGMIETKGLVAAVEAADAMVKAANVTLVGKELVGGGLVTVMVRGDVGAVKAATD AGAAAADRVGELISVHVIPRPHSEVELILPKSNN >gi|296153738|gb|ADVK01000053.1| GENE 11 6597 - 7004 759 135 aa, chain - ## HITS:1 COG:FN0082 KEGG:ns NR:ns ## COG: FN0082 COG4577 # Protein_GI_number: 19703434 # Func_class: Q Secondary metabolites biosynthesis, transport and catabolism; C Energy production and conversion # Function: Carbon dioxide concentrating mechanism/carboxysome shell protein # Organism: Fusobacterium nucleatum # 1 93 1 93 148 137 96.0 5e-33 MKALGLIETKGMVGAIVAADIALKTAQVELINKECVKGGLVCIEFEGDVAAVKASVEAAV TAIKDMGIYVGSHVIPRPDDSVEKIIKRKNETSKEEVIEEKVEKIKKETKDIEEEIEEIN EILKVSKNKKQKNKK >gi|296153738|gb|ADVK01000053.1| GENE 12 7015 - 7668 982 217 aa, chain - ## HITS:1 COG:FN0081 KEGG:ns NR:ns ## COG: FN0081 COG4816 # Protein_GI_number: 19703433 # Func_class: E Amino acid transport and metabolism # Function: Ethanolamine utilization protein # Organism: Fusobacterium nucleatum # 1 217 1 217 217 374 100.0 1e-104 MINDPLRASVLSVKLIPNVDAKMAEELNLPNGYRSIGIITADSDDVTYTALDEATKMAEV VIVYAKSFYGGAANANTKLAGEVIGIMAGPNPAEVKSGLNAAIDFIENGGCFYSANEDDT IPYYAHCVSRTGSYLSKTAGVEEGAALAYLIAPPLEAMYALDAALKAADVTLAAFFGPPS ETNFGGGLLTGSQSACKSACDAFAEAVKFVAQNPKKI >gi|296153738|gb|ADVK01000053.1| GENE 13 7670 - 7747 59 25 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MKSILNFLSLRNLAGNELFFETNGG >gi|296153738|gb|ADVK01000053.1| GENE 14 7774 - 8661 1578 295 aa, chain - ## HITS:1 COG:FN0080 KEGG:ns NR:ns ## COG: FN0080 COG4302 # Protein_GI_number: 19703432 # Func_class: E Amino acid transport and metabolism # Function: Ethanolamine ammonia-lyase, small subunit # Organism: Fusobacterium nucleatum # 1 295 1 295 295 545 100.0 1e-155 MVSELELKEIIGKVLKEMAVEGKTEGQAVTETKKTSESHIEDGIIDDITKEDLREIVELK NATNKEEFLKYKRKTPARLGISRAGSRYTTHTMLRLRADHAAAQDAVLSSVNEDFLKANN LFIVKSRCEDKDQYITRPDLGRRLDEESVKTLKEKCVQNPTVQVFVADGLSSTAIEANIE DCLPALLNGLKSYGISVGTPFFAKLARVGLADDVSEVLGAEVTCVLIGERPGLATAESMS AYITYKGYVGIPEAKRTVVSNIHVKGTPAAEAGAHIAHIIKKVLDAKASGQDLKL >gi|296153738|gb|ADVK01000053.1| GENE 15 8673 - 9968 1871 431 aa, chain - ## HITS:1 COG:FN0079 KEGG:ns NR:ns ## COG: FN0079 COG4303 # Protein_GI_number: 19703431 # Func_class: E Amino acid transport and metabolism # Function: Ethanolamine ammonia-lyase, large subunit # Organism: Fusobacterium nucleatum # 2 431 26 455 455 874 99.0 0 MKKKSGDTLAGIAASSSKERVAAKVVLSKITLKDLKENPAVPYEEDEVTRIIIDDLNLQI YDEIKDWTVSDLREWLLSEEATPSKINWIRRGLTSEMIAAVAKLMSNMDLIVAAKKIEVH AHCNTTIGGYDTLAVRLQPNHTTDDPDGIMISTLEGLSYGIGDAVIGLNPVDDSVDSVMA VMERFHKIKTEYQIPTQTCVLAHVTTQMEAIKRGAKVDMIFQSIAGSEKGNEAFGITGAM IEEARQLGLKYGTAAGPNIMYFETGQGSELSSDAHNGADQVTMEARCYGFAKRFQPFLVN TVVGFIGPEYLYDSKQVIRAGLEDHFMGKLHGLPMGVDVCYTNHMKAEQSDVEILATLLT AAGCNYFMGVPAGDDIMLNYQTTGFHDNQSLRELFGKQPIKEFKEWLVKYGFMTEDGKLT EKAGDPSVFLK >gi|296153738|gb|ADVK01000053.1| GENE 16 10068 - 11498 1618 476 aa, chain - ## HITS:1 COG:FN0078 KEGG:ns NR:ns ## COG: FN0078 COG4819 # Protein_GI_number: 19703430 # Func_class: E Amino acid transport and metabolism # Function: Ethanolamine utilization protein, possible chaperonin protecting lyase from inhibition # Organism: Fusobacterium nucleatum # 1 476 1 476 476 857 99.0 0 MREEINSVGIDIGTSTTQVVFSKIVLENMSSGARVPQIKIVSKDVVYRSQIYFTPLVSQT EIDAQAVKKIVEEEYRKAGMTPSSISTGAVIITGETARKSNANEVLNVLSGMAGDFVVAT AGPDLESIIAGKGSGAMSFSEKRNTAIFNLDIGGGTTNISYFDKGKVLDTTCLDIGGRLI KLNRSTMTVEYISDKFIKLIANLGLNIKVGVKVEKSEIVKLCKEIADILLQAVYFKAKTS NYELLITYKDFHNKDNRLEYVSFSGGVADLIYDDYNGDEFKYGDIGIILGKEIKKVFDVA GVKYVKVGETIGATVVGAGNYTTEISGSTITYTDVDILPIKNIPVIKMNKEDEENLFEFK ERLEQKLDWFRNNEGRQDVAVGVVGENNMKYKKIVGIAESISQVFKSVSRIIVVVESDIG KVLGQCLMLNTGGKVQIICVDNIKVNDGDYIDIGKPLGMGSVLPVVVKTLVLKNYR >gi|296153738|gb|ADVK01000053.1| GENE 17 11611 - 12999 1290 462 aa, chain - ## HITS:1 COG:FN0077 KEGG:ns NR:ns ## COG: FN0077 COG3920 # Protein_GI_number: 19703429 # Func_class: T Signal transduction mechanisms # Function: Signal transduction histidine kinase # Organism: Fusobacterium nucleatum # 1 462 1 462 462 799 99.0 0 MLKLLCKICATLTPSDIDIVEQMSNVATILSNILDMDVFLDCPTKKEDEAMVVFHARPEK NSLYSKDISGEIAYRFNEPAVFRTFETGLPSRNYKAVTQEKANVLQNILPIFNSLDEVIC VIIIEYSEQQREFFEKEYNKKAAGILMGQIDSLKDRVTEYINDGIIIFNKNGYATYANKV AKILYEKLGVPSIVGQSFENLYFERAKYSAIIEAPNKYKQKEVRIFDFILNVQCLVSKIN EDVKRVTLIIKDITEEKKYEEELKIKTVFIKEIHHRVKNNLQTVASLLRIQKRRVKNAET KKILDETINRILSIAITHEILSATGMDTISIKHILEILCQNYFKNNVDKSKKIEFNINGD EFSISSDKATSVALVVNEIVQNATEHAFIGRDSGKVIIKILKGEKFSKIKISDNGVGMEV NRETNSMGLLIISSLVKDKLKGNLEIRSKKDKGTTIEFDFKN >gi|296153738|gb|ADVK01000053.1| GENE 18 12999 - 13577 729 192 aa, chain - ## HITS:1 COG:FN0076 KEGG:ns NR:ns ## COG: FN0076 COG3707 # Protein_GI_number: 19703428 # Func_class: T Signal transduction mechanisms # Function: Response regulator with putative antiterminator output domain # Organism: Fusobacterium nucleatum # 1 192 1 192 192 317 100.0 8e-87 MSLRVVVVEDETLTRIDLIEILKENGYDVVGEAADGIEAVEVCKKLQPDIVLLDIKIPYI SGLKVANILKEEGFKGCVIILTAYNIAEYIQEASNTIVMGYILKPIDEVIFLERLNLIYK NYKLYDDLRVEVEDTKKKLEERKVIERAKGIVMAKYTLSEEEAYKKMRDLSMQKRITMFK LSEIIILTGGFE >gi|296153738|gb|ADVK01000053.1| GENE 19 13567 - 14004 480 145 aa, chain - ## HITS:1 COG:FN0075 KEGG:ns NR:ns ## COG: FN0075 COG4917 # Protein_GI_number: 19703427 # Func_class: E Amino acid transport and metabolism # Function: Ethanolamine utilization protein # Organism: Fusobacterium nucleatum # 1 145 1 145 145 251 100.0 3e-67 MKKIMLIGRTSCGKTTLTQKLMNEEVKYKKTQAVTYKSKIIDTPGEYVENKMYYKSLLVL SADAKIIVLVQSAIDGATLFPPKFSTMFPKKEVIGLVTKIDLADADIERSKRFLVEAGAT EVFTIGLNDEKGLEAIKKRLVMNES >gi|296153738|gb|ADVK01000053.1| GENE 20 14007 - 14348 560 113 aa, chain - ## HITS:1 COG:FN0074 KEGG:ns NR:ns ## COG: FN0074 COG4810 # Protein_GI_number: 19703426 # Func_class: E Amino acid transport and metabolism # Function: Ethanolamine utilization protein # Organism: Fusobacterium nucleatum # 1 113 10 122 122 194 100.0 3e-50 MEKQRTIQEYVPGKQVTLAHLIANPDKDMCVKLGLDEEKTNAIGILTITPGEAAIISADI AIKSGSIELGFLDRFSGTLLLTGDFASVESSLKAVLEFLKETLKFYICEITRS >gi|296153738|gb|ADVK01000053.1| GENE 21 14350 - 15177 1085 275 aa, chain - ## HITS:1 COG:FN0073 KEGG:ns NR:ns ## COG: FN0073 COG0294 # Protein_GI_number: 19703425 # Func_class: H Coenzyme transport and metabolism # Function: Dihydropteroate synthase and related enzymes # Organism: Fusobacterium nucleatum # 1 275 3 277 277 503 99.0 1e-142 MKKISCGNKEIILGERTLVMGILNVTPDSFSDGGKYNNLDSAIKQAEKLILDGADIIDVG GESTRPGHVQITSEEEISRVVLIIEKISKNLNTIISIDTYKYDVAEEAIKAGANIINDIW GLQYDNGEMAELVKKSKLPIIAMHNQNDEIYSKDIMLSLREFFEKTYKIADKYGIDRDKI ILDPGLGFGKNVEQNIEVLSRLNELKDMGSILLGASKKRFIGKLLNDLPFDERVEGTVAT TVIGIEKGVDIVRVHNVLENKRACLVADGIYRKRG >gi|296153738|gb|ADVK01000053.1| GENE 22 15164 - 15988 628 274 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|148994682|ref|ZP_01823786.1| 50S ribosomal protein L13 [Streptococcus pneumoniae SP9-BS68] # 1 270 1 269 278 246 44 1e-64 MDKIYIRDLEFIGYHGLFEEEKKLGQKFFVSLELTTNLREAGLNDDITKTTHYGEVSETV KKIFFQKKYDLIETLAEDIAREVLLNYPLISELKLEIKKPWAPVGIALKDVSVEITRKWN EVYISLGTNMGKKKKNMEKAIKEVANIKDTFIIKESTIIETEPFGYKEQDDFLNSCIGVK TLLTPREILKELLSIEKKMGRERKIKWGPRIIDLDIIFYGKEVIEEDDLVVPHPYMEYRE FVLKPLEEIIPNFVHPLLSKRISTLRKELENEKN >gi|296153738|gb|ADVK01000053.1| GENE 23 16360 - 16512 116 50 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296329136|ref|ZP_06871639.1| ## NR: gi|296329136|ref|ZP_06871639.1| hypothetical protein HMPREF0397_1832 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] hypothetical protein HMPREF0397_1832 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 50 1 50 50 79 100.0 7e-14 MTFKKYLYMIRKYLKNTNKTWEICDKFYGNLRYEMSDIPYKRLHTRSMIL >gi|296153738|gb|ADVK01000053.1| GENE 24 16509 - 17279 1033 256 aa, chain - ## HITS:1 COG:FN1817 KEGG:ns NR:ns ## COG: FN1817 COG3210 # Protein_GI_number: 19705122 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Large exoproteins involved in heme utilization or adhesion # Organism: Fusobacterium nucleatum # 1 138 2342 2476 2806 146 66.0 4e-35 MPKDSKGSTGSAYAYIVDEKNKKVLILIDVNKIGDTKELLGTLAEEISYGEDGLAGRQDL DVAKDTTNDEEGLESLGRPINDYIKNKLEDNSSAIQLSTDGIDLTNANVGKKVGDVLLKS EIDAAGGYEAVMVTRIDEEMRKVAKPYLSYVEKIADITSRFGYVGTMRLIAESITGKNQN LTQEKVIKIYKEKPINYIEKVPDKNSPSGYKTNNVRYYDNIRVVTNENDTEFITVVEDKN NQASKNIDKKKYEKIK >gi|296153738|gb|ADVK01000053.1| GENE 25 17231 - 17533 319 100 aa, chain - ## HITS:1 COG:FN1817 KEGG:ns NR:ns ## COG: FN1817 COG3210 # Protein_GI_number: 19705122 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Large exoproteins involved in heme utilization or adhesion # Organism: Fusobacterium nucleatum # 1 91 2257 2347 2806 109 63.0 2e-24 MGNVVENTINPKGEDKRNIFANLRAQRGGTTFYNVIGSRAEALNEALKNGTINEEQFKEE VRKVIKGYGKDIGVDFEVVYLDKKLCQKIPKEVQVQLMLI >gi|296153738|gb|ADVK01000053.1| GENE 26 17704 - 19857 2903 717 aa, chain - ## HITS:1 COG:FN1817 KEGG:ns NR:ns ## COG: FN1817 COG3210 # Protein_GI_number: 19705122 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Large exoproteins involved in heme utilization or adhesion # Organism: Fusobacterium nucleatum # 1 600 1837 2455 2806 526 60.0 1e-149 MKSGDMFGGIASATNTATGIVSGLASNQGTRLPLSAVKADNTVGKDNVKLAEANNNFYAN MGVNLGFNKSTSKSSSHSESAVVTTIKGKDKDSSITYNNVKNIEYVGTQAKDTKFIYNNV DNITKKAVELNNSYSSDSKSSGVSAGVNVGYGRKVLTDNASVSVSASKSKMNSNGTSYQN GLFVNVDEEHNNTKNMTLSGFNQVGGKVTGNIENLTIESKQNTSTTTGSTKGGSIGFAPN GIPTSISANYSQTNGERKYVDDPTTFIIGEGSNLKVGKVENTAAAIGTSGNGKLSIDEYV GHNLENKDETTTKGGSLSLSQSSIPISGVGVNYANKDLESVTKNTVIGNVEIGKSSGDEI NKDLDTMTEVTKDEDTKTNVFVESQTIRYAVNPESFKQDLEKAKNEIHDIYHAVDSTVNP QGKESRNVLQQLAETRQAKTIYNVVDSRLEIAENQEDIAKAFEGVSEDLGYKVKVIYTDP SNSPQLIGTDKDGNTYIKDGTAYVDKATGVNYILINTKSPANRTKAGVIGTIAEEQSHVI GKIEGRQKVVPDGSEKGLESLGKPTNNYFKNQYSKNDKAIGLKSDGKDYSNVDFGENVGD NSLALEVGIIVETGVATGTGIIAGGVSAPVVAGIAVVGGIVYYAIMSEERKEELKREISE IYEDEKSIQKEGMAYDIGDKVEKRWEIIGEWKYYNREGKLEKIETYENSEIIKVEEF >gi|296153738|gb|ADVK01000053.1| GENE 27 19980 - 21062 1033 360 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296329140|ref|ZP_06871643.1| ## NR: gi|296329140|ref|ZP_06871643.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 360 1 360 360 598 100.0 1e-169 MKKFKLKPIYHPKGSYLNYIVEIWVDGVNISQFYEDNKLRIDVGYIFHIYNFFDNYLEEI MKEQILPYEDVEGKTIFETIDNIKEKYFYWLKDDYENDETDEEIEQIISISEPFYEWQRA HRWLSSGPFLCIPDTIFRRVGDKIEISWDTTWDLTYQQRKYENKNIKFISTKGVTYIDAN EFYLEIKKFLKKIDDISKIQNEKFHIIEKTGKLIYAKDPYNNIEFKEEKEFLQDLEKIGY RFFTIYELVLITEKDKKIVPIILKYLSKIEDENIKTHLVYFLAVKNYKEASEKLIKEFYK AKTDEYRIALSKALSIIYNKNVLNELLEIVKNKEFKNVNFPIISTLNKYKDKRVEKFLKE >gi|296153738|gb|ADVK01000053.1| GENE 28 21089 - 23776 3190 895 aa, chain - ## HITS:1 COG:FN1817 KEGG:ns NR:ns ## COG: FN1817 COG3210 # Protein_GI_number: 19705122 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Large exoproteins involved in heme utilization or adhesion # Organism: Fusobacterium nucleatum # 35 596 1892 2458 2806 504 60.0 1e-142 MLLIKEQGAVNKNNSDDKNDSKDKNKNNQNKKDNTVGKDNVKLAEANNNFYANMGVNLGF NKSSSKTSSHSETAVVTTIQGKDKDSSITYNNVKNIEYVGTQAKDTKFIYNNVDNITKKA VELNNSYSSDSKSSGVSAGVNVGYGRKVLTDNASISVSASKSNMNSNGTSYQNGLFVNVD EEHNNTKNMTLSGFNQVGGKVTGNIENLTIESKQNTSTTTGSTKGGSIGFAPNGMPTSIS ANYSQTNGERKYVDDATTFIIGDGSNLKIGKVDNTASAIGTSGNGKLSIDEYVGHNLENK DKTTTKGGSVSLSQSSIPISGVGVNYANKDLESVTKNTVIGNVEIGKSSGDEINKDLDTM TEVTKDEDTKTNVFVESQTIRYALNPEAFKQDLEKAKNEIHDIYHAVDSTVNPQGKESRN VLQQLAETRQAKTIYNVVDSRLQIAENQEDIAKAFEGVSEDLGYKVKVIYTDPSNSPQLI GVDKNGNKYIKNGTAYVDKDTGIGYILINTKSPANKTKAGVIGTIAEEQSHIIGKIEGRQ KKVPDGSEKGLESLGRPTNDYFKDKYSKNDKAIGIVSDGKDYSNVDFGENVGDNNTLTLV KEGTFFLETGTKTGIGMAAAGISVPAVVGIVAVGGVVYYFTMPEEKKEQLKEEISEIYED VKKSFGSILDKESSKNLVADVIKFKLKEKGIEADVKIDKDGNIMYTMTPLPEEKRVFKLP PFPLPEERIFRLPPMEPPKQETQKERWEREERARREEQEFFNELQKPNGIPLQQEKPGTI YTIDNNESKKFDGEYKPSPKHDPVSGHNGASKSNIPDLKTGQELLEKSYGSSKTKQRYVW YRGKLVKFQPGNDGTWHDYPVINAGEEVPIDVLRQMRDDGIITNSEYKNLIKGKK >gi|296153738|gb|ADVK01000053.1| GENE 29 23745 - 29408 6549 1887 aa, chain - ## HITS:1 COG:FN0291 KEGG:ns NR:ns ## COG: FN0291 COG3210 # Protein_GI_number: 19703636 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Large exoproteins involved in heme utilization or adhesion # Organism: Fusobacterium nucleatum # 302 1765 404 1860 1881 1536 76.0 0 MKRSLKKLIATFMLFLHIISLADGIVPDNAASRNLQVDKAANGVPLVNIEAPDNNGTSHN VYKDYNVDGRGAILNNSKDLTNSQLGGLIYGNPNLQNSKEASTIINEVSGVNRSRIEGYQ EIAGKKANYILANPNGIYINGAGFINTGNVTLTTGSGNNLLNPEKGTIEIAGKGLDLRNI NKAELVARVAELSAPIYGGEEVNLKLGSQGKANKPEYVLDARALGSIYAGRINIIVNEDG VGVKTQAPMYAEKGDVVISSKGKVYLKDTQAKGNIKITSTETEIANKLLAENLIDIKGKT TNSGQIQANNNITISGNVDSSNLISTNKDITISGNLTNSGEVSTKNLTTNNLDNKGNITV INNVNSELITNNGKLLVGDTINSQNLTNTSTVQGKTLDIKSKVNSSGKIISDNISTKDIS NSGNISSKSITTQELTNSGEIISNNLSSNNTNNSKNIFVNGNLKIVNNLNNSGIIEGLEL NTNSIDNTGNITIKNKLTSQNLNNKKNTANINAGFLDVQNKISSVGNIKAITLKTNNLDN SGNILTNSLTTTENINKGSIIAKNISSQNLVNSGSVISNNITVAKNITNTNSIFANEKIS ADKISNSNKLVAKNTEITKLTNDGNIVVKENLKAKDITNSNTIKVGENLNTDKLQNSKTL IAKNINIEKSLNNINGKITSLNANINTSDIKNNNGIIQAIKNINIKTSNDLSLDGKYTAN DSLNINAKSLENNGNLENDGKINLNLTGNLINNKRISSSGNLDITAKEVLNNGKDSAIGS ETNLSITANSLKNEGNLLFGVRTDNKLKTTGNITNKGVIGSLGKLSIEAKDILNDKHIAS DNDLTINTDSITNKGLLYSTGNMKVDFKENFLNDKAEIYSSGDIIFTGKEGTFINRVGDI ESEKNIKIEAKDIKNLAELRGGHRVVGSVPGNQSNIDMSKLDIKKYNKLSSDIVNDFFKQ YIIIPRLQKSGIDIDNKRVEIKADEQGGSFFVLKKDDEKFHTYGAWDWEGYKSKAGVYLA SADKIESNYTSEMSTIKAGGNITLKATNDIENLESNILANGNVDITAKNLINKNFNIAVK RKITLMRDIEYHGAAMHNLKWGKTYDHMGQKIYNHNEDKGNVIIEDEVTSYVGTGKNAKI SAGGNIKIEANKVGNGLEIKENVSIKSKNQEINSTKVDKNGVNLDSINIDKKNTNVDEIV LNKKNLEPKEEIDTKDYINLPKNDKGLFRINNNIDNKPGFSYLVETNINFIDKSRFFGSE YFFKRIGFNPDRNIRLLGDSFYETKLINKAILEGTGRRFLSGYKSDKEQMQALYDNAVSE QADLNLSVGIALSKEQIAKLKKDIIWYVEEEVQGQKVLVPKVYLTRNTLSKLKDKNASIE AGQELTIIAKDIQNTGNLSANNITITTDNLTNKTILGANKASIDGNTVNITAKNSVDNIG ADIKAKENLTVTAKDISNLSTKRTNGYGLDTVTTGENLASIQAKNVTLDAGNNIQNSGAK IKADEKVDIKTKNVKIDTIEESRYFHDGNSNNYTTIDNKSNIASNIEAKNINIEAKKDID IKGSNVVAKNEANIKADGDVNIVSATDSRFYAHKETSKGKFGKSSSEENIAYATRNVASN IIGDKVNITSGKNVNIFGSNVGAKDTGNISAKGTITEAAAKEINYSYHKKTKTGFMGLSG KSSAEKIRQELNAESNLYVKNQGIIGGDIKVIGSNLVLGNNSIINGKLTTDSNQLHNSYS YEESKKGFSGSIGGGGFSVGYGKSESGLKEKSVINAKSNLVLGDGTVLNKGADITATNLT HGKITVNNGDVKFGARKDTKDVETYSKSSGVNLSVRIKSQALDRVQQGFDSFNQLKSGDI FGGVASATNTATGIVSGLASNQGTRCC >gi|296153738|gb|ADVK01000053.1| GENE 30 29420 - 29560 94 46 aa, chain - ## HITS:1 COG:FN0292 KEGG:ns NR:ns ## COG: FN0292 COG2831 # Protein_GI_number: 19703637 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Hemolysin activation/secretion protein # Organism: Fusobacterium nucleatum # 1 46 305 350 350 86 95.0 1e-17 LQGVAIGIKGYFKGFEGSFTLAKPIDKPRYFKNNRPVAYTTITYRF Prediction of potential genes in microbial genomes Time: Sat Jul 9 20:03:21 2011 Seq name: gi|296153718|gb|ADVK01000054.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00068, whole genome shotgun sequence Length of sequence - 19892 bp Number of predicted genes - 19, with homology - 17 Number of transcription units - 7, operones - 4 average op.length - 4.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) - Term 111 - 149 5.6 1 1 Op 1 1/0.000 - CDS 159 - 1973 1877 ## COG1835 Predicted acyltransferases - Prom 2024 - 2083 7.8 - Term 2081 - 2114 1.4 2 1 Op 2 1/0.000 - CDS 2139 - 4154 3419 ## COG3808 Inorganic pyrophosphatase 3 1 Op 3 . - CDS 4231 - 5250 1202 ## COG1477 Membrane-associated lipoprotein involved in thiamine biosynthesis 4 1 Op 4 . - CDS 5231 - 5455 424 ## FN2032 DNA-directed RNA polymerase omega chain (EC:2.7.7.6) 5 1 Op 5 . - CDS 5457 - 6014 755 ## COG0194 Guanylate kinase - Prom 6067 - 6126 5.7 + Prom 5806 - 5865 5.2 6 2 Tu 1 . + CDS 6105 - 6170 61 ## 7 3 Op 1 1/0.000 - CDS 6219 - 7097 1006 ## COG1561 Uncharacterized stress-induced protein - Prom 7122 - 7181 5.8 8 3 Op 2 58/0.000 - CDS 7303 - 11262 5265 ## COG0086 DNA-directed RNA polymerase, beta' subunit/160 kD subunit 9 3 Op 3 28/0.000 - CDS 11298 - 14852 843 ## PROTEIN SUPPORTED gi|163796927|ref|ZP_02190884.1| 30S ribosomal protein S12 - Prom 14881 - 14940 8.2 - Term 15179 - 15216 2.2 10 4 Op 1 47/0.000 - CDS 15250 - 15618 584 ## PROTEIN SUPPORTED gi|19705328|ref|NP_602823.1| 50S ribosomal protein L12P (L7/L12) 11 4 Op 2 43/0.000 - CDS 15665 - 16177 825 ## PROTEIN SUPPORTED gi|19705329|ref|NP_602824.1| 50S ribosomal protein L10P - Prom 16198 - 16257 1.9 - Term 16199 - 16232 1.4 12 4 Op 3 55/0.000 - CDS 16331 - 17038 1180 ## PROTEIN SUPPORTED gi|19705330|ref|NP_602825.1| 50S ribosomal protein L1 13 4 Op 4 45/0.000 - CDS 17099 - 17524 703 ## PROTEIN SUPPORTED gi|19705331|ref|NP_602826.1| 50S ribosomal protein L11P 14 4 Op 5 46/0.000 - CDS 17559 - 18140 917 ## COG0250 Transcription antiterminator 15 4 Op 6 . - CDS 18137 - 18313 242 ## COG0690 Preprotein translocase subunit SecE + Prom 18036 - 18095 7.2 16 5 Tu 1 . + CDS 18335 - 18400 215 ## - TRNA 18336 - 18411 87.4 # Trp CCA 0 0 17 6 Op 1 1/0.000 - CDS 18438 - 18590 266 ## PROTEIN SUPPORTED gi|19705334|ref|NP_602829.1| 50S ribosomal protein L33P - Prom 18615 - 18674 4.7 - Term 18690 - 18729 -0.5 18 6 Op 2 3/0.000 - CDS 18855 - 19283 449 ## COG0735 Fe2+/Zn2+ uptake regulation proteins - Prom 19331 - 19390 14.2 19 7 Tu 1 . - CDS 19392 - 19841 615 ## COG0456 Acetyltransferases Predicted protein(s) >gi|296153718|gb|ADVK01000054.1| GENE 1 159 - 1973 1877 604 aa, chain - ## HITS:1 COG:FN2029 KEGG:ns NR:ns ## COG: FN2029 COG1835 # Protein_GI_number: 19705320 # Func_class: I Lipid transport and metabolism # Function: Predicted acyltransferases # Organism: Fusobacterium nucleatum # 1 604 1 604 604 991 100.0 0 MNELKKRSIGIDIIKAISLISVIIYHLYEYKGTYIGVVLFFVISGYLITEVLYERDDSYF KFIKRRYTKIFPTLIVVLTSSCLAFYYFYKFLSVKLIFNSLSSLFGLSNIYQIYSGMSYF ERSGDLFPLLHTWSLSIEIQFYVIFPFLIYLFKKLKLNIKVIATIIIILCLISGGIMFYK EYMNYDISSIYYGTDTRVFSILIGSAFYFLFKNKKLDAKKANILSYIFLAVIVLIVLSVD YLSKSNYYGFLYLISVLGAFITVTSLKTGFLDFKSPITKPLSKLGEHSYVYYLWQYPIMV YSLEYFKWSDIDYNYTVVLQIIILIILSEISYKFLIESRQGSIILRRIFLVLYVAILAFL PISKESNSEEVQNRANEIDNNLVINDFNNTAKKEYDPLRTDNVDYIASKIVEKIALFKEQ QDKKIEKKVDEAVDKKEEKDEEKLDNTIEAEDYTFIGDSVMKMGEPYIKEIFKDASIDAK VSRQFTDLPKVLESLKNSKKLKNIVVIHLGTNGVINKESFESSMKLLEGKTVYLMNTVVP KSWEKSVNKSLEEWSEDYENITIIDWYKYAKGEKKLFYKDATHPKPEGAKKYAEFILESI KEKK >gi|296153718|gb|ADVK01000054.1| GENE 2 2139 - 4154 3419 671 aa, chain - ## HITS:1 COG:FN2030 KEGG:ns NR:ns ## COG: FN2030 COG3808 # Protein_GI_number: 19705321 # Func_class: C Energy production and conversion # Function: Inorganic pyrophosphatase # Organism: Fusobacterium nucleatum # 1 671 1 671 671 1031 99.0 0 MDLLTQVMYIGIAVGIISLLAAFYYAKKVEHYQINIPKVEEITAAIREGAMAFLAAEYKI LIVFVIVVAVALGIFISVPTAGAFILGAITSAIAGNAGMRIATKANGRTAIAAKEGGLAK ALNVAFSGGAVMGLTVVGLGMFMLSLILLVSRTVGISVNDVTGFGMGASSIALFARVGGG IYTKAADVGADLVGKVEAGIPEDDPRNPATIADNVGDNVGDVAGMGADLFESYVGSIIAT ITLAFLLPVDDATPYVAAPLLISAFGIISSIIATLTVKTDDGSKVHAKLEMGTRIAGILT IIASFGIIKYLGLNMGIFYAIVAGLVAGLVIAYFTGVYTDTGRRAVNRVSDAAGTGAATA IIEGLAIGMESTVAPLIVIAIAIIISFKTGGLYGISIAAVGMLATTGMVVAVDAYGPVAD NAGGIAEMSELPPEVRETTDKLDAVGNSTAAVGKGFAIGSAALTALSLFAAYKEAVDKLT SEALVIDVTDPEVIAGLFIGGMLTFLFSALTMTAVGKAAIEMVEEVRRQFREFPGIMDRT QKPDYKRCVEISTHSSLKQMILPGVLAIIVPVIIGLWSVKALGGLLAGALVTGVLMAIMM ANAGGAWDNGKKQIEGGYKGDKKGSDRHKAAVVGDTVGDPFKDTSGPSLNILIKLMSIVS LVLVPLFVKFM >gi|296153718|gb|ADVK01000054.1| GENE 3 4231 - 5250 1202 339 aa, chain - ## HITS:1 COG:FN2031 KEGG:ns NR:ns ## COG: FN2031 COG1477 # Protein_GI_number: 19705322 # Func_class: H Coenzyme transport and metabolism # Function: Membrane-associated lipoprotein involved in thiamine biosynthesis # Organism: Fusobacterium nucleatum # 20 339 1 320 320 587 99.0 1e-167 MKKKKNRFIVFILLILSIFLISCGKELEKIEESKFLFGTYIKIIVYSDNKEKAMDSIEKA FNEIQRIDEKYNSKSEGSLIYKLNNSSNKTIKLDEEGVKLFEGVKKAYELSGHKYDITIA PLLELWGFTDETIELPDLKLPTKEEIEFTKTFIGFDKVKISDDGTLTMGSPVREIDTGSF LKGYAISRAKEVLKNEGIKSAFITSISSIDLIGTKPDNKPWKIGLQNPENPADMIGIVPL KDRAMGVSGDYQTYVEIDGEMYHHILDKDTGYPVKDKKMVVVLCDNAFDADLLSTTFFLM PIDEVISYVDGRKDLDVLIVDKDMNIITSKNFKYEEIKK >gi|296153718|gb|ADVK01000054.1| GENE 4 5231 - 5455 424 74 aa, chain - ## HITS:1 COG:no KEGG:FN2032 NR:ns ## KEGG: FN2032 # Name: not_defined # Def: DNA-directed RNA polymerase omega chain (EC:2.7.7.6) # Organism: F.nucleatum # Pathway: Purine metabolism [PATH:fnu00230]; Pyrimidine metabolism [PATH:fnu00240]; Metabolic pathways [PATH:fnu01100]; RNA polymerase [PATH:fnu03020] # 11 74 1 64 64 89 98.0 4e-17 MKKDITYDELLTKIPNKYILTIVGGERARERAKERMERGGEPLPLTKYDKKDTEMKKVFK EILAGKVGYEKEEE >gi|296153718|gb|ADVK01000054.1| GENE 5 5457 - 6014 755 185 aa, chain - ## HITS:1 COG:FN2033 KEGG:ns NR:ns ## COG: FN2033 COG0194 # Protein_GI_number: 19705324 # Func_class: F Nucleotide transport and metabolism # Function: Guanylate kinase # Organism: Fusobacterium nucleatum # 1 185 1 185 185 326 100.0 1e-89 MSLGALYVVSGPSGAGKSTVCKLVRERLGINLSISATSRKPRNGEQEGVDYFFITAEEFE RKIKNDDFLEYANVHGNYYGTLKSEVEERLKRGEKVLLEIDVQGGVQVKNKFPEANLIFF KTANKEELEKRLRGRNTDSEEVIQARLKNSLKELEYESKYDRVIINNEIEQACNDLISII ENGVK >gi|296153718|gb|ADVK01000054.1| GENE 6 6105 - 6170 61 21 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MSRANLAVSEQSEFSEFAANS >gi|296153718|gb|ADVK01000054.1| GENE 7 6219 - 7097 1006 292 aa, chain - ## HITS:1 COG:FN2034 KEGG:ns NR:ns ## COG: FN2034 COG1561 # Protein_GI_number: 19705325 # Func_class: S Function unknown # Function: Uncharacterized stress-induced protein # Organism: Fusobacterium nucleatum # 1 292 1 292 292 427 99.0 1e-119 MRSMTGYSKLNYEDENYVINMEIKSVNNKNLATKIKLPYNLNLLESFIRAEIASLISRGS IDFRIEFENKNESLKNLKYDEKLAKSCMDILNKIEEDFNDKFSNKLDFLVRNFGVISQKD LDTDEEKYKEIIDLKLKELLQNFIKTKVEEGNRLRVFFKEQLSILKSKLEQVKKLRPQVV ENYKQRLLNNINSIKADINFNEEDILKEVLLFSDRVDITEEISRLESHFKQLEHEFETDE ISQGKKIEFIFQEVFREFNTMGVKSNMYEISKLVVESKNELEKMREQIMNIE >gi|296153718|gb|ADVK01000054.1| GENE 8 7303 - 11262 5265 1319 aa, chain - ## HITS:1 COG:FN2035 KEGG:ns NR:ns ## COG: FN2035 COG0086 # Protein_GI_number: 19705326 # Func_class: K Transcription # Function: DNA-directed RNA polymerase, beta' subunit/160 kD subunit # Organism: Fusobacterium nucleatum # 1 1319 1 1319 1319 2553 100.0 0 MGIRNFEKIRIKLASPEKILEWSHGEVTKPETINYRTLNPERDGLFCEIIFGPTKDWECS CGKYKRMRYKGLVCEKCGVEVTRAKVRRERMGHITLASPVSHIWYSKGSPNKMSLIIGIS SKELESVLYFARYIVTSSQESTVEVGKILTEKEYKLLKQLYGNKFEAYMGADGILKLLTA IDLEKLRDELENELAEANSAQKRKKLVKRLKIVRDFIASGNRPEWMILTNVPVIPAELRP MVQLDGGRFATSDLNDLYRRVINRNNRLKKLLEIRAPEIVVKNEKRMLQEAVDALIDNGR RGKPVVAQNNRELKSLSDMLKGKQGRFRQNLLGKRVDYSARSVIVVGPSLKMNQCGIPKK MALELYKPFIMRELVRRELANNIKMAKKLVEESDDKVWAVIEDVIADHPVLLNRAPTLHR LSIQAFQPVLIEGKAIRLHPLVCSAFNADFDGDQMAVHLTLSPESMMEAKLLMFAPNNII SPSSGEPIAVPSQDMVMGCFYMTKERPGEKGEGKLFSNIEQVITAYQNDKVGTHALIKVR MNGELIETTPGRVLFNEILPEIDRNYHKTYGKKEIKSLIKSLYEAHGFTETAELINRVKN FGYHYGTFAGVSVGIEDLEVPPKKKSLLNQADKEVAQIDKDYKSGKIINEERYRKTIEVW SRTTEAVTDAMMKNLDEFNPVYMMATSGARGNVSQMRQLAGMRGNMADTQGRTIEVPIKA NFREGLTVLEFFMSSHGARKGLADTALRTADSGYLTRRLVDISHEVIVNEEDCHTHEGIE VEALVGANGKIIEKLSERINGRVLAEDLVHKGKKIAKRNTMIHKDLLDKIEELGIKKVKI RSPLTCALEKGVCQKCYGMDLSNYNEILLGEAVGVVAAQSIGEPGTQLTMRTFHTGGVAG AATVVNSKKAENDGEVSFRDIKTIEINGEDVVVSQGGKIIIADNEHEVDSGSVIRVTEGQ KVKEGDVLVTFDPYHIPIISSHDGKVQYRHFTPKNIRDEKYDVHEYLVVRSVDSVDSEPR VHILDKKNEKLATYNIPYGAYMMVRDGAKVKKGDIIAKIIKLGEGTKDITGGLPRVQELF EARNPKGKATLAEIDGRIEILTTKKKQMRVVNVRSLENPEEFKEYLIPMGERLVVTDGLK IKAGDKITEGAISPYDVLNIKGLVAAEQFILESVQQVYRDQGVTVNDKHIEIIVKQMFRK VRIIDSGASLFLEDEVIEKRVVDLENKKLEEQGKALIKYEPVIQGITKAAVNTGSFISAA SFQETTKVLSNAAIEGKVDYLEGLKENVILGKKIPAGTGFNKYKSIKVRYNTDDKPEEE >gi|296153718|gb|ADVK01000054.1| GENE 9 11298 - 14852 843 1184 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163796927|ref|ZP_02190884.1| 30S ribosomal protein S12 [alpha proteobacterium BAL199] # 888 1142 1085 1391 1392 329 55 1e-89 MQKLIERLDFGKIKARGEMPHFLEFQLNSYEDFLQTNMSPNKREEKGFELAFKEIFPIES SNGDVRLEYIGYELHEAEAPLNDELECKKRGKTYSNSLKVRLRLINKKMGNEIQESLVYF GEVPKMTDRATFIINGAERVVVSQLHRSPGVSFSKEVNTQTGKDLFSGKIIPYKGTWLEF ETDKNDFLSVKIDRKKKVLATVFLKAVDFFKDNNEIRDYFLEVKELKLKALYKKYSKEPE ELLNVLKQELDGSIVKEDILDEETGEFIAEAEAFINEEVINKLIENKVDKISYWYVGPES KLVANTLMNDTTLTEDEAVVEVFKKLRPGDQVTVDSARSLIRQMFFNPQRYDLEPVGRYK MNKRLKLDVPEEQISLTKEDVLGTIKYVIELNNGEQNVHTDDIDNLSNRRIRGVGELLLM QIKTGLAKMNKMVREKMTTQDIETVTPQSLLNTRPLNALIQDFFGSGQLSQFMDQSNPLA ELTHKRRISALGPGGLSRERAGFEVRDVHDSHYGRICPIETPEGPNIGLIGSLATYAKIN KYGFIETPYVKVENGVALVDDVRYLAADEEDGLFIAQADTKLDKNNKLQGLVVCRYGHEI VEIEPERVNYMDVSPKQVVSVSAGLIPFLEHDDANRALMGSNMQRQAVPLLKSEAPFIGT GLERKVAVDSGAVVTTKVSGKVTYVDGKKIIIEDKDKKEHIYRLLNYERSNQSMCLHQTP LVDLGDKVKTGDIIADGPATKLGDLSLGRNILMGFMPWEGYNYEDAILISDRLRKDDVFT SIHIEEYEIDARTTKLGDEEITREIPNVSENALRNLDENGVIMIGSEVGPGDILVGKTAP KGETEPPAEEKLLRAIFGEKARDVRDTSLTMPHGSKGVVVDILELSRENGDELKAGVNKS IRVLVAEKRKITVGDKMSGRHGNKGVVSRVLPAEDMPFLEDGTHLDVVLNPLGVPSRMNI GQVLEVHLGMAMRTLNGGTCIATPVFDGATEEQVKDYLEKQGYPRTGKVTLYDGRTGEKF DNKVTVGIMYMLKLHHLVEDKMHARAIGPYSLVTQQPLGGKAQFGGQRLGEMEVWALEAY GASNILQEMLTVKSDDITGRTKTYEAIIKGEAMPESDLPESFKVLLKEFQALALDIELCD EEDNVINVDEEIGIEETPTEYSPQYEIEMTGLHEIDEDAEDFEE >gi|296153718|gb|ADVK01000054.1| GENE 10 15250 - 15618 584 122 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19705328|ref|NP_602823.1| 50S ribosomal protein L12P (L7/L12) [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 122 1 122 122 229 100 1e-59 MAFNKEQFIADLEAMTVLELKELVSALEEHFGVTAAAPVAVAAAGGATEAAEEKTEFDVV LKSAGGNKIAVIKEVRAITGLGLKEAKDLVDNGGVIKEAAPKDEANAIKEKLTAAGAEVE VK >gi|296153718|gb|ADVK01000054.1| GENE 11 15665 - 16177 825 170 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19705329|ref|NP_602824.1| 50S ribosomal protein L10P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 170 1 170 170 322 100 1e-87 MATQVKKELVAELVEKIKKAQSVVFVDYQGIKVNEETLLRKQMRENGAEYLVAKNRLFKI ALKESGVEDSFDEILEGSTAFAFGYNDPVAPAKAVFDLAKAKAKAKLDVFKIKGGYLTGK KVSVKEVEELAKLPSREQLLSMLLNSMLGPIRKLAYATVAIADKKEGSAE >gi|296153718|gb|ADVK01000054.1| GENE 12 16331 - 17038 1180 235 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19705330|ref|NP_602825.1| 50S ribosomal protein L1 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 235 1 235 235 459 99 1e-129 MAKHRGKKYLEVAKLVEIGKLYDIREALELVQKTKTAKFTETVEVALRLGVDPRHADQQI RGTVVLPHGTGKTVKILAITSGENIEKALAAGADYAGAEEYINQIQQGWLDFDLVIATPD MMPKIGRLGKILGTKGLMPNPKSGTVTPDIAAAVSEFKKGKLAFRVDKLGSIHAPIGKVD FDLDKIEENFKAFMDQIIRLKPASSKGQYLRTVAVSLTMGPGVKMDPAIVGKIVG >gi|296153718|gb|ADVK01000054.1| GENE 13 17099 - 17524 703 141 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19705331|ref|NP_602826.1| 50S ribosomal protein L11P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 141 1 141 141 275 100 2e-73 MAKEVIQIIKLQLPAGKANPAPPVGPALGQHGVNIMEFCKAFNAKTQDKAGWIIPVEISV YSDRSFTFILKTPPASDLLKKAAGITSGAKNSKKEVAGKITTAKLKELAETKMPDLNASS VETAMKIIAGSARSMGIKIED >gi|296153718|gb|ADVK01000054.1| GENE 14 17559 - 18140 917 193 aa, chain - ## HITS:1 COG:FN2041 KEGG:ns NR:ns ## COG: FN2041 COG0250 # Protein_GI_number: 19705332 # Func_class: K Transcription # Function: Transcription antiterminator # Organism: Fusobacterium nucleatum # 1 193 1 193 193 344 100.0 6e-95 MSIENVRKWFMIHTYSGYEKKVKTDLEQKMETLGFKEVVTNILVPEEELTEIVRGKPKKV YRKLFPAYVMLEMEATREENENGISYKVDPRVWYEVRNTNGVTGFVGVGSDPIPMEEEEV KNIFNIIGVKTPKENVKIDFTEGDYVKILKGSFKDQEGQVAEIDHEHGRVKVMVDIFGRM TPVEIEVDGVLKV >gi|296153718|gb|ADVK01000054.1| GENE 15 18137 - 18313 242 58 aa, chain - ## HITS:1 COG:FN2042 KEGG:ns NR:ns ## COG: FN2042 COG0690 # Protein_GI_number: 19705333 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Preprotein translocase subunit SecE # Organism: Fusobacterium nucleatum # 1 58 1 58 58 91 100.0 3e-19 MNLFQKVKMEYSKVEWPSRTEVIHSTLWVVTMTVLVSIYLGIFDILAVRALNFLEALI >gi|296153718|gb|ADVK01000054.1| GENE 16 18335 - 18400 215 21 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MAGQEGLEPPTLGFGDRCSTN >gi|296153718|gb|ADVK01000054.1| GENE 17 18438 - 18590 266 50 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19705334|ref|NP_602829.1| 50S ribosomal protein L33P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 50 1 50 50 107 100 8e-23 MRVQVILECTETKLRHYTTTKNKKTHPERLEMMKYNPVLKKHTLYKETKK >gi|296153718|gb|ADVK01000054.1| GENE 18 18855 - 19283 449 142 aa, chain - ## HITS:1 COG:FN2045 KEGG:ns NR:ns ## COG: FN2045 COG0735 # Protein_GI_number: 19705335 # Func_class: P Inorganic ion transport and metabolism # Function: Fe2+/Zn2+ uptake regulation proteins # Organism: Fusobacterium nucleatum # 1 142 1 142 142 269 100.0 1e-72 MELQLHTGDIGNYLKEHNIKPSYQRMKIFQYLLDNHNHPTVDTIYKALCTEIPTLSKTTV YNTLNLFVEKKLVYVIVIEENETRYDLLTHTHGHFKCTCCGALFDVELNIDYSKSQELLG CDIEEKHIYFKGICKNCKGKQN >gi|296153718|gb|ADVK01000054.1| GENE 19 19392 - 19841 615 149 aa, chain - ## HITS:1 COG:FN2046 KEGG:ns NR:ns ## COG: FN2046 COG0456 # Protein_GI_number: 19705336 # Func_class: R General function prediction only # Function: Acetyltransferases # Organism: Fusobacterium nucleatum # 1 149 1 149 149 257 99.0 6e-69 MELVYIKDPDFEVMMKIVEIEQEAFEGNGNVDLWIIKALIRYGLVFVIKENDKIVCIVEY MQVFNKKSLFLYGISTLKEYRHKGYGNYILNETEKILKNLSYEEIELTVAPENDIAINFY KKHGYIQEKLLKDEYGKGIHRYVMRKKLF Prediction of potential genes in microbial genomes Time: Sat Jul 9 20:06:15 2011 Seq name: gi|296153520|gb|ADVK01000055.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00069, whole genome shotgun sequence Length of sequence - 188717 bp Number of predicted genes - 210, with homology - 196 Number of transcription units - 62, operones - 42 average op.length - 4.5 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 3 - 525 791 ## FN1590 lipoprotein - Prom 580 - 639 11.5 - Term 620 - 669 2.5 2 2 Op 1 12/0.000 - CDS 681 - 1838 1975 ## COG2878 Predicted NADH:ubiquinone oxidoreductase, subunit RnfB 3 2 Op 2 3/0.000 - CDS 1865 - 2449 858 ## COG4657 Predicted NADH:ubiquinone oxidoreductase, subunit RnfA 4 2 Op 3 13/0.000 - CDS 2446 - 3063 926 ## COG4660 Predicted NADH:ubiquinone oxidoreductase, subunit RnfE 5 2 Op 4 12/0.000 - CDS 3063 - 3596 1018 ## COG4659 Predicted NADH:ubiquinone oxidoreductase, subunit RnfG 6 2 Op 5 12/0.000 - CDS 3586 - 4530 1369 ## COG4658 Predicted NADH:ubiquinone oxidoreductase, subunit RnfD 7 2 Op 6 1/0.857 - CDS 4556 - 5863 1744 ## COG4656 Predicted NADH:ubiquinone oxidoreductase, subunit RnfC 8 2 Op 7 . - CDS 5938 - 6513 856 ## COG0193 Peptidyl-tRNA hydrolase - Prom 6542 - 6601 12.8 9 3 Op 1 . - CDS 6682 - 6861 255 ## FN0289 hypothetical protein - Prom 6895 - 6954 1.8 10 3 Op 2 . - CDS 6977 - 7426 507 ## FN1599 hypothetical protein - Prom 7621 - 7680 11.7 11 4 Op 1 1/0.857 - CDS 7712 - 8455 888 ## COG0101 Pseudouridylate synthase 12 4 Op 2 1/0.857 - CDS 8475 - 9551 1360 ## COG2404 Predicted phosphohydrolase (DHH superfamily) - Prom 9578 - 9637 5.4 13 5 Op 1 1/0.857 - CDS 9666 - 10136 597 ## COG1683 Uncharacterized conserved protein 14 5 Op 2 . - CDS 10120 - 12312 2580 ## COG5324 Uncharacterized conserved protein 15 5 Op 3 . - CDS 12309 - 12488 91 ## FN1604 hypothetical protein - Prom 12585 - 12644 7.8 16 6 Op 1 1/0.857 - CDS 12650 - 13927 2220 ## COG0104 Adenylosuccinate synthase 17 6 Op 2 1/0.857 - CDS 13937 - 15859 2018 ## COG1519 3-deoxy-D-manno-octulosonic-acid transferase 18 6 Op 3 1/0.857 - CDS 15888 - 16544 894 ## COG0283 Cytidylate kinase 19 6 Op 4 1/0.857 - CDS 16560 - 17498 1568 ## PROTEIN SUPPORTED gi|19704929|ref|NP_602424.1| ribosomal protein L11 methyltransferase 20 6 Op 5 1/0.857 - CDS 17473 - 18264 1163 ## COG1692 Uncharacterized protein conserved in bacteria - Prom 18284 - 18343 10.8 - Term 18351 - 18381 -0.5 21 7 Op 1 1/0.857 - CDS 18391 - 19248 1120 ## COG1281 Disulfide bond chaperones of the HSP33 family 22 7 Op 2 1/0.857 - CDS 19248 - 19778 581 ## COG1555 DNA uptake protein and related DNA-binding proteins 23 7 Op 3 1/0.857 - CDS 19775 - 21184 1705 ## COG0635 Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 24 7 Op 4 1/0.857 - CDS 21187 - 22137 1124 ## COG2805 Tfp pilus assembly protein, pilus retraction ATPase PilT - Prom 22276 - 22335 4.0 - Term 22210 - 22252 -1.0 25 8 Tu 1 . - CDS 22347 - 23846 1892 ## COG0606 Predicted ATPase with chaperone activity - Prom 23898 - 23957 6.9 + Prom 23817 - 23876 9.7 26 9 Op 1 . + CDS 24019 - 24570 730 ## COG1971 Predicted membrane protein 27 9 Op 2 . + CDS 24649 - 24774 262 ## gi|256028477|ref|ZP_05442311.1| hypothetical protein PrD11_10879 + Term 24795 - 24839 1.0 - Term 24710 - 24750 0.0 28 10 Op 1 . - CDS 24822 - 25283 710 ## COG0781 Transcription termination factor 29 10 Op 2 . - CDS 25288 - 25512 330 ## FN1617 prolipoprotein diacylglyceryltransferase 30 10 Op 3 . - CDS 25512 - 26105 579 ## FN1618 hypothetical protein 31 10 Op 4 . - CDS 26122 - 26490 758 ## COG1302 Uncharacterized protein conserved in bacteria - Prom 26596 - 26655 10.1 + Prom 26550 - 26609 11.4 32 11 Op 1 38/0.000 + CDS 26792 - 27535 1247 ## PROTEIN SUPPORTED gi|19704941|ref|NP_602436.1| 30S ribosomal protein S2 33 11 Op 2 24/0.000 + CDS 27578 - 28471 519 ## PROTEIN SUPPORTED gi|42631241|ref|ZP_00156779.1| COG0264: Translation elongation factor Ts + Term 28477 - 28515 4.2 34 11 Op 3 33/0.000 + CDS 28536 - 29255 1170 ## COG0528 Uridylate kinase 35 11 Op 4 . + CDS 29290 - 29862 1013 ## COG0233 Ribosome recycling factor + Term 29871 - 29933 5.7 - Term 29859 - 29916 6.7 36 12 Op 1 53/0.000 - CDS 29926 - 31206 863 ## PROTEIN SUPPORTED gi|163796899|ref|ZP_02190856.1| 30S ribosomal protein S11 37 12 Op 2 48/0.000 - CDS 31231 - 31710 812 ## PROTEIN SUPPORTED gi|19704946|ref|NP_602441.1| 50S ribosomal protein L15P 38 12 Op 3 50/0.000 - CDS 31710 - 31895 300 ## PROTEIN SUPPORTED gi|19704947|ref|NP_602442.1| 50S ribosomal protein L30P 39 12 Op 4 56/0.000 - CDS 31908 - 32402 807 ## PROTEIN SUPPORTED gi|19704948|ref|NP_602443.1| SSU ribosomal protein S5P 40 12 Op 5 46/0.000 - CDS 32426 - 32794 591 ## PROTEIN SUPPORTED gi|19704949|ref|NP_602444.1| 50S ribosomal protein L18P 41 12 Op 6 55/0.000 - CDS 32821 - 33354 915 ## PROTEIN SUPPORTED gi|19704950|ref|NP_602445.1| 50S ribosomal protein L6 42 12 Op 7 50/0.000 - CDS 33379 - 33777 662 ## PROTEIN SUPPORTED gi|19704951|ref|NP_602446.1| SSU ribosomal protein S8P 43 12 Op 8 50/0.000 - CDS 33806 - 34093 479 ## PROTEIN SUPPORTED gi|197736518|ref|YP_002165296.1| ribosomal protein S14 44 12 Op 9 48/0.000 - CDS 34114 - 34665 922 ## PROTEIN SUPPORTED gi|19704953|ref|NP_602448.1| 50S ribosomal protein L5 45 12 Op 10 57/0.000 - CDS 34684 - 35025 567 ## PROTEIN SUPPORTED gi|197736520|ref|YP_002165298.1| ribosomal protein L24 46 12 Op 11 50/0.000 - CDS 35050 - 35418 600 ## PROTEIN SUPPORTED gi|19704956|ref|NP_602451.1| 50S ribosomal protein L14P 47 12 Op 12 50/0.000 - CDS 35447 - 35698 411 ## PROTEIN SUPPORTED gi|19704957|ref|NP_602452.1| SSU ribosomal protein S17P 48 12 Op 13 50/0.000 - CDS 35734 - 35916 291 ## PROTEIN SUPPORTED gi|34764030|ref|ZP_00144916.1| LSU ribosomal protein L29P 49 12 Op 14 50/0.000 - CDS 35916 - 36347 746 ## PROTEIN SUPPORTED gi|19704959|ref|NP_602454.1| 50S ribosomal protein L16 50 12 Op 15 61/0.000 - CDS 36350 - 37009 1111 ## PROTEIN SUPPORTED gi|19704960|ref|NP_602455.1| SSU ribosomal protein S3P 51 12 Op 16 59/0.000 - CDS 37028 - 37363 530 ## PROTEIN SUPPORTED gi|19704961|ref|NP_602456.1| 50S ribosomal protein L22P 52 12 Op 17 60/0.000 - CDS 37398 - 37673 492 ## PROTEIN SUPPORTED gi|19704962|ref|NP_602457.1| SSU ribosomal protein S19P 53 12 Op 18 61/0.000 - CDS 37698 - 38528 1448 ## PROTEIN SUPPORTED gi|19704963|ref|NP_602458.1| 50S ribosomal protein L2 54 12 Op 19 61/0.000 - CDS 38585 - 38872 471 ## PROTEIN SUPPORTED gi|19704964|ref|NP_602459.1| 50S ribosomal protein L23P 55 12 Op 20 58/0.000 - CDS 38872 - 39501 1045 ## PROTEIN SUPPORTED gi|19704965|ref|NP_602460.1| 50S ribosomal protein L4 56 12 Op 21 40/0.000 - CDS 39521 - 40156 1075 ## PROTEIN SUPPORTED gi|19704966|ref|NP_602461.1| 50S ribosomal protein L3P - Prom 40217 - 40276 3.2 - Term 40182 - 40230 1.2 57 12 Op 22 . - CDS 40304 - 40615 508 ## PROTEIN SUPPORTED gi|19704967|ref|NP_602462.1| SSU ribosomal protein S10P - Prom 40818 - 40877 13.6 + Prom 40687 - 40746 13.3 58 13 Tu 1 . + CDS 40861 - 41307 525 ## FN1647 hypothetical protein + Term 41336 - 41397 13.4 59 14 Op 1 44/0.000 - CDS 41402 - 42172 213 ## PROTEIN SUPPORTED gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) 60 14 Op 2 44/0.000 - CDS 42182 - 42970 214 ## PROTEIN SUPPORTED gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) 61 14 Op 3 49/0.000 - CDS 42973 - 43734 891 ## COG1173 ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 62 14 Op 4 38/0.000 - CDS 43731 - 44642 362 ## COG0601 ABC-type dipeptide/oligopeptide/nickel transport systems, permease components - Prom 44821 - 44880 5.3 63 14 Op 5 1/0.857 - CDS 44902 - 46401 2070 ## COG0747 ABC-type dipeptide transport system, periplasmic component - Prom 46434 - 46493 11.5 64 15 Tu 1 . - CDS 46513 - 47850 1100 ## COG0534 Na+-driven multidrug efflux pump - Prom 47922 - 47981 13.0 - Term 47963 - 48013 2.2 65 16 Tu 1 . - CDS 48024 - 49658 2193 ## FN1654 hypothetical protein - Prom 49685 - 49744 8.1 + Prom 49709 - 49768 11.2 66 17 Tu 1 . + CDS 49800 - 51329 2010 ## COG2461 Uncharacterized conserved protein + Term 51537 - 51600 1.4 - Term 51537 - 51572 -0.6 67 18 Tu 1 . - CDS 51584 - 52360 1257 ## COG0501 Zn-dependent protease with chaperone function - Prom 52477 - 52536 6.5 - Term 52513 - 52547 1.1 68 19 Op 1 11/0.000 - CDS 52565 - 52783 352 ## PROTEIN SUPPORTED gi|19704977|ref|NP_602472.1| SSU ribosomal protein S18P 69 19 Op 2 1/0.857 - CDS 52828 - 53145 525 ## PROTEIN SUPPORTED gi|19704978|ref|NP_602473.1| SSU ribosomal protein S6P 70 19 Op 3 . - CDS 53211 - 54914 2460 ## COG0442 Prolyl-tRNA synthetase 71 19 Op 4 1/0.857 - CDS 54984 - 57053 2263 ## COG1200 RecG-like helicase - Term 57066 - 57102 5.9 72 19 Op 5 1/0.857 - CDS 57105 - 57854 1430 ## COG0217 Uncharacterized conserved protein - Prom 57881 - 57940 7.8 73 20 Op 1 . - CDS 58034 - 59488 1416 ## COG2865 Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 74 20 Op 2 . - CDS 59506 - 60912 1359 ## FN1663 hypothetical protein 75 20 Op 3 . - CDS 60878 - 60979 81 ## 76 21 Op 1 . + CDS 61307 - 61648 281 ## FN1664 hypothetical protein 77 21 Op 2 . + CDS 61705 - 62076 404 ## COG3093 Plasmid maintenance system antidote protein 78 21 Op 3 . + CDS 62086 - 62439 180 ## FN1666 hypothetical protein + Term 62446 - 62491 3.3 - Term 62432 - 62479 1.2 79 22 Tu 1 . - CDS 62485 - 63687 1598 ## COG1088 dTDP-D-glucose 4,6-dehydratase - Prom 63722 - 63781 11.5 80 23 Op 1 . - CDS 63790 - 65187 1197 ## AZL_019960 alginate O-acetyltransferase 81 23 Op 2 . - CDS 65206 - 66585 986 ## COG1696 Predicted membrane protein involved in D-alanine export - Prom 66654 - 66713 10.1 82 24 Op 1 . - CDS 66734 - 67807 513 ## gi|296329245|ref|ZP_06871746.1| conserved hypothetical protein 83 24 Op 2 . - CDS 67812 - 69332 1062 ## COG0728 Uncharacterized membrane protein, putative virulence factor 84 24 Op 3 . - CDS 69346 - 70470 1672 ## COG0037 Predicted ATPase of the PP-loop superfamily implicated in cell cycle control 85 24 Op 4 1/0.857 - CDS 70467 - 71669 1260 ## COG0399 Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis 86 24 Op 5 1/0.857 - CDS 71662 - 73188 1662 ## COG2089 Sialic acid synthase 87 24 Op 6 3/0.000 - CDS 73190 - 74137 1159 ## COG3980 Spore coat polysaccharide biosynthesis protein, predicted glycosyltransferase 88 24 Op 7 1/0.857 - CDS 74139 - 75920 2047 ## COG1861 Spore coat polysaccharide biosynthesis protein F, CMP-KDO synthetase homolog - Prom 75946 - 76005 7.9 89 25 Op 1 2/0.000 - CDS 76026 - 76823 875 ## COG1028 Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 90 25 Op 2 1/0.857 - CDS 76823 - 77695 1016 ## COG0667 Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 91 25 Op 3 . - CDS 77714 - 78724 1243 ## COG1086 Predicted nucleoside-diphosphate sugar epimerases 92 25 Op 4 . - CDS 78718 - 80130 1045 ## mru_1879 sialyltransferase 93 25 Op 5 . - CDS 80096 - 81205 734 ## gi|296329256|ref|ZP_06871757.1| membrane protein 94 25 Op 6 . - CDS 81238 - 82317 1401 ## COG0673 Predicted dehydrogenases and related proteins 95 25 Op 7 . - CDS 82325 - 83536 1258 ## COG0438 Glycosyltransferase 96 25 Op 8 . - CDS 83549 - 84211 814 ## COG0223 Methionyl-tRNA formyltransferase 97 25 Op 9 . - CDS 84232 - 85185 1107 ## COG1044 UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase 98 25 Op 10 . - CDS 85172 - 85906 423 ## PROTEIN SUPPORTED gi|227410568|ref|ZP_03893770.1| acetyltransferase, ribosomal protein N-acetylase 99 25 Op 11 . - CDS 85909 - 87117 1613 ## COG0399 Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis 100 25 Op 12 . - CDS 87098 - 88396 1882 ## COG0677 UDP-N-acetyl-D-mannosaminuronate dehydrogenase 101 25 Op 13 5/0.000 - CDS 88411 - 89622 1496 ## COG0399 Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis 102 25 Op 14 2/0.000 - CDS 89641 - 90228 885 ## COG2148 Sugar transferases involved in lipopolysaccharide synthesis 103 25 Op 15 . - CDS 90231 - 92042 1962 ## COG1086 Predicted nucleoside-diphosphate sugar epimerases 104 25 Op 16 . - CDS 92054 - 93046 1287 ## FN1697 hypothetical protein 105 25 Op 17 9/0.000 - CDS 93036 - 93959 1322 ## COG1091 dTDP-4-dehydrorhamnose reductase 106 25 Op 18 . - CDS 93969 - 94217 300 ## COG1898 dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes 107 26 Op 1 . - CDS 94611 - 95165 587 ## FN1700 hypothetical protein 108 26 Op 2 . - CDS 95155 - 96504 1151 ## FN1701 ABC transporter ATP-binding protein 109 26 Op 3 1/0.857 - CDS 96530 - 97345 874 ## COG1968 Uncharacterized bacitracin resistance protein 110 26 Op 4 . - CDS 97360 - 98358 1417 ## COG0451 Nucleoside-diphosphate-sugar epimerases - Prom 98380 - 98439 12.7 + Prom 98389 - 98448 10.4 111 27 Op 1 . + CDS 98551 - 100833 2364 ## COG0729 Outer membrane protein 112 27 Op 2 3/0.000 + CDS 100849 - 101613 750 ## COG0730 Predicted permeases 113 27 Op 3 1/0.857 + CDS 101597 - 102604 325 ## PROTEIN SUPPORTED gi|15900011|ref|NP_344615.1| aldose 1-epimerase + Prom 102606 - 102665 8.8 114 28 Op 1 1/0.857 + CDS 102716 - 104815 1716 ## PROTEIN SUPPORTED gi|62291006|ref|YP_222799.1| polynucleotide phosphorylase/polyadenylase 115 28 Op 2 1/0.857 + CDS 104831 - 105394 302 ## PROTEIN SUPPORTED gi|229231897|ref|ZP_04356325.1| SSU ribosomal protein S12P methylthiotransferase 116 28 Op 3 1/0.857 + CDS 105408 - 105683 295 ## COG0762 Predicted integral membrane protein + Term 105691 - 105725 4.5 117 29 Op 1 . + CDS 105741 - 106685 1301 ## COG0275 Predicted S-adenosylmethionine-dependent methyltransferase involved in cell envelope biogenesis 118 29 Op 2 . + CDS 106687 - 106944 321 ## FN1712 hypothetical protein 119 29 Op 3 . + CDS 106946 - 108577 1858 ## COG2265 SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase 120 29 Op 4 . + CDS 108596 - 108877 327 ## SZO_12680 membrane protein + Term 108903 - 108936 1.4 + Prom 108884 - 108943 4.7 121 30 Tu 1 . + CDS 108964 - 109761 893 ## SDEG_1349 hypothetical protein 122 31 Tu 1 . - CDS 109866 - 111041 771 ## COG3547 Transposase and inactivated derivatives - Prom 111213 - 111272 12.5 + Prom 111204 - 111263 11.7 123 32 Op 1 . + CDS 111317 - 111442 79 ## gi|296329286|ref|ZP_06871787.1| conserved hypothetical protein 124 32 Op 2 . + CDS 111495 - 111974 343 ## SEQ_0755 conjugative transposon conserved hypothetical protein + Term 111982 - 112021 7.3 + Prom 112064 - 112123 5.7 125 33 Op 1 2/0.000 + CDS 112144 - 113211 955 ## COG0464 ATPases of the AAA+ class 126 33 Op 2 . + CDS 113204 - 115438 2396 ## COG1404 Subtilisin-like serine proteases 127 33 Op 3 . + CDS 115476 - 116432 907 ## Sbal195_2015 hypothetical protein 128 33 Op 4 . + CDS 116422 - 116967 289 ## gi|296329291|ref|ZP_06871792.1| conserved hypothetical protein + Term 117053 - 117100 6.2 129 34 Op 1 . - CDS 116977 - 117306 278 ## SZO_12770 relaxase/mobilisation protein 130 34 Op 2 . - CDS 117385 - 118308 1537 ## COG3843 Type IV secretory pathway, VirD2 components (relaxase) 131 34 Op 3 . - CDS 118311 - 118667 175 ## CD1870 putative conjugative transposon mobilization protein 132 34 Op 4 . - CDS 118712 - 118840 239 ## - Prom 118914 - 118973 8.3 + Prom 118884 - 118943 7.6 133 35 Tu 1 . + CDS 118965 - 119537 572 ## COG1309 Transcriptional regulator + Term 119554 - 119588 0.5 + Prom 119554 - 119613 8.2 134 36 Op 1 35/0.000 + CDS 119641 - 121380 184 ## PROTEIN SUPPORTED gi|163765018|ref|ZP_02172066.1| ribosomal protein L9 135 36 Op 2 . + CDS 121373 - 123082 190 ## PROTEIN SUPPORTED gi|163765018|ref|ZP_02172066.1| ribosomal protein L9 136 36 Op 3 . + CDS 123058 - 123294 202 ## COG1122 ABC-type cobalt transport system, ATPase component + Prom 123469 - 123528 1.9 137 37 Op 1 . + CDS 123553 - 125532 1283 ## FN1714 hypothetical protein 138 37 Op 2 . + CDS 125616 - 126908 1287 ## COG1373 Predicted ATPase (AAA+ superfamily) + Term 126959 - 126997 -0.9 + Prom 126965 - 127024 9.3 139 38 Tu 1 . + CDS 127051 - 127620 578 ## FN1716 hypothetical protein + Prom 127622 - 127681 9.7 140 39 Op 1 1/0.857 + CDS 127725 - 129815 2739 ## COG0272 NAD-dependent DNA ligase (contains BRCT domain type II) 141 39 Op 2 . + CDS 129874 - 132483 3789 ## COG0653 Preprotein translocase subunit SecA (ATPase, RNA helicase) 142 39 Op 3 . + CDS 132496 - 133242 919 ## FN1719 hypothetical protein + Term 133258 - 133312 8.0 - Term 133246 - 133300 10.1 143 40 Op 1 24/0.000 - CDS 133307 - 134005 760 ## COG0357 Predicted S-adenosylmethionine-dependent methyltransferase involved in bacterial cell division 144 40 Op 2 1/0.857 - CDS 134007 - 135908 2385 ## COG0445 NAD/FAD-utilizing enzyme apparently involved in cell division 145 40 Op 3 17/0.000 - CDS 135917 - 136573 864 ## COG0569 K+ transport systems, NAD-binding component 146 40 Op 4 1/0.857 - CDS 136588 - 137934 1352 ## COG0168 Trk-type K+ transport systems, membrane components 147 40 Op 5 1/0.857 - CDS 137949 - 139301 1544 ## COG0534 Na+-driven multidrug efflux pump 148 40 Op 6 1/0.857 - CDS 139325 - 140890 1829 ## COG0038 Chloride channel protein EriC - Prom 140920 - 140979 9.2 149 41 Op 1 1/0.857 - CDS 140984 - 141628 730 ## COG2039 Pyrrolidone-carboxylate peptidase (N-terminal pyroglutamyl peptidase) 150 41 Op 2 9/0.000 - CDS 141600 - 142349 600 ## COG0115 Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase 151 41 Op 3 35/0.000 - CDS 142343 - 143695 1575 ## COG0147 Anthranilate/para-aminobenzoate synthases component I 152 41 Op 4 1/0.857 - CDS 143679 - 144290 771 ## COG0512 Anthranilate/para-aminobenzoate synthases component II - Prom 144325 - 144384 7.5 153 42 Tu 1 1/0.857 - CDS 144485 - 145276 1043 ## COG0253 Diaminopimelate epimerase - Prom 145304 - 145363 10.5 - Term 145333 - 145381 7.1 154 43 Op 1 16/0.000 - CDS 145434 - 146069 743 ## COG1394 Archaeal/vacuolar-type H+-ATPase subunit D 155 43 Op 2 16/0.000 - CDS 146080 - 147456 2183 ## COG1156 Archaeal/vacuolar-type H+-ATPase subunit B 156 43 Op 3 12/0.000 - CDS 147449 - 149218 2520 ## COG1155 Archaeal/vacuolar-type H+-ATPase subunit A 157 43 Op 4 13/0.000 - CDS 149236 - 149544 455 ## COG1436 Archaeal/vacuolar-type H+-ATPase subunit F 158 43 Op 5 11/0.000 - CDS 149537 - 150538 1339 ## COG1527 Archaeal/vacuolar-type H+-ATPase subunit C 159 43 Op 6 11/0.000 - CDS 150548 - 151099 640 ## COG1390 Archaeal/vacuolar-type H+-ATPase subunit E 160 43 Op 7 . - CDS 151115 - 151597 891 ## COG0636 F0F1-type ATP synthase, subunit c/Archaeal/vacuolar-type H+-ATPase, subunit K 161 43 Op 8 . - CDS 151641 - 151748 56 ## 162 43 Op 9 . - CDS 151758 - 153674 1918 ## COG1269 Archaeal/vacuolar-type H+-ATPase subunit I 163 43 Op 10 . - CDS 153661 - 153987 504 ## FN1742 V-type sodium ATP synthase subunit G (EC:3.6.3.15) - Prom 154046 - 154105 13.8 - Term 154180 - 154222 -0.4 164 44 Op 1 2/0.000 - CDS 154229 - 155035 862 ## COG0789 Predicted transcriptional regulators - Prom 155164 - 155223 10.3 165 44 Op 2 1/0.857 - CDS 155236 - 156117 711 ## COG0697 Permeases of the drug/metabolite transporter (DMT) superfamily 166 44 Op 3 . - CDS 156150 - 157340 1795 ## COG0626 Cystathionine beta-lyases/cystathionine gamma-synthases - Prom 157415 - 157474 12.6 + Prom 157508 - 157567 14.7 167 45 Op 1 1/0.857 + CDS 157588 - 158847 1598 ## COG0626 Cystathionine beta-lyases/cystathionine gamma-synthases 168 45 Op 2 1/0.857 + CDS 158915 - 160429 648 ## PROTEIN SUPPORTED gi|145634045|ref|ZP_01789756.1| 50S ribosomal protein L21 169 45 Op 3 1/0.857 + CDS 160505 - 160726 220 ## COG0674 Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, alpha subunit 170 45 Op 4 23/0.000 + CDS 160774 - 160872 153 ## COG0674 Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, alpha subunit + Prom 160878 - 160937 5.5 171 45 Op 5 1/0.857 + CDS 161058 - 161246 181 ## COG1013 Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, beta subunit 172 45 Op 6 . + CDS 161314 - 161610 332 ## COG1013 Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, beta subunit + Term 161636 - 161684 4.2 - Term 161624 - 161672 4.2 173 46 Op 1 . - CDS 161711 - 161818 110 ## 174 46 Op 2 3/0.000 - CDS 161788 - 162408 803 ## COG0352 Thiamine monophosphate synthase 175 46 Op 3 5/0.000 - CDS 162411 - 163541 1166 ## COG1060 Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 176 46 Op 4 5/0.000 - CDS 163541 - 164314 1135 ## COG2022 Uncharacterized enzyme of thiazole biosynthesis 177 46 Op 5 5/0.000 - CDS 164315 - 164935 825 ## COG0476 Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 178 46 Op 6 1/0.857 - CDS 164939 - 165145 289 ## COG2104 Sulfur transfer protein involved in thiamine biosynthesis 179 46 Op 7 8/0.000 - CDS 165146 - 166447 2228 ## COG0422 Thiamine biosynthesis protein ThiC 180 46 Op 8 11/0.000 - CDS 166464 - 167084 770 ## COG0352 Thiamine monophosphate synthase 181 46 Op 9 . - CDS 167094 - 167927 1055 ## COG0351 Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase - Prom 167952 - 168011 9.5 - Term 167945 - 167986 1.6 182 47 Op 1 . - CDS 168052 - 168147 129 ## 183 47 Op 2 . - CDS 168189 - 168266 87 ## + Prom 168086 - 168145 16.3 184 48 Tu 1 . + CDS 168234 - 168407 813 ## - TRNA 168235 - 168309 72.4 # Gln TTG 0 0 - TRNA 168316 - 168392 86.7 # Pro TGG 0 0 185 49 Op 1 1/0.857 - CDS 168456 - 169202 768 ## COG3022 Uncharacterized protein conserved in bacteria 186 49 Op 2 1/0.857 - CDS 169206 - 169760 637 ## COG3758 Uncharacterized protein conserved in bacteria - Prom 169805 - 169864 12.2 - Term 169840 - 169874 -0.1 187 50 Op 1 2/0.000 - CDS 169891 - 171195 2252 ## COG0148 Enolase 188 50 Op 2 . - CDS 171221 - 172639 1970 ## COG0469 Pyruvate kinase - Prom 172668 - 172727 12.2 - Term 172733 - 172770 3.4 189 51 Op 1 . - CDS 172777 - 172845 76 ## - TRNA 172784 - 172860 91.8 # Met CAT 0 0 190 51 Op 2 . - CDS 172838 - 173020 622 ## - TRNA 172865 - 172941 89.3 # Ala TGC 0 0 - TRNA 172960 - 173035 91.2 # Gly GCC 0 0 191 51 Op 3 . - CDS 173026 - 173130 425 ## - TRNA 173053 - 173136 68.7 # Leu TAG 0 0 192 52 Tu 1 . + CDS 173137 - 173235 332 ## - TRNA 173152 - 173227 81.3 # Thr TGT 0 0 193 53 Tu 1 . - CDS 173190 - 173495 714 ## - Prom 173644 - 173703 8.7 - TRNA 173236 - 173312 95.0 # Asp GTC 0 0 - TRNA 173323 - 173398 97.8 # Val TAC 0 0 - TRNA 173403 - 173477 66.8 # Glu TTC 0 0 - TRNA 173485 - 173560 92.5 # Lys CTT 0 0 + Prom 173351 - 173410 2.4 194 54 Op 1 . + CDS 173554 - 173652 325 ## - TRNA 173568 - 173643 93.2 # Gly TCC 0 0 - TRNA 173695 - 173768 68.6 # Cys GCA 0 0 195 54 Op 2 . + CDS 173733 - 174008 1119 ## - TRNA 173784 - 173859 87.4 # Phe GAA 0 0 - TRNA 173878 - 173954 93.9 # Asp GTC 0 0 - TRNA 173965 - 174040 97.8 # Val TAC 0 0 196 55 Tu 1 . - CDS 174119 - 174862 1068 ## FN1780 hypothetical protein - Term 174879 - 174910 2.5 197 56 Op 1 1/0.857 - CDS 174930 - 177413 4036 ## PROTEIN SUPPORTED gi|34762725|ref|ZP_00143715.1| LytB protein; SSU ribosomal protein S1P 198 56 Op 2 . - CDS 177419 - 177688 430 ## COG1925 Phosphotransferase system, HPr-related proteins - Prom 177892 - 177951 10.3 + Prom 177756 - 177815 15.1 199 57 Tu 1 . + CDS 177874 - 178698 1218 ## COG4820 Ethanolamine utilization protein, possible chaperonin + Term 178708 - 178749 1.3 - Term 178868 - 178911 3.2 200 58 Op 1 . - CDS 178930 - 179385 651 ## FN1784 hypothetical protein 201 58 Op 2 . - CDS 179407 - 179880 418 ## FN1785 hypothetical protein - Prom 179966 - 180025 8.9 + Prom 179850 - 179909 12.0 202 59 Op 1 . + CDS 180021 - 181001 1268 ## COG2870 ADP-heptose synthase, bifunctional sugar kinase/adenylyltransferase 203 59 Op 2 2/0.000 + CDS 180995 - 181489 807 ## COG0245 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase 204 59 Op 3 1/0.857 + CDS 181505 - 182857 1372 ## COG0534 Na+-driven multidrug efflux pump 205 59 Op 4 1/0.857 + CDS 182869 - 183390 674 ## COG2109 ATP:corrinoid adenosyltransferase 206 59 Op 5 . + CDS 183399 - 184157 895 ## COG0494 NTP pyrophosphohydrolases including oxidative damage repair enzymes + Term 184160 - 184199 5.3 - Term 184139 - 184193 9.1 207 60 Tu 1 . - CDS 184207 - 184602 763 ## FN1792 hypothetical protein - Prom 184734 - 184793 6.9 208 61 Op 1 25/0.000 - CDS 184842 - 186569 2484 ## COG1080 Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) - Term 186592 - 186623 0.1 209 61 Op 2 . - CDS 186636 - 186899 384 ## COG1925 Phosphotransferase system, HPr-related proteins - Prom 186946 - 187005 6.4 210 62 Tu 1 . - CDS 187047 - 188636 1754 ## FN1795 hypothetical protein - Prom 188657 - 188716 4.1 Predicted protein(s) >gi|296153520|gb|ADVK01000055.1| GENE 1 3 - 525 791 174 aa, chain - ## HITS:1 COG:no KEGG:FN1590 NR:ns ## KEGG: FN1590 # Name: not_defined # Def: lipoprotein # Organism: F.nucleatum # Pathway: not_defined # 3 174 1 172 414 330 98.0 2e-89 MKMKKILFSLLTIFMLVIAVACGKKEAPTEDANTQQGTTSEATQDYHIGVVTISVSQAED NFRGAEAVAKKYGLSSEGGKITVVTIPDNFMQEQETTISQMVSLADDPKMKAIVVAEGVP GTYPAFKAIREKRPDILLFVNNNHEDPVQVSTVADVVVNSDSIARGYLIVKTAH >gi|296153520|gb|ADVK01000055.1| GENE 2 681 - 1838 1975 385 aa, chain - ## HITS:1 COG:FN1591 KEGG:ns NR:ns ## COG: FN1591 COG2878 # Protein_GI_number: 19704912 # Func_class: C Energy production and conversion # Function: Predicted NADH:ubiquinone oxidoreductase, subunit RnfB # Organism: Fusobacterium nucleatum # 1 385 1 385 385 599 100.0 1e-171 MEAIMMPVAVLGITGVLMGLFLAYASKKFEVEVDPKVEAILAILPGANCGACGFPGCAGY ASGVALEGAKMTLCAPGGPKVIEKIGEIMGVAVEIPVKKKPAKKPVEKKEAPKAQTGEPI SASQEFIEKNKRMLMKFKEAFDAGDKEGFEKLENLAKMAKKDELLKYYEEIKAGKIVPDG SAPVAAGTTNANAISASKEFVEKNKRMLMKFKEAFDAGDKEGFEKLENLAKMAKKDELLK FYEEIKAGKIVPDPATMTDAVAAVKAEVISATKEFVEKNKRMLMKFKEAFDAGDKEGFEK LENLAKMAKKDELLKFYEEIKAGKTVPDPATMTDTPAAKQEAPKVEDTKKQEASYCSVLG DGLCVPEQNEKVKENLHQEIDKEVK >gi|296153520|gb|ADVK01000055.1| GENE 3 1865 - 2449 858 194 aa, chain - ## HITS:1 COG:FN1592 KEGG:ns NR:ns ## COG: FN1592 COG4657 # Protein_GI_number: 19704913 # Func_class: C Energy production and conversion # Function: Predicted NADH:ubiquinone oxidoreductase, subunit RnfA # Organism: Fusobacterium nucleatum # 1 194 1 194 194 290 100.0 8e-79 MSIGGLFSIIITSIFINNIIFAKFLGCCPFMGVSKKVDSSLGMGMAVTFVITIASGVTWI VYRKILEPLGLGYLQTIAFILIIASLVQFVEMAIKKTSPSLYKALGVFLPLITTNCAVLG VAIINIQEGYNFIETIVNGFGVAVGFSLALLLLAGIRERLEYANIPKNFKGVPIAFITAG LLAMAFMGFSGMQI >gi|296153520|gb|ADVK01000055.1| GENE 4 2446 - 3063 926 205 aa, chain - ## HITS:1 COG:FN1593 KEGG:ns NR:ns ## COG: FN1593 COG4660 # Protein_GI_number: 19704914 # Func_class: C Energy production and conversion # Function: Predicted NADH:ubiquinone oxidoreductase, subunit RnfE # Organism: Fusobacterium nucleatum # 1 191 1 191 205 312 100.0 3e-85 MKKLGVLTAGIFKENPVFVLMLGLCPTLGVTSSAINGFSMGLAVIAVLACSNGLISLFKK FIPDEVRIPAFIMIIATLVTVVDMVMNAYTPDLYKVLGLFIPLIVVNCIVLGRAESFASK NGVIDSILDGIGSGIGFTLSLTFLGSVREILGNGSVFGISLVPANFTPALIFILAPGGFI TIGIIMACINMKKERDAKKKKVTKK >gi|296153520|gb|ADVK01000055.1| GENE 5 3063 - 3596 1018 177 aa, chain - ## HITS:1 COG:FN1594 KEGG:ns NR:ns ## COG: FN1594 COG4659 # Protein_GI_number: 19704915 # Func_class: C Energy production and conversion # Function: Predicted NADH:ubiquinone oxidoreductase, subunit RnfG # Organism: Fusobacterium nucleatum # 1 177 1 177 177 326 99.0 2e-89 MENRYIHFGIVLGLIAAISAGLLGGVNDFTSKVIAENTLKIVNEARKEVLPEATSFKEDE AKEADGMQYIPGFNDAGEVVGYVASVTEAGYGGDINFVVGIDKDAKVTGLNVVTSSETPG LGAKINGKEWQEHWIGKDSTYEFNKSVDAFAGATISPSAVYRGVIRALNTYQNEVSK >gi|296153520|gb|ADVK01000055.1| GENE 6 3586 - 4530 1369 314 aa, chain - ## HITS:1 COG:FN1595 KEGG:ns NR:ns ## COG: FN1595 COG4658 # Protein_GI_number: 19704916 # Func_class: C Energy production and conversion # Function: Predicted NADH:ubiquinone oxidoreductase, subunit RnfD # Organism: Fusobacterium nucleatum # 1 314 1 314 314 573 99.0 1e-163 MSTILKTGPAPHIRTAETVESVMYDVVIALIPAFAMAIYTFGVRALILTAVSVLTCILTE YLCQKALKRDIEAFDGSAILTGILFSFVVPAIMPLQYVVVGNIVAITLGKMVYGGLGHNI FNPALVGRAFVQASWPVAITTFAFDGKAGATVLDAMKRGIPLSDALLENTNQYIDAFLGQ MGGCLGETSSLALLIGGAYLIYKKHIDWKVPAVMIGTVFVLTWAMGADPLMQIFSGGLFL GAFFMATDMVTSPTTSKGRVVFALGLGVLISLIRMKGGYPEGTAYAILIMNGVVPLIDRY IRPKKFGGVSKNGK >gi|296153520|gb|ADVK01000055.1| GENE 7 4556 - 5863 1744 435 aa, chain - ## HITS:1 COG:FN1596 KEGG:ns NR:ns ## COG: FN1596 COG4656 # Protein_GI_number: 19704917 # Func_class: C Energy production and conversion # Function: Predicted NADH:ubiquinone oxidoreductase, subunit RnfC # Organism: Fusobacterium nucleatum # 1 435 7 441 441 838 99.0 0 MKFFGFRGGVHPPENKIQTEHLPIEKLESPDEIFVPLLQHIGAPLNPLVNVGDRVLKGQK IADAEGLAVPVHSPVSGAVTKIESRVFPLTGKVMTIFIENDKKEEWAELSKIENWEEADK KALLDIIREKGIVGIGGATFPTHVKLNPPPNTQLDSLILNGAECEPYLNSDNRLMLENPK SIVEGIKIIKKILNVPNVYVGIEDNKPEAIESMRKATEGTGINIVPLKTKYPQGGEKQLI KSILDRQVPSGQLPSAVGVVVQNTGTAAAIYEAVVNGKPLIEKVVTVSGKAIKNPKNVKV AIGTPFSYILDNCGINREEMARLVMGGPMMGLAQMTEDATVIKGTSGLLALTNEEMRPYK TKACISCSKCVSACPMGLAPLMFDRLAAAKEYEEMAAHNLMDCIECGSCAYICPANRPLA ESIKTGKAKLRAKKK >gi|296153520|gb|ADVK01000055.1| GENE 8 5938 - 6513 856 191 aa, chain - ## HITS:1 COG:FN1597 KEGG:ns NR:ns ## COG: FN1597 COG0193 # Protein_GI_number: 19704918 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Peptidyl-tRNA hydrolase # Organism: Fusobacterium nucleatum # 1 191 1 191 191 327 96.0 8e-90 MKVVIGLGNPGKKYEKTRHNIGFIAVDNLRKKFNIIDEREKFQALVSEKNIDGEKVIFFK PQTFMNLSGNSVIEIVNFYKLDPKKDIIVIYDDMDLPFGDIRIREKGSSGGHNGIKSIIS HIGEEFIRIKCGIGAKEKDAVEHVLGEFNQTEQKDLDEILENINNCVIEILSVQNLDRIM QKYNKKKEILK >gi|296153520|gb|ADVK01000055.1| GENE 9 6682 - 6861 255 59 aa, chain - ## HITS:1 COG:no KEGG:FN0289 NR:ns ## KEGG: FN0289 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 59 250 308 308 89 83.0 5e-17 MNFEEIDFYIFYVNYLSKKEDEDEKVLVGYNGVDGKEVSMSKLKEDINQIRDSRSAFKD >gi|296153520|gb|ADVK01000055.1| GENE 10 6977 - 7426 507 149 aa, chain - ## HITS:1 COG:no KEGG:FN1599 NR:ns ## KEGG: FN1599 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 149 1 168 168 230 79.0 2e-59 MITLEDFKNNNLKINWKVIDIGCLGSEIFKNKLILRIIASDRDEYQETGYLVKELANMEK SEYKIAFEKWELVYVKKNFPKLNKNIIQGLIELNDLWFRLDFPEDSPYIFQGVKNNISPQ EYYTEENYIYLYNRHLNWIRDKSNYLNGK >gi|296153520|gb|ADVK01000055.1| GENE 11 7712 - 8455 888 247 aa, chain - ## HITS:1 COG:FN1600 KEGG:ns NR:ns ## COG: FN1600 COG0101 # Protein_GI_number: 19704921 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Pseudouridylate synthase # Organism: Fusobacterium nucleatum # 1 247 1 247 247 449 99.0 1e-126 MLMGRKNIKIEFRYDGSSYYGFQRQPNKITVQGEIEKVLRIVTKEEINLISAGRTDRGVH ANHQVSNFYTSSNIPIEKYKYLLTRALPNDIDILSVEEVDENFNARHNAKMREYIYIISW EKNPFEARYCKFVKEKIVAEKLEKIFSDFIGVHDFKNFRLSDCVSKVTVREIYQIEVKYF GENKIKIYIKGSAFLKSQVRIMVGTALEIYYGRLLENHIRLMLNDFTKEYKKNLVEAEGL YLNKIEY >gi|296153520|gb|ADVK01000055.1| GENE 12 8475 - 9551 1360 358 aa, chain - ## HITS:1 COG:FN1601 KEGG:ns NR:ns ## COG: FN1601 COG2404 # Protein_GI_number: 19704922 # Func_class: R General function prediction only # Function: Predicted phosphohydrolase (DHH superfamily) # Organism: Fusobacterium nucleatum # 1 358 1 358 358 699 99.0 0 MADILCDTRLKSEEAPKVIILTHGDADGLVSAMIVKAFEELQNKNKTFLIMSSMDVTLEQ TDKTFDYICKYTSLGSKDRVYILDRPIPSVEWLKMKYLAYTNVINIDHHLTNNPTIYKDE CCCDDIYFYWDDKLSAAYLTLEWFKPLIEKGENYKKMYEKLEPLAEATSCWDIFTWKNLG NSPKELLLKKRALSINSAEKILGAGAFYNFITKKLNSKNYTEEVFDYFMLLDEAYNMKID NLYDFAKRVISDFDYKGHKLGIIYGIDGDYQSIIGDKILDDKKLDYEIVAFLNVYGTVSF RSKNNIDVSEIAKKLGVLVGYSGGGHKHASGCRICDKDEMKKKMMEIFEHSMNKIKVL >gi|296153520|gb|ADVK01000055.1| GENE 13 9666 - 10136 597 156 aa, chain - ## HITS:1 COG:FN1602 KEGG:ns NR:ns ## COG: FN1602 COG1683 # Protein_GI_number: 19704923 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 156 1 156 156 267 94.0 5e-72 MKKKIKVLISACLLGDNVKYSGGNNLTLELVILLEKYNVDIVKVCPECFAGLSIPRVPSE IKEDKVYSKDGRDITEEFLVGAEKISKIAKEKKVDFAILKERSPSCGSSYIYDGSFSGKV IQGQGLTVRKLNEENIVIFSEENLEEIEKYLQVLNK >gi|296153520|gb|ADVK01000055.1| GENE 14 10120 - 12312 2580 730 aa, chain - ## HITS:1 COG:FN1603_3 KEGG:ns NR:ns ## COG: FN1603_3 COG5324 # Protein_GI_number: 19704924 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 411 677 1 267 273 497 98.0 1e-140 MRTLLLLRGAQASGKSTWVTENNLEPYTLNADKIRLNIANPILHEDGSIEISQKNNKLTW ELLYKYLEMRMENGDFTIIDATHSDIKLMNKYRDLANIYKYTIYYLEFDTPLEECLKRNK ERVGYKYVPEKVIERTWETIKNNEKLPSVLKKINSIDEIINFYTADVNEYKKVIIIGDIH SCAEPLKEVLKDFSEENLYIFVGDYFDRGIQAVETFKIMLDLLEKPNVILIEGNHENDSV KKFINNEEKYTKSFDETTLQPLLKEFELEYIKTGLKKIYKRLRQCFTFEFRGKKFLCTHG GLPLVPKLALVSAKEMIKGVGRYETEIGEVYSENYKKGLCQDFIQVHGHRGINDGEYSYC LEGRVEFGEELKVLTIDNDGNIEKSGIKNDVYNRGLIITTRDNSEKIKKFQTENELINEM IASSFINVKECDYNLISLNFNRDAFNRKKWNDLTIKARGLFVDRDSGEVKIRSYNKFFNY GERNINLRYLYKYATYPIRVFKKYNGFLGLASVINGDVVLTSKSVTSGKYKDIFQSIWDK VESEVKELLKQTMIENNCTVVFEVVSPEYDPHIIKYDKEHLYLLDFIENKLDLDTHNIDL EFSEKLMKKVKFSSTILTKKEELRRLENYDELYNFLHEKTMSLEEFEGYVLCDNSGLMFK FKLPYYNLWKTRRAWLERYRTALLKGKRIEIKDIEKDENRHFKKFLLKLGKDKLQGLSII DVKELYEKEN >gi|296153520|gb|ADVK01000055.1| GENE 15 12309 - 12488 91 59 aa, chain - ## HITS:1 COG:no KEGG:FN1604 NR:ns ## KEGG: FN1604 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 6 59 1 54 54 72 98.0 6e-12 MFEADLSASLLKLQRTLDFLSLRNLASSELFFTILSDCNELFLLHLLQQPLFYIKKGKL >gi|296153520|gb|ADVK01000055.1| GENE 16 12650 - 13927 2220 425 aa, chain - ## HITS:1 COG:FN1605 KEGG:ns NR:ns ## COG: FN1605 COG0104 # Protein_GI_number: 19704926 # Func_class: F Nucleotide transport and metabolism # Function: Adenylosuccinate synthase # Organism: Fusobacterium nucleatum # 1 425 1 425 425 833 99.0 0 MAGYVVVGTQWGDEGKGKIIDVLSEKADYVVRFQGGNNAGHTVVVDGEKFILQLLPSGVL QAGTCVIGPGVVIDPKVFLDEIDRIEKRGAKTDHVIISDRAHVVMPYHIEMDKIRESVED RIKIGTTKKGIGPCYADKISRDGIRMSDLLDLKQFEEKLRYNLKEKNEIFTKIYGLEPLD FDTIFEEYKGYAEKIKHRIVDTIPIVNKALDENKLVLFEGAQAMMLDINYGTYPYVTSSS PTLGGVTTGAGISPRKIDKGIGVMKAYTTRVGEGPFVTEIKGEFGDKIRGIGGEYGAVTG RPRRCGWLDLVVGRYATEINGLTDIVMTKIDVLSGLGKLKICTAYEIDGKIHEYVPADTK SLDRAIPIYEELDGWDEDITQIKKYEDLPVNCRKYLERVQEILGCPISVVSVGPDRSQNI HIREI >gi|296153520|gb|ADVK01000055.1| GENE 17 13937 - 15859 2018 640 aa, chain - ## HITS:1 COG:FN1606_1 KEGG:ns NR:ns ## COG: FN1606_1 COG1519 # Protein_GI_number: 19704927 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: 3-deoxy-D-manno-octulosonic-acid transferase # Organism: Fusobacterium nucleatum # 1 426 1 426 426 691 96.0 0 MYNLLRKIALTLYRPFMKEKMKTFINKRLSQDFSDLKDEEYIWIHCSSVGEVNLSEDLVK KFYSISRKNILISVFTDTGYETAVKKYSDKKKIKVIYFPVDDKKKINEILNKIKLKLLAL VETELWPNLINETKKKSLRIIVVNGRISDRSYPRYKKLKFLLKSMLQKINFFYMQSEIDK ERIINLGAIKEKVENVGNLKFSISLEKYSDIEKEEYRKFLNIGDRKVFVAGSTRTGEDEV ILDVFKRLKNYVLIIVPRHLDRLAKIENLIKENNLTYVKYSELENNISTGKENIILVDKM GVLRKLYSISDIAFVGGTLVNIGGHNLLEPLFYRKTVIFGKYTQNVVDIAKEILRRKIGF QVENVEEFAKAIETIENEKNSDEEINSFFEENRLIALNIVKKENLIMNNIKEEAKDLWKH FFHSEKSNYNIYMYKLLDYPEYIMYDNDVMKEKKSKWNEYFGNSNPIAVEIGTGSGNFMY QLAEKNPNKNFIGLELRFKRLVLATQKCKKRNLKNVAFLRKRGEELEDFLAENEISEMYI NFPDPWEGTEKNRIIQERLFKTLDKIMKKDGMLYFKTDHDIYYSDVLELVKTLDNYEVIY HTSDLHNSEKAENNIKTEFEQLFLHKHNKNINYIEIKKIV >gi|296153520|gb|ADVK01000055.1| GENE 18 15888 - 16544 894 218 aa, chain - ## HITS:1 COG:FN1607 KEGG:ns NR:ns ## COG: FN1607 COG0283 # Protein_GI_number: 19704928 # Func_class: F Nucleotide transport and metabolism # Function: Cytidylate kinase # Organism: Fusobacterium nucleatum # 1 218 1 218 218 324 98.0 9e-89 MDNIIVAIDGPAGSGKSTIAKLIAKKFNFTYIDTGAMYRMITLYLLENNIDFDDLKEIEK ALKNINLDMQEDKFYLNGIDVSTKIREKRINENVSKVASIKIVRDNLVNLQRKISNNKNV ILDGRDIGTVVFPNAKVKIFLVAAAEERARRRYNEFLEKKVEITYDEVLKSLKERDYIDS TRKESPLKKADEAIELDTTNLTIEDVINFISKKIEKVK >gi|296153520|gb|ADVK01000055.1| GENE 19 16560 - 17498 1568 312 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704929|ref|NP_602424.1| ribosomal protein L11 methyltransferase [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 312 1 312 312 608 99 1e-173 MKMKVLEAKIIYESDDLEKYKKIISDIFYSFGVTGLKIEEPILNKDPLNFYKDEKQFLIS ENSVSAYFPLNIYSEKRKKVLEETFAEKFSEDEDIVYNLDFYEYDEEDYQNSWKKYLFVE KVSEKFVVKPTWREYEKQDNELVIELDPGRAFGTGSHPTTSLLLKLMEEQDFSNKSVIDI GTGSGILMIAGKILGAGEVYGTDIDEFSMEVAKENLILNNISLNDVKLLKGNLLEVIENK KFDIVVCNILADILVKLLDEIKYILKENSIVLFSGIIEDKLNEVISKAEEVGLEVVEVKT DKEWRAVYFKRK >gi|296153520|gb|ADVK01000055.1| GENE 20 17473 - 18264 1163 263 aa, chain - ## HITS:1 COG:FN1609 KEGG:ns NR:ns ## COG: FN1609 COG1692 # Protein_GI_number: 19704930 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 263 1 263 263 518 100.0 1e-147 MKVLVVGDIVGRPGRNTLQVFLEKYKDNYDFIIVNGENSAAGFGITIKIADEFLSWGVDV ISGGNHSWDKKEIYEYMNNSDRILRPANYPEGVSGKGYTILEDKKGNKIALISLQGRVFM SAVDCPFRTAKKLIDEISKITKNIIVDFHAEATSEKIALGKYLDGDISLFYGTHTHVQTA DERILNNGTGYISDVGMTGSQNGVIGTNLETIINKFLTSLPQKFEVAEGDEQLCGIEVEI DEKTGKCQKIERIKWSENEGFRS >gi|296153520|gb|ADVK01000055.1| GENE 21 18391 - 19248 1120 285 aa, chain - ## HITS:1 COG:FN1610 KEGG:ns NR:ns ## COG: FN1610 COG1281 # Protein_GI_number: 19704931 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Disulfide bond chaperones of the HSP33 family # Organism: Fusobacterium nucleatum # 1 285 1 285 285 521 98.0 1e-148 MGRLIRGVSKNARFFVADTTDVIQEALDIHKYDEYSMKIFGKFCTLASLMGATLKGEDKL TIRTDTDGYIKNIVVTSDANGNIKGYLVNTTDENFDGLGKGTMRIIKDMGLKEPYVAISN IDYSNLPNDISAFFYNSEQIPSVISLAVECTNDGKILCAGAFMVQLLPNADEDFITKLER KAEAIRPMNELMKGGMSLERIINLLYDDMDTADDSLVEEYEILEEKEIKYSCDCNSERFQ KGIMTLGKEELKHIFEEEKEIEAECQFCGKKYKFTEKDFEDILKK >gi|296153520|gb|ADVK01000055.1| GENE 22 19248 - 19778 581 176 aa, chain - ## HITS:1 COG:FN1611 KEGG:ns NR:ns ## COG: FN1611 COG1555 # Protein_GI_number: 19704932 # Func_class: L Replication, recombination and repair # Function: DNA uptake protein and related DNA-binding proteins # Organism: Fusobacterium nucleatum # 18 176 1 159 159 246 100.0 1e-65 MKKIMLLLGIFSLFSLNMYSAPDFSNNDYKIIMSSQNMKDEKEELMDINKVSEQEMLARK VSKSYVSKIIEYREITGGFDKLEDMKRIKGIGNATYQKLSKVFKVASAPNKKMLNINSAD DMTLKYYGLSKKEIKRIQKYLDKNDRITDNIEFKKIVNKKTYERLKDLINYDGGKR >gi|296153520|gb|ADVK01000055.1| GENE 23 19775 - 21184 1705 469 aa, chain - ## HITS:1 COG:FN1612 KEGG:ns NR:ns ## COG: FN1612 COG0635 # Protein_GI_number: 19704933 # Func_class: H Coenzyme transport and metabolism # Function: Coproporphyrinogen III oxidase and related Fe-S oxidoreductases # Organism: Fusobacterium nucleatum # 1 469 1 469 469 820 97.0 0 MLIETNIEVNLRSIEEFTRVIVSELLEDKINFEILKEDNLIKIKVKSEKLNKNTEFSYID LGNKIEDQVLTMCKISLLKLLDKKYDWGSLMGVRPTKVLRRLLINGCDYEEARKILKDFY LVTDEKINLMETVVKKELELLDKEHINLYIGIPFCPTKCKYCSFASYEINGGVGRFYNDF VEALLKEIQIVGDFLKTYSKKVSSIYFGGGTPSTLTEEDLERVLKKLLENIDMSDVKEFT FEAGREDSLNAKKLEIMKKYSVDRISLNPQSFNLETLKRVNRRFNRENFDLIFKEAKKLE FIINMDLIIGLPEETTEEILDTLSQLKDYDIDNLTIHCLAFKRASKLFKESQERNVIDRA LIEKHIQKIVEEKAMKPYYMYRQKNIIEWGENIGYSKEGRESIFNIEMIEENQNTMALGG GGISKIVIEERNGIDYIERYVNPKDPALYIRELDKRCKEKIEMFKKEKI >gi|296153520|gb|ADVK01000055.1| GENE 24 21187 - 22137 1124 316 aa, chain - ## HITS:1 COG:FN1613 KEGG:ns NR:ns ## COG: FN1613 COG2805 # Protein_GI_number: 19704934 # Func_class: N Cell motility; U Intracellular trafficking, secretion, and vesicular transport # Function: Tfp pilus assembly protein, pilus retraction ATPase PilT # Organism: Fusobacterium nucleatum # 1 316 1 316 316 508 96.0 1e-144 MNIEKIFDYARENNISDIHLLEGEKIYFRKDGEIIEYDSNIIVSKDELLEICNGKIEEDF AYTDSKNQRYRVNSFLTRGKLALVIRIINKEPIKLKGKFINKLIDEKILSLKDGLVLVTG ITGSGKSTTLANIIEKFNENKNLKILTIEDPIEYIFENKKSLIIQRELGKDIESFEKALK SSLRQDPDVIILGEIRDEESLYSALKLAETGHLVFSTLHTMNTVESVNRLISMVRSEKKD FIREQLASVLRFILSQELHREKKTVSIFEVLNNTKAVANLILNNKLNQIPTLIESGIENF MITKEKYLKNIETESD >gi|296153520|gb|ADVK01000055.1| GENE 25 22347 - 23846 1892 499 aa, chain - ## HITS:1 COG:FN1614 KEGG:ns NR:ns ## COG: FN1614 COG0606 # Protein_GI_number: 19704935 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Predicted ATPase with chaperone activity # Organism: Fusobacterium nucleatum # 1 497 1 497 497 905 98.0 0 MKKKIFTSSYLGLESYLVEVEVDVSRGLPMFSIVGMGDTAILESKFRVKAALKNSNYEIV PQKIVVNLSPAGIKKEGAQFDLPIALGIILEMKLLKDKRDIFKDYLFVGELSLDGEVKGV SGTINSVILAKEKGFKGIVVPYENRNEASLIDGVDIVAVKDISDVINFIENEVRLEFEKI NLVKTEEDILDFSDVKGQYFAKRAMEISAAGGHNILLIGSPGSGKSMLAKRMIGILPEMT ESEIVESTKIYSVAGELSEKNPIISKRPVRIPHHSTTLAAMVGGGKKALPGEISLASNGI LILDEMSEFKHSVLEALRQPLEDGYVSITRAMYRVEFKTNFLLVGTSNPCPCGNLYEGNC KCSATEVERYTKKLSGPILDRIDLVIQMKRLSEEELVNDKKEESSADIRKRVIKAREIQI KRYGEAKTNSRMSQKELKKYCIIKEEDKRFLISALENLQISARVYDKILKIARTIADLAG EKEINRKYLLEAISFKKKM >gi|296153520|gb|ADVK01000055.1| GENE 26 24019 - 24570 730 183 aa, chain + ## HITS:1 COG:FN1615 KEGG:ns NR:ns ## COG: FN1615 COG1971 # Protein_GI_number: 19704936 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 1 183 1 183 183 258 98.0 6e-69 MSTISVLITALALSMDAMSLSIYQGIASTESQKKQNFLKIVLTFGIFQFAMALVGSLSGI LFIHYISLYSKYVSFAIFLFLGLMMLKEALKKEEMEYDEKYLDFKTLIIMGIATSLDALL VGLTFSILPFYQTFLYTIEIGVVTAIIAGLGFILGDKFGNILGQKSHFLGAALLIFLSIN ILL >gi|296153520|gb|ADVK01000055.1| GENE 27 24649 - 24774 262 41 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|256028477|ref|ZP_05442311.1| ## NR: gi|256028477|ref|ZP_05442311.1| hypothetical protein PrD11_10879 [Fusobacterium sp. D11] hypothetical protein HMPREF0397_1885 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] hypothetical protein HMPREF0397_1885 [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 41 1 41 41 64 100.0 3e-09 MDAYRNYNREEFNRILNGKITRKLIITATILAGLGILNKTL >gi|296153520|gb|ADVK01000055.1| GENE 28 24822 - 25283 710 153 aa, chain - ## HITS:1 COG:FN1616 KEGG:ns NR:ns ## COG: FN1616 COG0781 # Protein_GI_number: 19704937 # Func_class: K Transcription # Function: Transcription termination factor # Organism: Fusobacterium nucleatum # 1 153 1 153 153 239 100.0 2e-63 MNKNFEEQEKKAKGGVRLAREEVFKLVFGVEATESASEELKQAFDIYLQNSEELIGTLNE NQLEFLKSSIDGIAKNYDNIKDIIKKNTQNWAYERIGVVERALLIVATYEFIFKNAPIEV IANEIIELAKEYGNEKSYEFVNGILANIEKSKK >gi|296153520|gb|ADVK01000055.1| GENE 29 25288 - 25512 330 74 aa, chain - ## HITS:1 COG:no KEGG:FN1617 NR:ns ## KEGG: FN1617 # Name: not_defined # Def: prolipoprotein diacylglyceryltransferase # Organism: F.nucleatum # Pathway: not_defined # 1 74 1 74 74 124 100.0 2e-27 MPDNILEVLLEKIINNWRKVYGAILGFIIGLTVINYGILKAIIVFVFAFVGYKLGDSSFT QGIKRIVLKRLKED >gi|296153520|gb|ADVK01000055.1| GENE 30 25512 - 26105 579 197 aa, chain - ## HITS:1 COG:no KEGG:FN1618 NR:ns ## KEGG: FN1618 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 19 197 1 179 179 293 98.0 4e-78 MFKKIIFFFAWVGIFLISLISLNYVLLPGQVFYDNTYTANITTFQYKMVILVLASLYIFI CLYKFFSLFERKKDYERKTENGTLKITRATINNYVTDLLRKDPDITGIKTTSELKGNKFL IYVKCELLAKINIADKIAQLQNLIKRDLGENVGVEVNKVVVNISKLEIREGVRETETFKE TSDIPNDENIEDVEVSD >gi|296153520|gb|ADVK01000055.1| GENE 31 26122 - 26490 758 122 aa, chain - ## HITS:1 COG:FN1619 KEGG:ns NR:ns ## COG: FN1619 COG1302 # Protein_GI_number: 19704940 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 122 1 122 122 175 100.0 2e-44 MSELGNIRIADDVVKTIAAKAAADVEGVYKLAGGVVDEVSKMLGKKRPTNGVKVEVGEVE CSIEVYLIIKYGYKIAEVAEEVQKAILEAVSSLSGLKVVEVNVYVQNVKMEDIEETTEEF ED >gi|296153520|gb|ADVK01000055.1| GENE 32 26792 - 27535 1247 247 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|19704941|ref|NP_602436.1| 30S ribosomal protein S2 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 247 1 247 247 484 100 1e-136 MSVVSMKQLLEAGVHFGHQAKRWNPKMAKYIFTERNGIHVIDLHKSLKKIEEAYEEMRKI AEDGGKVLFVGTKKQAQEAIKEQAERSGMYYVNSRWLGGMLTNFSTIKKRIERMKELEKL DAEGILDTDYTKKEAAEFRKELSKLSKNLSGIRDMEKVPDAIYVVDVKMEELPVKEAHLL GIPVFAMIDTNVDPDLITYPIPANDDAIRSVKLITSVIANAIVEGNQGIENVEPQSEEVN VEEGSAE >gi|296153520|gb|ADVK01000055.1| GENE 33 27578 - 28471 519 297 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|42631241|ref|ZP_00156779.1| COG0264: Translation elongation factor Ts [Haemophilus influenzae R2866] # 1 297 1 281 283 204 43 2e-51 MATVTAALVKELRERTGAGMLDCKKALESHDGDIEKSIDYLREKGIAKAVKKAGRIAAEG LIFDEATPDHKKAVILEFNSETDFVAKNEEFKEFGRKLVKIALERNVHQLEELNEAQVEG DKKVSEALTDLIAKIGENMSLRRLAVVVAKDGFVQTYSHLGGKLGVIVEMSGEPTEANLE KAKNIAMHVAAMDPKYLSEEEVTASDLEHEKEIARKQLEEEGKPANIIEKILTGKMHKFY EENCLVDQIYVRAENKETVKQYAGDIKVLSFERFKVGDGIEKKEEDFAAEVAAQING >gi|296153520|gb|ADVK01000055.1| GENE 34 28536 - 29255 1170 239 aa, chain + ## HITS:1 COG:FN1622 KEGG:ns NR:ns ## COG: FN1622 COG0528 # Protein_GI_number: 19704943 # Func_class: F Nucleotide transport and metabolism # Function: Uridylate kinase # Organism: Fusobacterium nucleatum # 1 239 1 239 239 452 100.0 1e-127 MESPFYKKILLKLSGEALMGDQEFGISSDVIASYAKQIKEIVDLGVEVSIVIGGGNIFRG LSGAAQGVDRVTGDHMGMLATVINSLALQNSIEKLGVPTRVQTAIEMPKVAEPFIKRRAQ RHLEKGRVVIFGAGTGNPYFTTDTAAALRAIEMETDVVIKATKVDGIYDKDPVKYPDAKK YQTVTYNEVLAKDLKVMDATAISLCRENKLPIIVFNSLDEGNLKKVVMGEHIGTTVVAD >gi|296153520|gb|ADVK01000055.1| GENE 35 29290 - 29862 1013 190 aa, chain + ## HITS:1 COG:FN1623 KEGG:ns NR:ns ## COG: FN1623 COG0233 # Protein_GI_number: 19704944 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Ribosome recycling factor # Organism: Fusobacterium nucleatum # 1 190 1 190 190 298 100.0 4e-81 MSIASDKLVKECEEKMVKTIEAVKEKFTAIRAGRANVSMLDGIKVENYGSEVPLNQIGTV SAPEARLLVIDPWDKTLISKIEKAILAANIGMTPNNDGRVIRLVLPELTADRRKEYVKLA KNEAENGKIAIRNIRKDINNHLKKLEKDKENPISEDELKKEETNVQTLTDKYVKEIDDLL AKKEKEITTI >gi|296153520|gb|ADVK01000055.1| GENE 36 29926 - 31206 863 426 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|163796899|ref|ZP_02190856.1| 30S ribosomal protein S11 [alpha proteobacterium BAL199] # 16 426 19 437 447 337 40 3e-91 MTLMEKFNSRLSSIVKIPELRERIIFTLLMFLVARVGTLIPAPGVDVDRLSSMASQSDVL SYINMFSGGAFTRISIFSLGIIPYINASIVVSLLVSIIPQLEEIQKEGESGRNRITQWTR YLTIALAIIQGTGVCLWLQSVGLIYNPGISFFVRTITTLTAGTVFLMWVGEQISIKGIGN GVSLIIFLNVISRAPSSVIQTIQTMQGNKFLIPLLVLVAFLGTVTIAGIVLFQLGQRKIP IHYVGKGFSSKGGIGEKSFIPLRLNTAGVMPVIFASVFMLIPGVIVNALPSTLSIKTTLS IIFGQNHPVYMILYALVIMFFSFFYTALVFDPEKVAENLKQGGGTIPGIRPGEETVEYLE GVASRITWGGGIFLAIISILPYVIFTSMGLPVYFGGTGIIIVVGVALDTIQQIDAHLVMR DYKGFI >gi|296153520|gb|ADVK01000055.1| GENE 37 31231 - 31710 812 159 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704946|ref|NP_602441.1| 50S ribosomal protein L15P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 159 1 159 159 317 99 2e-85 MKLNELSPSVPKKNRKRIGRGNSSGWGKTAGKGSNGQNSRAGGGVKPYFEGGQMPIYRRV PKRGFSNAIFKKEYTVISLAFLNENFEDGEEVSLETLFNKCLIKKGRDGVKVLGNGELNK KLTVKVHKISKSAKAAVEAKGGTVELVEVKGFERAETNK >gi|296153520|gb|ADVK01000055.1| GENE 38 31710 - 31895 300 61 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704947|ref|NP_602442.1| 50S ribosomal protein L30P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 61 1 61 61 120 100 6e-26 MARLRIELVKSIIGRKPNHIATVKSLGLKKMHDVVEHNETPELKGKLAQVSYLLKVEEVQ A >gi|296153520|gb|ADVK01000055.1| GENE 39 31908 - 32402 807 164 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704948|ref|NP_602443.1| SSU ribosomal protein S5P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 164 1 164 164 315 99 9e-85 MLNREDNQYQEKLLKISRVSKTTKGGRTISFSVLAAVGDGEGKIGLGLGKANGVPDAIRK AIAAAKKNIVKISLKNNTIPHEIAGKWGATTLWMAPAYEGTGVIAGSASREILELVGVHD ILTKIKGSRNKHNVARATVEALKLLRTAEQIAALRGLEVKDILS >gi|296153520|gb|ADVK01000055.1| GENE 40 32426 - 32794 591 122 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704949|ref|NP_602444.1| 50S ribosomal protein L18P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 122 1 122 122 232 99 1e-59 MFKKVDRKASRQKKQMSIRNKISGTPERPRLSVFRSNTNIFAQLIDDVNGVTLVSASTID KALKGSIANGGNIEAAKAVGKAIAERAKEKGIDAIVFDRSGYKYTGRVAALADAAREAGL SF >gi|296153520|gb|ADVK01000055.1| GENE 41 32821 - 33354 915 177 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704950|ref|NP_602445.1| 50S ribosomal protein L6 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 177 1 177 177 357 100 3e-97 MSRVGKKPIAVPSGVDFSVKDNVVTVKGPKGTLTKEFNNNITIKLEDGHITVERPNDEPF MRAIHGTTRALINNMVKGVHEGYRKTLTLVGVGYRAATKGKGLEISLGYSHPVIIDEIPG ITFSVEKNTTIHIDGVEKELVGQVAANIRAKRPPEPYKGKGVKYADEHIRRKEGKKS >gi|296153520|gb|ADVK01000055.1| GENE 42 33379 - 33777 662 132 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704951|ref|NP_602446.1| SSU ribosomal protein S8P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 132 1 132 132 259 100 6e-68 MYLTDPIADMLTRVRNANAVMHEKVDIPHSKMKERIAEILKEQGYISNFKIVTDEGNKKN IRVYLKYAGKERVIKGLKRISKPGRRVYSSVDDMPRVLSGLGIAIVSTSKGIVTDKVARA EKVGGEVLAFVW >gi|296153520|gb|ADVK01000055.1| GENE 43 33806 - 34093 479 95 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|197736518|ref|YP_002165296.1| ribosomal protein S14 [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] # 1 95 1 95 95 189 100 1e-46 MAKKSMIARDVKRAKLVDKYAEKRAELKKRIAAGDMEAMFELNKLPKDSSAVRKRNRCQL DGRPRGYMREFGISRVKFRQLAGAGLIPGVKKSSW >gi|296153520|gb|ADVK01000055.1| GENE 44 34114 - 34665 922 183 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704953|ref|NP_602448.1| 50S ribosomal protein L5 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 183 1 183 183 359 99 4e-98 MDKYTSRYHKFYDEVVVPKLMKELEIKNIMECPKLEKIIVNMGVGEATQNSKLIDAAMAD LTIITGQKPLLRKAKKSEAGFKLREGMPIGAKVTLRKERMYDFLDRLVNVVLPRVRDFEG VPSNSFDGRGNYSVGLRDQLVFPEIDFDKVEKLLGMSITMVSSAKTDEEGRALLKAFGMP FKK >gi|296153520|gb|ADVK01000055.1| GENE 45 34684 - 35025 567 113 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|197736520|ref|YP_002165298.1| ribosomal protein L24 [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] # 1 113 1 113 113 223 100 6e-57 MAKPKIKFVPDSLHVKTGDIVYVISGKDKKKTGKVLKVFPKKGKIIVEGINIVTKHLKPS QVNPQGGVVQKEAAIFSSKVMLFDEKTKSPTRVGYEVRDGKKVRISKKSGEII >gi|296153520|gb|ADVK01000055.1| GENE 46 35050 - 35418 600 122 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704956|ref|NP_602451.1| 50S ribosomal protein L14P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 122 1 122 122 235 100 9e-61 MVQQQTILNVADNSGAKKLMVIRVLGGSRKRFGKIGDIVVASVKEAIPGGNVKKGDIVKA VIVRTRKETRRDDGSYIKFDDNAGVVINNNNEPRATRIFGPVARELRARNFMKILSLAIE VI >gi|296153520|gb|ADVK01000055.1| GENE 47 35447 - 35698 411 83 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704957|ref|NP_602452.1| SSU ribosomal protein S17P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 83 1 83 83 162 98 8e-39 MRNERKVREGIVVSDKMQKTIVVAIETMILHPIYKKRVKRTTKFKAHDEENVAQVGDKVK IMETRRLSKDKNWRLVEIIEKAR >gi|296153520|gb|ADVK01000055.1| GENE 48 35734 - 35916 291 60 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|34764030|ref|ZP_00144916.1| LSU ribosomal protein L29P [Fusobacterium nucleatum subsp. vincentii ATCC 49256] # 1 60 1 60 60 116 100 6e-25 MRAKEIREMTSEDLVVKCKELKEELFNLKFQLSLGQLTNTAKIREVRREIARINTILNER >gi|296153520|gb|ADVK01000055.1| GENE 49 35916 - 36347 746 143 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704959|ref|NP_602454.1| 50S ribosomal protein L16 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 143 1 143 143 291 100 1e-77 MLMPKRTKHRKMFRGRMKGAAHKGNFVAFGDYGLQALEPSWITNRQIESCRVAINRTFKR EGKTYIRIFPDKPITARPAGVRMGKGKGNVEGWVSVVRPGRILFEVSGVTEEKAAAALRK AAMKLPIRCKVVKREEKENGGEN >gi|296153520|gb|ADVK01000055.1| GENE 50 36350 - 37009 1111 219 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704960|ref|NP_602455.1| SSU ribosomal protein S3P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 219 1 219 219 432 99 1e-120 MGQKVDPRGLRLGITRAWDSNWYADKKEYVKYFHEDVQIKEFIKKNYFHTGISKVRIERT SPSQVVVHIHTGKAGLIIGRKGAEIDALRAKLEKLTGKKVTVKVQEIKDLNGDAVLVAES IAAQIEKRIAYKKAMTQAISRSMKSPEVKGIKVMISGRLNGAEIARSEWAVEGKVPLHTL RADIDYAVATAHTTYGALGIKVWIFHGEVLPSKKEGGEA >gi|296153520|gb|ADVK01000055.1| GENE 51 37028 - 37363 530 111 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704961|ref|NP_602456.1| 50S ribosomal protein L22P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 111 1 111 111 208 99 1e-52 MEAKAITRFVRLSPRKARLVADLVRGKSALEALDILEFTNKKAARVIKKTLSSAIANATN NFKMDEDKLVVSTIMVNQGPVLKRVMPRAMGRADIIRKPTAHITVAVSDEQ >gi|296153520|gb|ADVK01000055.1| GENE 52 37398 - 37673 492 91 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704962|ref|NP_602457.1| SSU ribosomal protein S19P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 91 1 91 91 194 100 3e-48 MARSLKKGPFCDHHLMAKVEEAVASGNNKAVIKTWSRRSTIFPNFIGLTFGVYNGKKHIP VHVTEQMVGHKLGEFAPTRTYHGHGVDKKKK >gi|296153520|gb|ADVK01000055.1| GENE 53 37698 - 38528 1448 276 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704963|ref|NP_602458.1| 50S ribosomal protein L2 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 276 1 276 276 562 100 1e-159 MAIRKMKPITNGTRHMSRLVNDELDKVRPEKSLTVPLKSAYGRDNYGHRTCRDRQKGHKR LYRIIDFKRNKLDVPARVATIEYDPNRSANIALLFYVDGEKRYILAPKGLKKGDIVSAGS KAEIKPGNALKLKDMPVGVQIHNIELQRGKGGQLVRSAGTAARLVAKEGTYCHVELPSGE LRLIHGECMATVGEVGNSEHNLVNIGKAGRARHMGKRPHVRGAVMNPVDHPHGGGEGKNS VGRKSPLTPWGKPALGIKTRGRKTSDKFIVRRRNEK >gi|296153520|gb|ADVK01000055.1| GENE 54 38585 - 38872 471 95 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704964|ref|NP_602459.1| 50S ribosomal protein L23P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 95 1 95 95 186 97 8e-46 MNVYDIIKKPVVTEKTELLRKEYNKYTFEVHPKANKIEIKKAIEKIFNVKVEDVATINKK PITKRHGMRLYKTQAKKKAIVKLAKENTITYSKEV >gi|296153520|gb|ADVK01000055.1| GENE 55 38872 - 39501 1045 209 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704965|ref|NP_602460.1| 50S ribosomal protein L4 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 209 1 209 209 407 100 1e-112 MAVLNIYNLAGEQTGTVEVNDAVFGIEPNKVVLHEVLTAELAAARQGTASTKTRAMVRGG GRKPFKQKGTGRARQGTIRAPHMVGGGVTFGPHPRSYEKKVNKKVRNLALRSALSAKVAA GNVLVLDYDGIETPKTKVIVNLVNKVDAKQKQLFVVGDLIKDYNLYLSARNLENAVILQP NEIGVYWLLKQEKVILTKEALATVEEVLG >gi|296153520|gb|ADVK01000055.1| GENE 56 39521 - 40156 1075 211 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704966|ref|NP_602461.1| 50S ribosomal protein L3P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 211 1 211 211 418 99 1e-116 MSGILGKKIGMTQIFEDGKFVPVTVVEAGPNFVLQKKTEEKDGYVALQLGFDEKKEKNTT KPLMGIFNKAGVKPQRFVKELEVESVDGYELGQEIKVDVLTEVGYVDITGTSKGKGTSGV MKKHGFAGNRASHGVSRNHRLGGSIGMSSWPGKVLKGKKMAGQHGNATVTVQNLKVVKVD AEHNLLLIKGAVPGAKNSYLVIKPAVKKVIG >gi|296153520|gb|ADVK01000055.1| GENE 57 40304 - 40615 508 103 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704967|ref|NP_602462.1| SSU ribosomal protein S10P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 103 1 103 103 200 100 4e-50 MASNKLRIYLKAYDHTLLDESAKRIAESAKKSGAIVAGPMPLPTKIRKYTVLRSVHVNKD SREQFEMRVHRRMIELVNSTDKAISSLTSVHLPAGVGIEIKQV >gi|296153520|gb|ADVK01000055.1| GENE 58 40861 - 41307 525 148 aa, chain + ## HITS:1 COG:no KEGG:FN1647 NR:ns ## KEGG: FN1647 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 10 148 1 139 139 244 100.0 6e-64 MKKIFLSILMLFAFVACSSTQFVHDAKPITKDEKTVLIQYFPTEFEIDLEKTLENNFWKV SVVSNKDTSSPSLKSNFVITCESLYADYLGTYQGIIKFSDLRTGKRIAVYKFKVSTKSAI IENIIKTMDSIPGASSPASSITVTKPVK >gi|296153520|gb|ADVK01000055.1| GENE 59 41402 - 42172 213 256 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) [Campylobacter concisus 13826] # 1 220 1 221 223 86 28 7e-16 MEEVLSIKNLNKTFESGFKLKDINFDIKEGEVVSLIGESGSGKTSISKIIVALLKAKGQI LFKGMDILENPKKINGKIQMIFQSPYSSLNPKYKIKDIILEGVIYQKVLKEEENIDEYLL NILNEVGLDKEVLNKYPHELSGGQRQRVGIARAVAVKPDLIIADEILTALDALTQIQILE LFQKLKENKKISYLFISHDINVVKKISDRLLIIKDGEIIESGSKEKIFSKPEKEYTKKLI EISGINLLINKNNEIG >gi|296153520|gb|ADVK01000055.1| GENE 60 42182 - 42970 214 262 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|157164682|ref|YP_001467345.1| 50S ribosomal protein L25 (general stress protein Ctc) [Campylobacter concisus 13826] # 1 236 1 221 223 87 27 5e-16 MKNILELKDFSVSLKNNNCKILNNINIEIKEKEFLGIVGESGSGKTTLLNSIISFLDEKK FILDGKIILFENIEIYKMTEEQRKEICHKNISMILQDSINSLNPYEKIKKQLLETYLFHS KEKVSNDFAIKEIEKLLLDVGFEDTNRILNSYPNELSGGMRQRIAIVLVLCTDIKIFLAD EPTTSLDVVNQFRFIELLKKISKEKGLTLIYVSHDIKVLSKVCERIIVLKDGNIVEENST AQILKEPKNDYTKLLIKAATAD >gi|296153520|gb|ADVK01000055.1| GENE 61 42973 - 43734 891 253 aa, chain - ## HITS:1 COG:FN1650 KEGG:ns NR:ns ## COG: FN1650 COG1173 # Protein_GI_number: 19704971 # Func_class: E Amino acid transport and metabolism; P Inorganic ion transport and metabolism # Function: ABC-type dipeptide/oligopeptide/nickel transport systems, permease components # Organism: Fusobacterium nucleatum # 16 253 1 238 238 408 99.0 1e-114 MTKKLIFINIFLVIILLIFSSKLNADINLDSVFLGFSKENFFGTDDLGRDVFSLIIIGGF RTLEVVVIATSLSFFVGNFLGMIAGYFEGNIGTIIKSTVDLMMVVPTLIVALIITSIFGI TPVTAGISLGIFGIGNYMNQSEALTKAEKNKDYILASKLLGVPWYVVLFRRIFVNILARL LVNLGNTASGVILQYSALTFIGLGSDYTKPDWGAMLYQYRIYLVRKPSLIIIPTLCILWV SLSFNLIFDKREN >gi|296153520|gb|ADVK01000055.1| GENE 62 43731 - 44642 362 303 aa, chain - ## HITS:1 COG:FN1651 KEGG:ns NR:ns ## COG: FN1651 COG0601 # Protein_GI_number: 19704972 # Func_class: E Amino acid transport and metabolism; P Inorganic ion transport and metabolism # Function: ABC-type dipeptide/oligopeptide/nickel transport systems, permease components # Organism: Fusobacterium nucleatum # 1 303 5 307 307 462 97.0 1e-130 MKFILKWLGIMFILSVITFLIVRFIPVSPVDMLLQHYNLPLTEENRKLLTSYYKLDRSLF KQYIVWIKDFLKGNWGISFITKLPVKEEMLRRLPYSLIIGLGSLFLSIILSFFLGYLAAI KEKGFFDKMTRTISILTLSIPSFIIAIFIIYYFGVKTQLIKFFIGGKFYGILFSIIILVL YQVGNLSRIVRDTFVEMKEETFVKFYLIRGFNINYVLLRHCYKPALYSLFSASISKFSSV VGGSAVVEFSFAIPGISYFLISSIVNRDYNVIQAYIFLICIYMFFVHLIFDFLLSFLREK GNK >gi|296153520|gb|ADVK01000055.1| GENE 63 44902 - 46401 2070 499 aa, chain - ## HITS:1 COG:FN1652 KEGG:ns NR:ns ## COG: FN1652 COG0747 # Protein_GI_number: 19704973 # Func_class: E Amino acid transport and metabolism # Function: ABC-type dipeptide transport system, periplasmic component # Organism: Fusobacterium nucleatum # 1 499 1 499 499 940 96.0 0 MRKIIKFLLCSLLLVIFFVACGEKKEEKVVTEDKPIVIGQTFVVGAIEPTVGGTPWSLTT HGLSETVFSVDRDGNLVSRYVEDVERTDKLNWVLKLKKGVKFSDGTEVNAEALAWAMNTV MEENPLSNATAGKVKFEKVDDYTVNVTVEREIQNLKSLLTEWTNIIFKKTDNGYIFTGPY VIKNLEPEVSLTLEPNQYYENSEKRGEVIIKAISDMASMKLAYESGELDMAFGITPEIAG ELKDEGKIVETIDAGYQYFGVLNTEAGIMSDKSVREAINLGLDREDYIKALKGGRVANGL FAQYFSFAGDVKLEYNLEKANSILEEDGWKLNKDGLREKDGKILSVNILTYNSRPDLKII MQVMLSQLKKMGIEAKTSIVDNIDVEAKKKEFDVILYAQHTAPTGEPTYFLNQFFRTNGS KNMMSYSSKEVDELLDKMGTLPFGDELIKTAKQIQEVIYKDLPVLYLVDPEWNVALSERL KDYKPYCGDYYIVNSELYK >gi|296153520|gb|ADVK01000055.1| GENE 64 46513 - 47850 1100 445 aa, chain - ## HITS:1 COG:FN1653 KEGG:ns NR:ns ## COG: FN1653 COG0534 # Protein_GI_number: 19704974 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Fusobacterium nucleatum # 1 445 1 445 445 686 91.0 0 MDVSLKNNNLTKGKIWKVMLKFVLPIFLGTLFQSLYNTIDAIIVGKFAGKEAVAAIESVL NFQRLPITFFVGLSSGATIIISQYFGANKKEDVSKASHTAILFAMLGGLLLSILSCVFSP FFIKLIKVPEEIFYQAQIYTIICFIGIVASMTYNIGSGILRALGDSKTPFYILIVSNILN IVLDLILVIVFNLGVIGVGMATLISEIVSAILIFIMLIKTNLDCKIYINKLYFYKKYIKE IFRLGLPIGIQSVLYPISNTIIQSSINTFGVNSIAAWAISGKLDFLIWTVSDAFSIAVST FVAQNYGAKKHQRARDGIKVALSMSMVAIFIISFTLYFYNKPLAYFLIDDKEVVDLTSEV IHLIAPLYFIYVIGDVLSGSIRGTGNTLHPMVINIFGICICRILWVFLIVPLNPTFFMVL YGFIVSWIITALMYIVYVIYKRKSF >gi|296153520|gb|ADVK01000055.1| GENE 65 48024 - 49658 2193 544 aa, chain - ## HITS:1 COG:no KEGG:FN1654 NR:ns ## KEGG: FN1654 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 544 28 571 571 938 95.0 0 MKKGIGVGIEDFRKIIREGCYYFDKTNYIEELLKDKTVIKLFTRPRRFGKTLNMSTLKYF FDIKNAEENRKLFKDLYIEKSEYFKEQGQYPVIFITLKDFKKNTWEEMNFEIKELLRNLY DEFNFIRDTLSISDLREFDKIWLKEEDANYDSSLLNLTKYLYNYYKKEVVLLIDEYDSPL ITANQRGYYKDSINFFRNFLSLALKTNSYLKMGVLTGIVQVAKEGIFSGLNNVITYNILG NDFETFFGLSEEEVENSLKYFELEYEIEEIKKWYDGYKFGNSEVYNPWSIINYLRTKELQ AYWVNTSDNALIYDNLKNSTVDVFNNLQTLFEGKEIKKEISPFFTFEELSKFDGIWQLMV YNGYLKISEKISNDEYMIKIPNYEIQTFFKKGFIDKFLVSGNYFNPMMDALLDGDIEEFE RRLQNIFLVNTSFYDLKGEKVYHSLFLGMLIWLRDKYEVKSNGERGHGRYDAMLIPLDKV KPAYIFEFKVSKTIKGLNAKAEEALEQIKEKQYDVGLKDLGITKIYRIGIAFKGKNVKVK YEIV >gi|296153520|gb|ADVK01000055.1| GENE 66 49800 - 51329 2010 509 aa, chain + ## HITS:1 COG:FN1655 KEGG:ns NR:ns ## COG: FN1655 COG2461 # Protein_GI_number: 19704976 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 4 509 1 512 512 898 94.0 0 METMSSHLPKLDEEKLKFVIELKEKYNAGKISLADARKQLKERVKTLKPYEIAYAEQKLT PFVEDECIKENIQNMMLLFDEVMDTSRPTELPPDHPIMCYFRENDDMRELLKEVENLIQF PVIKNQWYELYDKLDLWWKLHLPRKQNQLYSLLEKKGFTRPTTTMWVLDDFVRDELKENR KMLDDGNEEEFIASQTSVAADIIDLIQKEETVLYPTSLAMITPEEFEDMKSGDREIGFTF GKLETTSEPKKPIIQENSNISEQGNLAKDLAQLLGKYGFNSENSQSSELDVAMGKMTLEQ INLVFKHLPVDITYVDENEIVKFYSDTAHRIFPRSKNVIGRDVKNCHPRKSVHIVEEIIE KFRSDEQDFAEFWINKPGLFIYISYSAVKDENGKFRGILEMMQDCTKIRSLEGSQTLLNW ESTNSTNKTTEEKVEEIKNIDGDTYLKDLIKVYPKLKDDMIKISDNFKLLQTPLSAVILP TVTLKKASERGGVELNTLIEKIKEIIKTY >gi|296153520|gb|ADVK01000055.1| GENE 67 51584 - 52360 1257 258 aa, chain - ## HITS:1 COG:RSc0153 KEGG:ns NR:ns ## COG: RSc0153 COG0501 # Protein_GI_number: 17544872 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Zn-dependent protease with chaperone function # Organism: Ralstonia solanacearum # 14 256 38 274 314 125 34.0 1e-28 MLFVSLIFISCATAPLTGRKQLKFVSDESIVQSSVAQYNQMIAQLKANNLLANNTAQGKR VAQIGRKVTGAVEQYLRENGMADKLQYLNWEFNLINTKDINAFALPGGKIAFYSGILPVL QTDGAIAFVMGHEIGHVIGGHHAEGASGQSLAGFLMLGKKAIDGMVGGAVISDELAQQGL SLGLLKFNRTQEYEADKYGMIFMAMAGYNPEEAIKAQERMMKLSGSQNVEILSTHPSSQN RIEELKRFLPEAMKYYKK >gi|296153520|gb|ADVK01000055.1| GENE 68 52565 - 52783 352 72 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704977|ref|NP_602472.1| SSU ribosomal protein S18P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 72 1 72 72 140 100 5e-32 MAEFRRRRAKLRVKAEEIDYKNVELLKRFVSDKGKINPSRLTGANAKLQRKIAKAVKRAR NIALIPYTRTEK >gi|296153520|gb|ADVK01000055.1| GENE 69 52828 - 53145 525 105 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|19704978|ref|NP_602473.1| SSU ribosomal protein S6P [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] # 1 105 1 105 105 206 99 5e-52 MGKNQREEVNAMRKYEIMYIINPTVLEEGREELVNQVNALLTSNGATIAKTEKWGERKLA YPIDKKKSGFYVLTTFEIDGTKLAEVESKLNIMESVMRYIVVKQD >gi|296153520|gb|ADVK01000055.1| GENE 70 53211 - 54914 2460 567 aa, chain - ## HITS:1 COG:FN1658 KEGG:ns NR:ns ## COG: FN1658 COG0442 # Protein_GI_number: 19704979 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Prolyl-tRNA synthetase # Organism: Fusobacterium nucleatum # 1 567 1 567 567 1114 99.0 0 MRFSKAYIKTLKETPKEAEIASHKLMLRAGMIKKLASGIYAYLPLGYRTIRKIENIIREE MDRAGALELLMPVVQPAELWQESGRWDVMGPEMLRLKDRHERDFVLSPTQEEMITAIIRS DISSYKSLPINLYHIQTKFRDERRPRFGLMRGREFTMKDAYSFHTSQESLDEEFLNMRDT YTRIFTRCGLKFRPVDADSGNIGGSGSQEFQVLAESGEDEIIYSDGSEYAANIEKAVSEL INPPKEELKEVELVHTPDCPTIESLAKYLDIPLERTVKALTYKDMGTDEIYMVLIRGDFE VNEVKLKNILNAVEVEMATDEEIEKIGLKKGYIGPYKLPAKIKIVADLSVPEVSNHIVGS HQKDYHYKNVNYDRDYTADIVTDIRKVRVGDNCITGGKLHSARGIECGQIFKLGDKYSKA MNATYLDEKGKTQFMLMGCYGIGVTRTMAASIEQNNDENGIIWPVSIAPYIVDVIPANIK NEVQVSLAEKIYNELQEEKIDVMLDDRDEKPGFKFKDADLIGFPFKVVVGKRADEGIVEL KIRRTGETLEVSQNEVIAKIKELMRIY >gi|296153520|gb|ADVK01000055.1| GENE 71 54984 - 57053 2263 689 aa, chain - ## HITS:1 COG:FN1660 KEGG:ns NR:ns ## COG: FN1660 COG1200 # Protein_GI_number: 19704981 # Func_class: L Replication, recombination and repair; K Transcription # Function: RecG-like helicase # Organism: Fusobacterium nucleatum # 1 689 1 689 689 1237 99.0 0 MIESYRNIYSKLEDIPTKYITAKQLSNLKSLGINTVYDLIYYFPRAYDDRTNIKKIGELK FNEYVVLKATVMSAVNLTVRSGKKIVKAMVTDGTGIMEILWFGMPYIKKSLKIGEEYLFI GQTKKSAVFQLINPEYKLFSGQQKVSENEILPIYSSNKNITQNSLRKLVEKFLVNFLNYF EENIPKKLIKEYKIMERKSAIKNIHYPVSMKEIEEAKRRFAIEELLILELGILKNRFIIE NSNSKNYEVEGKKEKVREFLSQLTFNLTNAQKKVIKEIYDEISNGKIVNRLIQGDVGSGK TVVAMVMLIYMAENGYQGALMAPTEILANQHYLGIKERLEQIGLRVELLTSSIKGKKKNE ILDGIANGDIDIVIGTHSLIEDDVIFKKLGLIVIDEQHRFGVNQRNKLREKGFLGNLLVM SATPIPRSLALSIYGDLDLSIIDELPPGRTPIKTKWIANDEDLEKMYNFIYKKVNDGNQA YFVAPLIETSDKMALKSVDKVSEEIERKFSNKKIGIIHGKMKAKEKDEVMLKFKNKEYDI LIATTVIEVGIDVPASTIMTIYNAERFGLSALHQLRGRVGRGSKQSYCFLISNSTTENSK QRLSIMEETEDGFRIAEEDLKLRNSGEIFGLRQSGFSDLKFIDIIYDVKTIKLVRDECIK YLKEHKGEIENIYLKYDIEQKFSDIQAGN >gi|296153520|gb|ADVK01000055.1| GENE 72 57105 - 57854 1430 249 aa, chain - ## HITS:1 COG:FN1661 KEGG:ns NR:ns ## COG: FN1661 COG0217 # Protein_GI_number: 19704982 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 249 1 249 249 455 100.0 1e-128 MSGHSKWNNIQHRKGAQDKKRAKLFTKFGRELTIAAKEGGGDPNFNPRLRLAIEKAKAGN MPKDILERAIKKGSGELEGVDFTEMRYEGYGPAGTAFIVEAVTDNKNRTASEMRMTFTRK DGNLGADGAVSWMFKKKGVITVKAEGIDADEFMMAALEAGAEDVTEDDGYFEVTTEYTEF QTVLENLKNAGYQYEEAEISMIPENTVEITDLETAKKVMALYDALEDLDDSQNVYSNFDI PDEILEQLD >gi|296153520|gb|ADVK01000055.1| GENE 73 58034 - 59488 1416 484 aa, chain - ## HITS:1 COG:FN1662 KEGG:ns NR:ns ## COG: FN1662 COG2865 # Protein_GI_number: 19704983 # Func_class: K Transcription # Function: Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen # Organism: Fusobacterium nucleatum # 1 484 1 484 484 902 99.0 0 MSLNNNEIGNIIEYLKQNNSETTWIEVKENKIEVDKLGETISAIANSCLLDDKDYGYIII GIKDKTWEILGTSNRLSEYHLKGGQEVKLYISTMLLPRINFEFIDDYIIDNKKISIIKIP SATHTPISYKNEIYLRIGSNNKLEKEYPEQLRILWNKTSGFNYEETIVREKLTVEEVLNF LDYRTYFKMKGWDENKTSQQIIDEMELDGFIVKEENYKFYKVKALGALLLAHNLKDFNLE RKAVRIVIYNGITKSSIREQLEGKKGYAIGFENIIKVLKKYLPQEQRPINGRMEVFEFYP QAIIRELIANAIIHQDLSMTGSEIKIEVYDNRVEITNPGKSVIDLWRFYDCNKSRNEKLA YYMRQLKLCEELGSGIDRIVEISELNNLFTPKFIITNEYVNAKLFREKKFEQMVEEDKIN ILYYHCCFRYSIEDYMTNSSLRNRFLLDNSKPSTDRISNLIKKAKNMKLIKVGPNKSYIP FWAE >gi|296153520|gb|ADVK01000055.1| GENE 74 59506 - 60912 1359 468 aa, chain - ## HITS:1 COG:no KEGG:FN1663 NR:ns ## KEGG: FN1663 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 13 468 1 456 456 748 100.0 0 MKKKELEYFINNMLINKEDVLLSVRDYIEYCKETKEENWSEKKREIIIKILFNFYNTIKD FDFPVTNSKNWYYEYFWNRDGISLELMYCDELTLDDKGEIDSTSSSNSIIIAEEKCLYLS VEEYAKVYDVKPTTVRQWIRRGKIRNAKKIGRDWLISELADKPQKGYTDVSYFINYLSNE ILEKYPYLEKYERLSISKSNLENDKYEILLSSKKEKYPYERMYLNTIEREKLELMLISEN EVYVDEPFFIMYIPEKRNKYCIKGGDIMLENKIETYEKSIKKILKNDLKIECDNYLENED DFLIWNSNIYLKKRIFDDKGDYIDKKLLEIIGAKIIPANMDFNDETSFYSPLDYCDSVSG DMYFSYKAIGDDEGIKEEIVKELEMEEEEAYETSVLYVENVEVKESENLNTFLQAFDIVR KGLPVQYCKLAIFLLEWQKESKKVKVFLENGWKIRNIDSSSVVMYKKI >gi|296153520|gb|ADVK01000055.1| GENE 75 60878 - 60979 81 33 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MTFYCITYILLVYRYNTKKEDIYEKKRTRVFYK >gi|296153520|gb|ADVK01000055.1| GENE 76 61307 - 61648 281 113 aa, chain + ## HITS:1 COG:no KEGG:FN1664 NR:ns ## KEGG: FN1664 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 85 1 85 111 144 100.0 8e-34 MEIRYKNKRIRDICENEKKAIKKYNKIIAEKLIFSIEFLKNSKSLKDVADYNNFRLHELK YERKGQFAIDLGKTTGYRLIIEPVTAIVIEEPDLGKFILIQTNYRTCYCQQRK >gi|296153520|gb|ADVK01000055.1| GENE 77 61705 - 62076 404 123 aa, chain + ## HITS:1 COG:FN1665 KEGG:ns NR:ns ## COG: FN1665 COG3093 # Protein_GI_number: 19704986 # Func_class: R General function prediction only # Function: Plasmid maintenance system antidote protein # Organism: Fusobacterium nucleatum # 25 123 1 99 99 159 100.0 1e-39 MNKLVFKSKDNEMIFHPGYLIKNIMDEEGKDIKGMVQLLGLTEKEITALINAEISITDDM IDRIVKNYGTSKELWKNFQNKYDLKMKELEENPMIFNFERENEISSDIANNILNSVSERL IIA >gi|296153520|gb|ADVK01000055.1| GENE 78 62086 - 62439 180 117 aa, chain + ## HITS:1 COG:no KEGG:FN1666 NR:ns ## KEGG: FN1666 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 117 1 117 117 175 99.0 4e-43 MEIKKYILKKFDYDVNVSNKKFYTPNETIRQKLGIDVKFLEDRKNMELAFKIDMIDNDNI NILKLKVEYILTLNNEALDINKSFIKKILSKFYPIFSKLVLNFYNSIGLNSIQLPEF >gi|296153520|gb|ADVK01000055.1| GENE 79 62485 - 63687 1598 400 aa, chain - ## HITS:1 COG:FN1667 KEGG:ns NR:ns ## COG: FN1667 COG1088 # Protein_GI_number: 19704988 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: dTDP-D-glucose 4,6-dehydratase # Organism: Fusobacterium nucleatum # 3 400 2 399 399 746 98.0 0 MKKTYLVTGAAGFIGANFLKYILKKHKDIKVIVVDSLTYAGNLGTIKEELKDSRVKFEKV DIRDRKEIERVFSENKVDYVVNFAAESHVDRSIENPQIFLETNILGTQNLLDNAKKAWTV SKDENGYPIYREDIKYLQVSTDEVYGSLSKDYDEPIELVIDDEDVKKVVKNRKNLKTYGN NFFTEESPVDPRSPYSASKTGADHIVIAYGETYKLPINITRCSNNYGPYHFPEKLIPLMI KNILEGKKLPVYGKGGNVRDWLYVEDHCKGIDLVLREAKSGEIYNIGGFNEEKNINIVKL VIDVLKEEITNNDEYKKVLKTDISNINYDLITYVQDRLGHDMRYAINPSRIAKDLGWYPE TDFETGIRKTVKWYLENQDWVDEVVSGDYQKYYERMYGNR >gi|296153520|gb|ADVK01000055.1| GENE 80 63790 - 65187 1197 465 aa, chain - ## HITS:1 COG:no KEGG:AZL_019960 NR:ns ## KEGG: AZL_019960 # Name: not_defined # Def: alginate O-acetyltransferase # Organism: Azospirillum_B510 # Pathway: not_defined # 51 221 73 253 412 70 24.0 2e-10 MKQLKKIFIVFFMILLFLPLIFFNWEDDYVSLIDNRVLKKFPNKENLAGGDVTDYIQSYI NDRIGGRERIINLYTELNDKVFNLLVHPTYTYGKDGYIFFKMKRNIEYEEYHRKFAETIK KIQTYCEEREVPFYFVFSPEKKYVYSEYLPKGVNYNREWVDRFIEDLKELGVNFIDNSDF LKEKAKEEFVFNKKYDAGHWNDLGAFLGMNNVYEKIHQKNPNLLILSENYYDKLSKVEES LPVSYFKINEEVPNWELKVNYENITEQYINEVKRNNLFGYFGYYKNLSQEAENSLKLLIF HGSYLNSRWKFVIPTVKEYIGIHNYQNILDIPYYFNIFKPDLVIFEVAEYTITDHYFSAK KMSQLDFPPFLDEYKKGQAKMIPDIDILNYDIKFNIEEGNKITKLTLSGLPGNAKYVYLS IKGETYDFMKIDDSIYELSLLKEVLEENQKYEIFYMTDKGEIYRF >gi|296153520|gb|ADVK01000055.1| GENE 81 65206 - 66585 986 459 aa, chain - ## HITS:1 COG:CAC1564 KEGG:ns NR:ns ## COG: CAC1564 COG1696 # Protein_GI_number: 15894842 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted membrane protein involved in D-alanine export # Organism: Clostridium acetobutylicum # 1 459 1 473 473 248 36.0 1e-65 MLFTTNIFLFLFFLFCILGYFTLDYFNRIKLNNLYLVLASLFFYAWAGIDVALYFIIFII YVYLASHLLENSENDQQRKIRLISVLISLVGLLVYIKYFNFFILNFNFIFKLELTQKNII VPLGISFITFEAISYILDVYWKKAKATTLLDIALFLSFFPKVVSGPIVLWRDFSSQINNR KVSLDLLYNGVERIMIGFAKKTIIADTLGLTVSNIMENLEYGIDNVTAIGGMLCYTLQLY YDFSGYSDIAIGISNCFGFEVKENFNFPYISSSITEFWRRWHISLGTWFREYLYIPLGEN KKGNIYLNLFIVFLITGIWHGVRWNYIIWGGIHGFFIVLECYLNKNTVWYNKINLIYKRI FTFLIVSFSWIIFMLPEWSMVKKYYAFMFTITQRKLDFTYSYYFNSKVITLIVIGFLGAV LCKEIKNEKIKMVVYPILFILAIIFMINSTYSPFLYFQF >gi|296153520|gb|ADVK01000055.1| GENE 82 66734 - 67807 513 357 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296329245|ref|ZP_06871746.1| ## NR: gi|296329245|ref|ZP_06871746.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 357 9 365 365 567 100.0 1e-160 MYRVENIKGIDYAKAFAILGVLILHLYLPNLYDEKVLLRLWTEVGVPIFMIVTGHNYILS YYKSKENWLSRNNLYRKLKRIIIPYIYILIFEIILVFIKSDFVSYDFSKYRNIKSLFYLI LIKGGIGPGSYYSPVLIQIVLIYFPLLLVFNKFLNKLIKNEYKNVISLLVIFIIEAMFEV IINYMGSIYNKNFIDNFYRMNALRYTPFLQLGIILYNHKNQILKNFKKILPLSIGGGVYT YLTHYKDCTFPPFYYWKQVATPIMFHALFFIYIALKYFNKSNENFFEKVIITIGKSTYHI FLVQMVYFGMLRIQPYDKGFYYLMHILICIGAGIIFYYAEPKITKKLELFIKRRVVV >gi|296153520|gb|ADVK01000055.1| GENE 83 67812 - 69332 1062 506 aa, chain - ## HITS:1 COG:CAC3047 KEGG:ns NR:ns ## COG: CAC3047 COG0728 # Protein_GI_number: 15896298 # Func_class: R General function prediction only # Function: Uncharacterized membrane protein, putative virulence factor # Organism: Clostridium acetobutylicum # 3 452 11 460 520 192 34.0 1e-48 MKKIVFSIMIISVISKIFGFGRELFFSYYFGASYVTDAYLVSTTIPLVIFSLVGVGINSA FIPIFTSISENKSKERAFTFTSRLLLSLFIICTLSYFIILVFTSPIVKIFASGFSGDILK LTVEYTRISALIIYFVIVINIFTALLQVNNKFYIASIIGIPFNIAYMIGIYIAYLKGNTY LPIVTVIAYLVQAFMLFYPVKKLGYKFKYNLGLKDKYLKQMLIIALPAIIGGSLEQVNYL IDKTVASRIGIKGGITLLNYSSKLNLAISGILISSLLVVFFPRISKLVAKNDRITLKNEI LNTISFTMIVSIPISILILILRYEIISFLFQRGNFNKSNTIITAKCLLCYNIAFSFIGLR EILSRIFYALKDTKTPVINSVIGVILNIFLNLTLSKYLGLPGIALATTISIIFTVILLFF TLYKKYKVLYIKEIIVTFLKVILASIVVGFIVYNTKNILINYPLILNLMFSSLVGIIIYI LIIIFMRIEIVDDFVKQIKRNVIRRF >gi|296153520|gb|ADVK01000055.1| GENE 84 69346 - 70470 1672 374 aa, chain - ## HITS:1 COG:FN1679 KEGG:ns NR:ns ## COG: FN1679 COG0037 # Protein_GI_number: 19705000 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: Predicted ATPase of the PP-loop superfamily implicated in cell cycle control # Organism: Fusobacterium nucleatum # 1 373 1 373 373 644 81.0 0 MKYCKKCLQPDTRPGIKFDDEGICYACLYEEEKKKIDWETRERELKEIAEWAKKNAKNSY HCVIGVSGGKDSTFQAIYAKEKLGLNVLLVNGEPDQMTEIGRKNIETLINKGFDIIKLRP NPKIVKKLVRESFIKYGNPQKPTEYPLWASAYIIADKFDIPLIIQGENAALTLGVVNTGL GVDGNALNVNEGNTLAGCNASDWVDDEITLDELYMYQFPDKKRLIDKGIKAIYLQYYTKE WSQVYNADFSVARGLLGRSTEDLHDLGRYRRYTSLDSNLHIVSQMLKYYKFGFGFATDEA CYDIREGRLTREEAIWLVNEYDGKCGQQYIDEFCEYIGITNDEFLEVLDKFVNKDLFEKK DGKWVPKFKIGENI >gi|296153520|gb|ADVK01000055.1| GENE 85 70467 - 71669 1260 400 aa, chain - ## HITS:1 COG:MJ1066 KEGG:ns NR:ns ## COG: MJ1066 COG0399 # Protein_GI_number: 15669255 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis # Organism: Methanococcus jannaschii # 4 393 7 380 386 298 39.0 2e-80 MNRIPYGKQFIEKKDIEAVVEALKSEFMTQGPKIQEFEETVAKYHNCKYAVAFCNGTAAL HGAYYALGLKENDEFITTPITFAASGNGGLYLGGIPKFVDIDKNNYNIDTIKIKDAITPK TKVITPVSFAGFPVDLKRIKEIVNETGYDIKILHDAAHAIGAICDSRNIVDYADATILSF HPVKHVTTGEGGMVLTNNKEVYKKLCLFRTHGITKNQEELIEKQGDWYYEMQELGYNYRI TDLQCALGIVQMSKLDHSLYQRNKIAQFYDENLKDVEWLTLPLNYFSKEWLKDSEYESLQ KKPNNLNSYHLYPILLKNKEDRKDFFDYMRENNIFVQVHYIPLHLMPFYKEKYGFKKGDF PNAEDFYSKEVSIPMYPILTQEELDYIVSTIKKFKKGDQL >gi|296153520|gb|ADVK01000055.1| GENE 86 71662 - 73188 1662 508 aa, chain - ## HITS:1 COG:CAC2187 KEGG:ns NR:ns ## COG: CAC2187 COG2089 # Protein_GI_number: 15895456 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Sialic acid synthase # Organism: Clostridium acetobutylicum # 170 503 17 350 350 382 55.0 1e-106 MINFENILNSTLEEQKKIRIWRNKKEVRKYMFNNKIISEEEHLKWLESLKRITNKEVFYI IYNNVKIGIASIDKISEEIYEFGYYLNDLVPKGKGIGKKVFYYFYLFLFHNYKKLNEIRC EVLNNNLASLNLLFFLGMKKIEERQISVNEQIFNSISLSITREEWNMNNKVFIVAEISAN HGRDINIVKETIKVAKECGADAVKIQTYTPDTLTLNCNNEYFQIKDGTIWDGKILYDLYK EAYTPWEWHKEIFDYAKELNICLFSTPFDKTAVNLLESLGNPIYKIASFEINDIPLIEYA ASKKKPVIISTGVATEEEIKDAIEACKKVGNYDITLLQCTSQYPAKLEDANLVMIEDLAK RFSVKSGLSDHTMGHLVATTAVAMGAKVVEKHFILNRSIGGPDSSFSMLPSEFKEMVDNI RNVEKMIGKISYEISEKKEASLKFKRSLFVSKDIKKGEVITENNIKSVRPSNGISPKFYY EIIGKKVNRDLEFGTPLLFEYIEEESNE >gi|296153520|gb|ADVK01000055.1| GENE 87 73190 - 74137 1159 315 aa, chain - ## HITS:1 COG:CAC2186 KEGG:ns NR:ns ## COG: CAC2186 COG3980 # Protein_GI_number: 15895455 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Spore coat polysaccharide biosynthesis protein, predicted glycosyltransferase # Organism: Clostridium acetobutylicum # 5 313 2 337 339 72 27.0 1e-12 MVKKRVIIFTEGGQDIGFGHLTRCSALYDEIDKRGIEVVLVIYGKGIENLLGKKKYILVD WKNLEFLRSFLNKTDYVIIDSYLTTTEVYDFCSKNTARCLYVDDTNRVNYPKGIILNPSL SENIKYNTKNEVLQGKDYIILRKEFTEEKIPNFEKEIDVLITLGGTDIRNLIPKLLDILR AINDKLKIVVVTGKISEDIKKIETDYIKVFSSIEALEMRNLILKSKFVICGCGQSIYEML ALKASFLPILIIDNQEKNKEFLLKNKLIEDILDYNDEKEKFYNIILKQIFLERNNNIILD VKGSKRIIDFFLEEK >gi|296153520|gb|ADVK01000055.1| GENE 88 74139 - 75920 2047 593 aa, chain - ## HITS:1 COG:FN1686 KEGG:ns NR:ns ## COG: FN1686 COG1861 # Protein_GI_number: 19705007 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Spore coat polysaccharide biosynthesis protein F, CMP-KDO synthetase homolog # Organism: Fusobacterium nucleatum # 333 591 1 259 259 377 75.0 1e-104 MYNILEVANSHGGDISYLYTLIEKYSIFDKKENFGIKFQPFKYDLMAVEDYSYYSLYKDL FFTKKEWEDIIQKAFITKDIWLDIFDEYGLEILEDNLDKIEGIKFQTSILDNLTIFRKLK KINTFNLKVILNIAGRSLDEIDSIINKYSLLKFKEIYLEVGFQGYPTSIEDCGISKIAIL KERYNNIKIVFADHTDSQTDGAIILPLLVGFNCCDIIEKHIMLDRENTKYDFYSSLTFEQ YKKFIDEQKKYFNLKKEKFINEKEREYLKKTLQIPILNKDKKAGELIDIENDFEFKRNDS YGLNIVELKKCVSEKYILNSDRKKGETIRKEDIRKANIAVIIACRLKSTRLKRKALLKIG SISSVEMCIKNILKFKDVNSVVLATSTTEEDSELKNYTYNKNVIFHQGDPDDVIQRYLDI IEKKNFDVIIRVTGDCPYLSSDIAETILNSHFEKGAEYSNGVGAAVGTNLEVINALSLKE VKKYFPKADYSEYMTWYFQNNPEEFKLNYVDLPEKWKRDYRLTLDYQEDLDLFNKIEKYF VEEKIEYSIDKLYEYLDDHPDIAKINSSITQKYQTDTKLIETLNKYTKINKGN >gi|296153520|gb|ADVK01000055.1| GENE 89 76026 - 76823 875 265 aa, chain - ## HITS:1 COG:FN1687 KEGG:ns NR:ns ## COG: FN1687 COG1028 # Protein_GI_number: 19705008 # Func_class: I Lipid transport and metabolism; Q Secondary metabolites biosynthesis, transport and catabolism; R General function prediction only # Function: Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) # Organism: Fusobacterium nucleatum # 1 265 6 270 270 424 78.0 1e-118 MRNLFNLKGKVILITGGSGYLGKAMSHALAEYGATLVLASRNSKKNKELCTELTNLYNNN NASLELDLESKEDVISKIKKIIAEYGKIDVLINNSYYGASGKFHKMSYENWNKGIQGSID TVFLCTKAVIDEMRNNKKGKIINISSMYGINAPNVQELYDGDLCEKYYNPINYGVGKAGI IQFTKYIAAVYGKEGIICNSISPGPFPNEEVQKNKTFLERLINKVPLKRVGQPEDLKGAI VFLCSDSSNYINGHNLVIDGGWTIW >gi|296153520|gb|ADVK01000055.1| GENE 90 76823 - 77695 1016 290 aa, chain - ## HITS:1 COG:FN1688 KEGG:ns NR:ns ## COG: FN1688 COG0667 # Protein_GI_number: 19705009 # Func_class: C Energy production and conversion # Function: Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) # Organism: Fusobacterium nucleatum # 1 290 1 290 290 358 73.0 9e-99 MKLCLGTVQFGLNYGIEKKKVETNEIDKILTTALKNGISMLDTAQNYGDSEKIIGKFKKK DNFKIISKISSTSLNDTNNLEQLKKILKNSLDNLGISSLDGLLLHKPEDLKNIDFLNNLN ILKKEGYFLNLGVSIYLPDEANLALEIESIKYIQIPYNILDTRLDKINFFEKAKKKNKII FARSIFLQGVLLKKHSSYPNSLEGLKKFDNLIEDEIKKVGCSKLSFLLNFARSNKNIDYL VVGIESLENLEEIIEAYNNTELQAYNYNKLRDNFINIPEKILNPSLWEEK >gi|296153520|gb|ADVK01000055.1| GENE 91 77714 - 78724 1243 336 aa, chain - ## HITS:1 COG:FN1689 KEGG:ns NR:ns ## COG: FN1689 COG1086 # Protein_GI_number: 19705010 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Predicted nucleoside-diphosphate sugar epimerases # Organism: Fusobacterium nucleatum # 1 336 1 336 336 600 88.0 1e-171 MLDNKVILVTGGTGSFGNKFIERILKKYNPQKIIIYSRDEFKQDLMKKNFIAKYGIEKAQ KLRFFIGDVRDKERLYRAFNGVDYVIHAAAMKQVPACEYNPFEAIKTNINGAENIVEAAI DRKVKKVIALSTDKAVNPINLYGGTKLVSDKLFISANAYSGESGTIFSVVRYGNVAGSRG SVIPFFKQLLEQDEKELPITDLRMTRFWMVLDDAVNLVLKALEESKGGETFVFKNPSFLI TELAKALNPNGTIKEVGIREGEKIHEVMITRDDANYTYDYGDYYVIYPNFEWWSKEKIKS GGMLIPKDWDYNSGTNNVWLDADELRKRIAKLDIKY >gi|296153520|gb|ADVK01000055.1| GENE 92 78718 - 80130 1045 470 aa, chain - ## HITS:1 COG:no KEGG:mru_1879 NR:ns ## KEGG: mru_1879 # Name: not_defined # Def: sialyltransferase # Organism: M.ruminantium # Pathway: not_defined # 9 436 4 466 508 158 31.0 6e-37 MKENYSQKTTDEILKIIINYENEENLFSKKIKGVYFYKLIRVGLYNKILNVILGTKNAQD SMRNGTIVFNFLKKYIVNLFNKDKKSFDVLLFDTGRVFSYEGQKESIYMYDIIKKLKNKE LSFRIVYPWITPNENRVFEAPPTFKYFLQYIFLTIKLKLNKKLNFQKFDIDETRFIENCN KELCNLLGINPNLSLFDKSEIDREIEKFKIQYNYYFSYFKKKNIKEIYIICAYGKEGVVA AAKDLNIKVIEIQHGTITKYHLAYHYPTNQKIPYFPKYFYSFGKYWEEIVAFPKGTKLKV YGFPYLQNQLIKYKDVEEKKDQILFISQGHIGEKLLYKAIEFALKNRNKNIVYRLHPGEL RYSIKNYMSILQKYTLNNFILEDCKEELYKLMKESEYIFAVSSTTVYEALALGKDVGIIN LPSYEEVIDLIRQSYVDFYEEKDINEIYISHFKNTDYNKFFNMEMEEGLC >gi|296153520|gb|ADVK01000055.1| GENE 93 80096 - 81205 734 369 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296329256|ref|ZP_06871757.1| ## NR: gi|296329256|ref|ZP_06871757.1| membrane protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] membrane protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 369 12 380 380 524 100.0 1e-147 MYITFSFVSSYDLKIRRQILLILAVLAFFIVLFYYKNFFKKNKKDIYTLGISLLLLLSLI FNIELHNFAYTSVNIIVLLACMLITKEKYEKVINVIKKNLIIFNYINCFLLNFFHVTTSD AKKIFGYTIMRRRTDWVNLSIVSVWALLLLIYSIWSVKENHKRTMYDYINIFTSIVLIFL SGKVNVFIAIIVMILIVTYKKFSKKNNLLILFIEIFFINTSWLFLSFKKIIGKFIDIPFI LTGRDRLWQDYYTYIFKNPFKIIVGFNFRQEEVSYINHPHNQYLMIFYILGIFGITLYFL LFYKVLKNTNKNNKLLFNLNIGILILMTGDDYFVLTVMPLHYLIIMCGLYFKKKEVEQYE RKLLPKNHR >gi|296153520|gb|ADVK01000055.1| GENE 94 81238 - 82317 1401 359 aa, chain - ## HITS:1 COG:TM0585 KEGG:ns NR:ns ## COG: TM0585 COG0673 # Protein_GI_number: 15643351 # Func_class: R General function prediction only # Function: Predicted dehydrogenases and related proteins # Organism: Thermotoga maritima # 4 358 5 355 360 287 40.0 3e-77 MYNVAIIGCGRISHKIAEGIAKNNDRMNLFVLCDPIEEKMFETEKTYNKKMECQVVLSKY KDYKEILNQNKIDIAIISTESGYHEEIGLYFLENGVNVIIEKPLAMSIEGAQKLVDMAKK NDLKLAASHQNRFNYPIQLLKKAIKQNRLGKIFNGMARILWTRDDNYYLQAPWRGTWALD GGTLMNQCIHNIDLINWMMDDEIDTVYAQTSNYIRNIEAEDYGVIVIRYKSGKIATIEGS AIVYPKNLEETLTITGEKGTVVIGGMAVNKINTWRVEGDNEEEYLSIDCGDPNSVYGYGH EALYKDFLDALDENRDPLVDGEAGLEAIKIILAAYKSQKTGLPVKFDEFKEFSTLDMEK >gi|296153520|gb|ADVK01000055.1| GENE 95 82325 - 83536 1258 403 aa, chain - ## HITS:1 COG:PA3147 KEGG:ns NR:ns ## COG: PA3147 COG0438 # Protein_GI_number: 15598343 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferase # Organism: Pseudomonas aeruginosa # 3 393 2 403 413 137 26.0 4e-32 MKSIWFINEHDAPPEFSKTRRRYDMCKYLFRLRKYKLHIICGAFLHGTQNKYAYAKNQKK NVNINGVDVHILKGVKYGSNVKRIFSMLVFMLRLIFFKFPKNDKPDLIYASSPHLFAALG SLILAKKNKAKYILEIRDLWPETWVQMNIIKKNGLIHKFFLKIEKYLYEKADKIIFLGEN FSYILSLGIDKNKLHSVNNGVDLEEFDKDIESPIKLNLEKFNITYTGAIGPANNLDTLLD LAKLIDNNSIIFNIVGFGPLKEHLKNRVEKEGILNIRFYEPINKIFVPSLLRSSEVLIIL LLDIELYKAGISPNKLFEYFASSKPILFFGNTVSDYVADANSGISVPAGDITRLKDACLR LYNMSVEEREQLGRNGRNYVEKNFDWKILANKVDEIIENVLKG >gi|296153520|gb|ADVK01000055.1| GENE 96 83549 - 84211 814 220 aa, chain - ## HITS:1 COG:aq_2131 KEGG:ns NR:ns ## COG: aq_2131 COG0223 # Protein_GI_number: 15607077 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Methionyl-tRNA formyltransferase # Organism: Aquifex aeolicus # 81 182 74 178 303 61 34.0 1e-09 MLIGVLTYNIPHRKTYDTLCLLKARGYKDVIVFATPLHYKKQFKPLYEHRPQLINQISSI KDFCNNLDYKYELIDNFLDIDLKENTKILVCGAGILPDNFIKKYKVINSHPGYIPEVRGL DSLKWAIILEKKIGVTTHLIGDEVDAGYIIEQKEVPIYENDTFHALSQRVYETEICMLVD AIEKSDRNLIYKNGKNTEVHTRMSKEIEKDLLKKFEELKI >gi|296153520|gb|ADVK01000055.1| GENE 97 84232 - 85185 1107 317 aa, chain - ## HITS:1 COG:VC2250 KEGG:ns NR:ns ## COG: VC2250 COG1044 # Protein_GI_number: 15642248 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase # Organism: Vibrio cholerae # 113 315 113 340 351 139 35.0 8e-33 MTMNNRLFLFHSSNININYDFYIYGASTINNPRNNTVIFLKKNSAELLKKLKYIKDSILI ILENMDAESLKEDNIVIYSNNPRLEYAKLLTKILEENKKEECKLTFKEGYYYGENCYFGN NVIIEPFVTIGSNVTIEDNTIIKSGARIGSNIKIGKRCYIKENCVIGGEGFGIEKDKEGK TYRIPHIGGVEIGDNVEIGALTTVCRGTIENTIIEDYVKIDDHVYVAHNVFIGKGSLIVG GTLIGGSTKVGKNCWISPNTAIKNGLKIGNDVTLGMAARVLDDVKDKQILTNEKADTLEN IKKFVRIKERLLENNKI >gi|296153520|gb|ADVK01000055.1| GENE 98 85172 - 85906 423 244 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|227410568|ref|ZP_03893770.1| acetyltransferase, ribosomal protein N-acetylase [Eggerthella lenta DSM 2243] # 36 234 1 197 386 167 42 3e-40 MEFTYEAYKNMILNLKNNKYKIVGYNNDFEGKTVILRHDVDFSLEKALDIAKLENQLEVF STFFILLSTDFYNINSIKSLEILKNIIRLGGKIGLHFDEKKYLIDEKNDYIKYISYELEI ISTILEFPIDIVSMHRPSQNFLEMNLEIPNVTNSYQKKFFTEFKYVSDSRMHWRENIEEI ISSKKYQNLHILTHPFWYNFNKEEREYKIKKFLQKSVFERYNALNENIKDFEQIFSKEVL NDYE >gi|296153520|gb|ADVK01000055.1| GENE 99 85909 - 87117 1613 402 aa, chain - ## HITS:1 COG:TM0668 KEGG:ns NR:ns ## COG: TM0668 COG0399 # Protein_GI_number: 15643433 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis # Organism: Thermotoga maritima # 10 398 2 373 377 313 40.0 4e-85 MIIGGYKMKVSLLNLKRQYSYLKKDIETTISEILEGGAYINGPQTKKFEKRMEEYLGVKH AIGVGNGTDALVIALEALGIGKGDEVITSPFTFFATAEAISVVGAIPVFVDVKLEDFNID ENKIEKAITPKTKAIIPVHIFGTPANMDRINEIAKKNNLYVIEDACQAIGAKYKDKMVGT LSDIACFSFFPTKNLGTYGDGGLITTNDDSFATICRALKAHGSGENGEIAYNSLNNIKEE VKVDNQVDDTVYNPKKYYNYLIGHNSRLDELHAGILNVKLNYLDEWNKKRNDIAKYYNEK LDDKKYKKMQLREDNYNVYHMYIIQTENRNELTKKLDEAGIAYGVYYPVPLHLQKVYKNL GYKEGDLPNAEYLSKRTLAIPVDPELTEEEKEYIVNILKSEE >gi|296153520|gb|ADVK01000055.1| GENE 100 87098 - 88396 1882 432 aa, chain - ## HITS:1 COG:PM1003 KEGG:ns NR:ns ## COG: PM1003 COG0677 # Protein_GI_number: 15602868 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: UDP-N-acetyl-D-mannosaminuronate dehydrogenase # Organism: Pasteurella multocida # 2 423 3 420 424 429 51.0 1e-120 MIENIKICVVGLGYVGLPLAIAFAEKDFNVIGFDLNQEKIDKYLQGIDPTNEVGNEKIRS IKNLEFTSDEKKISEASFIIVAVPTPVLENKSPDFRPLIGASTIIGKNMKKGSIVVYEST VYPGATEEVCLPILEKESGMKCGVDFKIGYSPERVNPADKVNTLTKIKKITSGIDKESSD IIAEVYGSIIEVGIHKASSIKVAEAAKVIENSQRDINIAFINELAMIFDRIGIDTLEVLE AAGTKWNFLPYRPGLVGGHCIGVDPYYLADKANELGYHAQVILAGRRINDSMAKFVAEKT IKKLINANIRVKGADILVMGLTFKENCPDLRNSKVNDIILELREYGVSVHVVDPMADKIE AKKEYNIDLEDPKDIKNMDAIIIAVGHKEYRDMDIKDLHKYYNEVYSKPLLIDVKSIFNK EEAEKEYDYWRL >gi|296153520|gb|ADVK01000055.1| GENE 101 88411 - 89622 1496 403 aa, chain - ## HITS:1 COG:SP1837 KEGG:ns NR:ns ## COG: SP1837 COG0399 # Protein_GI_number: 15901666 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis # Organism: Streptococcus pneumoniae TIGR4 # 1 400 1 396 408 424 52.0 1e-118 MEKRKITFSPPDITDREIAEVVDTLKSGWITTGPKTKKFEEEIAKYCEVKRAVCLNSATA AMELALRLFDIGEGDEVITSAYTYTASASVIYHCGAKIVLADSKNGEFNIDSKQIEKLIT PRTKAIIPVDIGGFPADYSEILEIVEKKKNIFNPKKGTYQEKLGRILVLADSAHSFGSNY KGKKIGSIADITSFSFHAIKNLTTAEGGALTWNLPNNFDNEQIYKELMLLALHGQNKDAL AKLKAGAWKYDIVMPGYKCNMTDIMASIGLVQLQRYDNEILKKKQELVSYYEKYLSDLTD KMELPIFKDDDKESCRHLYMIRLKNQDEEKRNEVIAKLGENDIATNVHFQPLPLLTAYKN LGFKIEDYPNAYNQYKNEISLPLHDFLSEDDVKYICEYIKKFI >gi|296153520|gb|ADVK01000055.1| GENE 102 89641 - 90228 885 195 aa, chain - ## HITS:1 COG:FN1695 KEGG:ns NR:ns ## COG: FN1695 COG2148 # Protein_GI_number: 19705016 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Sugar transferases involved in lipopolysaccharide synthesis # Organism: Fusobacterium nucleatum # 2 194 7 201 230 164 48.0 8e-41 MLKRIFDIILSLFGLIILLPFMLITAIFIKLDSKGPVFFKQLRVTKNGREFKIFKYRTMR VGSDKYSQITVGKDDRITKIGSFLRKYKLDEIPQLINVLIGDMSLVGPRPEVPKYVALYT DEQKEILKVRAGITDYASIEFSNENDLLASEEDPEKAYIEKVMPRKIELNKKYLSEISIL TDIRIILLTIKKILK >gi|296153520|gb|ADVK01000055.1| GENE 103 90231 - 92042 1962 603 aa, chain - ## HITS:1 COG:FN1696 KEGG:ns NR:ns ## COG: FN1696 COG1086 # Protein_GI_number: 19705017 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Predicted nucleoside-diphosphate sugar epimerases # Organism: Fusobacterium nucleatum # 1 603 5 607 607 1082 97.0 0 MNSIRKLVKFLIDIFLLNISLAISIFLKYDQIQITDRNINFFIYYNLSFCILYFILKIYN NSWRFSGISEYVALITLSVSTTILSYILRVFLKLSTKSSLYFETFIIFTFLLIVSRFLMF LTRMKGIIKKDLNQENVLIYGAGESGVLLVKESRINPNFPYKIVGFLDDNINKIGGKVYG LRVFGGLDKVQEVVEKKDISKIIISMPSVSQNKVSNILKEINKIERLSVKILPNVDNLIE EGNLTTQLRNIKLEDLLGRDEIKINTKEVFEFIQDKVIFVTGGGGSIGSELINQIAKYNP KKIINIEINENASYLMELELKRKYPYLDYKTEIASVRDFDKLDMLFNKYKPDILFHAAAH KHVPLMENNPEEAIKNNIFGTRNIAECCLKYKLESVVLISTDKAVNPTNVMGATKRVCEM IFQKYSEKSSDTKFIAVRFGNVLGSNGSVIPIFSKLIEERKNLTLTHKDIIRYFMTIPEA AQLVIEAATIGKGGEILILDMGEPVKIYDLAKNMIKLSGSTVGIDIVGLRPGEKLFEELL YDVNSSEKTSNNKIFITNMENEKVKVDIDDYYTVLKDLIKENDIIGMRKTLANIIGTFKG RVE >gi|296153520|gb|ADVK01000055.1| GENE 104 92054 - 93046 1287 330 aa, chain - ## HITS:1 COG:no KEGG:FN1697 NR:ns ## KEGG: FN1697 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 330 1 328 328 478 91.0 1e-133 MQNNLMKIKDKFQEEEENEINIYELINIFIKNIKLFVIVSILGVIATCLYIGKRIIFDKN NVLSISYTLNYAELESYLGEKVYYPKKSPNEILLEDKYLEKFFENEELKEYYEKNVKENR DNINTKRQFLTDNKILENIPKRENSEIKDSPIIPNSYKITVKINKGYDKDGNASYKILEA YLNILKDYYNKNIFEFINNRKKYLEGAIPILKKELIDNTIVGEVVITDMMNNENNYLKYF FPIKVSNIDSYYPEYVKLESEYQAIKTLFGLGLNKIENFVKYDSSIIVEKEKSGNIIKLG MGIFLSLCLGVLIVFIKEFIEGYKKNKKDL >gi|296153520|gb|ADVK01000055.1| GENE 105 93036 - 93959 1322 307 aa, chain - ## HITS:1 COG:FN1698 KEGG:ns NR:ns ## COG: FN1698 COG1091 # Protein_GI_number: 19705019 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: dTDP-4-dehydrorhamnose reductase # Organism: Fusobacterium nucleatum # 1 298 1 298 298 556 98.0 1e-158 MKLIFGANGKLGTDFKELFDSIGEKYIATDKDEVDITNGDFLRAYIKTMHQNYKIDTIIN CAAYNDVDKAETEKELCYKANAEAPANLAMVASEIGATYITYSTDFVFNGMTTNYLYNES TGYTEEDEAHPLSAYAKAKYEGELLVSQIIENPENTSKIYIVRTSWVFGEGGMNFVEKII ELSKEKDELKVVDDQVSSPTYSKDLAYFSWELIKKGCESGVYHLTNDGIVSKYEEAQYIL EKISWKGNLIRAKSEDFNLLAERPKFSKLSCKKIKEKLGVSIPNWKDAIDRYFKENNKIN RGVNNAK >gi|296153520|gb|ADVK01000055.1| GENE 106 93969 - 94217 300 82 aa, chain - ## HITS:1 COG:FN1699 KEGG:ns NR:ns ## COG: FN1699 COG1898 # Protein_GI_number: 19705020 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes # Organism: Fusobacterium nucleatum # 1 82 1 82 82 129 100.0 2e-30 MLFIPKSFAHGFLTLEDNTEIFYKCDNFYNLQNEAGIIWNDRDLNIDWNFKKYNINENEL IISEKDKRNISFKEYKKINNIE >gi|296153520|gb|ADVK01000055.1| GENE 107 94611 - 95165 587 184 aa, chain - ## HITS:1 COG:no KEGG:FN1700 NR:ns ## KEGG: FN1700 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 184 1 184 184 291 98.0 9e-78 MKNDNEELLNLAEFKEKISEVSKSNKENFIDSDIEIINFDEVKKAYCKKRSNIAYPPKSN DGLYVTNDCIVFLEFKNANTKDIDQYELAKKNYDSICMLSDILGISIREIRGKTKYILVY SSEKNSIIDFGQALLKKNSPPNNELESKKKNCFGQKKFLNYLYSNVEALNDKEFKEKYLG KEII >gi|296153520|gb|ADVK01000055.1| GENE 108 95155 - 96504 1151 449 aa, chain - ## HITS:1 COG:no KEGG:FN1701 NR:ns ## KEGG: FN1701 # Name: not_defined # Def: ABC transporter ATP-binding protein # Organism: F.nucleatum # Pathway: not_defined # 1 449 1 449 449 717 99.0 0 MKLSIENVGILSKIDLEINGITVIAGENDTGKSTISKSLYAMFNGCYRLDEKVINAKIDS IRNNIDNFLEKNFSLHLGKRYIKKDKILEILSKNKKSIDEEAIKNLLLEIIKGTGKRNDL VNKLLNTIKNIERRDLNISIEELKNRINDIKLINNIKIINRLLQENFNREFSNQINNINT NEVSKIVLTIKDMDTNVEIKEGHLSFKKTENFGKIYTRAVYIDDPLILDKESERNPLFST AIEENNYFHSDNLVYDLQRKSELDLIEEIQINEKIRNIFSKINKKGIGKLSFKRDVFKST ITYKPNKDSKILDIRNVSAGLKSFLIIKTLLENGILEEKGVIILDEPEIHLHPEWQVLFA ELIVLIQKEFNMHILLTTHSPYFLYAVELYSKKYEINRKCKFYFSEKKDEKVNLIDVTDD IGIVFKSLSTPFFKLEDMELKMEEENCEK >gi|296153520|gb|ADVK01000055.1| GENE 109 96530 - 97345 874 271 aa, chain - ## HITS:1 COG:FN1702 KEGG:ns NR:ns ## COG: FN1702 COG1968 # Protein_GI_number: 19705023 # Func_class: V Defense mechanisms # Function: Uncharacterized bacitracin resistance protein # Organism: Fusobacterium nucleatum # 6 271 1 266 266 411 98.0 1e-115 MNAIILVVILAIVEGITEFLPISSTGHMILVNKLIGGEYLSPTFRNSFLIIIQLGAIFSV VVYFWKDISPFVKTKKEFVLRFRLWLKIIVGVLPAMVIGLLLDDIIDKYFLDNILIVAIT LIAYGVIFIGIEVVYKLKNIKPKVKRFAGLKYRTAFLIGFFQCLAMIPGTSRSGATIIGA LLLGLSRPLAAEFSFYLAIPTMFGATALKLFKNGLAFTEMEWSYLALGSAIAFVVAYIVI KWFMDFIKKRSFASFGLYRIILGIIVIVLLY >gi|296153520|gb|ADVK01000055.1| GENE 110 97360 - 98358 1417 332 aa, chain - ## HITS:1 COG:FN1703 KEGG:ns NR:ns ## COG: FN1703 COG0451 # Protein_GI_number: 19705024 # Func_class: M Cell wall/membrane/envelope biogenesis; G Carbohydrate transport and metabolism # Function: Nucleoside-diphosphate-sugar epimerases # Organism: Fusobacterium nucleatum # 1 332 1 332 332 640 99.0 0 MIIVTGGAGMIGSAFVWKLNEMGIKDILIVDKLRKEDKWLNIRKREYYDWIDKDNLKEWL ACKENADNIEAVIHMGACSATTETDADFLMDNNFGYTKFLWNFCAEKNIKYIYASSAATY GMGELGYNDDVSPEELQKLMPLNKYGYSKKFFDDWAFKQKNQPKQWNGLKFFNVYGPQEY HKGRMASMVFHTYNQYKENGYVKLFKSYKEGFKDGEQLRDFVYVKDVVDIMYFMLVNDVK SGIYNIGTGKARSFMDLSMATMRAASHNDNLDKNEVVKLIEMPEDLQGRYQYFTEAKINK LREIGYTKEMHSLEEGVKDYVQNYLAKEDSYL >gi|296153520|gb|ADVK01000055.1| GENE 111 98551 - 100833 2364 760 aa, chain + ## HITS:1 COG:FN1704_2 KEGG:ns NR:ns ## COG: FN1704_2 COG0729 # Protein_GI_number: 19705025 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Outer membrane protein # Organism: Fusobacterium nucleatum # 376 760 1 385 385 682 98.0 0 MKKIFFLLYIFLNFGFAYSENIELKSREDVEIENLESQIKVLEDKIQTIKKLKSAKDNKN LKVALVLSGGGVKGYAHLGVLRVLERENIKIDYITGTSIGAFIGTLYSIGYTVDEIEKFL DDVNVSNFLETITDNTNLSLEKKESLKKYSVHLSFDNELNFSFPKGLRGTGEAYLLLKGL LGKYEHMDNFDNFPIPLRIIATNLNTGETKAFSKGDVAKILIASMSIPSIFEPMKIDGEI YVDGLVSRNLPVEEAYEMGADIVVASDIGAPVVEKDDYNILSVMNQANTIQASNITKISR EKASILISPDVKNISALDSSKKEELMKLGKVAAEKQIDKIKLLAKADNKKKKEKFVTNSD AKIIINKIEYNDKFDKNTVIVLNDIFKGLLNNPISKKDIDKKIIDVYSSKYMDKVYYTVD NGVLYLDGEKAHSNRIGVGANYQTGYGTTFNIGTDLFFNGKFGNNINLNFKFGDYLGADL GTLTYYGVKNRFGILTNIGYNESPFFLYKNRRKFAKFMNREAYLNIGIFTQPTNNSMISY GVLSKFSSLKQDTGDSLSQNLEYSENQTKTYIRLKYDNLDSISNPMKGIKADFIYNFASS FGKSKSNLYGPAYSIKGYIPINPKLSFVYGLNSASLRGDRIRADQRIRLGGTYTNINNNE FEFYGFNYQEKQVKDLISLTLGFKHKIIYSLYFNTKFNIATFTENNSFGNNNSRLWKNYS KGMGISISYDSPIGPIEFSISSDLRHKRPIGSISIGYKLD >gi|296153520|gb|ADVK01000055.1| GENE 112 100849 - 101613 750 254 aa, chain + ## HITS:1 COG:FN1706 KEGG:ns NR:ns ## COG: FN1706 COG0730 # Protein_GI_number: 19705027 # Func_class: R General function prediction only # Function: Predicted permeases # Organism: Fusobacterium nucleatum # 1 254 1 254 254 398 99.0 1e-111 MEFDIVKFLILAVFCFIAAVVDAISGGGGLISLPAYFAVGFPPHMALGTNKLSAFLSTFA SAFKFWKAKKVNVEIVSKLFVFSLAGAVLGVKTAVSIDTKYFKPISFGILILVFLYALKN KSLGENNYYKGTTPKTIVLGKIMAFCLGFYDGFLGPGTAAFLMFCLIKIFKLDFSSASGN TKILNLSSNFASLVVFAFLGKLNWAYGISIAIVMTFGAIIGSRLAILKGNKFIKPVFLVV TIVLILKMSVEIFF >gi|296153520|gb|ADVK01000055.1| GENE 113 101597 - 102604 325 335 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|15900011|ref|NP_344615.1| aldose 1-epimerase [Streptococcus pneumoniae TIGR4] # 2 303 7 315 345 129 29 7e-29 MRYFFNSYGGKLEKIKVYTLENEFLKVELLNLGAIIKKIELKNKNGDVKNVVLGYENIEK YRENPAYLGAIIGRTAGRIKDGILKIDDIKYQLDKNNNGNTLHGGKKSISHRFWNVGKIE NGLCFSIKSFRLDNGYPANIEIKVSYILNNNELLIKYFATTDNLTYLNLTNHSYFNLNGN PNNSIYEDILKIDSDYLIGIDKNSIPCKTINLDNNIFDFRESKKLEEFFKADDEQKTLAN NGIDHPYIFNKEIGKLEIKNLESGIKISVETDNPAVVIYTANYLQDIGFKKHSAICFETQ EAPSLYRDEDINIYPTFIDKNTNYGKYTKFIFEIN >gi|296153520|gb|ADVK01000055.1| GENE 114 102716 - 104815 1716 699 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|62291006|ref|YP_222799.1| polynucleotide phosphorylase/polyadenylase [Brucella abortus bv. 1 str. 9-941] # 1 699 1 694 714 665 50 0.0 MFDEKIMELELVGRTLKVSTGKISRQSSGAIVIQYGDTVLLSTANRSKEARKGADFFPLT VDYIEKFYSSGKFPGGFNKREGRPSTNATLVARLIDRPIRPMFPDGFNYDVHIVNTVLSY DEINIPDYLGVIGSSLALMISDIPFLGPVASVIVGYKNGEFILNPSPKELEESELDLIVA GTKEAVNMVEAGAKELDEETMLKAIMFAHENIKKICEFQEEFSKLYGKENIEFEKPEVLT LVKDFIDTNGYERLQQAVLTTGKKNREDAVDSLEEELMEKFIQKNYPDVPEEELPEDIIL EFKTYYHDLMKKLVREAILYHKHRVDGRTTTEIRPLDAQINVLPIPHGSALFTRGETQSL AITTLGTKEDEQLIDDLEKEYYKKFYLHYNFPPYSVGEVGRMGSPGRRELGHGSLAERAL SYVIPTEEEFPYTIRVVSEITESNGSSSQASICGGSLSLMSAGVPIKEHVAGIAMGLIKE GEEFTVLTDIMGLEDHLGDMDFKVAGTKSGITALQMDIKITGITKEIMRIALNQAHQARI QILELMNNTISKPAELKSNVPRIQQITIPKDKIAVLIGPGGKNIKGIIEQTGATVDITDD GLVSVFAKDAETLEKTLKLVDSYVREVEYNEVYEGRVVSIMKFGAFMEILPGKEGLLHIS EISPERVEKVEDVLSVGDVFKVRVISMEGGKISLSKKKV >gi|296153520|gb|ADVK01000055.1| GENE 115 104831 - 105394 302 187 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|229231897|ref|ZP_04356325.1| SSU ribosomal protein S12P methylthiotransferase [Cryptobacterium curtum DSM 15641] # 5 180 484 665 904 120 35 3e-26 MNLPNRLTMIRFILAIPFIIFLQYSDSSKYGLIFRLISLVIFVIASLTDFFDGYIARKYN LITDFGKIMDPLADKILVISALVIFVQLEYIPGWMSIIVLAREFLISGIRILAAAKGEII AAGNLGKYKTTSQMLVVIVALAIGPIGFYISDYFFTVAEVLMLIPVILTIWSGWEYTFKA KHYFTEQ >gi|296153520|gb|ADVK01000055.1| GENE 116 105408 - 105683 295 91 aa, chain + ## HITS:1 COG:FN1710 KEGG:ns NR:ns ## COG: FN1710 COG0762 # Protein_GI_number: 19705031 # Func_class: S Function unknown # Function: Predicted integral membrane protein # Organism: Fusobacterium nucleatum # 1 91 1 91 91 115 100.0 2e-26 MSLLAYSLITIIQKLFWLVDILILIRVLLSWIPMNNNFTELIYNLTEPMLKPFKDFLNKY INLPIDFSPLLFLLCLEAVERILIRFIIVIF >gi|296153520|gb|ADVK01000055.1| GENE 117 105741 - 106685 1301 314 aa, chain + ## HITS:1 COG:FN1711 KEGG:ns NR:ns ## COG: FN1711 COG0275 # Protein_GI_number: 19705032 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted S-adenosylmethionine-dependent methyltransferase involved in cell envelope biogenesis # Organism: Fusobacterium nucleatum # 1 314 1 314 314 577 98.0 1e-165 MEKIGNDYHIPVLYYETLDNLVINPDGTYIDCTLGGGSHSEGILERLSDKGLLISIDQDT NAIVYSKKRLEKFGSKWKVFKGNFENIDTIAYMAGVDKVDGILMDIGVSSKQLDDPERGF SYRYDVKLDMRMNTDQKISAYDVVNTYSEEQLSKIIFEYGEERHARKIAKLIVEERKSSP IEKTSDLIALIKRAYPERASKHPAKKTFQAIRIEVNRELEVLENAMSKATELLKVGGRLA IITFHSLEDRIVKNKFKDLATACKCPKDIPICVCGGVKKFEVITKKPIIPIDDELKNNNR AHSSKLRILERILD >gi|296153520|gb|ADVK01000055.1| GENE 118 106687 - 106944 321 85 aa, chain + ## HITS:1 COG:no KEGG:FN1712 NR:ns ## KEGG: FN1712 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 85 4 88 88 110 100.0 2e-23 MRYITIVFAFCILGIWLFNVKTLREVTSLEKELKIANENLEELEKELDKKIMFYDAKLDL DKIRREMEAKGMKVSDEVIYFEIEE >gi|296153520|gb|ADVK01000055.1| GENE 119 106946 - 108577 1858 543 aa, chain + ## HITS:1 COG:FN1713 KEGG:ns NR:ns ## COG: FN1713 COG2265 # Protein_GI_number: 19705034 # Func_class: J Translation, ribosomal structure and biogenesis # Function: SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase # Organism: Fusobacterium nucleatum # 1 450 13 462 464 725 94.0 0 MLKVSDIIQIKIDKIVFGGEGLGYYNGFAVFVPMSIPEDELEIEIISVKKTYARGLIKNI IKASPERIDSHKFTFEDFYGCDFAMLKYESQLKYKRLMVEEVMRKIAGLSDIEISDVLAS EDIYNYRNKIIEPFSIYNNKIITGFFKRKSHEVFEVDENILNSKLGNRIIKELKEILNKN KISIYDENTHRGLLRNIMIRTNSNNEAMVVLIINFNKITENIKKLLFNLRENIEEIKSIY ISLNSKKTNTVIGEKNILIYGEKSIKENINGIEFHISPTSFFQINVKQAKKLYDIAISFF DDIDDKYIVDAYSGTGTIGMIIAKKAKKVYAIEIVKSASEDGEKTAKENGIKNIEFINGA VEKELVKLINNNKKIDIIIFDPPRKGLEASIIDKVAELNLKEVVYISCNPSTFARDIKLF SEKGYVLKKLQAVDMFPQTSHIETVALLSKLDVDKHINVEIKLDELDLTSAESKATYAQI KEYILEKFDLKVSTLYIAQIKKKCGIVLREHYNKSKKEKQVIPQCTPDKEEAIMDALRHF KMI >gi|296153520|gb|ADVK01000055.1| GENE 120 108596 - 108877 327 93 aa, chain + ## HITS:1 COG:no KEGG:SZO_12680 NR:ns ## KEGG: SZO_12680 # Name: not_defined # Def: membrane protein # Organism: S.equi_zooepidemicus # Pathway: not_defined # 1 93 1 93 93 97 89.0 1e-19 MRLIIKIILFPISLLLSILTAFLTFLLGVGTALLYLLMMFCIFGAIASFLQKEVTIGIEA LIIGFLVSPYGIPMVGAAIIAFLKGINEAIKSI >gi|296153520|gb|ADVK01000055.1| GENE 121 108964 - 109761 893 265 aa, chain + ## HITS:1 COG:no KEGG:SDEG_1349 NR:ns ## KEGG: SDEG_1349 # Name: not_defined # Def: hypothetical protein # Organism: S.dysgalactiae # Pathway: not_defined # 1 265 1 259 259 435 92.0 1e-121 MDFDYFYNREAERFNFLKVPEILVDGEEFKGLSAEAIILYSMLLKRTGMSFKNNWIDKEG RVFIYFAVEEIMKRRNISKPTAIKTLDELDSKKGIGLIERVRLGLGKPNIIYVKDFMSVF QVKENDFQKLKNFTSEVKDVDLRSKENELQEVKNVDRNYIENNKSKYSKRKYSKREYSFG VNGLGTFQNVFLAAEDISDLQIIMNSQLDNYYIERLSAYIKSTGKTYKDHKATILSWFYK DQGRGKEVKTSNISTWEEYDKGDQL >gi|296153520|gb|ADVK01000055.1| GENE 122 109866 - 111041 771 391 aa, chain - ## HITS:1 COG:FN1676 KEGG:ns NR:ns ## COG: FN1676 COG3547 # Protein_GI_number: 19704997 # Func_class: L Replication, recombination and repair # Function: Transposase and inactivated derivatives # Organism: Fusobacterium nucleatum # 1 391 1 391 391 658 96.0 0 MFLLGIDIAKLNHVASCIDSSTNEVIFSNFKFKNDFKGFSALLNKIKSFDAKNLIIGLES TSHYGENLINFLFIHDFKVALINPLQTSHLRKANIRDAKNDNLDSLNIAKSLLFTKLNFV SKKNIECFSLKKLTRFRSNLIKQRSKAKIQLTSLLDLLFPELQYLFKSKIHSKAIYFLLK KYPSTEEIAALKDDEISNLLYASSKGHFKKEKSIELKSLAKTTVGIKDTSISLHLIQLIE LIELYTKQIKDIEIKITDIVNNLDITLLSVPGISIIACAIILGETNNFENFSSSKKLLAF AGLDPKIRQSGNFNVSSCRMSKKGSPYLRYALIFTAWNIVRHSEKFNKYYSLKRSQGKTH YNALGHVAHKLVRILFTLIKKNISYQEEKLE >gi|296153520|gb|ADVK01000055.1| GENE 123 111317 - 111442 79 41 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296329286|ref|ZP_06871787.1| ## NR: gi|296329286|ref|ZP_06871787.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 41 1 41 41 66 100.0 7e-10 MDYKTIRDKIEDMVNDNHKDFVKAIISMEKSLTMKGFRQPL >gi|296153520|gb|ADVK01000055.1| GENE 124 111495 - 111974 343 159 aa, chain + ## HITS:1 COG:no KEGG:SEQ_0755 NR:ns ## KEGG: SEQ_0755 # Name: not_defined # Def: conjugative transposon conserved hypothetical protein # Organism: S.equi_equi # Pathway: not_defined # 1 159 61 219 219 229 92.0 3e-59 MIEDLREQGQIKDLPYAQEEKDNLINIVGNIVGEVDVVERENKNGEAFKVTNFSVVSKDD EGNKVYHNCSAYGEKSDIPKDFKQGDFVKLFGQIRTSIDDNGKEHSNIRILSSKLLKAKE QMKGKEEKKDSVLGAIKKYQAEEKEKPKEKKEATKEAER >gi|296153520|gb|ADVK01000055.1| GENE 125 112144 - 113211 955 355 aa, chain + ## HITS:1 COG:VNG2062G KEGG:ns NR:ns ## COG: VNG2062G COG0464 # Protein_GI_number: 15790910 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: ATPases of the AAA+ class # Organism: Halobacterium sp. NRC-1 # 66 236 84 265 369 76 29.0 7e-14 MKKQNVLNLIKYHVERNENSFRNEAIAIARYFDSIGDYQLSEYIMGLISESNLYSPQSSD FESEFLKQVETRNLGALNLPLEITEDIKGIINAVNHNVGINKFLFEGLPGSGKTEAAKNV ARLLDRSLFRVDFENLIDSKLGQTNKNIIKVFKEINMIPNANKIVVLFDEIDVIALDRIN SNDIREMGRVTSTILRELDRLTDLNKKIVLIATTNLYSNFDKALVRRFDAVINFNRYSNE DLIEVAEYYFSSFIKSFKGTSKDTRLFKKILKTAPKLPYPGELKNIIKTSLAFSDVGSEY DYLKRLYNSLIGNLDQKGINQLHEEGFTVREIEKLKGESKSAVSRKLKKEEVDSE >gi|296153520|gb|ADVK01000055.1| GENE 126 113204 - 115438 2396 744 aa, chain + ## HITS:1 COG:FN2100 KEGG:ns NR:ns ## COG: FN2100 COG1404 # Protein_GI_number: 19705390 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Subtilisin-like serine proteases # Organism: Fusobacterium nucleatum # 271 647 25 406 416 67 25.0 8e-11 MNRVLELKGKRFVQARRNGNGGGIAMNGKVEVTTEHLLRLKCQLNQIKKFWDKETKPFEG ILVSVYYNKIVAKSNRISGLFKGTDSNEAIVGAKFNRDKSKHIITYFLDEYDLTKSIELI TDTSNILSQKFVGKISKNIFEDKNIVNPTVFKDFSVSMSIFKQVIADVSYIDSFEVELPT IDVKQSIITLYDVRKDTKLLFERLGIDILSSRILDNQTVYLDENQVEILFEKAPYLVSMA TVDLSSLSPDDFINEYQTKMVTIPSPTIEPTIGVIDTLFDESVYFSDWVEYHDMVSEDIP KNPKDYNHGTAVSSIIVDGARLNPWLDDGCGRFKVRHFGVAVGAEFSSFSIIKKIKSIVV NNSNIKVWNISLGSNQEINDNFISAEAATLDKIQFDNNVIFVVAGTNKPSADIVKIGSPA DSVNSIVVNSVSKNGLSTKYARKGLVLSFFAKPDVSYYGGSEEEYIRVCEPLGAASVAGT SYAAPWIARKLSYLIDVLGLNREIAKAMIIDAARGWDEKPTPEEVALYGHGVVPIHINDI IQTKDDEIKFVVSDISEKWNTYNYDFPVPLKDNKYPYIAKATMCYFPLCDRLQGVDYTNT ELNLHFGRIGDDKKIKDIKGDKQNTEDPLEGERNYLLEGEARKQFRKWDNVKYIAESATK RMIPKDSYRNKNWGMEIKTNNRLAPQDGVGVRFGVVVTLKEMNGVNRIDEFIRSCTLNGW LVNVIDVVNRVDIHQKVNEEIEFE >gi|296153520|gb|ADVK01000055.1| GENE 127 115476 - 116432 907 318 aa, chain + ## HITS:1 COG:no KEGG:Sbal195_2015 NR:ns ## KEGG: Sbal195_2015 # Name: not_defined # Def: hypothetical protein # Organism: S.baltica_OS195 # Pathway: not_defined # 67 294 46 272 299 117 34.0 7e-25 MIFTEEQLRKYAKPLSETEENQCKNAIRMVVDALKTIGFNERTSIQKMYSETPSFEVMMK SNDDYEVKIFLQGSYANNTNVRQHSDVDIAVVQIDQFRPKYRVGVSKTNYGFISAISKSK TFKDIVQSALENKFGDDVERKNKSIKIHGNSYRKDADSVPALRYRDYSYDYRFDPENYVG GILIKADDGTEVINYPEQHIKNGIDKNKRTNLYFKKMVRIAKEMRYQMQDEGYEFAQKTS SFAIECLLYNVPDEIFTKYDCYKYIFDDIVEFLYDNKHNINSFVEVNGIKKLCYDSADRE KVYKGFIDELKGFYSYEI >gi|296153520|gb|ADVK01000055.1| GENE 128 116422 - 116967 289 181 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296329291|ref|ZP_06871792.1| ## NR: gi|296329291|ref|ZP_06871792.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 181 1 181 181 291 100.0 1e-77 MKSKMESLIKLSGIIIILIFIILYKFKKPQSGLDYWDLILESAGYSVIAIAIYERYLWKY NPFVKIPRLKKKYTGVLSYNYKGESGEKAVEIEIRQSFLYTDVKLKSDEISSKTITSNLV EENGGFVLYYTYITNPLSRYSEKNPIQIGTCKLQIDKIDSIKGSYWTNRKTIGDLTLKSE E >gi|296153520|gb|ADVK01000055.1| GENE 129 116977 - 117306 278 109 aa, chain - ## HITS:1 COG:no KEGG:SZO_12770 NR:ns ## KEGG: SZO_12770 # Name: not_defined # Def: relaxase/mobilisation protein # Organism: S.equi_zooepidemicus # Pathway: not_defined # 1 109 335 443 443 157 89.0 2e-37 MQELSATMEQIHTVKKYRSYYKEYKANPSDKTFFEEYKSQIKLYETALSELKKSYSKLPN SKDILDKLDKLQEKKNTLMQEYSSSKSDMDELYKIRKNYGIYMGKEIER >gi|296153520|gb|ADVK01000055.1| GENE 130 117385 - 118308 1537 307 aa, chain - ## HITS:1 COG:SP1056_1 KEGG:ns NR:ns ## COG: SP1056_1 COG3843 # Protein_GI_number: 15900926 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Type IV secretory pathway, VirD2 components (relaxase) # Organism: Streptococcus pneumoniae TIGR4 # 1 249 1 263 402 73 30.0 5e-13 MAITKIHPIKSTLHLAINYIVNGDKTDEQLLVSTHKCHESTAHTQFLRTREEAGTKGTVL ARHLIQSFLPGEATPEMAHRIGLELCKKILKDEYEFVLSTHIDKGHIHNHIIFNNVNMVT GKCYQSNKKSYHKIRYQSDKLCKENNLSVINEHYESYKKKYKTNGKSWYENEHAKRGTSW KSRLQFDIDRMVKQSKDWDEFLKKMTDLGYEIKYSKHIAFKPKDKPRFTRSKTIGEDYTE ERLKERIAEISSIKTPSVKKRIGNVIDMNSNVKVKESKGYEYWATKHNLNTMAESVIFIR EHSIKSV >gi|296153520|gb|ADVK01000055.1| GENE 131 118311 - 118667 175 118 aa, chain - ## HITS:1 COG:no KEGG:CD1870 NR:ns ## KEGG: CD1870 # Name: not_defined # Def: putative conjugative transposon mobilization protein # Organism: C.difficile # Pathway: not_defined # 1 113 1 113 116 179 89.0 3e-44 MANRIRNERLEMKLTKEEKALFEEKRKLAKCRNMSHFICKCVLEKEIYQVDLEPFRELQG LLSNATNNINQIAKRVNQTGIIYKDDIGDMKKEIEHFSKELWQIHSLLLKRTAETEGE >gi|296153520|gb|ADVK01000055.1| GENE 132 118712 - 118840 239 42 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MLNACSITHTFVVSTVSVRTLTEKLMKQPFGICFFFYQFLAC >gi|296153520|gb|ADVK01000055.1| GENE 133 118965 - 119537 572 190 aa, chain + ## HITS:1 COG:FN0473 KEGG:ns NR:ns ## COG: FN0473 COG1309 # Protein_GI_number: 19703808 # Func_class: K Transcription # Function: Transcriptional regulator # Organism: Fusobacterium nucleatum # 1 190 1 189 189 161 50.0 7e-40 MAQVLKEEVRNRILEAAEKVFYKKDYRGAKLTEIAKEANIPVALIYTYFKNKEVLFDAVV SSVYINFESAFNEEESLEKGSASERFDEVGENYIHELLKERKKLIILMDKSSGTKHTEAK QKLISQMQIHIEVSLKRQSEKEYDPMLAHILASNFTEGLLEIARHYQSEKWAKDMLKLIA RCYYKGVESL >gi|296153520|gb|ADVK01000055.1| GENE 134 119641 - 121380 184 579 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163765018|ref|ZP_02172066.1| ribosomal protein L9 [Bacillus selenitireducens MLS10] # 340 549 34 249 329 75 26 2e-12 MLEKELRKKVIGKNGLSNSLLALKIVFDLIPQILLVYLISSLITNNISEDNLKHIFLGIF ISFALKGVFYYFATKVAHEKAYEKLTELRLDIIGHLKKLSLGFFKEHNTGELTNIVQHDV EQVEVYLAHGLPEIMSVTLLPTIIFVAMIFVDWRLALGMIAGVPLMYLVKVLSQKTMDKN FSIYFNHENKMREELMEYVKNISVIKAFAKEEEISERTLKTAREYIYWVKKSMGAITIPM GLIDIFMEIGVVIVMILGSIFLYYGNITTPNFILAIILSSAFTASISKTATLQHFSIVFR EALKAIGKVLTVPLPNKKTEQGLEFGNIEFKDVNFAYGKDCFELKNINLTFKKNSLNAFV GASGCGKSTVSNLLMGFWDADEGQILINGKDIKEYSQENISMLIGSVQQEVILFDLSIFE NIVIGKLNATKEEVIEAAKKARCHDFISALPNGYETRIGEMGVKLSGGEKQRISIARMIL KNAPILILDEAMAAVDSENERLIGEAIDDLSKDKTIITIAHHLNTIRDSDQIIVMDKGVV LDAGSHEELMKRCDFYKDMVEAQNKVDRWNLKEVVTENV >gi|296153520|gb|ADVK01000055.1| GENE 135 121373 - 123082 190 569 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163765018|ref|ZP_02172066.1| ribosomal protein L9 [Bacillus selenitireducens MLS10] # 347 545 38 242 329 77 29 3e-13 MFREMLKLLTKTGKRDLIISSVFFALYGLSSIAMIVIVFSILFQIFDGTSLDMLYKYFIA IGLLVVFKGICNMVADMKKHSAGFDIVQQIRERMIIKLKKFSLGFYTNERLGEINTILHK DVDNMSLVVGHMWSRMFGDFFIGAVVFVGLANIDIKLALIMAVSVPIALIFLYLTIKQSE KIENQNNSALLDMVSLFVEYVRGIPVLKSFSNNKSLDNELMNKTKKFGETSKAASRFKAK QLSIFGFLLDIGYLVLLITGVILVIKGSLDVLNFIIFAVISKEFYKPFASMEQHYMYYVS VVDSYERLSRILYADVIPDKVDGIVPKDNDIAFENIGFSYEKDEFKMENLSFDIDEKTMT ALVGESGSGKTTITNLLLRFYDVQQGKITLGGVDIRDIPYDELLDRISIVMQNVQLFDNT IEENIRVSKKGATKEEIIEAAKKAKIHDFIMSLPKGYETDIGENGGILSGGQRQRISIAR AFLKDAPILILDEMTSNVDPINESLIQDAITELAKNRTVLVVAHHLKTIQKADQILVFQK GNLLQKGKHGELLDKNGYYTKLWKAQYEV >gi|296153520|gb|ADVK01000055.1| GENE 136 123058 - 123294 202 78 aa, chain + ## HITS:1 COG:MA4341 KEGG:ns NR:ns ## COG: MA4341 COG1122 # Protein_GI_number: 20093129 # Func_class: P Inorganic ion transport and metabolism # Function: ABC-type cobalt transport system, ATPase component # Organism: Methanosarcina acetivorans str.C2A # 10 61 1 52 274 62 57.0 3e-10 MESSVRGVVMILLKDVSYKWEDGRTALKNINLEIKKGEFVLISGKSGSDKSTLGSVMNGL IPHYCKGKLQGEAFASKI >gi|296153520|gb|ADVK01000055.1| GENE 137 123553 - 125532 1283 659 aa, chain + ## HITS:1 COG:no KEGG:FN1714 NR:ns ## KEGG: FN1714 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 334 659 1 326 326 556 99.0 1e-156 MSLFDIFNKTNRKIIPLSLYLGYEQDDEDTEKNDKDMTFANSNDFPEELFDKIIIMTKRL HEMGVYEEKEIVRFIIPKILKALEAGEILSGVSISSFDTTGVVFGENYAGWLYKSETKKG NEILYKVPYQLIFYHSKKEDNYFICKTKNGLELRQNKVSITNSDDKEHIIKNTKSINLTK MGMKELKDSFSILFMYIKMNILLKMKKLGYLDFIEKIAEFFCSSMEKDEFISYAKQYFAL SIDKEKNPLAYVLLPFKDFLSELLIRNNCSTSFFTSYADIIKWIKERTKIKIDIQKEYHD HIEFIKDVSNILISKEYYLYFLIKEDFFHKTSLLIGKQKGIFENNSNFSYEITSLSDDLI EHYIEDISRERKIRELVKRFFQIINIVDIQSMMMNIRGNRVLDYALVKNYYLSVVYSDED MNIYNVLGKLLSKYAMPDLIKNTTAFDATLDNLYFLEFESVNPVDRAILYADLIRQNKKL LYSFEVGDYFYLGIIEDNIEIKDNFIKYVNNLLEMKEIENYVIYDSKNKEMFVNLLSGID LLVESEFSYTQKDIREKFYKENSENISIEDADKYLSRIVLEKSEIKICVSNYYEDEEELE FTIKAENRKYFTNQDLIFQIQRYLAEKIDFTKIFSSKICISGLRLSIDRDKYYLFWNCV >gi|296153520|gb|ADVK01000055.1| GENE 138 125616 - 126908 1287 430 aa, chain + ## HITS:1 COG:FN1715 KEGG:ns NR:ns ## COG: FN1715 COG1373 # Protein_GI_number: 19705036 # Func_class: R General function prediction only # Function: Predicted ATPase (AAA+ superfamily) # Organism: Fusobacterium nucleatum # 1 430 1 430 430 692 93.0 0 MIIERKQYLNELIKKKDNGRIKIITGIRRCGKSYLLFKLYKDYLLGNGIKKEQIIEMALD EIDNIKYRNPFELNDYIKNKITKNKKYYIFIDEIQFSKSVKNPYINNSDEEITFVDTLLG LMKNENLDIYVTGSNSKMLSKDILTQFRDRGDEIHVYPLSFAELYNTYKDKNLAWRDYVV FGGMPYILSLENFEEKSTYLKNLFEETYIKDIIERNKIQNSSDILDILLNFISSAIGSLT NPLKLSNRFLSENKISISHNTISKYLSYFEESYIVYSAKRYDIKRAKYFTTPLKYYFADI GLRNARLNFRQVEETHIMENIIYNDLIRRGYNVDVGVVEYTQTKENKNRKIQLEVDFVVN RGNIRYYIQSALTLESDEKREQELNSLKRIDDSFKKIVIVKDDIIPRYDEQGIYYIGVKD FLLTDNIFEN >gi|296153520|gb|ADVK01000055.1| GENE 139 127051 - 127620 578 189 aa, chain + ## HITS:1 COG:no KEGG:FN1716 NR:ns ## KEGG: FN1716 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 189 1 189 189 327 93.0 1e-88 MKNFKIILILISLLFFTACSSVKTIPKYEKKENVTWKEVEPPIIVLDLEPGDIIVKEKTL NPIGMFGHVAVMKNDRIIVDYPKLGNKSYTIDVDYWLEKGRDILVLRYKDMNDEFKKKLV KNMEKYFGKNYKISSDRMNTDGFYCSQYVWYIYYITAQEMGFELDLDSDGGNFVLPYDFI NSPYLEIVN >gi|296153520|gb|ADVK01000055.1| GENE 140 127725 - 129815 2739 696 aa, chain + ## HITS:1 COG:FN1717 KEGG:ns NR:ns ## COG: FN1717 COG0272 # Protein_GI_number: 19705038 # Func_class: L Replication, recombination and repair # Function: NAD-dependent DNA ligase (contains BRCT domain type II) # Organism: Fusobacterium nucleatum # 1 696 1 696 696 1205 97.0 0 MEIKKRIEELKNNQTGLTFYSSQELNDLEKIVKLREDLNKYRDSYYNDNKSLISDYEFDI LLKELESLEEKYPQYKEISSPTTSVGASLKENKFKKVEHLHQMLSLANSYNIGEIVEFIE RIKKKIPKEQELKYCLEVKLDGLSISLTYRQGKLVRAVTRGDGFIGEDVTENILEIASIV KTLPQAIDMEIRGEVVLPLASFEKLNNERLEKGEELFANPRNAASGTLRQLDSKIVKDRG LDAYFYFLVEADKLGLKSHSESIKFLESMGIKTTGIFELLETSKDIEKRIDYWEKERENL PYETDGLVIKVDEINLWDEIGYTSKTPRWAIAYKFPAHQVSTVLNDVTWQVGRTGKLTPV AELEEVELSGSKVKRASLHNISEIQRKDIRIGDRVFIEKAAEIIPQVVKAIKEERTGNEK IIEEPVCCPVCNHKLEREEGLVDIKCINEECPAKVQGEIEYFVSRDALNIMGLGSKIVEK FIDLGYIKTVVDIFDLKNHREALENIDKMGKRSIENLLNSIEESKNRDYDKVIYALGIPF IGKVASKILAKASKNIDKLMTMTFEELTSIEGIGEIAANEIIAFFTKEKNQKIIQGLKEK GLKFEIKESEASVQNVNPNFVGKNFLFTGTLKHFTREQIKEEIEKLGGKNLSSVSKNLDY LIVGEKAGSKLKKVQEIPTIKILTEEEFIELKDKFD >gi|296153520|gb|ADVK01000055.1| GENE 141 129874 - 132483 3789 869 aa, chain + ## HITS:1 COG:FN1718 KEGG:ns NR:ns ## COG: FN1718 COG0653 # Protein_GI_number: 19705039 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Preprotein translocase subunit SecA (ATPase, RNA helicase) # Organism: Fusobacterium nucleatum # 1 869 1 869 869 1629 98.0 0 MIGGLLKKIFGTKNDREIKALTKEVEKINALESEYEKLSDEDLKNKTNIFKERLKNGETL DDILVEAFATVREASKRVLGLRHYDVQLIGGMVLHQGKITEMKTGEGKTLVATCPVYLNA LAGHGIHVITVNDYLAKRDRDQMSRLYGFLGLSSGVILNGLPTDQRKKSYNSDITYGTNS EFGFDYLRDNMVSSLDQKVQRELNFCIVDEVDSILIDEARTPLIISGAAEDKIKWYQISF QVVSMLNRSYETEKIKNIKEKKAMNIPDEKWGDYEVDEKSRVIVFTEKGVKRVEEILKID NLYAPEYVELTHFLNQALKAKELFKRDRDYLVRDNGEVVIIDEFTGRAMEGRRYSDGLHQ AIEAKEGVKIASENQTLATITLQNYFRMYKKLSGMTGTAETEATEFMHTYGLEVVVIPTN LPVIRKDDADLVYKTKKEKINSIIDRIQGLYEKGQPVLIGTISIKSSEELSELLKKRKIS HNVLNAKYHAKEAEIVAQAGRYKAVTIATNMAGRGTDIMLGGNPEFMALAEVGSRDDEKF SEVFSKYQEQCAIEKEQVLALGGLFILGTERHESRRIDNQLRGRSGRQGDPGESEFYLSL EDDLMRLFGSERVMIWMDRLKLPEGEPITHRMINSAIEKAQKKIEARNFGIRKNLLEFDD VMNKQRTAIYESRNEALAIDNLKDRILGMLQRNITEKVYEKFAPEMREDWDIDGLNEYLK DFYVYEERDDKAYLRSTKEEYIERIYNALVEQYNNKEAELGSDLMRKLEKHILFDVVDNR WRGHLKSLDALRESIYLRAYGQRDPVTEYKLISSQIFEEMIATIQEQATSFLFKVVVNTE PVKDEKNEIEADGLCPCGSGKPYEKCCGR >gi|296153520|gb|ADVK01000055.1| GENE 142 132496 - 133242 919 248 aa, chain + ## HITS:1 COG:no KEGG:FN1719 NR:ns ## KEGG: FN1719 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 10 248 1 239 239 435 99.0 1e-121 MKKLLFLLVMTFTLISCSSTTVTKKGLVEKYSLNKESAHNWETTMSKVMVAEATNPDWYG EENPLVNFRKQGKISEKEYYFLDYLGKTPANEISDDDFDRFTKILTSYVKKLPRKFIIEV TNIKDPKGLVDYMVKQATSPQLDNPSKYIKEVVADKEEWAQIEAFSKQSDLTSKDVKKLR KLLADFVKRSNFYNEQVWYQVEVSDRMIQLANLAKKTEKTKLELNNVNARALYLAYPQFL SKVDRWGR >gi|296153520|gb|ADVK01000055.1| GENE 143 133307 - 134005 760 232 aa, chain - ## HITS:1 COG:FN1722 KEGG:ns NR:ns ## COG: FN1722 COG0357 # Protein_GI_number: 19705043 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Predicted S-adenosylmethionine-dependent methyltransferase involved in bacterial cell division # Organism: Fusobacterium nucleatum # 1 232 1 232 232 378 99.0 1e-105 MKDYFKEGLEKIKVSYDENKMEKSLKYLEILLDYNSHTNLTAIREEKAIIEKHFLDSLLL QNLLKDEDKTLIDIGTGAGFPGMILAIFNEDKKFTLLDSVRKKTDFLELVKNELALNNVE VINGRAEEIIKDRREKYDVGLCRGVSNLSVILEYEIPFLKVNGRFLPQKMVGTDEIKNSS NALKILNSKILKEYEFKLPFSNEDRLVIEILKTKKTDEKYPRKIGIPLKKPL >gi|296153520|gb|ADVK01000055.1| GENE 144 134007 - 135908 2385 633 aa, chain - ## HITS:1 COG:FN1723 KEGG:ns NR:ns ## COG: FN1723 COG0445 # Protein_GI_number: 19705044 # Func_class: D Cell cycle control, cell division, chromosome partitioning # Function: NAD/FAD-utilizing enzyme apparently involved in cell division # Organism: Fusobacterium nucleatum # 1 633 1 633 633 1191 99.0 0 MQEFDIIVVGGGHAGCEAALASARMGMKTAIFTISLDTIGVMSCNPSLGGPAKSHLAREI DALGGEMGRNIDKTFIQIRVLNTRKGPAVRSLRAQADKMAYANEMKKTLEHTDNLSVIQG MVSELVVEEENGKKIIKGIKIREGLEYRAKAVIIATGTFLRGLIHIGEINFSAGRMGELS SEELPLSLEKIGLKLERFKTGTPTRIDGRTIDYSVLEEQPGDKSQVLKFSNRTKDEDALS RRQISCYIAHTNEKVHEIIRNAKERSPLFNGTIQGLGPRYCPSIEDKIFRYPDKNQHHLF LEREGYETNEIYLGGLSSSLPVDVQEEMLKNIKGFENAKIMRYAYAIEYDYVPPEEIKYT LESRTVENLFLAGQINGTSGYEEAGAQGLMAGINAVRKLRNEEPVILDRADSYIGTLIDD LVSKGTNEPYRMFTARSEYRLYLREDNADLRLTKLGYELGLVPEEEYQRVEKKKKDVKII TEILAKTNVGPSNPRVNEILLKRRENPIKDGSTLLELLRRPEVSFEDIKYISEEIRGIDL QGYDHDTTYQVEITVKYEGYINRALKMIEKHKSMENKKIPVDIDYDDLKTIPKEAKDKLK RIKPINIGQASRISGVSPADIQAILIYLKMRGN >gi|296153520|gb|ADVK01000055.1| GENE 145 135917 - 136573 864 218 aa, chain - ## HITS:1 COG:FN1724 KEGG:ns NR:ns ## COG: FN1724 COG0569 # Protein_GI_number: 19705045 # Func_class: P Inorganic ion transport and metabolism # Function: K+ transport systems, NAD-binding component # Organism: Fusobacterium nucleatum # 1 218 1 218 218 379 100.0 1e-105 MKQYLVIGLGRFGASVAKTLYSAGEIVLGVDTNEELVQDKIDNNILKNAIIGDASDEKLL KNIGAENFDVAFICMGDIEASVMIALNLKELGIKTIIAKAINKKHGKVLTKVGATEIVYP EEHMGKRIAELTMNTDIIEHLKFTDNFVLVEVKAPSIFWNNSLIKLDVRNKYNINIVGIK KAKGEFLPNPNANVIIEEGDILVIITDKKTVESFNKLI >gi|296153520|gb|ADVK01000055.1| GENE 146 136588 - 137934 1352 448 aa, chain - ## HITS:1 COG:FN1725 KEGG:ns NR:ns ## COG: FN1725 COG0168 # Protein_GI_number: 19705046 # Func_class: P Inorganic ion transport and metabolism # Function: Trk-type K+ transport systems, membrane components # Organism: Fusobacterium nucleatum # 1 448 1 448 448 690 99.0 0 MRKLSLLKKWDSLSPYRKLIFGFLVAIFIGVILLKMPFSLRENQNITVLDSLFTIVSAIC VTGLSVVDISQVFTSTGQLIILFFIQLGGLGVMTVSIIVFLLVGKKMSFETRELLKEERN SNSNGGITNFIKNLLLTVSLIEILGASILAYGFSRYYPLKKSIFYGLFHSVSAFCNAGFS LFTNNLEDFRYDKLISLTVSFLIILGGIGFVTVNSLFIIKKKKLKNLSLTSKFALLITFF LLTFGTMLFLVFEYNNSSTLKGMNFVDKIINSFFQSVTLRTAGFNTVPLENIRPATVFIS YIFMFIGASPGSTGGGIKTTTFGILIFYAFGVLKRKEYVEVFKRRIDWELINKALAIVII SLFYIIVVITILLSIESFSADKVIYEVISAFSTTGLSMGITASLGIISKFLIIITMFIGR LGPMTVALAFTNNKKSLVKYPKEDILIG >gi|296153520|gb|ADVK01000055.1| GENE 147 137949 - 139301 1544 450 aa, chain - ## HITS:1 COG:FN1726 KEGG:ns NR:ns ## COG: FN1726 COG0534 # Protein_GI_number: 19705047 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Fusobacterium nucleatum # 1 450 8 457 457 764 99.0 0 METESITKLLIKFSIPAIVGMFVNALYNVVDRIYIGNIKDIGHLGITGIGVVFPVVILIF AFSLLIGIGSAAAVSLKLGMKDREEAERFLGVAVFLSFIVSVVLMIIIYFNMDKIIYLIG GSNETFIYAKDYLFYINLGVPAAILGLVLNSVIRSDGSPKIAMGTLLVGAITNIVLDPIF IFGFGMGVKGAAIATIISQYVSMLWTIYYFTSNKSKIKLVKKDIKFNFYKAKEICLLGSS AFAIQLGFSLVTYILNTVLKKYGGDTSIGAMAIVQSFMTFMAMPIFGINQGIQPILGYNY GAEKYKRVKEALYKGIFAATIICIIGYTSVRLFSDTLIKIFTTKPELQEITKYGLKAYTM VFPIVGFQIVSSIYFQAVGKPRMSFFISLSRQIIVMIPCLIILPIFFGLNGIWYAAPTAD SIATLITYILVRKEIKKLDKLEEMLELRNI >gi|296153520|gb|ADVK01000055.1| GENE 148 139325 - 140890 1829 521 aa, chain - ## HITS:1 COG:FN1727 KEGG:ns NR:ns ## COG: FN1727 COG0038 # Protein_GI_number: 19705048 # Func_class: P Inorganic ion transport and metabolism # Function: Chloride channel protein EriC # Organism: Fusobacterium nucleatum # 1 521 1 521 521 954 99.0 0 MNSAKDTVEKLYKGNGKLYFACLLVGLITGTIVSCYRWALEEIGIFRKLYFSDINLNNPI SLLKMWLIFIAVGLIVNYLFKKFPKTSGSGIPQVKGLILGRINYNNWFFELLAKFFAGVL GIGAGLSLGREGPSVQLGSYVGYGTSKLLKTDTVERNYLLTSGSSAGLSGAFGAPLAGVM FSIEEIHKYLSGKLLICAFVASIGADFVGRRFFGVQTSFNIPIEYPLNINPYFQFVLYIV FGVIIAFFGKLFTVTLVKCQDIFNGVKLAREIKVSFIMTISFILCFVLPEVTGGGHNLVE SLIHGKIVIYTLIIIFVIKLLFTAISYSTGFAGGIFLPMLVLGAIIGKIFGETVDIFAQT GADFTVHWIVLGMAAYFVAVVRAPITGVILILEMTGSFHLLLALTTVAVVSFYVTELLGQ QPVYEILYDRMKKDDNVVDEENQGKITIELAVMAESLLDGKAISEIIWPEEVLIIAIIRN GVEKIPKGRTVMMAGDILVLLLPEKIVPEVKEKLMKHTSVE >gi|296153520|gb|ADVK01000055.1| GENE 149 140984 - 141628 730 214 aa, chain - ## HITS:1 COG:FN1728 KEGG:ns NR:ns ## COG: FN1728 COG2039 # Protein_GI_number: 19705049 # Func_class: O Posttranslational modification, protein turnover, chaperones # Function: Pyrrolidone-carboxylate peptidase (N-terminal pyroglutamyl peptidase) # Organism: Fusobacterium nucleatum # 1 214 1 214 214 384 97.0 1e-107 MKKILVTGFDPFGGEKINPALEVIKLLPKKIGDNEIKILEIPTVYKKSIEKIDKEIKNYN PDYILSIGQAGGRTDISIERVAINIDDFRIKDNEGNQPIDEKIYLDGDNAYFSTLPIKAI QSEITKNGIPASISNTAGTFVCNHVFYGVRYLIEKKYKGKKSGFIHIPYLPEQIIGKADT PSMSLDNILKGITIAIEIIFSVENDIKKLGGSIC >gi|296153520|gb|ADVK01000055.1| GENE 150 141600 - 142349 600 249 aa, chain - ## HITS:1 COG:FN1729 KEGG:ns NR:ns ## COG: FN1729 COG0115 # Protein_GI_number: 19705050 # Func_class: E Amino acid transport and metabolism; H Coenzyme transport and metabolism # Function: Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase # Organism: Fusobacterium nucleatum # 1 249 1 249 249 426 96.0 1e-119 MLIELDDGYSFGLGLFETILLYKGKPVFLDEHLARINKSIVDLGLNIDKLEKDEVFQYLN NNKNTLEYEVLKIVLSEKNRLFLKREYTYTEKDYQRAFSLNISEVIRNESSIFTFHKTLN YGDNILEKRKSKKMGYDEPIFLNSKNQITEGATSNIFVVVGDKIYTPKLSCGLLNGIVRQ YIISNYDVIESEIDLEFLNNADEIFLTNSLFGIMPVNNLEKKVFKSQKISKEILNKYRRD YEKNSCYRF >gi|296153520|gb|ADVK01000055.1| GENE 151 142343 - 143695 1575 450 aa, chain - ## HITS:1 COG:FN1730 KEGG:ns NR:ns ## COG: FN1730 COG0147 # Protein_GI_number: 19705051 # Func_class: E Amino acid transport and metabolism; H Coenzyme transport and metabolism # Function: Anthranilate/para-aminobenzoate synthases component I # Organism: Fusobacterium nucleatum # 1 450 1 453 453 777 98.0 0 MQIEVKKLEKYIDIYDIFRVLMSQDNFKENKLSFLDSSLKNKYGKYSIIGINSYLELKEK DNKFYINDKLSDENFEEYLDRFLKENKQENKYDLPLISGGIAYFSYDYGRKFENIKTRHK KDVDIPEAVIRFYKTYIIEDIEKQEIYISYQDKKDFDNLINILENTKIEEENLIKNNNLA NFKSNFEKDEYLKAIKNTIDYIIEGDIYIMNLTQRLMIESKKAPLQVFSYLRKFNPAPFG AYLDFDNFEVVSASPERFIKMKDRLIETRPIKGTRKRGATAEEDLALKNELANSEKDKSE LLMIVDLERNDLNRICELKSVVVDELFEVETYSTVFHLVSTIRGKLKEEYSFVDLIKATF PGGSITGAPKIRAMEIIDELENSRRDLYTGSIGYISFNGDCDLNIVIRTAIHKEGKYYLG VGGGITCESELDFEYEETLQKAKAILEAIC >gi|296153520|gb|ADVK01000055.1| GENE 152 143679 - 144290 771 203 aa, chain - ## HITS:1 COG:FN1731 KEGG:ns NR:ns ## COG: FN1731 COG0512 # Protein_GI_number: 19705052 # Func_class: E Amino acid transport and metabolism; H Coenzyme transport and metabolism # Function: Anthranilate/para-aminobenzoate synthases component II # Organism: Fusobacterium nucleatum # 1 203 1 203 203 399 99.0 1e-111 MFLMIDNYDSFVYNLVSYFLEENIEMKIIRNDLVDLKYIENLIEKGKLEGIIISPGPKSP KDCGLCNEIVKQFYKKVAIFGVCLGHQIIGHVFGAEVKKGKSPVHGKVHKIRNSGENIFK SLPKEFNVTRYHSLVVEKEKLLNDFNIEAETDDGVLMALSNKNYPLHSVQFHPEAVLTEY GHEMIRNFLDLAKEWRDKNANRS >gi|296153520|gb|ADVK01000055.1| GENE 153 144485 - 145276 1043 263 aa, chain - ## HITS:1 COG:FN1732 KEGG:ns NR:ns ## COG: FN1732 COG0253 # Protein_GI_number: 19705053 # Func_class: E Amino acid transport and metabolism # Function: Diaminopimelate epimerase # Organism: Fusobacterium nucleatum # 1 263 1 263 265 470 92.0 1e-132 MDGKVQVLDFIKINPAGNITILIDNFDIYDKNIPKLSEEIMKETNLYAEQVGFIKEKHLQ MMGGEFCGNASRSFASLLAFRDKDFSEQKNYSITCSGESEVLDVDVRTDGAKNKFLAKIK MPKFISLEEISIDEYKLGLVRFSGISHFIFNIKENKETSFENIIDLVKKYLSNEDYSAFG IMFFDKDNLSMKPYVYVKELESGIYENSCASGTTALGYYLKKYKNLDRAKVVQPNGWLEY IIENDEMYIDGPVEIVAEGKVYI >gi|296153520|gb|ADVK01000055.1| GENE 154 145434 - 146069 743 211 aa, chain - ## HITS:1 COG:FN1733 KEGG:ns NR:ns ## COG: FN1733 COG1394 # Protein_GI_number: 19705054 # Func_class: C Energy production and conversion # Function: Archaeal/vacuolar-type H+-ATPase subunit D # Organism: Fusobacterium nucleatum # 1 211 1 211 211 319 98.0 3e-87 MAKLKVNPTRMALSELKKRLVTAKRGHKLLKDKQDELMRQFINLIKENKKLRVEVEKELS DSFKSFLLASATMSPLFLESAISFPKEKIAVEMNLKNIMSVNVPEMKFVKDEMEGSIFPY GFVQTSAELDDTVIKLQKVLDNLLSLAEIEKSCQLMADEIEKTRRRVNALEYSTIPNLEE TVKDIRMKLDENERATITRLMKVKQMLQKDA >gi|296153520|gb|ADVK01000055.1| GENE 155 146080 - 147456 2183 458 aa, chain - ## HITS:1 COG:FN1734 KEGG:ns NR:ns ## COG: FN1734 COG1156 # Protein_GI_number: 19705055 # Func_class: C Energy production and conversion # Function: Archaeal/vacuolar-type H+-ATPase subunit B # Organism: Fusobacterium nucleatum # 1 458 1 458 458 887 98.0 0 MLKEYKSVQEVVGPLMIVEGVEGIKYEELVEIQTQTGEKRRGRVLEIDGDRAMIQLFEGS AGINLKDTTVRFLGKPLELGVSEDMIGRIFDGLGNPIDKGPKIIPEKRVDINGSPINPVS RDYPSEFIQTGISTIDGLNTLVRGQKLPIFSGSGLPHNNVAAQIARQAKVLGDDAKFAVV FAAMGITFEEAQFFIDDFTKTGAIDRAVLFINLANDPAIERISTPRMALTCAEYLAFEKG MHVLVILTDLTNYAEALREVSAARKEVPGRRGYPGYLYTDLSQIYERAGKIKGKPGSITQ IPILTMPEDDITHPIPDLTGYITEGQIILSRELYKSGIQPPIFVIPSLSRLKDKGIGKGK TREDHADTMNQIYAGYASGREARELAVILGDSALSDADKAFAKFAEDFDKEYVNQGYETS RSIQETLDLGWKLLKVIPRAELKRIRTEYLDKYLADKD >gi|296153520|gb|ADVK01000055.1| GENE 156 147449 - 149218 2520 589 aa, chain - ## HITS:1 COG:SPy0154 KEGG:ns NR:ns ## COG: SPy0154 COG1155 # Protein_GI_number: 15674362 # Func_class: C Energy production and conversion # Function: Archaeal/vacuolar-type H+-ATPase subunit A # Organism: Streptococcus pyogenes M1 GAS # 1 585 1 590 591 748 62.0 0 MKEGRIIKVSGPLVVAEGMEEANVYDVVEVSENKLIGEIIEMRGDKASIQVYEETTGIGP GDVVVTTGSPLSIELGPGMLEQMFDGIQRPLLKIQEAVGDFLLKGVSVPALDREKKWQFT PTMQVGEEVEPGKVIGTVQETEIVLHKIMVPNGVYGKIIDIKEGEFTVDKTICSIETENG VRELNMIQKWPVRKGRPYLRKLNPEKPLITGQRIIDTFFAVTKGGTAAIPGPFGSGKTVI QHQLAKWADAEVVVYVGCGERGNEMTDVLMEFPEIIDPKTGQSLMKRTVLIANTSNMPVA AREASIYTGITIAEYFRDMGYSVALMADSTSRWAEALREMSGRLEEMPGDEGYPAYLASR IAEFYERAGLVECLGNGEEGALTVIGAVSPPGGDISEPVSQSTLRIAKVFWGLDYALSYR RHFPAINWLNSYSLYQAKMDKYKKEEIDTNFPKFRIEAMALLQEEAKLQEIVRLVGRDSL SELDQLKLEITKSLREDFLQQNAFHEVDTYCSLPKQFKMLKLILSFYDEAQRGIKEGVYL DEILALPAREKITRAKNISEEELDSFDKIEEEIKEVISKLIAEGGKPNA >gi|296153520|gb|ADVK01000055.1| GENE 157 149236 - 149544 455 102 aa, chain - ## HITS:1 COG:FN1737 KEGG:ns NR:ns ## COG: FN1737 COG1436 # Protein_GI_number: 19705058 # Func_class: C Energy production and conversion # Function: Archaeal/vacuolar-type H+-ATPase subunit F # Organism: Fusobacterium nucleatum # 1 102 4 105 105 171 100.0 3e-43 MYKIAVIGDKDSVLAFKILGVDVFITLDAQEARKTIDRIAKENYGIIFVTEQLAKDIPET IKRYNSEIIPAVILIPSNKGSLNIGLTNIDKNVEKAIGSKIM >gi|296153520|gb|ADVK01000055.1| GENE 158 149537 - 150538 1339 333 aa, chain - ## HITS:1 COG:FN1738 KEGG:ns NR:ns ## COG: FN1738 COG1527 # Protein_GI_number: 19705059 # Func_class: C Energy production and conversion # Function: Archaeal/vacuolar-type H+-ATPase subunit C # Organism: Fusobacterium nucleatum # 1 333 2 334 334 566 99.0 1e-161 MDRELFIQPSVRIRNFEKKLLTKIQFERLYEADNLQDSIRHLNETVYSEDLAKIDREENF ELALLNSLNKTYSEILSISPIKELVDVLTYKFAFHNIKVVVKEKILQENFEHIYSKVHYE DLDNLRKQFETEKGEKETWYEDTVIQAYKIFEKTKDPEKIEVFIDKKYFEKVLEISKTFE LDLIEEYFRTMIDFINIRTFIRCKRQEQEIVVLKEALIQGGYIDTDDILTYFYKEIQDLV NAHKNSKIGKSLFLGLKGYNETGRLLLFEKHMENYLTNLLKERVKRMPYGPEIIFAYVHA KEIEIKNLRIALVGRANGLSADFIKERLRETYV >gi|296153520|gb|ADVK01000055.1| GENE 159 150548 - 151099 640 183 aa, chain - ## HITS:1 COG:FN1739 KEGG:ns NR:ns ## COG: FN1739 COG1390 # Protein_GI_number: 19705060 # Func_class: C Energy production and conversion # Function: Archaeal/vacuolar-type H+-ATPase subunit E # Organism: Fusobacterium nucleatum # 1 183 1 183 183 258 98.0 5e-69 MSSLDNLVAEILQQAEKEANRILAKVKAENLEFTENENKKIQKEIENIQWKINEEAISLK ERIISNANLKSRDMVLQAKEELVDKVLKMTLERLKNLDSDSYLDFVENALKTLNISKNAE IILTKKMKDVLGKEIFGYKVSDDIVESGCNIKDGNVIYNNEFSSLLEFNKEDLEREILKK IFG >gi|296153520|gb|ADVK01000055.1| GENE 160 151115 - 151597 891 160 aa, chain - ## HITS:1 COG:FN1740 KEGG:ns NR:ns ## COG: FN1740 COG0636 # Protein_GI_number: 19705061 # Func_class: C Energy production and conversion # Function: F0F1-type ATP synthase, subunit c/Archaeal/vacuolar-type H+-ATPase, subunit K # Organism: Fusobacterium nucleatum # 1 160 1 160 160 236 100.0 2e-62 MENIMTMFWQNGGVVFGVLGAVIAVLLSGIGSAKGVGIAGQAAAGLIIDEPEKFGKAMVL QLLPGTQGLYGFVIGLLIMFRLTSQMTMPEGLYLLMAGLPVGLVGLKSALYQGQVAVAGI NILAKNEAHQTKGIVLAVMVETYAVLAFVMSLLLLNQVQF >gi|296153520|gb|ADVK01000055.1| GENE 161 151641 - 151748 56 35 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MATCQPLVFQELQNALSILMDVTVAYLNKYNSLIF >gi|296153520|gb|ADVK01000055.1| GENE 162 151758 - 153674 1918 638 aa, chain - ## HITS:1 COG:FN1741 KEGG:ns NR:ns ## COG: FN1741 COG1269 # Protein_GI_number: 19705062 # Func_class: C Energy production and conversion # Function: Archaeal/vacuolar-type H+-ATPase subunit I # Organism: Fusobacterium nucleatum # 1 638 1 638 638 1133 99.0 0 MAIVKMKKFKLFALEKDRKPLLKELQKFDYVHFIKTSNEENEDLKEVQIPENINLLKEKS QKVKWMINYLLKLFPKEAKEEISNNLTEHLLFVQIEQQADKYDFNKDYETLDRISKEIET NKEEIINLEIRKKEIDSWRNIKEPIENLKAFKTAKILLGTVPKRSFEILKDSIRNFDKTY VEEISQDSTMVNLMVLGSKLEEKELRNQLKIHSFTELNFDFKGTFEEEFEKIKLREEEIK KANNKLKNTAEKLLKILSKLEVQNNYLDNLLLRENIVSNFKKTDTVDIVEGYIPADMEYE FKKLITRISSRNNYLEISDVDKDNPEVPILLKNSGVTGLFESITQMYALPRYNEIDPTPI LSIFYWVFFGMMVADFAYGLILCLASGIALMVGNFNETTRKFLKFFFALSFSTMIWGLLY GSAFGDLIKLPTQVLDSSKDFMSILILSIIFGAAHLIMGLAIKAYVLIKNGHFMDAVYDV FLWYLTLTSLILLILAGRFEFSEFTKNILIVCAVVGMLGIVAFGARDAETLMGRIGGGIY SLYGVTSYIGDFVSYLRLMALGLAGGFIAGAINIIVRMLVSGGIFGIILGIVIFAFGQVF NIFLSVLSAYVHTSRLMYVEFFSKFYEGGGKAFKKFRV >gi|296153520|gb|ADVK01000055.1| GENE 163 153661 - 153987 504 108 aa, chain - ## HITS:1 COG:no KEGG:FN1742 NR:ns ## KEGG: FN1742 # Name: not_defined # Def: V-type sodium ATP synthase subunit G (EC:3.6.3.15) # Organism: F.nucleatum # Pathway: Oxidative phosphorylation [PATH:fnu00190]; Methane metabolism [PATH:fnu00680]; Metabolic pathways [PATH:fnu01100] # 1 108 1 108 108 114 100.0 1e-24 MATDAILKVKDAELKAKEILEKAYKDVSILKEEVKEKVKKSYEEAIKNAEKEAEELKLKY KNEGEAIAMPIFESAQRKVSSIKDIDEDKLKSVVDMIVERIVNSNGNS >gi|296153520|gb|ADVK01000055.1| GENE 164 154229 - 155035 862 268 aa, chain - ## HITS:1 COG:FN1743 KEGG:ns NR:ns ## COG: FN1743 COG0789 # Protein_GI_number: 19705064 # Func_class: K Transcription # Function: Predicted transcriptional regulators # Organism: Fusobacterium nucleatum # 1 268 1 268 268 465 100.0 1e-131 MKELYSIGEVSEIMGVSVQTLRYYSNINLILPKYVNPSTGYRYYSVDQFHFIDRIKYLQK LGLCLKEIKEILSENNISTLIKYLDKYKNYIEEEINKLKDTIDSIEWYKNYFTYIGENVD DHSYILHFEKRYVVAVKILENEPKEDFHIRLNELRNNEKYKNLKYMRQFLYIADYDALIE GNLKPYYLGMFIKESPNILSENIIEIPAGNYLCFKARILSETWNPYFAKLFFHGKEKPTI VLANEYENNLHEYLSSVYELQILILEKN >gi|296153520|gb|ADVK01000055.1| GENE 165 155236 - 156117 711 293 aa, chain - ## HITS:1 COG:FN1744 KEGG:ns NR:ns ## COG: FN1744 COG0697 # Protein_GI_number: 19705065 # Func_class: G Carbohydrate transport and metabolism; E Amino acid transport and metabolism; R General function prediction only # Function: Permeases of the drug/metabolite transporter (DMT) superfamily # Organism: Fusobacterium nucleatum # 1 293 1 293 293 442 100.0 1e-124 MKKISYLFIIAAAILWGSIGLFSKIAGNKGFTPIDICFIRSLFSVAILGIFFSIKDRNIF KLESVADMKYFIGTGIISFSLYNWSYIAAIKETSMGVAAILLYTAPSIIMILSVFLFHEK ITRIKILVIIITFIGCMMVTGIFEGDNIISWKGFLYGVLSGIGYGLYSIFGKYALKKYSS VTVVFYTFLMSTFLFSVIGKPAIVISKINKSHSWIFIISFALFSAVIPYIFYTKGLSKIE ASKASIIANIEPVIAAVIGVCIFSEKINFLKILGIILVLGAVCIINMTDKLEN >gi|296153520|gb|ADVK01000055.1| GENE 166 156150 - 157340 1795 396 aa, chain - ## HITS:1 COG:FN1745 KEGG:ns NR:ns ## COG: FN1745 COG0626 # Protein_GI_number: 19705066 # Func_class: E Amino acid transport and metabolism # Function: Cystathionine beta-lyases/cystathionine gamma-synthases # Organism: Fusobacterium nucleatum # 1 396 1 396 396 752 99.0 0 MEKEYAEATELLYKGRKVKGMDFNKPEAFPLFTTTAFTMNSLTEVKKAYAEKYTYIRTCN PNRDALADMVTFLEAGEKSLIFSSGMGAITTTLMTILKPGDHVVCNRNIYGETFDVFTKL MPKFGISADLVDFDDIENCKKAIKAETKLIYSEVFANPTLNIADIPTLADIAHKNGALLM IDNTFSTPIAIKPIKFGADIVINSLTKFMNGHSDAIAGSITSTTEIIDAIHPVRMLCGTP GDPHAAHAMMKSFATMDLRLKKQMSNAAKLAAALEENKYVSKVNHPSLKSFSQHELALKL FTSNDTMSGMMSFIVPEDFEKIDKFMLRLNFAHYAMTLGGVRTTLVHPVTSSHSHMPDEA RRAMGITPGLFRLSVGIEDVDDLIEDFNQALKVFGE >gi|296153520|gb|ADVK01000055.1| GENE 167 157588 - 158847 1598 419 aa, chain + ## HITS:1 COG:FN1746 KEGG:ns NR:ns ## COG: FN1746 COG0626 # Protein_GI_number: 19705067 # Func_class: E Amino acid transport and metabolism # Function: Cystathionine beta-lyases/cystathionine gamma-synthases # Organism: Fusobacterium nucleatum # 1 419 1 419 419 834 99.0 0 MAYFNIRFIILILLHGGRIMFKEFDETLSFETNILEAGAYFRLSTSNPEALPIHLTTAHN VEDLEDLQKRYDEKGFCYNRNRNPNRTALIELMNYVEGGEDSIGCSSGMAAISSSIIAHT KTGDHILSDKTLYGETLEIFTKILQKYGVETTFVDFTNIEEVKKNIKSNTVILYTETVSN PLIGVPNLKVLANIAHSNNAIFIVDNTFMTGALVQPLKFGADIVVNSLTKFANGHSDVVC GAATGKSELIKKIYELQVLLGTQSDPFSSWLTVRGMRTLELRIKKQCENASALALELEKS PYILKVNHPSLVNNPYHNLAHEQFGDYYGGMLSIELPEDLKKINKFMRTLKLAHYAMTLG GYRTSFAYPVMSSHSDMTRDERLAIGITDGLLRISVGIENIKDLINDFKNALEVAYGNK >gi|296153520|gb|ADVK01000055.1| GENE 168 158915 - 160429 648 504 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|145634045|ref|ZP_01789756.1| 50S ribosomal protein L21 [Haemophilus influenzae PittAA] # 2 462 3 443 456 254 33 3e-66 MLSNILNGINDFLWGKPFTYFVLFIGLYFTVRSGFFSIFHFKHILKNTFGSMFSKEANSK KAGAVTPFEAVCVAIGGCVGCGNIGGVASAIAVGGPGAVFWMWIWAFFGMTVKCVETTLG CYYRSKDETGRYFGGSTYFMEKGISREMGFTKFGIGLAIAFGIGFIAQFLGGSQAYTISE VLNQSFGFNMIAVTVAYSLVLFYVIWKGTPRVAAFASKAVPFMCTLFIIGGVALIVANYQ NVPHVVAMIFHDAFTGTAAVGGFVGSTVSQAISIGVARSINSNEAGQGSSPLIHGSANTI HPFRQGIWGSFEVFMDTIVVCSITALAVLCTGAWEEGYTGATLTIKAFEKVFGQFGSIYI GIMCALFGLTTTAGWYTYYIAVMRHGLRYKPILADKIELLFKFIFPLPNIIIVSSIVLTG NGPDLFWTIVNITLVAPVFTNLLGLFILRDKYFKLFKDYKARYMGIGEVDPNFFVFYEDN PEIKKAEDAVREKIKAIRDSVYQS >gi|296153520|gb|ADVK01000055.1| GENE 169 160505 - 160726 220 73 aa, chain + ## HITS:1 COG:FN1748 KEGG:ns NR:ns ## COG: FN1748 COG0674 # Protein_GI_number: 19705069 # Func_class: C Energy production and conversion # Function: Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, alpha subunit # Organism: Fusobacterium nucleatum # 1 73 3 75 75 132 100.0 1e-31 MKKVMQTMDGNQASAYAFTKVAGIYPITPSSPMAEYVDEWAAKGMKNIFDVPVKLVEMQS EGELSMVHWKLVL >gi|296153520|gb|ADVK01000055.1| GENE 170 160774 - 160872 153 32 aa, chain + ## HITS:1 COG:FN1421_1 KEGG:ns NR:ns ## COG: FN1421_1 COG0674 # Protein_GI_number: 19704753 # Func_class: C Energy production and conversion # Function: Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, alpha subunit # Organism: Fusobacterium nucleatum # 1 32 99 130 412 63 93.0 7e-11 MYKIAGELLPGVIHVSAHSLPVQALSIFGDHQ >gi|296153520|gb|ADVK01000055.1| GENE 171 161058 - 161246 181 62 aa, chain + ## HITS:1 COG:FN1750 KEGG:ns NR:ns ## COG: FN1750 COG1013 # Protein_GI_number: 19705071 # Func_class: C Energy production and conversion # Function: Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, beta subunit # Organism: Fusobacterium nucleatum # 1 62 1 62 62 107 100.0 6e-24 MDTEVYSNTGGQSSKARPTATVGKPLKKKDLAAICMSYGHIYVAQVSMGANQQQFLKAYT RS >gi|296153520|gb|ADVK01000055.1| GENE 172 161314 - 161610 332 98 aa, chain + ## HITS:1 COG:FN1421_3 KEGG:ns NR:ns ## COG: FN1421_3 COG1013 # Protein_GI_number: 19704753 # Func_class: C Energy production and conversion # Function: Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, beta subunit # Organism: Fusobacterium nucleatum # 1 98 280 377 377 199 95.0 9e-52 MSKSQTEMKLATECGYWPIFRYNPLLEKEGKNPLQLDSKEPKWELYQDYLMGETRYMTLK KINPNETNDLFEKNMFDAQRRWRQYKRLASLDYSDEKR >gi|296153520|gb|ADVK01000055.1| GENE 173 161711 - 161818 110 35 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MYDVKLDEVLRFVYQIMLIKKFKLIINIILKIFWE >gi|296153520|gb|ADVK01000055.1| GENE 174 161788 - 162408 803 206 aa, chain - ## HITS:1 COG:FN1752 KEGG:ns NR:ns ## COG: FN1752 COG0352 # Protein_GI_number: 19705073 # Func_class: H Coenzyme transport and metabolism # Function: Thiamine monophosphate synthase # Organism: Fusobacterium nucleatum # 1 206 1 206 206 365 99.0 1e-101 MIENKIKLNIITNRKLCENENLEKQIEKIFSAYKRKIILEDFEIVALTLREKDLDKNEYL KLVEKIYPICQKYRIDLILHQNYDLNLDEKYNIGGIHLSYEIFKSLNKNIREELIKKYKK IGVSIHSIDEAKEVEMLGATYIVAGHIFETDCKKDLEPRGLKFIQELSSTLTIPIFVIGG INQENSNLVINSGAFGVCMMSSLMRY >gi|296153520|gb|ADVK01000055.1| GENE 175 162411 - 163541 1166 376 aa, chain - ## HITS:1 COG:FN1753 KEGG:ns NR:ns ## COG: FN1753 COG1060 # Protein_GI_number: 19705074 # Func_class: H Coenzyme transport and metabolism; R General function prediction only # Function: Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes # Organism: Fusobacterium nucleatum # 1 376 1 376 376 702 98.0 0 MELENINSNIMDKVISEMNTYDYNSFSNEDVEEALNKDYLSVKDFQALLSSKAINYLEEM AEKAKEFKERYFGNSVYIFTPLYISNYCDNYCVYCGFNSHNKIKRAKLDFEQIEQELKEI AKSGLQEILILTGESERYSNIEYIGEACKLARKYFSNVGIEIYPVNVEDYKYLNSCGADY VTIFQETYNNEKYKKLHLEGHKKVFSYRFNSQERALMGGMRGVAFGALLGLDDFRKDAFS TGYHAYLLQKKYPYAEISISCPRLRPVINNLKVEKEIVTERELFQIICAYRLFLPFANIT ISTRENSIFRDNIIKIAATKISAGVDTGIGAHTECSTKKGDEQFEIADKRTAAQIFEKVK SEDLQPVMNDYIYLKD >gi|296153520|gb|ADVK01000055.1| GENE 176 163541 - 164314 1135 257 aa, chain - ## HITS:1 COG:FN1754 KEGG:ns NR:ns ## COG: FN1754 COG2022 # Protein_GI_number: 19705075 # Func_class: H Coenzyme transport and metabolism # Function: Uncharacterized enzyme of thiazole biosynthesis # Organism: Fusobacterium nucleatum # 1 257 1 257 257 480 99.0 1e-136 MKDSFKLGNKEFNSRFILGSGKYSNELINSAINYAEAEIVTVAMRRAVSGVQENILDYIP KNITLLPNTSGARNAEEAVKIARLARECIQGDFIKIEVIKDSKYLLPDNYETIKATEILA KEGFIVMPYMYPDLNVARALRDAGASCIMPLAAPIGSNKGLITKEFIQILIDEIDLPIIV DAGIGKPSQACEAIEMGVTAIMANTAIATANDIPRMARAFKYAIQAGREAYLAKLGRVLE KGASASSPLTGFLNGVE >gi|296153520|gb|ADVK01000055.1| GENE 177 164315 - 164935 825 206 aa, chain - ## HITS:1 COG:FN1755 KEGG:ns NR:ns ## COG: FN1755 COG0476 # Protein_GI_number: 19705076 # Func_class: H Coenzyme transport and metabolism # Function: Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 # Organism: Fusobacterium nucleatum # 1 161 1 161 165 270 97.0 1e-72 MELKEEDLLKRNVKGISKKLKKTRVCILGLGGLGSNVVVLLARSGIGSLKLVDFDIVEAS NLNRQQYRISHIGLKKTEAMKSIIREINPFVEVDISDIKVDRENIYSIVGDIELVVEAFD RAETKAMTLEALLTNTNKIVVSASGMAGLGSANEIVTRKIKDNFYLIGDNYSDYEEYSGI MSTRVMICAAHQANMVLRLILEEKGE >gi|296153520|gb|ADVK01000055.1| GENE 178 164939 - 165145 289 68 aa, chain - ## HITS:1 COG:FN1756 KEGG:ns NR:ns ## COG: FN1756 COG2104 # Protein_GI_number: 19705077 # Func_class: H Coenzyme transport and metabolism # Function: Sulfur transfer protein involved in thiamine biosynthesis # Organism: Fusobacterium nucleatum # 5 68 1 64 64 100 100.0 5e-22 MGVEMAEINGKYEEINDVNLLDYLIKNKYRVDRIVVDYNGDIVKKSDFEKINIKNTDKIE IVCFVGGG >gi|296153520|gb|ADVK01000055.1| GENE 179 165146 - 166447 2228 433 aa, chain - ## HITS:1 COG:FN1757 KEGG:ns NR:ns ## COG: FN1757 COG0422 # Protein_GI_number: 19705078 # Func_class: H Coenzyme transport and metabolism # Function: Thiamine biosynthesis protein ThiC # Organism: Fusobacterium nucleatum # 1 433 1 433 433 815 97.0 0 MYKTQMEAAKKGILTKEMKGIAESESMDEKVLMERVASGEIAIPANKKHSSLLAKGVGTG LSTKINVNLGISKDCPNVDKELEKVKVAIDMKADAIMDLSSFGKTEEFRKKLIAMSTAMV GTVPIYDAIGFYDKELKDIKAEEFLDVVRKHAEDGVDFVTIHAGLNREAVNLFKRNERIT NIVSRGGSLMYAWMELNNTENPFYENFDKLLDICEEYDMTISLGDALRPGCLNDATDACQ IKELITLGELTKRAWERNVQIIIEGPGHMAIDEIEANVKLEKKLCHNAPFYVLGPLVTDI APGYDHITSAIGGAIAAAAGVDFLCYVTPAEHLRLPDLDDMKEGIIASRIAAHAADISKK VPKAIDWDNRMAKYRADIDWEGMFTEAIDEEKARRYRKESTPENEDTCTMCGKMCSMRTM KKVMSGEDVNILK >gi|296153520|gb|ADVK01000055.1| GENE 180 166464 - 167084 770 206 aa, chain - ## HITS:1 COG:FN1758 KEGG:ns NR:ns ## COG: FN1758 COG0352 # Protein_GI_number: 19705079 # Func_class: H Coenzyme transport and metabolism # Function: Thiamine monophosphate synthase # Organism: Fusobacterium nucleatum # 1 206 1 206 206 374 100.0 1e-104 MDLKDCKIYLVTDEKACNGKDFYKCIEESIKGGVKIVQLREKNISTKDFYKKALKVKEIC KNYEVLFIINDRLDITQAVEADGVHLGQSDMPIEKAREILKDKFLIGATARNIEEAEKAQ LLGADYIGSGAIFGTSTKDNAKRLEMEDLKKIVNSVKIPVFAIGGININNVWMLKNIGLQ GVCSVSGILSEKDCKKAVENILKNFN >gi|296153520|gb|ADVK01000055.1| GENE 181 167094 - 167927 1055 277 aa, chain - ## HITS:1 COG:FN1759 KEGG:ns NR:ns ## COG: FN1759 COG0351 # Protein_GI_number: 19705080 # Func_class: H Coenzyme transport and metabolism # Function: Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase # Organism: Fusobacterium nucleatum # 1 277 13 289 289 465 99.0 1e-131 MKNVLSIAGSDCSAGAGIQADLKTFVANGVYGMTVITSLTAQNPQKVKMLEDVSIEMLEK QTEAIFDVMEVSAVKIGMLNSKENGEVIYEKLLKYKAKNIVLDPVMIATSGNSLIKDETK DFLVNKLFKIVDIITPNLDETKEIVKIILNNENIENIDSIEKMKNYGKVIADFTKKWVLI KGGHLSNSAVDILMNKDEIYILEGEKISSNNTHGTGCSLSSAIASNLAKSYSMLDSVKKA KNFVLFSIKNSVDFGEIAGTVNQMGEIYKNIDIEKLY >gi|296153520|gb|ADVK01000055.1| GENE 182 168052 - 168147 129 31 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MCAIIANIEKIKLNNKKTSYKGVNKLTEKRM >gi|296153520|gb|ADVK01000055.1| GENE 183 168189 - 168266 87 25 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MRWFESSHPSHIKNSTRKKFLVFYL >gi|296153520|gb|ADVK01000055.1| GENE 184 168234 - 168407 813 57 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MAGVAGFEPTHNGVKVRCLTAWRHPKKMVGIARFELATPCSQGRCATGLRYIPLQMT >gi|296153520|gb|ADVK01000055.1| GENE 185 168456 - 169202 768 248 aa, chain - ## HITS:1 COG:FN1762 KEGG:ns NR:ns ## COG: FN1762 COG3022 # Protein_GI_number: 19705081 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 248 1 248 248 360 98.0 1e-99 MKIIFSPSKEMREENIFENKKIEFTESKFKDKTNILIKILSKKSINEIENIMKLKGELLN NTYKDIQNYDKLKYIPAISMYYGVSFKELELEDYSEKSLKYLKNNLLILSALYGALLAFD LLKKYRLDMTMSITDKGLYNFWKKDVNDYISNILNKDEILLNLASSEFSKLIDNKKISMI NIDFKEEKDGTYKSISIYSKKARGKFLNYLVKNQVSNLEEITKIELDGYNINKDLSDEKN FIFTRKNS >gi|296153520|gb|ADVK01000055.1| GENE 186 169206 - 169760 637 184 aa, chain - ## HITS:1 COG:FN1763 KEGG:ns NR:ns ## COG: FN1763 COG3758 # Protein_GI_number: 19705082 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 184 1 184 184 357 100.0 6e-99 MNKVIKKEDWKVSVWAGGTTNEIFIYPENSSYADRIFKARISVATTNNGEKSLFTSLPGV ERYISKLSGDMKLQHTDHYDVEMEDYQIDRFRGDWETYSWGKFRDFNLMLKGIRGDLYYR QIRSKCRLHLEKDSTVVFLYVIDGKINVNGTDLETEDFYITDDNILDVFGNNPKIYYGFI KEWD >gi|296153520|gb|ADVK01000055.1| GENE 187 169891 - 171195 2252 434 aa, chain - ## HITS:1 COG:FN1764 KEGG:ns NR:ns ## COG: FN1764 COG0148 # Protein_GI_number: 19705083 # Func_class: G Carbohydrate transport and metabolism # Function: Enolase # Organism: Fusobacterium nucleatum # 1 434 1 434 434 796 99.0 0 MTGIVEVIGREILDSRGNPTVEVDVILECGAKGRAAVPSGASTGSHEAVELRDEDKGRYL GKGVLKAVNNVNTEIREALLGMDALNQVEIDKIMLELDGTPNKGRLGANAILGVSLAVAK AAAEALGQPLYKYLGGVNSKELPLPMMNILNGGAHADSAVDLQEFMIQPVGAKSFREAMQ MGAEVFHHLGKILKANGDSTNVGNEGGYAPSKIQGTEGALALISEAVKAAGYELGKDITF ALDAASSEFCKEVNGKYEYHFKREGGVVRTTDEMIKWYEELINKYPIVSIEDGLGEDDWD GWVKLTKAIGDRVQIVGDDLFVTNTERLKKGIELGAGNSILIKLNQIGSLTETLDAIEMA KRAGYTAVVSHRSGETEDATIADVAVATNAGQIKTGSTSRTDRMAKYNQLLRIEEELGSV AQYNGRDVFYNIKK >gi|296153520|gb|ADVK01000055.1| GENE 188 171221 - 172639 1970 472 aa, chain - ## HITS:1 COG:FN1765 KEGG:ns NR:ns ## COG: FN1765 COG0469 # Protein_GI_number: 19705084 # Func_class: G Carbohydrate transport and metabolism # Function: Pyruvate kinase # Organism: Fusobacterium nucleatum # 1 472 4 475 475 874 99.0 0 MKKTKIVCTIGPVTESVETLKELLNRGMNVMRLNFSHGDYEEHGMRMKNFRQAMSETGIR GGILLDTKGPEIRTMTLKDGKDVSIKAGQKFTFTTDQSVVGDNERVAVTYENFAKDLKVG NMVLVDDGLLELDVVEIKGNEVICIARNNGDLGQKKGINLPNVSVNLPALSEKDIEDLKF GCQNNVDFVAASFIRKADDVRQVRKVLRENGGERIQIISKIESQEGLDNFDEILAESDGI MVARGDLGVEIPVEEVPCAQKMMIRKCNRVGKTVITATQMLDSMIKNPRPTRAEANDVAN AILDGTDAVMLSGETAKGKYPLAAVEVMNKIAKKVDATIPPFYIEGVINKNDITSAVAEG SADISERLNAKLIVVGTESGRAARDMRRYFPRAHILAITNNEKTANQLVLSRGIISYVDA SPKTLEEFFVVAESVAKKLNLVKNNDIIIATCGESVFIQGTTNSIKVIQVKA >gi|296153520|gb|ADVK01000055.1| GENE 189 172777 - 172845 76 22 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MARAYGSYPYGRWFESIPRHHY >gi|296153520|gb|ADVK01000055.1| GENE 190 172838 - 173020 622 60 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MVEHNLAKVRVASSSLVSRSRIIYYIWGYSSVWESDALALRRSAVRSRLSPPLCLGSSVG >gi|296153520|gb|ADVK01000055.1| GENE 191 173026 - 173130 425 34 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MAELADALDLGSSVPDVRVQVSLSAPRSLKILRE >gi|296153520|gb|ADVK01000055.1| GENE 192 173137 - 173235 332 32 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MYSIYWCRLSESNQSPTDYKSVALPDELKRHI >gi|296153520|gb|ADVK01000055.1| GENE 193 173190 - 173495 714 101 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MTHQKFAPFVQWLGHQIFTLETGVQFPYGVPLWSHSSVGRAPALQAGGHRFKSYCDHHIG NGGVAQLVRAPACHAGGREFEPRHSRHYICRFSSSGRATDL >gi|296153520|gb|ADVK01000055.1| GENE 194 173554 - 173652 325 32 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MTHIFGAGNEVRTRDIQLGRLTLYQLSYSRVA >gi|296153520|gb|ADVK01000055.1| GENE 195 173733 - 174008 1119 91 aa, chain + ## HITS:0 COG:no KEGG:no NR:no MQTSALPLGDVAIYYNKWCPEAESNHRHGDFQSPALPTELSGHFLYREVAGVTRLELATS CVTGRRSNQLSYTPISYMVVAIGLEPMTPCL >gi|296153520|gb|ADVK01000055.1| GENE 196 174119 - 174862 1068 247 aa, chain - ## HITS:1 COG:no KEGG:FN1780 NR:ns ## KEGG: FN1780 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 247 1 247 247 473 99.0 1e-132 MSTSAFTGFVLLNEAKFDRDKFLKDLKEDWKITLNLGEENKEKDMLVGNIDDIMVAVALM PAPIPDNEAVESAKTNYRWKDAIKVAEEHKAHIIVSLLGEPNLVDGAKLYSKIISALTKQ ENCTGINVLGTVLNPDMYRDFTEYYVENDMFPVENMIFIGLYASEDEKINAYTYGMEAFG KKEMEIIASSQSPEGVYYFLQGIADYVITSDVILQDGETIGFSAEQKIPISQSKGIAVNG TTLKLAY >gi|296153520|gb|ADVK01000055.1| GENE 197 174930 - 177413 4036 827 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|34762725|ref|ZP_00143715.1| LytB protein; SSU ribosomal protein S1P [Fusobacterium nucleatum subsp. vincentii ATCC 49256] # 1 827 1 827 827 1559 95 0.0 MEIIRAKHMGFCFGVLEAINVCNSLVEEKGRKYILGMLVHNKQVVEDMERKGFKLVTEDE LLNDMDELKEDDIVVIRAHGTSKSVHEKLKERKVKVFDATCIFVNKIRQEIEIANENGYS ILFMGDKNHPEVKGVISFADDIQIFESFEEAKKLKIDLDKTYLLSTQTTLNKKKFEEIKK YFKENYKNVVIFDKICGATAVRQKAVEDLAVKVEVMIIVGDTKSSNTKKLYEISKKLNDN SYLVENEEQLDLSIFRGKEVVGITAGASTPEETIMNIEKKVRGIYKMSNVNENQNEFSLM LEEFLPNQEKRVEGVIESMDQNFSYLDVPGERTVVRVRTDELKDYKVGDTVEVLITGLSE EEDDQEYITASRRKIEVEKNWEKIEDSFKNKTILDAKVTKKIKGGYLVEAFLYPGFLPNS LSEISDSEEKVNGKKIQVIVKDIKMDPKDKKNRKITYSVKDIRLAEQEKEFAGLAVGQIV DCVVTEVLDFGLAVDINTLKGFIHISEVSWKRLDKLSDNYKVGDKIKAVVVSLDEAKRNV KLSIKKLEEDPWATVANEFKVDDEVEGIVTKVLPYGAFVEIKPGVEGLVHISDFSWTKKK VNVADYVKEGEKIKVRITDLHPEDRKLKLGIKQLVANPWETAEKDFAIDTVIKGKVVEVK PFGIFVEIADGIDAFVHSSDYNWVGEEIPKFEIGNEVELKITELDLNNKKIKGSLKALRK SPWEHAMEEYKVGTTVEKKIKTVADFGLFIELIKGIDGFIPTQFASKEFIKNIRDKFSEG DVVKAQVVEVNKETQKIKLSIKKIEIEEEKREEREQIEKYSTSSSEE >gi|296153520|gb|ADVK01000055.1| GENE 198 177419 - 177688 430 89 aa, chain - ## HITS:1 COG:FN1782 KEGG:ns NR:ns ## COG: FN1782 COG1925 # Protein_GI_number: 19705087 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphotransferase system, HPr-related proteins # Organism: Fusobacterium nucleatum # 1 89 1 89 89 148 100.0 3e-36 MKSVKVHIKNKKGLHARPSSLFVQLVTKYDSDITVKSEDETVNGKSIMGLMLLAAEEGRE LELIADGPDEDAMLEDLVDLIEVKRFNEE >gi|296153520|gb|ADVK01000055.1| GENE 199 177874 - 178698 1218 274 aa, chain + ## HITS:1 COG:FN1783 KEGG:ns NR:ns ## COG: FN1783 COG4820 # Protein_GI_number: 19705088 # Func_class: E Amino acid transport and metabolism # Function: Ethanolamine utilization protein, possible chaperonin # Organism: Fusobacterium nucleatum # 1 274 1 274 274 484 98.0 1e-137 MNLDKVNKYIKDFEKTIKKPKINFDKSKFYVGVDLGTANIVITILDKDGKPVAGATQRSR VVRDGIVVDFMEAIEIVRKLKENLEKKLGIEITEGYTAIPPGVEQGSVRAIVNVIESAGI DVLKVVDEPTAASYVLGITDGVVVDLGGGTTGISILEKGKVIFVADEPTGGTHMTLVLAG SYGIDFETAEDIKTDKKKEKEIFVQITPVLQKMASIVKKYIKDYKVKDVFLVGGACSFDG SESIFERELGLNIYKPYMPVYITPLGIALAGMKS >gi|296153520|gb|ADVK01000055.1| GENE 200 178930 - 179385 651 151 aa, chain - ## HITS:1 COG:no KEGG:FN1784 NR:ns ## KEGG: FN1784 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 151 1 151 151 254 100.0 9e-67 MKKIILGLFLLLVVSSYAVPSFVNSKRAEERGYKVVQDTEGTLSIQKVDDESATTISYWY GVKDPDVAELNKILKEDATRDLKSKGSLKMGKAYVEKYTDGKNFMYTLVFKNAKPADVLT SVAYYTKKEIPKSELNKYVDKLLAESEKYIK >gi|296153520|gb|ADVK01000055.1| GENE 201 179407 - 179880 418 157 aa, chain - ## HITS:1 COG:no KEGG:FN1785 NR:ns ## KEGG: FN1785 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 157 1 157 157 223 91.0 2e-57 MKKILLVLFLVLGVLSFSAPSFIDISKIEKNGYKISGEKENRLLLGKTLEKGDRIENIFI SYDFVEENPNFPNSKHQFLKETSPEGLEFTNSFETKRAIIGKYTEINNSYFYTFVSKKNK VKNCYVSVYYATDKNFSKNELEEVCNKFLDEGESFLK >gi|296153520|gb|ADVK01000055.1| GENE 202 180021 - 181001 1268 326 aa, chain + ## HITS:1 COG:FN1786 KEGG:ns NR:ns ## COG: FN1786 COG2870 # Protein_GI_number: 19705091 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: ADP-heptose synthase, bifunctional sugar kinase/adenylyltransferase # Organism: Fusobacterium nucleatum # 1 319 3 321 323 569 98.0 1e-162 MISKLIENFKNIKIAVIGDLMLDEYIMGKVDRISPEAPVPVVKVTEEKFVLGGAANVINN LAALGANVYCGGLVGKDKNAEKLINAFPKNVDCNLILKVENRPTIVKKRVIAGHQQLLRL DWEEEFYINEDEENIIIENLKNHIKNLNAVILSDYNKGLLTKSLSQKIINLCRENNVIVT VDPKPKNISNFMGASSITPNKKEAYAAVEANPSENIDIVGEKLKEKYNLDTVLVTRSEEG MTLYDKEIHNIPTYAKEVYDVTGAGDTVISVFTLAKAAGATWEEAAKIANAAGGIVVGKI GTSTVSEKELISTYNSIYNTGDVCKC >gi|296153520|gb|ADVK01000055.1| GENE 203 180995 - 181489 807 164 aa, chain + ## HITS:1 COG:FN1788 KEGG:ns NR:ns ## COG: FN1788 COG0245 # Protein_GI_number: 19705093 # Func_class: I Lipid transport and metabolism # Function: 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase # Organism: Fusobacterium nucleatum # 1 157 1 157 160 278 98.0 4e-75 MLRIGNGYDVHKLVEGRKLMLGGVEVPHTKGVLGHSDGDVLLHAITDAIIGALGLGDIGL HFPDNDENLKDIDSAILLKKINNIMKEKNYKIVNLDSIIVIQKPKLRPYIDSIRNNIAKI LEIDSELINVKAKTEEKLGFTGDETGVKSYCVVLLEKGIELSIT >gi|296153520|gb|ADVK01000055.1| GENE 204 181505 - 182857 1372 450 aa, chain + ## HITS:1 COG:FN1789 KEGG:ns NR:ns ## COG: FN1789 COG0534 # Protein_GI_number: 19705094 # Func_class: V Defense mechanisms # Function: Na+-driven multidrug efflux pump # Organism: Fusobacterium nucleatum # 1 450 10 459 459 749 99.0 0 MLDKSSFRKTVFAFLLPMAIQNLINVAVSSTDVIMLGRYSEVALSASSLASQIQFILILL FFGIGSGATVLTAQYWGKKDTKSIEKILAIGIKIAFGLSLLFFIFAFFFSRNAMRLFTND EATILEGIKYLKIVSFSYLTTSISIVYLVTMRSVERVTVSTITYATSFVSNFIINYLLIF GNFGFPKMGIEGAAIGTLVARLIELGIVFYYNSKNHHFVSIKWKYIKSLDPILKKDFLKY SAPTMMNELLWSGGTAAGIAILGRLGNSIVAANSITSVVRQLAMVFAFGLANTAAIMVGK EIGKKDFHTAEIYAKKLLLYSFLSSLLGVALLLILKPFIIKKFALNAEVEDYLNLTLNIL FYYIPLQSISAVLIVGVFRAGGDTKFALIADVLPLWCGSVLISAFAAFYLNLPTKLIYFL IMSDEIIKQPLIIWRYRSKKWINNVTRELN >gi|296153520|gb|ADVK01000055.1| GENE 205 182869 - 183390 674 173 aa, chain + ## HITS:1 COG:FN1790 KEGG:ns NR:ns ## COG: FN1790 COG2109 # Protein_GI_number: 19705095 # Func_class: H Coenzyme transport and metabolism # Function: ATP:corrinoid adenosyltransferase # Organism: Fusobacterium nucleatum # 1 173 1 173 173 307 99.0 6e-84 MEKGYTQIYTGNGKGKTTAALGLITRAVGSNLKIFLCQFLKGRDYGELYTLKKFETVTHE RYGRGVFIRSKEYVTDEDKKLMREGYESLKKALLSEKYDIVIADEILGTLRYDLISIEEI KFLIENKPETTEFVLTGRNAPDELIEMADLVTEMREVKHYFQKGVMARKGIEK >gi|296153520|gb|ADVK01000055.1| GENE 206 183399 - 184157 895 252 aa, chain + ## HITS:1 COG:FN1791_1 KEGG:ns NR:ns ## COG: FN1791_1 COG0494 # Protein_GI_number: 19705096 # Func_class: L Replication, recombination and repair; R General function prediction only # Function: NTP pyrophosphohydrolases including oxidative damage repair enzymes # Organism: Fusobacterium nucleatum # 1 158 1 158 158 293 96.0 2e-79 MITTLCYLEKDNKYLMLHRTKRENDINKNKWLGVGGKLEKSETPEQCLFREVKEETGLTL IDYIHRGIVIFNFNDDEPLYMYLYTSKNFLGEVQECSEGDLKWIDKSEIYNLNLWEGDKI FLDLLNKDTPFFYLILDYENDNLISSDLKFKENNFTCFEVFVPENYVKDIVKALSRYGLL KEGTYTDVYALIDVEGHWTTLEGAKAFIGEVGKESIEREKLMKFRVKKEFTDLAYYLIKK AHPYEVPVINIF >gi|296153520|gb|ADVK01000055.1| GENE 207 184207 - 184602 763 131 aa, chain - ## HITS:1 COG:no KEGG:FN1792 NR:ns ## KEGG: FN1792 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 11 131 1 121 121 119 100.0 3e-26 MKKFAMLALAMSLFLVACGEKKEEEKPAEQAAVEATATEAPATETTEAAAEAKTFSLKTE DGKEFTLVVAADGSTATLTDAEGKATELKNAETASGERYADEAGNEVAMKGAEGILTLGD LKEVPVTVEAK >gi|296153520|gb|ADVK01000055.1| GENE 208 184842 - 186569 2484 575 aa, chain - ## HITS:1 COG:FN1793 KEGG:ns NR:ns ## COG: FN1793 COG1080 # Protein_GI_number: 19705098 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) # Organism: Fusobacterium nucleatum # 1 575 5 579 579 1057 100.0 0 MKQNNLIKGIPASPGIAIGKAFLYKENKLEIDEKSNLSKEEEIERLVKGREIAKKQLEEI KENTLKKLGKDKADIFEGHITLLEDEELFSEIDSKISQKKCTAEFALNEAIDEYATMLAN LEDTYFKERAGDLRDIGKRWLYGVMNEQIVDLSKLEPETIIVAKELNPSDTAQINLDNVL AFVTEIGGKTAHSSIMARSLELPAVVGVGAVLDELEDNQILIVDALKGEVIVSPDVETLQ IYKEKREKFLKEKEELKALKDKEAISKDGIKVDVWGNIGSPNDVKGIISNGGFGVGLYRT EFLFMEKDSFPTEDEQFEAYKIVAEELKGYPVTIRTMDIGGDKSLPYMELPKEENPFLGW RAIRVCLDREEILRTQFKALLRASKYGKIKIMLPMIMDIVEVRKAKAIFEECKKELQEKG IEFDKNIMLGIMVETPAVAFRAKYFAKECDFFSIGTNDLTQYTLAVDRGNEKIANLYDTY NPSVLQAIKMLIDGAHDGGIKISMCGEFAGDENAIAILFGMGLDAFSMSGISIPRVKRII MKLEKKECQNLVERVLSLSTASEIKEEIKKFMEKI >gi|296153520|gb|ADVK01000055.1| GENE 209 186636 - 186899 384 87 aa, chain - ## HITS:1 COG:FN1794 KEGG:ns NR:ns ## COG: FN1794 COG1925 # Protein_GI_number: 19705099 # Func_class: G Carbohydrate transport and metabolism # Function: Phosphotransferase system, HPr-related proteins # Organism: Fusobacterium nucleatum # 1 87 1 87 87 136 100.0 1e-32 MTSKTVEIVNETGLHTRPGNEFVSLAKTFSSQISVENEAGVKVNGTSLLKLLSLGIKKGS KITVYADGEDENEAVDKLSSLLENLKD >gi|296153520|gb|ADVK01000055.1| GENE 210 187047 - 188636 1754 529 aa, chain - ## HITS:1 COG:no KEGG:FN1795 NR:ns ## KEGG: FN1795 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 251 529 1 279 279 518 98.0 1e-145 MYNEISITYLEEALKSDSTLHFPKEEYLELCQLDRKVPLDVSQTKVFGKDIFIGYHRHII CYSFGNNLKDFPIFGDYLIEEIVEGEKILFTNCYDPGVGIIVKIEYNEKNAVHNPAWFEN CFGEIKSIDIGANGVCRYAFMGMVLDRAGQKMFYYHAEVLIGNERYTFEFYYADRVQAQQ INFMLGNIKAVLPKVEVVNEEVKSNFIHNIFKKFKKISDKLGIRKKDLSLNKIRKFIGSN EKSETLTINELYNFIEAVSKFNSQQISRIDEGYKQYFGLLLSGVLSVINFGKLEPLQSSS DYEYDEDIIKKMKHSLYESWEIYDTKSVFETISWLLNEGHSKKYESLKYTTLDEAIQEKY KKIKKDIETENYSDDVYTTHGFRNKEHYIETLLKDEIKELQHNWDFILAFRGLNVKNIRA WDIGRAAYLVWECYFFDYLKKFEAEQLIDTLAEVAAKEFSNFTEFASSYALGRIFWYFSI SKKNNINKEMTEIVYELLEAFEILFSSNDGLWAVNQWWNIDESNNKTCR Prediction of potential genes in microbial genomes Time: Sat Jul 9 20:10:55 2011 Seq name: gi|296153482|gb|ADVK01000056.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00070, whole genome shotgun sequence Length of sequence - 41500 bp Number of predicted genes - 37, with homology - 37 Number of transcription units - 13, operones - 7 average op.length - 4.4 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 80 - 547 743 ## COG0662 Mannose-6-phosphate isomerase - Prom 676 - 735 8.9 - Term 749 - 791 8.2 2 2 Op 1 1/1.000 - CDS 813 - 1838 682 ## PROTEIN SUPPORTED gi|229879751|ref|ZP_04499249.1| (SSU ribosomal protein S18P)-alanine acetyltransferase 3 2 Op 2 14/0.000 - CDS 1835 - 2383 605 ## COG2137 Uncharacterized protein conserved in bacteria 4 2 Op 3 1/1.000 - CDS 2376 - 3512 2177 ## COG0468 RecA/RadA recombinase 5 2 Op 4 . - CDS 3518 - 4525 1319 ## COG0859 ADP-heptose:LPS heptosyltransferase 6 2 Op 5 . - CDS 4527 - 5123 555 ## FN0545 lipopolysaccharide core biosynthesis protein RfaY 7 2 Op 6 11/0.000 - CDS 5116 - 6144 1032 ## COG0859 ADP-heptose:LPS heptosyltransferase 8 2 Op 7 3/0.000 - CDS 6141 - 7160 1186 ## COG0859 ADP-heptose:LPS heptosyltransferase 9 2 Op 8 3/0.000 - CDS 7150 - 7929 933 ## COG0463 Glycosyltransferases involved in cell wall biogenesis 10 2 Op 9 . - CDS 7947 - 9041 1252 ## COG0726 Predicted xylanase/chitin deacetylase - Prom 9165 - 9224 10.7 + Prom 8997 - 9056 10.5 11 3 Op 1 1/1.000 + CDS 9269 - 10573 1763 ## COG0001 Glutamate-1-semialdehyde aminotransferase 12 3 Op 2 . + CDS 10563 - 11015 409 ## COG1648 Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) + Term 11093 - 11128 -0.3 13 4 Tu 1 . - CDS 11192 - 11416 302 ## COG1314 Preprotein translocase subunit SecG - Prom 11447 - 11506 14.9 + Prom 11417 - 11476 9.7 14 5 Op 1 1/1.000 + CDS 11560 - 12144 575 ## COG0344 Predicted membrane protein 15 5 Op 2 . + CDS 12162 - 13307 1345 ## COG0592 DNA polymerase sliding clamp subunit (PCNA homolog) - Term 13288 - 13320 4.9 16 6 Op 1 . - CDS 13334 - 13912 983 ## COG1611 Predicted Rossmann fold nucleotide-binding protein 17 6 Op 2 . - CDS 13977 - 14513 343 ## FN0534 hypothetical protein - Prom 14538 - 14597 9.9 + Prom 14565 - 14624 10.0 18 7 Op 1 1/1.000 + CDS 14672 - 15712 904 ## COG2855 Predicted membrane protein 19 7 Op 2 . + CDS 15734 - 16789 903 ## COG3180 Putative ammonia monooxygenase 20 7 Op 3 . + CDS 16795 - 18057 582 ## gi|296329382|ref|ZP_06871882.1| conserved hypothetical protein + Term 18106 - 18146 -0.0 - Term 18090 - 18138 8.1 21 8 Tu 1 . - CDS 18160 - 18360 430 ## COG1278 Cold shock proteins - Prom 18385 - 18444 6.3 + Prom 18540 - 18599 18.6 22 9 Op 1 1/1.000 + CDS 18782 - 19903 1447 ## COG2872 Predicted metal-dependent hydrolases related to alanyl-tRNA synthetase HxxxH domain 23 9 Op 2 1/1.000 + CDS 19903 - 20979 1062 ## COG0820 Predicted Fe-S-cluster redox enzyme 24 9 Op 3 1/1.000 + CDS 20983 - 23235 3033 ## COG0744 Membrane carboxypeptidase (penicillin-binding protein) 25 9 Op 4 1/1.000 + CDS 23237 - 25996 2736 ## COG0210 Superfamily I DNA and RNA helicases 26 9 Op 5 28/0.000 + CDS 25993 - 27159 1251 ## COG0420 DNA repair exonuclease 27 9 Op 6 1/1.000 + CDS 27149 - 29914 3225 ## COG0419 ATPase involved in DNA repair 28 9 Op 7 . + CDS 29923 - 30591 773 ## COG1636 Uncharacterized protein conserved in bacteria 29 9 Op 8 . + CDS 30584 - 31201 502 ## FN0520 hypothetical protein 30 9 Op 9 1/1.000 + CDS 31210 - 32241 1138 ## COG2849 Uncharacterized protein conserved in bacteria 31 9 Op 10 1/1.000 + CDS 32263 - 33252 1131 ## COG2849 Uncharacterized protein conserved in bacteria + Prom 33257 - 33316 11.7 32 10 Op 1 13/0.000 + CDS 33471 - 34793 1568 ## COG1538 Outer membrane protein 33 10 Op 2 27/0.000 + CDS 34803 - 35876 1294 ## COG0845 Membrane-fusion protein 34 10 Op 3 . + CDS 35873 - 38941 3510 ## COG0841 Cation/multidrug efflux pump + Term 38955 - 38997 7.9 - Term 38943 - 38983 5.0 35 11 Tu 1 . - CDS 38997 - 39437 635 ## FN0514 hypothetical protein - Prom 39503 - 39562 12.7 + Prom 39509 - 39568 12.1 36 12 Tu 1 1/1.000 + CDS 39684 - 40112 779 ## COG0716 Flavodoxins + Term 40134 - 40188 5.1 + Prom 40171 - 40230 10.6 37 13 Tu 1 . + CDS 40283 - 41494 1752 ## COG0426 Uncharacterized flavoproteins Predicted protein(s) >gi|296153482|gb|ADVK01000056.1| GENE 1 80 - 547 743 155 aa, chain - ## HITS:1 COG:FN0550 KEGG:ns NR:ns ## COG: FN0550 COG0662 # Protein_GI_number: 19703885 # Func_class: G Carbohydrate transport and metabolism # Function: Mannose-6-phosphate isomerase # Organism: Fusobacterium nucleatum # 15 155 1 141 141 264 99.0 4e-71 MKKTLFSILLIGICMTGAANAKEKNPILLKQVYKKEELITLDKQNVAGGNGTLHGKFAFT RDMATEDEAIKEIGWMTLNKGESIGVHPHKNNEDTYIIVSGEGVFTDGSGKETIVKAGDV TIARPNQSHGLRNEKDEPLVFLDIIAQNHALKAEK >gi|296153482|gb|ADVK01000056.1| GENE 2 813 - 1838 682 341 aa, chain - ## PROTEIN SUPPORTED ## NR: gi|229879751|ref|ZP_04499249.1| (SSU ribosomal protein S18P)-alanine acetyltransferase [Slackia heliotrinireducens DSM 20476] # 2 316 439 763 781 267 43 9e-71 MIILGIESSCDETSIAVVKDGKEILSNNISSQIEIHKEYGGVVPEIASRQHIKNIATVLE ESLEEAKITLDDVDYIAVTYAPGLIGALLVGVSFAKGLSYAKNIPIIPVHHIKGHMYANF LEHDVELPCISLVVSGGHTNIIYIDENHNFINIGETLDDAVGESCDKVARVLGLGYPGGP VIDKMYYKGDRDFLKITKPKVSRFDFSFSGIKTAIINFDNNMKMKNQEYKKEDLAASFLG TVVDILCDKTLDAAVEKNVKTIMLAGGVAANSLLRSQLTEKAAEKGIKVIYPSMKLCTDN AAMIAEAAYYKLKNAKNEKDCFAGLDLNGVASLMVSDEKVM >gi|296153482|gb|ADVK01000056.1| GENE 3 1835 - 2383 605 182 aa, chain - ## HITS:1 COG:FN0548 KEGG:ns NR:ns ## COG: FN0548 COG2137 # Protein_GI_number: 19703883 # Func_class: R General function prediction only # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 53 182 1 130 130 187 98.0 7e-48 MIKGNKLLLDNDKIIYLTKEMFSKFDLKDKISLDDETFYSLIYFRIKLSAYTMLAKRDYF KKELKNKLIEKIGFADIVEDVVEDFEEKGYLDDYEKAKSYAAQHSNYGTKKLSFILYQMG VDKEIVSEILEDEKDNQIEKIKQLWIKLGNKEHKKKIESILRKGFLYGDIKKAISSLEEE EE >gi|296153482|gb|ADVK01000056.1| GENE 4 2376 - 3512 2177 378 aa, chain - ## HITS:1 COG:FN0547 KEGG:ns NR:ns ## COG: FN0547 COG0468 # Protein_GI_number: 19703882 # Func_class: L Replication, recombination and repair # Function: RecA/RadA recombinase # Organism: Fusobacterium nucleatum # 8 378 11 381 381 592 99.0 1e-169 MAAKKDKSVPDSKITDKEGKEKAVKDAMAAITKGFGSGLIMKLGEKSSMNVESIPTGSIN LDIALGIGGVPKGRIIEIYGAESSGKTTLALHVIAEAQKQGGTVAFIDAEHALDPVYAKA LGVDIDELLISQPDYGEQALEIADTLVRSGAIDLIVIDSVAALVPKAEIDGEMSDQQMGL QARLMSKGLRKLTGNLNKYKTTMIFINQIREKIGVTYGPTTTTTGGKALKFYSSVRMEVK KMGTVKQGDDPIGSEVIVKVTKNKVAPPFKEAAFEILYGKGISKVGEIIDAAVAKDVIVK AGSWFSFRDQSIGQGKEKVRAELETNPELLAQVEKDLKEAIAKGPVDKKKKKSKKEVSSD DTDDENLEIDDAIEENND >gi|296153482|gb|ADVK01000056.1| GENE 5 3518 - 4525 1319 335 aa, chain - ## HITS:1 COG:FN0546 KEGG:ns NR:ns ## COG: FN0546 COG0859 # Protein_GI_number: 19703881 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: ADP-heptose:LPS heptosyltransferase # Organism: Fusobacterium nucleatum # 1 335 1 335 335 609 97.0 1e-174 MENHKRILVIRLSSIGDIILTTAVLRAFKEKYPNYIIDFLVIDKFKDAISLSPYVDNLLI YNKKKNDGLFNLIKFSKELSKNNYDYVFDLHSKFRSKIITFVLSKFYGAKAYTYKKRAFW KSILVNLKLIKYKVDNTIIKNYFSAFKDFDLEYQGEKLNFSFEPNLKEKFKEYKDYIAFA VGASKETKKWTVEGFGKLAKKLYETYGKKVILVGGKEDCERCDTIEKISENSVINLAGKL TLKETGALLSQTRFLLTNDSGPFHIARGVGCKTFVIFGPTSPGMFDFGENDVLVYNKIDC SPCSLHGDKVCPKKHFKCMKELSYEKVFKIIENKE >gi|296153482|gb|ADVK01000056.1| GENE 6 4527 - 5123 555 198 aa, chain - ## HITS:1 COG:no KEGG:FN0545 NR:ns ## KEGG: FN0545 # Name: not_defined # Def: lipopolysaccharide core biosynthesis protein RfaY # Organism: F.nucleatum # Pathway: not_defined # 1 198 1 198 198 327 96.0 2e-88 MDKADISKIIKVLKDDHRSYVCVFEIDEDNKKYVYKEPREKNKRKWQKFLNFFRGSESKR EYYQMEKINSLRLKTAKPIFYDKNYLIYEYVEGNKPTIDDIDLVVKELQKIHSMGYLHGD SHIDNFLISPNREIYIIDSKFQKNKYGKFGQIFEMMYLEDSVGIKIDYDKKSFYYKGAML LRKYLTFFSKLKNIIRGK >gi|296153482|gb|ADVK01000056.1| GENE 7 5116 - 6144 1032 342 aa, chain - ## HITS:1 COG:FN0544 KEGG:ns NR:ns ## COG: FN0544 COG0859 # Protein_GI_number: 19703879 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: ADP-heptose:LPS heptosyltransferase # Organism: Fusobacterium nucleatum # 1 342 1 342 342 643 99.0 0 MNILIIHTAFIGDIVLSTALVSKVKEKYPDSDIYYLTTPLGKEILKNNPKIKEIIVYDKR GKDKGFGAFISFVRKIRKLKIDVCLTPHRYLRSSILSLLSGAKIRVGYDIASLSFVFNKK IKYDKTKHEVEKLLSFIDDNTKRYELEMYPNEKDKIKIDTLIKNLSENKKIILIAPGSKW FTKKWPEEYFRTLIQNLVKRADLLIVITGGKEEKEIELNLDSKVLDLRGEISLLELAELT KRAILVVSNDSAPIHITSVFPNTRIVGIFGPTVKEFGFFPWSQNSKVFEIDNLYCRPCAI HGGNSCPEKHFRCMREITPDLIENEIYNYIASINTKKVKADG >gi|296153482|gb|ADVK01000056.1| GENE 8 6141 - 7160 1186 339 aa, chain - ## HITS:1 COG:FN0543 KEGG:ns NR:ns ## COG: FN0543 COG0859 # Protein_GI_number: 19703878 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: ADP-heptose:LPS heptosyltransferase # Organism: Fusobacterium nucleatum # 1 339 7 345 345 620 100.0 1e-177 MEIKRILVSRTDKIGDLVLSIPSFFMLKKMYPNAELVVIVRKYNVDIVKNLPYIDRIVII DEYSKAELLEKIAYFKADVFIALYNDSYIASLARASKAKIKIGPISKLSSFFTYNKGVLQ KRSLSLKNEGQYNLDLVTKLDRKRFAILYELNTKLILTDENKKVADVYFKENSIEGKTLV VNPFIGGSAKNITDEQYISILKKIKEKMPDLNIIITSYTTDEERTEKLCKDIGKDKIFAF SNGASILNTASIIDRADVYFGASTGPTHIAGALGKKIVAIYPHKKTQSPTRWGVLGNSNV RYIIPDENNPNEDYKNPYFDNFTKDMEDKVVKAILEALK >gi|296153482|gb|ADVK01000056.1| GENE 9 7150 - 7929 933 259 aa, chain - ## HITS:1 COG:FN0542 KEGG:ns NR:ns ## COG: FN0542 COG0463 # Protein_GI_number: 19703877 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Glycosyltransferases involved in cell wall biogenesis # Organism: Fusobacterium nucleatum # 1 259 5 263 263 485 100.0 1e-137 MTLTVAMITLNEEKNLERTLKSVQDFADEIVIVDSGSTDKTEEIAKKFGAKFVYQQWLGY GPQRNRAIELSTSDWILNIDADEEISPELANKIKGIKENSRYKVYKINFMSVCFNKKIKH GGWSNTYRIRLFRKNAGSYNENSVHEEFVTNQEIVKLHKYIYHHSYSDLADYFKKFNKYT TLGAIEYYKKGKKARLISIVLSPLYKFIRMYIIRLGFLDGLEGFLLATTSSLYTMVKYYK LREIYKNGTYIEGEGNNGN >gi|296153482|gb|ADVK01000056.1| GENE 10 7947 - 9041 1252 364 aa, chain - ## HITS:1 COG:FN0541 KEGG:ns NR:ns ## COG: FN0541 COG0726 # Protein_GI_number: 19703876 # Func_class: G Carbohydrate transport and metabolism # Function: Predicted xylanase/chitin deacetylase # Organism: Fusobacterium nucleatum # 14 364 1 351 351 614 99.0 1e-176 MVYILIIITILFILLIAFNKSAVPVFLYHQVNPISSNVNPEIFEEHLKIIKKYNMETIKI SEYYNKNINKNSILLTFDDGYYDNFKYVFPLLKKYNMKATIFLNTLYIMDKRENEPEIKD NNTVNLEAMKKYIENGKATINQYMSWEEIKEMYNSGLIDFQAHSHKHMAMFKDIKIEGLT KKDKMEAPELYLYGELENNFPIFAKRGEYSGKAKIIKKEFFRTFKKFYEENIENKIADKN EILKKCQEFIDKNSEYFFDESEAEYKKRIEEDYLENKKLIEKNIGNQVKFFCWPWGHRSK ETIKILKELGVVGFISTKKGNNSMKPNWDMIRRIELRKYTPKKFKINLLVARNLILGKIY GWIS >gi|296153482|gb|ADVK01000056.1| GENE 11 9269 - 10573 1763 434 aa, chain + ## HITS:1 COG:FN0540 KEGG:ns NR:ns ## COG: FN0540 COG0001 # Protein_GI_number: 19703875 # Func_class: H Coenzyme transport and metabolism # Function: Glutamate-1-semialdehyde aminotransferase # Organism: Fusobacterium nucleatum # 1 434 1 434 434 847 98.0 0 MVFKNSIDLYKKAVELIPGGVNSPVRAFKSVNREAPIFIKKGEGAKIWDEDDNEYIDYIC SWGPLILGHNHPKVIEEVKKIIENGSSYGLPTKYEVDLAELIVDIVPSIEKVRLTTSGTE ATMSAVRLARAYTQRNKILKFEGCYHGHSDALLVKSGSGLLTEGYQDSNGITDGVLKDTL TLPFGDIEKVKEILKNKDVACVIVEPIPANMGLIEAHKEFLQGLRKVTEETGTILIFDEV ISGFRLALGGAQEFFGITPDLTTLGKIIGGGYPVGAFGGKKEIMDLVAPVGRVYHAGTLS GNPIASKAGFATISYLKENPNIYKELEEKTNYLIDNIEILAKKYSVNVCVNSMGSLFTIF FVDIDKVENLEDSLKSNTENFSIYFNTMLENGIVVPPSQFEAHFLSMAHTKKELNRTLEV IEIAFKKIGEKSGK >gi|296153482|gb|ADVK01000056.1| GENE 12 10563 - 11015 409 150 aa, chain + ## HITS:1 COG:FN0539 KEGG:ns NR:ns ## COG: FN0539 COG1648 # Protein_GI_number: 19703874 # Func_class: H Coenzyme transport and metabolism # Function: Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) # Organism: Fusobacterium nucleatum # 1 150 1 150 152 204 91.0 5e-53 MANKFFPVSIDLNNKNILIIGAGKIALRKTETLLNYNCNITVITKDILEEKFLELEKSNK IKIFKNQEFEEKFLENIFLVVVATDNETLNKKISQLCMSKNILVNNVTSKDDMNVRFASI YEKNDIQIAISANANPKKVVEIKNKIKDIF >gi|296153482|gb|ADVK01000056.1| GENE 13 11192 - 11416 302 74 aa, chain - ## HITS:1 COG:FN0538 KEGG:ns NR:ns ## COG: FN0538 COG1314 # Protein_GI_number: 19703873 # Func_class: U Intracellular trafficking, secretion, and vesicular transport # Function: Preprotein translocase subunit SecG # Organism: Fusobacterium nucleatum # 1 74 1 74 74 112 100.0 1e-25 MSTLLNVLLFLSAFILIVLVLIQPDRSHGMTASMGLGASNTIFGINKDGGPLARATEVVA TLFIICSLLLYLTR >gi|296153482|gb|ADVK01000056.1| GENE 14 11560 - 12144 575 194 aa, chain + ## HITS:1 COG:FN0537 KEGG:ns NR:ns ## COG: FN0537 COG0344 # Protein_GI_number: 19703872 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 1 194 1 194 194 299 99.0 2e-81 MAFFCFIVLTYFIGAIPSGVWIGKAFKGVDVRDYGSKNSGATNSYRVLGAKLGVAVLIMD VLKGFIPLYIASKFNLVYNDLVILGLVAILAHTFSCFISFKGGKGVATSLGVFLFLIPVI TLILLAIFILVAYFTKYVSLASITAAFLLPIFTFFTHRDSYLFALSVIIAVFVIYRHKTN ISRLLSGTENKFKF >gi|296153482|gb|ADVK01000056.1| GENE 15 12162 - 13307 1345 381 aa, chain + ## HITS:1 COG:FN0536 KEGG:ns NR:ns ## COG: FN0536 COG0592 # Protein_GI_number: 19703871 # Func_class: L Replication, recombination and repair # Function: DNA polymerase sliding clamp subunit (PCNA homolog) # Organism: Fusobacterium nucleatum # 1 381 1 381 381 622 100.0 1e-178 MHIKVNRQNFLSAIRIVEKSVKENKIKPILSCIYAKVKGNKIYFTGTNLDTTIKTSIDVN EVIREGEVAFYYSIIDEYLKEIKDEFVVLRVENGNILFIETEDSTTEYDVFSAEDYPNTF ENIVLNENNFKFEMPSQELVNIFEKVLFSADTPDNIAMNCIRIESILKHLHFVSTNTYRL TFLKKNIDKDISDFSVSVPADTISSIIKIIKGLDNEVIKIYKEDAHLYFQYKDTMIITKL IELRFPNYAEILSNISYDKKLHMNNDKLTNLLKRILIFSRSNSESKYSSTYEFKHNEENK NKMAISALNEIARINEELDVNFEGEDLKISLNSKYLLEFIQNIPKEKELVLEFMYSNSAV KVYEKDNDEYIYILMPLALRE >gi|296153482|gb|ADVK01000056.1| GENE 16 13334 - 13912 983 192 aa, chain - ## HITS:1 COG:FN0535 KEGG:ns NR:ns ## COG: FN0535 COG1611 # Protein_GI_number: 19703870 # Func_class: R General function prediction only # Function: Predicted Rossmann fold nucleotide-binding protein # Organism: Fusobacterium nucleatum # 1 192 1 192 192 397 100.0 1e-111 MRKKNVTVYCGASFGVDEKYQEVTRKLGEWIGKNNYNLVYGGGRSGLMGLIADSVLENGG KVTGIITHFLSEREIAHEGITKLIKVDTMSERKKKMADLADIFIALPGGPGTLEEITEVV SWAVLALHPCPCIFFNFDNYYNHIRDFYDLMVAKGYMKKEARDKILFTSSFKEIGNFIIK YEPPKAREYHGE >gi|296153482|gb|ADVK01000056.1| GENE 17 13977 - 14513 343 178 aa, chain - ## HITS:1 COG:no KEGG:FN0534 NR:ns ## KEGG: FN0534 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 37 178 1 142 142 216 99.0 5e-55 MKKIKFTLVFLPLFLGVLIYLLYRSKNLYYYNFIHFLNINGYVLLARETAILYRKLFPTW VIYSLPDGLWLFSTGAAFLIARKKYLLHFFWFLFIYLFMVGIEYIQKFYGGHGTPIGTFD KTDIIAYTYAYIIINIIALILRKFDNKYKYKDKTSKEVMQNIRYTLIFSVLGLLPNMF >gi|296153482|gb|ADVK01000056.1| GENE 18 14672 - 15712 904 346 aa, chain + ## HITS:1 COG:FN0533 KEGG:ns NR:ns ## COG: FN0533 COG2855 # Protein_GI_number: 19703868 # Func_class: S Function unknown # Function: Predicted membrane protein # Organism: Fusobacterium nucleatum # 1 346 1 346 346 526 97.0 1e-149 MNNKLYGIILCFLLALPAWKLGKFFPLVGGPVFGIIIGIVIAILLKNRAKFDSGINFASK KVLQYAVILLGFGLNLQTIISVGSSSLPIILSTISTSLIIAYILAKLINIPTKIATLIGV GSSICGGSAIAATAPVIDAHDDEIAQAISVIFLFNVIAALIFPTLGDILNFSNKGFALFA GTAVNDTSSVTATASAWDSMHNTGTQVLDSATIVKLTRTLAIIPITLFLAVYNSKKNSNA NNFSLKKIFPMFIVYFILASIITTVCNYFIEVGVITENISITINNVFSFFKHLSKFFIIM AMVAIGLNTNIKKLILSGAKPLTLGFCCWFAISLVSIGLQKILGIF >gi|296153482|gb|ADVK01000056.1| GENE 19 15734 - 16789 903 351 aa, chain + ## HITS:1 COG:FN0532 KEGG:ns NR:ns ## COG: FN0532 COG3180 # Protein_GI_number: 19703867 # Func_class: R General function prediction only # Function: Putative ammonia monooxygenase # Organism: Fusobacterium nucleatum # 1 351 1 351 351 519 99.0 1e-147 MDIINLIVTLIIAILGGYLADKKKVPAAYMLGALFLVAIFNVLFNRAFLPNYFKFVTQIA TGTFIGSKFRSEDIKMLKKVVIPGMTMVVLMIAFSFILSYLMSTFLGIDNLTSFFATAPG GIMDISLIAYDFKANTSQVALLQLIRLISVISFVPFFAKKCYERNNKKNTSFEREIKNEI KEEEKVENKNEKSFLFTVIVGIIGGIIGYFSHLPAGTMSCSMALVAYFNVKTHKAYMPLT LRKIIQSFGGALIGAKVTLSDVIALKNLIFPIILIIIGFCLMNILVGFFLYKTTKFSLST ALLSASPGGMSDISLMAEDLGANGPQVASMQFLRAIFIVGVYPIIIKILFS >gi|296153482|gb|ADVK01000056.1| GENE 20 16795 - 18057 582 420 aa, chain + ## HITS:1 COG:no KEGG:no NR:gi|296329382|ref|ZP_06871882.1| ## NR: gi|296329382|ref|ZP_06871882.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 420 1 420 420 499 100.0 1e-139 MKLKVLKKNFIAKENIMNITGKILILIITTMIIKNYNLIEDLKNGVFNILSNYNEIKLLK NFSIIHYRNDKKSLFNKKLEIFREAQKILGISLIILYILIINIFMIFPNIIYTNYINTFY NSSIFFIILYISISFYKLFFKYTVSVYFITLLTFFGISLIIFSVIQNIILTFSLMMTSIL LFYIITYLLFSIDFFKKINICLKISVIFMIVINILLYYLISLLKNYYWIFFIISICIMIL LSLAPFMIYIIMYNKNNRENLYENFLKKFFGKEITQFFILLIICIGIMTFSLSKTYLYFK NFIPFFTLEEYIKENNSNIFKSIINYLTIKRKLNDILFLVSIVSGTYFLIGEILVYFKIK KYKEKAQEIYEDIIMNERYYYEQMKKCVYYGEEEYRNLFFNNDKIREIIKQNEKYYKDNN >gi|296153482|gb|ADVK01000056.1| GENE 21 18160 - 18360 430 66 aa, chain - ## HITS:1 COG:FN0528 KEGG:ns NR:ns ## COG: FN0528 COG1278 # Protein_GI_number: 19703863 # Func_class: K Transcription # Function: Cold shock proteins # Organism: Fusobacterium nucleatum # 1 66 6 71 71 102 100.0 2e-22 MKGTVKWFNKEKGFGFITGEDGKDVFAHFSQIQKEGFKELFEGQEVEFEITEGQKGPQAS NIVVIK >gi|296153482|gb|ADVK01000056.1| GENE 22 18782 - 19903 1447 373 aa, chain + ## HITS:1 COG:FN0527 KEGG:ns NR:ns ## COG: FN0527 COG2872 # Protein_GI_number: 19703862 # Func_class: R General function prediction only # Function: Predicted metal-dependent hydrolases related to alanyl-tRNA synthetase HxxxH domain # Organism: Fusobacterium nucleatum # 1 373 1 373 373 610 99.0 1e-174 MENIKVNIKKISDKTYEILTSPFYVDGKGGQLGDRGTIFDANIVEVKENLVILDKDLEDG EYSYFINEKRREDIRQQHTAQHIFSAEAYNNFGLNTVGFRMAEEYTTVDLDQKDISKEII DKLEELVNKDIKADIVIEEEIYTNEEAHKIENLRKAIKDKIKGDIRFIKIGNIDICACAG FHVSRTSEIEIFKLINYENVKGNYTRFYFLAGERAKADYNKKHDIIKKLTNVFSCKDDEI LEMLDKSLAEKAKITTELKSLSIKYAELMVKDFENTFIEYKEHKILIYNEDENLANILVK FVNLDKFLLLSGYDKSFSLNSNIYDCKAIILNITKSFPTIKGGGGKNKGNIKLDKTYSRN ELIELIKKGIDEQ >gi|296153482|gb|ADVK01000056.1| GENE 23 19903 - 20979 1062 358 aa, chain + ## HITS:1 COG:FN0526 KEGG:ns NR:ns ## COG: FN0526 COG0820 # Protein_GI_number: 19703861 # Func_class: R General function prediction only # Function: Predicted Fe-S-cluster redox enzyme # Organism: Fusobacterium nucleatum # 1 358 1 358 358 659 99.0 0 MKNEKINILNLTQEELTELLVSLGLKKFYGKEVFIWLHKKITRSFDEMTNLSLKDREILK EKTYIPFFNLLKYQVSKIDKTEKFLFELEDGGTIETVLLRHKDSKNKEIRNTLCVSSQVG CPVKCSFCATGQSGYMRNLSVSEILNQIYTVERRLRKKGENLNNLVFMGMGEPLLNIDNL SKALSIISNENGINISKRKITISTSGVVSGIEKILLDKIPIELAISLHSAINEKRDKIIP INKNFPLEDLSAVLIEYQKQTKRRITFEYILIDNFNISETDANALADFIHQFDHVVNLIP YNEVEGAEHTRPSVKKINKFYNYLKNVRKVNVTLRQEKGSDIDGACGQLRQRNKKGDN >gi|296153482|gb|ADVK01000056.1| GENE 24 20983 - 23235 3033 750 aa, chain + ## HITS:1 COG:FN0525 KEGG:ns NR:ns ## COG: FN0525 COG0744 # Protein_GI_number: 19703860 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane carboxypeptidase (penicillin-binding protein) # Organism: Fusobacterium nucleatum # 20 750 1 731 731 1345 99.0 0 MKKILILLLKLIGALFIIGVIGVFAIIIKYRLELPNIQSMVEDYKPQMATIIYDKNNNVV DTLSVESREIAKLENISPYVKEAFLSIEDKQFYSHHGLNFRGITRAIVTTFLKGRPTQGG SSITQQLAKNAFLTPERTFSRKVKEAILTYQIERTYTKDEILERYLNEIYFGSGSYGIRN AAEQYFKKDVKDLNIAEAALLAGIPNRPTKYDPNRNLENALYRQKIILKEMYTDGRITKE QYDEALAYKFELENEENIKNVPKNTSIIYNKRTKNTYKNPELTTIVEDYLAEIYDEEQIY TSGLKIYTTIDLEYQKVAKETFNSYPYFKNKEINGAMITLDPFTGGIVSIVGGKNFKAGN FDRATMARRQLGSSFKPFVYLEALQNGFDPYSVVVNDFVAFGKWAPKNFDGRYTYNSTLV NSLNLSLNVPAVKLLDAVTVEAFKEAIGDNVKLTSEVKDLTAALGSVDSTPVNVAANFSI FVNGGYIVKPNIIREIRDNQDILIYVAEIEKTKAFDSVDVSVITAMLKTVVSNGTASKAR VVDKTGKPIQQGGKTGTTNEHRTAWFVGITPEYVTACYIGRDDNKPMYGKATGGSAVAPM WAKYYQTLINKGLYTPGKFEFLENYLETGDLVKQNIDIYSGLLDGPNSKEFTVRKGRLQV ESAGKYKNGIASVFGLDGNVTDGAGIDMSEGMIIDTEIEEGTVTEGGTGEGTTQTLNTNT TTSTEGNIPPVQNNTSNKDEDSLTDRLLGD >gi|296153482|gb|ADVK01000056.1| GENE 25 23237 - 25996 2736 919 aa, chain + ## HITS:1 COG:FN0524 KEGG:ns NR:ns ## COG: FN0524 COG0210 # Protein_GI_number: 19703859 # Func_class: L Replication, recombination and repair # Function: Superfamily I DNA and RNA helicases # Organism: Fusobacterium nucleatum # 1 919 1 919 919 1519 97.0 0 MSNLNKKQFEAVETVNGPVVIIAGPGTGKTKTLVERTVNILINKEVEAKKIMITTFTNKA AKELELRINERLEELNKNIDISDIYLGTMHSIWTRLIQENITYSNFFDNFELMSGDYEQH FFIYSRLKEYKKLEDYQKFFDNLSYNENKYRSDWQKSSFLRNKINDLNENAIDIESVQTS DIYINFIKSAYKLYEKQLFENNIVDFSYLQVEFLNMLLNNADFLEKINNNFDYIMVDEYQ DSNKIQEKILLFISKNKKNICVVGDEDQSIYRFRGASAENILNFSKHFNNDECKCIVLEE NYRSVNDIVEFNNNWINAINWYGNRFEKNIVSMRLDNILNKSVFHISGTTSDENIRNTVT FIKKLKQSNKITNYNQIAVLFSHFKDRSAKKLEDALKKENIEVYSPRTKVFFEMYEVKLT FGVILACFKKYFPEEALDIYLLECLDLARIEIRKDNEFLTWIKDKIENISEYSFNSLNEI FYELLNFSYYKNVLKEESPIEARANHNLAILSKIFRNFQKYVHSKKISVEDDFSIIKYFF TKYLEILKQSRVDEIFSEEDYPNDCIPFLTIHQSKGLEFPVVIVFSLYSKPNVSGDLSRQ TSIDRLINSNSKISEIDKEYFDFYRKFYVAFSRAKNLLVLSCYEKGVSENFKPFFYSVRG VNSLQFDINQINLDEVSKKDERRILSYTTDIALYRYCPMKYFLVREKEYSTFDKKVFNLG IITHKAIEHINKSFLQKQEIFSDNYIEDLVKNIYKFQNIDLDNNVERIINIVKKYIKDEK DNFKYIKKVEASEFRVEDNYILYGQIDLILEDENEIKIIDFKTGKYNEIEFFSNYRQQLS LYKLLLQKKYDKKIKTYLYYLEEDEPKKEILIDDEDLKEDLENINKTVQDILDKKFPKIP YNQNICGLCEFKNYCWGIQ >gi|296153482|gb|ADVK01000056.1| GENE 26 25993 - 27159 1251 388 aa, chain + ## HITS:1 COG:FN0523 KEGG:ns NR:ns ## COG: FN0523 COG0420 # Protein_GI_number: 19703858 # Func_class: L Replication, recombination and repair # Function: DNA repair exonuclease # Organism: Fusobacterium nucleatum # 1 284 1 284 291 504 96.0 1e-143 MKIVHCSDLHLGKRFSGNKDYVKKRYMDFFNAFSTFIDRVEEIKPDVCLIAGDIFDKKEI NPDILSKTEYLFKRLRDNVKKDIIAIEGNHDNSRILEESWLEYLQEQNILKVFYYNKDFE GKNYLKIDDINFYPVGYPGFMIDEALTKLSEKLNPQEKNIVIVHTGISGSIDTLPGLVST SILDLFKDKAIYIAGGHIHSFTTYPKEKPYFFVSGSLEFSNVQNEKSDKKGFILFDTDTL NYEFIELEHRKRIKKDFSYTNFSNLENEFENFVKELNLTGEEILVISVSLNNNDYINIEN LENIAEKNGALKTHILIKNILNIGTSEENNYDLSIDELEKNLINTWNISEIEKFSKSFAR LKELFSNDDRDSFLELFDKTLEVNEDDN >gi|296153482|gb|ADVK01000056.1| GENE 27 27149 - 29914 3225 921 aa, chain + ## HITS:1 COG:FN0522 KEGG:ns NR:ns ## COG: FN0522 COG0419 # Protein_GI_number: 19703857 # Func_class: L Replication, recombination and repair # Function: ATPase involved in DNA repair # Organism: Fusobacterium nucleatum # 1 921 1 921 921 1016 91.0 0 MIIKKVQLENYRSHSNITVEFTKGVNLILGKNGRGKTSILEAISTVMFNTKDRSGKETGK SYIKFGEKSSKVDINFIANDGREYNLKTEFFKTKPKKQTLKDMIGSEYDGDIQEKLEELC GIKKGFEETYENIVIAKQNEFINIFKAKPKDREEIFNKIFNTQIYKEMYDSFLKEAVDKY KEKVKDLDKEITFLKENMEDKEQITNFLKEEEIIKNTLNKDFSKTTETINKLSNEIKDYE TTEIELNNLIKNIKDEENKIKKYLSLLKENIVEAKQAKKSKIIVKETEKSYLEYLNIEKR LKDLRENLDNLLDEQKLNTQYQNNIEKLELSNKNLKVDISNLEENISKNSEKKENLESEI SELKIKEENLDLKLKKYINLLDKLEKLENFKDKKSEDKLKKMTEIDFLKKELVSKNDLFK IIDIEKLGKKLSDFQELEKELKLLEEQKIIFETEIKTLKKSSKELSDKICPFLNEKCQNL EDKEAEDYFSSKISIKTEELENLKKSIEEKTQILVEKVVFEDRKKQYFELEKSIKDLELS SKNEEINLKEIELDVKNLDMDINKLIENQEFQNSQMLREKKKELEVELRNLNLDEKRENL KNLLENLEIEKEKILKNQNSIESNLKKIDEFSKKIKVDTNKNIESIKSEIKTFENKLDDL KNPYNEYIKNNVLAEDLENLLLKVDKNIKELYSLRTDKNLLKEKVSILEEKIKNIKIDEL KEKYDIIKEELNEINKKLGSSQEKIDNYKKILEKISSQEEKEKKLLIEFKKLENKFNKAS LIRNEVGQMGRAISKYMLSGISNIASVNFNKITGRTERIEWSNEEKDKYAVYLVGQERKI AFEQLSGGEQVSVAIAIRGTMTEYFTNSKFMILDEPTNNLDTERKKLLAEYMGEILNNLE QSIIVTHDDTFREMAEKIIEL >gi|296153482|gb|ADVK01000056.1| GENE 28 29923 - 30591 773 222 aa, chain + ## HITS:1 COG:FN0521 KEGG:ns NR:ns ## COG: FN0521 COG1636 # Protein_GI_number: 19703856 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 222 1 222 222 349 92.0 3e-96 MKVNYDLKMEEILKEVSESGKKKRLLIHSCCGPCSSSVLEYLKDYFKIDIYYYNPNITFD YEYWARMAEQKEMLKKLDYDMNVIEGVYNPKKDFFEKIKGLENEKEGGQRCYSCYDIRIG ETAKKAKKEGYDFFSTVLSISPMKNVNYINEIGERYSKEYDIPFLFADFKKKNRYLRSVQ ISKELNMYRQEYCGCVFSKVEKEQRDREKALKEKQEGEIKND >gi|296153482|gb|ADVK01000056.1| GENE 29 30584 - 31201 502 205 aa, chain + ## HITS:1 COG:no KEGG:FN0520 NR:ns ## KEGG: FN0520 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 205 1 205 205 347 96.0 2e-94 MTSFSEKTVRGISLLFLVIFGFLTYKNYYYSPLIVLTIMMYFSTKGVQMFENRIFLSTRA IFWILFSTLLFLRIYFNESSHLDMKNTKTLLTISLISICIGTWVGDFFAKYIYIRIKFCI NRFFSTSNKGTYRIVKMENTQQNYLKSLGKKMGIMFYHITLDVNGEEKKFLLEKELFEKL QGKSEININIKKGCLGICYGVGMQE >gi|296153482|gb|ADVK01000056.1| GENE 30 31210 - 32241 1138 343 aa, chain + ## HITS:1 COG:FN0519 KEGG:ns NR:ns ## COG: FN0519 COG2849 # Protein_GI_number: 19703854 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 14 343 14 343 343 500 94.0 1e-141 MKKFLIILTFIFCSLLGFADNVNVGVRIPLGEQKVELDFNVKLLLGMIKAENNPKYKKLL DYIDENLAKKGEVKYSANISLKRASSEVFSENGELLYEEKLPKEFMNFVNYSLSAADDKA KVKKFIKGSYEKPAYMIISKNNGKPKIFIERSMEPDEHKIKTTTEVTLKRELTEAEKKEL LSLKNDKLITKYKSYIDSEISKAYTDNNLTMVQEFKNLTETSIVYIKNNESIKKEIKYTD NTLSNGTMKSYKNDKLMEEIIFENSMSNLQKSYHDNGNLAFEIPMKNGAINGEVKIYYEN GKIKESVSVKNGKREGVAREYSETGKVIKEVLYKDDKEIKKIK >gi|296153482|gb|ADVK01000056.1| GENE 31 32263 - 33252 1131 329 aa, chain + ## HITS:1 COG:FN0518 KEGG:ns NR:ns ## COG: FN0518 COG2849 # Protein_GI_number: 19703853 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 30 329 1 300 300 420 92.0 1e-117 MKKFLIILTFLFISIFSYSADIDFDIRVMMGLTKIKENDNPKYKKFLNYIDENLVKKNEV KYSHKLNMDKKAVEFFSEKGEILLTENLPKEFLDIVDNSIRVAVNKEEIKKTIKNIYEDP YTHVSISKYKENLILFTEENMVNRGKIKNTISVVLKRELTDNEKNELIYLKENNNDEFSK KYKTYIESETTKTYINDKLELFQEIKGLTDTIILYRKNEVSKEVLEYTDSSRLNSVAKEY RNDRLLKETFVKNKKIVLEKEYYANGKLAREIPLKDGLINGEAKDYYENGKIRSTTNFVN GDIDGVVKEYSQAGKVVKETLYKNGKKVK >gi|296153482|gb|ADVK01000056.1| GENE 32 33471 - 34793 1568 440 aa, chain + ## HITS:1 COG:FN0517 KEGG:ns NR:ns ## COG: FN0517 COG1538 # Protein_GI_number: 19703852 # Func_class: M Cell wall/membrane/envelope biogenesis; U Intracellular trafficking, secretion, and vesicular transport # Function: Outer membrane protein # Organism: Fusobacterium nucleatum # 1 440 10 449 449 748 97.0 0 MMEGLKEKEISTENLRREKEGMLSLDECIDLALKNNSQIKLKEIETKIAKIDKNISFGNF LPRISAMYSISELDRYVSTTIPAPDVTLGILGGITLPSLPATLTSRMVDKDFKNYALTAQ LPIFVPATWFLYSAREKGENISLYTKDLTKKMIKLKVISEYYYILALESEKRVLESEYEY AKNLNKNANLALEAESILKWQKEQTELLIKQKENAIKNNKRDLQIAKMNLMNDLGLDLNA DFRFVIPEDTVYKLPPLEDVVYDALINSELIKISNNVVAISKDKIKIAMSNFLPQISLGG GVIGTGLSFLNPQNILFGAVTGFLSLFNGFKNVNEYKKAKLQSEAAYIQREDVIMNTIIS AVNSYNNVQKSIEDKELADMNYNIAEQKFKQKKLEKEVGNITDVDLLNEITELEKAASLK EKADYKYSVSVEALKMLIEK >gi|296153482|gb|ADVK01000056.1| GENE 33 34803 - 35876 1294 357 aa, chain + ## HITS:1 COG:FN0516 KEGG:ns NR:ns ## COG: FN0516 COG0845 # Protein_GI_number: 19703851 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Membrane-fusion protein # Organism: Fusobacterium nucleatum # 1 357 1 357 357 604 99.0 1e-173 MKIKYILLFLIIFIFTACKKDAEEEIIRPVKIQEINSIQDENFNIDFPAQISPTQKTVLA FKYAGKIKSINFESGDFVKKGQVIATMDNKDYMVNLEAFSKKYEAAKAVAQNAEVQFSRA EKLYKGGALAKKDYDNALMQKNVAISTFKEASAGLENARNTLNDTKIIAPYDGYIDKKIV EVGTVVPEGGPVISFISNEITDISINASLKDVENIRNAKNIVFKDNSSEKVYPLEIKNIA QNPDSINLTYPVIFTFSNLGKDEKFLSGQTGTVTISVKNNGNQEILIPLDALFEDNGSNV YLFKNGVAVKTAVEIGELRETDKISIIKGLKSGDKVIVAGVRKLVDGEKVKLLGGTK >gi|296153482|gb|ADVK01000056.1| GENE 34 35873 - 38941 3510 1022 aa, chain + ## HITS:1 COG:FN0515 KEGG:ns NR:ns ## COG: FN0515 COG0841 # Protein_GI_number: 19703850 # Func_class: V Defense mechanisms # Function: Cation/multidrug efflux pump # Organism: Fusobacterium nucleatum # 1 1022 1 1022 1022 1908 99.0 0 MKIIEYSIKNRIVIIFATLILTVAGIFAYFKLGKLEDPEFKVKEAIVITLYPGASPESVE QEVTDKIEIALRKIPNADVDSISKAGYSEVHIKIDESTPSEEVDQQWDIVRKKITDVRAS LPLGALPAVILDDYGDVYGMFFAITSEGFSRDELYDYVKDIRKELEKTSGVAKTTLFGNR DAVIEVLVDKNKLANLGINEKMIVLAFTSQNIPAYANSVLHGDRNIRFDIDQSFESIEDI ENLVIYSTPPVLNIQKPTTVLLKDIAEVRRTEVNPYTTKMRYNGKEAIGLMLSPVTGTNV VETGKEINKKIELLKQDLPYGIEIEKVYYQPELVSSAINQFIINLIESVVVVVGILLITM GIKSGLIIGSGLILSILGTLIAMLGMKIDLQRVSLGAFIVAMGMLVDNSIVVVDGVLDSL DSGNNKYASLTKPTEKTAIPLLGATFIAIIAFLPMYLMPTTAGEYIKSLFWVVAISLGLS WIISLTQTTVFCDIYLSENNLKSGNGRGKLLHDKFSALLEKILIYKKLSVLILLGAFFLS MLLFIKVPFSFFPDSDKKGFVINLWNPEGTDIECTNKISEAVENEILKQNGIISVASAIG GSPSRYYIATIPELPNTALSQLIITVKNFKDIDKISKIVKELVDNNFPDTRVEIRKYANG MPTRYPIQLRIVGDDPNILREYSKKFGNILRNIEGAENVQTDWKEKQLVIKPELDKVKER ESLVTALDIATSLNRSINGIKIGTFKDGEENIPVLFKEKNDSREFNINNLGQVPVWGLGL KSIPFRELIKKENLIWENPIIIRKDGFRAIQVQADVKSRYRVEDVRKRFSKAIKESKIEL PKGYKLEWSGEYYEQEKNTEEIISYVPLQLIIMFMTCVLLFGNLRDPIIIFGVLPLSFIG ILPGLFITGRTFGFMAIIGTISLSGMMIKSAIVLIDEIRYEIYTLKKEPFKAIIDSSASR IRAVSLAAGTTVLGMIPLMFDPLFSDMAITIVFGLTITTMLILFVVPLLYSIFYKINKPK EN >gi|296153482|gb|ADVK01000056.1| GENE 35 38997 - 39437 635 146 aa, chain - ## HITS:1 COG:no KEGG:FN0514 NR:ns ## KEGG: FN0514 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 14 146 1 133 133 235 98.0 4e-61 MEEFQTEQKSNFFMGVICLLAGALVTCVLYFGVSRLGIFSFWVSAIGITISLMGYNYFVK GSGNFGFILGSILNAIGIIYGEFLDTCAIVAKSYNMSMSDLMFNTDLLREVLITGSFWIY PAIGIAVMLVVGFQNRKSDFSDKDNE >gi|296153482|gb|ADVK01000056.1| GENE 36 39684 - 40112 779 142 aa, chain + ## HITS:1 COG:FN0513 KEGG:ns NR:ns ## COG: FN0513 COG0716 # Protein_GI_number: 19703848 # Func_class: C Energy production and conversion # Function: Flavodoxins # Organism: Fusobacterium nucleatum # 1 142 1 142 142 251 100.0 2e-67 MNKISLVYYSATGNTEQMAKAIEEGIVEAGGKVTVYKANEMNKEDILSSDVIVMGSSATG AEVIDENDMLPFMEEAGDKFKGKKVYIFGSYGWGGGEYADNWKAQLEGFGANIVAMPILA NENPNDDELAQLKEIGKKLVTI >gi|296153482|gb|ADVK01000056.1| GENE 37 40283 - 41494 1752 403 aa, chain + ## HITS:1 COG:FN0512 KEGG:ns NR:ns ## COG: FN0512 COG0426 # Protein_GI_number: 19703847 # Func_class: C Energy production and conversion # Function: Uncharacterized flavoproteins # Organism: Fusobacterium nucleatum # 1 403 1 403 403 823 100.0 0 MYCCTKINDDIIWIGINDRKTERFENYIPLDNGVTYNSYLIMDEKICIVDGVEEGENGNF LSKIEAMIGNSPVDYIIVNHVEPDHSGSIKNMLKIYPELKVVGNAKTIMMLKLLGLDLPD ERVVIVKEKDILDLGKHKLTFYLMPMVHWPESMATYDITDKVLFSNDAFGSFGALDGAIF DDEVNTDFFTDEMRRYYSNIVGKFGAPVNAILKKLSSVEISCICPSHGLIWRKYIKEIIE RYQKWANMEPTKEGVVIVYGSMYGNTAEMAEILGRELGNRGIKDVIIYDSSKTDHSYIFS TIWKYKGLMLGSCAHNNDIYPKMEPLLHKLENYGLKNRYLGIFGNMMWSGGGVKRIKEFA NTLTGLEQVGEPIEIKGHVTSAERDRLIELANLMADKLIANRK Prediction of potential genes in microbial genomes Time: Sat Jul 9 20:11:34 2011 Seq name: gi|296153478|gb|ADVK01000057.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00071, whole genome shotgun sequence Length of sequence - 1261 bp Number of predicted genes - 3, with homology - 3 Number of transcription units - 1, operones - 1 average op.length - 3.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . + CDS 10 - 264 283 ## FN2052 hypothetical protein 2 1 Op 2 . + CDS 278 - 817 957 ## FN2051 hypothetical protein 3 1 Op 3 . + CDS 851 - 1258 588 ## FN2050 hypothetical protein Predicted protein(s) >gi|296153478|gb|ADVK01000057.1| GENE 1 10 - 264 283 84 aa, chain + ## HITS:1 COG:no KEGG:FN2052 NR:ns ## KEGG: FN2052 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 84 36 119 119 91 100.0 9e-18 MERKEEFRQEKETLEKEVQELKERQLGREELYAKLKEDSKIRWHRDKYKKLLKRFDEYYN KLEQKIADKEQQIVELTKLLEVLN >gi|296153478|gb|ADVK01000057.1| GENE 2 278 - 817 957 179 aa, chain + ## HITS:1 COG:no KEGG:FN2051 NR:ns ## KEGG: FN2051 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 179 1 179 179 207 99.0 1e-52 MKKFIKSILFLCALSSLAYAEEAAPDVTSGTTMSAEEQKEAMDILDRMREKIEKEEAEKA KLIAEAKELGMSPSEVASMDNVEEMLEAKRAAEAKPKTEAEKLELTRKKALNKLDFYERV VRSVAREENEVSDYYGVMGEEKQRSTVYLGTAEAAAEQQVEQNAAPAEIQPETPEEAAK >gi|296153478|gb|ADVK01000057.1| GENE 3 851 - 1258 588 135 aa, chain + ## HITS:1 COG:no KEGG:FN2050 NR:ns ## KEGG: FN2050 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 10 135 1 126 126 138 97.0 7e-32 MKNKILFGTMLALLLVGSVSFADDDADKKRLLEEYDKMQAEKAKEAERMAKENPQAIEVA GENGEVVVTEGEEVAMTPKKSEKDMTESERMDVEVQRIKKRMLEINDKIENYNKTNEMID NLEKNVGELERKVNY Prediction of potential genes in microbial genomes Time: Sat Jul 9 20:11:44 2011 Seq name: gi|296153477|gb|ADVK01000058.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00072, whole genome shotgun sequence Length of sequence - 3174 bp Number of predicted genes - 3, with homology - 0 Number of transcription units - 2, operones - 1 average op.length - 2.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) - Term 22 - 61 2.1 1 1 Op 1 . - CDS 153 - 296 267 ## 2 1 Op 2 . - CDS 257 - 379 295 ## - Prom 508 - 567 71.3 + LSU_RRNA 498 - 2860 93.0 # FJ410389 [D:301..3086] # 23S ribosomal RNA # Fusobacterium necrophorum # Bacteria; Fusobacteria; Fusobacteriales; Fusobacteriaceae; Fusobacterium. + 5S_RRNA 3052 - 3167 100.0 # AE009951 [D:1076861..1076976] # 5S Ribosomal RNA # Fusobacterium nucleatum subsp. nucleatum ATCC 25586 # Bacteria; Fusobacteria; Fusobacteriales; Fusobacteriaceae; Fusobacterium. 3 2 Tu 1 . - CDS 3084 - 3173 126 ## Predicted protein(s) >gi|296153477|gb|ADVK01000058.1| GENE 1 153 - 296 267 47 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MGIVFTFFSSRYLDVSVHAVPSFVLRLHLNRLLHSEILDSYVRLQLI >gi|296153477|gb|ADVK01000058.1| GENE 2 257 - 379 295 40 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MSLYPLNTTAVSLTYLRFRLDPVRSPLLWVSFLLSFPRVT >gi|296153477|gb|ADVK01000058.1| GENE 3 3084 - 3173 126 29 aa, chain - ## HITS:0 COG:no KEGG:no NR:no KLLGKSILSQAASSQVPSAYMGLTSRFGM Prediction of potential genes in microbial genomes Time: Sat Jul 9 20:11:59 2011 Seq name: gi|296153471|gb|ADVK01000059.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00074, whole genome shotgun sequence Length of sequence - 3351 bp Number of predicted genes - 7, with homology - 3 Number of transcription units - 3, operones - 1 average op.length - 5.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . + CDS 1 - 117 184 ## + Term 128 - 168 7.4 - Term 113 - 157 1.5 2 2 Op 1 . - CDS 180 - 332 202 ## 3 2 Op 2 . - CDS 334 - 486 115 ## 4 2 Op 3 . - CDS 555 - 737 217 ## gi|296329408|ref|ZP_06871905.1| conserved hypothetical protein 5 2 Op 4 . - CDS 766 - 831 137 ## 6 2 Op 5 . - CDS 837 - 1295 570 ## COG0454 Histone acetyltransferase HPA2 and related acetyltransferases - Prom 1492 - 1551 14.8 + Prom 1424 - 1483 16.5 7 3 Tu 1 . + CDS 1522 - 3349 2919 ## COG5295 Autotransporter adhesin Predicted protein(s) >gi|296153471|gb|ADVK01000059.1| GENE 1 1 - 117 184 38 aa, chain + ## HITS:0 COG:no KEGG:no NR:no AKLNNQESKINAQGAENKDLKERVKNLEEKLNKLLKTK >gi|296153471|gb|ADVK01000059.1| GENE 2 180 - 332 202 50 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MIIEYYRKNRELKKVYEIRENGKIYDKENILQKYTEVVEDILETLSELES >gi|296153471|gb|ADVK01000059.1| GENE 3 334 - 486 115 50 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MAMKKIIFIVCIFLLSINIFAKTNVEKQVEKIREEFVKINSEKNYKVEMN >gi|296153471|gb|ADVK01000059.1| GENE 4 555 - 737 217 60 aa, chain - ## HITS:1 COG:no KEGG:no NR:gi|296329408|ref|ZP_06871905.1| ## NR: gi|296329408|ref|ZP_06871905.1| conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] conserved hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] # 1 60 18 77 77 101 100.0 2e-20 MLKNYPLEAYKGINGVSLYMKQLNNKTVQDEIFSYLSTDNTLFMMFMDQDYPDSEDFYKK >gi|296153471|gb|ADVK01000059.1| GENE 5 766 - 831 137 21 aa, chain - ## HITS:0 COG:no KEGG:no NR:no MKKIVALLLLILSIYAFRAKK >gi|296153471|gb|ADVK01000059.1| GENE 6 837 - 1295 570 152 aa, chain - ## HITS:1 COG:FN1295 KEGG:ns NR:ns ## COG: FN1295 COG0454 # Protein_GI_number: 19704630 # Func_class: K Transcription; R General function prediction only # Function: Histone acetyltransferase HPA2 and related acetyltransferases # Organism: Fusobacterium nucleatum # 16 146 1 131 135 205 85.0 3e-53 MIEVKQLFNNEKDKALLFAKRVYIESKDESYSEQGIETFCNFVNNKEITKSFKVYGAFED NILKGIIATDRRKRHISLFFVDKVSQGKGIGKKLMSIVIDDNENSFITVNSSRYAVPIYE KIGFIKTEEEKEQDGLKFTPMKLVLKDEVMEE >gi|296153471|gb|ADVK01000059.1| GENE 7 1522 - 3349 2919 609 aa, chain + ## HITS:1 COG:FN0735 KEGG:ns NR:ns ## COG: FN0735 COG5295 # Protein_GI_number: 19704070 # Func_class: U Intracellular trafficking, secretion, and vesicular transport; W Extracellular structures # Function: Autotransporter adhesin # Organism: Fusobacterium nucleatum # 194 609 27 439 617 169 42.0 2e-41 MKKYANLKLIVFSLLLVAGNITYSAAPVFQAGAGANSTEAGVGNVASGESSSAVGKNNKA SKYDSSALGAANTASGEYSSAVGYGNKASELHSSAVGADNTAEGNYSSAVGAKNEATKNG SSAFGNKNKASELLSSAFGANNKANGEASSAVGYGNKAEGKLSSTFGANNKAKGELSSAF GYMNEANGEDSSAFGITNITSGVHSSAFGTDNEATKNDSSAFGYQNKADGEFSSAFGANN IASKEYSSAFGYQNKAEGKLSSTFGANNEAEGELSSAVGYKNNATGENSSAFGYQNKARR KNSSAFGNDNIATGDYSSAVGYKNDATEESSSAFGNENKARGKNSSAVGYKNNATEESSS AFGTQNAASGKNSSAFGAGNGAIGENSSAFGNDNTATGKSSSAVGADNEASEDYSSAFGN ENKASGKNSSAFGTNNEANGLASSAFGTNNIATGDYSSAVGNANIADGKNSSAFGADNEV KGEGSSAFGYKNKVTGNHSGAFGDPNIVTGNRSYVFGNDNTVTGGGSYAFGNDNTINGNN NFVLGNNVTIDAAIQNSVALGNGSTVSSSNEVSVGSKGKERKITNVADGEISATSTDAVN GRQLYNAMQ Prediction of potential genes in microbial genomes Time: Sat Jul 9 20:12:21 2011 Seq name: gi|296153469|gb|ADVK01000060.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00076, whole genome shotgun sequence Length of sequence - 584 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . + CDS 2 - 556 694 ## COG2963 Transposase and inactivated derivatives Predicted protein(s) >gi|296153469|gb|ADVK01000060.1| GENE 1 2 - 556 694 184 aa, chain + ## HITS:1 COG:FN0485 KEGG:ns NR:ns ## COG: FN0485 COG2963 # Protein_GI_number: 19703820 # Func_class: L Replication, recombination and repair # Function: Transposase and inactivated derivatives # Organism: Fusobacterium nucleatum # 16 184 1 169 169 221 99.0 4e-58 NCTQNLGHKIGGAVFLSKLTREEKIEIFERRKMGETISSLAKAFNIHESNIKYLIALIEK YGNNILRKGKNRAYSKEFKLQAINRILINHESINSVALDIGLVSASILHNWLSKFKENEY NVVEKKKGRKPKSMTKPKKNDKELSEKEKIKKLEEENLYLKAENEYLKKLRALVQERELK EKKK Prediction of potential genes in microbial genomes Time: Sat Jul 9 20:12:22 2011 Seq name: gi|296153467|gb|ADVK01000061.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00077, whole genome shotgun sequence Length of sequence - 1957 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 8 - 1957 2443 ## COG5651 PPE-repeat proteins Predicted protein(s) >gi|296153467|gb|ADVK01000061.1| GENE 1 8 - 1957 2443 649 aa, chain - ## HITS:1 COG:FN1381 KEGG:ns NR:ns ## COG: FN1381 COG5651 # Protein_GI_number: 19704716 # Func_class: N Cell motility # Function: PPE-repeat proteins # Organism: Fusobacterium nucleatum # 1 649 528 1176 1176 1052 98.0 0 EGDTNGLFAKNKVTVEEKANLNIETENKIALKLGENLETFGNINIKNARTGILADNTTSV LKFENGSKTIVNAKDVAVNAQKGTSEFKGGSNVELTSNKYALVAKKAVFEKGSTVKLNAE YGIGTLSETDSSDVTFNSQSEVNINSSNSGIYNVNVGGSGKLFIKAPNVLKQVRTVNQSL KFESGSVLEGNIEKSWNANLILDKGSKMFVNNKIEANMDIKGDLFVGTRNSYEKEESKNS MQTLSTMSTFSSSDKYYTVHYNKDSNGHKTKVNLDNANIHLRINGEQSESNDKIVFSKDT EITGKGEITLHPENVSKVKRNMTYSLLEEEGKDVGGNKVYNLEKTSLTLKTVEFGPLVYG RKDKKVNGKYIVTLEDTGELSQAAKNTLTNSRDSYKYNQEELKAIGDKIFESHNLELKNN FWLLVSKENLKNKDSVIKLNQNTKYAGYDYTFNTGISLGFFAGQSTGRYKNIGQGIYLKK DLKPFYLGTVYKHTKSKDKNNDKKLHSNDFSFVVGYNKDISEKTFLDGNIKLTRGYISDY KYTAENDLNTRNEKTNYWNGEINAKLGYKFKYGDAFLKAGLDKDLKGNQKVIWNENIDEE IKYDDLSKNIGIGMEYKVKQHSFNIELSRKYSKHYRANTKISFGYSYKF Prediction of potential genes in microbial genomes Time: Sat Jul 9 20:12:23 2011 Seq name: gi|296153465|gb|ADVK01000062.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00078, whole genome shotgun sequence Length of sequence - 1769 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 3 - 1746 2330 ## COG5651 PPE-repeat proteins Predicted protein(s) >gi|296153465|gb|ADVK01000062.1| GENE 1 3 - 1746 2330 581 aa, chain - ## HITS:1 COG:FN1381 KEGG:ns NR:ns ## COG: FN1381 COG5651 # Protein_GI_number: 19704716 # Func_class: N Cell motility # Function: PPE-repeat proteins # Organism: Fusobacterium nucleatum # 56 581 1 526 1176 906 99.0 0 MRKIFWKVTLLSCLISLFAYGGWNELGEGEKVEKTLDENGKVTTVKKLILGKNSNLKVKG KNTKGILVKGSYFQSKVNIGEGATLDIDIESSDKDSSGISFAEKRDFIIEKNGKLNVKNL FNGTINSGDILRQFHSSKEGKGILVRGVSVAKKFTTEGNTRIESSGAGIGFDKTTSNGQG EIEFKKGSKTFVKGGIAGVFARYNNVTFEEGSDTTLIGGAYGLMANKGTIKKGAKVTLMG NYGTGKFEVKEHDRLTFESGSELNILANNSALYGIRLVGKTKLIAKGYNVLRQIDTYADG EHSLNTVTFEKGSILDGNIDRSWNSQFLLEEGTKLFVPEKIQANLEIKGDLYVGPRSAYE GKQKTYKETVEKSSDVDTVVGATAIFMGKQAQRARKGEIKYGKTYDDFSANEYYTLKYNE NSYNYKSTLNLNNGKIHLRLGTPDRLGRINDKIIFSKNTVINGKGELVFHKRNSSQVTKN TVFKILEEEGLKNINGTQGYYLEKLPLKIPDVSFGQLVFTTKVQKLNGRYVVSLVFKGIE ADNFLLAKGETKTLNKKKEGSLEDAILSAKEVTIEEKSTLN Prediction of potential genes in microbial genomes Time: Sat Jul 9 20:12:24 2011 Seq name: gi|296153462|gb|ADVK01000063.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00081, whole genome shotgun sequence Length of sequence - 682 bp Number of predicted genes - 2, with homology - 2 Number of transcription units - 1, operones - 1 average op.length - 2.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 . + CDS 5 - 253 349 ## FN2049 hypothetical protein 2 1 Op 2 . + CDS 223 - 681 162 ## PROTEIN SUPPORTED gi|163756109|ref|ZP_02163225.1| 30S ribosomal protein S1 Predicted protein(s) >gi|296153462|gb|ADVK01000063.1| GENE 1 5 - 253 349 82 aa, chain + ## HITS:1 COG:no KEGG:FN2049 NR:ns ## KEGG: FN2049 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 82 1 82 82 133 100.0 2e-30 MKKLAILALGVLSLVACTDQKVVNYNTARLDIVEDYLRNHKYVKPSENLDKLIEDGKIEY AEEYVSLEKEAKQWEREKTQQQ >gi|296153462|gb|ADVK01000063.1| GENE 2 223 - 681 162 153 aa, chain + ## PROTEIN SUPPORTED ## NR: gi|163756109|ref|ZP_02163225.1| 30S ribosomal protein S1 [Kordia algicida OT-1] # 69 152 244 327 347 67 38 3e-12 MGKRKNSTTIMVMLFLLIFSLPALAVQALTTTQMRENSIRINALELKNVDILNSEAPKEM TIVLDERSLNFDFDKSNVKPQYYDLLNNIKEFVEQNNYEITIVGHTDSIGSNAYNFKLSR RRAESVKAKLLEFGLSEDRIVGIEAMGEEQPIA Prediction of potential genes in microbial genomes Time: Sat Jul 9 20:12:27 2011 Seq name: gi|296153460|gb|ADVK01000064.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00082, whole genome shotgun sequence Length of sequence - 1164 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 9 - 1163 1635 ## COG4625 Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain Predicted protein(s) >gi|296153460|gb|ADVK01000064.1| GENE 1 9 - 1163 1635 384 aa, chain - ## HITS:1 COG:FN1950_2 KEGG:ns NR:ns ## COG: FN1950_2 COG4625 # Protein_GI_number: 19705252 # Func_class: S Function unknown # Function: Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain # Organism: Fusobacterium nucleatum # 1 384 49 432 432 694 99.0 0 VEVQDGKVVATLSRQNPVEYIGENAEASTKNVAENVENVFQDLDKKVMSGTATKEELAMG AIVQNMTTMGFTSATEMMSGEIYASAQALTFSQAQNINRDLSNRLAGLDNFKNSNKDSEV WFSAIGSGGKLKREGYASADTRVTGGQFGIDTKYKGTTTLGVAMNYSYAKANFNRYAGES KSDMVGVSFYAKQDLPYGFYTAGRLGLSNISSKVERELLTSTGETVTGKIKHHDKMLSAY VEIGKKFGWFTPFIGYSQDYLRRGSFNESEASWGVKADRKNYRATNFLVGARAEYVGDKY KLQAYVTQAINTDKRDLSYEGRFTGSAARQKFYGVKQSKNTTWIGFGAFREISPVFGVYG NVDFRVEDKKWADSVISTGLQYRF Prediction of potential genes in microbial genomes Time: Sat Jul 9 20:12:27 2011 Seq name: gi|296153458|gb|ADVK01000065.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00083, whole genome shotgun sequence Length of sequence - 521 bp Number of predicted genes - 1, with homology - 1 Number of transcription units - 1, operones - 0 average op.length - 0.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Tu 1 . - CDS 41 - 310 426 ## COG2388 Predicted acetyltransferase - Prom 445 - 504 12.5 Predicted protein(s) >gi|296153458|gb|ADVK01000065.1| GENE 1 41 - 310 426 89 aa, chain - ## HITS:1 COG:FN1391 KEGG:ns NR:ns ## COG: FN1391 COG2388 # Protein_GI_number: 19704723 # Func_class: R General function prediction only # Function: Predicted acetyltransferase # Organism: Fusobacterium nucleatum # 1 89 1 89 89 157 97.0 4e-39 MDIIHSEGKGFYIYDENKEILARLEYKKNDNILTFDHTVVSDKLKGQGIAQKLLDKAVDY ARKNNFKVHPVCSYVVKKFETGNYDDIKI Prediction of potential genes in microbial genomes Time: Sat Jul 9 20:12:28 2011 Seq name: gi|296153455|gb|ADVK01000066.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00085, whole genome shotgun sequence Length of sequence - 1111 bp Number of predicted genes - 2, with homology - 2 Number of transcription units - 1, operones - 1 average op.length - 2.0 N Tu/Op Conserved S Start End Score pairs(N/Pv) 1 1 Op 1 2/0.000 + CDS 1 - 321 104 ## COG3464 Transposase and inactivated derivatives 2 1 Op 2 . + CDS 296 - 1036 442 ## COG3464 Transposase and inactivated derivatives Predicted protein(s) >gi|296153455|gb|ADVK01000066.1| GENE 1 1 - 321 104 106 aa, chain + ## HITS:1 COG:FN0275 KEGG:ns NR:ns ## COG: FN0275 COG3464 # Protein_GI_number: 19703620 # Func_class: L Replication, recombination and repair # Function: Transposase and inactivated derivatives # Organism: Fusobacterium nucleatum # 1 105 85 189 317 185 98.0 1e-47 LTVQRYICKDCKKTFSPSTNIVSDNSSISNNLKYAIALELQKNISLTSIAKRYNISIPSV QRIMDNCYSDFKVNKKHLPEAICIDEFKSVKNIDGAMSFVFVDYQK >gi|296153455|gb|ADVK01000066.1| GENE 2 296 - 1036 442 246 aa, chain + ## HITS:1 COG:FN0599 KEGG:ns NR:ns ## COG: FN0599 COG3464 # Protein_GI_number: 19703934 # Func_class: L Replication, recombination and repair # Function: Transposase and inactivated derivatives # Organism: Fusobacterium nucleatum # 7 246 189 428 428 419 98.0 1e-117 MFLLTIKSKSIIDIVEDRRLHSLTEYFSRFSLEARNNVKYICMDMYTPYISLVNSIFPNA KIVLDKFHIVNLVNRAFNQTRISIMNSIQDDSLKRKFKLFWKSLLKYYPDLCQINYYCQS FKRKLSSKDKVDYLLEKSPELEANFNIYQDIIQAIRHNNFKRFESVVKKYLSTKEKISKK MMIVLKTLKKHMNYIENMFESNITNGVIEGLNNKIKSIKRTAFGYSNFSNFKKRILIQAG IISISA Prediction of potential genes in microbial genomes Time: Sat Jul 9 20:12:34 2011 Seq name: gi|296153436|gb|ADVK01000067.1| Fusobacterium nucleatum subsp. nucleatum ATCC 23726 contig00094, whole genome shotgun sequence Length of sequence - 18552 bp Number of predicted genes - 17, with homology - 17 Number of transcription units - 3, operones - 3 average op.length - 5.7 N Tu/Op Conserved S Start End Score pairs(N/Pv) - Term 232 - 285 11.1 1 1 Op 1 1/0.000 - CDS 291 - 1910 1912 ## COG1283 Na+/phosphate symporter 2 1 Op 2 1/0.000 - CDS 1966 - 2838 928 ## COG4866 Uncharacterized conserved protein 3 1 Op 3 1/0.000 - CDS 2851 - 4209 2047 ## COG0624 Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 4 1 Op 4 . - CDS 4238 - 5020 869 ## COG2853 Surface lipoprotein 5 1 Op 5 . - CDS 5010 - 6293 1587 ## FN0280 hypothetical protein 6 1 Op 6 1/0.000 - CDS 6306 - 10655 5433 ## COG2176 DNA polymerase III, alpha subunit (gram-positive type) 7 1 Op 7 2/0.000 - CDS 10657 - 11220 707 ## COG4752 Uncharacterized protein conserved in bacteria 8 1 Op 8 30/0.000 - CDS 11230 - 11946 962 ## COG0336 tRNA-(guanine-N1)-methyltransferase 9 1 Op 9 12/0.000 - CDS 11943 - 12464 808 ## COG0806 RimM protein, required for 16S rRNA processing 10 1 Op 10 . - CDS 12473 - 12712 310 ## COG1837 Predicted RNA-binding protein (contains KH domain) 11 1 Op 11 . - CDS 12724 - 12966 387 ## FN0286 hypothetical protein 12 1 Op 12 1/0.000 - CDS 12975 - 13769 906 ## COG0030 Dimethyladenosine transferase (rRNA methylation) 13 1 Op 13 . - CDS 13779 - 14306 871 ## COG0634 Hypoxanthine-guanine phosphoribosyltransferase - Prom 14533 - 14592 20.0 + Prom 14453 - 14512 12.8 14 2 Op 1 . + CDS 14628 - 15347 788 ## CJA_1210 hypothetical protein 15 2 Op 2 . + CDS 15348 - 16319 969 ## CJA_1209 hypothetical protein + Term 16365 - 16414 10.2 + Prom 16656 - 16715 10.8 16 3 Op 1 12/0.000 + CDS 16783 - 17595 1350 ## COG3959 Transketolase, N-terminal subunit 17 3 Op 2 . + CDS 17620 - 18549 1410 ## COG3958 Transketolase, C-terminal subunit Predicted protein(s) >gi|296153436|gb|ADVK01000067.1| GENE 1 291 - 1910 1912 539 aa, chain - ## HITS:1 COG:FN0276 KEGG:ns NR:ns ## COG: FN0276 COG1283 # Protein_GI_number: 19703621 # Func_class: P Inorganic ion transport and metabolism # Function: Na+/phosphate symporter # Organism: Fusobacterium nucleatum # 15 539 1 525 525 951 99.0 0 MYLKIILQLIGGLGLFLYGMEHMSTSMQKIAGPKLKKILASLTNNRIFGILVGIIITALV QSSSVSTVMTIGFVNASLLTLKQALGVILGANIGTTITGWLLVLDIGKYGLPIVGLASIL YMFMKKEKARTNLSAIIGVGLIFFGLQLMSQALSPLKDMSEFIDMFKMFEVDSYFGLLKV TAVGAIITALIQSSAATIGITIALATQGLIDYQAAVALVLGENVGTTITALLASLGAKPN AKRAAYAHTLINLIGVIWVTSIFRIYLHFLNHFVDPINHMGAAIAAAHTIFNISNVILLI PFVGLLDKFLLFVVKDTGEDEVRVTKLASLKMTLPSVIIEQTKIEVNSMVDIIEDAFLKL EESLKEKDKIAKYNDSIIEAEDKLDLYEKEIYDSNFSLLSKSLSKELIEDTRMNLLACDE YETIGDYQNRIANRLFMLYENSIDLDEVRAKMAFKLHSLAVELFNDISRAIRTNEKELYP IGMKKYQALKTYYKEVKREHFSRAENIPARLNTGYLDIINYYKRIADHIYNIIEYVMKI >gi|296153436|gb|ADVK01000067.1| GENE 2 1966 - 2838 928 290 aa, chain - ## HITS:1 COG:FN0277 KEGG:ns NR:ns ## COG: FN0277 COG4866 # Protein_GI_number: 19703622 # Func_class: S Function unknown # Function: Uncharacterized conserved protein # Organism: Fusobacterium nucleatum # 1 290 1 290 290 487 99.0 1e-137 MWQKLTIESKSSIEEYTKNRFEICDLSFSNLLLWSIGENTEYEIENDVLTVRSVYMGDVY YYMPIPKNDTPKNIEKMKEKIREILKENVAIHYFTEYWYEKLKDDFNLQEKRDYEDYIYS YESLSTLKGRHYAKKKNRVSNFKKSYQFSYESISKDNINEVVAFQKKWYEIHSESSEEIL KNENEGILNLLKNYEKLDLKGGFLKVNNQVIAYSLGEALTDKMILVHTEKALIDYIGSYQ AINMIYLQKEWQGYELVNREDDFGDEGLREAKMSYKPLYLQKKYSIEKNI >gi|296153436|gb|ADVK01000067.1| GENE 3 2851 - 4209 2047 452 aa, chain - ## HITS:1 COG:FN0278 KEGG:ns NR:ns ## COG: FN0278 COG0624 # Protein_GI_number: 19703623 # Func_class: E Amino acid transport and metabolism # Function: Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases # Organism: Fusobacterium nucleatum # 1 452 1 452 452 879 99.0 0 MDLKEKVLDYKDEVVKEIQNAVRVKSVKEAPLPGMPFGEGPAKALDHFMDLAKKLGFKAE KFDNYAMHIDMGEGKETLGILAHVDVVPEGDNWTYPPYSGTIADGKIFGRGTLDDKGPAI ISLFAMKAIADSGVKLNKKIRMILGADEESGSACLKYYFGELKMPYPDIAFTPDSSFPVT YAEKGSVRVKIKKKFNTLQDLVIKGGNAFNSVPNEANGVIPVDMLGEVRNKNKVEFEREG NTYKVFSAGIPAHGAYPSKGYNAVSALFEVLKDIEVKNEELKGLVTFFDKFVKMETDGKS FGVKCTDRETGELTLNLGKINLENNELEIWIDMRIPVKVKNEQIIETIKENTEDYGYEFL LHSNTQPLYVAKDSFLVSTLMNIYKELTGDNDAEPVAIGGGTYAKYAKNAVAFGALLPDQ EDRMHQRDEYLEISKIDKLLQIYVEAIYRLAK >gi|296153436|gb|ADVK01000067.1| GENE 4 4238 - 5020 869 260 aa, chain - ## HITS:1 COG:FN0279 KEGG:ns NR:ns ## COG: FN0279 COG2853 # Protein_GI_number: 19703624 # Func_class: M Cell wall/membrane/envelope biogenesis # Function: Surface lipoprotein # Organism: Fusobacterium nucleatum # 1 260 1 260 260 473 98.0 1e-133 MKTKNLLLLSVLSLTLISCSNTNEVNKSDTNYSEASNVVYANPGESNFMADEPDPWEPFN RRMYYFNYQIERLIITPIVNTYKFITPDFVEDRISNFFKNAKVLNTMANSAFQFKGRKSM KALGRFTINTVLGLGGLFDVSSKMGMPKPYEDFGLTLAYYGVGRGPYLVLPILGPTYLRD AFGMGVDSVVAGKSDIYKRMYLFDSALVNYTDASLFVLKGIDLRKNINFHYHQTNSPFEY EYVRYLYSKYRGIQEATSKQ >gi|296153436|gb|ADVK01000067.1| GENE 5 5010 - 6293 1587 427 aa, chain - ## HITS:1 COG:no KEGG:FN0280 NR:ns ## KEGG: FN0280 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 427 1 427 427 789 100.0 0 MKKLLNKIVLFLILSLTAFSYNFPIDDPYSATIIGSATMMTPGVSENIPLKVYKIQIKDK KETPDVFWYASKFKFSFSKQKNKKAPLIFVLAGTGSDYSTTRVKFMQRIFHDAGYHTIAI SSQMSQQFMISASSNSVPGLLLEDNKDIYKAMKLAYNKIKDQVDVTDFYIMGYSLGGTNA AVLSYIDEKEKAFNFKRVFMVNPPVELYDSAVKLDKYLDDYTGGKTKGIEKLLNTTLARV KGGLTNEYANIGAETIYEIIKGDILSEAEKKAYIGLAFRLTSNDLNFISDFITKSHVYTK NPEKVNKFTNMKEYLKAVNFATFEDYVNKVGFPYYKKYNKDFTIEDLKKEASLRVIEDYL RTSPKIAAVTNADELILNEKDIDYLKDVFKDRLVIYPKGGHCGNMFYKENVDVMLKFVNE GVLKYEN >gi|296153436|gb|ADVK01000067.1| GENE 6 6306 - 10655 5433 1449 aa, chain - ## HITS:1 COG:FN0281 KEGG:ns NR:ns ## COG: FN0281 COG2176 # Protein_GI_number: 19703626 # Func_class: L Replication, recombination and repair # Function: DNA polymerase III, alpha subunit (gram-positive type) # Organism: Fusobacterium nucleatum # 1 1449 6 1454 1454 2788 99.0 0 MNNRQIIIEPNMEVFSKLGVKSIEIKTILLNTRIKRITFNCTVSSMNCIDDIDIIYKDVL SKFGRELEIEFITDNKNLSLKDEEIKTIAIRAIERLKTKNTTSKSFLCFYKLHVKDNYIV IELNDENTKFMLEEVKISSKIENILDEYGVKDYKIIFSVGDFSKEILNVEEKIKLDIEKH QDTINAEREKVVKTNSTSETQVYKAKNDFKRGSKTREIKGETISIKDFYDLYDGETCIVE GEIFSMEDMTLKSGKILRTIRVTDGESSLTSKIFLDEDDKLDIHEGMFLKLSGKLQLDTY AGNEKTLMINAINILEKENTKKEDTAEEKMVELHAHTKMSEMVGVTDVEDIIKRAKEYGH KAIAITDYSVVHSYPAAFKTAKKFSTDEEKMKAIFGCEMYMIDDEAPMVTNPKDKKIDDE EFVVFDIETTGLNSHTNEIIEIGAVKIKAGRIVDRYSQLINPGRPIPYHITEITSITDEQ VANEPKIDEVIGKFVDFIGDAVLVAHNAPFDMGFIKRDIKKYLNIDYQCSVIDTLQMARD LFPDLKKYGLGDLNKTLGLALEKHHRAVDDSQATANMFIIFLDKYKEKGLEYMKDINVGF EVNVKKQSLKNIMVLVKTQDGLKNMYRLVSEAHIKYFGNKKARIPKSVLTENREGLIIGS SLTAHFMNIGELADLYLRHDLEKLEEAAKFYDYIELLPKSTYNELIEKDGTGALGSYEEV EKMNKYFYDLGKRLGILVTASSNVHYLDENEDIIRSILLYGSGTVYNSRQYSINNGFYFR TTDEMLQEFSYLGDEKAKEVVITNTNKIADMIESGIKPIPEGFYPPKMENAEEIVRTMTY EKAYRIYGDPLPNIVSARLERELNAIINNGFSVLYLSAQKLVKKSLDNGYLVGSRGSVGS SLVAFMMGITEVNALYPHYICDNSECKYSEFIEREGVGIDLPDKICPKCGAKLRKDGYSI PFEVFMGFKGDKVPDIDLNFSGEYQSEIHRYCEELFGKENVFKAGTISTLAEKNAEAYVR KYFEDNNLNAVRAEIIRLGRLCQGAKKTTGQHPGGMVIVPQGNSIYEFCPVQRPANDETS ESTTTHYDYHVMDEQLVKLDILGHDDPTTIKLLQEYTNIEIKDIPLADKDTLKIFSSTES LGVTPEQIGTEIGTYGIPEFGTGFVRQMLIDTRPTTFAELVRISGLSHGTNVWLNNAQEF VRNGQATLSQIITVRDDIMNYLIDQGLDNSDSFKIMEFVRKGKPKKEPENWEKYSAMMKE KNVPDWYIESCRRIEYMFPKGHAVAYVMMAMRIAYFKVHQPLAFYAAFLSRKADDFDMEV MSRGILAKQKLEELSKEPKLDPKKKNEQAICEIVVELEARGLKLLPVDIYLSDGKKFTIE DGKIRIPLIGINGLGGAVIDAIVKEREEGKFISVEDLKRRTKIQQPVVDKLKNIGAISSL SETNQISLF >gi|296153436|gb|ADVK01000067.1| GENE 7 10657 - 11220 707 187 aa, chain - ## HITS:1 COG:FN0282 KEGG:ns NR:ns ## COG: FN0282 COG4752 # Protein_GI_number: 19703627 # Func_class: S Function unknown # Function: Uncharacterized protein conserved in bacteria # Organism: Fusobacterium nucleatum # 1 187 1 187 187 377 100.0 1e-105 MRNKVYLSLVHYPVYNRNKDIVCTSVTNFDIHDISRSCGTYEIKGYRLVVPVDAQKKLTE RIIGYWQDGTGGQYNKDREQAFRVTDVTESIEAVVEEIEKIEGQKPLIITTSARIFNNSI SYKNLSKQIFEDDKPYLLLFGTGWGLTDEVMAMSDYILEPIRANSKYNHLSVRAAVAIIL DRLFGEN >gi|296153436|gb|ADVK01000067.1| GENE 8 11230 - 11946 962 238 aa, chain - ## HITS:1 COG:FN0283 KEGG:ns NR:ns ## COG: FN0283 COG0336 # Protein_GI_number: 19703628 # Func_class: J Translation, ribosomal structure and biogenesis # Function: tRNA-(guanine-N1)-methyltransferase # Organism: Fusobacterium nucleatum # 1 238 1 238 238 450 99.0 1e-127 MKINILTLFPKMFDGFLSESIIARAIKFGAVEVNIIDIRDYCFDKHKQADDMPFGGGNGM VMKPEPLFLALENVSGKVIYTSPQGKIFNQEIAKELVKEEELTIIAGHYEGVDERVVENK VDMELSIGDFVLTGGEIPAMAISDTIIRLLPDVIKKESYENDSFYNGLLDYPHYTRPAEY KDLKVPEVLLSGNHKKIDEWRLKESLRRTYLRRRELIENRELTKLEKKLLDEIKKEEV >gi|296153436|gb|ADVK01000067.1| GENE 9 11943 - 12464 808 173 aa, chain - ## HITS:1 COG:FN0284 KEGG:ns NR:ns ## COG: FN0284 COG0806 # Protein_GI_number: 19703629 # Func_class: J Translation, ribosomal structure and biogenesis # Function: RimM protein, required for 16S rRNA processing # Organism: Fusobacterium nucleatum # 1 173 1 173 173 280 100.0 7e-76 MELLIAGKVLGSHNLKGEVKVISDLDNIEVLVGNKVILELADSQQKLLTIKKIEHLVANK WIFSFEEIKNKQDTIEIRNANIKVRRDIVGIGEDEYLVSDMIGFKVYDVKGDEYLGEITE IMDTAAHDIYVIESEEFETMIPDVDVFIKNIDFENRKMLVDTIEGMKESKVKK >gi|296153436|gb|ADVK01000067.1| GENE 10 12473 - 12712 310 79 aa, chain - ## HITS:1 COG:FN0285 KEGG:ns NR:ns ## COG: FN0285 COG1837 # Protein_GI_number: 19703630 # Func_class: R General function prediction only # Function: Predicted RNA-binding protein (contains KH domain) # Organism: Fusobacterium nucleatum # 1 79 1 79 79 122 98.0 1e-28 MENLESLLNFIIKELVETKDKVNVTYEVLDSNVTFKVSVAKGEMGKIIGKNGLTANAIRG VMQAAGVKDKLNVNVEFLD >gi|296153436|gb|ADVK01000067.1| GENE 11 12724 - 12966 387 80 aa, chain - ## HITS:1 COG:no KEGG:FN0286 NR:ns ## KEGG: FN0286 # Name: not_defined # Def: hypothetical protein # Organism: F.nucleatum # Pathway: not_defined # 1 80 1 80 80 138 98.0 9e-32 MKKSYEFLIQSKREDIDFINKIVEAYEGAGVVRTLDPIKGIISVISTDDFKDFMRDVLVD LGKKWVDLKIIEEGAWKGTL >gi|296153436|gb|ADVK01000067.1| GENE 12 12975 - 13769 906 264 aa, chain - ## HITS:1 COG:FN0287 KEGG:ns NR:ns ## COG: FN0287 COG0030 # Protein_GI_number: 19703632 # Func_class: J Translation, ribosomal structure and biogenesis # Function: Dimethyladenosine transferase (rRNA methylation) # Organism: Fusobacterium nucleatum # 1 264 1 264 264 438 95.0 1e-123 MEFKHKKKYGQNFLNNKDEILNKIIEVSNISDNDEILEIGPGQGALTSLLVERVKKVTCV EIDKDLENTLRKKFSSKENYTLVMGDVLEVDFKKYINQGTKVVANIPYYITSPIINKIIE NKDLIDEAYIMVQKEVGERICAKSGKERGILTLAVEYYGESEYLFTIPREFFNPIPNVDS AFISIKFYKDDRYKNKISEDLFFKYVKAAFSNKRKNIVNNLATLGYSKDKIKEILNQIEV PENERAENISIDKFIELINIFESR >gi|296153436|gb|ADVK01000067.1| GENE 13 13779 - 14306 871 175 aa, chain - ## HITS:1 COG:FN0288 KEGG:ns NR:ns ## COG: FN0288 COG0634 # Protein_GI_number: 19703633 # Func_class: F Nucleotide transport and metabolism # Function: Hypoxanthine-guanine phosphoribosyltransferase # Organism: Fusobacterium nucleatum # 1 175 1 175 175 306 97.0 2e-83 MKYRIENLIDKEAVEKRIKELARQIEKDYVGEEIYCVGLLKGSVIFLSDLVKELNIPVII DFMSVSSYGSETVSSGDVKILKDTDLDLRGKHVLIVEDIIDTGLTLEYVIKYFKEGKGVK SLRTCTLLNKPERRKVDVKVDYIGFDVPDKFVIGYGLDYDQKYRNLPYIAVVIPE >gi|296153436|gb|ADVK01000067.1| GENE 14 14628 - 15347 788 239 aa, chain + ## HITS:1 COG:no KEGG:CJA_1210 NR:ns ## KEGG: CJA_1210 # Name: not_defined # Def: hypothetical protein # Organism: C.japonicus # Pathway: not_defined # 1 231 1 230 241 120 30.0 5e-26 MSIFSRVNEKIEKYPEGVLFSCDDFFEKNISRTAVLKSLERIEKTGKIKRVEKGFYYLPI KSIFGEIPISSNEYIKKFLYKDKEKIGYISGNNLYNRYGLTTQLSNLIEITTNTRKNKKY IGNICIKFIETKVPIKRESIKYLEILDILKNLNKISDGNITENYLKMKKNIEIFDLEDSL ELLKFSEEYYPLFVSAILGSILEEKRFKKVQILKEKIAKKKTFFKLGIKLKNTIEWRIK >gi|296153436|gb|ADVK01000067.1| GENE 15 15348 - 16319 969 323 aa, chain + ## HITS:1 COG:no KEGG:CJA_1209 NR:ns ## KEGG: CJA_1209 # Name: not_defined # Def: hypothetical protein # Organism: C.japonicus # Pathway: not_defined # 1 303 1 304 319 185 36.0 3e-45 MKLHLDSEFKNLITIVAKDLGILEFYVEKDYWITYILNKLSKSSYKNEVVFKGGTSLSKV FDDLINRFSEDIDLQLIEQDLTSSGKKKKLKEIEKAIIDSEVLNVVNEDEITKKNGKMRR TVYEYPVNCSVSGIGQASDKLLLEIVSYSSSIPRKMHKIETYIAKYLKKIGEKDIISKFE LEEFEINVLSIKRTFLEKLFAIIEASLKESNIEELKNKIRHLYDIYMIYTKKEDIISTFL SEKSSFDICNLIYKENIHIDISEKKYCDSFLFTDFEKINEIKKTYENEFSKLVFKELPNF EDLKVTLKKFLNFIKEWEEKYKN >gi|296153436|gb|ADVK01000067.1| GENE 16 16783 - 17595 1350 270 aa, chain + ## HITS:1 COG:FN0294 KEGG:ns NR:ns ## COG: FN0294 COG3959 # Protein_GI_number: 19703639 # Func_class: G Carbohydrate transport and metabolism # Function: Transketolase, N-terminal subunit # Organism: Fusobacterium nucleatum # 1 270 1 270 270 539 100.0 1e-153 MKDISFLKEKAKEIRKSIVSMIAEAKSGHPGGSLSATDILTALYFSEMNVDPTNPKMEGR DRFVLSKGHAAPAIYATLAEKGYFSKDELMTLRKFGSRLQGHPDMKKLSGIEISTGSLGQ GLSVANGMALNAKIFDENYRTYVVLGDGEIQEGQIWEAAMTAAHYKLDNLCAFLDSNNLQ IDGNVTEIKGVEPLDKKWEAFGWNVIKIDGHDFEQILSALEKARECKGKPTMIIAKTIKG KGVSFMENVCGFHGVAPTVEELERALAELA >gi|296153436|gb|ADVK01000067.1| GENE 17 17620 - 18549 1410 309 aa, chain + ## HITS:1 COG:FN0295 KEGG:ns NR:ns ## COG: FN0295 COG3958 # Protein_GI_number: 19703640 # Func_class: G Carbohydrate transport and metabolism # Function: Transketolase, C-terminal subunit # Organism: Fusobacterium nucleatum # 1 309 1 309 309 587 99.0 1e-167 MSKKSTRQAYGEALVELGRINNDIVVLDADLSKSTKTDLFKKEFPKRHLNIGIAEADLMG TAAGFATCGKIPFASTFAMFAAGRAFEQIRNTIAYPKLNVKIAPTHAGISVGEDGGSHQS IEDIALMRAIPEMVVLCSCDAVETKKMVFAAAEYNGPVYLRLGRLDVETVLDDNYDFQIG IANTLRDGSDVTIVSTGLLTQEALKAAEELAKENISVRVINCGTIKPLDGETILKAAQET KFIITAEEHSVIGGLGSAVSEFLSETHPTLVKKLGVYDKFGQSGKGAEMLEKYELTAAKL ISMVKENLK