>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein signature. Length = 533 Score = 28.8 bits (64), Expect = 0.023 Identities = 32/120 (26%), Positives = 51/120 (42%), Gaps = 18/120 (15%) Query: 157 IAEELGISRAQFD-----QFLRMMQGGAQFGGGYQQQSGGGNWQQAQRGPTLEDACNVLG 211 EEL R FD F+ + QQQ G G QQAQ T ++A Sbjct: 310 TLEEL---RDSFDGYINNAFVNQIHLNFVMPPQAQQQQGQGQQQQAQ--ATAQEAVAAAA 364 Query: 212 VKPTDDATTIKRAYRKLMS-EHHPDKLVAKGLPPEMMEMAKQKAQEIQ-QAYELIKQQKG 269 V+ + + I + Y+ L+ + H G+ M ++A Q+ ++ + Q KQQ+G Sbjct: 365 VRLLNGSDQIAQLYKDLVKLQRH------AGIRKAMEKLAAQQEEDAKNQGKGDCKQQQG 418
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 32.1 bits (73), Expect = 0.007 Identities = 15/60 (25%), Positives = 29/60 (48%), Gaps = 2/60 (3%) Query: 119 EVTEILVKVGDKV-EAEQSLITVEGDKASMEVPAPFAGTVKEIKVN-VGDKVSTGSLIMV 176 E+ + L + D + L E + + + AP + V+++KV+ G V+T +MV Sbjct: 299 EILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMV 358 Score = 32.1 bits (73), Expect = 0.008 Identities = 16/63 (25%), Positives = 27/63 (42%), Gaps = 2/63 (3%) Query: 26 DKVEAEQSLITVEGDKASMEVPSPQAGIVKEIKVSVGDKTQTGALIMIFDSADGAADAAP 85 + V +T G S E+ + IVKEI V G+ + G +++ + AD Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138 Query: 86 AQA 88 Q+ Sbjct: 139 TQS 141 Score = 31.0 bits (70), Expect = 0.015 Identities = 17/106 (16%), Positives = 38/106 (35%), Gaps = 4/106 (3%) Query: 230 DKVAAEQSLITVEGDKASMEVPAPFAGVVKELKVNVGDKVKTGSLIMIFEVEGAAPAAAP 289 + VA +T G S E+ +VKE+ V G+ V+ G + + ++ A Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDV--LLKLTALGAEADT 136 Query: 290 AKQEAAAPAPAAKAEAPAAKAEGKSEFAENDAYVHATPLIRRLARE 335 K +++ + + + + P + ++ E Sbjct: 137 LKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEE 182
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 507 bits (1308), Expect = 0.0 Identities = 292/296 (98%), Positives = 293/296 (98%) Query: 1 MSGLPLISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVA 60 MSGLPLISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVA Sbjct: 1 MSGLPLISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVA 60 Query: 61 DTINYRLWVSEPPLPDSVIDVGLRTEPNLELMTEMKPSFMVWSAGYGPSSEMLARIAPGR 120 DTINYRLWVSEPPLPDSVIDVGLRTEPNLEL+TEMKPSFMVWSAGYGPS EMLARIAPGR Sbjct: 61 DTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGR 120 Query: 121 GFNFSDGKHPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLT 180 GFNFSDGK PLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLT Sbjct: 121 GFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLT 180 Query: 181 TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH 240 TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH Sbjct: 181 TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH 240 Query: 241 DNSKDMDALMATPLWQAMPFVRTGRFQRVPAVWFYGATLSAMHFVRVLDNAIGGKA 296 DNSKDMDALMATPLWQAMPFVR GRFQRVPAVWFYGATLSAMHFVRVLDNAIGGKA Sbjct: 241 DNSKDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIGGKA 296
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 70.1 bits (171), Expect = 2e-16 Identities = 41/176 (23%), Positives = 73/176 (41%), Gaps = 3/176 (1%) Query: 6 LSYSHFNNDLSATMSNGTYVDGSTNSDAWGFGLKTGYDFKLGDAGYVTPYGSISGLFQSG 65 L S ND S+G V G + G L+ G F D ++ P ++ G Sbjct: 736 LRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGRRFTHADGWFLEPQAELAVFRAGG 795 Query: 66 DDYQLSNDMKVDGQPYDSMRYELGVDAGYTFTYSEDQALTPYFKLAYVYDDSNNDNDVNG 125 Y+ +N ++V + S+ LG++ G + + + PY K + + + + V+ Sbjct: 796 GAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELAGGRQVQPYIKASVL-QEFDGAGTVHT 854 Query: 126 DSIDNGTEGSAVRV--GLGTQFSFTKNFSAYTDANYLGGGDVDQDWSANVGVKYTW 179 + I + TE R GLG + + S Y Y G + W+ + G +Y+W Sbjct: 855 NGIAHRTELRGTRAELGLGMAAALGRGHSLYASYEYSKGPKLAMPWTFHAGYRYSW 910
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 30.0 bits (67), Expect = 0.015 Identities = 19/69 (27%), Positives = 30/69 (43%) Query: 254 DIVRELRERTELPIGAYQVSGEYAMIKFAALAGAIDEEKVVLESLGSIKRAGADLIFSYF 313 + EL + +L + QV G A F +D E L I+ A +IF+ Sbjct: 466 NQFLELEKTKQLRLDTDQVYGNIATYNFENGRVRVDTGSNWSEVLPQIQETTARIIFNGK 525 Query: 314 ALDLAEKKI 322 L+L E++I Sbjct: 526 DLNLVERRI 534
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 30.4 bits (68), Expect = 0.027 Identities = 31/101 (30%), Positives = 42/101 (41%), Gaps = 9/101 (8%) Query: 10 PVGNGGPVITT-----PPIAGESGGMSTGSAVTDVSGAAEEMAEQAAADLFGALPEPSGL 64 P + G VI T P +G GG G + ++ S A A+ + A L E + Sbjct: 13 PYDDKGQVIITLLNGTPDGSGSGGGGGKGGSKSESSAAIHATAKWSTAQLKKTQAEQAAR 72 Query: 65 VKAAVAAAQAAAAA---AGISDMAGAVQDAAASLAAGAPGA 102 KAA A AQA A A A + V +A A+ P A Sbjct: 73 AKAA-AEAQAKAKANRDALTQRLKDIVNEALRHNASRTPSA 112
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 29.4 bits (66), Expect = 0.016 Identities = 17/69 (24%), Positives = 29/69 (42%), Gaps = 10/69 (14%) Query: 187 FISGTGFATDYRRLSGHALKGSEIIRLVEESDPVAELALRRYELRLAKSLAHVVNILDP- 245 +G ++D+R L A + D A+LAL + R+ K++ + Sbjct: 273 VYGISGISSDFRDLEDAAF---------KNGDKRAQLALNVFAYRVKKTIGSYAAAMGGV 323 Query: 246 DVIVLGGGM 254 DVIV G+ Sbjct: 324 DVIVFTAGI 332
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 52.5 bits (126), Expect = 1e-09 Identities = 73/356 (20%), Positives = 126/356 (35%), Gaps = 36/356 (10%) Query: 5 ILSLALGTFGLGMAEFGIMSVLTELAHNVGISIPAAGH---MISYYALVVVVGAPIIALF 61 + ++AL G+G+ IM VL L ++ S H +++ YAL+ AP++ Sbjct: 11 LSTVALDAVGIGL----IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 62 SSRYSLKHILLFLVALCVIGNAMFTLSSSYLMLAIGRLVSGFPHGAFFGVGAIVLSKIIK 121 S R+ + +LL +A + A+ + +L IGR+V+G GA + I Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIAD--IT 124 Query: 122 PGKVTAAVAGMVSGMTVANLLGIP-LGTYLSQECWRYTFLLIAVFNIAVMASVYFWVPDI 180 G A G +S ++ P LG + F A N + F +P+ Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184 Query: 181 RDEAKGNLREQ----------FHFLRSPAPWLI--FAATMFGNAGVFAWFSYVKPYMMFI 228 + LR + + A + F + G W + Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG------E 238 Query: 229 SGFSETAMTFIMMLVGLGM---VLGNMLSGRISGRYSPLRIAAVTDFIIVLALLMLFFCG 285 F A T + L G+ + M++G ++ R R + ++L F Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298 Query: 286 GMKTTSLIFAFICCAGLFALSAPLQILLLQNAKGGELLGAAGGQIAF--NLGSAVG 339 I + G+ LQ +L + E G G +A +L S VG Sbjct: 299 RGWMAFPIMVLLASGGIG--MPALQAMLSRQV-DEERQGQLQGSLAALTSLTSIVG 351
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 39.4 bits (92), Expect = 7e-05 Identities = 34/199 (17%), Positives = 71/199 (35%), Gaps = 14/199 (7%) Query: 671 QQEAQSWQQRQNELTALQNRIQQLTPILETLPQSDDLPHSEETVALDNWRQVHEQCLALH 730 + + Q + Q R Q L+ +E + E + +V + Sbjct: 133 EADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIK 192 Query: 731 SQQQTLQQQDVLAAQSLQKAQAQFDTAL--------QASVFDDQQAFLAALMDEQTLTQL 782 Q T Q Q +L K +A+ T L + V + ++L+ +Q + Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIA-- 250 Query: 783 EQLKQNLENQRRQAQTLVTQTAETLAQHQQHRPDGLALTVTVEQIQQEL-AQTHQKLREN 841 K + Q + V + +Q +Q + L+ + + Q + KLR+ Sbjct: 251 ---KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQT 307 Query: 842 TTSQGEIRQQLKQDADNRQ 860 T + G + +L ++ + +Q Sbjct: 308 TDNIGLLTLELAKNEERQQ 326 Score = 39.4 bits (92), Expect = 7e-05 Identities = 25/204 (12%), Positives = 59/204 (28%), Gaps = 18/204 (8%) Query: 487 EARIKTLEAQRAQLQAGQPCPLCGSTSHPAVEAYQALEPGVNQSRLLALENEVKKLGEEG 546 EA ++ Q + Q ++E + E + +E + L Sbjct: 133 EADTLKTQSSLLQARLEQ---TRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLT- 188 Query: 547 AALRGQLDALTKQLQRDENEAQSLRQDEQALTQQWQAVTASLNITLQPQDDIQPWLDAQD 606 + ++ Q Q + E R + + + + DD L Q Sbjct: 189 SLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQA 248 Query: 607 -------EHERQL-RLLSQRHELQGQIAAHNQQIIQYQQQIEQRQQQLLTALAGYALTLP 658 E E + +++ + Q+ +I+ +++ + Q L Sbjct: 249 IAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF------KNEILD 302 Query: 659 QEDEEESWLATRQQEAQSWQQRQN 682 + + + E ++RQ Sbjct: 303 KLRQTTDNIGLLTLELAKNEERQQ 326 Score = 32.5 bits (74), Expect = 0.009 Identities = 16/150 (10%), Positives = 42/150 (28%), Gaps = 5/150 (3%) Query: 731 SQQQTLQQQDVLAAQSLQKAQAQFDTA----LQASVFDDQQAFLAALMDEQTLTQLEQLK 786 + Q + A + Q + L D+ F +E+ L +K Sbjct: 134 ADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVS-EEEVLRLTSLIK 192 Query: 787 QNLENQRRQAQTLVTQTAETLAQHQQHRPDGLALTVTVEQIQQELAQTHQKLRENTTSQG 846 + + Q + A+ + L L + ++ Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKH 252 Query: 847 EIRQQLKQDADNRQQQQTLMQQIAQMTQQV 876 + +Q + + + + Q+ Q+ ++ Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEI 282
>FRAGILYSIN#Fragilysin metallopeptidase (M10C) enterotoxin signature. Length = 405 Score = 30.8 bits (69), Expect = 0.009 Identities = 14/70 (20%), Positives = 25/70 (35%), Gaps = 4/70 (5%) Query: 149 KQQHLLAAITDYYQQHYADACKLRGDQPLPIIATGHLTTVGSSKSDAVRDIYIGTLDAFP 208 K+ ++ I ++Y + + + I T S+ D + + I A Sbjct: 135 KEAQMMNEIAEFYAAPFKKTRAINEKEAFECI-YDSRTR--SAGKD-IVSVKINIDKAKK 190 Query: 209 AQNFPPADYI 218 N P DYI Sbjct: 191 ILNLPECDYI 200
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 95.3 bits (237), Expect = 7e-25 Identities = 33/149 (22%), Positives = 63/149 (42%), Gaps = 9/149 (6%) Query: 4 RILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDSAVNQLNEPWPDLILLDWMLPGGSGIQ 63 ILV +D+A IR ++ L + G+ + + + DL++ D ++P + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 64 FIKHLKRESMTRDIPVVMLTARGEEEDRVRGLETGADDYITKPFSPKELVARIKAVMRRI 123 + +K+ D+PV++++A+ ++ E GA DY+ KPF EL+ I + Sbjct: 65 LLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA-- 120 Query: 124 SPMAVEEVIKMQGLSLDPTSHRVMAGEEP 152 E + L D + G Sbjct: 121 -----EPKRRPSKLEDDSQDGMPLVGRSA 144
>PF06580#Sensor histidine kinase Length = 349 Score = 34.1 bits (78), Expect = 0.001 Identities = 19/105 (18%), Positives = 33/105 (31%), Gaps = 26/105 (24%) Query: 325 LVYNAVNH----TPEGTHITVRWQRVPHGAEFSVEDNGPGIAPEHIPRLTERFYRVDKAR 380 LV N + H P+G I ++ + VE+ G Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN---------------- 306 Query: 381 SRQTGGSGLGLAIVKHAVNH---HESRLNIESTVGKGTRFSFVIP 422 +G GL V+ + E+++ + GK +IP Sbjct: 307 --TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 68.7 bits (168), Expect = 1e-14 Identities = 38/183 (20%), Positives = 88/183 (48%), Gaps = 5/183 (2%) Query: 433 IQIVEERTIGPTLGMQNIEQGLEACLAGLLVSILFMII-FYKKFGLIATSALIANLILIV 491 ++I ++GP + + + + + LA +V + ++ + F +F L A AL+ +++L V Sbjct: 135 LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTV 194 Query: 492 GIMSLLPGATLSMPGIAGIVLTLAVAVDANVLINERIKEEL--SNGRTVQQAIDEGYRGA 549 G+ ++L + +A ++ +++ V++ +R++E L ++ ++ Sbjct: 195 GLFAVL-QLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNET 253 Query: 550 FSSIFDANITTLIKVIILYAVGTGAIKGFAITTGIGVATSMFTAIVGTRAIVNLLYGGKR 609 S +TTL+ ++ + G I+GF GV T ++++ + IV L G R Sbjct: 254 LSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIV-LFIGLDR 312 Query: 610 VKK 612 K+ Sbjct: 313 NKE 315
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 348 bits (895), Expect = e-122 Identities = 104/309 (33%), Positives = 179/309 (57%), Gaps = 12/309 (3%) Query: 17 YDFMRWDYWAFGISGLLLIAAIVIMGVRGFNWGLDFTGGTVIEITLEEPAEIDVMRDALQ 76 +DF RW + FG + +++IA++++ V G N+G+DF GGT I ++ V R AL+ Sbjct: 14 FDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGVYRAALE 73 Query: 77 KAGFEEPMLQNFGS------SHDIMVRMPPAEGETGGQVLGSQVLKVINE------STNQ 124 + ++ H M+R+ E G + G+Q +++N+ + + Sbjct: 74 PLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAVDP 133 Query: 125 NAAVKRIEFVGPSVGADLAQTGAMALMAALLSILVYVGFRFEWRLAAGVVIALTHDVIIT 184 + E VGP V +L T +L+AA + I+ Y+ RFEW+ A G V+AL HDV++T Sbjct: 134 ALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLT 193 Query: 185 LGILSLFHIEIDLTIVASLMSVIGYSLNDSIVVSDRIRENFRKIRRGTPYEIFNVSLTQT 244 +G+ ++ ++ DLT VA+L+++ GYS+ND++VV DR+REN K + ++ N+S+ +T Sbjct: 194 VGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNET 253 Query: 245 LHRTLITSGTTLMVILMLYLFGGPVLEGFSLTMLIGVSIGTASSIYVASALALKLGMKRE 304 L RT++T TTL+ ++ + ++GG V+ GF M+ GV GT SS+YVA + L +G+ R Sbjct: 254 LSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGLDRN 313 Query: 305 HMLQQKVEK 313 + +K Sbjct: 314 KEKKDPSDK 322
>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx signature. Length = 294 Score = 527 bits (1358), Expect = 0.0 Identities = 257/294 (87%), Positives = 273/294 (92%) Query: 1 MKKTLLAAGAVLALSSSFTVNAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLE 60 MKKTLLAAGAV+ALS++F AAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLE Sbjct: 1 MKKTLLAAGAVVALSTTFAAGAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLE 60 Query: 61 YEAFAKKDWFDFYGYADAPVFFGGNSDAKGIWNHGSPLFMEIEPRFSIDKLTNTDLSFGP 120 YEAFAKKDWFDFYGY DAPVFFGGNS AKGIWN GSPLFMEIEPRFSIDKLTNTDLSFGP Sbjct: 61 YEAFAKKDWFDFYGYIDAPVFFGGNSTAKGIWNKGSPLFMEIEPRFSIDKLTNTDLSFGP 120 Query: 121 FKEWYFANNYIYDMGRNKDGRQSTWYMGLGTDIDTGLPMSLSMNVYAKYQWQNYGAANEN 180 FKEWYFANNYIYDMGRN QSTWYMGLGTDIDTGLPMSLS+NVYAKYQWQNYGA+NEN Sbjct: 121 FKEWYFANNYIYDMGRNDSQEQSTWYMGLGTDIDTGLPMSLSLNVYAKYQWQNYGASNEN 180 Query: 181 EWDGYRFKIKYFVPITDLWGGQLSYIGFTNFDWGSDLGDDSGNAINGIKTRTNNSIASSH 240 EWDGYRFK+KYFVP+TDLWGG LSYIGFTNFDWGSDLGDD+ +NG RT+NSIASSH Sbjct: 181 EWDGYRFKVKYFVPLTDLWGGSLSYIGFTNFDWGSDLGDDNFYDLNGKHARTSNSIASSH 240 Query: 241 ILALNYDHWHYSVVARYWHDGGQWNDDAELNFGNGNFNVRSTGWGGYLVVGYNF 294 ILALNY HWHYS+VARY+H+GGQW DDA+LNFG+G F+VRSTGWGGY VVGYNF Sbjct: 241 ILALNYAHWHYSIVARYFHNGGQWADDAKLNFGDGPFSVRSTGWGGYFVVGYNF 294
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 39.0 bits (91), Expect = 3e-05 Identities = 71/347 (20%), Positives = 135/347 (38%), Gaps = 20/347 (5%) Query: 62 KFLWSPLMDRYTPPFFGRRRGWLLATQILLLVAIAAMGFLEPGTQLRWMAALAVVIAFCS 121 +F +P++ + F RR LL + V A M W+ + ++A + Sbjct: 56 QFACAPVLGALSDRF--GRRPVLLVSLAGAAVDYAIMAT----APFLWVLYIGRIVAGIT 109 Query: 122 ASQDIVFDAWKTDVLPAEERGAGAAISVLGYRLGMLVSGGLALWLADKWLGWQGMYWLMA 181 + V A+ D+ +ER + GM+ L + ++ A Sbjct: 110 GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG--FSPHAPFFAAA 167 Query: 182 AL-LIPCIIATLLAPEP--TDTIPVPKTLEQAVVAPLRDFFGRNNAWLILLLIVLYKLGD 238 AL + + L PE + P+ + + + A L+ + ++ +G Sbjct: 168 ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQ 227 Query: 239 AFAMSLTTTFLIRGVGFDAGEVGVVNKTLGLLATIVGALYGGILMQRLSLFRALLIFGIL 298 A +L F +DA +G+ G+L ++ A+ G + RL RAL+ G++ Sbjct: 228 VPA-ALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALM-LGMI 285 Query: 299 QGASNAGYWLLSITDKHLYSMGAAVFFENLCGGMGTSAFVALLMTLCNKSFSATQFALLS 358 A GY LL+ + + V GG+G A A+L ++ L+ Sbjct: 286 --ADGTGYILLAFATRGWMAFPIMVLL--ASGGIGMPALQAMLSRQVDEERQGQLQGSLA 341 Query: 359 ALSAVGRVYVGPVAGWFVEAHGWSTF--YLFSVAAAVPGLILLLVCR 403 AL+++ + VGP+ + A +T+ + + AA+ L L + R Sbjct: 342 ALTSLTSI-VGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387
>PF06291#Lambda prophage Bor protein Length = 102 Score = 26.5 bits (58), Expect = 0.030 Identities = 11/34 (32%), Positives = 18/34 (52%) Query: 3 KKILFPLVALFMLAGCAKPPTTIEVSPTITLPQQ 36 KK+LF ++ GCA+ T+ PT P++ Sbjct: 7 KKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKE 40
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.0 bits (65), Expect = 0.043 Identities = 16/73 (21%), Positives = 29/73 (39%), Gaps = 13/73 (17%) Query: 60 ERSALPTPHEIRNHLDDYVIGQEQAKKVLAVAVYNHYKRLRNGDTSNGVELGKSNILLIG 119 E P+ E + ++G+ A + +Y RL D +++ G Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQ----EIYRVLARLMQTD---------LTLMITG 167 Query: 120 PTGSGKTLLAETL 132 +G+GK L+A L Sbjct: 168 ESGTGKELVARAL 180
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 35.0 bits (80), Expect = 0.001 Identities = 34/133 (25%), Positives = 69/133 (51%), Gaps = 15/133 (11%) Query: 191 ERLEYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKELGEMDDVPD- 249 LE A +E + +L R +++ ++ S+ +Q++A ++L E + + + Sbjct: 291 AALEAEKADLEHQSQVLNAN---RQSLRRDLDASREAK---KQLEAEHQKLEEQNKISEA 344 Query: 250 ENEALKRKIDAAKMPKEAKEKAEAELQKLKMMSPMS-AEATVVRGYIDWMVQVPWNARSK 308 ++L+R +DA++ EAK++ EAE QKL+ + +S A +R +D + A+ + Sbjct: 345 SRQSLRRDLDASR---EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASRE----AKKQ 397 Query: 309 VKKDLRQAQEILD 321 V+K L +A L Sbjct: 398 VEKALEEANSKLA 410
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 117 bits (294), Expect = 3e-38 Identities = 49/88 (55%), Positives = 67/88 (76%) Query: 2 NKSQLIDKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTFAVKERAARTGR 61 NK LI K+A +++K + A+DA+ ++V+ L +G+ V L+GFG F V+ERAAR GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 62 NPQTGKEITIAAAKVPSFRAGKALKDAV 89 NPQTG+EI I A+KVP+F+AGKALKDAV Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1366 bits (3536), Expect = 0.0 Identities = 800/1033 (77%), Positives = 913/1033 (88%), Gaps = 1/1033 (0%) Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLAILKLPVAQYPTIAPPAVTISASYPGADAKTVQDT 60 M NFFI RPIFAWV+AII+M+AG LAIL+LPVAQYPTIAPPAV++SA+YPGADA+TVQDT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGIDNLMYMSSNSDSTGTVQITLTFESGTDADIAQVQVQNKLQLAMPLLPQ 120 VTQVIEQNMNGIDNLMYMSS SDS G+V ITLTF+SGTD DIAQVQVQNKLQLA PLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGVSVEKSSSSFLMVVGVINTDGTMTQEDISDYVAANMKDAISRTSGVGDVQLFGS 180 EVQQQG+SVEKSSSS+LMV G ++ + TQ+DISDYVA+N+KD +SR +GVGDVQLFG+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWMNPNELNKFQLTPVDVITAIKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240 QYAMRIW++ + LNK++LTPVDVI +K QN Q+AAGQLGGTP + GQQLNASIIAQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 TSTEEFGKILLKVNQDGSRVLLRDVAKIELGGENYDIIAEFNGQPASGLGIKLATGANAL 300 + EEFGK+ L+VN DGS V L+DVA++ELGGENY++IA NG+PA+GLGIKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTAAAIRAELAKMEPFFPSGLKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ 360 DTA AI+A+LA+++PFFP G+K++YPYDTTPFV++SIHEVVKTL EAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 AEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFVPMAFFGGSTGAIYRQFSITIVSAMAL 480 E+ LPPKEAT KSM QIQGALVGIAMVLSAVF+PMAFFGGSTGAIYRQFSITIVSAMAL Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATMLKPIAKGDHGEGKKGFFGWLNRMFEKSTHHYTDSVGGILRSTGR 540 SVLVALILTPALCAT+LKP++ H E K GFFGW N F+ S +HYT+SVG IL STGR Sbjct: 481 SVLVALILTPALCATLLKPVSAE-HHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 541 YLVLYLIIVVGMAYLFVRLPSSFLPDEDQGVFMTMVQLPAGATQERTQKVLNEVTHYYLT 600 YL++Y +IV GM LF+RLPSSFLP+EDQGVF+TM+QLPAGATQERTQKVL++VT YYL Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 601 KEKNNVESVFAVNGFGFAGRGQNTGIAFVSLKDWADRPGEENKVEAITMRATRAFSQIKD 660 EK NVESVF VNGF F+G+ QN G+AFVSLK W +R G+EN EA+ RA +I+D Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659 Query: 661 AMVFAFNLPAIVELGTATGFDFELIDQAGLGHEKLTQARNQLLAEAAKHPDMLTSVRPNG 720 V FN+PAIVELGTATGFDFELIDQAGLGH+ LTQARNQLL AA+HP L SVRPNG Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719 Query: 721 LEDTPQFKIDIDQEKAQALGVSINDINTTLGAAWGGSYVNDFIDRGRVKKVYVMSEAKYR 780 LEDT QFK+++DQEKAQALGVS++DIN T+ A GG+YVNDFIDRGRVKK+YV ++AK+R Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779 Query: 781 MLPDDIGDWYVRAADGQMVPFSAFSSSRWEYGSPRLERYNGLPSMEILGQAAPGKSTGEA 840 MLP+D+ YVR+A+G+MVPFSAF++S W YGSPRLERYNGLPSMEI G+AAPG S+G+A Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839 Query: 841 MELMEQLASKLPTGVGYDWTGMSYQERLSGNQAPSLYAISLIVVFLCLAALYESWSTPFS 900 M LME LASKLP G+GYDWTGMSYQERLSGNQAP+L AIS +VVFLCLAALYESWS P S Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899 Query: 901 VMLVVPLGVIGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGL 960 VMLVVPLG++G LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG+ Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959 Query: 961 IEATLDAVRMRLRPILMTSLAFILGVMPLVISTGAGSGAQNAVGTGVMGGMVTATVLAIF 1020 +EATL AVRMRLRPILMTSLAFILGV+PL IS GAGSGAQNAVG GVMGGMV+AT+LAIF Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019 Query: 1021 FVPVFFVVVRRRF 1033 FVPVFFVV+RR F Sbjct: 1020 FVPVFFVVIRRCF 1032
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 44.0 bits (104), Expect = 7e-07 Identities = 33/212 (15%), Positives = 71/212 (33%), Gaps = 23/212 (10%) Query: 100 TYQATYDSAKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQGYDQALADAQQANAAVTA 159 + Y A +L + + Q+ Q +++ ++ L +Q + Sbjct: 256 EQENKYVEAVNELR--VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313 Query: 160 AKAAVETARINLAYTKVTSPISGRIGKSNV-TEGALVQNGQATALATVQQLDPIYVDVTQ 218 + + + +P+S ++ + V TEG +V + T + V + D + V Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALV 372 Query: 219 SSNDFLRLKQELA----------NGTLKQENGKAKVSLITSDGIKFPQDGTLEFSDVTVD 268 + D + KV I D I+ + G + ++++ Sbjct: 373 QNKDIGFINVGQNAIIKVEAFPYTRYGYLV---GKVKNINLDAIEDQRLGLVFNVIISIE 429 Query: 269 QTTGSITLRAIFPNPDHTLLPGMFVRARLEEG 300 + S + I L GM V A ++ G Sbjct: 430 ENCLSTGNKNIP------LSSGMAVTAEIKTG 455 Score = 32.9 bits (75), Expect = 0.002 Identities = 26/127 (20%), Positives = 50/127 (39%), Gaps = 10/127 (7%) Query: 49 PLQITTELPGR-TSAYRIAEVRPQVSGIILKRNFKEGSDIEAGVSLYQIDPATYQATYDS 107 ++I G+ T + R E++P + I+ + KEG + G L ++ +A Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA---- 134 Query: 108 AKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQGYDQALADAQQANAAVTAAKAAVETA 167 D K Q++ A+L RYQ L + I + + V+ + T+ Sbjct: 135 ---DTLKTQSSLLQARLEQTRYQILSRS--IELNKLPELKLPDEPYFQNVSEEEVLRLTS 189 Query: 168 RINLAYT 174 I ++ Sbjct: 190 LIKEQFS 196
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 222 bits (567), Expect = 5e-76 Identities = 215/215 (100%), Positives = 215/215 (100%) Query: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 Query: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180 Query: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.7 bits (72), Expect = 0.017 Identities = 19/125 (15%), Positives = 40/125 (32%), Gaps = 6/125 (4%) Query: 28 QNTAFARASSNGDLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDKIDRIKEE 87 N RA L + + L L+ + A L++ ++ E Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSR--LDDFSSLLHKQAIAKHAVLEQENKYVEA 264 Query: 88 TVQLRQKVAEAPEKMRQATAALTALSDVDND--EETRKIL--STLSLRQLETRVAQALDD 143 +LR ++ + + +A V E L +T ++ L +A+ + Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEER 324 Query: 144 LQNAQ 148 Q + Sbjct: 325 QQASV 329
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 38.9 bits (90), Expect = 7e-05 Identities = 40/251 (15%), Positives = 77/251 (30%), Gaps = 31/251 (12%) Query: 402 PLPETTSQVLAARQQLQRVQGATKAKKSEPAA----ATRARPVNNAALERLASVTDRVQA 457 P E +Q + + + P+ AR + A + A T Sbjct: 983 PEVEKRNQTVDTTN----ITTPNNIQADVPSVPSNNEEIARV-DEAPVPPPAPATPSETT 1037 Query: 458 RPVPSALEKAPAKKEAYRWKATTPVMQQKE--------VVATPKALKKA---LEHEKTPE 506 V ++ E AT Q +E V A + + A E ++T Sbjct: 1038 ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097 Query: 507 LAVKLAA---------EAIERDPWAAQVSQLSLPKLVEQVALNAWKE-ESDNAVCLHLRS 556 K A E+ +V+ PK + + E +N ++++ Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157 Query: 557 SQRHLNNRGAQQKLAEALS-MLKGSTVELTIVEDDNPAVRTPLEWRQAIYEEKLAQARES 615 Q N ++ A+ S ++ E T V N V P A + + + Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSN 1217 Query: 616 IIADNNIQTLR 626 + + +++R Sbjct: 1218 KPKNRHRRSVR 1228
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 27.8 bits (62), Expect = 0.048 Identities = 11/43 (25%), Positives = 18/43 (41%) Query: 38 GQLAAVAIVTCDGNVYSAGDSDYRFALESISKVCTLALALEDV 80 G++ + + G +A +D RF + S KV L V Sbjct: 38 GRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARV 80
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 74.7 bits (183), Expect = 5e-18 Identities = 47/212 (22%), Positives = 80/212 (37%), Gaps = 7/212 (3%) Query: 16 KSVLITGCSSGIGLESALELKRQGFHVLAGCRKPNDVERMNN----MGFT--GVLIDLDS 69 K ITG + GIG A L QG H+ A P +E++ + D+ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 70 PESVNRAADEVIALTDNCLYGIFNNAGFGMYGPLSTISRAQMEQQFSANFFGAHQLTMRL 129 +++ + + + N AG G + ++S + E FS N G + + Sbjct: 69 SAAIDEITARIEREMGP-IDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 130 LPAMLPHGEGRIVMTSSVMGLISTPGRGAYAASKYALEAWSDALRMELRHSGIKVSLIEP 189 M+ G IV S + AYA+SK A ++ L +EL I+ +++ P Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 190 GPIRTRFTDNVNQTQSDKPVENPGIAARFTLG 221 G T ++ ++ G F G Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTG 219
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.014 Identities = 12/20 (60%), Positives = 13/20 (65%) Query: 41 LVGESGSGKSTLLAILAGLD 60 L G G GKSTL+ L GLD Sbjct: 601 LEGTGGIGKSTLINTLVGLD 620
>PF09025#YopR Core Length = 143 Score = 28.1 bits (62), Expect = 0.020 Identities = 17/61 (27%), Positives = 25/61 (40%), Gaps = 8/61 (13%) Query: 126 EAVLIGQLECKSMVRMCAPLGSR--------LPLHASGAGKALLYPLAEEELMSIILQTG 177 + + +LE K+M+R PLG + L G L LA EL +I G Sbjct: 68 QGLEADRLELKAMLRAELPLGRQQQTFLLQLLGAVEHAPGGEYLAQLARRELQVLIPLNG 127 Query: 178 L 178 + Sbjct: 128 M 128
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 135 bits (341), Expect = 2e-39 Identities = 60/171 (35%), Positives = 91/171 (53%), Gaps = 16/171 (9%) Query: 11 QRYTWCL------AGICYSSLAILPSFLSY-----AESYFNPAFLLENGTSVADLSRFER 59 QR T CL + L + +F + AE YFNP FL ++ +VADLSRFE Sbjct: 10 QRNTQCLHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFEN 69 Query: 60 GNHQPAGVYRVDLWRNDEFIGSQDIVFESTTENTGDKSGGLMPCFNQVLLERIGLNSSAF 119 G P G YRVD++ N+ ++ ++D+ F NTGD G++PC + L +GLN+++ Sbjct: 70 GQELPPGTYRVDIYLNNGYMATRDVTF-----NTGDSEQGIVPCLTRAQLASMGLNTASV 124 Query: 120 PELAQQQNNKCINLLKAVPDATINFDFAAMRLNITIPQIALLSSAHGVMTP 170 + ++ C+ L + DAT D RLN+TIPQ + + A G + P Sbjct: 125 SGMNLLADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPP 175
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 85.7 bits (212), Expect = 1e-21 Identities = 35/117 (29%), Positives = 62/117 (52%) Query: 2 KLLIVEDEKKTGEYLTKGLTEAGFVVDLADNGLNGYHLAMTGDYDLIILDIMLPDVNGWD 61 +L+ +D+ L + L+ AG+ V + N + GD DL++ D+++PD N +D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 IVRMLRSANKGMPILLLTALGTIEHRVKGLELGADDYLVKPFAFAELLARVRTLLRR 118 ++ ++ A +P+L+++A T +K E GA DYL KPF EL+ + L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 36.3 bits (84), Expect = 2e-04 Identities = 26/189 (13%), Positives = 61/189 (32%), Gaps = 13/189 (6%) Query: 254 QAQTVNSGSLQSVKLPA-GLSSQILLQRPDIMEAEHALM-----AANANIGAARAAFFPS 307 + +SG + +K + +I+++ + + L+ A A+ ++ Sbjct: 87 NGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQS----- 141 Query: 308 ISLTSGISTASSDLSSLFNASSGMWNFIPKIEIPIFNAGRNQANLDIAEIRQQQSVVNYE 367 SL + + + + P F + L + + ++Q Sbjct: 142 -SLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQN 200 Query: 368 QKIQNAFKEVADALALRQSLDDQISAQQRYLASLQITLQRAWALYQHGAVSYLEVLDAER 427 QK Q + A R ++ +I+ + + L +L A++ VL+ E Sbjct: 201 QKYQ-KELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQEN 259 Query: 428 SLFATRQTL 436 L Sbjct: 260 KYVEAVNEL 268
>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature. Length = 52 Score = 56.4 bits (136), Expect = 2e-15 Identities = 18/50 (36%), Positives = 27/50 (54%) Query: 1 MLTKYALVAVIVLCLTVLGFTLLAGDSLCEFTVKERNIEFRAVLAYEPKK 50 + + V+++CLT+L FT L SLCE ++ E A +AYE K Sbjct: 3 LPRSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYESGK 52
>ENTSNTHTASED#Enterobactin synthetase component D signature. Length = 234 Score = 274 bits (703), Expect = 1e-96 Identities = 105/183 (57%), Positives = 130/183 (71%), Gaps = 1/183 (0%) Query: 1 MKTTHTSLPFAGHTLHFVEFDPANFCEQDLLWLPHYAQLQHAGRKRKTEHLAGRIAAVYA 60 M T+H LPFAGH LH V+FD ++F E DLLWLPH+ +L+ AGRKRK EHLAGRIAAV+A Sbjct: 1 MLTSHFPLPFAGHRLHIVDFDASSFREHDLLWLPHHDRLRSAGRKRKAEHLAGRIAAVHA 60 Query: 61 LREYGYKCVPAIGELRQPVWPAEVYGSISHCGATALAVVSRQPIGVDIEEIFSAQTATEL 120 LRE G + VP +G+ RQP+WP ++GSISHC TALAV+SRQ IG+DIE+I S TATEL Sbjct: 61 LREVGVRTVPGMGDKRQPLWPDGLFGSISHCATTALAVISRQRIGIDIEKIMSQHTATEL 120 Query: 121 TDNIITPAEHERLADCGLAFSLALTLAFSAKESAFKA-SEIQTDAGFLDYQIISWNKQQV 179 +II E + L L F LALTLAFSAKES +KA S+ T GF ++ S + Sbjct: 121 APSIIDSDERQILQASLLPFPLALTLAFSAKESVYKAFSDRVTLPGFNSAKVTSLTATHI 180 Query: 180 IIH 182 +H Sbjct: 181 SLH 183
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 36.3 bits (84), Expect = 2e-04 Identities = 82/394 (20%), Positives = 145/394 (36%), Gaps = 44/394 (11%) Query: 27 FISIVSLGLLGVAVPVQIQMMTHSTWQV---GLSVTLTGGAMFVGLMVGGVLADRYERKK 83 + V +GL+ +P ++ + HS G+ + L F V G L+DR+ R+ Sbjct: 15 ALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRP 74 Query: 84 VILLARGTCGIGFIGLCLNALL--PEPSLLAIYLLGLWDGFFASLGVTALLAATSALVGR 141 V+L + G ++ + P L +Y+ + G + G A A + + Sbjct: 75 VLL-------VSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA-GAYIADITDG 126 Query: 142 ENLMQAGAITMLTVRLGSVISPMIGGLLLATGGVAWNYGLAAAGTFITLLPLLSLPALPP 201 + + G V P++GGL+ GG + + AA L L LP Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLM---GGFSPHAPFFAAAALNGLNFLTGCFLLPE 183 Query: 202 PPQPLEHPLK----SLLAGFRFLLASPLLGGLLTMA----------SAVLVLYPALADNW 247 + PL+ + LA FR+ ++ L+ + +A+ V++ D + Sbjct: 184 SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG--EDRF 241 Query: 248 QMSAAQIGFLYAAIP-LGAAIGALTSGKLAHSARPGLLMLLSTLGS---FLAIGLFGLMP 303 A IG AA L + A+ +G +A ++L + ++ + Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301 Query: 304 MWILGVVCLALFGWLSAVSSLLQYTMLQTQTPEAMLGRINGLWTAQNVTGDAIGAALLGG 363 M +V LA G ML Q E G++ G A +G L Sbjct: 302 MAFPIMVLLASGGIGMPALQ----AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357 Query: 364 LGAMMTPVASASASGFGLLIIGVLLLLVLVELRR 397 + A + + +G+ + L LL L LRR Sbjct: 358 IYA----ASITTWNGWAWIAGAALYLLCLPALRR 387
>BCTLIPOCALIN#Bacterial lipocalin signature. Length = 171 Score = 28.8 bits (64), Expect = 0.014 Identities = 18/98 (18%), Positives = 39/98 (39%), Gaps = 13/98 (13%) Query: 30 QGITIIKTFDAPGGMKGYLGKYQDMGVTIYLTPDGKHAISG--YMYNEKGENLSNTLIEK 87 + + + F+ YLGK+ ++ + G ++ + N+ G ++ N Sbjct: 21 ESVKPVSDFEL----NNYLGKWYEVARLDHSFERGLSQVTAEYRVRNDGGISVLN----- 71 Query: 88 EIYAPAGREMWQRMEQSHWLLDGKKDAPVIVYVFSDPF 125 Y+ + W+ E + ++G D + V F PF Sbjct: 72 RGYSEE-KGEWKEAEGKAYFVNGSTDGYLKVSFFG-PF 107
>PF03944#delta endotoxin Length = 633 Score = 27.3 bits (60), Expect = 0.009 Identities = 12/43 (27%), Positives = 24/43 (55%), Gaps = 3/43 (6%) Query: 21 IAPLDTQDIDLQINSSVEKQFG---DAIRTTILDVLARYNVRG 60 I+P+ ++ Q + + ++FG D++R + ARY +RG Sbjct: 496 ISPIHATQVNNQTRTFISEKFGNQGDSLRFEQNNTTARYTLRG 538
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 38.6 bits (90), Expect = 1e-05 Identities = 14/67 (20%), Positives = 33/67 (49%), Gaps = 2/67 (2%) Query: 155 NPFTNGHRYLIQQAAAQCDWLHLFLVKEDSSR--FPYEDRLDLVLKGTADIPRLTVHRGS 212 +P T GH +I++ D +++ +++ + + F ++RL+ + K A +P V Sbjct: 10 DPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLPNAQVDSFE 69 Query: 213 EYIISRA 219 ++ A Sbjct: 70 GLTVNYA 76
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 34.6 bits (79), Expect = 9e-05 Identities = 43/179 (24%), Positives = 74/179 (41%), Gaps = 26/179 (14%) Query: 14 SLLFTAPVYAADEGSGEIHFKGEVIEAPCEIHQDDIDKEVELGQVTTSHINQS-HHSDAV 72 ++L + V+AAD + FKG++I C + + EV G + ++ QS + Sbjct: 15 AVLMSQHVHAADN----LTFKGKLIIPACTVQ----NAEVNWGDIEIQNLVQSGGNQKDF 66 Query: 73 AVDLLLVNCDLENSSNGSGGKISKVAVTFDSSAKTTGADPILNNTSTGEATGVGVRLMNK 132 VD+ NC + + VT S+ TG ++ NTST G+ + L N Sbjct: 67 TVDM---NCPYS---------LGTMKVTITSNG-QTGNSILVPNTSTASGDGLLIYLYNS 113 Query: 133 DQSNI----VLGTATPDIDLAPTSSEQTLNFFAWMEQIDQATPVTPGAVTANATYVLDY 187 + S I LG+ + T+ + + +A + + G +A AT V Y Sbjct: 114 NNSGIGNAVTLGSQVTPGKITGTAPARKITLYAKLGYKGNMQSLQAGTFSATATLVASY 172
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 538 bits (1388), Expect = 0.0 Identities = 215/743 (28%), Positives = 342/743 (46%), Gaps = 57/743 (7%) Query: 3 FNFDQANQQLNISIPQAWLAWHSENWTPPSTWKEGVAGVLMDYNLFASSYRPQDGSSSTN 62 D Q+LN++IPQA+++ + + PP W G+ L++YN +S + + G +S Sbjct: 147 AQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHY 206 Query: 63 LNAYGTAGINTGAWRLRSDYQLNHTDSDDNHEQSG--EISRTYLFRPLPQLGSKLTLGET 120 +G+N GAWRLR + ++ SD + + T+L R + L S+LTLG+ Sbjct: 207 AYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDG 266 Query: 121 DFSPNIFDGFSYTGAALASDDRMLPWELRGYAPQISGIAQTNATVTISQSGRVIYQKKVP 180 +IFDG ++ GA LASDD MLP RG+AP I GIA+ A VTI Q+G IY VP Sbjct: 267 YTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVP 326 Query: 181 PGPFIIDDLNQ-SVQGTLDVKVTEEDGRVNNFQVSAASTPFLTRQGQVRYKLAAGQPRPS 239 PGPF I+D+ G L V + E DG F V +S P L R+G RY + AG+ R Sbjct: 327 PGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSG 386 Query: 240 MSHQTENETFFSNEVSWGMLSNTSLYSGLLLSGDDYHSAAMGIGQNMLWLGALSFDVTWA 299 + Q E FF + + G+ + ++Y G L+ D Y + GIG+NM LGALS D+T A Sbjct: 387 -NAQQEKPRFFQSTLLHGLPAGWTIYGGTQLA-DRYRAFNFGIGKNMGALGALSVDMTQA 444 Query: 300 SSHFDTQQDERGLSYRFNYSKQVDATNSTISLAAYRFSDRHFHSYANYLDHKYND----- 354 +S G S RF Y+K ++ + + I L YR+S + ++A+ + N Sbjct: 445 NSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIET 504 Query: 355 ---------------SDAQDEKQTISLSVGQPITPLNLNLYANLLHQTWWNADASTTANI 399 + A +++ + L+V Q + LY + HQT+W Sbjct: 505 QDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQL-GRTSTLYLSGSHQTYWGTSNVDE-QF 562 Query: 400 TAGFNVDIGDWRDISISTSFNTTHYE-DKDRDNQIYLSISLPFGNGGR-----------V 447 AG N + DI+ + S++ T K RD + L++++PF + R Sbjct: 563 QAGLNT---AFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASA 619 Query: 448 GYDMQNSSHS-TTHRMSWNDTLDERN--SWGMSAGL-QSDRPDNGAQVSGNYQHLSSAGE 503 Y M + + T+ TL E N S+ + G ++G+ + G Sbjct: 620 SYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGN 679 Query: 504 WDISGTYAANDYSSVSSSWSGSFTATQYGAAFHRRSSTNEPRLMVSTDGVADIPVQGNLD 563 +I ++ ++D + SG A G + N+ ++V G D V+ Sbjct: 680 ANIGYSH-SDDIKQLYYGVSGGVLAHANGVTLGQP--LNDTVVLVKAPGAKDAKVENQTG 736 Query: 564 Y-TNHFGIAVVPLISSYQPSTVAVNMNDLPDGVTVAENVIKETWIEGAIGYKSLASRSGK 622 T+ G AV+P + Y+ + VA++ N L D V + V GAI +R G Sbjct: 737 VRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGI 796 Query: 623 DVNVIIRNASGQFPPLGADIRQDDSGISVGMVGEEGHAWLSGVAENQKFTVVWG--DSQH 680 + + + + + P GA + +S S G+V + G +LSG+ K V WG ++ H Sbjct: 797 KLLMTLT-HNNKPLPFGAMV-TSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAH 854 Query: 681 CSLH--LPEH-MEDTANRLILPC 700 C + LP + +L C Sbjct: 855 CVANYQLPPESQQQLLTQLSAEC 877
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 95.3 bits (237), Expect = 7e-25 Identities = 37/133 (27%), Positives = 60/133 (45%), Gaps = 1/133 (0%) Query: 2 TNVLIVEDEQAIRRFLRTALEGDGMRVFEAETLQRGLLEAATRKPDLIILDLGLPDGDGI 61 +L+ +D+ AIR L AL G V A DL++ D+ +PD + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 62 EFIRDLRQWSP-VPVIVLSARSEESDKIAALDAGADDYLSKPFGIGELQARLRVALRRHS 120 + + +++ P +PV+V+SA++ I A + GA DYL KPF + EL + AL Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 121 ATAAPDPLVKFSG 133 + G Sbjct: 124 RRPSKLEDDSQDG 136
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 28.1 bits (62), Expect = 0.031 Identities = 29/116 (25%), Positives = 48/116 (41%), Gaps = 4/116 (3%) Query: 96 DDVHHQDNAQETKELAGGQEENAQADAHEDCQDCEVSVATLRFTQRLL-HIFTYAAGDRK 154 +H +D E K LAG + E AQA A D V + R L F A R Sbjct: 221 SSIHARD--AEMKTLAGKRNELAQASAKYKELDELVKKLSPRANDPLQNRPFFEATRRRV 278 Query: 155 YLHHATREQRKHITALEMDQENSYVQNLLLAIRGMAEPTTLDNAALLRLTDAIKAE 210 E++K +TA E + N ++ + +++ + NA + R+ +A + Sbjct: 279 GAGKIREEKQKQVTASE-TRINRINADITQIQKAISQVSNNRNAGIARVHEAEENL 333
>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature. Length = 52 Score = 61.8 bits (150), Expect = 4e-17 Identities = 20/48 (41%), Positives = 34/48 (70%) Query: 23 QKAMLIALIVICLTVIVTALVTRKDLCEVRIRTGQTEVAVFVDYESRK 70 + +++ ++++CLT+++ +TRK LCE+R R G EVA F+ YES K Sbjct: 5 RSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYESGK 52
>PF05272#Virulence-associated E family protein Length = 892 Score = 53.9 bits (129), Expect = 1e-09 Identities = 30/82 (36%), Positives = 48/82 (58%), Gaps = 2/82 (2%) Query: 4 SELSDLLWAQVDRVAPHLLPNGKIEGHEWVAGNVNGDKGNSLKVNLIGKKKWADFAEGDG 63 + L+D L + + P LP G + GHE+ G++ G KG+S KVN + KW DF+ G+ Sbjct: 12 TSLADALLTRAKDLLPEWLPGGVLVGHEYECGSLAGGKGDSCKVN-VTTGKWCDFSTGES 70 Query: 64 G-DMLDLWMACRGINLHQAMQE 84 G D+LDL+ G+ + +A + Sbjct: 71 GRDLLDLYAEIHGLKVSKAAAQ 92
>DNABINDNGFIS#DNA-binding protein FIS signature. Length = 98 Score = 29.6 bits (66), Expect = 3e-04 Identities = 12/33 (36%), Positives = 19/33 (57%) Query: 3 VKIQTIPELLIQTRGNMTEVSRMLNCNRATVRK 35 V+ + ++ TRGN T + M+ NR T+RK Sbjct: 58 VEQPLLDMVMQYTRGNQTRAALMMGINRGTLRK 90
>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein signature. Length = 104 Score = 26.8 bits (59), Expect = 0.014 Identities = 8/31 (25%), Positives = 15/31 (48%) Query: 71 LVDYYVFGMTFMTLARKHGCSDGYIGKKLQK 101 + DY V G + + K+ ++GY L + Sbjct: 51 MKDYLVGGHSRKEVCEKYQMNNGYFSTTLGR 81
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 39.4 bits (92), Expect = 6e-05 Identities = 14/194 (7%), Positives = 43/194 (22%), Gaps = 12/194 (6%) Query: 29 LNGAASDAERSSARMQRFMERQTQAARQTMQAASSAATAASAHAQTVEKNARAHERMARE 88 L ++A+ + Q + + Q S + + E Sbjct: 127 LTALGAEADTLKTQSSL---LQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEE 183 Query: 89 VEQTRLRVDALNQKMREEQAQARALAEAQDKAAAAFYRQIDSVKQAGAGLQELQRIQQQI 148 V + + + ++ Q + + +I+ + + Sbjct: 184 VLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDD---F 240 Query: 149 RQARNSGGVGQQDYLALISEITAKTRALTQAE------EQATRQKAAFIRQLKEQATRQN 202 + + + L ++ L + E + + + + Sbjct: 241 SSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI 300 Query: 203 LSSSELLRARAAQL 216 L L Sbjct: 301 LDKLRQTTDNIGLL 314 Score = 38.7 bits (90), Expect = 1e-04 Identities = 25/224 (11%), Positives = 63/224 (28%), Gaps = 30/224 (13%) Query: 545 NYQEQQKRRNAENAALNRMNETEAARHQREIARINAMQYADQAVRDAAIQRENERYEKAI 604 + + + A L + + E+ ++ ++ D+ + E R I Sbjct: 133 EADTLKTQSSLLQARLEQ-TRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLI 191 Query: 605 KKNTRATRNDEATRLLLQYSQQQAQVEGQIAAARQSAGIATERMTEAHKQLLALQQRISD 664 K+ + Q+ Q E + R R+ + R+ D Sbjct: 192 KEQ------------FSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDD 239 Query: 665 LDGKKLTADEKSVLARKNELIQALTLLDVKQQELQKQTALNDLRKKTVQLTSQLADKERA 724 L + I +L+ + + ++ L + + Q+ S++ + Sbjct: 240 F--SSL---------LHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEE 288 Query: 725 LREQHNLDIATAGMGDKQRQRYQAQLRIRQEYRQQLQQLENDSR 768 + + + Q I +L + E + Sbjct: 289 YQ-LVTQLFKN----EILDKLRQTTDNIGL-LTLELAKNEERQQ 326 Score = 32.5 bits (74), Expect = 0.010 Identities = 31/238 (13%), Positives = 69/238 (28%), Gaps = 42/238 (17%) Query: 402 DPVNAAKALDNALHFLNATQLEQIRVLGEQGRSSDAARIAMSALAEETGKRTSDIDNNLN 461 + A L +LEQ R RS + ++ L +E + + L Sbjct: 128 TALGAEADTLKTQSSLLQARLEQTRYQILS-RSIELNKLPELKLPDEPYFQNVSEEEVLR 186 Query: 462 ALGSTLQTLSDWWKQFWDAAMNIGREDSLDAQIDALQEKIQRAKKYPWTNASTQVEYDQQ 521 + S W Q + +N+ D A+ + +I R + ++ Sbjct: 187 LTSLIKEQFSTWQNQKYQKELNL---DKKRAERLTVLARINRYEN--------LSRVEKS 235 Query: 522 RLNDLQEKKRRKDLQDAKAQAERNYQEQQKRRNAENAALNRMNETEAARHQREIARINAM 581 RL+D L +A A+ EQ+ + L +++ + Sbjct: 236 RLDDFSS------LLHKQAIAKHAVLEQENKYVEAVNELRVY-----------KSQLEQI 278 Query: 582 QYADQAVRDAAIQRENERYEKAIKKNTRATRNDEATRLLLQYSQQQAQVEGQIAAARQ 639 + + ++ + + K + T + ++A + Sbjct: 279 ESEILSAKEEYQLVTQLFKNEILDKLRQTT-------------DNIGLLTLELAKNEE 323 Score = 31.0 bits (70), Expect = 0.030 Identities = 26/185 (14%), Positives = 57/185 (30%), Gaps = 13/185 (7%) Query: 13 IDAAEFKNEIPRIKNLLNGAASDAERSSARMQRFMERQTQAARQTMQAASSAATAASAHA 72 + E + +E R+ ++ Q + A Sbjct: 157 SRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAER 216 Query: 73 QTVEKNARAHERMAREVEQTRLRVDALNQKMREEQAQARALAEAQDKAAAAFYRQIDSVK 132 TV +E ++R VE++RL + +QA A+ Q+ ++ Sbjct: 217 LTVLARINRYENLSR-VEKSRL---DDFSSLLHKQAIAKHAVLEQENKYVEAVNEL---- 268 Query: 133 QAGAGLQELQRIQQQIRQARNSGGVGQQDYLALISEITAKTRA-LTQAEEQATRQKAAFI 191 +L++I+ +I A+ + Q + I + +T + + K Sbjct: 269 --RVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLE--LAKNEER 324 Query: 192 RQLKE 196 +Q Sbjct: 325 QQASV 329
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 134 bits (338), Expect = 2e-42 Identities = 63/200 (31%), Positives = 101/200 (50%), Gaps = 30/200 (15%) Query: 1 MRKLYAAILSAAICLAVSGAPAWASEHQSTLSAGYLHASTNAPGSDDLNGINVKYRYEFT 60 M+K+ A + + A LA + + A+ ST++ GY + + + G N+KYRYE Sbjct: 1 MKKI-ACLSALAAVLAFTAGTSVAA--TSTVTGGYAQSDAQGQMNK-MGGFNLKYRYEED 56 Query: 61 DT-LGLVTSFSYANAEDEQKTHYSDTRWHEDSVRNRWFSVMAGLSVRVNEWFSAYAMAGV 119 ++ LG++ SF+Y T S T D +N+++ + AG + R+N+W S Y + GV Sbjct: 57 NSPLGVIGSFTY--------TEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVVGV 108 Query: 120 AYSRVSTFSGDYLRVTDNKGKTHDVLTGSDDNRHSNTSLAWGAGVQFNPTESVAIDLAYE 179 Y + T T+ HD S+ ++GAG+QFNP E+VA+D +YE Sbjct: 109 GYGKFQT--------TEYPTYKHD---------TSDYGFSYGAGLQFNPMENVALDFSYE 151 Query: 180 GSGSGDWRTDGFIVGVGYKF 199 S +I GVGY+F Sbjct: 152 QSRIRSVDVGTWIAGVGYRF 171
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 42.1 bits (99), Expect = 1e-06 Identities = 41/201 (20%), Positives = 64/201 (31%), Gaps = 34/201 (16%) Query: 16 AFLFLFAPTAFAAEQTVEAPSVDARAW----------ILMDYASGKVLAEGNADEKLDPA 65 + L A A P + I MD ASG+ L ADE+ Sbjct: 7 CIISLLATLPLAV-HASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMM 65 Query: 66 SLTKIMTSYVVGQALKADKIKLTDMVTVGKDAWATGNPALRGSSVMFLKPGDQVSVADLN 125 S K++ V + A +L + + +P V D ++V +L Sbjct: 66 STFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSP------VSEKHLADGMTVGELC 119 Query: 126 KGVIIQSGNDACIALADYVAGSQESFIGLMNGYAKKLGLTNTT---FQTVHGLDAPGQF- 181 I S N A L V G + + +++G T ++T PG Sbjct: 120 AAAITMSDNSAANLLLATVGGPAG-----LTAFLRQIGDNVTRLDRWETELNEALPGDAR 174 Query: 182 --STARDMA------LLGKAL 194 +T MA L + L Sbjct: 175 DTTTPASMAATLRKLLTSQRL 195
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 29.9 bits (67), Expect = 0.010 Identities = 21/54 (38%), Positives = 27/54 (50%), Gaps = 9/54 (16%) Query: 2 RRVFWLVAAALLLAGCAGEKGIVEKEGYQLDTRHQAQAAYPRIKVLVIHYTADD 55 R+V LV ALL AG A I K+G +LD Y ++ L HY +DD Sbjct: 3 RKVLALVIPALLAAGAAHAAEIYNKDGNKLDL-------YGKVDGL--HYFSDD 47
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 73.3 bits (180), Expect = 8e-17 Identities = 70/363 (19%), Positives = 123/363 (33%), Gaps = 65/363 (17%) Query: 1 MKVLVTGATSGLGRNAVEFLCQKGISVRA---------SGRNEAMGKLLEKMGAEFVPTD 51 MK LVTGA +G + + L + G V +A +LL + G +F D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 52 LTELVSSQAKVMLAGIDTLWHCS-------SFTSPWGTQQAFDLANVRATRRLGEWAVAW 104 L + + ++ S +P A+ +N+ + E Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPH----AYADSNLTGFLNILEGCRHN 116 Query: 105 GVRNFIHISSPSLYFDYHHHRNIKEDFRPHRFANEFARSKAASEEVINMLSQANPQTRFT 164 +++ ++ SS S+Y + D + +A +K A+E + + S T Sbjct: 117 KIQHLLYASSSSVYGL-NRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSH-LYGLPAT 174 Query: 165 ILRPQSLFGPHDK--VFIPRLAHMMHHYGSILLPHGGSALVDMTYYENAVHAMWLASQEA 222 LR +++GP + + + + M SI + + G D TY ++ A+ Sbjct: 175 GLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVI 234 Query: 223 CDKLPS--------------GRVYNITNGEHRTLRSIVQKLIDELNIDCRIRSVPYPMLD 268 RVYNI N L +Q L D L I+ + +P D Sbjct: 235 PHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGD 294 Query: 269 MIARSMERLGRKSAKEPPLTHYGVSKLNFDFTLDITRAQEELGYQPVITLDEGIEKTAAW 328 + T D E +G+ P T+ +G++ W Sbjct: 295 V----------------LETS-----------ADTKALYEVIGFTPETTVKDGVKNFVNW 327 Query: 329 LRD 331 RD Sbjct: 328 YRD 330
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 55.6 bits (134), Expect = 2e-10 Identities = 29/125 (23%), Positives = 52/125 (41%), Gaps = 17/125 (13%) Query: 4 RILVLGASGYIGQHLVRTLSQQGHQILA---------AARHVDRLAKLQLANVSCHKVDL 54 + LV GA+G+IG H+ + L + GHQ++ + RL L HK+DL Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 55 SWPDNLPALLQD--IDTVYFLVH------SMGEGGDFIAQERQLALNVRDALREVPVKQL 106 + + + L + V+ H S+ + LN+ + R ++ L Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121 Query: 107 IFLSS 111 ++ SS Sbjct: 122 LYASS 126
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 117 bits (296), Expect = 2e-38 Identities = 33/88 (37%), Positives = 57/88 (64%), Gaps = 1/88 (1%) Query: 2 TKSELIERLATQQSHIPAKTVEDAVKEMLEHMASTLAQGERIEIRGFGSFSLHYRAPRTG 61 K +LI ++A + + + K AV + ++S LA+GE++++ GFG+F + RA R G Sbjct: 3 NKQDLIAKVA-EATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKG 61 Query: 62 RNPKTGDKVELEGKYVPHFKPGKELRDR 89 RNP+TG++++++ VP FK GK L+D Sbjct: 62 RNPQTGEEIKIKASKVPAFKAGKALKDA 89
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 71.6 bits (175), Expect = 6e-17 Identities = 43/176 (24%), Positives = 70/176 (39%), Gaps = 23/176 (13%) Query: 12 TFDPQQSAQIVVDMQNAYATPGGYLDLAGFDVSTTRPVIANIQTAVTAARAAGMLIIWFQ 71 DP ++ ++ DMQN + +D S + ANI+ G+ +++ Sbjct: 25 VPDPNRAVLLIHDMQNYF------VDAFTAGASPVTELSANIRKLKNQCVQLGIPVVY-- 76 Query: 72 NGWDEQYVEAGGPGSPNFHKSNALKTMRKQPQLQGKLLAKGSWDYQLVDELVPQPGDIVL 131 PGS N L G L G ++ +++ EL P+ D+VL Sbjct: 77 ---------TAQPGSQNPDDRALLTDF------WGPGLNSGPYEEKIITELAPEDDDLVL 121 Query: 132 PKPRYSGFFNTPLDSILRSRGIRHLVFTGIATNVCVESTLRDGFFLEHFGVVLEDA 187 K RYS F T L ++R G L+ TGI ++ T + F + + DA Sbjct: 122 TKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDA 177
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 65.8 bits (160), Expect = 2e-15 Identities = 30/165 (18%), Positives = 62/165 (37%), Gaps = 8/165 (4%) Query: 10 GKRSRAVSAKKKAILSAALDTFSQFGFHGTRLEQIAELAGVSKTNLLYYFPSKEALYIAV 69 K + ++ IL AL FSQ G T L +IA+ AGV++ + ++F K L+ + Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62 Query: 70 LRQILDIWLAPLKAFREDF--APLAAIKEYIRLKLEVSRDYPQASRLFCM-----EMLAG 122 ++ F PL+ ++E + LE + + L + E + Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122 Query: 123 APLLMDELTGDLKALIDEKSALIAGWVKSGKL-APIDPQHLIFMI 166 ++ D + +++ L A + + ++ Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM 167
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 27.1 bits (60), Expect = 0.004 Identities = 11/38 (28%), Positives = 14/38 (36%) Query: 14 RNLIVAWLGCFLTGAAFSLVMPFLPLYVEQLGVTGHSA 51 R LIV L L+MP LP + L + Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVT 42
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 53.3 bits (128), Expect = 8e-10 Identities = 61/356 (17%), Positives = 121/356 (33%), Gaps = 13/356 (3%) Query: 14 FLLIDNMLVVLGFFVVFPLIS--IRFVDQMGWAAVMVGIALGLRQFIQQGLGIFGGAIAD 71 +L L +G ++ P++ +R + GI L L +Q GA++D Sbjct: 9 VILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSD 68 Query: 72 RFGAKPMIVTGMLMRAAGFATMGIAHEPWLLWFSCLLSGLGGTLFDPPRSALVVKLIRPQ 131 RFG +P+++ + A +A M A W+L+ +++G+ G A + + Sbjct: 69 RFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA-GAYIADITDGD 127 Query: 132 QRGRFFSLLMMQDSASAVIGALLGSWLLQYDFRLVCATGAVLFVLCAAFNAWLLPAWKLS 191 +R R F + V G +LG + + A L L +LLP Sbjct: 128 ERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKG 187 Query: 192 TVRTPVREGMTRVMRDKRFVTYVLTLAGYYMLAVQVMLMLPIMV--------NDVAGAPS 243 R P+R + R+ + +A + + L+ + + + Sbjct: 188 E-RRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDAT 246 Query: 244 AVKWMYAIEACLSLTLLYPIARWSEKHFRLEHRLMAGLLIMSLSMMPVGMVSGLQQLFTL 303 + A L I LM G++ + + + F + Sbjct: 247 TIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPI 306 Query: 304 ICLFYIGSIIAEPARETLSASLADARARGSYMGFSRLGLAIGGAIGYIGGGWLFDL 359 + L G I PA + + + D +G G ++ +G + ++ Sbjct: 307 MVLLASGGIGM-PALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 39.2 bits (91), Expect = 3e-05 Identities = 16/49 (32%), Positives = 28/49 (57%) Query: 354 TLTNGALEASNVDLSKELVNMIVAQRNYKSNAQTIKTQDQILNTRVNLR 402 L+N S V+L +E N+ Q+ Y +NAQ ++T + I + +N+R Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546 Score = 36.9 bits (85), Expect = 1e-04 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 4/56 (7%) Query: 6 AVSGLNAAATNLDVIGNNIANSATYGFKSGTASFAD----MFAGSKVGLGVKVAGI 57 A+SGLNAA L+ NNI++ G+ T A + AG VG GV V+G+ Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 43.8 bits (103), Expect = 4e-07 Identities = 18/81 (22%), Positives = 36/81 (44%), Gaps = 14/81 (17%) Query: 3 SSLWIAKTGLDAQQTNMDVIANNLANVSTNGFKRQRAVFEDLLYQTIRQPGAQSSEQTTL 62 S + A +GL+A Q ++ +NN+++ + G+ RQ + + +TL Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTL 47 Query: 63 PSGLQIGTGVRPVATERLHSQ 83 +G +G GV +R + Sbjct: 48 GAGGWVGNGVYVSGVQREYDA 68 Score = 41.1 bits (96), Expect = 3e-06 Identities = 11/41 (26%), Positives = 21/41 (51%) Query: 220 ETSNVNVAEELVNMIQVQRAYEINSKAVSTTDQMLQKLTQL 260 S VN+ EE N+ + Q+ Y N++ + T + + L + Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545
>FLGLRINGFLGH#Flagellar L-ring protein signature. Length = 232 Score = 349 bits (897), Expect = e-126 Identities = 232/232 (100%), Positives = 232/232 (100%) Query: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY Sbjct: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60 Query: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120 Query: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF Sbjct: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180 Query: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM Sbjct: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 427 bits (1100), Expect = e-152 Identities = 157/363 (43%), Positives = 213/363 (58%), Gaps = 9/363 (2%) Query: 4 FLSALILLLVTTAAQAERIRDLTSVQGVRQNSLIGYGLVVGLDGTGDQTTQTPFTTQTLN 63 F + L A RI+D+ S+Q R N LIGYGLVVGL GTGD +PFT Q++ Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72 Query: 64 NMLSQLGITVPTGTNMQLKNVAAVMVTASLPPFGRQGQTIDVVVSSMGNAKSLRGGTLLM 123 ML LGIT G + KN+AAVMVTA+LPPF G +DV VSS+G+A SLRGG L+M Sbjct: 73 AMLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIM 131 Query: 124 TPLKGVDSQVYALAQGNILVGGAGASAGGSSVQVNQLNGGRITNGAVIERELPSQFGVGN 183 T L G D Q+YA+AQG ++V G A +++ R+ NGA+IERELPS+F Sbjct: 132 TSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSV 191 Query: 184 TLNLQLNDEDFSMAQQIADTINRVR----GYGSATALDARTIQVRVPSGNSSQVRFLADI 239 L LQL + DFS A ++AD +N G A D++ I V+ P + R +A+I Sbjct: 192 NLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRV-ADLTRLMAEI 250 Query: 240 QNMQVNVTPQDAKVVINSRTGSVVMNREVTLDSCAVAQGNLSVTVNRQANVSQPDTPFGG 299 +N+ V T AKVVIN RTG++V+ +V + AV+ G L+V V V QP PF Sbjct: 251 ENLTVE-TDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-APFSR 308 Query: 300 GQTVVTPQTQIDLRQSGGSLQSVRSSASLNNVVRALNALGATPMDLMSILQSMQSAGCLR 359 GQT V PQT I Q G + ++ L +V LN++G +++ILQ ++SAG L+ Sbjct: 309 GQTAVQPQTDIMAMQEGSKV-AIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQ 367 Query: 360 AKL 362 A+L Sbjct: 368 AEL 370
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 503 bits (1297), Expect = 0.0 Identities = 308/313 (98%), Positives = 309/313 (98%) Query: 1 MISDSKLLASAAWDAQSLNELKAKASEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60 MISDSKLLASAAWDAQSLNELKAKA EDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG Sbjct: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60 Query: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET Sbjct: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120 Query: 121 VVRYQNQTLSQLVQKAVPRNYDDSLPGDSRAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180 VVRYQNQ LSQLVQKAVPRNYDDSLPGDS+AFLAQLSLPAQLASQQSGVPHHLILAQAAL Sbjct: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180 Query: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGQVTEITTTEYENGEAKKVKAKFRVYSSYL 240 ESGWGQRQIRRENGEPSYNLFGVKASGNWKG VTEITTTEYENGEAKKVKAKFRVYSSYL Sbjct: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240 Query: 241 EALSDYVGLLTRNPRYAAVTTAASAEQGAQVLQDAGYATDPHYARKLTNMIQQMKSISDK 300 EALSDYVGLLTRNPRYAAVTTAASAEQGAQ LQDAGYATDPHYARKLTNMIQQMKSISDK Sbjct: 241 EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300 Query: 301 VSKTYSMNIDNLF 313 VSKTYSMNIDNLF Sbjct: 301 VSKTYSMNIDNLF 313
>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature. Length = 1147 Score = 27.7 bits (61), Expect = 0.005 Identities = 16/48 (33%), Positives = 25/48 (52%) Query: 10 KIIGEQLGVKQEEVTNNASFVEDLGADSLDTVELVMALEEEFDTEIPD 57 ++ G Q + QEE+ N F+E L ++ L +E+F TEI D Sbjct: 380 QLTGSQRALSQEEIQNKIDFMEFLAQNNAKLDNLSEKEKEKFRTEIKD 427
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 31.5 bits (71), Expect = 0.002 Identities = 23/124 (18%), Positives = 38/124 (30%), Gaps = 7/124 (5%) Query: 25 EPAPVEEVKPAPEQPAEPQQPVPTVPSVPTIPQQPGPIEHEDRTAPPAPHIRHYDWNGAM 84 +P V V PA +P + QP P P +P P P + Sbjct: 43 QPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEP------PKEAPVVIEKPKPKP 96 Query: 85 QPMVSKMLGADGVTAGSVLLVDSVNNRTNGSLNAAEATETLRNALANNGKFTLVSA-QQL 143 +P + V V+S + A T + A + ++ S + L Sbjct: 97 KPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRAL 156 Query: 144 SMAK 147 S + Sbjct: 157 SRNQ 160 Score = 31.5 bits (71), Expect = 0.002 Identities = 15/39 (38%), Positives = 18/39 (46%) Query: 23 QREPAPVEEVKPAPEQPAEPQQPVPTVPSVPTIPQQPGP 61 Q P PV E +P PE EP + P V P +P P Sbjct: 62 QPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP 100
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 96.3 bits (239), Expect = 2e-22 Identities = 142/655 (21%), Positives = 226/655 (34%), Gaps = 93/655 (14%) Query: 137 DVDITTHGDNAHAIAARQGTVSFNQGEIYTTGPDAAIAKIYNGGTVTLKNTSAVAHQGSG 196 D + + +V Q + AAI + G VT+ S A G+ Sbjct: 285 PGGFGPVLDGWYGVDVSGSSVELAQSIVEAPELGAAIR-VGRGARVTVSGGSLSAPHGNV 343 Query: 197 IVLESSIN--GQEATVDILSGSSLRSANEILYHKNETSNVTITDSEVSSAADVFINNIKG 254 I + Q A + I + + + L ++ V +T ++ AD + + Sbjct: 344 IETGGARRFAPQAAPLSITLQAGAHAQGKALLYRVLPEPVKLT---LTGGADAQGDIVAT 400 Query: 255 HLTVDATNSKITGSANISTDDN------THTYLSLS-DNSTWDIKADSTVSNLTV--DNS 305 L S G +++ T SLS DN+TW + +S V L + D S Sbjct: 401 ELPSIPGTS--IGPLDVALASQARWTGATRAVDSLSIDNATWVMTDNSNVGALRLASDGS 458 Query: 306 TVYISRADGRDVEPTRLTITENYVGNNGVLHLRTELDDDNSATDKVVINGNTSGTTRVKV 365 + A+ + +T N + +G+ + D +DK+V+ + SG R+ V Sbjct: 459 VDFQQPAEAGRFK----VLTVNTLAGSGLFRMNVFAD--LGLSDKLVVMQDASGQHRLWV 512 Query: 366 TNAGGSGAYTLNGIEIISVEGESNGEFI---KDSRIFAGAYEYSLTRGNTEATNKNWYLT 422 N+G S + N + ++ S F KD ++ G Y Y L N W L Sbjct: 513 RNSG-SEPASANTLLLVQTPLGSAATFTLANKDGKVDIGTYRYRLA----ANGNGQWSLV 567 Query: 423 NFQAT-------SGGETNSGGSSAPTVAPTPVLRPEAGSYVANLAAANTLFVMRLNDRAG 475 +A G AP P A AA NT V A Sbjct: 568 GAKAPPAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRELSAAANAAVNTGGV----GLAS 623 Query: 476 ETRYIDPVTEQERSSRLWLRQIGGHNAWRDSNGQLRTTSHRY-------VS--QLGGDLL 526 Y + +R L L G AW Q + +R V+ +LG D Sbjct: 624 TLWYAESNALSKRLGELRLNPDAG-GAWGRGFAQRQQLDNRAGRRFDQKVAGFELGADHA 682 Query: 527 TGGFTDSDSWRLGVMAGYARDYNLTHSSVSDYRSKGSVRGYSAGLYATWFADDISKKGAY 586 W LG +AGY R + G G YAT+ AD G Y Sbjct: 683 VAVAGGR--WHLGGLAGYTR----GDRGFTGDGG-GHTDSVHVGGYATYIADS----GFY 731 Query: 587 IDSWAQYSWFKN----------SVKGDELAYESYSAKGATVSLEAGYGFALNKSFGLEAA 636 +D+ + S +N +VKG Y G SLEAG F Sbjct: 732 LDATLRASRLENDFKVAGSDGYAVKGK------YRTHGVGASLEAGRRFTHADG------ 779 Query: 637 KYTWIFQPQAQAIWMGVDHNAHTEANGSRIENDANNNIQTRLGFRTFIRTQEKNSGPHGD 696 W +PQA+ A+ ANG R+ ++ +++ RLG R + G Sbjct: 780 ---WFLEPQAELAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELAG----GR 832 Query: 697 DFEPFVEMNWIHNSK-DFAVSMNGVKVEQDGVSNLGEIKLGVNGNLNPAASVWGN 750 +P+++ + + V NG+ + E+ LG+ L S++ + Sbjct: 833 QVQPYIKASVLQEFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLYAS 887
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 49.7 bits (118), Expect = 7e-10 Identities = 30/114 (26%), Positives = 50/114 (43%), Gaps = 1/114 (0%) Query: 34 YHLSNGMESKSVDTRSIYRELGATLSYNMRLGNGMEIEPCLKAAVRKEFVDDNRVKVNSD 93 Y +NG+ + S+ LG + + L G +++P +KA+V +EF V N Sbjct: 798 YRAANGLRVRDEGGSSVLGRLGLEVGKRIELAGGRQVQPYIKASVLQEFDGAGTVHTNGI 857 Query: 94 GNFVNDLSGRRGIYQAGIKASFSSTLSGHFGVGYSHGAGVESPWNAVAGVNWSF 147 + +L G R G+ A+ S + YS G + PW AG +S+ Sbjct: 858 AH-RTELRGTRAELGLGMAAALGRGHSLYASYEYSKGPKLAMPWTFHAGYRYSW 910
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 554 bits (1429), Expect = 0.0 Identities = 353/356 (99%), Positives = 354/356 (99%) Query: 1 MTRPIQASLDLQALKQNLSIVRQAAPHARVWSVVKANAYGHGIERIWSALGATDGFALLN 60 MTRPIQASLDLQALKQNLSIVRQAA HARVWSVVKANAYGHGIERIWSA+GATDGFALLN Sbjct: 1 MTRPIQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLN 60 Query: 61 LEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDI 120 LEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDI Sbjct: 61 LEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDI 120 Query: 121 YLKVNSGMNRLGFQSDRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGISGAMARIEQAA 180 YLKVNSGMNRLGFQ DRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGISGAMARIEQAA Sbjct: 121 YLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGISGAMARIEQAA 180 Query: 181 EGLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMTLSSEII 240 EGLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMTLSSEII Sbjct: 181 EGLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMTLSSEII 240 Query: 241 GVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTMTVGTVS 300 GVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTMTVGTVS Sbjct: 241 GVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTMTVGTVS 300 Query: 301 MDMLAVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALRVPVVTV 356 MDMLAVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALRVPVVTV Sbjct: 301 MDMLAVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALRVPVVTV 356
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 29.5 bits (66), Expect = 0.011 Identities = 13/41 (31%), Positives = 18/41 (43%), Gaps = 1/41 (2%) Query: 168 TEGTLWGGNLAMLISLIGTPWMPKIENGILVLEDINVHPFR 208 T G++ GG L L G W P +L+ + PFR Sbjct: 106 TSGSIAGGKL-NLAFAQGEAWTPGDNPDVLIFPGGEMTPFR 145
>LCRVANTIGEN#Low calcium response V antigen signature. Length = 326 Score = 29.7 bits (66), Expect = 0.011 Identities = 19/63 (30%), Positives = 28/63 (44%), Gaps = 7/63 (11%) Query: 193 LMSTHHPLHANAIADSIIQVEPDGRVTQGLPTEQLTTNKLAAL------YRVSADQIHHH 246 + H L A+ I D I++V D G +L +LA L Y V +I+ H Sbjct: 119 MAVMHFSLTADRIDDDILKVIVDSMNHHGDARSKL-REELAELTAELKIYSVIQAEINKH 177 Query: 247 LSA 249 LS+ Sbjct: 178 LSS 180
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 39.5 bits (92), Expect = 1e-05 Identities = 65/301 (21%), Positives = 102/301 (33%), Gaps = 48/301 (15%) Query: 2 PITRRTFAQALASTLLLQSLPSFSQTVNRFASQSLPEAQNI--TRIVSAG-APADLLL-L 57 I+RR A+A + L + A I RIV+ P +LLL L Sbjct: 6 LISRRRLLTAMALSPL-------------LWQMNTAHAAAIDPNRIVALEWLPVELLLAL 52 Query: 58 AVAPEKMVGFSSFDFARQALI--PLPEHIRQLPRLGRLAGRASTLSLEGLMALHPDLVVD 115 + P G + R + PLP+ + + G + +LE L + P +V Sbjct: 53 GIVP---YGVADTINYRLWVSEPPLPDSVIDV-------GLRTEPNLELLTEMKPSFMVW 102 Query: 116 CGNTDETLISQARQVSEQTQIPWLLLN-----GKLAQSAEQLTTLGKTLGEEHRAAEQAN 170 + AR P N LA + + LT + L + A Sbjct: 103 SAGYGPSPEMLARIA------PGRGFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLA 156 Query: 171 LASHFVGEAQA-FATSPAANLRFYAARGPRGLETGLQGSLHTEAAELLGLHNVAQ-IADR 228 F+ + F A L PR + SL E + G+ N Q + Sbjct: 157 QYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNF 216 Query: 229 HGLTQVSMENLLRWQ-PDIILVQEAVTADF--IRRDPLWQGVKAVAEQRILFLSGLPFGW 285 G T VS++ L ++ D++ + D + PLWQ + V R +P W Sbjct: 217 WGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALMATPLWQAMPFVRAGR---FQRVPAVW 273 Query: 286 L 286 Sbjct: 274 F 274
>adhesinmafb#Neisseria meningitidis: adhesin MafB signature. Length = 467 Score = 31.2 bits (70), Expect = 4e-04 Identities = 16/57 (28%), Positives = 20/57 (35%), Gaps = 2/57 (3%) Query: 41 GPMPAVDSNDPGAAGFTGSTVIAEFESLEAAQAWADADPYVAAGVYEHVSVKPFKKV 97 P+PA G GS E + EA W +P A V +V KV Sbjct: 268 APLPA--EGKFAVIGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKV 322
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 249 bits (638), Expect = 4e-86 Identities = 233/239 (97%), Positives = 233/239 (97%), Gaps = 4/239 (1%) Query: 1 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQA 60 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMV PADLEPPQA Sbjct: 1 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQA 60 Query: 61 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIE----KPKPKPKPVKKVQEQPKRDVKPVESR 116 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIE KPKPKPKPVKKVQEQPKRDVKPVESR Sbjct: 61 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR 120 Query: 117 PASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF 176 PASPFENTAPAR TSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF Sbjct: 121 PASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF 180 Query: 177 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ 235 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ Sbjct: 181 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ 239
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 87.0 bits (215), Expect = 1e-22 Identities = 62/239 (25%), Positives = 105/239 (43%), Gaps = 23/239 (9%) Query: 13 RIILVTGASDGIGREAAMTYARYGATVILLGRNEEKLRQVASHINEETGRQPQWFILDLL 72 +I +TGA+ GIG A T A GA + + N EKL +V S + E R + F D+ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE-ARHAEAFPADV- 66 Query: 73 TCTSENCQQLAQRIVVNYPRLDGVLHNAGLLGDVCPMSEQNPQVWQDVMQVNVNATFMLT 132 S ++ RI +D +++ AG+L + + + W+ VN F + Sbjct: 67 -RDSAAIDEITARIEREMGPIDILVNVAGVL-RPGLIHSLSDEEWEATFSVNSTGVFNAS 124 Query: 133 QALLPLLLKSDAGSLVFTSSSVGRQGRANWGAYAASKFATEGMMQVLADEYQQR-LRVNC 191 +++ ++ +GS+V S+ R + AYA+SK A + L E + +R N Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184 Query: 192 INPGGTRTAMRASAFPTEDPQ------------------KLKTPADIMPLYLWLMGDDS 232 ++PG T T M+ S + E+ KL P+DI L+L+ + Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243
>OUTRMMBRANEA#Outer membrane protein A signature. Length = 346 Score = 26.4 bits (58), Expect = 0.005 Identities = 9/43 (20%), Positives = 16/43 (37%) Query: 12 GLFYGYDFQNGLSVSLEYAFEWQDHDEGDSDKFHYAGVGVNYS 54 G F GY + + Y + + +G + Y GV + Sbjct: 59 GAFGGYQVNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLT 101
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 316 bits (810), Expect = e-104 Identities = 122/422 (28%), Positives = 188/422 (44%), Gaps = 44/422 (10%) Query: 130 NGFNFLRWLESEPQDSHNEHVVINGQNFLMEITPVYLQDENDQH----VLTGAVVMLRST 185 N F+ L ++ +V++ QN M + D LT + ++ Sbjct: 61 NAFDLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118 Query: 186 IRMGRQLQNVAAQDVSAFSQIVAVSPKMKHVVEQAQKLAMLSAPLLITGDTGTGKDLFAY 245 + ++ + D +V S M+ + +L L+ITG++GTGK+L A Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVAR 178 Query: 246 ACHQASPRAGKPYLALNCASIPEDAVESELFGH-------APEGKKGFFEQANGGSVLLD 298 A H R P++A+N A+IP D +ESELFGH A G FEQA GG++ LD Sbjct: 179 ALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLD 238 Query: 299 EIGEMSPRMQAKLLRFLNDGTFRRVGEDHEVHVDVRVICATQKNLVELVQKGVFREDLYY 358 EIG+M Q +LLR L G + VG + DVR++ AT K+L + + +G+FREDLYY Sbjct: 239 EIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYY 298 Query: 359 RLNVLTLNLPPLRDCPQDIMPLTELFVARFADEQGVPRPKLAADLNTVLTRYAWPGNVRQ 418 RLNV+ L LPPLRD +DI L FV + E G+ + + ++ + WPGNVR+ Sbjct: 299 RLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVRE 357 Query: 419 LKNAIYRALTQLDGYELRPQDILLPDYDAATVAVGEDAM--------------------- 457 L+N + R + + I + E A Sbjct: 358 LENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFA 417 Query: 458 --------EGSLDEITSRFERSVLTQ-LYRNYPSTRKLAKRLGVSHTAIANKLREYGLSQ 508 G D + + E ++ L + K A LG++ + K+RE G+S Sbjct: 418 SFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSV 477 Query: 509 KK 510 + Sbjct: 478 YR 479
>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein signature. Length = 533 Score = 25.7 bits (56), Expect = 0.013 Identities = 9/18 (50%), Positives = 9/18 (50%) Query: 1 MSQQPQQPQQPQQPQQPQ 18 M Q QQ Q Q QQ Q Sbjct: 336 MPPQAQQQQGQGQQQQAQ 353 Score = 24.1 bits (52), Expect = 0.047 Identities = 8/17 (47%), Positives = 8/17 (47%) Query: 3 QQPQQPQQPQQPQQPQQ 19 P Q QQ Q Q QQ Sbjct: 335 VMPPQAQQQQGQGQQQQ 351
>adhesinb#Adhesin B signature. Length = 310 Score = 331 bits (849), Expect = e-116 Identities = 90/296 (30%), Positives = 163/296 (55%), Gaps = 7/296 (2%) Query: 9 MLLGCLALTCSIAFQASATEKFKVITTFTIIADMAKNVAGDAAEVSSITKPGAEIHEYQP 68 +G A + + + + K V+ T +IIAD+ KN+AGD + SI G + HEY+P Sbjct: 13 AFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSIVPVGQDPHEYEP 72 Query: 69 TPGDIKRAQGAQLILANGMNLEL----WFQRFYQHLNGVPE---VIVSSGVTPVGITEGP 121 P D+K+ A LI NG+NLE WF + ++ VS GV + + Sbjct: 73 LPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVSEGVDVIYLEGQS 132 Query: 122 YEGKPNPHAWMSPDNALIYVDNIRDALIKYDPANAQTYQRNADTYKAKITQTLAPLRKQI 181 +GK +PHAW++ +N +IY NI L + DPAN +TY++N Y K++ +++ Sbjct: 133 EKGKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEKLSALDKEAKEKF 192 Query: 182 TELPENQRWMVTSEGAFSYLARDLGLKELYLWPINADQQGTPQQVRKVVDIVKKNHIPAV 241 +P ++ +VTSEG F Y ++ + Y+W IN +++GTP Q++ +V+ ++K +P++ Sbjct: 193 NNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEKLRKTKVPSL 252 Query: 242 FSESTISDKPARQVARETGAHYGGVLYVDSLSTENGPVPTYIDLLKVTTSTLVQGI 297 F ES++ D+P + V+++T ++ DS++ + +Y ++K + +G+ Sbjct: 253 FVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSYYSMMKYNLEKIAEGL 308
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 113 bits (283), Expect = 2e-29 Identities = 84/397 (21%), Positives = 171/397 (43%), Gaps = 14/397 (3%) Query: 27 MAVLDGAIANVALPTIATDLHATPASSIWVVNAYQIAIVISLLSFSFLGDMFGYRRIYKC 86 +VL+ + NV+LP IA D + PAS+ WV A+ + I + L D G +R+ Sbjct: 25 FSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLF 84 Query: 87 GLVVFLLSSLFCALSDS-LQMLTLARVIQGFGGAALMSVNTALIRLIYPQRFLGRGMGIN 145 G+++ S+ + S +L +AR IQG G AA ++ ++ P+ G+ G+ Sbjct: 85 GIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144 Query: 146 SFIVAVSSAAGPTIAAAILSIASWKWLFLINVPLGIIALLLAMRFLPPNGSRASKPRFDL 205 IVA+ GP I I W +L LI + + II + M+ L K FD+ Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLKKEVRI--KGHFDI 201 Query: 206 PSAVMNALTFGLLITALSGFAQGQSLTLIAAELVVMVVVGIFFIRRQLSLPVPLLPVDLL 265 ++ + I F S++ L+V V+ + F++ + P + L Sbjct: 202 KGIIL----MSVGIVFFMLFTTSYSISF----LIVSVLSFLIFVKHIRKVTDPFVDPGLG 253 Query: 266 RIPLFSLSICTSVCSFCAQMLAMVSLPFYLQTVLGRSEVETG-LLLTPWPLATMVMAPLA 324 + F + + F + +P+ ++ V S E G +++ P ++ ++ + Sbjct: 254 KNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIG 313 Query: 325 GYLIERVHAGLLGALGLFIMATGLFSLVLLPASPADINIIWPMILCGAGFGLFQSPNNHT 384 G L++R + +G+ ++ + L + + + ++ G ++ + Sbjct: 314 GILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWF-MTIIIVFVLGGLSFTKTVISTI 372 Query: 385 IITSAPRERSGGASGMLGTARLLGQSSGAALVALMLN 421 + +S ++ +G +L L + +G A+V +L+ Sbjct: 373 VSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 31.2 bits (71), Expect = 0.002 Identities = 14/61 (22%), Positives = 28/61 (45%), Gaps = 5/61 (8%) Query: 74 SNKAELTAIIARETGKPRWEAATEVTAMINKIAISIKAYHVRTGEQRSEMPDGAASLRHR 133 +NK +L A +A T + ++A V A+ + ++ + GE+ + G +R R Sbjct: 2 ANKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAK-----GEKVQLIGFGNFEVRER 56 Query: 134 P 134 Sbjct: 57 A 57
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 119 bits (301), Expect = 3e-39 Identities = 34/89 (38%), Positives = 55/89 (61%) Query: 4 TKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFGNFDLRDKNQRPGR 63 K ++ + + L+K+D+ V+ F + L GE+V+L GFGNF++R++ R GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 64 NPKTGEDIPITARRVVTFRPGQKLKSRVE 92 NP+TGE+I I A +V F+ G+ LK V+ Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.008 Identities = 9/22 (40%), Positives = 13/22 (59%) Query: 28 ILHLVGPNGAGKSTLLAQMAGM 49 + L G G GKSTL+ + G+ Sbjct: 598 SVVLEGTGGIGKSTLINTLVGL 619
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 297 bits (761), Expect = 3e-93 Identities = 116/453 (25%), Positives = 200/453 (44%), Gaps = 73/453 (16%) Query: 363 RAIGHRIGAGPVKVIHDISEMNRIEPGDVLVTDMTDPDWEPIMKK-ASAIVTNRGGRTCH 421 R +GH IG ++ + E ++ D+T D + K+ T+ GGRT H Sbjct: 137 RVLGHLIGVE----TGSLATIA--EETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSH 190 Query: 422 AAIIARELGIPAVVGCGDATERMKDGENVTVSCAEG---------DTGYVYAELLEFSVK 472 +AI++R L IPAVVG + TE+++ G+ V V EG + + F + Sbjct: 191 SAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYEEKRAAFEKQ 250 Query: 473 SSSVETMPDLP--------LKVMMNVGNPDRAFDFACLPSEGVGLARLEFII-NRMIGVH 523 + P +++ N+G P EG+GL R EF+ +R Sbjct: 251 KQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFLYMDRD---- 306 Query: 524 PRALLEFDDQEPQLQNEIREMMKGFDSPREFYVGRLTEGIATLGAAFYPKRVIVRLSDFK 583 + +E Q + +E+++ K V++R D Sbjct: 307 -----QLPTEEEQFE-AYKEVVQ----------------------RMDGKPVVIRTLDIG 338 Query: 584 SNEYANLVGGERYEPDEENPMLGFRGAGRYVSDSFRDCFALECEAVKRVRNDMGLTNVEI 643 ++ + + P E NP LGFR + +D F + A+ R N+++ Sbjct: 339 GDKELSYL----QLPKELNPFLGFRAIRLCL--EKQDIFRTQLRALLRAS---TYGNLKV 389 Query: 644 MIPFVRTVD---QAKAVVEELARQGLKRG---ENGLKIIMMCEIPSNALLAEQFLEYFDG 697 M P + T++ QAKA+++E + L G + +++ +M EIPS A+ A F + D Sbjct: 390 MFPMIATLEELRQAKAIMQEEKDKLLSEGVDVSDSIEVGIMVEIPSTAVAANLFAKEVDF 449 Query: 698 FSIGSNDMTQLALGLDRDSGVVSELFDERNDAVKALLSMAIRAAKKQGKYVGICGQGPSD 757 FSIG+ND+ Q + DR + VS L+ + A+ L+ M I+AA +GK+VG+CG+ D Sbjct: 450 FSIGTNDLIQYTMAADRMNERVSYLYQPYHPAILRLVDMVIKAAHSEGKWVGMCGEMAGD 509 Query: 758 HEDFAAWLMEEGIDSLSLNPDTVVQTWLSLAEL 790 L+ G+D S++ +++ L +L Sbjct: 510 E-VAIPLLLGLGLDEFSMSATSILPARSQLLKL 541
>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature. Length = 52 Score = 60.6 bits (147), Expect = 5e-17 Identities = 19/46 (41%), Positives = 32/46 (69%) Query: 4 QKAMLIALIVICLTVIVTALVTRKDLCEVRIRTGQTEVAVFTAYEP 49 + +++ ++++CLT+++ +TRK LCE+R R G EVA F AYE Sbjct: 5 RSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYES 50
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 53.3 bits (128), Expect = 7e-10 Identities = 40/192 (20%), Positives = 83/192 (43%), Gaps = 8/192 (4%) Query: 36 LSDIAQSFHMQTAQVGIMLTIYAWVVALMSLPFMLMTSQVERRKLLICLFVVFIASHVLS 95 L DIA F+ A + T + ++ + + ++ Q+ ++LL+ ++ V+ Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 96 FLSWS-FTVLVISRIGVAFAHAIFWSITASLAIRMAPAGKRAQALSLIATGTALAMVLGL 154 F+ S F++L+++R A F ++ + R P R +A LI + A+ +G Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 155 PLGRIVGQYFGWRMTFFAIGIGALVTLLCLIKLLPLLPSEHSGSLKSLPLLFRRPALMSI 214 +G ++ Y W I + ++T+ L+KLL + LMS+ Sbjct: 157 AIGGMIAHYIHWSY-LLLIPMITIITVPFLMKLLK------KEVRIKGHFDIKGIILMSV 209 Query: 215 YLLTVVVVTAHY 226 ++ ++ T Y Sbjct: 210 GIVFFMLFTTSY 221
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 29.4 bits (66), Expect = 0.015 Identities = 12/51 (23%), Positives = 21/51 (41%), Gaps = 1/51 (1%) Query: 22 GQGKVADYIPALATVDGSRLGI-AICTVDGQLFQAGDAQERFSIQSISKVL 71 + + I + R+G+ + G+ A A ERF + S KV+ Sbjct: 21 ASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVV 71
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 28.9 bits (64), Expect = 0.016 Identities = 12/32 (37%), Positives = 19/32 (59%), Gaps = 1/32 (3%) Query: 151 AYAASKAALDNMTRSFARKLAPE-VKVNSIAP 181 AYA+SKAA T+ +LA ++ N ++P Sbjct: 156 AYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 66.0 bits (161), Expect = 1e-14 Identities = 27/131 (20%), Positives = 60/131 (45%), Gaps = 3/131 (2%) Query: 3 TIVFVEDDAEVGSLIAAYLAKHDMQVTVEPRGDQAEETILRENPDLVLLDIMLPGKDGMT 62 TI+ +DDA + +++ L++ V + I + DLV+ D+++P ++ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 63 ICRDLRAKWSG-PIVLLTSLDSDMNHILALEMGACDYILKTTPPAVLLARLR--LHLRQN 119 + ++ P++++++ ++ M I A E GA DY+ K L+ + L + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 120 EQATLTKGLQE 130 + L Q+ Sbjct: 125 RPSKLEDDSQD 135
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 36.1 bits (83), Expect = 2e-05 Identities = 13/61 (21%), Positives = 30/61 (49%), Gaps = 1/61 (1%) Query: 78 RHTVEHSVYVHPDHQGKGLGRKLLSRLIDEARDCGKHVMVAGIESQNQASLHLHQSLGFV 137 +E + V D++ KG+G LL + I+ A++ ++ + N ++ H + F+ Sbjct: 89 YALIED-IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147 Query: 138 V 138 + Sbjct: 148 I 148
>PF06291#Lambda prophage Bor protein Length = 102 Score = 31.9 bits (72), Expect = 2e-04 Identities = 17/51 (33%), Positives = 25/51 (49%), Gaps = 6/51 (11%) Query: 1 MKKVAAFVALSLLMAGC------VSNDKIAVTPEQLQHHRFVLESVNGKPV 45 MKK+ AL++L+ GC V N AVTP++ H F + + K Sbjct: 6 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQKKT 56
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 30.8 bits (69), Expect = 1e-04 Identities = 12/31 (38%), Positives = 12/31 (38%) Query: 10 PVPEPIPGDPVPVPDPIPRPQPMPDPPPDEE 40 P P P PG P P P P PP E Sbjct: 575 PKPAPQPGPQPPQPPQPQPEAPAPQPPAGRE 605 Score = 26.6 bits (58), Expect = 0.005 Identities = 11/23 (47%), Positives = 11/23 (47%) Query: 19 PVPVPDPIPRPQPMPDPPPDEEP 41 P P P P P PQP P P E Sbjct: 573 PAPKPAPQPGPQPPQPPQPQPEA 595
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 72.9 bits (179), Expect = 6e-18 Identities = 28/137 (20%), Positives = 54/137 (39%), Gaps = 14/137 (10%) Query: 16 RQAAVVRAPIDGIVANRSAHT-GSWVEGGTSLVSLVPVSE-LWVDANYKENQIAGMKPGM 73 +QA+V+RAP+ V HT G V +L+ +VP + L V A + I + G Sbjct: 325 QQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQ 384 Query: 74 KAEIRADILKGEVFH---GHIESLSPATGASFSLIPIENATGNFTKIVQRVPVRIAFDDA 130 A I+ + + G +++++ + G ++ + Sbjct: 385 NAIIKVEAFPYTRYGYLVGKVKNINLDA-------IEDQRLGLVFNVIISIEENCLSTGN 437 Query: 131 KELKQLLRPGLSVTVSV 147 K + L G++VT + Sbjct: 438 KNIP--LSSGMAVTAEI 452
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 64.5 bits (157), Expect = 2e-14 Identities = 40/212 (18%), Positives = 79/212 (37%), Gaps = 16/212 (7%) Query: 11 VVAIGILLTGVVFFIW----RVSKGRFIQTTDDAYIGGNITTVASKVSGYISAIEVRDNQ 66 +VA I+ V+ FI +V G + + + I V++ + Sbjct: 59 LVAYFIMGFLVIAFILSVLGQVEIV--ATANGKLTHSGRSKEIKPIENSIVKEIIVKEGE 116 Query: 67 SVKKGDIILRLDDRDYRANVARLEAKIKSSKANLEGIQATITMQQ-----SIIQSASETW 121 SV+KGD++L+L A+ + ++ + ++ Q + + + Sbjct: 117 SVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYF 176 Query: 122 QAVKHEEQKRLRD--TERYEKLAQSAAISQQIIDNARFDYQQVAAKERKAANDFLVEKQR 179 Q V EE RL E++ + +D R + V A+ + N VEK R Sbjct: 177 QNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSR 236 Query: 180 LAVLSAQEEN---VRASIEEVQAALTQALLDL 208 L S+ + ++ E + +A+ +L Sbjct: 237 LDDFSSLLHKQAIAKHAVLEQENKYVEAVNEL 268
>PilS_PF08805#PilS N terminal Length = 185 Score = 29.5 bits (66), Expect = 0.007 Identities = 12/46 (26%), Positives = 18/46 (39%) Query: 29 AASNCWSNHVGIIIGHNGEDFLVAESRVPLSTITTLSRFIKRSSNQ 74 +A N W V I + F V E+ VP + ++ SS Sbjct: 110 SAKNPWGGSVTITTSSDKYSFNVVEANVPQKNCMAMVNALRSSSAI 155
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 65.2 bits (159), Expect = 9e-14 Identities = 35/188 (18%), Positives = 72/188 (38%), Gaps = 23/188 (12%) Query: 1 MSKIRVLSVDDSALMRQIMTEIINSHSDMEMVATAPDPLVARDLIKKFNPDVLTLDVEMP 60 M+ +L DD A +R ++ + ++ V + I + D++ DV MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 RMDGLDFLEKLMRLRPMPVVMVSSLTGKGS-EVTLRALELGAIDFVTKPQLGIREGMLAY 119 + D L ++ + RP V+V ++ + + ++A E GA D++ KP + E + Sbjct: 59 DENAFDLLPRIKKARPDLPVLV--MSAQNTFMTAIKASEKGAYDYLPKP-FDLTELIGII 115 Query: 120 SEMIAEKVRTAAKASLAAHKPLSAPTTLKAGPLLSSEKLIAIGASTGGTEAIRHVLQPLP 179 +AE R +K + + +G S E R + + + Sbjct: 116 GRALAEPKRRPSKLEDDSQDGMP-----------------LVGRSAAMQEIYRVLARLMQ 158 Query: 180 LSSPALLI 187 ++ Sbjct: 159 TDLTLMIT 166
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.8 bits (69), Expect = 0.009 Identities = 22/93 (23%), Positives = 35/93 (37%), Gaps = 11/93 (11%) Query: 46 LISISSPKELIQIAEYFRTPLATAVTGGDRISNSESPIPGGGDDYTQSQGEVNKQPNIEE 105 L +SSP A P + G + ++ PGGGDD GE +++ Sbjct: 384 LADVSSPTAAAGGAGGGEPPKKRDPSAG---AGTDPGGPGGGDD-----GEDPFGEWLDD 435 Query: 106 LKKRM---EQSRLRKLRGDLDQLIESDPKLRAL 135 R+ + L+ R L + + S P L Sbjct: 436 EVARLRLRGRWLLKPRRAALIEALRSAPALAGC 468
>PF05844#YopD protein Length = 295 Score = 33.1 bits (75), Expect = 0.001 Identities = 12/28 (42%), Positives = 22/28 (78%), Gaps = 2/28 (7%) Query: 76 MDLLALLYRLMAKSRQMGMFSLERDIEN 103 ++LL +L+R+ K+R++G+ L+RD EN Sbjct: 74 VELLLILFRIAQKARELGV--LQRDNEN 99
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.0 bits (67), Expect = 0.029 Identities = 15/40 (37%), Positives = 16/40 (40%), Gaps = 10/40 (25%) Query: 18 PGVKALTDISFDCYAGQVHALMGENGAGKSTLLKILSGNY 57 PG K FD L G G GKSTL+ L G Sbjct: 591 PGCK------FDY----SVVLEGTGGIGKSTLINTLVGLD 620
>SECA#SecA protein signature. Length = 901 Score = 60.3 bits (146), Expect = 9e-13 Identities = 27/70 (38%), Positives = 31/70 (44%), Gaps = 5/70 (7%) Query: 155 RVEKMSPEAFEESVDAIRLAALDLH---AYWMAHPQEKAVQQPI--KAEEKPGRNDPCPC 209 +V+ PE EE R+ A L A E K GRNDPCPC Sbjct: 828 KVQVRMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPC 887 Query: 210 GSGKKFKQCC 219 GSGKK+KQC Sbjct: 888 GSGKKYKQCH 897
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 75.3 bits (185), Expect = 5e-18 Identities = 23/113 (20%), Positives = 47/113 (41%), Gaps = 2/113 (1%) Query: 4 VLLVDDHELVRAGIRRILEDIKGIKVVGEASCGEDAVKWCRTNAVDVVLMDMSMPGIGGL 63 +L+ DD +R + + L G V ++ +W D+V+ D+ MP Sbjct: 6 ILVADDDAAIRTVLNQALSR-AGYDVRITSN-AATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 64 EATRKIARSTADVKIIMLTVHTENPLPAKVMQAGAAGYLSKGAAPQEVVSAIR 116 + +I ++ D+ +++++ K + GA YL K E++ I Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116
>FLAGELLIN#Flagellin signature. Length = 507 Score = 234 bits (599), Expect = 9e-73 Identities = 260/551 (47%), Positives = 311/551 (56%), Gaps = 47/551 (8%) Query: 2 AQVINTNSLSLITQNNINKNQSALSSSIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 61 AQVINTNSLSL+TQNN+NK+QS+LSS+IERLSSGLRINSAKDDAAGQAIANRFTSNIKGL Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60 Query: 62 TQAARNANDGISVAQTTEGALSEINNNLQRIRELTVQASTGTNSDSDLDSIQDEIKSRLD 121 TQA+RNANDGIS+AQTTEGAL+EINNNLQR+REL+VQA+ GTNSDSDL SIQDEI+ RL+ Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120 Query: 122 EIDRVSGQTQFNGVNVLAKDGSMKIQVGANDGQTITIDLKKIDSDTLGLNGFNVNGGGAV 181 EIDRVS QTQFNGV VL++D MKIQVGANDG+TITIDL+KID +LGL+GFNVNG Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEA 180 Query: 182 A---NTAASKADLVAANATVVGNKYTVSAGYDAAKASDLLAGVSDGDTVQATINNGFGTA 238 ++ K V NKY V A V D V A N T Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAA-NGQLTTD 239 Query: 239 ASATNYKYDSASKSYSFDTTTASAADVQKYLTPGVGDTAKGTITIDGSAQDVQISSDGKI 298 + N D + S T + A GDT +GK+ Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKV 299 Query: 299 TASNGDKLYIDTTGRLTKNGSGASLTEASLSTLAANNTKATTIDIGGTSISFTGNSTTPD 358 + T NG +LT A ++ AAN AT S T D Sbjct: 300 ST--------------TINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFD 345 Query: 359 TITYSVTGAKVDQAAFDKAVSTSGNNVDFTTAGYSVNGTTGAVTKGVDSVYVDNNEALTT 418 T + + D A + S V+ + G + Sbjct: 346 DKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLA---------------- 389 Query: 419 SDTVDFYLQDDGSVTNGSGKAVYKDADGKLTTDAETKAATTADPLKALDEAISSIDKFRS 478 + DA +TA+PL ++D A+S +D RS Sbjct: 390 -------------GKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRS 436 Query: 479 SLGAVQNRLDSAVTNLNNTTTNLSEAQSRIQDADYATEVSNMSKAQIIQQAGNSVLAKAN 538 SLGA+QNR DSA+TNL NT TNL+ A+SRI+DADYATEVSNMSKAQI+QQAG SVLA+AN Sbjct: 437 SLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQAN 496 Query: 539 QVPQQVLSLLQ 549 QVPQ VLSLL+ Sbjct: 497 QVPQNVLSLLR 507
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 27.9 bits (62), Expect = 0.024 Identities = 13/52 (25%), Positives = 21/52 (40%), Gaps = 4/52 (7%) Query: 105 PCFAWLADRFGRRRVYITGALIGTLSAFPFFMALEAQSIFWIVFFSIMLANI 156 P L+DRFGRR V + + A W+++ ++A I Sbjct: 61 PVLGALSDRFGRRPVLLVSLAGAAVDYAIMATA----PFLWVLYIGRIVAGI 108
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 28.1 bits (62), Expect = 0.002 Identities = 9/34 (26%), Positives = 17/34 (50%), Gaps = 3/34 (8%) Query: 19 ALRLRFE---DKLTIRAIAQRLGLSHSTIHTLFQ 49 ALRL + ++ IA+ G++ I+ F+ Sbjct: 20 ALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFK 53
>FLAGELLIN#Flagellin signature. Length = 507 Score = 25.0 bits (54), Expect = 0.029 Identities = 14/77 (18%), Positives = 31/77 (40%), Gaps = 9/77 (11%) Query: 2 KSMDKISTGIAYGTSAGSAGYWFL--------QWLDQVSPSQWAAIGVLGSLVLGFLTYL 53 +++++S+G+ ++ A + + L Q S + I + G L + Sbjct: 26 SAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIA-QTTEGALNEI 84 Query: 54 TNLYFKIREDKRKAARG 70 N ++RE +A G Sbjct: 85 NNNLQRVRELSVQATNG 101
>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature. Length = 52 Score = 64.1 bits (156), Expect = 5e-18 Identities = 19/46 (41%), Positives = 32/46 (69%) Query: 23 QKAMLIALIVICLTVIVTALVTRKDLCEVRIRTGQTEVAVFTAYEP 68 + +++ ++++CLT+++ +TRK LCE+R R G EVA F AYE Sbjct: 5 RSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYES 50
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 41.4 bits (97), Expect = 7e-06 Identities = 9/74 (12%), Positives = 33/74 (44%), Gaps = 1/74 (1%) Query: 16 LRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRMLFGQSSEKKRHKLENQIRQAEKRLS 75 + +Q+++ + ++ Y+ ++E++++++ + + K L+ ++RQ + Sbjct: 254 VLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILD-KLRQTTDNIG 312 Query: 76 ELENRLNTARNLLE 89 L L + Sbjct: 313 LLTLELAKNEERQQ 326
>PF01206#SirA family protein Length = 76 Score = 57.5 bits (139), Expect = 4e-15 Identities = 15/72 (20%), Positives = 35/72 (48%), Gaps = 1/72 (1%) Query: 4 KKLDVVTQVCPFPLIEAKAALAEMASGDELVIEFDCTQATEAIPQWAAEEGHAITDYQQI 63 + LD CP P+++AK LA M +G+ L + + + ++ + GH + + ++ Sbjct: 6 QSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKEE 65 Query: 64 GDAAWSITVQKA 75 + +++A Sbjct: 66 DG-TYHFRLKRA 76
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 28.6 bits (64), Expect = 0.025 Identities = 25/126 (19%), Positives = 51/126 (40%), Gaps = 16/126 (12%) Query: 95 LVDSALAHRIPRIIFTSSTSVYGDAQG---TVKETT--PRNPVTNSGRVLEELEDWLHNL 149 +++ ++I +++ SS+SVYG + + ++ P + + + E + +L Sbjct: 109 ILEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHL 168 Query: 150 PGTSVDILRLAGLVGP-GRHPGRFF-------AGKTAP---DGEHGVNLVHLEDVIGAIT 198 G LR + GP GR F GK+ G+ + +++D+ AI Sbjct: 169 YGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAII 228 Query: 199 LLLQAP 204 L Sbjct: 229 RLQDVI 234
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 46.7 bits (111), Expect = 5e-08 Identities = 33/175 (18%), Positives = 64/175 (36%), Gaps = 35/175 (20%) Query: 1 MNILLFGKTGQVGWELQRALAPLGN-LIALDVHSTDY--------------------CGD 39 M L+ G G +G+ + + L G+ ++ +D + Y D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 40 FSNPEGVAETVKKIRPDVIVNAAAHTAVDKAESEP------NFAQLLNATCVEAIAKAAN 93 ++ EG+ + + + + AV + P N LN + + Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLN---ILEGCRHNK 117 Query: 94 EVGAWVIHYSTDYVFPGNGDTPWLETDATA-PLNVYGETKLAGEKALQEHCAKHL 147 + +++ S+ V+ N P+ D+ P+++Y TK A E L H HL Sbjct: 118 -IQH-LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANE--LMAHTYSHL 168
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 184 bits (470), Expect = 7e-58 Identities = 89/360 (24%), Positives = 149/360 (41%), Gaps = 48/360 (13%) Query: 1 MKILVTGGAGFIGSAVVRHIINNTQDSVVNVDKLT--YAGNL-ESLADVSDSERYAFEHA 57 MK LVTG AGFIG V + ++ VV +D L Y +L ++ ++ + F Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59 Query: 58 DICDAVAMSRIFAQHQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYVLLEAARNYWSA 117 D+ D M+ +FA + V V S+ P A+ ++N+ G +LE R+ Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN--- 116 Query: 118 LNDEKKKSFRFHHISTDEVYGDLPHPDEANNNEALPLFTETTAYAPSSPYSASKASSDHL 177 K + S+ VYG N +P T+ + P S Y+A+K +++ + Sbjct: 117 ------KIQHLLYASSSSVYGL---------NRKMPFSTDDSVDHPVSLYAATKKANELM 161 Query: 178 VRAWKRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKALPIYGKGNQIRDWLYVE 237 + YGLP YGP+ P+ + LEGK++ +Y G RD+ Y++ Sbjct: 162 AHTYSHLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYID 221 Query: 238 D-------------HARALYTVVTEGKA-----GETYNIGGHNEKKNIDVVLTICDLLDE 279 D HA +TV T A YNIG + + +D + + D L Sbjct: 222 DIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG- 280 Query: 280 IVPKEKSYREQITYVADRPGHDRRYAIDADKISRELGWKPQETFESGIRKTVEWYLANTN 339 + +K+ +PG + D + +G+ P+ T + G++ V WY Sbjct: 281 -IEAKKNMLPL------QPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYK 333
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 104 bits (262), Expect = 1e-27 Identities = 76/353 (21%), Positives = 122/353 (34%), Gaps = 42/353 (11%) Query: 6 LITGVTGQDGSYLAEFLLEKGYEVHGIKRRASSFNTERVDHIYQDPH--------TCNPK 57 L+TG G G ++++ LLE G++V GI + N Y D P Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGI----DNLND------YYDVSLKQARLELLAQPG 53 Query: 58 FHLHYGDLSDTSNLTRILREVQPDEVYNLGAMSHVAVSFESPEYTADVDAMGTLRLLEAI 117 F H DL+D +T + + V+ V S E+P AD + G L +LE Sbjct: 54 FQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGC 113 Query: 118 RFLGLEKKTRFYQASTSELYGLVQEIPQKETTPF-YPRSPYAVAKLYAYWITVNYRESYG 176 R ++ AS+S +YGL +++P +P S YA K + Y YG Sbjct: 114 RHNKIQ---HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170 Query: 177 MYACNGILFNHESPRRGETFVTRKITRAIANIAQGLESCLYLGNMDSLRDWGHAKDYVKM 236 + A F P K T+A+ G +Y RD+ + D + Sbjct: 171 LPATGLRFFTVYGPWGRPDMALFKFTKAMLE---GKSIDVY-NYGKMKRDFTYIDDIAEA 226 Query: 237 QWMMLQQEQPEDFVIATGVQYSVRQFVEMAAAQLGIKLRFEGTGVEEKGIVVSVTGHDAP 296 + D +G + E + DA Sbjct: 227 IIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIG------NSSPVELMDYIQAL-EDAL 279 Query: 297 GVKPGDVIIAVDPRY--FRPAEVETLLGDPTKAHEKLGWKPEITLREMVSEMV 347 G++ +P +V D +E +G+ PE T+++ V V Sbjct: 280 GIE-------AKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFV 325
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 35.5 bits (82), Expect = 2e-05 Identities = 30/127 (23%), Positives = 52/127 (40%), Gaps = 20/127 (15%) Query: 18 RSAEECKIALSSV--AETRASLPFISNELAT------LISQRGLESALSQPLARILEQVQ 69 +AE K + S + + LA ++ + AL +PL I+ V Sbjct: 213 ATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVM 272 Query: 70 LALDNAQEKPDV--------IYLTGGSARSPLIKKALAEQLPGIPIAGGDD-FGSVTAGL 120 +AL+ Q P++ + LTGG A + + L E+ GIP+ +D V G Sbjct: 273 VALE--QCPPELASDISERGMVLTGGGALLRNLDRLLMEET-GIPVVVAEDPLTCVARGG 329 Query: 121 ARWAEVV 127 + E++ Sbjct: 330 GKALEMI 336
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 50.1 bits (120), Expect = 3e-09 Identities = 32/129 (24%), Positives = 57/129 (44%), Gaps = 20/129 (15%) Query: 132 AMMLH-IRQQAQAQLPEAITQAVIGRPINFQGLGGDEANAQAQGILERAAKRAGFKDVVF 190 M+ H I+Q + ++ P+ + E A + +A+ AG ++V Sbjct: 89 KMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQV---ERRA-----IRESAQGAGAREVFL 140 Query: 191 QYEPVAAGLDYEATLQEEKRVLVVDIGGGTTDCSLLLMGPQWRSRLDREASLLGHSGYRI 250 EP+AA + + E +VVDIGGGTT+ +++ + ++ S RI Sbjct: 141 IEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN-----------GVVYSSSVRI 189 Query: 251 GGNDLDIAL 259 GG+ D A+ Sbjct: 190 GGDRFDEAI 198
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 44.0 bits (104), Expect = 5e-07 Identities = 33/167 (19%), Positives = 64/167 (38%), Gaps = 11/167 (6%) Query: 61 ALAQTQGQLAKDKATLANARRDLARYQQLAKTNLVSRQELDAQQALVSETEGTIKADEAS 120 + +L K+ L ++ AK +L + L + T + Sbjct: 260 KYVEAVNELRVYKSQLEQIESEILS----AKEEYQLVTQLFKNEILDKLRQTTDNIGLLT 315 Query: 121 --VASAQLQLDWSRITAPVDGRV-GLKQVDVGNQISSGDTTGIVVITQTHPIDLVFTLPE 177 +A + + S I APV +V LK G +++ +T +V++ + +++ + Sbjct: 316 LELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVTALVQN 374 Query: 178 SDIATVVQAQKAGKPLMVEAWDRTNSKKL-SEGTLLSLDNQIDATTG 223 DI + Q A + VEA+ T L + ++LD D G Sbjct: 375 KDIGFINVGQNA--IIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLG 419 Score = 43.7 bits (103), Expect = 8e-07 Identities = 21/122 (17%), Positives = 47/122 (38%), Gaps = 13/122 (10%) Query: 15 GTITAA-NTVTVRSRVDGQLMALHFQEGQQVKAGDLLAEIDPSQFKVALAQTQGQLAKDK 73 G +T + + ++ + + + +EG+ V+ GD+L ++ + K + Sbjct: 88 GKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTAL-------GAEADTLKTQ 140 Query: 74 ATLANARRDLARYQQLAKTNLVSRQELDAQQALVSETEGTIKADEASVASAQLQLDWSRI 133 ++L AR + RYQ L+++ EL+ L E + L + Sbjct: 141 SSLLQARLEQTRYQILSRS-----IELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQF 195 Query: 134 TA 135 + Sbjct: 196 ST 197
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 907 bits (2345), Expect = 0.0 Identities = 287/1035 (27%), Positives = 502/1035 (48%), Gaps = 40/1035 (3%) Query: 6 LFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVAT 65 FI RP+ +L++ + + G L LPVA P + P + VSA+ PGA +T+ +V Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63 Query: 66 PLERSLGRIAGVSEMTSSS-SLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMP 124 +E+++ I + M+S+S S GS I L F D + A VQ + A LLP + Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123 Query: 125 SRPTYRKANPSDAPIMILTLTSDT--YSQGELYDFASTQLAPTISQIDGVGDVDVGGSSL 182 + S + +M+ SD +Q ++ D+ ++ + T+S+++GVGDV + G+ Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182 Query: 183 PAVRVGLTPQALFNQGVSLDDVRTAISNANVRKPQG------ALEDGTHRWQIQTNDELK 236 A+R+ L L ++ DV + N + G AL I K Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241 Query: 237 TAAEYQPLIIHYN-NGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQ 295 E+ + + N +G VRL DVA V ++ N KPA L I+ AN + Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301 Query: 296 TVDSIRAKLPELQETIPAAIDLQIAQDRSPTIRASLEEVEQTLIISVALVILVVFLFLRS 355 T +I+AKL ELQ P + + D +P ++ S+ EV +TL ++ LV LV++LFL++ Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361 Query: 356 GRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENIARHL- 414 RAT+IP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R + Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421 Query: 415 EAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGIS 474 E + P +A + ++ ++ +++ L AVF+P+ GG G + R+F++T+ A+ +S Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481 Query: 475 LLVSLTLTPMMCGWMLKASKPREQKRLRGFG----RMLVALQQGYGKSLKWVLNHTRLVG 530 +LV+L LTP +C +LK + GF Y S+ +L T Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541 Query: 531 VVLLGTIALNI----SIPKTFFPEQDTGVLMGGIQADQSISFQ----AMRGKLQDFMKII 582 ++ +A + +P +F PE+D GV + IQ + + + ++K Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601 Query: 583 RD-DPAVDNVTGFT-GGSRVNSGMMFITLKPRDERS---ETAQQIIDRLRVKLAKEPGAN 637 + +V V GF+ G N+GM F++LKP +ER+ +A+ +I R +++L K Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661 Query: 638 LFLMAVQDIRVGGRQSNASYQYTLLSDDLAALREWEPKIRKKLATL-----PELADVNSD 692 + + I G + ++ L D + + R +L + L V + Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718 Query: 693 QQDNGAEMNLVYDRDTMARLGIDVQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRY 752 ++ A+ L D++ LG+ + N ++ A G ++ K+ ++ D ++ Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778 Query: 753 TQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSAASTISFNLPTGKSLSD 812 ++K++V + G+ +P S F + + I G S D Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838 Query: 813 ASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVH 872 A A ++ ++L P+ + + G + + + N L+ + V++ L LYES+ Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896 Query: 873 PLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGN 932 P++++ +P VG LLA LFN + ++G++ IG+ KNAI++V+FA + Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956 Query: 933 LTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLEITIVGGLVMSQLL 992 EA A +R RPI+MT+LA + G LPL +S G GS + + I ++GG+V + LL Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016 Query: 993 TLYTTPVVYLFFDRL 1007 ++ PV ++ R Sbjct: 1017 AIFFVPVFFVVIRRC 1031 Score = 78.3 bits (193), Expect = 1e-16 Identities = 77/446 (17%), Positives = 162/446 (36%), Gaps = 26/446 (5%) Query: 588 VDNVTGFTGGS-RVNSGMMFITLKPRDERSETAQQIIDRLRVKLAKEPGANLFLMAVQDI 646 +DN+ + S S + +T + + Q+ ++L++ P + Q I Sbjct: 72 IDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE----VQQQGI 127 Query: 647 RVGGRQSNASYQYTLLSDDLAALREW-----EPKIRKKLATLPELADVNSDQQDNGAE-- 699 V S+ +SD+ ++ ++ L+ L + DV GA+ Sbjct: 128 SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL----FGAQYA 183 Query: 700 MNLVYDRDTMARLGID----VQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQD 755 M + D D + + + + + + T P Q + R+ Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNP 243 Query: 756 ISALEKMFVINNEGKAIPLSYFAK--WQPANAPLSVNHQGLSAASTISFNLPTGKSLSDA 813 + +N++G + L A+ N + G AA +L D Sbjct: 244 EEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL-DT 302 Query: 814 SAAIDRAMTQL--GVPSTVRGSFA-GTAQVFQETMNSQVILIIAAIATVYIVLGILYESY 870 + AI + +L P ++ + T Q +++ V + AI V++V+ + ++ Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362 Query: 871 VHPLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRH 930 L +P +G L F + + + G++L IG++ +AI++V+ Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422 Query: 931 GNLTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLEITIVGGLVMSQ 990 L P+EA ++ ++ + +P+ GG + + ITIV + +S Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482 Query: 991 LLTLYTTPVVYLFFDRLRLRFSRKPK 1016 L+ L TP + + + K Sbjct: 483 LVALILTPALCATLLKPVSAEHHENK 508
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 125 bits (315), Expect = 1e-33 Identities = 97/429 (22%), Positives = 187/429 (43%), Gaps = 23/429 (5%) Query: 20 FMQSLDTTIVNTALPSMAQSLGESPLHMLMVIVSYVLTVAVMLPASGWLADKVGVRNIFF 79 F L+ ++N +LP +A + P V +++LT ++ G L+D++G++ + Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 80 TAIVLFTLGSLFCALSGTLNELL-LARALQGVGGAMMVPVGRLTVMKIVPREQYMAAMTF 138 I++ GS+ + + LL +AR +QG G A + + V + +P+E A Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 139 VTLPGQVGPLLGPALGGLLVEYASWHWIFLINIPVGIIGAIATLM-LMPNYTMQTRRFDL 197 + +G +GPA+GG++ Y HW +L+ IP+ I + LM L+ FD+ Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201 Query: 198 SGFLLLAVGMAVLTLALDGSKGTGLSPLAIAGLVAVGVVALVLYLLHARNNNRALFSLKL 257 G +L++VG+ L + + V V++ ++++ H R L Sbjct: 202 KGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252 Query: 258 FRTRTFSLGLAGSFAGRIGSGMLPFMTPVFLQIGLGFSPFHAG-LMMIPMVLGSMGMKRI 316 + F +G+ M P ++ S G +++ P + + I Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312 Query: 317 VVQVVNRFGYRRVLVATTLGLSLVTLLFMTTALL----GWYYVLPFVLFLQGMVNSTRFS 372 +V+R G VL +G++ +++ F+T + L W+ + V L G+ S + Sbjct: 313 GGILVDRRGPLYVL---NIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL--SFTKT 367 Query: 373 SMNTLTLKDLPDNLASSGNSLLSMIMQLSMSIGVTIAGLLLGLFGSQHISVDSGTTQTVF 432 ++T+ L A +G SLL+ LS G+ I G LL + + Q+ + Sbjct: 368 VISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTY 427 Query: 433 MYTWLSMAF 441 +Y+ L + F Sbjct: 428 LYSNLLLLF 436
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 31.3 bits (71), Expect = 0.009 Identities = 27/93 (29%), Positives = 34/93 (36%), Gaps = 27/93 (29%) Query: 173 LATLLAALATFLLA-------------RGLLAPVKRLVDGTHKLAAGDFTTRVTPTSEDE 219 LATL+AA A L+A V+ V H LA + P S + Sbjct: 77 LATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD---AMKCFPGSFER 133 Query: 220 L-----------GKLAQDFNQLASTLEKNQQMR 241 L G L N+LA E+ QQMR Sbjct: 134 LYCAMVAAGETSGHLDAVLNRLADYTEQRQQMR 166
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 75.6 bits (186), Expect = 6e-18 Identities = 28/136 (20%), Positives = 65/136 (47%), Gaps = 1/136 (0%) Query: 11 PRILIVEDEPKLGQLLIDYLRAASYAPTLISHGDQVLAYVRQTPPDLILLDLMLPGTDGL 70 IL+ +D+ + +L L A Y + S+ + ++ DL++ D+++P + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 71 TLCREIR-RFSDVPIVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTILRRCK 129 L I+ D+P+++++A+ + + E GA DY+ KP+ E++ + L K Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 130 PQRELQQQDAESPLII 145 + + D++ + + Sbjct: 124 RRPSKLEDDSQDGMPL 139
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 35.2 bits (81), Expect = 5e-04 Identities = 53/268 (19%), Positives = 89/268 (33%), Gaps = 17/268 (6%) Query: 29 LSKSGFSAGEIGWSYACTAIAAILSPILVGSITDRFFSAQKVLAVLMFAGAVLMYFAAQQ 88 L S G A A+ ++G+++DRF ++ + ++ AGA + Y Sbjct: 35 LVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF--GRRPVLLVSLAGAAVDYAI--- 89 Query: 89 TTFAGFFPLLLAYSLTYMPTIALTNSIAFANVPDVERDFPRIRVMGTIG-WIASGLACGF 147 A F +L + T A T ++A A + D+ R R G + G+ G Sbjct: 90 MATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAG- 147 Query: 148 LPQMLGY-ADISPTNIPLLITAGSSALLGVFAFFLPDTPPKSTGKMDIKVMLGLDALILL 206 P + G SP + P A + L + FL K + + L A Sbjct: 148 -PVLGGLMGGFSP-HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRW 205 Query: 207 RDKN------FLVFFFCSFLFAMPLAFYYIFANGYLTEVGMKNATGWMTLGQFSEIFFML 260 VFF + +P A + IF G + + Sbjct: 206 ARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAM 265 Query: 261 ALPFFTKRFGIKKVLLLGLVTAAICYGF 288 R G ++ L+LG++ Y Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYIL 293 Score = 33.6 bits (77), Expect = 0.001 Identities = 32/153 (20%), Positives = 53/153 (34%), Gaps = 20/153 (13%) Query: 253 FSEIFFMLALPFFTKRFGIKKVLLLGLVTAAICYGFFIYGSADEYFTYALLFLGILLHGV 312 + L + RFG + VLL+ L AA+ Y +L++G ++ G+ Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAP-----FLWVLYIGRIVAGI 108 Query: 313 SYDFYYVTAYIYVDKKAPVHMRTAAQGLITLCCQGFGSLLGYRLGGVMMEKMFAYQEPVN 372 + V D R G ++ C GFG + G LGG+M F+ P Sbjct: 109 TGATGAVAGAYIAD-ITDGDERARHFGFMS-ACFGFGMVAGPVLGGLMGG--FSPHAP-- 162 Query: 373 GLTFNWSGMWTFGAVMIAIIAVLFMIFFRESDN 405 + A + + + ES Sbjct: 163 ---------FFAAAALNGLNFLTGCFLLPESHK 186 Score = 28.6 bits (64), Expect = 0.048 Identities = 22/114 (19%), Positives = 43/114 (37%), Gaps = 4/114 (3%) Query: 7 LSFMMFVEWFIWGAWFVPLWLWL----SKSGFSAGEIGWSYACTAIAAILSPILVGSITD 62 ++ +M V + + VP LW+ + + A IG S A I L+ ++ Sbjct: 212 VAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVA 271 Query: 63 RFFSAQKVLAVLMFAGAVLMYFAAQQTTFAGFFPLLLAYSLTYMPTIALTNSIA 116 ++ L + M A A T FP+++ + + AL ++ Sbjct: 272 ARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLS 325
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 26.4 bits (58), Expect = 0.029 Identities = 13/42 (30%), Positives = 21/42 (50%), Gaps = 1/42 (2%) Query: 6 KMLLGVLLLVTSAAWAAPATAGSTNTSGISKYE-LSSFIADF 46 ++L G LLL++S +WA ++K E L + DF Sbjct: 11 RVLTGTLLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDF 52
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 713 bits (1843), Expect = 0.0 Identities = 239/843 (28%), Positives = 389/843 (46%), Gaps = 35/843 (4%) Query: 2 LRMTPLASAI---VALLLGIEAYAAEETFDTHFMIGGMKDQQVANIRL--DDNQPLPGQY 56 R+ + A +AE F+ F+ Q VA++ + + PG Y Sbjct: 21 HRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADD--PQAVADLSRFENGQELPPGTY 78 Query: 57 DIDIYVNKQWRGKYEIIVKDNPQET----CLSREVIKRLGIN-----SDNFASGKQCLTF 107 +DIY+N + ++ E CL+R + +G+N N + C+ Sbjct: 79 RVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPL 138 Query: 108 EQLVQGGSYSWDIGVFRLDFSVPQAWVEELESGYVPPENWERGINAFYTSYYVSQYYSDY 167 ++ + D+G RL+ ++PQA++ GY+PPE W+ GINA +Y S Sbjct: 139 TSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQN 198 Query: 168 KASGNNKSTYVRFNSGLNLLEWQLHSDASFSKTNNNPGV-----WKSNTLYLERGFAQFL 222 + GN+ Y+ SGLN+ W+L + ++S +++ W+ +LER Sbjct: 199 RIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLR 258 Query: 223 GTLRVGDMYTSSDIFDSVRFSGVRLFRDMQMLPNSKQNFTPRVQGIAQSNALVTIEQNGF 282 L +GD YT DIFD + F G +L D MLP+S++ F P + GIA+ A VTI+QNG+ Sbjct: 259 SRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGY 318 Query: 283 VVYQKEVPPGPFAITDLQLAGGGADLDVSVKEADGSVTTYLVPYAAVPNMLQPGVSKYDF 342 +Y VPPGPF I D+ AG DL V++KEADGS + VPY++VP + + G ++Y Sbjct: 319 DIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSI 378 Query: 343 AAGRSHIEGASKQSD-FVQAGYQYGFNNLLTLYGGTMVANNYYAFTLGTGWNT-RIGAIS 400 AG A ++ F Q+ +G T+YGGT +A+ Y AF G G N +GA+S Sbjct: 379 TAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALS 438 Query: 401 VDATKSHSKQDNGDVFDGQSYQIAYNKFVSQTSTRFGLAAWRYSSRDYRTFNDHVWANNK 460 VD T+++S + DGQS + YNK ++++ T L +RYS+ Y F D ++ Sbjct: 439 VDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMN 498 Query: 461 DNYRRDENDIYDI----ADYYQNDFGRKNSFSANMSQSLPEGWGSVSLSTLWRDYWGRSG 516 ++ + + DYY + ++ ++Q L ++ LS + YWG S Sbjct: 499 GYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGSHQTYWGTSN 557 Query: 517 SSKDYQLSYSNNWRRISYTLAASQAYGENHHE-EKRFNIFISIPCD--WGDDVTTPRRQI 573 + +Q + + I++TL+ S ++ + ++IP D + R Sbjct: 558 VDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHA 617 Query: 574 YMSNSTTFDDQGFASNNTGLSGTVGSRDQFNYGVNLSHQHQGN---ETTAGANLTWNAPV 630 S S + D G +N G+ GT+ + +Y V + G+ +T A L + Sbjct: 618 SASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGY 677 Query: 631 ATVNGSYSQSSTYRQTGASVSGGIVAWSGGVNLANRLSETFAVMNAPGIKDAYVNGQKYR 690 N YS S +Q VSGG++A + GV L L++T ++ APG KDA V Q Sbjct: 678 GNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGV 737 Query: 691 TTNRNGVVVYDGMTPYRENHLMLDVSQSDSEAELRGNRKIAAPYRGAVVLVNFDTDQRKP 750 T+ G V T YREN + LD + +L P RGA+V F + Sbjct: 738 RTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKA-RVGI 796 Query: 751 WFIKALRADGQPLTFGYEVNDIHGHNIGVVGQGSQLFIRTNEIPPSVNVAIDKQQGLSCT 810 + L + +PL FG V + G+V Q+++ + V V +++ C Sbjct: 797 KLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCV 856 Query: 811 ITF 813 + Sbjct: 857 ANY 859
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 28.3 bits (63), Expect = 0.019 Identities = 5/33 (15%), Positives = 16/33 (48%), Gaps = 2/33 (6%) Query: 152 WLHNLDQHLKHW-VWLILVVVL-VVGVRWWLKR 182 L + ++ + W++L ++ + R L++ Sbjct: 215 VLMGMSDAVRTFGPWMLLALLAGFMAFRVMLRQ 247
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 29.3 bits (66), Expect = 0.017 Identities = 32/127 (25%), Positives = 53/127 (41%), Gaps = 5/127 (3%) Query: 122 GAKAMREAVPAHLPVSVKVRLGWDSGEK-KFEIADAVQQAGATELVVHGRTKEQGY-RAE 179 G EA+ ++ + +G + E+ K EI A E+ V GR +G R Sbjct: 190 GGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGF 249 Query: 180 HIDWQAIGE-IRQRLNIPVIANGEIWDWQSAQQCMAISGCDAVMIGRGALNIPNLSRVVK 238 ++ I E +++ L V A + + IS V+ G GAL + NL R++ Sbjct: 250 TLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGAL-LRNLDRLL- 307 Query: 239 YNEPRMP 245 E +P Sbjct: 308 MEETGIP 314
>PF05272#Virulence-associated E family protein Length = 892 Score = 32.0 bits (72), Expect = 0.006 Identities = 21/74 (28%), Positives = 28/74 (37%), Gaps = 17/74 (22%) Query: 24 PGVKALDNVNLKVRPHSIHALMGENGAGKSTLLKCLFGIYQKDSGTILFQGKEIDFHSAK 83 PG K D + L G G GKSTL+ L G+ F D + K Sbjct: 591 PGCKF-DYSVV---------LEGTGGIGKSTLINTLVGLD-------FFSDTHFDIGTGK 633 Query: 84 EALENGISMVHQEL 97 ++ E +V EL Sbjct: 634 DSYEQIAGIVAYEL 647
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 30.5 bits (69), Expect = 0.008 Identities = 18/77 (23%), Positives = 27/77 (35%), Gaps = 8/77 (10%) Query: 4 IRDVARQAGVSVATVSRVLNNS------TLVSADTREAVMKAVSELDYRPNANAQALATQ 57 I + + +S V +LN + +S+D R+ A D R A + Sbjct: 249 ISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQLALNVFAYR 308 Query: 58 VSDTIG--VVVMDVSDA 72 V TIG M D Sbjct: 309 VKKTIGSYAAAMGGVDV 325
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 49.8 bits (119), Expect = 8e-09 Identities = 74/381 (19%), Positives = 118/381 (30%), Gaps = 24/381 (6%) Query: 21 LIVAFLTGIAGALQTPTLSIFLTDEVHA--RPAMVGFFFTGSAVIGILVSQFLAGRSDKR 78 L L + L P L L D VH+ A G A++ + L SD+ Sbjct: 11 LSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF 70 Query: 79 GDRKSLIVFCCLLGVLACTLFAWNRNYFVLLFVGVFLSSFGSTANPQMFALAREHADKTG 138 G R+ +++ + + A +VL ++G ++ G T A A G Sbjct: 71 G-RRPVLLVSLAGAAVDYAIMATAPFLWVL-YIGRIVA--GITGATGAVAGAYIADITDG 126 Query: 139 REAVMFSSFLRAQVSLAWVIGPPLAYALAMGFSFTVMYLSAAVAFIVCGVMVWLFLPSMQ 198 E F+ A V GP L L GFS + +AA + + LP Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLG-GLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESH 185 Query: 199 K-------ELPLATGTVEAPRRNRRDTLLLFVICTLMWGSNSLYIINMPLFIINELHLPE 251 K L R L + +M + +F + H Sbjct: 186 KGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDA 245 Query: 252 KLAGVMMGTAAGLEIPT-MLIAGYFAKRLGKRFLMRVAAVGGVCFYAGMLMA-HSPAILL 309 G+ + L +I G A RLG+R + + + Y + A Sbjct: 246 TTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFP 305 Query: 310 GLQLLNAIFIGILGGIGMLYFQDLMPGQAGSATTLYTNTSRVGWIIAGSVAG--IVAEIW 367 + LL GGIGM Q ++ Q S S+ G + I+ Sbjct: 306 IMVLL------ASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIY 359 Query: 368 NYHAVFWFAMVMIIATLFWVM 388 W I +++ Sbjct: 360 AASITTWNGWAWIAGAALYLL 380 Score = 40.2 bits (94), Expect = 1e-05 Identities = 18/101 (17%), Positives = 35/101 (34%) Query: 19 AFLIVAFLTGIAGALQTPTLSIFLTDEVHARPAMVGFFFTGSAVIGILVSQFLAGRSDKR 78 A + V F+ + G + IF D H +G ++ L + G R Sbjct: 214 ALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAAR 273 Query: 79 GDRKSLIVFCCLLGVLACTLFAWNRNYFVLLFVGVFLSSFG 119 + ++ + L A+ ++ + V L+S G Sbjct: 274 LGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGG 314
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 61.8 bits (150), Expect = 1e-12 Identities = 79/408 (19%), Positives = 152/408 (37%), Gaps = 59/408 (14%) Query: 11 IVFILGLLAMLMPLSIDMYLPALPVISAQFGVSAGSTQMTLSTYILGFALGQLIYGPMAD 70 I+ L +L+ L+ + +LP I+ F ST + ++L F++G +YG ++D Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74 Query: 71 SFGRKPVVLGGTLVFAAAAVACALAQTIDQLIVM-RFFHGLAAAAASVVINALMRDIYPK 129 G K ++L G ++ +V + + L++M RF G AAA ++ ++ PK Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134 Query: 130 EEFSRMMSFVMLVTTIALLMAPIVGGWVLVWLSWHYIFWILALAAILASAMIFFLIKETL 189 E + + + + + P +GG + ++ W Y+ I + I + FL+K Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITII----TVPFLMKLLK 190 Query: 190 PPERR-QPFHIRTTIGNFAA---------------------------------------- 208 R F I+ I Sbjct: 191 KEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDP 250 Query: 209 -LFRHKRVLSYMLASGFSFAGMFSFLSAGPFVYIEINHVAPENFGYYFAL-NIVFLFVMT 266 L ++ + +L G F + F+S P++ +++ ++ G + + + Sbjct: 251 GLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFG 310 Query: 267 IFNSRFVRRIGALNMFRSG---LWIQFIMAAWMVISAPLGLGFWSLVVGVAAFVGCVSMV 323 V R G L + G L + F+ A++++ + + ++ V G Sbjct: 311 YIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSW----FMTIIIVFVLGGLSFTK 366 Query: 324 SSNAMAVILDEFPHMAGTASSLAGTFRF---GIG-AIVGALLSLATFN 367 + + V AG SL F G G AIVG LLS+ + Sbjct: 367 TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLD 414
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.4 bits (68), Expect = 0.025 Identities = 19/70 (27%), Positives = 28/70 (40%), Gaps = 6/70 (8%) Query: 503 LHVSTPASEYSQGQ-DLF---NPQRRHYWVTAADNDTLAITTPKKTLVLNNNGKYRTYNL 558 L V+ E + + LF QR H V+ +T+ + K L N NG+Y YN Sbjct: 926 LQVADKTGEPNHNELTLFDASKAQRDHLNVSLV-GNTVDLGAWKYKLR-NVNGRYDLYNP 983 Query: 559 RGERVKDEKP 568 E+ Sbjct: 984 EVEKRNQTVD 993
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 204 bits (520), Expect = 4e-57 Identities = 139/526 (26%), Positives = 220/526 (41%), Gaps = 59/526 (11%) Query: 350 APAMLVGKVVVSEGASFRTHGAVDTSKADVSLENSAWTIIADITTTNQNTRLNLANLAMS 409 P +G + V+ + R GA + +S++N+ W + + N+ L ++ Sbjct: 405 IPGTSIGPLDVALASQARWTGATRAVDS-LSIDNATWVMTDNS---------NVGALRLA 454 Query: 410 GANVIMMAEPVTRSSVTASAENFITLTTNTLSGNGNFYMRTDMANHQSDQLNVTGQATGD 469 + +P A A F LT NTL+G+G F M SD+L V A+G Sbjct: 455 SDGSVDFQQP-------AEAGRFKVLTVNTLAGSGLFRMNVFADLGLSDKLVVMQDASGQ 507 Query: 470 FKIFVTDTGASPAAGDSLTLVTT-GGGDAAFTLGNAGGVVDIGTYEYTLLDNGNHSWSLA 528 +++V ++G+ PA+ ++L LV T G A FTL N G VDIGTY Y L NGN WSL Sbjct: 508 HRLWVRNSGSEPASANTLLLVQTPLGSAATFTLANKDGKVDIGTYRYRLAANGNGQWSLV 567 Query: 529 ENRAQITPSTTDVLNMAAAQPL-----------------------------------VFD 553 +A P QP ++ Sbjct: 568 GAKAPPAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRELSAAANAAVNTGGVGLASTLWY 627 Query: 554 AELDTVRERLGSVKGVSYDTVMWSSAINTRNNVTTDAGAGFEQTLTGLTLGIDSRFSREE 613 AE + + +RLG ++ W R + AG F+Q + G LG D + Sbjct: 628 AESNALSKRLGELRLNPDAGGAWGRGFAQRQQLDNRAGRRFDQKVAGFELGADHAVAVAG 687 Query: 614 SSTIRGLFFGYSHSDIGFDRGGKGNIDSYTLGAYAGWEHQNGAYVDGVVKVDRFANTIHG 673 G GY+ D GF G G+ DS +G YA + +G Y+D ++ R N Sbjct: 688 GRWHLGGLAGYTRGDRGFTGDGGGHTDSVHVGGYATYIADSGFYLDATLRASRLENDFKV 747 Query: 674 KMSNGATAFGDYNSNGAGAHVESGFRW-VDGLWSVRPYLAFTGFTTDGQDYTLSNGMR-- 730 S+G G Y ++G GA +E+G R+ W + P F G Y +NG+R Sbjct: 748 AGSDGYAVKGKYRTHGVGASLEAGRRFTHADGWFLEPQAELAVFRAGGGAYRAANGLRVR 807 Query: 731 ADVGNTRILRAEAGTAVSYHMDLQNGTTLEPWLKAAVRQEYADSNQVKVNDDGKFNNDVA 790 + G++ + R G V ++L G ++P++KA+V QE+ + V N ++ Sbjct: 808 DEGGSSVLGR--LGLEVGKRIELAGGRQVQPYIKASVLQEFDGAGTVHTNGIAH-RTELR 864 Query: 791 GTSGVYQAGIRSSFTPTLSGHLSVSYGNGAGVESPWNTQAGVVWTF 836 GT G+ ++ S + S Y G + PW AG +++ Sbjct: 865 GTRAELGLGMAAALGRGHSLYASYEYSKGPKLAMPWTFHAGYRYSW 910
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 62.9 bits (153), Expect = 7e-14 Identities = 22/114 (19%), Positives = 46/114 (40%), Gaps = 2/114 (1%) Query: 9 VMIVDDHPLMRRGVRQLLELDPGFEVVAEAGEGASAIDLANRLDIDVILLDLNMKGMSGL 68 +++ DD +R + Q L G++V A+ D D+++ D+ M + Sbjct: 6 ILVADDDAAIRTVLNQALS-RAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 69 DTLNALRRDGVTAQIIILTVSDASSDVFALIDAGADGYLLKDSDPEVLLEAIRT 122 D L +++ +++++ + + GA YL K D L+ I Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 28.6 bits (64), Expect = 0.037 Identities = 14/61 (22%), Positives = 28/61 (45%), Gaps = 3/61 (4%) Query: 97 PVMVDVDRDTLMVT-PEAIESAIT-PRTKAIIP-VHYAGAPADIDAIRAIGERYGIAVIE 153 + +V R +++V P I I R + +P V + A + +R I E G+ +++ Sbjct: 249 NMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQ 308 Query: 154 D 154 Sbjct: 309 R 309
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 114 bits (288), Expect = 4e-30 Identities = 73/361 (20%), Positives = 136/361 (37%), Gaps = 60/361 (16%) Query: 317 RVLILGVNGFIGNHLTERLLREDHYEVYGLDIGSD--------AISRFLNHPHFHFVEGD 368 + L+ G GFIG H+++RLL H +V G+D +D A L P F F + D Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGH-QVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 369 ISIHSEWIE--YHVKKCDVVLPLVAIATPIEYT-RNPLRVFELDFEENLRIIRYCVKYR- 424 ++ E + + + V + Y+ NP + + L I+ C + Sbjct: 61 LADR-EGMTDLFASGHFERVFISPHRLA-VRYSLENPHAYADSNLTGFLNILEGCRHNKI 118 Query: 425 KRIIFPSTSEVYGMCSDKYFDEDHSNLIVGPVNKPRWIYSVSKQLLDRVIWAYGEKEGLQ 484 + +++ S+S VYG+ F D V+ P +Y+ +K+ + + Y GL Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDD------SVDHPVSLYAATKKANELMAHTYSHLYGLP 172 Query: 485 FTLFLPFNWMGPRLDNLNAARIGSSRAITQLILNLVEGSPIKLIDGGKQKRCFTDIRDGI 544 T F GP A+ + ++EG I + + GK KR FT I D Sbjct: 173 ATGLRFFTVYGPWGR--------PDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIA 224 Query: 545 EALYRIIEN---------------AGNRCDGEIINIGNPENEASIEELGEMLLASFEKHP 589 EA+ R+ + A + + NIGN + + + L + Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVE-LMDYIQALEDALGIEA 283 Query: 590 LRHHFPPFAGFRVVESSCYYGKGYQDVEHRKPSIRNAHRCLDWEPKIDMQETIDETLDFF 649 ++ P G DV + + + + P+ +++ + ++++ Sbjct: 284 KKNMLPLQPG---------------DVLETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328 Query: 650 L 650 Sbjct: 329 R 329
>BCTERIALGSPC#Bacterial general secretion pathway protein C signature. Length = 272 Score = 31.9 bits (72), Expect = 2e-04 Identities = 13/38 (34%), Positives = 19/38 (50%), Gaps = 1/38 (2%) Query: 34 KHIVLWLGLALACLGLAMMLWLLVL-QNVPVSPRHWCG 70 + I+ +L + L C LAM+ W + L N PVS Sbjct: 15 RRILFYLLMLLFCQQLAMIFWRIGLPDNAPVSSVQITP 52
>AUTOINDCRSYN#Autoinducer synthesis protein signature. Length = 216 Score = 34.8 bits (80), Expect = 6e-05 Identities = 14/79 (17%), Positives = 32/79 (40%), Gaps = 12/79 (15%) Query: 1 MIEWQDLHHSELSVSQLYALLQLRCAVFV--------VEQNCPYQDIDGDDLTGDNRHIL 52 M+E D++H+ LS ++ L LR F + D + + ++ Sbjct: 1 MLEIFDVNHTLLSETKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYDNN----NTTYLF 56 Query: 53 GWKNDELVAYARILKSDDD 71 G K++ ++ R +++ Sbjct: 57 GIKDNTVICSLRFIETKYP 75
>FbpA_PF05833#Fibronectin-binding protein Length = 577 Score = 28.3 bits (63), Expect = 0.039 Identities = 21/63 (33%), Positives = 31/63 (49%), Gaps = 6/63 (9%) Query: 204 VRNIVGSLMEV-GAHNQPESWIAELLAAKDRTLAAATAKAEGLYLVAVDYPDRYDLPKPP 262 +NI GS + V + PES + E AA LAA +K++ V VDY + ++ KP Sbjct: 496 TKNIPGSHVIVKNIMDIPESTLLE--AAN---LAAYYSKSQNSSNVPVDYTEVKNVKKPN 550 Query: 263 MGP 265 Sbjct: 551 GAK 553
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 42.5 bits (100), Expect = 2e-06 Identities = 80/366 (21%), Positives = 132/366 (36%), Gaps = 42/366 (11%) Query: 16 SLFRISFAVFLTYMTVGLPLPVIPLFVHHDLGYGNTM--VGIAVGIQFLATVLTRGYAGR 73 L I V L + +GL +PV+P + + + GI + + L G Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65 Query: 74 LADQYGAKRSALQGMLACGLAGGAL--LLAAILPVSAPFKFALLVIGRLILGFGESQLLT 131 L+D++G + +L LAG A+ + A P +L IGR++ G +T Sbjct: 66 LSDRFGRRP-----VLLVSLAGAAVDYAIMATAPF-----LWVLYIGRIVAG------IT 109 Query: 132 GALTWGLG-----IVGPKHSGKVMSWNGMAIYGALAVGAPLGLL---IHSHYGF---AAL 180 GA G I + + + G LG L H F AAL Sbjct: 110 GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAAL 169 Query: 181 AITT--MVLPLLAWACNGTVRKVPALAGERPSLWSVV----GLIWKPGLGLALQGVGFAV 234 LL + G R + A + + + + +Q VG Sbjct: 170 NGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVP 229 Query: 235 IGTFVSLYFASKGW--AMAGFTLTAFGGAFVVMRVM-FGWMPDRFGGVKVAIVSLLVETV 291 +V W G +L AFG + + M G + R G + ++ ++ + Sbjct: 230 AALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGT 289 Query: 292 GLLLLWQAPGAWVALAGAALTGAGCSLIFPALGVEVVKRVPSQVRGTALGGYAAFQDIAL 351 G +LL A W+A L +G + PAL + ++V + +G G AA + Sbjct: 290 GYILLAFATRGWMAFPIMVLLASG-GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLT- 347 Query: 352 GVSGPL 357 + GPL Sbjct: 348 SIVGPL 353 Score = 31.7 bits (72), Expect = 0.005 Identities = 35/142 (24%), Positives = 49/142 (34%), Gaps = 8/142 (5%) Query: 252 GFTLTAFGGAFVVMRVMFGWMPDRFGGVKVAIVSLLVETVGLLLLWQAPGAWVALAG--- 308 G L + + G + DRFG V +VSL V ++ AP WV G Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105 Query: 309 AALTGAGCSLIFPALGVEVVKRVPSQVRGTALGGYAAFQDIALGVSGPLAGMLATTFGYS 368 A +TGA G + R G +A + V+GP+ G L F Sbjct: 106 AGITGA----TGAVAGAYIADITDGDERARHFGFMSACFGFGM-VAGPVLGGLMGGFSPH 160 Query: 369 SVFLAGAISAVLGIIVTILSFR 390 + F A A L + Sbjct: 161 APFFAAAALNGLNFLTGCFLLP 182
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 32.7 bits (74), Expect = 4e-04 Identities = 49/182 (26%), Positives = 75/182 (41%), Gaps = 19/182 (10%) Query: 1 MKKKRTLFFISSL-MLLGSGTTIAGDNLHFTGNLISKSCTPVINGSQLAEVHFPAIAASD 59 MKK R L L +L S A DNL F G LI +CT Q AEV++ I + Sbjct: 1 MKKIRGLCLPVMLGAVLMSQHVHAADNLTFKGKLIIPACT-----VQNAEVNWGDIEIQN 55 Query: 60 LMNLGQSERVPLVFQLKDCHSSTLFNVKVTLTGTEDSALPGFLAFDSSSSASGAGIGIET 119 L+ G +++ + +S V +T G ++ + ++S+ASG G+ I Sbjct: 56 LVQSGGNQK-DFTVDMNCPYSLGTMKVTITSNGQTGNS----ILVPNTSTASGDGLLIYL 110 Query: 120 AAGTSVPINNTTGVTLPLNQGN---NSLNFNTWLQAKSG-----RDVTSGDFSATVTATF 171 + I N + + G + L AK G + + +G FSAT T Sbjct: 111 YNSNNSGIGNAVTLGSQVTPGKITGTAPARKITLYAKLGYKGNMQSLQAGTFSATATLVA 170 Query: 172 EY 173 Y Sbjct: 171 SY 172
>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein signature. Length = 167 Score = 42.8 bits (100), Expect = 8e-08 Identities = 44/171 (25%), Positives = 74/171 (43%), Gaps = 21/171 (12%) Query: 1 MKRISL---ILLWGFCSMALSNVSFHGYLVQPPNCTISNAQTIEITFQDVLIDDINGSNY 57 M R+SL +LL +A ++ G + PP CTI+N Q I + F ++ + ++ S Sbjct: 1 MIRLSLFISLLLTSVAVLADVQINIRGNVYIPP-CTINNGQNIVVDFGNINPEHVDNSRG 59 Query: 58 EQTVPYSITCDTAVRDPLMEMTLSWSGTPSDFDNAAVSSNITGLGIQLKQ---------- 107 E T SI+C +++T + G N +++NIT GI L Q Sbjct: 60 EVTKNISISCPYKSGSLWIKVTGNTMGVGQ---NNVLATNITHFGIALYQGKGMSTPLTL 116 Query: 108 ---AGQSFTINTPLVVNETDLPVLTAVPVKKSGVILPEADFEAWATLQVDY 155 +G + + L + T+VP + IL DF A++ + Y Sbjct: 117 GNGSGNGYRVTAGLDTARSTF-TFTSVPFRNGSGILNGGDFRTTASMSMIY 166
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 46.9 bits (111), Expect = 3e-08 Identities = 28/123 (22%), Positives = 42/123 (34%), Gaps = 1/123 (0%) Query: 68 VHRVNHAPANAQEHEAARPSPQHQYLPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPVQQ 127 VH+V PA AQ +P P P V+ PE + P P+ A V + Sbjct: 37 VHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIP-EPPKEAPVVIE 95 Query: 128 PAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPVMDK 187 +P Q +PV S P + PA P ++ ++P + Sbjct: 96 KPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVAS 155 Query: 188 PKR 190 R Sbjct: 156 GPR 158 Score = 29.6 bits (66), Expect = 0.017 Identities = 21/79 (26%), Positives = 27/79 (34%), Gaps = 1/79 (1%) Query: 125 VQQPAYQPQPEQPLQ-QPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAP 183 V Q P P QP+ V+P PQ V P+P + EP+ P E Sbjct: 37 VHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEK 96 Query: 184 VMDKPKRKEAVIIMNVAAH 202 KPK K + Sbjct: 97 PKPKPKPKPKPVKKVEQPK 115
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 748 bits (1933), Expect = 0.0 Identities = 276/571 (48%), Positives = 386/571 (67%), Gaps = 2/571 (0%) Query: 1 MISGILASPGIAFGKALLLKEDEIVIDRKKISADQVDQEVERFLSGRAKASAQLETIKTK 60 I+GI AS G+A KA + E + I++ I V E+E+ + K+ +L IK + Sbjct: 4 KITGIAASSGVAIAKAFIHLEPNVDIEKTSI--TDVSTEIEKLTAALEKSKEELRAIKDQ 61 Query: 61 AGETFGEEKEAIFEGHIMLLEDEELEQEIIALIKDKHMTADAAAHEVIEGQASALEELDD 120 + G +K IF H+++L+D EL I I+++ M A+ A EV + S E +D+ Sbjct: 62 TEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDN 121 Query: 121 EYLKERAADVRDIGKRLLRNILGLKIIDLSAIQDEVILVAADLTPSETAQLNLKKVLGFI 180 EY+KERAAD+RD+ KR+L +++G++ L+ I +E +++A DLTPS+TAQLN + V GF Sbjct: 122 EYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFA 181 Query: 181 TDAGGRTSHTSIMARSLELPAIVGTGSVTSQVKNDDYLILDAVNNQVYVNPTNEVIDKMR 240 TD GGRTSH++IM+RSLE+PA+VGT VT ++++ D +I+D + V VNPT E + Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241 Query: 241 AVQEQVASEKAELAKLKDLPAITLDGHQVEVCANIGTVRDVEGAERNGAEGVGLYRTEFL 300 + +K E AKL P+ T DG VE+ ANIGT +DV+G NG EG+GLYRTEFL Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301 Query: 301 FMDRDALPTEEEQFAAYKAVAEACGSQAVIVRTMDIGGDKELPYMNFPKEENPFLGWRAI 360 +MDRD LPTEEEQF AYK V + + V++RT+DIGGDKEL Y+ PKE NPFLG+RAI Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361 Query: 361 RIAMDRKEILRDQLRAILRASAFGKLRIMFPMIISVEEVRALRKEIEIYKQELRDEGKAF 420 R+ +++++I R QLRA+LRAS +G L++MFPMI ++EE+R + ++ K +L EG Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDV 421 Query: 421 DESIEIGVMVETPAAATIARHLAKEVDFFSIGTNDLTQYTLAVDRGNDMISHLYQPMSPS 480 +SIE+G+MVE P+ A A AKEVDFFSIGTNDL QYT+A DR N+ +S+LYQP P+ Sbjct: 422 SDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPA 481 Query: 481 VLNLIKQVIDASHAEGKWTGMCGELAGDERATLLLLGMGLDEFSMSAISIPRIKKIIRNT 540 +L L+ VI A+H+EGKW GMCGE+AGDE A LLLG+GLDEFSMSA SI + + Sbjct: 482 ILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKL 541 Query: 541 NFEDAKVLAEQALAQPTTDELMTLVNKFIEE 571 + E+ K A++AL T +E+ LV K + Sbjct: 542 SKEELKPFAQKALMLDTAEEVEQLVKKTYLK 572
>PF05272#Virulence-associated E family protein Length = 892 Score = 34.7 bits (79), Expect = 7e-04 Identities = 11/33 (33%), Positives = 16/33 (48%) Query: 30 MVALLGPSGSGKTTLLRIIAGLEHQTSGHIRFH 62 V L G G GK+TL+ + GL+ + H Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG 630
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 153 bits (387), Expect = 1e-47 Identities = 96/255 (37%), Positives = 136/255 (53%), Gaps = 4/255 (1%) Query: 4 LTGKTALITGALQGIGEGIARTFARHGANLILLDISPE-IEKLADELCGRGHRCTAVVAD 62 + GK A ITGA QGIGE +ART A GA++ +D +PE +EK+ L A AD Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 63 VRDPASVAAAIKRAKEKEGRIDILVNNAGVCRLGSFLDMSDEDRDFHIDINIKGVWNVTK 122 VRD A++ R + + G IDILVN AGV R G +SDE+ + +N GV+N ++ Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 123 AVLPEMIARKDGRIVMMSSVTGDMVADPGETAYALTKAAIVGLTKSLAVEYAQSGIRVNA 182 +V M+ R+ G IV + S V AYA +KAA V TK L +E A+ IR N Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAG-VPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184 Query: 183 ICPGYVRTPMAESIARQSNPEDP--ESVLTEMAKAIPLCRLADPLEVGELAAFLASDESS 240 + PG T M S+ N + + L IPL +LA P ++ + FL S ++ Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244 Query: 241 YLTGTQNVIDGGSTL 255 ++T +DGG+TL Sbjct: 245 HITMHNLCVDGGATL 259
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 31.5 bits (71), Expect = 6e-04 Identities = 15/102 (14%), Positives = 38/102 (37%), Gaps = 4/102 (3%) Query: 24 LRPWNDPEMDIERKMNHDVSLFLVAEVNGEVVG--TVMGGYDGHRGSAYYLGVHPEFRGR 81 + + D +MD+ + FL + +G + ++G + V ++R + Sbjct: 47 FKQYEDDDMDVSYVEEEGKAAFL-YYLENNCIGRIKIRSNWNG-YALIEDIAVAKDYRKK 104 Query: 82 GIANALLNRLEKKLIARGCPKIQINVPEDNDMVLGMYERLGY 123 G+ ALL++ + + + + N Y + + Sbjct: 105 GVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 32.1 bits (72), Expect = 0.002 Identities = 28/104 (26%), Positives = 46/104 (44%), Gaps = 13/104 (12%) Query: 41 QGYYAGVRQGVQDAAKDSSVQVQLIETNAQGDISKESTFVDTLVARNVDAIILSAVSENG 100 QGY G+ QG++ ++ Q I Q +S+ T +D L D++I Sbjct: 70 QGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDAL-----DSVI-------- 116 Query: 101 SSRTVRRASEAGIPVICYNTCINQKGVDKYVSAYLVGDPLEFGK 144 +SR ++ A EA VI ++ + K + L +PL GK Sbjct: 117 ASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGK 160
>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature. Length = 591 Score = 34.7 bits (79), Expect = 0.002 Identities = 26/66 (39%), Positives = 34/66 (51%), Gaps = 13/66 (19%) Query: 99 MTGFTLRPDRAALEIASRVYNGNATPRH--FLWW-ANPAVKGGEGHQSVFPPDVTAVFDH 155 + G DR+A+ S+V GNATP +LW A PAV Q +F T VFD+ Sbjct: 218 LNGNEAGRDRSAMRYLSKVQYGNATPAADLYLWTSATPAV------QWLF----TLVFDY 267 Query: 156 GKRAVS 161 G+R V Sbjct: 268 GERGVD 273
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 472 bits (1217), Expect = e-167 Identities = 164/480 (34%), Positives = 248/480 (51%), Gaps = 42/480 (8%) Query: 6 AHLLLVDDDLGLLKLLGLRLTSEGYSVVTAESGAEGLRVLNREKVDLVISDLRMDEMDGM 65 A +L+ DDD + +L L+ GY V + A R + DLV++D+ M + + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 66 QLFAEIQKVQPGMPVIILTAHGSIPDAVAATQQGVFSFLTKPVDKDALYQAIDDALE--- 122 L I+K +P +PV++++A + A+ A+++G + +L KP D L I AL Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 123 --QSAPATDERWREAIVTRSPLMLRLLEQARLVAQSDVSVLINGQSGTGKEIFAQAIHNA 180 S D + +V RS M + + Q+D++++I G+SGTGKE+ A+A+H+ Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183 Query: 181 SPRNSKPFIAINCGALPEQLLESELFGHARGAFTGAVSNREGLFQAAEGGTLFLDEIGDM 240 R + PF+AIN A+P L+ESELFGH +GAFTGA + G F+ AEGGTLFLDEIGDM Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243 Query: 241 PAPLQVKLLRVLQERKVRPLGSNRDIDINVRIISATHRDLSKAMARGEFREDLYYRLNVV 300 P Q +LLRVLQ+ + +G I +VRI++AT++DL +++ +G FREDLYYRLNVV Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303 Query: 301 SLKIPALAERTEDIPLLANHLLRQAAERHKPFVRAFSTDAMKRLMTASWPGNVRQLVNVI 360 L++P L +R EDIP L H ++QA + V+ F +A++ + WPGNVR+L N++ Sbjct: 304 PLRLPPLRDRAEDIPDLVRHFVQQAEKEGLD-VKRFDQEALELMKAHPWPGNVRELENLV 362 Query: 361 EQCVALTSSPVISDALVEQALEGENTALPT------------------------------ 390 + AL VI+ ++E L E P Sbjct: 363 RRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDA 422 Query: 391 ------FVEARNHFELNYLRKLLQITKGNVTHAARMAGRNRTEFYKLLSRHELDANDFKE 444 + E + L T+GN AA + G NR K + + Sbjct: 423 LPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVYRSSR 482
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 50.6 bits (121), Expect = 2e-08 Identities = 51/267 (19%), Positives = 95/267 (35%), Gaps = 52/267 (19%) Query: 552 MESEREKLLRMEQELHHRVIGQNEAVDAVSNAIRRSRAGLADPNRPIGSFLFLGPTGVGK 611 R L + + ++G++ A+ + + R L + + + G +G GK Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR----LMQTDLTL---MITGESGTGK 173 Query: 612 TELCKALANFMFDSDEAMVRIDMSEFMEKHSVSRLVGAPPGYVGYEEGGYLTEAVRRRPY 671 + +AL ++ + V I+M+ S L G+E+G + T A R Sbjct: 174 ELVARALHDYGKRRNGPFVAINMAAIPRDLIESEL-------FGHEKGAF-TGAQTRSTG 225 Query: 672 SV-------ILLDEVEKAHPDVFNILLQVLDDG---RLTDGQGRTVDFRNTVVIMTSNLG 721 + LDE+ D LL+VL G + D R ++ +N Sbjct: 226 RFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVR---IVAATN-- 280 Query: 722 SDLIQERFGELDYAHMKELVLGVVSHNFRPEFINRIDEVVVFHP-LGE--QHIASIAQIQ 778 DL Q + FR + R++ V + P L + + I + + Sbjct: 281 KDLKQS----------------INQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHF 324 Query: 779 LKRLYKRLEERGYEIHISDEALKLLSE 805 +++ K E EAL+L+ Sbjct: 325 VQQAEK---EGLDVKRFDQEALELMKA 348 Score = 40.6 bits (95), Expect = 3e-05 Identities = 46/225 (20%), Positives = 76/225 (33%), Gaps = 43/225 (19%) Query: 112 VLAALESRGTLADILKATGATTANITQAIEQMRGGES-------VNDQGAEDQRQALKKY 164 +L ++ +L + T AI+ G + +AL + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNT--FMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122 Query: 165 TIDLTERAEQG-KLDPVIGRDEEIRRTIQVLQRRTKNN-PVLI-GEPGVGKTAIVEGLAQ 221 ++ + P++GR ++ +VL R + + ++I GE G GK + L Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHD 182 Query: 222 R----------IINGEVPEGLKGRRVLALDMGALV-AGAKYRGEFEERLKGVLNDLAKQE 270 I +P L + + GA A + G FE+ G Sbjct: 183 YGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGT-------- 234 Query: 271 GNVILFIDELHTMVGAGKADGAMDAGNMLKPALARGELHCVGATT 315 LF+DE+ M MDA L L +GE VG T Sbjct: 235 ----LFLDEIGDM--------PMDAQTRLLRVLQQGEYTTVGGRT 267
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 82.4 bits (203), Expect = 8e-21 Identities = 64/256 (25%), Positives = 116/256 (45%), Gaps = 5/256 (1%) Query: 3 QVAVVIGGGQTLGAFLCHGLAAEGYRVAVVDIQSDKAANVAQEINAEYGEGMAYGFGADA 62 ++A + G Q +G + LA++G +A VD +K V + AE A F AD Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE--ARHAEAFPADV 66 Query: 63 TSEQSVLALSRGVDEIFGRVDLLVYSAGIAKAAFISDFQLGDFDRSLQVNLVGYFLCARE 122 ++ ++ ++ G +D+LV AG+ + I +++ + VN G F +R Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 123 FSRLMIRDGIQGRIIQINSKSGKVGSKHNSGYSAAKFGGVGLTQSLALDLAEYGITVHSL 182 S+ M D G I+ + S V + Y+++K V T+ L L+LAEY I + + Sbjct: 127 VSKYM-MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 183 MLGNLLKSRMFQSLLPQYATKLGIKPDQVEQYYIDKVPLKRGCDYQDVLNMLLFYASPKA 242 G+ + + + IK +E + +PLK+ D+ + +LF S +A Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGS-LETFKTG-IPLKKLAKPSDIADAVLFLVSGQA 243 Query: 243 SYCTGQSINVTGGQVM 258 + T ++ V GG + Sbjct: 244 GHITMHNLCVDGGATL 259
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 28.7 bits (64), Expect = 0.014 Identities = 20/105 (19%), Positives = 35/105 (33%), Gaps = 17/105 (16%) Query: 1 MKPRQRQAAILEYLQKQGKCSVEEL-----AQYFDTTGTTIRKDLVILEHAGTVIRTYGG 55 M QR I E + + +EL ++ T T+ +D+ E + T G Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIK--ELHLVKVPTNNG 58 Query: 56 ---VVLNKEESDPPIDHKTLINTHKKELIAEAAVSFIHDGDSIIL 97 L ++ P+ K + +A V I+L Sbjct: 59 SYKYSLPADQRFNPLS-------KLKRSLMDAFVKIDSASHLIVL 96
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 373 bits (959), Expect = e-127 Identities = 125/388 (32%), Positives = 194/388 (50%), Gaps = 33/388 (8%) Query: 149 IAALAAGALS----------NALLIEQLESQNMLPGDATPFEAVKQTQMIGLSPGMTQLK 198 I A GA +I + ++ ++ ++G S M ++ Sbjct: 91 IKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIY 150 Query: 199 KEIEIVAASDLNVLISGETGTGKELVAKAIHEASPRAVNPLVYLNCAALPESVAESELFG 258 + + + +DL ++I+GE+GTGKELVA+A+H+ R P V +N AA+P + ESELFG Sbjct: 151 RVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFG 210 Query: 259 HVKGAFTGAISNRSGKFEMADNGTLFLDEIGELSLALQAKLLRVLQYGDIQRVGDDRSLR 318 H KGAFTGA + +G+FE A+ GTLFLDEIG++ + Q +LLRVLQ G+ VG +R Sbjct: 211 HEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIR 270 Query: 319 VDVRVLAATNRDLREEVLAGRFRADLFHRLSVFPLSVPPLRERGDDVILLAGYFCEQCRL 378 DVR++AATN+DL++ + G FR DL++RL+V PL +PPLR+R +D+ L +F +Q Sbjct: 271 SDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE- 329 Query: 379 RQGLSRVVLSAGARNLLQHYSFPGNVRELEHAIHRAVVLARATRNGDEVIL-----EAQH 433 ++GL A L++ + +PGNVRELE+ + R L E+I E Sbjct: 330 KEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPD 389 Query: 434 FAFPEVTLPPPEAAAVLVVKQNLR-----------------EATEAFQRETIRQALAQNH 476 + + V++N+R + I AL Sbjct: 390 SPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATR 449 Query: 477 HNWAACARMLETDVANLHRLAKRLGLKD 504 N A +L + L + + LG+ Sbjct: 450 GNQIKAADLLGLNRNTLRKKIRELGVSV 477
>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature. Length = 52 Score = 53.7 bits (129), Expect = 3e-14 Identities = 17/50 (34%), Positives = 28/50 (56%) Query: 1 MLTKYALVAIIVLCCTVLGFTLMVGDSLCELSIRERGMEFKAVLAYESKK 50 + + ++++C T+L FT + SLCE+ R+ E A +AYES K Sbjct: 3 LPRSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYESGK 52
>PF07675#Cleaved Adhesin Length = 1358 Score = 30.4 bits (68), Expect = 0.021 Identities = 20/92 (21%), Positives = 39/92 (42%), Gaps = 12/92 (13%) Query: 206 ILGQTYLPRKFKTTVVIP---PQND--IDLHANDMNFVAIAENGKLVGFNLLVGGGLSIE 260 ++ +P+ T +P PQN + A+ ++VAI+++G L G + G++ Sbjct: 240 VMPYRAMPKT--NTYTLPASLPQNQASYSIQASAGSYVAISKDGVLYGTGVANASGVATV 297 Query: 261 HGNK-----KTYARTASEFGYLPLEHTLAVAE 287 + K Y + YLP+ + E Sbjct: 298 NMTKQITENGNYDVVITRSNYLPVIKQIQAGE 329
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 33.7 bits (77), Expect = 0.001 Identities = 27/137 (19%), Positives = 55/137 (40%), Gaps = 11/137 (8%) Query: 69 LGSLVLGWISDHIGRQKIFTFSFLLITLASFLQFFATTP-EHLIGLRILIGIGLGGDYSV 127 +G+ V G +SD +G +++ F ++ S + F + LI R + G G ++ Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL 123 Query: 128 GHTLLAEFSPRRHRGILLGAFSVVWT----VGYVLASIAGHHFISENPEAWRWLLASAAL 183 ++A + P+ +RG G + VG + + H+ W +LL + Sbjct: 124 VMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI------HWSYLLLIPMI 177 Query: 184 PALLITLLRWGTPESPR 200 + + L + R Sbjct: 178 TIITVPFLMKLLKKEVR 194
>SECYTRNLCASE#Preprotein translocase SecY subunit signature. Length = 437 Score = 28.2 bits (63), Expect = 0.040 Identities = 34/154 (22%), Positives = 60/154 (38%), Gaps = 16/154 (10%) Query: 34 ALAI--IIVGLIIARMISNAVNRLMISRKIDATVADFLSALVRYGIIAFTLIAALGRVGV 91 AL I I II ++++ + RL +K ++ RY +A ++ G V Sbjct: 76 ALGIMPYITASIILQLLTVVIPRLEALKKEGQAGTAKITQYTRYLTVALAILQGTGL--V 133 Query: 92 QTASVIAVLGAAGLAVGLALQGSLSN-------LAAGVLLVMFRPFRAGEYVDLGGVAGT 144 TA + G + + S+ + AG +VM+ GE + G+ G Sbjct: 134 ATARSAPLFGRCSVGGQIVPDQSIFTTITMVICMTAGTCVVMW----LGELITDRGI-GN 188 Query: 145 VLSVQIFSTTMRTADGKIIVIPNGKIIAGNIINF 178 +S+ +F + T + I +AG I F Sbjct: 189 GMSILMFISIAATFPSALWAIKKQGTLAGGWIEF 222
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.025 Identities = 13/41 (31%), Positives = 16/41 (39%), Gaps = 12/41 (29%) Query: 12 PGAATDCLCDISLQLKQGEWLALTGDNGAGKSTLLRVMAGL 52 PG D + L G G GKSTL+ + GL Sbjct: 591 PGCKFDYS------------VVLEGTGGIGKSTLINTLVGL 619
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 29.4 bits (66), Expect = 0.028 Identities = 36/239 (15%), Positives = 76/239 (31%), Gaps = 18/239 (7%) Query: 158 SHMQLYIGAALSAILVLFTLTLPHIPVAKQQANQSWTTLLGLDAFALFKNKRMAIFFIFS 217 H + AAL+ + L L + + L+ A F+ R Sbjct: 159 PHAPFFAAAALNGLNFLTGCFLLPES---HKGERRPLRREALNPLASFRWARGMTVVAAL 215 Query: 218 MLLGAELQITNMFGNTFLHSFDKDPMFASSFIVQHASIIMSISQISETLF-ILTIPFFLS 276 M + +Q+ F +D + + I ++ I +L + + Sbjct: 216 MAVFFIMQLVGQVPAALWVIFGEDRFHWDATTI---GISLAAFGILHSLAQAMITGPVAA 272 Query: 277 RYGIKNVMMISIVAWILRFALFAYGDPTPFGTVLLVLSMIVYGCAFDFFNISGSVFVEKE 336 R G + +M+ ++A + L A+ + ++V + + + ++ Sbjct: 273 RLGERRALMLGMIADGTGYILLAF-----ATRGWMAFPIMVLLASGGIGMPALQAMLSRQ 327 Query: 337 VSPAIRASAQGMFLMMTNGFGCILGGIVSGKVVEMYTQNGITDWQ-TVWLIFAGYSVVL 394 V + QG +T+ L IV + IT W W+ A ++ Sbjct: 328 VDEERQGQLQGSLAALTS-----LTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLC 381
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 68.0 bits (166), Expect = 1e-13 Identities = 73/434 (16%), Positives = 142/434 (32%), Gaps = 32/434 (7%) Query: 232 NSRVDAYRNEQLLGSFYLNSGSQFIDTSSFPPGSYSVALKVYENNQLTRTELVPFTKTGG 291 ++V +N + + + G I+ S + + + E + T+ VP++ Sbjct: 308 TAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPL 367 Query: 292 LT-DGNAQWFLQAGKTTSQVS-DDESSAYQLGVRLPLHPQYELYAGLANADDVSAFELGN 349 L +G+ ++ + AG+ S + ++ +Q + L + +Y G AD AF G Sbjct: 368 LQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGI 427 Query: 350 NWTADLGRVG--NLAISASVFRNDDGGKGDMQQANWS-NPGWPTLGF------YRTNSDG 400 ++G +G ++ ++ + D + D Q + N G YR ++ G Sbjct: 428 GK--NMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSG 485 Query: 401 ---DACTTDSRESYNALSCYESISATVSLNFVGWNMMLGYTCTQNNTDDSLRWDKQQSFE 457 A TT SR + + + + V F + + + + + + + Sbjct: 486 YFNFADTTYSRMNGYNIETQDGV-IQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLY 544 Query: 458 NNYLRQTTAQSISETVQLSASRAIVMRDWILSTSVGVFHRNDNGGDNDDNGLYLSFS--L 515 + QT + + Q A D ++ ++ + D L L+ + Sbjct: 545 LSGSHQTYWGTSNVDEQFQAGLNTAFED--INWTLSYSLTKNAWQKGRDQMLALNVNIPF 602 Query: 516 SDTPTMDSNNNSHSTNVPTDYRYSEQDGDQTSWQLSHTFYNDSFSHKEL--GVTVGGLNT 573 S DS + + + + T D+ + G GG Sbjct: 603 SHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGN 662 Query: 574 DTINSAVNGRWDGQYGNVYATVSDSYDRKNHDHLSAFTGTYSSTLAVSRYGVNLGASGTD 633 + G YGN S S + L S + GV LG D Sbjct: 663 SGSTGYATLNYRGGYGNANIGYSHS---DDIKQLYY---GVSGGVLAHANGVTLGQPLND 716 Query: 634 DLLGAVLVDVKGFS 647 VLV G Sbjct: 717 ---TVVLVKAPGAK 727 Score = 31.0 bits (70), Expect = 0.023 Identities = 40/222 (18%), Positives = 68/222 (30%), Gaps = 35/222 (15%) Query: 246 SFYLNSGSQFIDTSSF------PPGSYSVALKVYENNQLTRTELVPFTKTGGLTDGNAQW 299 F + D S F PPG+Y V + + NN T V F Sbjct: 52 RFLADDPQAVADLSRFENGQELPPGTYRVDIYL--NNGYMATRDVTFNTGDSEQG----- 104 Query: 300 FLQAGKTTSQVSDDESSAYQLGVRLPLHPQYELYAGLANADDVSAFELGNNWTADLGRVG 359 + T +Q++ +G+ L A A S D+G+ Sbjct: 105 -IVPCLTRAQLA-------SMGLNTASVSGMNLLADDACVPLTSMIH-DATAQLDVGQQR 155 Query: 360 -NLAISASVFRNDDGGKGDMQQANWSNPGWPTLGFYRTNSDGDACTTDSRESYNALSCYE 418 NL I + N +G + W L Y + + +R N+ Y Sbjct: 156 LNLTIPQAFMSNRA--RGYIPPELWDPGINAGLLNYNFS----GNSVQNRIGGNSHYAY- 208 Query: 419 SISATVSLNFVGW----NMMLGYTCTQNNTDDSLRWDKQQSF 456 ++ LN W N Y + +++ +W ++ Sbjct: 209 -LNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTW 249
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 52.7 bits (126), Expect = 8e-10 Identities = 31/160 (19%), Positives = 59/160 (36%), Gaps = 26/160 (16%) Query: 77 RTLGSGVIMDQRGYIITNKHVINDADQIIVALQ------------DGRVFEALLVGSDSL 124 + SGV++ + ++TNKHV++ AL+ +G + Sbjct: 101 TFIASGVVVG-KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159 Query: 125 TDLAVLKI-------NATGGLPTIPINARRVPHIGDVVLAIGNPYNLGQTITQGIISATG 177 DLA++K + + ++ + + G P + T + G Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--ESKG 216 Query: 178 RIGLNPTGRQNFLQTDASINHGNSGGALVNSLGELMGINT 217 +I + +Q D S GNSG + N E++GI+ Sbjct: 217 KI---TYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHW 253
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 28.1 bits (62), Expect = 0.045 Identities = 37/167 (22%), Positives = 61/167 (36%), Gaps = 27/167 (16%) Query: 3 VAVLGAAGGIGQALALLLKTQLPSGSELSLYDIAPVTPGVAVDLSHIPTAVKIKGFSGED 62 + GAA GIG+A+A L G+ ++ D P V S A + F + Sbjct: 11 AFITGAAQGIGEAVARTL---ASQGAHIAAVDYNP-EKLEKVVSSLKAEARHAEAFPADV 66 Query: 63 ATPA------------LEGADVVLISAGVARK------PGMDRSDLFNVNAGIVKNLVQQ 104 A + D+++ AGV R + F+VN+ V N + Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 105 VAKNCPK----ACIGIITNPVNTT-VAIAAEVLKKAGVYDKNKLFGV 146 V+K + + + +NP ++AA KA K G+ Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGL 173
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 169 bits (430), Expect = 4e-57 Identities = 44/141 (31%), Positives = 71/141 (50%), Gaps = 5/141 (3%) Query: 15 KALLKEEKFSSQGEIVAALQEQGFDNINQSKVSRMLTKFGAVRTRNAKMEMVYCLPAELG 74 + ++ + +Q E+V L++ G+ N+ Q+ VSR + + V+ Y LPA+ Sbjct: 11 REIITANEIETQDELVDILKKDGY-NVTQATVSRDIKELHLVKVPTNNGSYKYSLPADQR 69 Query: 75 VPTTSSPLKNLV---LDIDYNDAVVVIHTSPGAAQLIARLLDSLGKAEGILGTIAGDDTI 131 S ++L+ + ID ++V+ T PG AQ I L+D+L E I+GTI GDDTI Sbjct: 70 FNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEE-IMGTICGDDTI 128 Query: 132 FTTPANGFTVKDLYEAILELF 152 K + + ILEL Sbjct: 129 LIICRTHDDTKVVQKKILELL 149
>TETREPRESSOR#Tetracycline repressor protein signature. Length = 218 Score = 28.7 bits (64), Expect = 0.004 Identities = 12/58 (20%), Positives = 25/58 (43%), Gaps = 5/58 (8%) Query: 1 METRLNLLCEAGVIDKDVCKGMMQVVN-----VLEKECHLPVRSEQGTMAMTHMASAL 53 +ET+L + E G +D + V + VLE++ H +++ ++ L Sbjct: 113 VETQLRFMTENGFSLRDGLYAISAVSHFTLGAVLEQQEHTAALTDRPAAPDENLPPLL 170
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 42.7 bits (100), Expect = 3e-06 Identities = 39/199 (19%), Positives = 70/199 (35%), Gaps = 19/199 (9%) Query: 143 DLAGNATDQANGVQPAPGTTSAENTQQDVSL-----------------PPISSTPTQGQT 185 DL ++ N T+ N Q DV PP +TP++ Sbjct: 979 DLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE 1038 Query: 186 PVATDGQQRVEVQGDLNNALTQPQNQQQLNNVAVNSTLPTEPATVAPVRNGNASRDTAKT 245 VA + +Q + T+ Q + S + T ++G+ +++T T Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT 1098 Query: 246 QTAERPSTTRPVRQQAVIEPKKPQATVKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPA 305 +T E + + + + E + V ++ P + + +P A A P Sbjct: 1099 ETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEP 1158 Query: 306 PKETATTAPVQTASPAQTT 324 +T TTA T PA+ T Sbjct: 1159 QSQTNTTA--DTEQPAKET 1175 Score = 41.2 bits (96), Expect = 9e-06 Identities = 41/203 (20%), Positives = 68/203 (33%), Gaps = 10/203 (4%) Query: 126 APSTTSSDQTASGEKSIDLAGNATDQANGVQPAPGTTSAENTQQDVSLPPISST-PTQGQ 184 P+ +D + + ++A D+A PAP T S + S T Q Sbjct: 999 TPNNIQADVPSVPSNNEEIA--RVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQ 1056 Query: 185 TPVATDGQQRVEVQGDLNNALTQPQN----QQQLNNVAVNSTLPTEPATVAPVRNGNASR 240 T Q R + +N Q Q +T E ATV + A Sbjct: 1057 DATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVE--KEEKAKV 1114 Query: 241 DTAKTQTAERPSTTRPVRQQAVIEPKKPQATVKTEPKPVAQTPKRTEPAAPVASTKAPAA 300 +T KTQ + ++ +Q+ E +PQA E P + A T+ PA Sbjct: 1115 ETEKTQEVPKVTSQVSPKQEQS-ETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAK 1173 Query: 301 TSTPAPKETATTAPVQTASPAQT 323 ++ ++ T + + Sbjct: 1174 ETSSNVEQPVTESTTVNTGNSVV 1196
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 32.1 bits (73), Expect = 8e-04 Identities = 27/91 (29%), Positives = 40/91 (43%), Gaps = 18/91 (19%) Query: 32 FYDSDQEIEKRTGADVGWVFDLEGEEGFRD----------REEKVINELTEKQGIVLATG 81 FYD + KR + GW+ + G+R E + I +L E+ IV+A+G Sbjct: 136 FYDEETA--KRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKLVERGVIVIASG 193 Query: 82 GGSVKSRETRNRLSARGVVVYLETTIEKQLA 112 GG V + +GV E I+K LA Sbjct: 194 GGGVPVILEDGEI--KGV----EAVIDKDLA 218
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 287 bits (736), Expect = 1e-93 Identities = 80/301 (26%), Positives = 132/301 (43%), Gaps = 18/301 (5%) Query: 117 LENRSITLQYADAGELAKAGEKLLSAKGSMTVDKRTNRLLLRDNKTALSALEQWVAQMDL 176 L + +I D + +A SA+ + D N +++RD+ + ++ + +D Sbjct: 219 LSDATIQQVTVDNQRIPQAAT-RASAQARVEADPSLNAIIVRDSPERMPMYQRLIHALDK 277 Query: 177 PVGQVELSAHIVTINEKSLRELGVKWTLADAQHAGGVGQVTTLGSDLSVATATTHVGFNI 236 P ++E++ IV IN L ELGV W + + T G ++A+ G Sbjct: 278 PSARIEVALSIVDINADQLTELGVDWRVGIRTGNNHQVVIKTTGDQSNIASN----GALG 333 Query: 237 GRINGRLLDL---ELSALEQKQQLDIIASPRLLASHLQPASIKQESEIPYQVSSGESGAT 293 ++ R LD ++ LE + +++ P LL A I SE Y +G+ A Sbjct: 334 SLVDARGLDYLLARVNLLENEGSAQVVSRPTLLTQENAQAVIDH-SETYYVKVTGKEVA- 391 Query: 294 SVEFKEAVLG--MEVTPTVLQKG---RIRLKLHISQNVPGQVLQQADGEVLAIDKQEIET 348 E K G + +TP VL +G I L LHI +G + I + ++T Sbjct: 392 --ELKGITYGTMLRMTPRVLTQGDKSEISLNLHIEDGNQKPNSSGIEG-IPTISRTVVDT 448 Query: 349 QVEVKSGETLALGGIFTRKNKSGQDSVPLLGDIPWFGQLFRHDGKEDERRELVVFITPRL 408 V G++L +GGI+ + VPLLGDIP+ G LFR + R + I PR+ Sbjct: 449 VARVGHGQSLIIGGIYRDELSVALSKVPLLGDIPYIGALFRRKSELTRRTVRLFIIEPRI 508 Query: 409 V 409 + Sbjct: 509 I 509 Score = 36.8 bits (85), Expect = 2e-04 Identities = 18/95 (18%), Positives = 33/95 (34%), Gaps = 4/95 (4%) Query: 1 MKQWIAALLLMLIPGVQAA----KPQKVTLMVDDVPVAQVLQALAEQEKLNLVVSPDVSG 56 K+ + LL+L A P + + +L +VVS ++ Sbjct: 9 FKRVLTGTLLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKIND 68 Query: 57 TVSLHLTDVPWKQALQTVVKSAGLITRQEGNILSV 91 VS + LQ + L+ +GN+L + Sbjct: 69 KVSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYI 103
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 29.0 bits (65), Expect = 0.031 Identities = 23/98 (23%), Positives = 38/98 (38%), Gaps = 18/98 (18%) Query: 226 ENLLFTHRGLSGPAVLQISSYWQPGEFVSINLLPDVDLETFL--NEQRNAHPNQSLKNTL 283 E + RG GP +L + ++ + + + L T + N Q A N LK L Sbjct: 63 EAITLRERGWKGP-ILMLEGFFHAQD---LEIYDQHRLTTCVHSNWQLKALQNARLKAPL 118 Query: 284 AVHL------------PKRLVERLQQLGQIPDVSLKQL 309 ++L P R++ QQL + +V L Sbjct: 119 DIYLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTL 156
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 32.0 bits (73), Expect = 0.006 Identities = 25/194 (12%), Positives = 57/194 (29%), Gaps = 40/194 (20%) Query: 12 TGLLLLLALAFVLFYEAINGFHDTANAVATVIY------TRAMRSQLAVVMAAVFNFLGV 65 L++AL+ +L + F + + ++A+ + V+ F Sbjct: 30 VSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFP 89 Query: 66 LLGGLSVAYAIVHML-------------------PTDLLLNMGSSHGLAMVFSMLLAAII 106 LL ++ H++ P + + S L +L ++ Sbjct: 90 LLTVAALMAIASHVVQYGFLISGEAIKPDIKKINPIEGAKRIFSIKSLVEFLKSILKVVL 149 Query: 107 WNLGTWYFGLPASSSHTLIGAIIGIGLTNALMTGTSVVDALNIPKVLSIFGSLIVSPIVG 166 ++ W + ++ + T + T + + I L+V VG Sbjct: 150 LSILIWIIIKG------NLVTLLQLP-TCGIECITPL--------LGQILRQLMVICTVG 194 Query: 167 LVFAGGLIFLLRRY 180 V + Y Sbjct: 195 FVVISIADYAFEYY 208
>adhesinmafb#Neisseria meningitidis: adhesin MafB signature. Length = 467 Score = 24.3 bits (52), Expect = 0.035 Identities = 15/45 (33%), Positives = 25/45 (55%), Gaps = 6/45 (13%) Query: 21 VGNLTPARASVNGT----TRTSDQDFE--SVYAHCQSENASELTG 59 +GNL +A++NGT TR S E + + + +++ASE G Sbjct: 78 MGNLLIQQANINGTIGYHTRFSGHGHEEHAPFDNHAADSASEEKG 122
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 53.3 bits (128), Expect = 4e-10 Identities = 41/218 (18%), Positives = 70/218 (32%), Gaps = 33/218 (15%) Query: 97 LQAELNSAKGSLAKALSTASNARITFNRQASLLKTNYVSR-QDYDT-ARTQLNEAEANVT 154 + + A L S + K Y Q + +L + N+ Sbjct: 257 QENKYVEAVNELRVYKSQLEQIE----SEILSAKEEYQLVTQLFKNEILDKLRQTTDNIG 312 Query: 155 VAKAAVEQATINLQYANVTSPITGVSGKSSV-TVGALVTANQADSLVTVQRLDPIYVDLT 213 + + + Q + + +P++ + V T G +VT + +V V D + V Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET-LMVIVPEDDTLEVTAL 371 Query: 214 QSVQDFLRMKEEVASGQIKQVQGSTPVQLNLE--NGKRY-SQTGTLK--FSDPTVDETTG 268 +D I + + +E RY G +K D D+ G Sbjct: 372 VQNKD------------IGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLG 419 Query: 269 SVT--LRAI------FPNPNGDLLPGMYVTALVDEGSR 298 V + +I N N L GM VTA + G R Sbjct: 420 LVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMR 457 Score = 33.3 bits (76), Expect = 0.001 Identities = 22/139 (15%), Positives = 48/139 (34%), Gaps = 24/139 (17%) Query: 53 PGRTVPY-EVAEIRPQVGGIIIKRNFI-EGDKVNQGDSLYQIDPAPLQAELNSAKGSLAK 110 G+ EI+P I+ K + EG+ V +GD L ++ +A+ + SL + Sbjct: 87 NGKLTHSGRSKEIKPIENSIV-KEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQ 145 Query: 111 ALSTASNAR-------------ITFNRQASL--------LKTNYVSRQDYDTARTQLNEA 149 A + + + + L+ + ++ + T + Q + Sbjct: 146 ARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQK 205 Query: 150 EANVTVAKAAVEQATINLQ 168 E N+ +A + Sbjct: 206 ELNLDKKRAERLTVLARIN 224
>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature. Length = 52 Score = 67.5 bits (165), Expect = 8e-20 Identities = 18/50 (36%), Positives = 33/50 (66%) Query: 1 MPQKYRLLSLIVICFTLLFFTWMIRDSLCELHIKQESYELAAFLAYKLKE 50 +P+ + ++++C TLL FT++ R SLCE+ + E+AAF+AY+ + Sbjct: 3 LPRSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYESGK 52
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 111 bits (280), Expect = 1e-31 Identities = 40/122 (32%), Positives = 61/122 (50%), Gaps = 11/122 (9%) Query: 97 LNMPNNVTFDSSSAPLKPAGANTLTGVAMVLKEY--PKTAVNVIGYTDSTGGHDLNMRLS 154 + ++V F+ + A LKP G L + L +V V+GYTD G N LS Sbjct: 215 FTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLS 274 Query: 155 QQRADSVASALITQGVDASRIRTQGLGPANPIASNSTAEGK---------AQNRRVEITL 205 ++RA SV LI++G+ A +I +G+G +NP+ N+ K A +RRVEI + Sbjct: 275 ERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334 Query: 206 SP 207 Sbjct: 335 KG 336
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 34.9 bits (80), Expect = 5e-05 Identities = 16/52 (30%), Positives = 22/52 (42%), Gaps = 5/52 (9%) Query: 76 VAPKAVRRGIGKALMQYV-----QQRYPHLMLEVYQKNQPAIDFYRAQGFHI 122 VA ++G+G AL+ + + LMLE N A FY F I Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 27.8 bits (62), Expect = 0.039 Identities = 19/90 (21%), Positives = 37/90 (41%), Gaps = 13/90 (14%) Query: 119 SMYNEFGDSTTTLTDPLWHASVSSLGWRVDSRLGDLRPWAQISYNQQFGENIWKAQSGLS 178 S+ + D+ + H S + + + R G++ P ++SY F + Sbjct: 228 SVAVQQQDAKLV-EENYSHNSQTEVAATLAYRFGNVTP--RVSYAHGFKGSF-------- 276 Query: 179 RMTATNQNGNWLDVTVGADMLLNQNIAAYA 208 ATN N ++ V VGA+ ++ +A Sbjct: 277 --DATNYNNDYDQVVVGAEYDFSKRTSALV 304
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 42.9 bits (101), Expect = 2e-06 Identities = 47/275 (17%), Positives = 94/275 (34%), Gaps = 32/275 (11%) Query: 44 PVSQVAFSFGLL----SLGLAISSSVAGKLQERFGVKRVTVASGILLGLGFFLTAHSNNL 99 + V +G+L +L + V G L +RFG + V + S + + + A + L Sbjct: 37 HSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL 96 Query: 100 MMLWLS---AGVLVGLADGAGYLL----TLSNCVKWFPERKGLISAFAIGSYGLGSLGFK 152 +L++ AG+ AG + + F + LG Sbjct: 97 WVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGG---- 152 Query: 153 FIDTQLLETVGLEKTFVIWGAIALVMIVFGATLMKDAPKQEVKTSNGVVEKDYTLAESMR 212 L+ F A+ + + G L+ ++ K E + R Sbjct: 153 -----LMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWAR 207 Query: 213 --KPQYWMLAVMFLTACMSG----LYVIGVAKDIAQSLAHLDVVSAANAVTVISIAN-LS 265 ++AV F+ + L+VI + H D + ++ I + L+ Sbjct: 208 GMTVVAALMAVFFIMQLVGQVPAALWVI-----FGEDRFHWDATTIGISLAAFGILHSLA 262 Query: 266 GRLVLGILSDKIARIRVITIGQVISLVGMAALLFA 300 ++ G ++ ++ R + +G + G L FA Sbjct: 263 QAMITGPVAARLGERRALMLGMIADGTGYILLAFA 297 Score = 36.0 bits (83), Expect = 2e-04 Identities = 37/155 (23%), Positives = 64/155 (41%), Gaps = 9/155 (5%) Query: 241 AQSLAHLDVVSAANAVTVISIANLSGRLVLGILSDKIARIRVITIGQVISLVGMAALLFA 300 AH ++ A A+ + A + G L SD+ R V+ + + V A + A Sbjct: 39 NDVTAHYGILLALYALMQFACAPVLGAL-----SDRFGRRPVLLVSLAGAAVDYAIMATA 93 Query: 301 PLNAVTFFAAIACVAFNFGGTITVFPSLVSEFFGLNNLAKNYGVIYLGFGIGSIFGSIIA 360 P V + I VA G T V + +++ + A+++G + FG G + G ++ Sbjct: 94 PFLWVLYIGRI--VAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLG 151 Query: 361 SLFGGF--YVTFYVIFALLILSLALSTTIRQPEQK 393 L GGF + F+ AL L+ + K Sbjct: 152 GLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHK 186
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 31.9 bits (72), Expect = 0.008 Identities = 10/44 (22%), Positives = 19/44 (43%), Gaps = 5/44 (11%) Query: 94 QGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIR 137 G ++ + ID + W + VL+VAW+ + +R Sbjct: 442 TGGELPFWQQQSFIDQLLAAGRW-----LLVLVVAWILWRKAVR 480
>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature. Length = 591 Score = 31.6 bits (71), Expect = 0.003 Identities = 43/165 (26%), Positives = 65/165 (39%), Gaps = 32/165 (19%) Query: 93 DFFVEHGLLASVNIDGPTLIALRQQPKILRQIERLPWLRFELV----EHIRLPKDSTFAS 148 DF++ H +++ G T A P+ + WL E V EHI ++ Sbjct: 157 DFWLLHDSNGILHLLGKTAAARLSDPQAASHTAQ--WLVEESVTPAGEHI------YYSY 208 Query: 149 MCEFGPLWLDDFGTGMANFSA---LSEVRYDYIKIARELFVMLRQSPEGRTLFSQLLHLM 205 + E G + + SA LS+V+Y A +L++ +P + LF L+ Sbjct: 209 LAENGDNVDLNGNEAGRDRSAMRYLSKVQYGNATPAADLYLWTSATPAVQWLF----TLV 264 Query: 206 NRYC-RGVIVEGVETPEEWRDVQNSPAFAAQGWFLSRPAPMETLN 249 Y RGV D Q PAF AQ +L+R P N Sbjct: 265 FDYGERGV------------DPQVPPAFTAQNSWLARQDPFSLYN 297
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 34.5 bits (79), Expect = 7e-04 Identities = 85/405 (20%), Positives = 137/405 (33%), Gaps = 72/405 (17%) Query: 79 IGSAVFGHFGDRVGRKATLVASLLTMGISTVVIGLLPGYATIGIFAPLLLALARFGQGLG 138 IG+AV+G D++G K LL GI G + G+ F+ LL +ARF QG G Sbjct: 64 IGTAVYGKLSDQLGIK-----RLLLFGIIINCFGSVIGFVGHSFFS--LLIMARFIQGAG 116 Query: 139 LGGEWGGAALLATENAPPRKR----ALYGSFPQLGAPIGFFFANGTFLLLSW-------- 186 ++ P R L GS +G +G + W Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM 176 Query: 187 -----LLTDEQFMSWGWRV--PF-IFSAVLVIIG-------------LYVRVSLHESPVF 225 + + + R+ F I +L+ +G ++ VS+ +F Sbjct: 177 ITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIF 236 Query: 226 EKVAKAKKQVKIPLGTLLTKHVRVTVLGTFIMLATYTLFYIMTVYSMTFSTAAAPVGLGL 285 K + + G + VL I+ T F M Y M + +G Sbjct: 237 VKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIG- 295 Query: 286 PRNEVLWMLMMAVIGFGVMVPVAGLLADAFGRRKSMVIITTLIIL-FALFAFNPLLGSGN 344 + +++ M+VI FG + G+L D G + I T + + F +F S Sbjct: 296 --SVIIFPGTMSVIIFG---YIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWF 350 Query: 345 PILVFAFLLLGLSLMGL---TFGPMGALLPELFPTEVRYTGASFS-YNVSSILGASVAPY 400 ++ F+L GLS T L E GA S N +S L Sbjct: 351 MTIIIVFVLGGLSFTKTVISTIVSSS-----LKQQEA---GAGMSLLNFTSFLSEGTGIA 402 Query: 401 IAAWL-------------QANYGLGAVGLYLAAMAGLTLIALLLT 432 I L + + L +G+ +I+ L+T Sbjct: 403 IVGGLLSIPLLDQRLLPMEVDQSTYLYSNLLLLFSGIIVISWLVT 447 Score = 29.1 bits (65), Expect = 0.041 Identities = 13/73 (17%), Positives = 29/73 (39%), Gaps = 2/73 (2%) Query: 283 LGLPRNEVLWMLMMAVIGFGVMVPVAGLLADAFGRRKSMVIITTLIILFALFAFNPLLGS 342 P W+ ++ F + V G L+D G ++ ++ + ++ F + S Sbjct: 44 FNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGF--VGHS 101 Query: 343 GNPILVFAFLLLG 355 +L+ A + G Sbjct: 102 FFSLLIMARFIQG 114
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 38.5 bits (89), Expect = 1e-05 Identities = 31/105 (29%), Positives = 46/105 (43%), Gaps = 17/105 (16%) Query: 134 TRKIPWNTLLERVDIIPTSMVATMAAAESGWGTSKLARNN----NNLFGMKC---MKGRC 186 + L + +P ++ AA ESGWG ++ R N NLFG+K KG Sbjct: 154 AQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPV 213 Query: 187 T---------NAPGKVKG-YSQFSSVKESVSAYVTNLNTHPAYSS 221 T KVK + +SS E++S YV L +P Y++ Sbjct: 214 TEITTTEYENGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRYAA 258
>SECA#SecA protein signature. Length = 901 Score = 41.8 bits (98), Expect = 1e-05 Identities = 38/129 (29%), Positives = 56/129 (43%), Gaps = 18/129 (13%) Query: 233 NLSMLALRAGAQRFHAQPLSANDTLKNKLLAALPFKPTGAQARVVAEIEHDM-ALDVPMM 291 LS L+ F A+ L + L+N + A A R ++ M DV ++ Sbjct: 37 KLSDEELKGKTAEFRAR-LEKGEVLENLIPEAF------AVVREASKRVFGMRHFDVQLL 89 Query: 292 ---RLVQGDV-----GSGKTLVAALAA-LRAIAHGKQVALMAPTELLAEQHANNFRNWFA 342 L + + G GKTL A L A L A+ GK V ++ + LA++ A N R F Sbjct: 90 GGMVLNERCIAEMRTGEGKTLTATLPAYLNALT-GKGVHVVTVNDYLAQRDAENNRPLFE 148 Query: 343 PLGIEVGWL 351 LG+ VG Sbjct: 149 FLGLTVGIN 157
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 47.5 bits (113), Expect = 5e-08 Identities = 81/375 (21%), Positives = 135/375 (36%), Gaps = 41/375 (10%) Query: 20 FSAGLLGIGQNGLLVVLPVLVIQTNLSLSV---WAALLMLGSMLFLPSSPWWGKQISRTG 76 + L +G ++ VLP L+ S V + LL L +++ +P G R G Sbjct: 12 STVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFG 71 Query: 77 SKPVVLWALGGYGISFTLLGLGSVLMATSAITTAVGLGILIIARIAYGLTVSAMVPACQV 136 +PV+L +L G + + ++ L +L I RI G+T + A Sbjct: 72 RRPVLLVSLAGAAVDYAIMATAPFLW------------VLYIGRIVAGITGATGAVAGAY 119 Query: 137 WALQRAGEGNRMAALATISSGLSCGRLFGPLCAAAMLAIHPLAPLGLLMAAPVLALLMLL 196 A R +S+ G + GP+ M P AP A L L Sbjct: 120 IA-DITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGC 178 Query: 197 RL------PGTPPQPTPECKSVSLKRDCLPYLLCAILLAAAVSMMQLGLSPAL------T 244 L P ++ R + A L+A M +G PA Sbjct: 179 FLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGE 238 Query: 245 RQFATDTTAISQQVAWLLGLSAVAALIAQFGVLRPQRLTPVALLLSAGVLMSGGLAIMLS 304 +F D T I +A L ++A + G + + AL+L +G + + + Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMI-TGPVAARLGERRALMLGMIADGTGYILLAFA 297 Query: 305 EQLWLFYPGCAVLSFGAALATPAYQLLLNDKLADGAGAGWLATSHTLGYGLCALLVPLVS 364 + W+ +P +L+ G + PA Q +L+ + D G L L L S Sbjct: 298 TRGWMAFPIMVLLASG-GIGMPALQAMLS-RQVDEERQGQLQ----------GSLAALTS 345 Query: 365 KTGVAIALIMAALFA 379 T + L+ A++A Sbjct: 346 LTSIVGPLLFTAIYA 360
>PF04183#IucA / IucC family Length = 580 Score = 339 bits (872), Expect = e-111 Identities = 104/480 (21%), Positives = 178/480 (37%), Gaps = 46/480 (9%) Query: 58 ELLIPLDEQKSLHFRVAYFSPTQHHRF-----AFPARLVTASGSYPVDFTTLSRLIIDKL 112 E + + Q + + P RF + + A D L++ ++ +L Sbjct: 24 EQVFHAESQGDDRYCIN--LPGAQWRFIAERGIWGWLWIDAQTLRCADEPVLAQTLLMQL 81 Query: 113 RHQLFLPVPLCETFHQRVLESHVHTQQAIDARHDWAALREKALNFGEAEQALLTGHAFHP 172 + L + Q + + + Q + AR +A LN + Q LL+GH Sbjct: 82 KQVLSMSDATVAEHMQDLYATLLGDLQLLKARRGLSASDLINLNA-DRLQCLLSGHPKFV 140 Query: 173 APKSHEPFNRREAERYLPDMAPHFPLRWFSVDKTQIAGES-LHLNLQQRLTRFAAENAPQ 231 K + + ERY P+ A F L W +V + + +++ Q LT A PQ Sbjct: 141 FNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRCDNEMDIHQLLT---AAMDPQ 197 Query: 232 LLNELS--------DNQWLF-PLHPWQGEYLLQQGWCQALVAKGLIKDLGEAGTSWLPTT 282 S D+ WL P+HPWQ + + + A+G + LGE G WL Sbjct: 198 EFARFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADF-AEGRMVSLGEFGDQWLAQQ 256 Query: 283 SSRSLYCATSRD--MIKFSLSVRLTNSIRTLSVKEVKRGMRLARLAQ----TDGWQMLQ- 335 S R+L A+ R IK L++ T+ R + + + G +R Q TD + Sbjct: 257 SLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWLQQVFATDATLVQSG 316 Query: 336 ---VRFPTFRVMQEDGWAGLLDLNGNIMQESLFALRENLLVDQPKSQTNVLVSLTQAAPD 392 + P + +G+A L + REN ++ VL++ + Sbjct: 317 AVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLKPDESPVLMATLMECDE 376 Query: 393 GGDSLLVSAVKRLSDRLGITVQQAAHAWVDAYCQQVLKPLFTAEADYGLVLLAHQQNILV 452 L + + DR G+ A W+ + V+ PL+ YG+ L+AH QNI + Sbjct: 377 NNQPLAGAYI----DRSGLD----AETWLTQLFRVVVVPLYHLLCRYGVALIAHGQNITL 428 Query: 453 QMLGDLPVGFIYRDCQGSAFMPHATDWLDSIGEAQAENIFTHEQLLRYFPYYLLVNSTFA 512 M +P + +D QG M + + E + L++ Sbjct: 429 AMKEGVPQRVLLKDFQGD--MRLVKEEFPEMDSLPQE----VRDVTSRLSADYLIHDLQT 482
>PF04183#IucA / IucC family Length = 580 Score = 816 bits (2109), Expect = 0.0 Identities = 565/580 (97%), Positives = 571/580 (98%) Query: 1 MNHKDWDFVNRRLVAKMLSEMEYEQVFHAESQGDDHYCINLPGAQWRFIAERGIWGWLWI 60 MNHKDWD VNRRLVAKMLSE+EYEQVFHAESQGDD YCINLPGAQWRFIAERGIWGWLWI Sbjct: 1 MNHKDWDLVNRRLVAKMLSELEYEQVFHAESQGDDRYCINLPGAQWRFIAERGIWGWLWI 60 Query: 61 DAQTLRCTDEPVLAQTLLMQLKPVLSMSDATVAEHMQDLYATLLGDLQLLKARRGLSASD 120 DAQTLRC DEPVLAQTLLMQLK VLSMSDATVAEHMQDLYATLLGDLQLLKARRGLSASD Sbjct: 61 DAQTLRCADEPVLAQTLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQLLKARRGLSASD 120 Query: 121 LINLDADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYTNTFRLHWLAVKREHMIWRC 180 LINL+ADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEY NTFRLHWLAVKREHMIWRC Sbjct: 121 LINLNADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRC 180 Query: 181 DNDLDIQQLLTAAMDPQEFTRFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADFAEG 240 DN++DI QLLTAAMDPQEF RFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADFAEG Sbjct: 181 DNEMDIHQLLTAAMDPQEFARFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADFAEG 240 Query: 241 RMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASR 300 RMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASR Sbjct: 241 RMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASR 300 Query: 301 WLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLK 360 WLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLK Sbjct: 301 WLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLK 360 Query: 361 PDESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPLYHLLCRYGVALI 420 PDESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPLYHLLCRYGVALI Sbjct: 361 PDESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPLYHLLCRYGVALI 420 Query: 421 AHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEAFPEMDSLPQEVRDVTSRLSADYLIHDL 480 AHGQNITLAMKEGVPQRVLLKDFQGDMRLVKE FPEMDSLPQEVRDVTSRLSADYLIHDL Sbjct: 421 AHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMDSLPQEVRDVTSRLSADYLIHDL 480 Query: 481 QTGHFVTVLRFISPLMVRLGVPERRFYQLLAAVLSDYMNKHPQMAERFALFSLFRPQIIR 540 QTGHFVTVLRFISPLMVRLGVPERRFYQLLAAVLSDYM KHPQM+ERFALFSLFRPQIIR Sbjct: 481 QTGHFVTVLRFISPLMVRLGVPERRFYQLLAAVLSDYMKKHPQMSERFALFSLFRPQIIR 540 Query: 541 VVLNPVKLTWPDLDGGSRMLPNYLENLQNPLWLVTQEYES 580 VVLNPVKLTWPDLDGGSRMLPNYLE+LQNPLWLVTQEYES Sbjct: 541 VVLNPVKLTWPDLDGGSRMLPNYLEDLQNPLWLVTQEYES 580
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 82.8 bits (204), Expect = 4e-20 Identities = 48/214 (22%), Positives = 74/214 (34%), Gaps = 52/214 (24%) Query: 31 NRKLVATMLSLAVAGTVNA---ANIDISNVWARDYLDLAQNKGIFQPGATDVTITLKNGD 87 N+K ++L VA + A + +V + + D A+NKG F GAT+V + KN Sbjct: 3 NKKFKLNFIALTVAYALTPYTEAALVRDDVDYQIFRDFAENKGKFSVGATNVLVKDKNNK 62 Query: 88 KF--SFHN-LSIPDFSGAAAS-GAATAIGGSYSVTVAH-----------------NKKNP 126 + N + + DFS AT I Y V V H N N Sbjct: 63 DLGTALPNGIPMIDFSVVDVDKRIATLINPQYVVGVKHVSNGVSELHFGNLNGNMNNGNA 122 Query: 127 QAAETQVYAQSSYKVVDRRNSN-------------------DFEIQRLNKFVVETVGATP 167 +A ++ Y V++ D+ + RL+KFV E Sbjct: 123 KAHRDVSSEENRYFSVEKNEYPTKLNGKTVTTEDQTQKRREDYYMPRLDKFVTEVAPIEA 182 Query: 168 AETNPTTYSDALERYGIVTSDGSKKIIGFRAGSG 201 + + +D +K R GSG Sbjct: 183 STAS---------SDAGTYNDQNKYPAFVRLGSG 207
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 59.9 bits (145), Expect = 6e-12 Identities = 41/184 (22%), Positives = 81/184 (44%), Gaps = 1/184 (0%) Query: 5 RNVNLLLMLVLLVAVGQMAQTIYIPAIADMARDLNVREGAVQSVMGAYLLTYGVSQLFYG 64 R+ +L+ L +L + + + ++ D+A D N + V A++LT+ + YG Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 65 PISDRVGRRPVILVGMSIFMLATLVA-VTTSSLTVLIAASAMQGMGTGVGGVMARTLPRD 123 +SD++G + ++L G+ I +++ V S ++LI A +QG G + + Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVAR 130 Query: 124 LYERTQLRHANSLLNMGILVSPLLAPLIGGLLDTMWNWRACYLFLLVLCAGVTFSMARWM 183 + A L+ + + + P IGG++ +W L ++ V F M Sbjct: 131 YIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLK 190 Query: 184 PETR 187 E R Sbjct: 191 KEVR 194
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 42.1 bits (99), Expect = 3e-06 Identities = 63/383 (16%), Positives = 112/383 (29%), Gaps = 34/383 (8%) Query: 51 AEMGYVFSAFAWLYTLCQIPGGWFLDRVGSRVTYFIAIFGWSVVTLFQGFATGLMSLIGL 110 A G + + +A + C G DR G R +++ G +V A L L Sbjct: 43 AHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIG 102 Query: 111 RAITGIFEAPAFPTNNRMVTSWFPEHERASAVGFYTSGQFVGLAFLTPLLIWIQEMLSWH 170 R + GI A + ERA GF ++ G+ P+L + S H Sbjct: 103 RIVAGITGAT-GAVAGAYIADITDGDERARHFGFMSACFGFGMV-AGPVLGGLMGGFSPH 160 Query: 171 WVFIVTGGIGIIWSLIWFKVYQPPRLTKGISKAELDYIRDGGGLVDGDAPVKKEARQPLT 230 F + + L + + P+++EA PL Sbjct: 161 APFFAAAALNGLNFLTGCFLLPESHKGER-------------------RPLRREALNPLA 201 Query: 231 AKDWKLVFHRKLIGVYLGQFAVASALWFFLTWFPNYLTQEKGITALKAGFMTTVPFLAAF 290 + W + + F + + + A G Sbjct: 202 SFRWARGM-TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAA---FGI 257 Query: 291 VGVLLSGWVADLLVRKGFSLGFARKTPIICGLLISTC--IMGANYTNDPMMIMCLMALAF 348 + L + + + + ++ G++ I+ A T M ++ LA Sbjct: 258 LHSLAQAMITGPVAAR-----LGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLAS 312 Query: 349 FGNGFASITWSLVSSLAPMRLIGLTGGVFNFAGGLGGITVPLVVGYL-AQGYGFAPALVY 407 G G ++ +++S G G L I PL+ + A + Sbjct: 313 GGIGMPALQ-AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAW 371 Query: 408 ISAVALIGALSYILLVGDVKRVG 430 I+ AL L G G Sbjct: 372 IAGAALYLLCLPALRRGLWSGAG 394
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 873 bits (2258), Expect = 0.0 Identities = 547/548 (99%), Positives = 547/548 (99%) Query: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKL 60 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKL Sbjct: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKL 60 Query: 61 ISVKTDVLDLTINTRGGDVEQALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGP 120 ISVKTDVLDLTINTRGGDVEQALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGP Sbjct: 61 ISVKTDVLDLTINTRGGDVEQALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGP 120 Query: 121 DNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNTFTKTFVLKRGDYAVNVNYNV 180 DNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNTFTKTFVLKRGDYAVNVNYNV Sbjct: 121 DNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNTFTKTFVLKRGDYAVNVNYNV 180 Query: 181 QNAGEKPLEISSFGQLKQSITLPPHLDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD 240 QNAGEKPLEISSFGQLKQSITLPPHLDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD Sbjct: 181 QNAGEKPLEISSFGQLKQSITLPPHLDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD 240 Query: 241 NENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIVAIGYKSQPVLVQPGQT 300 NENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGI AIGYKSQPVLVQPGQT Sbjct: 241 NENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQT 300 Query: 301 GAMNSTLWVGPEIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIII 360 GAMNSTLWVGPEIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIII Sbjct: 301 GAMNSTLWVGPEIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIII 360 Query: 361 ITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRISQEMMALYKAEKVNPL 420 ITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRISQEMMALYKAEKVNPL Sbjct: 361 ITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRISQEMMALYKAEKVNPL 420 Query: 421 GGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQK 480 GGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQK Sbjct: 421 GGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQK 480 Query: 481 MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGL 540 MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGL Sbjct: 481 MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGL 540 Query: 541 HSREKKKS 548 HSREKKKS Sbjct: 541 HSREKKKS 548
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 54.1 bits (130), Expect = 3e-10 Identities = 60/247 (24%), Positives = 94/247 (38%), Gaps = 7/247 (2%) Query: 2 SRFLICSFALVLLYPAGIDMYLVGLPRIAADLNASEAQLHIAFSVYLAGMAAAML----F 57 +R LI + V L GI + + LP + DL S + + + LA A Sbjct: 4 NRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSN-DVTAHYGILLALYALMQFACAPV 62 Query: 58 AGKVADRSGRKPVAIPGAALFIIASVFCSLAETSTLFLAGRFLQGLGAGCCYVVAFAILR 117 G ++DR GR+PV + A + + A + GR + G+ G VA A + Sbjct: 63 LGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIA 121 Query: 118 DTLDDRRRAKVLSLLNGITCIIPVLAPVLGHLIMLKFPWQSLFWAMAMMGIAVLMLSLFI 177 D D RA+ ++ V PVLG L M F + F+A A + + F+ Sbjct: 122 DITDGDERARHFGFMSACFGFGMVAGPVLGGL-MGGFSPHAPFFAAAALNGLNFLTGCFL 180 Query: 178 LKETRPASPAASDKPRENSESLLNRFFLSRVVITTLSVSVILTFVNTSPVLLMEIMGFER 237 L E+ + N + VV ++V I+ V P L I G +R Sbjct: 181 LPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDR 240 Query: 238 GEYATIM 244 + Sbjct: 241 FHWDATT 247
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 41.4 bits (97), Expect = 8e-06 Identities = 9/74 (12%), Positives = 33/74 (44%), Gaps = 1/74 (1%) Query: 16 LRKQQSRLRQYACQVAGYEQEIERLKAQLDRLRRMLFGQSSEKKRHKLENQIRQAEKRLS 75 + +Q+++ + ++ Y+ ++E++++++ + + K L+ ++RQ + Sbjct: 254 VLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILD-KLRQTTDNIG 312 Query: 76 ELENRLNTARNLLE 89 L L + Sbjct: 313 LLTLELAKNEERQQ 326
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 182 bits (464), Expect = 5e-57 Identities = 84/353 (23%), Positives = 145/353 (41%), Gaps = 44/353 (12%) Query: 3 KILITGGAGFIGSALVRYIINETSDAVVVVDKLT--YAGNL-MSLAPVAQSERFAFEKVD 59 K L+TG AGFIG + + ++ E VV +D L Y +L + + F F K+D Sbjct: 2 KYLVTGAAGFIGFHVSKRLL-EAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 60 ICDRAELARVFTEHQPDCVMHLAAESHVDRSIDGPAAFIETNIVGTYTLLEAARAYWNTL 119 + DR + +F + V V S++ P A+ ++N+ G +LE R Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN---- 116 Query: 120 TEDKKSAFRFHHISTDEVYGDLHSTDDFFTETTPYAPSSPYSASKASSDHLVRAWLRTYG 179 + S+ VYG L+ F T+ + P S Y+A+K +++ + + YG Sbjct: 117 -----KIQHLLYASSSSVYG-LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170 Query: 180 LPTLITNCSNNYGPYHFPEKLIPLMILNALAGKPLPVYGNGQQIRDWLYVEDHARALYCV 239 LP YGP+ P+ + L GK + VY G+ RD+ Y++D A A+ + Sbjct: 171 LPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRL 230 Query: 240 ------------------ATTGKVGETYNIGGHNERKNLDVVETICELLEELASNKPHGV 281 A + YNIG + + +D ++ + + L Sbjct: 231 QDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDAL------GIEAK 284 Query: 282 AHYRDLITFVADRPGHDLRYAIDASKIARELGWLPQETFESGMRKTVQWYLAN 334 + L +PG L + D + +G+ P+ T + G++ V WY Sbjct: 285 KNMLPL------QPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 36.8 bits (85), Expect = 3e-05 Identities = 26/126 (20%), Positives = 45/126 (35%), Gaps = 10/126 (7%) Query: 104 FAQSRFRAPWYAPDASGRFYAQWIEN---AVRGTFDHQCLILRAA-SGDIRGYVSLRELN 159 + + RF P++ ++E A + I R + GY + ++ Sbjct: 37 YTEERFSKPYFKQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIA 96 Query: 160 -ATDARIGLLAGRGAGAELMQTALNWAYARGKTTLRVATQMGNTAALKRYIQSGANVEST 218 A D R +G G L+ A+ WA L + TQ N +A Y + + + Sbjct: 97 VAKDYR-----KKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAV 151 Query: 219 AYWLYR 224 LY Sbjct: 152 DTMLYS 157
>TATBPROTEIN#Bacterial sec-independent translocation TatB protein signature. Length = 171 Score = 201 bits (512), Expect = 4e-69 Identities = 171/171 (100%), Positives = 171/171 (100%) Query: 1 MFDIGFSELLLVFIIGLVVLGPQRLPVAVKTVAGWIRALRSLATTVQNELTQELKLQEFQ 60 MFDIGFSELLLVFIIGLVVLGPQRLPVAVKTVAGWIRALRSLATTVQNELTQELKLQEFQ Sbjct: 1 MFDIGFSELLLVFIIGLVVLGPQRLPVAVKTVAGWIRALRSLATTVQNELTQELKLQEFQ 60 Query: 61 DSLKKVEKASLTNLTPELKASMDELRQAAESMKRSYVANDPEKASDEAHTIHNPVVKDNE 120 DSLKKVEKASLTNLTPELKASMDELRQAAESMKRSYVANDPEKASDEAHTIHNPVVKDNE Sbjct: 61 DSLKKVEKASLTNLTPELKASMDELRQAAESMKRSYVANDPEKASDEAHTIHNPVVKDNE 120 Query: 121 AAHEGVTPAAAQTQASSPEQKPETTPEPVVKPAADAEPKTAAPSPSSSDKP 171 AAHEGVTPAAAQTQASSPEQKPETTPEPVVKPAADAEPKTAAPSPSSSDKP Sbjct: 121 AAHEGVTPAAAQTQASSPEQKPETTPEPVVKPAADAEPKTAAPSPSSSDKP 171
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 29.4 bits (66), Expect = 0.023 Identities = 38/172 (22%), Positives = 67/172 (38%), Gaps = 17/172 (9%) Query: 160 TAGIASFEPHVFVGAVLPFLVGFA-LGNLDPELREFFSKAVQTLIPF-FAFALGNTID-L 216 I +F +L FLV + L N+ L + V L F A G +I+ L Sbjct: 334 QLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTL 393 Query: 217 TVIAQTGLLGILLGVAVIIVTGIPLIIADKLIGGGDGTAGIAASSSAGAAV--ATPVLIA 274 T+ +G+L+ A+++V + ++ + A + S A+ VL A Sbjct: 394 TMFGMVLAIGLLVDDAIVVVENVERVMMED--KLPPKEATEKSMSQIQGALVGIAMVLSA 451 Query: 275 EMVPA----------FKPMAPAATSLVATAVIVTSILVPILTSIWSRKVKAR 316 +P ++ + S +A +V+V IL P L + + V A Sbjct: 452 VFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAE 503
>PF06580#Sensor histidine kinase Length = 349 Score = 29.4 bits (66), Expect = 0.027 Identities = 28/186 (15%), Positives = 63/186 (33%), Gaps = 33/186 (17%) Query: 277 IETEAQRLDSMINDLLVMSRNQQKNALVSETIKANQLWSEV-LDNAAFEAEQM--GKSLT 333 I + + M+ L + R + + + L E+ + ++ + + L Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQV----SLADELTVVDSYLQLASIQFEDRLQ 241 Query: 334 VNF--PPGPWPLYGNPNALESALENIVRNAL--RYSHTKIEVGFAVDKDGITITVDDDGP 389 P + P +++ +EN +++ + KI + D +T+ V++ G Sbjct: 242 FENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGS 301 Query: 390 GVSPEDREQIFRPFYRTDEARDRESGGTGLGLAIVETAIQQHRGW---VKAEDSPLGGLR 446 +E TG GL V +Q G +K + G + Sbjct: 302 LALKNTKE------------------STGTGLQNVRERLQMLYGTEAQIKLSEKQ-GKVN 342 Query: 447 LVIWLP 452 ++ +P Sbjct: 343 AMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 92.2 bits (229), Expect = 9e-24 Identities = 35/117 (29%), Positives = 62/117 (52%), Gaps = 2/117 (1%) Query: 3 KILLVDDDRELTSLLKELLEMEGFNVIVAHDGEQALDLL-DDSIDLLLLDVMMPKKNGID 61 IL+ DDD + ++L + L G++V + + + DL++ DV+MP +N D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 TLKALRQTH-QTPVIMLTARGSELDRVLGLELGADDYLPKPFNDRELVARIRAILRR 117 L +++ PV++++A+ + + + E GA DYLPKPF+ EL+ I L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 30.2 bits (68), Expect = 0.017 Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 3/36 (8%) Query: 49 TPKNILMIGPTGVGKTEIAR---RLAKLANAPFIKV 81 T +++ G +G GK +AR K N PF+ + Sbjct: 159 TDLTLMITGESGTGKELVARALHDYGKRRNGPFVAI 194
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 42.0 bits (98), Expect = 2e-06 Identities = 32/155 (20%), Positives = 64/155 (41%), Gaps = 5/155 (3%) Query: 114 LTPEQRQLLEQMQADMRQQPTQLVEVPWNEQTPEQRQQTLQRQRQAQQLAEQQRLAQQSR 173 + +QAD+ P+ E+ ++ P + +AE + Q+S+ Sbjct: 992 VDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSK--QESK 1049 Query: 174 TTEQSWQQQT-RTSQAAPVQAQPRQSKPASTQQPYQDLLQTPAHTTAQSKPQQAAPVARA 232 T E++ Q T T+Q V + + + A+TQ + T ++ ++ A V + Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKE 1109 Query: 233 ADAPKPTAEKKDERRWMVQCGSFRGAEQAETVRAQ 267 A T + ++ + Q + EQ+ETV+ Q Sbjct: 1110 EKAKVETEKTQEVPKVTSQVSPKQ--EQSETVQPQ 1142
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 525 bits (1355), Expect = 0.0 Identities = 183/468 (39%), Positives = 253/468 (54%), Gaps = 35/468 (7%) Query: 8 ILVVDDDISHCTILQALLRGWGYNVALANSGRQALEQVREQVFDLVLCDVRMAEMDGIAT 67 ILV DDD + T+L L GY+V + ++ + DLV+ DV M + + Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 68 LKEIKALNPAIPVLIMTAYSSVETAVEALKTGAQDYLIKPLDFDNLQATLEKALAHTHSI 127 L IK P +PVL+M+A ++ TA++A + GA DYL KP D L + +ALA Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125 Query: 128 DAETPAVTASQFGMVGKSPAMQHLLSEIALVAPSEATVLIHGDSGTGKELVARAIHASSA 187 ++ + +VG+S AMQ + +A + ++ T++I G+SGTGKELVARA+H Sbjct: 126 PSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGK 185 Query: 188 RSEKPLVTLNCAALNESLLESELFGHEKGAFTGADKRREGRFVEADGGTLFLDEIGDISP 247 R P V +N AA+ L+ESELFGHEKGAFTGA R GRF +A+GGTLFLDEIGD+ Sbjct: 186 RRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPM 245 Query: 248 MMQVRLLRAIQEREVQRVGSNQTISVDVRLIAATHRDLAAEVNAGRFRQDLYYRLNVVAI 307 Q RLLR +Q+ E VG I DVR++AAT++DL +N G FR+DLYYRLNVV + Sbjct: 246 DAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPL 305 Query: 308 EVPSLRQRREDIPLLAGHFLQRFAERNRKAVKGFTPQAMDLLIHYDWPGNIRELENAVER 367 +P LR R EDIP L HF+Q+ + VK F +A++L+ + WPGN+RELEN V R Sbjct: 306 RLPPLRDRAEDIPDLVRHFVQQAEKEGLD-VKRFDQEALELMKAHPWPGNVRELENLVRR 364 Query: 368 AVVLLTGEYISERELPLAIASTPIPLGQSQDIQP-------------------------- 401 L + I+ + + S + Sbjct: 365 LTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALP 424 Query: 402 --------LVEVEKEVILAALEKTGGNKTEAARQLGITRKTLLAKLSR 441 L E+E +ILAAL T GN+ +AA LG+ R TL K+ Sbjct: 425 PSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 30.7 bits (69), Expect = 0.001 Identities = 15/54 (27%), Positives = 20/54 (37%), Gaps = 5/54 (9%) Query: 78 IDPDVCGCGVGRMLVEHALSMAPE-----LTTNVNEQNEQAVGFYKKVGFKVTG 126 + D GVG L+ A+ A E L + N A FY K F + Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 32.3 bits (73), Expect = 0.004 Identities = 14/58 (24%), Positives = 23/58 (39%) Query: 289 ETSTPDLELARRFAQAIHAKYPGKLLAYNCSPSFNWQKNLDDKTIASFQQQLSDMGYK 346 ET+ PD+ L A P L Y + N D +T + + QL+++ Sbjct: 544 ETTKPDMTLKEALKIAFGFNEPNGNLQYQGKDITEFDFNFDQQTSQNIKNQLAELNAT 601
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 90.7 bits (225), Expect = 2e-23 Identities = 40/121 (33%), Positives = 59/121 (48%) Query: 2 KILIVEDDTLLLQGLILAAQTEGYACDGVTTARMAEQSLEDGHYSLVVLDLGLPDEDGLH 61 IL+ +DD + L A GY + A + + G LVV D+ +PDE+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 FLARIRQKKYTLPVLILTARDTLTDKIAGLDVGADDYLVKPFALEELHARIRALLRRHNN 121 L RI++ + LPVL+++A++T I + GA DYL KPF L EL I L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 122 Q 122 + Sbjct: 125 R 125
>PF06580#Sensor histidine kinase Length = 349 Score = 36.8 bits (85), Expect = 1e-04 Identities = 40/182 (21%), Positives = 80/182 (43%), Gaps = 34/182 (18%) Query: 181 ARLDQMMESVSQLLQLARAGQSFSSGNYQHVKLLEDV-ILPSYDELSTML--DQRQQTLL 237 + +M+ S+S+L++ S N + V L +++ ++ SY +L+++ D+ Q Sbjct: 191 TKAREMLTSLSELMR-----YSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245 Query: 238 LPESAADITVQGDATLLRMLLRNLVENAHRY----SPQGSNIMIKLQEDGGAV-MAVEDE 292 + + D+ V ML++ LVEN ++ PQG I++K +D G V + VE+ Sbjct: 246 INPAIMDVQV------PPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299 Query: 293 GPGIDESKCGELSKAFVRMDSRYGGIGLGLSIV-SRITQLHHGQFFLQNRQETSGTRAWV 351 G + + G GL V R+ L+ + ++ ++ A V Sbjct: 300 GSLA--------------LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345 Query: 352 RL 353 + Sbjct: 346 LI 347
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 42.9 bits (101), Expect = 2e-06 Identities = 57/290 (19%), Positives = 105/290 (36%), Gaps = 55/290 (18%) Query: 85 FFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYDTIGIWAPILLLICKMAQGFSVGGE 144 G L D++GR+ +L +++ ++ + P +W +L I ++ G + G Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPF-----LW---VLYIGRIVAGIT-GAT 112 Query: 145 YTGASIFVAEYSPDRKR----GFMGSWLDFGSIAGFVLGAGVVVLISTIVGEANFLDWGW 200 A ++A+ + +R GFM + FG +AG VLG G++ S Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLG-GLMGGFSP------------ 159 Query: 201 RIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDREGLQDGPKVSFKEIATKYWRS 260 PFF A L + L K E+ P SF+ W Sbjct: 160 HAPFFAAAALNGLNFLTGCFLLPESH------KGERRPLRREALNPLASFR------WAR 207 Query: 261 LLTCIGLVIATNVTYYML----LTYMPSYLSHNLHYS-EDHGVLIIIAIMIGMLFVQPVM 315 +T + ++A ++ + H+ G+ + ++ L + Sbjct: 208 GMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMIT 267 Query: 316 GLLSDRFGRRPFVLLG----SVALFVLA--------IPAFILINSNVIGL 353 G ++ R G R ++LG +LA P +L+ S IG+ Sbjct: 268 GPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGM 317 Score = 41.0 bits (96), Expect = 8e-06 Identities = 39/164 (23%), Positives = 73/164 (44%), Gaps = 16/164 (9%) Query: 286 LSHNLHYSEDHGVLI-IIAIMIGMLFVQPVMGLLSDRFGRRPFVLLGSVALFVLAIPAFI 344 L H+ + +G+L+ + A+M PV+G LSDRFGRRP +L+ L A+ I Sbjct: 35 LVHSNDVTAHYGILLALYALM--QFACAPVLGALSDRFGRRPVLLVS---LAGAAVDYAI 89 Query: 345 LINSNVIGLIFAGLLMLAVILNCFMGVMASTLPAMFPTHIR---YSALAAAFNISVLVAG 401 + + + +++ G ++A I V + + + R + ++A F +VAG Sbjct: 90 MATAPFLWVLYIG-RIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFG-MVAG 147 Query: 402 LTPTLAAWLVESSQNLMMPAYYLMVVAVIGLITG-VTMKETANR 444 P L + S + P + + + +TG + E+ Sbjct: 148 --PVLGGLMGGFSPH--APFFAAAALNGLNFLTGCFLLPESHKG 187
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.019 Identities = 12/22 (54%), Positives = 13/22 (59%) Query: 32 MVALLGPSGSGKSTLLRHLSGL 53 V L G G GKSTL+ L GL Sbjct: 598 SVVLEGTGGIGKSTLINTLVGL 619
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 26.7 bits (59), Expect = 0.020 Identities = 5/33 (15%), Positives = 13/33 (39%), Gaps = 1/33 (3%) Query: 17 ELVEKR-QRFATILSIIMLAVYIGFILLIAFAP 48 EL+E R +++ ++ + +L Sbjct: 47 ELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQ 79
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 29.9 bits (67), Expect = 0.006 Identities = 6/21 (28%), Positives = 11/21 (52%) Query: 179 FGNLDDPNSEISQLLRQKPTY 199 GNL++P ++ L+ P Sbjct: 75 TGNLEEPAVMVNYFLQGDPYQ 95
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 71.8 bits (176), Expect = 5e-16 Identities = 51/363 (14%), Positives = 112/363 (30%), Gaps = 78/363 (21%) Query: 8 APRSKFPALLVVALALVALVFVIW-RVDS-APSTNDAYASADTIDVVPEVSGRIVELAVT 65 + R + A ++ ++A + + +V+ A + S + ++ P + + E+ V Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113 Query: 66 DNQAVKQGDLLFRIDPRPYEANLAKAEAS-----LAALDKQIMLTQRSVDAQQFGADSVN 120 + ++V++GD+L ++ EA+ K ++S L QI+ ++ Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173 Query: 121 ATVEKARAAAKQATDTL------------------------------RRTEPLLKEGFVS 150 + +L R V Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233 Query: 151 AEDVDRARTAQRAAEADLNAVLLQAQSAASAVSGVDALVAQRAAVEADIALTKLH----- 205 +D + +AVL Q AV+ + +Q +E++I K Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293 Query: 206 -------------------------------LEMTTVRAPFDGRIISLKT-SVGQFASAM 233 + + +RAP ++ LK + G + Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353 Query: 234 RPIFTLIDTRHWYVI-ANFRETDLKNIRSGTPATIRLMSDSGKTF---EGKVDSIGYGVL 289 + ++ + A + D+ I G A I++ + + GKV +I + Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI 413 Query: 290 PDD 292 D Sbjct: 414 EDQ 416
>PERTACTIN#Pertactin signature. Length = 922 Score = 27.0 bits (59), Expect = 0.048 Identities = 15/50 (30%), Positives = 19/50 (38%), Gaps = 4/50 (8%) Query: 119 GGAPAGGNIGGGQPQGGWGQPQQPQGGNQFSG----GAQSRPQQSAPAAP 164 G APAGG + GG GG + + G + QS AP Sbjct: 261 GDAPAGGAVPGGAVPGGAVPGGFGPLLDGWYGVDVSDSTVDLAQSIVEAP 310
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 35.1 bits (81), Expect = 4e-04 Identities = 21/85 (24%), Positives = 37/85 (43%), Gaps = 20/85 (23%) Query: 26 CDVLVANGKIIAVASNIPSDIVPNCT--------VVDLSGQILCPGFIDQHVHLIGGGGE 77 D+ + +G+I A+ D+ P T V+ G+I+ G +D H+H I Sbjct: 86 ADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIHFI----- 140 Query: 78 AGPTTRTPEVALSRLTEAGVTSVVG 102 + E AL +G+T ++G Sbjct: 141 ---CPQQIEEALM----SGLTCMLG 158
>PF06580#Sensor histidine kinase Length = 349 Score = 31.4 bits (71), Expect = 0.008 Identities = 10/49 (20%), Positives = 25/49 (51%) Query: 230 LVPLIPAIIMISTTIANIWLVKDTPAWEVVNFIGSSPIAMFIAMVVAFV 278 + +I ++ I +W V +T W ++ FI + P+A + + ++ + Sbjct: 73 MGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLALSII 121
>SURFACELAYER#Lactobacillus surface layer protein signature. Length = 439 Score = 28.1 bits (62), Expect = 0.047 Identities = 19/79 (24%), Positives = 32/79 (40%), Gaps = 1/79 (1%) Query: 211 SQNLGYYLSGTTADAGNSIFTNTASFSPAQGVGVQLTRNGTIIPANNTVSLGAVGTSAVS 270 S+N G ++ +A+ N FT PA V V L ++G ++ + + + Sbjct: 133 SENAGKEITIGSAN-PNVTFTEKTGDQPASTVKVTLDQDGVAKLSSVQIKNVYAIDTTYN 191 Query: 271 LGLTANYARTGGQVTAGNV 289 + TG VT G V Sbjct: 192 SNVNFYDVTTGATVTTGAV 210
>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature. Length = 1291 Score = 33.5 bits (76), Expect = 4e-04 Identities = 30/158 (18%), Positives = 49/158 (31%), Gaps = 9/158 (5%) Query: 3 WCKRGYVLAAMLALASATIQAADVTITVNGKVVAKPCTVSTTNATVDLGDLYSFSLMSAG 62 W R + A LA + +TI + VT VN + + + + G Sbjct: 258 WMGRLQYVGAYLAPSYSTINTSKVTGEVNFNHLTVGDHNAAQAGIIASNKTH------IG 311 Query: 63 AASAWHDVALELTNCPVG--TSRVTASFSGAADSTGYYKNQGTAQNIQLELQDDSGNTLN 120 W L + P G + S + Q ++QN + N+ Sbjct: 312 TLDLWQSAGLNIIAPPEGGYKDKPNDKPSNTTQNNAKNDKQESSQNNSNTQVINPPNSAQ 371 Query: 121 TGATKTVQVDDSSQSAHFPLQVRALTVNGGATQGTIQA 158 + QV D + V +N A GTI+ Sbjct: 372 KTEIQPTQVIDGPFAGGKNTVVNINRINTNA-DGTIRV 408
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 1086 bits (2810), Expect = 0.0 Identities = 866/878 (98%), Positives = 871/878 (99%) Query: 1 MSYLNLRLYQRNTQCLHIRKHRLAGFFVRLVVACAFAAQAPLSSADLYFNLRFLADDPQA 60 MSYLNLRLYQRNTQCLHIRKHRLAGFFVRL VACAFAAQAPLSSA+LYFN RFLADDPQA Sbjct: 1 MSYLNLRLYQRNTQCLHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQA 60 Query: 61 VADLSRFENGQELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLN 120 VADLSRFENGQELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLN Sbjct: 61 VADLSRFENGQELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLN 120 Query: 121 TASVAGMNLLADDACVPLTTMVQDATAHLDVGQQRLNLTIPQAFMSNRARGYIPPELWDP 180 TASV+GMNLLADDACVPLT+M+ DATA LDVGQQRLNLTIPQAFMSNRARGYIPPELWDP Sbjct: 121 TASVSGMNLLADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDP 180 Query: 181 GINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSK 240 GINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSK Sbjct: 181 GINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSK 240 Query: 241 NKWQHIITWIERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPV 300 NKWQHI TW+ERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPV Sbjct: 241 NKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPV 300 Query: 301 IHGIARGTAQVTIKQNGYGIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTV 360 IHGIARGTAQVTIKQNGY IYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTV Sbjct: 301 IHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTV 360 Query: 361 PYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRY 420 PYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRY Sbjct: 361 PYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRY 420 Query: 421 RAFNFGIGKNMEALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYR 480 RAFNFGIGKNM ALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYR Sbjct: 421 RAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYR 480 Query: 481 YSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRT 540 YSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRT Sbjct: 481 YSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRT 540 Query: 541 STLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNI 600 STLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNI Sbjct: 541 STLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNI 600 Query: 601 PFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGD 660 PFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGD Sbjct: 601 PFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGD 660 Query: 661 GNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVL 720 GNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVL Sbjct: 661 GNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVL 720 Query: 721 VKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVP 780 VKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVP Sbjct: 721 VKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVP 780 Query: 781 TRGAIVRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLA 840 TRGAIVRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLA Sbjct: 781 TRGAIVRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLA 840 Query: 841 GKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878 GKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR Sbjct: 841 GKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 26.7 bits (59), Expect = 0.013 Identities = 7/45 (15%), Positives = 16/45 (35%), Gaps = 1/45 (2%) Query: 4 KRYPEEFKTEAVKQVVDR-GYSVASVATRLDITTHSLYAWIKKYG 47 R E + + + + A L + ++L I++ G Sbjct: 430 DRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELG 474
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 32.2 bits (73), Expect = 6e-04 Identities = 15/48 (31%), Positives = 18/48 (37%) Query: 97 PAIRGKGLAKKLALMAMEQAREMGFKRCYLETTAFLKEAIALYEHLGF 144 R KG+ L A+E A+E F LET A Y F Sbjct: 99 KDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146
>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein signature. Length = 533 Score = 30.3 bits (68), Expect = 0.009 Identities = 16/59 (27%), Positives = 21/59 (35%), Gaps = 7/59 (11%) Query: 228 FSIYTQAGYALAGVGVELDAIASVVIGGTLLSGGVGTVLGTLFGVAIQGLIQTYINFDG 286 FSIY GVG L + I + G G V GVAI ++ + Sbjct: 454 FSIY-------GGVGAGLGSYTYAKIDNKDVKGYTGMVASGALGVAINAAEGVCVDLEA 505
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 32.2 bits (73), Expect = 0.003 Identities = 15/59 (25%), Positives = 28/59 (47%), Gaps = 4/59 (6%) Query: 79 TGGIDLSVGAV----MAIAGATTAAMTVAGFSLPIVLLSALGTGILAGLWNGILVAILK 133 TG ID S+ + +++ +AA T + P+ L TGI++G+ A+ + Sbjct: 361 TGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASKQAMFE 419
>INFPOTNTIATR#Macrophage infectivity potentiator signature. Length = 233 Score = 165 bits (418), Expect = 2e-53 Identities = 80/207 (38%), Positives = 115/207 (55%), Gaps = 5/207 (2%) Query: 3 TPTFDTIEAQASYGIGLQVGQQLSESGLEGLLPEALVAGIADALEGKHPAVPVDVVHRAL 62 + T + + SY IG +G+ G++ + P+ L G+ D + G + + + L Sbjct: 24 ATSLTTDKDKLSYSIGADLGKNFKNQGID-INPDVLAKGMQDGMSGAQLILTEEQMKDVL 82 Query: 63 REIHERADAVRRQRFQAMAAE----GVKYLEENAKKEGVNSTESGLQFRVINQGEGAIPA 118 + + A R F A E G +L N K G+ SGLQ+++I+ G GA P Sbjct: 83 SKFQKDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPSGLQYKIIDAGTGAKPG 142 Query: 119 RTDRVRVHYTGKLIDGTVFDSSVARGEPAEFPVNGVIPGWIEALTLMPVGSKWELTIPQE 178 ++D V V YTG LIDGTVFDS+ G+PA F V+ VIPGW EAL LMP GS WE+ +P + Sbjct: 143 KSDTVTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEALQLMPAGSTWEVFVPAD 202 Query: 179 LAYGERGAGASIPPFSTLVFEVELLEI 205 LAYG R G I P TL+F++ L+ + Sbjct: 203 LAYGPRSVGGPIGPNETLIFKIHLISV 229
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 75.0 bits (184), Expect = 8e-19 Identities = 39/207 (18%), Positives = 77/207 (37%), Gaps = 20/207 (9%) Query: 12 KEKLLLCAVNEFAEYGYEGARVDNIVKAAGCSKQTVYHHFGNKENLFIEVLEYTWNDIRQ 71 ++ +L A+ F++ G + I KAAG ++ +Y HF +K +LF E+ E + ++I + Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72 Query: 72 K--EKALDFSDLPPQKAIEKIID-FTWDYYIAN-PWFLKIV-HSENQSKGVH-YAKSQRL 125 E F P E +I ++I+ H + ++QR Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRN 132 Query: 126 LEINHAHLQLMESLLDEGKKYNIFKPGIDPLQVNINIAALGGYYLINQHTLGLVYHISMV 185 L + +E L + + + + + GY GL+ + Sbjct: 133 LCL--ESYDRIEQTLKHCIEAKMLPADLMTRRA---AIIMRGYI------SGLMENWLFA 181 Query: 186 --SPQALEARRKVIKETILSWLLVDPS 210 S + R + +L L+ P+ Sbjct: 182 PQSFDLKKEARDYV-AILLEMYLLCPT 207
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 70.1 bits (171), Expect = 2e-16 Identities = 67/263 (25%), Positives = 115/263 (43%), Gaps = 22/263 (8%) Query: 7 VAVITGATRGIGKGCAQELARGGFNLLINDRPDADSVEKLHITQQECLAEGVEVICFPAD 66 +A ITGA +GIG+ A+ LA G ++ D + EKL AE FPAD Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDY----NPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 67 VGDLSLHEEMLDAAQNQWGRLDCLLNNAGISVKKRGDLLDLEPDSFDQNIAINTRAPFFL 126 V D + +E+ + + G +D L+N AG+ + G + L + ++ ++N+ F Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVL--RPGLIHSLSDEEWEATFSVNSTGVFNA 123 Query: 127 AQAFSKRLLAQPKPEAELPHRSIIFVSSINAIMLAMNRGEYTIAKTAVSAAARLFAARLC 186 +++ SK ++ + SI+ V S A + + Y +K A + L Sbjct: 124 SRSVSKYMMDRRSG-------SIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176 Query: 187 NEQIGVYEVRPGLIKTDM--TIPATAYYDELIAKGL-------VPWGRWGYPADIASTVR 237 I V PG +TDM ++ A E + KG +P + P+DIA V Sbjct: 177 EYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVL 236 Query: 238 AMAEGKLIYTCGQAVAIDGGLSM 260 + G+ + + +DGG ++ Sbjct: 237 FLVSGQAGHITMHNLCVDGGATL 259
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 27.8 bits (62), Expect = 0.034 Identities = 6/19 (31%), Positives = 7/19 (36%), Gaps = 2/19 (10%) Query: 105 FNGDVQI--ELTGYWTWEQ 121 F G + L W EQ Sbjct: 62 FKGQEDLGNGLKAIWQVEQ 80
>2FE2SRDCTASE#Ferric iron reductase signature. Length = 262 Score = 486 bits (1251), Expect = e-179 Identities = 255/262 (97%), Positives = 255/262 (97%) Query: 1 MAYRSAPLYEDVIWRTHLQPQDAGLAQAVRAMIAKHREHLLEFIRLDEPAPLNAMTLAQW 60 MAYRSAPLYEDVIWRTHLQPQD LAQAVRA IAKHREHLLEFIRLDEPAPLNAMTLAQW Sbjct: 1 MAYRSAPLYEDVIWRTHLQPQDPTLAQAVRATIAKHREHLLEFIRLDEPAPLNAMTLAQW 60 Query: 61 SSPNALSSLLAVYSDHIYRNQPTMIRENKPLISLWAQWYIGLMVPPLMLALLTQEKALDV 120 SSPN LSSLLAVYSDHIYRNQP MIRENKPLISLWAQWYIGLMVPPLMLALLTQEKALDV Sbjct: 61 SSPNVLSSLLAVYSDHIYRNQPMMIRENKPLISLWAQWYIGLMVPPLMLALLTQEKALDV 120 Query: 121 SPEHFHAEFHETGRVACFWVDVCEDKNATPHSPQQRMETLISQALVPVVQALEATGEING 180 SPEHFHAEFHETGRVACFWVDVCEDKNATPHSPQ RMETLISQALVPVVQALEATGEING Sbjct: 121 SPEHFHAEFHETGRVACFWVDVCEDKNATPHSPQHRMETLISQALVPVVQALEATGEING 180 Query: 181 KLIWSNTGYLINWYLTEMKQLLGEATVESLRHALFFEKTLTNGEDNPLWRTVVLRDGLLV 240 KLIWSNTGYLINWYLTEMKQLLGEATVESLRHALFFEKTLTNGEDNPLWRTVVLRDGLLV Sbjct: 181 KLIWSNTGYLINWYLTEMKQLLGEATVESLRHALFFEKTLTNGEDNPLWRTVVLRDGLLV 240 Query: 241 RRTCCQRYRLPDVQQCGYCTLK 262 RRTCCQRYRLPDVQQCG CTLK Sbjct: 241 RRTCCQRYRLPDVQQCGDCTLK 262
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 54.6 bits (131), Expect = 4e-12 Identities = 23/80 (28%), Positives = 35/80 (43%), Gaps = 1/80 (1%) Query: 62 DEATLFNIAVDPDYQRQGLGRALLEHLIDELEKRGVATLWLEVRASNAAAIALYESLGFN 121 A + +IAV DY+++G+G ALL I+ ++ L LE + N +A Y F Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147 Query: 122 EATIRRNYYPTTDG-REDAI 140 + Y E AI Sbjct: 148 IGAVDTMLYSNFPTANEIAI 167
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 214 bits (546), Expect = 4e-64 Identities = 107/460 (23%), Positives = 207/460 (45%), Gaps = 44/460 (9%) Query: 12 KRRTFAIISHPDAGKTTITEKVLLFGQAIQTAGTVKGRGSNQHAKSDWMEMEKQRGISIT 71 K +++H DAGKTT+TE +L AI G+V ++D +E+QRGI+I Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSV----DKGTTRTDNTLLERQRGITIQ 57 Query: 72 TSVMQFPYHDCLVNLLDTPGHEDFSEDTYRTLTAVDCCLMVIDAAKGVEDRTRKLMEVTR 131 T + F + + VN++DTPGH DF + YR+L+ +D +++I A GV+ +TR L R Sbjct: 58 TGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALR 117 Query: 132 LRDTPILTFMNKLDRDIRDPMELLDEVENELKIGCAPITWPIGCGKLFKGVYHLYKDETY 191 P + F+NK+D++ D + +++ +L K + Sbjct: 118 KMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVI------------------KQKVE 159 Query: 192 LYQSGKGHTIQEVRIVKGLNNPDLDAAVGEDLAQQLRDELELVKGASNEFDKELFLAGEI 251 LY + E + + +DL ++ L + + F + Sbjct: 160 LYPNMCVTNFTESEQWDTVIEGN------DDLLEKYMSGKSLEALELEQEESIRFHNCSL 213 Query: 252 TPVFFGTALGNFGVDHMLNGLVEWAPAPMPRQTDTRTVEASEDKFTGFVFKIQANMDPKH 311 PV+ G+A N G+D+++ + + + + G VFKI+ K Sbjct: 214 FPVYHGSAKNNIGIDNLIEVITNKFYSS---------THRGQSELCGKVFKIE--YSEK- 261 Query: 312 RDRVAFMRVVSGKYEKGMKLRQVRTAKDVVISDALTFMAGDRSHVEEAYPGDILGLHNHG 371 R R+A++R+ SG +R K + I++ T + G+ +++AY G+I+ L N Sbjct: 262 RQRLAYIRLYSGVLHLRDSVRISEKEK-IKITEMYTSINGELCKIDKAYSGEIVILQNEF 320 Query: 372 TIQIGDTFTQGEMMKFTGIPNFA-PELFRRIRLKDPLKQKQLLKGLVQLSEEG-AVQVFR 429 +++ +++ P L + P +++ LL L+++S+ ++ + Sbjct: 321 -LKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYV 379 Query: 430 PISNNDLIVGAVGVLQFDVVVARLKSEYNVEAVYESVNVA 469 + +++I+ +G +Q +V A L+ +Y+VE + V Sbjct: 380 DSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVI 419
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 29.5 bits (66), Expect = 0.022 Identities = 21/71 (29%), Positives = 33/71 (46%), Gaps = 2/71 (2%) Query: 123 QIECIDEIAKLAGTGEMVAEVTERAMRGELDFTASLRSRVATLK-GADANILQQVRENLP 181 Q+ E AK A V + TE A+ L L+ R A + GA+ + Q++RE Sbjct: 482 QLTRRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEV-MSQRIREMSD 540 Query: 182 LMPGLTQLVLK 192 P + LV++ Sbjct: 541 NDPRVVALVIR 551
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 36.3 bits (84), Expect = 7e-05 Identities = 22/152 (14%), Positives = 54/152 (35%), Gaps = 35/152 (23%) Query: 71 GKFYPLHTGHIYLIQRACSQVDELHIIMGFDDTRDRALFEDSAMSQQPTVPDRLRWLLQT 130 G F P+ GH+ +I+R C D++++ A+ + +V +RL + + Sbjct: 7 GSFDPITFGHLDIIERGCRLFDQVYV----------AVLRNPNKQPMFSVQERLEQIAKA 56 Query: 131 FKYQKNIRIHAFNEEGMEPYPHGWDVWSNGIKKFMAEKGI---------QPDLIYTSEEA 181 + N ++ D + + ++ D + A Sbjct: 57 IAHLPNAQV---------------DSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMA 101 Query: 182 DAPQYMEHLGIETVLVDPKRTFMSISGAQIRE 213 + + + +ETV + + +S + ++E Sbjct: 102 NTNKTLAS-DLETVFLTTSTEYSFLSSSLVKE 132
>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature. Length = 1291 Score = 29.2 bits (65), Expect = 0.014 Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 4/45 (8%) Query: 145 PLLVSHGIALGCLVSTILGLPAWAERRLRLRNCSISRVDYQESLW 189 P +V GIA G V T+ GL W ++ N D + +W Sbjct: 42 PAIVG-GIATGAAVGTVSGLLGWGLKQAEEAN---KTPDKPDKVW 82
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 86.8 bits (215), Expect = 6e-22 Identities = 33/139 (23%), Positives = 60/139 (43%) Query: 1 MQRETVWLVEDEQGIADTLVYMLQQEGFAVEVFERGLPVLDKARQQVPDVMILDVGLPDI 60 M T+ + +D+ I L L + G+ V + + D+++ DV +PD Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 SGFELCRQLLALHPALPVLFLTARSEEVDRLLGLEIGADDYVAKPFSPREVCARVRTLLR 120 + F+L ++ P LPVL ++A++ + + E GA DY+ KPF E+ + L Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 121 RVKKFSTPSPVIRIGHFEL 139 K+ + L Sbjct: 121 EPKRRPSKLEDDSQDGMPL 139
>PF06580#Sensor histidine kinase Length = 349 Score = 32.9 bits (75), Expect = 0.003 Identities = 47/207 (22%), Positives = 80/207 (38%), Gaps = 51/207 (24%) Query: 298 LTQNARMQAL---------VETL--LRQARLENRQEVVLTAVDVAALFR---RVSEARTV 343 + Q A++ AL L +R LE+ + ++ L R R S AR V Sbjct: 157 MAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQV 216 Query: 344 QLAE--KNITLHVM--------PTEVNVAAEPALLDQALGNLL-----DNA----IDFTP 384 LA+ + ++ + PA++D + +L +N I P Sbjct: 217 SLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLP 276 Query: 385 ESGCITLSAEVDQEHVTLKVLDTGSGIPDYALSRIFERFYSLPRANGQKSSGLGLAFVSE 444 + G I L D VTL+V +TGS N ++S+G GL V E Sbjct: 277 QGGKILLKGTKDNGTVTLEVENTGSLALK----------------NTKESTGTGLQNVRE 320 Query: 445 -VARLFNGEVTLR-NVQEGGVLASLRL 469 + L+ E ++ + ++G V A + + Sbjct: 321 RLQMLYGTEAQIKLSEKQGKVNAMVLI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 81.8 bits (202), Expect = 4e-20 Identities = 30/122 (24%), Positives = 60/122 (49%), Gaps = 1/122 (0%) Query: 1 MQTPHILIVEDELVTRNTLKSIFEAEGYDVFEATDGAEMHQILSEYDINLVIMDINLPGK 60 M IL+ +D+ R L GYDV ++ A + + ++ D +LV+ D+ +P + Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 NGLLLARELRE-QANVALMFLTGRDNEVDKILGLEIGADDYITKPFNPRELTIRARNLLS 119 N L +++ + ++ ++ ++ ++ + I E GA DY+ KPF+ EL L+ Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 120 RT 121 Sbjct: 121 EP 122
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 29.8 bits (67), Expect = 0.024 Identities = 31/137 (22%), Positives = 54/137 (39%), Gaps = 24/137 (17%) Query: 245 IWLPLGLVIGLLAAMFVLRILRRIQSPHHRLQDAIENRDICVHYQPIVSLANGKIVGAEA 304 W+ L L+ G +A +LR R+ + + P++ G+I Sbjct: 228 PWMLLALLAGFMAFRVMLR------QEKRRVS-----FHRRLLHLPLI----GRIARGLN 272 Query: 305 LARWPQTDGSWLSPDSFIPLAQQTGLS-EPLTLLIIRSVFEDMGDCLRQHPQQHISINLE 363 AR+ +T + S +PL Q +S + ++ R D +R+ H + LE Sbjct: 273 TARYARTLSILNA--SAVPLLQAMRISGDVMSNDYARHRLSLATDAVREGVSLHKA--LE 328 Query: 364 STVLTSEKIPQLLREMI 380 T L P ++R MI Sbjct: 329 QTAL----FPPMMRHMI 341
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 63.0 bits (153), Expect = 2e-13 Identities = 61/285 (21%), Positives = 102/285 (35%), Gaps = 35/285 (12%) Query: 40 HTLESQPQRIVSTSVTLTGSLLAIDAPVIASGATTPNNRVADDQGFLRQWSKVAKERKLQ 99 H P RIV+ LLA+ VAD + R W E L Sbjct: 29 HAAAIDPNRIVALEWLPVELLLALGIVPYG---------VADTINY-RLW---VSEPPLP 75 Query: 100 RLYIG-----EPSAEAVAAQMPDLILISATGGDSALALYDQLSTIAPTLIINYDDKS--- 151 I EP+ E + P ++ SA G S + L+ IAP N+ D Sbjct: 76 DSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPL 131 Query: 152 --WQSLLTQLGEITGHEKQAAERIAQFDKQLAAAKEQIKLPPQPVTAIVYTAAAHSANLW 209 + LT++ ++ + A +AQ++ + + K + + ++ Sbjct: 132 AMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVF 191 Query: 210 TPESAQGQMLEQLGFTLAKLPAGLNASQSQGKRHDIIQLGGENLAAGLNGESLFLFAGDQ 269 P S ++L++ G NA Q + + + LAA + + L + Sbjct: 192 GPNSLFQEILDEYGIP--------NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNS 243 Query: 270 KDADAIYANPLLAHLPAVQNKQVYALGTETFRLDYYSAMQVLERL 314 KD DA+ A PL +P V+ + + F SAM + L Sbjct: 244 KDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVL 288
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 444 bits (1142), Expect = e-161 Identities = 146/299 (48%), Positives = 195/299 (65%), Gaps = 18/299 (6%) Query: 1 MAIPKLQAYALPESHDIPQNKVDWAFEPQRAALLIHDMQDYFVSFWGENCPMMEQVIANI 60 MAIP +Q Y +P + D+PQNKV W +P RA LLIHDMQ+YFV + + ++ ANI Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60 Query: 61 AALRDYCKQHNIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQKVVDRLTPDADDTV 120 L++ C Q IPV YTAQP Q+ +DRALL D WGPGL P ++K++ L P+ DD V Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120 Query: 121 LVKWRYSAFHRSPLEQMLKESGRNQLIITGVYAHIGCMTTATDAFMRDIKPFMVADALAD 180 L KWRYSAF R+ L +M+++ GR+QLIITG+YAHIGC+ TA +AFM DIK F V DA+AD Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180 Query: 181 FSRDEHLMSLKYVAGRSGRVVMTEELL------PAPIPASKA-----------ALREVIL 223 FS ++H M+L+Y AGR VMT+ LL PA + + A +R+ I Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240 Query: 224 PLLDESDEPFDDD-NLIDYGLDSVRMMALAARWRKVHGDIDFVMLAKNPTIDAWWKLLS 281 LL E+ E D +L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W KLL+ Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 353 bits (907), Expect = e-126 Identities = 106/258 (41%), Positives = 147/258 (56%), Gaps = 20/258 (7%) Query: 5 GKNVWVTGAGKGIGYATALAFVEAGAKVTGFD---------------QAFTQEQYPFATE 49 GK ++TGA +GIG A A GA + D +A E +P Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP---- 63 Query: 50 VMDVADAGQVAQVCQRLLAETERLDVLINAAGILRMGATDQLSKEDWQQTFAVNVGGAFN 109 DV D+ + ++ R+ E +D+L+N AG+LR G LS E+W+ TF+VN G FN Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122 Query: 110 LFQQTMNQFRRQRGGAIVTVASDAAHTARIGMSAYGASKAALKSLALSVGLELAGSGVRC 169 + +R G+IVTV S+ A R M+AY +SKAA +GLELA +RC Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182 Query: 170 NVVSLGSTDTDMQRTLWVSDDAEEQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASDL 229 N+VS GST+TDMQ +LW ++ EQ I+G E FK GIPL K+A+P +IA+ +LFL S Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242 Query: 230 ASHITLQDIVVDGGSTLG 247 A HIT+ ++ VDGG+TLG Sbjct: 243 AGHITMHNLCVDGGATLG 260
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 47.2 bits (112), Expect = 3e-08 Identities = 36/146 (24%), Positives = 63/146 (43%), Gaps = 5/146 (3%) Query: 197 AREREQGTLDQLLVSPLTTWQIFIGKAVPALIVATFQATIVLAIGIWAYQIPFAGSLALF 256 R Q T + +L + L I +G+ A A IG+ A + + L+L Sbjct: 92 GRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGA---GIGVVAAALGYTQWLSLL 148 Query: 257 YFTMVI--YGLSLVGFGLLISSLCSTQQQAFIGVFVFMMPAILLSGYVSPVENMPVWLQN 314 Y VI GL+ G+++++L + + + P + LSG V PV+ +P+ Q Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208 Query: 315 LTWINPIRHFTDITKQIYLKDASLDI 340 P+ H D+ + I L +D+ Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDV 234
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.2 bits (70), Expect = 0.012 Identities = 20/90 (22%), Positives = 28/90 (31%), Gaps = 21/90 (23%) Query: 293 TPRFEDAFIDLLGGAGTSESPLGAILHTVEGTPGETVIEAKELTKKFGDFAATDHVNFAV 352 PR E + +LG P + + + K HV + Sbjct: 547 VPRLEKWLVHVLGKTPDDYKP-------------RRLRYLQLVGKYI----LMGHVARVM 589 Query: 353 KRGEIFG----LLGPNGAGKSTTFKMMCGL 378 + G F L G G GKST + GL Sbjct: 590 EPGCKFDYSVVLEGTGGIGKSTLINTLVGL 619 Score = 29.3 bits (65), Expect = 0.047 Identities = 11/23 (47%), Positives = 13/23 (56%) Query: 34 YVTGLVGPDGAGKTTLMRMLAGL 56 Y L G G GK+TL+ L GL Sbjct: 597 YSVVLEGTGGIGKSTLINTLVGL 619
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 62.5 bits (152), Expect = 6e-13 Identities = 42/259 (16%), Positives = 92/259 (35%), Gaps = 25/259 (9%) Query: 83 ALMQAKAGVSVAQAQYDLMLAGYRDEEIAQAAAAVKQAQAAYDYAQNFYNRQQGLWKSRT 142 Q + + +A+ +LA E + + + + L + Sbjct: 201 QKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENK 260 Query: 143 ISA--NDLENARSSRDQAQATLKSAQDKLRQYRSGNREQ---DIAQAKASLEQAQAQLAQ 197 N+L +S +Q ++ + SA+++ + + + + Q ++ +LA+ Sbjct: 261 YVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAK 320 Query: 198 AELNLQDSTLIAPSDGTLLTRAV-EPGTVLNEGGTVFTVSLT-RPVWVRAYVDERNLDQA 255 E Q S + AP + V G V+ T+ + + V A V +++ Sbjct: 321 NEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFI 380 Query: 256 QPGRKVLLYTDGRPDKPYH---GQIGFVSPTAEFTPKTVETPDLRTDLVYRLRIVVT--- 309 G+ ++ + P Y G++ ++ A D R LV+ + I + Sbjct: 381 NVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--------IEDQRLGLVFNVIISIEENC 432 Query: 310 ----DADDALRQGMPVTVQ 324 + + L GM VT + Sbjct: 433 LSTGNKNIPLSSGMAVTAE 451
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 72.7 bits (178), Expect = 7e-18 Identities = 33/214 (15%), Positives = 77/214 (35%), Gaps = 17/214 (7%) Query: 9 KGEQAKKQLIAAALAQFGEYGMNATT-REIAAQAGQNIAAITYYFGSKEDLYLACAQWIA 67 + ++ ++ ++ AL F + G+++T+ EIA AG AI ++F K DL+ + Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67 Query: 68 DFIGEQFRPHAEEAERLFAQPQPDRAAIRELILRACRNMIKLLTQDDTVNLSKFISREQL 127 IGE E + P + +RE+++ + + + + + F E + Sbjct: 68 SNIGELEL---EYQAKFPGDP---LSVLREILIHVLESTVTEERRRLLMEII-FHKCEFV 120 Query: 128 SPTAAYHLVHEQVISPLHSHLTRLIAAWTGCDANDTRMILHTHALIGEILAFRLGKETIL 187 A + + + + + +A L T + + G Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKH--CIEAKMLPADLMTRRAAIIMRGYISG----- 173 Query: 188 LRTGWTAFDEEKTELINQTVTCHIDLILQGLSQR 221 L W + + + ++ ++L+ Sbjct: 174 LMENWLFAPQSFD--LKKEARDYVAILLEMYLLC 205
>SECA#SecA protein signature. Length = 901 Score = 30.6 bits (69), Expect = 0.014 Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 4/67 (5%) Query: 246 QQVLVFTRTKHGANHLAEQLNKDGIRSVAIHG-NKSQGARTRALADFKSGDIRVLVATDI 304 Q VLV T + + ++ +L K GI+ ++ + A A A + + V +AT++ Sbjct: 450 QPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNM 506 Query: 305 AARGLDI 311 A RG DI Sbjct: 507 AGRGTDI 513
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 38.7 bits (90), Expect = 3e-05 Identities = 28/155 (18%), Positives = 61/155 (39%), Gaps = 5/155 (3%) Query: 48 QAGIDWVPTSMTAYLAGGMFLQWLLGPLSDRIGRRPVMLAGVVWFIVTCLAILLAQNIEQ 107 A +WV T+ + G + G LSD++G + ++L G++ + + + Sbjct: 48 PASTNWVNTAFMLTFSIGTAV---YGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFS 104 Query: 108 FTLL-RFLQGISFCFIGAVGYAAIQESFEEAVCIKITALMANVALIAPLLGPLVGAAWIH 166 ++ RF+QG A+ + + K L+ ++ + +GP +G H Sbjct: 105 LLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAH 164 Query: 167 VLPWEGMFVLFAALAAISFFGLQRAMPETAMRIGE 201 + W +L + I+ L + + + G Sbjct: 165 YIHW-SYLLLIPMITIITVPFLMKLLKKEVRIKGH 198
>PF05272#Virulence-associated E family protein Length = 892 Score = 33.1 bits (75), Expect = 0.002 Identities = 24/61 (39%), Positives = 29/61 (47%), Gaps = 1/61 (1%) Query: 302 DSAWVAGVSVVLWGLGASLGFPLTISAASDTGPDAPTRVSVVATTGYLAFLVGPPLLGYL 361 D +W+AG +VVLW SL LT DT PD R ++A YL F P L Sbjct: 270 DWSWLAGCTVVLWPDCDSLREKLTRQELKDT-PDPLAREKLLAAKPYLPFDKQPGQKAML 328 Query: 362 G 362 G Sbjct: 329 G 329
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 49.6 bits (118), Expect = 6e-10 Identities = 14/83 (16%), Positives = 30/83 (36%), Gaps = 4/83 (4%) Query: 2 RRANDPQRREKIIQATLEAVKLYGIHAVTHRKIATLAGVPLGSMTYYFSGIDELLLEAFS 61 + + R+ I+ L G+ + + +IA AGV G++ ++F +L E + Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW- 63 Query: 62 SFTEIMSRQYQAFFSDVSDAQGA 84 E+ + Sbjct: 64 ---ELSESNIGELELEYQAKFPG 83
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 30.2 bits (68), Expect = 0.020 Identities = 20/106 (18%), Positives = 34/106 (32%), Gaps = 6/106 (5%) Query: 394 LMIGMITFQFSTFSFGMGNAAGLLFAGIML-GFMRANHPTFG-YIPQ--GALSMVKEFGL 449 L++ + +L+ G ++ G A G YI + FG Sbjct: 76 LLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGF 135 Query: 450 IVFMAGVGLSAGSGINNGLGAIGGQM--LIAGLIVSLVPVVICFLF 493 + G G+ AG + +G A + L + CFL Sbjct: 136 MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.0 bits (67), Expect = 0.010 Identities = 9/18 (50%), Positives = 12/18 (66%) Query: 31 LVLLGPSGAGKSSLLRVL 48 +VL G G GKS+L+ L Sbjct: 599 VVLEGTGGIGKSTLINTL 616
>FLAGELLIN#Flagellin signature. Length = 507 Score = 45.0 bits (106), Expect = 2e-07 Identities = 40/226 (17%), Positives = 79/226 (34%), Gaps = 9/226 (3%) Query: 7 MMYQQNMRGITNSQAEWMKYGEQMSTGKRVVNPSDDPIAASQAVVLSQAQAQNSQYTLAR 66 ++ Q N+ +S + + E++S+G R+ + DD + A + +Q + Sbjct: 11 LLTQNNLNKSQSSLSSAI---ERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNA 67 Query: 67 TFATQKVSLEESVLSQVTTAIQNAQEKIVYASNGTLSDDDRASLATDIQGLRDQLLNLAN 126 E L+++ +Q +E V A+NGT SD D S+ +IQ +++ ++N Sbjct: 68 NDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSN 127 Query: 127 TTDGNGRYIFAGYKTETAPFSEANGDYVGGTESIKQQVDASRSMVIGHTGDKIFDSITSN 186 T NG + + +G E+I + +G G + + Sbjct: 128 QTQFNGVKVLSQDNQMKIQVGANDG------ETITIDLQKIDVKSLGLDGFNVNGPKEAT 181 Query: 187 AVAEPDGSASETNLFAMLDSAIAALKTPVADSEADKEIAAAALDKT 232 + T A + + A DK Sbjct: 182 VGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKV 227
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 68.2 bits (166), Expect = 2e-13 Identities = 49/261 (18%), Positives = 87/261 (33%), Gaps = 26/261 (9%) Query: 551 VAPAPKAATATPAAPAQPGLLSRFFGALKALFSGGEEAKPTEQP-TPKAEAKPERQQDRR 609 T P + S E A+ E P P A A P Sbjct: 991 TVDTTNITTPNNIQADVPSVPSN----------NEEIARVDEAPVPPPAPATPSET---- 1036 Query: 610 KPRQSNRRDRNERRDTRSERTEGSDNREENRRNRRQAQQQTAETRESRQQAEV------T 663 + N ++++++ D E +NR A++ + + + Q EV T Sbjct: 1037 ----TETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSET 1092 Query: 664 EKARTTDEQQAPRRERSRRRNDDKRQAQQEAKALNVEEQSVQETEQEERVRPVQPRRKQR 723 ++ +TT+ ++ E+ + + + Q+ K + + QE + + + R Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPK-VTSQVSPKQEQSETVQPQAEPARENDP 1151 Query: 724 QLNQKVRYEQSVAEEAVVAPVVEETAAAEPIVQEAPAPRTELVKVPLPVVAQTAPEQQEE 783 +N K Q+ P E ++ E V E+ T V P A Q Sbjct: 1152 TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTV 1211 Query: 784 NNADNRDNGGMPRRSRRSPRH 804 N+ + RRS RS H Sbjct: 1212 NSESSNKPKNRHRRSVRSVPH 1232 Score = 61.2 bits (148), Expect = 2e-11 Identities = 48/288 (16%), Positives = 88/288 (30%), Gaps = 36/288 (12%) Query: 513 PSEEEFAERKRPEQPALATFAMPDVPPAPT-PAEPAATVVAPAPKAATATPAAPAQPGLL 571 P E+ + DVP P+ E A AP P A ATP+ + Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE---- 1038 Query: 572 SRFFGALKALFSGGEEAKPTEQPTPKAEAKPERQQDRRKPRQSNRRDRNERRDTRSER-- 629 A E +K + K E Q+ + + + ++ Sbjct: 1039 ------TVA-----ENSKQESKTVEKNEQDATE-----TTAQNREVAKEAKSNVKANTQT 1082 Query: 630 TEGSDNREENRRNRRQAQQQTAETRESRQQAEVTEKARTTDEQQAPRRERSRRRNDDKRQ 689 E + + E + + ++TA + + TEK + + + + + + Q Sbjct: 1083 NEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQ 1142 Query: 690 AQ---QEAKALNVEEQSVQETEQEERVRPVQPRRKQRQLNQKVRYEQSV--AEEAVVAPV 744 A+ + +N++E Q + +P + + Q V +V V P Sbjct: 1143 AEPARENDPTVNIKEPQSQTNTTADTEQPA--KETSSNVEQPVTESTTVNTGNSVVENPE 1200 Query: 745 VEETAAAEPIVQEAPA------PRTELVKVPLPVVAQTAPEQQEENNA 786 A +P V + R + VP V T A Sbjct: 1201 NTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVA 1248
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 140 bits (355), Expect = 2e-38 Identities = 60/206 (29%), Positives = 100/206 (48%), Gaps = 1/206 (0%) Query: 258 GKAFYYQPVLCTVQAKSTLTAEEEQDRLRQAIDFTLLDLMTLTAKAEASGLDDIAAIFSG 317 KAF + ++ S E ++L A++ + +L + + EAS D A IF+ Sbjct: 17 AKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTEASMGADKAEIFAA 76 Query: 318 HHTLLGDPELLAAASELLQHEHCTAEYAWQQVLKELSQQYQQLDDEYLQARYIDVDDLLH 377 H +L DPEL+ +++E AEYA ++V ++ +D+EY++ R D+ D+ Sbjct: 77 HLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNEYMKERAADIRDVSK 136 Query: 378 RTLVHLT-QTKEELPQFNSPTILLAENIYPSTVLQLDPAVVKGICLSAGSPVSHSALIAR 436 R L HL L T+++AE++ PS QL+ VKG G SHSA+++R Sbjct: 137 RVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSHSAIMSR 196 Query: 437 ELGIGWICQQGEKLYAIQPEETLTLD 462 L I + E IQ + + +D Sbjct: 197 SLEIPAVVGTKEVTEKIQHGDMVIVD 222
>adhesinmafb#Neisseria meningitidis: adhesin MafB signature. Length = 467 Score = 28.5 bits (63), Expect = 0.020 Identities = 10/47 (21%), Positives = 26/47 (55%) Query: 138 VESLRQSSEQNLSVPVALEAASSIAESAAQSTITMQARKGRASYLGE 184 E++ + ++N + +EA ++A +A + + A+ G+A+ G+ Sbjct: 293 REAVDRWIQENPNAAETVEAVFNVAAAAKVAKLAKAAKPGKAAVSGD 339
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 244 bits (625), Expect = 7e-76 Identities = 91/363 (25%), Positives = 155/363 (42%), Gaps = 33/363 (9%) Query: 308 QMRQLMTSQLGKVSHTFAHMPQDDPQTRRLIHFGRQAARSSFPVLLCGEEGVGKALLSQA 367 + S+L S + + + + ++ +++ GE G GK L+++A Sbjct: 120 AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179 Query: 368 IHNESERAAGPYIAVNCELYGDAALAEEFIG---GDRTDNENGRLSRLELAHGGTLFLEK 424 +H+ +R GP++A+N + E G G T + R E A GGTLFL++ Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239 Query: 425 IEYLAVELQSALLQVIKQGVITRLDARRLIPIDVKVIATTTADLAMLVEQNRFSRQLYYA 484 I + ++ Q+ LL+V++QG T + R I DV+++A T DL + Q F LYY Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299 Query: 485 LHAFEITIPPLRMRRGSIPALVNNKLRSLEKRFSTRLKIDDDALARLVSCAWPGNDFELY 544 L+ + +PPLR R IP LV + ++ EK + D +AL + + WPGN EL Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELE 359 Query: 545 SVIENLALSSDNGRIRVSDLPEHLFTEQATDDVSATRLSTS------------------- 585 +++ L I + L +E + + Sbjct: 360 NLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419 Query: 586 -----------LSFAEVEKEAIINAAQVTGGRIQEMSALLGIGRTTLWRKMKQHGIDAGQ 634 AE+E I+ A T G + + LLG+ R TL +K+++ G+ + Sbjct: 420 GDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVYR 479 Query: 635 FKR 637 R Sbjct: 480 SSR 482
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 212 bits (540), Expect = 3e-59 Identities = 247/980 (25%), Positives = 402/980 (41%), Gaps = 117/980 (11%) Query: 14 RLAELKIRSPSIQLIKFGAIGLNAIIFSPLLIAADTGSQYGTNITINDGDRI---TGDTA 70 + A L+ + ++ L GA ++ I Q+G +I +D + +G T Sbjct: 10 KAAPLRRTTLAMALGALGAAPAAHADWNNQSIVKTGERQHGIHIQGSDPGGVRTASGTTI 69 Query: 71 DPSGN-LYGVMTPAGNTPGNINLGNDVTVN---VNDASGYAKGIIIQGKNSSLTANRLTV 126 SG G++ N + N + ++D + K L A+ T+ Sbjct: 70 KVSGRQAQGILLE--NPAAELQFRNGSVTSSGQLSDDGIRRFLGTVTVKAGKLVADHATL 127 Query: 127 DVVGQT---SAIGINLIGDYTHADLGTGSTIKSNDDGIIIGHSSTLTATQFTIENSNGIG 183 VG T I + + G+ A + ST++ G+ I + +T + I + G+ Sbjct: 128 ANVGDTWDDDGIALYVAGEQAQASIAD-STLQGAG-GVQIERGANVTVQRSAIVD-GGLH 184 Query: 184 LTINDYGTSVDLGSGSKIKTDGS-TGVYIGGLNGNNANGAARFTATDLTID---VQGYSA 239 + DL + D + T V G + A++LT+D + G A Sbjct: 185 IGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPA----AVSVLGASELTLDGGHITGGRA 240 Query: 240 MGINVQKNSVVDLGTNSTIKTNGDNAHGLWSFGQVSANAL-------TVDVTGAAANGVE 292 G+ + +VV L +TI+ A G G V A+ GV+ Sbjct: 241 AGVAAMQGAVVHL-QRATIRRGDAPAGGAVPGGAVPGGAVPGGFGPGGFGPVLDGWYGVD 299 Query: 293 VRGGTTTIGADSHISSAQGGGLVTSSSDATINFSG---TAAQRNSIFSGGSYGASAQTAT 349 V G + + A S + + + G + A + SG +A N I +GG+ + Q A Sbjct: 300 VSGSSVEL-AQSIVEAPELGAAIRVGRGARVTVSGGSLSAPHGNVIETGGARRFAPQAAP 358 Query: 350 AVINMQNTDITVDRNGSLALGLWALSGGRITGDSLAITGAAGARGIYAMTNSQIDLTSDL 409 I +Q G+ A G L L +TG A A+G T + + Sbjct: 359 LSITLQA--------GAHAQGKALLYRVLPEPVKLTLTGGADAQGDIVATELPSIPGTSI 410 Query: 410 VIDMSTPDQMAIATQHDDGYAASRINASGRMLINGSVLSKGGLINLDMHPGSVWTGSSLS 469 P +A+A+ + WTG++ Sbjct: 411 -----GPLDVALAS------------------------------------QARWTGAT-- 427 Query: 470 DNVNGGKLDVAMNNSVWNVTSNSNLDTLAL-SHSTVDFASHGSTAGTFTTLNVENLSGNS 528 V+ +D N+ W +T NSN+ L L S +VDF + AG F L V L+G+ Sbjct: 428 RAVDSLSID----NATWVMTDNSNVGALRLASDGSVDFQQ-PAEAGRFKVLTVNTLAGSG 482 Query: 529 TFIMRADVVGEGNGVNNRGDLLNISGSSAGNHVLAIRNQGSEATTGNEVLTVVKTTDGAA 588 F M D L + ++G H L +RN GSE + N +L V AA Sbjct: 483 LFRMNV------FADLGLSDKLVVMQDASGQHRLWVRNSGSEPASANTLLLVQTPLGSAA 536 Query: 589 SFSASS---QVELGGYLYDVRKNG-TNWELYASGTVPEPTPNPEPTPAPAQPPIVNPD-P 643 +F+ ++ +V++G Y Y + NG W L + P P P P+P P P QPP P+ P Sbjct: 537 TFTLANKDGKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPEAP 596 Query: 644 TPEPAPTPKPTTTADAGGNYLNVGYL--LNYVENRTLMQRMGDLRNQSKDGNIWLRSYG- 700 P+P + + A+A N VG L Y E+ L +R+G+LR G W R + Sbjct: 597 APQPPAGRELSAAANAAVNTGGVGLASTLWYAESNALSKRLGELRLNPDAGGAWGRGFAQ 656 Query: 701 -GSLDSFASGKLSGFDMGYSGIQFGGDKRLSDVM-PLYVGLYIDSTHASPDYSG-GDGTA 757 LD+ A + FD +G + G D ++ ++G T ++G G G Sbjct: 657 RQQLDNRAGRR---FDQKVAGFELGADHAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGHT 713 Query: 758 RSDYMGMYASYMAQNGFYSDLVIKASRQKNSFHVLDSQNNGVNANGTANGMSISLEAGQR 817 S ++G YA+Y+A +GFY D ++ASR +N F V S V +G+ SLEAG+R Sbjct: 714 DSVHVGGYATYIADSGFYLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGRR 773 Query: 818 FNLSPTGYGFYIEPQTQLTYSHQNEMAMKASNGLNIHLNHYESLLGRASMILGYDIT-AG 876 F + G+++EPQ +L A +A+NGL + S+LGR + +G I AG Sbjct: 774 FTHAD---GWFLEPQAELAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELAG 830 Query: 877 NSQLNVYVKTGAIREFSGDTEYLLNDSREKYSFKGNGWNNGVGVSAQYNKQHTFYLEADY 936 Q+ Y+K ++EF G N + +G G+G++A + H+ Y +Y Sbjct: 831 GRQVQPYIKASVLQEFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLYASYEY 890 Query: 937 TQGNLFDQK-QVNGGYRFSF 955 ++G + GYR+S+ Sbjct: 891 SKGPKLAMPWTFHAGYRYSW 910
>INTIMIN#Intimin signature. Length = 939 Score = 255 bits (652), Expect = 4e-78 Identities = 120/378 (31%), Positives = 197/378 (52%), Gaps = 21/378 (5%) Query: 79 GEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDN 138 G+ AK ALG + Q + +++WL +G A V+++ N F GS + +P D+ Sbjct: 184 GDYAKDTALGIAGN----QASSQLQAWLQHYGTAEVNLQSGNN--FDGSSLDFLLPFYDS 237 Query: 139 DRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLQDENLQRAGFGAEAWG 198 ++ L + Q+G D+ +N+G GQR+ ++GYN F D + R G G E W Sbjct: 238 EKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWR 297 Query: 199 EYLRLSANFYQPFAAWHE--QTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDR 256 +Y + S N Y + WHE ++R A G+D+ +P Y L + EQY+GD Sbjct: 298 DYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDN 357 Query: 257 VDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKK 316 V LFNS NP A ++G+NYTP+PLVT+ ++ G EN + Y+F P + Sbjct: 358 VALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQ 417 Query: 317 QLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQI 376 Q+ V E ++L GSRYD QRNN LEY+++ L++ + + T ++L + Sbjct: 418 QIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNI-PHDINGTERSTQKIQLIV 476 Query: 377 RSRYGIRQLIWQGDTQILS-----LTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVV 431 +S+YG+ +++W D+ + S G+Q SA+ + I+P + +G SN ++++ Sbjct: 477 KSKYGLDRIVWD-DSALRSQGGQIQHSGSQ--SAQDYQAILPAYV--QGGSNVYKVTARA 531 Query: 432 EDNQGQRVSSNEITLTLV 449 D G SSN + LT+ Sbjct: 532 YDRNGN--SSNNVLLTIT 547
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 73.7 bits (181), Expect = 2e-17 Identities = 32/117 (27%), Positives = 56/117 (47%), Gaps = 2/117 (1%) Query: 7 ATILLIDDHPMLRTGVKQLISMAPDITVVGEASNGEQGIELAESLDPDLILLDLNMPGMN 66 ATIL+ DD +RT + Q +S A + SN + D DL++ D+ MP N Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 67 GLETLDKLREKSLSGRIVVFSVSNHEEDVVTALKRGADGYLLKDMEPEDLLKALHQA 123 + L ++++ ++V S N + A ++GA YL K + +L+ + +A Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118
>PF06580#Sensor histidine kinase Length = 349 Score = 53.3 bits (128), Expect = 1e-09 Identities = 36/172 (20%), Positives = 73/172 (42%), Gaps = 23/172 (13%) Query: 424 PESSRELLSQIRNELNASWAQLRELLTTFRLQLTEPGLRPALEASCEEYSAKFGFPVKLD 483 P +RE+L+ + + S + +LT +++ + S +F ++ + Sbjct: 190 PTKAREMLTSLSELMRYSLRYSNARQVSLADELT------VVDSYLQLASIQFEDRLQFE 243 Query: 484 YQLPPRL----VPSHQAIHLLQIAREALSNALKH-----SQASEVVVTVAQNDNQVKLTV 534 Q+ P + VP L+Q E N +KH Q ++++ +++ V L V Sbjct: 244 NQINPAIMDVQVPPM----LVQTLVE---NGIKHGIAQLPQGGKILLKGTKDNGTVTLEV 296 Query: 535 QDNGCGVPENAIRSNHYGMIIMRDRAQSLRG-DCRVRRRESGGTEVVVTFIP 585 ++ G +N S G+ +R+R Q L G + +++ E G + IP Sbjct: 297 ENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 32.5 bits (74), Expect = 0.004 Identities = 35/166 (21%), Positives = 60/166 (36%), Gaps = 22/166 (13%) Query: 258 IMSLLYLATFGSFIGFSAGFAMLSKTQFPDVQILQYAFFGPFIGALARSA---GGALSDR 314 I+S + L+ + I A A L K + + FFG F S ++ Sbjct: 474 IVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKI 533 Query: 315 LGGTRVTLVNFILMAIFSGLLFLTLPTD----GQGGSFMAFFAVFLALFLTAGLGSGSTF 370 LG T L+ + L+ +LFL LP+ G F+ L +G+T Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTM----------IQLPAGATQ 583 Query: 371 QMISVIFRKLTMDRVKAEGGSDER-----AMREAATGTAAALGFIS 411 + + ++T +K E + E + A + F+S Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVS 629
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 89.5 bits (222), Expect = 4e-24 Identities = 30/105 (28%), Positives = 51/105 (48%), Gaps = 3/105 (2%) Query: 7 KFLVVDDFSTMRRIVRNLLKELGFNNVEEAEDGVDALNKLQAGGYGFVISDWNMPNMDGL 66 LV DD + +R ++ L G++ V + + AG V++D MP+ + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYD-VRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 67 ELLKTIRADGAMSALPVLMVTAEAKKENIIAAAQAGASGYVVKPF 111 +LL I+ LPVL+++A+ I A++ GA Y+ KPF Sbjct: 64 DLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106
>TYPE3OMBPROT#Type III secretion system outer membrane B protein family signature. Length = 538 Score = 32.7 bits (74), Expect = 0.003 Identities = 24/72 (33%), Positives = 37/72 (51%), Gaps = 2/72 (2%) Query: 214 NGMEVSVAAQNAQLTVNNVAIENSSNTISDALENITLNLNDVTTGNQTLTITQDTSKAQT 273 N E +VAA+N + + A+ + +S AL T++L V+T LT T T ++ Sbjct: 236 NSSERAVAARNKAEELVSAALYSRPELLSQALSGKTVDLKIVSTS--LLTPTSLTGGEES 293 Query: 274 AIKDWVNAYNSL 285 +KD VNA L Sbjct: 294 MLKDQVNALKGL 305
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 30.2 bits (68), Expect = 0.017 Identities = 10/57 (17%), Positives = 17/57 (29%), Gaps = 2/57 (3%) Query: 164 RFTLLPIFRIPVKMQKVSAASPLTQKPDQARRRF--RLGMLVFFGMLGWALLTAMNQ 218 R L R + + + A L + P R R M ++L + Sbjct: 26 RKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEI 82
>PF01206#SirA family protein Length = 76 Score = 92.5 bits (230), Expect = 6e-29 Identities = 16/71 (22%), Positives = 37/71 (52%) Query: 7 DYRLDMVGEPCPYPAVATLEAMPQLKKGEILEVVSDCPQSINNIPLDARNHGYTVLDIQQ 66 D LD G CP P + + + + GE+L V++ P S+ + ++ G+ +L+ ++ Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64 Query: 67 DGPTIRYLIQK 77 + T + +++ Sbjct: 65 EDGTYHFRLKR 75
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 509 bits (1312), Expect = 0.0 Identities = 239/388 (61%), Positives = 282/388 (72%), Gaps = 33/388 (8%) Query: 1 MKKLTVAISAVAASVLMAMSAQAAEIYNKDSNKLDLYGKVNAKHYFSSNDADDGDTTYVR 60 MK+ +A+ V ++L A +A AAEIYNKD NKLDLYGKV+ HYFS + + DGD TY+R Sbjct: 1 MKRKVLAL--VIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMR 58 Query: 61 LGFKGETQINDQLTGFGQWEYEFKGNRAESQGSSKDKTRLAFAGLKFGDYGSIDYGRNYG 120 +GFKGETQINDQLTG+GQWEY + N E +G++ TRLAFAGLKFGDYGS DYGRNYG Sbjct: 59 VGFKGETQINDQLTGYGQWEYNVQANTTEGEGANS-WTRLAFAGLKFGDYGSFDYGRNYG 117 Query: 121 VAYDIGAWTDVLPEFGGDTWTQTDVFMTGRTTGVATYRNNDFFGLVDGLNFAAQYQGKND 180 V YD+ WTD+LPEFGGD++T D +MTGR GVATYRN DFFGLVDGLNFA QYQGKN+ Sbjct: 118 VLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNE 177 Query: 181 R----------------TDVTEANGDGFGFSTTYEY-EGFGVGATYAKSDRTNDQVIYGN 223 D+ NGDGFG STTY+ GF GA Y SDRTN+QV G Sbjct: 178 SQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGG 237 Query: 224 NSLNASGQNAEVWAAGLKYDANNIYLATTYSETQNMTVFG------NNHIANKAQNFEVV 277 A G A+ W AGLKYDANNIYLAT YSET+NMT +G + +ANK QNFEV Sbjct: 238 T--IAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVT 295 Query: 278 AQYQFDFGLRPSVAYLQSKGKDLG----AWGDQDLIEYIDVGATYYFNKNMSTFVDYKIN 333 AQYQFDFGLRP+V++L SKGKDL D+DL++Y DVGATYYFNKN ST+VDYKIN Sbjct: 296 AQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKIN 355 Query: 334 LIDKSD-FTKASGVATDDIVAVGLVYQF 360 L+D D F K +G++TDDIVA+G+VYQF Sbjct: 356 LLDDDDPFYKDAGISTDDIVALGMVYQF 383
>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE signature. Length = 103 Score = 114 bits (286), Expect = 8e-37 Identities = 101/103 (98%), Positives = 101/103 (98%) Query: 2 SAIQGIEGVISQLQATAMSARAQESLSQPTISFAGQLHAALDRISDTQTVARTQAEKFTL 61 SAIQGIEGVISQLQATAMSARAQESL QPTISFAGQLHAALDRISDTQT ARTQAEKFTL Sbjct: 1 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL 60 Query: 62 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 104 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV Sbjct: 61 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 375 bits (965), Expect = e-131 Identities = 231/259 (89%), Positives = 246/259 (94%) Query: 3 ATAAQTKSLEWLNRLRANPKIPLIVTGSAAVAVMVALILWAKAPDYRTLFSNLSDQDGGA 62 +TA Q K LEWLNRLRANP+IPLIV GSAAVA++VA++LWAK PDYRTLFSNLSDQDGGA Sbjct: 5 STATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGA 64 Query: 63 IVSQLTQMNIPYRFSEASGAIEVPADKVHELHLRLAQQGLPKGGAVGFELLDQEKFGISQ 122 IV+QLTQMNIPYRF+ SGAIEVPADKVHEL LRLAQQGLPKGGAVGFELLDQEKFGISQ Sbjct: 65 IVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 124 Query: 123 FSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPSLFVREQKSPSASVTVNLLPGRA 182 FSEQVNYQRALEGEL+RTIET+GPVK ARVHLAMPKPSLFVREQKSPSASVTV L PGRA Sbjct: 125 FSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRA 184 Query: 183 LDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQSNTSGRDLNDAQLKYASDVEGRI 242 LDEGQISA+VHLVSSAVAGLPPGNVTLVDQ GHLLTQSNTSGRDLNDAQLK+A+DVE RI Sbjct: 185 LDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRI 244 Query: 243 QQRIEAILSPIVGNGNIHA 261 Q+RIEAILSPIVGNGN+HA Sbjct: 245 QRRIEAILSPIVGNGNVHA 263
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 246 bits (629), Expect = 1e-80 Identities = 159/199 (79%), Positives = 174/199 (87%), Gaps = 5/199 (2%) Query: 1 MDFASKEQTEEQYRPNGDESHAALRSRQLNESEQSGSGYPGGVPGALSNQPAPANNAPIS 60 +DFA+KEQTEE Y PNGD S A LRSRQLN SEQ G+GYPGGVPGALSNQPAP N API+ Sbjct: 269 LDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGALSNQPAPPNEAPIA 328 Query: 61 TPPTNQNNRQQ--QASTTSNS---GPRSTQRNETSNYEVDRTIRHTKMNVGDVQRLSVAV 115 TPPTNQ N Q Q ST++NS GPRSTQRNETSNYEVDRTIRHTKMNVGD++RLSVAV Sbjct: 329 TPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRHTKMNVGDIERLSVAV 388 Query: 116 VVNYKTLPDGKPLPLSNEQMKQIEALTREAMGFSEKRGDSLNVVNSPFNSSDESGGALPF 175 VVNYKTL DGKPLPL+ +QMKQIE LTREAMGFS+KRGD+LNVVNSPF++ D +GG LPF Sbjct: 389 VVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDKRGDTLNVVNSPFSAVDNTGGELPF 448 Query: 176 WQQQVFIDQLLAAGRWLLV 194 WQQQ FIDQLLAAGRWLLV Sbjct: 449 WQQQSFIDQLLAAGRWLLV 467
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 137 bits (347), Expect = 1e-42 Identities = 45/140 (32%), Positives = 82/140 (58%), Gaps = 1/140 (0%) Query: 1 MSNLTGTDKSVILLMTIGEDRAAEVFKHLSQREVQTLSAAMANVTQISNKQLTDVLAEFE 60 +S LTG K+ ILL++IG + +++VFK+LSQ E+++L+ +A + I+++ +VL EF+ Sbjct: 12 VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK 71 Query: 61 QEAEQFAALNINANDYLRSVLVKALGEERAASLLEDILETRDTASGIETLNFMEPQSAAD 120 + + DY R +L K+LG ++A ++ + L + + E + +P + + Sbjct: 72 ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINN-LGSALQSRPFEFVRRADPANILN 130 Query: 121 LIRDEHPQIIATILVHLKRA 140 I+ EHPQ IA IL +L Sbjct: 131 FIQQEHPQTIALILSYLDPQ 150
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 200 bits (511), Expect = 2e-66 Identities = 69/182 (37%), Positives = 109/182 (59%), Gaps = 1/182 (0%) Query: 1 MFDERLRHDVMLRIATFGGVQPAALAELTEVLNGLLDGQ-NLKRSKMGGVRTAAEIINLM 59 ++ +V RIA P + E+ VL L + + GGV EIIN+ Sbjct: 158 SLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMA 217 Query: 60 KTQQEEAVITAVREFDGELAQKIIDEMFLFENLVDVDDRSIQRLLQEVDSESLLIALKGA 119 + E+ +I ++ E D ELA++I +MF+FE++V +DDRSIQR+L+E+D + L ALK Sbjct: 218 DRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVLLDDRSIQRVLREIDGQELAKALKSV 277 Query: 120 EQPLREKFLRNMSQRAADILRDDLANRGPVRLSQVENEQKAILLIVRRLAETGEMVIGSG 179 + P++EK +NMS+RAA +L++D+ GP R VE Q+ I+ ++R+L E GE+VI G Sbjct: 278 DIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDVEESQQKIVSLIRKLEEQGEIVISRG 337 Query: 180 ED 181 + Sbjct: 338 GE 339
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 373 bits (958), Expect = e-135 Identities = 223/228 (97%), Positives = 226/228 (99%) Query: 1 MSDNLPWKTWTPDDLAPPPAEFVPMVESEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60 MSDNLPWKTWTPDDLAPP AEFVP+VE EETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI Sbjct: 1 MSDNLPWKTWTPDDLAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60 Query: 61 AEGRQQGHEQGYQEGLAQGLEQGLAEAKAQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120 AEGRQQGH+QGYQEGLAQGLEQGLAEAK+QQAPIHARMQQLVSEFQTTLDALDSVIASRL Sbjct: 61 AEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120 Query: 121 MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180 MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT Sbjct: 121 MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180 Query: 181 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV Sbjct: 181 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228
>FLGFLIJ#Flagellar FliJ protein signature. Length = 147 Score = 152 bits (384), Expect = 5e-51 Identities = 112/113 (99%), Positives = 113/113 (100%) Query: 2 AEEQLKMLIDYQNEYRNNLNSDMSAGMTSNRWINYQQFIQTLEKAITQHRQQLNQWTQKV 61 AEEQLKMLIDYQNEYRNNLNSDMSAG+TSNRWINYQQFIQTLEKAITQHRQQLNQWTQKV Sbjct: 35 AEEQLKMLIDYQNEYRNNLNSDMSAGITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKV 94 Query: 62 DIALNSWREKKQRLQAWQTLQERQSTAALLAENRLDQKKMDEFAQRAAMRKPE 114 DIALNSWREKKQRLQAWQTLQERQSTAALLAENRLDQKKMDEFAQRAAMRKPE Sbjct: 95 DIALNSWREKKQRLQAWQTLQERQSTAALLAENRLDQKKMDEFAQRAAMRKPE 147
>FLGHOOKFLIK#Flagellar hook-length control protein signature. Length = 375 Score = 468 bits (1205), Expect = e-168 Identities = 363/375 (96%), Positives = 368/375 (98%) Query: 1 MIRLAPLITADVDTTTLPGGKASDAAQDFLTLLSEALAGETTTDKAAPQLLVATDKPTTK 60 MIRLAPLITADVDTTTLPGGKASDAAQDFL LLSEALAGETTTDKAAPQLLVATDKPTTK Sbjct: 1 MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK 60 Query: 61 GEPLVSDILADAQQADLLIPVDETLPVINDEQSTSTPLTTAQTMTLAAVADKNTTKDEKA 120 GEPL+SDI++DAQQA+LLIPVDET PVINDEQSTSTPLTTAQTM LAAVADKNTTKDEKA Sbjct: 61 GEPLISDIVSDAQQANLLIPVDETPPVINDEQSTSTPLTTAQTMALAAVADKNTTKDEKA 120 Query: 121 DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPAEKPTLFTKLTSAQLTTAQPDDAP 180 DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLP EKPTLFTKLTS QLTTAQPDDAP Sbjct: 121 DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPTEKPTLFTKLTSEQLTTAQPDDAP 180 Query: 181 GTPAQPLTPLVAEAQSKAEVISTPSPVTADASPLITPHQTQPLPTVAAPVLSVPLGSHEW 240 GTPAQPLTPLVAEAQSKAEVISTPSPVTA ASPLITPHQTQPLPTVAAPVLS PLGSHEW Sbjct: 181 GTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEW 240 Query: 241 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMISPHQHVRAALEAA 300 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQM+SPHQHVRAALEAA Sbjct: 241 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAA 300 Query: 301 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS 360 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS Sbjct: 301 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS 360 Query: 361 LQGRVTGNSGVDIFA 375 LQGRVTGNSGVDIFA Sbjct: 361 LQGRVTGNSGVDIFA 375
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 379 bits (975), Expect = e-134 Identities = 85/324 (26%), Positives = 148/324 (45%), Gaps = 10/324 (3%) Query: 5 ILSQAEIDALLNGDS--EVKDEPTASVSGESDIRPYDPNTQRRVVRERLQALEIINERFA 62 +LSQ EID LL S + E +S I YD + +E+++ L +++E FA Sbjct: 4 VLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETFA 63 Query: 63 RHFRMGLFNLLRRSPDITVGAIRIQPYHEFARNLPVPTNLNLIHLKPLRGTGLVVFSPSL 122 R L LR + V ++ Y EF R++P P+ L +I + PL+G ++ PS+ Sbjct: 64 RLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPSI 123 Query: 123 VFIAVDNLFGGDGRFPTKVEGREFTHTEQRVINRMLKLALEGYSDAWKAINPLEVEYVRS 182 F +D LFGG G+ KV+ R+ T E V+ ++ L ++W + L + Sbjct: 124 TFSIIDRLFGGTGQ-AAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQI 181 Query: 183 EMQVEFTNITTSPNDIVVNTPFHVEIGNLTGEFNICLPFSMIEPLRELLVNPPLENS--R 240 E +F I P+++VV ++G G N C+P+ IEP+ L + +S R Sbjct: 182 ETNPQFAQI-VPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240 Query: 241 NEDQNWRDNLVRQVQHSQLELVANFADISLRLSQILKLKPGDVLPIEKP---DRIIAHVD 297 + + L ++ +++VA + L + IL L+ GD++ + D + + Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300 Query: 298 GVPVLTSQYGTLNGQYALRIEHLI 321 Q G + + A +I I Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324
>FLGMOTORFLIN#Flagellar motor switch protein FliN signature. Length = 137 Score = 210 bits (537), Expect = 6e-74 Identities = 125/137 (91%), Positives = 133/137 (97%) Query: 1 MSDMNNPADDNNGAMDDLWAEALSEQKSTSEKSAADAVFQQFGGGDVSGTLQDIDLIMDI 60 MSDMNNP+D+N GA+DDLWA+AL+EQK+T+ KSAADAVFQQ GGGDVSG +QDIDLIMDI Sbjct: 1 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDI 60 Query: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV Sbjct: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120 Query: 121 RITDIITPSERMRRLSR 137 RITDIITPSERMRRLSR Sbjct: 121 RITDIITPSERMRRLSR 137
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 333 bits (855), Expect = e-119 Identities = 243/245 (99%), Positives = 244/245 (99%) Query: 1 MRRLFSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60 MRRL SVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60 Query: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVINKIYVDAYQPFSEEK 120 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVI+KIYVDAYQPFSEEK Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120 Query: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK Sbjct: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180 Query: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA Sbjct: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240 Query: 241 QSFYS 245 QSFYS Sbjct: 241 QSFYS 245
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 67.1 bits (164), Expect = 1e-18 Identities = 22/78 (28%), Positives = 42/78 (53%) Query: 4 ESVMMMGTEAMKVALALAAPLLLVALVTGLIISILQAATQINEMTLSFIPKIIAVFIAII 63 + ++ G +A+ + L L+ +VA + GL++ + Q TQ+ E TL F K++ V + + Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61 Query: 64 IAGPWMLNLLLDYVRTLF 81 + W +LL Y R + Sbjct: 62 LLSGWYGEVLLSYGRQVI 79
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 203 bits (517), Expect = 4e-67 Identities = 254/261 (97%), Positives = 257/261 (98%) Query: 1 MMQETSDQWLSWLSLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60 M+Q TS+QWLSWL+LYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60 Query: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPGSHL 120 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDP SHL Sbjct: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120 Query: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGSEPLNSNAFLAPTKAGSLIF 180 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIG EPLNSNAFLA TKAGSLIF Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180 Query: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC Sbjct: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240 Query: 241 EHLFSEIFNLLADIISELPLI 261 EHLFSEIFNLLADIISELPLI Sbjct: 241 EHLFSEIFNLLADIISELPLI 261
>INTIMIN#Intimin signature. Length = 939 Score = 28.1 bits (62), Expect = 0.015 Identities = 20/92 (21%), Positives = 31/92 (33%) Query: 38 GTEIAITYVYKGDKVLKQSSETKIQFASIGATTKEDAAKTLEPLSAKYKNIAGVEEKSTY 97 + AITY K K K S ++ F + KT AK + KS Sbjct: 673 NGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLV 732 Query: 98 TDTYAQENVTIDMEKVDFKALQGISGINVSAE 129 + + V + +V+F I N+ Sbjct: 733 SARVSDVAVDVKAPEVEFFTTLTIDDGNIEIV 764
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 71.8 bits (176), Expect = 1e-16 Identities = 41/178 (23%), Positives = 76/178 (42%), Gaps = 14/178 (7%) Query: 2 IKVLIVDDEPLARENL-RIFLQEQSDIEIVGECSNAVEGIGAVHKLRPDVLFLDIQMPRI 60 +L+ DD+ R L + + D+ I NA + D++ D+ MP Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITS---NAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 SGLEMVGMLDPEHRPYI--VFLTAFD--EYAIKAFEEHAFDYLLKPIDEARLEKTLARLR 116 + +++ + + RP + + ++A + AIKA E+ A+DYL KP D L + R Sbjct: 61 NAFDLLPRIK-KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119 Query: 117 QERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMKDVAFVSSRMSGVYVT--SHEGKE 172 E ++ L ++Q + + G S + +A + + +T S GKE Sbjct: 120 AEPKRRPSKLEDDSQDGMPLV---GRSAAMQEIYRVLARLMQTDLTLMITGESGTGKE 174
>PF06580#Sensor histidine kinase Length = 349 Score = 220 bits (562), Expect = 4e-69 Identities = 63/216 (29%), Positives = 115/216 (53%), Gaps = 3/216 (1%) Query: 343 LGEGIAQLLSAQILAGQYERQKAMLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQA 402 L G + + + +M ++++ L AQ+NPHF+FNALN I+A+I D +A Sbjct: 134 LYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKA 193 Query: 403 SQLVQYLSTFFRKNLKR-PSEFVTLADEIEHVNAYLQIEKARFQSRLQVNIAIPQELSQQ 461 +++ LS R +L+ + V+LADE+ V++YLQ+ +F+ RLQ I + Sbjct: 194 REMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDV 253 Query: 462 QLPAFTLQPIVENAIKHGTSQLLDTGRVAISARREGQHLMLEIEDNAGL-YQPVTNASGL 520 Q+P +Q +VEN IKHG +QL G++ + ++ + LE+E+ L + ++G Sbjct: 254 QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGT 313 Query: 521 GMNLVDKRLRERFGDDYGISVACEPDSYTRITLRLP 556 G+ V +RL+ +G + I ++ + + +P Sbjct: 314 GLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348
>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS signature. Length = 171 Score = 31.4 bits (71), Expect = 0.001 Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 7/66 (10%) Query: 27 TKEHLLPHFL-EHLGNNHLDI------GVGTGFYLTHVPESSLISLMDLNEASLNAASTR 79 T EHL F+ HL + ++I G TGFY++ + S + D A++ Sbjct: 54 TLEHLYAGFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKV 113 Query: 80 AGESKI 85 ++KI Sbjct: 114 ENQNKI 119
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 119 bits (300), Expect = 3e-31 Identities = 98/408 (24%), Positives = 168/408 (41%), Gaps = 25/408 (6%) Query: 19 VTIALSLATFMQMLDSTISNVAIPTISGFLGASTDEGTWVITSFGVANAIAIPVTGRLAQ 78 + I L + +F +L+ + NV++P I+ WV T+F + +I V G+L+ Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74 Query: 79 RIGELRLFLLSVTFFSLSSLMCSLS-TNLDVLIFFRVVQGLMAGPLIPLSQSLLLRNYPP 137 ++G RL L + S++ + + +LI R +QG A L ++ R P Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134 Query: 138 EKRTFALALWSMTVIIAPICGPILGGYICDNFSWGWIFLINVPMGIIVLTLCLTLLKGRE 197 E R A L V + GP +GG I W +L+ +PM I+ L L +E Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKE 192 Query: 198 TETSPVKMNLPRLTLLVLGVGGLQIMLDKGRDLDWFNSSTIIILTVVSVISLISLVIWES 257 K + ++++ VG + ML F +S I +VSV+S + V Sbjct: 193 VRI---KGHFDIKGIILMSVGIVFFML--------FTTSYSISFLIVSVLSFLIFVKHIR 241 Query: 258 TSENPILDLSLFKSRNFTIGIVSITCAYLFYSGAIVLMPQLLQKTMGYNAIWAGLAYAPI 317 +P +D L K+ F IG++ + +G + ++P +++ + G Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301 Query: 318 GIMPLLISPLIG-----RYGNKIDMRVLVTFSFLMYAVCYYWRSVTFMPTIDFTGIILPQ 372 G M ++I IG R G + + VTF +V + S T F II+ Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFL----SVSFLTASFLLETTSWFMTIIIVF 357 Query: 373 FFQGFAVACFFLPLTTISFSGLPDNKFANASSMSNFFRTLSGSVGTSL 420 G + ++TI S L + S+ NF LS G ++ Sbjct: 358 VLGGLSFTK--TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 77.2 bits (190), Expect = 1e-17 Identities = 63/419 (15%), Positives = 125/419 (29%), Gaps = 96/419 (22%) Query: 8 KKQSNRKKYFSLLVIVLFIAFSGAYAYWSMELEDMISTDDAYVT-GNADPISAQVSGSVT 66 + +R+ I+ F+ + + ++E + + + G + I + V Sbjct: 50 ETPVSRRPRLVAYFIMGFLVIAFILSVLG-QVEIVATANGKLTHSGRSKEIKPIENSIVK 108 Query: 67 VVNHKDTNYVRQGDILVSLDKTDATIALNKA----------------------------- 97 + K+ VR+GD+L+ L A K Sbjct: 109 EIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPEL 168 Query: 98 -----------------------KNNLANIVRQTNKLYLQDKQYSAEVASARIQ---YQQ 131 K + Q + L + AE + + Y+ Sbjct: 169 KLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYEN 228 Query: 132 SLEDYNRRV----PLAKQGVISKE----------TLEHTKDTLISSKAALNAAIQAYKAN 177 R+ L + I+K + S + + I + K Sbjct: 229 LSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEE 288 Query: 178 KALVMN-------TPLNR-QPQVVEAADATKEAWLVLKRTDIRSPVTGYIAQRSVQ-VGE 228 LV L + + + + + IR+PV+ + Q V G Sbjct: 289 YQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGG 348 Query: 229 TVSSGQSLMAVVPARQ-MWVNANFKETQLTDVRIGQSVNIISDLYGENVVFHGRVTGINM 287 V++ ++LM +VP + V A + + + +GQ+ I + F G + Sbjct: 349 VVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVE------AFPYTRYGYLV 402 Query: 288 GTGNAFSLLPAQNATGNWIKIVQRVPVEVSLDPKELMEH----PLRIGLSMTATIDTKD 342 G + + +V V +S++ L PL G+++TA I T Sbjct: 403 GK---VKNINLDAIEDQRLGLVFNVI--ISIEENCLSTGNKNIPLSSGMAVTAEIKTGM 456
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 49.1 bits (117), Expect = 3e-09 Identities = 22/148 (14%), Positives = 53/148 (35%), Gaps = 31/148 (20%) Query: 4 IIIDDHPLAIAAIRNLLIKNDIEILAELTEGGSAVQRVETLKPDIVIIDVDIPGVNGIQV 63 ++ DD + L + ++ + + + + D+V+ DV +P N + Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRIT-SNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 64 LETLRKRQYSGIIIIVSAKNDHFYGKHCADAGANGFVSKKEGMNNIIAAIEAAKNGYCYF 123 L ++K + ++++SA+N + AI+A++ G + Sbjct: 66 LPRIKKARPDLPVLVMSAQNT------------------------FMTAIKASEKGAYDY 101 Query: 124 ---PFSLNRFVGSLTSDQQKLDSLSKQE 148 PF L + + L ++ Sbjct: 102 LPKPFDLTE---LIGIIGRALAEPKRRP 126
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 79.9 bits (197), Expect = 2e-17 Identities = 30/105 (28%), Positives = 51/105 (48%) Query: 960 SILIADDHPTNRLLLKRQLNLLGYDVDEATDGVQALHKVSMQHYDLLITDVNMPNMDGFE 1019 +IL+ADD R +L + L+ GYDV ++ ++ DL++TDV MP+ + F+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 1020 LTRKLREQNSSLPIWGLTANAQANEREKGLSCGMNLCLFKPLTLD 1064 L ++++ LP+ ++A K G L KP L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109
>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature. Length = 591 Score = 25.1 bits (54), Expect = 0.038 Identities = 10/27 (37%), Positives = 13/27 (48%) Query: 40 RAMAAELALWARGRHTQFDPTPPPPPV 66 R +A E + R P PPPPP+ Sbjct: 348 RTLAYEGDGYRRAPVNNMMPPPPPPPM 374
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 30.2 bits (68), Expect = 0.016 Identities = 14/88 (15%), Positives = 27/88 (30%), Gaps = 5/88 (5%) Query: 17 PEFRSEALKLAERIGVTAAARELSLYESQLYNWRSKQQNQQTSSERELEMST-EIARLKR 75 PE + + + R SL + Q W QNQ+ E L+ E + Sbjct: 166 PELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTW----QNQKYQKELNLDKKRAERLTVLA 221 Query: 76 QLAERDEELAILPKGRDILREAPEMKYV 103 ++ + + D + + Sbjct: 222 RINRYENLSRVEKSRLDDFSSLLHKQAI 249
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 27.1 bits (60), Expect = 0.024 Identities = 14/66 (21%), Positives = 27/66 (40%), Gaps = 1/66 (1%) Query: 75 IGGGDVKLLTVLSLAIDEHELANFLVAMTFCGALVVLAGLLFFRKSIRENGVPYAVPISL 134 +G GD KLL L + L L+ + GA + + +L +P+ +++ Sbjct: 211 MGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILLRNHHQS-KPIPFGPYLAI 269 Query: 135 AFLLTY 140 A + Sbjct: 270 AGWIAL 275
>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein signature. Length = 167 Score = 28.9 bits (64), Expect = 0.022 Identities = 41/160 (25%), Positives = 67/160 (41%), Gaps = 21/160 (13%) Query: 208 VKLSIQGNLTAPQSCKINQGDVIKVNFGFINGQKFTTRNAMPDGFTPVDFDITYDCGDTS 267 V+++I+GN+ P C IN G I V+FG IN + V +I+ C S Sbjct: 21 VQINIRGNVYIP-PCTINNGQNIVVDFGNINPEHVDNSRG------EVTKNISISCPYKS 73 Query: 268 KIKNSLQMRIDGTTGVVDQYNLVARRRSSDNVPDVGIRIENLGGGVANIPFQNG------ 321 SL +++ G T V Q N++A N+ GI + G + NG Sbjct: 74 ---GSLWIKVTGNTMGVGQNNVLA-----TNITHFGIALYQGKGMSTPLTLGNGSGNGYR 125 Query: 322 ILPVDPSGHGTVNMRAWPVNLVGGELETGKFQGTATITVM 361 + + T + P G L G F+ TA+++++ Sbjct: 126 VTAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMI 165
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.4 bits (68), Expect = 0.029 Identities = 41/266 (15%), Positives = 83/266 (31%), Gaps = 15/266 (5%) Query: 277 QQGFEAAKNIGTQPVAAQVAAAPAADVAEQPQPQTADSVASPAQASVSDLTGDQPAAQPV 336 Q G E + T+ E + Q V S QP A+P Sbjct: 1087 QSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA 1146 Query: 337 PVSAPATSTAAVSAPANPSAELKIYDTSSQPLSQILSQVQQDGASIVVGPLLKNNVEELL 396 + P + + N +A+ + + S + V + +++N Sbjct: 1147 RENDPTVNIKEPQSQTNTTAD--TEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTP 1204 Query: 397 KSNTPLNVLALNQPENIENRVNICYFALSPEDEARDAARHIRDQGKQAPLVLIPR---SA 453 + P + +R ++ + E A D+ A L + Sbjct: 1205 ATTQPTVNSESSNKPKNRHRRSVRSVPHNVE----PATTSSNDRSTVALCDLTSTNTNAV 1260 Query: 454 LGDRVANAFAQEWQKLGGGTVLQQKFGSTSELRAGVNGGSGIALTGSPITPRATTDSGMT 513 L D A A ++ L G + Q S+L G + ++ + + ++ Sbjct: 1261 LSDARAKA---QFVALNVGKAVSQHI---SQLEMNNEGQYNVWVSNTSMNKNYSSSQYRR 1314 Query: 514 TNNPTLQTTPTDDQFTNNGGRVDAVY 539 ++ + QT DQ +N ++ V+ Sbjct: 1315 FSSKSTQTQLGWDQTISNNVQLGGVF 1340
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 27.6 bits (61), Expect = 0.036 Identities = 26/111 (23%), Positives = 44/111 (39%), Gaps = 22/111 (19%) Query: 42 NKILCCGNGTSAANAQHFAASMINRFETERPSLPAIALNTDNVVLTAIA-------NDRL 94 K+L GN + A T + IA + V AI+ D+ Sbjct: 277 TKVL--GNVGKGISQYIIAQRAAQGLSTSAAAAGLIA----SAVTLAISPLSFLSIADKF 330 Query: 95 HD----EVYAKQVRALGHAGDVLLAISTRGNSRDIVKAVEAAVTRDMTIVA 141 E Y+++ + LG+ GD LLA + A++A++T T++A Sbjct: 331 KRANKIEEYSQRFKKLGYDGDSLLAAFHKETG-----AIDASLTTISTVLA 376
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 29.0 bits (65), Expect = 0.014 Identities = 8/22 (36%), Positives = 13/22 (59%) Query: 4 VLITGATGLVGGHLLRMLINEP 25 L+TGA G +G H+ + L+ Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG 24
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 53.3 bits (128), Expect = 5e-10 Identities = 28/163 (17%), Positives = 59/163 (36%), Gaps = 16/163 (9%) Query: 6 RKFSRTAITVVLVILAFIAIFNAWVYYTE----SPWTRDARFSADVVAIAPDVSGLITQV 61 SR V I+ F+ I + + S I P + ++ ++ Sbjct: 51 TPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEI 110 Query: 62 NVHDNQLVKKGQILFTIDQPR-------YQKALEEAQADVAYYQVLAQEKRQEAGRRNRL 114 V + + V+KG +L + Q +L +A+ + YQ+L++ E + L Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRS--IELNKLPEL 168 Query: 115 GVQAMSREEIDQANNVL---QTVLHQLAKAQATRDLAKLDLER 154 + + VL + Q + Q + +L+L++ Sbjct: 169 KLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDK 211 Score = 51.4 bits (123), Expect = 2e-09 Identities = 28/147 (19%), Positives = 54/147 (36%), Gaps = 15/147 (10%) Query: 100 LAQEKRQEAGRRNRLGVQ-AMSREEIDQANNVLQT-VLHQLAKAQAT-------RDLAKL 150 E R + ++ + ++EE + + +L +L + + Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEE 323 Query: 151 DLERTVIRAPADGWVTNLNVYT-GEFITRGSTAVALVKQNSFY-VLAYMEETKLEGVRPG 208 + +VIRAP V L V+T G +T T + +V ++ V A ++ + + G Sbjct: 324 RQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVG 383 Query: 209 YRAEIT----PLGSNKVLKGTVDSVAA 231 A I P L G V ++ Sbjct: 384 QNAIIKVEAFPYTRYGYLVGKVKNINL 410
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 142 bits (360), Expect = 8e-45 Identities = 65/142 (45%), Positives = 84/142 (59%), Gaps = 2/142 (1%) Query: 4 TLPFLILYACLSALLFFWDAKHGLLPDRFTCPLLWSGLLFYQVCHPDGLADALWGAIVGY 63 TL L+L L AL F D LLPD+ T PLLW GLLF + L DA+ GA+ GY Sbjct: 134 TLAALLLTWVLVALTFI-DLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGY 192 Query: 64 GTFAVIYWGYRILRHKEGLGYGDVKFLAALGAWHSWAFLPRLVFLAASFACGAVVIGLLM 123 +YW +++L KEG+GYGD K LAALGAW W LP +V L +S + IGL++ Sbjct: 193 LVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALP-IVLLLSSLVGAFMGIGLIL 251 Query: 124 RGKESLKNPLPFGPFLAAAGFV 145 P+PFGP+LA AG++ Sbjct: 252 LRNHHQSKPIPFGPYLAIAGWI 273
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 38.3 bits (89), Expect = 3e-06 Identities = 19/103 (18%), Positives = 43/103 (41%), Gaps = 10/103 (9%) Query: 44 EYHESIDEMKHADKYIERILFLEGIPN--LQDLGKL------GIGEDVEEMLQSDLRLEL 95 E ++ E D ER+L + G P +++ + G EM+Q+ + Sbjct: 52 ELYDHAAE--TVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYK 109 Query: 96 EGAKDLREAIAYADSVHDYVSRDMMIEILADEEGHIDWLETEL 138 + + + + I A+ D + D+ + ++ + E + L + L Sbjct: 110 QISSESKFVIGLAEENQDNATADLFVGLIEEVEKQVWMLSSYL 152
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 79.5 bits (196), Expect = 3e-18 Identities = 57/198 (28%), Positives = 87/198 (43%), Gaps = 13/198 (6%) Query: 13 VNVGTIGHVDHGKTTLTAAI------TTVLAKTYGGAARAFDQIDNAPEEKARGITINTS 66 +N+G + HVD GKTTLT ++ T L G R DN E+ RGITI T Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRT----DNTLLERQRGITIQTG 59 Query: 67 HVEYDTPTRHYAHVDCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQV 126 + +D PGH D++ + + +DGAIL+++A DG QTR R++ Sbjct: 60 ITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKM 119 Query: 127 GVPYIIVFLNKCDMVDDEELLELVEMEVRELLSQYDFPGDDTPIVRGSALKALEGDAEWE 186 G+P I F+NK D + L V +++E LS + + +W+ Sbjct: 120 GIP-TIFFINKIDQNGID--LSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176 Query: 187 AKILELAGFLDSYIPEPE 204 I L+ Y+ Sbjct: 177 TVIEGNDDLLEKYMSGKS 194
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 611 bits (1578), Expect = 0.0 Identities = 178/698 (25%), Positives = 304/698 (43%), Gaps = 81/698 (11%) Query: 9 RYRNIGISAHIDAGKTTTTERILFYTGVNHKIGEVHDGAATMDWMEQEQERGITITSAAT 68 + NIG+ AH+DAGKTT TE +L+ +G ++G V G D E++RGITI + T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 69 TAFWSGMAKQYEPHRINIIDTPGHVDFTIEVERSMRVLDGAVMVYCAVGGVQPQSETVWR 128 + W ++NIIDTPGH+DF EV RS+ VLDGA+++ A GVQ Q+ ++ Sbjct: 62 SFQWEN-------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114 Query: 129 QANKYKVPRIAFVNKMDRVGANFLKVVNQIKTRLGANPVPLQLAIGAEEHFTGVVDLVKM 188 K +P I F+NK+D+ G + V IK +L A V Q V M Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ----------KVELYPNM 164 Query: 189 KAINWNDADQGVTFEYEDIPADMVELANEWHQNLIESAAEASEELMEKYLGGEELTEAEI 248 N+ +++Q ++ E +++L+EKY+ G+ L E+ Sbjct: 165 CVTNFTESEQ------------------------WDTVIEGNDDLLEKYMSGKSLEALEL 200 Query: 249 KGALRQRVLNNEIILVTCGSAFKNKGVQAMLDAVIDYLPSPVDVPAINGILDDGKDTPAE 308 + R N + V GSA N G+ +++ + + S Sbjct: 201 EQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH----------------- 243 Query: 309 RHASDDEPFSALAFKIATDPFVGNLTFFRVYSGVVNSGDTVLNSVKAARERFGRIVQMHA 368 FKI L + R+YSGV++ D+V S K + + + Sbjct: 244 ---RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEK-EKIKITEMYTSIN 299 Query: 369 NKREEIKEVRAGDIAAAIG----LKDVTTGDTLCDPDAPIILERMEFPEPVISIAVEPKT 424 + +I + +G+I L V GDT P ER+E P P++ VEP Sbjct: 300 GELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQR----ERIENPLPLLQTTVEPSK 354 Query: 425 KADQEKMGLALGRLAKEDPSFRVWTDEESNQTIIAGMGELHLDIIVDRMKREFNVEANVG 484 +E + AL ++ DP R + D +++ I++ +G++ +++ ++ +++VE + Sbjct: 355 PQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIK 414 Query: 485 KPQVAYRETIRQKVTDVEGKHAKQSGGRGQYGHVVIDMYPLEPGSNPKGYEFINDIKGGV 544 +P V Y E +K E + + + + + PL GS G ++ + + G Sbjct: 415 EPTVIYMERPLKK---AEYTIHIEVPPNPFWASIGLSVSPLPLGS---GMQYESSVSLGY 468 Query: 545 IPGEYIPAVDKGIQEQLKAGPLAGYPVVDMGIRLHFGSYHDVDSSELAFKLAASIAFKEG 604 + + AV +GI+ + G L G+ V D I +G Y+ S+ F++ A I ++ Sbjct: 469 LNQSFQNAVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQV 527 Query: 605 FKKAKPVLLEPIMKVEVETPEENTGDVIGDLSRRRGMLKGQESEVTGVKIHAEVPLSEMF 664 KKA LLEP + ++ P+E D + + + + V + E+P + Sbjct: 528 LKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQ 587 Query: 665 GYATQLRSLTKGRASYTMEFLKYDEAPSNVAQAVIEAR 702 Y + L T GR+ E Y + V + R Sbjct: 588 EYRSDLTFFTNGRSVCLTELKGYHVT---TGEPVCQPR 622
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 29.0 bits (65), Expect = 0.022 Identities = 14/62 (22%), Positives = 29/62 (46%), Gaps = 1/62 (1%) Query: 160 ASSVEDLVTQTLEFTIEEVNADRNV-SNNAKNRQIVLNLYEKGIFDIKDAINQVADRLNI 218 A +V+D VTQ +E + ++ + S + + + L + D A QV ++L + Sbjct: 54 AQTVQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQL 113 Query: 219 SK 220 + Sbjct: 114 AT 115
>INFPOTNTIATR#Macrophage infectivity potentiator signature. Length = 233 Score = 133 bits (337), Expect = 2e-40 Identities = 80/226 (35%), Positives = 124/226 (54%), Gaps = 9/226 (3%) Query: 28 AAKPATTADSKASFKNDDQKSAYALGASLGRYMENSLKEQEKLGIKLDKDQLIAGVQDAF 87 A A A S D K +Y++GA LG K + GI ++ D L G+QD Sbjct: 14 AMSTAMAATDATSLTTDKDKLSYSIGADLG-------KNFKNQGIDINPDVLAKGMQDGM 66 Query: 88 A-DKSKLSDQEIEQTLQAFEARVKSSAQAKMEKDAADNEAKGKEYREKFAKEKGVKTSST 146 + + L++++++ L F+ + + A+ K A +N+AKG + + G+ + Sbjct: 67 SGAQLILTEEQMKDVLSKFQKDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPS 126 Query: 147 GLVYQVVEAGKGEAPKDSDTVVVNYKGTLIDGKEFDNSYTRGEPLSFRLDGVIPGWTEGL 206 GL Y++++AG G P SDTV V Y GTLIDG FD++ G+P +F++ VIPGWTE L Sbjct: 127 GLQYKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEAL 186 Query: 207 KNIKKGGKIKLVIPPELAYGKAGVPG-IPPNSTLVFDVELLDVKPA 251 + + G ++ +P +LAYG V G I PN TL+F + L+ VK A Sbjct: 187 QLMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISVKKA 232
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 30.3 bits (68), Expect = 0.021 Identities = 13/69 (18%), Positives = 29/69 (42%), Gaps = 6/69 (8%) Query: 230 TAIDPFKGLLLG---LFFISVGMSLNLGVLYTHL-LWVVISVVVLVAVKILVLYLLARLY 285 A+ P L + L+FIS + L +++ + W +++ V+ ++ L Sbjct: 318 AAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKA-- 375 Query: 286 GVRSSERMQ 294 S +M+ Sbjct: 376 QYTSMAKMR 384
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 31.9 bits (72), Expect = 0.001 Identities = 32/135 (23%), Positives = 51/135 (37%), Gaps = 16/135 (11%) Query: 12 YAHPESQDSVANRVLLKPATQLSNVTVHDLYAHYPDFFIDIPREQALLREHEVIVFQH-- 69 Y P + D N+V P + + +HD+ ++ D F L + + Sbjct: 9 YQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCV 68 Query: 70 ----PLYTYSCPALLKEWLDRVLSRGFASGPGGNQLAGKYWRNVITTGEPESA------Y 119 P+ + P DR L F GPG N +G Y +IT PE + Sbjct: 69 QLGIPVVYTAQPGSQNP-DDRALLTDFW-GPGLN--SGPYEEKIITELAPEDDDLVLTKW 124 Query: 120 RYDALNRYPMSDVLR 134 RY A R + +++R Sbjct: 125 RYSAFKRTNLLEMMR 139
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 32.7 bits (74), Expect = 0.005 Identities = 28/152 (18%), Positives = 54/152 (35%), Gaps = 22/152 (14%) Query: 504 KVEPFDGDLEDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPLRKE 563 + D + ++ E + + ++ R+ +R R + L E Sbjct: 272 AMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAE 331 Query: 564 IARLEKEME---------------------KLNAQLAQAEEKLGDSELYDQSRKAELTAC 602 +LE++ + +L A+ + EE+ SE QS + +L A Sbjct: 332 HQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDAS 391 Query: 603 LQQQASAKSGLEECEMAWLEAQEQLEQMLLEG 634 + + + LEE L A E+L + L E Sbjct: 392 REAKKQVEKALEEANSK-LAALEKLNKELEES 422 Score = 32.0 bits (72), Expect = 0.008 Identities = 13/125 (10%), Positives = 39/125 (31%), Gaps = 7/125 (5%) Query: 513 EDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPLRKEIARLEKEME 572 + + ++ + E A A + D ++ + +++ Sbjct: 127 KALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFST-------ADSAKIK 179 Query: 573 KLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEECEMAWLEAQEQLEQMLL 632 L A+ A E + + E + TA + + ++ + ++ LE + Sbjct: 180 TLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMN 239 Query: 633 EGQSN 637 ++ Sbjct: 240 FSTAD 244
>PF07299#Fibronectin-binding protein (FBP) Length = 219 Score = 31.8 bits (72), Expect = 0.002 Identities = 10/46 (21%), Positives = 21/46 (45%), Gaps = 2/46 (4%) Query: 71 PEANDFGLLEQTFIEYGQSGKGKSRKYLHTYDEAVPWNQVPGTFTP 116 P+ + + E ++ KG SRK++ ++ + + GTF Sbjct: 112 PDMEELDMKELSY--LSWIDKGSSRKFIIAKNDKNKFVGLQGTFQS 155
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 37.2 bits (86), Expect = 1e-05 Identities = 21/92 (22%), Positives = 33/92 (35%), Gaps = 16/92 (17%) Query: 55 VACIDGDVVGHLTIDVQQRPRRSHVADFGICVDSRWKNRGVASALMREMIE------MCD 108 + ++ + +G + I + + D + D R K GV +AL+ + IE C Sbjct: 69 LYYLENNCIGRIKIR-SNWNGYALIEDIAVAKDYRKK--GVGTALLHKAIEWAKENHFCG 125 Query: 109 NWLRVDRIELTVFVDNAPAIKVYKKFGFEIEG 140 L I N A Y K F I Sbjct: 126 LMLETQDI-------NISACHFYAKHHFIIGA 150
>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature. Length = 293 Score = 31.6 bits (71), Expect = 0.007 Identities = 27/82 (32%), Positives = 37/82 (45%), Gaps = 17/82 (20%) Query: 275 RTPISGDYRGYQVYSMPPPSSGGIHIVQILNI--LENFDMKKYGF-GSADAMQIMAEAEK 331 R P+ G+ R + SMPPP G H +I N+ + FD G+ G A I++E EK Sbjct: 77 RRPM-GETRNVSLISMPPPWRPGEHADRITNLKFFKQFD----GYVGGQTAWGILSELEK 131 Query: 332 YAYADRSEYLGDPDFVKVPWQA 353 Y P F WQ+ Sbjct: 132 GRY---------PTFSYQDWQS 144
>PF04619#Dr-family adhesin Length = 160 Score = 28.4 bits (63), Expect = 0.017 Identities = 12/60 (20%), Positives = 22/60 (36%), Gaps = 4/60 (6%) Query: 29 VGAKYGHKMIEFDAKLSKDGEIFLLHDDNLERTSNGWGVAGELNWQD----LLRVDAGSW 84 +G ++ D + G+ FL+ D+N ++ W + D GSW Sbjct: 70 LGCDARQVALKADTDNFEQGKFFLISDNNRDKLYVNIRPTDNSAWTTDNGVFYKNDVGSW 129
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 39.3 bits (91), Expect = 2e-05 Identities = 41/160 (25%), Positives = 68/160 (42%), Gaps = 14/160 (8%) Query: 134 GHLLSQPFNSSTPVLYYNKDAFKKAGLDPEQPPKTWQDLADYSAKLKASGIKCGYASGWQ 193 G L++ P L YNKD L P PPKTW+++ +LKA G + + Sbjct: 127 GKLIAYPIAVEALSLIYNKD------LLP-NPPKTWEEIPALDKELKAKGKSALMFNLQE 179 Query: 194 GWIQLENFSAWNGLPFASKNNGFDGTDAVLEF--NKPEQVKHIAMLEEMNKKGDFSYVGR 251 + +A G F +N +D D ++ K + +++ + D Y Sbjct: 180 PYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDY--- 236 Query: 252 KDESTEKFYNGDCAMTTASSGSLANIREYAKFNYGVGMMP 291 + F G+ AMT + +NI + +K NYGV ++P Sbjct: 237 -SIAEAAFNKGETAMTINGPWAWSNI-DTSKVNYGVTVLP 274
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.4 bits (66), Expect = 0.018 Identities = 10/34 (29%), Positives = 19/34 (55%) Query: 25 QAVLNNVSLTLKSGETVALLGRSGCGKSTLARLL 58 Q + ++ +++ T+ + G SG GK +AR L Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 49.9 bits (119), Expect = 5e-09 Identities = 41/171 (23%), Positives = 73/171 (42%), Gaps = 7/171 (4%) Query: 200 REREHGTVEHLLVMPITPFEIMMAKI-WSMGLVVLVVSGLSLVLMVKGVLGVPIEGSIPL 258 R T E +L + +I++ ++ W+ L +G+ +V G + L Sbjct: 93 RMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGY----TQWLSLL 148 Query: 259 FMLGV-ALSLFATTSIGIFMGTIARSMPQLGLLVILVLLPLQMLSGGSTPRESMPQMVQD 317 + L V AL+ A S+G+ + +A S LV+ P+ LSG P + +P + Q Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208 Query: 318 IMLTMPTTHFVSLAQAILYRGAGFEIVWPQFLTLMAIGGAFF-TIALLRFR 367 +P +H + L + I+ ++ + I FF + ALLR R Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRRR 259
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.045 Identities = 9/26 (34%), Positives = 14/26 (53%) Query: 37 ARCMVGLIGPDGVGKSSLLSLISGAR 62 V L G G+GKS+L++ + G Sbjct: 595 FDYSVVLEGTGGIGKSTLINTLVGLD 620
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 84.5 bits (209), Expect = 4e-20 Identities = 71/408 (17%), Positives = 139/408 (34%), Gaps = 81/408 (19%) Query: 6 RHLAWWVVGALAVAAVVAWWLLRPAGVP-EGFAVSNGRIEATEVDIASKIAGRIDTILVK 64 R +A++++G L +A +++ G +GR + I + I+VK Sbjct: 58 RLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKE----IKPIENSIVKEIIVK 113 Query: 65 EGQFVREGEVLAKMDTRV----------------LQEQRLEAI----------------- 91 EG+ VR+G+VL K+ L++ R + + Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173 Query: 92 -------------------AQIKEAQSAVAAAQALLEQRQSETRAAQSLVNQRQAELDSV 132 Q Q+ + L+++++E + +N+ + Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233 Query: 133 AKRHTRSRSLAQRGAISAQQLDDDRAAAESARAALESAKAQVSASKAAIEAARTNIIQ-- 190 R SL + AI+ + + A L K+Q+ ++ I +A+ Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293 Query: 191 -----------AQTRVEAAQATERRIAADID--DSELKAPRDGRV-QYRVAEPGEVLAAG 236 QT T + S ++AP +V Q +V G V+ Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353 Query: 237 GRVLNMVDLSDVY-MTFFLPTEQAGTLKLGGEARLILDAAPDLRIPATISFVASVAQFTP 295 ++ +V D +T + + G + +G A + ++A P R V V Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYG---YLVGKVKNINL 410 Query: 296 KTVETSDERLKLMFRVKARIPPELLQQHLEYV--KTGLPGVAWVRVNE 341 +E D+RL L+F V I L + + +G+ A ++ Sbjct: 411 DAIE--DQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGM 456
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 37.9 bits (88), Expect = 5e-05 Identities = 33/208 (15%), Positives = 71/208 (34%), Gaps = 13/208 (6%) Query: 33 IIVEFLPVSLLTP----MAQDLGISEGVA---GQSVTVTAFVAMFASLFITQTIQATDRR 85 + ++ + + L+ P + +DL S V G + + A + + + RR Sbjct: 14 VALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRR 73 Query: 86 YVVILFAVLLTLSCLLVSFANSFSLLLIGRACLGLALGGFWAMSASLTMRLVPPRTVPKA 145 V+++ + +++ A +L IGR G+ G A++ + + + Sbjct: 74 PVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARH 132 Query: 146 LSVIFGAVSIALVIAAPLGSFLGELIGWRNVFNAAAVMG----VLCIFWIIKSLPSLPGE 201 + +V LG +G F AAA + + F + +S Sbjct: 133 FGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRP 191 Query: 202 PSHQKQNTFRLLQRLGVMAGMIAIFMSF 229 + N + M + A+ F Sbjct: 192 LRREALNPLASFRWARGMTVVAALMAVF 219
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 39.7 bits (93), Expect = 3e-05 Identities = 30/105 (28%), Positives = 43/105 (40%), Gaps = 17/105 (16%) Query: 22 AVSRGDAVADYIIDNVSILDLINAGEISGPIVIKGRYIAGVG-AEYADT---------PA 71 V+R D +I N ILD + G + I +K IA +G A D P Sbjct: 60 QVTREGGAVDTVITNALILD--HWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPG 117 Query: 72 LQRIDARGATAVPGFIDAHLHIESSMMTPVTFETATLPRGLTTVI 116 + I G G +D+H+H + P E A L GLT ++ Sbjct: 118 TEVIAGEGKIVTAGGMDSHIH----FICPQQIEEA-LMSGLTCML 157
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 34.1 bits (78), Expect = 0.001 Identities = 28/168 (16%), Positives = 61/168 (36%), Gaps = 17/168 (10%) Query: 49 FNIAQNDMISTYGLSMTQLGMIGLGFSITYGVGKTLVSYYADGKNTKQFLPFMLILSAIC 108 N++ D+ + + + F +T+ +G + +D K+ L F +I++ C Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIIN--C 90 Query: 109 MLGFSASMGSGSVSLFLMIAFYALSGFFQSTGGSCSYSTI----TKWTPRRKRGTFLGFW 164 +G SL +M + F Q G + + + ++ P+ RG G Sbjct: 91 FGSVIGFVGHSFFSLLIM------ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144 Query: 165 NISHNLGGAGAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRY 212 +G + A+Y+ + + + P + I+ L Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSY---LLLIP--MITIITVPFLMK 187
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 40.6 bits (95), Expect = 8e-06 Identities = 65/408 (15%), Positives = 137/408 (33%), Gaps = 60/408 (14%) Query: 29 RHILLTIWLGYALFY--FTRKSFNAAVPEILANGVLSRSDIGLLATLFYITYGVSKFVSG 86 RH + IWL F+ N ++P+I + + + T F +T+ + V G Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 87 IVSDRSNARYFMGIGLIATGIMNILFGFSTSLWAFAVLWVLNAFFQGWGS---PVCARLL 143 +SD+ + + G+I +++ S F L ++ F QG G+ P ++ Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHS---FFSLLIMARFIQGAGAAAFPALVMVV 127 Query: 144 TAWY-SRTERGGWWALWNTAHNVGGALIPIVMAASALHYGWRAGMMIAGCMAIVVGIFLC 202 A Y + RG + L + +G + P + A + W ++ M ++ + Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW--SYLLLIPMITIITVPF- 184 Query: 203 WRLRDRPQALGLPAVGEWRHDALEIAQQQEGAGLTRKEILTKYVLLNPYIWLLSFCYVLV 262 + L +I G L I+ + Y VL Sbjct: 185 --------LMKLLKKEVRIKGHFDIK----GIILMSVGIVFFMLFTTSYSISFLIVSVLS 232 Query: 263 YVV-----RAAINDWGNLYMSETLGVDLVTANTAVTMFELGGFI-----------GALVA 306 +++ R + + + + + + + + + GF+ A Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292 Query: 307 GWGSDKLFNGNRGPMNLIFAAGILL-SVGSLWLMPFASYVMQATCFFTIGFFVFGPQMLI 365 GS +F G + + GIL+ G L+++ + + F T F + + Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFL-SVSFLTASFLLETTSWFM 351 Query: 366 ---------GMAAAECS---------HKEAAGAATGFVGLFAYLGASL 395 G++ + ++ AGA + ++L Sbjct: 352 TIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399
>PF06580#Sensor histidine kinase Length = 349 Score = 39.8 bits (93), Expect = 2e-05 Identities = 28/142 (19%), Positives = 56/142 (39%), Gaps = 11/142 (7%) Query: 365 LRPRQLDDLTLEQAIRSLMREMELEGRGIVSHLEWRIDESALSENQRVTLFRVCQEGLNN 424 LR ++L + + ++L L++ + + +V + Q + N Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPM-LVQTLVEN 266 Query: 425 IVKHA-----DASAVTLQGWQQDERLMLVIEDDGSGLPPGSGQ-QGFGLTGMRERVTALG 478 +KH + L+G + + + L +E+ GS + + G GL +RER+ L Sbjct: 267 GIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLY 326 Query: 479 G---TLHISCLHG-TRVSVSLP 496 G + +S G V +P Sbjct: 327 GTEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 61.4 bits (149), Expect = 2e-13 Identities = 29/174 (16%), Positives = 59/174 (33%), Gaps = 20/174 (11%) Query: 2 ITVALIDDHLIVRSGFAQLLGLEPDLQVVAEFGSGREALAGLPGRGVQVCICDISMPDIS 61 T+ + DD +R+ Q L V + + + + D+ MPD + Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 62 GLELLSQLPK---GMATIMLSVHDSPALVEQALNAGARGFLSKRCSPDELIAAVHTVATG 118 +LL ++ K + +++S ++ +A GA +L K ELI + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR---- 117 Query: 119 GCYLTPDIAIKLASGRQDPLTKRERQVAEKLAQG---MAVKEIAAELGLSPKTV 169 A+ R L + + + + + A L + T+ Sbjct: 118 --------ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 33.3 bits (76), Expect = 2e-04 Identities = 17/63 (26%), Positives = 26/63 (41%) Query: 57 ATEFGVLLSAFSLSYGFSQLPSGILLDRFGPRIVLGAGLIFWSLMQALTGMVNSFSHFIL 116 +G+LL+ ++L G L DRFG R VL L ++ A+ + Sbjct: 42 TAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYI 101 Query: 117 MRI 119 RI Sbjct: 102 GRI 104
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 31.4 bits (71), Expect = 0.005 Identities = 45/302 (14%), Positives = 105/302 (34%), Gaps = 40/302 (13%) Query: 7 LGIGEAPFMPAGVKSITDWYAQKERGTALGIFNSSTVIGQAIAPP--ALVLMQLAWGWRT 64 G G A F + + + ++ RG A G+ S +G+ + P ++ + W + Sbjct: 113 QGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL 172 Query: 65 MFVIIGVAGILVGICWYAWYRNRAQ----------------FVL--TDEERTYLSASVKP 106 + +I + + + F+L T ++L SV Sbjct: 173 LIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLS 232 Query: 107 -----RPQLQFSEWL---ALFKHRTTWGMILGFSGVNYTGWLYIAWLPGYLQAEQGFSLA 158 + + ++ L K+ +L + T +++ +P ++ S A Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292 Query: 159 KTGWVAAIP-FLAAAVGMWVNGIVVDRLAKKGYDLAKTRKTAIVCGLMMSA--LGTLLVV 215 + G V P ++ + ++ GI+VDR + +S L ++ Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRG--------PLYVLNIGVTFLSVSFLTASFLL 344 Query: 216 QSSSPAQAVAFISMALFCVHFAGTSAWGLVQVMVSETKVASIAGIQNFGSFVFASFAPIV 275 +++S + + + + F T +V + + + + + NF SF+ + Sbjct: 345 ETTSWFMTIIIVFVLGG-LSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403 Query: 276 TG 277 G Sbjct: 404 VG 405
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 50.4 bits (120), Expect = 6e-10 Identities = 29/142 (20%), Positives = 62/142 (43%), Gaps = 3/142 (2%) Query: 1 MGVRAQQKEKTRRSLVEAAFSQLSAERSFASLSLREVAREAGIAPTSFYRHFRDVDELGL 60 Q+ ++TR+ +++ A +L +++ +S SL E+A+ AG+ + Y HF+D +L Sbjct: 2 ARKTKQEAQETRQHILDVA-LRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 TMVDESGLMLRQLMRQ-ARQRIAKGGSVIRTSVSTFMEFIGNNPNAFRLL-LRERSGTSA 118 + + S + +L + + SV+R + +E L+ + Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 Query: 119 AFRAAVAREIQHFIAELADYLE 140 A V + ++ E D +E Sbjct: 121 GEMAVVQQAQRNLCLESYDRIE 142
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 29.8 bits (67), Expect = 0.043 Identities = 16/63 (25%), Positives = 27/63 (42%), Gaps = 10/63 (15%) Query: 22 DTSPDTLVVTAIRFEQPRSTVLAPTTVVTRQDIDRWQSTSVNDVLRRLPGV-DITQNGGS 80 +S L+V + P +T + DI + +++V D L RL GV D+ G Sbjct: 131 KSSSSYLMVAGFVSDNPGTT---------QDDISDYVASNVKDTLSRLNGVGDVQLFGAQ 181 Query: 81 GQL 83 + Sbjct: 182 YAM 184
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 115 bits (289), Expect = 5e-33 Identities = 79/272 (29%), Positives = 128/272 (47%), Gaps = 27/272 (9%) Query: 7 LQDKIIIVTGGASGIGLAIVEELLAQGANVQMVDIHG-------GDGQYEGHKGYQFWPT 59 ++ KI +TG A GIG A+ L +QGA++ VD + + E F P Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF-PA 64 Query: 60 DISSTKEVNHTVAEIIQRFGRIDGLVNNAGVNFPRLLVDEKAPAGQYELNEAAFEKMVNI 119 D+ + ++ A I + G ID LVN AGV P L+ + L++ +E ++ Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLI---------HSLSDEEWEATFSV 115 Query: 120 NQKGVFLMSQAVARQMVKQHDGVIVNVSSESGLEGSEGQSCYAATKAALNSFTRSWSKEL 179 N GVF S++V++ M+ + G IV V S + YA++KAA FT+ EL Sbjct: 116 NSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLEL 175 Query: 180 GKHGIRVVGIAPGILEKTGLRTPEYEEALAWTRNITVEQLREGYT---KNAIPIGRAGRL 236 ++ IR ++PG E + W EQ+ +G K IP+ + + Sbjct: 176 AEYNIRCNIVSPGSTETDMQWS-------LWADENGAEQVIKGSLETFKTGIPLKKLAKP 228 Query: 237 AEVADFVCYLLSERASYITGVTTNIAGGKTRG 268 +++AD V +L+S +A +IT + GG T G Sbjct: 229 SDIADAVLFLVSGQAGHITMHNLCVDGGATLG 260
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 28.6 bits (64), Expect = 0.033 Identities = 6/21 (28%), Positives = 12/21 (57%) Query: 24 QAQIARELGIYRTTISRLLKR 44 Q + A LG+ R T+ + ++ Sbjct: 452 QIKAADLLGLNRNTLRKKIRE 472
>PF06580#Sensor histidine kinase Length = 349 Score = 41.0 bits (96), Expect = 8e-06 Identities = 21/99 (21%), Positives = 38/99 (38%), Gaps = 18/99 (18%) Query: 442 LIENALE-ALGP-EPGGEISVTLHYRHGWLHCEVNDDGPGIAPDKIDHIFDKGVSTKGSE 499 L+EN ++ + GG+I + +G + EV + G + Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN------------TKES 310 Query: 500 RGVGLALVKQQVENLGG---SIAVESEPGIFTQFFVQIP 535 G GL V+++++ L G I + + G V IP Sbjct: 311 TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 71.4 bits (175), Expect = 2e-16 Identities = 31/109 (28%), Positives = 50/109 (45%), Gaps = 4/109 (3%) Query: 4 VLIIDDDAMVAELNRRYVAQIPGFQCCGTASTLEKAKEIIFNSDAPIDLILLDIYMQKEN 63 +L+ DDDA + + + +++ G+ S I + DL++ D+ M EN Sbjct: 6 ILVADDDAAIRTVLNQALSRA-GYDVR-ITSNAATLWRWI--AAGDGDLVVTDVVMPDEN 61 Query: 64 GLDLLPVLHNARCKSDVIVISSAADAATIKDSLHYGVVDYLIKPFQASR 112 DLLP + AR V+V+S+ T + G DYL KPF + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTE 110
>PF05272#Virulence-associated E family protein Length = 892 Score = 34.7 bits (79), Expect = 6e-04 Identities = 13/35 (37%), Positives = 18/35 (51%) Query: 32 VVFVGPSGCGKSTLLRMIAGLETITSGDLFIGEKR 66 VV G G GKSTL+ + GL+ + IG + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGK 633
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 755 bits (1951), Expect = 0.0 Identities = 395/396 (99%), Positives = 395/396 (99%) Query: 1 MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIK 60 MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIK Sbjct: 1 MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIK 60 Query: 61 VTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTW 120 VTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTW Sbjct: 61 VTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTW 120 Query: 121 DAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEP 180 DAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEP Sbjct: 121 DAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEP 180 Query: 181 YFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAE 240 YFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAE Sbjct: 181 YFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAE 240 Query: 241 AAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSTGINAASPNKE 300 AAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLS GINAASPNKE Sbjct: 241 AAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKE 300 Query: 301 LAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIP 360 LAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIP Sbjct: 301 LAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIP 360 Query: 361 QMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK 396 QMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK Sbjct: 361 QMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK 396
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 31.1 bits (70), Expect = 0.011 Identities = 22/124 (17%), Positives = 43/124 (34%), Gaps = 21/124 (16%) Query: 128 GDEWQLALSDGETGKNYLSDAFKFGGEQKLQLKETTAQPEGERANLRVITQNRQALSDIT 187 ++WQ+ T DA L+L T + L+ + A+ ++ Sbjct: 367 NNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPV---SDAIVNMD 423 Query: 188 AILPDGNKVMMSSLRQFSGTQPLYTLDGDGTLTNNQSGVKYRPNNQ--------IGFYQS 239 ++ D K+ M+S GD N Q+ + + N++ Y S Sbjct: 424 VLITDEAKIAMAS----------EEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYAS 473 Query: 240 ITAD 243 + +D Sbjct: 474 LVSD 477
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 36.0 bits (83), Expect = 3e-04 Identities = 20/87 (22%), Positives = 42/87 (48%), Gaps = 3/87 (3%) Query: 279 VIGVMLSIFQQFVGINVVLYYAPEVFKTLGASTDIALLQTIIVGVINLTFTVLAIMT--- 335 +I ++ ++ VGI +++ P + + L S D+ I++ + L A + Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 336 VDKFGRKPLQIIGALGMAIGMFSLGTA 362 D+FGR+P+ ++ G A+ + TA Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATA 93