>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 141 bits (357), Expect = 3e-39 Identities = 84/388 (21%), Positives = 152/388 (39%), Gaps = 86/388 (22%) Query: 5 IGIDLGTTNSCVAIMDGTQARVLENAEGDRTTPSIIAYTQDGET------LVGQPAKRQA 58 + IDLGT N+ + + Q VL PS++A QD VG AK+ Sbjct: 13 LSIDLGTANTLIYVKG--QGIVLNE-------PSVVAIRQDRAGSPKSVAAVGHDAKQML 63 Query: 59 VTNPQNTLFAIKRLIGRRFQDEEVQRDVSIMPYKIIGADNGDAWLDVKGQKMAPPQISAE 118 P N + AI+ +K +A ++ + Sbjct: 64 GRTPGN-IAAIR---------------------------------PMKDGVIADFFVTEK 89 Query: 119 VLKK-MKKTAEDYLGEPVTEAVITVPAYFNDAQRQATKDAGRIAGLEVKRIINEPTAAAL 177 +L+ +K+ + P ++ VP +R+A +++ + AG +I EP AAA+ Sbjct: 90 MLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAI 149 Query: 178 AYGL--DKEVGNRTIAVYDLGGGTFDISIIEIDEVDGEKTFEVLATNGDTHLGGEDFDTR 235 GL + G+ V D+GGGT ++++I ++ V + +GG+ FD Sbjct: 150 GAGLPVSEATGS---MVVDIGGGTTEVAVISLNGV---------VYSSSVRIGGDRFDEA 197 Query: 236 LINYLVDEFKKDQGIDLRNDPLAMQRLKEAAEKAKIELSSA----QQTDVNLPYITADAT 291 +INY+ + G + AE+ K E+ SA + ++ + Sbjct: 198 IINYVRRNYGSLIG-------------EATAERIKHEIGSAYPGDEVREIEVRGRNLAEG 244 Query: 292 GPKHMNIKVTRAKLESLVEDLVNRSIEPLKVALQD-AGLSVSDIND--VILVGGQTRMPM 348 P+ + + LE+L E + + + VAL+ SDI++ ++L GG + Sbjct: 245 VPRGFTLN-SNEILEALQEP-LTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRN 302 Query: 349 VQKKVAEFFGKEPRKDVNPDEAVAIGAA 376 + + + E G +P VA G Sbjct: 303 LDRLLMEETGIPVVVAEDPLTCVARGGG 330
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 832 bits (2150), Expect = 0.0 Identities = 439/873 (50%), Positives = 583/873 (66%), Gaps = 27/873 (3%) Query: 14 VAKPVLTPLALAIALAPA------PGWAENYFNPAFLSDDPSAVADLSTFSR-NAQAAGM 66 + K L + + +A A AE YFNP FL+DDP AVADLS F G Sbjct: 18 IRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGT 77 Query: 67 YRVDVYLNNTFLATRDIAFQAVKTTGKSAPTDDSGLRACLTPEMLKNMGVNTGAFPLLAK 126 YRVD+YLNN ++ATRD+ F + G+ CLT L +MG+NT + + Sbjct: 78 YRVDIYLNNGYMATRDVTFNTGD--------SEQGIVPCLTRAQLASMGLNTASVSGMNL 129 Query: 127 AAAGSCPDLASAIPAARTRFDFAQQRLDISIPQAAMVASARGYIPPQYWDEGINALLFNY 186 A +C L S I A + D QQRL+++IPQA M ARGYIPP+ WD GINA L NY Sbjct: 130 LADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNY 189 Query: 187 TFTGANSQDRSPGGSAENSYFLGLNSGLNLGAWRLRDYSTWNANSGDQNSDS--DWQHIS 244 F+G + Q+R G S + +L L SGLN+GAWRLRD +TW+ NS D +S S WQHI+ Sbjct: 190 NFSGNSVQNRIGGNS--HYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHIN 247 Query: 245 THLERDVVFLQGELTAGDSYTPSALFDSLPFRGLQLASDDNMLPDSMKGFAPTIHGIARS 304 T LERD++ L+ LT GD YT +FD + FRG QLASDDNMLPDS +GFAP IHGIAR Sbjct: 248 TWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARG 307 Query: 305 NAQVTIRQNGYIINQRYVPPGAFTINDLYPTAASGDLTVEVKESDGSINRYNVPYSAVPI 364 AQVTI+QNGY I VPPG FTIND+Y SGDL V +KE+DGS + VPYS+VP+ Sbjct: 308 TAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPL 367 Query: 365 LQREGRLKYAATVAEYRSDSSQKEKVKFSQATLIWGLPHGFTLYGGTQLSSHYHALAIGS 424 LQREG +Y+ T EYRS ++Q+EK +F Q+TL+ GLP G+T+YGGTQL+ Y A G Sbjct: 368 LQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGI 427 Query: 425 GANLGDWGAVSLDVTQATSTLADNNTYQGQSLRFLYAKSLAQSGTNLQLMGYRYSTSGFY 484 G N+G GA+S+D+TQA STL D++ + GQS+RFLY KSL +SGTN+QL+GYRYSTSG++ Sbjct: 428 GKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYF 487 Query: 485 TLDDTTWKRMSGYDDDNRTDSDKSRPEWADYYNLYYTRRGKVQLDINQQLGGLGSLFITG 544 DTT+ RM+GY+ + + + +P++ DYYNL Y +RGK+QL + QQLG +L+++G Sbjct: 488 NFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSG 547 Query: 545 SQQSYWHTDEKDSLLQVGYSDTLAGIAWSVSYNNNKSAGDAERDQIFALNISVPLSQWLQ 604 S Q+YW T D Q G + I W++SY+ K+A RDQ+ ALN+++P S WL+ Sbjct: 548 SHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLR 607 Query: 605 HDDEVTHHHNVYATFSTSTDKQHNVTQNAGLSGTLLDENNLSYNIQQGYQNHGIGESGA- 663 D + + A++S S D +T AG+ GTLL++NNLSY++Q GY G G SG+ Sbjct: 608 SDS-KSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGST 666 Query: 664 --ASLEYDGAKGNANIGYNVSDNGDYQQVNYGLSGGLVAHAHGVTLSQPLGNTNILIAAP 721 A+L Y G GNANIGY S + D +Q+ YG+SGG++AHA+GVTL QPL +T +L+ AP Sbjct: 667 GYATLNYRGGYGNANIGY--SHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAP 724 Query: 722 GAANVGVVDQPGIHTDARGYAVVPYATTYRQNRMALDVNAMADDVDIDDAVTRVVPTEGA 781 GA + V +Q G+ TD RGYAV+PYAT YR+NR+ALD N +AD+VD+D+AV VVPT GA Sbjct: 725 GAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGA 784 Query: 782 LVLARFKARVGVRALVTLNHNGKPVPFGATVTVNDRHAEAIVDEAGEVYLSGLSAQGVLH 841 +V A FKARVG++ L+TL HN KP+PFGA VT + IV + G+VYLSG+ G + Sbjct: 785 IVRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQ 844 Query: 842 VRWGNLPDQQCVASYHL--SSSRQILSRQHAEC 872 V+WG + CVA+Y L S +Q+L++ AEC Sbjct: 845 VKWGEEENAHCVANYQLPPESQQQLLTQLSAEC 877
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 69.1 bits (169), Expect = 7e-16 Identities = 29/141 (20%), Positives = 48/141 (34%), Gaps = 2/141 (1%) Query: 1 MDSITTLIVEDEPMLAEILVDTIKLFPQFSIVGIADKLESAKKQIRLYQPQLILLDNFLP 60 M T L+ +D+ + +L L V I + + I L++ D +P Sbjct: 1 MTGATILVADDDAAIRTVLNQA--LSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 DGKGIDLIRHTISTNYTGRIIFITADNHMDTISDALRMGVFDYLIKPVHYQRLQHTLERF 120 D DL+ ++ ++A N T A G +DYL KP L + R Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118 Query: 121 TRYRSSLRSSEQANQTHVDAL 141 S + + L Sbjct: 119 LAEPKRRPSKLEDDSQDGMPL 139
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 30.2 bits (68), Expect = 0.017 Identities = 19/81 (23%), Positives = 30/81 (37%), Gaps = 13/81 (16%) Query: 91 DATYITVGNEKGQRLYHVNPDEIGKYMEGGDSDDALYNAKSYVSVRKGSLGSSLRGKSPI 150 + + G EK Q L V +E+ KY E G + GS+G + Sbjct: 238 NGAALYYGTEKEQWLREVKVEELRKYYEEG-------------HFKAGSMGPKVLAAIRF 284 Query: 151 QDSTGKVIGIVSVGYTLEQLE 171 + G+ I + +E LE Sbjct: 285 IEWGGERAIIAHLEKAVEALE 305
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.7 bits (72), Expect = 0.007 Identities = 16/58 (27%), Positives = 28/58 (48%) Query: 507 ASAPAAAAPAGAGTPVTAPLAGNIWKVIATEGQTVAEGDVLLILEAMKMETEIRAAQA 564 A+A +G + + ++I EG++V +GDVLL L A+ E + Q+ Sbjct: 84 ATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQS 141 Score = 29.8 bits (67), Expect = 0.034 Identities = 14/56 (25%), Positives = 21/56 (37%), Gaps = 10/56 (17%) Query: 532 KVIATEGQTVAEGDVLLILEAMKMETEIRAAQAGTVRGIAVKSGDAVSVGAPLMTL 587 V G+ G EI+ + V+ I VK G++V G L+ L Sbjct: 82 IVATANGKLTHSGRSK----------EIKPIENSIVKEIIVKEGESVRKGDVLLKL 127
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 40.2 bits (94), Expect = 3e-06 Identities = 21/102 (20%), Positives = 42/102 (41%), Gaps = 4/102 (3%) Query: 158 NPFTLGHRYLVEQAAAACDWLHLFVLKEDAS--FFSYTDRWALIEQGIAGIDNVTLHPGS 215 +P T GH ++E+ D +++ VL+ FS +R I + IA + N + Sbjct: 10 DPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLPNAQVDSFE 69 Query: 216 AYMISRATFPGYFLKEKGV--VDDCHCQIDLQLFREHLAPAL 255 ++ A +G+ + D ++ + + LA L Sbjct: 70 GLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDL 111
>PF06580#Sensor histidine kinase Length = 349 Score = 30.6 bits (69), Expect = 0.017 Identities = 16/79 (20%), Positives = 27/79 (34%), Gaps = 3/79 (3%) Query: 4 RRQPLIPGWLIPGLCAAALMITVSLAAFLALWLNAPSGAWSTLWRDSYLWHVVRFSFWQA 63 R GWL + L + + +W A + W L +++ Sbjct: 60 RSFIKRQGWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLL---AFINTKPVAFTLPL 116 Query: 64 FLSAVLSVVPAVFLARALY 82 LS + +VV F+ LY Sbjct: 117 ALSIIFNVVVVTFMWSLLY 135
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 35.2 bits (81), Expect = 7e-04 Identities = 40/281 (14%), Positives = 85/281 (30%), Gaps = 32/281 (11%) Query: 26 DKVEAEQSLITVEGDKASMEVPSPQAGVVKEIKVSVGDKTETGALIMIFDSADGAADAAP 85 + V +T G S E+ + +VKEI V G+ G +++ + AD Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138 Query: 86 AKA--------EEKKEAAPAAAPAAAAAKDVHVPDIGSDEVEVTEVMVKVG------DTV 131 ++ + + + + + + V EV+ T Sbjct: 139 TQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTW 198 Query: 132 EAEQSLITVEGDKASMEVPAPFAGTVKEIKVNTGDKVSTGSLIMVFEVAGEAGAAAPAKA 191 + ++ + DK E A + ++ +K + A + Sbjct: 199 QNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHK-QAIAKHAVLEQ 257 Query: 192 EAAPAAAAPAAATGVKDVNVPDIGGDEV-------------EVTEVMVKVGDKVAA-EQS 237 E A + + E+ + + + D + Sbjct: 258 ENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLE 317 Query: 238 LITVEGDKASMEVPAPFAGTVKEIKIST-GDKVKTGSLIMV 277 L E + + + AP + V+++K+ T G V T +MV Sbjct: 318 LAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMV 358 Score = 29.8 bits (67), Expect = 0.040 Identities = 16/85 (18%), Positives = 32/85 (37%), Gaps = 4/85 (4%) Query: 230 DKVAAEQSLITVEGDKASMEVPAPFAGTVKEIKISTGDKVKTGSLIMVFEVEGAAPAAAP 289 + VA +T G S E+ VKEI + G+ V+ G ++ ++ A Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVL--LKLTALGAEADT 136 Query: 290 AKQEAAAPAPAAKAEKPAAPAAKAE 314 K +++ + + + E Sbjct: 137 LKTQSSLLQARLEQTRYQILSRSIE 161
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 774 bits (2000), Expect = 0.0 Identities = 267/879 (30%), Positives = 449/879 (51%), Gaps = 34/879 (3%) Query: 2 LFQRSLLCLTIG----AALPFSVSAANSAAEKTVVESDEAVEFNEQFLLNSS-ANIDISR 56 L+QR+ CL I A + A + A S + FN +FL + A D+SR Sbjct: 8 LYQRNTQCLHIRKHRLAGFFVRLFVACAFAA-QAPLSSAELYFNPRFLADDPQAVADLSR 66 Query: 57 YAYGNPVLAGTYRVKVNLNNALKSTSEITFNEN-GTPRASACLTPLLLTQAGVDPAAMRD 115 + G + GTYRV + LNN +T ++TFN CLT L G++ A++ Sbjct: 67 FENGQELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSG 126 Query: 116 DVEVDDNTTCLDIKKYYPGATANYDSGKQAMDLNFPQIYILKRPAGYVDPSLWEDGVPAA 175 + D+ C+ + ATA D G+Q ++L PQ ++ R GY+ P LW+ G+ A Sbjct: 127 MNLLADDA-CVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAG 185 Query: 176 IVSYDMNAWHSEGN-GTTSDTAYVELRYGLNMGPWRLRSRGSLNWNKDTGS-----EYNN 229 +++Y+ + + G S AY+ L+ GLN+G WRLR + ++N S ++ + Sbjct: 186 LLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQH 245 Query: 230 QDVYLQRDITALKAQMVIGDSYTRGDAFDSFSLSGIRMYNDDRMLPMGSSNYAPVIRGVA 289 + +L+RDI L++++ +GD YT+GD FD + G ++ +DD MLP +APVI G+A Sbjct: 246 INTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIA 305 Query: 290 NSNAKVTVMQSGNKIYETTVPPGAFEINDLSTTGYGNDLLVTVEEADGSKRSFTVPFSSV 349 A+VT+ Q+G IY +TVPPG F IND+ G DL VT++EADGS + FTVP+SSV Sbjct: 306 RGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSV 365 Query: 350 TQMLRPGATRWDIGVGEL-NDDSLHDKPQVGYAQFYYGLNNTFTGYIGAQYTDMNFYAGL 408 + R G TR+ I GE + ++ +KP+ + +GL +T Y G Q D + A Sbjct: 366 PLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLAD-RYRAFN 424 Query: 409 LGLAMNTG-IGAFAFDVTQSHASIDDLGTLSGQNYRLTYSKMIEATNTSFNVAAYRFSTE 467 G+ N G +GA + D+TQ+++++ D GQ+ R Y+K + + T+ + YR+ST Sbjct: 425 FGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTS 484 Query: 468 DYLSLNDAASLQDSVKH---QQYAQQSYRSGDELYDDYQRTKNQVQISINQPLNQGETTW 524 Y + D + + + Q Q + Y+ + ++Q+++ Q L + Sbjct: 485 GYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR----T 540 Query: 525 GSVYVSGTWQDYWNDAGSTANYSVGYNNSFAYGSYSVSLQRAYDQNGSK-DDSVYLSFSI 583 ++Y+SG+ Q YW + + G N +F ++++S + D + L+ +I Sbjct: 541 STLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNI 600 Query: 584 PLSMFSHNGERSG-GFSNINMGLRSDMKGGTNVNSTASGNT-KDSDISYSVSA-TSSSGN 640 P S + + +S ++ + + D+ G + G +D+++SYSV + G+ Sbjct: 601 PFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGD 660 Query: 641 YGNLNQVSGFGSLNSSYGPLGLSASFGDDNSQQYSASYSGGMVLHSGGVAFTPGSIGETD 700 + + + YG + S DD +Q SGG++ H+ GV + +T Sbjct: 661 GNSGSTGYATLNYRGGYGNANIGYSHSDDI-KQLYYGVSGGVLAHANGVTLGQ-PLNDT- 717 Query: 701 AVALVKASGAQGAGL-GYSSSEIGSSGYGILPYMSAYRENRVSLDISTLENDVEIKSTST 759 V LVKA GA+ A + + GY +LPY + YRENRV+LD +TL ++V++ + Sbjct: 718 -VVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVA 776 Query: 760 VTVPRSGAVVLVNFETDEGRSLILELLRSDKGFIPLGADVLNEKNETVGTVGQAGQAYVR 819 VP GA+V F+ G L++ L ++K +P GA V +E +++ G V GQ Y+ Sbjct: 777 NVVPTRGAIVRAEFKARVGIKLLMTLTHNNKP-LPFGAMVTSESSQSSGIVADNGQVYLS 835 Query: 820 GVEPQGELRVVWGSGKESTCTVRYQLAETTAKAGLTPVL 858 G+ G+++V WG + + C YQL + + LT + Sbjct: 836 GMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLS 874
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 501 bits (1292), Expect = 0.0 Identities = 246/296 (83%), Positives = 268/296 (90%) Query: 1 MRELYPLTRRRLLTAMALSPLLWQMNTAQAAAIDPRRIVALEWLPVELLLALGITPYGVA 60 M L ++RRRLLTAMALSPLLWQMNTA AAAIDP RIVALEWLPVELLLALGI PYGVA Sbjct: 1 MSGLPLISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVA 60 Query: 61 DVPNYKLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEKLARIAPGR 120 D NY+LWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPE LARIAPGR Sbjct: 61 DTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGR 120 Query: 121 GFDFSDGKKPLAVARRSLVELAQTLNLEAAAEKHLAQYDRFIASQKPRFIRRGGRPLLMT 180 GF+FSDGK+PLA+AR+SL E+A LNL++AAE HLAQY+ FI S KPRF++RG RPLL+T Sbjct: 121 GFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLT 180 Query: 181 TLIDPRHMLVLGPNCLFQEVLDEYGIVNAWQGETNFWGSTAVSIDRLAMYKEADVICFDH 240 TLIDPRHMLV GPN LFQE+LDEYGI NAWQGETNFWGSTAVSIDRLA YK+ DV+CFDH Sbjct: 181 TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH 240 Query: 241 GNNTDMNALMATPLWQAMPFVRAGRFHRVPAVWFYGATLSTMHFVRILNNVLGGKA 296 N+ DM+ALMATPLWQAMPFVRAGRF RVPAVWFYGATLS MHFVR+L+N +GGKA Sbjct: 241 DNSKDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIGGKA 296
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 32.3 bits (73), Expect = 4e-04 Identities = 49/170 (28%), Positives = 70/170 (41%), Gaps = 25/170 (14%) Query: 16 MLSAVSMPL---AAESKTVNMTLTIVVNAAPPCTVTGGEVEFGNV-LTTKVDGVNYRQAV 71 ML AV M AA++ T L I P CTV EV +G++ + V ++ Sbjct: 12 MLGAVLMSQHVHAADNLTFKGKLII-----PACTVQNAEVNWGDIEIQNLVQSGGNQKDF 66 Query: 72 GYRLSCNGRVSDYLKLQIQGNAVTINGESVLQTDV---DGLGIRLQTATDGALVSPGNTQ 128 ++C + +K+ I N T N V T DGL I L + + + GN Sbjct: 67 TVDMNCPYSLGT-MKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGI---GNAV 122 Query: 129 WLSFQYS----GGSGPA-----IEAIPVKDNGVTLTGGAFNAGATLVVDY 169 L Q + G+ PA + K N +L G F+A ATLV Y Sbjct: 123 TLGSQVTPGKITGTAPARKITLYAKLGYKGNMQSLQAGTFSATATLVASY 172
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 32.7 bits (74), Expect = 3e-04 Identities = 46/161 (28%), Positives = 72/161 (44%), Gaps = 18/161 (11%) Query: 23 VSAADNLHFSGSLVASPCTLTMQGADIAEVDFSSLDASDFIPGGQSARKPVVFELTDCDS 82 V AADNL F G L+ CT+ AEV++ ++ + + G + + V +C Sbjct: 22 VHAADNLTFKGKLIIPACTVQN-----AEVNWGDIEIQNLVQSGGNQKDFTV--DMNCPY 74 Query: 83 ALSNGVQVIFTGTEATGMRGILAIDSYSGASGIGIGIETLSGVPVGINNES--GAVFT-- 138 +L ++V T TG IL ++ S ASG G+ I + GI N G+ T Sbjct: 75 SLGT-MKVTITSNGQTG-NSILVPNT-STASGDGLLIYLYNSNNSGIGNAVTLGSQVTPG 131 Query: 139 LVTGKN---TLSLNAWV-QRLPGEDLIPGRFSASALATFEY 175 +TG ++L A + + + L G FSA+A Y Sbjct: 132 KITGTAPARKITLYAKLGYKGNMQSLQAGTFSATATLVASY 172
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 33.3 bits (76), Expect = 0.002 Identities = 33/177 (18%), Positives = 69/177 (38%), Gaps = 3/177 (1%) Query: 213 FWLLFMILALGVFSGMVISSSSAQIGMTQYGLLSGAL-VVSLVSIFNSIGRLFWGGLTDK 271 WL + V + MV++ S I + V + + SIG +G L+D+ Sbjct: 17 IWLCILSF-FSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75 Query: 272 LGGYNTLVIVYLFTCVCMLLLLFFNGNTSVFYFSALGVGFAYAGILVIFPGLTSQNFGMR 331 LG L+ + C ++ + S+ + G A + + ++ Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135 Query: 332 NQGLNYGFMYFGFAVGAVIAPYVTSAIAKYTGSYNTVFILTTVLLLIGVVLTLITKK 388 N+G +G + A+G + P + IA Y ++ + ++ + ++ L + KK Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIH-WSYLLLIPMITIITVPFLMKLLKK 191
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 120 bits (303), Expect = 5e-32 Identities = 85/368 (23%), Positives = 148/368 (40%), Gaps = 68/368 (18%) Query: 23 GIDLGTTNSLVATVRSGQAETLPDHEGRHLLPSVVHYQQQGHTVGYAARDNAAQDTANTI 82 IDLGT N+L+ G + +E PSVV A + ++ Sbjct: 14 SIDLGTANTLIYVKGQG----IVLNE-----PSVV------------AIRQDRAGSPKSV 52 Query: 83 SSV----KRMMGRSLADIQARYPHLPYRFKASVNGLPMIDTAAGLLNPVRVSADILKALA 138 ++V K+M+GR+ +I A P M D G++ V+ +L+ Sbjct: 53 AAVGHDAKQMLGRTPGNIAAIRP--------------MKD---GVIADFFVTEKMLQHFI 95 Query: 139 ARA-SESLSGELDGVVITVPAYFDDAQRQGTKDAARLAGLHVLRLLNEPTAAAIAYGLDS 197 + S S V++ VP +R+ +++A+ AG + L+ EP AAAI GL Sbjct: 96 KQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPV 155 Query: 198 GKEGVIAVYDLGGGTFDISILRLSRGVFEVLATGGDSALGGDDFDHLLADYIREQAG--I 255 + V D+GGGT +++++ L+ V +GGD FD + +Y+R G I Sbjct: 156 SEATGSMVVDIGGGTTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYGSLI 210 Query: 256 ADRSDNRVQRELLDAAITAKIALSDADTVRVNVAG---WQG-----EITREQFNDLISAL 307 + + R++ E+ A + + V G +G + + + + Sbjct: 211 GEATAERIKHEI-------GSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEP 263 Query: 308 VKRTLLACRRALKDAGVE-PQDVLE--VVMVGGSTRVPLVRERVGEFFGRTPLTAIDPDK 364 + + A AL+ E D+ E +V+ GG + + + E G + A DP Sbjct: 264 LTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAEDPLT 323 Query: 365 VVAIGAAI 372 VA G Sbjct: 324 CVARGGGK 331
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 29.4 bits (65), Expect = 0.008 Identities = 19/68 (27%), Positives = 26/68 (38%), Gaps = 2/68 (2%) Query: 3 KVILGAVLFTLSGSVLSSSLQDQLAAVAQAEQQGKNEENRQRDALQAKRDQEA--QQERQ 60 LFT + S L + AA A E N+ Q A ++ +E QQ Sbjct: 185 TAAYNVKLFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQAAI 244 Query: 61 RQANAAAV 68 R AN A+ Sbjct: 245 RAANTYAM 252 Score = 27.8 bits (61), Expect = 0.023 Identities = 15/46 (32%), Positives = 25/46 (54%), Gaps = 1/46 (2%) Query: 46 ALQAKRDQEAQQERQRQANAAAVAKQRAKAAEAERKARQAKLAAEA 91 +LQ + + + +A AA A+++A AAEA+RKA + A Sbjct: 199 SLQIRMNTLTAAKASIEAAAANKAREQA-AAEAKRKAEEQARQQAA 243
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 37.6 bits (87), Expect = 4e-05 Identities = 31/149 (20%), Positives = 53/149 (35%), Gaps = 28/149 (18%) Query: 290 NQLTSDLLDQWSKGNVHQQHAAQYGRALQAMEASKYDEARKTLQPLLSAEPNNAWYLDLA 349 N+++SD L+Q Y A ++ KY++A K Q L + ++ + Sbjct: 29 NEISSDTLEQL------------YSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGL 76 Query: 350 TDIDLGQKRANDAINRLKNARDLRVN-PVLQLNLANAYLQGGQPKAAETILNRYTFSHKD 408 + + AI+ + + P + A LQ G+ AE+ L Sbjct: 77 GACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLF-------- 128 Query: 409 DGNGWDLLAQAEAALNNRDQELAARAESY 437 LAQ A +EL+ R S Sbjct: 129 -------LAQELIADKTEFKELSTRVSSM 150
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 32.5 bits (74), Expect = 0.003 Identities = 9/70 (12%), Positives = 26/70 (37%), Gaps = 6/70 (8%) Query: 262 DHALPALLSGLSESWQVQELSRLWLQLVQHDAKGVLQQTLRTWFEHNCDLTQTAKALHIH 321 + + + ++ L L +++ ++ L + + A L ++ Sbjct: 409 EENMRQYFASFGDALPPSGLYDRVLAEMEYP---LILAALT---ATRGNQIKAADLLGLN 462 Query: 322 VNTLRYRLQR 331 NTLR +++ Sbjct: 463 RNTLRKKIRE 472
>AUTOINDCRSYN#Autoinducer synthesis protein signature. Length = 216 Score = 31.4 bits (71), Expect = 0.009 Identities = 24/122 (19%), Positives = 49/122 (40%), Gaps = 13/122 (10%) Query: 459 SRIAVHPARQREGIGQQLIACACMQAAQCDYLSVSFGYT-------PELWRFWQRCGFVL 511 SR V +R ++ +G + + + + +Y S GY + +R G+ Sbjct: 100 SRFFVDKSRAKDILGNEYPISSMLFLSMINY-SKDKGYDGIYTIVSHPMLTILKRSGWG- 157 Query: 512 VRMGNHREASSGCYTAMALLPLSDAG-KRLAQQEHRRLRRDADILTQWNGEAIPLAALDE 570 +R+ + + LP+ D + LA++ +R ++ L QW + + A Sbjct: 158 IRVVEQGLSEKEERVYLVFLPVDDENQEALARRINRSGTFMSNELKQW---PLRVPAAIA 214 Query: 571 QA 572 QA Sbjct: 215 QA 216
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 48.6 bits (116), Expect = 9e-09 Identities = 32/116 (27%), Positives = 51/116 (43%), Gaps = 9/116 (7%) Query: 64 VRDGIVWDFFGAVTLVRRHLDTLEQQLGCRFT-HAATSFPPGTDP---RISINVLESAGL 119 ++DG++ DFF +++ + + R + P G R + AG Sbjct: 76 MKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGA 135 Query: 120 EVSHVLDEPTAVA---DLLALDNAG--VVDIGGGTTGIAIVKQGKVTYSADEATGG 170 +++EP A A L + G VVDIGGGTT +A++ V YS+ GG Sbjct: 136 REVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRIGG 191
>PF05272#Virulence-associated E family protein Length = 892 Score = 34.3 bits (78), Expect = 8e-04 Identities = 12/33 (36%), Positives = 16/33 (48%) Query: 30 MVALLGPSGSGKTTLLRIIAGLEHQSSGHIRFH 62 V L G G GK+TL+ + GL+ S H Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG 630
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 748 bits (1934), Expect = 0.0 Identities = 276/571 (48%), Positives = 388/571 (67%), Gaps = 2/571 (0%) Query: 1 MISGILASPGIAFGKALLLKEDEIVIDRKKISADKVDQEVERFLSGRAKASAQLEAIKTK 60 I+GI AS G+A KA + E + I++ I V E+E+ + K+ +L AIK + Sbjct: 4 KITGIAASSGVAIAKAFIHLEPNVDIEKTSI--TDVSTEIEKLTAALEKSKEELRAIKDQ 61 Query: 61 AGETFGEEKEAIFEGHIMLLEDEELEQEIIALIKDKHMTADAAAHEVIEGQATALEELDD 120 + G +K IF H+++L+D EL I I+++ M A+ A EV + + E +D+ Sbjct: 62 TEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDN 121 Query: 121 EYLKERAADVRDIGKRLLRNILGLAIIDLSAIQEEVILVAADLTPSETAQLNLQKVLGFI 180 EY+KERAAD+RD+ KR+L +++G+ L+ I EE +++A DLTPS+TAQLN Q V GF Sbjct: 122 EYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFA 181 Query: 181 TDAGGRTSHTSIMARSLELPAIVGTGSVTAQVKNGDYLILDAVNNQVYVNPTNDVIEQLR 240 TD GGRTSH++IM+RSLE+PA+VGT VT ++++GD +I+D + V VNPT + ++ Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241 Query: 241 AVQEQVATEKAELAKLKDLPAITLDGHQVEVCANIGTVRDVEGAERNGAEGVGLYRTEFL 300 + +K E AKL P+ T DG VE+ ANIGT +DV+G NG EG+GLYRTEFL Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301 Query: 301 FMDRDALPTEEEQFAAYKAVAEACGSQAVIVRTMDIGGDKELPYMNFPKEENPFLGWRAV 360 +MDRD LPTEEEQF AYK V + + V++RT+DIGGDKEL Y+ PKE NPFLG+RA+ Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361 Query: 361 RIAMDRKEILRDQVRAILRASAFGKLRIMFPMIISVEEVRALRKEIEIYKQELRDEGKAF 420 R+ +++++I R Q+RA+LRAS +G L++MFPMI ++EE+R + ++ K +L EG Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDV 421 Query: 421 DESIEIGVMVETPAAATIARHLAKEVDFFSIGTNDLTQYTLAVDRGNDMISHLYQPMSPS 480 +SIE+G+MVE P+ A A AKEVDFFSIGTNDL QYT+A DR N+ +S+LYQP P+ Sbjct: 422 SDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPA 481 Query: 481 VLNLIKQVIDASHAEGKWTGMCGELAGDERATLLLLGMGLDEFSMSAISIPRIKKIIRNT 540 +L L+ VI A+H+EGKW GMCGE+AGDE A LLLG+GLDEFSMSA SI + + Sbjct: 482 ILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKL 541 Query: 541 NFEDAKVLAEQALAQPTTDELMTLVNKFIEE 571 + E+ K A++AL T +E+ LV K + Sbjct: 542 SKEELKPFAQKALMLDTAEEVEQLVKKTYLK 572
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 41.9 bits (98), Expect = 1e-06 Identities = 25/128 (19%), Positives = 39/128 (30%), Gaps = 11/128 (8%) Query: 68 VHRVNHAPGQSQEHDAPRQSPQHQYQPPYASAQPRPAAPPQPQAPMQQPVQQPVQPAPQP 127 VH+V P +Q +P P P P P+P +P P P P Sbjct: 37 VHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEP-----EPEPIPEPPKEAP 91 Query: 128 QQVQPSAPPVQPPQQQPAPPSQAPQPVAQPAPPPSAQTFQPAEPVVE-----AEPVVEEA 182 ++ P P+ +P + P+ +P A F+ P + Sbjct: 92 VVIEKPKPK-PKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPV 150 Query: 183 PVVEKPQR 190 V R Sbjct: 151 TSVASGPR 158 Score = 38.0 bits (88), Expect = 2e-05 Identities = 18/82 (21%), Positives = 21/82 (25%) Query: 96 YASAQPRPAAPPQPQAPMQQPVQQPVQPAPQPQQVQPSAPPVQPPQQQPAPPSQAPQPVA 155 Y S P Q V PQ Q P P+ +P P PV Sbjct: 34 YTSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVV 93 Query: 156 QPAPPPSAQTFQPAEPVVEAEP 177 P P + VE Sbjct: 94 IEKPKPKPKPKPKPVKKVEQPK 115 Score = 35.0 bits (80), Expect = 2e-04 Identities = 26/79 (32%), Positives = 30/79 (37%), Gaps = 4/79 (5%) Query: 117 VQQPVQPAP-QPQQVQPSAPPVQPPQQQPAPPSQAPQPVAQPAPPPSAQTFQPAEPVVEA 175 Q PAP QP V AP P Q PP P+PV +P P P P E V Sbjct: 38 HQVIELPAPAQPISVTMVAPADLEPPQAVQPP---PEPVVEPEPEPEPIPEPPKEAPVVI 94 Query: 176 EPVVEEAPVVEKPQRKEAV 194 E + KP +K Sbjct: 95 EKPKPKPKPKPKPVKKVEQ 113 Score = 28.0 bits (62), Expect = 0.042 Identities = 10/80 (12%), Positives = 23/80 (28%) Query: 93 QPPYASAQPRPAAPPQPQAPMQQPVQQPVQPAPQPQQVQPSAPPVQPPQQQPAPPSQAPQ 152 +P P+P+ + V+QP + + S P + + + A Sbjct: 87 PKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAAT 146 Query: 153 PVAQPAPPPSAQTFQPAEPV 172 + + +P Sbjct: 147 SKPVTSVASGPRALSRNQPQ 166
>RTXTOXINC#Gram-negative bacterial RTX toxin-activating protein C signature. Length = 170 Score = 29.5 bits (66), Expect = 0.010 Identities = 20/58 (34%), Positives = 29/58 (50%), Gaps = 14/58 (24%) Query: 161 AILSEPFLLLCRQDHPLAHQEWVSWQDLKQ----------ASLVLQDYASGSRP-LID 207 AI + ++LL R D+P+A + SW +L SLV +D+ SG R ID Sbjct: 38 AIQANQYVLLTRDDYPVA---YCSWANLSLENEIKYLNDVTSLVAEDWTSGDRKWFID 92
>BINARYTOXINA#Clostridial binary toxin A signature. Length = 454 Score = 26.6 bits (58), Expect = 0.027 Identities = 12/36 (33%), Positives = 19/36 (52%), Gaps = 3/36 (8%) Query: 5 RMTPEELANLTGYSR---QTINKWVRKEGWATSPKP 37 ++TP ELA++ Y R IN ++ G +P P Sbjct: 275 KLTPNELADVNDYMRGGYTAINNYLISNGPLNNPNP 310
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 31.3 bits (71), Expect = 0.007 Identities = 40/217 (18%), Positives = 72/217 (33%), Gaps = 14/217 (6%) Query: 28 IQALLSVFLGYLAYYIVRNNFTLSTPYLKEQLDLSATQI---GLLSSCMLIAYGISKGVM 84 I L +V L + ++ P L L S G+L + + V+ Sbjct: 8 IVILSTVALDAVGIGLI----MPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVL 63 Query: 85 SSLADKASPKVFMACGLVLCAIVNVGLGFSSAFWIFAALVVFNGLFQGMGVGPSFITIAN 144 +L+D+ + + L A+ + + W+ + G+ G IA+ Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIAD 122 Query: 145 WFPRRERGRVGAFWNISHNVGGGIVA-PIVGAAFAILGNEHWQSASYIVPACVAVIFALI 203 ER R F +S G G+VA P++G ++G + + A + F Sbjct: 123 ITDGDERAR--HFGFMSACFGFGMVAGPVLG---GLMGGFSPHAPFFAAAALNGLNFLTG 177 Query: 204 VLVLGKGSPREEGLPSLEQMMPEEKVVLKTKNTAKAP 240 +L + E E + P T A Sbjct: 178 CFLLPESHKGERRPLRREALNPLASFRWARGMTVVAA 214
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 248 bits (635), Expect = 6e-80 Identities = 120/474 (25%), Positives = 191/474 (40%), Gaps = 73/474 (15%) Query: 7 SILLIDDDVDVLDAYTQMLEQAGYRVRGFTHPFEAKEWVKADWEGIVLSDVCMPGCSGID 66 +IL+ DDD + Q L +AGY VR ++ W+ A +V++DV MP + D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 67 LMTLFHQDDDQLPILLITGHGDVPMAVDAVKKGAWDFLQKPVDPGNLLILIEDALRQRRS 126 L+ + LP+L+++ A+ A +KGA+D+L KP D L+ +I AL + + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 127 VIARRQYCQQTLQVDLIGRSEWMNQFRQRLQQLAETDIAVWFYGEHGTGRMTGARYLHQL 186 ++ + Q L+GRS M + + L +L +TD+ + GE GTG+ AR LH Sbjct: 125 RPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183 Query: 187 GRNAKGPFVRYELT--PENAGQLETF-----------------IDQAQGGTLVLSHPEYL 227 G+ GPFV + P + + E F +QA+GGTL L + Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243 Query: 228 TREQQHHLAR-LQSLEHRP----------FRLVGVGSASLVEQAAANQIAAELYYCFAMT 276 + Q L R LQ E+ R+V + L + +LYY + Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303 Query: 277 QIACQSLSQRPDDIEPLFRHYLRKACLRLNHPVPEIAGELLKGIMRRAWPSNVRELANAA 336 + L R +DI L RH++++A + V E L+ + WP NVREL N Sbjct: 304 PLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRELENLV 362 Query: 337 ELFAV-----------------------------------GVLPLAETVNPQLL------ 355 + E Q Sbjct: 363 RRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDA 422 Query: 356 LQEPTPLDRRVEEYERQIITEALNIHQGRINEVAEYLQIPRKKLYLRMKKYGLS 409 L DR + E E +I AL +G + A+ L + R L ++++ G+S Sbjct: 423 LPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476
>OMPTIN#Omptin serine protease signature. Length = 317 Score = 472 bits (1217), Expect = e-172 Identities = 149/320 (46%), Positives = 211/320 (65%), Gaps = 11/320 (3%) Query: 1 MKKHAIAVMMIAVFSESVYAESTLFIPDVSPESVTTSLSVGVLNGKSRELVYD-TDTGRK 59 M+ + +++ + S +A + +P+++ +S+G L+GK++E VY + GRK Sbjct: 1 MRAKLLGIVLTTPIAISSFASTET--LSFTPDNINADISLGTLSGKTKERVYLAEEGGRK 58 Query: 60 LSQLDWKIKNVATLQGDLSWKPYSFMTLDARGWTSLASGSGHMVDHDWMSSEQPG-WTDR 118 +SQLDWK N A ++G ++W +++ A GWT+L S G+MVD DWM S PG WTD Sbjct: 59 VSQLDWKFNNAAIIKGAINWDLMPQISIGAAGWTTLGSRGGNMVDQDWMDSSNPGTWTDE 118 Query: 119 SIHPDTSANYANEYDLNVKGWLLQGDNYKAGVTAGYQETRFSWTARGGSYIYDNGR---- 174 S HPDT NYANE+DLN+KGWLL NY+ G+ AGYQE+R+S+TARGGSYIY + Sbjct: 119 SRHPDTQLNYANEFDLNIKGWLLNEPNYRLGLMAGYQESRYSFTARGGSYIYSSEEGFRD 178 Query: 175 YIGNFPHGVRGIGYSQRFEMPYIGLAGDYRINDFECNVLFKYSDWVNAHDNDEHY--MRK 232 IG+FP+G R IGY QRF+MPYIGL G YR DFE FKYS WV + DNDEHY ++ Sbjct: 179 DIGSFPNGERAIGYKQRFKMPYIGLTGSYRYEDFELGGTFKYSGWVESSDNDEHYDPGKR 238 Query: 233 LTFREKTENSRYYGASIDAGYYITSNAKIFAEFAYSKYEEGKGGTQIIDKTSGNTAYFGG 292 +T+R K ++ YY +++AGYY+T NAK++ E A+++ KG T + D + NT+ + Sbjct: 239 ITYRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRVTNKKGNTSLYDHNN-NTSDYSK 297 Query: 293 DAAGIANNNYTVTAGLQYRF 312 + AGI N N+ TAGL+Y F Sbjct: 298 NGAGIENYNFITTAGLKYTF 317
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 41.7 bits (98), Expect = 3e-06 Identities = 78/362 (21%), Positives = 133/362 (36%), Gaps = 30/362 (8%) Query: 14 NFSLFRIAFAAFLTYMTVGLPLPVIPLFVHHELGYSNTMV---GIAVGIQFFATVLTRGY 70 N L I L + +GL +PV+P + +L +SN + GI + + Sbjct: 4 NRPLIVILSTVALDAVGIGLIMPVLPGLLR-DLVHSNDVTAHYGILLALYALMQFACAPV 62 Query: 71 AGRLADQYGAKRSALQGMFACGLAGAAWLLAALLPVSAPVKFALLIVGRLILGFGESQLL 130 G L+D++G + + LAGAA + + +AP +L +GR++ G + Sbjct: 63 LGALSDRFGRRP-----VLLVSLAGAA--VDYAIMATAPF-LWVLYIGRIVAGITGATGA 114 Query: 131 TGTLTWGLGLVGPTRSGKVMSWNGMAIYGALAAGAPLGLL---IHSHFGFAALAGTTMVL 187 G R+ + + + AG LG L H F A A + Sbjct: 115 VAGAYIADITDGDERA-RHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLN 173 Query: 188 PLLAWAFNGTVRKVPAYTGERPSLWSVVGLIWKPGL-----------GLALQGVGFAVIG 236 L K R +L + W G+ + L G A + Sbjct: 174 FLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALW 233 Query: 237 TFISLYFVSNGWTMAGFTLTAFGGAFVLMRIL-FGWMPDRFGGVKVAVVSLLVETAGLLL 295 T G +L AFG L + + G + R G + ++ ++ + G +L Sbjct: 234 VIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYIL 293 Query: 296 LWLAPTAWIALVGAALTGAGCSLIFPALGVEVVKRVPAQVRGTALGGYAAFQDISYGVTG 355 L A W+A L +G + PAL + ++V + +G G AA ++ + G Sbjct: 294 LAFATRGWMAFPIMVLLASG-GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLT-SIVG 351 Query: 356 PL 357 PL Sbjct: 352 PL 353
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 28.3 bits (63), Expect = 0.019 Identities = 18/56 (32%), Positives = 25/56 (44%), Gaps = 3/56 (5%) Query: 68 MVTSFT---AVHDVARFGAEVLRASPRQADLMVVAGTCFTKMAPVIQRLYDQMLEP 120 M+TSFT V + R A P Q L + F M+PVI ++Y +P Sbjct: 60 MMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQP 115
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 45.6 bits (108), Expect = 2e-07 Identities = 32/148 (21%), Positives = 58/148 (39%), Gaps = 16/148 (10%) Query: 185 PGAVAIVAEDSKVARAMLEKGLNAMEIPHQMHVTGKDAWERIQQLAQEAEAEGKPISEKI 244 GA +VA+D R +L + L+ ++ W I + Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA-------------AGDG 48 Query: 245 ALVLTDLEMPEMDGFTLTRKIKTDERLKKIPVVIHSSLSGSANEDHVRKVKADGYVAK-F 303 LV+TD+ MP+ + F L +IK + +PV++ S+ + + A Y+ K F Sbjct: 49 DLVVTDVVMPDENAFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106 Query: 304 EINELSSVIQEVMERAAQNISGPLVSRQ 331 ++ EL +I + + S Q Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLEDDSQ 134
>AUTOINDCRSYN#Autoinducer synthesis protein signature. Length = 216 Score = 30.2 bits (68), Expect = 0.002 Identities = 11/74 (14%), Positives = 27/74 (36%), Gaps = 12/74 (16%) Query: 1 MIDWQDLHHSELTVPQLYALLKLRCAVFV--------VEQRCPYLDVDGDDLVGDNRHIL 52 M++ D++H+ L+ + L LR F + D + + ++ Sbjct: 1 MLEIFDVNHTLLSETKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYDNN----NTTYLF 56 Query: 53 GWHQDELVAYARIL 66 G + ++ R + Sbjct: 57 GIKDNTVICSLRFI 70
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 28.2 bits (63), Expect = 0.031 Identities = 16/75 (21%), Positives = 29/75 (38%), Gaps = 15/75 (20%) Query: 133 AERDTQAYLKLDHDFHYVFVKYADNKYISQAHLLISARLLAIRYRLDFTAEYITSSNRGH 192 A+R+ L F VF+ + A+RY L+ Y S+ G Sbjct: 62 ADREGMTDLFASGHFERVFISPH------RL---------AVRYSLENPHAYADSNLTGF 106 Query: 193 ATILDMLKNNNVEGV 207 IL+ ++N ++ + Sbjct: 107 LNILEGCRHNKIQHL 121
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 30.6 bits (69), Expect = 0.012 Identities = 36/179 (20%), Positives = 68/179 (37%), Gaps = 12/179 (6%) Query: 25 ILYFFNYMDRVNIGFAALRMNESLGITPEDFANISSIFFISYLIFQIPSSIGLQKLGARK 84 IL FF+ ++ + + + + P +++ F +++ I +LG ++ Sbjct: 21 ILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR 80 Query: 85 W--ISSIIIGWGAVTGLIFFAKDTQHIL-LARIFLGVFEAGFFPGMVYYLACWFPARERG 141 II +G+V G F +L +AR G A F ++ +A + P RG Sbjct: 81 LLLFGIIINCFGSVIG--FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138 Query: 142 KVNSFFMLSIAVASVLAAPMSGWIIEHLNTPDYEGWRWLFAIEGIPTVFLGILTFYLLP 200 K +A+ + + G I ++ W +L I I T+ LL Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYI------HWSYLLLIPMI-TIITVPFLMKLLK 190
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 79.9 bits (197), Expect = 1e-17 Identities = 29/104 (27%), Positives = 47/104 (45%) Query: 827 ILVVDDHPINRRLLADQLGSLGYQCKTANDGVDALNVLSKNAIDIVLSDVNMPNMDGYRL 886 ILV DD R +L L GY + ++ ++ D+V++DV MP+ + + L Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 887 TQRIRQLGLTLPVVGVTANALAEEKQRCLESGMDSCLSKPVTLD 930 RI++ LPV+ ++A + E G L KP L Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 47.9 bits (114), Expect = 8e-09 Identities = 26/145 (17%), Positives = 60/145 (41%), Gaps = 20/145 (13%) Query: 1 MNNMNVIIADDHPIVLFGIRKSLEQIEWVNVVGEFEDSTALINNLPKLDAHVLITDLSMP 60 M +++ADD + + ++L + + + ++ L + D +++TD+ MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 GDKYGDGITLIKYIKRHFPSLSIIVLTMNNNPAILSAVLDLDIEGIVLKQGA------PT 114 + L+ IK+ P L ++V++ N +A+ ++GA P Sbjct: 59 D---ENAFDLLPRIKKARPDLPVLVMSAQNTFM--TAIKA-------SEKGAYDYLPKPF 106 Query: 115 DLPKALAALQKGKKFTPESVSRLLE 139 DL + + + + S+L + Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLED 131
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 540 bits (1392), Expect = 0.0 Identities = 261/389 (67%), Positives = 298/389 (76%), Gaps = 17/389 (4%) Query: 1 MKVKVLSLLVPALLVAGAANAAEIYNKDGNKLDLFGKVDGLHYFSDDKGSDGDQTYMRIG 60 MK KVL+L++PALL AGAA+AAEIYNKDGNKLDL+GKVDGLHYFSDD DGDQTYMR+G Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60 Query: 61 FKGETQVNDQLTGYGQWEYQIQGNQTEG-SNDSWTRVAFAGLKFADAGSFDYGRNYGVTY 119 FKGETQ+NDQLTGYGQWEY +Q N TEG +SWTR+AFAGLKF D GSFDYGRNYGV Y Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120 Query: 120 DVTSWTDVLPEFGGDTYG-ADNFMQQRGNGYATYRNTDFFGLVDGLDFALQYQGKNGSVS 178 DV WTD+LPEFGGD+Y ADN+M R NG ATYRNTDFFGLVDGL+FALQYQGKN S S Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180 Query: 179 GEN--------TNGRSLLNQNGDGYGGSLTYAIGEGFSVGGAITTSKRTADQNNTANARL 230 ++ NG + NGDG+G S TY IG GFS G A TTS RT +Q N Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGG--T 238 Query: 231 YGNGDRATVYTGGLKYDANNIYLAAQYSQTYNATRFGTSNGSNPSTSYGFANKAQNFEVV 290 GD+A +T GLKYDANNIYLA YS+T N T +G ++ G ANK QNFEV Sbjct: 239 IAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDK---GYDGGVANKTQNFEVT 295 Query: 291 AQYQFDFGLRPSVAYLQSKGKDISNGYGASYGDQDIVKYVDVGATYYFNKNMSTYVDYKI 350 AQYQFDFGLRP+V++L SKGKD++ + D+D+VKY DVGATYYFNKN STYVDYKI Sbjct: 296 AQYQFDFGLRPAVSFLMSKGKDLTYN-NVNGDDKDLVKYADVGATYYFNKNFSTYVDYKI 354 Query: 351 NLLDKND-FTRDAGINTDDIVALGLVYQF 378 NLLD +D F +DAGI+TDDIVALG+VYQF Sbjct: 355 NLLDDDDPFYKDAGISTDDIVALGMVYQF 383
>PF04335#VirB8 type IV secretion protein Length = 227 Score = 29.0 bits (65), Expect = 0.006 Identities = 10/30 (33%), Positives = 12/30 (40%) Query: 1 MNLRRKNRLWVVCAVLAGLALTTALVLYAL 30 R K WVV V LA + + AL Sbjct: 27 AAERSKKLAWVVAGVAGALATAGVVAVAAL 56
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 67.9 bits (166), Expect = 2e-15 Identities = 24/114 (21%), Positives = 48/114 (42%), Gaps = 2/114 (1%) Query: 9 VLIVDDHPLMRRGIRQLLELDPAFYVVAEAGDGASAIDLANRIEPDLILLDLNMKGLSGL 68 +L+ DD +R + Q L A Y V + A+ + DL++ D+ M + Sbjct: 6 ILVADDDAAIRTVLNQALSR--AGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 69 DTLNALRRDGVTAQIIILTVSDSASDIYALIDAGADGYLLKDSDPEVLLEAIRK 122 D L +++ +++++ ++ + GA YL K D L+ I + Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 51.4 bits (123), Expect = 4e-09 Identities = 64/425 (15%), Positives = 137/425 (32%), Gaps = 65/425 (15%) Query: 22 RVIICCFLVVMLDGFDTAAIGFIAPDIRTHWQLSASELAPLFGAGLLGLTAGALLCGPLA 81 +++I ++ + + PDI + + + A +L + G + G L+ Sbjct: 14 QILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73 Query: 82 DRFGRKRVIELCVALFGALSLLSAFS-PDIETLVLLRFLTGLGLGGAMPNTIT-MTSEYL 139 D+ G KR++ + + S++ L++ RF+ G G A P + + + Y+ Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG-AAAFPALVMVVVARYI 132 Query: 140 PARRRGALVTLMFCGFTLGSAMGGIVSAQLVPLIGWHGILALGGILPLMLFFGLLFALPE 199 P RG L+ +G +G + + I W +L + I + + F + E Sbjct: 133 PKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKE 192 Query: 200 SPRWQVRRQLPQAVVARTVSAITGERYHDTQFFLHEVAAVAKGSIRQLFAGRQLVITLML 259 R + LF + L++ Sbjct: 193 ------------------------VRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIV 228 Query: 260 WVVFFMSLLIIYLLSSWMPTLLNHRGIDLQQASWVTAAFQVGGTLGALLL---------- 309 V+ F+ + + ++ P + G ++ V + GT+ + Sbjct: 229 SVLSFL-IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVH 287 Query: 310 --------------------------GVLMDRLNPFRVLAVSYALGAVCIVMIGLSENG- 342 G+L+DR P VL + +V + Sbjct: 288 QLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETT 347 Query: 343 LWLMALAIFGTGIGISGSQVGLNALTATLYPTQSRATGVSWSNAIGRCGAIVGSLSGGMM 402 W M + I G+S ++ ++ + ++ Q G+S N G G + Sbjct: 348 SWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407 Query: 403 MTLNF 407 +++ Sbjct: 408 LSIPL 412 Score = 43.3 bits (102), Expect = 2e-06 Identities = 40/169 (23%), Positives = 73/169 (43%), Gaps = 1/169 (0%) Query: 251 RQLVITLMLWVVFFMSLLIIYLLSSWMPTLLNHRGIDLQQASWVTAAFQVGGTLGALLLG 310 R I + L ++ F S+L +L+ +P + N +WV AF + ++G + G Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 311 VLMDRLNPFRVLAVSYALGAVCIVMIGLSENGLWLMALAIFGTGIGISGSQVGLNALTAT 370 L D+L R+L + V+ + + L+ +A F G G + + + A Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVAR 130 Query: 371 LYPTQSRATGVSWSNAIGRCGAIVGSLSGGMMM-TLNFSFDTLFFVIAI 418 P ++R +I G VG GGM+ +++S+ L +I I Sbjct: 131 YIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITI 179
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 115 bits (290), Expect = 2e-33 Identities = 69/253 (27%), Positives = 116/253 (45%), Gaps = 12/253 (4%) Query: 3 KVAIVTASDSGIGKACALLLAQNGFDIGITWHSDERGAQETAKKAAQFGVRAETIHLDLS 62 K+A +T + GIG+A A LA G I ++ E+ + + A+ AE D+ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE-ARHAEAFPADVR 67 Query: 63 QLPEGAQAIEHLIQRLGRVDVLVNNAGAMTKSAFIDMPFTQWRQIFTVDVDGAFLCAQIA 122 + + + +G +D+LVN AG + + +W F+V+ G F ++ Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 123 ARHMIKQGEGGRIINITSVHEHTPLPQASAYTAAKHALGGLTKSMALELIEYHILVNAVA 182 +++M+ + G I+ + S P +AY ++K A TK + LEL EY+I N V+ Sbjct: 128 SKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 183 PGAIATPM-------NDMDDSDIEPGSEP---SIPIARPGSTHEIASLVAWLCSEGASYT 232 PG+ T M + + I+ E IP+ + +IA V +L S A + Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246 Query: 233 TDQSLIVDGGFML 245 T +L VDGG L Sbjct: 247 TMHNLCVDGGATL 259
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 27.5 bits (61), Expect = 0.034 Identities = 8/39 (20%), Positives = 16/39 (41%), Gaps = 1/39 (2%) Query: 161 WLHDLDQHLRH-GVWLILAIVLVVGVRWWLKRRGKAEAR 198 L + +R G W++LA++ + R+ K Sbjct: 215 VLMGMSDAVRTFGPWMLLALLAGFMAFRVMLRQEKRRVS 253
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 37.1 bits (86), Expect = 5e-05 Identities = 38/180 (21%), Positives = 67/180 (37%), Gaps = 6/180 (3%) Query: 11 FALMLAVPFAPQAVAKTAATTAASQPEIASGSAMI-VDLNTNKVIYSNHPDLVRPIASIT 69 +L+ +P A A + S+ +++ MI +DL + + + + D P+ S Sbjct: 9 ISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTF 68 Query: 70 KLMTAMVVLDARLPLDEILKVDISQTPEMKGVYSRV---RLNSEISRKNMLLLALMSSEN 126 K++ VL DE L+ I + YS V L ++ + A+ S+N Sbjct: 69 KVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCAAAITMSDN 128 Query: 127 RAAASLAHYY--PGGYNAFIKAMNAKAKALGMTHTRFVEPTGLSIHNVSTARDLTKLLIA 184 AA L P G AF++ + L T E + +T + L Sbjct: 129 SAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPASMAATLRK 188
>PF06580#Sensor histidine kinase Length = 349 Score = 219 bits (559), Expect = 1e-68 Identities = 60/216 (27%), Positives = 116/216 (53%), Gaps = 3/216 (1%) Query: 343 LGEGIAQLLSAQILAGQYERQKALLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQA 402 L G + + + ++ ++++ L AQ+NPHF+FNALN I+A+I D +A Sbjct: 134 LYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKA 193 Query: 403 SQLVQYLSTFFRKNLKR-PSEIVTLADEIEHVNAYLQIEKARFQSRLQVQLDVPSTLSRQ 461 +++ LS R +L+ + V+LADE+ V++YLQ+ +F+ RLQ + + + Sbjct: 194 REMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDV 253 Query: 462 KLPAFTLQPIVENAIKHGTSQLLDTGNVAIRARREGQHLMLDIEDNAGLYQPSAG-SSGL 520 ++P +Q +VEN IKHG +QL G + ++ ++ + L++E+ L + S+G Sbjct: 254 QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGT 313 Query: 521 GMSLVDKRLREHFGDDYGISVACEPDCFTRITLRLP 556 G+ V +RL+ +G + I ++ + + +P Sbjct: 314 GLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 75.6 bits (186), Expect = 6e-18 Identities = 49/215 (22%), Positives = 87/215 (40%), Gaps = 19/215 (8%) Query: 2 IKVLIVDDEPLARENLRILLQGQDDIEIVGECANAVEAIGAVHKLRPDVLFLDIQMPRIS 61 +L+ DD+ R L L V +NA + D++ D+ MP + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 62 GLEMVGMLDPEHRPYI--VFLTAFD--EYAIKAFEEHAFDYLLKPIEEKRLEKTLHRLRQ 117 +++ + + RP + + ++A + AIKA E+ A+DYL KP + L + R Sbjct: 62 AFDLLPRIK-KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 118 ERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMDDVAFVSSRMSGVYVT--SSEGKEGFT 175 E ++ L ++Q + + G S +A + + +T S GK Sbjct: 121 EPKRRPSKLEDDSQDGMPLV---GRSAAMQEIYRVLARLMQTDLTLMITGESGTGK---- 173 Query: 176 ELTLRTLESRTPLLRCHRQFL-VNMAHLQEIRLED 209 EL R L R + F+ +NMA + +E Sbjct: 174 ELVARALHDYGK--RRNGPFVAINMAAIPRDLIES 206
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 28.5 bits (63), Expect = 0.012 Identities = 36/134 (26%), Positives = 59/134 (44%), Gaps = 16/134 (11%) Query: 3 RSLIVASVLSAVFMSAGVFAADEDMGELKINGEVVGTSCTFEGANSATIELSQVGVDRLT 62 R L + +L AV MS V AAD L G+++ +CT + +A + + + L Sbjct: 5 RGLCLPVMLGAVLMSQHVHAAD----NLTFKGKLIIPACTVQ---NAEVNWGDIEIQNLV 57 Query: 63 DL--NPGDIYTGYTSPEAILKVKCSNTANPR------ISFNRSQFVDNMQITKNNATNNG 114 N D P ++ +K + T+N + + + D + I N+ N+G Sbjct: 58 QSGGNQKDFTVDMNCPYSLGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSG 117 Query: 115 AGFAVYLDGTQVKP 128 G AV L G+QV P Sbjct: 118 IGNAVTL-GSQVTP 130
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 675 bits (1742), Expect = 0.0 Identities = 239/829 (28%), Positives = 389/829 (46%), Gaps = 28/829 (3%) Query: 13 AALLISPAWAEEETFDTNFMFG-GLKGEKVSRYQIDSTKPMAGVYEMDVYVNKEWRGTYE 71 A +P + E F+ F+ +SR++ + + G Y +D+Y+N + T + Sbjct: 35 AFAAQAPLSSAELYFNPRFLADDPQAVADLSRFE-NGQELPPGTYRVDIYLNNGYMATRD 93 Query: 72 VNIQDDPDST----CISPDLIASLGIK---FTPQSTTVENECIALKTVVHGGSVSYDTAA 124 V C++ +AS+G+ + + ++ C+ L +++H + D Sbjct: 94 VTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHDATAQLDVGQ 153 Query: 125 FNLYLSVPQAYVLEYEAGYASPETWDRGINAFYTSYYASEYYSHYKSGGSEKNTYANFVS 184 L L++PQA++ GY PE WD GINA +Y S + GG+ Y N S Sbjct: 154 QRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQS 213 Query: 185 GLNLLGWQLHSNANFSKSEN-----LAGKWQSNTQYLERDFPAVLGTMRLGEQYTSGDMF 239 GLN+ W+L N +S + + KWQ +LERD + + LG+ YT GD+F Sbjct: 214 GLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDGYTQGDIF 273 Query: 240 DTVRFRGVRFWRDMQMLPHSKQNFAPVVRDVAQSNALVTVEQNGFIVYQKEVPPGPFVFE 299 D + FRG + D MLP S++ FAPV+ +A+ A VT++QNG+ +Y VPPGPF Sbjct: 274 DGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTIN 333 Query: 300 DLQLAGGGADLDVSVKEADGTVSRFIVPYSSVPNMVQPGVAKYDFAAGRSRIEGASQQTD 359 D+ AG DL V++KEADG+ F VPYSSVP + + G +Y AG R A Q+ Sbjct: 334 DIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSGNAQQEKP 393 Query: 360 -FLQGTYQYGVNNLLTLYGGTMLASDYRSFTLGTGWNT-LIGAVSVDGTLSHSKQDNGDV 417 F Q T +G+ T+YGGT LA YR+F G G N +GA+SVD T ++S + Sbjct: 394 RFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQ 453 Query: 418 FDGESYQVAWNKYLPQSATHFSLAAYRYSSRDYRTFNDHVWANNRDNYRRDDDDIYDI-- 475 DG+S + +NK L +S T+ L YRYS+ Y F D ++ D + + Sbjct: 454 HDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKP 513 Query: 476 --ADYYENDFGRKNTFTLNINQTLPDGWGYFTASALWRDYWGRSGTGKDYQLSYSNTWQR 533 DYY + ++ L + Q L S + YWG S + +Q + ++ Sbjct: 514 KFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGSHQTYWGTSNVDEQFQAGLNTAFED 572 Query: 534 LSFTLSATQTYDSDNRE-DKRFNIYLSIPL--TWGVKENGGNRDIHLSNSTTFDDQGYEA 590 +++TLS + T ++ + D+ + ++IP R S S + D G Sbjct: 573 INWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMT 632 Query: 591 NNTSLSGTFGNRDQFNYTTNLS---QQRQEHQTTFGGSVTWNAPLATVGGSYSQSNKYHQ 647 N + GT + +Y+ +T ++ + YS S+ Q Sbjct: 633 NLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQ 692 Query: 648 VGGNIQGGLVAWADGVHLASRLNDTIAIINAPYLEGAAVQGRPYLRTNAKGYAVFEALTP 707 + + GG++A A+GV L LNDT+ ++ AP + A V+ + +RT+ +GYAV T Sbjct: 693 LYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATE 752 Query: 708 YRQNFISLDVSGSESDVALLGNRKVTVPYRGAVVVVDFETETSKPFYFLARRADGEPLTF 767 YR+N ++LD + +V L VP RGA+V +F+ + +PL F Sbjct: 753 YRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTLTH-NNKPLPF 811 Query: 768 GYEVEDDEGNNVGLVGQGSRVFIRTEKVPISVKVATDKQQGLFCKITFD 816 G V + + G+V +V++ + V+V +++ C + Sbjct: 812 GAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANYQ 860
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 26.8 bits (59), Expect = 0.019 Identities = 10/43 (23%), Positives = 17/43 (39%), Gaps = 7/43 (16%) Query: 3 SKLLPCALLLATSFAWAAPA-------TTGIDQYELKSFIADF 38 ++L LLL +S++WA L+ + DF Sbjct: 10 KRVLTGTLLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDF 52
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 36.3 bits (84), Expect = 2e-04 Identities = 33/153 (21%), Positives = 52/153 (33%), Gaps = 20/153 (13%) Query: 253 FSEIFFMLALPFFTKRFGIKKVLLLGLVTAAIRYGFFVYGGAETYFTYALLFLGILLHGV 312 + L + RFG + VLL+ L AA+ Y +L++G ++ G+ Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAP-----FLWVLYIGRIVAGI 108 Query: 313 SYDFYYVTAYIYVDKKAPVHMRTAAQGLITLCCQGFGSLLGYRLGGVMMEKMFAYPQPVN 372 + V D R G ++ C GFG + G LGG+M P Sbjct: 109 TGATGAVAGAYIAD-ITDGDERARHFGFMS-ACFGFGMVAGPVLGGLMGGFSPHAP---- 162 Query: 373 GLTFNWAGMWTFGAVMIAVIALLFMIFFRESDK 405 + A + + L ES K Sbjct: 163 ---------FFAAAALNGLNFLTGCFLLPESHK 186 Score = 32.9 bits (75), Expect = 0.002 Identities = 54/286 (18%), Positives = 93/286 (32%), Gaps = 17/286 (5%) Query: 29 LNKSGFSAGEIGWSYACTAIAAILSPILVGSVTDRFFSAQKVLAVLMFAGAVLMYFAAQQ 88 L S G A A+ ++G+++DRF ++ + ++ AGA + Y Sbjct: 35 LVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF--GRRPVLLVSLAGAAVDYAI--- 89 Query: 89 TTFAGFFPLLLAYSLTYMPTIALTNSIAFANVPDVERDFPRIRVMGTIG-WIASGLACGF 147 A F +L + T A T ++A A + D+ R R G + G+ G Sbjct: 90 MATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAG- 147 Query: 148 LPQMLGY-NDISPTNIPLLITAASSALLGVFAFCLPDTPPKSTGKMDIKVMLGLDALILL 206 P + G SP + P AA + L + L K + + L A Sbjct: 148 -PVLGGLMGGFSP-HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRW 205 Query: 207 RDKN------FLVFFFCSFLFAMPLAFYYIFANGYLTEVGMKNATGWMTLGQFSEIFFML 260 VFF + +P A + IF G + + Sbjct: 206 ARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAM 265 Query: 261 ALPFFTKRFGIKKVLLLGLVTAAIRYGFFVYGGAETYFTYALLFLG 306 R G ++ L+LG++ Y + ++ L Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLA 311
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 26.9 bits (59), Expect = 0.020 Identities = 19/75 (25%), Positives = 24/75 (32%), Gaps = 2/75 (2%) Query: 32 FNAYGNKPRCLMCLGTTALFTGVFSGVCSGAVASVSSGAAYTTALTVLGASFGLGG--IG 89 N GNK + L L SG+ S AS A T A L +G Sbjct: 222 LNGVGNKLQNLPNLDNIGAGLDTVSGILSAISASFILSNADADTRTKAAAGVELTTKVLG 281 Query: 90 MMGICAGLYLSANGV 104 +G Y+ A Sbjct: 282 NVGKGISQYIIAQRA 296
>PF05932#Tir chaperone protein (CesT) Length = 127 Score = 83.7 bits (207), Expect = 5e-24 Identities = 29/118 (24%), Positives = 45/118 (38%), Gaps = 3/118 (2%) Query: 6 DRLLRQFSLKLNADSIAFDENRLCSFIIDNRYRI-LLTSTNSEYIMIYGFCGRPPDNNNL 64 LL FS L + FD++ C+ IIDN + + L E +++ G P + Sbjct: 7 KTLLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTLSCDYARERLLLIGLLE--PHKDIP 64 Query: 65 AFEFLNSNLWFAENNGPHLCYDNNSQSLLLALNFSLNESSVEKLECEIEVVIRSMENL 122 L L N GP L D S + + SV L+ E+ ++ M Sbjct: 65 QQCLLAGALNPLLNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWMRGW 122
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 75.3 bits (185), Expect = 8e-18 Identities = 27/140 (19%), Positives = 65/140 (46%), Gaps = 2/140 (1%) Query: 11 PRILIVEDEPKLGQLLIDYLRAASYAPTLINHGDKVLPYVRQTPPDLILLDLMLPGTDGL 70 IL+ +D+ + +L L A Y + ++ + ++ DL++ D+++P + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 71 TLCREIR-RFSDIPIVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTIL-RRC 128 L I+ D+P+++++A+ + + E GA DY+ KP+ E++ + L Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 129 KPQRELQQQDAESPLMIDES 148 + +L+ + ++ S Sbjct: 124 RRPSKLEDDSQDGMPLVGRS 143
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 31.0 bits (70), Expect = 0.010 Identities = 20/66 (30%), Positives = 26/66 (39%), Gaps = 14/66 (21%) Query: 187 RGLLAPVKRLVEGTHRLAAGDFTTRVTPTSADEL-----------GKLAQDFNQLASTLE 235 L+A V+ V H LA + P S + L G L N+LA E Sbjct: 104 SQLMAAVRSKVMEGHSLAD---AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTE 160 Query: 236 KNQQMR 241 + QQMR Sbjct: 161 QRQQMR 166
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 124 bits (313), Expect = 3e-33 Identities = 95/450 (21%), Positives = 197/450 (43%), Gaps = 25/450 (5%) Query: 20 FMQSLDTTIVNTALPSMAKSLGESPLHMHMVVVSYVLTVAVMLPASGWLADKIGVRNIFF 79 F L+ ++N +LP +A + P + V +++LT ++ G L+D++G++ + Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 80 AAIVLFTLGSLFCALSGTLNQ-LVLARVLQGVGGAMMVPVGRLTVMKIVPRAQYMAAMTF 138 I++ GS+ + + L++AR +QG G A + + V + +P+ A Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 139 VTLPGQIGPLLGPALGGVLVEYASWHWIFLINIP-VGIVGAMATFMLMPNYTIETRRFDL 197 + +G +GPA+GG++ Y HW +L+ IP + I+ L+ FD+ Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201 Query: 198 PGFLLLAIGMAVLTLALDGSKSMGISPWTLAGLAAGGAAAILLYLFHAKKSSGALFSLRL 257 G +L+++G+ L + L + L+++ H +K + L Sbjct: 202 KGIILMSVGIVFFMLFTTSYSISFLIVSVL---------SFLIFVKHIRKVTDPFVDPGL 252 Query: 258 FRTPTFSLGLLGSFAGRIGSGMLPFMTPVFLQIGLGFSPFHAG-LMMIPMVLGSMGMKRI 316 + F +G+L M P ++ S G +++ P + + I Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312 Query: 317 VVQIVNRFGYRRVLVATTLGLALVSLLFMSVALL----GWYYLLPLVLLLQGMVNSARFS 372 +V+R G VL +G+ +S+ F++ + L W+ + +V +L G+ S + Sbjct: 313 GGILVDRRGPLYVL---NIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL--SFTKT 367 Query: 373 SMNTLTLKDLPDTLASSGNSLLSMIMQLSMSIGVTIAGMLL--GMFGQQHIGIDSSATHH 430 ++T+ L A +G SLL+ LS G+ I G LL + Q+ + ++ + + Sbjct: 368 VISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTY 427 Query: 431 VFMYTWLCMAVIIALPAIIFARVPNDTQQN 460 ++ L + II + ++ V +Q++ Sbjct: 428 LYSNLLLLFSGIIVISWLVTLNVYKHSQRD 457
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 875 bits (2263), Expect = 0.0 Identities = 282/1035 (27%), Positives = 503/1035 (48%), Gaps = 36/1035 (3%) Query: 6 LFIYRPVATILIAAAITLCGILGFRLLPVAPLPQVDFPVIMVSASLPGASPETMASSVAT 65 FI RP+ ++A + + G L LPVA P + P + VSA+ PGA +T+ +V Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63 Query: 66 PLERSLGRIAGVNEMTSSS-SLGSTRIILEFNFDRDINGAARDVQAAINAAQSLLPGGMP 124 +E+++ I + M+S+S S GS I L F D + A VQ + A LLP + Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123 Query: 125 SRPTYRKANPSDAPIMILTLTSES--WSQGKLYDFASTQLAQTIAQIDGVGDVDVGGSSL 182 + S + +M+ S++ +Q + D+ ++ + T+++++GVGDV + G+ Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182 Query: 183 PAVRVGLNPQALFNQGVSLDEVREAIDSANVRRPQGAIEDSV------HRWQIQTNDELK 236 A+R+ L+ L ++ +V + N + G + + I K Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241 Query: 237 TAAEYQPLIIHYN-NGAAVRLGDVASVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQ 295 E+ + + N +G+ VRL DVA V ++ N KPA L I+ AN + Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301 Query: 296 TVDGIRAKLPELRAMIPAAIDLQIAQDRSPTIRASLQEVEETLAISVALVILVVFLFLRS 355 T I+AKL EL+ P + + D +P ++ S+ EV +TL ++ LV LV++LFL++ Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361 Query: 356 GRATLIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENIARHL- 414 RATLIP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R + Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421 Query: 415 EAGMKPLQAALQGTREVGFTVISMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGIS 474 E + P +A + ++ ++ +++ L AVF+P+ GG G + R+F++T+ A+ +S Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481 Query: 475 LVVSLTLTPMMCGWMLKSSKPRTQPRKRGVG----RLLVALQQGYGTSLKWVLNHTRLVG 530 ++V+L LTP +C +LK K G Y S+ +L T Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541 Query: 531 VVFLGTVALNIWLYIAIPKTFFPEQDTGVLMGGIQADQSISFQ----AMRGKLQDFMKII 586 +++ VA + L++ +P +F PE+D GV + IQ + + + ++K Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601 Query: 587 RD-DPAVNNVTGFT-GGSRVNSGMMFITLKPRGER---KETAQQIIDRLRVKLAKEPGAR 641 + +V V GF+ G N+GM F++LKP ER + +A+ +I R +++L K Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661 Query: 642 LFLMAVQDIRVGGRQANASYQYTLLSDSLPALREWEPKIRKALSAL-----PQLADVNSD 696 + + I G ++ L D + + R L + L V + Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718 Query: 697 QQDNGAEMNLIYDRDTMSRLGIDVQAANSLLNNTFGQRQISTIYQPMNQYKVVMEVDPRY 756 ++ A+ L D++ LG+ + N ++ G ++ K+ ++ D ++ Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778 Query: 757 TQDISALEKMFVINRDGKAIPLSYFAQWRPANAPLSVNHQGLSAASTIAFNLPTGTSLSQ 816 ++K++V + +G+ +P S F + + I GTS Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838 Query: 817 ATEAINRTMTQLGVPSTVRGSFSGTAQVFQQTMNSQLILIVAAIATVYIVLEILYESYVH 876 A + ++L P+ + ++G + + + N L+ + V++ L LYES+ Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896 Query: 877 PLTILSTLPSASVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRSGG 936 P++++ +P VG LLA LFN + ++G++ IG+ KNAI++V+FA + G Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956 Query: 937 LTPEQAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLL 996 +A A +R RPI+MT+LA + G LPL +S G GS + +GI ++GG+V + LL Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016 Query: 997 TLYTTPVVYLFFDRL 1011 ++ PV ++ R Sbjct: 1017 AIFFVPVFFVVIRRC 1031
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 885 bits (2289), Expect = 0.0 Identities = 292/1036 (28%), Positives = 504/1036 (48%), Gaps = 29/1036 (2%) Query: 13 SRLFILRPVATTLLMAAILLAGIIGYRFLPVAALPEVDYPTIQVVTLYPGASPDVMTSSV 72 + FI RP+ +L +++AG + LPVA P + P + V YPGA + +V Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 73 TAPLERQFGQMSGLKQMSSQS-SGGASVVTLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131 T +E+ + L MSS S S G+ +TL FQ D+A+ +VQ + AT LLP + Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 132 LPNPPIYSKVNPADPPIMTLAVTSNAMPMTQVE--DMVETRVAQKISQVSGVGLVTLAGG 189 + I S + +M S+ TQ + D V + V +S+++GVG V L G Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 190 QRPAIRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGP------ERAVTLSANDQ 243 Q A+R+ L+A + LT V + N A G L G + ++ A + Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239 Query: 244 MQSADEYRKLII-AYQNGAPVRLGDVATVEQGAENSWLGAWANQAPAIVMNVQRQPGANI 302 ++ +E+ K+ + +G+ VRL DVA VE G EN + A N PA + ++ GAN Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299 Query: 303 IATADSIRQMLPQLTESLPKSVKVTVLSDRTTNIRASVRDTQFELMLAIALVVMIIYLFL 362 + TA +I+ L +L P+ +KV D T ++ S+ + L AI LV +++YLFL Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359 Query: 363 RNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMALTIATGFVVDDAIVVIENISRY 422 +N+ AT+IP +AVP+ L+GTFA++ +SIN LT+ + +A G +VDDAIVV+EN+ R Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 423 I-EKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAVTLAVAIL 481 + E P A K +I ++ + L AV IP+ F G G ++R+F++T+ A+ Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 482 ISAVVSLTLTPMMCACML---SQQSLRKQNRFSRACERMFDRVIASYGRGLAKVLNHPWL 538 +S +V+L LTP +CA +L S + + F FD + Y + K+L Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 539 TLSVAFATLLLSVMLWIVIPKGFFPVQDNGIIQGTLQAPQSSSYASMAQRQRQVAERILQ 598 L + + V+L++ +P F P +D G+ +Q P ++ + QV + L+ Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 599 DPA--VQSLTTFVGVDGANPTLNSARLQINLKPLDARDDR---VQQVISRLQTAVATIPV 653 + V+S+ T G + N+ ++LKP + R+ + VI R + + I Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659 Query: 654 VALYLQPTQDLTIDTQVSRTQYQFTLQ---ATTLDALSHWVPKL-QNALQSLPQLSEVSS 709 ++ P I + T + F L DAL+ +L A Q L V Sbjct: 660 G--FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717 Query: 710 DWQDRGLAAWVNVDRDSASRLGISIADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTA 769 + + + VD++ A LG+S++D++ + A G ++ + ++ ++ + Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777 Query: 770 STPGLAALETIRLTSRDGGTVSLSAIARIEQRFAPLSINHLDQFPVTTFSFNVPEGYSLG 829 ++ + + S +G V SA + + + P G S G Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837 Query: 830 DAVQAILDTEKTLALPADITTQFQGSTLAFQAALGSTVWLIVAAVVAMYIVLGVLYESFI 889 DA+ + + LPA I + G + + + L+ + V +++ L LYES+ Sbjct: 838 DAMALMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895 Query: 890 HPITILSTLPTAGVGALLALIIAGSELDIIAIIGIILLIGIVKKNAIMMIDFALAAEREQ 949 P++++ +P VG LLA + + D+ ++G++ IG+ KNAI++++FA ++ Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955 Query: 950 GMSPRDAIFQACLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIAMVGGLLVSQV 1009 G +A A +R RPILMT+LA +LG LPL +S G G+ + +GI ++GG++ + + Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015 Query: 1010 LTLFTTPVIYLLFDRL 1025 L +F PV +++ R Sbjct: 1016 LAIFFVPVFFVVIRRC 1031
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 106 bits (267), Expect = 2e-28 Identities = 78/361 (21%), Positives = 120/361 (33%), Gaps = 58/361 (16%) Query: 6 LITGVTGQDGSYLAEFLLEKGYEVHGIKRRASSFNTERVDHIYQDPH--------SCNPK 57 L+TG G G ++++ LLE G++V GI + N Y D P Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGI----DNLND------YYDVSLKQARLELLAQPG 53 Query: 58 FHLHYGDLTDASNLTRILQEVQPDEVYNLGAMSHVAVSFESPEYTADVDAMGTLRLLEAI 117 F H DL D +T + + V+ V S E+P AD + G L +LE Sbjct: 54 FQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGC 113 Query: 118 RFLGLEKKTRFYQASTSELYGLVQEIPQKETTPF-YPRSPYAVAKLYAYWITVNYRESYG 176 R ++ AS+S +YGL +++P +P S YA K + Y YG Sbjct: 114 RHNKIQ---HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170 Query: 177 IYACNGILFNHESPRRGETFVTRKITRAIANIAQGLESCLYLGNMDSLRDWGHAKDYVRM 236 + A F P K T+A+ G +Y RD+ + D Sbjct: 171 LPATGLRFFTVYGPWGRPDMALFKFTKAMLE---GKSIDVY-NYGKMKRDFTYIDDIAEA 226 Query: 237 QWMMLQQEQPEDFVIATGVQYSVRQFVELAAVQLGIKLRFEGEGINEKGIVVSVTGHDAP 296 + D N G+ +P Sbjct: 227 IIRLQDVIPHADTQWTVETGTPAAS-------------IAPYRVYN--------IGNSSP 265 Query: 297 GVKPGDVIVAV--------DPRY--FRPAEVETLLGDPSKAHEKLGWKPEITLSEMVSEM 346 V+ D I A+ +P +V D +E +G+ PE T+ + V Sbjct: 266 -VELMDYIQALEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNF 324 Query: 347 V 347 V Sbjct: 325 V 325
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 87.9 bits (218), Expect = 6e-22 Identities = 64/344 (18%), Positives = 128/344 (37%), Gaps = 47/344 (13%) Query: 5 RIFVAGHRGMVGSAIVRQLAQRG-------------DVEL------VLRTRD----ELDL 41 + V G G +G + ++L + G DV L +L ++DL Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 42 LDGRAVQAFFARAGIDQVYLAAAKVGGIVANNTYPADFIYENMMIESNIIHAAHLHNVNK 101 D + FA ++V+++ + + + P + N+ NI+ + + Sbjct: 62 ADREGMTDLFASGHFERVFISPHR-LAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 102 LLFLGSSCIYPKLARQPMAESELLQGTLEPTNEPYAIAKIAGIKLCESYNRQYGRDYRSV 161 LL+ SS +Y + P + P + YA K A + +Y+ YG + Sbjct: 121 LLYASSSSVYGLNRKMPFSTD---DSVDHPVS-LYAATKKANELMAHTYSHLYGLPATGL 176 Query: 162 MPTNLYGPHDNFHPDNSHVIPALLRRFHEAAQSHAPEVVVWGSGTPMREFLHVDDMAAAS 221 +YGP PD AL + + + +V + G R+F ++DD+A A Sbjct: 177 RFFTVYGPWGR--PDM-----ALFKFTKAMLEGKSIDV--YNYGKMKRDFTYIDDIAEAI 227 Query: 222 IHVMELA----REVWQENTAPMLSH-----INVGTGVDCTIRELAQTIAKVVGYQGRVVF 272 I + ++ + E P S N+G + + Q + +G + + Sbjct: 228 IRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNM 287 Query: 273 DAAKPDGTPRKLLDVTRLHQ-LGWYHEISLEAGLAGTYQWFLEN 315 +P D L++ +G+ E +++ G+ W+ + Sbjct: 288 LPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 176 bits (449), Expect = 1e-54 Identities = 83/359 (23%), Positives = 142/359 (39%), Gaps = 50/359 (13%) Query: 1 MKILITGGAGFIGSAVVRHIIKNTQDTVVNIDKLT--YAGNLE--SLSDISESNRYNFEH 56 MK L+TG AGFIG V + +++ VV ID L Y +L+ L +++ + F Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPG-FQFHK 58 Query: 57 ADICDSAEITRIFEQYQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYVLLEVARKYWS 116 D+ D +T +F + V V S+ P A+ ++N+ G +LE R Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN-- 116 Query: 117 ALGEDKKNNFRFHHISTDEVYGDLPHPDEVENSVTLPLFTETTAYAPSSPYSASKASSDH 176 + S+ VYG +P T+ + P S Y+A+K +++ Sbjct: 117 -------KIQHLLYASSSSVYGLNRK---------MPFSTDDSVDHPVSLYAATKKANEL 160 Query: 177 LVRAWRRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKPLPIYGKGDQIRDWLYV 236 + + YGLP YGP+ P+ + LEGK + +Y G RD+ Y+ Sbjct: 161 MAHTYSHLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYI 220 Query: 237 EDHA----RALHMVVTEGKA--------------GETYNIGGHNEKKNLDVVFTICDLLD 278 +D A R ++ YNIG + + +D + + D L Sbjct: 221 DDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG 280 Query: 279 EIVPKATSYREQITYVADRPGHDRRYAIDAGKISRELGWKPLETFESGIRKTVEWYLAN 337 +A + + +PG + D + +G+ P T + G++ V WY Sbjct: 281 ---IEA-----KKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 40.9 bits (96), Expect = 3e-06 Identities = 27/160 (16%), Positives = 57/160 (35%), Gaps = 23/160 (14%) Query: 1 MNILLFGKTGQVGWELQRSLAPVGN-LIALDV-----------HSKEFC---------GD 39 M L+ G G +G+ + + L G+ ++ +D E D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 40 FSNPKGVAETVRKLRPDVIVNAAAHTAVDKAESEPELAQLLNATSVEAIAKAANEIG-AW 98 ++ +G+ + + + + AV + P N T I + Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 99 VVHYSTDYVFPGTGDIPWQETDATS-PLNVYGKTKLAGEK 137 +++ S+ V+ +P+ D+ P+++Y TK A E Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANEL 160
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 73.3 bits (180), Expect = 1e-16 Identities = 62/352 (17%), Positives = 121/352 (34%), Gaps = 48/352 (13%) Query: 11 RVFVTGHTGFKGSWLSLWLTEMGAIVKGYALDAPTVPSLFEIVRLNDLMES----HIGDI 66 + VTG GF G +S L E G V G + RL L + H D+ Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 67 RDFEKLRSSIAEFKPEIVFHMAAQPLVRLSYEQPIETYSTNVMGTVHLLEAVKQVDNIKA 126 D E + A E VF + VR S E P +N+ G +++LE + I+ Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN-KIQH 120 Query: 127 VVNITSDKCYDNREWVWGYRENEPMGGYD-------PYSNSKGCAELVASAFRNSFFNPA 179 ++ +S V+G P D Y+ +K EL+A + + + Sbjct: 121 LLYASSSS-------VYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLY---- 169 Query: 180 NYEQHGVGLASVRAGNVIGGGDWAK-DRLIPDILRSFENNQQVIIRNPYSI-RPWQHVLE 237 G+ +R V G W + D + ++ + + + N + R + ++ + Sbjct: 170 -----GLPATGLRFFTVY--GPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDD 222 Query: 238 PLSGYIVVAQRLYTEGAKFSEG-------------WNFGPRDEDAKTVEFIVDKMVTLWG 284 I + + +++ +N G + + + + G Sbjct: 223 IAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIG--NSSPVELMDYIQALEDALG 280 Query: 285 DDASWLLDGENHPHEAHYLKLDCSKANMQLGWHPRWGLTETLSRIVKWHKAW 336 +A + P + D +G+ P + + + V W++ + Sbjct: 281 IEAKKNML-PLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331
>PERTACTIN#Pertactin signature. Length = 922 Score = 30.8 bits (69), Expect = 0.012 Identities = 23/82 (28%), Positives = 34/82 (41%), Gaps = 9/82 (10%) Query: 209 GDIGTVSFYPAHHITMGEGGAVFTKSGELKKIIESFRDWGRDCYCAPGCDNTCGKRFGQQ 268 G +G S + E A+ + GEL+ ++ WGR DN G+RF Q+ Sbjct: 629 GGVGLAS-----TLWYAESNALSKRLGELRLNPDAGGAWGRGFAQRQQLDNRAGRRFDQK 683 Query: 269 LGSLPQGYDHKYTYS----HLG 286 + G DH + HLG Sbjct: 684 VAGFELGADHAVAVAGGRWHLG 705
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 64.0 bits (156), Expect = 6e-14 Identities = 58/329 (17%), Positives = 115/329 (34%), Gaps = 57/329 (17%) Query: 1 MKILIMGAFGFLGSRLTSYFESR-HTVIGL---------ARKRNNEATINNIIYT----- 45 MK L+ GA GF+G ++ H V+G+ + K+ + + Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 46 -TENNWIEKIL-EFEPNIIINTIACYG-RHN-EPATALIESNILMPIRVLE--------- 92 + + + + + R++ E A +SN+ + +LE Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 93 ----SISSL--DAVFINCGTSLPPNT--SLYAYTKQKANEFAAAIIDKVCG-KYIELKLE 143 S SS+ + T + SLYA TK KANE A + G L+ Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATK-KANELMAHTYSHLYGLPATGLRFF 179 Query: 144 HFYGAFDGDDKFTSMVIRRCLSNQPVKL-TSGLQQRDFLYIKDL----LTAFDCIISNVN 198 YG + D + L + + + G +RDF YI D+ + D I Sbjct: 180 TVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADT 239 Query: 199 NFPKFHS-----------IEVGSGEATSIREYVETVKNITKSNSIIEFGVVKERVNELMY 247 + +G+ + +Y++ +++ + + + +++ Sbjct: 240 QWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNM--LPLQPGDVLE 297 Query: 248 SCADIAELEK-IGWKREFSLVDALTEIIE 275 + AD L + IG+ E ++ D + + Sbjct: 298 TSADTKALYEVIGFTPETTVKDGVKNFVN 326
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 157 bits (398), Expect = 2e-47 Identities = 84/355 (23%), Positives = 151/355 (42%), Gaps = 55/355 (15%) Query: 9 LITGGCGFLGSNLASFALSQGIDLIVFDNL------SRKGATDNLHWLSSLGNFEFVHGD 62 L+TG GF+G +++ L G ++ DNL S K A L L+ F+F D Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQA--RLELLAQ-PGFQFHKID 60 Query: 63 IRNKNDVTRLITKYMPDSCFHLAGQVAMTTSIDNPCMDFEINVGGTLNLLEAVRQYNSNC 122 + ++ +T L + F ++A+ S++NP + N+ G LN+LE R Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ- 119 Query: 123 NIIYSSTNKVYGDLEQYKYNETETRYTCVDKPNGYDESTQLDFHSPYGCSKGAADQYMLD 182 +++Y+S++ VYG + ++ ++ VD P S Y +K A + Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDS----VDHP-----------VSLYAATKKANELMAHT 164 Query: 183 YARIFGLNTVVFRHSSMYG--GRQFATYDQGWVGWFCQKAVEIKNGINKPFTISGNGKQV 240 Y+ ++GL R ++YG GR + F + + G K + GK Sbjct: 165 YSHLYGLPATGLRFFTVYGPWGRPDMALFK-----FTKA---MLEG--KSIDVYNYGKMK 214 Query: 241 RDVLHAEDMI-------SLYFTALANVSKIRGNA---------FNIGGTIVNSLSLLELF 284 RD + +D+ + A + G +NIG + + + L++ Sbjct: 215 RDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNS--SPVELMDYI 272 Query: 285 KLLEDYCNIDMRFTNLPVRESDQRVFVADIKKITNAIDWSPKVSAKDGVQKMYDW 339 + LED I+ + LP++ D AD K + I ++P+ + KDGV+ +W Sbjct: 273 QALEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNW 327
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 582 bits (1501), Expect = 0.0 Identities = 200/395 (50%), Positives = 279/395 (70%), Gaps = 5/395 (1%) Query: 4 KIMAINAGSSSLKFQLLEMPQGDMLCQGLIERIGMADAQVTIKTHNQKWQETVPVADHRD 63 KI+ IN GSSSLK+QL+E G++L +GL ERIG+ D+ +T + +K + + DH+D Sbjct: 2 KILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKD 61 Query: 64 AVTLLLEKLLG--YQIINSLRDIDGVGHRVAHGGEFFKDSTLVTDETLAQIERLAELAPL 121 A+ L+L+ L+ Y +I + +ID VGHRV HGGE+F S L+TD+ L I ELAPL Sbjct: 62 AIKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIELAPL 121 Query: 122 HNPVNALGIHVFRQLLPDAPSVAVFDTAFHQTLDEPAYIYPLPWHYYAELGIRRYGFHGT 181 HNP N GI Q++PD P VAVFDTAFHQT+ + AY+YP+P+ YY + IR+YGFHGT Sbjct: 122 HNPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFHGT 181 Query: 182 SHKYVSGVLAEKLGVPLSALRVICCHLGNGSSVCAIKNGRSVNTSMGFTPQSGVMMGTRS 241 SHKYVS AE L P+ +L++I CHLGNGSS+ A+KNG+S++TSMGFTP G+ MGTRS Sbjct: 182 SHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGTRS 241 Query: 242 GDIDPSILPWIAQRESKTPQQLNQLLNNESGLLGVSGVSSDYRDVEQAA-NTGNRQAKLA 300 G IDPSI+ ++ ++E+ + +++ +LN +SG+ G+SG+SSD+RD+E AA G+++A+LA Sbjct: 242 GSIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQLA 301 Query: 301 LTLFAERIRATIGSYIMQMGGLDALVFTGGIGENSARARSAVCHNLQFLGLAVDEEKNQR 360 L +FA R++ TIGSY MGG+D +VFT GIGEN R + L+FLG +D+EKN+ Sbjct: 302 LNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEKNKV 361 Query: 361 NA--TFIQTENALVKVAVINTNEELMIAQDVMRIA 393 I T ++ V V V+ TNEE MIA+D +I Sbjct: 362 RGEEAIISTADSKVNVMVVPTNEEYMIAKDTEKIV 396
>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature. Length = 591 Score = 26.6 bits (58), Expect = 0.047 Identities = 11/24 (45%), Positives = 14/24 (58%) Query: 93 IGLVTKADLADPQRISLVAQWLTQ 116 +G A L+DPQ S AQWL + Sbjct: 171 LGKTAAARLSDPQAASHTAQWLVE 194
>BONTOXILYSIN#Bontoxilysin signature. Length = 1196 Score = 30.3 bits (68), Expect = 0.014 Identities = 8/39 (20%), Positives = 16/39 (41%) Query: 190 SDFIDALAEKAAKLVFQYLPTAVEKGDCVATRGKMHNAS 228 SDF ++ K LV+ +L + + + G + Sbjct: 518 SDFSKVVSSKDKSLVYSFLDNLMSYLETIKNDGPIDTDK 556
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 27.6 bits (61), Expect = 0.019 Identities = 12/30 (40%), Positives = 15/30 (50%) Query: 97 PPPSVIEPEPEESEIADVVSEAPAEEAPQE 126 PP V+EPEPE I + EAP + Sbjct: 64 PPEPVVEPEPEPEPIPEPPKEAPVVIEKPK 93
>PF03627#PapG Length = 336 Score = 36.9 bits (85), Expect = 1e-04 Identities = 22/93 (23%), Positives = 34/93 (36%), Gaps = 8/93 (8%) Query: 327 DDHVLDAVLPPDIP-------IPSIAEVQRALYDATKAVSGMPGEEVKQRLRTGTVVTTD 379 DD + LP D+P IP + +QR A +P K R ++ Sbjct: 152 DDIIFKVALPADLPLGDYSVTIPYTSGMQRHFASYLGARFKIPYNVAKTLPRENEMLFLF 211 Query: 380 DRNWELRYSASALRFNLSRAVAIDMESATIAAQ 412 R SA +L ++I+ + AAQ Sbjct: 212 KNIGGCRPSAQSLEIKHGD-LSINSANNHYAAQ 243
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 213 bits (543), Expect = 5e-71 Identities = 231/260 (88%), Positives = 246/260 (94%) Query: 1 MIQVTSEQWLYWLHLYFWPLLRVLALISTAPILSERAIPKRVKLGLGIMITLVIAPSLPA 60 M+QVTSEQWL WL+LYFWPLLRVLALISTAPILSER++PKRVKLGL +MIT IAPSLPA Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60 Query: 61 NDTPLFSIAALWLAMQQILIGIALGFTMQFAFAAVRTAGEFIGLQMGLSFATFVDPGSHL 120 ND P+FS ALWLA+QQILIGIALGFTMQFAFAAVRTAGE IGLQMGLSFATFVDP SHL Sbjct: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120 Query: 121 NMPVLARIMDMLAMLLFLTFNGHLWLISLLVDTFHTLPIGSNPVNSNAFMALARAGGLIF 180 NMPVLARIMDMLA+LLFLTFNGHLWLISLLVDTFHTLPIG P+NSNAF+AL +AG LIF Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180 Query: 181 LNGLMLALPVITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGIMLMAALMPLIAPFC 240 LNGLMLALP+ITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGI LMAALMPLIAPFC Sbjct: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240 Query: 241 EHLFSEIFNLLADIVSEMPI 260 EHLFSEIFNLLADI+SE+P+ Sbjct: 241 EHLFSEIFNLLADIISELPL 260
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 67.5 bits (165), Expect = 1e-18 Identities = 23/78 (29%), Positives = 42/78 (53%) Query: 4 ESVMMMGTEAMKVALALAAPLLLVALITGLIISILQAATQINEMTLSFIPKIVAVFIAII 63 + ++ G +A+ + L L+ +VA I GL++ + Q TQ+ E TL F K++ V + + Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61 Query: 64 VAGPWMLNLLLDYVRTLF 81 + W +LL Y R + Sbjct: 62 LLSGWYGEVLLSYGRQVI 79
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 328 bits (842), Expect = e-117 Identities = 224/245 (91%), Positives = 232/245 (94%) Query: 1 MRRLLFLSLAGLWLFSPAAAAQLPGLISQPLAGGGQSWSLSVQTLVFITSLTFLPAILLM 60 MRRLL ++ LWL +P A AQLPG+ SQPL GGGQSWSL VQTLVFITSLTF+PAILLM Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60 Query: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEQK 120 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSE+K Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120 Query: 121 ISMQEALDKGAQPLCAFMLRQTREADLALFARLANSGPLQGPEAVPMRILLPAYVTSELK 180 ISMQEAL+KGAQPL FMLRQTREADL LFARLAN+GPLQGPEAVPMRILLPAYVTSELK Sbjct: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180 Query: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA Sbjct: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240 Query: 241 QSFYS 245 QSFYS Sbjct: 241 QSFYS 245
>FLGMOTORFLIN#Flagellar motor switch protein FliN signature. Length = 137 Score = 209 bits (534), Expect = 2e-73 Identities = 136/137 (99%), Positives = 136/137 (99%) Query: 1 MSDMNNPSDENTGALDDLWADALNEQKATTNKSAADAVFQQLGGGDVSGAMQDIDLIMDI 60 MSDMNNPSDENTGALDDLWADALNEQKATT KSAADAVFQQLGGGDVSGAMQDIDLIMDI Sbjct: 1 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDI 60 Query: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV Sbjct: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120 Query: 121 RITDIITPSERMRRLSR 137 RITDIITPSERMRRLSR Sbjct: 121 RITDIITPSERMRRLSR 137
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 383 bits (984), Expect = e-135 Identities = 86/324 (26%), Positives = 148/324 (45%), Gaps = 10/324 (3%) Query: 5 ILSQAEIDALLNGDS--DTKDEPTPGIASDSDIRPYDPNTQRRVVRERLQALEIINERFA 62 +LSQ EID LL S D E I+ I YD + +E+++ L +++E FA Sbjct: 4 VLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETFA 63 Query: 63 RQFRMGLFNLLRRSPDITVGTIRIQPYHEFARNLPVPTNLNLIHLKPLRGTGLVVFSPSL 122 R L LR + V ++ Y EF R++P P+ L +I + PL+G ++ PS+ Sbjct: 64 RLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPSI 123 Query: 123 VFIAVDNLFGGDGRFPTKVEGREFTHTEQRVINRMLKLALEGYSDAWKAINPLEVEYVRS 182 F +D LFGG G+ KV+ R+ T E V+ ++ L ++W + L + Sbjct: 124 TFSIIDRLFGGTGQ-AAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQI 181 Query: 183 EMQVKFTNITTSPNDIVVNTPFHVEIGNLTGEFNICLPFSMIEPLRELLVNPPLENS--R 240 E +F I P+++VV ++G G N C+P+ IEP+ L + +S R Sbjct: 182 ETNPQFAQI-VPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240 Query: 241 HEDQNWRDNLVRQVQHSELELVANFADIPLRLSQILKLKPGDVLPIEKP---DRIIAHVD 297 + L ++ ++++VA + L + IL L+ GD++ + D + + Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300 Query: 298 GVPVLTSQYGTVNGQYALRVEHLI 321 Q G V + A ++ I Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324
>FLGHOOKFLIK#Flagellar hook-length control protein signature. Length = 375 Score = 406 bits (1045), Expect = e-143 Identities = 193/411 (46%), Positives = 233/411 (56%), Gaps = 40/411 (9%) Query: 1 MITLPQLITTDTDMTAGLTSGKTTGSAEDFLALLAGALGADGAQGKDARITLADLQAAGG 60 MI L LIT D D T L GK + +A+DFLALL+ AL + K A L Sbjct: 1 MIRLAPLITADVDTTT-LPGGKASDAAQDFLALLSEALAGETTTDKAAPQLL-------- 51 Query: 61 KLSKELLTQHGEPGQAVKLADLLAQKAN---ATDETLTDLTQAQHLLSTLTLSLKTSALA 117 ++ + T GEP + ++D AQ+AN DET + Q + LT + + A Sbjct: 52 -VATDKPTTKGEPLISDIVSD--AQQANLLIPVDETPPVINDEQSTSTPLTTAQTMALAA 108 Query: 118 ALSKTAQHDEKTPALSDEDLASLSALFAMLPGQPVATPVAGETPAENHIALPSLLRGDMP 177 K DEK L+++ ASLSALFAMLPG V D P Sbjct: 109 VADKNTTKDEKADDLNEDVTASLSALFAMLPGFDNTPKVT-----------------DAP 151 Query: 178 SAPQEETHTLSFSEHEKGKTEASLARASDDRATGPALTPLVVAAAATSAKVEVDSPPAPV 237 S F++ T L A D A G PL A +K EV S P+PV Sbjct: 152 STVLPTEKPTLFTK----LTSEQLTTAQPDDAPGTPAQPLTPLVAEAQSKAEVISTPSPV 207 Query: 238 THGAAMPTLSSATAQPQPLPVASAPVLSAPLGSHEWQQTFSQQVMLFTRQGQQSAQLRLH 297 T AA P ++ QP LP +APVLSAPLGSHEWQQ+ SQ + LFTRQGQQSA+LRLH Sbjct: 208 T-AAASPLITPHQTQP--LPTVAAPVLSAPLGSHEWQQSLSQHISLFTRQGQQSAELRLH 264 Query: 298 PEELGQVHISLKLDDNQAQLQMVSPHSHVRAALEAALPMLRTQLAESGIQLGQSSISSES 357 P++LG+V ISLK+DDNQAQ+QMVSPH HVRAALEAALP+LRTQLAESGIQLGQS+IS ES Sbjct: 265 PQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPVLRTQLAESGIQLGQSNISGES 324 Query: 358 FAGQQQ-SSSQQQSSRAQHTDAFGAEDDIALAAPASLQAAARGNGAVDIFA 407 F+GQQQ +S QQQS R + + EDD L P SLQ GN VDIFA Sbjct: 325 FSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVSLQGRVTGNSGVDIFA 375
>FLGFLIJ#Flagellar FliJ protein signature. Length = 147 Score = 206 bits (526), Expect = 4e-72 Identities = 130/147 (88%), Positives = 138/147 (93%) Query: 1 MAQHGALETLKDLAEKEVDDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRSNLNTDMGNG 60 MA+HGAL TLKDLAEKEV+DAARLLGEMRRGCQQAEEQLKMLIDYQNEYR+NLN+DM G Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60 Query: 61 IASNRWINYQQFIQTLEKAIEQHRLQLTQWTQKVDLALKSWREKKQRLQAWQTLQDRQTA 120 I SNRWINYQQFIQTLEKAI QHR QL QWTQKVD+AL SWREKKQRLQAWQTLQ+RQ+ Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120 Query: 121 AALLAENRMDQKKMDEFAQRAAMRKPE 147 AALLAENR+DQKKMDEFAQRAAMRKPE Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 35.2 bits (81), Expect = 4e-04 Identities = 25/83 (30%), Positives = 40/83 (48%), Gaps = 10/83 (12%) Query: 123 QLPFAWPLSVILMLTALAALY--YHLPALLLFIVPLWLT-ALLASVQLNQYMNIRFLLVW 179 Q P +S +++ LAALY + +P ++ +VPL + LLA+ NQ ++ F++ Sbjct: 871 QAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGL 930 Query: 180 LTL------TAILIYGRFILQRW 196 LT AILI F Sbjct: 931 LTTIGLSAKNAILIVE-FAKDLM 952
>PilS_PF08805#PilS N terminal Length = 185 Score = 28.7 bits (64), Expect = 0.013 Identities = 5/34 (14%), Positives = 13/34 (38%), Gaps = 2/34 (5%) Query: 112 WTLITSI--LIIIAVAVVLAISSMNAAFRSLNIN 143 TL+ + + +I V A + ++ + Sbjct: 28 ATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSS 61
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 28.4 bits (63), Expect = 0.008 Identities = 13/60 (21%), Positives = 28/60 (46%), Gaps = 2/60 (3%) Query: 60 WLCIDYLWVSESARSRGLGSQLMEMAEKEGLRKGCVHGLVDTFSFQ--ALPFYEKQGYIL 117 + I+ + V++ R +G+G+ L+ A + +++T A FY K +I+ Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.0 bits (70), Expect = 0.007 Identities = 9/16 (56%), Positives = 14/16 (87%) Query: 38 LVGESGSGKSLIAKAI 53 + GESG+GK L+A+A+ Sbjct: 165 ITGESGTGKELVARAL 180
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 341 bits (877), Expect = e-117 Identities = 123/345 (35%), Positives = 177/345 (51%), Gaps = 22/345 (6%) Query: 2 AEFKDNLLGEANRFLEVLEQVSRLAPLDKPVLIIGERGTGKELIANRLHYLSSRWQGPLI 61 ++ L+G + E+ ++RL D ++I GE GTGKEL+A LH R GP + Sbjct: 133 SQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFV 192 Query: 62 SLNCAALNENLLDSELFGHEAGAFTGAQKRHPGRFERADGGTLFLDELATAPMLVQEKLL 121 ++N AA+ +L++SELFGHE GAFTGAQ R GRFE+A+GGTLFLDE+ PM Q +LL Sbjct: 193 AINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLL 252 Query: 122 RVIEYGELERVGGSQPLQVNVRLVCATNADLPAMVKKGTFRADLLDRLAFDVVQLPPLRE 181 RV++ GE VGG P++ +VR+V ATN DL + +G FR DL RL ++LPPLR+ Sbjct: 253 RVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRD 312 Query: 182 RQSDTMLMAEHFAIQMCRELRLPLFPGFTDRAKETLLHYAWPGNVRELKNVVERSVYRHG 241 R D + HF Q +E F A E + + WPGNVREL+N+V R + Sbjct: 313 RAEDIPDLVRHFVQQAEKEGLDVK--RFDQEALELMKAHPWPGNVRELENLVRRLTALYP 370 Query: 242 SSE--------HPLDEIVIDPFQRHPAEPPTPALPSA------------SATPDLPLNLR 281 EI P ++ A + ++ A Sbjct: 371 QDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYD 430 Query: 282 EFQLQQEKALLQRSLQQAKFNQKRAADLLALTYHQFRALLKKHQL 326 + E L+ +L + NQ +AADLL L + R +++ + Sbjct: 431 RVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 28.6 bits (64), Expect = 0.019 Identities = 19/104 (18%), Positives = 43/104 (41%), Gaps = 5/104 (4%) Query: 40 LVEVRSNSARALAEKKQLSRRIEQATAQQTEWQEKAELA-LRKDKDDLARAALIEKQKLT 98 + + R + +L K+ +++ + + EL + + + L K++ Sbjct: 232 VEKSRLDDFSSLLHKQAIAK-HAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQ 290 Query: 99 DLIATLEQEVTLVDDTLARMKKEIGELENKLSETRARQQALMLR 142 + + E+ D L + IG L +L++ RQQA ++R Sbjct: 291 LVTQLFKNEIL---DKLRQTTDNIGLLTLELAKNEERQQASVIR 331
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 86.6 bits (214), Expect = 1e-22 Identities = 67/249 (26%), Positives = 112/249 (44%), Gaps = 24/249 (9%) Query: 7 KSVLVLGGSRGIGAAIVRRFSADGASVV-FSYAGSR----EAAEKLAAETGSTAIQTDSA 61 K + G ++GIG A+ R ++ GA + Y + ++ K A + A D Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARH-AEAFPADVR 67 Query: 62 DRDAVISLV----REYGPLDILVVNAGVALFGDALEQDSDAIDRLFRINIHAPYHASVEA 117 D A+ + RE GP+DILV AGV G + + F +N ++AS Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 118 ARNMP--EGGRIIIIGSVNGDRMPVPGMAAYAASKSALQGLARGLARDFGPRGITINVVQ 175 ++ M G I+ +GS N +P MAAYA+SK+A + L + I N+V Sbjct: 128 SKYMMDRRSGSIVTVGS-NPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 176 PGPIDTDI--------NPEDGPMKELMHSF---MAIKRHGRPKEVAGMVAWLAGPEASFV 224 PG +TD+ N + +K + +F + +K+ +P ++A V +L +A + Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246 Query: 225 TGAMHTIDG 233 T +DG Sbjct: 247 TMHNLCVDG 255
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 45.4 bits (107), Expect = 3e-08 Identities = 14/115 (12%), Positives = 39/115 (33%), Gaps = 5/115 (4%) Query: 6 SRTPGRPRQFDPEQAIKTAQHLFHSRGYDAVSVADLTKAFGINPPSFYAAFGSKLGLYTR 65 +T ++ + + A LF +G + S+ ++ KA G+ + Y F K L++ Sbjct: 3 RKTKQEAQE-TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61 Query: 66 VLK----RYRMTDAIPLGALLRHDRPTAKCLIDVLMEAARRYAADPDATGCLVLE 116 + + + + ++ ++E+ + + Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHK 116
>adhesinb#Adhesin B signature. Length = 310 Score = 27.1 bits (60), Expect = 0.006 Identities = 9/48 (18%), Positives = 18/48 (37%), Gaps = 6/48 (12%) Query: 1 MQKCSLITVISLSVLMLAGCTTTYTMTTRTGEIIETQGKPEVDTATGM 48 M+KC + ++ L+ + LA C++ K V + Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQ------KSSTETGSSKLNVVATNSI 42
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 76.1 bits (187), Expect = 3e-17 Identities = 48/194 (24%), Positives = 84/194 (43%), Gaps = 3/194 (1%) Query: 8 LVWLAGLSVLGFLATDMYLPAFAAIQADLQTPAAAVSASLSLFLAGFAVAQLLWGPLSDR 67 L+WL LS L + + I D P A+ + + F+ F++ ++G LSD+ Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75 Query: 68 YGRKPILLLGLSIFALGSLGMLWVESAAALLTL-RFVQAVGVCAATVIWQALVTDYYPSQ 126 G K +LL G+ I GS+ S +LL + RF+Q G A + +V Y P + Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135 Query: 127 KINRIFATIMPLVGLSPALAPLLGSWILTHFSWQAIFATLFVITLLLMLPALRLKPSVKA 186 + F I +V + + P +G I + W + L + ++ +P L + Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEV 193 Query: 187 RTEGQDKLTFATLL 200 R +G + L+ Sbjct: 194 RIKGHFDIKGIILM 207
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 386 bits (992), Expect = e-136 Identities = 125/350 (35%), Positives = 203/350 (58%), Gaps = 4/350 (1%) Query: 2 SEKTEQPTEKKLRDGRKEGQVVKSIEITSLFQLIALYLYFHFFTEKMILILIESITFTLQ 61 EKTEQPT KK+RD RK+GQV KS E+ S ++AL ++ + + + Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAE 62 Query: 62 LVNKPFSYALTQL-SHALIESLTSALLFLGAGVIVATVGSVFLQVGVVIASKAIGFKSEH 120 PFS AL+ + + L+E L ++A + S +Q G +I+ +AI + Sbjct: 63 QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMA-IASHVVQYGFLISGEAIKPDIKK 121 Query: 121 INPVSNFKQIFSLHSVVELCKSSLKVIMLSLIFAFFFYYYASTFRALPYCGLACGVLVVS 180 INP+ K+IFS+ S+VE KS LKV++LS++ T LP CG+ C ++ Sbjct: 122 INPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLG 181 Query: 181 SLIKWLWVGVMAFYIVVGILDYSFQYYKIRKDLKMSKDDVKQEHKDLEGDPQMKTRRREM 240 +++ L V ++V+ I DY+F+YY+ K+LKMSKD++K+E+K++EG P++K++RR+ Sbjct: 182 QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQF 241 Query: 241 QSEIQSGSLAQSVKQSVAVVRNPTHIAVCLGYHPTDMPIPRVLEKGSDAQANYIVNIAER 300 EIQS ++ ++VK+S VV NPTHIA+ + Y + P+P V K +DAQ + IAE Sbjct: 242 HQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEE 301 Query: 301 NCIPVVENVELARSLFFEVERGDKIPETLFEPVAALLRMVMK--IDYAHS 348 +P+++ + LAR+L+++ IP E A +LR + + I+ HS Sbjct: 302 EGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQHS 351
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 164 bits (417), Expect = 4e-52 Identities = 55/229 (24%), Positives = 100/229 (43%), Gaps = 5/229 (2%) Query: 8 WLIALAVAFIRPLSLSLLLPLLKSGSLGSAILRNGVLMSLTFPILPIIYQQKIMMHIGKD 67 WL +R L+L P+L S+ + + G+ M +TF I P + + + Sbjct: 12 WLNLYFWPLLRVLALISTAPILSERSVPKRV-KLGLAMMITFAIAPSLPANDVPVF---S 67 Query: 68 YSWLGLVTGEVIIGFLIGFCAAVPFWAVDMAGFLLDTLRGATMGTIFNSTMEAETSLFGL 127 + L L +++IG +GF F AV AG ++ G + T + + Sbjct: 68 FFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLAR 127 Query: 128 LFSQFLCVIFFISGGVEFILNILYESYQYLPPGRTLLFDRQFLKYIQAEWRTLYQLCISF 187 + ++F G +++++L +++ LP G L FL +A ++ + Sbjct: 128 IMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSL-IFLNGLML 186 Query: 188 SLPAIICMVLADLALGLLNRSAQQLNVFFFSMPLKSILVLLTLLISFPY 236 +LP I ++ +LALGLLNR A QL++F PL + + + P Sbjct: 187 ALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPL 235
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 72.5 bits (178), Expect = 9e-21 Identities = 30/85 (35%), Positives = 50/85 (58%) Query: 4 SELTQFVTQLLWIVLFTSMPVVLVASVVGVIVSLVQALTQIQDQTLQFMIKLLAIAITLM 63 +L + L++VL S +VA+++G++V L Q +TQ+Q+QTL F IKLL + + L Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61 Query: 64 VSYPWLSGILLNYTRQIMLRIGEHG 88 + W +LL+Y RQ++ G Sbjct: 62 LLSGWYGEVLLSYGRQVIFLALAKG 86
>TYPE3IMPPROT#Type III secretion system inner membrane P protein family signature. Length = 224 Score = 231 bits (592), Expect = 9e-80 Identities = 79/215 (36%), Positives = 130/215 (60%), Gaps = 8/215 (3%) Query: 8 LQLIGILFLLSILPLIIVMGTSFLKLAVVFSILRNALGIQQVPPNIALYGLALVLSLFIM 67 + LI +L ++LP II GT F+K ++VF ++RNALG+QQ+P N+ L G+AL+LS+F+M Sbjct: 5 ISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLSMFVM 64 Query: 68 GPTLLAVKERWHPVQVAGAPFWT-SEWDSKALAPYRQFLQKNSEEKEANYFRNLIKRTWP 126 P + + V + S+ + L YR +L K S+ + +F N + Sbjct: 65 WPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQLKRQY 124 Query: 127 ED-------IKRKIKPDSLLILIPAFTVSQLTQAFRIGLLIYLPFLAIDLLISNILLAMG 179 + K +I+ S+ L+PA+ +S++ AF+IG +YLPF+ +DL++S++LLA+G Sbjct: 125 GEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVLLALG 184 Query: 180 MMMVSPMTISLPFKLLIFLLAGGWDLTLAQLVQSF 214 MMM+SP+TIS P KL++F+ GW L L+ + Sbjct: 185 MMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219
>TYPE3OMOPROT#Type III secretion system outer membrane O protein family signature. Length = 303 Score = 53.9 bits (129), Expect = 2e-10 Identities = 59/291 (20%), Positives = 96/291 (32%), Gaps = 33/291 (11%) Query: 31 QYPVQQGTLFTINYHNELGRVWIAEQCWQRWCEGLIGTANRSAIDPELLYGIAEWGVAPL 90 +YP +QG ++ + WI W + A SA AE V P Sbjct: 32 EYPTRQGMWVRLSDAEKRWSAWIKPGDWLEHVSPALAGAAVSAG--------AEHLVVPW 83 Query: 91 LQASDATLCQNEPPTSCSNLPHQLALHIKWTVEEHEFHSIIFTWPTGFLRNIVGELSAER 150 L A++ P SC L VE S + P G L +I+ + Sbjct: 84 LAATERPFELPVPHLSCRRL----------CVENPVPGSAL---PEGKLLHIMSDRGGLW 130 Query: 151 QQIYPAPPVVVPVYLGWCQLTLIELESIEIGMG-VRIHCFGDIRLGFFAIQLPGGIYARV 209 + P P V L IG + G I +G + L A V Sbjct: 131 FEHLPELPAVGGGRPK----MLRWPLRFVIGSSDTQRSLLGRIGIG--DVLLIRTSRAEV 184 Query: 210 LLTEDNTMKFDELVQDIETLLASGSPMSKSDGTSSV-----ELEQIPQQVLFEIGRASLE 264 F+ + I + + + T+ L Q+P ++ F + R ++ Sbjct: 185 YCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEFVLYRKNVT 244 Query: 265 IGQLRQLKTGDVLPVGGCFAPEVTIRVNDRIIGQGELIACGNEFMVRITRW 315 + +L + +L + V I N ++G GEL+ + V I W Sbjct: 245 LAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHEW 295
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 52.3 bits (125), Expect = 5e-10 Identities = 29/183 (15%), Positives = 70/183 (38%), Gaps = 15/183 (8%) Query: 23 LYRSLPEDEANQMLALLMQHHIDAEKKQEEDGVTLRVEQSQFINAVELLRLNGYPHRQFT 82 L+ +L + + ++A L Q +I + V + L G P + Sbjct: 53 LFSNLSDQDGGAIVAQLTQMNIPYRFA--NGSGAIEVPADKVHELRLRLAQQGLP-KGGA 109 Query: 83 TADKMFPANQLVVSPQEEQQKINFLK--EQRIEGMLSQMEGVINAKVTIALPTYDEGS-- 138 ++ + +S EQ +N+ + E + + + V +A+V +A+P + S Sbjct: 110 VGFELLDQEKFGISQFSEQ--VNYQRALEGELARTIETLGPVKSARVHLAMP---KPSLF 164 Query: 139 --NASPSSVAVFIKYSPQVNMEAFRVK-IKDLIEMSIPGLQYSKISILMQPAEFRMVADV 195 S +V + P ++ ++ + L+ ++ GL ++++ Q ++ Sbjct: 165 VREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNT 224 Query: 196 PAR 198 R Sbjct: 225 SGR 227
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 77.3 bits (190), Expect = 5e-21 Identities = 26/127 (20%), Positives = 49/127 (38%) Query: 14 LKQLLSVDPETVYASGYASWQEGDYSRAVIDFSWLVMAQPWSWRAHIALAGTWMMLKEYT 73 L ++ S E +Y+ + +Q G Y A F L + + R + L + +Y Sbjct: 28 LNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYD 87 Query: 74 TAINFYGHALMLDASHPEPVYQTGVCLKMMGEPGLAREAFQTAIKMSYADASWSEIRQNA 133 AI+ Y + ++D P + CL GE A A ++ + E+ Sbjct: 88 LAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRV 147 Query: 134 QKMVDTL 140 M++ + Sbjct: 148 SSMLEAI 154
>PF05844#YopD protein Length = 295 Score = 29.2 bits (65), Expect = 0.010 Identities = 29/149 (19%), Positives = 48/149 (32%), Gaps = 19/149 (12%) Query: 9 VLPAPSL-LTPSSTPSPSGEGMGTESMLLLFDDIWMKLMELAKKLRDIMRSYNVEKQRLS 67 L AP L P + E + +LL+ I K EL RD + Q+ Sbjct: 50 ELNAPRQVLDPVRMEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQAIIHAQK-- 107 Query: 68 WELQVNVLQTQMKTIDEAFRASMITAGGAMLSGVLTIGLGAVGGETGLIAGQAVGHTAGG 127 +DE + + A+++GV + VG L G+A+ Sbjct: 108 ------------AQVDEMRSGATLMIAMAVIAGVGALASAVVGSLGALKNGKAISQEK-- 153 Query: 128 VMGLGSGVAQRQSDQDKAIADLQQNGAQS 156 L + R D + L + + Sbjct: 154 --TLQKNIDGRNELIDAKMQALGKTSDED 180
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 89.6 bits (222), Expect = 2e-25 Identities = 39/154 (25%), Positives = 67/154 (43%), Gaps = 7/154 (4%) Query: 6 TLQQAHDTMRFFRRGGSLRMLL---DDDVTQPLNTVYRYAMQLMEVKEFAGAARLFQLLT 62 T + F + GG++ ML D L +Y A + ++ A ++FQ L Sbjct: 8 TQEYQLAMESFLKGGGTIAMLNEISSDT----LEQLYSLAFNQYQSGKYEDAHKVFQALC 63 Query: 63 IYDAWSFDYWFRLGECCQAQKHWGEAIYAYGRAAQIKIDAPQAPWAAAECYLACDNVCYA 122 + D + ++ LG C QA + AI++Y A + I P+ P+ AAEC L + A Sbjct: 64 VLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEA 123 Query: 123 IKALKAVVRICGEVSEHQILRLRAEKMLQQLSDR 156 L + + +E + L R ML+ + + Sbjct: 124 ESGLFLAQELIADKTEFKELSTRVSSMLEAIKLK 157
>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature. Length = 428 Score = 27.3 bits (60), Expect = 0.047 Identities = 15/44 (34%), Positives = 22/44 (50%) Query: 78 SNEMDEVIAKAAKGDAKTKEEVPEDVIKYMRDNGILIDGMTIDD 121 + E I K K +E+PED +KY+ + L DG ID+ Sbjct: 368 NTEEQAKINNKIKEAIKMFKELPEDFVKYINSDKALKDGNKIDN 411
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 581 bits (1499), Expect = 0.0 Identities = 157/500 (31%), Positives = 260/500 (52%), Gaps = 15/500 (3%) Query: 11 LLFILNTAKSDELSWKGNDFTLYARQMPLAEVLHLLSENYDTAITISPLITATFSGKIPP 70 LL + + + + EL W + A+ L ++L NYD + +S I SG+ Sbjct: 17 LLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKINDKVSGQFEH 76 Query: 71 GPPVDILNNLAAQYDLLTWFDGSMLYVYPASLLKHQVITFNILSTGRFIHYLRSQNILSS 130 P D L ++A+ Y+L+ ++DG++LY++ S + ++I L+ I Sbjct: 77 DNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQALQRSGIWE- 135 Query: 131 PGCEVKEITGTKAVEVSGVPSCLTRISQLASVLDNALIKR--KDSAVSVSIYTLKYATAM 188 P + + V VSG P L + Q A+ L+ R K A+++ I+ LKYA+A Sbjct: 136 PRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEIFPLKYASAS 195 Query: 189 DTQYQYRDQSVVVPGVVSVL-REMSKTSVPASSTTN-----GSPATQALPMFAADPRQNA 242 D YRD V PGV ++L R +S ++ + N + A ADP NA Sbjct: 196 DRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIPQAATRASAQARVEADPSLNA 255 Query: 243 VIVRDYAANMAGYRKLITELDQRQQMIEISVKIIDVNAGDINQLGIDWGTAVSLGG---- 298 +IVRD M Y++LI LD+ IE+++ I+D+NA + +LG+DW + G Sbjct: 256 IIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDINADQLTELGVDWRVGIRTGNNHQV 315 Query: 299 --KKIAFNTGLNDGGASGFSTVISDTSNFMVRLNALEKSSQAYVLSQPSVVTLNNIQAVL 356 K + + GA G + R+N LE A V+S+P+++T N QAV+ Sbjct: 316 VIKTTGDQSNIASNGALGSLVDARGLDYLLARVNLLENEGSAQVVSRPTLLTQENAQAVI 375 Query: 357 DKNITFYTKLQGEKVAKLESITTGSLLRVTPRLLNDNGTQKIMLNLNIQDGQQSDTQSET 416 D + T+Y K+ G++VA+L+ IT G++LR+TPR+L +I LNL+I+DG Q S Sbjct: 376 DHSETYYVKVTGKEVAELKGITYGTMLRMTPRVLTQGDKSEISLNLHIEDGNQKPNSSGI 435 Query: 417 DPLPEVQNSEIASQATLLAGQSLLLGGFKQGKQIHSQNKIPLLGDIPVVGHLFRNDTTQV 476 + +P + + + + A + GQSL++GG + + + +K+PLLGDIP +G LFR + Sbjct: 436 EGIPTISRTVVDTVARVGHGQSLIIGGIYRDELSVALSKVPLLGDIPYIGALFRRKSELT 495 Query: 477 HSVIRLFLIKASVVNNGISH 496 +RLF+I+ +++ GI+H Sbjct: 496 RRTVRLFIIEPRIIDEGIAH 515
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 67.5 bits (165), Expect = 1e-13 Identities = 31/156 (19%), Positives = 57/156 (36%), Gaps = 13/156 (8%) Query: 691 ILLVDDADINRDIIGKMLVSQGQHVTIAASSNEALTLSQQQRFDLVLIDIRMPEIDGIEC 750 IL+ DD R ++ + L G V I +++ DLV+ D+ MP+ + + Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 751 VQLWHDEPNNLDPDCMFVALSASVATEDIHRCKKNGIHHYITKPVTLATLARYISIAAEY 810 + PD + +SA + + G + Y+ KP L L Sbjct: 66 LP----RIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIG-------- 113 Query: 811 QLLRNIELQEQDPSRCSALLAT-DDMVINSKIFQSL 845 + R + ++ PS+ +V S Q + Sbjct: 114 IIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEI 149
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 66.0 bits (161), Expect = 7e-15 Identities = 28/119 (23%), Positives = 50/119 (42%), Gaps = 2/119 (1%) Query: 1 MKEYKILLVDDHEIIINGIMNALLPWPHFKIVEHVKNGLEVYNACCAYEPDILILDLSLP 60 M IL+ DD I + AL + V N ++ A + D+++ D+ +P Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 GINGLDIIPQLHQRWPAMNILVYTAYQQEYMTIKTLAAGANGYVLKSSSQQVLLAALQT 119 N D++P++ + P + +LV +A IK GA Y+ K L+ + Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 84.5 bits (209), Expect = 2e-21 Identities = 31/127 (24%), Positives = 56/127 (44%) Query: 2 ATIHLLDDDTAVTNACAFLLESLGYDVKCWTQGADFLAQASLYQAGVVLLDMRMPVLDGQ 61 ATI + DDD A+ L GYDV+ + A + +V+ D+ MP + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 62 GVHDALRQCGSTLAVVFLTGHGDVPMAVEQMKRGAVDFLQKPVSVKPLQAALERALTVSS 121 + +++ L V+ ++ A++ ++GA D+L KP + L + RAL Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 122 AAVARRE 128 ++ E Sbjct: 124 RRPSKLE 130
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.7 bits (66), Expect = 0.014 Identities = 10/22 (45%), Positives = 13/22 (59%) Query: 28 ILHLVGPNGAGKSTLLARMAGL 49 + L G G GKSTL+ + GL Sbjct: 598 SVVLEGTGGIGKSTLINTLVGL 619
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 119 bits (299), Expect = 6e-39 Identities = 34/89 (38%), Positives = 55/89 (61%) Query: 4 TKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFGNFDLRDKNQRPGR 63 K ++ + + L+K+D+ V+ F + L GE+V+L GFGNF++R++ R GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 64 NPKTGEDIPITARRVVTFRPGQKLKSRVE 92 NP+TGE+I I A +V F+ G+ LK V+ Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 28.5 bits (63), Expect = 0.012 Identities = 17/59 (28%), Positives = 25/59 (42%) Query: 49 QGLTVGIIILTIGVMAPIASGTLPPSTLIHSFVNWKSLVAIAVGVFVSWLGGRGITLMG 107 Q + L IG + + LPPS ++ N ++ A VS LG +TL G Sbjct: 174 QRSAIVDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASELTLDG 232
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 28.4 bits (63), Expect = 0.002 Identities = 8/37 (21%), Positives = 17/37 (45%), Gaps = 5/37 (13%) Query: 4 LSWIIFGLIAGILAKWIMPG-----KDGGGFFMTIIL 35 + I+ G I+G++ W+ K ++ I+L Sbjct: 163 AAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILL 199
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 194 bits (494), Expect = 4e-66 Identities = 66/187 (35%), Positives = 93/187 (49%), Gaps = 18/187 (9%) Query: 1 MKNIILSTLVITTSVLVVNVAQADTNAFSVGYAQSKVQDFKN-IRGVNVKYRYE-DDSPV 58 MK I + + + A T+ + GYAQS Q N + G N+KYRYE D+SP+ Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAATSTVTGGYAQSDAQGQMNKMGGFNLKYRYEEDNSPL 60 Query: 59 SFISSLSYLYGDSQASGSIESEGIHYHDKFEVKYGSLMVGPAYRLSDNFSLYALAGVGTV 118 I S +Y AS D + +Y + GPAYR++D S+Y + GVG Sbjct: 61 GVIGSFTYTEKSRTASSG---------DYNKNQYYGITAGPAYRINDWASIYGVVGVGYG 111 Query: 119 KATFKEHATQDGDSFSNKISSRKTGFAWGAGVQMNPLENIVVDVGYEGSNISSTKINGFN 178 K E+ T D+ GF++GAG+Q NP+EN+ +D YE S I S + + Sbjct: 112 KFQTTEYPTYKHDT-------SDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWI 164 Query: 179 VGVGYRF 185 GVGYRF Sbjct: 165 AGVGYRF 171
>cdtoxinb#Cytolethal distending toxin B signature. Length = 269 Score = 298 bits (764), Expect = e-104 Identities = 126/276 (45%), Positives = 167/276 (60%), Gaps = 16/276 (5%) Query: 1 MKKPVFFLLTMIICSYISFACANISDYKVMTWNLQGSSASTESKWNVNVRQLLSGTAGVD 60 MKK + L+ + S+ + A +++D++V TWNLQG+SA+TESKWN+NVRQL+SG VD Sbjct: 1 MKKYIISLI--VFLSFYAQA--DLTDFRVATWNLQGASATTESKWNINVRQLISGENAVD 56 Query: 61 ILMVQEAGAVPTSAVPTGRHIQPFGVGIPIDEYTWNLGTTSRQDIRYIYYSAIDVGARRV 120 IL VQEAG+ P++AV TG I GIP+ E WNL T SR YIY+SA+D RV Sbjct: 57 ILAVQEAGSPPSTAVDTGTLIP--SPGIPVRELIWNLSTNSRPQQVYIYFSAVDALGGRV 114 Query: 121 NLAIVSRQRADNVYVLRPTTVASRPVIGIGLGNDVFLTAHALASGGPDAAAIVRVTINFF 180 NLA+VS +RAD V+VL P RP++GI +GND F TAHA+A DA A+V NFF Sbjct: 115 NLALVSNRRADEVFVLSPVRQGGRPLLGIRIGNDAFFTAHAIAMRNNDAPALVEEVYNFF 174 Query: 181 RQ---PQMRHLSWFLAGDFNRSPDRLENDLMTEHLERVVAVLAPTEPTQISGGILDYGVI 237 R P + L+W + GDFNR P LE +L T + R +++P TQ S LDY V Sbjct: 175 RDSRDPVHQALNWMILGDFNREPADLEMNL-TVPVRRASEIISPAAATQTSQRTLDYAVA 233 Query: 238 VDRAPYSQR------VEALRNPQLASDHYPVAFLAR 267 + + V R Q++SDH+PV R Sbjct: 234 GNSVAFRPSPLQAGIVYGARRTQISSDHFPVGVSRR 269
>BORPETOXINA#Bordetella pertussis toxin A subunit signature. Length = 269 Score = 81.4 bits (200), Expect = 2e-20 Identities = 57/170 (33%), Positives = 85/170 (50%), Gaps = 12/170 (7%) Query: 22 VYRVDSTPPDVIFRDGFSLLGYNRNFQQFISGRSCSGGSSDSRYIATTSSVNQT------ 75 VYR DS PP+ +F++GF+ G N N ++GRSC GSS+S +++T+SS T Sbjct: 41 VYRYDSRPPEDVFQNGFTAWGNNDNVLDHLTGRSCQVGSSNSAFVSTSSSRRYTEVYLEH 100 Query: 76 ---YAIARAYYSRSTFKGNLYRYQIRADNNFYSLLPS-ITYLETQGGHFN-AYEKTMMRL 130 A+ R T Y Y++RADNNFY S Y++T G + + Sbjct: 101 RMQEAVEAERAGRGTGHFIGYIYEVRADNNFYGAASSYFEYVDTYGDNAGRILAGALATY 160 Query: 131 QREYVSTLSILPENIQKAVALVYDSATGLVKDGVSTMNASYLGLSTTSNP 180 Q EY++ I PENI++ + ++ TG NA Y+ T +NP Sbjct: 161 QSEYLAHRRIPPENIRRVTRVYHNGITGETTT-TEYSNARYVSQQTRANP 209
>BORPETOXINB#Bordetella pertussis toxin B subunit signature. Length = 226 Score = 35.0 bits (80), Expect = 4e-05 Identities = 31/101 (30%), Positives = 42/101 (41%), Gaps = 7/101 (6%) Query: 30 TNAYYSDEVISELHVGQIDTSPYFCIKTVKANGSGTPVV-ACAVSKQSIWAPSFKELLDQ 88 T+ YYS+ + L T+ C V+ SG PV+ AC + + L Sbjct: 126 TDHYYSNVTATRLLS---STNSRLCAVFVR---SGQPVIGACTSPYDGKYWSMYSRLRKM 179 Query: 89 ARYFYSTGQSVRIHVQKNIWTYPLFVNTFSANALVGLSSCS 129 Y G SVR+HV K Y TF AL G+S C+ Sbjct: 180 LYLIYVAGISVRVHVSKEEQYYDYEDATFETYALTGISICN 220
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 55.8 bits (134), Expect = 9e-10 Identities = 50/259 (19%), Positives = 93/259 (35%), Gaps = 26/259 (10%) Query: 513 PSEEEYAERKRPEQPALATFAMPDVPPAPTPVEPAVSVATAKKDNVAAEQPAQPGLFSRF 572 P E+ + DVP P+ + A+ D A P P S Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSN-----NEEIARVDE-APVPPPAPATPSET 1036 Query: 573 LNALKQLFSGEETKTVETAAPKAEEKAERQQDRRKPRQNNRRDRNERRDTRDNRAGRDGG 632 S +E+KTVE A E + ++ K ++N + +T+ N + G Sbjct: 1037 TE-TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKA-----NTQTNEVAQSGS 1090 Query: 633 ESRDDNRRNRRQAQQQNAEAR---DTRQQETAEKVKTGDEQQQTPRRERSRRRNDDKRQA 689 E+++ ++ E + +T + + KV + Q +P++E+S A Sbjct: 1091 ETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTS----QVSPKQEQSETVQPQAEPA 1146 Query: 690 QQEVKALNREEQPVQETEQEERVQQVQPRRKQRQLNQKVRFTNSTVVETVDTPVVVDEPR 749 ++ +N +E Q + QP ++ N + T ST V T ++ V E Sbjct: 1147 RENDPTVNIKEPQSQTNTTAD---TEQPAKETSS-NVEQPVTESTTVNTGNSVVENPENT 1202 Query: 750 PVENVEQPVPAPRTELAKV 768 + P +E + Sbjct: 1203 TPATTQ---PTVNSESSNK 1218 Score = 38.5 bits (89), Expect = 2e-04 Identities = 51/372 (13%), Positives = 88/372 (23%), Gaps = 47/372 (12%) Query: 630 DGGESRDDNRRNRRQAQQQNAEARDTRQQETAEKVKTGDEQQQTPRRERSRRRNDDKRQA 689 D G + R + N E Q + T + Q S + Sbjct: 963 DLGAWKYKLRNVNGRYDLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDE 1022 Query: 690 QQEVKALNREEQPVQETEQEERVQQVQPRRKQRQLNQKVRFTNSTVVETVDTPVVVDEPR 749 ET E Q+ + K Q + N V + V + Sbjct: 1023 APVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKE-AKSNVKANTQ 1081 Query: 750 PVENVEQPVPAPRTELAKVDLPVVADIAPEQDDSVEPRDNTGMPRRSRRSPRHLRVSGQR 809 E + T+ + A + E+ VE +++ P+ + Sbjct: 1082 TNEVAQSGSETKETQTTETKET--ATVEKEEKAKVETE-------KTQEVPKVTSQVSPK 1132 Query: 810 RRRYRDERYPTQSPMPLTVACASPEMASGKVWIRYPIVRPQETQVVEEQREADLALPQPV 869 + + + + P V +E Q + AD P Sbjct: 1133 QEQSETVQPQAEPARE-----------------NDPTVNIKEPQS-QTNTTADTEQPAKE 1174 Query: 870 VAEPQVIAATVALEPQASVQAVENVAVEPQTVAEPQAPEVVKVETTHPEVIAAPVDEQPQ 929 Q V + + PE TT P V + ++ Sbjct: 1175 T-------------SSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKN 1221 Query: 930 LIAESDTPEAQEVIA------DAEPVAETADASITVAENVADVVVVEPEEETKAEAAVVE 983 S V D VA S ++D AV + Sbjct: 1222 RHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQ 1281 Query: 984 HTAEETVIAPAQ 995 H ++ + Q Sbjct: 1282 HISQLEMNNEGQ 1293
>FLAGELLIN#Flagellin signature. Length = 507 Score = 41.2 bits (96), Expect = 4e-06 Identities = 30/138 (21%), Positives = 59/138 (42%) Query: 1 MRISTQMMYEQNMSGITNSQAEWMKLGEQMSTGKRVTNPSDDPIAASQAVVLSQAQAQNS 60 I+T + + + SQ+ E++S+G R+ + DD + A + + Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61 Query: 61 QYALARTFATQKVSLEESVLSQVTTAIQTAQEKIVYAGNGTLSDDDRASLATDLQGIRDQ 120 Q + E L+++ +Q +E V A NGT SD D S+ ++Q ++ Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121 Query: 121 LMNLANSTDGNGRYIFAG 138 + ++N T NG + + Sbjct: 122 IDRVSNQTQFNGVKVLSQ 139
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 664 bits (1714), Expect = 0.0 Identities = 438/553 (79%), Positives = 487/553 (88%), Gaps = 8/553 (1%) Query: 2 SSLINHAMSGLNAAQAALNTVSNNINNYNVAGYTRQTTILAQANSTLGAGGWIGNGVYVS 61 SSLIN+AMSGLNAAQAALNT SNNI++YNVAGYTRQTTI+AQANSTLGAGGW+GNGVYVS Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60 Query: 62 GVQREYDAFITNQLRGAQNQSSGLTTRYEQMSKIDNLLADKSSSLSGSLQSFFTSLQTLV 121 GVQREYDAFITNQLR AQ QSSGLT RYEQMSKIDN+L+ +SSL+ +Q FFTSLQTLV Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120 Query: 122 SNAEDPAARQALIGKAEGLVNQFKTTDQYLRDQDKQVNIAIGSSVAQINNYAKQIANLND 181 SNAEDPAARQALIGK+EGLVNQFKTTDQYLRDQDKQVNIAIG+SV QINNYAKQIA+LND Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180 Query: 182 QISRMTGVGAGASPNDLLDQRDQLVSELNKIVGVEVSVQDGGTYNLTMANGYTLVQGSTA 241 QISR+TGVGAGASPN+LLDQRDQLVSELN+IVGVEVSVQDGGTYN+TMANGY+LVQGSTA Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240 Query: 242 RQLAAVPSSADPTRTTVAYVDEAAGNIEIPEKLLNTGSLGGLLTFRSQDLDQTRNTLGQL 301 RQLAAVPSSADP+RTTVAYVD AGNIEIPEKLLNTGSLGG+LTFRSQDLDQTRNTLGQL Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300 Query: 302 ALAFADAFNAQHTKGYDADGNKGKDFFSIGSPVVYSNSNNADKTVSLTAKVVDSTKVQAT 361 ALAFA+AFN QH G+DA+G+ G+DFF+IG P V N+ N V++ A V D++ V AT Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGD-VAIGATVTDASAVLAT 359 Query: 362 DYKIVFDGTDWQVTRTADNTTFTATKDADGKLEIDGLKVTVGTGAQKNDSFLLKPVSNAI 421 DYKI FD WQVTR A NTTFT T DA+GK+ DGL++T NDSF LKPVS+AI Sbjct: 360 DYKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAI 419 Query: 422 VDMNVKVTNEAEIAMASESKLDPDVDTGDSDNRNGQALLDLQ-NSNVVGGNKTFNDAYAT 480 V+M+V +T+EA+IAMASE D GDSDNRNGQALLDLQ NS VGG K+FNDAYA+ Sbjct: 420 VNMDVLITDEAKIAMASEE------DAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYAS 473 Query: 481 LVSDVGNKTSTLKTSSTTQANVVKQLYKQQQSVSGVNLDEEYGNLQRYQQYYLANAQVLQ 540 LVSD+GNKT+TLKTSS TQ NVV QL QQQS+SGVNLDEEYGNLQR+QQYYLANAQVLQ Sbjct: 474 LVSDIGNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQ 533 Query: 541 TANALFDALLNIR 553 TANA+FDAL+NIR Sbjct: 534 TANAIFDALINIR 546
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 499 bits (1285), Expect = 0.0 Identities = 263/316 (83%), Positives = 289/316 (91%), Gaps = 3/316 (0%) Query: 1 MIGDGKLLASAAWDAQSLNELKAKAGQDPAANIRPVARQVEGMFVQMMLKSMREALPKDG 60 MI D KLLASAAWDAQSLNELKAKAG+DPAANIRPVARQVEGMFVQMMLKSMR+ALPKDG Sbjct: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60 Query: 61 LFSSDQTRLYTSMYDQQIAQQMTAGKGLGLADMMVKQMTGGQTMPADDAPQVPLKFSLET 120 LFSS+ TRLYTSMYDQQIAQQMTAGKGLGLA+MMVKQMT Q +P + P P+KF LET Sbjct: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120 Query: 121 VNSYQNQALTQLVRKAIPKTPDSSDAPLSGDSKDFLARLSLPARLASEQSGVPHHLILAQ 180 V YQNQAL+QLV+KA+P+ D S L GDSK FLA+LSLPA+LAS+QSGVPHHLILAQ Sbjct: 121 VVRYQNQALSQLVQKAVPRNYDDS---LPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQ 177 Query: 181 AALESGWGQRQILRENGEPSYNVFGVKATASWKGPVTEITTTEYENGEAKKVKAKFRVYS 240 AALESGWGQRQI RENGEPSYN+FGVKA+ +WKGPVTEITTTEYENGEAKKVKAKFRVYS Sbjct: 178 AALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYS 237 Query: 241 SYLEALSDYVALLTRNPRYAAVTTAATAEQGAVALQNAGYATDPNYARKLTSMIQQLKAM 300 SYLEALSDYV LLTRNPRYAAVTTAA+AEQGA ALQ+AGYATDP+YARKLT+MIQQ+K++ Sbjct: 238 SYLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSI 297 Query: 301 SEKVSKTYSANLDNLF 316 S+KVSKTYS N+DNLF Sbjct: 298 SDKVSKTYSMNIDNLF 313
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 429 bits (1104), Expect = e-153 Identities = 153/362 (42%), Positives = 215/362 (59%), Gaps = 9/362 (2%) Query: 5 LAGIVLALVATLAHAERIRDLTSVQGVRENSLIGYGLVVGLDGTGDQTTQTPFTTQTLNN 64 A L+ A RI+D+ S+Q R+N LIGYGLVVGL GTGD +PFT Q++ Sbjct: 14 SALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMRA 73 Query: 65 MLSQLGITVPTGTNMQLKNVAAVMVTASYPPFARQGQTIDVVVSSMGNAKSLRGGTLLMT 124 ML LGIT G + KN+AAVMVTA+ PPFA G +DV VSS+G+A SLRGG L+MT Sbjct: 74 MLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIMT 132 Query: 125 PLKGVDSQVYALAQGNILVGGAGASAGGSSVQVNQLNGGRITNGAIIERELPTQFGAGNT 184 L G D Q+YA+AQG ++V G A +++ R+ NGAIIERELP++F Sbjct: 133 SLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSVN 192 Query: 185 INLQLNDEDFTMAQQITDAINRAR----GYGSATALDARTVQVRVPSGNSSQVRFLADIQ 240 + LQL + DF+ A ++ D +N G A D++ + V+ P + R +A+I+ Sbjct: 193 LVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRV-ADLTRLMAEIE 251 Query: 241 NMEVNVTPQDAKVVINSRTGSVVMNREVTLDSCAVAQGNLSVTVNRQLNVNQPNTPFGGG 300 N+ V T AKVVIN RTG++V+ +V + AV+ G L+V V V QP PF G Sbjct: 252 NLTVE-TDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-APFSRG 309 Query: 301 QTVVTPQTQIDLRQSGGSLQSVRSSANLNSVVRALNALGATPMDLMSILQSMQSAGCLRA 360 QT V PQT I Q G + ++ +L ++V LN++G +++ILQ ++SAG L+A Sbjct: 310 QTAVQPQTDIMAMQEGSKV-AIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQA 368 Query: 361 KL 362 +L Sbjct: 369 EL 370
>FLGLRINGFLGH#Flagellar L-ring protein signature. Length = 232 Score = 353 bits (908), Expect = e-127 Identities = 211/232 (90%), Positives = 223/232 (96%) Query: 1 MQKYALHAYPVMALMVATLTGCAWIPAKPLVQGATTAQPIPGPVPVANGSIFQSAQPINY 60 MQK A H Y + +L+V +LTGCAWIP+ PLVQGAT+AQP+PGP PVANGSIFQSAQPINY Sbjct: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60 Query: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTSFGFDTVPRYLQGLFGNS 120 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKT+FGFDTVPRYLQGLFGN+ Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120 Query: 121 RADMEASGGNSFNGKGGANASNTFSGTLTVTVDQVLANGNLHVVGEKQIAINQGTEFIRF 180 RAD+EASGGN+FNGKGGANASNTFSGTLTVTVDQVL NGNLHVVGEKQIAINQGTEFIRF Sbjct: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180 Query: 181 SGVVNPRTISGSNSVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232 SGVVNPRTISGSN+VPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM Sbjct: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 43.8 bits (103), Expect = 4e-07 Identities = 18/81 (22%), Positives = 36/81 (44%), Gaps = 14/81 (17%) Query: 3 SSLWIAKTGLDAQQTNMDVIANNLANVSTNGFKRQRAVFEDLLYQTIRQPGAQSSEQTTL 62 S + A +GL+A Q ++ +NN+++ + G+ RQ + + +TL Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTL 47 Query: 63 PSGLQIGTGVRPVATERLHSQ 83 +G +G GV +R + Sbjct: 48 GAGGWVGNGVYVSGVQREYDA 68 Score = 41.1 bits (96), Expect = 3e-06 Identities = 11/41 (26%), Positives = 21/41 (51%) Query: 220 ETSNVNVAEELVNMIQVQRAYEINSKAVSTTDQMLQKLTQL 260 S VN+ EE N+ + Q+ Y N++ + T + + L + Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 51.7 bits (124), Expect = 3e-09 Identities = 52/253 (20%), Positives = 91/253 (35%), Gaps = 24/253 (9%) Query: 56 AFLATAAFIGRPFGGALFGLLADKFGRKPLMMWSIVAYSVGTGLSGLASGVIMLTLSRFI 115 A A F P GAL +D+FGR+P+++ S+ +V + A + +L + R + Sbjct: 50 ALYALMQFACAPVLGAL----SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105 Query: 116 VGMGMAGEYACASTYAVESWPKHLKSKASAFLVSGFGIGNIIAAYFMPSFAEAYGWRAAF 175 G+ A + A + +++ F+ + FG G ++A + + A F Sbjct: 106 AGITGATGAVAGAYIA-DITDGDERARHFGFMSACFGFG-MVAGPVLGGLMGGFSPHAPF 163 Query: 176 FV-GLLPVLLVIYIRARAPESKEWEE--AKLSGPGKHSQSAWSVFSLSMKGLFNQA---- 228 F L L + PES + E + + W+ + L Sbjct: 164 FAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQ 223 Query: 229 ---QFPLTLCVFIVLFSIFGANWPIFGLLPTYLAGEGFDTGVVSNLMTAAAFGTVLGN-- 283 Q P L V+F +W + LA G ++ M LG Sbjct: 224 LVGQVPAAL---WVIFGEDRFHWDA-TTIGISLAAFGI-LHSLAQAMITGPVAARLGERR 278 Query: 284 -IVWGLCADRIGL 295 ++ G+ AD G Sbjct: 279 ALMLGMIADGTGY 291
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 83.0 bits (205), Expect = 1e-20 Identities = 30/117 (25%), Positives = 56/117 (47%), Gaps = 1/117 (0%) Query: 2 KILLIEDNQKTIEWVRQGLTEAGYMVDYACDGRDGLHLALQEHYSLIILDIMLPGLDGWQ 61 IL+ +D+ + Q L+ AGY V + L++ D+++P + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VLRALRTAHQS-PVICLTARDSVEDRVKGLEAGANDYLVKPFSFAELLARVRAQLRQ 117 +L ++ A PV+ ++A+++ +K E GA DYL KPF EL+ + L + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>PF06580#Sensor histidine kinase Length = 349 Score = 33.7 bits (77), Expect = 0.001 Identities = 18/102 (17%), Positives = 38/102 (37%), Gaps = 15/102 (14%) Query: 348 ILLQRVLSNLLTNAIRYSDENAVIRIESAYDDNVAEIRVANPGSHPADADKLFRRFWRGD 407 +L+Q ++ N + + I + I ++ D+ + V N GS K Sbjct: 258 MLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE-------- 309 Query: 408 NARHTAGFGLGLSLVNA-IALLHGGSASYRYADEHNIFSVRL 448 G GL V + +L+G A + +++ + + Sbjct: 310 ------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345
>TYPE3OMBPROT#Type III secretion system outer membrane B protein family signature. Length = 538 Score = 655 bits (1692), Expect = 0.0 Identities = 186/396 (46%), Positives = 253/396 (63%), Gaps = 5/396 (1%) Query: 166 LNNQPWQTIKNTLTHNGHHYTNTQLPAAEMKIGAKDIFPSAYQGKGVCSWDTKNIHHANN 225 LNN+ W + ++H+G +Y PA+ MKIG K+IF Y GKG+C T+ H N Sbjct: 146 LNNKNWGPVNKNISHHGKNYGFQLTPASHMKIGNKNIFVKEYNGKGICCASTRESDHIAN 205 Query: 226 LWMSTVSVHEDGKDKTLFCGIRHGVLSPYH-EKDPLLRQVGAENKAKEVLTAALYSKPEL 284 +W+S V V ++GK+ +F GIRHGV+S Y +K+ R V A NKA+E+++AALYS+PEL Sbjct: 206 MWLSKV-VDDEGKE--IFSGIRHGVISAYGLKKNSSERAVAARNKAEELVSAALYSRPEL 262 Query: 285 LNRALEGEAVSLKLVSVGLLTASNIFGKEGTMVEDQMRAWQSL-TQPGKMIHLKIRNKDG 343 L++AL G+ V LK+VS LLT +++ G E +M++DQ+ A + L ++ G+ L IRN DG Sbjct: 263 LSQALSGKTVDLKIVSTSLLTPTSLTGGEESMLKDQVNALKGLNSKRGEPTKLLIRNSDG 322 Query: 344 DLQTVKIKPDVAAFNVGVNELALKLGFGLKASDRYNAEALHQLLGNDLRPEARPGGWVGE 403 L+ V + V FN GVNELALK+G G + D+ N E++ LLG++ GGW E Sbjct: 323 LLKEVSVNLKVVTFNFGVNELALKMGLGWRNVDKLNDESICSLLGDNFLKNGVIGGWAAE 382 Query: 404 WLAQYPDNYEVVNTLARQIKDIWKNNQHHKDGGEPYKLAQRLAMLAHEIDAVPAWNCKSG 463 + + P V LA QIK+I D GEPYKL+QR+ +LA+ I AVP WNCKSG Sbjct: 383 AIEKNPPCKNDVIYLANQIKEIINKKLQKNDNGEPYKLSQRMTLLAYTIGAVPCWNCKSG 442 Query: 464 KDRTGMMDSEIKREHISLHQTHMLSAPGSLPDSGGQKIFQKVLLNSGNLEIQKQNTGGAG 523 KDRTGM D+EIKRE I H+T S S S +++F +L+NSGN+EIQ+ NTG G Sbjct: 443 KDRTGMQDAEIKREIIRKHETGQFSQLNSKLSSEEKRLFSTILMNSGNMEIQEMNTGVPG 502 Query: 524 NKVMKNLSPEVLNLSYQKRVGDENIWQSVKGISSLI 559 NKVMK L L LSY +R+GD IW VKG SS + Sbjct: 503 NKVMKKLPLSSLELSYSERIGDSKIWNMVKGYSSFV 538
>PF07824#Type III secretion chaperone Length = 120 Score = 165 bits (419), Expect = 1e-56 Identities = 33/114 (28%), Positives = 63/114 (55%), Gaps = 1/114 (0%) Query: 1 MESLLNRLYDALGLDAPE-DEPLLIIDDGIQVYFNESDHTLEMCCPFMPLPDDILTLQHF 59 ME L + + ALG+ + + D+ +++DD + +Y + ++ + CPF LP++I L + Sbjct: 1 MEDLADVICRALGIPSIDTDDQAIMLDDDVLIYIEKEGDSINLLCPFCALPENINDLIYA 60 Query: 60 LRLNYTSAVTIGADADNTALVALYRLPQTSTEEEALTGFELFISNVKQLKEHYA 113 L LNY+ + + D + +L+A L + E+ E +IS V+ LK+ +A Sbjct: 61 LSLNYSEKICLATDDEGGSLIARLDLTGINEFEDIYVNTEYYISRVRWLKDEFA 114
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 32.2 bits (73), Expect = 0.004 Identities = 36/158 (22%), Positives = 63/158 (39%), Gaps = 6/158 (3%) Query: 8 VMLLLCGLLLLTLAIAVLNTLVLLWLAQA-NLPTWQVGMVSSSYFTGNLVGTLFTGYLIK 66 +++ LC L ++ ++ + L +A N P V++++ +GT G L Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74 Query: 67 RIGFNRSYYLASLIFAAGCVGLGVMVGFWSWMSW-RFIAGIGCAMIWVVVESALMCSGTS 125 ++G R +I G V V F+S + RFI G G A +V + Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134 Query: 126 HNRGRLLAAYMMVYYMGTFLGQLLVSKVSGELLHVLPW 163 NRG+ + MG +G + G + H + W Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPA----IGGMIAHYIHW 168
>BCTERIALGSPC#Bacterial general secretion pathway protein C signature. Length = 272 Score = 28.0 bits (62), Expect = 0.035 Identities = 12/51 (23%), Positives = 20/51 (39%), Gaps = 4/51 (7%) Query: 97 LKKLPP-ALRTLWLIITMVLGVVFVW---MMVRVYNSIDTVPTWYSVWTPL 143 + KLPP + + I+ +L ++F M+ D P TP Sbjct: 3 ISKLPPLSPSVIRRILFYLLMLLFCQQLAMIFWRIGLPDNAPVSSVQITPA 53
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 58.2 bits (140), Expect = 3e-10 Identities = 47/292 (16%), Positives = 85/292 (29%), Gaps = 44/292 (15%) Query: 555 AAPAFSLATGGAPRPQVKEGIGPQLPRPNRVRVPTRRELASYGIKLPSQRIAEEKAREAE 614 + A P V ++ R + VP PS+ E A ++ Sbjct: 994 TTNITTPNNIQADVPSVPSN-NEEIARVDEAPVPPPAP------ATPSET-TETVAENSK 1045 Query: 615 RNQYETGVQLTDEEIDAMHQDELARQFAQSQQHRYGETYQHDTQQAEDDDTAAEAELARQ 674 + E DA R+ A+ + + +TQ E + +E + + Sbjct: 1046 QESKTVEKN----EQDATETTAQNREVAKEAK----SNVKANTQTNEVAQSGSETKETQT 1097 Query: 675 FAASQQQRYSGEQPAGAQPFSLDDLDFSPMKVLVDEGPHEPLFTPGVMPESTPVQQPVAP 734 + E+ A KV ++ P T V P+ +Q Sbjct: 1098 TETKETATVEKEEKA---------------KVETEKTQEVPKVTSQVSPKQ---EQSETV 1139 Query: 735 QPQPQYQQSQQPVAPQSQYQQPQQPVAPQPQPQYQQSQQPVAPQSQYQQPQQPVAPQPQY 794 QPQ + + P + Q A QP + S P ++ V Sbjct: 1140 QPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTE----STTVNTGNSV 1195 Query: 795 QQPQQPVAPQPQYQQPQQPTA----PQPQYQQPVAPQPQYQQPQQPVAPQPQ 842 + P P QP + P+ ++++ V P +P + Sbjct: 1196 --VENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRS 1245 Score = 43.5 bits (102), Expect = 7e-06 Identities = 26/214 (12%), Positives = 55/214 (25%), Gaps = 37/214 (17%) Query: 718 TPGVMPESTPVQQPVAPQ-------PQPQYQQSQQPVAPQSQYQQPQQPVAPQPQPQYQQ 770 T P + P P P P + + + + + Sbjct: 995 TNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKN 1054 Query: 771 SQQPVAPQSQYQ-----------------------------QPQQPVAPQPQYQQPQQPV 801 Q +Q + Q + ++ + V Sbjct: 1055 EQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKV 1114 Query: 802 APQPQYQQPQQPTAPQPQYQQPVAPQPQYQQPQQPVAPQPQYQQPQQPTAPQDSLIHPLL 861 + + P+ + P+ +Q QPQ +P + P ++PQ T P Sbjct: 1115 ETEKTQEVPKVTSQVSPKQEQSETVQPQ-AEPARENDPTVNIKEPQSQTNTTADTEQPAK 1173 Query: 862 MRNGDSRPLQRPTTPLPSLDLLTPPPSEVEPVDT 895 + + +T + + + + P P T Sbjct: 1174 ETSSNVEQPVTESTTVNTGNSVVENPENTTPATT 1207 Score = 41.2 bits (96), Expect = 3e-05 Identities = 46/303 (15%), Positives = 85/303 (28%), Gaps = 46/303 (15%) Query: 296 RATQPEYDEYDPLLNGHSVTEPVAAAAAATAVTQTWAASADP--IMQTPPMPGAEPVVAQ 353 PE ++ + ++ ++T P A +V A PP P + Sbjct: 979 DLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE 1038 Query: 354 PTVEWQP--------------VPGPQTGE------PVIAPAPEGYQPHPQYAQPQEAQSA 393 E Q E + + + ++ +E Q+ Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT 1098 Query: 394 PWQQPVPVASAPQYAATPATAAEYDS----LAPQETQPQWQAPDAEQHWQPEPT------ 443 ++ V + E ++P++ Q + P AE + +PT Sbjct: 1099 ETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEP 1158 Query: 444 ---HQPEPIAAEPSHMPPPVIEQPVTT---------EPEPGIEETRPARPPLYYFEEVEE 491 +P+ +EQPVT E T P E + Sbjct: 1159 QSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNK 1218 Query: 492 KRAREREQLAAWYQPIPEPVKENVPVKPTVSVAPSIPPVEAVAAAASLDAGIKSGALAAG 551 + R R + + EP + + TV++ A + A + AL G Sbjct: 1219 PKNRHRRSVRS-VPHNVEPATTSSNDRSTVALCDLT-STNTNAVLSDARAKAQFVALNVG 1276 Query: 552 AAA 554 A Sbjct: 1277 KAV 1279 Score = 40.0 bits (93), Expect = 7e-05 Identities = 21/187 (11%), Positives = 56/187 (29%), Gaps = 12/187 (6%) Query: 724 ESTPVQQPVAPQPQPQYQQSQQPVAPQSQYQQPQQPVAPQPQPQYQ-QSQQPVAPQSQYQ 782 ++ + A + Q ++ + + VA + Q+ + + + Sbjct: 1049 KTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEK 1108 Query: 783 QPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPTAPQPQYQQPVAPQPQYQQPQQPVAPQPQ 842 + + V + + P+ P+ +Q + PQ + + P ++PQ Sbjct: 1109 EEKAKVETEKTQEVPKVTSQVSPKQEQSET-VQPQAEPARENDPTVNIKEPQSQTNTTAD 1167 Query: 843 YQQPQQPTAPQDSLIHPLLMRNGDSRPLQRPT-TPLPSLDLLTPPPSEVEPVDTFALEQM 901 +QP + T+ + + + P + P+ +P Sbjct: 1168 TEQPAKETSSN---VEQPVTESTTVNTGNSVVENPENTT------PATTQPTVNSESSNK 1218 Query: 902 ARLVEAR 908 + R Sbjct: 1219 PKNRHRR 1225 Score = 38.9 bits (90), Expect = 2e-04 Identities = 31/215 (14%), Positives = 65/215 (30%), Gaps = 25/215 (11%) Query: 658 QQAEDDDTAAEAELARQFAASQQQRYSGEQPAGAQPFSLDDLDFSPMKVLVDEGPHEPLF 717 AE+ ++ + A++ + E A+ ++ + V + E Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKS----NVKANTQTNEVAQSGSE--- 1091 Query: 718 TPGVMPESTPVQQPVAPQPQPQYQQSQQPVAPQSQYQ-QPQQPVAPQPQPQYQQSQQPVA 776 T T V + + + + + P+ Q P+Q + QPQ + +++ Sbjct: 1092 TKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREN-D 1150 Query: 777 PQSQYQQPQQPVAPQPQYQQPQQPVAPQ-PQYQQPQQPTAPQPQYQQ-PVAPQPQYQQPQ 834 P ++PQ +QP + + Q + P P QP Sbjct: 1151 PTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPT 1210 Query: 835 --------------QPVAPQPQYQQPQQPTAPQDS 855 + V P +P ++ S Sbjct: 1211 VNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRS 1245 Score = 36.6 bits (84), Expect = 0.001 Identities = 60/345 (17%), Positives = 98/345 (28%), Gaps = 37/345 (10%) Query: 365 QTGEPVIAPAPEGYQP-HPQYAQPQEAQSAPWQQPVPVASAPQYAATPATAAEYDSLAPQ 423 QT + P Q P E + + PVP P ATP+ E Sbjct: 990 QTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVP----PPAPATPSETTE----TVA 1041 Query: 424 ETQPQWQAPDAEQHWQP-EPTHQPEPIAAEPSHMPPPVIEQPVTTEPEPGIEETRPARPP 482 E Q + E T Q +A E + + +ET+ Sbjct: 1042 ENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETK 1101 Query: 483 LYYFEEVEEKRAREREQLAAWYQPIPEPVKENVP-------VKPTVSVAPSIPPVEAVAA 535 E EEK E E+ Q +P+ + P V+P A P + Sbjct: 1102 ETATVEKEEKAKVETEKT----QEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157 Query: 536 AASLDAGIKSGALAAGAAAAAPAFSLATGGAPRPQVKEGIGPQLPRPNRVRVPTRRELAS 595 S A ++ + P+ P + PT +S Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQ-PTVNSESS 1216 Query: 596 YGIKLPSQRI-------AEEKAREAERNQYETGVQLTDEEIDAMHQDELARQFAQSQQHR 648 K +R E + LT +A+ D A+ AQ Sbjct: 1217 NKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAK--AQFVALN 1274 Query: 649 YGETYQHDTQQAEDDDTAA------EAELARQFAASQQQRYSGEQ 687 G+ Q E ++ + + +++SQ +R+S + Sbjct: 1275 VGKAVSQHISQLEMNNEGQYNVWVSNTSMNKNYSSSQYRRFSSKS 1319 Score = 35.4 bits (81), Expect = 0.002 Identities = 17/131 (12%), Positives = 34/131 (25%), Gaps = 8/131 (6%) Query: 723 PESTPVQQPV-APQPQ-PQYQQSQQPVAPQSQYQ--QPQQPVAPQPQPQYQQSQQPVAPQ 778 PE Q V P Q+ P P + + + + P P P + Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAE 1042 Query: 779 SQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPTAPQPQYQQPVAPQPQYQQPQQP-V 837 Q+ + Q + A + + + VA + Q Sbjct: 1043 ---NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099 Query: 838 APQPQYQQPQQ 848 + + ++ Sbjct: 1100 TKETATVEKEE 1110
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 55.2 bits (133), Expect = 2e-10 Identities = 30/125 (24%), Positives = 49/125 (39%), Gaps = 17/125 (13%) Query: 4 RILVLGASGYIGQHLVFALSQQGHQVRA---------AARRVERLEKQRLANVSCHKVDL 54 + LV GA+G+IG H+ L + GHQV + + RLE HK+DL Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 55 HWPENLPTLLRD--VDTVYYLVH------GMGEGGDFIAHERQAALNVRDALRQTPVKQL 106 E + L + V+ H + + LN+ + R ++ L Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121 Query: 107 IFLSS 111 ++ SS Sbjct: 122 LYASS 126
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 66.3 bits (162), Expect = 2e-14 Identities = 69/370 (18%), Positives = 123/370 (33%), Gaps = 71/370 (19%) Query: 1 MKVLVTGATSGLGRNAVEFLRNKGISVRA---------TGRNEAMGKLLEKMGAEFVHAD 51 MK LVTGA +G + + L G V +A +LL + G +F D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 52 LTELVSSQAKVMLAGIDTLWHCS-------SFTSPWGTQQAFDLANVRATRRLGEWAVAW 104 L + + ++ S +P A+ +N+ + E Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPH----AYADSNLTGFLNILEGCRHN 116 Query: 105 GVRNFIHISSPSLYFDYHHHRDIKEDFRPHRFANEFARSKAAGEEVINLLAQANPQTH-- 162 +++ ++ SS S+Y + D + +A +K A E L+A + Sbjct: 117 KIQHLLYASSSSVYGL-NRKMPFSTDDSVDHPVSLYAATKKANE----LMAHTYSHLYGL 171 Query: 163 -FTVLRPQSLFGPHDK--VFIPRLAHMMHHYGSVLLPHGGSALVDMTYYENAIHAMWLAS 219 T LR +++GP + + + + M S+ + + G D TY ++ A+ Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQ 231 Query: 220 QPGCDHLPS--------------GRAYNITNGENRTLRSIVQKLIDELTIDCRIRSVPYP 265 R YNI N L +Q L D L I+ + +P Sbjct: 232 DVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQ 291 Query: 266 MLDMIARSMERFGKKSAKEPPLTHYGVSKLNFDFTLDTTRAQDELGYQPIVTLDEGIERT 325 D+ T DT + +G+ P T+ +G++ Sbjct: 292 PGDV----------------LETS-----------ADTKALYEVIGFTPETTVKDGVKNF 324 Query: 326 AAWLRDHGNL 335 W RD + Sbjct: 325 VNWYRDFYKV 334
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.006 Identities = 16/50 (32%), Positives = 22/50 (44%), Gaps = 1/50 (2%) Query: 31 LVLLGPSGAGKSSLLRVLNLLEMPRSGTLTIAGNHFDFTKTPSDKAIREL 80 +VL G G GKS+L+ L L+ S T G D + + EL Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDF-FSDTHFDIGTGKDSYEQIAGIVAYEL 647
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 30.1 bits (67), Expect = 0.006 Identities = 34/119 (28%), Positives = 50/119 (42%), Gaps = 31/119 (26%) Query: 81 FDAVMAG--MDITPEREKQVLFTTPYYDNSALFVGQQGKYTSVDQLKGKKVGVQNGTTHQ 138 D+V+A M + E +QV+ TP DNSAL + QL Q Sbjct: 112 LDSVIASRLMQMALEAARQVIGQTPTVDNSALI-------KQIQQL-----------LQQ 153 Query: 139 KFIMDKHPEITTVPYDSYQNAKLDLQNGRIDAVFGDTAVVTEW-LKANPKLVPVGDKVT 196 + + P++ P DLQ R+D + G T + W L+ +P L P G KV+ Sbjct: 154 EPLFSGKPQLRVHPD--------DLQ--RVDDMLGATLSLHGWRLRGDPTLHPGGCKVS 202
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 30.8 bits (69), Expect = 0.013 Identities = 17/55 (30%), Positives = 24/55 (43%), Gaps = 5/55 (9%) Query: 230 VLQTAKALGIPVKGHVEQLSLLGGAQLVSRYQGLSADHIEYLDEAGVAAMRDGGT 284 VL+ +P G +S+LG ++L L HI AGVAAM+ Sbjct: 202 VLRDTNVTAVPASGAPAAVSVLGASELT-----LDGGHITGGRAAGVAAMQGAVV 251
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 31.0 bits (70), Expect = 0.004 Identities = 12/41 (29%), Positives = 23/41 (56%), Gaps = 1/41 (2%) Query: 15 VDNAPRMQDYTLEGEEGRDM-MLLDALIQLKEKDPSLSFRR 54 ++N + T+E + + MLLDAL+++ + DP L + Sbjct: 339 IENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYV 379
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.8 bits (69), Expect = 0.006 Identities = 11/33 (33%), Positives = 16/33 (48%), Gaps = 6/33 (18%) Query: 50 KIDFTLTEGNRLALIGHNGSGKTTLLRVLAGAY 82 K D+++ L G G GK+TL+ L G Sbjct: 594 KFDYSVV------LEGTGGIGKSTLINTLVGLD 620
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 31.9 bits (72), Expect = 0.002 Identities = 17/87 (19%), Positives = 27/87 (31%), Gaps = 8/87 (9%) Query: 20 LTLVSSANIASGFHAGDAQTMLT---CVREALKNGVAIGAHPSFPDRDN--FGRT--AMV 72 + + IASG G T+LT V + A+ A PS ++DN G + Sbjct: 95 VEAPTGTFIASGVVVGK-DTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQI 153 Query: 73 LPPETVYAQTLYQIGALGAIVQAQGGV 99 + + V Sbjct: 154 TKYSGEGDLAIVKFSPNEQNKHIGEVV 180
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 63.7 bits (155), Expect = 4e-14 Identities = 28/138 (20%), Positives = 52/138 (37%), Gaps = 4/138 (2%) Query: 6 TLLIVEDETLLAEMHAEYIRHIPGFKQIWLAGNLAQARMMIDRFKPGLILLDNYLPDGKG 65 T+L+ +D+ + + + + G+ + N A I L++ D +PD Sbjct: 5 TILVADDDAAIRTVLNQALS-RAGYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 66 ITLLHELMQSRYPG-GVVFTTAASDMETVAEAVRSGAFDYLVKPIAYERLGQTLTRYQQR 124 LL + + P V+ +A + T +A GA+DYL KP L + R Sbjct: 63 FDLLPRI-KKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121 Query: 125 RRMLASADSASQKQIDEM 142 + S + + Sbjct: 122 PKRRPSKLEDDSQDGMPL 139
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 38.3 bits (89), Expect = 1e-05 Identities = 14/68 (20%), Positives = 31/68 (45%), Gaps = 4/68 (5%) Query: 138 NPFTNGHRYLIQQAAAQCDWLHLFLVKEDTSRFPY---EDRLDLVLKGTTDIPRLTVHRG 194 +P T GH +I++ D +++ V + ++ P ++RL+ + K +P V Sbjct: 10 DPITFGHLDIIERGCRLFDQVYV-AVLRNPNKQPMFSVQERLEQIAKAIAHLPNAQVDSF 68 Query: 195 SEYIISRA 202 ++ A Sbjct: 69 EGLTVNYA 76
>STREPTOPAIN#Streptopain (C10) cysteine protease family signature. Length = 398 Score = 31.2 bits (70), Expect = 0.011 Identities = 17/73 (23%), Positives = 33/73 (45%), Gaps = 1/73 (1%) Query: 2 LDTNMKTQLRAYLEKLTKPVELIATLDDS-AKSAEIKELLAEIAELSDKVTFKEDNTLPV 60 D N K + +++E + ++ LD + A +AEIK+ + + S + + + N + Sbjct: 109 FDANGKENIASFMESYVEQIKENKKLDTTYAGTAEIKQPVVKSLLDSKGIHYNQGNPYNL 168 Query: 61 RKPSFLITNPGSQ 73 P PG Q Sbjct: 169 LTPVIEKVKPGEQ 181
>BCTLIPOCALIN#Bacterial lipocalin signature. Length = 171 Score = 28.4 bits (63), Expect = 0.018 Identities = 18/98 (18%), Positives = 41/98 (41%), Gaps = 13/98 (13%) Query: 30 QGITILKSFEAPGGMKGYLGKYQDMGVTIYLTPDGKHAISG--YMYNEKGENLSNALIEK 87 + + + FE YLGK+ ++ + G ++ + N+ G ++ N Sbjct: 21 ESVKPVSDFEL----NNYLGKWYEVARLDHSFERGLSQVTAEYRVRNDGGISVLN----- 71 Query: 88 EIYAPAGREMWQKMEKASWILDGKKDAPVVLYVFADPF 125 Y+ + W++ E ++ ++G D + + F PF Sbjct: 72 RGYSEE-KGEWKEAEGKAYFVNGSTDGYLKVSFFG-PF 107
>PF05043#Transcriptional activator Length = 493 Score = 28.8 bits (64), Expect = 0.033 Identities = 30/137 (21%), Positives = 53/137 (38%), Gaps = 20/137 (14%) Query: 7 LKKFDLNLLVIFECIYQH---LSISKAAETLYITPSAVSQSLQRLRTQFNDPLFIRSGKG 63 L K L + E +++H S+ AE L T AV L +++ F D +F S G Sbjct: 5 LSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDLIFHSSTNG 64 Query: 64 I----TPTVTGINLHYHLENNLNSLE--QTINIMNQSSL----KKKFIIYSPQMLITQYA 113 I T +++H + + I K+ +I S + Y Sbjct: 65 IRIINTDDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISSS-----SLYR 119 Query: 114 M--KLVKYIRKDPQVEI 128 + ++ K I++ Q E+ Sbjct: 120 IISQINKVIKRQFQFEV 136
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 338 bits (868), Expect = e-120 Identities = 104/257 (40%), Positives = 147/257 (57%), Gaps = 20/257 (7%) Query: 9 KTVWVTGAGKGIGYATALAFVDAGARVIGFDRE---------------FTQENYPFATEV 53 K ++TGA +GIG A A GA + D E +P Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP----- 63 Query: 54 MDVADAGQVAQVCQRVLQKTPRLDVLVNAAGILRMGATDALSVDDWQQTFAVNVGGAFNL 113 DV D+ + ++ R+ ++ +D+LVN AG+LR G +LS ++W+ TF+VN G FN Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123 Query: 114 FSQTMAQFRRQQGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALTVGLELAGCGVRCN 173 ++ G+IVTV S+ A PR M+AY +SKAA +GLELA +RCN Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183 Query: 174 VVSPGSTDTDMQRTLWVSEDAEQQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASDLA 233 +VSPGST+TDMQ +LW E+ +Q I+G E FK GIPL K+A+P +IA+ +LFL S A Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243 Query: 234 SHITLQDIVVDGGSTLG 250 HIT+ ++ VDGG+TLG Sbjct: 244 GHITMHNLCVDGGATLG 260
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 425 bits (1095), Expect = e-154 Identities = 148/299 (49%), Positives = 192/299 (64%), Gaps = 18/299 (6%) Query: 1 MAIPKLQSYALPTALDIPTNKVNWAFEPERAALLIHDMQDYFVSFWGRNCPMMDQVIANI 60 MAIP +Q Y +PTA D+P NKV+W +P RA LLIHDMQ+YFV + + ++ ANI Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60 Query: 61 AALRQYCKEHHIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQKVVEALTPDEADTV 120 L+ C + IPV YTAQP Q+ +DRALL D WGPGL P ++K++ L P++ D V Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120 Query: 121 LVKWRYSAFHRSPLEQMLKDTGRNQLIITGVYAHIGCMTTATDAFMRDIKPFMVADALAD 180 L KWRYSAF R+ L +M++ GR+QLIITG+YAHIGC+ TA +AFM DIK F V DA+AD Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180 Query: 181 FSREEHLMALNYVAGRSGRVVMTESLL------PTPVPASKA-----------ALRALIL 223 FS E+H MAL Y AGR VMT+SLL P V + A +R I Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240 Query: 224 PLLDETDEPLD-DENLIDYGLDSVRMMGLAARWRKVHGDIDFVMLAKNPTIDAWWALLS 281 LL ET E + E+L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W LL+ Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 58.8 bits (142), Expect = 4e-12 Identities = 46/210 (21%), Positives = 81/210 (38%), Gaps = 21/210 (10%) Query: 105 EPNAETVAAQMPDLILISATGGDSALALYDQLSAIAPTLVINYDDKS-----WQSLLTQL 159 EPN E + P ++ SA G S + L+ IAP N+ D + LT++ Sbjct: 86 EPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPLAMARKSLTEM 141 Query: 160 GEITGQEKQAAARIAEFEAQLTTVKQRIALPPQPVSALVYTPAAHSANLWTPESTQGKLL 219 ++ + A +A++E + ++K R L ++ P S ++L Sbjct: 142 ADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEIL 201 Query: 220 TQVGFTLATLPQGLQTSKSQGKRHDIIQLGGENLAAGLNGESLFLFAGDNNDVAALYANP 279 + G A + + + + LAA + + L ++ D+ AL A P Sbjct: 202 DEYGIPNAW--------QGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALMATP 253 Query: 280 LLAHLPAVQNKRVYALGTETFRLDYYSATL 309 L +P V+ R + F Y ATL Sbjct: 254 LWQAMPFVRAGRFQRVPAVWF----YGATL 279
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 29.1 bits (65), Expect = 0.039 Identities = 69/394 (17%), Positives = 130/394 (32%), Gaps = 60/394 (15%) Query: 27 FISIVSLGLLGVAVPVQIQMMTHSTWQVGLSVTLTGGAMFIGLMVGGVLADRYERKKVIL 86 F S+++ +L V++P T IG V G L+D+ K+++L Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 87 LARGTCGIGFIGLCVNALLPEPSLLAIYLLGLWDGFFASLGVTALLAATPALVGRENLMQ 146 G + V ++A ++ G F +L ++ + +EN + Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL----VMVVVARYIPKENRGK 139 Query: 147 AGAITMLTVRLGSVISPMLGGILLASGGVAWNYGLAAAGTFITLLPLLTLPRLPVPPQPR 206 A + V +G + P +GG++ + W+Y L IT++ + L +L Sbjct: 140 AFGLIGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIP--MITIITVPFLMKLLKKEVRI 195 Query: 207 ENP--------------------------FLALLAAFRFLLA------------------ 222 + FL + + Sbjct: 196 KGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKN 255 Query: 223 CPLIGGIALLGGLVTMASAVRVLYPALAMS--WQMSAAQIGLLYAAI-PLGAAIGALTSG 279 P + G+ L GG++ A V M Q+S A+IG + + I G Sbjct: 256 IPFMIGV-LCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGG 314 Query: 280 QLAHSVRPGLIMLVSTVG---SFLAVGLFAIMPVWIAGVICLALFGWLSAISSLLQYTLL 336 L P ++ + SFL W +I + + G LS +++ + Sbjct: 315 ILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVS 374 Query: 337 QTQTPENMLGRMNGLWTAQNVTGDAIGAALLGGL 370 + + M L + + G A++GGL Sbjct: 375 SSLKQQEAGAGM-SLLNFTSFLSEGTGIAIVGGL 407
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 52.9 bits (127), Expect = 7e-12 Identities = 13/37 (35%), Positives = 25/37 (67%) Query: 25 GAGSEVMSRIATPMIGGMITAPLLSLFIIPAAYKLMR 61 GAGS + + ++GGM++A LL++F +P + ++R Sbjct: 993 GAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIR 1029
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 33.7 bits (77), Expect = 2e-05 Identities = 7/45 (15%), Positives = 17/45 (37%), Gaps = 1/45 (2%) Query: 4 KRYPEEFKIEAVRQVVER-GHSVSSVATHLDITTHSLYARIKKYG 47 R E + + + + A L + ++L +I++ G Sbjct: 430 DRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELG 474
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 70.2 bits (172), Expect = 2e-16 Identities = 29/122 (23%), Positives = 58/122 (47%), Gaps = 2/122 (1%) Query: 1 MKPASVIIMDEHPIVRMSIEVLLGKNSNIQVVLKTDDSRTAIEYLRTYPVDLVILDIELP 60 M A++++ D+ +R + L + V T ++ T ++ DLV+ D+ +P Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYD--VRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 GTDGFTLLKRIKSIQEHTRILFLSSKSEAFYAGRAIRAGANGFVSKRKDLNDIYNAVKMI 120 + F LL RIK + +L +S+++ A +A GA ++ K DL ++ + Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118 Query: 121 LS 122 L+ Sbjct: 119 LA 120
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 826 bits (2134), Expect = 0.0 Identities = 406/856 (47%), Positives = 561/856 (65%), Gaps = 19/856 (2%) Query: 21 VALSVLAALCPLTSRGESYFNPAFLSADTASVADLSRFEKGYHQPPGIYRVDIWRNDEFV 80 + ++ A S E YFNP FL+ D +VADLSRFE G PPG YRVDI+ N+ ++ Sbjct: 30 LFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNGYM 89 Query: 81 ATQDIRFEAGAVGAGDKSGGLMPCFTPEWIKRLGVNTAVFPVSDKGVDTSCIHLPEKIPG 140 AT+D+ F G D G++PC T + +G+NTA + D +C+ L I Sbjct: 90 ATRDVTFNTG-----DSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHD 144 Query: 141 AEVAFDFASMRLNISLPQASLLNSARGYIPPEEWDEGIPAALINYSFTGSR-----GTDS 195 A D RLN+++PQA + N ARGYIPPE WD GI A L+NY+F+G+ G +S Sbjct: 145 ATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNS 204 Query: 196 DSYFLSLLSGLNYGPWRLRNNGAWNYSKGDG--YHSQRWNNIGTWVQRAIIPLKSELVMG 253 +L+L SGLN G WRLR+N W+Y+ D +W +I TW++R IIPL+S L +G Sbjct: 205 HYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLG 264 Query: 254 DSNTGNDVFDSVGFRGARLYSSDNMYPDSLQGYAPTVRGIARTAAKLTIRQNGYVIYQNY 313 D T D+FD + FRGA+L S DNM PDS +G+AP + GIAR A++TI+QNGY IY + Sbjct: 265 DGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNST 324 Query: 314 VSPGAFAITDLNPTSSSGDLEVTVDEKDGSQQRYTVPYSTVPLLQREGRVKYDLVAGDFR 373 V PG F I D+ +SGDL+VT+ E DGS Q +TVPYS+VPLLQREG +Y + AG++R Sbjct: 325 VPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYR 384 Query: 374 SGNSQQSSPFFFQGTVIAGLPAGLTAYGGTQLADRYRAVVVGAGQNLGDWGAVSVDVTHA 433 SGN+QQ P FFQ T++ GLPAG T YGGTQLADRYRA G G+N+G GA+SVD+T A Sbjct: 385 SGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQA 444 Query: 434 RSQLADDSTHQGQSLRFLYAKSLNNYGTNFQLLGYRYSTRGFYTLDDVAYRSMEGYDYEY 493 S L DDS H GQS+RFLY KSLN GTN QL+GYRYST G++ D Y M GY+ E Sbjct: 445 NSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIE- 503 Query: 494 DSDGRRHKVPVAQSYHNLRYSKKGRFQVNISQNLGDYGSLYLSGSQQNYWNTADTNTWYQ 553 DG P Y+NL Y+K+G+ Q+ ++Q LG +LYLSGS Q YW T++ + +Q Sbjct: 504 TQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQ 563 Query: 554 LGYASGWQDISYSLSWSWNESVGISGADRILAFNMSAPFSVLTGRRYARDTILDRTYATF 613 G + ++DI+++LS+S ++ G D++LA N++ PFS R + A++ Sbjct: 564 AGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFS--HWLRSDSKSQWRHASASY 621 Query: 614 NANRNRDGDNSWQSGVGGTLLEGRNLSYSVTQGRS----STNGYSGSASASWQATYGTLG 669 + + + +G + +GV GTLLE NLSYSV G + +G +G A+ +++ YG Sbjct: 622 SMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNAN 681 Query: 670 VGYNYDRDQHDYNWQLSGGVVGHADGITFSQPLGDTNVLIKAPGAKGVRIENQTGVKTDW 729 +GY++ D + +SGGV+ HA+G+T QPL DT VL+KAPGAK ++ENQTGV+TDW Sbjct: 682 IGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDW 741 Query: 730 RGYAVMPYATVYRYNRVALDTNTMDNHTDVENNVSSVVPTEGALVRAAFDTRIGVRAIIT 789 RGYAV+PYAT YR NRVALDTNT+ ++ D++N V++VVPT GA+VRA F R+G++ ++T Sbjct: 742 RGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMT 801 Query: 790 ARLGGRPLPFGAIVRETASGITSMVGDDGQIYLSGLPLKGELFIQWGEGKNARCIAPYAL 849 +PLPFGA+V +S + +V D+GQ+YLSG+PL G++ ++WGE +NA C+A Y L Sbjct: 802 LTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANYQL 861 Query: 850 AENSLKQAITIVSATC 865 S +Q +T +SA C Sbjct: 862 PPESQQQLLTQLSAEC 877
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 30.2 bits (68), Expect = 0.018 Identities = 16/149 (10%), Positives = 42/149 (28%), Gaps = 8/149 (5%) Query: 299 RSQLNYSEENLKQARASLERLYTALRGTDKSAAPAGGEAFEARFVEAMNDDFNTPEAY-- 356 + ++ +L QAR R R + + P E F ++ + Sbjct: 133 EADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIK 192 Query: 357 SVLFDMAREVN--RLKGEDMTAA-NAMASHLRKISGVLGLLEQEPDVFLQSGAQADDGEV 413 + L + A + + + + + + + D F + Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSS---LLHKQAI 249 Query: 414 AEIEALIQQRLDARKAKDWAAADAARDRL 442 A+ L Q+ + + +++ Sbjct: 250 AKHAVLEQENKYVEAVNELRVYKSQLEQI 278
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 359 bits (923), Expect = e-127 Identities = 126/310 (40%), Positives = 176/310 (56%), Gaps = 16/310 (5%) Query: 2 KTLVVALGGNALLQRGEALTAENQYRNIADAVPALARL-ARSYRLAIVHGNGPQVGLLAL 60 K +V+ALGGNAL QRG+ + E N+ +A + AR Y + I HGNGPQVG L L Sbjct: 3 KRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLLL 62 Query: 61 QNLAWKA---VEPYPLDVLVAESQGMIGYMLAQRLALEPDM----PPVTAVLTRIKVSAD 113 A +A + P+DV A SQG IGYM+ Q L E V ++T+ V + Sbjct: 63 HMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDKN 122 Query: 114 DPAFLEPEKFIGPVYSPEEQMSLEATYGWHMKRD-GKYLRRVVASPAPRQIIESAAIELL 172 DPAF P K +GP Y E L GW +K D G+ RRVV SP P+ +E+ I+ L Sbjct: 123 DPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKL 182 Query: 173 LKEGHVVICSGGGGVPVAGEG---EGVEAVIDKDLAAALLAEQIAADGLIILTDADAVYE 229 ++ G +VI SGGGGVPV E +GVEAVIDKDLA LAE++ AD +ILTD + Sbjct: 183 VERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAAL 242 Query: 230 HWGTPQQRAIRQASPDELAPFAKAD----GAMGPKVTAVSGYVKRCGKPAWIGALSRIDD 285 ++GT +++ +R+ +EL + + G+MGPKV A +++ G+ A I L + + Sbjct: 243 YYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHLEKAVE 302 Query: 286 TLAGRAGTCI 295 L G+ GT + Sbjct: 303 ALEGKTGTQV 312
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 115 bits (291), Expect = 8e-38 Identities = 49/88 (55%), Positives = 67/88 (76%) Query: 2 NKSQLIEKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTFAVKERAARTGR 61 NK LI K+A +++K + A+DA+ ++V+ L +G+ V L+GFG F V+ERAAR GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 62 NPQTGKEITIAAAKVPSFRAGKALKDAV 89 NPQTG+EI I A+KVP+F+AGKALKDAV Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 34.3 bits (78), Expect = 0.002 Identities = 34/133 (25%), Positives = 68/133 (51%), Gaps = 15/133 (11%) Query: 191 ERLEYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKELGEMDDAPD- 249 LE A +E + +L R +++ ++ S+ +Q++A ++L E + + Sbjct: 291 AALEAEKADLEHQSQVLNAN---RQSLRRDLDASREAK---KQLEAEHQKLEEQNKISEA 344 Query: 250 ENEALKRKIDAAKMPKEAKEKAEAELQKLKMMSPMS-AEATVVRGYIDWMVQVPWNARSK 308 ++L+R +DA++ EAK++ EAE QKL+ + +S A +R +D + A+ + Sbjct: 345 SRQSLRRDLDASR---EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASRE----AKKQ 397 Query: 309 VKKDLRQAQEILD 321 V+K L +A L Sbjct: 398 VEKALEEANSKLA 410
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 42.9 bits (101), Expect = 2e-06 Identities = 40/190 (21%), Positives = 76/190 (40%), Gaps = 15/190 (7%) Query: 221 RNNAWLI-LLLIVLYKLGDAFAMSLTTTFLIRGVGFDAGEVGVVNKTLSLLATIVGALYG 279 R+N LI L ++ + + + ++++ + VN L +I A+YG Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 280 GILMQRLSLFRALLIFGILQGASNAGYWLLSITDKNMFSMGAAVFFENLCGGMGTAAFVA 339 L +L + R LL I+ + ++ + FS+ + G G AAF A Sbjct: 71 K-LSDQLGIKRLLLFGIIINCFGS----VIGFVGHSFFSL---LIMARFIQGAGAAAFPA 122 Query: 340 LLM----TLCNKSFSATQFALLSALSAVGRVYVGPVAGWFVEAH-GWPTFYLFSVVAAVP 394 L+M K F L+ ++ A+G VGP G + + W L ++ + Sbjct: 123 LVMVVVARYIPKENRGKAFGLIGSIVAMG-EGVGPAIGGMIAHYIHWSYLLLIPMITIIT 181 Query: 395 GLLLLLVCRQ 404 L+ + ++ Sbjct: 182 VPFLMKLLKK 191
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 84.1 bits (208), Expect = 1e-19 Identities = 85/382 (22%), Positives = 156/382 (40%), Gaps = 26/382 (6%) Query: 17 LGTVFSLRMLGMFMVLPVLTTY--GMALQSASEALIGIAIGIYGLAQAIFQIPFGLLSDR 74 L TV L +G+ +++PVL + + A GI + +Y L Q G LSDR Sbjct: 11 LSTVA-LDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDR 69 Query: 75 IGRKPLIVGGLAVFVAGSIIAALSHSIWGIILGRALQG-SGAIAAAVMALLSDLTREQNR 133 GR+P+++ LA I A + +W + +GR + G +GA A A ++D+T R Sbjct: 70 FGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDER 129 Query: 134 TKAMAFIGVSFGITFAIAMVLGPIVTHSLGLNALFWMIAALATLGILLTIWVVPNSTNHV 193 + F+ FG VLG ++ +A F+ AAL L L +++P S Sbjct: 130 ARHFGFMSACFGFGMVAGPVLGGLMG-GFSPHAPFFAAAALNGLNFLTGCFLLPESHKGE 188 Query: 194 LNRESGMVKGSFSKVLAEPRLLKLNFGIMCLHILLMSTFVA-LPGQLADAGFPAAEHWKV 252 + + G+ + L+ F+ L GQ+ A + + Sbjct: 189 RRPLRREALNPLAS-------FRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRF 241 Query: 253 YLATMVIAFA--------AVVPFIIYAEVKRRMKQVFLFCVGLI--VVAEIVLWGAGQHF 302 + I + ++ +I V R+ + +G+I I+L A + + Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301 Query: 303 WELVIGVQLFFLAFNL--MEALLPSLISKESPAGYKGTAMGVYSTSQFLGVALGGSLGGW 360 I V L + ++A+L + +E +G+ + S + +G L ++ Sbjct: 302 MAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361 Query: 361 IDGTFDGQTVFLAGAVLAMVWL 382 T++G ++AGA L ++ L Sbjct: 362 SITTWNG-WAWIAGAALYLLCL 382
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 55.6 bits (134), Expect = 1e-10 Identities = 56/306 (18%), Positives = 108/306 (35%), Gaps = 17/306 (5%) Query: 19 FTSWMLDAFDFFILVFVLSDLAEWFHAS---VSDVSIAIMLTLAVRPIGALLFGRMAEKY 75 ++ LDA +++ VL L S + I + L ++ A + G +++++ Sbjct: 11 LSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF 70 Query: 76 GRRPILMLNILFFTVFELLSAWSPTFMAFLIFRVMYGVAMGGIWGVASSLAMETIPDRSR 135 GRRP+L++++ V + A +P I R++ G+ G VA + + R Sbjct: 71 GRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDER 129 Query: 136 ----GLMSGIFQAGYPCGYLFASVIFGLFYSMVGWRGMFLIGA---LPVVLLPYIWFKVP 188 G MS F G + A + G F A L Sbjct: 130 ARHFGFMSACFGFG-----MVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184 Query: 189 ESPVWLAARARKENTALLPVLRKQWKLCLYLVLVMAFFNFFSHGTQDLYPTFLKMQHSFD 248 R N + + L+ V L+ F + + +D Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWD 244 Query: 249 PHLISI-IAIFYNIAAMLGGIFYGTLSERIGRKKAIMIAAFLALPVLPLWAFSSGSFTIG 307 I I +A F + ++ + G ++ R+G ++A+M+ L AF++ + Sbjct: 245 ATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAF 304 Query: 308 LGAFLM 313 L+ Sbjct: 305 PIMVLL 310 Score = 33.6 bits (77), Expect = 0.001 Identities = 37/186 (19%), Positives = 77/186 (41%), Gaps = 10/186 (5%) Query: 3 TPLNWTTTQRHVAFASFTSWMLDAF-DFFILVFVLSDLAEWFHASVSDVSIAIMLTLAVR 61 W VA +++ ++V+ + FH + + I++ + Sbjct: 201 ASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG-EDRFHWDATTIGISLAAFGILH 259 Query: 62 PIG-ALLFGRMAEKYGRRPILMLNILF-FTVFELLSAWSPTFMAFLIFRVMYGVAMG--G 117 + A++ G +A + G R LML ++ T + LL+ + +MAF I ++ +G Sbjct: 260 SLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPA 319 Query: 118 IWGVASSLAMETIPDRSRGLMSGIFQAGYPCGYLFASVIFGLFYSMVGWRG-MFLIGA-L 175 + + S E + +G ++ + G L + I+ S+ W G ++ GA L Sbjct: 320 LQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA--ASITTWNGWAWIAGAAL 377 Query: 176 PVVLLP 181 ++ LP Sbjct: 378 YLLCLP 383
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 134 bits (339), Expect = 7e-43 Identities = 59/183 (32%), Positives = 88/183 (48%), Gaps = 21/183 (11%) Query: 1 MKRRSSFLVFLGLLLASPLALANDQHTVSFGYAQTHLSSLKNSDSKDLRGFNFKYRYEFN 60 MK+ + +L + TV+ GYAQ+ N + GFN KYRYE + Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAATSTVTGGYAQSDAQGQMN----KMGGFNLKYRYEED 56 Query: 61 ET-WGMLGSFTATRNEMENYTWKEGKLHKNGSDSVDYGSLMFGPTYRFNDYVSLYGNAGI 119 + G++GSFT T + K Y + GP YR ND+ S+YG G+ Sbjct: 57 NSPLGVIGSFTYTEKSRTASSGDYNK--------NQYYGITAGPAYRINDWASIYGVVGV 108 Query: 120 ATMKF--------NKHSKEDSFAYGAGVIFNPVKSISIDASWEASRFFAVDTNTFGVSVG 171 KF + + F+YGAG+ FNP++++++D S+E SR +VD T+ VG Sbjct: 109 GYGKFQTTEYPTYKHDTSDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWIAGVG 168 Query: 172 YRF 174 YRF Sbjct: 169 YRF 171
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 765 bits (1977), Expect = 0.0 Identities = 261/880 (29%), Positives = 413/880 (46%), Gaps = 63/880 (7%) Query: 4 TINLNRKS-LALLIAIVCSGSAQG----EEYYFDPALLQGATYGQ-NIARFNE-QQTPSG 56 I +R + + + + C+ +AQ E YF+P L +++RF Q+ P G Sbjct: 17 HIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPG 76 Query: 57 DYLADVYVNGTLVTASTNIRFNAVKEGQQAEPCLPLSVMKAAQIKSLPETDAA----TEC 112 Y D+Y+N + A+ ++ FN Q PCL + + + + + + C Sbjct: 77 TYRVDIYLNNGYM-ATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDAC 135 Query: 113 RPLREWVPHAGWQFDSATLRLLLTIPMTELTHKPRGYISPSEWDSGALALFLRHNTNWTH 172 PL + A Q D RL LTIP ++++ RGYI P WD G A L +N + Sbjct: 136 VPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNS 195 Query: 173 TENTDSHYRYQYLWSGLNMGVNLGLWQVRHQSNLRYANSNQS-GSAWRYNSVRTWVQRPV 231 +N Y + L G+N+G W++R + Y +S+ S GS ++ + TW++R + Sbjct: 196 VQN-RIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDI 254 Query: 232 ASINSILSLGDSYTDSSLFGSLSFNGVKLVTDERMRPQGKRGYAPEVRGVAASSAHVVVK 291 + S L+LGD YT +F ++F G +L +D+ M P +RG+AP + G+A +A V +K Sbjct: 255 IPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIK 314 Query: 292 QLGKVIYETNVPPGPFYIDDLYNTRYQGDLEVEVIEASGKTSRFTVPYSSVPDSVRPGNW 351 Q G IY + VPPGPF I+D+Y GDL+V + EA G T FTVPYSSVP R G+ Sbjct: 315 QNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHT 374 Query: 352 HYSLAFGRVRQYY--DIENRFFEGTFQHGVNNTITLNLGSRIAQRYQAWLAGGVWATGM- 408 YS+ G R + RFF+ T HG+ T+ G+++A RY+A+ G G Sbjct: 375 RYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGAL 434 Query: 409 GAFGLNATWSNARAEHNERQQGWRAELSYSKTFT-TGTNLVLAAYRYSTNGFRDLQDVLG 467 GA ++ T +N+ + + G Y+K+ +GTN+ L YRYST+G+ + D Sbjct: 435 GALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTY 494 Query: 468 VRREAKTGI-------------DYYSDTLHQRNRLSATVSQPLGRLGTLNLSASTADYYN 514 R DYY+ ++R +L TV+Q LGR TL LS S Y+ Sbjct: 495 SRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWG 554 Query: 515 NQSRITQLQMGYSNQWRNISYGVNIARQRTTWDYDRFYHGVNEPLDVSSRQKYTETTMSF 574 + Q Q G + + +I++ ++ + + W + ++ Sbjct: 555 TSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKG------------------RDQMLAL 596 Query: 575 NVSIPLDWGENRTSVA------MNYNQSSQSRSST---VSMTGSSGENSDLSWSVYGGYE 625 NV+IP S + +Y+ S + G+ E+++LS+SV GY Sbjct: 597 NVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYA 656 Query: 626 RYRNSNSDSSAPTTFGGNLQQNTRFGALRANYDQGDNYRQEGLGASGTLVLHPGGLTAGP 685 + NS S+ L +G Y D+ +Q G SG ++ H G+T G Sbjct: 657 GGGDGNSGSTG----YATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQ 712 Query: 686 YTSDTFALIHADGAQGAIVQNGQGAVVDRFGYAILPSLSPYRINNVTLDTRKMRSDAELT 745 +DT L+ A GA+ A V+N G D GYA+LP + YR N V LDT + + +L Sbjct: 713 PLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLD 772 Query: 746 GGSQQIVPYAGAIARVNFATISGKAVLISVKMPDGGIPPMGADVFNGEGTNIGMVGQSGQ 805 +VP GAI R F G +L+++ + P GA V + + G+V +GQ Sbjct: 773 NAVANVVPTRGAIVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQ 831 Query: 806 IYARIAHPSGSLLVRWGTGANQRCRVAYQLDLHTKEPFLY 845 +Y +G + V+WG N C YQL +++ L Sbjct: 832 VYLSGMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLT 871
>PF01540#Adhesin lipoprotein Length = 475 Score = 33.9 bits (77), Expect = 0.002 Identities = 21/59 (35%), Positives = 27/59 (45%), Gaps = 9/59 (15%) Query: 94 DANGSQVDYIANVLKYDPDQYSI---------EADKKFKYSVKLSDYPTLQDAASAAVD 143 DA Q + +A LK +PD I EA K FK + DYP + SAAV+ Sbjct: 42 DAALKQANALAEELKKNPDYSKILETLNKEIAEATKSFKEAGSYGDYPAIISKLSAAVE 100
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 26.7 bits (59), Expect = 0.005 Identities = 11/28 (39%), Positives = 14/28 (50%) Query: 6 QTIPELLIQTRGNQTEVARMLSCARGTV 33 I L TRGNQ + A +L R T+ Sbjct: 439 PLILAALTATRGNQIKAADLLGLNRNTL 466
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 31.2 bits (71), Expect = 0.002 Identities = 10/49 (20%), Positives = 20/49 (40%), Gaps = 3/49 (6%) Query: 123 MTEATEL---LYSRNGMTATQKYEAIQAIFTQLTDHAKTGSRRGLRSFG 168 M +L + +T A+ A+F+ ++ + G + L FG Sbjct: 1 MANKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFG 49
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 25.9 bits (57), Expect = 0.027 Identities = 8/22 (36%), Positives = 13/22 (59%) Query: 22 RAAEHLGLNINQFYYIAKKLSL 43 +AA+ LGLN N ++L + Sbjct: 454 KAADLLGLNRNTLRKKIRELGV 475
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 33.0 bits (75), Expect = 5e-04 Identities = 14/62 (22%), Positives = 26/62 (41%), Gaps = 7/62 (11%) Query: 146 VGLAHVKLSNNTIPVGFGINETLSASKNNFAWGAGIGAKYAVTDNIMIDASYKYINAGKV 205 VG+ + K P S F++GAG+ ++ +N+ +D SY+ V Sbjct: 106 VGVGYGKFQTTEYPTYKH-----DTSDYGFSYGAGL--QFNPMENVALDFSYEQSRIRSV 158 Query: 206 SI 207 + Sbjct: 159 DV 160
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 69.1 bits (169), Expect = 4e-14 Identities = 98/658 (14%), Positives = 194/658 (29%), Gaps = 92/658 (13%) Query: 78 PAAERQKALAALSRPLLRNSNLVCGVSEAK-------DSSECGYVATDKEDVAVIFDENN 130 Q + L+R L + G++ A C + + D D Sbjct: 98 TGDSEQGIVPCLTRAQLASM----GLNTASVSGMNLLADDACVPLTSMIHDATAQLDVGQ 153 Query: 131 AQLSLFLNRDWLPDEERRDKRWLTPT--PEGVSAF-----IHRQTLYLSDDLHSRNMTLN 183 +L+L + + ++ R + ++ P G++A ++ +S LN Sbjct: 154 QRLNLTIPQAFMS---NRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYAYLN 210 Query: 184 GSGALGLGDGRYLGGNWAAIWNQSEHYNNSQAWFDNLFVRQDLGNQYYLQAGRMDQRNLS 243 L +G R L N +N S+ + S+ + + +L+ R S Sbjct: 211 LQSGLNIGAWR-LRDNTTWSYNSSDSSSGSKNKWQH--------INTWLE--RDIIPLRS 259 Query: 244 SATGGDFGFSLLPLS--RFDGLRTGTTQAYVNHEVDQNATPVMVQVTRNARIDIYRGSEL 301 T GD F G + + + A + A++ I + Sbjct: 260 RLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYD 319 Query: 302 LGSQFLTPGMHTLDTHSLPPGSYPLALRVYEDGILRRTETQPFS-------KGGNRFSAQ 354 + + + PG T++ S L + + E + T P+S +G R+S Sbjct: 320 IYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYS-- 377 Query: 355 TQWFIQGGLEDTGDKASHYDGETVMAAGFQTGLRKNISLTEGISLAHE----AWYSETRL 410 G + + + GL ++ G LA + + Sbjct: 378 ---ITAGEYRSGN---AQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNM 431 Query: 411 NSQHAV-LDGTLDLSAGILHGTDSTSGNTEQVTYNDGFSASLWRNHTESDACSGRHPQSV 469 + A+ +D T + L G + + YN + S T R+ S Sbjct: 432 GALGALSVDMTQ--ANSTLPDDSQHDGQSVRFLYNKSLNES----GTNIQLVGYRYSTSG 485 Query: 470 HASMTCQTSMNASLSVPVGNWYALLGYSTSRTEGRPVYRGYDDNSDKENVF--------- 520 + + T N Y + Y+ +K Sbjct: 486 YFNFADTTYSRM-------NGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLG 538 Query: 521 -WRQAYIPASHRE-------SAQASATYSLNMAGMNINTHGGVWRTRNDGVNDDGLFMSV 572 Y+ SH+ Q A + +N + + D L ++V Sbjct: 539 RTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNV 598 Query: 573 SVSYASQ-PPTMTGSNGYTSAGTDIHNSRNQKTQTSWNVNHVRSWQQDLYRELSVGFSGY 631 ++ ++ + SA + + N + V +L + G++G Sbjct: 599 NIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGG 658 Query: 632 NDDSWSGSLGGRMS--GRMGELSATISNSHQRNAGSASSLTAGYSSSLALSRNGLFWG 687 D + + ++ G G + S+S L G S + NG+ G Sbjct: 659 GDGNSGSTGYATLNYRGGYGNANIGYSHS-----DDIKQLYYGVSGGVLAHANGVTLG 711
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 824 bits (2129), Expect = 0.0 Identities = 306/872 (35%), Positives = 450/872 (51%), Gaps = 52/872 (5%) Query: 4 KQPALLLFIAGVVHCANA-------HAYTFDASML-GDAAKGVDMSLFNQG-VQQPGTYR 54 K F+ V CA A F+ L D D+S F G PGTYR Sbjct: 20 KHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYR 79 Query: 55 VDVMVNGKRVDTRDVVFKLEKDGQGTPFLASCLTVSQLSRYGVKTEDYPQLWKAAKPPDE 114 VD+ +N + TRDV F QG + CLT +QL+ G+ T + D Sbjct: 80 VDIYLNNGYMATRDVTFNTGDSEQG---IVPCLTRAQLASMGLNTASVSGMNLL--ADDA 134 Query: 115 CADLT-AIPQAKAVLDINNQQLQLSIPQLALRPEFKGIAPEDIWDDGIPAFLMNYSARTT 173 C LT I A A LD+ Q+L L+IPQ + +G P ++WD GI A L+NY+ Sbjct: 135 CVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGN 194 Query: 174 QTDYKMDMERRDNSSWVQLQPGINIGAWRVRNATSWQR-----SSQLSGKWQAAYTYAER 228 ++ + +++ LQ G+NIGAWR+R+ T+W SS KWQ T+ ER Sbjct: 195 SVQNRIG--GNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLER 252 Query: 229 GLYSLKSRLTLGQKTSQGEIFDSVPFTGVMLASDDNMVPYSERQFAPVVRGIARTQARVE 288 + L+SRLTLG +QG+IFD + F G LASDDNM+P S+R FAPV+ GIAR A+V Sbjct: 253 DIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVT 312 Query: 289 VKQNGYTIYNTTVAPGPFALRDLSVTDSSGDLHVTVWEADGSTQMFVVPYQTPAIALHQG 348 +KQNGY IYN+TV PGPF + D+ +SGDL VT+ EADGSTQ+F VPY + + +G Sbjct: 313 IKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREG 372 Query: 349 YLKYSLLAGRYRSSDSATDKAQIAQATLMYGLPWNLTAYGGIQSATHYQAASLGLGASLG 408 + +YS+ AG YRS ++ +K + Q+TL++GLP T YGG Q A Y+A + G+G ++G Sbjct: 373 HTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMG 432 Query: 409 RWGSLSVDGSDTHSQRQGEAVQQGASWRLRYSNQLTATGTNFSLTRWQYASQGYNTLSDV 468 G+LSVD + +S ++ G S R Y+ L +GTN L ++Y++ GY +D Sbjct: 433 ALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADT 492 Query: 469 LDSYRHDGNRL-------------WSWRENLQPSSRTILMLSQSWGRHLGNLSLTGSRTD 515 S + N + + L ++Q GR L L+GS Sbjct: 493 TYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGSHQT 551 Query: 516 WRNRPGHDDSYGLSWGTSIGGGSLSLNWNQNRTLWRNGAHSKENITSLWFSMSLSRWTGN 575 + D+ + T+ + +L+++ + W+ G ++ + +L ++ S W + Sbjct: 552 YWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKG---RDQMLALNVNIPFSHWLRS 608 Query: 576 -------NVSASWQMTSPSHGGQMQQVGGNGEAFSQ-QLDWEVRQSYRADAPPGGGNNSA 627 + SAS+ M+ +G G G L + V+ Y G+ Sbjct: 609 DSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGY 668 Query: 628 LHLAWNGGYGLLGGDYSYSRAMRQMGVNIAGGIVIHHHGVTLGQPLQGSVALVEAPGASG 687 L + GGYG YS+S ++Q+ ++GG++ H +GVTLGQPL +V LV+APGA Sbjct: 669 ATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKD 728 Query: 688 VPVGGWPGVKTDFRGDTTVGNLNVYQENTVSLDPSRLPDDAEVTQTDVRVVPTEGAVVEA 747 V GV+TD+RG + Y+EN V+LD + L D+ ++ VVPT GA+V A Sbjct: 729 AKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRA 788 Query: 748 KFHTRIGARALMTLKREDGSAIPFGAQVTVNGQDGSAALVDTDSQVYLTGLADKGELTVK 807 +F R+G + LMTL + +PFGA VT + S+ +V + QVYL+G+ G++ VK Sbjct: 789 EFKARVGIKLLMTLTH-NNKPLPFGAMVT-SESSQSSGIVADNGQVYLSGMPLAGKVQVK 846 Query: 808 WGA---QQCRVNYQLPAHKGIAGLYQMSGLCR 836 WG C NYQLP L Q+S CR Sbjct: 847 WGEEENAHCVANYQLPPESQQQLLTQLSAECR 878
>BACINVASINB#Salmonella/Shigella invasin protein B signature. Length = 593 Score = 27.8 bits (61), Expect = 0.008 Identities = 15/56 (26%), Positives = 26/56 (46%) Query: 1 MRIIKGFDSLLSEVKTLPDVGWLYVDKEFNLKSKMDILNKDYYLAENRDESFDMAE 56 +++ K F + L E + D+ + K KS D K A+N+ +S D A+ Sbjct: 123 IQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAATKKLTQAQNKLQSLDPAD 178
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 70.3 bits (172), Expect = 2e-15 Identities = 36/128 (28%), Positives = 58/128 (45%), Gaps = 16/128 (12%) Query: 317 QHSRVVFRGDAMFVPGQKTVSDAIRPVINKAAREIARVG---GAVTVTGHTDSQPIHSAE 373 Q + D +F + T+ + +++ +++ + G+V V G+TD I S Sbjct: 211 QTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDR--IGSDA 268 Query: 374 FPSNLVLSEKRAAEVAALLTSGGVPAGRVHIVGKGDTVPVADN---------GSKAGRAK 424 + N LSE+RA V L S G+PA ++ G G++ PV N A Sbjct: 269 Y--NQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAP 326 Query: 425 NRRVEILV 432 +RRVEI V Sbjct: 327 DRRVEIEV 334
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 34.0 bits (78), Expect = 0.002 Identities = 41/202 (20%), Positives = 65/202 (32%), Gaps = 35/202 (17%) Query: 512 LVEPDADDKTTLQQAETALREWQGDAPVV----FPEVSAAVVA--AIVADWTGIP--AGR 563 +V PD + L + +++ + D PV+ A+ A D+ P Sbjct: 55 VVMPDENAFDLLPR----IKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTE 110 Query: 564 MVKDEASQVLELPARLAQRVTGQDGALAQIGE--RIQTAR---AGLGDPRKPVGVFMLAG 618 ++ + E R ++ + +G +Q A L + M+ G Sbjct: 111 LIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL---MITG 167 Query: 619 PSGVGKTETALALAEAIYGGEQNLITINMSEFQEAHTVSTLKGAPPGYVGYGEGGVLTEA 678 SG GK A AL + + INM+ S L G E G T A Sbjct: 168 ESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGAFTGA 219 Query: 679 VRRHPWSV-------VLLDEIE 693 R + LDEI Sbjct: 220 QTRSTGRFEQAEGGTLFLDEIG 241
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.3 bits (71), Expect = 0.009 Identities = 31/198 (15%), Positives = 64/198 (32%), Gaps = 36/198 (18%) Query: 177 QQQSQERAARAELLQYQLKELNDFNPQAGEFEQIDEEYKRLANSGQLLTTSQNALALLAD 236 + QS AR E +YQ+ + + E + DE Y + + ++L + + Sbjct: 138 KTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFS- 196 Query: 237 GEDVNLQSQLYSAKQLVSELVGMDSKLSGILDMLEEATIQLTEASDELRHYCERLDLDPN 296 Q+Q Y Q L ++ +L A I E + Sbjct: 197 ----TWQNQKY---QKELNLDKKRAERLTVL-----ARINRYENLSRV------------ 232 Query: 297 RLFELEQRIAKQISLARKHHVSPEALPQLYQSLLEEQQQLDDQADSLETLTLAVNKHHQQ 356 + R+ SL K ++ ++LE++ + + + L + + + Sbjct: 233 ----EKSRLDDFSSLLHKQAIA-------KHAVLEQENKYVEAVNELRVYKSQLEQIESE 281 Query: 357 ALETAQALHQQRQFYAQE 374 L + Q + E Sbjct: 282 ILSAKEEYQLVTQLFKNE 299
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 27.6 bits (61), Expect = 0.018 Identities = 16/78 (20%), Positives = 30/78 (38%), Gaps = 8/78 (10%) Query: 36 GSRVLESSPAQMTAAVDVSKAGISKTFTTRNQLTRNQSILMHLVDGPFKKLIGGWK---- 91 G+ VLE P+ + +D G + + LT I +++G +++ + Sbjct: 113 GNAVLEVDPSITFSIIDRLFGGTGQAAKVQRDLT---DIENSVMEGVIVRILANVRESWT 169 Query: 92 -FTPLSPEACRIEFQLDF 108 L P +IE F Sbjct: 170 QVIDLRPRLGQIETNPQF 187
>INTIMIN#Intimin signature. Length = 939 Score = 46.2 bits (109), Expect = 3e-06 Identities = 63/315 (20%), Positives = 107/315 (33%), Gaps = 38/315 (12%) Query: 2724 TPAQTNGQPLLAFAQDKAGNTGIAAGFTAPDTRVPEAPIITNVVDDVGIYTGAIANGQ-- 2781 +N + A A D+ GN+ T + V D T A A+G Sbjct: 518 VQGGSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEA 577 Query: 2782 VTNDAQPTLNGTAQAGATVS--IYNNGALLGTTTANASGNWSFTPTGNLTEGSHAFT-AT 2838 +T A NG AQA VS I + A+L +AN +G+ T T L + Sbjct: 578 ITYTATVKKNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVT--LKSDKPGQVVVS 635 Query: 2839 ATNANGTGSVSSAATVIVDTLAPGTPSGTLSADGGSLSGLAEANSTVTVTLT-------- 2890 A A T ++++ A + VD + AD + +A +T T+ Sbjct: 636 AKTAEMTSALNANAVIFVDQTK--ASITEIKAD--KTTAVANGQDAITYTVKVMKGDKPV 691 Query: 2891 -----------GGVTLTT-TAGSNGAWSLTLPTKQIEGQLINVTATDAAGN-ASGTLGIT 2937 G ++ +T +NG +TL + L++ +D A + + + Sbjct: 692 SNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFF 751 Query: 2938 APVLPLAARDNITSLDLTSTAVTSTQNYSDYGLLLVGALGNVASVLGN------DTTQVE 2991 + I + T Y L G G N D + + Sbjct: 752 TTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQ 811 Query: 2992 FTIAEGGTGDVTIDA 3006 T+ E GT +++ + Sbjct: 812 VTLKEKGTTTISVIS 826 Score = 41.6 bits (97), Expect = 7e-05 Identities = 64/272 (23%), Positives = 91/272 (33%), Gaps = 22/272 (8%) Query: 1508 TLPVTSALPDGVYTLTAIAADAAGNSSGVSNSFTFTVDTVPLQPPVVN--EILDDVAPVT 1565 LP VY +TA A D GNS SN+ T+ TV VV+ + D A T Sbjct: 513 ILPAYVQGGSNVYKVTARAYDRNGNS---SNNVLLTI-TVLSNGQVVDQVGVTDFTADKT 568 Query: 1566 GPLTDG--AFTNDRTLTINGSGENGSTVTIYDNGVAIGTALVTDGVWTFN-----TSELS 1618 DG A T T+ NG + V+ + GTA+++ N T L Sbjct: 569 SAKADGTEAITYTATVKKNGVAQANVPVSF---NIVSGTAVLSANSANTNGSGKATVTLK 625 Query: 1619 EASHALTFSATDDAGNTTAQTQPITITVDITAPPAPTIQTVADDGTRVAGLADPYA-TVE 1677 + A T+A I VD T I+ AD T VA D TV+ Sbjct: 626 SDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIK--ADKTTAVANGQDAITYTVK 683 Query: 1678 IHHADGTLVGSAVANGTGEFVVTLSPAQTDGGTLTAIAIDRAGNNGPATNFPASDSGLPA 1737 + D + V T ++ S +TD + + + G + Sbjct: 684 VMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLT-STTPGKSLVSARVSDVAVD 742 Query: 1738 VPAITAIEDDVGSIQGNIAA--GGATDDTMPT 1767 V A +I G +PT Sbjct: 743 VKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPT 774 Score = 37.4 bits (86), Expect = 0.001 Identities = 75/370 (20%), Positives = 137/370 (37%), Gaps = 45/370 (12%) Query: 2197 IYNGSALVGTA-QVQANGSWSFT-------PSTSLGAGVWNLTATATDAAGNTSAASEIR 2248 +++ SAL Q+Q +GS S G+ V+ +TA A D GN+S + + Sbjct: 486 VWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNVYKVTARAYDRNGNSS-NNVLL 544 Query: 2249 SFTIDTTAPAAPVIDTVYDGTGPITGNLSSGQ--ITDEARPVISGTREAN--TAIRLYDN 2304 + T+ + V D T T + G IT A +G +AN + + Sbjct: 545 TITV-LSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSG 603 Query: 2305 GTLLAEIPADNSSSWRYTPDASLATGNHVITVIAVDAAGNASPV-SDSVNFVVDTTPPLT 2363 +L+ A+ + S + T +L + V++ A S + +++V FV T +T Sbjct: 604 TAVLSANSANTNGSGKAT--VTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASIT 661 Query: 2364 PVITSVSDDQAPGLGTIANGQN--TNDPTPTFSGTAEAGATITLYENGTVIGTTTAQ--P 2419 + A +ANGQ+ T + +T + +T + Sbjct: 662 EI--KADKTTA-----VANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDT 714 Query: 2420 DGAWSVSTSTLASGTHVITAVATDAAGNSSPNSTAFTLTVDTTAPQTPILTSVVDDVAGG 2479 +G V+ ++ G +++A +D A + F I ++ V G Sbjct: 715 NGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFF-------TTLTIDDGNIEIVGTG 767 Query: 2480 VTGNLANGQITNDNRPTLNGTAEAGSV-VTIYDGNTLLGVTSANAGGAWSFTPTTGLNDG 2538 V G L + +N A G+ T N + A++G T G Sbjct: 768 VKGKLPTVWL---QYGQVNLKASGGNGKYTWRSANPAIASVDASSG------QVTLKEKG 818 Query: 2539 TRILTVTATD 2548 T ++V ++D Sbjct: 819 TTTISVISSD 828
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 35.6 bits (82), Expect = 4e-04 Identities = 32/224 (14%), Positives = 63/224 (28%), Gaps = 32/224 (14%) Query: 209 DVVQTEARIESARSQLAQYQANLDSAKASLMSWLGWNSLNGINNDFPAKLARSCETATPD 268 + EA +S L Q + + S D P S E Sbjct: 128 TALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRL 187 Query: 269 DRLVPAVLAAW-AQANVARANLDYASAQ---MTPTISLEPSVQHYLNDKYPSHEVLDKTQ 324 L+ + W Q NLD A+ + I+ ++ + L Q Sbjct: 188 TSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQ 247 Query: 325 YSTWVKVEMPLYQGGGLTARRNAASHAVDAAQSTIQRTRLDVRQKLMEARSQAMSLASAL 384 V + ++ +L +SQ + S Sbjct: 248 AIAKHAVL-------------------------EQENKYVEAVNELRVYKSQLEQIES-- 280 Query: 385 QILRRQQQLSERTRELYQQQYLDLGSRPLLDVLNAEQEVYQARF 428 +IL +++ T +L++ + LD + ++ E+ + Sbjct: 281 EILSAKEEYQLVT-QLFKNEILDKLRQTTDNIGLLTLELAKNEE 323
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 243 bits (621), Expect = 3e-78 Identities = 95/432 (21%), Positives = 176/432 (40%), Gaps = 56/432 (12%) Query: 9 ERAFSGAGRIVLICSLLFLILGI-WAWFGRLDEVSTGNGKVIPSSREQVLQSLDGGILAQ 67 E S R+V + FL++ + G+++ V+T NGK+ S R + ++ ++ I+ + Sbjct: 50 ETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKE 109 Query: 68 LTVREGDRVQANQIVARLDPTRLASNVGESAAKYRASLASSARLTAEVSDLPL------- 120 + V+EG+ V+ ++ +L ++ ++ + + R + L Sbjct: 110 IIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELK 169 Query: 121 --AFPAELNGWPDLIAAETRLYKSR-----------RAQLADTEAELRDALASVNK---- 163 P N + + T L K + L AE LA +N+ Sbjct: 170 LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENL 229 Query: 164 ------ELTITQRLEKSGAASHVEVLRLQRQKSDLG---------------------LKI 196 L L A + VL + + + + Sbjct: 230 SRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEY 289 Query: 197 TDLRSQYYVQAREALSKANAEVDMLSAILKGREDSVTRLTVRSPVRGIVKNIQVTTIGGV 256 + + + + L + + +L+ L E+ +R+PV V+ ++V T GGV Sbjct: 290 QLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGV 349 Query: 257 IPPNGEMMEIVPVDDRLLIETRLSPRDIAFIHPGQRALVKITAYDYAIYGGLDGVVETIS 316 + +M IVP DD L + + +DI FI+ GQ A++K+ A+ Y YG L G V+ I+ Sbjct: 350 VTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN 409 Query: 317 PDTIQDKVKPEIFYYRVFIRTHQDYLQNKSGRRFSIVPGMIATVDIKTGEKTIVDYLIKP 376 D I+D+ + V I ++ L + + GM T +IKTG ++++ YL+ P Sbjct: 410 LDAIEDQRLGL--VFNVIISIEENCLSTG-NKNIPLSSGMAVTAEIKTGMRSVISYLLSP 466 Query: 377 F-NRAKEALRER 387 E+LRER Sbjct: 467 LEESVTESLRER 478
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 34.7 bits (79), Expect = 0.002 Identities = 20/98 (20%), Positives = 32/98 (32%), Gaps = 6/98 (6%) Query: 88 ESPTKKQTQALEAQWRAVSRLEQKQQQETRQMAAARAELYRLGLSAGGGARETARIARET 147 E+P A ++ KQ+ +T + A RE A+ A+ Sbjct: 1022 EAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDAT------ETTAQNREVAKEAKSN 1075 Query: 148 ERYNRQLAEQERRLREVGERQRKLNAIKAKAEKTRELR 185 + N Q E + E E Q A EK + + Sbjct: 1076 VKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAK 1113
>SOPEPROTEIN#Salmonella type III secretion SopE effector protein signature. Length = 239 Score = 432 bits (1112), Expect = e-158 Identities = 237/239 (99%), Positives = 237/239 (99%) Query: 2 TKITLSPQNFRIQKQETTLLKEKSTEKNSLAKSILAVKNHFIELRSKLSERFISHKNTES 61 TKITLSPQNFRIQKQETTLLKEKSTEKNSLAKSILAVKNHFIELRSKLSERFISHKNTES Sbjct: 1 TKITLSPQNFRIQKQETTLLKEKSTEKNSLAKSILAVKNHFIELRSKLSERFISHKNTES 60 Query: 62 SATHFHRGSASEGRAVLTNKVVKDFMLQTLNDIDIRGSASKDPAYASQTREAILSAVYSK 121 SATHFHRGSASEGRAVLTNKVVKDFMLQTLNDIDIRGSASKDPAYASQTREAILSAVYSK Sbjct: 61 SATHFHRGSASEGRAVLTNKVVKDFMLQTLNDIDIRGSASKDPAYASQTREAILSAVYSK 120 Query: 122 NKDQCCNLLISKGINIAPFLQEIGEAAKNVGLPGTTKNDVFTPSGAGANPFITPLISSAN 181 NKDQCCNLLISKGINIAPFLQEIGEAAKN GLPGTTKNDVFTPSGAGANPFITPLISSAN Sbjct: 121 NKDQCCNLLISKGINIAPFLQEIGEAAKNAGLPGTTKNDVFTPSGAGANPFITPLISSAN 180 Query: 182 SKYPRMFINQHQQASFKIYAEKIIMTEVAPLFNECAMPTPQQFQLILENIANKYIQYTP 240 SKYPRMFINQHQQASFKIYAEKIIMTEVAPLFNECAMPTPQQFQLILENIANKYIQ TP Sbjct: 181 SKYPRMFINQHQQASFKIYAEKIIMTEVAPLFNECAMPTPQQFQLILENIANKYIQNTP 239
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 24.9 bits (54), Expect = 0.035 Identities = 11/32 (34%), Positives = 16/32 (50%) Query: 15 AVLLAWLGDLSLKDASTVGGVLIGVLMLAINW 46 A W+ DLS +D + +L+GV M I Sbjct: 449 APFALWIHDLSAQDPYYILPILMGVTMFFIQK 480
>ICENUCLEATIN#Ice nucleation protein signature. Length = 1258 Score = 30.1 bits (67), Expect = 0.033 Identities = 21/70 (30%), Positives = 33/70 (47%) Query: 425 ATLMAGAIQQVSAGDFSQAVKGNRLASITGNEETEIAGQLSTKVAGAMNVDVGGTLTEKI 484 ++L AG A S + G ITGN IAG+ S++ AG + + G + ++ Sbjct: 1070 SSLTAGYGSNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQM 1129 Query: 485 AALRKSVAAG 494 A R + AG Sbjct: 1130 AGERGKLIAG 1139 Score = 30.1 bits (67), Expect = 0.034 Identities = 20/58 (34%), Positives = 29/58 (50%) Query: 437 AGDFSQAVKGNRLASITGNEETEIAGQLSTKVAGAMNVDVGGTLTEKIAALRKSVAAG 494 AG S + GNR I G ++ AG ST ++GA +V + G + IA + AG Sbjct: 1090 AGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQMAGERGKLIAGADSTQTAG 1147 Score = 29.3 bits (65), Expect = 0.045 Identities = 21/70 (30%), Positives = 33/70 (47%) Query: 425 ATLMAGAIQQVSAGDFSQAVKGNRLASITGNEETEIAGQLSTKVAGAMNVDVGGTLTEKI 484 +TLMAG +A + S G S+ G + + IAG ST+ AG + G + + Sbjct: 942 STLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQT 1001 Query: 485 AALRKSVAAG 494 A ++ AG Sbjct: 1002 AEHSSTLTAG 1011
>FLAGELLIN#Flagellin signature. Length = 507 Score = 274 bits (702), Expect = 2e-88 Identities = 262/515 (50%), Positives = 310/515 (60%), Gaps = 18/515 (3%) Query: 2 AQVINTNSLSLLTQNNLNKSQSALGTAIERLSSGLRINSAKDDAAGQAIANRFTANIKGL 61 AQVINTNSLSLLTQNNLNKSQS+L +AIERLSSGLRINSAKDDAAGQAIANRFT+NIKGL Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60 Query: 62 TQASRNANDGISIAQTTEGALNEINNNLQRVRELAVQSANSTNSQSDLDSIQAEITQRLN 121 TQASRNANDGISIAQTTEGALNEINNNLQRVREL+VQ+ N TNS SDL SIQ EI QRL Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120 Query: 122 EIDRVSGQTQFNGVKVLAQDNTLTIQVGANDGETIDIDLKQINSQTLGLDSLNVQKAYDV 181 EIDRVS QTQFNGVKVL+QDN + IQVGANDGETI IDL++I+ ++LGLD NV + Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEA 180 Query: 182 KDTAVTTKAYADNGTTLDASGLDDAAIKAAIGGTTGTAAVTSGTVKFDADNNKYFVTIGG 241 + + G D + + + T+ TV D G Sbjct: 181 TVGDLKSSFKNVTGY--DTYAVGANKYRVDVNSGAVVTDTTAPTV---PDKVYVNAANGQ 235 Query: 242 FTGADAAKNGDYEVNVATDGKVTLATSATKTTMPAGAATKTEVQELKDTPAVVSADAKNA 301 T DA N + + A +A + E + D K Sbjct: 236 LTTDDAENNTAVD---LFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTG 292 Query: 302 LIAGGVDTADANAATLVKMSYTDKNGKTIEGGYALKAGDKYYAA------DYDEATGAIK 355 G + N + G L++ Y + +D+ T Sbjct: 293 NDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNES 352 Query: 356 AKTTSYTAADGTTKTAANQLGGVDG----KTEVVTIDGKTYNASKAAGHDFKAQPELAEA 411 AK + A + + + G + + VT+ GKT K A E A A Sbjct: 353 AKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAA 412 Query: 412 AAKTTENPLQKIDAALAQVDALRSDLGAVQNRFNSAITNLGNTVNNLSEARSRIEDSDYA 471 A K+T NPL ID+AL++VDA+RS LGA+QNRF+SAITNLGNTV NL+ ARSRIED+DYA Sbjct: 413 AKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYA 472 Query: 472 TEVSNMSRAQILQQAGTSVLAQANQVPQNVLSLLR 506 TEVSNMS+AQILQQAGTSVLAQANQVPQNVLSLLR Sbjct: 473 TEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507
>PF05272#Virulence-associated E family protein Length = 892 Score = 33.5 bits (76), Expect = 0.007 Identities = 47/217 (21%), Positives = 66/217 (30%), Gaps = 49/217 (22%) Query: 992 PPG----TVVAVVGRSGTGKSTLIKLLAGLYSPGSGQIRVGER-----------LIDAAS 1036 PG V + G G GKSTLI L GL +G + + Sbjct: 590 EPGCKFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSE 649 Query: 1037 LSDYRRQTGLVTQDVALFSGDIAENI-RYSRPDSSDTEVEIAARQAGLFETV---QHL-- 1090 ++ +RR D + RY V+ RQ ++ T Q+L Sbjct: 650 MTAFRR------ADAEAVKAFFSSRKDRYRGA--YGRYVQDHPRQVVIWCTTNKRQYLFD 701 Query: 1091 PLGFRT--PVNNGG----TDLSAGQRQLIALA--------RAHLA--QAHILLLDEATAR 1134 G R PV G L + QL A A R + I E R Sbjct: 702 ITGNRRFWPVLVPGRANLVWLQKFRGQLFAEALHLYLAGERYFPSPEDEEIYFRPEQELR 761 Query: 1135 -IDRSAEERLMTSLTRVTHTEKRIALIVAHRLTTARR 1170 ++ + RL LTR A A + + Sbjct: 762 LVETGVQGRLWALLTREG---APAAEGAAQKGYSVNT 795
>PF06580#Sensor histidine kinase Length = 349 Score = 32.9 bits (75), Expect = 0.003 Identities = 23/101 (22%), Positives = 38/101 (37%), Gaps = 21/101 (20%) Query: 370 LLDNALKY----TPEQGIVTARLEQDGDAVTLVVEDSGPGIDDEHIHLALQPFHRLDNVG 425 L++N +K+ P+ G + + +D VTL VE++G L N Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA--------------LKNTK 308 Query: 426 NVAGAGIGLALVND-IARLHRTHPHFSRSEALGGLYVRIRF 465 G GL V + + L+ T SE G + + Sbjct: 309 E--STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 96.8 bits (241), Expect = 2e-25 Identities = 35/122 (28%), Positives = 61/122 (50%), Gaps = 1/122 (0%) Query: 2 RLLLAEDNRELAHWLEKALVQNGFAVDCVFDGLAADHLLHSEMYALAVLDINMPGMDGLE 61 +L+A+D+ + L +AL + G+ V + + + L V D+ MP + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VVQRLRKRGQTLPVLLLTARSAVADRVKGLNVGADDYLPKPFELEE-LDARLRALLRRSA 120 ++ R++K LPVL+++A++ +K GA DYLPKPF+L E + RAL Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 121 GQ 122 Sbjct: 125 RP 126
>INTIMIN#Intimin signature. Length = 939 Score = 27.3 bits (60), Expect = 0.028 Identities = 20/69 (28%), Positives = 33/69 (47%), Gaps = 6/69 (8%) Query: 92 SVDDQVKTTTPAAESQFYTVKSGDTLSAISKQVYGNANLYNKIFEANKPMLKSPE---KI 148 D ++ T FYT+K+G+T++ +SK N + I+ NK + S K Sbjct: 48 GSDSKLLTHNSYQNRLFYTLKTGETVADLSKSQDINLST---IWSLNKHLYSSESEMMKA 104 Query: 149 YPGQVLRIP 157 PGQ + +P Sbjct: 105 EPGQQIILP 113
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 84.3 bits (208), Expect = 1e-21 Identities = 67/257 (26%), Positives = 120/257 (46%), Gaps = 7/257 (2%) Query: 3 QVAVVIGGGQTLGAFLCRGLAEEGYRVAVVDIQSDKAANVAQEINADFGEGMAYGFGADA 62 ++A + G Q +G + R LA +G +A VD +K V + A+ A F AD Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE--ARHAEAFPADV 66 Query: 63 TSEQSVLALSRGVDEIFGRVDLLVYSAGIAKAAFISDFQLGDFDRSLQVNLVGYFLCARE 122 ++ ++ ++ G +D+LV AG+ + I +++ + VN G F +R Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 123 FSRLMIRDGIQGRIIQINSKSGKVGSKHNSGYSAAKFGGVGLTQSLALDLAEYGITVHSL 182 S+ M D G I+ + S V + Y+++K V T+ L L+LAEY I + + Sbjct: 127 VSKYM-MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 183 MLGNLLKSPMFQSL-LPQYATKLGIKPDEVEQYYIDKVPLKRGCDYQDVLNMLLFYASPK 241 G+ ++ M SL + + IK +E + +PLK+ D+ + +LF S + Sbjct: 186 SPGS-TETDMQWSLWADENGAEQVIKGS-LETFKTG-IPLKKLAKPSDIADAVLFLVSGQ 242 Query: 242 ASYCTGQSINVTGGQVM 258 A + T ++ V GG + Sbjct: 243 AGHITMHNLCVDGGATL 259
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 27.1 bits (60), Expect = 0.044 Identities = 10/45 (22%), Positives = 18/45 (40%), Gaps = 5/45 (11%) Query: 1 MKPRQRQAAILEHLQKQGKCSVEEL-----AQYFDTTGTTIRKDL 40 M QR I E + + +EL ++ T T+ +D+ Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDI 45
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 373 bits (958), Expect = e-127 Identities = 122/340 (35%), Positives = 180/340 (52%), Gaps = 21/340 (6%) Query: 183 MIGLSPAMTQLKKEIEIVAGSDLNVLIGGETGTGKELVAKAIHQGSPRAVNPLVYLNCAA 242 ++G S AM ++ + + + +DL ++I GE+GTGKELVA+A+H R P V +N AA Sbjct: 139 LVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAA 198 Query: 243 LPESVAESELFGHVKGAFTGAISNRSGKFEMADNGTLFLDEIGELSLALQAKLLRVLQYG 302 +P + ESELFGH KGAFTGA + +G+FE A+ GTLFLDEIG++ + Q +LLRVLQ G Sbjct: 199 IPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQG 258 Query: 303 DIQRVGDDRSLRVDVRVLAATNRDLREEVLAGRFRADLFHRLSVFPLFVPPLRERGDDVV 362 + VG +R DVR++AATN+DL++ + G FR DL++RL+V PL +PPLR+R +D+ Sbjct: 259 EYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIP 318 Query: 363 LLAGYFCEQCRLRLGLSRVVLSPGARRHLLNYGWPGNVRELEHAIHRAVVLARATRAGDE 422 L +F +Q + GL A + + WPGNVRELE+ + R L E Sbjct: 319 DLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITRE 377 Query: 423 VVL-----EEQHFALS---------------EDVLPAPSAESFLALPACRNLRESTENFQ 462 ++ E + E+ + A ALP + Sbjct: 378 IIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEME 437 Query: 463 REMIRQALAQNNHNWAASARALETDVANLHRLAKRLGLKD 502 +I AL N +A L + L + + LG+ Sbjct: 438 YPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSV 477
>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature. Length = 1147 Score = 27.0 bits (59), Expect = 0.011 Identities = 19/75 (25%), Positives = 37/75 (49%), Gaps = 8/75 (10%) Query: 12 IDGNQAKVD--VCGIQRDVDLTLVGSCDENGQPRLGQWVLVHVGFAMSVINEAEARDTLD 69 I GNQ + D G+ D L ++NG+P G W+ + + F + ++ ++ D + Sbjct: 171 IIGNQIRTDQKFMGV-FDESLKERQEAEKNGEPTGGDWLDIFLSF---IFDKKQSSDVKE 226 Query: 70 ALQN--MFDVEPDVG 82 A+ + V+PD+ Sbjct: 227 AINQEPVPHVQPDIA 241
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 384 bits (988), Expect = e-129 Identities = 142/373 (38%), Positives = 207/373 (55%), Gaps = 39/373 (10%) Query: 350 YQEIHRLKERLVDENLALTEQLNNVDSEFGEIIGRSEAMYNVLKQVEMVAQSDSTVLILG 409 E+ + R + E +L + + ++GRS AM + + + + Q+D T++I G Sbjct: 108 LTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITG 167 Query: 410 ETGTGKELIARAIHNLSGRSGRRMVKMNCAAMPAGLLESDLFGHERGAFTGASAQRIGRF 469 E+GTGKEL+ARA+H+ R V +N AA+P L+ES+LFGHE+GAFTGA + GRF Sbjct: 168 ESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRF 227 Query: 470 ELADKSSLFLDEVGDMPLELQPKLLRVLQEQEFERLGSNKLIQTDVRLIAATNRDLEKMV 529 E A+ +LFLDE+GDMP++ Q +LLRVLQ+ E+ +G I++DVR++AATN+DL++ + Sbjct: 228 EQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSI 287 Query: 530 ADREFRNDLYYRLNVFPIQLPPLRERPEDIPLLVKAFTFKIARRMGRNIDSIPAETLRTL 589 FR DLYYRLNV P++LPPLR+R EDIP LV+ F + A + G ++ E L + Sbjct: 288 NQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFV-QQAEKEGLDVKRFDQEALELM 346 Query: 590 SSMEWPGNVRELENVVERAVLLTRGNVLQLS-LPDITAVTPDTSPVATESAKEG------ 642 + WPGNVRELEN+V R L +V+ + + SP+ +A+ G Sbjct: 347 KAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQ 406 Query: 643 ----------------------------EDEYQLIIRVLKETNGVVAGPKGAAQRLGLKR 674 E EY LI+ L T G AA LGL R Sbjct: 407 AVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIK---AADLLGLNR 463 Query: 675 TTLLSRMKRLGID 687 TL +++ LG+ Sbjct: 464 NTLRKKIRELGVS 476
>adhesinb#Adhesin B signature. Length = 310 Score = 321 bits (824), Expect = e-112 Identities = 89/309 (28%), Positives = 164/309 (53%), Gaps = 14/309 (4%) Query: 4 LHRLKTLLIAGIVAILAL-------SPAYAKEKFKVITTFTVIADMAKNVAGDAAEVSSI 56 + + + L++ + + S K V+ T ++IAD+ KN+AGD + SI Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSI 60 Query: 57 TKPGAEIHEYQPTPGDIKRAQGAQLILANGLNLER----WFARFYQHLSGVPE---VVVS 109 G + HEY+P P D+K+ A LI NG+NLE WF + ++ VS Sbjct: 61 VPVGQDPHEYEPLPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVS 120 Query: 110 TGVKPMGITEGPYNGKPNPHAWMSAENALIYVDNIRDALVKYDPDNAQIYKQNAERYKAK 169 GV + + GK +PHAW++ EN +IY NI L + DP N + Y++N + Y K Sbjct: 121 EGVDVIYLEGQSEKGKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEK 180 Query: 170 IRQMADPLRAELEKIPADQRWLVTSEGAFSYLARDNDMKELYLWPINADQQGTPKQVRKV 229 + + + + IP +++ +VTSEG F Y ++ ++ Y+W IN +++GTP Q++ + Sbjct: 181 LSALDKEAKEKFNNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTL 240 Query: 230 IDTIKKHHIPAIFSESTVSDKPARQVARESGAHYGGVLYVDSLSAADGPVPTYLDLLRVT 289 ++ ++K +P++F ES+V D+P + V++++ ++ DS++ +Y +++ Sbjct: 241 VEKLRKTKVPSLFVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSYYSMMKYN 300 Query: 290 TETIVNGIN 298 E I G++ Sbjct: 301 LEKIAEGLS 309
>BORPETOXINA#Bordetella pertussis toxin A subunit signature. Length = 269 Score = 30.5 bits (68), Expect = 0.007 Identities = 16/57 (28%), Positives = 30/57 (52%), Gaps = 8/57 (14%) Query: 201 IISDLTRKWSQAEVAGKLFMSVSSLKRKLAAEEVSFSKIYLDARMNQAIKLLRMGAG 257 ++ LT + Q + F+S SS +R ++++YL+ RM +A++ R G G Sbjct: 66 VLDHLTGRSCQVGSSNSAFVSTSSSRR--------YTEVYLEHRMQEAVEAERAGRG 114
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 42.6 bits (100), Expect = 7e-07 Identities = 33/167 (19%), Positives = 63/167 (37%), Gaps = 10/167 (5%) Query: 23 LLKGLDQEQANEVIAVLQMHNIEANKIDSGKLGYSITVAEPDFTAAVYWIKTYQLPPRPR 82 L L + ++A L NI + +G +I V + LP Sbjct: 53 LFSNLSDQDGGAIVAQLTQMNI-PYRFANG--SGAIEVPADKVHELRLRLAQQGLPKGGA 109 Query: 83 VEIAQMFPADSLVSSPRAEKARLYSAIEQRLEQSLQTMEGVLSARVHISYDIDA---GEN 139 V + + S +E+ A+E L ++++T+ V SARVH++ + E Sbjct: 110 VGFE-LLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQ 168 Query: 140 GRPPKPVHLSALAVYERGSPLAHQISDIKRFLKNSFADVDYDNISVV 186 P V ++ QIS + + ++ A + N+++V Sbjct: 169 KSPSASVTVTLEPGRALDEG---QISAVVHLVSSAVAGLPPGNVTLV 212
>PF07212#Hyaluronoglucosaminidase Length = 336 Score = 28.1 bits (62), Expect = 0.044 Identities = 12/39 (30%), Positives = 21/39 (53%) Query: 234 MSTSTLKRKLAEEGTSFSDIYLSARMNQAAKLLRIGNHN 272 +S +K++ +GT+ IY+++ KLLRI N Sbjct: 241 LSIDIVKKQKGGKGTAAQGIYINSTSGTTGKLLRIRNLG 279
>BACYPHPHTASE#Salmonella/Yersinia modular tyrosine phosphatase signature. Length = 468 Score = 303 bits (777), Expect = 1e-99 Identities = 68/212 (32%), Positives = 102/212 (48%), Gaps = 17/212 (8%) Query: 340 GKPVALAGSYPKNTPDALEAHMKMLLEKECSCLAVLTSEDQMQAKQ--LPAYFRGSYTFG 397 G +A YP LE+H +ML E LAVL S ++ ++ +P YFR S T+G Sbjct: 252 GNTRTIACQYP--LQSQLESHFRMLAENRTPVLAVLASSSEIANQRFGMPDYFRQSGTYG 309 Query: 398 EVHTNSQKVSSASQGGAI--DQYNMQL-SCGEKRYTIPVLHVKNWPDHQPLPS--TDQLE 452 + S+ G I D Y + + G+K ++PV+HV NWPD + S T L Sbjct: 310 SITVESKMTQQVGLGDGIMADMYTLTIREAGQKTISVPVVHVGNWPDQTAVSSEVTKALA 369 Query: 453 YLADRVKNSNQNGAPGRSSS-----DKHLPMIHCLGGVGRTGTMAAALVLKDNPHSNL-- 505 L D+ + +N + SS K P+IHC GVGRT + A+ + D+ +S L Sbjct: 370 SLVDQTAETKRNMYESKGSSAVGDDSKLRPVIHCRAGVGRTAQLIGAMCMNDSRNSQLSV 429 Query: 506 EQVRADFRNSRNNRMLEDASQF-VQLKAMQAQ 536 E + + R RN M++ Q V +K + Q Sbjct: 430 EDMVSQMRVQRNGIMVQKDEQLDVLIKLAEGQ 461
>PF05932#Tir chaperone protein (CesT) Length = 127 Score = 33.6 bits (77), Expect = 5e-05 Identities = 16/111 (14%), Positives = 40/111 (36%), Gaps = 7/111 (6%) Query: 4 PLTFDDNNQCLLLLDSDIFTSIEAK--DDIWLLNGMIIPLSPVCGDSIWRQIMVINGELA 61 PL FDD+ C +++D+ ++ + LL G++ P D + ++ Sbjct: 21 PLVFDDHGTCNMIIDNTFALTLSCDYARERLLLIGLLEPH----KDIPQQCLLAGALNPL 76 Query: 62 ANNEGTLAYIDAAETLLFIHAI-TDLTNIYHIISQLESFVNKQEALKNILQ 111 N L + + +I + ++ + ++ + + Q Sbjct: 77 LNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWMRGWREASQ 127
>BACINVASINC#Salmonella/Shigella invasin protein C signature. Length = 409 Score = 514 bits (1324), Expect = 0.0 Identities = 407/409 (99%), Positives = 407/409 (99%) Query: 1 MLISNVGINPAAYLNNHSVENSSQTASQSVSAKDILNSIGISSSKVSDLGLSPTLSAPAP 60 MLISNVGINPAAYLNNHSVENSSQTASQSVSAKDILNSIGISSSKVSDLGLSPTLSAPAP Sbjct: 1 MLISNVGINPAAYLNNHSVENSSQTASQSVSAKDILNSIGISSSKVSDLGLSPTLSAPAP 60 Query: 61 GVLTQTPGTITSFLKASIQNTDMNQDLNALANNVTTKANEVVQTQLREQQAEVGKFFDIS 120 GVLTQTPGTITSFLKASIQNTDMNQDLNALANNVTTKANEVVQTQLREQQAEVGKFFDIS Sbjct: 61 GVLTQTPGTITSFLKASIQNTDMNQDLNALANNVTTKANEVVQTQLREQQAEVGKFFDIS 120 Query: 121 GMSSSAVALLAAANTLMLTLNQADSKLSGKLSLVSFDAAKTTASSMMREGMNALSGSISQ 180 GMSSSAVALLAAANTLMLTLNQADSKLSGKLSLVSFDAAKTTASSMMREGMNALSGSISQ Sbjct: 121 GMSSSAVALLAAANTLMLTLNQADSKLSGKLSLVSFDAAKTTASSMMREGMNALSGSISQ 180 Query: 181 SALQLGITGVGAKLEYKGLQNERGALKHNAAKIDKLTTESHSIKNVLNGQNSVKLGAEGV 240 SALQLGITGVGAKLEYKGLQNERGALKHNAAKIDKLTTESHSIKNVLNGQNSVKLGAEGV Sbjct: 181 SALQLGITGVGAKLEYKGLQNERGALKHNAAKIDKLTTESHSIKNVLNGQNSVKLGAEGV 240 Query: 241 DSLKSLNMKKTGTDATKNLNDATLKSNAGTSATESLGIKNSNKQISPEHQAILSKRLESV 300 DSLKSLNMKKTGTDATKNLNDATLKSNAGTSATESLGIKNSNKQISPEHQAILSKRLESV Sbjct: 241 DSLKSLNMKKTGTDATKNLNDATLKSNAGTSATESLGIKNSNKQISPEHQAILSKRLESV 300 Query: 301 ESDIRLEQNTMDMTRIDARKMQMTGDLIMKNLVTVGGIAGASGQYAATQERSEQQISQVN 360 ESDIRLEQNTMDMTRIDARKMQMTGDLIMKN VTVGGIAGAS QYAATQERSEQQISQVN Sbjct: 301 ESDIRLEQNTMDMTRIDARKMQMTGDLIMKNSVTVGGIAGASRQYAATQERSEQQISQVN 360 Query: 361 NRVASTASDEARESSRKSTSLIQEMLKTMESINQSKASALAAIAGNIRA 409 NRVASTASDEARESSRKSTSLIQEMLKTMESINQSKASALAAIAGNIRA Sbjct: 361 NRVASTASDEARESSRKSTSLIQEMLKTMESINQSKASALAAIAGNIRA 409
>BACINVASINB#Salmonella/Shigella invasin protein B signature. Length = 593 Score = 835 bits (2158), Expect = 0.0 Identities = 590/593 (99%), Positives = 590/593 (99%) Query: 1 MVNDASSISRSGYTQNPRLAEAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGE 60 MVNDASSISRSGYTQNPRLAEAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGE Sbjct: 1 MVNDASSISRSGYTQNPRLAEAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGE 60 Query: 61 SAINTVGLKPPTDAAREKLSSEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKE 120 SAINTVGLKPPTDAAREKLSSEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKE Sbjct: 61 SAINTVGLKPPTDAAREKLSSEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKE 120 Query: 121 MGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAAAKKLTQAQNKLQSLDPADPG 180 MGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAA KKLTQAQNKLQSLDPADPG Sbjct: 121 MGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAATKKLTQAQNKLQSLDPADPG 180 Query: 181 YAQAEAAVEQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQN 240 YAQAEAAVEQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQN Sbjct: 181 YAQAEAAVEQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQN 240 Query: 241 QVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEF 300 QVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEF Sbjct: 241 QVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEF 300 Query: 301 QEETRKAEETNRIMGCIGKVLGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAA 360 QEETRKAEETNRIMGCIGKVLGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAA Sbjct: 301 QEETRKAEETNRIMGCIGKVLGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAA 360 Query: 361 TGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMV 420 TGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMV Sbjct: 361 TGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMV 420 Query: 421 AVIAVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLG 480 AVI VVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLG Sbjct: 421 AVIVVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLG 480 Query: 481 NVGSKMGLQTNALSKELVGNTLNKVALSMEVTNTAAQSAGGVAEGVFIKNASEALADFML 540 NVGSKMGLQTNALSKELVGNTLNKVAL MEVTNTAAQSAGGVAEGVFIKNASEALADFML Sbjct: 481 NVGSKMGLQTNALSKELVGNTLNKVALGMEVTNTAAQSAGGVAEGVFIKNASEALADFML 540 Query: 541 ARFAMDQIQQWLKQSVEIFGENQKVTAELQKAMSSAVQQNADASRFILRQSRA 593 ARFAMDQIQQWLKQSVEIFGENQKVTAELQKAMSSAVQQNADASRFILRQSRA Sbjct: 541 ARFAMDQIQQWLKQSVEIFGENQKVTAELQKAMSSAVQQNADASRFILRQSRA 593
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 128 bits (322), Expect = 2e-40 Identities = 39/160 (24%), Positives = 72/160 (45%), Gaps = 4/160 (2%) Query: 4 QNNVSEERVAEMIWDAVSEGATLKDVHGIPQDMMDGLYAHAYEFYNQGRLDEAETFFRFL 63 Q + + + G T+ ++ I D ++ LY+ A+ Y G+ ++A F+ L Sbjct: 3 QETTDTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQAL 62 Query: 64 CIYDFYNPDYTMGLAAVCQLKKQFQKACDLYAVAFTLLKNDYRPVFFTGQCQLLMRKAAK 123 C+ D Y+ + +GL A Q Q+ A Y+ + + R F +C L + A+ Sbjct: 63 CVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAE 122 Query: 124 ARQCF----ELVNERTEDESLRAKALVYLEALKTAETEQH 159 A EL+ ++TE + L + LEA+K + +H Sbjct: 123 AESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKEMEH 162
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 340 bits (874), Expect = e-118 Identities = 119/360 (33%), Positives = 204/360 (56%), Gaps = 19/360 (5%) Query: 1 MSSNKTEKPTKKRLEDSAKKGQSFKSKDLIIACLTLGGIAYLVSYGSFN-EFMGIIKIII 59 MS KTE+PT K++ D+ KKGQ KSK+++ L + A L+ + E + +I Sbjct: 1 MSGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIP 60 Query: 60 ADNFDQSMADYSLAVFGIGLKYLIPFMLLCL---VCSALPAL----LQAGFVLATEALKP 112 +QS +S A+ + L+ F LC +AL A+ +Q GF+++ EA+KP Sbjct: 61 ---AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKP 117 Query: 113 NLSALNPVEGAKKLFSMRTVKDTVKTLLYLSSFVVAAIICWKKYKVEIFSQLNGNVVDIA 172 ++ +NP+EGAK++FS++++ + +K++L + V+ +I+ W K + + L I Sbjct: 118 DIKKINPIEGAKRIFSIKSLVEFLKSILKV---VLLSILIWIIIKGNLVTLLQLPTCGIE 174 Query: 173 VIWRELLLALVLTCLACA---LIVLLLDAIAEYFLTMKDMKMDKEEVKREMKEQEGNPEV 229 I L L + C +++ + D EY+ +K++KM K+E+KRE KE EG+PE+ Sbjct: 175 CITPLLGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEI 234 Query: 230 KSKRREVHMEILSEQVKSDIENSRLIVANPTHITIGIYFKPELMPIPMISVYETNQRALA 289 KSKRR+ H EI S ++ +++ S ++VANPTHI IGI +K P+P+++ T+ + Sbjct: 235 KSKRRQFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQT 294 Query: 290 VRAYAEKVGVPVIVDIKLARSLFKTHRRYDLVSLEEIDEVLRLLVWLE--EVENAGKDVI 347 VR AE+ GVP++ I LAR+L+ + E+I+ +L WLE +E +++ Sbjct: 295 VRKIAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQHSEML 354
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 188 bits (479), Expect = 2e-61 Identities = 50/248 (20%), Positives = 107/248 (43%), Gaps = 4/248 (1%) Query: 1 MLYALYFEIHHLVASAALGFARVAPIFFFLPFLNSGVLSGAPRNAIIILVALGVWPHALN 60 ML + + RV + P L+ + + + +++ + P Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60 Query: 61 EAPPFLSVAMIPLVLQEAAVGVMLGCLLSWPFWVMHALGCIIDNQRGATLSSSIDPANGI 120 P S + L +Q+ +G+ LG + + F + G II Q G + ++ +DPA+ + Sbjct: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120 Query: 121 DTSEMANFLNMFAAVVYLQNGGLVTMVDVLNKSYQLCDPMNEC--TPSLPPLLTFINQVA 178 + +A ++M A +++L G + ++ +L ++ E + + L + + Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180 Query: 179 QNALVLASPVVLVLLLSEVFLGLLSRFAPQMNAFAISLTVKSGIAVLIMLLYFS--PVLP 236 N L+LA P++ +LL + LGLL+R APQ++ F I + + + +M Sbjct: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240 Query: 237 DNVLRLSF 244 +++ F Sbjct: 241 EHLFSEIF 248
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 88.7 bits (220), Expect = 4e-27 Identities = 86/86 (100%), Positives = 86/86 (100%) Query: 1 MDDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCL 60 MDDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCL Sbjct: 1 MDDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCL 60 Query: 61 FLLSGWYGEVLLSYGRQVIFLALAKG 86 FLLSGWYGEVLLSYGRQVIFLALAKG Sbjct: 61 FLLSGWYGEVLLSYGRQVIFLALAKG 86
>TYPE3IMPPROT#Type III secretion system inner membrane P protein family signature. Length = 224 Score = 303 bits (777), Expect = e-107 Identities = 223/224 (99%), Positives = 223/224 (99%) Query: 1 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS 60 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS Sbjct: 1 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS 60 Query: 61 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLSKYSDRELVQFFENAQL 120 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYL KYSDRELVQFFENAQL Sbjct: 61 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQL 120 Query: 121 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL 180 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL Sbjct: 121 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL 180 Query: 181 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQYMDIAT 224 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQYMDIAT Sbjct: 181 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQYMDIAT 224
>TYPE3OMOPROT#Type III secretion system outer membrane O protein family signature. Length = 303 Score = 537 bits (1384), Expect = 0.0 Identities = 302/303 (99%), Positives = 302/303 (99%) Query: 1 MSLRVRQIDRREWLLAQTATECQRHGREATLEYPTRQGMWVRLSDAEKRWSAWIKPGDWL 60 MSLRVRQIDRREWLLAQTATECQRHGREATLEYPTRQGMWVRLSDAEKRWSAWIKPGDWL Sbjct: 1 MSLRVRQIDRREWLLAQTATECQRHGREATLEYPTRQGMWVRLSDAEKRWSAWIKPGDWL 60 Query: 61 EHVSPALAGAAVSAGAEHLVVPWLAATERPFELPVPHLSCRRLCVENPVPGSALPEGKLL 120 EHVSPALAGAAVSAGAEHLVVPWLAATERPFELPVPHLSCRRLCVENPVPGSALPEGKLL Sbjct: 61 EHVSPALAGAAVSAGAEHLVVPWLAATERPFELPVPHLSCRRLCVENPVPGSALPEGKLL 120 Query: 121 HIMSDRGGLWFEHLPELPAVAGGRPKMLRWPLRFVIGSSDTQRSLLGRIGIGDVLLIRTS 180 HIMSDRGGLWFEHLPELPAV GGRPKMLRWPLRFVIGSSDTQRSLLGRIGIGDVLLIRTS Sbjct: 121 HIMSDRGGLWFEHLPELPAVGGGRPKMLRWPLRFVIGSSDTQRSLLGRIGIGDVLLIRTS 180 Query: 181 RAEVYCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEFVLYR 240 RAEVYCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEFVLYR Sbjct: 181 RAEVYCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEFVLYR 240 Query: 241 KNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHEWLSESG 300 KNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHEWLSESG Sbjct: 241 KNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHEWLSESG 300 Query: 301 NGE 303 NGE Sbjct: 301 NGE 303
>SSPANPROTEIN#Salmonella invasion protein InvJ signature. Length = 336 Score = 601 bits (1550), Expect = 0.0 Identities = 332/336 (98%), Positives = 334/336 (99%) Query: 1 MGDVSAVSSSGNILLPQQDEVGGLSEALKKAVEKHKTEYSGDKKDRDYGDAFVMHKETAL 60 MGDVSAVSSSGNILLPQQDEVGGLSEALKKAVEKHKTEYSGDKKDRDYGDAFVMHKETAL Sbjct: 1 MGDVSAVSSSGNILLPQQDEVGGLSEALKKAVEKHKTEYSGDKKDRDYGDAFVMHKETAL 60 Query: 61 PVLLAAWRHGAPAKSEHHNGNVSGLHHNGKGELRIAEKLLKVTAEKSVGLISAEAKVDKS 120 P+LLAAWRHGAPAKSEHHNGNVSGLHHNGK ELRIAEKLLKVTAEKSVGLISAEAKVDKS Sbjct: 61 PLLLAAWRHGAPAKSEHHNGNVSGLHHNGKSELRIAEKLLKVTAEKSVGLISAEAKVDKS 120 Query: 121 AALLSPKNRPLESVSGKKLSADLKAVESVSEVTDNATGISDDNIKALPGDNKAIAGEGVR 180 AALLS KNRPLESVSGKKLSADLKAVESVSEVTDNATGISDDNIKALPGDNKAIAGEGVR Sbjct: 121 AALLSSKNRPLESVSGKKLSADLKAVESVSEVTDNATGISDDNIKALPGDNKAIAGEGVR 180 Query: 181 KEGAPLARDVAPARMAAANTGKPDDKDHKKVKDVSQLPLQPTTIADLSQLTGGDEKMPLA 240 KEGAPLARDVAPARMAAANTGKP+DKDHKKVKDVSQLPLQPTTIADLSQLTGGDEKMPLA Sbjct: 181 KEGAPLARDVAPARMAAANTGKPEDKDHKKVKDVSQLPLQPTTIADLSQLTGGDEKMPLA 240 Query: 241 AQSKPMMTIFPTADGVKGEDSSLTYRFQRWGNDYSVNIQARQAGEFSLIPSNTQVEHRLH 300 AQSKPMMTIFPTADGVKGEDSSLTYRFQRWGNDYSVNIQARQAGEFSLIPSNTQVEHRLH Sbjct: 241 AQSKPMMTIFPTADGVKGEDSSLTYRFQRWGNDYSVNIQARQAGEFSLIPSNTQVEHRLH 300 Query: 301 DQWQNGNPQRWHLTRDDQQNPQQQQHRQQSGEEDDA 336 DQWQNGNPQRWHLTRDDQQNPQQQQHRQQSGEEDDA Sbjct: 301 DQWQNGNPQRWHLTRDDQQNPQQQQHRQQSGEEDDA 336
>SSPAMPROTEIN#Salmonella surface presentation of antigen gene type M signature. Length = 147 Score = 167 bits (423), Expect = 2e-56 Identities = 141/147 (95%), Positives = 143/147 (97%) Query: 1 MHSLTRIKVLQRRCTVFHSQCESILLRYQDEDRGLQAEEEAILEQIAGLKLLLDTLRAEN 60 MHSLTRIKVLQRRCTVFHSQCESILLRYQDEDR LQ EEEAI+EQIAGLKLLLDTLRAEN Sbjct: 1 MHSLTRIKVLQRRCTVFHSQCESILLRYQDEDRRLQVEEEAIVEQIAGLKLLLDTLRAEN 60 Query: 61 RQLSREEIYTLLRKQSIVRRQIKDLELQIIQIQEKRSELEKKREEFQKKSKYWLRKEGNY 120 RQLSREEIY LLRKQSIVRRQIKDLELQIIQIQEKRSELEKKREEFQ+KSKYWLRKEGNY Sbjct: 61 RQLSREEIYALLRKQSIVRRQIKDLELQIIQIQEKRSELEKKREEFQEKSKYWLRKEGNY 120 Query: 121 QRWIIRQKRHYIQREIQQEEAESEEII 147 QRWIIRQKR YIQREIQQEEAESEEII Sbjct: 121 QRWIIRQKRLYIQREIQQEEAESEEII 147
>SSPAKPROTEIN#Invasion protein B family signature. Length = 133 Score = 205 bits (523), Expect = 7e-72 Identities = 43/133 (32%), Positives = 76/133 (57%) Query: 1 MQHLDIAELVRSALEVSGCDPSLIGGIDSHSTIVLDLFALPSICISVKDDDVWIWAQLGA 60 M ++++ +LVR +L GC PS+I +DSHS I + L ++P+I I++ ++ V +WA A Sbjct: 1 MSNINLVQLVRDSLFTIGCPPSIITDLDSHSAITISLDSMPAINIALVNEQVMLWANFDA 60 Query: 61 GSMVVLQQRAYEILMTIMEGCHFARGGQLLLGEQNGELTLKALVHPDFLSDGEKFSTALN 120 S V LQ AY IL ++ ++ + L + L L+ ++ D++ DG F+ L+ Sbjct: 61 PSDVKLQSSAYNILNLMLMNFSYSINELVELHRSDEYLQLRVVIKDDYVHDGIVFAEILH 120 Query: 121 GFYNYLEVFSRSL 133 FY +E+ + L Sbjct: 121 EFYQRMEILNGVL 133
>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE) signature. Length = 372 Score = 604 bits (1557), Expect = 0.0 Identities = 372/372 (100%), Positives = 372/372 (100%) Query: 1 MIPGSTSGISFSRILSRQASHQDATQHTDAQQAEIQQAAEDSSPGAEVQKFVQSTDEMSA 60 MIPGSTSGISFSRILSRQASHQDATQHTDAQQAEIQQAAEDSSPGAEVQKFVQSTDEMSA Sbjct: 1 MIPGSTSGISFSRILSRQASHQDATQHTDAQQAEIQQAAEDSSPGAEVQKFVQSTDEMSA 60 Query: 61 ALAQFRNRRDYEKKSSNLSNSFERVLEDEALPKAKQILKLISVHGGALEDFLRQARSLFP 120 ALAQFRNRRDYEKKSSNLSNSFERVLEDEALPKAKQILKLISVHGGALEDFLRQARSLFP Sbjct: 61 ALAQFRNRRDYEKKSSNLSNSFERVLEDEALPKAKQILKLISVHGGALEDFLRQARSLFP 120 Query: 121 DPSDLVLVLRELLRRKDLEEIVRKKLESLLKHVEEQTDPKTLKAGINCALKARLFGKTLS 180 DPSDLVLVLRELLRRKDLEEIVRKKLESLLKHVEEQTDPKTLKAGINCALKARLFGKTLS Sbjct: 121 DPSDLVLVLRELLRRKDLEEIVRKKLESLLKHVEEQTDPKTLKAGINCALKARLFGKTLS 180 Query: 181 LKPGLLRASYRQFIQSESHEVEIYSDWIASYGYQRRLVVLDFIEGSLLTDIDANDASCSR 240 LKPGLLRASYRQFIQSESHEVEIYSDWIASYGYQRRLVVLDFIEGSLLTDIDANDASCSR Sbjct: 181 LKPGLLRASYRQFIQSESHEVEIYSDWIASYGYQRRLVVLDFIEGSLLTDIDANDASCSR 240 Query: 241 LEFGQLLRRLTQLKMLRSADLLFVSTLLSYSFTKAFNAEESSWLLLMLSLLQQPHEVDSL 300 LEFGQLLRRLTQLKMLRSADLLFVSTLLSYSFTKAFNAEESSWLLLMLSLLQQPHEVDSL Sbjct: 241 LEFGQLLRRLTQLKMLRSADLLFVSTLLSYSFTKAFNAEESSWLLLMLSLLQQPHEVDSL 300 Query: 301 LADIIGLNALLLSHKEHASFLQIFYQVCKAIPSSLFYEEYWQEELLMALRSMTDIAYKHE 360 LADIIGLNALLLSHKEHASFLQIFYQVCKAIPSSLFYEEYWQEELLMALRSMTDIAYKHE Sbjct: 301 LADIIGLNALLLSHKEHASFLQIFYQVCKAIPSSLFYEEYWQEELLMALRSMTDIAYKHE 360 Query: 361 MAEQRRTIEKLS 372 MAEQRRTIEKLS Sbjct: 361 MAEQRRTIEKLS 372
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 576 bits (1486), Expect = 0.0 Identities = 169/540 (31%), Positives = 271/540 (50%), Gaps = 57/540 (10%) Query: 4 HILLARVLACAALVLVAPGYSSE----KIPVTGSGFVAKDDSLRTFFDAMALQLKEPVIV 59 H RVL L+L + ++ E IP +VAK +SLR V+V Sbjct: 6 HSFFKRVLTGTLLLLSSYSWAQELDWLPIPYV---YVAKGESLRDLLTDFGANYDATVVV 62 Query: 60 SKMAARKKITGNFEFHDPNALLEKLSLQLGLIWYFDGQAIYIYDASEMRNAVVSLRNVSL 119 S K++G FE +P L+ ++ L+WY+DG +YI+ SE+ + ++ L+ Sbjct: 63 SD-KINDKVSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEA 121 Query: 120 NEFNNFLKRSGLYNKNYPLRGDNRKGTFYVSGPPVYVDMVVNAATMMDKQND--GIELGR 177 E L+RSG++ + R D YVSGPP Y+++V A +++Q + G Sbjct: 122 AELKQALQRSGIWEPRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGA 181 Query: 178 QKIGVMRLNNTFVGDRTYNLRDQKMVIPGIATAIERLLQGEEQPLGNIVSSEPPAMPAFS 237 I + L DRT + RD ++ PG+AT ++R+L + + P Sbjct: 182 LAIEIFPLKYASASDRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIP------ 235 Query: 238 ANGEKGKAANYAGGMSLQEALKQNAAAGNIKIVAYPDTNSLLVKGTAEQVHFIEMLVKAL 297 Q A + +A A ++ A P N+++V+ + E++ + L+ AL Sbjct: 236 -----------------QAATRASAQA---RVEADPSLNAIIVRDSPERMPMYQRLIHAL 275 Query: 298 DVAKRHVELSLWIVDLNKSDLERLGTSWSGSI-----------TIGDKLGVSLNQSSIST 346 D +E++L IVD+N L LG W I T GD+ ++ N + S Sbjct: 276 DKPSARIEVALSIVDINADQLTELGVDWRVGIRTGNNHQVVIKTTGDQSNIASNGALGSL 335 Query: 347 LDG---SRFIAAVNALEEKKQATVVSRPVLLTQENVPAIFDNNRTFYTKLIGERNVALEH 403 +D +A VN LE + A VVSRP LLTQEN A+ D++ T+Y K+ G+ L+ Sbjct: 336 VDARGLDYLLARVNLLENEGSAQVVSRPTLLTQENAQAVIDHSETYYVKVTGKEVAELKG 395 Query: 404 VTYGTMIRVLPRFSADG---QIEMSLDIEDGNDKTPQSDTTTSVDALPEVGRTLISTIAR 460 +TYGTM+R+ PR G +I ++L IEDGN Q ++ ++ +P + RT++ T+AR Sbjct: 396 ITYGTMLRMTPRVLTQGDKSEISLNLHIEDGN----QKPNSSGIEGIPTISRTVVDTVAR 451 Query: 461 VPHGKSLLVGGYTRDANTDTVQSIPFLGKLPLIGSLFRYSSKNKSNVVRVFMIEPKEIVD 520 V HG+SL++GG RD + + +P LG +P IG+LFR S+ VR+F+IEP+ I + Sbjct: 452 VGHGQSLIIGGIYRDELSVALSKVPLLGDIPYIGALFRRKSELTRRTVRLFIIEPRIIDE 511
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 82.6 bits (204), Expect = 2e-19 Identities = 65/387 (16%), Positives = 142/387 (36%), Gaps = 48/387 (12%) Query: 16 FLDLINLFIASVAFPAMSVDLHTSISALAWVSNGYIAGLTLIVPFSAFLSRYLGARRLII 75 F ++N + +V+ P ++ D + ++ WV+ ++ ++ LS LG +RL++ Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 76 FSLILFSVAAAAAGFADSLHS-LVFWRIVQGAGGGLLIPVGQALTWQQFEPHERAGVSSV 134 F +I+ + S S L+ R +QGAG + + + R + Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 135 VMMVALLAPACSPAIGGLLVETCGWRWIFFATLPVAVLTLLLAYCWLNAASTT------- 187 + + + PAIGG++ W ++ + + L Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKG 203 Query: 188 --------------MASARLLHL-------------------PLLTDRLLRFAMIVYLCV 214 S + L P + L + + + Sbjct: 204 IILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVL 263 Query: 215 PGMFIGISVVGM-----FYLQNIAQLSPAAAGS-LMLPWSIASFVAIMLTGRYFNRLGPR 268 G I +V G + ++++ QLS A GS ++ P +++ + + G +R GP Sbjct: 264 CGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPL 323 Query: 269 PLIIVGCLLQAAGILLLTNVTPATSHRVLMMIFALMGAGGSLCSSTAQSGAFLTIARRDM 328 ++ +G + L + + TS + ++I ++G G S + + ++ +++ Sbjct: 324 YVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTVISTIVSSSLKQQEA 382 Query: 329 PDASALWNLNRQLSFFLGATLLTLLLN 355 +L N LS G ++ LL+ Sbjct: 383 GAGMSLLNFTSFLSEGTGIAIVGGLLS 409
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 87.5 bits (217), Expect = 9e-22 Identities = 56/219 (25%), Positives = 93/219 (42%), Gaps = 31/219 (14%) Query: 1 MQIIITGGGGFLGQKLASALLNSSL------AFNELLLVDLKMPARLS--DSPRLRCLEA 52 M+ ++TG GF+G ++ LL + N+ V LK ARL P + + Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQ-ARLELLAQPGFQFHKI 59 Query: 53 DLT-QPGVLENVITANTSVVYHLAA-------IVSSHAEDDFDLGWKVNLDLTRQLLEAC 104 DL + G+ + + + V+ + + HA D NL +LE C Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYAD------SNLTGFLNILEGC 113 Query: 105 RRQPQKIRFVFSSSLAVYGG--TLPECVTDTTALTPRSSYGAQKAACELLVNDYTRKGYV 162 R + +++SS +VYG +P D+ P S Y A K A EL+ + Y+ + Sbjct: 114 RHNKIQ-HLLYASSSSVYGLNRKMPFSTDDSVD-HPVSLYAATKKANELMAHTYSHLYGL 171 Query: 163 DGLALRLPTICVRPGKPNRAASSFVSAIIREPLQGETIV 201 LR T+ G+P+ A F A+ L+G++I Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAM----LEGKSID 206
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 32.1 bits (73), Expect = 0.004 Identities = 17/84 (20%), Positives = 36/84 (42%), Gaps = 12/84 (14%) Query: 290 IVATADGRVVYAGNALRGYGNLIIIKHNDDYLSAYAHNDTMLVREQQEVKAGQKIATMGS 349 IVATA+G++ ++G + IK ++ ++V+E + V+ G + + + Sbjct: 82 IVATANGKLTHSGRSK-------EIKP---IENSIV--KEIIVKEGESVRKGDVLLKLTA 129 Query: 350 TGTSSTRLHFEIRYKGKSVNPLRY 373 G + L + + RY Sbjct: 130 LGAEADTLKTQSSLLQARLEQTRY 153
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 28.6 bits (64), Expect = 0.025 Identities = 9/36 (25%), Positives = 14/36 (38%), Gaps = 5/36 (13%) Query: 90 QLRANPVITRNGKRSDVMMNAKH-----QAKANGVE 120 +L P T++G ++ N ANG E Sbjct: 256 KLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGE 291
>ANTHRAXTOXNA#Anthrax toxin LF subunit signature. Length = 800 Score = 29.3 bits (65), Expect = 0.036 Identities = 31/132 (23%), Positives = 51/132 (38%), Gaps = 9/132 (6%) Query: 211 GYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLA-----GEGN 265 P L N + A+ +E K YE+GK I+L + + ++ + + Sbjct: 147 RETPKLIINIKDYAINSEQSKEVYYEIGKGISLDIISKDKSLDPEFLNLIKSLSDDSDSS 206 Query: 266 KAFTSEEFTHFLEELTKQYPIVSIEDGLDESDW---DGFAYQTKVLG-DKIQLVGDDLFV 321 S++F LE K I I++ L E F+Y ++L D+F Sbjct: 207 DLLFSQKFKEKLELNNKSIDINFIKENLTEFQHAFSLAFSYYFAPDHRTVLELYAPDMFE 266 Query: 322 TNTKILKEGIEK 333 K+ K G EK Sbjct: 267 YMNKLEKGGFEK 278
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 702 bits (1814), Expect = 0.0 Identities = 216/864 (25%), Positives = 369/864 (42%), Gaps = 68/864 (7%) Query: 5 ASPYSASGKDIEFNTDFLDVKNRDNVNIAQFSRKGFILPGVYLLQIKINGQTLPQEFPVN 64 A+ S ++ FN FL + ++++F + PG Y + I +N + V Sbjct: 37 AAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNGYMATR-DVT 95 Query: 65 WVIPEHDPQGSEVCAEPELVTQLGIKPELAEKLVWITHGERQCLAPDSL-KGMDFQADLG 123 + QG C + +G+ + + + C+ S+ Q D+G Sbjct: 96 FN-TGDSEQGIVPCLTRAQLASMGLNTASVSGMNLL--ADDACVPLTSMIHDATAQLDVG 152 Query: 124 HSTLLVNLPQAYMEYSDVDWDPPARWDNGIPGIILDYNINNQLRHDQESGSEEQSISGNG 183 L + +PQA+M + PP WD GI +L+YN + ++ G+ N Sbjct: 153 QQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNS-HYAYLNL 211 Query: 184 TLGANLGAWRLRADWQASYDHRDDDENTSTLHDQSWSRYYAYRALPTLGAKLTLGESYLQ 243 G N+GAWRLR + SY+ D + + R + L ++LTLG+ Y Q Sbjct: 212 QSGLNIGAWRLRDNTTWSYNSSDSSSGSKN--KWQHINTWLERDIIPLRSRLTLGDGYTQ 269 Query: 244 SDVFDSFNYIGASVVSDDQMLPPKLRGYAPEIVGIARSNAKVKVSWQGRVLYETQVPAGP 303 D+FD N+ GA + SDD MLP RG+AP I GIAR A+V + G +Y + VP GP Sbjct: 270 GDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGP 329 Query: 304 FRIQDLNQ-SVSGTLHVTVEEQNGQTQEFDVNTASVPFLTRPGMVRYKMALGRPQDWDHH 362 F I D+ SG L VT++E +G TQ F V +SVP L R G RY + G + + Sbjct: 330 FTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSGNAQ 389 Query: 363 PITGTFASAEASWGVTNGWSLYGGAIGESNYQAVALGSGKDLGVVGAVAVDITHSIAHMP 422 F + G+ GW++YGG Y+A G GK++G +GA++VD+T + + +P Sbjct: 390 QEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQANSTLP 449 Query: 423 QDDGFDGETLQGNSYRISYSRDFDEIDSRLTFAGYRFSEKNFMSMSDYLDAKT--YHHLN 480 D G S R Y++ +E + + GYR+S + + +D ++ Y+ Sbjct: 450 DD-----SQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIET 504 Query: 481 A-----------------GHEKERYTVTYNQNFREQGMSAYFSYSRSTFWDSPDQS-NYN 522 +++ + +T Q + Y S S T+W + + + Sbjct: 505 QDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTS-TLYLSGSHQTYWGTSNVDEQFQ 563 Query: 523 LSLSWYFDLGSIKNLSASLNGYRSEYNGDKDDGVYISLSVPWG------------NDSIS 570 L+ F+ + + S + ++ + +D + +++++P+ + S S Sbjct: 564 AGLNTAFEDIN---WTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASAS 620 Query: 571 YNGT-FNGSQHRNQLGYSGH--SQNGDNWQLHVG-----QDEQGAQADGYYSHQGALTDI 622 Y+ + + N G G N ++ + G G+ +++G + Sbjct: 621 YSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNA 680 Query: 623 DLSADYEEGSYRSLGMSLRGGMTLTTQGGALHRGSLAGSTRLLVDTDGIADVPVSGNGSP 682 ++ + + + L + GG+ G L + T +LV G D V N + Sbjct: 681 NIGYSHSDD-IKQLYYGVSGGVLAHANGVTLGQPL--NDTVVLVKAPGAKDAKV-ENQTG 736 Query: 683 TSTNIFGKAVIADVGSYSRSLARIDLNKLPEKAEATKSVVQITLTEGAIGYRHFDVVSGE 742 T+ G AV+ Y + +D N L + + +V + T GAI F G Sbjct: 737 VRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGI 796 Query: 743 KMMAVFRLADGDFPPFGAEVKNERQQQLGLVADDGNAWLAGVKAGETLKVFW--DGAAQC 800 K++ + PFGA V +E Q G+VAD+G +L+G+ ++V W + A C Sbjct: 797 KLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHC 855 Query: 801 EA--SLPPTFTPELLANALLLPCK 822 A LPP +LL L C+ Sbjct: 856 VANYQLPPESQQQLL-TQLSAECR 878
>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein signature. Length = 167 Score = 33.5 bits (76), Expect = 2e-04 Identities = 36/144 (25%), Positives = 61/144 (42%), Gaps = 26/144 (18%) Query: 39 PPCTVGGAS---VEFGDVLTTKVGDVSQTKPVGYSLNCDGRASDYLKLQIQGTTTTISGE 95 PPCT+ V+FG++ V + S++C + S L +++ G T + Sbjct: 32 PPCTINNGQNIVVDFGNINPEHVDNSRGEVTKNISISCPYK-SGSLWIKVTGNTMGVGQN 90 Query: 96 QVLQTSVQGLGIRIQQ-------------AGNKQLVPVGI-TDWLNFTLSGSNGPELEAV 141 VL T++ GI + Q +GN V G+ T FT + +V Sbjct: 91 NVLATNITHFGIALYQGKGMSTPLTLGNGSGNGYRVTAGLDTARSTFTFT--------SV 142 Query: 142 PVKEPTTQLAGGDFNASATLVVDY 165 P + + L GGDF +A++ + Y Sbjct: 143 PFRNGSGILNGGDFRTTASMSMIY 166
>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein signature. Length = 167 Score = 37.4 bits (86), Expect = 6e-06 Identities = 43/166 (25%), Positives = 71/166 (42%), Gaps = 20/166 (12%) Query: 5 LILTLLITRFAC-AD-NLTFHGKLINPPACTINNGETLEVSFGSVIIDNIDGVNYLTEIP 62 L ++LL+T A AD + G + PP CTINNG+ + V FG++ +++D N E+ Sbjct: 6 LFISLLLTSVAVLADVQINIRGNVYIPP-CTINNGQNIVVDFGNINPEHVD--NSRGEVT 62 Query: 63 WTLTCDSSFRDDALTFTLSYLGTATPYSAKALTTNVPGLGIELQQNGTVFPPGT------ 116 ++ ++ +L ++ T L TN+ GI L Q + P T Sbjct: 63 KNISISCPYKSGSLWIKVTG-NTMGVGQNNVLATNITHFGIALYQGKGMSTPLTLGNGSG 121 Query: 117 -------SLTINESSLPTLKAVPVKQPGKEPAEGDFEAFATLQVDY 155 L S+ T +VP + GDF A++ + Y Sbjct: 122 NGYRVTAGLDTARSTF-TFTSVPFRNGSGILNGGDFRTTASMSMIY 166
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 632 bits (1631), Expect = 0.0 Identities = 228/856 (26%), Positives = 380/856 (44%), Gaps = 66/856 (7%) Query: 19 SQATEFNASLLDSGNLSNVDLTAFSREGYVAPGNYILDIWLNDQPVREQYPVRVVPVAGR 78 S FN L + DL+ F + PG Y +DI+LN+ + + V Sbjct: 44 SAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNGYMATRD-VTFNTGDSE 102 Query: 79 DAAVICVTTDMVAMLGLKDKIIHGLKPVTGIPDGQCLELRSA--DSQVRYSAENQRLTFI 136 V C+T +A +GL + + + D C+ L S D+ + QRL Sbjct: 103 QGIVPCLTRAQLASMGLNTA---SVSGMNLLADDACVPLTSMIHDATAQLDVGQQRLNLT 159 Query: 137 IPQAWMRYQDPDWVPPSRWSDGVTAGLLDYSLMANRYMPQQGETSTSYSLYGTAGFNLGA 196 IPQA+M + ++PP W G+ AGLL+Y+ N + G S L +G N+GA Sbjct: 160 IPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGA 219 Query: 197 WRLRSDYQYSRFDS-GQGASQSDFYLPQTYLFRALPALRSKLTLGQTYLSSAIFDSFRFA 255 WRLR + +S S S++ + T+L R + LRS+LTLG Y IFD F Sbjct: 220 WRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFR 279 Query: 256 GLTLTSDERMLPPSLQGYAPKISGIANSNAQVTVSQNGRILYQTRVSPGPFELPDLSQ-N 314 G L SD+ MLP S +G+AP I GIA AQVT+ QNG +Y + V PGPF + D+ Sbjct: 280 GAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAG 339 Query: 315 ISGNLDVSVRESDGSVRTWQVNTASVPFIARQGQVRYKVAAGRPLYGGTHNNSTVSPDFL 374 SG+L V+++E+DGS + + V +SVP + R+G RY + AG G N P F Sbjct: 340 NSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSG---NAQQEKPRFF 396 Query: 375 LGEATWGAFNNTSLYGGLIASTGDYQSAALGIGQNMGLLGALSADVTRSDARLPHGKKQS 434 G ++YGG + Y++ GIG+NMG LGALS D+T++++ LP + Sbjct: 397 QSTLLHGLPAGWTIYGGTQLA-DRYRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHD 455 Query: 435 GYSYRINYAKTFDKTGSTLAFVGYRFSDRHFLSMPEYLQRRATDGGD------------- 481 G S R Y K+ +++G+ + VGYR+S + + + R Sbjct: 456 GQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKF 515 Query: 482 ------AWHEKQSYTVTYSQSVPVLNMSAALSVSRLNYWNAQ-SNNNYMLSFNKVFSLGD 534 A++++ +T +Q + + LS S YW + + N F Sbjct: 516 TDYYNLAYNKRGKLQLTVTQQLG-RTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAF---- 570 Query: 535 LQGLSASVSFARNQYTGG-GSQNQVYATISIPWGDSR-----------QVSYSVQKDNRG 582 + ++ ++S++ + G + ++IP+ SYS+ D G Sbjct: 571 -EDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNG 629 Query: 583 GLQQTVNYSD--FHNPDTTWNISAGHNRYDTGSN-SSFSGSVQSRLPWGQAAADATLQPG 639 + + + ++++ G+ G++ S+ ++ R +G A + Sbjct: 630 RMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDD 689 Query: 640 QYRSLGLSWYGSVTATAHGAAFSQSMAGNEPRMMIDTGDVAGVPVNGNSGV-TNRFGVGV 698 + L G V A A+G Q + N+ +++ V +GV T+ G V Sbjct: 690 -IKQLYYGVSGGVLAHANGVTLGQPL--NDTVVLVKAPGAKDAKVENQTGVRTDWRGYAV 746 Query: 699 VSAGSSYRRSDISVDVAALPEDVDVSSSVVSQVLTEGAVGYRKIDASQGAQVLGHIRLAD 758 + + YR + +++D L ++VD+ ++V + V T GA+ + A G ++L + + Sbjct: 747 LPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTLT-HN 805 Query: 759 GASPPFGALVVSGKTGRTAGMVGDDGLAYLTGLSGEDRRTLNVSW--DGRVQCRLTLPET 816 PFGA+V S +++G+V D+G YL+G+ + V W + C Sbjct: 806 NKPLPFGAMVTSES-SQSSGIVADNGQVYLSGM--PLAGKVQVKWGEEENAHCVANYQLP 862 Query: 817 VTLSRGPL---LLPCR 829 + L CR Sbjct: 863 PESQQQLLTQLSAECR 878
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 25.8 bits (56), Expect = 0.030 Identities = 8/18 (44%), Positives = 11/18 (61%) Query: 50 RFSPGDSWFVEQGTEVAW 67 RF+ D WF+E E+A Sbjct: 773 RFTHADGWFLEPQAELAV 790
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 27.6 bits (61), Expect = 0.034 Identities = 26/111 (23%), Positives = 44/111 (39%), Gaps = 22/111 (19%) Query: 42 NKILCCGNGTSAANAQHFAASMINRFETERPSLPAIALNTDNVVLTAIA-------NDRL 94 K+L GN + A T + IA + V AI+ D+ Sbjct: 277 TKVL--GNVGKGISQYIIAQRAAQGLSTSAAAAGLIA----SAVTLAISPLSFLSIADKF 330 Query: 95 HD----EVYAKQVRALGHAGDVLLAISTRGNSRDIVKAVEAAVTRDMTIVA 141 E Y+++ + LG+ GD LLA + A++A++T T++A Sbjct: 331 KRANKIEEYSQRFKKLGYDGDSLLAAFHKETG-----AIDASLTTISTVLA 376
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 29.4 bits (66), Expect = 0.009 Identities = 14/55 (25%), Positives = 22/55 (40%), Gaps = 16/55 (29%) Query: 4 VLITGATGLVGGHLLRMLINTPQVSAIAAPTRRPLTDIVGV--YNP-HDPQLTDA 55 L+TGA G +G H+ + L+ +VG+ N +D L A Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGH-------------QVVGIDNLNDYYDVSLKQA 44
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 27.1 bits (60), Expect = 0.040 Identities = 14/49 (28%), Positives = 22/49 (44%), Gaps = 1/49 (2%) Query: 11 APGIDALLRRSFESDAEAKLVHDLREDGF-LTLGLVATDDEGQVVGYVA 58 P A++R F++ KL+ L + L G + T + Q G VA Sbjct: 779 VPTRGAIVRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVA 827
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 73.0 bits (179), Expect = 2e-15 Identities = 69/313 (22%), Positives = 110/313 (35%), Gaps = 77/313 (24%) Query: 398 IMGHVDHGKTSLLDYI-----RSTKVASGEAG-------------GITQHIGAYHVETDN 439 ++ HVD GKT+L + + T++ S + G GIT G + +N Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67 Query: 440 GMITFLDTPGHAAFTSMRARGAQATDIVVLVVAADDGVMPQTIEAIQHAKAAGVPVVVAV 499 + +DTPGH F + R D +L+++A DGV QT + G+P + + Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127 Query: 500 NKIDKPEADPDRV----KNELSQYGI-----------------LPEEWG----------- 527 NKID+ D V K +LS + E+W Sbjct: 128 NKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGNDDLLE 187 Query: 528 ---------------GESQFV---------HVSAKAGTGIDELLDAILLQAEVLELKAVR 563 ES H SAK GID L++ I + Sbjct: 188 KYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVIT--NKFYSSTHRG 245 Query: 564 KGMASGAVIESFLDKGRGPVATVLVREGTLHKGDIVL-CGFEYGRVRAMRNELGQEVLEA 622 + G V + + R +A + + G LH D V E ++ M + E+ + Sbjct: 246 QSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKIKITEMYTSINGELCKI 305 Query: 623 GPSIPVEILGLSG 635 + EI+ L Sbjct: 306 DKAYSGEIVILQN 318
>SECGEXPORT#Protein-export SecG membrane protein signature. Length = 110 Score = 158 bits (400), Expect = 7e-54 Identities = 107/109 (98%), Positives = 109/109 (100%) Query: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTAVLA 60 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTA+LA Sbjct: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLA 60 Query: 61 TLFFIISLVLGNINSNKTNKGSEWENLSAPAKTEQTQPAAPAQPTSDIP 109 TLFFIISLVLGNINSNKTNKGSEWENLSAPAKTEQTQPAAPA+PTSDIP Sbjct: 61 TLFFIISLVLGNINSNKTNKGSEWENLSAPAKTEQTQPAAPAKPTSDIP 109
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 34.0 bits (78), Expect = 0.002 Identities = 22/82 (26%), Positives = 31/82 (37%), Gaps = 18/82 (21%) Query: 188 VLMVGPPGTGKTLLAKAI---AGEAKVPFFT-----ISGSDFVEMFVGV------GASRV 233 +++ G GTGK L+A+A+ PF I G GA Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTR 222 Query: 234 RD-MFEQAKKAAPCIIFIDEID 254 FEQA+ +F+DEI Sbjct: 223 STGRFEQAEGGT---LFLDEIG 241
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 69.3 bits (169), Expect = 4e-15 Identities = 34/187 (18%), Positives = 65/187 (34%), Gaps = 32/187 (17%) Query: 90 GLGSGVIIDAAKGYVLTNNHVINQAQKISIQL------------NDGREFDAKLIGGDDQ 137 + SGV++ K +LTN HV++ L +G ++ + Sbjct: 102 FIASGVVV--GKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159 Query: 138 SDIALLQIQN-------PSKLTQIAIADSDKLRVGDFAVAVGNPFGLGQTATSGIISALG 190 D+A+++ + ++++ + +V G P ++ + Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKP-------VATMW 212 Query: 191 RSGLNLEGLEN-FIQTDASINRGNSGGALLNLNGELIGINTAILAPGGGSIGIGFAIPSN 249 S + L+ +Q D S GNSG + N E+IGI+ I N Sbjct: 213 ESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHW---GGVPNEFNGAVFINEN 269 Query: 250 MAQTLAQ 256 + L Q Sbjct: 270 VRNFLKQ 276
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 53.5 bits (128), Expect = 4e-10 Identities = 31/160 (19%), Positives = 59/160 (36%), Gaps = 26/160 (16%) Query: 77 RTLGSGVIMDQRGYIITNKHVINDADQIIVALQ------------DGRVFEALLVGSDSL 124 + SGV++ + ++TNKHV++ AL+ +G + Sbjct: 101 TFIASGVVVG-KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159 Query: 125 TDLAVLKI-------NATGGLPTIPINTKRTPHIGDVVLAIGNPYNLGQTITQGIISATG 177 DLA++K + + ++ + + G P + T + G Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--ESKG 216 Query: 178 RIGLNPTGRQNFLQTDASINHGNSGGALVNSLGELMGINT 217 +I + +Q D S GNSG + N E++GI+ Sbjct: 217 KI---TYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHW 253
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 32.1 bits (73), Expect = 0.007 Identities = 16/58 (27%), Positives = 28/58 (48%) Query: 507 ASAPAAAAPAGAGTPVTAPLAGNIWKVIATEGQTVAEGDVLLILEAMKMETEIRAAQA 564 A+A +G + + ++I EG++V +GDVLL L A+ E + Q+ Sbjct: 84 ATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQS 141 Score = 31.0 bits (70), Expect = 0.016 Identities = 15/56 (26%), Positives = 22/56 (39%), Gaps = 10/56 (17%) Query: 532 KVIATEGQTVAEGDVLLILEAMKMETEIRAAQAGTVRGIAVKSGDAVSVGDPLMTL 587 V G+ G EI+ + V+ I VK G++V GD L+ L Sbjct: 82 IVATANGKLTHSGRSK----------EIKPIENSIVKEIIVKEGESVRKGDVLLKL 127
>SHIGARICIN#Ribosome inactivating protein family signature. Length = 289 Score = 26.7 bits (59), Expect = 0.026 Identities = 6/29 (20%), Positives = 16/29 (55%) Query: 7 FFIIIIALIVVAASFRFVQQRREKAANEA 35 +++I AA ++F++Q+ K ++ Sbjct: 173 ALMVLIQSTSEAARYKFIEQQIGKRVDKT 201
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 30.2 bits (68), Expect = 0.039 Identities = 17/78 (21%), Positives = 34/78 (43%), Gaps = 3/78 (3%) Query: 336 AEERRAPIERFIDRFSRIYTPVIMVIALLVTLIPPLMFDGGWQEWIYKGLTLLLIGCPCA 395 E++ P E S+I ++ + +L + P+ F GG IY+ ++ ++ A Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVS---A 477 Query: 396 LVISTPAAITSGLAAAAR 413 + +S A+ A A Sbjct: 478 MALSVLVALILTPALCAT 495
>PF01206#SirA family protein Length = 76 Score = 101 bits (254), Expect = 2e-32 Identities = 28/72 (38%), Positives = 42/72 (58%) Query: 9 DHTLDALGLRCPEPVMMVRKTVRNMQTGETLLIIADDPATTRDIPGFCTFMEHDLLAQET 68 D +LDA GL CP P++ +KT+ M GE L ++A DP + +D F H+LL Q+ Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64 Query: 69 EGLPYRYLLRKA 80 E Y + L++A Sbjct: 65 EDGTYHFRLKRA 76
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 49.1 bits (117), Expect = 2e-08 Identities = 75/399 (18%), Positives = 141/399 (35%), Gaps = 34/399 (8%) Query: 13 LRLNLRIVSIVMFNFASYLTIGLPLAVLPGYVHD--AMGFSAFWAGLIISLQYFATLLSR 70 ++ N ++ I+ + IGL + VLPG + D G++++L Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60 Query: 71 PHAGRYADVLGPKKIVVFGLCGCFLSGLGYLLADIASAWPMISLLLLGLGRVILGI-GQS 129 P G +D G + +++ L G + + Y + A L +L +GR++ GI G + Sbjct: 61 PVLGALSDRFGRRPVLLVSLAG---AAVDYAIMATAPF-----LWVLYIGRIVAGITGAT 112 Query: 130 FAGTGSTLWGVGVVGSLHIGRVISWNGIVTYGAMAMGAPLGVLCYAWGGLQGL--ALTVM 187 A G+ + + R + M G LG L + A + Sbjct: 113 GAVAGAYIADITDGDER--ARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALN 170 Query: 188 GVALLAVLLALPRPSVK----ANKGKPLPFRAVLGRVWLYGMALALA-----SAGFGVIA 238 G+ L LP + P + + +A +A V A Sbjct: 171 GLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPA 230 Query: 239 TFITLFYDAK-GWDGAAFALTLFSVAFVGT---RLLFPNGINRLGGLNVAMICFGVEIIG 294 +F + + WD ++L + + + ++ RLG M+ + G Sbjct: 231 ALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTG 290 Query: 295 LLLVGTAAMPWMAKIGVLLTGMGFSLVFPALGVVAVKAVPPQNQGAALATYTVFMDMSLG 354 +L+ A WMA ++L + PAL + + V + QG + ++ Sbjct: 291 YILLAFATRGWMAFPIMVLLA-SGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLT-S 348 Query: 355 VTGPLAGLVMTWAGVPV----IYLAAAGLVAMALLLTWR 389 + GPL + A + ++A A L + L R Sbjct: 349 IVGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387
>ENTSNTHTASED#Enterobactin synthetase component D signature. Length = 234 Score = 29.6 bits (66), Expect = 0.006 Identities = 25/93 (26%), Positives = 43/93 (46%), Gaps = 6/93 (6%) Query: 30 RWASWLAGRVLLSRALSPL---PEMVYGEQGKPAFSAGTPLWFNLSHSGDTIALLLSDEG 86 R A LAGR+ AL + G++ +P + G L+ ++SH T ++S + Sbjct: 46 RKAEHLAGRIAAVHALREVGVRTVPGMGDKRQPLWPDG--LFGSISHCATTALAVISRQ- 102 Query: 87 EVGCDIEVIRPRDNWRSLANAVFSLGEHAEMEA 119 +G DIE I + LA ++ E ++A Sbjct: 103 RIGIDIEKIMSQHTATELAPSIIDSDERQILQA 135
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 47.2 bits (112), Expect = 3e-08 Identities = 44/171 (25%), Positives = 74/171 (43%), Gaps = 7/171 (4%) Query: 200 REREHGTVEHLLVMPVTPFEIMMAKV-WSMGLVVLVVSGLSLMLMVKGVLGVPIEGSIPL 258 R T E +L + +I++ ++ W+ L +G + +V LG + L Sbjct: 93 RMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAG---IGVVAAALGY-TQWLSLL 148 Query: 259 FMLGV-ALSLLATISIGIFMGTIARSMPQLGLLMILVLLPLQMLSGGSTPRESMPQAVQD 317 + L V AL+ LA S+G+ + +A S LV+ P+ LSG P + +P Q Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208 Query: 318 IMLTMPTTHFVSLAQAILYRGAGLSIVWPQFLTLLAIGGVFFL-IALLRFR 367 +P +H + L + I+ + + + I FFL ALLR R Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRRR 259
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 77.2 bits (190), Expect = 9e-18 Identities = 70/409 (17%), Positives = 135/409 (33%), Gaps = 82/409 (20%) Query: 4 HLVWWGAGILVAVAAIAWWMLRPAGIPEGFAASNGRI--EATEVDIATKIAGRIDTILVS 61 LV + + V A +L E A +NG++ +I + I+V Sbjct: 58 RLVAY-FIMGFLVIAFILSVLGQV---EIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113 Query: 62 EGQFVRQGEVLAKMDTRV----------------LQEQRLEAI----------------- 88 EG+ VR+G+VL K+ L++ R + + Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173 Query: 89 -------------------AQIKEAESAVAAARALLEQRQSEMRAAQSVVKQREAELDSV 129 Q ++ L+++++E + + + E Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233 Query: 130 SKRHVRSRSLSQRGAVSVQQLDDDRAAAESARAALETAKAQVSAAKAAIEAARTSIIQ-- 187 R SL + A++ + + A L K+Q+ ++ I +A+ Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293 Query: 188 -----------AQTRVEAAQATERRIVADID--DSELKAPRDGRV-QYRVAEPGEVLSAG 233 QT T + S ++AP +V Q +V G V++ Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353 Query: 234 GRVLNMVDLSDVY-MTFFLPTEQAGLLKIGGEARLVLDAAPDLRIPATISFVASVAQFTP 292 ++ +V D +T + + G + +G A + ++A P R V V Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYG---YLVGKVKNINL 410 Query: 293 KTVETHDERLKLMFRVKARIPPELLRQHLEYV--KTGLPGMAWVRLDER 339 +E D+RL L+F V I L + + +G+ A ++ R Sbjct: 411 DAIE--DQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMR 457
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 30.2 bits (68), Expect = 0.016 Identities = 19/109 (17%), Positives = 45/109 (41%), Gaps = 10/109 (9%) Query: 226 FAAFSIFATISFYQGSSYLVPY-LSDVYGMTAEHAGIIGMIRAYVLAILIAPVVGLLADK 284 IF T++ G +VPY + DV+ ++ G + + + I+ + G+L D+ Sbjct: 263 LCGGIIFGTVA---GFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDR 319 Query: 285 VGS--AIKVMNWLFIAGVIGVAMFLVIPQDPAMVWVLIGTLMIVGSINF 331 G + + + + L + ++ I + ++G ++F Sbjct: 320 RGPLYVLNIGVTFLSVSFLTASFLL----ETTSWFMTIIIVFVLGGLSF 364
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.2 bits (70), Expect = 0.008 Identities = 14/42 (33%), Positives = 19/42 (45%) Query: 162 VVKEVNRDGEVVWEWRAWEHLNPEDFPIHDIFDRRHWPMING 203 V RDG W+WR W+ P FP H + R ++ G Sbjct: 194 VYSRSQRDGSEAWKWRGWDDPRPLYFPSHRAPESRTVVLVEG 235
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 34.1 bits (78), Expect = 8e-05 Identities = 20/52 (38%), Positives = 26/52 (50%), Gaps = 5/52 (9%) Query: 76 VAPDALRHGIGKALL----EYVQQR-FPLLSLEVYQKNQSAVNFYHALGFRI 122 VA D + G+G ALL E+ ++ F L LE N SA +FY F I Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 118 bits (297), Expect = 4e-34 Identities = 43/124 (34%), Positives = 64/124 (51%), Gaps = 11/124 (8%) Query: 104 LNMPNNVTFDSSSATLKPAGANTLTGVAMVLKEY--PKTAVNVVGYTDSTGSHDLNMRLS 161 + ++V F+ + ATLKP G L + L +V V+GYTD GS N LS Sbjct: 215 FTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLS 274 Query: 162 QQRADSVASSLITQGVDASRIRTSGMGPANPIASNSTAEGK---------AQNRRVEITL 212 ++RA SV LI++G+ A +I GMG +NP+ N+ K A +RRVEI + Sbjct: 275 ERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334 Query: 213 SPLQ 216 ++ Sbjct: 335 KGIK 338
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 99.5 bits (248), Expect = 3e-26 Identities = 75/348 (21%), Positives = 125/348 (35%), Gaps = 67/348 (19%) Query: 2 IIVTGGAGFIGSNIVKALNDKGITDILVVDNLKD--------------GTKFVNLVDLNI 47 +VTG AGFIG ++ K L + G ++ +DNL D +++ Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 48 ADYMDKEDFLIQIMSGEELGDIEAIFHEGACSSTTEWDGKYMMDNNYQYSK-------EL 100 AD + + + G E +F + +Y ++N + Y+ + Sbjct: 62 ADR----EGMTDLF---ASGHFERVFISPHRLAV-----RYSLENPHAYADSNLTGFLNI 109 Query: 101 LHYCLEREIP-FLYASSAATYGGRTSD-FIESREYEKPLNVYGYSKFLFDEYVRQILPEA 158 L C +I LYASS++ YG F + P+++Y +K + Sbjct: 110 LEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLY 169 Query: 159 NSQIVGFRYFNVYGPREGHKGSMASVAFHLNTQLNNGESPKLFEGSENFKRDFVYVGDVA 218 G R+F VYGP + MA F + G+S ++ KRDF Y+ D+A Sbjct: 170 GLPATGLRFFTVYGPWG--RPDMA--LFKFTKAMLEGKSIDVY-NYGKMKRDFTYIDDIA 224 Query: 219 AVNL------------WFLESGKSG-------IFNLGTGRAESFQAVADATLAY-HKKGS 258 + W +E+G ++N+G A + Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK 284 Query: 259 IEYIPFPDKLKGRYQAFTQADLTNLRNA-GYDKPFKTVAEGVTEYMAW 305 +P G T AD L G+ P TV +GV ++ W Sbjct: 285 KNMLPLQ---PGDVL-ETSADTKALYEVIGF-TPETTVKDGVKNFVNW 327
>SECA#SecA protein signature. Length = 901 Score = 41.0 bits (96), Expect = 1e-05 Identities = 38/141 (26%), Positives = 60/141 (42%), Gaps = 18/141 (12%) Query: 233 NLSMLALRAGAQRYHAQPLSTNNILKDKLLAALPFKPTGAQARVVAEIERDM-ALDVPMM 291 LS L+ + A+ L +L++ + A A R ++ M DV ++ Sbjct: 37 KLSDEELKGKTAEFRAR-LEKGEVLENLIPEAF------AVVREASKRVFGMRHFDVQLL 89 Query: 292 ---RLVQGDV-----GSGKTLVAALAA-LRAIAHGKQVALMAPTELLAEQHANNFRNWFE 342 L + + G GKTL A L A L A+ GK V ++ + LA++ A N R FE Sbjct: 90 GGMVLNERCIAEMRTGEGKTLTATLPAYLNALT-GKGVHVVTVNDYLAQRDAENNRPLFE 148 Query: 343 PLGVEVGWLAGKQKGKARQAQ 363 LG+ VG A++ Sbjct: 149 FLGLTVGINLPGMPAPAKREA 169
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 34.7 bits (79), Expect = 2e-04 Identities = 27/108 (25%), Positives = 47/108 (43%), Gaps = 3/108 (2%) Query: 25 GYHIEHVENKSQQPGRTFDYQNLAASALDSENGLPQLGINAFGGHVQG-KNKSVDMAQFI 83 GY + S + +F+ NL + +E+ LG G +Q N V + + Sbjct: 793 GYVTCTTDKLSDKALNSFNPTNLRGNVNLTESANFVLGKANLFGTIQSRGNSQVRLTENS 852 Query: 84 HCHLP-DCSRYFAYLSNGHV-VPSIDLTEQEAEYAQYTIDHLNLNSGF 129 H HL + + L+NGH+ + S D + +Y T++ L+ N F Sbjct: 853 HWHLTGNSDVHQLDLANGHIHLNSADNSNNVTKYNTLTVNSLSGNGSF 900
>PERTACTIN#Pertactin signature. Length = 922 Score = 119 bits (300), Expect = 1e-29 Identities = 164/749 (21%), Positives = 289/749 (38%), Gaps = 90/749 (12%) Query: 230 TGDSSEGLRTGQSGSLIRLGDDATIETSGASSTGIYAASSSRTELGNNATITVNGASAHA 289 TG + G+ G+++ L ATI A + G + + Sbjct: 236 TGGRAAGV-AAMDGAIVHL-QRATIRRGDAPAGGAVPGGAVPGGAVPGG-FGPLLDGWYG 292 Query: 290 VYATNATVNLGENATISVNSASKAASYSKAPAGLYALSRGAINLAGGAAITMAGDNSSES 349 V +++TV+L A V + A+ +S G+++ G I G Sbjct: 293 VDVSDSTVDL---AQSIVEAPQLGAAIRAGRGARVTVSGGSLSAPHGNVIETGGGARRFP 349 Query: 350 YAISTETGGIVDGS--SGGRFVIDGDIRAAGATAASGTLPQ--------------QNSTI 393 S + + G+ G + T A G Q + + Sbjct: 350 PPASPLSITLQAGARAQGRALLYRVLPEPVKLTLAGGAQGQGDIVATELPPIPGASSGPL 409 Query: 394 KLNMTDNSRWDGASYITSATAGTGVISVQMSDATWNMTSSSTLTDLTLNSGATINFSH-- 451 + + +RW GA+ V S+ + +ATW MT +S + L L S +++F Sbjct: 410 DVALASQARWTGATRA--------VDSLSIDNATWVMTDNSNVGALRLASDGSVDFQQPA 461 Query: 452 EDGEPWQTLTINEDYVGNGGKLVFNTVLSDDDSETDRLQVLGNTSGNTFVAVNNIGGAGA 511 E G ++ L ++ G+G +F + D +D+L V+ + SG + V N G A Sbjct: 462 EAGR-FKVLMVDT-LAGSG---LFRMNVFADLGLSDKLVVMRDASGQHRLWVRNSGSEPA 516 Query: 512 QTIEGIEIVNVAGNSNGTFEKASR---IVAGAYDYNMVQKGKNWYLTSYIEPDEPIIPDP 568 + + +V S TF A++ + G Y Y + G + S + P P P Sbjct: 517 -SGNTMLLVQTPRGSAATFTLANKDGKVDIGTYRYRLAANGNGQW--SLVGAKAPPAPKP 573 Query: 569 VDPVIPDPVIPDPVDPDPVDPVIPDPVIPDPVDPEPVDPVIPDPTIPDIGQSDTPPITEH 628 P P P P P P P P P +P P P ++ + + Sbjct: 574 APQPGPQPGPQPPQPPQPPQP----PQPPQPPQRQPEAPAPQPPAGRELSAAANAAVNTG 629 Query: 629 QFRPEVGSYLANNYAANTLFMTRLHDRLGETQYTDMLTGEKKVTSLWMRNVGAHTRFNDG 688 + A + A L RLGE + G W R + ++ Sbjct: 630 GVGLASTLWYAESNA--------LSKRLGELRLNPDAGG------AWGRGFAQRQQLDNR 675 Query: 689 SGQLKTRINSYVLQLGGDLAQWSTDGLDRWHIGAMAGYANSQNRTQSSVSDYHSRGQVTG 748 +G+ + +LG D A + G RWH+G +AGY + D G Sbjct: 676 AGRRFDQ-KVAGFELGADHA-VAVAG-GRWHLGGLAGYTRGD---RGFTGD--GGGHTDS 727 Query: 749 YSVGLYGTWYANNIDRSGAYVDTWMLFNWFDN--KVMGQDQAA--EKYKSKGITASVEAG 804 VG Y T+ AN+ G Y+D + + +N KV G D A KY++ G+ S+EAG Sbjct: 728 VHVGGYATYIANS----GFYLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGVSLEAG 783 Query: 805 YSFRLGESVHQSYWLQPKAQVVWMGVQADDNREANGTLVKDDTAGNLLTRMGVKAYINGH 864 F ++L+P+A++ V R ANG V+D+ ++L R+G++ Sbjct: 784 RRFAH----ADGWFLEPQAELAVFRVGGGAYRAANGLRVRDEGGSSVLGRLGLEV----G 835 Query: 865 NAIDDNKSREFQPFVEANWIHNTQPA-SVKMDDVS--SDMRGTKNIGELKVGIEGQITPR 921 I+ R+ QP+++A+ + A +V+ + ++ +++RGT+ EL +G+ + Sbjct: 836 KRIELAGGRQVQPYIKASVLQEFDGAGTVRTNGIAHRTELRGTR--AELGLGMAAALGRG 893 Query: 922 LNVWGNVAQQVGDQGYSNTQGLLGVKYSF 950 +++ + G + G +YS+ Sbjct: 894 HSLYASYEYSKGPKLAMPWTFHAGYRYSW 922
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 36.0 bits (83), Expect = 2e-04 Identities = 40/208 (19%), Positives = 77/208 (37%), Gaps = 13/208 (6%) Query: 33 ITVEFLPVSLLTP----MAQDLGISEGVAGQSVTVTAFVAMFSSLFITQIIQATDR--RY 86 + ++ + + L+ P + +DL S V + A A+ + +DR R Sbjct: 14 VALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRR 73 Query: 87 IVILFAVLLTA-SCLMVSFANSFTLLLLGRACLGLALGGFWAISASLTMRLVPARTVPKA 145 V+L ++ A +++ A +L +GR G+ G A++ + + + Sbjct: 74 PVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARH 132 Query: 146 LSVIFGAVSIALVIAAPLGSFLGGIIGWRNVFNAAAVMGVLCVIWVVKSLP-SLPGEPSH 204 + +V LG +GG F AAA + L + LP S GE Sbjct: 133 FGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRP 191 Query: 205 QKQ---NMFSLLQRPGVMAGMIAIFMSF 229 ++ N + + M + A+ F Sbjct: 192 LRREALNPLASFRWARGMTVVAALMAVF 219
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 26.7 bits (58), Expect = 0.043 Identities = 15/42 (35%), Positives = 21/42 (50%) Query: 70 AEAQVIIEQANKRRAQILDEAKTEAEQERTKIVAQAQAEIEA 111 A+A + ANK R Q EAK +AE++ + A A A Sbjct: 210 AKASIEAAAANKAREQAAAEAKRKAEEQARQQAAIRAANTYA 251
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 28.3 bits (63), Expect = 0.028 Identities = 18/90 (20%), Positives = 32/90 (35%), Gaps = 9/90 (10%) Query: 24 SH-ALNYLFADGQLKQGTLVAINAEKLLTAEDNPEVRALIGAAEFKYADGISVVRSIRKK 82 S A Y + + IN E+ T E + + + + V S+ + Sbjct: 204 SEGAFKYFSKAYGVPSAYIWEINTEEEGTPEQIKTLVEKLRQTKVP---SLFVESSVDDR 260 Query: 83 FPQAQVSRVAGADLWEAL----MARAGKEG 108 P VS+ ++ + +A GKEG Sbjct: 261 -PMKTVSQDTNIPIYAQIFTDSIAEQGKEG 289
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 179 bits (456), Expect = 7e-51 Identities = 100/448 (22%), Positives = 170/448 (37%), Gaps = 87/448 (19%) Query: 4 NLRNIAIIAHVDHGKTTLVDKLLQQSGTFDARAETQE--RVMDSNDLEKERGITILAKNT 61 + NI ++AHVD GKTTL + LL SG + D+ LE++RGITI T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 62 AIKWNDYRINIVDTPGHADFGGEVERVMSMVDSVLLVVDAFDGPMPQTRFVTKKAFAHGL 121 + +W + ++NI+DTPGH DF EV R +S++D +L++ A DG QTR + G+ Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121 Query: 122 KPIVVINKVDRPGARPDWVVDQVFD-------------LFVNLDATDEQLD--------- 159 I INK+D+ G V + + L+ N+ T+ Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181 Query: 160 --------------------------------FPIIYASALNGIAGLDHEDMAEDMTPLY 187 FP+ + SA N I G+D+ L Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNI-GIDN---------LI 231 Query: 188 QAIVDHVPAPDVDLDGPLQMQISQLDYNNYVGVIGIGRIKRGKVKPNQQVTIIDSEGKTR 247 + I + + L ++ +++Y+ + R+ G + V I + E Sbjct: 232 EVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-- 289 Query: 248 NAKVGKVLTHLGLERIDSDIAEAGDIIAITGLG-ELN--ISDTICDPQNVEALPALSVDE 304 K+ ++ T + E D A +G+I+ + +LN + DT PQ + Sbjct: 290 --KITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQR----ERIENPL 343 Query: 305 PTVSMFFCVNTSPFCGKEGKFVTSRQILDRLNKELVHNVALRVEETEDADAFRVSGRGEL 364 P + + + D L LR +S G++ Sbjct: 344 PLLQTTVEPSKPQQREMLLDALLEISDSDPL---------LRYYVDSATHEIILSFLGKV 394 Query: 365 HLSVLIENMRRE-GFELAVSRPKVIFRE 391 + V ++ + E+ + P VI+ E Sbjct: 395 QMEVTCALLQEKYHVEIEIKEPTVIYME 422 Score = 32.5 bits (74), Expect = 0.005 Identities = 13/75 (17%), Positives = 29/75 (38%), Gaps = 1/75 (1%) Query: 398 EPYENVTLDVEEQHQGSVMQALGERKGDLKNMNPDGKGRVRLDYVIPSRGLIGFRSEFMT 457 EPY + + +++ + ++ + V L IP+R + +RS+ Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKN-NEVILSGEIPARCIQEYRSDLTF 595 Query: 458 MTSGTGLLYSTFSHY 472 T+G + + Y Sbjct: 596 FTNGRSVCLTELKGY 610
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.0 bits (65), Expect = 0.014 Identities = 12/62 (19%), Positives = 24/62 (38%), Gaps = 14/62 (22%) Query: 134 AWLEDKTNSNLLIEMVIPQADISFSDSLRLGYERGIILMKEIKKIYPDV-VIDMSVNSAA 192 W+ ++ ++V+P + L+ IKK PD+ V+ MS + Sbjct: 41 RWIAAGDGDLVVTDVVMPDEN-------------AFDLLPRIKKARPDLPVLVMSAQNTF 87 Query: 193 SS 194 + Sbjct: 88 MT 89
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 266 bits (682), Expect = 8e-87 Identities = 87/425 (20%), Positives = 175/425 (41%), Gaps = 25/425 (5%) Query: 9 LMMIIISLTILIIILTYFIEINSVVHGQGVITTKDNAQLISLSKGGTIQDIYVAEGDTVK 68 + I+ ++ IL+ ++ V G +T ++ I + +++I V EG++V+ Sbjct: 60 VAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVR 119 Query: 69 KGELLAKVVNLDLQKEYQRYRTQKGYLDKDVNEI-------SFILDKENESGLITLDGTR 121 KG++L K+ L E +TQ L + + S L+K E L + Sbjct: 120 KGDVLLKLT--ALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQ 177 Query: 122 SLSNKEVKANIELVHSQIRA-------KELKKTSLDSEISGLQEKLSSKEKELALLAEEI 174 ++S +EV L+ Q KEL +E + +++ E + + Sbjct: 178 NVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRL 237 Query: 175 NILSPLVKKGISPYTNFLNKKQAYIKVKSEINDIESSITLKKDDIELVVNDIEALNNELR 234 + S L+ K L ++ Y++ +E+ +S + + +I + + + + Sbjct: 238 DDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFK 297 Query: 235 LSLSKIISKNLQELEVVNSTLKVIEKQINEEDIYSPVDGVIYKINKSATTHGGVIQAADL 294 + + + + ++ L E++ I +PV + ++ T GGV+ A+ Sbjct: 298 NEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQL--KVHTEGGVVTTAET 355 Query: 295 LFEIKPKVRTMLADVKILPKYRDQIYVDEAVKLDVQSIIQPKIKSYNATIDNISPDSYEE 354 L I P+ T+ + K I V + + V++ + + NI+ D+ E+ Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIED 415 Query: 355 NTGGTIQRYYKVIIAFDVNE----DDLRWLKPGMTVDASVITGKHSIMEYLLSPLMKGVD 410 G + VII+ + N + L GM V A + TG S++ YLLSPL + V Sbjct: 416 QRLGL---VFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVT 472 Query: 411 KAFSE 415 ++ E Sbjct: 473 ESLRE 477
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 49.3 bits (117), Expect = 4e-07 Identities = 49/190 (25%), Positives = 76/190 (40%), Gaps = 30/190 (15%) Query: 96 DSAQVEKKGNGKRRNKKEEEELKKQLDEAENAKK--EADKAK-EEAEKAKEAAEKALNEA 152 A+ + + + L++ LD + AKK EA+ K EE K EA+ ++L Sbjct: 293 LEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRD 352 Query: 153 FEVQNSSK-QIEEMLQNFLADNVAKDNLAQQSDASQQNTQA---KATQASKQNDAEKVLP 208 + +K Q+E Q N + S+AS+Q+ + + +A KQ + Sbjct: 353 LDASREAKKQLEAEHQKLEEQN-------KISEASRQSLRRDLDASREAKKQVEKALEEA 405 Query: 209 QPI-------NKNTSTGK--SNSSKNEEN-KLDAESVKEPLKVTLALAAES----NSGSK 254 NK K + K E KL+AE+ + LK LA AE +G Sbjct: 406 NSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEA--KALKEKLAKQAEELAKLRAGKA 463 Query: 255 DDSITNFTKP 264 DS T KP Sbjct: 464 SDSQTPDAKP 473 Score = 47.4 bits (112), Expect = 1e-06 Identities = 35/136 (25%), Positives = 63/136 (46%), Gaps = 4/136 (2%) Query: 98 AQVEKKGNGKRRNKKEEEELKKQLDEAENAKKEADKAKEEAEKAKEAAEKALNEAFEVQN 157 A+ +K + ++ + L++ LD + AKK+ +KA EEA A EK E E + Sbjct: 365 AEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKK 424 Query: 158 SSKQIEEMLQNFL-ADNVA-KDNLAQQSD--ASQQNTQAKATQASKQNDAEKVLPQPINK 213 +++ + LQ L A+ A K+ LA+Q++ A + +A +Q K +P Sbjct: 425 LTEKEKAELQAKLEAEAKALKEKLAKQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQA 484 Query: 214 NTSTGKSNSSKNEENK 229 + K N +K + Sbjct: 485 PQAGTKPNQNKAPMKE 500 Score = 43.1 bits (101), Expect = 2e-05 Identities = 17/115 (14%), Positives = 41/115 (35%), Gaps = 19/115 (16%) Query: 101 EKKGNGKRRNKKEEEELKKQLDEAENAKKEAD-------KAKEEAEKAKEAAEKALNEAF 153 ++ ++ + + + + E + + + A ++L Sbjct: 260 ARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQ-SQVLNANRQSLRRDL 318 Query: 154 EVQNSSK-QIEEMLQNFLADNVAKDNLAQQSDASQQNTQAK---ATQASKQNDAE 204 + +K Q+E Q N + S+AS+Q+ + + +A KQ +AE Sbjct: 319 DASREAKKQLEAEHQKLEEQN-------KISEASRQSLRRDLDASREAKKQLEAE 366
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 29.9 bits (67), Expect = 0.003 Identities = 17/65 (26%), Positives = 29/65 (44%), Gaps = 6/65 (9%) Query: 89 LAVDKSLHGQGVARALVRDAGLRVIQVAETIGIRGMLVHALSDE--AREFYQRVGFVPSP 146 +AV K +GV AL+ A I+ A+ G+++ A FY + F+ Sbjct: 95 IAVAKDYRKKGVGTALLHKA----IEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150 Query: 147 MDPMM 151 +D M+ Sbjct: 151 VDTML 155
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 46.2 bits (109), Expect = 1e-08 Identities = 29/189 (15%), Positives = 53/189 (28%), Gaps = 15/189 (7%) Query: 3 REDILGEALKLLETQGIADTTLEMVAERVNRPLDTLQRFWPDKEAILYDALRYLSQQVDI 62 R+ IL AL+L QG++ T+L +A+ + + DK + + + Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72 Query: 63 WRRQLLLDESLSAEQKLLARYSA-LSECVSNNRYPGCLFIAACTFYPDPTH----PIHQL 117 + L L V+ R + F+ + Q Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRL---LMEIIFHKCEFVGEMAVVQQA 129 Query: 118 ANQQKRAAHDFTHELLTTL----EID---DPAMVARQMELVLEGCLSRMLVNRSQADVDT 170 ++D + L + A M + G + L D+ Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKK 189 Query: 171 AQRLAEDIL 179 R IL Sbjct: 190 EARDYVAIL 198
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 31.1 bits (70), Expect = 0.007 Identities = 16/65 (24%), Positives = 26/65 (40%), Gaps = 7/65 (10%) Query: 130 PPPPPPPVVAKRVESAPRPTEPARNPFKSSDDRLTGVTSSNTVTRPAARASAGAGDKVVI 189 P P P P K+VE R +P + S + + RP + + A K V Sbjct: 99 PKPKPKPKPVKKVEQPKRDVKPVESRPASPFE-------NTAPARPTSSTATAATSKPVT 151 Query: 190 AIDAG 194 ++ +G Sbjct: 152 SVASG 156
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 30.1 bits (68), Expect = 0.027 Identities = 26/161 (16%), Positives = 57/161 (35%), Gaps = 18/161 (11%) Query: 31 VENSLDAGATRVDIDIER---GGAKLIR-IRDNGCGIKKEELALALARHATSKIASLDDL 86 ++ SLD A + ++ I R A++ ++ N G E + A+ + +L++ Sbjct: 5 IQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLNLEEA 64 Query: 87 EAIISLGFRGEAL----------ASISSVSRLTLTSRTAEQAEAWQAYAEGRDMDVTVK- 135 + G++G L I RLT + Q +A Q +D+ +K Sbjct: 65 ITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDIYLKV 124 Query: 136 -PAAHPVGTTLEVLDLFYNTPARRKFMRTEK--TEFNHIDE 173 + +G + + + + + F + Sbjct: 125 NSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEH 165
>SECA#SecA protein signature. Length = 901 Score = 33.3 bits (76), Expect = 0.002 Identities = 26/144 (18%), Positives = 55/144 (38%), Gaps = 6/144 (4%) Query: 282 HVVDAADVRVQENIEAVNTVLEEIDAHEIPTLMVMNKIDMLDDFEPRIDRDEENK-PIRV 340 ++D +DV N + IDA+ P + ++ + + R+ D + PI Sbjct: 665 ELLDVSDVSETINSIREDVFKATIDAYIPPQSL--EEMWDIPGLQERLKNDFDLDLPIAE 722 Query: 341 WLSAQSGVGIPQLFQALTERLSGEVAQHTLRLPPQEGRLRSRFYQLQAIEKEWMEEDGSV 400 WL + + L + + + + + + R + LQ ++ W E ++ Sbjct: 723 WLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAAM 782 Query: 401 SLQVRMPIVDWRRLCKQEPALIEY 424 +R I R +++P EY Sbjct: 783 D-YLRQGIH-LRGYAQKDP-KQEY 803
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 29.0 bits (64), Expect = 0.030 Identities = 18/65 (27%), Positives = 30/65 (46%), Gaps = 3/65 (4%) Query: 225 NRMRAEREAVARRHRSQGQEEAEKLRAAADYEVTK---TLAEAERQGRIMRGEGDAEAAK 281 N+ R + A A+R + + +RAA Y + +A A +G I +G A A+ Sbjct: 220 NKAREQAAAEAKRKAEEQARQQAAIRAANTYAMPANGSVVATAAGRGLIQVAQGAASLAQ 279 Query: 282 LFADA 286 +DA Sbjct: 280 AISDA 284
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.7 bits (72), Expect = 0.013 Identities = 12/55 (21%), Positives = 26/55 (47%), Gaps = 1/55 (1%) Query: 165 VVPDDSRLSFDILIPPEDVMGARMGFVVVVELTQRPTRRTKAV-GKIVEVLGDNM 218 +VP+D L L+ +D+ +G ++++ P R + GK+ + D + Sbjct: 359 IVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI 413
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 140 bits (353), Expect = 1e-42 Identities = 85/256 (33%), Positives = 133/256 (51%), Gaps = 8/256 (3%) Query: 7 LAGKNILITGAAQGIGYLLATGLGRYGARIIVNDITPERAETAVTKLQQEGIKAIAAPFN 66 + GK ITGAAQGIG +A L GA I D PE+ E V+ L+ E A A P + Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 67 VTHKQDIEAAVEHIEKDIGVIDVLINNAGIQRRHPFTEFPEQEWNDVIAVNQTAVFLVSQ 126 V I+ IE+++G ID+L+N AG+ R ++EW +VN T VF S+ Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 127 AVTRRMVARQAGKVINICSMQSELGRDTITPYAASKGAVKMLTRGMCVELARHNIQVNGI 186 +V++ M+ R++G ++ + S + + R ++ YA+SK A M T+ + +ELA +NI+ N + Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 187 APGYFKTEMTKALVEDE--------AFTSWLCKRTPAARWGDPQELIGAAVFLSSKASDF 238 +PG +T+M +L DE P + P ++ A +FL S + Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245 Query: 239 VNGHLLFVDGGMLVAV 254 + H L VDGG + V Sbjct: 246 ITMHNLCVDGGATLGV 261
>ICENUCLEATIN#Ice nucleation protein signature. Length = 1258 Score = 26.3 bits (57), Expect = 0.046 Identities = 17/60 (28%), Positives = 25/60 (41%), Gaps = 1/60 (1%) Query: 14 MVAQAGASHEAPVSNVAGYANPVWATTSEIGVSSGSSHMQTLEVATMATTLT-TSHSQFV 72 M A G+ V + PV S + QT+E+AT +TL+ T SQ + Sbjct: 118 MQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLI 177
>PF05775#Enterobacteria AfaD invasin protein Length = 142 Score = 30.3 bits (68), Expect = 0.015 Identities = 12/41 (29%), Positives = 18/41 (43%) Query: 174 VNVQLINSDGLKRTLKDGAVKGTCHIIGGQKQAGKRLWIAE 214 ++ L+N + L DG T II +G R+WI Sbjct: 24 ADITLMNHKYMGNLLHDGVKLATGRIICQDTHSGFRVWINA 64
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 724 bits (1870), Expect = 0.0 Identities = 261/859 (30%), Positives = 422/859 (49%), Gaps = 58/859 (6%) Query: 6 ITLFVLTSVFHSGNVFSRQYNFDYGSLSLPPGENASFLSVE----TLPGNYVVDVYLNNQ 61 + LFV + + S + F+ L+ P A E PG Y VD+YLNN Sbjct: 28 VRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNG 87 Query: 62 LKETTELYFKS--MTQTLEPCLTKEKLIKYGIAIQELHGLQF-DNEQCVLLEHSP--LKY 116 T ++ F + Q + PCLT+ +L G+ + G+ ++ CV L Sbjct: 88 YMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHDATA 147 Query: 117 TYNAANQSLLLNAPSKILSPIDSEIADENIWDDGINAFLLNYRANYLHS--KVGGE-DSY 173 + Q L L P +S +WD GINA LLNY + ++GG Sbjct: 148 QLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYA 207 Query: 174 FGQIQPGFNFGPWRLRNLSSW------QNLSSEKKFESAYIYAERGLKKIKSKLTVGDKY 227 + +Q G N G WRLR+ ++W + S+ K++ + ER + ++S+LT+GD Y Sbjct: 208 YLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDGY 267 Query: 228 TSADLFDSVPFRGFSLNKDESMIPFSQRIYYPTIRGIAKTNATVEVRQNGYLIYSTSVPP 287 T D+FD + FRG L D++M+P SQR + P I GIA+ A V ++QNGY IY+++VPP Sbjct: 268 TQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPP 327 Query: 288 GQFEIGREQIADLGVGVGVLDVSIYEKNGQVQNYTVPYSTPVLSLPDGYSKYSVTIGRYR 347 G F I A G L V+I E +G Q +TVPYS+ L +G+++YS+T G YR Sbjct: 328 GPFTINDIYAAG---NSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYR 384 Query: 348 EVNNDYIDPVFFEGTYIYGLPYGFTLFGGVQWANIYNSYAIGASKDIGEYGALSFDWKTS 407 N P FF+ T ++GLP G+T++GG Q A+ Y ++ G K++G GALS D + Sbjct: 385 SGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQA 444 Query: 408 VSKT-DTSNENGHAYGIRYNKNIAQTNTEVSLASHYYYSKNYRTFSEAIHSSEHDEF--- 463 S D S +G + YNK++ ++ T + L + Y + Y F++ +S + Sbjct: 445 NSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIET 504 Query: 464 ----------------YDKNKKSTTSMLLSQALGSLGSVNLSYNYDKYWKHEGK-KSIIA 506 NK+ + ++Q LG ++ LS ++ YW + A Sbjct: 505 QDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQA 564 Query: 507 SYGKNLNGVSLSLSYTKSTSKISEENEDLFSFLLSVPLQKLTNHE-------MYATYQNS 559 ++ +LSY+ + + + + + + +++P + A+Y S Sbjct: 565 GLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMS 624 Query: 560 SSSKHDMNHDLGITGVAF-DSQLTWQARGQIE--DKSKNQKATFLNASWRGTYGEIGANY 616 M + G+ G D+ L++ + + + ++RG YG Y Sbjct: 625 HDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGY 684 Query: 617 SHNEINRDIGMNVSGGVIAHSSGITFGQSISDTAALVEAKGVSGAKVLGLPGVRTDFRGY 676 SH++ + + VSGGV+AH++G+T GQ ++DT LV+A G AKV GVRTD+RGY Sbjct: 685 SHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWRGY 744 Query: 677 TISSYLTPYMNNFISIDPTTLPINTDIRQTDIQVVPTEGAIVKAVYKTSVGTNALIRITR 736 + Y T Y N +++D TL N D+ VVPT GAIV+A +K VG L+ +T Sbjct: 745 AVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTLTH 804 Query: 737 TNGKPLALGTVLSLKNNDGVIQSTSIVGEDGQAYVSGLSGVQKLIASWGNKPSDTCTVFY 796 N KPL G +++ +++ QS+ IV ++GQ Y+SG+ K+ WG + + C Y Sbjct: 805 -NNKPLPFGAMVTSESS----QSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANY 859 Query: 797 SLPDKNKGQ-ISFLNGVCK 814 LP +++ Q ++ L+ C+ Sbjct: 860 QLPPESQQQLLTQLSAECR 878
>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature. Length = 428 Score = 30.7 bits (69), Expect = 0.030 Identities = 13/61 (21%), Positives = 28/61 (45%), Gaps = 5/61 (8%) Query: 773 ADEIWGYLLGEREKYVFTGEWYDGLFGLEENEEFNDAFWDDVRYIK---DQINKELENQK 829 AD+ W + ++EK++ E + EE + N+ + ++ K + K + + K Sbjct: 344 ADKKWSHFGTQKEKWIGVAE--NHFSNTEEQAKINNKIKEAIKMFKELPEDFVKYINSDK 401 Query: 830 A 830 A Sbjct: 402 A 402
>INFPOTNTIATR#Macrophage infectivity potentiator signature. Length = 233 Score = 28.8 bits (64), Expect = 0.007 Identities = 12/32 (37%), Positives = 19/32 (59%) Query: 8 NSAILVHFTLKLDDGSTAESTRNNGKPALFRL 39 + + V +T L DG+ +ST GKPA F++ Sbjct: 144 SDTVTVEYTGTLIDGTVFDSTEKAGKPATFQV 175
>PF06580#Sensor histidine kinase Length = 349 Score = 28.7 bits (64), Expect = 0.033 Identities = 22/113 (19%), Positives = 46/113 (40%), Gaps = 12/113 (10%) Query: 199 WIIATMVWMFPAAGGAKIVVIILMTWLIALGDTTHIVVGSVEILYLV-FNGTLPWSDFFW 257 I W+ G I+ ++ +I + V + I L+ F T P + F Sbjct: 61 SFIKRQGWLKLNMG-QIILRVLPACVVIGM----VWFVANTSIWRLLAFINTKPVA-FTL 114 Query: 258 PFALPTLAGNICGGTFIFALMSHAQIRNDMSNKRKEEARLRGERLERERKKAE 310 P AL ++ N+ TF+++L+ K ++A + ++ ++A+ Sbjct: 115 PLAL-SIIFNVVVVTFMWSLLYFGWHFF----KNYKQAEIDQWKMASMAQEAQ 162
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 398 bits (1024), Expect = e-144 Identities = 237/251 (94%), Positives = 248/251 (98%) Query: 1 MKLRLSALALGTTLLVGCASSGTEQQGRSDPFEGFNRTMYNFNFNVLDPYVVRPVAVAWR 60 MKLRLSALALGTTLLVGCASSGT+QQGRSDP EGFNRTMYNFNFNVLDPY+VRPVAVAWR Sbjct: 1 MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR 60 Query: 61 DYVPQPARNGLSNFTGNLEEPAIMVNYFLQGDPYQGMVHFTRFFLNTLLGMGGFIDVAGM 120 DYVPQPARNGLSNFTGNLEEPA+MVNYFLQGDPYQGMVHFTRFFLNT+LGMGGFIDVAGM Sbjct: 61 DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM 120 Query: 121 ANPKLQRVEPHRFGSTLGHYGVGYGPYMQLPFYGSFTLREDGGDMADTLYPVLSWLTWPM 180 ANPKLQR EPHRFGSTLGHYGVGYGPY+QLPFYGSFTLR+DGGDMAD LYPVLSWLTWPM Sbjct: 121 ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPVLSWLTWPM 180 Query: 181 SIGKWTIEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGKLKPQENPNAQA 240 S+GKWT+EGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGG+LKPQENPNAQA Sbjct: 181 SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA 240 Query: 241 IQDELKEIDSE 251 IQD+LK+IDSE Sbjct: 241 IQDDLKDIDSE 251
>FbpA_PF05833#Fibronectin-binding protein Length = 577 Score = 28.7 bits (64), Expect = 0.026 Identities = 20/63 (31%), Positives = 31/63 (49%), Gaps = 6/63 (9%) Query: 204 VRNIVGS-LLEVGAHNQPESWIAELLAARDRTLAAATAKAEGLYLVAVDYPDRFDLPKPP 262 +NI GS ++ + PES + E AA LAA +K++ V VDY + ++ KP Sbjct: 496 TKNIPGSHVIVKNIMDIPESTLLE--AAN---LAAYYSKSQNSSNVPVDYTEVKNVKKPN 550 Query: 263 MGP 265 Sbjct: 551 GAK 553
>PERTACTIN#Pertactin signature. Length = 922 Score = 28.5 bits (63), Expect = 0.023 Identities = 16/50 (32%), Positives = 18/50 (36%) Query: 105 SKPKPVEKPKPQPKPQQPVVAASTPTPAPQPVADDKPAPTGKAYVVQLGA 154 +K P KP PQP PQ P P P P +A Q A Sbjct: 565 AKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPA 614 Score = 27.8 bits (61), Expect = 0.049 Identities = 19/60 (31%), Positives = 22/60 (36%), Gaps = 4/60 (6%) Query: 99 PIPAETSKPKPVEKPKPQPKPQQPVVAASTPTPAPQPVADDKPAPTGKAYVVQLGALKNA 158 P P +P P P+P PQ P P QP A P G+ L A NA Sbjct: 569 PAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRE----LSAAANA 624
>ANTHRAXTOXNA#Anthrax toxin LF subunit signature. Length = 800 Score = 33.6 bits (76), Expect = 0.002 Identities = 13/37 (35%), Positives = 24/37 (64%), Gaps = 2/37 (5%) Query: 469 KDVDQQYLDFLDSLRND-DAKAVLFQNEM-ENLEMHN 503 K +D ++L+ + SL +D D+ +LF + E LE++N Sbjct: 186 KSLDPEFLNLIKSLSDDSDSSDLLFSQKFKEKLELNN 222
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 348 bits (894), Expect = e-118 Identities = 121/371 (32%), Positives = 185/371 (49%), Gaps = 24/371 (6%) Query: 122 NMSGVRRLQEQVVELNQLLYADHHE---KHHAIITENPEMLSNIAKAKRLAASNIPVTIV 178 +++ + + + + + + + ++ + M RL +++ + I Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMIT 166 Query: 179 GETGTGKELFSRLIHQCSKRANKPFIALNCGALPPTLIESTLFGTVRGAYTGAENS-QGY 237 GE+GTGKEL +R +H KR N PF+A+N A+P LIES LFG +GA+TGA+ G Sbjct: 167 GESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGR 226 Query: 238 LELANGGTLFLDELNAMPIEMQSKLLRFLQDKTFWRLGGQQQLHSDVRIVAAMNEAPVKL 297 E A GGTLFLDE+ MP++ Q++LLR LQ + +GG+ + SDVRIVAA N+ + Sbjct: 227 FEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQS 286 Query: 298 IQQERLRADLFYRLSVGMLTLPPLRARPEDIPLLANYFIDKYRNDVPQDIHGLSETARAD 357 I Q R DL+YRL+V L LPPLR R EDIP L +F+ + + D+ + A Sbjct: 287 INQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALEL 345 Query: 358 LLNHAWPGNVRMLENAIVRSMIMQEKDGLLKHIIF-------------------EQDELN 398 + H WPGNVR LEN + R + +D + + II ++ Sbjct: 346 MKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSIS 405 Query: 399 LGVPETAPENPLPSSPDPQYEGSLEVRVANYERHLIETALDTHQGNIAAAARSLNVSRTT 458 V E + G + +A E LI AL +GN AA L ++R T Sbjct: 406 QAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNT 465 Query: 459 LQYKVQKYAIR 469 L+ K+++ + Sbjct: 466 LRKKIRELGVS 476
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 31.7 bits (72), Expect = 0.006 Identities = 23/133 (17%), Positives = 47/133 (35%), Gaps = 20/133 (15%) Query: 87 VLKAIRDAGICAEANSQYEVRKCLEIGFRGDQIVFNGVVKKPADLEYAIANDLYLINVDS 146 + AI A N + E E G++G ++ G DLE + L Sbjct: 46 IWSAIGATDGFALLNLE-EAITLRERGWKGPILMLEGFFH-AQDLEIYDQHRLTT----C 99 Query: 147 LYELEHIDAIS-RKLKKVANVCVRVEPNVPSATHAELVTAFHAKSGLDLEQAEETCRRIL 205 ++ + A+ +LK ++ ++V + + + G ++ +++ Sbjct: 100 VHSNWQLKALQNARLKAPLDIYLKVN------------SGMN-RLGFQPDRVLTVWQQLR 146 Query: 206 AMPYVHLRGLHMH 218 AM V L H Sbjct: 147 AMANVGEMTLMSH 159
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 50.5 bits (121), Expect = 5e-09 Identities = 32/129 (24%), Positives = 56/129 (43%), Gaps = 20/129 (15%) Query: 132 AMMVHIRHTAHSQ-LPEAITQAVIGRPINFQGLGGDDANRQAQGILERAAKRAGFQEVVF 190 M+ H HS + ++ P+ R+A + +A+ AG +EV Sbjct: 89 KMLQHFIKQVHSNSFMRPSPRVLVCVPVGA-----TQVERRA---IRESAQGAGAREVFL 140 Query: 191 QYEPVAAGLDYEATLREEKRVLVVDIGGGTTDCSMLLMGPQWRQRADRENSLLGHSGCRV 250 EP+AA + + E +VVDIGGGTT+ +++ + ++ S R+ Sbjct: 141 IEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN-----------GVVYSSSVRI 189 Query: 251 GGNDLDIAL 259 GG+ D A+ Sbjct: 190 GGDRFDEAI 198 Score = 34.3 bits (79), Expect = 7e-04 Identities = 25/81 (30%), Positives = 39/81 (48%), Gaps = 12/81 (14%) Query: 377 ALDQPLARILEQVQLALDSAQEKPDV--------IYLTGGSARSPLIKKALSEQLPGIPV 428 AL +PL I+ V +AL+ Q P++ + LTGG A + + L E+ GIPV Sbjct: 259 ALQEPLTGIVSAVMVALE--QCPPELASDISERGMVLTGGGALLRNLDRLLMEET-GIPV 315 Query: 429 AGGDD-FGSVTAGLARWAEVV 448 +D V G + E++ Sbjct: 316 VVAEDPLTCVARGGGKALEMI 336
>FbpA_PF05833#Fibronectin-binding protein Length = 577 Score = 26.8 bits (59), Expect = 0.023 Identities = 8/49 (16%), Positives = 25/49 (51%) Query: 16 RLFRRKNKLQREIQDIEKKIRDNQKRVLLLDNLSDYIKPGMSVEAIQGI 64 +++ NKL++ + +++ N++ + L ++ I + + I+ I Sbjct: 385 SYYKKYNKLKKSEEAANEQLLQNEEELNYLYSVLTNINNADNYDEIEEI 433
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 366 bits (940), Expect = e-132 Identities = 192/235 (81%), Positives = 209/235 (88%), Gaps = 7/235 (2%) Query: 1 MSNELPWQVWTPDDLAPPPETFVPVEAGNVTLTDDTPEPELTAEQQLEQELAQLKIQAHE 60 MS+ LPW+ WTPDDLAPP FVP+ T+ ++ AE LEQ+LAQL++QAHE Sbjct: 1 MSDNLPWKTWTPDDLAPPQAEFVPIVEPEETIIEE-------AEPSLEQQLAQLQMQAHE 53 Query: 61 QGYNAGLAEGRQKGHAQGYQEGLAQGLEQGQAQAQTQQAPIHARMQQLVSEFQNTLDALD 120 QGY AG+AEGRQ+GH QGYQEGLAQGLEQG A+A++QQAPIHARMQQLVSEFQ TLDALD Sbjct: 54 QGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALD 113 Query: 121 SVIASRLMQMALEAARQVIGQTPAVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRV 180 SVIASRLMQMALEAARQVIGQTP VDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRV Sbjct: 114 SVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRV 173 Query: 181 EEMLGATLSLHGWRLRGDPTLHHGGCKVSADEGDLDASVATRWQELCRLAAPGVL 235 ++MLGATLSLHGWRLRGDPTLH GGCKVSADEGDLDASVATRWQELCRLAAPGV+ Sbjct: 174 DDMLGATLSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 339 bits (870), Expect = e-118 Identities = 114/329 (34%), Positives = 196/329 (59%), Gaps = 2/329 (0%) Query: 1 MSNLSGTDKSVILLMTIGEDRAAEVFKHLSTREVQALSTAMANVRQISNKQLTDVLSEFE 60 +S L+G K+ ILL++IG + +++VFK+LS E+++L+ +A + I+++ +VL EF+ Sbjct: 12 VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK 71 Query: 61 QEAEQFAALNINANEYLRSVLVKALGEERASSLLEDILETRDTTSGIETLNFMEPQSAAD 120 + + +Y R +L K+LG ++A ++ + L + + E + +P + + Sbjct: 72 ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINN-LGSALQSRPFEFVRRADPANILN 130 Query: 121 LIRDEHPQIIATILVHLKRSQAADILALFDERLRHDVMLRIATFGGVQPAALAELTEVLN 180 I+ EHPQ IA IL +L +A+ IL+ ++ +V RIA P + E+ VL Sbjct: 131 FIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLE 190 Query: 181 GLLDGQ-NLKRSKMGGVRTAAEIINLMKTQQEEAVITAVREFDGELAQKIIDEMFLFENL 239 L + + GGV EIIN+ + E+ +I ++ E D ELA++I +MF+FE++ Sbjct: 191 KKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDI 250 Query: 240 VDVDDRSIQRLLQEVDSESLLIALKGAEPPLREKFLRNMSQRAADILRDDLANRGPVRLS 299 V +DDRSIQR+L+E+D + L ALK + P++EK +NMS+RAA +L++D+ GP R Sbjct: 251 VLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRK 310 Query: 300 QVENEQKAILLIVRRLAETGEMVIGSGED 328 VE Q+ I+ ++R+L E GE+VI G + Sbjct: 311 DVEESQQKIVSLIRKLEEQGEIVISRGGE 339
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 783 bits (2022), Expect = 0.0 Identities = 554/559 (99%), Positives = 559/559 (100%) Query: 5 ASTASTATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQ 64 ++TASTATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQ Sbjct: 1 SATASTATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQ 60 Query: 65 DGGAIVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKF 124 DGGAIVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKF Sbjct: 61 DGGAIVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKF 120 Query: 125 GISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLE 184 GISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLE Sbjct: 121 GISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLE 180 Query: 185 PGRALDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDV 244 PGRALDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDV Sbjct: 181 PGRALDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDV 240 Query: 245 ESRIQRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNIS 304 ESRIQRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNIS Sbjct: 241 ESRIQRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNIS 300 Query: 305 EQVGAGYPGGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNNAGPRNTQRN 364 EQVGAGYPGGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSN+AGPR+TQRN Sbjct: 301 EQVGAGYPGGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRN 360 Query: 365 ETSNYEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMG 424 ETSNYEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMG Sbjct: 361 ETSNYEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMG 420 Query: 425 FSDKRGDTLNVVNSPFSAVDDTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVR 484 FSDKRGDTLNVVNSPFSAVD+TGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVR Sbjct: 421 FSDKRGDTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVR 480 Query: 485 PQLTRRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSD 544 PQLTRRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSD Sbjct: 481 PQLTRRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSD 540 Query: 545 NDPRVVALVIRQWMSNDHE 563 NDPRVVALVIRQWMSNDHE Sbjct: 541 NDPRVVALVIRQWMSNDHE 559
>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE signature. Length = 103 Score = 114 bits (286), Expect = 1e-36 Identities = 90/103 (87%), Positives = 96/103 (93%) Query: 2 AAIQGIEGVISQLQATAMAARGQDTHSQSTVSFAGQLHAALDRISDRQTAARVQAEKFTL 61 +AIQGIEGVISQLQATAM+AR Q++ Q T+SFAGQLHAALDRISD QTAAR QAEKFTL Sbjct: 1 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL 60 Query: 62 GEPGIALNDVMADMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 104 GEPG+ALNDVM DMQKASVSMQMGIQVRNKLVAAYQEVMSMQV Sbjct: 61 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103
>PF01206#SirA family protein Length = 76 Score = 92.5 bits (230), Expect = 6e-29 Identities = 16/71 (22%), Positives = 37/71 (52%) Query: 7 DYRLDMVGEPCPYPAVATLEAMPQLKKGEILEVVSDCPQSINNIPLDARNHGYTVLDIQQ 66 D LD G CP P + + + + GE+L V++ P S+ + ++ G+ +L+ ++ Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64 Query: 67 DGPTIRYLIQK 77 + T + +++ Sbjct: 65 EDGTYHFRLKR 75
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 29.4 bits (66), Expect = 0.027 Identities = 10/53 (18%), Positives = 17/53 (32%), Gaps = 2/53 (3%) Query: 184 RFTLLPIFRIPVKMQKVSAASPLTQKPDQARRRF--RLGMLVFIGMIGWALLT 234 R L R + + + A L + P R R M + ++L Sbjct: 26 RKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLG 78
>PF05844#YopD protein Length = 295 Score = 31.9 bits (72), Expect = 0.002 Identities = 12/28 (42%), Positives = 21/28 (75%), Gaps = 2/28 (7%) Query: 76 MDLLALLYRLMAKSRQQGMFSLERDIEN 103 ++LL +L+R+ K+R+ G+ L+RD EN Sbjct: 74 VELLLILFRIAQKARELGV--LQRDNEN 99
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 42.2 bits (99), Expect = 1e-06 Identities = 25/118 (21%), Positives = 46/118 (38%), Gaps = 11/118 (9%) Query: 162 FKTGSAEVEPYMRDILRAIAPVL---NGIPNRISLAGHTDDFPYANGEKGYSNWELSADR 218 F A ++P + L + L + + + G+TD G Y N LS R Sbjct: 223 FNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRI----GSDAY-NQGLSERR 277 Query: 219 ANASRRELVAGGLDNGKVLRVVGMAATMRLSDRGPDDAINRR--ISLLVLNKQAEQAI 274 A + L++ G+ K+ GM + ++ D+ R I L +++ E + Sbjct: 278 AQSVVDYLISKGIPADKI-SARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334
>PF06580#Sensor histidine kinase Length = 349 Score = 41.8 bits (98), Expect = 7e-06 Identities = 23/151 (15%), Positives = 49/151 (32%), Gaps = 52/151 (34%) Query: 378 ELDKSLIERIIDPLT--HLVRNSLDHGIEMPEKRLEAGKNVVGNLILSAEHQGGNICIEV 435 +++ ++++ + P+ LV N + HGI G ++L G + +EV Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEV 296 Query: 436 TDDGAGLNRERILAKAMSQGMAVNENMTDDEVGMLIFAPGFSTAEQVTDVSGRGVGMDVV 495 + G+ + G G+ V Sbjct: 297 ENTGSLALKNTK--------------------------------------ESTGTGLQNV 318 Query: 496 KRNIQEMGG---HVEIQSKQGSGTTIRILLP 523 + +Q + G +++ KQG +L+P Sbjct: 319 RERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 65.6 bits (160), Expect = 7e-14 Identities = 31/142 (21%), Positives = 62/142 (43%), Gaps = 6/142 (4%) Query: 1 MSKIRVLSVDDSALMRQIMTEIINSHSDMEMVATAPDPLVARDLIKKFNPDVLTLDVEMP 60 M+ +L DD A +R ++ + ++ V + I + D++ DV MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 RMDGLDFLEKLMRLRPMPVVMVSSLTGKGS-EVTLRALELGAIDFVTKPQLGIREGMLAY 119 + D L ++ + RP V+V ++ + + ++A E GA D++ KP + E + Sbjct: 59 DENAFDLLPRIKKARPDLPVLV--MSAQNTFMTAIKASEKGAYDYLPKP-FDLTELIGII 115 Query: 120 SEMIAEKVRTAARARIAAHKPM 141 +AE R ++ + M Sbjct: 116 GRALAEPKRRPSKLEDDSQDGM 137
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 88.7 bits (220), Expect = 7e-24 Identities = 29/105 (27%), Positives = 51/105 (48%), Gaps = 3/105 (2%) Query: 7 KFLVVDDFSTMRRIVRNLLKELGFNNVEEAEDGVDALNKLQAGGFGFIISDWNMPNMDGL 66 LV DD + +R ++ L G++ V + + AG +++D MP+ + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYD-VRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 67 ELLKTIRADSAMSALPVLMVTAEAKKENIIAAAQAGASGYVVKPF 111 +LL I+ LPVL+++A+ I A++ GA Y+ KPF Sbjct: 64 DLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 419 bits (1080), Expect = e-149 Identities = 101/351 (28%), Positives = 179/351 (50%), Gaps = 14/351 (3%) Query: 7 DDKTEAPTPHRLEKAREEGQIPRSRELTSLLILLVGVCIIWFGGESLARQLAGMLSAGLH 66 +KTE PTP ++ AR++GQ+ +S+E+ S +++ ++ + + ++ + Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLML--IP 60 Query: 67 FDHRMVNDPNLILGQIILLIKAAMMALLPLIAGVVLVALISPVMLGGLIFSGKSLQPKFS 126 + + + + ++ PL+ L+A+ S V+ G + SG++++P Sbjct: 61 AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIK 120 Query: 127 KLNPLPGIKRMFSAQTGAELLKAVLKSTLVGCVTGFYLWHHWPQMMRLMAESPIVAMGNA 186 K+NP+ G KR+FS ++ E LK++LK L+ + + + +++L P + Sbjct: 121 KINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQL----PTCGIECI 176 Query: 187 LDLVGLCALLVVLGVIPMVGF------DVFFQIFSHLKKLRMSRQDIRDEFKESEGDPHV 240 L+G +L L VI VGF D F+ + ++K+L+MS+ +I+ E+KE EG P + Sbjct: 177 TPLLG--QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEI 234 Query: 241 KGKIRQMQRAAAQRRMMEDVPKADVIVTNPTHYSVALQYDENKMSAPKVVAKGAGLIALR 300 K K RQ + R M E+V ++ V+V NPTH ++ + Y + P V K Sbjct: 235 KSKRRQFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQT 294 Query: 301 IREIGAEHRVPTLEAPPLARALYRHAEIGQQIPGQLYAAVAEVLAWIWQLK 351 +R+I E VP L+ PLARALY A + IP + A AEVL W+ + Sbjct: 295 VRKIAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQN 345
>PF03309#Bvg accessory factor Length = 271 Score = 27.8 bits (62), Expect = 0.046 Identities = 17/84 (20%), Positives = 34/84 (40%), Gaps = 2/84 (2%) Query: 84 LKQTFGV--KVITDVHEASQVQPVADVVDVIQLPAFLARQTDLVEAMAKTGAVINVKKPQ 141 + G + +T S V V V V+ + L+E +TG + V P+ Sbjct: 47 IDGLIGDDAERLTGASGLSTVPSVLHEVRVMLEQYWPNVPHVLIEPGVRTGIPLLVDNPK 106 Query: 142 FVSPGQMGNIVDKFHEGGNDKVIL 165 V ++ N + +H+ G +++ Sbjct: 107 EVGADRIVNCLAAYHKYGTAAIVV 130
>INTIMIN#Intimin signature. Length = 939 Score = 246 bits (629), Expect = 1e-74 Identities = 126/444 (28%), Positives = 216/444 (48%), Gaps = 24/444 (5%) Query: 22 SFSLSLLLLAASGTIRAQAQDPFTQNRL----PDLGMMPESHEGEKHFAEMAKAFGEASM 77 F S L L S + A N+L PD+ + + ++A A + + Sbjct: 118 PFEYSALPLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDDKALNYAAQQAASLGSQL 177 Query: 78 KNNDLDTGEQARQFAFGQVRDVVSEQVNQQLESWLSAWGSASVDINVDNEGHFNGSRGSW 137 ++ L+ G+ A+ A G + Q + QL++WL +G+A V++ N F+GS + Sbjct: 178 QSRSLN-GDYAKDTALG----IAGNQASSQLQAWLQHYGTAEVNLQSGNN--FDGSSLDF 230 Query: 138 FIPLQDKQRYLTWSQLGLTQQTDGLVSNVGIGQRWAQDGWLLGYNTFYDNLLDENLQRAG 197 +P D ++ L + Q+G +N+G GQR+ +LGYN F D + R G Sbjct: 231 LLPFYDSEKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLG 290 Query: 198 FGAEAWGEYLRLSANYYQPFADWQT--HTATLEQRMARGYDINAQMRLPFYQHINTSVSL 255 G E W +Y + S N Y + W + ++R A G+DI LP Y + + Sbjct: 291 IGGEYWRDYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMY 350 Query: 256 EQYFGDSVDLFDSGTGYHNPVALKLGLNYTPVPLLTMTAQHKQGESGVSQNNLGLTLNYR 315 EQY+GD+V LF+S NP A +G+NYTP+PL+TM ++ G + + Y+ Sbjct: 351 EQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQ 410 Query: 316 FGVPLKKQLAASEVAQSQSLRGSRYDTPQRNSLPTMEYRQRKTLTVFLATPPWDLTPGET 375 F P +Q+ V + ++L GSRYD QRN+ +EY+++ L++ + + T T Sbjct: 411 FDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNI-PHDINGTERST 469 Query: 376 VALKLQVRSVHGIRHLSWQGDTQALSLTAG----TDTRSTEGWTIIMPAWDHREGAANRW 431 ++L V+S +G+ + W D AL G + ++S + + I+PA+ +G +N + Sbjct: 470 QKIQLIVKSKYGLDRIVW--DDSALRSQGGQIQHSGSQSAQDYQAILPAY--VQGGSNVY 525 Query: 432 RLSVVVEDEKGQRVSSNEITLALT 455 +++ D G SSN + L +T Sbjct: 526 KVTARAYDRNGN--SSNNVLLTIT 547
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 72.2 bits (177), Expect = 6e-17 Identities = 33/117 (28%), Positives = 56/117 (47%), Gaps = 2/117 (1%) Query: 7 ATILLIDDHPMLRTGVKQLVSMAPDISVVGEASNGEQGIDLAESLDPDLILLDLNMPGMN 66 ATIL+ DD +RT + Q +S A V SN + D DL++ D+ MP N Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 67 GLETLDKLREKALSGRIVVFSVSNHEEDVVTALKRGADGYLLKDMEPEDLLKALQQA 123 + L ++++ ++V S N + A ++GA YL K + +L+ + +A Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118
>PF06580#Sensor histidine kinase Length = 349 Score = 51.4 bits (123), Expect = 4e-09 Identities = 30/123 (24%), Positives = 54/123 (43%), Gaps = 17/123 (13%) Query: 473 SARFGFTVKLDYQLPPRL----VPSHQAIHLLQIAREALSNALKH-----SHADDVVVTV 523 S +F ++ + Q+ P + VP L+Q E N +KH +++ Sbjct: 233 SIQFEDRLQFENQINPAIMDVQVPPM----LVQTLVE---NGIKHGIAQLPQGGKILLKG 285 Query: 524 TQCGKQVKLKVQDNGCGVPENAERSNHYGMIIMRDRAQSLRG-DCQVRRRETGGTEVTVT 582 T+ V L+V++ G +N + S G+ +R+R Q L G + Q++ E G + Sbjct: 286 TKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345 Query: 583 FIP 585 IP Sbjct: 346 LIP 348
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 29.8 bits (67), Expect = 0.025 Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 1/58 (1%) Query: 128 TPFSTFIIISLLCGFAGANF-ASSMANISFFFPKQKQGGALGLNGGLGNMGVSVMQLV 184 + FS I+ + G A F A M ++ + PK+ +G A GL G + MG V + Sbjct: 101 SFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAI 158
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 41.1 bits (96), Expect = 7e-06 Identities = 17/48 (35%), Positives = 29/48 (60%) Query: 356 LTNGALEASNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILNTLVNLR 403 L+N S V+L +E N+ Q+ Y +NAQ ++T + I + L+N+R Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546 Score = 37.2 bits (86), Expect = 9e-05 Identities = 22/60 (36%), Positives = 31/60 (51%), Gaps = 4/60 (6%) Query: 2 SFSQAVSGLNAAATNLDVIGNNIANSATYGFKSGTASFAD----MFAGSKVGLGVKVAGI 57 + A+SGLNAA L+ NNI++ G+ T A + AG VG GV V+G+ Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62
>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE chaperone signature. Length = 130 Score = 28.5 bits (63), Expect = 0.010 Identities = 16/34 (47%), Positives = 20/34 (58%), Gaps = 2/34 (5%) Query: 44 LKNQDPTNPLQNNELTTQLAQISTVSGIEKLNTT 77 L N+ P N L NN L TQL + V G E+L T+ Sbjct: 89 LWNRQPLNSLDNNSLYTQLEML--VQGAERLQTS 120
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 31.3 bits (71), Expect = 0.011 Identities = 21/106 (19%), Positives = 34/106 (32%), Gaps = 6/106 (5%) Query: 394 LMIGMITFQFSNFSFGIGNAAGLLFAGIML-GFLRANHPTFG-YIPQ--GALNMVKEFGL 449 L++ + +L+ G ++ G A G YI + FG Sbjct: 76 LLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGF 135 Query: 450 MVFMAGVGLSAGSGISNGLGAVGGQM--LIAGLVVSLVPVVICFLF 493 M G G+ AG + +G A + L + CFL Sbjct: 136 MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 47.3 bits (112), Expect = 6e-09 Identities = 17/80 (21%), Positives = 33/80 (41%) Query: 7 RRANDPKRREKIIQATLEAVKTYGVHAVTHRKIAAIAQVPLGSMTYYFAGMDALLSEAFT 66 + + R+ I+ L GV + + +IA A V G++ ++F L SE + Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64 Query: 67 LFTENMSRQYQDFFAQVTDA 86 L N+ ++ A+ Sbjct: 65 LSESNIGELELEYQAKFPGD 84
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 32.9 bits (75), Expect = 0.002 Identities = 33/150 (22%), Positives = 65/150 (43%), Gaps = 6/150 (4%) Query: 200 LLIGVVVLAMAFAEGSANDWL-PLLMVDGHGFSP-TSGSLIYAGFTLGMTVGRFTGGWFI 257 +IGV+ + F + + P +M D H S GS+I T+ + + + GG + Sbjct: 258 FMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILV 317 Query: 258 DRYSRVTVVR-ASALM--GALGIGLIIFVDSDWVA-GVSVILWGLGASLGFPLTISAASD 313 DR + V+ + L ++ S ++ + +L GL + TI ++S Sbjct: 318 DRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSL 377 Query: 314 TGPDAPTRVSVVATTGYLAFLVGPPLLGYL 343 +A +S++ T +L+ G ++G L Sbjct: 378 KQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 42.9 bits (101), Expect = 2e-06 Identities = 66/356 (18%), Positives = 126/356 (35%), Gaps = 51/356 (14%) Query: 48 QAGLDWVPTSMTAYLAGGMFLQWLLGPLSDRIGRRPVMLAGVVWFIVTCLATLLAKNIEQ 107 A +WV T+ + G + G LSD++G + ++L G++ + + + Sbjct: 48 PASTNWVNTAFMLTFSIGTAV---YGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFS 104 Query: 108 FT-FLRFLQGISLCFIGAVGYAAIQESFEEAVCIKITALMANVALIAPLLGPLVGAAWVH 166 RF+QG A+ + + K L+ ++ + +GP +G H Sbjct: 105 LLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAH 164 Query: 167 VLPWEGMFILFAALAAIAFFGLQCAMPETATRRGE------------------------- 201 + W +L + I L + + +G Sbjct: 165 YIHW-SYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSI 223 Query: 202 ------TLSFKALGRDYRLV---------IKNRRFVAGALALGFVSLPLLAWIAQSPIII 246 LSF + R V KN F+ G L G + + +++ P ++ Sbjct: 224 SFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMM 283 Query: 247 ISGEQLSSYEYG-LLQVPVFGALIAGNLVLARLTSRRTVRSLIVMGGWPIVAGLIIAAAA 305 QLS+ E G ++ P ++I + L RR ++ +G + + A+ Sbjct: 284 KDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTAS-- 341 Query: 306 TVVSSHAYLWMTAGLSVYAFGIGLANAGLVRLTLFSSDMSKGTVSAAMGMLQMLIF 361 + +MT + V+ G GL+ V T+ SS + + A M +L F Sbjct: 342 -FLLETTSWFMTIII-VFVLG-GLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSF 394
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 46.7 bits (111), Expect = 5e-08 Identities = 49/207 (23%), Positives = 78/207 (37%), Gaps = 25/207 (12%) Query: 1 MTQYASSLRSLAAGSVLLFLFASPVKAEEQTIAPPGVDAR-AWILMDYASGKVLAEGNAD 59 M + SL A ++ L + ASP E+ ++ + R I MD ASG+ L AD Sbjct: 1 MRYIRLCIISLLA-TLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRAD 59 Query: 60 EKLDPASLTKIMTSYVVGQALKAGKIKLTDMVTVGKDAWATGNPALRGSSVMFLKPGDQV 119 E+ S K++ V + AG +L + + +P V D + Sbjct: 60 ERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSP------VSEKHLADGM 113 Query: 120 SVADLNKGIIIQSGNDACIALADYVAGSQESFIGLMNAYAKRLGLTNTT---FQTVHGLD 176 +V +L I S N A L V G + A+ +++G T ++T Sbjct: 114 TVGELCAAAITMSDNSAANLLLATVGGPAG-----LTAFLRQIGDNVTRLDRWETELNEA 168 Query: 177 APGQF---STARDMA------LLGKAL 194 PG +T MA L + L Sbjct: 169 LPGDARDTTTPASMAATLRKLLTSQRL 195
>SECA#SecA protein signature. Length = 901 Score = 29.8 bits (67), Expect = 0.023 Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 4/67 (5%) Query: 246 QQVLVFTRTKHGANHLAEQLNKDGIRSAAIHG-NKSQGARTRALADFKSGDIRVLVATDI 304 Q VLV T + + ++ +L K GI+ ++ + A A A + + V +AT++ Sbjct: 450 QPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNM 506 Query: 305 AARGLDI 311 A RG DI Sbjct: 507 AGRGTDI 513
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 65.8 bits (160), Expect = 2e-15 Identities = 32/224 (14%), Positives = 72/224 (32%), Gaps = 25/224 (11%) Query: 6 TTTKGEQAKSQLIAAALAQFGEYGLHATT-RDIAALAGQNIAAITYYFGSKEDLYLACAQ 64 T + ++ + ++ AL F + G+ +T+ +IA AG AI ++F K DL+ + Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64 Query: 65 WIADFLGEKFRPHAEKAERLFSQPAPD-RDAIRELILLACKNMIMLLTQEDTVNLSKFIS 123 +GE E + P R+ + ++ L E + +F+ Sbjct: 65 LSESNIGELEL---EYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV- 120 Query: 124 REQLSPTSAYQLVHEQVIDPLHTHLTRLVAA---YTGCDANDTRMILHTHALLGEVLAFR 180 E A + + + D + L + A +I+ + ++ Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM--RGYISGLM--- 175 Query: 181 LGKETILLRTGWPQFDEEKAELIYQTVTCHIDLILHGLTQRSLD 224 W + + ++ ++L Sbjct: 176 ---------ENWLFAPQSFDLK--KEARDYVAILLEMYLLCPTL 208
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 61.0 bits (148), Expect = 2e-12 Identities = 48/286 (16%), Positives = 104/286 (36%), Gaps = 28/286 (9%) Query: 55 ASLNVDEGDAIKAGQVLGELDHAPYENALMQAKAGVSVAQAQYDLMLAGYRDEEIAQAAA 114 NV E + L + + ++N Q + + +A+ +LA E Sbjct: 175 YFQNVSEEEV-LRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233 Query: 115 AVRQAQAAYDYAQNFYNRQQGLWKSRTISA--NDLENARSSRDQAQATLKSAQDKLSQYR 172 R + + + L + N+L +S +Q ++ + SA+++ Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL-V 292 Query: 173 TGNREQDI----AQAKASLEQAKAQLAQAQLDLQDTTLIAPANGTLLTRAV-EPGSMLNA 227 T + +I Q ++ +LA+ + Q + + AP + + V G ++ Sbjct: 293 TQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTT 352 Query: 228 GSTVLTLSLT-RPVWVRAYVDERNLSQTQPGRDILLYTDGRPDKPYH---GKIGFVSPTA 283 T++ + + V A V +++ G++ ++ + P Y GK+ ++ A Sbjct: 353 AETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA 412 Query: 284 EFTPKTVETPDLRTDLVYRLRIIVT-------DADDALRQGMPVTV 322 D R LV+ + I + + + L GM VT Sbjct: 413 --------IEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTA 450
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.6 bits (71), Expect = 0.009 Identities = 22/89 (24%), Positives = 29/89 (32%), Gaps = 21/89 (23%) Query: 294 PRFEDAFIDLLGGAGTSESPLGSILHTVEGTAGETVIEAQELTKKFGDFAATDHVNFVVQ 353 PR E + +LG P + Q + K HV V++ Sbjct: 548 PRLEKWLVHVLGKTPDDYKP-------------RRLRYLQLVGKYI----LMGHVARVME 590 Query: 354 RGEIFG----LLGPNGAGKSTTFKMMCGL 378 G F L G G GKST + GL Sbjct: 591 PGCKFDYSVVLEGTGGIGKSTLINTLVGL 619
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 45.7 bits (108), Expect = 1e-07 Identities = 35/139 (25%), Positives = 60/139 (43%), Gaps = 5/139 (3%) Query: 197 AREREQGTLDQLLVSPLTTWQIFVGKAVPALIVATFQATIVLAIGIWAYQIPFAGSLALF 256 R Q T + +L + L I +G+ A A IG+ A + + L+L Sbjct: 92 GRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGA---GIGVVAAALGYTQWLSLL 148 Query: 257 YFTMVI--YGLSLVGFGLLISSLCATQQQAFIGVFVFMMPAILLSGYVSPVENMPVWLQN 314 Y VI GL+ G+++++L + + + P + LSG V PV+ +P+ Q Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208 Query: 315 LTWINPIRHFTDITKQIYL 333 P+ H D+ + I L Sbjct: 209 AARFLPLSHSIDLIRPIML 227
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 45.1 bits (106), Expect = 9e-07 Identities = 52/275 (18%), Positives = 85/275 (30%), Gaps = 34/275 (12%) Query: 366 PEPETPRQSFAPVAPTAVMTPP--QVQQPSAP-----------APQTSPAPLPASTSQVL 412 PE E Q V T + TP Q PS P AP PAP S + Sbjct: 983 PEVEKRNQ---TVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTET 1039 Query: 413 AARNQLQRAQGVTKTKK--SEPAAASRARPVNNSALERLASVSERVQARPAPSALETAPV 470 A N Q ++ V K ++ +E A + R + + + E A Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQN-----------REVAKEAKSNVKANTQTNEVAQS 1088 Query: 471 KKEAYRWKATTPVVQTKEVVATPKALKKALEHEKTPELAAKLAAEAIERDPWAAQVSQLS 530 E T +TKE K K +E EKT E K+ ++ + + V + Sbjct: 1089 GSET----KETQTTETKETATVEKEEKAKVETEKTQE-VPKVTSQVSPKQEQSETVQPQA 1143 Query: 531 LPKLVEQVALNAWKEQNGNAVCLHLRSTQRHLNSSGAQQKLAQALSDLTGTTVELTIVED 590 P +N + Q+ + +S+ Q + + VE Sbjct: 1144 EPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTT 1203 Query: 591 DNPAVRTPLEWRQAIYEEKLAQARESIIADNNIQT 625 T + + ++ S+ + T Sbjct: 1204 PATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPAT 1238
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 30.9 bits (69), Expect = 0.005 Identities = 28/63 (44%), Positives = 37/63 (58%), Gaps = 6/63 (9%) Query: 222 AEPGALIRQLAQGAPQYKEQLMT--IAEWLEE---KGRTEGLQKGLEQGLAQGREAEARA 276 AEP +L +QLAQ Q EQ IAE ++ +G EGL +GLEQGLA+ + +A Sbjct: 36 AEP-SLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPI 94 Query: 277 IAR 279 AR Sbjct: 95 HAR 97
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 36.2 bits (83), Expect = 7e-04 Identities = 41/219 (18%), Positives = 81/219 (36%), Gaps = 18/219 (8%) Query: 92 RQKVAQAPEKMRQ-ATAALNALSDVDNDDEMRKTLSALSLRQLELRVA--QVLDDLQNSQ 148 R ++A+A EK R+ A AA A + + + + A + RQL+L A + L L Sbjct: 129 RLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEA 188 Query: 149 NDLAAYNSQLVSLQTQPERVQNAMYTASQQI-------QQIRNRLDGNNVGEAALRPSQQ 201 + +L + Q++ ++ + T + ++ L G A + Sbjct: 189 KAVEIAQKKLSAAQSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYK 248 Query: 202 VLLQAQQALLNAQID--------QQRKSLEGNTVLQDTLQKQRDYVTANSNRLEHQLQLL 253 L + + L D + + G +++ QKQ NR+ + + Sbjct: 249 ELDELVKKLSPRANDPLQNRPFFEATRRRVGAGKIREEKQKQVTASETRINRINADITQI 308 Query: 254 QEAVNSKRLTLTEKTAQEAISPDETARIQANPLVKQELD 292 Q+A++ A+ + + + Q N L Q D Sbjct: 309 QKAISQVSNNRNAGIARVHEAEENLKKAQNNLLNSQIKD 347
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 204 bits (519), Expect = 8e-69 Identities = 187/214 (87%), Positives = 199/214 (92%) Query: 1 MARKTKQQALETRQHILDVALRLFSQQGVSATSLAEIANAAGVTRGAIYWHFKNKSDLFS 60 MARKTKQ+A ETRQHILDVALRLFSQQGVS+TSL EIA AAGVTRGAIYWHFK+KSDLFS Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EIWELSESNIGELEIEYQAKFPDDPLSVLREILVHILEATVTEERRRLLMEIIFHKCEFV 120 EIWELSESNIGELE+EYQAKFP DPLSVLREIL+H+LE+TVTEERRRLLMEIIFHKCEFV Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 Query: 121 GEMVVVQQAQRSLCLESYDRIEQTLKHCINAKMLPENLLTRRAAILMRSFISGLMENWLF 180 GEM VVQQAQR+LCLESYDRIEQTLKHCI AKMLP +L+TRRAAI+MR +ISGLMENWLF Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180 Query: 181 APQSFDLKKEARAYVTILLEMYQLCPTLRASTVN 214 APQSFDLKKEAR YV ILLEMY LCPTLR N Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATN 214
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 43.3 bits (102), Expect = 1e-06 Identities = 33/216 (15%), Positives = 75/216 (34%), Gaps = 27/216 (12%) Query: 100 TYQATYDSAKGDLAKAQAAANIAELTVKRYQKLLGTQYISKQEYDQALADAQQATAAVVA 159 + Y A +L + + ++ + Q +++ ++ L +Q T + Sbjct: 256 EQENKYVEAVNELR--VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313 Query: 160 AKAAVETARINLAYTKVTSPISGRIGKSSV-TEGALVQNGQASALATVQQLDPIYVDVTQ 218 + + + +P+S ++ + V TEG +V + + V + D + V Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET-LMVIVPEDDTLEVTALV 372 Query: 219 SSNDFLRLKQELA------------NGSLKQENGKAKVDLVTSDGIKFPQSGTLEFSDVT 266 + D + G L KV + D I+ + G + ++ Sbjct: 373 QNKDIGFINVGQNAIIKVEAFPYTRYGYL-----VGKVKNINLDAIEDQRLGLVFNVIIS 427 Query: 267 VDQTTGSITLRAIFPNPDHTLLPGMFVRARLQEGTK 302 +++ S + I L GM V A ++ G + Sbjct: 428 IEENCLSTGNKNIP------LSSGMAVTAEIKTGMR 457 Score = 32.9 bits (75), Expect = 0.002 Identities = 24/133 (18%), Positives = 45/133 (33%), Gaps = 10/133 (7%) Query: 49 PLQITTELPGR-TVAYRIAEVRPQVSGIILKRNFV-EGSDIEAGVSLYQIDP-------A 99 ++I G+ T + R E++P + I+ K V EG + G L ++ Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIV-KEIIVKEGESVRKGDVLLKLTALGAEADTL 137 Query: 100 TYQATYDSAKGDLAKAQAAANIAELTVKRYQKLLGTQYISKQEYDQALADAQQATAAVVA 159 Q++ A+ + + Q + EL KL Y ++ L Sbjct: 138 KTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFST 197 Query: 160 AKAAVETARINLA 172 + +NL Sbjct: 198 WQNQKYQKELNLD 210
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1366 bits (3538), Expect = 0.0 Identities = 808/1033 (78%), Positives = 916/1033 (88%), Gaps = 1/1033 (0%) Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLAIFKLPVAQYPTIALPAVTISATYPGADAKTVQDT 60 M NFFI RPIFAWV+AII+M+AG LAI +LPVAQYPTIA PAV++SA YPGADA+TVQDT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGIDNLMYMSSNSDSTGTVQITLTFESGTDADIAQVQVQNKLQLAMPLLPQ 120 VTQVIEQNMNGIDNLMYMSS SDS G+V ITLTF+SGTD DIAQVQVQNKLQLA PLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGVSVEKSSSSFLMVVGVINTDGTMTQEDISDYVAANMKDPISRTSGVGDVQLFGS 180 EVQQQG+SVEKSSSS+LMV G ++ + TQ+DISDYVA+N+KD +SR +GVGDVQLFG+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWMNPTELTKYQLTPVDVINAIKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240 QYAMRIW++ L KY+LTPVDVIN +K QN Q+AAGQLGGTP + GQQLNASIIAQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 TSTDEFGKILLKVNQDGSQVRLRDVAKIELGGENYDVIAKFNGQPASGLGIKLATGANAL 300 + +EFGK+ L+VN DGS VRL+DVA++ELGGENY+VIA+ NG+PA+GLGIKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTATAIRAELKKMEPFFPPGMKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ 360 DTA AI+A+L +++PFFP GMK++YPYDTTPFV++SIHEVVKTL EAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 TEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 E+ LPPKEAT KSM QIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATMLKPVAKGDHGEGKKGFFGWFNRLFDKSTHHYTDSVGNILRSTGR 540 SVLVALILTPALCAT+LKPV+ H E K GFFGWFN FD S +HYT+SVG IL STGR Sbjct: 481 SVLVALILTPALCATLLKPVSAE-HHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 541 YLLLYLIIVVGMAYLFVRLPSSFLPDEDQGVFLTMVQLPAGATQERTQKVLDEVTDYYLN 600 YLL+Y +IV GM LF+RLPSSFLP+EDQGVFLTM+QLPAGATQERTQKVLD+VTDYYL Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 601 KEKANVESVFAVNGFGFAGRGQNTGIAFVSLKDWADRPGEKNKVEAITQRATAAFSQIKD 660 EKANVESVF VNGF F+G+ QN G+AFVSLK W +R G++N EA+ RA +I+D Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659 Query: 661 AMVFAFNLPAIVELGTATGFDFELIDQAGLGHEKLTQARNQLFGEVAKYPDLLVGVRPNG 720 V FN+PAIVELGTATGFDFELIDQAGLGH+ LTQARNQL G A++P LV VRPNG Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719 Query: 721 LEDTPQFKIDIDQEKAQALGVSISDINTTLGAAWGGSYVNDFIDRGRVKKVYVMSEAKYR 780 LEDT QFK+++DQEKAQALGVS+SDIN T+ A GG+YVNDFIDRGRVKK+YV ++AK+R Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779 Query: 781 MLPDDINDWYVRGSDGQMVPFSAFSSSRWEYGSPRLERYNGLPSMEILGQAAPGKSTGEA 840 MLP+D++ YVR ++G+MVPFSAF++S W YGSPRLERYNGLPSMEI G+AAPG S+G+A Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839 Query: 841 MAMMEELASKLPSGIGYDWTGMSYQERLSGNQAPALYAISLIVVFLCLAALYESWSIPFS 900 MA+ME LASKLP+GIGYDWTGMSYQERLSGNQAPAL AIS +VVFLCLAALYESWSIP S Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899 Query: 901 VMLVVPLGVIGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGL 960 VMLVVPLG++G LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG+ Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959 Query: 961 VEATLEAVRMRLRPILMTSLAFMLGVMPLVISSGAGSGAQNAVGTGVLGGMVTATVLAIF 1020 VEATL AVRMRLRPILMTSLAF+LGV+PL IS+GAGSGAQNAVG GV+GGMV+AT+LAIF Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019 Query: 1021 FVPVFFVVVRRRF 1033 FVPVFFVV+RR F Sbjct: 1020 FVPVFFVVIRRCF 1032
>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx signature. Length = 294 Score = 491 bits (1266), Expect = e-180 Identities = 239/295 (81%), Positives = 254/295 (86%), Gaps = 9/295 (3%) Query: 1 MKKTLLAVSAALALTSSFTANAAENDQPQYLSDWWHQSVNVVGSYHTRFSPKLNNDVYLE 60 MKKTLLA A +AL+++F A AAEND+PQYLSDWWHQSVNVVGSYHTRF P++ ND YLE Sbjct: 1 MKKTLLAAGAVVALSTTFAAGAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLE 60 Query: 61 YEAFAKKDWFDFYGYIDIPKTFDWGNGNDKGIWSDGSPLFMEIEPRFSIDKLTGADLSFG 120 YEAFAKKDWFDFYGYID P F GN KGIW+ GSPLFMEIEPRFSIDKLT DLSFG Sbjct: 61 YEAFAKKDWFDFYGYIDAPVFFG-GNSTAKGIWNKGSPLFMEIEPRFSIDKLTNTDLSFG 119 Query: 121 PFKEWYFANNYIYDMGDNKASRQSTWYMGLGTDIDTGLPMGLSLNVYAKYQWQNYGASNE 180 PFKEWYFANNYIYDMG N + QSTWYMGLGTDIDTGLPM LSLNVYAKYQWQNYGASNE Sbjct: 120 PFKEWYFANNYIYDMGRNDSQEQSTWYMGLGTDIDTGLPMSLSLNVYAKYQWQNYGASNE 179 Query: 181 NEWDGYRFKVKYFVPITDLWGGKLNYIGFTNFDWGSDLGDDP--------NRTSNSIASS 232 NEWDGYRFKVKYFVP+TDLWGG L+YIGFTNFDWGSDLGDD RTSNSIASS Sbjct: 180 NEWDGYRFKVKYFVPLTDLWGGSLSYIGFTNFDWGSDLGDDNFYDLNGKHARTSNSIASS 239 Query: 233 HILALNYDHWHYSVVARYFHNGGQWQNGAKLNWGDGDFSAKSTGWGGYLVVGYNF 287 HILALNY HWHYS+VARYFHNGGQW + AKLN+GDG FS +STGWGGY VVGYNF Sbjct: 240 HILALNYAHWHYSIVARYFHNGGQWADDAKLNFGDGPFSVRSTGWGGYFVVGYNF 294
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 32.9 bits (75), Expect = 4e-04 Identities = 14/56 (25%), Positives = 24/56 (42%), Gaps = 5/56 (8%) Query: 3 RRADRLFQIVQILRGRRLTT-----AALLAQRLAVSERTIYRDIRDLSLSGVPVEG 53 + R +I +I+ + T L V++ T+ RDI++L L VP Sbjct: 2 NKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKELHLVKVPTNN 57
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 341 bits (876), Expect = e-120 Identities = 102/306 (33%), Positives = 173/306 (56%), Gaps = 12/306 (3%) Query: 1 MRWDFWAFGISGLLLIAAIVIMGVRGFNWGLDFTGGTVIEITLEKPAEMDVMREALQKAG 60 RW + FG + +++IA++++ V G N+G+DF GGT I ++ V R AL+ Sbjct: 17 FRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGVYRAALEPLE 76 Query: 61 YEEPQLQNFGS------SHDIMVRMPPTEGETGGQVLGSKVVTIINE------ATNQNAA 108 + + H M+R+ E G + G++ ++N+ A + Sbjct: 77 LGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAVDPALK 136 Query: 109 VKRIEFVGPSVGADLAQTGAMALLVALISILVYVGFRFEWRLAAGVVIALAHDVIITLGI 168 + E VGP V +L T +LL A + I+ Y+ RFEW+ A G V+AL HDV++T+G+ Sbjct: 137 ITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTVGL 196 Query: 169 LSLFHIEIDLTIVASLMSVIGYSLNDSIVVSDRIRENFRKIRRGTPYEIFNVSLTQTLHR 228 ++ ++ DLT VA+L+++ GYS+ND++VV DR+REN K + ++ N+S+ +TL R Sbjct: 197 FAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNETLSR 256 Query: 229 TLITSGTTLVVILMLYLFGGPVLEGFSLTMLIGVSIGTASSIYVASALALKLGMKREHML 288 T++T TTL+ ++ + ++GG V+ GF M+ GV GT SS+YVA + L +G+ R Sbjct: 257 TVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGLDRNKEK 316 Query: 289 QQKVEK 294 + +K Sbjct: 317 KDPSDK 322
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 69.5 bits (170), Expect = 6e-15 Identities = 35/165 (21%), Positives = 79/165 (47%), Gaps = 4/165 (2%) Query: 422 IQIVEERTIGPTLGMQNIKQGLEACLAGLVVSILFMIF-FYKKFGLIATSALVANLVLIV 480 ++I ++GP + + + + + LA VV + ++ F +F L A ALV +++L V Sbjct: 135 LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTV 194 Query: 481 GIMSLLPGATLSMPGIAGIVLTLAVAVDANVLINERIKEEL--SNGRTVQQAINEGYAGA 538 G+ ++L + +A ++ +++ V++ +R++E L ++ +N Sbjct: 195 GLFAVL-QLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNET 253 Query: 539 FSSIFDANITTLIKVIILYAVGTGAIKGFAITTGIGVATSMFTAI 583 S +TTL+ ++ + G I+GF GV T ++++ Sbjct: 254 LSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSV 298
>PF06580#Sensor histidine kinase Length = 349 Score = 36.8 bits (85), Expect = 1e-04 Identities = 17/123 (13%), Positives = 38/123 (30%), Gaps = 28/123 (22%) Query: 300 TFTFEVDDSLSVLGNEEQLRSAISNLVYNAVNH----TPAGTHITVSWRRVAHGAEFCIQ 355 F +++ ++ + + + LV N + H P G I + + ++ Sbjct: 241 QFENQINPAIM---DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVE 297 Query: 356 DNGPGIAAEHIPRLTERFYRVDKARSRQTGGSGLGLAIVKHALNH---HESRLEIDSSPG 412 + G +G GL V+ L E+++++ G Sbjct: 298 NTGSLAL------------------KNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQG 339 Query: 413 KGT 415 K Sbjct: 340 KVN 342
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 81.8 bits (202), Expect = 3e-20 Identities = 27/131 (20%), Positives = 52/131 (39%), Gaps = 9/131 (6%) Query: 6 LEQNGFQPVEAEDYDSAVNKLNEPWPDLILLDWMLPGGSGLQFIKHLKREAMTRDIPVVM 65 L + G+ + + + DL++ D ++P + + +K+ D+PV++ Sbjct: 23 LSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARP--DLPVLV 80 Query: 66 LTARGEEEDRVRGLETGADDYITKPFSPKELVARIKAVMRRISPMAVEEVIEMQGLSLDP 125 ++A+ ++ E GA DY+ KPF EL+ I + E L D Sbjct: 81 MSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA-------EPKRRPSKLEDDS 133 Query: 126 GSHRVMTGDSP 136 + G S Sbjct: 134 QDGMPLVGRSA 144
>FRAGILYSIN#Fragilysin metallopeptidase (M10C) enterotoxin signature. Length = 405 Score = 29.3 bits (65), Expect = 0.028 Identities = 14/70 (20%), Positives = 29/70 (41%), Gaps = 4/70 (5%) Query: 149 KQQQLLHAIADYYQQQYQEACQLRGERKLPVIATGHLTTVGASKSDAVRDIYIGTLDAFP 208 K+ Q+++ IA++Y +++ + E++ T D + + I A Sbjct: 135 KEAQMMNEIAEFYAAPFKKTRAIN-EKEAFECIYDSRTRSA--GKD-IVSVKINIDKAKK 190 Query: 209 AQHFPPADYI 218 + P DYI Sbjct: 191 ILNLPECDYI 200
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 47.9 bits (114), Expect = 2e-07 Identities = 32/198 (16%), Positives = 71/198 (35%), Gaps = 13/198 (6%) Query: 373 TQQSHDRAQLSQWQQQLLSDTRQRDALPPLTLDLTPQALAEARALHTRQRPLRHRLAALQ 432 TQ S +A+L Q + Q+LS + + + LP L L P + R L ++ Sbjct: 139 TQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL------IK 192 Query: 433 GQILPKQKRQAQLQAAIARHHQEQTQYTQRLADKRLSYKTKAQELADVRTICEQ----EA 488 Q Q ++ Q + + + E+ R+ + + L D ++ + + Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKH 252 Query: 489 RIKDLESQRAHLQS--GQPCPLCGSTTHPAIAAYQALELSANQTRRDALEKEVKTLAEEG 546 + + E++ + ++A + +L + + L+K +T Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNI- 311 Query: 547 AALRGQLDALTQQLQRDE 564 L +L ++ Q Sbjct: 312 GLLTLELAKNEERQQASV 329
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 49.8 bits (119), Expect = 9e-09 Identities = 68/356 (19%), Positives = 120/356 (33%), Gaps = 35/356 (9%) Query: 5 IFSLALGTFGLGMAEFGIMGVLTELARDVGITIPAAGH---MISFYAFGVVLGAPVMALF 61 + ++AL G+G+ IM VL L RD+ + H +++ YA APV+ Sbjct: 11 LSTVALDAVGIGL----IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 62 SSRFSLKHILLFLVMLCVMGNAIFTFSSSYLMLAVGRLVSGFPHGAFFGVGAIVLSKIIR 121 S RF + +LL + + AI + +L +GR+V+G GA + I Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIAD--IT 124 Query: 122 PGKVTAAVAGMVSGMTVANLVGIPFGTYLSQEFSWRYTFLLIAVFNIAVLTAIFFWVPDI 181 G A G +S +V P L FS F A N F +P+ Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184 Query: 182 RDKAQGSLHEQ----------FHFLRSPAPWLI--FAATMFGNAGVFAWFSYIKPFMMYI 229 + L + + A + F + G W + Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG------E 238 Query: 230 SGFSETSMTFIMMLVGLGM---VLGNLLSGKLSGRYTPLRIAVVTDLVIVLSLMALFFFS 286 F + T + L G+ + +++G ++ R R ++ ++ + Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLG---MIADGTGYILLA 295 Query: 287 GYKTASLTFAFICCAGLFALSAPLQILLLQNAKGGELLGAAGGQIAF--NLGSAIG 340 + F + + P +L E G G +A +L S +G Sbjct: 296 FATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVG 351
>PF06291#Lambda prophage Bor protein Length = 102 Score = 30.0 bits (67), Expect = 0.002 Identities = 21/68 (30%), Positives = 30/68 (44%), Gaps = 11/68 (16%) Query: 28 VNDKEIICSPDESNTHTFVILEGVVSLVRGDKVLIGIVQAPFIFGLADGVAKKEAQYKLI 87 V +K +P E+ TH F VS + K V A I G A+ V K E Q + Sbjct: 29 VGNKPTAVTPKETITHHFF-----VSGIGQKKT----VDAAKICGGAENVVKTETQQTFV 79 Query: 88 AESGCIGY 95 +G +G+ Sbjct: 80 --NGLLGF 85
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 120 bits (303), Expect = 6e-30 Identities = 100/436 (22%), Positives = 165/436 (37%), Gaps = 59/436 (13%) Query: 597 TYSANGEADNSYTDNVVA---ATGNYKVRIDNATGAGSVADYKGNELIRVNDVNTDATFS 653 + N AD +D +V A+G +++ + N+ GS L+ + + ATF+ Sbjct: 483 LFRMNVFADLGLSDKLVVMQDASGQHRLWVRNS---GSEPASANTLLLVQTPLGSAATFT 539 Query: 654 AAN---KADLGAYTYQAKQEGNTV------------------------------------ 674 AN K D+G Y Y+ GN Sbjct: 540 LANKDGKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPEAPAPQ 599 Query: 675 VLEQMELTDYANMALSIP--SANTNIWNLEQDTVGTRLTNARHGLADNGGAWVSYFGGNF 732 EL+ AN A++ + +W E + + RL R D GGAW F Sbjct: 600 PPAGRELSAAANAAVNTGGVGLASTLWYAESNALSKRLGELRL-NPDAGGAWGRGFAQRQ 658 Query: 733 NGDNGTIN-YDQDVNGIMVGVDTKVDGNNAKWIVGAAAGFAKGDLS---DRTGQVDQDSQ 788 DN +DQ V G +G D V +W +G AG+ +GD D G D Sbjct: 659 QLDNRAGRRFDQKVAGFELGADHAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGHTD---- 714 Query: 789 SAYIYSSARFANN--IFVDGNLSYSHFNNDLSANMSDGTYVDGNTSSDAWGFGLKLGYDL 846 S ++ A + + ++D L S ND SDG V G + G L+ G Sbjct: 715 SVHVGGYATYIADSGFYLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGRRF 774 Query: 847 KLGDAGYVTPYGSVSGLFQSGDDYQLSNDMKVDGQSYDSMRYELGVDAGYTFTYSEDQAL 906 D ++ P ++ G Y+ +N ++V + S+ LG++ G + + + Sbjct: 775 THADGWFLEPQAELAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELAGGRQV 834 Query: 907 TPYFKLAYVYD-DSNNDADVNGDSIDNGVEGSAVRVGLGTQFSFTKNFSAYTDANYLGGG 965 PY K + + + D NG + + G+ +GLG + + S Y Y G Sbjct: 835 QPYIKASVLQEFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLYASYEYSKGP 894 Query: 966 DVDQDWSANVGVKYTW 981 + W+ + G +Y+W Sbjct: 895 KLAMPWTFHAGYRYSW 910
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 32.3 bits (73), Expect = 0.003 Identities = 19/69 (27%), Positives = 29/69 (42%) Query: 254 DVLREIRERTELPLGAYQVSGEYAMIKFAAMAGAIDEEKVVLESLGSIKRAGADLIFSYF 313 + E+ + +L L QV G A F +D E L I+ A +IF+ Sbjct: 466 NQFLELEKTKQLRLDTDQVYGNIATYNFENGRVRVDTGSNWSEVLPQIQETTARIIFNGK 525 Query: 314 ALDLAEKNI 322 L+L E+ I Sbjct: 526 DLNLVERRI 534
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 341 bits (875), Expect = e-114 Identities = 119/376 (31%), Positives = 188/376 (50%), Gaps = 57/376 (15%) Query: 192 ALDMTRLTRRQRVDYPSGKGLQTRYELGDIRGQSPQMEQLRQTITLYARSRAAVLIQGET 251 AL + + + + G+S M+++ + + ++ ++I GE+ Sbjct: 118 ALAEPKRRPSKL--------EDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGES 169 Query: 252 GTGKELAAQAIHQTFFHRQPHRQNKPSPPFVAVNCGAITESLLEAELFGYEEGAFTGSRR 311 GTGKEL A+A+H R+N P FVA+N AI L+E+ELFG+E+GAFTG++ Sbjct: 170 GTGKELVARALHD-----YGKRRNGP---FVAINMAAIPRDLIESELFGHEKGAFTGAQT 221 Query: 312 GGRAGLFEIAHGGTLFLDEIGEMPLPLQTRLLRVLEEKAVTRVGGHQPIPVEVRVISATH 371 G FE A GGTLFLDEIG+MP+ QTRLLRVL++ T VGG PI +VR+++AT+ Sbjct: 222 R-STGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATN 280 Query: 372 CDLDREIMQGRFRPDLFYRLSILRLTLPPLRERQADILPLAESFLKQSLAAMEIPFTESI 431 DL + I QG FR DL+YRL+++ L LPPLR+R DI L F++Q + + Sbjct: 281 KDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQ-AEKEGLD----V 335 Query: 432 RHGLTQCQPLLLAWRWPGNIRELRNMMERLALFLS------------------------- 466 + + L+ A WPGN+REL N++ RL Sbjct: 336 KRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKA 395 Query: 467 -VDPAPTLDRQFMRQLLPELMVNTAELTPST---------VDAHTLQDVLARFKGDKSAA 516 Q + + + + + + P + ++ + L +G++ A Sbjct: 396 AARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKA 455 Query: 517 ARYLGISRTTLWRRLK 532 A LG++R TL ++++ Sbjct: 456 ADLLGLNRNTLRKKIR 471
>PF06057#Type IV secretory pathway VirJ component Length = 243 Score = 29.4 bits (66), Expect = 0.014 Identities = 8/55 (14%), Positives = 17/55 (30%) Query: 277 FAIMKLPLADINAQNAMMHAGKSSEADVQGHVDGWINAHQQQFDGWVKEALAAQK 331 F + ++P + S +D + HV + + Q + Q Sbjct: 133 FVLNEMPARYRKNVLGAVLLSPSQSSDFEIHVSEMVTSDNQSARYLTLPEVNKQT 187
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 46.4 bits (110), Expect = 1e-07 Identities = 31/165 (18%), Positives = 66/165 (40%), Gaps = 2/165 (1%) Query: 34 LDTIAHHFSLSASSAGFIVTAAQLGYAAGLLFLVPLGDMFE-RRTLIVSMTLLAAGGMLI 92 L IA+ F+ +S ++ TA L ++ G L D +R L+ + + G ++ Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 93 TASSQSLSMMILGTALTGLFSVVAQILVPLA-ATLATPATRGKVVGTIMSGLLLGILLAR 151 S++I+ + G + LV + A RGK G I S + +G + Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 152 TVAGLLANLGGWRTVFWVASALMALMAVALWRGLPKLKSDTHLNY 196 + G++A+ W + + + + + +++ H + Sbjct: 157 AIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 74.5 bits (183), Expect = 1e-16 Identities = 62/418 (14%), Positives = 125/418 (29%), Gaps = 97/418 (23%) Query: 19 KRKTALLLLTLLFVIIAVAYGIYWFLVLRHIEETDDA----YVAGNQVQIMAQVSGSVTK 74 + L F++ + VL +E A +G +I + V + Sbjct: 51 TPVSRRPRLVAYFIMGFLVIAFILS-VLGQVEIVATANGKLTHSGRSKEIKPIENSIVKE 109 Query: 75 VWADNTDFVKEGDVLVTLDQT--------------------------------------- 95 + + V++GDVL+ L Sbjct: 110 IIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELK 169 Query: 96 -------------DAKQAFERAKTALASSVRQTHQLMINSKQ-------LQANIDVQKTA 135 + + K ++ Q +Q +N + + A I+ + Sbjct: 170 LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENL 229 Query: 136 LAQAQSDLNRRVPLGNANLIGREELQHARDAVASAQAQLDVAIQQYNANQAMILNSNLED 195 +S L+ L + I + + + A +L V Q ++ IL++ E Sbjct: 230 SRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEY 289 Query: 196 QPAVQQAATEVRN------------------AWLALERTRIVSPMTGYVSRRAVQ-PGAQ 236 Q Q E+ + + + I +P++ V + V G Sbjct: 290 QLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGV 349 Query: 237 ISPTTPLMAVVPATD-LWVDANFKETQLANMRIGQPVTIITDIYGDDVKY---TGKVVGL 292 ++ LM +VP D L V A + + + +GQ I + + +Y GKV + Sbjct: 350 VTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF-PYTRYGYLVGKVKNI 408 Query: 293 DMGTGSAFSLLPAQNATGNWIKVVQRLPVRVELDARQLEQHPLRIGLSTLVTVDTANR 350 + ++ G V+ + + PL G++ + T R Sbjct: 409 -----NLDAIE--DQRLGLVFNVIISIEENCLSTGNK--NIPLSSGMAVTAEIKTGMR 457
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 129 bits (326), Expect = 8e-35 Identities = 94/405 (23%), Positives = 164/405 (40%), Gaps = 23/405 (5%) Query: 17 IALSLATFMQVLDSTIANVAIPTIAGNLGSSLSQGTWVITSFGVANAISIPLTGWLAKRF 76 I L + +F VL+ + NV++P IA + + WV T+F + +I + G L+ + Sbjct: 17 IWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQL 76 Query: 77 GEVKLFMWSTVAFAAASWACGVS-SGLNMLIFFRVVQGVVAGPLIPLSQSLLLNNYPPAK 135 G +L ++ + S V S ++LI R +QG A L ++ P Sbjct: 77 GIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKEN 136 Query: 136 RSIALALWSMTVIVAPICGPILGGYISDNYHWGWIFFINVPIGIAVVLMTLHTLRGRETH 195 R A L V + GP +GG I+ HW ++ I + I I V + L+ Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLKKEVRI 195 Query: 196 TERRRIDAVGLALLVIGIGSLQIMLDRGKELDWFSSQEIIILTVVAVIAISFLIVWELTD 255 D G+ L+ +GI + ML F++ I +V+V++ + Sbjct: 196 KG--HFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKV 243 Query: 256 DHPIVDLSLFKSRNFTIGCLCISLAYMLYFGAIVLLPQLLQEVYGYTATWAGLASAPVGI 315 P VD L K+ F IG LC + + G + ++P ++++V+ + G G Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303 Query: 316 IPVILS-PIIGRFAHKLDMRRLVTFSFIMYAVCFYWRAWTFEPGMDFGASAWPQFIQGF- 373 + VI+ I G + ++ +V F ++ S + I F Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL-----LETTSWFMTIIIVFV 358 Query: 374 --AVACFFMPLTTITLSGLPPERLAAASSLSNFTRTLAGSIGTSI 416 ++ ++TI S L + A SL NFT L+ G +I Sbjct: 359 LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403
>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS signature. Length = 171 Score = 287 bits (736), Expect = e-103 Identities = 130/170 (76%), Positives = 145/170 (85%) Query: 2 PLLDSFAVDHTRMQAPAVRVAKTMNTPHGDAITVFDLRFCIPNKEVMPEKGIHTLEHLFA 61 PLLDSF VDHTRM APAVRVAKTM TP GD ITVFDLRF PNK+++ EKGIHTLEHL+A Sbjct: 1 PLLDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYA 60 Query: 62 GFMRDHLNGNGVEIIDISPMGCRTGFYMSLIGTPDEQRVADAWKAAMADVLKVQDQNQIP 121 GFMR+HLNG+ VEIIDISPMGCRTGFYMSLIGTP EQ+VADAW AAM DVLKV++QN+IP Sbjct: 61 GFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKVENQNKIP 120 Query: 122 ELNVYQCGTYQMHSLSEAQDIARHILERDVRVNSNKELALPKEKLQELHI 171 ELN YQCGT MHSL EA+ IA++ILE V VN N ELALP+ L+EL I Sbjct: 121 ELNEYQCGTAAMHSLDEAKQIAKNILEVGVAVNKNDELALPESMLRELRI 170
>DNABINDNGFIS#DNA-binding protein FIS signature. Length = 98 Score = 157 bits (399), Expect = 3e-54 Identities = 98/98 (100%), Positives = 98/98 (100%) Query: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ Sbjct: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60 Query: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN Sbjct: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 128 bits (322), Expect = 4e-39 Identities = 81/216 (37%), Positives = 129/216 (59%), Gaps = 3/216 (1%) Query: 1 MAKKTKADALKTRQHLIETAIAQFALRGVANTTLNDIADAADVTRGAIYWHFENKTQLFN 60 MA+KTK +A +TRQH+++ A+ F+ +GV++T+L +IA AA VTRGAIYWHF++K+ LF+ Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EVW-LQQPPLRELIQDRLTGCWNDNPLQDPREKFIAALQYIAAVPRQQALMQILYHKCEF 119 E+W L + + EL + +PL RE I L+ R++ LM+I++HKCEF Sbjct: 61 EIWELSESNIGELELEYQAKF-PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119 Query: 120 HNGM-ISEQAIREKMGFHHQSLLEVLQRCMDKKLISGSLDLDVILIILHGSFSGIVKNWL 178 M + +QA R + + + L+ C++ K++ L II+ G SG+++NWL Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWL 179 Query: 179 MNPTSYDLYKQAPALVDNVLKMLSPDGSVRQLMPNE 214 P S+DL K+A V +L+M ++R NE Sbjct: 180 FAPQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 42.5 bits (100), Expect = 2e-06 Identities = 24/137 (17%), Positives = 48/137 (35%), Gaps = 15/137 (10%) Query: 98 ATYQADYDSAKGELAKSEAAAAIAHLTVKRYVPLVGTKYISQQEYDQAIADA-RQADAAV 156 + K +L + E+ A + Q + I D RQ + Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLV----------TQLFKNEILDKLRQTTDNI 311 Query: 157 VAAKAAVESARINLAYTKVTSPISGRIGKSNV-TEGALVTNGQSTELATVQQLDPIYVDV 215 + + + +P+S ++ + V TEG +VT + T + V + D + V Sbjct: 312 GLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTA 370 Query: 216 TQSSND--FMRLKQSVE 230 + D F+ + Q+ Sbjct: 371 LVQNKDIGFINVGQNAI 387 Score = 37.1 bits (86), Expect = 1e-04 Identities = 22/127 (17%), Positives = 41/127 (32%), Gaps = 13/127 (10%) Query: 46 TAPLAVTTELPGR-TSAFRIAEVRPQVSGIVLKRNFTEGSDVEAGQSLYQIDPATYQADY 104 + + G+ T + R E++P + IV + EG V G L ++ +AD Sbjct: 77 LGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEAD- 135 Query: 105 DSAKGELAKSEAAAAIAHLTVKRYVPLVGTKYISQQEYDQAIADARQADAAVVAAKAAVE 164 K++++ A L RY L E ++ + Sbjct: 136 ------TLKTQSSLLQARLEQTRYQIL-----SRSIELNKLPELKLPDEPYFQNVSEEEV 184 Query: 165 SARINLA 171 +L Sbjct: 185 LRLTSLI 191
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1391 bits (3602), Expect = 0.0 Identities = 917/1032 (88%), Positives = 974/1032 (94%) Query: 1 MANFFIRRPIFAWVLAIILMMAGALAIMQLPVAQYPTIAPPAVSISATYPGADAQTVQDT 60 MANFFIRRPIFAWVLAIILMMAGALAI+QLPVAQYPTIAPPAVS+SA YPGADAQTVQDT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGISVEKSSSSFLMVAGFVSDNPNTTQDDISDYVASNIKDSISRLNGVGDVQLFGA 180 EVQQQGISVEKSSSS+LMVAGFVSDNP TTQDDISDYVASN+KD++SRLNGVGDVQLFGA Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWLDANLLNKYQLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRL 240 QYAMRIWLDA+LLNKY+LTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 KDPEEFGKVTLRVNTDGSVVHLKDVARIELGGENYNVVARINGKPASGLGIKLATGANAL 300 K+PEEFGKVTLRVN+DGSVV LKDVAR+ELGGENYNV+ARINGKPA+GLGIKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTATAIKAKLAELQPFFPQGMKVVYPYDTTPFVKISIHEVVKTLFEAIILVFLVMYLFLQ 360 DTA AIKAKLAELQPFFPQGMKV+YPYDTTPFV++SIHEVVKTLFEAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NIRATLIPTIAVPVVLLGTFAVLAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 N+RATLIPTIAVPVVLLGTFA+LAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 MEDNLSPREATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 MED L P+EATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATLLKPVSAEHHEKKSGFFGWFNTRFDHSVNHYTNSVSGIVRNTGRY 540 SVLVALILTPALCATLLKPVSAEHHE K GFFGWFNT FDHSVNHYTNSV I+ +TGRY Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 541 LIIYLLIVVGMAVLFLRLPTSFLPEEDQGVFLTMIQLPSGATQERTQKVLDQVTHYYLNN 600 L+IY LIV GM VLFLRLP+SFLPEEDQGVFLTMIQLP+GATQERTQKVLDQVT YYL N Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 601 EKANVESVFTVNGFSFSGQGQNSGMAFVSLKPWEERNGEENSVEAVIARATRAFSQIRDG 660 EKANVESVFTVNGFSFSGQ QN+GMAFVSLKPWEERNG+ENS EAVI RA +IRDG Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 661 LVFPFNMPAIVELGTATGFDFELIDQGGLGHDALTKARNQLLGMVAKHPDLLVRVRPNGL 720 V PFNMPAIVELGTATGFDFELIDQ GLGHDALT+ARNQLLGM A+HP LV VRPNGL Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 721 EDTPQFKLDVDQEKAQALGVSLSDINETISAALGGYYVNDFIDRGRVKKVYVQADAQFRM 780 EDT QFKL+VDQEKAQALGVSLSDIN+TIS ALGG YVNDFIDRGRVKK+YVQADA+FRM Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 781 LPGDINNLYVRSANGEMVPFSTFSSARWIYGSPRLERYNGMPSMELLGEAAPGRSTGEAM 840 LP D++ LYVRSANGEMVPFS F+++ W+YGSPRLERYNG+PSME+ GEAAPG S+G+AM Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840 Query: 841 SLMENLASQLPNGIGYDWTGMSYQERLSGNQAPALYAISLIVVFLCLAALYESWSIPFSV 900 +LMENLAS+LP GIGYDWTGMSYQERLSGNQAPAL AIS +VVFLCLAALYESWSIP SV Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 Query: 901 MLVVPLGVVGALLAASLRGLNNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMEKEGRGLI 960 MLVVPLG+VG LLAA+L NDVYF VGLLTTIGLSAKNAILIVEFAKDLMEKEG+G++ Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 Query: 961 EATLEASRMRLRPILMTSLAFILGVMPLVISRGAGSGAQNAVGTGVMGGMLTATLLAIFF 1020 EATL A RMRLRPILMTSLAFILGV+PL IS GAGSGAQNAVG GVMGGM++ATLLAIFF Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 Query: 1021 VPVFFVVVKRRF 1032 VPVFFVV++R F Sbjct: 1021 VPVFFVVIRRCF 1032
>adhesinb#Adhesin B signature. Length = 310 Score = 29.0 bits (65), Expect = 0.001 Identities = 14/68 (20%), Positives = 26/68 (38%), Gaps = 10/68 (14%) Query: 1 MKR---LIPVALLTTLLAGCAHDSPCVPVYDDQGRLVHTNTCMKGTTQDNWETAGAIAGG 57 MK+ L+ + L LA C+ + +V TN+ + T++ IAG Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKN-------IAGD 53 Query: 58 AAAVAGLT 65 + + Sbjct: 54 KINLHSIV 61
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 141 bits (356), Expect = 3e-44 Identities = 59/143 (41%), Positives = 86/143 (60%), Gaps = 2/143 (1%) Query: 4 ALPFLIFYASFSLLLGIYDARTGLLPDRFTCPLLWGGLLYHQICLPERLPDALWGAIAGY 63 L L+ + L D LLPD+ T PLLWGGLL++ + L DA+ GA+AGY Sbjct: 134 TLAALLLT-WVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGY 192 Query: 64 GGFALIYWGYRLRYQKEGLGYGDVKYLAALGAWHCWETLPLLVFLAAMLACGRFGVALLV 123 +YW ++L KEG+GYGD K LAALGAW W+ LP+++ L++++ G+ L++ Sbjct: 193 LVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAF-MGIGLIL 251 Query: 124 RGKSALINPLPFGPWLAVAGFIT 146 P+PFGP+LA+AG+I Sbjct: 252 LRNHHQSKPIPFGPYLAIAGWIA 274
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 36.8 bits (85), Expect = 1e-05 Identities = 18/103 (17%), Positives = 43/103 (41%), Gaps = 10/103 (9%) Query: 44 EYHESIDEMKHADKYIERILFLEGIPN--LQDLGKL------GIGEDVEEMLRSDLRLEL 95 E ++ E D ER+L + G P +++ + G EM+++ + Sbjct: 52 ELYDHAAE--TVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYK 109 Query: 96 EGAKDLREAIAYADSVHDYVSRDMMIEILADEEGHIDWLETEL 138 + + + + I A+ D + D+ + ++ + E + L + L Sbjct: 110 QISSESKFVIGLAEENQDNATADLFVGLIEEVEKQVWMLSSYL 152
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 79.5 bits (196), Expect = 3e-18 Identities = 57/198 (28%), Positives = 87/198 (43%), Gaps = 13/198 (6%) Query: 13 VNVGTIGHVDHGKTTLTAAI------TTVLAKTYGGAARAFDQIDNAPEEKARGITINTS 66 +N+G + HVD GKTTLT ++ T L G R DN E+ RGITI T Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRT----DNTLLERQRGITIQTG 59 Query: 67 HVEYDTPTRHYAHVDCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQV 126 + +D PGH D++ + + +DGAIL+++A DG QTR R++ Sbjct: 60 ITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKM 119 Query: 127 GVPYIIVFLNKCDMVDDEELLELVEMEVRELLSQYDFPGDDTPIVRGSALKALEGDAEWE 186 G+P I F+NK D + L V +++E LS + + +W+ Sbjct: 120 GIP-TIFFINKIDQNGID--LSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176 Query: 187 AKIIELAGFLDSYIPEPE 204 I L+ Y+ Sbjct: 177 TVIEGNDDLLEKYMSGKS 194
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 616 bits (1591), Expect = 0.0 Identities = 178/698 (25%), Positives = 305/698 (43%), Gaps = 81/698 (11%) Query: 9 RYRNIGISAHIDAGKTTTTERILFYTGVNHKIGEVHDGAATMDWMEQEQERGITITSAAT 68 + NIG+ AH+DAGKTT TE +L+ +G ++G V G D E++RGITI + T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 69 TAFWSGMAKQYEPHRINIIDTPGHVDFTIEVERSMRVLDGAVMVYCAVGGVQPQSETVWR 128 + W ++NIIDTPGH+DF EV RS+ VLDGA+++ A GVQ Q+ ++ Sbjct: 62 SFQWEN-------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114 Query: 129 QANKYKVPRIAFVNKMDRMGANFLKVVGQIKTRLGANPVPLQLAIGAEEGFTGVVDLVKM 188 K +P I F+NK+D+ G + V IK +L A V Q V M Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ----------KVELYPNM 164 Query: 189 KAINWNDADQGVTFEYEDIPADMQDLANEWHQNLIESAAEASEELMEKYLGGEELTEEEI 248 N+ +++Q ++ E +++L+EKY+ G+ L E+ Sbjct: 165 CVTNFTESEQ------------------------WDTVIEGNDDLLEKYMSGKSLEALEL 200 Query: 249 KQALRQRVLNNEIILVTCGSAFKNKGVQAMLDAVIDYLPSPVDVPAINGILDDGKDTPAE 308 +Q R N + V GSA N G+ +++ + + S Sbjct: 201 EQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH----------------- 243 Query: 309 RHASDDEPFSALAFKIATDPFVGNLTFFRVYSGVVNSGDTVLNSVKTARERFGRIVQMHA 368 FKI L + R+YSGV++ D+V S K + + + Sbjct: 244 ---RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEK-EKIKITEMYTSIN 299 Query: 369 NKREEIKEVRAGDIAAAIG----LKDVTTGDTLCDPENPIILERMEFPEPVISIAVEPKT 424 + +I + +G+I L V GDT P+ ER+E P P++ VEP Sbjct: 300 GELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQR----ERIENPLPLLQTTVEPSK 354 Query: 425 KADQEKMGLALGRLAKEDPSFRVWTDEESNQTIIAGMGELHLDIIVDRMKREFNVEANVG 484 +E + AL ++ DP R + D +++ I++ +G++ +++ ++ +++VE + Sbjct: 355 PQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIK 414 Query: 485 KPQVAYREAIRAKVTDIEGKHAKQSGGRGQYGHVVIDMYPLEPGSNPKGYEFINDIKGGV 544 +P V Y E K E + + + + + PL GS G ++ + + G Sbjct: 415 EPTVIYMERPLKKA---EYTIHIEVPPNPFWASIGLSVSPLPLGS---GMQYESSVSLGY 468 Query: 545 IPGEYIPAVDKGIQEQLKSGPLAGYPVVDLGVRLHFGSYHDVDSSELAFKLAASIAFKEG 604 + + AV +GI+ + G L G+ V D + +G Y+ S+ F++ A I ++ Sbjct: 469 LNQSFQNAVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQV 527 Query: 605 FKKAKPVLLEPIMKVEVETPEENTGDVIGDLSRRRGMLKGQESEVTGVKIHAEVPLSEMF 664 KKA LLEP + ++ P+E D + + + + V + E+P + Sbjct: 528 LKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQ 587 Query: 665 GYATQLRSLTKGRASYTMEFLKYDDAPNNVAQAVIEAR 702 Y + L T GR+ E Y + V + R Sbjct: 588 EYRSDLTFFTNGRSVCLTELKGYHVT---TGEPVCQPR 622
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 29.0 bits (65), Expect = 0.020 Identities = 14/62 (22%), Positives = 29/62 (46%), Gaps = 1/62 (1%) Query: 160 ASSVEDLVTQTLEFTIEEVNADRNV-SNNAKNRQIVLNLYEKGIFDIKDAINQVADRLNI 218 A +V+D VTQ +E + ++ + S + + + L + D A QV ++L + Sbjct: 54 AQTVQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQL 113 Query: 219 SK 220 + Sbjct: 114 AT 115
>INFPOTNTIATR#Macrophage infectivity potentiator signature. Length = 233 Score = 128 bits (323), Expect = 2e-38 Identities = 80/226 (35%), Positives = 122/226 (53%), Gaps = 9/226 (3%) Query: 28 AAKPAATADSKAAFKNDDQKAAYALGASLGRYMENSLKEQEKLGIKLDKDQLIAGVQDAF 87 A A A + D K +Y++GA LG K + GI ++ D L G+QD Sbjct: 14 AMSTAMAATDATSLTTDKDKLSYSIGADLG-------KNFKNQGIDINPDVLAKGMQDGM 66 Query: 88 A-DKSKLSDQEIEQTLQTFEARVKSAAQAKMEKDAADNEAKGKTFRDAFAKEKGVKTSST 146 + + L++++++ L F+ + + A+ K A +N+AKG F A + G+ + Sbjct: 67 SGAQLILTEEQMKDVLSKFQKDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPS 126 Query: 147 GLLYKVEKEGTGEAPKDSDTVVVNYKGTLIDGKEFDNSYTRGEPLSFRLDGVIPGWTEGL 206 GL YK+ GTG P SDTV V Y GTLIDG FD++ G+P +F++ VIPGWTE L Sbjct: 127 GLQYKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEAL 186 Query: 207 KNIKKGGKIKLVIPPELAYGKTGVPG-IPANSTLVFDVELLDIKPA 251 + + G ++ +P +LAYG V G I N TL+F + L+ +K A Sbjct: 187 QLMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISVKKA 232
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 27.7 bits (61), Expect = 0.025 Identities = 35/138 (25%), Positives = 52/138 (37%), Gaps = 22/138 (15%) Query: 11 YAHPESQDSVANRVLLKPAIQHNNVTVHDLYARYPDFFID--TPYEQ-----ALLREHDV 63 Y P + D N+V P + +HD+ + D F +P + L+ V Sbjct: 9 YQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCV 68 Query: 64 IVFQH--PLYTYSCPALLKEWLDRVLSRGFASGPGGNQLVGKYWRSVITTGEPESA---- 117 Q P+ + P DR L F GPG N G Y +IT PE Sbjct: 69 ---QLGIPVVYTAQPGSQNP-DDRALLTDFW-GPGLNS--GPYEEKIITELAPEDDDLVL 121 Query: 118 --YRYDALNRYPMSDVLR 133 +RY A R + +++R Sbjct: 122 TKWRYSAFKRTNLLEMMR 139
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 30.9 bits (69), Expect = 0.019 Identities = 21/85 (24%), Positives = 33/85 (38%), Gaps = 7/85 (8%) Query: 522 VQKQENQADDAPKENNANSAQSRKDQKRREAELRTLT---QPLRKEITRLEKEMEKLNAQ 578 + E + A +E N N ++ RE E T + + I+ L+ M L A Sbjct: 151 TRTAEEIGEQAVREGNINGPEAYMRFLDREMEGLTAAYNVKLFTEAISSLQIRMNTLTAA 210 Query: 579 LA----QAEEKLGDSSLYDPSRKAE 599 A A K + + + RKAE Sbjct: 211 KASIEAAAANKAREQAAAEAKRKAE 235
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 25.1 bits (54), Expect = 0.024 Identities = 17/46 (36%), Positives = 23/46 (50%), Gaps = 3/46 (6%) Query: 3 IPWQGLAPDTLDNLIESFV---LREGTDYGEHERSLEQKVADVKRQ 45 +PW+ PD L FV E T E E SLEQ++A ++ Q Sbjct: 5 LPWKTWTPDDLAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQ 50
>PF07299#Fibronectin-binding protein (FBP) Length = 219 Score = 36.0 bits (83), Expect = 1e-04 Identities = 10/46 (21%), Positives = 21/46 (45%), Gaps = 2/46 (4%) Query: 71 PEANDFSLLEHTFIEYGQTGKGQSRKYLHTYDEAVPWNQVPGTFTP 116 P+ + + E ++ KG SRK++ ++ + + GTF Sbjct: 112 PDMEELDMKELSY--LSWIDKGSSRKFIIAKNDKNKFVGLQGTFQS 155
>YERSSTKINASE#Yersinia serine/threonine protein kinase signature. Length = 732 Score = 31.2 bits (70), Expect = 0.014 Identities = 18/52 (34%), Positives = 27/52 (51%) Query: 630 RPGGSGDVNILESPDMPSHGLLSTLEQHLQRIIGHLNTMHTISSMAWRQRPH 681 R G + + + SP S+ +LS +E LQRI HL+ H+ S + R H Sbjct: 515 REGDTKNSSTEVSPYHRSNFMLSIVEPSLQRIQKHLDQTHSFSDIGSLVRAH 566
>PF06580#Sensor histidine kinase Length = 349 Score = 33.7 bits (77), Expect = 0.001 Identities = 27/188 (14%), Positives = 71/188 (37%), Gaps = 45/188 (23%) Query: 270 INKDIEECNAIIEQFIDYLR------TGQEMPM--EMADLNSVL-------GEVIAAESG 314 I +D + ++ + +R +++ + E+ ++S L + + Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQ---- 241 Query: 315 YEREINTALQAGSIQVKMHPLSIKRAVANMVVNA--ARYGNGWIKVSSGTESHRAWFQVE 372 +E +IN A+ V++ P+ ++ V N + + G I + ++ +VE Sbjct: 242 FENQINPAIM----DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVE 297 Query: 373 DDGPGIKPEQRKHLFQPFVRGDSARSTSGTGLGLAIV-QRIIDNH--NGMLEIGTSERGG 429 + G ++ TG GL V +R+ + +++ + ++G Sbjct: 298 NTGSLALKNTKE----------------STGTGLQNVRERLQMLYGTEAQIKL-SEKQGK 340 Query: 430 LSIRAWLP 437 ++ +P Sbjct: 341 VNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 98.4 bits (245), Expect = 6e-26 Identities = 39/136 (28%), Positives = 72/136 (52%), Gaps = 3/136 (2%) Query: 6 KILVVDDDMRLRALLERYLTEQGFQVRSVANAEQMDRLLTRESFHLMVLDLMLPGEDGLS 65 ILV DDD +R +L + L+ G+ VR +NA + R + L+V D+++P E+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 66 ICRRLRSQSNPMPIIMVTAKGEEVDRIVGLEIGADDYIPKPFNPRELLARIRAVL---RR 122 + R++ +P+++++A+ + I E GA DY+PKPF+ EL+ I L +R Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 123 QANELPGAPSQEEAVI 138 + ++L ++ Sbjct: 125 RPSKLEDDSQDGMPLV 140
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 41.8 bits (98), Expect = 9e-06 Identities = 42/142 (29%), Positives = 66/142 (46%), Gaps = 30/142 (21%) Query: 1 MKKLTIGLIGNPNSGKTTLFNQL---TGARQRVGNW-AGVTV------ERKEG---QFAT 47 MK + IG++ + ++GKTTL L +GA +G+ G T ER+ G Q Sbjct: 1 MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI 60 Query: 48 T-----DHQVTLVDLPGTYSLTTISSQTSLDEQIACHYILSGDADLLINVVDASNLE-RN 101 T + +V ++D PG + SL +L G A LLI+ D + R Sbjct: 61 TSFQWENTKVNIIDTPG-HMDFLAEVYRSLS-------VLDG-AILLISAKDGVQAQTRI 111 Query: 102 LYLTLQLLELGIPCIVALNMLD 123 L+ L+ ++GIP I +N +D Sbjct: 112 LFHALR--KMGIPTIFFINKID 131
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 29.0 bits (64), Expect = 0.019 Identities = 12/41 (29%), Positives = 23/41 (56%) Query: 234 MRIPQHKEKIMTIAERLRREGHRNGLQKGLQQGKQEGQRLA 274 +++ H++ R++GH+ G Q+GL QG ++G A Sbjct: 47 LQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEA 87
>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature. Length = 293 Score = 32.4 bits (73), Expect = 0.004 Identities = 27/82 (32%), Positives = 37/82 (45%), Gaps = 17/82 (20%) Query: 275 RTPISGDYRGYQVFSMPPPSSGGIHIVQILNI--LENFDMKKYGF-GSADAMQIMAEAEK 331 R P+ G+ R + SMPPP G H +I N+ + FD G+ G A I++E EK Sbjct: 77 RRPM-GETRNVSLISMPPPWRPGEHADRITNLKFFKQFD----GYVGGQTAWGILSELEK 131 Query: 332 YAYADRSEYLGDPDFVKVPWQA 353 Y P F WQ+ Sbjct: 132 GRY---------PTFSYQDWQS 144
>PF04619#Dr-family adhesin Length = 160 Score = 27.6 bits (61), Expect = 0.031 Identities = 11/60 (18%), Positives = 21/60 (35%), Gaps = 4/60 (6%) Query: 29 VGARYGHTMIEFDAKLSKDGEIFLLHDDNLERTSNGWGVAGELNWQD----LLRVDAGGW 84 +G ++ D + G+ FL+ D+N ++ W + D G W Sbjct: 70 LGCDARQVALKADTDNFEQGKFFLISDNNRDKLYVNIRPTDNSAWTTDNGVFYKNDVGSW 129
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.041 Identities = 10/29 (34%), Positives = 16/29 (55%) Query: 33 IVMVGPSGCGKSTLLRMVAGLERVTSGDI 61 +V+ G G GKSTL+ + GL+ + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHF 627
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 43.2 bits (101), Expect = 1e-06 Identities = 46/176 (26%), Positives = 73/176 (41%), Gaps = 17/176 (9%) Query: 133 SGHLLSQPFNSSTPVLYYNKDAFKKAGLDPEQPPKTWQELADYTAKLRAAGMKCGYASGW 192 +G L++ P L YNKD L P PPKTW+E+ +L+A G + Sbjct: 126 NGKLIAYPIAVEALSLIYNKD------LLP-NPPKTWEEIPALDKELKAKGKSALMFNLQ 178 Query: 193 QGWIQLENFSAWNGLPFASKNNGFDGTDAVLEF--NKPEQVKHIALLEEMNKKGDFSYVG 250 + + +A G F +N +D D ++ K + L++ + D Y Sbjct: 179 EPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDY-- 236 Query: 251 RKDESTEKFYNGDCAMTTASSGSLANIRQYAKFNYGVGMMPYDADIKGAPQNAIIG 306 + F G+ AMT + +NI +K NYGV ++P KG P +G Sbjct: 237 --SIAEAAFNKGETAMTINGPWAWSNIDT-SKVNYGVTVLP---TFKGQPSKPFVG 286
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 30.9 bits (70), Expect = 0.008 Identities = 59/306 (19%), Positives = 110/306 (35%), Gaps = 34/306 (11%) Query: 42 NEYFSLTNTQS--GMLMSWLGFVGIISGAVSGIIVDRFKNPKSILTIAYLTMAALAIWQS 99 + + + G+L++ + V G + DRF + +L ++ A + Sbjct: 33 RDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGR-RPVLLVSLAGAAVDYAIMA 91 Query: 100 FRPSYQAMFI--IVGFMSLVGNGLFLVSMTKIARLLASDNEQGRYFGFLESGRGIAGTVL 157 P ++I IV ++ V+ IA + D E+ R+FGF+ + G Sbjct: 92 TAPFLWVLYIGRIVAGIT---GATGAVAGAYIADITDGD-ERARHFGFMSACFGFG---- 143 Query: 158 TLCAVAIVGLHGSSAVSIGFILRFDAAIYIIPGFTSYYLFPKGVSAIENAA------PKK 211 + + GL G + F AA+ + T +L P+ P Sbjct: 144 MVAGPVLGGLMGGFSPHAPFF--AAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLA 201 Query: 212 MSDLISLLKSVKLWLAAFIISCVIFVYQGGAYL-VPYLSDAYGMTPDQT----AVIGMIR 266 + V +A F I + V Q A L V + D + A G++ Sbjct: 202 SFRWARGMTVVAALMAVFFI--MQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILH 259 Query: 267 AYFLAFIISPFAGLLADK--IGSSLKVMASFFILGALITASFIFIPHDSRFLILLITLVL 324 + A I P A L ++ + + + +IL A T ++ P ++LL + + Sbjct: 260 SLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFP----IMVLLASGGI 315 Query: 325 LLGALT 330 + AL Sbjct: 316 GMPALQ 321
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 40.2 bits (94), Expect = 1e-05 Identities = 68/410 (16%), Positives = 129/410 (31%), Gaps = 65/410 (15%) Query: 35 VAPIMSKELGFDPEA---MGLAFSSFGIAYVIMQLPGGWLLDRYGSRLVYGCALIGWSLV 91 V P + ++L + G+ + + + G L DR+G R V +L G ++ Sbjct: 27 VLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAV- 85 Query: 92 TMFQGTIYLYGSPLIVLVILRLLMGAIEAPAFPANSRLS--------VQWFPNNERGFVT 143 I L VL I R++ G A A + ++ + F GF++ Sbjct: 86 ---DYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHF-----GFMS 137 Query: 144 SVYQAAQYISLGIITPLMTIILHNLSWHFVFYYIGAIGV---MLGIFWLMKVKDPMHHPK 200 + + + P++ ++ S H F+ A+ + G F L + P Sbjct: 138 ACFGFGM-----VAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPL 192 Query: 201 VNQAEIDYIRSGGGEPSLGCKKEPQKITFAQIKTVCVNRMMIGVYIGQFCVTSITWFFLT 260 + A + ++ + F + + Sbjct: 193 ---------------------RREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAA 231 Query: 261 WFPTYLYQAKGMSILKVGFVASIPAIAGFIGGLLGGVFSDWLLKRGYSLTVARKLPVICG 320 + + +G A G + L + + + R ++ G Sbjct: 232 LWVIFGEDRFHWDATTIGISL---AAFGILHSLAQAMITGPVAARLGERRA-----LMLG 283 Query: 321 MLLSCV--IVIANYTSSEFVVIAAMSLAFFAKGFGNLGWCVLSDTSPKEVLGIAGGVFNM 378 M+ I++A T + LA G L +LS +E G G Sbjct: 284 MIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQ-AMLSRQVDEERQGQLQGSLAA 342 Query: 379 CGNMASIVTPLVIGVILANTQSFDFAILYVGSMGLIGLISYLFIVGPLDR 428 ++ SIV PL+ I A + + + G + G YL + L R Sbjct: 343 LTSLTSIVGPLLFTAIYAASITT-----WNGWAWIAGAALYLLCLPALRR 387
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 29.0 bits (65), Expect = 0.028 Identities = 21/87 (24%), Positives = 30/87 (34%), Gaps = 13/87 (14%) Query: 8 MTVI---GAGSYGTALAITLARNGHQVVLWGHD---PKHIATLEHDRCNVAFLPDVPFPD 61 M + AG G ++ L GHQVV G D + +L+ R + P F Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVV--GIDNLNDYYDVSLKQARLELLAQPGFQF-- 56 Query: 62 TLHLESDLATALAASRNILVVVPSHVF 88 + DLA + VF Sbjct: 57 ---HKIDLADREGMTDLFASGHFERVF 80
>SECBCHAPRONE#Bacterial protein-transport SecB chaperone protein signature. Length = 170 Score = 234 bits (598), Expect = 2e-82 Identities = 91/153 (59%), Positives = 118/153 (77%), Gaps = 4/153 (2%) Query: 3 EQNNTEMAFQIQRIYTKDVSFEAPNAPHVFQKDWQPEVKLDLDTASSQLADDVYEVVLRV 62 Q + QIQRIY KDVSFEAPN PH+FQ+DW+P++ DL T + Q+ DD+YEV L + Sbjct: 12 TQATQQPVLQIQRIYVKDVSFEAPNLPHIFQQDWEPKLSFDLSTEAKQVGDDLYEVCLNI 71 Query: 63 TVTASLGEE--TAFLCEVQQAGIFSISGIEGTQMAHCLGAYCPNILFPYARECITSLVSR 120 +V ++ AF+CEV+QAG+F+ISG+E QMAHCL + CPN+LFPYARE ++SLV+R Sbjct: 72 SVETTMESSGDVAFICEVKQAGVFTISGLEEMQMAHCLTSQCPNMLFPYARELVSSLVNR 131 Query: 121 GTFPQLNLAPVNFDALFMNYL--QQQAGEGTEE 151 GTFP LNL+PVNFDALFM+YL Q+QA + TEE Sbjct: 132 GTFPALNLSPVNFDALFMDYLQRQEQAEQTTEE 164
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 47.1 bits (112), Expect = 7e-08 Identities = 25/196 (12%), Positives = 62/196 (31%), Gaps = 21/196 (10%) Query: 45 RDQLKSIQADIAAKERDVRQQQQQRASLLAQLKAQEEAISAAARKLRETQSTLDQLNAQI 104 ++ + + Q +L +E S + + + LD+ A+ Sbjct: 157 SRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAER 216 Query: 105 DEMNASIAKLEQQKASQERNLAAQLDAAFRQGEHTGIQLILSGEESQRGQRLQAYFGYLN 164 + A I + E ++ L + + ++ Q + ++A Sbjct: 217 LTVLARINRYENLSRVEKSRLDD-FSSLLHKQAIAKHAVL-----EQENKYVEA-----V 265 Query: 165 QARQETIAELKQTREQVATQKAELEEKQSQQQTLLYEQRAQ-QAKLEQARNERKKTLAGL 223 + ++L+Q ++ + K E + + + ++ Q + E K Sbjct: 266 NELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEE-- 323 Query: 224 ESSIQQGQQQLSELRA 239 +QQ S +RA Sbjct: 324 -------RQQASVIRA 332
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 42.3 bits (99), Expect = 6e-07 Identities = 43/180 (23%), Positives = 63/180 (35%), Gaps = 22/180 (12%) Query: 1 MSTPANF--NGQRPAIDANDAVMLLIDHQSGLFQTVGD--MPMPELRACAAALAKIATLC 56 M T ++ N D N AV+L+ D Q+ P+ EL A L Sbjct: 11 MPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQL 70 Query: 57 NMPVITTASVPQ-------------GPNGPLIPE----IHANAPHA-QYVARKGEINAWD 98 +PV+ TA GP P I AP V K +A+ Sbjct: 71 GIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFK 130 Query: 99 NADFVQAVKATGRKTLIIAGTITSVCMAFPAISAVAEGYKVFAVIDASGTYSKMAQEITM 158 + ++ ++ GR LII G + A A E K F V DA +S ++ + Sbjct: 131 RTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEKHQMAL 190
>cloacin#Cloacin signature. Length = 551 Score = 28.9 bits (64), Expect = 0.009 Identities = 13/47 (27%), Positives = 21/47 (44%) Query: 30 NGNGGGHGNNAANQGNNGNGHKGNAGQKTEHRKNGGKPDHVESDISY 76 N GGG G+ G +G+G+ G G GG V + +++ Sbjct: 44 NPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAF 90
>PF06872#EspG protein Length = 398 Score = 28.5 bits (63), Expect = 0.021 Identities = 14/54 (25%), Positives = 27/54 (50%) Query: 111 LLLEAGMEVNDDFKEPTDHLAIYLELLSHLHFSLGESFQQRRMNKLRQKTLSSL 164 L+L+A +++N D+K+P + + +LL L L + + Q L+ L Sbjct: 29 LVLDATIKINSDYKKPWNEMTCAEKLLKILTLGLWNPKYSQDERQQFQGLLTVL 82
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 76.4 bits (188), Expect = 2e-18 Identities = 28/115 (24%), Positives = 56/115 (48%), Gaps = 1/115 (0%) Query: 4 HIVIVEDEPVTQARLQAYFEQEGYRVSVTDSGAGLRDIMEHEHVSLILLDINLPDENGLM 63 I++ +D+ + L + GY V +T + A L + L++ D+ +PDEN Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 64 LTRALRER-STVGIILVTGRCDQIDRIVGLEMGADDYVTKPLELRELVVRVKNLL 117 L +++ + +++++ + + I E GA DY+ KP +L EL+ + L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 54.8 bits (132), Expect = 9e-10 Identities = 24/118 (20%), Positives = 49/118 (41%), Gaps = 3/118 (2%) Query: 681 RLLLIEDNMLTQRITAEMLTGKGVKVSVAESANDALRCLAEGESFDVALVDFDLPDYDGL 740 +L+ +D+ + + + L+ G V + +A R +A G+ D+ + D +PD + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDENAF 63 Query: 741 TLAQQLMSLYPAMKRIGFSAH-VIDDNLRQRTAGLFCGIIQKPVPREELYRMIAHYLQ 797 L ++ P + + SA ++ G + + KP EL +I L Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAY-DYLPKPFDLTELIGIIGRALA 120
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 45.6 bits (108), Expect = 2e-07 Identities = 65/384 (16%), Positives = 117/384 (30%), Gaps = 36/384 (9%) Query: 66 AEMGYVFSAFAWLYTLCQIPGGWFLDRIGSRLTYFIAIFGWSVATLLQGFATGLLSLIGL 125 A G + + +A + C G DR G R +++ G +V + A L L Sbjct: 43 AHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIG 102 Query: 126 RAITGIFEAPAFPANNRMVTSWFPEHERASAVGFYTSGQFVGLAFLTPLLIWIQEMLSWH 185 R + GI A + ERA GF ++ G+ P+L + S H Sbjct: 103 RIVAGITGAT-GAVAGAYIADITDGDERARHFGFMSACFGFGMV-AGPVLGGLMGGFSPH 160 Query: 186 WVFIVTGGIGIIWSLIWFKVYQPPRLTKSLSQAELEYIRDGGGLVDGDAPAKKEARQPLT 245 F + + L + + P ++EA PL Sbjct: 161 APFFAAAALNGLNFLTGCFLLPESHKGE-------------------RRPLRREALNPLA 201 Query: 246 KADWKLVFHRKLVGVYLGQFAVNSTLWFFLTWFPNYLTQEKGITALKAGFMTTV-PFLAA 304 W + + F + + + A G L + Sbjct: 202 SFRWARGM-TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHS 260 Query: 305 FFGVLLSGWLADKLVKKGFSLGVARKTPIICGLLISTC--IMGANYTNDPFWIMALMAIA 362 +++G +A +L + ++ G++ I+ A T ++ +A Sbjct: 261 LAQAMITGPVAARL---------GERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLA 311 Query: 363 FFGNGFASITWSLISSLAPMRLIGLTGGMFNFIGGLGGISVPLVIGYL-AQSYGFAPALV 421 G G ++ +++S G G + L I PL+ + A S Sbjct: 312 SGGIGMPALQ-AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWA 370 Query: 422 YISVVALLGALSYILLVGDVKRVG 445 +I+ AL L G G Sbjct: 371 WIAGAALYLLCLPALRRGLWSGAG 394
>SECA#SecA protein signature. Length = 901 Score = 28.3 bits (63), Expect = 0.015 Identities = 14/74 (18%), Positives = 27/74 (36%) Query: 11 KAFGKQRRKTREELNQEARDRKRLKKHRGHAPGSRAAGGNSASGGGNQNQQKDPRIGSKT 70 K + + EE+ + + R+ + +SA+ Q + ++G Sbjct: 824 STLSKVQVRMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRND 883 Query: 71 PVPLGVTEKVTQQH 84 P P G +K Q H Sbjct: 884 PCPCGSGKKYKQCH 897
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 597 bits (1540), Expect = 0.0 Identities = 204/478 (42%), Positives = 299/478 (62%), Gaps = 11/478 (2%) Query: 1 MQRGIVWVVDDDSSIRWVLERALAGAGLTCTTFENGNEVLAALASKTPDVLLSDIRMPGM 60 M + V DDD++IR VL +AL+ AG N + +A+ D++++D+ MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 DGLALLKQIKQRHPMLPVIIMTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVERAIS 120 + LL +IK+ P LPV++M+A + A+ A ++GA+DYLPKPFD+ E + ++ RA++ Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 121 HYQEQQQPRNIEVNGPTTDMIGEAPAMQDVFRIIGRLSRSSISVLINGESGTGKELVAHA 180 + + + ++G + AMQ+++R++ RL ++ ++++I GESGTGKELVA A Sbjct: 121 EPKRRPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179 Query: 181 LHRHSPRAKAPFIALNMAAIPKDLIESELFGHEKGAFTGANTIRQGRFEQADGGTLFLDE 240 LH + R PF+A+NMAAIP+DLIESELFGHEKGAFTGA T GRFEQA+GGTLFLDE Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239 Query: 241 IGDMPLDVQTRLLRVLADGQFYRVGGYAPVKVDVRIIAATHQNLERRVQEGKFREDLFHR 300 IGDMP+D QTRLLRVL G++ VGG P++ DVRI+AAT+++L++ + +G FREDL++R Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299 Query: 301 LNVIRIHLPPLRERREDIPRLARHFLQVAARELGVEAKLLHPETETALTRLAWPGNVRQL 360 LNV+ + LPPLR+R EDIP L RHF+Q A +E G++ K E + WPGNVR+L Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVREL 358 Query: 361 ENTCRWLTVMAAGQEVLTQDLPGELFEASTPDSPSHLPPDSWATLLAQWADRALRS---- 416 EN R LT + + + + EL S + ++Q + +R Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418 Query: 417 -----GHQNLLSEAQPELERTLLTTALRHTQGHKQEAARLLGWGRNTLTRKLKELGME 469 L E+E L+ AL T+G++ +AA LLG RNTL +K++ELG+ Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476
>PF06580#Sensor histidine kinase Length = 349 Score = 28.7 bits (64), Expect = 0.034 Identities = 33/189 (17%), Positives = 71/189 (37%), Gaps = 39/189 (20%) Query: 171 IIEQADRLRNLVDRL-------LGPQHPGMHIT--ESIHKVAERVVALVSMELPDNVRLI 221 I+E + R ++ L L ++ + + V + + L S++ D ++ Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYS-NARQVSLADELTVV-DSYLQLASIQFEDRLQFE 243 Query: 222 RDYDPSLPELPHDPEQIEQVLL-NIVRNALQALGPEGGEITLRTRTAFQLTLHGERYRLA 280 +P++ ++ P + Q L+ N +++ + L P+GG+I L+ Sbjct: 244 NQINPAIMDVQV-PPMLVQTLVENGIKHGIAQL-PQGGKILLKGT------KDNGTVT-- 293 Query: 281 ARIDVEDNGPGIPPHLQDTLFYPMVSGREGGTGLGLSIARNLIDQHAGK---IEFTSWPG 337 ++VE+ G + ++ TG GL R + G I+ + G Sbjct: 294 --LEVENTGSLALKNTKE------------STGTGLQNVRERLQMLYGTEAQIKLSEKQG 339 Query: 338 HTEFSVYLP 346 V +P Sbjct: 340 KVNAMVLIP 348
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 120 bits (303), Expect = 1e-39 Identities = 49/89 (55%), Positives = 66/89 (74%) Query: 2 NKTQLIDVIADKAELSKTQAKAALESTLAAITESLKEGDAVQLVGFGTFKVNHRAERTGR 61 NK LI +A+ EL+K + AA+++ +A++ L +G+ VQL+GFG F+V RA R GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 62 NPQTGKEIKIAAANVPAFVSGKALKDAVK 90 NPQTG+EIKI A+ VPAF +GKALKDAVK Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91
>PF06580#Sensor histidine kinase Length = 349 Score = 36.4 bits (84), Expect = 2e-04 Identities = 25/132 (18%), Positives = 49/132 (37%), Gaps = 29/132 (21%) Query: 340 QLRFTANETLK-RIQADPDRLTQVLLNLYL-----NAI-HAIGRQ---GTISVEAKESGT 389 ++F + L+ Q +P + + + + N I H I + G I ++ + Sbjct: 233 SIQF--EDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDN- 289 Query: 390 DRVIITVTDSGKGIAPDQLEAIFTPYFTTKADGTGLGLAVVQNIIEQHGG---AIKVKSI 446 V + V ++G + E TG GL V+ ++ G IK+ Sbjct: 290 GTVTLEVENTGSLALKNTKE------------STGTGLQNVRERLQMLYGTEAQIKLSEK 337 Query: 447 EGKGAVFTIWLP 458 +GK + +P Sbjct: 338 QGKVNA-MVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 518 bits (1336), Expect = 0.0 Identities = 180/475 (37%), Positives = 255/475 (53%), Gaps = 37/475 (7%) Query: 1 MIRGKIDILVVDDDVSHCTILQALLRGWGYNVALAYSGHDALAQVREKVFDLVLCDVRMA 60 M I LV DDD + T+L L GY+V + + + DLV+ DV M Sbjct: 1 MTGATI--LVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 EMDGIATLKEIKALNPAIPILIMTAFSSVETAVEALKAGALDYLIKPLDFDRLQETLEKA 120 + + L IK P +P+L+M+A ++ TA++A + GA DYL KP D L + +A Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118 Query: 121 LAHTRETGAELPSASAAQFGMIGSSPAMQHLLNEIAMVAPSDATVLIHGDSGTGKELVAR 180 LA + ++L S ++G S AMQ + +A + +D T++I G+SGTGKELVAR Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVAR 178 Query: 181 ALHACSARSDKPLVTLNGAALNESLLESELFGHEKGAFTGADKRREGRFVEADGGTLFLD 240 ALH R + P V +N AA+ L+ESELFGHEKGAFTGA R GRF +A+GGTLFLD Sbjct: 179 ALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLD 238 Query: 241 EISDISPLMQVRLLRAIQEREVQRVGSNQTISVDVRLIAATHRDLAEEVSAGRFRQDLYY 300 EI D+ Q RLLR +Q+ E VG I DVR++AAT++DL + ++ G FR+DLYY Sbjct: 239 EIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYY 298 Query: 301 RLNVVAIEMPSLRQRREDIPLLADHFLRRFAERNRKAVKGFTPQAMDLLIHYDWPGNIRE 360 RLNVV + +P LR R EDIP L HF+++ + VK F +A++L+ + WPGN+RE Sbjct: 299 RLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLD-VKRFDQEALELMKAHPWPGNVRE 357 Query: 361 LENAIERAVVLLTGEYISERELPLAIAATPIKTEYSGEIQP------------------- 401 LEN + R L + I+ + + + + Sbjct: 358 LENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFA 417 Query: 402 ---------------LVDVEKEVILAALEKTGGNKTEAARQLGITRKTLLAKLSR 441 L ++E +ILAAL T GN+ +AA LG+ R TL K+ Sbjct: 418 SFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 38.8 bits (90), Expect = 2e-06 Identities = 16/54 (29%), Positives = 22/54 (40%), Gaps = 5/54 (9%) Query: 78 VDPDVRGQGIGKRLVEHALTLAP-----GLTTNVNEQNTQAVGFYKKMGFKVTG 126 V D R +G+G L+ A+ A GL + N A FY K F + Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 31.6 bits (71), Expect = 0.008 Identities = 14/58 (24%), Positives = 23/58 (39%) Query: 289 ETSTPDLELARRFADAIHAKYPGKLLAYNCSPSFNWQKNLDDKTIASFQQQLSDMGYK 346 ET+ PD+ L A P L Y + N D +T + + QL+++ Sbjct: 544 ETTKPDMTLKEALKIAFGFNEPNGNLQYQGKDITEFDFNFDQQTSQNIKNQLAELNAT 601
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 33.4 bits (76), Expect = 2e-04 Identities = 20/86 (23%), Positives = 33/86 (38%), Gaps = 9/86 (10%) Query: 58 LALRNGEVVGMISLHMQFHLHHANWIG--EIQELVVLPPMRGQKIGSQLLAWAEEEARQA 115 L +G I + +NW G I+++ V R + +G+ LL A E A++ Sbjct: 69 LYYLENNCIGRIKIR-------SNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKEN 121 Query: 116 GAELTELSTNIKRRDAHRFYLREGYK 141 L T A FY + + Sbjct: 122 HFCGLMLETQDINISACHFYAKHHFI 147
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 44.0 bits (104), Expect = 9e-07 Identities = 53/284 (18%), Positives = 104/284 (36%), Gaps = 40/284 (14%) Query: 85 FFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYATIGIWAPILLLLCKMAQGFSIGGE 144 G L D++GR+ +L +++ ++ + P +W +L + ++ G + G Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPF-----LW---VLYIGRIVAGIT-GAT 112 Query: 145 YTGASIFVAEYSPDRKR----GFMGSWLDFGSIAGFVLGAGVVVLISTIVGEENFLEWGW 200 A ++A+ + +R GFM + FG +AG VLG G++ S Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLG-GLMGGFSP------------ 159 Query: 201 RIPFFIALLLGIIGLYLRHALEETPAFQQHVDKLEQGDREGLQDGPKVSFKEIATKHWRS 260 PFF A L + L K E+ P SF+ + Sbjct: 160 HAPFFAAAALNGLNFLTGCFLLPESH------KGERRPLRREALNPLASFRWARGMTVVA 213 Query: 261 LLSCIGLVIATNVTYYMLLTYMPSYLSHNLHYS-EDHGVLIIIAIMIGMLFVQPVMGLLS 319 L + ++ + + + H+ G+ + ++ L + G ++ Sbjct: 214 ALMAVFFIM--QLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVA 271 Query: 320 DRFGRRPFVIMGSIA-LFALAIPAFILINSNVIGLIFAGLLMLA 362 R G R +++G IA + AF + F +++LA Sbjct: 272 ARLGERRALMLGMIADGTGYILLAFA----TRGWMAFPIMVLLA 311 Score = 37.9 bits (88), Expect = 8e-05 Identities = 37/164 (22%), Positives = 73/164 (44%), Gaps = 16/164 (9%) Query: 286 LSHNLHYSEDHGVLI-IIAIMIGMLFVQPVMGLLSDRFGRRPFVIMGSIALFALAIPAFI 344 L H+ + +G+L+ + A+M PV+G LSDRFGRRP ++ ++L A+ I Sbjct: 35 LVHSNDVTAHYGILLALYALM--QFACAPVLGALSDRFGRRPVLL---VSLAGAAVDYAI 89 Query: 345 LINSNVIGLIFAGLLMLAVILNCFTGVMASTLPAMFPTHIR---YSALAAAFNISVLIAG 401 + + + +++ G ++A I V + + + R + ++A F ++AG Sbjct: 90 MATAPFLWVLYIG-RIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFG-MVAG 147 Query: 402 LTPTLAAWLVESSQDLMMPAYYLMVIAVIGLVTGI-SMKETANR 444 P L + S P + + + +TG + E+ Sbjct: 148 --PVLGGLMGGFS--PHAPFFAAAALNGLNFLTGCFLLPESHKG 187
>PF06580#Sensor histidine kinase Length = 349 Score = 37.5 bits (87), Expect = 5e-05 Identities = 39/182 (21%), Positives = 78/182 (42%), Gaps = 34/182 (18%) Query: 181 ARLDQMMDSVSQLLQLARVGQSFSSGNYQEVKLLEDV-ILPSYDELNTM-LETR-QQTLL 237 + +M+ S+S+L++ S N ++V L +++ ++ SY +L ++ E R Q Sbjct: 191 TKAREMLTSLSELMR-----YSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245 Query: 238 LPESAADVVVRGDATLLRMLLRNLVENAHRY----SPEGTHITIHISADPDAI-MAVEDE 292 + + DV V ML++ LVEN ++ P+G I + + D + + VE+ Sbjct: 246 INPAIMDVQV------PPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299 Query: 293 GPGIDESKCGKLSEAFVRMDSRYGGIGLGLSIV-SRITQLHQGQFFLQNRTERTGTRAWV 351 G + + G GL V R+ L+ + ++ ++ A V Sbjct: 300 GSLA--------------LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345 Query: 352 LL 353 L+ Sbjct: 346 LI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 92.2 bits (229), Expect = 8e-24 Identities = 46/144 (31%), Positives = 69/144 (47%), Gaps = 1/144 (0%) Query: 2 KILIVEDDTLLLQGLILAAQTEGYACDGVSTARAAEHSLESGHYSLMVLDLGLPDEDGLH 61 IL+ +DD + L A GY S A + +G L+V D+ +PDE+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 FLTRIRQKKYTLPVLILTARDTLNDRISGLDVGADDYLVKPFALEELHARI-RALLRRHN 120 L RI++ + LPVL+++A++T I + GA DYL KPF L EL I RAL Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 121 NQGESELTVGNLTLNIGRHQAWRD 144 + E + +GR A ++ Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQE 148
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 48.4 bits (115), Expect = 8e-10 Identities = 17/59 (28%), Positives = 29/59 (49%) Query: 62 DEATLFNIAVDPDFQRRGLGRMLLEHLIDELEKRGVVTLWLEVRASNAAAIALYESLGF 120 A + +IAV D++++G+G LL I+ ++ L LE + N +A Y F Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 213 bits (545), Expect = 6e-64 Identities = 109/452 (24%), Positives = 209/452 (46%), Gaps = 44/452 (9%) Query: 12 KRRTFAIISHPDAGKTTITEKVLLFGQAIQTAGTVKGRGSSQHAKSDWMEMEKQRGISIT 71 K +++H DAGKTT+TE +L AI G+V ++D +E+QRGI+I Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGT----TRTDNTLLERQRGITIQ 57 Query: 72 TSVMQFPYHDCLVNLLDTPGHEDFSEDTYRTLTAVDCCLMVIDAAKGVEDRTRKLMEVTR 131 T + F + + VN++DTPGH DF + YR+L+ +D +++I A GV+ +TR L R Sbjct: 58 TGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALR 117 Query: 132 LRDTPILTFMNKLDRDIRDPMELLDEVENELKIGCAPITWPIGCGKLFKGVYHLYKDETY 191 P + F+NK+D++ D + +++ +L K + Sbjct: 118 KMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVI------------------KQKVE 159 Query: 192 LYQTGKGHTIQEVRIVKGLNNPDLDAAVGEDLAQQLRDELELVQGASNEFDEELFLAGEI 251 LY E + + D + + ++ + + LEL Q S F + Sbjct: 160 LYPNMCVTNFTESEQWDTVIEGN-DDLLEKYMSGKSLEALELEQEES-----IRFHNCSL 213 Query: 252 TPVFFGTALGNFGVDHMLDGLVAWAPAPMPRQTDTRTVEASEEKFTGFVFKIQANMDPKH 311 PV+ G+A N G+D++++ + + + + G VFKI+ K Sbjct: 214 FPVYHGSAKNNIGIDNLIEVITNKFYSS---------THRGQSELCGKVFKIE--YSEK- 261 Query: 312 RDRVAFMRVVSGKYEKGMKLRQVRTGKDVVISDALTFMAGDRSHVEEAYPGDILGLHNHG 371 R R+A++R+ SG +R K + I++ T + G+ +++AY G+I+ L N Sbjct: 262 RQRLAYIRLYSGVLHLRDSVRISEKEK-IKITEMYTSINGELCKIDKAYSGEIVILQNEF 320 Query: 372 TIQIGDTFTQGEMMKFTGIPNFA-PELFRRIRLKDPLKQKQLLKGLVQLSEEG-AVQVFR 429 +++ +++ P L + P +++ LL L+++S+ ++ + Sbjct: 321 -LKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYV 379 Query: 430 PISNNDLIVGAVGVLQFDVVVARLKSEYNVEA 461 + +++I+ +G +Q +V A L+ +Y+VE Sbjct: 380 DSATHEIILSFLGKVQMEVTCALLQEKYHVEI 411
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 27.0 bits (59), Expect = 0.004 Identities = 16/49 (32%), Positives = 20/49 (40%), Gaps = 8/49 (16%) Query: 10 WGIIFLVIALIA--------AALGFGGLAGTAAGAAKIVFVVGIVLFLV 50 W +FL + A AL F LAGT G I V GI+ + Sbjct: 460 WKPLFLTLEKKAADAGVSYVVALLFSLLAGTTLGIWGIAIVTGILCSYI 508
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 28.9 bits (65), Expect = 0.022 Identities = 32/141 (22%), Positives = 51/141 (36%), Gaps = 37/141 (26%) Query: 6 IDTHCHFDFPPFTGDERASIQRACEAGVEKIIVPATEAA-------------HFPRVLAL 52 +D+H HF P I+ A +G+ ++ T A H R++ Sbjct: 133 MDSHIHFICP-------QQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMIEA 185 Query: 53 AARFPSLYAALGLHPIVIERHVDDDPDKLQQALAQQQNVVAVGEIGLDLYRDDPQFARQE 112 A FP A G + P AL + V G L L+ D + Sbjct: 186 ADAFPMNLAFAG-------KGNASLPG----ALVEM---VLGGATSLKLHED---WGTTP 228 Query: 113 RLLDAQLQLAKRYDLPVILHS 133 +D L +A YD+ V++H+ Sbjct: 229 AAIDCCLSVADEYDVQVMIHT 249