>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 25.8 bits (56), Expect = 0.033 Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 6/45 (13%) Query: 51 KERQIEIEKK-----VNVTCSKARSLQEKLSVKYKK-RQDLLDVI 89 K+ Q + V+ T S ++L EK KY K Q+L D Sbjct: 334 KKAQNNLLNSQIKDAVDATVSFYQTLTEKYGEKYSKMAQELADKS 378
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 61.0 bits (148), Expect = 9e-12 Identities = 50/175 (28%), Positives = 84/175 (48%), Gaps = 18/175 (10%) Query: 8 HVDHGKTTLLQAI---TGV------------NADRLPEEKQRGMTIDLGYAYWPLPDGRI 52 HVD GKTTL +++ +G D E+QRG+TI G + + ++ Sbjct: 11 HVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWENTKV 70 Query: 53 MGFIDVPGHEKFLANMLAGVGGIDHALLVVACDDGVMAQTREHLAILRLSGRPALTVALT 112 ID PGH FLA + + +D A+L+++ DGV AQTR LR G P + + Sbjct: 71 -NIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTI-FFIN 128 Query: 113 KADRVDDERIAQVHQQILQELVAQGWSAEQISLFVTAAVTERGIGELREHLAQCH 167 K D+ ++ V+Q I ++L A+ +++ L+ VT E + + + + Sbjct: 129 KIDQN-GIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGN 182
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 39.9 bits (93), Expect = 2e-05 Identities = 41/202 (20%), Positives = 80/202 (39%), Gaps = 5/202 (2%) Query: 1 MQTSFSPATRLGRRALLFPLCLVLFEFAAYIANDMIQPGMLAVVAEFNASVEWVPTSMTA 60 M TS+S + + L++ L F + ++ P + + AS WV T+ Sbjct: 1 MNTSYSQSNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFML 60 Query: 61 YLAGGMFLQWLLGPLSDRRGRRPVMLAGVAFFVVTCLAILLVNS-IEQFIAMRFLQGIGL 119 + G + G LSD+ G + ++L G+ + + +S I RF+QG G Sbjct: 61 TFSIGTAV---YGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGA 117 Query: 120 CFIGAVGYATIQESFEEAVCIKITALMANVALIAPLLGPLAGAALIHVAPWQTMFVLFAV 179 A+ + + K L+ ++ + +GP G + H W + L + Sbjct: 118 AAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL-LIPM 176 Query: 180 LGAISFAGLWRAMPETASLKGE 201 + I+ L + + + +KG Sbjct: 177 ITIITVPFLMKLLKKEVRIKGH 198
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 735 bits (1899), Expect = 0.0 Identities = 223/875 (25%), Positives = 375/875 (42%), Gaps = 62/875 (7%) Query: 5 SKRKKTIFLMVKVLTIILVWLFLPESTAVVKFNTNIIDAKDRSNIDLSRFEVDDYTPPGN 64 K + F + ++ P S+A + FN + ++ DLSRFE PPG Sbjct: 19 RKHRLAGFFV-RLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGT 77 Query: 65 YLLDILIDDRLLPERYLVTYLAVDEGKSTKLCLTPDLVNLFGLSTEVRESMTLWNNDKCV 124 Y +DI +++ + R VT+ D + CLT + GL+T M L +D CV Sbjct: 78 YRVDIYLNNGYMATRD-VTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACV 136 Query: 125 AIDEK-KEIKIQYDKEKQYLIISIPQAWLAYNDPNWVPPSQWGNGVAGTLLDYNLFGYHY 183 + + Q D +Q L ++IPQA+++ ++PP W G+ LL+YN G Sbjct: 137 PLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSV 196 Query: 184 SPNMGGSTTNFSSYGTTGANMGPWRIRADYQYINTETAGE--HYRNFDWSQVYAFRAIPS 241 +GG++ +G N+G WR+R + + + + + R I Sbjct: 197 QNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIP 256 Query: 242 IGAKFVGGQTYLNSSIFDSFRFLGTSLSSDERMLPPTLRGYAPQVMGIAHTNARVVLSQN 301 + ++ G Y IFD F G L+SD+ MLP + RG+AP + GIA A+V + QN Sbjct: 257 LRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQN 316 Query: 302 GRVLYQTNVAPGPFVIQDIS-EAVQGNIDVRVEEEDGRVTVFQVNAASVPFLTRKGAVRY 360 G +Y + V PGPF I DI G++ V ++E DG +F V +SVP L R+G RY Sbjct: 317 GYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRY 376 Query: 361 KAALGRPMLGGNNSASNPTFFSGEFSWGAFNHVSLYGGLMTTSQDYTSAALGIGQNLYDF 420 G GN P FF G ++YGG + Y + GIG+N+ Sbjct: 377 SITAGEYR-SGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQ-LADRYRAFNFGIGKNMGAL 434 Query: 421 GALSIDITHSRAQLPNEEQQNGESYRVNYSKRFEQTDSQISFAGYRFSKKNFMSMSQYLD 480 GALS+D+T + + LP++ Q +G+S R Y+K ++ + I GYR+S + + + Sbjct: 435 GALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTY 494 Query: 481 -WLNGNTALQYD-------------------KQAYTVAANQYLAWPDITMYLSVTRRTYW 520 +NG D + + Q L T+YLS + +TYW Sbjct: 495 SRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTS-TLYLSGSHQTYW 553 Query: 521 NA-ASSNNYSLSMSKIFDIGTFKGISATISANKVNNQYANENQMFFSLSVPIGIGQQASY 579 + ++ F+ I+ T+S + N + +L+V I Sbjct: 554 GTSNVDEQFQAGLNT-----AFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRS 608 Query: 580 DAQRG-RNTGYTQNISYFNNQNPKNI--------------WRISAGGGNPELQKGNGVFR 624 D++ R+ + ++S+ N N+ + + G Sbjct: 609 DSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGY 668 Query: 625 GGYQHSSPYGEFGLDGSHKNNEYNSINTNWYGSITATAYGVAAHQNKAGNEPRIMVDTGD 684 + YG + SH +++ + G + A A GV Q N+ ++V Sbjct: 669 ATLNYRGGYGNANIGYSH-SDDIKQLYYGVSGGVLAHANGVTLGQP--LNDTVVLVKAPG 725 Query: 685 VAGVSLNNNSAV-TNRFGVAVVSGATSYQQSDIRVDVQNLPDDIEVYNTVIQKTLTEGAI 743 + N + V T+ G AV+ AT Y+++ + +D L D++++ N V T GAI Sbjct: 726 AKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAI 785 Query: 744 GYREIRAVKGRQMMAIIRLKDGSSPPLGASVITDKTGAEVGIVGDDGLTYLAGLQDTERL 803 E +A G +++ + + P GA V T ++ GIV D+G YL+G+ ++ Sbjct: 786 VRAEFKARVGIKLLMTLT-HNNKPLPFGAMV-TSESSQSSGIVADNGQVYLSGMPLAGKV 843 Query: 804 TVQWGKK---QCTL--ILPKDKGM-NSGKVLLPCQ 832 V+WG++ C LP + ++ C+ Sbjct: 844 QVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 57.6 bits (139), Expect = 1e-11 Identities = 41/193 (21%), Positives = 83/193 (43%), Gaps = 7/193 (3%) Query: 47 LNPQKVVILNPSVLDNADALHIKVAGVPQTSTHLPAFLSKYSGPE-YMNTGTLFEPDYEA 105 ++P ++V L ++ AL I GV T + ++S+ P+ ++ G EP+ E Sbjct: 33 IDPNRIVALEWLPVELLLALGIVPYGVADTINY-RLWVSEPPLPDSVIDVGLRTEPNLEL 91 Query: 106 LSQAKPDLIIAGGRAQDAYNKLSAIAPTIALDVDTQHFTQSLTQRT-EQLASIFGKEEEA 164 L++ KP ++ + L+ IAP + ++ +++ ++A + + A Sbjct: 92 LTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNLQSAA 151 Query: 165 KTLLGNFSSQVNAIKQKSANAGS---AMVLMISGGKMSAYTPGSRFGFIFDELGFTPAAT 221 +T L + + ++K + G+ + +I M + P S F I DE G A Sbjct: 152 ETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGIPNAWQ 211 Query: 222 FAESGRHGNVVTS 234 E+ G+ S Sbjct: 212 -GETNFWGSTAVS 223
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 103 bits (258), Expect = 1e-24 Identities = 184/882 (20%), Positives = 312/882 (35%), Gaps = 117/882 (13%) Query: 238 GVNVGEGSSITMDGLIATG----NITNLFKVNGNASVSNANIELAAGGLLMAQGHSASNQ 293 GV G++I + G A G N + + S+ + + + + Sbjct: 60 GVRTASGTTIKVSGRQAQGILLENPAAELQFRNGSVTSSGQLSDDGIRRFLGTVTVKAGK 119 Query: 294 AVI---ILNNVDAISNGGGTTLVDVNKDADVTINGGAYHSKGNNAKGIWVRDNNSSLNVD 350 V L NV + G L + A +I G I +++ V Sbjct: 120 LVADHATLANVGDTWDDDGIALYVAGEQAQASIADSTLQGAG--GVQIE---RGANVTVQ 174 Query: 351 NVVIITEGVNATAIENRGTAIVKNTTVITQGNNSHGL---------YSEQSLDATNMAIS 401 I+ G++ A+++ + + V+ + N + + + T Sbjct: 175 RSAIVDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASELTLDGGH 234 Query: 402 TAGIGSIGAAAAKGGNLNLNDALIETTGNS-------GMVLGTFADSSISAKNITGLSTG 454 G + G AA +G ++L A I G V G Sbjct: 235 ITGGRAAGVAAMQGAVVHLQRATIRRGDAPAGGAVPGGAVPGGAVPGGFGPGGFG--PVL 292 Query: 455 AGAYALWVDDGSSILLEESQITTQGQGAGGIYASN---TGTGSHTAYTQVTLNNSQIHSE 511 G Y + V GSS+ L +S + GA T +G + + + Sbjct: 293 DGWYGVDVS-GSSVELAQSIVEAPELGAAIRVGRGARVTVSGGSLSAPHGNVIETGGARR 351 Query: 512 QGPGIWANGADINVDVKNGSQLTGGNGLLIYASSNAGAA----SNVNVNGDNHAVLLGDI 567 P A +++ ++ G+ G L+Y + GD A L I Sbjct: 352 FAPQ----AAPLSITLQAGAHAQGK--ALLYRVLPEPVKLTLTGGADAQGDIVATELPSI 405 Query: 568 HAAENSNINLALNNNSVWTGAATNAKQVDIDSSSIWNLTGDADVESMHVLGQMNFISNSS 627 +++AL + + WTGA + ID+++ W +T +++V ++ + S Sbjct: 406 PGTSIGPLDVALASQARWTGATRAVDSLSIDNAT-WVMTDNSNVGALRLAS-----DGSV 459 Query: 628 DTNSRAPYDNFSTLTINSNVTGSGSFTFNVQLGDNDSPVDRLYVIGNASGDHGVQVINQG 687 D A F LT+N+ + GSG F NV S D+L V+ +ASG H + V N G Sbjct: 460 DFQQPAEAGRFKVLTVNT-LAGSGLFRMNVFADLGLS--DKLVVMQDASGQHRLWVRNSG 516 Query: 688 GLGALTTGDGINLITVDGETHSGSFTMSN---SVSAGAYEYFLYKIDDYRWNLQSNLINP 744 + + + L+ + + +FT++N V G Y Y L + +W+L P Sbjct: 517 S--EPASANTLLLVQTPLGSAA-TFTLANKDGKVDIGTYRYRLAANGNGQWSLVGAKAPP 573 Query: 745 GPGPEPEIEPE---------EIAYRPEVPGYIAAPWLNAFYGFTTLG-----------SL 784 P P P+ P+ E G + NA +G +L Sbjct: 574 APKPAPQPGPQPPQPPQPQPEAPAPQPPAGRELSAAANAAVNTGGVGLASTLWYAESNAL 633 Query: 785 HERRGS--AEGAAEGFNQDSWGRIRGQHNNFE--AGRFSYDSNIWFMQLGHDVYQAKNAA 840 +R G A G WGR Q + AGR +D + +LG D A A Sbjct: 634 SKRLGELRLNPDAGGA----WGRGFAQRQQLDNRAGR-RFDQKVAGFELGAD--HAVAVA 686 Query: 841 GTQVTGGMMITLGKQNSDTRDRARAINPDLSIDTGKIKTEAYGFGGYYTLMTEEGGYLDI 900 G + G + G D G T++ GGY T + + G YLD Sbjct: 687 GGRWHLGGL--AGYTRGDRG----------FTGDGGGHTDSVHVGGYATYIADSGFYLDA 734 Query: 901 VSQATLYRNNYE------SQHNTKHNGYGVVMSAEVGQPYPLAAGWVVEPQGQLKYQYLH 954 +A+ N+++ K+ +GV S E G+ + A GW +EPQ +L Sbjct: 735 TLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGRRFTHADGWFLEPQAEL--AVFR 792 Query: 955 LSPKNF---NDAISEIGGTDYSVGQ--VRAGLRLFSDASEKRDIKPYLTTDVLHQLGRNP 1009 + N G +G+ + G R+ + + R ++PY+ VL + Sbjct: 793 AGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRI--ELAGGRQVQPYIKASVLQEFDGAG 850 Query: 1010 QVTVATVDIRPDFTKTFWQGGAGVTAKVNSQVDLYADAKYQK 1051 V + R + T + G G+ A + LYA +Y K Sbjct: 851 TVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLYASYEYSK 892 Score = 33.1 bits (75), Expect = 0.007 Identities = 29/149 (19%), Positives = 49/149 (32%), Gaps = 23/149 (15%) Query: 92 GGTLGLTGSTIKTENSVAFGVL--NDKGTVNLQGGTITTKGQTAYGVYSSGLGSNTDIHS 149 GG +G+TIK A G+L N + + G++T+ GQ + Sbjct: 59 GGVRTASGTTIKVSGRQAQGILLENPAAELQFRNGSVTSSGQ---------------LSD 103 Query: 150 SEITTSYSLTHAIYGAGGTGLTLNNTTLNTSGSGSYGIYLNGPGGSLTGADNTINSTHAT 209 I G +T +Y+ G + AD+T+ Sbjct: 104 DGIRRFLGTVTVKAGKLVADHATLANVGDTWDDDGIALYVAGEQAQASIADSTL------ 157 Query: 210 NGAGIYISSGGSNATLDNTTLNITKGAVG 238 GAG G+N T+ + + +G Sbjct: 158 QGAGGVQIERGANVTVQRSAIVDGGLHIG 186
>CABNDNGRPT#NodO calcium binding signature. Length = 479 Score = 254 bits (649), Expect = 1e-81 Identities = 188/433 (43%), Positives = 245/433 (56%), Gaps = 43/433 (9%) Query: 38 ISSHQSWKENTIHNKNTNLTYSF-SRAYTLWDYDRTFQQNAYVSLFNPAQIHQAKIAMQS 96 + SW + K+ NLT+ F ++ D F + FN QI QAK+++QS Sbjct: 58 TRENVSWNGTNVFGKSANLTFKFLQSVSSIPSGDTGFVK------FNAEQIEQAKLSLQS 111 Query: 97 WADVANISFTEASADSSANILFLNFQR-PGN-----VAGYAYHPNPGSFS-PIWINYSFS 149 W+DVAN++FTE + + SANI F N+ R YAY+P + W NY+ S Sbjct: 112 WSDVANLTFTEVTGNKSANITFGNYTRDASGNLDYGTQAYAYYPGNYQGAGSSWYNYNQS 171 Query: 150 DNQHPSRLNYGGGVLTHEIGHALGLGHS---HAPHGY-----------TQQMSVMSYLSE 195 + ++P YG THEIGHALGL H +A G + Q S+MSY E Sbjct: 172 NIRNPGSEEYGRQTFTHEIGHALGLAHPGEYNAGEGDPSYNDAVYAEDSYQFSIMSYWGE 231 Query: 196 QDSGANYGQHYLSTPQMYDIAAIQYLYGANLHTRTGDTVYGFNSTSYRDHFTATHASDAL 255 ++GA+Y HY P + DIAAIQ LYGAN+ TRTGD+VYGFNS + RD +TAT +S AL Sbjct: 232 NETGADYNGHYGGAPMIDDIAAIQRLYGANMTTRTGDSVYGFNSNTDRDFYTATDSSKAL 291 Query: 256 IFCVWDAGGNDTFDFSGYKQNQMINLNELCFSDVGGLKGNVSIAADVTIENAIGGSGHDD 315 IF VWDAGG DTFDFSGY NQ INLNE FSDVGGLKGNVSIA VTIENAIGGSG+D Sbjct: 292 IFSVWDAGGTDTFDFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTIENAIGGSGNDI 351 Query: 316 IIGNHTNNILTGN---------GGSDQLWGNGGNNTFRYASARDSMTTSPDTIHDFKSGR 366 ++GN +NIL G G+D L+G G +TF Y S +DS + D I DF+ G Sbjct: 352 LVGNSADNILQGGAGNDVLYGGAGADTLYGGAGRDTFVYGSGQDSTVAAYDWIADFQKGI 411 Query: 367 DKIDLSQLMPSTDRVIFVDRLSFNGQ-TEMGQQYNEVADITYLMIDFDAQVSECDMMIKF 425 DKIDLS + + F G+ E+ Q++ IT L + S D +++ Sbjct: 412 DKIDLSAFRNEGQ--LSFVQDQFTGKGQEVMLQWDAANSITNLWLHEAGH-SSVDFLVRI 468 Query: 426 TGRHHFTANDFIL 438 G+ +D I+ Sbjct: 469 VGQ--AAQSDIIV 479
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 64.1 bits (156), Expect = 1e-12 Identities = 26/125 (20%), Positives = 58/125 (46%), Gaps = 17/125 (13%) Query: 735 MADQLVLVLEDEPDVRQTLCEQLHQLGYLTLETGDSRQALALMADVPDISIVISDLMLPG 794 M +LV +D+ +R L + L + GY T ++ +A +V++D+++P Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPD 59 Query: 795 DMTGAEVLQQARSVYPHLKLLLISGQD---------LRRSKNFMPEVELLRKPFNQQQLV 845 ++L + + P L +L++S Q+ + + +++P KPF+ +L+ Sbjct: 60 -ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLP------KPFDLTELI 112 Query: 846 QALQR 850 + R Sbjct: 113 GIIGR 117
>INTIMIN#Intimin signature. Length = 939 Score = 452 bits (1164), Expect = e-138 Identities = 266/882 (30%), Positives = 402/882 (45%), Gaps = 77/882 (8%) Query: 91 YTLGPGDSIQSIAKKYNITVDELKKLNAYRTFSKP-FASLTTGDEIEVPRKESSF----- 144 YTL G+++ ++K +I + + LN + S+ G +I +P K+ F Sbjct: 65 YTLKTGETVADLSKSQDINLSTIWSLNKHLYSSESEMMKAEPGQQIILPLKKLPFEYSAL 124 Query: 145 ---------------------FSNNPNENNKKDVDDLLARNAMGAG-----KLLSNDNTS 178 +P+ DD A +L S Sbjct: 125 PLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDDKALNYAAQQAASLGSQLQSRSLNG 184 Query: 179 DAASNMARSAVTNEINASSQQWLNQFGTARVQLNVDSDFKLDNSALDLLVPLKDSESSLL 238 D A + A N+ ++ Q WL +GTA V L ++F D S+LD L+P DSE L Sbjct: 185 DYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNF--DGSSLDFLLPFYDSEKMLA 242 Query: 239 FTQLGVRNKDSRNTVNIGAGIRQYQGDWMYGANTFFDNDLTGKNRRVGVGAEVATDYLKF 298 F Q+G R DSR T N+GAG R + + M G N F D D +G N R+G+G E DY K Sbjct: 243 FGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKS 302 Query: 299 SANTYFGLTGWHQSRDFSSYDERPADGFDIRTEAYLPAYPQLGGKLMYEKYRGDEVALFG 358 S N YF ++GWH+S + YDERPA+GFDIR YLP+YP LG KLMYE+Y GD VALF Sbjct: 303 SVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFN 362 Query: 359 KDDRQKDPHAVTLGVNYTPVPLVTIGAEHREGKGNNNNTSVNVQLNYRMGQPWNDQIDQS 418 D Q +P A T+GVNYTP+PLVT+G ++R G GN N+ ++Q Y+ +PW+ QI+ Sbjct: 363 SDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQQIEPQ 422 Query: 419 AVAANRTLAGSRYDLVERNNNIVLDYKKQELIHLVLPDRISGSGGGAITLTAQVRAKYGF 478 V RTL+GSRYDLV+RNNNI+L+YKKQ+++ L +P I+G+ + V++KYG Sbjct: 423 YVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTERSTQKIQLIVKSKYGL 482 Query: 479 SRIEWDATPLENAGG---STSPLTQSSLSVTLPFYQHILRTSNTHTISAVAYDAQGNASN 535 RI WD + L + GG + + LP Y SN + ++A AYD GN+SN Sbjct: 483 DRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQ--GGSNVYKVTARAYDRNGNSSN 540 Query: 536 RAVTSIEVTRPETMV----ISHLATTVDNATANGIAANTVQATVTDGDGQPIIGQIINFA 591 + +I V +V ++ +A A+G A T ATV + Sbjct: 541 NVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNI 600 Query: 592 VNTQATLSTTEARTGANGIASTTLTHTVAGVSAVSATLGSSSRSVNTTFVADESTAEITA 651 V+ A LS A T +G A+ TL G VSA + ++N V + + Sbjct: 601 VSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASI 660 Query: 652 ANLTVTTNDSVANGSDTNAVRAKVTDAYTNAVANQSVIFSASNGATVIDQTVITNAEGIA 711 + +VANG D KV V+NQ V F+ + + + T T+ G A Sbjct: 661 TEIKADKTTAVANGQDAITYTVKVMKG-DKPVSNQEVTFT-TTLGKLSNSTEKTDTNGYA 718 Query: 712 DSTLTNTTAGVSAVTATLGSQS---QQVDTTFKPGSTAAISLVKLADRAVADGIDQNEIQ 768 TLT+TT G S V+A + + + + F T +++ V + +Q Sbjct: 719 KVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQ 778 Query: 769 -----VVLRDGTGN----AVPNVPMSIQADNGAIVVASTPNTGVDGTIN----ATFTNLR 815 + G G + S+ A +G + + T + + AT+T Sbjct: 779 YGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVISSDNQTATYTIAT 838 Query: 816 AGESVVS------VTSPALVGMTMTMTFSADQRTAVVSTLAAIDNNAKADG-TDTNVVRA 868 +V + A+ + + + A K + + + + Sbjct: 839 PNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAANKYEYYKSSQTIIS 898 Query: 869 WVVDANGNSVPGVSVTFDAGNGAVLAQNPV----VTDRNGYA 906 WV ++ GV+ T+D ++ QNP+ ++ N YA Sbjct: 899 WVQQTAQDAKSGVASTYD-----LVKQNPLNNIKASESNAYA 935 Score = 79.0 bits (194), Expect = 4e-16 Identities = 73/392 (18%), Positives = 117/392 (29%), Gaps = 21/392 (5%) Query: 1904 VAGAVATITLTTPVNGAVADGANSNSVQAVVSDSEGNAVAGAAVVFSSANATAQITTVIG 1963 V V T A ADG + + A V G A A V F+ + TA ++ Sbjct: 554 VVDQVGVTDFTADKTSAKADGTEAITYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSA 612 Query: 1964 TTGADGIATATLTNTVAGTSNVVATIGSITDNIDT---VFVAGAVATITLTTPVNGAVAD 2020 T G AT TL + G V A +T ++ +FV A+IT Sbjct: 613 NTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVA 672 Query: 2021 GANSNSVQAVVSDSEGNPVTGATVVFSSSNATAQITTVIGTTGADGIATATLTNTVAGTS 2080 V PV+ V F+++ +T T +G A TLT+T G S Sbjct: 673 NGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTE--KTDTNGYAKVTLTSTTPGKS 730 Query: 2081 NVVATIDTVNANI---DTTFVPGAVATITLTTPVDGAVADGANSNSVQAVVTDSGGNPVT 2137 V A + V ++ + F V V + +Q + + Sbjct: 731 LVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGN 790 Query: 2138 GAAVVFSSANATAQITTVIGTTGADGIATATLTNTVAGTSNVVATVDTVNANIDTTFVAG 2197 G S+ A A + G T T++ + T+ T N Sbjct: 791 GKYTWRSANPAIASVDASSGQVTLKEKGTTTISVISSDNQTATYTIATPN---------- 840 Query: 2198 AVATITLTTPVNGAVANGADSNSVQAVVSDSEGNAVAGAAVVFSSANATAQITTVIGTTG 2257 + I + ++ S N + + +AN + Sbjct: 841 --SLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAANKYEYYKSSQTIIS 898 Query: 2258 ADGIATATLINTVAGTSNVVATIDTVNANIDT 2289 + VA T ++V N Sbjct: 899 WVQQTAQDAKSGVASTYDLVKQNPLNNIKASE 930 Score = 76.6 bits (188), Expect = 2e-15 Identities = 74/382 (19%), Positives = 116/382 (30%), Gaps = 21/382 (5%) Query: 2292 VAGAVATITLTTPVDGAVANGADSNSVQAVVSDSEGNAVAGAAVVFSSANATAQITTVIG 2351 V V T A A+G ++ + A V G A A V F+ + TA ++ Sbjct: 554 VVDQVGVTDFTADKTSAKADGTEAITYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSA 612 Query: 2352 TTGADGIATATLTNTVAGTSNVVATIGSITNNIDTA---FVAGAVATITLTTPVNGAVAD 2408 T G AT TL + G V A +T+ ++ FV A+IT Sbjct: 613 NTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVA 672 Query: 2409 GANSNSVQAVVTDSGGNPVNGAAVVFSSANATAQITTVIGTTGADGIATATLTNTVAGTS 2468 V G PV+ V F++ +T T +G A TLT+T G S Sbjct: 673 NGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTE--KTDTNGYAKVTLTSTTPGKS 730 Query: 2469 NVVATVDTVNANI---DTTFVAGAVATITLTTPVNGAVADGADSNSVQAVVSDSGGNPVA 2525 V A V V ++ + F V V + +Q + + Sbjct: 731 LVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGN 790 Query: 2526 GAAVVFSSANATAQVTTVIGTTGADGIATATLTNTVAGTSNVVATIGSITNNIDTAFVAG 2585 G S+ A A V G T T++ + TI + Sbjct: 791 GKYTWRSANPAIASVDASSGQVTLKEKGTTTISVISSDNQTATYTIAT------------ 838 Query: 2586 AVATITLTTPVNGAVADGADSNSVQAVVSDSEGNAVTGAAVVFSSANATAQITTVIGTTG 2645 + I D ++ S N + + +AN + Sbjct: 839 PNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAANKYEYYKSSQTIIS 898 Query: 2646 ADGIATATLTNTVAGTSNVVAT 2667 + VA T ++V Sbjct: 899 WVQQTAQDAKSGVASTYDLVKQ 920 Score = 75.1 bits (184), Expect = 7e-15 Identities = 76/393 (19%), Positives = 121/393 (30%), Gaps = 23/393 (5%) Query: 3356 VAGAVATITLTTPVNGAVADGANSNSVQAVVTDSGGNPVNGAAVVFSSANATAQITTVIG 3415 V V T A ADG + + A V +G N V F+ + TA ++ Sbjct: 554 VVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQAN-VPVSFNIVSGTAVLSANSA 612 Query: 3416 TTGADGIATATLTNTVAGTSNVAATI----DTVNANIDTTFVAGAVATITLTTPVNGAVA 3471 T G AT TL + G V+A +NAN FV A+IT Sbjct: 613 NTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANA-VIFVDQTKASITEIKADKTTAV 671 Query: 3472 DGANSNSVQAVVSDSEGNPVNGATVVFSSINATAQITTVIGTTGVDGIATATLTNTVAGT 3531 V PV+ V F++ +T T +G A TLT+T G Sbjct: 672 ANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTE--KTDTNGYAKVTLTSTTPGK 729 Query: 3532 SNVVATIDTVNANI---DTTFVAGAVATITLTTLVNGAVADGANSNSVQAVVSDSGGNPV 3588 S V A + V ++ + F +V V + +Q Sbjct: 730 SLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQ---YGQVNLKA 786 Query: 3589 TGAAVVFSSANATAQITTVIGTTGVDGIATATLTNTVAGTSNVVATIGSITNNIDTAFVA 3648 +G ++ +A I +V ++G +T GT+ + N T +A Sbjct: 787 SGGNGKYTWRSANPAIASVDASSGQ-------VTLKEKGTTTISVISSD--NQTATYTIA 837 Query: 3649 GAVATITLTTPVNGAVADGANSNSVQAVVTDSGGNPVNGAAVVFSSANATAQITTVIGTT 3708 + I D N+ S N + + +AN + Sbjct: 838 TPNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAANKYEYYKSSQTII 897 Query: 3709 GADGIATATLTNTVAGTSNVIATIDTVNANIDT 3741 + VA T +++ N Sbjct: 898 SWVQQTAQDAKSGVASTYDLVKQNPLNNIKASE 930 Score = 74.7 bits (183), Expect = 1e-14 Identities = 84/395 (21%), Positives = 132/395 (33%), Gaps = 34/395 (8%) Query: 4907 VLLSVTSTQAGVHPITGTLVSN--NYTDTFGATFIANKNTAQLSTLMVVD-----NNALA 4959 +L + + V+ +T N ++ T N + + V D +A A Sbjct: 513 ILPAYVQGGSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKA 572 Query: 4960 DGVTRNQVRAHVVDSTGNSVADIAVTFTANHGAQLSHVTVLTDDNGDAVNTLTNSLVGVT 5019 DG A V + + A LS + T+ +G A TL + G Sbjct: 573 DGTEAITYTATVKKNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQV 632 Query: 5020 VVTAKLGTAGTPLTVDTV-FTAGPLATLTLVTMVDNAFADNSATNTVQATLK-DATGNPI 5077 VV+AK + L + V F A++T + D A + + + T+K P+ Sbjct: 633 VVSAKTAEMTSALNANAVIFVDQTKASITEIK-ADKTTAVANGQDAITYTVKVMKGDKPV 691 Query: 5078 VGEVVAFAASNGATITATDGGVSNANGIVLATLTNGAAGVSTVTATIE---TLTATTETT 5134 + V F + G +T+ ++ NG TLT+ G S V+A + E Sbjct: 692 SNQEVTFTTTLGKLSNSTE--KTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVE 749 Query: 5135 FIAMKNLD-VTVGDTTFDGDAGFPTTGFVGAAFKVNSGGDNSLYDWSSSAPALVSV-SGE 5192 F +D + PT + + G N Y W S+ PA+ SV + Sbjct: 750 FFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASS 809 Query: 5193 GVVTFNAVFPTGTPAITISATPKGGGSPLSYSFRVNQWFINNNGVALNRADAATYCANAG 5252 G VT TIS + N + N + DA C N G Sbjct: 810 GQVTLK-----EKGTTTISVISSDNQTATYTIATPNSLIVPNMSKRVTYNDAVNTCKNFG 864 Query: 5253 YTTVSSSQVTNAIVWGMGTRAMGNLWSEWGDFNNY 5287 SS N++ WG N Y Sbjct: 865 GKLPSSQNELE------------NVFKAWGAANKY 887 Score = 74.3 bits (182), Expect = 1e-14 Identities = 77/393 (19%), Positives = 127/393 (32%), Gaps = 24/393 (6%) Query: 2680 VAGAVATITLTTPVNGAVADGTDSNSVQAVVSDSEGNAVAGAAVVFSSANATAQITTVIG 2739 V V T A ADGT++ + A V G A A V F+ + TA ++ Sbjct: 554 VVDQVGVTDFTADKTSAKADGTEAITYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSA 612 Query: 2740 TTGADGIATATLTNTVAGTSNVVATIGSITNNIDTAFVAGAVATITLTTLVNG----AVA 2795 T G AT TL + G V A +T+ ++ V T T + AVA Sbjct: 613 NTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVA 672 Query: 2796 NGADSNSVQAVVSDSGGNVVAGATVVFSSTNATAQVTTVIGTTGADGIATATLTNTVAGT 2855 NG D+ + V G V+ V F++T +T T +G A TLT+T G Sbjct: 673 NGQDAITYT-VKVMKGDKPVSNQEVTFTTTLGKLSNSTE--KTDTNGYAKVTLTSTTPGK 729 Query: 2856 SNVVATIDTVNANI---DTTFVAGAVATITLSVLVNDATADGADTNQVDALVQDANGNAI 2912 S V A + V ++ + F +V T + + + Sbjct: 730 SLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGG 789 Query: 2913 TGAAVVFSSANG-ATILSSTMNTGVNGVASTLLTHTVAGTSNVVATIDTVNANIDTAFVA 2971 G S+ A++ +S+ + +T ++ + TI T N Sbjct: 790 NGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVISSDNQTATYTIATPN--------- 840 Query: 2972 GAVATITLTTPVNGAVANGADSNSVQAVVSDSEGNAVAGAAVVFSSANATAQITTVIGTT 3031 + I + ++ S N + + +AN + Sbjct: 841 ---SLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAANKYEYYKSSQTII 897 Query: 3032 GVDGIATATLTNTVAGTSNVVATVDTVNANIDT 3064 + VA T ++V N Sbjct: 898 SWVQQTAQDAKSGVASTYDLVKQNPLNNIKASE 930 Score = 74.3 bits (182), Expect = 1e-14 Identities = 76/388 (19%), Positives = 121/388 (31%), Gaps = 26/388 (6%) Query: 1035 VAGAVATITLTTLVNGAVADGANSNSVQAVVSDSGGNPVTGAAVVFSSANATAQITTVIG 1094 V V T A ADG + + A V +G V F+ + TA ++ Sbjct: 554 VVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQA-NVPVSFNIVSGTAVLSANSA 612 Query: 1095 TTGVDGIATATLTNTVAGTSNVVATIGSITNNIDTA---FVAGAVATITLTTPVNGAVAD 1151 T G AT TL + G V A +T+ ++ FV A+IT Sbjct: 613 NTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVA 672 Query: 1152 GANSNSVQAVVTDSGGNPVNGAAVVFSSANATAQITTVIGTTGADGIATATLTNTVAGTS 1211 V G PV+ V F++ +T T +G A TLT+T G S Sbjct: 673 NGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTE--KTDTNGYAKVTLTSTTPGKS 730 Query: 1212 NVVATVDTVNANI---DTTFVAGAVATITLTTPVNGAVADGADSNSVQAVVSDSGGNPVA 1268 V A V V ++ + F V V + +Q + + Sbjct: 731 LVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGN 790 Query: 1269 GAAVVFSSANATAQVTTVIGTTGADGIATATLTNTVAGTSNVVATIGSITNNIDTAFVAG 1328 G S+ A A V G T T++ + TI + Sbjct: 791 GKYTWRSANPAIASVDASSGQVTLKEKGTTTISVISSDNQTATYTIAT------------ 838 Query: 1329 AVATITLSVPVNDATADGVDTNQVDALVQDANGNAITGAAVVFSSTNGADIIVPTMNTGV 1388 + I ++ D V+T + ++ N + + + N + Sbjct: 839 PNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAANKYEYY-----KSS 893 Query: 1389 NGVASTLLTHTVAGTSNVVATVDTVNAN 1416 + S + S V +T D V N Sbjct: 894 QTIISWVQQTAQDAKSGVASTYDLVKQN 921 Score = 72.8 bits (178), Expect = 3e-14 Identities = 85/390 (21%), Positives = 133/390 (34%), Gaps = 31/390 (7%) Query: 2970 VAGAVATITLTTPVNGAVANGADSNSVQAVVSDSEGNAVAGAAVVFSSANATAQITTVIG 3029 V V T A A+G ++ + A V G A A V F+ + TA ++ Sbjct: 554 VVDQVGVTDFTADKTSAKADGTEAITYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSA 612 Query: 3030 TTGVDGIATATLTNTVAGTSNVVATV----DTVNANIDTAFVAGAVATITLTTPV-NGAV 3084 T G AT TL + G V A +NAN FV A+IT AV Sbjct: 613 NTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANA-VIFVDQTKASITEIKADKTTAV 671 Query: 3085 ANGADSNSVQAVVSDSGGNVVAGATVVFSSTNTTAQVTTVIGTTGADGIATATLTNTVAG 3144 ANG D+ + V G V+ V F++T +T T +G A TLT+T G Sbjct: 672 ANGQDAITYT-VKVMKGDKPVSNQEVTFTTTLGKLSNSTE--KTDTNGYAKVTLTSTTPG 728 Query: 3145 TSNVVATVDTVNANI---DTTFVAGAVATITLSVLVNDATADGADTNQVDALVQDANGNA 3201 S V A V V ++ + F +V T + + + Sbjct: 729 KSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASG 788 Query: 3202 ITGAAVVFSSANG-ADIIAPTMNTGVNGVASTLLTHTMAGTSNVIATIDTVNANIDTTFV 3260 G S+ A + A + + +T ++ + TI T N Sbjct: 789 GNGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVISSDNQTATYTIATPN-------- 840 Query: 3261 AGAVATITLSVPVNDATADGADTNQVDALVQDANGNAITGAAVVFSSANGATILSSTMNT 3320 + I ++ D +T + ++ N + + +AN S+ Sbjct: 841 ----SLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAANKYEYYKSS--- 893 Query: 3321 GVNGVASTLLTHTQSGVSNVVATIDTVNAN 3350 + S + Q S V +T D V N Sbjct: 894 --QTIISWVQQTAQDAKSGVASTYDLVKQN 921 Score = 72.4 bits (177), Expect = 4e-14 Identities = 79/387 (20%), Positives = 128/387 (33%), Gaps = 25/387 (6%) Query: 1229 VAGAVATITLTTPVNGAVADGADSNSVQAVVSDSGGNPVAGAAVVFSSANATAQVTTVIG 1288 V V T A ADG ++ + A V +G A V F+ + TA ++ Sbjct: 554 VVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQ-ANVPVSFNIVSGTAVLSANSA 612 Query: 1289 TTGADGIATATLTNTVAGTSNVVATIGSITNNIDTA---FVAGAVATITLSVPVNDATAD 1345 T G AT TL + G V A +T+ ++ FV A+IT + + TA Sbjct: 613 NTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASIT-EIKADKTTAV 671 Query: 1346 GVDTNQVDALVQDANGNAITGAAVVFSSTNGADIIVPTMNTGVNGVASTLLTHTVAGTSN 1405 + + V+ G+ V +T + T T NG A LT T G S Sbjct: 672 ANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKSL 731 Query: 1406 VVATVDTVNANI---DTAFVPGAVATITLTTPVNGAVADGANSNSVQAVVSDSEGNAVAG 1462 V A V V ++ + F V V + +Q + + + G Sbjct: 732 VSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNG 791 Query: 1463 AAVVFSSANATAQITTVIGTTGADGIATATLTNTVAGTSNVVATIDTVNANIDTAFVPGA 1522 S+ A A + G T T++ + TI T N Sbjct: 792 KYTWRSANPAIASVDASSGQVTLKEKGTTTISVISSDNQTATYTIATPN----------- 840 Query: 1523 VATITLSVLVNDATADGADTNQVDALVQDANGNAITGAAVVFSSANGADIIAPTMNTGVN 1582 + I ++ D +T + ++ N + + +AN + Sbjct: 841 -SLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAANKYEYY-----KSSQ 894 Query: 1583 GVASTLLTHTQSGVSNVVATIDTVNAN 1609 + S + Q S V +T D V N Sbjct: 895 TIISWVQQTAQDAKSGVASTYDLVKQN 921 Score = 72.0 bits (176), Expect = 5e-14 Identities = 74/380 (19%), Positives = 123/380 (32%), Gaps = 19/380 (5%) Query: 1615 VAGAVAAITLTTPVDGAVADGTDSNSVQAVVSDSEGNAVAGAAVVFSSANATAQITTVIG 1674 V V T A ADGT++ + A V G A A V F+ + TA ++ Sbjct: 554 VVDQVGVTDFTADKTSAKADGTEAITYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSA 612 Query: 1675 TTGADGIATATLTNTVAGTSNVAATIGSITDNIDT---VFVAGAVATITLSVPVNDATAD 1731 T G AT TL + G V+A +T ++ +FV A+IT + + TA Sbjct: 613 NTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASIT-EIKADKTTAV 671 Query: 1732 GADTNQVDALVQDVNGNAITGAAVVFSSANGATILSSTVNTGADGIASTTLTHTQSGVSN 1791 + + V+ + G+ V + + +ST T +G A TLT T G S Sbjct: 672 ANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKSL 731 Query: 1792 VVATIDTVNANI---DTTFVAGAVATITLSVLVNDATADGADTNQVDALVQDANGNAITG 1848 V A + V ++ + F +V T + + + G Sbjct: 732 VSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNG 791 Query: 1849 AAVVFSSANGATIIVPTMNTGANGVASTLLTHTVAGTSNVVATIGSITNNIDTAFVAGAV 1908 S+ + +S +T GT+ + N T +A Sbjct: 792 KYTWRSANPAIASV---------DASSGQVTLKEKGTTTISVISSD--NQTATYTIATPN 840 Query: 1909 ATITLTTPVNGAVADGANSNSVQAVVSDSEGNAVAGAAVVFSSANATAQITTVIGTTGAD 1968 + I D N+ S N + + +AN + Sbjct: 841 SLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAANKYEYYKSSQTIISWV 900 Query: 1969 GIATATLTNTVAGTSNVVAT 1988 + VA T ++V Sbjct: 901 QQTAQDAKSGVASTYDLVKQ 920 Score = 59.7 bits (144), Expect = 3e-10 Identities = 82/360 (22%), Positives = 134/360 (37%), Gaps = 22/360 (6%) Query: 776 GNAVPNVPMSIQADNG-AIVVASTPNTGVDGTINATFTNLRAGESVVSVTSPALVGMTMT 834 G A NVP+S +G A++ A++ NT G T + + G+ VVS + MT Sbjct: 588 GVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKT---AEMTSA 644 Query: 835 MTFSA----DQRTAVVSTLAAIDNNAKADGTDTNVVRAWVVDANGNSVPGVSVTFDAGNG 890 + +A DQ A ++ + A A A+G D + V V VTF Sbjct: 645 LNANAVIFVDQTKASITEIKADKTTAVANGQDA-ITYTVKVMKGDKPVSNQEVTFTT-TL 702 Query: 891 AVLAQNPVVTDRNGYAENTLTNLAIG--TTTVKATTVTDPVGQTVNTHFVAGAVDTITLT 948 L+ + TD NGYA+ TLT+ G + + + V V F +D + Sbjct: 703 GKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIE 762 Query: 949 VLVNGAVANGVNTNSVQAVVSDSGGNPVNGAAVVFSSANATAQITTVIGTTGVDGIATAT 1008 ++ G V + T +Q + + NG S+ A A + G + T T Sbjct: 763 IVGTG-VKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTT 821 Query: 1009 LTNTVAGTSNVVATIDTVNANIDTTFVAGAVATITLTTLVNGAVADGANSNSVQAVVSDS 1068 ++ + TI T N+ I V +T VN G S Q + + Sbjct: 822 ISVISSDNQTATYTIATPNSLI----VPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENV 877 Query: 1069 GGNPVT-GAAVVFSSANATAQITTVIGTTGVDGIATATLTNTVAGTSNVVATIGSITNNI 1127 GAA + ++ I + + T D + T + N + I + +N Sbjct: 878 ---FKAWGAANKYEYYKSSQTIISWVQQTAQDAKSGVASTYDLV-KQNPLNNIKASESNA 933 Score = 58.9 bits (142), Expect = 6e-10 Identities = 61/380 (16%), Positives = 125/380 (32%), Gaps = 25/380 (6%) Query: 4037 VAGKAASIELTMTKDNAVANNIDTNEVQVLVTDADGNAINGAVVNLTSNSGMNITPNSVT 4096 V + + T K +A A+ + V N V + ++ NS Sbjct: 554 VVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTAVLSANSAN 613 Query: 4097 TGSDGTATATLTHTLAGSLPINARIDQVSKTINATFIADVSTAQIIASDMFIIVNDQVAN 4156 T G AT TL G + ++A+ +++ +NA + V + +++ VAN Sbjct: 614 TNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVAN 673 Query: 4157 GQAVNAVQARVTDSYGNPIQGQLVEFVLSNTGTIQYKLEETSVEGGVMVTFTNTLAGITN 4216 GQ +V P+ Q V F + G + E+T G VT T+T G + Sbjct: 674 GQDAITYTVKVMKG-DKPVSNQEVTFT-TTLGKLSNSTEKTDTNGYAKVTLTSTTPGKSL 731 Query: 4217 VTATVV-SSRSSQNVDTTFIADVTTAHIAESDLMVIVDNAVANNSEKNEVHARVTDAKGN 4275 V+A V + + + F + + + IV V + + K + Sbjct: 732 VSARVSDVAVDVKAPEVEFFTTL----TIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKAS 787 Query: 4276 VLSGQTVIFTSGNGAAITTVNGISDGDGLTKATLTHTLAGTSVVTARVGNQVQSKDTTFI 4335 +G+ ++ A + +T GT+ ++ + ++ T+ Sbjct: 788 GGNGKYTWRSANPAIASVDAS---------SGQVTLKEKGTTTISVISSD---NQTATYT 835 Query: 4336 ADRTTATIRASDLTITRSNALADGVATNAARVIVTDAYGNPVPSMLVS------YTSENG 4389 + I + N + ++ + V + Y S Sbjct: 836 IATPNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAANKYEYYKSSQT 895 Query: 4390 ATLTPTLGSTDSSGMLSTTF 4409 + D+ +++T+ Sbjct: 896 IISWVQQTAQDAKSGVASTY 915 Score = 58.2 bits (140), Expect = 8e-10 Identities = 65/375 (17%), Positives = 122/375 (32%), Gaps = 32/375 (8%) Query: 4216 NVTATVVSSRSSQNVDTTFIADVTTAHIAESDLMVIVDNAVANNSEKNEVHARVTDAKGN 4275 NV T+ + Q VD + D T +A A+ +E A V Sbjct: 541 NVLLTITVLSNGQVVDQVGVTDFTADK----------TSAKADGTEAITYTATVKKNGVA 590 Query: 4276 VLSGQTVIFTSGNGAAITTVNGISDGDGLTKATLTHTLAGTSVVTARVGNQVQSKDTTF- 4334 + A ++ + ++G G TL G VV+A+ + + Sbjct: 591 QANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAV 650 Query: 4335 -IADRTTATIRASDLTITRSNALADGVATNAARVIVTDAYGNPVPSMLVSYTSENGATLT 4393 D+T A+I +++ ++ A+A+G V V PV + V++T+ L+ Sbjct: 651 IFVDQTKASI--TEIKADKTTAVANGQDAITYTVKVMKG-DKPVSNQEVTFTT-TLGKLS 706 Query: 4394 PTLGSTDSSGMLSTTFTHTIAGISKVTATIVTMGISQAKDAVFIADRTTAHVSALTVEKN 4453 + TD++G T T T G S V+A + + + V + Sbjct: 707 NSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEV-----EFFTTLTIDDGNI 761 Query: 4454 DSLANNSDRNIVQAHIQDAHGN-VITGMNVNFSATENVTLAANMVTTNAQGYAENTLRHN 4512 + + + +Q N +G N ++ A++ ++ Q + Sbjct: 762 EIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTT 821 Query: 4513 APVTSAVTATVATDLVGLTEDVRFVAGAGARIELFRLNDGAVADGIQTNRVEARVYDVSD 4572 V S+ T +A + I D + T + S Sbjct: 822 ISVISSDNQTATYT----------IATPNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQ 871 Query: 4573 NLVPNSNVVFSADNG 4587 N + N + A N Sbjct: 872 NELENVFKAWGAANK 886 Score = 58.2 bits (140), Expect = 1e-09 Identities = 69/345 (20%), Positives = 120/345 (34%), Gaps = 26/345 (7%) Query: 4646 FLITHDNAVANGVTENRVLLQLLDANDNKVSGVEVNFTATNG-ASINA-SAITDTNGLAI 4703 F +A A+G + N + V V+F +G A ++A SA T+ +G A Sbjct: 563 FTADKTSAKADGTEAITYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSANTNGSGKAT 621 Query: 4704 GVLTNTLSGPSDVTVTLVTPGGTESLTVTPQFIADINTARIANGDFVIIDDGAVANSVDA 4763 L + P V V+ T T +L D A I AVAN DA Sbjct: 622 VTLKSD--KPGQVVVSAKTAEMTSALNANAVIFVDQTKASITE--IKADKTTAVANGQDA 677 Query: 4764 NEVRARVTDNQGNAIAGYSVTFASQNGATITTSGITGVDGWASAKLTHTKAGESGILARI 4823 +V ++ VTF + G ++ T +G+A LT T G+S + AR+ Sbjct: 678 ITYTVKVMKG-DKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARV 736 Query: 4824 SRPGSMVQVLTPYFIADVSTATLQLFNFNPIPIIADGVMQFFVLGRVFDANQNPVGGQQV 4883 S V+ F ++ + ++ V G++ Sbjct: 737 SDVAVDVKAPEVEFFTTLTIDDGNI-----------EIVGTGVKGKLPTVWLQYGQVNLK 785 Query: 4884 AFSATNEVTLTESNGSISTPEGSVLLSVTSTQAGVHPITGTLVSNNYTDTFGATFIANKN 4943 A + T +N +I++ + S VT + G I+ N AT+ Sbjct: 786 ASGGNGKYTWRSANPAIASVDASS-GQVTLKEKGTTTISVISSDNQT-----ATYTIATP 839 Query: 4944 TAQLSTLMVVDNNALADGVTRNQVRAHVVDSTGNSVADIAVTFTA 4988 + + + D V + + S+ N + ++ + A Sbjct: 840 NSLI-VPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGA 883 Score = 57.8 bits (139), Expect = 1e-09 Identities = 48/189 (25%), Positives = 69/189 (36%), Gaps = 7/189 (3%) Query: 3744 VAGAVATITLTTPVNGAVADGADSNSVQAVVSDSEGNAVTGAAVVFSSANATAQITTVIG 3803 V V T A ADG ++ + A V G A V F+ + TA ++ Sbjct: 554 VVDQVGVTDFTADKTSAKADGTEAITYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSA 612 Query: 3804 TTGADGIATATLTNTVAGTSNVVATI----DTVNANIDTAFVAGELENIVVSIINNNALA 3859 T G AT TL + G V A +NAN + + A+A Sbjct: 613 NTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVA 672 Query: 3860 NGADTNIVEAFVTDRFGNGVANQSLMFGTNGASIVGSSTVTTNIDGRVRVSATHTVAGSS 3919 NG D V V+NQ + F T + +ST T+ +G +V+ T T G S Sbjct: 673 NGQDAITYTVKVMKG-DKPVSNQEVTFTTTL-GKLSNSTEKTDTNGYAKVTLTSTTPGKS 730 Query: 3920 NTVFAISGA 3928 +S Sbjct: 731 LVSARVSDV 739 Score = 57.8 bits (139), Expect = 1e-09 Identities = 68/382 (17%), Positives = 125/382 (32%), Gaps = 32/382 (8%) Query: 4325 NQVQSKDTTFIADRTTATIRASDLTITRSNALADGVATNAARVIVTDAYGNPVPSMLVSY 4384 N V T + + +D T +++A ADG V + Sbjct: 540 NNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFN 599 Query: 4385 TSENGATLTPTLGSTDSSGMLSTTFTHTIAGISKVTATIVTMGISQAKDAVFIADRTTAH 4444 A L+ +T+ SG + T G V+A M + +AV D+T A Sbjct: 600 IVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKAS 659 Query: 4445 VSALTVEKNDSLANNSDRNIVQAHIQDAHGNVITGMNVNFSATENVTLAANMVTTNAQGY 4504 ++ + +K ++AN D + ++ V F+ T L+ + T+ GY Sbjct: 660 ITEIKADKTTAVANGQDAITYTVKV-MKGDKPVSNQEVTFT-TTLGKLSNSTEKTDTNGY 717 Query: 4505 AENTLRHNAPVTSAVTATVATDLVGLTEDVRFVAGAGARIELFR--LNDGAVADGIQTNR 4562 A+ TL P S V+A V+ V + +E F D + + T Sbjct: 718 AKVTLTSTTPGKSLVSARVSDVAVDV---------KAPEVEFFTTLTIDDGNIEIVGTG- 767 Query: 4563 VEARVYDVSDNLVPNSNVVFSADNGGQLVQNDVQTDALGSAYVTVSNINTGVTKVSVTAD 4622 +P + + N N T + + + ++G VT Sbjct: 768 --------VKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQ----VTLK 815 Query: 4623 GVSASTTTTFIADKDTVTLRAD------LFLITHDNAVANGVTENRVLLQLLDANDNKVS 4676 +T + +D T T + ++ + V + L ++ N++ Sbjct: 816 EKGTTTISVISSDNQTATYTIATPNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELE 875 Query: 4677 GVEVNFTATNGASINASAITDT 4698 V + A N S+ T Sbjct: 876 NVFKAWGAANKYEYYKSSQTII 897 Score = 54.3 bits (130), Expect = 1e-08 Identities = 95/518 (18%), Positives = 171/518 (33%), Gaps = 62/518 (11%) Query: 4304 LTKATLTHTLAGTSVVTARVGNQVQSKDTTFIADRTTATIRASDLTITRSNALADGVATN 4363 + + H + GT T ++ V+SK + +R+ I S + + Sbjct: 453 ILSLNIPHDINGTERSTQKIQLIVKSKYGLDRIVWDDSALRSQGGQIQHSGSQS------ 506 Query: 4364 AARVIVTDAYGNPVPSMLVSYTSENGATLTPTLGSTDSSGMLSTTFTHTIAGISKVTATI 4423 ++L +Y T + D +G S TI T+ Sbjct: 507 ----------AQDYQAILPAYVQGGSNVYKVTARAYDRNGNSSNNVLLTI--------TV 548 Query: 4424 VTMGISQAKDAVFIADRTTAHVSALTVEKNDSLANNSDRNIVQAHIQDAHGNVITGMNVN 4483 ++ G Q D V + D T SA A+ ++ A ++ +G + V+ Sbjct: 549 LSNG--QVVDQVGVTDFTADKTSA--------KADGTEAITYTATVKK-NGVAQANVPVS 597 Query: 4484 FSATENV-TLAANMVTTNAQGYAENTLRHNAPVTSAVTATVA--TDLVGLTEDVRFVAGA 4540 F+ L+AN TN G A TL+ + P V+A A T + + Sbjct: 598 FNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTK 657 Query: 4541 GARIELFRLNDGAVADGIQTNRVEARVYDVSDNLVPNSNVVFSADNGGQLVQNDVQTDAL 4600 + E+ AVA+G +V D V N V F+ G+L + +TD Sbjct: 658 ASITEIKADKTTAVANGQDAITYTVKV-MKGDKPVSNQEVTFTTT-LGKLSNSTEKTDTN 715 Query: 4601 GSAYVTVSNINTGVTKVSVTADGVSASTTTTFIADKDTVTLRADLFLITHDNAVANGVTE 4660 G A VT+++ G + VS V+ + T+T+ + V GV Sbjct: 716 GYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDG-----NIEIVGTGVKG 770 Query: 4661 NRVLLQLLDANDN-KVSGVEVNFTATNGASINASAITDTNGLAIGVLTNTLSGPSDVTVT 4719 + L N K SG +T ++ A A D + + TL T++ Sbjct: 771 KLPTVWLQYGQVNLKASGGNGKYTWR--SANPAIASVDASSGQV-----TLKEKGTTTIS 823 Query: 4720 LVTPGGTESLTVTPQFIADINTARIANGDFVIIDDGAVANSVDANEVRARVTDNQGNAIA 4779 V ++ T T + ++ ++V+ + + N + Sbjct: 824 -VISSDNQTATYTIATPNSLIVPNMSK-------RVTYNDAVNTCKNFGGKLPSSQNELE 875 Query: 4780 GYSVTFASQNGATITTSGITGVDGWASAKLTHTKAGES 4817 + + N + W K+G + Sbjct: 876 NVFKAWGAANKYE-YYKSSQTIISWVQQTAQDAKSGVA 912
>PF06580#Sensor histidine kinase Length = 349 Score = 225 bits (574), Expect = 1e-70 Identities = 64/213 (30%), Positives = 115/213 (53%), Gaps = 2/213 (0%) Query: 345 LGEGIAHLLSAQILAGEFEQQKQLLAQSEIKLLHAQVNPHFLFNALNTLSVVIRRNPDHA 404 L G + + + + + ++++ L AQ+NPHF+FNALN + +I +P A Sbjct: 134 LYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKA 193 Query: 405 RNLVLSLSTFFRKNLKRS-HDVVTLSDEIEHVNAYLEIEKARFADRLTVTVSLPNELMEA 463 R ++ SLS R +L+ S V+L+DE+ V++YL++ +F DRL + +M+ Sbjct: 194 REMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDV 253 Query: 464 RLPAFSLQPVVENAIKHGISQMFSNGRVTLRGKLDDNTLVLEVEDNAGL-YQPQPDGDGL 522 ++P +Q +VEN IKHGI+Q+ G++ L+G D+ T+ LEVE+ L + + G Sbjct: 254 QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGT 313 Query: 523 GMSLVDRRIKARSGNEYGITVVSEAEVFTRIII 555 G+ V R++ G E I + + +++ Sbjct: 314 GLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVL 346
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 34.8 bits (80), Expect = 3e-04 Identities = 7/47 (14%), Positives = 16/47 (34%) Query: 215 DSLTTAVETFECAVLTQRQRLYGNDKSRIAASLGLSLRALTYKLAKY 261 + E ++ ++ + A LGL+ L K+ + Sbjct: 427 GLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL 473
>adhesinmafb#Neisseria meningitidis: adhesin MafB signature. Length = 467 Score = 36.6 bits (84), Expect = 2e-04 Identities = 14/32 (43%), Positives = 18/32 (56%) Query: 112 NPLHKRRFAQQILKRFDSASSSFSQRADEAQR 143 NP R Q+I + + S+FS RADEA R Sbjct: 178 NPTDTRSIRQRISDNYSNLGSNFSDRADEANR 209
>cloacin#Cloacin signature. Length = 551 Score = 32.0 bits (72), Expect = 0.021 Identities = 23/85 (27%), Positives = 37/85 (43%), Gaps = 3/85 (3%) Query: 945 RDYDAMGRRLWQSAGSDAPTVAADLLPRQG--DIWRKFSFDTAGELSMATDFIRGEQQYR 1002 D A G R+WQ AG A D+ +Q D K D LS A + + ++ + Sbjct: 380 HDPMAGGHRMWQMAGLKAQRAQTDVNNKQAAFDAAAKEKSDADAALSSAMESRKKKEDKK 439 Query: 1003 YDAEGRLTDSRERHQLSVAEDFAYD 1027 AE L D + + + +D+ +D Sbjct: 440 RSAENNLNDEKNKPRKGF-KDYGHD 463
>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature. Length = 591 Score = 347 bits (890), Expect = e-107 Identities = 162/367 (44%), Positives = 215/367 (58%), Gaps = 23/367 (6%) Query: 10 VAPLSLPKGGGAITGMGDSLGPIGPSGMATLTLPLPISAGRGYAPSLTLSYSSGSGNGPF 69 + P LPKGG A L GP G+A++TLPLPISA RG+AP+L L YSSG GNGPF Sbjct: 15 ITPPFLPKGGKA-------LSQSGPDGLASITLPLPISAERGFAPALALHYSSGGGNGPF 67 Query: 70 GLGWQLGTMAIRRRTNAQVPRYDEYDEFLAPNGEVMVVAADPQGNIERTEQSLNG----- 124 G+GW TM+I R T+ VP+Y++ DEFL P+GEV+V G Sbjct: 68 GVGWSCATMSIARSTSHGVPQYNDSDEFLGPDGEVLVQTLSTGDAPNPVTCFAYGDVSFP 127 Query: 125 EQFSVIRYLPRIEGNFHRIEYWRPRTNNSQAPFWLVHSSDGQKHGLGYSASARIADPLHP 184 + ++V RY PR E +F+R+EYW +N FWL+H S+G H LG +A+AR++DP Sbjct: 128 QSYTVTRYQPRTESSFYRLEYWVGNSNGDD--FWLLHDSNGILHLLGKTAAARLSDPQAA 185 Query: 185 EHIAEWLLEESVSLSGEHICYQYQAEDEQDIDESEKQNHPAASAQRYLSTVVYGNREVAH 244 H A+WL+EESV+ +GEHI Y Y AE+ ++D + + SA RYLS V YGN A Sbjct: 186 SHTAQWLVEESVTPAGEHIYYSYLAENGDNVDLNGNEAGRDRSAMRYLSKVQYGNATPAA 245 Query: 245 ELYCLTQRPAPTSWLFSLIFDHGEYSNIAEQVPVIIKGKSWNFRQDAFSRFSCGFEVRTR 304 +LY T WLF+L+FD+GE + P SW RQD FS ++ GFE+R Sbjct: 246 DLYLWTSATPAVQWLFTLVFDYGERGVDPQVPPAFTAQNSWLARQDPFSLYNYGFEIRLH 305 Query: 305 RLCQQVLMYHNLSALKGDEPDAQATLVSRLRLHYQHDAYATQLVGCQQLAHEPDGTKRS- 363 RLC+QVLM+H+ DE TLVSRL L Y + TQL + LA+E DG +R+ Sbjct: 306 RLCRQVLMFHHFP----DELGEADTLVSRLLLEYDENPILTQLCAARTLAYEGDGYRRAP 361 Query: 364 ----LPP 366 +PP Sbjct: 362 VNNMMPP 368
>SALSPVAPROT#Salmonella virulence plasmid 28.1kDa A protein signature. Length = 255 Score = 32.1 bits (72), Expect = 0.007 Identities = 35/144 (24%), Positives = 69/144 (47%), Gaps = 36/144 (25%) Query: 118 QRRPDLQDLVLNNSNMNQEVSSL--------EILLNVLQTKAPLDELTKDTEAHVNDVSF 169 +RRPDL L++ N +NQ++ +L ++ L++L T L+++ +++ S Sbjct: 110 ERRPDLATLMVVNDAINQQIPTLLPYHFPHDQVELSLLNTDVSLEDI-------ISESSI 162 Query: 170 TLPYDDNLTVINAVLQDKSTSLREIAALLA--------ENNDPWANPITPALVQEQLGLN 221 P+ + N++ D S E+A+ L+ E ++ A +T + Q LGL Sbjct: 163 DWPW----FLSNSLTGDNSNYAMELASRLSPEQQTLPTEPDNSTATDLT-SFYQTNLGLK 217 Query: 222 PASYELIDIKSPLD--ESYAKRLA 243 A Y +P + ++A++LA Sbjct: 218 TADY------TPFEALNTFARQLA 235
>PF05860#haemagglutination activity domain. Length = 117 Score = 87.2 bits (216), Expect = 1e-22 Identities = 23/141 (16%), Positives = 46/141 (32%), Gaps = 24/141 (17%) Query: 68 AAIVADGSAPGNQQPTIISSANGTPQVNIQTPSSGGVSRNAYRQFDVDNRGVILNNGRGV 127 A I D + P N + I++ T + T + + + +++F V G N Sbjct: 1 AQITPDTTLPIN---SNITTEGNTRIIERGTQAGSNLFHS-FQEFSVPTSGTAFFN---- 52 Query: 128 NQTQIAGLVDGNPWLARGEASVILNEVNSRDPSQLNGYIEVAGRKAQVVIANPAGITCEG 187 I++ V S ++G I A + + NP GI Sbjct: 53 ---------------NPTNIQNIISRVTGGSVSNIDGLIRANAT-ANLFLINPNGIIFGQ 96 Query: 188 CGFINANRATLTTGQAQLNNG 208 ++ + + + +L Sbjct: 97 NARLDIGGSFVGSTANRLKFA 117
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 35.9 bits (82), Expect = 0.001 Identities = 40/191 (20%), Positives = 65/191 (34%), Gaps = 1/191 (0%) Query: 544 ANGSIGPIFDKEKEQNRLKEVQLIGEIGGQALDIASTQGKIIATHAANDKMKAVKPEDIV 603 A+ ++GP + + + ++G Q K I + A + + E Sbjct: 100 ADAALGPAKNLAPLDVINRSLTIVGNALQQKNQKLLLNQKKITSLGAKNFLTRTAEEIGE 159 Query: 604 AAEKQWEKAHPGKAATAEDINQQIYQTAYNQAFNESGFGTGGPVQRGMQAAIAAVQGLAG 663 A ++ P D + AYN + + AA A+++ A Sbjct: 160 QAVREGNINGPEAYMRFLDREMEGLTAAYNVKLFTEAISSLQIRMNTLTAAKASIEAAAA 219 Query: 664 GNMGAALTGASAPYLAGVIKQSTGDNPAANTMAHAVLGAVTAYASGNHALAGAAGAATAE 723 N A A A + AANT A G+V A A+G + A GAA+ Sbjct: 220 -NKAREQAAAEAKRKAEEQARQQAAIRAANTYAMPANGSVVATAAGRGLIQVAQGAASLA 278 Query: 724 LMAPTIISALG 734 I+ LG Sbjct: 279 QAISDAIAVLG 289
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 36.7 bits (84), Expect = 3e-04 Identities = 40/191 (20%), Positives = 64/191 (33%), Gaps = 1/191 (0%) Query: 140 ANGSIGPIFDKEKEQNRLKEVQLIGEIGGQALDIASTQGKIIATHAANDKMKAVKPEDIA 199 A+ ++GP + + + ++G Q K I + A + + E Sbjct: 100 ADAALGPAKNLAPLDVINRSLTIVGNALQQKNQKLLLNQKKITSLGAKNFLTRTAEEIGE 159 Query: 200 AAEKQWEKAHPGKAATAEDINQQIYQTAYNQAFNESGFGTGGPVQRGMQAATAAVQGLAG 259 A ++ P D + AYN + + AA A+++ A Sbjct: 160 QAVREGNINGPEAYMRFLDREMEGLTAAYNVKLFTEAISSLQIRMNTLTAAKASIEAAAA 219 Query: 260 GNLGAALTGASAPYLAGVIKQSTGDNPAANTMAHAVLGAVTAYARGNNALAGAAGAATAE 319 N A A A + AANT A G+V A A G + A GAA+ Sbjct: 220 -NKAREQAAAEAKRKAEEQARQQAAIRAANTYAMPANGSVVATAAGRGLIQVAQGAASLA 278 Query: 320 LMAPTIISALG 330 I+ LG Sbjct: 279 QAISDAIAVLG 289
>SOPEPROTEIN#Salmonella type III secretion SopE effector protein signature. Length = 239 Score = 27.8 bits (61), Expect = 0.013 Identities = 18/65 (27%), Positives = 32/65 (49%), Gaps = 5/65 (7%) Query: 3 LSIAEIQKKVDEMALRAGLPRHSVNLCTEPIGEG-----TPYITFENNMYNYIYSERGYE 57 ++IA +++ E A AGLP + N P G G TP I+ N+ Y ++ + + Sbjct: 134 INIAPFLQEIGEAAKNAGLPGTTKNDVFTPSGAGANPFITPLISSANSKYPRMFINQHQQ 193 Query: 58 FSRRV 62 S ++ Sbjct: 194 ASFKI 198
>PF05616#Neisseria meningitidis TspB protein Length = 501 Score = 28.9 bits (64), Expect = 0.023 Identities = 28/101 (27%), Positives = 41/101 (40%), Gaps = 6/101 (5%) Query: 31 TAKTLTGSGTVIN-NTVINNGTAPGAIVAPRDRDSTGKNIAVEFNGISLTLPRSGLYQLK 89 + K GT +N V + P +VA RDS G N V+ +PR L Sbjct: 265 SEKVEVAPGTKVNMGPVTDRNGNPVQVVATFGRDSQG-NTTVDVQ----VIPRPDLTPGS 319 Query: 90 TDKGDYAPGPEAALSLANISPPSSLDATGQRGVPPPSDDLN 130 + + P PE + + + P+ + G R P P DLN Sbjct: 320 AEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLN 360
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.2 bits (70), Expect = 0.007 Identities = 11/35 (31%), Positives = 17/35 (48%) Query: 33 MVIVGPSGCAKSTMLRMIAGLEEISSGELTIADRK 67 +V+ G G KST++ + GL+ S I K Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGK 633
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 51.8 bits (124), Expect = 2e-09 Identities = 63/417 (15%), Positives = 120/417 (28%), Gaps = 96/417 (23%) Query: 7 SGRKRQLALIVAGVIIIAAAISGWLSVRQTTLNPLSEDAELGASVVH------IASSVPG 60 S R R +A + G ++IA +S + A + H I Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVL--------GQVEIVATANGKLTHSGRSKEIKPIENS 105 Query: 61 RIISINVEENSKVRRGDLLFSIEP-----DLYRLQ--VEQAQAELKMAEAAHDTQQR--- 110 + I V+E VR+GD+L + D + Q + QA+ E + + + Sbjct: 106 IVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKL 165 Query: 111 ---TVVAERSNAAITNEQIVR----------------AQANLKLATQT------------ 139 + E ++ E+++R Q L L + Sbjct: 166 PELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINR 225 Query: 140 -----------LARLQPLRPKGYVTAQQVDDAATAKHDAEVSLKQALKQSVAAEALVSST 188 L L K + V + +A L+ Q E+ + S Sbjct: 226 YENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSA 285 Query: 189 -------------------ASSEALVVARRAALAIAERELANTQIHAPNDGRVVGLTV-S 228 + + LA E + I AP +V L V + Sbjct: 286 KEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHT 345 Query: 229 AGEFVAPDQAIFTLINTEH-WHASAFFRETELKHIKVGDCATVYVMADRQRAIQGRVEGI 287 G V + + ++ + +A + ++ I VG A + V A G + G Sbjct: 346 EGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRY-GYLVGK 404 Query: 288 GWGVSSEDMLNIPRGLPYVPKSLNWVRVVQRFPVRISLEKPPEDLMRIGATAVVIVR 344 ++ + + + GL + V+ + G ++ Sbjct: 405 VKNINLDAIEDQRLGLVF--------NVIISIEENCLSTGNKNIPLSSGMAVTAEIK 453
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 33.6 bits (76), Expect = 0.001 Identities = 18/73 (24%), Positives = 35/73 (47%), Gaps = 12/73 (16%) Query: 293 FRSVRTKFVKSIANNPDVAKRFTLEQIDGLSNGITP-----------SGWVVHHKLPL-D 340 +R R +F ++AN+P+++K+F + + +G P +HHK+ + D Sbjct: 532 WRDFREQFWIAVANDPELSKQFNPGSLAVMRDGGAPYVRESEQAGGRIKIEIHHKVRVAD 591 Query: 341 DSGTNALDNLVLI 353 G + NLV + Sbjct: 592 GGGVYNMGNLVAV 604
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 125 bits (316), Expect = 3e-33 Identities = 62/265 (23%), Positives = 120/265 (45%), Gaps = 43/265 (16%) Query: 154 EYRGVINKIKLPQANQVNVKLTIVEITKDFTENIGLDW---------------NSIKSAA 198 + VI ++ + + QV V+ I E+ N+G+ W + A Sbjct: 332 DLERVIAQLDIRRP-QVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIA 390 Query: 199 GAFQF---------------------LNFNAQSISTLVHAINDEAIAKVLAEPNLSVLSG 237 GA Q+ F + + L+ A++ +LA P++ L Sbjct: 391 GANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDN 450 Query: 238 EYASFLVGGEIPIVSTNQNG------ISVEYKEFGIKLNIGAKVNEKKRIRVMLGEEVSS 291 A+F VG E+P+++ +Q +VE K GIKL + ++NE + + + +EVSS Sbjct: 451 MEATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSS 510 Query: 292 IDKVFNLRGGDSYPSLRIRKANTTVELGDGESFILGGLISSTERESLKKIPFIGDVPLLG 351 + + D + R N V +G GE+ ++GGL+ + ++ K+P +GD+P++G Sbjct: 511 VADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIG 570 Query: 352 ALFRNAQTQRNQSELVVVATVNLVK 376 ALFR+ + ++ L++ +++ Sbjct: 571 ALFRSTSKKVSKRNLMLFIRPTVIR 595
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 41.7 bits (98), Expect = 2e-07 Identities = 19/141 (13%), Positives = 56/141 (39%), Gaps = 11/141 (7%) Query: 9 MVLIVSQLLFVCYSDIRHRIISNKFVISIACNAIILSL----------VTHHTVSIIIPI 58 +L+ L+ + + D+ ++ ++ + + ++ +L V ++ Sbjct: 137 ALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGYLVLW 196 Query: 59 VALFIGYIIFHFNVMGGGDVKLITALLLALTAEQSLNFIIYTAVMGGVVMVVGLLINRVD 118 + ++ MG GD KL+ AL L + ++ ++++G + + +L+ Sbjct: 197 SLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILLRNHH 256 Query: 119 IQKRGVPYAVAITAGFLSSVL 139 K +P+ + ++L Sbjct: 257 QSK-PIPFGPYLAIAGWIALL 276
>PF07675#Cleaved Adhesin Length = 1358 Score = 29.7 bits (66), Expect = 0.029 Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 7/66 (10%) Query: 360 NAISY-YSIFRDGNKVGTS-TNTTFTDTGLEPNKQYIYKVSATDSQGQISDFSTVVTATT 417 NA SY Y+I+R+ ++ + T TT+ D L Y Y V G+ S + TAT Sbjct: 1255 NAPSYTYTIYRNNTQIASGVTETTYRDPDL-ATGFYTYGVKVVYPNGE----SAIETATL 1309 Query: 418 LTTNLS 423 T+L+ Sbjct: 1310 NITSLA 1315
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 671 bits (1732), Expect = 0.0 Identities = 230/875 (26%), Positives = 375/875 (42%), Gaps = 67/875 (7%) Query: 2 RIIKKIPIAMTTSLIMLSGAVSA--------IDFNTDAMDANDKQNIDLSHFTNVGYIMP 53 I+K +A + ++ A +A + FN + + + DLS F N + P Sbjct: 16 LHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPP 75 Query: 54 GEYRLEINVNNHRIPEQVIAFYARDDEPNSSEVCLPEAVVEQFGLKPDVLQKITFWHEGQ 113 G YR++I +NN + + + F D E CL A + GL + + + Sbjct: 76 GTYRVDIYLNNGYMATRDVTFNTGDSE-QGIVPCLTRAQLASMGLNTASVSGMNLLADDA 134 Query: 114 CADLREL-AGLTTEVDLATSTLAINVPQDWMEYSDSNWVPSSQWDEGISGFLLDYNVNSL 172 C L + T ++D+ L + +PQ +M ++P WD GI+ LL+YN + Sbjct: 135 CVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGN 194 Query: 173 FSKPKESGSTRNISLNGTSGLNAGPWRLRGDYQGNYSHSSGEQNSSTSTFDWSRIYMYRA 232 + + G++ LN SGLN G WRLR + +Y+ S S + ++ R Sbjct: 195 SVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNK-WQHINTWLERD 253 Query: 233 IKSLAATLSVGENYFASSLFDTFRYAGASLSSDERMLPPNLRGYAPEVSGIARTNAKVTV 292 I L + L++G+ Y +FD + GA L+SD+ MLP + RG+AP + GIAR A+VT+ Sbjct: 254 IIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTI 313 Query: 293 SQQGRILYQTTVASGPFRIQELSD-SVSGRLDVSVEEQDGTVQTFQVETAAVPYLTRPGA 351 Q G +Y +TV GPF I ++ SG L V+++E DG+ Q F V ++VP L R G Sbjct: 314 KQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGH 373 Query: 352 IRYKTSVGQPSTLNHGTEGPVFASGEFSWGVSNRWSLFGGAIGSGDYNAVSVGVGRDLYA 411 RY + G+ + N E P F G+ W+++GG + Y A + G+G+++ A Sbjct: 374 TRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGA 433 Query: 412 FGAISTDITQTRASGLPNQETQSGKSLRVRYAKRFDELNSDISLAGNRFFEREFMSMNQY 471 GA+S D+TQ ++ LP+ G+S+R Y K +E ++I L G R+ + + Sbjct: 434 LGALSVDMTQANST-LPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADT 492 Query: 472 LGTRYFDNDL--------------------GRNKEMYTVTASKNFPDIQTNINFSYSYQN 511 +R ++ + +T ++ T + S S+Q Sbjct: 493 TYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTST-LYLSGSHQT 551 Query: 512 YWDQP-TSNSYSATVSHAFDAFSLKDMTVNLSASRSKNNGV--NDDVLYLSFSVPLGNQ- 567 YW + A ++ AF +D+ LS S +KN D +L L+ ++P + Sbjct: 552 YWGTSNVDEQFQAGLNTAF-----EDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWL 606 Query: 568 ----------QTLSYSGQH-NGQGNNQTVNYSNSSAIDS--SYRLSAGVNNSNDNGARGQ 614 + SYS H + D+ SY + G D + Sbjct: 607 RSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGST 666 Query: 615 FSGFYIHRSSIAETSLNVAYAQDDFTSTGVSMRGGATVTAKGAALHGPGMSGGTRLMVNT 674 +R ++ DD + GG A G L P T ++V Sbjct: 667 GYATLNYRGGYGNANIG-YSHSDDIKQLYYGVSGGVLAHANGVTLGQP--LNDTVVLVKA 723 Query: 675 DDIAGVPLEERNI-RSNRFGIAVLNNINSYYRTDTRIDINQLADDVEVKQSAVEFALTEG 733 +E + R++ G AVL Y +D N LAD+V++ + T G Sbjct: 724 PGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRG 783 Query: 734 AIGYRRFAMMKGEKVLATISLTDSSHPPFGSLVISAKGQELGIVSDDGFTYLSGVEPGET 793 AI F G K+L T++ ++ PFG++V S Q GIV+D+G YLSG+ Sbjct: 784 AIVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGK 842 Query: 794 LDVVW--SGAKQCQV--AIPAVIQPQA--QILLPC 822 + V W C +P Q Q Q+ C Sbjct: 843 VQVKWGEEENAHCVANYQLPPESQQQLLTQLSAEC 877
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 49.6 bits (118), Expect = 7e-09 Identities = 21/97 (21%), Positives = 40/97 (41%), Gaps = 12/97 (12%) Query: 129 QTEPNIKAVAKMRPDLIIISATGDDSTLELYDQLSAIAPTLVINYDDKS-----WQELTL 183 +TEPN++ + +M+P ++ SA S + L+ IAP N+ D ++ Sbjct: 84 RTEPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPLAMARKSLT 139 Query: 184 QLGQATGHEGDAEQVI---DKFARRLNEVKQKITLPP 217 ++ + AE + + F R + K P Sbjct: 140 EMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARP 176
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 297 bits (763), Expect = e-101 Identities = 97/344 (28%), Positives = 173/344 (50%) Query: 5 SGEKSEKPTAGKLSKARKKGDIPRSKDVTMAAGLVTSFILLSLFLPYYKALVSQSFVSVA 64 SGEK+E+PT K+ ARKKG + +SK+V A +V +L YY S+ + A Sbjct: 2 SGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPA 61 Query: 65 QLASQLDDQGALEQFLLANLFIFAKFLATLIPIPLFSMLATLIPGGWNFTPVKLIPDLKK 124 + + Q L F L L ++ + ++ G+ + + PD+KK Sbjct: 62 EQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKK 121 Query: 125 LSPLAGIKRIFSASNGTEVLKMLAKCSIVLYTLYLVVHSSLDDLLHLQTLPLEEAITQGF 184 ++P+ G KRIFS + E LK + K ++ +++++ +L LL L T +E Sbjct: 122 INPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLG 181 Query: 185 AQYHHILLYFIAIVVVFAAIDIPLSHHLFTKKMKMTKQEVKQEHKNNDGNPEIKSRVRQL 244 +++ VV + D ++ + K++KM+K E+K+E+K +G+PEIKS+ RQ Sbjct: 182 QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQF 241 Query: 245 QRQYAIGQINKTVPSADVIITNPTHFSVALKYAPEKASAPYIVAKGKDDIALYIRSIAQK 304 ++ + + V + V++ NPTH ++ + Y + P + K D +R IA++ Sbjct: 242 HQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEE 301 Query: 305 HKIEIVEFPPLARAIYHTTKVNQQIPAQLYRAIAQVLTYVMQIK 348 + I++ PLARA+Y V+ IPA+ A A+VL ++ + Sbjct: 302 EGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQN 345
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 105 bits (264), Expect = 2e-29 Identities = 72/237 (30%), Positives = 127/237 (53%), Gaps = 3/237 (1%) Query: 19 LPFVRILSFLHFCPVIRHKAFTRKAKIGTALLLAILITPMISQPVVSEELLSIENLLLAG 78 P +R+L+ + P++ ++ ++ K+G A+++ I P + V S L LA Sbjct: 18 WPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPV--FSFFALWLAV 75 Query: 79 EQILWGWLFGSMLHLVLAALEAAGQILSMNMGLGMAMMNDPTSGASTAVISQIIFTFSVL 138 +QIL G G + AA+ AG+I+ + MGL A DP S + V+++I+ ++L Sbjct: 76 QQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLALL 135 Query: 139 IFFTLNGHLLFVTILLKSFSSWPIG-EAINDFSLRSLALSLGWILSSATLLALPTTFIML 197 +F T NGHL +++L+ +F + PIG E +N + +L + I + +LALP ++L Sbjct: 136 LFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLALPLITLLL 195 Query: 198 IVQGSFGLLNRISPTLNLFSLGFPIGMLFGLLCLLLLAINIPDHYLHLTNEILTQFE 254 + + GLLNR++P L++F +GFP+ + G+ + L I HL +EI Sbjct: 196 TLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFNLLA 252
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 45.5 bits (108), Expect = 3e-10 Identities = 25/74 (33%), Positives = 37/74 (50%) Query: 14 GLHLVLMISIVAIVPSLLIGLLVSIFQATTQINEQTLSFLPRLVMTMLVLIFAGKWMMIK 73 L+LVL++S + + +IGLLV +FQ TQ+ EQTL F +L+ L L W Sbjct: 11 ALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLSGWYGEV 70 Query: 74 LSDFTVSIFQQAAQ 87 L + + A Sbjct: 71 LLSYGRQVIFLALA 84
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 219 bits (559), Expect = 1e-73 Identities = 111/236 (47%), Positives = 155/236 (65%), Gaps = 4/236 (1%) Query: 19 LVGGLLYSPLLLAQEGGITLFNTVQTATGQDYNVKIEILILMTLLGLLPIMMLMMTCFTR 78 V L +PL AQ GIT + GQ +++ ++ L+ +T L +P ++LMMT FTR Sbjct: 9 PVLLWLITPLAFAQLPGIT--SQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMMTSFTR 66 Query: 79 FIIVLAILRQALGLQQSPPNKVLTGIALALTLLVMRPVWTKIHQDAVIPFQQDEITLSQA 138 IIV +LR ALG +PPN+VL G+AL LT +M PV KI+ DA PF +++I++ +A Sbjct: 67 IIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKISMQEA 126 Query: 139 LGRAEAPLKNYMLAQTSTKSLDQMMAIA--QVSGEPQQQDLSVVTPAYVLSELKTAFQMG 196 L + PL+ +ML QT L +A P+ + ++ PAYV SELKTAFQ+G Sbjct: 127 LEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKTAFQIG 186 Query: 197 FMIYIPFLVIDLIVASILMAMGMMMLSPLIVSLPFKLMLFVLCDGWTLMVGTLTAS 252 F I+IPFL+IDL++AS+LMA+GMMM+ P ++LPFKLMLFVL DGW L+VG+L S Sbjct: 187 FTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQS 242
>FLGMOTORFLIN#Flagellar motor switch protein FliN signature. Length = 137 Score = 72.6 bits (178), Expect = 2e-19 Identities = 35/77 (45%), Positives = 50/77 (64%) Query: 54 RKMSLFSRIPVTLTLEVASVEIPLSELLTVNNDSVIELDKLAGEPLDIRVNGIMFGQAEV 113 + + L IPV LT+E+ + + ELL + SV+ LD LAGEPLDI +NG + Q EV Sbjct: 52 QDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEV 111 Query: 114 VVINEKYGLRIININSQ 130 VV+ +KYG+RI +I + Sbjct: 112 VVVADKYGVRITDIITP 128
>TYPE3OMOPROT#Type III secretion system outer membrane O protein family signature. Length = 303 Score = 32.7 bits (74), Expect = 0.002 Identities = 26/103 (25%), Positives = 43/103 (41%), Gaps = 16/103 (15%) Query: 154 GEHLIINNSTAALIACWSYRIDFFLKDYNKSGFSIFIDAPHIDRLIDTIKTKSEKAVEKN 213 G+ L+I S A + C++ ++ F + I +D E N Sbjct: 172 GDVLLIRTSRA-EVYCYAKKLGHFNRVEGG----------IIVETLDI----QHIEEENN 216 Query: 214 VSLSERQLEHLVKKLPVTLTSQLSNINLTLAELMALKEGDIIS 256 + + L L +LPV L L N+TLAEL A+ + ++S Sbjct: 217 TTETAETLPGL-NQLPVKLEFVLYRKNVTLAELEAMGQQQLLS 258
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 375 bits (965), Expect = e-130 Identities = 127/345 (36%), Positives = 186/345 (53%), Gaps = 22/345 (6%) Query: 14 HGFVANAPSSVSVFSLARRVAEFNVPVLVTGETGTGKECVAKYIHQKAMGDASPYIAVNC 73 V + + ++ + R+ + ++ +++TGE+GTGKE VA+ +H P++A+N Sbjct: 137 MPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINM 196 Query: 74 AAIPESMLEAILFGYEKGAFTGAIASVAGKFEQANGGTLLLDEIGDMPLALQVKLLRVLQ 133 AAIP ++E+ LFG+EKGAFTGA G+FEQA GGTL LDEIGDMP+ Q +LLRVLQ Sbjct: 197 AAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQ 256 Query: 134 EQEVERLGGHKPIPLDIRIIASTNKDLSVEIAEGRFRQDLYYRLSVVPIHILPLRERPED 193 + E +GG PI D+RI+A+TNKDL I +G FR+DLYYRL+VVP+ + PLR+R ED Sbjct: 257 QGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAED 316 Query: 194 ILPLVKAFINKYQSFLNVKIDITAEAQCELYKYTWPGNVRELENVIQRGIIMSNNGVI-- 251 I LV+ F+ + + EA + + WPGNVRELEN+++R + VI Sbjct: 317 IPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITR 376 Query: 252 ---------ELPSLGLPMAQGISSPVGETSLPF--------STIQPPDGENNIKLRGRLA 294 E+P + A S + + S Sbjct: 377 EIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEM 436 Query: 295 QYQYIVDLLQRHQGNKSKTAAFLGITPRALRYRLANMREDGIDIE 339 +Y I+ L +GN+ K A LG+ LR + +RE G+ + Sbjct: 437 EYPLILAALTATRGNQIKAADLLGLNRNTLRKK---IRELGVSVY 478
>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE signature. Length = 103 Score = 44.3 bits (104), Expect = 5e-09 Identities = 23/73 (31%), Positives = 35/73 (47%), Gaps = 1/73 (1%) Query: 53 NNLSFSQVLNGAIKSVDQLQHVASEKQTAMDMGISD-DLTGTMLASQKASVAFSAMVQVR 111 +SF+ L+ A+ + Q A + +G L M QKASV+ +QVR Sbjct: 29 PTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVR 88 Query: 112 NKLTSALDDVMNT 124 NKL +A +VM+ Sbjct: 89 NKLVAAYQEVMSM 101
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 283 bits (724), Expect = 1e-90 Identities = 154/565 (27%), Positives = 258/565 (45%), Gaps = 62/565 (10%) Query: 12 GQLGENTKTILMSAVALLVTAAIIFSLWRSSQGYTALFGSQENIPITQVVEVLEGEAIAY 71 +L N + L+ A + V + LW + Y LF + + +V L I Y Sbjct: 17 NRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPY 76 Query: 72 RINPDNGQVLVAENQLGKARILLAAKGITATLPIGYELMDKESMLGSSQFIQNVRYKRSL 131 R +G + V +++ + R+ LA +G+ +G+EL+D+E G SQF + V Y+R+L Sbjct: 77 RFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEK-FGISQFSEQVNYQRAL 135 Query: 132 EGELAQSMMALSAVEYARVHLGMSEASSFAISNHADNSASVVLRLRYGQTLSTEQVGAIV 191 EGELA+++ L V+ ARVHL M + S F + SASV + L G+ L Q+ A+V Sbjct: 136 EGELARTIETLGPVKSARVHLAMPKPSLF-VREQKSPSASVTVTLEPGRALDEGQISAVV 194 Query: 192 QLVAGSIPGMKPANVRVVDQHGELLSQAYQANSEGVPSVKSGTELAHYLQSTTEKNIANL 251 LV+ ++ G+ P NV +VDQ G LL+ Q+N+ G + + A+ ++S ++ I + Sbjct: 195 HLVSSAVAGLPPGNVTLVDQSGHLLT---QSNTSGRDLNDAQLKFANDVESRIQRRIEAI 251 Query: 252 LNSVIGANNYRISVSTQLDMSRIEETAEHYGPDPRIN------DENIQQENSNDDMAMGI 305 L+ ++G N V+ QLD + E+T EHY P+ + + E G+ Sbjct: 252 LSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGV 311 Query: 306 PGSLSNQPIPQSQAGQTPAAVSRSQAQ------------------------RKYIYDRNI 341 PG+LSNQP P ++A ++ AQ Y DR I Sbjct: 312 PGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTI 371 Query: 342 RHVRYPGYKLEKMTVAVVLN-KSLPVL--EQWTPEQQEELKRLIEDAAGIDVKRGDSLTI 398 RH + +E+++VAVV+N K+L T +Q ++++ L +A G KRGD+L + Sbjct: 372 RHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDKRGDTLNV 431 Query: 399 NMMAFAVP-TLIDEPVMPWWQEPSTFRWAELLGIGLLSLLVLW----FGVRPLMKRYSRK 453 F+ E +P+WQ+ S G LL L+V W VRP + R + Sbjct: 432 VNSPFSAVDNTGGE--LPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRVEE 489 Query: 454 GSENLPLAISSASADEALDHVDTGVDGAESSPRTENAFSASSLWKSDDLPEQGSGLETKI 513 + E + V+ + E Q G E Sbjct: 490 AK---AAQEQAQVRQETEEAVEVRLSKDEQL--------------QQRRANQRLGAEVMS 532 Query: 514 AHLQQLAQSETERTAEVIKQWINSN 538 +++++ ++ A VI+QW++++ Sbjct: 533 QRIREMSDNDPRVVALVIRQWMSND 557
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 173 bits (440), Expect = 2e-53 Identities = 85/334 (25%), Positives = 166/334 (49%), Gaps = 2/334 (0%) Query: 15 KSDTKGRSRLEQASILLLSIGEEAAAMVMQQLSREEVVCVSQMMSRLHNIKLDQARQALD 74 D + ++A+ILL+SIG E ++ V + LS+EE+ ++ +++L I + L Sbjct: 9 ILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLL 68 Query: 75 DFFQDYREQSGINGASRSYLQAILNKALGSDIAKSVINGIYGDEIRHRMTRLQWVDTPQL 134 +F + Q I Y + +L K+LG+ A +IN + ++ D + Sbjct: 69 EFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANI 128 Query: 135 VALIDQEHLQLQAVFLAFLPPDVAAAVLAYLDKDRQDDILYRIAKLDDVNRDVVDEL-DR 193 + I QEH Q A+ L++L P A+ +L+ L + Q ++ RIA +D + +VV E+ Sbjct: 129 LNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERV 188 Query: 194 LIERGVAVLSEHGSKVIGIKQAANIVNRIPGNQQQ-LLDQLGERDEEVLNELKDEMYEFF 252 L ++ ++ SE + G+ I+N ++ +++ L E D E+ E+K +M+ F Sbjct: 189 LEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFE 248 Query: 253 ILSRQSEATLQRLMDLIPMSDWAIALKGTEPALRQAIYDVLPKRQIQQLQNATQRTGAVP 312 + + ++QR++ I + A ALK + +++ I+ + KR L+ + G Sbjct: 249 DIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTR 308 Query: 313 VSRVEHIRKVIMAQVRELAEAGEIQVQLFAEQTM 346 VE ++ I++ +R+L E GEI + E+ + Sbjct: 309 RKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDV 342
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 59.0 bits (142), Expect = 9e-13 Identities = 45/204 (22%), Positives = 100/204 (49%), Gaps = 11/204 (5%) Query: 18 QFPPLRKVRQVAPSAADQTLDPAEYQKQLMAGFQEGISQGFDKGLAEGKEEGYQEGVRLG 77 +F P+ + + A+ +L+ Q Q+ A QG+ G+AEG+++G+++G + G Sbjct: 21 EFVPIVEPEETIIEEAEPSLEQQLAQLQMQAH-----EQGYQAGIAEGRQQGHKQGYQEG 75 Query: 78 HDDGLKKGRIEGRQSELASFNDVIKPFSGYITQLHTYLETYEQRRRDELLQLVEKVTRQV 137 GL++G E +S+ A + ++ +++ T L+ + L+Q+ + RQV Sbjct: 76 LAQGLEQGLAEA-KSQQAPIHARMQQL---VSEFQTTLDALDSVIASRLMQMALEAARQV 131 Query: 138 IRCELALQPAQLLTLVEEALAALPMVPQQLKVYLNPAEFGRINDV--APEKVQAWGLAAD 195 I + + L+ +++ L P+ + ++ ++P + R++D+ A + W L D Sbjct: 132 IGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGD 191 Query: 196 PDMVGGECRIVTETTEIDVGCQHR 219 P + G C++ + ++D R Sbjct: 192 PTLHPGGCKVSADEGDLDASVATR 215
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 29.9 bits (67), Expect = 0.004 Identities = 6/37 (16%), Positives = 19/37 (51%) Query: 102 VNVVSEMADMMSASRSFETNVEVLNSVKSMQQSVLKL 138 VN+ E ++ + + N +VL + ++ +++ + Sbjct: 509 VNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 38.4 bits (89), Expect = 1e-06 Identities = 20/60 (33%), Positives = 27/60 (45%), Gaps = 5/60 (8%) Query: 2 SFSIANTALNAHTEQLNTISNNIANSATKGFKASR----TEFSSMYAQSQ-PLGVAVSGV 56 + A + LNA LNT SNNI++ G+ S++ A GV VSGV Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62
>FLAGELLIN#Flagellin signature. Length = 507 Score = 94.3 bits (234), Expect = 4e-23 Identities = 66/326 (20%), Positives = 123/326 (37%), Gaps = 9/326 (2%) Query: 5 IHTNASAKTAINSLSNAGLANAKSSQRLSTGFRINSPADNAAGLQITNRMEKFLNSAGQA 64 I+TN+ + N+L+ + + + + +RLS+G RINS D+AAG I NR + QA Sbjct: 4 INTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQA 63 Query: 65 KQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSELGKELQ 124 +N + I++ Q +G L E L +++L+ QA N TNS +D ++IQ E + +E+ Sbjct: 64 SRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEID 123 Query: 125 NALNNTEYNSEKLFADGGKMRKELNFQSGTDAESSLKLNLDSVIAELTESVTKQATPVKA 184 N T++N K+ + +M Q G + ++ ++L + + Sbjct: 124 RVSNQTQFNGVKVLSQDNQM----KIQVGANDGETITIDLQK-----IDVKSLGLDGFNV 174 Query: 185 NGSGSALEIEADTLHKATEKAKTAKEAADVATKDAQAKGAGTGATHRLTTAYDIPDYINE 244 NG A + + K T A+ D + T T + N Sbjct: 175 NGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANG 234 Query: 245 AGKSVSARTIATSADLKPIDLVDIAGAAVAMGKAHAAAEKEENLFQAKNSTGGGVMNMQL 304 + A K A A+ A ++ + + Sbjct: 235 QLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGND 294 Query: 305 ADKDLAMKADKKLSDVIDAYGAFRAT 330 + ++ + + + A A Sbjct: 295 GNGKVSTTINGEKVTLTVADITAGAA 320 Score = 62.0 bits (150), Expect = 1e-12 Identities = 54/337 (16%), Positives = 98/337 (29%), Gaps = 10/337 (2%) Query: 64 AKQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSELGKEL 123 +++ S + D + K + A + D+ + +L + Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDD 240 Query: 124 QNALNNTEYNSEKLFADGGKMRKELNFQSGTDAESSLKLNLDSVIAELTESVTKQATPVK 183 + G K + E T++ V Sbjct: 241 AENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVS 300 Query: 184 ANGSGSALEIEADTLHKATEKAKTAKEAADVATKDAQAKGAGTGATHRLTTAYDIPDYIN 243 +G K T A + N Sbjct: 301 TTINGE----------KVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKN 350 Query: 244 EAGKSVSARTIATSADLKPIDLVDIAGAAVAMGKAHAAAEKEENLFQAKNSTGGGVMNMQ 303 E+ K I + A A G A K + + + + Sbjct: 351 ESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDA 410 Query: 304 LADKDLAMKADKKLSDVIDAYGAFRATLGANQNRLQSSSNNLDNMISNTAQALGSIKDTD 363 A K + + A R++LGA QNR S+ NL N ++N A I+D D Sbjct: 411 AAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDAD 470 Query: 364 FADEMKNHAQSEMLMQSSVMMLKKANAATQLISTLLQ 400 +A E+ N +++++L Q+ +L +AN Q + +LL+ Sbjct: 471 YATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507
>FLAGELLIN#Flagellin signature. Length = 507 Score = 63.5 bits (154), Expect = 1e-13 Identities = 43/238 (18%), Positives = 79/238 (33%), Gaps = 6/238 (2%) Query: 6 NSAGQAKQNIQESIAMLQIADGGLAESVKTLNAMKKLATQAANDTNSAADREAIQKEFSE 65 QA +N + I++ Q +G L E L +++L+ QA N TNS +D ++IQ E + Sbjct: 58 KGLTQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQ 117 Query: 66 LGKELQNALNNTEYNSEKLFADGGKMRKELNFQSG------TDAESSLKLDLNSVIAELT 119 +E+ N T++N K+ + +M+ ++ G L L+ Sbjct: 118 RLEEIDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGP 177 Query: 120 ESVTKKATPITASATGTKEEQALEKLEDATKAADTAKKAADTAKTAMGTTKAGANAPKEI 179 + T + + A+ + A TA T A + Sbjct: 178 KEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLT 237 Query: 180 KIPTYINASGISIPAKTIASGTAVTQDDLNNIAGAVDVLTKEHAKAEKAAKDYAVISA 237 N + +GTA + I G + T ++ Sbjct: 238 TDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDG 295
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 36.1 bits (83), Expect = 1e-04 Identities = 25/116 (21%), Positives = 41/116 (35%), Gaps = 17/116 (14%) Query: 171 FQRSSAVLTPFFSRLLGELAPAFNEM---DNKIIITGHTDASRYRDQLLYNNWNLSGERA 227 F + A L P L +L + + D +++ G+TD D N LS RA Sbjct: 223 FNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTD-RIGSDAY---NQGLSERRA 278 Query: 228 LMAHKALVNGGLDEGRVLQI----------NAMADQMLLDPTDPLAAKNRRIEIMV 273 L++ G+ ++ N + A +RR+EI V Sbjct: 279 QSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 33.8 bits (77), Expect = 2e-04 Identities = 29/126 (23%), Positives = 49/126 (38%), Gaps = 7/126 (5%) Query: 32 SQAGQTQSVTHGTLVSVRPVTIQGGDGNNVAGAVGGAVVGGFLGNTIGGGTGRRLGTAAG 91 S G S+ G L+ ++ G DG A A G +V GF + + T+A Sbjct: 116 SSLGDATSLRGGNLIMT---SLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSAR 172 Query: 92 VVAGGVVGQQVQSLMNRSSGVELEVRRDDGSTFLVVQAQGVTQFHP---GQRVTIATSGS 148 V G ++ +++ S S + L++R D ST + V V F G + Sbjct: 173 VPNGAIIERELPSKFKDSVNLVLQLRNPDFSTAVRVADV-VNAFARARYGDPIAEPRDSQ 231 Query: 149 TVTITP 154 + + Sbjct: 232 EIAVQK 237
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 47.1 bits (112), Expect = 9e-08 Identities = 27/177 (15%), Positives = 55/177 (31%), Gaps = 17/177 (9%) Query: 39 RHSLLSHALFLLILGAGSVSAAPAPLPAVTVAVVASITPDNAVQYLGRIEAIQAVDVTTR 98 R L + L + + + V +T GR + I+ + Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLGQVEIV-ATANGKLTHS------GRSKEIKPI----- 102 Query: 99 TEGFIARRLFTEGKMVKQGELLYEIDPALHQASVAQAQAQLDSATASANHAQVNLTRLQR 158 + + EG+ V++G++L ++ +A + Q+ L A Q+ ++ Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIEL 162 Query: 159 LGNNRSVSQAE-----VDEAQAQRDISRAAVAQAQANLQIQQLQLSFTQIHAPISGQ 210 E V E + R S + Q Q +L+ + A Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTV 219 Score = 40.6 bits (95), Expect = 1e-05 Identities = 24/175 (13%), Positives = 52/175 (29%), Gaps = 46/175 (26%) Query: 104 ARRLFTEGKMVKQGELL-YEIDPALHQASVAQAQAQLDSATASANHAQVNLTRLQRLGNN 162 L E Q + E++ +A A+++ + + L L + Sbjct: 187 LTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHK 246 Query: 163 RSVS-------QAEVDEAQAQRDISRAAVAQ---------------------------AQ 188 ++++ + + EA + + ++ + Q Q Sbjct: 247 QAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQ 306 Query: 189 ANLQIQQL---------QLSFTQIHAPISGQ-MGHSRFNVGSLINPASGTLVNIV 233 I L + + I AP+S + G ++ A TL+ IV Sbjct: 307 TTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIV 360
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 817 bits (2113), Expect = 0.0 Identities = 393/965 (40%), Positives = 563/965 (58%), Gaps = 17/965 (1%) Query: 1 MLHFFIRRPKFAIVIALVITLVGWVSLYVIPVEQYPDITPPVVSVSAVYPGASARDVAQA 60 M +FFIRRP FA V+A+++ + G +++ +PV QYP I PP VSVSA YPGA A+ V Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VASPLEAQVNGVSHMLYMESTSANNGSYQLSITFASGTDPDMAAVEVQNRISQVSAQLPA 120 V +E +NG+ +++YM STS + GS +++TF SGTDPD+A V+VQN++ + LP Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVNENGISVRKRASNLLLGVSVFSPQQTHDALFVSNYTSIQLRDAIARISGVGDVQVFGA 180 EV + GISV K +S+ L+ S +S+Y + ++D ++R++GVGDVQ+FGA Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 RDYSMRVWLDPQRMESLNVSVQDIVAALQQQNVQAAAGQIGSSPSMPNQQQTLTISGQGR 240 + Y+MR+WLD + ++ D++ L+ QN Q AAGQ+G +P++P QQ +I Q R Sbjct: 181 Q-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239 Query: 241 LTDARQFADVIIRSNPQGGMIRLGDVARVALGAQNYQVSAAQNQTESAFLVVYPVPGANA 300 + +F V +R N G ++RL DVARV LG +NY V A N +A L + GANA Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299 Query: 301 LNVANGVRDEMARLSAAFPADLTYEINYDSTLPVTATLHEIAVSLTLTLIVVLAVVYLFL 360 L+ A ++ ++A L FP + YD+T V ++HE+ +L +++V V+YLFL Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359 Query: 361 QSLRATFIVALTVPVSLLGTFAVLYVFGYSANTLSLFAIILALTIVVDDAIVVVENVERL 420 Q++RAT I + VPV LLGTFA+L FGYS NTL++F ++LA+ ++VDDAIVVVENVER+ Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 421 LSNDPHLSPAEATRQAMSQIAGPIIATTLVLMAVFVPIAILPGIIGELYRQFAVTLSAAV 480 + D L P EAT ++MSQI G ++ +VL AVF+P+A G G +YRQF++T+ +A+ Sbjct: 420 MMED-KLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478 Query: 481 ILSSINALTLSPALCAVLLKRRTL----ATTGMFGTINKGLDRARDGYVGLTGRINRRAV 536 LS + AL L+PALCA LLK + G FG N D + + Y G+I Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538 Query: 537 FSIAALLLVGLATWWGYSRLPTSFLPEEDQGYFFVSLQLPDGASLNRTQTVMDQMYQQVS 596 + L+ + RLP+SFLPEEDQG F +QLP GA+ RTQ V+DQ+ Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598 Query: 597 TNEA--VEDVIKITGFSLLSGNNAPNAGFAIVMLKPWGQRP----HIDRVLASIQANLAA 650 NE VE V + GFS A NAG A V LKPW +R + V+ + L Sbjct: 599 KNEKANVESVFTVNGFSF--SGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGK 656 Query: 651 IPSAMIMAVNPPAIAGLGSASGFDLRIQALLGQSPQELAQVSQGIIFAANQDP-TLSRVF 709 I ++ N PAI LG+A+GFD + G L Q ++ A Q P +L V Sbjct: 657 IRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716 Query: 710 TTFSASVPETNLSIDRDRAALLQVPVSRIFQTLQTSLGGMNAGDFTLNNRMFRVQLQNDM 769 + L +D+++A L V +S I QT+ T+LGG DF R+ ++ +Q D Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADA 776 Query: 770 NFRQRTAQINNLNVRSDNGALVSLANLVTLTPSVGAPFISNFNQFPSVAISGSAADGASS 829 FR ++ L VRS NG +V + T G+P + +N PS+ I G AA G SS Sbjct: 777 KFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSS 836 Query: 830 GQAMAAMEALLAQNLPQGYSYSWSGMSWQEQQTGGQVVFIYLAALVFAYLFLVAQYESWS 889 G AMA ME L ++ LP G Y W+GMS+QE+ +G Q + + V +L L A YESWS Sbjct: 837 GDAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895 Query: 890 IPLVVVLSVVFAVGGAVAGLSAMGFANDVYAQIGLVLLIGLAAKNAILIVEFSK-ARREE 948 IP+ V+L V + G + + NDVY +GL+ IGL+AKNAILIVEF+K +E Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955 Query: 949 GASMR 953 G + Sbjct: 956 GKGVV 960 Score = 93.0 bits (231), Expect = 3e-21 Identities = 74/516 (14%), Positives = 176/516 (34%), Gaps = 41/516 (7%) Query: 7 RRPKFAIVIALVITLVGWVSLYVIPVEQYPDITPPVVSVSAVYPGASARDVAQAVASP-- 64 ++ ++ AL++ + + L +P P+ V P + ++ Q V Sbjct: 536 STGRYLLIYALIVAGMVVLFLR-LPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVT 594 Query: 65 ---LEAQVNGVSHMLYMESTSANNGSYQLSITFAS---GTDPDMAAVEVQNRISQVSAQL 118 L+ + V + + S + + + F S + + + I + +L Sbjct: 595 DYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMEL 654 Query: 119 PAEVNENGISVRKRASNLLLGVSVFSPQQTHDALFVSNYTSIQLRDAIARISGVGDVQVF 178 + I A L + F + D + + Q R+ + ++ + Sbjct: 655 GKIRDGFVIPFNMPAIVELGTATGFDFE-LIDQAGLGHDALTQARNQLLGMAAQHPASLV 713 Query: 179 GARD------YSMRVWLDPQRMESLNVSVQDIVAALQQQNVQAAAGQIGSSPSMPNQQQT 232 R ++ +D ++ ++L VS+ DI + ++ + Sbjct: 714 SVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVND------FIDRGRV 767 Query: 233 LTISGQGRLTDARQFADVIIR---SNPQGGMIRLGDVARVALGAQNYQVSAAQNQTESAF 289 + Q R + + + + G M+ + ++ N S Sbjct: 768 KKLYVQAD-AKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERY-NGLPSME 825 Query: 290 LVVYPVPGANALNVANGVRDEMARLSAAFPADLTYEINYDSTLPVTATLHEIAVSLTLTL 349 + PG ++ + M L++ PA + Y+ ++ + Sbjct: 826 IQGEAAPGTSSGDA----MALMENLASKLPAGIGYD-----WTGMSYQERLSGNQAPALV 876 Query: 350 IVVLAVVYLFL----QSLRATFIVALTVPVSLLGTFAVLYVFGYSANTLSLFAIILALTI 405 + VV+L L +S V L VP+ ++G +F + + ++ + + Sbjct: 877 AISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGL 936 Query: 406 VVDDAIVVVENVERLLSNDPHLSPAEATRQAMSQIAGPIIATTLVLMAVFVPIAILPGII 465 +AI++VE + L+ + EAT A+ PI+ T+L + +P+AI G Sbjct: 937 SAKNAILIVEFAKDLMEKEGK-GVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAG 995 Query: 466 GELYRQFAVTLSAAVILSSINALTLSPALCAVLLKR 501 + + ++ +++ A+ P V+ + Sbjct: 996 SGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 69.1 bits (169), Expect = 8e-18 Identities = 25/57 (43%), Positives = 42/57 (73%) Query: 1 MMTAISFILGVMPLVFASGAGAMSRQIIGITVFGGMLMATAVGILFIPALYLHIQRL 57 +MT+++FILGV+PL ++GAG+ ++ +GI V GGM+ AT + I F+P ++ I+R Sbjct: 975 LMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031 Score = 27.1 bits (60), Expect = 0.006 Identities = 15/60 (25%), Positives = 27/60 (45%), Gaps = 2/60 (3%) Query: 1 MMTAISFILGVMPLVFASG-AGAMSRQIIGITVFGGMLMATAVGILFIPALYLHIQRLRE 59 + A+ +P+ F G GA+ RQ IT+ M ++ V ++ PAL + + Sbjct: 443 VGIAMVLSAVFIPMAFFGGSTGAIYRQF-SITIVSAMALSVLVALILTPALCATLLKPVS 501
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 78.2 bits (192), Expect = 6e-16 Identities = 173/846 (20%), Positives = 280/846 (33%), Gaps = 122/846 (14%) Query: 3529 ATLANNGTQSNDLSAQITGSGDLAFASANDGSTAS-----LSNSTNSYTGTTWVSSGNLR 3583 ATLAN G +D + +G+ A AS D + + N + + G L Sbjct: 125 ATLANVGDTWDDDGIALYVAGEQAQASIADSTLQGAGGVQIERGANVTVQRSAIVDGGLH 184 Query: 3584 LDADSALGQTSL------LAMSTATHVDINGTQQVVGELATEGGSTLDLNDGKLTVTGGG 3637 + A +L L L + T V +G V + G S L L+ G +T GG Sbjct: 185 IGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAV---SVLGASELTLDGGHIT---GG 238 Query: 3638 QIDGALTGGGELVLSGGLLNVSYDNTGFTGSTDIANGAVAHLSQAQGLGNGTINNNGTLH 3697 + G G +V L + + GAV + G G G Sbjct: 239 RAAGVAAMQGAVV---HLQRATIRRGDAPAGGAVPGGAVPGGAVPGGFGPGGFG------ 289 Query: 3698 LDNTIGTLFNALTGSDGEVLLSNNASVQLAGDNSGYSGLFTNQAGSILIANSAEHLGGSS 3757 + + + S V L A + G + A GGS Sbjct: 290 ---PVLDGWYGVDVSGSSVEL---AQSIVEAPELGAAIRVGRGA-------RVTVSGGSL 336 Query: 3758 IANSGALILNTGSVWEL--TNTISGTGTLVKRGSGTVKIEGDTVSAGLTTIEEGLLQLGS 3815 A G +I G+ +S T G + T+ G G Sbjct: 337 SAPHGNVIETGGARRFAPQAAPLSITLQAGAHAQGKALLYRVLPEPVKLTLTGGADAQGD 396 Query: 3816 SAVTQTLSLEESLQEDALLVSFASNMANLTSNVLITANGSLGGYGQVTGN-------VEN 3868 T+ L L V+ AS + + + +T N + + Sbjct: 397 IVATE-LPSIPGTSIGPLDVALASQARWTGATRAVDSLSIDNATWVMTDNSNVGALRLAS 455 Query: 3869 HGNLIMPNALTGGDFGTFTIDGNYTGDEGMITFNTILAGDTSVTDRLVITGGTAGQSYVT 3928 G++ G F T+ N G+ N D ++D+LV+ +GQ + Sbjct: 456 DGSVDFQQPAEAGRFKVLTV--NTLAGSGLFRMNVFA--DLGLSDKLVVMQDASGQHRLW 511 Query: 3929 VNNIGGVGARTFEGIKIIDVGGDSAGQFTL---NGRAVGGAYEYFLYQGG---------- 3975 V N G + + ++ SA FTL +G+ G Y Y L G Sbjct: 512 VRNSGS-EPASANTLLLVQTPLGSAATFTLANKDGKVDIGTYRYRLAANGNGQWSLVGAK 570 Query: 3976 -------ASTPDDGDWYLRTQADDRRPEPASYTANLAAANNMFVTS-------------- 4014 A P + L+AA N V + Sbjct: 571 APPAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRELSAAANAAVNTGGVGLASTLWYAES 630 Query: 4015 --LSDRMGETLYTDVFTGEQKTTSLWLRNEGSHNRSRDDSGELHTQDNR-YVMQLGGDVA 4071 LS R+GE G W R G R + D+ D + +LG D A Sbjct: 631 NALSKRLGELRLNPDAGG------AWGR--GFAQRQQLDNRAGRRFDQKVAGFELGADHA 682 Query: 4072 QWSRNAQDLWRVGVMAGYANSSSSTVAKVAGYRSTGSVDGYSVGIYGSWLADNADDTGAY 4131 A W +G +AGY G D VG Y +++AD+ G Y Sbjct: 683 --VAVAGGRWHLGGLAGYTRGDRGFTGD-----GGGHTDSVHVGGYATYIADS----GFY 731 Query: 4132 VDSWVQYSWFDN--NVSGQDLAA--EKYDSKGFTASVEGGYAFKVGESVNQSYFIQPKAQ 4187 +D+ ++ S +N V+G D A KY + G AS+E G F + +F++P+A+ Sbjct: 732 LDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGRRF----THADGWFLEPQAE 787 Query: 4188 VVWMGVKADDHTETNGTVISGDGNGNIQTRLGAKAFINPSDKAKVSGPAFKPFVEANWIH 4247 + + NG + +G ++ RLG + G +P+++A+ + Sbjct: 788 LAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLEV---GKRIELAGGRQVQPYIKASVLQ 844 Query: 4248 NTKDFGTT-LDGVTVKQAGTANIAELKLGVDGQINNQLNLWGNIGQQVGNKGYSETSVVL 4306 GT +G+ + AEL LG+ + +L+ + G K + Sbjct: 845 EFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLYASYEYSKGPKLAMPWTFHA 904 Query: 4307 GVKYNF 4312 G +Y++ Sbjct: 905 GYRYSW 910
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 66.2 bits (161), Expect = 6e-13 Identities = 182/867 (20%), Positives = 295/867 (34%), Gaps = 128/867 (14%) Query: 404 GILMMGMASE---GNSTIIINANNINSGSQSLKVNNYSHLGTAVSDITATGHLVSEQGVG 460 GIL+ A+E N ++ + + G + G V+D ++ Sbjct: 78 GILLENPAAELQFRNGSVTSSGQLSDDGIRRFLGTVTVKAGKLVADHATLANVGDTWDDD 137 Query: 461 AIFSTYVSQGDAIAVINLNDITAAGSSVEIDTIASEGNSITYLTVTGQINASNGEGI--- 517 + YV+ A A I + + AG V+I+ A+ +TV G I Sbjct: 138 G-IALYVAGEQAQASIADSTLQGAGG-VQIERGAN-------VTVQRSAIVDGGLHIGAL 188 Query: 518 -TLSSQATDGSTLVNIDVNNIASEYDAIYLHNSVTGVDNGTSTIDLITRG---ALVSQQG 573 +L + S +V D N A SV G T IT G + + QG Sbjct: 189 QSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASELTLDGGHITGGRAAGVAAMQG 248 Query: 574 YGINIE-TNTADTYVTVGGLVHGGNGTAIGIHRLENVQTSATLELQSGYALEGVTQALVF 632 ++++ GG V GG + G+ G V Sbjct: 249 AVVHLQRATIRRGDAPAGGAVPGG--------------AVPGGAVPGGFGPGGFGP--VL 292 Query: 633 TGSYA-EINDAALDLANSHLVLGGTGDAVFDLTRIDNREEAILDGDPNRITGFGTLTKTN 691 G Y +++ ++++LA S + G A+ R+ + G G+L+ + Sbjct: 293 DGWYGVDVSGSSVELAQSIVEAPELGAAI----RVGRGARVTVSG--------GSLSAPH 340 Query: 692 NSIWTLTGSNMADGDANAFLSANIAGGILVLDNATL---GLTPATTILNR--LSAADIAA 746 ++ TG A LS + G A L P L + DI A Sbjct: 341 GNVIE-TGGARRFAPQAAPLSITLQAGAHAQGKALLYRVLPEPVKLTLTGGADAQGDIVA 399 Query: 747 D--PTRVATETGAL--TLAEGGALSSLGDSVLSGNLISAGGILLSNHYTGGNGAATDDRL 802 P+ T G L LA + +V S ++ +A ++ N G A+D + Sbjct: 400 TELPSIPGTSIGPLDVALASQARWTGATRAVDSLSIDNATWVMTDNSNVGALRLASDGSV 459 Query: 803 TVTGTYFGENNGSGEGAWLALDTVLGD---------DDSATDRLVINGDATGTTSVRVNN 853 F + +G L ++T+ G D +D+LV+ DA+G + V N Sbjct: 460 D-----FQQPAEAGRFKVLTVNTLAGSGLFRMNVFADLGLSDKLVVMQDASGQHRLWVRN 514 Query: 854 AGGLGDKTRNGINLITVDGLAQDDTFLLAGDYVTTDGYQAVVAGAYAYTLQADGEAATAG 913 +G + N + L+ + TF LA DG V G Y Y L A+G Sbjct: 515 SGS-EPASANTLLLVQTPLGSA-ATFTLA----NKDG--KVDIGTYRYRLAANGNG---- 562 Query: 914 RNWYLSSELMLTEGVRYQVGVPLYEQYPQVLAALNTLPTLQQRVGNRYGAPGALA----D 969 W L P Q PQ P Q G A A Sbjct: 563 -QWSLVGAKAPPAPKPAPQPGP---QPPQPPQPQPEAPAPQPPAGRELSAAANAAVNTGG 618 Query: 970 LNFDDNQW----------------------AWGRIEGSHQVTDPARSTSGSQREIDVWKL 1007 + W AWGR Q D + +G + + V Sbjct: 619 VGLASTLWYAESNALSKRLGELRLNPDAGGAWGRGFAQRQQLD---NRAGRRFDQKVAGF 675 Query: 1008 QTGIDVPLYQSQGGSLLTGGVNFTYGKAKADIHSFFGDGRINSAGYGLGTSLTWYGNNGV 1067 + G D + + G L G +T G F GDG ++ +G T+ ++G Sbjct: 676 ELGADHAVAVAGGRWHLGGLAGYTRGD-----RGFTGDGGGHTDSVHVGGYATYIADSGF 730 Query: 1068 YVDGQLQTMWFDSDLSSRTA-GHAVASGNNGRGYTSAIEAGKGYALGNGLSLTPQMQVTY 1126 Y+D L+ ++D + G+AV G +++EAG+ + +G L PQ ++ Sbjct: 731 YLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGRRFTHADGWFLEPQAELAV 790 Query: 1127 SRVDFDTFRDPFDSEVSLQEGDSLRGRIGVSLDKETTWSAKDGTTRRSHIYSHLDLHNEF 1186 R +R V + G S+ GR+G+ + K R+ Y + EF Sbjct: 791 FRAGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIEL----AGGRQVQPYIKASVLQEF 846 Query: 1187 LNGSKVQVSGVEFATRDKRQSVGLGAG 1213 V +G+ T + LG G Sbjct: 847 DGAGTVHTNGIAHRTELRGTRAELGLG 873
>ICENUCLEATIN#Ice nucleation protein signature. Length = 1258 Score = 34.0 bits (77), Expect = 0.002 Identities = 42/189 (22%), Positives = 64/189 (33%), Gaps = 1/189 (0%) Query: 532 NTTVLNDRSTTVSGNHTETVTKDQAVTVSGNQTMDITQDQTITVTGTQRIDVTQDRIIDV 591 +T ++S +G + + + ++G + +I G Q+R Sbjct: 758 STQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLT 817 Query: 592 TAEQQTTVKADDRLLISGKQKTKIDLDQEYEVVGSQKKTIGANQTLKVGGYQKNTLEGYK 651 T T+ D LI+G T+ G + GY + GY Sbjct: 818 TGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYD 877 Query: 652 KSKIGG-NNTTTVGGHDKLTVGDTITITAGTSITLQCGASSIVMDEAGNIKITGVNITSA 710 S I G +T T G + LT G T TA + L G S + I G T Sbjct: 878 SSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQT 937 Query: 711 ASTTHTIKA 719 AS T+ A Sbjct: 938 ASFKSTLMA 946 Score = 33.6 bits (76), Expect = 0.004 Identities = 39/188 (20%), Positives = 59/188 (31%), Gaps = 15/188 (7%) Query: 532 NTTVLNDRSTTVSGNHTETVTKDQAVTVSGNQTMDITQDQTITVTGTQRIDVTQDRIIDV 591 +T +RS +G + + + ++G + +I G Q+ Sbjct: 806 STQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLT 865 Query: 592 TAEQQTTVKADDRLLISGKQKTKIDLDQEYEVVGSQKKTIGANQTLKVGGYQKNTLEGYK 651 T T+ D LI+G T+ T G N L G T + Sbjct: 866 TGYGSTSTAGYDSSLIAGYGSTQ---------------TAGYNSILTAGYGSTQTAQENS 910 Query: 652 KSKIGGNNTTTVGGHDKLTVGDTITITAGTSITLQCGASSIVMDEAGNIKITGVNITSAA 711 G +T+T G L G T TA TL G S + G TS A Sbjct: 911 DLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMA 970 Query: 712 STTHTIKA 719 ++ A Sbjct: 971 GYDSSLIA 978 Score = 32.8 bits (74), Expect = 0.007 Identities = 31/181 (17%), Positives = 66/181 (36%), Gaps = 9/181 (4%) Query: 532 NTTVLNDRSTTVSGNHTETVTKDQAVTVSGNQTMDITQDQTITVTGTQRIDVTQDRIIDV 591 +T + S +G + + ++ ++G + ++ + G +++ Sbjct: 902 STQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLT 961 Query: 592 TAEQQTTVKADDRLLISGKQKTKIDLDQEYEVVGSQKKTIGANQTLKVGGYQKNTLEGYK 651 T++ D LI+G T+ G Q + + + GY Sbjct: 962 AGYGSTSMAGYDSSLIAGYGSTQT--------AGYQSTLTAGYGSTQTAEHSSTLTAGYG 1013 Query: 652 KSKIGGNNTTTVGGH-DKLTVGDTITITAGTSITLQCGASSIVMDEAGNIKITGVNITSA 710 + G +++ + G+ LT G +TAG TL G S++ G+ I+G + Sbjct: 1014 STATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVLTAGYGSSLISGRRSSLT 1073 Query: 711 A 711 A Sbjct: 1074 A 1074 Score = 32.8 bits (74), Expect = 0.007 Identities = 23/104 (22%), Positives = 44/104 (42%), Gaps = 1/104 (0%) Query: 591 VTAEQQTTVKADDRLLISGKQKTKIDLDQEYEVVGSQKKTIGANQTLKVGGYQKNTLEGY 650 + + T + + +LI+GK ++ + + G+ + + + G G Sbjct: 1089 IAGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQMAGERGKLIAGADSTQTAGD 1148 Query: 651 KKSKIGGNNT-TTVGGHDKLTVGDTITITAGTSITLQCGASSIV 693 + + GNN+ T G KLT G+ + AG L G +SI+ Sbjct: 1149 RSKLLAGNNSYLTAGDRSKLTAGNDCILMAGDRSKLTAGINSIL 1192 Score = 30.5 bits (68), Expect = 0.033 Identities = 26/90 (28%), Positives = 38/90 (42%), Gaps = 1/90 (1%) Query: 606 LISGKQKTKIDLDQEYEVVGS-QKKTIGANQTLKVGGYQKNTLEGYKKSKIGGNNTTTVG 664 LI+G + T+I ++ + G +T G TL G K G ++T T G Sbjct: 1088 LIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQMAGERGKLIAGADSTQTAG 1147 Query: 665 GHDKLTVGDTITITAGTSITLQCGASSIVM 694 KL G+ +TAG L G I+M Sbjct: 1148 DRSKLLAGNNSYLTAGDRSKLTAGNDCILM 1177
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 31.0 bits (70), Expect = 0.041 Identities = 12/37 (32%), Positives = 20/37 (54%), Gaps = 3/37 (8%) Query: 218 KSLSLPTSVMLPIPMGRPVVVGGMPVLNLLALMMGLF 254 +S S+P SVML +P+G +VG + L ++ Sbjct: 892 ESWSIPVSVMLVVPLG---IVGVLLAATLFNQKNDVY 925
>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE) signature. Length = 372 Score = 28.9 bits (64), Expect = 0.046 Identities = 13/44 (29%), Positives = 25/44 (56%) Query: 221 NALDEAAFANEYFMPEYVESFYTLNDSAKQHMLAEQRMTSDGIT 264 A+ + F EY+ E + + ++ D A +H +AEQR T + ++ Sbjct: 329 KAIPSSLFYEEYWQEELLMALRSMTDIAYKHEMAEQRRTIEKLS 372
>PF04183#IucA / IucC family Length = 580 Score = 735 bits (1900), Expect = 0.0 Identities = 381/576 (66%), Positives = 447/576 (77%), Gaps = 1/576 (0%) Query: 5 DYANWQQVNRHMIAKILSELEYERTLHAELHGETG-RITLPGAVYTFNGKRGIWGWLHID 63 ++ +W VNR ++AK+LSELEYE+ HAE G+ I LPGA + F +RGIWGWL ID Sbjct: 2 NHKDWDLVNRRLVAKMLSELEYEQVFHAESQGDDRYCINLPGAQWRFIAERGIWGWLWID 61 Query: 64 PATLRCEGVPLAADHMLRQLALVLKMDDSQVAEHLEDLYATLRGDMQLLSARHGMSAEAL 123 TLRC P+ A +L QL VL M D+ VAEH++DLYATL GD+QLL AR G+SA L Sbjct: 62 AQTLRCADEPVLAQTLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQLLKARRGLSASDL 121 Query: 124 IALNDDALQCLLAGHPKFIFNKGRRGWGLTALQHYAPEYQGQFRLHWVAAKRGSFIWCVD 183 I LN D LQCLL+GHPKF+FNKGRRGWG AL+ YAPEY FRLHW+A KR IW D Sbjct: 122 INLNADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRCD 181 Query: 184 AEYPLDNLLNSAMDPAERQRFDRRWRECQLNDDWVPVPLHPWQWQQKIALHFLPQLAEGE 243 E + LL +AMDP E RF + W+E L+ +W+P+P+HPWQWQQKIA F+ AEG Sbjct: 182 NEMDIHQLLTAAMDPQEFARFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADFAEGR 241 Query: 244 LIELGEFGDHYLAQQSLRTLTNVSRRVPFDIKLPLTIYNTSCYRGIPGKYISAGPAASRW 303 ++ LGEFGD +LAQQSLRTLTN SRR DIKLPLTIYNTSCYRGIPG+YI+AGP ASRW Sbjct: 242 MVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRW 301 Query: 304 LQQVFAQDRTLHESGAEILGEPAAGYMLHQTYATLAKAPYRCQEMLGVIWRENPSCYLRE 363 LQQVFA D TL +SGA ILGEPAAGY+ H+ YA LA+APYR QEMLGVIWRENP +L+ Sbjct: 302 LQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLKP 361 Query: 364 GEHAILMATLMETNNQGHPLIAAYIARSGLSAEAWLEQMFRVVVVPMYHLMCCYGVALIA 423 E +LMATLME + PL AYI RSGL AE WL Q+FRVVVVP+YHL+C YGVALIA Sbjct: 362 DESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPLYHLLCRYGVALIA 421 Query: 424 HGQNITLVMKDHAPQRILLKDFQGDMRLVDKDFPQAASLPNVVKDVTVRLSADYLIHDLQ 483 HGQNITL MK+ PQR+LLKDFQGDMRLV ++FP+ SLP V+DVT RLSADYLIHDLQ Sbjct: 422 HGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMDSLPQEVRDVTSRLSADYLIHDLQ 481 Query: 484 TGHFVTVLRFISPLMQACNLSEYRFYQLLAQVLERYMAQHPDLADRFTLFNLFKPQIIRV 543 TGHFVTVLRFISPLM + E RFYQLLA VL YM +HP +++RF LF+LF+PQIIRV Sbjct: 482 TGHFVTVLRFISPLMVRLGVPERRFYQLLAAVLSDYMKKHPQMSERFALFSLFRPQIIRV 541 Query: 544 VLNPVKLTYSEQDGGSRMLPDYLQDLDNPLYLVTKE 579 VLNPVKLT+ + DGGSRMLP+YL+DL NPL+LVT+E Sbjct: 542 VLNPVKLTWPDLDGGSRMLPNYLEDLQNPLWLVTQE 577
>PF04183#IucA / IucC family Length = 580 Score = 320 bits (821), Expect = e-104 Identities = 101/457 (22%), Positives = 170/457 (37%), Gaps = 37/457 (8%) Query: 62 TQHHHYLFPAYLHQQGNDRQDDDTPVKLGIEQLVTLLLEKPTVKGELSDDVVARFRQRVL 121 + F A G D T L LL + +SD VA Q + Sbjct: 41 LPGAQWRFIAERGIWGWLWIDAQTLRCADEPVLAQTLLMQLKQVLSMSDATVAEHMQDLY 100 Query: 122 ESHDNTQQAINIRLDWPSLRDKPLNFAQAEQGLLAGHAFHPAPKSHQPFNEKQAQRYLPD 181 + Q + R + LN Q LL+GH K + + ++ +RY P+ Sbjct: 101 ATLLGDLQLLKARRGLSASDLINLNA-DRLQCLLSGHPKFVFNKGRRGWGKEALERYAPE 159 Query: 182 FASRFPLRWFAVDKRYLCGDSLKLTLQHRLQRFASESAPQLLAYFT--------DDVW-L 232 +A+ F L W AV + ++ H Q + PQ A F+ D W Sbjct: 160 YANTFRLHWLAVKREHMIWRCDNEMDIH--QLLTAAMDPQEFARFSQVWQENGLDHNWLP 217 Query: 233 LPMHPWQADHLLKQDWCQQLVQQNALHDLGEAGERWLPTSSSRSLYSPSNRD--MVKFSL 290 LP+HPWQ + D+ + + LGE G++WL S R+L + S R +K L Sbjct: 218 LPVHPWQWQQKIATDFIADFAEGR-MVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPL 276 Query: 291 SVRLTNSVRTLSVKEAKRGMRLARLAQTPRWQELQARY--------PTFRVMQEDGWAGL 342 ++ T+ R + + G +R Q + P + +G+A L Sbjct: 277 TIYNTSCYRGIPGRYIAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAAL 336 Query: 343 RSADFTLQEESLLVLRDNLLFSQPDSQTNVLVTLTQAAPDGGDSLLASAVRRLAARLNLP 402 A + QE ++ R+N ++ VL+ + L + + R Sbjct: 337 ARAPYRYQEMLGVIWRENPCRWLKPDESPVLMATLMECDENNQPLAGAYIDRSG------ 390 Query: 403 LQQAAFCWLDAYCQHVLLPLFSTEADYGLVLLAHQQNILVEMQQDLPVGMLYRDCQGSGF 462 A WL + V++PL+ YG+ L+AH QNI + M++ +P +L +D QG Sbjct: 391 --LDAETWLTQLFRVVVVPLYHLLCRYGVALIAHGQNITLAMKEGVPQRVLLKDFQGD-- 446 Query: 463 TQSALPWLAEIGEAEAENSFSEQQLLRYFPYYLLVNS 499 + E+ E + + L++ Sbjct: 447 MRLVKEEFPEMDSLPQE----VRDVTSRLSADYLIHD 479
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 42.5 bits (100), Expect = 2e-06 Identities = 42/180 (23%), Positives = 73/180 (40%), Gaps = 16/180 (8%) Query: 24 FCVGLLGIGQNGLLVVLPVLVSRTHLSLSVWAG---LLTLGSMLFLVGSAWWGRQSEIRG 80 V L +G ++ VLP L+ S V A LL L +++ + G S+ G Sbjct: 12 STVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFG 71 Query: 81 CKFVVIMALAGYLLSFVLLALAVWGLSAGWLSEMAGLGWLIVARIIYGLTVSGMVPASQT 140 + V++++LAG + + ++A A W+ L + RI+ G+T + A Sbjct: 72 RRPVLLVSLAGAAVDYAIMATA----PFLWV--------LYIGRIVAGITGATGAVAGAY 119 Query: 141 WALQRAGYEQRMAALATISSGLSCGRLLGPLCAALALSIHPIAPLWLMAITPLIALLVVY 200 A G ++R +S+ G + GP+ L P AP + A + L Sbjct: 120 IADITDG-DERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGC 178
>AUTOINDCRSYN#Autoinducer synthesis protein signature. Length = 216 Score = 320 bits (821), Expect = e-114 Identities = 114/216 (52%), Positives = 154/216 (71%) Query: 1 MLEIFDVRYDELTDIRSEDLYKLRKKTFKDRLNWEVNCSNGMEFDEYDNSDTRYLLGIYQ 60 MLEIFDV + L++ +S +L+ LRK+TFKDRLNW V C++GMEFD+YDN++T YL GI Sbjct: 1 MLEIFDVNHTLLSETKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYDNNNTTYLFGIKD 60 Query: 61 GQLICSVRFIELHLPNMITHTFNALFDDVALPKRGYIESSRFFVDKTRAKLLFGNHYPIS 120 +ICS+RFIE PNMIT TF F ++ +P+ Y+ESSRFFVDK+RAK + GN YPIS Sbjct: 61 NTVICSLRFIETKYPNMITGTFFPYFKEINIPEGNYLESSRFFVDKSRAKDILGNEYPIS 120 Query: 121 YLFFLSIINYSRHNGYTGIYTIVSRAMLTILKRSGWQVEVIKEAHITEKERIYLLHLPID 180 + FLS+INYS+ GY GIYTIVS MLTILKRSGW + V+++ ++ER+YL+ LP+D Sbjct: 121 SMLFLSMINYSKDKGYDGIYTIVSHPMLTILKRSGWGIRVVEQGLSEKEERVYLVFLPVD 180 Query: 181 RDNQARLLLQVNQRLQDPCSVLSTWPISLPVMPESA 216 +NQ L ++N+ + L WP+ +P A Sbjct: 181 DENQEALARRINRSGTFMSNELKQWPLRVPAAIAQA 216
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 84.2 bits (208), Expect = 1e-19 Identities = 41/146 (28%), Positives = 60/146 (41%), Gaps = 14/146 (9%) Query: 426 PPPPPPPAPPAPKTVRLDSLSLFDVGKFTLNAGSTKML---VTALIDIKAKPGWLIVVAG 482 P P P K L S LF+ K TL L + L ++ K G +VV G Sbjct: 201 APAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDG-SVVVLG 259 Query: 483 HTDITGDAQANHILSLKRAEALRDWMLSTSDVSPTCFAVQGYGATRPIADNDT------- 535 +TD G N LS +RA+++ D+ L + + + +G G + P+ N Sbjct: 260 YTDRIGSDAYNQGLSERRAQSVVDY-LISKGIPADKISARGMGESNPVTGNTCDNVKQRA 318 Query: 536 --PDGRALNRRVEISLVPQADACQVP 559 D A +RRVEI + D P Sbjct: 319 ALIDCLAPDRRVEIEVKGIKDVVTQP 344
>DPTHRIATOXIN#Diphtheria toxin signature. Length = 567 Score = 30.5 bits (68), Expect = 0.034 Identities = 19/54 (35%), Positives = 24/54 (44%), Gaps = 9/54 (16%) Query: 622 GIGKTETALALADSLFGGEKSLITINLSEYQEAHTVSQLKGSPPGYVGYGQGGV 675 GIG +A A AD + KS + N S Y G+ PGYV Q G+ Sbjct: 23 GIGAPPSAHAGADDVVDSSKSFVMENFSSYH---------GTKPGYVDSIQKGI 67
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 30.1 bits (68), Expect = 0.010 Identities = 15/65 (23%), Positives = 26/65 (40%), Gaps = 1/65 (1%) Query: 22 GQGKVADYIPALAEVPANKLGI-AVCTLDGQIFQAGDADERFSIQSISKVLSLTLALSRY 80 + + I + ++G+ + G+ A ADERF + S KV+ L+R Sbjct: 21 ASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARV 80 Query: 81 SEQDI 85 D Sbjct: 81 DAGDE 85
>CABNDNGRPT#NodO calcium binding signature. Length = 479 Score = 52.7 bits (126), Expect = 2e-08 Identities = 39/162 (24%), Positives = 61/162 (37%), Gaps = 24/162 (14%) Query: 2144 DVAALFDLGGGDDVAKGYHKKKNIFTIGSGFKQYQGGENADTFILTSAAASKSHIL--SG 2201 D+AA+ L G + + + + Y +++ I + A + SG Sbjct: 250 DIAAIQRLYGANMTTRTGDSVYGFNS-NTDRDFYTATDSSKALIFSVWDAGGTDTFDFSG 308 Query: 2202 GEGNDTVALGEVLGNEIDSIIDISKGYYSQVNGGVEKQVALLYDFENILGHENVNDTIIG 2261 N + L S + KG S + GV EN +G ND ++G Sbjct: 309 YSNNQRI----NLNEGSFSDVGGLKGNVS-IAHGVTI--------ENAIGGSG-NDILVG 354 Query: 2262 NDVDNYLNGMGGDDKIWGNGGNDLLALQSGLAQGGTGLDSYH 2303 N DN L G G+D ++G G D L GG G D++ Sbjct: 355 NSADNILQGGAGNDVLYGGAGADTLY-------GGAGRDTFV 389 Score = 45.0 bits (106), Expect = 5e-06 Identities = 31/137 (22%), Positives = 47/137 (34%), Gaps = 21/137 (15%) Query: 2637 SSGNDEVVITSATFLPGNYIDTGDGNDAIIYIRGHEGT-MLKGGGGDDTYYYSAGSGAIN 2695 SGND +V SA N + G GND + G G L GG G DT+ Y +G + Sbjct: 346 GSGNDILVGNSA----DNILQGGAGNDVLY---GGAGADTLYGGAGRDTFVYGSGQDSTV 398 Query: 2696 IADTSGLDHLY-----------LDKHILLHTLSAERRENNLVLNIADNTSGRIIFVDWYL 2744 A D + + + ++L S I + + Sbjct: 399 AAYDWIADFQKGIDKIDLSAFRNEGQLSFVQDQFTGKGQEVMLQWDAANS--ITNLWLHE 456 Query: 2745 ADENKVEFIWVEDSQIT 2761 A + V+F+ Q Sbjct: 457 AGHSSVDFLVRIVGQAA 473
>PF03895#Serum resistance protein DsrA. Length = 79 Score = 54.4 bits (131), Expect = 1e-11 Identities = 20/75 (26%), Positives = 34/75 (45%) Query: 642 LSAGIASAMSMASLTQPYTSGSSMTTIGAASYRGQSALSLGVSSISDSGRWGSKLQASSN 701 L G+A+ +++ L QP G + + YR ++AL++GV S A + Sbjct: 5 LQTGLANQSALSMLVQPNGVGKTSVSAAVGGYRDKTALAIGVGSRITDRFTAKAGVAFNT 64 Query: 702 TQGDFGIGVGVGYQW 716 G G VGY++ Sbjct: 65 YNGGMSYGASVGYEF 79
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 94.5 bits (235), Expect = 1e-24 Identities = 36/124 (29%), Positives = 59/124 (47%) Query: 2 KPLIWLVEDEPSIADTLIYTLESEGFTLRWFDRGEPALAALSSGSPALAIVDVGLPDING 61 I + +D+ +I L L G+ +R +++G L + DV +PD N Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 62 FDLCRRMLAAAPDLPVIFLTARSEELDRIVGLEIGADDYIAKPFSPREVSARVRTILRRL 121 FDL R+ A PDLPV+ ++A++ + I E GA DY+ KPF E+ + L Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122 Query: 122 QKSH 125 ++ Sbjct: 123 KRRP 126
>PF06580#Sensor histidine kinase Length = 349 Score = 34.8 bits (80), Expect = 7e-04 Identities = 22/74 (29%), Positives = 34/74 (45%), Gaps = 20/74 (27%) Query: 376 LIDNA----LDFTPAGGEINVSGERQDDTYLITVEDSGCGIPDYAQEKIFDRFYSLPRAN 431 L++N + P GG+I + G + + T + VE++G SL N Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG----------------SLALKN 306 Query: 432 SPKSTGLGLNFVRE 445 + +STG GL VRE Sbjct: 307 TKESTGTGLQNVRE 320
>PF06917#Periplasmic pectate lyase Length = 555 Score = 29.5 bits (66), Expect = 0.006 Identities = 20/66 (30%), Positives = 27/66 (40%), Gaps = 2/66 (3%) Query: 28 MDALRFIPAQRDLGLDQYQLALMQFDA-VLSWGRFPYRDY-DPRNLCALLLVWMIENAPD 85 L F+ A DL Y+ A DA +WG+ YR Y RN L V+ + Sbjct: 213 TKGLTFVNAGTDLIYAAYKYAEYTGDAAAAAWGKHLYRQYVLARNPETGLPVYQFSSPQQ 272 Query: 86 HGPVPE 91 P+P Sbjct: 273 RQPIPA 278
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 29.9 bits (67), Expect = 0.032 Identities = 32/109 (29%), Positives = 45/109 (41%), Gaps = 25/109 (22%) Query: 20 AQLPPLLRRLYASRGVK---------DAQELERGVKGLLAWQKLDGIDAGVTLLQQALAD 70 A LPP +AS G + DA L RG G L L G D + + Q Sbjct: 99 ANLPP-----FASPGSRVDVTVSSLGDATSL-RG--GNLIMTSLSGADGQIYAVAQG--- 147 Query: 71 RRRIVIVGDFDA--DGATSTALAVLALRSMGGSNLDYLVPNRFEDGYGL 117 +IV F A D AT T + R G+ ++ +P++F+D L Sbjct: 148 ---ALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSVNL 193
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 23.2 bits (50), Expect = 0.048 Identities = 11/34 (32%), Positives = 20/34 (58%), Gaps = 3/34 (8%) Query: 11 SHEQVVARMLKKPAV---RAEYERLERQDFAIID 41 SH +++R L+ PAV + E+++ D I+D Sbjct: 189 SHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVD 222
>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature. Length = 1291 Score = 33.1 bits (75), Expect = 0.005 Identities = 40/172 (23%), Positives = 63/172 (36%), Gaps = 26/172 (15%) Query: 437 LYLRNQSAATPWNFWAQTLYAHSRQSSGTYTPGYQTNGYGINVGVDRRFND--ESLFG-- 492 LY P N WA + S S G + YG + GVD N E++ G Sbjct: 1012 LYQFAPKYEKPTNVWANAIGGTSLNSGG------NASLYGTSAGVDAYLNGEVEAIVGGF 1065 Query: 493 VSLGYQNANIN---IHSYGNEKDVDSYELMAYTGWFDDRYFFNGNVNMGYNSNSSTRNIG 549 S GY + + ++S N + Y + F +++ F+ S+ S+ N Sbjct: 1066 GSYGYSSFSNQANSLNSGANNTNFGVYSRI-----FANQHEFDFEAQGALGSDQSSLNFK 1120 Query: 550 ENTGYQGNTKATADYNSLQMGYQVKAGMTFDL----DVVKLQPSVAYNYQWL 597 N YN L +A +D + + L+PSV +Y L Sbjct: 1121 SALLRDLNQS----YNYLAYSAATRASYGYDFAFFRNALVLKPSVGVSYNHL 1168
>BCTERIALGSPC#Bacterial general secretion pathway protein C signature. Length = 272 Score = 44.6 bits (105), Expect = 4e-08 Identities = 19/62 (30%), Positives = 31/62 (50%) Query: 105 IKLVGVIEHSAPSESIAILEVKGKQTTHLTRENINYEDIVIVKIFTDRVIIKRNGKYYSL 164 + L GV+ S SIAI+ +Q + E + + IV I DRV+++ G+Y L Sbjct: 95 LSLTGVMAGDDDSRSIAIISKDNEQFSRGVNEEVPGYNAKIVSIRPDRVVLQYQGRYEVL 154 Query: 165 II 166 + Sbjct: 155 GL 156
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 543 bits (1400), Expect = 0.0 Identities = 309/610 (50%), Positives = 431/610 (70%), Gaps = 15/610 (2%) Query: 3 ISGKGIKSIHGMIFLFTLIMPLDIISANFSVSFKDVDIKEFINSVSKNINKTIIIDPTVQ 62 I I+S + +F ++ + FS SFK DI+EFIN+VSKN+NKT+IIDP+V+ Sbjct: 2 IIANVIRSFSLTLLIFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVR 61 Query: 63 GLISIRSYETLDKDTYYQLFLNVLDVYGYAAIEMPHNVLKVISSKRAKGVVAPLPKEGVT 122 G I++RSY+ L+++ YYQ FL+VLDVYG+A I M + VLKV+ SK AK P+ + Sbjct: 62 GTITVRSYDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAP 121 Query: 123 FDGDELINRVIPLRYISAKKITPLLRQLNDNTESGSIINYDPSNILLITGRAAVVNRLHS 182 GDE++ RV+PL ++A+ + PLLRQLNDN GS+++Y+PSN+LL+TGRAAV+ RL + Sbjct: 122 GIGDEVVTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLT 181 Query: 183 IVTDLDQAGDNEIELYKLNYAIAADVVKIVNEAINPINNLKQEVSIVGKVIADERTNSIL 242 IV +D AGD + L++A AADVVK+V E + S+V V+ADERTN++L Sbjct: 182 IVERVDNAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVL 241 Query: 243 ISGDTYIRKKSILMIKKLDKRQSSDGNTKVVYMKYAQASKLLDVLNGISEGFHNEKKTKQ 302 +SG+ R++ I MIK+LD++Q++ GNTKV+Y+KYA+AS L++VL GIS +EK+ + Sbjct: 242 VSGEPNSRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAK 301 Query: 303 SNQWNQRPVAIKAYDQTNALVITADPDMMLALGEVIEKLDIRRAQVLVEAIIVETQNGEG 362 + + IKA+ QTNAL++TA PD+M L VI +LDIRR QVLVEAII E Q+ +G Sbjct: 302 PVAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADG 361 Query: 363 INLGVKWENKRSDDINF----IKNSDGLLNNNGWGIATTIT-----------GLTAGFYK 407 +NLG++W NK + F + S + N + T++ G+ AGFY+ Sbjct: 362 LNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQ 421 Query: 408 GNWDVLLSALSTNKNNNILATPSIVTLDNMEAEFNVGQEVPVLISTQTTTTDKVYNSISR 467 GNW +LL+ALS++ N+ILATPSIVTLDNMEA FNVGQEVPVL +QTT+ D ++N++ R Sbjct: 422 GNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVER 481 Query: 468 QSIGVMLKVKPQINKGDSVLLEIRQEVSSIADSSTVNTHNLGSVFNKRVVNNAVLVKSGE 527 +++G+ LKVKPQIN+GDSVLLEI QEVSS+AD+++ + +LG+ FN R VNNAVLV SGE Sbjct: 482 KTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGE 541 Query: 528 TVVVGGLLDKKSSTIVNKVPFLGDLPLIGWLFRQTKEKVEKSNLILFIKPTILRESDDYS 587 TVVVGGLLDK S +KVP LGD+P+IG LFR T +KV K NL+LFI+PT++R+ D+Y Sbjct: 542 TVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYR 601 Query: 588 VVTSKEYNKY 597 +S +Y + Sbjct: 602 QASSGQYTAF 611
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 356 bits (915), Expect = e-123 Identities = 171/406 (42%), Positives = 263/406 (64%), Gaps = 7/406 (1%) Query: 1 MAVFKYVAISRSGTKITGDIDAENIRIARYLLYKKNMHVLSI-------KKRILLFNKYV 53 MA + Y A+ G K G +A++ R AR LL ++ + LS+ +K Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60 Query: 54 VKKNSNKTDLVLITRQIATLVNASMPLDEVLDIVGKQNSKSKMIEIIQRIRVNIQEGHSF 113 K + +DL L+TRQ+ATLV ASMPL+E LD V KQ+ K + +++ +R + EGHS Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120 Query: 114 ADALSPFSAVFSPLYKTMVTAGEVSGHLGLVLVRLADHIEQTQKIQRKIIQALIYPCVLV 173 ADA+ F F LY MV AGE SGHL VL RLAD+ EQ Q+++ +I QA+IYPCVL Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180 Query: 174 LISLSVIIILLTAVVPNIVEQFSFSETALPLSTKVLMILSYSIKENVIFIMAIGVSAVIF 233 +++++V+ ILL+ VVP +VEQF + ALPLST+VLM +S +++ +++ ++ + Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240 Query: 234 LNRLLKINKINVFFHRHYLSLPMLGNMFVRINTSRYLRTLTTLHSNGVTIVQAMSISNAV 293 +L+ K V FHR L LP++G + +NT+RY RTL+ L+++ V ++QAM IS V Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300 Query: 294 LTNVYIKNKLNISVKLVSEGCSLSSSLVDSGVFPPIILHMIISGERSGELDHMLETVAGV 353 ++N Y +++L+++ V EG SL +L + +FPP++ HMI SGERSGELD MLE A Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360 Query: 354 QEEELMNQISIVMSLLEPTIIIVMAAFISFVILSILQPILEINSLV 399 Q+ E +Q+++ + L EP +++ MAA + F++L+ILQPIL++N+L+ Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 207 bits (529), Expect = 2e-72 Identities = 87/136 (63%), Positives = 103/136 (75%) Query: 2 ANKKTKGFTLLEIMVVIVILGLLASLTIPSLMSNKNRADQQKAVSDISALENALDMYRLD 61 A K +GFTLLEIMVVIVI+G+LASL +P+LM NK +AD+QKAVSDI ALENALDMY+LD Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62 Query: 62 NGDYPTEQQGIAALVTKPNVPPLPQRYPSDGYIRRLPTDPWGNSYQMNNPGKHGQIDIFS 121 N YPT QG+ +LV P +PPL Y +GYI+RLP DPWGN Y + NPG+HG D+ S Sbjct: 63 NHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLLS 122 Query: 122 IGPDRLPETEDDIGNW 137 GPD TEDDI NW Sbjct: 123 AGPDGEMGTEDDITNW 138
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 55.7 bits (134), Expect = 2e-12 Identities = 35/157 (22%), Positives = 57/157 (36%), Gaps = 10/157 (6%) Query: 4 SQRAFTLLELLLAMIIISGLYYSVLITLPKGSGVVKSE-AENLVQGLRYINQKIRHEGGV 62 QR FTLLE++L ++++ VL+ P ++ LR++ Q+ G Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQF 61 Query: 63 FGLQLSETHWRFYKFCCDDCHGIKDNFKINTKINCIWQDAGNDKI-LSREYPDKLTSKLN 121 FG+ + W+F D G + W ++ S KLN Sbjct: 62 FGVSVHPDRWQFLVLEARD--GADPAPADDGWSGYRWLPLRAGRVATSGSIAG---GKLN 116 Query: 122 VYGEDSIIDNVIGDNIKPQLVFSPEEEYSDFSLVLRN 158 + GDN P ++ P E + F L L Sbjct: 117 LAFAQGEAWTP-GDN--PDVLIFPGGEMTPFRLTLGE 150
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 30.2 bits (68), Expect = 0.003 Identities = 13/44 (29%), Positives = 24/44 (54%), Gaps = 9/44 (20%) Query: 4 RPDCGFTLLEMLLAVVIFSMISFIIYSSLRITIKSNNVMGNKAQ 47 GFTLLE+++ +VI +++ ++ N+MGNK + Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVP---------NLMGNKEK 39
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 232 bits (594), Expect = 5e-78 Identities = 115/275 (41%), Positives = 151/275 (54%), Gaps = 4/275 (1%) Query: 6 VFFVSYLIFGAMVGSFLNVLIYRFPIMLANLSSR-SESHGEEIKMRSHLRNINLFQPGSF 64 ++F +F M+GSFLNV+I+R PIML S+ NL P S Sbjct: 14 LYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEPPYNLMVPRSC 73 Query: 65 CHHCNESIPIKYNIPILGWIFLRGASRCCNKKISTRYLFIEVLAVIQTLLVLMIFKEDLL 124 C HCN I NIP+L W++LRG R C IS RY +E+L + ++ V M Sbjct: 74 CPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVAVAMTLAPGWG 133 Query: 125 ICTSLVLIWSLTALAFIDFDTYLLPDCMTIPLLWLGLLINIDTVFAPLTSAVLGAVSGYL 184 +L+L W L AL FID D LLPD +T+PLLW GLL N+ F L AV+GA++GYL Sbjct: 134 TLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGYL 193 Query: 185 FLWLSYWLFKIVRGVDGMGYGDFKLMAALGAWFGVSAVPFLILFSSFFGLVAYAIFYFFD 244 LW YW FK++ G +GMGYGDFKL+AALGAW G A+P ++L SS G Sbjct: 194 VLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILLR 253 Query: 245 KKDNGKEINYIAFGPYISLAGVLYLFLGSHVTNLF 279 K I FGPY+++AG + L G +T + Sbjct: 254 NHHQSKP---IPFGPYLAIAGWIALLWGDSITRWY 285
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 29.1 bits (65), Expect = 0.016 Identities = 23/104 (22%), Positives = 46/104 (44%), Gaps = 13/104 (12%) Query: 186 GVAVSGNIHLWVADTQTPESRENWLT----TLEKIKALKPAIVVPGHFLDNAPQTLESVI 241 GVA + N LWV++ P+S + LE + +KP+ +V +P+ L + Sbjct: 58 GVADTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIA 117 Query: 242 FTQNYLTTLNAEIPKAKDSAELIAVMKKHYPELKDESSLELSAK 285 + + D + +A+ +K E+ D +L+ +A+ Sbjct: 118 PGRGF---------NFSDGKQPLAMARKSLTEMADLLNLQSAAE 152
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 58.4 bits (141), Expect = 2e-11 Identities = 33/149 (22%), Positives = 67/149 (44%), Gaps = 3/149 (2%) Query: 26 LPQVAGDLHISIPTAGWLISGYALGVAIGAPIMAVLTAKLPRKKTLLLLMVIFIIGNLMC 85 LP +A D + + W+ + + L +IG + L+ +L K+ LL ++I G+++ Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 86 ALAYSYDF-LMFARVITALCHGAFFGIGAVVAANLVAPNRRASAVALMFTGLTLANVLGV 144 + +S+ L+ AR I AF + VV A + R A L+ + + + +G Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 145 PLGTALGQAFGWRSTFW--VVSVIGLFSL 171 +G + W ++++I + L Sbjct: 157 AIGGMIAHYIHWSYLLLIPMITIITVPFL 185
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 977 bits (2528), Expect = 0.0 Identities = 328/570 (57%), Positives = 417/570 (73%), Gaps = 5/570 (0%) Query: 3 QISRQEYAGLFGPTTGDKIRLGDTNLFIEIEKDLRGYGEESVYGGGKSLRDGMGANNNLT 62 ++SR YA +FGPT GDK+RL DT LFIE+EKD +GEE +GGGK +RDGMG + +T Sbjct: 4 RMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMG-QSQVT 62 Query: 63 RDNGVLDLVITNVTIVDARLGVIKADVGIRDGKIAGIGKSGNPGVMDGVTQGMVVGVSTD 122 R+ G +D VITN I+D G++KAD+G++DG+IA IGK+GNP + GVT ++VG T+ Sbjct: 63 REGGAVDTVITNALILDH-WGIVKADIGLKDGRIAAIGKAGNPDMQPGVT--IIVGPGTE 119 Query: 123 AISGEHLILTAAGIDSHIHLISPQQAYHALSNGVATFFGGGIGPTDGTNGTTVTPGPWNI 182 I+GE I+TA G+DSHIH I PQQ AL +G+ GGG GP GT TT TPGPW+I Sbjct: 120 VIAGEGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHI 179 Query: 183 RQMLRSIEGLPVNVGILGKGNSYGRGPLLEQAIAGVVGYKVHEDWGATANALRHALRMAD 242 +M+ + + P+N+ GKGN+ G L+E + G K+HEDWG T A+ L +AD Sbjct: 180 ARMIEAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVAD 239 Query: 243 EVDIQVSVHTDSLNECGYVEDTIDAFEGRTIHTFHTEGAGGGHAPDIIRVASQTNVLPSS 302 E D+QV +HTD+LNE G+VEDTI A +GRTIH +HTEGAGGGHAPDIIR+ Q NV+PSS Sbjct: 240 EYDVQVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSS 299 Query: 303 TNPTLPYGVNSQAELFDMIMVCHNLNPNVPADVSFAESRVRPETIAAENVLHDMGVISMF 362 TNPT PY VN+ AE DM+MVCH+L+P +P D++FAESR+R ETIAAE++LHD+G S+ Sbjct: 300 TNPTRPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSII 359 Query: 363 SSDSQAMGRVGENWLRILQTADAMKAARGKLPEDAAGNDNFRVLRYVAKITINPAITQGV 422 SSDSQAMGRVGE +R QTAD MK RG+L E+ NDNFRV RY+AK TINPAI G+ Sbjct: 360 SSDSQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGL 419 Query: 423 SHVIGSVEVGKMADLVLWDPRFFGAKPKMVIKGGMINWAAMGDPNASLPTPQPVFYRPMF 482 SH IGS+EVGK ADLVLW+P FFG KP MV+ GG I A MGDPNAS+PTPQPV YRPMF Sbjct: 420 SHEIGSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMF 479 Query: 483 GAMGKTLQDTCVTFVSQAALDDGVKEKAGLDRQVIAVKNCR-TISKRDLVRNDQTPNIEV 541 GA G++ ++ VTFVSQA+LD G+ + G+ ++++AV+N R I K ++ N TP+IEV Sbjct: 480 GAYGRSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEV 539 Query: 542 DPETFAVKVDGVHATCEPIATASMNQRYFF 571 DPET+ V+ DG TCEP M QRYF Sbjct: 540 DPETYEVRADGELLTCEPATVLPMAQRYFL 569
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 74.1 bits (182), Expect = 9e-18 Identities = 27/112 (24%), Positives = 52/112 (46%), Gaps = 1/112 (0%) Query: 1 MRIALESEGWRVFESETLQRGLIEAGTRKPDLIILDLGLPDGDGLNYIQDLRQWSA-IPI 59 + AL G+ V + DL++ D+ +PD + + + +++ +P+ Sbjct: 19 LNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPDLPV 78 Query: 60 IVLSARNNEEDKVAALDAGADDYLSKPFGISELLARVRVALRRHSGASQESP 111 +V+SA+N + A + GA DYL KPF ++EL+ + AL + Sbjct: 79 LVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLE 130
>PF05272#Virulence-associated E family protein Length = 892 Score = 203 bits (518), Expect = 2e-57 Identities = 97/427 (22%), Positives = 168/427 (39%), Gaps = 40/427 (9%) Query: 309 EAEGASGEYL--PWPKFKRDKFDQIEGTITNVLMALRR-PDLCGVQIRLDEFRNDIILTT 365 + E GE+L + + ++ ++ ALR P L G + DE R + Sbjct: 424 DGEDPFGEWLDDEVARLRLRGRWLLKPRRAALIEALRSAPALAGC-VAFDELREQPVAVR 482 Query: 366 P---KGSHVQLRDEYYTKIHTTIEQKLGFKKFEEAAIKRAVRLIAFENRYDSLKDWISKL 422 + + L D ++ +E G + ++A+ + A NR +DW+ Sbjct: 483 AFPWRKAPGPLEDADVLRLADYVETTYGTGEASAQTTEQAINVAADMNRVHPFRDWVKAQ 542 Query: 423 PEWDGTPRIDTFFCRH-------WRIDQSAYTKAVGRYWWTLLAGRALEPGIKGDMAVVL 475 WD PR++ + ++ + Y + VG+Y R +EPG K D +VVL Sbjct: 543 Q-WDEVPRLEKWLVHVLGKTPDDYKPRRLRYLQLVGKYILMGHVARVMEPGCKFDYSVVL 601 Query: 476 VSQQGKNKSEGIRSMAPTP---EHYMELDFEKPAAERIREMRGHNVIELGEMRGMNKAGI 532 G KS I ++ + + ++ K + E+I G EL EM +A Sbjct: 602 EGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIA---GIVAYELSEMTAFRRADA 658 Query: 533 GAVRVTISTRADRNRGLYREHYDILLRRCGFIATVNTDTPLTDSEGNRRWLPMTIPDDTD 592 AV+ S+R DR RG Y + R+ T N L D GNRR+ P+ +P + Sbjct: 659 EAVKAFFSSRKDRYRGAYGRYVQDHPRQVVIWCTTNKRQYLFDITGNRRFWPVLVPGRAN 718 Query: 593 GKHIAKQIEAERDQLWAEAVRVFKQ---------DGIAWERAETLAKTILSDYEVKDDVW 643 ++ R QL+AEA+ ++ D + R E + + + + +W Sbjct: 719 LVW----LQKFRGQLFAEALHLYLAGERYFPSPEDEEIYFRPEQELRLVETGVQ--GRLW 772 Query: 644 VSCISEWLETEAIELGDTSGITNGQRVPLTSKDLLVEAIGFKAPQVKRGDEMRVAKIMKD 703 E A E G + +T D LV+A+G + E +V + + Sbjct: 773 ALLTREGA--PAAEGAAQKGYS-VNTTFVTIAD-LVQALGADPGKSSPMLEGQVRDWLNE 828 Query: 704 LGYKKKR 710 G++ R Sbjct: 829 NGWEYLR 835
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 102 bits (256), Expect = 7e-26 Identities = 58/267 (21%), Positives = 108/267 (40%), Gaps = 30/267 (11%) Query: 144 GLVNPVQVSAEILKTLAQRAQ-AALAGELDGVVITVPAYFDDAQRQGTKDAARLAGLHVL 202 G++ V+ ++L+ ++ + V++ VP +R+ +++A+ AG + Sbjct: 79 GVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREV 138 Query: 203 RLLNEPTAAAIAYGLDSGQEGVIAVYDLGGGTFDISILRLSRGVFEVLATGGDSALGGDD 262 L+ EP AAAI GL + V D+GGGT +++++ L+ V +GGD Sbjct: 139 FLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVV-----YSSSVRIGGDR 193 Query: 263 FDHLLADWLREQAGVATRDDHGIQRQLLDTAIAAKI----ALSEAETAVVSVAG---WQG 315 FD + +++R G G TA K A E + V G +G Sbjct: 194 FDEAIINYVRRNYGSLI----GEA-----TAERIKHEIGSAYPGDEVREIEVRGRNLAEG 244 Query: 316 -----EVTREQLESLIAPLVKRTLMACRRALKD-AGVTADEILE--VVMVGGSTRVPLVR 367 + ++ + + + A AL+ A +I E +V+ GG + + Sbjct: 245 VPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLD 304 Query: 368 EQVGQFFGRTPLTSIDPDKVVAIGAAI 394 + + G + + DP VA G Sbjct: 305 RLLMEETGIPVVVAEDPLTCVARGGGK 331
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 148 bits (375), Expect = 6e-40 Identities = 111/438 (25%), Positives = 186/438 (42%), Gaps = 46/438 (10%) Query: 169 LVMDSLAGNGTFKLGSMLQQDASAPLNVTGNADGDFILQIDGSGIDPTNLN----VVSTG 224 L +++LAG+G F++ S L V +A G L + SG +P + N V + Sbjct: 473 LTVNTLAGSGLFRMNVFADLGLSDKLVVMQDASGQHRLWVRNSGSEPASANTLLLVQTPL 532 Query: 225 GGDARFTLT--DGPIGLGNRVYNLVKDASGKITLVANESTVTPG---------------- 266 G A FTL DG + +G Y L + +G+ +LV ++ P Sbjct: 533 GSAATFTLANKDGKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQ 592 Query: 267 ----------TASILAVANT---------TPVIFNAELSSVQQRLDKQSTEANESGIWGT 307 + A AN ++ AE +++ +RL + + G WG Sbjct: 593 PEAPAPQPPAGRELSAAANAAVNTGGVGLASTLWYAESNALSKRLGELRLNPDAGGAWGR 652 Query: 308 YLHNNFAVKGRAAN-FDQTLNGMTLGGDKATALTDGVLSVGGFASASTSSIKTDYQSKGN 366 + RA FDQ + G LG D A A+ G +GG A + G+ Sbjct: 653 GFAQRQQLDNRAGRRFDQKVAGFELGADHAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGH 712 Query: 367 VDSHSFGAYAQYLANNGYYVNGVVKANKFNQDIHVTSADNSA-SGNTNFSGMGVAVKAGK 425 DS G YA Y+A++G+Y++ ++A++ D V +D A G G+G +++AG+ Sbjct: 713 TDSVHVGGYATYIADSGFYLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGR 772 Query: 426 HINH-NHLYVSPYVAMSAFSSGKSVVKLSNGMAAQSSSTRSMIGTLGVNAGYPFVLKNGV 484 H + ++ P ++ F +G + +NG+ + S++G LG+ G L G Sbjct: 773 RFTHADGWFLEPQAELAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELAGGR 832 Query: 485 EMKPYVSASVDHEFAANNKFRVNQEMFDNNLNGTRVNTGAGLNVNITPNLSVGSEVKVSS 544 +++PY+ ASV EF N L GTR G G+ + S+ + + S Sbjct: 833 QVQPYIKASVLQEFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLYASYEYSK 892 Query: 545 GKNIKTPVTVNLNVGYRF 562 G + P T + GYR+ Sbjct: 893 GPKLAMPWT--FHAGYRY 908
>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature. Length = 1291 Score = 34.2 bits (78), Expect = 0.001 Identities = 24/133 (18%), Positives = 42/133 (31%), Gaps = 9/133 (6%) Query: 99 TFTAAGDAAVTVLNASDFSLADKAT---ANNTTLTDGTFTVAGDAAVTATNMSGGKFAVK 155 T +T ++ SL D AT A+N+ G + V A ++ Sbjct: 219 VLTLQASEGITSRENAEISLYDGATLNLASNSVKLMGNVWMGRLQYVGAY--LAPSYSTI 276 Query: 156 GKAKIKDT----QLSAGNFTLAENATANDTTLNGGKFDVSNEATATNTTINNGLFTLKDG 211 +K+ L+ G+ A+ + G D+ A G + K Sbjct: 277 NTSKVTGEVNFNHLTVGDHNAAQAGIIASNKTHIGTLDLWQSAGLNIIAPPEGGYKDKPN 336 Query: 212 AHADSTTVNSGTF 224 +TT N+ Sbjct: 337 DKPSNTTQNNAKN 349 Score = 30.0 bits (67), Expect = 0.022 Identities = 37/199 (18%), Positives = 66/199 (33%), Gaps = 11/199 (5%) Query: 96 TGGTFTAAGDAAVTVLNASDFSLADKATANNTTLTDGTFTVAGDAAVTATNMSGGKFAVK 155 T T G+ L D + A + GT + A + G + K Sbjct: 275 TINTSKVTGEVNFNHLTVGDHNAAQAGIIASNKTHIGTLDLWQSAGLNIIAPPEGGYKDK 334 Query: 156 GKAKIKDTQLSAGNFTLAENATANDTTLNGGKFDVSNEATATNTTINNGLF-----TLKD 210 K +T + E++ N T + + + T + +G F T+ + Sbjct: 335 PNDKPSNTTQNNAKNDKQESSQNNSNTQVINPPNSAQKTEIQPTQVIDGPFAGGKNTVVN 394 Query: 211 GAHADST---TVNSGTFVMADQSTANGIQLVDSAFTLASGAKASGI--TKLTGGQAQVAG 265 ++ T+ G F + + A + + L++ A + LTG V G Sbjct: 395 INRINTNADGTIRVGGFKASLTTNAAHLHIGKGGINLSNQASGRSLLVENLTGN-ITVDG 453 Query: 266 SLESLSLTGGRADFANSAK 284 L + GG A +SA Sbjct: 454 PLRVNNQVGGYALAGSSAN 472
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 29.5 bits (66), Expect = 0.008 Identities = 17/89 (19%), Positives = 25/89 (28%) Query: 39 LGLAYLAQGDLTAARKNLEKAVEADPQDYRTQLGMAFYAQRIGENSAAEQRYQQAMKLAP 98 L G A K + D D R LG+ Q +G+ A Y + Sbjct: 42 LAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDI 101 Query: 99 GNGTVLNNYGAFLCSLGQYVSAQQQFSAA 127 + L G+ A+ A Sbjct: 102 KEPRFPFHAAECLLQKGELAEAESGLFLA 130
>INTIMIN#Intimin signature. Length = 939 Score = 468 bits (1204), Expect = e-144 Identities = 269/857 (31%), Positives = 402/857 (46%), Gaps = 89/857 (10%) Query: 61 SKADTMVSYSSTEPYVLGSGETVAMVAKKYGITVDELKKIN--IYRTFSRPFTALTTGDE 118 SK T SY + Y L +GETVA ++K I + + +N +Y + S A G + Sbjct: 51 SKLLTHNSYQNRLFYTLKTGETVADLSKSQDINLSTIWSLNKHLYSSESEMMKA-EPGQQ 109 Query: 119 IDIPRKASPF-----------------------------SVDNNKDNRLSVENTLAGHAV 149 I +P K PF S D K N ++ +A Sbjct: 110 IILPLKKLPFEYSALPLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSN--MTDDKALNYAA 167 Query: 150 AGATALS--------NGDVAKSGERMVRSAASNEFNNSAQQWLSQFGTARVQLNINDDFH 201 A +L NGD AK A N+ ++ Q WL +GTA V L ++F Sbjct: 168 QQAASLGSQLQSRSLNGDYAKD---TALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNF- 223 Query: 202 LDGSAADVLIPLYDNEKSILFTQLGARNKDSRNTVNMGAGVRTFQGNWMYGANTFFDNDL 261 DGS+ D L+P YD+EK + F Q+GAR DSR T N+GAG R F M G N F D D Sbjct: 224 -DGSSLDFLLPFYDSEKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDF 282 Query: 262 TGKNRRIGVGAEAWTDYLKLSANNYFGITDWHQSRDFIDYNERPANGYDLRAEAYLPSYP 321 +G N R+G+G E W DY K S N YF ++ WH+S + DY+ERPANG+D+R YLPSYP Sbjct: 283 SGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYP 342 Query: 322 QLGGKAMYEKYRGDDVALFGKDNRQKNPHAITAGVNYTPIPLVTIGAEHRAGKGGQNDSN 381 LG K MYE+Y GD+VALF D Q NP A T GVNYTPIPLVT+G ++R G G +ND Sbjct: 343 ALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLL 402 Query: 382 INFQLNYRLGETWQSHIDPSAVAASRTLAGSRYDLVERNNHIVLDYQKQNLVRLSLPDSL 441 + Q Y+ + W I+P V RTL+GSRYDLV+RNN+I+L+Y+KQ+++ L++P + Sbjct: 403 YSMQFRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDI 462 Query: 442 AGDPFSQLSVTAQVTATHGLERIDWQSAELMAAGGVLKQT---SKNGLEITLPEYQMNRT 498 G S + V + +GL+RI W + L + GG ++ + S + LP Y + Sbjct: 463 NGTERSTQKIQLIVKSKYGLDRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAYV--QG 520 Query: 499 GGNSYILNAIAYDTQGNASSQASMLITV--NAQKINIANST-LVAVPINIEANNSDTSVV 555 G N Y + A AYD GN+S+ + ITV N Q ++ T A + +A+ ++ Sbjct: 521 GSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITY 580 Query: 556 TLTLKDDN----NIPVTGQDVTFLSPLGTLSAMTDSGNGVYTATLTAGTVSGTTAVSSNI 611 T T+K + N+PV+ V+ + L SA T+ G+G T TL + + Sbjct: 581 TATVKKNGVAQANVPVSFNIVSGTAVLSANSANTN-GSGKATVTLKSDKPGQVVVSAKTA 639 Query: 612 NGSALDMTPATVTLNGNSGELSITHSMLVAAPVNIEANGSDTSVVTLTLRDSNN-NPVTG 670 ++ A + ++ + + + A ANG D +T T++ PV+ Sbjct: 640 EMTSALNANAVIFVDQTKA----SITEIKADKTTAVANGQDA--ITYTVKVMKGDKPVSN 693 Query: 671 QTVTFAGTLGTLG--AVTEGSSGVYTATLTAGIMVGTSSITASVNSTALGVTPATVTLNG 728 Q VTF TLG L ++G TLT+ G S ++A V+ A+ V V Sbjct: 694 QEVTFTTTLGKLSNSTEKTDTNGYAKVTLTST-TPGKSLVSARVSDVAVDVKAPEVEF-- 750 Query: 729 DSGNLSTTNSTLVAAPVNIEANSSDTSVVTLTLR-DNNNNPVTGQTVVFTSTL------- 780 T T+ + I + T+ L+ N +G +T Sbjct: 751 ------FTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIAS 804 Query: 781 --GTLGNVTEQASGVYTATLTAGTVSGVASLSVSVGGNALGVTPATVTLNGDSGNLSTTN 838 + G VT + G T ++ + + A+ +++ + + + D+ N Sbjct: 805 VDASSGQVTLKEKGTTTISVISSD-NQTATYTIATPNSLIVPNMSKRVTYNDAVNTCKNF 863 Query: 839 STLVAAPVNIEANSSDT 855 + + N N Sbjct: 864 GGKLPSSQNELENVFKA 880 Score = 86.7 bits (214), Expect = 7e-19 Identities = 78/412 (18%), Positives = 142/412 (34%), Gaps = 43/412 (10%) Query: 1164 TLRDNNNNPVTGQTVAFTSTLGTLGNVTEQASGLYTATLTAGTVSGVASLSVNVGGTALG 1223 LR + + L + S +Y T A +G + S NV T Sbjct: 491 ALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNVYKVTARAYDRNG--NSSNNVLLTITV 548 Query: 1224 VTPATVTLNGDSGNLSTTNSTLVAAPVNIEANSSDTSVVTLTLRDNN----NNPVTGQTV 1279 ++ V + + + + +A+ ++ T T++ N N PV+ V Sbjct: 549 LSNGQVVDQVGVTDFTADKT-------SAKADGTEAITYTATVKKNGVAQANVPVSFNIV 601 Query: 1280 AFTSTLGTLGNVTEQASGLYTATLTAGTVSGVASLSVSVNSTALGVTPATVTLNGDSGNL 1339 + T+ L T SG T TL + V + + T+ A + ++ Sbjct: 602 SGTAVLSANSANTN-GSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVD------ 654 Query: 1340 STTNSTLVAAPVNIEANSSDTSVVTLTLR-DNNNNPVTGQTVAFTSTLGTLGN--VTEQA 1396 T S A ++ +T T++ + PV+ Q V FT+TLG L N Sbjct: 655 QTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDT 714 Query: 1397 SGVYTATLTAGTVAGVASLSVNVGGNALGVTPATVTLNGDSGNLSTTNSTLVAAPVNIEA 1456 +G TLT+ T G + +S V A+ V V T T+ + I Sbjct: 715 NGYAKVTLTSTTP-GKSLVSARVSDVAVDVKAPEVEF--------FTTLTIDDGNIEIVG 765 Query: 1457 NSSDTSVVTLTLR-DNNNNPVTGQTVAFTSTL---------GTLGNVTEQASGVYTATLT 1506 + T+ L+ N +G +T + G VT + G T ++ Sbjct: 766 TGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVI 825 Query: 1507 AGTVSGVASLSVSVGSSALGVTPATVTLNGDSGNLSTTNSTLVAAPVNIEAN 1558 + + A+ +++ +S + + D+ N + + N N Sbjct: 826 SSD-NQTATYTIATPNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELEN 876 Score = 82.4 bits (203), Expect = 1e-17 Identities = 80/420 (19%), Positives = 140/420 (33%), Gaps = 51/420 (12%) Query: 760 TLRDNNNNPVTGQTVVFTSTLGTLGNVTEQASGVYTATLTA----GTVSGVASLSVSVGG 815 LR + L + S VY T A G S L+++V Sbjct: 491 ALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNVYKVTARAYDRNGNSSNNVLLTITVLS 550 Query: 816 NALGVTPATVTLNGDSGNLSTTNSTLVAAPVNIEANSSDTSVVTLTLRDNN----NNPVT 871 N V VT A + +A+ ++ T T++ N N PV+ Sbjct: 551 NGQVVDQVGVT-------------DFTADKTSAKADGTEAITYTATVKKNGVAQANVPVS 597 Query: 872 GQTVNFAGTLGTLGTVSEGSSGVYTTTLTAGTVAGVASLSVNVGGNALGVTPATVTLNGN 931 V+ L + + + SG T TL + V + + A + ++ Sbjct: 598 FNIVSGTAVL-SANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQT 656 Query: 932 SGNLSATNSTLVAAPVNIEANSSDTSVVTLTLR-DNNNNPVTGQTVAFTSTLGTLGN--V 988 A+ + + A AN D +T T++ + PV+ Q V FT+TLG L N Sbjct: 657 K----ASITEIKADKTTAVANGQDA--ITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTE 710 Query: 989 TEQASGVYTATLTAGTVSGVASLSVSVNSNALGVTPATVTLNGDSGNLSTTNSTLVAAPV 1048 +G TLT+ T G + +S V+ A+ V V T T+ + Sbjct: 711 KTDTNGYAKVTLTSTTP-GKSLVSARVSDVAVDVKAPEVEF--------FTTLTIDDGNI 761 Query: 1049 NIEANSSDTSVVTLTLR-DNNNNPVTGQTVAFTSTL---------GTLGNVTEQASGLYT 1098 I + T+ L+ N +G +T + G VT + G T Sbjct: 762 EIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTT 821 Query: 1099 ATLTAGTVSGVASLSVNVGGNALGVTPATVTLNGDSGNLSATNSTLVAAPVNIEANSSDT 1158 ++ + + A+ ++ + + + D+ N + + N N Sbjct: 822 ISVISSD-NQTATYTIATPNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKA 880 Score = 60.9 bits (147), Expect = 4e-11 Identities = 52/280 (18%), Positives = 95/280 (33%), Gaps = 27/280 (9%) Query: 1568 TLRDNNNNPVTGQTVAFTSTLGTLGNVTEQASGVYTATLTA----GTVSGVASLSVSVNS 1623 LR + + L + S VY T A G S L+++V S Sbjct: 491 ALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNVYKVTARAYDRNGNSSNNVLLTITVLS 550 Query: 1624 NALGVTPATVTLNGDSGNLSTTNSTLVAAPVNIEANSSDTSVVTLTLRDNN----NNPVT 1679 N V VT A + +A+ ++ T T++ N N PV+ Sbjct: 551 NGQVVDQVGVT-------------DFTADKTSAKADGTEAITYTATVKKNGVAQANVPVS 597 Query: 1680 GQTVVFTSTLGTLGNVTEQASGLYTATLTAGTVSGVASLSVSVGGNALGVTGNITLAPGA 1739 V T+ L T SG T TL + V + + + + N + Sbjct: 598 FNIVSGTAVLSANSANTN-GSGKATVTLKSDKPGQVVVSAKTAEMTS-ALNANAVIFVDQ 655 Query: 1740 LDAARSILAVNKPSINADDRIGSTITFTAQDAQ-GNAITGLDIAFMTDLENSQIMTLVDH 1798 A+ + + +K + A+ + IT+T + + ++ ++ F T L T Sbjct: 656 TKASITEIKADKTTAVANGQ--DAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTD 713 Query: 1799 NDGTYTANINGTQTGIANIAVQSSGATIAGLAATMVTITP 1838 +G + T G + ++ + S + + A V Sbjct: 714 TNGYAKVTLTSTTPGKSLVSARVSDVAVD-VKAPEVEFFT 752
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 77.9 bits (192), Expect = 9e-19 Identities = 33/150 (22%), Positives = 70/150 (46%), Gaps = 5/150 (3%) Query: 10 QSGSVLIVEDEPKLGQLLVDYLQAAGYRTQWLTNGAEVVATVRQTPPAIILLDLMLPGSD 69 ++L+ +D+ + +L L AGY + +N A + + +++ D+++P + Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 70 GITLCREIR-RFSDIPIVMVTAKTEEIDRLLGLEIGADDYICKPYSPREVVARVKTIL-- 126 L I+ D+P+++++A+ + + E GA DY+ KP+ E++ + L Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121 Query: 127 --RRCSQQRHQPTDDAPLLINESRFQASYQ 154 RR S+ D PL+ + Q Y+ Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYR 151
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 34.0 bits (78), Expect = 0.001 Identities = 24/90 (26%), Positives = 38/90 (42%), Gaps = 21/90 (23%) Query: 170 LSTLLAAAVTWVLS-------------RGMLAPVKRLVEGTHRLAA------GDFST--R 208 L+TL+AA++ + ++A V+ V H LA G F Sbjct: 77 LATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFPGSFERLYC 136 Query: 209 VAVSSRDELGHLAQDFNQLASSLEKNEQMR 238 V++ + GHL N+LA E+ +QMR Sbjct: 137 AMVAAGETSGHLDAVLNRLADYTEQRQQMR 166
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 126 bits (318), Expect = 5e-34 Identities = 97/435 (22%), Positives = 182/435 (41%), Gaps = 17/435 (3%) Query: 20 FMQTLDTTIVNTALPSIAASLGENPLRMQSVIVSYVLTVAVMLPASGWLADRIGVKWVFF 79 F L+ ++N +LP IA + P V +++LT ++ G L+D++G+K + Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 80 SAIILFTFGSLMCAQSATLNE-LILSRVLQGVGGAMMVPVGRLTVMKIVPREQYMAAMAF 138 II+ FGS++ + LI++R +QG G A + + V + +P+E A Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 139 VTLPGQIGPLVGPALGGFLVEFASWHWIFLINLP-VGVIGALATLLLMPNHKMSTRRFDI 197 + +G VGPA+GG + + HW +L+ +P + +I + L+ FDI Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201 Query: 198 SGFIMLAIGMATLTLALDGHTGLGLSPLAIAGLILCGVIALGSYWWHALGNRFALFSLHL 257 G I++++G+ L + ++ V++ + H L Sbjct: 202 KGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252 Query: 258 FKNKIYTLGLVGSMSARIGSGMLPFMTPIFLQIGLGFSPFHAG-LMMIPMIIGSMGMKRI 316 KN + +G++ M P ++ S G +++ P + + I Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312 Query: 317 IVQVVNRFGYRRVLVNATLLLAVVSLSLPLVAIMGWTLLMPVVLFFQGMLNALRFSTMNT 376 +V+R G VL L+V L+ + + +++F G L+ + ++T Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTV-IST 371 Query: 377 LTLKTLPDRLASSGNSLLSMAMQLSMSIGVSTAGILLGTFAHHQVATNTPATHSAFLYS- 435 + +L + A +G SLL+ LS G++ G LL Q S +LYS Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTYLYSN 431 Query: 436 -YLCMAIIIALPALI 449 L + II + L+ Sbjct: 432 LLLLFSGIIVISWLV 446
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 862 bits (2229), Expect = 0.0 Identities = 285/1035 (27%), Positives = 503/1035 (48%), Gaps = 36/1035 (3%) Query: 6 LFIQRPVATTLLTLAITLSGIIGFSLLPVSPLPQVDYPVIMVSASMPGADPETMASSVAT 65 FI+RP+ +L + + ++G + LPV+ P + P + VSA+ PGAD +T+ +V Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63 Query: 66 PLERALGRIAGVNEMTSTS-SLGSTRIILQFDLNRDINGAARDVQAALNAAQSLLPSGMP 124 +E+ + I + M+STS S GS I L F D + A VQ L A LLP + Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123 Query: 125 SRPTYRKMNPSDAPIMIMTLTSDT--FSQGQLYDYASTKLAQKIAQTEGVSDVTVGGSSL 182 + S + +M+ SD +Q + DY ++ + +++ GV DV + G+ Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182 Query: 183 PAVRVELNPSALFNQGVSLDAVRQAISAANVRRPQGSVDAAET------HWQVQANDEIK 236 A+R+ L+ L ++ V + N + G + + + A K Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241 Query: 237 TAEGYRPLIVHYN-NGSPVRLQDVANVIDSVQDVRNAGMSAGQPAVLLVISREPGANIIA 295 E + + + N +GS VRL+DVA V ++ G+PA L I GAN + Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301 Query: 296 TVDRIRAELPALRASIPASIQLNIAQDRSPTIRASLDEVERSLVIAVALVILVVFIFLRS 355 T I+A+L L+ P +++ D +P ++ S+ EV ++L A+ LV LV+++FL++ Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361 Query: 356 GRATLIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENISRHL- 414 RATLIP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R + Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421 Query: 415 EAGVKPMVAALRGVREVGFTVLSMSISLVAVFIPLLLMAGLPGRLFREFAVTLSVAIGIS 474 E + P A + + ++ ++ +++ L AVFIP+ G G ++R+F++T+ A+ +S Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481 Query: 475 LVISLTLTPMMCAWLLRSHPKGQQQRIRGFG----KVLLAIQQGYGRSLNWALSHTRWVM 530 ++++L LTP +CA LL+ + GF Y S+ L T + Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541 Query: 531 VVLLSTIALNVWLYISIPKTFFPEQDTGRMMGFIQADQSISFQSMQQKLKDFMQIVGADP 590 ++ +A V L++ +P +F PE+D G + IQ + + Q+ L + Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601 Query: 591 -----AVDSVTGFT-GGSRTNSGSMFISLKPLSER---QETAQQVITRLRGKLAKEPGAN 641 +V +V GF+ G N+G F+SLKP ER + +A+ VI R + +L K Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661 Query: 642 LFLSSVQDIRVGGRHSNAAYQFTLLADDLAALREWEPKVRAALAKL-----PQLADVNSD 696 + ++ I G + ++ L D + + R L + L V + Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718 Query: 697 QQDKGAEMALTYDRETMARLGIDVSEANALLNNAFGQRQISTIYQPLNQYKVVMEVAPEY 756 + A+ L D+E LG+ +S+ N ++ A G ++ K+ ++ ++ Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778 Query: 757 TQDVSSLDKMFVINSNGQSIPLSYFAKWQPANAPLAVNHQGLSAASTISFNLPDGGSLSE 816 +DK++V ++NG+ +P S F + + I G S + Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838 Query: 817 ATAAVERAMTELGVPSTVRGAFAGTAQVFQETLKSQLWLIMAAIATVYIVLGILYESYVH 876 A A +E ++L P+ + + G + + + L+ + V++ L LYES+ Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896 Query: 877 PLTILSTLPSAGVGALLALELFDAPFSLIALIGIMLLIGIVKKNAIMMVDFALDAQRNGN 936 P++++ +P VG LLA LF+ + ++G++ IG+ KNAI++V+FA D Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956 Query: 937 ISAREAIFQASLLRFRPIIMTTLAALFGALPLVLSSGDGAELRQPLGITIVGGLVVSQLL 996 EA A +R RPI+MT+LA + G LPL +S+G G+ + +GI ++GG+V + LL Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016 Query: 997 TLYTTPVIYLYFDRL 1011 ++ PV ++ R Sbjct: 1017 AIFFVPVFFVVIRRC 1031 Score = 77.6 bits (191), Expect = 2e-16 Identities = 59/350 (16%), Positives = 130/350 (37%), Gaps = 12/350 (3%) Query: 680 VRAALAKLPQLADVNSDQQDKGAEMALTYDRETMARLGI---DVSEANALLNNAFGQRQI 736 V+ L++L + DV M + D + + + + DV + N+ Q+ Sbjct: 162 VKDTLSRLNGVGDVQLFGAQY--AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQL 219 Query: 737 STIYQPLNQYKVVMEVAPEYTQDVSSLDKMFV-INSNGQSIPLSYFAK--WQPANAPLAV 793 Q +A ++ K+ + +NS+G + L A+ N + Sbjct: 220 GGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIA 279 Query: 794 NHQGLSAASTISFNLPDGGSLSEATAAVERAMTEL--GVPSTVRGAFA-GTAQVFQETLK 850 G AA +L + A++ + EL P ++ + T Q ++ Sbjct: 280 RINGKPAAGLGIKLATGANAL-DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIH 338 Query: 851 SQLWLIMAAIATVYIVLGILYESYVHPLTILSTLPSAGVGALLALELFDAPFSLIALIGI 910 + + AI V++V+ + ++ L +P +G L F + + + G+ Sbjct: 339 EVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGM 398 Query: 911 MLLIGIVKKNAIMMVDFALDAQRNGNISAREAIFQASLLRFRPIIMTTLAALFGALPLVL 970 +L IG++ +AI++V+ + +EA ++ ++ + +P+ Sbjct: 399 VLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAF 458 Query: 971 SSGDGAELRQPLGITIVGGLVVSQLLTLYTTPVIYLYFDRLRNRFSKQPL 1020 G + + ITIV + +S L+ L TP + + + + Sbjct: 459 FGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENK 508
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 873 bits (2256), Expect = 0.0 Identities = 288/1036 (27%), Positives = 502/1036 (48%), Gaps = 29/1036 (2%) Query: 13 SRLFILRPVATTLFMIAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVVTSSI 72 + FI RP+ + I +++AG + LPV+ P + P + V YPGA V ++ Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 73 TAPLERQFGQMSGLKQMASQS-SGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131 T +E+ + L M+S S S G+ ITL FQ D+A+ +VQ + AT LLP + Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 132 LPYPPIYNKVNPADPPILTLAVTATAIPMTQVE--DMVETRIAQKISQVTGVGLVTLSGG 189 + I + ++ + TQ + D V + + +S++ GVG V L G Sbjct: 122 VQQQGIS-VEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 190 QRPAVRVKLNAPAVAALGLDSETIRTAISNANVNSAKGSLDGP------TRSVTLSANDQ 243 Q A+R+ L+A + L + + N A G L G + ++ A + Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239 Query: 244 MKSAEEYRDLII-AYQNGAPIRLQDVATIEQGAENNKLAAWANTQSAIVLNIQRQPGVNV 302 K+ EE+ + + +G+ +RL+DVA +E G EN + A N + A L I+ G N Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299 Query: 303 IATADSIREMLPELIKSLPKSVDVKVLTDRTSTIRASVNDVQFELLLAIALVVMVIYLFL 362 + TA +I+ L EL P+ + V D T ++ S+++V L AI LV +V+YLFL Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359 Query: 363 RNAAATIIPSIAVPLSLVGTFAAMYFLGFSINNLTLMALTIATGFVVDDAIVVIENISRY 422 +N AT+IP+IAVP+ L+GTFA + G+SIN LT+ + +A G +VDDAIVV+EN+ R Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 423 I-EKGEKPLDAALKGAGEIGFTIISLTFSLIAVLIPLLFMEDIVGRLFREFAVTLAVAIL 481 + E P +A K +I ++ + L AV IP+ F G ++R+F++T+ A+ Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 482 ISAVVSLTLTPMMCARML---SYESLRKQNRLSRASEKFFDWVIAHYAVALKKVLNHPWL 538 +S +V+L LTP +CA +L S E + FD + HY ++ K+L Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 539 TLSVAFSTLVLTVILYLLIPKGFFPLQDNGLIQGTLEAPQSVSFSNMAERQQQVAAIILK 598 L + + V+L+L +P F P +D G+ ++ P + + QV LK Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 599 DPA--VESLTSFVGVDGTNATLNNGRLQINLKPLSERDDRIP---QIITRLQESVSGVPG 653 + VES+ + G + N G ++LKP ER+ +I R + + + Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659 Query: 654 IKLYLQPVQDLTIDTQLSRTQYQFTLQ---ATSLEELSTWVPKLVNELQQK-APFQDVTS 709 ++ P I + T + F L + L+ +L+ Q A V Sbjct: 660 --GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717 Query: 710 DWQDQGLVAFVNVDRDSASRLGITMAAIDNALYNAFGQRLISTIYTQSNQYRVVLEHDVQ 769 + + + VD++ A LG++++ I+ + A G ++ + ++ ++ D + Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777 Query: 770 ATPGLAAFNDIRLTGSDGKGVPLNSIATIEERFGPLSINHLNQFPSATVSFNLAQGYSLG 829 + + + ++G+ VP ++ T +G + N PS + A G S G Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837 Query: 830 EAVAAVTLAEKEIQLPADITTRFQGSTLAFQAALGSTLWLIIAAIVAMYIVLGVLYESFI 889 +A+A + +LPA I + G + + + L+ + V +++ L LYES+ Sbjct: 838 DAMALMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895 Query: 890 HPITILSTLPTAGVGALLALMLTGNELDVIAIIGIILLIGIVKKNAIMMIDFALAAERDQ 949 P++++ +P VG LLA L + DV ++G++ IG+ KNAI++++FA + Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955 Query: 950 GMTPYDAIYQACLLRFRPILMTTLAALFGALPLMLSTGVGAELRQPLGVCMVGGLIVSQV 1009 G +A A +R RPILMT+LA + G LPL +S G G+ + +G+ ++GG++ + + Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015 Query: 1010 LTLFTTPVIYLLFDKL 1025 L +F PV +++ + Sbjct: 1016 LAIFFVPVFFVVIRRC 1031 Score = 84.1 bits (208), Expect = 2e-18 Identities = 78/517 (15%), Positives = 191/517 (36%), Gaps = 25/517 (4%) Query: 533 LNHPWLTLSVAFSTLVLTVILYLLIPKGFFPLQDNGLIQGTLEAPQSVSFSNMAERQQQV 592 + P +A ++ + L +P +P + + P + + Q V Sbjct: 6 IRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGA----DAQTVQDTV 61 Query: 593 AAIILKDPAVESLTSFVGVDGTNAT-LNNGRLQINL--KPLSERDDRIPQIITRLQESVS 649 +I +++ + ++T + G + I L + ++ D Q+ +LQ + Sbjct: 62 TQVI-----EQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATP 116 Query: 650 GVP-GIKLYLQPVQDLTIDTQLSRTQYQFTLQATSLEELSTWVPK-LVNELQQKAPFQDV 707 +P ++ V+ + + L + T+ +++S +V + + L + DV Sbjct: 117 LLPQEVQQQGISVEKSS-SSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDV 175 Query: 708 TSDWQDQGLVAFVNVDRDSASRLGITMAAIDNALYNAFGQRLISTIYTQSNQYRVVLEHD 767 + + +D D ++ +T + N L Q + L Sbjct: 176 QLFGAQYAMR--IWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233 Query: 768 VQATPGLAAFNDIR----LTGSDGKGVPLNSIATIEERFGPLSIN-HLNQFPSATVSFNL 822 + A + SDG V L +A +E ++ +N P+A + L Sbjct: 234 IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKL 293 Query: 823 AQGYSLGEAVAAV--TLAEKEIQLPADI-TTRFQGSTLAFQAALGSTLWLIIAAIVAMYI 879 A G + + A+ LAE + P + +T Q ++ + + AI+ +++ Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353 Query: 880 VLGVLYESFIHPITILSTLPTAGVGALLALMLTGNELDVIAIIGIILLIGIVKKNAIMMI 939 V+ + ++ + +P +G L G ++ + + G++L IG++ +AI+++ Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413 Query: 940 DFALAAERDQGMTPYDAIYQACLLRFRPILMTTLAALFGALPLMLSTGVGAELRQPLGVC 999 + + + P +A ++ ++ + +P+ G + + + Sbjct: 414 ENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSIT 473 Query: 1000 MVGGLIVSQVLTLFTTPVIYLLFDKLARNTRGKNRHR 1036 +V + +S ++ L TP + K +N+ Sbjct: 474 IVSAMALSVLVALILTPALCATLLKPVSAEHHENKGG 510
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 43.3 bits (102), Expect = 1e-06 Identities = 22/115 (19%), Positives = 42/115 (36%), Gaps = 10/115 (8%) Query: 84 VIAANTVTVTSRVDGELMALHFTEGQQVKAGDLLAEIDPRPYEVQLTQAQGQLAKDQATL 143 + + + + + + EG+ V+ GD+L ++ A+ K Q++L Sbjct: 91 THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTA-------LGAEADTLKTQSSL 143 Query: 144 DNARRDLARYQKLSK---TGLISQQELDTQSSLVRQSEGSVKADQGAIDSAKLQL 195 AR + RYQ LS+ + + +L + SE V I Sbjct: 144 LQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTW 198 Score = 42.5 bits (100), Expect = 3e-06 Identities = 23/124 (18%), Positives = 54/124 (43%), Gaps = 4/124 (3%) Query: 125 YEVQLTQAQGQLAKDQATLDNARRDLARYQKLSKTGLISQQELDTQSSLVRQSEGSVKAD 184 E + +A +L ++ L+ ++ + + L++Q + +RQ+ ++ Sbjct: 257 QENKYVEAVNELRVYKSQLEQIESEILSAK--EEYQLVTQLFKNEILDKLRQTTDNIGLL 314 Query: 185 QGAIDSAKLQLTYSRITAPISGRV-GLKQVDVGNYITSGTATPIVVITQTHPVDVVFTLP 243 + + + S I AP+S +V LK G +T+ T +V++ + ++V + Sbjct: 315 TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALVQ 373 Query: 244 ESDI 247 DI Sbjct: 374 NKDI 377
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.2 bits (70), Expect = 0.007 Identities = 8/31 (25%), Positives = 14/31 (45%) Query: 34 LTLLGPSGSGKTTSLMMLAGFETPTQGEITL 64 + L G G GK+T + L G + + + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDI 629
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 27.4 bits (61), Expect = 0.019 Identities = 7/27 (25%), Positives = 16/27 (59%) Query: 1 MKILKRLIFICLVIIIIFFLIDCSMQK 27 +IL++L+ IC V ++ + D + + Sbjct: 181 GQILRQLMVICTVGFVVISIADYAFEY 207
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 29.5 bits (66), Expect = 0.026 Identities = 10/57 (17%), Positives = 24/57 (42%), Gaps = 1/57 (1%) Query: 468 ISRVAVTAAWRQQGIARRMIAAEQAHARQQQ-CDFLSVSFGYTAELAHFWHRCGFRL 523 I +AV +R++G+ ++ A++ C + + HF+ + F + Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 73.0 bits (179), Expect = 6e-16 Identities = 68/399 (17%), Positives = 140/399 (35%), Gaps = 36/399 (9%) Query: 48 DFVLITLVLTDIKQEFGLTLIQATSLISAAFISRWFGGLVLGAMGDRYGRKLAMIISIVL 107 + +++ + L DI +F + +A ++ G V G + D+ G K ++ I++ Sbjct: 29 NEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIII 88 Query: 108 FSFGTLACGLAPGYTTLFI-ARLIIGIGMAGEYGSSSTYVMESWPKNMRNKASGFLISGF 166 FG++ + + +L I AR I G G A V PK R KA G + S Sbjct: 89 NCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIV 148 Query: 167 SIGAVLAAQAYSYVVPAFGWRMLFYIGLLPIIFALWLRKNLPEAEDWEKAQSKQKKGKQV 226 ++G + + W L I ++ II +L K L + Sbjct: 149 AMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVR-------------- 194 Query: 227 TDRNMVDILYRSHLSYLNIGLTIFAAVSLYLCFTGMVSTLLVVVLGILCAAIFIYFMVQT 286 + H I L + + ++ FT S +++ +L IF+ + + Sbjct: 195 ---------IKGHFDIKGIIL-MSVGIVFFMLFTTSYSISF-LIVSVLSFLIFVKHIRKV 243 Query: 287 SGD----RWPTGVMLMVVVFCAFLYSWPIQA---LLPTYLKMDLGYDPHTVGNILFFSG- 338 + + M+ V C + + ++P +K +G+++ F G Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303 Query: 339 FGAAVGCCVGGFLGDWLGTRK-AYVTSLLISQLLIIPLFAIQGSSILFLGGLLFLQQMLG 397 + +GG L D G + +S + F ++ +S ++F+ Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFV-LGGL 362 Query: 398 QGIAGLLPKLLGGYFDTEQRAAGLGFTYNVGALGGALAP 436 ++ ++ ++ AG+ L Sbjct: 363 SFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGI 401 Score = 34.5 bits (79), Expect = 0.001 Identities = 35/173 (20%), Positives = 67/173 (38%), Gaps = 11/173 (6%) Query: 297 LMVVVFCAFLYSWPIQALLPTYLKMDLGYDPHTVGNILFFSGFGAAVGCCVGGFLGDWLG 356 L ++ F + L + LP + D P + + ++G V G L D LG Sbjct: 19 LCILSFFSVLNEMVLNVSLPD-IANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLG 77 Query: 357 TRKAYVTSLLISQL-LIIPLFAIQGSSILFLGGLLFLQQMLGQGIAGLLPKLLGGYFDTE 415 ++ + ++I+ +I S+L + F+Q L+ ++ Y E Sbjct: 78 IKRLLLFGIIINCFGSVIGFVGHSFFSLLIMA--RFIQGAGAAAFPALVMVVVARYIPKE 135 Query: 416 QRAAGLGFTYNVGALGGALAPILGASIAQHLSLGTALGSLSFSLTFVVILLIG 468 R G ++ A+G + P +G IA ++ S+ L +I +I Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI-------HWSYLLLIPMITIIT 181
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.041 Identities = 10/23 (43%), Positives = 14/23 (60%) Query: 30 MVALLGPSGSGKTTLLRIIAGLE 52 V L G G GK+TL+ + GL+ Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD 620
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 92.6 bits (230), Expect = 7e-24 Identities = 33/134 (24%), Positives = 62/134 (46%) Query: 2 KILIAEDNAHIRNGLMEVLAHEGYRPIAAENGVQALALYRQQQPDFIILDIMMPELDGYK 61 IL+A+D+A IR L + L+ GY N D ++ D++MP+ + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VCREIRKHDWQTPIIFLSAKDEEIDRVIGLELGADDYISKPFGIHEMRARIKTIVRRCLR 121 + I+K P++ +SA++ + + E GA DY+ KPF + E+ I + R Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 122 KVPESAEDAGFPFG 135 + + +D+ Sbjct: 125 RPSKLEDDSQDGMP 138
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 60.2 bits (146), Expect = 5e-12 Identities = 40/259 (15%), Positives = 87/259 (33%), Gaps = 25/259 (9%) Query: 27 FRWISPPDKPSYITAVAEIRDLEQTVLADGTIKAQKQVTVGAQVSGQIKALHVTLGQQVE 86 F+ +S + + + E Q + K+ V +I + Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235 Query: 87 KNQLVAEI--DDLAQQNALKDAEEALKNVQAQRAAKIA--TQKNNQLTYQRQQQILAKGV 142 + + + ++A+ + E + + Q +++ +++ L Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL---- 291 Query: 143 GVRADFDS-IKATLEATQAEISALDAQIAQAEIAVSTAKLNLGYTKISSPIAGTVVAIPV 201 V F + I L T I L ++A+ E + I +P++ V + V Sbjct: 292 -VTQLFKNEILDKLRQTTDNIGLLTLELAKNE-------ERQQASVIRAPVSVKVQQLKV 343 Query: 202 -EEGQTVNAVQSAPTIIKVAQLDTMTVEAQISEADVVKVKTGMPVYFTILGEPEKRF--- 257 EG V + ++ V + DT+ V A + D+ + G + P R+ Sbjct: 344 HTEGGVVTTAE--TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYL 401 Query: 258 SATLRAIEPAPDSINDETT 276 ++ I D+I D+ Sbjct: 402 VGKVKNI--NLDAIEDQRL 418 Score = 49.8 bits (119), Expect = 9e-09 Identities = 17/167 (10%), Positives = 56/167 (33%), Gaps = 17/167 (10%) Query: 10 RLIGWVVLLLFIGGLLFFRWISPPDKPSYITAVAEIRDLEQTVLADGTIKAQKQV-TVGA 68 RL+ + ++ + + + + +E A+G + + + Sbjct: 58 RLVAYFIMGFLVIAFI---L-------------SVLGQVEIVATANGKLTHSGRSKEIKP 101 Query: 69 QVSGQIKALHVTLGQQVEKNQLVAEIDDLAQQNALKDAEEALKNVQAQRAAKIATQKNNQ 128 + +K + V G+ V K ++ ++ L + + +L + ++ ++ + Sbjct: 102 IENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIE 161 Query: 129 LTYQRQQQILAKGVGVRADFDSIKATLEATQAEISALDAQIAQAEIA 175 L + ++ + + + + + S Q Q E+ Sbjct: 162 LNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELN 208
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 89.1 bits (221), Expect = 1e-22 Identities = 32/122 (26%), Positives = 63/122 (51%), Gaps = 1/122 (0%) Query: 2 KILLVDDDLELGTMLKEYLGGEGFTAKHVLTGKAGIDGALSGDYTALILDIMLPDMSGID 61 IL+ DDD + T+L + L G+ + +GD ++ D+++PD + D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VLRQVRK-KSRLPIIMLTAKGDNIDRVIGLEMGADDYMPKPCYPRELVARLRAVLRRFEE 120 +L +++K + LP+++++A+ + + E GA DY+PKP EL+ + L + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 121 QP 122 +P Sbjct: 125 RP 126
>PF06580#Sensor histidine kinase Length = 349 Score = 36.8 bits (85), Expect = 2e-04 Identities = 41/231 (17%), Positives = 83/231 (35%), Gaps = 59/231 (25%) Query: 239 ELRSPLARLQLAIGLAHQNPGNVDNAL----QRIEHESERLDKMIGEL-------LALSR 287 ++ S QL A NP + NAL I + + +M+ L L S Sbjct: 153 KMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLRYSN 212 Query: 288 AENHSLADD----DEYFDLQEL-------VKVVVNDARYEAQLPGVEIQLEVAAQSEYTV 336 A SLAD+ D Y L + + +N A + Q+P + +Q V Sbjct: 213 ARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLV-------- 264 Query: 337 KGNAELMRRAIENIVRNALRFSASGQQVKVTLSALDKRYQIQVIDQGPGVEENKLSSIFD 396 EN +++ + G ++ + + + ++V + G +N Sbjct: 265 -----------ENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN------- 306 Query: 397 PFVRVKSAMSGKGYGLGLAITDK-VILAHGGQVEAR-NGEQGGLVITLRVP 445 + + G GL + + + +G + + + + +QG + + +P Sbjct: 307 ---------TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 32.5 bits (74), Expect = 0.007 Identities = 35/164 (21%), Positives = 57/164 (34%), Gaps = 32/164 (19%) Query: 576 DDIRAVMELPQRLEAR----------VIGQPHALMQLGENIMTARAGLSDPRKPLGVFML 625 I + P+R ++ ++G+ A+ ++ + AR +D L + M+ Sbjct: 113 GIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVL--ARLMQTD----LTL-MI 165 Query: 626 VGPSGVGKTETALAIAESMYGGEQNMITINMSEYQESHTVSSLKGSPPGYVGYGEGGVLT 685 G SG GK A A+ + + INM+ S L G E G T Sbjct: 166 TGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGAFT 217 Query: 686 EAVRRKPYSV-------VLLDEIEKAHSDVHELFFQVFDKGQME 722 A R + LDEI D +V +G+ Sbjct: 218 GAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYT 261
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 32.7 bits (74), Expect = 4e-04 Identities = 32/113 (28%), Positives = 49/113 (43%), Gaps = 18/113 (15%) Query: 1 MRKLNLAVCAVALSVISSTSYAAAGGTVTFNGKLIADTCQVDTASENITVTLPTLSIQSL 60 M+K+ V L + + + A +TF GKLI C V +N V + IQ+L Sbjct: 1 MKKIRGLCLPVMLGAVLMSQHVHAADNLTFKGKLIIPACTV----QNAEVNWGDIEIQNL 56 Query: 61 AVAEAQDGS--KDFEIKVLDCP-------ATLTQVGAHFNAIDSSGVNPATGN 104 Q G KDF + ++CP T+T G N+I + A+G+ Sbjct: 57 ----VQSGGNQKDFTVD-MNCPYSLGTMKVTITSNGQTGNSILVPNTSTASGD 104
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 698 bits (1802), Expect = 0.0 Identities = 253/882 (28%), Positives = 411/882 (46%), Gaps = 57/882 (6%) Query: 35 SVLLVTKSISAVPMSQDTNESAAVIPVEFNADFIHGGG---VDVMRFMHENPVAPGVYDV 91 + V ++ +Q SA + FN F+ D+ RF + + PG Y V Sbjct: 24 AGFFVRLFVACAFAAQAPLSSAEL---YFNPRFLADDPQAVADLSRFENGQELPPGTYRV 80 Query: 92 TVIINGKNRGKHRIRFELSEGESTAEPCFTLEQLDSIGLKIETSDTDLLVNGKAAPKDQC 151 + +N + F + E PC T QL S+GL +T + D C Sbjct: 81 DIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGL-----NTASVSGMNLLADDAC 135 Query: 152 YNLRALIKDSHVNYNSGDLELSLTVPQFNLVHHPRGYIDSSLWDAGGTVGFLDYNSNVYS 211 L ++I D+ + G L+LT+PQ + + RGYI LWD G G L+YN + Sbjct: 136 VPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGN- 194 Query: 212 IFNGRSNSDVGSDNSNSYNSNIGLSAGINLGEWRFRKRLNTTWSNSSG-----MHTQNLY 266 + +G ++ +Y + L +G+N+G WR R ++++S Q++ Sbjct: 195 ----SVQNRIGGNSHYAY---LNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHIN 247 Query: 267 GYAATDITALKSQLTIGDTNTQGSLFDSYALRGVLLASDTRMLPEGIRNYSPIVRGIAET 326 + DI L+S+LT+GD TQG +FD RG LASD MLP+ R ++P++ GIA Sbjct: 248 TWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARG 307 Query: 327 NARVTVTQRGQIIYETVVTPGAFELTDIGTMSYGGDLQMTITESDGRTRIQRIPFSAPPM 386 A+VT+ Q G IY + V PG F + DI GDLQ+TI E+DG T+I +P+S+ P+ Sbjct: 308 TAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPL 367 Query: 387 LLYQGVSRFDFSAGQL-NDSSINHNPAIVQGAYHYGLGNTYTLYGGAQVAENYRSVAIGN 445 L +G +R+ +AG+ + ++ P Q +GL +T+YGG Q+A+ YR+ G Sbjct: 368 LQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGI 427 Query: 446 AFNT-PLGGVSMDITHAKSELAGDRRSSGNSYKIDYSKYVGETDTNLTLAAYRYSSGGYY 504 N LG +S+D+T A S L D + G S + Y+K + E+ TN+ L YRYS+ GY+ Sbjct: 428 GKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYF 487 Query: 505 SFREASLDRYGNSNGIDE---------------IDFRTRNRLSLSVSQRVADNMSVNLNS 549 +F + + R N + + + R +L L+V+Q++ ++ L+ Sbjct: 488 NFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSG 547 Query: 550 SLYSYWGNQDASQQYSVGFNHSLRYFSYTVSAIRTSNSGNSSNGDNDREYENSYMLAVSI 609 S +YWG + +Q+ G N + ++T+S T N+ + + L V+I Sbjct: 548 SHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAW-------QKGRDQMLALNVNI 600 Query: 610 PIGG----SGKNKPLFSSLSTMVSHSEAGDTQLQLTTSGSRGDQNELTYGIGTSYGNRND 665 P K++ +S S +SH G G+ + N L+Y + T Y D Sbjct: 601 PFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGD 660 Query: 666 ASSEQSVIGNIGYQSSVGQLGMTASANNNASRQLSVSASGSLVAHQGGVIAGPRLGDAPF 725 +S + + Y+ G + S +++ +QL SG ++AH GV G L D Sbjct: 661 GNSGSTGYATLNYRGGYGNANIGYSHSDD-IKQLYYGVSGGVLAHANGVTLGQPLNDT-V 718 Query: 726 AIINAQGAGGAKVFNGRGAKIDSNGYALVPSLTPYRENTIAIDYKDLPETVDILENHKVV 785 ++ A GA AKV N G + D GYA++P T YREN +A+D L + VD+ V Sbjct: 719 VLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANV 778 Query: 786 VPRMGAMIPVKMKTMTGNPMMLIVRDENKEFLPIGTDLLDADGVSQSIVGQGGMAFIRGW 845 VP GA++ + K G +++ + NK LP G + S IV G ++ G Sbjct: 779 VPTRGAIVRAEFKARVGIKLLMTLTHNNKP-LPFGAMVTSESSQSSGIVADNGQVYLSGM 837 Query: 846 DPVSQPITATLNGGIDKCVIKPDAKIDTATKTAQIIQLEVIC 887 + CV + ++ ++ + QL C Sbjct: 838 PLAGKVQVKWGEEENAHCVA--NYQLPPESQQQLLTQLSAEC 877
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 88.1 bits (218), Expect = 3e-21 Identities = 39/123 (31%), Positives = 63/123 (51%), Gaps = 16/123 (13%) Query: 347 FKGDSMFMVGSDNVRPEMIDVIKRVAQEVHRVK---GAILIVGHTDSMPINKPGFPNNQV 403 K D +F ++PE + ++ ++ + G+++++G+TD I + NQ Sbjct: 217 LKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDR--IGSDAY--NQG 272 Query: 404 LSEKRAANVARYMEQAGIPTDKIRFEGKGETQPVSSN--DDATGRSQ-------NRRVEI 454 LSE+RA +V Y+ GIP DKI G GE+ PV+ N D+ R+ +RRVEI Sbjct: 273 LSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEI 332 Query: 455 FVN 457 V Sbjct: 333 EVK 335
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 49.2 bits (117), Expect = 2e-08 Identities = 34/145 (23%), Positives = 43/145 (29%), Gaps = 11/145 (7%) Query: 698 LLPIVVPPVTSPPDPTLPPDPTLPPDPTLPPETTAPPETTAPPETTAPPETTAPPETTAP 757 LL V V P P P T+ L P P PPE PE PE Sbjct: 32 LLYTSVHQVIELPAPAQPISVTMVAPADLEP----PQAVQPPPEPVVEPE----PEPEPI 83 Query: 758 PETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAP 817 PE P P + P+ P + P +P E TAP T+ Sbjct: 84 PEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRP---ASPFENTAPARPTSS 140 Query: 818 PETTAPPETTAPPETTAPPETTAPP 842 T A + + + P Sbjct: 141 TATAATSKPVTSVASGPRALSRNQP 165 Score = 48.8 bits (116), Expect = 3e-08 Identities = 30/129 (23%), Positives = 39/129 (30%), Gaps = 9/129 (6%) Query: 722 PDPTLPPETT--APPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETT 779 P P P T AP + P PPE PE PE PE Sbjct: 44 PAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPE----PEPEPIPEPPKEAPVVIEKPKP 99 Query: 780 APPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETT 839 P P + P+ P + P +P E TAP T+ T A + + Sbjct: 100 KPKPKPKPVKKVEQPKRDVKPVESRP---ASPFENTAPARPTSSTATAATSKPVTSVASG 156 Query: 840 APPETTAPP 848 + P Sbjct: 157 PRALSRNQP 165 Score = 47.3 bits (112), Expect = 1e-07 Identities = 22/100 (22%), Positives = 28/100 (28%), Gaps = 4/100 (4%) Query: 761 TAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPET 820 AP + P PPE PE PE PE P P + Sbjct: 55 VAPADLEPPQAVQPPPEPVVEPE----PEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK 110 Query: 821 TAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPP 860 P+ P + P P +TA T+ P Sbjct: 111 VEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPV 150 Score = 47.3 bits (112), Expect = 1e-07 Identities = 22/100 (22%), Positives = 28/100 (28%), Gaps = 4/100 (4%) Query: 785 TAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPET 844 AP + P PPE PE PE PE P P + Sbjct: 55 VAPADLEPPQAVQPPPEPVVEPE----PEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK 110 Query: 845 TAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPP 884 P+ P + P P +TA T+ P Sbjct: 111 VEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPV 150 Score = 47.3 bits (112), Expect = 1e-07 Identities = 22/100 (22%), Positives = 28/100 (28%), Gaps = 4/100 (4%) Query: 797 TAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPET 856 AP + P PPE PE PE PE P P + Sbjct: 55 VAPADLEPPQAVQPPPEPVVEPE----PEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK 110 Query: 857 TAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPP 896 P+ P + P P +TA T+ P Sbjct: 111 VEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPV 150 Score = 47.3 bits (112), Expect = 1e-07 Identities = 22/100 (22%), Positives = 28/100 (28%), Gaps = 4/100 (4%) Query: 803 TAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPET 862 AP + P PPE PE PE PE P P + Sbjct: 55 VAPADLEPPQAVQPPPEPVVEPE----PEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK 110 Query: 863 TAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPP 902 P+ P + P P +TA T+ P Sbjct: 111 VEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPV 150 Score = 47.3 bits (112), Expect = 1e-07 Identities = 22/100 (22%), Positives = 28/100 (28%), Gaps = 4/100 (4%) Query: 809 TAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPET 868 AP + P PPE PE PE PE P P + Sbjct: 55 VAPADLEPPQAVQPPPEPVVEPE----PEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK 110 Query: 869 TAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPP 908 P+ P + P P +TA T+ P Sbjct: 111 VEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPV 150 Score = 46.9 bits (111), Expect = 1e-07 Identities = 21/93 (22%), Positives = 25/93 (26%), Gaps = 4/93 (4%) Query: 827 TAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPET 886 AP + P PPE PE PE PE P P + Sbjct: 55 VAPADLEPPQAVQPPPEPVVEPE----PEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK 110 Query: 887 TAPPETTAPPETTAPPETTAPPEPTRTPPGTQT 919 P+ P + P P R T T Sbjct: 111 VEQPKRDVKPVESRPASPFENTAPARPTSSTAT 143 Score = 46.1 bits (109), Expect = 3e-07 Identities = 26/118 (22%), Positives = 35/118 (29%), Gaps = 7/118 (5%) Query: 791 TAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPET 850 AP + P PPE PE PE PE P P + Sbjct: 55 VAPADLEPPQAVQPPPEPVVEPE----PEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK 110 Query: 851 TAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPP 908 P+ P + P +P E TAP T+ T A + + + P Sbjct: 111 VEQPKRDVKPVESRP---ASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQP 165 Score = 45.7 bits (108), Expect = 3e-07 Identities = 20/101 (19%), Positives = 26/101 (25%), Gaps = 4/101 (3%) Query: 821 TAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPET 880 AP + P PPE PE PE PE P P + Sbjct: 55 VAPADLEPPQAVQPPPEPVVEPE----PEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK 110 Query: 881 TAPPETTAPPETTAPPETTAPPETTAPPEPTRTPPGTQTPP 921 P+ P + P P T T ++ Sbjct: 111 VEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVT 151 Score = 45.0 bits (106), Expect = 5e-07 Identities = 22/108 (20%), Positives = 30/108 (27%), Gaps = 5/108 (4%) Query: 815 TAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPETTAPPET 874 AP + P PPE PE PE PE P P + Sbjct: 55 VAPADLEPPQAVQPPPEPVVEPE----PEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK 110 Query: 875 TAPPETTAPPETTAPPETTAPPETTAPPETTAPPEPTRTPPGTQTPPP 922 P+ P + P + A P ++ T P + P Sbjct: 111 VEQPKRDVKPVES-RPASPFENTAPARPTSSTATAATSKPVTSVASGP 157
>PF07201#Hypersensitivity response secretion protein HrpJ Length = 293 Score = 28.3 bits (63), Expect = 0.029 Identities = 15/99 (15%), Positives = 29/99 (29%), Gaps = 16/99 (16%) Query: 55 QLETLTQLLPEFTKQAELYKNLILSEKMRDEVLAGKRSPGTL--------GNDLPEWVAL 106 Q+ +PE ++ + ++ + + + E + Sbjct: 86 QVNQYLSKVPELEQKQNV-------SELLSLLSNSPNISLSQLKAYLEGKSEEPSEQFKM 138 Query: 107 LQQA-NQLHHDGDHQQSEALREQALQQAPESIGESAATG 144 L + L + L EQAL E GE+ G Sbjct: 139 LCGLRDALKGRPELAHLSHLVEQALVSMAEEQGETIVLG 177
>TYPE3OMBPROT#Type III secretion system outer membrane B protein family signature. Length = 538 Score = 28.9 bits (64), Expect = 0.011 Identities = 17/45 (37%), Positives = 23/45 (51%) Query: 102 LLSVLIYAISSVSDQGISGEMVDAKAVGISLFGPYVLAVELASML 146 L+S +Y+ + Q +SG+ VD K V SL P L SML Sbjct: 251 LVSAALYSRPELLSQALSGKTVDLKIVSTSLLTPTSLTGGEESML 295
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 203 bits (517), Expect = 8e-70 Identities = 122/174 (70%), Positives = 135/174 (77%), Gaps = 3/174 (1%) Query: 1 MKKIACLSAVAACVLAVTAGSAFAGQSTVSGGYAQSDYQGVANKSSGFNLKYRYEWSDSQ 60 MKKIACLSA+AA LA TAG++ A STV+GGYAQSD QG NK GFNLKYRYE +S Sbjct: 1 MKKIACLSALAAV-LAFTAGTSVAATSTVTGGYAQSDAQGQMNKMGGFNLKYRYEEDNSP 59 Query: 61 LGYITSFTHTEKSGFGDEAVYNKAQYNAITGGPAYRINDWASIYGLVGVGHGRFTQNESA 120 LG I SFT+TEKS YNK QY IT GPAYRINDWASIYG+VGVG+G+F E Sbjct: 60 LGVIGSFTYTEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVVGVGYGKFQTTE-- 117 Query: 121 FVGDKHSTSDYGFTYGAGLQFNPAENVALDVSYEQSRIRNVDVGTWVAGVGYTF 174 + KH TSDYGF+YGAGLQFNP ENVALD SYEQSRIR+VDVGTW+AGVGY F Sbjct: 118 YPTYKHDTSDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWIAGVGYRF 171
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 29.2 bits (65), Expect = 0.024 Identities = 11/60 (18%), Positives = 23/60 (38%), Gaps = 4/60 (6%) Query: 40 NDYFVSMKEALEQAANDIGAKVYIADAGHDVSKQINDVED---MLQKKIDILLINPTDSV 96 D+F S++ L A D A+ + + Q + K+++I + D + Sbjct: 110 QDFFTSLQT-LVSNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQI 168
>PF05860#haemagglutination activity domain. Length = 117 Score = 82.2 bits (203), Expect = 2e-20 Identities = 21/141 (14%), Positives = 41/141 (29%), Gaps = 24/141 (17%) Query: 68 AAIVADASAPGNQQPTIINSANGTPQVNIQAPSSGGVSRNVYSQFDVDGRGVILNNGHGV 127 A I D + P N + I + T + + + + + +F V G N Sbjct: 1 AQITPDTTLPIN---SNITTEGNTRIIERGTQAGSNLFHS-FQEFSVPTSGTAFFN---- 52 Query: 128 NQTELGGFIDGNPWLARGEASIILNEVNSRDPSKLNGYIEVAGRKAQVVIANSAGITCEG 187 I++ V S ++G I A + + N GI Sbjct: 53 ---------------NPTNIQNIISRVTGGSVSNIDGLIRANAT-ANLFLINPNGIIFGQ 96 Query: 188 CGFINANRVTLTTGQAQLNNG 208 ++ + + +L Sbjct: 97 NARLDIGGSFVGSTANRLKFA 117
>PF05272#Virulence-associated E family protein Length = 892 Score = 34.7 bits (79), Expect = 7e-04 Identities = 16/49 (32%), Positives = 23/49 (46%), Gaps = 1/49 (2%) Query: 32 IVLVGPSGCGKSTLLRMIAGLEDVNSGEIKI-EDKDVTQTNAGARGVSM 79 +VL G G GKSTL+ + GL+ + I KD + AG + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYEL 647
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 43.1 bits (101), Expect = 9e-06 Identities = 39/287 (13%), Positives = 83/287 (28%), Gaps = 21/287 (7%) Query: 504 QLHEAEMAQPLEEATIERKRPEQPALATFSLPTEVPPEEAPTVAKAKPAVATPAAVSTDV 563 L+ E+ + T++ P +P+ P +A+ A P A +T Sbjct: 979 DLYNPEVEK--RNQTVDTTNITTPNNIQADVPSV--PSNNEEIARVDEAPVPPPAPATPS 1034 Query: 564 EQPGFFSRLFSGLKNMFGASAEAEVQPAEVVKTDASENRRNDRR-----NPRRQNNGRKE 618 E +++ E + E + DA+E +R + N + Sbjct: 1035 ET-----------TETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTN 1083 Query: 619 RNDRTPREGRDNSSRDNTNRDNTSRDNANRDGANRDNSNRDNSGRDNVSREGREDQRRNN 678 ++ E ++ + + ++ + + + + + +E E + Sbjct: 1084 EVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQA 1143 Query: 679 RRPAQPTTTSQGQTEVVEADKAQREEQPQRRGDRQRRRQDEKRQAPQEIKADVAEAPVIE 738 + T + + + EQP + Q V E P Sbjct: 1144 EPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE-QPVTESTTVNTGNSVVENPENT 1202 Query: 739 EVQPEQEERQQVMQRRQRRQLNQKVRIQSANDELNTLESPVSAPVAQ 785 Q + + + + VR N E T S + VA Sbjct: 1203 TPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVAL 1249 Score = 41.2 bits (96), Expect = 3e-05 Identities = 46/326 (14%), Positives = 92/326 (28%), Gaps = 35/326 (10%) Query: 671 REDQRRNNRRPAQPTTTSQGQTEVVEADKAQREEQPQRRGDR-QRRRQDEKRQAPQEIKA 729 ++R TT + Q + P + + R DE P Sbjct: 984 EVEKRNQTVDTTNITTPNNIQAD-----------VPSVPSNNEEIARVDEAPVPPPAPAT 1032 Query: 730 DVAEAPVIEEVQPEQEERQQVMQRRQRRQLNQKVRIQSANDELNTLESPVSAPVAQVVVA 789 + E ++ + + ++ Q R + + N + + VAQ Sbjct: 1033 PSETTETVAENSKQESKTVEKNEQDATETTAQN-REVAKEAKSNVKANTQTNEVAQS--- 1088 Query: 790 EVQEEVKLLPQITAQTDDDSANERTTNNENGMPRRSRRSPRHLRVSGQRRRRYRDERYPA 849 E + T +T E+ +++ P+ ++ + + A Sbjct: 1089 -GSETKETQTTETKETATVEKEEK----AKVETEKTQEVPKVTSQVSPKQEQSETVQPQA 1143 Query: 850 QSAMPLAGAFASPEMASGKVWVRYPVTPVVEQVVVEQIAIEQTTTVEQTAIVEQVSVANI 909 + A E P + EQ A E ++ VEQ Sbjct: 1144 EPARENDPTVNIKE----------PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGN 1193 Query: 910 VTAQLPVEQVQNTVAEQESSATPSVMTTPTVAVTLAPQHKPGGSSSSAAAVPGRAPIVAA 969 + P T +S + + P + + P + + R+ VA Sbjct: 1194 SVVENPENTTPATTQPTVNSESSNK---PKNRHRRSVRSVPHNVEPATTSSNDRST-VAL 1249 Query: 970 VPVVAETTAAETVVAKTEAAIDAVAV 995 + + T A A+ +A A+ V Sbjct: 1250 CDLTSTNTNAVLSDARAKAQFVALNV 1275
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 62.5 bits (152), Expect = 6e-13 Identities = 27/109 (24%), Positives = 52/109 (47%), Gaps = 5/109 (4%) Query: 1 MSKIRVLCVDDSALMRQLMTEIINSHPDMEMVAAAQDPLVARDLIKKFNPQVLTLDVEMP 60 M+ +L DD A +R ++ + ++ V + I + ++ DV MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 RMDGLDFLEKLMRLRPMPVVMVSSLTGKNSEITM-RALELGAIDFVTKP 108 + D L ++ + RP V+V ++ +N+ +T +A E GA D++ KP Sbjct: 59 DENAFDLLPRIKKARPDLPVLV--MSAQNTFMTAIKASEKGAYDYLPKP 105
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 88.7 bits (220), Expect = 6e-24 Identities = 34/105 (32%), Positives = 53/105 (50%), Gaps = 3/105 (2%) Query: 7 RFLVVDDFSTMRRIVRNLLKELGFHNVEEAEDGVDALNKLRAGGFDFVVSDWNMPNMDGL 66 LV DD + +R ++ L G+ +V + + AG D VV+D MP+ + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGY-DVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 67 DLLKTIRTDGALATLPVLMVTAEAKKENIIAAAQAGASGYVVKPF 111 DLL I+ A LPVL+++A+ I A++ GA Y+ KPF Sbjct: 64 DLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106
>PF03895#Serum resistance protein DsrA. Length = 79 Score = 62.9 bits (153), Expect = 2e-14 Identities = 22/78 (28%), Positives = 34/78 (43%) Query: 799 DSTLSAGIAGAMAMASLTQPYTPGASMATIGAASYRGQSALSVGVSSISDSGRWVSKLQA 858 L G+A A++ L QP G + + YR ++AL++GV S A Sbjct: 2 SKELQTGLANQSALSMLVQPNGVGKTSVSAAVGGYRDKTALAIGVGSRITDRFTAKAGVA 61 Query: 859 SSNTQGDMGVGVGVGYQW 876 + G M G VGY++ Sbjct: 62 FNTYNGGMSYGASVGYEF 79
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 201 bits (512), Expect = 3e-63 Identities = 85/354 (24%), Positives = 160/354 (45%), Gaps = 30/354 (8%) Query: 45 AWLEISQGALDFNTKKMLTLLDNKSTLCAILKGDAYGHDLTLVTPVMLKNNVQCIGVASN 104 + AL N ++ + + +++K +AYGH + + + + + + Sbjct: 5 IQASLDLQALKQNLS-IVRQAATHARVWSVVKANAYGHGIERIWSAIGATD--GFALLNL 61 Query: 105 QELKTVRDLGFTGQLIRVRSAT-LKEMQQAMAYDVEELIGDKTVAEQLNNIAKLNGKVLR 163 +E T+R+ G+ G ++ + ++++ + + + + L N L Sbjct: 62 EEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARL--KAPLD 119 Query: 164 IHLALNSAGMSRNGLEVSKARGLNDAKTIVGLKNLTIVGIMSHYPVEDASE-IKADLARF 222 I+L +NS GM+R G + + L + + + N+ + +MSH+ + + I +AR Sbjct: 120 IYLKVNS-GMNRLGFQPDRV--LTVWQQLRAMANVGEMTLMSHFAEAEHPDGISGAMARI 176 Query: 223 QQQAKDVIAVTGLKREKIKLHVANTFATLAVPDSWLDMVRVGGVFYG-------DTIAST 275 +Q A GL+ + ++N+ ATL P++ D VR G + YG IA+T Sbjct: 177 EQ------AAEGLECRR---SLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANT 227 Query: 276 EYKRVMTFKSNIASLNNYPKGGTVGYDRTYTLKRDSLLANIPVGYADGYRRVFSNAGHVI 335 + VMT S I + G VGY YT + + + + GYADGY R V+ Sbjct: 228 GLRPVMTLSSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVL 287 Query: 336 IQGQRLPVLGKTSMNTVMVDVTDLKKVSLGDEVVLFGKQGNAEIQAEEIEDLSG 389 + G R +G SM+ + VD+T + +G V L+GK EI+ +++ +G Sbjct: 288 VDGVRTMTVGTVSMDMLAVDLTPCPQAGIGTPVELWGK----EIKIDDVAAAAG 337
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 460 bits (1185), Expect = e-151 Identities = 141/825 (17%), Positives = 291/825 (35%), Gaps = 78/825 (9%) Query: 46 TLYLELVVNDRNFGSA-VPISYRNNRYY----LSQSQLRTIGLPISEPLAPEIAIDN--- 97 T +++ +N+ + V + ++ L+++QL ++GL ++ + + Sbjct: 77 TYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNT-ASVSGMNLLADDAC 135 Query: 98 ------MAGVNVKYDGENQRLLINVPSEWLPKQQIEVTEQDDFNLAQSSLGALFNYDIYA 151 + + D QRL + +P ++ + + ++ L NY+ Sbjct: 136 VPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWD--PGINAGLLNYNFSG 193 Query: 152 TQGYPYSSLTHFSAWTEQRIFDRFGLLSNTGVYRTHFPSNNNTDDAKGYIRFDTQWQKND 211 A+ + G + S++++ +K + W + D Sbjct: 194 NSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERD 253 Query: 212 EEHLL-RYSAGDLITGALPWSSAIRLGGIQIARHFAIRPDLITYPLPQFSGQAAVPSTVD 270 L R + GD T + I G Q+A + PD P G A + V Sbjct: 254 IIPLRSRLTLGDGYTQGDIFDG-INFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVT 312 Query: 271 LYIDNFRTQSANINPGPFVINNAPRINGAGQATIVTTDALGRQISTSVPFYVASTLLKPG 330 + + + ++ + PGPF IN+ +G + +A G +VP+ L + G Sbjct: 313 IKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREG 372 Query: 331 VWDFSLSGGALRRNYAIRSADYGEMVASGVVRYGTTPWLTLEGRGDIAKEMHVIGGGVNF 390 +S++ G R A + + +G T+ G +A G+ Sbjct: 373 HTRYSITAGEYRSGNAQQEKPR---FFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGK 429 Query: 391 RMGLLGVLNSAYSISNTSNGAFNNVAEPLNTNNATSNRLPPPAASRRGRGNQRSLGYSYS 450 MG LG L+ + +N++ + ++ S R + N + +GY YS Sbjct: 430 NMGALGALSVDMTQANST-------LPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYS 482 Query: 451 NA-FFNL--------NAQHIISSDEYSD----LANYKTPSLLSRRMTQLTGSLSLGSYGT 497 + +FN N +I + D +Y + R QLT + LG T Sbjct: 483 TSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTST 542 Query: 498 V----------GSGYFDVRDALGEQTRLINISYSTSLLRNSNFYSALNRELGRKGYNVQL 547 + G+ D + G T +I+++ S N + + + L Sbjct: 543 LYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQ------KGRDQMLAL 596 Query: 548 VWSIPLGPR-----------GSSSISATRTNDNQWIQQLNYSRSAPSNGGLGWNL--AYA 594 +IP S+S S + + + + + L +++ YA Sbjct: 597 NVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYA 656 Query: 595 NSTNNNNQ-YQQADIVWRTSMMESRMGLYGNSNNYNYWGGLTGSLVVMNRSVYASNMIND 653 + N+ A + +R + +G + + + G++G ++ V +ND Sbjct: 657 GGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLND 716 Query: 654 AFALVSTNGFSNIPVSYENQLIGTTNAKGYLLIPTVASYYQAKFQIDPMNLPADVMLPNV 713 LV G + V ENQ T+ +GY ++P Y + + +D L +V L N Sbjct: 717 TVVLVKAPGAKDAKV--ENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNA 774 Query: 714 ERRLAIGERSGYLINFPIKRISAVNIRITDASGQDLPKGSAIYTTGNIPISYVGWDGMVY 773 + + F + + + +T + + LP G+ + + + V +G VY Sbjct: 775 VANVVPTRGAIVRAEFKARVGIKLLMTLTH-NNKPLPFGAMVTSESSQSSGIVADNGQVY 833 Query: 774 IEQVAQLNNLRI-IRADNGTQCYSQFKLKTTEGIQDAG--TTVCR 815 + + +++ + C + ++L Q + CR Sbjct: 834 LSGMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 28.9 bits (64), Expect = 0.016 Identities = 17/67 (25%), Positives = 26/67 (38%), Gaps = 7/67 (10%) Query: 53 DSSNF-GSINFGNITSLATAINATSGLNAGTITIQCNGNPSVTLALNSGANMTGNISAGR 111 + +N G++N + G TIQ GN V L NS ++TGN + Sbjct: 811 NPTNLRGNVNLTESANFVLGKANLFG------TIQSRGNSQVRLTENSHWHLTGNSDVHQ 864 Query: 112 HLLNSST 118 L + Sbjct: 865 LDLANGH 871
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 33.1 bits (75), Expect = 0.001 Identities = 17/90 (18%), Positives = 26/90 (28%), Gaps = 11/90 (12%) Query: 92 EEQHVEHARKQLEEAKARVQAQRAEQQAKKREAAIAAGETPEPRRPRPAGKKPAPRREAG 151 EE+ K E K V +Q + +Q + A EP R P + Sbjct: 1109 EEKAKVETEKTQEVPK--VTSQVSPKQEQSETVQPQA----EPAREN----DPTVNIKEP 1158 Query: 152 AAPENRKPRQS-PRPQQVRPPRPQVEENQP 180 + N P + V E+ Sbjct: 1159 QSQTNTTADTEQPAKETSSNVEQPVTESTT 1188
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 829 bits (2144), Expect = 0.0 Identities = 284/874 (32%), Positives = 447/874 (51%), Gaps = 39/874 (4%) Query: 17 LPAFSFAICGIGGMLYIPSSAAENSEYVEFSDAFL----RFPVDATRYSEGNPVSPGERQ 72 F P S+AE + F+ FL + D +R+ G + PG + Sbjct: 24 AGFFVRLFVACAFAAQAPLSSAE----LYFNPRFLADDPQAVADLSRFENGQELPPGTYR 79 Query: 73 VDIYLNDQWIGRQEMRFALPSPESKVATPCFDVKLFDELGVDTAKLSSDTVKLLESRGAC 132 VDIYLN+ ++ +++ F + PC +G++TA +S L + AC Sbjct: 80 VDIYLNNGYMATRDVTFN-TGDSEQGIVPCLTRAQLASMGLNTASVSGMN---LLADDAC 135 Query: 133 SPLSRLLEGGNAIFDDNQQRLDIQVPQAYLIRQARGYVHPKYWDDGVTAATLKYDYTGYR 192 PL+ ++ A D QQRL++ +PQA++ +ARGY+ P+ WD G+ A L Y+++G Sbjct: 136 VPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNS 195 Query: 193 SNQNDIGSQTYQYLGLLGGLNWQSWRLYYRSALNRSDSQG-----FDYQNLATYVERAVP 247 G+ Y YL L GLN +WRL + + + S +Q++ T++ER + Sbjct: 196 VQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDII 255 Query: 248 SLYSKMTIGDSNTDGQVFDSLSYRGIELTSDDRMYADSQRGYAPVVRGVARTNARVVVRQ 307 L S++T+GD T G +FD +++RG +L SDD M DSQRG+APV+ G+AR A+V ++Q Sbjct: 256 PLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQ 315 Query: 308 QGRPIYETTVPPGPFVIDDLYPTGQGGNLNVTITEADGSEQTFIVPFASIAELLRPGTTR 367 G IY +TVPPGPF I+D+Y G G+L VTI EADGS Q F VP++S+ L R G TR Sbjct: 316 NGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTR 375 Query: 368 YSLMAGEYR-DNSMVDKPVLFMGTVRHGLSNLLTGNGGMVAAEGYLSASAGLAFNT-PVG 425 YS+ AGEYR N+ +KP F T+ HGL T GG A+ Y + + G+ N +G Sbjct: 376 YSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALG 435 Query: 426 AVAFNVTQAQTRLPNKDNQRGQSIGMTYAKSLPETNTNLTIASYHYSSNGFYTPAEAMRM 485 A++ ++TQA + LP+ GQS+ Y KSL E+ TN+ + Y YS++G++ A+ Sbjct: 436 ALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYS 495 Query: 486 RDYLQHGEVNNTQIDSSWPNGSDRYDDSFKYRRRNQAQVSIAQGLPDGYGSFYANANVQD 545 R + E + P +D Y+ + Y +R + Q+++ Q L + Y + + Q Sbjct: 496 RMNGYNIE-TQDGVIQVKPKFTDYYNLA--YNKRGKLQLTVTQQLGR-TSTLYLSGSHQT 551 Query: 546 YWDGRNRDMNFQFGYTNSYKSLSYNVALNRLRDIPSGDWDNQLSVSLSIPLG------TH 599 YW N D FQ G +++ +++ ++ + ++ D L+++++IP + Sbjct: 552 YWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSK 611 Query: 600 AGAPRLSSSYSNTR---GSSAIQTGVSGSAGEDNQFSYGVSAANNRSDENGSYNTLGANG 656 + S+SYS + G GV G+ EDN SY V + S +T A Sbjct: 612 SQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATL 671 Query: 657 SWQAPYATVGGSYSKSNSYDQASASLSGGVVAYRGGVILAPALGDTVGIIEAPDAAGARV 716 +++ Y YS S+ Q +SGGV+A+ GV L L DTV +++AP A A+V Sbjct: 672 NYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKV 731 Query: 717 GSYSSMYLDRRGRAILPYLSPYRQNEVELDPKGLSADVEFKSTSQKVAPTAGAVALVTFE 776 + + + D RG A+LPY + YR+N V LD L+ +V+ + V PT GA+ F+ Sbjct: 732 ENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFK 791 Query: 777 TSTGYSVLVRGHLADNTPLPFGAEVKDGGGTRVGFIAQGGQAMVRVNQQAGNLRVIWGDG 836 G +L+ +N PLPFGA V G +A GQ + AG ++V WG+ Sbjct: 792 ARVGIKLLMT-LTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEE 850 Query: 837 IGESCSFDYKLPEGNLVKGHLVKGDYRRLEVICK 870 C +Y+LP + +L C+ Sbjct: 851 ENAHCVANYQLPP------ESQQQLLTQLSAECR 878
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 116 bits (292), Expect = 2e-30 Identities = 85/400 (21%), Positives = 165/400 (41%), Gaps = 14/400 (3%) Query: 25 IMMAVLDGTIANVALPTIARDLNTSPATSIWVVNAYQLAITISLLSMASLGDIIGYRRVY 84 +VL+ + NV+LP IA D N PA++ WV A+ L +I L D +G +R+ Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82 Query: 85 QAGLLIFSVTSLFCALSDSLWTLT-FARVLQGFGAAALMSVNTALIRIIYPRAQLGRGIG 143 G++I S+ + S ++L AR +QG GAAA ++ ++ P+ G+ G Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142 Query: 144 INTLIVAVSSAAGPSIAAAVLSVASWQWLFALNVPIGLLAWCLGIKFLPANNTKSNGNRF 203 + IVA+ GP+I + W +L + + I ++ +K L F Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLK--KEVRIKGHF 199 Query: 204 DITSCVLNALTFGLLITAISGFSQGQSPAVIAAQVVALLLIGFFFVRRQLTQSFPLLPVD 263 DI + L+ I F + I+ +V++L FV+ + P + Sbjct: 200 DIKGII-------LMSVGIVFFMLFTTSYSISFLIVSVLSF-LIFVKHIRKVTDPFVDPG 251 Query: 264 LLRIPIFALSIGTSIFSFAAQMLAMVSLPFFLQTVLGRDEVATG-LLLTPWPLATMVIAP 322 L + F + + F + +P+ ++ V G +++ P ++ ++ Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311 Query: 323 IAGRLVERYHAGLLGGIGLAVFASGLFLLAVLPANPSDVDIIWRMILCGAGFGLFQTPNN 382 I G LV+R + IG+ + + L + + ++ G +T + Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLL-ETTSWFMTIIIVFVLGGLSFTKTVIS 370 Query: 383 HTIISAAPQHRSGGASGMLGTARLLGQTSGAALVALMFNM 422 + S+ Q +G +L L + +G A+V + ++ Sbjct: 371 TIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410
>BACINVASINB#Salmonella/Shigella invasin protein B signature. Length = 593 Score = 37.0 bits (85), Expect = 1e-04 Identities = 24/95 (25%), Positives = 42/95 (44%), Gaps = 10/95 (10%) Query: 60 EVRIGDRVVNNLAPKSRGIAM-VFQNYALYPHMTVKENLAFGLKLSKLPKDQIEAQVAEA 118 +V +G V N A + G+A VF A E LA L++ DQI+ + ++ Sbjct: 504 KVALGMEVTNTAAQSAGGVAEGVFIKNA-------SEALA-DFMLARFAMDQIQQWLKQS 555 Query: 119 AKIL-ELEDLLDRLPRQLSGGQAQRVAVGRAIVKK 152 +I E + + L + +S Q R I+++ Sbjct: 556 VEIFGENQKVTAELQKAMSSAVQQNADASRFILRQ 590
>PF06917#Periplasmic pectate lyase Length = 555 Score = 995 bits (2575), Expect = 0.0 Identities = 553/555 (99%), Positives = 554/555 (99%) Query: 1 MGFTGNIKGSDAMMINWLSAIRSYVDLVQSVGHSQLNPSPLLADGFDVLTHQPVVWEFPD 60 MGFTGNIKGSDAMMINWLSAIRSYVDLVQSVGHSQLNPSPLLADGFDVLTHQPVVWEFPD Sbjct: 1 MGFTGNIKGSDAMMINWLSAIRSYVDLVQSVGHSQLNPSPLLADGFDVLTHQPVVWEFPD 60 Query: 61 GHHTPISNFASQQNWLRTLDALSLVTQDPQYHQQARVQSGYFMQHGVHNESGLFYWGGHR 120 GHHTPISNFASQQNWLRTLDALSLVTQDPQYHQQAR+QSGYFMQHGVHNESGLFYWGGHR Sbjct: 61 GHHTPISNFASQQNWLRTLDALSLVTQDPQYHQQARIQSGYFMQHGVHNESGLFYWGGHR 120 Query: 121 FLNLDTLKTEGPASKDQVHELKHHLPYYDLLVTIDRERTLNFLQGFWHAHVEDWKTLDLG 180 FLNLDTLKTEGPASKDQVHELKHHLPYYDLLVTIDRERTLNFLQGFWHAHVEDWKTLDLG Sbjct: 121 FLNLDTLKTEGPASKDQVHELKHHLPYYDLLVTIDRERTLNFLQGFWHAHVEDWKTLDLG 180 Query: 181 RHGNYSKQRDPQVFTHPRYDVVNPAELPKLPETKGLTFVNAGTDLIYAAYKYAEYTGDAA 240 RHGNYSKQRDPQVFTHPRYDVVNPAELPKLPETKGLTFVNAGTDLIYAAYKYAEYTGDAA Sbjct: 181 RHGNYSKQRDPQVFTHPRYDVVNPAELPKLPETKGLTFVNAGTDLIYAAYKYAEYTGDAA 240 Query: 241 AAAWGKHLYCQYVLARNPETGLPVYQFSSPQQRQPIPADDNQTQSWYGDRAKRQFGPEFG 300 AAAWGKHLY QYVLARNPETGLPVYQFSSPQQRQPIPADDNQTQSWYGDRAKRQFGPEFG Sbjct: 241 AAAWGKHLYRQYVLARNPETGLPVYQFSSPQQRQPIPADDNQTQSWYGDRAKRQFGPEFG 300 Query: 301 EIAREANVLFRDMRPLLIDNPLAMLDILRQQPDAEVLQWVIDGLKNYYRFAYDVESNTLR 360 EIAREANVLFRDMRPLLIDNPLAMLDILRQQPDAEVLQWVIDGLKNYYRFAYDVESNTLR Sbjct: 301 EIAREANVLFRDMRPLLIDNPLAMLDILRQQPDAEVLQWVIDGLKNYYRFAYDVESNTLR 360 Query: 361 PLWNDGQDMSGYVLPRDGYYGVKGTVISPFPLDVDYLLPLVRAWRLSEDEELLDLIGVLL 420 PLWNDGQDMSGYVLPRDGYYGVKGTVISPFPLDVDYLLPLVRAWRLSEDEELLDLIGVLL Sbjct: 361 PLWNDGQDMSGYVLPRDGYYGVKGTVISPFPLDVDYLLPLVRAWRLSEDEELLDLIGVLL 420 Query: 421 LRWQLAELNKTQRRATLMAAQRPIASPYLLLALVELAEHCQCPTLFTLAWQIGDDLFKRH 480 LRWQLAELNKTQRRATLMAAQRPIASPYLLLALVELAEHCQCPTLFTLAWQIGDDLFKRH Sbjct: 421 LRWQLAELNKTQRRATLMAAQRPIASPYLLLALVELAEHCQCPTLFTLAWQIGDDLFKRH 480 Query: 481 YHRGLFVESAQHRYFRIDNPIALALLTLIAAKQDKLAAIPQFITNGGYIHGDYRVNGESR 540 YHRGLFVESAQHRYFRIDNPIALALLTLIAAKQDKLAAIPQFITNGGYIHGDYRVNGESR Sbjct: 481 YHRGLFVESAQHRYFRIDNPIALALLTLIAAKQDKLAAIPQFITNGGYIHGDYRVNGESR 540 Query: 541 TLYDIDFIYPTLLNQ 555 TLYDIDFIYPTLLNQ Sbjct: 541 TLYDIDFIYPTLLNQ 555
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 120 bits (301), Expect = 5e-35 Identities = 75/252 (29%), Positives = 127/252 (50%), Gaps = 11/252 (4%) Query: 8 LKGKVALVTGCDTGLGQGMAIGLAEAGCDIIGVN-IVEPRETIEQ-VTALGRRFFSLTAD 65 ++GK+A +TG G+G+ +A LA G I V+ E E + + A R + AD Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 66 LSNIECIPSLLERAVAEFGHIDILVNNAGIIRREDAINFSEKDWDDVMNVNIKSVFFMSQ 125 + + I + R E G IDILVN AG++R + S+++W+ +VN VF S+ Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 126 AVAKQFIKQGNGGKIINVASMLSYQGGIRVPSYTASKSAVMGVTRLLANEWAKHGINVNA 185 +V+K + + G I+ V S + + +Y +SK+A + T+ L E A++ I N Sbjct: 126 SVSKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184 Query: 186 VAPGYMATNNTQQLRKDEERSKEILD--------RIPAGRWGLPDDLKGPVVFLASKASD 237 V+PG T+ L DE +++++ IP + P D+ V+FL S + Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244 Query: 238 YISGYTIAVDGG 249 +I+ + + VDGG Sbjct: 245 HITMHNLCVDGG 256
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 388 bits (997), Expect = e-138 Identities = 106/309 (34%), Positives = 179/309 (57%), Gaps = 7/309 (2%) Query: 21 LRAAALFTIVAFSSLISTAALAENNPSDTAKKFKVVTTFTIIQDIAQNIAGDVAVVESIT 80 ++ ++ S++I A + + + +K KVV T +II DI +NIAGD + SI Sbjct: 1 MKKLGTLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIV 60 Query: 81 KPGAEIHDYQPTPRDIVKAQSADLILWNGMNLER----WFEKFFESIK---DVPSAVVTA 133 G + H+Y+P P D+ K ADLI +NG+NLE WF K E+ K + V+ Sbjct: 61 PIGQDPHEYEPLPEDVKKTSEADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFAVSD 120 Query: 134 GITPLPIREGPYSGIANPHAWMSPSNALIYIENIRKALVEHDPANAETYNRNAQAYAEKI 193 G+ + + G +PHAW++ N +I+ +NI K L DP N E Y +N + Y +K+ Sbjct: 121 GVDVIYLEGQNEKGKEDPHAWLNLENGIIFAKNIAKQLSAKDPNNKEFYEKNLKEYTDKL 180 Query: 194 KALDAPLRERLSRIPAEQRWLVTSEGAFSYLAKDYGFKEVYLWPINAEQQGIPQQVRHVI 253 LD +++ ++IPAE++ +VTSEGAF Y +K YG Y+W IN E++G P+Q++ ++ Sbjct: 181 DKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWEINTEEEGTPEQIKTLV 240 Query: 254 DIIRENKIPVVFSESTISDKPAKQVSKETGAQYGGVLYVDSLSGEKGPVPTYISLINMTV 313 + +R+ K+P +F ES++ D+P K VS++T ++ DS++ + +Y S++ + Sbjct: 241 EKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGKEGDSYYSMMKYNL 300 Query: 314 DTIAKGFGQ 322 D IA+G + Sbjct: 301 DKIAEGLAK 309
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 162 bits (411), Expect = 1e-51 Identities = 94/249 (37%), Positives = 132/249 (53%), Gaps = 18/249 (7%) Query: 10 RRLTWSLIFSIGLHGSVVAALLYVSVEQMKIQPEIEDTPLAVTMVNIAEFAAPQPAAAAP 69 RR W + S+ +HG+VVA LLY SV Q+ P P++VTMV A Sbjct: 7 RRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQ-PISVTMVT-----------PAD 54 Query: 70 EPVQETPAVPEETPPVLEETPPEPEELPEPVPVPVPEPV-KPKPKPVKKEVKKEVKKPEV 128 + P E E P E P+ PV + +P KPKPKP + +E K +V Sbjct: 55 LEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDV 114 Query: 129 KKTQAPPDDKPFKSDEAALVANNAPVKSAPVASTPGLSTSAGPKALSKAKPSYPARALAL 188 K ++ P PF++ A + ++ + P S ++GP+ALS+ +P YPARA AL Sbjct: 115 KPVESRPA-SPFENTAPARLTSSTATAATS---KPVTSVASGPRALSRNQPQYPARAQAL 170 Query: 189 GIEGQVKVQYDIDESGRVTNVRVLEATPRNTFEREVKQVMRKWRFEA-VAAKNYVTTIVF 247 IEGQVKV++D+ GRV NV++L A P N FEREVK MR+WR+E V I+F Sbjct: 171 RIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILF 230 Query: 248 KLDGKMEMN 256 K++G E+ Sbjct: 231 KINGTTEIQ 239
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 161 bits (410), Expect = 2e-53 Identities = 72/180 (40%), Positives = 106/180 (58%), Gaps = 10/180 (5%) Query: 1 MKWITTLAPLSLALSLGISVANAASDASNTVSFGYAQSTLKIDGEKIGKDNKGFNLKYRH 60 MK I L+ L+ L+ + AA+ +TV+ GYAQS + K+ GFNLKYR+ Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAAT---STVTGGYAQSDAQGQMNKM----GGFNLKYRY 53 Query: 61 ELD-SVLGIVASFTHTKQNYGMPGDSDGKRKVEYYSLMVGPSWRFNEFVSAYALIGATQG 119 E D S LG++ SFT+T+++ K +YY + GP++R N++ S Y ++G G Sbjct: 54 EEDNSPLGVIGSFTYTEKSRTASSGDYNK--NQYYGITAGPAYRINDWASIYGVVGVGYG 111 Query: 120 KSTHTKPRMVSNTVSKTSMGYGAGLQFNPVKHVAIDTAYEYAKIEDVKIGTWIVGVGYRF 179 K T+ + S YGAGLQFNP+++VA+D +YE ++I V +GTWI GVGYRF Sbjct: 112 KFQTTEYPTYKHDTSDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWIAGVGYRF 171
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.0 bits (70), Expect = 0.007 Identities = 9/16 (56%), Positives = 11/16 (68%) Query: 54 VVGESGCGKSTFARAI 69 + GESG GK ARA+ Sbjct: 165 ITGESGTGKELVARAL 180
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 29.4 bits (66), Expect = 0.032 Identities = 20/88 (22%), Positives = 33/88 (37%), Gaps = 14/88 (15%) Query: 1 MKVTVFGI-GYVGLVQATVLAEVGHDVLCID-IDANKVADLKKGRIAIFEPGLAPLVK-- 56 MK V G G++G + L E GH V+ ID ++ LK+ R+ + K Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 57 -ENYEAGRLQFSTD---------AQAGV 74 + E F++ + V Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAV 88
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 83.7 bits (207), Expect = 5e-20 Identities = 31/115 (26%), Positives = 48/115 (41%), Gaps = 1/115 (0%) Query: 10 ILVVEDEVVFRTVLAEYLGSLGATIHQAENGLAALYQLKGHSPDLILCDLAMPKMGGIEF 69 ILV +D+ RTVL + L G + N + DL++ D+ MP + Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 70 VEQLLLKGIKIPVLVISATDKMADIAQVLRLGVKDVLLKPIVDLNRLREAVLACL 124 + ++ +PVLV+SA + + G D L KP DL L + L Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP-FDLTELIGIIGRAL 119
>SECA#SecA protein signature. Length = 901 Score = 46.4 bits (110), Expect = 1e-08 Identities = 15/23 (65%), Positives = 18/23 (78%) Query: 132 PSLGRNDTCLCGSGKKHKKCCGR 154 +GRND C CGSGKK+K+C GR Sbjct: 877 RKVGRNDPCPCGSGKKYKQCHGR 899 Score = 27.9 bits (62), Expect = 0.019 Identities = 8/14 (57%), Positives = 9/14 (64%) Query: 5 CPCGSILNYHECCG 18 CPCGS Y +C G Sbjct: 885 CPCGSGKKYKQCHG 898
>PF05860#haemagglutination activity domain. Length = 117 Score = 52.1 bits (125), Expect = 6e-10 Identities = 20/97 (20%), Positives = 33/97 (34%), Gaps = 20/97 (20%) Query: 59 VINIAPPSEHGLSHNQYMEFHVNEHGVVFNNSLERVVKNGVTYDANLNLRGSPARVILNE 118 +I + L H+ + EF V G F N+ + + I++ Sbjct: 23 IIERGTQAGSNLFHS-FQEFSVPTSGTAFFNN------------------PTNIQNIISR 63 Query: 119 VVGLNASVLAGHQDIVGIPADYILANANGISCQGCSF 155 V G + S + G A+ L N NGI + Sbjct: 64 VTGGSVSNIDGLIRANA-TANLFLINPNGIIFGQNAR 99
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 62.5 bits (152), Expect = 7e-13 Identities = 66/356 (18%), Positives = 127/356 (35%), Gaps = 15/356 (4%) Query: 14 FLLFDNLLVVLGFFVVFPLISIRFVDQLGWAALVV---GLALGLRQLVQQGLGIFGGAIA 70 +L L +G ++ P++ + L + V G+ L L L+Q GA++ Sbjct: 9 VILSTVALDAVGIGLIMPVLPG-LLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALS 67 Query: 71 DRFGAKPMIVTGMLMRAAGFALMAMADEPWILWLACALSGLGGTLFDPPRTALVIKLTRP 130 DRFG +P+++ + A +A+MA A W+L++ ++G+ G A + +T Sbjct: 68 DRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA-GAYIADITDG 126 Query: 131 HERGRFYSLLMMQDSAGAVIGALIGSWLLQYDFHFVCWTGAAIFVLAAGWNAWLLPAYRI 190 ER R + + G V G ++G + + H + AA+ L +LLP Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHK 186 Query: 191 STVRAPMKEGLMRVLRDRRFVTYVLTLTGYYMLAVQVMLMLPI--------VVNELAGSP 242 R P++ + L R+ + + + + L+ + + Sbjct: 187 GE-RRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDA 245 Query: 243 AAVKWMYAIEAALSLTLLYPLARWSEKRFSLEQRLMAGLLIMTLSLFPIGMITHLQTLFM 302 + A L + R + LM G++ + T F Sbjct: 246 TTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFP 305 Query: 303 FICFFYMGSILAEPARETLGASLADSRARGSYMGFSRLGLALGGALGYTGGGWMYD 358 + G I PA + + + D +G G +L +G +Y Sbjct: 306 IMVLLASGGI-GMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA 360
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 79.8 bits (197), Expect = 4e-19 Identities = 68/365 (18%), Positives = 121/365 (33%), Gaps = 85/365 (23%) Query: 3 NILITGASGFIGGAFMRRFACHDGIRLCGI-------------GRRSVEGFP--TSVRYQ 47 L+TGA+GFIG +R G ++ GI R + P + Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEA-GHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 48 ALDLARLATL--DFTPDVVIHAAGRAG---PWGTRSEYYRDNVVTTEQVIKFCQSRGNPR 102 D + L + V + R Y N+ +++ C+ Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 103 LIYLSTAAVYYRYCHQLALTEQSEIGPEFANDYALTKHQGEALIEAYQG----EKTILRP 158 L+Y S+++V Y ++ + + + YA TK E + Y T LR Sbjct: 121 LLYASSSSV-YGLNRKMPFSTDDSV-DHPVSLYAATKKANELMAHTYSHLYGLPATGLRF 178 Query: 159 CAVFGP-GDQLLFPPLLDAASRHGLPLLISEVPARGELM----YIDVLCDYLLKAAIKPE 213 V+GP G + A G + +V G++ YID + + +++ Sbjct: 179 FTVYGPWGRPDMALFKFTKAMLEGKSI---DVYNYGKMKRDFTYIDDIAEAIIRLQDVIP 235 Query: 214 LR----------------PF--YNLSNVEPIEINEFLIDVLSK-LGLPAPKREVRVATAM 254 P+ YN+ N P+E+ ++ I L LG+ A K + + Sbjct: 236 HADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDY-IQALEDALGIEAKKNMLPLQ--- 291 Query: 255 LIAGIIEGTYRLLRIKSEPSITRFGVGVLGYSKTLDVSAAIHDFG-SPSRSLSQGLDAFI 313 G + T D A G +P ++ G+ F+ Sbjct: 292 --PGDVLETS------------------------ADTKALYEVIGFTPETTVKDGVKNFV 325 Query: 314 RWYKE 318 WY++ Sbjct: 326 NWYRD 330
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 47.8 bits (113), Expect = 4e-08 Identities = 101/420 (24%), Positives = 169/420 (40%), Gaps = 55/420 (13%) Query: 14 TLLMAGNASA---QETLRVLLEGHSTSDSIKALLPEFEKQTGIKVQAEIVPYSDLTSKAL 70 T++ + +A A + L + + G + + + +FEK TGIKV E + D + Sbjct: 17 TMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVE---HPDKLEEKF 73 Query: 71 LAFSSHSGRYDVVMDDWVHAV--GYASAGYITPVDQWMESDTAFYDGADFVKSYA---DT 125 ++ D++ W H GYA +G + + D AF D K Y D Sbjct: 74 PQVAATGDGPDIIF--WAHDRFGGYAQSGLLAEI----TPDKAFQD-----KLYPFTWDA 122 Query: 126 LRYKDGYYGLPVYGESTFLMYRKDLFEQYGIAVPKTFDELTAAAKTIKEKTEGKVAGITL 185 +RY P+ E+ L+Y KDL PKT++E+ A K +K K GK A + Sbjct: 123 VRYNGKLIAYPIAVEALSLIYNKDLLPN----PPKTWEEIPALDKELKAK--GKSA--LM 174 Query: 186 RGAQGIQNTFAWASFLWGYGGQWIDDNGK-----SAIASPQAVEATKSFVNILKNYGPIG 240 Q + F W G + +NGK + + A V+++KN Sbjct: 175 FNLQ--EPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNA 232 Query: 241 AANFGWQENRLVFQQGKAAMTIDSTVNGGFNEDPKESMVVGKVGYAPVPVQPGDHPGNSG 300 ++ E F +G+ AMTI NG + ++ KV Y + + Sbjct: 233 DTDYSIAE--AAFNKGETAMTI----NGPW---AWSNIDTSKVNYGVTVLPTFKGQPSKP 283 Query: 301 ALQVHGLYISSDSKKQDAAWKFISWATDKQTQMKSVELNPNAGVSSLSAINSDAFTKRYG 360 + V I++ S ++ A +F+ +++V + L A+ ++ + Sbjct: 284 FVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKD-----KPLGAVALKSYEEELA 338 Query: 361 AFKDGMLAALQNGNAK--YLPTIPQSTQIINITGIALSEALAGTQTVENALQQANTRNDK 418 KD +AA K +P IPQ + A+ A +G QTV+ AL+ A TR K Sbjct: 339 --KDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK 396
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 41.6 bits (97), Expect = 2e-05 Identities = 25/158 (15%), Positives = 46/158 (29%), Gaps = 1/158 (0%) Query: 27 ASNKTLAASIKTTKDQLKQLNGQAAKIE-GFRQNKAAVDRAAQALTAARNKARQLATELK 85 A L +++ + + + +E A +AL A N + + ++K Sbjct: 190 ARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIK 249 Query: 86 NSAAPTAKQAREFKRASEEAAKLKQKYNDLRTALHTQRAALQSSGVATNRLGQAQRTLKA 145 A A + + T A + L + L A Sbjct: 250 TLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNA 309 Query: 146 SITSTTAALAAQQRRLAQQAQQQQRLNAARNRFDASNQ 183 + S L A + Q + Q+L +AS Q Sbjct: 310 NRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQ 347 Score = 35.0 bits (80), Expect = 0.002 Identities = 30/169 (17%), Positives = 58/169 (34%), Gaps = 6/169 (3%) Query: 15 IDKITRPFKSMLASNKTLAASIKTTKDQLKQLNGQAAKIEGFRQNKAAVDRAAQALTAAR 74 + + + + ++K L + A +E +A +++A + A Sbjct: 220 KAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALE---ARQAELEKALEG---AM 273 Query: 75 NKARQLATELKNSAAPTAKQAREFKRASEEAAKLKQKYNDLRTALHTQRAALQSSGVATN 134 N + + ++K A A E ++ L LR L R A + Sbjct: 274 NFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQ 333 Query: 135 RLGQAQRTLKASITSTTAALAAQQRRLAQQAQQQQRLNAARNRFDASNQ 183 +L + + +AS S L A + Q + Q+L +AS Q Sbjct: 334 KLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQ 382
>BONTOXILYSIN#Bontoxilysin signature. Length = 1196 Score = 29.5 bits (66), Expect = 0.005 Identities = 11/51 (21%), Positives = 24/51 (47%), Gaps = 4/51 (7%) Query: 80 WSL-IEGNGAIHGMFVIESLERTKSIFFSDGSARKIEF-TLSLKRTDESLK 128 W + E NG + + I+S +S++ S+ + ++S+ R + L Sbjct: 929 WEIYFEDNGLVFEI--IDSNGNQESVYLSNIINDNWYYISISVDRLKDQLL 977
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 42.4 bits (99), Expect = 8e-06 Identities = 24/158 (15%), Positives = 47/158 (29%), Gaps = 1/158 (0%) Query: 27 ASNKTLAASIKTTKDQLKQLNSQAAKIE-GFRQNKAAVDRAAQALTAARDKARQLATELK 85 A L +++ + +++ +E A +AL A + + + ++K Sbjct: 190 ARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIK 249 Query: 86 NSAAPTAKQAREFKRASEEAAKLKQKYNDLRTALHTQRAALQSSGVATNRLGQAQRTLKA 145 A A + + T A + L + L A Sbjct: 250 TLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNA 309 Query: 146 SITSTTAALAAQQRRLAQQAQQQQRLNAARNRFDASNQ 183 + S L A + Q + Q+L +AS Q Sbjct: 310 NRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQ 347 Score = 33.1 bits (75), Expect = 0.006 Identities = 29/169 (17%), Positives = 59/169 (34%), Gaps = 6/169 (3%) Query: 15 IDKITRPFKSMLASNKTLAASIKTTKDQLKQLNSQAAKIEGFRQNKAAVDRAAQALTAAR 74 + + + + ++K L ++ A +E +A +++A + A Sbjct: 220 KAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALE---ARQAELEKALEG---AM 273 Query: 75 DKARQLATELKNSAAPTAKQAREFKRASEEAAKLKQKYNDLRTALHTQRAALQSSGVATN 134 + + + ++K A A E ++ L LR L R A + Sbjct: 274 NFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQ 333 Query: 135 RLGQAQRTLKASITSTTAALAAQQRRLAQQAQQQQRLNAARNRFDASNQ 183 +L + + +AS S L A + Q + Q+L +AS Q Sbjct: 334 KLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQ 382
>BONTOXILYSIN#Bontoxilysin signature. Length = 1196 Score = 29.1 bits (65), Expect = 0.008 Identities = 10/51 (19%), Positives = 24/51 (47%), Gaps = 4/51 (7%) Query: 80 WSL-IEGNGAIHGMFVIESLNRTKNIFFSDGSARKIEF-TLSLKRTDESLK 128 W + E NG + + I+S ++++ S+ + ++S+ R + L Sbjct: 929 WEIYFEDNGLVFEI--IDSNGNQESVYLSNIINDNWYYISISVDRLKDQLL 977
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 71.8 bits (176), Expect = 7e-17 Identities = 25/115 (21%), Positives = 46/115 (40%), Gaps = 2/115 (1%) Query: 2 ISVLLVDDHELVRAGIRRILDDIKGIKVAGEMQCGEDAVKWCRSHVVDIVLMDMNMPGIG 61 ++L+ DD +R + + L G V +W + D+V+ D+ MP Sbjct: 4 ATILVADDDAAIRTVLNQALSR-AGYDVRIT-SNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 62 GLEATRKILRFSPDTKVIMLTIHTENPLPAKVMQAGAGGYLSKGAAPQDVITAIR 116 + +I + PD V++++ K + GA YL K ++I I Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116
>FLAGELLIN#Flagellin signature. Length = 507 Score = 165 bits (419), Expect = 9e-49 Identities = 164/358 (45%), Positives = 191/358 (53%), Gaps = 3/358 (0%) Query: 3 VINTNSLSLLTQNNLNKSQSSLGTAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQ 62 VINTNSLSLLTQNNLNKSQSSL +AIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQ Sbjct: 3 VINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQ 62 Query: 63 AARNANDGISIAQTTEGSLNEINNNLQRVRELTVQAQNGSNSSSDLDSIQDEISLRLAEI 122 A+RNANDGISIAQTTEG+LNEINNNLQRVREL+VQA NG+NS SDL SIQDEI RL EI Sbjct: 63 ASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEI 122 Query: 123 DRVSDQTQFNGKKVLAENTTMSIQVGANDGETIDINLQKIDSKSLGLGSYSVSGVSGALT 182 DRVS+QTQFNG KVL+++ M IQVGANDGETI I+LQKID KSLGL ++ V+G Sbjct: 123 DRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFN---VNGPKE 179 Query: 183 SLTDTSVTGVTTTTALDFSDISTFAKGATVHGIGDVGTDGAYADGYVIRTTDGKQYKGEV 242 + + T D + V+ V A + Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239 Query: 243 DATNGKVTFADDANGDPIDDATKLEAAAQFSPAGKATASPLETLDDAIKQVDGLRSSLGA 302 DA N A A + + + I G + Sbjct: 240 DAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKV 299 Query: 303 VQNRFESAVTNLNNTVTNLTSARSRIEDADYATEVSNMSRAQILQQAGTSVLSQANQV 360 VT +T + +++ Q T S Sbjct: 300 STTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSD 357 Score = 101 bits (252), Expect = 1e-25 Identities = 82/241 (34%), Positives = 112/241 (46%), Gaps = 2/241 (0%) Query: 129 TQFNGKKVLAENTTMSIQVGANDGETIDINLQKIDSKSLGLGSYSVSGVSGALTSLTDTS 188 G K + + D N + + + + +V+ ++ ++ + Sbjct: 267 GAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAAT 326 Query: 189 VTGVTTTTALDFSDISTFAKG-ATVHGIGDVGTDGAYADGYVIRTTDGKQYKGEVDATNG 247 + + TF G T +G +Y Sbjct: 327 LQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKV 386 Query: 248 KVTFADDANGDPIDDATKLEAAAQFSPAGKATASPLETLDDAIKQVDGLRSSLGAVQNRF 307 + + L + PL ++D A+ +VD +RSSLGA+QNRF Sbjct: 387 TLAGKTMFIDKTASGVSTLINEDAAAAKKSTAN-PLASIDSALSKVDAVRSSLGAIQNRF 445 Query: 308 ESAVTNLNNTVTNLTSARSRIEDADYATEVSNMSRAQILQQAGTSVLSQANQVPQTVLSL 367 +SA+TNL NTVTNL SARSRIEDADYATEVSNMS+AQILQQAGTSVL+QANQVPQ VLSL Sbjct: 446 DSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSL 505 Query: 368 L 368 L Sbjct: 506 L 506
>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE signature. Length = 103 Score = 80.5 bits (198), Expect = 2e-23 Identities = 59/102 (57%), Positives = 73/102 (71%) Query: 2 SVQGIEGVLQQLQVTALQASGSAKTLPAEAGFASELKAAIGKISENQQVARTSAQNFELG 61 ++QGIEGV+ QLQ TA+ A FA +L AA+ +IS+ Q ART A+ F LG Sbjct: 2 AIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLG 61 Query: 62 VPGVGLNDVMVNAQKSSVSLQLGIQVRNKLVAAYQEVMNMGV 103 PGV LNDVM + QK+SVS+Q+GIQVRNKLVAAYQEVM+M V Sbjct: 62 EPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 578 bits (1490), Expect = 0.0 Identities = 353/552 (63%), Positives = 441/552 (79%), Gaps = 9/552 (1%) Query: 19 LARLRANPKIPLLIAAAAAIAIIVALMLWAKSPDYRVLYSNLSDRDGGDIVTQLTQLNIP 78 L RLRANP+IPL++A +AA+AI+VA++LWAK+PDYR L+SNLSD+DGG IV QLTQ+NIP Sbjct: 16 LNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIP 75 Query: 79 YRFADNGGALLIPAEKVHETRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQINYQRAL 138 YRFA+ GA+ +PA+KVHE RLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQ+NYQRAL Sbjct: 76 YRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRAL 135 Query: 139 EGELSRTIGTLGPVLNVRVHLAMPKPSLFVREQKSPTASVTLALQPGRALDDGQINAIVY 198 EGEL+RTI TLGPV + RVHLAMPKPSLFVREQKSP+ASVT+ L+PGRALD+GQI+A+V+ Sbjct: 136 EGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVH 195 Query: 199 MVSSSVAGLPPGNVTVVDQTGRLLTQSDSAGRDLNASQLKFTSEVENRYQRRIENILAPM 258 +VSS+VAGLPPGNVT+VDQ+G LLTQS+++GRDLN +QLKF ++VE+R QRRIE IL+P+ Sbjct: 196 LVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRIQRRIEAILSPI 255 Query: 259 VGNGNVHAQVTAQVDFASREQTDEEYKPNQAANQGAVRSQQVSTSEQLGGTNVGGVPGAL 318 VGNGNVHAQVTAQ+DFA++EQT+E Y PN A++ +RS+Q++ SEQ+G GGVPGAL Sbjct: 256 VGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGAL 315 Query: 319 SNQPPVAPIAPIEIPQPAGAAANNAAPANAAATANANTTATAAKASSSNSRHDQTTNFEV 378 SNQP P A A NA T +T+ + A +++ ++T+N+EV Sbjct: 316 SNQPAP--------PNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEV 367 Query: 379 DRTIRHTQQQAGMVQRLSVAVVVNYTSDKAGKPIALSKDQLAQVESLTREAMGFSTVRGD 438 DRTIRHT+ G ++RLSVAVVVNY + GKP+ L+ DQ+ Q+E LTREAMGFS RGD Sbjct: 368 DRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDKRGD 427 Query: 439 TLNVVNTPFTASDDTRGSSLPFWQQQSFFDQLLNAGRYLLILLVAWILWRKLLRPMLAKK 498 TLNVVN+PF+A D+T G LPFWQQQSF DQLL AGR+LL+L+VAWILWRK +RP L ++ Sbjct: 428 TLNVVNSPFSAVDNT-GGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRR 486 Query: 499 QVADKAAASVNNIVQTAQAAETVKQSKEELALRKKNQQRVSAEVQAQRIRELADKDPRVV 558 KAA + Q + A V+ SK+E +++ QR+ AEV +QRIRE++D DPRVV Sbjct: 487 VEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVV 546 Query: 559 ALVIRQWMSNDQ 570 ALVIRQWMSND Sbjct: 547 ALVIRQWMSNDH 558
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 314 bits (806), Expect = e-108 Identities = 113/327 (34%), Positives = 192/327 (58%), Gaps = 2/327 (0%) Query: 2 SLTGTEKSAIMLMTLGEDHAAEVFKHLSSREVQQLSTTMASMRQVSHQQLVDVLAEFEDD 61 +LTG +K+AI+L+++G + +++VFK+LS E++ L+ +A + ++ + +VL EF++ Sbjct: 14 ALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKEL 73 Query: 62 AEQYAALSVNASDYLRSVLIKALGEERASSLLEDILESRETTSGMETLNFMEPQMAADLI 121 + DY R +L K+LG ++A ++ + L S + E + +P + I Sbjct: 74 MMAQEFIQKGGIDYARELLEKSLGTQKAVDIINN-LGSALQSRPFEFVRRADPANILNFI 132 Query: 122 RDEHPQIIATILVHLKRAQAADILALFDERLRNDVMLRIATFGGVQPAALAELTEVLNNL 181 + EHPQ IA IL +L +A+ IL+ ++ +V RIA P + E+ VL Sbjct: 133 QQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKK 192 Query: 182 LDGQ-NLKRSKMGGIRTAAEIINLMKTQQEETVMDAVREYDGELAQKIIDEMFLFENLVS 240 L + + GG+ EIIN+ + E+ +++++ E D ELA++I +MF+FE++V Sbjct: 193 LASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVL 252 Query: 241 VDDRSIQRLLQEIDNESLLIALKGADQALRERFLSNMSLRAAEILRDDLATRGPVRMSLV 300 +DDRSIQR+L+EID + L ALK D ++E+ NMS RAA +L++D+ GP R V Sbjct: 253 LDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDV 312 Query: 301 ENEQKSILLIVRRLAESGEIVIGGGED 327 E Q+ I+ ++R+L E GEIVI G + Sbjct: 313 EESQQKIVSLIRKLEEQGEIVISRGGE 339
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 221 bits (564), Expect = 3e-75 Identities = 128/233 (54%), Positives = 167/233 (71%), Gaps = 7/233 (3%) Query: 6 NALPWQPWSLKDFASQSEVPLSESMPDISLLFPNEPMEATAAVDEQQVLVNLQLEAEKQG 65 + LPW+ W+ D A P +E +P + P E + A +Q L LQ++A +QG Sbjct: 3 DNLPWKTWTPDDLAP----PQAEFVPIVE---PEETIIEEAEPSLEQQLAQLQMQAHEQG 55 Query: 66 RQQGFAKGLQEGLDKGYQTGLEEGHQQALADAQQQLAPMTAHWQVMVTDFQNTLDTLDSV 125 Q G A+G Q+G +GYQ GL +G +Q LA+A+ Q AP+ A Q +V++FQ TLD LDSV Sbjct: 56 YQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSV 115 Query: 126 IASRLVQIALAAAKQIIGQPAICDGTALLAQIQQMIQQEPMFAGKTQLRVNPDDLAIVEQ 185 IASRL+Q+AL AA+Q+IGQ D +AL+ QIQQ++QQEP+F+GK QLRV+PDDL V+ Sbjct: 116 IASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDD 175 Query: 186 RLGSTLSLHGWRLLGDSQIHAGGCKVSAEEGDLDASLATRWHELCRLAAPGEL 238 LG+TLSLHGWRL GD +H GGCKVSA+EGDLDAS+ATRW ELCRLAAPG + Sbjct: 176 MLGATLSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228
>FLGFLIJ#Flagellar FliJ protein signature. Length = 147 Score = 109 bits (274), Expect = 9e-34 Identities = 81/144 (56%), Positives = 101/144 (70%) Query: 1 MKSQSPLVTLCDLAQKAVEQASTQLGHVRQSYQNAEQQLTMLLTYQDEYRERLNDTLCNG 60 M L TL DLA+K VE A+ LG +R+ Q AE+QL ML+ YQ+EYR LN + G Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60 Query: 61 MASSSCQNYQQFIQTLEQAIDQHRKQLAQWSIKVEQAVKYWQEKQQRLNAFETLQERAET 120 + S+ NYQQFIQTLE+AI QHR+QL QW+ KV+ A+ W+EK+QRL A++TLQER T Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120 Query: 121 TQRQQENRLDQKLMDEFAQRASQR 144 ENRLDQK MDEFAQRA+ R Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMR 144
>FLGHOOKFLIK#Flagellar hook-length control protein signature. Length = 375 Score = 137 bits (346), Expect = 1e-38 Identities = 94/199 (47%), Positives = 119/199 (59%), Gaps = 7/199 (3%) Query: 253 AAQSEVSLSSASSDKTQLNLTPV-TAALSSPMNTAAASSLVSAPANGYLSAPLGSQEWQQ 311 AQ L + + K ++ TP A +SP+ T + + A LSAPLGS EWQQ Sbjct: 183 PAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEWQQ 242 Query: 312 SLGQQVLMFSRNGQQSAELRLHPQELGALQISLKMEDNQAQLHFASAHSQVRAALEAAMP 371 SL Q + +F+R GQQSAELRLHPQ+LG +QISLK++DNQAQ+ S H VRAALEAA+P Sbjct: 243 SLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALP 302 Query: 372 SLRHALAESGVQLGQSSVGSEGQWQQAQQQSQQNQQDVIARGQPTYGDVVAGPLTETPLA 431 LR LAESG+QLGQS++ E Q Q SQQ Q A +P G+ + L Sbjct: 303 VLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGE------DDDTLP 356 Query: 432 APTALQSLANGQGGVDVFA 450 P +LQ G GVD+FA Sbjct: 357 VPVSLQGRVTGNSGVDIFA 375
>PF04335#VirB8 type IV secretion protein Length = 227 Score = 27.1 bits (60), Expect = 0.031 Identities = 26/156 (16%), Positives = 44/156 (28%), Gaps = 27/156 (17%) Query: 8 AKRKSSIWLILLVLVAIAASAGGGYSWWLLHKSKPTNTQIVAAIPVFMPLETFTVNLITP 67 A+R + ++ + A+AG V A+ PL+T +IT Sbjct: 28 AERSKKLAWVVAGVAGALATAG------------------VVAVAALTPLKTVEPYVITV 69 Query: 68 DNNLDRVLYIGLTLRLPDDTTRTKLNDYLPE--VRSR-----LLLLLSRQSADSLSNEEG 120 D N T + Y VR R + +S Sbjct: 70 DRNTGEASIAAKLHGDATITYDEAVRKYFLATYVRYREGWIAAAREEYFDAVMVMSARPE 129 Query: 121 KQRLVN--DIKNILSPPMVKGQPNQVISDVLFTAFI 154 + R N SP + V ++ +F+ Sbjct: 130 QDRWSRFYKTDNPQSPQNILANRTDVFVEIKRVSFL 165
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 334 bits (857), Expect = e-116 Identities = 78/288 (27%), Positives = 138/288 (47%), Gaps = 8/288 (2%) Query: 5 ILSQAEIDALLNGDS---GSEEPEIITANETDVKPYDPTTQRRVVRERLHALEIINERFA 61 +LSQ EID LL S S E ++ + YD + +E++ L +++E FA Sbjct: 4 VLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETFA 63 Query: 62 RQFRMGLFNLLRRSPDITVGPIKIQPYHDFARNLPVPTNLNLVHLKPLRGTALFVFAPSL 121 R L LR + V + Y +F R++P P+ L ++ + PL+G A+ PS+ Sbjct: 64 RLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPSI 123 Query: 122 VFIAVDNLFGGDGRFPTKVEGREFTHTEQRVITRMLRLALDAYRDAWAAIYKIDVEYVRS 181 F +D LFGG G+ KV+ R+ T E V+ ++ L R++W + + + Sbjct: 124 TFSIIDRLFGGTGQ-AAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQI 181 Query: 182 EIQVKFTNITTSPNDIVVSTPFQVEIGTLSGEFNICIPFAMIEPLRELLTNPPLENS--R 239 E +F I P+++VV + ++G G N CIP+ IEP+ L++ +S R Sbjct: 182 ETNPQFAQI-VPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240 Query: 240 QEDNYWRETLVKQVQHSELELVANFVDIPLRLSQILKLQPGDVLPIEK 287 + L ++ ++++VA + L + IL L+ GD++ + Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHD 288
>FLGMOTORFLIN#Flagellar motor switch protein FliN signature. Length = 137 Score = 161 bits (410), Expect = 1e-54 Identities = 103/138 (74%), Positives = 117/138 (84%), Gaps = 1/138 (0%) Query: 1 MSDPKFPSADGKESVDDLWADAFNEQQATEKPTATTEGVFKSLEAPEGLGNLQDIDLILD 60 MSD PS + ++DDLWADA NEQ+AT +A + VF+ L + G +QDIDLI+D Sbjct: 1 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAA-DAVFQQLGGGDVSGAMQDIDLIMD 59 Query: 61 IPVKLSVELGRTKMTIKELLRLSQGSVVSLDGLAGEPLDILINGYLIAQGEVVVVADKYG 120 IPVKL+VELGRT+MTIKELLRL+QGSVV+LDGLAGEPLDILINGYLIAQGEVVVVADKYG Sbjct: 60 IPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYG 119 Query: 121 VRITDIITPSERMRRLSR 138 VRITDIITPSERMRRLSR Sbjct: 120 VRITDIITPSERMRRLSR 137
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 306 bits (786), Expect = e-108 Identities = 196/240 (81%), Positives = 215/240 (89%), Gaps = 1/240 (0%) Query: 35 TTLGLLTLFCSPSVLAQLPGIISQPLANGGQSWSLPVQTLVFITTLSFLPAALLMMTSFT 94 LL L P AQLPGI SQPL GGQSWSLPVQTLVFIT+L+F+PA LLMMTSFT Sbjct: 7 VAPVLLWLIT-PLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMMTSFT 65 Query: 95 RIIIVLGLLRNAMGTPSAPPNQVMLGLALFLTFFIMSPVFDKVYQEAYLPFSQDKISMDV 154 RIIIV GLLRNA+GTPSAPPNQV+LGLALFLTFFIMSPV DK+Y +AY PFS++KISM Sbjct: 66 RIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKISMQE 125 Query: 155 ALDKGSQPLREFMLRQTRESDLALYARLANLPPLEGPEMVPMRILLPAYVTSELKTAFQI 214 AL+KG+QPLREFMLRQTRE+DL L+ARLAN PL+GPE VPMRILLPAYVTSELKTAFQI Sbjct: 126 ALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKTAFQI 185 Query: 215 GFTVFIPFLIIDLVVASVLMALGMMMVPPASISLPFKLMLFVLVDGWQLLLGSLAQSFYS 274 GFT+FIPFLIIDLV+ASVLMALGMMMVPPA+I+LPFKLMLFVLVDGWQLL+GSLAQSFYS Sbjct: 186 GFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQSFYS 245
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 67.1 bits (164), Expect = 1e-18 Identities = 24/78 (30%), Positives = 40/78 (51%) Query: 4 ESVMALGTEAMKIALALAAPLLLAALISGLIVSLLQAATQINEMTLSFIPKILAVFTTMV 63 + ++ G +A+ + L L+ + A I GL+V L Q TQ+ E TL F K+L V + Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61 Query: 64 IAGPWMLNLILDYMRNLF 81 + W ++L Y R + Sbjct: 62 LLSGWYGEVLLSYGRQVI 79
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 173 bits (440), Expect = 1e-55 Identities = 172/258 (66%), Positives = 215/258 (83%) Query: 1 MLSFDTHQLSVWVSQYFWPLVRVLALIGTAPLLSEKQINKKVKIGLGVLITFLIAPSLPP 60 ML + Q W++ YFWPL+RVLALI TAP+LSE+ + K+VK+GL ++ITF IAPSLP Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60 Query: 61 VNIPLFSSAALWVAIQQILIGVALGVTMQFAFAAVRLSGEVIGLQMGLSFATFFDPSGGP 120 ++P+FS ALW+A+QQILIG+ALG TMQFAFAAVR +GE+IGLQMGLSFATF DP+ Sbjct: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120 Query: 121 NMPVLSRLLNILVTLLFLSFDGHLWLISLLADSFHTLPIQFAPLNGNGFLTLAQSGSMIF 180 NMPVL+R++++L LLFL+F+GHLWLISLL D+FHTLPI PLN N FL L ++GS+IF Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180 Query: 181 MNGLMLALPLITLLLTLNMALGMLNRMTPQLSVFVIGFPLTLTVGIISLGLIMPLLAPFT 240 +NGLMLALPLITLLLTLN+ALG+LNRM PQLS+FVIGFPLTLTVGI + +MPL+APF Sbjct: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240 Query: 241 EHLFGEFFDRLAEVLSGM 258 EHLF E F+ LA+++S + Sbjct: 241 EHLFSEIFNLLADIISEL 258
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 437 bits (1126), Expect = e-150 Identities = 314/552 (56%), Positives = 397/552 (71%), Gaps = 9/552 (1%) Query: 3 NSLMNTAMSGLNAAQYALSTVSNNITNFQVAGYNRQNTVFAQNGGTITSAGFIGNGVTVT 62 +SL+N AMSGLNAAQ AL+T SNNI+++ VAGY RQ T+ AQ T+ + G++GNGV V+ Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60 Query: 63 GVNREYNAFITNQLRASQTQSSGLATYYQQISQIDNLLSNASNNLSTTMQDFFSNLQNLV 122 GV REY+AFITNQLRA+QTQSSGL Y+Q+S+IDN+LS ++++L+T MQDFF++LQ LV Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120 Query: 123 SNADDDAARKTVLGKAEGLVNQFQNADKYLRDMDDGVNQKITDSATQINNYAEQIAKLND 182 SNA+D AAR+ ++GK+EGLVNQF+ D+YLRD D VN I S QINNYA+QIA LND Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180 Query: 183 QITRLRG-SSGSEPNALLDQRDQLVTELNQIMAVTVTQQDGDAYNVSFAGGLSLVQGPNA 241 QI+RL G +G+ PN LLDQRDQLV+ELNQI+ V V+ QDG YN++ A G SLVQG A Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240 Query: 242 YKVEAIPSSADATRLTLGYKRGNGEATEVDESRITTGSLGGTLKFRSEALDSARNQLGQL 301 ++ A+PSSAD +R T+ Y G E+ E + TGSLGG L FRS+ LD RN LGQL Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300 Query: 302 ALVMADSFNTQHNAGFDINGDEGEDFFSFADPTVLKNAKNQGNASITVEYKDTSKVKASD 361 AL A++FNTQH AGFD NGD GEDFF+ P VL+N KN+G+ +I D S V A+D Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 360 Query: 362 YTVEFDGTDWQVTRLSDNTKVQTTPGVNADGDPTLEFEGVAIKIDNGTPGPQAKDKFTIK 421 Y + FD WQVTRL+ NT TP D + + F+G+ + P D FT+K Sbjct: 361 YKISFDNNQWQVTRLASNTTFTVTP----DANGKVAFDGLELTFTG---TPAVNDSFTLK 413 Query: 422 TVSNVAANLQVAITDSSKIAAAGSADGGISDNTNAQALLDLQSKKLVEGK-TTLSGAYAG 480 VS+ N+ V ITD +KIA A D G SDN N QALLDLQS G + + AYA Sbjct: 414 PVSDAIVNMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYAS 473 Query: 481 LVSNVGNQTATAKTNSTAQANIVTQLTTEQQSISGVNLDEEYGDLQRFQQYYLANAQVLQ 540 LVS++GN+TAT KT+S Q N+VTQL+ +QQSISGVNLDEEYG+LQRFQQYYLANAQVLQ Sbjct: 474 LVSDIGNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQ 533 Query: 541 AASTLFNTLLSI 552 A+ +F+ L++I Sbjct: 534 TANAIFDALINI 545
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 314 bits (805), Expect = e-109 Identities = 181/316 (57%), Positives = 233/316 (73%), Gaps = 6/316 (1%) Query: 1 MSDLLAMSGAAYDAQSLEALKRDAARDPEGNLKQVAQQVEGMFVQMMLKSMRAALPQDGV 60 +SD ++ AA+DAQSL LK A DP N++ VA+QVEGMFVQMMLKSMR ALP+DG+ Sbjct: 2 ISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDGL 61 Query: 61 MNSEQTKLYTSLYDQQIAQQMSA-KGLGLADMMVEQLS-GSTSASETAGTVPMMLDNEVL 118 +SE T+LYTS+YDQQIAQQM+A KGLGLA+MMV+Q++ E+ PM E + Sbjct: 62 FSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLETV 121 Query: 119 QSMPAQALAQVMRRAIPTPPSSSMAAISPGNGNFVARMSIPAQIASQQSGIPHQLIMAQA 178 QAL+Q++++A+P S+ S F+A++S+PAQ+ASQQSG+PH LI+AQA Sbjct: 122 VRYQNQALSQLVQKAVPRNYDDSLPGDSK---AFLAQLSLPAQLASQQSGVPHHLILAQA 178 Query: 179 ALESGWGQREIPTADGKSSYNVFGIKAGSSWNGPVSEITTTEYEQGVAKKTKARFRVYGS 238 ALESGWGQR+I +G+ SYN+FG+KA +W GPV+EITTTEYE G AKK KA+FRVY S Sbjct: 179 ALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSS 238 Query: 239 YVEAVSDYVKLLTQNPRYAHVAAAQSPEQGAHALQKAGYATDPQYAQKLVSVIQQMRSTG 298 Y+EA+SDYV LLT+NPRYA V A S EQGA ALQ AGYATDP YA+KL ++IQQM+S Sbjct: 239 YLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSIS 298 Query: 299 EQAVKAYGGSDLSQLF 314 ++ K Y ++ LF Sbjct: 299 DKVSKTY-SMNIDNLF 313
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 391 bits (1007), Expect = e-138 Identities = 155/366 (42%), Positives = 217/366 (59%), Gaps = 9/366 (2%) Query: 5 SLVTLLMVLLSLVWLPASAERIRDLVTVQGVRDNALIGYGLVVGLDGSGDQTMQTPFTTQ 64 +LV + LS A RI+D+ ++Q RDN LIGYGLVVGL G+GD +PFT Q Sbjct: 10 ALVFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQ 69 Query: 65 SLSNMLSQLGITVPPGTNMQLKNVAAVMVTAKLPAFSRAGQTIDVVVSSMGNAKSIRGGT 124 S+ ML LGIT G + KN+AAVMVTA LP F+ G +DV VSS+G+A S+RGG Sbjct: 70 SMRAMLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGN 128 Query: 125 LLMTPLKGVDNQVYALAQGNVLVGGAGAAAGGSSVQVNQLAGGRISNGATIERELPTTFG 184 L+MT L G D Q+YA+AQG ++V G A +++ R+ NGA IERELP+ F Sbjct: 129 LIMTSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFK 188 Query: 185 TDGIINLQLNSEDFTLAQQVSDAINR----QRGFGSATAIDARTIQVLVPRGGSSQVRFL 240 + LQL + DF+ A +V+D +N + G A D++ I V PR + R + Sbjct: 189 DSVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPR-VADLTRLM 247 Query: 241 ADIQNIPINVDPGDAKVIINSRTGSVVMNRNVVLDSCAVAQGNLSVVVDKQNIVSQPDTP 300 A+I+N+ + D AKV+IN RTG++V+ +V + AV+ G L+V V + V QP P Sbjct: 248 AEIENLTVETD-TPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-AP 305 Query: 301 FGGGQTVVTPNTQISVQQQGGVLQRVNASPNLNNVVRALNSLGATPIDLMSILQAMESAG 360 F GQT V P T I Q+G + V P+L +V LNS+G +++ILQ ++SAG Sbjct: 306 FSRGQTAVQPQTDIMAMQEGSKVAIVE-GPDLRTLVAGLNSIGLKADGIIAILQGIKSAG 364 Query: 361 CLRAKL 366 L+A+L Sbjct: 365 ALQAEL 370
>FLGLRINGFLGH#Flagellar L-ring protein signature. Length = 232 Score = 283 bits (725), Expect = 1e-99 Identities = 176/222 (79%), Positives = 193/222 (86%), Gaps = 2/222 (0%) Query: 9 PLMTMLL--LNGCAYIPHKPLVDGTTSAQPAPASAPLPNGSIFQTVQPMNYGYQPLFEDR 66 + ++L+ L GCA+IP PLV G TSAQP P P+ NGSIFQ+ QP+NYGYQPLFEDR Sbjct: 10 AISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINYGYQPLFEDR 69 Query: 67 RPRNIGDTLTITLQENVSASKSSSANASRNGTSSFGVTTAPRYLDGLLGNGRADMEITGD 126 RPRNIGDTLTI LQENVSASKSSSANASR+G ++FG T PRYL GL GN RAD+E +G Sbjct: 70 RPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNARADVEASGG 129 Query: 127 NTFGGKGGANANNTFSGTITVTVDQVLANGNLHVVGEKQIAINQGTEFIRFSGVVNPRTI 186 NTF GKGGANA+NTFSGT+TVTVDQVL NGNLHVVGEKQIAINQGTEFIRFSGVVNPRTI Sbjct: 130 NTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRTI 189 Query: 187 SGSNSVTSTQVADARIEYVGNGYINEAQTMGWLQRFFLNVSP 228 SGSN+V STQVADARIEYVGNGYINEAQ MGWLQRFFLN+SP Sbjct: 190 SGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSP 231
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 41.9 bits (98), Expect = 2e-06 Identities = 17/80 (21%), Positives = 35/80 (43%), Gaps = 14/80 (17%) Query: 4 SLWIAKTGLDAQQTNMDVIANNLANVSTNGFKRQRAVFEDLLYQTLRQPGAQSSEQTTLP 63 + A +GL+A Q ++ +NN+++ + G+ RQ + + +TL Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTLG 48 Query: 64 SGLQIGTGVRPVATERLHSQ 83 +G +G GV +R + Sbjct: 49 AGGWVGNGVYVSGVQREYDA 68 Score = 41.1 bits (96), Expect = 3e-06 Identities = 11/41 (26%), Positives = 22/41 (53%) Query: 220 ETSNVNVAEELVNMIQTQRAYEINSKAVSTSDQMLQKLAQL 260 S VN+ EE N+ + Q+ Y N++ + T++ + L + Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 45.3 bits (107), Expect = 3e-07 Identities = 22/87 (25%), Positives = 42/87 (48%), Gaps = 8/87 (9%) Query: 6 AVSGMNAASSNLDVIGNNIANSATSGFKAGSVSFAD----MFAGSQTGMGVKVAGITQDF 61 A+SG+NAA + L+ NNI++ +G+ + A + AG G GV V+G+ +++ Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGVQREY 66 Query: 62 NDGTATTTNRRLDLAISQNGFFRMQDS 88 + +L A +Q+ + Sbjct: 67 DA----FITNQLRAAQTQSSGLTARYE 89 Score = 40.7 bits (95), Expect = 9e-06 Identities = 15/49 (30%), Positives = 28/49 (57%) Query: 380 TLTSGALESSNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILQTLVSLR 428 L++ S V+L +E N+ Q+ Y +NAQ ++T + I L+++R Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546
>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE chaperone signature. Length = 130 Score = 28.9 bits (64), Expect = 0.008 Identities = 15/34 (44%), Positives = 21/34 (61%), Gaps = 2/34 (5%) Query: 43 LKNQDPTNPMENNELTTQLAQINTVSGIEKLNTT 76 L N+ P N ++NN L TQL + V G E+L T+ Sbjct: 89 LWNRQPLNSLDNNSLYTQLEML--VQGAERLQTS 120
>PF05272#Virulence-associated E family protein Length = 892 Score = 32.0 bits (72), Expect = 0.007 Identities = 21/74 (28%), Positives = 29/74 (39%), Gaps = 17/74 (22%) Query: 24 PGVKALDNVNLKVRPYSIHALMGENGAGKSTLLKCLFGIYKKDSGSIIFQGQEIEFKSSK 83 PG K D + L G G GKSTL+ L G+ F + + K Sbjct: 591 PGCKF-DYSVV---------LEGTGGIGKSTLINTLVGLD-------FFSDTHFDIGTGK 633 Query: 84 EALEQGVSMVHQEL 97 ++ EQ +V EL Sbjct: 634 DSYEQIAGIVAYEL 647
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 32.0 bits (73), Expect = 0.009 Identities = 23/122 (18%), Positives = 48/122 (39%), Gaps = 5/122 (4%) Query: 669 TISLVTLFSVALLLISTMIIGIAESKRISKILKIMESVGGSLYTHIIFFIQQNVTPVLVA 728 + L+ L S +++ AE + + V L L+A Sbjct: 39 SAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMA 98 Query: 729 VAIAF-PIGFIL----LQKWLSKYNFINNLSYLYAFGTLLLFMVSIVSVMTLSLILSHTK 783 +A GF++ ++ + K N I +++ +L+ F+ SI+ V+ LS+++ Sbjct: 99 IASHVVQYGFLISGEAIKPDIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIII 158 Query: 784 KN 785 K Sbjct: 159 KG 160
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 32.1 bits (73), Expect = 0.005 Identities = 33/219 (15%), Positives = 70/219 (31%), Gaps = 36/219 (16%) Query: 114 EATSRMADIMEQINSLRNMRMRLEQDSRDTQLSLQEAQ-------HQIDIISKDLKRYKV 166 E + I EQ ++ +N + + E + + + + L + Sbjct: 183 EVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSS 242 Query: 167 LDEKLLIAKSEL---ERQADRLIN---------WKTKSNILQK------HNSRNQKSFPS 208 L K IAK + E + +N + +S IL + Sbjct: 243 LLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILD 302 Query: 209 QFKNIDESIILLEKMMKMIEVGIEQLVIIAPIDGTLSVLDI-ELGQQIKSGEKI-SVIDN 266 + + ++I LL + E + VI AP+ + L + G + + E + ++ Sbjct: 303 KLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPE 362 Query: 267 LNSYYFNVYFSENYIDKIKPNTQIIAQINGQDTQLLIES 305 ++ I I GQ+ + +E+ Sbjct: 363 DDTLEVTALVQNKDIGFINV---------GQNAIIKVEA 392
>DPTHRIATOXIN#Diphtheria toxin signature. Length = 567 Score = 35.1 bits (80), Expect = 5e-04 Identities = 40/135 (29%), Positives = 55/135 (40%), Gaps = 41/135 (30%) Query: 3 HCNTSDLLSLEQALTK-MLSQATPLPATEVIPLSEAAGRITASAIT----------SPIA 51 H NT ++++ AL+ M++QA PL E++ + AA S I P Sbjct: 354 HHNTEEIVAQSIALSSLMVAQAIPL-VGELVDIGFAAYNFVESIINLFQVVHNSYNRPAY 412 Query: 52 VP-----PFANSAMDGYAVRWHELSDEI--------------------PLPVAGVAFAGA 86 P PF + DGYAV W+ + D I PLP+AGV Sbjct: 413 SPGHKTQPFLH---DGYAVSWNTVEDSIIRTGFQGESGHDIKITAENTPLPIAGVLLPTI 469 Query: 87 PFK-DVWPEKTCIRI 100 P K DV KT I + Sbjct: 470 PGKLDVNKSKTHISV 484
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.6 bits (71), Expect = 0.009 Identities = 11/18 (61%), Positives = 13/18 (72%) Query: 352 GPNGIGKSTLLKTLLGEY 369 G GIGKSTL+ TL+G Sbjct: 603 GTGGIGKSTLINTLVGLD 620
>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature. Length = 1104 Score = 30.5 bits (68), Expect = 0.003 Identities = 16/69 (23%), Positives = 26/69 (37%) Query: 40 YVYSSESTYGVEPNEKEVEEIIKMKPDVIDPGETLKLAPSILSLLKKNIRKDTGWRIGGR 99 Y+S G + + VE + ++ + E + +LS K NI K G Sbjct: 199 IQYNSNFRLGTKAQDGVVEALGRLIGNASADPEVINNCIYVLSDFKDNIDKYGSNYSKGN 258 Query: 100 YSFNSVGGG 108 FN + G Sbjct: 259 AVFNLMKGI 267
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 72.8 bits (178), Expect = 3e-17 Identities = 54/255 (21%), Positives = 95/255 (37%), Gaps = 32/255 (12%) Query: 10 VLVTGGTKGIGRATVESFVKAGAKVYGTYFWGDNLDELENHFSQYLNRPVFLQADISDEE 69 +TG +GIG A + GA + + + L+++ + AD+ D Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70 Query: 70 ITTQLIEKIAQENKKIDILILNAAFAPQFKDTYKFRGLLDSIEHNSWPLITYIDC----- 124 ++ +I +E IDIL+ N A + GL+ S+ W ++ Sbjct: 71 AIDEITARIEREMGPIDILV-NVAGVLRP-------GLIHSLSDEEWEATFSVNSTGVFN 122 Query: 125 -----IKQHFGQYPGYVVAITSEGHRSCHITGYDYVAASKAVLETLTKYIG---ARENII 176 K + G +V + S + Y A+SKA TK +G A NI Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAY-ASSKAAAVMFTKCLGLELAEYNIR 181 Query: 177 INCISPGVVDTEAFELVFGKK--AQAFIRKFDPDF--------IVSPEAVGNVSVALCSG 226 N +SPG +T+ ++ + A+ I+ F + P + + + L SG Sbjct: 182 CNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241 Query: 227 LMDAVRGQVITVDNG 241 + + VD G Sbjct: 242 QAGHITMHNLCVDGG 256
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 120 bits (303), Expect = 2e-35 Identities = 75/251 (29%), Positives = 117/251 (46%), Gaps = 16/251 (6%) Query: 2 NLFISGGASGIGRSVVIAALSKGWNV-GFSYHNNKEGAQQLLDIAVAEFPRQLCRAYQLD 60 FI+G A GIG +V S+G ++ Y+ K A A + D Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA----FPAD 65 Query: 61 VIDSGAVEYVGDRLLVDFSNIDAVVCNAGIDLPGNLVSMTDEDWALVLNTNLTGTFYLIR 120 V DS A++ + R+ + ID +V AG+ PG + S++DE+W + N TG F R Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 121 YFLPLFLANKYGRIVTL-SSLAKDGSSGQAAYAASKAGLVGLTKTTAKEYGHFGITANVV 179 + + G IVT+ S+ A + AAYA+SKA V TK E + I N+V Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 180 VPGLINTEI-----IGDD-----IKGIKNFFAQYAPVGRLGSPSEVAEVILFLVAKESSY 229 PG T++ ++ IKG F P+ +L PS++A+ +LFLV+ ++ + Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245 Query: 230 VNGAVFNVTGG 240 + V GG Sbjct: 246 ITMHNLCVDGG 256
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 104 bits (261), Expect = 3e-29 Identities = 67/252 (26%), Positives = 106/252 (42%), Gaps = 10/252 (3%) Query: 3 KTILITGALSGIGNTATKLFSEMGYNVVFSGRRPEEGRVILDDLKRINKDVLYVNADMNS 62 K ITGA GIG + + G ++ PE+ ++ LK + AD+ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 63 ESDIKHLIEMTLERFGSLDVAVNCAGTVGETAEIQAVTQDNFHLVFNTNVLGTLLAMKYQ 122 + I + G +D+ VN AG + I +++ + + F+ N G A + Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVL-RPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 123 IPVMVERGKGSIINISSIAGLVGLPSTGIYVASKHAIEGLTKTAALEVATTGVRINSISP 182 M++R GSI+ + S V S Y +SK A TK LE+A +R N +SP Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 183 GPVEGKMFDRFLGHDENNKKAFIE--------MMPNKRFTTQEEVAHTIVFLAEDNVTAI 234 G E M L DEN + I+ +P K+ ++A ++FL I Sbjct: 188 GSTETDM-QWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246 Query: 235 TGQTITIDGGYT 246 T + +DGG T Sbjct: 247 TMHNLCVDGGAT 258
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 772 bits (1994), Expect = 0.0 Identities = 302/847 (35%), Positives = 446/847 (52%), Gaps = 42/847 (4%) Query: 7 VGAQRYSFDPNLL-VDGNNNTDTSLFEQGNE-LPGTYLVDIILNGNKVDSTNVTFHSEKS 64 + + F+P L D D S FE G E PGTY VDI LN + + +VTF++ S Sbjct: 42 LSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNGYMATRDVTFNTGDS 101 Query: 65 PSGEPFLQSCLTKEQLSRYGVDVDAYPELSPALKNSQTNPCVNL-AAIPQASEEFQFYNM 123 G + CLT+ QL+ G++ + ++ + CV L + I A+ + Sbjct: 102 EQG---IVPCLTRAQLASMGLNTASVSGMNLL----ADDACVPLTSMIHDATAQLDVGQQ 154 Query: 124 QLVLSIPQAALR--PEGEVPIERWDDGITAFLLNYMANISETQFRQNGGYRRSQYIQLYP 181 +L L+IPQA + G +P E WD GI A LLNY + + Q R GG Y+ L Sbjct: 155 RLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRI-GGNSHYAYLNLQS 213 Query: 182 GLNLGAWRVRNATNWS-----QSGDRGGKWQSAYTYATRGIYRLKSRVTLGESYTPGDFF 236 GLN+GAWR+R+ T WS S KWQ T+ R I L+SR+TLG+ YT GD F Sbjct: 214 GLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDGYTQGDIF 273 Query: 237 DSIPFRGVMLGDDPNMQPSNQRDFIPVVRGIARSQAQVEIRQNGYLIYSTVVPPGPFELS 296 D I FRG L D NM P +QR F PV+ GIAR AQV I+QNGY IY++ VPPGPF ++ Sbjct: 274 DGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTIN 333 Query: 297 DVIPSKSGSDLHVRVLESNGASQAFIVPYEVPAIALRKGHLRYNLVAGQYRPANADVETP 356 D+ + + DL V + E++G++Q F VPY + R+GH RY++ AG+YR NA E P Sbjct: 334 DIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSGNAQQEKP 393 Query: 357 PVAQATVAYGLPWNLTAFIGEQWSRHYQATSAGLGVLLGEYGALSSSITQATSQYHHQQP 416 Q+T+ +GLP T + G Q + Y+A + G+G +G GALS +TQA S Sbjct: 394 RFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQ 453 Query: 417 VKGQAWEVRYNKTLQASDTSFSLVNSQYSTNGFSTLSDVLQSYRQSGSGDNRDKI----- 471 GQ+ YNK+L S T+ LV +YST+G+ +D S + + +D + Sbjct: 454 HDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKP 513 Query: 472 ---DENSRSRDLRNQISAVIGQSLGKFGYLNLNWSRQVYRGPIPAKNSLGIHYNLNVGNS 528 D + + + R ++ + Q LG+ L L+ S Q Y G N + Sbjct: 514 KFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDI 573 Query: 529 FWALSW--VQNANENKNDRILSLSVSIPLGGHHD---------TYASYRMT-SSNGSNDH 576 W LS+ +NA + D++L+L+V+IP ASY M+ NG + Sbjct: 574 NWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTN 633 Query: 577 EIEMYGQAF-DSRLSWSVRQAEHYGQPNSGHNSGSLRLGWQGSYGNIAGNYYYTPSIRQL 635 +YG D+ LS+SV+ G + ++G L ++G YGN Y ++ I+QL Sbjct: 634 LAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQL 693 Query: 636 SADVSGGAIIHRHGLTLGPQINGTSVLVEVPGVGGVTTTEDRRLKTDFRGYSIVSGLSPY 695 VSGG + H +G+TLG +N T VLV+ PG ++TD+RGY+++ + Y Sbjct: 694 YYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEY 753 Query: 696 QEHDIVLETADLPPDAEVAKTDTKVLPTEGAIVRASFSPQIGAKALMTITRANGQTIPFG 755 +E+ + L+T L + ++ V+PT GAIVRA F ++G K LMT+T N + +PFG Sbjct: 754 RENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTLTH-NNKPLPFG 812 Query: 756 AMASLVNQSANAAIVDEGGKAYLTGLPETGQLLVQWGKDAGQQCRVDYQLSPAEKGDAGL 815 AM + S ++ IV + G+ YL+G+P G++ V+WG++ C +YQL P E L Sbjct: 813 AMVTS-ESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANYQL-PPESQQQLL 870 Query: 816 YMLSGVC 822 LS C Sbjct: 871 TQLSAEC 877
>AUTOINDCRSYN#Autoinducer synthesis protein signature. Length = 216 Score = 27.5 bits (61), Expect = 0.019 Identities = 8/31 (25%), Positives = 11/31 (35%), Gaps = 3/31 (9%) Query: 68 TMFTLTMGDTAPHGGWRLIPTGDSKGGYMIS 98 T + + D R I T K MI+ Sbjct: 52 TTYLFGIKDNTVICSLRFIET---KYPNMIT 79
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 28.8 bits (64), Expect = 0.020 Identities = 17/73 (23%), Positives = 28/73 (38%), Gaps = 11/73 (15%) Query: 194 WAGRPLPALGDVVEAAHALRDQGIAHVVISLGAEGALWVNASGAWL----AKPPACDVVS 249 W G AL + + A R +G+ ++ E A + G L AC + Sbjct: 86 WNGY---ALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYA 142 Query: 250 ----TVGAGDSMV 258 +GA D+M+ Sbjct: 143 KHHFIIGAVDTML 155
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 31.3 bits (71), Expect = 0.009 Identities = 23/122 (18%), Positives = 45/122 (36%), Gaps = 22/122 (18%) Query: 349 GWQIDPVGLRYSLSVLYERYQKPLFIVENGFGAIDKVAADG-------MVHDDYRIAYLK 401 G++ +R L ++ +F + A+ + + G M+ + K Sbjct: 357 GFR----AIRLCLE------KQDIFRTQ--LRALLRASTYGNLKVMFPMIATLEELRQAK 404 Query: 402 AHIEQMKKAVFEDGVDLMGYTPWGC---IDCVSFTTGEYSKRYGFIYVDKNDDGTGTMAR 458 A +++ K + +GVD+ G I + ++K F + ND TMA Sbjct: 405 AIMQEEKDKLLSEGVDVSDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAA 464 Query: 459 SR 460 R Sbjct: 465 DR 466
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 64.6 bits (157), Expect = 4e-15 Identities = 25/104 (24%), Positives = 41/104 (39%), Gaps = 4/104 (3%) Query: 14 PAQQRILLTAHRLFYQEGIRATGIDKIIKESGVTKVTFYRHFPSKNDLISAFLEYRHQRW 73 +Q IL A RLF Q+G+ +T + +I K +GVT+ Y HF K+DL S E Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70 Query: 74 INWFIEELKQQTLHHA----NLALALTKCMASWFEHPSFRGCAF 113 +E + + + + + + F Sbjct: 71 GELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIF 114
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 502 bits (1294), Expect = 0.0 Identities = 242/388 (62%), Positives = 287/388 (73%), Gaps = 22/388 (5%) Query: 1 MKLRVLSFIIPALLVAGSASAAEIYNKDGNKLDLYGKIDGLHYFSDNKNLDGDQSYMRFG 60 MK +VL+ +IPALL AG+A AAEIYNKDGNKLDLYGK+DGLHYFSD+ + DGDQ+YMR G Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60 Query: 61 LKGETQITDQLTGYGQWEYQVNLNKAENEDGNHDSFTRVGFAGLKFADYGSLDYGRNYGV 120 KGETQI DQLTGYGQWEY V N E E N S+TR+ FAGLKF DYGS DYGRNYGV Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGAN--SWTRLAFAGLKFGDYGSFDYGRNYGV 118 Query: 121 LYDVTSWTDVLPEFGGDTYG-ADNFLSQRGNGMLTYRNTNFFGLVDGLNFALQYQGKNGS 179 LYDV WTD+LPEFGGD+Y ADN+++ R NG+ TYRNT+FFGLVDGLNFALQYQGKN S Sbjct: 119 LYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNES 178 Query: 180 SS---------ETNNGRGVADQNGDGYGMSLSYDLGWGVSASAAMASSLRTTAQNDLQ-- 228 S NNG + NGDG+G+S +YD+G G SA AA +S RT Q + Sbjct: 179 QSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGGT 238 Query: 229 YGQGKRANAYTGGLKYDANNVYLAANYTQTYNLTRFGDFSNRSSDAAFGFADKAHNIEVV 288 G +A+A+T GLKYDANN+YLA Y++T N+T +G G A+K N EV Sbjct: 239 IAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYG---KTDKGYDGGVANKTQNFEVT 295 Query: 289 AQYQFDFGLRPSVAYLQSKGKDIGI----YGDQDLLKYVDIGATYFFNKNMSTYVDYKIN 344 AQYQFDFGLRP+V++L SKGKD+ D+DL+KY D+GATY+FNKN STYVDYKIN Sbjct: 296 AQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKIN 355 Query: 345 LLDKND-FTKNARINTDDIVAVGMVYQF 371 LLD +D F K+A I+TDDIVA+GMVYQF Sbjct: 356 LLDDDDPFYKDAGISTDDIVALGMVYQF 383
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 34.4 bits (79), Expect = 6e-04 Identities = 56/301 (18%), Positives = 102/301 (33%), Gaps = 15/301 (4%) Query: 25 FIAGLGMAAWAPLVPFAKARIGLND---ASLGLLLLCIGIGSMLAMPLTGVLTAKWGCRA 81 + +G+ P++P + ++ A G+LL + P+ G L+ ++G R Sbjct: 15 ALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRP 74 Query: 82 VILLAGAVLCLDLPLLVLMNTPATMAIALLVFGAAMGIIDVAMNIQAVIVEKASGRAMMS 141 V+L++ A +D ++ + I +V G VA A I RA Sbjct: 75 VLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADIT-DGDERARHF 133 Query: 142 GFHG-LFSVGGIVG------AGGVSALLWLGLNPLTAIMATVVLMIILLLAAN---KNLL 191 GF F G + G GG S + + +L + + L Sbjct: 134 GFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLR 193 Query: 192 RGSGEPHDGPLFVFPRGWVMFIGFLCFVMFLAEGSMLDWSAVFLTTLRGMSPSQAGMGYA 251 R + P + V + + F+M L +F + G+ A Sbjct: 194 REALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLA 253 Query: 252 VFAIAMTLGR-LNGDRIVNGLGRYKVLLGGSLCSAIGIIIAISIDSSMAAIIGFMLVGFG 310 F I +L + + + LG + L+ G + G I+ A +L+ G Sbjct: 254 AFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG 313 Query: 311 A 311 Sbjct: 314 G 314
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 26.5 bits (58), Expect = 0.010 Identities = 5/41 (12%), Positives = 18/41 (43%), Gaps = 6/41 (14%) Query: 4 LSWIIFGLIAGILAKWIMP------GEDGGGFIMTIILGII 38 + I+ G I+G++ W+ ++ ++ ++ + Sbjct: 163 AAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILLEMYL 203
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 44.0 bits (104), Expect = 4e-07 Identities = 32/196 (16%), Positives = 67/196 (34%), Gaps = 24/196 (12%) Query: 17 IRFALLSFLLLSTGISVAPLAIARGSAVEVKGTAPLELASGSAM---VVDLQTNKVIYAN 73 +R+ L + L + +A S ++ E + +DL + + + A Sbjct: 1 MRYIRLCIISLLATLPLA----VHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAW 56 Query: 74 NADKVVPIASITKLMTAMVVLD----AKLPLDEILSVDIDQTKELKGVFSRVRVNSEISR 129 AD+ P+ S K++ VL L+ + + V S + ++ Sbjct: 57 RADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPV-SEKHLADGMTV 115 Query: 130 KDMLLLTLMSSENRAAASLAHHY--PGGYNAFIKAMNAKAKSL-----GMSSTHYVEPTG 182 ++ + S+N AA L P G AF++ + L ++ + Sbjct: 116 GELCAAAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEALPGDAR- 174 Query: 183 LSINNVSTARDLAKLL 198 + +T +A L Sbjct: 175 ----DTTTPASMAATL 186
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 28.5 bits (63), Expect = 0.041 Identities = 17/93 (18%), Positives = 40/93 (43%), Gaps = 8/93 (8%) Query: 78 KSIVDKNITTVSGDVNAVNSTIGKNIKTVSGSIEVEQSTVSGNLETTSGRI------DID 131 +N+ ++ ++ A N +I +G +S +G + T+ ++ + Sbjct: 753 SLYSGRNVANITSNITASNKA-QVHIGYKTGDTVCVRSDYTGYVTCTTDKLSDKALNSFN 811 Query: 132 TTKINGNVH-TTSGSISLNDSTIDGSVTCKAGS 163 T + GNV+ T S + L + + G++ + S Sbjct: 812 PTNLRGNVNLTESANFVLGKANLFGTIQSRGNS 844
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 519 bits (1337), Expect = 0.0 Identities = 171/475 (36%), Positives = 254/475 (53%), Gaps = 28/475 (5%) Query: 5 KAHILVVDDDLSHCTIIQALMKGWGYQTTPAHNGLEAIELAKEIPFDLILTDVRMSEMDG 64 A ILV DDD + T++ + GY N DL++TDV M + + Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 65 IEALKAIKAYNPAIPILIMTAYSNVESAVEAIKAGAYDYLTKPLDFDMLQLTLERALEHT 124 + L IK P +P+L+M+A + +A++A + GAYDYL KP D L + RAL Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE- 121 Query: 125 HLKNENKTLKQQIISNQNIIGRSPQMRYLMDMVGMIAPSEATVLICGESGTGKEIIARSV 184 K L+ ++GRS M+ + ++ + ++ T++I GESGTGKE++AR++ Sbjct: 122 -PKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180 Query: 185 HANSSRKDQPLVIVNCAALSESLLESELFGHEKGAFTGADKRREGRFMEAHKATLFLDEI 244 H R++ P V +N AA+ L+ESELFGHEKGAFTGA R GRF +A TLFLDEI Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240 Query: 245 GEISGLMQAKLLRAIQEREIQRVGSNQTLAIDVRLIAATNRNLKADVDSGKFRQDLYYRL 304 G++ Q +LLR +Q+ E VG + DVR++AATN++LK ++ G FR+DLYYRL Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300 Query: 305 NVVTIDTPALRERSEDIPPLSMHFLEKFALKNRKSIKGFTPQAMNMLLKYNWPGNVRELE 364 NVV + P LR+R+EDIP L HF+++ K +K F +A+ ++ + WPGNVRELE Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRELE 359 Query: 365 NTVERAVILLTGDFISEKELPLNINHYIQENAGSENIGYEDAEKP--------------- 409 N V R L D I+ + + + I ++ + + Sbjct: 360 NLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419 Query: 410 ----------IQSLEWVEIDAILTALEKTGGNKTEAAKHLGITRKTLQAKLQKRN 454 + L +E IL AL T GN+ +AA LG+ R TL+ K+++ Sbjct: 420 GDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELG 474
>PF06580#Sensor histidine kinase Length = 349 Score = 30.6 bits (69), Expect = 0.017 Identities = 43/270 (15%), Positives = 91/270 (33%), Gaps = 58/270 (21%) Query: 322 EGLIIPLSISV-ANIVNHNGSFLGNIFIFRDMREVRQLQEEIRRKEKLAAIGNLAAGVA- 379 +PL++S+ N+V + F + + +Q + + + +A L A A Sbjct: 110 VAFTLPLALSIIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQ 169 Query: 380 ---HEIRNPLSSIKGFAKYFEGHSPQGSEEQELAKVMIKEVDRLNRAVTELLGLVRPSDL 436 H + N L++I+ E+ A+ M+ + L R Sbjct: 170 INPHFMFNALNNIRALIL----------EDPTKAREMLTSLSELMRYSLR--------YS 211 Query: 437 RIQLVNINEIIAH-----SLHLIRQDADSKKITIQFISNENLPRVEIDPDRFTQALL-NL 490 + V++ + + L I+ ++ + N + V++ P Q L+ N Sbjct: 212 NARQVSLADELTVVDSYLQLASIQF---EDRLQFENQINPAIMDVQV-PPMLVQTLVENG 267 Query: 491 YLNAIQAMGRAGTLEIALALVEESKLRISVIDTGKGIRAEDLENIFNPYFTTKASGTGLG 550 + I + + G + + + + + V +TG E TG G Sbjct: 268 IKHGIAQLPQGGKILLK-GTKDNGTVTLEVENTGSLALKNTKE------------STGTG 314 Query: 551 LAIVQK------------VIEEHQGRITVT 568 L V++ + E QG++ Sbjct: 315 LQNVRERLQMLYGTEAQIKLSEKQGKVNAM 344
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 54.2 bits (130), Expect = 2e-11 Identities = 33/169 (19%), Positives = 62/169 (36%), Gaps = 12/169 (7%) Query: 5 NEVGMHEASIAQIAKRAGVSNGIISHYFRDKNGLLEATMRYLIRHLGEAVKQHLAVLSVN 64 ++ G+ S+ +IAK AGV+ G I +F+DK+ L ++GE Sbjct: 25 SQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELE-LEYQAKFPG 83 Query: 65 DPRARLRAIAEGNFDDSQINSAAMKTWLAFWASSMHS----PQLYRLQQVNNRRLYSNLC 120 DP + LR I + + + + + + + Q+ Y + Sbjct: 84 DPLSVLREILIHVLEST-VTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIE 142 Query: 121 AEFKRCLPREQ------AQLAAKGMAGLIDGLWLRSALSGEHFNRQEAL 163 K C+ + + AA M G I GL + + F+ ++ Sbjct: 143 QTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEA 191
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 30.3 bits (68), Expect = 0.005 Identities = 21/112 (18%), Positives = 40/112 (35%), Gaps = 10/112 (8%) Query: 152 YNVAVSLALEKKQYDQAITAFQSFVKQYPKSTYQPNANYWLGQLYYNKGKKDDAAYYYAV 211 Y++A + + +Y+ A FQ+ Y LG G+ D A + Y+ Sbjct: 40 YSLAFN-QYQSGKYEDAHKVFQALCVLDH---YDSRFFLGLGACRQAMGQYDLAIHSYSY 95 Query: 212 VVKNYPKSPKSSEAMFKVGVIMQDKGQSDKAKA---VYQQVIKQYPNTDAAK 260 K P+ F + KG+ +A++ + Q++I Sbjct: 96 GAIMDIKEPRFP---FHAAECLLQKGELAEAESGLFLAQELIADKTEFKELS 144
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 116 bits (291), Expect = 6e-34 Identities = 37/119 (31%), Positives = 54/119 (45%), Gaps = 4/119 (3%) Query: 50 EEQARLQMQELQKNNIVYFGFDKYDIGSDFAQMLDAHAAFLRSN--PSYKVVVEGHADER 107 +Q + + V F F+K + + LD + L + VVV G+ D Sbjct: 205 APAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRI 264 Query: 108 GTPEYNIALGERRASAVKMYLQGKGVSADQISIVSYGKEKPAVLGHDEAAFAKNRRAVL 166 G+ YN L ERRA +V YL KG+ AD+IS G+ P V G+ K R A++ Sbjct: 265 GSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNP-VTGNTCDN-VKQRAALI 321
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 60.5 bits (146), Expect = 7e-12 Identities = 29/193 (15%), Positives = 63/193 (32%), Gaps = 4/193 (2%) Query: 69 YNRQQQQQTDAKRAEQQRQKKAEQQAEELQQKQAAEQQRLKELEKERLQAQEDAK---LA 125 YN + +++ Q E R+ E ++ Sbjct: 981 YNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETV 1040 Query: 126 AEEQKKQVAEQQKQIAEQQKQAAEQQKIAAAAVAKAKEEQKQAETAAAQAKAEADKIVKA 185 AE K++ +K + + A+ +++A A + K + E A + ++ + + + Sbjct: 1041 AENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTET 1100 Query: 186 QAEAQKKAEAEAKKEAA-VAAAAKKQADADAKKAVEVAEKAAADAAEKKAAADAEKKAAA 244 + A + E +AK E K + K+ + A+ A + K+ + Sbjct: 1101 KETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQS 1160 Query: 245 AKKVAAAAEAKKK 257 A E K Sbjct: 1161 QTNTTADTEQPAK 1173 Score = 52.4 bits (125), Expect = 2e-09 Identities = 22/199 (11%), Positives = 68/199 (34%), Gaps = 5/199 (2%) Query: 72 QQQQQTDAKRAEQQRQKKAEQQAEELQQKQAAEQQRLKELEKERLQAQ-EDAKLAAEEQK 130 +Q+ ++ EQ + Q E ++ ++ + + E + ++ ++ + ++ Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103 Query: 131 KQVAEQQKQIAEQQKQAAEQQKIAAAAVAKAKEEQKQAETAAAQAKAEADKIVKAQAEAQ 190 V +++K E +K + + + + + E Q + A+ I + Q++ Sbjct: 1104 ATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTN 1163 Query: 191 KKAEAEAKKE----AAVAAAAKKQADADAKKAVEVAEKAAADAAEKKAAADAEKKAAAAK 246 A+ E + + VE E + +++ K Sbjct: 1164 TTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRH 1223 Query: 247 KVAAAAEAKKKAAAEAAAS 265 + + + A +++ Sbjct: 1224 RRSVRSVPHNVEPATTSSN 1242 Score = 45.1 bits (106), Expect = 5e-07 Identities = 32/218 (14%), Positives = 65/218 (29%), Gaps = 10/218 (4%) Query: 52 GEVIDAVMVDPGAVTEQYNRQQQQQTDAKRAEQQRQKKAEQQAEELQQKQAAEQQRLKEL 111 EV + A T+ Q + + ++ A + EE + + + Q Sbjct: 1066 REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQ----- 1120 Query: 112 EKERLQAQEDAKLAAEEQKKQVAEQQKQ--IAEQQKQAAEQQKIAAAAVAKAKEEQKQAE 169 E ++ +Q K E + AE ++ K+ Q A AKE E Sbjct: 1121 EVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE 1180 Query: 170 TAAAQAKAEADKIVKAQAEAQKKAEAEAKKEAAVAAAAKKQADADAKKAVEVAEKA-AAD 228 ++ + E + + + ++ K + + V A Sbjct: 1181 QPVTESTTV--NTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPAT 1238 Query: 229 AAEKKAAADAEKKAAAAKKVAAAAEAKKKAAAEAAAST 266 + + A + A ++A+ KA A Sbjct: 1239 TSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVG 1276
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 27.2 bits (60), Expect = 0.033 Identities = 11/24 (45%), Positives = 18/24 (75%) Query: 150 LKARTLIQVLEPIKARGALETDLL 173 LKA +I +L+ IK+ GAL+ +L+ Sbjct: 348 LKADGIIAILQGIKSAGALQAELV 371
>PF06872#EspG protein Length = 398 Score = 25.1 bits (54), Expect = 0.031 Identities = 8/41 (19%), Positives = 27/41 (65%), Gaps = 1/41 (2%) Query: 11 LYSETCRVVGDTVLALHALGLPIDVESII-DSITAQRSHRS 50 ++ +T R +G++ L+L+ + +P D + ++ +++ + ++ S Sbjct: 202 VFMDTSRGLGNSKLSLNGVDIPADAQKLLRNTLGLKDTNSS 242
>PF07299#Fibronectin-binding protein (FBP) Length = 219 Score = 25.2 bits (55), Expect = 0.024 Identities = 12/30 (40%), Positives = 16/30 (53%), Gaps = 2/30 (6%) Query: 8 KHPHVELCDLLKLQ--GWNDSGASAKAAIA 35 K P +E D+ +L W D G+S K IA Sbjct: 110 KLPDMEELDMKELSYLSWIDKGSSRKFIIA 139
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 33.6 bits (77), Expect = 0.001 Identities = 17/151 (11%), Positives = 49/151 (32%), Gaps = 10/151 (6%) Query: 299 RSQLNYSEENLKQARASLERLYTALRGTDANATPAGGVEFEARFRTAMDDDFNTPEAY-- 356 + ++ +L QAR R R + N P + E F+ +++ + Sbjct: 133 EADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIK 192 Query: 357 SVLFDIAREVNRLK---NEDMAAANGLAAELRKLAQVLGLLEQDPELFLQGGAQ-ADDDE 412 + + + ++ A + A + + + + + L + Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSR----LDDFSSLLHKQA 248 Query: 413 VAKIEALIKQRNDARSSKDWALADAARDQLN 443 +AK L ++ + + + + +Q+ Sbjct: 249 IAKHAVLEQENKYVEAVNELRVYKSQLEQIE 279
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 27.1 bits (60), Expect = 0.028 Identities = 26/140 (18%), Positives = 55/140 (39%), Gaps = 2/140 (1%) Query: 11 QPVNVSVKRTSFSILGAISVSHLLNDMIQSLILAIYPLLQAE-FSLSFAQIGLITLTYQL 69 P+ +++ A+ + ++ + A++ + + F IG+ + + Sbjct: 198 NPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGI 257 Query: 70 TASLLQPLI-GLYTDKHPQPYSLPIGMGFTLSGILLLAVATTFPVVLLAAALVGTGSSVF 128 SL Q +I G + + +L +GM +G +LLA AT + L+ +G Sbjct: 258 LHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGM 317 Query: 129 HPESSRVARMASGGRHGCAQ 148 + ++R R G Q Sbjct: 318 PALQAMLSRQVDEERQGQLQ 337
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 29.4 bits (66), Expect = 0.016 Identities = 26/118 (22%), Positives = 41/118 (34%) Query: 136 IGGPLGDKIGRKYVIWGSILGVAPFTLALPYASLYWTGILTVFIGVILASAFSAILVYAQ 195 + G L D+ GR+ V+ S+ G A + A W + + I + + Y Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA 121 Query: 196 ELIPGKVGMVSGLFFGFAFGMGGIGAAVLGYVADLTSIELVYQICAFLPLLGIFTALL 253 ++ G F FG G + VLG + S + A L L T Sbjct: 122 DITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCF 179
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 67.1 bits (164), Expect = 9e-15 Identities = 58/329 (17%), Positives = 126/329 (38%), Gaps = 51/329 (15%) Query: 1 MKIALIGGSGFIGTNLARLLIDNSVDFSILDKVKS--DVYPER------------WVYCD 46 MK + G +GFIG ++++ L++ +D + DV ++ + D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 47 VTDYDSLISTLI---GHDLIINLAAEH--KDNV-NPISLYYQVNVEGAKNICRAADSLNI 100 + D + ++ L + + + ++ NP + Y N+ G NI I Sbjct: 61 LADRE-GMTDLFASGHFERVFISPHRLAVRYSLENPHA-YADSNLTGFLNILEGCRHNKI 118 Query: 101 KNIVFTSSVAVYGFVEKD--TDESGKYAPFNHYGKSKLEAEKVYDSWFNSSADKKLVTLR 158 +++++ SS +VYG K + + P + Y +K E + + ++ LR Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHT-YSHLYGLPATGLR 177 Query: 159 PTVVFGIGNRGN--VYNLFKQIASGKFVMI-GRGENEKSMAYVENIAAFLVLTLSFP--- 212 V+G R + ++ K + GK + + G+ ++ Y+++IA ++ Sbjct: 178 FFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHA 237 Query: 213 ---------------AGYHLINYVDKPDFTMNELANVIYTCLGKKSKIVRVPYFFG--LF 255 A Y + N + + + + LG ++K +P G L Sbjct: 238 DTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVLE 297 Query: 256 AGYIFDLLAKITGKELPVSSIR--IKKFC 282 L ++ G P ++++ +K F Sbjct: 298 TSADTKALYEVIGFT-PETTVKDGVKNFV 325
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 82.5 bits (204), Expect = 4e-20 Identities = 59/352 (16%), Positives = 122/352 (34%), Gaps = 59/352 (16%) Query: 5 RVFIAGHRGMVGSAIVRQLENRND--------------------IELIIRDR---TELDL 41 + + G G +G + ++L +EL+ + ++DL Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 42 MSQSAVQKFFATEKIDEIYLAAAKVGGIQANNNYPAEFIYQNLMIECNIIHAAHLAGIQK 101 + + FA+ + ++++ ++ + N P + NL NI+ IQ Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLEN-PHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 102 LLFLGSSCIYPKLAAQPMTEEALLTGVLEPTNEP---YAIAKIAGIKLCESYNRQYGRDY 158 LL+ SS +Y P + + + + P YA K A + +Y+ YG Sbjct: 121 LLYASSSSVYGLNRKMPFSTD-------DSVDHPVSLYAATKKANELMAHTYSHLYGLPA 173 Query: 159 RSVMPTNLYGENDNFHPENSHVIPALLRRFHEAKIRNDKEMVVWGTGKPMREFLHVDDMA 218 + +YG P + + K + V+ GK R+F ++DD+A Sbjct: 174 TGLRFFTVYGPWGR---------PDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIA 224 Query: 219 AASVHVMELSDQIYQTNTQPMLSH------------INVGTGVDCTIRELAETMAKVVGF 266 A ++ L D I +TQ + N+G + + + + +G Sbjct: 225 EA---IIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGI 281 Query: 267 TGNLVFDSTKPDGTPRKLMDVSRLAK-LGWCYQISLEVGLTMTYQWFLAHQN 317 +P D L + +G+ + +++ G+ W+ Sbjct: 282 EAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYK 333
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 103 bits (258), Expect = 5e-27 Identities = 79/364 (21%), Positives = 128/364 (35%), Gaps = 64/364 (17%) Query: 6 LITGITGQDGSYLAEFLLEKGYEVHGIKRRASSFNTSRIDHIYQDRHET--NPRFFLHYG 63 L+TG G G ++++ LLE G++V GI + N + Q R E P F H Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGI----DNLNDYYDVSLKQARLELLAQPGFQFHKI 59 Query: 64 DLTDTSNLIRLVQEIQPDEIYNLGAQSHVAVSFESPEYTADVDAMGTLRLLEAIRINGLE 123 DL D + L + ++ + V S E+P AD + G L +LE R N ++ Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119 Query: 124 KKTRFYQASTSELYGLVQETPQRETTPF-YPRSPYAVAKMYAYWITVNYRESYGMYACNG 182 AS+S +YGL ++ P +P S YA K + Y YG+ A Sbjct: 120 ---HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGL 176 Query: 183 ILFNHESPRRGETFVTRKITRAVANIALGLEKCLYLGNIDSLRDWGHAKDYV----RMQW 238 F P K T+A+ G +Y RD+ + D R+Q Sbjct: 177 RFFTVYGPWGRPDMALFKFTKAMLE---GKSIDVY-NYGKMKRDFTYIDDIAEAIIRLQD 232 Query: 239 MMLQQDKPED---------------FVIATGKQITVREFVRMSAREAGIELEFSGEGVEE 283 ++ D + I + + ++++ GIE Sbjct: 233 VIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIE---------- 282 Query: 284 VATVVAINGNHISSVNIGDVIVRVDPRYFRPAEVETLLGDPTKAKKVLGWVPEITVEEMC 343 N + +P +V D +V+G+ PE TV++ Sbjct: 283 ------AKKNMLP---------------LQPGDVLETSADTKALYEVIGFTPETTVKDGV 321 Query: 344 AEMV 347 V Sbjct: 322 KNFV 325
>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature. Length = 1104 Score = 28.5 bits (63), Expect = 0.048 Identities = 16/67 (23%), Positives = 23/67 (34%), Gaps = 5/67 (7%) Query: 217 HYLPGRYHGLGRLSDEALNEA-----YNSAYALLYPSSYEGFGIPILEAMSAGCPVISVN 271 HYL GRY G + Y A + S GI ++++ G N Sbjct: 506 HYLQGRYVVPGMWGQGEFYQEGVLTWYEEGTAEFFAGSTRTDGIKPRKSVTQGLAYDRNN 565 Query: 272 VSSIPEV 278 S+ V Sbjct: 566 RMSLYGV 572
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 61.0 bits (148), Expect = 8e-13 Identities = 58/321 (18%), Positives = 114/321 (35%), Gaps = 70/321 (21%) Query: 1 MKILITGVSGYLGSQLANALMLE-HEVAGTVRAGSVCNRITDIGNVNL------------ 47 MK L+TG +G++G ++ L+ H+V G + + D +V+L Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGI-------DNLNDYYDVSLKQARLELLAQPG 53 Query: 48 -----INVTDSGWIDKVL-SFSPDVVINTVALYGRKGELLS--ELVDANIQFPLRILE-- 97 I++ D + + S + V + + L + D+N+ L ILE Sbjct: 54 FQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGC 113 Query: 98 --------MLVST----GKGLFFQCGTSLPAD--VSQYALTKNQFTELAREYCNKFSGKF 143 + S+ G T D VS YA TK +A Y + + Sbjct: 114 RHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPA 173 Query: 144 IELKLEHFFGPFDDST----KFTTYVINSCRSHSDLKL-TAGLQRRDFIYINDLINA--- 195 L+ +GP+ KFT + + + G +RDF YI+D+ A Sbjct: 174 TGLRFFTVYGPWGRPDMALFKFT----KAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIR 229 Query: 196 ------------FKIMISKSESLISGESISIGSGHAVTIKEFVETVAKMTSYQGNLQFGA 243 + + S+ +IG+ V + ++++ + + Sbjct: 230 LQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNM-- 287 Query: 244 IPTRENELMYSCASLARIQEL 264 +P + +++ + A + E+ Sbjct: 288 LPLQPGDVLETSADTKALYEV 308
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 72.5 bits (178), Expect = 2e-16 Identities = 64/352 (18%), Positives = 118/352 (33%), Gaps = 48/352 (13%) Query: 11 RVFVTGHTGFKGGWLSLWLQTMGATVKGYSLTAPTVPSLFETARVA----DGMQSEIGDI 66 + VTG GF G +S L G V G + AR+ G Q D+ Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 67 RDQNKLLESIREFQPEIVFHMAAQPLVRLSYSEPVETYSTNVMGTVYLLEAIRHVGGVKA 126 D+ + + E VF + VR S P +N+ G + +LE RH ++ Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN-KIQH 120 Query: 127 VVNITSDKCYDNKEWIWGYRENEAMGGYDPYSNSKGCAELVTSSYRNSFFNPAN------ 180 ++ +S Y + ++ Y+ +K EL+ +Y + + PA Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180 Query: 181 -YGQHG----------TAVATVRAGNVIGGGDWA-----LDRIVPDILRAFEQSQPVIIR 224 YG G A+ ++ +V G +D I I+R + Sbjct: 181 VYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVI------ 234 Query: 225 NPHAIRPWQHVLEPLSGYLLLAQKLYTDGAEYAEGWNFGPNDADATPVKNIVEQMVKYWG 284 PHA W + T A A + ++ + + ++ + G Sbjct: 235 -PHADTQW-------------TVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG 280 Query: 285 EGASWQLDGNAHPHEAHYLKLDCSKAKMQLGWHPRWNLNTTLEYIVGWHKNW 336 A + P + D +G+ P + ++ V W++++ Sbjct: 281 IEAKKNML-PLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 40.4 bits (94), Expect = 4e-05 Identities = 28/235 (11%), Positives = 58/235 (24%), Gaps = 21/235 (8%) Query: 35 SEVQSQLDLLSKQKILSPAEKLAQQDLTQTLE-YLDTIERTKQEANQLKQQLAQAPAKLR 93 S + +L K ++ + LE L+ + + L A L Sbjct: 95 SNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALA 154 Query: 94 QATEGLE-ALKSSSADTMTKESLANYSLRQLESRLNETLDNLQSAQEDLSAYNSQLIALQ 152 LE AL+ + + +++ + + + L Sbjct: 155 ARKADLEKALEGAMNFS-----------TADSAKIKTLEAEKAALEARQAELEKALEGAM 203 Query: 153 TQPERVQSAMYSASMRLMQIRNQLNGLTPNQESLRPTQQ--QELLAEQVMLNGQLDLERK 210 + + + + + L E + L+ + Sbjct: 204 NFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQA 263 Query: 211 NLEANTTLQDLLQKQRDYTTAHINQLERYVQLLQEVVSGKRLILSEKTVKEAQAQ 265 LE + +A I LE L+ K + + V A Q Sbjct: 264 ELEK---ALEGAMNFSTADSAKIKTLEAEKAALEAE---KADLEHQSQVLNANRQ 312 Score = 32.0 bits (72), Expect = 0.016 Identities = 36/201 (17%), Positives = 72/201 (35%), Gaps = 33/201 (16%) Query: 37 VQSQLDLLSKQKILSPAEKLAQQDLTQTLEYLDTIERTKQEANQLKQQ------------ 84 + L ++Q L A + A T + T+E K K Sbjct: 252 EAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANR 311 Query: 85 --LAQAPAKLRQATEGLEA-----------LKSSSADTMTKESLANYSLRQLESRLNETL 131 L + R+A + LEA ++S + + +QLE+ + Sbjct: 312 QSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLE 371 Query: 132 DNLQSAQEDLSAYNSQLIALQTQPERVQSAMYSASMRLMQIRNQLNGLTPNQESLRPTQQ 191 + + ++ + L A + ++V+ A+ A+ +L + L +ES + T++ Sbjct: 372 EQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKEL---EESKKLTEK 428 Query: 192 QELLAEQVMLNGQLDLERKNL 212 E+ L +L+ E K L Sbjct: 429 -----EKAELQAKLEAEAKAL 444
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 26.4 bits (58), Expect = 0.033 Identities = 9/71 (12%), Positives = 27/71 (38%) Query: 47 IAGLNGQQPREGYNLQQMLEILTAQNVPIKLCKTCADARGIAGLTLVGGVEIGTLVELAQ 106 I +N ++ ++ ++E L VP ++ D R + ++ + I + Sbjct: 222 IWEINTEEEGTPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDS 281 Query: 107 WTLAAEKVLTF 117 ++ ++ Sbjct: 282 IAEQGKEGDSY 292
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 165 bits (420), Expect = 7e-54 Identities = 135/210 (64%), Positives = 164/210 (78%) Query: 1 MARKTKQKAEETRQQILDAAVREFSAHGVSRTSLTDIAIAAGVTRGAIYWHFKNKVDLFN 60 MARKTKQ+A+ETRQ ILD A+R FS GVS TSL +IA AAGVTRGAIYWHFK+K DLF+ Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EVWELSESKIDQLEIEYQAKYPDNPLRILRELLIYILVSTREDRRRRALMEIVFHKCEFV 120 E+WELSES I +LE+EYQAK+P +PL +LRE+LI++L ST + RRR LMEI+FHKCEFV Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 Query: 121 GEMTSVHDARKVLDLASYERIESVLQGCIDANQLPVNLNTHRAAIIMRAYITGLMENWLF 180 GEM V A++ L L SY+RIE L+ CI+A LP +L T RAAIIMR YI+GLMENWLF Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180 Query: 181 MPESFDIKQEAPVLIDAYLEMLGQSFSLRN 210 P+SFD+K+EA + LEM +LRN Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRN 210
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 39.8 bits (93), Expect = 1e-05 Identities = 22/166 (13%), Positives = 52/166 (31%), Gaps = 45/166 (27%) Query: 96 QIDPATYQAAYDSAKGDLAKAQASAQIAHLTVNRYKPLLGTNYISKQ---EYDQALSDAQ 152 +++ +A + + + + +++ ++ + LL I+K E + +A Sbjct: 206 ELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAV 265 Query: 153 QADATVLAAKAALES----------------------------------------ARINL 172 + +ES Sbjct: 266 NELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQ 325 Query: 173 AYTQVRSPISGRTGKSAV-TEGALVTSGQASAMTTVQQLDPMYVDV 217 + +R+P+S + + V TEG +VT+ + M V + D + V Sbjct: 326 QASVIRAPVSVKVQQLKVHTEGGVVTTAET-LMVIVPEDDTLEVTA 370
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1344 bits (3479), Expect = 0.0 Identities = 807/1032 (78%), Positives = 919/1032 (89%) Query: 1 MAKFFIDRPIFAWVIAIIIMLAGALAIMKLPVAQYPTIAPPAITIAANYPGADATTVQNT 60 MA FFI RPIFAWV+AII+M+AGALAI++LPVAQYPTIAPPA++++ANYPGADA TVQ+T Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGIDNLLYMSSSSDSSGNVQLTLTFNSGTDPDIAQVQVQNKLQLAMPLLPQ 120 VTQVIEQNMNGIDNL+YMSS+SDS+G+V +TLTF SGTDPDIAQVQVQNKLQLA PLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGVSVEKSSSSFLMVAGFISEDGTMQQEDIADYVGSNIKDPISRTPGVGDVQLFGS 180 EVQQQG+SVEKSSSS+LMVAGF+S++ Q+DI+DYV SN+KD +SR GVGDVQLFG+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWMDPHKLNNYKLTPVDVINAIKIQNNQVAAGQLGGTPPVPGQELNSSIIAQTRL 240 QYAMRIW+D LN YKLTPVDVIN +K+QN+Q+AAGQLGGTP +PGQ+LN+SIIAQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 TNAEEFSQILLKVNTDGSQVRLKDVAIVKLGAESYNIIARYNGKPAAGIGIKLATGANAL 300 N EEF ++ L+VN+DGS VRLKDVA V+LG E+YN+IAR NGKPAAG+GIKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 NTSAAVKAELAKLQPFFPSGLTVVYPYDTTPFVKISINEVVKTLIEAIILVFLVMYLFLQ 360 +T+ A+KA+LA+LQPFFP G+ V+YPYDTTPFV++SI+EVVKTL EAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NFRATLIPTIAVPVVLLGTFAILSAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 N RATLIPTIAVPVVLLGTFAIL+AFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 QEEGLPPKEATKKSMEQIQGALVGIALVLSAVFVPMAFFGGATGAIYRQFSITIVSAMVL 480 E+ LPPKEAT+KSM QIQGALVGIA+VLSAVF+PMAFFGG+TGAIYRQFSITIVSAM L Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATMLKPIKKGDHGPKTGFFGWFNNMFEKSTHHYTDSVANILRSTGRY 540 SVLVALILTPALCAT+LKP+ H K GFFGWFN F+ S +HYT+SV IL STGRY Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 541 LVIYLAIVIGMAVLFMRLPSSFLPEEDQGVFLTMVQLPAGATQERTQKVLNQVTDYYLDK 600 L+IY IV GM VLF+RLPSSFLPEEDQGVFLTM+QLPAGATQERTQKVL+QVTDYYL Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 601 EKNVVNSVFTVNGFGFSGQGQNTGLAFVSLKNWDERKGEQNKVPAIVSRASAAFSKIKDG 660 EK V SVFTVNGF FSGQ QN G+AFVSLK W+ER G++N A++ RA KI+DG Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 661 MVFAFNLPAIVELGTATGFDFQLIDQGNLGHQQLTDARNQLLGMAAQHPDMLVGVRPNGL 720 V FN+PAIVELGTATGFDF+LIDQ LGH LT ARNQLLGMAAQHP LV VRPNGL Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 721 EDTPQFKVEVDQEKAQALGVAISDINTTLGSAMGGSYVNDFIDRGRVKKVYVQADAPFRM 780 EDT QFK+EVDQEKAQALGV++SDIN T+ +A+GG+YVNDFIDRGRVKK+YVQADA FRM Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 781 LPDDIDKWYVRNNMGQMVSFATFSTAKWEYGSPRLERYNGLPSMEILGQAAPGKSTGEAM 840 LP+D+DK YVR+ G+MV F+ F+T+ W YGSPRLERYNGLPSMEI G+AAPG S+G+AM Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840 Query: 841 DLMQELAAKLPSGVGYDWTGMSYQERLSGNQAPALYAISLIVVFLCLAALYESWSIPFSV 900 LM+ LA+KLP+G+GYDWTGMSYQERLSGNQAPAL AIS +VVFLCLAALYESWSIP SV Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 Query: 901 MLVVPLGVVGALLAATLRGLENDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGLV 960 MLVVPLG+VG LLAATL +NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG+V Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 Query: 961 ESTLESVRMRLRPILMTSLAFILGVMPLVISSGAGSGAQNAVGTGVMGGMITATVLAIFF 1020 E+TL +VRMRLRPILMTSLAFILGV+PL IS+GAGSGAQNAVG GVMGGM++AT+LAIFF Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 Query: 1021 VPLFFVVVRRRF 1032 VP+FFVV+RR F Sbjct: 1021 VPVFFVVIRRCF 1032
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 90.3 bits (224), Expect = 4e-23 Identities = 31/119 (26%), Positives = 58/119 (48%), Gaps = 2/119 (1%) Query: 4 RILVVEDEAPIREMVCFVLEQNGYQPLEAEDYDSAVARLSEPFPDLVLLDWMLPGGSGIQ 63 ILV +D+A IR ++ L + GY + + ++ DLV+ D ++P + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 64 FIKHMKREALTRDIPVMMLTARGEEEDRVRGLEVGADDYITKPFSPKELVARIKAVMRR 122 + +K+ D+PV++++A+ ++ E GA DY+ KPF EL+ I + Sbjct: 65 LLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 43.7 bits (103), Expect = 4e-06 Identities = 22/226 (9%), Positives = 70/226 (30%), Gaps = 21/226 (9%) Query: 741 QQLALITERQKNAQQTYQQLQSQYQHQQEALIAQQQVLNHTLTELSLSVPDADQQQNWLA 800 L +T A + QS + Q + + D+ Sbjct: 122 DVLLKLTALGAEAD--TLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNV 179 Query: 801 QREEECQRWQQHQQEQQRLTIEQKTLETRIENERRHLQECIDQLSALSQQRQQAETLLQQ 860 EE + +++ ++ E ++ +R + +++ + + + Sbjct: 180 SEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSR----VEKS 235 Query: 861 QIQQRRALFGEDIVAEVRQRLRLQQQQAELAQQNAEKALQQAQSQLNRLSGELTGLEQQC 920 ++ +L + + + + +Q+ + +A ++L +L +E + Sbjct: 236 RLDDFSSLLHKQAI----AKHAVLEQENKYV---------EAVNELRVYKSQLEQIESEI 282 Query: 921 QQYQQRATTTQAELQQALSTSEFADETALTAALLSEEERQHLQQLQ 966 ++ + + +T LL+ E ++ ++ Q Sbjct: 283 LSAKEEYQLVTQLFKN--EILDKLRQTTDNIGLLTLELAKNEERQQ 326 Score = 41.7 bits (98), Expect = 2e-05 Identities = 32/222 (14%), Positives = 74/222 (33%), Gaps = 8/222 (3%) Query: 321 QYLAQLTPLT--QAVEQATAARQQQQLNQHEQETLIEQRIVPLDNLITQQQQTLSQLAGQ 378 L +LT L + ++ Q +L Q + L + + + Q + Sbjct: 122 DVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSE 181 Query: 379 IQQLRAKEQQNSQQLALNEQKLLQTHQRLQQLADYANLHAHHQHWEKHLPLWHEQFRQLQ 438 + LR Q QK + ++ A+ + A +E + + Sbjct: 182 EEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFS 241 Query: 439 LQQQQSAQSEQQLHQQTTLLATLQQQATTLSAQEKQQQVALAEARAQASYLQQKL--LVL 496 + A ++ + +Q + +Q +Q + + A+ + + Q +L Sbjct: 242 SLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL 301 Query: 497 EQ----QQPSAQLRQQLNEFNEQRQICQQLAALSPLAQQIQA 534 ++ L +L + E++Q A +S QQ++ Sbjct: 302 DKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKV 343 Score = 38.7 bits (90), Expect = 1e-04 Identities = 36/222 (16%), Positives = 72/222 (32%), Gaps = 30/222 (13%) Query: 458 LATLQQQATTLSAQEKQQQVALAEARAQASYLQQKLLVLEQQ----QPSAQLRQQLNEFN 513 L L +A TL Q Q L + R Q +L L + +P Q + Sbjct: 127 LTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLR 186 Query: 514 EQRQICQQLAALSPLAQQIQALYDKQQQQFTAQQQQLKQLEQQ---LTEKRQLYQQ-QKQ 569 I +Q + Q + DK++ + ++ + E + + + Sbjct: 187 LTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHK 246 Query: 570 HLVDLEALLEREKQIVTLEAERAKLQPGDACPLCGAVEHPAITAYQAVKPSETAVRVAKL 629 + A+LE+E + V E + ++ E+ + AK Sbjct: 247 QAIAKHAVLEQENKYVEAVNELRVYK-------------------SQLEQIESEILSAKE 287 Query: 630 RL-QVEQLYTEGTELRTQVASMQQHQQRIEQELQDHRQQLAA 670 V QL+ E+ ++ + + EL + ++ A Sbjct: 288 EYQLVTQLFKN--EILDKLRQTTDNIGLLTLELAKNEERQQA 327 Score = 37.9 bits (88), Expect = 3e-04 Identities = 28/206 (13%), Positives = 71/206 (34%), Gaps = 19/206 (9%) Query: 658 EQELQDHRQQLAAYQQRWQTLAQPLSL----AFTLNEPDALALWLEQHEQQEQACQLKLV 713 + Q Q Q R+Q L++ + L L + E+ + + Sbjct: 136 TLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL----- 190 Query: 714 EYERLTQQYQQAKDILTQLEQRQQEHQQQLALITERQKNAQQTYQQLQSQYQHQQEALIA 773 + +Q+ ++ Q E + + + + R + + +S+ +L+ Sbjct: 191 ----IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFS-SLLH 245 Query: 774 QQQVLNHTLTELSLSVPDADQQQNWLAQREEECQRWQQHQQEQQRLTIEQKTLETRIENE 833 +Q + H + E +A + + E+ + +E+ +L + E + Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK-- 303 Query: 834 RRHLQECIDQLSALSQQRQQAETLLQ 859 L++ D + L+ + + E Q Sbjct: 304 ---LRQTTDNIGLLTLELAKNEERQQ 326 Score = 33.6 bits (77), Expect = 0.005 Identities = 26/180 (14%), Positives = 71/180 (39%), Gaps = 13/180 (7%) Query: 844 LSALSQQRQQAETLLQQQIQQRRALFGEDIVAE-------VRQRLRLQQQQAELAQQNAE 896 + L Q R Q + + + ++ + +R +++Q + Q + Sbjct: 145 QARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQ 204 Query: 897 K--ALQQAQSQLNRLSGELTGLEQQCQQYQQRATTTQAEL-QQALSTSEFADETALTAAL 953 K L + +++ + + E + + R + L +QA++ ++ Sbjct: 205 KELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEA 264 Query: 954 LSE--EERQHLQQLQQQLNERRQQAQIRLQQAR-EILDQHLQLCPQGVDKSSELTLLQQQ 1010 ++E + L+Q++ ++ +++ Q+ Q + EILD+ Q + EL +++ Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEER 324
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 28.3 bits (63), Expect = 0.045 Identities = 11/37 (29%), Positives = 21/37 (56%) Query: 218 DVIAEQAMNNYERRFAKSLAHVINLFDPDVVVLGGGM 254 D + E+A +N +R F+ + + LF+P +VV + Sbjct: 351 DSMLERAADNQDREFSSQMTLALGLFEPLLVVSMAAV 387
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.014 Identities = 16/68 (23%), Positives = 27/68 (39%), Gaps = 12/68 (17%) Query: 7 MVGARGAGKTTIGKALAQALGYRFVDTDL-------FMQQTSQMTVAEVVESEGWDGFRL 59 + G G GK+T+ L F DT +Q + + E+ E FR Sbjct: 601 LEGTGGIGKSTLINTLVGL--DFFSDTHFDIGTGKDSYEQIAGIVAYELSE---MTAFRR 655 Query: 60 RESMALQA 67 ++ A++A Sbjct: 656 ADAEAVKA 663
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 39.8 bits (93), Expect = 1e-05 Identities = 32/146 (21%), Positives = 57/146 (39%), Gaps = 21/146 (14%) Query: 119 DTMNALLDNRI---------VPVINENDAVATAEIKVGDNDNLSALAAILASADKLLLLT 169 +T+ L++ + VPVI E+ + E V D D A +AD ++LT Sbjct: 177 ETIKKLVERGVIVIASGGGGVPVILEDGEIKGVE-AVIDKDLAGEKLAEEVNADIFMILT 235 Query: 170 DQAGLYTADPRNNPEAELIREVHGIDDVLRGMAGDSVSGLGTGGMATKLQAA-DVACRAG 228 D G + + +REV ++++ + G M K+ AA G Sbjct: 236 DVNGAALY--YGTEKEQWLREVK-VEELRKYYEEG---HFKAGSMGPKVLAAIRFIEWGG 289 Query: 229 IDVVIAAGSQVGVIADVIDGTPVGTR 254 +IA + + ++G GT+ Sbjct: 290 ERAIIAHLEK---AVEALEGK-TGTQ 311 Score = 30.6 bits (69), Expect = 0.009 Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 13/76 (17%) Query: 4 SQTLVVKLGTSVLTGGSRRLNRAHIVELVRQCAQQ----HAKGHRIVIVTSG-------- 51 + +V+ LG + L ++ + +++ VR+ A+Q A+G+ +VI Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61 Query: 52 -AIAAGREHLGYPELP 66 + AG+ G P P Sbjct: 62 LHMDAGQATYGIPAQP 77
>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature. Length = 1104 Score = 45.1 bits (106), Expect = 6e-07 Identities = 29/123 (23%), Positives = 45/123 (36%), Gaps = 23/123 (18%) Query: 366 PVAQITAPSSVQDNETITLSASAST---GQIASYQWEFQHFEPKVATTQNVTVRAVATQQ 422 P A I + SSV E I + S G+I +Y+W+F E + Sbjct: 775 PKAVIKSDSSVIVEEEINFDGTESKDEDGEIKAYEWDFGDGEKSNEAKATHKYNKTGEYE 834 Query: 423 PLAGKVTLTVTNNQGVQSRAEKTINIL------------PSGGIEQEHPLWDRNKVTTYG 470 V LTVT+N G + K I ++ P+ E+ + + K Sbjct: 835 -----VKLTVTDNNGGINTESKKIKVVEDKPVEVINESEPNNDFEKANQI---AKSNMLV 886 Query: 471 EGT 473 +GT Sbjct: 887 KGT 889
>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein signature. Length = 104 Score = 58.0 bits (140), Expect = 3e-13 Identities = 25/83 (30%), Positives = 45/83 (54%) Query: 147 QGRISPGEVDEVQLTLLMDIAKVTKISLRAALHRHLVEGATEEWVCSVYKMNQEDFWQNM 206 + + PG + E+ LL+ I+ + + A+ +LV G + + VC Y+MN F + Sbjct: 20 ESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHSRKEVCEKYQMNNGYFSTTL 79 Query: 207 RKLHRLNERVVQLLPFYTRQTSS 229 +L RLN +L P+YT ++S+ Sbjct: 80 GRLIRLNALAARLAPYYTDESSA 102
>PF05860#haemagglutination activity domain. Length = 117 Score = 67.9 bits (166), Expect = 2e-15 Identities = 27/115 (23%), Positives = 51/115 (44%), Gaps = 6/115 (5%) Query: 67 VLAHPVLPVNGHVVIGQGMLDQQSSTLTVTQQTDKLAINWDSFDIAHGHSVIYAQPGSQS 126 + LP+N ++ TQ L ++ F + + + P + Sbjct: 3 ITPDTTLPINSNITTEGN----TRIIERGTQAGSNLFHSFQEFSVPTSGTAFFNNPTNIQ 58 Query: 127 IALNQVQGQSASQIYGRLQANG--QVFLLNPRGILFGKEAQVNVGGLVASTKYMS 179 +++V G S S I G ++AN +FL+NP GI+FG+ A++++GG + Sbjct: 59 NIISRVTGGSVSNIDGLIRANATANLFLINPNGIIFGQNARLDIGGSFVGSTANR 113
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.007 Identities = 14/42 (33%), Positives = 22/42 (52%), Gaps = 4/42 (9%) Query: 31 VISIIGRSGSGKSTLLRCMNGLEDYQDGSIKLGGMTVTNRDS 72 + + G G GKSTL+ + GL+ + D +G T +DS Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG----TGKDS 635
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 46.0 bits (109), Expect = 1e-07 Identities = 34/163 (20%), Positives = 65/163 (39%), Gaps = 5/163 (3%) Query: 35 LETIATNFSLSVNQAGFIVTAAQLGYAVGLMFLVPLGDMFE-RRGLIVGMTLLAAGGMLI 93 L IA +F+ ++ TA L +++G L D +R L+ G+ + G +I Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGS-VI 95 Query: 94 TAMSQNLTMMIIGTALTGLFSVVA--QLLVPLAATLAAPEKRGKVVGIIMSGLLLGILLA 151 + + ++I A L++ + A E RGK G+I S + +G + Sbjct: 96 GFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVG 155 Query: 152 RTVAGALATLGGWRTIYWVASALMFIMALVLWRCLPRYKQHTG 194 + G +A W + + + I L + L + + G Sbjct: 156 PAIGGMIAHYIHWSYLLLIPMITI-ITVPFLMKLLKKEVRIKG 197
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.017 Identities = 23/105 (21%), Positives = 37/105 (35%), Gaps = 12/105 (11%) Query: 12 LNSRAKRQKDFPYQEILLTRLSMHMHSKLLENRNKMLKAQGINETLFMALITLDAQESRS 71 + + P QE+ L + L R A+G + + T Sbjct: 745 PSPEDEEIYFRPEQELRLVETGVQGRLWALLTREGAPAAEGAAQKGYSVNTTF------- 797 Query: 72 IQPSELSAALG-----SSRTNATRIADELEKKGWIERRESHNDRR 111 + ++L ALG SS ++ D L + GW RE+ RR Sbjct: 798 VTIADLVQALGADPGKSSPMLEGQVRDWLNENGWEYLRETSGQRR 842
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 67.9 bits (166), Expect = 1e-14 Identities = 63/410 (15%), Positives = 118/410 (28%), Gaps = 99/410 (24%) Query: 25 LLLTAIFIMIGVAYLIYWFLVLRHHQ---ETDNAYISGNQVQIMSQVPGSVVSVHFENTD 81 L A FIM + ++ + SG +I V + + + Sbjct: 57 PRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGE 116 Query: 82 FVKSGDVLVTLDPTD-------AEQAFEQAK----------------------------- 105 V+ GDVL+ L + + QA+ Sbjct: 117 SVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYF 176 Query: 106 ----------------TALANSVRQTHQLIINSKQYQ-------ANIALKKTELSQAQND 142 + Q +Q +N + + A I + ++ Sbjct: 177 QNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSR 236 Query: 143 LKRRVVLGAAAVIGREELQHARDAVEAAQASLDMAVQQYNANQALVLNTPLE-------- 194 L L I + + + A L + Q ++ +L+ E Sbjct: 237 LDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF 296 Query: 195 -KQPAIEQAAAKMRDAWLT---------LQRTKVVSPISGYVSRRSVQ-VGAEISSGTPL 243 + + LT Q + + +P+S V + V G +++ L Sbjct: 297 KNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL 356 Query: 244 MAVVPADQ-LWIDANFKETQLVNMRIGQPATI-VTDF----YGDDVVYQGKVVGLDMGTG 297 M +VP D L + A + + + +GQ A I V F YG GKV + Sbjct: 357 MVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGY---LVGKVKNI----- 408 Query: 298 SAFSLLPAQNATGNWIKVVQRLPVRIALDEKQLKEHPLRIGLSSLVKVDT 347 + ++ G V+ + K PL G++ ++ T Sbjct: 409 NLDAIE--DQRLGLVFNVIISIEENCLSTG--NKNIPLSSGMAVTAEIKT 454
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 32.2 bits (73), Expect = 0.003 Identities = 15/47 (31%), Positives = 22/47 (46%), Gaps = 5/47 (10%) Query: 27 AAERVISL-----SPSTTELAYAAGLGDKLVAVSAYSDYPESAKKLE 68 + ER + + ELA GDK ++ +Y DY E K+LE Sbjct: 468 SVERSVLITQQHWDTLIGELAGVTRNGDKTLSGKSYIDYYEEGKRLE 514
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 414 bits (1066), Expect = e-148 Identities = 157/305 (51%), Positives = 199/305 (65%), Gaps = 20/305 (6%) Query: 43 RRRLLMALTLSPLLLSLPSLVAAAPKSDQPLLNIDRVIDIQRDIDTKRVVALEWLPVELL 102 RRRLL A+ LSPLL + + AAA ID R+VALEWLPVELL Sbjct: 9 RRRLLTAMALSPLLWQMNTAHAAA-------------------IDPNRIVALEWLPVELL 49 Query: 103 LALGVTPFGVADIHNYRLWVGEPALPADVINVGQRTEPNLELLQQMAPSLILLSQGYGPS 162 LALG+ P+GVAD NYRLWV EP LP VI+VG RTEPNLELL +M PS ++ S GYGPS Sbjct: 50 LALGIVPYGVADTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPS 109 Query: 163 PEKLAPIAPTMSFAFNEQGSSPLAVGKNSLQTLGQRLGLETAAQQHLADFDHFMLAARAR 222 PE LA IAP F F++ G PLA+ + SL + L L++AA+ HLA ++ F+ + + R Sbjct: 110 PEMLARIAPGRGFNFSD-GKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPR 168 Query: 223 LSGDTQTPLLMFSLLDPRHALIIGNGSLFQDVLSTLNIENAWQGETNFWGSAVVGIERLA 282 PLL+ +L+DPRH L+ G SLFQ++L I NAWQGETNFWGS V I+RLA Sbjct: 169 FVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLA 228 Query: 283 TIKTARAVCFGHGNNEMLQQVARTPLWQSLSFVRENQLRLLPPVWFYGATLSAMRFVRLL 342 K +CF H N++ + + TPLWQ++ FVR + + +P VWFYGATLSAM FVR+L Sbjct: 229 AYKDVDVLCFDHDNSKDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVL 288 Query: 343 EQAWG 347 + A G Sbjct: 289 DNAIG 293
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 41.0 bits (96), Expect = 3e-07 Identities = 21/66 (31%), Positives = 38/66 (57%) Query: 10 QKGFTLIELMVAVAIIAVLSGIGIPSYQRYIQKAALTDMLQAIVPYKMAVELCALEQSNL 69 Q+GFTL+E+MV + II VL+ + +P+ +KA + IV + A+++ L+ + Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHY 66 Query: 70 DSCNAG 75 + N G Sbjct: 67 PTTNQG 72
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 266 bits (682), Expect = 2e-88 Identities = 103/376 (27%), Positives = 195/376 (51%), Gaps = 6/376 (1%) Query: 2 LLATERNSVYEHIIQHGLQPLGVKGGRRLSARYWQGERLVAMTRQLATLLQAGLPLVNSL 61 L+ + + G L ++ RLS L +TRQLATL+ A +PL +L Sbjct: 37 LVPLSVDENRGDQQKSGSTGLSLRRKIRLSTS-----DLALLTRQLATLVAASMPLEEAL 91 Query: 62 QLLAKEADDSAWRCLLDEISQQVAQGQSLSEVMEQYPHVFPRLYPPVVAVGELTGNLEQC 121 +AK+++ L+ + +V +G SL++ M+ +P F RLY +VA GE +G+L+ Sbjct: 92 DAVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFPGSFERLYCAMVAAGETSGHLDAV 151 Query: 122 CTQLVHHQERQQNLHKKVIKALKYPVVVCIVALVVSVIMLVMVLPEFAQIYQSFDTPLPG 181 +L + E++Q + ++ +A+ YP V+ +VA+ V I+L +V+P+ + + LP Sbjct: 152 LNRLADYTEQRQQMRSRIQQAMIYPCVLTVVAIAVVSILLSVVVPKVVEQFIHMKQALPL 211 Query: 182 LTASLLWLSTFLTFYGPYLALIIAIVCIGYFYTLRKKSRWQQWEQTILLSIPLVSTLIRG 241 T L+ +S + +GP++ L + + + LR++ R + + LL +PL+ + RG Sbjct: 212 STRVLMGMSDAVRTFGPWMLLALLAGFMAFRVMLRQEKRRVSFHRR-LLHLPLIGRIARG 270 Query: 242 SCLSQIFQTLAITQQAGLPLSAGLDAAARSIHNYNYQQALRCIQKQISQGIPLYTTLNQH 301 ++ +TL+I + +PL + + + N + L + +G+ L+ L Q Sbjct: 271 LNTARYARTLSILNASAVPLLQAMRISGDVMSNDYARHRLSLATDAVREGVSLHKALEQT 330 Query: 302 PLFPAICQQLIRVGEESGSLDVLLEKLACWHQQQTQNLADNVTQMLEPLLMLIIGSIVGV 361 LFP + + +I GE SG LD +LE+ A ++ + + EPLL++ + ++V Sbjct: 331 ALFPPMMRHMIASGERSGELDSMLERAADNQDREFSSQMTLALGLFEPLLVVSMAAVVLF 390 Query: 362 LVIAMYLPIFQLGDVI 377 +V+A+ PI QL ++ Sbjct: 391 IVLAILQPILQLNTLM 406
>SECA#SecA protein signature. Length = 901 Score = 1373 bits (3556), Expect = 0.0 Identities = 805/904 (89%), Positives = 852/904 (94%), Gaps = 3/904 (0%) Query: 1 MLIKLLTKVFGSRNDRTLRRMQKVVDVINRMEPDIEKLTDTELRAKTDEFRERLAKGEVL 60 MLIKLLTKVFGSRNDRTLRRM+KVV++IN MEP++EKL+D EL+ KT EFR RL KGEVL Sbjct: 1 MLIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVL 60 Query: 61 ENLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNA 120 ENLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNA Sbjct: 61 ENLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNA 120 Query: 121 LSGRGVHVVTVNDYLAQRDAENNRPLFEFLGLSIGINLPNMTAPAKRAAYAADITYGTNN 180 L+G+GVHVVTVNDYLAQRDAENNRPLFEFLGL++GINLP M APAKR AYAADITYGTNN Sbjct: 121 LTGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNN 180 Query: 181 EFGFDYLRDNMAFSPEERVQRQLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYIRVN 240 E+GFDYLRDNMAFSPEERVQR+LHYALVDEVDSILIDEARTPLIISGPAEDSSEMY RVN Sbjct: 181 EYGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYKRVN 240 Query: 241 KLIPKLIRQEKEDSDSFQGEGHFSVDEKSRQVHLTERGLILIEQMLVEAGIMDEGESLYS 300 K+IP LIRQEKEDS++FQGEGHFSVDEKSRQV+LTERGL+LIE++LV+ GIMDEGESLYS Sbjct: 241 KIIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYS 300 Query: 301 PANIMLMHHVTAALRAHVLFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAK 360 PANIMLMHHVTAALRAH LFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAK Sbjct: 301 PANIMLMHHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAK 360 Query: 361 EGVEIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTIVVPTNRPMIR 420 EGV+IQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDT+VVPTNRPMIR Sbjct: 361 EGVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMIR 420 Query: 421 KDLADLVYMTEQEKIGAIIEDIRERTANGQPVLVGTISIEKSEVVSAELTKAGIEHKVLN 480 KDL DLVYMTE EKI AIIEDI+ERTA GQPVLVGTISIEKSE+VS ELTKAGI+H VLN Sbjct: 421 KDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLN 480 Query: 481 AKFHAMEAEIVSQAGQPGAVTIATNMAGRGTDIVLGGSWQSEIAALEDPTEEQIAAIKAA 540 AKFHA EA IV+QAG P AVTIATNMAGRGTDIVLGGSWQ+E+AALE+PT EQI IKA Sbjct: 481 AKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKAD 540 Query: 541 WQIRHDAVLASGGLHIIGTERHESRRIDNQLRGRAGRQGDAGSSRFYLSMEDALMRIFAS 600 WQ+RHDAVL +GGLHIIGTERHESRRIDNQLRGR+GRQGDAGSSRFYLSMEDALMRIFAS Sbjct: 541 WQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFAS 600 Query: 601 DRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIY 660 DRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIY Sbjct: 601 DRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRAIY 660 Query: 661 SQRNELLDVSDVSETINSIREDVFKTTIDSYIPTQSLEEMWDIEGLEQRLKNDFDLDMPI 720 SQRNELLDVSDVSETINSIREDVFK TID+YIP QSLEEMWDI GL++RLKNDFDLD+PI Sbjct: 661 SQRNELLDVSDVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDLPI 720 Query: 721 AKWLEDEPQLHEETLRERILQQAIETYQRKEEVVGIEMMRNFEKGVMLQTLDSLWKEHLA 780 A+WL+ EP+LHEETLRERIL Q+IE YQRKEEVVG EMMR+FEKGVMLQTLDSLWKEHLA Sbjct: 721 AEWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLA 780 Query: 781 AMDYLRQGIHLRGYAQKDPKQEYKRESFAMFAAMLESLKYEVISVLSKVQVRMPEEVEAL 840 AMDYLRQGIHLRGYAQKDPKQEYKRESF+MFAAMLESLKYEVIS LSKVQVRMPEEVE L Sbjct: 781 AMDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEEVEEL 840 Query: 841 EVQRREEAERLARQQQLSHQTDNSALMSEEEVKVANSLERKVGRNDPCPCGSGKKYKQCH 900 E QRR EAERLA+ QQLSHQ D+SA + + ERKVGRNDPCPCGSGKKYKQCH Sbjct: 841 EQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTG---ERKVGRNDPCPCGSGKKYKQCH 897 Query: 901 GRLQ 904 GRLQ Sbjct: 898 GRLQ 901
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 53.2 bits (128), Expect = 7e-10 Identities = 47/201 (23%), Positives = 72/201 (35%), Gaps = 18/201 (8%) Query: 171 IVKAVERCGLKVDQLIFAGLAASYAVLTEDERELGVCVVDIGGGTMDMAVYTGGALRHTK 230 I ++ + G + LI +AA+ G VVDIGGGT ++AV + + ++ Sbjct: 126 IRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSS 185 Query: 231 VIPYAGNVVTSDI------AYAFGTPPTDAEAIKVRHGCALGSIVSKDESVEVPSVGGRP 284 + G+ I Y AE IK G A + V V GR Sbjct: 186 SVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAY-----PGDEVREIEVRGRN 240 Query: 285 -----PRSLQRQTLAEVIEPRYTELLNLVNDEILQLQEQLRQQGVKHHLAAGIVLTGGAA 339 PR E++E E L + ++ EQ + G+VLTGG A Sbjct: 241 LAEGVPRGF-TLNSNEILEA-LQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGA 298 Query: 340 QIDGLAECAQRVFHAQVRIGQ 360 + L V + + Sbjct: 299 LLRNLDRLLMEETGIPVVVAE 319
>ICENUCLEATIN#Ice nucleation protein signature. Length = 1258 Score = 38.2 bits (88), Expect = 1e-04 Identities = 52/236 (22%), Positives = 89/236 (37%), Gaps = 8/236 (3%) Query: 545 TGMSVSATGISVSTTGTSLSVTGMSTSVTGVSVGFTLIDTS--FTGVSTSFTGVGTSFTG 602 +G + I ++T G++LS T S + G T D+S G ++ T S Sbjct: 150 SGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLV 209 Query: 603 ASNSLTGVSNSMTGCSSSFTGTSNSMTGSSHSMTGMSTSITGHSMSQ-TGSSSSITGDST 661 A T + + + + T M GS + ST G S G S+ T Sbjct: 210 AGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGED 269 Query: 662 SFTGSSVSSTGSSVSTTGVSTSTTGSSTSTTGCSVSTTGSSTSTTGNSVSMTG----NST 717 S + ST ++ + ++ + T+ S+ ST T G + T T Sbjct: 270 SSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQT 329 Query: 718 STTGCSISTTGSSIGTVGSSISTTGSSVSTTGSSISTTGLSVSYTGAQYSDVGVDL 773 + G ++ S GT G S+ + +T ++ + L+ Y Q + G DL Sbjct: 330 AQKGSDLTAGYGSTGTAGDD-SSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDL 384 Score = 34.3 bits (78), Expect = 0.002 Identities = 39/147 (26%), Positives = 69/147 (46%), Gaps = 14/147 (9%) Query: 630 GSSHSMTGMSTSITGHSMSQT-GSSSSITGDSTSFTGSSVSSTGSSVSTTGVSTSTTGSS 688 GS+ + S+ I G+ +QT G S++T + + + S + T STST G++ Sbjct: 485 GSTSTAGYESSLIAGYGSTQTAGYGSTLTA---GYGSTQTAQNESDLITGYGSTSTAGAN 541 Query: 689 TSTTGCSVSTTGSSTSTTGNSVSMTGNSTSTT---GCSISTTGSSIGTVGSS---ISTTG 742 +S ++ GS+ + + NSV G ++ T G ++ S GT GS I+ G Sbjct: 542 SSL----IAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYG 597 Query: 743 SSVSTTGSSISTTGLSVSYTGAQYSDV 769 S+ + + S T G + T + S + Sbjct: 598 STQTASYHSSLTAGYGSTQTAREQSVL 624 Score = 34.0 bits (77), Expect = 0.003 Identities = 31/143 (21%), Positives = 63/143 (44%), Gaps = 6/143 (4%) Query: 630 GSSHSMTGMSTSITGHSMSQTGSSSSITGDSTSFTGSSVSSTGSSVSTTGVSTSTTGSST 689 GS+ + S+ I G+ +QT +SI T+ GS+ ++ S T G +++T + Sbjct: 629 GSTSTAGADSSLIAGYGSTQTAGYNSIL---TAGYGSTQTAQEGSDLTAGYGSTSTAGAD 685 Query: 690 STTGCSVSTTGSSTSTTGNSVSMTGNSTSTTGCSISTTGSSIGTVGSS---ISTTGSSVS 746 S+ +T ++ + + T+ G +++ S T G+ I+ GS+ + Sbjct: 686 SSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQT 745 Query: 747 TTGSSISTTGLSVSYTGAQYSDV 769 + S T G + T + S + Sbjct: 746 ASYHSSLTAGYGSTQTAREQSVL 768 Score = 32.0 bits (72), Expect = 0.012 Identities = 47/194 (24%), Positives = 78/194 (40%), Gaps = 17/194 (8%) Query: 590 STSFTGVGTSFT---GASNSLTGVSNSMTGCSSSFTGTSNSMTGSSHSMTGM----STSI 642 ST G +S G++ + S G S+ T S + + TG S+ I Sbjct: 294 STGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLI 353 Query: 643 TGHSMSQT-GSSSSITGDSTSFTGSSVSSTGSSVSTTGVSTSTTGSSTSTTGC--SVSTT 699 G+ +QT G SS+T + + + GS ++ ST T G+ +S S T Sbjct: 354 AGYGSTQTAGEDSSLTA---GYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTA 410 Query: 700 GSSTSTTGNSVSMTGNSTSTTGCSISTTGSSIGTVGSSISTTGSSVSTTGSSISTTGLSV 759 G ++ T S T+ G ++ S GT G S+ + +T ++ + L+ Sbjct: 411 GEESTQTAGYGS---TQTAQKGSDLTAGYGSTGTAGDD-SSLIAGYGSTQTAGEDSSLTA 466 Query: 760 SYTGAQYSDVGVDL 773 Y Q + G DL Sbjct: 467 GYGSTQTAQKGSDL 480 Score = 30.9 bits (69), Expect = 0.022 Identities = 46/187 (24%), Positives = 79/187 (42%), Gaps = 15/187 (8%) Query: 590 STSFTGVGTSFTGASNSLTGVSNSMTGCSSSFTGTSNSMTGSSHSMTGM----STSITGH 645 S+ G G++ T NS+ G S+ T S S + T S+ I G+ Sbjct: 686 SSLIAGYGSTQTAGYNSIL-----TAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGY 740 Query: 646 SMSQTGSSSSITGDSTSFTGSSVSSTGSSVSTTGVSTSTTGSSTSTTGCSVSTTGSSTST 705 +QT S S T+ GS+ ++ SV TTG +++T + S+ +T ++ Sbjct: 741 GSTQTASYHSSL---TAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYH 797 Query: 706 TGNSVSMTGNSTSTTGCSISTTGSSIGTVG---SSISTTGSSVSTTGSSISTTGLSVSYT 762 + + T+ ++T S T G S I+ GS+ + +SI T G + T Sbjct: 798 SILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQT 857 Query: 763 GAQYSDV 769 + SD+ Sbjct: 858 AQENSDL 864
>PERTACTIN#Pertactin signature. Length = 922 Score = 30.5 bits (68), Expect = 0.026 Identities = 35/109 (32%), Positives = 47/109 (43%), Gaps = 22/109 (20%) Query: 102 SPQWHSRVVLPKGSRVTLSDSSLNNRLANFSTGRTLKIQPLVIENAECAST-PPAYLPLS 160 +PQ + + +G+RVT+S SL+ N VIE A PP PLS Sbjct: 309 APQLGAAIRAGRGARVTVSGGSLSAPHGN------------VIETGGGARRFPPPASPLS 356 Query: 161 VASQLQAGQAHLRLRLTTQGVASLSELDFAPMNLTLAGGIIQSNQLITT 209 + LQAG QG A L + P+ LTLAGG ++ T Sbjct: 357 I--TLQAGA-------RAQGRALLYRVLPEPVKLTLAGGAQGQGDIVAT 396
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 143 bits (363), Expect = 4e-40 Identities = 81/387 (20%), Positives = 149/387 (38%), Gaps = 84/387 (21%) Query: 5 IGIDLGTTNSCVAIMDGTKARVLENSEGDRTTPSIIAYTQDGET------LVGQPAKRQA 58 + IDLGT N+ + + + VL PS++A QD VG AK+ Sbjct: 13 LSIDLGTANTLIYVKG--QGIVLNE-------PSVVAIRQDRAGSPKSVAAVGHDAKQML 63 Query: 59 VTNPQNTLFAIKRLIGRRFQDEEAQRDKDIMPYKIIAADNGDAWLEVKGQKMAPPQISAE 118 P N + AI+ + +D I + + + Sbjct: 64 GRTPGN-IAAIRPM-----------KDGVIADFFVTEK------------------MLQH 93 Query: 119 VLKKMKKTAEDYLGEPVTEAVITVPAYFNDAQRQATKDAGRIAGLEVKRIINEPTAAALA 178 +K++ + P ++ VP +R+A +++ + AG +I EP AAA+ Sbjct: 94 FIKQVHS---NSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIG 150 Query: 179 YGL--DKEVGNRTIAVYDLGGGTFDISIIEIDEVDGEKTFEVLATNGDTHLGGEDFDSRL 236 GL + G+ V D+GGGT ++++I ++ V + +GG+ FD + Sbjct: 151 AGLPVSEATGS---MVVDIGGGTTEVAVISLNGV---------VYSSSVRIGGDRFDEAI 198 Query: 237 INYLVEEFKKDQGMDLRTDPLAMQRLKEAAEKAKIELSSA----QQTDVNLPYITADGSG 292 INY+ + G + AE+ K E+ SA + ++ + Sbjct: 199 INYVRRNYGSLIG-------------EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGV 245 Query: 293 PKHMNIKVTRAKLESLVEDLVNRSIEPLKVALQD-AGLSVSDIQD--VILVGGQTRMPMV 349 P+ + + LE+L E + + + VAL+ SDI + ++L GG + + Sbjct: 246 PRGFTLN-SNEILEALQEP-LTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNL 303 Query: 350 QKKVADFFGKEPRKDVNPDEAVAIGAA 376 + + + G +P VA G Sbjct: 304 DRLLMEETGIPVVVAEDPLTCVARGGG 330
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 28.8 bits (64), Expect = 0.039 Identities = 20/51 (39%), Positives = 30/51 (58%), Gaps = 2/51 (3%) Query: 60 TRNIDVSVGSI-TGLCAVTVGMALNAGFGLVASCLFALLVGMVAGFFNGIL 109 T ID S+ +I T L +V+ G++ A LV + + + LVG V G +GIL Sbjct: 361 TGAIDASLTTISTVLASVSSGISAAATTSLVGAPV-SALVGAVTGIISGIL 410
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 582 bits (1501), Expect = 0.0 Identities = 194/599 (32%), Positives = 310/599 (51%), Gaps = 34/599 (5%) Query: 116 PTLLRARSVSPGTACGKLLSLIRADLNA--LGDLPVAQGIEREQQMLADGVAQLGKAWES 173 + + S G A K + +++ V+ IE+ L +L Sbjct: 2 HHKITGIAASSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAI--- 58 Query: 174 LLVANSSTAANSSTTENSSTTENNSTTENNSTTENNSTTRAIREVHRSLLRDGTFRQRLL 233 ++ + + I H +L D + Sbjct: 59 ---------------------------KDQTEASMGADKAEIFAAHLLVLDDPELVDGIK 91 Query: 234 SHIIAGESCATAIVATAA-YFSQQLALAANTYLRERELDIRDVSFQLLQQIYGEQRFPSQ 292 I + A + + F N Y++ER DIRDVS ++L + G + S Sbjct: 92 GKIENEQMNAEYALKEVSDMFVSMFESMDNEYMKERAADIRDVSKRVLGHLIGVET-GSL 150 Query: 293 QALSEDSLCIADELTPSQFLALDKRYLKGLLLGRGGSTSHTVILARSFNIPTLVGVDATA 352 ++E+++ IA++LTPS L+K+++KG GG TSH+ I++RS IP +VG Sbjct: 151 ATIAEETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSHSAIMSRSLEIPAVVGTKEVT 210 Query: 353 LQPYLNQSLQIDGELGLVVCLLDEPVRRYYRQEQWLHDQLREQQSRYQNMPGRTLDGVRM 412 + + +DG G+V+ E + Y +++ ++ +++ ++ P T DG + Sbjct: 211 EKIQHGDMVIVDGIEGIVIVNPTEEEVKAYEEKRAAFEKQKQEWAKLVGEPSTTKDGAHV 270 Query: 413 VVAANITHAVEVEGAFNQGAESIGLFRTEMLYMDRAAAPSEEELYTLYAQALGAAKGKPM 472 +AANI +V+G G E IGL+RTE LYMDR P+EEE + Y + + GKP+ Sbjct: 271 ELAANIGTPKDVDGVLANGGEGIGLYRTEFLYMDRDQLPTEEEQFEAYKEVVQRMDGKPV 330 Query: 473 IIRTIDIGGDKPVSYLNIPAESNPFLGYRAVRIYHEFLSLFHTQLRAILRASMHGPLKIM 532 +IRT+DIGGDK +SYL +P E NPFLG+RA+R+ E +F TQLRA+LRAS +G LK+M Sbjct: 331 VIRTLDIGGDKELSYLQLPKELNPFLGFRAIRLCLEKQDIFRTQLRALLRASTYGNLKVM 390 Query: 533 IPMISSMEEILWVKDQLAEVKQSLRINHLQFDETVPLGMMLEVPSVMFIIDQCCEEMDFL 592 PMI+++EE+ K + E K L + +++ +G+M+E+PS + +E+DF Sbjct: 391 FPMIATLEELRQAKAIMQEEKDKLLSEGVDVSDSIEVGIMVEIPSTAVAANLFAKEVDFF 450 Query: 593 SIGSNDLTQYLLAVDRDNAKVSEHYHCLSPALLRALDYAVCEVHRHGKWIGLCGELAAKD 652 SIG+NDL QY +A DR N +VS Y PA+LR +D + H GKW+G+CGE+A + Sbjct: 451 SIGTNDLIQYTMAADRMNERVSYLYQPYHPAILRLVDMVIKAAHSEGKWVGMCGEMAGDE 510 Query: 653 SVLPLLVAMGLDEISMSASFIGATKARLAKLDRGECRLLLNRAMACRTSREVEHLLVQY 711 +PLL+ +GLDE SMSA+ I +++L KL + E + +A+ T+ EVE L+ + Sbjct: 511 VAIPLLLGLGLDEFSMSATSILPARSQLLKLSKEELKPFAQKALMLDTAEEVEQLVKKT 569
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 33.9 bits (77), Expect = 0.003 Identities = 28/196 (14%), Positives = 57/196 (29%), Gaps = 15/196 (7%) Query: 707 QTDLSDDLQALAAKETRLAEIASMLEEILESLTEEEKEQDTVKESQDGFANAELSKAAKA 766 T S ++ L A++ LA + LE+ LE ++ + A ++ A+ Sbjct: 136 STADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAEL 195 Query: 767 FLKEQKDSKVKFAEDSYEAKIIRANKLIDEEKALKKTVKDAATALHLKTKTTIETFTDEQ 826 + A+ + + + KA + + A I+T E Sbjct: 196 EKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAE- 254 Query: 827 VSNLLHLKWIAPLSTELAAMPSTVISQLTSQVQALADKYAVTYSQVANEIKSTEQELAQM 886 A L A + + + + + E + E E A + Sbjct: 255 ---------KAALEARQAE-----LEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADL 300 Query: 887 MSELTGNEFDMQGLAE 902 + + Q L Sbjct: 301 EHQSQVLNANRQSLRR 316
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 28.3 bits (63), Expect = 0.042 Identities = 10/18 (55%), Positives = 14/18 (77%) Query: 33 VALVGESGSGKSITARAL 50 + + GESG+GK + ARAL Sbjct: 163 LMITGESGTGKELVARAL 180
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.1 bits (62), Expect = 0.034 Identities = 13/47 (27%), Positives = 17/47 (36%), Gaps = 8/47 (17%) Query: 40 CVVLHGHSGSGKSTLLRSLYANYLPDSGHI--------WIKHQGEWI 78 VVL G G GKSTL+ +L H + + G Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVA 644
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 30.1 bits (68), Expect = 0.015 Identities = 25/87 (28%), Positives = 38/87 (43%), Gaps = 18/87 (20%) Query: 289 LASLGVLDILSSD--------------YYPASLMDAAF-RIAHDE--SNRFSLPQAVNLV 331 L +G I+SSD + A M R+ + ++ F + + + Sbjct: 350 LHDIGAFSIISSDSQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKY 409 Query: 332 TRNPARALGLNDR-GVIAEGKRADLIL 357 T NPA A GL+ G + GKRADL+L Sbjct: 410 TINPAIAHGLSHEIGSLEVGKRADLVL 436
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 37.9 bits (88), Expect = 8e-05 Identities = 25/177 (14%), Positives = 54/177 (30%), Gaps = 19/177 (10%) Query: 72 DLRQAIADIEAARAQYGVQRAAQLPTVNAGVNGSRGRGLSDTSDGNNNTAISQSYGAQAS 131 A AD ++ R Q R + LS + + N + + Sbjct: 128 TALGAEADTLKTQSSLLQARLEQT----------RYQILSRSIELNKLPELKLP--DEPY 175 Query: 132 VSAFELDLFGKKSSLSHAEFETYLATEEAAKTTRITLIADTATAWVTLAADQNQLLLAEE 191 + + +SL +F T+ + + A+ T + +N + + Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235 Query: 192 TLKSAEQSLKLAQLRQKNGIASRIDVAAMETLYQSARADVAQYKTTVAQDKNALDLL 248 L + L ++ V E Y A ++ YK+ + Q ++ + Sbjct: 236 RLDD------FSSLL-HKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSA 285
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1152 bits (2982), Expect = 0.0 Identities = 586/1033 (56%), Positives = 763/1033 (73%), Gaps = 5/1033 (0%) Query: 3 ARFFIYRPVFAWVIAIVIMLGGVVALETLPIAQYPDVAPPSISIKATYTGASAETLENSV 62 A FFI RP+FAWV+AI++M+ G +A+ LP+AQYP +APP++S+ A Y GA A+T++++V Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 63 TQVIEQELTGLDGLLYFSSSSGSDGNAKIVATFKQGTNADTAQVQVQNKVQQALTRLPTE 122 TQVIEQ + G+D L+Y SS+S S G+ I TF+ GT+ D AQVQVQNK+Q A LP E Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 123 VQAQGVTVTKSQTNFLLIMSLYDEKDKHTGTDIADYLVSNLQDPLARLEGVGSVQVFGSQ 182 VQ QG++V KS +++L++ + T DI+DY+ SN++D L+RL GVG VQ+FG+Q Sbjct: 122 VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ 181 Query: 183 YAMRIWLNPTKLAAYNLMPSDVQSAITAQNTQVSAGKIGALPSGKEQQLTATVMAQSRLK 242 YAMRIWL+ L Y L P DV + + QN Q++AG++G P+ QQL A+++AQ+R K Sbjct: 182 YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241 Query: 243 TPEQFNNIIVKSDSTGAVVRLRDVARVELGNEDYSVTTRLNGHPAAGIAVMLAPGANALA 302 PE+F + ++ +S G+VVRL+DVARVELG E+Y+V R+NG PAAG+ + LA GANAL Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301 Query: 303 TAERVKAKAAEFELNLPDGYKIAYPKDSTDFIKVSVEEVVKTLIEAILLVVIVMYIFLQN 362 TA+ +KAK AE + P G K+ YP D+T F+++S+ EVVKTL EAI+LV +VMY+FLQN Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361 Query: 363 IRATLIPAIAVPVVLLGTFGVLAIFGYSINTLTLFGMVLSIGLLVDDAIVVVENVERVMR 422 +RATLIP IAVPVVLLGTF +LA FGYSINTLT+FGMVL+IGLLVDDAIVVVENVERVM Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421 Query: 423 EDNLPPREATEKSMSEIASALIGIALVLSAVFLPMAFFGGATGVIYRQFSITIVSAMALS 482 ED LPP+EATEKSMS+I AL+GIA+VLSAVF+PMAFFGG+TG IYRQFSITIVSAMALS Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481 Query: 483 VLVALTLTPALCATFLKPNHKPPSEH--GFFGGFNRRYDRMQTRYESLVGHVIHRSLRYL 540 VLVAL LTPALCAT LKP E+ GFFG FN +D Y + VG ++ + RYL Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541 Query: 541 LIYAVLIGVMCVLFIRLPTGFLPTEDQGDVMVQYTLPAGATSGRTMEVSKAVENYFMTQE 600 LIYA+++ M VLF+RLP+ FLP EDQG + LPAGAT RT +V V +Y++ E Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601 Query: 601 KDNTKAVFTISGFGFSGSGQNAGMAFIALKHWRDRPGSENTATAIADRAMKALSSIRDAQ 660 K N ++VFT++GF FSG QNAGMAF++LK W +R G EN+A A+ RA L IRD Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661 Query: 661 IFSMTPPAVDGLGQSNGFTFELQATGDTSREQLLTLRDQLISKANKDPI-LASVRANTLQ 719 + PA+ LG + GF FEL + L R+QL+ A + P L SVR N L+ Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721 Query: 720 QMPQLQVDIDNDKAAALGLSISDVNATLSAAWGGTYINDFIDRGRVKKVYMQGDVDTRSK 779 Q ++++D +KA ALG+S+SD+N T+S A GGTY+NDFIDRGRVKK+Y+Q D R Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781 Query: 780 PEDLNQWFVRGSSDAMTSFSAFATTRWIYGPETLSRYNGQTSYEIQGQAASGSSSGTAMD 839 PED+++ +VR ++ M FSAF T+ W+YG L RYNG S EIQG+AA G+SSG AM Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841 Query: 840 QMEKLAAELP-GTSYAWSGLSYQERLASGQALSLYAISILVVFLCLAALYESWSVPFSVM 898 ME LA++LP G Y W+G+SYQERL+ QA +L AIS +VVFLCLAALYESWS+P SVM Sbjct: 842 LMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVM 901 Query: 899 MVIPLGIIGAVAAATLRGLENDIYFQVALLTTLGLASKNAILIVEFAEAAYLR-GEPLVV 957 +V+PLGI+G + AATL +ND+YF V LLTT+GL++KNAILIVEFA+ + G+ +V Sbjct: 902 LVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVE 961 Query: 958 AALQGAATRLRPILMTSLAFIAGVMPLAMSTGAGANSRISIGSGIIGGTLTATVLAVFFV 1017 A L RLRPILMTSLAFI GV+PLA+S GAG+ ++ ++G G++GG ++AT+LA+FFV Sbjct: 962 ATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFV 1021 Query: 1018 PLFFVLIRRVFSG 1030 P+FFV+IRR F G Sbjct: 1022 PVFFVVIRRCFKG 1034
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 42.9 bits (101), Expect = 1e-06 Identities = 21/103 (20%), Positives = 38/103 (36%), Gaps = 10/103 (9%) Query: 58 ASYQAAYDTAKAALQNVQVSVKSAKLKAQRYAALAKENGVSQQDADDAQTSYQQALANVA 117 K+ L+ ++ + SAK + Q L K + +Q N+ Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN---------EILDKLRQTTDNIG 312 Query: 118 EKTAALETARINLAYTQVRAPISGRI-GISSVTPGALVTANQT 159 T L + +RAP+S ++ + T G +VT +T Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355 Score = 42.5 bits (100), Expect = 2e-06 Identities = 36/210 (17%), Positives = 75/210 (35%), Gaps = 16/210 (7%) Query: 19 TVAAMTSEVRPQVDGIIKKRLFTEGSEVTAGQVLYQIDPASYQAAYDTAKAALQNVQVSV 78 T + + E++P + I+K+ + EG V G VL ++ A+A Q S+ Sbjct: 91 THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTAL-------GAEADTLKTQSSL 143 Query: 79 KSAKLKAQRYAALAKENGVSQQDADDAQTSYQQALANVAEKTAALETARINLAYTQVRAP 138 A+L+ RY L++ +++ + NV+E+ T+ I ++ + Sbjct: 144 LQARLEQTRYQILSRSIELNKLPELKL--PDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQ 201 Query: 139 -ISGRIGISSVTPGALVTANQTTALATIRNLDPIYVDLTQSSAQLLALRKQQQAGNDTVA 197 + + A + T LA I + + +L +Q V Sbjct: 202 KYQKELNL------DKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVL 255 Query: 198 NAPVQLTLEDGSVYAHEGSLQLTEVAVDEA 227 + + ++ L+ E + A Sbjct: 256 EQENKYVEAVNELRVYKSQLEQIESEILSA 285
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 70.7 bits (173), Expect = 1e-14 Identities = 38/133 (28%), Positives = 59/133 (44%), Gaps = 18/133 (13%) Query: 398 IMGHVDHGKTSLLDYI-----RSTKVASGEAG-------------GITQHIGAYHVETEN 439 ++ HVD GKT+L + + T++ S + G GIT G + EN Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67 Query: 440 GMITFLDTPGHAAFTSMRARGAQATDIVVLVVAADDGVMPQTIEAIQHAKAANVPVVVAV 499 + +DTPGH F + R D +L+++A DGV QT + +P + + Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127 Query: 500 NKIDKPEADPDRV 512 NKID+ D V Sbjct: 128 NKIDQNGIDLSTV 140
>INFPOTNTIATR#Macrophage infectivity potentiator signature. Length = 233 Score = 162 bits (412), Expect = 2e-52 Identities = 74/207 (35%), Positives = 115/207 (55%), Gaps = 5/207 (2%) Query: 3 TPSFDSVEAQASYGIGLQIGQQLQESGLQGLLPEALLAGLRDAMEGN----TPTVPVDVI 58 S + + + SY IG +G+ + G+ + P+ L G++D M G T DV+ Sbjct: 24 ATSLTTDKDKLSYSIGADLGKNFKNQGID-INPDVLAKGMQDGMSGAQLILTEEQMKDVL 82 Query: 59 HRALQEVHEKADKVRVERQQALVDEGKTFLEENAKRDDVTTTESGLQFSVLQAGDGPIPS 118 + +++ K ++ + +G FL N + + SGLQ+ ++ AG G P Sbjct: 83 SKFQKDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPSGLQYKIIDAGTGAKPG 142 Query: 119 RQDRVRVHYTGRLVDGTVFDSSVERGQPADFPVSGVIPGWIEALSMMPVGSKWKLYIPHN 178 + D V V YTG L+DGTVFDS+ + G+PA F VS VIPGW EAL +MP GS W++++P + Sbjct: 143 KSDTVTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEALQLMPAGSTWEVFVPAD 202 Query: 179 LAYGERGAGATIPPFSALMFEVELLEI 205 LAYG R G I P L+F++ L+ + Sbjct: 203 LAYGPRSVGGPIGPNETLIFKIHLISV 229
>SECYTRNLCASE#Preprotein translocase SecY subunit signature. Length = 437 Score = 28.6 bits (64), Expect = 0.042 Identities = 15/49 (30%), Positives = 24/49 (48%), Gaps = 3/49 (6%) Query: 4 SFLLIVVVVLIALFASLFVVEEGQRGIVLRFGKVL--RDSDNKPLVYAP 50 F ++ V LI + +FV E+ QR I +++ K + R S Y P Sbjct: 221 EFGTVIAVGLIMVALVVFV-EQAQRRIPVQYAKRMIGRRSYGGTSTYIP 268
>TYPE3OMOPROT#Type III secretion system outer membrane O protein family signature. Length = 303 Score = 26.5 bits (58), Expect = 0.018 Identities = 19/73 (26%), Positives = 29/73 (39%), Gaps = 14/73 (19%) Query: 9 RVIVKRKEVESKSAGGIVLTGTAAGKSTRGEVLAVGNGRILDNGEIKPLDVKVGDVVIFN 68 R V E+E+ ++ T A V + NG +L NGE+ V N Sbjct: 240 RKNVTLAELEAMGQQQLLSLPTNAE----LNVEIMANGVLLGNGEL----------VQMN 285 Query: 69 DGYGVKAEKIDNE 81 D GV+ + +E Sbjct: 286 DTLGVEIHEWLSE 298
>AUTOINDCRSYN#Autoinducer synthesis protein signature. Length = 216 Score = 27.9 bits (62), Expect = 0.007 Identities = 15/46 (32%), Positives = 23/46 (50%), Gaps = 1/46 (2%) Query: 60 EGKLEQEYEVQLLFKSNTDH-QQALLTYIKQHHPYQTPELLVLPVR 104 +G E+E V L+F D Q+AL I + + + EL P+R Sbjct: 163 QGLSEKEERVYLVFLPVDDENQEALARRINRSGTFMSNELKQWPLR 208
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 48.1 bits (114), Expect = 3e-09 Identities = 34/173 (19%), Positives = 60/173 (34%), Gaps = 11/173 (6%) Query: 3 REQVLSNALNLLEQQGLANTTLEMLAKALSVEVSDLTRFWPDREALLYDCLRYHSQQIDT 62 R+ +L AL L QQG+++T+L +AKA V + + D+ L + I Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72 Query: 63 WRRQLQLDETLSPQQKLLARY-QTLSEQVQNQRYPGCLFIAACSFYPDTEH----PIHQL 117 + Q P L L V +R + F+ + Q Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRL---LMEIIFHKCEFVGEMAVVQQA 129 Query: 118 AEQQKQASLHYTKALLQEMDAD---DADMVAQQMELILEGCLSKLLIKRQLAD 167 S + L+ AD++ ++ +I+ G +S L+ A Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAP 182
>PF07520#Virulence protein SrfB Length = 1041 Score = 29.6 bits (66), Expect = 0.027 Identities = 16/83 (19%), Positives = 31/83 (37%), Gaps = 5/83 (6%) Query: 282 ILLPVIEEYNRP---QATRRFARIAQAMGVDTQDMSDE-QASHQAIAAIRQLSLQVGIPA 337 ++ VI P + + A + D Q + RQ S++V +P Sbjct: 639 LVHRVISAIVLPRLQDSIAQAGGQFVAERMRELFGGDIGGQEQQTVQRRRQFSIRVLVPL 698 Query: 338 GFSAL-GIEESDIEGWLDKALAD 359 + L E+++ +D +AD Sbjct: 699 AEAILSACEDAEEADRIDIPVAD 721
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 446 bits (1149), Expect = e-160 Identities = 147/357 (41%), Positives = 216/357 (60%), Gaps = 4/357 (1%) Query: 2 KAATAVIDRHALRHNLQQIRRLAPQSRLVAVVKANAYGHGLLAAAHTLQDADCYGVARIS 61 + A +D AL+ NL +R+ A +R+ +VVKANAYGHG+ + D + + + Sbjct: 3 RPIQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLNLE 62 Query: 62 EALMLRAGGIVKPILLLEGFFDAEDLPVLVANHIETAVHSLEQLVALEAATLSAPINAWM 121 EA+ LR G PIL+LEGFF A+DL + + + T VHS QL AL+ A L AP++ ++ Sbjct: 63 EAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDIYL 122 Query: 122 KLDTGMHRLGVRPDQAEAFYQRLSACRNVIQPVNIMSHFSRADEPEVAATQQQLACFDAF 181 K+++GM+RLG +PD+ +Q+L A NV + + +MSHF+ A+ P+ +A + Sbjct: 123 KVNSGMNRLGFQPDRVLTVWQQLRAMANVGE-MTLMSHFAEAEHPD--GISGAMARIEQA 179 Query: 182 AAGKPGKQSIAASGGILRWPQAHRDWVRPGIVLYGVSPF-DAPYGRDFGLLPAMTLKSSL 240 A G ++S++ S L P+AH DWVRPGI+LYG SP + GL P MTL S + Sbjct: 180 AEGLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMTLSSEI 239 Query: 241 IAVREHKAGESVGYGGTWVSERDTRLGVIAIGYGDGYPRSAPSGTPVWLNGREVSIVGRV 300 I V+ KAGE VGYGG + + + R+G++A GY DGYPR AP+GTPV ++G VG V Sbjct: 240 IGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTMTVGTV 299 Query: 301 SMDMISIDLGPESTDKVGDEALMWGAELPVERVAACTGISAYELITNLTSRVAMEYL 357 SMDM+++DL P +G +WG E+ ++ VAA G YEL+ L RV + + Sbjct: 300 SMDMLAVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALRVPVVTV 356
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.1 bits (62), Expect = 0.049 Identities = 10/21 (47%), Positives = 12/21 (57%) Query: 39 MVAIIGPNGAGKSTLLRLLTG 59 V + G G GKSTL+ L G Sbjct: 598 SVVLEGTGGIGKSTLINTLVG 618
>PF01206#SirA family protein Length = 76 Score = 92.1 bits (229), Expect = 1e-28 Identities = 17/71 (23%), Positives = 37/71 (52%) Query: 19 DYRLDMVGEPCPYPAVATLEAMPQLKPGEILEVISDCPQSINNIPLDARNYGYTVLDIQQ 78 D LD G CP P + + + + GE+L V++ P S+ + ++ G+ +L+ ++ Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64 Query: 79 DGPTIRYLIQR 89 + T + ++R Sbjct: 65 EDGTYHFRLKR 75
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 347 bits (891), Expect = e-121 Identities = 124/351 (35%), Positives = 199/351 (56%), Gaps = 2/351 (0%) Query: 1 MSTEKNEKPTPKRLKEAKEKGQVVKSVEITSGVQLVALVIYFLLTGYSLVEQAKALIRSS 60 MS EK E+PTPK++++A++KGQV KS E+ S +VAL + E L+ Sbjct: 1 MSGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIP 60 Query: 61 IIQLQQPLTLALARIGAECMTVLMHIVVVLGGALIVVTIIAGIAQVGPLLATKAVSFKGE 120 Q P + AL+ + + ++ L ++ I + + Q G L++ +A+ + Sbjct: 61 AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIK 120 Query: 121 RINPIQNAKQLFSLRSVFELMKSLLKVGVLTLIFGYLLIQYAPSFGYLTHCGSRCALPVF 180 +INPI+ AK++FS++S+ E +KS+LKV +L+++ ++ + L CG C P+ Sbjct: 121 KINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLL 180 Query: 181 STLMGWLLGSLIACYLVFSLMDYAFQRYTIMKQLKMSHDEVKREHKDSNGDPHIKQKRRQ 240 ++ L+ ++V S+ DYAF+ Y +K+LKMS DE+KRE+K+ G P IK KRRQ Sbjct: 181 GQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQ 240 Query: 241 LQHEVQSGSFATNVRRSTAVVRNPTHFAVCLIYHPEETPLPIVIEKGHDEQAALIVSLAE 300 E+QS + NV+RS+ VV NPTH A+ ++Y ETPLP+V K D Q + +AE Sbjct: 241 FHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAE 300 Query: 301 QSGIPVVENIALARALHRDVACGDTIPEQFFEPVAALLRM--ALELDYQPS 349 + G+P+++ I LARAL+ D IP + E A +LR ++ Q S Sbjct: 301 EEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQHS 351
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 140 bits (354), Expect = 1e-42 Identities = 52/230 (22%), Positives = 105/230 (45%), Gaps = 4/230 (1%) Query: 5 LPGLTALALAMMRPYGILLILPLFTARSLGSSLLRNGLIVAIALPVTPLFLSAPIITNSS 64 L L ++R ++ P+ + RS+ + + GL + I + P + + S Sbjct: 10 LSWLNLYFWPLLRVLALISTAPILSERSVPKRV-KLGLAMMITFAIAPSLPANDVPVFS- 67 Query: 65 PVTWIGVLCTELLIGVVMGFVAALPFWAMNMAGFLIDTLRGATMSTLFNPGMGVESSLFG 124 + + ++LIG+ +GF F A+ AG +I G + +T +P + + Sbjct: 68 -FFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLA 126 Query: 125 VLFTQILTVLFLISGGFNQVLAALYGSYDSLPIGQGIQPAADLLLFLQTEWQMMFELCLC 184 + + +LFL G +++ L ++ +LPIG + L + +F L Sbjct: 127 RIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSL-IFLNGLM 185 Query: 185 FALPALLVMVLADLSLGLINRSARQLNVFFLAMPIKSALALFLLLISLPY 234 ALP + +++ +L+LGL+NR A QL++F + P+ + + L+ +P Sbjct: 186 LALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPL 235
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 72.5 bits (178), Expect = 1e-20 Identities = 32/79 (40%), Positives = 47/79 (59%) Query: 14 IVHLATELLWLVLLLSLPVVVVASTVGLVISLVQALTQIQDQTLQFLIKLLAVSATLLMT 73 +V + L+LVL+LS +VA+ +GL++ L Q +TQ+Q+QTL F IKLL V L + Sbjct: 4 LVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLL 63 Query: 74 YHWMGATLLNYTQQSFLQI 92 W G LL+Y +Q Sbjct: 64 SGWYGEVLLSYGRQVIFLA 82
>TYPE3IMPPROT#Type III secretion system inner membrane P protein family signature. Length = 224 Score = 227 bits (581), Expect = 1e-77 Identities = 86/220 (39%), Positives = 143/220 (65%), Gaps = 7/220 (3%) Query: 24 LNSSYQLIALLFMLSVLPLLVVMGTAFLKLSVVFSLLRNALGVQQVPPNIAIYGLALVLT 83 + + LIALL ++LP ++ GT F+K S+VF ++RNALG+QQ+P N+ + G+AL+L+ Sbjct: 1 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS 60 Query: 84 IFIMAPVGLDVQARLQNEELSNDIGALAHQIDQNALVPYRDFLQRNTDIEQVTFFNDIVQ 143 +F+M P+ D ++E+++ + + + L YRD+L + +D E V FF + Sbjct: 61 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQL 120 Query: 144 NKWPE-------RYRDSVKPDSLLILMPAFTLSQLNEAFKIGLLLFLPFVAIDLIVSNIL 196 + R +D ++ S+ L+PA+ LS++ AFKIG L+LPFV +DL+VS++L Sbjct: 121 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL 180 Query: 197 LAMGMMMVSPMTLSLPFKLLVFVLVDGWSLVLGQLVGSYL 236 LA+GMMM+SP+T+S P KL++FV +DGW+L+ L+ Y+ Sbjct: 181 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQYM 220
>TYPE3OMOPROT#Type III secretion system outer membrane O protein family signature. Length = 303 Score = 50.4 bits (120), Expect = 3e-09 Identities = 29/111 (26%), Positives = 50/111 (45%), Gaps = 4/111 (3%) Query: 205 YIKLEGGNRMTIQQINEASDPLACGSRAESLPLAAVQFEDLPQTLVMEIGRLTLPLGEIK 264 + ++EGG + I + AE+LP LP L + R + L E++ Sbjct: 194 FNRVEGGIIVETLDIQHIEEENNTTETAETLP----GLNQLPVKLEFVLYRKNVTLAELE 249 Query: 265 QLAVGQTLACQTHCYGEVNICLNGQSVGRGSLLRCDEQLVVRIAQWGLQNG 315 + Q L+ T+ V I NG +G G L++ ++ L V I +W ++G Sbjct: 250 AMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHEWLSESG 300
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 30.6 bits (69), Expect = 0.002 Identities = 15/118 (12%), Positives = 37/118 (31%), Gaps = 11/118 (9%) Query: 5 QQRTLQRLLALRQRQERRLRQQLGQLRREQQQQEQQQENGRRRHQQLCQQLQQLAQWCGM 64 ++ + Q Q+ + L + R E+ + + +L + Sbjct: 187 LTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSL--- 243 Query: 65 LTPREADEQKVLRQAVYQAERQAKKQLNAWVAQGRQQVSAIERQ--QARLRRNQREQE 120 +Q + + AV + E + + + + Q+ IE + A+ Q Sbjct: 244 -----LHKQAIAKHAVLEQENKY-VEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 63.1 bits (153), Expect = 1e-13 Identities = 44/188 (23%), Positives = 70/188 (37%), Gaps = 7/188 (3%) Query: 7 MLAIVLMTLSLSGCDME-LYSGLSEGEANQMLALLMLHQINAEKQIEKSGMVGLTVDKRQ 65 + +V M L D L+S LS+ + ++A L I + V + Sbjct: 35 VAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPYR--FANGSGA-IEVPADK 91 Query: 66 FINAVELLRQNGFPRQRFITVDELFPANQLVTSPTQEQAKMVFLKEQQLENMLSHMDGVI 125 L Q G P+ + EL + S EQ E +L + + V Sbjct: 92 VHELRLRLAQQGLPKGGAVGF-ELLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVK 150 Query: 126 HADVTVAMPM-SVDGKNPLPHTASVFIKYSPEVNLQSYQ-SQIKGLVRDAVPGIDYAKIS 183 A V +AMP S+ + +ASV + P L Q S + LV AV G+ ++ Sbjct: 151 SARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVT 210 Query: 184 VVMQPANY 191 +V Q + Sbjct: 211 LVDQSGHL 218
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 478 bits (1231), Expect = e-166 Identities = 160/514 (31%), Positives = 269/514 (52%), Gaps = 21/514 (4%) Query: 4 IYIMRKITGLILLFFATLLPYGKFSYGKAIPWQGEPFFIYSRGMTVSELLKDLGMNYGIP 63 + R +TG +LL + S+ + + W P+ ++G ++ +LL D G NY Sbjct: 7 SFFKRVLTGTLLLLSSY-------SWAQELDWLPIPYVYVAKGESLRDLLTDFGANYDAT 59 Query: 64 VVISSEINEHFTGKIRDKTPEKILSELAGRYNITWYYDGETLYFYPVQSIKREFISPDGL 123 VV+S +IN+ +G+ P+ L +A YN+ WYYDG LY + + I Sbjct: 60 VVVSDKINDKVSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQES 119 Query: 124 AANTLVKYLQRGDVLAGKNCAIKAIPHLDTLEVKGVPICIERVKSVSKMLS--EQVRHQN 181 A L + LQR + + + V G P +E V+ + L Q+R + Sbjct: 120 EAAELKQALQRSGIWE-PRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEK 178 Query: 182 QNKETVKVFPLKYASAADSDYQYRDQNVRLPGLVSVLRELNQGNNLPLAGGNQPDGNQAS 241 +++FPLKYASA+D YRD V PG+ ++L+ + + + QA+ Sbjct: 179 TGALAIEIFPLKYASASDRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIPQAA 238 Query: 242 S-----PVFSADPRQNAVIIRDRQANMPIYRSLITQLDQRPIQIEISVTIIDVDAGDISQ 296 + ADP NA+I+RD MP+Y+ LI LD+ +IE++++I+D++A +++ Sbjct: 239 TRASAQARVEADPSLNAIIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDINADQLTE 298 Query: 297 LGVDWSASASIGGTGV------SFNSTFAKNNAEGFSTVIGDTGNFMVRLNALQKNSRAR 350 LGVDW G S A N A G + R+N L+ A+ Sbjct: 299 LGVDWRVGIRTGNNHQVVIKTTGDQSNIASNGALGSLVDARGLDYLLARVNLLENEGSAQ 358 Query: 351 ILSQPSVVTLNNIQAVLDKNVTFYTKLQGEKVAKLESVTSGSLLRVTPRMIETEGVQEVL 410 ++S+P+++T N QAV+D + T+Y K+ G++VA+L+ +T G++LR+TPR++ E+ Sbjct: 359 VVSRPTLLTQENAQAVIDHSETYYVKVTGKEVAELKGITYGTMLRMTPRVLTQGDKSEIS 418 Query: 411 LNLNIQDGQQQASTNSNEPLPEIRNSDISTQATLQVGQSLLLGGFIQDTQIESQNKIPLL 470 LNL+I+DG Q+ +++ E +P I + + T A + GQSL++GG +D + +K+PLL Sbjct: 419 LNLHIEDGNQKPNSSGIEGIPTISRTVVDTVARVGHGQSLIIGGIYRDELSVALSKVPLL 478 Query: 471 GDIPLLGGLFRSTDKQSHSVVRLFLIKAVPVNAG 504 GDIP +G LFR + + VRLF+I+ ++ G Sbjct: 479 GDIPYIGALFRRKSELTRRTVRLFIIEPRIIDEG 512
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 44.2 bits (104), Expect = 1e-07 Identities = 21/115 (18%), Positives = 43/115 (37%), Gaps = 9/115 (7%) Query: 18 RARTRRLLIDTAMSMYERGAFPSIT--EVASAAQLSRATAYRYFPTQSALVSAMVDESLG 75 TR+ ++D A+ ++ + S + E+A AA ++R Y +F +S L S + + S Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68 Query: 76 PILAW-------QPTQPDARQRIAELLSFAYPRMLQHEGVLRAALHLSLQQWADA 123 I P P + R + + +L + + + Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 47.7 bits (113), Expect = 5e-09 Identities = 32/177 (18%), Positives = 61/177 (34%), Gaps = 14/177 (7%) Query: 8 KRNRREEILQALAQMLESSDGSQRITTAKLAANVGVSEAALYRHFPSKTRMFDSLIEFIE 67 + R+ IL ++ G + ++A GV+ A+Y HF K+ +F + E E Sbjct: 9 AQETRQHILDVALRLFSQQ-GVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67 Query: 68 DSLMSRINLILQDEKETFN-RLRLILLLVLGFAERNPGLTRIMT-------GHALMFEQD 119 ++ LR IL+ VL +M M Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127 Query: 120 RLQGRIN-QLFERIEMQLRQVLREKKLRDGQGFIHDEALLATQLLAFCEGMLSRFVR 175 + Q + + ++RIE L+ + K L A + + G++ ++ Sbjct: 128 QAQRNLCLESYDRIEQTLKHCIEAKMLPAD----LMTRRAAIIMRGYISGLMENWLF 180
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 28.6 bits (64), Expect = 0.005 Identities = 7/21 (33%), Positives = 11/21 (52%) Query: 58 RVTKTAKFLGVSYPTLWRWMR 78 K A LG++ TL + +R Sbjct: 451 NQIKAADLLGLNRNTLRKKIR 471
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 27.4 bits (61), Expect = 0.042 Identities = 16/37 (43%), Positives = 19/37 (51%), Gaps = 4/37 (10%) Query: 146 DMLSWCHHL-PAKPEGRFYALKGVRPDDELAVLPEDI 181 DML CHHL P PE +A +R + A EDI Sbjct: 316 DMLMVCHHLSPTIPEDIAFAESRIRKETIAA---EDI 349
>cloacin#Cloacin signature. Length = 551 Score = 30.8 bits (69), Expect = 0.002 Identities = 28/106 (26%), Positives = 45/106 (42%), Gaps = 22/106 (20%) Query: 34 EKRQQEIADGLSSAE-------RAKKDLDLAQAN-ATDQLKKAKAEAQVIIEQASKRKAQ 85 E R+Q+ D E RA+ +L+ A + A +Q ++AKA Q Sbjct: 303 ENRRQQEWDATHPVEAAERNYERARAELNQANEDVARNQERQAKAV-------------Q 349 Query: 86 ILDEAKAEAEQERNKIVAQAQAEIDAERKRAREELRKQVAMLAIAG 131 + + K+E NK +A A AEI + A + + M +AG Sbjct: 350 VYNSRKSEL-DAANKTLADAIAEIKQFNRFAHDPMAGGHRMWQMAG 394
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 60.6 bits (147), Expect = 3e-13 Identities = 34/173 (19%), Positives = 63/173 (36%), Gaps = 20/173 (11%) Query: 4 RVVFIDDHDIVRSGFAQLLSLEEDIQVVGEFSSAKQARAGLPGLQANICICDISMPDENG 63 ++ DD +R+ Q LS V S+A + ++ + D+ MPDEN Sbjct: 5 TILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 64 LDLLKGLPS---GMGVIMLSMHDSPALVETALERGARGFLSKRCKPEDLISAVRTVGSGG 120 DLL + + V+++S ++ A E+GA +L K +LI + Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR----- 117 Query: 121 VYLMPEIAQQLARVAVDPLTRREREVAVLLAEG---MEVREIAESLGLSPKTV 170 A + L ++ L+ E+ + L + T+ Sbjct: 118 -------ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163
>PF06580#Sensor histidine kinase Length = 349 Score = 36.8 bits (85), Expect = 1e-04 Identities = 17/85 (20%), Positives = 33/85 (38%), Gaps = 10/85 (11%) Query: 426 VTNAYRHGAASR-----IEINARQDNQQIYLTISDNGK-GIDLASITPGYGLRGIQSRVS 479 V N +HG A I + +DN + L + + G + + G GL+ ++ R+ Sbjct: 264 VENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQ 323 Query: 480 A-FGGNVSLSV---DNGTCLNVTLP 500 +G + + V +P Sbjct: 324 MLYGTEAQIKLSEKQGKVNAMVLIP 348
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 44.9 bits (106), Expect = 5e-07 Identities = 33/157 (21%), Positives = 68/157 (43%), Gaps = 7/157 (4%) Query: 49 FNFIMPAMLTDLGLSMSDVGILGTLFYITYGCSKFVSGMISDRSNPRYFMGIGLVMTGII 108 N +P + D + + T F +T+ V G +SD+ + + G+++ Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFG 92 Query: 109 NILFGMSSSLLVLGALWILNAFFQGWG---WPPCSKILTSWY-SRSERGGWWAIWNTSHN 164 +++ + S +L I+ F QG G +P ++ + Y + RG + + + Sbjct: 93 SVIGFVGHSFF---SLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVA 149 Query: 165 FGGALIPLLVGVITLHFSWRYGMIIPGIIGVVIGLLM 201 G + P + G+I + W Y ++IP I + + LM Sbjct: 150 MGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLM 186
>PF05860#haemagglutination activity domain. Length = 117 Score = 59.4 bits (144), Expect = 4e-13 Identities = 20/128 (15%), Positives = 42/128 (32%), Gaps = 18/128 (14%) Query: 53 TPPSTCRALTSYCIGMTETVVNIQAPDENGLSHNKYSKFDVVANGLFDVTTLNNRLAQEV 112 TP +T ++ ++ + L H+ + +F V +G Sbjct: 4 TPDTTLPINSNITTEGNTRIIERGTQAGSNLFHS-FQEFSVPTSGTA------------- 49 Query: 113 NGNSFLQDKSATIILNEVNSSHASLLDGNLRVDGGNAHIIIANPAGINCRGCSFTNASHV 172 F + I++ V S +DG +R A++ + NP GI + + Sbjct: 50 ---FFNNPTNIQNIISRVTGGSVSNIDGLIRA-NATANLFLINPNGIIFGQNARLDIGGS 105 Query: 173 TLTTGTPS 180 + + Sbjct: 106 FVGSTANR 113
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 34.5 bits (79), Expect = 9e-05 Identities = 28/116 (24%), Positives = 47/116 (40%), Gaps = 8/116 (6%) Query: 55 IEREALLLWIARDEIGIIGTIQLVLCQKPNGLNRAEIQKLLVHSRSRRTGIGHKLIIAAE 114 +E E ++ E IG I++ + N A I+ + V R+ G+G L+ A Sbjct: 60 VEEEGKAAFLYYLENNCIGRIKI----RSNWNGYALIEDIAVAKDYRKKGVGTALLHKAI 115 Query: 115 NTAVQLRRGLIYLDTQS-GSSAESFYRAQGYRYVG-EIPDYACTPNGNYHPTAIYF 168 A + + L+TQ SA FY + + Y+ P N AI++ Sbjct: 116 EWAKENHFCGLMLETQDINISACHFYAKHHFIIGAVDTMLYSNFPTAN--EIAIFW 169
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 79.5 bits (196), Expect = 3e-18 Identities = 53/154 (34%), Positives = 78/154 (50%), Gaps = 13/154 (8%) Query: 13 VNVGTIGHVDHGKTTLTAAI------TTVLAKTYGGSARAFDQIDNAPEEKARGITINTS 66 +N+G + HVD GKTTLT ++ T L G+ R DN E+ RGITI T Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRT----DNTLLERQRGITIQTG 59 Query: 67 HVEYDTPARHYAHVDCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQV 126 + +D PGH D++ + + +DGAIL+++A DG QTR R++ Sbjct: 60 ITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKM 119 Query: 127 GVPYIIVFLNKCDMVDDEELLELVEMEVRELLSQ 160 G+P I F+NK D + L V +++E LS Sbjct: 120 GIP-TIFFINKIDQNGID--LSTVYQDIKEKLSA 150
>SECETRNLCASE#Bacterial translocase SecE signature. Length = 127 Score = 161 bits (410), Expect = 7e-55 Identities = 109/127 (85%), Positives = 116/127 (91%) Query: 1 MSANTEAPGSGRGLETAKWLIVAVLLVVAIVGNYYYREYSLPLRALAVVVIIAVAGAVAL 60 MSANTEA GSGRGLE KW++V LL+VAIVGNY YR+ LPLRALAVV++IA AG VAL Sbjct: 1 MSANTEAQGSGRGLEAMKWVVVVALLLVAIVGNYLYRDIMLPLRALAVVILIAAAGGVAL 60 Query: 61 MTAKGKATVAFAREARTEVRKVIWPTRQETLHTTLIVAAVTAVMSLILWGLDGILVRLVS 120 +T KGKATVAFAREARTEVRKVIWPTRQETLHTTLIVAAVTAVMSLILWGLDGILVRLVS Sbjct: 61 LTTKGKATVAFAREARTEVRKVIWPTRQETLHTTLIVAAVTAVMSLILWGLDGILVRLVS 120 Query: 121 FITGLRF 127 FITGLRF Sbjct: 121 FITGLRF 127
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 26.7 bits (59), Expect = 0.045 Identities = 14/71 (19%), Positives = 26/71 (36%), Gaps = 2/71 (2%) Query: 4 KVQAYVKLQVAAGMANPSPPVGPALGQQ-GVNIMEFCKAFNAKTESIEKGLPIPVVITVY 62 +V+ + N P G + G N ++ KA AK ++ P + + Sbjct: 267 RVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYP 326 Query: 63 SDRSFTFVTKT 73 D + FV + Sbjct: 327 YDTT-PFVQLS 336
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 750 bits (1939), Expect = 0.0 Identities = 278/571 (48%), Positives = 392/571 (68%), Gaps = 2/571 (0%) Query: 1 MISGILVSPGIAFGKALLLKEDEIVINRKKISADQVEQEVERFKAGRAKAAEQLEAIKTK 60 I+GI S G+A KA + E + I + I V E+E+ A K+ E+L AIK + Sbjct: 4 KITGIAASSGVAIAKAFIHLEPNVDIEKTSI--TDVSTEIEKLTAALEKSKEELRAIKDQ 61 Query: 61 AGVSLGEEKAAIFEGHIMLLEDEELEQEIIALIKDEHASADAAAYSVIEGQAKALEELDD 120 S+G +KA IF H+++L+D EL I I++E +A+ A V + E +D+ Sbjct: 62 TEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDN 121 Query: 121 EYLKERAADVRDIGKRLLKNILGLNIVDLSAIQDEVILVATDLTPSETAQLNLDKVLGFI 180 EY+KERAAD+RD+ KR+L +++G+ L+ I +E +++A DLTPS+TAQLN V GF Sbjct: 122 EYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFA 181 Query: 181 TDIGGRTSHTSIMARSLELPAIVGTSNVTKQVKNDDYLILDAVNNKVYLNPTADVIEQLK 240 TDIGGRTSH++IM+RSLE+PA+VGT VT+++++ D +I+D + V +NPT + ++ + Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241 Query: 241 AVKNQYITEKNELAKLKDLPAITLDGHQVEVVANIGTVRDIAGAERNGAEGVGLYRTEFL 300 + + +K E AKL P+ T DG VE+ ANIGT +D+ G NG EG+GLYRTEFL Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301 Query: 301 FMDRDSLPTEEEQFQAYKAVAEAMGSQAVIVRTMDIGGDKDLPYMNLPKEENPFLGWRAI 360 +MDRD LPTEEEQF+AYK V + M + V++RT+DIGGDK+L Y+ LPKE NPFLG+RAI Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361 Query: 361 RIAMDRKEILHAQLRAILRASAFGKLRIMFPMIISVEEVRELKAELELLKSQLREENKAF 420 R+ +++++I QLRA+LRAS +G L++MFPMI ++EE+R+ KA ++ K +L E Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDV 421 Query: 421 DETIEVGVMVETPAAAVIARHLAKEVDFFSIGTNDLTQYTLAVDRGNELISHLYNPMSPS 480 ++IEVG+MVE P+ AV A AKEVDFFSIGTNDL QYT+A DR NE +S+LY P P+ Sbjct: 422 SDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPA 481 Query: 481 VLGLIKQVIDASHAEGKWTGMCGELAGDERATLLLLGMGLDEFSMSAISIPRIKKIIRNT 540 +L L+ VI A+H+EGKW GMCGE+AGDE A LLLG+GLDEFSMSA SI + + Sbjct: 482 ILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKL 541 Query: 541 NFEDVKVLAEQALAQPTAKELMDLVTTFIEE 571 + E++K A++AL TA+E+ LV + Sbjct: 542 SKEELKPFAQKALMLDTAEEVEQLVKKTYLK 572
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 31.2 bits (70), Expect = 0.003 Identities = 24/83 (28%), Positives = 38/83 (45%), Gaps = 7/83 (8%) Query: 207 PSCTFDGPQKVNFGLVTSSNL-NNGGIERDLDFNITCKTDYGHYSATAAISTQTPSDDNN 265 P+CT + VN+G + NL +GG ++D ++ C G T + QT N Sbjct: 37 PACTVQNAE-VNWGDIEIQNLVQSGGNQKDFTVDMNCPYSLGTMKVTITSNGQT----GN 91 Query: 266 YIKVKDNQN-QEDRLLIKISDTN 287 I V + D LLI + ++N Sbjct: 92 SILVPNTSTASGDGLLIYLYNSN 114
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 117 bits (296), Expect = 2e-38 Identities = 36/89 (40%), Positives = 55/89 (61%) Query: 4 TKAEMSEHLFEKLGLSKRDAKDLVELFFEEVRRALENGEQVKLSGFGNFDLRDKNQRPGR 63 K ++ + E L+K+D+ V+ F V L GE+V+L GFGNF++R++ R GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 64 NPKTGEDIPITARRVVTFRPGQKLKSRVE 92 NP+TGE+I I A +V F+ G+ LK V+ Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91
>OUTRSURFACE#Outer surface protein signature. Length = 273 Score = 29.5 bits (66), Expect = 0.004 Identities = 18/52 (34%), Positives = 27/52 (51%), Gaps = 8/52 (15%) Query: 1 MKKYLLLFGVLSFMPLIAQSDVSLD------INMPGIN--LHLGDQDKRGYY 44 MKKYLL G++ + Q+ SLD +++PG L ++DK G Y Sbjct: 1 MKKYLLGIGLILALIACKQNVSSLDEKNSASVDLPGEMKVLVSKEKDKDGKY 52
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.6 bits (71), Expect = 0.003 Identities = 17/66 (25%), Positives = 27/66 (40%), Gaps = 10/66 (15%) Query: 29 LIGPNGAGKSTLLASLAGL------LPASGEIVLAGKSLQHYEGHELAR----QRAYLSQ 78 L G G GKSTL+ +L GL G + + + +EL+ +RA Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRADAEA 660 Query: 79 QQSALS 84 ++ S Sbjct: 661 VKAFFS 666
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 32.5 bits (74), Expect = 0.002 Identities = 25/90 (27%), Positives = 36/90 (40%), Gaps = 4/90 (4%) Query: 220 LMYDLITCLTTTPLRLLSLVGSAIALLGF-TFSVLLVALRLIFGPEWAGGGVFTLFAVLF 278 L L+ L + L V A+A G+ L A +L+ G E G G F L A L Sbjct: 166 LWGGLLFNLLGGFVSLGDAVIGAMA--GYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALG 223 Query: 279 MFIGAQFV-GMGLLGEYIGRIYNDVRARPR 307 ++G Q + + LL +G R Sbjct: 224 AWLGWQALPIVLLLSSLVGAFMGIGLILLR 253
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 100 bits (251), Expect = 3e-25 Identities = 73/361 (20%), Positives = 138/361 (38%), Gaps = 60/361 (16%) Query: 317 RVLILGVNGFIGNHLTERLLQDDRYEVYGLDIGSD--------AISRFLGNPAFHFVEGD 368 + L+ G GFIG H+++RLL+ ++V G+D +D A L P F F + D Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 369 ISIHSEWIE--YHIKKCDVILPLVAIATPIEYT-RNPLRVFELDFEENLKIVRDCVKYN- 424 ++ E + + + + + Y+ NP + + L I+ C Sbjct: 61 LADR-EGMTDLFASGHFERVFISPHRLA-VRYSLENPHAYADSNLTGFLNILEGCRHNKI 118 Query: 425 KRIVFPSTSEVYGMCDDKEFDEDTSRLIVGPINKQRWIYSVSKQLLDRVIWAYGVKEGLK 484 + +++ S+S VYG+ F D ++ +Y+ +K+ + + Y GL Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTD------DSVDHPVSLYAATKKANELMAHTYSHLYGLP 172 Query: 485 FTLFRPFNWMGPRLDNLDAARIGSSRAITQLILNLVEGSPIKLVDGGAQKRCFTDIHDGI 544 T R F GP D A ++A+ +EG I + + G KR FT I D Sbjct: 173 ATGLRFFTVYGPWGRP-DMALFKFTKAM-------LEGKSIDVYNYGKMKRDFTYIDDIA 224 Query: 545 EALFRIIEN---------------RDGCCDGQIINIGNPTNEASIRELAEMLLTSFENHE 589 EA+ R+ + ++ NIGN ++ + + + L + Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGN-SSPVELMDYIQALEDALGIEA 283 Query: 590 LRDHFPPFAGFKDIESSAYYGKGYQDVEYRTPSIKNARRILHWQPEIAMQQTVTETLDFF 649 ++ P G DV + K ++ + PE ++ V ++++ Sbjct: 284 KKNMLPLQPG---------------DVLETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328 Query: 650 L 650 Sbjct: 329 R 329
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 39.1 bits (91), Expect = 3e-05 Identities = 72/440 (16%), Positives = 151/440 (34%), Gaps = 49/440 (11%) Query: 62 LGMSEADSITLFSSFSALVYGFVAIGGWLGDKVLGAKRVIVLGALTLAVGYSMIAYSGHE 121 A + + ++F A+ G L D+ LG KR+++ G + G S+I + GH Sbjct: 44 FNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ-LGIKRLLLFGIIINCFG-SVIGFVGHS 101 Query: 122 IF-WVYLGMATIAVGNGLFKANPSSLLSTCYSKDDPRLDGAFTMYYMSINIGSFFSMLAT 180 F + + G F A +++ K+ AF + + +G Sbjct: 102 FFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE--NRGKAFGLIGSIVAMGEGVGPAIG 159 Query: 181 PWLAAKYGWSVAFSLSVVGMLITLVNFWFCRKWVKNQGSKPDFLPLQFKKLLMVLVGIIA 240 +A WS + +IT++ F K +K + K ++++ VGI+ Sbjct: 160 GMIAHYIHWSYLLLI----PMITIITVPFLMKLLKKEVRIKG--HFDIKGIILMSVGIVF 213 Query: 241 LITLSNWLLHNQIIARWALALVSLGIIFIFTKET-----------LFLQGIARRRMIVAF 289 + + + +VS+ IF K L ++ Sbjct: 214 FMLFTT-------SYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGG 266 Query: 290 LLMLEAVIFFVLYSQMPTSLNFFAIHNVEHSIFGIGFEPEQFQALNPFWIMLASPILAAI 349 ++ F + +P + +H + + G F ++ I I Sbjct: 267 IIFGTVAGFVSM---VPYMMKD--VHQLSTAEIGSVI---------IFPGTMSVIIFGYI 312 Query: 350 YNKMGDRLPMPHKFAFGMMLCSAAFLVLPWGASFANEHGIVSVNW-LILSYALQSIGELM 408 + DR + G+ S +FL ASF E + ++ S + + Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFL----TASFLLETTSWFMTIIIVFVLGGLSFTKTV 368 Query: 409 ISGLGLAMVAQLVPQRLMGFIMGSWFLTTAAAALIAGKVAALTAVPSDAI-TDAHASLAI 467 IS + + + Q M + + FL+ I G + ++ + + + S + Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTYL 428 Query: 468 YSHVFMQIGIVTAIIAVLMM 487 YS++ + + I ++ + Sbjct: 429 YSNLLLLFSGIIVISWLVTL 448
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.3 bits (71), Expect = 0.006 Identities = 9/16 (56%), Positives = 14/16 (87%) Query: 38 LVGESGSGKSLIAKAI 53 + GESG+GK L+A+A+ Sbjct: 165 ITGESGTGKELVARAL 180
>TATBPROTEIN#Bacterial sec-independent translocation TatB protein signature. Length = 171 Score = 31.5 bits (71), Expect = 0.002 Identities = 15/46 (32%), Positives = 25/46 (54%), Gaps = 3/46 (6%) Query: 144 LLLAIIVVAFVGPS-LEHAMFAVWLALLPRMVRTIYSAVHDELDKE 188 LL+ II + +GP L A +A R +R++ + V +EL +E Sbjct: 10 LLVFIIGLVVLGPQRLPVA--VKTVAGWIRALRSLATTVQNELTQE 53
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 346 bits (890), Expect = e-119 Identities = 124/344 (36%), Positives = 176/344 (51%), Gaps = 19/344 (5%) Query: 3 EQLDNLLGEANAFVDVLEQVSGLAKLNKPVLVIGERGTGKELIAHRLHYLSERWQGPFIS 62 + L+G + A ++ ++ L + + +++ GE GTGKEL+A LH +R GPF++ Sbjct: 134 QDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVA 193 Query: 63 LNCAALNENLLDSELFGHEAGAFTGAQKRHLGRFERADGGTLFLDELATAPMLVQEKLLR 122 +N AA+ +L++SELFGHE GAFTGAQ R GRFE+A+GGTLFLDE+ PM Q +LLR Sbjct: 194 INMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLR 253 Query: 123 VIEYGHLERVGGSQPLQVDVRLVCATNDNLPALAAAGKFRADLLDRLAFDVVQLPPLRER 182 V++ G VGG P++ DVR+V ATN +L G FR DL RL ++LPPLR+R Sbjct: 254 VLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDR 313 Query: 183 QQDIMLLAEHFAILMCRELGLPLFSGFTATAKEQLLEYRWPGNVRELKNVVERSV----- 237 +DI L HF +E F A E + + WPGNVREL+N+V R Sbjct: 314 AEDIPDLVRHFVQQAEKEGLDVK--RFDQEALELMKAHPWPGNVRELENLVRRLTALYPQ 371 Query: 238 -----------YRHSDSSLPLNNIIINPFASNQKGEIEGVDTPNEGGAVLPALPVD-LKH 285 R P+ + + +E P Sbjct: 372 DVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDR 431 Query: 286 WLHTSEHQMLTRALKQARFNQRKAAHLLGLTYHQLRGLLKKHTI 329 L E+ ++ AL R NQ KAA LLGL + LR +++ + Sbjct: 432 VLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475
>cloacin#Cloacin signature. Length = 551 Score = 30.5 bits (68), Expect = 0.005 Identities = 33/146 (22%), Positives = 54/146 (36%), Gaps = 29/146 (19%) Query: 56 QLLRRIDHSESQQQEWQ------------EKAELALRKDKEDLARAALLEKQ-KVMTLVE 102 Q+ +R D +QQEW E+A L + ED+AR E+Q K + + Sbjct: 295 QVKQRQDEENRRQQEWDATHPVEAAERNYERARAELNQANEDVAR--NQERQAKAVQVYN 352 Query: 103 TLKREVATVDETLSRMKHEITELENKLTETRA--------------RQQALTLRHQAASS 148 + K E+ ++TL+ EI + + A R Q QAA Sbjct: 353 SRKSELDAANKTLADAIAEIKQFNRFAHDPMAGGHRMWQMAGLKAQRAQTDVNNKQAAFD 412 Query: 149 SRDVRRQLDSGKLDEAMARFEQFERR 174 + + L AM ++ E + Sbjct: 413 AAAKEKSDADAALSSAMESRKKKEDK 438
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 104 bits (260), Expect = 3e-28 Identities = 37/249 (14%), Positives = 83/249 (33%), Gaps = 41/249 (16%) Query: 33 QTALFFGKDDRTAVTNSRQWPWEAIGQVET---ASGNLCTATLISPRLVLTAGHCVLTP- 88 + +DR +T++ + + ++ + + ++ +LT H V Sbjct: 66 HANVILPNNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHVVDATH 125 Query: 89 --PGNIDQAVALRFISDKGHWKYQITDLKTRVDAKLGQKLKADGDGWIVPPAAAAYDFAL 146 P + + + + + + +GD A F+ Sbjct: 126 GDPHALKAFPSAINQDNYPNGGFTAEQITKY---------SGEGD-------LAIVKFSP 169 Query: 147 IQLTNAAPIPIKPLPLWEGTANELTKALKLVNRKVTQAGYPLD-NLNTLYKHEDCLVTGW 205 + +KP + A VN+ +T GYP D + T+++ + + Sbjct: 170 NEQNKHIGEVVKPATM-------SNNAETQVNQNITVTGYPGDKPVATMWESKG--KITY 220 Query: 206 AQQGVLAHQCDTLPGDSGSPLLLKNGNSWSLIAIQSSAPAAKERYLADNRALSVT-AINN 264 + + + T G+SGSP+ + +I I + N A+ + + N Sbjct: 221 LKGEAMQYDLSTTGGNSGSPVFNEKNE---VIGIHWGGVPNE-----FNGAVFINENVRN 272 Query: 265 RLKKLVNKI 273 LK+ + I Sbjct: 273 FLKQNIEDI 281
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 29.4 bits (66), Expect = 0.030 Identities = 23/87 (26%), Positives = 28/87 (32%), Gaps = 14/87 (16%) Query: 402 YRNGCMQDIHWTDGAFGYFPTYTLGAMYAAQLFHAARSAIPALDSHIANGNLAPLLNWLQ 461 +R M + W YF G RS P + I PLL+WL Sbjct: 35 HRLPIMLEREWQAEYRSYFNPDDEGVDEPPYNLMVPRSCCPHCNHPITALENIPLLSWL- 93 Query: 462 QNIWQHGS----------RYPTAELIT 478 W G RYP EL+T Sbjct: 94 ---WLRGRCRGCQAPISARYPLVELLT 117
>PF06580#Sensor histidine kinase Length = 349 Score = 29.4 bits (66), Expect = 0.030 Identities = 16/105 (15%), Positives = 31/105 (29%), Gaps = 28/105 (26%) Query: 327 LVNNALRY------SHQRLRIGLWFDGDNACLQVEDDGPGIPPEERTRIFEPFVRLDPSR 380 LV N +++ ++ + D L+VE+ G + Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE------------- 309 Query: 381 DRATGGCGLGLAIVHS-IALAY--QGSISVNTSPLGGASFRFSWP 422 G GL V + + Y + I + + G + P Sbjct: 310 -----STGTGLQNVRERLQMLYGTEAQIKL-SEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 69.9 bits (171), Expect = 6e-16 Identities = 26/133 (19%), Positives = 59/133 (44%), Gaps = 2/133 (1%) Query: 2 SKIVFVEDDPEVGKLIAAYLGKHDIDVFVEPRGDTAQAVIEQQQPDLVLLDIMLPGKDGM 61 + I+ +DD + ++ L + DV + T I DLV+ D+++P ++ Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 62 TLCRDLRPHYDG-PIVLLTSLDSDMNHILSLEMGANDYILKTTPPAVLLARLRLHLRQHN 120 L ++ P++++++ ++ M I + E GA DY+ K L+ + L Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL-AEP 122 Query: 121 QRLRQQTPLQAKE 133 +R + +++ Sbjct: 123 KRRPSKLEDDSQD 135
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 66.8 bits (163), Expect = 3e-15 Identities = 34/166 (20%), Positives = 76/166 (45%), Gaps = 10/166 (6%) Query: 1 MTK-SVMIVDDHPAIRVAIHALLSQSKEFSTISESVDGSEALEKLKNNPVDLVIIDIELP 59 MT ++++ DD AIR ++ LS+ + + + + + DLV+ D+ +P Sbjct: 1 MTGATILVADDDAAIRTVLNQALSR--AGYDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 60 NFDGFSLLKKLQQRGFTGKSLFLSAKNEQVFAVRALQAGANGFISKNKDISEILFAAQNV 119 + + F LL ++++ L +SA+N + A++A + GA ++ K D++E++ Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118 Query: 120 LRGYSFFPSETLTQ------LAGQ-PSSHDPVNRARLLSEREINVL 158 L PS+ L G+ + + L + ++ ++ Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLM 164
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 752 bits (1944), Expect = 0.0 Identities = 245/895 (27%), Positives = 395/895 (44%), Gaps = 77/895 (8%) Query: 2 RIAPWLSCLLTQSLLVTHISSAADKNNQDDYIFDDALVRGSSLGLGSIARFNKKNSYDAG 61 R+A + L ++ + F+ + + ++RF G Sbjct: 22 RLAGFFVRLFVACAFAAQAPLSSA-----ELYFNPRFLADDPQAVADLSRFENGQELPPG 76 Query: 62 QYQVDMYMNNKFVDRLKMLFVDKDNS--VEPCLSVAQLLQAGVKEEALKTAD--PKTPCL 117 Y+VD+Y+NN ++ + F D+ + PCL+ AQL G+ ++ + C+ Sbjct: 77 TYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACV 136 Query: 118 AFQSILPASDFRFDHAKLRFDLSIPQKFVKNVPRGYVDPKNLTAGNTIGFSNYNLNQYHV 177 S++ + + D + R +L+IPQ F+ N RGY+ P+ G G NYN + V Sbjct: 137 PLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSV 196 Query: 178 DYNKEGIKRTTNSTYLSLNSGINIGMWRFRQQGSLRYDASRG-----TNWTSNRLYSQRA 232 G ++ YL+L SG+NIG WR R + Y++S W + +R Sbjct: 197 QNRIGG---NSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERD 253 Query: 233 LPTIGSEITLGETFSSGQFFSSLGFLGVALSTDDRMLPESQRGYAPVVRGIARTNARVTV 292 + + S +TLG+ ++ G F + F G L++DD MLP+SQRG+APV+ GIAR A+VT+ Sbjct: 254 IIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTI 313 Query: 293 YQNNRSIYQTTVSPGAFEFNDLSVTHFGGDLTVEINEADGSVSTFQVPFASVPESLRPGY 352 QN IY +TV PG F ND+ GDL V I EADGS F VP++SVP R G+ Sbjct: 314 KQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGH 373 Query: 353 SRYSFAAGQVRDVGN---NETFSELTYQQGISNAITANTGIRLASGYQAIMLGGVF-THY 408 +RYS AG+ R F + T G+ T G +LA Y+A G Sbjct: 374 TRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGA 433 Query: 409 IGALGLNTTYSHARLPDGEQQQGWMAKASFSRTFQPTNTTLSVAGYRYSTDGYRDLSDVL 468 +GAL ++ T +++ LPD Q G + ++++ + T + + GYRYST GY + +D Sbjct: 434 LGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTT 493 Query: 469 GVR--------------ATSNDSSWNSSTYRQRSRAEISLNQNFHRYGSLYLTASSQDYR 514 R + + + Y +R + ++++ Q R +LYL+ S Q Y Sbjct: 494 YSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYW 553 Query: 515 DDRSRDSQLQLGYSNTFWRNTSFNLAISQQKTGGANKIYFVDPGSGMPASNGANTLATRE 574 + D Q Q G + + + ++ L+ S K R+ Sbjct: 554 GTSNVDEQFQAGLNTA-FEDINWTLSYSLTKNAWQKG---------------------RD 591 Query: 575 TVAQMSISFPLGGSSSAP--------YVSAGAVNSRTSGASYQTSLSGTMGSDQTAGYSV 626 + ++++ P + S + + + GT+ D YSV Sbjct: 592 QMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSV 651 Query: 627 DVARNEP---TNENTLSGSLQKQLPTTSLSGSASRSPGYWQGSASARGSVAFHRGGVTLG 683 + +T +L + + + S S Q G V H GVTLG Sbjct: 652 QTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLG 711 Query: 684 PYLSDTFALIEAKGASGAKVMYGQGARIDRFGYALVPTLTPYRYNTLSLDPDGMDFNTEL 743 L+DT L++A GA AKV G R D GYA++P T YR N ++LD + + N +L Sbjct: 712 QPLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDL 771 Query: 744 QDGERQIAPYAGSTVKVTFRTLNGYPALITIKMPDGSQLPMGTVVYNYNGKGTNDKNDIV 803 + + P G+ V+ F+ G L+T+ + LP G +V T++ + Sbjct: 772 DNAVANVVPTRGAIVRAEFKARVGIKLLMTLT-HNNKPLPFGAMV-------TSESSQSS 823 Query: 804 GMVGQSSQAYLRAEELSGTLTLVWGESSKERCQLDYDLGKPTDNDKQLYKLDALC 858 G+V + Q YL L+G + + WGE C +Y L P + L +L A C Sbjct: 824 GIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANYQL-PPESQQQLLTQLSAEC 877
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 30.2 bits (68), Expect = 0.022 Identities = 13/116 (11%), Positives = 26/116 (22%), Gaps = 15/116 (12%) Query: 311 ESGTSSGQTAIGIQTSLPGYLKALGLGLVNTAGGVSYLLSDSYG--TDSRIATGVGISLS 368 + + + ++P L + + S SY D + Sbjct: 584 NAWQKGRDQMLALNVNIP-----FSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVY 638 Query: 369 DSNGSTMNFVGWG-------GCAQTQDCLTTADAGWYPILTGASGNGSHSAGYNNY 417 + N + + G A + A+ SHS Sbjct: 639 GTLLED-NNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQL 693
>PF05272#Virulence-associated E family protein Length = 892 Score = 35.4 bits (81), Expect = 4e-04 Identities = 13/33 (39%), Positives = 16/33 (48%) Query: 33 VFIGPSGCGKSTLLRMIAGLETISSGEISIGDK 65 V G G GKSTL+ + GL+ S IG Sbjct: 600 VLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTG 632
>INTIMIN#Intimin signature. Length = 939 Score = 25.0 bits (54), Expect = 0.014 Identities = 13/30 (43%), Positives = 18/30 (60%) Query: 20 EERFQLLVESKILTKNGTYNSRFFTKETVE 49 E F+L +SK+LT N N F+T +T E Sbjct: 42 ENYFKLGSDSKLLTHNSYQNRLFYTLKTGE 71
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 103 bits (259), Expect = 6e-27 Identities = 69/256 (26%), Positives = 114/256 (44%), Gaps = 8/256 (3%) Query: 433 SVKPLQGQIVVVTGAGGGIGAAIAKEFSLLGAELAVLDIDSESAKNVAAQL---GPHALA 489 + K ++G+I +TGA GIG A+A+ + GA +A +D + E + V + L HA A Sbjct: 2 NAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA 61 Query: 490 LQCDVTETASVQAAFEMIATKFGGVDIVVSNAGIALSGAIAELPEATLRTSFEVNFFAHQ 549 DV ++A++ I + G +DI+V+ AG+ G I L + +F VN Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVF 121 Query: 550 RVAQQAVSIMKKQGIGGVLLFNISKQAINPGINFGAYGTSKAALLSLVRQYALEQGQDGI 609 ++ M + G ++ S A P + AY +SKAA + + LE + I Sbjct: 122 NASRSVSKYMMDRRSGSIVTVG-SNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180 Query: 610 RVNAVNADRIRSGLLDDEMISLRARARGL--SEEKYMAGNLLGQEVTAQDVAKA--FVVS 665 R N V+ + + + + S E + G L + D+A A F+VS Sbjct: 181 RCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240 Query: 666 AMLDKSTGNVITVDGG 681 T + + VDGG Sbjct: 241 GQAGHITMHNLCVDGG 256
>FLAGELLIN#Flagellin signature. Length = 507 Score = 40.8 bits (95), Expect = 5e-06 Identities = 35/137 (25%), Positives = 63/137 (45%), Gaps = 7/137 (5%) Query: 4 STSMLYQQNMQGITNAQSLWMQTGQQLSTGKRVVNPSDDPMAASQAVMVSQAESENSQYT 63 S S+L Q N+ ++ S ++LS+G R+ + DD AA QA+ + Sbjct: 8 SLSLLTQNNLNKSQSSLS---SAIERLSSGLRINSAKDD--AAGQAIANRFTSNIKGLTQ 62 Query: 64 LARSFARQSSSLETT--VLAQTTSTIQSIQSLVISAKNDTLSDDDRASYATQLQGLKDQL 121 +R+ S +TT L + + +Q ++ L + A N T SD D S ++Q +++ Sbjct: 63 ASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEI 122 Query: 122 LNQANTTDGNGRYIFAG 138 +N T NG + + Sbjct: 123 DRVSNQTQFNGVKVLSQ 139
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 36.3 bits (84), Expect = 2e-04 Identities = 13/26 (50%), Positives = 18/26 (69%) Query: 5 RILVLGASGYIGQHLVPLLSQQGHQV 30 + LV GA+G+IG H+ L + GHQV Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQV 27
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 76.4 bits (188), Expect = 9e-18 Identities = 71/366 (19%), Positives = 126/366 (34%), Gaps = 73/366 (19%) Query: 1 MKVLVTGATSGLGRNAVEYLRRQEISVIA---------TGRNQAMGALLTKLGAKFIHAD 51 MK LVTGA +G + + L V+ QA LL + G +F D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 52 LTDLVSSQAKAMLADVDTLWHCS-------SFTSPWGTEQAFALANVRATRRLGEWAAAY 104 L D + ++ S +P A+A +N+ + E Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPH----AYADSNLTGFLNILEGCRHN 116 Query: 105 GVENFIHISSPAIYFDYHHHRNIQEDFRPVRFANEFARSKAAGEEVIKLLALSNPQTH-- 162 +++ ++ SS ++Y + D + +A +K A E L+A + + Sbjct: 117 KIQHLLYASSSSVYGL-NRKMPFSTDDSVDHPVSLYAATKKANE----LMAHTYSHLYGL 171 Query: 163 -FTILRPQGLFGPHDK--VMLPRLLHMIKHYGTLLLPRGGDALVDMTYLENAVHAM---- 215 T LR ++GP + + L + + ++ + G D TY+++ A+ Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQ 231 Query: 216 ---------WLATQSQKTLS---GRAYNITNQQPRPLRTIVQQLLDALDMKCRIRSVPYP 263 W S R YNI N P L +Q L DAL ++ + +P Sbjct: 232 DVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQ 291 Query: 264 MMDIMARAMEKMSNKAEKEPVLTHYAVAKLNFDLTLDTLRAEQELGYRPIISLDEGILRT 323 D VL A DT + +G+ P ++ +G+ Sbjct: 292 PGD-----------------VLETSA----------DTKALYEVIGFTPETTVKDGVKNF 324 Query: 324 ARWLKE 329 W ++ Sbjct: 325 VNWYRD 330
>PF04183#IucA / IucC family Length = 580 Score = 29.8 bits (67), Expect = 0.007 Identities = 14/65 (21%), Positives = 23/65 (35%), Gaps = 9/65 (13%) Query: 54 IQQIGGQQGLPDDNLSAQFRPYLSQSLYNDIQA--ARKQASNRTPAQVNKTQMISGDIFT 111 + Q+ + D + A+ L +L D+Q AR+ S +N D Sbjct: 78 LMQLKQVLSMSDATV-AEHMQDLYATLLGDLQLLKARRGLSASDLINLN------ADRLQ 130 Query: 112 SLREG 116 L G Sbjct: 131 CLLSG 135
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.7 bits (66), Expect = 0.016 Identities = 9/18 (50%), Positives = 12/18 (66%) Query: 48 LVLLGPSGAGKSSLLRVL 65 +VL G G GKS+L+ L Sbjct: 599 VVLEGTGGIGKSTLINTL 616
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 53.3 bits (128), Expect = 1e-10 Identities = 24/133 (18%), Positives = 57/133 (42%), Gaps = 24/133 (18%) Query: 1 MNNLNVIIADDHPIVLFGIRKSLEQIEWVNVVGEFEDSTALINNLSKLDANVLITDLSMP 60 M +++ADD + + ++L + + + ++ L ++ D ++++TD+ MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 GDKYGDGITLIKYIKRHYPDLAIIVLTMNNNPAILSSVLDLDIDGIV--LKQGA------ 112 + L+ IK+ PDL ++V++ N + ++GA Sbjct: 59 D---ENAFDLLPRIKKARPDLPVLVMSAQN-----------TFMTAIKASEKGAYDYLPK 104 Query: 113 PADLPKALAALQK 125 P DL + + + + Sbjct: 105 PFDLTELIGIIGR 117
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 82.2 bits (203), Expect = 3e-18 Identities = 29/109 (26%), Positives = 50/109 (45%) Query: 837 ILVVDDHPINRRLLADQLTTLGYRVITANDGLDALVALNTNTVDMVLTDVNMPNMDGYRL 896 ILV DD R +L L+ GY V ++ + D+V+TDV MP+ + + L Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 897 TERLRQLNHNFPIIGVTANALAEGKQRCIEAGMDNCLSKPVTLDTLRQM 945 R+++ + P++ ++A + E G + L KP L L + Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 31.6 bits (71), Expect = 0.002 Identities = 21/98 (21%), Positives = 35/98 (35%), Gaps = 26/98 (26%) Query: 54 GIFEKKVLDVGCGGGI---LAESMAREGAQVTGLDMGYEPLQVARLHALETGVKLEYVQE 110 GI K G GI +A ++A +GA + +D E L+ K+ + Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLE-----------KVVSSLK 53 Query: 111 TVENHAQQHPQHYDVVTCMEMLEHVPDPASVVRACAQL 148 HA+ P V D A++ A++ Sbjct: 54 AEARHAEAFPA------------DVRDSAAIDEITARI 79
>cloacin#Cloacin signature. Length = 551 Score = 34.3 bits (78), Expect = 6e-05 Identities = 20/50 (40%), Positives = 24/50 (48%) Query: 58 GGNGGQGTLHINNGSGGNGGNGAANNASGGNGGNGGNGATNGGSGGNGGN 107 G + G G NN GG G+G G+G GGNG + GGSG G Sbjct: 32 GASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81 Score = 33.9 bits (77), Expect = 6e-05 Identities = 22/64 (34%), Positives = 26/64 (40%), Gaps = 10/64 (15%) Query: 58 GGNGG--QGTLHINNGSGGNGGNGAANNASG--------GNGGNGGNGATNGGSGGNGGN 107 G N G + +IN G G G G A++ SG G G G G GNGG Sbjct: 8 GHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGG 67 Query: 108 GGNG 111 GN Sbjct: 68 NGNS 71 Score = 30.5 bits (68), Expect = 0.001 Identities = 17/62 (27%), Positives = 20/62 (32%) Query: 36 SALDGKSGENGLDGLPDSNCKNGGNGGQGTLHINNGSGGNGGNGAANNASGGNGGNGGNG 95 + L G + G N GG G G GNGG + G GGN Sbjct: 25 TGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAV 84 Query: 96 AT 97 A Sbjct: 85 AA 86 Score = 27.4 bits (60), Expect = 0.014 Identities = 16/37 (43%), Positives = 20/37 (54%), Gaps = 1/37 (2%) Query: 70 NGSGGNGGNGAANNASGG-NGGNGGNGATNGGSGGNG 105 +G G G N A++ SG NGG G G G S G+G Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSG 38
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 71.2 bits (174), Expect = 3e-17 Identities = 28/158 (17%), Positives = 53/158 (33%), Gaps = 15/158 (9%) Query: 12 PSPATTRGEQARQQLLQAAIELFGELGLKGATTRDIAQRAGQNIAAITYYFNSKEGLYLA 71 ++ RQ +L A+ LF + G+ + +IA+ AG AI ++F K L+ Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61 Query: 72 VAQYIADFIQQAFSPLAQEIDHFLQLPAEHQPPEQQLHYIRQGLLAFSHLMTQPETL-NL 130 + + I + + P L +R+ L+ E L Sbjct: 62 IWELSESNIGELELEYQAKF------------PGDPLSVLREILIHVLESTVTEERRRLL 109 Query: 131 SKIMAREQLSPSEAYPLIHTQAIAP--LHQTLNQLLAA 166 +I+ + E + Q + + Q L Sbjct: 110 MEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKH 147
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 72.2 bits (177), Expect = 4e-16 Identities = 54/261 (20%), Positives = 94/261 (36%), Gaps = 29/261 (11%) Query: 82 NALKQAQANVQSAQAQLALLKAGYREEEIAQVRSEVAQRQAAFD--YADNFLKRQQGLWA 139 N Q + N+ +A+ + A E R E R F + + L Sbjct: 200 NQKYQKELNLDKKRAERLTVLARINRYE-NLSRVE-KSRLDDFSSLLHKQAIAKHAVLEQ 257 Query: 140 SKAVSA--NELENARTARNQAQANLQAAKDKLAQFLSGNRPQ---EIAQAEANLAQTEAE 194 NEL ++ Q ++ + +AK++ + + ++ Q N+ E Sbjct: 258 ENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLE 317 Query: 195 LAQAQLNLQDTILLAPSAGTVLTRAV--EPGTILSASNTVFTVSLTDPVWVRAYVSERHL 252 LA+ + Q +++ AP + V V E G + +A + V D + V A V + + Sbjct: 318 LAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDI 377 Query: 253 GQAIPGSEVEVFTDGRPDKPYH---GKIGFVSPTAEFTPKTVETPDLRTDLVYRLRIIIT 309 G G + + P Y GK+ ++ A D R LV+ + I I Sbjct: 378 GFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--------IEDQRLGLVFNVIISIE 429 Query: 310 DADES-------LRQGMPVTV 323 + S L GM VT Sbjct: 430 ENCLSTGNKNIPLSSGMAVTA 450
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.2 bits (70), Expect = 0.014 Identities = 21/91 (23%), Positives = 27/91 (29%), Gaps = 13/91 (14%) Query: 296 PRFEDAFIDLLGGGPDSESALAKIMPRVAGNPGETVIEAQALTKKFGDFAATDHVNFQVK 355 PR E + +LG PD + Q + K + K Sbjct: 548 PRLEKWLVHVLGKTPDD-------------YKPRRLRYLQLVGKYILMGHVARVMEPGCK 594 Query: 356 RGEIFGLLGPNGAGKSTTFKMMCGLLVPSDG 386 L G G GKST + GL SD Sbjct: 595 FDYSVVLEGTGGIGKSTLINTLVGLDFFSDT 625 Score = 30.0 bits (67), Expect = 0.030 Identities = 10/19 (52%), Positives = 12/19 (63%) Query: 40 LVGPDGAGKTTLLRMLAGL 58 L G G GK+TL+ L GL Sbjct: 601 LEGTGGIGKSTLINTLVGL 619
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 50.7 bits (121), Expect = 2e-09 Identities = 34/147 (23%), Positives = 60/147 (40%), Gaps = 1/147 (0%) Query: 197 AREREQGTMEQLLVSPLTTWQIFIGKAVPALIVATFQASIVLLIGIFFYQIPFAGSLALF 256 R Q T E +L + L I +G+ A A + + ++ + SL Sbjct: 92 GRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGYTQWL-SLLYA 150 Query: 257 YGTMLLYGLSLVGFGLLISSLCSTQQQAFIGVFVFMMPAILLSGYVSPVENMPIWLQNIT 316 + L GL+ G+++++L + + + P + LSG V PV+ +PI Q Sbjct: 151 LPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQTAA 210 Query: 317 WINPIRHFTDITKQIYLKDASFDIIWH 343 P+ H D+ + I L D+ H Sbjct: 211 RFLPLSHSIDLIRPIMLGHPVVDVCQH 237
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 80.9 bits (199), Expect = 3e-20 Identities = 51/191 (26%), Positives = 85/191 (44%), Gaps = 7/191 (3%) Query: 3 KAVLITGCSSGIGLVAAQDLKNRGYRVLAACRKPDDVAKMVQ-LGLEG-----IELDLDD 56 K ITG + GIG A+ L ++G + A P+ + K+V L E D+ D Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 57 SASVERAAAQVIELTGGRLYGLFNNGGFGLYGSLHTISRQQLEKQFSTNLFGTHQLTQLL 116 SA+++ A++ G + L N G G +H++S ++ E FS N G ++ + Sbjct: 69 SAAIDEITARIEREMG-PIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 117 LPAMLPHGEGRIIQTSSVMGLVSTAGRGAYAASKYALEAWSDALRMELQSSGIHVSLIEP 176 M+ G I+ S V AYA+SK A ++ L +EL I +++ P Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 177 GPISTHFTQNV 187 G T ++ Sbjct: 188 GSTETDMQWSL 198
>PF06057#Type IV secretory pathway VirJ component Length = 243 Score = 29.4 bits (66), Expect = 0.013 Identities = 15/68 (22%), Positives = 25/68 (36%), Gaps = 12/68 (17%) Query: 20 QSMSVPV-----LFYFWSERSQHCLQLTPTLDKLAAEYAGQFILARVDCDAQPMVASQFG 74 Q PV L Y+W ++ +T + +Y +F +V ++ FG Sbjct: 75 QQQGWPVVGWSSLKYYWKQKDPK--DVTQDTLAIIDKYQAEFGTQKV-----ILIGYSFG 127 Query: 75 LRSIPAVY 82 IP V Sbjct: 128 AEVIPFVL 135
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 29.3 bits (65), Expect = 0.021 Identities = 33/153 (21%), Positives = 61/153 (39%), Gaps = 20/153 (13%) Query: 130 SQRDNINSRLLHIVDEATNPWGIKITRIEIRDVRPP--TELISAMNAQMKAERTKRADIL 187 + RD + RL IV+EA + R P TEL A NA M+AE + Sbjct: 85 ANRDALTQRLKDIVNEA----------LRHNASRTPSATELAHANNAAMQAEDERLRLAK 134 Query: 188 EAEGVRQAAILRAEGEKQSQILKAEGERQSA-------FLQAEARERAAEAEAQATKMVS 240 E R+ A + ++++ + E ER+ A +AE + AA +E ++ Sbjct: 135 AEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIA 194 Query: 241 EAIAAGDIQAINYFVAQKYTDALQHIGSANNSK 273 + + + + T + S+ +++ Sbjct: 195 QKKLSAAQSEVVKMDGEIKT-LNSRLSSSIHAR 226
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 35.4 bits (81), Expect = 0.001 Identities = 46/241 (19%), Positives = 74/241 (30%), Gaps = 19/241 (7%) Query: 88 RKALEAVSGVISADVTLESANVYGKA-DIQTLIAAVEQAGYHATQQGIDSPKT-EPLTHS 145 A EA S V + T E A + + QT + +++ KT E + Sbjct: 1067 EVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVT 1126 Query: 146 AQSQP-------ESLAAAPNTVPATNVALATSTVSDTNTVLPTNTALPTNTTSTTS-TAD 197 +Q P A P V + T A T++ T Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTES 1186 Query: 198 TASATSTAPVINPLPVTESVAQPAA-SEGESVQLLLTGMSCASCVSKVQNALQRVDGVQV 256 T T + V NP T + QP SE + S S V+ A Sbjct: 1187 TTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPA--TTSSNDR 1244 Query: 257 ARVNLAERSALVTGTQNNEALIAAVKNAGYGAEIIEDEGERRERQQQ------MSQASMK 310 + V L + ++ T ++A A A + + + E + +S SM Sbjct: 1245 STVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQYNVWVSNTSMN 1304 Query: 311 R 311 + Sbjct: 1305 K 1305
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 121 bits (305), Expect = 6e-40 Identities = 48/88 (54%), Positives = 65/88 (73%) Query: 2 NKSQLIDKIAAGADISKAAAGRALDAIITSVTESLKEGDDVALVGFGTFAVRERSARTGR 61 NK LI K+A +++K + A+DA+ ++V+ L +G+ V L+GFG F VRER+AR GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 62 NPQTGKEISIPAAKVPGFRAGKGLKDAV 89 NPQTG+EI I A+KVP F+AGK LKDAV Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90
>PF05272#Virulence-associated E family protein Length = 892 Score = 32.4 bits (73), Expect = 0.010 Identities = 15/76 (19%), Positives = 32/76 (42%), Gaps = 6/76 (7%) Query: 296 DWMLQVPWNSRSKVKKDLVKAQEVLDTDHYGLERVKDRILEYLAVQSRVSKIKGP----- 350 DW+ W+ +++K LV D+ +++ + V+++ P Sbjct: 537 DWVKAQQWDEVPRLEKWLVHVLGKTPDDYKPRRLRYLQLVGKYILMGHVARVMEPGCKFD 596 Query: 351 -ILCLVGPPGVGKTSL 365 + L G G+GK++L Sbjct: 597 YSVVLEGTGGIGKSTL 612
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.4 bits (66), Expect = 0.032 Identities = 15/72 (20%), Positives = 27/72 (37%), Gaps = 13/72 (18%) Query: 61 RSSLPTPHEIRHHLDDYVIGQEPAKKVLAVAVYNHYKRLRNGDTSNGIELGKSNILLIGP 120 P+ E ++G+ A + +Y RL D +++ G Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQ----EIYRVLARLMQTD---------LTLMITGE 168 Query: 121 TGSGKTLLAETL 132 +G+GK L+A L Sbjct: 169 SGTGKELVARAL 180
>PF06291#Lambda prophage Bor protein Length = 102 Score = 27.7 bits (61), Expect = 0.014 Identities = 12/38 (31%), Positives = 19/38 (50%) Query: 2 LKKILFPLLAIFILAGCATTSNTLNVTPKVVLPTQDPT 39 +KK+LF ++ GCA + T+ P V P + T Sbjct: 6 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETIT 43
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 46.0 bits (109), Expect = 2e-07 Identities = 45/199 (22%), Positives = 78/199 (39%), Gaps = 15/199 (7%) Query: 221 RNNAWLI-LLLIVFYKMGDAFAASLSTTFLIRGVGFDAGEVGLVNKTLGLIATIIGALYG 279 R+N LI L ++ F+ + + ++S + VN L +I A+YG Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 280 GLLMQRLSLFRALMIFGILQAVSNMGYWLLAITDKNIFSMGSAIFLENLCGGMGTAAFVA 339 L +L + R L+ I+ + ++ + FS+ + + G G AAF A Sbjct: 71 KL-SDQLGIKRLLLFGIIINCFGS----VIGFVGHSFFSL---LIMARFIQGAGAAAFPA 122 Query: 340 LLM----TLCNKSFSATQFALLSALSAVGRVYVGP-IAGWFVEAHGWSLFYLFSIAAAIP 394 L+M K F L+ ++ A+G VGP I G WS L + I Sbjct: 123 LVMVVVARYIPKENRGKAFGLIGSIVAMG-EGVGPAIGGMIAHYIHWSYLLLIPMITIIT 181 Query: 395 GLLLLYVCRQTLDHTQKTD 413 L+ + ++ + D Sbjct: 182 VPFLMKLLKKEVRIKGHFD 200
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 30.5 bits (69), Expect = 0.012 Identities = 21/145 (14%), Positives = 47/145 (32%), Gaps = 18/145 (12%) Query: 281 ITIAAGILNFVVITASVSAINSDVFGVGRMLNGMAEQGHAPKAFTAISKRGVPWVTVLVM 340 + A I+ + +S +++ AEQ + P + ++ +V Sbjct: 29 VVSTALIVALSAMLMGLSDYY--FEHFSKLMLIPAEQSYLPFSQA---------LSYVVD 77 Query: 341 MCAMLIAVYLNYIMPENVFLVIASLATFATVWVWIMILFSQIAFRRSLSK-DQVKALDFP 399 + L +A+L A+ V L S A + + K + ++ Sbjct: 78 NVLLEFFYLCF------PLLTVAALMAIASHVVQYGFLISGEAIKPDIKKINPIEGAKRI 131 Query: 400 LRGGTFTSVLAIIFLVFIIGLIGWF 424 + L I V ++ ++ W Sbjct: 132 FSIKSLVEFLKSILKVVLLSILIWI 156
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 140 bits (355), Expect = 1e-38 Identities = 94/404 (23%), Positives = 167/404 (41%), Gaps = 17/404 (4%) Query: 18 LSLATFMQVLDSTIANVAIPTIAGDLGSSNSQGTWVITSFGVANAISIPVTGWLAKRVGE 77 L + +F VL+ + NV++P IA D + WV T+F + +I V G L+ ++G Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78 Query: 78 VRLFLWSTGLFVLASWLCGMSNS-LGMLIFFRVIQGLVAGPLIPLSQSLLLNNYPPAKRS 136 RL L+ + S + + +S +LI R IQG A L ++ P R Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138 Query: 137 MALALWSMTIVVAPIFGPILGGYISDNYHWGWIFFINIPIGLVVVLLAGSTLKGRETKTE 196 A L + + GP +GG I+ HW + + IP+ ++ + L +E + + Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIK 196 Query: 197 IRPIDTIGLVLLVVGIGALQIMLDQGKELDWFNSTEIIVLTVVAVVAITFLIVWELTDDH 256 D G++L+ VGI + ML F ++ I +V+V++ + Sbjct: 197 -GHFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKVTD 245 Query: 257 PVIDLSLFKSRNFTIGCLCLSLAYMLYFGAIVLLPQLLQEVYGYTATWAGLASAPVGILP 316 P +D L K+ F IG LC + + G + ++P ++++V+ + G G + Sbjct: 246 PFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMS 305 Query: 317 VLLS-PLIGRFAHRIDMRQLVTFSFIMYAVCFYWRAYTFEPGMDFGASAWPQFFQGFAIA 375 V++ + G R ++ +V F ++ E F G + Sbjct: 306 VIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFT 365 Query: 376 CFFMPLTTITLSGLPPERMAAASSLSNFMRTLAGSIGTSITTTL 419 ++TI S L + A SL NF L+ G +I L Sbjct: 366 K--TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 36.8 bits (85), Expect = 1e-04 Identities = 16/54 (29%), Positives = 22/54 (40%) Query: 812 VLVRSDLKGLGLGRALLEKMIRYARSHGLSRLTAVTMPNNRGMIGLAQKLGFTI 865 + V D + G+G ALL K I +A+ + L T N K F I Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 219 bits (560), Expect = 4e-66 Identities = 115/462 (24%), Positives = 215/462 (46%), Gaps = 48/462 (10%) Query: 12 KRRTFAIISHPDAGKTTITEKVLLFGHAIQTAGTVKGRGSSHHAKSDWMEMEKQRGISIT 71 K +++H DAGKTT+TE +L AI G+V ++D +E+QRGI+I Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGT----TRTDNTLLERQRGITIQ 57 Query: 72 TSVMQFPYGGCLVNLLDTPGHEDFSEDTYRTLTAVDCCLMVIDAAKGVEDRTRKLMEVTR 131 T + F + VN++DTPGH DF + YR+L+ +D +++I A GV+ +TR L R Sbjct: 58 TGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALR 117 Query: 132 LRDTPILTFMNKLDREIRDPMEVLDEVERELNIACSPITWPIGCGKSFKGVYHLHKDETY 191 P + F+NK+D+ D V +++ +L+ K + Sbjct: 118 KMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVI------------------KQKVE 159 Query: 192 LYQSGKGHTIQEVRIVKGLNNPDLDVAVGEDLAKQFRQELELVQGASHEFDHEAFLSGDL 251 LY + E + + +DL +++ L + + F + L Sbjct: 160 LYPNMCVTNFTESEQWDTVIEGN------DDLLEKYMSGKSLEALELEQEESIRFHNCSL 213 Query: 252 TPVFFGTALGNFGVDHMLDGLVEWAPAPMPRKTDTRVVVASEEKFTGFVFKIQANMDPKH 311 PV+ G+A N G+D++++ + + R + + G VFKI+ K Sbjct: 214 FPVYHGSAKNNIGIDNLIEVITNKFYSSTHR---------GQSELCGKVFKIE--YSEK- 261 Query: 312 RDRVAFMRVVSGRFEKGMKLRQVRTKKDVVISDALTFMAGDRSHVEEAYAGDIIGLHNHG 371 R R+A++R+ SG +R + K+ + I++ T + G+ +++AY+G+I+ L N Sbjct: 262 RQRLAYIRLYSGVLHLRDSVR-ISEKEKIKITEMYTSINGELCKIDKAYSGEIVILQNEF 320 Query: 372 ---TIQIGDTFTQGEDMKFTGIPNFAPELFRRIRLRDPLKQKQLLKGLVQLSEEG-AVQV 427 +GDT + + I N P L + P +++ LL L+++S+ ++ Sbjct: 321 LKLNSVLGDTKLLPQRER---IENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRY 377 Query: 428 FRPLSNNDLIVGAVGVLQFEVVSSRLKSEYNVEAVYESVNVS 469 + + +++I+ +G +Q EV + L+ +Y+VE + V Sbjct: 378 YVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVI 419
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 47.2 bits (112), Expect = 2e-09 Identities = 22/80 (27%), Positives = 33/80 (41%), Gaps = 1/80 (1%) Query: 62 DEATLFNIAIDPQYQRQGYGRLLLEHLIEQLEARNIVTLWLEVRASNARAIALYESLGFN 121 A + +IA+ Y+++G G LL IE + + L LE + N A Y F Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147 Query: 122 EVSVRRNYYPS-ANGREDAI 140 +V Y + E AI Sbjct: 148 IGAVDTMLYSNFPTANEIAI 167
>PF04183#IucA / IucC family Length = 580 Score = 27.9 bits (62), Expect = 0.017 Identities = 8/38 (21%), Positives = 14/38 (36%), Gaps = 2/38 (5%) Query: 32 HLPEDTRLLIVA--QQLPEHGDPLLCDVLRSLGLTPHQ 67 L D +++A + E+ PL + GL Sbjct: 358 WLKPDESPVLMATLMECDENNQPLAGAYIDRSGLDAET 395
>ANTHRAXTOXNA#Anthrax toxin LF subunit signature. Length = 800 Score = 31.3 bits (70), Expect = 0.006 Identities = 25/105 (23%), Positives = 40/105 (38%), Gaps = 18/105 (17%) Query: 106 WGTSGSSTVLVNAANFTAENLTIRNDFDFPANQAKAEGDPTKLKDTQAVALLLAEKSDKA 165 + S S + VNA N I+ + N+ + E K KD+ + ++ Sbjct: 21 FAISSSQAIEVNAMNEHYTESDIKRNHKTEKNKTEKE----KFKDSINNLVKTEFTNETL 76 Query: 166 RFRQVKLEGYQDTL----------YSKTGSRSYFTDCDISGHVDF 200 K++ QD L YS+ G YFTD D+ H + Sbjct: 77 ----DKIQQTQDLLKKIPKDVLEIYSELGGEIYFTDIDLVEHKEL 117
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 80.3 bits (198), Expect = 1e-17 Identities = 36/173 (20%), Positives = 65/173 (37%), Gaps = 14/173 (8%) Query: 695 HILLVDDSETNRDITGMMLQQLGHQVTLADSGTTALAIGRQHRFDLVLMDIRMPVLDGLA 754 IL+ DD R + L + G+ V + + T DLV+ D+ MP + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 755 TTARWRHDPANIDSHCMITALSANASPDEQIKTSQAGMNHYLSKPVTLGQLAEMLDLTAQ 814 R + + +SA + IK S+ G YL KP L E++ + + Sbjct: 65 LLPRIK----KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF---DLTELIGIIGR 117 Query: 815 FQLERGVDLSPQLSEPQPLLDL-ADSALSLKLYQSLQVLIQQAKDAIENLPVL 866 E S + Q + L SA ++Y+ ++ + +L ++ Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYR----VLARLMQT--DLTLM 164
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 58.7 bits (142), Expect = 2e-12 Identities = 25/127 (19%), Positives = 53/127 (41%), Gaps = 3/127 (2%) Query: 3 TKLLIVDDHELIIHGIKNMLAAYPRYLIVGQADNGLEVYNLCRQTEPDMVILDLGLPGMD 62 +L+ DD I + L+ V N ++ + D+V+ D+ +P + Sbjct: 4 ATILVADDDAAIRTVLNQALS--RAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 63 GLDVIIQLLRRWPAMKILTLTARNEEHYASRTFNSGALGYVLKKSPQQILMAAIQTVAIG 122 D++ ++ + P + +L ++A+N A + GA Y+ K L+ I A+ Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR-ALA 120 Query: 123 KRYIDPA 129 + P+ Sbjct: 121 EPKRRPS 127
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 31.1 bits (70), Expect = 0.008 Identities = 7/43 (16%), Positives = 18/43 (41%) Query: 293 AYGAPKAITSFVVPTGYSFNLDGSTLYQSIAAIFIAQLYGIEL 335 + A + + TGY + +T+++S I + ++ Sbjct: 186 SNNAETQVNQNITVTGYPGDKPVATMWESKGKITYLKGEAMQY 228
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.2 bits (70), Expect = 0.007 Identities = 13/43 (30%), Positives = 19/43 (44%), Gaps = 7/43 (16%) Query: 33 IVMVGPSGCGKSTLLRMVAGLERTTTGDIYIGDQRVTDLEPKD 75 +V+ G G GKSTL+ + GL+ + D KD Sbjct: 599 VVLEGTGGIGKSTLINTLVGLD-------FFSDTHFDIGTGKD 634
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 33.9 bits (77), Expect = 0.001 Identities = 41/173 (23%), Positives = 70/173 (40%), Gaps = 13/173 (7%) Query: 136 GRLLSQPFNSSTPVLYYNKEAFKKAGLDPEQPPKTWQELAADTAKLRAAGSSCGYASGWQ 195 G+L++ P L YNK+ L P PPKTW+E+ A +L+A G S + + Sbjct: 127 GKLIAYPIAVEALSLIYNKD------LLP-NPPKTWEEIPALDKELKAKGKSALMFNLQE 179 Query: 196 GWIQIENFSAWHGQPIASRNNGFDGTDAVLEFNKPLQVKHIQLLSDMNKKGDFTYFGRKD 255 + +A G N +D D + + + L D+ K Sbjct: 180 PYFTWPLIAADGGYAFKYENGKYDIKD--VGVDNAGAKAGLTFLVDLIKNKHMNADTDYS 237 Query: 256 ESTSKFYNGDCAITTASSGSLASIRHYAKFNFGVGMMPYDADAKNAPQNAIIG 308 + + F G+ A+T + ++I +K N+GV ++P K P +G Sbjct: 238 IAEAAFNKGETAMTINGPWAWSNIDT-SKVNYGVTVLP---TFKGQPSKPFVG 286
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 27.8 bits (62), Expect = 0.019 Identities = 19/83 (22%), Positives = 33/83 (39%), Gaps = 10/83 (12%) Query: 1 MKKTVIAIITMATLTSTAAYANTIEKDIRVEAEIISLMDVKRADDSNINKIKLTYDTVTN 60 MKK++IA+ A + A D+ + I + ++ R+ N + T Sbjct: 1 MKKSLIALTLAALPVAAMA-------DVTLYGTIKAGVETSRSVAHNGAQ---AASVETG 50 Query: 61 DGTYSHSEAIKVKARKQLGDKLK 83 G I K ++ LG+ LK Sbjct: 51 TGIVDLGSKIGFKGQEDLGNGLK 73
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 72.2 bits (177), Expect = 4e-15 Identities = 40/265 (15%), Positives = 80/265 (30%), Gaps = 27/265 (10%) Query: 436 SLARYQSPYVS----RYAPDSGST---SGSYTRRIGPTQLSYQFNQYRNNRQHRIQSGWD 488 L R + Y+S Y S + ++ +N Q G D Sbjct: 536 QLGRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQK----GRD 591 Query: 489 WQLPQFNLALSLGLQNGGQWNSHNNYGVFLNTTLSFGQSNASINTAYTQQQLNTSASYQK 548 L +++ W ++ + + + S+ S + + Sbjct: 592 ---QMLALNVNIPF---SHWLRSDSKSQWRHASASYSMS--HDLNGRMTNLAGVYGTLLE 643 Query: 549 EFIDNYGASTLGVSGSASGKLNSVGGFAKRSGSRGDISGRVGIDNQITNGGISYNGMLAL 608 + +Y T G ++ G G+ + + I +G + Sbjct: 644 DNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLA 703 Query: 609 SSQGVALGRSSYSGAALLIKAPALGGTPYSFHVEDSPI--TGGGTYAIPVPRYQDRFFVR 666 + GV LG+ + +L+KAP VE+ T YA+ +P + R Sbjct: 704 HANGVTLGQPL-NDTVVLVKAPGAKDAK----VENQTGVRTDWRGYAV-LPYATEYRENR 757 Query: 667 THTDRSDMDMNIQLPVNIVRAHPGQ 691 D + + N+ L + P + Sbjct: 758 VALDTNTLADNVDLDNAVANVVPTR 782
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 65.8 bits (160), Expect = 7e-15 Identities = 31/195 (15%), Positives = 67/195 (34%), Gaps = 8/195 (4%) Query: 70 ITQNIIEPAVEQRVNQPDDIVDLPTLPEQPEGQREITRKEPIKVKRPAENRATSRKPVNK 129 I+ ++ PA + + PE KE V + KP Sbjct: 50 ISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPK-PKPKPKPKPV 108 Query: 130 ETQESDSKQSSPAAAASAMLSGTSQQVAAAVNSDSSHRQQAQVSWKSRLQGHLMGFKRYP 189 + E + P + A + ++ ++ + S S + +YP Sbjct: 109 KKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYP 168 Query: 190 SSARKQQQQGTAMIRFVVDKNGYVSSVQLSHSSGTSALDREALAIIKRAQPLPKPPAELL 249 + A+ + +G ++F V +G V +VQ+ + + +RE ++R + P P Sbjct: 169 ARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGS-- 226 Query: 250 SQGQITLSLPVDFNL 264 + + + F + Sbjct: 227 -----GIVVNILFKI 236
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 354 bits (909), Expect = e-120 Identities = 91/424 (21%), Positives = 172/424 (40%), Gaps = 8/424 (1%) Query: 25 RYLNIGGGLVVIGFIGFLLWAGLAPLDKGVAVTGLLVVAENRKVIQPLQGGRIQQLHVTE 84 R + ++ + + + L ++ G L + K I+P++ ++++ V E Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKE 114 Query: 85 GDEIVSGQLLVTLDDTAIRNQRDNLQHQYLSALAQEARLTAEQNDLDVITFPQALLEH-- 142 G+ + G +L+ L Q L A ++ R +++ P+ L Sbjct: 115 GESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEP 174 Query: 143 ATQPAVERNIILQQQLLHHRRQAHLSEIARLSTQLTRHQARLDGLQAMRSNHQRQSNLFQ 202 Q E ++ L+ + ++ + L + +A + A + ++ S + + Sbjct: 175 YFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEK 234 Query: 203 QQLDSVQLLAKDGYIAKNKLLEMESQLTSLQARVEQGTSDIAEAHKLIDETEQHVLQRRE 262 +LD L IAK+ +LE E++ + S + + I ++ + Sbjct: 235 SRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQ 294 Query: 263 QYQSENSEQLAKAQQNTQELVQRLNIAEYELSHTRIFAPVSGSVIALAQHTVGGVVSSGQ 322 +++E ++L + N L L E + I APVS V L HT GGVV++ + Sbjct: 295 LFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354 Query: 323 ALMEIVPSGQPLFVEAQLPVELIDKVTVGLPVDLNFSAFNQSNTPRLQGSVWRIGADRIQ 382 LM IVP L V A + + I + VG + AF + L G V I D I+ Sbjct: 355 TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIE 414 Query: 383 PPPTSPPYYPLTVAIDL-----DPTELAIRPGMAVDVFIRTGERSLLSYLFKPFTDRLHL 437 + + ++I+ + + GMAV I+TG RS++SYL P + + Sbjct: 415 DQRLGLVFNVI-ISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTE 473 Query: 438 ALAE 441 +L E Sbjct: 474 SLRE 477
>PF06438#Heme acquisition protein HasAp Length = 205 Score = 229 bits (586), Expect = 2e-79 Identities = 66/214 (30%), Positives = 104/214 (48%), Gaps = 18/214 (8%) Query: 1 MSTTIQYNSNYADYSISSYLREWANNFGDIDQAPAETKDRGSFSG-SSTLFSGTQYAIGS 59 MS +I Y++ Y+ ++++ YL +W+ FGD++ P + D + G + F G+QYA+ S Sbjct: 1 MSISISYSTTYSGWTVADYLADWSAYFGDVNHRPGQVVDGSNTGGFNPGPFDGSQYALKS 60 Query: 60 SHSNPEGMIAEGDLKYSFM--PQHTFHGQIDTLQFGKDLATNAGGPSAGKHLEKIDITFN 117 + S+ IA GDL Y+ P HT G++D++ G L G S G L+ +++F+ Sbjct: 61 TASDA-AFIAGGDLHYTLFSNPSHTLWGKLDSIALGDTL--TGGASSGGYALDSQEVSFS 117 Query: 118 ELDLSGEFDSGKSMTENHQGDMHKSVRGLMKGNPDPMLEVMKAKGINVDTAFKDLSIASQ 177 L L G+ G +HK V GLM G+ + + A VD + S Q Sbjct: 118 NLGLDSPIAQGRD------GTVHKVVYGLMSGDSSALQGQIDALLKAVDPSLSINSTFDQ 171 Query: 178 SPDSGYMSDAPM-----VDTVGVVDC-HDMLLAA 205 +G P V VGV + HD+ LAA Sbjct: 172 LAAAGVAHATPAAAAAEVGVVGVQELPHDLALAA 205
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 42.1 bits (99), Expect = 1e-06 Identities = 36/138 (26%), Positives = 58/138 (42%), Gaps = 18/138 (13%) Query: 133 VQTLLAAGYMPIISSIG----ITVEGQLMNVNA----DQAATALAATLGAD-LILLSDVS 183 ++ L+ G + I S G I +G++ V A D A LA + AD ++L+DV+ Sbjct: 179 IKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVN 238 Query: 184 GILDGKG----QRIAEMTAQKAEQLIAQGIITDG-MVVKVNAALDAARSLGRPVDIASWR 238 G G Q + E+ ++ + +G G M KV AA+ G IA Sbjct: 239 GAALYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHL- 297 Query: 239 HSEQLPALFNGVPIGTRI 256 E+ G GT++ Sbjct: 298 --EKAVEALEG-KTGTQV 312