>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 63.3 bits (154), Expect = 8e-14 Identities = 26/128 (20%), Positives = 52/128 (40%), Gaps = 10/128 (7%) Query: 8 VLIIEDESELARLHAELVQKHPRLRLAGM----AASLAQARQLLHATPPQLVLLDNYLPD 63 +L+ +D++ + + + L AG ++ A + + A LV+ D +PD Sbjct: 6 ILVADDDAAIRTVLNQA------LSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59 Query: 64 GKGVTLMTDPALATSQCSVIFITAASDMETCSQAIRNGAFDYILKPVSWKRLSQSLERFI 123 L+ A V+ ++A + T +A GA+DY+ KP L + R + Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119 Query: 124 QFYDQQRE 131 ++ Sbjct: 120 AEPKRRPS 127
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 31.4 bits (71), Expect = 0.019 Identities = 26/171 (15%), Positives = 50/171 (29%), Gaps = 9/171 (5%) Query: 578 LVMFDLPFNPDLLEQRIGRLDRIGQAHDIQIHVPYLEKTAQSVLVRWYH-EGLDAFEHTC 636 L+M L + R+D G + + + + LV + + + Sbjct: 167 LLMTGRAAVIKRLLTIVERVDNAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGS 226 Query: 637 PTGRTVYDSVHDELINYLAAPESTDGFDDLIKSCRQQHDALKAQLEQGRDRLLEI-HSNG 695 V D + P S +IK +Q Q QG +++ + ++ Sbjct: 227 MVANVVADE-RTNAVLVSGEPNSRQRIIAMIKQLDRQ------QATQGNTKVIYLKYAKA 279 Query: 696 GEKAQALAESIEEQDDDTSLIAFSMNLFDIVGINQDDRGENLIVLTPSDHM 746 + + L + L + I + LIV D M Sbjct: 280 SDLVEVLTGISSTMQSEKQAAKPVAALDKNIIIKAHGQTNALIVTAAPDVM 330
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 27.8 bits (62), Expect = 0.048 Identities = 19/76 (25%), Positives = 33/76 (43%), Gaps = 2/76 (2%) Query: 95 RALLEKTEHALHQHSMITILIGRFVGPTRPLVPMVAGMLDLPVAKFVLPNIIGCLLWPPL 154 R LEK + Q + +L G + + PM+A + +L AK ++ LL + Sbjct: 362 RLCLEKQDIFRTQ--LRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGV 419 Query: 155 YFLPGILAGAAIDIPA 170 I G ++IP+ Sbjct: 420 DVSDSIEVGIMVEIPS 435
>NEISSPPORIN#Neisseria sp. porin signature. Length = 348 Score = 36.5 bits (84), Expect = 2e-04 Identities = 17/81 (20%), Positives = 30/81 (37%), Gaps = 2/81 (2%) Query: 367 YHAGEHYQ-GNWFPAYGLLPRWHHASNHACEKPAGLETVTLTYYRDHVEHRVIGGIMRDL 425 YH G +YQ +F Y L + + E ++ + HR++GG + Sbjct: 180 YHVGLNYQNSGFFAQYAGLFQRYGEGTKKIEYDDQTYSIPSLFVEKLQVHRLVGGYDNNA 239 Query: 426 LAAHQVKLEIQELEYDAWHRG 446 L V + Q+ + G Sbjct: 240 LYV-SVAAQQQDAKLYGAMSG 259
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 32.9 bits (75), Expect = 0.002 Identities = 41/187 (21%), Positives = 67/187 (35%), Gaps = 17/187 (9%) Query: 16 AAFMLVAFMMGVAGALQAPTLSLFLSREVGAQPFWVGLFYTVNAIAGILVSLWLAKRSDS 75 AA M V F+M + G + A +F +G+ I L + + Sbjct: 213 AALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAA 272 Query: 76 RGDRRRLIMFCCLMAVGNALLFAFNRHYLTLITCGVMLASIANAAMPQLFALAREYADSS 135 R RR +M + +L AF V+LAS MP L A+ D Sbjct: 273 RLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGG-IGMPALQAMLSRQVDEE 331 Query: 136 AREVVMFSSVMRAQLSLAWVIGPPLAFMLALNYGFTTMFSIAAG-----IFVISLALIAI 190 + + S SL ++GP L FT +++ + ++ AL + Sbjct: 332 RQGQLQGSLAALT--SLTSIVGPLL---------FTAIYAASITTWNGWAWIAGAALYLL 380 Query: 191 KLPSVPR 197 LP++ R Sbjct: 381 CLPALRR 387
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 61.0 bits (148), Expect = 3e-12 Identities = 71/415 (17%), Positives = 132/415 (31%), Gaps = 46/415 (11%) Query: 30 LSVGTMINYLDRTILGI---VAPQLSKEIHID---PAMMGIIFSAFAWTYALAQIPGGMF 83 L V LD +G+ V P L +++ A GI+ + +A G Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 84 LDRFGNKVTYALSIFFWSLFTLLQSFTLGLKSLLLLRLGLGVSEAPCFPANSRIVSTWFP 143 DRFG + +S+ ++ + + L L + R+ G++ A ++ Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGAT-GAVAGAYIADITD 125 Query: 144 QHERARA----TATYTVGEYIGLAAFSPLLFLILEHHGWRTLFFLTGGLGILFTLVWWRF 199 ERAR +A + G G P+L ++ FF L L L Sbjct: 126 GDERARHFGFMSACFGFGMVAG-----PVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFL 180 Query: 200 YHEPHESRTANQAELEYIGANGINNKIQNVPFNWRDARRLLGCRQILGASLGQFAGNTTL 259 E H+ +N F W ++ + + Sbjct: 181 LPESHKGERRPLRREA------LNPLAS---FRWARGMTVVAALMAVFFIMQLVGQ---- 227 Query: 260 VFFLTWFPSYLANERHLPWLHVGFFATWPFLAAAIGILFGGWISDRLLKRTGSVNISRKL 319 + + + H +G + L I+ + R G Sbjct: 228 -VPAALWVIFGEDRFHWDATTIGISLA---AFGILHSLAQAMITGPVAARLGERRA---- 279 Query: 320 PIISGLLLSSC--IIAANWVSANSTVIIIMSVAFFGQGMVGLGWTLISDIAPENMAGLTG 377 ++ G++ I+ A I++ +A G GM L ++S E G Sbjct: 280 -LMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQ-AMLSRQVDEERQGQLQ 337 Query: 378 GIFNFCANMASIIAPLIIGVIISATGNFFYALIYVGLTALIGVIAYIFIIGDIKR 432 G ++ SI+ PL+ I +A+ + G + G Y+ + ++R Sbjct: 338 GSLAALTSLTSIVGPLLFTAIYAAS-----ITTWNGWAWIAGAALYLLCLPALRR 387
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 35.6 bits (82), Expect = 6e-04 Identities = 44/285 (15%), Positives = 86/285 (30%), Gaps = 40/285 (14%) Query: 26 DKVEAEQSLITVEGDKASMEVPSPQAGVVKEIKVSVGDKTETGKLIMIFD---------S 76 + V +T G S E+ + +VKEI V G+ G +++ Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138 Query: 77 AEGAAAAAPAQEEKKEAAPAAAAPAAAAAAKEVHVPD---IGGDEVEVTEIMVKVG-DTI 132 + + A ++ + + + K P + +EV ++K T Sbjct: 139 TQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTW 198 Query: 133 AAEQSLITVEGDKASMEVPAPFAGTVKEIKINTGDKVSTGSLIMIFEVAGAAPAAAPAQA 192 ++ + DK E A + ++ +K + A A Sbjct: 199 QNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSL-----LHKQAIAKHA 253 Query: 193 AAPAAAAPAAAAGVKDVNVPDIGGDEVEVTEVMVK-----------VGDKVA-------- 233 A V + E E+ + + DK+ Sbjct: 254 VLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313 Query: 234 AEQSLITVEGDKASMEVPAPFAGTVKEIKIST-GDKVKTGSLIMV 277 L E + + + AP + V+++K+ T G V T +MV Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMV 358 Score = 32.5 bits (74), Expect = 0.006 Identities = 17/96 (17%), Positives = 31/96 (32%), Gaps = 5/96 (5%) Query: 17 ITEILVKVGDKVEAEQSLITV-----EGDKASMEVPSPQAGVVKEIKVSVGDKTETGKLI 71 + EI+VK G+ V L+ + E D + QA + + + E KL Sbjct: 107 VKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLP 166 Query: 72 MIFDSAEGAAAAAPAQEEKKEAAPAAAAPAAAAAAK 107 + E +E + + + K Sbjct: 167 ELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQK 202 Score = 29.8 bits (67), Expect = 0.033 Identities = 16/64 (25%), Positives = 27/64 (42%), Gaps = 2/64 (3%) Query: 230 DKVAAEQSLITVEGDKASMEVPAPFAGTVKEIKISTGDKVKTGSLIMVFEVEGAAPAAAP 289 + VA +T G S E+ VKEI + G+ V+ G +++ GA Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138 Query: 290 AQAA 293 Q++ Sbjct: 139 TQSS 142
>INFPOTNTIATR#Macrophage infectivity potentiator signature. Length = 233 Score = 55.4 bits (133), Expect = 8e-11 Identities = 51/213 (23%), Positives = 99/213 (46%), Gaps = 14/213 (6%) Query: 322 SAAAKLSASGEQQAYAIGASMGSEALNVLTTRRTQGVTVDAGLVLQGIEDAFRG-QLRLG 380 + A L+ ++ +Y+IGA +G + QG+ ++ ++ +G++D G QL L Sbjct: 22 TDATSLTTDKDKLSYSIGADLGKNF-------KNQGIDINPDVLAKGMQDGMSGAQLILT 74 Query: 381 EQER----NKALFDVSQQVFQNLNKIEQKNISAGKKYQQAFARKKDVV-FKEGVYSRVDY 435 E++ +K D+ + NK ++N + G + A K +V G+ ++ Sbjct: 75 EEQMKDVLSKFQKDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPSGLQYKIID 134 Query: 436 LGKG-KISGNDLVTVVIKEMLTDGTVINDMEAKDQALTQKLDAYPPVFREPLKRLQNHGS 494 G G K +D VTV L DGTV + E + T ++ P + E L+ + + Sbjct: 135 AGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEALQLMPAGST 194 Query: 495 VTLVVPPEKAYGSKGLPPKIPPGATMVYSVRIV 527 + VP + AYG + + I P T+++ + ++ Sbjct: 195 WEVFVPADLAYGPRSVGGPIGPNETLIFKIHLI 227
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 271 bits (694), Expect = 2e-93 Identities = 138/277 (49%), Positives = 168/277 (60%), Gaps = 15/277 (5%) Query: 1 MTTLAALSLHFPFVWYGFLLLFGLALGSFYNVVIYRLPRML---------------TQTA 45 M L L+ P++++ + LF L +GSF NVVI+RLP ML + Sbjct: 1 MALLLELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGV 60 Query: 46 DDERITLSTPGSSCPQCRQPISWRDNIPLLSFLWLGRRARCCQAPIAWSYPLTELATGLL 105 D+ L P S CP C PI+ +NIPLLS+LWL R R CQAPI+ YPL EL T LL Sbjct: 61 DEPPYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALL 120 Query: 106 FILAGALLAPGLPLAGGLVLLSFLLILARIDARTQLLPDRLTLPLLWAGLLFNLNEVYIA 165 + LAPG L+L L+ L ID LLPD+LTLPLLW GLLFNL +++ Sbjct: 121 SVAVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVS 180 Query: 166 LPDAVAGAMAGYLALWSVYWLFRLLTGKEALGYGDFKLLAALGAWCGWQVLPQVLLLASA 225 L DAV GAMAGYL LWS+YW F+LLTGKE +GYGDFKLLAALGAW GWQ LP VLLL+S Sbjct: 181 LGDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSL 240 Query: 226 SGLVWTLLQRLWTRQSLQQPLAFGPWLALAGGGIFLW 262 G + L +P+ FGP+LA+AG LW Sbjct: 241 VGAFMGIGLILLRNHHQSKPIPFGPYLAIAGWIALLW 277
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 33.3 bits (76), Expect = 3e-04 Identities = 21/63 (33%), Positives = 35/63 (55%), Gaps = 5/63 (7%) Query: 4 KMRGFTLIETLLALAILAVLSAAAV-MVLQNVIRADGLTREKSQ-QIAALQRAFRQIADD 61 K RGFTL+E ++ + I+ VL++ V ++ N +AD ++K+ I AL+ A D Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKAD---KQKAVSDIVALENALDMYKLD 62 Query: 62 VTH 64 H Sbjct: 63 NHH 65
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 32.2 bits (73), Expect = 2e-04 Identities = 22/99 (22%), Positives = 36/99 (36%), Gaps = 8/99 (8%) Query: 1 MKREAGMTLIEVMVALVIF-ALAGLAV---MQSTLQQTRQLGRMEEKILASWLADNQLVQ 56 ++ G TL+E+MV +VI LA L V M + + +Q + L + L +L Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDN 63 Query: 57 LRLEKRWPALS--WSETTVEAAGTRWFVRWQGVETALPQ 93 L T+ + +G LP Sbjct: 64 HHYPTTNQGLESLVEAPTLPPLAANY--NKEGYIKRLPA 100
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 177 bits (450), Expect = 1e-59 Identities = 98/164 (59%), Positives = 125/164 (76%) Query: 1 MSQRGFTLLEMMLVLLLIGVSASMVLLAFPSARTQEATQILARFQTQLDFVRERGQQTGQ 60 M QRGFTLLEMML+LLL+GVSA MVLLAFP++R A Q LARF+ QL FV++RG QTGQ Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQ 60 Query: 61 LFGIIIHPERWQFMRLQPADDSAPAAADDRWGNAQWLPLQAGRVTTAETLPRARLTLRFP 120 FG+ +HP+RWQF+ L+ D + PA ADD W +WLPL+AGRV T+ ++ +L L F Sbjct: 61 FFGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAGRVATSGSIAGGKLNLAFA 120 Query: 121 DGQAWTPGEQPDVLIFPGGEVTPFQLRIDAATGINVDAQGDSQP 164 G+AWTPG+ PDVLIFPGGE+TPF+L + A GI +A+G+S P Sbjct: 121 QGEAWTPGDNPDVLIFPGGEMTPFRLTLGEAPGIAFNARGESLP 164
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 243 bits (621), Expect = 2e-86 Identities = 98/140 (70%), Positives = 112/140 (80%) Query: 1 MQRQRGFTLLEIMVVIVILGILASLVVPNLMGNKEKADRQKVVSDLVALEGALDMYKLDN 60 +QRGFTLLEIMVVIVI+G+LASLVVPNLMGNKEKAD+QK VSD+VALE ALDMYKLDN Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDN 63 Query: 61 SRYPNTEQGLQALVTAPAAEPHARNYPEGGYIRRLPQDPWGNEYQLLSPGQHGAIDVFSV 120 YP T QGL++LV AP P A NY + GYI+RLP DPWGN+Y L++PG+HGA D+ S Sbjct: 64 HHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLLSA 123 Query: 121 GPDGMPDTNDDIGNWTLGKK 140 GPDG T DDI NW L KK Sbjct: 124 GPDGEMGTEDDITNWGLSKK 143
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 512 bits (1321), Expect = 0.0 Identities = 277/407 (68%), Positives = 335/407 (82%), Gaps = 4/407 (0%) Query: 1 MALFRYQALDAQGKTRRGLQQADSARHARQLLRDKGWLALEVTTADPARRLWAGGSLT-- 58 MA + YQALDAQGK RG Q+ADSAR ARQLLR++G + L V ++ L+ Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60 Query: 59 --RRTSAGDLALLTRQLATLVAAGIPLEKALDAVAQQCEKPSLRTLMAGVRSKVLEGHSL 116 R S DLALLTRQLATLVAA +PLE+ALDAVA+Q EKP L LMA VRSKV+EGHSL Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120 Query: 117 AEAMRGYPACFDGLFCAMVAAGETSGHLDGVLNRLANYTEQRQQLRARLLQAMIYPIVLT 176 A+AM+ +P F+ L+CAMVAAGETSGHLD VLNRLA+YTEQRQQ+R+R+ QAMIYP VLT Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180 Query: 177 LVAISVIAILLSTVVPKVVEQFVHLKQALPFSTRLLMSLSDIVRSAGPWLALLSLLALLA 236 +VAI+V++ILLS VVPKVVEQF+H+KQALP STR+LM +SD VR+ GPW+ L L +A Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240 Query: 237 LRYLLRQPARRLAWDRMLLRLPVIGRVARSVNSARYARTLSILNASAVPLLLSMRISADV 296 R +LRQ RR+++ R LL LP+IGR+AR +N+ARYARTLSILNASAVPLL +MRIS DV Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300 Query: 297 LSNAWARSQLAAASESVREGVSLHRALESTALFPPMMRYMIASGEQSGELTAMLERAAEN 356 +SN +AR +L+ A+++VREGVSLH+ALE TALFPPMMR+MIASGE+SGEL +MLERAA+N Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360 Query: 357 QDRELSAQIQMALSLFEPLLVVTMAGMVLFIVLAILQPILQLNTLMS 403 QDRE S+Q+ +AL LFEPLLVV+MA +VLFIVLAILQPILQLNTLMS Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLMS 407
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 839 bits (2169), Expect = 0.0 Identities = 606/646 (93%), Positives = 631/646 (97%) Query: 10 ALLILTPLLFSPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML 69 LLI LLF PAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML Sbjct: 13 TLLIFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML 72 Query: 70 NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRAKDAKTSAVPVASAAAPGEGDEVVTRVV 129 NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVR+KDAKT+AVPVAS AAPG GDEVVTRVV Sbjct: 73 NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVV 132 Query: 130 PLTNVAARDLAPLLRQLNDNAGAGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDR 189 PLTNVAARDLAPLLRQLNDNAG GSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDR Sbjct: 133 PLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDR 192 Query: 190 SVVTVPLSWASAAEVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRI 249 SVVTVPLSWASAA+VVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRI Sbjct: 193 SVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRI 252 Query: 250 IAMIKQLDRQQAVQGNTKVIYLKYAKAADLVEVLTGISSSLQSDKQSARPVAAIDKNIII 309 IAMIKQLDRQQA QGNTKVIYLKYAKA+DLVEVLTGISS++QS+KQ+A+PVAA+DKNIII Sbjct: 253 IAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIII 312 Query: 310 KAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKN 369 KAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKN Sbjct: 313 KAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKN 372 Query: 370 AGMTQFTNSGLPISTAIAGANQYNKDGTISSSLASALGSFNGIAAGFYQGNWAMLLTALS 429 AGMTQFTNSGLPISTAIAGANQYNKDGT+SSSLASAL SFNGIAAGFYQGNWAMLLTALS Sbjct: 373 AGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALS 432 Query: 430 SSTKNDILATPSIVTLDNMQATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKP 489 SSTKNDILATPSIVTLDNM+ATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKP Sbjct: 433 SSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKP 492 Query: 490 QINEGDAVLLEIEQEVSSVADSASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKT 549 QINEGD+VLLEIEQEVSSVAD+ASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDK+ Sbjct: 493 QINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKS 552 Query: 550 VTDTADKVPLLGDIPVIGALFRSDSKKVSKRNLMLFIRPTIIRDRDEYRQASSGQYTAFN 609 V+DTADKVPLLGDIPVIGALFRS SKKVSKRNLMLFIRPT+IRDRDEYRQASSGQYTAFN Sbjct: 553 VSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFN 612 Query: 610 NAQTKQRGKESSEASLSNDLLHIYPQQETQAFRQVSAAIDAFNLGG 655 +AQ+KQRGKE+++A L+ DLL IYP+Q+T AFRQVSAAIDAFNLGG Sbjct: 613 DAQSKQRGKENNDAMLNQDLLEIYPRQDTAAFRQVSAAIDAFNLGG 658
>BCTERIALGSPC#Bacterial general secretion pathway protein C signature. Length = 272 Score = 213 bits (544), Expect = 7e-71 Identities = 98/266 (36%), Positives = 159/266 (59%), Gaps = 7/266 (2%) Query: 17 KLLPQIVTLIILITAIPQLAKLTWRVVFPVSPEDISALPLTMPPAADPELKNVRPAFTLF 76 ++ +I+ ++++ QLA + WR+ P ++ + + PA + FTLF Sbjct: 12 SVIRRILFYLLMLLFCQQLAMIFWRIGLP---DNAPVSSVQITPAQARQQPVTLNDFTLF 68 Query: 77 GLAVKISPTPT-DAASLNQVPVSSLKLRLAGLLASSNPARSIAIIEKGNQQVSLSTGDPL 135 G++ + + DA+ ++ +P S+L L L G++A + +RSIAII K N+Q S + + Sbjct: 69 GVSPEKNKAGALDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSRGVNEEV 128 Query: 136 PGYDARIAAILPDRIIVNYQGRKEAILLFNDSRAPSPPPTAAGNPPLVKRLREQPQNILT 195 PGY+A+I +I PDR+++ YQGR E + L++ + S G + + + Sbjct: 129 PGYNAKIVSIRPDRVVLQYQGRYEVLGLYSQEDSGSDGVP--GAQVNEQLQQRASTTMSD 186 Query: 196 YLSISPVLSGDKLLGYRLNPGKDASLFRQSGLQANDLAIALNGIDLRDQEQAQQALQNLA 255 Y+S SP+++ +KL GYRLNPG + F + GLQ ND+A+ALNG+DLRD EQA++A++ +A Sbjct: 187 YVSFSPIMNDNKLQGYRLNPGPKSDSFYRVGLQDNDMAVALNGLDLRDAEQAKKAMERMA 246 Query: 256 DMTEITLTVEREGQRHDIAFAL-GDE 280 D+ TLTVER+GQR DI GDE Sbjct: 247 DVHNFTLTVERDGQRQDIYMEFGGDE 272
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 430 bits (1107), Expect = e-155 Identities = 213/291 (73%), Positives = 244/291 (83%) Query: 6 LITRRRLLIAMALSPLLWQMRGAQAADVDPQRVVALEWLPAELLLALGVTPYGVADIPNY 65 LI+RRRLL AMALSPLLWQM A AA +DP R+VALEWLP ELLLALG+ PYGVAD NY Sbjct: 6 LISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVADTINY 65 Query: 66 RLWVNEPALPDSVIDVGLRTEPNLELLTQMKPSFIVWSAGYGPSPEKLARIAPGRGFTFS 125 RLWV+EP LPDSVIDVGLRTEPNLELLT+MKPSF+VWSAGYGPSPE LARIAPGRGF FS Sbjct: 66 RLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFS 125 Query: 126 DGKRPLAMAQRSLLDMADLLGKTQQAKRHLAEFDALMESLRPRFAGRGDRPLLMISLLDP 185 DGK+PLAMA++SL +MADLL A+ HLA+++ + S++PRF RG RPLL+ +L+DP Sbjct: 126 DGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDP 185 Query: 186 RHVLVFGENCLFQEVLDRFGIKNAWHGEAAFWGSVSVGIDRLAAFNEADVICFDHGNERD 245 RH+LVFG N LFQE+LD +GI NAW GE FWGS +V IDRLAA+ + DV+CFDH N +D Sbjct: 186 RHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKD 245 Query: 246 MAQLLATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFARVLADAQGRPA 296 M L+ATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHF RVL +A G A Sbjct: 246 MDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIGGKA 296
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 29.8 bits (67), Expect = 0.009 Identities = 18/66 (27%), Positives = 24/66 (36%), Gaps = 14/66 (21%) Query: 120 AEAI-SLLRNNRVVILSAGTGNPFFTT-------------DSAACLRGIEIEADVVLKAT 165 AE I L+ +VI S G G P D A E+ AD+ + T Sbjct: 176 AETIKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILT 235 Query: 166 KVDGVF 171 V+G Sbjct: 236 DVNGAA 241
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 61.4 bits (149), Expect = 2e-12 Identities = 82/390 (21%), Positives = 147/390 (37%), Gaps = 30/390 (7%) Query: 3 LALFALTIGAFAIGTTEFVIVGLVPTIAQQLSISLPSA---GLLVSIYALGVAIGAPVLT 59 + L + + A IG +I+ ++P + + L S G+L+++YAL APVL Sbjct: 9 VILSTVALDAVGIG----LIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64 Query: 60 ALTGRMPRKQLLLALMVLFTAGNILAWQAPGYETLILARLLTGLAHGVFFSIGSTIATSL 119 AL+ R R+ +LL + + AP L + R++ G+ G+ IA Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADIT 124 Query: 120 VAKEKAASAIAIMFGGLTVALVTGVPFGTFIGQHFGWRETFLAVSILGVIALISSLLLVP 179 E+A M +V G G +G F F A + L + ++ L+P Sbjct: 125 DGDERAR-HFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLP 182 Query: 180 NNIPGRASASLRDQLRVLTHPRLLMIYAITALGYGGVFTAF-------TFLAPMMQELAG 232 + G R+ L L R + A F ++ Sbjct: 183 ESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFH 242 Query: 233 FSPSAVSWILLGYGVSVAIGNVW-GGKLADKHGAVSALK--FIFAALVLLLLVFQLTASV 289 + + + L +G+ ++ G +A + G AL I +LL F A+ Sbjct: 243 WDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAF---ATR 299 Query: 290 HYAALATVLVMGVFAFGNVPGLQVYVVQKAEQYTPGAVDIASGLNIAAFNIGIALGSIVG 349 + A ++++ G +P LQ + ++ ++ G G A ++ +G ++ Sbjct: 300 GWMAFPIMVLLASGGIG-MPALQAMLSRQVDEERQGQ---LQGSLAALTSLTSIVGPLLF 355 Query: 350 GQTVERYGLAQTPWIG-AMIVLVALLLVVL 378 Y + T W G A I AL L+ L Sbjct: 356 TAI---YAASITTWNGWAWIAGAALYLLCL 382
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 100 bits (251), Expect = 1e-27 Identities = 73/253 (28%), Positives = 124/253 (49%), Gaps = 16/253 (6%) Query: 8 RTAIVTGGATGLGREFVLSLAKEGVNIC-FTYMREEEHPERLIETVKASANVEIIAVKTD 66 + A +TG A G+G +LA +G +I Y E+ E+++ ++KA A A D Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKL--EKVVSSLKAEARHAE-AFPAD 65 Query: 67 LSDEQSRENLFATCIDRLGKADILVNNAGIWLSGYVTEICPQDWDLVMNVNLKAIFHLSQ 126 + D + + + A +G DILVN AG+ G + + ++W+ +VN +F+ S+ Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 127 LFVNHCLQHDQMGSILNITSQAAFHGSTTGHAHYAASKAGLVAFAISLAREVAKQKINVN 186 + + + GSI+ + S A T A YA+SKA V F L E+A+ I N Sbjct: 126 SVSKY-MMDRRSGSIVTVGSNPA-GVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183 Query: 187 NIAVGIMDTAMIRKN-IEQNPDYYVSR---------IPVGRVAQPQEIADIGVFMVSPKT 236 ++ G +T M ++N V + IP+ ++A+P +IAD +F+VS + Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243 Query: 237 SYMTGATLDVTGG 249 ++T L V GG Sbjct: 244 GHITMHNLCVDGG 256
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.3 bits (71), Expect = 0.009 Identities = 14/56 (25%), Positives = 29/56 (51%), Gaps = 4/56 (7%) Query: 44 SKIVNVLEAPFAGTLRRILAREGETLQVGAVLALAADASVSDAELDEFVARLATAK 99 SK + +E ++ I+ +EGE+++ G VL A ++A+ + + L A+ Sbjct: 96 SKEIKPIEN---SIVKEIIVKEGESVRKGDVLLK-LTALGAEADTLKTQSSLLQAR 147
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 718 bits (1855), Expect = 0.0 Identities = 224/866 (25%), Positives = 379/866 (43%), Gaps = 57/866 (6%) Query: 15 LGLFIFVSLSPLVMKAYATDNIQFNTDVLDVRDRKNIDLSQFSRSGYIMPGTYDMVVHIN 74 + +FV+ + ++ + FN L + DLS+F + PGTY + +++N Sbjct: 26 FFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLN 85 Query: 75 KNTLPEQEIPFYEPDDDPNGSRACINPKLVEQLGLKPGVLKDLAWWHKGGCLDKRS-VKG 133 + +++ F D G C+ + +GL + + C+ S + Sbjct: 86 NGYMATRDVTFN-TGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHD 144 Query: 134 MEIRGDLPTASLYLSIPQAYLEYTDENWDPPSRWDEGVAGLLLDYNLNASSQHQQSEGSN 193 + D+ L L+IPQA++ + PP WD G+ LL+YN + +S + G N Sbjct: 145 ATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRI-GGN 203 Query: 194 TQALSGNGTAGGNLGSWRFRADWQANLDHSNGSEQSTQKQFDLSRYYAYRAIPGLHSKLT 253 + N +G N+G+WR R + + + S+ S ++ ++ + R I L S+LT Sbjct: 204 SHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSS-SGSKNKWQHINTWLERDIIPLRSRLT 262 Query: 254 LGENYLDSGMFDSFRFTGVSLISDDNMLPPNLRGYAPEVTGIAKTNAKVIISQQGRVLYE 313 LG+ Y +FD F G L SDDNMLP + RG+AP + GIA+ A+V I Q G +Y Sbjct: 263 LGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYN 322 Query: 314 TSVASGPFRIQDIN-EAVSGELNVRVEEQDGGVQEFVVNTANIPYLTRPGSVRFKLTMGK 372 ++V GPF I DI SG+L V ++E DG Q F V +++P L R G R+ +T G+ Sbjct: 323 STVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGE 382 Query: 373 PSDWQHHLRDPMFGTGELSWGISNGWSLYGGILRGGDYNALSLGIGRDLMFLGALSFDAT 432 P F L G+ GW++YGG Y A + GIG+++ LGALS D T Sbjct: 383 YRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMT 442 Query: 433 HSRVRLPWEDLTLNGDSYRLSYSKNFEEYDSQVTFAGYRFSEQDFMTMSEYLDARSYGTR 492 + LP +D +G S R Y+K+ E + + GYR+S + ++ +R G Sbjct: 443 QANSTLP-DDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYN 501 Query: 493 -------------------SNGNGKEMYTVNLNKHFRKLELSSYINFSRETYWDRPTTDR 533 N + + + + + + Y++ S +TYW D Sbjct: 502 IETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRT-STLYLSGSHQTYWGTSNVDE 560 Query: 534 -YNITLSHYFDLGRFRGVSISLSAYRNQYNGTEDNGAYVSMSIPWSD-----------SS 581 + L+ F+ + ++S S +N + D ++++IP+S + Sbjct: 561 QFQAGLNTAFEDINW---TLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHA 617 Query: 582 TVSYNA-MVSHNDNTHRVGYYDRVDEHN--NYQLSAG-----NSSRGVSVSGYYSHEGDM 633 + SY+ + T+ G Y + E N +Y + G + + G + ++ G Sbjct: 618 SASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGY 677 Query: 634 ARMSANASYQDGRYSAIGLTMQGGLTLTSEGGAMHRSGMMGGTRMLIDTEGVPDVPVRGY 693 + S+ D + + GG+ + G + + + T +L+ G D V Sbjct: 678 GNANIGYSHSDD-IKQLYYGVSGGVLAHANGVTLGQ--PLNDTVVLVKAPGAKDAKVEN- 733 Query: 694 GSTSRTNAWGKAVISDVNSYYRNKASIDLNQLGDNIEATVSVVQATLTEGAIGYRKFDVI 753 + RT+ G AV+ Y N+ ++D N L DN++ +V T GAI +F Sbjct: 734 QTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKAR 793 Query: 754 SGAKAMAAIKLADGSEPPFGATVINKRKQETGIVNDSGNVYLSGINAGETMVVHWGGSAQ 813 G K + + + PFGA V ++ Q +GIV D+G VYLSG+ + V WG Sbjct: 794 VGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEEN 852 Query: 814 CEVRMPALL---QPDMLMNTLRLLCK 836 L L+ L C+ Sbjct: 853 AHCVANYQLPPESQQQLLTQLSAECR 878
>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein signature. Length = 167 Score = 40.1 bits (93), Expect = 9e-07 Identities = 37/160 (23%), Positives = 71/160 (44%), Gaps = 14/160 (8%) Query: 24 LSGLSTQATGVPNSAFQVKVNIVSPPCIINNNEDIIVSFGEMMATRVDGNHYRVPVNYTL 83 +S L T + + ++ N+ PPC INN ++I+V FG + VD + V N ++ Sbjct: 8 ISLLLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRGEVTKNISI 67 Query: 84 DCKNTSSRAMKLQMQGSSTSF-DGTLLGTDNPALGIKILND---ATPLSVNTWMNFTYPD 139 C S ++ +++ G++ +L T+ GI + +TPL++ Y Sbjct: 68 SCPYKSG-SLWIKVTGNTMGVGQNNVLATNITHFGIALYQGKGMSTPLTLGNGSGNGYRV 126 Query: 140 KPEL---------WAVPVKHSGVTLSTGEFFAVATLKIDY 170 L +VP ++ L+ G+F A++ + Y Sbjct: 127 TAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMIY 166
>TETREPRESSOR#Tetracycline repressor protein signature. Length = 218 Score = 27.6 bits (61), Expect = 0.004 Identities = 9/26 (34%), Positives = 17/26 (65%) Query: 25 IKGTSVKNIAKRLGLQIKTVYAHRSN 50 I G + + +A++LG++ T+Y H N Sbjct: 22 IDGLTTRKLAQKLGIEQPTLYWHVKN 47
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 37.9 bits (88), Expect = 4e-05 Identities = 31/140 (22%), Positives = 53/140 (37%), Gaps = 20/140 (14%) Query: 119 DTLRALLDNSI---------VPVINENDAVATAEIKVGDNDNLSALAAILAGADKLLLLT 169 +T++ L++ + VPVI E+ + E V D D A AD ++LT Sbjct: 177 ETIKKLVERGVIVIASGGGGVPVILEDGEIKGVE-AVIDKDLAGEKLAEEVNADIFMILT 235 Query: 170 DQPGLFTADPRSNPQAELIKDVYGIDDALRAIAGDSVSGLGTGGMGTKLQAA-DVACRAG 228 D G + + +++V +++ + G MG K+ AA G Sbjct: 236 DVNGAALY--YGTEKEQWLREV-KVEELRKYYEEG---HFKAGSMGPKVLAAIRFIEWGG 289 Query: 229 IDTIIAAGNRPDVIGHAMAG 248 IIA + A+ G Sbjct: 290 ERAIIAH---LEKAVEALEG 306 Score = 29.0 bits (65), Expect = 0.032 Identities = 16/76 (21%), Positives = 33/76 (43%), Gaps = 13/76 (17%) Query: 4 SQTLVVKLGTSVLTGGSRRLNRAHIVELVRQCAQ----LHAMGHRIVIVTSG-------- 51 + +V+ LG + L ++ + +++ VR+ A+ + A G+ +VI Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61 Query: 52 -AIAAGREHLGYPELP 66 + AG+ G P P Sbjct: 62 LHMDAGQATYGIPAQP 77
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 536 bits (1381), Expect = 0.0 Identities = 225/384 (58%), Positives = 263/384 (68%), Gaps = 35/384 (9%) Query: 1 MKKSTLALMMMGFVASTATQAAEVYNKNANKLDVYGKIKAMHYFSDYDSKDGDQTYVRFG 60 MK+ LAL++ +A+ A AAE+YNK+ NKLD+YGK+ +HYFSD SKDGDQTY+R G Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60 Query: 61 IKGETQINDDLTGYGRWESEFSGNKTESDSSQ-KTRLAFAGVKLKNYGSFDYGRNLGALY 119 KGETQIND LTGYG+WE N TE + + TRLAFAG+K +YGSFDYGRN G LY Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120 Query: 120 DVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFGLVDGLDLTLQYQGKNE--- 176 DVE WTDM PEFGGDS DN+MT RA+G+ATYRNTDFFGLVDGL+ LQYQGKNE Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180 Query: 177 -------------GREAKKQNGDGVGTSLSYDFGGSDFAVSAAYTSSDRTNDQNLLAR-- 221 G + + NGDG G S +YD G F+ AAYT+SDRTN+Q Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDI-GMGFSAGAAYTTSDRTNEQVNAGGTI 239 Query: 222 GQGSKAEAWATGLKYDANNIYLATMYSETRKMTP-------ISGGFANKAQNFEAVAQYQ 274 G KA+AW GLKYDANNIYLATMYSETR MTP GG ANK QNFE AQYQ Sbjct: 240 AGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQ 299 Query: 275 FDFGLRPSLGYVLSKGKDIE----GVGSEDLVNYIDVGLTYYFNKNMNAFVDYKINQLKS 330 FDFGLRP++ +++SKGKD+ +DLV Y DVG TYYFNKN + +VDYKIN L Sbjct: 300 FDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLLDD 359 Query: 331 DNKL----GINDDDIVALGMTYQF 350 D+ GI+ DDIVALGM YQF Sbjct: 360 DDPFYKDAGISTDDIVALGMVYQF 383
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 66.4 bits (162), Expect = 3e-13 Identities = 33/247 (13%), Positives = 73/247 (29%), Gaps = 23/247 (9%) Query: 487 TLNLNALWSKLGTFSVSYNDDRRYNSHYYTADYYQTVYSGAFGSLGLRAGIQRYNNGDSS 546 L + + T +S + Y + +Q + AF + N Sbjct: 530 QLTVTQQLGRTSTLYLSG-SHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQK 588 Query: 547 ANTGKYIALDLSLPLGNWFSAGMTHQNGYTMANLSARKQFDEGT------------IRTI 594 + +AL++++P +W + Q + A+ S + + Sbjct: 589 -GRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNL 647 Query: 595 GANLSRAISGDTGDDKTLSGGAYAQFDARYASGTLNVNSAADGYVNTNLTASGSVGWQGK 654 ++ +G + +G A + Y + + S +D SG V Sbjct: 648 SYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGY-SHSDDIKQLYYGVSGGVLAHAN 706 Query: 655 NIAASGRTDGNAGVIFNTGLED---DGQISARVNGRIFPLSGKRNYLPLSPYGRYEVELQ 711 + + ++ G +D + Q R + R G + Y V L Sbjct: 707 GVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWR-----GYAVLPYATEYRENRVALD 761 Query: 712 NSKNSLD 718 + + + Sbjct: 762 TNTLADN 768
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 29.1 bits (65), Expect = 0.018 Identities = 20/57 (35%), Positives = 30/57 (52%), Gaps = 2/57 (3%) Query: 159 LWLLYRTRY--GMAIRAVAFDVNTVRLMGIDANRIISLVFALGSSLAALGGVFYSIS 213 L L+ R R MA+ + + +NT ID NRI++L + L ALG V Y ++ Sbjct: 4 LPLISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVA 60
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 69.9 bits (171), Expect = 5e-15 Identities = 37/183 (20%), Positives = 88/183 (48%), Gaps = 5/183 (2%) Query: 433 IQIVEERTIGPTLGMQNIKQGLEACLAGLVVSILFMIL-FYKKFGLIATSALIANLILIV 491 ++I ++GP + + + + + LA VV + ++ + F +F L A AL+ +++L V Sbjct: 135 LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTV 194 Query: 492 GIMSLIPGATLTMPGIAGIVLTLAVAVDANVLINERIKEEL--SNGRTVQQAIDEGYRGA 549 G+ +++ + +A ++ +++ V++ +R++E L ++ ++ Sbjct: 195 GLFAVL-QLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNET 253 Query: 550 FSSIFDANVTTLIKVIILYAVGTGAIKGFAITTGIGIATSMFTAIVGTRAIVNLLYGGKR 609 S +TTL+ ++ + G I+GF G+ T ++++ + IV L G R Sbjct: 254 LSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIV-LFIGLDR 312 Query: 610 VKK 612 K+ Sbjct: 313 NKE 315
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 83.3 bits (206), Expect = 1e-19 Identities = 57/231 (24%), Positives = 96/231 (41%), Gaps = 13/231 (5%) Query: 16 GLGTVFSLRMLGMFMVLPVLTTY--GMALQGASEALIGLAIGIYGLAQAVFQIPFGLLSD 73 L TV L +G+ +++PVL + A G+ + +Y L Q G LSD Sbjct: 10 ILSTVA-LDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSD 68 Query: 74 RIGRKPLIVGGLLIFVLGSVIAALTDSIWGIILGRALQG-SGAIAAAVMALLSDLTREQN 132 R GR+P+++ L + I A +W + +GR + G +GA A A ++D+T Sbjct: 69 RFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDE 128 Query: 133 RTKAMAFIGVSFGVTFAIAMVLGPIVTHQLGLHALFWMIAILATVGILLTLWVVPNSHNH 192 R + F+ FG VLG ++ HA F+ A L + L +++P SH Sbjct: 129 RARHFGFMSACFGFGMVAGPVLGGLMG-GFSPHAPFFAAAALNGLNFLTGCFLLPESHKG 187 Query: 193 VLNRESGMVKGCFSKVLAEPRLLKLNFGIMCLHIMLMSTFVA-LPGQLEAA 242 + + G+ + ++ F+ L GQ+ AA Sbjct: 188 ERRPLRREALNPLAS-------FRWARGMTVVAALMAVFFIMQLVGQVPAA 231
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 50.6 bits (121), Expect = 7e-09 Identities = 35/181 (19%), Positives = 74/181 (40%), Gaps = 3/181 (1%) Query: 26 IFLGFCVIALDG-FDIAIMGFIAPTLKLEWGVSNHQLGLVISAALIGLALGAIFSGPLAD 84 I + C+++ + ++ P + ++ V +A ++ ++G G L+D Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74 Query: 85 WLGRKKIIINSVFFFGFWTIATAFSHN-VEQMMFFRFMTGLGLGAAMPNIGTLVSEYAPE 143 LG K++++ + F ++ H+ ++ RF+ G G A + +V+ Y P+ Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134 Query: 144 RQRSFIITVIFCGFTFGAAAGGFSASWLIPQFGWHSLMALGGILPLLFAPLLIWLLPESV 203 R +I G G + W L+ + I ++ P L+ LL + V Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMIT-IITVPFLMKLLKKEV 193 Query: 204 R 204 R Sbjct: 194 R 194
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 38.3 bits (89), Expect = 5e-05 Identities = 42/196 (21%), Positives = 75/196 (38%), Gaps = 15/196 (7%) Query: 221 RNNAWLI-LLLIVLYKLGDAFAMSLTTTFLIRGVGFDAGEVGMVNKTLGLFATILGALYG 279 R+N LI L ++ + + + ++++ + VN L +I A+YG Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 280 GVLMQRLTLFRALLIFGLLQGVSNAGYWLLSITDKHLYSMATAVFFENLCGGMGTAAFVA 339 L +L + R LL ++ S+ +S + + G G AAF A Sbjct: 71 K-LSDQLGIKRLLLFGIIINC-------FGSVIGFVGHSFFSLLIMARFIQGAGAAAFPA 122 Query: 340 LLM----TLCNKSFSATQFALLSALSAVGRVYVGP-IAGWFVEAHGWSTFYLFSVVAAVP 394 L+M K F L+ ++ A+G VGP I G WS L ++ + Sbjct: 123 LVMVVVARYIPKENRGKAFGLIGSIVAMG-EGVGPAIGGMIAHYIHWSYLLLIPMITIIT 181 Query: 395 GIALLLLCRQTLEHTQ 410 L+ L ++ + Sbjct: 182 VPFLMKLLKKEVRIKG 197
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 34.3 bits (78), Expect = 0.002 Identities = 34/133 (25%), Positives = 68/133 (51%), Gaps = 15/133 (11%) Query: 191 ERLEYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKELGEMDDAPD- 249 LE A +E + +L R +++ ++ S+ +Q++A ++L E + + Sbjct: 291 AALEAEKADLEHQSQVLNAN---RQSLRRDLDASREAK---KQLEAEHQKLEEQNKISEA 344 Query: 250 ENEALKRKIDAAKMPKEAKEKTEAELQKLKMMSPMS-AEATVVRGYIDWMVQVPWNARSK 308 ++L+R +DA++ EAK++ EAE QKL+ + +S A +R +D + A+ + Sbjct: 345 SRQSLRRDLDASR---EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASRE----AKKQ 397 Query: 309 VKKDLRQAQEILD 321 V+K L +A L Sbjct: 398 VEKALEEANSKLA 410
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 116 bits (293), Expect = 4e-38 Identities = 49/88 (55%), Positives = 65/88 (73%) Query: 2 NKSQLIDKIAAGADISKAAAGRALDALIASVTESLQAGDDVALVGFGTFAVKERAARTGR 61 NK LI K+A +++K + A+DA+ ++V+ L G+ V L+GFG F V+ERAAR GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 62 NPQTGKEITIAAAKVPGFRAGKALKDAV 89 NPQTG+EI I A+KVP F+AGKALKDAV Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 28.6 bits (64), Expect = 0.024 Identities = 11/64 (17%), Positives = 23/64 (35%), Gaps = 10/64 (15%) Query: 193 LAVLSQHLGFTLQECMAFGDAMNDREMLGSVGRGFIMGN----------AMPQLKAELPH 242 VL+Q L + +A + + ++ + +P++K P Sbjct: 16 RTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPD 75 Query: 243 LPVI 246 LPV+ Sbjct: 76 LPVL 79
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1365 bits (3535), Expect = 0.0 Identities = 805/1032 (78%), Positives = 911/1032 (88%) Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLSILKLPVAQYPTIAPPAISITAMYPGADAETVQNT 60 M NFFI RPIFAWV+AII+M+AG L+IL+LPVAQYPTIAPPA+S++A YPGADA+TVQ+T Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGIDHLMYMSSNGDSTGTATITLTFESGTDPDIAQVQVQNKLALATPLLPQ 120 VTQVIEQNMNGID+LMYMSS DS G+ TITLTF+SGTDPDIAQVQVQNKL LATPLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGISVEKASSSFLMVVGVINTNGTMNQDDISDYVAANMKDPISRTSGVGDVQLFGS 180 EVQQQGISVEK+SSS+LMV G ++ N QDDISDYVA+N+KD +SR +GVGDVQLFG+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWMDPNKLNNFQLTPVDVISALKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240 QYAMRIW+D + LN ++LTPVDVI+ LK QN Q+AAGQLGGTP + GQQLNASIIAQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 TNTEEFGNILLKVNQDGSQVRLRDVAKIELGGESYDVVAKFNGQPASGLGIKLATGANAL 300 N EEFG + L+VN DGS VRL+DVA++ELGGE+Y+V+A+ NG+PA+GLGIKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTANAIRAELAKMEPFFPSGMKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ 360 DTA AI+A+LA+++PFFP GMK++YPYDTTPFV++SIHEVVKTL EAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 AEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 E+ LPPKEAT KSM QIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATMLKPIQKGSHGATTGFFGWFNRMFDKSTHHYTDSVGNILRSTGRY 540 SVLVALILTPALCAT+LKP+ H GFFGWFN FD S +HYT+SVG IL STGRY Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 541 LVLYLIIVVGMAWLFVRLPSSFLPDEDQGVFLSMAQLPAGATQERTQKVLDEMTNYYLTK 600 L++Y +IV GM LF+RLPSSFLP+EDQGVFL+M QLPAGATQERTQKVLD++T+YYL Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 601 EKDNVESVFAVNGFGFAGRGQNTGIAFVSLKDWSQRPGEENKVEAITARAMGYFSQIKDA 660 EK NVESVF VNGF F+G+ QN G+AFVSLK W +R G+EN EA+ RA +I+D Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 661 MVFAFNLPAIVELGTATGFDFELIDQGGLGHEKLTQARNQLFGMVAQHPDVLTGVRPNGL 720 V FN+PAIVELGTATGFDFELIDQ GLGH+ LTQARNQL GM AQHP L VRPNGL Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 721 EDTPQFKIDIDQEKAQALGVSISDINTTLGAAWGGSYVNDFIDRGRVKKVYIMSEAKYRM 780 EDT QFK+++DQEKAQALGVS+SDIN T+ A GG+YVNDFIDRGRVKK+Y+ ++AK+RM Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 781 LPEDIGKWYVRGSDGQMVPFSAFSTSRWEYGSPRLERYNGLPSLEILGQAAPGKSTGEAM 840 LPED+ K YVR ++G+MVPFSAF+TS W YGSPRLERYNGLPS+EI G+AAPG S+G+AM Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840 Query: 841 ALMEELAGKLPSGIGYDWTGMSYQERLSGNQAPALYAISLIVVFLCLAALYESWSIPFSV 900 ALME LA KLP+GIGYDWTGMSYQERLSGNQAPAL AIS +VVFLCLAALYESWSIP SV Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 Query: 901 MLVVPLGVVGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGLI 960 MLVVPLG+VG LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLMEKEGKG++ Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 Query: 961 EATLEAVRMRLRPILMTSLAFILGVMPLVISSGAGSGAQNAVGTGVMGGMVTATILAIFF 1020 EATL AVRMRLRPILMTSLAFILGV+PL IS+GAGSGAQNAVG GVMGGMV+AT+LAIFF Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 Query: 1021 VPVFFVVVRRRF 1032 VPVFFVV+RR F Sbjct: 1021 VPVFFVVIRRCF 1032
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 42.5 bits (100), Expect = 2e-06 Identities = 30/210 (14%), Positives = 75/210 (35%), Gaps = 19/210 (9%) Query: 100 TYQASYDSAKGDLAKAQAAANMDQLTVKRYQKLLGTKYISQQDYDTAVATA-QQSNAAVV 158 + Y A +L ++ + + ++ Q + + +Q+ + Sbjct: 256 EQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLV---TQLFKNEILDKLRQTTDNIG 312 Query: 159 AAKAAVETARINLAYTKVTSPISGRIGKSAV-TEGALVQNGQTTALATVQQLDPIYVDVT 217 + + + +P+S ++ + V TEG +V +T + V + D + V Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVTAL 371 Query: 218 QSSNDFLRLKQEL-ADGRLKQENGK------AKVELVTNDGLKYPQSGTLEFSDVTVDQT 270 + D + A +++ KV+ + D ++ + G + +++++ Sbjct: 372 VQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEEN 431 Query: 271 TGSITLRAIFPNPDHTLLPGMFVRARLEEG 300 S + I L GM V A ++ G Sbjct: 432 CLSTGNKNIP------LSSGMAVTAEIKTG 455 Score = 29.0 bits (65), Expect = 0.040 Identities = 17/78 (21%), Positives = 33/78 (42%), Gaps = 3/78 (3%) Query: 48 APLQITTELPGR-TSAYRIAEVRPQVSGIILKRNFV-EGSDIQAGVSLYQIDPATYQASY 105 ++I G+ T + R E++P + I+ K V EG ++ G L ++ +A Sbjct: 78 GQVEIVATANGKLTHSGRSKEIKPIENSIV-KEIIVKEGESVRKGDVLLKLTALGAEADT 136 Query: 106 DSAKGDLAKAQAAANMDQ 123 + L +A+ Q Sbjct: 137 LKTQSSLLQARLEQTRYQ 154
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 185 bits (470), Expect = 2e-61 Identities = 170/213 (79%), Positives = 194/213 (91%) Query: 1 MARKTKQQARETRQLILDVALRLFSQQGVSSTSLATIAKAAGVTRGAIYWHFKNKSDLFN 60 MARKTKQ+A+ETRQ ILDVALRLFSQQGVSSTSL IAKAAGVTRGAIYWHFK+KSDLF+ Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EIWELSDASISDLEIEYRAKFPNDPLSVIREILVYVLEATVTEERRRLMMEIIYHKCEFV 120 EIWELS+++I +LE+EY+AKFP DPLSV+REIL++VLE+TVTEERRRL+MEII+HKCEFV Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 Query: 121 GEMTVVQQAQRQLSLASYERIEQTLKECIAAKLLPANLLTRRAAVLMRSYLSGLMENWLF 180 GEM VVQQAQR L L SY+RIEQTLK CI AK+LPA+L+TRRAA++MR Y+SGLMENWLF Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180 Query: 181 APDSFDLHAEARDYVAILLEMYQFCPTLRGPES 213 AP SFDL EARDYVAILLEMY CPTLR P + Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPAT 213
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 47.0 bits (111), Expect = 4e-07 Identities = 42/282 (14%), Positives = 95/282 (33%), Gaps = 4/282 (1%) Query: 31 RAADLPDRAEVQSQLNTLNKQKELTPQDKLVQQDLTQTLETLDKIERIKSETAQLRQQVE 90 + +DL + N +EL+ + ++++ E KI+ +++ A L + +E Sbjct: 72 KNSDLSFNNKALKDHND-ELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALE 130 Query: 91 QAPAKLRQAVESLNNLSDVPNDDATRKTLSTLSLRQLESRVTQTLDDLQNAQNDLATYNS 150 A + L A RK +L + T ++ + + A + Sbjct: 131 GAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 190 Query: 151 QLVSLQTQPERVQNAMFNASQQLQQIRNRLNGTSVGD---ETLRPTQQVLLQAQQALLNA 207 + L+ E N S +++ + + E A A + Sbjct: 191 RQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 250 Query: 208 QIEQQRKSLEGNTILQDTLQKQRDYVTAWSNRLEHQLQLLQEAVNSKRLTLTEKTAQEAV 267 ++ L+ L+ ++ TA S +++ K + A Sbjct: 251 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNAN 310 Query: 268 TPDETARIQANPLVKQELDINHQLSEKLIQATENGNQLVQRN 309 + A+ K++L+ HQ E+ + +E Q ++R+ Sbjct: 311 RQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRD 352
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 29.9 bits (67), Expect = 0.008 Identities = 15/61 (24%), Positives = 23/61 (37%), Gaps = 6/61 (9%) Query: 74 RYIQIGTVMTEPDHRNKGLAGQLIHHILQDWQQEADAFFLFANPTTVD-----FYPKFGF 128 Y I + D+R KG+ L+ H +W +E L ++ FY K F Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALL-HKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146 Query: 129 T 129 Sbjct: 147 I 147
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 29.4 bits (66), Expect = 0.030 Identities = 41/208 (19%), Positives = 76/208 (36%), Gaps = 14/208 (6%) Query: 44 SHAISLFSAYA-SLVYVTPILGGWLADRLLGNRTAVIAGALLMTLGHVVLGVESTSAWSL 102 +H L + YA P+LG +DR G R ++ + + ++ + W L Sbjct: 43 AHYGILLALYALMQFACAPVLGAL-SDRF-GRRPVLLVSLAGAAVDYAIMAT-APFLWVL 99 Query: 103 YVALAIIICGY-GLFKSNISCLLGELYAHDDPRRDGGFSLLYAAGNVGSIAAPIACGLAA 161 Y + I+ G G + + ++ D+ R F + A G +A P+ GL Sbjct: 100 Y--IGRIVAGITGATGAVAGAYIADITDGDE--RARHFGFMSACFGFGMVAGPVLGGLMG 155 Query: 162 QWYGWHIGFALAGIGMFIGLMIFLSGSRHFRHT-RGVDKPALRAVKFVLPTWGWLLVMLC 220 + H F A + + FL+G + +G +P R L ++ W M Sbjct: 156 G-FSPHAPFFAAA---ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTV 211 Query: 221 LAPVFFTLLLQNNWSGYLLAIVCLFAAQ 248 +A + + A+ +F Sbjct: 212 VAALMAVFFIMQLVGQVPAALWVIFGED 239
>PF08280#M protein trans-acting positive regulator Length = 530 Score = 30.6 bits (69), Expect = 0.008 Identities = 23/130 (17%), Positives = 45/130 (34%), Gaps = 21/130 (16%) Query: 114 GETPLDEPISLSPPLSRVSLAAYCHKLNTFADLLLR------------DYDLQLAYHHHL 161 P+ E ++ L+ + L YC +LN F L + + Y + L Sbjct: 57 SSLPITE-VAEKTGLTFLQLNHYCEELNAFFPDSLSMTIQKRMISCQFTHPSKETYLYQL 115 Query: 162 ----MMLVEHDDELERFLSHTHDNVGLAFDTGHAFVAGVEIPRVLHKYGHRIRHLHLKDV 217 +L L + + + L F++ R+ +R+ LK Sbjct: 116 YASSNVL----QLLAFLIKNGSHSRPLTDFARSHFLSNSSAYRMREALIPLLRNFELKLS 171 Query: 218 RPQVLGRLYR 227 + +++G YR Sbjct: 172 KNKIVGEEYR 181
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 89.7 bits (222), Expect = 1e-23 Identities = 62/253 (24%), Positives = 103/253 (40%), Gaps = 8/253 (3%) Query: 3 RVVVITGGGTGIGAACARLMRAAGDRVFITGRREAPLQAVANETGATA-----LVGDAAD 57 ++ ITG GIG A AR + + G + L+ V + A A D D Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 58 GEVWRQRLLPAILDQTGGIDVLICSAGGMGNSPAAETSDRQWREALDGNLTSAFASVRAC 117 + + I + G ID+L+ AG + SD +W N T F + R+ Sbjct: 69 SAAIDE-ITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 118 LPSLIARR-GNVLFVASIASLAAGPQACGYVTAKHALIGLMRSVARDYGPQGVRANAICP 176 ++ RR G+++ V S + Y ++K A + + + + +R N + P Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 177 GWVTTPMADEEMHPLMQAEGLSLTEAYQRVCRDVPLRRPASPEEIAQACQFLCSPQAAII 236 G T M + + + + +PL++ A P +IA A FL S QA I Sbjct: 188 GSTETDM-QWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246 Query: 237 SGATLVADGGASI 249 + L DGGA++ Sbjct: 247 TMHNLCVDGGATL 259
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 103 bits (257), Expect = 7e-26 Identities = 76/409 (18%), Positives = 160/409 (39%), Gaps = 29/409 (7%) Query: 21 MLPLIDTSITNVALDAITHTLAASATQLELIVALYGVAFAVCLAMGSKLGDNYGRRRLFM 80 +++ + NV+L I + + + + F++ A+ KL D G +RL + Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 81 WGVALFGIASLLCGMANSIGALL-AARTLQGAGAALIVPQILATLHVTLKGPAH-ARAIS 138 +G+ + S++ + +S +LL AR +QGAGAA P ++ + + +A Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAF-PALVMVVVARYIPKENRGKAFG 142 Query: 139 LYGGIGGIAFIVGQMGGGWLVSADIAGLGWRNAFFINVPICLLVLALSRRYVPETRRETP 198 L G I + VG GG + + W ++ + +P+ ++ + + Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHY----IHW--SYLLLIPMITIITVPFLMKLLKKEVRIK 196 Query: 199 SRIDWQGTLYL-ALILCCLLFPMALGPELHWPLWLQLMLVAVLPLLFAMRQSALRQQQRG 257 D +G + + I+ +LF + L++ + L+F ++ ++ Sbjct: 197 GHFDIKGIILMSVGIVFFMLFTTSY-------SISFLIVSVLSFLIF------VKHIRKV 243 Query: 258 DHPLLPPRLLQLTSIRFGMAIALLFFSAWSGFMFCMALTMQEGLGMAPWQSGNSFIALG- 316 P + P L + G+ + F +GF+ + M++ ++ + G+ I G Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303 Query: 317 VAYFISALYAPRLIARYSMGRILLTGLAVQIAGLLLLCATFSRFGVATNALTLVPATALI 376 ++ I L+ R +L G+ L F + T + + + Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTAS-----FLLETTSWFMTIIIVFV 358 Query: 377 GYGQALIVNSFYRIGMRDISASDAGAGSAILSTLQQATLGLGPAILGSL 425 G + I + +AGAG ++L+ + G G AI+G L Sbjct: 359 LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 29.8 bits (67), Expect = 0.012 Identities = 18/87 (20%), Positives = 32/87 (36%), Gaps = 10/87 (11%) Query: 199 AMAEHRGDPAWENKLARFFAASSEFEALWHQRYEVRGVENQIKHFNHPQLGRFSLQQMYW 258 A+ + ++E + + + + + YEV I H N PQ+G L Sbjct: 13 ALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEV-----VITHGNGPQVGSLLLHMDAG 67 Query: 259 YSAPRNGSRLLVYLPMDEAGEQALAWL 285 + + PMD AG + W+ Sbjct: 68 QATYGIPA-----QPMDVAGAMSQGWI 89
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 149 bits (379), Expect = 1e-40 Identities = 75/255 (29%), Positives = 122/255 (47%), Gaps = 21/255 (8%) Query: 89 QALCADRQDSLAQLIGAQGSLQEALRQCKAAISYPGAGLPLLLRGPTGTGKSFLARQLWH 148 L D QD L+G ++QE R + L L++ G +GTGK +AR Sbjct: 127 SKLEDDSQDG-MPLVGRSAAMQEIYRVLARLMQTD---LTLMITGESGTGKELVAR---- 178 Query: 149 YAIDEGILPADAPFTVFNCAEYANNPELLTSKLFGHAKGAFTGADKAVPGLIETSNGGVL 208 A+ + + PF N A A +L+ S+LFGH KGAFTGA G E + GG L Sbjct: 179 -ALHDYGKRRNGPFVAINMA--AIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTL 235 Query: 209 FIDEVHRLPPEGQEKLFHFMDNGSWRRLGESADERSATVRLIFASTEDLEK-----HFLA 263 F+DE+ +P + Q +L + G + +G + VR++ A+ +DL++ F Sbjct: 236 FLDEIGDMPMDAQTRLLRVLQQGEYTTVG-GRTPIRSDVRIVAATNKDLKQSINQGLFRE 294 Query: 264 TFIRRIPVI-VKILPIAERGQFERLAFIHHFFRREAQRLNHD-LELDGEIVSQLMRETLE 321 R+ V+ +++ P+ +R E + + F ++A++ D D E + + Sbjct: 295 DLYYRLNVVPLRLPPLRDRA--EDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWP 352 Query: 322 GNVGGLENLIRNICA 336 GNV LENL+R + A Sbjct: 353 GNVRELENLVRRLTA 367
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 56.6 bits (136), Expect = 3e-12 Identities = 21/106 (19%), Positives = 47/106 (44%), Gaps = 1/106 (0%) Query: 3 RPKSEDKKQALLEAATAAFAQSGI-AASTSAIARSAGVAEGTLFRYFATKDELLNELYLA 61 + ++++ +Q +L+ A F+Q G+ + S IA++AGV G ++ +F K +L +E++ Sbjct: 6 KQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL 65 Query: 62 IKLRLVRTMIAGLDPDEKRPKENARNIWNSYIDWGVRNPMEHKAIR 107 + + + P R I ++ V + Sbjct: 66 SESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLME 111
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 42.5 bits (100), Expect = 2e-06 Identities = 17/105 (16%), Positives = 35/105 (33%) Query: 53 RAVDIRARTEGVIVQRHFQDGQYVTEGDLLFTLDDAQPRAALALAQAELKSAEASLRQSQ 112 R+ +I+ ++ + ++G+ V +GD+L L A Q+ L A + Q Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154 Query: 113 QLLTRYERLINNHSISRNDVDTARMQRDVAAAAVQQAKARVEAQQ 157 L E ++ + + K + Q Sbjct: 155 ILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQ 199 Score = 34.8 bits (80), Expect = 5e-04 Identities = 15/90 (16%), Positives = 31/90 (34%), Gaps = 4/90 (4%) Query: 91 RAALALAQAELKSAEASLRQSQQLLTRYE-RLINNHSISRNDVDTARMQRDVAAAAVQQA 149 A EL+ ++ L Q + + + + +N++ Q + Sbjct: 258 ENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLE 317 Query: 150 KARVEAQQIVLSYTRITAPVTGRVGHSAFH 179 A+ E + + I APV+ +V H Sbjct: 318 LAKNEER---QQASVIRAPVSVKVQQLKVH 344
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 915 bits (2366), Expect = 0.0 Identities = 417/1034 (40%), Positives = 606/1034 (58%), Gaps = 17/1034 (1%) Query: 1 MLTFFIRRPRFAMVIALLLTFVGAVSLKLIPVEQYPAITPPVVNVSASWPGASASDVAEA 60 M FFIRRP FA V+A++L GA+++ +PV QYP I PP V+VSA++PGA A V + Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 IAAPLETQLNGVDHMLYMESTSSDEGTYRLSITFAAGTDADLAAIDVQNRVAQALAQLPA 120 + +E +NG+D+++YM STS G+ +++TF +GTD D+A + VQN++ A LP Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQNGVQVRKRASNLLMGVSLYSPLGTLSPLFVSNYASTQVREALARLPGVGEVQMFGA 180 EVQQ G+ V K +S+ LM S + +S+Y ++ V++ L+RL GVG+VQ+FGA Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 RDYSMRIWLRPDRMNALNITTDDVAQALREQNVQGAAGQVGTPPVFNGQQQTLTINGLGR 240 + Y+MRIWL D +N +T DV L+ QN Q AAGQ+G P GQQ +I R Sbjct: 181 Q-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239 Query: 241 LNEAASFGEIILRRGAQGQLVRLADVATIELGARSYSSGAQLNGKASAYLGIYPTPTANA 300 FG++ LR + G +VRL DVA +ELG +Y+ A++NGK +A LGI ANA Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299 Query: 301 LQVASAVRAELNRLHTRFPADLTWEVKFDTTRFVAATIKEIGVSLALTLLAVVVVVSLFL 360 L A A++A+L L FP + +DTT FV +I E+ +L ++ V +V+ LFL Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359 Query: 361 QSWRATLIVVLAIPVSLIGTFAVLYLLGYSANTLSLFAIILALTMVVDDAIVVVENVETK 420 Q+ RATLI +A+PV L+GTFA+L GYS NTL++F ++LA+ ++VDDAIVVVENVE Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 421 MAE-GLDRLQATAQALRQIAGPVIATTLVLLAVFVPVALLPGIVGELYRQFAVTLSTAVA 479 M E L +AT +++ QI G ++ +VL AVF+P+A G G +YRQF++T+ +A+A Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 480 LSSLVALTLTPALCALLLRPRPARP----AAVWRAFNRLLDGTRDGYGRLVGWMNRRPLL 535 LS LVAL LTPALCA LL+P A + FN D + + Y VG + Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 536 ALAATVAAGALVAFSFTSMPKGFLPQEDQGYLFASVQLPEAASLERTEAVMAQARKLLMA 595 L A + F +P FLP+EDQG +QLP A+ ERT+ V+ Q + Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 596 NPA--VEDVIQVSGFNILNGTSASNGGFISVMLKDWHQRPP----LDAVMADIQRQLLSL 649 N VE V V+GF+ A N G V LK W +R +AV+ + +L + Sbjct: 600 NEKANVESVFTVNGFSF--SGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKI 657 Query: 650 PEATIMTFAPPTLPGLGNASGFDLRIMAQAGQSSAELEQVTREILQLANQHP-QLSRVFT 708 + ++ F P + LG A+GFD ++ QAG L Q ++L +A QHP L V Sbjct: 658 RDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717 Query: 709 TWSSNVPQLTLTVDRDRAALLDVPVAQIFSSLQTAFGGTRAGDFSRNNRVYHVVMQNEMQ 768 + Q L VD+++A L V ++ I ++ TA GGT DF RV + +Q + + Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777 Query: 769 WRERAEQISELYVRSRDGERVRLSNLVTITPTVGAPFIQQYNQFPSVSVSGSAAEGVSSR 828 +R E + +LYVRS +GE V S T G+P +++YN PS+ + G AA G SS Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837 Query: 829 TAMAAMEQILQAHLPPGYDYAWSGISWQEQQTGNQAVWIVLAAVAMAWLFLVAQYESWTL 888 AMA ME + LP G Y W+G+S+QE+ +GNQA +V + + +L L A YESW++ Sbjct: 838 DAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896 Query: 889 PASVMLSVLFAIGGALLWLWTAGYANDVYVQIGLVLLIALAAKNAILIVEFARSRRE-EG 947 P SVML V I G LL NDVY +GL+ I L+AKNAILIVEFA+ E EG Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956 Query: 948 LSIVDAAREGATRRFRAVMMTAVSFIIGIMPMMLATGAGAQSRRIIGTTVFSGMLVATMV 1007 +V+A R R ++MT+++FI+G++P+ ++ GAG+ ++ +G V GM+ AT++ Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016 Query: 1008 GILFIPSLYVLFQR 1021 I F+P +V+ +R Sbjct: 1017 AIFFVPVFFVVIRR 1030 Score = 76.0 bits (187), Expect = 4e-16 Identities = 90/522 (17%), Positives = 182/522 (34%), Gaps = 45/522 (8%) Query: 531 RRPLLALAATVAAGALVAFSFTSMPKGFLPQEDQGYLFASVQLPEAASLERTEAVMAQAR 590 RRP+ A + A + +P P + S P A + + V Sbjct: 7 RRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV----- 61 Query: 591 KLLMANPAVEDVIQVSGFNILNGTSASNGGFISVMLKDWHQRPPLDA---VMADIQRQLL 647 +++ + ++ TS S G +++ L P A V +Q Sbjct: 62 ----TQVIEQNMNGIDNLMYMSSTSDSAGS-VTITLTFQSGTDPDIAQVQVQNKLQLATP 116 Query: 648 SLPEATIMTFAPPTLPGLGNASGFDLRIMAQAGQS-SAELEQVTREILQLANQHPQLSRV 706 LP+ + ++S + +M S + Q +N LSR+ Sbjct: 117 LLPQE----VQQQGISVEKSSSSY---LMVAGFVSDNPGTTQDDISDYVASNVKDTLSRL 169 Query: 707 -----FTTWSSNVPQLTLTVDRDRAALLDVPVAQIFSSLQTAFGGTRAGDF------SRN 755 + + + + +D D + + + L+ AG Sbjct: 170 NGVGDVQLFGAQY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQ 228 Query: 756 NRVYHVVMQNEMQWRERAEQISELYVR-SRDGERVRLSNLVTITPTVGA-PFIQQYNQFP 813 ++ Q + E+ ++ +R + DG VRL ++ + I + N P Sbjct: 229 QLNASIIAQTRFK---NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKP 285 Query: 814 SVSVSGSAAEGVSSR-TAMAAMEQI--LQAHLPPG--YDYAWSGISWQEQQTGNQAVWIV 868 + + A G ++ TA A ++ LQ P G Y + + + ++ V + Sbjct: 286 AAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSI-HEVVKTL 344 Query: 869 LAAVAMAWLFLVAQYESWTLPASVMLSVLFAIGGALLWLWTAGYANDVYVQIGLVLLIAL 928 A+ + +L + ++ ++V + G L GY+ + G+VL I L Sbjct: 345 FEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGL 404 Query: 929 AAKNAILIVE-FARSRREEGLSIVDAAREGATRRFRAVMMTAVSFIIGIMPMMLATGAGA 987 +AI++VE R E+ L +A + ++ A++ A+ +PM G+ Sbjct: 405 LVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464 Query: 988 QSRRIIGTTVFSGMLVATMVGILFIPSLYVLFQRMREWAHRR 1029 R T+ S M ++ +V ++ P+L + H Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHE 506
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 83.9 bits (207), Expect = 2e-21 Identities = 52/187 (27%), Positives = 84/187 (44%), Gaps = 5/187 (2%) Query: 6 VVFITGATSGFGEAAAQVFADAGWSLVLSGRRYPRLKALQ--DRLAARVPVHIIELDVRD 63 + FITGA G GEA A+ A G + +L+ + + AR DVRD Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA-EAFPADVRD 68 Query: 64 SEAVAAAVASLPADFADITTLINNAGLALSPLPAQEVALEDWKTMIDTNVTGLVTVTHAL 123 S A+ A + + I L+N AG+ L P ++ E+W+ N TG+ + ++ Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 124 LPTLIRHGAGASIINIGSIAGQWPYPGSHVYGASKAFVKQFSYNLRCDLLGTGVRVTDLA 183 ++ +G SI+ +GS P Y +SKA F+ L +L +R ++ Sbjct: 128 SKYMMDRRSG-SIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 184 PGIAETE 190 PG ET+ Sbjct: 187 PGSTETD 193
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 31.5 bits (71), Expect = 0.002 Identities = 17/61 (27%), Positives = 25/61 (40%), Gaps = 1/61 (1%) Query: 107 RKTLIICGYSAEVGVLLTALGGLRQGYNVFIPVDCVGSQSLRTETVVLKQ-AEKAGAVIT 165 R LII G A +G L+TA + F D V SL + L+ A + + Sbjct: 143 RDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCAFTVM 202 Query: 166 S 166 + Sbjct: 203 T 203
>ENTSNTHTASED#Enterobactin synthetase component D signature. Length = 234 Score = 175 bits (446), Expect = 1e-57 Identities = 100/212 (47%), Positives = 129/212 (60%), Gaps = 9/212 (4%) Query: 1 MRHHRTVLPLAGYTIQQIDFDPATFQPEDLFWLPYHASLTGWGRKRQAEHLAGRIAAAYA 60 M LP AG+ + +DFD ++F+ DL WLP+H L GRKR+AEHLAGRIAA +A Sbjct: 1 MLTSHFPLPFAGHRLHIVDFDASSFREHDLLWLPHHDRLRSAGRKRKAEHLAGRIAAVHA 60 Query: 61 LREVGEKRLPAIGDQRQPLWPTPWFGSISHCGQRALAVIADRPVGVDIERRFTPQLAAEL 120 LREVG + +P +GD+RQPLWP FGSISHC ALAVI+ + +G+DIE+ + A EL Sbjct: 61 LREVGVRTVPGMGDKRQPLWPDGLFGSISHCATTALAVISRQRIGIDIEKIMSQHTATEL 120 Query: 121 ESSIISPAEKTALLRSGLPFPLALTLAFSAKESGFKATPAANQRALGFADFQIVEITAST 180 SII E+ L S LPFPLALTLAFSAKES +KA GF ++ +TA+ Sbjct: 121 APSIIDSDERQILQASLLPFPLALTLAFSAKESVYKAFSDR-VTLPGFNSAKVTSLTATH 179 Query: 181 LALMF--------AEQRYLLHWIASEEQVITL 204 ++L AE+ W + VITL Sbjct: 180 ISLHLLPAFAATMAERTVRTEWFQRDNSVITL 211
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 35.2 bits (81), Expect = 4e-04 Identities = 39/187 (20%), Positives = 71/187 (37%), Gaps = 8/187 (4%) Query: 24 IARFISILSLGLLGVAIPVQIQMMTHSTWQVGLSVTLTGASMFVGLMVGGVLADRYERKR 83 I F S+L+ +L V++P T + +G V G L+D+ KR Sbjct: 21 ILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR 80 Query: 84 LILLARGTCGVGFVGLCLNALLPEPSLAAIYLLGIWDGFFASLGVTALLAATPALVGREN 143 L+L G V + + A ++ G F +L ++ + +EN Sbjct: 81 LLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL----VMVVVARYIPKEN 136 Query: 144 LMQAGAITMLTVRLGSVISPMIGGLLLATGGVAWNFGLAAAGTFITTLTLLRLPQLPPPP 203 +A + V +G + P IGG++ + W++ L IT +T+ L +L Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIP--MITIITVPFLMKLLKKE 192 Query: 204 QPREHPL 210 + Sbjct: 193 VRIKGHF 199
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 52.3 bits (125), Expect = 6e-10 Identities = 59/288 (20%), Positives = 100/288 (34%), Gaps = 31/288 (10%) Query: 40 HTLPSQPLRIVSTSVTLTGSLLAIDAPVVASGATTPNNRVADSQGFLRQWSEVAKARKLA 99 H P RIV+ LLA+ VAD+ + SE + Sbjct: 29 HAAAIDPNRIVALEWLPVELLLALGIVPYG---------VADTINYRLWVSEPPLPDSV- 78 Query: 100 RLYIG---EPSAEAVAAQMPDLILVSATGGDSALPLYDQLKTIAPTLVINYDDKS----- 151 + +G EP+ E + P ++ SA G P + L IAP N+ D Sbjct: 79 -IDVGLRTEPNLELLTEMKPSFMVWSAGYG----PSPEMLARIAPGRGFNFSDGKQPLAM 133 Query: 152 WQTLLTQLGQITGHEQQASARIADFNKQLVSLKEKMKLPPQPVTALVYTAAAHSANIWTP 211 + LT++ + + A +A + + S+K + L ++ P Sbjct: 134 ARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGP 193 Query: 212 ESAQGQMLEQLGFSLATLPGGLPASHSQGKRHDIVQLGGENLAAGLNGQSLFLFAGDQKD 271 S ++L++ G A + + + LAA + L + KD Sbjct: 194 NSLFQEILDEYGIP--------NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKD 245 Query: 272 ADAIYANPLLAHLPAVAGKRVYPLGTETFRLDYYSALLVLQRLSSLFG 319 DA+ A PL +P V R + F SA+ ++ L + G Sbjct: 246 MDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIG 293
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 425 bits (1094), Expect = e-153 Identities = 152/303 (50%), Positives = 201/303 (66%), Gaps = 20/303 (6%) Query: 1 MAIPKLQAYALPEASDIPANKVNWAFEPSRAALLIHDMQEYFLNFWGENSAMMEKVVANI 60 MAIP +Q Y +P ASD+P NKV+W +P+RA LLIHDMQ YF++ + ++ + ++ ANI Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60 Query: 61 AALRDFCKQNGIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQQVIAALAPDEDDTV 120 L++ C Q GIPV YTAQP Q+ +DRALL D WGPGL P ++++I LAP++DD V Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120 Query: 121 LVKWRYSAFHRSPLEEMLKETGRDQLIITGVYAHIGCMTTATDAFMRDIKPFFVADALAD 180 L KWRYSAF R+ L EM+++ GRDQLIITG+YAHIGC+ TA +AFM DIK FFV DA+AD Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180 Query: 181 FSREEHLMALKYVAGRSGRVVMTEELL--------PLPASKA-----------ALRALIL 221 FS E+H MAL+Y AGR VMT+ LL + + A +R I Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240 Query: 222 PLLDESDEPLD-DENLIDYGLDSVRMMALAARWRKVHGDIDFVMLAKNPTIDAWWALLTR 280 LL E+ E + E+L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W LLT Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLTT 300 Query: 281 EVQ 283 Q Sbjct: 301 RSQ 303
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 341 bits (875), Expect = e-121 Identities = 107/258 (41%), Positives = 149/258 (57%), Gaps = 20/258 (7%) Query: 5 GQTVWVTGAGKGIGYATALAFVEAGANVTGFD---------------LAFDGESYPFATE 49 G+ ++TGA +GIG A A GA++ D A E++P Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP---- 63 Query: 50 TLDVADADQVREACSRLLANTERLDVLVNAAGILRMGATDQLSAEDWQQTFAVNVGGAFN 109 DV D+ + E +R+ +D+LVN AG+LR G LS E+W+ TF+VN G FN Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122 Query: 110 LFQQTMAQFRRQRGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALTVGLELAGSGVRC 169 + +R G+IVTV S+ A PR M+AY +SKAA +GLELA +RC Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182 Query: 170 NLVSPGSTDTDMQRTLWVSDDAEQQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASSH 229 N+VSPGST+TDMQ +LW ++ +Q I+G E FK GIPL K+A+P +IA+ +LFL S Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242 Query: 230 ASHITLQDIVVDGGSTLG 247 A HIT+ ++ VDGG+TLG Sbjct: 243 AGHITMHNLCVDGGATLG 260
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 31.3 bits (71), Expect = 0.014 Identities = 9/60 (15%), Positives = 25/60 (41%), Gaps = 1/60 (1%) Query: 172 VIILAVLAMIVVKALTHSPWG-TYTVAFTIPLAIFMGIYIRYLRPGRIGEVSVIGLVMLV 230 ++ ++ + + + A + W +V +PL I + L + ++GL+ + Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTI 934
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 135 bits (340), Expect = 8e-41 Identities = 86/253 (33%), Positives = 130/253 (51%), Gaps = 15/253 (5%) Query: 5 LTGKKALVTGASRGLGRAIALSLARAGAAVVITYEKSVDKAQAVADEIKALGRYGEAVQA 64 + GK A +TGA++G+G A+A +LA GA + + + +K + V +KA R+ EA A Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAEARHAEAFPA 64 Query: 65 DSASAQAIQDAVTHAARSLGGLDILVNNAGIARGGPLESMTLADIDALINVNIRGVVIAT 124 D + AI + R +G +DILVN AG+ R G + S++ + +A +VN GV A+ Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124 Query: 125 QEALVHMAD--GGRIINIGSCLANRVAMPGISVYAMTKSALNALTRGLARDLGPRGITVN 182 + +M D G I+ +GS A ++ YA +K+A T+ L +L I N Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRT-SMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183 Query: 183 LVHPGPTNSDMN-----PEDGEQ------AEAQRQMIAVGHYGQPEDIAAAVTFLASPAA 231 +V PG T +DM E+G + E + I + +P DIA AV FL S A Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243 Query: 232 GQISGTGLDVDGG 244 G I+ L VDGG Sbjct: 244 GHITMHNLCVDGG 256
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 36.7 bits (85), Expect = 1e-04 Identities = 53/313 (16%), Positives = 106/313 (33%), Gaps = 26/313 (8%) Query: 99 LGLLLSAGMNLMMGMTTNALLLAIFWGINGWAQSMGVGPCAVSLARWYGVKERGTFYGIW 158 + L +A +M +L I + G + G A +A ER +G Sbjct: 78 VSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAY-IADITDGDERARHFGFM 136 Query: 159 STAHNIGEAVTYMVIAAVIAGFGWQMGYLSTAALGAAGVVLLVLFMHDSPQSSGFPSINV 218 S G V V+ ++ GF + + AAL + + +S + P Sbjct: 137 SACFGFG-MVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRP---- 191 Query: 219 IRDEPQEEVEARGSVFKNQLLALRNPALWTLALASAFMYIDRYAVNSWGIFFLEQDKAYS 278 EA + + +A+ + + W I F E + Sbjct: 192 ------LRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVI-FGEDRFHWD 244 Query: 279 TLEASGIIGVN-AIAGIAGTIIAGMLSDRF---FPRNRSVMAGFISLLNTAGFALMLWSP 334 IG++ A GI ++ M++ R++M G I+ + G+ L+ ++ Sbjct: 245 A----TTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIA--DGTGYILLAFAT 298 Query: 335 HNYYTDILAMIIFGATIGALTCFLGGLIAVDISSRKAAGAALGTIGIASYAGAGLGEFLT 394 + M++ A+ G L +++ + + G G++ + + +G L Sbjct: 299 R-GWMAFPIMVLL-ASGGIGMPALQAMLSRQVDEER-QGQLQGSLAALTSLTSIVGPLLF 355 Query: 395 GIIIDKTAILENG 407 I + NG Sbjct: 356 TAIYAASITTWNG 368
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 36.7 bits (85), Expect = 1e-04 Identities = 15/32 (46%), Positives = 22/32 (68%) Query: 345 LETLLQENGNVVRAADRLGLHRNTLHQRIQRI 376 L L GN ++AAD LGL+RNTL ++I+ + Sbjct: 442 LAALTATRGNQIKAADLLGLNRNTLRKKIREL 473
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 88.3 bits (219), Expect = 2e-22 Identities = 35/122 (28%), Positives = 57/122 (46%), Gaps = 1/122 (0%) Query: 4 VLIIEDEHAIRRFLRTALEADGMRVFEAETLQRGLIEAATRKPDLAILDLGLPDGDGIDF 63 +L+ +D+ AIR L AL G V A DL + D+ +PD + D Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 64 IRDLRQ-WSQMPIIVLSARSEEHDKIAALDAGADDYLSKPFGIGELQARLRVALRRHGAA 122 + +++ +P++V+SA++ I A + GA DYL KPF + EL + AL Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125 Query: 123 QA 124 + Sbjct: 126 PS 127
>PF06580#Sensor histidine kinase Length = 349 Score = 29.8 bits (67), Expect = 0.044 Identities = 24/130 (18%), Positives = 48/130 (36%), Gaps = 28/130 (21%) Query: 760 HIQLDLPDPLQLVHVDGPLFERVLINLLENAHKYAGAR----ASIGIRAEADARQLSLEV 815 + + + V V P ++ L+EN K+ A+ I ++ D ++LEV Sbjct: 241 QFENQINPAIMDVQV--PPM--LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEV 296 Query: 816 WDNGPGIPAGQEQTIFDKFARGNKESAIPGVGLGLA-ICQAIVDVHGG--TISASNRPEG 872 + G ++ G GL + + + ++G I S + +G Sbjct: 297 ENTGS----------------LALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEK-QG 339 Query: 873 GASFRVTLPG 882 + V +PG Sbjct: 340 KVNAMVLIPG 349
>BACINVASINB#Salmonella/Shigella invasin protein B signature. Length = 593 Score = 29.3 bits (65), Expect = 0.036 Identities = 34/115 (29%), Positives = 58/115 (50%), Gaps = 4/115 (3%) Query: 361 ILTLSARWSAAY-GQSSMPLMVLGLAVMGFAELFIDPVAMSQITRIEIPGVTGVLTGIYM 419 +LT+ + +A + G +S+ L +GLAVM E+ +S I + P + VL + Sbjct: 324 LLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAATGVSFIQQALNPIMEHVLKPLME 383 Query: 420 LLSGAIANYLAGVIAD-QTSQASFDAAGAVNYSID--AYITVFSQITWGALACVG 471 L+ AI L G+ D +T++ + GA+ +I A I V + + GA A +G Sbjct: 384 LIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMVAVIVVVAVVGKGAAAKLG 438
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 31.4 bits (71), Expect = 0.003 Identities = 11/41 (26%), Positives = 23/41 (56%), Gaps = 1/41 (2%) Query: 14 VDDAPHMQDYTLEAEEGRDM-MLLDALIQLKEKDPSLSFRR 53 +++ + T+E + + MLLDAL+++ + DP L + Sbjct: 339 IENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYV 379
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 60.5 bits (146), Expect = 7e-12 Identities = 37/270 (13%), Positives = 80/270 (29%), Gaps = 15/270 (5%) Query: 55 VDPGAVVNNYNRQQQQQA----SARRAAEQREKQAQQQAEELREKQAAEQERLKQLEQER 110 VD + N Q + + A E E KQ + Sbjct: 992 VDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTV 1051 Query: 111 LQAQEAAKEAKEQQKQ-AEEAAAKAAAAAKAKADAQAKEAQEAAAKAAAEAKAKADAQAK 169 + ++ A E Q ++ A+EA + A + AQ+ + + A + + K Sbjct: 1052 EKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEK 1111 Query: 170 AAEQAAAKAAADAKKQAEAAAAKAAAEA-KKQAEAEAAKAAAEAQKKAEAAAAKKAQQEA 228 A + K ++ + + +E + QAE K+ ++ A E Sbjct: 1112 AKVETEKTQEV-PKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQ 1170 Query: 229 EKKAQQEAAKQAAAEKAAAEKAAA--------QKAAAEKAAAEKAAAAEKAAAAKAAAAE 280 K +Q E + A + +++ K ++ + Sbjct: 1171 PAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSV 1230 Query: 281 KAAADKAAKAAAAKAAAAKKAAAAKEADGV 310 + A ++ ++ A + + V Sbjct: 1231 PHNVEPATTSSNDRSTVALCDLTSTNTNAV 1260 Score = 56.2 bits (135), Expect = 2e-10 Identities = 25/234 (10%), Positives = 70/234 (29%), Gaps = 6/234 (2%) Query: 65 NRQQQQQASARRAAEQREKQAQQQAEELREKQAAEQERLKQLEQERLQAQEAAKEAKEQQ 124 NR+ ++A + A + + Q E +E Q E + +E+E E K + + Sbjct: 1065 NREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPK 1124 Query: 125 KQAEEAAAKAAA-AAKAKADAQAKEAQEAAAKAAAEAKAKADAQAKAAEQAAAKAAADAK 183 ++ + + + + +A+ + K + A++ ++ Sbjct: 1125 VTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVT 1184 Query: 184 KQAEAAAAKAAAEAKKQAEAEAAK--AAAEAQKKAEAAAAKKAQQEAEKKAQQEAAKQAA 241 + + E + + +E+ K + + + E A ++ Sbjct: 1185 ESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVP---HNVEPATTSS 1241 Query: 242 AEKAAAEKAAAQKAAAEKAAAEKAAAAEKAAAAKAAAAEKAAADKAAKAAAAKA 295 +++ ++ A A+ A A + + Sbjct: 1242 NDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQYN 1295 Score = 56.2 bits (135), Expect = 2e-10 Identities = 29/199 (14%), Positives = 63/199 (31%), Gaps = 5/199 (2%) Query: 86 QQQAEELREKQAAEQERLKQLEQERLQAQEAAKEAKEQQKQAEEAAAKAAAAAKAKADAQ 145 ++ + + Q + + + ++ A A + + A+ Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAE-N 1043 Query: 146 AKEAQEAAAKAAAEAKAKADAQAKAAEQAAAKAAADAKKQAEAAAAKAAAEAKKQAEAEA 205 +K+ + K +A + A++A + A+ + A + E + E Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103 Query: 206 AKAAAEAQKKAEAAAAKKAQQEAEKKAQQEAAKQAAAEKAAAEKAAAQKAAAEKAAAEKA 265 A E + K E QE K Q + KQ +E + A++ E Sbjct: 1104 ATVEKEEKAKVETEK----TQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQ 1159 Query: 266 AAAEKAAAAKAAAAEKAAA 284 + A + A E ++ Sbjct: 1160 SQTNTTADTEQPAKETSSN 1178 Score = 55.5 bits (133), Expect = 3e-10 Identities = 28/251 (11%), Positives = 64/251 (25%), Gaps = 1/251 (0%) Query: 59 AVVNNYNRQQQQQASARRAAEQREKQAQQQAEELREKQAAEQERLKQLEQERLQAQEAAK 118 V N ++ + + A + Q ++ A+E + A + + + + Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT 1098 Query: 119 EAKEQQKQAEEAAAKAAAAAKAKADAQAKEAQEAAAKAAAEAK-AKADAQAKAAEQAAAK 177 E KE +E AK + + ++ A+ + Sbjct: 1099 ETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEP 1158 Query: 178 AAADAKKQAEAAAAKAAAEAKKQAEAEAAKAAAEAQKKAEAAAAKKAQQEAEKKAQQEAA 237 + AK + +Q E+ A + ++ Sbjct: 1159 QSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNK 1218 Query: 238 KQAAAEKAAAEKAAAQKAAAEKAAAEKAAAAEKAAAAKAAAAEKAAADKAAKAAAAKAAA 297 + ++ + A + A + A A KA A A Sbjct: 1219 PKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKA 1278 Query: 298 AKKAAAAKEAD 308 + + E + Sbjct: 1279 VSQHISQLEMN 1289
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 116 bits (293), Expect = 5e-38 Identities = 33/88 (37%), Positives = 57/88 (64%), Gaps = 1/88 (1%) Query: 2 TKSELIERLASQQSHIPAKAVEDAVKEMLEHMASTLAQGERIEIRGFGSFSLHYRAPRTG 61 K +LI ++A + + + K AV + ++S LA+GE++++ GFG+F + RA R G Sbjct: 3 NKQDLIAKVA-EATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKG 61 Query: 62 RNPKTGDKVELEGKYVPHFKPGKELRDR 89 RNP+TG++++++ VP FK GK L+D Sbjct: 62 RNPQTGEEIKIKASKVPAFKAGKALKDA 89
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 30.2 bits (68), Expect = 0.030 Identities = 14/74 (18%), Positives = 29/74 (39%), Gaps = 1/74 (1%) Query: 128 ITYDSEQVASSSSSALITVVREGASIIGLFVMMFYYSWQLSLILIVLAPIVSVAIRVVSK 187 YD+ S ++ + E ++ L + +F + + +LI + P+V + + Sbjct: 325 YPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILA 384 Query: 188 RFRNISKNMQNTMG 201 F S N G Sbjct: 385 AF-GYSINTLTMFG 397
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 48.1 bits (114), Expect = 2e-07 Identities = 46/281 (16%), Positives = 95/281 (33%), Gaps = 20/281 (7%) Query: 347 QEKIERYEADLDELQIRLEEQNEVVAEAVERQEENEARAEAAELEVDELKSQLADYQQAL 406 + K + L+ +E E ++ A E+ +N+ ++ EL+++ AD ++AL Sbjct: 70 KLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKAL 129 Query: 407 DVQQTRAIQYNQALQALERAKALCHLPDLTPESADEWLETFQAKEQEATEKMLSLEQKMS 466 + + + ++ LE KA L AD + + A + K+ Sbjct: 130 EGAMNFSTADSAKIKTLEAEKA-----ALAARKADL-----EKALEGAMNFSTADSAKIK 179 Query: 467 VAQTAHSQFEQAYQLVAAINGPLARNEAWDVARELLRDGVNQRHQAEQAQGLRSRLNELE 526 + + E + L + A + A A+ +LE Sbjct: 180 TLEAEKAALEARQA---ELEKALEGAMNFSTADSAKIKTLEAEKAALAAR-----KADLE 231 Query: 527 QRLREQQDAERQLAEFCKRQGKRYDIDDLETLHQELEARIASLADSVSNAQEQRMALRQE 586 + L + + K + + LE ELE + + + + L E Sbjct: 232 KALEGAMNFSTADSA--KIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAE 289 Query: 587 LEQLQSRTQTLMRRAPVWLAAQNSLNQLCEQSGEQFASGQE 627 L++ L ++ V A + SL + + S E + Sbjct: 290 KAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEA 330 Score = 38.5 bits (89), Expect = 2e-04 Identities = 61/363 (16%), Positives = 117/363 (32%), Gaps = 29/363 (7%) Query: 261 HLISEATNYVAADYMRHANERRIHLDKALEYRRDLFTSRSQLAAEQYKHVDMARELQEHN 320 + E + + + + +L+ + K + L E Sbjct: 53 EKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKA 112 Query: 321 GAEGDLEADY----QAASDHLNLVQTALRQQEKIERYEADLDELQIRLEEQNEVVAEAVE 376 +LEA +A +N + + +E +A L + LE+ E Sbjct: 113 SKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFST 172 Query: 377 RQEENEARAEAAELEVDELKSQLADYQQALDVQQTRAIQYNQALQA-LERAKALCHLPDL 435 EA + ++ +++L + T + L+A A + Sbjct: 173 ADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEK 232 Query: 436 TPESADEWLETFQAKEQEATEKMLSLEQKM-SVAQTAHSQFEQAYQLVAAINGPLARNEA 494 E A + AK + + +LE + + + + A I A A Sbjct: 233 ALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAA 292 Query: 495 WDVARELLRDGVNQRHQAEQAQGLRSRLNELEQRLREQQDAERQLAEFCK------RQGK 548 + + L +Q A + Q LR L + + ++Q +AE Q E RQ Sbjct: 293 LEAEKADLEH-QSQVLNANR-QSLRRDL-DASREAKKQLEAEHQKLEEQNKISEASRQSL 349 Query: 549 RYDID--------------DLETLHQELEARIASLADSVSNAQEQRMALRQELEQLQSRT 594 R D+D LE ++ EA SL + ++E + + + LE+ S+ Sbjct: 350 RRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKL 409 Query: 595 QTL 597 L Sbjct: 410 AAL 412 Score = 36.2 bits (83), Expect = 0.001 Identities = 41/307 (13%), Positives = 102/307 (33%), Gaps = 18/307 (5%) Query: 935 QFEQLKEDYAYAQQTQRDARQQAFALAEVVQRRAHFSYSDSAEMLSGNSDLNEKLRQRLE 994 ++ +Q + Q+ E+ SD + D N++L + L Sbjct: 36 NTNEVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELS 95 Query: 995 QAESERSRARDAMRAHAAQLSQYNQVLASLKSSYDTKKELLNDLYKELQDIGVRADAGAE 1054 A+ + + ++ A+++ + A L+ + + +++ + A A Sbjct: 96 NAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAA 155 Query: 1055 ERA--RARRDELHMQLSNNRSRRNQLEKALTFCEAEMDNLTRKLRKLERDY-------CE 1105 +A + + + ++ LE EA L + L Sbjct: 156 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKT 215 Query: 1106 MREQVVTAKAGWCAVMRLVKDNGVERRLHRRELAYLSAD------ELRSMSDKALGALRL 1159 + + A + + ++ ++ L A+ + GA+ Sbjct: 216 LEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNF 275 Query: 1160 AVADNEHLRDVLRISEDPKRPERKIQFFVAVYQHLRERIRQDIIRTDDPVEAIEQMEIEL 1219 + AD+ ++ + + + ++ V R+ +R+D+ D EA +Q+E E Sbjct: 276 STADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDL---DASREAKKQLEAEH 332 Query: 1220 SRLTEEL 1226 +L E+ Sbjct: 333 QKLEEQN 339 Score = 35.0 bits (80), Expect = 0.002 Identities = 51/327 (15%), Positives = 96/327 (29%), Gaps = 15/327 (4%) Query: 782 ARENRIETLHAERESLSERFATLSFDVQKTQRLHQAFSRFIGSHLAVAFEDDPEEEIRKL 841 A ++ + L E + E+ + + Q + E Sbjct: 82 ALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADL------EKALEGAMNF 135 Query: 842 NSRRGELERALSAHESDNQQNRVQYEQAKEGVSALNRLLPRLNLLADDTLADRVDEIQER 901 ++ + L A ++ + E+A EG + + A E Sbjct: 136 STADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAEL 195 Query: 902 LDEAQEAARFIQQHGNQLAKLEPIVSVLQSDPEQFEQLKEDYAYAQQTQRDARQQAFALA 961 + A F ++ LE + L + E+ E + A Sbjct: 196 EKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEK 255 Query: 962 EVVQRRAHFSYSDSAEMLSGNSDLNEKLRQRLEQAESERSRARDAMRAHAAQLSQYNQVL 1021 ++ R ++ + L G + + +++ E+E++ Q N Sbjct: 256 AALEAR----QAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANR 311 Query: 1022 ASLKSSYDTKKELLNDLYKELQDIGVRADAGAEERARARRDELHMQLSNNRSRRNQLEKA 1081 SL+ D +E L E Q + + R RRD L +R + QLE Sbjct: 312 QSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRD-----LDASREAKKQLEAE 366 Query: 1082 LTFCEAEMDNLTRKLRKLERDYCEMRE 1108 E + + L RD RE Sbjct: 367 HQKLEEQNKISEASRQSLRRDLDASRE 393
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 496 bits (1278), Expect = e-179 Identities = 222/385 (57%), Positives = 266/385 (69%), Gaps = 29/385 (7%) Query: 2 MKRNILAVVIPALLVAGAANAAEIYNKNGNKLDFYGKMVGEHVWTTNGDTSSDDTTYARI 61 MKR +LA+VIPALL AGAA+AAEIYNK+GNKLD YGK+ G H ++ + + D TY R+ Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDD-SSKDGDQTYMRV 59 Query: 62 GLKGETQINDQLIGYGQWEYNMDASNVEGSQT-TKTRLAFAGLKAGEYGSFDYGRNYGAI 120 G KGETQINDQL GYGQWEYN+ A+ EG + TRLAFAGLK G+YGSFDYGRNYG + Sbjct: 60 GFKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVL 119 Query: 121 YDVEAATDMLVEWGGDGWNYTDNYMTGRTNGVATYRNSDFFGLVDGLSFALQYQGKNDHD 180 YDVE TDML E+GGD + Y DNYMTGR NGVATYRN+DFFGLVDGL+FALQYQGKN+ Sbjct: 120 YDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQ 179 Query: 181 RA---------------IRKQNGDGFSTAATYAFDNGIALSAGYSSSNRSVDQKA----D 221 A IR NGDGF + TY G + A Y++S+R+ +Q Sbjct: 180 SADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGGTI 239 Query: 222 GNGDKAEAWATSAKYDANNIYAAVMYSQTYNMTP------EEDNHFAGKTQNFEAVVQYQ 275 GDKA+AW KYDANNIY A MYS+T NMTP D A KTQNFE QYQ Sbjct: 240 AGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQ 299 Query: 276 FDFGLRPSIGYVQTKGKDLQSRAGFSGGDADLVKYIEVGTWYYFNKNMNVYAAYKFNQLD 335 FDFGLRP++ ++ +KGKDL + +G D DLVKY +VG YYFNKN + Y YK N LD Sbjct: 300 FDFGLRPAVSFLMSKGKDL-TYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLLD 358 Query: 336 DND-YTKAAGVATDDQAAVGIVYQF 359 D+D + K AG++TDD A+G+VYQF Sbjct: 359 DDDPFYKDAGISTDDIVALGMVYQF 383
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 33.3 bits (76), Expect = 0.003 Identities = 17/37 (45%), Positives = 20/37 (54%), Gaps = 2/37 (5%) Query: 121 DWVEAEQLFGCVR-QFNGAITLQPGLVHQANGGVLVL 156 D +E+E LFG + F GA T G QA GG L L Sbjct: 202 DLIESE-LFGHEKGAFTGAQTRSTGRFEQAEGGTLFL 237
>OUTRMMBRANEA#Outer membrane protein A signature. Length = 346 Score = 579 bits (1495), Expect = 0.0 Identities = 306/356 (85%), Positives = 323/356 (90%), Gaps = 10/356 (2%) Query: 1 MKKTAIAIAVALAGFATVAQAAPKDNTWYAGGKLGWSQYHDTGFYGNGFQNNNGPTRNDQ 60 MKKTAIAIAVALAGFATVAQAAPKDNTWY G KLGWSQYHDTGF NNNGPT +Q Sbjct: 1 MKKTAIAIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTGFI-----NNNGPTHENQ 55 Query: 61 LGAGAFGGYQVNPYLGFEMGYDWLGRMAYKGSVDNGAFKAQGVQLTAKLGYPITDDLDIY 120 LGAGAFGGYQVNPY+GFEMGYDWLGRM YKGSV+NGA+KAQGVQLTAKLGYPITDDLDIY Sbjct: 56 LGAGAFGGYQVNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIY 115 Query: 121 TRLGGMVWRADSKGNYASTGVSRSEHDTGVSPVFAGGVEWAVTRDIATRLEYQWVNNIGD 180 TRLGGMVWRAD+K N V HDTGVSPVFAGGVE+A+T +IATRLEYQW NNIGD Sbjct: 116 TRLGGMVWRADTKSN-----VYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQWTNNIGD 170 Query: 181 AGTVGTRPDNGMLSLGVSYRFGQEDAAPVVAPAPAPAPEVATKHFTLKSDVLFNFNKATL 240 A T+GTRPDNGMLSLGVSYRFGQ +AAPVVAPAPAPAPEV TKHFTLKSDVLFNFNKATL Sbjct: 171 AHTIGTRPDNGMLSLGVSYRFGQGEAAPVVAPAPAPAPEVQTKHFTLKSDVLFNFNKATL 230 Query: 241 KPEGQQALDQLYTQLSNMDPKDGSAVVLGYTDRIGSEAYNQQLSEKRAQSVVDYLVAKGI 300 KPEGQ ALDQLY+QLSN+DPKDGS VVLGYTDRIGS+AYNQ LSE+RAQSVVDYL++KGI Sbjct: 231 KPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGI 290 Query: 301 PAGKISARGMGESNPVTGNTCDNVKARAALIDCLAPDRRVEIEVKGYKEVVTQPAA 356 PA KISARGMGESNPVTGNTCDNVK RAALIDCLAPDRRVEIEVKG K+VVTQP A Sbjct: 291 PADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVKGIKDVVTQPQA 346
>YERSSTKINASE#Yersinia serine/threonine protein kinase signature. Length = 732 Score = 30.1 bits (67), Expect = 0.035 Identities = 46/201 (22%), Positives = 78/201 (38%), Gaps = 28/201 (13%) Query: 291 PTISKLESDTAARHALLLKSWQKQCQEKKAQAKSWR------LWLEEEMGWQLPE---GD 341 P+IS A H + + WQ E K +R L L G+ L G Sbjct: 12 PSIS-----LAKAHERISQHWQNPVGELNIGGKRYRIIDNQVLRLNPHSGFSLFREGVGK 66 Query: 342 FWQDKKVQRRMASRLDRWVSLMRMHGGSQAEMIAGAPEAVRDLFSKRVKLMSPLMKDWKA 401 + K +A L +L + E+ + P A+ +LF + + PL WK Sbjct: 67 IFSGKMFNFSIARNLTD--TLHAAQKTTSQELRSDIPNALSNLFGAKPQTELPL--GWKG 122 Query: 402 ALKAENAVDFSGLIHQAVNILDKGRFVSPWKHILVDEFQDISPQRASLLAALRRQNSQTT 461 A D G+ + + +F HI + E +D + L+A + R ++ Sbjct: 123 E-PLSGAPDLEGM-----RVAETDKFAEGESHISIIETKD----KQRLVAKIERSIAEGH 172 Query: 462 LFAVGDDWQAIYRFSGAQLSL 482 LFA + ++ IY+ +G +L Sbjct: 173 LFAELEAYKHIYKTAGKHPNL 193
>ICENUCLEATIN#Ice nucleation protein signature. Length = 1258 Score = 42.4 bits (99), Expect = 2e-05 Identities = 109/600 (18%), Positives = 200/600 (33%) Query: 166 TDPGGSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 225 T+ G S A + + +DS + S + +S + S SD + Sbjct: 183 TETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTA 242 Query: 226 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 285 S + DS + S + DS + S + SD + S + +DS Sbjct: 243 GYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADS 302 Query: 286 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 345 + S + +S + S + SD + S + DS + S + Sbjct: 303 SLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTA 362 Query: 346 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 405 DS + S + SD + S + +DS + S + +S + S Sbjct: 363 GEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGS 422 Query: 406 DSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 465 + SD + S + D+ + S + DS + S + SD + Sbjct: 423 TQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTA 482 Query: 466 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 525 S S + +S + S + S + S + ++SD + S S + ++S Sbjct: 483 GYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANS 542 Query: 526 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 585 + S + +S + S + SD + S + SDS + S + Sbjct: 543 SLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTA 602 Query: 586 DSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 645 S + S + S + S S + +DS + S + +S + S Sbjct: 603 SYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGS 662 Query: 646 DSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 705 + SD + S S + +DS + S + +S + S + SD S Sbjct: 663 TQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTS 722 Query: 706 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDS 765 S S + +DS + S + S A S + S + S S + +DS Sbjct: 723 GYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADS 782 Score = 42.0 bits (98), Expect = 3e-05 Identities = 110/600 (18%), Positives = 202/600 (33%) Query: 158 SDSDSDSDTDPGGSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 217 +++ DS T G S A +D+ + S + +S + S SD + Sbjct: 183 TETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTA 242 Query: 218 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 277 S + DS + S + DS + S + SD + S + +DS Sbjct: 243 GYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADS 302 Query: 278 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 337 + S + +S + S + SD + S + DS + S + Sbjct: 303 SLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTA 362 Query: 338 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 397 DS + S + SD + S + +DS + S + +S + S Sbjct: 363 GEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGS 422 Query: 398 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDS 457 + SD + S + DS + + + DS + S + SD + Sbjct: 423 TQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTA 482 Query: 458 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 517 S S + +S + S + S + S + ++SD + S S + ++S Sbjct: 483 GYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANS 542 Query: 518 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 577 + S + +S + S + SD + S + SDS + S + Sbjct: 543 SLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTA 602 Query: 578 DSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 637 S + S + S + S S + +DS + S + +S + S Sbjct: 603 SYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGS 662 Query: 638 DSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 697 + SD + S S + +DS + S + +S + S + SD S Sbjct: 663 TQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTS 722 Query: 698 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDS 757 S S + +DS + S + S + S A S + S S + +DS Sbjct: 723 GYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADS 782 Score = 41.7 bits (97), Expect = 4e-05 Identities = 115/618 (18%), Positives = 206/618 (33%) Query: 170 GSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 229 GS A +S + S SD + S + DS + S + DS Sbjct: 213 GSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSL 272 Query: 230 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 289 + S + SD + S + +DS + S + +S + S + Sbjct: 273 TAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQK 332 Query: 290 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 349 SD + S + DS + S + DS + S + SD + S Sbjct: 333 GSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTG 392 Query: 350 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 409 + +DS + S + +S + S + SD + S + DS + Sbjct: 393 TAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGY 452 Query: 410 DSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 469 S + DS + S + SD + S S + +S + S + S Sbjct: 453 GSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTL 512 Query: 470 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 529 + S + ++SD + S S + ++S + S + +S + S + Sbjct: 513 TAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTARE 572 Query: 530 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 589 SD + S + SDS + S + S + S + S + S S Sbjct: 573 GSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTS 632 Query: 590 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 649 A +DS + S + +S + S + SD + S S + +DS + Sbjct: 633 TAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGY 692 Query: 650 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 709 + + +S + S + SD S S S + +DS + S + S Sbjct: 693 GSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSL 752 Query: 710 DSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 769 + S + S + S S + ADS + S + S + S + Sbjct: 753 TAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQE 812 Query: 770 DSDSDSDSDSDSDSDSDS 787 SD + S S + +DS Sbjct: 813 RSDLTTGYGSTSTAGADS 830 Score = 41.3 bits (96), Expect = 5e-05 Identities = 112/618 (18%), Positives = 203/618 (32%) Query: 170 GSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 229 GS S + S + S + S + +DS + S + +S Sbjct: 165 GSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQ 224 Query: 230 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 289 + S SD + S + DS + S + DS + S + Sbjct: 225 MAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQK 284 Query: 290 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 349 SD + S + +DS + S + +S + S + SD + S Sbjct: 285 GSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTG 344 Query: 350 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 409 + DS + S + DS + S + SD + S + +DS + Sbjct: 345 TAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGY 404 Query: 410 DSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 469 S + +S + S + SD + S + DS + S + DS Sbjct: 405 GSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSL 464 Query: 470 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 529 + S + SD + S S + +S + S + S + S + + Sbjct: 465 TAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQN 524 Query: 530 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 589 +SD + S S + ++S + S + +S + S + SD + S Sbjct: 525 ESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTG 584 Query: 590 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 649 A SDS + S + S + S + S + S S + +DS + Sbjct: 585 TAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGY 644 Query: 650 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 709 + + +S + S + SD + S S + +DS + S + +S Sbjct: 645 GSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSIL 704 Query: 710 DSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 769 + S + SD S S S + ADS + S + S + S + Sbjct: 705 TAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTARE 764 Query: 770 DSDSDSDSDSDSDSDSDS 787 S + S S + +DS Sbjct: 765 QSVLTTGYGSTSTAGADS 782 Score = 40.5 bits (94), Expect = 1e-04 Identities = 108/610 (17%), Positives = 202/610 (33%) Query: 154 ADSDSDSDSDSDTDPGGSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 213 D S+ + + P + DA +S + + + + S S + S Sbjct: 125 PDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTE 184 Query: 214 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 273 + S + S + +DS + S + +S + S SD + Sbjct: 185 TAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGY 244 Query: 274 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 333 S + DS + S + DS + S + SD + S + +DS Sbjct: 245 GSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSL 304 Query: 334 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 393 + S + +S + S + SD + S + DS + S + Sbjct: 305 IAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGE 364 Query: 394 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDS 453 DS + S + SD + S + +DS + S + +S + S Sbjct: 365 DSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQ 424 Query: 454 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 513 + SD + S + DS + S + DS + S + SD + Sbjct: 425 TAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGY 484 Query: 514 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 573 S S + +S + S + S + S + ++SD + S S + ++S Sbjct: 485 GSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSL 544 Query: 574 DSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 633 + S + +S A S + SD + S + SDS + S + Sbjct: 545 IAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASY 604 Query: 634 DSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 693 S + S + + + S S + +DS + S + +S + S Sbjct: 605 HSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQ 664 Query: 694 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDS 753 + SD + S S + +DS + S + +S + S + SD S Sbjct: 665 TAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGY 724 Query: 754 DSDSDSDSDS 763 S S + +DS Sbjct: 725 GSTSTAGADS 734 Score = 40.1 bits (93), Expect = 1e-04 Identities = 106/585 (18%), Positives = 195/585 (33%) Query: 205 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 264 S + +DS + S + +S + S SD + S + DS Sbjct: 198 STGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLI 257 Query: 265 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 324 + S + DS + S + SD + S + +DS + S + + Sbjct: 258 AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEE 317 Query: 325 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 384 S + S + SD + S + DS + S + DS + S Sbjct: 318 STQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQT 377 Query: 385 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSD 444 + SD + S + +DS + S + +S + + + SD + Sbjct: 378 AQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYG 437 Query: 445 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 504 S + DS + S + DS + S + SD + S S + +S Sbjct: 438 STGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLI 497 Query: 505 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 564 + S + S + S + ++SD + S S + ++S + S + + Sbjct: 498 AGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYN 557 Query: 565 SDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 624 S + S + SD + S + SDS + S + S + S Sbjct: 558 SVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQT 617 Query: 625 SDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 684 + S + S S + +DS + S + +S + S + SD + Sbjct: 618 AREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYG 677 Query: 685 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSD 744 S S + +DS + S + +S + S + SD S S S A +DS Sbjct: 678 STSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLI 737 Query: 745 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 789 + S + S + S + S + S S + +DS Sbjct: 738 AGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADS 782 Score = 39.7 bits (92), Expect = 2e-04 Identities = 105/585 (17%), Positives = 195/585 (33%) Query: 209 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 268 S + +DS + S + +S + S SD + S + DS Sbjct: 198 STGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLI 257 Query: 269 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 328 + S + DS + S + SD + S + +DS + S + + Sbjct: 258 AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEE 317 Query: 329 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 388 S + S + SD + S + DS + S + DS + S Sbjct: 318 STQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQT 377 Query: 389 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSD 448 + SD + S + +DS + S + +S A S + SD + Sbjct: 378 AQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYG 437 Query: 449 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 508 S + DS + S + DS + S + SD + S S + +S Sbjct: 438 STGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLI 497 Query: 509 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 568 + S + S + S + ++SD + S S + ++S + S + + Sbjct: 498 AGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYN 557 Query: 569 SDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 628 S + S + SD + S + SDS + S + S + S Sbjct: 558 SVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQT 617 Query: 629 SDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 688 + S + S S + +DS + S + +S + S + SD + Sbjct: 618 AREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYG 677 Query: 689 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSD 748 S S + +DS + S + +S + S + SD S + S + +DS Sbjct: 678 STSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLI 737 Query: 749 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDT 793 + S + S + S + S + S S + +D+ Sbjct: 738 AGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADS 782 Score = 39.7 bits (92), Expect = 2e-04 Identities = 105/585 (17%), Positives = 195/585 (33%) Query: 207 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 266 S + +DS + S + +S + S SD + S + DS Sbjct: 198 STGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLI 257 Query: 267 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 326 + S + DS + S + SD + S + +DS + S + + Sbjct: 258 AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEE 317 Query: 327 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 386 S + S + SD + S + DS + S + DS + S Sbjct: 318 STQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQT 377 Query: 387 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSD 446 + SD + S + +DS + S + +S + S + SD + Sbjct: 378 AQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYG 437 Query: 447 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 506 S + DS + S + DS + S + SD + S S + +S Sbjct: 438 STGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLI 497 Query: 507 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 566 + S + S + S + ++SD + S S + ++S + S + + Sbjct: 498 AGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYN 557 Query: 567 SDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 626 S + S + SD + + + SDS + S + S + S Sbjct: 558 SVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQT 617 Query: 627 SDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 686 + S + S S + +DS A S + +S + S + SD + Sbjct: 618 AREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYG 677 Query: 687 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSD 746 S S + +DS + S + +S + S + SD S S + + +DS Sbjct: 678 STSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLI 737 Query: 747 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 791 + S + S + S + S + S S + +DS Sbjct: 738 AGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADS 782 Score = 39.0 bits (90), Expect = 2e-04 Identities = 112/620 (18%), Positives = 206/620 (33%) Query: 170 GSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 229 G S A ++ + S SD + S + DS + S + DS Sbjct: 211 GYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDS 270 Query: 230 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 289 + S + SD + S + +DS + S + +S + S + Sbjct: 271 SLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTA 330 Query: 290 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 349 SD + S + DS + S + DS + S + SD + S Sbjct: 331 QKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGS 390 Query: 350 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 409 + +DS + S + +S + S + SD + S + DS + Sbjct: 391 TGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIA 450 Query: 410 DSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 469 S + DS + + + SD + S S + +S + S + S Sbjct: 451 GYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGS 510 Query: 470 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 529 + S + ++SD + S S + ++S + S + +S + S + Sbjct: 511 TLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTA 570 Query: 530 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 589 SD + S + SDS + S + S + S + S + S Sbjct: 571 REGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGS 630 Query: 590 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 649 + + +DS + S + +S + S + SD + S S + +DS + Sbjct: 631 TSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIA 690 Query: 650 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 709 S + +S + S + SD S S S + +DS + S + S Sbjct: 691 GYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHS 750 Query: 710 DSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 769 + S + S + S S A +DS + S + S + S + Sbjct: 751 SLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTA 810 Query: 770 DSDSDSDSDSDSDSDSDSDS 789 SD + S S + +DS Sbjct: 811 QERSDLTTGYGSTSTAGADS 830 Score = 38.6 bits (89), Expect = 3e-04 Identities = 105/593 (17%), Positives = 196/593 (33%) Query: 201 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 260 D D+ +S S + + + S S + S + S + S Sbjct: 142 DDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGT 201 Query: 261 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 320 + +DS + S + +S + S SD + S + DS + Sbjct: 202 AGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYG 261 Query: 321 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 380 S + DS + S + SD + S + +DS + S + +S Sbjct: 262 STQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQT 321 Query: 381 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSD 440 + S + SD + S + DS + S + DS A S + Sbjct: 322 AGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 381 Query: 441 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 500 SD + S + +DS + S + +S + S + SD + S Sbjct: 382 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 441 Query: 501 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 560 + DS + S + DS + S + SD + S S + +S + Sbjct: 442 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYG 501 Query: 561 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSD 620 S + S + S + ++SD + S S + ++S + S + +S Sbjct: 502 STQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLT 561 Query: 621 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSD 680 + S + SD + S + SDS + S + S + S + Sbjct: 562 AGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQ 621 Query: 681 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSD 740 S + S S + +DS + S + +S + S + SD + + S Sbjct: 622 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTST 681 Query: 741 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDT 793 + +DS + S + +S + S + SD S S S + +D+ Sbjct: 682 AGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADS 734 Score = 38.6 bits (89), Expect = 4e-04 Identities = 110/610 (18%), Positives = 203/610 (33%) Query: 182 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 241 +S + S SD + S + DS + S + DS + S Sbjct: 221 ESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQ 280 Query: 242 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 301 + SD + S + +DS + S + +S + S + SD + Sbjct: 281 TAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGY 340 Query: 302 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 361 S + DS + S + DS + S + SD + S + +DS Sbjct: 341 GSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSL 400 Query: 362 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 421 + S + +S + S + SD + S + DS + S + Sbjct: 401 IAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGE 460 Query: 422 DSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 481 DS + S + SD + S S + +S + S + S + S Sbjct: 461 DSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQ 520 Query: 482 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 541 + ++SD + S S + ++S + S + +S + S + SD + Sbjct: 521 TAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGY 580 Query: 542 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDS 601 S + SDS + S + S + S + S + + S + +DS Sbjct: 581 GSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSL 640 Query: 602 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDS 661 + S + +S + S + SD + S S + +DS A S + Sbjct: 641 IAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGY 700 Query: 662 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 721 +S + S + SD S S S + +DS + S + S + S Sbjct: 701 NSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQ 760 Query: 722 DSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 781 + S + S + + +DS + S + S + S + SD + Sbjct: 761 TAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGY 820 Query: 782 DSDSDSDSDS 791 S S + +DS Sbjct: 821 GSTSTAGADS 830 Score = 38.2 bits (88), Expect = 4e-04 Identities = 105/593 (17%), Positives = 196/593 (33%) Query: 199 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 258 D D+ +S S + + + S S + S + S + S Sbjct: 142 DDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGT 201 Query: 259 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 318 + +DS + S + +S + S SD + S + DS + Sbjct: 202 AGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYG 261 Query: 319 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 378 S + DS + S + SD + S + +DS + S + +S Sbjct: 262 STQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQT 321 Query: 379 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSD 438 + S + SD + S + DS + S + DS + S + Sbjct: 322 AGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 381 Query: 439 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 498 SD + S + +DS + S + +S + S + SD + S Sbjct: 382 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 441 Query: 499 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 558 + DS + S + DS + S + SD + S S + +S + Sbjct: 442 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYG 501 Query: 559 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSD 618 S + S + S + ++SD + + S + ++S + S + +S Sbjct: 502 STQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLT 561 Query: 619 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSD 678 + S + SD + S + SDS A S + S + S + Sbjct: 562 AGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQ 621 Query: 679 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDAD 738 S + S S + +DS + S + +S + S + SD + S + Sbjct: 622 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTST 681 Query: 739 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 791 + +DS + S + +S + S + SD S S S + +DS Sbjct: 682 AGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADS 734 Score = 37.8 bits (87), Expect = 6e-04 Identities = 115/619 (18%), Positives = 210/619 (33%) Query: 170 GSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 229 GS A ADS + S + +S + S + SD + S + DS Sbjct: 293 GSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSL 352 Query: 230 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 289 + S + DS + S + SD + S + +DS + S + Sbjct: 353 IAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGE 412 Query: 290 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 349 +S + S + SD + S + DS + S + DS + S Sbjct: 413 ESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQ 472 Query: 350 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 409 + SD + S S + +S + S + S + S + ++SD + Sbjct: 473 TAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGY 532 Query: 410 DSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 469 S S + ++S + S + +S + S + SD + S + SDS Sbjct: 533 GSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSI 592 Query: 470 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 529 + S + S + S + S + S S + +DS + S + Sbjct: 593 IAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGY 652 Query: 530 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 589 +S + S + SD + S S + +DS + S + +S + S Sbjct: 653 NSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQ 712 Query: 590 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 649 A SD S S S + +DS + S + S + S + S + Sbjct: 713 TAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGY 772 Query: 650 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 709 + S + +DS + S + S + S + SD + S S + +DS Sbjct: 773 GSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSL 832 Query: 710 DSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 769 + S + +S + S + +SD + S S + DS + S + Sbjct: 833 IAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAGY 892 Query: 770 DSDSDSDSDSDSDSDSDSD 788 +S + S + +SD Sbjct: 893 NSILTAGYGSTQTAQENSD 911 Score = 37.0 bits (85), Expect = 0.001 Identities = 93/530 (17%), Positives = 172/530 (32%) Query: 271 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 330 D D+ +S S + + + S S + S + S + S Sbjct: 142 DDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGT 201 Query: 331 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 390 + +DS + S + +S + S SD + S + DS + Sbjct: 202 AGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYG 261 Query: 391 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSD 450 S + DS + S + SD + S + ADS + S + +S Sbjct: 262 STQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQT 321 Query: 451 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 510 + S + SD + S + DS + S + DS + S + Sbjct: 322 AGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 381 Query: 511 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 570 SD + S + +DS + S + +S + S + SD + S Sbjct: 382 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 441 Query: 571 SDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 630 + DS + S + D+ + S + SD + S S + +S + Sbjct: 442 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYG 501 Query: 631 SDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 690 S + S + S A ++SD + S S + ++S + S + +S Sbjct: 502 STQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLT 561 Query: 691 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSD 750 + S + SD + S + SDS + S + S + S + Sbjct: 562 AGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQ 621 Query: 751 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDTEPQANND 800 S + S S + +DS + S + +S + S Q +D Sbjct: 622 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSD 671 Score = 35.5 bits (81), Expect = 0.003 Identities = 94/549 (17%), Positives = 178/549 (32%) Query: 257 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 316 D D+ +S S + + + S S + S + S + S Sbjct: 142 DDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGT 201 Query: 317 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 376 + +DS + S + +S + S SD + S + DS + Sbjct: 202 AGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYG 261 Query: 377 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSD 436 S + DS + S + SD + S + +DS + S A +S Sbjct: 262 STQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQT 321 Query: 437 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 496 + S + SD + S + DS + S + DS + S + Sbjct: 322 AGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 381 Query: 497 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 556 SD + S + +DS + S + +S + S + SD + S Sbjct: 382 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 441 Query: 557 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSD 616 + DS + S + DS + S + SD + S S + +S + Sbjct: 442 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYG 501 Query: 617 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSD 676 S + S + S + ++SD + S + + ++S + S + +S Sbjct: 502 STQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLT 561 Query: 677 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 736 + S + SD + S + SDS + S + S + S + Sbjct: 562 AGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQ 621 Query: 737 ADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDTEPQ 796 + + S S + +DS + S + +S + S + SD + + Sbjct: 622 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTST 681 Query: 797 ANNDTHTAA 805 A D+ A Sbjct: 682 AGADSSLIA 690
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 29.0 bits (65), Expect = 0.046 Identities = 15/102 (14%), Positives = 43/102 (42%), Gaps = 7/102 (6%) Query: 147 SSVRAADAAVAQQQAMVMLNIDQVAHDTAGAVVQLQGYQKLVKIAQAQVDSLKHIGDLIR 206 S ++ + Q+ LN+D+ + + ++ Y+ L ++ ++++D Sbjct: 189 SLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDF-------S 241 Query: 207 QRNDAGATSLSDVVQTDTRVEGAQATLIQYQAALERWKATLA 248 A + V++ + + A L Y++ LE+ ++ + Sbjct: 242 SLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEIL 283
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 263 bits (674), Expect = 6e-86 Identities = 105/462 (22%), Positives = 187/462 (40%), Gaps = 64/462 (13%) Query: 6 AAIFPLVKELDPVAAMADNER---DEAELV------KSRRLIALLALLLVVTGVWAWFAT 56 + + + K+LD D EL+ + R + + LV+ + + Sbjct: 20 SETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQ 79 Query: 57 LDEVSTGTGKVIPSSREQVLQTLDGGILTELNVREGSRVAAGQVVARLDPTRSESNVGES 116 ++ V+T GK+ S R + ++ ++ I+ E+ V+EG V G V+ +L +E++ ++ Sbjct: 80 VEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKT 139 Query: 117 QAKYRASLAASIRLTA-----EVNNQPLIFPPSLKAWPGLLAEE-TRLYHSRREQLTKSM 170 Q+ + R E+N P + P + + EE RL +EQ + Sbjct: 140 QSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQ 199 Query: 171 RQLDQ------------------------SLSLVNSELAINEKLAKTGAASNVEVL---- 202 Q Q + S L L A + VL Sbjct: 200 NQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQEN 259 Query: 203 -----------------RLRQQAADIELKKIDLNTRYYVDAREQLSKANADVASLAEVIK 245 ++ + + + + + + ++L + ++ L + Sbjct: 260 KYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELA 319 Query: 246 GRADSVARLTVRSPVQGIVKNIKVNTIGGVIAPNGELMDIVPIDGRLLIEARISPRDIAF 305 + +R+PV V+ +KV+T GGV+ LM IVP D L + A + +DI F Sbjct: 320 KNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGF 379 Query: 306 IHPDQKALVKITAYDYAIYGALNGVVETISPDTIQDEAKPDVYYYRVFIRTDHNYLENKR 365 I+ Q A++K+ A+ Y YG L G V+ I+ D I+D+ V+ V I + N L Sbjct: 380 INVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFN--VIISIEENCLS-TG 436 Query: 366 GKRFLIGPGMIATVDIKTGEKTVMDYLVKPF-NRAKEALRER 406 K + GM T +IKTG ++V+ YL+ P E+LRER Sbjct: 437 NKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESLRER 478
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 74.7 bits (183), Expect = 6e-18 Identities = 44/184 (23%), Positives = 73/184 (39%), Gaps = 23/184 (12%) Query: 4 LPARPESLTFEPQQSALIVVDMQNAYASQGGYLDLAGFDVSATRPVIDNINTAVAAARAA 63 +P S +P ++ L++ DMQN + +D S + NI Sbjct: 17 MPQNKVSWVPDPNRAVLLIHDMQNYF------VDAFTAGASPVTELSANIRKLKNQCVQL 70 Query: 64 GMLIIWFQNGWDDQYVEAGGPGSPNYHKSNALKTMRQRPELQGKLLAKGGWDYQLVDELT 123 G+ +++ PGS N L G L G ++ +++ EL Sbjct: 71 GIPVVY-----------TAQPGSQNPDDRALLTDF------WGPGLNSGPYEEKIITELA 113 Query: 124 PQEGDIVLPKPRYSGFFNTPLDSILRSRGIRHLVFTGIATNVCVESTLRDGFFLEYFGIV 183 P++ D+VL K RYS F T L ++R G L+ TGI ++ T + F + Sbjct: 114 PEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFF 173 Query: 184 LEDA 187 + DA Sbjct: 174 VGDA 177
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 65.0 bits (158), Expect = 4e-15 Identities = 30/166 (18%), Positives = 62/166 (37%), Gaps = 10/166 (6%) Query: 10 GKRSQAVSAKKEAILAAALEAFSQFGIHGTRLEQVAERAGVSKTNLLYYYPSKEALYVAV 69 K Q ++ IL AL FSQ G+ T L ++A+ AGV++ + +++ K L+ + Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62 Query: 70 LQQILAIWLAPLKAFREDI--SPLVAIREYIRLKLEVSRDHPQASKLF------CLEMLQ 121 + + ++ PL +RE + LE + + +L E + Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTV-TEERRRLLMEIIFHKCEFVG 121 Query: 122 GAPLLMGELTGDLKALVDEKSAIVSGWIDRGKL-APVDPQHLIFMI 166 ++ D + I+ L A + + ++ Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM 167
>PF05932#Tir chaperone protein (CesT) Length = 127 Score = 28.2 bits (63), Expect = 0.028 Identities = 6/43 (13%), Positives = 15/43 (34%) Query: 60 FYGRHQAGILTPQQASMMLVAFDVLAADKADLERLFRLLTQRI 102 + L + S + A+ + +K + L R + + Sbjct: 74 NPLLNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLL 116
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 34.9 bits (80), Expect = 5e-04 Identities = 57/351 (16%), Positives = 117/351 (33%), Gaps = 51/351 (14%) Query: 61 YALGILFLLPLGDRHDRRRLILVKSALLALLLLLCSLTGQLSSLLVVSLLI---GMAATM 117 +++G L D+ +RL+L + ++ + SLL+++ I G AA Sbjct: 62 FSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFP 121 Query: 118 AQDIVPAAAILAPAGKQGKMVGTVMTGLLLGILLSRTVSGVVGAVFGWRVMYQAAAVSVA 177 A +V A + +GK G + + + +G + + G++ W + +++ Sbjct: 122 ALVMVVVARYIPKE-NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITII 180 Query: 178 --------------------LIGLVMWRVLPRFAVHSTLSYPQLMASMA----------- 206 + G+++ V F + T SY ++ Sbjct: 181 TVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHI 240 Query: 207 ----------HLWQRYPALRRAALAQGALSIAFSAFWSTLAVMLSEHYHMGSAVAGGFGI 256 L + P + G + + F S + M+ + + + +A G I Sbjct: 241 RKVTDPFVDPGLGKNIPFMIGVLCG-GIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVII 299 Query: 257 --AGAAGALAAPLAGGLADKFGAGKVTQMGAALVTLSFALMFMLPLLPVHAQLALIALSA 314 + + + G L D+ G V +G +++SF L +I Sbjct: 300 FPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVL 359 Query: 315 IGFDLGLQSSLVAHQNLVYGLEPQARGRLNALLFTVVFIGMSLGSVLGSKL 365 G + V + L+ Q G +LL F+ G + L Sbjct: 360 GGLSF---TKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 156 bits (395), Expect = 4e-49 Identities = 91/250 (36%), Positives = 136/250 (54%), Gaps = 13/250 (5%) Query: 4 EGKIALVTGASRGIGRAIAETLVARGAKVIGTATSESGAQAISDYLGANGK---GLMLNV 60 EGKIA +TGA++GIG A+A TL ++GA + + + + L A + +V Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66 Query: 61 TDPASIESVLENVRAEFGEVDILVNNAGITRDNLLMRMKDDEWNDIIETNLSSVFRLSKA 120 D A+I+ + + E G +DILVN AG+ R L+ + D+EW N + VF S++ Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 121 VMRAMMKKRHGRIITIGSVVGTMGNAGQANYAAAKAGLIGFSKSLAREVASRGITVNVVA 180 V + MM +R G I+T+GS + A YA++KA + F+K L E+A I N+V+ Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 181 PGFIETDMTRAL-----TDEQR-AGTLA----AVPAGRLGTPNEIASAVAFLASDEASYI 230 PG ETDM +L EQ G+L +P +L P++IA AV FL S +A +I Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246 Query: 231 TGETLHVNGG 240 T L V+GG Sbjct: 247 TMHNLCVDGG 256
>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature. Length = 1147 Score = 27.7 bits (61), Expect = 0.005 Identities = 16/48 (33%), Positives = 25/48 (52%) Query: 10 KIIGEQLGVKQEEVTNNASFVEDLGADSLDTVELVMALEEEFDTEIPD 57 ++ G Q + QEE+ N F+E L ++ L +E+F TEI D Sbjct: 380 QLTGSQRALSQEEIQNKIDFMEFLAQNNAKLDNLSEKEKEKFRTEIKD 427
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 26.6 bits (58), Expect = 0.045 Identities = 17/64 (26%), Positives = 26/64 (40%) Query: 46 VEKQGLTVGIIILTIGVMAPIASGTLPPSTLIHSFMNWKSLLAIAVGVFVSWLGGRGVSL 105 V Q + L IG + + LPPS ++ N ++ A VS LG ++L Sbjct: 171 VTVQRSAIVDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASELTL 230 Query: 106 MGSQ 109 G Sbjct: 231 DGGH 234
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 123 bits (309), Expect = 3e-36 Identities = 71/254 (27%), Positives = 121/254 (47%), Gaps = 10/254 (3%) Query: 2 SKKLADKVALVTGGSAGIGLASAKALAEQGAKVY---ITGRRQEELDAAVRFIGPAARAI 58 +K + K+A +TG + GIG A A+ LA QGA + + E++ ++++ A A Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF 62 Query: 59 RADAAVLSDLDAVFATIAEESGRLDVLFANAGGGDMLPLSAITEAHVDRIFATNVRGVVF 118 AD + +D + A I E G +D+L AG + ++++ + F+ N GV Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122 Query: 119 TVQKALPLLAD--GASVILTGSTAAVKGTANFSIYSASKAAVRSLARSWALEVSDRGIRI 176 + + D S++ GS A + + Y++SKAA + LE+++ IR Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182 Query: 177 NVVSPGPVRTPGLGGLVAEADRQ-----GLFDALAAGVPLGRLGEPEEIGRTVVFLASDE 231 N+VSPG T L A+ + G + G+PL +L +P +I V+FL S + Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242 Query: 232 SSFINAAEIYVDGG 245 + I + VDGG Sbjct: 243 AGHITMHNLCVDGG 256
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 47.2 bits (112), Expect = 6e-08 Identities = 33/142 (23%), Positives = 60/142 (42%), Gaps = 1/142 (0%) Query: 40 LSALAADFHQTESGVGLAVTAYGWVGALAALLSGAMPARISRKALLVGLMLILALSCLAA 99 L +A DF++ + TA+ ++ + G + ++ K LL+ ++I + Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 100 TRSYSMFA-LMSARMIGALAHGAFWALIGIVAAQLVPPHRLGLATAIIFGGVSAASVVGV 158 +S F+ L+ AR I AF AL+ +V A+ +P G A +I V+ VG Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 159 PLASFIATLAGWRLAFMSMALL 180 + IA W + + Sbjct: 157 AIGGMIAHYIHWSYLLLIPMIT 178
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.1 bits (62), Expect = 0.029 Identities = 15/44 (34%), Positives = 21/44 (47%), Gaps = 4/44 (9%) Query: 11 VRRQPLLQEVAFSVAPG----EVLTLMGPSGSGKSTLFAWMIGA 50 V + L+ VA + PG + L G G GKSTL ++G Sbjct: 576 VGKYILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVGL 619
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.0 bits (70), Expect = 0.007 Identities = 9/16 (56%), Positives = 14/16 (87%) Query: 38 LVGESGSGKSLIAKAI 53 + GESG+GK L+A+A+ Sbjct: 165 ITGESGTGKELVARAL 180
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 47.9 bits (114), Expect = 2e-08 Identities = 22/83 (26%), Positives = 39/83 (46%), Gaps = 11/83 (13%) Query: 2 MRIFLTGASGFIGSRILPALQASGHQVIGL---------ARSESTAQALKAAGAEVHRGT 52 M+ +TGA+GFIG + L +GHQV+G+ + ++ + L G + H+ Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 53 LDAPESL--LAGVGNADAVIHTA 73 L E + L G+ + V + Sbjct: 61 LADREGMTDLFASGHFERVFISP 83
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 87.2 bits (216), Expect = 1e-20 Identities = 77/398 (19%), Positives = 156/398 (39%), Gaps = 22/398 (5%) Query: 35 VINV-VPAMKSSLDISLETLTLAVSLSALFSGCFVVASGGLADKFGRMRMTTLGLGLSIV 93 V+NV +P + + + + + L G L+D+ G R+ G+ ++ Sbjct: 32 VLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCF 91 Query: 94 GSAMLVVAQGP-GLFLAGRVLQGLSAACIMPATLALIKTWYEGRARQRAVSFWVIGSWGG 152 GS + V L + R +QG AA + ++ + R +A G Sbjct: 92 GSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMG 151 Query: 153 SGLCSFVGGAIATGLGWRWIFVFSIAVALLALFLLRGTPESRSASASQHKLDVGGLLSLI 212 G+ +GG IA + W ++ + + + FL++ + H D+ G++ + Sbjct: 152 EGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKK--EVRIKGH-FDIKGIILMS 208 Query: 213 VALVLVNLFISKGHGWGWSSPLSLTMLAGALAAGTIFIRNGMRKGEAALIDFALFSNRAY 272 V +V LF + + + L+ L IF+++ +RK +D L N + Sbjct: 209 VGIVFFMLF-TTSYSISFLIVSVLSFL--------IFVKH-IRKVTDPFVDPGLGKNIPF 258 Query: 273 GAAVLSNFLLNGAI-GTMMIASIWLQQGHHLTPLESGMMTLGYLVTVLAMIR--VGEKLL 329 VL ++ G + G + + ++ H L+ E G + + + T+ +I +G L+ Sbjct: 259 MIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVII-FPGTMSVIIFGYIGGILV 317 Query: 330 QRYGARLPMMAGPVLTAIAIALISCTFLEKALYIGVVFASNVLFGLGLGCYATPSTDTAV 389 R G + G +++ L + LE + + + G GL T + Sbjct: 318 DRRGPLYVLNIGVTFLSVSF-LTASFLLETTSWF-MTIIIVFVLG-GLSFTKTVISTIVS 374 Query: 390 ANAPENKIGVASGIYKMGSSLGGAMGIAVTVSLFTLFL 427 ++ + + G + S L GIA+ L ++ L Sbjct: 375 SSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPL 412
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 344 bits (884), Expect = e-118 Identities = 127/346 (36%), Positives = 183/346 (52%), Gaps = 26/346 (7%) Query: 7 AQYKDNLLGEANSFLEVLEQVSRLAPLDKPVLVIGERGTGKELIANRLHYLSSRWQGPFI 66 +Q L+G + + E+ ++RL D +++ GE GTGKEL+A LH R GPF+ Sbjct: 133 SQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFV 192 Query: 67 SLNCAALNDNLLDSELFGHEAGAFTGASKRHPGRFERADGGTLFLDELATAPMLVQEKLL 126 ++N AA+ +L++SELFGHE GAFTGA R GRFE+A+GGTLFLDE+ PM Q +LL Sbjct: 193 AINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLL 252 Query: 127 RVIEYGELERVGGSQPLQVNVRLVCATNADLPQMVEEGHFRADLLDRLAFDVVQLPPLRD 186 RV++ GE VGG P++ +VR+V ATN DL Q + +G FR DL RL ++LPPLRD Sbjct: 253 RVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRD 312 Query: 187 RQSDIMLLANQFAIQMCRELGLPLFPGFSERATATLLGYRWPGNIRELKNVVERSVYRHG 246 R DI L F Q +E F + A + + WPGN+REL+N+V R + Sbjct: 313 RAEDIPDLVRHFVQQAEKEGLDVK--RFDQEALELMKAHPWPGNVRELENLVRRLTALYP 370 Query: 247 DSE--------HELDAIIINPFRQSPG---------------SPPEAAPGDELPALPLDL 283 I +P ++ A+ GD LP L Sbjct: 371 QDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGL-Y 429 Query: 284 RDFQFQQEKRLLQRSLEQAKYHQKQAAELLGLTYHQLRALLKKHQL 329 + E L+ +L + +Q +AA+LLGL + LR +++ + Sbjct: 430 DRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 307 bits (789), Expect = e-101 Identities = 112/377 (29%), Positives = 172/377 (45%), Gaps = 38/377 (10%) Query: 174 VLTGAVAMLRSTVRMGRQLQTMTSQDTSAFSQILAVGPKMRHVVEQARKLAMLSAPLLIV 233 LT + ++ + ++ + D+ ++ M+ + +L L+I Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMIT 166 Query: 234 GDTGTGKDLLAHACHLASPRAGKPYLALNCGSIPEDAVESELFG-------DALQGKKGF 286 G++GTGK+L+A A H R P++A+N +IP D +ESELFG A G Sbjct: 167 GESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGR 226 Query: 287 FEQANGGSVLLDEIGEMSPRMQTKLLRFLNDGTFRRVGEDHEVHVDVRVICATQKNLIEL 346 FEQA GG++ LDEIG+M QT+LLR L G + VG + DVR++ AT K+L + Sbjct: 227 FEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQS 286 Query: 347 VQKGLFREDLYYRLNVLTLYLPPLRDCPQDIMPLTELFVARFADEQGIPRPKLSGDLSTV 406 + +GLFREDLYYRLNV+ L LPPLRD +DI L FV + E G+ + + + Sbjct: 287 INQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALEL 345 Query: 407 LTRYSWPGNVRQLKNAVYRALTQLEGFELRPQDILLP---------------DHDVASLP 451 + + WPGNVR+L+N V R + + I S+ Sbjct: 346 MKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSIS 405 Query: 452 VGEEAM--------------EGSLDDITRRFERSVLTQ-LYRSYPSTRKLAKRLGVSHTA 496 E G D + E ++ L + + K A LG++ Sbjct: 406 QAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNT 465 Query: 497 IANKLREYGLSQKKGDE 513 + K+RE G+S + Sbjct: 466 LRKKIRELGVSVYRSSR 482
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 32.5 bits (74), Expect = 0.003 Identities = 21/113 (18%), Positives = 45/113 (39%), Gaps = 2/113 (1%) Query: 76 RKWLLLGLTALMAASGVIIALASSFPVYMLGRALIGIVIGGFWSMSAATAIRLVPQRQVP 135 ++ LL G+ S + S F + ++ R + G F ++ R +P+ Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138 Query: 136 RALAIFNGGNALATVVAAPLGSYLGATVGWRGAFLCLVPLAVLAFVWQCISLP 188 +A + A+ V +G + + W ++L L+P+ + V + L Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYIHW--SYLLLIPMITIITVPFLMKLL 189
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 72.8 bits (178), Expect = 2e-17 Identities = 64/254 (25%), Positives = 102/254 (40%), Gaps = 15/254 (5%) Query: 6 KIALVTGGSRGLGRATVEALAQRGVNVVLTYKTRLAEANEVVTRVEALGARAIALPFSAG 65 KIA +TG ++G+G A LA +G ++ + +VV+ ++A A A P Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHI-AAVDYNPEKLEKVVSSLKAEARHAEAFP---- 63 Query: 66 EIDTFDAFVSAFQGALTELGADKFDYLVNNAGNASGMGFLNATEAEFDALYRIHVKSVFF 125 D D+ A E D LVN AG + ++ E++A + ++ VF Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122 Query: 126 LSQKLLPLLAD--GGRIVNVSSGLTRIVMANRAPYAIMKSAVETLTRYMAFELGSRGITV 183 S+ + + D G IV V S + + A YA K+A T+ + EL I Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182 Query: 184 NCVAPGAIATDFSGGVVRDNPQVAQAVANMTA-------LGRPGLPEDIGPMIASLLSDD 236 N V+PG+ TD + D Q + L + P DI + L+S Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242 Query: 237 HRWVNAQRIEVSGG 250 + + V GG Sbjct: 243 AGHITMHNLCVDGG 256
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 91.5 bits (227), Expect = 3e-22 Identities = 45/146 (30%), Positives = 63/146 (43%), Gaps = 12/146 (8%) Query: 416 PPPPPRPVQRVAPNVIRLDSMSLFDTGKWVLKPGSTKRL--VSSLMDIKARPGWLIVVAG 473 P P P V L S LF+ K LKP L + S + +VV G Sbjct: 200 VAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLG 259 Query: 474 HTDSVGEEKANQLLSLKRAESVRDWMRDTGDVPDSCFAVQGYGESRPIATNDT------- 526 +TD +G + NQ LS +RA+SV D++ G +P + +G GES P+ N Sbjct: 260 YTDRIGSDAYNQGLSERRAQSVVDYLISKG-IPADKISARGMGESNPVTGNTCDNVKQRA 318 Query: 527 --PEGRALNRRVEISLVPQVDACRLP 550 + A +RRVEI + D P Sbjct: 319 ALIDCLAPDRRVEIEVKGIKDVVTQP 344
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.3 bits (71), Expect = 0.020 Identities = 30/105 (28%), Positives = 42/105 (40%), Gaps = 17/105 (16%) Query: 580 GKRVVGQQAALSAIARRL-RAAKTGLTPENGPQGVFLLVGPSGTGKTETALALADALFGG 638 G +VG+ AA+ I R L R +T LT ++ G SGTGK A AL D Sbjct: 136 GMPLVGRSAAMQEIYRVLARLMQTDLT--------LMITGESGTGKELVARALHDYGKRR 187 Query: 639 EKALITINLSEYQEPHTVSQLKGSPPGYVGYGQGGILTEAVRKRP 683 + IN++ S+L G + G T A + Sbjct: 188 NGPFVAINMAAIPRDLIESELFGH--------EKGAFTGAQTRST 224
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 81.6 bits (201), Expect = 9e-21 Identities = 64/249 (25%), Positives = 111/249 (44%), Gaps = 24/249 (9%) Query: 7 KSVLVLGGSRGIGAAIVRRFVADGASVVFSYSGSPQAAERLAAETGSTA-----VQADSA 61 K + G ++GIG A+ R + GA + + +P+ E++ + + A AD Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67 Query: 62 DRDAVISLV----RDSGPLDVLVVNAGIALFGDALEQDSDAIDRLFRINIHAPYHASVEA 117 D A+ + R+ GP+D+LV AG+ G + + F +N ++AS Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 118 ARRMP--EGGRIIVIGSVNGDRMPLPGMAAYALSKSALQGLARGLARDFGPRGITVNVVQ 175 ++ M G I+ +GS N +P MAAYA SK+A + L + I N+V Sbjct: 128 SKYMMDRRSGSIVTVGS-NPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 176 PGPIDTDA--------NPENGPMKELMHSF---MAIKRHGRPEEVAGMVAWLAGPEASFV 224 PG +TD N +K + +F + +K+ +P ++A V +L +A + Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246 Query: 225 TGAMHTIDG 233 T +DG Sbjct: 247 TMHNLCVDG 255
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 44.2 bits (104), Expect = 7e-08 Identities = 20/122 (16%), Positives = 47/122 (38%), Gaps = 2/122 (1%) Query: 7 GRTPGRPRQFDAEQAIETAQRLFHARGYDAVSVADLTHAFGINPPSFYAAFGSKLGLYTR 66 +T ++ + ++ A RLF +G + S+ ++ A G+ + Y F K L++ Sbjct: 3 RKTKQEAQE-TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61 Query: 67 VLQR-YSQIGAIPIDALLRDDQPVAASLIAVLQEAARRYVADPAAAGCLVLEGVHCQDAD 125 + + S IG + ++ + + L +L V + + + C+ Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121 Query: 126 AR 127 Sbjct: 122 EM 123
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 103 bits (259), Expect = 6e-29 Identities = 67/258 (25%), Positives = 110/258 (42%), Gaps = 16/258 (6%) Query: 3 KIALITGANRGLGRQTALDIARQGGDVIVTYRGSLEQAEAVVADIRALGRKAIALPLDMA 62 KIA ITGA +G+G A +A QG + + E+ E VV+ ++A R A A P D+ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHI-AAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67 Query: 63 QTASFPAFADSLGSALASVWGRATFDHLINNAGHGEFAPLAETREAQFDGLFNVHVKGVF 122 +A+ + + D L+N AG + + +++ F+V+ GVF Sbjct: 68 DSAAIDEITARIEREMGP------IDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVF 121 Query: 123 FLVQTLLPLLAD--GGRIVNFSSGLTRVSYPGFSAYAAAKAAVEMLSVYMARELGGRGIT 180 +++ + D G IV S V +AYA++KAA M + + EL I Sbjct: 122 NASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181 Query: 181 VNTIAPGAIATDFGGGL-VRDDAEVN------AQFAAMTALGRVGVPEDIGPMIASLLRD 233 N ++PG+ TD L ++ F L ++ P DI + L+ Sbjct: 182 CNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241 Query: 234 DNRWVTAQRIEVSGGQTI 251 +T + V GG T+ Sbjct: 242 QAGHITMHNLCVDGGATL 259
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 40.2 bits (94), Expect = 6e-06 Identities = 29/129 (22%), Positives = 47/129 (36%), Gaps = 19/129 (14%) Query: 6 TVLVFGATGQQGGSVARALLHRGWRVRALVRDPFSAG---------AAALAARGAELVVG 56 LV GA G G V++ LL G +V + D + LA G + Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGI--DNLNDYYDVSLKQARLELLAQPGFQFHKI 59 Query: 57 TFEDRAAMRSAMA--GVDGVF------SVQPSSPGGTVTDEQEVRYGITIADLAVECGVK 108 DR M A + VF +V+ S + + + I + ++ Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119 Query: 109 HLVYSSGSA 117 HL+Y+S S+ Sbjct: 120 HLLYASSSS 128
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 58.9 bits (142), Expect = 3e-13 Identities = 28/166 (16%), Positives = 58/166 (34%), Gaps = 12/166 (7%) Query: 1 MRADARKNYDLLIEVARDVFVEQGAEA-SLRDIARRAGVGMGTLYRHFPNRDSLLEALLR 59 + +A++ +++VA +F +QG + SL +IA+ AGV G +Y HF ++ L + Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64 Query: 60 SRFAALTARAESLL------LAADPAAALLEWLAESVAFTHQHRGIIAPLMSAIDDPESA 113 + + + L+ L +V + + E A Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMA 124 Query: 114 L-----HSACVALRAAGTSLLTRAQQAGLARPDLSGEELFDLIAAL 154 + + C+ L +A + DL ++ Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGY 170
>PF06291#Lambda prophage Bor protein Length = 102 Score = 27.3 bits (60), Expect = 0.002 Identities = 9/17 (52%), Positives = 13/17 (76%) Query: 1 MKKLLLAMAASMLLAGC 17 MKK+L + A +ML+ GC Sbjct: 6 MKKMLFSAALAMLITGC 22
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 33.7 bits (77), Expect = 0.001 Identities = 35/159 (22%), Positives = 66/159 (41%), Gaps = 27/159 (16%) Query: 59 ILSWL--SFSLTFFIRPIGGVIFAHIGDRIGRKKTLVLTLSLMGSATVAIGLLPTYEMVG 116 +W+ +F LTF IG ++ + D++G K+ L+ + + +V VG Sbjct: 50 STNWVNTAFMLTF---SIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVI-------GFVG 99 Query: 117 LWAPALLIILRIIQGMGIGGEWGGALLLAYEYAPEKRK----GFFGSIPQAGVTIGMLMA 172 +LLI+ R IQG G +++ Y P++ + G GSI G +G + Sbjct: 100 HSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIG 159 Query: 173 TFIVSLMTLFDEAQFLAWGWRIPFLLSSVLVFLGLWIRK 211 I A ++ W + L+ + + ++ K Sbjct: 160 GMI---------AHYIHWSYL--LLIPMITIITVPFLMK 187
>PF01206#SirA family protein Length = 76 Score = 92.9 bits (231), Expect = 4e-29 Identities = 16/71 (22%), Positives = 38/71 (53%) Query: 7 DYRLDMVGEPCPYPAVATLEAMPSLQKGEILEVVSDCPQSINNIPLDARNHGYTVLDIQQ 66 D LD G CP P + + + ++ GE+L V++ P S+ + ++ G+ +L+ ++ Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64 Query: 67 DGPTIRYLIQK 77 + T + +++ Sbjct: 65 EDGTYHFRLKR 75
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 31.3 bits (71), Expect = 0.008 Identities = 25/134 (18%), Positives = 52/134 (38%), Gaps = 13/134 (9%) Query: 316 AQSQALLAKPELAQNPELYQQALTETLFNALPILLKGNPSVTISPLS-WRNAKGESTLNL 374 +Q + K + EL +++L A+ I+ ++ P R A + LN Sbjct: 74 MMAQEFIQKGGIDYARELLEKSLGTQ--KAVDIINNLGSALQSRPFEFVRRADPANILNF 131 Query: 375 SVLLKDPAQVTAPPQTLADSLDRVVQSLDGKVV--IPVDMATEFMTKIAGLEGYQPADAA 432 ++ PQT+A L + ++ +P ++ T +IA ++ P Sbjct: 132 ---IQQEH-----PQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVR 183 Query: 433 KLADQQVKGLAAMG 446 ++ K LA++ Sbjct: 184 EVERVLEKKLASLS 197
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.0 bits (67), Expect = 0.010 Identities = 14/75 (18%), Positives = 26/75 (34%), Gaps = 3/75 (4%) Query: 172 AVREAVEAGTVTVTQARQLASLKPEEQREKVSEIEAATAGTTGHEKARRQRQILGEAKPR 231 V EA T + E +++ +E T E + R++ EAK Sbjct: 1019 RVDEAPVPPPAPATPSET-TETVAENSKQESKTVEKNEQDAT--ETTAQNREVAKEAKSN 1075 Query: 232 LKTRKEIIKALESAE 246 +K + + +S Sbjct: 1076 VKANTQTNEVAQSGS 1090
>cloacin#Cloacin signature. Length = 551 Score = 28.9 bits (64), Expect = 0.042 Identities = 26/118 (22%), Positives = 43/118 (36%), Gaps = 13/118 (11%) Query: 37 WNAAKSELDALDERIAREEELRRQDQDYIHE---NEPEQRQQQNRDPANPEAQANERRA- 92 +N+ KSELDA ++ +A +Q + H+ Q + N ++A Sbjct: 351 YNSRKSELDAANKTLADAIAEIKQFNRFAHDPMAGGHRMWQMAGLKAQRAQTDVNNKQAA 410 Query: 93 --AAFNAFLRRGLGEMSAEERQALKELRAQGTTPDEKGGYTVPTQFRNKIVEALKDYG 148 AA SA E + KE + ++ +NK + KDYG Sbjct: 411 FDAAAKEKSDADAALSSAMESRKKKEDK-------KRSAENNLNDEKNKPRKGFKDYG 461
>adhesinb#Adhesin B signature. Length = 310 Score = 27.5 bits (61), Expect = 0.018 Identities = 15/43 (34%), Positives = 22/43 (51%), Gaps = 2/43 (4%) Query: 57 PEGQEPQEAPELTPSERAFRTMRADVTLFIDILLDTDLHPVFT 99 P GQ+P E E P + T +AD+ + I L+T + FT Sbjct: 62 PVGQDPHEY-EPLPEDVKK-TSQADLIFYNGINLETGGNAWFT 102
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 107 bits (269), Expect = 1e-29 Identities = 27/234 (11%), Positives = 70/234 (29%), Gaps = 47/234 (20%) Query: 26 LSAKDIKTLFFGHDDRKAVNRPEESPWDAIGQLET---ASGNLCTATLISPHLALTAGHC 82 L ++ + ++DR + + + ++ + + ++ LT H Sbjct: 61 LEQREHANVILPNNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHV 120 Query: 83 LLTPPRGKPDKAVALRFI------SRKGNWVYE---IHGIDGRVDPSLGRRLKADGDGWI 133 + AL+ N + I G D ++ + + Sbjct: 121 V----DATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVK-FSPNEQN-- 173 Query: 134 VPSAAAPSDFGLIVLRYAPSGIAPIPLFPGSKADLTAALKAADRKVTQSGYPEDH-LDNL 192 ++ + P + A ++ +T +GYP D + + Sbjct: 174 ---------------KHIGEVVKPATM-------SNNAETQVNQNITVTGYPGDKPVATM 211 Query: 193 YSHQDCIVTGWAQTSVLSHQCDTLPGDSGSPLLLKTEDGWQVIAVQSSAPGPQD 246 + + + + + + T G+SGSP+ + +VI + + Sbjct: 212 WESK--GKITYLKGEAMQYDLSTTGGNSGSPVF---NEKNEVIGIHWGGVPNEF 260
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.4 bits (68), Expect = 0.002 Identities = 21/125 (16%), Positives = 39/125 (31%), Gaps = 9/125 (7%) Query: 16 SSAAFAADAVSTTQAPAATHSTAAKTTHHKKHHKA--AAKPAAEQKAQAAKKHKKAEAKP 73 A A + A A + A T ++ + + + A K+ +AK Sbjct: 1055 EQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKV 1114 Query: 74 AAAQKAQAAKKHKKVAAKPAAPQKAQAAKKHHKAAAKPAAQKAQAAKKHHKTTKHQAAKP 133 + + K +V+ K + Q A+PA + ++ Sbjct: 1115 ETEKTQEVPKVTSQVSPKQEQSETVQPQ-------AEPARENDPTVNIKEPQSQTNTTAD 1167 Query: 134 TAQPA 138 T QPA Sbjct: 1168 TEQPA 1172 Score = 29.6 bits (66), Expect = 0.005 Identities = 16/117 (13%), Positives = 31/117 (26%), Gaps = 7/117 (5%) Query: 23 DAVSTTQAPAATHSTAAKTTHHKKHHKAAAKPAAEQKAQAAKKHKKAEAKPAAAQKAQAA 82 V TT + A + + A A A P+ + A Sbjct: 990 QTVDTTNITTPNNIQADVP-------SVPSNNEEIARVDEAPVPPPAPATPSETTETVAE 1042 Query: 83 KKHKKVAAKPAAPQKAQAAKKHHKAAAKPAAQKAQAAKKHHKTTKHQAAKPTAQPAA 139 ++ Q A ++ AK A +A + ++ + + Q Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 53.7 bits (129), Expect = 5e-10 Identities = 62/295 (21%), Positives = 108/295 (36%), Gaps = 15/295 (5%) Query: 55 VQPILPVLSNEFGVSPASSS---ISLSISTAMLAVGLLFTGPLSDAIGRKPVMVTALLLA 111 + P+LP L + S ++ I L++ M G LSD GR+PV++ +L A Sbjct: 24 IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGA 83 Query: 112 ACCSLLSTMMTSWHGILIMRALIGLSLSGVAAVGMTYLSEEIHPSFVAFSMGLYISGNSI 171 A + + I R + G++ AV Y+++ A G + Sbjct: 84 AVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGF 142 Query: 172 GGMSGRLLTGVFTDFFGWRVALAAISGFALAAAIMFWRILPES--RHFRPTSLRPKTLLI 229 G ++G +L G+ F A + + +LPES RP L Sbjct: 143 GMVAGPVLGGLMGGF-SPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLA 201 Query: 230 NFRLHWRDRGLPLLFVEGFLLM---GAFVTLFN-YIGYRLMMSPWSLSQAVVGLLSVAYL 285 +FR + L F++ L+ + R ++ ++ + L Sbjct: 202 SFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSL 261 Query: 286 TGTWSSPKAGAMTVRFG-RGPVMLGFTAVMLCGLLLTLFSSLWLIFIGMLLFSAG 339 G + R G R +MLG A +LL + W+ F M+L ++G Sbjct: 262 AQAMI---TGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG 313
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 438 bits (1127), Expect = e-159 Identities = 286/286 (100%), Positives = 286/286 (100%) Query: 1 MRYIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADE 60 MRYIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADE Sbjct: 1 MRYIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADE 60 Query: 61 RFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCA 120 RFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCA Sbjct: 61 RFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCA 120 Query: 121 AAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPA 180 AAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPA Sbjct: 121 AAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPA 180 Query: 181 SMAATLRKLLTSQRLSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGAGERGARG 240 SMAATLRKLLTSQRLSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGAGERGARG Sbjct: 181 SMAATLRKLLTSQRLSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGAGERGARG 240 Query: 241 IVALLGPNNKAERIVVIYLRDTPASMAERNQQIAGIGAALIEHWQR 286 IVALLGPNNKAERIVVIYLRDTPASMAERNQQIAGIGAALIEHWQR Sbjct: 241 IVALLGPNNKAERIVVIYLRDTPASMAERNQQIAGIGAALIEHWQR 286
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 41.3 bits (97), Expect = 5e-06 Identities = 64/288 (22%), Positives = 111/288 (38%), Gaps = 39/288 (13%) Query: 33 PFFPVWLADVNHLTK--TETGIVFSSISLFAIIFQPVFGLMSDKLGLRKHLLWTITVLLI 90 P P L D+ H GI+ + +L PV G +SD+ G R LL V L Sbjct: 26 PVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLL----VSLA 81 Query: 91 LFA-PFFIFVFSPLLQMNIIAGSLVGGIYLGIVFSSGSGAVEAYIERVSRANRFEYGKVR 149 A + I +P L + + G +V GI G + + + RA F + Sbjct: 82 GAAVDYAIMATAPFLWV-LYIGRIVAGI-TGATGAVAGAYIADITDGDERARHFGF---- 135 Query: 150 VAGCVGWALCAS--ITGVLFGIDPNITFWIASGFALVLGLLLWLSRPESSNS------AQ 201 ++ C G+ + A + G++ G P+ F+ A+ + L PES + Sbjct: 136 MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRRE 195 Query: 202 VIEALGANRQAFSLRTAAELLRMPRFWGFIVYVVG--VASVYDVFDQQFANFFKSFFASP 259 + L + R A + A L+ FI+ +VG A+++ +F + ++ Sbjct: 196 ALNPLASFRWARGMTVVAALM----AVFFIMQLVGQVPAALWVIFGEDRFHW-------- 243 Query: 260 QRGTEVFGFVTTGGELLNALI-MFCAPAIVNRIGAKNALLTAGMIMSV 306 G +L++L + R+G + AL+ GMI Sbjct: 244 --DATTIGISLAAFGILHSLAQAMITGPVAARLGERRALM-LGMIADG 288
>PF05211#Neuraminyllactose-binding hemagglutinin Length = 260 Score = 27.7 bits (61), Expect = 0.039 Identities = 15/55 (27%), Positives = 28/55 (50%), Gaps = 7/55 (12%) Query: 43 LSLAIGVGELRCVIGPNGAGKTTLMDVITGKTRPQSGKALYDQSVDLTTLDPVAI 97 L + G+ ++ V+ P G K T+++ P SG++L ++DL+ LD Sbjct: 145 LLFSTGLDKMEGVLIPAGFVKVTILE-------PMSGESLDSFTMDLSELDIQEK 192
>TYPE3OMBPROT#Type III secretion system outer membrane B protein family signature. Length = 538 Score = 27.7 bits (61), Expect = 0.021 Identities = 14/32 (43%), Positives = 22/32 (68%), Gaps = 1/32 (3%) Query: 19 EQLAEMAGLSVRTIQRIENGER-PGLETLSAL 49 E+ + G+SV + QR++NGER G+E L+ L Sbjct: 23 EETGKHKGVSVISYQRVKNGERNKGIEALNRL 54
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 46.0 bits (109), Expect = 2e-07 Identities = 46/237 (19%), Positives = 89/237 (37%), Gaps = 14/237 (5%) Query: 7 RSTIALLASSLLLTIGRGATLPFMTIYLTRRFQLEVDVI---GYALSLALVVGVLFSMGF 63 R I +L++ L +G G +P + L R DV G L+L ++ + Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLL-RDLVHSNDVTAHYGILLALYALMQFACAPVL 63 Query: 64 GILADRFDKKRYMVWSVLVFILGFSAIPLVNNAPLVVIFFA--LINCAYSVFSTVLKAWF 121 G L+DRF ++ ++ S+ + ++ ++ AP + + + ++ V A+ Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYA---IMATAPFLWVLYIGRIVAGITGATGAVAGAYI 120 Query: 122 ADRLTAEKKARIFSLNYTILNIGWTVGPPIGTLLVMHSINLPFWLAAACAAFPLVFIQLF 181 AD +++AR F G GP +G L+ S + PF+ AAA + Sbjct: 121 ADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFL 180 Query: 182 L----QRDGAAAAQPGAAPWSPSVLLRD-RALLWFTCSGLLASFVGGAFASCLSQYV 233 L + + + P + R + + VG A+ + Sbjct: 181 LPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG 237 Score = 39.4 bits (92), Expect = 2e-05 Identities = 26/158 (16%), Positives = 62/158 (39%), Gaps = 2/158 (1%) Query: 4 TLRRSTIALLASSLLLTIGRGATLPFMTIYLTRRFQLEVDVIGYALSLALVVGVLF-SMG 62 AL+A ++ + I+ RF + IG +L+ ++ L +M Sbjct: 207 RGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMI 266 Query: 63 FGILADRFDKKRYMVWSVLVFILGFSAIPLVNNAPLVVIFFALINCAYSVFSTVLKAWFA 122 G +A R ++R ++ ++ G+ + + L+ + L+A + Sbjct: 267 TGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG-GIGMPALQAMLS 325 Query: 123 DRLTAEKKARIFSLNYTILNIGWTVGPPIGTLLVMHSI 160 ++ E++ ++ + ++ VGP + T + SI Sbjct: 326 RQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASI 363
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 58.7 bits (142), Expect = 2e-11 Identities = 39/156 (25%), Positives = 69/156 (44%), Gaps = 2/156 (1%) Query: 36 LSDIADSFGMETAQVGMMLTIYAWVVALMSLPFMLLTSKVERRRLLIGLFILFIASHVLS 95 L DIA+ F A + T + ++ + + L+ ++ +RLL+ I+ V+ Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 96 FFAWN-FDVLVISRIGIAFAHAVFWSITSALAIRMAPPGKRAQALSLIATGTALAMVFGI 154 F + F +L+++R A F ++ + R P R +A LI + A+ G Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 155 PIGRIIGQYFGWRMTFLAIGLGALATLACLVKLLPP 190 IG +I Y W L I + + T+ L+KLL Sbjct: 157 AIGGMIAHYIHWSYLLL-IPMITIITVPFLMKLLKK 191
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 60.3 bits (146), Expect = 5e-12 Identities = 81/422 (19%), Positives = 147/422 (34%), Gaps = 45/422 (10%) Query: 1 MNTTANTTRIRWWIAGLMWLAIAINY--IDRTVLSAAAPHLIDELKLDPEMMGFIMAAFF 58 MNT+ + + +R L+WL I + ++ VL+ + P + ++ P ++ AF Sbjct: 1 MNTSYSQSNLRHNQI-LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFM 59 Query: 59 WSYSLLQIPAGWFADRFGQKKGLGLAVAWWSIATSMMGVATGFKSLLAL-RLALGVGEAA 117 ++S+ G +D+ G K+ L + + + V F SLL + R G G AA Sbjct: 60 LTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAA 119 Query: 118 AYPSNAGIAARWFPDKERATVSGLFDSASKFGGAIAMPLIVWMI-YTFDWRLTFLIIGSV 176 + AR+ P + R GL S G + P I MI + W LI Sbjct: 120 FPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVG-PAIGGMIAHYIHWSYLLLIP--- 175 Query: 177 GILWVIAWYFIYAENPEEHKRISPSE---------------------------VRIIRDG 209 ++ +I F+ +E + + V ++ Sbjct: 176 -MITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFL 234 Query: 210 QKQHHGDKTVLPMKWYKLLRYRNIWAMCIGFFTINYTSYFFITWLPTYLVKEKGMDFIKM 269 H K P L + + I T F++ +P + + ++ Sbjct: 235 IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEI 294 Query: 270 GMVAALPLLCGMVIEVFAGWASDRLVHKKVLSLTAT-RKLFLTIGLLMALCIGFAPFTDS 328 G V P G + + G+ LV ++ FL++ L A F T S Sbjct: 295 GSVIIFP---GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTA---SFLLETTS 348 Query: 329 VFMTVFLLCVAKSGTTVAASQVWALPGDVAPKNSVSIVAGLQNTVSNMGGAVGPIITGAI 388 FMT+ ++ V + + + + L N S + G I G + Sbjct: 349 WFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQ-EAGAGMSLLNFTSFLSEGTGIAIVGGL 407 Query: 389 VA 390 ++ Sbjct: 408 LS 409
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 35.7 bits (82), Expect = 3e-05 Identities = 13/73 (17%), Positives = 25/73 (34%), Gaps = 1/73 (1%) Query: 80 IDPQHRGQQLGEKLLAALEAKSRQRDCHTLRLETGIHQHAAIALYTRNGYQTRCAFAPYQ 139 + +R + +G LL +++ L LET +A Y ++ + A Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF-IIGAVDTML 155 Query: 140 PDPLSVFMEKPLF 152 E +F Sbjct: 156 YSNFPTANEIAIF 168
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 34.8 bits (80), Expect = 3e-04 Identities = 15/71 (21%), Positives = 31/71 (43%), Gaps = 3/71 (4%) Query: 22 GRGKVADYIPALASVSGDKLGI-AISTVDGQHFAAGDAHERFSIQSISKVL--SLVVAMN 78 + + I S ++G+ + G+ A A ERF + S KV+ V+A Sbjct: 21 ASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARV 80 Query: 79 HYQEEEIWQRV 89 +E++ +++ Sbjct: 81 DAGDEQLERKI 91
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 31.8 bits (72), Expect = 7e-04 Identities = 21/86 (24%), Positives = 39/86 (45%), Gaps = 4/86 (4%) Query: 92 RADVAKLLVHQNVRRQGIAQALMSELERIARRERKTVLVLDTAT-GSGAEQFYARCGWEK 150 A + + V ++ R++G+ AL+ + A+ L+L+T A FYA+ + Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF-I 147 Query: 151 VGEIPR--YALMPDGEMTATSLFYKF 174 +G + Y+ P A +YKF Sbjct: 148 IGAVDTMLYSNFPTANEIAIFWYYKF 173
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 31.5 bits (71), Expect = 6e-04 Identities = 11/51 (21%), Positives = 21/51 (41%), Gaps = 1/51 (1%) Query: 81 YLEDLFVDPAFRGQGIARTMIKSLQSEGADKGWSRLYWHTRRDN-PARHLY 130 +ED+ V +R +G+ ++ + + L T+ N A H Y Sbjct: 91 LIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFY 141
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 41.0 bits (96), Expect = 6e-06 Identities = 81/397 (20%), Positives = 143/397 (36%), Gaps = 32/397 (8%) Query: 11 NLRIISIVVFTCICYLSIGLPLAVLPGYIHYQLGYSTFVA---GIVISLQYISTLVSRPH 67 N +I I+ + + IGL + VLPG + L +S V GI+++L + P Sbjct: 4 NRPLIVILSTVALDAVGIGLIMPVLPGLLR-DLVHSNDVTAHYGILLALYALMQFACAPV 62 Query: 68 AGRYTDIWGPKKVVSLGIVCCLLSGAFTLLAVVLQATPMLAIAALLAGRVFLGV-GESFT 126 G +D +G + V+ + + + ++ P L + L GR+ G+ G + Sbjct: 63 LGALSDRFGRRPVLLVSLA------GAAVDYAIMATAPFLWV--LYIGRIVAGITGATGA 114 Query: 127 ATGATLWGIKTVGAIHTSRVISWNGVATYVAMAVGAPLGVTLNHYFGISGF--ATVVVLV 184 GA + I +R + M G LG + + + F A + + Sbjct: 115 VAGAYIADITDGDE--RARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGL 172 Query: 185 AAIGLLF-------ARTRQDVKVTAGARAPFH-AVVRKIWPYGLGLAFGTVGFGVIATFI 236 + F R + A F A + + + F G + + Sbjct: 173 NFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAAL 232 Query: 237 TLYFAAHSWQ----GAAFTLSLFSVGFICVRLVLGNTIT-RFGGVPVSLACFIIESLGLL 291 + F + +L+ F + + ++ + R G + I + G + Sbjct: 233 WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYI 292 Query: 292 LIWLAPSAWMAGVGAFLTGSGFSLVFPALGVEAVKQVEEQNQGTALGTYSAFLDLALGLT 351 L+ A WMA L SG + PAL +QV+E+ QG G+ +A L + Sbjct: 293 LLAFATRGWMAFPIMVLLASG-GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLT-SIV 350 Query: 352 GPLAGWVAGFYDLATLYLLAAIVVALAFLLIFRVHRQ 388 GPL + T A I A +LL R+ Sbjct: 351 GPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 31.3 bits (71), Expect = 0.006 Identities = 31/128 (24%), Positives = 50/128 (39%), Gaps = 4/128 (3%) Query: 246 VHLWALFGLAAAPSCLIWHKLVLKWGYRQALTRNLLVQALGVILPACSASLLFCVLSALL 305 + L+AL A AP + L ++G R L +L A+ + A + L + ++ Sbjct: 49 LALYALMQFACAP---VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105 Query: 306 VGFTFMGTVTIALPKAKSLSHQVSFNMIAAMTALYGVGQIAGPLIAGALYQIAASFNPAL 365 G T A M+A +G G +AGP++ G + + P Sbjct: 106 AGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHA-PFF 164 Query: 366 YAAALALL 373 AAAL L Sbjct: 165 AAAALNGL 172
>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature. Length = 1104 Score = 25.8 bits (56), Expect = 0.025 Identities = 10/29 (34%), Positives = 19/29 (65%) Query: 52 RRTPWARKEVEAMYLASLDDDAPVEKADP 80 +R WA KEV+A ++ + +D +E+ +P Sbjct: 415 KRLYWASKEVKAQFMRVVQNDKALEEGNP 443
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 43.7 bits (103), Expect = 1e-06 Identities = 31/157 (19%), Positives = 59/157 (37%), Gaps = 2/157 (1%) Query: 26 LGVFGLIVAEFLPASLLTPMASSLGVSEGMAGQAVTATALVALVTGLLIATATRNIDRRW 85 L F ++ L SL +A+ TA L + + + + + Sbjct: 22 LSFFSVLNEMVLNVSLPD-IANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR 80 Query: 86 VLMFFSVLQIVSSLMVAFADSLAFLL-LGRLLLGIAIGGFWAMSTATAMRLVPAAHVPKA 144 +L+F ++ S++ S LL + R + G F A+ R +P + KA Sbjct: 81 LLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKA 140 Query: 145 LAIIFSAVSVATVVAAPLGSYLGELIGWRNVFILCAI 181 +I S V++ V +G + I W + ++ I Sbjct: 141 FGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMI 177
>PF06057#Type IV secretory pathway VirJ component Length = 243 Score = 28.3 bits (63), Expect = 0.033 Identities = 12/44 (27%), Positives = 17/44 (38%), Gaps = 2/44 (4%) Query: 223 PPAPTAASAADGTFTITLTSTGERWPVPGDKTIAQVLQEHGVAV 266 A+S I L+ G W DK + +LQ+ G V Sbjct: 40 TQVNAASSHTKPPLVIFLSGDGG-W-ATLDKAVGGILQQQGWPV 81
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 679 bits (1753), Expect = 0.0 Identities = 309/861 (35%), Positives = 449/861 (52%), Gaps = 52/861 (6%) Query: 12 VSLSILLGGQSALLHAQAT--FNMDLLEKNDHLPAVDLQRFNQQAGQPPGAYPVSWQVNG 69 V L + + + A FN L + DL RF PPG Y V +N Sbjct: 28 VRLFVACAFAAQAPLSSAELYFNPRFLADDP-QAVADLSRFENGQELPPGTYRVDIYLNN 86 Query: 70 VTLDARKTVTFRQND-RGQLTPCLKPEDLLQAGVNPAVLSQATGATSRSCPELNALLPGS 128 + + VTF D + PCL L G+N A +S +C L +++ + Sbjct: 87 GYMA-TRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHDA 145 Query: 129 TVNFDFAHQRLVMTIPQTLMTHRARDNVPSALWDEGISAFQSNYRYSGASQRTREGSTER 188 T D QRL +TIPQ M++RAR +P LWD GI+A NY +SG S + R G Sbjct: 146 TAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSH 205 Query: 189 DNYLMLKSGVNVGAWRLRASNSLTAN-----SDDKPQWTTSGAWLERDLTRWQSELTLGD 243 YL L+SG+N+GAWRLR + + + N S K +W WLERD+ +S LTLGD Sbjct: 206 YAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGD 265 Query: 244 TFTSGDVFDAVQFQGISLASSDAMLPDSQKGFAPTIRGIARTNAQVTVRQNGYVLYQTYV 303 +T GD+FD + F+G LAS D MLPDSQ+GFAP I GIAR AQVT++QNGY +Y + V Sbjct: 266 GYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTV 325 Query: 304 TPGAFVIDDLYPTASSGNLEVAVKESDGEIRRFTQPYASVTSMQREGSLKYNLVAGRYHS 363 PG F I+D+Y +SG+L+V +KE+DG + FT PY+SV +QREG +Y++ AG Y S Sbjct: 326 PPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRS 385 Query: 364 DDASQR-PLMMQLSLMRGFAHNLTLFGGLQSAAQYHNLSLGAGQGLGEAGALSLQLLNAR 422 +A Q P Q +L+ G T++GG Q A +Y + G G+ +G GALS+ + A Sbjct: 386 GNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQAN 445 Query: 423 DR-HQQDPIDGRAWQLQYSKGFDRLGTQFTFTGWRYSHQRYATLSEAFSSPGSDDDLQDS 481 DG++ + Y+K + GT G+RYS Y ++ S + +++ Sbjct: 446 STLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQ 505 Query: 482 D-----------------NKKATLQITASQSLPYDITLYLSLDQDSYWSGGASQRTANMG 524 D NK+ LQ+T +Q L TLYLS +YW G Sbjct: 506 DGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQAG 565 Query: 525 ISSQVHGIAWSLSYSDSRSSHGDEEDDEPHSDKVVTLSLSVPLSHLLPG--------SYA 576 +++ I W+LSYS ++++ D+++ L++++P SH L + A Sbjct: 566 LNTAFEDINWTLSYSLTKNAWQKG------RDQMLALNVNIPFSHWLRSDSKSQWRHASA 619 Query: 577 GYTLTSSRHSVGSQMVSLNGTLLDNHALSYAVSQTRDRQ----NGSSGSLTAGYSSGRGD 632 Y+++ + + + + GTLL+++ LSY+V +GS+G T Y G G+ Sbjct: 620 SYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGN 679 Query: 633 LNLGYSHDSQAARLNYGASGGILIHRHGVVFTPEMNGAVVLIDAGGAGGVTLANQKTIAT 692 N+GYSH +L YG SGG+L H +GV +N VVL+ A GA + NQ + T Sbjct: 680 ANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRT 739 Query: 693 NGDGYAVLPFATAYHRNDVSLDSHSLPENVDLANSTVTLVPTKDAVVLARFHTHFGYKAL 752 + GYAVLP+AT Y N V+LD+++L +NVDL N+ +VPT+ A+V A F G K L Sbjct: 740 DWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLL 799 Query: 753 FTLQSRGQPLPFGSEVRAKDTNS--IVASEGQVYLAGLAPKGTLYAQWGPGPQQRCSARY 810 TL +PLPFG+ V ++ + S IVA GQVYL+G+ G + +WG C A Y Sbjct: 800 MTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANY 859 Query: 811 DLTPTLAQTPHPLILQQTLSC 831 L P Q + Q + C Sbjct: 860 QLPPESQQQL---LTQLSAEC 877
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 31.9 bits (72), Expect = 9e-04 Identities = 39/168 (23%), Positives = 75/168 (44%), Gaps = 22/168 (13%) Query: 24 ARAAGTLNFTGKIINESCQIANNGGDVNVDFGNVDMSALKSHEAKTAETPFTINLTGCPL 83 AA L F GK+I +C + N V++G++++ L ++ + FT+++ CP Sbjct: 22 VHAADNLTFKGKLIIPACTVQN----AEVNWGDIEIQNLV--QSGGNQKDFTVDMN-CPY 74 Query: 84 AQNISISLEGTPDTNANGTSAAVLALSDAADTAKGVGIEVFSSPDGS-----TEGTQLTF 138 S+ T+ T ++L + + + G+ I +++S + T G+Q+T Sbjct: 75 ----SLGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGIGNAVTLGSQVTP 130 Query: 139 DKQSKTAVSQADENGDIAFNFIADLKSDSSQDVTAGNINATANIDIVY 186 K + TA ++ I K + Q + AG +ATA + Y Sbjct: 131 GKITGTAPAR-----KITLYAKLGYKGN-MQSLQAGTFSATATLVASY 172
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 26.5 bits (58), Expect = 0.015 Identities = 12/35 (34%), Positives = 18/35 (51%), Gaps = 6/35 (17%) Query: 11 LCLAPLASSAALSGQVH------FSGRVINPACVI 39 LCL + + +S VH F G++I PAC + Sbjct: 7 LCLPVMLGAVLMSQHVHAADNLTFKGKLIIPACTV 41
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 39.8 bits (93), Expect = 2e-05 Identities = 26/118 (22%), Positives = 48/118 (40%), Gaps = 1/118 (0%) Query: 55 GLVMSVLLVGAALGSVFGGKFADYFGRRKYLLFLSFVFLIGALLSAAAPDITTLLIARAL 114 G+++++ + + G +D FGRR LL + + A AP + L I R + Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105 Query: 115 LGYAVGGASVTAPTFISEVAPTEMRGKLTGLNEVAIVIGQLAAFAINAIIGIIWGHLP 172 G G A +I+++ + R + G G +A + ++G H P Sbjct: 106 AGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAP 162 Score = 32.5 bits (74), Expect = 0.004 Identities = 32/137 (23%), Positives = 51/137 (37%), Gaps = 22/137 (16%) Query: 321 LVDRFKRKTIIIYGFAIMATLHLIIAAVDYTLVGDLKATAIWLLGALFVGVMQGSMGFIT 380 L DRF R+ +++ L AAVDY ++ + +G + G + G+ G + Sbjct: 66 LSDRFGRRPVLLVS--------LAGAAVDYAIMATAPFLWVLYIGRIVAG-ITGATGAVA 116 Query: 381 WVVLAELFPLKFRGLSMGISVFFMWIMNAVVSYLFPL------LQAKLGLGPVFFIFAAI 434 +A++ R G M+A + L FF AA+ Sbjct: 117 GAYIADITDGDERARHFGF-------MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAAL 169 Query: 435 NYLAILFVVFALPETSN 451 N L L F LPE+ Sbjct: 170 NGLNFLTGCFLLPESHK 186 Score = 30.2 bits (68), Expect = 0.018 Identities = 30/152 (19%), Positives = 51/152 (33%), Gaps = 8/152 (5%) Query: 48 ALTPTTEGLVMSVL-LVGAALGSVFGGKFADYFGRRKYLLFLSFVFLIGALLSAAAPDIT 106 TT G+ ++ ++ + ++ G A G R+ L+ G +L A A Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301 Query: 107 TLLIARALLGYAVGGASVTAPTFISEVAPTEMRGKLTGLN----EVAIVIGQLAAFAINA 162 LL + G +S E +G+L G + ++G L AI A Sbjct: 302 MAFPIMVLLA-SGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA 360 Query: 163 IIGIIWGHLPDVWRYMLLVQAIPAICLFVGMW 194 W W + + L G+W Sbjct: 361 ASITTWNGW--AWIAGAALYLLCLPALRRGLW 390
>DNABINDNGFIS#DNA-binding protein FIS signature. Length = 98 Score = 28.8 bits (64), Expect = 0.012 Identities = 13/50 (26%), Positives = 30/50 (60%), Gaps = 2/50 (4%) Query: 296 INNVRQLLEHDSGEVLLDTLSSFIANNAEPGKTSLLLGIHRNTLTYRLQQ 345 +N++ +L+ + + LLD + + N + +L++GI+R TL +L++ Sbjct: 47 VNDLYELVLAEVEQPLLDMVMQYTRGNQT--RAALMMGINRGTLRKKLKK 94
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 246 bits (630), Expect = 3e-79 Identities = 114/474 (24%), Positives = 195/474 (41%), Gaps = 73/474 (15%) Query: 7 SILLIDDDADVLDAYTQLLEQAGYHVSACNNPFDAREQVPKDWPGIVLSDVCMPGCSGID 66 +IL+ DDDA + Q L +AGY V +N + +V++DV MP + D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 67 LMTLFHQDDDLLPILLITGHGDVPMAVEAVKKGAWDFLQKPIDPGKLLTLVDAALRQRQS 126 L+ + LP+L+++ A++A +KGA+D+L KP D +L+ ++ AL + + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 127 VIARRQYCQQKLQVELIGRSQWTVRYRQRLQQLAETDIAVWLYGEPGTGRMTGARYLHQL 186 ++ + Q L+GRS + L +L +TD+ + + GE GTG+ AR LH Sbjct: 125 RPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183 Query: 187 GRHAEGPFIA--CELTPAN----------------AHTLNE-LIAQAQGGTLVLSHPEHL 227 G+ GPF+A P + A T + QA+GGTL L + Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243 Query: 228 THEQQHQLVQ-LQSHEKRP----------FRLISIGSASLVELAASSQIVAELYYCFAMT 276 + Q +L++ LQ E R+++ + L + +LYY + Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303 Query: 277 QIGCQPLSKRPDDIEPLFHHYLQKTCQRLNHPVPEVDAGLLKGMMRRVWPNNVRELANAA 336 + PL R +DI L H++Q+ + V D L+ M WP NVREL N Sbjct: 304 PLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRELENLV 362 Query: 337 ELFAV--------------------------------GVLPLAETVNPLMH--------- 355 G L +++ V M Sbjct: 363 RRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDA 422 Query: 356 IGEPTPLDQRVEDVERQIITEALNIHQGRINEVAEYLLIPRKKLYLRMKKYGLN 409 + D+ + ++E +I AL +G + A+ L + R L ++++ G++ Sbjct: 423 LPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 30.6 bits (69), Expect = 0.010 Identities = 5/35 (14%), Positives = 15/35 (42%), Gaps = 4/35 (11%) Query: 312 QRLVQRMFDTAISFRLAQLKDAWRALHSAEVRLKR 346 +++ + LA ++++W + RL + Sbjct: 150 NSVMEGVIVRI----LANVRESWTQVIDLRPRLGQ 180
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 31.3 bits (71), Expect = 0.009 Identities = 65/387 (16%), Positives = 126/387 (32%), Gaps = 39/387 (10%) Query: 52 TPYLKEQLDLSATQI---GLLSSCMLIAYGISKGVMSSLADKASPKVFMACGLVLCAIVN 108 P L L S G+L + + V+ +L+D+ + + L A+ Sbjct: 28 LPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDY 87 Query: 109 VGLGFSTAFWVFAALVVLNGLFQGMGVGPSFITIANWFPRRERGRVGAFWNISHNVGGGI 168 + + WV ++ G+ G IA+ ER R F +S G G+ Sbjct: 88 AIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIADITDGDERAR--HFGFMSACFGFGM 144 Query: 169 VA-PIVGAAFAILGTEHWQSASYIVPACVAVVFAISVLVLGKGSPREEGLPSLAEMMPEE 227 VA P++G P A + G ++PE Sbjct: 145 VAGPVLGGLMGGFSPH--------APFFAAAALNGLNFLTG------------CFLLPE- 183 Query: 228 KVVLKTKHGQKAPENMSAFQIFCTYVLRNKNAWYVSFVDVFVYMVRFGMISWLPIYLLTV 287 + G++ P A ++ + + VF M G + + Sbjct: 184 -----SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGE 238 Query: 288 KHFSKEQMSVAFLFFEWA---AIPSTLLAGWLSDKLFKGRRMPLAIICMTLIFICLIGYW 344 F + ++ + ++ ++ G ++ +L + R + L +I +I L Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298 Query: 345 KSESLLMVTVFAAIVGCLIYVPQFLASVQTMEIVPSFAVGSAVGLRGFMSYIFGASLGTS 404 + + V A G + Q + S Q E GS L ++ I G L T+ Sbjct: 299 RGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTS-LTSIVGPLLFTA 357 Query: 405 LFGVMVDKMGWHGGFYLLMGGIVCCIL 431 ++ + W+G ++ + L Sbjct: 358 IYAASITT--WNGWAWIAGAALYLLCL 382
>INTIMIN#Intimin signature. Length = 939 Score = 32.3 bits (73), Expect = 5e-04 Identities = 22/70 (31%), Positives = 39/70 (55%), Gaps = 7/70 (10%) Query: 84 SDGVKVTQSGAESR-FYTVKSGDTLSAISKAMYGSANDYQRIFEANKPMLTHPD---KIY 139 SD +T + ++R FYT+K+G+T++ +SK+ + I+ NK + + K Sbjct: 49 SDSKLLTHNSYQNRLFYTLKTGETVADLSKS---QDINLSTIWSLNKHLYSSESEMMKAE 105 Query: 140 PGQVLIIPAK 149 PGQ +I+P K Sbjct: 106 PGQQIILPLK 115
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 29.4 bits (66), Expect = 0.014 Identities = 14/40 (35%), Positives = 20/40 (50%), Gaps = 3/40 (7%) Query: 41 LTHPD-GFTLIDGGLAVEGLKDPSGYWG-SAVEQFKPVMS 78 L HP+ F + G+ + G PSG W A +PVM+ Sbjct: 196 LWHPEAHFDWVRPGIILYGAS-PSGQWRDIANTGLRPVMT 234
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 30.6 bits (69), Expect = 0.013 Identities = 26/141 (18%), Positives = 48/141 (34%), Gaps = 4/141 (2%) Query: 18 MVIAFVQFTNALEYMMFSPVFTFMAADF---AVPVTFSGYVSGMYTSGAVLSGIIAFYWI 74 +VI +A+ + PV + D G + +Y + Sbjct: 8 IVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALS 67 Query: 75 DRCNKKHFLIANMVLLAMATLLTTFTTSFPLLLTLRFFAGLVGGTTMAVGITILINHTPA 134 DR ++ L+ ++ A+ + +L R AG+ G T AV + + T Sbjct: 68 DRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITDG 126 Query: 135 DLRGKMLATVIASFSMVSIVG 155 D R + + A F + G Sbjct: 127 DERARHFGFMSACFGFGMVAG 147
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.015 Identities = 13/42 (30%), Positives = 22/42 (52%), Gaps = 4/42 (9%) Query: 31 VISIIGRSGSGKSTLLRCINGLEGYQEGSIKLGGMTITNRDS 72 + + G G GKSTL+ + GL+ + + +G T +DS Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG----TGKDS 635
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 29.1 bits (65), Expect = 0.020 Identities = 15/72 (20%), Positives = 27/72 (37%), Gaps = 13/72 (18%) Query: 58 RLGYDKYKDMRDELRTL-------RQSGMPLTDQRDAV------QGNTLLARHYKQEMAN 104 L Y K D+ + L + +Q+ P+ + Q N L+ M + Sbjct: 273 YLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIIIKAHGQTNALIVTAAPDVMND 332 Query: 105 LTQWVNALDARQ 116 L + + LD R+ Sbjct: 333 LERVIAQLDIRR 344
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 63.5 bits (154), Expect = 5e-14 Identities = 48/185 (25%), Positives = 84/185 (45%), Gaps = 3/185 (1%) Query: 2 LRGKRAVITGGGTGFGQALSVWLAREGVEVDFCARRADDIQKTCSIITAEGGMAKGHLCD 61 + GK A ITG G G+A++ LA +G + + ++K S + AE A+ D Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 62 LTLPESLSQFSSQLLTLDKPIDILILNAAQWLSGTLDDQSDTEIINTISSGLTGSILLTQ 121 + ++ + ++++ PIDIL+ A G + SD E T S TG ++ Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 122 ALLPGLRRSESADIVSIISSCGIPNFTDSIAHPAFFASKHGLSGFTTKLSYQLSKENIRV 181 ++ + S IV++ S+ P + A+ +SK FT L +L++ NIR Sbjct: 126 SVSKYMMDRRSGSIVTVGSN---PAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182 Query: 182 TGLYP 186 + P Sbjct: 183 NIVSP 187
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 50.8 bits (121), Expect = 3e-10 Identities = 27/106 (25%), Positives = 48/106 (45%), Gaps = 2/106 (1%) Query: 54 ADNESTAIHPDVAPAENEV--VKRRVGAFSFTELEMILRAQGIENLILTGVTTSRVVLST 111 + I ++AP ++++ K R AF T L ++R +G + LI+TG+ L T Sbjct: 101 SGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVT 160 Query: 112 VGQAFDLDYRLIVVNDYCADPDPDTNMFLLKKVLPQHAFVTSSSEI 157 +AF D + V D AD + + L+ + AF + + Sbjct: 161 ACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCAFTVMTDSL 206
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 96.4 bits (240), Expect = 5e-25 Identities = 42/127 (33%), Positives = 65/127 (51%), Gaps = 1/127 (0%) Query: 6 HILVVDDDRDIRELIVDYLEKSGYRASGAANGKAMWSVLKNHQIDLIVLDIMMPGEDGLT 65 ILV DDD IR ++ L ++GY +N +W + DL+V D++MP E+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 66 LCRQLRANPQQDIPVLMLTARTDDSDRILGLEMGADDYLIKPFVARELLARIKAILRRTR 125 L +++ + D+PVL+++A+ I E GA DYL KPF EL+ I L + Sbjct: 65 LLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 126 ALPPNLQ 132 P L+ Sbjct: 124 RRPSKLE 130
>PF04619#Dr-family adhesin Length = 160 Score = 27.2 bits (60), Expect = 0.014 Identities = 11/21 (52%), Positives = 14/21 (66%), Gaps = 2/21 (9%) Query: 1 MKKLAL--AMACLFAVGVAQA 19 MKKLA+ A + +FAV A A Sbjct: 1 MKKLAIMAAASMVFAVSSAHA 21
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 33.4 bits (76), Expect = 6e-04 Identities = 12/47 (25%), Positives = 19/47 (40%) Query: 246 RGTGIGRRLLSEAMAFCDSRQFSAVQLWTFKGLDAARKLYESFGFTL 292 R G+G LL +A+ + F + L T +A Y F + Sbjct: 102 RKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 56.0 bits (135), Expect = 1e-10 Identities = 80/390 (20%), Positives = 134/390 (34%), Gaps = 37/390 (9%) Query: 5 IFSLALGTFGLGMAEFGIMGVLPDMAHDVGISIPAA---GNMIAWYAFGVVIGAPIMALL 61 + ++AL G+G+ IM VLP + D+ S G ++A YA AP++ L Sbjct: 11 LSTVALDAVGIGL----IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 62 SSRFSLKSVMLFLAGLCILGNTLFTFSSSYAMLALGRLVSGFPHGAFFGVGAIILSKIAP 121 S RF + V+L + + + +L +GR+V+G GA I Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIAD--IT 124 Query: 122 PGKVTAAVAGMIGGMTIANLVGVPGGTWLGHHFSWRYTFALIAVFNVAVFLAIFCWVPTL 181 G A G + +V P L FS F A N FL +P Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184 Query: 182 YDRASTRLREQ---------FRFLASPAPWLI---FAATMFGNAGVFAWFSYIKPFMLNV 229 + LR + + + L+ F + G W + + Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGE------ 238 Query: 230 SGFAESKMMLIMMLAGLGM---VVGNLFSGKISGRYSPLRIAAMTDGVIAVTLLLIFAFG 286 F + + LA G+ + + +G ++ R R + +L+ Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298 Query: 287 EQKVASLALAFICCAGLFALSAPLQILLLQNAKGGEMLGAAGGQIAF--NLGSAIGAFCG 344 +A + + G+ LQ +L + E G G +A +L S +G Sbjct: 299 RGWMAFPIMVLLASGGIG--MPALQAMLSRQV-DEERQGQLQGSLAALTSLTSIVGPLLF 355 Query: 345 GMMIAQGFG-WNS-VALPAAALSFLAMSAL 372 + A WN + AAL L + AL Sbjct: 356 TAIYAASITTWNGWAWIAGAALYLLCLPAL 385
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 318 bits (817), Expect = e-106 Identities = 83/395 (21%), Positives = 169/395 (42%), Gaps = 15/395 (3%) Query: 20 IDATVLHVAAPTLSVALGSSGNELLWIIDIYSLVMAGMVLPMGALGDKIGFKRLLLLGSA 79 ++ VL+V+ P ++ W+ + L + G L D++G KRLLL G Sbjct: 28 LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGII 87 Query: 80 IFGIASLCAALSPT-AMTLIASRALLAVGAAMIVPATLAGIRSTFAEASQRNMALGLWAA 138 I S+ + + LI +R + GAA PA + + + + R A GL + Sbjct: 88 INCFGSVIGFVGHSFFSLLIMARFIQGAGAAAF-PALVMVVVARYIPKENRGKAFGLIGS 146 Query: 139 VGSGGAAFGPLVGGILLEHFYWGSVFLINVPIVLVVIAINAKVVPRQPARREQPLNLLQA 198 + + G GP +GG++ + +W +L+ +P++ ++ + ++ R + ++ Sbjct: 147 IVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGI 204 Query: 199 LVLIAAILMLVFSAKSALKGQLALWLTALVALGGAAMLTWFIRKQLSAARPMVDMRLFTH 258 +++ I+ + S L + + + + F++ P VD L + Sbjct: 205 ILMSVGIVFFMLFTTSYSISFLIVSVLSFLI---------FVKHIRKVTDPFVDPGLGKN 255 Query: 259 RIILSGVMMAMTALITLVGFELLMAQELQFVHQKTPFEAG-IFMLPVMVASGFSGPIAGL 317 + GV+ T+ GF ++ ++ VHQ + E G + + P ++ G I G+ Sbjct: 256 IPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGI 315 Query: 318 LVSRLGLREVATGGMLLSAFSFLGLALTDFSTQQWLAWGLMTLLGFSVASALLASSSAIM 377 LV R G V G+ + SFL + +T ++ ++ +LG + + S+ Sbjct: 316 LVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSS 375 Query: 378 AAAPKEKAAAAGAIETMAYELGAGLGIALFGLILT 412 + +E A + ++ L G GIA+ G +L+ Sbjct: 376 SLKQQEAGAGMSLLNFTSF-LSEGTGIAIVGGLLS 409
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 51.9 bits (124), Expect = 1e-10 Identities = 26/180 (14%), Positives = 64/180 (35%), Gaps = 11/180 (6%) Query: 5 QRDARREGIMQAAMRLALRGGFAAMTVRQIAREAQVAAGQLHHHFTSIGELKAQVFIRLI 64 + R+ I+ A+RL + G ++ ++ +IA+ A V G ++ HF +L ++++ Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67 Query: 65 REMLDMPLVAED-------ASWRERL---FSMIGSEDGRLEPYIRLWREGQVLADSDPDI 114 + ++ L + + RE L +E+ R + + + Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERR-RLLMEIIFHKCEFVGEMAVV 126 Query: 115 KAAYLLTMNMWHAETVAIIEQGLASGEFRSAEPAADIAWRFIALVCGLDGIYALDAQALD 174 + A + ++ + + + A + GL + Q+ D Sbjct: 127 QQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFD 186
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.005 Identities = 12/31 (38%), Positives = 14/31 (45%) Query: 38 LLGPSGCGKSTLLRLLAGLSVPASGEIRFGD 68 L G G GKSTL+ L GL + G Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIGT 631
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 37.0 bits (85), Expect = 2e-04 Identities = 23/96 (23%), Positives = 39/96 (40%), Gaps = 25/96 (26%) Query: 32 AQLEARLNLAEQ--QASEASRR-------AQRAEQQTAAAEQRAAAAEQQVQALSQQTTA 82 QLEA E+ + SEASR+ A R ++ A ++ AL + Sbjct: 361 KQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKA--LEEANSKLAALEKLNKE 418 Query: 83 REQKQQATNQQ--------------LSEQLAKRAPD 104 E+ ++ T ++ L E+LAK+A + Sbjct: 419 LEESKKLTEKEKAELQAKLEAEAKALKEKLAKQAEE 454 Score = 32.3 bits (73), Expect = 0.005 Identities = 24/82 (29%), Positives = 36/82 (43%), Gaps = 13/82 (15%) Query: 26 SVEQRLAQLEARLN-LAEQ-QASEASRRAQRAEQQTAAAEQRAAAAEQQVQALSQQTTAR 83 + + QLEA L EQ + SEASR++ R + + ++ AE Q Sbjct: 320 ASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQ--------KLE 371 Query: 84 EQKQ--QATNQQLSEQL-AKRA 102 EQ + +A+ Q L L A R Sbjct: 372 EQNKISEASRQSLRRDLDASRE 393 Score = 30.4 bits (68), Expect = 0.018 Identities = 19/77 (24%), Positives = 30/77 (38%), Gaps = 12/77 (15%) Query: 27 VEQRLAQLEARLNLAEQ--QASEASRRAQRAEQQTAAAEQRAAAAEQQVQALSQQTTARE 84 +E A LEA E Q A+R++ R + + ++ AE Q E Sbjct: 286 LEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQ--------KLEE 337 Query: 85 QKQ--QATNQQLSEQLA 99 Q + +A+ Q L L Sbjct: 338 QNKISEASRQSLRRDLD 354
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 94.7 bits (235), Expect = 2e-25 Identities = 57/180 (31%), Positives = 80/180 (44%), Gaps = 10/180 (5%) Query: 2 QTIMITGCSSGFGLETARYFLEQGWKVIATMRAPQEGVLPASDRLRLVR------LDVTS 55 + ITG + G G AR QG + A P++ S R DV Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 56 AQSIAEAI----AEVGEIDVLVNNAGVGMLNALEGAPREAIANLFATNTLGTIAMTQAVI 111 + +I E E+G ID+LVN AGV + E F+ N+ G +++V Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 112 PRFRARRSGTIVNITSAVTLQPMPLLAVYTASKAAVNAFTESLALELRAFNIRVGLILPG 171 RRSG+IV + S P +A Y +SKAA FT+ L LEL +NIR ++ PG Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 30.2 bits (68), Expect = 0.004 Identities = 14/44 (31%), Positives = 21/44 (47%), Gaps = 5/44 (11%) Query: 10 QRQALICQILQENGRVVCAELAARLQ-----VSEHTIRRDLHEL 48 QR I +I+ N EL L+ V++ T+ RD+ EL Sbjct: 5 QRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKEL 48
>cloacin#Cloacin signature. Length = 551 Score = 30.8 bits (69), Expect = 0.002 Identities = 15/44 (34%), Positives = 20/44 (45%) Query: 23 PAYANPGNGNGNGGGNHGNSGNHGNSGNHGNNGNSGDHGNKGQN 66 +++ N G G G+ + G GN G NGNSG G N Sbjct: 37 SGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80 Score = 29.7 bits (66), Expect = 0.006 Identities = 13/39 (33%), Positives = 16/39 (41%) Query: 27 NPGNGNGNGGGNHGNSGNHGNSGNHGNNGNSGDHGNKGQ 65 NP G G + G HGN G +GN+G G Sbjct: 44 NPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLS 82 Score = 29.7 bits (66), Expect = 0.007 Identities = 11/32 (34%), Positives = 16/32 (50%) Query: 26 ANPGNGNGNGGGNHGNSGNHGNSGNHGNNGNS 57 + G G+G GN G +GN G G N ++ Sbjct: 52 SGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 108 bits (272), Expect = 1e-27 Identities = 81/384 (21%), Positives = 156/384 (40%), Gaps = 14/384 (3%) Query: 39 PAIQQSLGGSPAALSWLTNGFMLTFGSFLLAAGVTADAIDRKRIFIAGAALFCLSSLLFC 98 P I PA+ +W+ FMLTF G +D + KR+ + G + C S++ Sbjct: 38 PDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGF 97 Query: 99 LTHNLFLSGVL-RALQGLAAAMILASGSAALAQLYDGAQRTRAFSILGTVFGVGLAFGPL 157 + H+ F ++ R +QG AA A +A+ R +AF ++G++ +G GP Sbjct: 98 VGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPA 157 Query: 158 LIGFMTDAVGWRGVYALFALLSAIVLLIGLAYLPAAEKSEPRTPDNLGLTLFTLALMLFT 217 + G + + W Y L + I+ + L L E D G+ L ++ ++ F Sbjct: 158 IGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFM 215 Query: 218 ASLMVIPARGFLSLTTLALLIASGGLFVAFVVRCRRVNNPVLELSLLRHPRFVGVLLLPV 277 + FL ++ L+ LI FV R+V +P ++ L ++ F+ +L Sbjct: 216 LF-TTSYSISFLIVSVLSFLI--------FVKHIRKVTDPFVDPGLGKNIPFMIGVLCGG 266 Query: 278 ATCCCYVVLLIIVPLHFMGGEGMSESQ-SALYLMALTTPMLVFPSVAALLTRWFSPGQVS 336 + +VP +S ++ ++ + T +++F + +L P V Sbjct: 267 IIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVL 326 Query: 337 TAGLMMASVGLLLLGDAFHSNHLPQLVLALILCGAGAALPWGLMDGLAISAVPVAKAGMA 396 G+ SV L + + ++ G + ++ + S++ +AG Sbjct: 327 NIGVTFLSVSFLTASFLLETTSWFM-TIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAG 385 Query: 397 AGLFNTVRVAGEGIALAVVSAVLT 420 L N EG +A+V +L+ Sbjct: 386 MSLLNFTSFLSEGTGIAIVGGLLS 409
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 32.2 bits (73), Expect = 0.004 Identities = 17/72 (23%), Positives = 32/72 (44%), Gaps = 9/72 (12%) Query: 188 LVSRYHDPRPESLRRVVMAPTTVLHSAPGAQ-LREMAKLARQLGIRL------HSHLSET 240 + S + P PE L R+ AP + + G Q L K ++ L +HL++ Sbjct: 101 VWSAGYGPSPEMLARI--APGRGFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQY 158 Query: 241 VDYLDAARQKFA 252 D++ + + +F Sbjct: 159 EDFIRSMKPRFV 170
>BINARYTOXINA#Clostridial binary toxin A signature. Length = 454 Score = 31.2 bits (70), Expect = 0.010 Identities = 24/95 (25%), Positives = 39/95 (41%), Gaps = 8/95 (8%) Query: 144 STYRLASLGGLYGGGFGGIGS--INYGPLAAPGNVLSVKVMTVEPAPRVLTVPAPEALLL 201 + LA + GG+ I + I+ GPL P L KV +E A ++ P P L++ Sbjct: 277 TPNELADVNDYMRGGYTAINNYLISNGPLNNPNPELDSKVNNIENALKL--TPIPSNLIV 334 Query: 202 HHAYGTNGIILEVELALAPAHQWIERLDVFDDFAD 236 + G E L L +++ D F + Sbjct: 335 YRRSGP----QEFGLTLTSPEYDFNKIENIDAFKE 365
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 29.5 bits (66), Expect = 0.027 Identities = 32/129 (24%), Positives = 47/129 (36%), Gaps = 15/129 (11%) Query: 6 ALAALALLMLAAYRGY----SVILFAPIAALGAVLLTDPGAVGPA----------FTGLF 51 ALA + ++AA GY S++ P+ AL + G V A + L Sbjct: 126 ALAGAGIGVVAAALGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLV 185 Query: 52 MEKMVGFVKLYFPVFLLGAVFGKLIELSGFSRSIVAAAIRILGRRHAIPVIVLVCALLTY 111 + ++ FPV L VF S SI +LG + V V AL Y Sbjct: 186 ITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHP-VVDVCQHVGALCIY 244 Query: 112 GGVSLFVVA 120 + F+ Sbjct: 245 IVIPFFLST 253
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 115 bits (288), Expect = 4e-33 Identities = 66/255 (25%), Positives = 105/255 (41%), Gaps = 9/255 (3%) Query: 3 LHGKTALVTGSTSGIGLGIAKVLAQAGAQLVLNGFGDSSHARAE--VAALGKIPGYHDAD 60 + GK A +TG+ GIG +A+ LA GA + + + + A + AD Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 61 LRDVGQIEAMMRYAESTFGGVDIVINNAGIQHVAPVEQFPVDKWNDILAINLSSVFHTTR 120 +RD I+ + E G +DI++N AG+ + ++W ++N + VF+ +R Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 121 LALPGMRQRNWGRIINIASVHGLVASKEKSAYVAAKHAVVGLTKTVALETARSGITCNAI 180 M R G I+ + S V +AY ++K A V TK + LE A I CN + Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 181 CPGWVLTPLVQQQIDKRIAEGVDPEQASAQLLAEKQ---PSGEFVTPQQLGEMALFLCSD 237 PG T + EQ L + P + P + + LFL S Sbjct: 186 SPGSTETDMQWSLWADENGA----EQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241 Query: 238 AAAQVRGAAWNMDGG 252 A + +DGG Sbjct: 242 QAGHITMHNLCVDGG 256
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 124 bits (312), Expect = 1e-36 Identities = 74/254 (29%), Positives = 114/254 (44%), Gaps = 10/254 (3%) Query: 3 KVALVTGAGQGIGKAIALRLVKDGFAVAIADYNDATAKAVASEINQAGGRAMAVKVDVSD 62 K+A +TGA QGIG+A+A L G +A DYN + V S + A A DV D Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 63 RDQVFAAVEQARKTLGGFDVIVNNAGVAPSTPIESITPEIVDKVYNINVKGVIWGIQAAV 122 + + + +G D++VN AGV I S++ E + +++N GV ++ Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 123 EAFKKEGHGGKIINACSQAGHVGNPELAVYSSSKFAVRGLTQTAARDLAPLGITVNGYCP 182 + G I+ S V +A Y+SSK A T+ +LA I N P Sbjct: 129 KYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 183 GIVKTPM----WAEIDRQVSEAAGKPLGYGTAEFAKRITLGRLSEPEDVAACVSYLASPD 238 G +T M WA+ + G F I L +L++P D+A V +L S Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGS-----LETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242 Query: 239 SDYMTGQSLLIDGG 252 + ++T +L +DGG Sbjct: 243 AGHITMHNLCVDGG 256
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1060 bits (2744), Expect = 0.0 Identities = 502/1031 (48%), Positives = 690/1031 (66%), Gaps = 7/1031 (0%) Query: 1 MPHFFIERPIFAWVIALFIVLTGLLSIPRLPVAQYPEVAPPGIIISVSYPGASPEVMNTS 60 M +FFI RPIFAWV+A+ +++ G L+I +LPVAQYP +APP + +S +YPGA + + + Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VVSLIEREISSVDNLLYFESSSDTTGMASITVTFKPGTDIKLAQMDLQNQIKIVESRLPQ 120 V +IE+ ++ +DNL+Y S+SD+ G +IT+TF+ GTD +AQ+ +QN++++ LPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 SVRQNGINVEAANSGFLMMVGLKSPSGAYQEADLSDYFARNVTDELRRVPGVGKVQLFGG 180 V+Q GI+VE ++S +LM+ G S + + D+SDY A NV D L R+ GVG VQLFG Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 EKALRIWLDPMKLHSYGLSVTDVLSAISQQNVIVSPGRTGDEPATSSQEVTYPITVKGQL 240 + A+RIWLD L+ Y L+ DV++ + QN ++ G+ G PA Q++ I + + Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 SSVEEFRNITIKSQVSAARVTLADVARVESGLQSYAFGIRENGVPATAAAIQLSPGANAI 300 + EEF +T++ + V L DVARVE G ++Y R NG PA I+L+ GANA+ Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 STASGIRARLTELSGVLPEGMTFTVPFDTAPFVKLSILKVVETFVEAMVLVFFVMLLFLH 360 TA I+A+L EL P+GM P+DT PFV+LSI +VV+T EA++LVF VM LFL Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 KIRCTLIPAIVAPVALLGTFTVMLLSGYSINILTMFGMILAIGIIVDDAIVVVENVERLM 420 +R TLIP I PV LLGTF ++ GYSIN LTMFGM+LAIG++VDDAIVVVENVER+M Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 EDKKMSPQDATREAMREITPAIIGITLVLTAVFIPMAFASGSVGIIYRQFSISMAISILL 480 + K+ P++AT ++M +I A++GI +VL+AVFIPMAF GS G IYRQFSI++ ++ L Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SAFLALTLTPALCATLLKP-HGIHQGKSSVFSAWFNAHFHRLTSFYATGLGFVLKRTGRM 539 S +AL LTPALCATLLKP H F WFN F + Y +G +L TGR Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 540 MMIYAALCLALFAGLSTLPSSFLPDEDQGYFMSSIQLPSDATMQRTLKVVDTFEEEI--A 597 ++IYA + + LPSSFLP+EDQG F++ IQLP+ AT +RT KV+D + Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 598 HRQAVESNIMILGFGFSGSGQNSAMAFTTLKDWRQRKGT--TAQEEADHIRSQMANVPDA 655 + VES + GF FSG QN+ MAF +LK W +R G +A+ + ++ + D Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 656 VTMSLLPPAISDMGTSSGFTYYLQDRGGKGYQALKKAADELIVQANHNP-HLADVYIDGL 714 + PAI ++GT++GF + L D+ G G+ AL +A ++L+ A +P L V +GL Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 715 GEGTSLSLHVDREKAEAMGVSFDEINQTISVAAGSNYVNDYTNNGRVQQVIVQADAPYRM 774 + L VD+EKA+A+GVS +INQTIS A G YVND+ + GRV+++ VQADA +RM Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 775 QPEQLLALSVKNRLGQMLPLSTFVTLSWNVAPQQLIRYQGYPAIRITGSSAQGKSSGTAM 834 PE + L V++ G+M+P S F T W +L RY G P++ I G +A G SSG AM Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840 Query: 835 AAMDNLAKHLPPGFAGEWAGSSLQEKESASQLPGLIVLSVLVVFMVLAALYESWSIPFAV 894 A M+NLA LP G +W G S QE+ S +Q P L+ +S +VVF+ LAALYESWSIP +V Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 Query: 895 MLVVPLGLLGAVLAVSVTNMTNDVFFKVGLITLIGLSAKNAILIIEFARQLM-KEGKSLI 953 MLVVPLG++G +LA ++ N NDV+F VGL+T IGLSAKNAILI+EFA+ LM KEGK ++ Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 Query: 954 DATLTAAKLRLRPILMTSLAFTLGVVPLMLASGASDSTQHAIGTGVFGGMISGTLLAIFF 1013 +ATL A ++RLRPILMTSLAF LGV+PL +++GA Q+A+G GV GGM+S TLLAIFF Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 Query: 1014 VPVFFVTITRF 1024 VPVFFV I R Sbjct: 1021 VPVFFVVIRRC 1031
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 77.9 bits (192), Expect = 7e-19 Identities = 29/118 (24%), Positives = 59/118 (50%), Gaps = 1/118 (0%) Query: 7 IIVAEDDDDIAAILTGYLRKAGMKTLRAEDGEQAINLTRLNKPDLLLLDIHLPVYDGWNV 66 I+VA+DD I +L L +AG + DL++ D+ +P + +++ Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 67 LTTLRKE-TNVPVIMVTALDQDVDKLMGLRLGADDYVIKPFNPSEVIARVEAVLRRTR 123 L ++K ++PV++++A + + + GA DY+ KPF+ +E+I + L + Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.043 Identities = 12/34 (35%), Positives = 17/34 (50%) Query: 31 VVSLLGPSGSGKTTLLRAVAGLEKPTSGRIAIGN 64 V L G G GK+TL+ + GL+ + IG Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGT 631
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 38.8 bits (90), Expect = 2e-05 Identities = 17/82 (20%), Positives = 33/82 (40%), Gaps = 5/82 (6%) Query: 107 DIDLEAVAAARPDLIITEPSRHVSVEQLEKIAPTVSIDHLQGSAP-----EIYRKLAQLT 161 + +LE + +P ++ S E L +IAP + G P + ++A L Sbjct: 86 EPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLL 145 Query: 162 GTQPRLAILERRYQEQIKQLKA 183 Q +Y++ I+ +K Sbjct: 146 NLQSAAETHLAQYEDFIRSMKP 167
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.022 Identities = 13/32 (40%), Positives = 18/32 (56%), Gaps = 4/32 (12%) Query: 38 LRPG---ESVALL-GPSGCGKSTLLRLLAGLE 65 + PG + +L G G GKSTL+ L GL+ Sbjct: 589 MEPGCKFDYSVVLEGTGGIGKSTLINTLVGLD 620
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 50.3 bits (120), Expect = 8e-09 Identities = 79/400 (19%), Positives = 150/400 (37%), Gaps = 52/400 (13%) Query: 14 VTIGLCFMVALMEGLDLQAAGIAAVGMAQAFALDKMQMGWIFSAGILGLLPGALVGGMLA 73 + I LC + L+ ++ +A F W+ +A +L G V G L+ Sbjct: 15 ILIWLCILS-FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73 Query: 74 DRHGRKRILLGSVLLFGLFSLATALAWS-FPTLLLARLLTGVGLGAALPNLIA-LTSEAA 131 D+ G KR+LL +++ S+ + S F L++AR + G G AA P L+ + + Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG-AAAFPALVMVVVARYI 132 Query: 132 GSRFRGRAVSLMYCGVPIGAALAAALGFSGLAAAWQTIFWIGGVVPLLLIPLLMRWLPES 191 RG+A L+ V +G + A+G + + ++ ++ +P LM+ L + Sbjct: 133 PKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKE 192 Query: 192 QAFQRA---------EASVPLRTLFAPGQAAATLLLWLGYFFTLLVVYMLINWLPMLLVG 242 + + LF + + L++ + F + V ++ P + G Sbjct: 193 VRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFL-IFVKHIRKVTDPFVDPG 251 Query: 243 QGFRASQAAGVMFSLQI-GAACGTLLLGALMDK--------------LTPLRMSLLIYS- 286 G GV+ I G G + + M K + P MS++I+ Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311 Query: 287 --GILAS------LLALGSASSLTGMLLAGFV----------AGLFATGGQSVLYALAPL 328 GIL +L +G L A F+ +F GG S + Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVIST 371 Query: 329 FYPAAIRATGVGTAVA----VGRLGAMSGPLLAGKMLALG 364 ++++ G ++ L +G + G +L++ Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIP 411
>PF06438#Heme acquisition protein HasAp Length = 205 Score = 29.1 bits (65), Expect = 0.014 Identities = 25/121 (20%), Positives = 40/121 (33%), Gaps = 17/121 (14%) Query: 139 VGSRIRDWSIGFVD-------TVADNASCGLYVIGGPAQRPAGLDLKQCAMHMTRNQE-L 190 V + DWS F D V + + G GP D Q A+ T + Sbjct: 16 VADYLADWSAYFGDVNHRPGQVVDGSNTGGFN--PGP------FDGSQYALKSTASDAAF 67 Query: 191 VSSGRGSECLGHPLNAAVWLARKLASLGEPLRAGDIVLTGALG-PMVTINEGDSFAAHIE 249 ++ G L + +W +LG+ L G AL V+ + + + Sbjct: 68 IAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLGLDSPIAQ 127 Query: 250 G 250 G Sbjct: 128 G 128
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 47.5 bits (113), Expect = 6e-08 Identities = 69/370 (18%), Positives = 124/370 (33%), Gaps = 43/370 (11%) Query: 64 GILFSAFAWTYALAQIPGGLFLDRFGNKVTYFLSLTLWSLFTLFHGMAVGLKTLLLCRFG 123 GIL + +A G DRFG + +SL ++ A L L + R Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105 Query: 124 LGISEAPCFPVNSRVVSAWFPQQERAKA----TAVYTVGEYLGLACFAPLLFWIMDGFGW 179 GI+ A V ++ ERA+ +A + G G P+L +M GF Sbjct: 106 AGITGAT-GAVAGAYIADITDGDERARHFGFMSACFGFGMVAG-----PVLGGLMGGFSP 159 Query: 180 RVLFVSVGAVGILFALVWWRCYREPHEDPRLSQQEREHIENGGGLSAPTDQQVAFSWPLV 239 F + A+ L L E H+ R + + +F W Sbjct: 160 HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREA-----------LNPLASFRWARG 208 Query: 240 RQLLSKRQIIGASIGQFAGNTVLVFFLTWFPTWLATERHMPWLKVGFFSILPFVAAAGGV 299 +++ + + ++ + E W + + AA G+ Sbjct: 209 MTVVAALMAVFFIMQLVGQVPAALWVIF-------GEDRFHWDA----TTIGISLAAFGI 257 Query: 300 M---FGGWLSDKLLKATGSANLGRKLPIVAGLL--MASCIITANWLESDLAVILVMSFAF 354 + ++ + LG + ++ G++ I+ A +A +++ A Sbjct: 258 LHSLAQAMITGPVAA-----RLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLAS 312 Query: 355 FGQGMVGLGWTLISDIAPKGLGGLTGGLFNFCANLAGILTPLVIGFIVAGFGNFFYALIY 414 G GM L ++S + G G +L I+ PL+ I A + + Sbjct: 313 GGIGMPALQ-AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAW 371 Query: 415 IGGAALLGVV 424 I GAAL + Sbjct: 372 IAGAALYLLC 381
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 65.8 bits (160), Expect = 2e-15 Identities = 29/162 (17%), Positives = 61/162 (37%), Gaps = 4/162 (2%) Query: 6 HDEAQSLKARIFSAAIAVFAEHGLSGARMEQIATEAQTTKRMVVYYFKSKEQLYQEVLQH 65 EAQ + I A+ +F++ G+S + +IA A T+ + ++FK K L+ E+ + Sbjct: 6 KQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL 65 Query: 66 VYARIRETEQQLGLENVPPVEALVR---LVRWSVRYHATHADYMRVICMENMQR-GKWLK 121 + I E E + + +++R + + I + G+ Sbjct: 66 SESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAV 125 Query: 122 SSGELKPLNRTALSILEDILLRGQQQGVFQAGLDARDVHRLI 163 + L + +E L + + A L R ++ Sbjct: 126 VQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM 167
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 440 bits (1132), Expect = e-140 Identities = 231/1055 (21%), Positives = 423/1055 (40%), Gaps = 71/1055 (6%) Query: 8 LSALAVRERSVTLFLIILISVAGLVAFFGLGRAEDPPFTVKQMTVITVWPGATAQEMQDQ 67 ++ +R L I++ +AG +A L A+ P ++V +PGA AQ +QD Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 68 VAEPLEKRLQELKWYDRTETYT-RPGMALITLSLQDQTPP----SEVPEQFYQARKKLGD 122 V + +E+ + + + + G ITL+ Q T P +V + A L Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLL-- 118 Query: 123 EAKNLPAGVSGPMMNDEFADVTFALFAL--KARGEQPRQLVRD--AEALRQQLLHVSGVK 178 P V ++ E + ++ + A + + D A ++ L ++GV Sbjct: 119 -----PQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVG 173 Query: 179 KVNILGEQ-AERIYLSFSHDRLATLGLSPEAIFAALNSQNVLTAAGAI---ETRGGQIF- 233 V + G Q A RI+L D L L+P + L QN AAG + GQ Sbjct: 174 DVQLFGAQYAMRIWLDA--DLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLN 231 Query: 234 --IRLDGAFDRLQQIRDTPIIAG--GRTLKLADVATVERGYEDPATFLIRHQGEPALLLG 289 I F ++ + G ++L DVA VE G E+ R G+PA LG Sbjct: 232 ASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIA-RINGKPAAGLG 290 Query: 290 VVMREGWNGLALGKALDAETASINQSLPLGMSLTKVTDQSVNISAAVDEFMIKFFVA-LL 348 + + G N L KA+ A+ A + P GM + D + + ++ E + F A +L Sbjct: 291 IKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIML 350 Query: 349 VVMTVCFVSMGWRVGVVVAAAVPLTLAVVFVVMEATGKNFDRITLGSLILALGLLVDDAI 408 V + + R ++ AVP+ L F ++ A G + + +T+ ++LA+GLLVDDAI Sbjct: 351 VFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAI 410 Query: 409 IAIEMMV-VKMEEGYDRLKASAYAWSHTAAPMLAGTLVTAVGFMPNGFAQSTAGEYASNV 467 + +E + V ME+ +A+ + S ++ +V + F+P F + G Sbjct: 411 VVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQF 470 Query: 468 FWIVGIALIASWIVAVIFTPWLGVHLLPDRKPAAAGHAALYDT----------PRYQRFR 517 + A+ S +VA+I TP L LL KP +A H + Sbjct: 471 SITIVSAMALSVLVALILTPALCATLL---KPVSAEHHENKGGFFGWFNTTFDHSVNHYT 527 Query: 518 RLLTRVIAHKWRVAAGVVALLIVAILGMSVVKKQFFPTSDRPEVLVEVQLPYGSSISQTS 577 + +++ R ++ ++ + F P D+ L +QLP G++ +T Sbjct: 528 NSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQ 587 Query: 578 AAAAKIEHWLQRQPEAKIVTSYIGQGAPRFYLAMAPELPDP--SFAKLVVLTDGQGARE- 634 ++ + + +A + + + G + + + + +F L + G Sbjct: 588 KVLDQVTDYYLKNEKANVESVFTVNG-----FSFSGQAQNAGMAFVSLKPWEERNGDENS 642 Query: 635 --ALKRRLREAV-----VNGLAPEARVRVTQLVFGPYSPYPVAWRVMGPDPHALLDIAER 687 A+ R + + + V + + +G D AL + Sbjct: 643 AEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHD--ALTQARNQ 700 Query: 688 VKSVLQASPL-MRTVNTDWGSRVPVMHFSLNQDRLQASGLSSQSVAQQLQFLLSGIPITT 746 + + P + +V + ++Q++ QA G+S + Q + L G + Sbjct: 701 LLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVND 760 Query: 747 VREDIRAVQVIGRAAGDIRLDPAKIADFTLVGSGGQRVPLSQIGDVSIRMEDPLLRRRDR 806 + R ++ +A R+ P + + + G+ VP S P L R + Sbjct: 761 FIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNG 820 Query: 807 TPTITVRGDVAENLQPPDVSIALMKPLQPIIDSLPPGYRIETAGSIEESGKATRAMVPLF 866 P++ ++G+ A P S M ++ + LP G + G + + L Sbjct: 821 LPSMEIQGEAA----PGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALV 876 Query: 867 PIMIALTLLIIILQVRSLSAMVMVFLTAPLGLIGVVPTLLLFNQPFGINALVGLIALSGI 926 I + L + S S V V L PLG++GV+ LFNQ + +VGL+ G+ Sbjct: 877 AISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGL 936 Query: 927 LMRNTLILIGQIHHNQQA-GLDPFHAVVEATVQRARPVLLTALAAILAFIPLTHSVFWGT 985 +N ++++ + G A + A R RP+L+T+LA IL +PL S G+ Sbjct: 937 SAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGS 996 Query: 986 -----LAYTLIGGTLGGTIMTLIFLPAMYAIWFRI 1015 + ++GG + T++ + F+P + + R Sbjct: 997 GAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031 Score = 76.8 bits (189), Expect = 3e-16 Identities = 57/323 (17%), Positives = 122/323 (37%), Gaps = 20/323 (6%) Query: 712 MHFSLNQDRLQASGLSSQSVAQQLQF----LLSGIPITTVREDIRAVQVIGRAAGDIRLD 767 M L+ D L L+ V QL+ + +G T + + A + + Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK-N 242 Query: 768 PAKIADFTLVGSG-GQRVPLSQIGDVSIRMED-PLLRRRDRTPTITVRGDVAENLQPPDV 825 P + TL + G V L + V + E+ ++ R + P + +A D Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302 Query: 826 SIALMKPLQPIIDSLPPGYRIE----TAGSIEESGKATRAMVPLFPIMIALTLLIIILQV 881 + A+ L + P G ++ T ++ S +V I L L++ L + Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLS---IHEVVKTLFEAIMLVFLVMYLFL 359 Query: 882 RSLSAMVMVFLTAPLGLIGVVPTLLLFNQPFGINALVGLIALSGILMRNTLILIGQIH-H 940 +++ A ++ + P+ L+G L F + G++ G+L+ + ++++ + Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 941 NQQAGLDPFHAVVEATVQRARPVLLTALAAILAFIPL-----THSVFWGTLAYTLIGGTL 995 + L P A ++ Q ++ A+ FIP+ + + + T++ Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 996 GGTIMTLIFLPAMYAIWFRIRPE 1018 ++ LI PA+ A + Sbjct: 480 LSVLVALILTPALCATLLKPVSA 502
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 41.7 bits (98), Expect = 3e-06 Identities = 18/92 (19%), Positives = 35/92 (38%), Gaps = 9/92 (9%) Query: 70 GKVLERRVETGQSVKRGQLLLRLDPADLALQAQSQQRAVDAARVRAKKAANDLARYRGLV 129 V E V+ G+SV++G +LL+L Q ++ AR L + R + Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQAR---------LEQTRYQI 155 Query: 130 ASGAISAAEFDQINAAAEAARADLRAAQAQAN 161 S +I + ++ E ++ + Sbjct: 156 LSRSIELNKLPELKLPDEPYFQNVSEEEVLRL 187 Score = 31.3 bits (71), Expect = 0.005 Identities = 18/84 (21%), Positives = 32/84 (38%), Gaps = 4/84 (4%) Query: 178 GVVVETLAEPGQVVSAGQVVIRLARAGQREARVQLPETLRPAVGSEALATRYGSESQPV- 236 +V E + + G+ V G V+++L G ++ +L A TRY S+ + Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQA---RLEQTRYQILSRSIE 161 Query: 237 TATLRLLSDAADATTRTFEARYVL 260 L L + + VL Sbjct: 162 LNKLPELKLPDEPYFQNVSEEEVL 185 Score = 28.6 bits (64), Expect = 0.044 Identities = 12/128 (9%), Positives = 37/128 (28%), Gaps = 15/128 (11%) Query: 103 SQQRAVDAARVRAKKAANDLARYRGLVAS--GAISAAEFDQINAAAEA----------AR 150 ++ ++ + A N+L Y+ + I +A+ + Sbjct: 250 AKHAVLEQENKYVE-AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTT 308 Query: 151 ADLRAAQAQANVAQNATGYAGLLADADGVVVE-TLAEPGQVVSAGQVVIRLARAGQR-EA 208 ++ + + + + A V + + G VV+ + ++ + E Sbjct: 309 DNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEV 368 Query: 209 RVQLPETL 216 + Sbjct: 369 TALVQNKD 376
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 58.9 bits (142), Expect = 3e-13 Identities = 31/183 (16%), Positives = 52/183 (28%), Gaps = 10/183 (5%) Query: 4 FSRYGYEKTTVTDLAKAIGFSKAYIYKFFDSKQAIGEAICASRLEKIMVAVSEAIADAPS 63 FS+ G T++ ++AKA G ++ IY F K + I I E A P Sbjct: 24 FSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPG 83 Query: 64 ASEK-----LRRLFR-ALTEAGSELFFE--DRKLYDIAAVAARDKWPSTEQYAGHLQQLI 115 L + +TE L E K + +A + I Sbjct: 84 DPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA--QRNLCLESYDRI 141 Query: 116 GQILVEGRQAGEFERKTPLDEATLAVYMVMCPFINPVQLQYNLDTAPTAAVLLASLILRS 175 Q L +A A + + + + A +++L Sbjct: 142 EQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILLEM 201 Query: 176 LSP 178 Sbjct: 202 YLL 204
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 37.1 bits (86), Expect = 6e-05 Identities = 37/167 (22%), Positives = 61/167 (36%), Gaps = 28/167 (16%) Query: 6 KVLILGASGGIGGEVARRLVADNWQVRA-----------LKRGAQIRDPEDGIQWIAGDA 54 K L+ GA+G IG V++RL+ QV LK+ + G Q+ D Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 55 LDGGQVAA--AAAGCDVIVH-----AV-----NPPGYRHWRQQVLPMLRNTLQAAERQR- 101 D + A+ + + AV NP Y + L N L+ + Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYAD--SNLTGFL-NILEGCRHNKI 118 Query: 102 ALVVLPGTVYNYGPDA-FPLIAEEAAQQPVTRKGAIRVAMELTLKDY 147 ++ + YG + P +++ PV+ A + A EL Y Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTY 165
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 119 bits (300), Expect = 5e-39 Identities = 34/89 (38%), Positives = 55/89 (61%) Query: 4 TKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFGNFDLRDKNQRPGR 63 K ++ + + L+K+D+ V+ F + L GE+V+L GFGNF++R++ R GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 64 NPKTGEDIPITARRVVTFRPGQKLKSRVE 92 NP+TGE+I I A +V F+ G+ LK V+ Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 229 bits (585), Expect = 1e-77 Identities = 169/241 (70%), Positives = 192/241 (79%), Gaps = 7/241 (2%) Query: 18 MTLDLPRRFPWPTLLSVAIHGAVVAGLLYTSVHQVIEQPSPTQPIEITMVAPADLEPPPA 77 MTLDLPRRFPWPTLLSV IHGAVVAGLLYTSVHQVIE P+P QPI +TMV PADLEPP A Sbjct: 1 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQA 60 Query: 78 AQPVVEPVVVPEPEPEPEVVPEPPKEAPVVIHKPEPKPKPKPKPKPKPEKKVEQPKREVK 137 QP EPVV PEPEPEP P APVVI KP+PKPKPKPKP K + EQPKR+VK Sbjct: 61 VQPPPEPVVEPEPEPEPIPEPPKE--APVVIEKPKPKPKPKPKPVKKVQ---EQPKRDVK 115 Query: 138 PAAEPRPASPFENNNTAPARTAPSTSTAAAKPTVTAPSGPRAISRVQPSYPARAQALRIE 197 P E RPASPFEN A T+ + + A +KP + SGPRA+SR QP YPARAQALRIE Sbjct: 116 P-VESRPASPFENTAPA-RLTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIE 173 Query: 198 GTVRVKFDVSPDGRIDNLQILSAQPANMFEREVKSAMRRWRYEQGRPGTGVTMTIKFRLN 257 G V+VKFDV+PDGR+DN+QILSA+PANMFEREVK+AMRRWRYE G+PG+G+ + I F++N Sbjct: 174 GQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKIN 233 Query: 258 G 258 G Sbjct: 234 G 234
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.0 bits (70), Expect = 0.008 Identities = 9/16 (56%), Positives = 11/16 (68%) Query: 55 VVGESGCGKSTFARAI 70 + GESG GK ARA+ Sbjct: 165 ITGESGTGKELVARAL 180
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 29.5 bits (66), Expect = 0.040 Identities = 27/112 (24%), Positives = 44/112 (39%), Gaps = 8/112 (7%) Query: 392 YNTSDLHKKLAIAAASLWRK------NLGIDVKLVNQEWKTFLDTRHQGTYDVARAGWCA 445 YN + ++ I A + G+ V V +E+ F+ + + +G A Sbjct: 28 YNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGVQREYDAFITNQLRAA-QTQSSGLTA 86 Query: 446 DYNEPTSFLNTMLSDSSMNTAHYKSPAFDKIMAESVKASDEAQRTAAYAKAE 497 Y E S ++ MLS S+ + A F + A D A R A K+E Sbjct: 87 RY-EQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVSNAEDPAARQALIGKSE 137
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 86.8 bits (215), Expect = 5e-21 Identities = 37/152 (24%), Positives = 61/152 (40%), Gaps = 3/152 (1%) Query: 10 ILIVEDEPVFRSLLHGWLTSLGATTFQAEDGKDALHKMTEVHPDLMICDISMPRMNGLEL 69 IL+ +D+ R++L+ L+ G + + DL++ D+ MP N +L Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 70 VETLRNRGEQLPILMISATENMADIAKALRLGVQDVLLKPVKDFDRLRETVYACLYPAMF 129 + ++ LP+L++SA KA G D L KP D L + L A Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF-DLTELIGIIGRAL--AEP 122 Query: 130 SSRVEEEERLFEDWDALVSNPIAASRLLQELQ 161 R + E +D LV A + + L Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLA 154
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 30.6 bits (69), Expect = 0.015 Identities = 16/58 (27%), Positives = 28/58 (48%), Gaps = 1/58 (1%) Query: 128 TPFSIFVIISLLCGFAGANF-ASSMANISFFFPKAKQGGALGVNGGLGNMGVSVMQLV 184 + FS+ ++ + G A F A M ++ + PK +G A G+ G + MG V + Sbjct: 101 SFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAI 158
>PF06580#Sensor histidine kinase Length = 349 Score = 48.3 bits (115), Expect = 5e-08 Identities = 28/116 (24%), Positives = 49/116 (42%), Gaps = 9/116 (7%) Query: 476 FGFTVQLDYQLPPRFVPSHQAIHLLQIAREALSNALKHASAT-----EVTVTVSQRDNQV 530 F +Q + Q+ P + L+Q E N +KH A ++ + ++ + V Sbjct: 236 FEDRLQFENQINPAIMDVQVPPMLVQTLVE---NGIKHGIAQLPQGGKILLKGTKDNGTV 292 Query: 531 RLVVADNGRGVPDHAERSNHYGLIIMRDRAQSLRG-DCQVRRRETGGTEVIVTFIP 585 L V + G + + S GL +R+R Q L G + Q++ E G + IP Sbjct: 293 TLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 74.5 bits (183), Expect = 8e-18 Identities = 35/118 (29%), Positives = 57/118 (48%), Gaps = 2/118 (1%) Query: 6 RATILLIDDHPMLRTGVKQLISMAPDIQVIGEASNGAQGIELAESLDPDLILLDLNMPGM 65 ATIL+ DD +RT + Q +S A V SN A + D DL++ D+ MP Sbjct: 3 GATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 66 NGLETLDKLREKSLSGRVVVFSVSNHEEDVVTALKRGADGYLLKDMEPEDLLKALQQA 123 N + L ++++ V+V S N + A ++GA YL K + +L+ + +A Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118
>INTIMIN#Intimin signature. Length = 939 Score = 230 bits (588), Expect = 3e-69 Identities = 123/448 (27%), Positives = 209/448 (46%), Gaps = 22/448 (4%) Query: 1 MPVSFRLLPTLTFLLLLPGVPVWALTASDTTRPAQAQDPLPDMGIAPQVDDDARHFAEVA 60 +P + LP LL P+ A + PD+ + DD A ++A Sbjct: 117 LPFEYSALP------LLGSAPLVAAGGVAGHTNKLTKMS-PDVTKSNMTDDKALNYAAQQ 169 Query: 61 KKFGEASMSDNDLTAGEQAQLFAISKIGNEVSHQLESWLSPWGNANVDLLVDKEGKFTGS 120 + + L G+ A+ A+ GN+ S QL++WL +G A V+L F GS Sbjct: 170 AASLGSQLQSRSLN-GDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGN--NFDGS 226 Query: 121 KGSWFVPLQDNDRYLTWNQYSVTRREHDLVGNIGLGQRWRVGGWLLGYNSFYDKVLSESL 180 + +P D+++ L + Q + N+G GQR+ + +LGYN F D+ S Sbjct: 227 SLDFLLPFYDSEKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDN 286 Query: 181 ARGSVGAEAWGEYLRLSANYYHPLGDW-QLRDNQTQEQRMAAGYDVTAQARLPFYQHINT 239 R +G E W +Y + S N Y + W + + + ++R A G+D+ LP Y + Sbjct: 287 TRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGA 346 Query: 240 SVSVEQYFGDSVDLFHSGTGYHNPVAVSVGLNYTPVPLVTVTAKHKQGENGVSQNNVGLK 299 + EQY+GD+V LF+S NP A +VG+NYTP+PLVT+ ++ G + ++ Sbjct: 347 KLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQ 406 Query: 300 LNYRFGVPLKQQLAADEVAISNSLRGSRFDSPERDNLPVVEYRQRKNLTVYLATP-PWDL 358 Y+F P QQ+ V +L GSR+D +R+N ++EY +K + L P + Sbjct: 407 FRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEY--KKQDILSLNIPHDING 464 Query: 359 QSGETVQLKLQIHSLHGIKALHWQGDTQALSLTPPVDASSPDG---WSIIMPVWNSEPGA 415 T +++L + S +G+ + W D+ S + S + I+P + G Sbjct: 465 TERSTQKIQLIVKSKYGLDRIVWD-DSALRSQGGQIQHSGSQSAQDYQAILPAYV--QGG 521 Query: 416 ANRWRLSVVVEDKQGQRVSSNEIALALT 443 +N ++++ D+ G SSN + L +T Sbjct: 522 SNVYKVTARAYDRNGN--SSNNVLLTIT 547
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 104 bits (260), Expect = 1e-26 Identities = 51/152 (33%), Positives = 70/152 (46%), Gaps = 13/152 (8%) Query: 407 DWAPPPPPRPVIKQVVQGPQTIRLDSMALFDTGKSTLKPGSTKLL--VNSLLGIKAKPGW 464 + AP P P VQ + L S LF+ K+TLKP L + S L Sbjct: 195 EAAPVVAPAPAPAPEVQT-KHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDG 253 Query: 465 LIVVAGHTDSIGNDRSNQQLSLKRAEAVRDWMRDTGDVPESCFAVQGYGASRPVASN--- 521 +VV G+TD IG+D NQ LS +RA++V D++ G +P + +G G S PV N Sbjct: 254 SVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKG-IPADKISARGMGESNPVTGNTCD 312 Query: 522 ------ETPEGRAQNRRVEISLVPQKDACLAP 547 + A +RRVEI + KD P Sbjct: 313 NVKQRAALIDCLAPDRRVEIEVKGIKDVVTQP 344
>TRNSINTIMINR#Translocated intimin receptor (Tir) signature. Length = 549 Score = 28.9 bits (64), Expect = 0.013 Identities = 14/56 (25%), Positives = 32/56 (57%), Gaps = 2/56 (3%) Query: 11 LKAGLVSSKKMAKVQRTAKKSRVQAREAREAVEENKKAQLERDKQLSEQQKQAVLA 66 + +G + + ++ + AK++ AR+ +AVE N +AQ + Q + +Q++ L+ Sbjct: 308 IPSGELKDDIVEQIAQQAKEAGEVARQ--QAVESNAQAQQRYEDQHARRQEELQLS 361
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.011 Identities = 14/76 (18%), Positives = 32/76 (42%), Gaps = 15/76 (19%) Query: 18 FIKDENGENRYFHVIKVANPDLIKKDAAVTFEPTTNNKGLSAYAVKVIPESKYIYIAGER 77 ++ D G R++ V+ +L+ L + ++ E+ ++Y+AGER Sbjct: 698 YLFDITGNRRFWPVLVPGRANLV---------------WLQKFRGQLFAEALHLYLAGER 742 Query: 78 LKLTSIKSYVVYREEE 93 + + +R E+ Sbjct: 743 YFPSPEDEEIYFRPEQ 758
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 40.2 bits (94), Expect = 1e-05 Identities = 74/369 (20%), Positives = 138/369 (37%), Gaps = 62/369 (16%) Query: 67 LMRPIGAIVLGAYIDKVGRRKGLIVTLSIMATGTFLIVLIPSYQTIGLWAPLLVLIGRLL 126 LM+ A VLGA D+ GRR L+V+L+ A ++ P LW ++ IGR++ Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPF-----LW---VLYIGRIV 105 Query: 127 QGFSAGAELGGVSVYLAEIATSGRKGFYTSWQSGSQQVAIMVAAAMGFALNAVLEPSAIS 186 G + GA Y+A+I + + + S ++ +G + Sbjct: 106 AGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGF------- 157 Query: 187 DWGWRIPFLFGCLIVPFIFIL------------RR--KLEETQEFTARRHHLAMRQVFAT 232 PF + F+ RR + E + R M V A Sbjct: 158 --SPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAAL 215 Query: 233 LLANWQVVIAGMMMVAMTTTAFYLITVYAPTFGKKVLMLSASD-SLLVTLLVAISNFFWL 291 + V M +V A ++I FG+ A+ + + + + Sbjct: 216 M-----AVFFIMQLVGQVPAALWVI------FGEDRFHWDATTIGISLAAFGILHSLAQA 264 Query: 292 PVGGALSDRFGRRSVLIAMTLLALATAWPALTMLANAPSFLMMLSVLLWLSFIYGMYNGA 351 + G ++ R G R + + ++A T +LA A M +++ L+ G Sbjct: 265 MITGPVAARLGERR-ALMLGMIADGT---GYILLAFATRGWMAFPIMVLLAS-----GGI 315 Query: 352 MIPALTEIMPAEV------RVAGFSLAYSLATAVFGGFTPVISTALIEYTGDKASPGYWM 405 +PAL ++ +V ++ G A + T++ G P++ TA+ + + W+ Sbjct: 316 GMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVG---PLLFTAIYAASITTWNGWAWI 372 Query: 406 SFAAICGLL 414 + AA+ L Sbjct: 373 AGAALYLLC 381 Score = 36.3 bits (84), Expect = 2e-04 Identities = 39/157 (24%), Positives = 62/157 (39%), Gaps = 20/157 (12%) Query: 273 ASDSLLVTLLVAISNFFWLPVGGALSDRFGRRSVLIAMTLLALATAWPALTMLANAPSFL 332 + ++ L A+ F PV GALSDRFGRR VL L++LA A ++A AP Sbjct: 42 TAHYGILLALYALMQFACAPVLGALSDRFGRRPVL----LVSLAGAAVDYAIMATAPFLW 97 Query: 333 MMLSVLLWLSFIYGMYNGAMIPALT----EIMPAEVRVAGFSLAYSLATAVFGGFT--PV 386 + L++ I GA +I + R F ++ G PV Sbjct: 98 V-----LYIGRIVAGITGATGAVAGAYIADITDGDERARHFGF---MSACFGFGMVAGPV 149 Query: 387 ISTALIEYTGDKASPGYWMSFAAICGLLATCYLYRRS 423 + + ++ +P + + L C+L S Sbjct: 150 LGGLMGGFS--PHAPFFAAAALNGLNFLTGCFLLPES 184
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 44.2 bits (104), Expect = 3e-08 Identities = 21/64 (32%), Positives = 26/64 (40%), Gaps = 2/64 (3%) Query: 73 STWLGRNGIYMEDLYVTPDYRGIGAGKALLKTIAQYAVQRQCGRLEWSVLDWNQPAIDFY 132 S W G +ED+ V DYR G G ALL ++A + L D N A FY Sbjct: 84 SNWNGY--ALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFY 141 Query: 133 LSIG 136 Sbjct: 142 AKHH 145
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 27.2 bits (60), Expect = 0.043 Identities = 9/37 (24%), Positives = 14/37 (37%) Query: 1 MKLRWLLILVVFLAGCSSKHDYTNPPWNPEVPVKRAM 37 ++L L + L GC+S +P R M Sbjct: 3 LRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTM 39
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 539 bits (1391), Expect = 0.0 Identities = 286/356 (80%), Positives = 319/356 (89%) Query: 1 MTRPVVASIDLLALRQNLQIVRRAAPGSRLWAVVKANAYGHGVARVWSALSAADGFALLN 60 MTRP+ AS+DL AL+QNL IVR+AA +R+W+VVKANAYGHG+ R+WSA+ A DGFALLN Sbjct: 1 MTRPIQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLN 60 Query: 61 LEEAILLREQGWKGPILLLEGFFHADELAVLDQYRLTTSVHSNWQIKALQQAKLRAPLDI 120 LEEAI LRE+GWKGPIL+LEGFFHA +L + DQ+RLTT VHSNWQ+KALQ A+L+APLDI Sbjct: 61 LEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDI 120 Query: 121 YLKVNSGMNRLGFMPERVHTVWQQLRAISNVGEMTLMSHFAEAENPQGIVEPMRRIEQAA 180 YLKVNSGMNRLGF P+RV TVWQQLRA++NVGEMTLMSHFAEAE+P GI M RIEQAA Sbjct: 121 YLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGISGAMARIEQAA 180 Query: 181 EGLDCPRSLANSAATLWHPEAHFDWVRPGIVLYGASPSGQWQDIANTGLKPVMTLRSEII 240 EGL+C RSL+NSAATLWHPEAHFDWVRPGI+LYGASPSGQW+DIANTGL+PVMTL SEII Sbjct: 181 EGLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMTLSSEII 240 Query: 241 GVQNLRPGEAIGYGGLYRTTQEQRIGIVACGYADGYPRVAPSGTPVLVDGVRTTTVGRVS 300 GVQ L+ GE +GYGG Y EQRIGIVA GYADGYPR AP+GTPVLVDGVRT TVG VS Sbjct: 241 GVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTMTVGTVS 300 Query: 301 MDMLAVDLTPCPQAGIGAPVELWGKEIKIDDVAASSGTVGYELMCALAPRVPVVTL 356 MDMLAVDLTPCPQAGIG PVELWGKEIKIDDVAA++GTVGYELMCALA RVPVVT+ Sbjct: 301 MDMLAVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALRVPVVTV 356
>PF05272#Virulence-associated E family protein Length = 892 Score = 34.3 bits (78), Expect = 0.001 Identities = 18/50 (36%), Positives = 20/50 (40%), Gaps = 10/50 (20%) Query: 18 PGVKALSDISFDCYPGQIHALMGENGAGKSTLLKILSGNYIPTAGHLQIG 67 PG K FD L G G GKSTL+ L G + H IG Sbjct: 591 PGCK------FDYSV----VLEGTGGIGKSTLINTLVGLDFFSDTHFDIG 630
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 104 bits (262), Expect = 2e-26 Identities = 81/395 (20%), Positives = 158/395 (40%), Gaps = 20/395 (5%) Query: 25 FMEFLDGTVIATALPDMARDFGVTAVELNIGISAYLITLAVLIPASGWIADRFGARAIFT 84 F L+ V+ +LPD+A DF N +A+++T ++ G ++D+ G + + Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 85 LALAIFTLASVFCGLS-TEVHIFVAMRILQGVGGALMVPVGRLAVLRTTPKHQLIKAIAT 143 + I SV + + + + R +QG G A + + V R PK KA Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 144 LTWPALVAPIIGPPLGGFITRYASWHWIFFINVPLGLAAIILSLRIIPDIRETERRSFDL 203 + + +GP +GG I Y HW + + +P+ + L + + FD+ Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201 Query: 204 SGFITTSVAMVSLVTAMERLGDRQPQIWPTLALAALGFGCLLYSIRHFRRAAAPMVRLDA 263 G I SV +V + L ++ L F ++H R+ P V Sbjct: 202 KGIILMSVGIVFFMLFTTS------YSISFLIVSVLSFLIF---VKHIRKVTDPFVDPGL 252 Query: 264 LQVPTFRVTMYGGSLFRASISAVPFLLPLLFQVGFGMDPFHSGLLVLAVFVGNLTI---K 320 + F + + G + +++ ++P + + + G +++ F G +++ Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVII--FPGTMSVIIFG 310 Query: 321 PATTPLIRWLGFRRLLLINGALNVCSLLACALLTPQTPVW-AIMLILYLGGVFRSIQFTG 379 L+ G +L I S L + L T + I+++ LGG+ S T Sbjct: 311 YIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL--SFTKTV 368 Query: 380 VSTLAFADVPAAQMSDANTLFSTASQLAVGLGITL 414 +ST+ + + + +L + S L+ G GI + Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403
>ENTEROTOXINA#Heat-labile enterotoxin A chain signature. Length = 258 Score = 28.8 bits (64), Expect = 0.025 Identities = 15/36 (41%), Positives = 21/36 (58%), Gaps = 5/36 (13%) Query: 8 PDEQRRLQSLRSSGLLNSGKEERFDRLTRLARSLYN 43 PDE +R S GL+ G E FDR T++ +LY+ Sbjct: 31 PDEIKR-----SGGLMPRGHNEYFDRGTQMNINLYD 61
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 29.8 bits (67), Expect = 0.021 Identities = 33/153 (21%), Positives = 57/153 (37%), Gaps = 35/153 (22%) Query: 78 LGGIIFGHFGDRLGRKRMLMMTVWMMGIATACIGLLPSFNQIGWWAPVLLVFLRAVQGFA 137 +G ++G D+LG KR+L + GI C G + F +G LL+ R +QG Sbjct: 64 IGTAVYGKLSDQLGIKRLL-----LFGIIINCFGSVIGF--VGHSFFSLLIMARFIQGA- 115 Query: 138 VGGEWGGAALLS---------VENAPQGKK-AFYSSGVQVGYGVGLLLSTGLVSLISSLT 187 G AA + + +GK S V +G GVG + + I Sbjct: 116 -----GAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI---- 166 Query: 188 SDQQFLSWGWRLPFLFSVVLVLIALWIRNGMAE 220 W L ++ ++ ++ + + Sbjct: 167 --------HWSYLLLIPMITIITVPFLMKLLKK 191
>FbpA_PF05833#Fibronectin-binding protein Length = 577 Score = 27.5 bits (61), Expect = 0.011 Identities = 14/84 (16%), Positives = 33/84 (39%), Gaps = 6/84 (7%) Query: 16 RLYRRKNKLQREIQDVEKKIRDNQKRVLLLDNLSDYIKPGMSVEAIQGIIASMKSDYEDR 75 Y++ NKL++ + +++ N++ + L ++ I + + I+ I E Sbjct: 385 SYYKKYNKLKKSEEAANEQLLQNEEELNYLYSVLTNINNADNYDEIEEIKK------ELI 438 Query: 76 VDDYIIKNAELSKERRDISKKLKV 99 YI ++ SK + Sbjct: 439 ETGYIKFKKIYKSKKSKTSKPMHF 462 Score = 26.4 bits (58), Expect = 0.025 Identities = 10/45 (22%), Positives = 17/45 (37%), Gaps = 1/45 (2%) Query: 13 EFVRLYRRKNKLQREIQDVEKKIRDNQKRVLLLDNLSDYIKPGMS 57 R ++ L ++ E K LL N+ +K G+S Sbjct: 311 NINRCTKKDKILNNTLKKCEDKDIFKLYGELLTANIYA-LKKGLS 354
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 29.5 bits (66), Expect = 0.028 Identities = 17/83 (20%), Positives = 33/83 (39%), Gaps = 12/83 (14%) Query: 25 AISASDSISKTVTEILNNVKA--NGDAALREYSAKFDKTTVAALQVSEAEIAAAGERLSD 82 + + I T L + D +++ + + VS +E+A A L + Sbjct: 138 NLGKAGGILSTFQNFLGTALSSMKIDELIKKQKSGGN--------VSSSELAKASIELIN 189 Query: 83 ELKQAMAVAVKNIETFHNAQQLQ 105 +L +A N+ +F +QQL Sbjct: 190 QLVDTVASLNNNVNSF--SQQLN 210
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 32.3 bits (73), Expect = 0.004 Identities = 14/66 (21%), Positives = 28/66 (42%) Query: 333 HELESSNLSTEKLQASIASQDQVLKAREEEIDELRASVAQKKERIDRLMERNAYLETEYQ 392 + + + + L+A A+ + E + L A+ + +D E LE E+Q Sbjct: 274 NFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQ 333 Query: 393 KQQDQL 398 K ++Q Sbjct: 334 KLEEQN 339
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 29.0 bits (65), Expect = 0.031 Identities = 15/30 (50%), Positives = 19/30 (63%), Gaps = 1/30 (3%) Query: 5 KILIVG-AGFSGAVIGRQLAEQGHQVHIID 33 K L+ G AGF G + ++L E GHQV ID Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGID 31
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 583 bits (1505), Expect = 0.0 Identities = 260/334 (77%), Positives = 301/334 (90%) Query: 1 MKFLVTGAAGFIGFHIAQRLLNEGHDVVGIDNMNDYYDVSLKQARLDRLASPAFHFQQLD 60 MK+LVTGAAGFIGFH+++RLL GH VVGIDN+NDYYDVSLKQARL+ LA P F F ++D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 61 LADREGMAKLFATEQFDRVIHLAAQAGVRYSLENPYAYADANLMGYLNILEGCRHTKVKH 120 LADREGM LFA+ F+RV + VRYSLENP+AYAD+NL G+LNILEGCRH K++H Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 121 LVYASSSSVYGLNRKMPFSTEDSVDHPVSLYAATKKANELMAHTYSHLYGIPTTGLRFFT 180 L+YASSSSVYGLNRKMPFST+DSVDHPVSLYAATKKANELMAHTYSHLYG+P TGLRFFT Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180 Query: 181 VYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIVEAVVRVQDVIPQANAD 240 VYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDI EA++R+QDVIP A+ Sbjct: 181 VYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQ 240 Query: 241 WTVEDGSPATSSAPYRVYNIGNSSPVELMDYITALEEALGMEAQKNMMPIQPGDVLDTSA 300 WTVE G+PA S APYRVYNIGNSSPVELMDYI ALE+ALG+EA+KNM+P+QPGDVL+TSA Sbjct: 241 WTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVLETSA 300 Query: 301 DTQPLYDLVGFKPQTSVKDGVKNFVEWFKDYYQI 334 DT+ LY+++GF P+T+VKDGVKNFV W++D+Y++ Sbjct: 301 DTKALYEVIGFTPETTVKDGVKNFVNWYRDFYKV 334
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 27.7 bits (61), Expect = 0.049 Identities = 18/64 (28%), Positives = 30/64 (46%), Gaps = 14/64 (21%) Query: 127 SGISIGKGSMVSWRVQFLDEDFHLVSYNDKKPKDGKITIGENCLIGNNVAINKGCI-IAD 185 +G+S+ +G V+W+V N + + K IG+ LI NKG + + D Sbjct: 425 AGVSVAEGKTVTWKVH-----------NPQYDRLAK--IGKGTLIVEGTGDNKGSLKVGD 471 Query: 186 GCVV 189 G V+ Sbjct: 472 GTVI 475
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 47.8 bits (114), Expect = 5e-08 Identities = 34/129 (26%), Positives = 57/129 (44%), Gaps = 20/129 (15%) Query: 132 AMMLH-IKLQAESQLPEQIDQAVIGRPINFQGLGGDEANAQAQGILERAALRAGFRDVVF 190 M+ H IK + + ++ P+ + E A + +A AG R+V Sbjct: 89 KMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQV---ERRA-----IRESAQGAGAREVFL 140 Query: 191 QFEPVAAGLDFEATLSEEKRVLVVDIGGGTTDCSLLLMGPQWRERADRQQSLLGHSGCRI 250 EP+AA + +SE +VVDIGGGTT+ +++ + ++ S RI Sbjct: 141 IEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN-----------GVVYSSSVRI 189 Query: 251 GGNDLDIAL 259 GG+ D A+ Sbjct: 190 GGDRFDEAI 198 Score = 33.6 bits (77), Expect = 0.001 Identities = 30/126 (23%), Positives = 50/126 (39%), Gaps = 32/126 (25%) Query: 332 RLSYRLV---RSAEESKIALSSAAS-------------VETALPFIQDELATAIAQQGLE 375 R +Y + +AE K + SA + +P L + + Sbjct: 203 RRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVP-RGFTLNSNEILE--- 258 Query: 376 AALDQPLTRIMEQVRLALDSSQTTPDV--------IYLTGGSARSPLIKKALAAQLPGIP 427 AL +PLT I+ V +AL+ Q P++ + LTGG A + + L + GIP Sbjct: 259 -ALQEPLTGIVSAVMVALE--QCPPELASDISERGMVLTGGGALLRNLDRLLMEET-GIP 314 Query: 428 LAGGDD 433 + +D Sbjct: 315 VVVAED 320
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 65.6 bits (160), Expect = 1e-14 Identities = 32/141 (22%), Positives = 61/141 (43%), Gaps = 6/141 (4%) Query: 4 RLAIIEDNADLLDELLAWLGYRGFEVWGTRSAEAFWRQLHSHPVDIVLVDIGLPGEDGFS 63 + + +D+A + L L G++V T +A WR + + D+V+ D+ +P E+ F Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 64 VLNYLHELGHY-GLVVVSARGQQQDKLQALSLGADAYLIKPVNFAH-LAETLTALGARLR 121 +L + + ++V+SA+ ++A GA YL KP + + AL R Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 122 QDRP----AAPPAEAIGTPPA 138 + + +G A Sbjct: 125 RPSKLEDDSQDGMPLVGRSAA 145
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 44.4 bits (105), Expect = 6e-07 Identities = 26/139 (18%), Positives = 52/139 (37%), Gaps = 16/139 (11%) Query: 55 GAALAPVQAATATEEAVPRYLTGLGTVTAA-NTVTVRSRVDGQLLSLHFQEGQQVKAGDL 113 +A + + E V T G +T + + ++ + + + +EG+ V+ GD+ Sbjct: 67 FLVIAFILSVLGQVEIV---ATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDV 123 Query: 114 LAQIDPSQFKVALAQAQGQLAKDQATLANARRDLARYQQLVKTNLVSRQELDTQQSLVVE 173 L ++ A+ K Q++L AR + RYQ L ++ EL+ L + Sbjct: 124 LLKLTAL-------GAEADTLKTQSSLLQARLEQTRYQILSRS-----IELNKLPELKLP 171 Query: 174 SAGTVKADEAAVASAQLQL 192 + L Sbjct: 172 DEPYFQNVSEEEVLRLTSL 190 Score = 36.3 bits (84), Expect = 2e-04 Identities = 26/170 (15%), Positives = 63/170 (37%), Gaps = 17/170 (10%) Query: 125 ALAQAQGQLAKDQATLANARRDLARYQQLVKTNLVSRQELDTQQSLVVESAGTVKADEAA 184 +A +L ++ L ++ ++ + ++ +++ + Sbjct: 260 KYVEAVNELRVYKSQLEQIESEILSAKEEY------QLVTQLFKNEILDKLRQTTDNIGL 313 Query: 185 V----ASAQLQLDWTRITAPIDGRV-GLKQVDIGNQISSGDTTGIVVLTQTHPIDVVFTL 239 + A + + + I AP+ +V LK G +++ +T ++V ++V + Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPED-DTLEVTALV 372 Query: 240 PESSIATVVQAQKAGKALSVEAWDRTNKQKISVGE--LLSLDNQIDATTG 287 I + Q A + VEA+ T + G+ ++LD D G Sbjct: 373 QNKDIGFINVGQNA--IIKVEAFPYTRYGYLV-GKVKNINLDAIEDQRLG 419
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 898 bits (2323), Expect = 0.0 Identities = 294/1036 (28%), Positives = 509/1036 (49%), Gaps = 29/1036 (2%) Query: 13 SRLFIMRPVATTLLMVAILLAGIIGYRFLPVSALPEVDYPTIQVVTLYPGASPDVVTSAI 72 + FI RP+ +L + +++AG + LPV+ P + P + V YPGA V + Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 73 TAPLERQFGQMSGLKQMSSQS-SGGASVVTLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131 T +E+ + L MSS S S G+ +TL FQ D+A+ +VQ + AT LLP + Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 132 LPNPPVYSKVNPADPPIMTLAVTSSAIPMTQVE--DMVETRVAQKISQVSGVGLVTLAGG 189 + + S + +M S TQ + D V + V +S+++GVG V L G Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 190 QRPAVRVKLNAQAIAALGLTSETVRTAITSANVNSAKGSLDGP------ARAVTLSANDQ 243 Q A+R+ L+A + LT V + N A G L G ++ A + Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239 Query: 244 MQSAEDYRRLII-AYQNGAPIRLGDVASVEQGAENSWLGAWANQQRAIVMNVQRQPGANI 302 ++ E++ ++ + +G+ +RL DVA VE G EN + A N + A + ++ GAN Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299 Query: 303 IDTADSIRQMLPQLTESLPKSVKVQVLSDRTTNIRASVRDTQFELMLAIALVVMIIYLFL 362 +DTA +I+ L +L P+ +KV D T ++ S+ + L AI LV +++YLFL Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359 Query: 363 RNVPATIIPGVAVPLSLVGTFAVMVFLDFSINNLTLMALTIATGFVVDDAIVVIENISRY 422 +N+ AT+IP +AVP+ L+GTFA++ +SIN LT+ + +A G +VDDAIVV+EN+ R Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 423 I-EKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAVTLAVAIL 481 + E P A K +I ++ + L AV IP+ F G G ++R+F++T+ A+ Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 482 ISAVVSLTLTPMMCARML---SHESLRKQNRFSRASERFFERVIAVYGRWLSRVLNHPWL 538 +S +V+L LTP +CA +L S E + F F+ + Y + ++L Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 539 TLGVALSTLALSIILWVFIPKGFFPIQDNGIIQGTLQAPQSVSFASMAERQRQVANIILK 598 L + +A ++L++ +P F P +D G+ +Q P + + QV + LK Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 599 DPA--VESLTSFVGVDGTNPALNSARLQINLKPLDERDDR---VQTVISRLQQAVDGVPG 653 + VES+ + G + A N+ ++LKP +ER+ + VI R + + + Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659 Query: 654 VALYLQPTQDLTIDTTVSRTQYQFTLQ---ANSLEALSTWVPPLLSRLQAQP-QLADVSS 709 ++ P I + T + F L +AL+ LL P L V Sbjct: 660 G--FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717 Query: 710 DWQDKGLAAYIKVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEQDTE 769 + + ++VD++ A LG+S++D++ + A G ++ + ++ ++ D + Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777 Query: 770 ATPGLAALENIRLTSSDGGIVPLTAIATVEQRFTPLSVNHLDQFPVTTISFNVPDNYSLG 829 ++ + + S++G +VP +A T + + + P I S G Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837 Query: 830 EAVEAILAAEQSLDFPTDIRTQFQGSSLAFQSALGSTVWLVVAAVVAMYIVLGVLYESFI 889 +A+ + L P I + G S + + LV + V +++ L LYES+ Sbjct: 838 DAMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895 Query: 890 HPITILSTLPTAGVGALLALWLAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQ 949 P++++ +P VG LLA L + DV ++G++ IG+ KNAI++++FA ++ Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955 Query: 950 GMPPREAIYQACLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLMLSQV 1009 G EA A +R RPILMT+LA +LG LPL +S G G+ + +GIG++GG++ + + Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015 Query: 1010 LTLFTTPVIYLLFDRL 1025 L +F PV +++ R Sbjct: 1016 LAIFFVPVFFVVIRRC 1031
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 890 bits (2302), Expect = 0.0 Identities = 280/1035 (27%), Positives = 502/1035 (48%), Gaps = 36/1035 (3%) Query: 6 LFIYRPVATILISLAITLCGILGFRLLPVAPLPQVDFPVIMVSASLPGASPETMASSVAT 65 FI RP+ ++++ + + G L LPVA P + P + VSA+ PGA +T+ +V Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63 Query: 66 PLERSLGRIAGVNEMTSSS-SLGSTRIILEFNFDRDINGAARDVQAAINAAQSLLPSGMP 124 +E+++ I + M+S+S S GS I L F D + A VQ + A LLP + Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123 Query: 125 SRPTYRKANPSDAPIMILTLTSDT--YSQGELYDFASTQLAQTIAQIDGVGDVDVGGSSL 182 + S + +M+ SD +Q ++ D+ ++ + T+++++GVGDV + G+ Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182 Query: 183 PAVRVDLNPQALFNQGVSLDAVRTAISDANVRKPQG------ALEDSAHRWQVQTNDELK 236 A+R+ L+ L ++ V + N + G AL + K Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241 Query: 237 TAADYQPLIVHY-QNGAAVRLGDVATVSDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQ 295 ++ + + +G+ VRL DVA V ++ N KPA L I+ AN + Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301 Query: 296 TVDSIRARLPELQQTIPAAIDLQIAQDRSPTIRASLEEVEQTLVISVALVILVVFLFLRS 355 T +I+A+L ELQ P + + D +P ++ S+ EV +TL ++ LV LV++LFL++ Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361 Query: 356 GRATLIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENISRHL- 414 RATLIP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R + Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421 Query: 415 EAGMKPLQAALQGSREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGIS 474 E + P +A + ++ ++ +++ L AVF+P+ GG G + R+F++T+ A+ +S Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481 Query: 475 LAVSLTLTPMMCGWLLKSGKPHQPTRNRGFG----RLLVAVQGGYGKSLKWVLKHSRLTG 530 + V+L LTP +C LLK GF Y S+ +L + Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541 Query: 531 LVVLGTIALSVWLYISIPKTFFPEQDTGVLMGGIQADQSISFQ----AMRGKLQDFMKII 586 L+ +A V L++ +P +F PE+D GV + IQ + + + ++K Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601 Query: 587 R-EDPAVDNVTGFT-GGSRVNSGMMFITLKPRDQRH---ETAQQVIDRLRKKLANEPGAN 641 + +V V GF+ G N+GM F++LKP ++R+ +A+ VI R + +L Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661 Query: 642 LFLMAVQDIRVGGRQSNASYQYTLLSDDLSALREWEPKIRKALAAL-----PELADVNSD 696 + + I G + ++ L D + + R L + L V + Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718 Query: 697 QQDNGAEMDLVYDRDTMSRLGISVQDANNLLNNAFGQRQISTIYQPLNQYKVVMEVDPAY 756 ++ A+ L D++ LG+S+ D N ++ A G ++ K+ ++ D + Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778 Query: 757 TQDVSALDKMFVINSDDKPIPLAYFAKWQPANAPLSVNHQGLSAASTISFNLPTGRSLSE 816 +DK++V +++ + +P + F + + I G S + Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838 Query: 817 ASEAIDRAMTQLGVPSSVRGSFAGTAQVFQQTMNAQVILILAAIATVYIVLGVLYESYVH 876 A ++ ++L P+ + + G + + + N L+ + V++ L LYES+ Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896 Query: 877 PLTILSTLPSAGVGALLALEIFDAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRNGN 936 P++++ +P VG LLA +F+ + ++G++ IG+ KNAI++V+FA + Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956 Query: 937 LTPEEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLL 996 EA A +R RPI+MT+LA + G LPL +S G GS + +GI ++GG+V + LL Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016 Query: 997 TLYTTPVVYLFFDRL 1011 ++ PV ++ R Sbjct: 1017 AIFFVPVFFVVIRRC 1031
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 123 bits (311), Expect = 5e-33 Identities = 93/435 (21%), Positives = 186/435 (42%), Gaps = 17/435 (3%) Query: 20 FMQSLDTTIVNTALPSMAKSLGESPLHMHMIIVSYVLTVAVMLPASGWLADRVGVRNIFF 79 F L+ ++N +LP +A + P + + +++LT ++ G L+D++G++ + Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 80 TAIVLFTAGSLFCAQA-STLDQLVMARVLQGVGGAMMVPVGRLTVMKIVPRDQYMAAMTF 138 I++ GS+ S L+MAR +QG G A + + V + +P++ A Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 139 VTLPGQVGPLLGPALGGVLVEYASWHWIFLINIP-VGIVGAIATLCLMPNYTMQTRRFDL 197 + +G +GPA+GG++ Y HW +L+ IP + I+ + L+ FD+ Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201 Query: 198 SGFLLLAAGMATLTLALDGQKGLGISPAWLAGLVAVGLCALLLYLWHARGNARALFSLNL 257 G +L++ G+ L + ++ + V + + L+++ H R L Sbjct: 202 KGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252 Query: 258 FRNRTFSLGLGGSFAGRIGSGMLPFMTPVFLQIGLGFSPFHAG-LMMIPMVLGSMGMKRI 316 +N F +G+ M P ++ S G +++ P + + I Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312 Query: 317 VVQVVNRFGYRRVLVASTLGLAAVSLLFMFSALAGWYYVLPLVLFLQGMINASRFSSMNT 376 +V+R G VL L+ L F +++ +++F+ G ++ ++ + ++T Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTK-TVIST 371 Query: 377 LTLKDLPDDLASSGNSLLSMVMQLSMSIGVTIAGLLLGLYGQQHMSLDAASTHQVFLYT- 435 + L A +G SLL+ LS G+ I G LL + L +LY+ Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTYLYSN 431 Query: 436 -YLSMAAIIALPALI 449 L + II + L+ Sbjct: 432 LLLLFSGIIVISWLV 446
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 36.3 bits (84), Expect = 2e-04 Identities = 28/93 (30%), Positives = 36/93 (38%), Gaps = 21/93 (22%) Query: 198 LATLLAA-------------LATFPLARGLLAPVKRLVEGTHKLAA------GDFST--R 236 LATL+AA + P L+A V+ V H LA G F Sbjct: 77 LATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFPGSFERLYC 136 Query: 237 VTVTGGDELGRLAQDFNQLASTLERNQQMRRDL 269 V G+ G L N+LA E+ QQMR + Sbjct: 137 AMVAAGETSGHLDAVLNRLADYTEQRQQMRSRI 169
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 76.8 bits (189), Expect = 2e-18 Identities = 31/148 (20%), Positives = 67/148 (45%), Gaps = 3/148 (2%) Query: 11 PRILIVEDEPKLGQLLIDYLQAAGYAPTLINHGDKVLPYVRQTPPHLILLDLMLPGTDGL 70 IL+ +D+ + +L L AGY + ++ + ++ L++ D+++P + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 71 TLCREIR-RFSDVPVVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTIL--RR 127 L I+ D+PV++++A+ + + E GA DY+ KP+ E++ + L + Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 128 CKPQRDLQALDAQSPLIVDEGRFQASWR 155 +P + PL+ Q +R Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYR 151
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.008 Identities = 19/93 (20%), Positives = 28/93 (30%), Gaps = 35/93 (37%) Query: 36 LVGESGSGKTTVLKCLAGLFTHWQGELTI---------------------------DAQP 68 L G G GK+T++ L GL I DA+ Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRADAEA 660 Query: 69 LGHEISRERCRQVQMVFQDPYGSL---HPRHTI 98 + S + R ++ YG HPR + Sbjct: 661 VKAFFSSRKDR-----YRGAYGRYVQDHPRQVV 688
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 39.1 bits (91), Expect = 3e-05 Identities = 31/160 (19%), Positives = 63/160 (39%), Gaps = 5/160 (3%) Query: 221 LYTNRSILFSSIVRIINTLSLFGFAVIMPMMFVDELGFTTSEWLQVWAAFFFTTIFSNVF 280 L N + + I ++ GF ++P M D +T+E + + F S + Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAE---IGSVIIFPGTMSVII 308 Query: 281 WGIVAEKMGWMKVIRWFGCIGMALSSLAFYYLP-QHFGHNFAMALVPAIALGIFVAAFVP 339 +G + + + + IG+ S++F ++ M ++ LG Sbjct: 309 FGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368 Query: 340 MAAVFP-ALEPNHKGAAISVYNLSAGLSNFLAPAIAVVLL 378 ++ + +L+ GA +S+ N ++ LS AI LL Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 34.7 bits (80), Expect = 0.001 Identities = 27/105 (25%), Positives = 46/105 (43%), Gaps = 17/105 (16%) Query: 13 QAARGESPFDLLLIDAQIVDMATGEIRPADVGIVGEMIASVHPRGSRE----------DA 62 Q R D ++ +A I+D G I AD+G+ IA++ G+ + Sbjct: 60 QVTREGGAVDTVITNALILD-HWG-IVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPG 117 Query: 63 HEVRSLAGGYLSPGLMDTHVHLESSHLPPERYAEIVLTQGTTAVF 107 EV + G ++ G MD+H+H + P++ E L G T + Sbjct: 118 TEVIAGEGKIVTAGGMDSHIHF----ICPQQ-IEEALMSGLTCML 157
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 33.0 bits (75), Expect = 2e-04 Identities = 15/85 (17%), Positives = 35/85 (41%), Gaps = 7/85 (8%) Query: 57 DEQLWVAECDGQPVGFAAV---WTVDNFLHHLFVDPDWQGKHIGSALLAQVERSFTASGT 113 + ++ + +G + W + + V D++ K +G+ALL + + Sbjct: 64 GKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHF 123 Query: 114 LKCLMENKN----ALRFYQRHGWTI 134 ++E ++ A FY +H + I Sbjct: 124 CGLMLETQDINISACHFYAKHHFII 148
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 70.6 bits (173), Expect = 3e-16 Identities = 36/168 (21%), Positives = 70/168 (41%), Gaps = 10/168 (5%) Query: 3 RVLIVDDEPLARENLRILLETQRDIEIVGECGNAVEAIGAVHKLRPDVLFLDIQMPRISG 62 +L+ DD+ R L L + ++ NA + D++ D+ MP + Sbjct: 5 TILVADDDAAIRTVLNQALS-RAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 63 LEMVGMLDPEHRPYI--VFLTAFD--EYAVKAFEEHAFDYLLKPIEAARLEKTLARLRQE 118 +++ + + RP + + ++A + A+KA E+ A+DYL KP + L + R E Sbjct: 63 FDLLPRIK-KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121 Query: 119 RNLQDVSLLDDAQQTLKYIPCTGHSRIWLLQMEDVAFVSSRMSGIYVT 166 + L DD+Q + + G S +A + + +T Sbjct: 122 PKRRPSKLEDDSQDGMPLV---GRSAAMQEIYRVLARLMQTDLTLMIT 166
>PF06580#Sensor histidine kinase Length = 349 Score = 211 bits (539), Expect = 1e-65 Identities = 59/216 (27%), Positives = 117/216 (54%), Gaps = 3/216 (1%) Query: 343 LGEGIAQLLSAQILAGQYERQKALLTQSEIKLLHAQVNPHFLFNALNTLKAVIRRDSDQA 402 L G + + + ++ ++++ L AQ+NPHF+FNALN ++A+I D +A Sbjct: 134 LYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKA 193 Query: 403 GQLVQYLSTFFRKNLKR-PTEIVTLADEIEHVNAYLQIEKARFQANLQIQMAVPEGLAHH 461 +++ LS R +L+ V+LADE+ V++YLQ+ +F+ LQ + + + Sbjct: 194 REMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDV 253 Query: 462 QLPAFTLQPIVENAIKHGTSQHLGVGEITIRASQDDRWLQLDIEDNAGL-YRANPQASGL 520 Q+P +Q +VEN IKHG +Q G+I ++ ++D+ + L++E+ L + +++G Sbjct: 254 QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGT 313 Query: 521 GMNLVDRRLRARFGADCGISVTCEPERFTRVTLRLP 556 G+ V RL+ +G + I ++ + + + +P Sbjct: 314 GLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 547 bits (1411), Expect = 0.0 Identities = 266/383 (69%), Positives = 299/383 (78%), Gaps = 18/383 (4%) Query: 1 MKVKVLSLLVPALLVAGAANAAEIYNKDGNKLDLYGKIDGLHYFSDDKSVDGDQTYMRVG 60 MK KVL+L++PALL AGAA+AAEIYNKDGNKLDLYGK+DGLHYFSDD S DGDQTYMRVG Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60 Query: 61 VKGETQINDQLTGYGQWEYNVQANNTESSSDQAWTRLAFAGLKFGDAGSFDYGRNYGVVY 120 KGETQINDQLTGYGQWEYNVQAN TE +WTRLAFAGLKFGD GSFDYGRNYGV+Y Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120 Query: 121 DVTSWTDVLPEFGGDTYG-SDNFLQSRANGVATYRNSDFFGLVDGLNFALQYQGKNGSVS 179 DV WTD+LPEFGGD+Y +DN++ RANGVATYRN+DFFGLVDGLNFALQYQGKN S S Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180 Query: 180 ------GEGATNNGRGWSKQNGDGFGTSLTYDIWDGISAGFAYSHSKRTDEQNSVPA-LG 232 G NNG NGDGFG S TYDI G SAG AY+ S RT+EQ + + Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGGTIA 240 Query: 233 RGDNAETYTGGLKYDANNIYLASQYTQTYNATRAGSL------GFANKAQNFEVVAQYQF 286 GD A+ +T GLKYDANNIYLA+ Y++T N T G G ANK QNFEV AQYQF Sbjct: 241 GGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQF 300 Query: 287 DFGLRPSVAYLQSKGKDLER---GYGDQDILKYVDVGATYYFNKNMSTYVDYKINLLD-D 342 DFGLRP+V++L SKGKDL D+D++KY DVGATYYFNKN STYVDYKINLLD D Sbjct: 301 DFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLLDDD 360 Query: 343 NSFTRNAGISTDDVVALGLVYQF 365 + F ++AGISTDD+VALG+VYQF Sbjct: 361 DPFYKDAGISTDDIVALGMVYQF 383
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 49.8 bits (119), Expect = 2e-09 Identities = 31/170 (18%), Positives = 66/170 (38%), Gaps = 27/170 (15%) Query: 1 MNTMNVIIADDHPIVLFGIRKSLEQIEWVNVVGEFEDSTALINNLPKLDAHVLITDLSMP 60 M +++ADD + + ++L + + + ++ L + D +++TD+ MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 GDKYGDGITLIKYIKRHFPDLSIIVLTMNNNPAILSAVLDLDIEGIVLKQGA------PT 114 + L+ IK+ PDL ++V++ N +A+ ++GA P Sbjct: 59 D---ENAFDLLPRIKKARPDLPVLVMSAQNTFM--TAIKA-------SEKGAYDYLPKPF 106 Query: 115 DLPKALAALQKGKKFTPESVSRLLEKISASGYGDKRL---SPKESEVLRL 161 DL + + + + R K+ L S E+ R+ Sbjct: 107 DLTELIGIIGRAL----AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRV 152
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 83.7 bits (207), Expect = 9e-19 Identities = 29/104 (27%), Positives = 46/104 (44%) Query: 825 ILVVDDHPINRRLLADQLGSLGYQCVTANDGIDALNVLSKQHIDIVLSDVNMPNMDGYRL 884 ILV DD R +L L GY ++ ++ D+V++DV MP+ + + L Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 885 TQRIRQLGLTLPVIGVTANALAEEKQRCLESGMDSCLSKPVTLD 928 RI++ LPV+ ++A + E G L KP L Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109
>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature. Length = 1291 Score = 28.8 bits (64), Expect = 0.039 Identities = 13/54 (24%), Positives = 23/54 (42%), Gaps = 4/54 (7%) Query: 252 NYSYDWMFKPGAMAQIAQYADGIGPDYHMLVAEGSKPGAVKLTAMVKEAHASHL 305 +Y YD+ F A+ +G Y+ L + K + + A+ A + HL Sbjct: 1143 SYGYDFAFFRNALVLKPS----VGVSYNHLGSTNFKSNSNQKVALKNGASSQHL 1192
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 29.4 bits (66), Expect = 0.029 Identities = 23/119 (19%), Positives = 46/119 (38%), Gaps = 6/119 (5%) Query: 59 GFSRGDLGFALSGISIAYGFSK-FIMGSVSDRSNPRIFLPAGLILAALVMLVMGFVPWAT 117 + +G +L+ I + ++ I G V+ R R L G+I +++ F Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301 Query: 118 SSIMIMFVLLFLCGWFQGMGWPPCGRTMVHWWSQKERGGIVSVWNCAHNVGGGIPPLLF 176 + IM +L G+G P + ++ +G + ++ + PLLF Sbjct: 302 MAFPIMVLLASG-----GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLF 355
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 35.5 bits (82), Expect = 4e-04 Identities = 31/195 (15%), Positives = 61/195 (31%), Gaps = 39/195 (20%) Query: 268 GYGLTEFASTVCAKEADGAADVGEAL----PGREVKIVAGEIWLRASSMAAGYWRDGQLL 323 G+G+ S + A + ++ EA+ G + I+ E + A + D L Sbjct: 40 GHGIERIWSAIGATDGFALLNLEEAITLRERGWKGPILMLEGFFHAQDLEIY---DQHRL 96 Query: 324 SLTNNEGWFATRDRGALHNGRLTVVGRMDNLFFSGGEGIQPEEVERVILAHPQVQQVFIV 383 + + W + A L + ++++ G QP+ V V + V + Sbjct: 97 TTCVHSNWQLKALQNARLKAPLDIYLKVNSGM--NRLGFQPDRVLTVWQQLRAMANVGEM 154 Query: 384 PL-------DNAEYGQRPVAVVECDDGCELSALAAWSAERLARFQQPVRWLRLPETLKNG 436 L ++ + +AR +Q L +L N Sbjct: 155 TLMSHFAEAEHPDGIS----------------------GAMARIEQAAEGLECRRSLSNS 192 Query: 437 GIKISRRALC-EWVR 450 + +WVR Sbjct: 193 AATLWHPEAHFDWVR 207
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 381 bits (981), Expect = e-138 Identities = 223/253 (88%), Positives = 238/253 (94%), Gaps = 2/253 (0%) Query: 1 MNYRLSALALGATLLVGCASSSSGDRPQGRSDPLEGFNRTMFNFNFNVVDPYVLRPVAVA 60 M RLSALALG TLLVGCASS G QGRSDPLEGFNRTM+NFNFNV+DPY++RPVAVA Sbjct: 1 MKLRLSALALGTTLLVGCASS--GTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVA 58 Query: 61 WRDYVPQPARNGLSNFTSNLEEPAVMVNYFLQGDPYKGMVHFTRFFLNTILGMGGLIDVA 120 WRDYVPQPARNGLSNFT NLEEPAVMVNYFLQGDPY+GMVHFTRFFLNTILGMGG IDVA Sbjct: 59 WRDYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVA 118 Query: 121 GMANPQLQRVEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDEGGDMADGLYPVLSWLTW 180 GMANP+LQR EPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRD+GGDMAD LYPVLSWLTW Sbjct: 119 GMANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPVLSWLTW 178 Query: 181 PMSIGKWAVEGIETRAQLLDSDGLLRQSSDPYILMREAYFQRHDFIANGGKLTPADNPNA 240 PMS+GKW +EGIETRAQLLDSDGLLRQSSDPYI++REAYFQRHDFIANGG+L P +NPNA Sbjct: 179 PMSVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNA 238 Query: 241 QAIQDELKDIDSQ 253 QAIQD+LKDIDS+ Sbjct: 239 QAIQDDLKDIDSE 251
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 33.3 bits (76), Expect = 0.001 Identities = 73/361 (20%), Positives = 117/361 (32%), Gaps = 19/361 (5%) Query: 31 LLPDIRAASGMSYTLAALLTALPVIAMGVLALAAGWVDRYIGQKRSIALSLLIIAAGALL 90 LL D+ ++ ++ LL ++ + DR+ G++ + +SL A + Sbjct: 31 LLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF-GRRPVLLVSLAGAAVDYAI 89 Query: 91 REIAPNSGLLLTSALAGGIGIGIIQAAIPAVIKHLFPRRT-PLVMGLWSAALMGGGGLGA 149 AP +L + GI G A A I + G SA G G Sbjct: 90 MATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGP 148 Query: 150 AFTPWLA--SHSAAWHDALAWWALPALLALL----SWLAICRHLPRAPHQTSASSRVAII 203 + S A + A A L L S R L R AS R A Sbjct: 149 VLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARG 208 Query: 204 GQRRAWTLGLYFG--LINAGYASLIAWLPPYYIQLGDSAQYSGSLLALLTVGQTAGALLL 261 A + ++F L+ A+L D+ SL A + A A++ Sbjct: 209 MTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHW-DATTIGISLAAFGILHSLAQAMIT 267 Query: 262 PALARQEDRRQLLLLALALQLIGFCGFIWLPEHFSALWAIACGVGLGGAFPLC---LVLA 318 +A + R+ L+L + G+ + + A + G P L Sbjct: 268 GPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQ 327 Query: 319 LDHAGQPAVAGRLVAFMQGIGFIIAGLSPWLSGLLRSLSGNYTLDWSWHAICVLLLMALT 378 +D Q + G L A + P L + + S W+W A L L+ L Sbjct: 328 VDEERQGQLQGSLAALTSLTSIV----GPLLFTAIYAASITTWNGWAWIAGAALYLLCLP 383 Query: 379 L 379 Sbjct: 384 A 384
>PF06580#Sensor histidine kinase Length = 349 Score = 217 bits (555), Expect = 5e-68 Identities = 58/207 (28%), Positives = 101/207 (48%), Gaps = 11/207 (5%) Query: 348 RAEQLREMANKAELRALQSKINPHFLFNALNAISSSIRLNPDTARQLIFNLSRYLRYNIE 407 ++ MA +A+L AL+++INPHF+FNALN I + I +P AR+++ +LS +RY++ Sbjct: 150 DQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLR 209 Query: 408 LKDDEQIDIKRELYQIKDYIAIEQARFGDKLTVIYDIDDDV-SCVIPSLLIQPLVENAIV 466 + Q+ + EL + Y+ + +F D+L I+ + +P +L+Q LVEN I Sbjct: 210 YSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIK 269 Query: 467 HGIQPCKGKGVVTIGINECGNRVRISVRDTGNGIDPAVVARVEADEMPGNKIGLLNVHHR 526 HGI G + + + V + V +TG+ + GL NV R Sbjct: 270 HGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK--------NTKESTGTGLQNVRER 321 Query: 527 VKLLYGE--GLHIRNLTPGTEIAFYVP 551 +++LYG + + +P Sbjct: 322 LQMLYGTEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 54.8 bits (132), Expect = 6e-11 Identities = 23/142 (16%), Positives = 58/142 (40%), Gaps = 9/142 (6%) Query: 2 KVIIVEDEFLAQQELSWLINTHSQMEIVGSFDDGLDVLKFLQHNKVDAIFLDINIPSLDG 61 +++ +D+ + L+ ++ V + + +++ D + D+ +P + Sbjct: 5 TILVADDDAAIRTVLNQALS--RAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 62 V-LLAQNISQFAHKPFIVFITAWK--EHAVEAFELEAFDYILKPYQESRIINMLQKLTTA 118 LL + P +V ++A A++A E A+DY+ KP+ + +I ++ + A Sbjct: 63 FDLLPRIKKARPDLPVLV-MSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR---A 118 Query: 119 WQQQNNAASGLASAAPRENDTI 140 + S L + + Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLV 140
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 51.2 bits (122), Expect = 4e-09 Identities = 26/126 (20%), Positives = 40/126 (31%), Gaps = 7/126 (5%) Query: 98 ERQMQQPARPEEPVRQPPQPPRQAPVPPQQQPAPHAAPQTGWQ-QPQPAQPPVQPQHQPQ 156 E + Q +E + + Q+ + + Q Q + QP +P + Sbjct: 1091 ETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREND 1150 Query: 157 PVV--QQPVAPQPVTPTVAQPQPAAPQQPAPQPVAASQPAVAEPQPVE---PQQPAAPQP 211 P V ++P + T QP QPV S VE PA QP Sbjct: 1151 PTVNIKEPQSQTNTTADTEQPAKETSSNV-EQPVTESTTVNTGNSVVENPENTTPATTQP 1209 Query: 212 KERKET 217 E+ Sbjct: 1210 TVNSES 1215 Score = 45.4 bits (107), Expect = 3e-07 Identities = 22/135 (16%), Positives = 41/135 (30%), Gaps = 1/135 (0%) Query: 78 AHGEHEAPRQAPQHQYQPPYERQMQQPARPEEPVRQPPQPPRQAPVPPQQQPAPHAAPQT 137 A E E ++ P+ Q +++ + +P+ + P P Q Q Sbjct: 1112 AKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQP 1171 Query: 138 GWQQPQPAQPPVQPQHQPQPVVQQPVAPQPVTPTVAQPQPAAPQQPAPQPVAASQPAVAE 197 + + PV P+ TP QP + P+ + + Sbjct: 1172 AKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPK-NRHRRSVRSV 1230 Query: 198 PQPVEPQQPAAPQPK 212 P VEP ++ Sbjct: 1231 PHNVEPATTSSNDRS 1245 Score = 37.4 bits (86), Expect = 9e-05 Identities = 30/170 (17%), Positives = 44/170 (25%), Gaps = 25/170 (14%) Query: 83 EAPR---QAPQHQYQPPYERQMQQPARPEEPVRQPPQPPRQAPVPP-QQQPA-------- 130 E P+ Q Q Q + +PAR +P +P Q +QPA Sbjct: 1121 EVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE 1180 Query: 131 -PHAAPQTGWQQPQPAQPPVQPQH---QPQPVVQQPVAPQPVTPTVAQPQPAAPQ----- 181 P T + P QP + P+ + P + Sbjct: 1181 QPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTS 1240 Query: 182 QPAPQPVAASQPAVAEPQPVEPQQPAAPQPKERKETVIVMNVAAHHGAQL 231 VA V A Q + V + H +QL Sbjct: 1241 SNDRSTVALCDLTSTNTNAVLSDARAKAQ----FVALNVGKAVSQHISQL 1286 Score = 33.9 bits (77), Expect = 0.001 Identities = 26/161 (16%), Positives = 41/161 (25%), Gaps = 36/161 (22%) Query: 86 RQAPQHQYQPPYERQMQQP---------ARPEEPVRQPPQPP---------------RQA 121 + P Q P AR +E PP P Sbjct: 990 QTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESK 1049 Query: 122 PVPPQQQPAPHAAPQTGWQQPQPAQPPVQPQHQPQPVVQQPVAPQPVTPTVAQPQP---- 177 V +Q A Q + + A+ V+ Q V Q + T + Sbjct: 1050 TVEKNEQDATETTAQNR-EVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEK 1108 Query: 178 -------AAPQQPAPQPVAASQPAVAEPQPVEPQQPAAPQP 211 Q P+ + P + + V+PQ A + Sbjct: 1109 EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREN 1149
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 746 bits (1927), Expect = 0.0 Identities = 275/571 (48%), Positives = 392/571 (68%), Gaps = 2/571 (0%) Query: 1 MISGILASPGIAFGKALLLKEDEIVIDRKKISADKVDQEVERFLSGRAKASAQLEVIKTK 60 I+GI AS G+A KA + E + I++ I V E+E+ + K+ +L IK + Sbjct: 4 KITGIAASSGVAIAKAFIHLEPNVDIEKTSI--TDVSTEIEKLTAALEKSKEELRAIKDQ 61 Query: 61 AGETFGEEKEAIFEGHIMLLEDEELEQEIIALIKDKHMTADAAANEVIDGQATALEELDD 120 + G +K IF H+++L+D EL I I+++ M A+ A EV D + E +D+ Sbjct: 62 TEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDN 121 Query: 121 EYLKERAADVRDIGKRLLRNILGLAIIDLSAIQDEVILVAADLTPSETAQLNLKKVLGFI 180 EY+KERAAD+RD+ KR+L +++G+ L+ I +E +++A DLTPS+TAQLN + V GF Sbjct: 122 EYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFA 181 Query: 181 TDAGGRTSHTSIMARSLELPAIVGTGSITAQVKNGDYLILDAVNNQVLINPSNEQIEALR 240 TD GGRTSH++IM+RSLE+PA+VGT +T ++++GD +I+D + V++NP+ E+++A Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241 Query: 241 SLQAQVAEEKAELAKLKDLPAITLDGHQVEVCANIGTVRDVEGAERNGAEGVGLYRTEFL 300 +A ++K E AKL P+ T DG VE+ ANIGT +DV+G NG EG+GLYRTEFL Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301 Query: 301 FMDRDALPTEEEQFAAYKAVAEACGSQAVIVRTMDIGGDKELPYMNFPKEENPFLGWRAV 360 +MDRD LPTEEEQF AYK V + + V++RT+DIGGDKEL Y+ PKE NPFLG+RA+ Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361 Query: 361 RIAMDRKEILRDQVRAILRASAFGKLRIMFPMIISVEEVRALKKEIEIYKQELRDEGKAF 420 R+ +++++I R Q+RA+LRAS +G L++MFPMI ++EE+R K ++ K +L EG Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDV 421 Query: 421 DESIEIGVMVETPAAATIARHLAKEVDFFSIGTNDLTQYTLAVDRGNDMISHLYQPMSPS 480 +SIE+G+MVE P+ A A AKEVDFFSIGTNDL QYT+A DR N+ +S+LYQP P+ Sbjct: 422 SDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPA 481 Query: 481 VLNLIKQVIDASHAEGKWTGMCGELAGDERATLLLLGMGLDEFSMSAISIPRIKKIIRNT 540 +L L+ VI A+H+EGKW GMCGE+AGDE A LLLG+GLDEFSMSA SI + + Sbjct: 482 ILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKL 541 Query: 541 NFEDAKVLAEQALAQPTTDELMTLVNKFIEE 571 + E+ K A++AL T +E+ LV K + Sbjct: 542 SKEELKPFAQKALMLDTAEEVEQLVKKTYLK 572
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 45.9 bits (109), Expect = 7e-08 Identities = 47/179 (26%), Positives = 70/179 (39%), Gaps = 41/179 (22%) Query: 33 LGIDLGTCD----------------VVSMVVDRDGQP---VAVCLDWADVV--------- 64 L IDLGT + VV++ DR G P AV D ++ Sbjct: 13 LSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAKQMLGRTPGNIAA 72 Query: 65 ----RDGIVWDFFGAVTLVRRHLATLEQQLGCRFT-HAATSFPPGTDP---RISINVLES 116 +DG++ DFF +++ + + R + P G R + Sbjct: 73 IRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQG 132 Query: 117 AGLEISHVLDEPTAVA---DLLQLDNAG--VVDIGGGTTGIAIVKQGRVTYSADEATGG 170 AG +++EP A A L + G VVDIGGGTT +A++ V YS+ GG Sbjct: 133 AGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRIGG 191
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 31.3 bits (71), Expect = 0.006 Identities = 21/60 (35%), Positives = 27/60 (45%) Query: 51 LTLAMLMMAAVSPFVARLLARFGGRLVVTSGTLLIAASCAMMAWRPSLAGWYGAWLLTGI 110 L L LM A +P + L RFG R V+ A A+MA P L Y ++ GI Sbjct: 49 LALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI 108
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 32.2 bits (73), Expect = 5e-04 Identities = 13/54 (24%), Positives = 20/54 (37%) Query: 17 LQDAFNQAVAAAGGDKTAWLLDALRSKLNQPESNPQLRLLELVERMEVAAAALA 70 + D N A GD A D+ + +P RL+ +E + V A Sbjct: 208 VADVVNAFARARYGDPIAEPRDSQEIAVQKPRVADLTRLMAEIENLTVETDTPA 261
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 37.7 bits (87), Expect = 2e-04 Identities = 40/233 (17%), Positives = 91/233 (39%), Gaps = 22/233 (9%) Query: 457 ISLSDSQAKALQQARQELFITKQTGEAKQQAQAW----RDAERQGLKAGTQAFREYYQVK 512 L+ + A+Q + L + K +A+++A+A ++AE++ + + Q+K Sbjct: 113 TELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLK 172 Query: 513 L--QTYKQQEANAEAAKNERNAQSEANSAAKQAGTQAERNARILEDYQQKAALSADSTSD 570 L K+ A +E AK AQ + ++A Q+E E L++ +S Sbjct: 173 LAEAEEKRLAALSEEAKAVEIAQKKLSAA------QSEVVKMDGE----IKTLNSRLSSS 222 Query: 571 LSREQAILAAKQKLINPTPQQVAQVERDAAAAWDKAAALKAQNAVPERKENADYAAQRKA 630 + A + N Q A+ + K + +A + + R + A Sbjct: 223 IHARDAEMKTLAGKRNELAQASAKYKELDELV--KKLSPRANDPLQNRPFFEATRRRVGA 280 Query: 631 LDSLKDQKNANGELIISQEQYNRASEQL-EEQHQVNLAKIRAQQVVSPTQEAQ 682 ++++ ++ S+ + NR + + + Q ++ ++ EA+ Sbjct: 281 GKIREEKQK---QVTASETRINRINADITQIQKAISQVSNNRNAGIARVHEAE 330
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 109 bits (274), Expect = 3e-28 Identities = 81/368 (22%), Positives = 141/368 (38%), Gaps = 68/368 (18%) Query: 23 GIDLGTTNSLVATVRSGQAETLPD----HQGRYLLPSVVNYHASGLTVGYDARLNAAQDP 78 IDLGT N+L+ G P Q R P V VG+DA+ + P Sbjct: 14 SIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSV------AAVGHDAKQMLGRTP 67 Query: 79 ANTISSVKRMMGRSLADIQNRYSHLPYQLQASENGLPMIQTAGGLLNPIRVSADILKALA 138 N I++++ M +AD V+ +L+ Sbjct: 68 GN-IAAIRPMKDGVIADF-------------------------------FVTEKMLQHFI 95 Query: 139 ARATEALAGE-LDGVVITVPAYFDDAQRQGTKDAARLAGLHVLRLLNEPTAAAIAYGLDS 197 + V++ VP +R+ +++A+ AG + L+ EP AAAI GL Sbjct: 96 KQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPV 155 Query: 198 GQEGVIAVYDLGGGTFDISILRLSRGVFEVLATGGDSALGGDDFDHLLADYLREQAGF-- 255 + V D+GGGT +++++ L+ V +GGD FD + +Y+R G Sbjct: 156 SEATGSMVVDIGGGTTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYGSLI 210 Query: 256 SDRSDNRLQRELLDAAIAAKIALSDAEAAHVEVGG---WQG-----DITRSQFNDLIAPL 307 + + R++ E+ A E +EV G +G + ++ + + Sbjct: 211 GEATAERIKHEI-------GSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEP 263 Query: 308 VKRTLMACRRALKDAGVE-AQEVLE--VVMVGGSTRVPLVRERVGEFFGRTPLTSIDPDK 364 + + A AL+ E A ++ E +V+ GG + + + E G + + DP Sbjct: 264 LTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAEDPLT 323 Query: 365 VVAIGAAI 372 VA G Sbjct: 324 CVARGGGK 331
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 94.0 bits (233), Expect = 5e-25 Identities = 67/258 (25%), Positives = 120/258 (46%), Gaps = 11/258 (4%) Query: 5 LAGKVALVTASTAGIGFAIAKGLAESGAEVILNGRSEQSVNAAIARLQNEVPGAKARPAI 64 + GK+A +T + GIG A+A+ LA GA + + + + ++ L+ E A+A PA Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA- 64 Query: 65 ADLSDADG----AAQLLRAVTGVDILVNNPGIYGPQDFYATDDATWDNYWQTNVMSGVRL 120 D+ D+ A++ R + +DILVN G+ P ++ D W+ + N Sbjct: 65 -DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123 Query: 121 SRGLLPAMVSKGWGRVVFISSESARNIPADMIHYGVTKTAQLSLARGLAKYVAGSGVTVN 180 SR + M+ + G +V + S A M Y +K A + + L +A + N Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183 Query: 181 SVLPGPTISDGFAEMLKDDVAKTGKSLEELAKAFVMTHRPSSVIQRAASVAEVANMVVYV 240 V PG T +D + D+ E++ K + T + +++ A +++A+ V+++ Sbjct: 184 IVSPGSTETDMQWSLWADENGA-----EQVIKGSLETFKTGIPLKKLAKPSDIADAVLFL 238 Query: 241 CSPQASATSGAALRVDGG 258 S QA + L VDGG Sbjct: 239 VSGQAGHITMHNLCVDGG 256
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 719 bits (1857), Expect = 0.0 Identities = 323/851 (37%), Positives = 459/851 (53%), Gaps = 46/851 (5%) Query: 20 PADSAERYNAQFVNG-----IDPLAFNQFVASDGDVMPGTYDVNIYINDLLVDSRPVRFS 74 + + +N +F+ D F ++ PGTY V+IY+N+ + +R V F+ Sbjct: 42 LSSAELYFNPRFLADDPQAVADLSRFEN----GQELPPGTYRVDIYLNNGYMATRDVTFN 97 Query: 75 EDSAHGGLAPCLSAAEYIRYGVKIDD-------DHQPCFALSQTIRQAEQQLDIANHRLN 127 + G+ PCL+ A+ G+ C L+ I A QLD+ RLN Sbjct: 98 TGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHDATAQLDVGQQRLN 157 Query: 128 IHIPQQYIEHYPRDYVSPMRFDEGINAAFVNYSYS-TDANNGDGGSHQYQYLSLNSGINI 186 + IPQ ++ + R Y+ P +D GINA +NY++S N GG+ Y YL+L SG+NI Sbjct: 158 LTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNI 217 Query: 187 ASWRLRNNAYWNKF-----SGQADKWQSIASWAETNIIPWRSRLVVGQTSTDNSVFDSVQ 241 +WRLR+N W+ SG +KWQ I +W E +IIP RSRL +G T +FD + Sbjct: 218 GAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGIN 277 Query: 242 FRGVQLGTDAEMRPSSQTGFAPVIRGVANSNARVEVRQNNYLIYSENVPAGPFELNDINA 301 FRG QL +D M P SQ GFAPVI G+A A+V ++QN Y IY+ VP GPF +NDI A Sbjct: 278 FRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYA 337 Query: 302 VNRSGDFYVTVIEADGSQTTFTVAYTTLPQLVRAGQWNYQLSAGKYH-DGADGYAPALMQ 360 SGD VT+ EADGS FTV Y+++P L R G Y ++AG+Y A P Q Sbjct: 338 AGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQ 397 Query: 361 SSLSYGLNNTFTLYGGALAAENYRAGAFGVGSNLGEIGALSADYTLAGTTLANGQRKQGG 420 S+L +GL +T+YGG A+ YRA FG+G N+G +GALS D T A +TL + + G Sbjct: 398 STLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQ 457 Query: 421 SVRFLYAKSFLSSKTDFQIAGYRYSTAGYYSLSDAVNERRRWHNGLYENDYWPSDEYESW 480 SVRFLY KS S T+ Q+ GYRYST+GY++ +D R +N ++ + Sbjct: 458 SVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTD 517 Query: 481 QASAPQHYYTSWFYNKKHRFDISARQTLGKNSAFFLNFSQQNYWNSSGSDISLQAGFNST 540 Y + YNK+ + ++ Q LG+ S +L+ S Q YW +S D QAG N+ Sbjct: 518 --------YYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTA 569 Query: 541 IHNVNYGLYYQNTRSHFTHD-DNSITLRVSIPF-------TLQENRRINTAFTLAHSKSS 592 ++N+ L Y T++ + D + L V+IPF + + R + +++++H + Sbjct: 570 FEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNG 629 Query: 593 GTSGQAGVNGTLLDDDRLSWAVTSAYDD----TSHSTNSASLGYLGQYGNLYTGYAYSKN 648 + AGV GTLL+D+ LS++V + Y S ST A+L Y G YGN GY++S + Sbjct: 630 RMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDD 689 Query: 649 HRQASLNLSGGVVAHRGGVTLSQPLGSTFALVEAKDAQGVGIENQTGVRIDPFGYAVVPQ 708 +Q +SGGV+AH GVTL QPL T LV+A A+ +ENQTGVR D GYAV+P Sbjct: 690 IKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPY 749 Query: 709 SVPYRVNSVALNPQDFDAFLDVPNAVADTVPTRGAITRVRFDTFRGYSVLIHTTLADGSY 768 + YR N VAL+ +D+ NAVA+ VPTRGAI R F G +L+ T + Sbjct: 750 ATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLM-TLTHNNKP 808 Query: 769 PPLGAELYRASGISNGLVGPGGDVYVSGVDSGEKLQMKWGETHQQSCEITLPELRQEPQQ 828 P GA + S S+G+V G VY+SG+ K+Q+KWGE C Q Sbjct: 809 LPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANYQ--LPPESQ 866 Query: 829 ATAWRELSLIC 839 +LS C Sbjct: 867 QQLLTQLSAEC 877
>PF05616#Neisseria meningitidis TspB protein Length = 501 Score = 28.6 bits (63), Expect = 0.037 Identities = 27/74 (36%), Positives = 34/74 (45%), Gaps = 6/74 (8%) Query: 228 PGYYEKTR--PFT-VTYGLVKQGNGSDCGTEPMLATFSTTNTIQESAIILPQPDSGFGIA 284 PGY EK P T V G V NG+ S NT + +I P+PD G A Sbjct: 262 PGYSEKVEVAPGTKVNMGPVTDRNGNPVQVVATFGRDSQGNTTVDVQVI-PRPDLTPGSA 320 Query: 285 ISPNASMHPLIEMN 298 +PNA PL E++ Sbjct: 321 EAPNA--QPLPEVS 332
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 739 bits (1909), Expect = 0.0 Identities = 333/853 (39%), Positives = 480/853 (56%), Gaps = 49/853 (5%) Query: 25 LATVPTMMFCLSPLSRALADDYFDPAALEFADPQQQTSDLHYFAKPGGQQPGTYPVTVVV 84 + + + A+ YF+P L D Q +DL F PGTY V + + Sbjct: 27 FVRLFVACAFAAQAPLSSAELYFNPRFLA--DDPQAVADLSRFENGQELPPGTYRVDIYL 84 Query: 85 NDQELGQADITFV--DDGGQLRPVLTPGQLAEYGVNVSAFPAFQALHEGETFTRIEKFIP 142 N+ + D+TF D + P LT QLA G+N ++ L + + I Sbjct: 85 NNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVP-LTSMIH 143 Query: 143 DASSRFSFANQRLTLSIPQAAMNVQSRGYVDPSRWDDGVPAAFVDYYFSGAQIKNADEGE 202 DA+++ QRL L+IPQA M+ ++RGY+ P WD G+ A ++Y FSG ++N G Sbjct: 144 DATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQN-RIGG 202 Query: 203 SSRSNYLNLRSGLNLGAWRLRNISSMQYDQ------QRRHWDTQSTWLQRDVRSLKSLLR 256 +S YLNL+SGLN+GAWRLR+ ++ Y+ + W +TWL+RD+ L+S L Sbjct: 203 NSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLT 262 Query: 257 IGDTYTTGDVFDSIQFRGVQLMSDDEMLPDSQRGFAPTIRGVAHSNAKVTVSQHGYVIYE 316 +GD YT GD+FD I FRG QL SDD MLPDSQRGFAP I G+A A+VT+ Q+GY IY Sbjct: 263 LGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYN 322 Query: 317 TFVSPGAFAISDLYPTSQSGDLEVKVTESNGAVRTFTQPYSAVPYMLREGRGKFSLSAGR 376 + V PG F I+D+Y SGDL+V + E++G+ + FT PYS+VP + REG ++S++AG Sbjct: 323 STVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGE 382 Query: 377 YHSGGESVRSPEFLQGTLFYGLTAGFTLYGGTQLARDYQAWALGLGRGFGEFGSLGGDVT 436 Y SG P F Q TL +GL AG+T+YGGTQLA Y+A+ G+G+ G G+L D+T Sbjct: 383 YRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMT 442 Query: 437 QAVTRTPSGKRYTGHSLRAQYQKNFVSSGTAFSLASYRYSSSGYYDFAEASALESAQGQV 496 QA + P ++ G S+R Y K+ SGT L YRYS+SGY++FA+ + + Sbjct: 443 QANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNI 502 Query: 497 D--------------------NRRRREELSVSQSLGGLGSLAVSAWSQEYWHRQSRDETV 536 + N+R + +L+V+Q LG +L +S Q YW + DE Sbjct: 503 ETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQF 562 Query: 537 HLGFYSAWKGISWGVGYYYTRTSGQQKNDRSWSFNINIPLGGPLSDSA--------VSYN 588 G +A++ I+W + Y T+ + Q+ D+ + N+NIP L + SY+ Sbjct: 563 QAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYS 622 Query: 589 TTSDSNGYTSQQMSLYGAVPTRPNLFYSVQQGYGNQGRGSNSS---ASLDYHGGFGNAQI 645 + D NG + +YG + NL YSVQ GY G G++ S A+L+Y GG+GNA I Sbjct: 623 MSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANI 682 Query: 646 GYRHDAASNQLTWGGAGSVVAHPHGVTFGQTVGESFAIVRAPGAAGVAVQNGNNVHTDWR 705 GY H QL +G +G V+AH +GVT GQ + ++ +V+APGA V+N V TDWR Sbjct: 683 GYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWR 742 Query: 706 GYAVVPSLTAYRKNVITLDTESMADDTDVDQQGQTVIPGGGAVVMANYQTHIGNRVLFTL 765 GYAV+P T YR+N + LDT ++AD+ D+D V+P GA+V A ++ +G ++L TL Sbjct: 743 GYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTL 802 Query: 766 RNAQGPLPFGASARLVKEEESGNPPGGMVADGGQVYLSGVPQEGTLAVSWIVNNQSQSCT 825 + PLPFGA V E S + G+VAD GQVYLSG+P G + V W ++ C Sbjct: 803 THNNKPLPFGAM---VTSESSQSS--GIVADNGQVYLSGMPLAGKVQVKWG-EEENAHCV 856 Query: 826 LHFHLPDNPQQSL 838 ++ LP QQ L Sbjct: 857 ANYQLPPESQQQL 869
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 285 bits (732), Expect = 4e-92 Identities = 121/366 (33%), Positives = 173/366 (47%), Gaps = 52/366 (14%) Query: 268 LTTPQGRYHYRLREPTRRRVAVSAPPAMHLPFTSPREGEKLLRLLNAGIALCIEGETGSG 327 L P+ R + V AM + L RL+ + L I GE+G+G Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIY------RVLARLMQTDLTLMITGESGTG 172 Query: 328 KEYVSRTLHQHSRWRSGKFVAINCAAIPESLIESELFGYQPGAFTGASKNGYIGKIREAD 387 KE V+R LH + + R+G FVAIN AAIP LIESELFG++ GAFTGA G+ +A+ Sbjct: 173 KELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRS-TGRFEQAE 231 Query: 388 GGVLFLDEIGDMPLALQTRLLRVLQEKEVAPLGASRSVPVNFALICATHRNLTQRVSAGE 447 GG LFLDEIGDMP+ QTRLLRVLQ+ E +G + + ++ AT+++L Q ++ G Sbjct: 232 GGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGL 291 Query: 448 FREDLLWRLREYALALPPLREWS----ALETFIATLWHDLGGASRRVTLSNALLVHLSQL 503 FREDL +RL L LPPLR+ + L G +R L + Sbjct: 292 FREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKR--FDQEALELMKAH 349 Query: 504 PWPGNVRQLQSVLKVMLALADEGDTLTPDALPEAYRAAPAPLPRGG-------------- 549 PWPGNVR+L+++++ + AL D +T + + R+ P Sbjct: 350 PWPGNVRELENLVRRLTALY-PQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAV 408 Query: 550 ------------------------LQAHDEQLIVDTLARVNGNVSRAAQILGIARSTLYR 585 L + LI+ L GN +AA +LG+ R+TL + Sbjct: 409 EENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRK 468 Query: 586 RAARAG 591 + G Sbjct: 469 KIRELG 474
>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS signature. Length = 171 Score = 29.5 bits (66), Expect = 0.035 Identities = 22/89 (24%), Positives = 39/89 (43%), Gaps = 6/89 (6%) Query: 562 LNHSATHLMHAALRQVLGTHVAQKGSLVNDKALRFDFSHFEAMKPEEIRAVEDLVNAQIR 621 ++H+ M+A +V T KG + LRF + + + + I +E L +R Sbjct: 8 VDHTR---MNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYAGFMR 64 Query: 622 RNLAIET-NIMDID--AARASGAMALFGE 647 +L ++ I+DI R M+L G Sbjct: 65 NHLNGDSVEIIDISPMGCRTGFYMSLIGT 93
>adhesinb#Adhesin B signature. Length = 310 Score = 237 bits (606), Expect = 2e-79 Identities = 92/308 (29%), Positives = 170/308 (55%), Gaps = 17/308 (5%) Query: 1 MKRSAIVVALALGLMAQGAMAKT----------LNVVSSFSVLGDIAQQVGGEHVHVDTL 50 MK+ +V L L + A + LNVV++ S++ DI + + G+ +++ ++ Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSI 60 Query: 51 VGPDGDPHTFEPSPKDSALLSKADVVVVNGLGLE----GWLDRLIKASGFKGE--LVVAS 104 V DPH +EP P+D S+AD++ NG+ LE W +L++ + K S Sbjct: 61 VPVGQDPHEYEPLPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVS 120 Query: 105 KGVKTHTLDEEGKTVT-DPHAWNSAANGALYAQNILEGLVKADPEDKAALTSSGKRYIDQ 163 +GV L+ + + DPHAW + NG +YAQNI + L + DP +K + K Y+++ Sbjct: 121 EGVDVIYLEGQSEKGKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEK 180 Query: 164 LTSLDGWAKAQFSAIPLAKRKVLTSHDAFGYFGRAYHVTFLAPQGLSSESEASAAQVAAL 223 L++LD AK +F+ IP K+ ++TS F YF +AY+V +++E E + Q+ L Sbjct: 181 LSALDKEAKEKFNNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTL 240 Query: 224 IKQIKADGVHTWFMENQLDPRLVKQIASATGAQPGGELYPEALSKPGGVADSYVKMMRHN 283 +++++ V + F+E+ +D R +K ++ T +++ +++++ G DSY MM++N Sbjct: 241 VEKLRKTKVPSLFVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSYYSMMKYN 300 Query: 284 VELIAKSM 291 +E IA+ + Sbjct: 301 LEKIAEGL 308
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 28.2 bits (63), Expect = 0.040 Identities = 14/74 (18%), Positives = 27/74 (36%), Gaps = 3/74 (4%) Query: 19 ALVVCLALSLSTTMLGVFLLLRRMSLMGDALSHAILP-GVAVGYLLSGMSLLAMTLGG-- 75 + + +ALS L + LM + LP A+ Y++ + L L Sbjct: 31 STALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFPL 90 Query: 76 FIAGIVVALVAGWV 89 ++A+ + V Sbjct: 91 LTVAALMAIASHVV 104
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 372 bits (956), Expect = e-126 Identities = 135/390 (34%), Positives = 197/390 (50%), Gaps = 38/390 (9%) Query: 149 IAALVAGALN----------NALLIARLEAQNVLPAQAVNYPLPERQEIIGLSGPMLQLK 198 I A GA + +I R A+ + + ++G S M ++ Sbjct: 91 IKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIY 150 Query: 199 KEIDIVAASDLNVLISGETGTGKELVAKAVHQGSPRAANPLVYLNCAALPESVAESELFG 258 + + + +DL ++I+GE+GTGKELVA+A+H R P V +N AA+P + ESELFG Sbjct: 151 RVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFG 210 Query: 259 HVKGAFTGAISNRSGKFEMADNGTLFLDEIGELSLALQAKLLRVLQYGDIQRVGDDRSLR 318 H KGAFTGA + +G+FE A+ GTLFLDEIG++ + Q +LLRVLQ G+ VG +R Sbjct: 211 HEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIR 270 Query: 319 VDVRVLAATNRDLRQEVVEGRFRADLYHRLSVFPLSVPPLRERESDVVLLAGYFCEQCRL 378 DVR++AATN+DL+Q + +G FR DLY+RL+V PL +PPLR+R D+ L +F +Q Sbjct: 271 SDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE- 329 Query: 379 RMGLARVILAEAARNRLQKWSWPGNVRELEHAIHRAVVLARATQAGDEVVLEPQHFQFAV 438 + GL + A ++ WPGNVRELE+ + R L D + E + Sbjct: 330 KEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYP----QDVITREIIENELRS 385 Query: 439 AAPMLPTETAAAAPATGNIN-----------------------LREATDSFQREAISRAL 475 P P E AAA + +I+ + I AL Sbjct: 386 EIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAAL 445 Query: 476 EANQGNWAATARALELDVANLHRLAKRLGL 505 A +GN A L L+ L + + LG+ Sbjct: 446 TATRGNQIKAADLLGLNRNTLRKKIRELGV 475
>adhesinb#Adhesin B signature. Length = 310 Score = 330 bits (848), Expect = e-116 Identities = 86/296 (29%), Positives = 165/296 (55%), Gaps = 7/296 (2%) Query: 9 SLLLASALALLAATPASAQEKFRVITTFTVIADMAQNVAGDAAVVSSITKPGAEIHDYQP 68 + + +A + ++ + K V+ T ++IAD+ +N+AGD + SI G + H+Y+P Sbjct: 13 AFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSIVPVGQDPHEYEP 72 Query: 69 TPGDIKRAQGAQLILSNGLNLER----WFARFYQHLQGVPE---VVVSEGIQPMGISAGP 121 P D+K+ A LI NG+NLE WF + ++ + VSEG+ + + Sbjct: 73 LPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVSEGVDVIYLEGQS 132 Query: 122 YSGKPNPHAWMSADNALIYVDNIRDALVKYDPPHADTYRRNAEAYKEKIRQTMAPLQARL 181 GK +PHAW++ +N +IY NI L + DP + +TY +N +AY EK+ + + Sbjct: 133 EKGKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEKLSALDKEAKEKF 192 Query: 182 AQLPADKRWLVTSEGAFSYLARDYGLRELYLWPINADQQGTPQQVRKVIDTMKKERIPTI 241 +P +K+ +VTSEG F Y ++ Y + Y+W IN +++GTP Q++ +++ ++K ++P++ Sbjct: 193 NNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEKLRKTKVPSL 252 Query: 242 FSESTISDKPARQVAREAGAHYGGVLYVDSLSAADGPVPTWLDLLRVTTETIVNGI 297 F ES++ D+P + V+++ ++ DS++ ++ +++ E I G+ Sbjct: 253 FVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSYYSMMKYNLEKIAEGL 308
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 36.5 bits (84), Expect = 7e-05 Identities = 44/191 (23%), Positives = 75/191 (39%), Gaps = 16/191 (8%) Query: 52 PPAAQKLPDVGYLRQLNAEGILALRPQLVLASAQAQPSLVLHKVQASGVKVVNVPGGESL 111 PP + DVG + N E + ++P ++ SA PS + A G G + L Sbjct: 72 PPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPL 131 Query: 112 SAIDNKVAVIAEALGKTAAGDALRQQLQQQIAAIPTQPV---AKRVLFILSHGGMNTLVA 168 + + +A+ L +A + Q + I ++ + V A+ +L + LV Sbjct: 132 AMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVF 191 Query: 169 GQHTAADGAIRAAGLQNAMQG---FDHYRAMSQEGVAA-SQADLVVISADGLKGMGGEAG 224 G ++ + G+ NA QG F A+S + +AA D++ D K M Sbjct: 192 GPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDM----- 246 Query: 225 LWKLPGLAQTP 235 L TP Sbjct: 247 ----DALMATP 253
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.1 bits (62), Expect = 0.048 Identities = 14/42 (33%), Positives = 20/42 (47%), Gaps = 4/42 (9%) Query: 15 GKRQIIDNVSVALRGG----EMTALIGPNGAGKSTLLRLLTG 52 GK ++ +V+ + G L G G GKSTL+ L G Sbjct: 577 GKYILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVG 618
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 40.2 bits (94), Expect = 1e-05 Identities = 27/119 (22%), Positives = 53/119 (44%), Gaps = 4/119 (3%) Query: 40 VFLALGGVFLDAYDLTTLSYGIDDVVREFQLSPLL---TGLVTSSIMVGTIVGNIIGGWL 96 + + L V LDA + + + ++R+ S + G++ + + + G L Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 97 TDKYGRYSVFMADMFFFVISAIAAGLAPNVWVLIGARFLMGIGVGIDLPVAMSYLAEFS 155 +D++GR V + + + AP +WVL R + GI G VA +Y+A+ + Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADIT 124
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 378 bits (972), Expect = e-127 Identities = 141/356 (39%), Positives = 196/356 (55%), Gaps = 24/356 (6%) Query: 4 PESPSTAPALI--DPASKAFQSLLDKLAPTEATVLIVGETGTGKEVVARYLHHHSARRQQ 61 + L+ A + +L +L T+ T++I GE+GTGKE+VAR LH + RR Sbjct: 130 EDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNG 189 Query: 62 PFLAVNCGALTESLAEAELFGHEKGAFTGAQQGQPGWFEAAEGGTLLLDEIGELSLPLQV 121 PF+A+N A+ L E+ELFGHEKGAFTGAQ G FE AEGGTL LDEIG++ + Q Sbjct: 190 PFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQT 249 Query: 122 KLLRVLQEREITRVGSRKAIKVNVRVIAATHVDLAQAIRERRFREDLYYRLNIAVVPLPP 181 +LLRVLQ+ E T VG R I+ +VR++AAT+ DL Q+I + FREDLYYRLN+ + LPP Sbjct: 250 RLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPP 309 Query: 182 LRQRRQDIPLLAHHFLSLYARRLGRPTLRLAPESLARLMDYSWPGNIRELENTLHNAVLL 241 LR R +DIP L HF+ + G R E+L + + WPGN+RELEN + L Sbjct: 310 LRDRAEDIPDLVRHFVQQA-EKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTAL 368 Query: 242 SKEEEISPAQLRLATLNDAP-----------------GPASDHELDDFIRHQLALPGEPL 284 ++ I+ + ++ P ++ F ALP L Sbjct: 369 YPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGL 428 Query: 285 WQRVTSA----LIRHAMAHCDDNQSQAAALLGISRHTLRTQLANLGLIKSRRRPPA 336 + RV + LI A+ NQ +AA LLG++R+TLR ++ LG+ R A Sbjct: 429 YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVYRSSRSA 484
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.007 Identities = 9/58 (15%), Positives = 17/58 (29%), Gaps = 1/58 (1%) Query: 152 ADFVICFYNPRSRGREGHLARAFTLLAASKSADTPVGVVKSAGRKKQEKWLTTLGEMD 209 D+ + G+ + L S + +G K + + L EM Sbjct: 595 FDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFD-IGTGKDSYEQIAGIVAYELSEMT 651
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 558 bits (1439), Expect = 0.0 Identities = 198/395 (50%), Positives = 270/395 (68%), Gaps = 5/395 (1%) Query: 4 KIMAINAGSSSLKFQLLNMPQGALLCQGLIERIGLPEARFTLKTSAQKWQETLPIADHHE 63 KI+ IN GSSSLK+QL+ G +L +GL ERIG+ ++ T + +K + + DH + Sbjct: 2 KILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKD 61 Query: 64 AVTLLLEALTGR--GILSSLQEIDGVGHRVAHGGERFKDAALVCDDTLREIERLAELAPL 121 A+ L+L+AL G++ + EID VGHRV HGGE F + L+ DD L+ I ELAPL Sbjct: 62 AIKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIELAPL 121 Query: 122 HNPVNALGIRLFRQLLPAVPAVAVFDTAFHQTLAPEAWLYPLPWRYYAELGIRRYGFHGT 181 HNP N GI+ Q++P VP VAVFDTAFHQT+ A+LYP+P+ YY + IR+YGFHGT Sbjct: 122 HNPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFHGT 181 Query: 182 SHHYVSSALAEKLGVPLSALRVVSCHLGNGCSVCAIKGGQSVNTSMGFTPQSGVMMGTRS 241 SH YVS AE L P+ +L++++CHLGNG S+ A+K G+S++TSMGFTP G+ MGTRS Sbjct: 182 SHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGTRS 241 Query: 242 GDIDPSILPWLVEKEGKSAQQLSQLLNNESGLLGVSGVSSDYRDVEQAADA-GNERAALA 300 G IDPSI+ +L+EKE SA+++ +LN +SG+ G+SG+SSD+RD+E AA G++RA LA Sbjct: 242 GSIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQLA 301 Query: 301 LSLFAERIRATIGSYIMQMGGLDALIFTGGIGENSARARATICRNLHFLGLALDDEKNQR 360 L++FA R++ TIGSY MGG+D ++FT GIGEN R I L FLG LD EKN+ Sbjct: 302 LNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEKNKV 361 Query: 361 SA--TFIQADNALVKVAVINTNEELMIARDVMRLA 393 I ++ V V V+ TNEE MIA+D ++ Sbjct: 362 RGEEAIISTADSKVNVMVVPTNEEYMIAKDTEKIV 396
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 25.7 bits (56), Expect = 0.035 Identities = 10/23 (43%), Positives = 15/23 (65%) Query: 7 RQRGFSLPETVLAMALMVLTVTA 29 RQRGF+L E +L + LM ++ Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGM 24
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 977 bits (2526), Expect = 0.0 Identities = 666/866 (76%), Positives = 758/866 (87%), Gaps = 6/866 (0%) Query: 11 LGCRTARRLVSPALALWLC------SQPFAARADLYFNPRFLADDPAAVADLSGFEKGQE 64 C R+ + L +Q + A+LYFNPRFLADDP AVADLS FE GQE Sbjct: 13 TQCLHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQE 72 Query: 65 VPPGTYRVDIYLNNGFMTTRDVTFQADAQGHGLSPCLTRGQLASMGVDTGRVPGMATLDS 124 +PPGTYRVDIYLNNG+M TRDVTF G+ PCLTR QLASMG++T V GM L Sbjct: 73 LPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLAD 132 Query: 125 TACVPLTTLISEATTRFDVGQQRLYLTVPQAFMGNQARGYIPPELWDNGITAGLINYNFT 184 ACVPLT++I +AT + DVGQQRL LT+PQAFM N+ARGYIPPELWD GI AGL+NYNF+ Sbjct: 133 DACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFS 192 Query: 185 GNNAHNTTGGSSRYAYLNLQSGLNIGAWRLRDNSTWSYSSGGSTSSNENRWQHVNSWLER 244 GN+ N GG+S YAYLNLQSGLNIGAWRLRDN+TWSY+S S+S ++N+WQH+N+WLER Sbjct: 193 GNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLER 252 Query: 245 DITPLRSRLTLGDSYTNGDVFDGINFRGAQLASDDNMLPDSQKGFAPVIHGIARGTAQVS 304 DI PLRSRLTLGD YT GD+FDGINFRGAQLASDDNMLPDSQ+GFAPVIHGIARGTAQV+ Sbjct: 253 DIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVT 312 Query: 305 IRQNGYEIYQSTVPPGPFTIDDLYAAGNGGDLQVTIKEADGSRQVFSVPWSTVPVLQREG 364 I+QNGY+IY STVPPGPFTI+D+YAAGN GDLQVTIKEADGS Q+F+VP+S+VP+LQREG Sbjct: 313 IKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREG 372 Query: 365 HTRFALTAGEYRSGNSQQETPDFFQGTAMHGLPAGWTLYGGTQLADRYRAFNLGVGKNMG 424 HTR+++TAGEYRSGN+QQE P FFQ T +HGLPAGWT+YGGTQLADRYRAFN G+GKNMG Sbjct: 373 HTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMG 432 Query: 425 YFGALSLDITQANATLADDSEHQGQSVRFLYNKSLDETGTNLQLVGYRYSTRGYYNFADT 484 GALS+D+TQAN+TL DDS+H GQSVRFLYNKSL+E+GTN+QLVGYRYST GY+NFADT Sbjct: 433 ALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADT 492 Query: 485 TYRRMSGYSVETQDGVIQVKPKFTDYYNLAYSKRGKVQLSVTQQLGRTATLYLSGSHQTY 544 TY RM+GY++ETQDGVIQVKPKFTDYYNLAY+KRGK+QL+VTQQLGRT+TLYLSGSHQTY Sbjct: 493 TYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTY 552 Query: 545 WGTDDADEQLQAGLNAAVDDINWSLSYSLTKNAWQQGRDQMLAININIPFSHWLRSDSRS 604 WGT + DEQ QAGLN A +DINW+LSYSLTKNAWQ+GRDQMLA+N+NIPFSHWLRSDS+S Sbjct: 553 WGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKS 612 Query: 605 VWRHASASYSLSHDLNGRMTNLAGLYGTLLEDNNLSYSVQTGYAGGGNGDNGSTGYTALN 664 WRHASASYS+SHDLNGRMTNLAG+YGTLLEDNNLSYSVQTGYAGGG+G++GSTGY LN Sbjct: 613 QWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLN 672 Query: 665 YRGGYGNANVGYSRSDGFKQLYYGVSGGVLAHANGITLSQPLNDTVVLVKAPGAGGVKVE 724 YRGGYGNAN+GYS SD KQLYYGVSGGVLAHANG+TL QPLNDTVVLVKAPGA KVE Sbjct: 673 YRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVE 732 Query: 725 NQTGVRTDWRGYAVLPYATEYRENRIALDTNTLADNVDLDDAVVSVVPTHGAIVRANFNA 784 NQTGVRTDWRGYAVLPYATEYRENR+ALDTNTLADNVDLD+AV +VVPT GAIVRA F A Sbjct: 733 NQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKA 792 Query: 785 QVGMKILMTLTHRGKPVPFGALATGDSNQSGSIVADNGQVYLSGMPLAGKVRVKWGDGPD 844 +VG+K+LMTLTH KP+PFGA+ T +S+QS IVADNGQVYLSGMPLAGKV+VKWG+ + Sbjct: 793 RVGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEEN 852 Query: 845 AQCVADYRLPPESQQQALSQLSVACR 870 A CVA+Y+LPPESQQQ L+QLS CR Sbjct: 853 AHCVANYQLPPESQQQLLTQLSAECR 878
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 28.4 bits (63), Expect = 0.002 Identities = 11/55 (20%), Positives = 23/55 (41%) Query: 11 YVNDAQGNQVAEIVFVPTGEHLSIIEHTDVDPSLKGQGVGKQLVAKVVEKMRQEQ 65 ++ + N + I ++IE V + +GVG L+ K +E ++ Sbjct: 68 FLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENH 122
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 68.3 bits (167), Expect = 7e-15 Identities = 49/362 (13%), Positives = 106/362 (29%), Gaps = 81/362 (22%) Query: 11 KKWPLLALVLAAILALILVIWQL-----QTSPETNDAYVYADTIDVVPEVSGRIVEMPIR 65 + P L +I I + + + ++ P + + E+ ++ Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113 Query: 66 DNQRVKKGDLLFRIDPRP---------------------YQAMLDDA------------- 91 + + V+KGD+L ++ YQ + Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173 Query: 92 ------------------KARLTTLDAQIMLTQRTIKAQEYNAQSVAAAVERARALVKQT 133 K + +T Q + + + +V A + R L + Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233 Query: 134 TSTRIRLEPLVPQGFASQEDLDQARTAEKAARAELEATLLQAKQASAAVTGVDAMVAQRA 193 S L+ + ++ + + A EL Q +Q + + Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293 Query: 194 GVL-------------------AQIALAELHLEFTEVRAPFNGVVVALKT-TVGQYASAL 233 + ++A E + + +RAP + V LK T G + Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353 Query: 234 KPVFTLL-DDDRWYVIANFRETDLNNVRPGVAARITVMT-NHNRT--FNGVVDSVGSGVL 289 + + ++ +DD V A + D+ + G A I V + R G V ++ + Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI 413 Query: 290 PE 291 + Sbjct: 414 ED 415
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 29.3 bits (66), Expect = 0.049 Identities = 17/109 (15%), Positives = 41/109 (37%), Gaps = 13/109 (11%) Query: 394 LASLLALLLIVFVQPWTDSLTGLLAMSLPV---LALAAWIAAGSERIAYAGIQIGFTFA- 449 + L+ + P++ +L+ ++ L L A IA +Q GF + Sbjct: 53 FSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISG 112 Query: 450 ---------LAFLSWFAPLTNLTELRDRVLGILLGVLVSSIVHLYLWPD 489 + + + ++ L + + IL VL+S ++ + + + Sbjct: 113 EAIKPDIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGN 161
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 67.5 bits (165), Expect = 4e-14 Identities = 36/156 (23%), Positives = 61/156 (39%), Gaps = 9/156 (5%) Query: 258 QAIRILIAEDLPANRQLLRRQLDTLGYAADEAKDGAEALKLIQQQRYDLLITDLNMPVMD 317 IL+A+D A R +L + L GY + A + I DL++TD+ MP + Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 318 GITLTCRVREYDTRMVIWGLTANLVAGEKERCLASGMNLCLFKPLDLSQL----ATALCE 373 L R+++ + + ++A + G L KP DL++L AL E Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121 Query: 374 INIPQSGSSLDEFLNMKIFTALTLGDKKLMRQMLEQ 409 S D M + +G M+++ Sbjct: 122 PKRRPSKLEDDSQDGMPL-----VGRSAAMQEIYRV 152
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 70.2 bits (172), Expect = 2e-16 Identities = 23/122 (18%), Positives = 56/122 (45%), Gaps = 4/122 (3%) Query: 1 MSKTANLSAIIIDDHPLARMAIRNLLENEGFNIVAEAGDGGEALMAVAEYQPDVVIVDVD 60 M+ + ++ DD R + L G+++ + +A D+V+ DV Sbjct: 1 MTGA---TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVV 56 Query: 61 IPVMSGIEVVEKLRKKQFSHIIIVVSAKNDLFYGKRSADAGANAFISKKEGINNIISAIH 120 +P + +++ +++K + ++V+SA+N ++++ GA ++ K + +I I Sbjct: 57 MPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116 Query: 121 AA 122 A Sbjct: 117 RA 118
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 63.7 bits (155), Expect = 4e-13 Identities = 37/218 (16%), Positives = 74/218 (33%), Gaps = 24/218 (11%) Query: 132 IAYQQALADYQRRSRLQGAAAISRENMQHAKDAVDSSKAALDVAVQAYRGNRVLIQNTAL 191 IA L + + + ++ + + S+K + Q + +N L Sbjct: 249 IAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF-------KNEIL 301 Query: 192 EKQPEVLMAAESMRE----AWVALQRTKVRSPVTGYLAQRNVQ-VGETIGSGQALMSIIP 246 +K + + Q + +R+PV+ + Q V G + + + LM I+P Sbjct: 302 DKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVP 361 Query: 247 VEQV-WINANFKETQLSGVKIGQKVSI-VTDF-YGSDVVFNGRVDGINMGTGSAFSVLPA 303 + + A + + + +GQ I V F Y G+V I + ++ Sbjct: 362 EDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNI-----NLDAIE-- 414 Query: 304 QNATGNWIKVVQRLPVRITLDAEQIKAYPLRIGLSATV 341 G V+ + K PL G++ T Sbjct: 415 DQRLGLVFNVIISIEENCLSTGN--KNIPLSSGMAVTA 450
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 130 bits (328), Expect = 4e-35 Identities = 90/411 (21%), Positives = 168/411 (40%), Gaps = 19/411 (4%) Query: 18 VTLALSMATFMQMLDSTISNVAIPTISGFLGASTDEGTWVITSFGVANAISIPVTGRLAQ 77 + + L + +F +L+ + NV++P I+ WV T+F + +I V G+L+ Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74 Query: 78 RFGERKLFLTSVTLFALASLCCGLS-TNLDTLIGFRVVQGLVAGPLIPLSQSLLLRNYPP 136 + G ++L L + + S+ + + LI R +QG A L ++ R P Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134 Query: 137 EKRNIALALWSMTVIIAPIFGPIIGGYICDNYDWGWIFLINVPLGVIVVVLTSWLLKGRE 196 E R A L V + GP IGG I W ++ LI + + V L L K Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVR 194 Query: 197 TPTEPVKINLIALSLLVLGVGSLQIMLDKGKDLDWFNSTTIIVLAIIAVIAIILLVIWEA 256 ++ + L+ +G+ + ML F ++ I I++V++ ++ V Sbjct: 195 IKGH---FDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIR 241 Query: 257 TDNNPIIDLSLFRSRNFTIGILCIACAYLIYAGAIVLMPQLLQTVFEYTSVSAGLAYAPI 316 +P +D L ++ F IG+LC + AG + ++P +++ V + ++ G Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301 Query: 317 GIMPLLL-APLIGRYGHKIDMRMLVTFSFIVYALCYYWRSVTFSSAINF-TWVIIPQFMQ 374 G M +++ + G + ++ ++ + S + F T +I+ Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361 Query: 375 GFAVACFFLPLTTISLSGLPPEKFAAATSLSNFFRSLSGSIGTTITMTLWS 425 ++TI S L ++ A SL NF LS G I L S Sbjct: 362 LSFTK---TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 969 bits (2506), Expect = 0.0 Identities = 326/570 (57%), Positives = 418/570 (73%), Gaps = 5/570 (0%) Query: 3 TISRKEYASLFGPTVGDKIRLGETDLYIEIEKDLRGYGDESVYGGGKSLRDGMGSNNTLT 62 +SR YA++FGPTVGDK+RL +T+L+IE+EKD +G+E +GGGK +RDGMG + T Sbjct: 4 RMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQV-T 62 Query: 63 RDNGVLDLVITNVTILDAKLGVIKADVGIKDGLIVGIGKSGNPAIMDGVTQNMIVGLSTD 122 R+ G +D VITN ILD G++KAD+G+KDG I IGK+GNP + GV +IVG T+ Sbjct: 63 REGGAVDTVITNALILDH-WGIVKADIGLKDGRIAAIGKAGNPDMQPGV--TIIVGPGTE 119 Query: 123 AISGEHLILTAAGIDSHIHLISPQQAYSALSNGVATFFGGGIGPTDGTNGTTVTPGPWNI 182 I+GE I+TA G+DSHIH I PQQ AL +G+ GGG GP GT TT TPGPW+I Sbjct: 120 VIAGEGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHI 179 Query: 183 RKMLRAVEGLPVNVGLLGKGNAFGRAPLVEQIIAGVAGLKVHEDWGATPNALRHSLRIAD 242 +M+ A + P+N+ GKGNA LVE ++ G LK+HEDWG TP A+ L +AD Sbjct: 180 ARMIEAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVAD 239 Query: 243 EMDIQVSVHTDSLNEAGYVENTIEAFEGRTIHTFHTEGAGGGHAPDIIKVASQLNVLPSS 302 E D+QV +HTD+LNE+G+VE+TI A +GRTIH +HTEGAGGGHAPDII++ Q NV+PSS Sbjct: 240 EYDVQVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSS 299 Query: 303 TNPTLPFGINTQAELFDMIMVCHNLNPNVAADVSFAESRVRPETIAAENVLHDMGVISMF 362 TNPT P+ +NT AE DM+MVCH+L+P + D++FAESR+R ETIAAE++LHD+G S+ Sbjct: 300 TNPTRPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSII 359 Query: 363 SSDSQAMGRVGENWLRVVQTAHAMKVARGKLPEDSDGNDNFRVLRYVAKLTINPAIAHGV 422 SSDSQAMGRVGE +R QTA MK RG+L E++ NDNFRV RY+AK TINPAIAHG+ Sbjct: 360 SSDSQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGL 419 Query: 423 SHIIGSVEVGKMADLVLWDPRSFGAKPKMVIKGGMINWALMGDPNASLPTPQPVFYRPMF 482 SH IGS+EVGK ADLVLW+P FG KP MV+ GG I A MGDPNAS+PTPQPV YRPMF Sbjct: 420 SHEIGSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMF 479 Query: 483 GAMGKTLQDTCATFVSQAALDDGVKEKAGLERQVIAINNCR-SVTKRDLVRNSATPHIEV 541 GA G++ ++ TFVSQA+LD G+ + G+ ++++A+ N R + K ++ NS TPHIEV Sbjct: 480 GAYGRSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEV 539 Query: 542 DPETFAVKVDGEHATCNPVTTAVMNQKYFF 571 DPET+ V+ DGE TC P T M Q+YF Sbjct: 540 DPETYEVRADGELLTCEPATVLPMAQRYFL 569
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 47.5 bits (113), Expect = 9e-09 Identities = 21/126 (16%), Positives = 49/126 (38%), Gaps = 10/126 (7%) Query: 1 MSA---VIIDDHPFARLALKTVLENQNI-VVTGEAADDFHAIQLVDRLQPDIVIVDVMLI 56 M+ ++ DD R L L V A + + D+V+ DV++ Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAAT--LWRWIAAGDGDLVVTDVVMP 58 Query: 57 GSSGIDVVTKLRQNHYAGSIVMVSGKNQIFYRKCSVDAGANAFISK----KESMDNFVAA 112 + D++ ++++ ++++S +N + + GA ++ K E + A Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118 Query: 113 IQAVQR 118 + +R Sbjct: 119 LAEPKR 124
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 41.7 bits (98), Expect = 9e-07 Identities = 13/66 (19%), Positives = 25/66 (37%), Gaps = 1/66 (1%) Query: 15 AIKTLLENKGVSVTGEAINGMDALRIVDQLQPNTIIVDVDLPDIDGIGLVETLRKRLYKG 74 + L G V N R + + ++ DV +PD + L+ ++K Sbjct: 18 VLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPDL 76 Query: 75 SIIVTS 80 ++V S Sbjct: 77 PVLVMS 82
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 126 bits (318), Expect = 4e-34 Identities = 77/398 (19%), Positives = 173/398 (43%), Gaps = 13/398 (3%) Query: 29 TLMGVFDGTMINIALPSMAQEMQVPASIAVWFANGYLLAAAMTLAIFAALAARLGYRPVF 88 + V + ++N++LP +A + P + W ++L ++ A++ L+ +LG + + Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82 Query: 89 LAGLTTFTLTSLGCALA-NKPEVLIGMRVLQGIGGAATLSIAPAILRSVFPGRLLGRILG 147 L G+ S+ + + +LI R +QG G AA ++ ++ P G+ G Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142 Query: 148 FHALLIASSSAIGPVLGGTILHTLSWQWLFAINVLPGTLALLLAVRALPRDAIRMQAPFD 207 ++A +GP +GG I H + W +L I ++ T+ + + L + +R++ FD Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMI--TIITVPFLMKLLKKEVRIKGHFD 200 Query: 208 TVGAILSALLLGSTIMAANSLQNATSQFGSLCWMALAALSGMAFIWQIRRTGHPLLPPSM 267 G IL ++ + ++ S S+ ++ ++ LS + F+ IR+ P + P + Sbjct: 201 IKGIILMSVGIVFFMLFTTS--------YSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252 Query: 268 FKNERFTLAAFTSMVAFVSQGITFIALPFLFQSEYGYSP-VVSALLFTPWPLGIVLIAPH 326 KN F + + F + +P++ + + S + +++ P + +++ Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312 Query: 327 AGRWADTISAPAISTLGLVIFVVGLILLATLPASPSMWDICLRSLVCGIGFGCFQSPNNR 386 G D + +G+ V + + L + S + + V G G ++ + Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTVIST 371 Query: 387 EMLSNVIREHASYASGVLSIMRTFGQCLGAAAVAVLLA 424 + S++ ++ A +L+ + G A V LL+ Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409
>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein signature. Length = 533 Score = 33.4 bits (76), Expect = 0.002 Identities = 37/158 (23%), Positives = 62/158 (39%), Gaps = 27/158 (17%) Query: 2 SVNVKAATLTNLVKYKTDRASLRSVRDDMKKLQKDFSKTEGTIAKAKMQADKQAYTAQMQ 61 +NV L N + ++ ++ + D +++L+ F ++ + Q Sbjct: 283 GINVPDTGLPNSASIEQIQSKIQELGDTLEELRDSFDGYINNAFVNQIHLNFVMPPQAQQ 342 Query: 62 QQKQVQQQQKQAAKQATVDAKA-------KQIEA------------------RKLAAAQS 96 QQ Q QQQQ QA Q V A A QI KLAA Q Sbjct: 343 QQGQGQQQQAQATAQEAVAAAAVRLLNGSDQIAQLYKDLVKLQRHAGIRKAMEKLAAQQE 402 Query: 97 KAAKIQMQQ--QQKQASVAENARLKERKALFDIGRMEG 132 + AK Q + +Q+Q + ++ K ++ FD+ + G Sbjct: 403 EDAKNQGKGDCKQQQGASEKSKEGKVKETEFDLSMVVG 440
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 39.8 bits (93), Expect = 6e-06 Identities = 18/82 (21%), Positives = 33/82 (40%), Gaps = 12/82 (14%) Query: 146 AAGAGKVVYVGNQLRGYGNLIMIKHSEDYITAYAHNDKLMVNNGQSVKAGQQIATMGSTD 205 A GK+ + G IK E+ I +++V G+SV+ G + + + Sbjct: 84 ATANGKLTHSGRSK-------EIKPIENSIV-----KEIIVKEGESVRKGDVLLKLTALG 131 Query: 206 ADSVRLHFQIRYRATAIDPLRY 227 A++ L Q ++ RY Sbjct: 132 AEADTLKTQSSLLQARLEQTRY 153
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 30.6 bits (69), Expect = 0.005 Identities = 11/38 (28%), Positives = 16/38 (42%) Query: 1 MKKGLLMFTLLAASLSGAAHADSAAIKQSLAKLGVQST 38 MKK + L +++ A A S KL V +T Sbjct: 1 MKKLGTLLVLFLSAIILVACASGKKDTTSGQKLKVVAT 38
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 108 bits (272), Expect = 1e-30 Identities = 77/262 (29%), Positives = 118/262 (45%), Gaps = 9/262 (3%) Query: 1 MNAQ-IEGRVAVVTGGSSGIGFETLRLLLGEGAKVAFCGRNPDRLASAHAALQNE--YPE 57 MNA+ IEG++A +TG + GIG R L +GA +A NP++L ++L+ E + E Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60 Query: 58 GEVFSWRCDVLNEAEVEAFAAAVAARFGGVDMLINNAGQGYVAHFADTPREAWLHEAELK 117 ++ DV + A ++ A + G +D+L+N AG E W + Sbjct: 61 ----AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVN 116 Query: 118 LFGVINPVKAFQSLLEASDIASITCVNSLLALQPEEHMIATSAARAALLNMTLTLSKELV 177 GV N ++ + SI V S A P M A ++++AA + T L EL Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176 Query: 178 DKGIRVNSILLGMVESGQWQRRFESRSDKSQSWQQWTADIARKRGIPMARLGKPQEPAQA 237 + IR N + G E+ + + Q + K GIP+ +L KP + A A Sbjct: 177 EYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETF--KTGIPLKKLAKPSDIADA 234 Query: 238 LLFLASPLASFTTGAALDVSGG 259 +LFL S A T L V GG Sbjct: 235 VLFLVSGQAGHITMHNLCVDGG 256
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 725 bits (1874), Expect = 0.0 Identities = 296/879 (33%), Positives = 465/879 (52%), Gaps = 50/879 (5%) Query: 14 KLFRYSPVAGFLLVCI------NPAWAGDYFDPGFLGNSGDNTAVDLSAFSEAGGVQPGK 67 + R + L V + A YF+P FL DLS F + PG Sbjct: 19 RKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFL-ADDPQAVADLSRFENGQELPPGT 77 Query: 68 YTVWVFVNQRNAGQYTLDFQKNTQGKIA-PVLTPSELETFGVNVRQLPDLKDLPATAEID 126 Y V +++N + F + P LT ++L + G+N + + L A + Sbjct: 78 YRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVP 137 Query: 127 NIGALIPQATTMLDLARLRLDISVPQAAMQPEVRGAVDPSQWEEGISALMANYSLSAGRT 186 + ++I AT LD+ + RL++++PQA M RG + P W+ GI+A + NY+ S Sbjct: 138 -LTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSV 196 Query: 187 TNSGQNQTSHNNNLFATVRAGANTGPWRLRSTMTHTRVENNGGNNALTTTQTRFSNTYLA 246 N + + + +++G N G WRLR T + N+ +++ + + + NT+L Sbjct: 197 QNRIGGNS---HYAYLNLQSGLNIGAWRLRDNTTWSY--NSSDSSSGSKNKWQHINTWLE 251 Query: 247 RDIRGWRSNLLMGESSTGSDVFDGIPFRGVKLSSNEQMLPSQLRGYAPAISGVANSNARV 306 RDI RS L +G+ T D+FDGI FRG +L+S++ MLP RG+AP I G+A A+V Sbjct: 252 RDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQV 311 Query: 307 TVRQNGNVVYETYVAPGPFYINDIQQAGLSGDYDVKVTEADGTERQFIVPYSSLPVMLRP 366 T++QNG +Y + V PGPF INDI AG SGD V + EADG+ + F VPYSS+P++ R Sbjct: 312 TIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQRE 371 Query: 367 GGWKYELTAGQY--DGNLTDGSRRADFMLGTVVYGLPGDVTLFGGILAAKDYQAFNIGTG 424 G +Y +TAG+Y + R F T+++GLP T++GG A Y+AFN G G Sbjct: 372 GHTRYSITAGEYRSGNAQQEKPR---FFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIG 428 Query: 425 VSLGYVGALSADITNSSAKFDNESTLIGQSYRVRYSKSLLSTGTSVDLTALRYSTEDYYS 484 ++G +GALS D+T +++ ++S GQS R Y+KSL +GT++ L RYST Y++ Sbjct: 429 KNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFN 488 Query: 485 FSEFNSQGHQLQEGVSPWSLQ--------------RRRNSFQTQLSQQLGDWGTMYFRAS 530 F++ + + +R Q ++QQLG T+Y S Sbjct: 489 FADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGS 548 Query: 531 RDDYWGGERTLTGMSLGYSNSLKGVSYGVNYNIDRTKDANGNWPENRQISFNVSVPFSIF 590 YWG G + + + +++ ++Y++ + G ++ ++ NV++PFS + Sbjct: 549 HQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGR---DQMLALNVNIPFSHW 605 Query: 591 GYSRN---LQSMYATTTLTHDNTGRTLSQTGLSGNTL-DGKLSYSASQSW---GNQGQIS 643 S + + A+ +++HD GR + G+ G L D LSYS + G+ S Sbjct: 606 LRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGS 665 Query: 644 NTNLNTGYQGSKGSISGGYSYSSDMQAINMSASGGVMVHSGGITLSRAMGDSVALVSAPG 703 Y+G G+ + GYS+S D++ + SGGV+ H+ G+TL + + D+V LV APG Sbjct: 666 TGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPG 725 Query: 704 AAGVSVNGGTAV-TDWRGYAVVPYLTDYTRNSVGVDPSTLPENVDLTQTNLNVYPTKGAV 762 A V T V TDWRGYAV+PY T+Y N V +D +TL +NVDL NV PT+GA+ Sbjct: 726 AKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAI 785 Query: 763 VKANFATRGGYQVLMTLKLDNGVVPFGAVATLLNAGMAEVNSSIVGDDGQVYLTGLPERG 822 V+A F R G ++LMTL +N +PFGA+ T ++ +S IV D+GQVYL+G+P G Sbjct: 786 VRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSESSQ----SSGIVADNGQVYLSGMPLAG 841 Query: 823 ELLVKWGETAARQCRVSFDISGLSTSPDKPVRQVTYTCQ 861 ++ VKWGE C ++ + S + + Q++ C+ Sbjct: 842 KVQVKWGEEENAHCVANYQLPP--ESQQQLLTQLSAECR 878
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 29.0 bits (65), Expect = 0.029 Identities = 10/37 (27%), Positives = 20/37 (54%) Query: 347 GVFDILHAGHVSYLANARKLGDRLIVAVNSDASTKRL 383 G FD + GH+ + +L D++ VAV + + + + Sbjct: 7 GSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPM 43
>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature. Length = 428 Score = 27.3 bits (60), Expect = 0.016 Identities = 24/66 (36%), Positives = 31/66 (46%), Gaps = 11/66 (16%) Query: 12 ITTIGVYDWEQTIEQK----LVFDI-EIAWDNRKAAASDDVSDCLSYADISERVIAHVEG 66 I IG+ D++ E K L F+I E A+ A A LS D S+RV+A G Sbjct: 148 IKIIGI-DFDIETEYKWFYSLQFNIKESAFTTGYAIA-----SWLSEQDESKRVVASFGG 201 Query: 67 GKFALV 72 G F V Sbjct: 202 GAFPGV 207
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 1082 bits (2800), Expect = 0.0 Identities = 411/566 (72%), Positives = 473/566 (83%), Gaps = 2/566 (0%) Query: 4 ISRQAYADMFGPTVGDKVRLADTELWIEVEDDLTTYGEEVKFGGGKVIRDGMGQGQML-A 62 +SR AYA+MFGPTVGDKVRLADTEL+IEVE D TT+GEEVKFGGGKVIRDGMGQ Q+ Sbjct: 5 MSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQVTRE 64 Query: 63 ADCVDLVLTNALIVDHWGIVKADIGVKDGRIFAIGKAGNPDIQPNVTIPIGAATEVIAAE 122 VD V+TNALI+DHWGIVKADIG+KDGRI AIGKAGNPD+QP VTI +G TEVIA E Sbjct: 65 GGAVDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGE 124 Query: 123 GKIVTAGGIDTHIHWICPQQAEEALVSGVTTMVGGGTGPAAGTHATTCTPGPWYISRMLQ 182 GKIVTAGG+D+HIH+ICPQQ EEAL+SG+T M+GGGTGPA GT ATTCTPGPW+I+RM++ Sbjct: 125 GKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMIE 184 Query: 183 AADSLPVNIGLLGKGNVSQPDALREQVAAGVIGLKIHEDWGATPAAIDCALTVADEMDVQ 242 AAD+ P+N+ GKGN S P AL E V G LK+HEDWG TPAAIDC L+VADE DVQ Sbjct: 185 AADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVADEYDVQ 244 Query: 243 VALHSDTLNESGFVEDPLAAIGGRTIHTFHTEGAGGGHAPDIITACAHPNILPSSTNPTL 302 V +H+DTLNESGFVED +AAI GRTIH +HTEGAGGGHAPDII C PN++PSSTNPT Sbjct: 245 VMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSSTNPTR 304 Query: 303 PYTLNTIDEHLDMLMVCHHLDPDIAEDVAFAESRIRRETIAAEDVLHDLGAFSLTSSDSQ 362 PYT+NT+ EHLDMLMVCHHL P I ED+AFAESRIR+ETIAAED+LHD+GAFS+ SSDSQ Sbjct: 305 PYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSIISSDSQ 364 Query: 363 AMGRVGEVILRTWQVAHRMKVQRGALAEETGDNDNFRVKRYIAKYTINPALTHGIAHEVG 422 AMGRVGEV +RTWQ A +MK QRG L EETGDNDNFRVKRYIAKYTINPA+ HG++HE+G Sbjct: 365 AMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGLSHEIG 424 Query: 423 SIEVGKLADLVVWSPAFFGVKPATVIKGGMIAIAPMGDINASIPTPQPVHYRPMFGALGS 482 S+EVGK ADLV+W+PAFFGVKP V+ GG IA APMGD NASIPTPQPVHYRPMFGA G Sbjct: 425 SLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMFGAYGR 484 Query: 483 ARHHCRLTFLSQAAAANGVAERLNLRSAIAVVKGCR-TVQKADMVHNSLQPNITVDAQTY 541 +R + +TF+SQA+ G+A RL + + V+ R + KA M+HNSL P+I VD +TY Sbjct: 485 SRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEVDPETY 544 Query: 542 EVRVDGELITSEPADVLPMAQRYFLF 567 EVR DGEL+T EPA VLPMAQRYFLF Sbjct: 545 EVRADGELLTCEPATVLPMAQRYFLF 570
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 28.3 bits (63), Expect = 0.019 Identities = 10/60 (16%), Positives = 24/60 (40%), Gaps = 6/60 (10%) Query: 31 KEIGDAD----HGLNMHRGFSKVVEKLP--SIADKDIGFILKNTGMTLLSNVGGASGPLF 84 K+ +AD +G+N+ G + KL + ++ + + G+ ++ G Sbjct: 77 KKTSEADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFAVSDGVDVIYLEGQNEKGKE 136
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 151 bits (382), Expect = 4e-42 Identities = 65/221 (29%), Positives = 112/221 (50%), Gaps = 5/221 (2%) Query: 258 VEGAALRYPLALIQP----LRPAAADAAREQQRLRQAIDQTLADLIALTELAENKFHADI 313 G A+ ++P + + D + E ++L A++++ +L A+ + E AD Sbjct: 11 SSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTEASMGADK 70 Query: 314 AAIFAGHHTLLDDDDLFDAANDRLLTEQCTAEWAWHQVLMELSQQYRQLDDPYLQARYID 373 A IFA H +LDD +L D ++ EQ AE+A +V + +D+ Y++ R D Sbjct: 71 AEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNEYMKERAAD 130 Query: 374 IEDILQRTLRHLQGVQE-RVPTPGEPTIIIADNIYPSTVLQLDASFVKGLCLRDGSEQAH 432 I D+ +R L HL GV+ + T E T+IIA+++ PS QL+ FVKG G +H Sbjct: 131 IRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSH 190 Query: 433 GAIIARAAGIAWLSQQGEALNSVQPGETIVLDMRHQRLIRD 473 AI++R+ I + E +Q G+ +++D +I + Sbjct: 191 SAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVN 231
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 50.8 bits (121), Expect = 1e-08 Identities = 42/204 (20%), Positives = 69/204 (33%), Gaps = 21/204 (10%) Query: 18 KEQAQETETEQKVEEQQAVAEEIPAVETPAEPSAPKADPEAFAEDVVEVTETV-VESEKA 76 QA EE V E A P P+ P E AE+ + ++TV + A Sbjct: 1002 NIQADVPSVPSNNEEIARVDE---APVPPPAPATPSETTETVAENSKQESKTVEKNEQDA 1058 Query: 77 HLAEPASAQ--EEEWVETPALTEETPVV----EPEPAVSEPPEQPAVVEP------LAEE 124 + + +E A T+ V E + + ++ A VE E+ Sbjct: 1059 TETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEK 1118 Query: 125 VIAEPVVAEAVAEQPVEGIVVQPQETEAPEEDAPLSDEELEAQ-----ALAAEAAEEAAV 179 P V V+ + + VQPQ A E D ++ +E ++Q A E ++ Sbjct: 1119 TQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSN 1178 Query: 180 VVPAPEDEAPLEALAQEQEKPTKE 203 V + + E P Sbjct: 1179 VEQPVTESTTVNTGNSVVENPENT 1202 Score = 47.0 bits (111), Expect = 2e-07 Identities = 28/163 (17%), Positives = 46/163 (28%), Gaps = 9/163 (5%) Query: 17 QKEQAQETETEQKVEEQQAVAEEIPAVETPAEPSAPKADPEA-----FAEDVVEVTETVV 71 + +ET+T + E EE VET PK + +E V E Sbjct: 1088 SGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAR 1147 Query: 72 ESEKAHLAEPASAQEEEWVETPALTEETPVVEPEPAVSEPPE----QPAVVEPLAEEVIA 127 E++ + +Q +T +ET +P Sbjct: 1148 ENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATT 1207 Query: 128 EPVVAEAVAEQPVEGIVVQPQETEAPEEDAPLSDEELEAQALA 170 +P V + +P + E A S + AL Sbjct: 1208 QPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALC 1250 Score = 43.9 bits (103), Expect = 2e-06 Identities = 29/188 (15%), Positives = 58/188 (30%), Gaps = 14/188 (7%) Query: 17 QKEQAQETET-EQKVEEQQAVAEEIPAVETPAEPSAPKADPEAFAEDVVEVTETVVESEK 75 K++++ E EQ E A E+ + + + +V + E++ Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTN------EVAQSGSETKETQT 1097 Query: 76 AHLAEPASAQEEEWVETPALTEETPVVEPEPAVSEPP--EQPAVVEPLAEEVIAEPVVAE 133 E A+ ++EE + TE+T P+ P EQ V+P AE A Sbjct: 1098 TETKETATVEKEE--KAKVETEKTQ-EVPKVTSQVSPKQEQSETVQPQAEP--ARENDPT 1152 Query: 134 AVAEQPVEGIVVQPQETEAPEEDAPLSDEELEAQALAAEAAEEAAVVVPAPEDEAPLEAL 193 ++P + +E + ++ + Sbjct: 1153 VNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVN 1212 Query: 194 AQEQEKPT 201 ++ KP Sbjct: 1213 SESSNKPK 1220 Score = 40.0 bits (93), Expect = 3e-05 Identities = 28/180 (15%), Positives = 51/180 (28%), Gaps = 23/180 (12%) Query: 17 QKEQAQETETEQKVEEQQAVAEEIPAVETPAEPSAPKADPEAFAEDVVEVTETVVESEKA 76 Q + +ET T +K E+ + E+ V +PK + +E V E E++ Sbjct: 1096 QTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPK---QEQSETVQPQAEPARENDPT 1152 Query: 77 HLAEPASAQEEEWVETPALTEETPVVEPEPAVSEPPEQPAVVEPLAEEVIAEPVVAEAVA 136 + +Q +T +ET +P +V Sbjct: 1153 VNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVN----------------TGNSVV 1196 Query: 137 EQPVEGI--VVQPQETEAPEEDAPLSDEELEAQALAAEAAEEAAVVVPAPEDEAPLEALA 194 E P QP P + +++ E A A + + Sbjct: 1197 ENPENTTPATTQPTVNSESSN-KPKNRHRRSVRSVPHN-VEPATTSSNDRSTVALCDLTS 1254
>PF01206#SirA family protein Length = 76 Score = 103 bits (259), Expect = 3e-33 Identities = 27/71 (38%), Positives = 43/71 (60%) Query: 9 DHTLDALGLRCPEPVMMVRKTVRTMPVGETLLIIADDPATTRDIPGFCRFMEHELVAQET 68 D +LDA GL CP P++ +KT+ TM GE L ++A DP + +D F + HEL+ Q+ Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64 Query: 69 EALPYRYLIRK 79 E Y + +++ Sbjct: 65 EDGTYHFRLKR 75
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 49.1 bits (117), Expect = 2e-08 Identities = 75/365 (20%), Positives = 134/365 (36%), Gaps = 30/365 (8%) Query: 13 LRLNLRIVSVVIFNFASYLTIGLPLAVLPGYVHDVM--GFSAFWAGLVISLQYFATLLSR 70 ++ N ++ ++ + IGL + VLPG + D++ G++++L Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60 Query: 71 PHAGRYADLLGPKKIVVFGLGGCFLSGLSYLLAAWGSGWPLISLLLLCLGRVILGI-GQS 129 P G +D G + +++ L + + Y + A L +L +GR++ GI G + Sbjct: 61 PVLGALSDRFGRRPVLLVSL---AGAAVDYAIMATAP-----FLWVLYIGRIVAGITGAT 112 Query: 130 FAGTGSTLWGVGVVGSLHIGRVISWNGIVTYGAMAMGAPLGVLC--YSHIGLSGLAGVIM 187 A G+ + + R + M G LG L +S A + Sbjct: 113 GAVAGAYIADITDGDER--ARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALN 170 Query: 188 AVALVAILCALP-------RAAVKAAKGKAMSFR-AVLGRVWPYGMALA-LASAGFGVIA 238 + + LP R + A SFR A V MA+ + V A Sbjct: 171 GLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPA 230 Query: 239 TFITLFYDAK-GWDGAAFALTLFSCAFVGA---RLLFPNAINRLGGLNVAMLCFSVEAIG 294 +F + + WD ++L + + + ++ RLG ML + G Sbjct: 231 ALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTG 290 Query: 295 LLLVGFADTPMMAKIGTFLTGAGFSLVFPALGVVAVKAVPQHNQGSALATYTVFMDLSLG 354 +L+ FA MA L A + PAL + + V + QG + L+ Sbjct: 291 YILLAFATRGWMAFPIMVLL-ASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLT-S 348 Query: 355 VSGPL 359 + GPL Sbjct: 349 IVGPL 353
>BCTERIALGSPC#Bacterial general secretion pathway protein C signature. Length = 272 Score = 32.2 bits (73), Expect = 2e-04 Identities = 13/32 (40%), Positives = 18/32 (56%), Gaps = 1/32 (3%) Query: 35 RHILFWLGMALLCLGCGMLLW-LSVLQSIPVS 65 R ILF+L M L C M+ W + + + PVS Sbjct: 15 RRILFYLLMLLFCQQLAMIFWRIGLPDNAPVS 46
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 113 bits (284), Expect = 2e-29 Identities = 74/361 (20%), Positives = 138/361 (38%), Gaps = 60/361 (16%) Query: 317 RVLILGVNGFIGNHLTERLLQDDNYEIYGLDIGSD--------AISRFLDCPRFHFVEGD 368 + L+ G GFIG H+++RLL + +++ G+D +D A L P F F + D Sbjct: 2 KYLVTGAAGFIGFHVSKRLL-EAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 369 ISIHSEWIE--YHIKKCDVVLPLVAIATPIEYT-RNPLRVFELDFEENLKIIRDCVKYN- 424 ++ E + + + V + Y+ NP + + L I+ C Sbjct: 61 LADR-EGMTDLFASGHFERVFISPHRLA-VRYSLENPHAYADSNLTGFLNILEGCRHNKI 118 Query: 425 KRIIFPSTSEVYGMCTDKNFDEDSSNLVVGPINKQRWIYSVSKQLLDRVIWAYGDKYGLK 484 + +++ S+S VYG+ F D S V P++ +Y+ +K+ + + Y YGL Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDS--VDHPVS----LYAATKKANELMAHTYSHLYGLP 172 Query: 485 FTLFRPFNWMGPRLDNLNAARIGSSRAITQLILNLVEGSPIKLIEGGKQKRCFTDISDGI 544 T R F GP A+ + ++EG I + GK KR FT I D Sbjct: 173 ATGLRFFTVYGPWGR--------PDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIA 224 Query: 545 EALFRIIEN---------------KDGRCDGQIINIGNPDNEASIKELAEMLLACFERHP 589 EA+ R+ + ++ NIGN + + + L Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVE-LMDYIQALEDALGIEA 283 Query: 590 LRDRFPPFAGFREVESSDYYGKGYQDVEHRKPSIRNAKRCLNWEPKVEMEETVEHTLDFF 649 ++ P G DV + + + P+ +++ V++ ++++ Sbjct: 284 KKNMLPLQPG---------------DVLETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328 Query: 650 L 650 Sbjct: 329 R 329
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 30.6 bits (69), Expect = 0.011 Identities = 20/97 (20%), Positives = 35/97 (36%), Gaps = 5/97 (5%) Query: 182 TFIPILANTFARRAVEIPVMHAEREFGDSKYSFMRLINLMYDLVTCLTTTPLRLLSIFGS 241 P L T + + FG +F +N + V + + R L I+ Sbjct: 487 ILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYAL 546 Query: 242 VIALLGFAFGLLLVVLRLAFGPQWAAEGVFMLFAVLF 278 ++A + F L +F P+ +GVF+ L Sbjct: 547 IVAGMVVLFLRLPS----SFLPE-EDQGVFLTMIQLP 578
>SURFACELAYER#Lactobacillus surface layer protein signature. Length = 439 Score = 31.2 bits (70), Expect = 0.008 Identities = 17/70 (24%), Positives = 28/70 (40%), Gaps = 3/70 (4%) Query: 1 MKRQL---SLLAVALLLAQPVLAKDIPLNRAAALANSVTPAASSQAYDDLEQQALAQLRH 57 MK+ L S A ALL P+ A +P+N A + A++ A D++ Sbjct: 1 MKKNLRIVSAAAAALLAVAPIAATAMPVNAATTINADSAINANTNAKYDVDVTPSISAIA 60 Query: 58 ALQGNAATLT 67 A+ + Sbjct: 61 AVAKSDTMPA 70
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 85.5 bits (211), Expect = 6e-22 Identities = 67/258 (25%), Positives = 113/258 (43%), Gaps = 16/258 (6%) Query: 4 RIALVTGGSRGLGKNAALKLAAKGTDILLTYHSNRQAALDVVAEIEKKGVKAAALALNVG 63 +IA +TG ++G+G+ A LA++G I N + VV+ ++ + A A +V Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67 Query: 64 DSTTFDAFASEVAQVLAQKWGRTTFDYLLNNAGIGLNAPFAETSEAQFDELMNIQFKGPF 123 DS D E+ + ++ G D L+N AG+ S+ +++ ++ G F Sbjct: 68 DSAAID----EITARIEREMGP--IDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVF 121 Query: 124 FLTQRLLPLLQD--GGRILNVSSGLARFALPGYAAYAAMKGAMEVLTRYQAKELGGRGIS 181 ++ + + D G I+ V S A AAYA+ K A + T+ EL I Sbjct: 122 NASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181 Query: 182 VNIIAPGAIETDFGGG-EVRDNAE--VNRHIAAQTALG----RVGLPDDIGDAIAALLSD 234 NI++PG+ ETD +N V + G ++ P DI DA+ L+S Sbjct: 182 CNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241 Query: 235 ELAWMNAQRVEVSGGMFL 252 + + + V GG L Sbjct: 242 QAGHITMHNLCVDGGATL 259
>DPTHRIATOXIN#Diphtheria toxin signature. Length = 567 Score = 30.1 bits (67), Expect = 0.015 Identities = 18/40 (45%), Positives = 21/40 (52%), Gaps = 6/40 (15%) Query: 19 EKSKSTLEALNDTAVGQKASQALKTVTGTAAKVQRNPVIA 58 EK+K LE + TA+ LKTVTGT NPV A Sbjct: 273 EKAKQYLEEFHQTALEHPELSELKTVTGT------NPVFA 306
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 31.8 bits (72), Expect = 0.006 Identities = 79/365 (21%), Positives = 126/365 (34%), Gaps = 59/365 (16%) Query: 79 IGSALFGHFGDRVGRKVTLVASLLTMGISTVVIGLLPGYESIGIVAPMLLALARFGQGLG 138 IG+A++G D++G K LL GI G + G+ +G LL +ARF QG G Sbjct: 64 IGTAVYGKLSDQLGIK-----RLLLFGIIINCFGSVIGF--VGHSFFSLLIMARFIQGAG 116 Query: 139 LGGEWGGAALLATENAPARKR----ALYGSFPQLGAPIGFFFANGTFLLLSW-------- 186 ++ P R L GS +G +G + W Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM 176 Query: 187 -----LLTDQQFMEWGWRV--PF-IFSAVLVIIG-------------LYVRVSLHETPVF 225 + + ++ R+ F I +L+ +G ++ VS+ +F Sbjct: 177 ITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIF 236 Query: 226 AKVAAAKKQVKIPLGTLLTKHVRVTVLGTFIMLATYTLFYIMTVYSMTFSTGAAPNGLGL 285 K + G + VL I+ T F M Y M + +G Sbjct: 237 VKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIG- 295 Query: 286 PRNEVLWMLMMAVIGFGVMVPVAGLLADAFGRRKSMII-ITTMIILFALFAFKPLLGSGN 344 + +++ M+VI FG + G+L D G + I +T + + F +F S Sbjct: 296 --SVIIFPGTMSVIIFG---YIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWF 350 Query: 345 PLLVFAFLLLGLSLMGL---TFGPMGALLPELFPTEVRYTGASFS-YNVSSILGASVAPY 400 ++ F+L GLS T L E GA S N +S L Sbjct: 351 MTIIIVFVLGGLSFTKTVISTIVSSS-----LKQQEA---GAGMSLLNFTSFLSEGTGIA 402 Query: 401 IAAWL 405 I L Sbjct: 403 IVGGL 407 Score = 29.8 bits (67), Expect = 0.025 Identities = 19/101 (18%), Positives = 38/101 (37%), Gaps = 2/101 (1%) Query: 255 FIMLATYTLFYIMTVYSMTFSTGAAPNGLGLPRNEVLWMLMMAVIGFGVMVPVAGLLADA 314 I L + F ++ + S N P W+ ++ F + V G L+D Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75 Query: 315 FGRRKSMIIITTMIILFALFAFKPLLGSGNPLLVFAFLLLG 355 G ++ ++ + ++ F + S LL+ A + G Sbjct: 76 LGIKRLLLFGIIINCFGSVIGF--VGHSFFSLLIMARFIQG 114
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 29.2 bits (65), Expect = 0.045 Identities = 8/44 (18%), Positives = 17/44 (38%), Gaps = 5/44 (11%) Query: 93 QGSQIAGFSASYIWDLIVRFINWSMVGAFFVLLVLWLFISQWLR 136 G ++ + D ++ W VL+V W+ + +R Sbjct: 442 TGGELPFWQQQSFIDQLLAAGRW-----LLVLVVAWILWRKAVR 480
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 51.9 bits (124), Expect = 1e-10 Identities = 38/185 (20%), Positives = 72/185 (38%), Gaps = 15/185 (8%) Query: 1 MAEK-QTAKRNRREEILQSLALMLESSDGSQRITTAKLAASVGVSEAALYRHFPSKTRMF 59 MA K + + R+ IL AL L S G + ++A + GV+ A+Y HF K+ +F Sbjct: 1 MARKTKQEAQETRQHILDV-ALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59 Query: 60 DSLIEFIEDSLITRIN-LILKDEKDTTARLRLIVLLILGFGERNPGLTRILT-------G 111 + E E ++ K D + LR I++ +L ++ Sbjct: 60 SEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119 Query: 112 HALMFEQDRLQGRIN-QLFERIEAQLRQVMREKKMREGEGYTLDETLLASQLLAFCEGML 170 M + Q + + ++RIE L+ + K + L A + + G++ Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPAD----LMTRRAAIIMRGYISGLM 175 Query: 171 SRFVR 175 ++ Sbjct: 176 ENWLF 180
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 44.5 bits (105), Expect = 2e-07 Identities = 35/180 (19%), Positives = 67/180 (37%), Gaps = 6/180 (3%) Query: 175 IMGSILSTTLILMTALSITRERENGALENLLVSPLSGLEVIIGKITPFVIIGLFQATLIL 234 + S ++ + R E +L + L ++++G++ I Sbjct: 74 VATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIG 133 Query: 235 IAAVLLFDIPLHGSVFLLFFVLLIYVFLCLSIGIGISGLAQNQLQALQMSSFYFIPSLML 294 + A L S+ V+ + S+G+ ++ LA + + + P L L Sbjct: 134 VVAAALGYTQWL-SLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFL 192 Query: 295 SGFVSPFISMPDWAKAIGSCLPLTYFIRLVKGIMLKGYSATALLPDLLPLIGLAVIVIGV 354 SG V P +P + LPL++ I L++ IML D+ +G I I + Sbjct: 193 SGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPV-----VDVCQHVGALCIYIVI 247
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 57.5 bits (139), Expect = 2e-11 Identities = 48/351 (13%), Positives = 121/351 (34%), Gaps = 85/351 (24%) Query: 21 IERILINKGDNVAAGQELVKIESFDA-------QNIFLRAEEKLSAESALLRNLESGERP 73 ++ I++ +G++V G L+K+ + A Q+ L+A + + L R++E + P Sbjct: 107 VKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLP 166 Query: 74 E-----------------------------------------------ELDIIRSQIKKA 86 E E + ++I + Sbjct: 167 ELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRY 226 Query: 87 QSAESQVKRQLGRYRNLYANHAISLAEWEDIRDELTQKGAQVEEL---INQLKARQLPAR 143 ++ K +L + +L AI+ + ++ + ++ + Q+++ L A+ Sbjct: 227 ENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAK 286 Query: 144 Q--------------DEISKQRSMVAAAKLERDKALWDVQQTTIVSPVNAKVFDI-IYRA 188 + D++ + + LE K Q + I +PV+ KV + ++ Sbjct: 287 EEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTE 346 Query: 189 GERPSAGKPIISLLPPEN-IKVRFFIPEAKLGKFKIGSKVKLICDG----CAEPIAGIIN 243 G + + ++ ++P ++ ++V + +G +G + + + G + Sbjct: 347 GGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVK 406 Query: 244 YISPEA---EFTPPVIYSTKRREKLIFMAEAIPALQQAGRMKIGQPFDVEI 291 I+ +A + V E+ + + + G EI Sbjct: 407 NINLDAIEDQRLGLVFNVIISIEE-----NCLSTGNKNIPLSSGMAVTAEI 452
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 138 bits (350), Expect = 4e-38 Identities = 94/418 (22%), Positives = 180/418 (43%), Gaps = 19/418 (4%) Query: 20 LLLVMLLSALDQTIVSTALPTIVGELDGL-DKLSWVVTAYILSSTIAVPLYGKFGDLFGR 78 L ++ S L++ +++ +LP I + + +WV TA++L+ +I +YGK D G Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78 Query: 79 KIVLQVAIGLFLVGSALCGLAQNMTQLVLM-RGLQGLGGGGLMVISMAAVADVIPPANRG 137 K +L I + GS + + + L++M R +QG G + M VA IP NRG Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138 Query: 138 RYQGLFGGVFGLATVIGPLIGGFLVQHASWRWIFYINLPLGLFALLVIGAVFHSSNKRSQ 197 + GL G + + +GP IGG + + W ++ +P+ + R + Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVRIK 196 Query: 198 HQIDWLGAIYLSMALLCIILFTSEGGSVHAWNDPQLWCILAFGIVGIIGFIYEERMAAEP 257 D G I +S+ ++ +LFT+ L ++ + F+ R +P Sbjct: 197 GHFDIKGIILMSVGIVFFMLFTTS----------YSISFLIVSVLSFLIFVKHIRKVTDP 246 Query: 258 IIPLALFRNRSFLLCSLIGFVIGMSLFGSVTFLPLYLQVVKEATPTEAGLQLI-PLMGGL 316 + L +N F++ L G +I ++ G V+ +P ++ V + + E G +I P + Sbjct: 247 FVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSV 306 Query: 317 LLTSIISGRIISRTGKYRLFPILGTLLGVTGMVLLTRITIHSPLWQLYLFTGVLGAGLGL 376 ++ I G ++ R G +G + + + + + + VLG GL Sbjct: 307 IIFGYIGGILVDRRGP-LYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSF 364 Query: 377 VMQVLVLAVQNAMPAQMYGVATSGVTLFRSIGGSIGVALFGAVFTHVLQSNLQQLLPE 434 V+ V +++ Q G S + + G+A+ G + + + Q+LLP Sbjct: 365 TKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS--IPLLDQRLLPM 420
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 72.4 bits (177), Expect = 7e-18 Identities = 33/175 (18%), Positives = 70/175 (40%), Gaps = 10/175 (5%) Query: 12 RPGRPRGKKPGTANREQLMDIALTLFARDGAGRVSLNAIAKEAGVTPAMLHYYFSSRDAL 71 R + ++ R+ ++D+AL LF++ G SL IAK AGVT ++++F + L Sbjct: 3 RKTKQEAQE----TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDL 58 Query: 72 VTQLIEERFMPLRNHISRIFVDHPQDPVL----ALTMMVETLAHMAEKNAWFAPLWM-QE 126 +++ E + P DP+ L ++E+ + ++ E Sbjct: 59 FSEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCE 118 Query: 127 IIGEMPILRQHMDARFGEERFQVMLGTVRRWQQEGKINPALAPELLFTTVISLVL 181 +GEM +++Q E + + T++ + + L + + Sbjct: 119 FVGEMAVVQQ-AQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYIS 172
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 36.5 bits (84), Expect = 1e-05 Identities = 20/72 (27%), Positives = 35/72 (48%), Gaps = 6/72 (8%) Query: 51 GLIAKRKGNW---LCIEYLWVSETTRGRGLGSELMQEAEQQAQAQGCSHLLVDTFSFQ-- 105 G I R NW IE + V++ R +G+G+ L+ +A + A+ L+++T Sbjct: 78 GRIKIRS-NWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINIS 136 Query: 106 ALPFYQKLGYQL 117 A FY K + + Sbjct: 137 ACHFYAKHHFII 148
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 42.5 bits (100), Expect = 2e-06 Identities = 24/133 (18%), Positives = 51/133 (38%), Gaps = 10/133 (7%) Query: 42 PVPVVSQLTGRTTAS-LSAEVRPQVGGIIQKRLFTEGDMVKAGQALYQIDPSSYRATWNE 100 V +V+ G+ T S S E++P I+++ + EG+ V+ G L ++ A + Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138 Query: 101 AAAALKQAQALVASDCQKAQRYASLVRDNGVSRQDADDAASTCAQDKASV--------ES 152 ++L QA+ Q R L + + D + ++ + + Sbjct: 139 TQSSLLQARLEQTRY-QILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFST 197 Query: 153 KKAALESARINLN 165 + +NL+ Sbjct: 198 WQNQKYQKELNLD 210
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1146 bits (2966), Expect = 0.0 Identities = 583/1031 (56%), Positives = 754/1031 (73%), Gaps = 6/1031 (0%) Query: 3 SRFFVRRPVFAWVIAILIMLAGVLAIRTLPVGQYPDVAPPAVKISATYTGASAETLENSV 62 + FF+RRP+FAWV+AI++M+AG LAI LPV QYP +APPAV +SA Y GA A+T++++V Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 63 TQVIEQQLTGLDHLLYFSSTSSSDGSVSITVTFEQGTDPDTAQVQVQNKVQQAESRLPSE 122 TQVIEQ + G+D+L+Y SSTS S GSV+IT+TF+ GTDPD AQVQVQNK+Q A LP E Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 123 VQQSGVTVEKSQSSFLLILAVYDKTNRATSSDISDWLVSNMQDPLARVEGVGSLQVFGAE 182 VQQ G++VEKS SS+L++ T DISD++ SN++D L+R+ GVG +Q+FGA+ Sbjct: 122 VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ 181 Query: 183 YAMRVWMDPTKLASYSLMPSDVQSAIEAQNVQVSAGKIGALPSSNAQQLTATVRAQSRLQ 242 YAMR+W+D L Y L P DV + ++ QN Q++AG++G P+ QQL A++ AQ+R + Sbjct: 182 YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241 Query: 243 TPDQFKAIIVKSQADGSVVRLSDVARVEMGSEDYTATANLNGHPAAGIAVMMAPGANALD 302 P++F + ++ +DGSVVRL DVARVE+G E+Y A +NG PAAG+ + +A GANALD Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301 Query: 303 TATLVKSKIAEFQRQMPQGYDIAYPKDSTEFIKISVEDVIQTLFEAIILVVCVMYLFLQN 362 TA +K+K+AE Q PQG + YP D+T F+++S+ +V++TLFEAI+LV VMYLFLQN Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361 Query: 363 FRATLIPAVAVPVVLLGTFGVLALFGYSINTLTLFAMVLAIGLLVDDAIVVVENVERIMR 422 RATLIP +AVPVVLLGTF +LA FGYSINTLT+F MVLAIGLLVDDAIVVVENVER+M Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421 Query: 423 DEGLPAREATEKSMGEISGALVAIALVLSAVFLPMAFFGGSTGVIYRQFSVTIISAMMLS 482 ++ LP +EATEKSM +I GALV IA+VLSAVF+PMAFFGGSTG IYRQFS+TI+SAM LS Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481 Query: 483 VVVALTLTPALCGALL----SHSKPHTKGFFGAFNRLWRRTEAGYQRRVLGGLRRGAVMM 538 V+VAL LTPALC LL + + GFFG FN + + Y V L + Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541 Query: 539 GAYALICGAMALAMWKLPGSFLPVEDQGEIMVQYTLPAGATAVRTAEVRRQVTDWFLTKE 598 YALI M + +LP SFLP EDQG + LPAGAT RT +V QVTD++L E Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601 Query: 599 KANTDVIFTVDGFSFSGSGQNAGMAFVSLKNWSQRKGDDNTAQAIALRATKELGTIRDAT 658 KAN + +FTV+GFSFSG QNAGMAFVSLK W +R GD+N+A+A+ RA ELG IRD Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661 Query: 659 LFAMTPPSVDGLGQSNGFTFELMASGGTDRDSLMKLRSQLLAAANQS-SELQSVRANDLP 717 + P++ LG + GF FEL+ G D+L + R+QLL A Q + L SVR N L Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721 Query: 718 QMPQLQVDIDNNKAVSLGLSLSDVTDTLSSAWGGTYVNDFIDRGRVKKVYIQGESDARAV 777 Q ++++D KA +LG+SLSD+ T+S+A GGTYVNDFIDRGRVKK+Y+Q ++ R + Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781 Query: 778 PSDLGKWFVRGSDNSMTPFSAFATTHWQYGPESLVRYNGSAAFEIQGENAAGFSSGAAMD 837 P D+ K +VR ++ M PFSAF T+HW YG L RYNG + EIQGE A G SSG AM Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841 Query: 838 KMEKLADSLPAGSTWAWSGISLQEKLASGQAMSLYAISILVVFLCLAALYESWSVPFSVI 897 ME LA LPAG + W+G+S QE+L+ QA +L AIS +VVFLCLAALYESWS+P SV+ Sbjct: 842 LMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVM 901 Query: 898 MVIPLGLLGAALAATLRGLSNDVYFQVALLTTIGLSSKNAILIVEFAESAVD-EGYSLSR 956 +V+PLG++G LAATL NDVYF V LLTTIGLS+KNAILIVEFA+ ++ EG + Sbjct: 902 LVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVE 961 Query: 957 AAIRAAQTRLRPIVMTSLAFIAGVLPLAIATGAGANSRVAIGTGIIGGTLTATLLAVFFV 1016 A + A + RLRPI+MTSLAFI GVLPLAI+ GAG+ ++ A+G G++GG ++ATLLA+FFV Sbjct: 962 ATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFV 1021 Query: 1017 PLFFVLVKRLF 1027 P+FFV+++R F Sbjct: 1022 PVFFVVIRRCF 1032
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 67.2 bits (164), Expect = 2e-14 Identities = 78/339 (23%), Positives = 130/339 (38%), Gaps = 25/339 (7%) Query: 27 LPALPEITQQLQATSTQTQLSLTAALIGLGLGQLFFGP----LSDHIGRLKPLALSLLLF 82 +P LP + + L S L L Q P LSD GR L +SL Sbjct: 25 MPVLPGLLRDLVH-SNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGA 83 Query: 83 IFSSAMCALTRDINMLIVWRFLQGFAGAGGSVLSRSIARDKYQGTLLTQFFALLMTVNGI 142 A+ A + +L + R + G GA G+V IA D G + F + G Sbjct: 84 AVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA-DITDGDERARHFGFMSACFGF 142 Query: 143 APVLSPVLGGYVITAFDWRILFWTMAAIGGVLLVMSLAILRETRPATAAHASRQRPGQPV 202 V PVLGG + F F+ AA+ G+ + +L E+ R+ Sbjct: 143 GMVAGPVLGGL-MGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNP-- 199 Query: 203 LKNRRFLRFCLIQAFMMA-----GLFSYIGSSSFVMQSE--YGMSAMQFSLLFGLNGI-G 254 L + R+ R + A +MA L + ++ +V+ E + A + GI Sbjct: 200 LASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILH 259 Query: 255 LIIAAMFFSRLARRFSAESLLRGGLTLAVSCAAIMLLFA---WLHLPVLALVGL--FFTV 309 + AM +A R L G+ +A I+L FA W+ P++ L+ Sbjct: 260 SLAQAMITGPVAARLGERRALMLGM-IADGTGYILLAFATRGWMAFPIMVLLASGGIGMP 318 Query: 310 SLMSGISTVAGAEAMSAVDAAQSG--TASALMGTLMFVF 346 +L + +S E + + + + ++++G L+F Sbjct: 319 ALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 41.7 bits (98), Expect = 4e-06 Identities = 58/348 (16%), Positives = 110/348 (31%), Gaps = 35/348 (10%) Query: 66 AEMGYVFSAFAWLYTLCQIPGGWFLDRVGSRLTYFIAIFGWSVATLLQGFATGLMSLIGL 125 A G + + +A + C G DR G R +++ G +V + A L L Sbjct: 43 AHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIG 102 Query: 126 RAITGIFEAPAFPTNNRMVTSWFPEHERASAVGFYTSGQFVGLAFLTPLLIWIQELLSWH 185 R + GI A + ERA GF ++ G+ P+L + S H Sbjct: 103 RIVAGITGAT-GAVAGAYIADITDGDERARHFGFMSACFGFGMV-AGPVLGGLMGGFSPH 160 Query: 186 WVFIVTGGIGIIWSLIWFKVYQPPRLTKSISKAELDYIRDGGGLVDGDAPVKKEARQPLS 245 F + + L + + P+++EA PL+ Sbjct: 161 APFFAAAALNGLNFLTGCFLLPESHKGE-------------------RRPLRREALNPLA 201 Query: 246 KADWKLVFHRKLVGVYLGQFAVTSTLWFFLTWFPNYLTQEKGITALKAGFMTTV-PFLAA 304 W + + F + + + A G L + Sbjct: 202 SFRWARGM-TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHS 260 Query: 305 FFGVLLSGWLADKLVKKGYSLGVARKTPIICGLLISTC--IMGANYTNDPIWIMALMALA 362 +++G +A +L + ++ G++ I+ A T + ++ LA Sbjct: 261 LAQAMITGPVAARL---------GERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLA 311 Query: 363 FFGNGFASITWSLVSSLAPMRLIGLTGGVFNFVGGLGGITVPLVIGYL 410 G G ++ +++S G G + L I PL+ + Sbjct: 312 SGGIGMPALQ-AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAI 358
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 814 bits (2105), Expect = 0.0 Identities = 476/549 (86%), Positives = 510/549 (92%), Gaps = 2/549 (0%) Query: 1 MDSQRNLLIIALLFVSFMIWQAWEQDKNPQPQ-QQTTQTTTTAAGSAADQGVPASGQGKL 59 MDSQRNLL+IALLFVSFMIWQAWEQDKNPQPQ QQTTQTTTTAAGSAADQGVPASGQGKL Sbjct: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKL 60 Query: 60 ITVKTDVLELTINTNGGDIEQALLLAYPKTLKSTEPFQLLETTPQFVYQAQSGLTGRDGP 119 I+VKTDVL+LTINT GGD+EQALL AYPK L ST+PFQLLET+PQF+YQAQSGLTGRDGP Sbjct: 61 ISVKTDVLDLTINTRGGDVEQALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGP 120 Query: 120 DNPANGPRPLYNVDKEAFVLADGQDELVIPLTYTDKAGNVFTKTFTLKRGGYAVNVGYSV 179 DNPANGPRPLYNV+K+A+VLA+GQ+EL +P+TYTD AGN FTKTF LKRG YAVNV Y+V Sbjct: 121 DNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNTFTKTFVLKRGDYAVNVNYNV 180 Query: 180 QNASEKPLEVSTFGQLKQTAALPTSRDTQTGGLSTMHTFRGAAFSTADSKYEKYKFDTIL 239 QNA EKPLE+S+FGQLKQ+ LP DT + + +HTFRGAA+ST D KYEKYKFDTI Sbjct: 181 QNAGEKPLEISSFGQLKQSITLPPHLDTGSSNFA-LHTFRGAAYSTPDEKYEKYKFDTIA 239 Query: 240 DNENLNVSTKNGWVAMLQQYFTTAWVPRNNGTNNFYTANLGNGVVAIGYKSQPVLVQPGQ 299 DNENLN+S+K GWVAMLQQYF TAW+P N+GTNNFYTANLGNG+ AIGYKSQPVLVQPGQ Sbjct: 240 DNENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQ 299 Query: 300 TDKLQSTLWVGPAIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKFIHSFLGNWGFSII 359 T + STLWVGP IQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLK+IHSF+GNWGFSII Sbjct: 300 TGAMNSTLWVGPEIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSII 359 Query: 360 VITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRQSQEMMALYKAEKVNP 419 +ITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQR SQEMMALYKAEKVNP Sbjct: 360 IITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRISQEMMALYKAEKVNP 419 Query: 420 LGGCFPLIIQMPIFLALYYMLSASVELRHAPFILWIHDLSAQDPYYILPIIMGATMFFIQ 479 LGGCFPL+IQMPIFLALYYML SVELR APF LWIHDLSAQDPYYILPI+MG TMFFIQ Sbjct: 420 LGGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQ 479 Query: 480 KMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVVYYIVSNLVTIIQQQLIYRGLEKRG 539 KMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLV+YYIVSNLVTIIQQQLIYRGLEKRG Sbjct: 480 KMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRG 539 Query: 540 LHSREKKKS 548 LHSREKKKS Sbjct: 540 LHSREKKKS 548
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 37.6 bits (87), Expect = 5e-06 Identities = 17/55 (30%), Positives = 27/55 (49%), Gaps = 3/55 (5%) Query: 69 IVDVAVDPAHQGKGLGRLVMEKLVAWLDANAFDGSYV-TLVADVP--ELYAKFGF 120 I D+AV ++ KG+G ++ K + W N F G + T ++ YAK F Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 58.3 bits (141), Expect = 2e-11 Identities = 67/311 (21%), Positives = 118/311 (37%), Gaps = 14/311 (4%) Query: 5 LLCSFALVLLYPSGIDMYLVGLPRIAQDLGASEAQLHIAFSVYLAGMASAML----FAGR 60 L+ + V L GI + + LP + +DL S + + + LA A G Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSN-DVTAHYGILLALYALMQFACAPVLGA 65 Query: 61 IADRSGRKPVAIVGAAIFVIASLICAQAHTSSHFLIGRFIQGIGAGSCYVVAFAILRDTL 120 ++DR GR+PV +V A + I A A IGR + GI G+ VA A + D Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADIT 124 Query: 121 DDRRRAKVLSLLNGITCIIPVLAPVLGHLIMLKYPWQSLFYTMTGMGVMVAVLSVFILRE 180 D RA+ ++ V PVLG L M + + F+ + + + F+L E Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGL-MGGFSPHAPFFAAAALNGLNFLTGCFLLPE 183 Query: 181 TRPTAPPQAASPQHDAGESLLNRFFLSRLLITTLSVTVILTYVNVSPVLMMEEMGFDRGT 240 + + S + ++ ++V I+ V P + G DR Sbjct: 184 SHKGERRPLRREALNPLASFRWARGM-TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFH 242 Query: 241 YSMAM------ALMAMISMAVSFSTPFALSLFNPRTLMLTSQVLFLAAGVTLSLATRQAV 294 + A + S+A + T + R ++ + + L+ ATR + Sbjct: 243 WDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWM 302 Query: 295 TLIGLGMICAG 305 + ++ +G Sbjct: 303 AFPIMVLLASG 313
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 30.5 bits (69), Expect = 0.013 Identities = 22/114 (19%), Positives = 43/114 (37%), Gaps = 3/114 (2%) Query: 331 LTAVVVGILFLLVIFLSPLAGMVPGYAAAGALIYVGVLMTSSLARVKWSDLTEAVPA--- 387 L+ VV +L PL + A A ++ G L++ + + A Sbjct: 72 LSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKINPIEGAKRI 131 Query: 388 FITAVMMPFSFSITEGIALGFISYCVMKIGTGRLRELSPCVIIVSLLFVLKIVF 441 F ++ F SI + + L + + ++K L +L C I + +I+ Sbjct: 132 FSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQILR 185
>OUTRMMBRANEA#Outer membrane protein A signature. Length = 346 Score = 30.3 bits (68), Expect = 0.014 Identities = 14/23 (60%), Positives = 16/23 (69%) Query: 1 MKRHAIYFALALAGAAFTLQAAP 23 MK+ AI A+ALAG A QAAP Sbjct: 1 MKKTAIAIAVALAGFATVAQAAP 23
>TRNSINTIMINR#Translocated intimin receptor (Tir) signature. Length = 549 Score = 28.9 bits (64), Expect = 0.009 Identities = 15/41 (36%), Positives = 21/41 (51%), Gaps = 3/41 (7%) Query: 84 SQILDEAKAEAEQERTKIV---AQAQAEIDAERKRAREELR 121 QI +AK E R + V AQAQ + + R +EEL+ Sbjct: 319 EQIAQQAKEAGEVARQQAVESNAQAQQRYEDQHARRQEELQ 359
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 134 bits (339), Expect = 7e-37 Identities = 102/408 (25%), Positives = 177/408 (43%), Gaps = 20/408 (4%) Query: 12 LPWIAAMAFFMQALDATILNTALPAIAHSLNRSPLAMQSAIISYTLTVAMLIPVSGWLAD 71 L W+ ++FF L+ +LN +LP IA+ N+ P + ++ LT ++ V G L+D Sbjct: 16 LIWLCILSFF-SVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74 Query: 72 RFGTRRVFIIAVSLFTLGSLACALSSSLTELVIF-RVIQGIGGAMMMPVARLALLRAYPR 130 + G +R+ + + + GS+ + S L+I R IQG G A + + + R P+ Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134 Query: 131 SELLPVLNFVTMPGLVGPILGPVLGGVFVTWASWHWIFLINIP-IGVIGILYARKYMPNF 189 + +G +GP +GG+ + HW +L+ IP I +I + + K + Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKE 192 Query: 190 TTPRRRFDIGGFLLFGLSLVLFSSGIELFGEKIVATWQALAVIAVSLLLLVAYVRHARRH 249 + FDI G +L + +V F + L V +S L+ +V+H R+ Sbjct: 193 VRIKGHFDIKGIILMSVGIVFFMLFTT------SYSISFLIVSVLSFLI---FVKHIRKV 243 Query: 250 PTPLISLSLFKTHTFSVGIAGNLATRLGTGCVPFLMPLMLQVGFGY-PAIIAGCMIAPTA 308 P + L K F +G+ ++P M++ A I +I P Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303 Query: 309 IGSIIAKSTVTQVLRWFGYRKTLVGITVF--IGLMIAQFSLQSPEMPLWMLLLPLFVLGM 366 + II ++ G L F + + A F L++ +M ++ +FVLG Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETT--SWFMTIIIVFVLGG 361 Query: 367 AMSTQFTAMNTITLADLTDDNASSGNSLLAVTQQLSISLGVAISAAVL 414 T+ T ++TI + L A +G SLL T LS G+AI +L Sbjct: 362 LSFTK-TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408
>PRPHPHLPASEC#Prokaryotic zinc-dependent phospholipase C signature. Length = 398 Score = 29.2 bits (65), Expect = 0.023 Identities = 11/31 (35%), Positives = 13/31 (41%) Query: 69 NPWLKWDVQGLEGLNKKNWYLLISNHHSWAD 99 N W K +G K +Y S HSW D Sbjct: 214 NAWSKEYARGFAKTGKSIYYSHASMSHSWDD 244
>SECA#SecA protein signature. Length = 901 Score = 27.9 bits (62), Expect = 0.022 Identities = 12/57 (21%), Positives = 23/57 (40%) Query: 15 KSREELNQEARDRKRQKKHRGHAAGSRANGGDAASAGKKQRQAQDPRVGSKKPIPLG 71 + EE+ + + R+ + + D+A+A Q + +VG P P G Sbjct: 832 RMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPCG 888
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 600 bits (1549), Expect = 0.0 Identities = 203/478 (42%), Positives = 297/478 (62%), Gaps = 11/478 (2%) Query: 1 MQRGIAWIVDDDSSIRWVLERALTGAGLSCTTFESGNEVLDALTTKTPDVLLSDIRMPGM 60 M + DDD++IR VL +AL+ AG + + + D++++D+ MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 DGLALLKQIKQRHPMLPVIIMTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVDRAIS 120 + LL +IK+ P LPV++M+A + A+ A ++GA+DYLPKPFD+ E + ++ RA++ Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 121 HYQEQQQPRNAPISSPTADIIGEAPAMQDVFRIIGRLSRSSISVLINGESGTGKELVAHA 180 + + ++G + AMQ+++R++ RL ++ ++++I GESGTGKELVA A Sbjct: 121 EPKRRPSKLEDDSQDG-MPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179 Query: 181 LHRHSPRSKAPFIALNMAAIPKDLIESELFGHEKGAFTGANTVRQGRFEQADGGTLFLDE 240 LH + R PF+A+NMAAIP+DLIESELFGHEKGAFTGA T GRFEQA+GGTLFLDE Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239 Query: 241 IGDMPLDVQTRLLRVLADGQFYRVGGYAPVKVDVRIIAATHQNLEQRVQEGKFREDLFHR 300 IGDMP+D QTRLLRVL G++ VGG P++ DVRI+AAT+++L+Q + +G FREDL++R Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299 Query: 301 LNVIRVHLPPLRERREDIPRLARHFLQIAARELGVEAKQLHPETETALTRLAWPGNVRQL 360 LNV+ + LPPLR+R EDIP L RHF+Q A +E G++ K+ E + WPGNVR+L Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVREL 358 Query: 361 ENTCRWLTVMAAGQEVLTQDLPSELFETTIPDSPTQMQPDSWATLLGQWADRALRS---- 416 EN R LT + + + + +EL + S + + Q + +R Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418 Query: 417 -----GHQNLLSEAQPEMERTLLTTALRHTQGHKQEAARLLGWGRNTLTRKLKELGME 469 L EME L+ AL T+G++ +AA LLG RNTL +K++ELG+ Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 115 bits (289), Expect = 4e-33 Identities = 79/271 (29%), Positives = 126/271 (46%), Gaps = 26/271 (9%) Query: 7 LKDNVIIVTGGASGIGLAIVDELLSQGAHVQMIDIHGGDRHHNGDNYHF-------WPTD 59 ++ + +TG A GIG A+ L SQGAH+ +D + + +P D Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 60 ISSATEVQQTIDAIIQRWSRIDGLVNNAGVNFPRLLVDEKAPAGRYELNEAAFEKMVNIN 119 + + + + I + ID LVN AGV P L+ + L++ +E ++N Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLI---------HSLSDEEWEATFSVN 116 Query: 120 QKGVFFMSQAVARQMVKQRAGVIVNVSSESGLEGSEGQSCYAATKAALNSFTRSWSKELG 179 GVF S++V++ M+ +R+G IV V S + YA++KAA FT+ EL Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176 Query: 180 KYGIRVVGVAPGILEKTGLRTPEYEEALAWTRNITVEQLREGYT---KNAIPIGRAGKLS 236 +Y IR V+PG E + W EQ+ +G K IP+ + K S Sbjct: 177 EYNIRCNIVSPGSTETDMQWS-------LWADENGAEQVIKGSLETFKTGIPLKKLAKPS 229 Query: 237 EVADFVCYLLSARASYITGVTTNIAGGKTRG 267 ++AD V +L+S +A +IT + GG T G Sbjct: 230 DIADAVLFLVSGQAGHITMHNLCVDGGATLG 260
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 28.3 bits (63), Expect = 0.045 Identities = 6/21 (28%), Positives = 12/21 (57%) Query: 24 QAQIARELGIYRTTISRLLKR 44 Q + A LG+ R T+ + ++ Sbjct: 452 QIKAADLLGLNRNTLRKKIRE 472
>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein signature. Length = 533 Score = 25.3 bits (55), Expect = 0.037 Identities = 19/75 (25%), Positives = 34/75 (45%), Gaps = 5/75 (6%) Query: 2 ALPRITQKEMTEREQRELKTLLDRARIAHGRPLSNAETNSVKKEYIDKLMAQREAEAKKA 61 LP E + + +EL L+ R + ++NA N + ++ AQ++ Sbjct: 290 GLPNSASIEQIQSKIQELGDTLEELRDSFDGYINNAFVNQIHLNFVMPPQAQQQQG---- 345 Query: 62 RQVKKQQAYKTDKEA 76 Q ++QQA T +EA Sbjct: 346 -QGQQQQAQATAQEA 359
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 31.8 bits (72), Expect = 0.007 Identities = 17/79 (21%), Positives = 34/79 (43%), Gaps = 13/79 (16%) Query: 69 LAKETDLAGAIKSMFSGEKINR-------TEDRAVLHVALRNRSNTPIVVDGKDVMPEVN 121 AK +DL + + S + + D+ ++ + ++N IV DVM ++ Sbjct: 276 YAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNII-IKAHGQTNALIVTAAPDVMNDLE 334 Query: 122 AVLEKM-----KTFSEAII 135 V+ ++ + EAII Sbjct: 335 RVIAQLDIRRPQVLVEAII 353
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 26.8 bits (59), Expect = 0.035 Identities = 11/43 (25%), Positives = 14/43 (32%) Query: 102 GHRYGEHIFHAVETRAKTAGESWLWLEVLAANPAARRFYERQG 144 G + H AK L LE N +A FY + Sbjct: 103 KKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHH 145
>TYPE3OMOPROT#Type III secretion system outer membrane O protein family signature. Length = 303 Score = 28.0 bits (62), Expect = 0.005 Identities = 13/55 (23%), Positives = 26/55 (47%), Gaps = 9/55 (16%) Query: 36 TRGEIIAVGKGRILENGT--VQPLDVKV-------GDIVIFNDGYGVKTEKIDNE 81 T E+ A+G+ ++L T +++ G++V ND GV+ + +E Sbjct: 244 TLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHEWLSE 298
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 114 bits (287), Expect = 6e-30 Identities = 70/430 (16%), Positives = 142/430 (33%), Gaps = 67/430 (15%) Query: 30 AWLVALLSFAFLAILIATTVFCSFTQRIDVQGEVITLPHSVNVYAPQQGFVISQYVKVGD 89 A+ + F +A +++ V G++ S + + V VK G+ Sbjct: 61 AYFIMG--FLVIAFILS--VLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGE 116 Query: 90 IVTKGQPLYEIDISRNTTTGNVSAVQIEVINEKIANAEDIISK----------------- 132 V KG L ++ + Q ++ ++ I Sbjct: 117 SVRKGDVLLKLTALG--AEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEP 174 Query: 133 -----LNHNKEETTISLEKQLKTINDSLKETNRMLANAQAGLKKMH-------------- 173 T +++Q T + + L +A + Sbjct: 175 YFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEK 234 Query: 174 DNLSSYDKYLSDGLITKDQYNYQHSLYFQQQSTYQSLVSQKMQLESQVTQLNSDKITKIA 233 L + L I K Q + Y + + + SQ Q+ES++ + Sbjct: 235 SRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQ 294 Query: 234 DFDNQISSQ----ENQINDYKNQLVESNAN-GNIIIKATTEGRIESLTV-TKGQMVDKGS 287 F N+I + + I +L ++ +I+A +++ L V T+G +V Sbjct: 295 LFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354 Query: 288 SLAQIKPTGDIEYYLILWLPNNSIPYVKPGDEINIRYAAFPSDKFGQFPGKILSIS--SV 345 +L I P D + + N I ++ G I+ AFP ++G GK+ +I+ ++ Sbjct: 355 TLMVIVPEDD-TLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI 413 Query: 346 PTSRQEMSEYTNVTNGTNQQELALYKTIVKIENKTFEYNGKTLSLSNGLKAQAVVFLEER 405 R + + I+ IE K + LS+G+ A + R Sbjct: 414 EDQRLGLV----------------FNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMR 457 Query: 406 PLYMWMFTPV 415 + ++ +P+ Sbjct: 458 SVISYLLSPL 467
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 733 bits (1894), Expect = 0.0 Identities = 318/881 (36%), Positives = 478/881 (54%), Gaps = 63/881 (7%) Query: 8 LFKLSTIFFAMLPA-LLSGLNNKAQARDFFDPSFISSLNGSDPSTTPDLSVFQTQNAQAP 66 +L+ F + A + + A +F+P F++ DP DLS F+ P Sbjct: 20 KHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLAD----DPQAVADLSRFENGQELPP 75 Query: 67 GDYRVDIMFNGRYLDTRTIKFVANNRASSDNREPALVPCLSLKALAEYGVRIKSFPELA- 125 G YRVDI N Y+ TR + F + +VPCL+ LA G+ S + Sbjct: 76 GTYRVDIYLNNGYMATRDVTFNTGDSEQG------IVPCLTRAQLASMGLNTASVSGMNL 129 Query: 126 EDQNGCANF-SVIPDTKADFDFTAQRLNISIPQAALSTTAQGYIPPDQFDDGINALLVNY 184 + C S+I D A D QRLN++IPQA +S A+GYIPP+ +D GINA L+NY Sbjct: 130 LADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNY 189 Query: 185 QFSGS---NDMQANDEYYSLNLQSGLNVGPWRIRNLSTWNKN-----NGDAGDWDSAYLY 236 FSG+ N + N Y LNLQSGLN+G WR+R+ +TW+ N +G W + Sbjct: 190 NFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTW 249 Query: 237 MQRSIRSINSNLVMGESSSLSTIFDSVPFTGIQLATDTTMLPESMRGYAPIIRGIAKTNA 296 ++R I + S L +G+ + IFD + F G QLA+D MLP+S RG+AP+I GIA+ A Sbjct: 250 LERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTA 309 Query: 297 RVVIKQNGYQVYQTYVAPGAFEITDMYPSGGSGDLYVSVEESDGSKQEFVVPFATLPVMV 356 +V IKQNGY +Y + V PG F I D+Y +G SGDL V+++E+DGS Q F VP++++P++ Sbjct: 310 QVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQ 369 Query: 357 RENQLEYEITSGKYRPYDGGVDETPFTQATATYGVSSSLTLYGGMQAASRYQALSTGLGY 416 RE Y IT+G+YR + ++ F Q+T +G+ + T+YGG Q A RY+A + G+G Sbjct: 370 REGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGK 429 Query: 417 NLGELGAASADVTQAWSKKKDDEKTSGQSWRVRYGKNIVETGTNVTIAGYRYSTRGFNTL 476 N+G LGA S D+TQA S DD + GQS R Y K++ E+GTN+ + GYRYST G+ Sbjct: 430 NMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNF 489 Query: 477 SEVLDSYSNDG------------------NYTSRSLRNRTNLTVNQSLGKGLGSLSISGL 518 ++ S N + + R + LTV Q LG+ +L +SG Sbjct: 490 ADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGS 548 Query: 519 IEDYWDDKRTNKSISVGYNGGFRNVNYYLGYSYNRYTWSGNNSGKDAQDDQRITLTVTLP 578 + YW ++ G N F ++N+ L YS + W DQ + L V +P Sbjct: 549 HQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGR-------DQMLALNVNIP 601 Query: 579 LSNWLPG--------TYTSYQLTNSNPGSTDQSVSIGGVGLDNDSLEWSLQQGYSNREYY 630 S+WL SY +++ G + G L++++L +S+Q GY+ Sbjct: 602 FSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDG 661 Query: 631 SGDMRG----TYNGARGSLNAGYSYDNNSQRIDYGANGSIVAHADGITLGQDITDAAVLV 686 + G Y G G+ N GYS+ ++ +++ YG +G ++AHA+G+TLGQ + D VLV Sbjct: 662 NSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLV 721 Query: 687 KAPGLDNVKLTNDNTISTDYRGYAIVPYVTPYRRTDITLDSTTLGEDMELPETTKSVVPT 746 KAPG + K+ N + TD+RGYA++PY T YR + LD+ TL ++++L +VVPT Sbjct: 722 KAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPT 781 Query: 747 RGAIVRANYDGNIGQRAFVHLKTASGQDVPYGAMVLLAGDSKSQPSIVSDAGMVYMSGLQ 806 RGAIVRA + +G + + L T + + +P+GAMV IV+D G VY+SG+ Sbjct: 782 RGAIVRAEFKARVGIKLLMTL-THNNKPLPFGAMVTSESS--QSSGIVADNGQVYLSGMP 838 Query: 807 ETGILNVQWGKSAAQQCNASFTLPAREGKASGISQIETVCR 847 G + V+WG+ C A++ LP + ++Q+ CR Sbjct: 839 LAGKVQVKWGEEENAHCVANYQLPPESQQQ-LLTQLSAECR 878
>BCTLIPOCALIN#Bacterial lipocalin signature. Length = 171 Score = 250 bits (640), Expect = 2e-88 Identities = 71/152 (46%), Positives = 104/152 (68%), Gaps = 1/152 (0%) Query: 25 PPGVTVVSPFDVQRYLGTWYEIARFDHPFESGLEKVTIAWHPRDDGGLDVVNKGYNPDRG 84 P V VS F++ YLG WYE+AR DH FE GL +VT + R+DGG+ V+N+GY+ ++G Sbjct: 20 PESVKPVSDFELNNYLGKWYEVARLDHSFERGLSQVTAEYRVRNDGGISVLNRGYSEEKG 79 Query: 85 MWQKTDGVAYFTGEPSRAALKISFFGPFYGSYNVIALDKE-YRYALVCGPDRDYLWLLAR 143 W++ +G AYF + LK+SFFGPFYGSY V LD+E Y YA V GP+ +YLWLL+R Sbjct: 80 EWKEAEGKAYFVNGSTDGYLKVSFFGPFYGSYVVFELDRENYSYAFVSGPNTEYLWLLSR 139 Query: 144 APTIAPEVRQQMLDIATRQGFDVGKLVWVNQR 175 PT+ + + ++++ +GFD +L++V Q+ Sbjct: 140 TPTVERGILDKFIEMSKERGFDTNRLIYVQQQ 171
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 32.7 bits (74), Expect = 0.008 Identities = 27/103 (26%), Positives = 47/103 (45%), Gaps = 2/103 (1%) Query: 709 RVEAVNMDERKIDFTLISSERAPRNVGKTAREKAKKSTSGKPGGRRRQVGKQVNFEPDSA 768 E V + ++ T+ +E+ RE AK++ S + Q E Sbjct: 1036 TTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKET 1095 Query: 769 FRKE-KETARPKKEKKAKKPSAKTQKIAAATKAKRAAKKKIAE 810 E KETA +KE+KAK + KTQ++ T ++ + K++ +E Sbjct: 1096 QTTETKETATVEKEEKAKVETEKTQEVPKVT-SQVSPKQEQSE 1137
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 29.8 bits (67), Expect = 0.037 Identities = 25/80 (31%), Positives = 31/80 (38%), Gaps = 11/80 (13%) Query: 11 QPTPLNNSNLFLSD--TALREAVVREGAGWDGDLLASIGQQLGTAESLELGRLANSNPPE 68 LN L D L+ + AG G A GQQL A + R NP E Sbjct: 189 DADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQL-NASIIAQTRFK--NPEE 245 Query: 69 L----LRYDATGA--RLDDV 82 LR ++ G+ RL DV Sbjct: 246 FGKVTLRVNSDGSVVRLKDV 265
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 36.6 bits (85), Expect = 2e-04 Identities = 14/30 (46%), Positives = 21/30 (70%) Query: 462 WTLNSARHHGMEEMTGSLEPGKRADIAVFD 491 +T+N A HG+ GSLE GKRAD+ +++ Sbjct: 409 YTINPAIAHGLSHEIGSLEVGKRADLVLWN 438
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 40.5 bits (94), Expect = 9e-06 Identities = 85/307 (27%), Positives = 133/307 (43%), Gaps = 36/307 (11%) Query: 120 LQKEFWPAMHKNAQVMGTTYAIPFHNSTPILYYNKTMFDRAGIKQPPQTWAELLADAKKL 179 Q + +P + G A P L YNK + + PP+TW E+ A K+L Sbjct: 111 FQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDL-----LPNPPKTWEEIPALDKEL 165 Query: 180 TDESKGQWGIMLPSTNDDYGGWIFSALVRANGG---KYFNEDYP-GEVYYNSPTAIGALR 235 ++KG+ +M + + Y W L+ A+GG KY N Y +V ++ A L Sbjct: 166 --KAKGKSALMF-NLQEPYFTW---PLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLT 219 Query: 236 FWQDLIYKDKVMPSGVLNSKQISAAFFSGKLGMAMLSTGALGFMRENSKDFELGVAMLPA 295 F DLI K+K M + + AAF G+ M + G + ++ GV +LP Sbjct: 220 FLVDLI-KNKHMNADT-DYSIAEAAFNKGETAMTI--NGPWAWSNIDTSKVNYGVTVLPT 275 Query: 296 -KEQRAVPIGGASLVSFKGINDA--QKKAAYQFL-TYLVSPEVNGAWSRFTGYFSPRKAS 351 K Q + P G V GIN A K+ A +FL YL++ E A ++ + S Sbjct: 276 FKGQPSKPFVG---VLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKPLGAVALKS 332 Query: 352 YDTPEMKAYLQQDPRAAIALEQLKYAHPWYSTWETVAVRKAMENQLAAVVNDA--KVTPE 409 Y + L +DPR A +E + + + A A+ AV+N A + T + Sbjct: 333 Y-----EEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVR---TAVINAASGRQTVD 384 Query: 410 AAVQAAQ 416 A++ AQ Sbjct: 385 EALKDAQ 391
>PF05272#Virulence-associated E family protein Length = 892 Score = 34.7 bits (79), Expect = 6e-04 Identities = 14/33 (42%), Positives = 18/33 (54%) Query: 30 VVLVGPSGCGKSTLLRLLAGLEPVSEGQIWLHD 62 VVL G G GKSTL+ L GL+ S+ + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGT 631
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 32.6 bits (74), Expect = 3e-04 Identities = 22/58 (37%), Positives = 29/58 (50%), Gaps = 4/58 (6%) Query: 70 EDIAELKRMFSVNKASGTGTALLRYLEGEAKSLGYNEIRLETRKVNTRAVAFYVKHNY 127 EDIA K + G GTALL AK + + LET+ +N A FY KH++ Sbjct: 93 EDIAVAKD----YRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.0 bits (70), Expect = 0.009 Identities = 11/87 (12%), Positives = 28/87 (32%), Gaps = 3/87 (3%) Query: 17 PEFRQEALKLAERIGVAAAARELNLYESQLYNWRSKQQNQFSSSEREQEMSAEIARLKRQ 76 PE + + + R +L + Q W+ Q+ Q + ++ AE + + Sbjct: 166 PELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQ-NQKYQKELNLDKKR--AERLTVLAR 222 Query: 77 LAERDEELAILPKGRDILREAPEMKYV 103 + + + D + + Sbjct: 223 INRYENLSRVEKSRLDDFSSLLHKQAI 249
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 61.6 bits (149), Expect = 2e-13 Identities = 51/233 (21%), Positives = 85/233 (36%), Gaps = 16/233 (6%) Query: 4 VLITGASSGIGAGLAKSFAADGHLVIACGRDASRLAALQQLSPNINVRL-----FDMTDR 58 ITGA+ GIG +A++ A+ G + A + +L + S R D+ D Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVS-SLKAEARHAEAFPADVRDS 69 Query: 59 DACRQALTGCFA-----DLIILCAGTCEYLDHGQVDAALVERVMATNFLGPVNCLAALQT 113 A + D+++ AG + E + N G N ++ Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129 Query: 114 QLEA--GDRVVLVSSMAHWLPFPRAEAYGASKAALTWFANSLRLDWEPKGVAVTVVSPGF 171 + +V V S +P AY +SKAA F L L+ + +VSPG Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189 Query: 172 VDTPLTRKNDFAMPGRVSVDRAVAA-IRHGLAKGKNHIAFPTGFSLALRLLAS 223 +T + G V + + G+ K +A P+ + A+ L S Sbjct: 190 TETDMQWSLWADENGAEQVIKGSLETFKTGIPLKK--LAKPSDIADAVLFLVS 240
>BCTLIPOCALIN#Bacterial lipocalin signature. Length = 171 Score = 233 bits (595), Expect = 1e-81 Identities = 85/151 (56%), Positives = 111/151 (73%), Gaps = 1/151 (0%) Query: 25 PKGVQPISGFDASRYLGKWYEVARLENRFERGLEQVTATYGARSDGGISVVNRGYDPVKK 84 P+ V+P+S F+ + YLGKWYEVARL++ FERGL QVTA Y R+DGGISV+NRGY K Sbjct: 20 PESVKPVSDFELNNYLGKWYEVARLDHSFERGLSQVTAEYRVRNDGGISVLNRGYSEEKG 79 Query: 85 RWNESDGKAYFTGAPTTAALKVSFFGPFYGGYNVIRLD-DDYQYALVSGPNRDYLWILSR 143 W E++GKAYF T LKVSFFGPFYG Y V LD ++Y YA VSGPN +YLW+LSR Sbjct: 80 EWKEAEGKAYFVNGSTDGYLKVSFFGPFYGSYVVFELDRENYSYAFVSGPNTEYLWLLSR 139 Query: 144 TPTIPAAVKQDYLNTARELGFDVDRLVWIRQ 174 TPT+ + ++ ++E GFD +RL++++Q Sbjct: 140 TPTVERGILDKFIEMSKERGFDTNRLIYVQQ 170
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 679 bits (1753), Expect = 0.0 Identities = 223/1059 (21%), Positives = 436/1059 (41%), Gaps = 54/1059 (5%) Query: 1 MIEWIIRRSVANRFLVMMAALFLSIWGTWTIIHTPVDALPDLSDVQVIVKTRYPGQAPQI 60 M + IRR + A+ L + G I+ PV P ++ V V YPG Q Sbjct: 1 MANFFIRR----PIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQT 56 Query: 61 VENQVTWPLTTTMLSVPGAKTVRGFSQ-FGDSYVYVIFEDGTDPYWARSRVLEYLNQVQG 119 V++ VT + M + + S G + + F+ GTDP A+ +V L Sbjct: 57 VQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATP 116 Query: 120 KLPAGVSAEMGP-DATGVGWVFEYALVDRSGKHDLAELRSLQDWFLKYELKTIPNVSEVA 178 LP V + + + ++ V + ++ +K L + V +V Sbjct: 117 LLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQ 176 Query: 179 SVGGVVKEYQIVVDPMKLTQYGISLGEVKSALDASNQEAGGSSVELA------EAEYMVR 232 G +I +D L +Y ++ +V + L N + + + + Sbjct: 177 LFGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASII 235 Query: 233 ASGYLQTLDDFKNIVLKTGDNGVPVYLGDVARVQIGPEMRRGIAELNGEGEVAGGVVILR 292 A + ++F + L+ +G V L DVARV++G E IA +NG+ AG + L Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLA 294 Query: 293 SGKNAREVISAVKAKLASLQSSLPEGVEVVTTYDRSQLIDRAIDNLSYKLLEEFIVVALV 352 +G NA + A+KAKLA LQ P+G++V+ YD + + +I + L E ++V LV Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354 Query: 353 CALFLWHVRSALVAIISLPLGLCFAFIMMHFQGLNANIMSLGGIAIAVGAMVDAAIVMIE 412 LFL ++R+ L+ I++P+ L F ++ G + N +++ G+ +A+G +VD AIV++E Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414 Query: 413 NAHKRLEEWEHQHPGEKLSNDTRWKIITEASVEVGPALFISLLIITLSFIPIFTLEGQGG 472 N + + E + P + ++ ++ AL ++++ FIP+ G G Sbjct: 415 NVERVMME-DKLPP---------KEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464 Query: 473 KLFGPLAFTKTWSMAGAALLAIVAIPILMGFWIRGRIPAESSNPLNRF----------LI 522 ++ + T +MA + L+A++ P L ++ AE F + Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPV-SAEHHENKGGFFGWFNTTFDHSV 523 Query: 523 RIYHPLLLKVLHWPKTTLLIALLSILTVAWPLNRVGGEFLPQINEGDLLYMPSTLPGISA 582 Y + K+L LLI L + + R+ FLP+ ++G L M G + Sbjct: 524 NHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQ 583 Query: 583 AQAADMLQKTDKLIMT--VPEVARVFGKTGKAETATDSAPLEMVETTIQLKPQDQW-RPG 639 + +L + + V VF G + + + LKP ++ Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQN---AGMAFVSLKPWEERNGDE 640 Query: 640 MTMEKIVEELDKTVRLPGLANLWVPPIRNRIDMLSTGIKSPIGIKVSGTNLADIDAIAGQ 699 + E ++ + + + +++ + I +G + Q Sbjct: 641 NSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQ 700 Query: 700 IEVVARSVPG-VTSALAERLVGGRYLNIDIHREKAARYGMTVGDVQLFVSSAIGGAMVGE 758 + +A P + S L +++ +EKA G+++ D+ +S+A+GG V + Sbjct: 701 LLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVND 760 Query: 759 TVEGVERYPINIRYPQSYRDSPETLRQLPILTPLKQQIVLADVAEVKVVTGPSMLKTENA 818 ++ + ++ +R PE + +L + + + + + V G L+ N Sbjct: 761 FIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNG 820 Query: 819 RPTSWIYIDARDRDMVSVVHDLQQAIGKEVKLKPGISVSYSGQFELLERAIQKLKLMVPM 878 P+ I +A L + + KL GI ++G + + +V + Sbjct: 821 LPSMEIQGEAAPGTSSGDAMALMENL--ASKLPAGIGYDWTGMSYQERLSGNQAPALVAI 878 Query: 879 TLMIIFVLLYLAFRRVGEALLIITSVPFALVGGIWFLYWMGFHLSVATGTGFIALAGVAA 938 + +++F+ L + + ++ VP +VG + V G + G++A Sbjct: 879 SFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSA 938 Query: 939 EFGVVMLMYLRHAIEAEPSLENPQTFSVDKLDEALYRGAVLRVRPKAMTVAVIIAGLLPI 998 + ++++ + + +E E + EA +R+RP MT I G+LP+ Sbjct: 939 KNAILIVEFAKDLMEKEGK----------GVVEATLMAVRMRLRPILMTSLAFILGVLPL 988 Query: 999 LWGTGAGSEVMSRIAAPMIGGMITAPLLSLFIIPAAYKL 1037 GAGS + + ++GGM++A LL++F +P + + Sbjct: 989 AISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVV 1027
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 30.8 bits (69), Expect = 0.012 Identities = 29/166 (17%), Positives = 57/166 (34%), Gaps = 9/166 (5%) Query: 140 RLKNLSEADRQNFFASEEARRAVHILLIANVSQSYFNQRLAAAQLQVANDTLQNYQQSYA 199 A + A + A A L + + +A+++ + A Sbjct: 204 NFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQA 263 Query: 200 FVEKQLLTGSTTVLALEQARGMIESTRADIAKRQGQLAQANNALQLLLGSYQHLPDDSAS 259 +EK L + I++ A+ A + + A + Q+L + Q L D + Sbjct: 264 ELEKAL---EGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDA 320 Query: 260 SAVDLQGVTLPPSLSSAILLQRPDILEAEHSLQAANANIGAARAAF 305 S + + L ++ I EA S Q+ ++ A+R A Sbjct: 321 SREAKKQL----EAEHQKLEEQNKISEA--SRQSLRRDLDASREAK 360
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 82.6 bits (204), Expect = 2e-20 Identities = 35/117 (29%), Positives = 61/117 (52%) Query: 2 KILIVEDEKKTGEYLTKGLTEAGFVVDLADNGLNGYHLAMTSDYDLLILDIMLPDVNGWD 61 IL+ +D+ L + L+ AG+ V + N + D DL++ D+++PD N +D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 IVRMLRTAGKGMPILLLTALGTIEHRVKGLELGADDYLVKPFAFAELLARVRTLLRR 118 ++ ++ A +P+L+++A T +K E GA DYL KPF EL+ + L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>BORPETOXINA#Bordetella pertussis toxin A subunit signature. Length = 269 Score = 29.4 bits (65), Expect = 0.027 Identities = 14/56 (25%), Positives = 27/56 (48%) Query: 358 RFVGSPCRVTGDPLMLRRAISNLLSNAIRYTPAGQAVTIQLSESAETVRLVVENPG 413 R+V R +P RR++++++ +R P A + +ES+E + E G Sbjct: 199 RYVSQQTRANPNPYTSRRSVASIVGTLVRMAPVIGACMARQAESSEAMAAWSERAG 254
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 66.0 bits (161), Expect = 4e-14 Identities = 35/221 (15%), Positives = 73/221 (33%), Gaps = 30/221 (13%) Query: 1 MMTPEQKFARWVRVSIAAFLGI-FAWFIVADIWIPLTPDSTVMRVVTP------VSSRVS 53 + TP + R V I FL I F ++ + +T +T + + Sbjct: 49 IETPVSRRPRLVAYFIMGFLVIAFILSVLG----QVEIVATANGKLTHSGRSKEIKPIEN 104 Query: 54 GYVSHVYVHNNSQVKKGDLLYELDPTPFINKVEAAQIALEQAKLSNQQLDAQIAAARAN- 112 V + V V+KGD+L +L Q +L QA+L + + N Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNK 164 Query: 113 -------------LRTAQYTARNDKVTLDRYQRLSTMQNVSQSDLDKVRTTWQTSEQSVS 159 + + R + +++ + + +LDK R T ++ Sbjct: 165 LPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARIN 224 Query: 160 ALNAQIQNLLIQRGERDDKRNVTLQKY--RNALEEAQLNLA 198 + + DD ++ ++ ++A+ E + Sbjct: 225 RYENLSRVE---KSRLDDFSSLLHKQAIAKHAVLEQENKYV 262 Score = 49.1 bits (117), Expect = 1e-08 Identities = 30/204 (14%), Positives = 71/204 (34%), Gaps = 21/204 (10%) Query: 86 EAAQIALEQAKLSNQQLDAQIAAARANLRTAQYTARNDKVTLDRYQRLSTMQNVSQSDLD 145 Q Q +L+ + A+ A + + +R +K LD + L Q +++ + Sbjct: 196 STWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVL 255 Query: 146 KVRTTWQTSEQSVSALNAQIQNL--------LIQRGERDDKRNVTLQKYRNA-------- 189 + + + + +Q++ + + +N L K R Sbjct: 256 EQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLT 315 Query: 190 --LEEAQLNLAWTKVRAETDGMVSNLQLN-PGIYATAATAVLALVNNNTDIVAD--FREK 244 L + + + +RA V L+++ G T A ++ +V + + + K Sbjct: 316 LELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNK 375 Query: 245 SLRHTAVNTDAAVVFDALPGQVFP 268 + V +A + +A P + Sbjct: 376 DIGFINVGQNAIIKVEAFPYTRYG 399
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 159 bits (404), Expect = 9e-44 Identities = 79/301 (26%), Positives = 132/301 (43%), Gaps = 27/301 (8%) Query: 86 MAELLAESDRQPEQADHFSLLTGHDGSLRKPIEQMKTALFYPNCGLPLLITGDSGTGKSY 145 +AE + + + L G ++++ + + L L+ITG+SGTGK Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLM---QTDLTLMITGESGTGKEL 175 Query: 146 MAELMHEFAIAQGLLAPDAPFVSFNCAQYASNPELLAANLFGYVKGAFTGAQSDKAGAFE 205 +A +H++ + + PFV+ N A A +L+ + LFG+ KGAFTGAQ+ G FE Sbjct: 176 VARALHDYGKRR-----NGPFVAINMA--AIPRDLIESELFGHEKGAFTGAQTRSTGRFE 228 Query: 206 AANGGMLFLDEVHRLDAQGQEKLFTWLDRKEIYRVGETAQGLPISLRLVFATTEDIHS-- 263 A GG LFLDE+ + Q +L L + E VG + +R+V AT +D+ Sbjct: 229 QAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGR-TPIRSDVRIVAATNKDLKQSI 287 Query: 264 ---TFLTTFLRRIPIL-VSLPDLQHRSREEKEALTLQFFWQEARTLAAR-LQLTPRLLQV 318 F R+ ++ + LP L R R E ++ F Q+A + L++ Sbjct: 288 NQGLFREDLYYRLNVVPLRLPPL--RDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALEL 345 Query: 319 LTQYVYRGNVGELKNVVKYAVASAWARSPGREMLTVTLHDLPENVMAATPALSEAMGQQE 378 + + + GNV EL+N+V+ A P +T + + + P Sbjct: 346 MKAHPWPGNVRELENLVRRLT----ALYPQD---VITREIIENELRSEIPDSPIEKAAAR 398 Query: 379 P 379 Sbjct: 399 S 399
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 35.2 bits (80), Expect = 2e-04 Identities = 15/38 (39%), Positives = 23/38 (60%) Query: 239 PQYEETLMSIAQKLKQEGRQQGRLEGREEGHLEGLQEG 276 P E+ L + + ++G Q G EGR++GH +G QEG Sbjct: 38 PSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEG 75 Score = 29.0 bits (64), Expect = 0.023 Identities = 12/22 (54%), Positives = 17/22 (77%) Query: 255 EGRQQGRLEGREEGHLEGLQEG 276 EGRQQG +G +EG +GL++G Sbjct: 62 EGRQQGHKQGYQEGLAQGLEQG 83
>2FE2SRDCTASE#Ferric iron reductase signature. Length = 262 Score = 368 bits (947), Expect = e-132 Identities = 169/262 (64%), Positives = 209/262 (79%) Query: 1 MAWRSLPLSDELIWRAPLPTAEHALAESIREKIATLRPHLLDFLRLDEPAPRHALTLAEW 60 MA+RS PL +++IWR L + LA+++R IA R HLL+F+RLDEPAP +A+TLA+W Sbjct: 1 MAYRSAPLYEDVIWRTHLQPQDPTLAQAVRATIAKHREHLLEFIRLDEPAPLNAMTLAQW 60 Query: 61 SQPIALRSLLATWSDHIYRHQPTLPREQKPLLSLWAQWYIGLLVPPLMLALLNEPQGLSL 120 S P L SLLA +SDHIYR+QP + RE KPL+SLWAQWYIGL+VPPLMLALL + + L + Sbjct: 61 SSPNVLSSLLAVYSDHIYRNQPMMIRENKPLISLWAQWYIGLMVPPLMLALLTQEKALDV 120 Query: 121 APEHFHVEFHESGRAACFWIDVHSDADIERLSPQARMDALVTRTLQPVVEALAATGEINS 180 +PEHFH EFHE+GR ACFW+DV D + SPQ RM+ L+++ L PVV+AL ATGEIN Sbjct: 121 SPEHFHAEFHETGRVACFWVDVCEDKNATPHSPQHRMETLISQALVPVVQALEATGEING 180 Query: 181 KLIWSNTGYLINWYLGEMRALLGDERLAALRQHCFFKKQLADGQDNPLWRTVMLREGQLV 240 KLIWSNTGYLINWYL EM+ LLG+ + +LR FF+K L +G+DNPLWRTV+LR+G LV Sbjct: 181 KLIWSNTGYLINWYLTEMKQLLGEATVESLRHALFFEKTLTNGEDNPLWRTVVLRDGLLV 240 Query: 241 RRTCCQRYRLPDVQQCGDCTLR 262 RRTCCQRYRLPDVQQCGDCTL+ Sbjct: 241 RRTCCQRYRLPDVQQCGDCTLK 262
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 39.8 bits (93), Expect = 2e-05 Identities = 57/343 (16%), Positives = 113/343 (32%), Gaps = 8/343 (2%) Query: 60 VTGFLSDRFGRKPFIYLGILSYLIFFVGILLTKNIYLAYVFGIMAGLANSFLDSGTYPAL 119 V G LSDRFGR+P + + + + + + +++ Y+ I+AG+ + Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA 121 Query: 120 MESFPHSASRANVLIKAFVSAGQFLLPFIISFLIWANLWFGWSFVIAAALFVLSGIYLLK 179 + +R + A G P + + F AAAL L+ + Sbjct: 122 DITDGDERARHFGFMSACFGFGMVAGPVLGGLM--GGFSPHAPFFAAAALNGLNFLTGCF 179 Query: 180 MPFPDSQAAKKKEAPTAQAETAVRPQANK-LDMVIFTLYGYIGMATFYLVSQWL-AQYGQ 237 + P+S +++ + + + +V + + M V L +G+ Sbjct: 180 L-LPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGE 238 Query: 238 FVVGL-PYASAIKLLSIYTVGSLVCVFVTAAFVKEVFSSAIAMIIYTGLSMISLLLVCLF 296 I L + + SL +T + M+ +LL Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298 Query: 297 PTPMMVTGFAFVIGFAAAGGVLQLGATIMAMSFPNGKGKATGIFYTAGSIASFTIPLITA 356 M + LQ A + +G+ G S+ S PL+ Sbjct: 299 RGWMAFPIMVLLASGGIGMPALQ--AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFT 356 Query: 357 KLSQISIASIMWFDFLIAVIGFVIALYIGYRQLQARAAQKVSR 399 + SI + + ++ +++ L R L + A Q+ R Sbjct: 357 AIYAASITTWNGWAWIAGAALYLLCLPALRRGLWSGAGQRADR 399
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 33.6 bits (77), Expect = 8e-04 Identities = 13/40 (32%), Positives = 23/40 (57%), Gaps = 1/40 (2%) Query: 216 EGDEKAELALSRYEQRLAKSLAHVVNILDP-DVIVLGGGM 254 GD++A+LAL+ + R+ K++ + DVIV G+ Sbjct: 293 NGDKRAQLALNVFAYRVKKTIGSYAAAMGGVDVIVFTAGI 332
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 36.7 bits (85), Expect = 4e-04 Identities = 26/207 (12%), Positives = 54/207 (26%), Gaps = 23/207 (11%) Query: 196 ARHALEKFEAQAAGIVLLTEAQQQALQESLQVLTDEEKALLAQQQSQQQQLQWLTRRDEL 255 A K ++ L + + Q L S+++ E L + Q + + R L Sbjct: 132 AEADTLKTQSSLL-QARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL 190 Query: 256 AQQQQQAATRQQ-QARQALADAAPALAKLE------------LAQPAAQLRPLWERQQEQ 302 ++Q Q+ Q L + L +Q Sbjct: 191 IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIA 250 Query: 303 TAGLTQTRQRISEVNARLLASTALRARIRQGALRAQQQRQAELADLAQWLAAHERFRLWG 362 + + + E L + +I L A+++ Q L Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQ---------LVTQLFKNEIL 301 Query: 363 QEIAGWRAQFSQLTRDKQQLTAQSTRL 389 ++ LT + + + Sbjct: 302 DKLRQTTDNIGLLTLELAKNEERQQAS 328 Score = 36.0 bits (83), Expect = 7e-04 Identities = 26/205 (12%), Positives = 64/205 (31%), Gaps = 23/205 (11%) Query: 307 TQTRQRISEVNARLLASTALRARIRQGALRAQQQRQAELADLAQWLAAHERFRLWGQEIA 366 + + LL + + R + + + + EL + + + + Sbjct: 130 LGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTS 189 Query: 367 GWRAQFSQLTRDKQQLTAQSTRLAALRQKLATLPASPLTLSADEVAAAIEQQTQS--RPL 424 + QFS K Q L R + T+ A ++ E + +E+ L Sbjct: 190 LIKEQFSTWQNQKYQKELN---LDKKRAERLTVLA---RINRYENLSRVEKSRLDDFSSL 243 Query: 425 -------RQRLISLHEQHQLLRKRLRQNAESVQQAQAEQVKLNATLTLRREQYKDKNQHY 477 + ++ ++ LR ++Q ++E + L + +K+ Sbjct: 244 LHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN----- 298 Query: 478 LDLKALCQREETIKDLESYRDRLEA 502 + L + +T ++ L Sbjct: 299 ---EILDKLRQTTDNIGLLTLELAK 320 Score = 31.3 bits (71), Expect = 0.019 Identities = 40/206 (19%), Positives = 80/206 (38%), Gaps = 22/206 (10%) Query: 675 AQWQAQQTQHDAIQQQIAALRPMLETLPTSDETEVEAESAIPD-------NWREIHEECL 727 A+ +TQ +Q ++ R + L S E E +PD + E+ Sbjct: 132 AEADTLKTQSSLLQARLEQTR--YQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTS 189 Query: 728 SLHSQLVAQQQQETQEKARLDQSQAQFTSALAASRFSDREAFLAALLDDETAQRLTQLKQ 787 + Q Q Q+ Q++ LD+ +A+ + LA + + + RL Sbjct: 190 LIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE-------KSRLDDFSS 242 Query: 788 TLEQQLQQAAALCEQATRQHEAHLALRPQGVDADVPTLQTQLHALAQRLRDNT-TRQGEI 846 L +Q A+ EQ + EA LR + + +++++ + + + T + EI Sbjct: 243 LLHKQAIAKHAVLEQENKYVEAVNELRVY--KSQLEQIESEILSAKEEYQLVTQLFKNEI 300 Query: 847 RQQLRQDAESRQQQQALGQQIAEAAQ 872 +LRQ + L ++A+ + Sbjct: 301 LDKLRQ---TTDNIGLLTLELAKNEE 323 Score = 31.0 bits (70), Expect = 0.027 Identities = 36/299 (12%), Positives = 87/299 (29%), Gaps = 62/299 (20%) Query: 418 QTQSRPLRQRLISLHEQHQLLRKRLRQNAESVQQAQAEQVKLNATLTLRREQYKDKNQHY 477 + + S Q +L + R + + S++ + ++KL + ++ + Sbjct: 129 ALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLT 188 Query: 478 LDLKALCQREETIKDLESYRDRLEAGKPCPLCGACEHPAIEQYASLTLTDNQRRRDALEK 537 +K +++++ + L L + R + Sbjct: 189 SLIKE---------QFSTWQNQKYQKE------------------LNLDKKRAERLTVLA 221 Query: 538 EVAALKEEGLLILGQVKALTQQLQRDTEAAGRLAEEEQALTKAWQETCASLHIARDIAQE 597 + + + ++ + L + A + E+E +A E + +Q Sbjct: 222 RINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNE------LRVYKSQL 275 Query: 598 INDWMQEQERYEQQLYQLSQRLMLQSQLNDQQALE--RQAEQQLAATRQGLESALQALAL 655 E+ E ++ + L +QL + L+ RQ + L + Sbjct: 276 --------EQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQA 327 Query: 656 SL---PAEGTEAAWLHARESEFAQWQAQQTQHDAIQQQIAALRPMLETLPTSDETEVEA 711 S+ P QQ + + ++ +P D EV A Sbjct: 328 SVIRAPVSVK----------------VQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTA 370
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 95.7 bits (238), Expect = 5e-25 Identities = 34/149 (22%), Positives = 63/149 (42%), Gaps = 9/149 (6%) Query: 4 RILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDSAVNQLNEPWPDLILLDWMLPGGSGLQ 63 ILV +D+A IR ++ L + G+ + + + DL++ D ++P + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 64 FIKLLKREAMTRDIPVVMLTARGEEEDRVRGLETGADDYITKPFSPKELVARIKAVMRRI 123 + +K+ D+PV++++A+ ++ E GA DY+ KPF EL+ I + Sbjct: 65 LLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA-- 120 Query: 124 SPMAVEEVIEMQGLSLDPSSHRVMTGDSP 152 E L D + G S Sbjct: 121 -----EPKRRPSKLEDDSQDGMPLVGRSA 144
>PF06580#Sensor histidine kinase Length = 349 Score = 31.4 bits (71), Expect = 0.006 Identities = 16/98 (16%), Positives = 30/98 (30%), Gaps = 25/98 (25%) Query: 325 LVYNAVNH----TPPGTEIRVSWQRTPQGALFSVEDNGPGIAPEHIPRLTERFYRVDKAR 380 LV N + H P G +I + + VE+ G Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN---------------- 306 Query: 381 SRQTGGSGLGLAIVKHAVNH---HDSRLEIDSTVGKGT 415 +G GL V+ + ++++++ GK Sbjct: 307 --TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN 342
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 29.4 bits (66), Expect = 0.030 Identities = 17/75 (22%), Positives = 29/75 (38%) Query: 357 FLVIASLATFATVWVWIMILLSQIAFRRRLSPEEVKALKFKVPGGVVTTVIGLLFLAFII 416 F A+L + ++ S RR L E + L +T V L+ + FI+ Sbjct: 163 FFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIM 222 Query: 417 ALIGYHPDTRISLYV 431 L+G P ++ Sbjct: 223 QLVGQVPAALWVIFG 237
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 36.1 bits (83), Expect = 6e-05 Identities = 35/186 (18%), Positives = 67/186 (36%), Gaps = 21/186 (11%) Query: 7 LDPTNSALIFIDHQPQM--SFGVANIDRQTLKNNTVALAKAGKIFNVPVIYT------SV 58 DP + L+ D Q +F L N L +PV+YT + Sbjct: 26 PDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQNP 85 Query: 59 ETKSFSGYIW-PELLAVHPDVKPIERTS-------MNSWEDDAF-----VAAVKATGRKK 105 + ++ W P L + + K I + + W AF + ++ GR + Sbjct: 86 DDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQ 145 Query: 106 LVISALWTEVCLTFPALMALEAGYEVYVVTDTSGGTSVDAHERSIDRMVQAGAVPVTWQQ 165 L+I+ ++ + A A + + V D S++ H+ +++ A V Sbjct: 146 LIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCAFTVMTDS 205 Query: 166 VLLEYQ 171 +L + Q Sbjct: 206 LLDQLQ 211
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 30.1 bits (68), Expect = 0.033 Identities = 22/76 (28%), Positives = 31/76 (40%), Gaps = 14/76 (18%) Query: 4 TATLILTHGQIHTLDRANPLAEAVAIADGKIVATGS------HDRIMSFAAEGTQIVDLK 57 T LIL H I D + + DG+I A G + GT+++ + Sbjct: 73 TNALILDHWGIVKAD--------IGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGE 124 Query: 58 GHTVIPGLNDSHLHLI 73 G V G DSH+H I Sbjct: 125 GKIVTAGGMDSHIHFI 140
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 342 bits (880), Expect = e-120 Identities = 101/308 (32%), Positives = 172/308 (55%), Gaps = 12/308 (3%) Query: 18 DFMRWDYWAFGISGFLLIVSIAIIGVRGFNWGLDFTGGTVIEITLEKPVDLDQMRDSLQK 77 DF RW + FG + ++I S+ + V G N+G+DF GGT I +D+ R +L+ Sbjct: 15 DFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGVYRAALEP 74 Query: 78 AGFEEPQVQNFGSSR------DIMVRMPPVHDANGSQELGSKVVTVINE------STSQN 125 + + M+R+ D G++ G++ ++N+ + Sbjct: 75 LELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAVDPA 134 Query: 126 AAVKRIEFVGPSVGADLAQTGALALIAALVCILIYVGFRFEWRLAAGVVIALAHDVVITM 185 + E VGP V +L T +L+AA V I+ Y+ RFEW+ A G V+AL HDV++T+ Sbjct: 135 LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTV 194 Query: 186 GVLSLFHIEIDLTIVASLMSVIGYSLNDSIVVSDRIRENFRKIRRGTPYEIFNVSLTQTL 245 G+ ++ ++ DLT VA+L+++ GYS+ND++VV DR+REN K + ++ N+S+ +TL Sbjct: 195 GLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNETL 254 Query: 246 HRTLITSGTTLMVILMLFLFGGPILEGFSLTMLIGVSIGTASSIYVASALALKLGMKREH 305 RT++T TTL+ ++ + ++GG ++ GF M+ GV GT SS+YVA + L +G+ R Sbjct: 255 SRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGLDRNK 314 Query: 306 LIQQKVEK 313 + +K Sbjct: 315 EKKDPSDK 322
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 32.5 bits (74), Expect = 6e-04 Identities = 18/66 (27%), Positives = 28/66 (42%), Gaps = 7/66 (10%) Query: 3 RRADRLFQIVQILRGRRLTT-----AALLAERLGVSERTVYRDIRDLSLSGVPVEGEAGS 57 + R +I +I+ + T L + V++ TV RDI++L L V V GS Sbjct: 2 NKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKELHL--VKVPTNNGS 59 Query: 58 GYRLLA 63 L Sbjct: 60 YKYSLP 65
>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx signature. Length = 294 Score = 537 bits (1384), Expect = 0.0 Identities = 294/294 (100%), Positives = 294/294 (100%) Query: 1 MKKTLLAAGAVVALSTTFAAGAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLE 60 MKKTLLAAGAVVALSTTFAAGAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLE Sbjct: 1 MKKTLLAAGAVVALSTTFAAGAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLE 60 Query: 61 YEAFAKKDWFDFYGYIDAPVFFGGNSTAKGIWNKGSPLFMEIEPRFSIDKLTNTDLSFGP 120 YEAFAKKDWFDFYGYIDAPVFFGGNSTAKGIWNKGSPLFMEIEPRFSIDKLTNTDLSFGP Sbjct: 61 YEAFAKKDWFDFYGYIDAPVFFGGNSTAKGIWNKGSPLFMEIEPRFSIDKLTNTDLSFGP 120 Query: 121 FKEWYFANNYIYDMGRNDSQEQSTWYMGLGTDIDTGLPMSLSLNVYAKYQWQNYGASNEN 180 FKEWYFANNYIYDMGRNDSQEQSTWYMGLGTDIDTGLPMSLSLNVYAKYQWQNYGASNEN Sbjct: 121 FKEWYFANNYIYDMGRNDSQEQSTWYMGLGTDIDTGLPMSLSLNVYAKYQWQNYGASNEN 180 Query: 181 EWDGYRFKVKYFVPLTDLWGGSLSYIGFTNFDWGSDLGDDNFYDLNGKHARTSNSIASSH 240 EWDGYRFKVKYFVPLTDLWGGSLSYIGFTNFDWGSDLGDDNFYDLNGKHARTSNSIASSH Sbjct: 181 EWDGYRFKVKYFVPLTDLWGGSLSYIGFTNFDWGSDLGDDNFYDLNGKHARTSNSIASSH 240 Query: 241 ILALNYAHWHYSIVARYFHNGGQWADDAKLNFGDGPFSVRSTGWGGYFVVGYNF 294 ILALNYAHWHYSIVARYFHNGGQWADDAKLNFGDGPFSVRSTGWGGYFVVGYNF Sbjct: 241 ILALNYAHWHYSIVARYFHNGGQWADDAKLNFGDGPFSVRSTGWGGYFVVGYNF 294
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 44.9 bits (106), Expect = 2e-07 Identities = 31/137 (22%), Positives = 56/137 (40%), Gaps = 1/137 (0%) Query: 197 AREREQGTLDQLLVSPLATWQIFVGKAVPALIVATLQATIVLAIGIWAYQIPFAGSLLLF 256 R Q T + +L + L I +G+ A A L + + + SLL Sbjct: 92 GRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGYTQWL-SLLYA 150 Query: 257 YFTMVIYGLSLVGFGLLISSLCATQQQAFIGVFVFMMPAILLSGYVSPVENMPQWLQDLT 316 + + GL+ G+++++L + + + P + LSG V PV+ +P Q Sbjct: 151 LPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQTAA 210 Query: 317 WINPIRHFTDITKQIYL 333 P+ H D+ + I L Sbjct: 211 RFLPLSHSIDLIRPIML 227
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.8 bits (69), Expect = 0.019 Identities = 11/27 (40%), Positives = 14/27 (51%) Query: 30 IRAGYVTGLVGPDGAGKTTLMRMLAGL 56 + Y L G G GK+TL+ L GL Sbjct: 593 CKFDYSVVLEGTGGIGKSTLINTLVGL 619
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 76.4 bits (188), Expect = 1e-17 Identities = 49/290 (16%), Positives = 107/290 (36%), Gaps = 26/290 (8%) Query: 55 ASLTVDEGDSIRAGQTLGELDRAPYENALLQAQANVSTAQAQYDLMMAGYRAEEIAQAAA 114 V E + +R + E + ++N Q + N+ +A+ ++A E Sbjct: 175 YFQNVSEEEVLRLTSLIKE-QFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233 Query: 115 AVKQAQAAYDYAQNFYQRQ--LGLRASSAISANDLENARSSRDQAQATLKSAQDKLRQYR 172 + + + + L + N+L +S +Q ++ + SA+++ + Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293 Query: 173 AGNRPQ---EIAQAKASLEQAQAALAQAKLDLHDTVLTAPSDGTLMTRAV-EPGTMLNAG 228 + + ++ Q ++ LA+ + +V+ AP + V G ++ Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353 Query: 229 GTVLTLSLT-HPVWVRAYVDEKNLGQAQPGQEVLLYTDSRPDKPYH---GKIGFVSPSAE 284 T++ + + V A V K++G GQ ++ ++ P Y GK+ ++ A Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA- 412 Query: 285 FTPKTVETPDLRTDLVYRLRIVVTDADGA-------LRQGMPVTISFSHG 327 D R LV+ + I + + + L GM VT G Sbjct: 413 -------IEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTG 455
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 61.2 bits (148), Expect = 1e-13 Identities = 24/154 (15%), Positives = 54/154 (35%), Gaps = 16/154 (10%) Query: 4 KGEQAKNQLIAAAIAQFGEYGQHATT-RDIAAQAGQNIAAITYYFGSKDDLYLACAQWIA 62 + ++ + ++ A+ F + G +T+ +IA AG AI ++F K DL+ + Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67 Query: 63 DFIGDNFRPQAEAAEHLLAGEAPDRQAIRDLILSACHNMILLLTQDDTVNLSKFISREQL 122 IG+ +R++++ + + T++ L + I + Sbjct: 68 SNIGELELEYQAKF------PGDPLSVLREILIHVLESTV---TEERRRLLMEIIFHKCE 118 Query: 123 APTA------AYHLIHQQVIAPLHHYLTRLIAAW 150 A + + + L I A Sbjct: 119 FVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAK 152
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 42.9 bits (101), Expect = 2e-06 Identities = 58/400 (14%), Positives = 129/400 (32%), Gaps = 50/400 (12%) Query: 22 LTMIFLVYAINYADRTNIGAVLPFIIDEFHINNFEAGAIASMFFLGYAVSQIP----AGF 77 L +I A++ I VLP ++ + +N + L YA+ Q G Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLAL-YALMQFACAPVLGA 65 Query: 78 FIAKRGTRGLVSLSIFGFSAFTWLMGTVSSVFGLKLVRLGLGLSEGPCPVGLASTINNWF 137 + G R ++ +S+ G + +M T ++ L + R+ G++ V + I + Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAV-AGAYIADIT 124 Query: 138 PPKEKATATGVYIAATMFAPIIVPPLAVWIAVTWGWRWVFFSFAIPGIVAAIAWYLLVKS 197 E+A G +++A ++ P+ + + FF+ A + + L+ Sbjct: 125 DGDERARHFG-FMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL-- 181 Query: 198 KPAESGFVSQSELAEINAGRESHNNSVR-ENILIADRFTWLDKIIRVKKMAPIDTAKGLF 256 ESH R + + +A + + Sbjct: 182 -------------------PESHKGERRPLRREALNPLASFRWARGMTVVAAL-----MA 217 Query: 257 TSKNILGDCLAYFMMVSVLYGLLTWIPLYLVKERGFDVMSMGFVASMPCIGGFIGAIGGG 316 +F+M V ++ +D ++G + G + ++ Sbjct: 218 V----------FFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLA---AFGILHSLAQA 264 Query: 317 WISDKLLGR-RRKPTMMFTAVSTVVMMLIMLNIPASTLAVCIGLFFVGFCLNIGWPAFTA 375 I+ + R + +M ++ +++ +A I + IG PA A Sbjct: 265 MITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG--GIGMPALQA 322 Query: 376 YGMAVSDSKTYPIASSIINSGGNLGGFVAPMAAGFLLDKT 415 D + + + +L V P+ + + Sbjct: 323 MLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAAS 362
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 118 bits (296), Expect = 2e-34 Identities = 74/255 (29%), Positives = 119/255 (46%), Gaps = 16/255 (6%) Query: 3 LKDKVAIITGAASARGLGFATAKLFAENGAKVVIIDLNGEAS---KTAAAALGEGHLGLA 59 ++ K+A ITGAA +G+G A A+ A GA + +D N E ++ A Sbjct: 6 IEGKIAFITGAA--QGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63 Query: 60 ANVADEVQVQAAIEQILAKYGRVDVLVNNAGITQPLKLMDIKRANYDAVLDVSLRGTLLM 119 A+V D + +I + G +D+LVN AG+ +P + + ++A V+ G Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123 Query: 120 SQAVIPTMRAQKSGSIVCISSVSAQRGGGIFGGPHYSAAKAGVLGLARAMARELGPDNVR 179 S++V M ++SGSIV + S A G Y+++KA + + + EL N+R Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPA--GVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181 Query: 180 VNCITPGLIQTDITAGKLTDD---------MTANILAGIPMNRLGDAIDIARAALFLGSD 230 N ++PG +TD+ D+ GIP+ +L DIA A LFL S Sbjct: 182 CNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241 Query: 231 LSSYSTGITLDVNGG 245 + + T L V+GG Sbjct: 242 QAGHITMHNLCVDGG 256
>SECA#SecA protein signature. Length = 901 Score = 29.8 bits (67), Expect = 0.026 Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 4/67 (5%) Query: 246 QQVLVFTRTKHGANHLAEQLNKDGIRSAAIHG-NKSQGARTRALADFKSGGIRVLVATDI 304 Q VLV T + + ++ +L K GI+ ++ + A A A + + V +AT++ Sbjct: 450 QPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNM 506 Query: 305 AARGLDI 311 A RG DI Sbjct: 507 AGRGTDI 513
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 34.8 bits (80), Expect = 4e-04 Identities = 39/156 (25%), Positives = 61/156 (39%), Gaps = 14/156 (8%) Query: 15 CALLFLVAPAV-QAAEQLPDAPS-IDAR-AWILMDYASGKVLSEGNADEKLDPASLTKIM 71 A L L A Q EQ+ + S + R I MD ASG+ L+ ADE+ S K++ Sbjct: 12 LATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVV 71 Query: 72 TSYVVGQAIKAGKIKLTDMVTVGRDAWATGNPALRGSSVMFLKPGMQVSVEDLNKGVIIQ 131 V + AG +L + + +P L GM +V +L I Sbjct: 72 LCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSE----KHLADGM--TVGELCAAAITM 125 Query: 132 SGNDASIAIADYVAGSQDAFVSLMNGYAKKMGLTNT 167 S N A+ + V G + + +++G T Sbjct: 126 SDNSAANLLLATVGGPAG-----LTAFLRQIGDNVT 156
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 45.2 bits (107), Expect = 3e-07 Identities = 61/267 (22%), Positives = 106/267 (39%), Gaps = 19/267 (7%) Query: 71 LLGPLSDRIGRRPVMLTGVVWFIVTCLATLLAQTIEQFTLLRFLQGISLCFIGAVGYAAI 130 +LG LSDR GRRPV+L + V A + + R + GI+ GAV A I Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYI 120 Query: 131 QESFEEAVCIKITALMANVALIAPLLGPLVGAAWVHILPWEMMFVLFAVLAAISFFGLQR 190 + + + M+ + GP++G P F A L ++F Sbjct: 121 ADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSP-HAPFFAAAALNGLNFLTGCF 179 Query: 191 AMPET--ATRLGEKLSVKELGRDYRLVLKNLRFVAGALATGFVSLPLLAWIAQSP--VII 246 +PE+ R + +R + + VA +A F ++ + Q P + + Sbjct: 180 LLPESHKGERRPLRREALNPLASFRWA-RGMTVVAALMAVFF----IMQLVGQVPAALWV 234 Query: 247 ISGEQATSYEYGMLQVPI--FGAL--IAGNLVLARLTARRTVRSLIIMGGWPIMFGLILS 302 I GE ++ + + + FG L +A ++ + AR R +++G G IL Sbjct: 235 IFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILL 294 Query: 303 AAATVVSSHAYLWMTAGLSFYAFGIGL 329 A AT ++ + + GIG+ Sbjct: 295 AFAT----RGWMAFPIMVLLASGGIGM 317
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 34.1 bits (78), Expect = 8e-04 Identities = 34/155 (21%), Positives = 62/155 (40%), Gaps = 19/155 (12%) Query: 17 LFMFFFIPGLLMASWATRTPAIRDLLALSTAEMGVVLFGLSVGSMSGILCS---AWLVKR 73 + I G + + ++D+ LSTAE+G V+ + G+MS I+ LV R Sbjct: 262 VLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVI--IFPGTMSVIIFGYIGGILVDR 319 Query: 74 FGTRKVIRTTM-----SFAVLGMLVLSLALWVTSAPLFAFGLAIFGASFGSAEVAINVEG 128 G V+ + SF L+ + + ++T +F G F + S V+ +++ Sbjct: 320 RGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQ 379 Query: 129 AAIEREMNKTVLPMMHGFYSFGTLFGAGVGMAVTG 163 M +F + G G+A+ G Sbjct: 380 QEAGAGM---------SLLNFTSFLSEGTGIAIVG 405 Score = 30.6 bits (69), Expect = 0.013 Identities = 34/172 (19%), Positives = 63/172 (36%), Gaps = 17/172 (9%) Query: 218 LLIGVIVLAMAFAEGSANDWL-PLLMVDGHGFSP-TSGSLIYAGFTLGMTLGRFTGGWFI 275 +IGV+ + F + + P +M D H S GS+I T+ + + + GG + Sbjct: 258 FMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILV 317 Query: 276 DRYSRVAVVRGSAVMGALGIGLIIFVDNPWVAGISVLLWGIGASLGF-PLTISAASDTGP 334 DR + V+ ++ F+ + I LG T + S Sbjct: 318 DRRGPLYVLNIGVTFLSVSFLTASFLLE---TTSWFMTIIIVFVLGGLSFTKTVISTIVS 374 Query: 335 DAPKRVSVVAITGYLAFLVGPPLLGFLGEHFGLRSAMMVVLGLVMAAALVAR 386 + K+ A L F FL E G+ ++G +++ L+ + Sbjct: 375 SSLKQQEAGAGMSLLNF------TSFLSEGTGI-----AIVGGLLSIPLLDQ 415
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 50.4 bits (120), Expect = 4e-10 Identities = 17/76 (22%), Positives = 33/76 (43%), Gaps = 2/76 (2%) Query: 1 MAR--RPNDPQRRERILQATLDTIAAHGIHAVTHRKIATCANVPLGSLTYYFSGIEALIE 58 MAR + + R+ IL L + G+ + + +IA A V G++ ++F L Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 59 EAFSLFTAEMSAQYQQ 74 E + L + + + Sbjct: 61 EIWELSESNIGELELE 76
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 32.1 bits (73), Expect = 0.005 Identities = 23/106 (21%), Positives = 35/106 (33%), Gaps = 6/106 (5%) Query: 394 LMIGMITFQFSSFSFGIGNAAGLLFAG-IMLGFLRANHPTFG-YIPQ--GALNMVKEFGL 449 L++ + +L+ G I+ G A G YI + FG Sbjct: 76 LLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGF 135 Query: 450 MVFMAGVGLSAGAGINNGLGAVGGQM--LAAGLIVSLVPVVICFLF 493 M G G+ AG + +G AA + L + CFL Sbjct: 136 MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 28.1 bits (62), Expect = 0.049 Identities = 16/54 (29%), Positives = 23/54 (42%) Query: 2 SKLDAFIQQAVTAMPISGTSLIASLYGDALLQRGGEVWLGSVAALLEGLGFGER 55 +L A AV + S + +AL +R GE+ L A G GF +R Sbjct: 604 RELSAAANAAVNTGGVGLASTLWYAESNALSKRLGELRLNPDAGGAWGRGFAQR 657
>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature. Length = 428 Score = 28.4 bits (63), Expect = 0.019 Identities = 34/139 (24%), Positives = 52/139 (37%), Gaps = 38/139 (27%) Query: 70 QQEALALSDELIAELKGNDVIVIAAPMYNFNIPTQLKNYFDL---VARAGVTFRY----- 121 QQ D EL+ N + +I +F+I T+ K ++ L + + T Y Sbjct: 129 QQSIKQYIDAHREELERNQIKIIGI---DFDIETEYKWFYSLQFNIKESAFTTGYAIASW 185 Query: 122 -TEKGPEGLVTGKRAVVVTSRGGIHKDTPTDLVTPYLSTFLGFIGITDVNFVFAEGIAY- 179 +E+ KR VV S GG F G+T N FA+GI Y Sbjct: 186 LSEQDES-----KR--VVASFGGGA-----------------FPGVTTFNEGFAKGILYY 221 Query: 180 -GPEVAAKAQSDAKAAIDS 197 ++K + +DS Sbjct: 222 NQKHKSSKIYHTSPVKLDS 240
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 34.4 bits (79), Expect = 8e-04 Identities = 28/235 (11%), Positives = 67/235 (28%), Gaps = 53/235 (22%) Query: 69 SDVLIARERVNEYQARAYAADSSLFPSLDASLTGTRARTQSAATGLPIHSTLYKGGLTAS 128 S +L AR YQ + + + + P L L + + ++L K + Sbjct: 141 SSLLQARLEQTRYQILSRSIELNKLPELK--LPDEPYFQNVSEEEVLRLTSLIKEQFST- 197 Query: 129 YDVDIWGANRSAANAAGASLEAQKAAAAAANLSVASSVAVGYVTLLSLDEQLRVTQQTLT 188 W + A++ A + + RV + L Sbjct: 198 -----WQNQKYQKELNLDKKRAERLTVLA--------------RINRYENLSRVEKSRLD 238 Query: 189 SREDAWRLAKRQFETGYTSRLELM-------QADSELRSTRAQIPPLQHQIAQQENALSV 241 L + ++ ++ +A +ELR ++Q+ ++ +I + + Sbjct: 239 DFS---SLLHK----QAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL 291 Query: 242 LLGDNPGAVKRGEFAQLTPLRLPSQLPSTLLNRRPDIAQAERQLVAADATLASSQ 296 + +++ L +I +L + +S Sbjct: 292 VTQL-----------------FKNEILDKLRQTTDNIGLLTLELAKNEERQQASV 329
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 101 bits (254), Expect = 5e-26 Identities = 63/409 (15%), Positives = 125/409 (30%), Gaps = 83/409 (20%) Query: 21 SIFTAAAIGLVGVLVILYAWQLPPFTRHSQFTDNAYVRGQTTFISPQVNGYITAVNVKDF 80 A I V+ + + L + G++ I P N + + VK+ Sbjct: 57 PRLVAYFIMGFLVIAFILSV-LGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEG 115 Query: 81 AIVQPGEVLFQIDDR-----IYKQRVHQAQATL------AMKEAALRNNL---------- 119 V+ G+VL ++ K + QA L + + N L Sbjct: 116 ESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPY 175 Query: 120 ------------------------QQRKSAEATIAKNEAALQNARAQNLKIQADLKRIQQ 155 Q+ E + K A A+ + + + + Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235 Query: 156 -------LTADGSLS---IRERDSARASA----AQGAADIEQAKAALEMSRQD------- 194 L +++ + E+++ A + +EQ ++ + ++++ Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295 Query: 195 -RESTIVNRDSLEADVASAKAALELAQIDLQNTQIIAPTGGQLGQISVR-LGAYVSAGTH 252 + + ++ L + Q + I AP ++ Q+ V G V+ Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355 Query: 253 LTSLVPPQH--WVIANLKETQLAEVRVGQPVTFTVDALNGETFH---GKVQSISPATGVE 307 L +VP V A ++ + + VGQ V+A + GKV++I+ Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--- 412 Query: 308 FSAISPDNATGNFVKIAQRIPVRITVNDGQNNSERLRPGMSVQVTIDTR 356 D G + I +N L GM+V I T Sbjct: 413 ----IEDQRLGLVFNVIISIEENCLSTGNKN--IPLSSGMAVTAEIKTG 455
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 40.6 bits (95), Expect = 1e-05 Identities = 71/399 (17%), Positives = 134/399 (33%), Gaps = 51/399 (12%) Query: 53 EMILRLG-PVISKEFSLSPEQWGNIVALIMVALAVLDIPGSIWSDRYGSGWKRARFQVPL 111 EM+L + P I+ +F+ P + M+ ++ SD+ G + L Sbjct: 30 EMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLG-------IKRLL 82 Query: 112 VLGYTALSFISGIKAISHGLTAFVLL-RVGVNLGAGWGEPVGVSNTAEWWPKEKRGFALG 170 + G F S I + H + +++ R GA + + A + PKE RG A G Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142 Query: 171 VHHTGYPIGALLSGVVASLVLATFGEGSWRYCFLL--ALLVAIPLMIFWAKYSTAARINT 228 + + +G + + ++ W Y L+ ++ +P ++ K RI Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIH---WSYLLLIPMITIITVPFLMKLLK--KEVRIK- 196 Query: 229 LYQHIDSQG----------LTRPATQES---------------SHVAKGEGMKTFLRTLR 263 H D +G T S H+ K + Sbjct: 197 --GHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGK 254 Query: 264 NRNISLTAGNTLLTQIVYMGINVVLPPYLYHVSGLSLAASAGLSIIF--TLTGTLGQVIW 321 N + + G ++P + V LS A G IIF T++ + I Sbjct: 255 NIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAE-IGSVIIFPGTMSVIIFGYIG 313 Query: 322 PWLSDSFGRKRTLIVCGLWMSIG---IALFYFATNMPRLIAIQLFFGLVANAVWPIYYAM 378 L D G L + ++S+ + T+ I I G ++ + + Sbjct: 314 GILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLS-FTKTVISTI 372 Query: 379 ASDSAEERATSTANGIITTAMFIGGGISPLLMGWLIQFG 417 S S +++ ++ F+ G ++G L+ Sbjct: 373 VSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIP 411
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 36.3 bits (84), Expect = 2e-04 Identities = 31/141 (21%), Positives = 57/141 (40%), Gaps = 18/141 (12%) Query: 51 SVDIGLSATAFGLGAGLFFLTYAVLEIPSNLFLTRIGARRWIARIMITWGILSCG----- 105 + IG+S AFG+ L +T A R R + G+++ G Sbjct: 245 ATTIGISLAAFGILHSLA-----------QAMITGPVAARLGERRALMLGMIADGTGYIL 293 Query: 106 MAFVTGPTSFYVMRLLLGAAEAGLYPGIIYYLTLWFGREERAKATGLFLLGVCLANIIGA 165 +AF T + + +LL + G+ P + L+ E + + G L +I+G Sbjct: 294 LAFATRGWMAFPIMVLLASGGIGM-PALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGP 352 Query: 166 PLGGLLLSLDGMSGWHGWQWM 186 L + + ++ W+GW W+ Sbjct: 353 LLFTAIYAAS-ITTWNGWAWI 372
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 126 bits (317), Expect = 2e-37 Identities = 75/254 (29%), Positives = 122/254 (48%), Gaps = 12/254 (4%) Query: 3 LASKTAIVTGAARGIGFGIAKVLAREGARVIIADRDAHG-EAAAASLRESGAQALFFSCN 61 + K A +TGAA+GIG +A+ LA +GA + D + E +SL+ A F + Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 62 IAEKTQVEALFSQAEEAFGPVDILVNNAGINRDAMLHKLTEADWDTVIDVNLKGTFLCMQ 121 + + ++ + ++ E GP+DILVN AG+ R ++H L++ +W+ VN G F + Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 122 QAAIRMRERGAGRIINIAS-ASWLGNVGQTNYSASKAGVVGMTKTACRELAKKGVTVNAI 180 + M +R +G I+ + S + + Y++SKA V TK ELA+ + N + Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 181 CPGFIDTDMTRG--VPENVWQIMIS--------KIPAGYAGEAKDVGECVAFLASDGARY 230 PG +TDM EN + +I IP + D+ + V FL S A + Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245 Query: 231 INGEVINVGGGMVL 244 I + V GG L Sbjct: 246 ITMHNLCVDGGATL 259
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.029 Identities = 10/20 (50%), Positives = 12/20 (60%) Query: 33 LLGPNGCGKSSLLRVLAGLR 52 L G G GKS+L+ L GL Sbjct: 601 LEGTGGIGKSTLINTLVGLD 620
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 65.6 bits (160), Expect = 7e-14 Identities = 77/373 (20%), Positives = 136/373 (36%), Gaps = 30/373 (8%) Query: 1 MLLGSQFVFNIGFYAVVPFLALFLRDDMLLSGGLI---GLILGLRTFSQQGMFILGGTLA 57 ++L + + +G ++P L LRD ++ S + G++L L Q + G L+ Sbjct: 9 VILSTVALDAVGIGLIMPVLPGLLRD-LVHSNDVTAHYGILLALYALMQFACAPVLGALS 67 Query: 58 DRYGAKAIILAGCVVRVAGFLLLACGASLWPIILGACLTGVGGALFSPSIEALLARAGTH 117 DR+G + ++L + ++A LW + +G + G+ GA + Sbjct: 68 DRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGA-------VAGAYI 120 Query: 118 SQANGKRSRAEWFALFAVCGELGAVIGPVAGGVLSGIGFRHIALAGAGIFLLALAVLFFC 177 + RA F + C G V GPV GG++ G A A + L F Sbjct: 121 ADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFL 180 Query: 178 LPADGHTTTTRRRVPWWTPLRQPRFVAFILAYSSWLLSY------NQLYLALPV--EIQR 229 LP R PL R+ + ++ + + Q+ AL V R Sbjct: 181 LPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDR 240 Query: 230 SGGREQDLAPLFMLASLLIITLQLPLA-RFARRMGAVRILPVGFLLLSASFASVALFAAA 288 + +L Q + A R+G R L +G + + +A Sbjct: 241 FHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFA--- 297 Query: 289 PPAEGWLRLMPAAGFVTLLTLGQMLLVPAAKDLIPLFAEESTLGAHYGALATAGGCAVLA 348 GW+ P + LL G + + PA + ++ +E G G+LA + Sbjct: 298 --TRGWM-AFPI---MVLLASGGIGM-PALQAMLSRQVDEERQGQLQGSLAALTSLTSIV 350 Query: 349 GNLLLGHLLDLAL 361 G LL + ++ Sbjct: 351 GPLLFTAIYAASI 363
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 28.6 bits (64), Expect = 0.027 Identities = 11/25 (44%), Positives = 14/25 (56%) Query: 47 IVGESGSGKSTVGRALLQLHPKKAR 71 I GESG+GK V RAL ++ Sbjct: 165 ITGESGTGKELVARALHDYGKRRNG 189
>BACINVASINB#Salmonella/Shigella invasin protein B signature. Length = 593 Score = 35.5 bits (81), Expect = 3e-04 Identities = 22/95 (23%), Positives = 44/95 (46%), Gaps = 10/95 (10%) Query: 60 EVRIGDKIVNNLAPKSRGIAM-VFQNYALYPHMTVRENLAFGLKLSKLPKAQIDRQVEEA 118 +V +G ++ N A + G+A VF A E LA L++ QI + ++++ Sbjct: 504 KVALGMEVTNTAAQSAGGVAEGVFIKNA-------SEALA-DFMLARFAMDQIQQWLKQS 555 Query: 119 AKIL-ELEELLDRLPRQLSGGQAQRVAVGRAIVKK 152 +I E +++ L + +S Q R I+++ Sbjct: 556 VEIFGENQKVTAELQKAMSSAVQQNADASRFILRQ 590
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 29.7 bits (66), Expect = 0.025 Identities = 72/298 (24%), Positives = 118/298 (39%), Gaps = 41/298 (13%) Query: 128 NGKLNGIPISVTARVFYFNDEAWKKAGIPFPKTWDELMAAGKTFESKLGKQYYPVVLEHQ 187 NGKL PI+V A +N + PKTW+E+ A K ++K GK L+ Sbjct: 126 NGKLIAYPIAVEALSLIYNKDLLPNP----PKTWEEIPALDKELKAK-GKSALMFNLQEP 180 Query: 188 ----DVLALLNSYMVQKYNQPAIDEKGRKFSYSKAQWADFFGMYKKLIDSHVMPDTRYYA 243 ++A Y KY D K + A+ F + + + H+ DT Y Sbjct: 181 YFTWPLIAADGGYAF-KYENGKYDIKDVGVDNAGAKAGLTF-LVDLIKNKHMNADTDYSI 238 Query: 244 SFGKSNMYEMKPWIQGEWGGTYMWNSTINKYSDNLKPPAKLVLGEYPMLP--GATDAGLF 301 + N E I G W + + S +N Y + P K P P G AG Sbjct: 239 AEAAFNKGETAMTINGPWAWSNIDTSKVN-YGVTVLPTFK----GQPSKPFVGVLSAG-- 291 Query: 302 FKPAQMLSIGKSTKNPQAAAKVINFLLNSKEGVDILGLERGVPLSKAAVTYLTEDGVIKA 361 I ++ N + A + + L + EG++ + ++ PL A+ E+ A Sbjct: 292 --------INAASPNKELAKEFLENYLLTDEGLEAVNKDK--PLGAVALKSYEEE---LA 338 Query: 362 DDPAVSGLKLAQSLPTALPVSPYFDDPQIVA---QFGTTLQYIDYGKKSVEEAAEDFQ 416 DP ++A ++ A + PQ+ A T + G+++V+EA +D Q Sbjct: 339 KDP-----RIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQ 391
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 29.0 bits (65), Expect = 0.036 Identities = 31/142 (21%), Positives = 48/142 (33%), Gaps = 8/142 (5%) Query: 43 AGDTGIIYAVLSVSALFAQVCYGFIQDKLGLRKHLLWYITALLILSGPAYLLFGHLLKIN 102 GI+ A+ ++ G + D+ G R LL L + Y + + Sbjct: 42 TAHYGILLALYALMQFACAPVLGALSDRFGRRPVLL----VSLAGAAVDYAIMATAPFLW 97 Query: 103 VL-LGSIFGGIYIGLTFNGGIGVLESYTERVARQSQFEFGRARMWGSLGWAVATFFAGLL 161 VL +G I GI G T + T+ R F F A G GL+ Sbjct: 98 VLYIGRIVAGI-TGATGAVAGAYIADITDGDERARHFGFMSACF--GFGMVAGPVLGGLM 154 Query: 162 FNINPQLNFLVASCSGLVFFIL 183 +P F A+ + F+ Sbjct: 155 GGFSPHAPFFAAAALNGLNFLT 176
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 54.9 bits (132), Expect = 4e-10 Identities = 66/371 (17%), Positives = 131/371 (35%), Gaps = 34/371 (9%) Query: 38 LDIGVISGALPFITDHFTLSSQLQEWVVSSMMLGAAIGALFNGWLSFRLGRKYSLMAGAV 97 L+ V++ +LP I + F WV ++ ML +IG G LS +LG K L+ G + Sbjct: 28 LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGII 87 Query: 98 LFVAGSIGSAFAAS-VEVLLVARVVLGVAVGIASYTAPLYLSEMASENVRGKMISMYQLM 156 + GS+ S +L++AR + G + ++ + RGK + + Sbjct: 88 INCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSI 147 Query: 157 VTLGIVLAFLSDTAFSYSGNWRAMLGVLALPAVILIILVVFLPNSPRWLAEKGRHIEAEE 216 V +G + ++ +W L L +I II V FL + H + + Sbjct: 148 VAMGEGVGPAIGGMIAHYIHW----SYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKG 203 Query: 217 VLRMLRDTSEKARDELNEIRESLKLKQGGWALFKV----------------NRNVRRAVF 260 ++ M + L + + +F N V Sbjct: 204 IILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVL 263 Query: 261 LGMLLQAMQQFTGMNIIMYYAPRIFKMAGFTTTEQQMIATLVVGLTFMFATFIAVFTVDK 320 G + G ++ Y + + +T E + ++ + +I VD+ Sbjct: 264 CGGI--IFGTVAGFVSMVPYMMK--DVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDR 319 Query: 321 AGRKPALKIGFSVMALGTLVLGYCLMQFDNGTASSGLSWLSVGMTMMCIAGYAMSAAPVV 380 G L IG + +++ L + T SW + + + G + + + Sbjct: 320 RGPLYVLNIGVTFLSVSFLTASF----LLETT-----SWFMTIIIVFVLGGLSFTKTVIS 370 Query: 381 WILCSEIQPLK 391 I+ S ++ + Sbjct: 371 TIVSSSLKQQE 381
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 110 bits (276), Expect = 2e-31 Identities = 72/257 (28%), Positives = 132/257 (51%), Gaps = 11/257 (4%) Query: 3 LDAFSLQGKVAVVSGCDTGLGQGMALGLAEAGCDIVGI--NIVEPVETIERVTALGRRFL 60 ++A ++GK+A ++G G+G+ +A LA G I + N + + + + A R Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60 Query: 61 SLTADLRQIDGIPQLLERAVAEFGHIDILVNNAGLIRREDALAFSEKDWDDVMNLNIKSV 120 + AD+R I ++ R E G IDILVN AG++R + S+++W+ ++N V Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120 Query: 121 FFMSQAAAKHFIAQGSGGKIINIASMLSFQGGIRVPSYTASKSAVMGVTRLLANEWAKHN 180 F S++ +K+ + + G I+ + S + + +Y +SK+A + T+ L E A++N Sbjct: 121 FNASRSVSKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN 179 Query: 181 INVNAIAPGYMATNNTQQLRADEQRSSEILD--------RIPAGRWGLPADLMGPVVFLA 232 I N ++PG T+ L ADE + +++ IP + P+D+ V+FL Sbjct: 180 IRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239 Query: 233 SSASDYINGYTVAVDGG 249 S + +I + + VDGG Sbjct: 240 SGQAGHITMHNLCVDGG 256
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 31.0 bits (70), Expect = 0.010 Identities = 33/188 (17%), Positives = 73/188 (38%), Gaps = 12/188 (6%) Query: 33 SFYGIRPLLILFMAATVYDGGMGLARENASAIVGIFAGSMYLAALPGGWLADNWLGQQRA 92 SF+ + ++L ++ + + + F + + G L+D LG +R Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ-LGIKRL 81 Query: 93 VWYGSILIALGHLSIALSAWLGNDLFFIGLMFIVL---GSGLFKTCISVMVGTLYKKGDA 149 + +G I+ G ++ ++G+ F + +M + G+ F + V+V K Sbjct: 82 LLFGIIINCFG----SVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK--E 135 Query: 150 RRDGGFSLFYMGINIGSFIAPLISGWLIKSHGWHWGFGIGGIGMLVALIIFRVFAVPSMK 209 R F L + +G + P I G + +H HW + + + + + F + + Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGG--MIAHYIHWSYLLLIPMITIITVPFLMKLLKKEV 193 Query: 210 RYDAEVGL 217 R + Sbjct: 194 RIKGHFDI 201
>DNABINDNGFIS#DNA-binding protein FIS signature. Length = 98 Score = 157 bits (399), Expect = 3e-54 Identities = 98/98 (100%), Positives = 98/98 (100%) Query: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ Sbjct: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60 Query: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN Sbjct: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 119 bits (299), Expect = 1e-35 Identities = 77/201 (38%), Positives = 124/201 (61%), Gaps = 3/201 (1%) Query: 1 MARKTKEEAQRTRQLLIESAIQQFALRGVTNTTLTDIADAAGVTRGAVYWHFASKTELFN 60 MARKTK+EAQ TRQ +++ A++ F+ +GV++T+L +IA AAGVTRGA+YWHF K++LF+ Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EMW-QQQPPLRDLIQPSQAIEYEHEPLNALRERFIAGLRYIAANPRQRALMQILYQRCEF 119 E+W + + +L QA ++ +PL+ LRE I L R+R LM+I++ +CEF Sbjct: 61 EIWELSESNIGELELEYQA-KFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119 Query: 120 SSDMLSEYEIRQRIGF-NYSLISGILQCCVRNNILPAETNIEMILIVLHSAFSGLIKNWL 178 +M + ++ + +Y I L+ C+ +LPA+ I++ SGL++NWL Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWL 179 Query: 179 LDPQRFDLYQQAPALVDNIMA 199 PQ FDL ++A V ++ Sbjct: 180 FAPQSFDLKKEARDYVAILLE 200
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 36.7 bits (85), Expect = 1e-04 Identities = 34/211 (16%), Positives = 67/211 (31%), Gaps = 30/211 (14%) Query: 97 ATYQAAWNSAKGDEAKAEAAAAIAHLTVKRYVPLLGTKYISQQEYDQAVATA-RQADADV 155 K + E+ A + Q + + RQ ++ Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLV----------TQLFKNEILDKLRQTTDNI 311 Query: 156 IATKAAVETARINLAYTKVTSPISGRIGKSSV-TEGALVTNGQSDALATVQQLDPIYVDV 214 + + + +P+S ++ + V TEG +VT ++ + V + D + V Sbjct: 312 GLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET-LMVIVPEDDTLEVTA 370 Query: 215 TESSNDFMRLKQESLQRGGDTKSVELVMENGQAYP-LKGSLQ--FSDVTVDESTG----- 266 + D + G +++ Y L G ++ D D+ G Sbjct: 371 LVQNKDIGFINV------GQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNV 424 Query: 267 --SITLRAIFPNPQHV-LLPGMFVRARIDEG 294 SI + +++ L GM V A I G Sbjct: 425 IISIEENCLSTGNKNIPLSSGMAVTAEIKTG 455 Score = 35.2 bits (81), Expect = 3e-04 Identities = 23/127 (18%), Positives = 41/127 (32%), Gaps = 15/127 (11%) Query: 46 APLSVTTELPGR-TSAFRVAEVRPQVSGIILKRNFV-EGSDVEAGQSLYQIDPATYQAAW 103 + + G+ T + R E++P + I+ K V EG V G L ++ +A Sbjct: 78 GQVEIVATANGKLTHSGRSKEIKPIENSIV-KEIIVKEGESVRKGDVLLKLTALGAEA-- 134 Query: 104 NSAKGDEAKAEAAAAIAHLTVKRYVPLLGTKYISQQEYDQAVATARQADADVIATKAAVE 163 D K +++ A L RY L E ++ + Sbjct: 135 -----DTLKTQSSLLQARLEQTRYQILS-----RSIELNKLPELKLPDEPYFQNVSEEEV 184 Query: 164 TARINLA 170 +L Sbjct: 185 LRLTSLI 191
>PF06291#Lambda prophage Bor protein Length = 102 Score = 26.5 bits (58), Expect = 0.005 Identities = 20/65 (30%), Positives = 27/65 (41%), Gaps = 3/65 (4%) Query: 1 MKKYLIVALLASLLAGCAHDSPCV---PVYDSQGRLVHTNTCMKGTTEDNWETAGAIAGG 57 MKK L A LA L+ GCA + V P + + + + G + A I GG Sbjct: 6 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQKKTVDAAKICGG 65 Query: 58 AAAVA 62 A V Sbjct: 66 AENVV 70
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 122 bits (309), Expect = 3e-37 Identities = 68/150 (45%), Positives = 86/150 (57%), Gaps = 7/150 (4%) Query: 2 LAALPFLLCYSGLTVALCHQDLRHGLLPDRYTCPLLWSGLLFYLCLAPHQLHDAVWGAIA 61 L LL L VAL DL LLPD+ T PLLW GLLF L L DAV GA+A Sbjct: 132 WGTLAALLLTWVL-VALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMA 190 Query: 62 GYLSLAAIYWLYRGIRGYEGLGYGDIKYLAALGAWHGWRLLPQLVLVASLLAGIAWAGAG 121 GYL L ++YW ++ + G EG+GYGD K LAALGAW GW+ LP ++L++SL+ Sbjct: 191 GYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFM---GI 247 Query: 122 LYASCGRKSKWGRSNPLPFGPFLAAAGFWC 151 +S P+PFGP+LA AG+ Sbjct: 248 GLILLRNHH---QSKPIPFGPYLAIAGWIA 274
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 36.8 bits (85), Expect = 9e-06 Identities = 18/103 (17%), Positives = 42/103 (40%), Gaps = 10/103 (9%) Query: 44 EYHESIDEMKHADKYIERILFLEGIPN--LQDLGKL------GIGEDVEEMLRSDLRLEL 95 E ++ E D ER+L + G P +++ + G EM+++ + Sbjct: 52 ELYDHAAE--TVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYK 109 Query: 96 EGAQNLREAIAYADSVHDYVSRDMMIEILADEEGHIDWLETEL 138 + + + I A+ D + D+ + ++ + E + L + L Sbjct: 110 QISSESKFVIGLAEENQDNATADLFVGLIEEVEKQVWMLSSYL 152
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 79.5 bits (196), Expect = 4e-18 Identities = 53/154 (34%), Positives = 78/154 (50%), Gaps = 13/154 (8%) Query: 13 VNVGTIGHVDHGKTTLTAAI------TTVLAKTYGGSARAFDQIDNAPEEKARGITINTS 66 +N+G + HVD GKTTLT ++ T L G+ R DN E+ RGITI T Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRT----DNTLLERQRGITIQTG 59 Query: 67 HVEYDTPTRHYAHVDCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQV 126 + +D PGH D++ + + +DGAIL+++A DG QTR R++ Sbjct: 60 ITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKM 119 Query: 127 GVPYIIVFLNKCDMVDDEELLELVEMEVRELLSQ 160 G+P I F+NK D + L V +++E LS Sbjct: 120 GIP-TIFFINKIDQNGID--LSTVYQDIKEKLSA 150
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 613 bits (1583), Expect = 0.0 Identities = 179/698 (25%), Positives = 304/698 (43%), Gaps = 81/698 (11%) Query: 9 RYRNIGISAHIDAGKTTTTERILFYTGVNHKIGEVHDGAATMDWMEQEQERGITITSAAT 68 + NIG+ AH+DAGKTT TE +L+ +G ++G V G D E++RGITI + T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 69 TAFWSGMAKQYEPHRVNIIDTPGHVDFTIEVERSMRVLDGAVMVYCAVGGVQPQSETVWR 128 + W +VNIIDTPGH+DF EV RS+ VLDGA+++ A GVQ Q+ ++ Sbjct: 62 SFQWEN-------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114 Query: 129 QANKYKVPRIAFVNKMDRMGANFLKVVGQIKTRLGANPVPLQLAIGAEEGFTGVVDLVKM 188 K +P I F+NK+D+ G + V IK +L A V Q V M Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ----------KVELYPNM 164 Query: 189 KAINWNDADQGVTFEYEDIPADMQDLADEWHQNLIESAAEASEELMEKYLGGEELTEEEI 248 N+ +++Q ++ E +++L+EKY+ G+ L E+ Sbjct: 165 CVTNFTESEQ------------------------WDTVIEGNDDLLEKYMSGKSLEALEL 200 Query: 249 KKALRQRVLNNEIILVTCGSAFKNKGVQAMLDAVIDYLPSPVDVPAINGILDDGKDTPAE 308 ++ R N + V GSA N G+ +++ + + S Sbjct: 201 EQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH----------------- 243 Query: 309 RHASDDEPFSALAFKIATDPFVGNLTFFRVYSGVVNSGDTVLNSVKAARERFGRIVQMHA 368 FKI L + R+YSGV++ D+V S K + + + Sbjct: 244 ---RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEK-EKIKITEMYTSIN 299 Query: 369 NKREEIKEVRAGDIAAAIG----LKDVTTGDTLCDPDAPIILERMEFPEPVISIAVEPKT 424 + +I + +G+I L V GDT P ER+E P P++ VEP Sbjct: 300 GELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQR----ERIENPLPLLQTTVEPSK 354 Query: 425 KADQEKMGLALGRLAKEDPSFRVWTDEESNQTIIAGMGELHLDIIVDRMKREFNVEANVG 484 +E + AL ++ DP R + D +++ I++ +G++ +++ ++ +++VE + Sbjct: 355 PQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIK 414 Query: 485 KPQVAYREAIRAKVTDIEGKHAKQSGGRGQYGHVVIDMYPLEPGSNPKGYEFINDIKGGV 544 +P V Y E K E + + + + + PL GS G ++ + + G Sbjct: 415 EPTVIYMERPLKKA---EYTIHIEVPPNPFWASIGLSVSPLPLGS---GMQYESSVSLGY 468 Query: 545 IPGEYIPAVDKGIQEQLKAGPLAGYPVVDMGIRLHFGSYHDVDSSELAFKLAASIAFKEG 604 + + AV +GI+ + G L G+ V D I +G Y+ S+ F++ A I ++ Sbjct: 469 LNQSFQNAVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQV 527 Query: 605 FKKAKPVLLEPIMKVEVETPEENTGDVIGDLSRRRGMLRGQESEVTGVKIHAEVPLSEMF 664 KKA LLEP + ++ P+E D + + + + V + E+P + Sbjct: 528 LKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQ 587 Query: 665 GYATQLRSLTKGRASYTMEFLKYDDAPNNVAQAVIEAR 702 Y + L T GR+ E Y + V + R Sbjct: 588 EYRSDLTFFTNGRSVCLTELKGYHVT---TGEPVCQPR 622
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 36.2 bits (83), Expect = 2e-04 Identities = 32/208 (15%), Positives = 59/208 (28%), Gaps = 12/208 (5%) Query: 125 SSSQQTASGEKSINLSDDQSASMPAAGQDQTAAANSTSQQDVTVPPIAANPTQGQAAVAP 184 S + A + +T A NS + N A Sbjct: 1009 SVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTV----EKNEQDATETTAQ 1064 Query: 185 QGQQRIEVQGDLNNALTQQ---QGQLDGAVANSTLPTEPATVAPIRNGANGTAAPRQATE 241 + E + ++ Q + +T E ATV T ++ + Sbjct: 1065 NREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPK 1124 Query: 242 RQTAATPRPAERKHTVIEAKPQPKPQAVAKTPVESKPVQPKHVESTATTAPAKTSVSESK 301 + +P+ + + +A+P + V K Q + + T PAK + S + Sbjct: 1125 VTSQVSPKQEQSETVQPQAEPARENDPT----VNIKEPQSQTNTTADTEQPAKETSSNVE 1180 Query: 302 PVATAQSKPTTTTAAPAATAAAAAPAAK 329 T +S T + PA Sbjct: 1181 QPVT-ESTTVNTGNSVVENPENTTPATT 1207
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 30.6 bits (69), Expect = 0.002 Identities = 27/91 (29%), Positives = 40/91 (43%), Gaps = 18/91 (19%) Query: 32 FYDSDQEIEKRTGADVGWVFDVEGEEGFRD----------REEKIINELTEKQGIVLATG 81 FYD + KR + GW+ + G+R E + I +L E+ IV+A+G Sbjct: 136 FYDEETA--KRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKLVERGVIVIASG 193 Query: 82 GGSVKSRETRNRLSARGVVVYLETTIEKQLA 112 GG V + +GV E I+K LA Sbjct: 194 GGGVPVILEDGEI--KGV----EAVIDKDLA 218
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 226 bits (578), Expect = 3e-70 Identities = 77/282 (27%), Positives = 122/282 (43%), Gaps = 17/282 (6%) Query: 139 GGKLLSARGHLMADKRTNRLLIRDDARHLPALKAWAQEMDLPVGQVELAAHIVSMSETSL 198 SA+ + AD N +++RD +P + +D P ++E+A IV ++ L Sbjct: 237 AATRASAQARVEADPSLNAIIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDINADQL 296 Query: 199 RELGVKWRLAEAGSPPGSGQITTLSSDVSVNDASTRAGFNIGKINGRLLEL---ELSALE 255 ELGV WR+ I T ++ G ++ R L+ ++ LE Sbjct: 297 TELGVDWRVGIRTGNNHQVVIKTTGDQSNIAS----NGALGSLVDARGLDYLLARVNLLE 352 Query: 256 RKQQVEIIASPRLLASHMQPASIKQGSEIPYQVSSGESGATSVEFKEAVLG--MEVTPTV 313 + ++++ P LL A I SE Y +G+ A E K G + +TP V Sbjct: 353 NEGSAQVVSRPTLLTQENAQAVIDH-SETYYVKVTGKEVA---ELKGITYGTMLRMTPRV 408 Query: 314 LQQG---RVRLKLRISENTPGQVLKQENGEALAIDKQEIETLVEVRSGETLALGGIFSQK 370 L QG + L L I + I + ++T+ V G++L +GGI+ + Sbjct: 409 LTQGDKSEISLNLHIEDGNQKPNS-SGIEGIPTISRTVVDTVARVGHGQSLIIGGIYRDE 467 Query: 371 NKTARDSVPLLGDIPVLGRLFRRDGKDNERRELVVFITPRIL 412 A VPLLGDIP +G LFRR + R + I PRI+ Sbjct: 468 LSVALSKVPLLGDIPYIGALFRRKSELTRRTVRLFIIEPRII 509
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 34.4 bits (78), Expect = 2e-04 Identities = 29/118 (24%), Positives = 44/118 (37%), Gaps = 16/118 (13%) Query: 37 LLVSRTARLQRDFLATLHTTADAQLLASLKQREQAMREAWQQHQRQRQQYQRRSAIAAWQ 96 L + LQ A + A+ K REQA EA +++ + Q R A Sbjct: 192 LFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEA-----KRKAEEQARQQAAIRA 246 Query: 97 PRLQALAAD----LPAQAWLTRLEYQGVLLTLDGLALNLQALTSVEAALTRVAGFAPA 150 A+ A+ A +G++ G A QA++ A L RV AP+ Sbjct: 247 ANTYAMPANGSVVATAAG-------RGLIQVAQGAASLAQAISDAIAVLGRVLASAPS 297
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 36.8 bits (85), Expect = 1e-05 Identities = 17/63 (26%), Positives = 27/63 (42%), Gaps = 13/63 (20%) Query: 80 MAVAAGHQGCGIGSALMREMID------LCDNWLRVERIELTVFADNAPAIAVYKKYGFE 133 +AVA ++ G+G+AL+ + I+ C L + I N A Y K+ F Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDI-------NISACHFYAKHHFI 147 Query: 134 IEG 136 I Sbjct: 148 IGA 150
>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature. Length = 293 Score = 33.2 bits (75), Expect = 0.003 Identities = 27/80 (33%), Positives = 36/80 (45%), Gaps = 13/80 (16%) Query: 276 RTPVSGEYRGYEVYSMPPPSSGGIHIVQILNILENFDMQKYGF-GSADAMQVMAEAEKHA 334 R P+ GE R + SMPPP G H +I N+ F Q G+ G A +++E EK Sbjct: 77 RRPM-GETRNVSLISMPPPWRPGEHADRITNL--KFFKQFDGYVGGQTAWGILSELEKGR 133 Query: 335 YADRSEYLGDPDFVNVPWQA 354 Y P F WQ+ Sbjct: 134 Y---------PTFSYQDWQS 144
>PF04619#Dr-family adhesin Length = 160 Score = 28.4 bits (63), Expect = 0.018 Identities = 14/60 (23%), Positives = 23/60 (38%), Gaps = 4/60 (6%) Query: 29 VGARYGHTMIEFDAKLSKDGQIFLLHDDNLERTSNGWGVAGELAW----DDLLKVDAGSW 84 +G ++ D + G+ FL+ D+N ++ AW K D GSW Sbjct: 70 LGCDARQVALKADTDNFEQGKFFLISDNNRDKLYVNIRPTDNSAWTTDNGVFYKNDVGSW 129
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.036 Identities = 10/29 (34%), Positives = 16/29 (55%) Query: 33 IVMVGPSGCGKSTLLRMVAGLERVTSGDI 61 +V+ G G GKSTL+ + GL+ + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHF 627
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 38.2 bits (88), Expect = 6e-05 Identities = 44/175 (25%), Positives = 73/175 (41%), Gaps = 17/175 (9%) Query: 134 GHLLSQPFNSSTPVLYYNKDAFKKAGLDPDQPPKTWQDLAAYTAKLKAAGMKCGYASGWQ 193 G L++ P L YNKD L P+ PPKTW+++ A +LKA G + + Sbjct: 127 GKLIAYPIAVEALSLIYNKD------LLPN-PPKTWEEIPALDKELKAKGKSALMFNLQE 179 Query: 194 GWIQIENFSAWHGLPVATKNNGFDGTDAVLEF--NKPEQVKHIALLEEMNKKGDFSYFGR 251 + +A G +N +D D ++ K + L++ + D Y Sbjct: 180 PYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDY--- 236 Query: 252 KDESTEKFYNGDCAITTASSGSLADIRQYAKFNYGVGMMPYDADVKGAPQNAIIG 306 + F G+ A+T + ++I +K NYGV ++P KG P +G Sbjct: 237 -SIAEAAFNKGETAMTINGPWAWSNIDT-SKVNYGVTVLP---TFKGQPSKPFVG 286
>PF06580#Sensor histidine kinase Length = 349 Score = 29.1 bits (65), Expect = 0.049 Identities = 23/137 (16%), Positives = 43/137 (31%), Gaps = 23/137 (16%) Query: 5 RTMTQQKLSFWLALYIGWFMNVAVFFRRFDGYAQEFTFWKGLSGVVELVATVFVTFFLLR 64 T Q +W IGW + F G+A + K S + + ++ Sbjct: 3 STHRQANKYYWYCQGIGWGVYTLTGF----GFASLYGSPKLHSMIFNIAISLMGLVLTHA 58 Query: 65 LLSLFGRRIWRILATLIVLFSAAASYYMTFLNVVIGYGIIASVMTTDIDLSKEVIGWHLI 124 S R+ W L ++ + + G++ V T I W L+ Sbjct: 59 YRSFIKRQGWLKLNMGQIILRVLPA--------CVVIGMVWFVANTSI--------WRLL 102 Query: 125 LWLVAVSAPPLLFIWSN 141 ++ + P+ F Sbjct: 103 AFI---NTKPVAFTLPL 116
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 41.3 bits (97), Expect = 4e-06 Identities = 48/276 (17%), Positives = 92/276 (33%), Gaps = 33/276 (11%) Query: 44 PVSQVAFSFGLLSLGLALS----SSVAGKLQERFGVKRVTMASGILLGLGFFLTAHSSSL 99 + V +G+L AL + V G L +RFG + V + S + + + A + L Sbjct: 37 HSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL 96 Query: 100 MMLWLS---AGVLVGLADGAGYLL----TLSNCVKWFPERKGLISAFSIGSYGLGSLGFK 152 +L++ AG+ AG + + F + LG Sbjct: 97 WVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGG---- 152 Query: 153 FIDSHLLATVGLEKTFVIWGAIVLVMIVFGATLMKDAPNHPAATAANGVVENDFTLAESM 212 L+ F A+ + + G L+ + + N Sbjct: 153 -----LMGGFSPHAPFFAAAALNGLNFLTGCFLLPE-SHKGERRPLRREALNPLASFRWA 206 Query: 213 R--KPQYWMLAVMFLTACMSG----LYVIGVAKDIAQGMVHLDVATAANAVTVISIAN-L 265 R ++AV F+ + L+VI + H D T ++ I + L Sbjct: 207 RGMTVVAALMAVFFIMQLVGQVPAALWVI-----FGEDRFHWDATTIGISLAAFGILHSL 261 Query: 266 SGRLVLGILSDKISRIRVITIGQVVSLVGMAALLFA 301 + ++ G ++ ++ R + +G + G L FA Sbjct: 262 AQAMITGPVAARLGERRALMLGMIADGTGYILLAFA 297 Score = 34.0 bits (78), Expect = 9e-04 Identities = 31/119 (26%), Positives = 51/119 (42%), Gaps = 2/119 (1%) Query: 270 VLGILSDKISRIRVITIGQVVSLVGMAALLFAPLNALTFFAAIACVAFNFGGTITVFPSL 329 VLG LSD+ R V+ + + V A + AP + + I VA G T V + Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRI--VAGITGATGAVAGAY 119 Query: 330 VSEFFGLNNLAKNYGVIYLGFGIGSICGSLIASLFGGFYVTFCVIFALLILSLALSTTI 388 +++ + A+++G + FG G + G ++ L GGF A + L T Sbjct: 120 IADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGC 178
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 33.8 bits (77), Expect = 1e-04 Identities = 18/52 (34%), Positives = 24/52 (46%), Gaps = 5/52 (9%) Query: 76 VAPGATRQGIGRALLDEVKQ-----HYAWLSLEVYQKNESAVSFYHAQGFRI 122 VA ++G+G ALL + + H+ L LE N SA FY F I Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 118 bits (296), Expect = 6e-34 Identities = 43/124 (34%), Positives = 65/124 (52%), Gaps = 11/124 (8%) Query: 108 LNMPNNVTFDSNSANLKPAGANTLTGVAMVLKEYEKT--AVNVVGYTDSTGSKDLNMRLS 165 + ++V F+ N A LKP G L + L + +V V+GYTD GS N LS Sbjct: 215 FTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLS 274 Query: 166 QQRADSVASALITQGVAANRIRTTGMGPANPIASNSTAEGK---------AQNRRVEITL 216 ++RA SV LI++G+ A++I GMG +NP+ N+ K A +RRVEI + Sbjct: 275 ERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334 Query: 217 SPLQ 220 ++ Sbjct: 335 KGIK 338
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 30.1 bits (68), Expect = 0.020 Identities = 18/55 (32%), Positives = 22/55 (40%), Gaps = 15/55 (27%) Query: 74 GHIELGKWADLVILAPA----------TADLIARVAAGMANDLVSTICLATPSPV 118 G +E+GK ADLV+ PA IA G N + TP PV Sbjct: 424 GSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNA-----SIPTPQPV 473
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 32.5 bits (74), Expect = 0.003 Identities = 27/151 (17%), Positives = 53/151 (35%), Gaps = 3/151 (1%) Query: 65 AFVAMFSSLFITTVIGKTDRRYVVILFSLLLTLSCLLVSFADSFTLLLLGRACLGLALGG 124 A + + + + + RR V+++ + +++ A +L +GR G+ G Sbjct: 53 ALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GA 111 Query: 125 FWAMSASLTMRLVPMRVVPKALSIIFGAVSIALVIAAPLGSFLGGLIGWRNVFNGAAVMG 184 A++ + + + + +V LG +GG F AA + Sbjct: 112 TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALN 170 Query: 185 VLCTLWVLKALP-SLPGESASQQQNMFGLLK 214 L L LP S GE ++ L Sbjct: 171 GLNFLTGCFLLPESHKGERRPLRREALNPLA 201
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 34.9 bits (80), Expect = 7e-04 Identities = 27/168 (16%), Positives = 62/168 (36%), Gaps = 17/168 (10%) Query: 49 FNIAQNDMISTYGLSMTQLGMIGLGFSITYGVGKTLVSYYADGKNTKQFLPFMLILSAIC 108 N++ D+ + + + F +T+ +G + +D K+ L F +I++ Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIIN--- 89 Query: 109 MLGFSASMGAGSTSLFLMIAFYALSGFFQSTGGSCSYSTI----TKWTPRRKRGTFLGFW 164 F + +G S F ++ + F Q G + + + ++ P+ RG G Sbjct: 90 --CFGSVIGFVGHSFFSLLIM---ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144 Query: 165 NISHNLGGAGAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRY 212 +G + A+Y+ + + + P + I+ L Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSY---LLLIP--MITIITVPFLMK 187
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 36.8 bits (85), Expect = 1e-04 Identities = 65/407 (15%), Positives = 130/407 (31%), Gaps = 58/407 (14%) Query: 29 RHILLTIWLGYALFY--FTRKSFNAAVPEILASNVLTRSDIGLLATLFYITYGLSKFFSG 86 RH + IWL F+ N ++P+I + + T F +T+ + G Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 87 IVSDRSDARYFMGLGLIATGVVNILFGFSSSLWAFALLWALNAFFQGWGS---PVCARLL 143 +SD+ + + G+I +++ S F L + F QG G+ P ++ Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHS---FFSLLIMARFIQGAGAAAFPALVMVV 127 Query: 144 TAWY-SRTERGGWWALWNTAHNVGGALIPMVVGAAALHYGWRAGMTIAGCLAILAGLYLC 202 A Y + RG + L + +G + P + G A + W + I I Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITV----- 182 Query: 203 WRLRDRPQAVGLPAVGDWRHDALEIAQQQEGAGMSRKAILTRYVLANPYIWLLSLCYVLV 262 P + L +I G + I+ + Y + VL Sbjct: 183 ------PFLMKLLKKEVRIKGHFDIK----GIILMSVGIVFFMLFTTSYSISFLIVSVLS 232 Query: 263 YVV-----RAAINDWGNLYMSETLGVDLVTANSAVTMFELGGFI-----------GALVA 306 +++ R + + + + + + + + + GF+ A Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292 Query: 307 GWGSDKLFNGNRGPMNLIFAAGILLSVGGLWLMPFASYVMQAACFFTTGFFVFGPQMLI- 365 GS +F G + + GIL+ G + + F T F + + Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMT 352 Query: 366 --------GMAAAECS---------HKEAAGAATGFVGLFAYLGASL 395 G++ + ++ AGA + ++L Sbjct: 353 IIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 64.5 bits (157), Expect = 2e-14 Identities = 24/116 (20%), Positives = 46/116 (39%), Gaps = 5/116 (4%) Query: 2 TTIALIDDHLIVRSGFAQLLGLEADFQVVAEFGSGREALTGLPGRGVQVCICDISMPDIS 61 TI + DD +R+ Q L + V + + + + D+ MPD + Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 62 GLELLSQLPK---GMATIMLSVHDSPALIEQALNAGARGFLSKRCSPDELIAAVRT 114 +LL ++ K + +++S ++ +A GA +L K ELI + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 89.5 bits (222), Expect = 1e-22 Identities = 33/132 (25%), Positives = 62/132 (46%), Gaps = 1/132 (0%) Query: 2 KPVILVVDDDRAMGELLSDVLGVHAFEVLVSQTGNDALTTVAQRADIALVLLDMILPDTH 61 ILV DDD A+ +L+ L ++V ++ +A D LV+ D+++PD + Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA-AGDGDLVVTDVVMPDEN 61 Query: 62 GLQVLQQLQRTRPELPVVMLSGLGSESDVVVGLEMGADDYIAKPFSSRVVVARVKAVLRR 121 +L ++++ RP+LPV+++S + + E GA DY+ KPF ++ + L Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121 Query: 122 SGALAGEASGAG 133 + Sbjct: 122 PKRRPSKLEDDS 133
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 29.4 bits (66), Expect = 0.019 Identities = 16/65 (24%), Positives = 25/65 (38%), Gaps = 5/65 (7%) Query: 55 KLAGDNVKVTLVSSGYDLGQQVAQIDNFIAAKVDMIIL---NAADSKGIGPAVKRAKEAG 111 L +KV + I I KVD+I + D + AVK+A + Sbjct: 111 DLLI--IKVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSLGGPEDVPELHEAVKKAVASQ 168 Query: 112 IVVVA 116 I+V+ Sbjct: 169 ILVMC 173
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 28.7 bits (64), Expect = 0.040 Identities = 10/40 (25%), Positives = 19/40 (47%), Gaps = 2/40 (5%) Query: 227 FVYGMSGLLSGLGGVMSASRLYSANGNLGVGYELDAIAAV 266 ++G+ + GG A R + N G+G + A+ A+ Sbjct: 400 LLHGLPAGWTIYGGTQLADRYRAF--NFGIGKNMGALGAL 437
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 55.6 bits (134), Expect = 4e-10 Identities = 28/121 (23%), Positives = 58/121 (47%), Gaps = 8/121 (6%) Query: 646 LVLEDEEDVRQTLCEQLHQLGWLTLETASGEEALQLLEASPDIALLISDLMLPGALSGAD 705 LV +D+ +R L + L + G+ T++ + + A L+++D+++P + D Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDE-NAFD 64 Query: 706 VIHTARRRFPALPVLLISGQDLRPAQNPALPE--VEWLRKPF----TRAQLAQALSAAYA 759 ++ ++ P LPVL++S Q+ A + ++L KPF + +AL+ Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 760 R 760 R Sbjct: 125 R 125
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 29.8 bits (67), Expect = 0.002 Identities = 24/115 (20%), Positives = 47/115 (40%), Gaps = 12/115 (10%) Query: 13 LSLTSLAARADIIDDAIGNIQQAINDAYNPGSSRSDDDDRYDDDGRYDDGRYQGS----- 67 L LT+L A AD + +Q + SRS + ++ + D+ +Q Sbjct: 125 LKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEV 184 Query: 68 -------RQQSRDSQRQYDERQRQLDERRRQLDERQRQLDRDRRQLESDQRRLDD 115 ++Q Q Q +++ LD++R + +++R ++ RLDD Sbjct: 185 LRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDD 239
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.027 Identities = 14/42 (33%), Positives = 19/42 (45%) Query: 36 CVVLHGHSGSGKSTLLRSLYANYLPDSGHIHIRHGDEWVDLV 77 VVL G G GKSTL+ +L H I G + + + Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQI 639
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 48.4 bits (115), Expect = 2e-08 Identities = 40/171 (23%), Positives = 71/171 (41%), Gaps = 7/171 (4%) Query: 200 REREHGTIEHLLVMPITPFEIMLAKI-WSMGLVVLVVSGLSLILMVQGILQVPIEGSIPL 258 R T E +L + +I+L ++ W+ L +G+ ++ G + L Sbjct: 93 RMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGY----TQWLSLL 148 Query: 259 FMLGV-ALSLFATTSIGIFMGTLARSMPQLGLLMILVLLPLQMLSGGSTPRESMPQLVQD 317 + L V AL+ A S+G+ + LA S LV+ P+ LSG P + +P + Q Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208 Query: 318 IMLTMPTTHFVSLAQAILYRGASFAIVWPQFLTLL-AIGGVFFTIALLRFR 367 +P +H + L + I+ + + + F + ALLR R Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRRR 259
>DPTHRIATOXIN#Diphtheria toxin signature. Length = 567 Score = 29.3 bits (65), Expect = 0.020 Identities = 20/97 (20%), Positives = 44/97 (45%), Gaps = 3/97 (3%) Query: 139 LGVTQSYTCKLEEISDFRNQMRVQFWRDFLGNSPS-IPPVLYGLHEPRPSLEK--DDEQE 195 +G S +++ D ++ + + G P + + G+ +P+ + DD+ + Sbjct: 24 IGAPPSAHAGADDVVDSSKSFVMENFSSYHGTKPGYVDSIQKGIQKPKSGTQGNYDDDWK 83 Query: 196 VFYTTALTPEMANGHLQHAHPVTLEGGEYVMFTYEGL 232 FY+T + A + + +P++ + G V TY GL Sbjct: 84 GFYSTDNKYDAAGYSVDNENPLSGKAGGVVKVTYPGL 120
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 97.6 bits (243), Expect = 1e-25 Identities = 36/146 (24%), Positives = 64/146 (43%) Query: 1 MQQPRIWLVEDEQSIADTLVYMLQQEGFQVSVFGRGLPALEAAAHQAPDVAILDVGLPDI 60 M I + +D+ +I L L + G+ V + A D+ + DV +PD Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 SGFELCRRLLMRYPALPVLFLTARSDEVDKLLGLEIGADDYIAKPFSPREVCARVRTVLR 120 + F+L R+ P LPVL ++A++ + + E GA DY+ KPF E+ + L Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 121 RLQKFAAPSPVVRVGEFVLDEQAAAI 146 ++ + L ++AA+ Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAM 146
>PF06580#Sensor histidine kinase Length = 349 Score = 37.5 bits (87), Expect = 9e-05 Identities = 28/94 (29%), Positives = 41/94 (43%), Gaps = 20/94 (21%) Query: 379 AIDFTPQGGEIALAAEKRNEEVQLSVIDNGCGIPDYALERIFERFYSLPREDGHKSSGLG 438 I PQGG+I L K N V L V + G SL ++ +S+G G Sbjct: 271 GIAQLPQGGKILLKGTKDNGTVTLEVENTG----------------SLALKNTKESTGTG 314 Query: 439 LAFVREVARLHHGD---INLHNRPEGGVVATLRL 469 L VRE ++ +G I L + +G V A + + Sbjct: 315 LQNVRERLQMLYGTEAQIKLSEK-QGKVNAMVLI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 83.0 bits (205), Expect = 2e-20 Identities = 31/122 (25%), Positives = 60/122 (49%), Gaps = 1/122 (0%) Query: 1 MQTPHILIVEDELVTRNTLKSIFEAEGYDVFEATDGAEMHQILSENDINLVIMDINLPGK 60 M IL+ +D+ R L GYDV ++ A + + ++ D +LV+ D+ +P + Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 NGLLLARELRE-QADVALMFLTGRDNEVDKILGLEIGADDYITKPFNPRELTIRARNLLS 119 N L +++ + D+ ++ ++ ++ + I E GA DY+ KPF+ EL L+ Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 120 RT 121 Sbjct: 121 EP 122