>TACYTOLYSIN#Bacterial thiol-activated pore-forming cytolysin signature. Length = 574 Score = 889 bits (2299), Expect = 0.0 Identities = 566/574 (98%), Positives = 570/574 (99%) Query: 1 MKDMSNKKTFKKYSRVAGLLTAALIIGNLVTANAESNKQNTASTETTTTNEQPKPESSEL 60 MKDMSNKK FKKYSRVAGLLTAALI+GNLVTANA+SNKQNTA+TETTTTNEQPKPESSEL Sbjct: 1 MKDMSNKKIFKKYSRVAGLLTAALIVGNLVTANADSNKQNTANTETTTTNEQPKPESSEL 60 Query: 61 TTEKAGQKTDDMLNSNDMIKLAPKEMPLESAEKEEKKSEDKKKSEEDHTEEINDKIYSLN 120 TTEKAGQK DDMLNSNDMIKLAPKEMPLESAEKEEKKSED KKSEEDHTEEINDKIYSLN Sbjct: 61 TTEKAGQKMDDMLNSNDMIKLAPKEMPLESAEKEEKKSEDNKKSEEDHTEEINDKIYSLN 120 Query: 121 YNELEVLAKNGETIENFVPKEGVKKADKFIVIERKKKNINTTPVDISIIDSVTDRTYPAA 180 YNELEVLAKNGETIENFVPKEGVKKADKFIVIERKKKNINTTPVDISIIDSVTDRTYPAA Sbjct: 121 YNELEVLAKNGETIENFVPKEGVKKADKFIVIERKKKNINTTPVDISIIDSVTDRTYPAA 180 Query: 181 LQLANKGFTENKPDAVVTKRNPQKIHIDLPGMGDKATVEVNDPTYANVSTAIDNLVNQWH 240 LQLANKGFTENKPDAVVTKRNPQKIHIDLPGMGDKATVEVNDPTYANVSTAIDNLVNQWH Sbjct: 181 LQLANKGFTENKPDAVVTKRNPQKIHIDLPGMGDKATVEVNDPTYANVSTAIDNLVNQWH 240 Query: 241 DNYSGGNTLPARTQYTESMVYSKSQIEAALNVNSKILDGTLGIDFKSISKGEKKVMIAAY 300 DNYSGGNTLPARTQYTESMVYSKSQIEAALNVNSKILDGTLGIDFKSISKGEKKVMIAAY Sbjct: 241 DNYSGGNTLPARTQYTESMVYSKSQIEAALNVNSKILDGTLGIDFKSISKGEKKVMIAAY 300 Query: 301 KQIFYTVSANLPNNPADVFDKSVTFKELQRKGVSNEAPPLFVSNVAYGRTVFVKLETSSK 360 KQIFYTVSANLPNNPADVFDKSVT KELQRKGVSNEAPPLFVSNVAYGRTVFVKLETSSK Sbjct: 301 KQIFYTVSANLPNNPADVFDKSVTLKELQRKGVSNEAPPLFVSNVAYGRTVFVKLETSSK 360 Query: 361 SNDVEAAFSAALKGTDVKTNGKYSDILENSSFTAVVLGGDAAEHNKVVTKDFDVIRNVIK 420 SNDVEAAFSAALKGTDVKTNGKYSDILENSSFTAVVLGGDAAEHNKVVTKDFDVIRNVIK Sbjct: 361 SNDVEAAFSAALKGTDVKTNGKYSDILENSSFTAVVLGGDAAEHNKVVTKDFDVIRNVIK 420 Query: 421 DNATFSRKNPAYPISYTSVFLKNNKIAGVNNRTEYVETTSTEYTSGKINLSHQGAYVAQY 480 DNATFSRKNPAYPISYTSVFLKNNKIAGVNNR+EYVETTSTEYTSGKINLSHQGAYVAQY Sbjct: 421 DNATFSRKNPAYPISYTSVFLKNNKIAGVNNRSEYVETTSTEYTSGKINLSHQGAYVAQY 480 Query: 481 EILWDEINYDDKGKEVITKRRWDNNWYSKTSPFSTVIPLGANSRNIRIMARECTGLAWEW 540 EILWDEINYDDKGKEVITKRRWDNNWYSKTSPFSTVIPLGANSRNIRIMARECTGLAWEW Sbjct: 481 EILWDEINYDDKGKEVITKRRWDNNWYSKTSPFSTVIPLGANSRNIRIMARECTGLAWEW 540 Query: 541 WRKVIDERDVKLSKEINVNISGSTLSPYGSITYK 574 WRKVIDERDVKLSKEINVNISGSTLSPYGSITYK Sbjct: 541 WRKVIDERDVKLSKEINVNISGSTLSPYGSITYK 574
>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature. Length = 1104 Score = 29.3 bits (65), Expect = 0.035 Identities = 16/72 (22%), Positives = 32/72 (44%), Gaps = 2/72 (2%) Query: 123 TSDVVILADGVIEIIDLKYGKGMPVSANQNPQMGLYALGAYASYDMV--YDFDRIKMTII 180 + + ++ D +E+I+ ANQ + + G + D Y FD K + Sbjct: 850 SKKIKVVEDKPVEVINESEPNNDFEKANQIAKSNMLVKGTLSEEDYSDKYYFDVAKKGNV 909 Query: 181 QPRLDSVSSVDI 192 + L++++SV I Sbjct: 910 KITLNNLNSVGI 921
>PF07212#Hyaluronoglucosaminidase Length = 336 Score = 554 bits (1429), Expect = 0.0 Identities = 255/334 (76%), Positives = 288/334 (86%), Gaps = 2/334 (0%) Query: 1 MTETIPLRVQFKRMTAEEWARSTVILLEGEIGLETDTGYAKFGDGKNRFSKLKYLNKPDL 60 MTETIPLRVQFKRMTAEEW RS VILLE EIG ETDTGYAKFGDGKN+FSKLKYLNKPDL Sbjct: 1 MTETIPLRVQFKRMTAEEWTRSDVILLESEIGFETDTGYAKFGDGKNQFSKLKYLNKPDL 60 Query: 61 DAFAQKKETDNKIAKLESIKADKDTVYLKAESKIELDKKLSLAGGIVTGQLRLKPN-SGI 119 AFAQK+ET++KI KLES KADK+ VYLKAESKIELDKKL+L GG++TGQL+ KPN SGI Sbjct: 61 GAFAQKEETNSKITKLESSKADKNAVYLKAESKIELDKKLNLKGGVMTGQLQFKPNKSGI 120 Query: 120 EKSSSTGGAINIDMSKSKGAAMVMYTNKDTTDGPLMILRSNKDTFDQSVQFVDYRGKTNA 179 + SSS GGAINIDMSKS+GA +V+Y+N DT+DGPLM LR+ K+TF+QS FVDY GKTNA Sbjct: 121 KPSSSVGGAINIDMSKSEGAGVVVYSNNDTSDGPLMSLRTGKETFNQSALFVDYSGKTNA 180 Query: 180 VNIVMRQPPTPNFSSALNITSANEGGSAMQIRGVEKALGTLKITHENPSVDKEYDKNAAA 239 VNI MRQP TPNFSSALNITS NE GSAMQIRGVEKALGTLKITHENP+V+ YD+NAAA Sbjct: 181 VNIAMRQPTTPNFSSALNITSGNENGSAMQIRGVEKALGTLKITHENPNVEANYDENAAA 240 Query: 240 LSIDIVKKKKGGGDGTAAQGIFINSSSGTTGKLLRIRNKNEDKFYVNPDGGFHSYADSIV 299 LSIDIVKK+K GG GTAAQGI+INS+SGTTGKLLRIRN +DKFYV DGGF++ S + Sbjct: 241 LSIDIVKKQK-GGKGTAAQGIYINSTSGTTGKLLRIRNLGDDKFYVKHDGGFYAKKTSQI 299 Query: 300 DGNLTVKNPTSGKHAATKDYVDKKFDELKKLIQK 333 DGNL +KNPT+ HAATK YVD + +LK L+ Sbjct: 300 DGNLKLKNPTADDHAATKAYVDSEVKKLKALLMD 333
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 28.3 bits (63), Expect = 0.039 Identities = 29/130 (22%), Positives = 44/130 (33%), Gaps = 16/130 (12%) Query: 109 SAKMDFNSNATINFNSRDNALVRKDGT--HTAFVHFSNATPKGYTGSALY------ASIG 160 S DF A I + L +K GT ++ F P+ Y A A+I Sbjct: 511 STPADFRMLAPIVL---EQVL-KKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIV 566 Query: 161 ITSSGDGVNSASSGRFAGLRSFRYAT---GYNHTAAVDQTELYGDNVLIADDFSINRGFK 217 T + S G Y + + + +V TEL G +V + R Sbjct: 567 DTQLKNNEVILS-GEIPARCIQEYRSDLTFFTNGRSVCLTELKGYHVTTGEPVCQPRRPN 625 Query: 218 FRPDKMEKVL 227 R DK+ + Sbjct: 626 SRIDKVRYMF 635
>SECA#SecA protein signature. Length = 901 Score = 31.0 bits (70), Expect = 0.011 Identities = 21/68 (30%), Positives = 34/68 (50%), Gaps = 1/68 (1%) Query: 165 VIKH-YEKLAKGKQAIVYTHSVEASHLVSDMFNQAGYQSQSVSGKTPKSEREEAMQAFRD 223 +I+ E+ AKG+ +V T S+E S LVS+ +AG + ++ K +E QA Sbjct: 438 IIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYP 497 Query: 224 GKLRILVN 231 + I N Sbjct: 498 AAVTIATN 505
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.8 bits (69), Expect = 0.004 Identities = 23/164 (14%), Positives = 56/164 (34%), Gaps = 11/164 (6%) Query: 8 EQSGAQEEAKEQTFDDILSDPKKQAEFDKRVAKAIDTARN-KWVAETEEKENEAK----- 61 E + E ++ ++ ++ + E + ++ +T T EKE +AK Sbjct: 1060 ETTAQNREVAKEAKSNVKANTQ-TNEVAQSGSETKETQTTETKETATVEKEEKAKVETEK 1118 Query: 62 --RLAKMNAEQKAQHEKAKLEARIAELEAER--TLSEMKSAARTMLSEANINISDALLSQ 117 + K+ ++ + E+++ AE E T++ + ++T + + S Sbjct: 1119 TQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSN 1178 Query: 118 LVSTTDADKTKNAVEAFSEAFSEAIEKEVKERLKSPTPKKSNGN 161 + T N + E + + S + K Sbjct: 1179 VEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNR 1222
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 37.4 bits (86), Expect = 3e-04 Identities = 32/139 (23%), Positives = 57/139 (41%), Gaps = 9/139 (6%) Query: 294 VFSQLYLESFWGDTPVGRAD----NNWGGI----TWTGATTRPSGINVSQGQSRAEGGYY 345 + +Q LES WG + R + N G+ W G T + G+++ + Sbjct: 174 ILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKF 233 Query: 346 NHYASVDDYLKDYAYLLAEQGIY-AVKGKLTIDEYTRGLFRVGGATYDYAAAGYDHYAPL 404 Y+S + L DY LL Y AV + ++ + L G AT + A + Sbjct: 234 RVYSSYLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQ 293 Query: 405 MRDIRAGINRNNNGAMDNV 423 M+ I +++ + +DN+ Sbjct: 294 MKSISDKVSKTYSMNIDNL 312
>FRAGILYSIN#Fragilysin metallopeptidase (M10C) enterotoxin signature. Length = 405 Score = 28.9 bits (64), Expect = 0.007 Identities = 15/63 (23%), Positives = 25/63 (39%), Gaps = 5/63 (7%) Query: 25 VSAPVKHVLDNNKKAMEALESAIVKISDD-----LKDNNFKWTESKNHRDRLQKVQDQHE 79 + APV +D + L + + +SD LKDN F + R + D Sbjct: 38 IDAPVTASIDLQSVSYTDLATQLNDVSDFGKMIILKDNGFNRQVHVSMDKRTKIQLDNEN 97 Query: 80 IRI 82 +R+ Sbjct: 98 VRL 100
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 27.8 bits (62), Expect = 0.009 Identities = 11/54 (20%), Positives = 21/54 (38%), Gaps = 5/54 (9%) Query: 46 VARNAVEAVEQIAYDKDIK---GIEKLTEAKIAVRDELSKHNVYLSDK--QMEV 94 ++V V Q + D + G+ K A R + K ++ + +EV Sbjct: 486 RTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEV 539
>BACTRLTOXIN#Bacterial toxin signature. Length = 266 Score = 277 bits (711), Expect = 1e-96 Identities = 114/257 (44%), Positives = 161/257 (62%), Gaps = 19/257 (7%) Query: 11 MVFFVLVTFLGLTISQEVFA--QQDPDPSQLHRSS-LVKNLQNIYFLYEGDPVTHENVKS 67 ++ F L+ + + V A Q DP P LH+SS + N+ +LY+ V+ VKS Sbjct: 11 ILIFALIL---VISTPNVLAESQPDPMPDDLHKSSEFTGTMGNMKYLYDDHYVSATKVKS 67 Query: 68 VDQLLSHDLIYNVSGP---NYDKLKTELKNQEMATLFKDKNIDIYGVEYYHLCYLCE--- 121 VD+ L+HDLIYN+S NYDK+KTEL N+++A +KD+ +D+YG YY CY Sbjct: 68 VDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYYVNCYFSSKDN 127 Query: 122 ---NAERSACIYGGVTNHEGNHLEIP--KKIVVKVSIDGIQSLSFDIETNKKMVTAQELD 176 C+YGG+T HEGNH + + ++V+V + ++SF+++T+KK VTAQELD Sbjct: 128 VGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFEVQTDKKSVTAQELD 187 Query: 177 YKVRKYLTDNKQLYTNGPSKYETGYIKFIPKNKESFWFDFFPEP--EFTQSKYLMIYKDN 234 K R +L + K LY S YETGYIKFI N +FW+D P P +F QSKYLM+Y DN Sbjct: 188 IKARNFLINKKNLYEFNSSPYETGYIKFIENNGNTFWYDMMPAPGDKFDQSKYLMMYNDN 247 Query: 235 ETLDSNTSQIEVYLTTK 251 +T+DS + +IEV+LTTK Sbjct: 248 KTVDSKSVKIEVHLTTK 264
>PF03309#Bvg accessory factor Length = 271 Score = 29.7 bits (67), Expect = 0.012 Identities = 13/65 (20%), Positives = 23/65 (35%), Gaps = 7/65 (10%) Query: 18 LLCIDIGGTSLKFALCHN----GQLSQQSSFPT--PSSLEKFYQLLDQEVARYSAYHFSG 71 LL ID+ T L ++ QQ T + ++ +D + A +G Sbjct: 2 LLAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTEPEVTADELALTIDG-LIGDDAERLTG 60 Query: 72 IAISS 76 + S Sbjct: 61 ASGLS 65
>PF06580#Sensor histidine kinase Length = 349 Score = 180 bits (459), Expect = 6e-54 Identities = 71/324 (21%), Positives = 132/324 (40%), Gaps = 34/324 (10%) Query: 251 LSKAYRMQYNRSGDLLAYVAVRKSYLLAEAVRTVFVYGLVSLLLAWLLLQLL-FRVFRNY 309 L+ AYR R G L + + A + + V+ W LL + + Sbjct: 55 LTHAYRSFIKRQG-WLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFT 113 Query: 310 IQQVSEITDTVEMVAAGDLSLTIDNSHMELELYHISEAINQMLASIKAYIDEVYVLEVEQ 369 + I V +V + M LY + +A ID+ + Sbjct: 114 LPLALSIIFNVVVV-----------TFMWSLLYF---GWHFFKNYKQAEIDQWK-MASMA 158 Query: 370 RDAQMRALQSQINPHFLYNTLEYIRMYALSCQQEELADVIYAFASLLRNNI--SQDKMTT 427 ++AQ+ AL++QINPHF++N L IR L + +++ + + L+R ++ S + + Sbjct: 159 QEAQLMALKAQINPHFMFNALNNIRALILE-DPTKAREMLTSLSELMRYSLRYSNARQVS 217 Query: 428 LKEELAFCEKYIYLYQMRYPDSFAYHVKIDESIADLAIPKFVIQPLVENYFVHGIDYSRH 487 L +EL + Y+ L +++ D + +I+ +I D+ +P ++Q LVEN HGI Sbjct: 218 LADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQ 277 Query: 488 DNALSIKALDETDHLLIQVLDNGRGISQERLADMEKRLQEHQTTGNISIGLQNVYLRLFH 547 + +K + + ++V + G L T + GLQNV RL Sbjct: 278 GGKILLKGTKDNGTVTLEVENTG-------------SLALKNTKESTGTGLQNVRERLQM 324 Query: 548 HFRDRVSWSMAKEPNGGFIIQIRI 571 + ++++ G + I Sbjct: 325 LYGTEAQIKLSEKQ-GKVNAMVLI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 84.9 bits (210), Expect = 1e-19 Identities = 31/133 (23%), Positives = 50/133 (37%), Gaps = 6/133 (4%) Query: 4 KVLLVDDEYMILQGLTMIIDWQALGFEVVQTARSGKEALAYLTQYPVDVMISDVTMPGMT 63 +L+ DD+ I L + G++V + ++ D++++DV MP Sbjct: 5 TILVADDDAAIRTVLNQAL--SRAGYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 64 GLDLIEAAKTYHPQLQTLILSGYQEFSYVQKAMELETKGYLLKPVDKAELQAKMKQFKDC 123 DL+ K P L L++S F KA E YL KP D L + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFD---LTELIGIIGRA 118 Query: 124 LDAQQAESIRQEA 136 L + + E Sbjct: 119 LAEPKRRPSKLED 131
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 27.1 bits (60), Expect = 0.046 Identities = 8/24 (33%), Positives = 15/24 (62%) Query: 151 GELAKILKQNGVNIGQNKLFQWLR 174 EL ILK++G N+ Q + + ++ Sbjct: 23 DELVDILKKDGYNVTQATVSRDIK 46
>ANTHRAXTOXNA#Anthrax toxin LF subunit signature. Length = 800 Score = 27.8 bits (61), Expect = 0.026 Identities = 23/89 (25%), Positives = 40/89 (44%), Gaps = 1/89 (1%) Query: 71 QAEAKVEKYKETIRRAMELSQKKKVDAGMFKVSLRKSKKVEILDETKIPLDYMQEKIEYK 130 + A E Y E+ + ++K K + FK S+ K E +ET + Q+ ++ Sbjct: 30 EVNAMNEHYTESDIKRNHKTEKNKTEKEKFKDSINNLVKTEFTNETLDKIQQTQDLLKKI 89 Query: 131 PMKS-EISKALKSGIDISGVELIETESLQ 158 P EI L I + ++L+E + LQ Sbjct: 90 PKDVLEIYSELGGEIYFTDIDLVEHKELQ 118
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 47.8 bits (113), Expect = 2e-07 Identities = 50/287 (17%), Positives = 97/287 (33%), Gaps = 29/287 (10%) Query: 454 LTKESDETKKLKKEQEGLVESNKQLRDSVREGVQERKKGLESVKESTAAHQKLADEIIKL 513 T +S + K L+ E+ L L + + + +A + L E L Sbjct: 136 STADSAKIKTLEAEKAALAARKADLE-------KALEGAMNFSTADSAKIKTLEAEKAAL 188 Query: 514 AAKENKTAGEKRNLKNKIDELNGSIDGLNLAYDKNSNSLSHNADQIKSRISAMEAESTWQ 573 A++ + N + I L + + ++ + S Sbjct: 189 EARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKA----DLEKALEGAMNFS--- 241 Query: 574 TAQQNLLNIEQKRSEVSKKLAENAELRKKWNEEANVSDSVRKEKIAELTEEEAKLKNMQT 633 + I+ +E + A AEL K E A + KI L E+A L+ + Sbjct: 242 --TADSAKIKTLEAEKAALEARQAELEKA-LEGAMNFSTADSAKIKTLEAEKAALEAEKA 298 Query: 634 QLQEEYNKTSATQQAAADAMAAAEESGSARQVIAYENMSEAQRTAIDNMRTKYSELLETT 693 L+ + +A +Q+ + A+ E + +Q+ A E Q + R L+ + Sbjct: 299 DLEHQSQVLNANRQSLRRDLDASRE--AKKQLEAEHQKLEEQNKISEASRQSLRRDLDAS 356 Query: 694 TSIFDAIE----------QKTALSVDQMNANLEKNRAATEQWATNLE 730 +E + + S + +L+ +R A +Q LE Sbjct: 357 REAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALE 403 Score = 30.8 bits (69), Expect = 0.032 Identities = 43/240 (17%), Positives = 77/240 (32%), Gaps = 33/240 (13%) Query: 454 LTKESDETKKLKKEQEGLVESNKQLRDSVREGVQERKKGLESVKESTAAHQKLADEIIKL 513 T +S + K L+ E+ L L ++ + +K A L +L Sbjct: 206 STADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAEL 265 Query: 514 AAKENKTAGEKRNLKNKIDELNGSIDGL----------NLAYDKNSNSLSHNADQIKSRI 563 KI L L + + N SL + D + Sbjct: 266 EKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAK 325 Query: 564 SAMEAESTWQTAQQNLLNIEQKRSEVSKKLAENAELRK-------KWNEEANVSDSVR-- 614 +EAE Q ++ E R + + L + E +K K E+ +S++ R Sbjct: 326 KQLEAEH--QKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQS 383 Query: 615 -----------KEKI-AELTEEEAKLKNMQTQLQEEYNKTSATQQAAADAMAAAEESGSA 662 K+++ L E +KL ++ +E T++ A+ A E A Sbjct: 384 LRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKA 443
>PF07212#Hyaluronoglucosaminidase Length = 336 Score = 507 bits (1307), Expect = 0.0 Identities = 260/343 (75%), Positives = 287/343 (83%), Gaps = 15/343 (4%) Query: 1 MSENIPLRVQFKRMKAAEWARSDVILLESEIGFETDTGFARAGDGHNRFSDLGYISPLDY 60 M+E IPLRVQFKRM A EW RSDVILLESEIGFETDTG+A+ GDG N+FS L Y+ Sbjct: 1 MTETIPLRVQFKRMTAEEWTRSDVILLESEIGFETDTGYAKFGDGKNQFSKLKYL----- 55 Query: 61 NLLTNKPNIDGLATKVETAQKLQQ----KADKETVYTKAESKQELDKKLNLKGGVMTGQL 116 NKP++ A K ET K+ + KADK VY KAESK ELDKKLNLKGGVMTGQL Sbjct: 56 ----NKPDLGAFAQKEETNSKITKLESSKADKNAVYLKAESKIELDKKLNLKGGVMTGQL 111 Query: 117 KFKPAAT-VAYSSSTGGAVNIDLSSSRGAGVVVYSDNDTSDGPLMSLRTGKETFNQSALF 175 +FKP + + SSS GGA+NID+S S GAGVVVYS+NDTSDGPLMSLRTGKETFNQSALF Sbjct: 112 QFKPNKSGIKPSSSVGGAINIDMSKSEGAGVVVYSNNDTSDGPLMSLRTGKETFNQSALF 171 Query: 176 VDYKGTTNAVNIAMRQPTTPNFSSALNITSGNENGSAMQLRGSEKALGTLKITHENPSIG 235 VDY G TNAVNIAMRQPTTPNFSSALNITSGNENGSAMQ+RG EKALGTLKITHENP++ Sbjct: 172 VDYSGKTNAVNIAMRQPTTPNFSSALNITSGNENGSAMQIRGVEKALGTLKITHENPNVE 231 Query: 236 ADYDKNAAALSIDIVKKTNGA-GTAAQGIYINSTSGTTGKLLRIRNLSDDKFYVKSDGGF 294 A+YD+NAAALSIDIVKK G GTAAQGIYINSTSGTTGKLLRIRNL DDKFYVK DGGF Sbjct: 232 ANYDENAAALSIDIVKKQKGGKGTAAQGIYINSTSGTTGKLLRIRNLGDDKFYVKHDGGF 291 Query: 295 YAKETSQIDGNLKLKDPTANDHAATKAYVDKAISELKKLILKK 337 YAK+TSQIDGNLKLK+PTA+DHAATKAYVD + +LK L++ K Sbjct: 292 YAKKTSQIDGNLKLKNPTADDHAATKAYVDSEVKKLKALLMDK 334
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 34.4 bits (79), Expect = 0.001 Identities = 17/173 (9%), Positives = 44/173 (25%), Gaps = 7/173 (4%) Query: 117 TEIVNSARGVATRISEDTDKKLALINDTIDGIRREYRDADRKLSASYQAGIEGLKATMAN 176 + + + +++ + + E Sbjct: 155 ILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRA 214 Query: 177 DKIGLQAEIKA--SAQGLSQKYDNELRQLSAKITTTSSGTTEAYESKLAGLRAEFTRSNQ 234 +++ + A I + + + ++ L K + E+K E R + Sbjct: 215 ERLTVLARINRYENLSRVEKSRLDDFSSLLHK-QAIAKHAVLEQENKYVEAVNEL-RVYK 272 Query: 235 GTRTELESQISGLRAVQQTTASQISQEIRNREGAVSRVQQGLDSYQRRLQSAE 287 ++ES+I + Q EI + + + L E Sbjct: 273 SQLEQIESEILSAKEEYQLVTQLFKNEIL---DKLRQTTDNIGLLTLELAKNE 322
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 87.1 bits (215), Expect = 2e-21 Identities = 43/125 (34%), Positives = 62/125 (49%), Gaps = 8/125 (6%) Query: 23 SLTAAQAILESGWGKHA-------PHNALFGIKADASWTGKSFDTKTQEEYQPGIVTDIV 75 L AQA LESGWG+ P LFG+KA +W G + T E Y+ G + Sbjct: 172 HLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTE-YENGEAKKVK 230 Query: 76 DRFRAYDSWTDSIIDHGKFLNDNPRYKAVIGETDYKKACYAIKAAGYATASSYVELLIQL 135 +FR Y S+ +++ D+ L NPRY AV ++ A++ AGYAT Y L + Sbjct: 231 AKFRVYSSYLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNM 290 Query: 136 IEEND 140 I++ Sbjct: 291 IQQMK 295
>BACTRLTOXIN#Bacterial toxin signature. Length = 266 Score = 44.9 bits (106), Expect = 6e-08 Identities = 44/222 (19%), Positives = 85/222 (38%), Gaps = 36/222 (16%) Query: 6 LKEIYN-KEIIEKNNISINAKQGTQLIFNTDENTTVWNDNTFKKVISSNLSPSQERMFNV 64 +K +Y+ + S++ LI+N + D KV + L+ + + Sbjct: 51 MKYLYDDHYVSATKVKSVDKFLAHDLIYNISDKKLKNYD----KVKTELLNEDLAKKYK- 105 Query: 65 GDHVNIFAIVKSYHVVCKEQFNYSD---------GGIIKTSDVKPEE---KAIYINIFGE 112 + V+++ + + N GGI K + + + + ++ Sbjct: 106 DEVVDVYGSNYYVNCYFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYEN 165 Query: 113 KELRTLTAKDKITFKNNIVTLQEIDVRLRKSLMGDSKIKLYEYD-SLYKKGFWDIHYKDG 171 K T ++ VT QE+D++ R L+ +K LYE++ S Y+ G+ +G Sbjct: 166 KRN---TISFEVQTDKKSVTAQELDIKARNFLI--NKKNLYEFNSSPYETGYIKFIENNG 220 Query: 172 GIRHTNLFTYPD-----------YTDNETIDMSKVSHFDVHL 202 ++ P Y DN+T+D SK +VHL Sbjct: 221 NTFWYDMMPAPGDKFDQSKYLMMYNDNKTVD-SKSVKIEVHL 261
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 92.6 bits (230), Expect = 1e-23 Identities = 42/163 (25%), Positives = 73/163 (44%), Gaps = 12/163 (7%) Query: 2 LIVEDEYLVRQGIRSLVDFSQFKIDRVNEAENGQLAWDLFQKEPYDIVLTDINMPKLNGI 61 L+ +D+ +R + + + + V N W D+V+TD+ MP N Sbjct: 7 LVADDDAAIRTVLNQALSRAGY---DVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 62 QLAELIKQESPQTHLVFLTGYDDFNYALSALKLGADDYLLKPFSKADVEDMLGKLQQKLD 121 L IK+ P ++ ++ + F A+ A + GA DYL KPF D+ +++G + + L Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF---DLTELIGIIGRALA 120 Query: 122 LSKKTETIQELVEQPQKEVSAIAMAIHE------RLADSDLTL 158 K+ + E Q + + A+ E RL +DLTL Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163
>PF06580#Sensor histidine kinase Length = 349 Score = 183 bits (466), Expect = 7e-55 Identities = 57/203 (28%), Positives = 101/203 (49%), Gaps = 10/203 (4%) Query: 362 EKAIGQYRLQALASQINPHFLYNTLDTIIWMAEFNDSKRVVEVTKSLAKYFRLALNQGN- 420 + +L AL +QINPHF++N L+ I + D + E+ SL++ R +L N Sbjct: 155 ASMAQEAQLMALKAQINPHFMFNALNNIRALIL-EDPTKAREMLTSLSELMRYSLRYSNA 213 Query: 421 EYIRLADELDHVSQYLFIQKQRYGDKLSYEVQGLDVYADFVIPKLILQPLVENAIYHGIK 480 + LADEL V YL + ++ D+L +E Q D +P +++Q LVEN I HGI Sbjct: 214 RQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIA 273 Query: 481 EVDRKGMIKVTVSDTAQHLMLTVWDNGKGIEDSSLTNSQSLLARGGVGLKNVDQRLKLHY 540 ++ + G I + + + L V + G ++ ++ G GL+NV +RL++ Y Sbjct: 274 QLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKEST-------GTGLQNVRERLQMLY 326 Query: 541 GEGYHMTIHSQSDQFTEIQLSLP 563 G + + + + + + +P Sbjct: 327 GTEAQIKLSEKQGKVNAM-VLIP 348
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 123 bits (310), Expect = 7e-39 Identities = 60/146 (41%), Positives = 92/146 (63%), Gaps = 2/146 (1%) Query: 1 MNKKETRHRLIRSLISETTIHTQQELQERLQKNGITITQATLSRDMKELNLVKVTSGNDT 60 MNK + RH IR +I+ I TQ EL + L+K+G +TQAT+SRD+KEL+LVKV + N + Sbjct: 1 MNKGQ-RHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKELHLVKVPTNNGS 59 Query: 61 HYEALAISQTRWEH-RLRFYMEDALVMLKIVQHQIILKTLPGLAQSFGSILDAMQIPEIV 119 + +L Q +L+ + DA V + H I+LKT+PG AQ+ G+++D + EI+ Sbjct: 60 YKYSLPADQRFNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEEIM 119 Query: 120 ATVCGDDTCLIVCEDNEQAKACYETL 145 T+CGDDT LI+C ++ K + + Sbjct: 120 GTICGDDTILIICRTHDDTKVVQKKI 145
>ARGDEIMINASE#Bacterial arginine deiminase signature. Length = 409 Score = 579 bits (1493), Expect = 0.0 Identities = 192/410 (46%), Positives = 277/410 (67%), Gaps = 9/410 (2%) Query: 5 TPIHVYSEIGKLKKVLLHRPGKEIENLMPDYLERLLFDDIPFLEDAQKEHDAFAQALRDE 64 PI+++SEIG+LKKVLLHRPG+E+ENL P ++ LFDDIP+LE A++EH+ FA L++ Sbjct: 6 NPINIFSEIGRLKKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQEHEVFASILKNN 65 Query: 65 GIEVLYLETLAAESLVTP-EIREAFIDEYLSEANIRGRATKKAIRELLMAIEDNQELIEK 123 +E+ Y+E L +E LV+ + FI +++ EA I+ T +++ ++ +I K Sbjct: 66 LVEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFTINLLKDYFSSL-TIDNMISK 124 Query: 124 TMAGVQKSELPEIPASEKGLTDLVESSYPFAIDPMPNLYFTRDPFATIGTGVSLNHMFSE 183 ++GV EL +S L DLV + F IDPMPN+ FTRDPFA+IG GV++N MF++ Sbjct: 125 MISGVVTEELKNYTSS---LDDLVNGANLFIIDPMPNVLFTRDPFASIGNGVTINKMFTK 181 Query: 184 TRNRETLYGKYIFTHHPIYGGGKVPMVYDRNETTRIEGGDELVLSKDVLAVGISQRTDAA 243 R RET++ +YIF +HP+Y VP+ +R E +EGGDELVL+K +L +GIS+RT+A Sbjct: 182 VRQRETIFAEYIFKYHPVYKE-NVPIWLNRWEEASLEGGDELVLNKGLLVIGISERTEAK 240 Query: 244 SIEKLLVNIFKQNLGFKKVLAFEFANNRKFMHLDTVFTMVDYDKFTIHPEIEGDLRVYSV 303 S+EKL +++FK F +LAF+ NR +MHLDTVFT +DY FT + +Y + Sbjct: 241 SVEKLAISLFKNKTSFDTILAFQIPKNRSYMHLDTVFTQIDYSVFTSFTSDDMYFSIYVL 300 Query: 304 TYDNE--ELHIVEEKGDLADLLAANLGVEKVDLIRCGGDNLVAAGREQWNDGSNTLTIAP 361 TY+ ++HI +EK + D+L+ LG K+D+I+C G +L+ REQWNDG+N L IAP Sbjct: 301 TYNPSSSKIHIKKEKARIKDVLSFYLG-RKIDIIKCAGGDLIHGAREQWNDGANVLAIAP 359 Query: 362 GVVVVYNRNTITNAILESKGLKLIKIHGSELVRGRGGPRCMSMPFEREDI 411 G ++ Y+RN +TN + E G+K+ +I SEL RGRGGPRCMSMP REDI Sbjct: 360 GEIIAYSRNHVTNKLFEENGIKVHRIPSSELSRGRGGPRCMSMPLIREDI 409
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 407 bits (1048), Expect = e-146 Identities = 141/315 (44%), Positives = 204/315 (64%), Gaps = 6/315 (1%) Query: 3 KQKIVVALGGNAIL--STDASAKAQQEALISTSKSLVKLIKEGHEVIVTHGNGPQVGNLL 60 +++V+ALGGNA+ S + + + T++ + ++I G+EV++THGNGPQVG+LL Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61 Query: 61 LQQAAADSEKN-PAMPLDTCVAMTEGSIGFWLVNALDNELQAQGIQKEVAAVVTQVIVDA 119 L A + PA P+D AM++G IG+ + AL NEL+ +G++K+V ++TQ IVD Sbjct: 62 LHMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDK 121 Query: 120 KDPAFENPTKPIGPFLTEEDAKKQMAESGASFKEDAGRGWRKVVPSPKPVGIKEANVIRS 179 DPAF+NPTKP+GPF EE AK+ E G KED+GRGWR+VVPSP P G EA I+ Sbjct: 122 NDPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKK 181 Query: 180 LVDSGVVVVSAGGGGVPVVEDATSKTLTGVEAVIDKDFASQTLSELVDADLFIVLTGVDN 239 LV+ GV+V+++GGGGVPV+ + + GVEAVIDKD A + L+E V+AD+F++LT V+ Sbjct: 182 LVERGVIVIASGGGGVPVILED--GEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNG 239 Query: 240 VYVNFNKPDQTKLEEVTVSQMKEYITQDQFAPGSMLPKVEAAIAFVENKPNAKAIITSLE 299 + + + L EV V ++++Y + F GSM PKV AAI F+E +AII LE Sbjct: 240 AALYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEW-GGERAIIAHLE 298 Query: 300 NIDNVLSANAGTQII 314 L GTQ++ Sbjct: 299 KAVEALEGKTGTQVL 313
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 153 bits (388), Expect = 2e-50 Identities = 58/157 (36%), Positives = 94/157 (59%), Gaps = 2/157 (1%) Query: 5 IGLYTGSFDPVTNGHLDIVKRASGLFDQIYVGIFDNPTKKSYFKLEVRKAMLTQALADFT 64 +Y GSFDP+T GHLDI++R LFDQ+YV + NP K+ F ++ R + +A+A Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLP 61 Query: 65 NVIVVTSHERLAIDVAKELRVTHLIRGLRNATDFEYEENLEYFNHLLAPNIETVYLISRN 124 N V + E L ++ A++ + ++RGLR +DFE E + N LA ++ETV+L + Sbjct: 62 NAQVDSF-EGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTST 120 Query: 125 KWQALSSSRVRELIHFQSSLEGLVPQSVIAQV-EKMN 160 ++ LSSS V+E+ F ++E VP V A + ++ + Sbjct: 121 EYSFLSSSLVKEVARFGGNVEHFVPSHVAAALYDQFH 157
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 124 bits (312), Expect = 5e-41 Identities = 82/91 (90%), Positives = 87/91 (95%) Query: 1 MANKQDLIAKVAEATELTKKDSAAAVDAVFSTIEAFLAEGEKVQLIGFGNFEVRERAARK 60 MANKQDLIAKVAEATELTKKDSAAAVDAVFS + ++LA+GEKVQLIGFGNFEVRERAARK Sbjct: 1 MANKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARK 60 Query: 61 GRNPQTGAEIEIAASKVPAFKAGKALKDAVK 91 GRNPQTG EI+I ASKVPAFKAGKALKDAVK Sbjct: 61 GRNPQTGEEIKIKASKVPAFKAGKALKDAVK 91
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 28.0 bits (62), Expect = 0.026 Identities = 13/37 (35%), Positives = 18/37 (48%) Query: 119 KSEETEDYITDYVEGLVAAGLGAYQEDNLHMKVKLRS 155 K E +D YVE A Y E+N ++K+RS Sbjct: 48 KQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRS 84
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 39.6 bits (92), Expect = 5e-05 Identities = 25/133 (18%), Positives = 59/133 (44%), Gaps = 7/133 (5%) Query: 503 LGASGQGLSSMLSSAWGNIQTVVSTAKNMITLAIDGIKL--VFSNLGNAGNILKGLLSAA 560 G G + + G ++ST +N + A+ +K+ + + GN+ L+ A Sbjct: 124 AGNILGGGAENIGDNLGKAGGILSTFQNFLGTALSSMKIDELIKKQKSGGNVSSSELAKA 183 Query: 561 WSAMQNAVVIAKGIINSAISAIKTAFSSFGNLVSSVSGTIKSVIGSLKNAFYSLASIDLV 620 + N +V +N+ +++ ++ G+++S+ + + N +L ++D + Sbjct: 184 SIELINQLVDTVASLNNNVNSFSQQLNTLGSVLSNTKH-----LNGVGNKLQNLPNLDNI 238 Query: 621 GAGRAIMQGFLNG 633 GAG + G L+ Sbjct: 239 GAGLDTVSGILSA 251
>PF07212#Hyaluronoglucosaminidase Length = 336 Score = 511 bits (1318), Expect = 0.0 Identities = 242/358 (67%), Positives = 284/358 (79%), Gaps = 39/358 (10%) Query: 1 MSADEWARSDVILLEGEIGFETDTGYAKFGNGKSKFSALKYLTGPKGPKGDTGFQGKTGG 60 M+A+EW RSDVILLE EIGFETDTGYAKFG+GK++FS LKYL Sbjct: 14 MTAEEWTRSDVILLESEIGFETDTGYAKFGDGKNQFSKLKYL------------------ 55 Query: 61 TGPRGPAGKPGTTDYNQLQNKPNLDAFARKQETDSKITELKSNKADKNAVYLKAESNAKL 120 NKP+L AFA+K+ET+SKIT+L+S+KADKNAVYLKAES +L Sbjct: 56 -------------------NKPDLGAFAQKEETNSKITKLESSKADKNAVYLKAESKIEL 96 Query: 121 DEKLSLTGGIVTGQLQFKPN-SGIKPSSSVGGAINIDMSKSEGAAMVMYTNKDTTDGPLM 179 D+KL+L GG++TGQLQFKPN SGIKPSSSVGGAINIDMSKSEGA +V+Y+N DT+DGPLM Sbjct: 97 DKKLNLKGGVMTGQLQFKPNKSGIKPSSSVGGAINIDMSKSEGAGVVVYSNNDTSDGPLM 156 Query: 180 ILRSDKDTFDQSAQFVDYSGKTNAVNIVMRQPSAPNFSSALNITSANEGGSAMQIRGVEK 239 LR+ K+TF+QSA FVDYSGKTNAVNI MRQP+ PNFSSALNITS NE GSAMQIRGVEK Sbjct: 157 SLRTGKETFNQSALFVDYSGKTNAVNIAMRQPTTPNFSSALNITSGNENGSAMQIRGVEK 216 Query: 240 ALGTLKITHENPNVKANYDENAAALSIDIVKKTN-GEGTAAQGIYINSSTGTTGKMLRIR 298 ALGTLKITHENPNV+ANYDENAAALSIDIVKK G+GTAAQGIYINS++GTTGK+LRIR Sbjct: 217 ALGTLKITHENPNVEANYDENAAALSIDIVKKQKGGKGTAAQGIYINSTSGTTGKLLRIR 276 Query: 299 NKNEDKFYVGPDGGFHSGANSTVAGNLTVKDPTSGKHAATKDYVDEKIAELKKLILKK 356 N +DKFYV DGGF++ S + GNL +K+PT+ HAATK YVD ++ +LK L++ K Sbjct: 277 NLGDDKFYVKHDGGFYAKKTSQIDGNLKLKNPTADDHAATKAYVDSEVKKLKALLMDK 334
>PF06580#Sensor histidine kinase Length = 349 Score = 25.6 bits (56), Expect = 0.038 Identities = 7/45 (15%), Positives = 18/45 (40%) Query: 29 LFLAIAIFGMMVTVSYFSYRDARQYYEPQIYGLRTQLSMTQKQLK 73 + + + M ++ YF + + Y + +I + + QL Sbjct: 120 IIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLM 164
>PF07212#Hyaluronoglucosaminidase Length = 336 Score = 495 bits (1275), Expect = e-179 Identities = 242/354 (68%), Positives = 275/354 (77%), Gaps = 35/354 (9%) Query: 1 MTTQGWESSSDILMEREIGIDMTTGYPKVGDGKNKFKDLKDLRGPMGPQGPTGERGPIGP 60 MT + W S IL+E EIG + TGY K GDGKN+F LK L Sbjct: 14 MTAEEWTRSDVILLESEIGFETDTGYAKFGDGKNQFSKLKYL------------------ 55 Query: 61 TGPIGKPGTTDYNQLQNKPNLDAFAQKKETNSKITKLESSKADKSAVYSKAESKIELDKK 120 NKP+L AFAQK+ETNSKITKLESSKADK+AVY KAESKIELDKK Sbjct: 56 ----------------NKPDLGAFAQKEETNSKITKLESSKADKNAVYLKAESKIELDKK 99 Query: 121 LSLTGGIVTGQLQFKPNKSGIKPSSSVGGAINIDMSKSEGAAMVMYTNKDTTDGPLMILR 180 L+L GG++TGQLQFKPNKSGIKPSSSVGGAINIDMSKSEGA +V+Y+N DT+DGPLM LR Sbjct: 100 LNLKGGVMTGQLQFKPNKSGIKPSSSVGGAINIDMSKSEGAGVVVYSNNDTSDGPLMSLR 159 Query: 181 SDKETFNQSALFVDYSGKTNAVNIVMRQPSTPNFSSALNITSANEGGSAMQIRGVEKALG 240 + KETFNQSALFVDYSGKTNAVNI MRQP+TPNFSSALNITS NE GSAMQIRGVEKALG Sbjct: 160 TGKETFNQSALFVDYSGKTNAVNIAMRQPTTPNFSSALNITSGNENGSAMQIRGVEKALG 219 Query: 241 TLKITHENPNVKANYDENAAALSIDIVKKTN-GEGTAAQGIYINSSTGTTGKMLRIRNKN 299 TLKITHENPNV+ANYDENAAALSIDIVKK G+GTAAQGIYINS++GTTGK+LRIRN Sbjct: 220 TLKITHENPNVEANYDENAAALSIDIVKKQKGGKGTAAQGIYINSTSGTTGKLLRIRNLG 279 Query: 300 EDKFYVGPDGGFHSGANSTVTGNLTVKDPTSEKHAATKKYVDEKIAELKKLIQK 353 +DKFYV DGGF++ S + GNL +K+PT++ HAATK YVD ++ +LK L+ Sbjct: 280 DDKFYVKHDGGFYAKKTSQIDGNLKLKNPTADDHAATKAYVDSEVKKLKALLMD 333
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 35.6 bits (82), Expect = 6e-04 Identities = 34/195 (17%), Positives = 62/195 (31%), Gaps = 29/195 (14%) Query: 171 LKLDLNKANEQTASLQASINGLRQEYQDAERKLSASYQTGINGLKA-TMANDKY--DLKA 227 LKL A T Q+S L Q + R S +N L + ++ Y ++ Sbjct: 125 LKLTALGAEADTLKTQSS---LLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSE 181 Query: 228 EIQATARGLSQE----YDNKLHQLSAKIKTTSSG------TTEAYENKLAGLRAEFTR-- 275 E L +E + N+ +Q + + YEN ++ Sbjct: 182 EEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFS 241 Query: 276 --SNQG-----TRTELESQISGLRAVQQTTASQISQEIRDRTGAVSRVQQDLESYQR--- 325 ++ E E++ + SQ+ Q + A Q + ++ Sbjct: 242 SLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL 301 Query: 326 -RLQDAEDNYSSLTH 339 +L+ DN LT Sbjct: 302 DKLRQTTDNIGLLTL 316
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 93.6 bits (232), Expect = 1e-23 Identities = 45/123 (36%), Positives = 64/123 (52%), Gaps = 8/123 (6%) Query: 23 SLTAAQAILESGWGKHA-------PHNALFGIKADSSWTGKSFDTKTQEEYQAGVVTDIV 75 L AQA LESGWG+ P LFG+KA +W G + T E Y+ G + Sbjct: 172 HLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTE-YENGEAKKVK 230 Query: 76 DRFRAYDSWTDSIIDHGKFLNDNPRYKAVVGETDYKKACHAIKDAGYATASGYAELLIQI 135 +FR Y S+ +++ D+ L NPRY AV ++ A++DAGYAT YA L + Sbjct: 231 AKFRVYSSYLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNM 290 Query: 136 IKE 138 I++ Sbjct: 291 IQQ 293
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 115 bits (290), Expect = 4e-29 Identities = 51/156 (32%), Positives = 81/156 (51%), Gaps = 8/156 (5%) Query: 12 KIRNFSIIAHIDHGKSTLADRILEK---TETVSSREMQAQLLDSMDLERERGITIKLNAI 68 KI N ++AH+D GK+TL + +L + S + D+ LER+RGITI+ Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 69 ELNYTARDGETYIFHLIDTPGHVDFTYEVSRSLAACEGAILVVDAAQGIEAQTLANVYLA 128 + E ++IDTPGH+DF EV RSL+ +GAIL++ A G++AQT + Sbjct: 62 SFQW-----ENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116 Query: 129 LDNDLEILPVINKIDLPAADPERVCHEVEDVIGLDA 164 + + INKID D V ++++ + + Sbjct: 117 RKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEI 152 Score = 93.4 bits (232), Expect = 6e-22 Identities = 44/214 (20%), Positives = 93/214 (43%), Gaps = 16/214 (7%) Query: 171 SAKAGIGIEEILEQIVEKVPAPTGDVDAPLQALIFDSVYDAYRGVILQVRIVNGIVKPGD 230 SAK IGI+ ++E I K + T + L +F Y R + +R+ +G++ D Sbjct: 220 SAKNNIGIDNLIEVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRD 279 Query: 231 KIQMMSNGKTFDVTEVGIFTP-KAVGRDFLATGDVGYVAASIKTVADTRVGDTVTLANNP 289 +++ K +TE+ + D +G++ + + +GDT L Sbjct: 280 SVRISEKEKI-KITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQRE 337 Query: 290 AKEALHGYKQMNPMVFAGIYPIESNKYNDLREALEKLQLNDASLQFE--PETSQALGFGF 347 E P++ + P + + L +AL ++ +D L++ T + + Sbjct: 338 RIENPL------PLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEII---- 387 Query: 348 RCGFLGLLHMDVIQERLEREFNIDLIMTAPSVVY 381 FLG + M+V L+ ++++++ + P+V+Y Sbjct: 388 -LSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIY 420 Score = 43.3 bits (102), Expect = 2e-06 Identities = 21/104 (20%), Positives = 41/104 (39%), Gaps = 12/104 (11%) Query: 393 VSNPSEFPDPTRVAFIE----------EPYVKAQIMVPQEFVGAVMELSQRKRGDFVTMD 442 VS P++F + + EPY+ +I PQE++ + + + V Sbjct: 510 VSTPADFRMLAPIVLEQVLKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ 569 Query: 443 YIDDNRVNVIYQIPLAEIVFDFFDKLKSSTRGYASFDYDMSEYR 486 + +N V + +IP I ++ L T G + ++ Y Sbjct: 570 -LKNNEVILSGEIPARCI-QEYRSDLTFFTNGRSVCLTELKGYH 611
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 70.1 bits (171), Expect = 3e-15 Identities = 36/105 (34%), Positives = 48/105 (45%), Gaps = 12/105 (11%) Query: 250 QPGKPAPKTPEVPQKPDTAPHTPKTPQIPGQSKDVTPAPQNPSNRGLNKPQTQGGNQLAK 309 + K A + ++ + TP P + +G NQ Sbjct: 447 KLAKQAEELAKLRAGKASDSQTPDAK----------PGNKAVPGKGQAPQAGTKPNQ--N 494 Query: 310 TPAAHDTHRQLPATGETTNPFFTAAAVAIMTTAGVVAVAKRQENN 354 +T RQLP+TGET NPFFTAAA+ +M TAGV AV KR+E N Sbjct: 495 KAPMKETKRQLPSTGETANPFFTAAALTVMATAGVAAVVKRKEEN 539
>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature. Length = 483 Score = 36.6 bits (84), Expect = 1e-04 Identities = 24/82 (29%), Positives = 42/82 (51%), Gaps = 4/82 (4%) Query: 31 SGSQSDKLVIYNWGDYIDPALLKKFTKETGIEVQYETFDSNEAMYTKIKQGGTTYDIAVP 90 S S V+ N+ YI P LL++ + + + T+ SNE + TY +AV Sbjct: 21 SSCGSTTFVLANFESYISPLLLER--VQEKHPLTFLTYPSNEKLINGF--ANNTYSVAVA 76 Query: 91 SDYTIDKMIKENLLNKLDKSKL 112 S Y + ++I+ +LL+ +D S+ Sbjct: 77 STYAVSELIERDLLSPIDWSQF 98
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 66.0 bits (161), Expect = 8e-15 Identities = 23/131 (17%), Positives = 50/131 (38%), Gaps = 2/131 (1%) Query: 3 VLIIEDDPMVDFIHRNYLEKLNLFDRIISSDSMKAVQSILTDYVIDLILLDIHITDGNGI 62 +L+ +DD + + L + + + + + + DL++ D+ + D N Sbjct: 6 ILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 63 QFLEKWRAQHIPCEVIIISAANDGNIIRDGFHLGIIDYLIKPFTFERFQESIQQFVTHRE 122 L + + V+++SA N G DYL KPF I + + + Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 123 HLANQQLEQAQ 133 ++ + +Q Sbjct: 124 RRPSKLEDDSQ 134
>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature. Length = 428 Score = 66.2 bits (161), Expect = 3e-14 Identities = 76/299 (25%), Positives = 120/299 (40%), Gaps = 45/299 (15%) Query: 36 DLKVAMVTDTGGVDDKSFNQSAWEGLQSWGKEMGLQKGTGFDYFQSTSESEYATNLDTAV 95 LK ++TD G +DDKSFNQSA+E L++ + K TG + S + + ++A+ Sbjct: 61 KLKPVLITDEGKIDDKSFNQSAFEALKA------INKQTGIEINNVEPSSNFESAYNSAL 114 Query: 96 SGGYQLIYGIGFALKDAIAKAAGD------NEGVKFVIIDDIIEGKDNV-ASVTFADHEA 148 S G+++ GF + +I + +K + ID IE + S+ F E+ Sbjct: 115 SAGHKIWVLNGFKHQQSIKQYIDAHREELERNQIKIIGIDFDIETEYKWFYSLQFNIKES 174 Query: 149 AYLAGIAAAKITKTK-----TVGFVGGMEGTVITRFEKGFEAGVKS---------VDDTI 194 A+ G A A + V GG +T F +GF G+ + T Sbjct: 175 AFTTGYAIASWLSEQDESKRVVASFGGGAFPGVTTFNEGFAKGILYYNQKHKSSKIYHTS 234 Query: 195 QVKVDYAGSFGDAAKGKTIAAAQYAAGADVIYQAAGG---TGAGVFNEAKAINEKRSEAD 251 VK+D +G I + ADV Y G F + N+ + Sbjct: 235 PVKLD-SGFTAGEKMNTVINNVLSSTPADVKYNPHVILSVAGPATFETVRLANKGQ---- 289 Query: 252 KVWVIGVDRDQKDEGKYTSKDGKEANFVLASSIKEVGKAVQLINKQVADKKFPGGKTTV 310 +VIGVD DQ +D +L S +K + +AV + +K G K V Sbjct: 290 --YVIGVDSDQG-----MIQDKDR---ILTSVLKHIKQAVYETLLDLILEKEEGYKPYV 338
>PF06580#Sensor histidine kinase Length = 349 Score = 39.1 bits (91), Expect = 2e-05 Identities = 15/75 (20%), Positives = 31/75 (41%), Gaps = 5/75 (6%) Query: 312 YGKIFYFQNQVNRSLRMDKALLKQLITILFDNAIKY----TDKNGIIEIIVKTTDKNLLI 367 + F+NQ+N ++ D + L+ L +N IK+ + G I + + + + Sbjct: 236 FEDRLQFENQINPAIM-DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTL 294 Query: 368 SVIDNGPGITDEEKK 382 V + G K+ Sbjct: 295 EVENTGSLALKNTKE 309
>PF05272#Virulence-associated E family protein Length = 892 Score = 33.5 bits (76), Expect = 7e-04 Identities = 18/41 (43%), Positives = 23/41 (56%), Gaps = 2/41 (4%) Query: 36 KGELVVIL-GASGAGKSTVLNILGGMD-TVDAGQVIIDGKD 74 K + V+L G G GKST++N L G+D D I GKD Sbjct: 594 KFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKD 634
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 41.2 bits (96), Expect = 6e-07 Identities = 13/48 (27%), Positives = 25/48 (52%) Query: 4 RHTETKAYVKTALTTLLTEQSFETLTVSDLTKKAGINRGTFYLHYTDK 51 ET+ ++ L ++Q + ++ ++ K AG+ RG Y H+ DK Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDK 55
>PF06580#Sensor histidine kinase Length = 349 Score = 28.3 bits (63), Expect = 0.024 Identities = 19/109 (17%), Positives = 32/109 (29%), Gaps = 15/109 (13%) Query: 19 LVGLVLLSVFGWVVGITGGYIYLPYSYRWLSWGMDNFPNLLDSALSYYYFWTALVLFVIT 78 ++ + +S+ G V +T Y WL M A V+ Sbjct: 42 MIFNIAISLMGLV--LTHAYRSFIKRQGWLKLNMGQI---------ILRVLPACVVIG-- 88 Query: 79 FLALLVIILYPRIYTEVQLRHKNKKGTLLLKKSAIESYVATAIQTAGLM 127 + + R+ + K TL L S I + V + L Sbjct: 89 MVWFVANTSIWRLLAFIN--TKPVAFTLPLALSIIFNVVVVTFMWSLLY 135
>BACTRLTOXIN#Bacterial toxin signature. Length = 266 Score = 353 bits (907), Expect = e-126 Identities = 149/261 (57%), Positives = 193/261 (73%), Gaps = 6/261 (2%) Query: 6 RILVVACVVFCAQLLSIS---VFASSQPDPTPEQLNKSSQFTGVMGNLRCLYDNHFVEGT 62 R+ + ++ A +L IS V A SQPDP P+ L+KSS+FTG MGN++ LYD+H+V T Sbjct: 4 RLFISRVILIFALILVISTPNVLAESQPDPMPDDLHKSSEFTGTMGNMKYLYDDHYVSAT 63 Query: 63 NVRSTGQLLQHDLIFPIKDLKLKNYDSVKTEFNSKDLAAKYKNKDVDIFGSNYYYNCYYS 122 V+S + L HDLI+ I D KLKNYD VKTE ++DLA KYK++ VD++GSNYY NCY+S Sbjct: 64 KVKSVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYYVNCYFS 123 Query: 123 EGNSCKNA--KKTCMYGGVTEHHRNQI-EGKFPNITVKVYEDNENILSFDITTNKKQVTV 179 ++ KTCMYGG+T+H N G N+ V+VYE+ N +SF++ T+KK VT Sbjct: 124 SKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFEVQTDKKSVTA 183 Query: 180 QELDCKTRKILVSRKNLYEFNNSPYETGYIKFIESSGDSFWYDMMPAPGAIFDQSKYLML 239 QELD K R L+++KNLYEFN+SPYETGYIKFIE++G++FWYDMMPAPG FDQSKYLM+ Sbjct: 184 QELDIKARNFLINKKNLYEFNSSPYETGYIKFIENNGNTFWYDMMPAPGDKFDQSKYLMM 243 Query: 240 YNDNKTVSSSAIAIEVHLTKK 260 YNDNKTV S ++ IEVHLT K Sbjct: 244 YNDNKTVDSKSVKIEVHLTTK 264
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 32.0 bits (72), Expect = 0.002 Identities = 13/42 (30%), Positives = 22/42 (52%) Query: 64 TKYAVAESVQKVEELSLAQKEIEQNAEQAKVTAEAAEKQAKS 105 T+ + VE+ A+ E E+ E KVT++ + KQ +S Sbjct: 1095 TQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQS 1136
>PF07212#Hyaluronoglucosaminidase Length = 336 Score = 385 bits (990), Expect = e-138 Identities = 168/235 (71%), Positives = 197/235 (83%), Gaps = 2/235 (0%) Query: 1 MSLAGGIVTGQLRLKPN-SGIEKSSSTGGAINIDMSKSKGAAMVMYTNKDTTDGPLMILR 59 ++L GG++TGQL+ KPN SGI+ SSS GGAINIDMSKS+GA +V+Y+N DT+DGPLM LR Sbjct: 100 LNLKGGVMTGQLQFKPNKSGIKPSSSVGGAINIDMSKSEGAGVVVYSNNDTSDGPLMSLR 159 Query: 60 SNKDTFDQSVQFVDYRGKTNAVNIVMRQPPTPNFSSALNITSANEGGSAMQIRGVEKALG 119 + K+TF+QS FVDY GKTNAVNI MRQP TPNFSSALNITS NE GSAMQIRGVEKALG Sbjct: 160 TGKETFNQSALFVDYSGKTNAVNIAMRQPTTPNFSSALNITSGNENGSAMQIRGVEKALG 219 Query: 120 TLKITHENPSVDKEYDKNAAALSIDIVKKKKGGGDGTAAQGIFINSSSGTTGKLLRIRNK 179 TLKITHENP+V+ YD+NAAALSIDIVKK+K GG GTAAQGI+INS+SGTTGKLLRIRN Sbjct: 220 TLKITHENPNVEANYDENAAALSIDIVKKQK-GGKGTAAQGIYINSTSGTTGKLLRIRNL 278 Query: 180 NEDKFYVNPDGGFHSYADSIVDGNLTVKNPTSGKHAATKDYVDKKFDELKKLIQK 234 +DKFYV DGGF++ S +DGNL +KNPT+ HAATK YVD + +LK L+ Sbjct: 279 GDDKFYVKHDGGFYAKKTSQIDGNLKLKNPTADDHAATKAYVDSEVKKLKALLMD 333
>PF06872#EspG protein Length = 398 Score = 31.2 bits (70), Expect = 0.002 Identities = 15/35 (42%), Positives = 22/35 (62%), Gaps = 3/35 (8%) Query: 59 RGVGDVKMETEAIDIPFD---VLKKILGYKDGSSS 90 RG+G+ K+ +DIP D +L+ LG KD +SS Sbjct: 208 RGLGNSKLSLNGVDIPADAQKLLRNTLGLKDTNSS 242
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 32.7 bits (74), Expect = 0.002 Identities = 30/150 (20%), Positives = 51/150 (34%), Gaps = 21/150 (14%) Query: 134 KAAVQRAVEQVTVNYDIYEALGSKRNELYAEIEKSLSERLAKESIELVSVTLTDQDAGDE 193 A V A A S+ E AE K S+ + K + T +++ E Sbjct: 1017 IARVDEAPVP-----PPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKE 1071 Query: 194 -----------IEKAIKDESVKQKQVDSAKQ-----DKEKAKIEAETKQIQAQAEADAQV 237 E A K+ Q K+ +EKAK+E E Q + + Sbjct: 1072 AKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSP 1131 Query: 238 IKAKGEAESNNTKAASITDNLIKMKEAEAR 267 + + E + A D + +KE +++ Sbjct: 1132 KQEQSETVQPQAEPARENDPTVNIKEPQSQ 1161
>PF06580#Sensor histidine kinase Length = 349 Score = 25.2 bits (55), Expect = 0.048 Identities = 7/45 (15%), Positives = 18/45 (40%) Query: 29 LFLAIAIFGIMVTVSYFSYRDAQQYYEPQITGLRTQLSRTQKQLK 73 + + + M ++ YF + + Y + +I + + QL Sbjct: 120 IIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLM 164
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 48.5 bits (115), Expect = 2e-09 Identities = 13/64 (20%), Positives = 30/64 (46%) Query: 5 RQIKKTKTAIYSAFIALLQKKEYSKITVRDMITLANVGRSTFYAHYESKEMLQKELCEEL 64 ++ ++T+ I + L ++ S ++ ++ A V R Y H++ K L E+ E Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66 Query: 65 FHHL 68 ++ Sbjct: 67 ESNI 70
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 44.4 bits (105), Expect = 5e-07 Identities = 21/112 (18%), Positives = 45/112 (40%), Gaps = 13/112 (11%) Query: 170 QQLQDLNDAYADAQAEVNKAQIALNDTVVISSVSGTVVE-----VNNDIDPSSKNSQTLV 224 +L+ D E+ K + +V+ + VS V + + ++TL+ Sbjct: 302 DKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTT----AETLM 357 Query: 225 HVATEGQ-LQVKGTLTEYDLANVKVGQSVKIKSKVYSNQEW---TGKISYVS 272 + E L+V + D+ + VGQ+ IK + + + GK+ ++ Sbjct: 358 VIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN 409 Score = 37.5 bits (87), Expect = 9e-05 Identities = 24/185 (12%), Positives = 53/185 (28%), Gaps = 29/185 (15%) Query: 21 ITLVLIITGVVLWKQQQNTLTADIAKEPYSTVSVTEGSIASSTLFSGTVKALSEEYIYFD 80 ++ + + + + V+ G + S S +K + + Sbjct: 62 YFIMGFLVIAFIL--------SVLG--QVEIVATANGKLTHSG-RSKEIKPIENSIV--- 107 Query: 81 ANKGNDATVTVKVGDQVTQGQQLVQYNTTTA-------QSAYDTAVRSLNKIGRQINHLK 133 + VK G+ V +G L++ A QS+ A + ++ Sbjct: 108 ------KEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIE 161 Query: 134 TYGVPAV--STETNKDEATGEETTTTVQPSAQQNANYKQQLQDLNDAYADAQAEVNKAQI 191 +P + E + EE +Q + ++ Q +AE Sbjct: 162 LNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLA 221 Query: 192 ALNDT 196 +N Sbjct: 222 RINRY 226
>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature. Length = 428 Score = 30.4 bits (68), Expect = 0.012 Identities = 28/122 (22%), Positives = 50/122 (40%), Gaps = 11/122 (9%) Query: 15 KKTSYVTFFLMPILTTLLALSLSFSNNNQAKIGILDKDNSQISKQFIAQLKQNKKYDIFT 74 KK+ + L PI L A+++S NN+++ I +KD S+ + + K ++ Sbjct: 2 KKSKKILLGLSPIAAILPAVAVSCGNNDESNISFKEKDISKYTTTNANGKQVVKNAELLK 61 Query: 75 KIKKEHI--DHYLQDKSL-----EAVLTIDKGFS-DKVLQGKSQKL--NIRSIANSEITE 124 +K I + + DKS EA+ I+K + S S ++ Sbjct: 62 -LKPVLITDEGKIDDKSFNQSAFEALKAINKQTGIEINNVEPSSNFESAYNSALSAGHKI 120 Query: 125 WV 126 WV Sbjct: 121 WV 122
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 30.9 bits (70), Expect = 0.004 Identities = 15/76 (19%), Positives = 32/76 (42%), Gaps = 1/76 (1%) Query: 37 SYQDFLDVLLSLFQFVVIILVLFFYSATINLGEVLTFLTQTSWHWQILCYLVLYLMAIIE 96 S + ++ L S+ + V++ ++++ NL +L T L +L + +I Sbjct: 133 SIKSLVEFLKSILKVVLLSILIWII-IKGNLVTLLQLPTCGIECITPLLGQILRQLMVIC 191 Query: 97 MTLLVLILIFDVLLQK 112 V+I I D + Sbjct: 192 TVGFVVISIADYAFEY 207
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 32.3 bits (73), Expect = 0.004 Identities = 41/225 (18%), Positives = 86/225 (38%), Gaps = 22/225 (9%) Query: 171 NLYDNIARYKERLKDKSDQLTTFRNARKYAFISNLVGGKKQFEANVSEIKRLEYDLAHLQ 230 ++ + E + S + + A + L + + E + + Sbjct: 225 ARKADLEKALEGAMNFSTADSA-KIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKI 283 Query: 231 DTHQDKIDSDDIEKNQQKLQLRNTKLELESSLRD------KQRRLKLLDISIEFGLYPTE 284 T + + + + EK + Q + +S RD +++L+ +E +E Sbjct: 284 KTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISE 343 Query: 285 SDLTELQQYFPDTNLKKLYEVEAYHKKLETIL------------DSEFSTE-RESLIAEI 331 + L++ D + + ++EA H+KLE D + S E ++ + + Sbjct: 344 ASRQSLRRDL-DASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKAL 402 Query: 332 DDLESQLTTLNQELQELGNIPNLS-SEYLENYSKLTATINALKEQ 375 ++ S+L L + +EL L+ E E +KL A ALKE+ Sbjct: 403 EEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEK 447
>PF05043#Transcriptional activator Length = 493 Score = 25.7 bits (56), Expect = 0.033 Identities = 10/82 (12%), Positives = 28/82 (34%), Gaps = 10/82 (12%) Query: 10 YLTNLPALAHDSLLLSN----VSYQAT-----EALLKLYDQSRSLNKQVFLAFDKASSYS 60 L+++ + D + S+ + S + F+ F++ Sbjct: 45 DLSHVKSAFPDLIFHSSTNGIRIINTDDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAE 104 Query: 61 PDANQL-LSENTVLRLSSNGNE 81 + +S +++ R+ S N+ Sbjct: 105 SICKEFYISSSSLYRIISQINK 126
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 37.9 bits (88), Expect = 6e-05 Identities = 28/141 (19%), Positives = 59/141 (41%), Gaps = 13/141 (9%) Query: 52 SVIGVLFNLFGGVIADSFKR----KKIIITTNILCGTACLVLSFLTKEQWLVYAIVLTNV 107 + G+L +L +I ++ ++ I GT ++L+F T W+ + I+ V Sbjct: 253 AAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT-RGWMAFPIM---V 308 Query: 108 ILAFMSAFSSPSYKAFTKEIVKKDGISQLNSLLETTSTVIKVTVPMVAIFLYKLLGIHGV 167 +LA P+ +A V ++ QL L +++ + P++ +Y + Sbjct: 309 LLAS-GGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA----ASI 363 Query: 168 LLLDGLSFLIAALLISFILPV 188 +G +++ A L LP Sbjct: 364 TTWNGWAWIAGAALYLLCLPA 384
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 47.8 bits (113), Expect = 2e-07 Identities = 47/313 (15%), Positives = 94/313 (30%), Gaps = 10/313 (3%) Query: 209 AKVAKQFLELDANRKQLQLDILVKDIDIAQERQTKDTEALAVLQQDLASYYAKRQSMEED 268 + VA + + Q + D + + + + + + L+ + + +E Sbjct: 41 SAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEK 100 Query: 269 YQKFKQKKQVLSQESDQTQTTLLELTKLIADLEKQIELVKLESGQ---EAEKKAEAKKHL 325 +K + + + + + +L K + + E A K L Sbjct: 101 LRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADL 160 Query: 326 EQLQEQLDGFQAEEKQRTEQLLHIDQQLCDVKQQLNELSNALERFSSDPDQLMETLREEF 385 E+ E F + + + L L + +L + FS+ ++TL E Sbjct: 161 EKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEK 220 Query: 386 VLLMQKEAALSNQLTALKAHLDKEKQARQHKAQEYQLLVTKLDQLNDESQKAQAHYKAQK 445 L ++A L L + + E L + +L + A A Sbjct: 221 AALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADS 280 Query: 446 EQVEMLLQNYQEGDKRVQELERDYQLNQERLFDLLDQ-------KKGKEARKASLESIQK 498 +++ L + +LE Q+ L KK EA LE K Sbjct: 281 AKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNK 340 Query: 499 SHSQFYAGVRAVL 511 +R L Sbjct: 341 ISEASRQSLRRDL 353 Score = 30.4 bits (68), Expect = 0.049 Identities = 39/243 (16%), Positives = 89/243 (36%), Gaps = 18/243 (7%) Query: 169 KYKTRKKETQIKLNQTQDNLDRLEDIIYELDTQLAPLEKQAKVAKQFLELDANRKQLQLD 228 + + + LE L+ + A LEK + A F ++ Sbjct: 229 DLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTA----DSAKIK 284 Query: 229 ILVKDIDIAQERQTKDTEALAVLQ-------QDLASYYAKRQSMEEDYQKFKQKKQVLSQ 281 L + + + VL +DL + ++ +E ++QK +++ ++ Sbjct: 285 TLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEA 344 Query: 282 ESDQTQTTLLELTKLIADLEKQIELVKLESGQEAEKKAEAKKHLEQLQEQLDGFQAEEKQ 341 + L + LE + + ++ E+ ++ + L+ LD + +KQ Sbjct: 345 SRQSLRRDLDASREAKKQLEAEHQKLE-------EQNKISEASRQSLRRDLDASREAKKQ 397 Query: 342 RTEQLLHIDQQLCDVKQQLNELSNALERFSSDPDQLMETLREEFVLLMQKEAALSNQLTA 401 + L + +L +++ EL + + + +L L E L +K A + +L Sbjct: 398 VEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEKLAKQAEELAK 457 Query: 402 LKA 404 L+A Sbjct: 458 LRA 460
>PF06580#Sensor histidine kinase Length = 349 Score = 40.6 bits (95), Expect = 8e-06 Identities = 29/188 (15%), Positives = 70/188 (37%), Gaps = 35/188 (18%) Query: 253 DETNRMMRMISDLL--NLSRIDNQVTQLAVEMTNFTAFITSILNRFDLVKNQHTGTGKVY 310 + M+ +S+L+ +L + + LA E+T +++ +F +++ ++ Sbjct: 191 TKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQF---EDRLQFENQIN 247 Query: 311 EIVRDYPITSVWIEIDNDKMTQVIENILNNAIKYSPDGGKITVRMKTTDTQLIISISDQG 370 + D + + ++ ++EN + + I P GGKI ++ + + + + + G Sbjct: 248 PAIMDVQVPPMLVQT-------LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG 300 Query: 371 LGIPKTDLPLIFDRFYRVDKARSRAQGGTGLGLAIAKEIIKQ---QHHGFIWAKSDYGKG 427 K + TG GL +E ++ I GK Sbjct: 301 SLALKNT------------------KESTGTGLQNVRERLQMLYGTEAQ-IKLSEKQGKV 341 Query: 428 STFTIVLP 435 +++P Sbjct: 342 -NAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 91.8 bits (228), Expect = 1e-23 Identities = 29/133 (21%), Positives = 65/133 (48%), Gaps = 1/133 (0%) Query: 3 KILIVDDEKPISDIIKFNLTKEGYDIVTAFDGREAVTIFEEEKPDLIILDLMLPELDGLE 62 IL+ DD+ I ++ L++ GYD+ + DL++ D+++P+ + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 63 VAKEIRKT-SHVPIIMLSAKDSEFDKVIGLEIGADDYVTKPFSNRELLARVKAHLRRTET 121 + I+K +P++++SA+++ + E GA DY+ KPF EL+ + L + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 122 IETAVAEENASSG 134 + + +++ Sbjct: 125 RPSKLEDDSQDGM 137
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 41.4 bits (97), Expect = 6e-06 Identities = 27/101 (26%), Positives = 40/101 (39%), Gaps = 4/101 (3%) Query: 66 LTVSYGLAKFYMGALGDRVSLRKLFSISLGASALICILIGFFNSSMVVLGILLVLCGVVQ 125 LT S G A G L D++ +++L + + + IGF S L I+ Sbjct: 60 LTFSIGTA--VYGKLSDQLGIKRLLLFGIIINCFGSV-IGFVGHSFFSLLIMARFIQGAG 116 Query: 126 GALAPA-SQAMIANYFPNKTRGGAIAGWNISQNMGSALLPL 165 A PA ++A Y P + RG A MG + P Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPA 157
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 99.0 bits (246), Expect = 3e-27 Identities = 67/252 (26%), Positives = 109/252 (43%), Gaps = 24/252 (9%) Query: 3 KVVLVTGCASGIGYAQARYFLRQGHHVYGVDKSDKPDLNGNFHFIKLDLSSELSPL---- 58 K+ +TG A GIG A AR QG H+ VD + + +E P Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 59 -----------FKVVPSVDILCNTAGILDAYKPLLDVSDEEVEHLFDINFFATVKLTRHY 107 + + +DIL N AG+L + +SDEE E F +N +R Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRP-GLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 108 LRRMVEKQSGVIINMCSIASFIAGGGGVAYTSSKHALAGFTRQLALDYAKDQIHIFGIAP 167 + M++++SG I+ + S + + AY SSK A FT+ L L+ A+ I ++P Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 168 GAVKTAM-----TASDFEP---GGLADWVARETPIGRWTEPDEVAELTGFLASGKARSMQ 219 G+ +T M + G + P+ + +P ++A+ FL SG+A + Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247 Query: 220 GEIVKIDGGWTL 231 + +DGG TL Sbjct: 248 MHNLCVDGGATL 259
>INTIMIN#Intimin signature. Length = 939 Score = 27.3 bits (60), Expect = 0.042 Identities = 16/60 (26%), Positives = 23/60 (38%), Gaps = 6/60 (10%) Query: 65 NGVKQSYPGEKEIKIINPSTQEVTRCYRISGWRADSQGSYTVTLDSPLQETDVVSLQIAD 124 NGV Q+ I T ++ + + G TVTL S VVS + A+ Sbjct: 587 NGVAQA--NVPVSFNIVSGTAVLSA----NSANTNGSGKATVTLKSDKPGQVVVSAKTAE 640
>BINARYTOXINA#Clostridial binary toxin A signature. Length = 454 Score = 38.5 bits (89), Expect = 1e-05 Identities = 42/170 (24%), Positives = 70/170 (41%), Gaps = 27/170 (15%) Query: 81 INTSLDKAKGKLSQLTPELRDQVAQLDAATHRLVIPWNIVVYRYVYETFLRDIGVSHADL 140 IN L + G L+ PEL +V ++ A IP N++VYR G L Sbjct: 295 INNYL-ISNGPLNNPNPELDSKVNNIENALKLTPIPSNLIVYRRS--------GPQEFGL 345 Query: 141 TSYYRNHQFDPHILCKIK---------LGTRYTKHSFMSTT--ALKNGAMTHRPVEVRIC 189 T + F+ KI+ G T +F+ST+ ++ A R + +RI Sbjct: 346 TLTSPEYDFN-----KIENIDAFKEKWEGKVITYPNFISTSIGSVNMSAFAKRKIILRIN 400 Query: 190 VKKGAKAAFVEPYSAVPSEVELLFPRGCQLEV--VGAYVSQDHKKLHIEA 237 + K + A++ E E+L G + ++ V +Y KL ++A Sbjct: 401 IPKDSPGAYLSAIPGYAGEYEVLLNHGSKFKINKVDSYKDGTVTKLILDA 450
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 92.2 bits (229), Expect = 5e-22 Identities = 42/160 (26%), Positives = 64/160 (40%), Gaps = 24/160 (15%) Query: 240 DIDWTQTDDDTKYESHGMHVTGIVAGNSKEAAATGERFLGIAPEAQVMFMRVFANDVMGS 299 + D D HG HV G +A +G+APEA ++ ++V G Sbjct: 74 EGDPEIFKDY---NGHGTHVAGTIAAT-----ENENGVVGVAPEADLLIIKVLNKQGSGQ 125 Query: 300 AESLFIKAIEDAVALGADVINLSLGTANGAQLSGSKPLMEAIEKAKKAGVSVVVAAGNER 359 + + I+ I A+ D+I++SLG L EA++KA + + V+ AAGNE Sbjct: 126 YDWI-IQGIYYAIEQKVDIISMSLGGP-----EDVPELHEAVKKAVASQILVMCAAGNEG 179 Query: 360 VYGSDHDDPLATNPDYGLVGSPSTGRTPTSVAAINSKWVI 399 D+ +G P SV AIN Sbjct: 180 DGDDRTDE----------LGYPGCYNEVISVGAINFDRHA 209 Score = 78.7 bits (194), Expect = 2e-17 Identities = 36/147 (24%), Positives = 58/147 (39%), Gaps = 18/147 (12%) Query: 537 FDSVVSKAPSQKGNEMNHFSNWGLTSDGYLKPDITAPGGDIYSTYNDNHYGSQTGTSMAS 596 ++ V+S + FSN + D+ APG DI ST Y + +GTSMA+ Sbjct: 194 YNEVISVGAINFDRHASEFSNSNN------EVDLVAPGEDILSTVPGGKYATFSGTSMAT 247 Query: 597 PQIAGASLLVKQ-YLEKTQPNLPKEKIADIVKNLLMSNAQIHVNPETKTTTSPRQQGAGL 655 P +AGA L+KQ + +L + L+ SP+ +G GL Sbjct: 248 PHVAGALALIKQLANASFERDL----TEPELYAQLIKRT-------IPLGNSPKMEGNGL 296 Query: 656 LNIDGAVTSGLYVTGKDNYGSISLGNI 682 L + + G +S ++ Sbjct: 297 LYLTAVEELSRIFDTQRVAGILSTASL 323 Score = 40.6 bits (95), Expect = 4e-05 Identities = 11/34 (32%), Positives = 18/34 (52%), Gaps = 1/34 (2%) Query: 103 HDWVKTKGAWDKGYKGQGKVVAVIDTGIDPAHQS 136 + ++ W++ G+G VAV+DTG D H Sbjct: 26 VEMIQAPAVWNQTR-GRGVKVAVLDTGCDADHPD 58
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 46.0 bits (109), Expect = 3e-08 Identities = 21/118 (17%), Positives = 51/118 (43%), Gaps = 6/118 (5%) Query: 2 KILLIDDHRLFAKSIQLLFQQYD-EVDVIDTITSHFNDVTIDLSKYDIILLDINLTNISK 60 IL+ DD + + +V + + + + D+++ D+ + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWI--AAGDGDLVVTDVVM---PD 59 Query: 61 ENGLEIAKELIQSTPHLKVVMLTGYVKSIYRERAKKVGAYGFVDKNIDPKQLISILKK 118 EN ++ + ++ P L V++++ + +A + GAY ++ K D +LI I+ + Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 29.8 bits (67), Expect = 0.006 Identities = 21/85 (24%), Positives = 38/85 (44%), Gaps = 11/85 (12%) Query: 1 MKKKERHEKILDILKVDGFIKVKDIIDEM-----NISDMTARRDLDTLADKGLL-IRTHG 54 M K +RH KI +I+ + +++D + N++ T RD+ L L+ + T+ Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKEL---HLVKVPTNN 57 Query: 55 GAQYLDYSSAKDEGHEKTHTEKKVL 79 G+ YS D+ K+ L Sbjct: 58 GSYK--YSLPADQRFNPLSKLKRSL 80
>PF04605#Virulence-associated protein D (VapD) Length = 125 Score = 29.8 bits (67), Expect = 0.008 Identities = 7/44 (15%), Positives = 17/44 (38%), Gaps = 2/44 (4%) Query: 227 INGYKVNSWNDLTEAV-NLATRD-LGPSQTIKVTYKSHQRLKTV 268 + ++ L E + +L +D + +Q+LK + Sbjct: 80 FDITEIGEQYSLKETIQDLCAKDFHQKLKEFTEKTPKNQKLKDL 123
>PF05272#Virulence-associated E family protein Length = 892 Score = 34.7 bits (79), Expect = 6e-04 Identities = 14/56 (25%), Positives = 20/56 (35%), Gaps = 9/56 (16%) Query: 34 IVFVGPSGCGKSTTLRMIAGLEDISEGELKIGGEVVNDKSPKDRDIAMVFQNYALY 89 +V G G GKST + + GL+ S+ IG +D Y Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG---------TGKDSYEQIAGIVAY 645
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 34.0 bits (78), Expect = 6e-04 Identities = 10/30 (33%), Positives = 19/30 (63%) Query: 243 ALWSEHGNLVQTAQRLYIHRNSLQYKLDKF 272 AL + GN ++ A L ++RN+L+ K+ + Sbjct: 444 ALTATRGNQIKAADLLGLNRNTLRKKIREL 473
>STREPKINASE#Streptococcus streptokinase protein signature. Length = 440 Score = 801 bits (2069), Expect = 0.0 Identities = 392/440 (89%), Positives = 409/440 (92%) Query: 1 MKNYLSIGVIALLFALTFGTVKPVHAIAGYGWLPDRPPVNNSQLVVSMAGIVEGTDKKVF 60 MKNYLS G+ ALLFALTFGTV V AIAG WL DRP VNNSQLVVS+AG VEGT++ + Sbjct: 1 MKNYLSFGMFALLFALTFGTVNSVQAIAGPEWLLDRPSVNNSQLVVSVAGTVEGTNQDIS 60 Query: 61 INFFEIDLTSQHAHGGKTEQGLSPKSKPFATDNGAMPHKLEKADLLKAIQKQLIANVHSN 120 + FFEIDLTS+ AHGGKTEQGLSPKSKPFATD+GAM HKLEKADLLKAIQ+QLIANVHSN Sbjct: 61 LKFFEIDLTSRPAHGGKTEQGLSPKSKPFATDSGAMSHKLEKADLLKAIQEQLIANVHSN 120 Query: 121 DGYFEVIDFASDATITDRNGKVYFADKDGSVTLPTQPVQEFLLSGHVRVRPYKEKPVQNQ 180 D YFEVIDFASDATITDRNGKVYFADKDGSVTLPTQPVQEFLLSGHVRVRPYKEKP+QNQ Sbjct: 121 DDYFEVIDFASDATITDRNGKVYFADKDGSVTLPTQPVQEFLLSGHVRVRPYKEKPIQNQ 180 Query: 181 AKSVDVKYTVQFTPLNPDDDFRPGLKDTKLLKTLAIGDTITSQELLAQAQSILNKTHPGY 240 AKSVDV+YTVQFTPLNPDDDFRPGLKDTKLLKTLAIGDTITSQELLAQAQSILNK HPGY Sbjct: 181 AKSVDVEYTVQFTPLNPDDDFRPGLKDTKLLKTLAIGDTITSQELLAQAQSILNKNHPGY 240 Query: 241 TIYERDSSIVTHDNDIFRTILPMDQEFTYRVKDREQAYGINKKSGLNEEINNTDLISEKY 300 TIYERDSSIVTHDNDIFRTILPMDQEFTYRVK+REQAY INKKSGLNEEINNTDLISEKY Sbjct: 241 TIYERDSSIVTHDNDIFRTILPMDQEFTYRVKNREQAYRINKKSGLNEEINNTDLISEKY 300 Query: 301 YILKKGESPYDPFDRSHLKLFTIKYVDVNTNELLKSEQLLTASERNLDFRDLYDPCDKAK 360 Y+LKKGE PYDPFDRSHLKLFTIKYVDV+TNELLKSEQLLTASERNLDFRDLYDP DKAK Sbjct: 301 YVLKKGEKPYDPFDRSHLKLFTIKYVDVDTNELLKSEQLLTASERNLDFRDLYDPRDKAK 360 Query: 361 LLYNNLDAFDIMDYTLTGKVEDNHDKNNRIVTVYMGKRPKGAKGSYHLAYDKDLYTEEER 420 LLYNNLDAF IMDYTLTGKVEDNHD NRI+TVYMGKRP+G SYHLAYDKD YTEEER Sbjct: 361 LLYNNLDAFGIMDYTLTGKVEDNHDDTNRIITVYMGKRPEGENASYHLAYDKDRYTEEER 420 Query: 421 KAYSYLRDTETPIPDNPKDK 440 + YSYLR T TPIPDNP DK Sbjct: 421 EVYSYLRYTGTPIPDNPNDK 440
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 56.6 bits (136), Expect = 3e-13 Identities = 36/85 (42%), Positives = 43/85 (50%), Gaps = 1/85 (1%) Query: 2 PEQPGEKAPEKSKEVTPAPEKPADKEANQTPE-RRNGNMAKTPVANNHRRLPATGEQANP 60 + S+ P A Q P+ N K P+ R+LP+TGE ANP Sbjct: 455 LAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTGETANP 514 Query: 61 FFTAAAVAVMTTAGVLAVTKRKENN 85 FFTAAA+ VM TAGV AV KRKE N Sbjct: 515 FFTAAALTVMATAGVAAVVKRKEEN 539
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 47.4 bits (113), Expect = 7e-08 Identities = 22/53 (41%), Positives = 32/53 (60%), Gaps = 6/53 (11%) Query: 46 IAIKDGLIVALG-SGEPDAE-----LVGPQTIMRSYKGKIATPGIIDCHTHLV 92 I +KDG I A+G +G PD + +VGP T + + +GKI T G +D H H + Sbjct: 88 IGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIHFI 140
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 35.2 bits (81), Expect = 0.001 Identities = 24/161 (14%), Positives = 57/161 (35%), Gaps = 16/161 (9%) Query: 266 GLSQLTQATTLSDEKAKGIQSLIVGLPVLNQGIQQLNTELSTLQPPNLNADELGNSLGAI 325 L +S+E+ + SLI +Q +T + LN D+ + Sbjct: 169 KLPDEPYFQNVSEEEVLRLTSLIK---------EQFSTWQNQKYQKELNLDKKR-AERLT 218 Query: 326 AQAAKQVIAEETAAQNEELSALQA----TSVYQSLTAEQQGELAAALSQSDKSQTVSAAQ 381 A + + L + ++ + EQ+ + A ++ S + Sbjct: 219 VLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEA--VNELRVYKSQLE 276 Query: 382 TILSSVQTLSTSLQSLSQEDQSKQLEQLKEAVAQIANQSNQ 422 I S + + Q ++Q +++ L++L++ I + + Sbjct: 277 QIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLE 317
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 47.3 bits (112), Expect = 4e-09 Identities = 20/134 (14%), Positives = 46/134 (34%), Gaps = 11/134 (8%) Query: 4 RKENTKQAILKAMVMLLKTESFDDITTVKLSKRAGISRSSFYTHYKDKYEMID------- 56 + T+Q IL + L + + +++K AG++R + Y H+KDK ++ Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67 Query: 57 -YYQQTFFHKLEYIFEKKYQNKEQAFLEVFEFLQREQLLSSLLSANGTKEIQA---FIIN 112 + + + V E E+ L+ K ++ Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127 Query: 113 KVRLLITTDLQDKF 126 + + + + D+ Sbjct: 128 QAQRNLCLESYDRI 141
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 88.0 bits (218), Expect = 5e-22 Identities = 59/291 (20%), Positives = 118/291 (40%), Gaps = 20/291 (6%) Query: 4 SLLKGQGLADMLSGLG--FSDAILTQISLADRHGNIETTLVAIQHYLNQMARIRRKTVEV 61 +++G LAD + F ++ + G+++ L + Y Q ++R + + Sbjct: 113 KVMEGHSLADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQA 172 Query: 62 ITYPLILLLFLFVMMLGLRRYLVPQLETQNQ---------------ITYFLNHFPAFFIG 106 + YP +L + ++ L +VP++ Q ++ + F + + Sbjct: 173 MIYPCVLTVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLL 232 Query: 107 FCSGLILLFGMVWLRWRSQSRLKLYSRLSRYPFLGKLLKQYLTSYYAREWGTLIGQGLDL 166 + F + LR + + R+ + RL P +G++ + T+ YAR L + L Sbjct: 233 ALLAGFMAFRV-MLR-QEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPL 290 Query: 167 MTILDIMAIEKSSL-MKELAEDIRMSLLEGQAFHIKVATYPFFKKELSLMIEYGEIKSKL 225 + + I S+ + ++ EG + H + F + MI GE +L Sbjct: 291 LQAMRISGDVMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGEL 350 Query: 226 GAELEIYAQESWEQFFSQLYQVTQLIQPAIFLVVAVTIVMIYAAILLPIYQ 276 + LE A +F SQ+ L +P + + +A ++ I AIL PI Q Sbjct: 351 DSMLERAADNQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQ 401 Score = 34.8 bits (80), Expect = 3e-04 Identities = 32/129 (24%), Positives = 60/129 (46%), Gaps = 6/129 (4%) Query: 154 REWGTLIGQGLDLMTILDIMAIE-KSSLMKELAEDIRMSLLEGQAFHIKVATYP-FFKKE 211 R+ TL+ + L LD +A + + + +L +R ++EG + + +P F++ Sbjct: 75 RQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFPGSFERL 134 Query: 212 LSLMIEYGEIKSKLGAELEIYA--QESWEQFFSQLYQVTQLIQPAIFLVVAVTIVMIYAA 269 M+ GE L A L A E +Q S++ Q +I P + VVA+ +V I + Sbjct: 135 YCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQA--MIYPCVLTVVAIAVVSILLS 192 Query: 270 ILLPIYQNM 278 +++P Sbjct: 193 VVVPKVVEQ 201
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 37.9 bits (88), Expect = 7e-07 Identities = 21/82 (25%), Positives = 42/82 (51%), Gaps = 4/82 (4%) Query: 1 MLLVILVISVLMLLFVPNLSKQKDRVTETGNAAVVKLVENQAELYELSQGSKPSLSQ-LK 59 +++VI++I VL L VPNL K++ + + + +EN ++Y+L P+ +Q L+ Sbjct: 15 IMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHYPTTNQGLE 74 Query: 60 A--DGSITEKQEKAY-QDYYDK 78 + + Y ++ Y K Sbjct: 75 SLVEAPTLPPLAANYNKEGYIK 96
>OMPTIN#Omptin serine protease signature. Length = 317 Score = 26.9 bits (59), Expect = 0.034 Identities = 17/71 (23%), Positives = 25/71 (35%), Gaps = 9/71 (12%) Query: 37 LLKRSHYLARHDQDNWLLFSHQL--REELSGARFYKVADNK-LYVEKGKKVLAFGQFKSH 93 K S ++ D D ++ R ++ +Y VA N YV KV G + Sbjct: 217 TFKYSGWVESSDNDEHYDPGKRITYRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRV 276 Query: 94 DFRKSASNGKG 104 N KG Sbjct: 277 T------NKKG 281
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 502 bits (1293), Expect = e-180 Identities = 209/401 (52%), Positives = 281/401 (70%), Gaps = 7/401 (1%) Query: 3 KTIAINAGSSSLKWQLYQMPEEAVLAQGIIERIGLKDSISTVKYDGKKEEQILDIHDHTE 62 K + IN GSSSLK+QL + + VLA+G+ ERIG+ DS+ T +G+K + D+ DH + Sbjct: 2 KILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKD 61 Query: 63 AVKILLNDLI--HFGIIAAYDEITGVGHRVVAGGELFKESVVVNDKVLEQIEELSVLAPL 120 A+K++L+ L+ +G+I EI VGHRVV GGE F SV++ D VL+ I + LAPL Sbjct: 62 AIKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIELAPL 121 Query: 121 HNPGAAAGIRAFRDILPDITSVCVFDTSFHTSMAKHTYLYPIPQKYYTDYKVRKYGAHGT 180 HNP GI+A I+PD+ V VFDT+FH +M + YLYPIP +YYT YK+RKYG HGT Sbjct: 122 HNPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFHGT 181 Query: 181 SHKYVAQEAAKMLGRPLEELKLITAHIGNGVSITANYHGKSVDTSMGFTPLAGPMMGTRS 240 SHKYV+Q AA++L +P+E LK+IT H+GNG SI A +GKS+DTSMGFTPL G MGTRS Sbjct: 182 SHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGTRS 241 Query: 241 GDIDPAIIPYLIEQDPELKDAADVVNMLNKKSGLSGVSGISSDMRDI-EAGLQEDNPDAV 299 G IDP+II YL+E+ E A +VVN+LNKKSG+ G+SGISSD RD+ +A + + A Sbjct: 242 GSIDPSIISYLMEK--ENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQ 299 Query: 300 LAYNIFIDRIKKCIGQYFAVLNGADALVFTAGMGENAPLMRQDVIGGLTWFGMDIDPEKN 359 LA N+F R+KK IG Y A + G D +VFTAG+GEN P +R+ ++ GL + G +D EKN Sbjct: 300 LALNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEKN 359 Query: 360 -VFGYRGDISTPESKVKVLVISTDEELCIARDVERL-KNTK 398 V G IST +SKV V+V+ T+EE IA+D E++ ++ K Sbjct: 360 KVRGEEAIISTADSKVNVMVVPTNEEYMIAKDTEKIVESLK 400
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 32.1 bits (73), Expect = 0.002 Identities = 11/63 (17%), Positives = 26/63 (41%), Gaps = 2/63 (3%) Query: 50 ERGDHQLYFLDIEIGEYTRCGLELAAAIRQKDPNAVIVFVTTHSEFAPISFKYKVSALDF 109 GD L D+ + + +L I++ P+ ++ ++ + F + A D+ Sbjct: 44 AAGDGDLVVTDVVMPD--ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDY 101 Query: 110 IDK 112 + K Sbjct: 102 LPK 104
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 162 bits (411), Expect = 2e-48 Identities = 66/237 (27%), Positives = 115/237 (48%), Gaps = 20/237 (8%) Query: 31 VTAQSSSGWDQLVYLFARAIQWL-----SFDGSIGVGIILFTLTIRLMLMPLFNMQIKSS 85 + GW + ++ + L SF G+ G II+ T +R ++ PL Q S Sbjct: 324 LDLTVDYGWLWFI---SQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSM 380 Query: 86 QKMQDIQPELRELQRKYAGKDTQTRMKLAEESQALYKKYGVNPYASLLPLLIQMPVMIAL 145 KM+ +QP+++ ++ + + ++++E ALYK VNP PLLIQMP+ +AL Sbjct: 381 AKMRMLQPKIQAMRERLGDD----KQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLAL 436 Query: 146 FQALTRVSFLKTGTF-LWV-ELAQHDHLYLLPVLAAVFTFLSTWLTNLAAKEKNVMMTVM 203 + L L+ F LW+ +L+ D Y+LP+L V F ++ + M + Sbjct: 437 YYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQKMS--PTTVTDPMQQKI 494 Query: 204 IYVMPLMIFFMGFNLASGVVLYWTVSNAFQVVQLLLLNNPFKIIAERQRLANEEKER 260 + MP++ SG+VLY+ VSN ++Q L+ E++ L + EK++ Sbjct: 495 MTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYR----GLEKRGLHSREKKK 547
>adhesinb#Adhesin B signature. Length = 310 Score = 30.2 bits (68), Expect = 0.010 Identities = 15/34 (44%), Positives = 20/34 (58%), Gaps = 2/34 (5%) Query: 3 MKKLASLVMLGASVLGLAACGGKSQKEAGASKSD 36 MKK LV+L + +GLAAC SQK + + S Sbjct: 1 MKKCRFLVLLLLAFVGLAACS--SQKSSTETGSS 32
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 25.2 bits (55), Expect = 0.017 Identities = 10/27 (37%), Positives = 14/27 (51%) Query: 33 QAISNGDEKPEDALKAFTEKANKTIKK 59 A NGD++ + AL F + KTI Sbjct: 289 AAFKNGDKRAQLALNVFAYRVKKTIGS 315
>SECA#SecA protein signature. Length = 901 Score = 1052 bits (2723), Expect = 0.0 Identities = 394/903 (43%), Positives = 560/903 (62%), Gaps = 73/903 (8%) Query: 1 MANILRKVIENDKG-ELRKLEKIAKKVESYADQMASLSDRDLQGKTLEFKERYQKGETLE 59 + +L KV + LR++ K+ + + +M LSD +L+GKT EF+ R +KGE LE Sbjct: 2 LIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVLE 61 Query: 60 QLLPEAFAVVREAAKRVLGLFPYRVQIMGGIVLHNGDVPEMRTGEGKTLTATMPVYLNAI 119 L+PEAFAVVREA+KRV G+ + VQ++GG+VL+ + EMRTGEGKTLTAT+P YLNA+ Sbjct: 62 NLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNAL 121 Query: 120 AGEGVHVITVNEYLSTRDATEMGEVYSWLGLSVGINLAAKSPAEKREAYNCDITYSTNSE 179 G+GVHV+TVN+YL+ RDA ++ +LGL+VGINL KREAY DITY TN+E Sbjct: 122 TGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNNE 181 Query: 180 VGFDYLRDNMVVRQEDMVQRPLNFALVDEVDSVLIDEARTPLIVSGAVSSETNQLYIRAD 239 GFDYLRDNM E+ VQR L++ALVDEVDS+LIDEARTPLI+SG + ++Y R + Sbjct: 182 YGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSS-EMYKRVN 240 Query: 240 MFVKTLT------------SVDYVIDVPTKTIGLSDSGIDKAESYFNLS-------NLYD 280 + L + +D ++ + L++ G+ E +LY Sbjct: 241 KIIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYS 300 Query: 281 IENVALTHFIDNALRANYIMLLDIDYVVSEDGEILIVDQFTGRTMEGRRFSDGLHQAIEA 340 N+ L H + ALRA+ + D+DY+V +DGE++IVD+ TGRTM+GRR+SDGLHQA+EA Sbjct: 301 PANIMLMHHVTAALRAHALFTRDVDYIV-KDGEVIIVDEHTGRTMQGRRWSDGLHQAVEA 359 Query: 341 KEGVRIQEESKTSASITYQNMFRMYKKLAGMTGTAKTEEEEFREVYNMRIIPIPTNRPIA 400 KEGV+IQ E++T ASIT+QN FR+Y+KLAGMTGTA TE EF +Y + + +PTNRP+ Sbjct: 360 KEGVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMI 419 Query: 401 RIDHTDLLYPTLESKFRAVVEDVKTRHAKGQPILVGTVAVETSDLISRKLVEAGIPHEVL 460 R D DL+Y T K +A++ED+K R AKGQP+LVGT+++E S+L+S +L +AGI H VL Sbjct: 420 RKDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVL 479 Query: 461 NAKNHFKEAQIIMNAGQRGAVTIATNMAGRGTDIKLG----------------------- 497 NAK H EA I+ AG AVTIATNMAGRGTDI LG Sbjct: 480 NAKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKA 539 Query: 498 ------EGVRELGGLCVIGTERHESRRIDNQLRGRSGRQGDPGESQFYLSLEDDLMRRFG 551 + V E GGL +IGTERHESRRIDNQLRGRSGRQGD G S+FYLS+ED LMR F Sbjct: 540 DWQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFA 599 Query: 552 SDRIKAFLDRMKLDEEDTVIKSGMLGRQVESAQKRVEGNNYDTRKQVLQYDDVMREQREI 611 SDR+ + ++ + + I+ + + + +AQ++VE N+D RKQ+L+YDDV +QR Sbjct: 600 SDRVSGMMRKLGM-KPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRA 658 Query: 612 IYANRRDVITANRDLGPEIKAMIKRTIDRAVDAHARSNR---KDAVDAIVTFARTSLVPE 668 IY+ R +++ + D+ I ++ + +DA+ + + + + Sbjct: 659 IYSQRNELLDVS-DVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLD 717 Query: 669 ESIS--AKELRGLKDEQIKEKLYQRALAIYDQQLSKLRDQEAIIEFQKVLILMIVDNKWT 726 I+ + L +E ++E++ +++ +Y ++ + E + F+K ++L +D+ W Sbjct: 718 LPIAEWLDKEPELHEETLRERILAQSIEVYQRKEEVVG-AEMMRHFEKGVMLQTLDSLWK 776 Query: 727 EHIDALDQLRNAVGLRGYAQNNPVVEYQAEGFKMFQDMIGAIEFDVTRTMMKAQIH-EQE 785 EH+ A+D LR + LRGYAQ +P EY+ E F MF M+ +++++V T+ K Q+ +E Sbjct: 777 EHLAAMDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEE 836 Query: 786 RERASQRATTAAPQNIQSQQSANTDD-------------LPKVERNEACPCGSGKKFKNC 832 E Q+ A + Q QQ ++ DD KV RN+ CPCGSGKK+K C Sbjct: 837 VEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPCGSGKKYKQC 896 Query: 833 HGR 835 HGR Sbjct: 897 HGR 899
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 345 bits (888), Expect = e-120 Identities = 121/368 (32%), Positives = 194/368 (52%), Gaps = 23/368 (6%) Query: 7 RPTVARVNLQAIKENVASVQKHIPLGVKTYAVVKADAYGHGAVQVSKALLPQVDGYCVSN 66 RP A ++LQA+K+N++ V++ + ++VVKA+AYGHG ++ A+ DG+ + N Sbjct: 3 RPIQASLDLQALKQNLSIVRQAAT-HARVWSVVKANAYGHGIERIWSAI-GATDGFALLN 60 Query: 67 LDEALQLRQAGIDKEILIL-GVLLPNELELAVANAITVTIAS---LDWIALARLEKKECQ 122 L+EA+ LR+ G IL+L G +LE+ + +T + S L + ARL+ Sbjct: 61 LEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAP--- 117 Query: 123 GLKVHVKVDSGMGRIGLRSSKEVNLLIDSLKELGADVEGIFTHFATADEADDTKFNQQLQ 182 L +++KV+SGM R+G + + + + + +HFA A+ D + Sbjct: 118 -LDIYLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGIS--GAMA 174 Query: 183 FFKKLIAGLEDKPRLVHASNSATSIWHSDTIFNAVRLGIVSYGLNPSGS-DLSLPFPLQE 241 ++ GL SNSA ++WH + F+ VR GI+ YG +PSG L+ Sbjct: 175 RIEQAAEGL---ECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRP 231 Query: 242 ALSLESSLVHVKMISAGDTVGYGATYTAKKSEYVGTVPIGYADGWTRNM-QGFSVLVDGQ 300 ++L S ++ V+ + AG+ VGYG YTA+ + +G V GYADG+ R+ G VLVDG Sbjct: 232 VMTLSSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGV 291 Query: 301 FCEIIGRVSMDQLTIRLSKA--YPLGTKVTLIGSNQQKNISTTDIANYRNTINYEVLCLL 358 +G VSMD L + L+ +GT V L G K I D+A T+ YE++C L Sbjct: 292 RTMTVGTVSMDMLAVDLTPCPQAGIGTPVELWG----KEIKIDDVAAAAGTVGYELMCAL 347 Query: 359 SDRIPRIY 366 + R+P + Sbjct: 348 ALRVPVVT 355
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 30.3 bits (68), Expect = 0.013 Identities = 12/36 (33%), Positives = 15/36 (41%) Query: 117 KPTDQPKPTDQPKPSPSKVDTAPASSLSRQLPEART 152 KP K +QPK V++ PAS P T Sbjct: 99 KPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLT 134
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 71.1 bits (174), Expect = 1e-15 Identities = 55/265 (20%), Positives = 104/265 (39%), Gaps = 24/265 (9%) Query: 304 VACVNQHPKTAKETEQQRIVATSVAVVDICDRLNLDLVGVCDSKLYTL----PKRYDAVK 359 + A + RIVA V++ L + GV D+ Y L P D+V Sbjct: 20 PLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLPDSVI 79 Query: 360 RVGLPMNPDIELIASLKPTWILSPNSLQEDLEPKYQKLDTEYGFLNLRSVEG------MY 413 VGL P++EL+ +KP++++ P + L +G Sbjct: 80 DVGLRTEPNLELLTEMKPSFMV----WSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMAR 135 Query: 414 QSIDDLGNLFQRQQEAKELRQQYQDYYRAFQAKRKGK-KKPKVLILMGLPGSYLVATNQS 472 +S+ ++ +L Q A+ QY+D+ R+ + + + +P +L + P LV S Sbjct: 136 KSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNS 195 Query: 473 YVGNLLDLAGGENVYQ--SDEKEFLSVNPEDMLA-KEPDLILRTAHAIPDKVKVMFDKEF 529 +LD G N +Q ++ +V+ + + A K+ D++ D +M Sbjct: 196 LFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALM----- 250 Query: 530 AENDIWKHFTAVKEGKVYDLDNTLF 554 +W+ V+ G+ + F Sbjct: 251 -ATPLWQAMPFVRAGRFQRVPAVWF 274
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 28.2 bits (63), Expect = 0.045 Identities = 19/76 (25%), Positives = 32/76 (42%), Gaps = 5/76 (6%) Query: 264 LASVATSIVGVVSFLGL---IVPHMSRLLVGSKHQILIPFSALLGAFVFLLADTLGRSLA 320 + S A + +GL H S+L++ Q +PFS L V + L Sbjct: 29 VVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFF-YLC 87 Query: 321 YPLEISPAIIMSIVGG 336 +PL ++ A +M+I Sbjct: 88 FPL-LTVAALMAIASH 102
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 31.7 bits (72), Expect = 0.003 Identities = 13/76 (17%), Positives = 34/76 (44%), Gaps = 9/76 (11%) Query: 50 LAQSLKTKKNQLVGLLLPDISNPFF-PRLARGAEEYLKEKGYRVMLGNISDSEALEE--- 105 +++ L +Q+VG+ D N ++ L + E L + G++ +++D E + + Sbjct: 16 VSKRLLEAGHQVVGI---DNLNDYYDVSLKQARLELLAQPGFQFHKIDLADREGMTDLFA 72 Query: 106 --EYVHVLLQSNAAGI 119 + V + + + Sbjct: 73 SGHFERVFISPHRLAV 88
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 29.4 bits (66), Expect = 0.009 Identities = 42/160 (26%), Positives = 59/160 (36%), Gaps = 25/160 (15%) Query: 70 SLIIILWASMVHWVSASYCYLLLFSLLFSLF--DWRSQ------EYPFILWLFSFVSLLL 121 +L+ + A + + LLL +L +L D P + F L Sbjct: 118 ALLSVAVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGG 177 Query: 122 FYSIN---------YLSLILLLLGLLAHLRPFSIGAGDFFYLASLALVLDLTSLIWLIQL 172 F S+ YL L L +G GDF LA+L L +L ++ L Sbjct: 178 FVSLGDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLL 237 Query: 173 ASLAGITACLLL-------GIKRIPFIPYLSFGLFWIVLL 205 +SL G + L K IPF PYL+ WI LL Sbjct: 238 SSLVGAFMGIGLILLRNHHQSKPIPFGPYLAIA-GWIALL 276
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 149 bits (377), Expect = 9e-49 Identities = 48/154 (31%), Positives = 84/154 (54%), Gaps = 4/154 (2%) Query: 19 KKEASNNEKT--KAVLNQAVADLSVAASIVHQVHWYMRGPGFLYLHPKMDELLDSLNANL 76 K E + +T + LN +++ + S +H+ HWY++GP F LH K +EL D + Sbjct: 2 KTENAKTNQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETV 61 Query: 77 DEMSERLITIGGAPYSTLAEFSKHSKLDETKGTYDKTVAQHLARLVEVYLYLSSLYQVGL 136 D ++ERL+ IGG P +T+ E+++H+ + + + + ++ + LV Y +SS + + Sbjct: 62 DTIAERLLAIGGQPVATVKEYTEHASITDGGN--ETSASEMVQALVNDYKQISSESKFVI 119 Query: 137 DITDEEGDAGTNDLFTAAKTEAEKTIWMLQAERG 170 + +E D T DLF E EK +WML + G Sbjct: 120 GLAEENQDNATADLFVGLIEEVEKQVWMLSSYLG 153
>PF03309#Bvg accessory factor Length = 271 Score = 31.7 bits (72), Expect = 0.003 Identities = 29/126 (23%), Positives = 43/126 (34%), Gaps = 14/126 (11%) Query: 5 LLGIDLGGTTIKFGILTAAGEVQE---KWAIETNILEGGKHIVPDIIASIKHRLDLYGLS 61 LL ID+ T G+++ +G+ + +W I T + D +A L G Sbjct: 2 LLAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTE-----PEVTADELALTIDG--LIGDD 54 Query: 62 SADFVGIGMGSPGAVDRDTNTVTGAFNLNWKETQEVGSVVEKELGIPFAIDNDANVAALG 121 + G S V + V W V GIP +DN V A Sbjct: 55 AERLTGASGLS--TVPSVLHEVRVMLEQYWPNVPHVLIEPGVRTGIPLLVDNPKEVGA-- 110 Query: 122 ERWVGA 127 +R V Sbjct: 111 DRIVNC 116
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 186 bits (473), Expect = 4e-53 Identities = 102/477 (21%), Positives = 187/477 (39%), Gaps = 97/477 (20%) Query: 8 IRNVAIIAHVDHGKTTLVDELLKQSHTLDERKELQE--RAMDSNDLEKERGITILAKNTA 65 I N+ ++AHVD GKTTL + LL S + E + + D+ LE++RGITI T+ Sbjct: 3 IINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS 62 Query: 66 VAYNDVRINIMDTPGHADFGGEVERIMKMVDGVVLVVDAYEGTMPQTRFVLKKALEQNLI 125 + + ++NI+DTPGH DF EV R + ++DG +L++ A +G QTR + + + Sbjct: 63 FQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP 122 Query: 126 PIVVVNKIDKPSARP-------------------------------------AEVVDEVL 148 I +NKID+ + V E Sbjct: 123 TIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGN 182 Query: 149 ELFIELGADDEQLE-----------------FPVVYASAINGTSSLSDDPADQEHTMAPI 191 + +E + LE FPV + SA N + + Sbjct: 183 DDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIG------------IDNL 230 Query: 192 FDTIIDHIPAPVDNSDEPLQFQVSLLDYNDFVGRIGIGRVFRGTVKVGDQVTLSKLDGTT 251 + I + + L +V ++Y++ R+ R++ G + + D V +S Sbjct: 231 IEVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRIS----EK 286 Query: 252 KNFRVTKLFGFFGLERREIQEAKAGDLIAVSGMEDIFVGETITPTDCVEALPILRIDEPT 311 + ++T+++ E +I +A +G+++ + E + + + T + + P Sbjct: 287 EKIKITEMYTSINGELCKIDKAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPL 345 Query: 312 LQMTFLVNNSPFAGREGKWITSRKVEER--LLAELQT----DVSLRVDPTDSPDKWTVSG 365 LQ T + K ++R LL L D LR + + +S Sbjct: 346 LQTT---------------VEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEIILSF 390 Query: 366 RGELHLSILIETMRRE-GYELQVSRPEVIIKEIDGVKCEPFERVQIDTPEEYQGAII 421 G++ + + ++ + E+++ P VI E K E + I+ P A I Sbjct: 391 LGKVQMEVTCALLQEKYHVEIEIKEPTVIYMERPLKKAE--YTIHIEVPPNPFWASI 445 Score = 42.5 bits (100), Expect = 4e-06 Identities = 18/79 (22%), Positives = 31/79 (39%), Gaps = 1/79 (1%) Query: 403 EPFERVQIDTPEEYQGAIIQSLSERKGDMLDMQMVGNGQTRLIFLIPARGLIGYSTEFLS 462 EP+ +I P+EY + +++D Q + N + L IPAR + Y ++ Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ-LKNNEVILSGEIPARCIQEYRSDLTF 595 Query: 463 MTRGYGIMNHTFDQYLPVV 481 T G + Y Sbjct: 596 FTNGRSVCLTELKGYHVTT 614
>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature. Length = 428 Score = 30.4 bits (68), Expect = 0.010 Identities = 19/99 (19%), Positives = 31/99 (31%), Gaps = 10/99 (10%) Query: 154 FEQEDQLSKVKHLGAVTKVFKDANQMPESTQLE-AVKEYFSRDLKTLLFIGGSAGAHVFN 212 FE ++K + + N + S+ E A S K + G Sbjct: 83 FEALKAINKQTGI--------EINNVEPSSNFESAYNSALSAGHKIWVLNGFKHQQS-IK 133 Query: 213 QFISDHPELKQRYNIINITGDPHLNELSSHLYRVDYVTD 251 Q+I H E +R I I D + Y + + Sbjct: 134 QYIDAHREELERNQIKIIGIDFDIETEYKWFYSLQFNIK 172
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 47.4 bits (113), Expect = 5e-08 Identities = 42/191 (21%), Positives = 79/191 (41%), Gaps = 16/191 (8%) Query: 170 RKTVERAGIKVENIIISPLAMAKTILNEGEREFGATVIDMGGGQTTVASMRAQELQYTNI 229 R++ + AG + +I P+A A G+ V+D+GGG T VA + + Y++ Sbjct: 127 RESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSS 186 Query: 230 YAEGGEYITKDISKVLKTSLAI------AEALKFNFGQAEISEASITETVK-VDVV-GSE 281 GG+ + I ++ + AE +K G A + V+ ++ G Sbjct: 187 VRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVP 246 Query: 282 EPVEVTERYLSEIISARIRHILDRVKQDLER------GRLLDLPGGIVLIGGGAIMPGVV 335 + + E + + I+ V LE+ + + G+VL GGGA++ + Sbjct: 247 RGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISE--RGMVLTGGGALLRNLD 304 Query: 336 EIAQEIFGVTV 346 + E G+ V Sbjct: 305 RLLMEETGIPV 315
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 136 bits (344), Expect = 1e-38 Identities = 70/231 (30%), Positives = 116/231 (50%), Gaps = 22/231 (9%) Query: 38 WEFLGKPMSYFIDYFANNAGLGYGLAIIIVTIIVRTLILPLGLYQSWKASYQS-EKMAFL 96 F+ +P+ + + + G +G +III+T IVR ++ PL KA Y S KM L Sbjct: 333 LWFISQPLFKLLKWIHSFVG-NWGFSIIIITFIVRGIMYPLT-----KAQYTSMAKMRML 386 Query: 97 KPVFEPINKRIKQANSQEEKMAAQTELMAAQRAHGINPLGGIGCLPLLIQMPFFSAMYFA 156 +P + + +R+ ++K E+MA +A +NPLGG C PLLIQMP F A+Y+ Sbjct: 387 QPKIQAMRERLG-----DDKQRISQEMMALYKAEKVNPLGG--CFPLLIQMPIFLALYYM 439 Query: 157 AQYTKGVSTSTFMG--IDLGSR--SLVLTAIIAALYFFQSWLSMMAVSEEQREQMKTMMY 212 + + + F DL ++ +L ++ FF +S V++ + +M Sbjct: 440 LMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQKMSPTTVTDPM---QQKIMT 496 Query: 213 TMPIMMIFMSFSLPAGVGLYWLVGGFFSIIQQ-LITTYLLKPRLHKQIKEE 262 MP++ P+G+ LY++V +IIQQ LI L K LH + K++ Sbjct: 497 FMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGLHSREKKK 547
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 31.8 bits (72), Expect = 7e-04 Identities = 26/120 (21%), Positives = 45/120 (37%), Gaps = 29/120 (24%) Query: 46 VALIDQEIVGYIEGPVVTTPILEDSLFHGVTKNPKTGGYIAITSLSIAKHFQQQGVGTAL 105 + ++ +G I+ + N GY I +++AK ++++GVGTAL Sbjct: 69 LYYLENNCIGRIK----------------IRSN--WNGYALIEDIAVAKDYRKKGVGTAL 110 Query: 106 LAALKDLVVAQQRTGLILTCHDYLIS---YYEMNGFINQGISESQHGGT--------LWY 154 L + GL+L D IS +Y + FI + + WY Sbjct: 111 LHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAVDTMLYSNFPTANEIAIFWY 170
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 31.3 bits (71), Expect = 0.008 Identities = 16/55 (29%), Positives = 24/55 (43%), Gaps = 9/55 (16%) Query: 304 IINDTII--IDDFA-----HHPTEIVATIDAARQKYPSKEIVAIFQPHTFTRTIA 351 +I D ++ I D H+P I I A Q P +VA+F F +T+ Sbjct: 103 LITDDVLKAITDCIELAPLHNPANIEG-IKACTQIMPDVPMVAVFDT-AFHQTMP 155
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 37.1 bits (86), Expect = 1e-04 Identities = 21/87 (24%), Positives = 40/87 (45%), Gaps = 8/87 (9%) Query: 36 GVTRDRIYATGEWLNRQFSLIDTGGIDDVDAPFMEQIKHQAQIAMEEADVIVFVVSGKEG 95 G+T + +W N + ++IDT G D F+ ++ ++ D + ++S K+G Sbjct: 53 GITIQTGITSFQWENTKVNIIDTPGHMD----FLAEVYR----SLSVLDGAILLISAKDG 104 Query: 96 VTDADEYVSKILYRTNTPVILAVNKVD 122 V + L + P I +NK+D Sbjct: 105 VQAQTRILFHALRKMGIPTIFFINKID 131
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 28.6 bits (64), Expect = 0.024 Identities = 9/16 (56%), Positives = 12/16 (75%) Query: 45 IIGASGSGKSLLAHAI 60 I G SG+GK L+A A+ Sbjct: 165 ITGESGTGKELVARAL 180
>PF05616#Neisseria meningitidis TspB protein Length = 501 Score = 34.0 bits (77), Expect = 0.002 Identities = 24/87 (27%), Positives = 35/87 (40%), Gaps = 2/87 (2%) Query: 197 IPKKDLSPSELAAAQAYWSQKQGRGARPSDY-RPTPAPGRRKAPIPDVTPNPGQGHQPD- 254 IP+ DL+P A A + P++ P PG R P PD NP D Sbjct: 310 IPRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDG 369 Query: 255 NGGYHPAPPRPNDASQNKHQRDEFKGK 281 G P P D +H+++ +G+ Sbjct: 370 QPGTRPDSPAVPDRPNGRHRKERKEGE 396
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 250 bits (640), Expect = 2e-84 Identities = 83/323 (25%), Positives = 144/323 (44%), Gaps = 34/323 (10%) Query: 1 MKKGFFLMAMVVSLVMIAGCDKSANPKQPTQGMSVVTSFYPMYAMTKEVSGDLNDVR-MI 59 MKK L+ + +S +++ C Q + VV + + +TK ++GD D+ ++ Sbjct: 1 MKKLGTLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIV 60 Query: 60 QSGAGIHSFEPSVNDVAAIYDADLFVYHSHTLE----AWARDLDPNLKKSKVDVFEASKP 115 G H +EP DV +ADL Y+ LE AW L N KK++ + A Sbjct: 61 PIGQDPHEYEPLPEDVKKTSEADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFA--- 117 Query: 116 LTLDRVKGLEDMEVTQGIDPATLY--------DPHTWTDPVLAGEEAVNIAKELGRLDPK 167 V+ G+D L DPH W + A NIAK+L DP Sbjct: 118 -------------VSDGVDVIYLEGQNEKGKEDPHAWLNLENGIIFAKNIAKQLSAKDPN 164 Query: 168 HKDSYTKNAKAFKKEAEQLTEEYTQKFKKVR--SKTFVTQHTAFSYLAKRFGLKQLGISG 225 +K+ Y KN K + + ++L +E KF K+ K VT AF Y +K +G+ I Sbjct: 165 NKEFYEKNLKEYTDKLDKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWE 224 Query: 226 ISPEQEPSPRQLKEIQDFVKEYNVKTIFAEDNVNPKIAHAIAKSTGAKVKT---LSPLEA 282 I+ E+E +P Q+K + + +++ V ++F E +V+ + +++ T + + Sbjct: 225 INTEEEGTPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAE 284 Query: 283 APSGNKTYLENLRANLEVLYQQL 305 +Y ++ NL+ + + L Sbjct: 285 QGKEGDSYYSMMKYNLDKIAEGL 307
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 107 bits (268), Expect = 3e-27 Identities = 50/226 (22%), Positives = 85/226 (37%), Gaps = 47/226 (20%) Query: 119 KAGKGAGTVVAVIDAGFDKNHEAWRLTDKSKARYQSKEDLEKAKKDHGITYGEWVNDKVA 178 +G G VAV+D G D +H DL KA+ G + + Sbjct: 36 NQTRGRGVKVAVLDTGCDADHP----------------DL-KARIIGGRNFTDDDEGDPE 78 Query: 179 YYHDYSKDGKTAVDQEHGTHVSGILSGNAPSETKEPYRLEGAMPEAQLLLMRVEIVNGLA 238 + DY+ HGTHV+G ++ + G PEA LL+++V G Sbjct: 79 IFKDYNG---------HGTHVAGTIAATENE-----NGVVGVAPEADLLIIKVLNKQGSG 124 Query: 239 DYARNYAQAIRDAVNLGAKVINMSFGNAALAYANLPDETKKAFDYAKSKGVSIVTSAGND 298 Y Q I A+ +I+MS G E +A A + + ++ +AGN+ Sbjct: 125 QYD-WIIQGIYYAIEQKVDIISMSLGGPED-----VPELHEAVKKAVASQILVMCAAGNE 178 Query: 299 SSFGGKTRLPLADHPDYGVVGTPAAADSTLTVASYSPDKQLTETAT 344 +T +G P + ++V + + D+ +E + Sbjct: 179 GDGDDRT----------DELGYPGCYNEVISVGAINFDRHASEFSN 214 Score = 79.9 bits (197), Expect = 6e-18 Identities = 37/139 (26%), Positives = 58/139 (41%), Gaps = 22/139 (15%) Query: 459 NATPKVLPTASGTK---LSRFSSWGLTADGNIKPDIAAPGQDILSSVANNKYAKLSGTSM 515 +V+ + S FS+ + D+ APG+DILS+V KYA SGTSM Sbjct: 192 GCYNEVISVGAINFDRHASEFSNSNN------EVDLVAPGEDILSTVPGGKYATFSGTSM 245 Query: 516 SAPLVAGIMGL-LQKQYETQYPDMTPSERLDLAKKVLMSSATALYDEDEKAYFSPRQQGA 574 + P VAG + L Q + D+T E L+ L + SP+ +G Sbjct: 246 ATPHVAGALALIKQLANASFERDLTEPE----LYAQLIKRTIPLGN-------SPKMEGN 294 Query: 575 GAVDAKKASA-ATMYVTDK 592 G + + ++ T + Sbjct: 295 GLLYLTAVEELSRIFDTQR 313
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 212 bits (540), Expect = 1e-63 Identities = 281/586 (47%), Positives = 342/586 (58%), Gaps = 52/586 (8%) Query: 1 MAKNNTNRHYSLRKLKTGTASVAVALTVLGTGLVAGQTVKAD----ARSVNGEFPRHVKL 56 M KNNTNRHYSLRKLKTGTASVAVALTVLG GLV + +++ E + Sbjct: 1 MTKNNTNRHYSLRKLKTGTASVAVALTVLGAGLVVNTNEVSAVATRSQTDTLEKVQERAD 60 Query: 57 KNEIEN-LLDQVTQLYTKHNSNYQQYNAQAGRLDLRQKAEYLKGLNDWAERLLQELNGED 115 K EIEN L + +N + +N + K + K +E+ + E Sbjct: 61 KFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEA 120 Query: 116 VKKVLGKVAFEKDDLEKEVKELKEKIDKKEKEYQDLDKDFDLAKQGYVLSDKRHQQELEE 175 K L K + + LE Sbjct: 121 RKADLEKALEGAMNFSTADSA--------------------------------KIKTLEA 148 Query: 176 KEKKVTEATAKVGQISEELETVKQKVESTMQDLTEKQNRVSQLEQELATTKQNAKEDFEL 235 ++ + A + + E + ++ L ++ + + EL + A Sbjct: 149 EKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTA 208 Query: 236 AALANAADKQKLEAKIADLETKLKEAKEDFELAALGHQHAHNEYQAKLAEKDDQIKQLEE 295 + + + L K D E A G + AK+ + + LE Sbjct: 209 DS--------AKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 260 Query: 296 QKQILDASRKGTARDLEAVRQAKKATEAELNNLKAELAKVTEQKQILDASRKGTARDLEA 355 ++ L+ + +G A K EAE L+AE A + Q Q+L+A+R+ RDL+A Sbjct: 261 RQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDA 320 Query: 356 VRQAKAQVEAALKQLEEQNRISEASRKGLRRDLDASREAKKQVEKDLANLTAELDKVKEE 415 R+AK Q+EA ++LEEQN+ISEASR+ LRRDLDASREAKKQ+E AE K++E+ Sbjct: 321 SREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLE-------AEHQKLEEQ 373 Query: 416 KQISDASRQGLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAEL 475 +IS+ASRQ LRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAEL Sbjct: 374 NKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAEL 433 Query: 476 QAKLEAEAKALKEQLAKQAEELAKLRAGKASDSQIPDTKPGNKAVPGKGQAPQAGTKPNQ 535 QAKLEAEAKALKE+LAKQAEELAKLRAGKASDSQ PD KPGNKAVPGKGQAPQAGTKPNQ Sbjct: 434 QAKLEAEAKALKEKLAKQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQ 493 Query: 536 NKAPMKETKRQLPSTGETANPFFTAAALTVMATAGVAAVVKRKEEN 581 NKAPMKETKRQLPSTGETANPFFTAAALTVMATAGVAAVVKRKEEN Sbjct: 494 NKAPMKETKRQLPSTGETANPFFTAAALTVMATAGVAAVVKRKEEN 539
>PF05043#Transcriptional activator Length = 493 Score = 521 bits (1343), Expect = 0.0 Identities = 108/475 (22%), Positives = 218/475 (45%), Gaps = 18/475 (3%) Query: 34 ELSKALNISMLTLQTCLTNMQ-FMKEVGGITYKNGYITIWYHQHCGLQEVYQKALRHSQS 92 EL++ LN + ++ L++++ ++ + NG I ++ VY +HS Sbjct: 30 ELAELLNCTERAVKDDLSHVKSAFPDLIFHSSTNGIRIINT-DDSDIEMVYHHFFKHSTH 88 Query: 93 FKLLETLFFRDFNSLEELAEELFVSLSTLKRLIKKTNAYLMHTFGITILTSPVQVSGDEH 152 F +LE +FF + E + +E ++S S+L R+I + N + F + +PVQ+ G+E Sbjct: 89 FSILEFIFFNEGCQAESICKEFYISSSSLYRIISQINKVIKRQFQFEVSLTPVQIIGNER 148 Query: 153 QIRLFYLKYFSEAYKISEWPFGEILNLKNCERLLSLMIKEVDVRVNFTLFQHLKILSSVN 212 IR F+ +YFSE Y EWPF + + +LL L+ KE +N + + LK+L N Sbjct: 149 DIRYFFAQYFSEKYYFLEWPFEN-FSSEPLSQLLELVYKETSFPMNLSTHRMLKLLLVTN 207 Query: 213 LIRYYKGHSAVYDNKKTSHRFSQLIQSSLEIQDLSRLFYLKFGLYLDETTIAEMFSNHVN 272 L R GH D + + + + I+ +++ F ++ + LDE + ++F ++ Sbjct: 208 LYRIKFGHFMEVDKDSFNDQSLDFLMQAEGIEGVAQSFESEYNISLDEEVVCQLFVSYFQ 267 Query: 273 DQLEIGYAF--DSIKQDSPTGCRKVTNWVHLTNWVHLLDELEIRLNLSVTNKYEVAVILH 330 I + +K+DS V HL + +D++ ++ + + NK + LH Sbjct: 268 KMFFIDESLFMKCVKKDS-----YVEKSYHLLS--DFIDQISVKYQIEIENKDNLIWHLH 320 Query: 331 NTTVLKEEDITANYLFFDYKKSYLNFYKQEHPHLYKAFVAGVEKLMRSEKEPISTELTNQ 390 NT L +++ ++ FD K + + ++ P + + + + S+ + N Sbjct: 321 NTAHLYRQELFTEFILFDQKGNTIRNFQNIFPKFVSDVKKELSHYLETLEVCSSSMMVNH 380 Query: 391 LIYAFFITWENSFLEVNQKDEKIRLLVI----ERSFNSVGNFLKKYIGEFFSITNFNELD 446 L Y F ++ + + Q K+++LV+ + V L Y F + + EL+ Sbjct: 381 LSYTFITHTKHLVINLLQNQPKLKVLVMSNFDQYHAKFVAETLSYYCSNNFELEVWTELE 440 Query: 447 ALTIDLEEIEKQYDVIVTDVMVGKSDELEIFFFYKMIPEAIIDKLNAFLNISSAD 501 LE + YD+I+++ ++ + + + + ++I LNA + I + Sbjct: 441 LSKESLE--DSPYDIIISNFIIPPIENKRLIYSNNINTVSLIYLLNAMMFIRLDE 493
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 40.4 bits (94), Expect = 2e-05 Identities = 25/151 (16%), Positives = 48/151 (31%), Gaps = 6/151 (3%) Query: 42 TADTDTDDESETAKKDKKSKETASQHDTQKDHKPSHNHPTPPSNDTKQTDQASSEATDKP 101 T +T T + ETA +K+ K TQ+ P P + +T Q +E + Sbjct: 1092 TKETQTTETKETATVEKEEKAKVETEKTQEV--PKVTSQVSPKQEQSETVQPQAEP-ARE 1148 Query: 102 NKDKNDTKQPNSSDQSTPSPKDQSSQKESQNKDGRPTPSPDQQKDQTPDKTPEKSADKTP 161 N + K+P S +T D + + + + + Sbjct: 1149 NDPTVNIKEPQSQTNTTA---DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPA 1205 Query: 162 EKGPEKATEKTPEPNRDAPKPIQPPLAAAAP 192 P +E + +P + ++ P Sbjct: 1206 TTQPTVNSESSNKPKNRHRRSVRSVPHNVEP 1236
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 83.0 bits (205), Expect = 1e-20 Identities = 31/128 (24%), Positives = 55/128 (42%), Gaps = 1/128 (0%) Query: 3 KILVVEDDDTISQVICEFLKANNYDPDCVFDGQAALDKWQTTSYDLIILDIMLPSLSGLE 62 ILV +DD I V+ + L YD + DL++ D+++P + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 63 VLKTIRKT-SDVPIIMLTALDDEYTQLVSFNHLISDYVTKPFSPLILIKRIENVLRVSTP 121 +L I+K D+P+++++A + T + + DY+ KPF LI I L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 122 DEKRQIGD 129 + D Sbjct: 125 RPSKLEDD 132
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 54.8 bits (132), Expect = 3e-10 Identities = 35/144 (24%), Positives = 55/144 (38%), Gaps = 10/144 (6%) Query: 60 DISLTLAGEVTANNSSKVKIDSSKGEVKEVFVKKGDVVKVGQPLFSYETSQRLTAQSSEF 119 +I T G++T + SK VKE+ VK+G+ V+ G L +LTA +E Sbjct: 81 EIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLL------KLTALGAEA 134 Query: 120 DVQTKANQLQVAKTNAALKWETYNRKVNEINTLKSRYNTAPDESLLEQIRSAEDSVSQAL 179 D + Q + A L+ Y I K PDE + + E +L Sbjct: 135 DTL----KTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL 190 Query: 180 SDAKTADSDVKTAQIELDKANATA 203 + + + Q EL+ A Sbjct: 191 IKEQFSTWQNQKYQKELNLDKKRA 214 Score = 39.4 bits (92), Expect = 2e-05 Identities = 28/180 (15%), Positives = 61/180 (33%), Gaps = 16/180 (8%) Query: 120 DVQTKANQLQVAKTNAALKWETYNRKVNEINTLKSRYNTAPDESL---LEQIRSAEDSVS 176 D + ++ +AK + Y VNE+ KS+ E L E + + Sbjct: 239 DFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN 298 Query: 177 QALSDAKTADSDVKTAQIELDKANATATTEKGKLEYDTVKSDTAGTIVSLNTDLPNQSKS 236 + L + ++ +EL K + + +++ + + L Sbjct: 299 EILDKLRQTTDNIGLLTLELAKNEE-------RQQASVIRAPVSVKVQQLKVHTEGGVV- 350 Query: 237 KKENETFMEII-DKSKMLVKGNISEFDRDKLKIGQKVEV-IDRKDNSK--KWTGKVTQVG 292 ET M I+ + + V + D + +GQ + ++ ++ GKV + Sbjct: 351 -TTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN 409
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 52.5 bits (126), Expect = 1e-09 Identities = 67/331 (20%), Positives = 123/331 (37%), Gaps = 20/331 (6%) Query: 45 TGLLMMITSLMGFVGTLYGGHLSDALGRKKVIMIGSVGTTLGWFLTILANLPNAAIPWLT 104 G+L+ + +LM F G LSD GR+ V+++ G + + + A W+ Sbjct: 45 YGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAP-----FLWVL 99 Query: 105 FAGILLVEIASSFYGPAYEAMLIDLTDESNRRFVYTINYWFINIAVMFGAGLSGLFYDHH 164 + G ++ I + G A + D+TD R + ++ G L GL Sbjct: 100 YIGRIVAGI-TGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFS 158 Query: 165 FLALLVALLLVNVLCFGVAYYYFDETRPETH--AFDHGKGLLDSFRNYRKVFHDRAFVLF 222 A A +N L F + E+ L SFR A + Sbjct: 159 PHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFR--------WARGMT 210 Query: 223 TLGAIFSGSIWMQMDNYVPVHLKLYFQPTAVLGFQVTSSKMLSLMVLTNTLLIVLFMTVV 282 + A+ + MQ+ VP L + F T L+ + ++L + Sbjct: 211 VVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMIT--- 267 Query: 283 NKLTEKWKLLPQLVVGSLLFTLGMLLAFTFTQFYAIWLSVVLLTFGEMINVPASQVLRAD 342 + + L++G + G +L T+ + + +VLL G + +PA Q + + Sbjct: 268 GPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIG-MPALQAMLSR 326 Query: 343 MMDHSQIGSYTGFVSMAQPLGAILASLLVSV 373 +D + G G ++ L +I+ LL + Sbjct: 327 QVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357
>SSPAMPROTEIN#Salmonella surface presentation of antigen gene type M signature. Length = 147 Score = 32.0 bits (72), Expect = 0.004 Identities = 37/130 (28%), Positives = 57/130 (43%), Gaps = 14/130 (10%) Query: 192 NLLLSYEETVYEDKSLIDGQLTTVELTAAGKLLQYVHKTQMRELSH--------LQALVH 243 ++LL Y++ ED+ L + VE A KLL + + R+LS Q++V Sbjct: 23 SILLRYQD---EDRRLQVEEEAIVEQIAGLKLLLDTLRAENRQLSREEIYALLRKQSIVR 79 Query: 244 YEIKDYLQMSYATKSSLDLVENARTNKKHGSLYWLLDETKTAMGM-RLLRSWIDRPLVSK 302 +IKD + +E R + S YWL E + R R +I R + + Sbjct: 80 RQIKDLELQIIQIQEKRSELEKKREEFQEKSKYWLRKEGNYQRWIIRQKRLYIQREIQQE 139 Query: 303 EAILERQEII 312 EA E +EII Sbjct: 140 EA--ESEEII 147
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 131 bits (332), Expect = 2e-42 Identities = 56/145 (38%), Positives = 86/145 (59%), Gaps = 4/145 (2%) Query: 1 MNKMERQQQIKRIIQAEHIGTQEDIKNHLQKEGIVVTQATLSRDLRAIGLLKLRDEQGKL 60 MNK +R +I+ II A I TQ+++ + L+K+G VTQAT+SRD++ + L+K+ G Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKELHLVKVPTNNGSY 60 Query: 61 YYSL-SEPVATPFSPEVRF---YVLKVDRAGFMLVLHTNLGEADVLANLIDNDAIEDILG 116 YSL ++ P S R +K+D A ++VL T G A + L+DN E+I+G Sbjct: 61 KYSLPADQRFNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEEIMG 120 Query: 117 TIAGADTLLVICRDEEIAKRFEKDL 141 TI G DT+L+ICR + K +K + Sbjct: 121 TICGDDTILIICRTHDDTKVVQKKI 145
>BINARYTOXINA#Clostridial binary toxin A signature. Length = 454 Score = 29.6 bits (66), Expect = 0.036 Identities = 17/65 (26%), Positives = 28/65 (43%), Gaps = 7/65 (10%) Query: 208 EEAREWFRKLEDGDKEATELWQWFRDESLLEFNRLYDQLHVTFDSYNGEAFYNDKMDEVL 267 +EA + L+ +KEA EL++ + + + Y Q F Y E+ N + E Sbjct: 63 KEAERVEKNLDTLEKEALELYK----KDSEQISN-YSQTRQYFYDYQIES--NPREKEYK 115 Query: 268 ELLEA 272 L A Sbjct: 116 NLRNA 120