PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genomesequence.gbThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in HE579073 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1SAI8T7_1000090SAI8T7_1000540Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_10000902132.318871Amino acid permease
SAI8T7_10001004142.259430Putative homoserine-o-acetyltransferase
SAI8T7_10001106162.069609Putative uncharacterized protein
SAI8T7_10001207162.265987Putative uncharacterized protein
SAI8T7_10001308172.127422Replicative DNA helicase
SAI8T7_10001408181.632741Adenylosuccinate synthetase
SAI8T7_10001704172.322553**Response regulator
SAI8T7_10001803172.250801Sensor protein kinase walK
SAI8T7_10001901172.806363Putative uncharacterized protein
SAI8T7_1000200-2121.265369Putative uncharacterized protein
SAI8T7_1000210-3121.380986Putative uncharacterized protein
SAI8T7_1000220-1120.0156445'-nucleotidase
SAI8T7_1000230115-2.644080Putative uncharacterized protein
SAI8T7_1000240213-3.561537Ribosomal RNA large subunit methyltransferase H
SAI8T7_1000250212-3.503346Putative uncharacterized protein SA0024
SAI8T7_1000260012-3.104242Glycerophosphoryl diester phosphodiesterase
SAI8T7_1000270012-3.021076Penicillin binding protein 2 prime
SAI8T7_1000280016-2.048999Methicillin-resistance MecR1 regulatory protein
SAI8T7_1000290-117-0.921644MW0034 protein
SAI8T7_1000300-114-1.611023Putative uncharacterized protein
SAI8T7_1000310-213-1.577651Putative uncharacterized protein
SAI8T7_1000320-313-1.571238Cassette chromosome recombinase B1
SAI8T7_1000330-214-2.060547Cassette chromosome recombinase A1
SAI8T7_1000340114-3.369116Putative uncharacterized protein
SAI8T7_1000350316-4.872253Putative uncharacterized protein
SAI8T7_1000360316-4.505663Metallo-beta-lactamase family protein
SAI8T7_1000370416-5.252601Putative uncharacterized protein
SAI8T7_1000380819-6.366616Putative uncharacterized protein
SAI8T7_1000390817-6.148122Putative uncharacterized protein
SAI8T7_1000400718-5.504262Glycosyl transferase, group 1 family protein
SAI8T7_1000410616-4.236084Putative uncharacterized protein
SAI8T7_1000420617-3.305871Putative uncharacterized protein
SAI8T7_1000430413-2.048607Putative uncharacterized protein
SAI8T7_1000440310-0.337575Putative uncharacterized protein SA0080
SAI8T7_1000450110-0.966141Putative uncharacterized protein SA0082
SAI8T7_1000460011-0.870206Putative uncharacterized protein
SAI8T7_1000470012-0.867438Similar to sulfide-quinone reductase
SAI8T7_1000480013-1.141924Probable tRNA-dihydrouridine synthase
SAI8T7_1000490115-2.398227Putative uncharacterized protein
SAI8T7_1000500315-2.507726Putative uncharacterized protein
SAI8T7_1000510618-2.930266Putative uncharacterized protein
SAI8T7_1000520819-2.9849071-phosphatidylinositol phosphodiesterase
SAI8T7_1000530518-2.381547Putative uncharacterized protein
SAI8T7_1000540213-1.165551Putative lipoprotein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000170HTHFIS942e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 94.1 bits (234), Expect = 2e-24
Identities = 33/129 (25%), Positives = 66/129 (51%), Gaps = 2/129 (1%)

Query: 1 MQMARKVVVVDDEKPIADILEFNLKKEGYDVYCAYDGNDAVDLIYEEEPDIVLLDIMLPG 60
M A ++V DD+ I +L L + GYDV + I + D+V+ D+++P
Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 61 RDGMEVCREVRKKYE-MPIIMLTAKDSEIDKVLGLELGADDYVTKPFSTRELIARVKANL 119
+ ++ ++K +P+++++A+++ + + E GA DY+ KPF ELI + L
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 120 RRHYSQPAQ 128
+P++
Sbjct: 120 AEPKRRPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000370PF01206634e-15 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 63.2 bits (154), Expect = 4e-15
Identities = 19/70 (27%), Positives = 37/70 (52%)

Query: 119 KTFNYSNLQCPGPIVNISKEIKNIAIGDQIEVVVTDHGFLNDIKSWVKQTGHTLVRLNDF 178
++ + + L CP PI+ K + + G+ + V+ TD G + D +S+ KQTGH L+ +
Sbjct: 6 QSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKEE 65

Query: 179 GNEIRAIIQK 188
+++
Sbjct: 66 DGTYHFRLKR 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000380VACCYTOTOXIN280.034 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 28.5 bits (63), Expect = 0.034
Identities = 18/48 (37%), Positives = 27/48 (56%), Gaps = 7/48 (14%)

Query: 119 LALILMFIKVTPSTSHIKFNRVLLI--TIGGI-----IGLVSGIVGAG 159
LAL+ + +TP SH F ++I +GGI +G VSG++G G
Sbjct: 17 LALVGALVSITPQQSHAAFFTTVIIPAIVGGIATGAAVGTVSGLLGWG 64


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000450PF01206614e-14 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 60.5 bits (147), Expect = 4e-14
Identities = 20/70 (28%), Positives = 37/70 (52%)

Query: 118 KQFNYRGFQCPGPIVKISQEMKNIEVGDQIEVKVTDPGFPSDIKSWVKQTRHTLVKLDEN 177
+ + G CP PI+K + + + G+ + V TDPG D +S+ KQT H L++ E
Sbjct: 6 QSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKEE 65

Query: 178 NNGINAIIQK 187
+ + +++
Sbjct: 66 DGTYHFRLKR 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000500GPOSANCHOR397e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 39.3 bits (91), Expect = 7e-05
Identities = 18/134 (13%), Positives = 50/134 (37%), Gaps = 5/134 (3%)

Query: 515 INSEKTSIEEQVYHLDNETLRDNKEIEDLDNRINYIVKQIETLNELIKSIKESNKGFINK 574
++ ++++ L E +++ D ++ +I+ L ++++ +G +N
Sbjct: 76 LSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNF 135

Query: 575 LKAMFNSEEDESYKDHNKEKQQLLTQQLELEKCKKNKHEDLVSKLKEKEKLIKQLTKVQL 634
+ K EK L ++ +LEK + + + + L + ++
Sbjct: 136 ST-----ADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 190

Query: 635 QLDELNSQLQELEA 648
+ EL L+
Sbjct: 191 RQAELEKALEGAMN 204


2SAI8T7_1000630SAI8T7_1000880Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1000630212-1.046292Integral membrane protein DUF6
SAI8T7_1000640112-0.186214HTH-type transcriptional regulator NorG
SAI8T7_10006501161.482226Putative uncharacterized protein
SAI8T7_10006602161.454260L-lactate permease homologue
SAI8T7_10006701171.883168Gram-positive signal peptide protein, YSIRK
SAI8T7_1000680-2101.428782Transcriptional regulator, MarR family
SAI8T7_1000690-192.536286Lipoprotein
SAI8T7_10007000102.961472Lipoprotein
SAI8T7_10007100112.694207Iron-regulated ABC transporter
SAI8T7_10007201153.136375Probable siderophore biosynthesis protein SbnA
SAI8T7_10007303163.489799Probable ornithine cyclodeaminase protein
SAI8T7_10007402163.356143Putative uncharacterized protein
SAI8T7_10007502173.591111Putative uncharacterized protein
SAI8T7_10007600152.722747Putative uncharacterized protein
SAI8T7_10007700143.295728Similar to siderophore biosynthesis protein
SAI8T7_1000780-1142.656087Putative uncharacterized protein
SAI8T7_1000790-2131.873213Probable diaminopimelate decarboxylase protein
SAI8T7_1000800-2111.277577Putative uncharacterized protein sbnI
SAI8T7_1000810-1110.019912Putative uncharacterized protein
SAI8T7_1000820111-0.255790Diacetyl reductase [(S)-acetoin forming]
SAI8T7_1000830212-1.048501Polysaccharide biosynthesis family protein
SAI8T7_1000840311-0.619802Similar to capsular polysaccharide biosynthesis
SAI8T7_1000850311-0.474623Putative glycosyltransferase
SAI8T7_1000860311-0.114336Putative uncharacterized protein
SAI8T7_10008704110.094585Putative uncharacterized protein
SAI8T7_10008803151.599117Superoxide dismutase [Mn/Fe] 2
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000710FERRIBNDNGPP692e-15 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 68.8 bits (168), Expect = 2e-15
Identities = 46/191 (24%), Positives = 78/191 (40%), Gaps = 38/191 (19%)

Query: 53 PKRVVTLYQGATDVAVSLGVKPVGAVES-----WTQKPKFEYIKNDLKDTKI-VGQEPAP 106
P R+V L ++ ++LG+ P G ++ W +P L D+ I VG P
Sbjct: 35 PNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPP-------LPDSVIDVGLRTEP 87

Query: 107 NLEEISKLKPDLIVASKVRNEKVYDQLSKIAPTVSTDTVFKFKD----------TTKLMG 156
NLE ++++KP +V S + L++IAP F F D + M
Sbjct: 88 NLELLTEMKPSFMVWS-AGYGPSPEMLARIAPGR----GFNFSDGKQPLAMARKSLTEMA 142

Query: 157 KALGKEKEAEDLLKKYDDKVAAFQKDAKAKY--KDAWPLKASVVNF-RADHTRIYA-GGY 212
L + AE L +Y+D F + K ++ + A PL + H ++
Sbjct: 143 DLLNLQSAAETHLAQYED----FIRSMKPRFVKRGARPL--LLTTLIDPRHMLVFGPNSL 196

Query: 213 AGKILNDLGFK 223
+IL++ G
Sbjct: 197 FQEILDEYGIP 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000730SYCECHAPRONE310.002 Gram-negative bacterial type III secretion SycE cha...
		>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE

chaperone signature.
Length = 130

Score = 31.2 bits (70), Expect = 0.002
Identities = 14/33 (42%), Positives = 16/33 (48%), Gaps = 1/33 (3%)

Query: 21 VDALTEALTAHAHNDFVQ-PLKPYLRQDPENGH 52
+D E T +HN F Q LKP L D GH
Sbjct: 54 LDNNDEKETLLSHNIFSQDILKPILSWDEVGGH 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000740PF04183316e-103 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 316 bits (812), Expect = e-103
Identities = 119/527 (22%), Positives = 208/527 (39%), Gaps = 46/527 (8%)

Query: 79 RVSKQPLTAAEFWQTIANMNCDLSHEWEVARVEEGLTTAATQLAKQLSELDLASHPFV-- 136
R + +P+ A + + +S +++ T L + L++ +
Sbjct: 66 RCADEPVLAQTLLMQLKQVL-SMSDATVAEHMQDLYATLLGDLQLLKARRGLSASDLINL 124

Query: 137 -MSEQFASLKDRPFHPLAKEKRGLREADYQVYQAELNQSFPLMVAAVKKTHMIHGDTANI 195
L P K +RG + + Y E +F L AVK+ HMI +
Sbjct: 125 NADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRCDNEM 184

Query: 196 DELENLTVPIKEQA----TDMLNDQGLSIDDYVLFPVHPWQYQHILPNVFATEISEKLVV 251
D + LT + Q + + + GL +++ PVHPWQ+Q + F + +E +V
Sbjct: 185 DIHQLLTAAMDPQEFARFSQVWQENGLD-HNWLPLPVHPWQWQQKIATDFIADFAEGRMV 243

Query: 252 LLPLKFGD-YLSSSSMRSLIDIGAPYN-HVKVPFAMQSLGALRLTPTRYMKNGEQAEQLL 309
L +FGD +L+ S+R+L + +K+P + + R P RY+ G A + L
Sbjct: 244 SLG-EFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWL 302

Query: 310 RQLIEKDEALAKYVMV-CDETA-------WWSYMGQDNDIFKDQLGHLTVQLRKYPEVLA 361
+Q+ D L + V E A ++ + + +++ LG V R+ P
Sbjct: 303 QQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLG---VIWRENPCRWL 359

Query: 362 KNDTQQLVSMAALAANDRTLYQMICGKDNISKNDVMTLFEDIAQVFLKVTLSFM-QYGAL 420
K D + V MA L D + + S D T + +V + + +YG
Sbjct: 360 KPD-ESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPLYHLLCRYGVA 418

Query: 421 PELHGQNILLSFEDGRVQKCVLRD-HDTVRIYKPWLTAHQLSLPKYV--VREDTPNTLIN 477
HGQNI L+ ++G Q+ +L+D +R+ K SLP+ V V +
Sbjct: 419 LIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMD-SLPQEVRDVTSRLSADYLI 477

Query: 478 EDLETFFAYFQTLAVSVNLYAIIDAIQDLFGVSEHELMSLLKQILKNEVATISWVTTDQL 537
DL+T V + I + GV E LL +L + + Q+
Sbjct: 478 HDLQTGHF--------VTVLRFISPLMVRLGVPERRFYQLLAAVLSDYMK-----KHPQM 524

Query: 538 AVRHILFDKQTWPFKQILLP---LLY-QRDSGGGSMPSGLTTVPNPM 580
+ R LF +++L L + D G +P+ L + NP+
Sbjct: 525 SERFALFSLFRPQIIRVVLNPVKLTWPDLDGGSRMLPNYLEDLQNPL 571


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000750TCRTETA733e-16 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 72.9 bits (179), Expect = 3e-16
Identities = 66/333 (19%), Positives = 131/333 (39%), Gaps = 22/333 (6%)

Query: 17 GIAIAAPAVTTMIASPIWGKLGDKISRKWMVLRALLGLAVCLFLMALCTTPLQFVLVRLL 76
GI +A A+ +P+ G L D+ R+ ++L +L G AV +MA + R++
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105

Query: 77 QGLFGGVVDASSAFASAEAPAEDRGKVLGRLQSSVSAGSLVGPLIGGVTASILGFSALLM 136
G+ G + A+ + ++R + G + + G + GP++GG+ A
Sbjct: 106 AGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGF-SPHAPFF 164

Query: 137 SIAVITFIVCIFGALKLIETTHMPKSQTPNINKGIRRSFQCLLCTQQTCRFIIVGVLANF 196
+ A + + + G L E+ + SF+ + V A
Sbjct: 165 AAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFR--------WARGMTVVAALM 216

Query: 197 AMYGMLTALSPLASSVNHTAIDDR-----SVIGFLQSAF-WTASILSAPLWGRFNDKSYV 250
A++ ++ + + +++ +DR + IG +AF S+ A + G +
Sbjct: 217 AVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGE 276

Query: 251 KSVYIFATIACGCSAILQGLATNIEFLMAARILQGLTYSAL--IQSVMFVVVNACHQ-QL 307
+ + IA G IL AT +L + +Q+++ V+ Q QL
Sbjct: 277 RRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQL 336

Query: 308 KGTFVGTTNSMLVVGQIIGSLSGAAITSYTTPA 340
+G+ T+ + I+G L AI + +
Sbjct: 337 QGSLAALTS----LTSIVGPLLFTAIYAASITT 365


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000760PF041833002e-97 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 300 bits (769), Expect = 2e-97
Identities = 109/474 (22%), Positives = 192/474 (40%), Gaps = 52/474 (10%)

Query: 11 WLIDGKSKKITTYKEAIARILQHMAQSADNQTA-VQQHMAQIMSDI--DNSIHRTARYLQ 67
ID ++ + +L + Q A V +HM + + + D + + R L
Sbjct: 58 LWIDAQTLRCADEPVLAQTLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQLLKARRGLS 117

Query: 68 SNTIDYVEDRYIVSEQSLYLGHPFHPTPKSASGFSEADLEKYAPECHTSFQLHYLAVHQD 127
++ + + Q L GHP K G+ + LE+YAPE +F+LH+LAV ++
Sbjct: 118 ASDL---INLNADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKRE 174

Query: 128 -------------VLLTRYVEGKEDQVEKVLYQLADIDISEIPKDFILLPTHPYQINVLR 174
LLT ++ +E ++Q +D +++ LP HP+Q
Sbjct: 175 HMIWRCDNEMDIHQLLTAAMDPQEFARFSQVWQENGLD-----HNWLPLPVHPWQWQQK- 228

Query: 175 QHPQYMQYSEQGLIKDLGVSGDLVYPTSSVRTVF--SKALNIYLKLPIHVKITNFIRTND 232
++ +G + LG GD S+RT+ S+ + +KLP+ + T+ R
Sbjct: 229 IATDFIADFAEGRMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIP 288

Query: 233 LEQIERTIDAAQVIASVKDE-----------VETPHFKLMFEEGYRALLPNPLGQTVEPE 281
I A++ + V + P + EGY AL P E
Sbjct: 289 GRYIAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQ---E 345

Query: 282 MDLLTNSAMIVREGIPNY-HADKDIHVLASLFETMPD-SPISKLAQVIEQSGLAPEAWLE 339
M +I RE + D+ ++A+L E + P+ I++SGL E WL
Sbjct: 346 M-----LGVIWRENPCRWLKPDESPVLMATLMECDENNQPL--AGAYIDRSGLDAETWLT 398

Query: 340 CYLDRTLLPILKLFSNTGISLEAHVQNTLIELKDGIPDVCFVRDLEG-ICLSRTIATEKQ 398
++P+ L G++L AH QN + +K+G+P ++D +G + L + E
Sbjct: 399 QLFRVVVVPLYHLLCRYGVALIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMD 458

Query: 399 LVPNVVAASSPVVYAHDEAWHRLKYYVVVNHLGHLVSTIGKATRNEVVLWQLVA 452
+P V + + A D H L+ V L + + + E +QL+A
Sbjct: 459 SLPQEVRDVTSRLSA-DYLIHDLQTGHFVTVLRFISPLMVRLGVPERRFYQLLA 511


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000770PF04183497e-173 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 497 bits (1282), Expect = e-173
Identities = 142/579 (24%), Positives = 251/579 (43%), Gaps = 40/579 (6%)

Query: 1 MHQLVSSLIYENIVVYKASYQDGVGHFTIEGHDSEYRFTAEKTHSFDRIRITSPIERVVG 60
+ +++S L YE + + A Q G + I +++RF AE+ + + I + R
Sbjct: 14 VAKMLSELEYEQV--FHAESQ-GDDRYCINLPGAQWRFIAERG-IWGWLWIDAQTLRC-- 67

Query: 61 DEADTTTDYTQLLREAVFTFPKNDEKLEQFIVELLQTELKDTQSMQYRESNPPATPETFN 120
AD LL + +D + + + +L T L D Q ++ R + N
Sbjct: 68 --ADEPVLAQTLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQLLKARRGLSASDLINLN 125

Query: 121 -DYEFYAMEGHQYHPSYKSRLGFTLSDNLKFGPDFVPNVKLQWLAIDKDKVETTVSRNVV 179
D + GH K R G+ ++ P++ +L WLA+ ++ + +
Sbjct: 126 ADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRCDNEMD 185

Query: 180 VNEMLRQQVGDKTYEHFVQQIEASGKHVNDVEMIPVHPWQFEHVIQVDLAEERLNGTVLW 239
++++L + + + F Q + +G N + +PVHPWQ++ I D + G ++
Sbjct: 186 IHQLLTAAMDPQEFARFSQVWQENGLDHNWL-PLPVHPWQWQQKIATDFIADFAEGRMVS 244

Query: 240 LGESDELYHPQQSIRTMSPIDTT-KYYLKVPISITNTSTKRVLAPHTIENAAQITDWLKQ 298
LGE + + QQS+RT++ +K+P++I NTS R + I + WL+Q
Sbjct: 245 LGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWLQQ 304

Query: 299 IQQQDMYLKDE----LKTVFLGEVLGQSYLNTQLSPYKQTQVYGALGVIWRENIYHMLID 354
+ D L L G V + Y +PY+ ++ LGVIWREN L
Sbjct: 305 VFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEM---LGVIWRENPCRWLKP 361

Query: 355 EEDAIPFNALYASDKDGLPFIEKWIKQYG--SEAWTKQFLAVAIRPMIHMLYYHGIAFES 412
+E + L D++ P +I + G +E W Q V + P+ H+L +G+A +
Sbjct: 362 DESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPLYHLLCRYGVALIA 421

Query: 413 HAQNMMLIHENGWPTRIALKDFHDGVRFKREHLSEAASHLTLKPMPEAHKKVNSNSFIET 472
H QN+ L + G P R+ LKDF +R +E E S +P+ + V S
Sbjct: 422 HGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMDS------LPQEVRDVTS------ 469

Query: 473 DDERLVRDFLH---DAFFFINIAEIILFIEKQYGIDEQRQWQWVKDIIEAYQEAFPELNN 529
RL D+L F+ + I + + G+ E+R +Q + ++ Y + P+++
Sbjct: 470 ---RLSADYLIHDLQTGHFVTVLRFISPLMVRLGVPERRFYQLLAAVLSDYMKKHPQMSE 526

Query: 530 -YQHFDLFEPTIQVEKLTTRRL-LSDSELRIHHVTNPLG 566
+ F LF P I L +L D + + N L
Sbjct: 527 RFALFSLFRPQIIRVVLNPVKLTWPDLDGGSRMLPNYLE 565


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000820DHBDHDRGNASE1284e-38 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 128 bits (323), Expect = 4e-38
Identities = 66/250 (26%), Positives = 113/250 (45%), Gaps = 2/250 (0%)

Query: 5 KVALVTGGAQGIGFKIAERLVEDGFKVAVVDFNEEGAKAAALKLSSDGTKAIAIKADVSN 64
K+A +TG AQGIG +A L G +A VD+N E + L ++ A A ADV +
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 65 RDDVFNAVRQTAAQFGDFHVMVNNAGLGPTTPIDTITEEQFKTVYGVNVAGVLWGIQAAH 124
+ + + G ++VN AG+ I ++++E+++ + VN GV ++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 125 EQFKKFNHGGKIINATSQAGVEGNPGLSLYCSTKFAVRGLTQVAAQDLASEGITVNAFAP 184
+ G I+ S ++ Y S+K A T+ +LA I N +P
Sbjct: 129 KYMMD-RRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 185 GIVQTPMMESIAVATAEEAGKPEAWGWEQFTSQIALGRVSQPEDVSNVVSFLAGKDSDYI 244
G +T M S+ + E F + I L ++++P D+++ V FL + +I
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSL-ETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 245 TGQTIIVDGG 254
T + VDGG
Sbjct: 247 TMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000830NUCEPIMERASE2179e-71 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 217 bits (554), Expect = 9e-71
Identities = 79/327 (24%), Positives = 139/327 (42%), Gaps = 33/327 (10%)

Query: 6 RVLITGGAGFIGSHLVDDL-QQDYDVYVLDNYRTG-----KRENIKSLADDHVF--ELDI 57
+ L+TG AGFIG H+ L + + V +DN K+ ++ LA ++D+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 58 REYDAVEQIMKTYQFDYVIHLAALVSVAESVEKPILSQEINVVATLRLLEIIKKYNSHIK 117
+ + + + + F+ V ++V S+E P + N+ L +LE + I+
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK--IQ 119

Query: 118 RFIFASSAAVYGDLPDLPKSDQSLI-LPLSPYAIDKYYGERTTLNYCSLYNIPTAVVKFF 176
++ASS++VYG +P S + P+S YA K E Y LY +P ++FF
Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179

Query: 177 NVFGPRQDPKSQYSGVISKMFDSFEHNKPFTFFGDGLQTRDFVYVYDVVQSVRLIMEH-- 234
V+GP P M K + G RDF Y+ D+ +++ + +
Sbjct: 180 TVYGPWGRPDMALFKFTKAML----EGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIP 235

Query: 235 ---------------KDAIGHGYNIGTGTFTNLLEVYRIIGELYGKSVEHEFKEARKGDI 279
A YNIG + L++ + + + G + + GD+
Sbjct: 236 HADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDV 295

Query: 280 KHSYADISNL-KALGFVPKYTVETGLK 305
+ AD L + +GF P+ TV+ G+K
Sbjct: 296 LETSADTKALYEVIGFTPETTVKDGVK 322


3SAI8T7_1001200SAI8T7_1001490Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_10012002111.309514Putative aldehyde dehydrogenase AldA
SAI8T7_10012101141.159836Cation efflux family protein
SAI8T7_1001220-1111.945785Putative uncharacterized protein
SAI8T7_1001230-1121.451998Putative uncharacterized protein
SAI8T7_10012400141.504206Putative uncharacterized protein
SAI8T7_10012501141.279741Putative uncharacterized protein
SAI8T7_10012602141.283404Putative uncharacterized protein
SAI8T7_10012702131.165875Formate dehydrogenase
SAI8T7_10012803141.468170Similar to integral membrane protein LmrP
SAI8T7_10012903131.731674Gramicidin S synthetase 2 related protein
SAI8T7_10013002142.429596Putative uncharacterized protein
SAI8T7_10013102152.006495Putative uncharacterized protein
SAI8T7_10013201151.927579Acetylglutamate kinase
SAI8T7_10013300142.270038Arginine biosynthesis bifunctional protein ArgJ
SAI8T7_1001340-1162.396710N-acetyl-gamma-glutamyl-phosphate reductase
SAI8T7_1001350-1142.432958Ornithine aminotransferase 1
SAI8T7_10013600152.446599Similar to branched-chain amino acid transport
SAI8T7_10013701123.324564Isochorismatase hydrolase
SAI8T7_10013801113.151056Putative indole-3-pyruvate decarboxylase
SAI8T7_10013900112.061049PTS system glucose-specific EIICBA component
SAI8T7_10014000141.055144Putative uncharacterized protein
SAI8T7_1001410-1140.369666N-acetylmuramic acid 6-phosphate etherase
SAI8T7_1001420-215-0.232952PTS system EIIBC component MW0166
SAI8T7_1001430116-1.431049RpiR family transcriptional regulator
SAI8T7_1001440115-3.570784Type-1 restriction enzyme R protein
SAI8T7_1001450218-5.479838Putative uncharacterized protein
SAI8T7_1001460319-5.792263Putative uncharacterized protein
SAI8T7_1001470015-4.293212Similar to ABC transporter ATP-binding protein
SAI8T7_1001480-114-3.907492Similar to SA0193/BacI-like protein
SAI8T7_1001490-111-3.396289Putative uncharacterized protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1001280TCRTETA320.004 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.1 bits (73), Expect = 0.004
Identities = 61/337 (18%), Positives = 127/337 (37%), Gaps = 33/337 (9%)

Query: 7 TLKVRLISNFLQLIITTAFIPFIALYLTDMLS----QSIVGIYLVGLVVLKFPLSIISGY 62
L V L + L + +P + L D++ + GI L +++F + + G
Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65

Query: 63 LIEIFPKKLLVLIYQATMVIMLVFMGVFGSHQLWQI-IGFCVAYAIFTIVWGLQFPVMDT 121
L + F ++ ++L+ A + M + LW + IG VA + G V
Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMAT--APFLWVLYIGRIVAG-----ITGATGAVAGA 118

Query: 122 LIMDAITEDVEHYIYKISYWMTNLSVAIGALLGGLMYGYSMLLLFLIAACIFLIVLFILY 181
I D D + + G +LGGLM G+S F AA + +
Sbjct: 119 YIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGC 178

Query: 182 IWLPQDRNQVKQSDDKRHASRYQKLQIMNIFRSYKLVLKDRNYMLLISGFSIIMMGEFSI 241
LP+ ++ + + + L+ F + ++G+
Sbjct: 179 FLLPESHKGERRPLRREALNPLASFRWARGMTVVAA--------LMAVFFIMQLVGQVPA 230

Query: 242 SSYIAIRLKDQF--ETISIGSYDITGAKMLAILLMINTVVVILLTYSISKVVLKIDFKKA 299
+ + I +D+F + +IG LA +++++ ++T ++ ++ ++A
Sbjct: 231 ALW-VIFGEDRFHWDATTIGI-------SLAAFGILHSLAQAMITGPVAA---RLGERRA 279

Query: 300 LITGLLIYIVGYSGLTYLNQFGLLVVFMIIATVGEII 336
L+ G++ GY L + + + M++ G I
Sbjct: 280 LMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIG 316


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1001290NUCEPIMERASE538e-09 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 52.9 bits (127), Expect = 8e-09
Identities = 54/266 (20%), Positives = 101/266 (37%), Gaps = 55/266 (20%)

Query: 2052 NTLLTGATGFLGAYLIEALQGYSHRIYCFIRADNEEIAWYKLMTNLNDYFS----EETVE 2107
L+TGA GF+G ++ + L H++ + D NLNDY+ + +E
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQV---VGID-----------NLNDYYDVSLKQARLE 47

Query: 2108 MM----LSNIEVIVGDFECMDDVVLPENMDTIIH----AGARTDHFGDDDEFEKVNVQGT 2159
++ ++ + D E M D+ + + + R + + N+ G
Sbjct: 48 LLAQPGFQFHKIDLADREGMTDLFASGHFERVFISPHRLAVR-YSLENPHAYADSNLTGF 106

Query: 2160 VDVIRLAQQHH-ARLIYVSTISV-GTYFDIDTEDVTFSEADVYKGQLLTSPYTRSKFYSE 2217
++++ + + L+Y S+ SV G + FS D + S Y +K +E
Sbjct: 107 LNILEGCRHNKIQHLLYASSSSVYG-----LNRKMPFSTDDSVDHPV--SLYAATKKANE 159

Query: 2218 LKVLEAVNN-GLDGRIVRVGNLTSPYNGRWHM------RNIKTNRFSMVMNDLLQLDCIG 2270
L + GL +R + P+ GR M + + + V N
Sbjct: 160 LMAHTYSHLYGLPATGLRFFTVYGPW-GRPDMALFKFTKAMLEGKSIDVYNY-------- 210

Query: 2271 VSMAEMPVDFSFVDTTARQIVALAQV 2296
+M DF+++D A I+ L V
Sbjct: 211 ---GKMKRDFTYIDDIAEAIIRLQDV 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1001300ENTSNTHTASED290.009 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 29.2 bits (65), Expect = 0.009
Identities = 15/57 (26%), Positives = 27/57 (47%), Gaps = 5/57 (8%)

Query: 84 GQP-----IYVSLSYSYPYIVCVVDKEPVGIDIEKISQRLDWRTLVTCFSTNEAHQI 135
QP ++ S+S+ + V+ ++ +GIDIEKI + L ++ QI
Sbjct: 76 RQPLWPDGLFGSISHCATTALAVISRQRIGIDIEKIMSQHTATELAPSIIDSDERQI 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1001320CARBMTKINASE320.002 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 31.7 bits (72), Expect = 0.002
Identities = 23/84 (27%), Positives = 41/84 (48%), Gaps = 7/84 (8%)

Query: 155 INADTLAYFIASSLKAPIYV-LSNIAGVLIN-----DVVIPQLPLVDIHQYIEHGD-IYG 207
I+ D +A + A I++ L+++ G + + + ++ + ++ +Y E G G
Sbjct: 213 IDKDLAGEKLAEEVNADIFMILTDVNGAALYYGTEKEQWLREVKVEELRKYYEEGHFKAG 272

Query: 208 GMIPKVLDAKNAIENGCPKVIIAS 231
M PKVL A IE G + IIA
Sbjct: 273 SMGPKVLAAIRFIEWGGERAIIAH 296


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1001370ISCHRISMTASE604e-13 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 59.6 bits (144), Expect = 4e-13
Identities = 31/99 (31%), Positives = 51/99 (51%)

Query: 86 LDKRDDDFVIDKRHFSAFVGTDLDLQLRRRGIDTIVLGGVATHIGVDTTARDAYQLNYNQ 145
L DDD V+ K +SAF T+L +R+ G D +++ G+ HIG TA +A+ +
Sbjct: 112 LAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKA 171

Query: 146 FFVTDMMSAQNETLHQFPIDNVFPLMGQTITTNDFLNIL 184
FFV D ++ + HQ ++ T+ T+ L+ L
Sbjct: 172 FFVGDAVADFSLEKHQMALEYAAGRCAFTVMTDSLLDQL 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1001430DNABINDINGHU300.002 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 30.4 bits (69), Expect = 0.002
Identities = 16/75 (21%), Positives = 28/75 (37%), Gaps = 15/75 (20%)

Query: 87 ELIENESVETLKNKMIARATNTMRFVATNIMDAQIDAICDVLKNARTIFLFGFGASSLTI 146
+LI +A AT + + +DA A+ L + L GFG +
Sbjct: 6 DLIA----------KVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVR- 54

Query: 147 GDLFQKLSRIGLNVR 161
++ +R G N +
Sbjct: 55 ----ERAARKGRNPQ 65


4SAI8T7_1002070SAI8T7_1002120Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_10020701183.066791Putative methyltransferase
SAI8T7_10020801163.283583Probable ribokinase
SAI8T7_10020902153.621451D-ribose pyranase
SAI8T7_10021003143.279484Putative ribose uptake protein rbsU
SAI8T7_10021103142.287867Similar to sugar-binding transcriptional
SAI8T7_10021202151.341901Major facilitator transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1002120TCRTETB1004e-25 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 100 bits (251), Expect = 4e-25
Identities = 91/392 (23%), Positives = 158/392 (40%), Gaps = 18/392 (4%)

Query: 38 PLVGQTYQTSPAVLNLSISLTSFATGIFMVAAGDIADKIGQLRMTYMGLIISMFASLLLI 97
P + + PA N + I G ++D++G R+ G+II+ F S++
Sbjct: 38 PDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGF 97

Query: 98 ISDITA-LLIIGRILQGLSAAILLPSTVGVLNNQFKGEHLRRAISYLMISTVGGIGLAGV 156
+ LLI+ R +QG AA + V+ E+ +A + G G+
Sbjct: 98 VGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPA 157

Query: 157 IGGLIATNFGWQMNFIISIVIAFIAILLLKGTPEKVSQHSHRHPFDYKGMSIFAVMIGSF 216
IGG+IA W +I ++ L+K ++V + FD KG+ + +V I F
Sbjct: 158 IGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEV---RIKGHFDIKGIILMSVGIVFF 214

Query: 217 TLLLTQGFEQGWFSTFSFICLSIFIITTLIFIIIERRHEVPFIDFSVLRNRPFIGAFLNN 276
L T ++S L + +++ LIF+ R+ PF+D + +N PF+ L
Sbjct: 215 MLFTT---------SYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCG 265

Query: 277 FVLNSGLGVTVVFFIYA-QTHLGLSAAQSGLVTLPYAIVAVAMIR-LGEKATLRFGGKLM 334
++ + V Y + LS A+ G V + ++V + +G R G +
Sbjct: 266 GIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYV 325

Query: 335 LIIGPLFPVIGITIISMTQLSASQYVIAVIIGFVICAIGNGLVATPGLTIAIFSMPNEKV 394
L IG F + S + S + + I V G T TI S+ ++
Sbjct: 326 LNIGVTFLSVSFLTASFLLETTSWF---MTIIIVFVLGGLSFTKTVISTIVSSSLKQQEA 382

Query: 395 GLATGLYKMSGTLGGAFGIALSTTVFSMLQLN 426
G L + L GIA+ + S+ L+
Sbjct: 383 GAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLD 414


5SAI8T7_1002430SAI8T7_1002480Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_10024300123.093866Putative uncharacterized protein
SAI8T7_10024400113.597552Putative N-acetylmannosamine-6-phosphate
SAI8T7_10024500103.416329Nucleoside recognition domain protein
SAI8T7_10024601113.643833Lipase=2
SAI8T7_10024702122.919400Putative uncharacterized protein
SAI8T7_10024801133.046606Putative trimethylamine dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1002440PHPHTRNFRASE270.046 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 27.4 bits (61), Expect = 0.046
Identities = 17/82 (20%), Positives = 27/82 (32%), Gaps = 12/82 (14%)

Query: 65 DYDHSDVFITATSKEVDELIESQCEVIALDATLQQ---RPKETLDELVSYIRTHAPNVEI 121
D V + T +EV E + + P T D +VE+
Sbjct: 222 DGIEGIVIVNPTEEEVKAYEEKRAAFEKQKQEWAKLVGEPSTTKD---------GAHVEL 272

Query: 122 MADIATVEEAKNAARLGFDYIG 143
A+I T ++ G + IG
Sbjct: 273 AANIGTPKDVDGVLANGGEGIG 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1002460GPOSANCHOR471e-07 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 47.4 bits (112), Expect = 1e-07
Identities = 41/309 (13%), Positives = 85/309 (27%), Gaps = 12/309 (3%)

Query: 1 MLRGQEERKYSIRKYSIGVVSVLAATMFVVSSHEAQASEKTPTSNAAAQKETLNQPGEQG 60
M + R YS+RK G SV A + + +E + + +
Sbjct: 1 MTKNNTNRHYSLRKLKTGTASVAVALTVLGAGLVVNTNEVSAVATRSQTDTLEKVQERAD 60

Query: 61 NAITSHQMQSGKQLDDMHKENGKSGTVTEGKDTLQSSKHQSTQNSKTIRTQ---NDNQVK 117
+ K D E + L ++K + +N K++ +
Sbjct: 61 KFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEA 120

Query: 118 QDSERQGSKQSHQN------NATNNTERQNDQVQNTHHAERNGSQSTTSQSNDVDKSQPS 171
+ ++ + + + N E + + + + S +
Sbjct: 121 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 180

Query: 172 IPAQKVLPNHDKAAPTSTTPPSNDKTAPKSTKAQDATTDKHPNQQDTHQPAHQIIDAKQD 231
+ A+K +A + + + S K + +K + A
Sbjct: 181 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNF 240

Query: 232 DTVRQSEQKPQVGDLSKHIDGQNSPEKPTDKNTDNKQLIKDALQAPKTRSTTNAAADAKK 291
T ++ K + + Q EK KT AA +A+K
Sbjct: 241 STADSAKIKTLEAEKAALEARQAELEK---ALEGAMNFSTADSAKIKTLEAEKAALEAEK 297

Query: 292 VRPLKANQV 300
+QV
Sbjct: 298 ADLEHQSQV 306


6SAI8T7_1003020SAI8T7_1003310Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1003020216-1.075687Exotoxin=6
SAI8T7_1003030115-1.175554Exotoxin=7
SAI8T7_1003040314-1.689685Putative uncharacterized protein (fragment)
SAI8T7_1003050415-2.798544Exotoxin=8
SAI8T7_1003060216-2.991972Superantigen-like protein
SAI8T7_1003070215-1.204044Superantigen-like protein 5
SAI8T7_1003080113-1.942571Exotoxin=11
SAI8T7_1003090113-2.000277Superantigen-like protein 7
SAI8T7_100310039-1.605994Exotoxin=13
SAI8T7_100311049-1.710183Exotoxin=14
SAI8T7_1003120411-1.932228Type I restriction enzyme EcoR124II M protein
SAI8T7_10031301014-3.878715Type I restriction modification DNA specificity
SAI8T7_10031401013-3.605067Exotoxin=15
SAI8T7_10031501113-3.550002Putative uncharacterized protein
SAI8T7_10031601321-4.088587Tandem lipoprotein within Pathogenicity island
SAI8T7_10031701222-3.884319Uncharacterized lipoprotein SAV0437
SAI8T7_10031801222-3.442082Tandem lipoprotein within Pathogenicity island
SAI8T7_10031901222-3.554770Uncharacterized lipoprotein SAV0439
SAI8T7_10032001121-3.539695Uncharacterized lipoprotein SA0400
SAI8T7_10032101221-3.501573Uncharacterized lipoprotein SAV0441
SAI8T7_10032201119-3.347690Tandem lipoprotein within Pathogenicity island
SAI8T7_1003230714-2.027657Tandem lipoprotein
SAI8T7_1003240114-0.576526Uncharacterized lipoprotein SAOUHSC_00402
SAI8T7_1003250-1131.640724Uncharacterized lipoprotein SAV0444
SAI8T7_1003260-1131.708234Lipoprotein
SAI8T7_10032700132.152735Putative uncharacterized protein
SAI8T7_10032803162.776193Putative cobalamin synthesis protein
SAI8T7_10032902152.474119NADH dehydrogenase subunit 5
SAI8T7_10033001152.589768UPF0753 protein SaurJH1_0488
SAI8T7_10033102161.150839Putative uncharacterized protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003020TOXICSSTOXIN953e-26 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 95.5 bits (237), Expect = 3e-26
Identities = 46/214 (21%), Positives = 84/214 (39%), Gaps = 11/214 (5%)

Query: 18 TGVITSNVQSVQAKTEVKQQSESELKHYYNKPVLERKNVTGYKYTEKGKDYIDVIVDNQY 77
T V S+ Q ++ + +L +Y+ N + + + + +
Sbjct: 25 TPVPLSSNQIIKTAKASTNDNIKDLLDWYSSGSDTFTNS---EVLDNSLGSMRIKNTDGS 81

Query: 78 SQISLVGSDKDKFKDGDNSNIDVFILREGDSRQATN-----YSIGGVTKTNSQPFIDYIH 132
+ + S +D+ R S+ + + I GVT T P I
Sbjct: 82 ISLIIFPSPYYSPAFTKGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLP--TPIE 139

Query: 133 TPILEIKKGKEEPQSSLYQIYKEDISLKELDYRLRERAIKQHGLYSNGLKQGQI-TITMK 191
P+ GK+ P + K+ +++ LD+ +R + + HGLY + K G ITM
Sbjct: 140 LPLKVKVHGKDSPLKYGPKFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMN 199

Query: 192 DGKSHTIDLSQKLEKERMGDSIDGRQIQKILVEM 225
DG ++ DLS+K E I+ +I+ I E+
Sbjct: 200 DGSTYQSDLSKKFEYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003030TOXICSSTOXIN803e-21 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 80.5 bits (198), Expect = 3e-21
Identities = 32/168 (19%), Positives = 63/168 (37%), Gaps = 18/168 (10%)

Query: 1 MNIIDGNSVNNLALIGKDKQHYHTGVHRNLNIFYVN-----EDKRFEGAKYSIGGITSAN 55
M I + + +L + +++ + I G+T+
Sbjct: 73 MRIKNTDGSISLIIFPSPYYSPAFTKGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTE 132

Query: 56 DKA--VDLIAEARVIKADHIGEYDYDFFPFKIDKEAMSLKEIDFKLRKYLIDNYGLYGEM 113
++L + +V D +Y F DK+ +++ +DF++R L +GLY
Sbjct: 133 KLPTPIELPLKVKVHGKDSPLKYGPKF-----DKKQLAISTLDFEIRHQLTQIHGLYRSS 187

Query: 114 ST----GKITVKKKYYGKYTFELDKKLQEDRMSDVINVTDIDRIEIKV 157
KIT+ Y +L KK + + IN+ +I IE ++
Sbjct: 188 DKTGGYWKITMNDG--STYQSDLSKKFEYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003050TOXICSSTOXIN895e-24 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 88.9 bits (220), Expect = 5e-24
Identities = 28/128 (21%), Positives = 48/128 (37%), Gaps = 5/128 (3%)

Query: 67 VFIVLEDNKYQLKKYSVGGITKTNSKKVDHKAELSVTKKDNQGMISRDVSEYMITKEEIS 126
++ + + G+T T + L V + K++++
Sbjct: 109 TKKSQHTSEGTYIHFQISGVTNTEKLPTPIELPLKVKVHGKDSPLKYG---PKFDKKQLA 165

Query: 127 LKELDFKLRKQLIEKHNLYGNM--GSGTIVIKMKNGGKYTFELHKKLQEHRMADVIEGTN 184
+ LDF++R QL + H LY + G I M +G Y +L KK + + I
Sbjct: 166 ISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFEYNTEKPPINIDE 225

Query: 185 IDKIEVNI 192
I IE I
Sbjct: 226 IKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003060TOXICSSTOXIN1018e-28 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 101 bits (253), Expect = 8e-28
Identities = 45/216 (20%), Positives = 81/216 (37%), Gaps = 13/216 (6%)

Query: 87 TKVETPQSPTTKQVPTEINPKFKDLRAYYTKPSLEFKNEIGIILKKWTTIRFMNIVPDYF 146
T V + K N KDL +Y+ S F N ++ ++R N
Sbjct: 25 TPVPLSSNQIIKTAKASTNDNIKDLLDWYSSGSDTFTN-SEVLDNSLGSMRIKNTDGSI- 82

Query: 147 IYKIALVGKDDKKYDEGVHRNVDVFVVLEEKNKYGVE----RYSVGGITKSNSKKVDHKA 202
+ + VD+ +K+++ E + + G+T + +
Sbjct: 83 --SLIIFPSPYYSPAFTKGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLPTPIEL 140

Query: 203 GVRITKEDNKGTISHDVSEFKITKEQISLKELDFKLRKQLIENHNLYGNV--GSGKIVIN 260
+++ + + K K+Q+++ LDF++R QL + H LY + G I
Sbjct: 141 PLKVKVHGKDSPLKYG---PKFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKIT 197

Query: 261 MKNGGKYTFELHKKLQENRMADVIDGTNIDNIEVNI 296
M +G Y +L KK + N I+ I IE I
Sbjct: 198 MNDGSTYQSDLSKKFEYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003070TOXICSSTOXIN1352e-41 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 135 bits (340), Expect = 2e-41
Identities = 49/201 (24%), Positives = 73/201 (36%), Gaps = 14/201 (6%)

Query: 44 NVTKDIFDLRDYYSGASKELKNVTGYRYSKGGKHYLIFDKNRKFTRVQIFGKDIERFKAR 103
+ +I DL D+YS S N S G + + IF
Sbjct: 41 STNDNIKDLLDWYSSGSDTFTNSEVLDNSLGS---MRIKNTDGSISLIIFPSPYYSPAFT 97

Query: 104 KNPGLDI-----FVVKEAENRNGTVFSYGGVTKKNQDAYYDYINAPRFQIKRDEGDGIAT 158
K +D+ + F GVT + I P +K D
Sbjct: 98 KGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLP--TPIELPLK-VKVHGKDSPLK 154

Query: 159 YGRVHYIYKEEISLKELDFKLRQYLIQNFDLYKKFPKDSKI-KVIMKDGGYYTFELNKKL 217
YG K+++++ LDF++R L Q LY+ K K+ M DG Y +L+KK
Sbjct: 155 YG--PKFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKF 212

Query: 218 QTNRMSDVIDGRNIEKIEANI 238
+ N I+ I+ IEA I
Sbjct: 213 EYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003080TOXICSSTOXIN1921e-63 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 192 bits (488), Expect = 1e-63
Identities = 49/197 (24%), Positives = 82/197 (41%), Gaps = 16/197 (8%)

Query: 42 DIKDLYRYYSSESFEFSNI--------SGKVENYNGSNVVRFNQEKQNHQLFLLGKDKDK 93
+IKDL +YSS S F+N S +++N +GS + F G+
Sbjct: 45 NIKDLLDWYSSGSDTFTNSEVLDNSLGSMRIKNTDGSISLIIFPSPYYSPAFTKGE---- 100

Query: 94 YKKGLEGQNVFVVKELIDPNGRLSTVGGVTKKNNKSSETNTHLFVNKVYGGNLDASIDSF 153
K L + + + + GVT + L V KV+G +
Sbjct: 101 -KVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLPTPIELPLKV-KVHGKDSPLKYGP- 157

Query: 154 LINKEEVSLKELDFKIRKQLVEKYGLYKGTTKYGKI-TINLKDEKKEVIDLGDKLQFERM 212
+K+++++ LDF+IR QL + +GLY+ + K G I + D DL K ++
Sbjct: 158 KFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFEYNTE 217

Query: 213 GDVLNSKDIQNIAVTIN 229
+N +I+ I IN
Sbjct: 218 KPPINIDEIKTIEAEIN 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003090TOXICSSTOXIN1252e-37 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 125 bits (314), Expect = 2e-37
Identities = 47/199 (23%), Positives = 73/199 (36%), Gaps = 19/199 (9%)

Query: 44 DTNKLHQYYSGPSYELTNV--------SGQSQGYYDSNVLLFNQQNQKFQVFLLGKDENK 95
+ L +YS S TN S + + S L+ F G+
Sbjct: 45 NIKDLLDWYSSGSDTFTNSEVLDNSLGSMRIKNTDGSISLIIFPSPYYSPAFTKGE---- 100

Query: 96 YKEKTHGLDVFAVPELVDLDGRIFSVSGVTKKNVKSIFESLRTPNLLVKKIDDKDGFSID 155
K + + F +SGVT L L K+ KD +
Sbjct: 101 -KVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLPTPIELP----LKVKVHGKDSP-LK 154

Query: 156 EFFFIQKEEVSLKELDFKIRKLLIKKYKLYEGSA-DKGRIVINMKDENKYEIDLSDKLDF 214
K+++++ LDF+IR L + + LY S G I M D + Y+ DLS K ++
Sbjct: 155 YGPKFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFEY 214

Query: 215 ERMADVINSEQIKNIEVNL 233
IN ++IK IE +
Sbjct: 215 NTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003100TOXICSSTOXIN1301e-39 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 130 bits (329), Expect = 1e-39
Identities = 39/197 (19%), Positives = 69/197 (35%), Gaps = 15/197 (7%)

Query: 43 INMLHQYYSEESFESTNISVKSEDYYGSNVLNFNQRNKTFKVFLLGDDKNKY------KE 96
I L +YS S TN V + + + + + K
Sbjct: 46 IKDLLDWYSSGSDTFTNSEVLD---NSLGSMRIKNTDGSISLIIFPSPYYSPAFTKGEKV 102

Query: 97 KTHGLDVFAVPELIDIKGGIYSVGGITKKNVRSVFGFVSNPSLQVKKVDAKHGFSINELF 156
+ + + + G+T + P L+VK F
Sbjct: 103 DLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLP--TPIELP-LKVKVHGKDSPLKYGPKF 159

Query: 157 FIQKEEVSLKELDFKIRKMLVEKYRLYKGAS-DKGRIVINMKDEKKYVIDLSEKLSFDRM 215
K+++++ LDF+IR L + + LY+ + G I M D Y DLS+K ++
Sbjct: 160 --DKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFEYNTE 217

Query: 216 FDVMDSKQIKNIEVNLN 232
++ +IK IE +N
Sbjct: 218 KPPINIDEIKTIEAEIN 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003110TOXICSSTOXIN1934e-64 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 193 bits (491), Expect = 4e-64
Identities = 51/202 (25%), Positives = 92/202 (45%), Gaps = 10/202 (4%)

Query: 31 KQNQKSVNKHDKEALYRYYTGKTMEMKNISALKHGKNNLRFKFRGIKIQVLLPGNDKSKF 90
K + S N + K+ L Y +G + N L + ++R K I +++ +
Sbjct: 36 KTAKASTNDNIKDLLDWYSSG-SDTFTNSEVLDNSLGSMRIKNTDGSISLIIFPSPYYSP 94

Query: 91 QQRSYEGLDVFFVQEKRDKHD-----IFYTVGGVIQNNKTSGVVSAPILNISKEKGEDAF 145
E +D+ + K+ +H I + + GV K + P L + K G+D+
Sbjct: 95 AFTKGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLPTPIELP-LKV-KVHGKDSP 152

Query: 146 VKGYPYYIKKEKITLKELDYKLRKHLIEKYGLYKTISKDGRV-KISLKDGSFYNLDLRSK 204
+K Y K+++ + LD+++R L + +GLY++ K G KI++ DGS Y DL K
Sbjct: 153 LK-YGPKFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKK 211

Query: 205 LKFKYMGEVIESKQIKDIEVNL 226
++ I +IK IE +
Sbjct: 212 FEYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003140TOXICSSTOXIN1084e-31 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 108 bits (270), Expect = 4e-31
Identities = 47/225 (20%), Positives = 86/225 (38%), Gaps = 19/225 (8%)

Query: 16 LTTGMITTTAQPVKASTLEVRSQAT-------QDLSEYYKGRGFELTNVTGYKYG-NKVT 67
L T PV S+ ++ A +DL ++Y TN +
Sbjct: 15 LLLATTATDFTPVPLSSNQIIKTAKASTNDNIKDLLDWYSSGSDTFTNSEVLDNSLGSMR 74

Query: 68 FIDNSQQIDVTLTGNE----KLTVKDDDEVSNVDVFVVREGSDKSAITTSIGGITKTNGT 123
+ I + + + T + +++ + S+ + I I G+T T
Sbjct: 75 IKNTDGSISLIIFPSPYYSPAFTKGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTE-- 132

Query: 124 QHKDTVQNVNLSVSKSTGQHTTSVTSEYYSIYKEEISLKELDFKLRKHLIDKHDLYKTEP 183
T + L V K G+ S K+++++ LDF++R L H LY++
Sbjct: 133 -KLPTPIELPLKV-KVHGK--DSPLKYGPKFDKKQLAISTLDFEIRHQLTQIHGLYRSSD 188

Query: 184 KDSKI-RITMKNGGYYTFELNKKLQPHRMGDTIDSRNIEKIEVNL 227
K +ITM +G Y +L+KK + + I+ I+ IE +
Sbjct: 189 KTGGYWKITMNDGSTYQSDLSKKFEYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003200BCTERIALGSPC310.002 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 31.5 bits (71), Expect = 0.002
Identities = 17/83 (20%), Positives = 34/83 (40%), Gaps = 9/83 (10%)

Query: 184 INSNVPSYDAKFKMSNKDENVKQLRSRYNIPTDKAPILKMHIDGDLKGSSVGYKKLEIDF 243
+N VP Y+AK D V Q + RY + + + S G +++
Sbjct: 124 VNEEVPGYNAKIVSIRPDRVVLQYQGRYEV---------LGLYSQEDSGSDGVPGAQVNE 174

Query: 244 SKEENSELSIVDSLNFQPAKNKD 266
++ + ++ D ++F P N +
Sbjct: 175 QLQQRASTTMSDYVSFSPIMNDN 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003240BCTERIALGSPC353e-04 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 34.6 bits (79), Expect = 3e-04
Identities = 18/83 (21%), Positives = 34/83 (40%), Gaps = 9/83 (10%)

Query: 177 INENVPSYDAKFKMSNKDENVKQLRSRYNIPTDKAPVLKMHIDGDLKGSSVGYKKLEIDF 236
+NE VP Y+AK D V Q + RY + + + S G +++
Sbjct: 124 VNEEVPGYNAKIVSIRPDRVVLQYQGRYEV---------LGLYSQEDSGSDGVPGAQVNE 174

Query: 237 SKGEKSDLSVIDSLNFQPAKVDE 259
+++ ++ D ++F P D
Sbjct: 175 QLQQRASTTMSDYVSFSPIMNDN 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003260BCTERIALGSPC290.025 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 28.8 bits (64), Expect = 0.025
Identities = 17/83 (20%), Positives = 33/83 (39%), Gaps = 9/83 (10%)

Query: 201 INENVPSYDAKFKMSNKDENVKQLRSRYNIPTDKSPVLKMHIDGNLKGSSVGDRKLEIDF 260
+NE VP Y+AK D V Q + RY + + + S G +++
Sbjct: 124 VNEEVPGYNAKIVSIRPDRVVLQYQGRYEV---------LGLYSQEDSGSDGVPGAQVNE 174

Query: 261 SKRENSHLSVIDSLDYQPAKVDE 283
++ + ++ D + + P D
Sbjct: 175 QLQQRASTTMSDYVSFSPIMNDN 197


7SAI8T7_1004070SAI8T7_1004310Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_10040702140.482410Putative TrmH family tRNA/rRNA
SAI8T7_10040802150.167784Putative uncharacterized protein
SAI8T7_10040903222.286925Putative uncharacterized protein
SAI8T7_10041004282.929455Transcription antitermination protein NusG
SAI8T7_10041104303.08398650S ribosomal protein L1
SAI8T7_10041204323.54336650S ribosomal protein L10
SAI8T7_10041305353.900909Putative uncharacterized protein SA0499
SAI8T7_10041404343.839032DNA-directed RNA polymerase subunit beta
SAI8T7_10041502333.410681DNA-directed RNA polymerase subunit beta'
SAI8T7_10041601313.47212130S ribosomal protein S7
SAI8T7_10041700232.551012Elongation factor G
SAI8T7_10041800151.590508Elongation factor Tu
SAI8T7_10041902130.611565Aminoacylase
SAI8T7_10042002120.515831Putative pyridoxal phosphate-dependent
SAI8T7_10042102130.402366Molecular chaperone Hsp31 and glyoxalase 3
SAI8T7_1004220213-0.519064Ribulokinase
SAI8T7_1004230011-0.333110Uncharacterized epimerase/dehydratase SACOL0599
SAI8T7_1004240-112-1.099289Probable branched-chain-amino-acid
SAI8T7_1004250213-0.239297Putative phosphoglycolate phosphatase
SAI8T7_1004260418-0.092767Deoxypurine kinase
SAI8T7_1004270417-0.328749Deoxynucleoside kinase
SAI8T7_10042805180.687528Putative uncharacterized protein
SAI8T7_10042906190.737880FMN-dependent NADPH-azoreductase
SAI8T7_10043005170.397149Serine-aspartate repeat-containing protein C
SAI8T7_1004310315-0.480159Serine-aspartate repeat-containing protein D
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1004170TCRTETOQM6190.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 619 bits (1598), Expect = 0.0
Identities = 170/671 (25%), Positives = 299/671 (44%), Gaps = 66/671 (9%)

Query: 31 KTRNIGIMAHIDAGKTTTTERILYYTGRIHKIGETHEGASQMDWMEQEQDRGITITSAAT 90
K NIG++AH+DAGKTT TE +LY +G I ++G +G ++ D E+ RGITI + T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 91 TAAWEGHRVNIIDTPGHVDFTVEVERSLRVLDGAVTVLDAQSGVEPQTETVWRQATTYGV 150
+ WE +VNIIDTPGH+DF EV RSL VLDGA+ ++ A+ GV+ QT ++ G+
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 151 PRIVFVNKMDKLGANFEYSVSTLHDRLQANAAPIQLPIGAEDEFEAIIDLVEMKCFKYTN 210
P I F+NK+D+ G + + ++L A Q K Y N
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ------------------KVELYPN 163

Query: 211 DLGTEIEEIEIPEDHLDRAEEARASLIEAVAETSDELMEKYLGDEEISVSELKEAIRQAT 270
T E E + V E +D+L+EKY+ + + EL++
Sbjct: 164 MCVTNFTESE---------------QWDTVIEGNDDLLEKYMSGKSLEALELEQEESIRF 208

Query: 271 TNVEFYPVLCGTAFKNKGVQLMLDAVIDYLPSPLDVKPIIGHRASNPEEEVIAKADDSAE 330
N +PV G+A N G+ +++ + + S +E
Sbjct: 209 HNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH--------------------RGQSE 248

Query: 331 FAALAFKVMTDPYVGKLTFFRVYSGTMTSGSYVKNSTKGKRERVGRLLQMHANSRQEIDT 390
FK+ +L + R+YSG + V+ S K K ++ + +ID
Sbjct: 249 LCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEK-IKITEMYTSINGELCKIDK 307

Query: 391 VYSGDIAAAVG----LKDTGTGDTLCGEKNDIILESMEFPEPVIHLSVEPKSKADQDKMT 446
YSG+I L GDT + + I E P P++ +VEP ++ +
Sbjct: 308 AYSGEIVILQNEFLKLNSV-LGDTKLLPQRERI----ENPLPLLQTTVEPSKPQQREMLL 362

Query: 447 QALVKLQEEDPTFHAHTDEETGQVIIGGMGELHLDILVDRMKKEFNVECNVGAPMVSYRE 506
AL+++ + DP + D T ++I+ +G++ +++ ++++++VE + P V Y E
Sbjct: 363 DALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIYME 422

Query: 507 TFKSSAQVQGKFSRQSGGRGQYGDVHIEFTPNETGAGFEFENAIVGGVVPREYIPSVEAG 566
+ + + + + + +P G+G ++E+++ G + + + +V G
Sbjct: 423 R--PLKKAEYTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSLGYLNQSFQNAVMEG 480

Query: 567 LKDAMENGVLAGYPLIDVKAKLYDGSYHDVDSSEMAFKIAASLALKEAAKKCDPVILEPM 626
++ E G L G+ + D K G Y+ S+ F++ A + L++ KK +LEP
Sbjct: 481 IRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTELLEPY 539

Query: 627 MKVTIEMPEEYMGDIMGDVTSRRGRVDGMEPRGNAQVVNAYVPLSEMFGYATSLRSNTQG 686
+ I P+EY+ D + + + N +++ +P + Y + L T G
Sbjct: 540 LSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQEYRSDLTFFTNG 599

Query: 687 RGTYTMYFDHY 697
R Y
Sbjct: 600 RSVCLTELKGY 610


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1004180TCRTETOQM862e-20 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 86.1 bits (213), Expect = 2e-20
Identities = 52/149 (34%), Positives = 81/149 (54%), Gaps = 7/149 (4%)

Query: 14 NIGTIGHVDHGKTTLTAAI---ATVLAKNGDSVAQSYDMIDNAPEEKERGITINTSHIEY 70
NIG + HVD GKTTLT ++ + + + G SV + DN E++RGITI T +
Sbjct: 5 NIGVLAHVDAGKTTLTESLLYNSGAITELG-SVDKGTTRTDNTLLERQRGITIQTGITSF 63

Query: 71 QTDKRHYAHVDCPGHADYVKNMITGAAQMDGGILVVSAADGPMPQTREHILLSRNVGVPA 130
Q + +D PGH D++ + + +DG IL++SA DG QTR R +G+P
Sbjct: 64 QWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPT 123

Query: 131 LVVFLNKVDMVDDEELLELVEMEVRDLLS 159
+ F+NK+D + L V ++++ LS
Sbjct: 124 -IFFINKIDQNGID--LSTVYQDIKEKLS 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1004230NUCEPIMERASE931e-23 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 92.9 bits (231), Expect = 1e-23
Identities = 77/328 (23%), Positives = 126/328 (38%), Gaps = 42/328 (12%)

Query: 3 KIMITGALGQIGTELVVKCREIYGTDNVLATDIREPEADSPVQNGPFEIL---------- 52
K ++TGA G IG + + E V+ D D ++ E+L
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLE--AGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59

Query: 53 DVTDRDRMFELVRDFEADSLMHMAALLSAT-AEKNPILAWDLNMGGLMNALEAARTYNL- 110
D+ DR+ M +L + + L+ + +NP D N+ G +N LE R +
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119

Query: 111 HFFTPSSIGAFGDSTPKVNTPQVTIQQPTTMYGVNKVAGELLCQYYFKRFGVDTRSVRFP 170
H SS +G + + ++ P ++Y K A EL+ Y +G+ +RF
Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRF- 178

Query: 171 GLISHVKEPGGGTTDYAVEIYFKAVREGHYTSFIDKGTYM-DMMYMDDAIEAIIKLMEA- 228
V P G D A+ + KA+ EG + G D Y+DD EAII+L +
Sbjct: 179 ---FTVYGP-WGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVI 234

Query: 229 --DDAKLETRNG-----------YNLSAMSFDPEMVKEAIQ--EYYPNFTLDYDVDPIRQ 273
D + G YN+ S P + + IQ E ++ P++
Sbjct: 235 PHADTQWTVETGTPAASIAPYRVYNIGNSS--PVELMDYIQALEDALGIEAKKNMLPLQP 292

Query: 274 GIANSWPDS-IDTSCSRGEWGFDPKYDL 300
G ++ DT GF P+ +
Sbjct: 293 GDV---LETSADTKALYEVIGFTPETTV 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1004310GPOSANCHOR360.001 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 36.2 bits (83), Expect = 0.001
Identities = 27/200 (13%), Positives = 59/200 (29%), Gaps = 8/200 (4%)

Query: 22 KFSIRKYTVGTASILVGTTLI-FGLGNQ-EAKAAESTNKELNE--ATTSASDNQSSDKVD 77
+S+RK GTAS+ V T++ GL +A +T + + +D +
Sbjct: 9 HYSLRKLKTGTASVAVALTVLGAGLVVNTNEVSAVATRSQTDTLEKVQERADKFEIENNT 68

Query: 78 MQQLNQEDNTKNDNQKEMVSSQGNET-TSNGNKSIEKESVQSTTGNKVEVSTAKSDEQAS 136
++ N + + N K+ E + +S+ E+ K+D + +
Sbjct: 69 LKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKA 128

Query: 137 PKSTNEDLNTKQTISNQEGLQPDLLENKSVVNVQPTNEENKKVDAKTESTTLNVKSDAIK 196
+ + L + ++ T + +A K
Sbjct: 129 LEGAMNFSTADSAKIKTLEAEKAALAAR---KADLEKALEGAMNFSTADSAKIKTLEAEK 185

Query: 197 SNAETLVDNNSNSNNENNAD 216
+ E +
Sbjct: 186 AALEARQAELEKALEGAMNF 205


8SAI8T7_1004710SAI8T7_1004850Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1004710113-3.022826Periplasmic binding protein
SAI8T7_1004720113-3.458323Transport system permease protein
SAI8T7_1004730013-3.612555L-2-haloalkanoic acid dehalogenase
SAI8T7_1004740013-3.687743MW0576 protein
SAI8T7_1004750212-3.047839Putative uncharacterized protein
SAI8T7_1004760211-2.450569Putative uncharacterized protein
SAI8T7_1004770212-2.259607Putative esterase/lipase
SAI8T7_1004780111-1.914146Putative uncharacterized protein
SAI8T7_1004790-110-1.772849Putative uncharacterized protein
SAI8T7_100480009-1.732532Putative antiporter subunit mnhA2
SAI8T7_1004810010-1.848298Putative antiporter subunit mnhD2
SAI8T7_1004820110-2.177194Putative antiporter subunit mnhE2
SAI8T7_1004830111-1.534494Monovalent cation/H+ antiporter subunit G
SAI8T7_1004840110-1.811177Putative uncharacterized protein
SAI8T7_1004850211-1.960363Transposase
9SAI8T7_1005010SAI8T7_1005240Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1005010491.165170Ferrichrome transport permease
SAI8T7_1005020390.247415Iron-hydroxamate transport permease
SAI8T7_100503009-0.076097Putative dihydroxyacetone kinase
SAI8T7_1005040-19-0.894833Putative uncharacterized protein
SAI8T7_1005050-110-1.563061Putative uncharacterized protein
SAI8T7_1005060-29-1.849522Putative uncharacterized protein MW0616
SAI8T7_1005070-211-2.168194SA0610 protein
SAI8T7_1005080-212-2.998513Putative uncharacterized protein
SAI8T7_1005090-312-3.622045Putative uncharacterized protein
SAI8T7_1005100-38-2.672065Response regulator protein graR
SAI8T7_1005110010-1.706438Sensor histidine kinase graS
SAI8T7_1005120010-1.782226ABC transporter, ATP-binding protein
SAI8T7_1005130-18-3.249860ABC transporter permease
SAI8T7_100514009-2.028610Putative pit accessory protein
SAI8T7_100515009-1.503835SA0619 protein
SAI8T7_1005160-111-2.730141Secretory antigen SsaA homologue
SAI8T7_1005170-115-3.548779Putative Transporter
SAI8T7_1005180-113-2.938515Similar to AraC/XylS family transcriptional
SAI8T7_1005190214-1.140573Probable transcriptional regulatory protein
SAI8T7_1005200212-2.433506Putative uncharacterized protein
SAI8T7_1005210312-2.334828Putative uncharacterized protein
SAI8T7_1005220310-1.442845MW0634 protein
SAI8T7_1005230411-1.278947Transporter, major facilitator family protein
SAI8T7_1005240216-1.835246Putative uncharacterized protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1005080SACTRNSFRASE473e-09 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 47.2 bits (112), Expect = 3e-09
Identities = 20/97 (20%), Positives = 35/97 (36%), Gaps = 3/97 (3%)

Query: 50 EYITSPHKVIFVAESDEQLVGFAFVNTTPFQRIKHVAKIDLGVKKLYQHRGIGQALLDAI 109
Y+ K F+ + +G + + + + D+ V K Y+ +G+G ALL
Sbjct: 58 SYVEEEGKAAFLYYLENNCIGRIKIRSN-WNGYALIE--DIAVAKDYRKKGVGTALLHKA 114

Query: 110 MAWCLNNQIHRIEANVPLNNQPALELFKSADFQIEGV 146
+ W N + N A + F I V
Sbjct: 115 IEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAV 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1005100HTHFIS645e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 63.7 bits (155), Expect = 5e-14
Identities = 26/111 (23%), Positives = 57/111 (51%), Gaps = 1/111 (0%)

Query: 3 ILLVEDDNTLFQELKKELEQWDFNVAGIEDFGKVMDTFESFNPEIVILDVQLPKYDGFYW 62
IL+ +DD + L + L + ++V + + + + ++V+ DV +P + F
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 63 CRKMREV-SNVPILFLSSRDNPMDQVMSMELGADDYMQKPFYTNVLIAKLQ 112
++++ ++P+L +S+++ M + + E GA DY+ KPF LI +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1005120PF05272361e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 36.2 bits (83), Expect = 1e-04
Identities = 15/56 (26%), Positives = 26/56 (46%), Gaps = 8/56 (14%)

Query: 57 GPSGSGKTTLLNVLSSIDYISQGSITLKGKK--LEKLSNK------ELSDIRKHDI 104
G G GK+TL+N L +D+ S + K E+++ E++ R+ D
Sbjct: 603 GTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRADA 658


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1005230TCRTETA583e-11 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 57.5 bits (139), Expect = 3e-11
Identities = 73/365 (20%), Positives = 134/365 (36%), Gaps = 41/365 (11%)

Query: 35 KNYKLFVA--NMFLLGMGIAVTVPYLVLFATKDLGMTTNQ---YGLLLASAAISQFTVNS 89
N L V + L +GI + +P L +DL + + YG+LLA A+ QF
Sbjct: 3 PNRPLIVILSTVALDAVGIGLIMPVLP-GLLRDLVHSNDVTAHYGILLALYALMQFACAP 61

Query: 90 IIARFSDTHHFNRKIIIILALLMGALGFSIYFFVDTIWLFILLYAIFQGLFAPAMPQLYA 149
++ SD F R+ +++++L A+ ++I +W+ + + I G+
Sbjct: 62 VLGALSD--RFGRRPVLLVSLAGAAVDYAIMATAPFLWV-LYIGRIVAGITGATGA---V 115

Query: 150 SARESINVSSSKDRAQFANTVLRSMFSLGFLFGPFIGAQLIGLKGYAGLFGGTISIILFT 209
+ +++ +RA+ + + F G + GP +G + G +A F L
Sbjct: 116 AGAYIADITDGDERARHFG-FMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNF 174

Query: 210 LVLQVFFYKDLNIKHPISTQQHVEKIAPNMFKDKTL--------LLPFIAFILLHIGQWM 261
L + ++ + + A N L + FI+ +GQ
Sbjct: 175 LTGCFLLPESHK-----GERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVP 229

Query: 262 YTMNMPLFVTDYLKENEQHVGYLASLCAGLEVPFMIIL-GVLSSRLHTRTLLIYGAIFGG 320
+ +F D + +G + L ++ G +++RL R L+ G I G
Sbjct: 230 AAL-WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADG 288

Query: 321 LFYFSIGVFKNFYMMLAGQVFLAIFLAVLLGIGISYFQDILPDFPGYASTLFSNAMVIGQ 380
Y + +M F + L GIG +P S GQ
Sbjct: 289 TGYILLAFATRGWM-----AFPIMVLLASGGIG-------MPALQAMLSRQVDEERQ-GQ 335

Query: 381 LGGNL 385
L G+L
Sbjct: 336 LQGSL 340



Score = 49.4 bits (118), Expect = 1e-08
Identities = 44/186 (23%), Positives = 73/186 (39%), Gaps = 13/186 (6%)

Query: 239 MFKDKTLLLPFIAFILLHIGQWMYTMNMPLFVTDYLKENEQ--HVGYLASLCAGLEVPFM 296
M ++ L++ L +G + +P + D + N+ H G L +L A ++
Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60

Query: 297 IILGVLSSRLHTRTLLIYGAIFGGLFYFSIGVFKNFYMMLAGQVFLAIFLAVLLGIGISY 356
+LG LS R R +L+ + Y + +++ G++ I A G +Y
Sbjct: 61 PVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG-AY 119

Query: 357 FQDILPD-----FPGYASTLFSNAMVIGQLGGNLLGGAMSHWVGLENVFFVSAASIMLGM 411
DI G+ S F MV G + G L+GG H FF +AA L
Sbjct: 120 IADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPH-----APFFAAAALNGLNF 174

Query: 412 ILIFFT 417
+ F
Sbjct: 175 LTGCFL 180


10SAI8T7_1006040SAI8T7_1006330Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_10060402192.245305Nucleoside-diphosphate sugar epimerase
SAI8T7_10060502212.745992Putative uncharacterized protein
SAI8T7_10060603302.935445Glycolytic operon regulator
SAI8T7_10060703342.444332Glyceraldehyde-3-phosphate dehydrogenase 1
SAI8T7_10060801281.780237Phosphoglycerate kinase
SAI8T7_10060901191.502026Triosephosphate isomerase
SAI8T7_10061001160.5559542,3-bisphosphoglycerate-independent
SAI8T7_1006110014-0.582274Enolase
SAI8T7_1006120011-1.341374Putative membrane spanning protein
SAI8T7_1006130311-0.486227Putative Carboxyesterase homologue
SAI8T7_10061406141.607096Ribonuclease R
SAI8T7_10061507150.784605Putative uncharacterized protein
SAI8T7_10061605151.109777Putative uncharacterized protein
SAI8T7_10061705151.254877Putative uncharacterized protein
SAI8T7_10061804141.209504Transposase for insertion sequence element IS256
SAI8T7_10061904150.626807Clumping factor A
SAI8T7_1006200012-2.151795Coagulase family protein
SAI8T7_1006210113-1.650844Putative Extracellular matrix protein-binding
SAI8T7_1006220-115-1.933508Putative uncharacterized protein
SAI8T7_1006230-112-1.550712Thermonuclease
SAI8T7_1006240012-1.827080Putative uncharacterized protein
SAI8T7_1006250-112-1.750278Phosphoglycerate mutase family protein
SAI8T7_1006260116-2.120414Similar to transporter, LysE family
SAI8T7_1006270521-1.091789Putative uncharacterized protein
SAI8T7_1006280722-0.2826873-dehydroquinate dehydratase
SAI8T7_1006290925-0.266086Putative uncharacterized protein
SAI8T7_1006300926-0.112363Putative uncharacterized protein truncated-SA
SAI8T7_10063108240.275741Transposase A from transposon Tn554
SAI8T7_10063206210.842260Transposase B from transposon Tn554
SAI8T7_10063304160.276107Streptomycin=3''-adenylyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1006050IGASERPTASE300.010 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 29.6 bits (66), Expect = 0.010
Identities = 24/162 (14%), Positives = 62/162 (38%), Gaps = 5/162 (3%)

Query: 49 SNKAKERMLNEQKQEQKEKRQKENAEKERKKKQQEEKEQNELDSQANQYQQLPQQNQYQY 108
S + N +++ + ++ +++A E + +E ++ + + +AN Q+ +
Sbjct: 1034 SETTETVAENSKQESKTVEKNEQDA-TETTAQNREVAKEAKSNVKANTQTNEVAQSGSET 1092

Query: 109 VPPQQQAPTKQRPAKEENDDKASKDESKDKDDNASQDKSDDNQKKTDDNKQPAQPKPQP- 167
Q + ++E K +++++ SQ Q +T + + P
Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPT 1152

Query: 168 ---QQPTPKPNNNQQNNQSNQQAKPQAPQQNSQSTTNKQNNA 206
++P + N Q ++ Q ++STT N+
Sbjct: 1153 VNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNS 1194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1006170ALARACEMASE270.049 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 26.7 bits (59), Expect = 0.049
Identities = 13/37 (35%), Positives = 19/37 (51%), Gaps = 2/37 (5%)

Query: 135 MYDIYP-PYDGIPDEAFLI-KELKVNSLAGKTGTINY 169
D+ P P GI L KE+K++ +A GT+ Y
Sbjct: 305 AVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGY 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1006190ICENUCLEATIN437e-06 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 42.8 bits (100), Expect = 7e-06
Identities = 72/369 (19%), Positives = 131/369 (35%), Gaps = 6/369 (1%)

Query: 514 SDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASD 573
S G+DS + S +G +ST +G S + SD + S + DS+
Sbjct: 294 STGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLI 353

Query: 574 SDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSD 633
+ S + DS + S + SD + S + +DS+ + S + +
Sbjct: 354 AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEE 413

Query: 634 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 693
S + S + SD + S + DS + S + DS + S
Sbjct: 414 STQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQT 473

Query: 694 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 753
+ SD + S S + +S + S + S + S + ++SD +
Sbjct: 474 AQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYG 533

Query: 754 SDSDSDSDSD------SDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSD 807
S S + ++S S + +S + S + SD + S + SDS
Sbjct: 534 STSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSII 593

Query: 808 SESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSD 867
+ S + S + S + S +G S S++ +DS + GS + +
Sbjct: 594 AGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYN 653

Query: 868 SNSDSESGS 876
S + GS
Sbjct: 654 SILTAGYGS 662



Score = 42.8 bits (100), Expect = 7e-06
Identities = 74/379 (19%), Positives = 135/379 (35%), Gaps = 6/379 (1%)

Query: 505 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 564
G + E+S G S + S +G ST +G DS+ + S + DS
Sbjct: 309 GSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSL 368

Query: 565 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 624
+ S + SD + S + +DS+ + S + +S + S +
Sbjct: 369 TAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQK 428

Query: 625 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 684
SD + S + DS + S + DS + S + SD + S S
Sbjct: 429 GSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTS 488

Query: 685 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD----- 739
+ +S + S + S + S + ++SD + S S + ++S
Sbjct: 489 TAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGY 548

Query: 740 -SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDS 798
S + +S + S + SD + S + SDS + S + S
Sbjct: 549 GSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSL 608

Query: 799 DSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDT 858
+ S + S + S S + +DS + S +G +S ++ S T+
Sbjct: 609 TAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQE 668

Query: 859 GSDNDSDSDSNSDSESGSN 877
GSD + S S + + S+
Sbjct: 669 GSDLTAGYGSTSTAGADSS 687



Score = 42.0 bits (98), Expect = 1e-05
Identities = 69/363 (19%), Positives = 125/363 (34%), Gaps = 6/363 (1%)

Query: 505 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 564
G + EDS G S + S +G ST +G+DS+ + S + +S
Sbjct: 261 GSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQ 320

Query: 565 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 624
+ S + SD + S + DS+ + S + DS + S +
Sbjct: 321 TAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQK 380

Query: 625 DSDSDSDSDSDSDSDSDSD------SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 678
SD + S + +DS S + +S + S + SD + S
Sbjct: 381 GSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTG 440

Query: 679 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 738
+ DS + S + DS + S + SD + S S + +S +
Sbjct: 441 TAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGY 500

Query: 739 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDS 798
S + S + S + ++SD + S S + ++S + S + +S
Sbjct: 501 GSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVL 560

Query: 799 DSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDT 858
+ S + SD + S + SDS + S + S ++ S T+
Sbjct: 561 TAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTARE 620

Query: 859 GSD 861
S
Sbjct: 621 QSV 623



Score = 42.0 bits (98), Expect = 1e-05
Identities = 72/369 (19%), Positives = 132/369 (35%), Gaps = 6/369 (1%)

Query: 514 SDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASD 573
S G DS + S +G DS+ +G S + SD + S + +DS+
Sbjct: 342 STGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLI 401

Query: 574 SDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSD 633
+ S + +S + S + SD + S + DS+ + S + D
Sbjct: 402 AGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGED 461

Query: 634 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 693
S + S + SD + S S + +S + S + S + S
Sbjct: 462 SSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQT 521

Query: 694 SDSDSDSDSDSDSDSDSDSDSD------SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 747
+ ++SD + S S + ++S S + +S + S + SD +
Sbjct: 522 AQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYG 581

Query: 748 SDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSD 807
S + SDS + S + S + S + S + S S + +DS
Sbjct: 582 STGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLI 641

Query: 808 SESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSD 867
+ S + +S + S + SD +G S S++ +DS + GS + +
Sbjct: 642 AGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYN 701

Query: 868 SNSDSESGS 876
S + GS
Sbjct: 702 SILTAGYGS 710



Score = 41.3 bits (96), Expect = 2e-05
Identities = 76/379 (20%), Positives = 138/379 (36%), Gaps = 6/379 (1%)

Query: 505 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSA------SDSDSASDSDS 558
G + EDS G S + S +G ST +G+DS+ S + +S
Sbjct: 357 GSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQ 416

Query: 559 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 618
+ S + SD + S + DS+ + S + DS + S +
Sbjct: 417 TAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQK 476

Query: 619 ASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 678
SD + S S + +S + S + S + S + ++SD + S S
Sbjct: 477 GSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTS 536

Query: 679 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 738
+ ++S + S + +S + S + SD + S + SDS +
Sbjct: 537 TAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGY 596

Query: 739 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDS 798
S + S + S + S + S S + +DS + S + +S
Sbjct: 597 GSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSIL 656

Query: 799 DSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDT 858
+ S ++ SD + S S + +DS + S +G +S ++ S T+
Sbjct: 657 TAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQE 716

Query: 859 GSDNDSDSDSNSDSESGSN 877
GSD S S S + + S+
Sbjct: 717 GSDLTSGYGSTSTAGADSS 735



Score = 41.3 bits (96), Expect = 2e-05
Identities = 74/372 (19%), Positives = 133/372 (35%), Gaps = 2/372 (0%)

Query: 505 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 564
G + + SD G S + DS +G ST +G DS+ + S + SD
Sbjct: 229 GSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDL 288

Query: 565 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 624
+ S + + S + S + +S + S + SD + S +
Sbjct: 289 TAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGD 348

Query: 625 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 684
DS + S + DS + S + SD + S + +DS + S
Sbjct: 349 DSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQ 408

Query: 685 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 744
+ +S + S + SD + S + DS + S + DS +
Sbjct: 409 TAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGY 468

Query: 745 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDS 804
S + SD + S S + +S + S + S + S + ++SD
Sbjct: 469 GSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDL 528

Query: 805 DSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDS 864
+ S S + ++S + S + +S +G S ++ SD T+ GS +
Sbjct: 529 ITGYGSTSTAGANSSLIAGYGSTQTASYNSV--LTAGYGSTQTAREGSDLTAGYGSTGTA 586

Query: 865 DSDSNSDSESGS 876
SDS+ + GS
Sbjct: 587 GSDSSIIAGYGS 598



Score = 40.9 bits (95), Expect = 3e-05
Identities = 74/369 (20%), Positives = 135/369 (36%), Gaps = 6/369 (1%)

Query: 514 SDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSA-- 571
S G+DS + S +G +ST +G S + SD + S + DS+
Sbjct: 390 STGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLI 449

Query: 572 ----SDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSD 627
S + DS + S + SD + S S + +S+ + S +
Sbjct: 450 AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYG 509

Query: 628 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 687
S + S + ++SD + S S + ++S + S + +S + S
Sbjct: 510 STLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQT 569

Query: 688 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 747
+ SD + S + SDS + S + S + S + S +
Sbjct: 570 AREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYG 629

Query: 748 SDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSD 807
S S + +DS + S + +S + S ++ SD + S S + +DS
Sbjct: 630 STSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLI 689

Query: 808 SESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSD 867
+ S + +S + S + SD SG S S++ +DS + GS +
Sbjct: 690 AGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYH 749

Query: 868 SNSDSESGS 876
S+ + GS
Sbjct: 750 SSLTAGYGS 758



Score = 40.9 bits (95), Expect = 3e-05
Identities = 71/378 (18%), Positives = 130/378 (34%), Gaps = 4/378 (1%)

Query: 505 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 564
G E + S G S + +DS +G ST +G +S+ + S SD
Sbjct: 181 GSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDL 240

Query: 565 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 624
+ S + S + S + DS+ + S + SD + S + +
Sbjct: 241 TAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGA 300

Query: 625 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 684
DS + S + +S + S + SD + S + DS + S
Sbjct: 301 DSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQ 360

Query: 685 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 744
+ DS + S + SD + S + +DS + S + +S +
Sbjct: 361 TAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGY 420

Query: 745 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDS 804
S + SD + S + DS + S + +S + S + SD
Sbjct: 421 GSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDL 480

Query: 805 DSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSD----SDSTSDTGS 860
+ S S + +S + S + S + GS + ++SD STS G+
Sbjct: 481 TAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGA 540

Query: 861 DNDSDSDSNSDSESGSNN 878
++ + S + N+
Sbjct: 541 NSSLIAGYGSTQTASYNS 558



Score = 39.4 bits (91), Expect = 8e-05
Identities = 70/381 (18%), Positives = 126/381 (33%), Gaps = 14/381 (3%)

Query: 510 IPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDS--------------ASDSDSASD 555
+P D D +SGS + + + ST S +S +
Sbjct: 138 LPVTDDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYG 197

Query: 556 SDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASD 615
S + +DS + S + +S + S SD + S + DS+
Sbjct: 198 STGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLI 257

Query: 616 SDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 675
+ S + DS + S + SD + S + +DS + S + +
Sbjct: 258 AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEE 317

Query: 676 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 735
S + S + SD + S + DS + S + DS + S
Sbjct: 318 STQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQT 377

Query: 736 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSD 795
+ SD + S + +DS + S + +S + S ++ SD +
Sbjct: 378 AQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYG 437

Query: 796 SDSDSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDST 855
S + DS + S + DS + S + SD +G S S++ +S
Sbjct: 438 STGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLI 497

Query: 856 SDTGSDNDSDSDSNSDSESGS 876
+ GS + S + GS
Sbjct: 498 AGYGSTQTAGYGSTLTAGYGS 518



Score = 38.6 bits (89), Expect = 1e-04
Identities = 73/381 (19%), Positives = 139/381 (36%), Gaps = 4/381 (1%)

Query: 505 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 564
G + E+S G S + S +G ST +G DS+ + S + DS
Sbjct: 405 GSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSL 464

Query: 565 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 624
+ S + SD + S S + +S+ + S + S + S + +
Sbjct: 465 TAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQN 524

Query: 625 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 684
+SD + S S + ++S + S + +S + S + SD + S
Sbjct: 525 ESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTG 584

Query: 685 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 744
+ SDS + S + S + S + S + S S + +DS +
Sbjct: 585 TAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGY 644

Query: 745 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDS 804
S + +S + S + SD + S S + ++S + S + +S
Sbjct: 645 GSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSIL 704

Query: 805 DSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDS----DSDSTSDTGS 860
+ S + SD S S S + +DS+ + GS +S S ST
Sbjct: 705 TAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTARE 764

Query: 861 DNDSDSDSNSDSESGSNNNVV 881
+ + S S +G++++++
Sbjct: 765 QSVLTTGYGSTSTAGADSSLI 785



Score = 37.4 bits (86), Expect = 4e-04
Identities = 70/375 (18%), Positives = 131/375 (34%), Gaps = 6/375 (1%)

Query: 513 DSDSDPGSDSGSDSNSDSGSD--SGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDS 570
S + GS + S +G ST +G+DS + S + +S + S
Sbjct: 171 THQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGS 230

Query: 571 ASDSDSASDSDSASDSDSASDSDSA----SDSDSASDSDSASDSDSASDSDSASDSDSDS 626
SD + S + DS+ S + DS+ + S + SD +
Sbjct: 231 TQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTA 290

Query: 627 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 686
S + +DS + S + +S + S + SD + S + DS
Sbjct: 291 GYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDS 350

Query: 687 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 746
+ S + DS + S + SD + S + +DS + S +
Sbjct: 351 SLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTA 410

Query: 747 DSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDS 806
+S + S + SD + S + DS + S + DS + S
Sbjct: 411 GEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGS 470

Query: 807 DSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDS 866
+ SD + S S + +S + S + S+ + ST +++D +
Sbjct: 471 TQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLIT 530

Query: 867 DSNSDSESGSNNNVV 881
S S +G+N++++
Sbjct: 531 GYGSTSTAGANSSLI 545



Score = 37.0 bits (85), Expect = 4e-04
Identities = 75/372 (20%), Positives = 135/372 (36%), Gaps = 2/372 (0%)

Query: 505 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 564
G + + SD G S + DS +G ST +G DS+ + S + SD
Sbjct: 421 GSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDL 480

Query: 565 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 624
+ S S + S + S + S + S + ++SD + S S + +
Sbjct: 481 TAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGA 540

Query: 625 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 684
+S + S + +S + S + SD + S + SDS + S
Sbjct: 541 NSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQ 600

Query: 685 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 744
+ S + S + S + S S + +DS + S + +S +
Sbjct: 601 TASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGY 660

Query: 745 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDS 804
S + SD + S S + +DS + S + S + S + SD
Sbjct: 661 GSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDL 720

Query: 805 DSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDS 864
S S S + +DS + S + S+ +G S ++ S T+ GS + +
Sbjct: 721 TSGYGSTSTAGADSSLIAGYGSTQTASYHSS--LTAGYGSTQTAREQSVLTTGYGSTSTA 778

Query: 865 DSDSNSDSESGS 876
+DS+ + GS
Sbjct: 779 GADSSLIAGYGS 790



Score = 37.0 bits (85), Expect = 4e-04
Identities = 75/373 (20%), Positives = 135/373 (36%), Gaps = 2/373 (0%)

Query: 505 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 564
G + + SD G S S + +S +G ST +G S + S + ++SD
Sbjct: 469 GSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDL 528

Query: 565 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 624
+ S S + + S + S + +S + S + SD + S + S
Sbjct: 529 ITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGS 588

Query: 625 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 684
DS + S + S + S + S + S S + +DS + S
Sbjct: 589 DSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQ 648

Query: 685 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 744
+ +S + S + SD + S S + +DS + S + +S +
Sbjct: 649 TAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGY 708

Query: 745 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDS 804
S + SD S S S + +DS + S + S + S + S
Sbjct: 709 GSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVL 768

Query: 805 DSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDS 864
+ S S + +DS + S + S +G S ++ SD T+ GS + +
Sbjct: 769 TTGYGSTSTAGADSSLIAGYGSTQTAGYHSI--LTAGYGSTQTAQERSDLTTGYGSTSTA 826

Query: 865 DSDSNSDSESGSN 877
+DS+ + GS
Sbjct: 827 GADSSLIAGYGST 839



Score = 37.0 bits (85), Expect = 4e-04
Identities = 67/339 (19%), Positives = 124/339 (36%), Gaps = 2/339 (0%)

Query: 514 SDSDPGSDSGSDSNSDSGSD--SGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSA 571
DS + GS + GSD +G STS +G +S+ + S + S + S
Sbjct: 460 EDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGST 519

Query: 572 SDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSD 631
+ + SD + S S + ++S+ + S ++ +S + S + SD +
Sbjct: 520 QTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAG 579

Query: 632 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 691
S + SDS + S + S + S + S + S S + +DS
Sbjct: 580 YGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSS 639

Query: 692 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 751
+ S + +S + S + SD + S S + +DS + S +
Sbjct: 640 LIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAG 699

Query: 752 SDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSESD 811
+S + S + SD S S S + ++S + S + S + S
Sbjct: 700 YNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGST 759

Query: 812 SDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDS 850
+ S + S S + +DS+ + GS + S
Sbjct: 760 QTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHS 798



Score = 33.6 bits (76), Expect = 0.005
Identities = 56/335 (16%), Positives = 107/335 (31%)

Query: 527 NSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDS 586
S +D + + + S ++ D D+ +S S +
Sbjct: 97 TKTSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPT 156

Query: 587 DSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDS 646
+ + S S + S + +S + S + +DS + S
Sbjct: 157 QTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQ 216

Query: 647 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 706
+ +S + S SD + S + DS + S + DS +
Sbjct: 217 TAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGY 276

Query: 707 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 766
S + SD + S + +DS + S + +S + S + SD
Sbjct: 277 GSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDL 336

Query: 767 DSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSESDSDSDSDSDSESDSDS 826
+ S + DS + S + DS + S ++ SD + S + +
Sbjct: 337 TAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGA 396

Query: 827 DSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSD 861
DS + S +G +S ++ S T+ GSD
Sbjct: 397 DSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSD 431



Score = 33.2 bits (75), Expect = 0.005
Identities = 66/339 (19%), Positives = 125/339 (36%), Gaps = 2/339 (0%)

Query: 514 SDSDPGSDSGSDSNSDSGSD--SGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSA 571
S + GS + + SD +G STS +G++S+ + S ++ +S + S
Sbjct: 508 YGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGST 567

Query: 572 SDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSD 631
+ SD + S + SDS+ + S ++ S + S + S +
Sbjct: 568 QTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTG 627

Query: 632 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 691
S S + +DS + S + +S + S + SD + S S + +DS
Sbjct: 628 YGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSS 687

Query: 692 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 751
+ S + +S + S + SD S S S + +DS + S +
Sbjct: 688 LIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTAS 747

Query: 752 SDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSESD 811
S + S + S + S S + ++S + S + S + S
Sbjct: 748 YHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGST 807

Query: 812 SDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDS 850
+ SD + S S + +DS+ + GS + +S
Sbjct: 808 QTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNS 846



Score = 32.8 bits (74), Expect = 0.008
Identities = 74/372 (19%), Positives = 135/372 (36%), Gaps = 2/372 (0%)

Query: 505 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 564
G + +SD G S S + ++S +G ST + +S + S + SD
Sbjct: 517 GSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDL 576

Query: 565 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 624
+ S + S S + S + S+ + S + S + S S + +
Sbjct: 577 TAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGA 636

Query: 625 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 684
DS + S + +S + S + SD + S S + +DS + S
Sbjct: 637 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQ 696

Query: 685 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 744
+ +S + S + SD S S S + +DS + S + S +
Sbjct: 697 TAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGY 756

Query: 745 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDS 804
S + S + S S + +DS + S + S + S + SD
Sbjct: 757 GSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDL 816

Query: 805 DSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDS 864
+ S S + +DS + S + +S +G S ++ +SD T+ GS + +
Sbjct: 817 TTGYGSTSTAGADSSLIAGYGSTQTAGYNSI--LTAGYGSTQTAQENSDLTTGYGSTSTA 874

Query: 865 DSDSNSDSESGS 876
DS+ + GS
Sbjct: 875 GYDSSLIAGYGS 886



Score = 32.8 bits (74), Expect = 0.008
Identities = 67/360 (18%), Positives = 131/360 (36%)

Query: 522 SGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSD 581
+G S +G S + ++S + S S + ++S+ + S ++ +S +
Sbjct: 506 AGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYG 565

Query: 582 SASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSD 641
S + SD + S + SDS+ + S ++ S + S + S
Sbjct: 566 STQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLT 625

Query: 642 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 701
+ S S + +DS + S + +S + S + SD + S S + +D
Sbjct: 626 TGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGAD 685

Query: 702 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 761
S + S + +S + S + SD S S S + +DS + S
Sbjct: 686 SSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQT 745

Query: 762 SDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSESDSDSDSDSDSE 821
+ S + S + S + S S + +DS + S + S +
Sbjct: 746 ASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYG 805

Query: 822 SDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSDSNSDSESGSNNNVV 881
S + SD + S S + +DSS + ST G ++ + S + N+++
Sbjct: 806 STQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLT 865



Score = 31.6 bits (71), Expect = 0.018
Identities = 77/387 (19%), Positives = 138/387 (35%), Gaps = 14/387 (3%)

Query: 505 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSA--------------SDS 550
G + E SD G S + SDS +G ST + S+ S
Sbjct: 565 GSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVL 624

Query: 551 DSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDS 610
+ S S + +DS+ + S + +S + S + SD + S S + +
Sbjct: 625 TTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGA 684

Query: 611 DSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 670
DS+ + S + +S + S + SD S S S + +DS + S
Sbjct: 685 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQ 744

Query: 671 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 730
+ S + S + S + S S + +DS + S + S +
Sbjct: 745 TASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGY 804

Query: 731 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDS 790
S + SD + S S + +DS + S + +S + S ++ +SD
Sbjct: 805 GSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDL 864

Query: 791 DSDSDSDSDSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDS 850
+ S S + DS + S + +S + S + SD +G S S++
Sbjct: 865 TTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGY 924

Query: 851 DSDSTSDTGSDNDSDSDSNSDSESGSN 877
+S + GS + S + GS+
Sbjct: 925 ESSLIAGYGSTQTASFKSTLMAGYGSS 951



Score = 31.3 bits (70), Expect = 0.026
Identities = 68/339 (20%), Positives = 125/339 (36%), Gaps = 2/339 (0%)

Query: 514 SDSDPGSDSGSDSNSDSGSD--SGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSA 571
+S + GS + GSD +G ST +GSDS+ + S ++ S + S
Sbjct: 556 YNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGST 615

Query: 572 SDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSD 631
+ S + S S + +DS+ + S + +S + S + SD +
Sbjct: 616 QTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAG 675

Query: 632 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 691
S S + +DS + S + +S + S + SD S S S + +DS
Sbjct: 676 YGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSS 735

Query: 692 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 751
+ S + S + S + S + S S + +DS + S +
Sbjct: 736 LIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAG 795

Query: 752 SDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSESD 811
S + S + SD + S S + ++S + S + +S + S
Sbjct: 796 YHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGST 855

Query: 812 SDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDS 850
+ +SD + S S + DS+ + GS + +S
Sbjct: 856 QTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAGYNS 894



Score = 31.3 bits (70), Expect = 0.026
Identities = 67/360 (18%), Positives = 128/360 (35%)

Query: 522 SGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSD 581
+ +S +G S + S + S + SDS+ + S ++ S +
Sbjct: 554 ASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYG 613

Query: 582 SASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSD 641
S + S + S S + +DS+ + S + +S + S + SD
Sbjct: 614 STQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLT 673

Query: 642 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 701
+ S S + +DS + S + +S + S + SD S S S + +D
Sbjct: 674 AGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGAD 733

Query: 702 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 761
S + S + S + S + S + S S + +DS + S
Sbjct: 734 SSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQT 793

Query: 762 SDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSESDSDSDSDSDSE 821
+ S + S + SD + S S + +DS + S + +S +
Sbjct: 794 AGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYG 853

Query: 822 SDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSDSNSDSESGSNNNVV 881
S + +SD + S S + DSS + ST G ++ + S + N+++
Sbjct: 854 STQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLT 913


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1006200IGASERPTASE310.016 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.8 bits (69), Expect = 0.016
Identities = 28/162 (17%), Positives = 52/162 (32%), Gaps = 4/162 (2%)

Query: 261 ALKLKADTEAAKNDVSKRSKRSLNTQNNKST-TQEISEEQKAEYQRKSEALKERFINRQK 319
+KA+T+ + S + T K T T E E+ K E ++ E K
Sbjct: 1073 KSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVT-SQVSP 1131

Query: 320 SKNESVVSLIDDEDDNENDRQLVVSAPSKKPTTPTTYTETTTQVPMPTVERQTQQQIVYK 379
+ +S E END + + P + T + + + T+ V
Sbjct: 1132 KQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNT 1191

Query: 380 TPKPLAGLNGESHDFTTTHQSPTTSNHTHNNVVEFEETSALP 421
+ N E+ TT + + + ++P
Sbjct: 1192 GNSVVE--NPENTTPATTQPTVNSESSNKPKNRHRRSVRSVP 1231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1006240PF05704280.035 Capsular polysaccharide synthesis protein
		>PF05704#Capsular polysaccharide synthesis protein

Length = 307

Score = 27.5 bits (61), Expect = 0.035
Identities = 13/69 (18%), Positives = 24/69 (34%), Gaps = 7/69 (10%)

Query: 116 EWVKKNYENTNHRYLVTLNLNSK-------KFTYCTKIIYQAYKFGVSEKSVKSYGLHII 168
W + Y N + +++ N + + YK + +Y HI
Sbjct: 239 YWKEIPYVNNVNPHMLQYLGNLPYDNSMFNYIKSTSPVQKLTYKLDYNNLKRNTYYDHIF 298

Query: 169 SPYAIKDNF 177
S +KDN+
Sbjct: 299 SIDKLKDNY 307


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1006270SACTRNSFRASE310.001 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 31.1 bits (70), Expect = 0.001
Identities = 19/91 (20%), Positives = 34/91 (37%), Gaps = 6/91 (6%)

Query: 53 IVFGCYENETLIATAALEQI--RYVGKEHKSLIKYNFVTNNDKSINSELINFIINYARQN 110
F Y I + Y E ++ K K + + L++ I +A++N
Sbjct: 66 AAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAK----DYRKKGVGTALLHKAIEWAKEN 121

Query: 111 NYESLLTSIVSNNIGAKVFYSALGFDILGFE 141
++ L+ NI A FY+ F I +
Sbjct: 122 HFCGLMLETQDINISACHFYAKHHFIIGAVD 152


11SAI8T7_1006820SAI8T7_1006970Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_100682009-3.009045Similar to hydrolase, haloacid dehalogenase-like
SAI8T7_1006830111-4.004782Putative O-acetyltransferase MW0856
SAI8T7_1006840112-4.364554Chaperone protein ClpB
SAI8T7_1006850417-5.698086Putative uncharacterized protein
SAI8T7_1006860315-2.826499HMGL-like protein
SAI8T7_1006870314-1.779588Putative uncharacterized protein
SAI8T7_1006880111-0.646008Putative uncharacterized protein
SAI8T7_10068900111.636458Putative uncharacterized protein
SAI8T7_1006900-1101.6149203-oxoacyl-[acyl-carrier-protein] synthase 3
SAI8T7_10069100110.9771873-oxoacyl-[acyl-carrier-protein] synthase 2
SAI8T7_1006920110-0.354410Oligopeptide transport system permease protein
SAI8T7_1006930110-0.690431Oligopeptide transport system permease protein
SAI8T7_1006940211-1.118185Oligopeptide transport system ATP-binding
SAI8T7_1006950312-2.123458Oligopeptide transport system ATP-binding
SAI8T7_1006960314-2.358895Similar to peptide binding protein OppA
SAI8T7_1006970313-2.194237Similar to oligopeptide ABC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1006840IGASERPTASE367e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 36.2 bits (83), Expect = 7e-04
Identities = 17/143 (11%), Positives = 48/143 (33%), Gaps = 14/143 (9%)

Query: 420 QLEIEESALKNESDNASKQRLQELQEELANEKEKQAALQSRVESEKEKIANLQEKRAQLD 479
E+ +S + + Q + + ++EK + + + + + K+ Q +
Sbjct: 1082 TNEVAQSGSETKE----TQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSE 1137

Query: 480 ESRQALEDAQTNNNLEKAAELQYGTIPQLEKELRELEDNFQDEQGEDTDRMIREVVTDEE 539
+ E A+ N+ E Q + + ++ ++T + + VT+
Sbjct: 1138 TVQPQAEPARENDPTVNIKEPQ-------SQTNTTAD---TEQPAKETSSNVEQPVTEST 1187

Query: 540 IGDIVSQWTGIPVSKLVETEREK 562
+ + P + T +
Sbjct: 1188 TVNTGNSVVENPENTTPATTQPT 1210


12SAI8T7_1007980SAI8T7_1008030Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1007980413-1.065845Similar to methyltransferase
SAI8T7_1007990414-1.048854Phosphopantetheine adenylyltransferase
SAI8T7_1008000512-1.600947UPF0348 protein NWMN_0989
SAI8T7_1008010412-1.702031Putative uncharacterized protein
SAI8T7_1008020413-1.754714Iron-regulated surface determinant protein B
SAI8T7_1008030213-2.757987Iron-regulated surface determinant protein A
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1007990LPSBIOSNTHSS2191e-76 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 219 bits (560), Expect = 1e-76
Identities = 77/155 (49%), Positives = 112/155 (72%)

Query: 5 IAVIPGSFDPITYGHLDIIERSTDRFDEIHVCVLKNSKKEGTFSLEERMDLIEQSVKHLP 64
A+ PGSFDPIT+GHLDIIER FD+++V VL+N K+ FS++ER++ I +++ HLP
Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLP 61

Query: 65 NVKVHQFSGLLVDYCEQVGAKTIIRGLRAVSDFEYELRLTSMNKKLNNEIETLYMMSSTN 124
N +V F GL V+Y Q A I+RGLR +SDFE EL++ + NK L +++ET+++ +ST
Sbjct: 62 NAQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTSTE 121

Query: 125 YSFISSSIVKEVAAYRADISEFVPPYVEKALKKKF 159
YSF+SSS+VKEVA + ++ FVP +V AL +F
Sbjct: 122 YSFLSSSLVKEVARFGGNVEHFVPSHVAAALYDQF 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1008020IGASERPTASE366e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 35.8 bits (82), Expect = 6e-04
Identities = 37/194 (19%), Positives = 71/194 (36%), Gaps = 15/194 (7%)

Query: 447 RIVDKEAFTKANTDKSNKKEQQDNSAKKEA---------TPATPSKPTPSPVEKESQKQD 497
+ VD T N +++ N+ + PATPS+ T + E Q+
Sbjct: 990 QTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESK 1049

Query: 498 SQKDDNKQLPSVEKENDASSESGKDKTPATKPT------KGEVESSSTTPTKVVSTTQNV 551
+ + + + +N ++ K A T E + + TT TK +T +
Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKE 1109

Query: 552 AKPTTASSKTTKDVVQTSAGSSEAKDSAPLQKANIKNTNDGHTQSQNNKNTQENKAKSLP 611
K + KT + TS S + + S +Q + T + +Q N
Sbjct: 1110 EKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTE 1169

Query: 612 QTGEESNKDMTLPL 625
Q +E++ ++ P+
Sbjct: 1170 QPAKETSSNVEQPV 1183



Score = 30.0 bits (67), Expect = 0.035
Identities = 27/156 (17%), Positives = 45/156 (28%), Gaps = 5/156 (3%)

Query: 37 EAQAAAEETGGTNTEAQPKTEAVASPTTTSEKAPET-KPVANAVSVSNKEVEAPTSETKE 95
A EE TE + V S + ++ ET +P A ++ V +++
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQT 1162

Query: 96 AKEVKEVKAPKETKAVKPAAKATNNTYPILNQELREAIKNPAIKDKDHSAPNSRPIDFEM 155
+ KET + + T N + NP + P
Sbjct: 1163 NTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVE----NPENTTPATTQPTVNSESSNK 1218

Query: 156 KKENGEQQFYHYASSVKPARVIFTDSKPEIELGLQS 191
K + +V+PA D L S
Sbjct: 1219 PKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTS 1254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1008030IGASERPTASE340.001 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.9 bits (77), Expect = 0.001
Identities = 27/132 (20%), Positives = 44/132 (33%), Gaps = 4/132 (3%)

Query: 184 ADAAKPNNVKPVQPKPAQPKTPTEQTKPVQPKVEKVKPTVTTTSKVEDNHSTKVVSTDTT 243
+ A+ + P PA P TE + K + + +V +
Sbjct: 1015 EEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKS 1074

Query: 244 KDQTKTQTAHTVKTAQTAQEQNKVQTPVKDVATAKSESNNQAVSDNKSQQTNKVTKHNET 303
+ TQT + AQ+ E + QT + V K+Q+ KVT +
Sbjct: 1075 NVKANTQTN---EVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTS-QVS 1130

Query: 304 PKQASKAKELPK 315
PKQ P+
Sbjct: 1131 PKQEQSETVQPQ 1142


13SAI8T7_1008880SAI8T7_1008990Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1008880211-1.833700Ribonuclease HII
SAI8T7_1008890210-1.230184Succinyl-CoA ligase [ADP-forming] subunit beta
SAI8T7_1008900211-1.134523Succinyl-CoA ligase [ADP-forming] subunit alpha
SAI8T7_1008910210-1.850900Probable cell wall hydrolase lytN
SAI8T7_1008920110-0.804860FmhC protein
SAI8T7_1008930190.808711Similar to DNA processing Smf protein
SAI8T7_1008940391.069846DNA topoisomerase
SAI8T7_10089503120.974671Methylenetetrahydrofolate--tRNA-(uracil-5-)-
SAI8T7_10089605161.044862Tyrosine recombinase XerC
SAI8T7_10089704181.666387ATP-dependent protease subunit HslV
SAI8T7_10089805191.173932ATP-dependent protease ATPase subunit HslU
SAI8T7_10089904200.500853GTP-sensing transcriptional pleiotropic
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1008910GPOSANCHOR362e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 36.2 bits (83), Expect = 2e-04
Identities = 15/90 (16%), Positives = 32/90 (35%), Gaps = 4/90 (4%)

Query: 12 MNKQQSKVRYSIRKVSIGILSISIGMFLALGMSNKAYADEIDKSKDFTRGYEQNVFAKSE 71
M K + YS+RK+ G S+++ AL + ++ + + K +
Sbjct: 1 MTKNNTNRHYSLRKLKTGTASVAV----ALTVLGAGLVVNTNEVSAVATRSQTDTLEKVQ 56

Query: 72 LNANKNTTKDKIKNEGAVKTSDTSLKLDNK 101
A+K ++ S + L +
Sbjct: 57 ERADKFEIENNTLKLKNSDLSFNNKALKDH 86


14SAI8T7_1009620SAI8T7_1009680Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1009620211-2.274390Putative uncharacterized protein
SAI8T7_1009630312-2.123750LexA repressor
SAI8T7_1009640311-1.632073Transketolase
SAI8T7_1009650210-1.472199Putative uncharacterized protein
SAI8T7_1009660310-1.408928Possible exonuclease SbcD
SAI8T7_1009670311-1.103097Putative Nuclease sbcCD subunit C
SAI8T7_10096802131.032397Putative uncharacterized protein MW1234
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1009640CHANLCOLICIN300.041 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 29.7 bits (66), Expect = 0.041
Identities = 18/58 (31%), Positives = 26/58 (44%), Gaps = 5/58 (8%)

Query: 296 QNTMLKRANEDESQ-----WNSLLEKYAETYPELAEEFKLAISGKLPKNYKDELPRFE 348
QN +L +D + +L EKY E Y ++A+E GK N + L FE
Sbjct: 337 QNNLLNSQIKDAVDATVSFYQTLTEKYGEKYSKMAQELADKSKGKKIGNVNEALAAFE 394


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1009670FbpA_PF05833340.005 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 33.7 bits (77), Expect = 0.005
Identities = 39/249 (15%), Positives = 83/249 (33%), Gaps = 4/249 (1%)

Query: 241 LQARSKEILAFVNESKETAIKEYEIIEKKTLENNILKDNINQLNKNKIDFVQLKEQQPEI 300
I F E+ + + + +L N ID ++
Sbjct: 174 FDFSYDMIENFTKENSLQLNDNIFSKIFTGVSKTLSSEICFRLKNNSIDLSLSNLKEIVE 233

Query: 301 DEIEAKLKLLQDITNLLNYIENREKIETKIAN--SKKDISKTNNKILNLDCDKRNIDKEK 358
+ ++ + Y +N + N SK+D K + + K+K
Sbjct: 234 VCKDLFKEIQSNKFEFNCYTKNNSFVGFYCLNLMSKEDYKKIQYDSSSKLLENFYYAKDK 293

Query: 359 K--MLEENGDLIESKTSFIDKTRVLFNDINKYQQSYLNIECLITEGEQLGDELNNLIKGL 416
+ ++ DL + + I++ +N + + + GE L + L KGL
Sbjct: 294 SDRLKSKSSDLQKIVMNNINRCTKKDKILNNTLKKCEDKDIFKLYGELLTANIYALKKGL 353

Query: 417 EKVEDSIGNNESDYEKIIELNNAITNINNEINIIKENEKAKAELDKLLGSKQELENQINE 476
+E + +E+ I L+ T N + K+ K K + + E ++N
Sbjct: 354 SHIELANYYSENYDTVKITLDENKTPSQNVQSYYKKYNKLKKSEEAANEQLLQNEEELNY 413

Query: 477 ETTIMKNLE 485
+++ N+
Sbjct: 414 LYSVLTNIN 422


15SAI8T7_1010260SAI8T7_1010360Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1010260213-1.688552UDP-N-acetylglucosamine--N-acetylmuramyl-
SAI8T7_1010270313-0.882417Putative uncharacterized protein
SAI8T7_1010280110-0.988746C-terminal processing peptidase family protein
SAI8T7_1010290012-0.000506Glucose-specific phosphotransferase enzyme IIA
SAI8T7_1010300-110-0.523722Peptide methionine sulfoxide reductase MsrA 2
SAI8T7_10103109111.874837DegV domain-containing protein SACOL1460
SAI8T7_10103209101.841688Dihydrofolate reductase
SAI8T7_10103309101.808400Putative Thymidylate synthase
SAI8T7_10103409101.872246Conserved virulence factor C
SAI8T7_10103509111.889078Putative uncharacterized protein
SAI8T7_10103609101.907854Putative uncharacterized protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1010270SACTRNSFRASE325e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 32.2 bits (73), Expect = 5e-04
Identities = 33/140 (23%), Positives = 54/140 (38%), Gaps = 19/140 (13%)

Query: 30 EQWDDQYPLLEHFEEDIAKDYLYVLEENDKIYGFIVVDQDQAEWYDDIDWPVNREGAFVI 89
+Q++D + + EE+ +LY LE + G I + N G +I
Sbjct: 48 KQYEDDDMDVSYVEEEGKAAFLYYLE--NNCIGRIKIRS-------------NWNGYALI 92

Query: 90 HRLTGSKEY--KGAATELFNYVIDVVKARGAEVILTDTFALNKPAQGLFAKFGFHKVGEQ 147
+ +K+Y KG T L + I+ K ++ +T +N A +AK F
Sbjct: 93 EDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAVD 152

Query: 148 LMEYP--PYDKGEPFYAYYK 165
M Y P + YYK
Sbjct: 153 TMLYSNFPTANEIAIFWYYK 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1010360GPOSANCHOR451e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 45.4 bits (107), Expect = 1e-05
Identities = 49/323 (15%), Positives = 96/323 (29%), Gaps = 9/323 (2%)

Query: 2582 TKVRAAQTKIDQAKALLQNKEDNSQLVTSKNNLQSSVNQVPSTAGMTQQSIDN------- 2634
T +A Q L + +E + N L+ + + + D
Sbjct: 37 TNEVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSN 96

Query: 2635 YNAKKREAETEITAAQRVIDNGDATAQQISDEKHRVDNALTALNQAKHDLTADTHALEQA 2694
K R+ + ++ I +A + N TA + L A+ AL
Sbjct: 97 AKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAAR 156

Query: 2695 VQQLNRTGTTTGKKPASITAYNNSIRALQSDLTSAKNSANAIIQKPIRTVQEVQSALTNV 2754
L + + +A ++ A ++ L + + ++ + + + +
Sbjct: 157 KADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTL 216

Query: 2755 NRVNERLTQAINQLVPLADNSALRTAKTKLDEEINKSVTTDGMTQSSIQTYENAKRAGQT 2814
L L T +I ++ E A
Sbjct: 217 EAEKAALAARKADL--EKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMN 274

Query: 2815 ETTNAQNVINNGDATDQQIAAEKTKVEEKYNSLKQAIAGLTPDLAPLQTAKTQLQNDIDQ 2874
+T I +A + AEK +E + L L DL + AK QL+ + +
Sbjct: 275 FSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQK 334

Query: 2875 PTSTTGMTSASVAAFNDKLSAAR 2897
++ AS + L A+R
Sbjct: 335 LEEQNKISEASRQSLRRDLDASR 357



Score = 40.8 bits (95), Expect = 3e-04
Identities = 56/339 (16%), Positives = 103/339 (30%), Gaps = 24/339 (7%)

Query: 2732 SANAIIQKPIRTVQEVQSALTNVNRVNERLTQAINQLVPLAD-----NSALRTAKTKLDE 2786
+ + T+++VQ N L + L N L + E
Sbjct: 40 VSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKE 99

Query: 2787 EINKSVTTDGMTQSSIQTYENAKRAGQTETTNAQNVINNGDATDQQIAAEKTKVEEKYNS 2846
++ K+ + S IQ E K + A N A + + AEK + +
Sbjct: 100 KLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKAD 159

Query: 2847 LKQAIAGLTPDLAPLQTAKTQLQNDIDQPTSTTGMTSASVAAFNDKLSAARTKIQEIDRV 2906
L++A+ G L+ + A A L A +
Sbjct: 160 LEKALEGAMNFSTADSAKIKTLEAEKAA-------LEARQAELEKALEGAM------NFS 206

Query: 2907 LASHPDVATIRQNVTAANAAKTALDQARNGLTVDKAPLENAKNQLQHSIDTQTSTTGMTQ 2966
A + T+ A A K L++A G L+ + +
Sbjct: 207 TADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELE 266

Query: 2967 DSINAYNAKLTAARNKVQQINQVLAGSPTVDQINTNTSAANQAKSDLDHARQALTPDKAP 3026
++ TA K++ + + + L+ RQ+L D
Sbjct: 267 KALEGAMNFSTADSAKIKTLEA------EKAALEAEKADLEHQSQVLNANRQSLRRDLDA 320

Query: 3027 LQNAKTQLEQSINQPTDTTGMTTASLNAYNQKLQAARQK 3065
+ AK QLE + + ++ AS + + L A+R+
Sbjct: 321 SREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREA 359


16SAI8T7_1011450SAI8T7_1011560Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_10114502150.576102Putative uncharacterized protein MW1528
SAI8T7_1011460113-0.402074Ribosomal RNA small subunit methyltransferase E
SAI8T7_1011470212-0.369411Ribosomal protein L11 methyltransferase
SAI8T7_1011480112-0.341166Chaperone protein DnaJ
SAI8T7_1011490112-1.532314Chaperone protein DnaK
SAI8T7_1011500012-3.441310Protein grpE
SAI8T7_1011510013-3.319726Heat-inducible transcription repressor HrcA
SAI8T7_1011520013-3.149398Oxygen-independent coproporphyrinogen oxidase
SAI8T7_1011530-113-3.490806Elongation factor 4
SAI8T7_1011540116-4.591237Putative uncharacterized protein
SAI8T7_1011550116-3.636942Similar to late competence protein ComEC
SAI8T7_1011560218-2.224068Possible competence protein ComEB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1011490SHAPEPROTEIN1632e-47 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 163 bits (415), Expect = 2e-47
Identities = 79/363 (21%), Positives = 145/363 (39%), Gaps = 58/363 (15%)

Query: 10 SKIIGIDLGTTNSCVTVLEG----DEPKVIQ-NPEGSRTTPSVVAFKNGETQVGEVAKRQ 64
S + IDLGT N+ + V +EP V+ + + + SV A VG AK+
Sbjct: 10 SNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAA-------VGHDAKQM 62

Query: 65 AITNPNTVQSIKRHMGTDYKVDIEGKSYTPQEISAMILQNLKNTAESYLGEKVDKAVITV 124
P + +I+ K + + +++ ++ + + + + ++ V
Sbjct: 63 LGRTPGNIAAIR-----PMKDGVIADFFVTEKMLQHFIKQVHSNS---FMRPSPRVLVCV 114

Query: 125 PAYFNDAERQATKDAGKIAGLEVERIINEPTAAALAYGLDKTDKDEKVLVFDLGGGTFDV 184
P ER+A +++ + AG +I EP AAA+ GL + +V D+GGGT +V
Sbjct: 115 PVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGL-PVSEATGSMVVDIGGGTTEV 173

Query: 185 SILELGDGVFEVLSTAGDNKLGGDDFDQVIIDYLVAEFKKENGVDLSQDKMALQRLKDAA 244
+++ L V + ++GGD FD+ II+Y+ + G + A
Sbjct: 174 AVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYGSLIG-------------EATA 215

Query: 245 EKAKKDLS----GVSQTQISLPFISAGENGPLHLEVNLTRSKFEELSDSL------IRRT 294
E+ K ++ G +I + + E P +N + E L + L +
Sbjct: 216 ERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLN-SNEILEALQEPLTGIVSAVMVA 274

Query: 295 MEPTRQAMKDAGLTNSDIDE--VILVGGSTRIPAVQEAVKKEIGKEPNKGVNPDEVVAMG 352
+E + SDI E ++L GG + + + +E G +P VA G
Sbjct: 275 LEQCPPELA------SDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAEDPLTCVARG 328

Query: 353 AAI 355

Sbjct: 329 GGK 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1011530TCRTETOQM1842e-52 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 184 bits (468), Expect = 2e-52
Identities = 106/439 (24%), Positives = 184/439 (41%), Gaps = 89/439 (20%)

Query: 12 NIRNFSIIAHIDHGKSTLADRILEN---TKSVETRDMQDQLLDSMDLERERGITIKLNAV 68
I N ++AH+D GK+TL + +L N + + D D+ LER+RGITI+
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 69 RLKYEAKDGNTYTFHLIDTPGHVDFTYEVSRSLAACEGAILVVDAAQGIEAQTLANVYLA 128
++E N +IDTPGH+DF EV RSL+ +GAIL++ A G++AQT +
Sbjct: 62 SFQWENTKVN-----IIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116

Query: 129 LDNELELLPVINKIDLPAAEPERV--------------KQEIE--------DMIGLDQDD 166
+ + INKID + V KQ++E + +Q D
Sbjct: 117 RKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176

Query: 167 VVLA---------------------------------------SAKSNIGIEEILEKIVE 187
V+ SAK+NIGI+ ++E I
Sbjct: 177 TVIEGNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITN 236

Query: 188 VVPAPDGDPEAPLKALIFDSEYDPYRGVISSIRIVDGVVKAGDKIRMMATGKEFEVTEVG 247
+ ++ L +F EY R ++ IR+ GV+ D +R+ K ++TE
Sbjct: 237 KFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-KITE-- 293

Query: 248 INTPKQ---LPVDELTVGDVGYIIASIKNVDDSRVGDTITLASRPASEPLQGYKKMNPMV 304
+ T +D+ G++ + ++ +GDT L P E ++ P++
Sbjct: 294 MYTSINGELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLL---PQRERIENPL---PLL 346

Query: 305 YCGLFPIDNKNYNDLREALEKLQLNDASLEFE--PESSQALGFGYRTGFLGMLHMEIIQE 362
+ P + L +AL ++ +D L + + + + FLG + ME+
Sbjct: 347 QTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEII-----LSFLGKVQMEVTCA 401

Query: 363 RIEREFGIELIATAPSVIY 381
++ ++ +E+ P+VIY
Sbjct: 402 LLQEKYHVEIEIKEPTVIY 420



Score = 35.6 bits (82), Expect = 6e-04
Identities = 12/75 (16%), Positives = 28/75 (37%), Gaps = 2/75 (2%)

Query: 408 IFEPYVRATMMVPNDYVGAVMELCQRKRGQFINMDYLDDIRVNIVYELPLAEVVFDFFDQ 467
+ EPY+ + P +Y+ + ++ ++ V + E+P + ++
Sbjct: 535 LLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNE-VILSGEIPARC-IQEYRSD 592

Query: 468 LKSNTKGYASFDYEF 482
L T G + E
Sbjct: 593 LTFFTNGRSVCLTEL 607


17SAI8T7_1011950SAI8T7_1012020Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1011950213-0.874566GTPase obg
SAI8T7_1011960522-1.353435Similar to cell shape determinant mreD
SAI8T7_1011970824-0.391426Rod shape-determining protein MreC
SAI8T7_1011980926-0.283450Putative uncharacterized protein
SAI8T7_1011990925-0.056321Putative uncharacterized protein
SAI8T7_1012000924-0.771946rRNA adenine N-6-methyltransferase
SAI8T7_1012010620-0.756404Streptomycin=3''-adenylyltransferase
SAI8T7_1012020210-0.602333Transposase B from transposon Tn554
18SAI8T7_1013300SAI8T7_1013410Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1013300210-2.682918Glutamyl endopeptidase
SAI8T7_1013310312-3.902729Serine protease splA
SAI8T7_1013320413-4.308101Putative Probable beta-lactamase
SAI8T7_1013330414-4.573290Putative uncharacterized protein
SAI8T7_1013340516-4.607500Leukotoxin, LukD
SAI8T7_1013350718-5.810955Leukotoxin S-subunit
SAI8T7_1013360920-7.177831Putative uncharacterized protein
SAI8T7_10133701020-6.561952Putative uncharacterized protein
SAI8T7_1013380819-7.051920Protein of hypothetical function DUF1828
SAI8T7_1013390719-6.235395Enterotoxin type G
SAI8T7_1013400415-4.069391Enterotoxin SeN
SAI8T7_1013410212-2.749384Extracellular enterotoxin type I
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013300V8PROTEASE1824e-58 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 182 bits (462), Expect = 4e-58
Identities = 65/230 (28%), Positives = 108/230 (46%), Gaps = 29/230 (12%)

Query: 43 EVQQTAKA-----ENNVTKIQDTNIFPYTGVVAFKS--------ATGFVVGKNTILTNKH 89
++Q A N+ +I DT Y V + A+G VVGK+T+LTNKH
Sbjct: 60 PLEQREHANVILPNNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKH 119

Query: 90 V-SKNYKVGDRITAHP---NSDKGNGGIYSIKKIINYPGKEDVSVIQVEERAIERGPKGF 145
V + + A P N D G ++ ++I Y G+ D+++++ +
Sbjct: 120 VVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQNK----- 174

Query: 146 NFNDNVTPFKYAAGA--KAGERIKVIGYPHPYKNKYVLYESTGPVMSVEGSSIVYSAHTE 203
+ + V P + A + + I V GYP K ++ES G + ++G ++ Y T
Sbjct: 175 HIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMWESKGKITYLKGEAMQYDLSTT 233

Query: 204 SGNSGSPVLNSNNELVGIHFASDVKNDDNRNAYGVYFTPEIKKFIAENID 253
GNSGSPV N NE++GIH+ V N+ N V+ ++ F+ +NI+
Sbjct: 234 GGNSGSPVFNEKNEVIGIHWGG-VPNEFNG---AVFINENVRNFLKQNIE 279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013310V8PROTEASE1381e-41 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 138 bits (349), Expect = 1e-41
Identities = 63/212 (29%), Positives = 100/212 (47%), Gaps = 18/212 (8%)

Query: 36 EKNVKEITDATKAPYNSVVAFA--------GGTGVVVGKNTIVTNKHIAKSNDIFKNRVA 87
+ +ITD T Y V +GVVVGK+T++TNKH+ + + +
Sbjct: 73 NNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHVVDATHGDPHALK 132

Query: 88 AHYS---SKGKGGGNYDVKDIVEYPGKEDLAIVHVHETSTEGLNFNKNVSYTKFAEGA-- 142
A S G + + I +Y G+ DLAIV + + + + V + A
Sbjct: 133 AFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVK-FSPNEQNKHIGEVVKPATMSNNAET 191

Query: 143 KAKDRISVIGYPKGAQTKYKMFESTGTINHISGTFIEFDAYAQPCNSGSPVLNSKHELIG 202
+ I+V GYP G + M+ES G I ++ G +++D NSGSPV N K+E+IG
Sbjct: 192 QVNQNITVTGYP-GDKPVATMWESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIG 250

Query: 203 ILYAGSGKDESEKNFGVYFTPQLKEFIQNNIE 234
I + G +E N V+ ++ F++ NIE
Sbjct: 251 IHWGGVP---NEFNGAVFINENVRNFLKQNIE 279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013340BICOMPNTOXIN396e-141 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 396 bits (1020), Expect = e-141
Identities = 96/329 (29%), Positives = 177/329 (53%), Gaps = 24/329 (7%)

Query: 1 MKMKKLVKSSVASSIALLLLSNTVDAAQHITPVSEKKVDDKITLYKTTATSDNDKLNISQ 60
M K++ ++++ S+ L + ++ A+ + I + K T ++K ++Q
Sbjct: 1 MLKNKILTTTLSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQ 60

Query: 61 ILTFNFIKDKSYDKDTLVLKAAGNINSGYKKPNPKDYNYSQ-FYWGGKYNVSVSSESNDA 119
+ F+F+KDK Y+KD L+LK G I+S N K N+ + W +YN+ + + +
Sbjct: 61 NIQFDFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTN-DKY 119

Query: 120 VNVVDYAPKNQNEEFQVQQTLGYSYGGDINISNGLSGGLNGSKSFSETINYKQESYRTTI 179
V++++Y PKN+ E V QTLGY+ GG+ + L G NGS ++S++I+Y Q++Y + +
Sbjct: 120 VSLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGG--NGSFNYSKSISYTQQNYVSEV 177

Query: 180 DRKTNHKSIGWGVEAHKIMNNGWGPYGRDSYDPTYGNELFLGGRQSSSNAGQNFLPTHQM 239
+++ N KS+ WGV+A+ + ++LF+G + S + F+P ++
Sbjct: 178 EQQ-NSKSVLWGVKANSFAT-------ESGQKSAFDSDLFVGYKPHSKDPRDYFVPDSEL 229

Query: 240 PLLARGNFNPEFISVLSHKQNDTKKSKIKVTYQREMD---------RYTNQWNRLHWIGN 290
P L + FNP FI+ +SH++ + S+ ++TY R MD Y N + H + N
Sbjct: 230 PPLVQSGFNPSFIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHN 289

Query: 291 NYKNQNTVTFTSTYEVDWQNHTVKLIGTD 319
+ N+N +T YEV+W+ H +K+ G +
Sbjct: 290 AFVNRN---YTVKYEVNWKTHEIKVKGQN 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013350BICOMPNTOXIN433e-156 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 433 bits (1116), Expect = e-156
Identities = 214/318 (67%), Positives = 256/318 (80%), Gaps = 10/318 (3%)

Query: 1 MFKKKMLAATLSVGLIAPLASPIQE-SRANTNIENIGDGA--EVIKRTEDVSSKKWGVTQ 57
M K K+L TLSV L+APLA+P+ E ++A + E+IG G+ E+IKRTED +S KWGVTQ
Sbjct: 1 MLKNKILTTTLSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQ 60

Query: 58 NVQFDFVKDKKYNKDALIVKMQGFINSRTSFSDVKGSGYELTKRMIWPFQYNIGLTTKDP 117
N+QFDFVKDKKYNKDALI+KMQGFI+SRT++ + K + + K M WPFQYNIGL T D
Sbjct: 61 NIQFDFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNH--VKAMRWPFQYNIGLKTNDK 118

Query: 118 NVSLINYLPKNKIETTDVGQTLGYNIGGNFQSAPSIGGNGSFNYSKTISYTQKSYVSEVD 177
VSLINYLPKNKIE+T+V QTLGYNIGGNFQSAPS+GGNGSFNYSK+ISYTQ++YVSEV+
Sbjct: 119 YVSLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGGNGSFNYSKSISYTQQNYVSEVE 178

Query: 178 KQNSKSVKWGVKANEFVTPDGKKSAHDRYLFVQSPNGPTGSAREYFAPDNQLPPLVQSGF 237
+QNSKSV WGVKAN F T G+KSA D LFV + R+YF PD++LPPLVQSGF
Sbjct: 179 QQNSKSVLWGVKANSFATESGQKSAFDSDLFVGYKPH-SKDPRDYFVPDSELPPLVQSGF 237

Query: 238 NPSFITTLSHEKGSSDTSEFEISYGRNLDITYA----TLFPRTGIYAERKHNAFVNRNFV 293
NPSFI T+SHEKGSSDTSEFEI+YGRN+D+T+A T + + + R HNAFVNRN+
Sbjct: 238 NPSFIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAFVNRNYT 297

Query: 294 VRYEVNWKTHEIKVKGHN 311
V+YEVNWKTHEIKVKG N
Sbjct: 298 VKYEVNWKTHEIKVKGQN 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013390BACTRLTOXIN1954e-64 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 195 bits (497), Expect = 4e-64
Identities = 109/261 (41%), Positives = 155/261 (59%), Gaps = 11/261 (4%)

Query: 4 LSTVIIILILEIVFHNMN-YVNAQPDPKLDELNKVSDYKNNKGTMGNVMNLYTSPPVEGR 62
+S VI+I L +V N +QPDP D+L+K S++ GTMGN+ LY V
Sbjct: 7 ISRVILIFALILVISTPNVLAESQPDPMPDDLHKSSEFT---GTMGNMKYLYDDHYVSAT 63

Query: 63 GVINSRQFLSHDLIFPI---EYKSYNEVKTELENTELANNYKDKKVDIFGVPYFYTCIIP 119
V + +FL+HDLI+ I + K+Y++VKTEL N +LA YKD+ VD++G Y+ C
Sbjct: 64 KVKSVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYYVNCYFS 123

Query: 120 KSEPDINQNFGGCCMYGGLTF---NSSENERDKLITVQVTIDNRQSLGFTITTNKNMVTI 176
+ G CMYGG+T N +N + + V+V + R ++ F + T+K VT
Sbjct: 124 SKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFEVQTDKKSVTA 183

Query: 177 QELDYKARHWLTKEKKLYEFDGSAFESGYIKFTEKNNTSFWFDLFPKKELVPFVPYKFLN 236
QELD KAR++L +K LYEF+ S +E+GYIKF E N +FW+D+ P F K+L
Sbjct: 184 QELDIKARNFLINKKNLYEFNSSPYETGYIKFIENNGNTFWYDMMPAPGDK-FDQSKYLM 242

Query: 237 IYGDNKVVDSKSIKMEVFLNT 257
+Y DNK VDSKS+K+EV L T
Sbjct: 243 MYNDNKTVDSKSVKIEVHLTT 263


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013400BACTRLTOXIN1559e-49 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 155 bits (394), Expect = 9e-49
Identities = 76/265 (28%), Positives = 124/265 (46%), Gaps = 21/265 (7%)

Query: 2 RLFYIAAIII-TLLCLINNNYVNAEV----DKKDLKKKSDLDSSKLFNLTSYYTDITWQL 56
RLF I+I L+ +I+ V AE DL K S+ + + N+ Y D +
Sbjct: 4 RLFISRVILIFALILVISTPNVLAESQPDPMPDDLHKSSEF-TGTMGNMKYLYDDH--YV 60

Query: 57 DESNKISTDQLLNNTIILKNIDISVLKTSSLKVEFNSSDLANQFKGKNIDIYGLYFGNKC 116
+ S D+ L + +I D + +K E + DLA ++K + +D+YG + C
Sbjct: 61 SATKVKSVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYYVNC 120

Query: 117 -------VGLTEEKTSCLYGGVTIHDGNQLDEEKV--IGVNVFKDGVQQEGFVIKTKKAK 167
VG +C+YGG+T H+GN D + + V V+++ F ++T K
Sbjct: 121 YFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFEVQTDKKS 180

Query: 168 VTVQELDTKVRFKLENLYKIYNKDTGNIQKGCIFFHSHNHQDQSFYYDLYNVKGSVG--A 225
VT QELD K R L N +Y ++ + G I F +N +F+YD+ G +
Sbjct: 181 VTAQELDIKARNFLINKKNLYEFNSSPYETGYIKFIENN--GNTFWYDMMPAPGDKFDQS 238

Query: 226 EFFQFYSDNRTVSSSNYHIDVFLYK 250
++ Y+DN+TV S + I+V L
Sbjct: 239 KYLMMYNDNKTVDSKSVKIEVHLTT 263


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013410BACTRLTOXIN1082e-30 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 108 bits (270), Expect = 2e-30
Identities = 54/227 (23%), Positives = 98/227 (43%), Gaps = 37/227 (16%)

Query: 30 VGNLRNFYTKHDYIDLKGVTDKNLPIANQLEFS------TGTNDLISESNNWDEISKFKG 83
+GN++ Y H K + +A+ L ++ + + +E N D K+K
Sbjct: 48 MGNMKYLYDDHYVSATKVKSVDKF-LAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKD 106

Query: 84 KKLDIFGIDY-------------NGPCKSKYMYGGATL-SGQYLNSARKIPINLWVNGKH 129
+ +D++G +Y MYGG T G + ++ + + V
Sbjct: 107 EVVDVYGSNYYVNCYFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENK 166

Query: 130 KTISTDKIATNKKLVTAQEIDVKLRRYLQEEYNIYGHNNTGKGKEYGYKSKFYSGFNNGK 189
+ + ++ T+KK VTAQE+D+K R +L + N+Y N+ + G
Sbjct: 167 RNTISFEVQTDKKSVTAQELDIKARNFLINKKNLY-EFNSSP-------------YETGY 212

Query: 190 VLFHLNNEKSFSYDLF-YTGDGLPVS-FLKIYEDNKIIESEKFHLDV 234
+ F NN +F YD+ GD S +L +Y DNK ++S+ ++V
Sbjct: 213 IKFIENNGNTFWYDMMPAPGDKFDQSKYLMMYNDNKTVDSKSVKIEV 259


19SAI8T7_1014590SAI8T7_1014650Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1014590011-3.607005Peptidase C45 acyl-coenzyme
SAI8T7_1014600012-4.763581Putative uncharacterized protein
SAI8T7_1014610111-5.254071Putative uncharacterized protein
SAI8T7_1014620212-4.736594Putative uncharacterized protein
SAI8T7_1014630011-3.452830Thioredoxin
SAI8T7_1014640-111-4.279029Putative uncharacterized protein
SAI8T7_1014650-112-3.714843Similar to ABC transporter (ATP-binding
20SAI8T7_1014740SAI8T7_1014820Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_10147402131.602720Phage tail protein
SAI8T7_10147502121.239175Bacteriophage tail length tape measure protein
SAI8T7_10147601160.105008Putative uncharacterized protein
SAI8T7_10147702180.263446Phage capsid protein
SAI8T7_10147802210.028924PhiN315 scaffolding protein-like protein
SAI8T7_10147904230.262874Putative uncharacterized protein
SAI8T7_10148002230.485456Phage terminase large subunit
SAI8T7_10148104271.196516Putative uncharacterized protein
SAI8T7_10148204260.530546Replicative DnaB-like helicase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1014750TYPE4SSCAGA360.001 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 36.2 bits (83), Expect = 0.001
Identities = 90/447 (20%), Positives = 172/447 (38%), Gaps = 69/447 (15%)

Query: 15 DSANLNRSLTEIKRNFRTLNSDLKLTGN--NFKYTEKSTDSYKQRIKELDGTIAGYKKNI 72
D + N+ L NF +D K TGN K +K + ++ + L+ + K +
Sbjct: 576 DFLSSNKELVGKTLNFNKAVADAKNTGNYDEVKKAQKDLEKSLRKREHLEKEVE---KKL 632

Query: 73 DDLAKQYDKVSQEQGENSTK--------------------AQNLRQEYNKQANELNFLEK 112
+ + +K+ + NS K AQNL+ + +++L + K
Sbjct: 633 ESKSGNKNKMEAKAQANSQKDEIFALINKEANRDARAIAYAQNLKGIKRELSDKLENVNK 692

Query: 113 ELEKTTAEFEEFKKAQVEAQRMAESGWGKTSKIFESMGPKLTKMGDGLKSIGKGMMIGVT 172
L+ F+EFK + + K + +++ + +G + I K
Sbjct: 693 NLKDFDKSFDEFKNGK-------NKDFSKAEETLKALKGSVKDLGINPEWISK------- 738

Query: 173 APVLGIAAASGKAFAEVDKGLDTVTQATGATGGELKKLQNSFKDVYGN--FPADAETVGG 230
V + AA + +K VTQA L+NS KDV N + +
Sbjct: 739 --VENLNAALNEFKNGKNKDFSKVTQAK-------SDLENSVKDVIINQKVTDKVDNLNQ 789

Query: 231 VLGEVNTRLGFTGKELENATESFLKFSHITGSDGVQAVQLITRAMGDAGIEASEYQSVLD 290
+ F+ +E A FS + Q + + +A ++ YQSV +
Sbjct: 790 AVSVAKATGDFSR--VEQALADLKNFSKEQLAQQAQKNESL-----NARKKSEIYQSVKN 842

Query: 291 MVAKAAQASGISVDTLADSITKYGAPMRAMGFEMKESIALFSQWEKSGVNTEIAFSGLKK 350
V +G+S A +++K + ++ E+ + F+ +G+ E ++ + K
Sbjct: 843 GVNGTLVGNGLS-QAEATTLSKNFSDIKK---ELNAKLGNFNNNNNNGLKNEPIYAKVNK 898

Query: 351 AISNWGKAGKNP--REEFKKTLAEIERTPDIASATSLAIEA--FGAKAGPDLADAIKGGR 406
+ + + P + KK A+I+R IAS + +A F K + D K G
Sbjct: 899 KKAGQAASLEEPIYAQVAKKVNAKIDRLNQIASGLGVVGQAAGFPLKRHDKVDDLSKVGL 958

Query: 407 FSYQEFLKTIEDSQGTVNQTFKDSESG 433
QE + I++ +NQ ++++G
Sbjct: 959 SRNQELAQKIDN----LNQAVSEAKAG 981


21SAI8T7_1015800SAI8T7_1016100Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1015800320-0.101199Cation-efflux system membrane protein homolog
SAI8T7_1015810825-0.318930Putative Lytic regulatory protein truncated with
SAI8T7_1015820823-0.630034Putative uncharacterized protein
SAI8T7_1015830720-0.754410rRNA adenine N-6-methyltransferase
SAI8T7_1015840618-0.538295Streptomycin=3''-adenylyltransferase
SAI8T7_1015850111-0.425604Transposase B from transposon Tn554
SAI8T7_1015860-113-0.042467Transposase A from transposon Tn554
SAI8T7_1015870-1110.145975Putative Lytic regulatory protein truncated with
SAI8T7_1015880-1110.614231Putative uncharacterized protein
SAI8T7_10158907131.140842Similar to ATP-binding protein homolog
SAI8T7_10159007141.143645Glucosamine--fructose-6-phosphate
SAI8T7_10159107120.822547PTS system mannitol-specific EIICB component
SAI8T7_10159207110.616605PRD domain protein
SAI8T7_10159309150.545200Mannitol-1-phosphate=5-dehydrogenase
SAI8T7_10159408150.821496FmtB protein
SAI8T7_1015950213-0.319174Phosphoglucosamine mutase
SAI8T7_1015960315-0.550028Putative uncharacterized protein SA1966
SAI8T7_1015970316-0.612308Conserved hypothetoical protein
SAI8T7_1015980416-1.376195Arginase
SAI8T7_1016060514-1.280895******ATP-binding protein Mrp-like protein
SAI8T7_1016070312-2.391229Similar to multidrug resistance protein
SAI8T7_1016080213-2.274026Drug resistance transporter, EmrB/QacA
SAI8T7_1016090112-1.537205Putative Multidrug resistance efflux pump sepA
SAI8T7_1016100211-0.897888SA1972 protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1015890PF05272290.017 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.017
Identities = 17/58 (29%), Positives = 27/58 (46%), Gaps = 8/58 (13%)

Query: 32 ILYGLNGAGKTTLLNILNAYEPATTGGVNLFGKMPGKVGYSAETVRQHIGFVSHSLLE 89
+L G G GK+TL+N L G++ F +G ++ Q G V++ L E
Sbjct: 600 VLEGTGGIGKSTLINTL--------VGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSE 649


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1015940IGASERPTASE457e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 44.7 bits (105), Expect = 7e-06
Identities = 57/316 (18%), Positives = 103/316 (32%), Gaps = 20/316 (6%)

Query: 2122 PQANNNSSADASTNSPTMDNDVTSKPEVESTNNG---TTDKPVTETDNATPAESTTNN-- 2176
P+ + +TN T +N P V S N + PV ATP+E+T
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAE 1042

Query: 2177 ----NSTTTATNENAPTGSTATAPTTASTEAASSADSKDNASVNDSKQNAEVNNSAESQS 2232
S T NE T +TA A ++ + V S + + E++
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE 1102

Query: 2233 TNGKVAQPKS--ENKAKAEKDGRDSTNQSMVESTTETLPSADITEPNVPSNTSKDKEEST 2290
T + K+ E + E S E + P A+ N P+ K+ + T
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQT 1162

Query: 2291 TNQTDAGQLKSETNVASNEADKSPSKADT----EVSNKPSTSASSEAKDKMTSTNVSQKD 2346
D Q ET+ + + +T + + +T A+++ S+N +
Sbjct: 1163 NTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNR 1222

Query: 2347 DTATADTNDTQKSVGPVANNKAKDMQTNDTQKSVGSAANNKATQNDGANASPATVSNG-S 2405
+ + ++N + S N + A A ++ G +
Sbjct: 1223 HRRSVRSVPHNVEPATTSSNDRS----TVALCDLTSTNTNAVLSDARAKAQFVALNVGKA 1278

Query: 2406 HSMHQDMLNVTKPEEN 2421
S H L + +
Sbjct: 1279 VSQHISQLEMNNEGQY 1294



Score = 42.0 bits (98), Expect = 4e-05
Identities = 47/318 (14%), Positives = 104/318 (32%), Gaps = 9/318 (2%)

Query: 1073 NNGSTTEEKEAAKQQVQTEKTAADAAIDAAHSNVEVEAAKNAEIAKI-EAIQPATTTKDN 1131
NG +++ QT T + ++V + N EIA++ EA P
Sbjct: 974 VNGRYDLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATP 1033

Query: 1132 AKQAIATKANERKTAIAQTQDITAEEIAAANADVDNAVTQANSNIEAANSQNDVDQAKTT 1191
++ N ++ ++T + ++ A +A SN++A N+V Q+ +
Sbjct: 1034 SETTETVAENSKQE--SKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSE 1091

Query: 1192 GETSIDQVTPTVNKKATARNEITAILNNKLQEIQATPDATDEEKQAADAEANTENGKANQ 1251
+ T T K TA E + ++ Q P T + + +
Sbjct: 1092 TKE-----TQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA 1146

Query: 1252 AISAATTNAQVDEAKANAEAAINAVTPKVVKKQAAKDEIDQLQATQTNVINNDQNATNEE 1311
+ T N + +++ N A + T +V+ N +N T
Sbjct: 1147 RENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPAT 1206

Query: 1312 KEAAIQQLATAVTDAKNNITAATDDNGVDTAKDAGKNSIQSTQPATAVKSNAKNEVDQAV 1371
+ + ++ ++ + + + V+ A + + + +N + A
Sbjct: 1207 TQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDR-STVALCDLTSTNTNAVLSDAR 1265

Query: 1372 TTQNQAIDNTTGATTEEK 1389
N A ++
Sbjct: 1266 AKAQFVALNVGKAVSQHI 1283



Score = 33.9 bits (77), Expect = 0.010
Identities = 61/312 (19%), Positives = 105/312 (33%), Gaps = 18/312 (5%)

Query: 1021 DIDNATANTDVDNAKTTNEATIAAITPDANVKPAAKQAIADKVQAQETAIDANNGSTTEE 1080
D+ N TTN T I D P+ + IA +A S T E
Sbjct: 979 DLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE 1038

Query: 1081 K--EAAKQQVQTEKTAADAAIDAAHSNVEVEAAKNAEIAKIEAIQPATTTKDNAKQAIAT 1138
E +KQ+ +T + A + N EV + + A T + Q+ +
Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNV-------KANTQTNEVAQSGSE 1091

Query: 1139 KANERKTAIAQTQDITAEEIAAANADVDNAVTQANSNIEAANSQNDVDQAKTTGETSIDQ 1198
+ T +T + EE A + V + S + Q++ Q + D
Sbjct: 1092 TKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREND- 1150

Query: 1199 VTPTVN-KKATARNEITAILNNKLQEIQATPDATDEEKQAADAEANTENGKANQAISAAT 1257
PTVN K+ ++ TA +E + + E + + N + AT
Sbjct: 1151 --PTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENT--TPAT 1206

Query: 1258 TNAQVDEAKANAEAAINAVTPKVVKKQ---AAKDEIDQLQATQTNVINNDQNATNEEKEA 1314
T V+ +N + + + V A D+ ++ + + NA + A
Sbjct: 1207 TQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARA 1266

Query: 1315 AIQQLATAVTDA 1326
Q +A V A
Sbjct: 1267 KAQFVALNVGKA 1278



Score = 33.5 bits (76), Expect = 0.013
Identities = 45/305 (14%), Positives = 93/305 (30%), Gaps = 14/305 (4%)

Query: 804 KNEEIFKIENITDSTQTKMDAYKEVRQAATARKAQNATVSNATDEEVAEANAAVDAAQTE 863
E K E+ T + + A++A++ +N EVA++ + QT
Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT 1098

Query: 864 GLHDIQVVKSQQEVADTKAKVLDKINAIQTQAKVKPAADTEVENAYNTRKQEIQNSNAST 923
+ V+ +++ A + + ++ + +Q K V+ ++ N
Sbjct: 1099 ETKETATVEKEEK-AKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157

Query: 924 TEEKEAAYTELDAKKQEARTNLDAANTNSDVTTAKDNGIAAINQVQAATTKKSDAKAEIA 983
+ + + + +E +N++ T T N + + T + +E +
Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVT-ESTTVNTGNSVVENPENTTPATTQPTVNSESS 1216

Query: 984 QKASERKTAIEAMNDSTTEEQQAAKDKVDQAVVTANADIDNATANTDVDNAKTTNEATIA 1043
K R + E V + N A AK A
Sbjct: 1217 NKPKNR-HRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVA--- 1272

Query: 1044 AITPDANVKPAAKQAIADKV---QAQETAIDANNGSTTEEKEAAKQQVQTEKTAADAAID 1100
NV A Q I+ + Q +N + ++ ++ T D
Sbjct: 1273 -----LNVGKAVSQHISQLEMNNEGQYNVWVSNTSMNKNYSSSQYRRFSSKSTQTQLGWD 1327

Query: 1101 AAHSN 1105
SN
Sbjct: 1328 QTISN 1332



Score = 32.3 bits (73), Expect = 0.034
Identities = 32/210 (15%), Positives = 63/210 (30%), Gaps = 4/210 (1%)

Query: 34 TTASAAEQNQPAQNQPAQPADANTQPNANAGAQANPAAQPANQGGQANPAGGAAQPAGQG 93
T + +Q P Q QP A + +P Q N QPA +
Sbjct: 1116 TEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKET 1175

Query: 94 NQADPNNAAQAQPGNQ--AAPANQAGQGNNQATPNNNATPANQTQPANAPA-AAQPAAPV 150
+ ++ N + N P N+ +N+ + + + + P
Sbjct: 1176 SSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVE 1235

Query: 151 AANAQTQDPNASNTGE-GSINTTLTFDDPAISTDENRQDPTVTVTDKVNGYSLINNGKIG 209
A + D + + S NT D + V+ ++ + N G+
Sbjct: 1236 PATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQYN 1295

Query: 210 FVNSELRRSDMFDKNNPQNYQAKGNVAALG 239
S + + + + + +K LG
Sbjct: 1296 VWVSNTSMNKNYSSSQYRRFSSKSTQTQLG 1325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1016070TCRTETB501e-09 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 50.3 bits (120), Expect = 1e-09
Identities = 37/147 (25%), Positives = 69/147 (46%), Gaps = 2/147 (1%)

Query: 1 MIIMMSMVGPALLIPLYVQNSLSLSALLSGLVIM-PGAIINGIMSVFTGKFYDKYGPRPL 59
II ++ G ++P +++ LS G VI+ PG + I G D+ GP +
Sbjct: 266 GIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYV 325

Query: 60 IYTGFTILTITTIMLCFLHTDTSYTYLIVVYAIRMFSVSLLMMPINTTGINSLRNEEISH 119
+ G T L+++ + FL TS+ I++ + S I+T +SL+ +E
Sbjct: 326 LNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL-SFTKTVISTIVSSSLKQQEAGA 384

Query: 120 GTAIMNFGRVMAGSLGTALMVTLMSFG 146
G +++NF ++ G A++ L+S
Sbjct: 385 GMSLLNFTSFLSEGTGIAIVGGLLSIP 411


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1016080TCRTETB913e-23 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 91.5 bits (227), Expect = 3e-23
Identities = 58/257 (22%), Positives = 120/257 (46%), Gaps = 12/257 (4%)

Query: 8 TTRRRNFIVAVMLISAFVAILNQTLLNTALPSIMRELNINESTSQWLVTGFMLVNGVMIP 67
+ R N I+ + I +F ++LN+ +LN +LP I + N +++ W+ T FML +
Sbjct: 8 SNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTA 67

Query: 68 LTAYLMDRIKTRPLYLAAMGTFLLGSIVAALAPN-FGVLMLARVIQAMGAGVLMPLMQFT 126
+ L D++ + L L + GS++ + + F +L++AR IQ GA L+
Sbjct: 68 VYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVV 127

Query: 127 LFTLFSKEHRGFAMGLAGLVIQFAPAIGPTVTGLIIDQASWRVPFIIIVGIALVAFVFGL 186
+ KE+RG A GL G ++ +GP + G+I W +++++ + + V L
Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW--SYLLLIPMITIITVPFL 185

Query: 187 VSISSYNEVKYTKLDKRSVMYSTIGFGLMLYAFSSAGDLGFTSPIVIGALIISMVIIYLF 246
+ + D + ++ ++G + FT+ I LI+S++ +F
Sbjct: 186 MKLLKKEVRIKGHFDIKGIILMSVGIVFFML---------FTTSYSISFLIVSVLSFLIF 236

Query: 247 IRRQFNITNVLLNLKVF 263
++ +T+ ++ +
Sbjct: 237 VKHIRKVTDPFVDPGLG 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1016100TCRTETB1035e-26 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 103 bits (257), Expect = 5e-26
Identities = 91/405 (22%), Positives = 175/405 (43%), Gaps = 14/405 (3%)

Query: 9 VIALILIMFMSAIESSIISLALPTIKQDLNA-GNLISLIFTAYFIALVIANPIVGELLSR 67
+I L ++ F S + +++++LP I D N + + TA+ + I + G+L +
Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 68 FKIIYVAIAGLLLFSIGSFMCGLS-TNFTMLIISRVIQGFGSGVLMSLSQIVPKLAFEIP 126
I + + G+++ GS + + + F++LI++R IQG G+ +L +V
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135

Query: 127 LRYKIMGIVGSVWGISSIIGPLLGGGILEFATWHWLFYINIPIAIIAIILVIWTFHFPEE 186
R K G++GS+ + +GP +GG I HW + + IP +I II V + ++
Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAH--YIHWSYLLLIP--MITIITVPFLMKLLKK 191

Query: 187 ETVAKSKFDTKGLTLFYVFIGLIMFALLNQQLLLLNFLSFILAIVVAMCLFKVEKHVSSP 246
E K FD KG+ L V I M + + I++++ + K + V+ P
Sbjct: 192 EVRIKGHFDIKGIILMSVGIVFFMLFTTSY-----SISFLIVSVLSFLIFVKHIRKVTDP 246

Query: 247 FLPVVEF-NRSITLVFITDLLTAICLMGFNLYIPVYLQEQLGLSPLQSG-LVIFPLSVAW 304
F+ N + + + + GF +P +++ LS + G ++IFP +++
Sbjct: 247 FVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSV 306

Query: 305 ITLNFNLHRIEAKLSRKVIYLLSFTLLLVSSIIISFGIKL-PVLIAFVLILAGLSFGYIY 363
I + + + + + T L VS + SF ++ + +++ +
Sbjct: 307 IIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTK 366

Query: 364 TKDSVIVQEETSPLQMKKMMSFYGLTKNLGASIGSTIMGYLYAIQ 408
T S IV + MS T L G I+G L +I
Sbjct: 367 TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIP 411


22SAI8T7_1018420SAI8T7_1018580Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1018420215-1.493621Putative Transposase
SAI8T7_1018430415-1.514951Putative uncharacterized protein
SAI8T7_1018440616-1.434940Uncharacterized oxidoreductase SAV2478
SAI8T7_1018450917-2.469082Putative lipoprotein
SAI8T7_1018460817-3.723550Putative uncharacterized protein
SAI8T7_10184701019-4.780336Putative lipoprotein
SAI8T7_1018480415-3.709489Putative lipoprotein
SAI8T7_1018490315-3.530426Putative uncharacterized protein
SAI8T7_1018500314-2.796412Similar to putative helicase
SAI8T7_1018510414-2.230657Type III restriction enzyme, res subunit
SAI8T7_1018520314-1.624735Putative uncharacterized protein
SAI8T7_1018530211-0.850073Phosphomannomutase
SAI8T7_10185404120.951006Putative uncharacterized protein
SAI8T7_10185503121.753085Surface protein G
SAI8T7_1018560091.927933Cell wall anchor protein
SAI8T7_1018570-1102.277169Putative SarU
SAI8T7_1018580-1103.157516UTP--glucose-1-phosphate uridylyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1018440DHBDHDRGNASE1022e-28 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 102 bits (254), Expect = 2e-28
Identities = 66/245 (26%), Positives = 118/245 (48%), Gaps = 23/245 (9%)

Query: 7 KIAVVTGAGSGIGEAIATLLHEEGAKVVLAGRNKDKLQNVANQLAQDS--VKVVPTDVTN 64
KIA +TGA GIGEA+A L +GA + N +KL+ V + L ++ + P DV +
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 65 KEEVDELMKIAQQTFGGLDIVINSAGQMLSSKITDYQVDEWDSMIDVNIKGTLYTAQAAL 124
+DE+ ++ G +DI++N AG + I +EW++ VN G +++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 125 PTMLEQSSGHLINIASISGFEVTKSSTIYSATKAAVHTITQGLEKELAKTGVKVTSISPG 184
M+++ SG ++ + S S Y+++KAA T+ L ELA+ ++ +SPG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 185 MVDTAITAAYNPSD--------------------RKKLDPQDIAEAVLYALT-QPKHVNV 223
+T + + + +K P DIA+AVL+ ++ Q H+ +
Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248

Query: 224 NEITV 228
+ + V
Sbjct: 249 HNLCV 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1018550V8PROTEASE342e-04 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 34.2 bits (78), Expect = 2e-04
Identities = 14/30 (46%), Positives = 18/30 (60%)

Query: 93 EKPKDPKGPENPEKPSRPTHPSGPVNPNNP 122
P +P P NP+ P+ P P+ P NPNNP
Sbjct: 290 NNPDNPDNPNNPDNPNNPDEPNNPDNPNNP 319



Score = 32.7 bits (74), Expect = 7e-04
Identities = 13/30 (43%), Positives = 18/30 (60%)

Query: 93 EKPKDPKGPENPEKPSRPTHPSGPVNPNNP 122
+ P +P P+NP P P +P P NP+NP
Sbjct: 293 DNPDNPNNPDNPNNPDEPNNPDNPNNPDNP 322



Score = 32.3 bits (73), Expect = 0.001
Identities = 13/30 (43%), Positives = 19/30 (63%)

Query: 93 EKPKDPKGPENPEKPSRPTHPSGPVNPNNP 122
++P +P P+NP P P +P P NP+NP
Sbjct: 287 DQPNNPDNPDNPNNPDNPNNPDEPNNPDNP 316



Score = 30.7 bits (69), Expect = 0.003
Identities = 12/29 (41%), Positives = 20/29 (68%)

Query: 93 EKPKDPKGPENPEKPSRPTHPSGPVNPNN 121
+ P +P P NP++P+ P +P+ P NP+N
Sbjct: 296 DNPNNPDNPNNPDEPNNPDNPNNPDNPDN 324



Score = 29.6 bits (66), Expect = 0.007
Identities = 17/46 (36%), Positives = 25/46 (54%), Gaps = 1/46 (2%)

Query: 98 PKGPENPEKPSRPTHPSGPVNPNNPGLSKDRAKP-NGPVHSMDKND 142
P P+NP+ P+ P +P+ P PNNP + P NG ++ D D
Sbjct: 289 PNNPDNPDNPNNPDNPNNPDEPNNPDNPNNPDNPDNGDNNNSDNPD 334



Score = 27.7 bits (61), Expect = 0.027
Identities = 13/33 (39%), Positives = 17/33 (51%)

Query: 102 ENPEKPSRPTHPSGPVNPNNPGLSKDRAKPNGP 134
+ P P P +P+ P NPNNP + PN P
Sbjct: 287 DQPNNPDNPDNPNNPDNPNNPDEPNNPDNPNNP 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1018560GPOSANCHOR300.014 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.4 bits (68), Expect = 0.014
Identities = 17/108 (15%), Positives = 33/108 (30%), Gaps = 1/108 (0%)

Query: 16 FLSNKLNKYSIRKFTVGTASILIG-SLMYLGTQQEAEAAENNIENPTTLKDNVQSKEVKI 74
+N YS+RK GTAS+ + +++ G T +
Sbjct: 2 TKNNTNRHYSLRKLKTGTASVAVALTVLGAGLVVNTNEVSAVATRSQTDTLEKVQERADK 61

Query: 75 EEVTNKDTAPQGVEAKSEVTSNKDTIEHEASVKAEDISKKEDTPKEVA 122
E+ N + + + KD + + K K ++
Sbjct: 62 FEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLS 109


23SAI8T7_1019280SAI8T7_1019380Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_10192802170.372307Putative X-Pro dipeptidyl-peptidase (S15) family
SAI8T7_1019290724-0.467340LPXTG-motif cell wall anchor domain
SAI8T7_10193001026-0.493443Putative uncharacterized protein
SAI8T7_1019310926-0.107757Putative uncharacterized protein
SAI8T7_10193208230.176867rRNA adenine N-6-methyltransferase
SAI8T7_10193305200.744114Streptomycin=3''-adenylyltransferase
SAI8T7_10193402180.915916Transposase B from transposon Tn554
SAI8T7_1019350-2161.403280Transposase A from transposon Tn554
SAI8T7_1019360-1122.160097Truncated-SA protein
SAI8T7_10193700102.955664Pantothenate synthetase
SAI8T7_10193801103.3111123-methyl-2-oxobutanoate
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019290SUBTILISIN280.025 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 28.3 bits (63), Expect = 0.025
Identities = 7/36 (19%), Positives = 14/36 (38%)

Query: 24 ANAENNKPEGVNENSNLIPVRQPDANYPGPVSDIAR 59
A N GV ++L+ ++ + G I +
Sbjct: 96 ATENENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQ 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019300V8PROTEASE663e-15 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 65.8 bits (160), Expect = 3e-15
Identities = 33/121 (27%), Positives = 58/121 (47%), Gaps = 8/121 (6%)

Query: 1 MNAPTKEDIAIIKLNS-----NLGNKTGYLTLNTHISK--GENIEISGFPGDKSDNRQYK 53
+ D+AI+K + ++G T++ + +NI ++G+PGDK ++
Sbjct: 154 TKYSGEGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMWE 213

Query: 54 GKGKLESFDENEMYYTVDTFSGQSGSAIRDSKNNIIGVHAYG-RYNHNSGVRINDLKLDY 112
KGK+ M Y + T G SGS + + KN +IG+H G N V IN+ ++
Sbjct: 214 SKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWGGVPNEFNGAVFINENVRNF 273

Query: 113 I 113
+
Sbjct: 274 L 274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019360V8PROTEASE492e-09 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 48.8 bits (116), Expect = 2e-09
Identities = 13/91 (14%), Positives = 28/91 (30%), Gaps = 2/91 (2%)

Query: 72 VFGKDQRTVVNNILQRPYKQTVLLNMTFSNNRVYKGTGTMIGKDIVLTAAHNVYSKDDKG 131
+ + R + + Y + + + +G ++GKD +LT H V
Sbjct: 70 ILPNNDRHQITDTTNGHYAPVTYIQVEAPTGT-FIASGVVVGKDTLLTNKH-VVDATHGD 127

Query: 132 WAKKIDVYAGVNGQTYTIGKAFSHKFFVSKT 162
+ +N Y G + +
Sbjct: 128 PHALKAFPSAINQDNYPNGGFTAEQITKYSG 158


24SAI8T7_1019840SAI8T7_1019890Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_10198407132.228436Protein translocase subunit SecA 2
SAI8T7_10198509142.631603Putative uncharacterized protein
SAI8T7_10198609142.789941Putative uncharacterized protein
SAI8T7_101987011152.787090Putative uncharacterized protein
SAI8T7_101988011162.981833Accessory Sec system protein translocase subunit
SAI8T7_101989012173.725961Serine-rich adhesin for platelets
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019840SECA6480.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 648 bits (1672), Expect = 0.0
Identities = 282/823 (34%), Positives = 441/823 (53%), Gaps = 68/823 (8%)

Query: 2 KRINTWSDEVKSYSDDALKQKTIEFKERLASGVDTLDTLLPEAYAVAREASWRVLGMYPK 61
IN E++ SD+ LK KT EF+ RL G + L+ L+PEA+AV REAS RV GM
Sbjct: 26 NIINAMEPEMEKLSDEELKGKTAEFRARLEKG-EVLENLIPEAFAVVREASKRVFGMRHF 84

Query: 62 EVQLIGAIVLHEGNIAEMQTGEGKTLTATMPLYLNALSGKGTYLITTNDYLAKRDFEEMQ 121
+VQL+G +VL+E IAEM+TGEGKTLTAT+P YLNAL+GKG +++T NDYLA+RD E +
Sbjct: 85 DVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNALTGKGVHVVTVNDYLAQRDAENNR 144

Query: 122 PLYEWLGLTASLGFVDIVDYEYQKGEKRNIYEHDIIYTTNGRLGFDYLIDNLADSAEGKF 181
PL+E+LGLT V I KR Y DI Y TN GFDYL DN+A S E +
Sbjct: 145 PLFEFLGLT-----VGINLPGMPAPAKREAYAADITYGTNNEYGFDYLRDNMAFSPEERV 199

Query: 182 LPQLNYGIIDEVDSIILDAAQTPLVISGAPRLQSNLFHIVKEFVDTLIE----------- 230
+L+Y ++DEVDSI++D A+TPL+ISG S ++ V + + LI
Sbjct: 200 QRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYKRVNKIIPHLIRQEKEDSETFQG 259

Query: 231 DVHFKMKKTKKEIWLLNQGIEAAQSYFNV-------EDLYSEQAMVLVRNINLALRAQYL 283
+ HF + + +++ L +G+ + E LYS ++L+ ++ ALRA L
Sbjct: 260 EGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYSPANIMLMHHVTAALRAHAL 319

Query: 284 FESNVDYFVYNGDIVLIDRITGRMLPGTKLQAGLHQAIEAKEGMEVSTDKSVMATITFQN 343
F +VDY V +G+++++D TGR + G + GLHQA+EAKEG+++ + +A+ITFQN
Sbjct: 320 FTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAKEGVQIQNENQTLASITFQN 379

Query: 344 LFKLFESFSGMTATGKLGESEFFDLYSKIVVQAPTDKAIQRIDEPDKVFRSVDEKNIAMI 403
F+L+E +GMT T EF +Y V PT++ + R D PD V+ + EK A+I
Sbjct: 380 YFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMIRKDLPDLVYMTEAEKIQAII 439

Query: 404 HDIVELHETGRPVLLITRTAEAAEYFSKVLFQMDIPNNLLIAQNVAKEAQMIAEAGQIGS 463
DI E G+PVL+ T + E +E S L + I +N+L A+ A EA ++A+AG +
Sbjct: 440 EDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAA 499

Query: 464 MTVATSMAGRGTDIKLG-----------------------------EGVEALGGLAVIIH 494
+T+AT+MAGRGTDI LG + V GGL +I
Sbjct: 500 VTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKADWQVRHDAVLEAGGLHIIGT 559

Query: 495 EHMENSRVDRQLRGRSGRQGDPGSSCIYISLDDYLVKRWSDSNLAENNQLYSLDAQRLSQ 554
E E+ R+D QLRGRSGRQGD GSS Y+S++D L++ ++ ++ + + +
Sbjct: 560 ERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFASDRVSGMMRKLGMKPGEAIE 619

Query: 555 SNLFNRKVKQIVVKAQRISEEQGVKAREMANEFEKSISIQRDLVYEERNRVLEIDDAENR 614
+ + AQR E + R+ E++ + QR +Y +RN +L++ D
Sbjct: 620 HPWVTKAIA----NAQRKVESRNFDIRKQLLEYDDVANDQRRAIYSQRNELLDVSDVSET 675

Query: 615 DFKALAKDVFEMFVNEE---KVLTKSRVVEYIYQNLSFQFNKDVACVNFKDKQAVVT--- 668
++ +DVF+ ++ + L + + + + L F+ D+ + DK+ +
Sbjct: 676 -INSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDLPIAEWLDKEPELHEET 734

Query: 669 ---FLLEQFEKQLALNRKNMQSAYYYNIFVQKVFLKAIDSCWLEQVDYLQQLKASVNQRQ 725
+L Q + ++ + A F + V L+ +DS W E + + L+ ++ R
Sbjct: 735 LRERILAQSIEVYQ-RKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAAMDYLRQGIHLRG 793

Query: 726 NGQRNAIFEYHRVALDSFEVMTRNIKKRMVKNICQSMITFDKE 768
Q++ EY R + F M ++K ++ + + + +E
Sbjct: 794 YAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEE 836


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019880SECYTRNLCASE1282e-35 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 128 bits (324), Expect = 2e-35
Identities = 93/440 (21%), Positives = 181/440 (41%), Gaps = 52/440 (11%)

Query: 4 LLQQYEYKIIYKRMLYTCFILFIYILGTNISI--VSYNDMQ------VKHESFFKIAISN 55
+ + + K++L+T I+ +Y +GT+I I V Y ++Q ++ F +
Sbjct: 5 FARAFRTPDLRKKLLFTLAIIVVYRVGTHIPIPGVDYKNVQQCVREASGNQGLFGLVNMF 64

Query: 56 MGGDVNTLNIFTLGLVPWLTSMIILMLISYRNMDKYMKQTSLEKHYKE------------ 103
GG + + IF LG++P++T+ IIL L++ + LE KE
Sbjct: 65 SGGALLQITIFALGIMPYITASIILQLLT-------VVIPRLEALKKEGQAGTAKITQYT 117

Query: 104 RILTLILSVIQSYFVIHEYVSKERVHQDN-------------IYLTILILVTGTILLVWL 150
R LT+ L+++Q ++ S + + ++ + GT +++WL
Sbjct: 118 RYLTVALAILQGTGLVATARSAPLFGRCSVGGQIVPDQSIFTTITMVICMTAGTCVVMWL 177

Query: 151 ADKNSRYGIAGPMPIVMVSIIKSMMHQKMEYI------DASHIVIALLIILVIITLFILL 204
+ + GI M I+M I + + I I +I + +I + +++
Sbjct: 178 GELITDRGIGNGMSILMFISIAATFPSALWAIKKQGTLAGGWIEFGTVIAVGLIMVALVV 237

Query: 205 FIELVEVRIPYI----DLMNVSATNMKSYLSWKVNPAGSITLMMSISAFVFLKSGIHFIL 260
F+E + RIP + S +Y+ KVN AG I ++ + S F
Sbjct: 238 FVEQAQRRIPVQYAKRMIGRRSYGGTSTYIPLKVNQAGVIPVIFASSLLYIPALVAQFAG 297

Query: 261 SMFNKSISDDMPMLTFDSPVGISVYLVIQMLLGYFLSRFLINTKQKSKDFLKSGNYFSGV 320
+ + D P+ I Y ++ + +F N ++ + + K G + G+
Sbjct: 298 GNSGWKSWVEQNLTKGDHPIYIVTYFLLIVFFAFFYVAISFNPEEVADNMKKYGGFIPGI 357

Query: 321 KPGKDTERYLNYQARRVCWFGSALVTVIIGIPLYFTLFVPHLSTEIYFS-VQLIVLVYIS 379
+ G+ T YL+Y R+ W GS + +I +P L S F ++++V +
Sbjct: 358 RAGRPTAEYLSYVLNRITWPGSLYLGLIALVP-TMALVGFGASQNFPFGGTSILIIVGVG 416

Query: 380 INIAETIRTYLYFDKYKPFL 399
+ + I + L Y+ FL
Sbjct: 417 LETVKQIESQLQQRNYEGFL 436


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019890ICENUCLEATIN553e-09 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 55.1 bits (132), Expect = 3e-09
Identities = 237/1070 (22%), Positives = 425/1070 (39%), Gaps = 12/1070 (1%)

Query: 1098 SDSMSISTSFSDSTSDSKSASTASSESISQSASTSTSGSVSTSTSLSTSNSERTSTSVSD 1157
+ + ++ + S S + + T +T S ST ++ +
Sbjct: 106 LHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQTIEIATYG 165

Query: 1158 STSLSTSESDSISESTSTSDSISEAISASESTSISLSESNSTSDSESQSASAFLSESLSE 1217
ST T +S I+ ST + + + S + ++ST + S ES
Sbjct: 166 STLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQM 225

Query: 1218 STSESTSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTSISESTSTFKSE 1277
+ ST + S + S T+ S+ + ST + S+ T+ ST T +
Sbjct: 226 AGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 285

Query: 1278 SVSTSLSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTSL 1337
S T+ ST T+ ++S+ ++ S T+ +S + ST + S + ST
Sbjct: 286 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 345

Query: 1338 SGSTSESESDSTSSSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMNQ 1397
+G S + S+ + DS+ + S T+ S T+ S T+ + S +
Sbjct: 346 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYG 405

Query: 1398 SGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTSQ 1457
S + S + ST T++ S T+ Y S T+ +S+ + S T+ S+
Sbjct: 406 STQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLT 465

Query: 1458 SGSTSTSASLSGSESESDSQSISTSASESTSESASTSLSDSTSTSNSGSASTSTSLSNSA 1517
+G ST + GS+ + S ST+ ES+ + S + S + ST + +
Sbjct: 466 AGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNE 525

Query: 1518 SASESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSE 1577
S + STS + + S+ + S ++ S+ + ST + ST
Sbjct: 526 SDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGT 585

Query: 1578 SGSTSESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSD 1637
+GS S + ST T+ S T+ S + S T+ STS + + S +
Sbjct: 586 AGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYG 645

Query: 1638 SQSMSLSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMSVSISLSDSTSTSTSASEVM 1697
S + S T+ ST + SD T+ S ST+G+ S I+ ST T+ S +
Sbjct: 646 STQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILT 705

Query: 1698 SASISDSQSMSESVNDSESVSESNSESDSKSMSGSTSVSDSGSLSVSTSLRKSESVSESS 1757
+ S + S S S S + +DS ++G S + S T+ S +
Sbjct: 706 AGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQ 765

Query: 1758 SLSGSQSMSDSVSTSDSSSLSVSTSLRSSESVSESDSLSDSKSTSGSTSTSTSGSLSTST 1817
S+ + S S + +DSS ++ S +++ S + S T+ S T+G STST
Sbjct: 766 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTST 825

Query: 1818 SLSGSESVSESTSLSDSISMSDSTSTSDSDSLSGSISLSGSTSLSTSDSLSDSKSLSSSQ 1877
+ + S ++ S + S T+ S + S + STS + DS ++
Sbjct: 826 AGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYG 885

Query: 1878 SMSGSESTSTSVSDSQSSSTSNSQFDSMSISASESDSMSTSDSSNISGSNSTSTSLSTSD 1937
S + S + S+ T+ D + S S + S GS T++ ST
Sbjct: 886 STQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLM 945

Query: 1938 SMSGSVSVSTSTSLSDSISGSTSVSDSSSTSTSTSLSDSMSQSQSTSTSASGSLSTSIST 1997
+ GS + S + GSTS++ S+ + S + QST T+ GS T+ +
Sbjct: 946 AGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHS 1005

Query: 1998 SMSMSASTSSSQSTSVSTSLSTSDSISDST----------SISISGSQSTVESESTSDST 2047
S + S++ + + S+ ++ S S S ISG +S + + S
Sbjct: 1006 STLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVLTAGYGSSLI 1065

Query: 2048 SISDSESLSTSDSDSTSTSTSDSTSGSTSTSIS--ESLSTSGSGSTSVSDSTSMSESDST 2105
S S + S+ ++ S +G ST I+ S+ +G GS+ + S S +
Sbjct: 1066 SGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGAD 1125

Query: 2106 SVSMSQDKSDSTSISDSESVSTSTSTSLSTSDSTSTSESLSTSMSGSQSI 2155
SV M+ ++ + +DS + S L+ ++S T+ S +G+ I
Sbjct: 1126 SVQMAGERGKLIAGADSTQTAGDRSKLLAGNNSYLTAGDRSKLTAGNDCI 1175



Score = 53.2 bits (127), Expect = 1e-08
Identities = 176/773 (22%), Positives = 305/773 (39%), Gaps = 2/773 (0%)

Query: 1408 SASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTSQSGSTSTSASL 1467
+ + +E + S + + T D+T S ST + + +
Sbjct: 106 LHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQTIEIATYG 165

Query: 1468 SGSESESDSQSISTSASESTSESASTSLSDSTSTSNSGSASTSTSLSNSASASESDSSST 1527
S SQ I+ S T+ +ST ++ ST +G+ ST + S + +SS
Sbjct: 166 STLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQM 225

Query: 1528 SLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSESGSTSESTSE 1587
+ ST M+ S+ + S + S+ + ST + S T+ GST +
Sbjct: 226 AGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 285

Query: 1588 SDSTSTSLSDSQSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSDSQSMSLSTST 1647
SD T+ S + + S+ +G ST T+ +S T+ ST SD + ST T
Sbjct: 286 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 345

Query: 1648 STSMSDSTSLSDSVSDSTSDSTSASTSGSMSVSISLSDSTSTSTSASEVMSASISDSQSM 1707
+ S + S + DS+ + GS + SD T+ S + S +
Sbjct: 346 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYG 405

Query: 1708 SESVNDSESVSESNSESDSKSMSGSTSVSDSGSLSVSTSLRKSESVSESSSLSGSQSMSD 1767
S ES + S + GS + GS + + S+ +G S
Sbjct: 406 STQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLT 465

Query: 1768 SVSTSDSSSLSVSTSLRSSESVSESDSLSDSKSTSGSTSTSTSGSLSTSTSLSGSESVSE 1827
+ S ++ S S S + S + GST T+ GS T+ S + +E
Sbjct: 466 AGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNE 525

Query: 1828 STSLSDSISMSDSTSTSDSDSLSGSISLSGSTSLSTSDSLSDSKSLSSSQSMSGSESTST 1887
S ++ S S + + S + GS + S+ T+ S + S +G ST T
Sbjct: 526 SDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGT 585

Query: 1888 SVSDSQSSSTSNSQFDSMSISASESDSMSTSDSSNISGSNSTSTSLSTSDSMSGSVSVST 1947
+ SDS + S + S+ + ST + S + S ST+ + S ++
Sbjct: 586 AGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYG 645

Query: 1948 STSLSDSISGSTSVSDSSSTSTSTSLSDSMSQSQSTSTSASGSLSTSISTSMSMSASTSS 2007
ST + S T+ S+ T+ S + S ST+ + S ++ ST + S +
Sbjct: 646 STQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILT 705

Query: 2008 SQSTSVSTSLSTSDSISDSTSISISGSQSTVESESTSDSTSISDSESLSTSDSDSTSTST 2067
+ S T+ SD S S S +G+ S++ + S T+ S + S T+
Sbjct: 706 AGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQ 765

Query: 2068 SDSTS--GSTSTSISESLSTSGSGSTSVSDSTSMSESDSTSVSMSQDKSDSTSISDSESV 2125
S T+ GSTST+ ++S +G GST + S+ + S +Q++SD T+ S S
Sbjct: 766 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTST 825

Query: 2126 STSTSTSLSTSDSTSTSESLSTSMSGSQSISDSTSTSMSGSTSTSESNSMHPS 2178
+ + S+ ++ ST T+ S +G S + S + S S + + S
Sbjct: 826 AGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDS 878



Score = 53.2 bits (127), Expect = 1e-08
Identities = 233/1050 (22%), Positives = 411/1050 (39%), Gaps = 2/1050 (0%)

Query: 759 TSGSTQQSQSVSTSKADSQSASTSTSGSIVVSTSASTSKSTSVSLSDSVSASKSLSTSES 818
TS + A ++ + S + D+ S S +++
Sbjct: 99 TSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQT 158

Query: 819 NSVSSSTSTSLVNSQSVSSSMSGSVSKSTSLSDSISNSNSTEKSESLSTSTSDSLRTSTS 878
+++ ST QS + GS + S I+ ST + + ST + T T+
Sbjct: 159 IEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTA 218

Query: 879 LSDSLSMSTSGSLSKSQSLSTSISGSSSTSASLSDSTSNAISTSTSLSESASTSDSISIS 938
+S M+ GS S +G ST + DS+ A ST + S+ + S
Sbjct: 219 GEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGS 278

Query: 939 NSIANSQSASTSKSDSQSTSISLSTSDSKSMSTSESLSDSTSTSGSVSGSLSIAASQSVS 998
A S T+ S T+ + S+ + ST + +ST T+G GS A S
Sbjct: 279 TQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGY--GSTQTAQKGSDL 336

Query: 999 TSTSDSMSTSEIVSDSISTSGSLSASDSKSMSVSSSMSTSQSGSTSESLSDSQSTSDSDS 1058
T+ S T+ S I+ GS + S + ST + S+ + ST + +
Sbjct: 337 TAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGA 396

Query: 1059 KSLSLSTSQSGSTSTSTSTSASVRTSESQSTSGSMSASQSDSMSISTSFSDSTSDSKSAS 1118
S ++ S T+ ST + S + GS + S + S + S
Sbjct: 397 DSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQ 456

Query: 1119 TASSESISQSASTSTSGSVSTSTSLSTSNSERTSTSVSDSTSLSTSESDSISESTSTSDS 1178
TA +S + ST + S + S T+ S + S + ST T+
Sbjct: 457 TAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGY 516

Query: 1179 ISEAISASESTSISLSESNSTSDSESQSASAFLSESLSESTSESTSESVSSSTSESTSLS 1238
S + +ES I+ S ST+ + S + + S + S T+ S+ T+ S
Sbjct: 517 GSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDL 576

Query: 1239 DSTSESGSTSTSLSNSTSGSASISTSTSISESTSTFKSESVSTSLSMSTSTSLSNSTSLS 1298
+ S T+ S S+ +G S T++ S T+ + S + S+ T+ S ST+ +
Sbjct: 577 TAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGA 636

Query: 1299 TSLSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTSLSGSTSESESDSTSSSESKSDS 1358
S + S + S+ T+ ST + S T+ GSTS + +DS+ + S
Sbjct: 637 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQ 696

Query: 1359 TSMSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMNQSGVDSNSASQSASNSTSTSTS 1418
T+ S+ + GST T+ S S S S + + + S ++ +S+ T+
Sbjct: 697 TAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGY 756

Query: 1419 ESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTSQSGSTSTSASLSGSESESDSQS 1478
S + + S ST+ + S + S T+ S T+ S ++ S
Sbjct: 757 GSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDL 816

Query: 1479 ISTSASESTSESASTSLSDSTSTSNSGSASTSTSLSNSASASESDSSSTSLSDSTSASMQ 1538
+ S ST+ + S+ ++ ST +G S T+ S ++ +S T+ STS +
Sbjct: 817 TTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGY 876

Query: 1539 SSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSESGSTSESTSESDSTSTSLSDS 1598
S + S + S T+ ST + S T+ GSTS + ES + S
Sbjct: 877 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQ 936

Query: 1599 QSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSDSQSMSLSTSTSTSMSDSTSLS 1658
++ +ST +G S+ T+ S T+ STSM S + ST T+ S T+
Sbjct: 937 TASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGY 996

Query: 1659 DSVSDSTSDSTSASTSGSMSVSISLSDSTSTSTSASEVMSASISDSQSMSESVNDSESVS 1718
S + ST + GS + + + S + S+ S + S ++ SV
Sbjct: 997 GSTQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVL 1056

Query: 1719 ESNSESDSKSMSGSTSVSDSGSLSVSTSLRKSESVSESSSLSGSQSMSDSVSTSDSSSLS 1778
+ S S S+ + GS +++ + ES+ ++G++SM + S ++
Sbjct: 1057 TAGYGSSLISGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGY 1116

Query: 1779 VSTSLRSSESVSESDSLSDSKSTSGSTSTS 1808
ST + ++SV + + + ST T+
Sbjct: 1117 RSTLISGADSVQMAGERGKLIAGADSTQTA 1146



Score = 52.8 bits (126), Expect = 2e-08
Identities = 229/1007 (22%), Positives = 395/1007 (39%), Gaps = 10/1007 (0%)

Query: 1163 TSESDSISESTSTSDSISEAISASESTSISLSESNSTSDSESQSASAFLSESLSESTSES 1222
TS I + + +E + S ++ ES S +++
Sbjct: 99 TSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQT 158

Query: 1223 TSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTSISESTSTFKSESVSTS 1282
+ ST T S + GST T+ +ST + ST T+ ++ST S T+
Sbjct: 159 IEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTA 218

Query: 1283 LSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTSLSGSTS 1342
S+ + ST SD T+ S + S+ + S + S+ +G S
Sbjct: 219 GEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGS 278

Query: 1343 ESESDSTSSSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMNQSGVDS 1402
+ S + ST + + S +G ST T+ S T+ S + S + +
Sbjct: 279 TQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTA 338

Query: 1403 NSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTSQSGSTS 1462
S + S+ + S T+ S T+ ST T+ SD T+ GST
Sbjct: 339 GYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTA------GYGSTG 392

Query: 1463 TSASLSGSESESDSQSISTSASESTSESASTSLSDSTSTSNSGSASTSTSLSNSASASES 1522
T+ + S + S + S T+ ST + S +G ST T+ +S+ +
Sbjct: 393 TAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGY 452

Query: 1523 DSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSESGSTS 1582
S+ T+ DS+ + S +Q S + STST+ S++ ++ ST +G S
Sbjct: 453 GSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSL--IAGYGSTQTAGYGS 510

Query: 1583 ESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSDSQSMS 1642
T+ ST T+ ++S + S S + + S+ + ST ++ S+ T+ S +
Sbjct: 511 TLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTA 570

Query: 1643 LSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMSVSISLSDSTSTSTSASEVMSASIS 1702
S T+ ST + S S + S T+ S + ST T+ S + + S
Sbjct: 571 REGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGS 630

Query: 1703 DSQSMSESVNDSESVSESNSESDSKSMSGSTSVSDSGSLSVSTSLRKSESVSESSSLSGS 1762
S + ++S + S + +S +G S + S T+ S S + + S +
Sbjct: 631 TSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIA 690

Query: 1763 QSMSDSVSTSDSSSLSVSTSLRSSESVSESDSLSDSKSTSGSTSTSTSGSLSTSTSLSGS 1822
S + +S + S ++++ S+ S S ST+G+ S+ +G ST T+ S
Sbjct: 691 GYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHS 750

Query: 1823 ESVSESTSLSDSISMSDSTSTSDSDSLSGSISLSGSTSLSTSDSLSDSKSLSSSQSMSGS 1882
+ S + S T+ S S +G+ S + ST + S + S +
Sbjct: 751 SLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTA 810

Query: 1883 ESTSTSVSDSQSSSTSNSQFDSMSISASESDSMSTSDSSNISGSNSTSTSLSTSDSMSGS 1942
+ S + S+ST+ DS I+ S + +S +G ST T+ SD +G
Sbjct: 811 QERSDLTTGYGSTSTAG--ADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGY 868

Query: 1943 VSVSTSTSLSDSISGSTSVSDSSSTSTSTSLSDSMSQSQSTSTSASGSLSTSISTSMSMS 2002
S ST+ S I+G S + S T+ S +Q S +G STS + S
Sbjct: 869 GSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSL 928

Query: 2003 ASTSSSQSTSVSTSLSTSDSISDSTSISISGSQSTVESESTSDSTSISDSESLSTSDSDS 2062
+ S T+ S + S T+ S + S S + S + ST +
Sbjct: 929 IAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGY 988

Query: 2063 TSTSTSDSTSGSTSTSISESLSTSGSGSTSVSDSTSMSESDSTSVSMSQDKSDSTSISDS 2122
ST T+ S T+ S + GS +T+ +DS+ ++ S+ S + + S
Sbjct: 989 QSTLTAGYGSTQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTL 1048

Query: 2123 ESVSTSTSTSLSTSDSTSTSESLSTSMSGSQSISDSTSTSMSGSTST 2169
S S T+ S S S T+ GS I+ S+ ++G ST
Sbjct: 1049 ISGLRSVLTAGYGSSLISGRRSSLTAGYGSNQIASHRSSLIAGPEST 1095



Score = 52.1 bits (124), Expect = 3e-08
Identities = 193/856 (22%), Positives = 350/856 (40%), Gaps = 14/856 (1%)

Query: 1329 DSISTSTSLSGSTSESESDSTSSSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTSTS 1388
S + ++ + + + S + + + + T +T S ST +
Sbjct: 97 TKTSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPT 156

Query: 1389 LSLSASMNQSGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDS 1448
++ + S + SQ + ST T+ S + Y S T+ ++ST + S
Sbjct: 157 QTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQ 216

Query: 1449 TSISKSTSQSGSTSTSASLSGSESESDSQSISTSASEST--SESASTSLSDSTSTSNSGS 1506
T+ +S+ +G ST + GS+ + S T+ +S+ + ST + S+ +G
Sbjct: 217 TAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGY 276

Query: 1507 ASTSTSLSNSASASESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTI 1566
ST T+ S + S+ T+ +DS+ + S + S + ST T+ + S +
Sbjct: 277 GSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDL 336

Query: 1567 ASLSTSVSTSESGST------SESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTSDS 1620
+ S T+ S+ S T+ DS+ T+ S T++ S + ST T+ +
Sbjct: 337 TAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGA 396

Query: 1621 RSTSASTSTSMRTSTSDSQSMSLSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMSVS 1680
S+ + S +T+ +S + ST T+ S + S T+ S+ +G S
Sbjct: 397 DSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQ 456

Query: 1681 ISLSDSTSTSTSASEVMSASISDSQSMSESVNDSESVSESNSESDSKSMSGSTSVSDSGS 1740
+ DS+ T+ S + SD + S + + S + S +G S +G
Sbjct: 457 TAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGY 516

Query: 1741 LSVSTSLRKSESVSESSSLSGSQSMSDSVSTSDSSSLSVSTSLRSSESVSESDSLSDSKS 1800
S T+ +S+ ++ S S + + S ++ S+ + S+ ++ S + S
Sbjct: 517 GSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDL 576

Query: 1801 TSGSTSTSTSGSLSTSTSLSGSESVSESTSLSDSISMSDSTSTSDSDSLSGSISLSGSTS 1860
T+G ST T+GS S+ + GS + S + S T+ S +G S S + +
Sbjct: 577 TAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGA 636

Query: 1861 LST------SDSLSDSKSLSSSQSMSGSESTSTSVSDSQSSSTSNSQFDSMSISASESDS 1914
S+ S + S+ ++ S + S + STS + DS I+ S
Sbjct: 637 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQ 696

Query: 1915 MSTSDSSNISGSNSTSTSLSTSDSMSGSVSVSTSTSLSDSISGSTSVSDSSSTSTSTSLS 1974
+ +S +G ST T+ SD SG S ST+ + S I+G S +S S+ T+
Sbjct: 697 TAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGY 756

Query: 1975 DSMSQSQSTSTSASGSLSTSISTSMSMSASTSSSQSTSVSTSLSTSDSISDSTSISISGS 2034
S ++ S +G STS + + S + S T+ S+ T+ S T+ S
Sbjct: 757 GSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDL 816

Query: 2035 QSTVESESTSDSTSISDSESLSTSDSDSTSTSTSDSTSGSTSTSISESLSTSGSGSTSVS 2094
+ S ST+ + S + ST + S T+ S T+ S+ + GS ST+
Sbjct: 817 TTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGY 876

Query: 2095 DSTSMSESDSTSVSMSQDKSDSTSISDSESVSTSTSTSLSTSDSTSTSESLSTSMSGSQS 2154
DS+ ++ ST + + S + S T+ S ST+ ES + GS
Sbjct: 877 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQ 936

Query: 2155 ISDSTSTSMSGSTSTS 2170
+ ST M+G S+
Sbjct: 937 TASFKSTLMAGYGSSQ 952



Score = 51.3 bits (122), Expect = 5e-08
Identities = 200/886 (22%), Positives = 350/886 (39%), Gaps = 6/886 (0%)

Query: 797 KSTSVSLSDSVSASKSLSTSESNSVSSSTSTSLVNSQSVSSSMSGSVSKSTSLSDSISNS 856
++ + + SA + + ++ V+ + + S V+S + D +
Sbjct: 89 RAEVLHVGTKTSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATI 148

Query: 857 NSTEKSESLSTSTSDSLRTSTSLSDSLSMSTSGSLSKSQSLSTSISGSSSTSASLSDSTS 916
S + + + T + S ++ GS + ST I+G ST + +DST
Sbjct: 149 ESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTL 208

Query: 917 NAISTSTSLSESASTSDSISISNSIANSQSASTSKSDSQSTSISLSTSDSKSMSTSESLS 976
A ST + S+ + S S T+ S T+ S+ + ST +
Sbjct: 209 VAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGE 268

Query: 977 DSTSTSGSVSGSLSIAASQSVSTSTSDSMSTSEIVSDSISTSGSLSASDSKSMSVSSSMS 1036
DS+ T+G S + S + S + ++ + S + +S + S
Sbjct: 269 DSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQ 328

Query: 1037 TSQSGSTSESLSDSQSTSDSDSKSLSLSTSQSGSTSTSTSTSASVRTSESQSTSGSMSAS 1096
T+Q GS + S T+ DS + + GST T+ S+ S T+ S
Sbjct: 329 TAQKGSDLTAGYGSTGTAGDDSSLI----AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDL 384

Query: 1097 QSDSMSISTSFSDSTSDSKSASTASSESISQSASTSTSGSVSTSTSLSTSNSERTSTSVS 1156
+ S T+ +DS+ + ST ++ S + S + S T+ T T+
Sbjct: 385 TAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGD 444

Query: 1157 DSTSLSTSESDSISESTSTSDSISEAISASESTSISLSESNSTSDSESQSASAFLSESLS 1216
DS+ ++ S + S+ + + ++ S + STS + +S+ S
Sbjct: 445 DSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQ 504

Query: 1217 ESTSESTSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTSISESTSTFKS 1276
+ ST + ST + + SD + GSTST+ +NS+ + ST T+ S T
Sbjct: 505 TAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGY 564

Query: 1277 ESVSTSLSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTS 1336
S T+ S T+ ST + S S + S ++ S+ + S + S
Sbjct: 565 GSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVL 624

Query: 1337 LSGSTSESESDSTSSSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMN 1396
+G S S + + SS + ST + S T+G ST T+ SD T+ S S +
Sbjct: 625 TTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGA 684

Query: 1397 QSGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTS 1456
S + + S + S T+ S T+ S TS STST+ + S + ST
Sbjct: 685 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQ 744

Query: 1457 QSGSTSTSASLSGSESESDSQSISTS--ASESTSESASTSLSDSTSTSNSGSASTSTSLS 1514
+ S+ + GS + QS+ T+ S ST+ + S+ ++ ST +G S T+
Sbjct: 745 TASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGY 804

Query: 1515 NSASASESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVS 1574
S ++ S T+ STS + S + S + S T+ ST + S
Sbjct: 805 GSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDL 864

Query: 1575 TSESGSTSESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTS 1634
T+ GSTS + +S + S + S +G ST T+ +S T+ STS
Sbjct: 865 TTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGY 924

Query: 1635 TSDSQSMSLSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMSVS 1680
S + ST T++ S + S + S+ + GS S++
Sbjct: 925 ESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMA 970



Score = 49.0 bits (116), Expect = 3e-07
Identities = 206/894 (23%), Positives = 361/894 (40%), Gaps = 6/894 (0%)

Query: 733 TDASGNKTTTTFKYEVTRNSMSDSVSTSGSTQQSQSVSTSKADSQSASTSTSGSIVVSTS 792
T G+ T + T + S ++ GSTQ + ST A S T+ GS + +
Sbjct: 281 TAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGY 340

Query: 793 ASTSKSTSVSLSDSVSASKSLSTSESNSVSSSTSTSLVNSQSVSSSMSGSVSKSTSLSDS 852
ST + S + S + +S+ + ST S ++ GS + + S
Sbjct: 341 GSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSL 400

Query: 853 ISNSNSTEKSESLSTSTSDSLRTSTSLSDSLSMSTSGSLSKSQSLSTSISGSSSTSASLS 912
I+ ST+ + ST T+ T T+ S + GS + S+ I+G ST +
Sbjct: 401 IAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGE 460

Query: 913 DSTSNAISTSTSLSESASTSDSISISNSIANSQSA------STSKSDSQSTSISLSTSDS 966
DS+ A ST ++ S + S S A +S+ ST + ST + S
Sbjct: 461 DSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQ 520

Query: 967 KSMSTSESLSDSTSTSGSVSGSLSIAASQSVSTSTSDSMSTSEIVSDSISTSGSLSASDS 1026
+ + S+ ++ STS + + S IA S T++ +S+ T+ S + GS +
Sbjct: 521 TAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGY 580

Query: 1027 KSMSVSSSMSTSQSGSTSESLSDSQSTSDSDSKSLSLSTSQSGSTSTSTSTSASVRTSES 1086
S + S S+ +G S + S+ + S + QS T+ STS + S
Sbjct: 581 GSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSL 640

Query: 1087 QSTSGSMSASQSDSMSISTSFSDSTSDSKSASTASSESISQSASTSTSGSVSTSTSLSTS 1146
+ GS + +S+ + S T+ S TA S S + + S+ + ST +
Sbjct: 641 IAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGY 700

Query: 1147 NSERTSTSVSDSTSLSTSESDSISESTSTSDSISEAISASESTSISLSESNSTSDSESQS 1206
NS T+ S T+ S+ S STST+ + S I+ ST + S+ T+ S
Sbjct: 701 NSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQ 760

Query: 1207 ASAFLSESLSESTSESTSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTS 1266
+ S + S ST+ + SS + S + S T+ S T+ S T+
Sbjct: 761 TAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGY 820

Query: 1267 ISESTSTFKSESVSTSLSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTS 1326
S ST+ S ++ S T+ S T+ S + +S + S ST+ S+
Sbjct: 821 GSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSL 880

Query: 1327 KSDSISTSTSLSGSTSESESDSTSSSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTS 1386
+ ST T+ S + ST +++ SD T+ S S + S+ + S ++
Sbjct: 881 IAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASF 940

Query: 1387 TSLSLSASMNQSGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLS 1446
S ++ + S+ + STS + +S + T + QS T+ S
Sbjct: 941 KSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQ 1000

Query: 1447 DSTSISKSTSQSGSTSTSASLSGSESESDSQSISTSASESTSESASTSLSDSTSTSNSGS 1506
+ S T+ GST+T+ + S + S S S T+ ST +S S +G
Sbjct: 1001 TAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVLTAGY 1060

Query: 1507 ASTSTSLSNSASASESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTI 1566
S+ S S+ + S+ + S+ + S + + S ++ S+ T+ ST+
Sbjct: 1061 GSSLISGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTL 1120

Query: 1567 ASLSTSVSTSESGSTSESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTSDS 1620
S + SV + + ++S T+ S + + S +G S T+ +D
Sbjct: 1121 ISGADSVQMAGERGKLIAGADSTQTAGDRSKLLAGNNSYLTAGDRSKLTAGNDC 1174



Score = 47.8 bits (113), Expect = 6e-07
Identities = 241/1091 (22%), Positives = 433/1091 (39%), Gaps = 10/1091 (0%)

Query: 907 TSASLSDSTSNAISTSTSLSESASTSDSISISNSIANSQSASTSKSDSQSTSISLSTSDS 966
TSA A + + ++ S ++ + N T D+ S S + +
Sbjct: 99 TSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQT 158

Query: 967 KSMSTSESLSDSTSTSGSVSGSLSIAASQSVSTSTSDSMSTSEIVSDSISTSGSLSASDS 1026
++T S T S ++G S + ST + ST +DS +G S +
Sbjct: 159 IEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTA 218

Query: 1027 KSMSVSSSMSTSQSGSTSESLSDSQSTSDSDSKSLSLSTSQSGSTSTSTSTSASVRTSES 1086
S + S S + S + S + GST T+ S+ S
Sbjct: 219 GEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGS 278

Query: 1087 QSTSGSMSASQSDSMSISTSFSDSTSDSKSASTASSESISQSASTSTSGSVSTSTSLSTS 1146
T+ S + S T+ +DS+ + ST ++ S + S + S T+
Sbjct: 279 TQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTA 338

Query: 1147 NSERTSTSVSDSTSLSTSESDSISESTSTSDSISEAISASESTSISLSESNSTSDSESQS 1206
T T+ DS+ ++ S + S+ + + ++ S + ST + + S
Sbjct: 339 GYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADS 398

Query: 1207 ASAFLSESLSESTSESTSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTS 1266
+ S + EST + ST + SD T+ GST T+ +S+ + ST T+
Sbjct: 399 SLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTA 458

Query: 1267 ISESTSTFKSESVSTSLSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMS--TSDSIS 1324
+S+ T S T+ S T+ STS + S + S + S T+ S
Sbjct: 459 GEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGS 518

Query: 1325 TSKSDSISTSTSLSGSTSESESDSTSSSESKSDSTSMSISMSQSTSGSTSTS------TS 1378
T + + S + GSTS + ++S+ + S T+ S+ + GST T+ T+
Sbjct: 519 TQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTA 578

Query: 1379 TSLSDSTSTSLSLSASMNQSGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSES 1438
S T+ S S + S ++ S + ST T+ S T+ Y S ST+ ++S
Sbjct: 579 GYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADS 638

Query: 1439 TSTSTSLSDSTSISKSTSQSGSTSTSASLSGSESESDSQSISTSASESTSESASTSLSDS 1498
+ + S T+ S +G ST + GS+ + S ST+ ++S+ + S +
Sbjct: 639 SLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTA 698

Query: 1499 TSTSNSGSASTSTSLSNSASASESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTS 1558
S + ST + S S STS + + S+ + S ++ S + S
Sbjct: 699 GYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGS 758

Query: 1559 TSNRMSTIASLSTSVSTSESGSTSESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTS 1618
T + STS +G+ S + ST T+ S T+ S + S T+
Sbjct: 759 TQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTT 818

Query: 1619 DSRSTSASTSTSMRTSTSDSQSMSLSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMS 1678
STS + + S + S + S T+ ST + SD T+ S ST+G S
Sbjct: 819 GYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDS 878

Query: 1679 VSISLSDSTSTSTSASEVMSASISDSQSMSESVNDSESVSESNSESDSKSMSGSTSVSDS 1738
I+ ST T+ S + + S + S + S S + +S ++G S +
Sbjct: 879 SLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTA 938

Query: 1739 GSLSVSTSLRKSESVSESSSLSGSQSMSDSVSTSDSSSLSVSTSLRSSESVSESDSLSDS 1798
S + S + S + S S++ DSS ++ S +++ S + S
Sbjct: 939 SFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGS 998

Query: 1799 KSTSGSTSTSTSGSLSTSTSLSGSESVSESTSLSDSISMSDSTSTSDSDSLSGSISLSGS 1858
T+ +ST T+G ST+T+ + S ++ S S S T+ S +SG S+ +
Sbjct: 999 TQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVLTA 1058

Query: 1859 TSLSTSDSLSDSKSLSSSQSMSGSESTSTSVSDSQSSSTSNSQFDSMSISASESDSMSTS 1918
S+ S S + S + S+ ++ +S+ + ++ SM I+ S +
Sbjct: 1059 GYGSSLISGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNR--SMLIAGKGSSQTAGY 1116

Query: 1919 DSSNISGSNSTSTSLSTSDSMSGSVSVSTSTSLSDSISGSTSVSDSSSTSTSTSLSDSMS 1978
S+ ISG++S + ++G+ S T+ S ++G+ S + S T+ +D +
Sbjct: 1117 RSTLISGADSVQMAGERGKLIAGADSTQTAGDRSKLLAGNNSYLTAGDRSKLTAGNDCIL 1176

Query: 1979 QSQSTSTSASG 1989
+ S +G
Sbjct: 1177 MAGDRSKLTAG 1187


25SAI8T7_1000710SAI8T7_1000830N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_10007100112.694207Iron-regulated ABC transporter
SAI8T7_10007201153.136375Probable siderophore biosynthesis protein SbnA
SAI8T7_10007303163.489799Probable ornithine cyclodeaminase protein
SAI8T7_10007402163.356143Putative uncharacterized protein
SAI8T7_10007502173.591111Putative uncharacterized protein
SAI8T7_10007600152.722747Putative uncharacterized protein
SAI8T7_10007700143.295728Similar to siderophore biosynthesis protein
SAI8T7_1000780-1142.656087Putative uncharacterized protein
SAI8T7_1000790-2131.873213Probable diaminopimelate decarboxylase protein
SAI8T7_1000800-2111.277577Putative uncharacterized protein sbnI
SAI8T7_1000810-1110.019912Putative uncharacterized protein
SAI8T7_1000820111-0.255790Diacetyl reductase [(S)-acetoin forming]
SAI8T7_1000830212-1.048501Polysaccharide biosynthesis family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000710FERRIBNDNGPP692e-15 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 68.8 bits (168), Expect = 2e-15
Identities = 46/191 (24%), Positives = 78/191 (40%), Gaps = 38/191 (19%)

Query: 53 PKRVVTLYQGATDVAVSLGVKPVGAVES-----WTQKPKFEYIKNDLKDTKI-VGQEPAP 106
P R+V L ++ ++LG+ P G ++ W +P L D+ I VG P
Sbjct: 35 PNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPP-------LPDSVIDVGLRTEP 87

Query: 107 NLEEISKLKPDLIVASKVRNEKVYDQLSKIAPTVSTDTVFKFKD----------TTKLMG 156
NLE ++++KP +V S + L++IAP F F D + M
Sbjct: 88 NLELLTEMKPSFMVWS-AGYGPSPEMLARIAPGR----GFNFSDGKQPLAMARKSLTEMA 142

Query: 157 KALGKEKEAEDLLKKYDDKVAAFQKDAKAKY--KDAWPLKASVVNF-RADHTRIYA-GGY 212
L + AE L +Y+D F + K ++ + A PL + H ++
Sbjct: 143 DLLNLQSAAETHLAQYED----FIRSMKPRFVKRGARPL--LLTTLIDPRHMLVFGPNSL 196

Query: 213 AGKILNDLGFK 223
+IL++ G
Sbjct: 197 FQEILDEYGIP 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000730SYCECHAPRONE310.002 Gram-negative bacterial type III secretion SycE cha...
		>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE

chaperone signature.
Length = 130

Score = 31.2 bits (70), Expect = 0.002
Identities = 14/33 (42%), Positives = 16/33 (48%), Gaps = 1/33 (3%)

Query: 21 VDALTEALTAHAHNDFVQ-PLKPYLRQDPENGH 52
+D E T +HN F Q LKP L D GH
Sbjct: 54 LDNNDEKETLLSHNIFSQDILKPILSWDEVGGH 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000740PF04183316e-103 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 316 bits (812), Expect = e-103
Identities = 119/527 (22%), Positives = 208/527 (39%), Gaps = 46/527 (8%)

Query: 79 RVSKQPLTAAEFWQTIANMNCDLSHEWEVARVEEGLTTAATQLAKQLSELDLASHPFV-- 136
R + +P+ A + + +S +++ T L + L++ +
Sbjct: 66 RCADEPVLAQTLLMQLKQVL-SMSDATVAEHMQDLYATLLGDLQLLKARRGLSASDLINL 124

Query: 137 -MSEQFASLKDRPFHPLAKEKRGLREADYQVYQAELNQSFPLMVAAVKKTHMIHGDTANI 195
L P K +RG + + Y E +F L AVK+ HMI +
Sbjct: 125 NADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRCDNEM 184

Query: 196 DELENLTVPIKEQA----TDMLNDQGLSIDDYVLFPVHPWQYQHILPNVFATEISEKLVV 251
D + LT + Q + + + GL +++ PVHPWQ+Q + F + +E +V
Sbjct: 185 DIHQLLTAAMDPQEFARFSQVWQENGLD-HNWLPLPVHPWQWQQKIATDFIADFAEGRMV 243

Query: 252 LLPLKFGD-YLSSSSMRSLIDIGAPYN-HVKVPFAMQSLGALRLTPTRYMKNGEQAEQLL 309
L +FGD +L+ S+R+L + +K+P + + R P RY+ G A + L
Sbjct: 244 SLG-EFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWL 302

Query: 310 RQLIEKDEALAKYVMV-CDETA-------WWSYMGQDNDIFKDQLGHLTVQLRKYPEVLA 361
+Q+ D L + V E A ++ + + +++ LG V R+ P
Sbjct: 303 QQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLG---VIWRENPCRWL 359

Query: 362 KNDTQQLVSMAALAANDRTLYQMICGKDNISKNDVMTLFEDIAQVFLKVTLSFM-QYGAL 420
K D + V MA L D + + S D T + +V + + +YG
Sbjct: 360 KPD-ESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPLYHLLCRYGVA 418

Query: 421 PELHGQNILLSFEDGRVQKCVLRD-HDTVRIYKPWLTAHQLSLPKYV--VREDTPNTLIN 477
HGQNI L+ ++G Q+ +L+D +R+ K SLP+ V V +
Sbjct: 419 LIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMD-SLPQEVRDVTSRLSADYLI 477

Query: 478 EDLETFFAYFQTLAVSVNLYAIIDAIQDLFGVSEHELMSLLKQILKNEVATISWVTTDQL 537
DL+T V + I + GV E LL +L + + Q+
Sbjct: 478 HDLQTGHF--------VTVLRFISPLMVRLGVPERRFYQLLAAVLSDYMK-----KHPQM 524

Query: 538 AVRHILFDKQTWPFKQILLP---LLY-QRDSGGGSMPSGLTTVPNPM 580
+ R LF +++L L + D G +P+ L + NP+
Sbjct: 525 SERFALFSLFRPQIIRVVLNPVKLTWPDLDGGSRMLPNYLEDLQNPL 571


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000750TCRTETA733e-16 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 72.9 bits (179), Expect = 3e-16
Identities = 66/333 (19%), Positives = 131/333 (39%), Gaps = 22/333 (6%)

Query: 17 GIAIAAPAVTTMIASPIWGKLGDKISRKWMVLRALLGLAVCLFLMALCTTPLQFVLVRLL 76
GI +A A+ +P+ G L D+ R+ ++L +L G AV +MA + R++
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105

Query: 77 QGLFGGVVDASSAFASAEAPAEDRGKVLGRLQSSVSAGSLVGPLIGGVTASILGFSALLM 136
G+ G + A+ + ++R + G + + G + GP++GG+ A
Sbjct: 106 AGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGF-SPHAPFF 164

Query: 137 SIAVITFIVCIFGALKLIETTHMPKSQTPNINKGIRRSFQCLLCTQQTCRFIIVGVLANF 196
+ A + + + G L E+ + SF+ + V A
Sbjct: 165 AAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFR--------WARGMTVVAALM 216

Query: 197 AMYGMLTALSPLASSVNHTAIDDR-----SVIGFLQSAF-WTASILSAPLWGRFNDKSYV 250
A++ ++ + + +++ +DR + IG +AF S+ A + G +
Sbjct: 217 AVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGE 276

Query: 251 KSVYIFATIACGCSAILQGLATNIEFLMAARILQGLTYSAL--IQSVMFVVVNACHQ-QL 307
+ + IA G IL AT +L + +Q+++ V+ Q QL
Sbjct: 277 RRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQL 336

Query: 308 KGTFVGTTNSMLVVGQIIGSLSGAAITSYTTPA 340
+G+ T+ + I+G L AI + +
Sbjct: 337 QGSLAALTS----LTSIVGPLLFTAIYAASITT 365


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000760PF041833002e-97 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 300 bits (769), Expect = 2e-97
Identities = 109/474 (22%), Positives = 192/474 (40%), Gaps = 52/474 (10%)

Query: 11 WLIDGKSKKITTYKEAIARILQHMAQSADNQTA-VQQHMAQIMSDI--DNSIHRTARYLQ 67
ID ++ + +L + Q A V +HM + + + D + + R L
Sbjct: 58 LWIDAQTLRCADEPVLAQTLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQLLKARRGLS 117

Query: 68 SNTIDYVEDRYIVSEQSLYLGHPFHPTPKSASGFSEADLEKYAPECHTSFQLHYLAVHQD 127
++ + + Q L GHP K G+ + LE+YAPE +F+LH+LAV ++
Sbjct: 118 ASDL---INLNADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKRE 174

Query: 128 -------------VLLTRYVEGKEDQVEKVLYQLADIDISEIPKDFILLPTHPYQINVLR 174
LLT ++ +E ++Q +D +++ LP HP+Q
Sbjct: 175 HMIWRCDNEMDIHQLLTAAMDPQEFARFSQVWQENGLD-----HNWLPLPVHPWQWQQK- 228

Query: 175 QHPQYMQYSEQGLIKDLGVSGDLVYPTSSVRTVF--SKALNIYLKLPIHVKITNFIRTND 232
++ +G + LG GD S+RT+ S+ + +KLP+ + T+ R
Sbjct: 229 IATDFIADFAEGRMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIP 288

Query: 233 LEQIERTIDAAQVIASVKDE-----------VETPHFKLMFEEGYRALLPNPLGQTVEPE 281
I A++ + V + P + EGY AL P E
Sbjct: 289 GRYIAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQ---E 345

Query: 282 MDLLTNSAMIVREGIPNY-HADKDIHVLASLFETMPD-SPISKLAQVIEQSGLAPEAWLE 339
M +I RE + D+ ++A+L E + P+ I++SGL E WL
Sbjct: 346 M-----LGVIWRENPCRWLKPDESPVLMATLMECDENNQPL--AGAYIDRSGLDAETWLT 398

Query: 340 CYLDRTLLPILKLFSNTGISLEAHVQNTLIELKDGIPDVCFVRDLEG-ICLSRTIATEKQ 398
++P+ L G++L AH QN + +K+G+P ++D +G + L + E
Sbjct: 399 QLFRVVVVPLYHLLCRYGVALIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMD 458

Query: 399 LVPNVVAASSPVVYAHDEAWHRLKYYVVVNHLGHLVSTIGKATRNEVVLWQLVA 452
+P V + + A D H L+ V L + + + E +QL+A
Sbjct: 459 SLPQEVRDVTSRLSA-DYLIHDLQTGHFVTVLRFISPLMVRLGVPERRFYQLLA 511


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000770PF04183497e-173 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 497 bits (1282), Expect = e-173
Identities = 142/579 (24%), Positives = 251/579 (43%), Gaps = 40/579 (6%)

Query: 1 MHQLVSSLIYENIVVYKASYQDGVGHFTIEGHDSEYRFTAEKTHSFDRIRITSPIERVVG 60
+ +++S L YE + + A Q G + I +++RF AE+ + + I + R
Sbjct: 14 VAKMLSELEYEQV--FHAESQ-GDDRYCINLPGAQWRFIAERG-IWGWLWIDAQTLRC-- 67

Query: 61 DEADTTTDYTQLLREAVFTFPKNDEKLEQFIVELLQTELKDTQSMQYRESNPPATPETFN 120
AD LL + +D + + + +L T L D Q ++ R + N
Sbjct: 68 --ADEPVLAQTLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQLLKARRGLSASDLINLN 125

Query: 121 -DYEFYAMEGHQYHPSYKSRLGFTLSDNLKFGPDFVPNVKLQWLAIDKDKVETTVSRNVV 179
D + GH K R G+ ++ P++ +L WLA+ ++ + +
Sbjct: 126 ADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRCDNEMD 185

Query: 180 VNEMLRQQVGDKTYEHFVQQIEASGKHVNDVEMIPVHPWQFEHVIQVDLAEERLNGTVLW 239
++++L + + + F Q + +G N + +PVHPWQ++ I D + G ++
Sbjct: 186 IHQLLTAAMDPQEFARFSQVWQENGLDHNWL-PLPVHPWQWQQKIATDFIADFAEGRMVS 244

Query: 240 LGESDELYHPQQSIRTMSPIDTT-KYYLKVPISITNTSTKRVLAPHTIENAAQITDWLKQ 298
LGE + + QQS+RT++ +K+P++I NTS R + I + WL+Q
Sbjct: 245 LGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWLQQ 304

Query: 299 IQQQDMYLKDE----LKTVFLGEVLGQSYLNTQLSPYKQTQVYGALGVIWRENIYHMLID 354
+ D L L G V + Y +PY+ ++ LGVIWREN L
Sbjct: 305 VFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEM---LGVIWRENPCRWLKP 361

Query: 355 EEDAIPFNALYASDKDGLPFIEKWIKQYG--SEAWTKQFLAVAIRPMIHMLYYHGIAFES 412
+E + L D++ P +I + G +E W Q V + P+ H+L +G+A +
Sbjct: 362 DESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPLYHLLCRYGVALIA 421

Query: 413 HAQNMMLIHENGWPTRIALKDFHDGVRFKREHLSEAASHLTLKPMPEAHKKVNSNSFIET 472
H QN+ L + G P R+ LKDF +R +E E S +P+ + V S
Sbjct: 422 HGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMDS------LPQEVRDVTS------ 469

Query: 473 DDERLVRDFLH---DAFFFINIAEIILFIEKQYGIDEQRQWQWVKDIIEAYQEAFPELNN 529
RL D+L F+ + I + + G+ E+R +Q + ++ Y + P+++
Sbjct: 470 ---RLSADYLIHDLQTGHFVTVLRFISPLMVRLGVPERRFYQLLAAVLSDYMKKHPQMSE 526

Query: 530 -YQHFDLFEPTIQVEKLTTRRL-LSDSELRIHHVTNPLG 566
+ F LF P I L +L D + + N L
Sbjct: 527 RFALFSLFRPQIIRVVLNPVKLTWPDLDGGSRMLPNYLE 565


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000820DHBDHDRGNASE1284e-38 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 128 bits (323), Expect = 4e-38
Identities = 66/250 (26%), Positives = 113/250 (45%), Gaps = 2/250 (0%)

Query: 5 KVALVTGGAQGIGFKIAERLVEDGFKVAVVDFNEEGAKAAALKLSSDGTKAIAIKADVSN 64
K+A +TG AQGIG +A L G +A VD+N E + L ++ A A ADV +
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 65 RDDVFNAVRQTAAQFGDFHVMVNNAGLGPTTPIDTITEEQFKTVYGVNVAGVLWGIQAAH 124
+ + + G ++VN AG+ I ++++E+++ + VN GV ++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 125 EQFKKFNHGGKIINATSQAGVEGNPGLSLYCSTKFAVRGLTQVAAQDLASEGITVNAFAP 184
+ G I+ S ++ Y S+K A T+ +LA I N +P
Sbjct: 129 KYMMD-RRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 185 GIVQTPMMESIAVATAEEAGKPEAWGWEQFTSQIALGRVSQPEDVSNVVSFLAGKDSDYI 244
G +T M S+ + E F + I L ++++P D+++ V FL + +I
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSL-ETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 245 TGQTIIVDGG 254
T + VDGG
Sbjct: 247 TMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1000830NUCEPIMERASE2179e-71 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 217 bits (554), Expect = 9e-71
Identities = 79/327 (24%), Positives = 139/327 (42%), Gaps = 33/327 (10%)

Query: 6 RVLITGGAGFIGSHLVDDL-QQDYDVYVLDNYRTG-----KRENIKSLADDHVF--ELDI 57
+ L+TG AGFIG H+ L + + V +DN K+ ++ LA ++D+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 58 REYDAVEQIMKTYQFDYVIHLAALVSVAESVEKPILSQEINVVATLRLLEIIKKYNSHIK 117
+ + + + + F+ V ++V S+E P + N+ L +LE + I+
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK--IQ 119

Query: 118 RFIFASSAAVYGDLPDLPKSDQSLI-LPLSPYAIDKYYGERTTLNYCSLYNIPTAVVKFF 176
++ASS++VYG +P S + P+S YA K E Y LY +P ++FF
Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179

Query: 177 NVFGPRQDPKSQYSGVISKMFDSFEHNKPFTFFGDGLQTRDFVYVYDVVQSVRLIMEH-- 234
V+GP P M K + G RDF Y+ D+ +++ + +
Sbjct: 180 TVYGPWGRPDMALFKFTKAML----EGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIP 235

Query: 235 ---------------KDAIGHGYNIGTGTFTNLLEVYRIIGELYGKSVEHEFKEARKGDI 279
A YNIG + L++ + + + G + + GD+
Sbjct: 236 HADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDV 295

Query: 280 KHSYADISNL-KALGFVPKYTVETGLK 305
+ AD L + +GF P+ TV+ G+K
Sbjct: 296 LETSADTKALYEVIGFTPETTVKDGVK 322


26SAI8T7_1001280SAI8T7_1001320N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_10012803141.468170Similar to integral membrane protein LmrP
SAI8T7_10012903131.731674Gramicidin S synthetase 2 related protein
SAI8T7_10013002142.429596Putative uncharacterized protein
SAI8T7_10013102152.006495Putative uncharacterized protein
SAI8T7_10013201151.927579Acetylglutamate kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1001280TCRTETA320.004 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.1 bits (73), Expect = 0.004
Identities = 61/337 (18%), Positives = 127/337 (37%), Gaps = 33/337 (9%)

Query: 7 TLKVRLISNFLQLIITTAFIPFIALYLTDMLS----QSIVGIYLVGLVVLKFPLSIISGY 62
L V L + L + +P + L D++ + GI L +++F + + G
Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65

Query: 63 LIEIFPKKLLVLIYQATMVIMLVFMGVFGSHQLWQI-IGFCVAYAIFTIVWGLQFPVMDT 121
L + F ++ ++L+ A + M + LW + IG VA + G V
Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMAT--APFLWVLYIGRIVAG-----ITGATGAVAGA 118

Query: 122 LIMDAITEDVEHYIYKISYWMTNLSVAIGALLGGLMYGYSMLLLFLIAACIFLIVLFILY 181
I D D + + G +LGGLM G+S F AA + +
Sbjct: 119 YIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGC 178

Query: 182 IWLPQDRNQVKQSDDKRHASRYQKLQIMNIFRSYKLVLKDRNYMLLISGFSIIMMGEFSI 241
LP+ ++ + + + L+ F + ++G+
Sbjct: 179 FLLPESHKGERRPLRREALNPLASFRWARGMTVVAA--------LMAVFFIMQLVGQVPA 230

Query: 242 SSYIAIRLKDQF--ETISIGSYDITGAKMLAILLMINTVVVILLTYSISKVVLKIDFKKA 299
+ + I +D+F + +IG LA +++++ ++T ++ ++ ++A
Sbjct: 231 ALW-VIFGEDRFHWDATTIGI-------SLAAFGILHSLAQAMITGPVAA---RLGERRA 279

Query: 300 LITGLLIYIVGYSGLTYLNQFGLLVVFMIIATVGEII 336
L+ G++ GY L + + + M++ G I
Sbjct: 280 LMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIG 316


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1001290NUCEPIMERASE538e-09 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 52.9 bits (127), Expect = 8e-09
Identities = 54/266 (20%), Positives = 101/266 (37%), Gaps = 55/266 (20%)

Query: 2052 NTLLTGATGFLGAYLIEALQGYSHRIYCFIRADNEEIAWYKLMTNLNDYFS----EETVE 2107
L+TGA GF+G ++ + L H++ + D NLNDY+ + +E
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQV---VGID-----------NLNDYYDVSLKQARLE 47

Query: 2108 MM----LSNIEVIVGDFECMDDVVLPENMDTIIH----AGARTDHFGDDDEFEKVNVQGT 2159
++ ++ + D E M D+ + + + R + + N+ G
Sbjct: 48 LLAQPGFQFHKIDLADREGMTDLFASGHFERVFISPHRLAVR-YSLENPHAYADSNLTGF 106

Query: 2160 VDVIRLAQQHH-ARLIYVSTISV-GTYFDIDTEDVTFSEADVYKGQLLTSPYTRSKFYSE 2217
++++ + + L+Y S+ SV G + FS D + S Y +K +E
Sbjct: 107 LNILEGCRHNKIQHLLYASSSSVYG-----LNRKMPFSTDDSVDHPV--SLYAATKKANE 159

Query: 2218 LKVLEAVNN-GLDGRIVRVGNLTSPYNGRWHM------RNIKTNRFSMVMNDLLQLDCIG 2270
L + GL +R + P+ GR M + + + V N
Sbjct: 160 LMAHTYSHLYGLPATGLRFFTVYGPW-GRPDMALFKFTKAMLEGKSIDVYNY-------- 210

Query: 2271 VSMAEMPVDFSFVDTTARQIVALAQV 2296
+M DF+++D A I+ L V
Sbjct: 211 ---GKMKRDFTYIDDIAEAIIRLQDV 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1001300ENTSNTHTASED290.009 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 29.2 bits (65), Expect = 0.009
Identities = 15/57 (26%), Positives = 27/57 (47%), Gaps = 5/57 (8%)

Query: 84 GQP-----IYVSLSYSYPYIVCVVDKEPVGIDIEKISQRLDWRTLVTCFSTNEAHQI 135
QP ++ S+S+ + V+ ++ +GIDIEKI + L ++ QI
Sbjct: 76 RQPLWPDGLFGSISHCATTALAVISRQRIGIDIEKIMSQHTATELAPSIIDSDERQI 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1001320CARBMTKINASE320.002 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 31.7 bits (72), Expect = 0.002
Identities = 23/84 (27%), Positives = 41/84 (48%), Gaps = 7/84 (8%)

Query: 155 INADTLAYFIASSLKAPIYV-LSNIAGVLIN-----DVVIPQLPLVDIHQYIEHGD-IYG 207
I+ D +A + A I++ L+++ G + + + ++ + ++ +Y E G G
Sbjct: 213 IDKDLAGEKLAEEVNADIFMILTDVNGAALYYGTEKEQWLREVKVEELRKYYEEGHFKAG 272

Query: 208 GMIPKVLDAKNAIENGCPKVIIAS 231
M PKVL A IE G + IIA
Sbjct: 273 SMGPKVLAAIRFIEWGGERAIIAH 296


27SAI8T7_1001680SAI8T7_1001750N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_10016801181.484855MFS family major facilitator transporter, hexose
SAI8T7_10016901140.302156Uncharacterized response regulatory protein
SAI8T7_10017000150.601654Uncharacterized sensor-like histidine kinase
SAI8T7_1001710-2142.074289Putative Similar to periplasmic-iron-binding
SAI8T7_1001720-1142.615695Formate acetyltransferase
SAI8T7_1001730-2132.798598Pyruvate formate-lyase-activating enzyme
SAI8T7_1001740-1132.931975Glycerophosphoryl diester phosphodiesterase
SAI8T7_1001750-1144.075711Staphylocoagulase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1001680TCRTETA388e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 37.9 bits (88), Expect = 8e-05
Identities = 53/361 (14%), Positives = 121/361 (33%), Gaps = 40/361 (11%)

Query: 36 AFFVVFFVYMAMYLIRNNFKAAQPFLKEEIGLSTLELGYIGL---AFSITYGLGKTLLGY 92
V + + LI P L ++ S + G+ +++ +LG
Sbjct: 10 ILSTVALDAVGIGLI----MPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65

Query: 93 FVDGRNTKRIISFLLILSAITVLIMGFVLSYFGSVMGLLIVLWGLNGVFQSVGGPASYST 152
D + ++ L +A+ IM + F V+ + ++ G+ G G + +
Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMAT--APFLWVLYIGRIVAGITG----ATGAVAGAY 119

Query: 153 ISRWAPRTKRGRYLGFWNTSHNIGGAIAGGVALWGANVFFHGNVIGMFIFPSVIALLIGI 212
I+ +R R+ GF + G + H F + + L +
Sbjct: 120 IADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAP----FFAAAALNGLNFL 175

Query: 213 ATLFIGKDDPEELGWNRAEEIWEEPVDKENIDSQGMTKWEIFKKYILGNPVIWILCVSNV 272
F+ + + + P+ +E ++ +W V ++ V +
Sbjct: 176 TGCFLLPE---------SHKGERRPLRREALNPLASFRWARGMT-----VVAALMAVFFI 221

Query: 273 FVYIVRIGIDNWAPLYVSEHLHFSKGDAVNTIFYFEI-GALVASLLWGYVSDLLKGRRAI 331
+ ++ W ++ + H+ ++ F I +L +++ G V+ L RRA+
Sbjct: 222 MQLVGQVPAALWV-IFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRAL 280

Query: 332 VAIGCMFMITFVVLFYTNATSVMMVNISLFALGALIFGPQLLIGVSLTGFVPKNAISVAN 391
+ +++L + + + L A G I P +L + +
Sbjct: 281 MLGMIADGTGYILLAFATRGWMAFPIMVLLASGG-IGMP------ALQAMLSRQVDEERQ 333

Query: 392 G 392
G
Sbjct: 334 G 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1001690HTHFIS441e-07 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 44.1 bits (104), Expect = 1e-07
Identities = 30/114 (26%), Positives = 50/114 (43%), Gaps = 9/114 (7%)

Query: 1 MPRKNGVDLLNDI--ALLDCNVIILSSYDDFEYMKAGIQHHVLDYLLKPVDHAQLEVILG 58
MP +N DLL I A D V+++S+ + F + DYL KP D L ++G
Sbjct: 57 MPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFD---LTELIG 113

Query: 59 RLVRTLLEQQSQNGRSLASCHDAFQPLLKVEYDDYYVNQIVDQIKQSYQTKVTV 112
+ R L E + + + D PL+ + +I + + QT +T+
Sbjct: 114 IIGRALAEPKRRPSKLEDDSQD-GMPLVG---RSAAMQEIYRVLARLMQTDLTL 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1001700PF065801466e-42 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 146 bits (371), Expect = 6e-42
Identities = 55/226 (24%), Positives = 109/226 (48%), Gaps = 16/226 (7%)

Query: 278 YIYDLFESNEQLIHSIEHTERRLRDIQLKEIERQFQPHFLFNTMQTIQYLITLSPKLAQT 337
+ + F++ +Q ++ QL ++ Q PHF+FN + I+ LI P A+
Sbjct: 136 FGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKARE 195

Query: 338 VVQQLSQMLRYSLR-TNSHTVELNEELNYIEQYVAIQNIRFDDMIKLHIESSEEARHQTI 396
++ LS+++RYSLR +N+ V L +EL ++ Y+ + +I+F+D ++ + + +
Sbjct: 196 MLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQV 255

Query: 397 GKMMLQPLIENAIKHGRDTESLDITIRLTLARQN--LHVLVCDNGIGMSSSRLQYVRQSL 454
M++Q L+EN IKHG I L + N + + V + G L+ ++S
Sbjct: 256 PPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA----LKNTKES- 310

Query: 455 NNDVFDTKHLGLNHLHNKAMIQYGSHARLHIFSKRNQGTLICYKIP 500
GL ++ + + YG+ A++ + K+ + + IP
Sbjct: 311 -------TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVL-IP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1001720SHAPEPROTEIN320.006 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 32.4 bits (74), Expect = 0.006
Identities = 18/54 (33%), Positives = 29/54 (53%), Gaps = 5/54 (9%)

Query: 262 AYLAAIKEQNGAAMSLGRTSTFLDIYAERDLKAGVITESEV-QEIIDHFIMKLR 314
+AA+ A LGRT +I A R +K GVI + V ++++ HFI ++
Sbjct: 50 KSVAAVGHD--AKQMLGRTPG--NIAAIRPMKDGVIADFFVTEKMLQHFIKQVH 99


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1001750IGASERPTASE320.009 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.0 bits (72), Expect = 0.009
Identities = 37/215 (17%), Positives = 66/215 (30%), Gaps = 16/215 (7%)

Query: 249 ETKQNRPNSITKYDPTKHNFKEKSENKPNFDKLVEETKKAVKEADESWKNKTVKKYEETV 308
E+K N + T N + E K N + + A ++ T K TV
Sbjct: 1047 ESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATV 1106

Query: 309 TKSPVVKEEKKVEEPQLPKVGNQQEVKTTAGKAEETTQPVAQPLVKIPQETIYGETVKGP 368
K +E+ KVE + +V + + ET QP A+P TV
Sbjct: 1107 EK----EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEP------ARENDPTVNIK 1156

Query: 369 EYPTMENKTLQGEIVQGPDFLTMEQNRPSLSDNYTQPTTPNPILEGLEGSSSKLEIKPQG 428
E + N + Q P T ++++ T T + + + + +P
Sbjct: 1157 EPQSQTNT--TADTEQ-PAKETSSNVEQPVTESTTVNTGNSVVENPENTTPA--TTQPTV 1211

Query: 429 TESTLKGIQGESSDIEVKPQATETTEASQYGPRPQ 463
+ + V+ A+
Sbjct: 1212 NSESSNKPK-NRHRRSVRSVPHNVEPATTSSNDRS 1245


28SAI8T7_1003010SAI8T7_1003140N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1003010-214-0.479160Putative nucleoside-diphosphate-sugar epimerase
SAI8T7_1003020216-1.075687Exotoxin=6
SAI8T7_1003030115-1.175554Exotoxin=7
SAI8T7_1003040314-1.689685Putative uncharacterized protein (fragment)
SAI8T7_1003050415-2.798544Exotoxin=8
SAI8T7_1003060216-2.991972Superantigen-like protein
SAI8T7_1003070215-1.204044Superantigen-like protein 5
SAI8T7_1003080113-1.942571Exotoxin=11
SAI8T7_1003090113-2.000277Superantigen-like protein 7
SAI8T7_100310039-1.605994Exotoxin=13
SAI8T7_100311049-1.710183Exotoxin=14
SAI8T7_1003120411-1.932228Type I restriction enzyme EcoR124II M protein
SAI8T7_10031301014-3.878715Type I restriction modification DNA specificity
SAI8T7_10031401013-3.605067Exotoxin=15
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003010NUCEPIMERASE300.009 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 30.1 bits (68), Expect = 0.009
Identities = 28/167 (16%), Positives = 61/167 (36%), Gaps = 32/167 (19%)

Query: 1 MNIMLTGATGHLGTHITNQAIANHIDHFHIGVRNVEKVPD----------DWRGKVSVRQ 50
M ++TGA G +G H++ + + H +G+ N+ D + +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEA--GHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHK 58

Query: 51 LDYFNQESMVEAFK--GMDTVVFI-------PSIIHP-SFKRIPEV--ENLVYAAKQSGV 98
+D ++E M + F + V S+ +P ++ N++ + + +
Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 99 AHIIFIG---YYADQHNNPFHMS-----PYFGYASRLLSTSGIDYTY 137
H+++ Y PF P YA+ + + +TY
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTY 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003020TOXICSSTOXIN953e-26 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 95.5 bits (237), Expect = 3e-26
Identities = 46/214 (21%), Positives = 84/214 (39%), Gaps = 11/214 (5%)

Query: 18 TGVITSNVQSVQAKTEVKQQSESELKHYYNKPVLERKNVTGYKYTEKGKDYIDVIVDNQY 77
T V S+ Q ++ + +L +Y+ N + + + + +
Sbjct: 25 TPVPLSSNQIIKTAKASTNDNIKDLLDWYSSGSDTFTNS---EVLDNSLGSMRIKNTDGS 81

Query: 78 SQISLVGSDKDKFKDGDNSNIDVFILREGDSRQATN-----YSIGGVTKTNSQPFIDYIH 132
+ + S +D+ R S+ + + I GVT T P I
Sbjct: 82 ISLIIFPSPYYSPAFTKGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLP--TPIE 139

Query: 133 TPILEIKKGKEEPQSSLYQIYKEDISLKELDYRLRERAIKQHGLYSNGLKQGQI-TITMK 191
P+ GK+ P + K+ +++ LD+ +R + + HGLY + K G ITM
Sbjct: 140 LPLKVKVHGKDSPLKYGPKFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMN 199

Query: 192 DGKSHTIDLSQKLEKERMGDSIDGRQIQKILVEM 225
DG ++ DLS+K E I+ +I+ I E+
Sbjct: 200 DGSTYQSDLSKKFEYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003030TOXICSSTOXIN803e-21 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 80.5 bits (198), Expect = 3e-21
Identities = 32/168 (19%), Positives = 63/168 (37%), Gaps = 18/168 (10%)

Query: 1 MNIIDGNSVNNLALIGKDKQHYHTGVHRNLNIFYVN-----EDKRFEGAKYSIGGITSAN 55
M I + + +L + +++ + I G+T+
Sbjct: 73 MRIKNTDGSISLIIFPSPYYSPAFTKGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTE 132

Query: 56 DKA--VDLIAEARVIKADHIGEYDYDFFPFKIDKEAMSLKEIDFKLRKYLIDNYGLYGEM 113
++L + +V D +Y F DK+ +++ +DF++R L +GLY
Sbjct: 133 KLPTPIELPLKVKVHGKDSPLKYGPKF-----DKKQLAISTLDFEIRHQLTQIHGLYRSS 187

Query: 114 ST----GKITVKKKYYGKYTFELDKKLQEDRMSDVINVTDIDRIEIKV 157
KIT+ Y +L KK + + IN+ +I IE ++
Sbjct: 188 DKTGGYWKITMNDG--STYQSDLSKKFEYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003050TOXICSSTOXIN895e-24 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 88.9 bits (220), Expect = 5e-24
Identities = 28/128 (21%), Positives = 48/128 (37%), Gaps = 5/128 (3%)

Query: 67 VFIVLEDNKYQLKKYSVGGITKTNSKKVDHKAELSVTKKDNQGMISRDVSEYMITKEEIS 126
++ + + G+T T + L V + K++++
Sbjct: 109 TKKSQHTSEGTYIHFQISGVTNTEKLPTPIELPLKVKVHGKDSPLKYG---PKFDKKQLA 165

Query: 127 LKELDFKLRKQLIEKHNLYGNM--GSGTIVIKMKNGGKYTFELHKKLQEHRMADVIEGTN 184
+ LDF++R QL + H LY + G I M +G Y +L KK + + I
Sbjct: 166 ISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFEYNTEKPPINIDE 225

Query: 185 IDKIEVNI 192
I IE I
Sbjct: 226 IKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003060TOXICSSTOXIN1018e-28 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 101 bits (253), Expect = 8e-28
Identities = 45/216 (20%), Positives = 81/216 (37%), Gaps = 13/216 (6%)

Query: 87 TKVETPQSPTTKQVPTEINPKFKDLRAYYTKPSLEFKNEIGIILKKWTTIRFMNIVPDYF 146
T V + K N KDL +Y+ S F N ++ ++R N
Sbjct: 25 TPVPLSSNQIIKTAKASTNDNIKDLLDWYSSGSDTFTN-SEVLDNSLGSMRIKNTDGSI- 82

Query: 147 IYKIALVGKDDKKYDEGVHRNVDVFVVLEEKNKYGVE----RYSVGGITKSNSKKVDHKA 202
+ + VD+ +K+++ E + + G+T + +
Sbjct: 83 --SLIIFPSPYYSPAFTKGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLPTPIEL 140

Query: 203 GVRITKEDNKGTISHDVSEFKITKEQISLKELDFKLRKQLIENHNLYGNV--GSGKIVIN 260
+++ + + K K+Q+++ LDF++R QL + H LY + G I
Sbjct: 141 PLKVKVHGKDSPLKYG---PKFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKIT 197

Query: 261 MKNGGKYTFELHKKLQENRMADVIDGTNIDNIEVNI 296
M +G Y +L KK + N I+ I IE I
Sbjct: 198 MNDGSTYQSDLSKKFEYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003070TOXICSSTOXIN1352e-41 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 135 bits (340), Expect = 2e-41
Identities = 49/201 (24%), Positives = 73/201 (36%), Gaps = 14/201 (6%)

Query: 44 NVTKDIFDLRDYYSGASKELKNVTGYRYSKGGKHYLIFDKNRKFTRVQIFGKDIERFKAR 103
+ +I DL D+YS S N S G + + IF
Sbjct: 41 STNDNIKDLLDWYSSGSDTFTNSEVLDNSLGS---MRIKNTDGSISLIIFPSPYYSPAFT 97

Query: 104 KNPGLDI-----FVVKEAENRNGTVFSYGGVTKKNQDAYYDYINAPRFQIKRDEGDGIAT 158
K +D+ + F GVT + I P +K D
Sbjct: 98 KGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLP--TPIELPLK-VKVHGKDSPLK 154

Query: 159 YGRVHYIYKEEISLKELDFKLRQYLIQNFDLYKKFPKDSKI-KVIMKDGGYYTFELNKKL 217
YG K+++++ LDF++R L Q LY+ K K+ M DG Y +L+KK
Sbjct: 155 YG--PKFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKF 212

Query: 218 QTNRMSDVIDGRNIEKIEANI 238
+ N I+ I+ IEA I
Sbjct: 213 EYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003080TOXICSSTOXIN1921e-63 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 192 bits (488), Expect = 1e-63
Identities = 49/197 (24%), Positives = 82/197 (41%), Gaps = 16/197 (8%)

Query: 42 DIKDLYRYYSSESFEFSNI--------SGKVENYNGSNVVRFNQEKQNHQLFLLGKDKDK 93
+IKDL +YSS S F+N S +++N +GS + F G+
Sbjct: 45 NIKDLLDWYSSGSDTFTNSEVLDNSLGSMRIKNTDGSISLIIFPSPYYSPAFTKGE---- 100

Query: 94 YKKGLEGQNVFVVKELIDPNGRLSTVGGVTKKNNKSSETNTHLFVNKVYGGNLDASIDSF 153
K L + + + + GVT + L V KV+G +
Sbjct: 101 -KVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLPTPIELPLKV-KVHGKDSPLKYGP- 157

Query: 154 LINKEEVSLKELDFKIRKQLVEKYGLYKGTTKYGKI-TINLKDEKKEVIDLGDKLQFERM 212
+K+++++ LDF+IR QL + +GLY+ + K G I + D DL K ++
Sbjct: 158 KFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFEYNTE 217

Query: 213 GDVLNSKDIQNIAVTIN 229
+N +I+ I IN
Sbjct: 218 KPPINIDEIKTIEAEIN 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003090TOXICSSTOXIN1252e-37 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 125 bits (314), Expect = 2e-37
Identities = 47/199 (23%), Positives = 73/199 (36%), Gaps = 19/199 (9%)

Query: 44 DTNKLHQYYSGPSYELTNV--------SGQSQGYYDSNVLLFNQQNQKFQVFLLGKDENK 95
+ L +YS S TN S + + S L+ F G+
Sbjct: 45 NIKDLLDWYSSGSDTFTNSEVLDNSLGSMRIKNTDGSISLIIFPSPYYSPAFTKGE---- 100

Query: 96 YKEKTHGLDVFAVPELVDLDGRIFSVSGVTKKNVKSIFESLRTPNLLVKKIDDKDGFSID 155
K + + F +SGVT L L K+ KD +
Sbjct: 101 -KVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLPTPIELP----LKVKVHGKDSP-LK 154

Query: 156 EFFFIQKEEVSLKELDFKIRKLLIKKYKLYEGSA-DKGRIVINMKDENKYEIDLSDKLDF 214
K+++++ LDF+IR L + + LY S G I M D + Y+ DLS K ++
Sbjct: 155 YGPKFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFEY 214

Query: 215 ERMADVINSEQIKNIEVNL 233
IN ++IK IE +
Sbjct: 215 NTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003100TOXICSSTOXIN1301e-39 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 130 bits (329), Expect = 1e-39
Identities = 39/197 (19%), Positives = 69/197 (35%), Gaps = 15/197 (7%)

Query: 43 INMLHQYYSEESFESTNISVKSEDYYGSNVLNFNQRNKTFKVFLLGDDKNKY------KE 96
I L +YS S TN V + + + + + K
Sbjct: 46 IKDLLDWYSSGSDTFTNSEVLD---NSLGSMRIKNTDGSISLIIFPSPYYSPAFTKGEKV 102

Query: 97 KTHGLDVFAVPELIDIKGGIYSVGGITKKNVRSVFGFVSNPSLQVKKVDAKHGFSINELF 156
+ + + + G+T + P L+VK F
Sbjct: 103 DLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLP--TPIELP-LKVKVHGKDSPLKYGPKF 159

Query: 157 FIQKEEVSLKELDFKIRKMLVEKYRLYKGAS-DKGRIVINMKDEKKYVIDLSEKLSFDRM 215
K+++++ LDF+IR L + + LY+ + G I M D Y DLS+K ++
Sbjct: 160 --DKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFEYNTE 217

Query: 216 FDVMDSKQIKNIEVNLN 232
++ +IK IE +N
Sbjct: 218 KPPINIDEIKTIEAEIN 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003110TOXICSSTOXIN1934e-64 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 193 bits (491), Expect = 4e-64
Identities = 51/202 (25%), Positives = 92/202 (45%), Gaps = 10/202 (4%)

Query: 31 KQNQKSVNKHDKEALYRYYTGKTMEMKNISALKHGKNNLRFKFRGIKIQVLLPGNDKSKF 90
K + S N + K+ L Y +G + N L + ++R K I +++ +
Sbjct: 36 KTAKASTNDNIKDLLDWYSSG-SDTFTNSEVLDNSLGSMRIKNTDGSISLIIFPSPYYSP 94

Query: 91 QQRSYEGLDVFFVQEKRDKHD-----IFYTVGGVIQNNKTSGVVSAPILNISKEKGEDAF 145
E +D+ + K+ +H I + + GV K + P L + K G+D+
Sbjct: 95 AFTKGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLPTPIELP-LKV-KVHGKDSP 152

Query: 146 VKGYPYYIKKEKITLKELDYKLRKHLIEKYGLYKTISKDGRV-KISLKDGSFYNLDLRSK 204
+K Y K+++ + LD+++R L + +GLY++ K G KI++ DGS Y DL K
Sbjct: 153 LK-YGPKFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKK 211

Query: 205 LKFKYMGEVIESKQIKDIEVNL 226
++ I +IK IE +
Sbjct: 212 FEYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1003140TOXICSSTOXIN1084e-31 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 108 bits (270), Expect = 4e-31
Identities = 47/225 (20%), Positives = 86/225 (38%), Gaps = 19/225 (8%)

Query: 16 LTTGMITTTAQPVKASTLEVRSQAT-------QDLSEYYKGRGFELTNVTGYKYG-NKVT 67
L T PV S+ ++ A +DL ++Y TN +
Sbjct: 15 LLLATTATDFTPVPLSSNQIIKTAKASTNDNIKDLLDWYSSGSDTFTNSEVLDNSLGSMR 74

Query: 68 FIDNSQQIDVTLTGNE----KLTVKDDDEVSNVDVFVVREGSDKSAITTSIGGITKTNGT 123
+ I + + + T + +++ + S+ + I I G+T T
Sbjct: 75 IKNTDGSISLIIFPSPYYSPAFTKGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTE-- 132

Query: 124 QHKDTVQNVNLSVSKSTGQHTTSVTSEYYSIYKEEISLKELDFKLRKHLIDKHDLYKTEP 183
T + L V K G+ S K+++++ LDF++R L H LY++
Sbjct: 133 -KLPTPIELPLKV-KVHGK--DSPLKYGPKFDKKQLAISTLDFEIRHQLTQIHGLYRSSD 188

Query: 184 KDSKI-RITMKNGGYYTFELNKKLQPHRMGDTIDSRNIEKIEVNL 227
K +ITM +G Y +L+KK + + I+ I+ IE +
Sbjct: 189 KTGGYWKITMNDGSTYQSDLSKKFEYNTEKPPINIDEIKTIEAEI 233


29SAI8T7_1006170SAI8T7_1006240N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_10061705151.254877Putative uncharacterized protein
SAI8T7_10061804141.209504Transposase for insertion sequence element IS256
SAI8T7_10061904150.626807Clumping factor A
SAI8T7_1006200012-2.151795Coagulase family protein
SAI8T7_1006210113-1.650844Putative Extracellular matrix protein-binding
SAI8T7_1006220-115-1.933508Putative uncharacterized protein
SAI8T7_1006230-112-1.550712Thermonuclease
SAI8T7_1006240012-1.827080Putative uncharacterized protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1006170ALARACEMASE270.049 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 26.7 bits (59), Expect = 0.049
Identities = 13/37 (35%), Positives = 19/37 (51%), Gaps = 2/37 (5%)

Query: 135 MYDIYP-PYDGIPDEAFLI-KELKVNSLAGKTGTINY 169
D+ P P GI L KE+K++ +A GT+ Y
Sbjct: 305 AVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGY 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1006190ICENUCLEATIN437e-06 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 42.8 bits (100), Expect = 7e-06
Identities = 72/369 (19%), Positives = 131/369 (35%), Gaps = 6/369 (1%)

Query: 514 SDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASD 573
S G+DS + S +G +ST +G S + SD + S + DS+
Sbjct: 294 STGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLI 353

Query: 574 SDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSD 633
+ S + DS + S + SD + S + +DS+ + S + +
Sbjct: 354 AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEE 413

Query: 634 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 693
S + S + SD + S + DS + S + DS + S
Sbjct: 414 STQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQT 473

Query: 694 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 753
+ SD + S S + +S + S + S + S + ++SD +
Sbjct: 474 AQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYG 533

Query: 754 SDSDSDSDSD------SDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSD 807
S S + ++S S + +S + S + SD + S + SDS
Sbjct: 534 STSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSII 593

Query: 808 SESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSD 867
+ S + S + S + S +G S S++ +DS + GS + +
Sbjct: 594 AGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYN 653

Query: 868 SNSDSESGS 876
S + GS
Sbjct: 654 SILTAGYGS 662



Score = 42.8 bits (100), Expect = 7e-06
Identities = 74/379 (19%), Positives = 135/379 (35%), Gaps = 6/379 (1%)

Query: 505 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 564
G + E+S G S + S +G ST +G DS+ + S + DS
Sbjct: 309 GSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSL 368

Query: 565 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 624
+ S + SD + S + +DS+ + S + +S + S +
Sbjct: 369 TAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQK 428

Query: 625 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 684
SD + S + DS + S + DS + S + SD + S S
Sbjct: 429 GSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTS 488

Query: 685 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD----- 739
+ +S + S + S + S + ++SD + S S + ++S
Sbjct: 489 TAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGY 548

Query: 740 -SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDS 798
S + +S + S + SD + S + SDS + S + S
Sbjct: 549 GSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSL 608

Query: 799 DSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDT 858
+ S + S + S S + +DS + S +G +S ++ S T+
Sbjct: 609 TAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQE 668

Query: 859 GSDNDSDSDSNSDSESGSN 877
GSD + S S + + S+
Sbjct: 669 GSDLTAGYGSTSTAGADSS 687



Score = 42.0 bits (98), Expect = 1e-05
Identities = 69/363 (19%), Positives = 125/363 (34%), Gaps = 6/363 (1%)

Query: 505 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 564
G + EDS G S + S +G ST +G+DS+ + S + +S
Sbjct: 261 GSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQ 320

Query: 565 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 624
+ S + SD + S + DS+ + S + DS + S +
Sbjct: 321 TAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQK 380

Query: 625 DSDSDSDSDSDSDSDSDSD------SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 678
SD + S + +DS S + +S + S + SD + S
Sbjct: 381 GSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTG 440

Query: 679 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 738
+ DS + S + DS + S + SD + S S + +S +
Sbjct: 441 TAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGY 500

Query: 739 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDS 798
S + S + S + ++SD + S S + ++S + S + +S
Sbjct: 501 GSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVL 560

Query: 799 DSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDT 858
+ S + SD + S + SDS + S + S ++ S T+
Sbjct: 561 TAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTARE 620

Query: 859 GSD 861
S
Sbjct: 621 QSV 623



Score = 42.0 bits (98), Expect = 1e-05
Identities = 72/369 (19%), Positives = 132/369 (35%), Gaps = 6/369 (1%)

Query: 514 SDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASD 573
S G DS + S +G DS+ +G S + SD + S + +DS+
Sbjct: 342 STGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLI 401

Query: 574 SDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSD 633
+ S + +S + S + SD + S + DS+ + S + D
Sbjct: 402 AGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGED 461

Query: 634 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 693
S + S + SD + S S + +S + S + S + S
Sbjct: 462 SSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQT 521

Query: 694 SDSDSDSDSDSDSDSDSDSDSD------SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 747
+ ++SD + S S + ++S S + +S + S + SD +
Sbjct: 522 AQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYG 581

Query: 748 SDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSD 807
S + SDS + S + S + S + S + S S + +DS
Sbjct: 582 STGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLI 641

Query: 808 SESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSD 867
+ S + +S + S + SD +G S S++ +DS + GS + +
Sbjct: 642 AGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYN 701

Query: 868 SNSDSESGS 876
S + GS
Sbjct: 702 SILTAGYGS 710



Score = 41.3 bits (96), Expect = 2e-05
Identities = 76/379 (20%), Positives = 138/379 (36%), Gaps = 6/379 (1%)

Query: 505 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSA------SDSDSASDSDS 558
G + EDS G S + S +G ST +G+DS+ S + +S
Sbjct: 357 GSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQ 416

Query: 559 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 618
+ S + SD + S + DS+ + S + DS + S +
Sbjct: 417 TAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQK 476

Query: 619 ASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 678
SD + S S + +S + S + S + S + ++SD + S S
Sbjct: 477 GSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTS 536

Query: 679 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 738
+ ++S + S + +S + S + SD + S + SDS +
Sbjct: 537 TAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGY 596

Query: 739 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDS 798
S + S + S + S + S S + +DS + S + +S
Sbjct: 597 GSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSIL 656

Query: 799 DSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDT 858
+ S ++ SD + S S + +DS + S +G +S ++ S T+
Sbjct: 657 TAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQE 716

Query: 859 GSDNDSDSDSNSDSESGSN 877
GSD S S S + + S+
Sbjct: 717 GSDLTSGYGSTSTAGADSS 735



Score = 41.3 bits (96), Expect = 2e-05
Identities = 74/372 (19%), Positives = 133/372 (35%), Gaps = 2/372 (0%)

Query: 505 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 564
G + + SD G S + DS +G ST +G DS+ + S + SD
Sbjct: 229 GSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDL 288

Query: 565 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 624
+ S + + S + S + +S + S + SD + S +
Sbjct: 289 TAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGD 348

Query: 625 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 684
DS + S + DS + S + SD + S + +DS + S
Sbjct: 349 DSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQ 408

Query: 685 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 744
+ +S + S + SD + S + DS + S + DS +
Sbjct: 409 TAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGY 468

Query: 745 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDS 804
S + SD + S S + +S + S + S + S + ++SD
Sbjct: 469 GSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDL 528

Query: 805 DSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDS 864
+ S S + ++S + S + +S +G S ++ SD T+ GS +
Sbjct: 529 ITGYGSTSTAGANSSLIAGYGSTQTASYNSV--LTAGYGSTQTAREGSDLTAGYGSTGTA 586

Query: 865 DSDSNSDSESGS 876
SDS+ + GS
Sbjct: 587 GSDSSIIAGYGS 598



Score = 40.9 bits (95), Expect = 3e-05
Identities = 74/369 (20%), Positives = 135/369 (36%), Gaps = 6/369 (1%)

Query: 514 SDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSA-- 571
S G+DS + S +G +ST +G S + SD + S + DS+
Sbjct: 390 STGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLI 449

Query: 572 ----SDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSD 627
S + DS + S + SD + S S + +S+ + S +
Sbjct: 450 AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYG 509

Query: 628 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 687
S + S + ++SD + S S + ++S + S + +S + S
Sbjct: 510 STLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQT 569

Query: 688 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 747
+ SD + S + SDS + S + S + S + S +
Sbjct: 570 AREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYG 629

Query: 748 SDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSD 807
S S + +DS + S + +S + S ++ SD + S S + +DS
Sbjct: 630 STSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLI 689

Query: 808 SESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSD 867
+ S + +S + S + SD SG S S++ +DS + GS +
Sbjct: 690 AGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYH 749

Query: 868 SNSDSESGS 876
S+ + GS
Sbjct: 750 SSLTAGYGS 758



Score = 40.9 bits (95), Expect = 3e-05
Identities = 71/378 (18%), Positives = 130/378 (34%), Gaps = 4/378 (1%)

Query: 505 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 564
G E + S G S + +DS +G ST +G +S+ + S SD
Sbjct: 181 GSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDL 240

Query: 565 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 624
+ S + S + S + DS+ + S + SD + S + +
Sbjct: 241 TAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGA 300

Query: 625 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 684
DS + S + +S + S + SD + S + DS + S
Sbjct: 301 DSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQ 360

Query: 685 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 744
+ DS + S + SD + S + +DS + S + +S +
Sbjct: 361 TAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGY 420

Query: 745 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDS 804
S + SD + S + DS + S + +S + S + SD
Sbjct: 421 GSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDL 480

Query: 805 DSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSD----SDSTSDTGS 860
+ S S + +S + S + S + GS + ++SD STS G+
Sbjct: 481 TAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGA 540

Query: 861 DNDSDSDSNSDSESGSNN 878
++ + S + N+
Sbjct: 541 NSSLIAGYGSTQTASYNS 558



Score = 39.4 bits (91), Expect = 8e-05
Identities = 70/381 (18%), Positives = 126/381 (33%), Gaps = 14/381 (3%)

Query: 510 IPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDS--------------ASDSDSASD 555
+P D D +SGS + + + ST S +S +
Sbjct: 138 LPVTDDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYG 197

Query: 556 SDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASD 615
S + +DS + S + +S + S SD + S + DS+
Sbjct: 198 STGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLI 257

Query: 616 SDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 675
+ S + DS + S + SD + S + +DS + S + +
Sbjct: 258 AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEE 317

Query: 676 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 735
S + S + SD + S + DS + S + DS + S
Sbjct: 318 STQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQT 377

Query: 736 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSD 795
+ SD + S + +DS + S + +S + S ++ SD +
Sbjct: 378 AQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYG 437

Query: 796 SDSDSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDST 855
S + DS + S + DS + S + SD +G S S++ +S
Sbjct: 438 STGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLI 497

Query: 856 SDTGSDNDSDSDSNSDSESGS 876
+ GS + S + GS
Sbjct: 498 AGYGSTQTAGYGSTLTAGYGS 518



Score = 38.6 bits (89), Expect = 1e-04
Identities = 73/381 (19%), Positives = 139/381 (36%), Gaps = 4/381 (1%)

Query: 505 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 564
G + E+S G S + S +G ST +G DS+ + S + DS
Sbjct: 405 GSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSL 464

Query: 565 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 624
+ S + SD + S S + +S+ + S + S + S + +
Sbjct: 465 TAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQN 524

Query: 625 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 684
+SD + S S + ++S + S + +S + S + SD + S
Sbjct: 525 ESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTG 584

Query: 685 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 744
+ SDS + S + S + S + S + S S + +DS +
Sbjct: 585 TAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGY 644

Query: 745 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDS 804
S + +S + S + SD + S S + ++S + S + +S
Sbjct: 645 GSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSIL 704

Query: 805 DSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDS----DSDSTSDTGS 860
+ S + SD S S S + +DS+ + GS +S S ST
Sbjct: 705 TAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTARE 764

Query: 861 DNDSDSDSNSDSESGSNNNVV 881
+ + S S +G++++++
Sbjct: 765 QSVLTTGYGSTSTAGADSSLI 785



Score = 37.4 bits (86), Expect = 4e-04
Identities = 70/375 (18%), Positives = 131/375 (34%), Gaps = 6/375 (1%)

Query: 513 DSDSDPGSDSGSDSNSDSGSD--SGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDS 570
S + GS + S +G ST +G+DS + S + +S + S
Sbjct: 171 THQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGS 230

Query: 571 ASDSDSASDSDSASDSDSASDSDSA----SDSDSASDSDSASDSDSASDSDSASDSDSDS 626
SD + S + DS+ S + DS+ + S + SD +
Sbjct: 231 TQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTA 290

Query: 627 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 686
S + +DS + S + +S + S + SD + S + DS
Sbjct: 291 GYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDS 350

Query: 687 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 746
+ S + DS + S + SD + S + +DS + S +
Sbjct: 351 SLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTA 410

Query: 747 DSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDS 806
+S + S + SD + S + DS + S + DS + S
Sbjct: 411 GEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGS 470

Query: 807 DSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDS 866
+ SD + S S + +S + S + S+ + ST +++D +
Sbjct: 471 TQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLIT 530

Query: 867 DSNSDSESGSNNNVV 881
S S +G+N++++
Sbjct: 531 GYGSTSTAGANSSLI 545



Score = 37.0 bits (85), Expect = 4e-04
Identities = 75/372 (20%), Positives = 135/372 (36%), Gaps = 2/372 (0%)

Query: 505 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 564
G + + SD G S + DS +G ST +G DS+ + S + SD
Sbjct: 421 GSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDL 480

Query: 565 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 624
+ S S + S + S + S + S + ++SD + S S + +
Sbjct: 481 TAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGA 540

Query: 625 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 684
+S + S + +S + S + SD + S + SDS + S
Sbjct: 541 NSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQ 600

Query: 685 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 744
+ S + S + S + S S + +DS + S + +S +
Sbjct: 601 TASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGY 660

Query: 745 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDS 804
S + SD + S S + +DS + S + S + S + SD
Sbjct: 661 GSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDL 720

Query: 805 DSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDS 864
S S S + +DS + S + S+ +G S ++ S T+ GS + +
Sbjct: 721 TSGYGSTSTAGADSSLIAGYGSTQTASYHSS--LTAGYGSTQTAREQSVLTTGYGSTSTA 778

Query: 865 DSDSNSDSESGS 876
+DS+ + GS
Sbjct: 779 GADSSLIAGYGS 790



Score = 37.0 bits (85), Expect = 4e-04
Identities = 75/373 (20%), Positives = 135/373 (36%), Gaps = 2/373 (0%)

Query: 505 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 564
G + + SD G S S + +S +G ST +G S + S + ++SD
Sbjct: 469 GSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDL 528

Query: 565 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 624
+ S S + + S + S + +S + S + SD + S + S
Sbjct: 529 ITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGS 588

Query: 625 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 684
DS + S + S + S + S + S S + +DS + S
Sbjct: 589 DSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQ 648

Query: 685 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 744
+ +S + S + SD + S S + +DS + S + +S +
Sbjct: 649 TAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGY 708

Query: 745 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDS 804
S + SD S S S + +DS + S + S + S + S
Sbjct: 709 GSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVL 768

Query: 805 DSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDS 864
+ S S + +DS + S + S +G S ++ SD T+ GS + +
Sbjct: 769 TTGYGSTSTAGADSSLIAGYGSTQTAGYHSI--LTAGYGSTQTAQERSDLTTGYGSTSTA 826

Query: 865 DSDSNSDSESGSN 877
+DS+ + GS
Sbjct: 827 GADSSLIAGYGST 839



Score = 37.0 bits (85), Expect = 4e-04
Identities = 67/339 (19%), Positives = 124/339 (36%), Gaps = 2/339 (0%)

Query: 514 SDSDPGSDSGSDSNSDSGSD--SGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSA 571
DS + GS + GSD +G STS +G +S+ + S + S + S
Sbjct: 460 EDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGST 519

Query: 572 SDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSD 631
+ + SD + S S + ++S+ + S ++ +S + S + SD +
Sbjct: 520 QTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAG 579

Query: 632 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 691
S + SDS + S + S + S + S + S S + +DS
Sbjct: 580 YGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSS 639

Query: 692 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 751
+ S + +S + S + SD + S S + +DS + S +
Sbjct: 640 LIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAG 699

Query: 752 SDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSESD 811
+S + S + SD S S S + ++S + S + S + S
Sbjct: 700 YNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGST 759

Query: 812 SDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDS 850
+ S + S S + +DS+ + GS + S
Sbjct: 760 QTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHS 798



Score = 33.6 bits (76), Expect = 0.005
Identities = 56/335 (16%), Positives = 107/335 (31%)

Query: 527 NSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDS 586
S +D + + + S ++ D D+ +S S +
Sbjct: 97 TKTSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPT 156

Query: 587 DSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDS 646
+ + S S + S + +S + S + +DS + S
Sbjct: 157 QTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQ 216

Query: 647 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 706
+ +S + S SD + S + DS + S + DS +
Sbjct: 217 TAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGY 276

Query: 707 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 766
S + SD + S + +DS + S + +S + S + SD
Sbjct: 277 GSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDL 336

Query: 767 DSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSESDSDSDSDSDSESDSDS 826
+ S + DS + S + DS + S ++ SD + S + +
Sbjct: 337 TAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGA 396

Query: 827 DSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSD 861
DS + S +G +S ++ S T+ GSD
Sbjct: 397 DSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSD 431



Score = 33.2 bits (75), Expect = 0.005
Identities = 66/339 (19%), Positives = 125/339 (36%), Gaps = 2/339 (0%)

Query: 514 SDSDPGSDSGSDSNSDSGSD--SGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSA 571
S + GS + + SD +G STS +G++S+ + S ++ +S + S
Sbjct: 508 YGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGST 567

Query: 572 SDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSD 631
+ SD + S + SDS+ + S ++ S + S + S +
Sbjct: 568 QTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTG 627

Query: 632 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 691
S S + +DS + S + +S + S + SD + S S + +DS
Sbjct: 628 YGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSS 687

Query: 692 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 751
+ S + +S + S + SD S S S + +DS + S +
Sbjct: 688 LIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTAS 747

Query: 752 SDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSESD 811
S + S + S + S S + ++S + S + S + S
Sbjct: 748 YHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGST 807

Query: 812 SDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDS 850
+ SD + S S + +DS+ + GS + +S
Sbjct: 808 QTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNS 846



Score = 32.8 bits (74), Expect = 0.008
Identities = 74/372 (19%), Positives = 135/372 (36%), Gaps = 2/372 (0%)

Query: 505 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 564
G + +SD G S S + ++S +G ST + +S + S + SD
Sbjct: 517 GSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDL 576

Query: 565 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 624
+ S + S S + S + S+ + S + S + S S + +
Sbjct: 577 TAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGA 636

Query: 625 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 684
DS + S + +S + S + SD + S S + +DS + S
Sbjct: 637 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQ 696

Query: 685 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 744
+ +S + S + SD S S S + +DS + S + S +
Sbjct: 697 TAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGY 756

Query: 745 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDS 804
S + S + S S + +DS + S + S + S + SD
Sbjct: 757 GSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDL 816

Query: 805 DSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDS 864
+ S S + +DS + S + +S +G S ++ +SD T+ GS + +
Sbjct: 817 TTGYGSTSTAGADSSLIAGYGSTQTAGYNSI--LTAGYGSTQTAQENSDLTTGYGSTSTA 874

Query: 865 DSDSNSDSESGS 876
DS+ + GS
Sbjct: 875 GYDSSLIAGYGS 886



Score = 32.8 bits (74), Expect = 0.008
Identities = 67/360 (18%), Positives = 131/360 (36%)

Query: 522 SGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSD 581
+G S +G S + ++S + S S + ++S+ + S ++ +S +
Sbjct: 506 AGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYG 565

Query: 582 SASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSD 641
S + SD + S + SDS+ + S ++ S + S + S
Sbjct: 566 STQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLT 625

Query: 642 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 701
+ S S + +DS + S + +S + S + SD + S S + +D
Sbjct: 626 TGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGAD 685

Query: 702 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 761
S + S + +S + S + SD S S S + +DS + S
Sbjct: 686 SSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQT 745

Query: 762 SDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSESDSDSDSDSDSE 821
+ S + S + S + S S + +DS + S + S +
Sbjct: 746 ASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYG 805

Query: 822 SDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSDSNSDSESGSNNNVV 881
S + SD + S S + +DSS + ST G ++ + S + N+++
Sbjct: 806 STQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLT 865



Score = 31.6 bits (71), Expect = 0.018
Identities = 77/387 (19%), Positives = 138/387 (35%), Gaps = 14/387 (3%)

Query: 505 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSA--------------SDS 550
G + E SD G S + SDS +G ST + S+ S
Sbjct: 565 GSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVL 624

Query: 551 DSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDS 610
+ S S + +DS+ + S + +S + S + SD + S S + +
Sbjct: 625 TTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGA 684

Query: 611 DSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 670
DS+ + S + +S + S + SD S S S + +DS + S
Sbjct: 685 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQ 744

Query: 671 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 730
+ S + S + S + S S + +DS + S + S +
Sbjct: 745 TASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGY 804

Query: 731 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDS 790
S + SD + S S + +DS + S + +S + S ++ +SD
Sbjct: 805 GSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDL 864

Query: 791 DSDSDSDSDSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDS 850
+ S S + DS + S + +S + S + SD +G S S++
Sbjct: 865 TTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGY 924

Query: 851 DSDSTSDTGSDNDSDSDSNSDSESGSN 877
+S + GS + S + GS+
Sbjct: 925 ESSLIAGYGSTQTASFKSTLMAGYGSS 951



Score = 31.3 bits (70), Expect = 0.026
Identities = 68/339 (20%), Positives = 125/339 (36%), Gaps = 2/339 (0%)

Query: 514 SDSDPGSDSGSDSNSDSGSD--SGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSA 571
+S + GS + GSD +G ST +GSDS+ + S ++ S + S
Sbjct: 556 YNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGST 615

Query: 572 SDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSD 631
+ S + S S + +DS+ + S + +S + S + SD +
Sbjct: 616 QTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAG 675

Query: 632 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 691
S S + +DS + S + +S + S + SD S S S + +DS
Sbjct: 676 YGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSS 735

Query: 692 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 751
+ S + S + S + S + S S + +DS + S +
Sbjct: 736 LIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAG 795

Query: 752 SDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSESD 811
S + S + SD + S S + ++S + S + +S + S
Sbjct: 796 YHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGST 855

Query: 812 SDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDS 850
+ +SD + S S + DS+ + GS + +S
Sbjct: 856 QTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAGYNS 894



Score = 31.3 bits (70), Expect = 0.026
Identities = 67/360 (18%), Positives = 128/360 (35%)

Query: 522 SGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSD 581
+ +S +G S + S + S + SDS+ + S ++ S +
Sbjct: 554 ASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYG 613

Query: 582 SASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSD 641
S + S + S S + +DS+ + S + +S + S + SD
Sbjct: 614 STQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLT 673

Query: 642 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 701
+ S S + +DS + S + +S + S + SD S S S + +D
Sbjct: 674 AGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGAD 733

Query: 702 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 761
S + S + S + S + S + S S + +DS + S
Sbjct: 734 SSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQT 793

Query: 762 SDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSESDSDSDSDSDSE 821
+ S + S + SD + S S + +DS + S + +S +
Sbjct: 794 AGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYG 853

Query: 822 SDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSDSNSDSESGSNNNVV 881
S + +SD + S S + DSS + ST G ++ + S + N+++
Sbjct: 854 STQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLT 913


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1006200IGASERPTASE310.016 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.8 bits (69), Expect = 0.016
Identities = 28/162 (17%), Positives = 52/162 (32%), Gaps = 4/162 (2%)

Query: 261 ALKLKADTEAAKNDVSKRSKRSLNTQNNKST-TQEISEEQKAEYQRKSEALKERFINRQK 319
+KA+T+ + S + T K T T E E+ K E ++ E K
Sbjct: 1073 KSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVT-SQVSP 1131

Query: 320 SKNESVVSLIDDEDDNENDRQLVVSAPSKKPTTPTTYTETTTQVPMPTVERQTQQQIVYK 379
+ +S E END + + P + T + + + T+ V
Sbjct: 1132 KQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNT 1191

Query: 380 TPKPLAGLNGESHDFTTTHQSPTTSNHTHNNVVEFEETSALP 421
+ N E+ TT + + + ++P
Sbjct: 1192 GNSVVE--NPENTTPATTQPTVNSESSNKPKNRHRRSVRSVP 1231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1006240PF05704280.035 Capsular polysaccharide synthesis protein
		>PF05704#Capsular polysaccharide synthesis protein

Length = 307

Score = 27.5 bits (61), Expect = 0.035
Identities = 13/69 (18%), Positives = 24/69 (34%), Gaps = 7/69 (10%)

Query: 116 EWVKKNYENTNHRYLVTLNLNSK-------KFTYCTKIIYQAYKFGVSEKSVKSYGLHII 168
W + Y N + +++ N + + YK + +Y HI
Sbjct: 239 YWKEIPYVNNVNPHMLQYLGNLPYDNSMFNYIKSTSPVQKLTYKLDYNNLKRNTYYDHIF 298

Query: 169 SPYAIKDNF 177
S +KDN+
Sbjct: 299 SIDKLKDNY 307


30SAI8T7_1007990SAI8T7_1008060N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1007990414-1.048854Phosphopantetheine adenylyltransferase
SAI8T7_1008000512-1.600947UPF0348 protein NWMN_0989
SAI8T7_1008010412-1.702031Putative uncharacterized protein
SAI8T7_1008020413-1.754714Iron-regulated surface determinant protein B
SAI8T7_1008030213-2.757987Iron-regulated surface determinant protein A
SAI8T7_1008040-114-2.668950Iron-regulated surface determinant protein C
SAI8T7_1008050-213-2.067117Putative uncharacterized protein
SAI8T7_1008060-112-0.398815High-affinity heme uptake system protein isdE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1007990LPSBIOSNTHSS2191e-76 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 219 bits (560), Expect = 1e-76
Identities = 77/155 (49%), Positives = 112/155 (72%)

Query: 5 IAVIPGSFDPITYGHLDIIERSTDRFDEIHVCVLKNSKKEGTFSLEERMDLIEQSVKHLP 64
A+ PGSFDPIT+GHLDIIER FD+++V VL+N K+ FS++ER++ I +++ HLP
Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLP 61

Query: 65 NVKVHQFSGLLVDYCEQVGAKTIIRGLRAVSDFEYELRLTSMNKKLNNEIETLYMMSSTN 124
N +V F GL V+Y Q A I+RGLR +SDFE EL++ + NK L +++ET+++ +ST
Sbjct: 62 NAQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTSTE 121

Query: 125 YSFISSSIVKEVAAYRADISEFVPPYVEKALKKKF 159
YSF+SSS+VKEVA + ++ FVP +V AL +F
Sbjct: 122 YSFLSSSLVKEVARFGGNVEHFVPSHVAAALYDQF 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1008020IGASERPTASE366e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 35.8 bits (82), Expect = 6e-04
Identities = 37/194 (19%), Positives = 71/194 (36%), Gaps = 15/194 (7%)

Query: 447 RIVDKEAFTKANTDKSNKKEQQDNSAKKEA---------TPATPSKPTPSPVEKESQKQD 497
+ VD T N +++ N+ + PATPS+ T + E Q+
Sbjct: 990 QTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESK 1049

Query: 498 SQKDDNKQLPSVEKENDASSESGKDKTPATKPT------KGEVESSSTTPTKVVSTTQNV 551
+ + + + +N ++ K A T E + + TT TK +T +
Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKE 1109

Query: 552 AKPTTASSKTTKDVVQTSAGSSEAKDSAPLQKANIKNTNDGHTQSQNNKNTQENKAKSLP 611
K + KT + TS S + + S +Q + T + +Q N
Sbjct: 1110 EKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTE 1169

Query: 612 QTGEESNKDMTLPL 625
Q +E++ ++ P+
Sbjct: 1170 QPAKETSSNVEQPV 1183



Score = 30.0 bits (67), Expect = 0.035
Identities = 27/156 (17%), Positives = 45/156 (28%), Gaps = 5/156 (3%)

Query: 37 EAQAAAEETGGTNTEAQPKTEAVASPTTTSEKAPET-KPVANAVSVSNKEVEAPTSETKE 95
A EE TE + V S + ++ ET +P A ++ V +++
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQT 1162

Query: 96 AKEVKEVKAPKETKAVKPAAKATNNTYPILNQELREAIKNPAIKDKDHSAPNSRPIDFEM 155
+ KET + + T N + NP + P
Sbjct: 1163 NTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVE----NPENTTPATTQPTVNSESSNK 1218

Query: 156 KKENGEQQFYHYASSVKPARVIFTDSKPEIELGLQS 191
K + +V+PA D L S
Sbjct: 1219 PKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTS 1254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1008030IGASERPTASE340.001 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.9 bits (77), Expect = 0.001
Identities = 27/132 (20%), Positives = 44/132 (33%), Gaps = 4/132 (3%)

Query: 184 ADAAKPNNVKPVQPKPAQPKTPTEQTKPVQPKVEKVKPTVTTTSKVEDNHSTKVVSTDTT 243
+ A+ + P PA P TE + K + + +V +
Sbjct: 1015 EEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKS 1074

Query: 244 KDQTKTQTAHTVKTAQTAQEQNKVQTPVKDVATAKSESNNQAVSDNKSQQTNKVTKHNET 303
+ TQT + AQ+ E + QT + V K+Q+ KVT +
Sbjct: 1075 NVKANTQTN---EVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTS-QVS 1130

Query: 304 PKQASKAKELPK 315
PKQ P+
Sbjct: 1131 PKQEQSETVQPQ 1142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1008060FERRIBNDNGPP452e-07 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 44.6 bits (105), Expect = 2e-07
Identities = 34/209 (16%), Positives = 79/209 (37%), Gaps = 11/209 (5%)

Query: 55 PNRYKDVPEIGQPMEPNVEAVKKLKPTHVLSVSTIKDEMQPFYKQLNMKGYFYDFDS--L 112
P V ++G EPN+E + ++KP+ ++ + + + +G+ + L
Sbjct: 72 PPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPL 131

Query: 113 KGMQKSITQLGDQFNRKAQAKELNDHLNSVKQKIENKAAKQKKHPKVLILMGVPGSYLVA 172
+KS+T++ D N ++ A+ + ++ + K+ P +L + P LV
Sbjct: 132 AMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVF 191

Query: 173 TDKSYIGDLVKIAGGENVIKVKDRQYISSNT---ENLLNINPDIILRLPHGMPEEVKKMF 229
S +++ G N + + + S + L +L H +++ +
Sbjct: 192 GPNSLFQEILDEYGIPNAWQ-GETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDAL- 249

Query: 230 QKEFKQNDIWKHFKAVKNNHVYDLEEVPF 258
+W+ V+ + V F
Sbjct: 250 ----MATPLWQAMPFVRAGRFQRVPAVWF 274


31SAI8T7_1008260SAI8T7_1008310N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1008260013-1.494506Alpha-hemolysin
SAI8T7_1008270111-0.822916Superantigen-like protein
SAI8T7_1008280111-0.710644SA1010 protein
SAI8T7_1008290111-0.559013Putative uncharacterized protein
SAI8T7_1008300011-0.487684Ornithine carbamoyltransferase
SAI8T7_1008310014-0.204395Carbamate kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1008260BICOMPNTOXIN314e-109 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 314 bits (805), Expect = e-109
Identities = 72/318 (22%), Positives = 144/318 (45%), Gaps = 24/318 (7%)

Query: 12 VTTTLLLGSILMNPVANAADSDINIKTGTTDIGSNTTVKTGDLVTYDKEN--GMHKKVFY 69
+TTTL + L+ P+AN + T DIG + ++ N G+ + + +
Sbjct: 7 LTTTLSVS--LLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQNIQF 64

Query: 70 SFIDDKNHNKKLLVIRTKGTIAGQYRVYSEEGANKS-GLAWPSAFKVQLQLPDNEVAQIS 128
F+ DK +NK L+++ +G I+ + Y+ + N + WP + + L+ D V+ I
Sbjct: 65 DFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTNDKYVSLI- 123

Query: 129 DYYPRNSIDTKEYMSTLTYGFNGNVTGDDTGKIGGLIGANVSIGHTLKYVQPDFKTILES 188
+Y P+N I++ TL Y GN + +GG N S ++ Y Q ++ + +E
Sbjct: 124 NYLPKNKIESTNVSQTLGYNIGGNFQSAPS--LGGNGSFNYS--KSISYTQQNYVSEVEQ 179

Query: 189 PTDKKVGWKVIFNNMVNQNWGPYDRDSWNPVYGNQLFMKTRNGSMKAAENFLDPNKASSL 248
K V W V N+ ++ + + LF+ + S + F+ ++ L
Sbjct: 180 QNSKSVLWGVKANSFATESG-------QKSAFDSDLFVGYKPHSKDPRDYFVPDSELPPL 232

Query: 249 LSSGFSPDFATVITMDRKASKQQTNIDVIYERVRD-----DYQLHWTSTNWKGTNTKDKW 303
+ SGF+P F ++ + K S + ++ Y R D H+ ++ G + +
Sbjct: 233 VQSGFNPSFIATVSHE-KGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAF 291

Query: 304 TDRS-SERYKIDWEKEEM 320
+R+ + +Y+++W+ E+
Sbjct: 292 VNRNYTVKYEVNWKTHEI 309


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1008270TOXICSSTOXIN486e-09 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 48.5 bits (115), Expect = 6e-09
Identities = 53/223 (23%), Positives = 89/223 (39%), Gaps = 12/223 (5%)

Query: 34 MSKNITKNIILTTTLLLLGTVLPQNQKPVFSFYSEAKAYSIGQDETNINELIKYYTQPHF 93
M+K + N + + LLL T P+ S A + D NI +L+ +Y+
Sbjct: 1 MNKKLLMNFFIVSPLLLATTATDFTPVPLSSNQIIKTAKASTND--NIKDLLDWYSSGSD 58

Query: 94 SFSNKWLYQYDNENIYVELKRYSWSAHISLWGAESWGNINQLKDRYVDVFGLKD-KDTDQ 152
+F+N DN + +K S + ++ + + + K VD+ + K
Sbjct: 59 TFTN--SEVLDNSLGSMRIKNTDGSISLIIFPSP-YYSPAFTKGEKVDLNTKRTKKSQHT 115

Query: 153 LWWSYRETFTGGVTPAAK-PSDKTYNLFVQYKDKLQTIIGAHKIYQGNKPVLTLKEIDFR 211
+Y GVT K P+ L V+ K + K +K L + +DF
Sbjct: 116 SEGTYIHFQISGVTNTEKLPTPIELPLKVKVHGKDSPLKYGPKF---DKKQLAISTLDFE 172

Query: 212 AREALIKNKILY-TENRNKGKLKIT-GGGNNYTIDLSKRLHSD 252
R L + LY + ++ G KIT G+ Y DLSK+ +
Sbjct: 173 IRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFEYN 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1008280TOXICSSTOXIN577e-12 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 56.6 bits (136), Expect = 7e-12
Identities = 55/228 (24%), Positives = 91/228 (39%), Gaps = 15/228 (6%)

Query: 16 LLLGTAFTQFPNTPINSSSEAKAYYINQNETNVNELTKYYSQKYLTFSNSTLWQKDNGTI 75
LLL T T F P++S+ K + N+ N+ +L +YS TF+NS + G++
Sbjct: 15 LLLATTATDFTPVPLSSNQIIKTAKASTND-NIKDLLDWYSSGSDTFTNSEVLDNSLGSM 73

Query: 76 HATLLQFSWYSHIQVYGPESWGNINQLRNKSVDIFGI---KDQETIDSFALSQETFTGGV 132
++ + S + P + + + + VD+ K Q T + + + GV
Sbjct: 74 R---IKNTDGSISLIIFPSPYYSPAFTKGEKVDLNTKRTKKSQHTSEGTYIHFQI--SGV 128

Query: 133 TPA-ATSNDKHYKLNVTYKDKAETFTGGFPVYEGNKPVLTLKELDFRIRQTLIKSKKLYN 191
T L V K + + + +K L + LDF IR L + LY
Sbjct: 129 TNTEKLPTPIELPLKV--KVHGKDSPLKYG-PKFDKKQLAISTLDFEIRHQLTQIHGLYR 185

Query: 192 NSYNKGQI-KITGTDNN-YTIDLSKRLPSTDANRYVKKPQNAKIEVIL 237
+S G KIT D + Y DLSK+ + + IE +
Sbjct: 186 SSDKTGGYWKITMNDGSTYQSDLSKKFEYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1008290TOXICSSTOXIN621e-13 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 61.6 bits (149), Expect = 1e-13
Identities = 62/222 (27%), Positives = 96/222 (43%), Gaps = 17/222 (7%)

Query: 2 KKNIMNKLVLSTALLLLGTTSTQLPKTPISFSSEAKAYNISENETNINELIKYYTQPHFS 61
KK +MN ++S LLL TT+T P+S + K S N+ NI +L+ +Y+ +
Sbjct: 3 KKLLMNFFIVSP--LLLATTATDFTPVPLSSNQIIKTAKASTND-NIKDLLDWYSSGSDT 59

Query: 62 LSGKWLWQKPNGSIHATLQTWVWYSHIQVFGSESWGNINQLRNKYVDIFGT---KDEDTV 118
+ + GS+ ++ + +F S + + + + VD+ K + T
Sbjct: 60 FTNSEVLDNSLGSMR--IKNTDGSISLIIFPSP-YYSPAFTKGEKVDLNTKRTKKSQHTS 116

Query: 119 EGYWTYDETFTGGVTPA-ATSSDKPYRLFLKYSDKQQTIIGGHEFYKGNKPVLTLKELDF 177
EG TY GVT + L +K K + G +F +K L + LDF
Sbjct: 117 EG--TYIHFQISGVTNTEKLPTPIELPLKVKVHGKDSPLKYGPKF---DKKQLAISTLDF 171

Query: 178 RIRQTLIKNKKLYNGEFNKGQI-KIT-ADGNNYTIDLSKKLK 217
IR L + LY G KIT DG+ Y DLSKK +
Sbjct: 172 EIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFE 213


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1008310CARBMTKINASE389e-139 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 389 bits (1001), Expect = e-139
Identities = 145/314 (46%), Positives = 211/314 (67%), Gaps = 7/314 (2%)

Query: 1 MMAKIVVALGGNALGK-----SPQEQLELVKNTAKSLVGLITKGHEIVISHGNGPQVGSI 55
M ++V+ALGGNAL + S +E ++ V+ TA+ + +I +G+E+VI+HGNGPQVGS+
Sbjct: 1 MGKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSL 60

Query: 56 NLGLNYAAEHNQGPAFPFAECGAMSQAYIGYQLQESLQNELHSIGMDKQVVTLVTQVEVD 115
L ++ PA P GAMSQ +IGY +Q++L+NEL GM+K+VVT++TQ VD
Sbjct: 61 LLHMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVD 120

Query: 116 ENDPAFNNPSKPIGLFYNKEEAEQIQKEKGFIFVEDAGRGYRRVVPSPQPISIIELESIK 175
+NDPAF NP+KP+G FY++E A+++ +EKG+I ED+GRG+RRVVPSP P +E E+IK
Sbjct: 121 KNDPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIK 180

Query: 176 TLIKNDTLVIAAGGGGIPVIREQHDGFKGIDAVIDKDKTSALLGANIQCDQLIILTAIDY 235
L++ +VIA+GGGG+PVI E KG++AVIDKD L + D +ILT ++
Sbjct: 181 KLVERGVIVIASGGGGVPVILE-DGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNG 239

Query: 236 VYINFNTENQQPLKTTNVDELKRYIDENQFAKGSMLPKIEAAISFIENNPKGSVLITSLN 295
+ + TE +Q L+ V+EL++Y +E F GSM PK+ AAI FIE + ++ I L
Sbjct: 240 AALYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAI-IAHLE 298

Query: 296 ELDAALEGKVGTVI 309
+ ALEGK GT +
Sbjct: 299 KAVEALEGKTGTQV 312


32SAI8T7_1011200SAI8T7_1011290N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1011200013-3.450553Similar to DNA transport mechinery protein
SAI8T7_1011210010-2.031559Similar to late competence protein comGA
SAI8T7_101122009-1.725568Similar to metallo-beta-lactamase superfamily
SAI8T7_101123009-1.779948Glucokinase
SAI8T7_1011240-110-2.387884Putative uncharacterized protein
SAI8T7_1011250-210-1.1619705-formyltetrahydrofolate cyclo-ligase
SAI8T7_1011260-210-0.917319Penicillin-binding protein 3
SAI8T7_1011270-111-1.515539Superoxide dismutase [Mn/Fe] 1
SAI8T7_1011280-211-1.834325ABC-3 protein
SAI8T7_1011290-212-1.488081ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1011200BCTERIALGSPF812e-19 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 81.4 bits (201), Expect = 2e-19
Identities = 51/265 (19%), Positives = 109/265 (41%), Gaps = 3/265 (1%)

Query: 43 ERFGNIIDVLEETVNYMKVNRKSEQRLLKTLQYPLILVSIFIAMIIILNLTVIPQFQQLY 102
E G++ VL +Y + ++ R+ + + YP +L + IA++ IL V+P+ + +
Sbjct: 143 ETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVVAIAVVSILLSVVVPKVVEQF 202

Query: 103 TSMNIQLSSFQKTLSFFITSLPTIIVVMLIIVSMLAIIMKLIYNNLNMLNKIN-FVMKLP 161
M L + L ++ T ML+ + + +++ + ++ LP
Sbjct: 203 IHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFRVMLRQEKRRVSFHRRLLHLP 262

Query: 162 LISGYFQLFKTYFVTNELVLFYKNGITLQSIVDVYINHSS-DPFRQFLGKYLLTYSEMGY 220
LI + T L + + + L + + + S D R L E G
Sbjct: 263 LIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVMSNDYARHRLSLATDAVRE-GV 321

Query: 221 GLPQILEKLKCFKPQLIKFVLQGEKRGKLEVELKLYSQILVKQIEDKAIKQTQFLQPILF 280
L + LE+ F P + + GE+ G+L+ L+ + ++ + +P+L
Sbjct: 322 SLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQDREFSSQMTLALGLFEPLLV 381

Query: 281 LILGLFIVAIYLVIMLPMFQMMQSI 305
+ + ++ I L I+ P+ Q+ +
Sbjct: 382 VSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1011220SHIGARICIN270.039 Ribosome inactivating protein family signature.
		>SHIGARICIN#Ribosome inactivating protein family signature.

Length = 289

Score = 27.5 bits (61), Expect = 0.039
Identities = 20/99 (20%), Positives = 38/99 (38%), Gaps = 11/99 (11%)

Query: 82 DFLKDPVKNGADKFKQYGLPIITSKVTPEK-------LNEGSTEIE-GFKFNVLHTPGHS 133
F+ + K + K Y +P++ S + + N I ++ G+
Sbjct: 39 VFISNLRKALPYERKLYDIPLLRSTLPGSQRYALIHLTNYADETISVAIDVTNVYVMGYR 98

Query: 134 PGSLTYVFDEFAVVG--DTLFNNGIGRTDL-YKGDYETL 169
G +Y F+E + +F + + L Y G+YE L
Sbjct: 99 AGDTSYFFNEASATEAAKYVFKDAKRKVTLPYSGNYERL 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1011230PF03309300.012 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 29.7 bits (67), Expect = 0.012
Identities = 32/154 (20%), Positives = 51/154 (33%), Gaps = 37/154 (24%)

Query: 5 ILAADVGGTTCKLGIFTPELEQ---LHKWSIHTD---TSDSTGYTLLKGIYDSFVEKVNE 58
+LA DV T +G+ + + + +W I T+ T+D + G+
Sbjct: 2 LLAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTEPEVTADELA-LTIDGLI--------- 51

Query: 59 NNYNFSNVLGVGIG--VPGPVDFEKGTVNGAVNLYWPE------KVNVREIFEQFVDCPV 110
+ + G VP V E V + YWP + VR VD P
Sbjct: 52 -GDDAERLTGASGLSTVP-SVLHE---VRVMLEQYWPNVPHVLIEPGVRTGIPLLVDNPK 106

Query: 111 YVDND--ANIAALGEKHKGAGEGADDVVAITLGT 142
V D N A K+ + + G+
Sbjct: 107 EVGADRIVNCLAAYHKYGT------AAIVVDFGS 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1011240TCRTETA330.002 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 33.3 bits (76), Expect = 0.002
Identities = 29/170 (17%), Positives = 54/170 (31%), Gaps = 51/170 (30%)

Query: 241 MLTVYFIAGLFGN--------FVSLSFNTTTISVGASGAIFGLIGSIFAMMY---VSKTF 289
++ V+FI L G F F+ ++G S A FG++ S+ M V+
Sbjct: 215 LMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARL 274

Query: 290 NKK----------MLGQLLIA-----------LVILVGVSLFMS------NINIVAHIGG 322
++ G +L+A +V+L + M + + G
Sbjct: 275 GERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQG 334

Query: 323 FIGGLLITL-----------IGYYYKVNRNIF--WILLIGMLVIFIALQI 359
+ G L L Y + + W + G + + L
Sbjct: 335 QLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPA 384


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1011290PF05272330.001 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 33.1 bits (75), Expect = 0.001
Identities = 13/43 (30%), Positives = 20/43 (46%), Gaps = 7/43 (16%)

Query: 39 LAIVGPNGAGKSTLLKLILGLLPLQSGEIFVEG-IDFKNKKTS 80
+ + G G GKSTL+ ++GL + F + D K S
Sbjct: 599 VVLEGTGGIGKSTLINTLVGL------DFFSDTHFDIGTGKDS 635


33SAI8T7_1013280SAI8T7_1013430N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1013280110-2.442092Trypsin
SAI8T7_1013290111-2.749488Serine protease splC
SAI8T7_1013300210-2.682918Glutamyl endopeptidase
SAI8T7_1013310312-3.902729Serine protease splA
SAI8T7_1013320413-4.308101Putative Probable beta-lactamase
SAI8T7_1013330414-4.573290Putative uncharacterized protein
SAI8T7_1013340516-4.607500Leukotoxin, LukD
SAI8T7_1013350718-5.810955Leukotoxin S-subunit
SAI8T7_1013360920-7.177831Putative uncharacterized protein
SAI8T7_10133701020-6.561952Putative uncharacterized protein
SAI8T7_1013380819-7.051920Protein of hypothetical function DUF1828
SAI8T7_1013390719-6.235395Enterotoxin type G
SAI8T7_1013400415-4.069391Enterotoxin SeN
SAI8T7_1013410212-2.749384Extracellular enterotoxin type I
SAI8T7_1013420112-1.335346Enterotoxin
SAI8T7_1013430112-0.894221Enterotoxin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013280V8PROTEASE1122e-31 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 112 bits (280), Expect = 2e-31
Identities = 58/227 (25%), Positives = 100/227 (44%), Gaps = 26/227 (11%)

Query: 38 IQQTAKA-----ENSVKLITNTNVAPYSGVTWMGA--------GTGFVVGNHTIITNKHV 84
++Q A N IT+T Y+ VT++ +G VVG T++TNKHV
Sbjct: 61 LEQREHANVILPNNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHV 120

Query: 85 TYHM-KVGDEIKAHPNGFY--NNGGGLYKVTKIVDYPGKEDIAVVQVEEKSTQPKGRKFK 141
+KA P+ N G + +I Y G+ D+A+V+ + +
Sbjct: 121 VDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSP---NEQNKHIG 177

Query: 142 DFTSKFNIA--SEAKENEPISVIGYPNPNGNKLQMYESTGKVLSVNGNIVTSDAVVQPGS 199
+ ++ +E + N+ I+V GYP + M+ES GK+ + G + D G+
Sbjct: 178 EVVKPATMSNNAETQVNQNITVTGYP-GDKPVATMWESKGKITYLKGEAMQYDLSTTGGN 236

Query: 200 SGSPILNSKREAIGVMYASDKPTGESTRSFAVYFSPEIKKFIADNLD 246
SGSP+ N K E IG+ + AV+ + ++ F+ N++
Sbjct: 237 SGSPVFNEKNEVIGIHWGGVPNEFNG----AVFINENVRNFLKQNIE 279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013290V8PROTEASE1771e-56 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 177 bits (451), Expect = 1e-56
Identities = 64/217 (29%), Positives = 106/217 (48%), Gaps = 23/217 (10%)

Query: 37 EKNVTQVKDTNNFPYNGVVSFK--------NATGFVIGKNTIITNKHV-SKDYKVGDRIT 87
+ Q+ DT N Y V + A+G V+GK+T++TNKHV + +
Sbjct: 73 NNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHVVDATHGDPHALK 132

Query: 88 AHP---NGDKGNGGIYKIKSISDYPGDEDISVMNIEEQAVERGPKGFNFNENVQAFNFAK 144
A P N D G + + I+ Y G+ D++++ + + E V+ +
Sbjct: 133 AFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQNK-----HIGEVVKPATMSN 187

Query: 145 DA--KVDDKIKVIGYPLPAQNSFKQFESTGTIKRIKDNILNFDAYIEPGNSGSPVLNSNN 202
+A +V+ I V GYP +ES G I +K + +D GNSGSPV N N
Sbjct: 188 NAETQVNQNITVTGYPGDK-PVATMWESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKN 246

Query: 203 EVIGVVYGGIGKIGSEYNGAVYFTPQIKDFIQKHIEQ 239
EVIG+ +GG + +E+NGAV+ +++F++++IE
Sbjct: 247 EVIGIHWGG---VPNEFNGAVFINENVRNFLKQNIED 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013300V8PROTEASE1824e-58 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 182 bits (462), Expect = 4e-58
Identities = 65/230 (28%), Positives = 108/230 (46%), Gaps = 29/230 (12%)

Query: 43 EVQQTAKA-----ENNVTKIQDTNIFPYTGVVAFKS--------ATGFVVGKNTILTNKH 89
++Q A N+ +I DT Y V + A+G VVGK+T+LTNKH
Sbjct: 60 PLEQREHANVILPNNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKH 119

Query: 90 V-SKNYKVGDRITAHP---NSDKGNGGIYSIKKIINYPGKEDVSVIQVEERAIERGPKGF 145
V + + A P N D G ++ ++I Y G+ D+++++ +
Sbjct: 120 VVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQNK----- 174

Query: 146 NFNDNVTPFKYAAGA--KAGERIKVIGYPHPYKNKYVLYESTGPVMSVEGSSIVYSAHTE 203
+ + V P + A + + I V GYP K ++ES G + ++G ++ Y T
Sbjct: 175 HIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMWESKGKITYLKGEAMQYDLSTT 233

Query: 204 SGNSGSPVLNSNNELVGIHFASDVKNDDNRNAYGVYFTPEIKKFIAENID 253
GNSGSPV N NE++GIH+ V N+ N V+ ++ F+ +NI+
Sbjct: 234 GGNSGSPVFNEKNEVIGIHWGG-VPNEFNG---AVFINENVRNFLKQNIE 279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013310V8PROTEASE1381e-41 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 138 bits (349), Expect = 1e-41
Identities = 63/212 (29%), Positives = 100/212 (47%), Gaps = 18/212 (8%)

Query: 36 EKNVKEITDATKAPYNSVVAFA--------GGTGVVVGKNTIVTNKHIAKSNDIFKNRVA 87
+ +ITD T Y V +GVVVGK+T++TNKH+ + + +
Sbjct: 73 NNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHVVDATHGDPHALK 132

Query: 88 AHYS---SKGKGGGNYDVKDIVEYPGKEDLAIVHVHETSTEGLNFNKNVSYTKFAEGA-- 142
A S G + + I +Y G+ DLAIV + + + + V + A
Sbjct: 133 AFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVK-FSPNEQNKHIGEVVKPATMSNNAET 191

Query: 143 KAKDRISVIGYPKGAQTKYKMFESTGTINHISGTFIEFDAYAQPCNSGSPVLNSKHELIG 202
+ I+V GYP G + M+ES G I ++ G +++D NSGSPV N K+E+IG
Sbjct: 192 QVNQNITVTGYP-GDKPVATMWESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIG 250

Query: 203 ILYAGSGKDESEKNFGVYFTPQLKEFIQNNIE 234
I + G +E N V+ ++ F++ NIE
Sbjct: 251 IHWGGVP---NEFNGAVFINENVRNFLKQNIE 279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013340BICOMPNTOXIN396e-141 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 396 bits (1020), Expect = e-141
Identities = 96/329 (29%), Positives = 177/329 (53%), Gaps = 24/329 (7%)

Query: 1 MKMKKLVKSSVASSIALLLLSNTVDAAQHITPVSEKKVDDKITLYKTTATSDNDKLNISQ 60
M K++ ++++ S+ L + ++ A+ + I + K T ++K ++Q
Sbjct: 1 MLKNKILTTTLSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQ 60

Query: 61 ILTFNFIKDKSYDKDTLVLKAAGNINSGYKKPNPKDYNYSQ-FYWGGKYNVSVSSESNDA 119
+ F+F+KDK Y+KD L+LK G I+S N K N+ + W +YN+ + + +
Sbjct: 61 NIQFDFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTN-DKY 119

Query: 120 VNVVDYAPKNQNEEFQVQQTLGYSYGGDINISNGLSGGLNGSKSFSETINYKQESYRTTI 179
V++++Y PKN+ E V QTLGY+ GG+ + L G NGS ++S++I+Y Q++Y + +
Sbjct: 120 VSLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGG--NGSFNYSKSISYTQQNYVSEV 177

Query: 180 DRKTNHKSIGWGVEAHKIMNNGWGPYGRDSYDPTYGNELFLGGRQSSSNAGQNFLPTHQM 239
+++ N KS+ WGV+A+ + ++LF+G + S + F+P ++
Sbjct: 178 EQQ-NSKSVLWGVKANSFAT-------ESGQKSAFDSDLFVGYKPHSKDPRDYFVPDSEL 229

Query: 240 PLLARGNFNPEFISVLSHKQNDTKKSKIKVTYQREMD---------RYTNQWNRLHWIGN 290
P L + FNP FI+ +SH++ + S+ ++TY R MD Y N + H + N
Sbjct: 230 PPLVQSGFNPSFIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHN 289

Query: 291 NYKNQNTVTFTSTYEVDWQNHTVKLIGTD 319
+ N+N +T YEV+W+ H +K+ G +
Sbjct: 290 AFVNRN---YTVKYEVNWKTHEIKVKGQN 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013350BICOMPNTOXIN433e-156 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 433 bits (1116), Expect = e-156
Identities = 214/318 (67%), Positives = 256/318 (80%), Gaps = 10/318 (3%)

Query: 1 MFKKKMLAATLSVGLIAPLASPIQE-SRANTNIENIGDGA--EVIKRTEDVSSKKWGVTQ 57
M K K+L TLSV L+APLA+P+ E ++A + E+IG G+ E+IKRTED +S KWGVTQ
Sbjct: 1 MLKNKILTTTLSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQ 60

Query: 58 NVQFDFVKDKKYNKDALIVKMQGFINSRTSFSDVKGSGYELTKRMIWPFQYNIGLTTKDP 117
N+QFDFVKDKKYNKDALI+KMQGFI+SRT++ + K + + K M WPFQYNIGL T D
Sbjct: 61 NIQFDFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNH--VKAMRWPFQYNIGLKTNDK 118

Query: 118 NVSLINYLPKNKIETTDVGQTLGYNIGGNFQSAPSIGGNGSFNYSKTISYTQKSYVSEVD 177
VSLINYLPKNKIE+T+V QTLGYNIGGNFQSAPS+GGNGSFNYSK+ISYTQ++YVSEV+
Sbjct: 119 YVSLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGGNGSFNYSKSISYTQQNYVSEVE 178

Query: 178 KQNSKSVKWGVKANEFVTPDGKKSAHDRYLFVQSPNGPTGSAREYFAPDNQLPPLVQSGF 237
+QNSKSV WGVKAN F T G+KSA D LFV + R+YF PD++LPPLVQSGF
Sbjct: 179 QQNSKSVLWGVKANSFATESGQKSAFDSDLFVGYKPH-SKDPRDYFVPDSELPPLVQSGF 237

Query: 238 NPSFITTLSHEKGSSDTSEFEISYGRNLDITYA----TLFPRTGIYAERKHNAFVNRNFV 293
NPSFI T+SHEKGSSDTSEFEI+YGRN+D+T+A T + + + R HNAFVNRN+
Sbjct: 238 NPSFIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAFVNRNYT 297

Query: 294 VRYEVNWKTHEIKVKGHN 311
V+YEVNWKTHEIKVKG N
Sbjct: 298 VKYEVNWKTHEIKVKGQN 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013390BACTRLTOXIN1954e-64 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 195 bits (497), Expect = 4e-64
Identities = 109/261 (41%), Positives = 155/261 (59%), Gaps = 11/261 (4%)

Query: 4 LSTVIIILILEIVFHNMN-YVNAQPDPKLDELNKVSDYKNNKGTMGNVMNLYTSPPVEGR 62
+S VI+I L +V N +QPDP D+L+K S++ GTMGN+ LY V
Sbjct: 7 ISRVILIFALILVISTPNVLAESQPDPMPDDLHKSSEFT---GTMGNMKYLYDDHYVSAT 63

Query: 63 GVINSRQFLSHDLIFPI---EYKSYNEVKTELENTELANNYKDKKVDIFGVPYFYTCIIP 119
V + +FL+HDLI+ I + K+Y++VKTEL N +LA YKD+ VD++G Y+ C
Sbjct: 64 KVKSVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYYVNCYFS 123

Query: 120 KSEPDINQNFGGCCMYGGLTF---NSSENERDKLITVQVTIDNRQSLGFTITTNKNMVTI 176
+ G CMYGG+T N +N + + V+V + R ++ F + T+K VT
Sbjct: 124 SKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFEVQTDKKSVTA 183

Query: 177 QELDYKARHWLTKEKKLYEFDGSAFESGYIKFTEKNNTSFWFDLFPKKELVPFVPYKFLN 236
QELD KAR++L +K LYEF+ S +E+GYIKF E N +FW+D+ P F K+L
Sbjct: 184 QELDIKARNFLINKKNLYEFNSSPYETGYIKFIENNGNTFWYDMMPAPGDK-FDQSKYLM 242

Query: 237 IYGDNKVVDSKSIKMEVFLNT 257
+Y DNK VDSKS+K+EV L T
Sbjct: 243 MYNDNKTVDSKSVKIEVHLTT 263


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013400BACTRLTOXIN1559e-49 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 155 bits (394), Expect = 9e-49
Identities = 76/265 (28%), Positives = 124/265 (46%), Gaps = 21/265 (7%)

Query: 2 RLFYIAAIII-TLLCLINNNYVNAEV----DKKDLKKKSDLDSSKLFNLTSYYTDITWQL 56
RLF I+I L+ +I+ V AE DL K S+ + + N+ Y D +
Sbjct: 4 RLFISRVILIFALILVISTPNVLAESQPDPMPDDLHKSSEF-TGTMGNMKYLYDDH--YV 60

Query: 57 DESNKISTDQLLNNTIILKNIDISVLKTSSLKVEFNSSDLANQFKGKNIDIYGLYFGNKC 116
+ S D+ L + +I D + +K E + DLA ++K + +D+YG + C
Sbjct: 61 SATKVKSVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYYVNC 120

Query: 117 -------VGLTEEKTSCLYGGVTIHDGNQLDEEKV--IGVNVFKDGVQQEGFVIKTKKAK 167
VG +C+YGG+T H+GN D + + V V+++ F ++T K
Sbjct: 121 YFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFEVQTDKKS 180

Query: 168 VTVQELDTKVRFKLENLYKIYNKDTGNIQKGCIFFHSHNHQDQSFYYDLYNVKGSVG--A 225
VT QELD K R L N +Y ++ + G I F +N +F+YD+ G +
Sbjct: 181 VTAQELDIKARNFLINKKNLYEFNSSPYETGYIKFIENN--GNTFWYDMMPAPGDKFDQS 238

Query: 226 EFFQFYSDNRTVSSSNYHIDVFLYK 250
++ Y+DN+TV S + I+V L
Sbjct: 239 KYLMMYNDNKTVDSKSVKIEVHLTT 263


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013410BACTRLTOXIN1082e-30 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 108 bits (270), Expect = 2e-30
Identities = 54/227 (23%), Positives = 98/227 (43%), Gaps = 37/227 (16%)

Query: 30 VGNLRNFYTKHDYIDLKGVTDKNLPIANQLEFS------TGTNDLISESNNWDEISKFKG 83
+GN++ Y H K + +A+ L ++ + + +E N D K+K
Sbjct: 48 MGNMKYLYDDHYVSATKVKSVDKF-LAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKD 106

Query: 84 KKLDIFGIDY-------------NGPCKSKYMYGGATL-SGQYLNSARKIPINLWVNGKH 129
+ +D++G +Y MYGG T G + ++ + + V
Sbjct: 107 EVVDVYGSNYYVNCYFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENK 166

Query: 130 KTISTDKIATNKKLVTAQEIDVKLRRYLQEEYNIYGHNNTGKGKEYGYKSKFYSGFNNGK 189
+ + ++ T+KK VTAQE+D+K R +L + N+Y N+ + G
Sbjct: 167 RNTISFEVQTDKKSVTAQELDIKARNFLINKKNLY-EFNSSP-------------YETGY 212

Query: 190 VLFHLNNEKSFSYDLF-YTGDGLPVS-FLKIYEDNKIIESEKFHLDV 234
+ F NN +F YD+ GD S +L +Y DNK ++S+ ++V
Sbjct: 213 IKFIENNGNTFWYDMMPAPGDKFDQSKYLMMYNDNKTVDSKSVKIEV 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013420BACTRLTOXIN1232e-36 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 123 bits (310), Expect = 2e-36
Identities = 64/231 (27%), Positives = 111/231 (48%), Gaps = 36/231 (15%)

Query: 28 NLRNYYGSYPIEDHQSINPENNHLSHQLVFSMDNST------VTAEFKNVDDVKKFKNHA 81
N++ Y + + + + L+H L++++ + V E N D KK+K+
Sbjct: 50 NMKYLYDDHYVS-ATKVKSVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKDEV 108

Query: 82 VDVYGLSYSGYCLKNKY------------IYGGVTLA-GDYLEKSRRIPINLWVNGEHQT 128
VDVYG +Y C + +YGG+T G++ + + + V +
Sbjct: 109 VDVYGSNYYVNCYFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRN 168

Query: 129 ISTDKVSTNKKLVTAQEIDTKLRRYLQEEYNIYGFNDTNKGRNYGNKSKFSSGFNAGKIL 188
+ +V T+KK VTAQE+D K R +L + N+Y FN SS + G I
Sbjct: 169 TISFEVQTDKKSVTAQELDIKARNFLINKKNLYEFN--------------SSPYETGYIK 214

Query: 189 FHLNDGSSFSYDLFDT-GTGQAES-FLKIYNDNKTVETEKFHLDVEISYKD 237
F N+G++F YD+ G +S +L +YNDNKTV+++ ++V ++ K+
Sbjct: 215 FIENNGNTFWYDMMPAPGDKFDQSKYLMMYNDNKTVDSKSVKIEVHLTTKN 265


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013430BACTRLTOXIN1701e-54 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 170 bits (433), Expect = 1e-54
Identities = 91/262 (34%), Positives = 136/262 (51%), Gaps = 20/262 (7%)

Query: 5 LLLILNLIAICSVNNAYANEE-DPKIESLCKKSSVDPIALHNINDDYINNRFTTVKSIVS 63
++LI LI + S N A + DP + L K S + N+ Y ++ + K V
Sbjct: 10 VILIFALILVISTPNVLAESQPDPMPDDLHKSSEFTG-TMGNMKYLYDDHYVSATK--VK 66

Query: 64 TTEKFLDFDLLFKSINWLDGISAEFKDLKVEFSSSAISKEFLGKTVDIYGVYYKAHCH-- 121
+ +KFL DL++ D + +K E + ++K++ + VD+YG Y +C+
Sbjct: 67 SVDKFLAHDLIYNI---SDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYYVNCYFS 123

Query: 122 -----GEHQVDTACTYGGVTPHENNKLSEP--KNIGVAVYKDNVNVNTFIVTTDKKKVTA 174
G+ C YGG+T HE N +N+ V VY++ N +F V TDKK VTA
Sbjct: 124 SKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFEVQTDKKSVTA 183

Query: 175 QELDIKVRTKLNNAYKLYDRMTSDVQKGYIKFHSHSEHKESFYYDLFYIKGNLPDQ--YL 232
QELDIK R L N LY+ +S + GYIKF ++ +F+YD+ G+ DQ YL
Sbjct: 184 QELDIKARNFLINKKNLYEFNSSPYETGYIKFIENNG--NTFWYDMMPAPGDKFDQSKYL 241

Query: 233 QIYNDNKTIDSSDYHIDVYLFT 254
+YNDNKT+DS I+V+L T
Sbjct: 242 MMYNDNKTVDSKSVKIEVHLTT 263


34SAI8T7_1013610SAI8T7_1013680N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_101361039-2.756393Cmp-binding-factor=1
SAI8T7_1013620210-3.101219Putative uncharacterized protein
SAI8T7_1013630-113-2.197925Probable phosphoesterase
SAI8T7_1013640-211-1.496601Hypothetical protein
SAI8T7_1013650-29-0.840633UPF0754 membrane protein SaurJH1_1933
SAI8T7_1013660-29-0.314845Putative uncharacterized protein
SAI8T7_1013670-29-0.334651Two-component response regulator homolog
SAI8T7_1013680-290.667096Sensor histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013610SSPANPROTEIN290.034 Salmonella invasion protein InvJ signature.
		>SSPANPROTEIN#Salmonella invasion protein InvJ signature.

Length = 336

Score = 28.6 bits (63), Expect = 0.034
Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 128 PAASSHHHNFASGLSYHVLTMLRIAKSICDI 158
PA S HH+ SGL ++ + LRIA+ + +
Sbjct: 72 PAKSEHHNGNVSGLHHNGKSELRIAEKLLKV 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013620cloacin375e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 36.6 bits (84), Expect = 5e-04
Identities = 40/175 (22%), Positives = 68/175 (38%), Gaps = 36/175 (20%)

Query: 145 KQKEVALHDHSQEWKSLEQQLNIEPITFPEKGVDR-YEKARAHKQSLERDIGLRNERLAQ 203
KQ++ + QEW + T P + +R YE+ARA D+ ER A+
Sbjct: 297 KQRQDEENRRQQEWDA----------THPVEAAERNYERARAELNQANEDVARNQERQAK 346

Query: 204 LKEEATQLEPVKQSDIDAF-ISLNQQENEIKNKEFELTAIE-------------KDIANK 249
A Q+ ++S++DA +L EI K+F A +
Sbjct: 347 ----AVQVYNSRKSELDAANKTLADAIAEI--KQFNRFAHDPMAGGHRMWQMAGLKAQRA 400

Query: 250 QRDKDELQANIGWSETHHDVDSSEAMKSYVSEQIKNKQEQAAYIKQLERSLEENK 304
Q D + QA + + ++A S E K K+++ + E +L + K
Sbjct: 401 QTDVNNKQA--AFDAAAKEKSDADAALSSAMESRKKKEDKK---RSAENNLNDEK 450


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013670HTHFIS808e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.3 bits (198), Expect = 8e-20
Identities = 36/143 (25%), Positives = 61/143 (42%), Gaps = 7/143 (4%)

Query: 3 KVILVDDHYIVRQGLRFLLSTIENIEVLQDFADGETFLEYLKEHEHPDIVLLDLVMPGMN 62
+++ DD +R L L + +V ++ T ++ D+V+ D+VMP N
Sbjct: 5 TILVADDDAAIRTVLNQAL-SRAGYDVRI-TSNAATLWRWI-AAGDGDLVVTDVVMPDEN 61

Query: 63 GIEITEYIKAHYPEIKVLVLTSYVDDEHVISAINKGADGYEMKDVEPQQLIETIRRVMNG 122
++ IK P++ VLV+++ I A KGA Y K + +LI I R +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 123 EKMIHPK----AQDVFETVSQKP 141
K K +QD V +
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSA 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1013680PF06580408e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 40.2 bits (94), Expect = 8e-06
Identities = 21/111 (18%), Positives = 45/111 (40%), Gaps = 16/111 (14%)

Query: 278 IDLSNEIEENIYRA------LQECINNVKKHA-----DTNKMDLTLKQMNDILYIDVIDY 326
+ N+I I +Q + N KH K+ L + N + ++V +
Sbjct: 240 LQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299

Query: 327 GQGFEIDNVQIASSHGINNIKQRVKLLRGK---VTFHSQPTKGTQIQFTIP 374
G + N + ++ G+ N+++R+++L G + + K + IP
Sbjct: 300 GSL-ALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348


35SAI8T7_1015130SAI8T7_1015250N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1015130-19-0.6309673-isopropylmalate dehydratase small subunit
SAI8T7_1015140-18-0.146910L-threonine dehydratase biosynthetic IlvA
SAI8T7_1015180-1110.267230**Similar to RNA binding protein, contains S1
SAI8T7_1015190-315-0.127266RNA polymerase sigma factor
SAI8T7_1015200-316-0.663052Putative Anti-sigma B factor
SAI8T7_1015210-218-0.354410Putative RsbU
SAI8T7_1015220-1180.386689Alanine racemase
SAI8T7_1015230017-0.369311Putative uncharacterized protein
SAI8T7_1015240-113-0.867402Putative uncharacterized protein SA1877
SAI8T7_1015250-113-0.733336Putative uncharacterized protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1015130NEISSPPORIN280.016 Neisseria sp. porin signature.
		>NEISSPPORIN#Neisseria sp. porin signature.

Length = 348

Score = 28.4 bits (63), Expect = 0.016
Identities = 18/67 (26%), Positives = 24/67 (35%), Gaps = 4/67 (5%)

Query: 40 GFGPFAFDEWRYLPDGSDNPDFNPNKPQYKGASILITGDNFGCGSSREHAAWALKDYGFH 99
A E RYL D+P+F + G+ DN G H ++ GF
Sbjct: 137 EISGMAQREHRYLSVRYDSPEF----AGFSGSVQYAPKDNSGSNGESYHVGLNYQNSGFF 192

Query: 100 IIIAGSF 106
AG F
Sbjct: 193 AQYAGLF 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1015190FIMREGULATRY270.027 Escherichia coli: P pili regulatory PapB protein si...
		>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein

signature.
Length = 104

Score = 26.8 bits (59), Expect = 0.027
Identities = 14/58 (24%), Positives = 25/58 (43%), Gaps = 4/58 (6%)

Query: 135 ILEKILPILSDREREIIQCTFIEGLSQKETGERIGLSQMHVSRLQRTAIKKLQEAAHK 192
+L I I SDR ++ + G S+KE E+ ++ + S T + +L
Sbjct: 35 LLIGISSIHSDRVILAMKDYLVGGHSRKEVCEKYQMNNGYFS----TTLGRLIRLNAL 88


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1015200PF06580357e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.2 bits (81), Expect = 7e-05
Identities = 9/48 (18%), Positives = 17/48 (35%)

Query: 61 VTNAVKHAYKENNNVGIINIYFEILEDKIKIVISDKGDSFDYETTKSK 108
V N +KH + G I + + + + + G T +S
Sbjct: 264 VENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKEST 311


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1015220ALARACEMASE328e-113 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 328 bits (843), Expect = e-113
Identities = 110/366 (30%), Positives = 173/366 (47%), Gaps = 17/366 (4%)

Query: 15 RSAYMNVDLNAVASNFKVFSTLHPNKTVMAVVKANAYGLGSVKVARHLMENGATFFAVAT 74
R ++DL A+ N + + V +VVKANAYG G ++ + FA+
Sbjct: 3 RPIQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDG--FALLN 60

Query: 75 LDEAIELRMHGITAKILVL-GVLPAKDIDKAIQHRVALTVPSKQWLKEAIKNISGEQEKK 133
L+EAI LR G IL+L G A+D++ QHR+ V W +A++N +
Sbjct: 61 LEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCV-HSNWQLKALQNA--RLKAP 117

Query: 134 LWLHIKLDTGMGRLGIKDTNTYQEVIEIIQQYEQLVFEGVFTHFACADEPGDMTTEQYQR 193
L +++K+++GM RLG + + V + ++ + + +HFA A+ P D + R
Sbjct: 118 LDIYLKVNSGMNRLGFQ-PDRVLTVWQQLRAMANVGEMTLMSHFAEAEHP-DGISGAMAR 175

Query: 194 FKDMVNEAIKPEYIHCQNSAGSLLMDCQFCNAIRPGISLYGYYPSEYVQQKVKVHLKPSV 253
+ NSA +L + +RPGI LYG PS + L+P +
Sbjct: 176 IEQAAEG--LECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVM 233

Query: 254 QLIANVVQTKTLQAGESVSYGATYTATDPTTIALLPIGYADGYLR-IMQGSFVNVNGHQC 312
L + ++ +TL+AGE V YG YTA D I ++ GYADGY R G+ V V+G +
Sbjct: 234 TLSSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRT 293

Query: 313 EVIGRVCMDQTIVKVPD--QVKAGDSVILIDNHRESPQSVEVVAEKQHTINYEVLCNLSR 370
+G V MD V + Q G V L ++ VA T+ YE++C L+
Sbjct: 294 MTVGTVSMDMLAVDLTPCPQAGIGTPVELWGKEI----KIDDVAAAAGTVGYELMCALAL 349

Query: 371 RLPRIY 376
R+P +
Sbjct: 350 RVPVVT 355


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1015250SECFTRNLCASE270.046 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 26.7 bits (59), Expect = 0.046
Identities = 10/18 (55%), Positives = 13/18 (72%)

Query: 28 LSVAVYVIFYFIWLRFEW 45
L A VI ++IW+RFEW
Sbjct: 159 LLAATVVIMFYIWVRFEW 176


36SAI8T7_1016170SAI8T7_1016220N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1016170-28-1.734840Iron complex transport system substrate-binding
SAI8T7_1016180-28-1.495369Putative uncharacterized protein
SAI8T7_1016190-27-0.597128Putative uncharacterized protein
SAI8T7_1016200-28-0.383817Transporter
SAI8T7_1016210-280.522783Putative uncharacterized protein
SAI8T7_1016220-191.058634Alkaline shock protein 23
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1016170FERRIBNDNGPP966e-25 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 95.8 bits (238), Expect = 6e-25
Identities = 64/257 (24%), Positives = 107/257 (41%), Gaps = 24/257 (9%)

Query: 63 DAKRIVVLEYSFADALAALDVKPVGIADDGKKKRIIK--PVREKIGDYTSVGTRKQPNLE 120
D RIV LE+ + L AL + P G+AD + + P+ + + D VG R +PNLE
Sbjct: 34 DPNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLPDSVID---VGLRTEPNLE 90

Query: 121 EISKLKPDLIIADSSRHKGINKELNKIAPTLSLKSFDGDYKQNI--NSFKTIAKALNKEK 178
++++KP ++ S+ + + L +IAP DG + S +A LN +
Sbjct: 91 LLTEMKPSFMVW-SAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNLQS 149

Query: 179 EGEKRLAEHDKLINKYKDEIKFDRNQKVLPAVV---AKAGLLAHPNYSYVGQFLNELGFK 235
E LA+++ I K R + L + L+ PN S + L+E G
Sbjct: 150 AAETHLAQYEDFIRSMKPRF-VKRGARPLLLTTLIDPRHMLVFGPN-SLFQEILDEYGIP 207

Query: 236 NALSDDVTKGLSKYLKGPYLQLDTEHLADLNPERMIIMTDHAKKDSAEFKKLQEDATWKK 295
NA +G + + + + LA ++ DH +S + L W+
Sbjct: 208 NAW-----QGETNFWG--STAVSIDRLAAYKDVDVLCF-DHD--NSKDMDALMATPLWQA 257

Query: 296 LNAVKNNRVDIVDRDVW 312
+ V+ R V VW
Sbjct: 258 MPFVRAGRFQRVP-AVW 273


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1016180ALARACEMASE391e-05 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 39.4 bits (92), Expect = 1e-05
Identities = 59/325 (18%), Positives = 119/325 (36%), Gaps = 33/325 (10%)

Query: 4 VNINISKIKYNAKVLQTVFQSKNIQFTPVIKCIAGDRTIVESLKALG-INHVAESRLDNI 62
++++ +K N +++ + + + V+K A I A+G + A L+
Sbjct: 7 ASLDLQALKQNLSIVRQA--ATHARVWSVVKANAYGHGIERIWSAIGATDGFALLNLEEA 64

Query: 63 ISIADQDLTYTLLRTPAKKEISDMIEKVDMSIQTELSTIHQINEVAEV-LGKKHKILLMV 121
I++ ++ +L D+ + T + + Q+ + L I L V
Sbjct: 65 ITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDIYLKV 124

Query: 122 DWKDGREGVLTYDVLDYIKEIIHLKNIHFVGLAFNFMCFKSDAPSDDDIFMINRFVSAVE 181
+ R G VL +++ + N+ + L +F ++ P D + R A E
Sbjct: 125 NSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAE--AEHP-DGISGAMARIEQAAE 181

Query: 182 REIGYRLKIISGGNSSMLPQLLYNDLGKINELRIGETLFRGVDTTTNQAIAML-YQDAIT 240
+ R + + + P+ ++ +R G L+ + + IA + +T
Sbjct: 182 -GLECRRSLSNSAATLWHPEAHFD------WVRPGIILYGASPSGQWRDIANTGLRPVMT 234

Query: 241 LEAEILEIK-----PRVN-----TQTHESFLQAIVDIGYLD---TKVDNISPM---DQHI 284
L +EI+ ++ RV T E + IV GY D +P+
Sbjct: 235 LSSEIIGVQTLKAGERVGYGGRYTARDEQRI-GIVAAGYADGYPRHAPTGTPVLVDGVRT 293

Query: 285 NILGA-SSDHLMLDLNGQGHYQVGD 308
+G S D L +DL +G
Sbjct: 294 MTVGTVSMDMLAVDLTPCPQAGIGT 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1016190PF041832592e-81 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 259 bits (663), Expect = 2e-81
Identities = 92/456 (20%), Positives = 176/456 (38%), Gaps = 56/456 (12%)

Query: 121 EGHPTHPLTKTKLPLTMEEVRAYAPEFEKEIPLQIMMIEKDHVVCTAMDGND--QFIIDE 178
GHP K + E + YAPE+ L + ++++H++ + D Q +
Sbjct: 134 SGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRCDNEMDIHQLLTAA 193

Query: 179 IIPEYYNQIRVFLKSLGLKSEDYRAILVHPWQYDHTIGKYFEAWIAKKILIPT-PFTILS 237
+ P+ + + + GL ++ + VHPWQ+ I F A A+ ++ F
Sbjct: 194 MDPQEFARFSQVWQENGL-DHNWLPLPVHPWQWQQKIATDFIADFAEGRMVSLGEFGDQW 252

Query: 238 KATLSFRTMSLIDKP--YHVKLPVDAQATSAVRTVSTVTTVDGPKLSYALQN-------- 287
A S RT++ + +KLP+ TS R + GP S LQ
Sbjct: 253 LAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWLQQVFATDATL 312

Query: 288 ------MLNQYPGFKVAMEPFGEYANVDKDRARQLACIIRQKPE--IDGKGATVVSASLV 339
+L + V+ E + A L I R+ P + + V+ A+L+
Sbjct: 313 VQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLKPDESPVLMATLM 372

Query: 340 NKNPIDQKVIVDSYLEWLNQGITKESITTFIERYAQALIPPLIAFIQNYGIALEAHMQNT 399
+ +Q + +Y++ G+ E+ ++ + + ++ PL + YG+AL AH QN
Sbjct: 373 ECDENNQPLA-GAYID--RSGLDAET---WLTQLFRVVVVPLYHLLCRYGVALIAHGQNI 426

Query: 400 VVNLGPHFDIQFLVRDLGGS-RI------DLETLQHRVSDI--KITNDSLIADSIDAVIA 450
+ + + L++D G R+ ++++L V D+ +++ D LI D
Sbjct: 427 TLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMDSLPQEVRDVTSRLSADYLIHDLQTGHFV 486

Query: 451 KFQHAVIQNQMAELIHHFNQYDCVEETELFNIVQQVVA--HAINPTLPHANELKDILFGP 508
I V E + ++ V++ +P + L LF P
Sbjct: 487 TV---------LRFISPLMVRLGVPERRFYQLLAAVLSDYMKKHPQMSERFALFS-LFRP 536

Query: 509 TITVKALLNMRM-----ENKVKQYLNI--ELDNPIK 537
I L +++ + + N +L NP+
Sbjct: 537 QIIRVVLNPVKLTWPDLDGGSRMLPNYLEDLQNPLW 572


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1016200TCRTETA423e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 41.7 bits (98), Expect = 3e-06
Identities = 53/340 (15%), Positives = 106/340 (31%), Gaps = 26/340 (7%)

Query: 6 FSSSFLLFLGNWIGQIGLNWFVLTTYHN--------AVYLGIVNFCRLVPILLLSVWAGA 57
S+ L +G IGL VL + GI+ + + GA
Sbjct: 11 LSTVALDAVG-----IGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65

Query: 58 IADKYDKGRLLRITISSSFLVTAILCVLTYSFTAIPISVIIIYAT-LRGILSAVETPLRQ 116
++D++ + R + S A+ Y+ A + ++Y + ++ +
Sbjct: 66 LSDRFGR----RPVLLVSLAGAAV----DYAIMATAPFLWVLYIGRIVAGITGATGAVAG 117

Query: 117 AILPDLSDKISTTQAVSFHSFIINICRSIGPAIAGVILAVYHAPTTFLAQA--ICYFIAA 174
A + D++D + F S GP + G++ F A A F+
Sbjct: 118 AYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTG 177

Query: 175 LLCLPLHFKVTKIPEDATRYMPLKVIIDYFKLHMEGRQIFITSLLIMATGFSYTTLLPVL 234
LP K + P PL + + + ++ G L +
Sbjct: 178 CFLLPESHKGERRPLRREALNPLASFRWARGMTVVA-ALMAVFFIMQLVGQVPAALWVIF 236

Query: 235 TNKVFPGKSEIFGIAMTMCAIGGIIATLVL-PKVLKYIGMVNMYYLSSLLFGIALLGVVF 293
F + GI++ I +A ++ V +G L + G + + F
Sbjct: 237 GEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAF 296

Query: 294 HNIVIMFICITLIGLFSQWARTTNRVYFQNNVKDYERGKV 333
M I ++ + V + +G++
Sbjct: 297 ATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQL 336



Score = 30.6 bits (69), Expect = 0.012
Identities = 37/180 (20%), Positives = 71/180 (39%), Gaps = 21/180 (11%)

Query: 10 FLLFLGNWIGQIGLNWFVLTTYH----NAVYLGI-VNFCRLVPILLLSVWAGAIADKYDK 64
+ F+ +GQ+ +V+ +A +GI + ++ L ++ G +A + +
Sbjct: 217 AVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGE 276

Query: 65 GRLLRITISSSFLVTAILCVLTYSFTAIPISVIIIYATLRGILSAVETPLRQAILPDLSD 124
R L + + + +L T + A PI V++ + P QA+L D
Sbjct: 277 RRALMLGMIADGTGYILLAFATRGWMAFPIMVLLA-------SGGIGMPALQAMLSRQVD 329

Query: 125 KISTTQAVSFHSFIINICRSIGPAIAGVILAVYHAPT----TFLAQAICYFIAALLCLPL 180
+ Q + + ++ +GP + I A T ++A A Y LLCLP
Sbjct: 330 EERQGQLQGSLAALTSLTSIVGPLLFTAIYA-ASITTWNGWAWIAGAALY----LLCLPA 384


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1016210PF041832703e-84 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 270 bits (691), Expect = 3e-84
Identities = 93/475 (19%), Positives = 181/475 (38%), Gaps = 45/475 (9%)

Query: 197 SEQAVIEGHPLHPGAKLRKGLNALQTFLYSSEFNQPIKLKIVLIHSKLSRTMSLSKDYDT 256
Q ++ GHP K R+G Y+ E+ +L + + + M D +
Sbjct: 128 RLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKRE---HMIWRCDNEM 184

Query: 257 TVHQLF-----PDLIKQLENEFTPNFNFNDYHIMIVHPWQLDDVLHSDYQAEVDKELIIE 311
+HQL P + + N +++ + VHPWQ + +D+ A+ + ++
Sbjct: 185 DIHQLLTAAMDPQEFARFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADFAEGRMVS 244

Query: 312 AKHTLD-YYAGLSFRTLVPKYPAMSPHIKLSTNVHITGEIRTLSEQTTHNGPLMTRILND 370
D + A S RTL IKL ++ T R + + GPL +R L
Sbjct: 245 LGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWLQQ 304

Query: 371 ILEKDVIFKSYASTIIDEVAGIHFYNEQDEVDYQTER--SEQLGTLFRKNIYQMIPQEVT 428
+ D + I+ E A + +E + E LG ++R+N + + + +
Sbjct: 305 VFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLKPDES 364

Query: 429 PMIPSSLVATYPFNNESPIVTLIKRYQSAASLSDFESSAKSWIETYSKALLGLVIPLVTK 488
P++ ++L+ N P+ A + A++W+ + ++ + L+ +
Sbjct: 365 PVLMATLMECDE--NNQPLA--------GAYIDRSGLDAETWLTQLFRVVVVPLYHLLCR 414

Query: 489 YGIALEAHLQNAIATFRKDGLLDTMYIRDFEG-LRIDKAQLNEMGYSTSHFHEKSRILTD 547
YG+AL AH QN I K+G+ + ++DF+G +R+ K + EM S E + +
Sbjct: 415 YGVALIAHGQN-ITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMD---SLPQEVRDVTSR 470

Query: 548 SKTSVFNKAFYSTVQNHLGELILTISKASNDSNLERHMWYIVRDVLDNIFDQLVLSTHKS 607
+ + I + ER + ++ VL + + H
Sbjct: 471 LSADYLIHDLQTGHFVTVLRFISPLMVRLGVP--ERRFYQLLAAVLSDYMKK-----HPQ 523

Query: 608 NQVNENRINEIKDTMFAPFIDYKCVTTMRLE----DEAHHY--TYIK-VNNPLYR 655
+ +F P I + ++L D Y++ + NPL+
Sbjct: 524 MSERFALFS-----LFRPQIIRVVLNPVKLTWPDLDGGSRMLPNYLEDLQNPLWL 573


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1016220TCRTETOQM290.012 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 28.7 bits (64), Expect = 0.012
Identities = 14/43 (32%), Positives = 21/43 (48%), Gaps = 5/43 (11%)

Query: 99 VDLKVILEYGE-----SAPKIFRKVTELVKEQVKYITGLDVVE 136
D K+ +YG S P FR + +V EQV G +++E
Sbjct: 495 TDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTELLE 537


37SAI8T7_1017370SAI8T7_1017480N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1017370010-1.582581Putative uncharacterized protein
SAI8T7_101738018-1.288830Similar to esterase
SAI8T7_101739008-1.501127Putative uncharacterized protein
SAI8T7_1017400-18-1.549153SA2142 protein
SAI8T7_1017410-212-1.367301Putative uncharacterized protein SA2143
SAI8T7_1017420-213-1.810388Similar to transcriptional regulator
SAI8T7_1017430-214-1.189832TcaB protein
SAI8T7_1017440-211-1.467736Putative Membrane-associated protein TcaA
SAI8T7_1017450-110-1.454390Putative uncharacterized protein
SAI8T7_1017460-211-0.213017Putative hemin import ATP-binding protein HrtA
SAI8T7_1017470012-0.718543Putative hemin transport system permease protein
SAI8T7_1017480-111-0.395938Response regulator receiver domain protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1017370PF06438260.050 Heme acquisition protein HasAp
		>PF06438#Heme acquisition protein HasAp

Length = 205

Score = 26.5 bits (58), Expect = 0.050
Identities = 12/28 (42%), Positives = 17/28 (60%), Gaps = 2/28 (7%)

Query: 68 NLAYTLFTLEEHTTY--LSELSLGDVFT 93
+L YTLF+ HT + L ++LGD T
Sbjct: 72 DLHYTLFSNPSHTLWGKLDSIALGDTLT 99


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1017400TCRTETB1582e-44 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 158 bits (402), Expect = 2e-44
Identities = 92/415 (22%), Positives = 187/415 (45%), Gaps = 16/415 (3%)

Query: 140 KILAALLFGMFIAILNQTLLNVALPKINTEFNISASTGQWLMTGFMLVNGILIPITAYLF 199
+IL L F ++LN+ +LNV+LP I +FN ++ W+ T FML I + L
Sbjct: 14 QILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73

Query: 200 NKYSYRKLFLVALVLFTIGSLICAISMN-FPIMMVGRVLQAIGAGVLMPLGSIVIITIYP 258
++ ++L L +++ GS+I + + F ++++ R +Q GA L +V+ P
Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIP 133

Query: 259 PEKRGAAMGTMGIAMILAPAIGPTLSGYIVQNYHWNVMFYGMFIIGIIAILVGFVWFKLY 318
E RG A G +G + + +GP + G I HW+ + +I II + K
Sbjct: 134 KENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIP-MITIITVPFLMKLLKKE 192

Query: 319 QYTTNPKADIPGIIFSTIGFGALLYGFSEAGNKGWGSVEIETMFAIGIIFIILFVIRELR 378
DI GII ++G + + + ++ ++FV +
Sbjct: 193 VRIKGH-FDIKGIILMSVGIVFFMLF---TTSYSIS------FLIVSVLSFLIFVKHIRK 242

Query: 379 MKSPMLNLEVLKFPTFTLTTIINMVVMLSLYGGMILLPIYLQNLRGFSALDSG-LLLLPG 437
+ P ++ + K F + + ++ ++ G + ++P ++++ S + G +++ PG
Sbjct: 243 VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302

Query: 438 SLIMGLLGPFAGKLLDTIGLKPLAIFGIAVMTYATWELTKLNMDTP-YMTIMGIYVLRSF 496
++ + + G G L+D G + G+ ++ + + L T +MTI+ ++VL
Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVL--G 360

Query: 497 GMAFIMMPMVTAAINALPGRLASHGNAFLNTMRQLAGSIGTAILVTVMTTQTTQH 551
G++F + T ++L + A G + LN L+ G AI+ +++
Sbjct: 361 GLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQ 415


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1017410RTXTOXIND591e-12 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 59.1 bits (143), Expect = 1e-12
Identities = 26/133 (19%), Positives = 45/133 (33%), Gaps = 13/133 (9%)

Query: 87 MDLKMPQKGTIAKLD-GMEGSMVQAGNPIAYAYNLDD-LYVTANIDEKDIKDVEVGKDVD 144
++ P + +L EG +V + DD L VTA + KDI + VG++
Sbjct: 328 SVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAI 387

Query: 145 VTIDGQKAS----IKGKVDSIGKATAASFSLMPSSNSDGNYTKVSQVIPVKITLESEPSK 200
+ ++ + + GKV +I G V I +
Sbjct: 388 IKVEAFPYTRYGYLVGKVKNINLDAI-------EDQRLGLVFNVIISIEENCLSTGNKNI 440

Query: 201 QVVPGMNAEVKIH 213
+ GM +I
Sbjct: 441 PLSSGMAVTAEIK 453



Score = 32.5 bits (74), Expect = 0.001
Identities = 17/77 (22%), Positives = 35/77 (45%), Gaps = 2/77 (2%)

Query: 9 VITVVVLLAIGIAGFYFWNKTTSYVTTDNAKV--NGDQIKIASPASGQIKSLNVKQGDKL 66
++ ++ + IA V T N K+ +G +I + +K + VK+G+ +
Sbjct: 59 LVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESV 118

Query: 67 DKGDKVATVTVQGQDGE 83
KGD + +T G + +
Sbjct: 119 RKGDVLLKLTALGAEAD 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1017420HTHTETR454e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 44.6 bits (105), Expect = 4e-08
Identities = 13/69 (18%), Positives = 24/69 (34%)

Query: 2 KRQAKIEIQNALVDLMAEYPFQEISTKMICAYCNINRSTFYDYYKDKFDLLDTINSKHKE 61
++ + I + + L ++ S I + R Y ++KDK DL I +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 62 KFQFLLSAL 70
L
Sbjct: 69 NIGELELEY 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1017430TCRTETA642e-13 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 64.5 bits (157), Expect = 2e-13
Identities = 68/386 (17%), Positives = 141/386 (36%), Gaps = 16/386 (4%)

Query: 15 IIILGSLTAIGALSIDMFLPGLPDIRHDF---QTTTSNAQLTLSMFMIGLAFGNLFAGPI 71
+I++ S A+ A+ I + +P LP + D T++ + L+++ + G +
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 72 SDSTGRRKPLIIAMIIFTLASLGIVFVHNIWLMVALRFLQGVTGGAAAVISRAIASDMYS 131
SD GRR L++++ + + +W++ R + G+TG AV IA D+
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA-DITD 125

Query: 132 GNELTKFMALLMLVNGIAPVVAPTIGGIILNYSVWRMVFVILTIFGFVMVIGSLLKVPES 191
G+E + + G V P +GG++ +S F + + +PES
Sbjct: 126 GDERARHFGFMSACFGFGMVAGPVLGGLMGGFSP-HAPFFAAAALNGLNFLTGCFLLPES 184

Query: 192 LTVTNRESSSGLKTMFKNFKILLKTPRFVLPMLIQGMTFVILFTYISASPFII--QKIYG 249
R +F+ + V ++ + L + A+ ++I + +
Sbjct: 185 HKGERRPLRREALNPLASFR-WARGMTVVAALMAVFFI-MQLVGQVPAALWVIFGEDRFH 242

Query: 250 MTAIQFSWMFAGIGITLIISSQLTGYLVDFIDSQKLMRGMTMIQIIGVILVTIVLLNHWN 309
A A GI ++ + + ++ R M+ +I I+L
Sbjct: 243 WDATTIGISLAAFGILHSLAQ---AMITGPVAARLGERRALMLGMIADGTGYILLAFATR 299

Query: 310 FWILAIGFIILIAPVTGVATIGFTIAMDESSSGRGSSSSLLGLVQFLFGGVASPLVGVKG 369
W+ ++L + G+ + ++ +G L + L + PL+
Sbjct: 300 GWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSL-TSIVGPLLFTAI 358

Query: 370 EDNPIPY---IIIIIATAVILIILQI 392
I I A+ L+ L
Sbjct: 359 YAASITTWNGWAWIAGAALYLLCLPA 384


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1017460PF05272290.014 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.014
Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 35 VILNGASGSGKTTLLTILGGL 55
V+L G G GK+TL+ L GL
Sbjct: 599 VVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1017480HTHFIS793e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 3e-19
Identities = 31/117 (26%), Positives = 51/117 (43%), Gaps = 1/117 (0%)

Query: 14 LVVDDDPRILNYIASHLQTEHIDAYTQPSGEAALKLLEKQRVDIAVVDIMMDGMDGFQLC 73
LV DDD I + L D + + + D+ V D++M + F L
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLL 66

Query: 74 NTLKNDY-DIPVIMLTARDALSDKERAFISGTDDYVTKPFEVKELIFRIRAVLRRYN 129
+K D+PV++++A++ +A G DY+ KPF++ ELI I L
Sbjct: 67 PRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123


38SAI8T7_1017880SAI8T7_1017970N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1017880-214-0.547367Similar to multidrug resistance protein homolog
SAI8T7_1017890018-0.9797352,3-bisphosphoglycerate-dependent
SAI8T7_1017900017-1.329491Similar to cation efflux family protein
SAI8T7_1017910117-1.417049Immunoglobulin-binding protein sbi
SAI8T7_1017920016-1.818509Gamma-hemolysin component A, HlgA
SAI8T7_1017930-216-1.547727Gamma-hemolysin component C
SAI8T7_1017940-215-1.321672Gamma-hemolysin component B
SAI8T7_1017950-218-1.318028Putative uncharacterized protein
SAI8T7_1017960-216-1.8344926-carboxyhexanoate--CoA ligase
SAI8T7_1017970-214-2.052065Similar to 8-amino-7-oxononanoate synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1017880TCRTETB1293e-35 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 129 bits (327), Expect = 3e-35
Identities = 91/398 (22%), Positives = 177/398 (44%), Gaps = 14/398 (3%)

Query: 18 FFGLLNETLLVTALPSIMKDFEISYTQVQWLTTAFLLTNGIVIPLSALVIQRYTTRQVFL 77
FF +LNE +L +LP I DF W+ TAF+LT I + + + +++ L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 78 VGISIFFLGTLLGGLS-PHFATLLVARIIQALGAGIMMPLMMTTILDVFQPHERGKYMGI 136
GI I G+++G + F+ L++AR IQ GA L+M + RGK G+
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 137 FGLVIGLAPAIGPTLSGYLVEYLNWRSLFHVVAPIAAVTFLIGFKTIKNVGTTIKVPIDF 196
G ++ + +GP + G + Y++W L + P+ + + + IK D
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLI--PMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 197 ISVIFSVLGFGGLLYGTSSISEKGFDNPIVLVSMIGGVVLVALFVLRQYRLSTPLLNFAV 256
+I +G + T+S S F +I V+ +FV +++ P ++ +
Sbjct: 202 KGIILMSVGIVFFMLFTTSYS-ISF--------LIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 257 FKNKQFTVGIIIMGVTMVSMIGSETILPIFVQNLLHRSALDSG-LTLLPGAIVMAFMSMT 315
KN F +G++ G+ ++ G +++P ++++ S + G + + PG + +
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 316 SGALYEKFGPRKLALVGMAIVVITTAYFVVMDEQTSTIMLATVYAIRMVGIALGLIPVMT 375
G L ++ GP + +G+ + ++ + E TS M + + G++ + T
Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTVIST 371

Query: 376 HTMNQLKPEMNAHGSSMTNTVQQIAGSIGTAALITILS 413
+ LK + G S+ N ++ G A + +LS
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1017920BICOMPNTOXIN427e-154 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 427 bits (1100), Expect = e-154
Identities = 213/312 (68%), Positives = 247/312 (79%), Gaps = 8/312 (2%)

Query: 13 MIKNKILTATLAVGLIAPLANPFIEISKAENKIEDIGQGA--EIIKRTQDITSKRLAITQ 70
M+KNKILT TL+V L+APLANP +E +KA N EDIG+G+ EIIKRT+D TS + +TQ
Sbjct: 1 MLKNKILTTTLSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQ 60

Query: 71 NIQFDFVKDKKYNKDALVVKMQGFISSRTTYSDLKKYPYIKRMIWPFQYNISLKTKDSNV 130
NIQFDFVKDKKYNKDAL++KMQGFISSRTTY + KK ++K M WPFQYNI LKT D V
Sbjct: 61 NIQFDFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTNDKYV 120

Query: 131 DLINYLPKNKIDSADVSQKLGYNIGGNFQSAPSIGGSGSFNYSKTISYNQKNYVTEVESQ 190
LINYLPKNKI+S +VSQ LGYNIGGNFQSAPS+GG+GSFNYSK+ISY Q+NYV+EVE Q
Sbjct: 121 SLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGGNGSFNYSKSISYTQQNYVSEVEQQ 180

Query: 191 NSKGVKWGVKANSFVTPNGQVSAYDQYLF-AQDPTGPAARDYFVPDNQLPPLIQSGFNPS 249
NSK V WGVKANSF T +GQ SA+D LF P RDYFVPD++LPPL+QSGFNPS
Sbjct: 181 NSKSVLWGVKANSFATESGQKSAFDSDLFVGYKPHSKDPRDYFVPDSELPPLVQSGFNPS 240

Query: 250 FITTLSHERGKGDKSEFEITYGRNMDATYA-----YVTRHRLAVDRKHDAFKNRNVTVKY 304
FI T+SHE+G D SEFEITYGRNMD T+A + L R H+AF NRN TVKY
Sbjct: 241 FIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAFVNRNYTVKY 300

Query: 305 EVNWKTHEVKIK 316
EVNWKTHE+K+K
Sbjct: 301 EVNWKTHEIKVK 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1017930BICOMPNTOXIN467e-169 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 467 bits (1203), Expect = e-169
Identities = 313/315 (99%), Positives = 313/315 (99%)

Query: 1 MLKNKILATTLSVSLLALLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQ 60
MLKNKIL TTLSVSLLA LANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQ
Sbjct: 1 MLKNKILTTTLSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQ 60

Query: 61 NIQFDFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTNDKYV 120
NIQFDFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTNDKYV
Sbjct: 61 NIQFDFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTNDKYV 120

Query: 121 SLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGGNGSFNYSKSISYTQQNYVSEVEQQ 180
SLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGGNGSFNYSKSISYTQQNYVSEVEQQ
Sbjct: 121 SLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGGNGSFNYSKSISYTQQNYVSEVEQQ 180

Query: 181 NSKSVLWGVKANSFATESGQKSAFDSDLFVGYKPHSKDPRDYFVPDSELPPLVQSGFNPS 240
NSKSVLWGVKANSFATESGQKSAFDSDLFVGYKPHSKDPRDYFVPDSELPPLVQSGFNPS
Sbjct: 181 NSKSVLWGVKANSFATESGQKSAFDSDLFVGYKPHSKDPRDYFVPDSELPPLVQSGFNPS 240

Query: 241 FIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAFVNRNYTVKY 300
FIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAFVNRNYTVKY
Sbjct: 241 FIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAFVNRNYTVKY 300

Query: 301 EVNWKTHEIKVKGQN 315
EVNWKTHEIKVKGQN
Sbjct: 301 EVNWKTHEIKVKGQN 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1017940BICOMPNTOXIN383e-136 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 383 bits (985), Expect = e-136
Identities = 87/322 (27%), Positives = 160/322 (49%), Gaps = 18/322 (5%)

Query: 1 MKMNKLVKSSVATSMALLLLSGTANAEGKITPVSVKKVDDKVTLYKTTATADSDKFKISQ 60
M NK++ ++++ S+ L + + + K T S+K+ ++Q
Sbjct: 1 MLKNKILTTTLSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQ 60

Query: 61 ILTFNFIKDKSYDKDTLVLKATGNINSGFVKPNPNDYDFSK-LYWGAKYNVSISSQSNDS 119
+ F+F+KDK Y+KD L+LK G I+S N + K + W +YN+ + + ++
Sbjct: 61 NIQFDFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKT-NDKY 119

Query: 120 VNVVDYAPKNQNEEFQVQNTLGYTFGGDISISNGLSGGLNGNTAFSETINYKQESYRTTL 179
V++++Y PKN+ E V TLGY GG+ + L G NG+ +S++I+Y Q++Y + +
Sbjct: 120 VSLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGG--NGSFNYSKSISYTQQNYVSEV 177

Query: 180 SRNTNYKNVGWGVEAHKIMNNGWGPYGRDSFHPTYGNELFLAGRQSSAYAGQNFIAQHQM 239
+ N K+V WGV+A+ + ++LF+ + S F+ ++
Sbjct: 178 EQQ-NSKSVLWGVKANSFATESGQ-------KSAFDSDLFVGYKPHSKDPRDYFVPDSEL 229

Query: 240 PLLSRSNFNPEFLSVLSHRQDGAKKSKITVTYQREMDL-----YQIRWNGFYWAGANYKN 294
P L +S FNP F++ +SH + + S+ +TY R MD+ + Y G N
Sbjct: 230 PPLVQSGFNPSFIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHN 289

Query: 295 -FKTRTFKSTYEIDWENHKVKL 315
F R + YE++W+ H++K+
Sbjct: 290 AFVNRNYTVKYEVNWKTHEIKV 311


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1017970CLENTEROTOXN280.044 Clostridium enterotoxin signature.
		>CLENTEROTOXN#Clostridium enterotoxin signature.

Length = 319

Score = 28.1 bits (62), Expect = 0.044
Identities = 8/47 (17%), Positives = 15/47 (31%), Gaps = 3/47 (6%)

Query: 172 GGVILSSND---VKDMLINHGRPLIYSSSLPIYNLYFIKRNIEKLIN 215
IL+ N+ L I + + FI+ ++E
Sbjct: 59 SSQILNPNETGTFSQSLTKSKEVSINVNFSVGFTSEFIQASVEYGFG 105


39SAI8T7_1019110SAI8T7_1019180N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_10191101110.839411Acetyltransferase, GNAT family
SAI8T7_10191201121.496157Probable transglycosylase isaA
SAI8T7_1019130-1110.966870Putative regulatory protein
SAI8T7_1019140-1100.532243Putative uncharacterized protein
SAI8T7_1019150-3111.366605Putative uncharacterized protein
SAI8T7_1019160-3111.393406Putative uncharacterized protein
SAI8T7_1019170-2131.606375HTH-type transcriptional regulator SAV2578
SAI8T7_10191800142.105867Putative short chain oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019110SACTRNSFRASE438e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 43.0 bits (101), Expect = 8e-08
Identities = 31/101 (30%), Positives = 46/101 (45%), Gaps = 5/101 (4%)

Query: 41 DDQPDLENIEHNYLNSGGQFWLAINNHQNIVGTIGLIRLDNNMSALKKMFVDKGYRNLKI 100
DD D+ +E G +L N +G I + N + ++ + V K YR +
Sbjct: 52 DDDMDVSYVE----EEGKAAFLYYLE-NNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGV 106

Query: 101 GKKLLDKVIMTCKEQNIDGIYLGTIDKFISAQYFYSNNGFR 141
G LL K I KE + G+ L T D ISA +FY+ + F
Sbjct: 107 GTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019140HTHTETR431e-07 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 43.5 bits (102), Expect = 1e-07
Identities = 33/200 (16%), Positives = 64/200 (32%), Gaps = 34/200 (17%)

Query: 11 KSIDPRIVRTKQLLVDAFLKISREKKLSQITVKDITDIATLNRATFYAHFTDKEDLLDYT 70
+ T+Q ++D L++ ++ +S ++ +I A + R Y HF DK DL
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 71 LSV---TILKDLNDNLSISNVINEKVLRNIFISIASYIKDAAKSCELNSEAFCNKAHQRI 127
+ I + + + VLR I I + + L F
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIF------HK 116

Query: 128 NNELEDIFAIM-LENSYPEHQRDIIVNS-------------------ASFLAAGISGLAL 167
+ ++ + + + D I + A + ISGL
Sbjct: 117 CEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLME 176

Query: 168 HWFNTSQ-----ETADVFID 182
+W Q + A ++
Sbjct: 177 NWLFAPQSFDLKKEARDYVA 196


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019160NUCEPIMERASE362e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 35.5 bits (82), Expect = 2e-04
Identities = 35/138 (25%), Positives = 53/138 (38%), Gaps = 35/138 (25%)

Query: 1 MKDILVIGATGKQGNAVVKQLLEDGWYVSAL--------TRNKNNRKLSDIGHPHLSIVE 52
MK LV GA G G V K+LLE G V + K R L + P +
Sbjct: 1 MK-YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQAR-LELLAQPGFQFHK 58

Query: 53 GDLSD-----------------NVSLQSAMKGKYGLYSIQ-PIVKDDVSEELRQGMKIIE 94
DL+D + A++ YS++ P D + L + I+E
Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVR-----YSLENPHAYADSN--LTGFLNILE 111

Query: 95 IAEQENIQHIVYSTAGGV 112
IQH++Y+++ V
Sbjct: 112 GCRHNKIQHLLYASSSSV 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019170HTHTETR622e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.3 bits (151), Expect = 2e-14
Identities = 25/80 (31%), Positives = 44/80 (55%)

Query: 1 MRKDAKENRQRIEEIAHKLFDEEGVENISMNRIAKELGIGMGTLYRHFKDKSDLCYYVIQ 60
+++A+E RQ I ++A +LF ++GV + S+ IAK G+ G +Y HFKDKSDL + +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 RDLDIFITHFKQIKDDYHSN 80
+ + + +
Sbjct: 65 LSESNIGELELEYQAKFPGD 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019180DHBDHDRGNASE656e-15 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 65.5 bits (159), Expect = 6e-15
Identities = 46/194 (23%), Positives = 74/194 (38%), Gaps = 18/194 (9%)

Query: 2 LITGGNKGLGYASAEALKALGYKVYIGSRND---VRGQQASQKLGVHYVQ--LDVTSDYS 56
ITG +G+G A A L + G + N + + + H DV +
Sbjct: 12 FITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAA 71

Query: 57 VKNAYNMIAEKEGRLDILINNAGISGQFSAPSKLTPRDVEEVYQTNVFGIVRMMNTFVPL 116
+ I + G +DIL+N AG+ + L+ + E + N G+ +
Sbjct: 72 IDEITARIEREMGPIDILVNVAGVL-RPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130

Query: 117 LEKSEQPVVVNVSSGLGSFGMVTNPETAESKVNSLAYCSSKSAVTMLTLQYAKGLP--NM 174
+ +V V S P T+ + AY SSK+A M T L N+
Sbjct: 131 MMDRRSGSIVTVGSNPAG-----VPRTSMA-----AYASSKAAAVMFTKCLGLELAEYNI 180

Query: 175 QINAADPGATNTDL 188
+ N PG+T TD+
Sbjct: 181 RCNIVSPGSTETDM 194


40SAI8T7_1019650SAI8T7_1019720N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_10196501152.113291Clumping factor B
SAI8T7_1019660-1120.127131HTH-type transcriptional regulator ArcR
SAI8T7_1019670-3101.107033Carbamate kinase 2
SAI8T7_1019680-2100.360113Arginine/ornithine antiporter
SAI8T7_1019690-3100.041238Ornithine carbamoyltransferase, catabolic
SAI8T7_1019700-18-0.761311Arginine deiminase
SAI8T7_1019710-1100.438555Arginine repressor 1
SAI8T7_1019720-1100.974584Zinc metalloproteinase aureolysin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019650PF05616512e-08 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 50.9 bits (121), Expect = 2e-08
Identities = 36/125 (28%), Positives = 57/125 (45%), Gaps = 14/125 (11%)

Query: 508 NVDPVTNRDYSIFGWNNENVVRYGGGSADGDSAVNPK-----DPTPG----PPVDPEPSP 558
N+ PVT+R+ N VV G + G++ V+ + D TPG P P P
Sbjct: 277 NMGPVTDRN-----GNPVQVVATFGRDSQGNTTVDVQVIPRPDLTPGSAEAPNAQPLPEV 331

Query: 559 DPEPEPTPDPEPSPDPEPEPSPDPDPDSDSDSDSGSDSDSGSDSDSESDSDSDSDSDSDS 618
P P +P P+ +P P+P+PDPD + D++ +D G+ DS + D +
Sbjct: 332 SPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGTRPDSPAVPDRPNGRHRKE 391

Query: 619 DSDSE 623
+ E
Sbjct: 392 RKEGE 396



Score = 35.1 bits (80), Expect = 0.001
Identities = 18/63 (28%), Positives = 27/63 (42%), Gaps = 1/63 (1%)

Query: 538 DSAVNPK-DPTPGPPVDPEPSPDPEPEPTPDPEPSPDPEPEPSPDPDPDSDSDSDSGSDS 596
+ A NP + PG +PEP PD P+ PD + P P+ PD + +
Sbjct: 336 NPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGTRPDSPAVPDRPNGRHRKERKEG 395

Query: 597 DSG 599
+ G
Sbjct: 396 EDG 398


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019670CARBMTKINASE388e-138 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 388 bits (999), Expect = e-138
Identities = 137/314 (43%), Positives = 198/314 (63%), Gaps = 5/314 (1%)

Query: 1 MKEKIVIALGGNAIQT--KEATAEAQQTAIRRAMQNLKPLFDSPARIVISHGNGPQIGSL 58
M +++VIALGGNA+Q ++ + E +R+ + + + +VI+HGNGPQ+GSL
Sbjct: 1 MGKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSL 60

Query: 59 LIQQAKSNSDT-TPAMPLDTCGAMSQGMIGYWLETEINRILTEMNSDRTVGTIVTRVEVD 117
L+ + PA P+D GAMSQG IGY ++ + L + ++ V TI+T+ VD
Sbjct: 61 LLHMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVD 120

Query: 118 KDDPRFNNPTKPIGPFYTKEEVEELQKEQPDSVFKEDAGRGYRKVVASPLPQSILEHQLI 177
K+DP F NPTKP+GPFY +E + L +E + KED+GRG+R+VV SP P+ +E + I
Sbjct: 121 KNDPAFQNPTKPVGPFYDEETAKRLARE-KGWIVKEDSGRGWRRVVPSPDPKGHVEAETI 179

Query: 178 RTLADGKNIVIACGGGGIPVIKKENTYEGVEAVIDKDFASEKLATLIEADTLMILTNVEN 237
+ L + IVIA GGGG+PVI ++ +GVEAVIDKD A EKLA + AD MILT+V
Sbjct: 180 KKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNG 239

Query: 238 VFINFNEPNQQQIDDIDVATLKKYAAQGKFAEGSMLPKIEAAIRFVESGENKKVIITNLE 297
+ + +Q + ++ V L+KY +G F GSM PK+ AAIRF+E G ++ II +LE
Sbjct: 240 AALYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWG-GERAIIAHLE 298

Query: 298 QAYEALIGNKGTHI 311
+A EAL G GT +
Sbjct: 299 KAVEALEGKTGTQV 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019700ARGDEIMINASE5060.0 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 506 bits (1305), Expect = 0.0
Identities = 193/409 (47%), Positives = 275/409 (67%), Gaps = 8/409 (1%)

Query: 5 PIKVNSEIGALKTVLLKRPGKELENLVPDYLDGLLFDDIPYLEVAQKEHDHFAQVLREEG 64
PI + SEIG LK VLL RPG+ELENL P + LFDDIPYLEVA++EH+ FA +L+
Sbjct: 7 PINIFSEIGRLKKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQEHEVFASILKNNL 66

Query: 65 VEVLYLEKLAAESIENPQ-VRSEFIDDVLAESKKTILGHEEEIKTLFATLSNQELVDKIM 123
VE+ Y+E L +E + + + ++FI + E++ +K F++L+ ++ K++
Sbjct: 67 VEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFTINLLKDYFSSLTIDNMISKMI 126

Query: 124 SGVRKEEINPKCTHLVEYMDDKYPFYLDPMPNLYFTRDPQASIGHGITINRMFWRARRRE 183
SGV EE+ + L + ++ F +DPMPN+ FTRDP ASIG+G+TIN+MF + R+RE
Sbjct: 127 SGVVTEELKNYTSSLDDLVNGANLFIIDPMPNVLFTRDPFASIGNGVTINKMFTKVRQRE 186

Query: 184 SIFIQYIVKHHPRFKDANIPIWLDRDCPFNIEGGDELVLSKDVLAIGVSERTSAQAIEKL 243
+IF +YI K+HP +K N+PIWL+R ++EGGDELVL+K +L IG+SERT A+++EKL
Sbjct: 187 TIFAEYIFKYHPVYK-ENVPIWLNRWEEASLEGGDELVLNKGLLVIGISERTEAKSVEKL 245

Query: 244 ARRIFENPQATFKKVVAIEIPTSRTFMHLDTVFTMIDYDKFTMHSAILKAEGNMNIFIIE 303
A +F+N + +F ++A +IP +R++MHLDTVFT IDY FT ++ + +I+++
Sbjct: 246 AISLFKN-KTSFDTILAFQIPKNRSYMHLDTVFTQIDYSVFTSFTSD---DMYFSIYVLT 301

Query: 304 YDDVNKDIAIK-QSSHLKDTLEDVLGIDDIQFIPTGNGDVIDGAREQWNDGSNTLCIRPG 362
Y+ + I IK + + +KD L LG I I GD+I GAREQWNDG+N L I PG
Sbjct: 302 YNPSSSKIHIKKEKARIKDVLSFYLG-RKIDIIKCAGGDLIHGAREQWNDGANVLAIAPG 360

Query: 363 VVVTYDRNYVSNDLLRQKGIKVIEISGSELVRGRGGPRCMSQPLFREDI 411
++ Y RN+V+N L + GIKV I SEL RGRGGPRCMS PL REDI
Sbjct: 361 EIIAYSRNHVTNKLFEENGIKVHRIPSSELSRGRGGPRCMSMPLIREDI 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019710ARGREPRESSOR837e-23 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 82.6 bits (204), Expect = 7e-23
Identities = 38/147 (25%), Positives = 78/147 (53%), Gaps = 2/147 (1%)

Query: 18 MKKSKRLEIVSTIVKKHKIYKKEQIISYIEEYFGVRYSATTIAKDLKELNIYRVPIDCET 77
M K +R + I+ ++I +++++ +++ G + T+++D+KEL++ +VP + +
Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKD-GYNVTQATVSRDIKELHLVKVPTNNGS 59

Query: 78 WIYKAINNQTEQEMREKFRHYCEHEVLSSIINGSYIIVKTSPGFAQGINYFIDQLNIEEI 137
+ Y ++ K + + I++KT PG AQ I +D L+ EEI
Sbjct: 60 YKY-SLPADQRFNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEEI 118

Query: 138 LGTVSGNDTTLILTASNDMAEYVYAKL 164
+GT+ G+DT LI+ ++D + V K+
Sbjct: 119 MGTICGDDTILIICRTHDDTKVVQKKI 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019720THERMOLYSIN440e-152 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 440 bits (1133), Expect = e-152
Identities = 173/480 (36%), Positives = 249/480 (51%), Gaps = 42/480 (8%)

Query: 64 NIYQDYAVTDVKTDKKGFTHYTLQPSVDGVHAPDKEVKVHADKSGKVVLING----DTDA 119
+ ++ K D+ G T + ++ + H + G++ ++G + D
Sbjct: 71 QARERLSLIGNKLDELGHTVMRFEQAIAASLCMGAVLVAHVN-DGELSSLSGTLIPNLDK 129

Query: 120 KKVKPTNKVTLSKDDAADKAFKAVKIDKHKAKNLKDKVIKENKVEIDGDSNKYVYNVELI 179
+ +K +++ + + K A ++ K + ++ + D ++ + Y V +
Sbjct: 130 RTLKTEAAISIQQAEMIAKQDVADRVTKERPAA-EEGKPTRLVIYPDEETPRLAYEVNVR 188

Query: 180 TVTPEISHWKVKIDAQTGEILEKMNLVKEA-----------AETGKGKGVLGDTKDINI- 227
+TP +W IDA G++L K N + EA + G G+GVLGD K IN
Sbjct: 189 FLTPVPGNWIYMIDAADGKVLNKWNQMDEAKPGGAQPVAGTSTVGVGRGVLGDQKYINTT 248

Query: 228 -NSIDGGFSLEDLTHQGKLSAFSFNDQTG-QATLITNEDENFVKDEQRAGVDANYYAKQT 285
+S G + L+D T + + ++T +L + D F A VDA+YYA
Sbjct: 249 YSSYYGYYYLQDNTRGSGIFTYDGRNRTVLPGSLWADGDNQFFASYDAAAVDAHYYAGVV 308

Query: 286 YDYYKDTFGRESYDNQGSPIVSLTHVNNYGGQDNRNNAAWIGDKMIYGDGDGRTFTSLSG 345
YDYYK+ GR SYD + I S H YG NNA W G +M+YGDGDG+TF SG
Sbjct: 309 YDYYKNVHGRLSYDGSNAAIRSTVH---YG--RGYNNAFWNGSQMVYGDGDGQTFLPFSG 363

Query: 346 ANDVVAHELTHGVTQETANLEYKDQSGALNESFSDVFGYFVD-----DEDFLMGEDVYTP 400
DVV HELTH VT TA L Y+++SGA+NE+ SD+FG V+ + D+ +GED+YTP
Sbjct: 364 GIDVVGHELTHAVTDYTAGLVYQNESGAINEAMSDIFGTLVEFYANRNPDWEIGEDIYTP 423

Query: 401 GKEGDALRSMSNPEQFGQPAHMKDYVFTEKDNGGVHTNSGIPNKAAYNVIQ--------- 451
G GDALRSMS+P ++G P H +DNGGVHTNSGI NKAAY + Q
Sbjct: 424 GVAGDALRSMSDPAKYGDPDHYSKRYTGTQDNGGVHTNSGIINKAAYLLSQGGVHYGVSV 483

Query: 452 -AIGKSKSEQIYYRALTEYLTSNSNFKDCKDALYQAAKDLYDEQTAE--QVYEAWNEVGV 508
IG+ K +I+YRAL YLT SNF + A QAA DLY + E V +A+N VGV
Sbjct: 484 TGIGRDKMGKIFYRALVYYLTPTSNFSQLRAACVQAAADLYGSTSQEVNSVKQAFNAVGV 543


41SAI8T7_1019780SAI8T7_1019930N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAI8T7_1019780-190.060528Similar to phage infection protein
SAI8T7_1019790111-0.143239Similar to autolysin
SAI8T7_10198001160.009697Isochorismatase transposase
SAI8T7_10198100140.319629Putative Cell-wall-anchored protein SasF
SAI8T7_10198200150.707440Putative uncharacterized protein
SAI8T7_10198300130.032032Similar to lipopolysaccharide biosynthesis
SAI8T7_10198407132.228436Protein translocase subunit SecA 2
SAI8T7_10198509142.631603Putative uncharacterized protein
SAI8T7_10198609142.789941Putative uncharacterized protein
SAI8T7_101987011152.787090Putative uncharacterized protein
SAI8T7_101988011162.981833Accessory Sec system protein translocase subunit
SAI8T7_101989012173.725961Serine-rich adhesin for platelets
SAI8T7_1019900-1171.320632Putative uncharacterized protein
SAI8T7_1019910-215-0.438092Putative uncharacterized protein
SAI8T7_1019920-116-1.627754Similar to methionine sulfoxide reductase
SAI8T7_1019930-117-1.200076Putative acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019780ABC2TRNSPORT396e-05 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 39.1 bits (91), Expect = 6e-05
Identities = 37/172 (21%), Positives = 67/172 (38%), Gaps = 28/172 (16%)

Query: 817 NKHKSLESVLTTRQVFLGKAGFFIMLGML-----QALIVSVGDLLILKAGVESP---VLF 868
++ E++L T Q+ LG I+LG + +A + G ++ A + +L+
Sbjct: 95 EGQRTWEAMLYT-QLRLGD----IVLGEMAWAATKAALAGAGIGVVAAALGYTQWLSLLY 149

Query: 869 VLITI-FCSIIFNSIVYTCVSLLGNPGKAIAIVLLVLQIAG----GGGTFPIQTTPQFFQ 923
L I + F S+ +L P I L I G FP+ P FQ
Sbjct: 150 ALPVIALTGLAFASLGMVVTAL--APSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQ 207

Query: 924 NISPYLPFTYAIDSLRETV-----GGIVPEILITKLIILTLFGIGFFVVGLI 970
+ +LP +++ID +R + + + + I+ F F L+
Sbjct: 208 TAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPF---FLSTALL 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019790FLGFLGJ645e-13 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 63.6 bits (154), Expect = 5e-13
Identities = 50/176 (28%), Positives = 84/176 (47%), Gaps = 19/176 (10%)

Query: 304 SNNDDSGQFNVVDSKDTRQFVKSIAKDAHRIGQDNDIYASVMIAQAILESDSGRSALAKS 363
N DDS D++ F+ ++ A Q + + +++AQA LES G+ + +
Sbjct: 139 RNYDDSLPG------DSKAFLAQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRE 192

Query: 364 ---PNHNLFGIK--GAFEGNSVPFNTLEADGNKLYSINAGFRKYPSTKESLKDYSDLIKN 418
P++NLFG+K G ++G T E + + + A FR Y S E+L DY L+
Sbjct: 193 NGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYLEALSDYVGLLTR 252

Query: 419 GIDGNRTIYKPTWKSEADSYKDATSHLSKTYATDPNYAKKLNSIIKHYQLTQFDDE 474
+ + A + +DA YATDP+YA+KL ++I+ Q+ D+
Sbjct: 253 NPRYAAVTTAASAEQGAQALQDA------GYATDPHYARKLTNMIQ--QMKSISDK 300


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019800ISCHRISMTASE773e-19 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 77.0 bits (189), Expect = 3e-19
Identities = 41/183 (22%), Positives = 77/183 (42%), Gaps = 10/183 (5%)

Query: 5 RKTALLVLDMQE----GIASSVPRIKNIIKANQRAIEAARQHRIPVIFIRLVLDKHFNDV 60
+ LL+ DMQ + + + ++ Q IPV++ ++ +D
Sbjct: 29 NRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQNPDDR 88

Query: 61 SSSNKVFSTIKAQGYAITEADASTRILEDLAPLEDEPIISKRRFSAFTGSYLEVYLRAND 120
+ + G + +I+ +LAP +D+ +++K R+SAF + L +R
Sbjct: 89 ALLTDFW------GPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEG 142

Query: 121 INHLVLTGVSTSGAVLSTALESVDKDYYITVLEDAVGDRSDDKHDFIIEQILSRSCDIES 180
+ L++TG+ L TA E+ +D + DAV D S +KH +E R
Sbjct: 143 RDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCAFTVM 202

Query: 181 VES 183
+S
Sbjct: 203 TDS 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019840SECA6480.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 648 bits (1672), Expect = 0.0
Identities = 282/823 (34%), Positives = 441/823 (53%), Gaps = 68/823 (8%)

Query: 2 KRINTWSDEVKSYSDDALKQKTIEFKERLASGVDTLDTLLPEAYAVAREASWRVLGMYPK 61
IN E++ SD+ LK KT EF+ RL G + L+ L+PEA+AV REAS RV GM
Sbjct: 26 NIINAMEPEMEKLSDEELKGKTAEFRARLEKG-EVLENLIPEAFAVVREASKRVFGMRHF 84

Query: 62 EVQLIGAIVLHEGNIAEMQTGEGKTLTATMPLYLNALSGKGTYLITTNDYLAKRDFEEMQ 121
+VQL+G +VL+E IAEM+TGEGKTLTAT+P YLNAL+GKG +++T NDYLA+RD E +
Sbjct: 85 DVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNALTGKGVHVVTVNDYLAQRDAENNR 144

Query: 122 PLYEWLGLTASLGFVDIVDYEYQKGEKRNIYEHDIIYTTNGRLGFDYLIDNLADSAEGKF 181
PL+E+LGLT V I KR Y DI Y TN GFDYL DN+A S E +
Sbjct: 145 PLFEFLGLT-----VGINLPGMPAPAKREAYAADITYGTNNEYGFDYLRDNMAFSPEERV 199

Query: 182 LPQLNYGIIDEVDSIILDAAQTPLVISGAPRLQSNLFHIVKEFVDTLIE----------- 230
+L+Y ++DEVDSI++D A+TPL+ISG S ++ V + + LI
Sbjct: 200 QRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYKRVNKIIPHLIRQEKEDSETFQG 259

Query: 231 DVHFKMKKTKKEIWLLNQGIEAAQSYFNV-------EDLYSEQAMVLVRNINLALRAQYL 283
+ HF + + +++ L +G+ + E LYS ++L+ ++ ALRA L
Sbjct: 260 EGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYSPANIMLMHHVTAALRAHAL 319

Query: 284 FESNVDYFVYNGDIVLIDRITGRMLPGTKLQAGLHQAIEAKEGMEVSTDKSVMATITFQN 343
F +VDY V +G+++++D TGR + G + GLHQA+EAKEG+++ + +A+ITFQN
Sbjct: 320 FTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAKEGVQIQNENQTLASITFQN 379

Query: 344 LFKLFESFSGMTATGKLGESEFFDLYSKIVVQAPTDKAIQRIDEPDKVFRSVDEKNIAMI 403
F+L+E +GMT T EF +Y V PT++ + R D PD V+ + EK A+I
Sbjct: 380 YFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMIRKDLPDLVYMTEAEKIQAII 439

Query: 404 HDIVELHETGRPVLLITRTAEAAEYFSKVLFQMDIPNNLLIAQNVAKEAQMIAEAGQIGS 463
DI E G+PVL+ T + E +E S L + I +N+L A+ A EA ++A+AG +
Sbjct: 440 EDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAA 499

Query: 464 MTVATSMAGRGTDIKLG-----------------------------EGVEALGGLAVIIH 494
+T+AT+MAGRGTDI LG + V GGL +I
Sbjct: 500 VTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKADWQVRHDAVLEAGGLHIIGT 559

Query: 495 EHMENSRVDRQLRGRSGRQGDPGSSCIYISLDDYLVKRWSDSNLAENNQLYSLDAQRLSQ 554
E E+ R+D QLRGRSGRQGD GSS Y+S++D L++ ++ ++ + + +
Sbjct: 560 ERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFASDRVSGMMRKLGMKPGEAIE 619

Query: 555 SNLFNRKVKQIVVKAQRISEEQGVKAREMANEFEKSISIQRDLVYEERNRVLEIDDAENR 614
+ + AQR E + R+ E++ + QR +Y +RN +L++ D
Sbjct: 620 HPWVTKAIA----NAQRKVESRNFDIRKQLLEYDDVANDQRRAIYSQRNELLDVSDVSET 675

Query: 615 DFKALAKDVFEMFVNEE---KVLTKSRVVEYIYQNLSFQFNKDVACVNFKDKQAVVT--- 668
++ +DVF+ ++ + L + + + + L F+ D+ + DK+ +
Sbjct: 676 -INSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDLPIAEWLDKEPELHEET 734

Query: 669 ---FLLEQFEKQLALNRKNMQSAYYYNIFVQKVFLKAIDSCWLEQVDYLQQLKASVNQRQ 725
+L Q + ++ + A F + V L+ +DS W E + + L+ ++ R
Sbjct: 735 LRERILAQSIEVYQ-RKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAAMDYLRQGIHLRG 793

Query: 726 NGQRNAIFEYHRVALDSFEVMTRNIKKRMVKNICQSMITFDKE 768
Q++ EY R + F M ++K ++ + + + +E
Sbjct: 794 YAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEE 836


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019880SECYTRNLCASE1282e-35 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 128 bits (324), Expect = 2e-35
Identities = 93/440 (21%), Positives = 181/440 (41%), Gaps = 52/440 (11%)

Query: 4 LLQQYEYKIIYKRMLYTCFILFIYILGTNISI--VSYNDMQ------VKHESFFKIAISN 55
+ + + K++L+T I+ +Y +GT+I I V Y ++Q ++ F +
Sbjct: 5 FARAFRTPDLRKKLLFTLAIIVVYRVGTHIPIPGVDYKNVQQCVREASGNQGLFGLVNMF 64

Query: 56 MGGDVNTLNIFTLGLVPWLTSMIILMLISYRNMDKYMKQTSLEKHYKE------------ 103
GG + + IF LG++P++T+ IIL L++ + LE KE
Sbjct: 65 SGGALLQITIFALGIMPYITASIILQLLT-------VVIPRLEALKKEGQAGTAKITQYT 117

Query: 104 RILTLILSVIQSYFVIHEYVSKERVHQDN-------------IYLTILILVTGTILLVWL 150
R LT+ L+++Q ++ S + + ++ + GT +++WL
Sbjct: 118 RYLTVALAILQGTGLVATARSAPLFGRCSVGGQIVPDQSIFTTITMVICMTAGTCVVMWL 177

Query: 151 ADKNSRYGIAGPMPIVMVSIIKSMMHQKMEYI------DASHIVIALLIILVIITLFILL 204
+ + GI M I+M I + + I I +I + +I + +++
Sbjct: 178 GELITDRGIGNGMSILMFISIAATFPSALWAIKKQGTLAGGWIEFGTVIAVGLIMVALVV 237

Query: 205 FIELVEVRIPYI----DLMNVSATNMKSYLSWKVNPAGSITLMMSISAFVFLKSGIHFIL 260
F+E + RIP + S +Y+ KVN AG I ++ + S F
Sbjct: 238 FVEQAQRRIPVQYAKRMIGRRSYGGTSTYIPLKVNQAGVIPVIFASSLLYIPALVAQFAG 297

Query: 261 SMFNKSISDDMPMLTFDSPVGISVYLVIQMLLGYFLSRFLINTKQKSKDFLKSGNYFSGV 320
+ + D P+ I Y ++ + +F N ++ + + K G + G+
Sbjct: 298 GNSGWKSWVEQNLTKGDHPIYIVTYFLLIVFFAFFYVAISFNPEEVADNMKKYGGFIPGI 357

Query: 321 KPGKDTERYLNYQARRVCWFGSALVTVIIGIPLYFTLFVPHLSTEIYFS-VQLIVLVYIS 379
+ G+ T YL+Y R+ W GS + +I +P L S F ++++V +
Sbjct: 358 RAGRPTAEYLSYVLNRITWPGSLYLGLIALVP-TMALVGFGASQNFPFGGTSILIIVGVG 416

Query: 380 INIAETIRTYLYFDKYKPFL 399
+ + I + L Y+ FL
Sbjct: 417 LETVKQIESQLQQRNYEGFL 436


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019890ICENUCLEATIN553e-09 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 55.1 bits (132), Expect = 3e-09
Identities = 237/1070 (22%), Positives = 425/1070 (39%), Gaps = 12/1070 (1%)

Query: 1098 SDSMSISTSFSDSTSDSKSASTASSESISQSASTSTSGSVSTSTSLSTSNSERTSTSVSD 1157
+ + ++ + S S + + T +T S ST ++ +
Sbjct: 106 LHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQTIEIATYG 165

Query: 1158 STSLSTSESDSISESTSTSDSISEAISASESTSISLSESNSTSDSESQSASAFLSESLSE 1217
ST T +S I+ ST + + + S + ++ST + S ES
Sbjct: 166 STLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQM 225

Query: 1218 STSESTSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTSISESTSTFKSE 1277
+ ST + S + S T+ S+ + ST + S+ T+ ST T +
Sbjct: 226 AGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 285

Query: 1278 SVSTSLSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTSL 1337
S T+ ST T+ ++S+ ++ S T+ +S + ST + S + ST
Sbjct: 286 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 345

Query: 1338 SGSTSESESDSTSSSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMNQ 1397
+G S + S+ + DS+ + S T+ S T+ S T+ + S +
Sbjct: 346 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYG 405

Query: 1398 SGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTSQ 1457
S + S + ST T++ S T+ Y S T+ +S+ + S T+ S+
Sbjct: 406 STQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLT 465

Query: 1458 SGSTSTSASLSGSESESDSQSISTSASESTSESASTSLSDSTSTSNSGSASTSTSLSNSA 1517
+G ST + GS+ + S ST+ ES+ + S + S + ST + +
Sbjct: 466 AGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNE 525

Query: 1518 SASESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSE 1577
S + STS + + S+ + S ++ S+ + ST + ST
Sbjct: 526 SDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGT 585

Query: 1578 SGSTSESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSD 1637
+GS S + ST T+ S T+ S + S T+ STS + + S +
Sbjct: 586 AGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYG 645

Query: 1638 SQSMSLSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMSVSISLSDSTSTSTSASEVM 1697
S + S T+ ST + SD T+ S ST+G+ S I+ ST T+ S +
Sbjct: 646 STQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILT 705

Query: 1698 SASISDSQSMSESVNDSESVSESNSESDSKSMSGSTSVSDSGSLSVSTSLRKSESVSESS 1757
+ S + S S S S + +DS ++G S + S T+ S +
Sbjct: 706 AGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQ 765

Query: 1758 SLSGSQSMSDSVSTSDSSSLSVSTSLRSSESVSESDSLSDSKSTSGSTSTSTSGSLSTST 1817
S+ + S S + +DSS ++ S +++ S + S T+ S T+G STST
Sbjct: 766 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTST 825

Query: 1818 SLSGSESVSESTSLSDSISMSDSTSTSDSDSLSGSISLSGSTSLSTSDSLSDSKSLSSSQ 1877
+ + S ++ S + S T+ S + S + STS + DS ++
Sbjct: 826 AGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYG 885

Query: 1878 SMSGSESTSTSVSDSQSSSTSNSQFDSMSISASESDSMSTSDSSNISGSNSTSTSLSTSD 1937
S + S + S+ T+ D + S S + S GS T++ ST
Sbjct: 886 STQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLM 945

Query: 1938 SMSGSVSVSTSTSLSDSISGSTSVSDSSSTSTSTSLSDSMSQSQSTSTSASGSLSTSIST 1997
+ GS + S + GSTS++ S+ + S + QST T+ GS T+ +
Sbjct: 946 AGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHS 1005

Query: 1998 SMSMSASTSSSQSTSVSTSLSTSDSISDST----------SISISGSQSTVESESTSDST 2047
S + S++ + + S+ ++ S S S ISG +S + + S
Sbjct: 1006 STLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVLTAGYGSSLI 1065

Query: 2048 SISDSESLSTSDSDSTSTSTSDSTSGSTSTSIS--ESLSTSGSGSTSVSDSTSMSESDST 2105
S S + S+ ++ S +G ST I+ S+ +G GS+ + S S +
Sbjct: 1066 SGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGAD 1125

Query: 2106 SVSMSQDKSDSTSISDSESVSTSTSTSLSTSDSTSTSESLSTSMSGSQSI 2155
SV M+ ++ + +DS + S L+ ++S T+ S +G+ I
Sbjct: 1126 SVQMAGERGKLIAGADSTQTAGDRSKLLAGNNSYLTAGDRSKLTAGNDCI 1175



Score = 53.2 bits (127), Expect = 1e-08
Identities = 176/773 (22%), Positives = 305/773 (39%), Gaps = 2/773 (0%)

Query: 1408 SASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTSQSGSTSTSASL 1467
+ + +E + S + + T D+T S ST + + +
Sbjct: 106 LHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQTIEIATYG 165

Query: 1468 SGSESESDSQSISTSASESTSESASTSLSDSTSTSNSGSASTSTSLSNSASASESDSSST 1527
S SQ I+ S T+ +ST ++ ST +G+ ST + S + +SS
Sbjct: 166 STLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQM 225

Query: 1528 SLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSESGSTSESTSE 1587
+ ST M+ S+ + S + S+ + ST + S T+ GST +
Sbjct: 226 AGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 285

Query: 1588 SDSTSTSLSDSQSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSDSQSMSLSTST 1647
SD T+ S + + S+ +G ST T+ +S T+ ST SD + ST T
Sbjct: 286 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 345

Query: 1648 STSMSDSTSLSDSVSDSTSDSTSASTSGSMSVSISLSDSTSTSTSASEVMSASISDSQSM 1707
+ S + S + DS+ + GS + SD T+ S + S +
Sbjct: 346 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYG 405

Query: 1708 SESVNDSESVSESNSESDSKSMSGSTSVSDSGSLSVSTSLRKSESVSESSSLSGSQSMSD 1767
S ES + S + GS + GS + + S+ +G S
Sbjct: 406 STQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLT 465

Query: 1768 SVSTSDSSSLSVSTSLRSSESVSESDSLSDSKSTSGSTSTSTSGSLSTSTSLSGSESVSE 1827
+ S ++ S S S + S + GST T+ GS T+ S + +E
Sbjct: 466 AGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNE 525

Query: 1828 STSLSDSISMSDSTSTSDSDSLSGSISLSGSTSLSTSDSLSDSKSLSSSQSMSGSESTST 1887
S ++ S S + + S + GS + S+ T+ S + S +G ST T
Sbjct: 526 SDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGT 585

Query: 1888 SVSDSQSSSTSNSQFDSMSISASESDSMSTSDSSNISGSNSTSTSLSTSDSMSGSVSVST 1947
+ SDS + S + S+ + ST + S + S ST+ + S ++
Sbjct: 586 AGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYG 645

Query: 1948 STSLSDSISGSTSVSDSSSTSTSTSLSDSMSQSQSTSTSASGSLSTSISTSMSMSASTSS 2007
ST + S T+ S+ T+ S + S ST+ + S ++ ST + S +
Sbjct: 646 STQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILT 705

Query: 2008 SQSTSVSTSLSTSDSISDSTSISISGSQSTVESESTSDSTSISDSESLSTSDSDSTSTST 2067
+ S T+ SD S S S +G+ S++ + S T+ S + S T+
Sbjct: 706 AGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQ 765

Query: 2068 SDSTS--GSTSTSISESLSTSGSGSTSVSDSTSMSESDSTSVSMSQDKSDSTSISDSESV 2125
S T+ GSTST+ ++S +G GST + S+ + S +Q++SD T+ S S
Sbjct: 766 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTST 825

Query: 2126 STSTSTSLSTSDSTSTSESLSTSMSGSQSISDSTSTSMSGSTSTSESNSMHPS 2178
+ + S+ ++ ST T+ S +G S + S + S S + + S
Sbjct: 826 AGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDS 878



Score = 53.2 bits (127), Expect = 1e-08
Identities = 233/1050 (22%), Positives = 411/1050 (39%), Gaps = 2/1050 (0%)

Query: 759 TSGSTQQSQSVSTSKADSQSASTSTSGSIVVSTSASTSKSTSVSLSDSVSASKSLSTSES 818
TS + A ++ + S + D+ S S +++
Sbjct: 99 TSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQT 158

Query: 819 NSVSSSTSTSLVNSQSVSSSMSGSVSKSTSLSDSISNSNSTEKSESLSTSTSDSLRTSTS 878
+++ ST QS + GS + S I+ ST + + ST + T T+
Sbjct: 159 IEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTA 218

Query: 879 LSDSLSMSTSGSLSKSQSLSTSISGSSSTSASLSDSTSNAISTSTSLSESASTSDSISIS 938
+S M+ GS S +G ST + DS+ A ST + S+ + S
Sbjct: 219 GEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGS 278

Query: 939 NSIANSQSASTSKSDSQSTSISLSTSDSKSMSTSESLSDSTSTSGSVSGSLSIAASQSVS 998
A S T+ S T+ + S+ + ST + +ST T+G GS A S
Sbjct: 279 TQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGY--GSTQTAQKGSDL 336

Query: 999 TSTSDSMSTSEIVSDSISTSGSLSASDSKSMSVSSSMSTSQSGSTSESLSDSQSTSDSDS 1058
T+ S T+ S I+ GS + S + ST + S+ + ST + +
Sbjct: 337 TAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGA 396

Query: 1059 KSLSLSTSQSGSTSTSTSTSASVRTSESQSTSGSMSASQSDSMSISTSFSDSTSDSKSAS 1118
S ++ S T+ ST + S + GS + S + S + S
Sbjct: 397 DSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQ 456

Query: 1119 TASSESISQSASTSTSGSVSTSTSLSTSNSERTSTSVSDSTSLSTSESDSISESTSTSDS 1178
TA +S + ST + S + S T+ S + S + ST T+
Sbjct: 457 TAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGY 516

Query: 1179 ISEAISASESTSISLSESNSTSDSESQSASAFLSESLSESTSESTSESVSSSTSESTSLS 1238
S + +ES I+ S ST+ + S + + S + S T+ S+ T+ S
Sbjct: 517 GSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDL 576

Query: 1239 DSTSESGSTSTSLSNSTSGSASISTSTSISESTSTFKSESVSTSLSMSTSTSLSNSTSLS 1298
+ S T+ S S+ +G S T++ S T+ + S + S+ T+ S ST+ +
Sbjct: 577 TAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGA 636

Query: 1299 TSLSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTSLSGSTSESESDSTSSSESKSDS 1358
S + S + S+ T+ ST + S T+ GSTS + +DS+ + S
Sbjct: 637 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQ 696

Query: 1359 TSMSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMNQSGVDSNSASQSASNSTSTSTS 1418
T+ S+ + GST T+ S S S S + + + S ++ +S+ T+
Sbjct: 697 TAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGY 756

Query: 1419 ESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTSQSGSTSTSASLSGSESESDSQS 1478
S + + S ST+ + S + S T+ S T+ S ++ S
Sbjct: 757 GSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDL 816

Query: 1479 ISTSASESTSESASTSLSDSTSTSNSGSASTSTSLSNSASASESDSSSTSLSDSTSASMQ 1538
+ S ST+ + S+ ++ ST +G S T+ S ++ +S T+ STS +
Sbjct: 817 TTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGY 876

Query: 1539 SSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSESGSTSESTSESDSTSTSLSDS 1598
S + S + S T+ ST + S T+ GSTS + ES + S
Sbjct: 877 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQ 936

Query: 1599 QSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSDSQSMSLSTSTSTSMSDSTSLS 1658
++ +ST +G S+ T+ S T+ STSM S + ST T+ S T+
Sbjct: 937 TASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGY 996

Query: 1659 DSVSDSTSDSTSASTSGSMSVSISLSDSTSTSTSASEVMSASISDSQSMSESVNDSESVS 1718
S + ST + GS + + + S + S+ S + S ++ SV
Sbjct: 997 GSTQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVL 1056

Query: 1719 ESNSESDSKSMSGSTSVSDSGSLSVSTSLRKSESVSESSSLSGSQSMSDSVSTSDSSSLS 1778
+ S S S+ + GS +++ + ES+ ++G++SM + S ++
Sbjct: 1057 TAGYGSSLISGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGY 1116

Query: 1779 VSTSLRSSESVSESDSLSDSKSTSGSTSTS 1808
ST + ++SV + + + ST T+
Sbjct: 1117 RSTLISGADSVQMAGERGKLIAGADSTQTA 1146



Score = 52.8 bits (126), Expect = 2e-08
Identities = 229/1007 (22%), Positives = 395/1007 (39%), Gaps = 10/1007 (0%)

Query: 1163 TSESDSISESTSTSDSISEAISASESTSISLSESNSTSDSESQSASAFLSESLSESTSES 1222
TS I + + +E + S ++ ES S +++
Sbjct: 99 TSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQT 158

Query: 1223 TSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTSISESTSTFKSESVSTS 1282
+ ST T S + GST T+ +ST + ST T+ ++ST S T+
Sbjct: 159 IEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTA 218

Query: 1283 LSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTSLSGSTS 1342
S+ + ST SD T+ S + S+ + S + S+ +G S
Sbjct: 219 GEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGS 278

Query: 1343 ESESDSTSSSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMNQSGVDS 1402
+ S + ST + + S +G ST T+ S T+ S + S + +
Sbjct: 279 TQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTA 338

Query: 1403 NSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTSQSGSTS 1462
S + S+ + S T+ S T+ ST T+ SD T+ GST
Sbjct: 339 GYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTA------GYGSTG 392

Query: 1463 TSASLSGSESESDSQSISTSASESTSESASTSLSDSTSTSNSGSASTSTSLSNSASASES 1522
T+ + S + S + S T+ ST + S +G ST T+ +S+ +
Sbjct: 393 TAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGY 452

Query: 1523 DSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSESGSTS 1582
S+ T+ DS+ + S +Q S + STST+ S++ ++ ST +G S
Sbjct: 453 GSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSL--IAGYGSTQTAGYGS 510

Query: 1583 ESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSDSQSMS 1642
T+ ST T+ ++S + S S + + S+ + ST ++ S+ T+ S +
Sbjct: 511 TLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTA 570

Query: 1643 LSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMSVSISLSDSTSTSTSASEVMSASIS 1702
S T+ ST + S S + S T+ S + ST T+ S + + S
Sbjct: 571 REGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGS 630

Query: 1703 DSQSMSESVNDSESVSESNSESDSKSMSGSTSVSDSGSLSVSTSLRKSESVSESSSLSGS 1762
S + ++S + S + +S +G S + S T+ S S + + S +
Sbjct: 631 TSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIA 690

Query: 1763 QSMSDSVSTSDSSSLSVSTSLRSSESVSESDSLSDSKSTSGSTSTSTSGSLSTSTSLSGS 1822
S + +S + S ++++ S+ S S ST+G+ S+ +G ST T+ S
Sbjct: 691 GYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHS 750

Query: 1823 ESVSESTSLSDSISMSDSTSTSDSDSLSGSISLSGSTSLSTSDSLSDSKSLSSSQSMSGS 1882
+ S + S T+ S S +G+ S + ST + S + S +
Sbjct: 751 SLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTA 810

Query: 1883 ESTSTSVSDSQSSSTSNSQFDSMSISASESDSMSTSDSSNISGSNSTSTSLSTSDSMSGS 1942
+ S + S+ST+ DS I+ S + +S +G ST T+ SD +G
Sbjct: 811 QERSDLTTGYGSTSTAG--ADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGY 868

Query: 1943 VSVSTSTSLSDSISGSTSVSDSSSTSTSTSLSDSMSQSQSTSTSASGSLSTSISTSMSMS 2002
S ST+ S I+G S + S T+ S +Q S +G STS + S
Sbjct: 869 GSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSL 928

Query: 2003 ASTSSSQSTSVSTSLSTSDSISDSTSISISGSQSTVESESTSDSTSISDSESLSTSDSDS 2062
+ S T+ S + S T+ S + S S + S + ST +
Sbjct: 929 IAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGY 988

Query: 2063 TSTSTSDSTSGSTSTSISESLSTSGSGSTSVSDSTSMSESDSTSVSMSQDKSDSTSISDS 2122
ST T+ S T+ S + GS +T+ +DS+ ++ S+ S + + S
Sbjct: 989 QSTLTAGYGSTQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTL 1048

Query: 2123 ESVSTSTSTSLSTSDSTSTSESLSTSMSGSQSISDSTSTSMSGSTST 2169
S S T+ S S S T+ GS I+ S+ ++G ST
Sbjct: 1049 ISGLRSVLTAGYGSSLISGRRSSLTAGYGSNQIASHRSSLIAGPEST 1095



Score = 52.1 bits (124), Expect = 3e-08
Identities = 193/856 (22%), Positives = 350/856 (40%), Gaps = 14/856 (1%)

Query: 1329 DSISTSTSLSGSTSESESDSTSSSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTSTS 1388
S + ++ + + + S + + + + T +T S ST +
Sbjct: 97 TKTSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPT 156

Query: 1389 LSLSASMNQSGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDS 1448
++ + S + SQ + ST T+ S + Y S T+ ++ST + S
Sbjct: 157 QTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQ 216

Query: 1449 TSISKSTSQSGSTSTSASLSGSESESDSQSISTSASEST--SESASTSLSDSTSTSNSGS 1506
T+ +S+ +G ST + GS+ + S T+ +S+ + ST + S+ +G
Sbjct: 217 TAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGY 276

Query: 1507 ASTSTSLSNSASASESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTI 1566
ST T+ S + S+ T+ +DS+ + S + S + ST T+ + S +
Sbjct: 277 GSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDL 336

Query: 1567 ASLSTSVSTSESGST------SESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTSDS 1620
+ S T+ S+ S T+ DS+ T+ S T++ S + ST T+ +
Sbjct: 337 TAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGA 396

Query: 1621 RSTSASTSTSMRTSTSDSQSMSLSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMSVS 1680
S+ + S +T+ +S + ST T+ S + S T+ S+ +G S
Sbjct: 397 DSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQ 456

Query: 1681 ISLSDSTSTSTSASEVMSASISDSQSMSESVNDSESVSESNSESDSKSMSGSTSVSDSGS 1740
+ DS+ T+ S + SD + S + + S + S +G S +G
Sbjct: 457 TAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGY 516

Query: 1741 LSVSTSLRKSESVSESSSLSGSQSMSDSVSTSDSSSLSVSTSLRSSESVSESDSLSDSKS 1800
S T+ +S+ ++ S S + + S ++ S+ + S+ ++ S + S
Sbjct: 517 GSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDL 576

Query: 1801 TSGSTSTSTSGSLSTSTSLSGSESVSESTSLSDSISMSDSTSTSDSDSLSGSISLSGSTS 1860
T+G ST T+GS S+ + GS + S + S T+ S +G S S + +
Sbjct: 577 TAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGA 636

Query: 1861 LST------SDSLSDSKSLSSSQSMSGSESTSTSVSDSQSSSTSNSQFDSMSISASESDS 1914
S+ S + S+ ++ S + S + STS + DS I+ S
Sbjct: 637 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQ 696

Query: 1915 MSTSDSSNISGSNSTSTSLSTSDSMSGSVSVSTSTSLSDSISGSTSVSDSSSTSTSTSLS 1974
+ +S +G ST T+ SD SG S ST+ + S I+G S +S S+ T+
Sbjct: 697 TAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGY 756

Query: 1975 DSMSQSQSTSTSASGSLSTSISTSMSMSASTSSSQSTSVSTSLSTSDSISDSTSISISGS 2034
S ++ S +G STS + + S + S T+ S+ T+ S T+ S
Sbjct: 757 GSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDL 816

Query: 2035 QSTVESESTSDSTSISDSESLSTSDSDSTSTSTSDSTSGSTSTSISESLSTSGSGSTSVS 2094
+ S ST+ + S + ST + S T+ S T+ S+ + GS ST+
Sbjct: 817 TTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGY 876

Query: 2095 DSTSMSESDSTSVSMSQDKSDSTSISDSESVSTSTSTSLSTSDSTSTSESLSTSMSGSQS 2154
DS+ ++ ST + + S + S T+ S ST+ ES + GS
Sbjct: 877 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQ 936

Query: 2155 ISDSTSTSMSGSTSTS 2170
+ ST M+G S+
Sbjct: 937 TASFKSTLMAGYGSSQ 952



Score = 51.3 bits (122), Expect = 5e-08
Identities = 200/886 (22%), Positives = 350/886 (39%), Gaps = 6/886 (0%)

Query: 797 KSTSVSLSDSVSASKSLSTSESNSVSSSTSTSLVNSQSVSSSMSGSVSKSTSLSDSISNS 856
++ + + SA + + ++ V+ + + S V+S + D +
Sbjct: 89 RAEVLHVGTKTSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATI 148

Query: 857 NSTEKSESLSTSTSDSLRTSTSLSDSLSMSTSGSLSKSQSLSTSISGSSSTSASLSDSTS 916
S + + + T + S ++ GS + ST I+G ST + +DST
Sbjct: 149 ESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTL 208

Query: 917 NAISTSTSLSESASTSDSISISNSIANSQSASTSKSDSQSTSISLSTSDSKSMSTSESLS 976
A ST + S+ + S S T+ S T+ S+ + ST +
Sbjct: 209 VAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGE 268

Query: 977 DSTSTSGSVSGSLSIAASQSVSTSTSDSMSTSEIVSDSISTSGSLSASDSKSMSVSSSMS 1036
DS+ T+G S + S + S + ++ + S + +S + S
Sbjct: 269 DSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQ 328

Query: 1037 TSQSGSTSESLSDSQSTSDSDSKSLSLSTSQSGSTSTSTSTSASVRTSESQSTSGSMSAS 1096
T+Q GS + S T+ DS + + GST T+ S+ S T+ S
Sbjct: 329 TAQKGSDLTAGYGSTGTAGDDSSLI----AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDL 384

Query: 1097 QSDSMSISTSFSDSTSDSKSASTASSESISQSASTSTSGSVSTSTSLSTSNSERTSTSVS 1156
+ S T+ +DS+ + ST ++ S + S + S T+ T T+
Sbjct: 385 TAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGD 444

Query: 1157 DSTSLSTSESDSISESTSTSDSISEAISASESTSISLSESNSTSDSESQSASAFLSESLS 1216
DS+ ++ S + S+ + + ++ S + STS + +S+ S
Sbjct: 445 DSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQ 504

Query: 1217 ESTSESTSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTSISESTSTFKS 1276
+ ST + ST + + SD + GSTST+ +NS+ + ST T+ S T
Sbjct: 505 TAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGY 564

Query: 1277 ESVSTSLSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTS 1336
S T+ S T+ ST + S S + S ++ S+ + S + S
Sbjct: 565 GSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVL 624

Query: 1337 LSGSTSESESDSTSSSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMN 1396
+G S S + + SS + ST + S T+G ST T+ SD T+ S S +
Sbjct: 625 TTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGA 684

Query: 1397 QSGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTS 1456
S + + S + S T+ S T+ S TS STST+ + S + ST
Sbjct: 685 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQ 744

Query: 1457 QSGSTSTSASLSGSESESDSQSISTS--ASESTSESASTSLSDSTSTSNSGSASTSTSLS 1514
+ S+ + GS + QS+ T+ S ST+ + S+ ++ ST +G S T+
Sbjct: 745 TASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGY 804

Query: 1515 NSASASESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVS 1574
S ++ S T+ STS + S + S + S T+ ST + S
Sbjct: 805 GSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDL 864

Query: 1575 TSESGSTSESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTS 1634
T+ GSTS + +S + S + S +G ST T+ +S T+ STS
Sbjct: 865 TTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGY 924

Query: 1635 TSDSQSMSLSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMSVS 1680
S + ST T++ S + S + S+ + GS S++
Sbjct: 925 ESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMA 970



Score = 49.0 bits (116), Expect = 3e-07
Identities = 206/894 (23%), Positives = 361/894 (40%), Gaps = 6/894 (0%)

Query: 733 TDASGNKTTTTFKYEVTRNSMSDSVSTSGSTQQSQSVSTSKADSQSASTSTSGSIVVSTS 792
T G+ T + T + S ++ GSTQ + ST A S T+ GS + +
Sbjct: 281 TAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGY 340

Query: 793 ASTSKSTSVSLSDSVSASKSLSTSESNSVSSSTSTSLVNSQSVSSSMSGSVSKSTSLSDS 852
ST + S + S + +S+ + ST S ++ GS + + S
Sbjct: 341 GSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSL 400

Query: 853 ISNSNSTEKSESLSTSTSDSLRTSTSLSDSLSMSTSGSLSKSQSLSTSISGSSSTSASLS 912
I+ ST+ + ST T+ T T+ S + GS + S+ I+G ST +
Sbjct: 401 IAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGE 460

Query: 913 DSTSNAISTSTSLSESASTSDSISISNSIANSQSA------STSKSDSQSTSISLSTSDS 966
DS+ A ST ++ S + S S A +S+ ST + ST + S
Sbjct: 461 DSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQ 520

Query: 967 KSMSTSESLSDSTSTSGSVSGSLSIAASQSVSTSTSDSMSTSEIVSDSISTSGSLSASDS 1026
+ + S+ ++ STS + + S IA S T++ +S+ T+ S + GS +
Sbjct: 521 TAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGY 580

Query: 1027 KSMSVSSSMSTSQSGSTSESLSDSQSTSDSDSKSLSLSTSQSGSTSTSTSTSASVRTSES 1086
S + S S+ +G S + S+ + S + QS T+ STS + S
Sbjct: 581 GSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSL 640

Query: 1087 QSTSGSMSASQSDSMSISTSFSDSTSDSKSASTASSESISQSASTSTSGSVSTSTSLSTS 1146
+ GS + +S+ + S T+ S TA S S + + S+ + ST +
Sbjct: 641 IAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGY 700

Query: 1147 NSERTSTSVSDSTSLSTSESDSISESTSTSDSISEAISASESTSISLSESNSTSDSESQS 1206
NS T+ S T+ S+ S STST+ + S I+ ST + S+ T+ S
Sbjct: 701 NSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQ 760

Query: 1207 ASAFLSESLSESTSESTSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTS 1266
+ S + S ST+ + SS + S + S T+ S T+ S T+
Sbjct: 761 TAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGY 820

Query: 1267 ISESTSTFKSESVSTSLSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTS 1326
S ST+ S ++ S T+ S T+ S + +S + S ST+ S+
Sbjct: 821 GSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSL 880

Query: 1327 KSDSISTSTSLSGSTSESESDSTSSSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTS 1386
+ ST T+ S + ST +++ SD T+ S S + S+ + S ++
Sbjct: 881 IAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASF 940

Query: 1387 TSLSLSASMNQSGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLS 1446
S ++ + S+ + STS + +S + T + QS T+ S
Sbjct: 941 KSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQ 1000

Query: 1447 DSTSISKSTSQSGSTSTSASLSGSESESDSQSISTSASESTSESASTSLSDSTSTSNSGS 1506
+ S T+ GST+T+ + S + S S S T+ ST +S S +G
Sbjct: 1001 TAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVLTAGY 1060

Query: 1507 ASTSTSLSNSASASESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTI 1566
S+ S S+ + S+ + S+ + S + + S ++ S+ T+ ST+
Sbjct: 1061 GSSLISGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTL 1120

Query: 1567 ASLSTSVSTSESGSTSESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTSDS 1620
S + SV + + ++S T+ S + + S +G S T+ +D
Sbjct: 1121 ISGADSVQMAGERGKLIAGADSTQTAGDRSKLLAGNNSYLTAGDRSKLTAGNDC 1174



Score = 47.8 bits (113), Expect = 6e-07
Identities = 241/1091 (22%), Positives = 433/1091 (39%), Gaps = 10/1091 (0%)

Query: 907 TSASLSDSTSNAISTSTSLSESASTSDSISISNSIANSQSASTSKSDSQSTSISLSTSDS 966
TSA A + + ++ S ++ + N T D+ S S + +
Sbjct: 99 TSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQT 158

Query: 967 KSMSTSESLSDSTSTSGSVSGSLSIAASQSVSTSTSDSMSTSEIVSDSISTSGSLSASDS 1026
++T S T S ++G S + ST + ST +DS +G S +
Sbjct: 159 IEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTA 218

Query: 1027 KSMSVSSSMSTSQSGSTSESLSDSQSTSDSDSKSLSLSTSQSGSTSTSTSTSASVRTSES 1086
S + S S + S + S + GST T+ S+ S
Sbjct: 219 GEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGS 278

Query: 1087 QSTSGSMSASQSDSMSISTSFSDSTSDSKSASTASSESISQSASTSTSGSVSTSTSLSTS 1146
T+ S + S T+ +DS+ + ST ++ S + S + S T+
Sbjct: 279 TQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTA 338

Query: 1147 NSERTSTSVSDSTSLSTSESDSISESTSTSDSISEAISASESTSISLSESNSTSDSESQS 1206
T T+ DS+ ++ S + S+ + + ++ S + ST + + S
Sbjct: 339 GYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADS 398

Query: 1207 ASAFLSESLSESTSESTSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTS 1266
+ S + EST + ST + SD T+ GST T+ +S+ + ST T+
Sbjct: 399 SLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTA 458

Query: 1267 ISESTSTFKSESVSTSLSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMS--TSDSIS 1324
+S+ T S T+ S T+ STS + S + S + S T+ S
Sbjct: 459 GEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGS 518

Query: 1325 TSKSDSISTSTSLSGSTSESESDSTSSSESKSDSTSMSISMSQSTSGSTSTS------TS 1378
T + + S + GSTS + ++S+ + S T+ S+ + GST T+ T+
Sbjct: 519 TQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTA 578

Query: 1379 TSLSDSTSTSLSLSASMNQSGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSES 1438
S T+ S S + S ++ S + ST T+ S T+ Y S ST+ ++S
Sbjct: 579 GYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADS 638

Query: 1439 TSTSTSLSDSTSISKSTSQSGSTSTSASLSGSESESDSQSISTSASESTSESASTSLSDS 1498
+ + S T+ S +G ST + GS+ + S ST+ ++S+ + S +
Sbjct: 639 SLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTA 698

Query: 1499 TSTSNSGSASTSTSLSNSASASESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTS 1558
S + ST + S S STS + + S+ + S ++ S + S
Sbjct: 699 GYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGS 758

Query: 1559 TSNRMSTIASLSTSVSTSESGSTSESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTS 1618
T + STS +G+ S + ST T+ S T+ S + S T+
Sbjct: 759 TQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTT 818

Query: 1619 DSRSTSASTSTSMRTSTSDSQSMSLSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMS 1678
STS + + S + S + S T+ ST + SD T+ S ST+G S
Sbjct: 819 GYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDS 878

Query: 1679 VSISLSDSTSTSTSASEVMSASISDSQSMSESVNDSESVSESNSESDSKSMSGSTSVSDS 1738
I+ ST T+ S + + S + S + S S + +S ++G S +
Sbjct: 879 SLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTA 938

Query: 1739 GSLSVSTSLRKSESVSESSSLSGSQSMSDSVSTSDSSSLSVSTSLRSSESVSESDSLSDS 1798
S + S + S + S S++ DSS ++ S +++ S + S
Sbjct: 939 SFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGS 998

Query: 1799 KSTSGSTSTSTSGSLSTSTSLSGSESVSESTSLSDSISMSDSTSTSDSDSLSGSISLSGS 1858
T+ +ST T+G ST+T+ + S ++ S S S T+ S +SG S+ +
Sbjct: 999 TQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVLTA 1058

Query: 1859 TSLSTSDSLSDSKSLSSSQSMSGSESTSTSVSDSQSSSTSNSQFDSMSISASESDSMSTS 1918
S+ S S + S + S+ ++ +S+ + ++ SM I+ S +
Sbjct: 1059 GYGSSLISGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNR--SMLIAGKGSSQTAGY 1116

Query: 1919 DSSNISGSNSTSTSLSTSDSMSGSVSVSTSTSLSDSISGSTSVSDSSSTSTSTSLSDSMS 1978
S+ ISG++S + ++G+ S T+ S ++G+ S + S T+ +D +
Sbjct: 1117 RSTLISGADSVQMAGERGKLIAGADSTQTAGDRSKLLAGNNSYLTAGDRSKLTAGNDCIL 1176

Query: 1979 QSQSTSTSASG 1989
+ S +G
Sbjct: 1177 MAGDRSKLTAG 1187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019910NUCEPIMERASE270.043 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 27.4 bits (61), Expect = 0.043
Identities = 9/32 (28%), Positives = 14/32 (43%)

Query: 23 IPRPIAFVTTLNQDASVNAAPFSFFNIVNNHP 54
IP T + + AP+ +NI N+ P
Sbjct: 234 IPHADTQWTVETGTPAASIAPYRVYNIGNSSP 265


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAI8T7_1019930SACTRNSFRASE451e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 45.3 bits (107), Expect = 1e-08
Identities = 24/101 (23%), Positives = 46/101 (45%), Gaps = 5/101 (4%)

Query: 48 EKNDEVIGYIN--GPVIKERYISDDLFKNVSINNSEGGYISVLGLVVAPNYQGQGIAGRL 105
E +D + Y+ G Y+ ++ + I ++ GY + + VA +Y+ +G+ L
Sbjct: 51 EDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTAL 110

Query: 106 LNYFETLAKNHHRHGVTLTCRE---SLISFYEKYGYRNEGV 143
L+ AK +H G+ L ++ S FY K+ + V
Sbjct: 111 LHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAV 151



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.