PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genomeexample.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in BA000017 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1SAV0012SAV0084Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV00122131.890215putative homoserine-o-acetyltransferase
SAV00133151.993232conserved hypothetical protein
SAV00145152.247792conserved hypothetical protein
SAV00157172.34852550S ribosomal protein L9
SAV00167182.269088replicative DNA helicase
SAV00177191.782587adenylosuccinate synthase
SAV00184182.462301response regulator
SAV00192182.636001two-component sensor histidine kinase
SAV0020-1141.518866conserved hypothetical protein
SAV0021-3131.025075conserved hypothetical protein
SAV0022-3141.256413conserved hypothetical protein
SAV0023-2151.005179probable 5'-nucleotidase precursor
SAV0024518-1.838365conserved hypothetical protein orfX
SAV0025519-2.972988hypothetical protein
SAV0026218-0.884722hypothetical protein
SAV00273200.328512transposase for IS-like element
SAV00285220.468472truncated replication protein for plasmid
SAV00294231.462415truncated replication protein for pUB110
SAV00305264.175656hypothetical protein
SAV00313243.675701plasmid recombination enzyme
SAV00323254.509846hypothetical protein
SAV00332213.051039hypothetical protein
SAV00342212.046834bleomycin resistance protein
SAV0035115-0.361521kanamycin nucleotidyltransferase
SAV0036-211-2.877188transposase for IS-like element
SAV0038-113-3.655468probable HMG-CoA synthase
SAV0039-111-3.842673glycerophosphoryl diester phosphodiesterase
SAV0040-110-4.247316conserved hypothetical protein
SAV0041-111-3.695293penicillin binding protein 2 prime
SAV0042-110-3.510454methicillin resistance protein
SAV0043112-3.244833methicillin resistance regulatory protein
SAV0044313-2.473495xylose repressor homolog
SAV0045314-1.323326conserved hypothetical protein
SAV0046416-1.359852conserved hypothetical protein
SAV0047320-1.537984conserved hypothetical protein
SAV0048726-0.577757conserved hypothetical protein
SAV0049726-0.299902conserved hypothetical protein
SAV00501025-0.003115hypothetical protein
SAV00511125-0.399289hypothetical protein
SAV00521024-0.163676rRNA methylase
SAV00538240.221378O-nucleotidylltransferase
SAV0054723-0.351711transposase C
SAV00557230.049496transposase B
SAV00562220.494721transposase A
SAV00573261.188143truncated hypothetical protein
SAV00580211.136246hypothetical protein
SAV00590201.032582conserved hypothetical protein
SAV0060019-0.379894hypothetical protein
SAV0061-118-0.487091cassette chromosome recombinase B
SAV0062016-1.294388cassette chromosome recombinase A
SAV0063417-2.224021hypothetical protein
SAV0064319-2.928920hypothetical protein
SAV0065219-1.900007hypothetical protein
SAV0066-2202.157479transposase
SAV0067-3212.618221transposase
SAV0068-3233.795070transposase
SAV0069-3244.034962conserved hypothetical protein
SAV0070-2233.641019transcription regulator protein kdpE
SAV0071-3223.571093sensor protein KdpD
SAV0072-2213.825886K+transporting ATPase chain A
SAV0073-1213.047925potassium-transporting ATPase B chain homologue
SAV00742230.284430potassium-transporting ATPase C chain homolog
SAV0075323-1.019424hypothetical protein
SAV0076423-1.749043hypothetical protein
SAV0077622-4.877886hypothetical protein
SAV0078521-4.581648hypothetical protein
SAV0079619-4.072382hypothetical protein
SAV0080619-3.972091hypothetical protein
SAV0081517-3.595267hypothetical protein
SAV0082514-3.082278hypothetical protein
SAV0083412-0.867010conserved hypothetical protein
SAV0084311-0.062112conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0018HTHFIS942e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 94.1 bits (234), Expect = 2e-24
Identities = 33/129 (25%), Positives = 66/129 (51%), Gaps = 2/129 (1%)

Query: 1 MQMARKVVVVDDEKPIADILEFNLKKEGYDVYCAYDGNDAVDLIYEEEPDIVLLDIMLPG 60
M A ++V DD+ I +L L + GYDV + I + D+V+ D+++P
Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 61 RDGMEVCREVRKKYE-MPIIMLTAKDSEIDKVLGLELGADDYVTKPFSTRELIARVKANL 119
+ ++ ++K +P+++++A+++ + + E GA DY+ KPF ELI + L
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 120 RRHYSQPAQ 128
+P++
Sbjct: 120 AEPKRRPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0047PF01206666e-16 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 65.6 bits (160), Expect = 6e-16
Identities = 24/70 (34%), Positives = 39/70 (55%)

Query: 118 KQFDFRGLQCPGPIVNISKEINNISTGEQIEVTVTDPGFNSDIKSWAKQTGNTLVNLTEE 177
+ D GL CP PI+ K + ++ GE + V TDPG D +S++KQTG+ L+ EE
Sbjct: 6 QSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKEE 65

Query: 178 ANVINAIIQK 187
+ +++
Sbjct: 66 DGTYHFRLKR 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0070HTHFIS1018e-27 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 101 bits (252), Expect = 8e-27
Identities = 34/124 (27%), Positives = 62/124 (50%), Gaps = 1/124 (0%)

Query: 2 KTTLLVVEDDEAILHLIDVALTMNYYKVVTAKTGKEADFRLRTEQPDIILLDLGLPDIDG 61
T+LV +DD AI +++ AL+ Y V + D+++ D+ +PD +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 LSLIQQFRD-FVDTPIIVISARTEEQTIVEVLDRGANDYMTKPFNIDELRARIRVALRMS 120
L+ + + D P++V+SA+ T ++ ++GA DY+ KPF++ EL I AL
Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122

Query: 121 RSTE 124
+
Sbjct: 123 KRRP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0071PF06580310.020 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.0 bits (70), Expect = 0.020
Identities = 9/56 (16%), Positives = 23/56 (41%), Gaps = 7/56 (12%)

Query: 753 KLILQVLFNLIDNALKHAESHSE----IKLHVQHETNKIKFEMIDCGKGIPEEERQ 804
+++Q L ++N +KH + I L + + E+ + G + ++
Sbjct: 257 PMLVQTL---VENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE 309


2SAV0114SAV0133Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV0114-1113.091156lipoprotein
SAV01150112.784854lipoprotein
SAV01161153.174745probable cysteine synthase A protein
SAV01172163.473641probable ornithine cyclodeaminase protein
SAV01182163.354600conserved hypothetical protein
SAV01191173.575234hypothetical protein
SAV0120-1152.719635hypothetical protein
SAV01210143.316831similar to siderophore biosynthesis protein
SAV01220163.522319hypothetical protein
SAV0123-2141.871772probable diaminopimelate decarboxylase protein
SAV0124-1141.540040hypothetical protein
SAV0125-2110.875680hypothetical protein
SAV0126-1120.219160acetoin#diacetylreductase
SAV0127112-1.337123hypothetical protein
SAV0128112-1.010987similar to NAD-dependent epimerase/dehydratase
SAV0129212-0.565201similar to capsular polysaccharide biosynthesis
SAV0130211-0.477875hypothetical protein
SAV0131311-0.038727hypothetical protein
SAV01323110.244951hypothetical protein
SAV01332151.825627superoxide dismutase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0115FERRIBNDNGPP707e-16 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 70.4 bits (172), Expect = 7e-16
Identities = 47/191 (24%), Positives = 78/191 (40%), Gaps = 38/191 (19%)

Query: 53 PKRVVTLYQGATDVAVSLGVKPVGAVES-----WTQKPKFEYIKNDLKDTKI-VGQEPAP 106
P R+V L ++ ++LG+ P G ++ W +P L D+ I VG P
Sbjct: 35 PNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPP-------LPDSVIDVGLRTEP 87

Query: 107 NLEEISKLKPDLIVASKVRNEKVYDQLSKIAPTVSTDTVFKFKD----------TTKLMG 156
NLE ++++KP +V S + L++IAP F F D + M
Sbjct: 88 NLELLTEMKPSFMVWS-AGYGPSPEMLARIAPGR----GFNFSDGKQPLAMARKSLTEMA 142

Query: 157 KALGKEKEAEDLLKKYDDKVAAFQKDAKAKY--KDAWPLKASVVNF-RADHTRIYA-GGY 212
L + AE L +Y+D F + K ++ + A PL + H ++
Sbjct: 143 DLLNLQSAAETHLAQYED----FIRSMKPRFVKRGARPL--LLTTLIDPRHMLVFGPNSL 196

Query: 213 AGEILNDLGFK 223
EIL++ G
Sbjct: 197 FQEILDEYGIP 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0117SYCECHAPRONE310.002 Gram-negative bacterial type III secretion SycE cha...
		>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE

chaperone signature.
Length = 130

Score = 31.2 bits (70), Expect = 0.002
Identities = 14/33 (42%), Positives = 16/33 (48%), Gaps = 1/33 (3%)

Query: 25 VDALTEALTAHAHNDFVQ-PLKPYLRQDPENGH 56
+D E T +HN F Q LKP L D GH
Sbjct: 54 LDNNDEKETLLSHNIFSQDILKPILSWDEVGGH 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0118PF04183316e-103 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 316 bits (812), Expect = e-103
Identities = 119/527 (22%), Positives = 208/527 (39%), Gaps = 46/527 (8%)

Query: 79 RVSKQPLTAAEFWQTIANMNCDLSHEWEVARVEEGLTTAATQLAKQLSELDLASHPFV-- 136
R + +P+ A + + +S +++ T L + L++ +
Sbjct: 66 RCADEPVLAQTLLMQLKQVL-SMSDATVAEHMQDLYATLLGDLQLLKARRGLSASDLINL 124

Query: 137 -MSEQFASLKDRPFHPLAKEKRGLREADYQVYQAELNQSFPLMVAAVKKTHMIHGDTANI 195
L P K +RG + + Y E +F L AVK+ HMI +
Sbjct: 125 NADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRCDNEM 184

Query: 196 DELENLTVPIKEQA----TDMLNDQGLSIDDYVLFPVHPWQYQHILPNVFATEISEKLVV 251
D + LT + Q + + + GL +++ PVHPWQ+Q + F + +E +V
Sbjct: 185 DIHQLLTAAMDPQEFARFSQVWQENGLD-HNWLPLPVHPWQWQQKIATDFIADFAEGRMV 243

Query: 252 LLPLKFGD-YLSSSSMRSLIDIGAPYN-HVKVPFAMQSLGALRLTPTRYMKNGEQAEQLL 309
L +FGD +L+ S+R+L + +K+P + + R P RY+ G A + L
Sbjct: 244 SLG-EFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWL 302

Query: 310 RQLIEKDEALAKYVMV-CDETA-------WWSYMGQDNDIFKDQLGHLTVQLRKYPEVLA 361
+Q+ D L + V E A ++ + + +++ LG V R+ P
Sbjct: 303 QQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLG---VIWRENPCRWL 359

Query: 362 KNDTQQLVSMAALAANDRTLYQMICGKDNISKNDVMTLFEDIAQVFLKVTLSFM-QYGAL 420
K D + V MA L D + + S D T + +V + + +YG
Sbjct: 360 KPD-ESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPLYHLLCRYGVA 418

Query: 421 PELHGQNILLSFEDGRVQKCVLRD-HDTVRIYKPWLTAHQLSLPKYV--VREDTPNTLIN 477
HGQNI L+ ++G Q+ +L+D +R+ K SLP+ V V +
Sbjct: 419 LIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMD-SLPQEVRDVTSRLSADYLI 477

Query: 478 EDLETFFAYFQTLAVSVNLYAIIDAIQDLFGVSEHELMSLLKQILKNEVATISWVTTDQL 537
DL+T V + I + GV E LL +L + + Q+
Sbjct: 478 HDLQTGHF--------VTVLRFISPLMVRLGVPERRFYQLLAAVLSDYMK-----KHPQM 524

Query: 538 AVRHILFDKQTWPFKQILLP---LLY-QRDSGGGSMPSGLTTVPNPM 580
+ R LF +++L L + D G +P+ L + NP+
Sbjct: 525 SERFALFSLFRPQIIRVVLNPVKLTWPDLDGGSRMLPNYLEDLQNPL 571


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0119TCRTETA802e-18 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 79.9 bits (197), Expect = 2e-18
Identities = 71/372 (19%), Positives = 149/372 (40%), Gaps = 24/372 (6%)

Query: 13 ILWLSQFIAIAGLTVLVPLLPIYMASLQNLSVVEIQLWSGIAIAAPAVTTMIASPIWGKL 72
++ + + G+ +++P+LP + L + ++ GI +A A+ +P+ G L
Sbjct: 9 VILSTVALDAVGIGLIMPVLPGLLRDL--VHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 73 GDKISRKWMVLRALLGLAVCLFLMALCTTPLQFVLVRLLQGLFGGVVDASSAFASAEAPA 132
D+ R+ ++L +L G AV +MA + R++ G+ G + A+ +
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDG 126

Query: 133 EDRGKVLGRLQSSVSAGSLVGPLIGGVTASILGFSALLMSIAVITFIVCIFGALKLIETT 192
++R + G + + G + GP++GG+ A + A + + + G L E+
Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLMGGF-SPHAPFFAAAALNGLNFLTGCFLLPESH 185

Query: 193 HMPKSQTPNINKGIRRSFQCLLCTQQTCRFIIVGVLANFAMYGMLTALSPLASSVNHTAI 252
+ SF+ + V A A++ ++ + + +++
Sbjct: 186 KGERRPLRREALNPLASFR--------WARGMTVVAALMAVFFIMQLVGQVPAALWVIFG 237

Query: 253 DDR-----SVIGFLQSAF-WTASILSAPLWGRFNDKSYVKSVYIFATIACGCSAILQGLA 306
+DR + IG +AF S+ A + G + + + IA G IL A
Sbjct: 238 EDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFA 297

Query: 307 TNIEFLMAARILQGLTYSAL--IQSVMFVVVNACHQ-QLKGTFVGTTNSMLVVGQIIGSL 363
T +L + +Q+++ V+ Q QL+G+ T+ + I+G L
Sbjct: 298 TRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTS----LTSIVGPL 353

Query: 364 SGAAITSYTTPA 375
AI + +
Sbjct: 354 LFTAIYAASITT 365


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0120PF041833014e-97 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 301 bits (772), Expect = 4e-97
Identities = 118/540 (21%), Positives = 212/540 (39%), Gaps = 63/540 (11%)

Query: 3 NKELIQHAAYAAIERILNEYFREENLYQVPPQNHQWSIQLSELE-TLTGEFRYWSAMGHH 61
N + + ++L+E E+ + + ++ I L + E W G
Sbjct: 2 NHKDWDLVNRRLVAKMLSELEYEQVFHAESQGDDRYCINLPGAQWRFIAERGIW---GW- 57

Query: 62 MYHPEVWLIDGKSKKITTYKEAIARILQHMAQSADNQTA-VQQHMAQIMSDI--DNSIHR 118
ID ++ + +L + Q A V +HM + + + D + +
Sbjct: 58 ------LWIDAQTLRCADEPVLAQTLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQLLK 111

Query: 119 TARYLQSNTIDYVEDRYIVSEQSLYLGHPFHPTPKSASGFSEADLEKYAPECHTSFQLHY 178
R L ++ + + Q L GHP K G+ + LE+YAPE +F+LH+
Sbjct: 112 ARRGLSASDL---INLNADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHW 168

Query: 179 LAVHQD-------------VLLTRYVEGKEDQVEKVLYQLADIDISEIPKDFILLPTHPY 225
LAV ++ LLT ++ +E ++Q +D +++ LP HP+
Sbjct: 169 LAVKREHMIWRCDNEMDIHQLLTAAMDPQEFARFSQVWQENGLD-----HNWLPLPVHPW 223

Query: 226 QINVLRQHPQYMQYSEQGLIKDLGVSGDLVYPTSSVRTVF--SKALNIYLKLPIHVKITN 283
Q ++ +G + LG GD S+RT+ S+ + +KLP+ + T+
Sbjct: 224 QWQQK-IATDFIADFAEGRMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTS 282

Query: 284 FIRTNDLEQIERTIDAAQVIASVKDE-----------VETPHFKLMFEEGYRALLPNPLG 332
R I A++ + V + P + EGY AL P
Sbjct: 283 CYRGIPGRYIAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYR 342

Query: 333 QTVEPEMDLLTNSAMIVREGIPNY-HADKDIHVLASLFETMPD-SPISKLAQVIEQSGLA 390
EM +I RE + D+ ++A+L E + P+ I++SGL
Sbjct: 343 YQ---EM-----LGVIWRENPCRWLKPDESPVLMATLMECDENNQPL--AGAYIDRSGLD 392

Query: 391 PEAWLECYLDRTLLPILKLFSNTGISLEAHVQNTLIELKDGIPDVCFVRDLEG-ICLSRT 449
E WL ++P+ L G++L AH QN + +K+G+P ++D +G + L +
Sbjct: 393 AETWLTQLFRVVVVPLYHLLCRYGVALIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKE 452

Query: 450 IATEKQLVPNVVAASSPVVYAHDEAWHRLKYYVVVNHLGHLVSTIGKATRNEVVLWQLVA 509
E +P V + + A D H L+ V L + + + E +QL+A
Sbjct: 453 EFPEMDSLPQEVRDVTSRLSA-DYLIHDLQTGHFVTVLRFISPLMVRLGVPERRFYQLLA 511


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0121PF04183514e-179 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 514 bits (1324), Expect = e-179
Identities = 146/592 (24%), Positives = 257/592 (43%), Gaps = 40/592 (6%)

Query: 1 MNQTILNRVKTRVMHQLVSSLIYENIVVYKASYQDGVGHFTIEGHDSEYRFTAEKTHSFD 60
MN + V R++ +++S L YE + + A Q G + I +++RF AE+ +
Sbjct: 1 MNHKDWDLVNRRLVAKMLSELEYEQV--FHAESQ-GDDRYCINLPGAQWRFIAERG-IWG 56

Query: 61 RIRITSPIERVVGDEADTTTDYTQLLREAVFTFPKNDEKLEQFIVELLQTELKDTQSMQY 120
+ I + R AD LL + +D + + + +L T L D Q ++
Sbjct: 57 WLWIDAQTLRC----ADEPVLAQTLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQLLKA 112

Query: 121 RESNPPATPETFN-DYEFYAMEGHQYHPSYKSRLGFTLSDNLKFGPDFVPNVKLQWLAID 179
R + N D + GH K R G+ ++ P++ +L WLA+
Sbjct: 113 RRGLSASDLINLNADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVK 172

Query: 180 KDKVETTVSRNVVVNEMLRQQVGDKTYEHFVQQIEASGKHVNDVEMIPVHPWQFEHVIQV 239
++ + + ++++L + + + F Q + +G N + +PVHPWQ++ I
Sbjct: 173 REHMIWRCDNEMDIHQLLTAAMDPQEFARFSQVWQENGLDHNWL-PLPVHPWQWQQKIAT 231

Query: 240 DLAEERLNGTVLWLGESDELYHPQQSIRTMSPIDTT-KYYLKVPISITNTSTKRVLAPHT 298
D + G ++ LGE + + QQS+RT++ +K+P++I NTS R +
Sbjct: 232 DFIADFAEGRMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRY 291

Query: 299 IENAAQITDWLKQIQQQDMYLKDE----LKTVFLGEVLGQSYLNTQLSPYKQTQVYGALG 354
I + WL+Q+ D L L G V + Y +PY+ ++ LG
Sbjct: 292 IAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEM---LG 348

Query: 355 VIWRENIYHMLIDEEDAIPFNALYASDKDGLPFIEKWIKQYG--SEAWTKQFLAVAIRPM 412
VIWREN L +E + L D++ P +I + G +E W Q V + P+
Sbjct: 349 VIWRENPCRWLKPDESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPL 408

Query: 413 IHMLYYHGIAFESHAQNMMLIHENGWPTRIALKDFHDGVRFKREHLSEAASHLTLKPMPE 472
H+L +G+A +H QN+ L + G P R+ LKDF +R +E E S +P+
Sbjct: 409 YHLLCRYGVALIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMDS------LPQ 462

Query: 473 AHKKVNSNSFIETDDERLVRDFLH---DAFFFINIAEIILFIEKQYGIDEQRQWQWVKDI 529
+ V S RL D+L F+ + I + + G+ E+R +Q + +
Sbjct: 463 EVRDVTS---------RLSADYLIHDLQTGHFVTVLRFISPLMVRLGVPERRFYQLLAAV 513

Query: 530 IEAYQEAFPELNN-YQHFDLFEPTIQVEKLTTRRL-LSDSELRIHHVTNPLG 579
+ Y + P+++ + F LF P I L +L D + + N L
Sbjct: 514 LSDYMKKHPQMSERFALFSLFRPQIIRVVLNPVKLTWPDLDGGSRMLPNYLE 565


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0126DHBDHDRGNASE1284e-38 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 128 bits (323), Expect = 4e-38
Identities = 66/250 (26%), Positives = 113/250 (45%), Gaps = 2/250 (0%)

Query: 5 KVALVTGGAQGIGFKIAERLVEDGFKVAVVDFNEEGAKAAALKLSSDGTKAIAIKADVSN 64
K+A +TG AQGIG +A L G +A VD+N E + L ++ A A ADV +
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 65 RDDVFNAVRQTAAQFGDFHVMVNNAGLGPTTPIDTITEEQFKTVYGVNVAGVLWGIQAAH 124
+ + + G ++VN AG+ I ++++E+++ + VN GV ++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 125 EQFKKFNHGGKIINATSQAGVEGNPGLSLYCSTKFAVRGLTQVAAQDLASEGITVNAFAP 184
+ G I+ S ++ Y S+K A T+ +LA I N +P
Sbjct: 129 KYMMD-RRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 185 GIVQTPMMESIAVATAEEAGKPEAWGWEQFTSQIALGRVSQPEDVSNVVSFLAGKDSDYI 244
G +T M S+ + E F + I L ++++P D+++ V FL + +I
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSL-ETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 245 TGQTIIVDGG 254
T + VDGG
Sbjct: 247 TMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0128NUCEPIMERASE2179e-71 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 217 bits (554), Expect = 9e-71
Identities = 79/327 (24%), Positives = 139/327 (42%), Gaps = 33/327 (10%)

Query: 3 RVLITGGAGFIGSHLVDDL-QQDYDVYVLDNYRTG-----KRENIKSLADDHVF--ELDI 54
+ L+TG AGFIG H+ L + + V +DN K+ ++ LA ++D+
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 55 REYDAVEQIMKTYQFDYVIHLAALVSVAESVEKPILSQEINVVATLRLLEIIKKYNSHIK 114
+ + + + + F+ V ++V S+E P + N+ L +LE + I+
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK--IQ 119

Query: 115 RFIFASSAAVYGDLPDLPKSDQSLI-LPLSPYAIDKYYGERTTLNYCSLYNIPTAVVKFF 173
++ASS++VYG +P S + P+S YA K E Y LY +P ++FF
Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179

Query: 174 NVFGPRQDPKSQYSGVISKMFDSFEHNKPFTFFGDGLQTRDFVYVYDVVQSVRLIMEH-- 231
V+GP P M K + G RDF Y+ D+ +++ + +
Sbjct: 180 TVYGPWGRPDMALFKFTKAML----EGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIP 235

Query: 232 ---------------KDAIGHGYNIGTGTFTNLLEVYRIIGELYGKSVEHEFKEARKGDI 276
A YNIG + L++ + + + G + + GD+
Sbjct: 236 HADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDV 295

Query: 277 KHSYADISNL-KALGFVPKYTVETGLK 302
+ AD L + +GF P+ TV+ G+K
Sbjct: 296 LETSADTKALYEVIGFTPETTVKDGVK 322


3SAV0168SAV0201Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV01682181.056489similar to cation-efflux system membrane protein
SAV01692162.059133hypothetical protein
SAV01700141.889934hypothetical protein
SAV0171-1141.841156hypothetical protein
SAV0172-2121.866417hypothetical protein
SAV0173-2111.233856hypothetical protein
SAV01740141.349404hypothetical protein
SAV01752141.261843hypothetical protein
SAV01762141.217189conserved hypothetical protein
SAV01772131.286703NAD-dependent formate dehydrogenase
SAV01782141.608224similar to integral membrane protein LmrP
SAV01793131.872859similar to surfactin synthetase
SAV01802142.555673conserved hypothetical protein
SAV01812152.131966conserved hypothetical protein
SAV01821152.097856hypothetical protein
SAV01830142.436282arginine biosynthesis bifunctional protein
SAV0184-1162.558530N-acetylglutamate gamma-semialdehyde
SAV0185-1142.601331ornithine aminotransferase
SAV0186-1152.629363similar to branched-chain amino acid transport
SAV01871123.511227hypothetical protein
SAV01880113.283205putative indole-3-pyruvate decarboxylase
SAV01890132.994324PTS enzyme II
SAV0190-1151.417086conserved hypothetical protein
SAV0191-1140.860223similar to glucokinase regulatory protein
SAV0192-214-0.240748similar to PTS system sucrose-specific IIBC
SAV0193116-1.486411similar to transcription regulator RpiR family
SAV0194014-2.031568hypothetical protein
SAV0195015-1.577118probable type I restriction enzyme restriction
SAV0196120-4.792797hypothetical protein
SAV0197219-5.130632conserved hypothetical protein
SAV0198320-5.856690similar to ABC transporter ATP-binding protein
SAV0199421-6.625156similar to SA0193/BacI-like protein
SAV0200-113-4.750793hypothetical protein
SAV0201-212-4.311485hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0178TCRTETA320.004 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.1 bits (73), Expect = 0.004
Identities = 61/337 (18%), Positives = 127/337 (37%), Gaps = 33/337 (9%)

Query: 7 TLKVRLISNFLQLIITTAFIPFIALYLTDMLS----QSIVGIYLVGLVVLKFPLSIISGY 62
L V L + L + +P + L D++ + GI L +++F + + G
Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65

Query: 63 LIEIFPKKLLVLIYQATMVIMLVFMGVFGSHQLWQI-IGFCVAYAIFTIVWGLQFPVMDT 121
L + F ++ ++L+ A + M + LW + IG VA + G V
Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMAT--APFLWVLYIGRIVAG-----ITGATGAVAGA 118

Query: 122 LIMDAITEDVEHYIYKISYWMTNLSVAIGALLGGLMYGYSMLLLFLIAACIFLIVLFILY 181
I D D + + G +LGGLM G+S F AA + +
Sbjct: 119 YIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGC 178

Query: 182 IWLPQDRNQVKQSDDKRHASRYQKLQIMNIFRSYKLVLKDRNYMLLISGFSIIMMGEFSI 241
LP+ ++ + + + L+ F + ++G+
Sbjct: 179 FLLPESHKGERRPLRREALNPLASFRWARGMTVVAA--------LMAVFFIMQLVGQVPA 230

Query: 242 SSYIAIRLKDQF--ETISIGSYDITGAKMLAILLMINTVVVILLTYSISKVVLKIDFKKA 299
+ + I +D+F + +IG LA +++++ ++T ++ ++ ++A
Sbjct: 231 ALW-VIFGEDRFHWDATTIGI-------SLAAFGILHSLAQAMITGPVAA---RLGERRA 279

Query: 300 LITGLLIYIVGYSGLTYLNQFGLLVVFMIIATVGEII 336
L+ G++ GY L + + + M++ G I
Sbjct: 280 LMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIG 316


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0179NUCEPIMERASE538e-09 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 52.9 bits (127), Expect = 8e-09
Identities = 54/266 (20%), Positives = 101/266 (37%), Gaps = 55/266 (20%)

Query: 2046 NTLLTGATGFLGAYLIEALQGYSHRIYCFIRADNEEIAWYKLMTNLNDYFS----EETVE 2101
L+TGA GF+G ++ + L H++ + D NLNDY+ + +E
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQV---VGID-----------NLNDYYDVSLKQARLE 47

Query: 2102 MM----LSNIEVIVGDFECMDDVVLPENMDTIIH----AGARTDHFGDDDEFEKVNVQGT 2153
++ ++ + D E M D+ + + + R + + N+ G
Sbjct: 48 LLAQPGFQFHKIDLADREGMTDLFASGHFERVFISPHRLAVR-YSLENPHAYADSNLTGF 106

Query: 2154 VDVIRLAQQHH-ARLIYVSTISV-GTYFDIDTEDVTFSEADVYKGQLLTSPYTRSKFYSE 2211
++++ + + L+Y S+ SV G + FS D + S Y +K +E
Sbjct: 107 LNILEGCRHNKIQHLLYASSSSVYG-----LNRKMPFSTDDSVDHPV--SLYAATKKANE 159

Query: 2212 LKVLEAVNN-GLDGRIVRVGNLTSPYNGRWHM------RNIKTNRFSMVMNDLLQLDCIG 2264
L + GL +R + P+ GR M + + + V N
Sbjct: 160 LMAHTYSHLYGLPATGLRFFTVYGPW-GRPDMALFKFTKAMLEGKSIDVYNY-------- 210

Query: 2265 VSMAEMPVDFSFVDTTARQIVALAQV 2290
+M DF+++D A I+ L V
Sbjct: 211 ---GKMKRDFTYIDDIAEAIIRLQDV 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0180ENTSNTHTASED290.009 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 29.2 bits (65), Expect = 0.009
Identities = 15/57 (26%), Positives = 27/57 (47%), Gaps = 5/57 (8%)

Query: 84 GQP-----IYVSLSYSYPYIVCVVDKEPVGIDIEKISQRLDWRTLVTCFSTNEAHQI 135
QP ++ S+S+ + V+ ++ +GIDIEKI + L ++ QI
Sbjct: 76 RQPLWPDGLFGSISHCATTALAVISRQRIGIDIEKIMSQHTATELAPSIIDSDERQI 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0182CARBMTKINASE320.002 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 31.7 bits (72), Expect = 0.002
Identities = 23/84 (27%), Positives = 41/84 (48%), Gaps = 7/84 (8%)

Query: 155 INADTLAYFIASSLKAPIYV-LSNIAGVLIN-----DVVIPQLPLVDIHQYIEHGD-IYG 207
I+ D +A + A I++ L+++ G + + + ++ + ++ +Y E G G
Sbjct: 213 IDKDLAGEKLAEEVNADIFMILTDVNGAALYYGTEKEQWLREVKVEELRKYYEEGHFKAG 272

Query: 208 GMIPKVLDAKNAIENGCPKVIIAS 231
M PKVL A IE G + IIA
Sbjct: 273 SMGPKVLAAIRFIEWGGERAIIAH 296


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0187ISCHRISMTASE593e-13 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 59.3 bits (143), Expect = 3e-13
Identities = 31/99 (31%), Positives = 51/99 (51%)

Query: 66 LDKRDDDFVIDKRHFSAFVGTDLDLQLRRRGIDTIVLGGVATHIGVDTTARDAYQLNYNQ 125
L DDD V+ K +SAF T+L +R+ G D +++ G+ HIG TA +A+ +
Sbjct: 112 LAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKA 171

Query: 126 FFVTDMMSAQNETLHQFPIDNVFPLMGQTITTNDFLNIL 164
FFV D ++ + HQ ++ T+ T+ L+ L
Sbjct: 172 FFVGDAVADFSLEKHQMALEYAAGRCAFTVMTDSLLDQL 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0193DNABINDINGHU300.002 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 30.4 bits (69), Expect = 0.002
Identities = 16/75 (21%), Positives = 28/75 (37%), Gaps = 15/75 (20%)

Query: 86 ELIENESVETLKNKMIARATNTMRFVATNIMDAQIDAICDVLKNARTIFLFGFGASSLTI 145
+LI +A AT + + +DA A+ L + L GFG +
Sbjct: 6 DLIA----------KVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVR- 54

Query: 146 GDLFQKLSRIGLNVR 160
++ +R G N +
Sbjct: 55 ----ERAARKGRNPQ 65


4SAV0289SAV0300Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SAV0289317-0.018031hypothetical protein
SAV0290217-0.771680hypothetical protein
SAV0291218-1.774556hypothetical protein
SAV0292219-1.583673hypothetical protein
SAV0293318-2.239369conserved hypothetical protein
SAV0294621-4.694054conserved hypothetical protein
SAV0295319-5.007448hypothetical protein
SAV0296722-4.461062hypothetical protein
SAV0297921-3.986066hypothetical protein
SAV02981118-4.168561conserved hypothetical protein
SAV0299817-3.875109conserved hypothetical protein
SAV0300214-2.931036conserved hypothetical protein
5SAV0315SAV0322Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV0315-2133.000203N-acetylneuraminate lyase subunit
SAV03160142.858962hypothetical protein
SAV0317-1133.258091conserved hypothetical protein
SAV03180113.751353Putative N-acetylmannosamine-6-phosphate
SAV0319-1113.624201conserved hypothetical protein
SAV0320-1113.749637glycerol ester hydrolase
SAV03211133.534683hypothetical protein
SAV03221153.433645putative trimethylamine dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0318PHPHTRNFRASE270.046 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 27.4 bits (61), Expect = 0.046
Identities = 17/82 (20%), Positives = 27/82 (32%), Gaps = 12/82 (14%)

Query: 65 DYDHSDVFITATSKEVDELIESQCEVIALDATLQQ---RPKETLDELVSYIRTHAPNVEI 121
D V + T +EV E + + P T D +VE+
Sbjct: 222 DGIEGIVIVNPTEEEVKAYEEKRAAFEKQKQEWAKLVGEPSTTKD---------GAHVEL 272

Query: 122 MADIATVEEAKNAARLGFDYIG 143
A+I T ++ G + IG
Sbjct: 273 AANIGTPKDVDGVLANGGEGIG 294


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0320GPOSANCHOR471e-07 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 47.4 bits (112), Expect = 1e-07
Identities = 41/309 (13%), Positives = 85/309 (27%), Gaps = 12/309 (3%)

Query: 1 MLRGQEERKYSIRKYSIGVVSVLAATMFVVSSHEAQASEKTPTSNAAAQKETLNQPGEQG 60
M + R YS+RK G SV A + + +E + + +
Sbjct: 1 MTKNNTNRHYSLRKLKTGTASVAVALTVLGAGLVVNTNEVSAVATRSQTDTLEKVQERAD 60

Query: 61 NAITSHQMQSGKQLDDMHKENGKSGTVTEGKDTLQSSKHQSTQNSKTIRTQ---NDNQVK 117
+ K D E + L ++K + +N K++ +
Sbjct: 61 KFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEA 120

Query: 118 QDSERQGSKQSHQN------NATNNTERQNDQVQNTHHAERNGSQSTTSQSNDVDKSQPS 171
+ ++ + + + N E + + + + S +
Sbjct: 121 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 180

Query: 172 IPAQKVLPNHDKAAPTSTTPPSNDKTAPKSTKAQDATTDKHPNQQDTHQPAHQIIDAKQD 231
+ A+K +A + + + S K + +K + A
Sbjct: 181 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNF 240

Query: 232 DTVRQSEQKPQVGDLSKHIDGQNSPEKPTDKNTDNKQLIKDALQAPKTRSTTNAAADAKK 291
T ++ K + + Q EK KT AA +A+K
Sbjct: 241 STADSAKIKTLEAEKAALEARQAELEK---ALEGAMNFSTADSAKIKTLEAEKAALEAEK 297

Query: 292 VRPLKANQV 300
+QV
Sbjct: 298 ADLEHQSQV 306


6SAV0391SAV0416Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV03913191.832618GMP synthase
SAV03928240.652803integrase homolog
SAV03937231.941071hypothetical protein
SAV03946231.728725conserved hypothetical protein
SAV03956252.808284hypothetical protein
SAV03964273.775505hypothetical protein
SAV03974263.900181putative transcriptional regulator
SAV03984273.940082tetracycline resistance protein
SAV03994283.854680hypothetical protein
SAV04004284.248875similar to lipoprotein, NLP/P60 family
SAV04014272.829428putative membrane protein
SAV04025261.785474hypothetical protein
SAV04037211.220076hypothetical protein
SAV04046221.915245hypothetical protein
SAV04056252.882968hypothetical protein
SAV04066263.111561hypothetical protein
SAV04075294.410959hypothetical protein
SAV04085231.725064putative phage replication protein
SAV0409620-0.372039similar to DNA translocase FtsK
SAV04106200.466871hypothetical protein
SAV0411620-0.506165hypothetical protein
SAV0412519-1.488520hypothetical protein
SAV0413519-2.072741hypothetical protein
SAV0414619-1.324919hypothetical protein
SAV04155270.067609transposase
SAV0416117-3.484364hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0398TCRTETOQM11030.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 1103 bits (2855), Expect = 0.0
Identities = 604/639 (94%), Positives = 621/639 (97%)

Query: 1 MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI 60
MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI
Sbjct: 1 MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI 60

Query: 61 TSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMG 120
TSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMG
Sbjct: 61 TSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMG 120

Query: 121 IPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIE 180
IPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIE
Sbjct: 121 IPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIE 180

Query: 181 GNDDLLEKYMSGKSLEALELEQEESIRFQNCSLFPLYHGSAKSNIGIDNLIEVITNKFYS 240
GNDDLLEKYMSGKSLEALELEQEESIRF NCSLFP+YHGSAK+NIGIDNLIEVITNKFYS
Sbjct: 181 GNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYS 240

Query: 241 STHRGPSELCGNVFKIEYTKKRQRLAYIRLYSGVLHLRDSVRVSEKEKIKVTEMYTSING 300
STHRG SELCG VFKIEY++KRQRLAYIRLYSGVLHLRDSVR+SEKEKIK+TEMYTSING
Sbjct: 241 STHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKIKITEMYTSING 300

Query: 301 ELCKIDRAYSGEIVILQNEFLKLNSVLGDTKLLPQRKKIENPHPLLQTTVEPSKPEQREM 360
ELCKID+AYSGEIVILQNEFLKLNSVLGDTKLLPQR++IENP PLLQTTVEPSKP+QREM
Sbjct: 301 ELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQQREM 360

Query: 361 LLDALLEISDSDPLLRYYVDSTTHEIILSFLGKVQMEVISALLQEKYHVEIELKEPTVIY 420
LLDALLEISDSDPLLRYYVDS THEIILSFLGKVQMEV ALLQEKYHVEIE+KEPTVIY
Sbjct: 361 LLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIY 420

Query: 421 MERPLKNAEYTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSLGYLNQSFQNAVMEG 480
MERPLK AEYTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSLGYLNQSFQNAVMEG
Sbjct: 421 MERPLKKAEYTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSLGYLNQSFQNAVMEG 480

Query: 481 IRYGCEQGLYGWNVTDCKICFKYGLYYSPVSTPADFRMLTPIVLEQAFRKAGTELLEPYL 540
IRYGCEQGLYGWNVTDCKICFKYGLYYSPVSTPADFRML PIVLEQ +KAGTELLEPYL
Sbjct: 481 IRYGCEQGLYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTELLEPYL 540

Query: 541 SFKVYAPQEYLSRAYNDAPKYCANIVNTQLKNNEVIIIGEIPARCIQDYRNDLTFFTNGL 600
SFK+YAPQEYLSRAY DAPKYCANIV+TQLKNNEVI+ GEIPARCIQ+YR+DLTFFTNG
Sbjct: 541 SFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQEYRSDLTFFTNGR 600

Query: 601 SVCLAELKGYQVTTGEPVCQTRRLNSRIDKVRYMFNKIT 639
SVCL ELKGY VTTGEPVCQ RR NSRIDKVRYMFNKIT
Sbjct: 601 SVCLTELKGYHVTTGEPVCQPRRPNSRIDKVRYMFNKIT 639


7SAV0425SAV0452Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV0425017-3.343438exotoxin 10
SAV0426116-0.863801exotoxin 11
SAV0427016-1.378472exotoxin 12
SAV0428-114-1.181689exotoxin 13
SAV0429211-1.018756exotoxin 14
SAV0430110-0.523393hypothetical protein
SAV043128-1.131205probable type I site-specific deoxyribonuclease
SAV0432814-3.516278probable restriction modification system
SAV0433712-3.282493exotoxin 15
SAV04341014-3.525899hypothetical protein
SAV04351019-4.174495Conserved hypothetical protein
SAV04361221-4.317324hypothetical protein
SAV04371222-4.099922hypothetical protein
SAV04381221-3.846788hypothetical protein
SAV04391221-3.851671hypothetical protein
SAV04401120-3.431344hypothetical protein
SAV04411120-3.351607hypothetical protein
SAV04421019-3.501842hypothetical protein
SAV04431018-3.448079hypothetical protein
SAV0444918-3.295197hypothetical protein
SAV0445313-1.554965hypothetical protein
SAV0446315-1.151650hypothetical protein
SAV04470161.611104conserved hypothetical protein
SAV04482163.640983hypothetical protein
SAV04492173.741553hypothetical protein
SAV04502163.616674putative cobalamin synthesis protein
SAV04513173.462956hypothetical protein
SAV04522173.126418NADH dehydrogenase subunit 5
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0425TOXICSSTOXIN1352e-41 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 135 bits (340), Expect = 2e-41
Identities = 49/201 (24%), Positives = 73/201 (36%), Gaps = 14/201 (6%)

Query: 39 NVTKDIFDLRDYYSGASKELKNVTGYRYSKGGKHYLIFDKNRKFTRVQIFGKDIERFKAR 98
+ +I DL D+YS S N S G + + IF
Sbjct: 41 STNDNIKDLLDWYSSGSDTFTNSEVLDNSLGS---MRIKNTDGSISLIIFPSPYYSPAFT 97

Query: 99 KNPGLDI-----FVVKEAENRNGTVFSYGGVTKKNQDAYYDYINAPRFQIKRDEGDGIAT 153
K +D+ + F GVT + I P +K D
Sbjct: 98 KGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLP--TPIELPLK-VKVHGKDSPLK 154

Query: 154 YGRVHYIYKEEISLKELDFKLRQYLIQNFDLYKKFPKDSKI-KVIMKDGGYYTFELNKKL 212
YG K+++++ LDF++R L Q LY+ K K+ M DG Y +L+KK
Sbjct: 155 YG--PKFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKF 212

Query: 213 QTNRMSDVIDGRNIEKIEANI 233
+ N I+ I+ IEA I
Sbjct: 213 EYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0426TOXICSSTOXIN1921e-63 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 192 bits (488), Expect = 1e-63
Identities = 49/197 (24%), Positives = 82/197 (41%), Gaps = 16/197 (8%)

Query: 42 DIKDLYRYYSSESFEFSNI--------SGKVENYNGSNVVRFNQEKQNHQLFLLGKDKDK 93
+IKDL +YSS S F+N S +++N +GS + F G+
Sbjct: 45 NIKDLLDWYSSGSDTFTNSEVLDNSLGSMRIKNTDGSISLIIFPSPYYSPAFTKGE---- 100

Query: 94 YKKGLEGQNVFVVKELIDPNGRLSTVGGVTKKNNKSSETNTHLFVNKVYGGNLDASIDSF 153
K L + + + + GVT + L V KV+G +
Sbjct: 101 -KVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLPTPIELPLKV-KVHGKDSPLKYGP- 157

Query: 154 LINKEEVSLKELDFKIRKQLVEKYGLYKGTTKYGKI-TINLKDEKKEVIDLGDKLQFERM 212
+K+++++ LDF+IR QL + +GLY+ + K G I + D DL K ++
Sbjct: 158 KFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFEYNTE 217

Query: 213 GDVLNSKDIQNIAVTIN 229
+N +I+ I IN
Sbjct: 218 KPPINIDEIKTIEAEIN 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0427TOXICSSTOXIN1252e-37 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 125 bits (314), Expect = 2e-37
Identities = 47/199 (23%), Positives = 73/199 (36%), Gaps = 19/199 (9%)

Query: 42 DTNKLHQYYSGPSYELTNV--------SGQSQGYYDSNVLLFNQQNQKFQVFLLGKDENK 93
+ L +YS S TN S + + S L+ F G+
Sbjct: 45 NIKDLLDWYSSGSDTFTNSEVLDNSLGSMRIKNTDGSISLIIFPSPYYSPAFTKGE---- 100

Query: 94 YKEKTHGLDVFAVPELVDLDGRIFSVSGVTKKNVKSIFESLRTPNLLVKKIDDKDGFSID 153
K + + F +SGVT L L K+ KD +
Sbjct: 101 -KVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLPTPIELP----LKVKVHGKDSP-LK 154

Query: 154 EFFFIQKEEVSLKELDFKIRKLLIKKYKLYEGSA-DKGRIVINMKDENKYEIDLSDKLDF 212
K+++++ LDF+IR L + + LY S G I M D + Y+ DLS K ++
Sbjct: 155 YGPKFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFEY 214

Query: 213 ERMADVINSEQIKNIEVNL 231
IN ++IK IE +
Sbjct: 215 NTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0428TOXICSSTOXIN1301e-39 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 130 bits (329), Expect = 1e-39
Identities = 39/197 (19%), Positives = 69/197 (35%), Gaps = 15/197 (7%)

Query: 43 INMLHQYYSEESFESTNISVKSEDYYGSNVLNFNQRNKTFKVFLLGDDKNKY------KE 96
I L +YS S TN V + + + + + K
Sbjct: 46 IKDLLDWYSSGSDTFTNSEVLD---NSLGSMRIKNTDGSISLIIFPSPYYSPAFTKGEKV 102

Query: 97 KTHGLDVFAVPELIDIKGGIYSVGGITKKNVRSVFGFVSNPSLQVKKVDAKHGFSINELF 156
+ + + + G+T + P L+VK F
Sbjct: 103 DLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLP--TPIELP-LKVKVHGKDSPLKYGPKF 159

Query: 157 FIQKEEVSLKELDFKIRKMLVEKYRLYKGAS-DKGRIVINMKDEKKYVIDLSEKLSFDRM 215
K+++++ LDF+IR L + + LY+ + G I M D Y DLS+K ++
Sbjct: 160 --DKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFEYNTE 217

Query: 216 FDVMDSKQIKNIEVNLN 232
++ +IK IE +N
Sbjct: 218 KPPINIDEIKTIEAEIN 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0429TOXICSSTOXIN1934e-64 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 193 bits (491), Expect = 4e-64
Identities = 51/202 (25%), Positives = 92/202 (45%), Gaps = 10/202 (4%)

Query: 31 KQNQKSVNKHDKEALYRYYTGKTMEMKNISALKHGKNNLRFKFRGIKIQVLLPGNDKSKF 90
K + S N + K+ L Y +G + N L + ++R K I +++ +
Sbjct: 36 KTAKASTNDNIKDLLDWYSSG-SDTFTNSEVLDNSLGSMRIKNTDGSISLIIFPSPYYSP 94

Query: 91 QQRSYEGLDVFFVQEKRDKHD-----IFYTVGGVIQNNKTSGVVSAPILNISKEKGEDAF 145
E +D+ + K+ +H I + + GV K + P L + K G+D+
Sbjct: 95 AFTKGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLPTPIELP-LKV-KVHGKDSP 152

Query: 146 VKGYPYYIKKEKITLKELDYKLRKHLIEKYGLYKTISKDGRV-KISLKDGSFYNLDLRSK 204
+K Y K+++ + LD+++R L + +GLY++ K G KI++ DGS Y DL K
Sbjct: 153 LK-YGPKFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKK 211

Query: 205 LKFKYMGEVIESKQIKDIEVNL 226
++ I +IK IE +
Sbjct: 212 FEYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0433TOXICSSTOXIN1084e-31 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 108 bits (270), Expect = 4e-31
Identities = 47/225 (20%), Positives = 86/225 (38%), Gaps = 19/225 (8%)

Query: 16 LTTGMITTTAQPVKASTLEVRSQAT-------QDLSEYYKGRGFELTNVTGYKYG-NKVT 67
L T PV S+ ++ A +DL ++Y TN +
Sbjct: 15 LLLATTATDFTPVPLSSNQIIKTAKASTNDNIKDLLDWYSSGSDTFTNSEVLDNSLGSMR 74

Query: 68 FIDNSQQIDVTLTGNE----KLTVKDDDEVSNVDVFVVREGSDKSAITTSIGGITKTNGT 123
+ I + + + T + +++ + S+ + I I G+T T
Sbjct: 75 IKNTDGSISLIIFPSPYYSPAFTKGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTE-- 132

Query: 124 QHKDTVQNVNLSVSKSTGQHTTSVTSEYYSIYKEEISLKELDFKLRKHLIDKHDLYKTEP 183
T + L V K G+ S K+++++ LDF++R L H LY++
Sbjct: 133 -KLPTPIELPLKV-KVHGK--DSPLKYGPKFDKKQLAISTLDFEIRHQLTQIHGLYRSSD 188

Query: 184 KDSKI-RITMKNGGYYTFELNKKLQPHRMGDTIDSRNIEKIEVNL 227
K +ITM +G Y +L+KK + + I+ I+ IE +
Sbjct: 189 KTGGYWKITMNDGSTYQSDLSKKFEYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0440BCTERIALGSPC320.002 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 31.9 bits (72), Expect = 0.002
Identities = 17/83 (20%), Positives = 34/83 (40%), Gaps = 9/83 (10%)

Query: 177 INSNVPSYDAKFKMSNKDENVKQLRSRYNIPTDKAPILKMHIDGDLKGSSVGYKKLEIDF 236
+N VP Y+AK D V Q + RY + + + S G +++
Sbjct: 124 VNEEVPGYNAKIVSIRPDRVVLQYQGRYEV---------LGLYSQEDSGSDGVPGAQVNE 174

Query: 237 SKEENSELSIVDSLNFQPAKNKD 259
++ + ++ D ++F P N +
Sbjct: 175 QLQQRASTTMSDYVSFSPIMNDN 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0443BCTERIALGSPC353e-04 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 34.6 bits (79), Expect = 3e-04
Identities = 18/83 (21%), Positives = 34/83 (40%), Gaps = 9/83 (10%)

Query: 181 INENVPSYDAKFKMSNKDENVKQLRSRYNIPTDKAPVLKMHIDGDLKGSSVGYKKLEIDF 240
+NE VP Y+AK D V Q + RY + + + S G +++
Sbjct: 124 VNEEVPGYNAKIVSIRPDRVVLQYQGRYEV---------LGLYSQEDSGSDGVPGAQVNE 174

Query: 241 SKGEKSDLSVIDSLNFQPAKVDE 263
+++ ++ D ++F P D
Sbjct: 175 QLQQRASTTMSDYVSFSPIMNDN 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0448adhesinb270.014 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 27.1 bits (60), Expect = 0.014
Identities = 21/94 (22%), Positives = 40/94 (42%), Gaps = 14/94 (14%)

Query: 14 DISTTVETLNLISKMEAQKENIRSVIAPEHKHKYKDIENGLKGEE---KVLIEQMAQHCE 70
+S V+ + L + E KE+ H + ++ENG+ + K L E+ + E
Sbjct: 118 AVSEGVDVIYLEGQSEKGKED---------PHAWLNLENGIIYAQNIAKRLSEKDPANKE 168

Query: 71 AFKANFKGAAQ--GDWVKSAMSEIDSIKDDLKKI 102
++ N K + K A + ++I + K I
Sbjct: 169 TYEKNLKAYVEKLSALDKEAKEKFNNIPGEKKMI 202


8SAV0610SAV0630Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV0610114-3.317671similar to iron(III) ABC transporter permease
SAV0611-113-3.924554L-2-haloalkanoic acid dehalogenase
SAV0612-112-3.809575similar to
SAV0613013-4.276875hypothetical protein
SAV0614-214-4.376550hypothetical protein
SAV0615-314-3.515715putative esterase/lipase
SAV0616013-2.518313staphylococcal accessory regulator A
SAV0617113-2.093769conserved hypothetical protein
SAV0618115-2.041900hypothetical protein
SAV0619012-1.454876hypothetical protein
SAV0620112-1.938036hypothetical protein
SAV0621212-1.706755putative NADH dehydrogenase I chain L
SAV0623014-1.529407Na_ antiporter
SAV0624011-1.650025Na_ antiporter
SAV0625-19-1.785784MnhD homologue
SAV0626-110-2.252711Na_ antiporter
SAV0627010-1.722608similar to Na_ antiporter
SAV0628011-1.587389conserved hypothetical protein
SAV0629010-1.866764conserved hypothetical protein
SAV0630210-2.057539transposase for IS1181
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0627TCRTETB270.012 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 27.2 bits (60), Expect = 0.012
Identities = 16/73 (21%), Positives = 38/73 (52%)

Query: 5 ITHIMIISSLIIFGIALIICLFRLIKGPTTADRVVTFDTTSAVVMSIVGVLSVLMGTVSF 64
I H + S L++ + II + L+K R+ +++ VG++ ++ T S+
Sbjct: 162 IAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSY 221

Query: 65 LDSIMLIAIISFV 77
S ++++++SF+
Sbjct: 222 SISFLIVSVLSFL 234


9SAV0658SAV0674Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV0658-311-3.567327hypothetical protein
SAV0659-38-2.655000two-component response regulator
SAV066009-1.725492putative two-component sensor histidine kinase
SAV066109-1.828049ABC transporter ATP-binding protein
SAV0662-18-3.245522ABC transporter permease
SAV0663010-3.092324putative pit accessory protein
SAV066409-2.434933low-affinity inorganic phosphate transporter
SAV0665-111-3.789135secretory antigen SsaA homologue
SAV0666-116-5.310794conserved hypothetical protein
SAV0667-117-4.763139similar to AraC/XylS family transcriptional
SAV0668113-2.464823hypothetical protein
SAV0669212-1.743480conserved hypothetical protein
SAV0670312-2.824529conserved hypothetical protein
SAV0671312-2.834408conserved hypothetical protein
SAV0672212-2.769483LysR family transcriptional regulator
SAV0673311-3.042106sugar efflux transporter
SAV0674214-3.105877conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0659HTHFIS645e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 63.7 bits (155), Expect = 5e-14
Identities = 26/111 (23%), Positives = 57/111 (51%), Gaps = 1/111 (0%)

Query: 3 ILLVEDDNTLFQELKKELEQWDFNVAGIEDFGKVMDTFESFNPEIVILDVQLPKYDGFYW 62
IL+ +DD + L + L + ++V + + + + ++V+ DV +P + F
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 63 CRKMREV-SNVPILFLSSRDNPMDQVMSMELGADDYMQKPFYTNVLIAKLQ 112
++++ ++P+L +S+++ M + + E GA DY+ KPF LI +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0661PF05272361e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 36.2 bits (83), Expect = 1e-04
Identities = 15/56 (26%), Positives = 26/56 (46%), Gaps = 8/56 (14%)

Query: 40 GPSGSGKTTLLNVLSSIDYISQGSITLKGKK--LEKLSNK------ELSDIRKHDI 87
G G GK+TL+N L +D+ S + K E+++ E++ R+ D
Sbjct: 603 GTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRADA 658


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0673TCRTETA567e-11 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 56.4 bits (136), Expect = 7e-11
Identities = 73/365 (20%), Positives = 134/365 (36%), Gaps = 41/365 (11%)

Query: 9 KNYKLFVA--NMFLLGMGIAVTVPYLVLFATKDLGMTTNQ---YGLLLASAAISQFTVNS 63
N L V + L +GI + +P L +DL + + YG+LLA A+ QF
Sbjct: 3 PNRPLIVILSTVALDAVGIGLIMPVLP-GLLRDLVHSNDVTAHYGILLALYALMQFACAP 61

Query: 64 IIARFSDTHHFNRKIIIILALLMGALGFSIYFFVDTIWLFILLYAIFQGLFAPAMPQLYA 123
++ SD F R+ +++++L A+ ++I +W+ + + I G+
Sbjct: 62 VLGALSD--RFGRRPVLLVSLAGAAVDYAIMATAPFLWV-LYIGRIVAGITGATGA---V 115

Query: 124 SARESINVSSSKDRAQFANTVLRSMFSLGFLFGPFIGAQLIGLKGYAGLFGGTISIILFT 183
+ +++ +RA+ + + F G + GP +G + G +A F L
Sbjct: 116 AGAYIADITDGDERARHFG-FMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNF 174

Query: 184 LVLQVFFYKDLNIKHPISTQQHVEKIAPNMFKDKTL--------LLPFIAFILLHIGQWM 235
L + ++ + + A N L + FI+ +GQ
Sbjct: 175 LTGCFLLPESHK-----GERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVP 229

Query: 236 YTMNMPLFVTDYLKENEQHVGYLASLCAGLEVPFMIIL-GVLSSRLHTRTLLIYGAIFGG 294
+ +F D + +G + L ++ G +++RL R L+ G I G
Sbjct: 230 AAL-WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADG 288

Query: 295 LFYFSIGVFKNFYMMLAGQVFLAIFLAVLLGIGISYFQDILPDFPGYASTLFSNAMVIGQ 354
Y + +M F + L GIG +P S GQ
Sbjct: 289 TGYILLAFATRGWM-----AFPIMVLLASGGIG-------MPALQAMLSRQVDEERQ-GQ 335

Query: 355 LGGNL 359
L G+L
Sbjct: 336 LQGSL 340



Score = 48.7 bits (116), Expect = 2e-08
Identities = 44/186 (23%), Positives = 73/186 (39%), Gaps = 13/186 (6%)

Query: 213 MFKDKTLLLPFIAFILLHIGQWMYTMNMPLFVTDYLKENEQ--HVGYLASLCAGLEVPFM 270
M ++ L++ L +G + +P + D + N+ H G L +L A ++
Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60

Query: 271 IILGVLSSRLHTRTLLIYGAIFGGLFYFSIGVFKNFYMMLAGQVFLAIFLAVLLGIGISY 330
+LG LS R R +L+ + Y + +++ G++ I A G +Y
Sbjct: 61 PVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG-AY 119

Query: 331 FQDILPD-----FPGYASTLFSNAMVIGQLGGNLLGGAMSHWVGLENVFFVSAASIMLGM 385
DI G+ S F MV G + G L+GG H FF +AA L
Sbjct: 120 IADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPH-----APFFAAAALNGLNF 174

Query: 386 ILIFFT 391
+ F
Sbjct: 175 LTGCFL 180


10SAV0796SAV0821Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV0796020-3.540162hypothetical protein
SAV0797-120-3.038929hypothetical protein
SAV0798-116-3.616275hypothetical protein
SAV0799221-4.575113hypothetical protein
SAV0800122-4.665040similar to bacteriophage terminase small
SAV0801224-6.019336hypothetical protein
SAV0802223-5.961571hypothetical protein
SAV0803521-6.546198ferric hydroxamate receptor 1
SAV0804520-6.719213hypothetical protein
SAV0805316-4.862425hypothetical protein
SAV08066160.835152hypothetical protein
SAV0807615-0.084400hypothetical protein
SAV08084140.054362conserved hypothetical protein
SAV08093140.425051hypothetical protein
SAV08103130.904291conserved hypothetical protein
SAV08113141.049288fibrinogen-binding protein
SAV0812115-2.142071similar to secreted von Willebrand
SAV0813216-1.568633extracellular ECM and plasma binding protein
SAV0814118-1.690023hypothetical protein
SAV0815120-1.449488staphylococcal nuclease
SAV0816220-3.371182cold-shock protein C
SAV0817118-4.291674hypothetical protein
SAV0818117-4.708102hypothetical protein
SAV0819116-2.786024conserved hypothetical protein
SAV0820316-3.062364hypothetical protein
SAV0821014-3.119346hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0803FERRIBNDNGPP602e-12 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 59.6 bits (144), Expect = 2e-12
Identities = 49/248 (19%), Positives = 96/248 (38%), Gaps = 21/248 (8%)

Query: 48 PKRVAVLTGFYVGDFIKLGIKPIAVSDITK-DSSILKPYL-KGVDYIG---ENDVEKVAK 102
P R+ L V + LGI P V+D + +P L V +G E ++E + +
Sbjct: 35 PNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTE 94

Query: 103 AKPDLIVVDA-MDKNIKKYQKIAPTVPYTYNKYNH-----KEILKEIGNLTNNEDKAKKW 156
KP +V A + + +IAP + ++ ++ L E+ +L N + A+
Sbjct: 95 MKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNLQSAAETH 154

Query: 157 IEEWEDKTRKDKKEIQSKIGQATASVFEPDEKQIYIYNSTWGRGLDIVHDAFGMPMTKQY 216
+ ++ED R K + + D + + ++ + D +G+P Q
Sbjct: 155 LAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGP--NSLFQEILDEYGIPNAWQG 212

Query: 217 KDKLQEDKKGYASISKENISKYA-GDYIFLSKPSYGKFD-FEKTHTWQNIEAVKKGHVIS 274
+ + G ++S + ++ Y D + + D T WQ + V+ G
Sbjct: 213 ----ETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALMATPLWQAMPFVRAGRF-- 266

Query: 275 YKAEDYWF 282
+ WF
Sbjct: 267 QRVPAVWF 274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0810ALARACEMASE270.049 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 26.7 bits (59), Expect = 0.049
Identities = 13/37 (35%), Positives = 19/37 (51%), Gaps = 2/37 (5%)

Query: 135 MYDIYP-PYDGIPDEAFLI-KELKVNSLAGKTGTINY 169
D+ P P GI L KE+K++ +A GT+ Y
Sbjct: 305 AVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGY 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0811ICENUCLEATIN414e-05 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 40.5 bits (94), Expect = 4e-05
Identities = 64/322 (19%), Positives = 116/322 (36%), Gaps = 4/322 (1%)

Query: 552 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 611
G + EDS G S + S +G ST +G+DS+ + S + +S
Sbjct: 357 GSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQ 416

Query: 612 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 671
+ S + SD + S + DS+ + S + DS + S +
Sbjct: 417 TAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQK 476

Query: 672 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 731
SD + S S + +S + S + S + S + ++SD + S S
Sbjct: 477 GSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTS 536

Query: 732 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDS 791
+ ++S + S + +S + S + SD + S + SDS +
Sbjct: 537 TAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGY 596

Query: 792 DSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDS----GSDSDSSSDSDSDS 847
S + S + S + S + S S + +DS G S ++ +S
Sbjct: 597 GSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSIL 656

Query: 848 TSDTGSDNDSDSDSNSDSESGS 869
T+ GS + S+ + GS
Sbjct: 657 TAGYGSTQTAQEGSDLTAGYGS 678



Score = 40.1 bits (93), Expect = 4e-05
Identities = 64/322 (19%), Positives = 117/322 (36%), Gaps = 4/322 (1%)

Query: 552 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 611
G + EDS G S + S +G ST +G+DS+ + S + +S
Sbjct: 261 GSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQ 320

Query: 612 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 671
+ S + SD + S + DS+ + S + DS + S +
Sbjct: 321 TAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQK 380

Query: 672 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 731
SD + S + +DS + S + +S + S + SD + S
Sbjct: 381 GSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTG 440

Query: 732 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDS 791
+ DS + S + DS + S + SD + S S + +S +
Sbjct: 441 TAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGY 500

Query: 792 DSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDS----GSDSDSSSDSDSDS 847
S + S + S ++++SD + S S + ++S G S ++ +S
Sbjct: 501 GSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVL 560

Query: 848 TSDTGSDNDSDSDSNSDSESGS 869
T+ GS + S+ + GS
Sbjct: 561 TAGYGSTQTAREGSDLTAGYGS 582



Score = 40.1 bits (93), Expect = 5e-05
Identities = 63/319 (19%), Positives = 116/319 (36%), Gaps = 4/319 (1%)

Query: 552 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 611
G + + SD G S + +DS +G ST +G +S + S +
Sbjct: 373 GSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYG----STQTAQK 428

Query: 612 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 671
SD + S + DS+ + S + DS + S + SD + S S
Sbjct: 429 GSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTS 488

Query: 672 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 731
+ +S + S + S + S + ++SD + S S + ++S +
Sbjct: 489 TAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGY 548

Query: 732 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDS 791
S + +S + S + SD + S + SDS + S + S
Sbjct: 549 GSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSL 608

Query: 792 DSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDT 851
+ S + S + S S + +DS + S +G +S ++ S T+
Sbjct: 609 TAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQE 668

Query: 852 GSDNDSDSDSNSDSESGSN 870
GSD + S S + + S+
Sbjct: 669 GSDLTAGYGSTSTAGADSS 687



Score = 39.4 bits (91), Expect = 7e-05
Identities = 61/306 (19%), Positives = 112/306 (36%), Gaps = 2/306 (0%)

Query: 566 GSDSGSDSNSDSGSDSGSDSTSDSGSD--SASDSDSASDSDSASDSDSASDSDSASDSDS 623
GS + S + GS T+ GSD + S + DS+ + S + DS
Sbjct: 309 GSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSL 368

Query: 624 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDS 683
+ S + SD + S + +DS+ + S + +S + S +
Sbjct: 369 TAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQK 428

Query: 684 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 743
SD + S + DS + S + DS + S + SD + S S
Sbjct: 429 GSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTS 488

Query: 744 DSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSES 803
+ +S + S + S + S ++++SD + S S + ++S +
Sbjct: 489 TAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGY 548

Query: 804 DSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSDSNS 863
S + +S + S + SD +G S ++ SDS + GS + S+
Sbjct: 549 GSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSL 608

Query: 864 DSESGS 869
+ GS
Sbjct: 609 TAGYGS 614



Score = 38.2 bits (88), Expect = 2e-04
Identities = 61/306 (19%), Positives = 112/306 (36%), Gaps = 2/306 (0%)

Query: 566 GSDSGSDSNSDSGSDSGSDSTSDSGSD--SASDSDSASDSDSASDSDSASDSDSASDSDS 623
GS + S + GS T+ GSD + S + DS+ + S + DS
Sbjct: 405 GSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSL 464

Query: 624 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDS 683
+ S + SD + S S + +S+ + S + S + S + +
Sbjct: 465 TAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQN 524

Query: 684 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 743
+SD + S S + ++S + S + +S + S + SD + S
Sbjct: 525 ESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTG 584

Query: 744 DSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSES 803
+ SDS + S + S + S + S + S S + +DS +
Sbjct: 585 TAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGY 644

Query: 804 DSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSDSNS 863
S + +S + S + SD +G S S++ +DS + GS + +S
Sbjct: 645 GSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSIL 704

Query: 864 DSESGS 869
+ GS
Sbjct: 705 TAGYGS 710



Score = 37.8 bits (87), Expect = 3e-04
Identities = 64/322 (19%), Positives = 113/322 (35%), Gaps = 4/322 (1%)

Query: 552 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 611
G + E+S G S + S +G ST +G DS+ + S + DS
Sbjct: 309 GSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSL 368

Query: 612 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 671
+ S + SD + S + +DS+ + S + +S + S +
Sbjct: 369 TAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQK 428

Query: 672 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 731
SD + S + DS + S + DS + S + SD + S S
Sbjct: 429 GSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTS 488

Query: 732 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDS 791
+ +S + S + S + S + + SD + S S + ++S +
Sbjct: 489 TAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGY 548

Query: 792 DSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDS----GSDSDSSSDSDSDS 847
S + +S + S + SD + S + SDS G S ++ S
Sbjct: 549 GSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSL 608

Query: 848 TSDTGSDNDSDSDSNSDSESGS 869
T+ GS + S + GS
Sbjct: 609 TAGYGSTQTAREQSVLTTGYGS 630



Score = 37.8 bits (87), Expect = 3e-04
Identities = 66/331 (19%), Positives = 121/331 (36%), Gaps = 4/331 (1%)

Query: 552 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 611
G + E+S G S + S +G ST +G DS+ + S + DS
Sbjct: 405 GSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSL 464

Query: 612 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 671
+ S + SD + S S + +S+ + S + S + S + +
Sbjct: 465 TAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQN 524

Query: 672 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 731
+SD + S S + ++S + S + +S + S + SD + S
Sbjct: 525 ESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTG 584

Query: 732 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDS 791
+ SDS + S + S + S + S + S S + +DS +
Sbjct: 585 TAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGY 644

Query: 792 DSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDS----GSDSDSSSDSDSDS 847
S + +S + S ++ SD + S S + +DS G S ++ +S
Sbjct: 645 GSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSIL 704

Query: 848 TSDTGSDNDSDSDSNSDSESGSNNNVVPPNS 878
T+ GS + S+ S GS + +S
Sbjct: 705 TAGYGSTQTAQEGSDLTSGYGSTSTAGADSS 735



Score = 37.4 bits (86), Expect = 3e-04
Identities = 57/301 (18%), Positives = 105/301 (34%)

Query: 569 SGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSD 628
+G +S+ +G S S + S + DS+ + S + DS +
Sbjct: 218 AGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYG 277

Query: 629 SASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSD 688
S + SD + S + +DS+ + S + +S + S + SD
Sbjct: 278 STQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLT 337

Query: 689 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 748
+ S + DS + S + DS + S + SD + S + +D
Sbjct: 338 AGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGAD 397

Query: 749 SDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSESDSDSD 808
S + S + +S + S ++ SD + S + DS + S
Sbjct: 398 SSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQT 457

Query: 809 SDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSDSNSDSESG 868
+ DS + S + SD +G S S++ +S + GS + S + G
Sbjct: 458 AGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYG 517

Query: 869 S 869
S
Sbjct: 518 S 518



Score = 37.0 bits (85), Expect = 4e-04
Identities = 60/303 (19%), Positives = 106/303 (34%), Gaps = 4/303 (1%)

Query: 552 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 611
G + + SD G S + DS +G ST +G DS+ + S + SD
Sbjct: 229 GSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDL 288

Query: 612 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 671
+ S + +DS+ + S + +S + S + SD + S
Sbjct: 289 TA----GYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTG 344

Query: 672 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 731
+ DS + S + DS + S + SD + S + +DS +
Sbjct: 345 TAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGY 404

Query: 732 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDS 791
S + +S + S + SD + S + DS + S + DS
Sbjct: 405 GSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSL 464

Query: 792 DSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDT 851
+ S ++ SD + S S + +S + S +G S ++ S T+
Sbjct: 465 TAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQN 524

Query: 852 GSD 854
SD
Sbjct: 525 ESD 527



Score = 37.0 bits (85), Expect = 4e-04
Identities = 59/306 (19%), Positives = 105/306 (34%), Gaps = 2/306 (0%)

Query: 566 GSDSGSDSNSDSGSDSGSDSTSDSGSD--SASDSDSASDSDSASDSDSASDSDSASDSDS 623
GS S + GS T+ S + S + +DS + S + +S
Sbjct: 165 GSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQ 224

Query: 624 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDS 683
+ S SD + S + DS+ + S + DS + S +
Sbjct: 225 MAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQK 284

Query: 684 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 743
SD + S + +DS + S + +S + S + SD + S
Sbjct: 285 GSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTG 344

Query: 744 DSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSES 803
+ DS + S + DS + S ++ SD + S + +DS +
Sbjct: 345 TAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGY 404

Query: 804 DSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSDSNS 863
S + +S + S + SD +G S ++ DS + GS + DS+
Sbjct: 405 GSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSL 464

Query: 864 DSESGS 869
+ GS
Sbjct: 465 TAGYGS 470



Score = 36.3 bits (83), Expect = 6e-04
Identities = 57/310 (18%), Positives = 104/310 (33%)

Query: 545 PEQPDEPGEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSD 604
P PD E++ D+ +S S + + +T S S +
Sbjct: 122 PGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYG 181

Query: 605 SASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSD 664
S + +S + S + +DS + S + +S + S SD
Sbjct: 182 STETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLT 241

Query: 665 SASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 724
+ S + DS + S + DS + S + SD + S + +D
Sbjct: 242 AGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGAD 301

Query: 725 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSD 784
S + S + +S + S + SD + S + DS + S
Sbjct: 302 SSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQT 361

Query: 785 SDSDSDSDSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSD 844
+ DS + S ++ SD + S + +DS + S +G +S ++
Sbjct: 362 AGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYG 421

Query: 845 SDSTSDTGSD 854
S T+ GSD
Sbjct: 422 STQTAQKGSD 431



Score = 36.3 bits (83), Expect = 6e-04
Identities = 62/306 (20%), Positives = 116/306 (37%), Gaps = 2/306 (0%)

Query: 566 GSDSGSDSNSDSGSDSGSDSTSDSGSD--SASDSDSASDSDSASDSDSASDSDSASDSDS 623
GS + +S + GS T+ GSD + S S + +S+ + S + S
Sbjct: 453 GSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTL 512

Query: 624 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDS 683
+ S + + SD + S S + ++S+ + S ++ +S + S +
Sbjct: 513 TAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTARE 572

Query: 684 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 743
SD + S + SDS + S + S + S + S + S S
Sbjct: 573 GSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTS 632

Query: 744 DSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSES 803
+ +DS + S + +S + S ++ SD + S S + +DS +
Sbjct: 633 TAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGY 692

Query: 804 DSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSDSNS 863
S + +S + S + SD SG S S++ +DS + GS + S+
Sbjct: 693 GSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSL 752

Query: 864 DSESGS 869
+ GS
Sbjct: 753 TAGYGS 758



Score = 36.3 bits (83), Expect = 8e-04
Identities = 66/315 (20%), Positives = 118/315 (37%), Gaps = 6/315 (1%)

Query: 561 SDSDPGSDSGSDSNSDSGSD--SGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSA 618
DS + GS + GSD +G STS +G +S+ + S + S + S
Sbjct: 460 EDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGST 519

Query: 619 SDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSD 678
+ + SD + S S + ++S+ + S ++ +S + S + SD +
Sbjct: 520 QTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAG 579

Query: 679 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 738
S + SDS + S + S + S + S + S S + +DS
Sbjct: 580 YGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSS 639

Query: 739 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSD 798
+ S + +S + S + SD + S S + +DS + S +
Sbjct: 640 LIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAG 699

Query: 799 SDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDS----GSDSDSSSDSDSDSTSDTGSD 854
+S + S ++ SD S S S + +DS G S ++ S T+ GS
Sbjct: 700 YNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGST 759

Query: 855 NDSDSDSNSDSESGS 869
+ S + GS
Sbjct: 760 QTAREQSVLTTGYGS 774



Score = 35.5 bits (81), Expect = 0.001
Identities = 66/322 (20%), Positives = 118/322 (36%), Gaps = 4/322 (1%)

Query: 552 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 611
G + + SD G S S + +S +G ST +G S + S + ++SD
Sbjct: 469 GSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDL 528

Query: 612 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 671
+ S S + + S + S + +S + S + SD + S + S
Sbjct: 529 ITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGS 588

Query: 672 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD----SDSDS 727
DS + S + S + S + S + S S + +DS S
Sbjct: 589 DSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQ 648

Query: 728 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDS 787
+ +S + S + SD + S S + +DS + S + +S +
Sbjct: 649 TAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGY 708

Query: 788 DSDSDSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDS 847
S + SD S S S + +DS + S + S +G S ++ S
Sbjct: 709 GSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVL 768

Query: 848 TSDTGSDNDSDSDSNSDSESGS 869
T+ GS + + +DS+ + GS
Sbjct: 769 TTGYGSTSTAGADSSLIAGYGS 790



Score = 34.3 bits (78), Expect = 0.002
Identities = 58/298 (19%), Positives = 105/298 (35%), Gaps = 2/298 (0%)

Query: 575 SDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSD 634
S + GSD T+ GS + DS+ + S + DS + S + SD
Sbjct: 326 STQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLT 385

Query: 635 SASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSD 694
+ S + +DS+ + S + +S + S + SD + S + D
Sbjct: 386 AGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDD 445

Query: 695 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 754
S + S + DS + S + SD + S S + +S + S
Sbjct: 446 SSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQT 505

Query: 755 SDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSD--SDSDSDSDSESDSDSDSDSD 812
+ S + S + ++SD + S S + ++S + S + +S +
Sbjct: 506 AGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYG 565

Query: 813 SESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSDSNSDSESGSN 870
S + SD + S +GSDS + S T+ S + S + S
Sbjct: 566 STQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSV 623



Score = 34.3 bits (78), Expect = 0.003
Identities = 58/301 (19%), Positives = 112/301 (37%)

Query: 569 SGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSD 628
+G S +G S + ++S + S S + ++S+ + S ++ +S +
Sbjct: 506 AGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYG 565

Query: 629 SASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSD 688
S + SD + S + SDS+ + S ++ S + S + S
Sbjct: 566 STQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLT 625

Query: 689 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 748
+ S S + +DS + S + +S + S + SD + S S + +D
Sbjct: 626 TGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGAD 685

Query: 749 SDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSESDSDSD 808
S + S + +S + S ++ SD S S S + +DS + S
Sbjct: 686 SSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQT 745

Query: 809 SDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSDSNSDSESG 868
+ S + S + S +G S S++ +DS + GS + S + G
Sbjct: 746 ASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYG 805

Query: 869 S 869
S
Sbjct: 806 S 806



Score = 34.3 bits (78), Expect = 0.003
Identities = 62/319 (19%), Positives = 111/319 (34%), Gaps = 4/319 (1%)

Query: 552 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 611
G + E+S G S S +G ST +G DS+ + S + DS
Sbjct: 213 GSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSL 272

Query: 612 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 671
+ S + SD + S + +DS+ + S + +S + S +
Sbjct: 273 TAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQK 332

Query: 672 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 731
SD + S + DS + S + DS + S + SD + S
Sbjct: 333 GSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTG 392

Query: 732 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDS 791
+ +DS + S + +S + S + SD + S + DS +
Sbjct: 393 TAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGY 452

Query: 792 DSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDT 851
S + DS + S ++ SD + S S +G +S + S T+
Sbjct: 453 GSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGY----GSTSTAGYESSLIAGYGSTQTAGY 508

Query: 852 GSDNDSDSDSNSDSESGSN 870
GS + S +++ S+
Sbjct: 509 GSTLTAGYGSTQTAQNESD 527



Score = 34.3 bits (78), Expect = 0.003
Identities = 60/315 (19%), Positives = 109/315 (34%), Gaps = 2/315 (0%)

Query: 561 SDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASD 620
S G+DS + S +G +S+ +G S SD + S + DS+
Sbjct: 198 STGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLI 257

Query: 621 SDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSD 680
+ S + DS + S + SD + S + +DS+ + S + +
Sbjct: 258 AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEE 317

Query: 681 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 740
S + S + SD + S + DS + S + DS + S
Sbjct: 318 STQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQT 377

Query: 741 SDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSD 800
+ SD + S + +DS + S + +S + S + SD +
Sbjct: 378 AQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYG 437

Query: 801 SESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSD--SDSDSTSDTGSDNDSD 858
S + DS + S + DS + S + SD + STS G ++
Sbjct: 438 STGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLI 497

Query: 859 SDSNSDSESGSNNNV 873
+ S +G + +
Sbjct: 498 AGYGSTQTAGYGSTL 512



Score = 34.0 bits (77), Expect = 0.004
Identities = 58/306 (18%), Positives = 113/306 (36%), Gaps = 2/306 (0%)

Query: 566 GSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSAS 625
GS + +S + GS T+ S + S + SD + S + +DS+
Sbjct: 341 GSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSL 400

Query: 626 DSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDS 685
+ S + +S + S + SD + S + DS + S +
Sbjct: 401 IAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGE 460

Query: 686 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 745
DS + S + SD + S S + +S + S + S + S
Sbjct: 461 DSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQ 520

Query: 746 DSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSESDS 805
+ ++SD + S S + ++S+ + S + +S + S + SD +
Sbjct: 521 TAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGY 580

Query: 806 DSDSDSDSESDSDSDSDSDSDSASDSD--SGSDSDSSSDSDSDSTSDTGSDNDSDSDSNS 863
S + S+S + S ++ S +G S ++ S T+ GS + + +DS+
Sbjct: 581 GSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSL 640

Query: 864 DSESGS 869
+ GS
Sbjct: 641 IAGYGS 646



Score = 34.0 bits (77), Expect = 0.004
Identities = 60/303 (19%), Positives = 108/303 (35%), Gaps = 4/303 (1%)

Query: 552 GEIEPIPEDSDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDS 611
G + +SD G S S + ++S +G ST + +S + S +
Sbjct: 517 GSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYG----STQTARE 572

Query: 612 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDS 671
SD + S + SDS+ + S ++ S + S + S + S S
Sbjct: 573 GSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTS 632

Query: 672 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 731
+ +DS + S + +S + S + SD + S S + +DS +
Sbjct: 633 TAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGY 692

Query: 732 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDS 791
S + +S + S + SD S S S + +DS + S + S
Sbjct: 693 GSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSL 752

Query: 792 DSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDT 851
+ S + S + S S + +DS + S +G S ++ S T+
Sbjct: 753 TAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQE 812

Query: 852 GSD 854
SD
Sbjct: 813 RSD 815



Score = 33.6 bits (76), Expect = 0.005
Identities = 62/313 (19%), Positives = 115/313 (36%), Gaps = 2/313 (0%)

Query: 561 SDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASD 620
S S G +S + S +G ST +G S + + SD + S S + ++S+
Sbjct: 486 STSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLI 545

Query: 621 SDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSD 680
+ S ++ +S + S + SD + S + SDS+ + S +
Sbjct: 546 AGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYH 605

Query: 681 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 740
S + S + S + S S + +DS + S + +S + S
Sbjct: 606 SSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQT 665

Query: 741 SDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSD 800
+ SD + S S + +DS + S + +S + S + SD S
Sbjct: 666 AQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYG 725

Query: 801 SESDSDSDSDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSD--SDSDSTSDTGSDNDSD 858
S S + +DS + S + S + S + S + STS G+D+
Sbjct: 726 STSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLI 785

Query: 859 SDSNSDSESGSNN 871
+ S +G ++
Sbjct: 786 AGYGSTQTAGYHS 798



Score = 33.2 bits (75), Expect = 0.007
Identities = 58/301 (19%), Positives = 108/301 (35%)

Query: 569 SGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSD 628
+ +S +G S + S + S + SDS+ + S ++ S +
Sbjct: 554 ASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYG 613

Query: 629 SASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSD 688
S + S + S S + +DS+ + S + +S + S + SD
Sbjct: 614 STQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLT 673

Query: 689 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 748
+ S S + +DS + S + +S + S + SD S S S + +D
Sbjct: 674 AGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGAD 733

Query: 749 SDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSESDSDSD 808
S + S + S + S + S + S S + +DS + S
Sbjct: 734 SSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQT 793

Query: 809 SDSDSESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSDSNSDSESG 868
+ S + S + SD +G S S++ +DS + GS + +S + G
Sbjct: 794 AGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYG 853

Query: 869 S 869
S
Sbjct: 854 S 854



Score = 32.8 bits (74), Expect = 0.009
Identities = 50/278 (17%), Positives = 94/278 (33%)

Query: 592 DSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDS 651
D+ +S S + + + S S + S + +S + S + +
Sbjct: 145 DATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGA 204

Query: 652 DSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 711
DS + S + +S + S SD + S + DS + S
Sbjct: 205 DSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQ 264

Query: 712 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDS 771
+ DS + S + SD + S + +DS + S + +S +
Sbjct: 265 TAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGY 324

Query: 772 DSDSDSESDSDSDSDSDSDSDSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSDSASDS 831
S ++ SD + S + DS + S + DS + S + SD
Sbjct: 325 GSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDL 384

Query: 832 DSGSDSDSSSDSDSDSTSDTGSDNDSDSDSNSDSESGS 869
+G S ++ +DS + GS + +S + GS
Sbjct: 385 TAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGS 422



Score = 32.0 bits (72), Expect = 0.015
Identities = 55/298 (18%), Positives = 100/298 (33%), Gaps = 2/298 (0%)

Query: 575 SDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSD 634
S + S + GS + +DS + S + +S + S SD
Sbjct: 182 STETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLT 241

Query: 635 SASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSD 694
+ S + DS+ + S + DS + S + SD + S + +D
Sbjct: 242 AGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGAD 301

Query: 695 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 754
S + S + +S + S + SD + S + DS + S
Sbjct: 302 SSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQT 361

Query: 755 SDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSD--SDSDSDSDSESDSDSDSDSD 812
+ DS + S + SD + S + +DS + S + +S +
Sbjct: 362 AGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYG 421

Query: 813 SESDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSDSNSDSESGSN 870
S + SD + S +G DS + S T+ S + S ++ GS+
Sbjct: 422 STQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSD 479



Score = 32.0 bits (72), Expect = 0.015
Identities = 60/300 (20%), Positives = 110/300 (36%)

Query: 575 SDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSD 634
S + GSD T+ GS + SDS+ + S ++ S + S + S
Sbjct: 566 STQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLT 625

Query: 635 SASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSD 694
+ S S + +DS+ + S + +S + S + SD + S S + +D
Sbjct: 626 TGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGAD 685

Query: 695 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 754
S + S + +S + S + SD S S S + +DS + S
Sbjct: 686 SSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQT 745

Query: 755 SDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSESDSDSDSDSDSE 814
+ S + S + S + S S + +DS + S + S +
Sbjct: 746 ASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYG 805

Query: 815 SDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSDSNSDSESGSNNNVV 874
S + SD + S S + +DSS + ST G ++ + S + N+++
Sbjct: 806 STQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLT 865



Score = 30.5 bits (68), Expect = 0.043
Identities = 59/300 (19%), Positives = 110/300 (36%)

Query: 575 SDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSD 634
S + S T+ GS S + +DS+ + S + +S + S + SD
Sbjct: 614 STQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLT 673

Query: 635 SASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSD 694
+ S S + +DS+ + S + +S + S + SD S S S + +D
Sbjct: 674 AGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGAD 733

Query: 695 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 754
S + S + S + S + S + S S + +DS + S
Sbjct: 734 SSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQT 793

Query: 755 SDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSESDSDSDSDSDSE 814
+ S + S + SD + S S + +DS + S + +S +
Sbjct: 794 AGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYG 853

Query: 815 SDSDSDSDSDSDSASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSDSNSDSESGSNNNVV 874
S + +SD + S S + DSS + ST G ++ + S + N+++
Sbjct: 854 STQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLT 913


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0812IGASERPTASE310.017 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.8 bits (69), Expect = 0.017
Identities = 28/162 (17%), Positives = 52/162 (32%), Gaps = 4/162 (2%)

Query: 261 ALKLKADTEAAKNDVSKRSKRSLNTQNNKST-TQEISEEQKAEYQRKSEALKERFINRQK 319
+KA+T+ + S + T K T T E E+ K E ++ E K
Sbjct: 1073 KSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVT-SQVSP 1131

Query: 320 SKNESVVSLIDDEDDNENDRQLVVSAPSKKPTTPTTYTETTTQVPMPTVERQTQQQIVYK 379
+ +S E END + + P + T + + + T+ V
Sbjct: 1132 KQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNT 1191

Query: 380 TPKPLAGLNGESHDFTTTHQSPTTSNHTHNNVVEFEETSALP 421
+ N E+ TT + + + ++P
Sbjct: 1192 GNSVVE--NPENTTPATTQPTVNSESSNKPKNRHRRSVRSVP 1231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0819PF05704280.035 Capsular polysaccharide synthesis protein
		>PF05704#Capsular polysaccharide synthesis protein

Length = 307

Score = 27.5 bits (61), Expect = 0.035
Identities = 13/69 (18%), Positives = 24/69 (34%), Gaps = 7/69 (10%)

Query: 116 EWVKKNYENTNHRYLVTLNLNSK-------KFTYCTKIIYQAYKFGVSEKSVKSYGLHII 168
W + Y N + +++ N + + YK + +Y HI
Sbjct: 239 YWKEIPYVNNVNPHMLQYLGNLPYDNSMFNYIKSTSPVQKLTYKLDYNNLKRNTYYDHIF 298

Query: 169 SPYAIKDNF 177
S +KDN+
Sbjct: 299 SIDKLKDNY 307


11SAV0847SAV0914Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV0847223-2.367732integrase
SAV0848425-3.184471excisionase
SAV0849325-1.031126hypothetical protein
SAV08503280.758344hypothetical protein
SAV08517280.503537hypothetical protein
SAV08525270.668757hypothetical protein
SAV08546300.477063hypothetical protein
SAV08556290.749778similar to anti-repressor
SAV0856835-0.300378hypothetical protein
SAV0857835-0.419959hypothetical protein
SAV0858835-0.301244hypothetical protein
SAV08598321.226758hypothetical protein
SAV08605250.213540hypothetical protein
SAV08614282.290898hypothetical protein
SAV08622251.250480hypothetical protein
SAV08632270.619467hypothetical protein
SAV08643280.801824single strand DNA binding protein
SAV08653270.481161hypothetical protein
SAV08665311.124799hypothetical protein
SAV0867527-0.666115hypothetical protein
SAV08686300.431898hypothetical protein
SAV08696321.492038hypothetical protein
SAV08704282.170343hypothetical protein
SAV08712261.133823hypothetical protein
SAV08722251.292388hypothetical protein
SAV08732251.580444phi PVL ORF 50
SAV08742260.428352phi PVL ORF 51 homolog
SAV0875325-1.282414phi PVL ORF 52 homolog
SAV0876321-2.108320similar to phi ETA orf 34-like protein
SAV0877425-1.166674hypothetical protein
SAV0878627-1.794749hypothetical protein
SAV0879729-1.894187hypothetical protein
SAV0880927-1.327083hypothetical protein
SAV0881525-0.312432hypothetical protein
SAV0882624-0.139199int gene activator RinB
SAV08835210.011300hypothetical protein
SAV0884521-0.054874hypothetical protein
SAV08856190.196020phage terminase, small subunit
SAV08865200.289324phage terminase large subunit
SAV08876190.528298hypothetical protein
SAV08885180.892105hypothetical protein
SAV08895210.648141hypothetical protein
SAV08904220.808791hypothetical protein
SAV08910210.988567hypothetical protein
SAV08921190.855143hypothetical protein
SAV0893-1221.412514hypothetical protein
SAV08940211.745435hypothetical protein
SAV08950201.591245hypothetical protein
SAV08962181.185178hypothetical protein
SAV08972180.575919hypothetical protein
SAV08981170.876571hypothetical protein
SAV08991180.923489hypothetical protein
SAV09001170.952848hypothetical protein
SAV09012171.025075hypothetical protein
SAV09022180.978252phi ETA orf 54-like protein
SAV09032181.377375phi ETA orf 55-like protein
SAV09042201.578555phi ETA orf 56-like protein
SAV09051191.238440phiETA ORF57-like protein
SAV09062200.990442phiETA ORF58-like protein
SAV09072191.266225phiETA ORF59-like protein
SAV09082192.714831phiETA ORF60-like protein
SAV09092202.968194cell wall hydrolase
SAV09101192.898385tail fiber
SAV09110212.518529phi ETA orf 63-like protein
SAV0912-1231.672078holin
SAV09130200.248489amidase
SAV0914-216-3.547775hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0851TETREPRESSOR260.026 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 26.4 bits (58), Expect = 0.026
Identities = 12/27 (44%), Positives = 19/27 (70%), Gaps = 3/27 (11%)

Query: 16 ELMNDKNID---QRELAEAIGVSQPTV 39
EL+N+ ID R+LA+ +G+ QPT+
Sbjct: 15 ELLNETGIDGLTTRKLAQKLGIEQPTL 41


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0873PF06580270.019 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 27.1 bits (60), Expect = 0.019
Identities = 9/36 (25%), Positives = 18/36 (50%), Gaps = 5/36 (13%)

Query: 67 ERLEQARLERKLERQRKKEAELR----RKKPH-LFN 97
+ +QA +++ +EA+L + PH +FN
Sbjct: 142 KNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFN 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0890IGASERPTASE280.020 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.5 bits (63), Expect = 0.020
Identities = 29/181 (16%), Positives = 65/181 (35%), Gaps = 8/181 (4%)

Query: 24 KSKDNNDDEGKDKQ-DKKTNSEEEIEKRLQEEYNKRLKEELSRRMKQKEKEKQEAVDEAK 82
+D + ++++ K+ S + + E + + ++ + KE E ++
Sbjct: 1054 NEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEK-- 1111

Query: 83 RLAKMNKDQIAEYEREQMEKELEQLRSEKQLNEMRSEARKMLSE-AEVDSSDEVVNLVVT 141
AK+ ++ E + + +Q +SE + + + S
Sbjct: 1112 --AKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTE 1169

Query: 142 DTAEQTKSNVEA--FSNAVKKAVNEAVKVNARQSPLTGGDSFNHSTKNKPQNLAEIARQK 199
A++T SNVE + N V+ +P T + N + NKP+N + +
Sbjct: 1170 QPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRS 1229

Query: 200 R 200

Sbjct: 1230 V 1230


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0891RTXTOXINA260.028 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 26.1 bits (57), Expect = 0.028
Identities = 19/46 (41%), Positives = 25/46 (54%), Gaps = 3/46 (6%)

Query: 21 PQVFNP--DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE 64
QVF+P N+ + + K TLL F TP+L E + Q GKYE
Sbjct: 522 KQVFDPLKGNIDLSDSKSSTLLK-FVTPLLTPGEEIRERRQSGKYE 566


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0895YERSSTKINASE270.014 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 27.0 bits (59), Expect = 0.014
Identities = 10/29 (34%), Positives = 18/29 (62%)

Query: 61 RIKESISHPVSHVLVNGIRYKIVDTRIYR 89
RI + +PV + + G RY+I+D ++ R
Sbjct: 22 RISQHWQNPVGELNIGGKRYRIIDNQVLR 50


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0901GPOSANCHOR505e-08 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 49.7 bits (118), Expect = 5e-08
Identities = 32/192 (16%), Positives = 69/192 (35%), Gaps = 21/192 (10%)

Query: 11 EASVAKFKKQIDSAVKSVQKFKRVADQTKDVELNANDKKLQKTIKVAKKSLDAFSNKKVK 70
A + + + + ++ + + IK + A ++
Sbjct: 210 SAKIKTLEAEKAALAARKADLEKALEGAMNFS-----TADSAKIKTLEAEKAALEARQ-- 262

Query: 71 AKLDAKIEDLQQKALEASFELNQLDSKEVTPEVKLQKQKLTKDIAEAEAK--LSELEKKR 128
A+L+ +E + S ++ L++++ E + + + A + +L+ R
Sbjct: 263 AELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASR 322

Query: 129 VNI-DVNADNSKFNRVLKVSKASLEALNRSKAKAILDVDNSVANSK---IKRTK-EELKS 183
+ A++ K K+S+AS ++L R D+D S K + K EE
Sbjct: 323 EAKKQLEAEHQKLEEQNKISEASRQSLRR-------DLDASREAKKQLEAEHQKLEEQNK 375

Query: 184 IPNKTRSRLDVD 195
I +R L D
Sbjct: 376 ISEASRQSLRRD 387



Score = 47.4 bits (112), Expect = 3e-07
Identities = 36/134 (26%), Positives = 63/134 (47%), Gaps = 23/134 (17%)

Query: 7 KATIEASVAKFKKQIDSAVKSVQKFKRVADQTKDV--ELNANDKKLQKTIKVAKKSLDAF 64
KA +EA A + Q + Q +R D +++ +L A +KL++ K+++ S
Sbjct: 290 KAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASR--- 346

Query: 65 SNKKVKAKLDAKIEDLQQKALEASFELNQLDSKEVTPEVKLQ------------KQKLTK 112
+ ++ LDA E +K LEA E +L+ + E Q K+++ K
Sbjct: 347 --QSLRRDLDASRE--AKKQLEA--EHQKLEEQNKISEASRQSLRRDLDASREAKKQVEK 400

Query: 113 DIAEAEAKLSELEK 126
+ EA +KL+ LEK
Sbjct: 401 ALEEANSKLAALEK 414



Score = 45.4 bits (107), Expect = 1e-06
Identities = 37/224 (16%), Positives = 68/224 (30%), Gaps = 27/224 (12%)

Query: 9 TIEASVAKFKKQIDSAVKSVQKFKRVADQTKDVELNANDK-KLQKTIKVAKKSLDAFSNK 67
+ + K +K S + K + + + D+E K+L +
Sbjct: 93 ELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTL-----E 147

Query: 68 KVKAKLDAKIEDLQQKALEASFELNQLDSKEVTPEVKLQKQKLTKDIAEAEAKLSELEKK 127
KA L A+ DL++ ALE + + DS + + L + A EA+ +ELEK
Sbjct: 148 AEKAALAARKADLEK-ALEGAMNFSTADSAK--------IKTLEAEKAALEARQAELEKA 198

Query: 128 -----RVNIDVNADNSKFNRVLKVSKASLEALNRSKAKA-----ILDVDNSVANSKIKRT 177
+ +A A L ++ A ++
Sbjct: 199 LEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAAL 258

Query: 178 KEELKSIPNKTRS--RLDVDTRLSIPTIYAFKKSLDALPNKKTT 219
+ + I T+ A K +L+A
Sbjct: 259 EARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEH 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0912SECETRNLCASE260.033 Bacterial translocase SecE signature.
		>SECETRNLCASE#Bacterial translocase SecE signature.

Length = 127

Score = 26.4 bits (58), Expect = 0.033
Identities = 14/81 (17%), Positives = 35/81 (43%), Gaps = 7/81 (8%)

Query: 22 LLLFIKQITDLFGLDLSTQLNQASAIIGAILTLLTGIGVITDPTSKGVSDSSIAQTYQAP 81
+++ + + G L + + ++ + GV T+KG + + A
Sbjct: 20 VVVVALLLVAIVGNYLYRDIMLPLRALAVVILIAAAGGVALL-TTKGKATVAFA------ 72

Query: 82 RDSKKEEQQVTWKSSQDSSLT 102
R+++ E ++V W + Q++ T
Sbjct: 73 REARTEVRKVIWPTRQETLHT 93


12SAV0973SAV0980Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV0973010-3.083557hypothetical protein
SAV0974111-4.018932similar to lipopolysaccharide modification
SAV0975113-4.208312ClpB chaperone homologue
SAV0976317-6.455079hypothetical protein
SAV0977419-6.245195hypothetical protein
SAV0978319-4.117927conserved hypothetical protein
SAV0979214-0.803051hypothetical protein
SAV09802141.274404conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0975IGASERPTASE367e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 36.2 bits (83), Expect = 7e-04
Identities = 17/143 (11%), Positives = 48/143 (33%), Gaps = 14/143 (9%)

Query: 420 QLEIEESALKNESDNASKQRLQELQEELANEKEKQAALQSRVESEKEKIANLQEKRAQLD 479
E+ +S + + Q + + ++EK + + + + + K+ Q +
Sbjct: 1082 TNEVAQSGSETKE----TQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSE 1137

Query: 480 ESRQALEDAQTNNNLEKAAELQYGTIPQLEKELRELEDNFQDEQGEDTDRMIREVVTDEE 539
+ E A+ N+ E Q + + ++ ++T + + VT+
Sbjct: 1138 TVQPQAEPARENDPTVNIKEPQ-------SQTNTTAD---TEQPAKETSSNVEQPVTEST 1187

Query: 540 IGDIVSQWTGIPVSKLVETEREK 562
+ + P + T +
Sbjct: 1188 TVNTGNSVVENPENTTPATTQPT 1210


13SAV1026SAV1036Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV1026-217-3.515214competence transcription factor
SAV1027-218-2.925589hypothetical protein
SAV1028-213-4.872377lipoate-protein ligase homolog
SAV1029116-5.153630conserved hypothetical protein
SAV1030116-5.334041hypothetical protein
SAV1031115-5.339113hypothetical protein
SAV1032-115-5.182914hypothetical protein
SAV1033015-4.449165hypothetical protein
SAV1034019-2.975977conserved hypothetical protein
SAV1035016-2.662544similar to ABC transporter (ATP-binding
SAV1036415-0.913531hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1032PRTACTNFAMLY250.043 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 25.4 bits (55), Expect = 0.043
Identities = 13/34 (38%), Positives = 18/34 (52%), Gaps = 4/34 (11%)

Query: 11 ALGLSTAAYASTEYAEGGT----WSHGVGSKYVW 40
ALG + YAS EY++G W+ G +Y W
Sbjct: 877 ALGRGHSLYASYEYSKGPKLAMPWTFHAGYRYSW 910


14SAV1125SAV1131Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV1125516-0.689681phosphopantetheine adenyltransferase homologue
SAV1126516-0.619331conserved hypothetical protein
SAV1127515-1.334840conserved hypothetical protein
SAV1128514-1.687776ribosomal protein L32
SAV1129413-1.557964iron-regulated cell wall-anchored protein SirH
SAV1130114-2.564557cell surface protein
SAV1131-115-3.121953conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1125LPSBIOSNTHSS2191e-76 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 219 bits (560), Expect = 1e-76
Identities = 77/155 (49%), Positives = 112/155 (72%)

Query: 5 IAVIPGSFDPITYGHLDIIERSTDRFDEIHVCVLKNSKKEGTFSLEERMDLIEQSVKHLP 64
A+ PGSFDPIT+GHLDIIER FD+++V VL+N K+ FS++ER++ I +++ HLP
Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLP 61

Query: 65 NVKVHQFSGLLVDYCEQVGAKTIIRGLRAVSDFEYELRLTSMNKKLNNEIETLYMMSSTN 124
N +V F GL V+Y Q A I+RGLR +SDFE EL++ + NK L +++ET+++ +ST
Sbjct: 62 NAQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTSTE 121

Query: 125 YSFISSSIVKEVAAYRADISEFVPPYVEKALKKKF 159
YSF+SSS+VKEVA + ++ FVP +V AL +F
Sbjct: 122 YSFLSSSLVKEVARFGGNVEHFVPSHVAAALYDQF 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1129IGASERPTASE366e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 35.8 bits (82), Expect = 6e-04
Identities = 37/194 (19%), Positives = 71/194 (36%), Gaps = 15/194 (7%)

Query: 447 RIVDKEAFTKANTDKSNKKEQQDNSAKKEA---------TPATPSKPTPSPVEKESQKQD 497
+ VD T N +++ N+ + PATPS+ T + E Q+
Sbjct: 990 QTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESK 1049

Query: 498 SQKDDNKQLPSVEKENDASSESGKDKTPATKPT------KGEVESSSTTPTKVVSTTQNV 551
+ + + + +N ++ K A T E + + TT TK +T +
Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKE 1109

Query: 552 AKPTTASSKTTKDVVQTSAGSSEAKDSAPLQKANIKNTNDGHTQSQNNKNTQENKAKSLP 611
K + KT + TS S + + S +Q + T + +Q N
Sbjct: 1110 EKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTE 1169

Query: 612 QTGEESNKDMTLPL 625
Q +E++ ++ P+
Sbjct: 1170 QPAKETSSNVEQPV 1183



Score = 30.0 bits (67), Expect = 0.035
Identities = 27/156 (17%), Positives = 45/156 (28%), Gaps = 5/156 (3%)

Query: 37 EAQAAAEETGGTNTEAQPKTEAVASPTTTSEKAPET-KPVANAVSVSNKEVEAPTSETKE 95
A EE TE + V S + ++ ET +P A ++ V +++
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQT 1162

Query: 96 AKEVKEVKAPKETKAVKPAAKATNNTYPILNQELREAIKNPAIKDKDHSAPNSRPIDFEM 155
+ KET + + T N + NP + P
Sbjct: 1163 NTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVE----NPENTTPATTQPTVNSESSNK 1218

Query: 156 KKENGEQQFYHYASSVKPARVIFTDSKPEIELGLQS 191
K + +V+PA D L S
Sbjct: 1219 PKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTS 1254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1130IGASERPTASE340.001 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 33.9 bits (77), Expect = 0.001
Identities = 27/132 (20%), Positives = 44/132 (33%), Gaps = 4/132 (3%)

Query: 184 ADAAKPNNVKPVQPKPAQPKTPTEQTKPVQPKVEKVKPTVTTTSKVEDNHSTKVVSTDTT 243
+ A+ + P PA P TE + K + + +V +
Sbjct: 1015 EEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKS 1074

Query: 244 KDQTKTQTAHTVKTAQTAQEQNKVQTPVKDVATAKSESNNQAVSDNKSQQTNKVTKHNET 303
+ TQT + AQ+ E + QT + V K+Q+ KVT +
Sbjct: 1075 NVKANTQTN---EVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTS-QVS 1130

Query: 304 PKQASKAKELPK 315
PKQ P+
Sbjct: 1131 PKQEQSETVQPQ 1142


15SAV1147SAV1163Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV11472121.900286succinate dehydrogenase cytochrome b-558
SAV11482121.969916succinate dehydrogenase flavoprotein subunit
SAV1149014-0.303700iron-sulphur subunit of succinate dehydrogenase
SAV1151-216-1.217946glutamate racemase
SAV1152-114-3.683437conserved hypothetical protein
SAV1153-114-4.065884conserved hypothetical protein
SAV1154215-4.598340hypothetical protein
SAV1155216-4.286482hypothetical protein
SAV1156216-4.189324hypothetical protein
SAV1157013-3.761420hypothetical protein
SAV1158-114-2.128603Fibrinogen-binding protein precursor
SAV1159116-2.364478fibrinogen-binding protein precursor
SAV1160217-1.890913hypothetical protein
SAV1161116-1.869504hypothetical protein
SAV1162215-2.214594transposase for insertion sequence IS1181
SAV1163217-2.177724alpha-hemolysin precursor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1148MICOLLPTASE300.027 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 30.5 bits (68), Expect = 0.027
Identities = 22/127 (17%), Positives = 47/127 (37%), Gaps = 13/127 (10%)

Query: 378 DFSQHGGNRLGANSLLSAIYGGTVAGPNAIDYISNIDRSYTDMDESIFEKRKAEEQERFD 437
+ ++G N N++ + + G + I D T+ F R ER +
Sbjct: 246 NIDKYGSNYSKGNAVFNLMKGIDYYTNSVIYNTKGYDAKNTE-----FYNRIDPYMERLE 300

Query: 438 KLLAMR---GTENAYKLHRELGEIMTANVTVVRENEKLLETDKKIVELMKRYEDIDMEDT 494
L + +NA+ ++ L T + RE+ + + + + MK Y + +
Sbjct: 301 SLCTIGDKLNNDNAWLVNNALYY--TGRMGKFREDPSISQ--RALERAMKEYPYLSYQYI 356

Query: 495 QTWSNQA 501
+ +N
Sbjct: 357 E-AANDL 362


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1163BICOMPNTOXIN313e-109 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 313 bits (803), Expect = e-109
Identities = 72/318 (22%), Positives = 144/318 (45%), Gaps = 24/318 (7%)

Query: 9 VTTTLLLGSILMNPVANAADSDINIKTGTTDIGSNTTVKTGDLVTYDKEN--GMHKKVFY 66
+TTTL + L+ P+AN + T DIG + ++ N G+ + + +
Sbjct: 7 LTTTLSVS--LLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQNIQF 64

Query: 67 SFIDDKNHNKKLLVIRTKGTIAGQYRVYSEEGANKS-GLAWPSAFKVQLQLPDNEVAQIS 125
F+ DK +NK L+++ +G I+ + Y+ + N + WP + + L+ D V+ I
Sbjct: 65 DFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTNDKYVSLI- 123

Query: 126 DYYPRNSIDTKEYMSTLTYGFNGNVTGDDTGKIGGLIGANVSIGHTLKYVQPDFKTILES 185
+Y P+N I++ TL Y GN + +GG N S ++ Y Q ++ + +E
Sbjct: 124 NYLPKNKIESTNVSQTLGYNIGGNFQSAPS--LGGNGSFNYS--KSISYTQQNYVSEVEQ 179

Query: 186 PTDKKVGWKVIFNNMVNQNWGPYDRDSWNPVYGNQLFMKTRNGSMKAAENFLDPNKASSL 245
K V W V N+ ++ + + LF+ + S + F+ ++ L
Sbjct: 180 QNSKSVLWGVKANSFATESG-------QKSAFDSDLFVGYKPHSKDPRDYFVPDSELPPL 232

Query: 246 LSSGFSPDFATVITMDRKASKQQTNIDVIYERVRD-----DYQLHWTSTNWKGTNTKDKW 300
+ SGF+P F ++ + K S + ++ Y R D H+ ++ G + +
Sbjct: 233 VQSGFNPSFIATVSHE-KGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAF 291

Query: 301 TDRS-SERYKIDWEKEEM 317
+R+ + +Y+++W+ E+
Sbjct: 292 VNRNYTVKYEVNWKTHEI 309


16SAV1244SAV1255Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV1244211-1.671433RNase HII
SAV1245210-1.094059succinyl-CoA synthetase
SAV1246210-0.998323succinyl-CoA synthetase
SAV1247210-1.718383LytN protein
SAV124819-0.687498FmhC protein
SAV1249180.942684similar to DNA processing Smf protein
SAV1250291.202316DNA topoisomerase I topA homologue
SAV12512121.110841glucose-inhibited division protein gid
SAV12524151.182132site-specific recombinase XerC homologue
SAV12533181.858526heat shock protein HslV
SAV12544191.384791heat shock protein HslU
SAV12553210.541725transcription pleiotropic repressor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1247GPOSANCHOR353e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 35.4 bits (81), Expect = 3e-04
Identities = 15/90 (16%), Positives = 32/90 (35%), Gaps = 4/90 (4%)

Query: 1 MNKQQSKVRYSIRKVSIGILSISIGMFLALGMSNKAYADEIDKSKDFTRGYEQNVFAKSE 60
M K + YS+RK+ G S+++ AL + ++ + + K +
Sbjct: 1 MTKNNTNRHYSLRKLKTGTASVAV----ALTVLGAGLVVNTNEVSAVATRSQTDTLEKVQ 56

Query: 61 LNANKNTTKDKIKNEGAVKTSDTSLKLDNK 90
A+K ++ S + L +
Sbjct: 57 ERADKFEIENNTLKLKNSDLSFNNKALKDH 86


17SAV1311SAV1322Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV1311419-4.746562hypothetical protein
SAV1312215-3.233340hypothetical protein
SAV1313214-3.441182hypothetical protein
SAV1314214-3.579380hypothetical protein
SAV1315114-3.581040possible low specificity-threonine aldolase
SAV1316-216-5.267021hypothetical protein
SAV1317-116-4.820421cardiolipin synthetase homolog
SAV1318-217-6.119694ABC transporter homolog
SAV1319-217-5.457142ABC transporter homolog
SAV1320-215-4.959849hypothetical protein
SAV1321-313-4.640334similar to two-component sensor histidine
SAV1322-114-3.201036similar to two-component response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1320ABC2TRNSPORT290.016 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 29.1 bits (65), Expect = 0.016
Identities = 11/34 (32%), Positives = 15/34 (44%)

Query: 167 IVTIGLAVLGGLWFPINTFPNWLQHVAHVLPSYH 200
+V + L G FP++ P Q A LP H
Sbjct: 184 LVITPILFLSGAVFPVDQLPIVFQTAARFLPLSH 217


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1321PF04647330.001 Accessory gene regulator B
		>PF04647#Accessory gene regulator B

Length = 212

Score = 32.8 bits (75), Expect = 0.001
Identities = 18/112 (16%), Positives = 42/112 (37%), Gaps = 9/112 (8%)

Query: 35 WLYIISVIVFSLSYLILVIVNNRLNTLMFYILLIIHYFIICYFVFSVHPMLSLFFFYSAF 94
+ S++VF++ I +++ L+ I I + + V +P
Sbjct: 79 RCTLTSLLVFNVLAYIAHLIDPAYFQLLILIAFITSLLALLFLVPVDNP---------RN 129

Query: 95 AVPFTFKNNVKKTATNLFILTMIICTIITYLLYNNYFVAMMVYYVVISLIML 146
+ T + K T++ ++ + +I Y LY + ++ V+ L
Sbjct: 130 LISNTEQRKTLKLKTSMVLMVLFGGSIGAYRLYTHQIALAILLGVLWQTFTL 181


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1322HTHFIS629e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 62.2 bits (151), Expect = 9e-14
Identities = 23/116 (19%), Positives = 53/116 (45%), Gaps = 2/116 (1%)

Query: 2 TSLIIAEDQNMLRQAMVQLIKLHGDFEILADTDNGLDAMKLIEEYNPNVVILDIEMPGMT 61
++++A+D +R + Q + G +++ T N + I + ++V+ D+ MP
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAG-YDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GLEVLAEIRKKHLNIKVIIVTTFKRPGYFEKAVVNDVDAYVLKERSIEELVETINK 117
++L I+K ++ V++++ KA Y+ K + EL+ I +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


18SAV1341SAV1348Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV1341412-2.398108conserved hypothetical protein
SAV1342311-2.264310transketolase
SAV1343312-2.539598conserved hypothetical protein
SAV134429-1.247603conserved hypothetical protein
SAV134529-1.183335similar to exonuclease SbcD
SAV1346110-0.898720similar to exonuclease SbcC
SAV13472131.286346large-conductance mechanosensitive channel
SAV13482131.506825glycine betaine transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1342CHANLCOLICIN300.041 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 29.7 bits (66), Expect = 0.041
Identities = 18/58 (31%), Positives = 26/58 (44%), Gaps = 5/58 (8%)

Query: 296 QNTMLKRANEDESQ-----WNSLLEKYAETYPELAEEFKLAISGKLPKNYKDELPRFE 348
QN +L +D + +L EKY E Y ++A+E GK N + L FE
Sbjct: 337 QNNLLNSQIKDAVDATVSFYQTLTEKYGEKYSKMAQELADKSKGKKIGNVNEALAAFE 394


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1346FbpA_PF05833340.005 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 33.7 bits (77), Expect = 0.005
Identities = 39/249 (15%), Positives = 83/249 (33%), Gaps = 4/249 (1%)

Query: 241 LQARSKEILAFVNESKETAIKEYEIIEKKTLENNILKDNINQLNKNKIDFVQLKEQQPEI 300
I F E+ + + + +L N ID ++
Sbjct: 174 FDFSYDMIENFTKENSLQLNDNIFSKIFTGVSKTLSSEICFRLKNNSIDLSLSNLKEIVE 233

Query: 301 DEIEAKLKLLQDITNLLNYIENREKIETKIAN--SKKDISKTNNKILNLDCDKRNIDKEK 358
+ ++ + Y +N + N SK+D K + + K+K
Sbjct: 234 VCKDLFKEIQSNKFEFNCYTKNNSFVGFYCLNLMSKEDYKKIQYDSSSKLLENFYYAKDK 293

Query: 359 K--MLEENGDLIESKTSFIDKTRVLFNDINKYQQSYLNIECLITEGEQLGDELNNLIKGL 416
+ ++ DL + + I++ +N + + + GE L + L KGL
Sbjct: 294 SDRLKSKSSDLQKIVMNNINRCTKKDKILNNTLKKCEDKDIFKLYGELLTANIYALKKGL 353

Query: 417 EKVEDSIGNNESDYEKIIELNNAITNINNEINIIKENEKAKAELDKLLGSKQELENQINE 476
+E + +E+ I L+ T N + K+ K K + + E ++N
Sbjct: 354 SHIELANYYSENYDTVKITLDENKTPSQNVQSYYKKYNKLKKSEEAANEQLLQNEEELNY 413

Query: 477 ETTIMKNLE 485
+++ N+
Sbjct: 414 LYSVLTNIN 422


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1347MECHCHANNEL1452e-48 Bacterial mechano-sensitive ion channel signature.
		>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature.

Length = 136

Score = 145 bits (367), Expect = 2e-48
Identities = 62/131 (47%), Positives = 90/131 (68%), Gaps = 11/131 (8%)

Query: 1 MLKEFKEFALKGNVLDLAIAVVMGAAFNKIISSLVENIIMPLIGKIFGSVDFAK------ 54
++KEF+EFA++GNV+DLA+ V++GAAF KI+SSLV +IIMP +G + G +DF +
Sbjct: 3 IIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVTLR 62

Query: 55 ----EWSFWGIKYGLFIQSVIDFIIIAFALFIFVKIANTL-MKKEEAEEEAVVEENVVLL 109
+ + YG+FIQ+V DF+I+AFA+F+ +K+ N L KKEE + VLL
Sbjct: 63 DAQGDIPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRKKEEPAAAPAPTKEEVLL 122

Query: 110 TEIRDLLREKK 120
TEIRDLL+E+
Sbjct: 123 TEIRDLLKEQN 133


19SAV1429SAV1435Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV142912121.964232conserved hypothetical protein
SAV143010122.133353conserved hypothetical protein
SAV143110122.043310conserved hypothetical protein
SAV14329112.009722conserved hypothetical protein
SAV14339112.097117similar to cell wall enzyme EbsB
SAV14349112.127745hypothetical protein ebhA
SAV1435491.727463hypothetical protein ebhB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1434CHANLCOLICIN330.039 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 33.1 bits (75), Expect = 0.039
Identities = 31/156 (19%), Positives = 64/156 (41%), Gaps = 4/156 (2%)

Query: 1315 ITQATSQVTTKEHALNGAQNLAQAKTTAKNNLNNLTSINNAQKDALTRNIDGATTVAGVN 1374
+++ V + L+ AQ+ LN+ S + +DA + + G +
Sbjct: 184 LSEEAKAVEIAQKKLSAAQSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRN--ELA 241

Query: 1375 QETAKATELNNAMHSLQNGINDETQTKQTQKYLDAEPSKKSAYDQAVNAAKAILTKASGQ 1434
Q +AK EL+ + L ND Q + + ++ A T+ +
Sbjct: 242 QASAKYKELDELVKKLSPRANDPLQNRPFFEATRRRVGAGKIREEKQKQVTASETRINRI 301

Query: 1435 NVDKAAVEQALQNVNSTKTALNGDAKLNEAKAAAKQ 1470
N D +++A+ V++ + A G A+++EA+ K+
Sbjct: 302 NADITQIQKAISQVSNNRNA--GIARVHEAEENLKK 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1435GPOSANCHOR618e-11 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 60.8 bits (147), Expect = 8e-11
Identities = 65/380 (17%), Positives = 120/380 (31%), Gaps = 36/380 (9%)

Query: 2732 SANAIIQKPIRTVQEVQSALTNVNRVNERLTQAINQLVPLAD-----NSALRTAKTKLDE 2786
+ + T+++VQ N L + L N L + E
Sbjct: 40 VSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKE 99

Query: 2787 EINKSVTTDGMTQSSIQAYENAKRAGQTETTNAQNVINNGDATDQQIAAEKTKVEEKYNS 2846
++ K+ + S IQ E K + A N A + + AEK + +
Sbjct: 100 KLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKAD 159

Query: 2847 LKQAIAGLTPDLAPLQTAKTQLQNDIDQPTSTTGMTSASVAAFNDKLSAARTKIQEIDRV 2906
L++A+ G L+ + A A L A +
Sbjct: 160 LEKALEGAMNFSTADSAKIKTLEAEKAA-------LEARQAELEKALEGAM------NFS 206

Query: 2907 LASHPDVATIRQNVTAANAAKTALDQARNGLTVDKAPLENAKNQLQHSIDTQTSTTGMTQ 2966
A + T+ A A K L++A G L+ + +
Sbjct: 207 TADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELE 266

Query: 2967 DSINAYNAKLTAARNKVQQINQVLAGSPTVDQINTNTSAANQAKSDLDHARQALTPDKAP 3026
++ TA K++ + + + L+ RQ+L D
Sbjct: 267 KALEGAMNFSTADSAKIKTLEA------EKAALEAEKADLEHQSQVLNANRQSLRRDLDA 320

Query: 3027 LQNAKTQLEQSINQPTDTTGMTTASLNAYNQKLQAAR----------QKLTEINQVLNGN 3076
+ AK QLE + + ++ AS + + L A+R QKL E N++
Sbjct: 321 SREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEA- 379

Query: 3077 PTVQNINDKVAEANQAKDQL 3096
+ Q++ + + +AK Q+
Sbjct: 380 -SRQSLRRDLDASREAKKQV 398



Score = 60.5 bits (146), Expect = 1e-10
Identities = 80/479 (16%), Positives = 156/479 (32%), Gaps = 36/479 (7%)

Query: 2582 TKVRAAQTKIDQAKALLQNKEDNSQLVTSKNNLQSSVNQVPSTAGMTQQSIDN------- 2634
T +A Q L + +E + N L+ + + + D
Sbjct: 37 TNEVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSN 96

Query: 2635 YNAKKREAETEITAAQRVIDNGDATAQQISDEKHRVDNALTALNQAKHDLTADTHALEQA 2694
K R+ + ++ I +A + N TA + L A+ AL
Sbjct: 97 AKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAAR 156

Query: 2695 VQQLNRTGTTTGKKPASITAYNNSIRALQSDLTSAKNSANAIIQKPIRTVQEVQSALTNV 2754
L + + +A ++ A ++ L + + ++ + + + +
Sbjct: 157 KADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTL 216

Query: 2755 NRVNERLTQAINQLVPLADNSALRTAKTKLDEEINKSVTTDGMTQSSIQAYENAKRAGQT 2814
L L T +I ++ E A
Sbjct: 217 EAEKAALAARKADL--EKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMN 274

Query: 2815 ETTNAQNVINNGDATDQQIAAEKTKVEEKYNSLKQAIAGLTPDLAPLQTAKTQLQNDIDQ 2874
+T I +A + AEK +E + L L DL + AK QL+ + +
Sbjct: 275 FSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQK 334

Query: 2875 PTSTTGMTSASVAAFNDKLSAARTKIQEIDRVLA--------SHPDVATIRQNVTAANAA 2926
++ AS + L A+R ++++ S ++R+++ A+ A
Sbjct: 335 LEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREA 394

Query: 2927 KTALDQARNGLTVDKAPLENAKNQLQHSIDTQTSTTGMTQDSINAYNAKLTAARN----- 2981
K +++A A LE +L+ S +T+ AKL A
Sbjct: 395 KKQVEKALEEANSKLAALEKLNKELEESKK-------LTEKEKAELQAKLEAEAKALKEK 447

Query: 2982 ---KVQQINQVLAGSPTVDQINTNTSAANQAKSDLDHARQALT---PDKAPLQNAKTQL 3034
+ +++ ++ AG + Q + N+A A QA T +KAP++ K QL
Sbjct: 448 LAKQAEELAKLRAGKASDSQ-TPDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQL 505



Score = 42.4 bits (99), Expect = 4e-05
Identities = 33/297 (11%), Positives = 80/297 (26%), Gaps = 35/297 (11%)

Query: 1 MNYRDKIQKFSIRKYTVGTFSTVIATLVFLGF--NTSQAHAAETNQPASVVKQKQQS--- 55
M + + +S+RK GT S +A V + +A + + +K Q
Sbjct: 1 MTKNNTNRHYSLRKLKTGTASVAVALTVLGAGLVVNTNEVSAVATRSQTDTLEKVQERAD 60

Query: 56 ------NNEQTENRESQVQNSQNSQNSQSLS-------------------ATHENEQPNN 90
N + +N + N ++ L+ + ++
Sbjct: 61 KFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEA 120

Query: 91 SQANLVNQKVAQSSTTNDEQPASQNVNTKKDSATAATTQ--PDKEESKHKQNESQSANKN 148
+A+L + + + + + +K + A E + + + K
Sbjct: 121 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 180

Query: 149 -GNDNRAAHVENHEANVVTASDSSDNGNVQHDRNELQAFFDANYHDYRFIDRENADSGTF 207
+ A E + + L+A A +++ +
Sbjct: 181 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGA--M 238

Query: 208 NYVKGIFDKINTLLGSNDPINNKDLQLAYKELEQAVALIRTMPQRQQTSRRSNRIQT 264
N+ KI TL + + +L + + ++
Sbjct: 239 NFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEA 295


20SAV1536SAV1546Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV1536013-3.147586glycine dehydrogenase subunit 1
SAV1537217-4.889704aminomethyltransferase
SAV1538222-6.321614similar to shikimate kinase
SAV1539221-5.336821hypothetical protein
SAV1540119-4.330067hypothetical protein
SAV1541017-4.110510similar to competence protein
SAV1542114-2.566870exogenous DNA-binding protein comGC
SAV1543115-2.650674similar to DNA transport mechinery protein
SAV1544-112-2.814919similar to late competence protein comGA
SAV1545014-3.084806similar to metallo-beta-lactamase superfamily
SAV1546-112-3.261156conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1541BCTERIALGSPH405e-07 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 40.3 bits (94), Expect = 5e-07
Identities = 14/79 (17%), Positives = 38/79 (48%), Gaps = 4/79 (5%)

Query: 5 KQSAFTMIEMLVVMMLISIFLLLTMTSKGLSNLRVIDDEA-NIISFITELNYIKSQAIAN 63
+Q FT++EM+++++L+ + + + + S D A + F +L +++ + +
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPAS---RDDSAAQTLARFEAQLRFVQQRGLQT 58

Query: 64 QGYINVRFYENSDTIKVIE 82
+ V + + V+E
Sbjct: 59 GQFFGVSVHPDRWQFLVLE 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1542BCTERIALGSPG469e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 46.4 bits (110), Expect = 9e-10
Identities = 19/76 (25%), Positives = 44/76 (57%), Gaps = 4/76 (5%)

Query: 3 KFLKKTQAFTLIEMLLVLLIISLLLILIIPNI--AKQTAHIQSTGCNAQVKMVNSQIEAY 60
+ K + FTL+E+++V++II +L L++PN+ K+ A Q + + + + ++ Y
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKA--VSDIVALENALDMY 59

Query: 61 ALKHNRNPSSIEDLIA 76
L ++ P++ + L +
Sbjct: 60 KLDNHHYPTTNQGLES 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1543BCTERIALGSPF844e-20 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 84.1 bits (208), Expect = 4e-20
Identities = 65/347 (18%), Positives = 137/347 (39%), Gaps = 6/347 (1%)

Query: 14 KKRQLSKAQQIDLLSNLCNLLKYGFTLYQSFQFLNLQMTYKN-KQLGTTILSEISNGAPC 72
+K +LS + L L L+ L ++ + Q + QL + S++ G
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 73 NQIL-SLIGYSDTI-VMQVYLAERFGNIIDVLEETVNYMKVNRKSEQRLLKTLQYPLILV 130
+ G + + V E G++ VL +Y + ++ R+ + + YP +L
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 131 SIFIAMIIILNLTVIPQFQQLYTSMNIQLSSFQKTLSFFITSLPTIIVVMLIIVSMLAII 190
+ IA++ IL V+P+ + + M L + L ++ T ML+ + +
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 191 MKLIYNNLNMLNKIN-FVMKLPLISGYFQLFKTYFVTNELVLFYKNGITLQSIVDVYINH 249
+++ + ++ LPLI + T L + + + L + + +
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 250 SS-DPFRQFLGKYLLTYSEMGYGLPQILEKLKCFKPQLIKFVLQGEKRGKLEVELKLYSQ 308
S D R L E G L + LE+ F P + + GE+ G+L+ L+ +
Sbjct: 301 MSNDYARHRLSLATDAVRE-GVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAAD 359

Query: 309 ILVKQIEDKAIKQTQFLQPILFLILGLFIVAIYLVIMLPMFQMMQSI 355
++ + +P+L + + ++ I L I+ P+ Q+ +
Sbjct: 360 NQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1545SHIGARICIN270.039 Ribosome inactivating protein family signature.
		>SHIGARICIN#Ribosome inactivating protein family signature.

Length = 289

Score = 27.5 bits (61), Expect = 0.039
Identities = 20/99 (20%), Positives = 38/99 (38%), Gaps = 11/99 (11%)

Query: 82 DFLKDPVKNGADKFKQYGLPIITSKVTPEK-------LNEGSTEIE-GFKFNVLHTPGHS 133
F+ + K + K Y +P++ S + + N I ++ G+
Sbjct: 39 VFISNLRKALPYERKLYDIPLLRSTLPGSQRYALIHLTNYADETISVAIDVTNVYVMGYR 98

Query: 134 PGSLTYVFDEFAVVG--DTLFNNGIGRTDL-YKGDYETL 169
G +Y F+E + +F + + L Y G+YE L
Sbjct: 99 AGDTSYFFNEASATEAAKYVFKDAKRKVTLPYSGNYERL 137


21SAV1585SAV1599Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV1585-111-3.275317GTP-binding protein
SAV1586-115-4.55402830S ribosomal protein S20
SAV1587-114-4.379575conserved hypothetical protein
SAV1588-114-3.642108similar to late competence protein ComEC
SAV1589216-1.845026late competence operon required for DNA binding
SAV1590217-2.275176similar to ComEA
SAV1591-114-1.865296similar to methyltransferase
SAV1592-111-0.968903similar to homolog of plant Iojap proteins
SAV1593012-0.987785similar to hydrolase (HAD superfamily)
SAV1594-112-0.792957probable nicotinic acid mononucleotide
SAV1595012-1.407957conserved hypothetical protein
SAV1596013-1.524946shikimate dehydrogenease
SAV1597-314-1.595011similar to GTPase family protein
SAV1598-218-4.040095similar to hydrolase, haloacid dehalogenase-like
SAV1599113-3.2375545'-methylthioadenosine
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1585TCRTETOQM1842e-52 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 184 bits (468), Expect = 2e-52
Identities = 106/439 (24%), Positives = 184/439 (41%), Gaps = 89/439 (20%)

Query: 12 NIRNFSIIAHIDHGKSTLADRILEN---TKSVETRDMQDQLLDSMDLERERGITIKLNAV 68
I N ++AH+D GK+TL + +L N + + D D+ LER+RGITI+
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 69 RLKYEAKDGNTYTFHLIDTPGHVDFTYEVSRSLAACEGAILVVDAAQGIEAQTLANVYLA 128
++E N +IDTPGH+DF EV RSL+ +GAIL++ A G++AQT +
Sbjct: 62 SFQWENTKVN-----IIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116

Query: 129 LDNELELLPVINKIDLPAAEPERV--------------KQEIE--------DMIGLDQDD 166
+ + INKID + V KQ++E + +Q D
Sbjct: 117 RKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176

Query: 167 VVLA---------------------------------------SAKSNIGIEEILEKIVE 187
V+ SAK+NIGI+ ++E I
Sbjct: 177 TVIEGNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITN 236

Query: 188 VVPAPDGDPEAPLKALIFDSEYDPYRGVISSIRIVDGVVKAGDKIRMMATGKEFEVTEVG 247
+ ++ L +F EY R ++ IR+ GV+ D +R+ K ++TE
Sbjct: 237 KFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-KITE-- 293

Query: 248 INTPKQ---LPVDELTVGDVGYIIASIKNVDDSRVGDTITLASRPASEPLQGYKKMNPMV 304
+ T +D+ G++ + ++ +GDT L P E ++ P++
Sbjct: 294 MYTSINGELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLL---PQRERIENPL---PLL 346

Query: 305 YCGLFPIDNKNYNDLREALEKLQLNDASLEFE--PESSQALGFGYRTGFLGMLHMEIIQE 362
+ P + L +AL ++ +D L + + + + FLG + ME+
Sbjct: 347 QTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEII-----LSFLGKVQMEVTCA 401

Query: 363 RIEREFGIELIATAPSVIY 381
++ ++ +E+ P+VIY
Sbjct: 402 LLQEKYHVEIEIKEPTVIY 420



Score = 35.6 bits (82), Expect = 6e-04
Identities = 12/75 (16%), Positives = 28/75 (37%), Gaps = 2/75 (2%)

Query: 408 IFEPYVRATMMVPNDYVGAVMELCQRKRGQFINMDYLDDIRVNIVYELPLAEVVFDFFDQ 467
+ EPY+ + P +Y+ + ++ ++ V + E+P + ++
Sbjct: 535 LLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNE-VILSGEIPARC-IQEYRSD 592

Query: 468 LKSNTKGYASFDYEF 482
L T G + E
Sbjct: 593 LTFFTNGRSVCLTEL 607


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1590IGASERPTASE280.034 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.1 bits (62), Expect = 0.034
Identities = 33/190 (17%), Positives = 71/190 (37%), Gaps = 27/190 (14%)

Query: 37 QDDYTSRNFENKDTALKQSTSE---------NNSLSKLEDVQVKDGDNSKNKGPVYVDVK 87
+DD+ +RNF+ + + S ++++ QV G + + V D
Sbjct: 733 EDDWINRNFKATTMNVTGNASLYSGRNVANITSNITASNKAQVHIGYKTGDTVCVRSDYT 792

Query: 88 GAVKHPNVYKMTSKDRVVDLLDKAQLLDDADVSRINLSEKLTDQKMIFIPHKGQKNVEPQ 147
G V D+ ++ + L +V+ + + + +F + + N + +
Sbjct: 793 GYV---TCTTDKLSDKALNSFNPTNL--RGNVNLTESANFVLGKANLFGTIQSRGNSQVR 847

Query: 148 IEVNS-------VHVKNGNTNNTKVNLNTASVSELMSVPGVGQAKANAIVEYRNQQGAFQ 200
+ NS V + N ++LN+A S +V N++ + G+F
Sbjct: 848 LTENSHWHLTGNSDVHQLDLANGHIHLNSADNSN--NVTKYNTLTVNSL----SGNGSFY 901

Query: 201 EIDDLKKVKG 210
+ DL +G
Sbjct: 902 YLTDLSNKQG 911


22SAV1642SAV1658Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SAV16422150.982494holliday junction DNA helicase
SAV16430130.197026ACT domain protein pheB
SAV16440120.094261Spo0B-associated GTP-binding protein
SAV1645-114-2.05037550S ribosomal protei L27
SAV1646-215-3.118177conserved hypothetical protein
SAV1647115-2.85438450S ribosomal protein L21
SAV1648020-2.344956similar to cell shape determinant mreD
SAV1649321-2.060998cell shape determinant mreC
SAV1650323-1.547161hypothetical protein
SAV16516270.182533hypothetical protein
SAV16527250.157262hypothetical protein
SAV165310250.018824hypothetical protein
SAV16541125-0.399289hypothetical protein
SAV16551023-0.406585rRNA methylase
SAV1656923-0.925552O-nucleotidylltransferase
SAV1657518-1.394855transposase C
SAV1658210-0.572239transposase B
23SAV1753SAV1758Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV17532100.472477similar to 16S psudouridylate synthase
SAV17543100.758120spore cortex protein homolog
SAV17552101.106729putative flavoprotein
SAV1756391.118277conserved hypothetical protein
SAV1757291.308622Mrp protein
SAV1758291.288385Mrp protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1758IGASERPTASE370.001 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 36.6 bits (84), Expect = 0.001
Identities = 48/321 (14%), Positives = 90/321 (28%), Gaps = 18/321 (5%)

Query: 1323 QNQTNDQVDTTTNQAVNAIDNVEAEVVIKPKAIADIEKAVKEKQQQIDNSLDSTDNEKEV 1382
+ N VDTT N I V + IA +++A S + +
Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENS 1044

Query: 1383 ASQALAKEKEKALAAIDQAQTNSQVNQAATNGVSAIKIIQPETKVKPAAREKINQKANEL 1442
++ EK + A AQ +A + K E +
Sbjct: 1045 KQESKTVEKNEQDATETTAQNREVAKEA-----------KSNVKANTQTNEVAQSGSETK 1093

Query: 1443 RAKINQDKEATAEERQ----VALDKINEFVNQAMTDITNNRTNQQVDDTTSQALDSIALV 1498
+ + KE E++ V +K E ++ V A ++ V
Sbjct: 1094 ETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTV 1153

Query: 1499 APEHIVRAAARDAVKQQYEAKKQEIEQAEHATDEEKQVALNQLANNEKLALQNINQAVTN 1558
+ A +Q AK+ + T+ N + N + Q N
Sbjct: 1154 NIKEPQSQTNTTADTEQ-PAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVN 1212

Query: 1559 NDVKRVETNGIATLKGVQPHIVIKPEAQQAIKATAENQVESIKDTPHATVDELDEANQLI 1618
++ N PH V +P + + + +A + + Q +
Sbjct: 1213 SESSNKPKNRHRRSVRSVPHNV-EPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFV 1271

Query: 1619 S-DTLKQAQQEIENTNQDAAV 1638
+ + K Q I +
Sbjct: 1272 ALNVGKAVSQHISQLEMNNEG 1292


24SAV1774SAV1792Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SAV1774018-3.402591aesenical pump membrane protein homolog
SAV1775-116-4.565951conserved hypothetical protein
SAV1776113-3.751313hypothetical protein
SAV1777214-4.158410hypothetical protein
SAV1778216-3.255823hypothetical protein
SAV1779315-2.969590hypothetical protein
SAV1780114-2.453453hypothetical protein
SAV1781116-2.062694putative transaldolase
SAV1782118-3.666786hypothetical protein
SAV1783116-1.900774conserved hypothetical protein
SAV1784115-2.482127hypothetical protein
SAV1785-111-0.749654truncated transposase
SAV1786-1110.046764truncated transposase
SAV1787-1120.523862truncated transposase
SAV1788-1131.856262plant metabolite dehydrogenase homolog
SAV1789-1131.981475hypothetical protein
SAV17900153.155142S-adenosylmethionine synthetase
SAV1791-2142.819285phosphoenolpyruvate carboxykinase
SAV17921183.262972hypothetical protein
25SAV1802SAV1827Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV1802419-3.467483hypothetical protein
SAV1803117-0.688891hypothetical protein
SAV1804-114-0.592496truncated transposase
SAV1805-113-0.060134truncated transposase
SAV1806-1120.052938transposase homolog for IS232
SAV1807-1110.235443probable specificity determinant HsdS
SAV1808-1121.254979type I restriction enzyme EcoR124II M protein
SAV1809210-0.608202serine protease
SAV1810211-1.411772serine protease
SAV1811113-2.004907serine protease
SAV1812013-3.188821serine protease
SAV1813115-4.595692serine protease
SAV1814114-4.505012hypothetical protein
SAV1815212-3.700100probable beta-lactamase
SAV1816214-4.425045truncated hypothetical protein
SAV1817314-4.424193hypothetical protein
SAV1818414-4.472887hypothetical protein
SAV1819515-4.507246leukotoxin F-subunit
SAV1820617-5.799965leukotoxin S-subunit
SAV1821620-7.121026hypothetical protein
SAV1822718-6.698566conserved hypothetical protein
SAV1823821-7.212088hypothetical protein
SAV1824822-6.949343extracellular enterotoxin type G precursor
SAV1825521-7.331276enterotoxin
SAV1826417-5.879637enterotoxin
SAV1827214-3.508722enterotoxin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1809V8PROTEASE1156e-33 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 115 bits (290), Expect = 6e-33
Identities = 60/227 (26%), Positives = 103/227 (45%), Gaps = 26/227 (11%)

Query: 30 IQQTAKA-----ENTVKQITNTNVAPYSGVTWMGA--------GTGFVVGNHTIITNKHV 76
++Q A N QIT+T Y+ VT++ +G VVG T++TNKHV
Sbjct: 61 LEQREHANVILPNNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHV 120

Query: 77 TYHM-KVGDEIKAHPNGFY--NNGGGLYKVTKIVDYPGKEDIAVVQVEEKSTQPKGRKFK 133
+KA P+ N G + +I Y G+ D+A+V+ + +
Sbjct: 121 VDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSP---NEQNKHIG 177

Query: 134 DFTSKFNIA--SEAKENEPISVIGYPNPNGNKLQMYESTGKVLSVNGNIVSSDAIIQPGS 191
+ ++ +E + N+ I+V GYP + M+ES GK+ + G + D G+
Sbjct: 178 EVVKPATMSNNAETQVNQNITVTGYP-GDKPVATMWESKGKITYLKGEAMQYDLSTTGGN 236

Query: 192 SGSPILNSKHEAIGVIYAGNKPSGESTRGFAVYFSPEIKKFIADNLD 238
SGSP+ N K+E IG+ + G AV+ + ++ F+ N++
Sbjct: 237 SGSPVFNEKNEVIGIHWGGVPNEF----NGAVFINENVRNFLKQNIE 279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1810V8PROTEASE1121e-31 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 112 bits (281), Expect = 1e-31
Identities = 58/227 (25%), Positives = 100/227 (44%), Gaps = 26/227 (11%)

Query: 30 IQQTAKA-----ENSVKLITNTNVAPYSGVTWMGA--------GTGFVVGNHTIITNKHV 76
++Q A N IT+T Y+ VT++ +G VVG T++TNKHV
Sbjct: 61 LEQREHANVILPNNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHV 120

Query: 77 TYHM-KVGDEIKAHPNGFY--NNGGGLYKVTKIVDYPGKEDIAVVQVEEKSTQPKGRKFK 133
+KA P+ N G + +I Y G+ D+A+V+ + +
Sbjct: 121 VDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSP---NEQNKHIG 177

Query: 134 DFTSKFNIA--SEAKENEPISVIGYPNPNGNKLQMYESTGKVLSVNGNIVTSDAVVQPGS 191
+ ++ +E + N+ I+V GYP + M+ES GK+ + G + D G+
Sbjct: 178 EVVKPATMSNNAETQVNQNITVTGYP-GDKPVATMWESKGKITYLKGEAMQYDLSTTGGN 236

Query: 192 SGSPILNSKREAIGVMYASDKPTGESTRSFAVYFSPEIKKFIADNLD 238
SGSP+ N K E IG+ + AV+ + ++ F+ N++
Sbjct: 237 SGSPVFNEKNEVIGIHWGGVPNEFNG----AVFINENVRNFLKQNIE 279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1811V8PROTEASE1787e-57 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 178 bits (452), Expect = 7e-57
Identities = 64/217 (29%), Positives = 106/217 (48%), Gaps = 23/217 (10%)

Query: 37 EKNVTQVKDTNNFPYNGVVSFK--------DATGFVIGKNTIITNKHV-SKDYKVGDRIT 87
+ Q+ DT N Y V + A+G V+GK+T++TNKHV + +
Sbjct: 73 NNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHVVDATHGDPHALK 132

Query: 88 AHP---NGDKGNGGIYKIKSISDYPGDEDISVMNIEEQAVERGPKGFNFNENVQAFNFAK 144
A P N D G + + I+ Y G+ D++++ + + E V+ +
Sbjct: 133 AFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQNK-----HIGEVVKPATMSN 187

Query: 145 DA--KVDDKIKVIGYPLPAQNSFKQFESTGTIKRIKDNILNFDAYIEPGNSGSPVLNSNN 202
+A +V+ I V GYP +ES G I +K + +D GNSGSPV N N
Sbjct: 188 NAETQVNQNITVTGYPGDK-PVATMWESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKN 246

Query: 203 EVIGVVYGGIGKIGSEYNGAVYFTPQIKDFIQKHIEQ 239
EVIG+ +GG + +E+NGAV+ +++F++++IE
Sbjct: 247 EVIGIHWGG---VPNEFNGAVFINENVRNFLKQNIED 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1812V8PROTEASE1772e-56 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 177 bits (450), Expect = 2e-56
Identities = 65/230 (28%), Positives = 108/230 (46%), Gaps = 29/230 (12%)

Query: 29 EVQQTAKA-----ENNVTKIQDTNIFPYTGVVAFKS--------ATGFVVGKNTILTNKH 75
++Q A N+ +I DT Y V + A+G VVGK+T+LTNKH
Sbjct: 60 PLEQREHANVILPNNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKH 119

Query: 76 V-SKNYKVGDRITAHP---NSDKGNGGIYSIKKIINYPGKEDVSVIQVEERAIERGPKGF 131
V + + A P N D G ++ ++I Y G+ D+++++ +
Sbjct: 120 VVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQNK----- 174

Query: 132 NFNDNVTPFKYAAGA--KAGERIKVIGYPHPYKNKYVLYESTGPVMSVEGSSIVYSAHTE 189
+ + V P + A + + I V GYP K ++ES G + ++G ++ Y T
Sbjct: 175 HIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMWESKGKITYLKGEAMQYDLSTT 233

Query: 190 SGNSGSPVLNSNNELVGIHFASDVKNDDNRNAYGVYFTPEIKKFIAENID 239
GNSGSPV N NE++GIH+ V N+ N V+ ++ F+ +NI+
Sbjct: 234 GGNSGSPVFNEKNEVIGIHWGG-VPNEFNG---AVFINENVRNFLKQNIE 279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1813V8PROTEASE1381e-41 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 138 bits (349), Expect = 1e-41
Identities = 64/212 (30%), Positives = 101/212 (47%), Gaps = 18/212 (8%)

Query: 36 EKNVKEITDATKAPYNSVVAFA--------GGTGVVVGKNTIVTNKHIAKSNDIFKNRVA 87
+ +ITD T Y V +GVVVGK+T++TNKH+ + + +
Sbjct: 73 NNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHVVDATHGDPHALK 132

Query: 88 AHYS---SKGKGGGNYDVKDIVEYPGKEDLAIVHVHETSTEGLNFNKNVSYTKFAEGA-- 142
A S G + + I +Y G+ DLAIV + + + + V + A
Sbjct: 133 AFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVK-FSPNEQNKHIGEVVKPATMSNNAET 191

Query: 143 KAKDRISVIGYPKGAQTKYKMFESTGTINHISGTFIEFDAYAQPGNSGSPVLNSKHELIG 202
+ I+V GYP G + M+ES G I ++ G +++D GNSGSPV N K+E+IG
Sbjct: 192 QVNQNITVTGYP-GDKPVATMWESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIG 250

Query: 203 ILYAGSGKDESEKNFGVYFTPQLKEFIQNNIE 234
I + G +E N V+ ++ F++ NIE
Sbjct: 251 IHWGGVP---NEFNGAVFINENVRNFLKQNIE 279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1819BICOMPNTOXIN396e-141 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 396 bits (1020), Expect = e-141
Identities = 96/329 (29%), Positives = 177/329 (53%), Gaps = 24/329 (7%)

Query: 1 MKMKKLVKSSVASSIALLLLSNTVDAAQHITPVSEKKVDDKITLYKTTATSDNDKLNISQ 60
M K++ ++++ S+ L + ++ A+ + I + K T ++K ++Q
Sbjct: 1 MLKNKILTTTLSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQ 60

Query: 61 ILTFNFIKDKSYDKDTLVLKAAGNINSGYKKPNPKDYNYSQ-FYWGGKYNVSVSSESNDA 119
+ F+F+KDK Y+KD L+LK G I+S N K N+ + W +YN+ + + +
Sbjct: 61 NIQFDFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTN-DKY 119

Query: 120 VNVVDYAPKNQNEEFQVQQTLGYSYGGDINISNGLSGGLNGSKSFSETINYKQESYRTTI 179
V++++Y PKN+ E V QTLGY+ GG+ + L G NGS ++S++I+Y Q++Y + +
Sbjct: 120 VSLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGG--NGSFNYSKSISYTQQNYVSEV 177

Query: 180 DRKTNHKSIGWGVEAHKIMNNGWGPYGRDSYDPTYGNELFLGGRQSSSNAGQNFLPTHQM 239
+++ N KS+ WGV+A+ + ++LF+G + S + F+P ++
Sbjct: 178 EQQ-NSKSVLWGVKANSFAT-------ESGQKSAFDSDLFVGYKPHSKDPRDYFVPDSEL 229

Query: 240 PLLARGNFNPEFISVLSHKQNDTKKSKIKVTYQREMD---------RYTNQWNRLHWIGN 290
P L + FNP FI+ +SH++ + S+ ++TY R MD Y N + H + N
Sbjct: 230 PPLVQSGFNPSFIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHN 289

Query: 291 NYKNQNTVTFTSTYEVDWQNHTVKLIGTD 319
+ N+N +T YEV+W+ H +K+ G +
Sbjct: 290 AFVNRN---YTVKYEVNWKTHEIKVKGQN 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1820BICOMPNTOXIN433e-156 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 433 bits (1116), Expect = e-156
Identities = 214/318 (67%), Positives = 256/318 (80%), Gaps = 10/318 (3%)

Query: 1 MFKKKMLAATLSVGLIAPLASPIQE-SRANTNIENIGDGA--EVIKRTEDVSSKKWGVTQ 57
M K K+L TLSV L+APLA+P+ E ++A + E+IG G+ E+IKRTED +S KWGVTQ
Sbjct: 1 MLKNKILTTTLSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQ 60

Query: 58 NVQFDFVKDKKYNKDALIVKMQGFINSRTSFSDVKGSGYELTKRMIWPFQYNIGLTTKDP 117
N+QFDFVKDKKYNKDALI+KMQGFI+SRT++ + K + + K M WPFQYNIGL T D
Sbjct: 61 NIQFDFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNH--VKAMRWPFQYNIGLKTNDK 118

Query: 118 NVSLINYLPKNKIETTDVGQTLGYNIGGNFQSAPSIGGNGSFNYSKTISYTQKSYVSEVD 177
VSLINYLPKNKIE+T+V QTLGYNIGGNFQSAPS+GGNGSFNYSK+ISYTQ++YVSEV+
Sbjct: 119 YVSLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGGNGSFNYSKSISYTQQNYVSEVE 178

Query: 178 KQNSKSVKWGVKANEFVTPDGKKSAHDRYLFVQSPNGPTGSAREYFAPDNQLPPLVQSGF 237
+QNSKSV WGVKAN F T G+KSA D LFV + R+YF PD++LPPLVQSGF
Sbjct: 179 QQNSKSVLWGVKANSFATESGQKSAFDSDLFVGYKPH-SKDPRDYFVPDSELPPLVQSGF 237

Query: 238 NPSFITTLSHEKGSSDTSEFEISYGRNLDITYA----TLFPRTGIYAERKHNAFVNRNFV 293
NPSFI T+SHEKGSSDTSEFEI+YGRN+D+T+A T + + + R HNAFVNRN+
Sbjct: 238 NPSFIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAFVNRNYT 297

Query: 294 VRYEVNWKTHEIKVKGHN 311
V+YEVNWKTHEIKVKG N
Sbjct: 298 VKYEVNWKTHEIKVKGQN 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1824BACTRLTOXIN1954e-64 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 195 bits (497), Expect = 4e-64
Identities = 109/261 (41%), Positives = 155/261 (59%), Gaps = 11/261 (4%)

Query: 4 LSTVIIILILEIVFHNMN-YVNAQPDPKLDELNKVSDYKNNKGTMGNVMNLYTSPPVEGR 62
+S VI+I L +V N +QPDP D+L+K S++ GTMGN+ LY V
Sbjct: 7 ISRVILIFALILVISTPNVLAESQPDPMPDDLHKSSEFT---GTMGNMKYLYDDHYVSAT 63

Query: 63 GVINSRQFLSHDLIFPI---EYKSYNEVKTELENTELANNYKDKKVDIFGVPYFYTCIIP 119
V + +FL+HDLI+ I + K+Y++VKTEL N +LA YKD+ VD++G Y+ C
Sbjct: 64 KVKSVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYYVNCYFS 123

Query: 120 KSEPDINQNFGGCCMYGGLTF---NSSENERDKLITVQVTIDNRQSLGFTITTNKNMVTI 176
+ G CMYGG+T N +N + + V+V + R ++ F + T+K VT
Sbjct: 124 SKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFEVQTDKKSVTA 183

Query: 177 QELDYKARHWLTKEKKLYEFDGSAFESGYIKFTEKNNTSFWFDLFPKKELVPFVPYKFLN 236
QELD KAR++L +K LYEF+ S +E+GYIKF E N +FW+D+ P F K+L
Sbjct: 184 QELDIKARNFLINKKNLYEFNSSPYETGYIKFIENNGNTFWYDMMPAPGDK-FDQSKYLM 242

Query: 237 IYGDNKVVDSKSIKMEVFLNT 257
+Y DNK VDSKS+K+EV L T
Sbjct: 243 MYNDNKTVDSKSVKIEVHLTT 263


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1825BACTRLTOXIN1543e-48 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 154 bits (391), Expect = 3e-48
Identities = 76/265 (28%), Positives = 124/265 (46%), Gaps = 21/265 (7%)

Query: 9 RLFYIAAIII-TLLCLINNNYVNAEV----DKKDLKKKSDLDSSKLFNLTSYYTDITWQL 63
RLF I+I L+ +I+ V AE DL K S+ + + N+ Y D +
Sbjct: 4 RLFISRVILIFALILVISTPNVLAESQPDPMPDDLHKSSEF-TGTMGNMKYLYDDH--YV 60

Query: 64 DESNKISTDQLLNNTIILKNIDISVLKTSSLKVEFNSSDLANQFKGKNIDIYGLYFGNKC 123
+ S D+ L + +I D + +K E + DLA ++K + +D+YG + C
Sbjct: 61 SATKVKSVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYYVNC 120

Query: 124 -------VGLTEEKTSCLYGGVTIHDGNQLDEEKV--IGVNVFKDGVQQEGFVIKTKKAK 174
VG +C+YGG+T H+GN D + + V V+++ F ++T K
Sbjct: 121 YFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFEVQTDKKS 180

Query: 175 VTVQELDTKVRFKLENLYKIYNKDTGNIQKGCIFFHSHNHQDQSFYYDLYNVKGSVG--A 232
VT QELD K R L N +Y ++ + G I F +N +F+YD+ G +
Sbjct: 181 VTAQELDIKARNFLINKKNLYEFNSSPYETGYIKFIENN--GNTFWYDMMPAPGDKFDQS 238

Query: 233 EFFQFYSDNRTVSSSNYHIDVFLYK 257
++ Y+DN+TV S + I+V L
Sbjct: 239 KYLMMYNDNKTVDSKSVKIEVHLTT 263


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1826BACTRLTOXIN1595e-52 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 159 bits (404), Expect = 5e-52
Identities = 84/135 (62%), Positives = 111/135 (82%), Gaps = 4/135 (2%)

Query: 2 KKTCMYGGVTEHDGNQIDKNNSTDNSHNILIKVYENERNSLSFDIPTNKKNITAQEIDYK 61
KTCMYGG+T+H+GN D N N+L++VYEN+RN++SF++ T+KK++TAQE+D K
Sbjct: 134 GKTCMYGGITKHEGNHFDNGNLQ----NVLVRVYENKRNTISFEVQTDKKSVTAQELDIK 189

Query: 62 VRNYLLKHKNLYEFNSSPYETGYIKFIEGSGHSFWYDLMPESGKKFYPTKYLLIYNDNKT 121
RN+L+ KNLYEFNSSPYETGYIKFIE +G++FWYD+MP G KF +KYL++YNDNKT
Sbjct: 190 ARNFLINKKNLYEFNSSPYETGYIKFIENNGNTFWYDMMPAPGDKFDQSKYLMMYNDNKT 249

Query: 122 VESKSINVEVHLTKK 136
V+SKS+ +EVHLT K
Sbjct: 250 VDSKSVKIEVHLTTK 264


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1827BACTRLTOXIN1083e-32 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 108 bits (272), Expect = 3e-32
Identities = 48/94 (51%), Positives = 68/94 (72%), Gaps = 5/94 (5%)

Query: 21 NGNPKPEQLNKASEFTGLMDNMRYLYDDKHVSETNIKSQEKFLQHDLLFKINGSKI---- 76
+P P+ L+K+SEFTG M NM+YLYDD +VS T +KS +KFL HDL++ I+ K+
Sbjct: 30 QPDPMPDDLHKSSEFTGTMGNMKYLYDDHYVSATKVKSVDKFLAHDLIYNISDKKLKNYD 89

Query: 77 -LKTEFNNKSLSDKYKNKNVDLFGTNYYNQCYFS 109
+KTE N+ L+ KYK++ VD++G+NYY CYFS
Sbjct: 90 KVKTELLNEDLAKKYKDEVVDVYGSNYYVNCYFS 123


26SAV1838SAV1843Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV1838310-2.599772Hit-like protein involved in cell-cycle
SAV1839210-2.754032conserved hypothetical protein
SAV1840311-3.006521conserved hypothetical protein
SAV184139-2.611236peptidyl-prolyl cis/trans isomerase
SAV1842210-2.537139cmp-binding-factor 1
SAV1843210-2.855420conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1842SSPANPROTEIN290.035 Salmonella invasion protein InvJ signature.
		>SSPANPROTEIN#Salmonella invasion protein InvJ signature.

Length = 336

Score = 28.6 bits (63), Expect = 0.035
Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 146 PAASSHHHNFASGLSYHVLTMLRIAKSICDI 176
PA S HH+ SGL ++ + LRIA+ + +
Sbjct: 72 PAKSEHHNGNVSGLHHNGKSELRIAEKLLKV 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1843cloacin367e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.8 bits (82), Expect = 7e-04
Identities = 40/175 (22%), Positives = 68/175 (38%), Gaps = 36/175 (20%)

Query: 239 KQKEVALHDHSQEWKSLEQQLNIEPITFPEKGVDR-YEKARAHKQSLERDIGLRNERLAQ 297
KQ++ + QEW + T P + +R YE+ARA D+ ER A+
Sbjct: 297 KQRQDEENRRQQEWDA----------THPVEAAERNYERARAELNQANEDVARNQERQAK 346

Query: 298 LKEEATQLEPVKQSDIDAF-ISLNQQENEIKNKEFELTAIE-------------KDIANK 343
A Q+ ++S++DA +L EI K+F A +
Sbjct: 347 ----AVQVYNSRKSELDAANKTLADAIAEI--KQFNRFAHDPMAGGHRMWQMAGLKAQRA 400

Query: 344 QRDKDELQANIGWSETHHDVDSSEAMKSYVSEQIKNKQEQAAYIKQLERSLEENK 398
Q D + QA + + ++A S E K K+++ + E +L + K
Sbjct: 401 QTDVNNKQA--AFDAAAKEKSDADAALSSAMESRKKKEDKK---RSAENNLNDEK 450


27SAV1923SAV1944Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
SAV1923015-3.378003truncated transposase
SAV1924013-3.182718hypothetical protein
SAV1925-111-3.494493hypothetical protein
SAV1926-111-4.699812hypothetical protein
SAV1927011-5.136740conserved hypothetical protein
SAV1928111-4.608886hypothetical protein
SAV1929112-4.180801hypothetical protein
SAV1930013-4.800748conserved hypothetical protein
SAV1931013-3.127154similar to ABC transporter (ATP-binding
SAV1932-114-3.205491hypothetical protein
SAV1933-113-3.578531similar to ABC transporter (ATP-binding
SAV1934013-3.811245transcription regulator GntR family
SAV1935115-4.201144hypothetical protein
SAV1936014-4.274718similar to aspartate transaminase protein
SAV1937316-6.208262truncated map-w protein
SAV1938317-5.012827truncated map-w protein
SAV1939528-3.437095truncated beta-hemolysin
SAV1940423-2.873801hypothetical protein
SAV1941320-1.182126hypothetical protein
SAV1942318-0.196946hypothetical protein
SAV19434180.436562truncated amidase
SAV1943.1315-0.671613truncated amidase
SAV1944315-1.075029staphylokinase precursor
28SAV1954SAV2008Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV19543141.574181phi PVL orfs 18-19-like protein
SAV19553131.430395phi PVL ORF 15 and 16 homolog
SAV1956416-0.729148hypothetical protein
SAV1957520-0.658983hypothetical protein
SAV1958317-0.551951hypothetical protein
SAV19591170.659314hypothetical protein
SAV1960217-0.016029hypothetical protein
SAV1961117-0.108772hypothetical protein
SAV19620170.038894hypothetical protein
SAV1963017-0.054840hypothetical protein
SAV1964018-0.391285phiN315 scaffolding protein-like protein
SAV1965120-1.115383hypothetical protein
SAV1966121-0.565318phage terminase large subunit
SAV19674280.374258hypothetical protein
SAV1968730-0.103291hypothetical protein
SAV19695312.599832hypothetical protein
SAV19703291.951386phi PVL ORF 60 homolog
SAV19713313.484000hypothetical protein
SAV19724344.099820hypothetical protein
SAV19734354.014571hypothetical protein
SAV19745395.035386dUTP pyrophosphatase
SAV19756373.012586hypothetical protein
SAV19763363.270868similar to phi PVL ORF 52 homolog
SAV19772301.673128phi PV83 orf 27-like protein
SAV19780241.098709PVL orf 51-like protein
SAV1979-1220.809673phi PVL ORF 50 homolog
SAV19800170.082022hypothetical protein
SAV1981317-0.949968hypothetical protein
SAV1982319-0.843157hypothetical protein
SAV1983417-0.976888single-strand DNA-binding protein
SAV1984518-1.413277hypothetical protein
SAV1985620-1.563705hypothetical protein
SAV1986824-1.991900hypothetical protein
SAV1987627-2.295190hypothetical protein
SAV1988529-2.473603phi PVL orf 39-like protein
SAV1989424-1.124586hypothetical protein
SAV1990225-1.197502phi PVL ORF 38 homolog
SAV1991321-1.457562hypothetical protein
SAV1992320-1.607490hypothetical protein
SAV1993319-0.167407phi PVL orf 35-like protein
SAV1994519-1.185537anti repressor
SAV1995619-2.586163phi PVL orf 33-like protein
SAV1996518-2.394818phi PVL orf 32-like protein
SAV1997321-2.516061hypothetical protein
SAV1998319-1.954482repressor homolog
SAV1999317-2.610416hypothetical protein
SAV2000115-2.530659hypothetical protein
SAV2001013-1.674630hypothetical protein
SAV2002014-2.412652integrase
SAV2003114-2.685300truncated beta-hemolysin
SAV2004112-3.269562hypothetical protein
SAV2005014-3.513925hypothetical protein
SAV2006216-3.259046similar to succinyl-diaminopimelate
SAV2007515-3.475538similar to Na+transporting ATP synthase
SAV2008215-2.983541extracellular enterotoxin L
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1955GPOSANCHOR350.002 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 35.4 bits (81), Expect = 0.002
Identities = 22/145 (15%), Positives = 44/145 (30%), Gaps = 18/145 (12%)

Query: 3 ERIKGLSIGLDLDAANLNRSFAEIKRNFKTLNSDLKLTGNNFKYTEKSTDSYQQRIKELD 62
+IK L AA ++ +D E + + R EL+
Sbjct: 211 AKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT----LEAEKAALEARQAELE 266

Query: 63 GTIIGYKKNVDDLAKQYDKVSQEQGE--------------NSAEAQKLRQEYNKQANELN 108
+ G + + + E+ +A Q LR++ +
Sbjct: 267 KALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKK 326

Query: 109 YLERELQKTSAEFEEFKKAQVEAQR 133
LE E QK + + + ++ +R
Sbjct: 327 QLEAEHQKLEEQNKISEASRQSLRR 351



Score = 32.0 bits (72), Expect = 0.021
Identities = 12/134 (8%), Positives = 33/134 (24%), Gaps = 14/134 (10%)

Query: 18 NLNRSFAEIKRNFKTLNSDLKLTGNNFKYTEKSTDSYQQRIKELDGTIIGYKKNVDDLAK 77
L + K + + L + + E ++ ++ + L
Sbjct: 89 ELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEA 148

Query: 78 QYDKVSQEQ--------------GENSAEAQKLRQEYNKQANELNYLERELQKTSAEFEE 123
+ ++ + +SA+ + L E LE+ L+
Sbjct: 149 EKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTA 208

Query: 124 FKKAQVEAQRMAES 137
+ +
Sbjct: 209 DSAKIKTLEAEKAA 222


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1979PF06580280.012 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 27.5 bits (61), Expect = 0.012
Identities = 9/36 (25%), Positives = 18/36 (50%), Gaps = 5/36 (13%)

Query: 67 ERLEQARLERKLERKRKKEAELR----RKKPH-LFN 97
+ +QA +++ +EA+L + PH +FN
Sbjct: 142 KNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFN 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1986CHANLCOLICIN300.022 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 30.4 bits (68), Expect = 0.022
Identities = 57/314 (18%), Positives = 122/314 (38%), Gaps = 40/314 (12%)

Query: 194 EIETKKKILTDKIKQINKDIKDIPIRINQTQ----------QNKQDVPEFDNDRYA---- 239
E + K K D + Q KDI + +R N ++ N E + R A
Sbjct: 78 EAQAKAKANRDALTQRLKDIVNEALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEE 137

Query: 240 IIKQEIEQLENERIDIQNGAEEINLRNQLADKQSELKRIEDNNSAS----------NENK 289
++E E E + + +EI ++Q +L E+ A+ + K
Sbjct: 138 KARKEAEAAEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIAQKK 197

Query: 290 IHTLTNELHVENGTVANLKTRLKQ-NKQQIAHEENRRNQLLENHKGLKS--DLEKAENQK 346
+ +E+ +G + L +RL + A + + E + +L++ +
Sbjct: 198 LSAAQSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYKELDELVKKL 257

Query: 347 FEYLDDNVCSCCGQQLLAEQVN--EAREKALQKFNASKSKELETIQVSINHIISEGKKIK 404
+D + + + +V + RE+ ++ AS+++ IN I ++ +I+
Sbjct: 258 SPRANDPLQNRPFFEATRRRVGAGKIREEKQKQVTASETR--------INRINADITQIQ 309

Query: 405 PIIEKLVDDNNNLQIKINEAEERSERIQNKINKLKTTHVDVRQTDEYKAVMLEINEINQK 464
I ++ ++ N +++EAEE ++ QN + + Y+ + + E K
Sbjct: 310 KAISQVSNNRNAGIARVHEAEENLKKAQNNLLNSQIKDAVDATVSFYQTLTEKYGE---K 366

Query: 465 RSNIRKTIQDKVSG 478
S + + + DK G
Sbjct: 367 YSKMAQELADKSKG 380


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2001IGASERPTASE335e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.7 bits (74), Expect = 5e-04
Identities = 20/104 (19%), Positives = 39/104 (37%), Gaps = 2/104 (1%)

Query: 26 ESKKEVKSKEKKIEKEKENKSKKDKEKEVA--TQQQPDNQTVEQPQSQEQSVQQPQQQIP 83
+S E K + KE K++K K TQ+ P + P+ ++ QPQ +
Sbjct: 1087 QSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA 1146

Query: 84 QNSVPQQNVQVQQNKKQKVDLNNMPPTDFSTEGMSEQAQKQIEE 127
+ + P N++ Q++ P + S+ +
Sbjct: 1147 RENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVN 1190


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2004BICOMPNTOXIN2148e-70 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 214 bits (547), Expect = 8e-70
Identities = 82/315 (26%), Positives = 145/315 (46%), Gaps = 18/315 (5%)

Query: 16 ISTALTVFPATSYAKINSEIKAVSEKNLDGDTKMYTRTATTSDSQKNITQSLQFNFLTEP 75
+S +L A + + D ++ RT + ++ +TQ++QF+F+ +
Sbjct: 11 LSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQNIQFDFVKDK 70

Query: 76 NYDKETVFIKAKGTIGSGLRILDPNGY-WNSTLRWPGSYSVSIQNVDDNNNTNVTDFAPK 134
Y+K+ + +K +G I S + +RWP Y++ ++ ++ ++ ++ PK
Sbjct: 71 KYNKDALILKMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKT--NDKYVSLINYLPK 128

Query: 135 NQDESREVKYTYGYKTGGDFSINRGGLTGNITKESNYSETISYQQPSYRTLLDQSTSHKG 194
N+ ES V T GY GG+F L GN + NYS++ISY Q +Y + ++Q K
Sbjct: 129 NKIESTNVSQTLGYNIGGNFQSAPS-LGGNGSF--NYSKSISYTQQNYVSEVEQQN-SKS 184

Query: 195 VGWKVEAHLINNMGHDHTRQLTNDSDNRTKSEIFSLTRNGNLWAKDNFTPKDKMPVTVSE 254
V W V+A+ + S++F + + +D F P ++P V
Sbjct: 185 VLWGVKANSFATESGQKSAF---------DSDLFVGYKPHSKDPRDYFVPDSELPPLVQS 235

Query: 255 GFNPEFLAVMSHDKKDKGKSQFVVHYKRSMDEFKIDWNRHGFWG-YWSGENHVDK-KEEK 312
GFNP F+A +SH+K S+F + Y R+MD + Y G +
Sbjct: 236 GFNPSFIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAFVNRN 295

Query: 313 LSALYEVDWKTHDVK 327
+ YEV+WKTH++K
Sbjct: 296 YTVKYEVNWKTHEIK 310


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2005BICOMPNTOXIN1651e-50 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 165 bits (419), Expect = 1e-50
Identities = 103/343 (30%), Positives = 163/343 (47%), Gaps = 42/343 (12%)

Query: 4 KKRVLIASSLSCAILLLSAATTQANSAHKDSQDQNKKEHVDKSQQKDKRNVTNKDKNSTV 63
K ++ ++LS ++L A N+
Sbjct: 2 LKNKILTTTLSVSLLAPLANPLLENAKAA-----------------------------ND 32

Query: 64 PDDIGKNGKIT--KRTETVYDEKTNILQNLQFDFIDDPTYDKNVLLVKKQGSIHSNLKFE 121
+DIGK I KRTE K + QN+QFDF+ D Y+K+ L++K QG I S +
Sbjct: 33 TEDIGKGSDIEIIKRTEDKTSNKWGVTQNIQFDFVKDKKYNKDALILKMQGFISSRTTYY 92

Query: 122 SHKEEKNSNWLKYPSEYHVDFQVKRNRKTEILDQLPKNKISTAKVDSTFSYSSGGKFDST 181
++K+ + +++P +Y++ + ++ +++ LPKNKI + V T Y+ GG F S
Sbjct: 93 NYKKTNHVKAMRWPFQYNIGLKTN-DKYVSLINYLPKNKIESTNVSQTLGYNIGGNFQSA 151

Query: 182 KGIGRTSSNSYSKTISYNQQNYDTIASGKNNNWHVHWSVIANDLKYGGEVKNRNDELLFY 241
+G S +YSK+ISY QQNY + + N+ V W V AN K+ D LF
Sbjct: 152 PSLGGNGSFNYSKSISYTQQNYVSEVE-QQNSKSVLWGVKANSFATESGQKSAFDSDLFV 210

Query: 242 RNTRIATVENPELSFASKYRYPALVRSGFNPEFLTYLSNEK-SNEKTQFEVTYTRNQDIL 300
+ +P F P LV+SGFNP F+ +S+EK S++ ++FE+TY RN D+
Sbjct: 211 GYKPHSK--DPRDYFVPDSELPPLVQSGFNPSFIATVSHEKGSSDTSEFEITYGRNMDVT 268

Query: 301 KN-RPGIHYAPSILEKNKDG-----QRLIVTYEVDWKNKTVKV 337
+ HY S L+ ++ + V YEV+WK +KV
Sbjct: 269 HAIKRSTHYGNSYLDGHRVHNAFVNRNYTVKYEVNWKTHEIKV 311


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2008BACTRLTOXIN1181e-34 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 118 bits (298), Expect = 1e-34
Identities = 74/279 (26%), Positives = 117/279 (41%), Gaps = 54/279 (19%)

Query: 1 MKKRLLFVIVITLFIFS---SNHTVLSNGDVGP---------------GNLRNFYTKYEY 42
M KRL VI +F S VL+ P GN++ Y Y
Sbjct: 1 MYKRLFISRVILIFALILVISTPNVLAESQPDPMPDDLHKSSEFTGTMGNMKYLY-DDHY 59

Query: 43 VNLKNVKDKNSPESHRLEYS-----YKN-DTLYAEFDNEYITSDLKGKNVDVFGISYKYG 96
V+ VK + +H L Y+ KN D + E NE + K + VDV+G +Y
Sbjct: 60 VSATKVKSVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYYVN 119

Query: 97 S-------------NSRTIYGGVTKAENNKLDSPRIIPINLIINGKHQTVTTKSVSTDKK 143
+YGG+TK E N D+ + + + + + + V TDKK
Sbjct: 120 CYFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFEVQTDKK 179

Query: 144 MVTAQEIDVKLRKYLQDEFNIYGHNDTGKGKEYGTSSKFYSGFDKGSVVFHMNDGSNFSY 203
VTAQE+D+K R +L ++ N+Y ++ ++ G + F N+G+ F Y
Sbjct: 180 SVTAQELDIKARNFLINKKNLY-EFNSSP-------------YETGYIKFIENNGNTFWY 225

Query: 204 DLFYT--GYGLPESFLKIYKDNKTVDSTQFHLDVEISKR 240
D+ +L +Y DNKTVDS ++V ++ +
Sbjct: 226 DMMPAPGDKFDQSKYLMMYNDNKTVDSKSVKIEVHLTTK 264


29SAV2022SAV2035Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV2022220-2.264421hypothetical protein
SAV2023422-2.636367hypothetical protein
SAV2024114-0.098298hypothetical protein
SAV20251130.109147hypothetical protein
SAV2026113-0.580234hypothetical protein
SAV2027-2110.246887hypothetical protein
SAV2028-2110.742744similar to integrase
SAV2029-1120.626032GroEL protein
SAV2030313-0.946630GroES protein
SAV2031312-2.019500conserved hypothetical protein
SAV2032313-1.439378hypothetical protein
SAV2033-213-4.214650similar to nitroreductase family protein
SAV2034011-4.226883hypothetical protein
SAV2035-113-3.348558delta-hemolysin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2032TONBPROTEIN467e-08 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 46.1 bits (109), Expect = 7e-08
Identities = 29/96 (30%), Positives = 36/96 (37%), Gaps = 7/96 (7%)

Query: 103 DSKPDPNNQNPSPNPKPDPDNPKPKPDPDKPKPNLDPKPDPDNPKPKPDPKPDPDKPK-P 161
D +P Q P P+P P+P K P + KP P KPKP PKP + P
Sbjct: 54 DLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKP---KPKPKPKPVKKVQEQP 110

Query: 162 NPDPKP---DPDKPKPNPNPKPDPNKPNPNPSPDPD 194
D KP P P N P + + P
Sbjct: 111 KRDVKPVESRPASPFENTAPARLTSSTATAATSKPV 146



Score = 35.7 bits (82), Expect = 2e-04
Identities = 24/109 (22%), Positives = 30/109 (27%), Gaps = 10/109 (9%)

Query: 98 QNPSTDSKPDPNNQNPSPNPKPDPDNPKPKPDPD-------KPKPNLDPKPDPDNPKPKP 150
+ P P P P P+P P+ PK P KPKP K +
Sbjct: 56 EPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVK 115

Query: 151 DPKPDPDKPKPNPDPKPDPDKPKPNPNPKP---DPNKPNPNPSPDPDQP 196
+ P P N P KP + P P P
Sbjct: 116 PVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQYP 164


30SAV2145SAV2166Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV2145215-3.066761repressor protein
SAV2146217-3.620827cation-efflux system membrane protein homolog
SAV2147117-3.735674lytic regulatory protein truncated with Tn554
SAV2148120-1.218639hypothetical protein
SAV21490130.263539hypothetical protein
SAV2150014-0.113812hypothetical protein
SAV21510130.626032hypothetical protein
SAV2152-2110.515266conserved hypothetical protein
SAV2153-1130.707880similar to ATP-binding protein homolog
SAV21540150.567502glucosamine-fructose-6-phosphate
SAV21558131.077917hypothetical protein
SAV21567131.236136mannitol specific IIBC component of PTS system
SAV21578120.780878hypothetical protein
SAV21589150.900975PTS system, mannitol specific IIA component
SAV21598150.650901mannitol-1-phosphate 5-dehydrogenase
SAV21607150.925480FmtB protein
SAV2161113-0.179360phosphoglucosamine-mutase
SAV2162215-1.021720conserved hypothetical protein
SAV2163315-0.762487conserved hypothetical protein
SAV2164416-1.080076arginase
SAV2165514-0.993571ATP-binding protein Mrp-like protein
SAV2166313-2.157159similar to multidrug resistance protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2153PF05272290.017 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.017
Identities = 17/58 (29%), Positives = 27/58 (46%), Gaps = 8/58 (13%)

Query: 32 ILYGLNGAGKTTLLNILNAYEPATTGGVNLFGKMPGKVGYSAETVRQHIGFVSHSLLE 89
+L G G GK+TL+N L G++ F +G ++ Q G V++ L E
Sbjct: 600 VLEGTGGIGKSTLINTL--------VGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSE 649


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2160IGASERPTASE457e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 44.7 bits (105), Expect = 7e-06
Identities = 57/316 (18%), Positives = 103/316 (32%), Gaps = 20/316 (6%)

Query: 2122 PQANNNSSADASTNSPTMDNDVTSKPEVESTNNG---TTDKPVTETDNATPAESTTNN-- 2176
P+ + +TN T +N P V S N + PV ATP+E+T
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAE 1042

Query: 2177 ----NSTTTATNENAPTGSTATAPTTASTEAASSADSKDNASVNDSKQNAEVNNSAESQS 2232
S T NE T +TA A ++ + V S + + E++
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE 1102

Query: 2233 TNGKVAQPKS--ENKAKAEKDGRDSTNQSMVESTTETLPSADITEPNVPSNTSKDKEEST 2290
T + K+ E + E S E + P A+ N P+ K+ + T
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQT 1162

Query: 2291 TNQTDAGQLKSETNVASNEADKSPSKADT----EVSNKPSTSASSEAKDKMTSTNVSQKD 2346
D Q ET+ + + +T + + +T A+++ S+N +
Sbjct: 1163 NTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNR 1222

Query: 2347 DTATADTNDTQKSVGPVANNKAKDMQTNDTQKSVGSAANNKATQNDGANASPATVSNG-S 2405
+ + ++N + S N + A A ++ G +
Sbjct: 1223 HRRSVRSVPHNVEPATTSSNDRS----TVALCDLTSTNTNAVLSDARAKAQFVALNVGKA 1278

Query: 2406 HSMHQDMLNVTKPEEN 2421
S H L + +
Sbjct: 1279 VSQHISQLEMNNEGQY 1294



Score = 42.0 bits (98), Expect = 4e-05
Identities = 47/318 (14%), Positives = 104/318 (32%), Gaps = 9/318 (2%)

Query: 1073 NNGSTTEEKEAAKQQVQTEKTAADAAIDAAHSNVEVEAAKNAEIAKI-EAIQPATTTKDN 1131
NG +++ QT T + ++V + N EIA++ EA P
Sbjct: 974 VNGRYDLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATP 1033

Query: 1132 AKQAIATKANERKTAIAQTQDITAEEIAAANADVDNAVTQANSNIEAANSQNDVDQAKTT 1191
++ N ++ ++T + ++ A +A SN++A N+V Q+ +
Sbjct: 1034 SETTETVAENSKQE--SKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSE 1091

Query: 1192 GETSIDQVTPTVNKKATARNEITAILNNKLQEIQATPDATDEEKQAADAEANTENGKANQ 1251
+ T T K TA E + ++ Q P T + + +
Sbjct: 1092 TKE-----TQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA 1146

Query: 1252 AISAATTNAQVDEAKANAEAAINAVTPKVVKKQAAKDEIDQLQATQTNVINNDQNATNEE 1311
+ T N + +++ N A + T +V+ N +N T
Sbjct: 1147 RENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPAT 1206

Query: 1312 KEAAIQQLATAVTDAKNNITAATDDNGVDTAKDAGKNSIQSTQPATAVKSNAKNEVDQAV 1371
+ + ++ ++ + + + V+ A + + + +N + A
Sbjct: 1207 TQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDR-STVALCDLTSTNTNAVLSDAR 1265

Query: 1372 TTQNQAIDNTTGATTEEK 1389
N A ++
Sbjct: 1266 AKAQFVALNVGKAVSQHI 1283



Score = 33.9 bits (77), Expect = 0.010
Identities = 61/312 (19%), Positives = 105/312 (33%), Gaps = 18/312 (5%)

Query: 1021 DIDNATANTDVDNAKTTNEATIAAITPDANVKPAAKQAIADKVQAQETAIDANNGSTTEE 1080
D+ N TTN T I D P+ + IA +A S T E
Sbjct: 979 DLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE 1038

Query: 1081 K--EAAKQQVQTEKTAADAAIDAAHSNVEVEAAKNAEIAKIEAIQPATTTKDNAKQAIAT 1138
E +KQ+ +T + A + N EV + + A T + Q+ +
Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNV-------KANTQTNEVAQSGSE 1091

Query: 1139 KANERKTAIAQTQDITAEEIAAANADVDNAVTQANSNIEAANSQNDVDQAKTTGETSIDQ 1198
+ T +T + EE A + V + S + Q++ Q + D
Sbjct: 1092 TKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREND- 1150

Query: 1199 VTPTVN-KKATARNEITAILNNKLQEIQATPDATDEEKQAADAEANTENGKANQAISAAT 1257
PTVN K+ ++ TA +E + + E + + N + AT
Sbjct: 1151 --PTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENT--TPAT 1206

Query: 1258 TNAQVDEAKANAEAAINAVTPKVVKKQ---AAKDEIDQLQATQTNVINNDQNATNEEKEA 1314
T V+ +N + + + V A D+ ++ + + NA + A
Sbjct: 1207 TQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARA 1266

Query: 1315 AIQQLATAVTDA 1326
Q +A V A
Sbjct: 1267 KAQFVALNVGKA 1278



Score = 33.5 bits (76), Expect = 0.013
Identities = 45/305 (14%), Positives = 93/305 (30%), Gaps = 14/305 (4%)

Query: 804 KNEEIFKIENITDSTQTKMDAYKEVRQAATARKAQNATVSNATDEEVAEANAAVDAAQTE 863
E K E+ T + + A++A++ +N EVA++ + QT
Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT 1098

Query: 864 GLHDIQVVKSQQEVADTKAKVLDKINAIQTQAKVKPAADTEVENAYNTRKQEIQNSNAST 923
+ V+ +++ A + + ++ + +Q K V+ ++ N
Sbjct: 1099 ETKETATVEKEEK-AKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157

Query: 924 TEEKEAAYTELDAKKQEARTNLDAANTNSDVTTAKDNGIAAINQVQAATTKKSDAKAEIA 983
+ + + + +E +N++ T T N + + T + +E +
Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVT-ESTTVNTGNSVVENPENTTPATTQPTVNSESS 1216

Query: 984 QKASERKTAIEAMNDSTTEEQQAAKDKVDQAVVTANADIDNATANTDVDNAKTTNEATIA 1043
K R + E V + N A AK A
Sbjct: 1217 NKPKNR-HRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVA--- 1272

Query: 1044 AITPDANVKPAAKQAIADKV---QAQETAIDANNGSTTEEKEAAKQQVQTEKTAADAAID 1100
NV A Q I+ + Q +N + ++ ++ T D
Sbjct: 1273 -----LNVGKAVSQHISQLEMNNEGQYNVWVSNTSMNKNYSSSQYRRFSSKSTQTQLGWD 1327

Query: 1101 AAHSN 1105
SN
Sbjct: 1328 QTISN 1332



Score = 32.3 bits (73), Expect = 0.034
Identities = 32/210 (15%), Positives = 63/210 (30%), Gaps = 4/210 (1%)

Query: 34 TTASAAEQNQPAQNQPAQPADANTQPNANAGAQANPAAQPANQGGQANPAGGAAQPAGQG 93
T + +Q P Q QP A + +P Q N QPA +
Sbjct: 1116 TEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKET 1175

Query: 94 NQADPNNAAQAQPGNQ--AAPANQAGQGNNQATPNNNATPANQTQPANAPA-AAQPAAPV 150
+ ++ N + N P N+ +N+ + + + + P
Sbjct: 1176 SSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVE 1235

Query: 151 AANAQTQDPNASNTGE-GSINTTLTFDDPAISTDENRQDPTVTVTDKVNGYSLINNGKIG 209
A + D + + S NT D + V+ ++ + N G+
Sbjct: 1236 PATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQYN 1295

Query: 210 FVNSELRRSDMFDKNNPQNYQAKGNVAALG 239
S + + + + + +K LG
Sbjct: 1296 VWVSNTSMNKNYSSSQYRRFSSKSTQTQLG 1325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2166TCRTETB1442e-40 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 144 bits (365), Expect = 2e-40
Identities = 98/416 (23%), Positives = 194/416 (46%), Gaps = 14/416 (3%)

Query: 7 TTRRRNFIVAVMLISAFVAILNQTLLNTALPSIMRELNINESTSQWLVTGFMLVNGVMIP 66
+ R N I+ + I +F ++LN+ +LN +LP I + N +++ W+ T FML +
Sbjct: 8 SNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTA 67

Query: 67 LTAYLMDRIKTRPLYLAAMGTFLLGSIVAALAPN-FGVLMLARVIQAMGAGVLMPLMQFT 125
+ L D++ + L L + GS++ + + F +L++AR IQ GA L+
Sbjct: 68 VYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVV 127

Query: 126 LFTLFSKEHRGFAMGLAGLVIQFAPAIGPTVTGLIIDQASWRVPFIIIVGIALVAFVFGL 185
+ KE+RG A GL G ++ +GP + G+I W +++++ + + V L
Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW--SYLLLIPMITIITVPFL 185

Query: 186 VSISSYNEVKYTKLDKRSVMYSTIGFGLMLYAFSSAGDLGFTSPIVIGALIISMVIIYLF 245
+ + D + ++ ++G + FT+ I LI+S++ +F
Sbjct: 186 MKLLKKEVRIKGHFDIKGIILMSVGIVFFML---------FTTSYSISFLIVSVLSFLIF 236

Query: 246 IRRQFNITNVLLNLRVFKNRTFALCTISSMIIMMSMVGPALLIPLYVQNSLSLSALLSGL 305
++ +T+ ++ + KN F + + II ++ G ++P +++ LS G
Sbjct: 237 VKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGS 296

Query: 306 VIM-PGAIINGIMSVFTGKFYDKYGPRPLIYTGFTILTITTIMLCFLHTDTSYTYLIVVY 364
VI+ PG + I G D+ GP ++ G T L+++ + FL TS+ I++
Sbjct: 297 VIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIV 356

Query: 365 AIRMFSVSLLMMPINTTGINSLRNEEISHGTAIMNFGRVMAGSLGTALMVTLMSFG 420
+ S I+T +SL+ +E G +++NF ++ G A++ L+S
Sbjct: 357 FVLGGL-SFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIP 411


31SAV2479SAV2502Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV2479316-1.206480hypothetical protein
SAV2480515-1.637471hypothetical protein
SAV2481617-2.261654hypothetical protein
SAV2482717-2.386771hypothetical protein
SAV2483717-3.646941hypothetical protein
SAV2484717-3.781933hypothetical protein
SAV2485918-4.537763hypothetical protein
SAV2486415-3.523460hypothetical protein
SAV2487315-3.384460hypothetical protein
SAV2488115-3.541613similar to putative helicase
SAV2489015-2.806645similar to putative helicase
SAV2490-115-2.224875similar to mutator protein mutT
SAV2491-214-1.210015similar to phosphomannomutase
SAV2492318-1.368469hypothetical protein
SAV2493419-2.186472hypothetical protein
SAV2494617-3.051160hypothetical protein
SAV2495414-1.668170conserved hypothetical protein
SAV2496413-1.678355hypothetical protein
SAV2497310-0.480364similar to accumulation-associated protein
SAV24983110.985850staphylococcal accessory regulator A homolog
SAV24993101.226138staphylococcal accessory regulator A homolog
SAV2500192.190022UTP-glucose-1-phosphate uridyltransferase
SAV2501082.422853transposase
SAV25021103.076256fibronectin-binding protein homolog
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2496V8PROTEASE342e-04 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 34.2 bits (78), Expect = 2e-04
Identities = 14/30 (46%), Positives = 18/30 (60%)

Query: 93 EKPKDPKGPENPEKPSRPTHPSGPVNPNNP 122
P +P P NP+ P+ P P+ P NPNNP
Sbjct: 290 NNPDNPDNPNNPDNPNNPDEPNNPDNPNNP 319



Score = 32.7 bits (74), Expect = 7e-04
Identities = 13/30 (43%), Positives = 18/30 (60%)

Query: 93 EKPKDPKGPENPEKPSRPTHPSGPVNPNNP 122
+ P +P P+NP P P +P P NP+NP
Sbjct: 293 DNPDNPNNPDNPNNPDEPNNPDNPNNPDNP 322



Score = 32.3 bits (73), Expect = 0.001
Identities = 13/30 (43%), Positives = 19/30 (63%)

Query: 93 EKPKDPKGPENPEKPSRPTHPSGPVNPNNP 122
++P +P P+NP P P +P P NP+NP
Sbjct: 287 DQPNNPDNPDNPNNPDNPNNPDEPNNPDNP 316



Score = 30.7 bits (69), Expect = 0.003
Identities = 12/29 (41%), Positives = 20/29 (68%)

Query: 93 EKPKDPKGPENPEKPSRPTHPSGPVNPNN 121
+ P +P P NP++P+ P +P+ P NP+N
Sbjct: 296 DNPNNPDNPNNPDEPNNPDNPNNPDNPDN 324



Score = 29.6 bits (66), Expect = 0.007
Identities = 17/46 (36%), Positives = 25/46 (54%), Gaps = 1/46 (2%)

Query: 98 PKGPENPEKPSRPTHPSGPVNPNNPGLSKDRAKP-NGPVHSMDKND 142
P P+NP+ P+ P +P+ P PNNP + P NG ++ D D
Sbjct: 289 PNNPDNPDNPNNPDNPNNPDEPNNPDNPNNPDNPDNGDNNNSDNPD 334



Score = 27.7 bits (61), Expect = 0.027
Identities = 13/33 (39%), Positives = 17/33 (51%)

Query: 102 ENPEKPSRPTHPSGPVNPNNPGLSKDRAKPNGP 134
+ P P P +P+ P NPNNP + PN P
Sbjct: 287 DQPNNPDNPDNPNNPDNPNNPDEPNNPDNPNNP 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2497GPOSANCHOR300.014 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.4 bits (68), Expect = 0.014
Identities = 17/108 (15%), Positives = 33/108 (30%), Gaps = 1/108 (0%)

Query: 14 FLSNKLNKYSIRKFTVGTASILIG-SLMYLGTQQEAEAAENNIENPTTLKDNVQSKEVKI 72
+N YS+RK GTAS+ + +++ G T +
Sbjct: 2 TKNNTNRHYSLRKLKTGTASVAVALTVLGAGLVVNTNEVSAVATRSQTDTLEKVQERADK 61

Query: 73 EEVTNKDTAPQGVEAKSEVTSNKDTIEHEASVKAEDISKKEDTPKEVA 120
E+ N + + + KD + + K K ++
Sbjct: 62 FEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLS 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2502TONBPROTEIN516e-09 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 50.8 bits (121), Expect = 6e-09
Identities = 23/67 (34%), Positives = 27/67 (40%)

Query: 828 EVPSEPETPTPPTPEVPSEPGEPTPPKPEVPSEPETPVPPTPEVPSEPGKPVPPAKEEPK 887
EP P PE EP P PE P E + P KPV +E+PK
Sbjct: 52 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPK 111

Query: 888 KPSKPVE 894
+ KPVE
Sbjct: 112 RDVKPVE 118



Score = 50.8 bits (121), Expect = 6e-09
Identities = 20/81 (24%), Positives = 23/81 (28%), Gaps = 4/81 (4%)

Query: 825 PTPEVPSE----PETPTPPTPEVPSEPGEPTPPKPEVPSEPETPVPPTPEVPSEPGKPVP 880
P P P P V P P+PE PE P + KP P
Sbjct: 39 PAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKP 98

Query: 881 PAKEEPKKPSKPVEQGKVVTP 901
K K +P K V
Sbjct: 99 KPKPVKKVQEQPKRDVKPVES 119



Score = 50.0 bits (119), Expect = 1e-08
Identities = 17/84 (20%), Positives = 28/84 (33%), Gaps = 3/84 (3%)

Query: 822 PTPPTPEVPSEPETPTPPTPEVPSEPGEPTP---PKPEVPSEPETPVPPTPEVPSEPGKP 878
P V PE P PE P P + +P+ P +V +P +
Sbjct: 54 DLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRD 113

Query: 879 VPPAKEEPKKPSKPVEQGKVVTPV 902
V P + P P + ++ +
Sbjct: 114 VKPVESRPASPFENTAPARLTSST 137



Score = 45.0 bits (106), Expect = 5e-07
Identities = 22/105 (20%), Positives = 33/105 (31%), Gaps = 4/105 (3%)

Query: 810 EGQQTIEEDTTPPTPPTPEVPSEPETPTPPTPEVPSEPGEPTPPKPEVPSEPETPVPPTP 869
E Q ++ P P PE PE PP PKP+ + P
Sbjct: 56 EPPQAVQPPPEPVVEPEPEPEPIPE---PPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKR 112

Query: 870 EV-PSEPGKPVPPAKEEPKKPSKPVEQGKVVTPVIEINEKVKAVA 913
+V P E P P + + PV + +A++
Sbjct: 113 DVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRALS 157



Score = 42.3 bits (99), Expect = 5e-06
Identities = 27/88 (30%), Positives = 31/88 (35%), Gaps = 12/88 (13%)

Query: 834 ETPTPPTP-------EVPSEPGEPTPPKPEVPSEPETPVPPTPEVPSEPGKPVPPAKEEP 886
E P P P EP + P PE EPE P PE P K P E+P
Sbjct: 37 ELPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPP----KEAPVVIEKP 92

Query: 887 KKPSKPVEQGKVVTPVIEINEKVKAVAP 914
K KP + V + VK V
Sbjct: 93 KPKPKPKPK-PVKKVQEQPKRDVKPVES 119



Score = 32.3 bits (73), Expect = 0.008
Identities = 17/81 (20%), Positives = 25/81 (30%), Gaps = 5/81 (6%)

Query: 853 PKPEVPSE----PETPVPPTPEVPSEPGKPVPPAKEEPKKPSKPVEQGKVVTPVIEINEK 908
P P P + P V P V P E P P ++ VV + K
Sbjct: 39 PAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPP-KEAPVVIEKPKPKPK 97

Query: 909 VKAVAPTKQKQSKKSELPETG 929
K K ++ K ++
Sbjct: 98 PKPKPVKKVQEQPKRDVKPVE 118


32SAV2649SAV2654Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV26496132.221023preprotein translocase secA homolog
SAV26508142.654091hypothetical protein
SAV26519142.762771hypothetical protein
SAV265211142.707409hypothetical protein
SAV265311153.006538similar to preprotein translocase secY
SAV265413163.961392serine-threoinine rich antigen
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2649SECA6560.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 656 bits (1695), Expect = 0.0
Identities = 287/835 (34%), Positives = 450/835 (53%), Gaps = 68/835 (8%)

Query: 10 NELRLKSIRKIVKRINTWSDEVKSYSDDALKQKTIEFKERLASGVDTLDTLLPEAYAVAR 69
N+ L+ +RK+V IN E++ SD+ LK KT EF+ RL G + L+ L+PEA+AV R
Sbjct: 14 NDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKG-EVLENLIPEAFAVVR 72

Query: 70 EASWRVLGMYPKEVQLIGAIVLHEGNIAEMQTGEGKTLTATMPLYLNALSGKGTYLITTN 129
EAS RV GM +VQL+G +VL+E IAEM+TGEGKTLTAT+P YLNAL+GKG +++T N
Sbjct: 73 EASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNALTGKGVHVVTVN 132

Query: 130 DYLAKRDFEEMQPLYEWLGLTASLGFVDIVDYEYQKGEKRNIYEHDIIYTTNGRLGFDYL 189
DYLA+RD E +PL+E+LGLT V I KR Y DI Y TN GFDYL
Sbjct: 133 DYLAQRDAENNRPLFEFLGLT-----VGINLPGMPAPAKREAYAADITYGTNNEYGFDYL 187

Query: 190 IDNLADSAEGKFLPQLNYGIIDEVDSIILDAAQTPLVISGAPRLQSNLFHIVKEFVDTLI 249
DN+A S E + +L+Y ++DEVDSI++D A+TPL+ISG S ++ V + + LI
Sbjct: 188 RDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYKRVNKIIPHLI 247

Query: 250 E-----------DVHFKMKKTKKEIWLLNQGIEAAQSYFNV-------EDLYSEQAMVLV 291
+ HF + + +++ L +G+ + E LYS ++L+
Sbjct: 248 RQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYSPANIMLM 307

Query: 292 RNINLALRAQYLFESNVDYFVYNGDIVLIDRITGRMLPGTKLQAGLHQAIEAKEGMEVST 351
++ ALRA LF +VDY V +G+++++D TGR + G + GLHQA+EAKEG+++
Sbjct: 308 HHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAKEGVQIQN 367

Query: 352 DKSVMATITFQNLFKLFESFSGMTATGKLGESEFFDLYSKIVVQAPTDKAIQRIDEPDKV 411
+ +A+ITFQN F+L+E +GMT T EF +Y V PT++ + R D PD V
Sbjct: 368 ENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMIRKDLPDLV 427

Query: 412 FRSVDEKNIAMIHDIVELHETGRPVLLITRTAEAAEYFSKVLFQMDIPNNLLIAQNVAKE 471
+ + EK A+I DI E G+PVL+ T + E +E S L + I +N+L A+ A E
Sbjct: 428 YMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANE 487

Query: 472 AQMIAEAGQIGSMTVATSMAGRGTDIKLG-----------------------------EG 502
A ++A+AG ++T+AT+MAGRGTDI LG +
Sbjct: 488 AAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKADWQVRHDA 547

Query: 503 VEALGGLAVIIHEHMENSRVDRQLRGRSGRQGDPGSSCIYISLDDYLVKRWSDSNLAENN 562
V GGL +I E E+ R+D QLRGRSGRQGD GSS Y+S++D L++ ++ ++
Sbjct: 548 VLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFASDRVSGMM 607

Query: 563 QLYSLDAQRLSQSNLFNRKVKQIVVKAQRISEEQGVKAREMANEFEKSISIQRDLVYEER 622
+ + + + + AQR E + R+ E++ + QR +Y +R
Sbjct: 608 RKLGMKPGEAIEHPWVTKAIA----NAQRKVESRNFDIRKQLLEYDDVANDQRRAIYSQR 663

Query: 623 NRVLEIDDAENRDFKALAKDVFEMFVNEE---KVLTKSRVVEYIYQNLSFQFNKDVACVN 679
N +L++ D ++ +DVF+ ++ + L + + + + L F+ D+
Sbjct: 664 NELLDVSDVSET-INSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDLPIAE 722

Query: 680 FKDKQAVVT------FLLEQFEKQLALNRKNMQSAYYYNIFVQKVFLKAIDSCWLEQVDY 733
+ DK+ + +L Q + ++ + A F + V L+ +DS W E +
Sbjct: 723 WLDKEPELHEETLRERILAQSIEVYQ-RKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAA 781

Query: 734 LQQLKASVNQRQNGQRNAIFEYHRVALDSFEVMTRNIKKRMVKNICQSMITFDKE 788
+ L+ ++ R Q++ EY R + F M ++K ++ + + + +E
Sbjct: 782 MDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEE 836


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2653SECYTRNLCASE1304e-36 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 130 bits (329), Expect = 4e-36
Identities = 93/440 (21%), Positives = 181/440 (41%), Gaps = 52/440 (11%)

Query: 4 LLQQYEYKIIYKRMLYTCFILFIYILGTNISI--VSYNDMQ------VKHESFFKIAISN 55
+ + + K++L+T I+ +Y +GT+I I V Y ++Q ++ F +
Sbjct: 5 FARAFRTPDLRKKLLFTLAIIVVYRVGTHIPIPGVDYKNVQQCVREASGNQGLFGLVNMF 64

Query: 56 MGGDVNTLNIFTLGLVPWLTSMIILMLISYRNMDKYMKQTSLEKHYKE------------ 103
GG + + IF LG++P++T+ IIL L++ + LE KE
Sbjct: 65 SGGALLQITIFALGIMPYITASIILQLLT-------VVIPRLEALKKEGQAGTAKITQYT 117

Query: 104 RILTLILSVIQSYFVIHEYVSKERVHQDN-------------IYLTILILVTGTMLLVWL 150
R LT+ L+++Q ++ S + + ++ + GT +++WL
Sbjct: 118 RYLTVALAILQGTGLVATARSAPLFGRCSVGGQIVPDQSIFTTITMVICMTAGTCVVMWL 177

Query: 151 ADKNSRYGIAGPMPIVMVSIIKSMMHQKMEYI------DASHIVIALLIILVIITLFILL 204
+ + GI M I+M I + + I I +I + +I + +++
Sbjct: 178 GELITDRGIGNGMSILMFISIAATFPSALWAIKKQGTLAGGWIEFGTVIAVGLIMVALVV 237

Query: 205 FIELVEVRIPYI----DLMNVSATNMKSYLSWKVNPAGSITLMMSISAFVFLKSGIHFIL 260
F+E + RIP + S +Y+ KVN AG I ++ + S F
Sbjct: 238 FVEQAQRRIPVQYAKRMIGRRSYGGTSTYIPLKVNQAGVIPVIFASSLLYIPALVAQFAG 297

Query: 261 SMFNKSISDDMPMLTFDSPVGISVYLVIQMLLGYFLSRFLINTKQKSKDFLKSGNYFSGV 320
+ + D P+ I Y ++ + +F N ++ + + K G + G+
Sbjct: 298 GNSGWKSWVEQNLTKGDHPIYIVTYFLLIVFFAFFYVAISFNPEEVADNMKKYGGFIPGI 357

Query: 321 KPGKDTERYLNYQARRVCWFGSALVTVIIGIPLYFTLFVPHLSTEIYFS-VQLIVLVYIS 379
+ G+ T YL+Y R+ W GS + +I +P L S F ++++V +
Sbjct: 358 RAGRPTAEYLSYVLNRITWPGSLYLGLIALVP-TMALVGFGASQNFPFGGTSILIIVGVG 416

Query: 380 INIAETIRTYLYFDKYKPFL 399
+ + I + L Y+ FL
Sbjct: 417 LETVKQIESQLQQRNYEGFL 436


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2654ICENUCLEATIN553e-09 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 55.1 bits (132), Expect = 3e-09
Identities = 237/1070 (22%), Positives = 425/1070 (39%), Gaps = 12/1070 (1%)

Query: 1098 SDSMSISTSFSDSTSDSKSASTASSESISQSASTSTSGSVSTSTSLSTSNSERTSTSVSD 1157
+ + ++ + S S + + T +T S ST ++ +
Sbjct: 106 LHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQTIEIATYG 165

Query: 1158 STSLSTSESDSISESTSTSDSISEAISASESTSISLSESNSTSDSESQSASAFLSESLSE 1217
ST T +S I+ ST + + + S + ++ST + S ES
Sbjct: 166 STLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQM 225

Query: 1218 STSESTSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTSISESTSTFKSE 1277
+ ST + S + S T+ S+ + ST + S+ T+ ST T +
Sbjct: 226 AGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 285

Query: 1278 SVSTSLSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTSL 1337
S T+ ST T+ ++S+ ++ S T+ +S + ST + S + ST
Sbjct: 286 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 345

Query: 1338 SGSTSESESDSTSSSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMNQ 1397
+G S + S+ + DS+ + S T+ S T+ S T+ + S +
Sbjct: 346 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYG 405

Query: 1398 SGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTSQ 1457
S + S + ST T++ S T+ Y S T+ +S+ + S T+ S+
Sbjct: 406 STQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLT 465

Query: 1458 SGSTSTSASLSGSESESDSQSISTSASESTSESASTSLSDSTSTSNSGSASTSTSLSNSA 1517
+G ST + GS+ + S ST+ ES+ + S + S + ST + +
Sbjct: 466 AGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNE 525

Query: 1518 SASESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSE 1577
S + STS + + S+ + S ++ S+ + ST + ST
Sbjct: 526 SDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGT 585

Query: 1578 SGSTSESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSD 1637
+GS S + ST T+ S T+ S + S T+ STS + + S +
Sbjct: 586 AGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYG 645

Query: 1638 SQSMSLSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMSVSISLSDSTSTSTSASEVM 1697
S + S T+ ST + SD T+ S ST+G+ S I+ ST T+ S +
Sbjct: 646 STQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILT 705

Query: 1698 SASISDSQSMSESVNDSESVSESNSESDSKSMSGSTSVSDSGSLSVSTSLRKSESVSESS 1757
+ S + S S S S + +DS ++G S + S T+ S +
Sbjct: 706 AGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQ 765

Query: 1758 SLSGSQSMSDSVSTSDSSSLSVSTSLRSSESVSESDSLSDSKSTSGSTSTSTSGSLSTST 1817
S+ + S S + +DSS ++ S +++ S + S T+ S T+G STST
Sbjct: 766 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTST 825

Query: 1818 SLSGSESVSESTSLSDSISMSDSTSTSDSDSLSGSISLSGSTSLSTSDSLSDSKSLSSSQ 1877
+ + S ++ S + S T+ S + S + STS + DS ++
Sbjct: 826 AGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYG 885

Query: 1878 SMSGSESTSTSVSDSQSSSTSNSQFDSMSISASESDSMSTSDSSNISGSNSTSTSLSTSD 1937
S + S + S+ T+ D + S S + S GS T++ ST
Sbjct: 886 STQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLM 945

Query: 1938 SMSGSVSVSTSTSLSDSISGSTSVSDSSSTSTSTSLSDSMSQSQSTSTSASGSLSTSIST 1997
+ GS + S + GSTS++ S+ + S + QST T+ GS T+ +
Sbjct: 946 AGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHS 1005

Query: 1998 SMSMSASTSSSQSTSVSTSLSTSDSISDST----------SISISGSQSTVESESTSDST 2047
S + S++ + + S+ ++ S S S ISG +S + + S
Sbjct: 1006 STLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVLTAGYGSSLI 1065

Query: 2048 SISDSESLSTSDSDSTSTSTSDSTSGSTSTSIS--ESLSTSGSGSTSVSDSTSMSESDST 2105
S S + S+ ++ S +G ST I+ S+ +G GS+ + S S +
Sbjct: 1066 SGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGAD 1125

Query: 2106 SVSMSQDKSDSTSISDSESVSTSTSTSLSTSDSTSTSESLSTSMSGSQSI 2155
SV M+ ++ + +DS + S L+ ++S T+ S +G+ I
Sbjct: 1126 SVQMAGERGKLIAGADSTQTAGDRSKLLAGNNSYLTAGDRSKLTAGNDCI 1175



Score = 53.2 bits (127), Expect = 1e-08
Identities = 176/773 (22%), Positives = 305/773 (39%), Gaps = 2/773 (0%)

Query: 1408 SASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTSQSGSTSTSASL 1467
+ + +E + S + + T D+T S ST + + +
Sbjct: 106 LHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQTIEIATYG 165

Query: 1468 SGSESESDSQSISTSASESTSESASTSLSDSTSTSNSGSASTSTSLSNSASASESDSSST 1527
S SQ I+ S T+ +ST ++ ST +G+ ST + S + +SS
Sbjct: 166 STLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQM 225

Query: 1528 SLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSESGSTSESTSE 1587
+ ST M+ S+ + S + S+ + ST + S T+ GST +
Sbjct: 226 AGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 285

Query: 1588 SDSTSTSLSDSQSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSDSQSMSLSTST 1647
SD T+ S + + S+ +G ST T+ +S T+ ST SD + ST T
Sbjct: 286 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 345

Query: 1648 STSMSDSTSLSDSVSDSTSDSTSASTSGSMSVSISLSDSTSTSTSASEVMSASISDSQSM 1707
+ S + S + DS+ + GS + SD T+ S + S +
Sbjct: 346 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYG 405

Query: 1708 SESVNDSESVSESNSESDSKSMSGSTSVSDSGSLSVSTSLRKSESVSESSSLSGSQSMSD 1767
S ES + S + GS + GS + + S+ +G S
Sbjct: 406 STQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLT 465

Query: 1768 SVSTSDSSSLSVSTSLRSSESVSESDSLSDSKSTSGSTSTSTSGSLSTSTSLSGSESVSE 1827
+ S ++ S S S + S + GST T+ GS T+ S + +E
Sbjct: 466 AGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNE 525

Query: 1828 STSLSDSISMSDSTSTSDSDSLSGSISLSGSTSLSTSDSLSDSKSLSSSQSMSGSESTST 1887
S ++ S S + + S + GS + S+ T+ S + S +G ST T
Sbjct: 526 SDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGT 585

Query: 1888 SVSDSQSSSTSNSQFDSMSISASESDSMSTSDSSNISGSNSTSTSLSTSDSMSGSVSVST 1947
+ SDS + S + S+ + ST + S + S ST+ + S ++
Sbjct: 586 AGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYG 645

Query: 1948 STSLSDSISGSTSVSDSSSTSTSTSLSDSMSQSQSTSTSASGSLSTSISTSMSMSASTSS 2007
ST + S T+ S+ T+ S + S ST+ + S ++ ST + S +
Sbjct: 646 STQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILT 705

Query: 2008 SQSTSVSTSLSTSDSISDSTSISISGSQSTVESESTSDSTSISDSESLSTSDSDSTSTST 2067
+ S T+ SD S S S +G+ S++ + S T+ S + S T+
Sbjct: 706 AGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQ 765

Query: 2068 SDSTS--GSTSTSISESLSTSGSGSTSVSDSTSMSESDSTSVSMSQDKSDSTSISDSESV 2125
S T+ GSTST+ ++S +G GST + S+ + S +Q++SD T+ S S
Sbjct: 766 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTST 825

Query: 2126 STSTSTSLSTSDSTSTSESLSTSMSGSQSISDSTSTSMSGSTSTSESNSMHPS 2178
+ + S+ ++ ST T+ S +G S + S + S S + + S
Sbjct: 826 AGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDS 878



Score = 53.2 bits (127), Expect = 1e-08
Identities = 233/1050 (22%), Positives = 411/1050 (39%), Gaps = 2/1050 (0%)

Query: 759 TSGSTQQSQSVSTSKADSQSASTSTSGSIVVSTSASTSKSTSVSLSDSVSASKSLSTSES 818
TS + A ++ + S + D+ S S +++
Sbjct: 99 TSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQT 158

Query: 819 NSVSSSTSTSLVNSQSVSSSMSGSVSKSTSLSDSISNSNSTEKSESLSTSTSDSLRTSTS 878
+++ ST QS + GS + S I+ ST + + ST + T T+
Sbjct: 159 IEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTA 218

Query: 879 LSDSLSMSTSGSLSKSQSLSTSISGSSSTSASLSDSTSNAISTSTSLSESASTSDSISIS 938
+S M+ GS S +G ST + DS+ A ST + S+ + S
Sbjct: 219 GEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGS 278

Query: 939 NSIANSQSASTSKSDSQSTSISLSTSDSKSMSTSESLSDSTSTSGSVSGSLSIAASQSVS 998
A S T+ S T+ + S+ + ST + +ST T+G GS A S
Sbjct: 279 TQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGY--GSTQTAQKGSDL 336

Query: 999 TSTSDSMSTSEIVSDSISTSGSLSASDSKSMSVSSSMSTSQSGSTSESLSDSQSTSDSDS 1058
T+ S T+ S I+ GS + S + ST + S+ + ST + +
Sbjct: 337 TAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGA 396

Query: 1059 KSLSLSTSQSGSTSTSTSTSASVRTSESQSTSGSMSASQSDSMSISTSFSDSTSDSKSAS 1118
S ++ S T+ ST + S + GS + S + S + S
Sbjct: 397 DSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQ 456

Query: 1119 TASSESISQSASTSTSGSVSTSTSLSTSNSERTSTSVSDSTSLSTSESDSISESTSTSDS 1178
TA +S + ST + S + S T+ S + S + ST T+
Sbjct: 457 TAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGY 516

Query: 1179 ISEAISASESTSISLSESNSTSDSESQSASAFLSESLSESTSESTSESVSSSTSESTSLS 1238
S + +ES I+ S ST+ + S + + S + S T+ S+ T+ S
Sbjct: 517 GSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDL 576

Query: 1239 DSTSESGSTSTSLSNSTSGSASISTSTSISESTSTFKSESVSTSLSMSTSTSLSNSTSLS 1298
+ S T+ S S+ +G S T++ S T+ + S + S+ T+ S ST+ +
Sbjct: 577 TAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGA 636

Query: 1299 TSLSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTSLSGSTSESESDSTSSSESKSDS 1358
S + S + S+ T+ ST + S T+ GSTS + +DS+ + S
Sbjct: 637 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQ 696

Query: 1359 TSMSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMNQSGVDSNSASQSASNSTSTSTS 1418
T+ S+ + GST T+ S S S S + + + S ++ +S+ T+
Sbjct: 697 TAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGY 756

Query: 1419 ESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTSQSGSTSTSASLSGSESESDSQS 1478
S + + S ST+ + S + S T+ S T+ S ++ S
Sbjct: 757 GSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDL 816

Query: 1479 ISTSASESTSESASTSLSDSTSTSNSGSASTSTSLSNSASASESDSSSTSLSDSTSASMQ 1538
+ S ST+ + S+ ++ ST +G S T+ S ++ +S T+ STS +
Sbjct: 817 TTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGY 876

Query: 1539 SSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSESGSTSESTSESDSTSTSLSDS 1598
S + S + S T+ ST + S T+ GSTS + ES + S
Sbjct: 877 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQ 936

Query: 1599 QSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSDSQSMSLSTSTSTSMSDSTSLS 1658
++ +ST +G S+ T+ S T+ STSM S + ST T+ S T+
Sbjct: 937 TASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGY 996

Query: 1659 DSVSDSTSDSTSASTSGSMSVSISLSDSTSTSTSASEVMSASISDSQSMSESVNDSESVS 1718
S + ST + GS + + + S + S+ S + S ++ SV
Sbjct: 997 GSTQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVL 1056

Query: 1719 ESNSESDSKSMSGSTSVSDSGSLSVSTSLRKSESVSESSSLSGSQSMSDSVSTSDSSSLS 1778
+ S S S+ + GS +++ + ES+ ++G++SM + S ++
Sbjct: 1057 TAGYGSSLISGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGY 1116

Query: 1779 VSTSLRSSESVSESDSLSDSKSTSGSTSTS 1808
ST + ++SV + + + ST T+
Sbjct: 1117 RSTLISGADSVQMAGERGKLIAGADSTQTA 1146



Score = 52.8 bits (126), Expect = 2e-08
Identities = 229/1007 (22%), Positives = 395/1007 (39%), Gaps = 10/1007 (0%)

Query: 1163 TSESDSISESTSTSDSISEAISASESTSISLSESNSTSDSESQSASAFLSESLSESTSES 1222
TS I + + +E + S ++ ES S +++
Sbjct: 99 TSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQT 158

Query: 1223 TSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTSISESTSTFKSESVSTS 1282
+ ST T S + GST T+ +ST + ST T+ ++ST S T+
Sbjct: 159 IEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTA 218

Query: 1283 LSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTSLSGSTS 1342
S+ + ST SD T+ S + S+ + S + S+ +G S
Sbjct: 219 GEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGS 278

Query: 1343 ESESDSTSSSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMNQSGVDS 1402
+ S + ST + + S +G ST T+ S T+ S + S + +
Sbjct: 279 TQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTA 338

Query: 1403 NSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTSQSGSTS 1462
S + S+ + S T+ S T+ ST T+ SD T+ GST
Sbjct: 339 GYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTA------GYGSTG 392

Query: 1463 TSASLSGSESESDSQSISTSASESTSESASTSLSDSTSTSNSGSASTSTSLSNSASASES 1522
T+ + S + S + S T+ ST + S +G ST T+ +S+ +
Sbjct: 393 TAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGY 452

Query: 1523 DSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSESGSTS 1582
S+ T+ DS+ + S +Q S + STST+ S++ ++ ST +G S
Sbjct: 453 GSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSL--IAGYGSTQTAGYGS 510

Query: 1583 ESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSDSQSMS 1642
T+ ST T+ ++S + S S + + S+ + ST ++ S+ T+ S +
Sbjct: 511 TLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTA 570

Query: 1643 LSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMSVSISLSDSTSTSTSASEVMSASIS 1702
S T+ ST + S S + S T+ S + ST T+ S + + S
Sbjct: 571 REGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGS 630

Query: 1703 DSQSMSESVNDSESVSESNSESDSKSMSGSTSVSDSGSLSVSTSLRKSESVSESSSLSGS 1762
S + ++S + S + +S +G S + S T+ S S + + S +
Sbjct: 631 TSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIA 690

Query: 1763 QSMSDSVSTSDSSSLSVSTSLRSSESVSESDSLSDSKSTSGSTSTSTSGSLSTSTSLSGS 1822
S + +S + S ++++ S+ S S ST+G+ S+ +G ST T+ S
Sbjct: 691 GYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHS 750

Query: 1823 ESVSESTSLSDSISMSDSTSTSDSDSLSGSISLSGSTSLSTSDSLSDSKSLSSSQSMSGS 1882
+ S + S T+ S S +G+ S + ST + S + S +
Sbjct: 751 SLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTA 810

Query: 1883 ESTSTSVSDSQSSSTSNSQFDSMSISASESDSMSTSDSSNISGSNSTSTSLSTSDSMSGS 1942
+ S + S+ST+ DS I+ S + +S +G ST T+ SD +G
Sbjct: 811 QERSDLTTGYGSTSTAG--ADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGY 868

Query: 1943 VSVSTSTSLSDSISGSTSVSDSSSTSTSTSLSDSMSQSQSTSTSASGSLSTSISTSMSMS 2002
S ST+ S I+G S + S T+ S +Q S +G STS + S
Sbjct: 869 GSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSL 928

Query: 2003 ASTSSSQSTSVSTSLSTSDSISDSTSISISGSQSTVESESTSDSTSISDSESLSTSDSDS 2062
+ S T+ S + S T+ S + S S + S + ST +
Sbjct: 929 IAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGY 988

Query: 2063 TSTSTSDSTSGSTSTSISESLSTSGSGSTSVSDSTSMSESDSTSVSMSQDKSDSTSISDS 2122
ST T+ S T+ S + GS +T+ +DS+ ++ S+ S + + S
Sbjct: 989 QSTLTAGYGSTQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTL 1048

Query: 2123 ESVSTSTSTSLSTSDSTSTSESLSTSMSGSQSISDSTSTSMSGSTST 2169
S S T+ S S S T+ GS I+ S+ ++G ST
Sbjct: 1049 ISGLRSVLTAGYGSSLISGRRSSLTAGYGSNQIASHRSSLIAGPEST 1095



Score = 52.1 bits (124), Expect = 3e-08
Identities = 193/856 (22%), Positives = 350/856 (40%), Gaps = 14/856 (1%)

Query: 1329 DSISTSTSLSGSTSESESDSTSSSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTSTS 1388
S + ++ + + + S + + + + T +T S ST +
Sbjct: 97 TKTSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPT 156

Query: 1389 LSLSASMNQSGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDS 1448
++ + S + SQ + ST T+ S + Y S T+ ++ST + S
Sbjct: 157 QTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQ 216

Query: 1449 TSISKSTSQSGSTSTSASLSGSESESDSQSISTSASEST--SESASTSLSDSTSTSNSGS 1506
T+ +S+ +G ST + GS+ + S T+ +S+ + ST + S+ +G
Sbjct: 217 TAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGY 276

Query: 1507 ASTSTSLSNSASASESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTI 1566
ST T+ S + S+ T+ +DS+ + S + S + ST T+ + S +
Sbjct: 277 GSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDL 336

Query: 1567 ASLSTSVSTSESGST------SESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTSDS 1620
+ S T+ S+ S T+ DS+ T+ S T++ S + ST T+ +
Sbjct: 337 TAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGA 396

Query: 1621 RSTSASTSTSMRTSTSDSQSMSLSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMSVS 1680
S+ + S +T+ +S + ST T+ S + S T+ S+ +G S
Sbjct: 397 DSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQ 456

Query: 1681 ISLSDSTSTSTSASEVMSASISDSQSMSESVNDSESVSESNSESDSKSMSGSTSVSDSGS 1740
+ DS+ T+ S + SD + S + + S + S +G S +G
Sbjct: 457 TAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGY 516

Query: 1741 LSVSTSLRKSESVSESSSLSGSQSMSDSVSTSDSSSLSVSTSLRSSESVSESDSLSDSKS 1800
S T+ +S+ ++ S S + + S ++ S+ + S+ ++ S + S
Sbjct: 517 GSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDL 576

Query: 1801 TSGSTSTSTSGSLSTSTSLSGSESVSESTSLSDSISMSDSTSTSDSDSLSGSISLSGSTS 1860
T+G ST T+GS S+ + GS + S + S T+ S +G S S + +
Sbjct: 577 TAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGA 636

Query: 1861 LST------SDSLSDSKSLSSSQSMSGSESTSTSVSDSQSSSTSNSQFDSMSISASESDS 1914
S+ S + S+ ++ S + S + STS + DS I+ S
Sbjct: 637 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQ 696

Query: 1915 MSTSDSSNISGSNSTSTSLSTSDSMSGSVSVSTSTSLSDSISGSTSVSDSSSTSTSTSLS 1974
+ +S +G ST T+ SD SG S ST+ + S I+G S +S S+ T+
Sbjct: 697 TAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGY 756

Query: 1975 DSMSQSQSTSTSASGSLSTSISTSMSMSASTSSSQSTSVSTSLSTSDSISDSTSISISGS 2034
S ++ S +G STS + + S + S T+ S+ T+ S T+ S
Sbjct: 757 GSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDL 816

Query: 2035 QSTVESESTSDSTSISDSESLSTSDSDSTSTSTSDSTSGSTSTSISESLSTSGSGSTSVS 2094
+ S ST+ + S + ST + S T+ S T+ S+ + GS ST+
Sbjct: 817 TTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGY 876

Query: 2095 DSTSMSESDSTSVSMSQDKSDSTSISDSESVSTSTSTSLSTSDSTSTSESLSTSMSGSQS 2154
DS+ ++ ST + + S + S T+ S ST+ ES + GS
Sbjct: 877 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQ 936

Query: 2155 ISDSTSTSMSGSTSTS 2170
+ ST M+G S+
Sbjct: 937 TASFKSTLMAGYGSSQ 952



Score = 51.3 bits (122), Expect = 5e-08
Identities = 200/886 (22%), Positives = 350/886 (39%), Gaps = 6/886 (0%)

Query: 797 KSTSVSLSDSVSASKSLSTSESNSVSSSTSTSLVNSQSVSSSMSGSVSKSTSLSDSISNS 856
++ + + SA + + ++ V+ + + S V+S + D +
Sbjct: 89 RAEVLHVGTKTSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATI 148

Query: 857 NSTEKSESLSTSTSDSLRTSTSLSDSLSMSTSGSLSKSQSLSTSISGSSSTSASLSDSTS 916
S + + + T + S ++ GS + ST I+G ST + +DST
Sbjct: 149 ESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTL 208

Query: 917 NAISTSTSLSESASTSDSISISNSIANSQSASTSKSDSQSTSISLSTSDSKSMSTSESLS 976
A ST + S+ + S S T+ S T+ S+ + ST +
Sbjct: 209 VAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGE 268

Query: 977 DSTSTSGSVSGSLSIAASQSVSTSTSDSMSTSEIVSDSISTSGSLSASDSKSMSVSSSMS 1036
DS+ T+G S + S + S + ++ + S + +S + S
Sbjct: 269 DSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQ 328

Query: 1037 TSQSGSTSESLSDSQSTSDSDSKSLSLSTSQSGSTSTSTSTSASVRTSESQSTSGSMSAS 1096
T+Q GS + S T+ DS + + GST T+ S+ S T+ S
Sbjct: 329 TAQKGSDLTAGYGSTGTAGDDSSLI----AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDL 384

Query: 1097 QSDSMSISTSFSDSTSDSKSASTASSESISQSASTSTSGSVSTSTSLSTSNSERTSTSVS 1156
+ S T+ +DS+ + ST ++ S + S + S T+ T T+
Sbjct: 385 TAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGD 444

Query: 1157 DSTSLSTSESDSISESTSTSDSISEAISASESTSISLSESNSTSDSESQSASAFLSESLS 1216
DS+ ++ S + S+ + + ++ S + STS + +S+ S
Sbjct: 445 DSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQ 504

Query: 1217 ESTSESTSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTSISESTSTFKS 1276
+ ST + ST + + SD + GSTST+ +NS+ + ST T+ S T
Sbjct: 505 TAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGY 564

Query: 1277 ESVSTSLSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTS 1336
S T+ S T+ ST + S S + S ++ S+ + S + S
Sbjct: 565 GSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVL 624

Query: 1337 LSGSTSESESDSTSSSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMN 1396
+G S S + + SS + ST + S T+G ST T+ SD T+ S S +
Sbjct: 625 TTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGA 684

Query: 1397 QSGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTS 1456
S + + S + S T+ S T+ S TS STST+ + S + ST
Sbjct: 685 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQ 744

Query: 1457 QSGSTSTSASLSGSESESDSQSISTS--ASESTSESASTSLSDSTSTSNSGSASTSTSLS 1514
+ S+ + GS + QS+ T+ S ST+ + S+ ++ ST +G S T+
Sbjct: 745 TASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGY 804

Query: 1515 NSASASESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVS 1574
S ++ S T+ STS + S + S + S T+ ST + S
Sbjct: 805 GSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDL 864

Query: 1575 TSESGSTSESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTS 1634
T+ GSTS + +S + S + S +G ST T+ +S T+ STS
Sbjct: 865 TTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGY 924

Query: 1635 TSDSQSMSLSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMSVS 1680
S + ST T++ S + S + S+ + GS S++
Sbjct: 925 ESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMA 970



Score = 49.0 bits (116), Expect = 3e-07
Identities = 206/894 (23%), Positives = 361/894 (40%), Gaps = 6/894 (0%)

Query: 733 TDASGNKTTTTFKYEVTRNSMSDSVSTSGSTQQSQSVSTSKADSQSASTSTSGSIVVSTS 792
T G+ T + T + S ++ GSTQ + ST A S T+ GS + +
Sbjct: 281 TAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGY 340

Query: 793 ASTSKSTSVSLSDSVSASKSLSTSESNSVSSSTSTSLVNSQSVSSSMSGSVSKSTSLSDS 852
ST + S + S + +S+ + ST S ++ GS + + S
Sbjct: 341 GSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSL 400

Query: 853 ISNSNSTEKSESLSTSTSDSLRTSTSLSDSLSMSTSGSLSKSQSLSTSISGSSSTSASLS 912
I+ ST+ + ST T+ T T+ S + GS + S+ I+G ST +
Sbjct: 401 IAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGE 460

Query: 913 DSTSNAISTSTSLSESASTSDSISISNSIANSQSA------STSKSDSQSTSISLSTSDS 966
DS+ A ST ++ S + S S A +S+ ST + ST + S
Sbjct: 461 DSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQ 520

Query: 967 KSMSTSESLSDSTSTSGSVSGSLSIAASQSVSTSTSDSMSTSEIVSDSISTSGSLSASDS 1026
+ + S+ ++ STS + + S IA S T++ +S+ T+ S + GS +
Sbjct: 521 TAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGY 580

Query: 1027 KSMSVSSSMSTSQSGSTSESLSDSQSTSDSDSKSLSLSTSQSGSTSTSTSTSASVRTSES 1086
S + S S+ +G S + S+ + S + QS T+ STS + S
Sbjct: 581 GSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSL 640

Query: 1087 QSTSGSMSASQSDSMSISTSFSDSTSDSKSASTASSESISQSASTSTSGSVSTSTSLSTS 1146
+ GS + +S+ + S T+ S TA S S + + S+ + ST +
Sbjct: 641 IAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGY 700

Query: 1147 NSERTSTSVSDSTSLSTSESDSISESTSTSDSISEAISASESTSISLSESNSTSDSESQS 1206
NS T+ S T+ S+ S STST+ + S I+ ST + S+ T+ S
Sbjct: 701 NSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQ 760

Query: 1207 ASAFLSESLSESTSESTSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTS 1266
+ S + S ST+ + SS + S + S T+ S T+ S T+
Sbjct: 761 TAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGY 820

Query: 1267 ISESTSTFKSESVSTSLSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTS 1326
S ST+ S ++ S T+ S T+ S + +S + S ST+ S+
Sbjct: 821 GSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSL 880

Query: 1327 KSDSISTSTSLSGSTSESESDSTSSSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTS 1386
+ ST T+ S + ST +++ SD T+ S S + S+ + S ++
Sbjct: 881 IAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASF 940

Query: 1387 TSLSLSASMNQSGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLS 1446
S ++ + S+ + STS + +S + T + QS T+ S
Sbjct: 941 KSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQ 1000

Query: 1447 DSTSISKSTSQSGSTSTSASLSGSESESDSQSISTSASESTSESASTSLSDSTSTSNSGS 1506
+ S T+ GST+T+ + S + S S S T+ ST +S S +G
Sbjct: 1001 TAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVLTAGY 1060

Query: 1507 ASTSTSLSNSASASESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTI 1566
S+ S S+ + S+ + S+ + S + + S ++ S+ T+ ST+
Sbjct: 1061 GSSLISGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTL 1120

Query: 1567 ASLSTSVSTSESGSTSESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTSDS 1620
S + SV + + ++S T+ S + + S +G S T+ +D
Sbjct: 1121 ISGADSVQMAGERGKLIAGADSTQTAGDRSKLLAGNNSYLTAGDRSKLTAGNDC 1174



Score = 47.8 bits (113), Expect = 6e-07
Identities = 241/1091 (22%), Positives = 433/1091 (39%), Gaps = 10/1091 (0%)

Query: 907 TSASLSDSTSNAISTSTSLSESASTSDSISISNSIANSQSASTSKSDSQSTSISLSTSDS 966
TSA A + + ++ S ++ + N T D+ S S + +
Sbjct: 99 TSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQT 158

Query: 967 KSMSTSESLSDSTSTSGSVSGSLSIAASQSVSTSTSDSMSTSEIVSDSISTSGSLSASDS 1026
++T S T S ++G S + ST + ST +DS +G S +
Sbjct: 159 IEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTA 218

Query: 1027 KSMSVSSSMSTSQSGSTSESLSDSQSTSDSDSKSLSLSTSQSGSTSTSTSTSASVRTSES 1086
S + S S + S + S + GST T+ S+ S
Sbjct: 219 GEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGS 278

Query: 1087 QSTSGSMSASQSDSMSISTSFSDSTSDSKSASTASSESISQSASTSTSGSVSTSTSLSTS 1146
T+ S + S T+ +DS+ + ST ++ S + S + S T+
Sbjct: 279 TQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTA 338

Query: 1147 NSERTSTSVSDSTSLSTSESDSISESTSTSDSISEAISASESTSISLSESNSTSDSESQS 1206
T T+ DS+ ++ S + S+ + + ++ S + ST + + S
Sbjct: 339 GYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADS 398

Query: 1207 ASAFLSESLSESTSESTSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTS 1266
+ S + EST + ST + SD T+ GST T+ +S+ + ST T+
Sbjct: 399 SLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTA 458

Query: 1267 ISESTSTFKSESVSTSLSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMS--TSDSIS 1324
+S+ T S T+ S T+ STS + S + S + S T+ S
Sbjct: 459 GEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGS 518

Query: 1325 TSKSDSISTSTSLSGSTSESESDSTSSSESKSDSTSMSISMSQSTSGSTSTS------TS 1378
T + + S + GSTS + ++S+ + S T+ S+ + GST T+ T+
Sbjct: 519 TQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTA 578

Query: 1379 TSLSDSTSTSLSLSASMNQSGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSES 1438
S T+ S S + S ++ S + ST T+ S T+ Y S ST+ ++S
Sbjct: 579 GYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADS 638

Query: 1439 TSTSTSLSDSTSISKSTSQSGSTSTSASLSGSESESDSQSISTSASESTSESASTSLSDS 1498
+ + S T+ S +G ST + GS+ + S ST+ ++S+ + S +
Sbjct: 639 SLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTA 698

Query: 1499 TSTSNSGSASTSTSLSNSASASESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTS 1558
S + ST + S S STS + + S+ + S ++ S + S
Sbjct: 699 GYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGS 758

Query: 1559 TSNRMSTIASLSTSVSTSESGSTSESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTS 1618
T + STS +G+ S + ST T+ S T+ S + S T+
Sbjct: 759 TQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTT 818

Query: 1619 DSRSTSASTSTSMRTSTSDSQSMSLSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMS 1678
STS + + S + S + S T+ ST + SD T+ S ST+G S
Sbjct: 819 GYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDS 878

Query: 1679 VSISLSDSTSTSTSASEVMSASISDSQSMSESVNDSESVSESNSESDSKSMSGSTSVSDS 1738
I+ ST T+ S + + S + S + S S + +S ++G S +
Sbjct: 879 SLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTA 938

Query: 1739 GSLSVSTSLRKSESVSESSSLSGSQSMSDSVSTSDSSSLSVSTSLRSSESVSESDSLSDS 1798
S + S + S + S S++ DSS ++ S +++ S + S
Sbjct: 939 SFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGS 998

Query: 1799 KSTSGSTSTSTSGSLSTSTSLSGSESVSESTSLSDSISMSDSTSTSDSDSLSGSISLSGS 1858
T+ +ST T+G ST+T+ + S ++ S S S T+ S +SG S+ +
Sbjct: 999 TQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVLTA 1058

Query: 1859 TSLSTSDSLSDSKSLSSSQSMSGSESTSTSVSDSQSSSTSNSQFDSMSISASESDSMSTS 1918
S+ S S + S + S+ ++ +S+ + ++ SM I+ S +
Sbjct: 1059 GYGSSLISGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNR--SMLIAGKGSSQTAGY 1116

Query: 1919 DSSNISGSNSTSTSLSTSDSMSGSVSVSTSTSLSDSISGSTSVSDSSSTSTSTSLSDSMS 1978
S+ ISG++S + ++G+ S T+ S ++G+ S + S T+ +D +
Sbjct: 1117 RSTLISGADSVQMAGERGKLIAGADSTQTAGDRSKLLAGNNSYLTAGDRSKLTAGNDCIL 1176

Query: 1979 QSQSTSTSASG 1989
+ S +G
Sbjct: 1177 MAGDRSKLTAG 1187


33SAV0086SAV0093N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV008619-0.063652conserved hypothetical protein
SAV00871100.040091conserved hypothetical protein
SAV0088012-1.093858similar to sulfide-quinone reductase
SAV0089-113-1.182170probable dehydrogenases, nifR3 family
SAV0090-114-2.071292hypothetical protein
SAV0091013-2.097007hypothetical protein
SAV0092014-2.291803hypothetical protein
SAV0093215-2.418737hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0086PF01206614e-14 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 60.5 bits (147), Expect = 4e-14
Identities = 20/70 (28%), Positives = 37/70 (52%)

Query: 118 KQFNYRGFQCPGPIVKISQEMKNIEVGDQIEVKVTDPGFPSDIKSWVKQTRHTLVKLDEN 177
+ + G CP PI+K + + + G+ + V TDPG D +S+ KQT H L++ E
Sbjct: 6 QSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKEE 65

Query: 178 NNGINAIIQK 187
+ + +++
Sbjct: 66 DGTYHFRLKR 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV009060KDINNERMP260.007 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 26.4 bits (58), Expect = 0.007
Identities = 8/38 (21%), Positives = 16/38 (42%), Gaps = 4/38 (10%)

Query: 14 WDLFFAIPMFLLFAYL----PNYNFITIFLNIVIIIFF 47
W F + P+F L ++ N+ F I + ++
Sbjct: 332 WLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIM 369


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0091LCRVANTIGEN250.037 Low calcium response V antigen signature.
		>LCRVANTIGEN#Low calcium response V antigen signature.

Length = 326

Score = 25.4 bits (55), Expect = 0.037
Identities = 12/38 (31%), Positives = 24/38 (63%)

Query: 9 DLFLNHVNSNAVKTRKMMGEYIIYYDSVVIGGLYDNRL 46
++F N V ++ ++ K + Y + D+++ GG YDN+L
Sbjct: 57 EVFANRVITDDIELLKKILAYFLPEDAILKGGHYDNQL 94


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0093GPOSANCHOR397e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 39.3 bits (91), Expect = 7e-05
Identities = 18/134 (13%), Positives = 50/134 (37%), Gaps = 5/134 (3%)

Query: 515 INSEKTSIEEQVYHLDNETLRDNKEIEDLDNRINYIVKQIETLNELIKSIKESNKGFINK 574
++ ++++ L E +++ D ++ +I+ L ++++ +G +N
Sbjct: 76 LSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNF 135

Query: 575 LKAMFNSEEDESYKDHNKEKQQLLTQQLELEKCKKNKHEDLVSKLKEKEKLIKQLTKVQL 634
+ K EK L ++ +LEK + + + + L + ++
Sbjct: 136 ST-----ADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 190

Query: 635 QLDELNSQLQELEA 648
+ EL L+
Sbjct: 191 RQAELEKALEGAMN 204


34SAV0115SAV0126N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV01150112.784854lipoprotein
SAV01161153.174745probable cysteine synthase A protein
SAV01172163.473641probable ornithine cyclodeaminase protein
SAV01182163.354600conserved hypothetical protein
SAV01191173.575234hypothetical protein
SAV0120-1152.719635hypothetical protein
SAV01210143.316831similar to siderophore biosynthesis protein
SAV01220163.522319hypothetical protein
SAV0123-2141.871772probable diaminopimelate decarboxylase protein
SAV0124-1141.540040hypothetical protein
SAV0125-2110.875680hypothetical protein
SAV0126-1120.219160acetoin#diacetylreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0115FERRIBNDNGPP707e-16 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 70.4 bits (172), Expect = 7e-16
Identities = 47/191 (24%), Positives = 78/191 (40%), Gaps = 38/191 (19%)

Query: 53 PKRVVTLYQGATDVAVSLGVKPVGAVES-----WTQKPKFEYIKNDLKDTKI-VGQEPAP 106
P R+V L ++ ++LG+ P G ++ W +P L D+ I VG P
Sbjct: 35 PNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPP-------LPDSVIDVGLRTEP 87

Query: 107 NLEEISKLKPDLIVASKVRNEKVYDQLSKIAPTVSTDTVFKFKD----------TTKLMG 156
NLE ++++KP +V S + L++IAP F F D + M
Sbjct: 88 NLELLTEMKPSFMVWS-AGYGPSPEMLARIAPGR----GFNFSDGKQPLAMARKSLTEMA 142

Query: 157 KALGKEKEAEDLLKKYDDKVAAFQKDAKAKY--KDAWPLKASVVNF-RADHTRIYA-GGY 212
L + AE L +Y+D F + K ++ + A PL + H ++
Sbjct: 143 DLLNLQSAAETHLAQYED----FIRSMKPRFVKRGARPL--LLTTLIDPRHMLVFGPNSL 196

Query: 213 AGEILNDLGFK 223
EIL++ G
Sbjct: 197 FQEILDEYGIP 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0117SYCECHAPRONE310.002 Gram-negative bacterial type III secretion SycE cha...
		>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE

chaperone signature.
Length = 130

Score = 31.2 bits (70), Expect = 0.002
Identities = 14/33 (42%), Positives = 16/33 (48%), Gaps = 1/33 (3%)

Query: 25 VDALTEALTAHAHNDFVQ-PLKPYLRQDPENGH 56
+D E T +HN F Q LKP L D GH
Sbjct: 54 LDNNDEKETLLSHNIFSQDILKPILSWDEVGGH 86


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0118PF04183316e-103 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 316 bits (812), Expect = e-103
Identities = 119/527 (22%), Positives = 208/527 (39%), Gaps = 46/527 (8%)

Query: 79 RVSKQPLTAAEFWQTIANMNCDLSHEWEVARVEEGLTTAATQLAKQLSELDLASHPFV-- 136
R + +P+ A + + +S +++ T L + L++ +
Sbjct: 66 RCADEPVLAQTLLMQLKQVL-SMSDATVAEHMQDLYATLLGDLQLLKARRGLSASDLINL 124

Query: 137 -MSEQFASLKDRPFHPLAKEKRGLREADYQVYQAELNQSFPLMVAAVKKTHMIHGDTANI 195
L P K +RG + + Y E +F L AVK+ HMI +
Sbjct: 125 NADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRCDNEM 184

Query: 196 DELENLTVPIKEQA----TDMLNDQGLSIDDYVLFPVHPWQYQHILPNVFATEISEKLVV 251
D + LT + Q + + + GL +++ PVHPWQ+Q + F + +E +V
Sbjct: 185 DIHQLLTAAMDPQEFARFSQVWQENGLD-HNWLPLPVHPWQWQQKIATDFIADFAEGRMV 243

Query: 252 LLPLKFGD-YLSSSSMRSLIDIGAPYN-HVKVPFAMQSLGALRLTPTRYMKNGEQAEQLL 309
L +FGD +L+ S+R+L + +K+P + + R P RY+ G A + L
Sbjct: 244 SLG-EFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWL 302

Query: 310 RQLIEKDEALAKYVMV-CDETA-------WWSYMGQDNDIFKDQLGHLTVQLRKYPEVLA 361
+Q+ D L + V E A ++ + + +++ LG V R+ P
Sbjct: 303 QQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLG---VIWRENPCRWL 359

Query: 362 KNDTQQLVSMAALAANDRTLYQMICGKDNISKNDVMTLFEDIAQVFLKVTLSFM-QYGAL 420
K D + V MA L D + + S D T + +V + + +YG
Sbjct: 360 KPD-ESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPLYHLLCRYGVA 418

Query: 421 PELHGQNILLSFEDGRVQKCVLRD-HDTVRIYKPWLTAHQLSLPKYV--VREDTPNTLIN 477
HGQNI L+ ++G Q+ +L+D +R+ K SLP+ V V +
Sbjct: 419 LIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMD-SLPQEVRDVTSRLSADYLI 477

Query: 478 EDLETFFAYFQTLAVSVNLYAIIDAIQDLFGVSEHELMSLLKQILKNEVATISWVTTDQL 537
DL+T V + I + GV E LL +L + + Q+
Sbjct: 478 HDLQTGHF--------VTVLRFISPLMVRLGVPERRFYQLLAAVLSDYMK-----KHPQM 524

Query: 538 AVRHILFDKQTWPFKQILLP---LLY-QRDSGGGSMPSGLTTVPNPM 580
+ R LF +++L L + D G +P+ L + NP+
Sbjct: 525 SERFALFSLFRPQIIRVVLNPVKLTWPDLDGGSRMLPNYLEDLQNPL 571


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0119TCRTETA802e-18 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 79.9 bits (197), Expect = 2e-18
Identities = 71/372 (19%), Positives = 149/372 (40%), Gaps = 24/372 (6%)

Query: 13 ILWLSQFIAIAGLTVLVPLLPIYMASLQNLSVVEIQLWSGIAIAAPAVTTMIASPIWGKL 72
++ + + G+ +++P+LP + L + ++ GI +A A+ +P+ G L
Sbjct: 9 VILSTVALDAVGIGLIMPVLPGLLRDL--VHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 73 GDKISRKWMVLRALLGLAVCLFLMALCTTPLQFVLVRLLQGLFGGVVDASSAFASAEAPA 132
D+ R+ ++L +L G AV +MA + R++ G+ G + A+ +
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDG 126

Query: 133 EDRGKVLGRLQSSVSAGSLVGPLIGGVTASILGFSALLMSIAVITFIVCIFGALKLIETT 192
++R + G + + G + GP++GG+ A + A + + + G L E+
Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLMGGF-SPHAPFFAAAALNGLNFLTGCFLLPESH 185

Query: 193 HMPKSQTPNINKGIRRSFQCLLCTQQTCRFIIVGVLANFAMYGMLTALSPLASSVNHTAI 252
+ SF+ + V A A++ ++ + + +++
Sbjct: 186 KGERRPLRREALNPLASFR--------WARGMTVVAALMAVFFIMQLVGQVPAALWVIFG 237

Query: 253 DDR-----SVIGFLQSAF-WTASILSAPLWGRFNDKSYVKSVYIFATIACGCSAILQGLA 306
+DR + IG +AF S+ A + G + + + IA G IL A
Sbjct: 238 EDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFA 297

Query: 307 TNIEFLMAARILQGLTYSAL--IQSVMFVVVNACHQ-QLKGTFVGTTNSMLVVGQIIGSL 363
T +L + +Q+++ V+ Q QL+G+ T+ + I+G L
Sbjct: 298 TRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTS----LTSIVGPL 353

Query: 364 SGAAITSYTTPA 375
AI + +
Sbjct: 354 LFTAIYAASITT 365


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0120PF041833014e-97 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 301 bits (772), Expect = 4e-97
Identities = 118/540 (21%), Positives = 212/540 (39%), Gaps = 63/540 (11%)

Query: 3 NKELIQHAAYAAIERILNEYFREENLYQVPPQNHQWSIQLSELE-TLTGEFRYWSAMGHH 61
N + + ++L+E E+ + + ++ I L + E W G
Sbjct: 2 NHKDWDLVNRRLVAKMLSELEYEQVFHAESQGDDRYCINLPGAQWRFIAERGIW---GW- 57

Query: 62 MYHPEVWLIDGKSKKITTYKEAIARILQHMAQSADNQTA-VQQHMAQIMSDI--DNSIHR 118
ID ++ + +L + Q A V +HM + + + D + +
Sbjct: 58 ------LWIDAQTLRCADEPVLAQTLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQLLK 111

Query: 119 TARYLQSNTIDYVEDRYIVSEQSLYLGHPFHPTPKSASGFSEADLEKYAPECHTSFQLHY 178
R L ++ + + Q L GHP K G+ + LE+YAPE +F+LH+
Sbjct: 112 ARRGLSASDL---INLNADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHW 168

Query: 179 LAVHQD-------------VLLTRYVEGKEDQVEKVLYQLADIDISEIPKDFILLPTHPY 225
LAV ++ LLT ++ +E ++Q +D +++ LP HP+
Sbjct: 169 LAVKREHMIWRCDNEMDIHQLLTAAMDPQEFARFSQVWQENGLD-----HNWLPLPVHPW 223

Query: 226 QINVLRQHPQYMQYSEQGLIKDLGVSGDLVYPTSSVRTVF--SKALNIYLKLPIHVKITN 283
Q ++ +G + LG GD S+RT+ S+ + +KLP+ + T+
Sbjct: 224 QWQQK-IATDFIADFAEGRMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTS 282

Query: 284 FIRTNDLEQIERTIDAAQVIASVKDE-----------VETPHFKLMFEEGYRALLPNPLG 332
R I A++ + V + P + EGY AL P
Sbjct: 283 CYRGIPGRYIAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYR 342

Query: 333 QTVEPEMDLLTNSAMIVREGIPNY-HADKDIHVLASLFETMPD-SPISKLAQVIEQSGLA 390
EM +I RE + D+ ++A+L E + P+ I++SGL
Sbjct: 343 YQ---EM-----LGVIWRENPCRWLKPDESPVLMATLMECDENNQPL--AGAYIDRSGLD 392

Query: 391 PEAWLECYLDRTLLPILKLFSNTGISLEAHVQNTLIELKDGIPDVCFVRDLEG-ICLSRT 449
E WL ++P+ L G++L AH QN + +K+G+P ++D +G + L +
Sbjct: 393 AETWLTQLFRVVVVPLYHLLCRYGVALIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKE 452

Query: 450 IATEKQLVPNVVAASSPVVYAHDEAWHRLKYYVVVNHLGHLVSTIGKATRNEVVLWQLVA 509
E +P V + + A D H L+ V L + + + E +QL+A
Sbjct: 453 EFPEMDSLPQEVRDVTSRLSA-DYLIHDLQTGHFVTVLRFISPLMVRLGVPERRFYQLLA 511


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0121PF04183514e-179 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 514 bits (1324), Expect = e-179
Identities = 146/592 (24%), Positives = 257/592 (43%), Gaps = 40/592 (6%)

Query: 1 MNQTILNRVKTRVMHQLVSSLIYENIVVYKASYQDGVGHFTIEGHDSEYRFTAEKTHSFD 60
MN + V R++ +++S L YE + + A Q G + I +++RF AE+ +
Sbjct: 1 MNHKDWDLVNRRLVAKMLSELEYEQV--FHAESQ-GDDRYCINLPGAQWRFIAERG-IWG 56

Query: 61 RIRITSPIERVVGDEADTTTDYTQLLREAVFTFPKNDEKLEQFIVELLQTELKDTQSMQY 120
+ I + R AD LL + +D + + + +L T L D Q ++
Sbjct: 57 WLWIDAQTLRC----ADEPVLAQTLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQLLKA 112

Query: 121 RESNPPATPETFN-DYEFYAMEGHQYHPSYKSRLGFTLSDNLKFGPDFVPNVKLQWLAID 179
R + N D + GH K R G+ ++ P++ +L WLA+
Sbjct: 113 RRGLSASDLINLNADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVK 172

Query: 180 KDKVETTVSRNVVVNEMLRQQVGDKTYEHFVQQIEASGKHVNDVEMIPVHPWQFEHVIQV 239
++ + + ++++L + + + F Q + +G N + +PVHPWQ++ I
Sbjct: 173 REHMIWRCDNEMDIHQLLTAAMDPQEFARFSQVWQENGLDHNWL-PLPVHPWQWQQKIAT 231

Query: 240 DLAEERLNGTVLWLGESDELYHPQQSIRTMSPIDTT-KYYLKVPISITNTSTKRVLAPHT 298
D + G ++ LGE + + QQS+RT++ +K+P++I NTS R +
Sbjct: 232 DFIADFAEGRMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRY 291

Query: 299 IENAAQITDWLKQIQQQDMYLKDE----LKTVFLGEVLGQSYLNTQLSPYKQTQVYGALG 354
I + WL+Q+ D L L G V + Y +PY+ ++ LG
Sbjct: 292 IAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEM---LG 348

Query: 355 VIWRENIYHMLIDEEDAIPFNALYASDKDGLPFIEKWIKQYG--SEAWTKQFLAVAIRPM 412
VIWREN L +E + L D++ P +I + G +E W Q V + P+
Sbjct: 349 VIWRENPCRWLKPDESPVLMATLMECDENNQPLAGAYIDRSGLDAETWLTQLFRVVVVPL 408

Query: 413 IHMLYYHGIAFESHAQNMMLIHENGWPTRIALKDFHDGVRFKREHLSEAASHLTLKPMPE 472
H+L +G+A +H QN+ L + G P R+ LKDF +R +E E S +P+
Sbjct: 409 YHLLCRYGVALIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMDS------LPQ 462

Query: 473 AHKKVNSNSFIETDDERLVRDFLH---DAFFFINIAEIILFIEKQYGIDEQRQWQWVKDI 529
+ V S RL D+L F+ + I + + G+ E+R +Q + +
Sbjct: 463 EVRDVTS---------RLSADYLIHDLQTGHFVTVLRFISPLMVRLGVPERRFYQLLAAV 513

Query: 530 IEAYQEAFPELNN-YQHFDLFEPTIQVEKLTTRRL-LSDSELRIHHVTNPLG 579
+ Y + P+++ + F LF P I L +L D + + N L
Sbjct: 514 LSDYMKKHPQMSERFALFSLFRPQIIRVVLNPVKLTWPDLDGGSRMLPNYLE 565


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0126DHBDHDRGNASE1284e-38 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 128 bits (323), Expect = 4e-38
Identities = 66/250 (26%), Positives = 113/250 (45%), Gaps = 2/250 (0%)

Query: 5 KVALVTGGAQGIGFKIAERLVEDGFKVAVVDFNEEGAKAAALKLSSDGTKAIAIKADVSN 64
K+A +TG AQGIG +A L G +A VD+N E + L ++ A A ADV +
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 65 RDDVFNAVRQTAAQFGDFHVMVNNAGLGPTTPIDTITEEQFKTVYGVNVAGVLWGIQAAH 124
+ + + G ++VN AG+ I ++++E+++ + VN GV ++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 125 EQFKKFNHGGKIINATSQAGVEGNPGLSLYCSTKFAVRGLTQVAAQDLASEGITVNAFAP 184
+ G I+ S ++ Y S+K A T+ +LA I N +P
Sbjct: 129 KYMMD-RRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 185 GIVQTPMMESIAVATAEEAGKPEAWGWEQFTSQIALGRVSQPEDVSNVVSFLAGKDSDYI 244
G +T M S+ + E F + I L ++++P D+++ V FL + +I
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSL-ETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 245 TGQTIIVDGG 254
T + VDGG
Sbjct: 247 TMHNLCVDGG 256


35SAV0178SAV0182N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV01782141.608224similar to integral membrane protein LmrP
SAV01793131.872859similar to surfactin synthetase
SAV01802142.555673conserved hypothetical protein
SAV01812152.131966conserved hypothetical protein
SAV01821152.097856hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0178TCRTETA320.004 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.1 bits (73), Expect = 0.004
Identities = 61/337 (18%), Positives = 127/337 (37%), Gaps = 33/337 (9%)

Query: 7 TLKVRLISNFLQLIITTAFIPFIALYLTDMLS----QSIVGIYLVGLVVLKFPLSIISGY 62
L V L + L + +P + L D++ + GI L +++F + + G
Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65

Query: 63 LIEIFPKKLLVLIYQATMVIMLVFMGVFGSHQLWQI-IGFCVAYAIFTIVWGLQFPVMDT 121
L + F ++ ++L+ A + M + LW + IG VA + G V
Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMAT--APFLWVLYIGRIVAG-----ITGATGAVAGA 118

Query: 122 LIMDAITEDVEHYIYKISYWMTNLSVAIGALLGGLMYGYSMLLLFLIAACIFLIVLFILY 181
I D D + + G +LGGLM G+S F AA + +
Sbjct: 119 YIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGC 178

Query: 182 IWLPQDRNQVKQSDDKRHASRYQKLQIMNIFRSYKLVLKDRNYMLLISGFSIIMMGEFSI 241
LP+ ++ + + + L+ F + ++G+
Sbjct: 179 FLLPESHKGERRPLRREALNPLASFRWARGMTVVAA--------LMAVFFIMQLVGQVPA 230

Query: 242 SSYIAIRLKDQF--ETISIGSYDITGAKMLAILLMINTVVVILLTYSISKVVLKIDFKKA 299
+ + I +D+F + +IG LA +++++ ++T ++ ++ ++A
Sbjct: 231 ALW-VIFGEDRFHWDATTIGI-------SLAAFGILHSLAQAMITGPVAA---RLGERRA 279

Query: 300 LITGLLIYIVGYSGLTYLNQFGLLVVFMIIATVGEII 336
L+ G++ GY L + + + M++ G I
Sbjct: 280 LMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIG 316


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0179NUCEPIMERASE538e-09 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 52.9 bits (127), Expect = 8e-09
Identities = 54/266 (20%), Positives = 101/266 (37%), Gaps = 55/266 (20%)

Query: 2046 NTLLTGATGFLGAYLIEALQGYSHRIYCFIRADNEEIAWYKLMTNLNDYFS----EETVE 2101
L+TGA GF+G ++ + L H++ + D NLNDY+ + +E
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQV---VGID-----------NLNDYYDVSLKQARLE 47

Query: 2102 MM----LSNIEVIVGDFECMDDVVLPENMDTIIH----AGARTDHFGDDDEFEKVNVQGT 2153
++ ++ + D E M D+ + + + R + + N+ G
Sbjct: 48 LLAQPGFQFHKIDLADREGMTDLFASGHFERVFISPHRLAVR-YSLENPHAYADSNLTGF 106

Query: 2154 VDVIRLAQQHH-ARLIYVSTISV-GTYFDIDTEDVTFSEADVYKGQLLTSPYTRSKFYSE 2211
++++ + + L+Y S+ SV G + FS D + S Y +K +E
Sbjct: 107 LNILEGCRHNKIQHLLYASSSSVYG-----LNRKMPFSTDDSVDHPV--SLYAATKKANE 159

Query: 2212 LKVLEAVNN-GLDGRIVRVGNLTSPYNGRWHM------RNIKTNRFSMVMNDLLQLDCIG 2264
L + GL +R + P+ GR M + + + V N
Sbjct: 160 LMAHTYSHLYGLPATGLRFFTVYGPW-GRPDMALFKFTKAMLEGKSIDVYNY-------- 210

Query: 2265 VSMAEMPVDFSFVDTTARQIVALAQV 2290
+M DF+++D A I+ L V
Sbjct: 211 ---GKMKRDFTYIDDIAEAIIRLQDV 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0180ENTSNTHTASED290.009 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 29.2 bits (65), Expect = 0.009
Identities = 15/57 (26%), Positives = 27/57 (47%), Gaps = 5/57 (8%)

Query: 84 GQP-----IYVSLSYSYPYIVCVVDKEPVGIDIEKISQRLDWRTLVTCFSTNEAHQI 135
QP ++ S+S+ + V+ ++ +GIDIEKI + L ++ QI
Sbjct: 76 RQPLWPDGLFGSISHCATTALAVISRQRIGIDIEKIMSQHTATELAPSIIDSDERQI 132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0182CARBMTKINASE320.002 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 31.7 bits (72), Expect = 0.002
Identities = 23/84 (27%), Positives = 41/84 (48%), Gaps = 7/84 (8%)

Query: 155 INADTLAYFIASSLKAPIYV-LSNIAGVLIN-----DVVIPQLPLVDIHQYIEHGD-IYG 207
I+ D +A + A I++ L+++ G + + + ++ + ++ +Y E G G
Sbjct: 213 IDKDLAGEKLAEEVNADIFMILTDVNGAALYYGTEKEQWLREVKVEELRKYYEEGHFKAG 272

Query: 208 GMIPKVLDAKNAIENGCPKVIIAS 231
M PKVL A IE G + IIA
Sbjct: 273 SMGPKVLAAIRFIEWGGERAIIAH 296


36SAV0221SAV0230N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV02211191.321752hexose phosphate transport protein
SAV02222181.313656hexose phosphate transport protein
SAV02231150.404814similar to two-component response regulator
SAV02240150.465966similar to two-component sensor histidine
SAV0225-2150.850928similar to periplasmic-iron-binding protein
SAV0226-2142.187668formate acetyltransferase
SAV0227-2122.401308formate acetyltransferase activating enzyme
SAV0228-1132.800990putative glycerophosphodiester
SAV0229-1144.107078hypothetical protein
SAV0230-1134.189958staphylocoagulase precursor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0221TCRTETB361e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 36.0 bits (83), Expect = 1e-04
Identities = 29/166 (17%), Positives = 61/166 (36%), Gaps = 17/166 (10%)

Query: 48 FKAAQPFLKEEIGLSTLELGYIGLAFSITYGLGKTLLGYFVDGRNTKRIISFLLILSAIT 107
+ P + + ++ AF +T+ +G + G D KR LL+ I
Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR----LLLFG-II 87

Query: 108 VLIMGFVLSYFGSVMGLLIVLWGLNGVFQSVGGPASYSTI----SRWAPRTKRGRYLGFW 163
+ G V+ + G L++ + Q G A + + +R+ P+ RG+ G
Sbjct: 88 INCFGSVIGFVGHSFFSLLI---MARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144

Query: 164 NTSHNIGGAIAGGVALWGANVFFHGNVIGMFIFPSVIALLIGIATL 209
+ +G + + A+ + + P + +I + L
Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSY---LLLIP--MITIITVPFL 185


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0223HTHFIS812e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.6 bits (199), Expect = 2e-19
Identities = 42/169 (24%), Positives = 72/169 (42%), Gaps = 12/169 (7%)

Query: 3 KVVICDDERIIREGLKQIIPWGDYHFNTIYTAKDGVEALSLIQQHQPELVITDIRMPRKN 62
+++ DD+ IR L Q + Y + + I +LV+TD+ MP +N
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGY---DVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 63 GVDLLNDI--ALLDCNVIILSSYDDFEYMKAGIQHHVLDYLLKPVDHAQLEVILGRLVRT 120
DLL I A D V+++S+ + F + DYL KP D L ++G + R
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFD---LTELIGIIGRA 118

Query: 121 LLEQQSQNGRSLASCHDAFQPLLKVEYDDYYVNQIVDQIKQSYQTKVTV 169
L E + + + D PL+ + +I + + QT +T+
Sbjct: 119 LAEPKRRPSKLEDDSQD-GMPLVG---RSAAMQEIYRVLARLMQTDLTL 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0224PF065801476e-42 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 147 bits (372), Expect = 6e-42
Identities = 55/226 (24%), Positives = 109/226 (48%), Gaps = 16/226 (7%)

Query: 288 YIYDLFESNEQLIHSIEHTERRLRDIQLKEIERQFQPHFLFNTMQTIQYLITLSPKLAQT 347
+ + F++ +Q ++ QL ++ Q PHF+FN + I+ LI P A+
Sbjct: 136 FGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKARE 195

Query: 348 VVQQLSQMLRYSLR-TNSHTVELNEELNYIEQYVAIQNIRFDDMIKLHIESSEEARHQTI 406
++ LS+++RYSLR +N+ V L +EL ++ Y+ + +I+F+D ++ + + +
Sbjct: 196 MLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQV 255

Query: 407 GKMMLQPLIENAIKHGRDTESLDITIRLTLARQN--LHVLVCDNGIGMSSSRLQYVRQSL 464
M++Q L+EN IKHG I L + N + + V + G L+ ++S
Sbjct: 256 PPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA----LKNTKES- 310

Query: 465 NNDVFDTKHLGLNHLHNKAMIQYGSHARLHIFSKRNQGTLICYKIP 510
GL ++ + + YG+ A++ + K+ + + IP
Sbjct: 311 -------TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVL-IP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0226SHAPEPROTEIN320.006 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 32.4 bits (74), Expect = 0.006
Identities = 18/54 (33%), Positives = 29/54 (53%), Gaps = 5/54 (9%)

Query: 257 AYLAAIKEQNGAAMSLGRTSTFLDIYAERDLKAGVITESEV-QEIIDHFIMKLR 309
+AA+ A LGRT +I A R +K GVI + V ++++ HFI ++
Sbjct: 50 KSVAAVGHD--AKQMLGRTPG--NIAAIRPMKDGVIADFFVTEKMLQHFIKQVH 99


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0230IGASERPTASE320.009 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.0 bits (72), Expect = 0.009
Identities = 37/215 (17%), Positives = 66/215 (30%), Gaps = 16/215 (7%)

Query: 249 ETKQNRPNSITKYDPTKHNFKEKSENKPNFDKLVEETKKAVKEADESWKNKTVKKYEETV 308
E+K N + T N + E K N + + A ++ T K TV
Sbjct: 1047 ESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATV 1106

Query: 309 TKSPVVKEEKKVEEPQLPKVGNQQEVKTTAGKAEETTQPVAQPLVKIPQETIYGETVKGP 368
K +E+ KVE + +V + + ET QP A+P TV
Sbjct: 1107 EK----EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEP------ARENDPTVNIK 1156

Query: 369 EYPTMENKTLQGEIVQGPDFLTMEQNRPSLSDNYTQPTTPNPILEGLEGSSSKLEIKPQG 428
E + N + Q P T ++++ T T + + + + +P
Sbjct: 1157 EPQSQTNT--TADTEQ-PAKETSSNVEQPVTESTTVNTGNSVVENPENTTPA--TTQPTV 1211

Query: 429 TESTLKGIQGESSDIEVKPQATETTEASQYGPRPQ 463
+ + V+ A+
Sbjct: 1212 NSESSNKPK-NRHRRSVRSVPHNVEPATTSSNDRS 1245


37SAV0421SAV0433N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV0421-314-1.305710putative nucleoside-diphosphate-sugar epimerase
SAV0422115-2.373553exotoxin 6
SAV0423114-2.540353exotoxin 7
SAV0424115-2.524454exotoxin 8
SAV0425017-3.343438exotoxin 10
SAV0426116-0.863801exotoxin 11
SAV0427016-1.378472exotoxin 12
SAV0428-114-1.181689exotoxin 13
SAV0429211-1.018756exotoxin 14
SAV0430110-0.523393hypothetical protein
SAV043128-1.131205probable type I site-specific deoxyribonuclease
SAV0432814-3.516278probable restriction modification system
SAV0433712-3.282493exotoxin 15
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0421NUCEPIMERASE300.009 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 30.1 bits (68), Expect = 0.009
Identities = 28/167 (16%), Positives = 61/167 (36%), Gaps = 32/167 (19%)

Query: 1 MNIMLTGATGHLGTHITNQAIANHIDHFHIGVRNVEKVPD----------DWRGKVSVRQ 50
M ++TGA G +G H++ + + H +G+ N+ D + +
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEA--GHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHK 58

Query: 51 LDYFNQESMVEAFK--GMDTVVFI-------PSIIHP-SFKRIPEV--ENLVYAAKQSGV 98
+D ++E M + F + V S+ +P ++ N++ + + +
Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 99 AHIIFIG---YYADQHNNPFHMS-----PYFGYASRLLSTSGIDYTY 137
H+++ Y PF P YA+ + + +TY
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTY 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0422TOXICSSTOXIN953e-26 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 95.5 bits (237), Expect = 3e-26
Identities = 46/214 (21%), Positives = 84/214 (39%), Gaps = 11/214 (5%)

Query: 18 TGVITSNVQSVQAKTEVKQQSESELKHYYNKPVLERKNVTGYKYTEKGKDYIDVIVDNQY 77
T V S+ Q ++ + +L +Y+ N + + + + +
Sbjct: 25 TPVPLSSNQIIKTAKASTNDNIKDLLDWYSSGSDTFTNS---EVLDNSLGSMRIKNTDGS 81

Query: 78 SQISLVGSDKDKFKDGDNSNIDVFILREGDSRQATN-----YSIGGVTKTNSQPFIDYIH 132
+ + S +D+ R S+ + + I GVT T P I
Sbjct: 82 ISLIIFPSPYYSPAFTKGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLP--TPIE 139

Query: 133 TPILEIKKGKEEPQSSLYQIYKEDISLKELDYRLRERAIKQHGLYSNGLKQGQI-TITMK 191
P+ GK+ P + K+ +++ LD+ +R + + HGLY + K G ITM
Sbjct: 140 LPLKVKVHGKDSPLKYGPKFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMN 199

Query: 192 DGKSHTIDLSQKLEKERMGDSIDGRQIQKILVEM 225
DG ++ DLS+K E I+ +I+ I E+
Sbjct: 200 DGSTYQSDLSKKFEYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0423TOXICSSTOXIN853e-22 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 85.1 bits (210), Expect = 3e-22
Identities = 38/203 (18%), Positives = 76/203 (37%), Gaps = 22/203 (10%)

Query: 37 ISENSKKLKAYYTQPSIEYKNVTGYISFIQPSIKFMNIIDGNSVNNLALIGKDKQHYHTG 96
++N K L +Y+ S + N + S+ M I + + +L +
Sbjct: 42 TNDNIKDLLDWYSSGSDTFTN----SEVLDNSLGSMRIKNTDGSISLIIFPSPYYSPAFT 97

Query: 97 VHRNLNIFYVN-----EDKRFEGAKYSIGGITSANDKA--VDLIAEARVIKADHIGEYDY 149
+++ + I G+T+ ++L + +V D +Y
Sbjct: 98 KGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLPTPIELPLKVKVHGKDSPLKYGP 157

Query: 150 DFFPFKIVKEAMSLKEIDFKLRKYLIDNYGLYGEMST----GKITVKKKYYGKYTFELDK 205
F K+ +++ +DF++R L +GLY KIT+ Y +L K
Sbjct: 158 KF-----DKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDG--STYQSDLSK 210

Query: 206 KLQEDRMSDVINVTDIDRIEIKV 228
K + + IN+ +I IE ++
Sbjct: 211 KFEYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0424TOXICSSTOXIN942e-24 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 93.6 bits (232), Expect = 2e-24
Identities = 44/223 (19%), Positives = 78/223 (34%), Gaps = 21/223 (9%)

Query: 140 TPQPMQSTKSDTPQSPTIKQAQTDMTPKYEDLRAYYTKPSFEFEKQFGFLLKPWTTVRFM 199
TP P+ S + IK A+ +DL +Y+ S F L ++R
Sbjct: 25 TPVPLSSNQ-------IIKTAKASTNDNIKDLLDWYSSGSDTF-TNSEVLDNSLGSMRIK 76

Query: 200 NVIPNRFIYKIALVGKDEKKYKDGPYDNIDV-----FIVLEDNKYQLKKYSVGGITKTNS 254
N + + + + +D+ ++ + + G+T T
Sbjct: 77 NTDGSI---SLIIFPSPYYSPAFTKGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEK 133

Query: 255 KKVDHKAELSVTKKDNQGMISRDVSEYMITKEEISLKELDFKLRKQLIEKHNLYGNM--G 312
+ L V + K+++++ LDF++R QL + H LY +
Sbjct: 134 LPTPIELPLKVKVHGKDSPLKYG---PKFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKT 190

Query: 313 SGTIVIKMKNGGKYTFELHKKLQEHRMADVIEGTNIDKIEVNI 355
G I M +G Y +L KK + + I I IE I
Sbjct: 191 GGYWKITMNDGSTYQSDLSKKFEYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0425TOXICSSTOXIN1352e-41 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 135 bits (340), Expect = 2e-41
Identities = 49/201 (24%), Positives = 73/201 (36%), Gaps = 14/201 (6%)

Query: 39 NVTKDIFDLRDYYSGASKELKNVTGYRYSKGGKHYLIFDKNRKFTRVQIFGKDIERFKAR 98
+ +I DL D+YS S N S G + + IF
Sbjct: 41 STNDNIKDLLDWYSSGSDTFTNSEVLDNSLGS---MRIKNTDGSISLIIFPSPYYSPAFT 97

Query: 99 KNPGLDI-----FVVKEAENRNGTVFSYGGVTKKNQDAYYDYINAPRFQIKRDEGDGIAT 153
K +D+ + F GVT + I P +K D
Sbjct: 98 KGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLP--TPIELPLK-VKVHGKDSPLK 154

Query: 154 YGRVHYIYKEEISLKELDFKLRQYLIQNFDLYKKFPKDSKI-KVIMKDGGYYTFELNKKL 212
YG K+++++ LDF++R L Q LY+ K K+ M DG Y +L+KK
Sbjct: 155 YG--PKFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKF 212

Query: 213 QTNRMSDVIDGRNIEKIEANI 233
+ N I+ I+ IEA I
Sbjct: 213 EYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0426TOXICSSTOXIN1921e-63 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 192 bits (488), Expect = 1e-63
Identities = 49/197 (24%), Positives = 82/197 (41%), Gaps = 16/197 (8%)

Query: 42 DIKDLYRYYSSESFEFSNI--------SGKVENYNGSNVVRFNQEKQNHQLFLLGKDKDK 93
+IKDL +YSS S F+N S +++N +GS + F G+
Sbjct: 45 NIKDLLDWYSSGSDTFTNSEVLDNSLGSMRIKNTDGSISLIIFPSPYYSPAFTKGE---- 100

Query: 94 YKKGLEGQNVFVVKELIDPNGRLSTVGGVTKKNNKSSETNTHLFVNKVYGGNLDASIDSF 153
K L + + + + GVT + L V KV+G +
Sbjct: 101 -KVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLPTPIELPLKV-KVHGKDSPLKYGP- 157

Query: 154 LINKEEVSLKELDFKIRKQLVEKYGLYKGTTKYGKI-TINLKDEKKEVIDLGDKLQFERM 212
+K+++++ LDF+IR QL + +GLY+ + K G I + D DL K ++
Sbjct: 158 KFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFEYNTE 217

Query: 213 GDVLNSKDIQNIAVTIN 229
+N +I+ I IN
Sbjct: 218 KPPINIDEIKTIEAEIN 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0427TOXICSSTOXIN1252e-37 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 125 bits (314), Expect = 2e-37
Identities = 47/199 (23%), Positives = 73/199 (36%), Gaps = 19/199 (9%)

Query: 42 DTNKLHQYYSGPSYELTNV--------SGQSQGYYDSNVLLFNQQNQKFQVFLLGKDENK 93
+ L +YS S TN S + + S L+ F G+
Sbjct: 45 NIKDLLDWYSSGSDTFTNSEVLDNSLGSMRIKNTDGSISLIIFPSPYYSPAFTKGE---- 100

Query: 94 YKEKTHGLDVFAVPELVDLDGRIFSVSGVTKKNVKSIFESLRTPNLLVKKIDDKDGFSID 153
K + + F +SGVT L L K+ KD +
Sbjct: 101 -KVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLPTPIELP----LKVKVHGKDSP-LK 154

Query: 154 EFFFIQKEEVSLKELDFKIRKLLIKKYKLYEGSA-DKGRIVINMKDENKYEIDLSDKLDF 212
K+++++ LDF+IR L + + LY S G I M D + Y+ DLS K ++
Sbjct: 155 YGPKFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFEY 214

Query: 213 ERMADVINSEQIKNIEVNL 231
IN ++IK IE +
Sbjct: 215 NTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0428TOXICSSTOXIN1301e-39 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 130 bits (329), Expect = 1e-39
Identities = 39/197 (19%), Positives = 69/197 (35%), Gaps = 15/197 (7%)

Query: 43 INMLHQYYSEESFESTNISVKSEDYYGSNVLNFNQRNKTFKVFLLGDDKNKY------KE 96
I L +YS S TN V + + + + + K
Sbjct: 46 IKDLLDWYSSGSDTFTNSEVLD---NSLGSMRIKNTDGSISLIIFPSPYYSPAFTKGEKV 102

Query: 97 KTHGLDVFAVPELIDIKGGIYSVGGITKKNVRSVFGFVSNPSLQVKKVDAKHGFSINELF 156
+ + + + G+T + P L+VK F
Sbjct: 103 DLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLP--TPIELP-LKVKVHGKDSPLKYGPKF 159

Query: 157 FIQKEEVSLKELDFKIRKMLVEKYRLYKGAS-DKGRIVINMKDEKKYVIDLSEKLSFDRM 215
K+++++ LDF+IR L + + LY+ + G I M D Y DLS+K ++
Sbjct: 160 --DKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFEYNTE 217

Query: 216 FDVMDSKQIKNIEVNLN 232
++ +IK IE +N
Sbjct: 218 KPPINIDEIKTIEAEIN 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0429TOXICSSTOXIN1934e-64 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 193 bits (491), Expect = 4e-64
Identities = 51/202 (25%), Positives = 92/202 (45%), Gaps = 10/202 (4%)

Query: 31 KQNQKSVNKHDKEALYRYYTGKTMEMKNISALKHGKNNLRFKFRGIKIQVLLPGNDKSKF 90
K + S N + K+ L Y +G + N L + ++R K I +++ +
Sbjct: 36 KTAKASTNDNIKDLLDWYSSG-SDTFTNSEVLDNSLGSMRIKNTDGSISLIIFPSPYYSP 94

Query: 91 QQRSYEGLDVFFVQEKRDKHD-----IFYTVGGVIQNNKTSGVVSAPILNISKEKGEDAF 145
E +D+ + K+ +H I + + GV K + P L + K G+D+
Sbjct: 95 AFTKGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLPTPIELP-LKV-KVHGKDSP 152

Query: 146 VKGYPYYIKKEKITLKELDYKLRKHLIEKYGLYKTISKDGRV-KISLKDGSFYNLDLRSK 204
+K Y K+++ + LD+++R L + +GLY++ K G KI++ DGS Y DL K
Sbjct: 153 LK-YGPKFDKKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKK 211

Query: 205 LKFKYMGEVIESKQIKDIEVNL 226
++ I +IK IE +
Sbjct: 212 FEYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV0433TOXICSSTOXIN1084e-31 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 108 bits (270), Expect = 4e-31
Identities = 47/225 (20%), Positives = 86/225 (38%), Gaps = 19/225 (8%)

Query: 16 LTTGMITTTAQPVKASTLEVRSQAT-------QDLSEYYKGRGFELTNVTGYKYG-NKVT 67
L T PV S+ ++ A +DL ++Y TN +
Sbjct: 15 LLLATTATDFTPVPLSSNQIIKTAKASTNDNIKDLLDWYSSGSDTFTNSEVLDNSLGSMR 74

Query: 68 FIDNSQQIDVTLTGNE----KLTVKDDDEVSNVDVFVVREGSDKSAITTSIGGITKTNGT 123
+ I + + + T + +++ + S+ + I I G+T T
Sbjct: 75 IKNTDGSISLIIFPSPYYSPAFTKGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTE-- 132

Query: 124 QHKDTVQNVNLSVSKSTGQHTTSVTSEYYSIYKEEISLKELDFKLRKHLIDKHDLYKTEP 183
T + L V K G+ S K+++++ LDF++R L H LY++
Sbjct: 133 -KLPTPIELPLKV-KVHGK--DSPLKYGPKFDKKQLAISTLDFEIRHQLTQIHGLYRSSD 188

Query: 184 KDSKI-RITMKNGGYYTFELNKKLQPHRMGDTIDSRNIEKIEVNL 227
K +ITM +G Y +L+KK + + I+ I+ IE +
Sbjct: 189 KTGGYWKITMNDGSTYQSDLSKKFEYNTEKPPINIDEIKTIEAEI 233


38SAV1016SAV1023N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV1016-290.410718multidrug resistance protein-related protein
SAV1017-2101.218680similar to cell wall synthesis protein
SAV1018-180.333140UDP-N-acetylmuramoylalanyl-D-glutamate-2,
SAV1019-210-0.810131hypothetical protein
SAV1020-210-0.807549peptide chain release factor 3
SAV1021-29-1.194233peptide chain release factor 3
SAV1022-210-1.288409toxic anion resistance protein homologue
SAV1023-110-2.094196serine protease HtrA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1016TCRTETA544e-10 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 54.1 bits (130), Expect = 4e-10
Identities = 54/304 (17%), Positives = 115/304 (37%), Gaps = 19/304 (6%)

Query: 68 VIGFLLKKFGTKIVLTTGFILAFTSLFLVIWFPASPFVIIFSAMMLGIAVSPIWVI---M 124
V+G L +FG + VL A ++ P +V+ ++ GI + V +
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL-WVLYIGRIVAGITGATGAVAGAYI 120

Query: 125 LSSVEEDKRGKQMGYVYFSWLLGLLVGMVFMNLLIKVHPTRFAFMMSLVVLIAWILYYFV 184
+ D+R + G++ + G++ G V L+ P F + + + ++ F+
Sbjct: 121 ADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFL 180

Query: 185 DVKLTNYNTRPVK-------AQLRQIVDVTKRHLLLFPGILLQ--GAAIAALVPILPTYA 235
+ RP++ A R +T L+ ++Q G AAL I +
Sbjct: 181 LPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVI---FG 237

Query: 236 TKVINVSTIEYTVAIIIGGIGCAVSMLFLSKLIDNRSRNFMYGVILSGFILYMILIFTLS 295
+ +++ GI +++ ++ + R ++ G I L+
Sbjct: 238 EDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGE--RRALMLGMIADGTGYILLA 295

Query: 296 MIVNIHIVWIIALAIGLMYGILLPAWNTFMARFIKSDEQEETWGVFNSIQGFGSMIGPLF 355
+ + I + + GI +PA ++R + + Q + G ++ S++GPL
Sbjct: 296 FATRGWMAFPIMVLLAS-GGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLL 354

Query: 356 GGLI 359
I
Sbjct: 355 FTAI 358


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1020TCRTETOQM499e-11 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 49.1 bits (117), Expect = 9e-11
Identities = 27/53 (50%), Positives = 32/53 (60%), Gaps = 7/53 (13%)

Query: 15 IISHPDAGKTTLTEKLLYFSGAIREAGTVKGKKTGKFGDQVTGNESLN-QRGV 66
+++H DAGKTTLTE LLY SGAI E G+V G T N L QRG+
Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDK------GTTRTDNTLLERQRGI 54


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1021TCRTETOQM1081e-27 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 108 bits (271), Expect = 1e-27
Identities = 74/343 (21%), Positives = 139/343 (40%), Gaps = 44/343 (12%)

Query: 3 LFKVCKMRGIPIFTFINKLDRVGKEPFELLDEIEETLNIETYPMNWPIGMGQSFFGIIDR 62
LF + GIP FINK+D+ G + + +I+E L+ E +I +
Sbjct: 112 LFHALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEI---------------VIKQ 156

Query: 63 KSKTIEPFRDEENILHLNDDFELEEDHAITNDSAF-EQAIEELMLVEEAGEAFDNDALLS 121
K + N+ N + D I + E+ + L E ++ +
Sbjct: 157 KVE------LYPNMCVTNFTESEQWDTVIEGNDDLLEKYMSGKSLEALELEQEESIRFHN 210

Query: 122 GDLTPVFFGSALANFGVQNFLNAYVDFAPMPNARQTKEDVEVSPFDDSFSGFIFKIQANM 181
L PV+ GSA N G+ N + + R E G +FKI+
Sbjct: 211 CSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTHRGQSE----------LCGKVFKIE--Y 258

Query: 182 DPKHRDRIAFMRVVSGAFERGMDVTLQRTNKKQKITRSTSFMADDKETVNHAVAGDIIGL 241
K R R+A++R+ SG V + K KIT + + + ++ A +G+I+ L
Sbjct: 259 SEK-RQRLAYIRLYSGVLHLRDSVRISEKEKI-KITEMYTSINGELCKIDKAYSGEIVIL 316

Query: 242 YDTG---NYQIGDTLVGGKQTYSFQDLPQFTPEIFMKVSAKNVMKQKHFHKGIEQLVQEG 298
+ N +GDT + ++ LP + V +++ + ++
Sbjct: 317 QNEFLKLNSVLGDTKLLPQRERIENPLPL----LQTTVEPSKPQQREMLLDALLEISDSD 372

Query: 299 -AIQYYKTLHTNQIILGAVGQLQFEVFEHRMKNEYNVDVVMEP 340
++YY T++IIL +G++Q EV ++ +Y+V++ ++
Sbjct: 373 PLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKE 415


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1023V8PROTEASE381e-04 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 37.7 bits (87), Expect = 1e-04
Identities = 35/159 (22%), Positives = 53/159 (33%), Gaps = 35/159 (22%)

Query: 499 IVTNAHVV----GDKENQKITFSNNKS--------VVGKVLGKDKWSDLAVVK------A 540
++TN HVV GD K S ++ DLA+VK
Sbjct: 114 LLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQN 173

Query: 541 TSSDSSVKEIAIGDSNNLVLGEPILVVGNPLGVDF-KGTVTEGIISGLNRNVPIDFDKDN 599
VK + ++ + + I V G P ++G I+ L
Sbjct: 174 KHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMWESKGKITYLKGE--------- 224

Query: 600 KYDMLMKAFQIDASVNPGNSGGAVVNREGKLIGVVAAKI 638
A Q D S GNSG V N + ++IG+ +
Sbjct: 225 -------AMQYDLSTTGGNSGSPVFNEKNEVIGIHWGGV 256


39SAV1163SAV1170N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV1163217-2.177724alpha-hemolysin precursor
SAV1164014-2.268740hypothetical protein
SAV1165013-1.609964hypothetical protein
SAV1166111-0.611707hypothetical protein
SAV1167111-0.492101hypothetical protein
SAV1168011-0.222209hypothetical protein
SAV1169012-0.034859ornithine carbamoyltransferase
SAV1170015-0.615032putative carbamate kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1163BICOMPNTOXIN313e-109 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 313 bits (803), Expect = e-109
Identities = 72/318 (22%), Positives = 144/318 (45%), Gaps = 24/318 (7%)

Query: 9 VTTTLLLGSILMNPVANAADSDINIKTGTTDIGSNTTVKTGDLVTYDKEN--GMHKKVFY 66
+TTTL + L+ P+AN + T DIG + ++ N G+ + + +
Sbjct: 7 LTTTLSVS--LLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQNIQF 64

Query: 67 SFIDDKNHNKKLLVIRTKGTIAGQYRVYSEEGANKS-GLAWPSAFKVQLQLPDNEVAQIS 125
F+ DK +NK L+++ +G I+ + Y+ + N + WP + + L+ D V+ I
Sbjct: 65 DFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTNDKYVSLI- 123

Query: 126 DYYPRNSIDTKEYMSTLTYGFNGNVTGDDTGKIGGLIGANVSIGHTLKYVQPDFKTILES 185
+Y P+N I++ TL Y GN + +GG N S ++ Y Q ++ + +E
Sbjct: 124 NYLPKNKIESTNVSQTLGYNIGGNFQSAPS--LGGNGSFNYS--KSISYTQQNYVSEVEQ 179

Query: 186 PTDKKVGWKVIFNNMVNQNWGPYDRDSWNPVYGNQLFMKTRNGSMKAAENFLDPNKASSL 245
K V W V N+ ++ + + LF+ + S + F+ ++ L
Sbjct: 180 QNSKSVLWGVKANSFATESG-------QKSAFDSDLFVGYKPHSKDPRDYFVPDSELPPL 232

Query: 246 LSSGFSPDFATVITMDRKASKQQTNIDVIYERVRD-----DYQLHWTSTNWKGTNTKDKW 300
+ SGF+P F ++ + K S + ++ Y R D H+ ++ G + +
Sbjct: 233 VQSGFNPSFIATVSHE-KGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAF 291

Query: 301 TDRS-SERYKIDWEKEEM 317
+R+ + +Y+++W+ E+
Sbjct: 292 VNRNYTVKYEVNWKTHEI 309


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1166TOXICSSTOXIN502e-09 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 49.7 bits (118), Expect = 2e-09
Identities = 53/223 (23%), Positives = 89/223 (39%), Gaps = 12/223 (5%)

Query: 1 MSKNITKNIILTTTLLLLGTVLPQNQKPVFSFYSEAKAYSIGQDETNINELIKYYTQPHF 60
M+K + N + + LLL T P+ S A + D NI +L+ +Y+
Sbjct: 1 MNKKLLMNFFIVSPLLLATTATDFTPVPLSSNQIIKTAKASTND--NIKDLLDWYSSGSD 58

Query: 61 SFSNKWLYQYDNGNIYVELKRYSWSAHISLWGAESWGNINQLKDRYVDVFGLKD-KDTDQ 119
+F+N DN + +K S + ++ + + + K VD+ + K
Sbjct: 59 TFTN--SEVLDNSLGSMRIKNTDGSISLIIFPSP-YYSPAFTKGEKVDLNTKRTKKSQHT 115

Query: 120 LWWSYRETFTGGVTPAAK-PSDKTYNLFVQYKDKLQTIIGAHKIYQGNKPVLTLKEIDFR 178
+Y GVT K P+ L V+ K + K +K L + +DF
Sbjct: 116 SEGTYIHFQISGVTNTEKLPTPIELPLKVKVHGKDSPLKYGPKF---DKKQLAISTLDFE 172

Query: 179 AREALIKNKILY-TENRNKGKLKIT-GGGNNYTIDLSKRLHSD 219
R L + LY + ++ G KIT G+ Y DLSK+ +
Sbjct: 173 IRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFEYN 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1167TOXICSSTOXIN574e-12 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 57.4 bits (138), Expect = 4e-12
Identities = 56/230 (24%), Positives = 89/230 (38%), Gaps = 19/230 (8%)

Query: 16 LLLGTAFTQFPNTPINSSSEAKAYYINQNETNVNELTKYYSQKYLTFSNSTLWQKDNGTI 75
LLL T T F P++S+ K + N+ N+ +L +YS TF+NS + G
Sbjct: 15 LLLATTATDFTPVPLSSNQIIKTAKASTND-NIKDLLDWYSSGSDTFTNSEVLDNSLG-- 71

Query: 76 HATSLQFSWY--SHIQVYGPESWGNINQLRNKSVDIFGI---KDQETIDSFALSQETFTG 130
S++ S + P + + + + VD+ K Q T + + +
Sbjct: 72 ---SMRIKNTDGSISLIIFPSPYYSPAFTKGEKVDLNTKRTKKSQHTSEGTYIHFQI--S 126

Query: 131 GVTPA-ATSNDKHYKLNVTYKDKAETFTGGFPVYEGNKPVLTLKELDFRIRQTLIKSKKL 189
GVT L V K + + + +K L + LDF IR L + L
Sbjct: 127 GVTNTEKLPTPIELPLKV--KVHGKDSPLKYG-PKFDKKQLAISTLDFEIRHQLTQIHGL 183

Query: 190 YNNSYNKGQI-KITGTDNN-YTIDLSKRLPSTDANRYVKKPQNAKIEVIL 237
Y +S G KIT D + Y DLSK+ + + IE +
Sbjct: 184 YRSSDKTGGYWKITMNDGSTYQSDLSKKFEYNTEKPPINIDEIKTIEAEI 233


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1168TOXICSSTOXIN621e-13 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 61.6 bits (149), Expect = 1e-13
Identities = 62/222 (27%), Positives = 96/222 (43%), Gaps = 17/222 (7%)

Query: 2 KKNIMNKLVLSTALLLLGTTSTQLPKTPISFSSEAKAYNISENETNINELIKYYTQPHFS 61
KK +MN ++S LLL TT+T P+S + K S N+ NI +L+ +Y+ +
Sbjct: 3 KKLLMNFFIVSP--LLLATTATDFTPVPLSSNQIIKTAKASTND-NIKDLLDWYSSGSDT 59

Query: 62 LSGKWLWQKPNGSIHATLQTWVWYSHIQVFGSESWGNINQLRNKYVDIFGT---KDEDTV 118
+ + GS+ ++ + +F S + + + + VD+ K + T
Sbjct: 60 FTNSEVLDNSLGSMR--IKNTDGSISLIIFPSP-YYSPAFTKGEKVDLNTKRTKKSQHTS 116

Query: 119 EGYWTYDETFTGGVTPA-ATSSDKPYRLFLKYSDKQQTIIGGHEFYKGNKPVLTLKELDF 177
EG TY GVT + L +K K + G +F +K L + LDF
Sbjct: 117 EG--TYIHFQISGVTNTEKLPTPIELPLKVKVHGKDSPLKYGPKF---DKKQLAISTLDF 171

Query: 178 RIRQTLIKNKKLYNGEFNKGQI-KIT-ADGNNYTIDLSKKLK 217
IR L + LY G KIT DG+ Y DLSKK +
Sbjct: 172 EIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFE 213


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1170CARBMTKINASE388e-138 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 388 bits (998), Expect = e-138
Identities = 144/311 (46%), Positives = 210/311 (67%), Gaps = 7/311 (2%)

Query: 3 KIVVALGGNALGK-----SPQEQLELVKNTAKSLVGLITKGHEIVISHGNGPQVGSINLG 57
++V+ALGGNAL + S +E ++ V+ TA+ + +I +G+E+VI+HGNGPQVGS+ L
Sbjct: 4 RVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLLLH 63

Query: 58 LNYAAEHNQGPAFPFAECGAMSQAYIGYQLQESLQNELHSIGMDKQVVTLVTQVEVDEND 117
++ PA P GAMSQ +IGY +Q++L+NEL GM+K+VVT++TQ VD+ND
Sbjct: 64 MDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDKND 123

Query: 118 PAFNNPSKPIGLFYNKEEAEQIQKEKGFIFVEDAGRGYRRVVPSPQPISIIELESIKTLI 177
PAF NP+KP+G FY++E A+++ +EKG+I ED+GRG+RRVVPSP P +E E+IK L+
Sbjct: 124 PAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKLV 183

Query: 178 KNDTLVIAAGGGGIPVIREQHDGFKGIDAVIDKDKTSALLGANIQCDQLIILTAIDYVYI 237
+ +VIA+GGGG+PVI E KG++AVIDKD L + D +ILT ++ +
Sbjct: 184 ERGVIVIASGGGGVPVILE-DGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAAL 242

Query: 238 NFNTENQQPLKTTNVDELKRYIDENQFAKGSMLPKIEAAISFIENNPKGSVLITSLNELD 297
+ TE +Q L+ V+EL++Y +E F GSM PK+ AAI FIE + ++ I L +
Sbjct: 243 YYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAI-IAHLEKAV 301

Query: 298 AALEGKVGTVI 308
ALEGK GT +
Sbjct: 302 EALEGKTGTQV 312


40SAV1231SAV1236N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV1231211-0.3574003-oxoacyl-#acyl-carrier protein reductase
SAV1232111-0.427788HmrB protein
SAV1233111-0.347032RNase III
SAV1234112-0.514758chromosome segregation SMC protein
SAV1235-1140.497507signal recognition particle
SAV12361150.739644conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1231DHBDHDRGNASE1451e-44 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 145 bits (366), Expect = 1e-44
Identities = 85/250 (34%), Positives = 136/250 (54%), Gaps = 13/250 (5%)

Query: 5 KSALVTGASRGIGRSIALQLAEEGYNV-AVNYAGSKEKAEAVVEEIKAKGVDSFAIQANV 63
K A +TGA++GIG ++A LA +G ++ AV+Y + EK E VV +KA+ + A A+V
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDY--NPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 64 ADADEVKAMIKEVVSQFGSLDVLVNNAGITRDNLLMRMKEQEWDDVIDTNLKGVFNCIQK 123
D+ + + + + G +D+LVN AG+ R L+ + ++EW+ N GVFN +
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 124 ATPQMLRQRSGAIINLSSVVGAVGNPGQANYVATKAGVIGLTKSAARELASRGITVNAVA 183
+ M+ +RSG+I+ + S V A Y ++KA + TK ELA I N V+
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 184 PGFIVSDMTDAL--SDELKEQML--------TQIPLARFGQDTDIANTVAFLASDKAKYI 233
PG +DM +L + EQ++ T IPL + + +DIA+ V FL S +A +I
Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 234 TGQTIHVNGG 243
T + V+GG
Sbjct: 247 TMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1232ACRIFLAVINRP260.012 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 26.3 bits (58), Expect = 0.012
Identities = 10/42 (23%), Positives = 17/42 (40%), Gaps = 2/42 (4%)

Query: 33 GADSLDIAELVMELEDEFGTEIPDEEAEKINTVGDAVKFINS 74
GA++LD A+ + E P + K+ D F+
Sbjct: 296 GANALDTAKAIKAKLAELQPFFP--QGMKVLYPYDTTPFVQL 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1234GPOSANCHOR552e-09 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 54.7 bits (131), Expect = 2e-09
Identities = 53/326 (16%), Positives = 119/326 (36%), Gaps = 23/326 (7%)

Query: 170 KYKKRKAESLNKLDQTEDNLTRVEDILYDLEGRV-EPLKEEAAIAKEYKTLSHQMKHSDI 228
K K +E +K+ + E +E L + + E L+ + +
Sbjct: 103 KNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLE- 161

Query: 229 VVTVHDIDQYTNDNRQLDQRLNDLQGQQANKEADKQRLSQQIQQYKG-------KRHQLD 281
++ N + ++ L+ ++A EA + L + ++ K L+
Sbjct: 162 ----KALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLE 217

Query: 282 NDVESLNYQLVKATEAFEKYTGQLNVLEERKKNQSETNARYEEEQENLMELLENISNEIS 341
+ +L + +A E + K A E Q L + LE N +
Sbjct: 218 AEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFST 277

Query: 342 EAQDTYKSLKSKQKELNAVIRELEEQLYVSD----------EAHDEKLEEIKNEYYTLMS 391
K+L++++ L A +LE Q V + +A E ++++ E+ L
Sbjct: 278 ADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEE 337

Query: 392 EQSDVNNDIRFLKHTIEENEAKKSRLDSRLVEVFEQLKDIQGQIKTTKKEYQQTNKELSA 451
+ + L+ ++ + K +L++ ++ EQ K + ++ +++ + +
Sbjct: 338 QNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQ 397

Query: 452 VDKEIKNIEKDLTDTKKAQNEYEEKL 477
V+K ++ L +K E EE
Sbjct: 398 VEKALEEANSKLAALEKLNKELEESK 423



Score = 53.1 bits (127), Expect = 5e-09
Identities = 31/315 (9%), Positives = 94/315 (29%), Gaps = 18/315 (5%)

Query: 177 ESLNKLDQTEDNLTRVEDILYDLEGRVEPLKEEAAIAKEYKTLSHQMKHSDIVVTVHDID 236
E +K + + L L ++ +E + + I
Sbjct: 57 ERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQ 116

Query: 237 QYTNDNRQLDQRLNDLQGQQANKEADKQRLSQQIQQYKGKRHQLDNDVESLNYQLVKATE 296
+ L++ L A + L + ++ L+ +E +
Sbjct: 117 ELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSA 176

Query: 297 AFEKYTGQLNVLEERKKNQSETNARYEEEQENLMELLENISNEISEAQDTYKSLKSKQKE 356
+ + LE R+ + ++ + E + L+ +
Sbjct: 177 KIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEG 236

Query: 357 LNAVIRELEEQLYVSDEAHDEKLEEIKNEYYTLMSEQSDVNNDIRFLKHTIEENEAKKSR 416
++ + + + ++ L N I+ EA+K+
Sbjct: 237 AMNFSTADSAKI----KTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAA 292

Query: 417 LDSRLVEVFEQLKDIQGQIKTTKK--------------EYQQTNKELSAVDKEIKNIEKD 462
L++ ++ Q + + ++ ++ E+Q+ ++ + +++ +D
Sbjct: 293 LEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRD 352

Query: 463 LTDTKKAQNEYEEKL 477
L +++A+ + E +
Sbjct: 353 LDASREAKKQLEAEH 367


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1235SUBTILISIN363e-04 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 35.6 bits (82), Expect = 3e-04
Identities = 16/79 (20%), Positives = 29/79 (36%), Gaps = 11/79 (13%)

Query: 192 VGVNGVGKTTTIGKLAYRYKMEGKKVMLAAGDTFRAGAIDQLKVWGERVGVDVISQSEG- 250
GV GV + L + +L + + I Q + VD+IS S G
Sbjct: 101 NGVVGVAPEADL--LIIK--------VLNKQGSGQYDWIIQGIYYAIEQKVDIISMSLGG 150

Query: 251 SDPAAVMYDAINAAKNKGV 269
+ +++A+ A +
Sbjct: 151 PEDVPELHEAVKKAVASQI 169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1236BONTOXILYSIN260.037 Bontoxilysin signature.
		>BONTOXILYSIN#Bontoxilysin signature.

Length = 1196

Score = 26.0 bits (57), Expect = 0.037
Identities = 11/42 (26%), Positives = 23/42 (54%)

Query: 10 LRMNYLFDFYQSLLTNKQRNYLELFYLEDYSLSEIADTFNVS 51
L +NY + S++ ++ N L+ FY + Y + D +N++
Sbjct: 334 LNLNYFCQSFNSIIPDRFSNALKHFYRKQYYTMDYTDNYNIN 375


41SAV1414SAV1421N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV1414213-2.778704putative protein histidine kinase
SAV1415-113-2.153676truncated (putative response regulator)
SAV1416015-2.308874hypothetical protein
SAV1417015-1.805374conserved hypothetical protein
SAV1418117-1.850596undecaprenyl-PP-MurNAc-pentapeptide-UDPGlcNAc
SAV1419115-0.728949conserved hypothetical protein
SAV1420113-1.112045probable carboxy-terminal processing proteinase
SAV1421114-0.207481conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1414PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.2 bits (86), Expect = 1e-04
Identities = 31/185 (16%), Positives = 68/185 (36%), Gaps = 35/185 (18%)

Query: 277 IEEMNRIIKLVEELLELTKGDVNDISSEAQTVHINDE---IRSRIHSLKQLHPD-YQFDT 332
+E+ + +++ L EL + + S A+ V + DE + S + D QF+
Sbjct: 187 LEDPTKAREMLTSLSELMRYSLRY--SNARQVSLADELTVVDSYLQLASIQFEDRLQFEN 244

Query: 333 DLTSKNLEIKMKPHQFEQLFLIFIDNAIKYDVKNKK----IKVKTRLKNKQKIIEITDHG 388
+ +++++ P L ++N IK+ + I +K N +E+ + G
Sbjct: 245 QINPAIMDVQVPPM----LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG 300

Query: 389 IGIPEEDQDFIFDRFYRVDKSRSRSQGGNGLGLSIAQKIIQL---NGGSIKIKSEINKGT 445
+ ++ G GL ++ +Q+ IK+ + K
Sbjct: 301 SLALKNTKE------------------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN 342

Query: 446 TFKII 450
+I
Sbjct: 343 AMVLI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1415HTHFIS935e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.6 bits (230), Expect = 5e-24
Identities = 30/125 (24%), Positives = 63/125 (50%), Gaps = 4/125 (3%)

Query: 2 TQILIVEDEQNLARFLELELTHENYNVDTEYDGQDGLDKALSHYYDLIILDLMLPSINGL 61
IL+ +D+ + L L+ Y+V + + DL++ D+++P N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EICRKIRQQQS-TPIIIITAKSDTYDKVAGLDYGADDYIVKPFDIEELLARIRAIL---R 117
++ +I++ + P+++++A++ + + GA DY+ KPFD+ EL+ I L +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 118 RQPQK 122
R+P K
Sbjct: 124 RRPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1419SACTRNSFRASE325e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 32.2 bits (73), Expect = 5e-04
Identities = 33/140 (23%), Positives = 54/140 (38%), Gaps = 19/140 (13%)

Query: 30 EQWDDQYPLLEHFEEDIAKDYLYVLEENDKIYGFIVVDQDQAEWYDDIDWPVNREGAFVI 89
+Q++D + + EE+ +LY LE + G I + N G +I
Sbjct: 48 KQYEDDDMDVSYVEEEGKAAFLYYLE--NNCIGRIKIRS-------------NWNGYALI 92

Query: 90 HRLTGSKEY--KGAATELFNYVIDVVKARGAEVILTDTFALNKPAQGLFAKFGFHKVGEQ 147
+ +K+Y KG T L + I+ K ++ +T +N A +AK F
Sbjct: 93 EDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAVD 152

Query: 148 LMEYP--PYDKGEPFYAYYK 165
M Y P + YYK
Sbjct: 153 TMLYSNFPTANEIAIFWYYK 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1421RTXTOXINA250.022 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 25.3 bits (55), Expect = 0.022
Identities = 10/25 (40%), Positives = 15/25 (60%)

Query: 15 GRHDDKGRLAEEIFDDLAFPKHDDD 39
G +DK LA+ F D+AF + +D
Sbjct: 866 GGKEDKLSLADIDFRDVAFKREGND 890


42SAV1541SAV1549N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV1541017-4.110510similar to competence protein
SAV1542114-2.566870exogenous DNA-binding protein comGC
SAV1543115-2.650674similar to DNA transport mechinery protein
SAV1544-112-2.814919similar to late competence protein comGA
SAV1545014-3.084806similar to metallo-beta-lactamase superfamily
SAV1546-112-3.261156conserved hypothetical protein
SAV1547-110-2.103873glucokinase
SAV1548-112-2.266969conserved hypothetical protein
SAV1549-112-2.282343conserved hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1541BCTERIALGSPH405e-07 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 40.3 bits (94), Expect = 5e-07
Identities = 14/79 (17%), Positives = 38/79 (48%), Gaps = 4/79 (5%)

Query: 5 KQSAFTMIEMLVVMMLISIFLLLTMTSKGLSNLRVIDDEA-NIISFITELNYIKSQAIAN 63
+Q FT++EM+++++L+ + + + + S D A + F +L +++ + +
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPAS---RDDSAAQTLARFEAQLRFVQQRGLQT 58

Query: 64 QGYINVRFYENSDTIKVIE 82
+ V + + V+E
Sbjct: 59 GQFFGVSVHPDRWQFLVLE 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1542BCTERIALGSPG469e-10 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 46.4 bits (110), Expect = 9e-10
Identities = 19/76 (25%), Positives = 44/76 (57%), Gaps = 4/76 (5%)

Query: 3 KFLKKTQAFTLIEMLLVLLIISLLLILIIPNI--AKQTAHIQSTGCNAQVKMVNSQIEAY 60
+ K + FTL+E+++V++II +L L++PN+ K+ A Q + + + + ++ Y
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKA--VSDIVALENALDMY 59

Query: 61 ALKHNRNPSSIEDLIA 76
L ++ P++ + L +
Sbjct: 60 KLDNHHYPTTNQGLES 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1543BCTERIALGSPF844e-20 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 84.1 bits (208), Expect = 4e-20
Identities = 65/347 (18%), Positives = 137/347 (39%), Gaps = 6/347 (1%)

Query: 14 KKRQLSKAQQIDLLSNLCNLLKYGFTLYQSFQFLNLQMTYKN-KQLGTTILSEISNGAPC 72
+K +LS + L L L+ L ++ + Q + QL + S++ G
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 73 NQIL-SLIGYSDTI-VMQVYLAERFGNIIDVLEETVNYMKVNRKSEQRLLKTLQYPLILV 130
+ G + + V E G++ VL +Y + ++ R+ + + YP +L
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 131 SIFIAMIIILNLTVIPQFQQLYTSMNIQLSSFQKTLSFFITSLPTIIVVMLIIVSMLAII 190
+ IA++ IL V+P+ + + M L + L ++ T ML+ + +
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 191 MKLIYNNLNMLNKIN-FVMKLPLISGYFQLFKTYFVTNELVLFYKNGITLQSIVDVYINH 249
+++ + ++ LPLI + T L + + + L + + +
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 250 SS-DPFRQFLGKYLLTYSEMGYGLPQILEKLKCFKPQLIKFVLQGEKRGKLEVELKLYSQ 308
S D R L E G L + LE+ F P + + GE+ G+L+ L+ +
Sbjct: 301 MSNDYARHRLSLATDAVRE-GVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAAD 359

Query: 309 ILVKQIEDKAIKQTQFLQPILFLILGLFIVAIYLVIMLPMFQMMQSI 355
++ + +P+L + + ++ I L I+ P+ Q+ +
Sbjct: 360 NQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1545SHIGARICIN270.039 Ribosome inactivating protein family signature.
		>SHIGARICIN#Ribosome inactivating protein family signature.

Length = 289

Score = 27.5 bits (61), Expect = 0.039
Identities = 20/99 (20%), Positives = 38/99 (38%), Gaps = 11/99 (11%)

Query: 82 DFLKDPVKNGADKFKQYGLPIITSKVTPEK-------LNEGSTEIE-GFKFNVLHTPGHS 133
F+ + K + K Y +P++ S + + N I ++ G+
Sbjct: 39 VFISNLRKALPYERKLYDIPLLRSTLPGSQRYALIHLTNYADETISVAIDVTNVYVMGYR 98

Query: 134 PGSLTYVFDEFAVVG--DTLFNNGIGRTDL-YKGDYETL 169
G +Y F+E + +F + + L Y G+YE L
Sbjct: 99 AGDTSYFFNEASATEAAKYVFKDAKRKVTLPYSGNYERL 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1547PF03309300.012 Bvg accessory factor
		>PF03309#Bvg accessory factor

Length = 271

Score = 29.7 bits (67), Expect = 0.012
Identities = 32/154 (20%), Positives = 51/154 (33%), Gaps = 37/154 (24%)

Query: 5 ILAADVGGTTCKLGIFTPELEQ---LHKWSIHTD---TSDSTGYTLLKGIYDSFVEKVNE 58
+LA DV T +G+ + + + +W I T+ T+D + G+
Sbjct: 2 LLAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTEPEVTADELA-LTIDGLI--------- 51

Query: 59 NNYNFSNVLGVGIG--VPGPVDFEKGTVNGAVNLYWPE------KVNVREIFEQFVDCPV 110
+ + G VP V E V + YWP + VR VD P
Sbjct: 52 -GDDAERLTGASGLSTVP-SVLHE---VRVMLEQYWPNVPHVLIEPGVRTGIPLLVDNPK 106

Query: 111 YVDND--ANIAALGEKHKGAGEGADDVVAITLGT 142
V D N A K+ + + G+
Sbjct: 107 EVGADRIVNCLAAYHKYGT------AAIVVDFGS 134


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1549TCRTETA330.002 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 33.3 bits (76), Expect = 0.002
Identities = 29/170 (17%), Positives = 54/170 (31%), Gaps = 51/170 (30%)

Query: 241 MLTVYFIAGLFGN--------FVSLSFNTTTISVGASGAIFGLIGSIFAMMY---VSKTF 289
++ V+FI L G F F+ ++G S A FG++ S+ M V+
Sbjct: 215 LMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARL 274

Query: 290 NKK----------MLGQLLIA-----------LVILVGVSLFMS------NINIVAHIGG 322
++ G +L+A +V+L + M + + G
Sbjct: 275 GERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQG 334

Query: 323 FIGGLLITL-----------IGYYYKVNRNIF--WILLIGMLVIFIALQI 359
+ G L L Y + + W + G + + L
Sbjct: 335 QLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPA 384


43SAV1809SAV1813N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV1809210-0.608202serine protease
SAV1810211-1.411772serine protease
SAV1811113-2.004907serine protease
SAV1812013-3.188821serine protease
SAV1813115-4.595692serine protease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1809V8PROTEASE1156e-33 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 115 bits (290), Expect = 6e-33
Identities = 60/227 (26%), Positives = 103/227 (45%), Gaps = 26/227 (11%)

Query: 30 IQQTAKA-----ENTVKQITNTNVAPYSGVTWMGA--------GTGFVVGNHTIITNKHV 76
++Q A N QIT+T Y+ VT++ +G VVG T++TNKHV
Sbjct: 61 LEQREHANVILPNNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHV 120

Query: 77 TYHM-KVGDEIKAHPNGFY--NNGGGLYKVTKIVDYPGKEDIAVVQVEEKSTQPKGRKFK 133
+KA P+ N G + +I Y G+ D+A+V+ + +
Sbjct: 121 VDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSP---NEQNKHIG 177

Query: 134 DFTSKFNIA--SEAKENEPISVIGYPNPNGNKLQMYESTGKVLSVNGNIVSSDAIIQPGS 191
+ ++ +E + N+ I+V GYP + M+ES GK+ + G + D G+
Sbjct: 178 EVVKPATMSNNAETQVNQNITVTGYP-GDKPVATMWESKGKITYLKGEAMQYDLSTTGGN 236

Query: 192 SGSPILNSKHEAIGVIYAGNKPSGESTRGFAVYFSPEIKKFIADNLD 238
SGSP+ N K+E IG+ + G AV+ + ++ F+ N++
Sbjct: 237 SGSPVFNEKNEVIGIHWGGVPNEF----NGAVFINENVRNFLKQNIE 279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1810V8PROTEASE1121e-31 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 112 bits (281), Expect = 1e-31
Identities = 58/227 (25%), Positives = 100/227 (44%), Gaps = 26/227 (11%)

Query: 30 IQQTAKA-----ENSVKLITNTNVAPYSGVTWMGA--------GTGFVVGNHTIITNKHV 76
++Q A N IT+T Y+ VT++ +G VVG T++TNKHV
Sbjct: 61 LEQREHANVILPNNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHV 120

Query: 77 TYHM-KVGDEIKAHPNGFY--NNGGGLYKVTKIVDYPGKEDIAVVQVEEKSTQPKGRKFK 133
+KA P+ N G + +I Y G+ D+A+V+ + +
Sbjct: 121 VDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSP---NEQNKHIG 177

Query: 134 DFTSKFNIA--SEAKENEPISVIGYPNPNGNKLQMYESTGKVLSVNGNIVTSDAVVQPGS 191
+ ++ +E + N+ I+V GYP + M+ES GK+ + G + D G+
Sbjct: 178 EVVKPATMSNNAETQVNQNITVTGYP-GDKPVATMWESKGKITYLKGEAMQYDLSTTGGN 236

Query: 192 SGSPILNSKREAIGVMYASDKPTGESTRSFAVYFSPEIKKFIADNLD 238
SGSP+ N K E IG+ + AV+ + ++ F+ N++
Sbjct: 237 SGSPVFNEKNEVIGIHWGGVPNEFNG----AVFINENVRNFLKQNIE 279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1811V8PROTEASE1787e-57 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 178 bits (452), Expect = 7e-57
Identities = 64/217 (29%), Positives = 106/217 (48%), Gaps = 23/217 (10%)

Query: 37 EKNVTQVKDTNNFPYNGVVSFK--------DATGFVIGKNTIITNKHV-SKDYKVGDRIT 87
+ Q+ DT N Y V + A+G V+GK+T++TNKHV + +
Sbjct: 73 NNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHVVDATHGDPHALK 132

Query: 88 AHP---NGDKGNGGIYKIKSISDYPGDEDISVMNIEEQAVERGPKGFNFNENVQAFNFAK 144
A P N D G + + I+ Y G+ D++++ + + E V+ +
Sbjct: 133 AFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQNK-----HIGEVVKPATMSN 187

Query: 145 DA--KVDDKIKVIGYPLPAQNSFKQFESTGTIKRIKDNILNFDAYIEPGNSGSPVLNSNN 202
+A +V+ I V GYP +ES G I +K + +D GNSGSPV N N
Sbjct: 188 NAETQVNQNITVTGYPGDK-PVATMWESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKN 246

Query: 203 EVIGVVYGGIGKIGSEYNGAVYFTPQIKDFIQKHIEQ 239
EVIG+ +GG + +E+NGAV+ +++F++++IE
Sbjct: 247 EVIGIHWGG---VPNEFNGAVFINENVRNFLKQNIED 280


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1812V8PROTEASE1772e-56 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 177 bits (450), Expect = 2e-56
Identities = 65/230 (28%), Positives = 108/230 (46%), Gaps = 29/230 (12%)

Query: 29 EVQQTAKA-----ENNVTKIQDTNIFPYTGVVAFKS--------ATGFVVGKNTILTNKH 75
++Q A N+ +I DT Y V + A+G VVGK+T+LTNKH
Sbjct: 60 PLEQREHANVILPNNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKH 119

Query: 76 V-SKNYKVGDRITAHP---NSDKGNGGIYSIKKIINYPGKEDVSVIQVEERAIERGPKGF 131
V + + A P N D G ++ ++I Y G+ D+++++ +
Sbjct: 120 VVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNEQNK----- 174

Query: 132 NFNDNVTPFKYAAGA--KAGERIKVIGYPHPYKNKYVLYESTGPVMSVEGSSIVYSAHTE 189
+ + V P + A + + I V GYP K ++ES G + ++G ++ Y T
Sbjct: 175 HIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMWESKGKITYLKGEAMQYDLSTT 233

Query: 190 SGNSGSPVLNSNNELVGIHFASDVKNDDNRNAYGVYFTPEIKKFIAENID 239
GNSGSPV N NE++GIH+ V N+ N V+ ++ F+ +NI+
Sbjct: 234 GGNSGSPVFNEKNEVIGIHWGG-VPNEFNG---AVFINENVRNFLKQNIE 279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1813V8PROTEASE1381e-41 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 138 bits (349), Expect = 1e-41
Identities = 64/212 (30%), Positives = 101/212 (47%), Gaps = 18/212 (8%)

Query: 36 EKNVKEITDATKAPYNSVVAFA--------GGTGVVVGKNTIVTNKHIAKSNDIFKNRVA 87
+ +ITD T Y V +GVVVGK+T++TNKH+ + + +
Sbjct: 73 NNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHVVDATHGDPHALK 132

Query: 88 AHYS---SKGKGGGNYDVKDIVEYPGKEDLAIVHVHETSTEGLNFNKNVSYTKFAEGA-- 142
A S G + + I +Y G+ DLAIV + + + + V + A
Sbjct: 133 AFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVK-FSPNEQNKHIGEVVKPATMSNNAET 191

Query: 143 KAKDRISVIGYPKGAQTKYKMFESTGTINHISGTFIEFDAYAQPGNSGSPVLNSKHELIG 202
+ I+V GYP G + M+ES G I ++ G +++D GNSGSPV N K+E+IG
Sbjct: 192 QVNQNITVTGYP-GDKPVATMWESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIG 250

Query: 203 ILYAGSGKDESEKNFGVYFTPQLKEFIQNNIE 234
I + G +E N V+ ++ F++ NIE
Sbjct: 251 IHWGGVP---NEFNGAVFINENVRNFLKQNIE 279


44SAV1819SAV1830N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV1819515-4.507246leukotoxin F-subunit
SAV1820617-5.799965leukotoxin S-subunit
SAV1821620-7.121026hypothetical protein
SAV1822718-6.698566conserved hypothetical protein
SAV1823821-7.212088hypothetical protein
SAV1824822-6.949343extracellular enterotoxin type G precursor
SAV1825521-7.331276enterotoxin
SAV1826417-5.879637enterotoxin
SAV1827214-3.508722enterotoxin
SAV1828112-2.705536extracellular enterotoxin type I precursor
SAV1829013-1.292214enterotoxin
SAV1830013-0.855541enterotoxin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1819BICOMPNTOXIN396e-141 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 396 bits (1020), Expect = e-141
Identities = 96/329 (29%), Positives = 177/329 (53%), Gaps = 24/329 (7%)

Query: 1 MKMKKLVKSSVASSIALLLLSNTVDAAQHITPVSEKKVDDKITLYKTTATSDNDKLNISQ 60
M K++ ++++ S+ L + ++ A+ + I + K T ++K ++Q
Sbjct: 1 MLKNKILTTTLSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQ 60

Query: 61 ILTFNFIKDKSYDKDTLVLKAAGNINSGYKKPNPKDYNYSQ-FYWGGKYNVSVSSESNDA 119
+ F+F+KDK Y+KD L+LK G I+S N K N+ + W +YN+ + + +
Sbjct: 61 NIQFDFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTN-DKY 119

Query: 120 VNVVDYAPKNQNEEFQVQQTLGYSYGGDINISNGLSGGLNGSKSFSETINYKQESYRTTI 179
V++++Y PKN+ E V QTLGY+ GG+ + L G NGS ++S++I+Y Q++Y + +
Sbjct: 120 VSLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGG--NGSFNYSKSISYTQQNYVSEV 177

Query: 180 DRKTNHKSIGWGVEAHKIMNNGWGPYGRDSYDPTYGNELFLGGRQSSSNAGQNFLPTHQM 239
+++ N KS+ WGV+A+ + ++LF+G + S + F+P ++
Sbjct: 178 EQQ-NSKSVLWGVKANSFAT-------ESGQKSAFDSDLFVGYKPHSKDPRDYFVPDSEL 229

Query: 240 PLLARGNFNPEFISVLSHKQNDTKKSKIKVTYQREMD---------RYTNQWNRLHWIGN 290
P L + FNP FI+ +SH++ + S+ ++TY R MD Y N + H + N
Sbjct: 230 PPLVQSGFNPSFIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHN 289

Query: 291 NYKNQNTVTFTSTYEVDWQNHTVKLIGTD 319
+ N+N +T YEV+W+ H +K+ G +
Sbjct: 290 AFVNRN---YTVKYEVNWKTHEIKVKGQN 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1820BICOMPNTOXIN433e-156 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 433 bits (1116), Expect = e-156
Identities = 214/318 (67%), Positives = 256/318 (80%), Gaps = 10/318 (3%)

Query: 1 MFKKKMLAATLSVGLIAPLASPIQE-SRANTNIENIGDGA--EVIKRTEDVSSKKWGVTQ 57
M K K+L TLSV L+APLA+P+ E ++A + E+IG G+ E+IKRTED +S KWGVTQ
Sbjct: 1 MLKNKILTTTLSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQ 60

Query: 58 NVQFDFVKDKKYNKDALIVKMQGFINSRTSFSDVKGSGYELTKRMIWPFQYNIGLTTKDP 117
N+QFDFVKDKKYNKDALI+KMQGFI+SRT++ + K + + K M WPFQYNIGL T D
Sbjct: 61 NIQFDFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNH--VKAMRWPFQYNIGLKTNDK 118

Query: 118 NVSLINYLPKNKIETTDVGQTLGYNIGGNFQSAPSIGGNGSFNYSKTISYTQKSYVSEVD 177
VSLINYLPKNKIE+T+V QTLGYNIGGNFQSAPS+GGNGSFNYSK+ISYTQ++YVSEV+
Sbjct: 119 YVSLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGGNGSFNYSKSISYTQQNYVSEVE 178

Query: 178 KQNSKSVKWGVKANEFVTPDGKKSAHDRYLFVQSPNGPTGSAREYFAPDNQLPPLVQSGF 237
+QNSKSV WGVKAN F T G+KSA D LFV + R+YF PD++LPPLVQSGF
Sbjct: 179 QQNSKSVLWGVKANSFATESGQKSAFDSDLFVGYKPH-SKDPRDYFVPDSELPPLVQSGF 237

Query: 238 NPSFITTLSHEKGSSDTSEFEISYGRNLDITYA----TLFPRTGIYAERKHNAFVNRNFV 293
NPSFI T+SHEKGSSDTSEFEI+YGRN+D+T+A T + + + R HNAFVNRN+
Sbjct: 238 NPSFIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAFVNRNYT 297

Query: 294 VRYEVNWKTHEIKVKGHN 311
V+YEVNWKTHEIKVKG N
Sbjct: 298 VKYEVNWKTHEIKVKGQN 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1824BACTRLTOXIN1954e-64 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 195 bits (497), Expect = 4e-64
Identities = 109/261 (41%), Positives = 155/261 (59%), Gaps = 11/261 (4%)

Query: 4 LSTVIIILILEIVFHNMN-YVNAQPDPKLDELNKVSDYKNNKGTMGNVMNLYTSPPVEGR 62
+S VI+I L +V N +QPDP D+L+K S++ GTMGN+ LY V
Sbjct: 7 ISRVILIFALILVISTPNVLAESQPDPMPDDLHKSSEFT---GTMGNMKYLYDDHYVSAT 63

Query: 63 GVINSRQFLSHDLIFPI---EYKSYNEVKTELENTELANNYKDKKVDIFGVPYFYTCIIP 119
V + +FL+HDLI+ I + K+Y++VKTEL N +LA YKD+ VD++G Y+ C
Sbjct: 64 KVKSVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYYVNCYFS 123

Query: 120 KSEPDINQNFGGCCMYGGLTF---NSSENERDKLITVQVTIDNRQSLGFTITTNKNMVTI 176
+ G CMYGG+T N +N + + V+V + R ++ F + T+K VT
Sbjct: 124 SKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFEVQTDKKSVTA 183

Query: 177 QELDYKARHWLTKEKKLYEFDGSAFESGYIKFTEKNNTSFWFDLFPKKELVPFVPYKFLN 236
QELD KAR++L +K LYEF+ S +E+GYIKF E N +FW+D+ P F K+L
Sbjct: 184 QELDIKARNFLINKKNLYEFNSSPYETGYIKFIENNGNTFWYDMMPAPGDK-FDQSKYLM 242

Query: 237 IYGDNKVVDSKSIKMEVFLNT 257
+Y DNK VDSKS+K+EV L T
Sbjct: 243 MYNDNKTVDSKSVKIEVHLTT 263


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1825BACTRLTOXIN1543e-48 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 154 bits (391), Expect = 3e-48
Identities = 76/265 (28%), Positives = 124/265 (46%), Gaps = 21/265 (7%)

Query: 9 RLFYIAAIII-TLLCLINNNYVNAEV----DKKDLKKKSDLDSSKLFNLTSYYTDITWQL 63
RLF I+I L+ +I+ V AE DL K S+ + + N+ Y D +
Sbjct: 4 RLFISRVILIFALILVISTPNVLAESQPDPMPDDLHKSSEF-TGTMGNMKYLYDDH--YV 60

Query: 64 DESNKISTDQLLNNTIILKNIDISVLKTSSLKVEFNSSDLANQFKGKNIDIYGLYFGNKC 123
+ S D+ L + +I D + +K E + DLA ++K + +D+YG + C
Sbjct: 61 SATKVKSVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYYVNC 120

Query: 124 -------VGLTEEKTSCLYGGVTIHDGNQLDEEKV--IGVNVFKDGVQQEGFVIKTKKAK 174
VG +C+YGG+T H+GN D + + V V+++ F ++T K
Sbjct: 121 YFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFEVQTDKKS 180

Query: 175 VTVQELDTKVRFKLENLYKIYNKDTGNIQKGCIFFHSHNHQDQSFYYDLYNVKGSVG--A 232
VT QELD K R L N +Y ++ + G I F +N +F+YD+ G +
Sbjct: 181 VTAQELDIKARNFLINKKNLYEFNSSPYETGYIKFIENN--GNTFWYDMMPAPGDKFDQS 238

Query: 233 EFFQFYSDNRTVSSSNYHIDVFLYK 257
++ Y+DN+TV S + I+V L
Sbjct: 239 KYLMMYNDNKTVDSKSVKIEVHLTT 263


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1826BACTRLTOXIN1595e-52 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 159 bits (404), Expect = 5e-52
Identities = 84/135 (62%), Positives = 111/135 (82%), Gaps = 4/135 (2%)

Query: 2 KKTCMYGGVTEHDGNQIDKNNSTDNSHNILIKVYENERNSLSFDIPTNKKNITAQEIDYK 61
KTCMYGG+T+H+GN D N N+L++VYEN+RN++SF++ T+KK++TAQE+D K
Sbjct: 134 GKTCMYGGITKHEGNHFDNGNLQ----NVLVRVYENKRNTISFEVQTDKKSVTAQELDIK 189

Query: 62 VRNYLLKHKNLYEFNSSPYETGYIKFIEGSGHSFWYDLMPESGKKFYPTKYLLIYNDNKT 121
RN+L+ KNLYEFNSSPYETGYIKFIE +G++FWYD+MP G KF +KYL++YNDNKT
Sbjct: 190 ARNFLINKKNLYEFNSSPYETGYIKFIENNGNTFWYDMMPAPGDKFDQSKYLMMYNDNKT 249

Query: 122 VESKSINVEVHLTKK 136
V+SKS+ +EVHLT K
Sbjct: 250 VDSKSVKIEVHLTTK 264


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1827BACTRLTOXIN1083e-32 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 108 bits (272), Expect = 3e-32
Identities = 48/94 (51%), Positives = 68/94 (72%), Gaps = 5/94 (5%)

Query: 21 NGNPKPEQLNKASEFTGLMDNMRYLYDDKHVSETNIKSQEKFLQHDLLFKINGSKI---- 76
+P P+ L+K+SEFTG M NM+YLYDD +VS T +KS +KFL HDL++ I+ K+
Sbjct: 30 QPDPMPDDLHKSSEFTGTMGNMKYLYDDHYVSATKVKSVDKFLAHDLIYNISDKKLKNYD 89

Query: 77 -LKTEFNNKSLSDKYKNKNVDLFGTNYYNQCYFS 109
+KTE N+ L+ KYK++ VD++G+NYY CYFS
Sbjct: 90 KVKTELLNEDLAKKYKDEVVDVYGSNYYVNCYFS 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1828BACTRLTOXIN1082e-30 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 108 bits (270), Expect = 2e-30
Identities = 54/227 (23%), Positives = 98/227 (43%), Gaps = 37/227 (16%)

Query: 30 VGNLRNFYTKHDYIDLKGVTDKNLPIANQLEFS------TGTNDLISESNNWDEISKFKG 83
+GN++ Y H K + +A+ L ++ + + +E N D K+K
Sbjct: 48 MGNMKYLYDDHYVSATKVKSVDKF-LAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKD 106

Query: 84 KKLDIFGIDY-------------NGPCKSKYMYGGATL-SGQYLNSARKIPINLWVNGKH 129
+ +D++G +Y MYGG T G + ++ + + V
Sbjct: 107 EVVDVYGSNYYVNCYFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENK 166

Query: 130 KTISTDKIATNKKLVTAQEIDVKLRRYLQEEYNIYGHNNTGKGKEYGYKSKFYSGFNNGK 189
+ + ++ T+KK VTAQE+D+K R +L + N+Y N+ + G
Sbjct: 167 RNTISFEVQTDKKSVTAQELDIKARNFLINKKNLY-EFNSSP-------------YETGY 212

Query: 190 VLFHLNNEKSFSYDLF-YTGDGLPVS-FLKIYEDNKIIESEKFHLDV 234
+ F NN +F YD+ GD S +L +Y DNK ++S+ ++V
Sbjct: 213 IKFIENNGNTFWYDMMPAPGDKFDQSKYLMMYNDNKTVDSKSVKIEV 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1829BACTRLTOXIN1232e-36 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 123 bits (310), Expect = 2e-36
Identities = 64/231 (27%), Positives = 111/231 (48%), Gaps = 36/231 (15%)

Query: 28 NLRNYYGSYPIEDHQSINPENNHLSHQLVFSMDNST------VTAEFKNVDDVKKFKNHA 81
N++ Y + + + + L+H L++++ + V E N D KK+K+
Sbjct: 50 NMKYLYDDHYVS-ATKVKSVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKDEV 108

Query: 82 VDVYGLSYSGYCLKNKY------------IYGGVTLA-GDYLEKSRRIPINLWVNGEHQT 128
VDVYG +Y C + +YGG+T G++ + + + V +
Sbjct: 109 VDVYGSNYYVNCYFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRN 168

Query: 129 ISTDKVSTNKKLVTAQEIDTKLRRYLQEEYNIYGFNDTNKGRNYGNKSKFSSGFNAGKIL 188
+ +V T+KK VTAQE+D K R +L + N+Y FN SS + G I
Sbjct: 169 TISFEVQTDKKSVTAQELDIKARNFLINKKNLYEFN--------------SSPYETGYIK 214

Query: 189 FHLNDGSSFSYDLFDT-GTGQAES-FLKIYNDNKTVETEKFHLDVEISYKD 237
F N+G++F YD+ G +S +L +YNDNKTV+++ ++V ++ K+
Sbjct: 215 FIENNGNTFWYDMMPAPGDKFDQSKYLMMYNDNKTVDSKSVKIEVHLTTKN 265


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1830BACTRLTOXIN1717e-55 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 171 bits (435), Expect = 7e-55
Identities = 91/270 (33%), Positives = 140/270 (51%), Gaps = 20/270 (7%)

Query: 3 NSKVMLNVLLLILNLIAICSVNNAYANEE-DPKIESLCKKSSVDPIALHNINDDYINNRF 61
++ ++ ++LI LI + S N A + DP + L K S + N+ Y ++
Sbjct: 2 YKRLFISRVILIFALILVISTPNVLAESQPDPMPDDLHKSSEFTG-TMGNMKYLYDDHYV 60

Query: 62 TTVKSIVSTTEKFLDFDLLFKSINWLDGISAEFKDLKVEFSSSAISKEFLGKTVDIYGVY 121
+ K V + +KFL DL++ D + +K E + ++K++ + VD+YG
Sbjct: 61 SATK--VKSVDKFLAHDLIYNI---SDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSN 115

Query: 122 YKAHCH-------GEHQVDTACTYGGVTPHENNKLSEP--KNIGVAVYKDNVNVNTFIVT 172
Y +C+ G+ C YGG+T HE N +N+ V VY++ N +F V
Sbjct: 116 YYVNCYFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFEVQ 175

Query: 173 TDKKKVTAQELDIKVRTKLNNAYKLYDRMTSDVQKGYIKFHSHSEHKESFYYDLFYIKGN 232
TDKK VTAQELDIK R L N LY+ +S + GYIKF ++ +F+YD+ G+
Sbjct: 176 TDKKSVTAQELDIKARNFLINKKNLYEFNSSPYETGYIKFIENNG--NTFWYDMMPAPGD 233

Query: 233 LPDQ--YLQIYNDNKTIDSSDYHIDVYLFT 260
DQ YL +YNDNKT+DS I+V+L T
Sbjct: 234 KFDQSKYLMMYNDNKTVDSKSVKIEVHLTT 263


45SAV1842SAV1849N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV1842210-2.537139cmp-binding-factor 1
SAV1843210-2.855420conserved hypothetical protein
SAV1844-113-2.130253probable phosphoesterase
SAV1845-210-1.464602conserved hypothetical protein
SAV1846-210-0.746619conserved hypothetical protein
SAV1847-39-0.490436hypothetical protein
SAV1848-39-0.552274two-component response regulator homolog
SAV1849-310-0.509204two-component sensor histidine kinase homolog
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1842SSPANPROTEIN290.035 Salmonella invasion protein InvJ signature.
		>SSPANPROTEIN#Salmonella invasion protein InvJ signature.

Length = 336

Score = 28.6 bits (63), Expect = 0.035
Identities = 12/31 (38%), Positives = 19/31 (61%)

Query: 146 PAASSHHHNFASGLSYHVLTMLRIAKSICDI 176
PA S HH+ SGL ++ + LRIA+ + +
Sbjct: 72 PAKSEHHNGNVSGLHHNGKSELRIAEKLLKV 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1843cloacin367e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 35.8 bits (82), Expect = 7e-04
Identities = 40/175 (22%), Positives = 68/175 (38%), Gaps = 36/175 (20%)

Query: 239 KQKEVALHDHSQEWKSLEQQLNIEPITFPEKGVDR-YEKARAHKQSLERDIGLRNERLAQ 297
KQ++ + QEW + T P + +R YE+ARA D+ ER A+
Sbjct: 297 KQRQDEENRRQQEWDA----------THPVEAAERNYERARAELNQANEDVARNQERQAK 346

Query: 298 LKEEATQLEPVKQSDIDAF-ISLNQQENEIKNKEFELTAIE-------------KDIANK 343
A Q+ ++S++DA +L EI K+F A +
Sbjct: 347 ----AVQVYNSRKSELDAANKTLADAIAEI--KQFNRFAHDPMAGGHRMWQMAGLKAQRA 400

Query: 344 QRDKDELQANIGWSETHHDVDSSEAMKSYVSEQIKNKQEQAAYIKQLERSLEENK 398
Q D + QA + + ++A S E K K+++ + E +L + K
Sbjct: 401 QTDVNNKQA--AFDAAAKEKSDADAALSSAMESRKKKEDKK---RSAENNLNDEK 450


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1848HTHFIS808e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 80.3 bits (198), Expect = 8e-20
Identities = 36/143 (25%), Positives = 61/143 (42%), Gaps = 7/143 (4%)

Query: 3 KVILVDDHYIVRQGLRFLLSTIENIEVLQDFADGETFLEYLKEHEHPDIVLLDLVMPGMN 62
+++ DD +R L L + +V ++ T ++ D+V+ D+VMP N
Sbjct: 5 TILVADDDAAIRTVLNQAL-SRAGYDVRI-TSNAATLWRWI-AAGDGDLVVTDVVMPDEN 61

Query: 63 GIEITEYIKAHYPEIKVLVLTSYVDDEHVISAINKGADGYEMKDVEPQQLIETIRRVMNG 122
++ IK P++ VLV+++ I A KGA Y K + +LI I R +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 123 EKMIHPK----AQDVFETVSQKP 141
K K +QD V +
Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSA 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1849PF06580408e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 40.2 bits (94), Expect = 8e-06
Identities = 21/111 (18%), Positives = 45/111 (40%), Gaps = 16/111 (14%)

Query: 272 IDLSNEIEENIYRA------LQECINNVKKHA-----DTNKMDLTLKQMNDILYIDVIDY 320
+ N+I I +Q + N KH K+ L + N + ++V +
Sbjct: 240 LQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299

Query: 321 GQGFEIDNVQIASSHGINNIKQRVKLLRGK---VTFHSQPTKGTQIQFTIP 368
G + N + ++ G+ N+++R+++L G + + K + IP
Sbjct: 300 GSL-ALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348


46SAV1948SAV1955N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV19480150.041738enterotoxin P
SAV19490160.591800hypothetical protein
SAV19500131.439057hypothetical protein
SAV19510131.414632similar to phi PVL ORF 22 homolog
SAV19520131.613642hypothetical protein
SAV19531141.473232phi PVL ORF 20 and 21 homolog
SAV19543141.574181phi PVL orfs 18-19-like protein
SAV19553131.430395phi PVL ORF 15 and 16 homolog
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1948BACTRLTOXIN298e-104 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 298 bits (764), Expect = e-104
Identities = 82/269 (30%), Positives = 141/269 (52%), Gaps = 16/269 (5%)

Query: 1 MSKMKKTAFTLLLFIALTLTTSPLVNGSEKSEEINEKDLRKKSELQGTALGNLKQIYYYN 60
M K + +L+F + + ++P V + + + + DL K SE GT +GN+K +Y +
Sbjct: 1 MYKRLFISRVILIFALILVISTPNVLAESQPDPMPD-DLHKSSEFTGT-MGNMKYLYDDH 58

Query: 61 EKAKTENKESHDQFLQHTILFKGFFTDHSWYNDLLVDFDSKDIVDKYKGKKVDLYGAYYG 120
+ T+ K S D+FL H +++ Y+ + + ++D+ KYK + VD+YG+ Y
Sbjct: 59 YVSATKVK-SVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYY 117

Query: 121 YQC-------AGGTPNKTACMYGGVTLHDNNRLTEEKKVPINLWL-DGKQNTVPLETVKT 172
C G CMYGG+T H+ N + + + + K+NT+ E V+T
Sbjct: 118 VNCYFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFE-VQT 176

Query: 173 NKKNVTVQELDLQARRYLQEKYNLYNSDVFDGKVQRGLIVFHTSTEPSVNYDLFGAQGQY 232
+KK+VT QELD++AR +L K NLY + + G I F + + YD+ A G
Sbjct: 177 DKKSVTAQELDIKARNFLINKKNLYEFN--SSPYETGYIKFIENNGNTFWYDMMPAPGDK 234

Query: 233 --SNTLLRIYRDNKTINSENMHIDIYLYT 259
+ L +Y DNKT++S+++ I+++L T
Sbjct: 235 FDQSKYLMMYNDNKTVDSKSVKIEVHLTT 263


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1950DPTHRIATOXIN270.022 Diphtheria toxin signature.
		>DPTHRIATOXIN#Diphtheria toxin signature.

Length = 567

Score = 27.0 bits (59), Expect = 0.022
Identities = 21/84 (25%), Positives = 43/84 (51%), Gaps = 2/84 (2%)

Query: 35 LREEHKQHHNELRESHKELKDKQDKVVDENLEQTKILNRIEERYQTQVDVAQKNEEKTL- 93
+R++ K L+E H +K+K + ++ + + K +EE +QT ++ + +E KT+
Sbjct: 241 IRDKTKTKIESLKE-HGPIKNKMSESPNKTVSEEKAKQYLEEFHQTALEHPELSELKTVT 299

Query: 94 AQNKWLVGAIWALVTIVMIAVITA 117
N GA +A + + VI +
Sbjct: 300 GTNPVFAGANYAAWAVNVAQVIDS 323


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1953CHANLCOLICIN404e-05 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 40.4 bits (94), Expect = 4e-05
Identities = 38/217 (17%), Positives = 76/217 (35%), Gaps = 20/217 (9%)

Query: 588 AIEAARESTKEQLRDYVKTSDYKTDKDGIVERLDTA-EAERTTLKGEIKDKVTLNEYRNG 646
A+E A++ + VK + + A +AE TL G+ NE
Sbjct: 190 AVEIAQKKLSAAQSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGK------RNELAQA 243

Query: 647 LEEQKQYTD--DQLSDLSNNPEIKASIEQANQEAQEALKSYIDAQDDLKEKESQAYADGK 704
+ K+ + +LS +N+P +A + A K + Q + E++
Sbjct: 244 SAKYKELDELVKKLSPRANDPLQNRPFFEATRRRVGAGKIREEKQKQVTASETRINRINA 303

Query: 705 ISEEEQRAIQDAQAKLEEAKQNAELKARNAEKKANAYTDNKVKESTDAQ----RKTLTRY 760
+ Q+AI N +K N ++++K++ DA + +Y
Sbjct: 304 DITQIQKAISQVSNNRNAGIARVHEAEENLKKAQNNLLNSQIKDAVDATVSFYQTLTEKY 363

Query: 761 GSQIIQNGKEI-------KLRTTKEEFNATNRTLSNI 790
G + + +E+ K+ E A + +
Sbjct: 364 GEKYSKMAQELADKSKGKKIGNVNEALAAFEKYKDVL 400


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV1955GPOSANCHOR350.002 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 35.4 bits (81), Expect = 0.002
Identities = 22/145 (15%), Positives = 44/145 (30%), Gaps = 18/145 (12%)

Query: 3 ERIKGLSIGLDLDAANLNRSFAEIKRNFKTLNSDLKLTGNNFKYTEKSTDSYQQRIKELD 62
+IK L AA ++ +D E + + R EL+
Sbjct: 211 AKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT----LEAEKAALEARQAELE 266

Query: 63 GTIIGYKKNVDDLAKQYDKVSQEQGE--------------NSAEAQKLRQEYNKQANELN 108
+ G + + + E+ +A Q LR++ +
Sbjct: 267 KALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKK 326

Query: 109 YLERELQKTSAEFEEFKKAQVEAQR 133
LE E QK + + + ++ +R
Sbjct: 327 QLEAEHQKLEEQNKISEASRQSLRR 351



Score = 32.0 bits (72), Expect = 0.021
Identities = 12/134 (8%), Positives = 33/134 (24%), Gaps = 14/134 (10%)

Query: 18 NLNRSFAEIKRNFKTLNSDLKLTGNNFKYTEKSTDSYQQRIKELDGTIIGYKKNVDDLAK 77
L + K + + L + + E ++ ++ + L
Sbjct: 89 ELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEA 148

Query: 78 QYDKVSQEQ--------------GENSAEAQKLRQEYNKQANELNYLERELQKTSAEFEE 123
+ ++ + +SA+ + L E LE+ L+
Sbjct: 149 EKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTA 208

Query: 124 FKKAQVEAQRMAES 137
+ +
Sbjct: 209 DSAKIKTLEAEKAA 222


47SAV2001SAV2011N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV2001013-1.674630hypothetical protein
SAV2002014-2.412652integrase
SAV2003114-2.685300truncated beta-hemolysin
SAV2004112-3.269562hypothetical protein
SAV2005014-3.513925hypothetical protein
SAV2006216-3.259046similar to succinyl-diaminopimelate
SAV2007515-3.475538similar to Na+transporting ATP synthase
SAV2008215-2.983541extracellular enterotoxin L
SAV2009117-2.372186enterotoxin typeC3
SAV2010-116-2.029017hypothetical protein
SAV2011-220-1.733285toxic shock syndrome toxin-1
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2001IGASERPTASE335e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.7 bits (74), Expect = 5e-04
Identities = 20/104 (19%), Positives = 39/104 (37%), Gaps = 2/104 (1%)

Query: 26 ESKKEVKSKEKKIEKEKENKSKKDKEKEVA--TQQQPDNQTVEQPQSQEQSVQQPQQQIP 83
+S E K + KE K++K K TQ+ P + P+ ++ QPQ +
Sbjct: 1087 QSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA 1146

Query: 84 QNSVPQQNVQVQQNKKQKVDLNNMPPTDFSTEGMSEQAQKQIEE 127
+ + P N++ Q++ P + S+ +
Sbjct: 1147 RENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVN 1190


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2004BICOMPNTOXIN2148e-70 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 214 bits (547), Expect = 8e-70
Identities = 82/315 (26%), Positives = 145/315 (46%), Gaps = 18/315 (5%)

Query: 16 ISTALTVFPATSYAKINSEIKAVSEKNLDGDTKMYTRTATTSDSQKNITQSLQFNFLTEP 75
+S +L A + + D ++ RT + ++ +TQ++QF+F+ +
Sbjct: 11 LSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQNIQFDFVKDK 70

Query: 76 NYDKETVFIKAKGTIGSGLRILDPNGY-WNSTLRWPGSYSVSIQNVDDNNNTNVTDFAPK 134
Y+K+ + +K +G I S + +RWP Y++ ++ ++ ++ ++ PK
Sbjct: 71 KYNKDALILKMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKT--NDKYVSLINYLPK 128

Query: 135 NQDESREVKYTYGYKTGGDFSINRGGLTGNITKESNYSETISYQQPSYRTLLDQSTSHKG 194
N+ ES V T GY GG+F L GN + NYS++ISY Q +Y + ++Q K
Sbjct: 129 NKIESTNVSQTLGYNIGGNFQSAPS-LGGNGSF--NYSKSISYTQQNYVSEVEQQN-SKS 184

Query: 195 VGWKVEAHLINNMGHDHTRQLTNDSDNRTKSEIFSLTRNGNLWAKDNFTPKDKMPVTVSE 254
V W V+A+ + S++F + + +D F P ++P V
Sbjct: 185 VLWGVKANSFATESGQKSAF---------DSDLFVGYKPHSKDPRDYFVPDSELPPLVQS 235

Query: 255 GFNPEFLAVMSHDKKDKGKSQFVVHYKRSMDEFKIDWNRHGFWG-YWSGENHVDK-KEEK 312
GFNP F+A +SH+K S+F + Y R+MD + Y G +
Sbjct: 236 GFNPSFIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAFVNRN 295

Query: 313 LSALYEVDWKTHDVK 327
+ YEV+WKTH++K
Sbjct: 296 YTVKYEVNWKTHEIK 310


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2005BICOMPNTOXIN1651e-50 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 165 bits (419), Expect = 1e-50
Identities = 103/343 (30%), Positives = 163/343 (47%), Gaps = 42/343 (12%)

Query: 4 KKRVLIASSLSCAILLLSAATTQANSAHKDSQDQNKKEHVDKSQQKDKRNVTNKDKNSTV 63
K ++ ++LS ++L A N+
Sbjct: 2 LKNKILTTTLSVSLLAPLANPLLENAKAA-----------------------------ND 32

Query: 64 PDDIGKNGKIT--KRTETVYDEKTNILQNLQFDFIDDPTYDKNVLLVKKQGSIHSNLKFE 121
+DIGK I KRTE K + QN+QFDF+ D Y+K+ L++K QG I S +
Sbjct: 33 TEDIGKGSDIEIIKRTEDKTSNKWGVTQNIQFDFVKDKKYNKDALILKMQGFISSRTTYY 92

Query: 122 SHKEEKNSNWLKYPSEYHVDFQVKRNRKTEILDQLPKNKISTAKVDSTFSYSSGGKFDST 181
++K+ + +++P +Y++ + ++ +++ LPKNKI + V T Y+ GG F S
Sbjct: 93 NYKKTNHVKAMRWPFQYNIGLKTN-DKYVSLINYLPKNKIESTNVSQTLGYNIGGNFQSA 151

Query: 182 KGIGRTSSNSYSKTISYNQQNYDTIASGKNNNWHVHWSVIANDLKYGGEVKNRNDELLFY 241
+G S +YSK+ISY QQNY + + N+ V W V AN K+ D LF
Sbjct: 152 PSLGGNGSFNYSKSISYTQQNYVSEVE-QQNSKSVLWGVKANSFATESGQKSAFDSDLFV 210

Query: 242 RNTRIATVENPELSFASKYRYPALVRSGFNPEFLTYLSNEK-SNEKTQFEVTYTRNQDIL 300
+ +P F P LV+SGFNP F+ +S+EK S++ ++FE+TY RN D+
Sbjct: 211 GYKPHSK--DPRDYFVPDSELPPLVQSGFNPSFIATVSHEKGSSDTSEFEITYGRNMDVT 268

Query: 301 KN-RPGIHYAPSILEKNKDG-----QRLIVTYEVDWKNKTVKV 337
+ HY S L+ ++ + V YEV+WK +KV
Sbjct: 269 HAIKRSTHYGNSYLDGHRVHNAFVNRNYTVKYEVNWKTHEIKV 311


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2008BACTRLTOXIN1181e-34 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 118 bits (298), Expect = 1e-34
Identities = 74/279 (26%), Positives = 117/279 (41%), Gaps = 54/279 (19%)

Query: 1 MKKRLLFVIVITLFIFS---SNHTVLSNGDVGP---------------GNLRNFYTKYEY 42
M KRL VI +F S VL+ P GN++ Y Y
Sbjct: 1 MYKRLFISRVILIFALILVISTPNVLAESQPDPMPDDLHKSSEFTGTMGNMKYLY-DDHY 59

Query: 43 VNLKNVKDKNSPESHRLEYS-----YKN-DTLYAEFDNEYITSDLKGKNVDVFGISYKYG 96
V+ VK + +H L Y+ KN D + E NE + K + VDV+G +Y
Sbjct: 60 VSATKVKSVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYYVN 119

Query: 97 S-------------NSRTIYGGVTKAENNKLDSPRIIPINLIINGKHQTVTTKSVSTDKK 143
+YGG+TK E N D+ + + + + + + V TDKK
Sbjct: 120 CYFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFEVQTDKK 179

Query: 144 MVTAQEIDVKLRKYLQDEFNIYGHNDTGKGKEYGTSSKFYSGFDKGSVVFHMNDGSNFSY 203
VTAQE+D+K R +L ++ N+Y ++ ++ G + F N+G+ F Y
Sbjct: 180 SVTAQELDIKARNFLINKKNLY-EFNSSP-------------YETGYIKFIENNGNTFWY 225

Query: 204 DLFYT--GYGLPESFLKIYKDNKTVDSTQFHLDVEISKR 240
D+ +L +Y DNKTVDS ++V ++ +
Sbjct: 226 DMMPAPGDKFDQSKYLMMYNDNKTVDSKSVKIEVHLTTK 264


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2009BACTRLTOXIN396e-143 Bacterial toxin signature.
		>BACTRLTOXIN#Bacterial toxin signature.

Length = 266

Score = 396 bits (1019), Expect = e-143
Identities = 266/266 (100%), Positives = 266/266 (100%)

Query: 1 MYKRLFISRVILIFALILVISTPNVLAESQPDPMPDDLHKSSEFTGTMGNMKYLYDDHYV 60
MYKRLFISRVILIFALILVISTPNVLAESQPDPMPDDLHKSSEFTGTMGNMKYLYDDHYV
Sbjct: 1 MYKRLFISRVILIFALILVISTPNVLAESQPDPMPDDLHKSSEFTGTMGNMKYLYDDHYV 60

Query: 61 SATKVKSVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYYVNC 120
SATKVKSVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYYVNC
Sbjct: 61 SATKVKSVDKFLAHDLIYNISDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYYVNC 120

Query: 121 YFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFEVQTDKKS 180
YFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFEVQTDKKS
Sbjct: 121 YFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLVRVYENKRNTISFEVQTDKKS 180

Query: 181 VTAQELDIKARNFLINKKNLYEFNSSPYETGYIKFIENNGNTFWYDMMPAPGDKFDQSKY 240
VTAQELDIKARNFLINKKNLYEFNSSPYETGYIKFIENNGNTFWYDMMPAPGDKFDQSKY
Sbjct: 181 VTAQELDIKARNFLINKKNLYEFNSSPYETGYIKFIENNGNTFWYDMMPAPGDKFDQSKY 240

Query: 241 LMMYNDNKTVDSKSVKIEVHLTTKNG 266
LMMYNDNKTVDSKSVKIEVHLTTKNG
Sbjct: 241 LMMYNDNKTVDSKSVKIEVHLTTKNG 266


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2011TOXICSSTOXIN358e-129 Staphylococcal toxic shock syndrome toxin signature.
		>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature.

Length = 234

Score = 358 bits (920), Expect = e-129
Identities = 232/234 (99%), Positives = 232/234 (99%)

Query: 1 MNKKLLMNFFIVSPLLLATIATDFTPVPLSSNQIIKTAKASTNDNIKDLLDWYSSGSDTF 60
MNKKLLMNFFIVSPLLLAT ATDFTPVPLSSNQIIKTAKASTNDNIKDLLDWYSSGSDTF
Sbjct: 1 MNKKLLMNFFIVSPLLLATTATDFTPVPLSSNQIIKTAKASTNDNIKDLLDWYSSGSDTF 60

Query: 61 TNSEVLDNSLGSMRIKNTDGSISLIIFPSPYYSPAFTKGEKVDLNTKRTKKSQHTSEGTY 120
TNSEVLDNSLGSMRIKNTDGSISLIIFPSPYYSPAFTKGEKVDLNTKRTKKSQHTSEGTY
Sbjct: 61 TNSEVLDNSLGSMRIKNTDGSISLIIFPSPYYSPAFTKGEKVDLNTKRTKKSQHTSEGTY 120

Query: 121 IHFQISGVTNTEKLPTPIELPLKVKVHGKDSPLKYWPKFDKKQLAISTLDFEIRHQLTQI 180
IHFQISGVTNTEKLPTPIELPLKVKVHGKDSPLKY PKFDKKQLAISTLDFEIRHQLTQI
Sbjct: 121 IHFQISGVTNTEKLPTPIELPLKVKVHGKDSPLKYGPKFDKKQLAISTLDFEIRHQLTQI 180

Query: 181 HGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFEYNTEKPPINIDEIKTIEAEIN 234
HGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFEYNTEKPPINIDEIKTIEAEIN
Sbjct: 181 HGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFEYNTEKPPINIDEIKTIEAEIN 234


48SAV2032SAV2039N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV2032313-1.439378hypothetical protein
SAV2033-213-4.214650similar to nitroreductase family protein
SAV2034011-4.226883hypothetical protein
SAV2035-113-3.348558delta-hemolysin
SAV2036-113-2.286476accessory gene regulator B
SAV2037-112-1.469112AgrD protein
SAV2038-112-1.320995accessory gene regulator C
SAV2039014-0.419671accessory gene regulator A
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2032TONBPROTEIN467e-08 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 46.1 bits (109), Expect = 7e-08
Identities = 29/96 (30%), Positives = 36/96 (37%), Gaps = 7/96 (7%)

Query: 103 DSKPDPNNQNPSPNPKPDPDNPKPKPDPDKPKPNLDPKPDPDNPKPKPDPKPDPDKPK-P 161
D +P Q P P+P P+P K P + KP P KPKP PKP + P
Sbjct: 54 DLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKP---KPKPKPKPVKKVQEQP 110

Query: 162 NPDPKP---DPDKPKPNPNPKPDPNKPNPNPSPDPD 194
D KP P P N P + + P
Sbjct: 111 KRDVKPVESRPASPFENTAPARLTSSTATAATSKPV 146



Score = 35.7 bits (82), Expect = 2e-04
Identities = 24/109 (22%), Positives = 30/109 (27%), Gaps = 10/109 (9%)

Query: 98 QNPSTDSKPDPNNQNPSPNPKPDPDNPKPKPDPD-------KPKPNLDPKPDPDNPKPKP 150
+ P P P P P+P P+ PK P KPKP K +
Sbjct: 56 EPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVK 115

Query: 151 DPKPDPDKPKPNPDPKPDPDKPKPNPNPKP---DPNKPNPNPSPDPDQP 196
+ P P N P KP + P P P
Sbjct: 116 PVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQYP 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2036PF046471302e-40 Accessory gene regulator B
		>PF04647#Accessory gene regulator B

Length = 212

Score = 130 bits (329), Expect = 2e-40
Identities = 38/173 (21%), Positives = 72/173 (41%), Gaps = 7/173 (4%)

Query: 18 RNNLDHIQFLQVRLGMQIIVGNFFKILVTYSISIFLSVFLFTLVTHLSYMLIRYNAHGAH 77
+ ++R G+++ +G F+I++ ++ + + LS + R + GAH
Sbjct: 14 DRSDYPFNQEEIRYGIEVFLGTVFQIIIILLVAFVIGLAKEVAFCLLSAAVYRRFSGGAH 73

Query: 78 AKSSILCYIQSILTFVFVPYFLINIDINFTYLLALS--IIGLISVVIYAPAATKKQPIPI 135
+ C + S+L F + Y ID + LL L I L++++ P + I
Sbjct: 74 CEKYYRCTLTSLLVFNVLAYIAHLIDPAYFQLLILIAFITSLLALLFLVPVDNPRNLISN 133

Query: 136 KLVKRKKYLSIIMYLLVLILSLIIHPF-----YAQFMLLGILVESITLLPIFF 183
++ L M L+VL I A +LLG+L ++ TL +
Sbjct: 134 TEQRKTLKLKTSMVLMVLFGGSIGAYRLYTHQIALAILLGVLWQTFTLTALGH 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2038FLGBIOSNFLIP290.024 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 29.0 bits (65), Expect = 0.024
Identities = 21/143 (14%), Positives = 52/143 (36%), Gaps = 12/143 (8%)

Query: 20 IVTILVTMIIMYLSNFATVGLFLTL------RKYTTDPAILLPLYILSFS-SVSLLATYL 72
+ V + ++ FA + + + ++ L+ + L+F ++ L+ T
Sbjct: 5 LSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMMTSF 64

Query: 73 VRISLK-KFKKSYLSLNKT--YMIIISFVLFATFAFFYIYSTNTSSNGDSLIPYALVFIG 129
RI + ++ L +++ LF TF F + D+ P++ I
Sbjct: 65 TRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTF--FIMSPVIDKIYVDAYQPFSEEKIS 122

Query: 130 LIIFISVVILIMSLFTLKEMKYK 152
+ + + F L++ +
Sbjct: 123 MQEALEKGAQPLREFMLRQTREA 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2039HTHFIS345e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.7 bits (77), Expect = 5e-04
Identities = 19/135 (14%), Positives = 44/135 (32%), Gaps = 13/135 (9%)

Query: 2 KIFICEDDPKQRENMVTIIKNYIMIEEKPMEIALATDNPYEVLEQAKNMNDIGCYFLDIQ 61
I + +DD R + + + T N + D D+
Sbjct: 5 TILVADDDAAIRTVLNQAL-------SRAGYDVRITSNAATLWRWIAAG-DGDLVVTDVV 56

Query: 62 LSTDINGIKLGSEIRKHDPVGNIIFVTSHSELTYLTFVYKVAAMDFIFK----DDPAELR 117
+ D N L I+K P ++ +++ + + A D++ K + +
Sbjct: 57 MP-DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115

Query: 118 TRIIDCLETAHTRLQ 132
R + + ++L+
Sbjct: 116 GRALAEPKRRPSKLE 130


49SAV2177SAV2182N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV2177-38-1.641869similar to ferrichrome ABC transporter
SAV2178-38-1.544451conserved hypothetical protein
SAV2179-38-1.270349conserved hypothetical protein
SAV2180-38-0.489876transporter
SAV2181-29-0.206472hypothetical protein
SAV21820111.117277alkaline shock protein 23
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2177FERRIBNDNGPP965e-25 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 96.2 bits (239), Expect = 5e-25
Identities = 64/257 (24%), Positives = 107/257 (41%), Gaps = 24/257 (9%)

Query: 53 DAKRIVVLEYSFADALAALDVKPVGIADDGKKKRIIK--PVREKIGDYTSVGTRKQPNLE 110
D RIV LE+ + L AL + P G+AD + + P+ + + D VG R +PNLE
Sbjct: 34 DPNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLPDSVID---VGLRTEPNLE 90

Query: 111 EISKLKPDLIIADSSRHKGINKELNKIAPTLSLKSFDGDYKQNI--NSFKTIAKALNKEK 168
++++KP ++ S+ + + L +IAP DG + S +A LN +
Sbjct: 91 LLTEMKPSFMVW-SAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNLQS 149

Query: 169 EGEKRLAEHDKLINKYKDEIKFDRNQKVLPAVV---AKAGLLAHPNYSYVGQFLNELGFK 225
E LA+++ I K R + L + L+ PN S + L+E G
Sbjct: 150 AAETHLAQYEDFIRSMKPRF-VKRGARPLLLTTLIDPRHMLVFGPN-SLFQEILDEYGIP 207

Query: 226 NALSDDVTKGLSKYLKGPYLQLDTEHLADLNPERMIIMTDHAKKDSAEFKKLQEDATWKK 285
NA +G + + + + LA ++ DH +S + L W+
Sbjct: 208 NAW-----QGETNFWG--STAVSIDRLAAYKDVDVLCF-DHD--NSKDMDALMATPLWQA 257

Query: 286 LNAVKNNRVDIVDRDVW 302
+ V+ R V VW
Sbjct: 258 MPFVRAGRFQRVP-AVW 273


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2178ALARACEMASE391e-05 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 39.4 bits (92), Expect = 1e-05
Identities = 59/325 (18%), Positives = 119/325 (36%), Gaps = 33/325 (10%)

Query: 4 VNINISKIKYNAKVLQTVFQSKNIQFTPVIKCIAGDRTIVESLKALG-INHVAESRLDNI 62
++++ +K N +++ + + + V+K A I A+G + A L+
Sbjct: 7 ASLDLQALKQNLSIVRQA--ATHARVWSVVKANAYGHGIERIWSAIGATDGFALLNLEEA 64

Query: 63 ISIADQDLTYTLLRTPAKKEISDMIEKVDMSIQTELSTIHQINEVAEV-LGKKHKILLMV 121
I++ ++ +L D+ + T + + Q+ + L I L V
Sbjct: 65 ITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDIYLKV 124

Query: 122 DWKDGREGVLTYDVLDYIKEIIHLKNIHFVGLAFNFMCFKSDAPSDDDIFMINRFVSAVE 181
+ R G VL +++ + N+ + L +F ++ P D + R A E
Sbjct: 125 NSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAE--AEHP-DGISGAMARIEQAAE 181

Query: 182 REIGYRLKIISGGNSSMLPQLLYNDLGKINELRIGETLFRGVDTTTNQAIAML-YQDAIT 240
+ R + + + P+ ++ +R G L+ + + IA + +T
Sbjct: 182 -GLECRRSLSNSAATLWHPEAHFD------WVRPGIILYGASPSGQWRDIANTGLRPVMT 234

Query: 241 LEAEILEIK-----PRVN-----TQTHESFLQAIVDIGYLD---TKVDNISPM---DQHI 284
L +EI+ ++ RV T E + IV GY D +P+
Sbjct: 235 LSSEIIGVQTLKAGERVGYGGRYTARDEQRI-GIVAAGYADGYPRHAPTGTPVLVDGVRT 293

Query: 285 NILGA-SSDHLMLDLNGQGHYQVGD 308
+G S D L +DL +G
Sbjct: 294 MTVGTVSMDMLAVDLTPCPQAGIGT 318


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2179PF041832581e-80 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 258 bits (661), Expect = 1e-80
Identities = 92/456 (20%), Positives = 176/456 (38%), Gaps = 56/456 (12%)

Query: 166 EGHPTHPLTKTKLPLTMEEVRAYAPEFEKEIPLQIMMIEKDHVVCTAMDGND--QFIIDE 223
GHP K + E + YAPE+ L + ++++H++ + D Q +
Sbjct: 134 SGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRCDNEMDIHQLLTAA 193

Query: 224 IIPEYYNQIRVFLKSLGLKSEDYRAILVHPWQYDHTIGKYFEAWIAKKILIPT-PFTILS 282
+ P+ + + + GL ++ + VHPWQ+ I F A A+ ++ F
Sbjct: 194 MDPQEFARFSQVWQENGL-DHNWLPLPVHPWQWQQKIATDFIADFAEGRMVSLGEFGDQW 252

Query: 283 KATLSFRTMSLIDKP--YHVKLPVDAQATSAVRTVSTVTTVDGPKLSYALQN-------- 332
A S RT++ + +KLP+ TS R + GP S LQ
Sbjct: 253 LAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWLQQVFATDATL 312

Query: 333 ------MLNQYPGFKVAMEPFGEYANVDKDRARQLACIIRQKPE--IDGKGATVVSASLV 384
+L + V+ E + A L I R+ P + + V+ A+L+
Sbjct: 313 VQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLKPDESPVLMATLM 372

Query: 385 NKNPIDQKVIVDSYLEWLNQGITKESITTFIERYAQALIPPLIAFIQNYGIALEAHMQNT 444
+ +Q + +Y++ G+ E+ ++ + + ++ PL + YG+AL AH QN
Sbjct: 373 ECDENNQPLA-GAYID--RSGLDAET---WLTQLFRVVVVPLYHLLCRYGVALIAHGQNI 426

Query: 445 VVNLGPHFDIQFLVRDLGGS-RI------DLETLQHRVSDI--KITNDSLIADSIDAVIA 495
+ + + L++D G R+ ++++L V D+ +++ D LI D
Sbjct: 427 TLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMDSLPQEVRDVTSRLSADYLIHDLQTGHFV 486

Query: 496 KFQHAVIQNQMAELIHHFNQYDCVEETELFNIVQQVVA--HAINPTLPHANELKDILFGP 553
I V E + ++ V++ +P + L LF P
Sbjct: 487 TV---------LRFISPLMVRLGVPERRFYQLLAAVLSDYMKKHPQMSERFALFS-LFRP 536

Query: 554 TITVKALLNMRM-----ENKVKQYLNI--ELDNPIK 582
I L +++ + + N +L NP+
Sbjct: 537 QIIRVVLNPVKLTWPDLDGGSRMLPNYLEDLQNPLW 572


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2180TCRTETA423e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 41.7 bits (98), Expect = 3e-06
Identities = 53/340 (15%), Positives = 106/340 (31%), Gaps = 26/340 (7%)

Query: 6 FSSSFLLFLGNWIGQIGLNWFVLTTYHN--------AVYLGIVNFCRLVPILLLSVWAGA 57
S+ L +G IGL VL + GI+ + + GA
Sbjct: 11 LSTVALDAVG-----IGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65

Query: 58 IADKYDKGRLLRITISSSFLVTAILCVLTYSFTAIPISVIIIYAT-LRGILSAVETPLRQ 116
++D++ + R + S A+ Y+ A + ++Y + ++ +
Sbjct: 66 LSDRFGR----RPVLLVSLAGAAV----DYAIMATAPFLWVLYIGRIVAGITGATGAVAG 117

Query: 117 AILPDLSDKISTTQAVSFHSFIINICRSIGPAIAGVILAVYHAPTTFLAQA--ICYFIAA 174
A + D++D + F S GP + G++ F A A F+
Sbjct: 118 AYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTG 177

Query: 175 LLCLPLHFKVTKIPEDATRYMPLKVIIDYFKLHMEGRQIFITSLLIMATGFSYTTLLPVL 234
LP K + P PL + + + ++ G L +
Sbjct: 178 CFLLPESHKGERRPLRREALNPLASFRWARGMTVVA-ALMAVFFIMQLVGQVPAALWVIF 236

Query: 235 TNKVFPGKSEIFGIAMTMCAIGGIIATLVL-PKVLKYIGMVNMYYLSSLLFGIALLGVVF 293
F + GI++ I +A ++ V +G L + G + + F
Sbjct: 237 GEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAF 296

Query: 294 HNIVIMFICITLIGLFSQWARTTNRVYFQNNVKDYERGKV 333
M I ++ + V + +G++
Sbjct: 297 ATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQL 336



Score = 30.6 bits (69), Expect = 0.012
Identities = 37/180 (20%), Positives = 71/180 (39%), Gaps = 21/180 (11%)

Query: 10 FLLFLGNWIGQIGLNWFVLTTYH----NAVYLGI-VNFCRLVPILLLSVWAGAIADKYDK 64
+ F+ +GQ+ +V+ +A +GI + ++ L ++ G +A + +
Sbjct: 217 AVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGE 276

Query: 65 GRLLRITISSSFLVTAILCVLTYSFTAIPISVIIIYATLRGILSAVETPLRQAILPDLSD 124
R L + + + +L T + A PI V++ + P QA+L D
Sbjct: 277 RRALMLGMIADGTGYILLAFATRGWMAFPIMVLLA-------SGGIGMPALQAMLSRQVD 329

Query: 125 KISTTQAVSFHSFIINICRSIGPAIAGVILAVYHAPT----TFLAQAICYFIAALLCLPL 180
+ Q + + ++ +GP + I A T ++A A Y LLCLP
Sbjct: 330 EERQGQLQGSLAALTSLTSIVGPLLFTAIYA-ASITTWNGWAWIAGAALY----LLCLPA 384


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2181PF041832703e-84 IucA / IucC family
		>PF04183#IucA / IucC family

Length = 580

Score = 270 bits (691), Expect = 3e-84
Identities = 93/475 (19%), Positives = 181/475 (38%), Gaps = 45/475 (9%)

Query: 197 SEQAVIEGHPLHPGAKLRKGLNALQTFLYSSEFNQPIKLKIVLIHSKLSRTMSLSKDYDT 256
Q ++ GHP K R+G Y+ E+ +L + + + M D +
Sbjct: 128 RLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKRE---HMIWRCDNEM 184

Query: 257 TVHQLF-----PDLIKQLENEFTPNFNFNDYHIMIVHPWQLDDVLHSDYQAEVDKELIIE 311
+HQL P + + N +++ + VHPWQ + +D+ A+ + ++
Sbjct: 185 DIHQLLTAAMDPQEFARFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADFAEGRMVS 244

Query: 312 AKHTLD-YYAGLSFRTLVPKYPAMSPHIKLSTNVHITGEIRTLSEQTTHNGPLMTRILND 370
D + A S RTL IKL ++ T R + + GPL +R L
Sbjct: 245 LGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWLQQ 304

Query: 371 ILEKDVIFKSYASTIIDEVAGIHFYNEQDEVDYQTER--SEQLGTLFRKNIYQMIPQEVT 428
+ D + I+ E A + +E + E LG ++R+N + + + +
Sbjct: 305 VFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLKPDES 364

Query: 429 PMIPSSLVATYPFNNESPIVTLIKRYQSAASLSDFESSAKSWIETYSKALLGLVIPLVTK 488
P++ ++L+ N P+ A + A++W+ + ++ + L+ +
Sbjct: 365 PVLMATLMECDE--NNQPLA--------GAYIDRSGLDAETWLTQLFRVVVVPLYHLLCR 414

Query: 489 YGIALEAHLQNAIATFRKDGLLDTMYIRDFEG-LRIDKAQLNEMGYSTSHFHEKSRILTD 547
YG+AL AH QN I K+G+ + ++DF+G +R+ K + EM S E + +
Sbjct: 415 YGVALIAHGQN-ITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMD---SLPQEVRDVTSR 470

Query: 548 SKTSVFNKAFYSTVQNHLGELILTISKASNDSNLERHMWYIVRDVLDNIFDQLVLSTHKS 607
+ + I + ER + ++ VL + + H
Sbjct: 471 LSADYLIHDLQTGHFVTVLRFISPLMVRLGVP--ERRFYQLLAAVLSDYMKK-----HPQ 523

Query: 608 NQVNENRINEIKDTMFAPFIDYKCVTTMRLE----DEAHHY--TYIK-VNNPLYR 655
+ +F P I + ++L D Y++ + NPL+
Sbjct: 524 MSERFALFS-----LFRPQIIRVVLNPVKLTWPDLDGGSRMLPNYLEDLQNPLWL 573


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2182TCRTETOQM290.012 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 28.7 bits (64), Expect = 0.012
Identities = 14/43 (32%), Positives = 21/43 (48%), Gaps = 5/43 (11%)

Query: 99 VDLKVILEYGE-----SAPKIFRKVTELVKEQVKYITGLDVVE 136
D K+ +YG S P FR + +V EQV G +++E
Sbjct: 495 TDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTELLE 537


50SAV2352SAV2361N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV2352-19-1.486098similar to multidrug resistance protein
SAV2353-211-1.945729similar to multidrug resistance protein A
SAV2354-213-1.653501similar to transcriptional regulator
SAV2355-213-1.669568TcaB protein
SAV2356-212-1.604821TcaA protein
SAV2357010-1.703411TcaR transcription regulator
SAV2358011-1.726087hypothetical protein
SAV2359-111-1.632313similar to ABC transporter (ATP-binding
SAV2360-211-0.926544putative ABC transporter, permease protein
SAV2361013-0.635225similar to two component response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2352TCRTETB1582e-44 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 158 bits (402), Expect = 2e-44
Identities = 92/415 (22%), Positives = 187/415 (45%), Gaps = 16/415 (3%)

Query: 140 KILAALLFGMFIAILNQTLLNVALPKINTEFNISASTGQWLMTGFMLVNGILIPITAYLF 199
+IL L F ++LN+ +LNV+LP I +FN ++ W+ T FML I + L
Sbjct: 14 QILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73

Query: 200 NKYSYRKLFLVALVLFTIGSLICAISMN-FPIMMVGRVLQAIGAGVLMPLGSIVIITIYP 258
++ ++L L +++ GS+I + + F ++++ R +Q GA L +V+ P
Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIP 133

Query: 259 PEKRGAAMGTMGIAMILAPAIGPTLSGYIVQNYHWNVMFYGMFIIGIIAILVGFVWFKLY 318
E RG A G +G + + +GP + G I HW+ + +I II + K
Sbjct: 134 KENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIP-MITIITVPFLMKLLKKE 192

Query: 319 QYTTNPKADIPGIIFSTIGFGALLYGFSEAGNKGWGSVEIETMFAIGIIFIILFVIRELR 378
DI GII ++G + + + ++ ++FV +
Sbjct: 193 VRIKGH-FDIKGIILMSVGIVFFMLF---TTSYSIS------FLIVSVLSFLIFVKHIRK 242

Query: 379 MKSPMLNLEVLKFPTFTLTTIINMVVMLSLYGGMILLPIYLQNLRGFSALDSG-LLLLPG 437
+ P ++ + K F + + ++ ++ G + ++P ++++ S + G +++ PG
Sbjct: 243 VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302

Query: 438 SLIMGLLGPFAGKLLDTIGLKPLAIFGIAVMTYATWELTKLNMDTP-YMTIMGIYVLRSF 496
++ + + G G L+D G + G+ ++ + + L T +MTI+ ++VL
Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVL--G 360

Query: 497 GMAFIMMPMVTAAINALPGRLASHGNAFLNTMRQLAGSIGTAILVTVMTTQTTQH 551
G++F + T ++L + A G + LN L+ G AI+ +++
Sbjct: 361 GLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQ 415


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2353RTXTOXIND591e-12 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 59.1 bits (143), Expect = 1e-12
Identities = 26/133 (19%), Positives = 45/133 (33%), Gaps = 13/133 (9%)

Query: 87 MDLKMPQKGTIAKLD-GMEGSMVQAGNPIAYAYNLDD-LYVTANIDEKDIKDVEVGKDVD 144
++ P + +L EG +V + DD L VTA + KDI + VG++
Sbjct: 328 SVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAI 387

Query: 145 VTIDGQKAS----IKGKVDSIGKATAASFSLMPSSNSDGNYTKVSQVIPVKITLESEPSK 200
+ ++ + + GKV +I G V I +
Sbjct: 388 IKVEAFPYTRYGYLVGKVKNINLDAI-------EDQRLGLVFNVIISIEENCLSTGNKNI 440

Query: 201 QVVPGMNAEVKIH 213
+ GM +I
Sbjct: 441 PLSSGMAVTAEIK 453



Score = 32.5 bits (74), Expect = 0.001
Identities = 17/77 (22%), Positives = 35/77 (45%), Gaps = 2/77 (2%)

Query: 9 VITVVVLLAIGIAGFYFWNKTTSYVTTDNAKV--NGDQIKIASPASGQIKSLNVKQGDKL 66
++ ++ + IA V T N K+ +G +I + +K + VK+G+ +
Sbjct: 59 LVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESV 118

Query: 67 DKGDKVATVTVQGQDGE 83
KGD + +T G + +
Sbjct: 119 RKGDVLLKLTALGAEAD 135


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2354HTHTETR454e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 44.6 bits (105), Expect = 4e-08
Identities = 13/69 (18%), Positives = 24/69 (34%)

Query: 2 KRQAKIEIQNALVDLMAEYPFQEISTKMICAYCNINRSTFYDYYKDKFDLLDTINSKHKE 61
++ + I + + L ++ S I + R Y ++KDK DL I +
Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68

Query: 62 KFQFLLSAL 70
L
Sbjct: 69 NIGELELEY 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2355TCRTETA651e-13 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 65.2 bits (159), Expect = 1e-13
Identities = 69/386 (17%), Positives = 141/386 (36%), Gaps = 16/386 (4%)

Query: 15 IIILGSLTAIGALSIDMFLPGLPDIRHDF---QTTTSNAQLTLSMFMIGLAFGNLFAGPI 71
+I++ S A+ A+ I + +P LP + D T++ + L+++ + G +
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 72 SDSTGRRKPLIIAMIIFTLASLGIVFVHNIWLMVALRFLQGVTGGAAAVISRAIASDMYS 131
SD GRR L++++ + + +W++ R + G+TG AV IA D+
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA-DITD 125

Query: 132 GNELTKFMALLMLVNGIAPVVAPTIGGIILNYSVWRMVFVILTIFGFVMVIGSLLKVPES 191
G+E + + G V P +GG++ +S F + + +PES
Sbjct: 126 GDERARHFGFMSACFGFGMVAGPVLGGLMGGFSP-HAPFFAAAALNGLNFLTGCFLLPES 184

Query: 192 LTVTNRESSSGLKTMFKNFKILLKTPRFVLPMLIQGMTFVILFTYISASPFII--QKIYG 249
R +F+ + V ++ + L + A+ ++I + +
Sbjct: 185 HKGERRPLRREALNPLASFR-WARGMTVVAALMAVFFI-MQLVGQVPAALWVIFGEDRFH 242

Query: 250 MTAIQFSWMFAGIGITLIISSQLTGYLVDFIDSQKLMRGMTMIQIIGVILVTIVLLNHWN 309
A A GI ++ + + ++ R M+ +I I+L
Sbjct: 243 WDATTIGISLAAFGILHSLAQ---AMITGPVAARLGERRALMLGMIADGTGYILLAFATR 299

Query: 310 FWILAIGFIILIAPVTGVATLGFTIAMDESSSGRGSSSSLLGLVQFLFGGVASPLVGVKG 369
W+ ++L + G+ L ++ +G L + L + PL+
Sbjct: 300 GWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSL-TSIVGPLLFTAI 358

Query: 370 EDNPIPY---IIIIIATAVILIILQI 392
I I A+ L+ L
Sbjct: 359 YAASITTWNGWAWIAGAALYLLCLPA 384


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2359PF05272290.014 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.014
Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 35 VILNGASGSGKTTLLTILGGL 55
V+L G G GK+TL+ L GL
Sbjct: 599 VVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2361HTHFIS792e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 2e-19
Identities = 31/117 (26%), Positives = 51/117 (43%), Gaps = 1/117 (0%)

Query: 5 LVVDDDPRILNYIASHLQTEHIDAYTQPSGEAALKLLEKQRVDIAVVDIMMDGMDGFQLC 64
LV DDD I + L D + + + D+ V D++M + F L
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLL 66

Query: 65 NTLKN-DYDIPVIMLTARDALSDKERAFISGTDDYVTKPFEVKELIFRIRAVLRRYN 120
+K D+PV++++A++ +A G DY+ KPF++ ELI I L
Sbjct: 67 PRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123


51SAV2369SAV2375N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV2369-2100.320929similar to transcription repressor of
SAV2370-3110.160350probable dehydrogenase
SAV2371-110-1.224284conserved hypothetical protein
SAV2372-19-0.835361similar to thioredoxin reductase
SAV2373-18-1.189177hypothetical protein
SAV2374-19-1.400606hypothetical protein
SAV2375011-1.458212hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2369SACTRNSFRASE519e-11 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 51.5 bits (123), Expect = 9e-11
Identities = 24/112 (21%), Positives = 45/112 (40%), Gaps = 7/112 (6%)

Query: 40 FFKDNYTVEKFTQEINHVDSFHYFYQEDGANVGYIKMNINSAQTEEMGETYLEVQRIYFL 99
+FK + + + Y + +G IK+ N Y ++ I
Sbjct: 46 YFKQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNG-------YALIEDIAVA 98

Query: 100 KDFQGGGRGSQLIELAEKIAQEHNKHKIWLGVWEHNPRAQAFYKRHGFKVVG 151
KD++ G G+ L+ A + A+E++ + L + N A FY +H F +
Sbjct: 99 KDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2370PF03627290.024 PapG
		>PF03627#PapG

Length = 336

Score = 29.1 bits (65), Expect = 0.024
Identities = 15/48 (31%), Positives = 19/48 (39%), Gaps = 5/48 (10%)

Query: 21 TFKQLSPTDLPKGDVLIKVHY-SGINYKDALATQDH----NAVVKSYP 63
FK P DLP GD + + Y SG+ A V K+ P
Sbjct: 155 IFKVALPADLPLGDYSVTIPYTSGMQRHFASYLGARFKIPYNVAKTLP 202


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2371SACTRNSFRASE332e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 32.6 bits (74), Expect = 2e-04
Identities = 15/59 (25%), Positives = 30/59 (50%), Gaps = 9/59 (15%)

Query: 66 IVDIAVSKSYQGQDYGSLIMEHIMKYIKN-----VSVESAYVSLIADYPADKLYAKFGF 119
I DIAV+K Y+ + G+ ++ +++ K + +E+ +++ A YAK F
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINI----SACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2375HTHTETR535e-11 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 53.5 bits (128), Expect = 5e-11
Identities = 34/203 (16%), Positives = 77/203 (37%), Gaps = 21/203 (10%)

Query: 5 RRIRKTKSSIKQAFTKLLQEKDLEKITIRDITTRADINRGTFYLHYEDKYMLLADMEDEY 64
+ ++T+ I +L ++ + ++ +I A + RG Y H++DK L +++ +
Sbjct: 7 QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELS 66

Query: 65 ISELTTY----------TQFDLLRGSSIEDIANTFVNNILKNIFQHIHDNLEFY---HTI 111
S + +LR I + +T + + + I EF +
Sbjct: 67 ESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVV 126

Query: 112 LQLERTSQLEL--KINEHIKNNMQR-YISINHSIGGVPEMYFYSYVSGATISIIKYWVMD 168
Q +R LE +I + +K+ ++ + + + Y+SG + W+
Sbjct: 127 QQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAII-MRGYISGLMEN----WLFA 181

Query: 169 KQPISVDELAKHVHNIIFNGPLR 191
Q + + A+ I+ L
Sbjct: 182 PQSFDLKKEARDYVAILLEMYLL 204


52SAV2415SAV2424N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV2415-314-0.370766similar to multidrug resistance protein homolog
SAV2416-118-0.820611phosphoglycerate mutase
SAV2417017-1.172765similar to cation efflux family protein
SAV2418017-1.260000IgG-binding protein
SAV2419-115-1.662918gamma-hemolysin chain II precursor
SAV2420-215-1.488109gamma-hemolysin component C
SAV2421-315-1.307984gamma-hemolysin component B
SAV2422-218-1.272449hypothetical protein
SAV2423-316-1.7414306-carboxyhexanoate-CoA ligase
SAV2424-214-1.949146similar to 8-amino-7-oxononanoate synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2415TCRTETB1293e-35 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 129 bits (327), Expect = 3e-35
Identities = 91/398 (22%), Positives = 177/398 (44%), Gaps = 14/398 (3%)

Query: 18 FFGLLNETLLVTALPSIMKDFEISYTQVQWLTTAFLLTNGIVIPLSALVIQRYTTRQVFL 77
FF +LNE +L +LP I DF W+ TAF+LT I + + + +++ L
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 78 VGISIFFLGTLLGGLS-PHFATLLVARIIQALGAGIMMPLMMTTILDVFQPHERGKYMGI 136
GI I G+++G + F+ L++AR IQ GA L+M + RGK G+
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 137 FGLVIGLAPAIGPTLSGYLVEYLNWRSLFHVVAPIAAVTFLIGFKTIKNVGTTIKVPIDF 196
G ++ + +GP + G + Y++W L + P+ + + + IK D
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLI--PMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 197 ISVIFSVLGFGGLLYGTSSISEKGFDNPIVLVSMIGGVVLVALFVLRQYRLSTPLLNFAV 256
+I +G + T+S S F +I V+ +FV +++ P ++ +
Sbjct: 202 KGIILMSVGIVFFMLFTTSYS-ISF--------LIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 257 FKNKQFTVGIIIMGVTMVSMIGSETILPIFVQNLLHRSALDSG-LTLLPGAIVMAFMSMT 315
KN F +G++ G+ ++ G +++P ++++ S + G + + PG + +
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 316 SGALYEKFGPRKLALVGMAIVVITTAYFVVMDEQTSTIMLATVYAIRMVGIALGLIPVMT 375
G L ++ GP + +G+ + ++ + E TS M + + G++ + T
Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTVIST 371

Query: 376 HTMNQLKPEMNAHGSSMTNTVQQIAGSIGTAALITILS 413
+ LK + G S+ N ++ G A + +LS
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2419BICOMPNTOXIN428e-154 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 428 bits (1103), Expect = e-154
Identities = 213/312 (68%), Positives = 247/312 (79%), Gaps = 8/312 (2%)

Query: 1 MIKNKILTATLAVGLIAPLANPFIEISKAENKIEDIGQGA--EIIKRTQDITSKRLAITQ 58
M+KNKILT TL+V L+APLANP +E +KA N EDIG+G+ EIIKRT+D TS + +TQ
Sbjct: 1 MLKNKILTTTLSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQ 60

Query: 59 NIQFDFVKDKKYNKDALVVKMQGFISSRTTYSDLKKYPYIKRMIWPFQYNISLKTKDSNV 118
NIQFDFVKDKKYNKDAL++KMQGFISSRTTY + KK ++K M WPFQYNI LKT D V
Sbjct: 61 NIQFDFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTNDKYV 120

Query: 119 DLINYLPKNKIDSADVSQKLGYNIGGNFQSAPSIGGSGSFNYSKTISYNQKNYVTEVESQ 178
LINYLPKNKI+S +VSQ LGYNIGGNFQSAPS+GG+GSFNYSK+ISY Q+NYV+EVE Q
Sbjct: 121 SLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGGNGSFNYSKSISYTQQNYVSEVEQQ 180

Query: 179 NSKGVKWGVKANSFVTPNGQVSAYDQYLF-AQDPTGPAARDYFVPDNQLPPLIQSGFNPS 237
NSK V WGVKANSF T +GQ SA+D LF P RDYFVPD++LPPL+QSGFNPS
Sbjct: 181 NSKSVLWGVKANSFATESGQKSAFDSDLFVGYKPHSKDPRDYFVPDSELPPLVQSGFNPS 240

Query: 238 FITTLSHERGKGDKSEFEITYGRNMDATYA-----YVTRHRLAVDRKHDAFKNRNVTVKY 292
FI T+SHE+G D SEFEITYGRNMD T+A + L R H+AF NRN TVKY
Sbjct: 241 FIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAFVNRNYTVKY 300

Query: 293 EVNWKTHEVKIK 304
EVNWKTHE+K+K
Sbjct: 301 EVNWKTHEIKVK 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2420BICOMPNTOXIN469e-170 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 469 bits (1207), Expect = e-170
Identities = 314/315 (99%), Positives = 314/315 (99%)

Query: 1 MLKNKILATTLSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQ 60
MLKNKIL TTLSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQ
Sbjct: 1 MLKNKILTTTLSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQ 60

Query: 61 NIQFDFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTNDKYV 120
NIQFDFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTNDKYV
Sbjct: 61 NIQFDFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTNDKYV 120

Query: 121 SLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGGNGSFNYSKSISYTQQNYVSEVEQQ 180
SLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGGNGSFNYSKSISYTQQNYVSEVEQQ
Sbjct: 121 SLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGGNGSFNYSKSISYTQQNYVSEVEQQ 180

Query: 181 NSKSVLWGVKANSFATESGQKSAFDSDLFVGYKPHSKDPRDYFVPDSELPPLVQSGFNPS 240
NSKSVLWGVKANSFATESGQKSAFDSDLFVGYKPHSKDPRDYFVPDSELPPLVQSGFNPS
Sbjct: 181 NSKSVLWGVKANSFATESGQKSAFDSDLFVGYKPHSKDPRDYFVPDSELPPLVQSGFNPS 240

Query: 241 FIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAFVNRNYTVKY 300
FIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAFVNRNYTVKY
Sbjct: 241 FIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAFVNRNYTVKY 300

Query: 301 EVNWKTHEIKVKGQN 315
EVNWKTHEIKVKGQN
Sbjct: 301 EVNWKTHEIKVKGQN 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2421BICOMPNTOXIN382e-136 Staphylococcal bi-component toxin signature.
		>BICOMPNTOXIN#Staphylococcal bi-component toxin signature.

Length = 315

Score = 382 bits (983), Expect = e-136
Identities = 87/322 (27%), Positives = 160/322 (49%), Gaps = 18/322 (5%)

Query: 1 MKMNKLVKSSVATSMALLLLSGTANAEGKITPVSVKKVDDKVTLYKTTATADSDKFKISQ 60
M NK++ ++++ S+ L + + + K T S+K+ ++Q
Sbjct: 1 MLKNKILTTTLSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQ 60

Query: 61 ILTFNFIKDKSYDKDTLVLKATGNINSGFVKPNPNDYDFSK-LYWGAKYNVSISSQSNDS 119
+ F+F+KDK Y+KD L+LK G I+S N + K + W +YN+ + + ++
Sbjct: 61 NIQFDFVKDKKYNKDALILKMQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKT-NDKY 119

Query: 120 VNVVDYAPKNQNEEFQVQNTLGYTFGGDISISNGLSGGLNGNTAFSETINYKQESYRTTL 179
V++++Y PKN+ E V TLGY GG+ + L G NG+ +S++I+Y Q++Y + +
Sbjct: 120 VSLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGG--NGSFNYSKSISYTQQNYVSEV 177

Query: 180 SRNTNYKNVGWGVEAHKIMNNGWGPYGRDSFHPTYGNELFLAGRQSSAYAGQNFIAQHQM 239
+ N K+V WGV+A+ + ++LF+ + S F+ ++
Sbjct: 178 EQQ-NSKSVLWGVKANSFATESGQ-------KSAFDSDLFVGYKPHSKDPRDYFVPDSEL 229

Query: 240 PLLSRSNFNPEFLSVLSHRQDGAKKSKITVTYQREMDL-----YQICWNGFYWAGANYKN 294
P L +S FNP F++ +SH + + S+ +TY R MD+ + Y G N
Sbjct: 230 PPLVQSGFNPSFIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHN 289

Query: 295 -FKTRTFKSTYEIDWENHKVKL 315
F R + YE++W+ H++K+
Sbjct: 290 AFVNRNYTVKYEVNWKTHEIKV 311


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2424CLENTEROTOXN280.047 Clostridium enterotoxin signature.
		>CLENTEROTOXN#Clostridium enterotoxin signature.

Length = 319

Score = 28.5 bits (63), Expect = 0.047
Identities = 8/47 (17%), Positives = 15/47 (31%), Gaps = 3/47 (6%)

Query: 233 GGVILSSND---VKDMLINHGRPLIYSSSLPIYNLYFIKRNIEKLIN 276
IL+ N+ L I + + FI+ ++E
Sbjct: 59 SSQILNPNETGTFSQSLTKSKEVSINVNFSVGFTSEFIQASVEYGFG 105


53SAV2496SAV2503N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV2496413-1.678355hypothetical protein
SAV2497310-0.480364similar to accumulation-associated protein
SAV24983110.985850staphylococcal accessory regulator A homolog
SAV24993101.226138staphylococcal accessory regulator A homolog
SAV2500192.190022UTP-glucose-1-phosphate uridyltransferase
SAV2501082.422853transposase
SAV25021103.076256fibronectin-binding protein homolog
SAV2503-1102.989681fibronectin-binding protein homolog
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2496V8PROTEASE342e-04 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 34.2 bits (78), Expect = 2e-04
Identities = 14/30 (46%), Positives = 18/30 (60%)

Query: 93 EKPKDPKGPENPEKPSRPTHPSGPVNPNNP 122
P +P P NP+ P+ P P+ P NPNNP
Sbjct: 290 NNPDNPDNPNNPDNPNNPDEPNNPDNPNNP 319



Score = 32.7 bits (74), Expect = 7e-04
Identities = 13/30 (43%), Positives = 18/30 (60%)

Query: 93 EKPKDPKGPENPEKPSRPTHPSGPVNPNNP 122
+ P +P P+NP P P +P P NP+NP
Sbjct: 293 DNPDNPNNPDNPNNPDEPNNPDNPNNPDNP 322



Score = 32.3 bits (73), Expect = 0.001
Identities = 13/30 (43%), Positives = 19/30 (63%)

Query: 93 EKPKDPKGPENPEKPSRPTHPSGPVNPNNP 122
++P +P P+NP P P +P P NP+NP
Sbjct: 287 DQPNNPDNPDNPNNPDNPNNPDEPNNPDNP 316



Score = 30.7 bits (69), Expect = 0.003
Identities = 12/29 (41%), Positives = 20/29 (68%)

Query: 93 EKPKDPKGPENPEKPSRPTHPSGPVNPNN 121
+ P +P P NP++P+ P +P+ P NP+N
Sbjct: 296 DNPNNPDNPNNPDEPNNPDNPNNPDNPDN 324



Score = 29.6 bits (66), Expect = 0.007
Identities = 17/46 (36%), Positives = 25/46 (54%), Gaps = 1/46 (2%)

Query: 98 PKGPENPEKPSRPTHPSGPVNPNNPGLSKDRAKP-NGPVHSMDKND 142
P P+NP+ P+ P +P+ P PNNP + P NG ++ D D
Sbjct: 289 PNNPDNPDNPNNPDNPNNPDEPNNPDNPNNPDNPDNGDNNNSDNPD 334



Score = 27.7 bits (61), Expect = 0.027
Identities = 13/33 (39%), Positives = 17/33 (51%)

Query: 102 ENPEKPSRPTHPSGPVNPNNPGLSKDRAKPNGP 134
+ P P P +P+ P NPNNP + PN P
Sbjct: 287 DQPNNPDNPDNPNNPDNPNNPDEPNNPDNPNNP 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2497GPOSANCHOR300.014 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.4 bits (68), Expect = 0.014
Identities = 17/108 (15%), Positives = 33/108 (30%), Gaps = 1/108 (0%)

Query: 14 FLSNKLNKYSIRKFTVGTASILIG-SLMYLGTQQEAEAAENNIENPTTLKDNVQSKEVKI 72
+N YS+RK GTAS+ + +++ G T +
Sbjct: 2 TKNNTNRHYSLRKLKTGTASVAVALTVLGAGLVVNTNEVSAVATRSQTDTLEKVQERADK 61

Query: 73 EEVTNKDTAPQGVEAKSEVTSNKDTIEHEASVKAEDISKKEDTPKEVA 120
E+ N + + + KD + + K K ++
Sbjct: 62 FEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLS 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2502TONBPROTEIN516e-09 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 50.8 bits (121), Expect = 6e-09
Identities = 23/67 (34%), Positives = 27/67 (40%)

Query: 828 EVPSEPETPTPPTPEVPSEPGEPTPPKPEVPSEPETPVPPTPEVPSEPGKPVPPAKEEPK 887
EP P PE EP P PE P E + P KPV +E+PK
Sbjct: 52 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPK 111

Query: 888 KPSKPVE 894
+ KPVE
Sbjct: 112 RDVKPVE 118



Score = 50.8 bits (121), Expect = 6e-09
Identities = 20/81 (24%), Positives = 23/81 (28%), Gaps = 4/81 (4%)

Query: 825 PTPEVPSE----PETPTPPTPEVPSEPGEPTPPKPEVPSEPETPVPPTPEVPSEPGKPVP 880
P P P P V P P+PE PE P + KP P
Sbjct: 39 PAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKP 98

Query: 881 PAKEEPKKPSKPVEQGKVVTP 901
K K +P K V
Sbjct: 99 KPKPVKKVQEQPKRDVKPVES 119



Score = 50.0 bits (119), Expect = 1e-08
Identities = 17/84 (20%), Positives = 28/84 (33%), Gaps = 3/84 (3%)

Query: 822 PTPPTPEVPSEPETPTPPTPEVPSEPGEPTP---PKPEVPSEPETPVPPTPEVPSEPGKP 878
P V PE P PE P P + +P+ P +V +P +
Sbjct: 54 DLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRD 113

Query: 879 VPPAKEEPKKPSKPVEQGKVVTPV 902
V P + P P + ++ +
Sbjct: 114 VKPVESRPASPFENTAPARLTSST 137



Score = 45.0 bits (106), Expect = 5e-07
Identities = 22/105 (20%), Positives = 33/105 (31%), Gaps = 4/105 (3%)

Query: 810 EGQQTIEEDTTPPTPPTPEVPSEPETPTPPTPEVPSEPGEPTPPKPEVPSEPETPVPPTP 869
E Q ++ P P PE PE PP PKP+ + P
Sbjct: 56 EPPQAVQPPPEPVVEPEPEPEPIPE---PPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKR 112

Query: 870 EV-PSEPGKPVPPAKEEPKKPSKPVEQGKVVTPVIEINEKVKAVA 913
+V P E P P + + PV + +A++
Sbjct: 113 DVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRALS 157



Score = 42.3 bits (99), Expect = 5e-06
Identities = 27/88 (30%), Positives = 31/88 (35%), Gaps = 12/88 (13%)

Query: 834 ETPTPPTP-------EVPSEPGEPTPPKPEVPSEPETPVPPTPEVPSEPGKPVPPAKEEP 886
E P P P EP + P PE EPE P PE P K P E+P
Sbjct: 37 ELPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPP----KEAPVVIEKP 92

Query: 887 KKPSKPVEQGKVVTPVIEINEKVKAVAP 914
K KP + V + VK V
Sbjct: 93 KPKPKPKPK-PVKKVQEQPKRDVKPVES 119



Score = 32.3 bits (73), Expect = 0.008
Identities = 17/81 (20%), Positives = 25/81 (30%), Gaps = 5/81 (6%)

Query: 853 PKPEVPSE----PETPVPPTPEVPSEPGKPVPPAKEEPKKPSKPVEQGKVVTPVIEINEK 908
P P P + P V P V P E P P ++ VV + K
Sbjct: 39 PAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPP-KEAPVVIEKPKPKPK 97

Query: 909 VKAVAPTKQKQSKKSELPETG 929
K K ++ K ++
Sbjct: 98 PKPKPVKKVQEQPKRDVKPVE 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2503PF03544583e-11 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 58.1 bits (140), Expect = 3e-11
Identities = 23/115 (20%), Positives = 33/115 (28%), Gaps = 5/115 (4%)

Query: 869 TPPTPPTPEVPSEPETPMPPTPEVPSEPETPTPPTPEVPSEPETPTPPTPEVPSEPETPT 928
P P + + + + P + P EP P PE PE P + P
Sbjct: 45 APAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPI--PEPPKEAPVVIEKPKPKPK 102

Query: 929 PPTPEVPSEPETPTPPTPEVPAEPGKPVPPAKEEPKKPSKPVEQGKVVTPVIEIN 983
P V + P P E P P +P+ PV +
Sbjct: 103 PKPKPV---KKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVA 154



Score = 56.5 bits (136), Expect = 9e-11
Identities = 21/102 (20%), Positives = 30/102 (29%), Gaps = 7/102 (6%)

Query: 877 EVPSEPETPMPPTPEVPSEPETPTPPTPEVPSEPETPTPPTPEVPSEPETPTPPTPEVPS 936
+V P P + + + + P + P EP P PE PE P +
Sbjct: 39 QVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPI--PEPPKEAPVVIEK 96

Query: 937 EPETPTPPTPEVPAEP-----GKPVPPAKEEPKKPSKPVEQG 973
P P V KPV P + + P
Sbjct: 97 PKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPT 138



Score = 54.2 bits (130), Expect = 6e-10
Identities = 21/108 (19%), Positives = 31/108 (28%)

Query: 871 PTPPTPEVPSEPETPMPPTPEVPSEPETPTPPTPEVPSEPETPTPPTPEVPSEPETPTPP 930
P + P EP P PE EP P E P P P + +P+ P
Sbjct: 61 EPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKP 120

Query: 931 TPEVPSEPETPTPPTPEVPAEPGKPVPPAKEEPKKPSKPVEQGKVVTP 978
P+ P T P + + + + + P
Sbjct: 121 VESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYP 168



Score = 53.4 bits (128), Expect = 9e-10
Identities = 24/120 (20%), Positives = 37/120 (30%), Gaps = 6/120 (5%)

Query: 861 QQTIEEDTTPPTPPTPEVPSEPETPMPPTPEVPSEPETPTPPTPEVPSEPETPTPPTPEV 920
+E PP P V EPE P P + P P P+
Sbjct: 57 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKR 116

Query: 921 PSEPETPTPPTPEVPSEPETPTPPTP------EVPAEPGKPVPPAKEEPKKPSKPVEQGK 974
+P P +P + P PT T V + P ++ +P+ P++
Sbjct: 117 DVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRI 176



Score = 51.5 bits (123), Expect = 4e-09
Identities = 25/103 (24%), Positives = 36/103 (34%), Gaps = 2/103 (1%)

Query: 905 EVPSEPETPTPPTPEVPSEPETPTPPTPEVPSEPETPTPPTPEVPAEPGKPVPPAKEEPK 964
+V P P + + + + P + P EP P PE EP K P E+PK
Sbjct: 39 QVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPK 98

Query: 965 KPSKPVEQGKVVTPVIEINEKVKAVAPTKKAQSKKSELPETGG 1007
KP K V V + VK V + + +
Sbjct: 99 PKPKPKP--KPVKKVEQPKRDVKPVESRPASPFENTAPARPTS 139



Score = 48.4 bits (115), Expect = 4e-08
Identities = 26/102 (25%), Positives = 35/102 (34%), Gaps = 7/102 (6%)

Query: 891 EVPSEPETPTPPTPEVPSEPETPTPPTPEVPSEPETPTPPTPEVPSEPETPTPPTPEVPA 950
+V P P + + + + P + P EP P PE PE P +
Sbjct: 39 QVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPI--PEPPKEAPVVIE- 95

Query: 951 EPGKPVPPAKEEPKKPSKPVEQGKVVTPVIEINEKVKAVAPT 992
KP P K +P KP K VEQ K +E
Sbjct: 96 ---KPKPKPKPKP-KPVKKVEQPKRDVKPVESRPASPFENTA 133


54SAV2571SAV2579N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV2571-1170.283235hypothetical protein
SAV2572-3180.482907hypothetical protein
SAV2573-217-0.363297conserved hypothetical protein
SAV2574-3120.854393hypothetical protein
SAV2575-280.607159conserved hypothetical protein
SAV2576-381.273329conserved hypothetical protein
SAV2577-3121.784392hypothetical protein
SAV2578-2131.614227hypothetical protein
SAV25790142.120788putative short chain oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2571HTHTETR449e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 43.8 bits (103), Expect = 9e-08
Identities = 33/200 (16%), Positives = 64/200 (32%), Gaps = 34/200 (17%)

Query: 5 KSIDPRIVRTKQLLVDAFLKISREKKLSQITVKDITDIATLNRATFYAHFTDKEDLLDYT 64
+ T+Q ++D L++ ++ +S ++ +I A + R Y HF DK DL
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 65 LSV---TILKDLNDNLSISNVINEKVLRNIFISIASYIKDAAKSCELNSEAFCNKAHQRI 121
+ I + + + VLR I I + + L F
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIF------HK 116

Query: 122 NNELEDIFAIM-LENSYPEHQRDIIVNS-------------------ASFLAAGISGLAL 161
+ ++ + + + D I + A + ISGL
Sbjct: 117 CEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLME 176

Query: 162 HWFNTSQ-----ETADVFID 176
+W Q + A ++
Sbjct: 177 NWLFAPQSFDLKKEARDYVA 196


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2575TRNSINTIMINR280.011 Translocated intimin receptor (Tir) signature.
		>TRNSINTIMINR#Translocated intimin receptor (Tir) signature.

Length = 549

Score = 28.2 bits (62), Expect = 0.011
Identities = 14/45 (31%), Positives = 23/45 (51%)

Query: 56 FQNVSQQSLNTEPNEVMISLGVNTNEEVDQLVNKVKEAGGAVVQE 100
F+N Q +N + N I G ++ V+Q+ + KEAG Q+
Sbjct: 291 FKNPENQKVNIDANGNAIPSGELKDDIVEQIAQQAKEAGEVARQQ 335


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2576NUCEPIMERASE362e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 35.5 bits (82), Expect = 2e-04
Identities = 35/138 (25%), Positives = 53/138 (38%), Gaps = 35/138 (25%)

Query: 1 MKDILVIGATGKQGNAVVKQLLEDGWYVSAL--------TRNKNNRKLSDIGHPHLSIVE 52
MK LV GA G G V K+LLE G V + K R L + P +
Sbjct: 1 MK-YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQAR-LELLAQPGFQFHK 58

Query: 53 GDLSD-----------------NVSLQSAMKGKYGLYSIQ-PIVKDDVSEELRQGMKIIE 94
DL+D + A++ YS++ P D + L + I+E
Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVR-----YSLENPHAYADSN--LTGFLNILE 111

Query: 95 IAEQENIQHIVYSTAGGV 112
IQH++Y+++ V
Sbjct: 112 GCRHNKIQHLLYASSSSV 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2578HTHTETR622e-14 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 62.3 bits (151), Expect = 2e-14
Identities = 25/80 (31%), Positives = 44/80 (55%)

Query: 1 MRKDAKENRQRIEEIAHKLFDEEGVENISMNRIAKELGIGMGTLYRHFKDKSDLCYYVIQ 60
+++A+E RQ I ++A +LF ++GV + S+ IAK G+ G +Y HFKDKSDL + +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 61 RDLDIFITHFKQIKDDYHSN 80
+ + + +
Sbjct: 65 LSESNIGELELEYQAKFPGD 84


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2579DHBDHDRGNASE702e-16 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 69.7 bits (170), Expect = 2e-16
Identities = 48/197 (24%), Positives = 76/197 (38%), Gaps = 18/197 (9%)

Query: 3 KIVLITGGNKGLGYASAEALKALGYKVYIGSRND---VRGQQASQKLGVHYVQ--LDVTS 57
KI ITG +G+G A A L + G + N + + + H DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 58 DYSVKNAYNMIAEKEGRLDILINNAGISGQFSAPSKLTPRDVEEVYQTNVFGIVRMMNTF 117
++ I + G +DIL+N AG+ + L+ + E + N G+ +
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVL-RPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 118 VPLLEKSEQPVVVNVSSGLGSFGMVTNPETAESKVNSLAYCSSKSAVTMLTLQYAKGLP- 176
+ +V V S P T+ + AY SSK+A M T L
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAG-----VPRTSMA-----AYASSKAAAVMFTKCLGLELAE 177

Query: 177 -NMQINAADPGATNTDL 192
N++ N PG+T TD+
Sbjct: 178 YNIRCNIVSPGSTETDM 194


55SAV2630SAV2637N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV26300142.088328Clumping factor B
SAV2631-1110.241845transcriptional regulator
SAV2632-391.192782carbamate kinase
SAV2633-2100.502867arginine/oirnithine antiporter
SAV2634-390.181193ornithine transcarbamoylase
SAV2635-28-0.553350arginine deiminase
SAV2636-290.701121similar to arginine repressor
SAV2637-191.159582zinc metalloproteinase aureolysin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2630PF05616512e-08 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 50.9 bits (121), Expect = 2e-08
Identities = 36/125 (28%), Positives = 57/125 (45%), Gaps = 14/125 (11%)

Query: 508 NVDPVTNRDYSIFGWNNENVVRYGGGSADGDSAVNPK-----DPTPG----PPVDPEPSP 558
N+ PVT+R+ N VV G + G++ V+ + D TPG P P P
Sbjct: 277 NMGPVTDRN-----GNPVQVVATFGRDSQGNTTVDVQVIPRPDLTPGSAEAPNAQPLPEV 331

Query: 559 DPEPEPTPDPEPSPDPEPEPSPDPDPDSDSDSDSGSDSDSGSDSDSESDSDSDSDSDSDS 618
P P +P P+ +P P+P+PDPD + D++ +D G+ DS + D +
Sbjct: 332 SPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGTRPDSPAVPDRPNGRHRKE 391

Query: 619 DSDSE 623
+ E
Sbjct: 392 RKEGE 396



Score = 35.1 bits (80), Expect = 0.001
Identities = 18/63 (28%), Positives = 27/63 (42%), Gaps = 1/63 (1%)

Query: 538 DSAVNPK-DPTPGPPVDPEPSPDPEPEPTPDPEPSPDPEPEPSPDPDPDSDSDSDSGSDS 596
+ A NP + PG +PEP PD P+ PD + P P+ PD + +
Sbjct: 336 NPANNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGTRPDSPAVPDRPNGRHRKERKEG 395

Query: 597 DSG 599
+ G
Sbjct: 396 EDG 398


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2632CARBMTKINASE391e-139 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 391 bits (1006), Expect = e-139
Identities = 137/314 (43%), Positives = 198/314 (63%), Gaps = 5/314 (1%)

Query: 10 MKEKIVIALGGNAIQT--KEATAEAQQTAIRRAMQNLKPLFDSPARIVISHGNGPQIGSL 67
M +++VIALGGNA+Q ++ + E +R+ + + + +VI+HGNGPQ+GSL
Sbjct: 1 MGKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSL 60

Query: 68 LIQQAKSNSDT-TPAMPLDTCGAMSQGMIGYWLETEINRILTEMNSDRTVGTIVTRVEVD 126
L+ + PA P+D GAMSQG IGY ++ + L + ++ V TI+T+ VD
Sbjct: 61 LLHMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVD 120

Query: 127 KDDPRFNNPTKPIGPFYTKEEVEELQKEQPDSVFKEDAGRGYRKVVASPLPQSILEHQLI 186
K+DP F NPTKP+GPFY +E + L +E + KED+GRG+R+VV SP P+ +E + I
Sbjct: 121 KNDPAFQNPTKPVGPFYDEETAKRLARE-KGWIVKEDSGRGWRRVVPSPDPKGHVEAETI 179

Query: 187 RTLADGKNIVIACGGGGIPVIKKENTYEGVEAVIDKDFASEKLATLIEADTLMILTNVEN 246
+ L + IVIA GGGG+PVI ++ +GVEAVIDKD A EKLA + AD MILT+V
Sbjct: 180 KKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNG 239

Query: 247 VFINFNEPNQQQIDDIDVATLKKYAAQGKFAEGSMLPKIEAAIRFVESGENKKVIITNLE 306
+ + +Q + ++ V L+KY +G F GSM PK+ AAIRF+E G ++ II +LE
Sbjct: 240 AALYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWG-GERAIIAHLE 298

Query: 307 QAYEALIGNKGTHI 320
+A EAL G GT +
Sbjct: 299 KAVEALEGKTGTQV 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2635ARGDEIMINASE5060.0 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 506 bits (1305), Expect = 0.0
Identities = 193/409 (47%), Positives = 275/409 (67%), Gaps = 8/409 (1%)

Query: 5 PIKVNSEIGALKTVLLKRPGKELENLVPDYLDGLLFDDIPYLEVAQKEHDHFAQVLREEG 64
PI + SEIG LK VLL RPG+ELENL P + LFDDIPYLEVA++EH+ FA +L+
Sbjct: 7 PINIFSEIGRLKKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQEHEVFASILKNNL 66

Query: 65 VEVLYLEKLAAESIENPQ-VRSEFIDDVLAESKKTILGHEEEIKTLFATLSNQELVDKIM 123
VE+ Y+E L +E + + + ++FI + E++ +K F++L+ ++ K++
Sbjct: 67 VEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFTINLLKDYFSSLTIDNMISKMI 126

Query: 124 SGVRKEEINPKCTHLVEYMDDKYPFYLDPMPNLYFTRDPQASIGHGITINRMFWRARRRE 183
SGV EE+ + L + ++ F +DPMPN+ FTRDP ASIG+G+TIN+MF + R+RE
Sbjct: 127 SGVVTEELKNYTSSLDDLVNGANLFIIDPMPNVLFTRDPFASIGNGVTINKMFTKVRQRE 186

Query: 184 SIFIQYIVKHHPRFKDANIPIWLDRDCPFNIEGGDELVLSKDVLAIGVSERTSAQAIEKL 243
+IF +YI K+HP +K N+PIWL+R ++EGGDELVL+K +L IG+SERT A+++EKL
Sbjct: 187 TIFAEYIFKYHPVYK-ENVPIWLNRWEEASLEGGDELVLNKGLLVIGISERTEAKSVEKL 245

Query: 244 ARRIFENPQATFKKVVAIEIPTSRTFMHLDTVFTMIDYDKFTMHSAILKAEGNMNIFIIE 303
A +F+N + +F ++A +IP +R++MHLDTVFT IDY FT ++ + +I+++
Sbjct: 246 AISLFKN-KTSFDTILAFQIPKNRSYMHLDTVFTQIDYSVFTSFTSD---DMYFSIYVLT 301

Query: 304 YDDVNKDIAIK-QSSHLKDTLEDVLGIDDIQFIPTGNGDVIDGAREQWNDGSNTLCIRPG 362
Y+ + I IK + + +KD L LG I I GD+I GAREQWNDG+N L I PG
Sbjct: 302 YNPSSSKIHIKKEKARIKDVLSFYLG-RKIDIIKCAGGDLIHGAREQWNDGANVLAIAPG 360

Query: 363 VVVTYDRNYVSNDLLRQKGIKVIEISGSELVRGRGGPRCMSQPLFREDI 411
++ Y RN+V+N L + GIKV I SEL RGRGGPRCMS PL REDI
Sbjct: 361 EIIAYSRNHVTNKLFEENGIKVHRIPSSELSRGRGGPRCMSMPLIREDI 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2636ARGREPRESSOR827e-23 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 82.2 bits (203), Expect = 7e-23
Identities = 38/147 (25%), Positives = 78/147 (53%), Gaps = 2/147 (1%)

Query: 1 MKKSKRLEIVSTIVKKHKIYKKEQIISYIEEYFGVRYSATTIAKDLKELNIYRVPIDCET 60
M K +R + I+ ++I +++++ +++ G + T+++D+KEL++ +VP + +
Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKD-GYNVTQATVSRDIKELHLVKVPTNNGS 59

Query: 61 WIYKAINNQTEQEMREKFRHYCEHEVLSSIINGSYIIVKTSPGFAQGINYFIDQLNIEEI 120
+ Y ++ K + + I++KT PG AQ I +D L+ EEI
Sbjct: 60 YKY-SLPADQRFNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEEI 118

Query: 121 LGTVSGNDTTLILTASNDMAEYVYAKL 147
+GT+ G+DT LI+ ++D + V K+
Sbjct: 119 MGTICGDDTILIICRTHDDTKVVQKKI 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2637THERMOLYSIN440e-152 Thermolysin metalloprotease (M4) family signature.
		>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature.

Length = 544

Score = 440 bits (1133), Expect = e-152
Identities = 173/480 (36%), Positives = 249/480 (51%), Gaps = 42/480 (8%)

Query: 64 NIYQDYAVTDVKTDKKGFTHYTLQPSVDGVHAPDKEVKVHADKSGKVVLING----DTDA 119
+ ++ K D+ G T + ++ + H + G++ ++G + D
Sbjct: 71 QARERLSLIGNKLDELGHTVMRFEQAIAASLCMGAVLVAHVN-DGELSSLSGTLIPNLDK 129

Query: 120 KKVKPTNKVTLSKDDAADKAFKAVKIDKHKAKNLKDKVIKENKVEIDGDSNKYVYNVELI 179
+ +K +++ + + K A ++ K + ++ + D ++ + Y V +
Sbjct: 130 RTLKTEAAISIQQAEMIAKQDVADRVTKERPAA-EEGKPTRLVIYPDEETPRLAYEVNVR 188

Query: 180 TVTPEISHWKVKIDAQTGEILEKMNLVKEA-----------AETGKGKGVLGDTKDINI- 227
+TP +W IDA G++L K N + EA + G G+GVLGD K IN
Sbjct: 189 FLTPVPGNWIYMIDAADGKVLNKWNQMDEAKPGGAQPVAGTSTVGVGRGVLGDQKYINTT 248

Query: 228 -NSIDGGFSLEDLTHQGKLSAFSFNDQTG-QATLITNEDENFVKDEQRAGVDANYYAKQT 285
+S G + L+D T + + ++T +L + D F A VDA+YYA
Sbjct: 249 YSSYYGYYYLQDNTRGSGIFTYDGRNRTVLPGSLWADGDNQFFASYDAAAVDAHYYAGVV 308

Query: 286 YDYYKDTFGRESYDNQGSPIVSLTHVNNYGGQDNRNNAAWIGDKMIYGDGDGRTFTSLSG 345
YDYYK+ GR SYD + I S H YG NNA W G +M+YGDGDG+TF SG
Sbjct: 309 YDYYKNVHGRLSYDGSNAAIRSTVH---YG--RGYNNAFWNGSQMVYGDGDGQTFLPFSG 363

Query: 346 ANDVVAHELTHGVTQETANLEYKDQSGALNESFSDVFGYFVD-----DEDFLMGEDVYTP 400
DVV HELTH VT TA L Y+++SGA+NE+ SD+FG V+ + D+ +GED+YTP
Sbjct: 364 GIDVVGHELTHAVTDYTAGLVYQNESGAINEAMSDIFGTLVEFYANRNPDWEIGEDIYTP 423

Query: 401 GKEGDALRSMSNPEQFGQPAHMKDYVFTEKDNGGVHTNSGIPNKAAYNVIQ--------- 451
G GDALRSMS+P ++G P H +DNGGVHTNSGI NKAAY + Q
Sbjct: 424 GVAGDALRSMSDPAKYGDPDHYSKRYTGTQDNGGVHTNSGIINKAAYLLSQGGVHYGVSV 483

Query: 452 -AIGKSKSEQIYYRALTEYLTSNSNFKDCKDALYQAAKDLYDEQTAE--QVYEAWNEVGV 508
IG+ K +I+YRAL YLT SNF + A QAA DLY + E V +A+N VGV
Sbjct: 484 TGIGRDKMGKIFYRALVYYLTPTSNFSQLRAACVQAAADLYGSTSQEVNSVKQAFNAVGV 543


56SAV2643SAV2649N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV2643-190.132570similar to phage infection protein
SAV2644011-0.126021similar to autolysin precursor
SAV2645016-0.069691hypothetical protein
SAV2646-1140.250200conserved hypothetical protein
SAV26470150.618851hypothetical protein
SAV26480130.017985similar to lipopolysaccharide biosynthesis
SAV26496132.221023preprotein translocase secA homolog
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2643ABC2TRNSPORT396e-05 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 39.1 bits (91), Expect = 6e-05
Identities = 37/172 (21%), Positives = 67/172 (38%), Gaps = 28/172 (16%)

Query: 817 NKHKSLESVLTTRQVFLGKAGFFIMLGML-----QALIVSVGDLLILKAGVESP---VLF 868
++ E++L T Q+ LG I+LG + +A + G ++ A + +L+
Sbjct: 95 EGQRTWEAMLYT-QLRLGD----IVLGEMAWAATKAALAGAGIGVVAAALGYTQWLSLLY 149

Query: 869 VLITI-FCSIIFNSIVYTCVSLLGNPGKAIAIVLLVLQIAG----GGGTFPIQTTPQFFQ 923
L I + F S+ +L P I L I G FP+ P FQ
Sbjct: 150 ALPVIALTGLAFASLGMVVTAL--APSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQ 207

Query: 924 NISPYLPFTYAIDSLRETV-----GGIVPEILITKLIILTLFGIGFFVVGLI 970
+ +LP +++ID +R + + + + I+ F F L+
Sbjct: 208 TAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPF---FLSTALL 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2644FLGFLGJ645e-13 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 63.6 bits (154), Expect = 5e-13
Identities = 50/176 (28%), Positives = 84/176 (47%), Gaps = 19/176 (10%)

Query: 304 SNNDDSGQFNVVDSKDTRQFVKSIAKDAHRIGQDNDIYASVMIAQAILESDSGRSALAKS 363
N DDS D++ F+ ++ A Q + + +++AQA LES G+ + +
Sbjct: 139 RNYDDSLPG------DSKAFLAQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRE 192

Query: 364 ---PNHNLFGIK--GAFEGNSVPFNTLEADGNKLYSINAGFRKYPSTKESLKDYSDLIKN 418
P++NLFG+K G ++G T E + + + A FR Y S E+L DY L+
Sbjct: 193 NGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYLEALSDYVGLLTR 252

Query: 419 GIDGNRTIYKPTWKSEADSYKDATSHLSKTYATDPNYAKKLNSIIKHYQLTQFDDE 474
+ + A + +DA YATDP+YA+KL ++I+ Q+ D+
Sbjct: 253 NPRYAAVTTAASAEQGAQALQDA------GYATDPHYARKLTNMIQ--QMKSISDK 300


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2645ISCHRISMTASE773e-19 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 77.0 bits (189), Expect = 3e-19
Identities = 41/183 (22%), Positives = 77/183 (42%), Gaps = 10/183 (5%)

Query: 3 RKTALLVLDMQE----GIASSVPRIKNIIKANQRAIEAARQHRIPVIFIRLVLDKHFNDV 58
+ LL+ DMQ + + + ++ Q IPV++ ++ +D
Sbjct: 29 NRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQNPDDR 88

Query: 59 SSSNKVFSTIKAQGYAITEADASTRILEDLAPLEDEPIISKRRFSAFTGSYLEVYLRAND 118
+ + G + +I+ +LAP +D+ +++K R+SAF + L +R
Sbjct: 89 ALLTDFW------GPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEG 142

Query: 119 INHLVLTGVSTSGAVLSTALESVDKDYYITVLEDAVGDRSDDKHDFIIEQILSRSCDIES 178
+ L++TG+ L TA E+ +D + DAV D S +KH +E R
Sbjct: 143 RDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCAFTVM 202

Query: 179 VES 181
+S
Sbjct: 203 TDS 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2649SECA6560.0 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 656 bits (1695), Expect = 0.0
Identities = 287/835 (34%), Positives = 450/835 (53%), Gaps = 68/835 (8%)

Query: 10 NELRLKSIRKIVKRINTWSDEVKSYSDDALKQKTIEFKERLASGVDTLDTLLPEAYAVAR 69
N+ L+ +RK+V IN E++ SD+ LK KT EF+ RL G + L+ L+PEA+AV R
Sbjct: 14 NDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKG-EVLENLIPEAFAVVR 72

Query: 70 EASWRVLGMYPKEVQLIGAIVLHEGNIAEMQTGEGKTLTATMPLYLNALSGKGTYLITTN 129
EAS RV GM +VQL+G +VL+E IAEM+TGEGKTLTAT+P YLNAL+GKG +++T N
Sbjct: 73 EASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNALTGKGVHVVTVN 132

Query: 130 DYLAKRDFEEMQPLYEWLGLTASLGFVDIVDYEYQKGEKRNIYEHDIIYTTNGRLGFDYL 189
DYLA+RD E +PL+E+LGLT V I KR Y DI Y TN GFDYL
Sbjct: 133 DYLAQRDAENNRPLFEFLGLT-----VGINLPGMPAPAKREAYAADITYGTNNEYGFDYL 187

Query: 190 IDNLADSAEGKFLPQLNYGIIDEVDSIILDAAQTPLVISGAPRLQSNLFHIVKEFVDTLI 249
DN+A S E + +L+Y ++DEVDSI++D A+TPL+ISG S ++ V + + LI
Sbjct: 188 RDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYKRVNKIIPHLI 247

Query: 250 E-----------DVHFKMKKTKKEIWLLNQGIEAAQSYFNV-------EDLYSEQAMVLV 291
+ HF + + +++ L +G+ + E LYS ++L+
Sbjct: 248 RQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYSPANIMLM 307

Query: 292 RNINLALRAQYLFESNVDYFVYNGDIVLIDRITGRMLPGTKLQAGLHQAIEAKEGMEVST 351
++ ALRA LF +VDY V +G+++++D TGR + G + GLHQA+EAKEG+++
Sbjct: 308 HHVTAALRAHALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAKEGVQIQN 367

Query: 352 DKSVMATITFQNLFKLFESFSGMTATGKLGESEFFDLYSKIVVQAPTDKAIQRIDEPDKV 411
+ +A+ITFQN F+L+E +GMT T EF +Y V PT++ + R D PD V
Sbjct: 368 ENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMIRKDLPDLV 427

Query: 412 FRSVDEKNIAMIHDIVELHETGRPVLLITRTAEAAEYFSKVLFQMDIPNNLLIAQNVAKE 471
+ + EK A+I DI E G+PVL+ T + E +E S L + I +N+L A+ A E
Sbjct: 428 YMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANE 487

Query: 472 AQMIAEAGQIGSMTVATSMAGRGTDIKLG-----------------------------EG 502
A ++A+AG ++T+AT+MAGRGTDI LG +
Sbjct: 488 AAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKADWQVRHDA 547

Query: 503 VEALGGLAVIIHEHMENSRVDRQLRGRSGRQGDPGSSCIYISLDDYLVKRWSDSNLAENN 562
V GGL +I E E+ R+D QLRGRSGRQGD GSS Y+S++D L++ ++ ++
Sbjct: 548 VLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFASDRVSGMM 607

Query: 563 QLYSLDAQRLSQSNLFNRKVKQIVVKAQRISEEQGVKAREMANEFEKSISIQRDLVYEER 622
+ + + + + AQR E + R+ E++ + QR +Y +R
Sbjct: 608 RKLGMKPGEAIEHPWVTKAIA----NAQRKVESRNFDIRKQLLEYDDVANDQRRAIYSQR 663

Query: 623 NRVLEIDDAENRDFKALAKDVFEMFVNEE---KVLTKSRVVEYIYQNLSFQFNKDVACVN 679
N +L++ D ++ +DVF+ ++ + L + + + + L F+ D+
Sbjct: 664 NELLDVSDVSET-INSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDLPIAE 722

Query: 680 FKDKQAVVT------FLLEQFEKQLALNRKNMQSAYYYNIFVQKVFLKAIDSCWLEQVDY 733
+ DK+ + +L Q + ++ + A F + V L+ +DS W E +
Sbjct: 723 WLDKEPELHEETLRERILAQSIEVYQ-RKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAA 781

Query: 734 LQQLKASVNQRQNGQRNAIFEYHRVALDSFEVMTRNIKKRMVKNICQSMITFDKE 788
+ L+ ++ R Q++ EY R + F M ++K ++ + + + +E
Sbjct: 782 MDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEE 836


57SAV2653SAV2665N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
SAV265311153.006538similar to preprotein translocase secY
SAV265413163.961392serine-threoinine rich antigen
SAV2655-1151.211608conserved hypothetical protein
SAV2656-214-0.012697hypothetical protein
SAV2657-2140.089940hypothetical protein
SAV2658-2150.609443hypothetical protein
SAV2659-215-0.375373conserved hypothetical protein
SAV2660-116-1.523849similar to methionine sulfoxide reductase
SAV2661-217-1.212499putative acetyltransferase
SAV2662-217-1.328218capsular polysaccharide biosynthesis
SAV2663-116-1.947783capsular polysaccharide biosynthesis
SAV2664-115-3.584731capsular polysaccharide biosynthesis
SAV2665016-3.257885ica operon transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2653SECYTRNLCASE1304e-36 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 130 bits (329), Expect = 4e-36
Identities = 93/440 (21%), Positives = 181/440 (41%), Gaps = 52/440 (11%)

Query: 4 LLQQYEYKIIYKRMLYTCFILFIYILGTNISI--VSYNDMQ------VKHESFFKIAISN 55
+ + + K++L+T I+ +Y +GT+I I V Y ++Q ++ F +
Sbjct: 5 FARAFRTPDLRKKLLFTLAIIVVYRVGTHIPIPGVDYKNVQQCVREASGNQGLFGLVNMF 64

Query: 56 MGGDVNTLNIFTLGLVPWLTSMIILMLISYRNMDKYMKQTSLEKHYKE------------ 103
GG + + IF LG++P++T+ IIL L++ + LE KE
Sbjct: 65 SGGALLQITIFALGIMPYITASIILQLLT-------VVIPRLEALKKEGQAGTAKITQYT 117

Query: 104 RILTLILSVIQSYFVIHEYVSKERVHQDN-------------IYLTILILVTGTMLLVWL 150
R LT+ L+++Q ++ S + + ++ + GT +++WL
Sbjct: 118 RYLTVALAILQGTGLVATARSAPLFGRCSVGGQIVPDQSIFTTITMVICMTAGTCVVMWL 177

Query: 151 ADKNSRYGIAGPMPIVMVSIIKSMMHQKMEYI------DASHIVIALLIILVIITLFILL 204
+ + GI M I+M I + + I I +I + +I + +++
Sbjct: 178 GELITDRGIGNGMSILMFISIAATFPSALWAIKKQGTLAGGWIEFGTVIAVGLIMVALVV 237

Query: 205 FIELVEVRIPYI----DLMNVSATNMKSYLSWKVNPAGSITLMMSISAFVFLKSGIHFIL 260
F+E + RIP + S +Y+ KVN AG I ++ + S F
Sbjct: 238 FVEQAQRRIPVQYAKRMIGRRSYGGTSTYIPLKVNQAGVIPVIFASSLLYIPALVAQFAG 297

Query: 261 SMFNKSISDDMPMLTFDSPVGISVYLVIQMLLGYFLSRFLINTKQKSKDFLKSGNYFSGV 320
+ + D P+ I Y ++ + +F N ++ + + K G + G+
Sbjct: 298 GNSGWKSWVEQNLTKGDHPIYIVTYFLLIVFFAFFYVAISFNPEEVADNMKKYGGFIPGI 357

Query: 321 KPGKDTERYLNYQARRVCWFGSALVTVIIGIPLYFTLFVPHLSTEIYFS-VQLIVLVYIS 379
+ G+ T YL+Y R+ W GS + +I +P L S F ++++V +
Sbjct: 358 RAGRPTAEYLSYVLNRITWPGSLYLGLIALVP-TMALVGFGASQNFPFGGTSILIIVGVG 416

Query: 380 INIAETIRTYLYFDKYKPFL 399
+ + I + L Y+ FL
Sbjct: 417 LETVKQIESQLQQRNYEGFL 436


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2654ICENUCLEATIN553e-09 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 55.1 bits (132), Expect = 3e-09
Identities = 237/1070 (22%), Positives = 425/1070 (39%), Gaps = 12/1070 (1%)

Query: 1098 SDSMSISTSFSDSTSDSKSASTASSESISQSASTSTSGSVSTSTSLSTSNSERTSTSVSD 1157
+ + ++ + S S + + T +T S ST ++ +
Sbjct: 106 LHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQTIEIATYG 165

Query: 1158 STSLSTSESDSISESTSTSDSISEAISASESTSISLSESNSTSDSESQSASAFLSESLSE 1217
ST T +S I+ ST + + + S + ++ST + S ES
Sbjct: 166 STLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQM 225

Query: 1218 STSESTSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTSISESTSTFKSE 1277
+ ST + S + S T+ S+ + ST + S+ T+ ST T +
Sbjct: 226 AGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 285

Query: 1278 SVSTSLSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTSL 1337
S T+ ST T+ ++S+ ++ S T+ +S + ST + S + ST
Sbjct: 286 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 345

Query: 1338 SGSTSESESDSTSSSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMNQ 1397
+G S + S+ + DS+ + S T+ S T+ S T+ + S +
Sbjct: 346 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYG 405

Query: 1398 SGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTSQ 1457
S + S + ST T++ S T+ Y S T+ +S+ + S T+ S+
Sbjct: 406 STQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLT 465

Query: 1458 SGSTSTSASLSGSESESDSQSISTSASESTSESASTSLSDSTSTSNSGSASTSTSLSNSA 1517
+G ST + GS+ + S ST+ ES+ + S + S + ST + +
Sbjct: 466 AGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNE 525

Query: 1518 SASESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSE 1577
S + STS + + S+ + S ++ S+ + ST + ST
Sbjct: 526 SDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGT 585

Query: 1578 SGSTSESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSD 1637
+GS S + ST T+ S T+ S + S T+ STS + + S +
Sbjct: 586 AGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYG 645

Query: 1638 SQSMSLSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMSVSISLSDSTSTSTSASEVM 1697
S + S T+ ST + SD T+ S ST+G+ S I+ ST T+ S +
Sbjct: 646 STQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILT 705

Query: 1698 SASISDSQSMSESVNDSESVSESNSESDSKSMSGSTSVSDSGSLSVSTSLRKSESVSESS 1757
+ S + S S S S + +DS ++G S + S T+ S +
Sbjct: 706 AGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQ 765

Query: 1758 SLSGSQSMSDSVSTSDSSSLSVSTSLRSSESVSESDSLSDSKSTSGSTSTSTSGSLSTST 1817
S+ + S S + +DSS ++ S +++ S + S T+ S T+G STST
Sbjct: 766 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTST 825

Query: 1818 SLSGSESVSESTSLSDSISMSDSTSTSDSDSLSGSISLSGSTSLSTSDSLSDSKSLSSSQ 1877
+ + S ++ S + S T+ S + S + STS + DS ++
Sbjct: 826 AGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYG 885

Query: 1878 SMSGSESTSTSVSDSQSSSTSNSQFDSMSISASESDSMSTSDSSNISGSNSTSTSLSTSD 1937
S + S + S+ T+ D + S S + S GS T++ ST
Sbjct: 886 STQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASFKSTLM 945

Query: 1938 SMSGSVSVSTSTSLSDSISGSTSVSDSSSTSTSTSLSDSMSQSQSTSTSASGSLSTSIST 1997
+ GS + S + GSTS++ S+ + S + QST T+ GS T+ +
Sbjct: 946 AGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQTAEHS 1005

Query: 1998 SMSMSASTSSSQSTSVSTSLSTSDSISDST----------SISISGSQSTVESESTSDST 2047
S + S++ + + S+ ++ S S S ISG +S + + S
Sbjct: 1006 STLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVLTAGYGSSLI 1065

Query: 2048 SISDSESLSTSDSDSTSTSTSDSTSGSTSTSIS--ESLSTSGSGSTSVSDSTSMSESDST 2105
S S + S+ ++ S +G ST I+ S+ +G GS+ + S S +
Sbjct: 1066 SGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGAD 1125

Query: 2106 SVSMSQDKSDSTSISDSESVSTSTSTSLSTSDSTSTSESLSTSMSGSQSI 2155
SV M+ ++ + +DS + S L+ ++S T+ S +G+ I
Sbjct: 1126 SVQMAGERGKLIAGADSTQTAGDRSKLLAGNNSYLTAGDRSKLTAGNDCI 1175



Score = 53.2 bits (127), Expect = 1e-08
Identities = 176/773 (22%), Positives = 305/773 (39%), Gaps = 2/773 (0%)

Query: 1408 SASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTSQSGSTSTSASL 1467
+ + +E + S + + T D+T S ST + + +
Sbjct: 106 LHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQTIEIATYG 165

Query: 1468 SGSESESDSQSISTSASESTSESASTSLSDSTSTSNSGSASTSTSLSNSASASESDSSST 1527
S SQ I+ S T+ +ST ++ ST +G+ ST + S + +SS
Sbjct: 166 STLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQM 225

Query: 1528 SLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSESGSTSESTSE 1587
+ ST M+ S+ + S + S+ + ST + S T+ GST +
Sbjct: 226 AGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 285

Query: 1588 SDSTSTSLSDSQSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSDSQSMSLSTST 1647
SD T+ S + + S+ +G ST T+ +S T+ ST SD + ST T
Sbjct: 286 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 345

Query: 1648 STSMSDSTSLSDSVSDSTSDSTSASTSGSMSVSISLSDSTSTSTSASEVMSASISDSQSM 1707
+ S + S + DS+ + GS + SD T+ S + S +
Sbjct: 346 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYG 405

Query: 1708 SESVNDSESVSESNSESDSKSMSGSTSVSDSGSLSVSTSLRKSESVSESSSLSGSQSMSD 1767
S ES + S + GS + GS + + S+ +G S
Sbjct: 406 STQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLT 465

Query: 1768 SVSTSDSSSLSVSTSLRSSESVSESDSLSDSKSTSGSTSTSTSGSLSTSTSLSGSESVSE 1827
+ S ++ S S S + S + GST T+ GS T+ S + +E
Sbjct: 466 AGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNE 525

Query: 1828 STSLSDSISMSDSTSTSDSDSLSGSISLSGSTSLSTSDSLSDSKSLSSSQSMSGSESTST 1887
S ++ S S + + S + GS + S+ T+ S + S +G ST T
Sbjct: 526 SDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGT 585

Query: 1888 SVSDSQSSSTSNSQFDSMSISASESDSMSTSDSSNISGSNSTSTSLSTSDSMSGSVSVST 1947
+ SDS + S + S+ + ST + S + S ST+ + S ++
Sbjct: 586 AGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYG 645

Query: 1948 STSLSDSISGSTSVSDSSSTSTSTSLSDSMSQSQSTSTSASGSLSTSISTSMSMSASTSS 2007
ST + S T+ S+ T+ S + S ST+ + S ++ ST + S +
Sbjct: 646 STQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILT 705

Query: 2008 SQSTSVSTSLSTSDSISDSTSISISGSQSTVESESTSDSTSISDSESLSTSDSDSTSTST 2067
+ S T+ SD S S S +G+ S++ + S T+ S + S T+
Sbjct: 706 AGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQ 765

Query: 2068 SDSTS--GSTSTSISESLSTSGSGSTSVSDSTSMSESDSTSVSMSQDKSDSTSISDSESV 2125
S T+ GSTST+ ++S +G GST + S+ + S +Q++SD T+ S S
Sbjct: 766 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTST 825

Query: 2126 STSTSTSLSTSDSTSTSESLSTSMSGSQSISDSTSTSMSGSTSTSESNSMHPS 2178
+ + S+ ++ ST T+ S +G S + S + S S + + S
Sbjct: 826 AGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDS 878



Score = 53.2 bits (127), Expect = 1e-08
Identities = 233/1050 (22%), Positives = 411/1050 (39%), Gaps = 2/1050 (0%)

Query: 759 TSGSTQQSQSVSTSKADSQSASTSTSGSIVVSTSASTSKSTSVSLSDSVSASKSLSTSES 818
TS + A ++ + S + D+ S S +++
Sbjct: 99 TSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQT 158

Query: 819 NSVSSSTSTSLVNSQSVSSSMSGSVSKSTSLSDSISNSNSTEKSESLSTSTSDSLRTSTS 878
+++ ST QS + GS + S I+ ST + + ST + T T+
Sbjct: 159 IEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTA 218

Query: 879 LSDSLSMSTSGSLSKSQSLSTSISGSSSTSASLSDSTSNAISTSTSLSESASTSDSISIS 938
+S M+ GS S +G ST + DS+ A ST + S+ + S
Sbjct: 219 GEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGS 278

Query: 939 NSIANSQSASTSKSDSQSTSISLSTSDSKSMSTSESLSDSTSTSGSVSGSLSIAASQSVS 998
A S T+ S T+ + S+ + ST + +ST T+G GS A S
Sbjct: 279 TQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGY--GSTQTAQKGSDL 336

Query: 999 TSTSDSMSTSEIVSDSISTSGSLSASDSKSMSVSSSMSTSQSGSTSESLSDSQSTSDSDS 1058
T+ S T+ S I+ GS + S + ST + S+ + ST + +
Sbjct: 337 TAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGA 396

Query: 1059 KSLSLSTSQSGSTSTSTSTSASVRTSESQSTSGSMSASQSDSMSISTSFSDSTSDSKSAS 1118
S ++ S T+ ST + S + GS + S + S + S
Sbjct: 397 DSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQ 456

Query: 1119 TASSESISQSASTSTSGSVSTSTSLSTSNSERTSTSVSDSTSLSTSESDSISESTSTSDS 1178
TA +S + ST + S + S T+ S + S + ST T+
Sbjct: 457 TAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGY 516

Query: 1179 ISEAISASESTSISLSESNSTSDSESQSASAFLSESLSESTSESTSESVSSSTSESTSLS 1238
S + +ES I+ S ST+ + S + + S + S T+ S+ T+ S
Sbjct: 517 GSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDL 576

Query: 1239 DSTSESGSTSTSLSNSTSGSASISTSTSISESTSTFKSESVSTSLSMSTSTSLSNSTSLS 1298
+ S T+ S S+ +G S T++ S T+ + S + S+ T+ S ST+ +
Sbjct: 577 TAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGA 636

Query: 1299 TSLSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTSLSGSTSESESDSTSSSESKSDS 1358
S + S + S+ T+ ST + S T+ GSTS + +DS+ + S
Sbjct: 637 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQ 696

Query: 1359 TSMSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMNQSGVDSNSASQSASNSTSTSTS 1418
T+ S+ + GST T+ S S S S + + + S ++ +S+ T+
Sbjct: 697 TAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGY 756

Query: 1419 ESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTSQSGSTSTSASLSGSESESDSQS 1478
S + + S ST+ + S + S T+ S T+ S ++ S
Sbjct: 757 GSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDL 816

Query: 1479 ISTSASESTSESASTSLSDSTSTSNSGSASTSTSLSNSASASESDSSSTSLSDSTSASMQ 1538
+ S ST+ + S+ ++ ST +G S T+ S ++ +S T+ STS +
Sbjct: 817 TTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGY 876

Query: 1539 SSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSESGSTSESTSESDSTSTSLSDS 1598
S + S + S T+ ST + S T+ GSTS + ES + S
Sbjct: 877 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQ 936

Query: 1599 QSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSDSQSMSLSTSTSTSMSDSTSLS 1658
++ +ST +G S+ T+ S T+ STSM S + ST T+ S T+
Sbjct: 937 TASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGY 996

Query: 1659 DSVSDSTSDSTSASTSGSMSVSISLSDSTSTSTSASEVMSASISDSQSMSESVNDSESVS 1718
S + ST + GS + + + S + S+ S + S ++ SV
Sbjct: 997 GSTQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVL 1056

Query: 1719 ESNSESDSKSMSGSTSVSDSGSLSVSTSLRKSESVSESSSLSGSQSMSDSVSTSDSSSLS 1778
+ S S S+ + GS +++ + ES+ ++G++SM + S ++
Sbjct: 1057 TAGYGSSLISGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGY 1116

Query: 1779 VSTSLRSSESVSESDSLSDSKSTSGSTSTS 1808
ST + ++SV + + + ST T+
Sbjct: 1117 RSTLISGADSVQMAGERGKLIAGADSTQTA 1146



Score = 52.8 bits (126), Expect = 2e-08
Identities = 229/1007 (22%), Positives = 395/1007 (39%), Gaps = 10/1007 (0%)

Query: 1163 TSESDSISESTSTSDSISEAISASESTSISLSESNSTSDSESQSASAFLSESLSESTSES 1222
TS I + + +E + S ++ ES S +++
Sbjct: 99 TSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQT 158

Query: 1223 TSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTSISESTSTFKSESVSTS 1282
+ ST T S + GST T+ +ST + ST T+ ++ST S T+
Sbjct: 159 IEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTA 218

Query: 1283 LSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTSLSGSTS 1342
S+ + ST SD T+ S + S+ + S + S+ +G S
Sbjct: 219 GEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGS 278

Query: 1343 ESESDSTSSSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMNQSGVDS 1402
+ S + ST + + S +G ST T+ S T+ S + S + +
Sbjct: 279 TQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTA 338

Query: 1403 NSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTSQSGSTS 1462
S + S+ + S T+ S T+ ST T+ SD T+ GST
Sbjct: 339 GYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTA------GYGSTG 392

Query: 1463 TSASLSGSESESDSQSISTSASESTSESASTSLSDSTSTSNSGSASTSTSLSNSASASES 1522
T+ + S + S + S T+ ST + S +G ST T+ +S+ +
Sbjct: 393 TAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGY 452

Query: 1523 DSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSESGSTS 1582
S+ T+ DS+ + S +Q S + STST+ S++ ++ ST +G S
Sbjct: 453 GSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSL--IAGYGSTQTAGYGS 510

Query: 1583 ESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSDSQSMS 1642
T+ ST T+ ++S + S S + + S+ + ST ++ S+ T+ S +
Sbjct: 511 TLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTA 570

Query: 1643 LSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMSVSISLSDSTSTSTSASEVMSASIS 1702
S T+ ST + S S + S T+ S + ST T+ S + + S
Sbjct: 571 REGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGS 630

Query: 1703 DSQSMSESVNDSESVSESNSESDSKSMSGSTSVSDSGSLSVSTSLRKSESVSESSSLSGS 1762
S + ++S + S + +S +G S + S T+ S S + + S +
Sbjct: 631 TSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIA 690

Query: 1763 QSMSDSVSTSDSSSLSVSTSLRSSESVSESDSLSDSKSTSGSTSTSTSGSLSTSTSLSGS 1822
S + +S + S ++++ S+ S S ST+G+ S+ +G ST T+ S
Sbjct: 691 GYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHS 750

Query: 1823 ESVSESTSLSDSISMSDSTSTSDSDSLSGSISLSGSTSLSTSDSLSDSKSLSSSQSMSGS 1882
+ S + S T+ S S +G+ S + ST + S + S +
Sbjct: 751 SLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTA 810

Query: 1883 ESTSTSVSDSQSSSTSNSQFDSMSISASESDSMSTSDSSNISGSNSTSTSLSTSDSMSGS 1942
+ S + S+ST+ DS I+ S + +S +G ST T+ SD +G
Sbjct: 811 QERSDLTTGYGSTSTAG--ADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGY 868

Query: 1943 VSVSTSTSLSDSISGSTSVSDSSSTSTSTSLSDSMSQSQSTSTSASGSLSTSISTSMSMS 2002
S ST+ S I+G S + S T+ S +Q S +G STS + S
Sbjct: 869 GSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSL 928

Query: 2003 ASTSSSQSTSVSTSLSTSDSISDSTSISISGSQSTVESESTSDSTSISDSESLSTSDSDS 2062
+ S T+ S + S T+ S + S S + S + ST +
Sbjct: 929 IAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGY 988

Query: 2063 TSTSTSDSTSGSTSTSISESLSTSGSGSTSVSDSTSMSESDSTSVSMSQDKSDSTSISDS 2122
ST T+ S T+ S + GS +T+ +DS+ ++ S+ S + + S
Sbjct: 989 QSTLTAGYGSTQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTL 1048

Query: 2123 ESVSTSTSTSLSTSDSTSTSESLSTSMSGSQSISDSTSTSMSGSTST 2169
S S T+ S S S T+ GS I+ S+ ++G ST
Sbjct: 1049 ISGLRSVLTAGYGSSLISGRRSSLTAGYGSNQIASHRSSLIAGPEST 1095



Score = 52.1 bits (124), Expect = 3e-08
Identities = 193/856 (22%), Positives = 350/856 (40%), Gaps = 14/856 (1%)

Query: 1329 DSISTSTSLSGSTSESESDSTSSSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTSTS 1388
S + ++ + + + S + + + + T +T S ST +
Sbjct: 97 TKTSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPT 156

Query: 1389 LSLSASMNQSGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDS 1448
++ + S + SQ + ST T+ S + Y S T+ ++ST + S
Sbjct: 157 QTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQ 216

Query: 1449 TSISKSTSQSGSTSTSASLSGSESESDSQSISTSASEST--SESASTSLSDSTSTSNSGS 1506
T+ +S+ +G ST + GS+ + S T+ +S+ + ST + S+ +G
Sbjct: 217 TAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGY 276

Query: 1507 ASTSTSLSNSASASESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTI 1566
ST T+ S + S+ T+ +DS+ + S + S + ST T+ + S +
Sbjct: 277 GSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDL 336

Query: 1567 ASLSTSVSTSESGST------SESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTSDS 1620
+ S T+ S+ S T+ DS+ T+ S T++ S + ST T+ +
Sbjct: 337 TAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGA 396

Query: 1621 RSTSASTSTSMRTSTSDSQSMSLSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMSVS 1680
S+ + S +T+ +S + ST T+ S + S T+ S+ +G S
Sbjct: 397 DSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQ 456

Query: 1681 ISLSDSTSTSTSASEVMSASISDSQSMSESVNDSESVSESNSESDSKSMSGSTSVSDSGS 1740
+ DS+ T+ S + SD + S + + S + S +G S +G
Sbjct: 457 TAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGY 516

Query: 1741 LSVSTSLRKSESVSESSSLSGSQSMSDSVSTSDSSSLSVSTSLRSSESVSESDSLSDSKS 1800
S T+ +S+ ++ S S + + S ++ S+ + S+ ++ S + S
Sbjct: 517 GSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDL 576

Query: 1801 TSGSTSTSTSGSLSTSTSLSGSESVSESTSLSDSISMSDSTSTSDSDSLSGSISLSGSTS 1860
T+G ST T+GS S+ + GS + S + S T+ S +G S S + +
Sbjct: 577 TAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGA 636

Query: 1861 LST------SDSLSDSKSLSSSQSMSGSESTSTSVSDSQSSSTSNSQFDSMSISASESDS 1914
S+ S + S+ ++ S + S + STS + DS I+ S
Sbjct: 637 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQ 696

Query: 1915 MSTSDSSNISGSNSTSTSLSTSDSMSGSVSVSTSTSLSDSISGSTSVSDSSSTSTSTSLS 1974
+ +S +G ST T+ SD SG S ST+ + S I+G S +S S+ T+
Sbjct: 697 TAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGY 756

Query: 1975 DSMSQSQSTSTSASGSLSTSISTSMSMSASTSSSQSTSVSTSLSTSDSISDSTSISISGS 2034
S ++ S +G STS + + S + S T+ S+ T+ S T+ S
Sbjct: 757 GSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDL 816

Query: 2035 QSTVESESTSDSTSISDSESLSTSDSDSTSTSTSDSTSGSTSTSISESLSTSGSGSTSVS 2094
+ S ST+ + S + ST + S T+ S T+ S+ + GS ST+
Sbjct: 817 TTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGY 876

Query: 2095 DSTSMSESDSTSVSMSQDKSDSTSISDSESVSTSTSTSLSTSDSTSTSESLSTSMSGSQS 2154
DS+ ++ ST + + S + S T+ S ST+ ES + GS
Sbjct: 877 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQ 936

Query: 2155 ISDSTSTSMSGSTSTS 2170
+ ST M+G S+
Sbjct: 937 TASFKSTLMAGYGSSQ 952



Score = 51.3 bits (122), Expect = 5e-08
Identities = 200/886 (22%), Positives = 350/886 (39%), Gaps = 6/886 (0%)

Query: 797 KSTSVSLSDSVSASKSLSTSESNSVSSSTSTSLVNSQSVSSSMSGSVSKSTSLSDSISNS 856
++ + + SA + + ++ V+ + + S V+S + D +
Sbjct: 89 RAEVLHVGTKTSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATI 148

Query: 857 NSTEKSESLSTSTSDSLRTSTSLSDSLSMSTSGSLSKSQSLSTSISGSSSTSASLSDSTS 916
S + + + T + S ++ GS + ST I+G ST + +DST
Sbjct: 149 ESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTL 208

Query: 917 NAISTSTSLSESASTSDSISISNSIANSQSASTSKSDSQSTSISLSTSDSKSMSTSESLS 976
A ST + S+ + S S T+ S T+ S+ + ST +
Sbjct: 209 VAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGE 268

Query: 977 DSTSTSGSVSGSLSIAASQSVSTSTSDSMSTSEIVSDSISTSGSLSASDSKSMSVSSSMS 1036
DS+ T+G S + S + S + ++ + S + +S + S
Sbjct: 269 DSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQ 328

Query: 1037 TSQSGSTSESLSDSQSTSDSDSKSLSLSTSQSGSTSTSTSTSASVRTSESQSTSGSMSAS 1096
T+Q GS + S T+ DS + + GST T+ S+ S T+ S
Sbjct: 329 TAQKGSDLTAGYGSTGTAGDDSSLI----AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDL 384

Query: 1097 QSDSMSISTSFSDSTSDSKSASTASSESISQSASTSTSGSVSTSTSLSTSNSERTSTSVS 1156
+ S T+ +DS+ + ST ++ S + S + S T+ T T+
Sbjct: 385 TAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGD 444

Query: 1157 DSTSLSTSESDSISESTSTSDSISEAISASESTSISLSESNSTSDSESQSASAFLSESLS 1216
DS+ ++ S + S+ + + ++ S + STS + +S+ S
Sbjct: 445 DSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQ 504

Query: 1217 ESTSESTSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTSISESTSTFKS 1276
+ ST + ST + + SD + GSTST+ +NS+ + ST T+ S T
Sbjct: 505 TAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGY 564

Query: 1277 ESVSTSLSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTS 1336
S T+ S T+ ST + S S + S ++ S+ + S + S
Sbjct: 565 GSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVL 624

Query: 1337 LSGSTSESESDSTSSSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMN 1396
+G S S + + SS + ST + S T+G ST T+ SD T+ S S +
Sbjct: 625 TTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGA 684

Query: 1397 QSGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLSDSTSISKSTS 1456
S + + S + S T+ S T+ S TS STST+ + S + ST
Sbjct: 685 DSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQ 744

Query: 1457 QSGSTSTSASLSGSESESDSQSISTS--ASESTSESASTSLSDSTSTSNSGSASTSTSLS 1514
+ S+ + GS + QS+ T+ S ST+ + S+ ++ ST +G S T+
Sbjct: 745 TASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGY 804

Query: 1515 NSASASESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVS 1574
S ++ S T+ STS + S + S + S T+ ST + S
Sbjct: 805 GSTQTAQERSDLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDL 864

Query: 1575 TSESGSTSESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTSDSRSTSASTSTSMRTS 1634
T+ GSTS + +S + S + S +G ST T+ +S T+ STS
Sbjct: 865 TTGYGSTSTAGYDSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGY 924

Query: 1635 TSDSQSMSLSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMSVS 1680
S + ST T++ S + S + S+ + GS S++
Sbjct: 925 ESSLIAGYGSTQTASFKSTLMAGYGSSQTAREQSSLTAGYGSTSMA 970



Score = 49.0 bits (116), Expect = 3e-07
Identities = 206/894 (23%), Positives = 361/894 (40%), Gaps = 6/894 (0%)

Query: 733 TDASGNKTTTTFKYEVTRNSMSDSVSTSGSTQQSQSVSTSKADSQSASTSTSGSIVVSTS 792
T G+ T + T + S ++ GSTQ + ST A S T+ GS + +
Sbjct: 281 TAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGY 340

Query: 793 ASTSKSTSVSLSDSVSASKSLSTSESNSVSSSTSTSLVNSQSVSSSMSGSVSKSTSLSDS 852
ST + S + S + +S+ + ST S ++ GS + + S
Sbjct: 341 GSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSL 400

Query: 853 ISNSNSTEKSESLSTSTSDSLRTSTSLSDSLSMSTSGSLSKSQSLSTSISGSSSTSASLS 912
I+ ST+ + ST T+ T T+ S + GS + S+ I+G ST +
Sbjct: 401 IAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGE 460

Query: 913 DSTSNAISTSTSLSESASTSDSISISNSIANSQSA------STSKSDSQSTSISLSTSDS 966
DS+ A ST ++ S + S S A +S+ ST + ST + S
Sbjct: 461 DSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQ 520

Query: 967 KSMSTSESLSDSTSTSGSVSGSLSIAASQSVSTSTSDSMSTSEIVSDSISTSGSLSASDS 1026
+ + S+ ++ STS + + S IA S T++ +S+ T+ S + GS +
Sbjct: 521 TAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGY 580

Query: 1027 KSMSVSSSMSTSQSGSTSESLSDSQSTSDSDSKSLSLSTSQSGSTSTSTSTSASVRTSES 1086
S + S S+ +G S + S+ + S + QS T+ STS + S
Sbjct: 581 GSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSL 640

Query: 1087 QSTSGSMSASQSDSMSISTSFSDSTSDSKSASTASSESISQSASTSTSGSVSTSTSLSTS 1146
+ GS + +S+ + S T+ S TA S S + + S+ + ST +
Sbjct: 641 IAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGY 700

Query: 1147 NSERTSTSVSDSTSLSTSESDSISESTSTSDSISEAISASESTSISLSESNSTSDSESQS 1206
NS T+ S T+ S+ S STST+ + S I+ ST + S+ T+ S
Sbjct: 701 NSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQ 760

Query: 1207 ASAFLSESLSESTSESTSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTS 1266
+ S + S ST+ + SS + S + S T+ S T+ S T+
Sbjct: 761 TAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGY 820

Query: 1267 ISESTSTFKSESVSTSLSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTS 1326
S ST+ S ++ S T+ S T+ S + +S + S ST+ S+
Sbjct: 821 GSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSL 880

Query: 1327 KSDSISTSTSLSGSTSESESDSTSSSESKSDSTSMSISMSQSTSGSTSTSTSTSLSDSTS 1386
+ ST T+ S + ST +++ SD T+ S S + S+ + S ++
Sbjct: 881 IAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTASF 940

Query: 1387 TSLSLSASMNQSGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTSTSTSLS 1446
S ++ + S+ + STS + +S + T + QS T+ S
Sbjct: 941 KSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGSTQ 1000

Query: 1447 DSTSISKSTSQSGSTSTSASLSGSESESDSQSISTSASESTSESASTSLSDSTSTSNSGS 1506
+ S T+ GST+T+ + S + S S S T+ ST +S S +G
Sbjct: 1001 TAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVLTAGY 1060

Query: 1507 ASTSTSLSNSASASESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTI 1566
S+ S S+ + S+ + S+ + S + + S ++ S+ T+ ST+
Sbjct: 1061 GSSLISGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTL 1120

Query: 1567 ASLSTSVSTSESGSTSESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTSDS 1620
S + SV + + ++S T+ S + + S +G S T+ +D
Sbjct: 1121 ISGADSVQMAGERGKLIAGADSTQTAGDRSKLLAGNNSYLTAGDRSKLTAGNDC 1174



Score = 47.8 bits (113), Expect = 6e-07
Identities = 241/1091 (22%), Positives = 433/1091 (39%), Gaps = 10/1091 (0%)

Query: 907 TSASLSDSTSNAISTSTSLSESASTSDSISISNSIANSQSASTSKSDSQSTSISLSTSDS 966
TSA A + + ++ S ++ + N T D+ S S + +
Sbjct: 99 TSAMQFILHHRADYVACTEMQAGPGSPDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQT 158

Query: 967 KSMSTSESLSDSTSTSGSVSGSLSIAASQSVSTSTSDSMSTSEIVSDSISTSGSLSASDS 1026
++T S T S ++G S + ST + ST +DS +G S +
Sbjct: 159 IEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTA 218

Query: 1027 KSMSVSSSMSTSQSGSTSESLSDSQSTSDSDSKSLSLSTSQSGSTSTSTSTSASVRTSES 1086
S + S S + S + S + GST T+ S+ S
Sbjct: 219 GEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGS 278

Query: 1087 QSTSGSMSASQSDSMSISTSFSDSTSDSKSASTASSESISQSASTSTSGSVSTSTSLSTS 1146
T+ S + S T+ +DS+ + ST ++ S + S + S T+
Sbjct: 279 TQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTA 338

Query: 1147 NSERTSTSVSDSTSLSTSESDSISESTSTSDSISEAISASESTSISLSESNSTSDSESQS 1206
T T+ DS+ ++ S + S+ + + ++ S + ST + + S
Sbjct: 339 GYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADS 398

Query: 1207 ASAFLSESLSESTSESTSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTS 1266
+ S + EST + ST + SD T+ GST T+ +S+ + ST T+
Sbjct: 399 SLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTA 458

Query: 1267 ISESTSTFKSESVSTSLSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMS--TSDSIS 1324
+S+ T S T+ S T+ STS + S + S + S T+ S
Sbjct: 459 GEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGS 518

Query: 1325 TSKSDSISTSTSLSGSTSESESDSTSSSESKSDSTSMSISMSQSTSGSTSTS------TS 1378
T + + S + GSTS + ++S+ + S T+ S+ + GST T+ T+
Sbjct: 519 TQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTA 578

Query: 1379 TSLSDSTSTSLSLSASMNQSGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSES 1438
S T+ S S + S ++ S + ST T+ S T+ Y S ST+ ++S
Sbjct: 579 GYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADS 638

Query: 1439 TSTSTSLSDSTSISKSTSQSGSTSTSASLSGSESESDSQSISTSASESTSESASTSLSDS 1498
+ + S T+ S +G ST + GS+ + S ST+ ++S+ + S +
Sbjct: 639 SLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTA 698

Query: 1499 TSTSNSGSASTSTSLSNSASASESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTS 1558
S + ST + S S STS + + S+ + S ++ S + S
Sbjct: 699 GYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGS 758

Query: 1559 TSNRMSTIASLSTSVSTSESGSTSESTSESDSTSTSLSDSQSTSRSTSASGSASTSTSTS 1618
T + STS +G+ S + ST T+ S T+ S + S T+
Sbjct: 759 TQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTT 818

Query: 1619 DSRSTSASTSTSMRTSTSDSQSMSLSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMS 1678
STS + + S + S + S T+ ST + SD T+ S ST+G S
Sbjct: 819 GYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDS 878

Query: 1679 VSISLSDSTSTSTSASEVMSASISDSQSMSESVNDSESVSESNSESDSKSMSGSTSVSDS 1738
I+ ST T+ S + + S + S + S S + +S ++G S +
Sbjct: 879 SLIAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYESSLIAGYGSTQTA 938

Query: 1739 GSLSVSTSLRKSESVSESSSLSGSQSMSDSVSTSDSSSLSVSTSLRSSESVSESDSLSDS 1798
S + S + S + S S++ DSS ++ S +++ S + S
Sbjct: 939 SFKSTLMAGYGSSQTAREQSSLTAGYGSTSMAGYDSSLIAGYGSTQTAGYQSTLTAGYGS 998

Query: 1799 KSTSGSTSTSTSGSLSTSTSLSGSESVSESTSLSDSISMSDSTSTSDSDSLSGSISLSGS 1858
T+ +ST T+G ST+T+ + S ++ S S S T+ S +SG S+ +
Sbjct: 999 TQTAEHSSTLTAGYGSTATAGADSSLIAGYGSSLTSGIRSFLTAGYGSTLISGLRSVLTA 1058

Query: 1859 TSLSTSDSLSDSKSLSSSQSMSGSESTSTSVSDSQSSSTSNSQFDSMSISASESDSMSTS 1918
S+ S S + S + S+ ++ +S+ + ++ SM I+ S +
Sbjct: 1059 GYGSSLISGRRSSLTAGYGSNQIASHRSSLIAGPESTQITGNR--SMLIAGKGSSQTAGY 1116

Query: 1919 DSSNISGSNSTSTSLSTSDSMSGSVSVSTSTSLSDSISGSTSVSDSSSTSTSTSLSDSMS 1978
S+ ISG++S + ++G+ S T+ S ++G+ S + S T+ +D +
Sbjct: 1117 RSTLISGADSVQMAGERGKLIAGADSTQTAGDRSKLLAGNNSYLTAGDRSKLTAGNDCIL 1176

Query: 1979 QSQSTSTSASG 1989
+ S +G
Sbjct: 1177 MAGDRSKLTAG 1187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2658ENTEROTOXINA280.005 Heat-labile enterotoxin A chain signature.
		>ENTEROTOXINA#Heat-labile enterotoxin A chain signature.

Length = 258

Score = 28.4 bits (63), Expect = 0.005
Identities = 17/54 (31%), Positives = 27/54 (50%), Gaps = 2/54 (3%)

Query: 30 IELFEHTFGLQKELVKYVGIAEATTAALYSASFINKNISRLASLSTIGILSVAA 83
I L++H G Q V+Y +T+ +L SA ++I L+ ST I +A
Sbjct: 57 INLYDHARGTQTGFVRYDDGYVSTSLSLRSAHLAGQSI--LSGYSTYYIYVIAT 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2659NUCEPIMERASE270.043 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 27.4 bits (61), Expect = 0.043
Identities = 9/32 (28%), Positives = 14/32 (43%)

Query: 23 IPRPIAFVTTLNQDASVNAAPFSFFNIVNNHP 54
IP T + + AP+ +NI N+ P
Sbjct: 234 IPHADTQWTVETGTPAASIAPYRVYNIGNSSP 265


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2661SACTRNSFRASE451e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 45.3 bits (107), Expect = 1e-08
Identities = 24/101 (23%), Positives = 46/101 (45%), Gaps = 5/101 (4%)

Query: 48 EKNDEVIGYIN--GPVIKERYISDDLFKNVSINNSEGGYISVLGLVVAPNYQGQGIAGRL 105
E +D + Y+ G Y+ ++ + I ++ GY + + VA +Y+ +G+ L
Sbjct: 51 EDDDMDVSYVEEEGKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTAL 110

Query: 106 LNYFETLAKNHHRHGVTLTCRE---SLISFYEKYGYRNEGV 143
L+ AK +H G+ L ++ S FY K+ + V
Sbjct: 111 LHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAV 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
SAV2665HTHTETR682e-16 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 67.7 bits (165), Expect = 2e-16
Identities = 16/48 (33%), Positives = 31/48 (64%)

Query: 2 KDKIIDNAITLFSEKGYDGTTLDDIAKSVNIKKASLYYHFDSKKSIYE 49
+ I+D A+ LFS++G T+L +IAK+ + + ++Y+HF K ++
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.