PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genomecg43genbank.gbThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_022566 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1D364_RS00220D364_RS00335Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS002202130.855319co-chaperone DjlA
D364_RS002251161.463700response regulator
D364_RS002300171.910492sensor histidine kinase
D364_RS002350182.3891792-hydroxycarboxylate transporter family protein
D364_RS002450202.383828citrate lyase holo-[acyl-carrier protein]
D364_RS002500192.678709bifunctional tRNA pseudouridine(32) synthase/23S
D364_RS002550182.654928RNA polymerase-associated protein RapA
D364_RS00260-2152.691285DNA polymerase II
D364_RS002651153.031495L-ribulose-5-phosphate 4-epimerase
D364_RS002703153.775141L-arabinose isomerase
D364_RS002753143.757734ribulokinase
D364_RS002854133.069288arabinose operon transcriptional regulator AraC
D364_RS002904122.930785DedA family protein
D364_RS002954122.764140thiamine ABC transporter ATP-binding protein
D364_RS00300190.691351thiamine/thiamine pyrophosphate ABC transporter
D364_RS00305011-2.252688thiamine ABC transporter substrate binding
D364_RS00310012-2.387563HTH-type transcriptional regulator SgrR
D364_RS00315-112-4.384139glucose uptake inhibitor SgrT
D364_RS00325-110-3.902600MFS transporter
D364_RS00330014-4.668657MFS transporter
D364_RS00335011-4.123491LysR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00225HTHFIS638e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 63.3 bits (154), Expect = 8e-14
Identities = 26/128 (20%), Positives = 52/128 (40%), Gaps = 10/128 (7%)

Query: 8 VLIIEDESELARLHAELVQKHPRLRLAGM----AASLAQARQLLHATPPQLVLLDNYLPD 63
+L+ +D++ + + + L AG ++ A + + A LV+ D +PD
Sbjct: 6 ILVADDDAAIRTVLNQA------LSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 64 GKGVTLMTDPALATSQCSVIFITAASDMETCSQAIRNGAFDYILKPVSWKRLSQSLERFI 123
L+ A V+ ++A + T +A GA+DY+ KP L + R +
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 124 QFYDQQRE 131
++
Sbjct: 120 AEPKRRPS 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00255BCTERIALGSPD310.019 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 31.4 bits (71), Expect = 0.019
Identities = 26/171 (15%), Positives = 50/171 (29%), Gaps = 9/171 (5%)

Query: 578 LVMFDLPFNPDLLEQRIGRLDRIGQAHDIQIHVPYLEKTAQSVLVRWYH-EGLDAFEHTC 636
L+M L + R+D G + + + + LV + + +
Sbjct: 167 LLMTGRAAVIKRLLTIVERVDNAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGS 226

Query: 637 PTGRTVYDSVHDELINYLAAPESTDGFDDLIKSCRQQHDALKAQLEQGRDRLLEI-HSNG 695
V D + P S +IK +Q Q QG +++ + ++
Sbjct: 227 MVANVVADE-RTNAVLVSGEPNSRQRIIAMIKQLDRQ------QATQGNTKVIYLKYAKA 279

Query: 696 GEKAQALAESIEEQDDDTSLIAFSMNLFDIVGINQDDRGENLIVLTPSDHM 746
+ + L + L + I + LIV D M
Sbjct: 280 SDLVEVLTGISSTMQSEKQAAKPVAALDKNIIIKAHGQTNALIVTAAPDVM 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00290PHPHTRNFRASE280.048 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 27.8 bits (62), Expect = 0.048
Identities = 19/76 (25%), Positives = 33/76 (43%), Gaps = 2/76 (2%)

Query: 95 RALLEKTEHALHQHSMITILIGRFVGPTRPLVPMVAGMLDLPVAKFVLPNIIGCLLWPPL 154
R LEK + Q + +L G + + PM+A + +L AK ++ LL +
Sbjct: 362 RLCLEKQDIFRTQ--LRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGV 419

Query: 155 YFLPGILAGAAIDIPA 170
I G ++IP+
Sbjct: 420 DVSDSIEVGIMVEIPS 435


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00310NEISSPPORIN372e-04 Neisseria sp. porin signature.
		>NEISSPPORIN#Neisseria sp. porin signature.

Length = 348

Score = 36.5 bits (84), Expect = 2e-04
Identities = 17/81 (20%), Positives = 30/81 (37%), Gaps = 2/81 (2%)

Query: 367 YHAGEHYQ-GNWFPAYGLLPRWHHASNHACEKPAGLETVTLTYYRDHVEHRVIGGIMRDL 425
YH G +YQ +F Y L + + E ++ + HR++GG +
Sbjct: 180 YHVGLNYQNSGFFAQYAGLFQRYGEGTKKIEYDDQTYSIPSLFVEKLQVHRLVGGYDNNA 239

Query: 426 LAAHQVKLEIQELEYDAWHRG 446
L V + Q+ + G
Sbjct: 240 LYV-SVAAQQQDAKLYGAMSG 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00325TCRTETA330.002 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.9 bits (75), Expect = 0.002
Identities = 41/187 (21%), Positives = 67/187 (35%), Gaps = 17/187 (9%)

Query: 16 AAFMLVAFMMGVAGALQAPTLSLFLSREVGAQPFWVGLFYTVNAIAGILVSLWLAKRSDS 75
AA M V F+M + G + A +F +G+ I L + +
Sbjct: 213 AALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAA 272

Query: 76 RGDRRRLIMFCCLMAVGNALLFAFNRHYLTLITCGVMLASIANAAMPQLFALAREYADSS 135
R RR +M + +L AF V+LAS MP L A+ D
Sbjct: 273 RLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGG-IGMPALQAMLSRQVDEE 331

Query: 136 AREVVMFSSVMRAQLSLAWVIGPPLAFMLALNYGFTTMFSIAAG-----IFVISLALIAI 190
+ + S SL ++GP L FT +++ + ++ AL +
Sbjct: 332 RQGQLQGSLAALT--SLTSIVGPLL---------FTAIYAASITTWNGWAWIAGAALYLL 380

Query: 191 KLPSVPR 197
LP++ R
Sbjct: 381 CLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00330TCRTETA613e-12 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 61.0 bits (148), Expect = 3e-12
Identities = 71/415 (17%), Positives = 132/415 (31%), Gaps = 46/415 (11%)

Query: 30 LSVGTMINYLDRTILGI---VAPQLSKEIHID---PAMMGIIFSAFAWTYALAQIPGGMF 83
L V LD +G+ V P L +++ A GI+ + +A G
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 84 LDRFGNKVTYALSIFFWSLFTLLQSFTLGLKSLLLLRLGLGVSEAPCFPANSRIVSTWFP 143
DRFG + +S+ ++ + + L L + R+ G++ A ++
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGAT-GAVAGAYIADITD 125

Query: 144 QHERARA----TATYTVGEYIGLAAFSPLLFLILEHHGWRTLFFLTGGLGILFTLVWWRF 199
ERAR +A + G G P+L ++ FF L L L
Sbjct: 126 GDERARHFGFMSACFGFGMVAG-----PVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFL 180

Query: 200 YHEPHESRTANQAELEYIGANGINNKIQNVPFNWRDARRLLGCRQILGASLGQFAGNTTL 259
E H+ +N F W ++ + +
Sbjct: 181 LPESHKGERRPLRREA------LNPLAS---FRWARGMTVVAALMAVFFIMQLVGQ---- 227

Query: 260 VFFLTWFPSYLANERHLPWLHVGFFATWPFLAAAIGILFGGWISDRLLKRTGSVNISRKL 319
+ + + H +G + L I+ + R G
Sbjct: 228 -VPAALWVIFGEDRFHWDATTIGISLA---AFGILHSLAQAMITGPVAARLGERRA---- 279

Query: 320 PIISGLLLSSC--IIAANWVSANSTVIIIMSVAFFGQGMVGLGWTLISDIAPENMAGLTG 377
++ G++ I+ A I++ +A G GM L ++S E G
Sbjct: 280 -LMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQ-AMLSRQVDEERQGQLQ 337

Query: 378 GIFNFCANMASIIAPLIIGVIISATGNFFYALIYVGLTALIGVIAYIFIIGDIKR 432
G ++ SI+ PL+ I +A+ + G + G Y+ + ++R
Sbjct: 338 GSLAALTSLTSIVGPLLFTAIYAAS-----ITTWNGWAWIAGAALYLLCLPALRR 387


2D364_RS00560D364_RS27235Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS00560231-2.509589MFS transporter
D364_RS00565334-2.239447aromatic amino acid transporter AroP
D364_RS00575433-2.314623pyruvate dehydrogenase complex transcriptional
D364_RS00580229-1.587534pyruvate dehydrogenase (acetyl-transferring),
D364_RS00585025-0.302789pyruvate dehydrogenase complex
D364_RS00590-120-0.292045dihydrolipoyl dehydrogenase
D364_RS00595-313-0.590988hypothetical protein
D364_RS00600-313-1.022948DUF2950 family protein
D364_RS00605-113-1.928368DUF3300 domain-containing protein
D364_RS00615-115-3.014207bifunctional aconitate hydratase
D364_RS00620120-5.464322protein YacL
D364_RS00625-117-3.141758adenosylmethionine decarboxylase
D364_RS00630-213-1.043659polyamine aminopropyltransferase
D364_RS00635-114-0.872403FKBP-type peptidyl-prolyl cis-trans isomerase
D364_RS00640-115-0.177345hypothetical protein
D364_RS00645013-0.226287YacC family pilotin-like protein
D364_RS00650013-0.230036multicopper oxidase CueO
D364_RS00655016-1.337618glucose/quinate/shikimate family membrane-bound
D364_RS27235212-2.845961hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00585RTXTOXIND366e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 35.6 bits (82), Expect = 6e-04
Identities = 44/285 (15%), Positives = 86/285 (30%), Gaps = 40/285 (14%)

Query: 26 DKVEAEQSLITVEGDKASMEVPSPQAGVVKEIKVSVGDKTETGKLIMIFD---------S 76
+ V +T G S E+ + +VKEI V G+ G +++
Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138

Query: 77 AEGAAAAAPAQEEKKEAAPAAAAPAAAAAAKEVHVPD---IGGDEVEVTEIMVKVG-DTI 132
+ + A ++ + + + K P + +EV ++K T
Sbjct: 139 TQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTW 198

Query: 133 AAEQSLITVEGDKASMEVPAPFAGTVKEIKINTGDKVSTGSLIMIFEVAGAAPAAAPAQA 192
++ + DK E A + ++ +K + A A
Sbjct: 199 QNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSL-----LHKQAIAKHA 253

Query: 193 AAPAAAAPAAAAGVKDVNVPDIGGDEVEVTEVMVK-----------VGDKVA-------- 233
A V + E E+ + + DK+
Sbjct: 254 VLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313

Query: 234 AEQSLITVEGDKASMEVPAPFAGTVKEIKIST-GDKVKTGSLIMV 277
L E + + + AP + V+++K+ T G V T +MV
Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMV 358



Score = 32.5 bits (74), Expect = 0.006
Identities = 17/96 (17%), Positives = 31/96 (32%), Gaps = 5/96 (5%)

Query: 17 ITEILVKVGDKVEAEQSLITV-----EGDKASMEVPSPQAGVVKEIKVSVGDKTETGKLI 71
+ EI+VK G+ V L+ + E D + QA + + + E KL
Sbjct: 107 VKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLP 166

Query: 72 MIFDSAEGAAAAAPAQEEKKEAAPAAAAPAAAAAAK 107
+ E +E + + + K
Sbjct: 167 ELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQK 202



Score = 29.8 bits (67), Expect = 0.033
Identities = 16/64 (25%), Positives = 27/64 (42%), Gaps = 2/64 (3%)

Query: 230 DKVAAEQSLITVEGDKASMEVPAPFAGTVKEIKISTGDKVKTGSLIMVFEVEGAAPAAAP 289
+ VA +T G S E+ VKEI + G+ V+ G +++ GA
Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138

Query: 290 AQAA 293
Q++
Sbjct: 139 TQSS 142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00635INFPOTNTIATR558e-11 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 55.4 bits (133), Expect = 8e-11
Identities = 51/213 (23%), Positives = 99/213 (46%), Gaps = 14/213 (6%)

Query: 322 SAAAKLSASGEQQAYAIGASMGSEALNVLTTRRTQGVTVDAGLVLQGIEDAFRG-QLRLG 380
+ A L+ ++ +Y+IGA +G + QG+ ++ ++ +G++D G QL L
Sbjct: 22 TDATSLTTDKDKLSYSIGADLGKNF-------KNQGIDINPDVLAKGMQDGMSGAQLILT 74

Query: 381 EQER----NKALFDVSQQVFQNLNKIEQKNISAGKKYQQAFARKKDVV-FKEGVYSRVDY 435
E++ +K D+ + NK ++N + G + A K +V G+ ++
Sbjct: 75 EEQMKDVLSKFQKDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPSGLQYKIID 134

Query: 436 LGKG-KISGNDLVTVVIKEMLTDGTVINDMEAKDQALTQKLDAYPPVFREPLKRLQNHGS 494
G G K +D VTV L DGTV + E + T ++ P + E L+ + +
Sbjct: 135 AGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEALQLMPAGST 194

Query: 495 VTLVVPPEKAYGSKGLPPKIPPGATMVYSVRIV 527
+ VP + AYG + + I P T+++ + ++
Sbjct: 195 WEVFVPADLAYGPRSVGGPIGPNETLIFKIHLI 227


3D364_RS00715D364_RS00860Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS007151114.095123polynucleotide adenylyltransferase PcnB
D364_RS007202124.896968tRNA glutamyl-Q(34) synthetase GluQRS
D364_RS007252135.530097RNA polymerase-binding protein DksA
D364_RS007304176.696522DNA/RNA nuclease SfsA
D364_RS007357198.593306RNA 2',3'-cyclic phosphodiesterase
D364_RS007407198.543021ATP-dependent helicase HrpB
D364_RS007456217.954679A24 family peptidase
D364_RS007506228.033545type II secretion system protein N
D364_RS007555237.850772type II secretion system protein M
D364_RS007603237.067034type II secretion system protein GspL
D364_RS007653246.851746type II secretion system minor pseudopilin GspK
D364_RS007704236.961594type II secretion system minor pseudopilin GspJ
D364_RS007750204.677759type II secretion system minor pseudopilin GspI
D364_RS00780-2183.334144type II secretion system minor pseudopilin GspH
D364_RS00785-2182.314383type II secretion system major pseudopilin GspG
D364_RS00790-1152.122476type II secretion system inner membrane protein
D364_RS00795-1141.441440type II secretion system ATPase GspE
D364_RS00800-113-0.055300GspD family T2SS secretin variant PulD
D364_RS00805014-0.585212type II secretion system protein GspC
D364_RS272401131.031304hypothetical protein
D364_RS008152131.186294pullulanase-type alpha-1,6-glucosidase
D364_RS008200141.316895type II secretion system assembly factor GspB
D364_RS008250131.557668type II secretion system pilot lipoprotein GspS
D364_RS272450153.158895hypothetical protein
D364_RS008400163.640354bifunctional glycosyl
D364_RS00845-1143.666622ferrichrome porin FhuA
D364_RS008501154.337298Fe3+-hydroxamate ABC transporter ATP-binding
D364_RS008550144.014611Fe(3+)-hydroxamate ABC transporter
D364_RS008600143.682168Fe(3+)-hydroxamate ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00745PREPILNPTASE2712e-93 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 271 bits (694), Expect = 2e-93
Identities = 138/277 (49%), Positives = 168/277 (60%), Gaps = 15/277 (5%)

Query: 1 MTTLAALSLHFPFVWYGFLLLFGLALGSFYNVVIYRLPRML---------------TQTA 45
M L L+ P++++ + LF L +GSF NVVI+RLP ML +
Sbjct: 1 MALLLELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGV 60

Query: 46 DDERITLSTPGSSCPQCRQPISWRDNIPLLSFLWLGRRARCCQAPIAWSYPLTELATGLL 105
D+ L P S CP C PI+ +NIPLLS+LWL R R CQAPI+ YPL EL T LL
Sbjct: 61 DEPPYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALL 120

Query: 106 FILAGALLAPGLPLAGGLVLLSFLLILARIDARTQLLPDRLTLPLLWAGLLFNLNEVYIA 165
+ LAPG L+L L+ L ID LLPD+LTLPLLW GLLFNL +++
Sbjct: 121 SVAVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVS 180

Query: 166 LPDAVAGAMAGYLALWSVYWLFRLLTGKEALGYGDFKLLAALGAWCGWQVLPQVLLLASA 225
L DAV GAMAGYL LWS+YW F+LLTGKE +GYGDFKLLAALGAW GWQ LP VLLL+S
Sbjct: 181 LGDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSL 240

Query: 226 SGLVWTLLQRLWTRQSLQQPLAFGPWLALAGGGIFLW 262
G + L +P+ FGP+LA+AG LW
Sbjct: 241 VGAFMGIGLILLRNHHQSKPIPFGPYLAIAGWIALLW 277


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00770BCTERIALGSPG333e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 33.3 bits (76), Expect = 3e-04
Identities = 21/63 (33%), Positives = 35/63 (55%), Gaps = 5/63 (7%)

Query: 4 KMRGFTLIETLLALAILAVLSAAAV-MVLQNVIRADGLTREKSQ-QIAALQRAFRQIADD 61
K RGFTL+E ++ + I+ VL++ V ++ N +AD ++K+ I AL+ A D
Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKAD---KQKAVSDIVALENALDMYKLD 62

Query: 62 VTH 64
H
Sbjct: 63 NHH 65


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00775BCTERIALGSPG322e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.2 bits (73), Expect = 2e-04
Identities = 22/99 (22%), Positives = 36/99 (36%), Gaps = 8/99 (8%)

Query: 1 MKREAGMTLIEVMVALVIF-ALAGLAV---MQSTLQQTRQLGRMEEKILASWLADNQLVQ 56
++ G TL+E+MV +VI LA L V M + + +Q + L + L +L
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDN 63

Query: 57 LRLEKRWPALS--WSETTVEAAGTRWFVRWQGVETALPQ 93
L T+ + +G LP
Sbjct: 64 HHYPTTNQGLESLVEAPTLPPLAANY--NKEGYIKRLPA 100


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00780BCTERIALGSPH1771e-59 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 177 bits (450), Expect = 1e-59
Identities = 98/164 (59%), Positives = 125/164 (76%)

Query: 1 MSQRGFTLLEMMLVLLLIGVSASMVLLAFPSARTQEATQILARFQTQLDFVRERGQQTGQ 60
M QRGFTLLEMML+LLL+GVSA MVLLAFP++R A Q LARF+ QL FV++RG QTGQ
Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQ 60

Query: 61 LFGIIIHPERWQFMRLQPADDSAPAAADDRWGNAQWLPLQAGRVTTAETLPRARLTLRFP 120
FG+ +HP+RWQF+ L+ D + PA ADD W +WLPL+AGRV T+ ++ +L L F
Sbjct: 61 FFGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAGRVATSGSIAGGKLNLAFA 120

Query: 121 DGQAWTPGEQPDVLIFPGGEVTPFQLRIDAATGINVDAQGDSQP 164
G+AWTPG+ PDVLIFPGGE+TPF+L + A GI +A+G+S P
Sbjct: 121 QGEAWTPGDNPDVLIFPGGEMTPFRLTLGEAPGIAFNARGESLP 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00785BCTERIALGSPG2432e-86 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 243 bits (621), Expect = 2e-86
Identities = 98/140 (70%), Positives = 112/140 (80%)

Query: 1 MQRQRGFTLLEIMVVIVILGILASLVVPNLMGNKEKADRQKVVSDLVALEGALDMYKLDN 60
+QRGFTLLEIMVVIVI+G+LASLVVPNLMGNKEKAD+QK VSD+VALE ALDMYKLDN
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDN 63

Query: 61 SRYPNTEQGLQALVTAPAAEPHARNYPEGGYIRRLPQDPWGNEYQLLSPGQHGAIDVFSV 120
YP T QGL++LV AP P A NY + GYI+RLP DPWGN+Y L++PG+HGA D+ S
Sbjct: 64 HHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLLSA 123

Query: 121 GPDGMPDTNDDIGNWTLGKK 140
GPDG T DDI NW L KK
Sbjct: 124 GPDGEMGTEDDITNWGLSKK 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00790BCTERIALGSPF5120.0 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 512 bits (1321), Expect = 0.0
Identities = 277/407 (68%), Positives = 335/407 (82%), Gaps = 4/407 (0%)

Query: 1 MALFRYQALDAQGKTRRGLQQADSARHARQLLRDKGWLALEVTTADPARRLWAGGSLT-- 58
MA + YQALDAQGK RG Q+ADSAR ARQLLR++G + L V ++ L+
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 59 --RRTSAGDLALLTRQLATLVAAGIPLEKALDAVAQQCEKPSLRTLMAGVRSKVLEGHSL 116
R S DLALLTRQLATLVAA +PLE+ALDAVA+Q EKP L LMA VRSKV+EGHSL
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 117 AEAMRGYPACFDGLFCAMVAAGETSGHLDGVLNRLANYTEQRQQLRARLLQAMIYPIVLT 176
A+AM+ +P F+ L+CAMVAAGETSGHLD VLNRLA+YTEQRQQ+R+R+ QAMIYP VLT
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 177 LVAISVIAILLSTVVPKVVEQFVHLKQALPFSTRLLMSLSDIVRSAGPWLALLSLLALLA 236
+VAI+V++ILLS VVPKVVEQF+H+KQALP STR+LM +SD VR+ GPW+ L L +A
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 237 LRYLLRQPARRLAWDRMLLRLPVIGRVARSVNSARYARTLSILNASAVPLLLSMRISADV 296
R +LRQ RR+++ R LL LP+IGR+AR +N+ARYARTLSILNASAVPLL +MRIS DV
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 297 LSNAWARSQLAAASESVREGVSLHRALESTALFPPMMRYMIASGEQSGELTAMLERAAEN 356
+SN +AR +L+ A+++VREGVSLH+ALE TALFPPMMR+MIASGE+SGEL +MLERAA+N
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 357 QDRELSAQIQMALSLFEPLLVVTMAGMVLFIVLAILQPILQLNTLMS 403
QDRE S+Q+ +AL LFEPLLVV+MA +VLFIVLAILQPILQLNTLMS
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLMS 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00800BCTERIALGSPD8390.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 839 bits (2169), Expect = 0.0
Identities = 606/646 (93%), Positives = 631/646 (97%)

Query: 10 ALLILTPLLFSPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML 69
LLI LLF PAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML
Sbjct: 13 TLLIFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML 72

Query: 70 NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRAKDAKTSAVPVASAAAPGEGDEVVTRVV 129
NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVR+KDAKT+AVPVAS AAPG GDEVVTRVV
Sbjct: 73 NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVV 132

Query: 130 PLTNVAARDLAPLLRQLNDNAGAGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDR 189
PLTNVAARDLAPLLRQLNDNAG GSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDR
Sbjct: 133 PLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDR 192

Query: 190 SVVTVPLSWASAAEVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRI 249
SVVTVPLSWASAA+VVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRI
Sbjct: 193 SVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRI 252

Query: 250 IAMIKQLDRQQAVQGNTKVIYLKYAKAADLVEVLTGISSSLQSDKQSARPVAAIDKNIII 309
IAMIKQLDRQQA QGNTKVIYLKYAKA+DLVEVLTGISS++QS+KQ+A+PVAA+DKNIII
Sbjct: 253 IAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIII 312

Query: 310 KAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKN 369
KAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKN
Sbjct: 313 KAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKN 372

Query: 370 AGMTQFTNSGLPISTAIAGANQYNKDGTISSSLASALGSFNGIAAGFYQGNWAMLLTALS 429
AGMTQFTNSGLPISTAIAGANQYNKDGT+SSSLASAL SFNGIAAGFYQGNWAMLLTALS
Sbjct: 373 AGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALS 432

Query: 430 SSTKNDILATPSIVTLDNMQATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKP 489
SSTKNDILATPSIVTLDNM+ATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKP
Sbjct: 433 SSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKP 492

Query: 490 QINEGDAVLLEIEQEVSSVADSASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKT 549
QINEGD+VLLEIEQEVSSVAD+ASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDK+
Sbjct: 493 QINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKS 552

Query: 550 VTDTADKVPLLGDIPVIGALFRSDSKKVSKRNLMLFIRPTIIRDRDEYRQASSGQYTAFN 609
V+DTADKVPLLGDIPVIGALFRS SKKVSKRNLMLFIRPT+IRDRDEYRQASSGQYTAFN
Sbjct: 553 VSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFN 612

Query: 610 NAQTKQRGKESSEASLSNDLLHIYPQQETQAFRQVSAAIDAFNLGG 655
+AQ+KQRGKE+++A L+ DLL IYP+Q+T AFRQVSAAIDAFNLGG
Sbjct: 613 DAQSKQRGKENNDAMLNQDLLEIYPRQDTAAFRQVSAAIDAFNLGG 658


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00805BCTERIALGSPC2137e-71 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 213 bits (544), Expect = 7e-71
Identities = 98/266 (36%), Positives = 159/266 (59%), Gaps = 7/266 (2%)

Query: 17 KLLPQIVTLIILITAIPQLAKLTWRVVFPVSPEDISALPLTMPPAADPELKNVRPAFTLF 76
++ +I+ ++++ QLA + WR+ P ++ + + PA + FTLF
Sbjct: 12 SVIRRILFYLLMLLFCQQLAMIFWRIGLP---DNAPVSSVQITPAQARQQPVTLNDFTLF 68

Query: 77 GLAVKISPTPT-DAASLNQVPVSSLKLRLAGLLASSNPARSIAIIEKGNQQVSLSTGDPL 135
G++ + + DA+ ++ +P S+L L L G++A + +RSIAII K N+Q S + +
Sbjct: 69 GVSPEKNKAGALDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSRGVNEEV 128

Query: 136 PGYDARIAAILPDRIIVNYQGRKEAILLFNDSRAPSPPPTAAGNPPLVKRLREQPQNILT 195
PGY+A+I +I PDR+++ YQGR E + L++ + S G + + +
Sbjct: 129 PGYNAKIVSIRPDRVVLQYQGRYEVLGLYSQEDSGSDGVP--GAQVNEQLQQRASTTMSD 186

Query: 196 YLSISPVLSGDKLLGYRLNPGKDASLFRQSGLQANDLAIALNGIDLRDQEQAQQALQNLA 255
Y+S SP+++ +KL GYRLNPG + F + GLQ ND+A+ALNG+DLRD EQA++A++ +A
Sbjct: 187 YVSFSPIMNDNKLQGYRLNPGPKSDSFYRVGLQDNDMAVALNGLDLRDAEQAKKAMERMA 246

Query: 256 DMTEITLTVEREGQRHDIAFAL-GDE 280
D+ TLTVER+GQR DI GDE
Sbjct: 247 DVHNFTLTVERDGQRQDIYMEFGGDE 272


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00855FERRIBNDNGPP430e-155 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 430 bits (1107), Expect = e-155
Identities = 213/291 (73%), Positives = 244/291 (83%)

Query: 6 LITRRRLLIAMALSPLLWQMRGAQAADVDPQRVVALEWLPAELLLALGVTPYGVADIPNY 65
LI+RRRLL AMALSPLLWQM A AA +DP R+VALEWLP ELLLALG+ PYGVAD NY
Sbjct: 6 LISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVADTINY 65

Query: 66 RLWVNEPALPDSVIDVGLRTEPNLELLTQMKPSFIVWSAGYGPSPEKLARIAPGRGFTFS 125
RLWV+EP LPDSVIDVGLRTEPNLELLT+MKPSF+VWSAGYGPSPE LARIAPGRGF FS
Sbjct: 66 RLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFS 125

Query: 126 DGKRPLAMAQRSLLDMADLLGKTQQAKRHLAEFDALMESLRPRFAGRGDRPLLMISLLDP 185
DGK+PLAMA++SL +MADLL A+ HLA+++ + S++PRF RG RPLL+ +L+DP
Sbjct: 126 DGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDP 185

Query: 186 RHVLVFGENCLFQEVLDRFGIKNAWHGEAAFWGSVSVGIDRLAAFNEADVICFDHGNERD 245
RH+LVFG N LFQE+LD +GI NAW GE FWGS +V IDRLAA+ + DV+CFDH N +D
Sbjct: 186 RHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKD 245

Query: 246 MAQLLATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFARVLADAQGRPA 296
M L+ATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHF RVL +A G A
Sbjct: 246 MDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIGGKA 296


4D364_RS00910D364_RS00970Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS00910322-2.598900DUF3461 family protein
D364_RS00915224-2.2736972,3,4,5-tetrahydropyridine-2,6-dicarboxylate
D364_RS00920324-2.606434bifunctional
D364_RS00925331-3.568981type I methionyl aminopeptidase
D364_RS00930330-3.57128530S ribosomal protein S2
D364_RS00935121-2.601206elongation factor Ts
D364_RS00940-119-2.154726UMP kinase
D364_RS00945018-2.678484ribosome recycling factor
D364_RS00950017-2.8136381-deoxy-D-xylulose-5-phosphate reductoisomerase
D364_RS00955119-3.564159(2E,6E)-farnesyl-diphosphate-specific
D364_RS00960119-3.573044phosphatidate cytidylyltransferase
D364_RS00965221-3.953510sigma E protease regulator RseP
D364_RS00970221-3.113103outer membrane protein assembly factor BamA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00940CARBMTKINASE300.009 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 29.8 bits (67), Expect = 0.009
Identities = 18/66 (27%), Positives = 24/66 (36%), Gaps = 14/66 (21%)

Query: 120 AEAI-SLLRNNRVVILSAGTGNPFFTT-------------DSAACLRGIEIEADVVLKAT 165
AE I L+ +VI S G G P D A E+ AD+ + T
Sbjct: 176 AETIKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILT 235

Query: 166 KVDGVF 171
V+G
Sbjct: 236 DVNGAA 241


5D364_RS01050D364_RS01155Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS01050213-1.257644envelope stress response activation lipoprotein
D364_RS01055015-1.414963YaeF family permuted papain-like enzyme
D364_RS01060-117-1.986215proline--tRNA ligase
D364_RS01065-117-2.949706tRNA
D364_RS01070-219-3.431639Rcs stress response system protein RcsF
D364_RS01075-216-3.009241methionine ABC transporter substrate-binding
D364_RS01080-214-1.559395methionine ABC transporter permease MetI
D364_RS01085-212-1.379126methionine ABC transporter ATP-binding protein
D364_RS01090-212-0.844881D-glycero-beta-D-manno-heptose 1,7-bisphosphate
D364_RS01125-114-0.936484***2,5-didehydrogluconate reductase DkgB
D364_RS01130-116-2.460109LysR family transcriptional regulator
D364_RS01135018-2.743129MFS transporter
D364_RS01140020-3.697140endonuclease/exonuclease/phosphatase family
D364_RS01145-121-4.351424class I SAM-dependent methyltransferase
D364_RS01150-119-4.299897murein transglycosylase D
D364_RS01155-120-4.429930hydroxyacylglutathione hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01135TCRTETA612e-12 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 61.4 bits (149), Expect = 2e-12
Identities = 82/390 (21%), Positives = 147/390 (37%), Gaps = 30/390 (7%)

Query: 3 LALFALTIGAFAIGTTEFVIVGLVPTIAQQLSISLPSA---GLLVSIYALGVAIGAPVLT 59
+ L + + A IG +I+ ++P + + L S G+L+++YAL APVL
Sbjct: 9 VILSTVALDAVGIG----LIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64

Query: 60 ALTGRMPRKQLLLALMVLFTAGNILAWQAPGYETLILARLLTGLAHGVFFSIGSTIATSL 119
AL+ R R+ +LL + + AP L + R++ G+ G+ IA
Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADIT 124

Query: 120 VAKEKAASAIAIMFGGLTVALVTGVPFGTFIGQHFGWRETFLAVSILGVIALISSLLLVP 179
E+A M +V G G +G F F A + L + ++ L+P
Sbjct: 125 DGDERAR-HFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLP 182

Query: 180 NNIPGRASASLRDQLRVLTHPRLLMIYAITALGYGGVFTAF-------TFLAPMMQELAG 232
+ G R+ L L R + A F ++
Sbjct: 183 ESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFH 242

Query: 233 FSPSAVSWILLGYGVSVAIGNVW-GGKLADKHGAVSALK--FIFAALVLLLLVFQLTASV 289
+ + + L +G+ ++ G +A + G AL I +LL F A+
Sbjct: 243 WDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAF---ATR 299

Query: 290 HYAALATVLVMGVFAFGNVPGLQVYVVQKAEQYTPGAVDIASGLNIAAFNIGIALGSIVG 349
+ A ++++ G +P LQ + ++ ++ G G A ++ +G ++
Sbjct: 300 GWMAFPIMVLLASGGIG-MPALQAMLSRQVDEERQGQ---LQGSLAALTSLTSIVGPLLF 355

Query: 350 GQTVERYGLAQTPWIG-AMIVLVALLLVVL 378
Y + T W G A I AL L+ L
Sbjct: 356 TAI---YAASITTWNGWAWIAGAALYLLCL 382


6D364_RS01300D364_RS01475Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS013000223.4085544-aminobutyrate--2-oxoglutarate transaminase
D364_RS01305-1193.003308NADP-dependent succinate-semialdehyde
D364_RS01310012-0.312889L-2-hydroxyglutarate oxidase
D364_RS01315-113-2.775764carbon starvation induced protein CsiD
D364_RS01320018-3.931576YjhX family toxin
D364_RS01325018-4.571484class I SAM-dependent methyltransferase
D364_RS01330121-6.651054glutathione S-transferase family protein
D364_RS01335123-7.681071PTS galactitol transporter subunit IIC
D364_RS01340123-6.9458772-hydroxyacid dehydrogenase
D364_RS01345020-5.123733SDR family oxidoreductase
D364_RS01350021-3.238370carbohydrate kinase
D364_RS01355-122-1.987066glycerol-3-phosphate responsive antiterminator
D364_RS01360022-0.893773phospho-2-dehydro-3-deoxyheptonate aldolase
D364_RS01365-222-1.404221Zn-finger protein
D364_RS01370-124-2.283072dihydrolipoyl dehydrogenase
D364_RS01375-128-3.5284362-oxo acid dehydrogenase subunit E2
D364_RS01380037-8.296077alpha-ketoacid dehydrogenase subunit beta
D364_RS01385145-10.995245thiamine pyrophosphate-dependent dehydrogenase
D364_RS01390251-14.286633AcoK
D364_RS26835774-22.535317hypothetical protein
D364_RS26840773-22.327555hypothetical protein
D364_RS26845662-17.380827hypothetical protein
D364_RS25685655-15.033241hypothetical protein
D364_RS01395754-13.529700winged helix-turn-helix domain-containing
D364_RS01400853-12.070544type 1 fimbrial protein
D364_RS01405853-11.711846type 1 fimbrial protein
D364_RS01410851-11.929330outer membrane usher protein
D364_RS01415855-14.418803fimbria/pilus periplasmic chaperone
D364_RS01420962-16.336989hypothetical protein
D364_RS01425963-17.417677fimbrial protein
D364_RS26850658-15.317011hypothetical protein
D364_RS26855019-4.234034hypothetical protein
D364_RS25690-115-2.300286helix-turn-helix transcriptional regulator
D364_RS27250-114-1.336174hypothetical protein
D364_RS01440-313-0.275265tyrosine-type recombinase/integrase
D364_RS01450-1172.156176*glutamate-5-semialdehyde dehydrogenase
D364_RS01455-2161.825044glutamate 5-kinase
D364_RS01465-1161.749157phosphoporin PhoE
D364_RS014702193.584899lipoprotein
D364_RS014751173.195912DMT family transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01345DHBDHDRGNASE1001e-27 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 100 bits (251), Expect = 1e-27
Identities = 73/253 (28%), Positives = 124/253 (49%), Gaps = 16/253 (6%)

Query: 8 RTAIVTGGATGLGREFVLSLAKEGVNIC-FTYMREEEHPERLIETVKASANVEIIAVKTD 66
+ A +TG A G+G +LA +G +I Y E+ E+++ ++KA A A D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKL--EKVVSSLKAEARHAE-AFPAD 65

Query: 67 LSDEQSRENLFATCIDRLGKADILVNNAGIWLSGYVTEICPQDWDLVMNVNLKAIFHLSQ 126
+ D + + + A +G DILVN AG+ G + + ++W+ +VN +F+ S+
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 127 LFVNHCLQHDQMGSILNITSQAAFHGSTTGHAHYAASKAGLVAFAISLAREVAKQKINVN 186
+ + + GSI+ + S A T A YA+SKA V F L E+A+ I N
Sbjct: 126 SVSKY-MMDRRSGSIVTVGSNPA-GVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 187 NIAVGIMDTAMIRKN-IEQNPDYYVSR---------IPVGRVAQPQEIADIGVFMVSPKT 236
++ G +T M ++N V + IP+ ++A+P +IAD +F+VS +
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 237 SYMTGATLDVTGG 249
++T L V GG
Sbjct: 244 GHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01375RTXTOXIND310.009 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.3 bits (71), Expect = 0.009
Identities = 14/56 (25%), Positives = 29/56 (51%), Gaps = 4/56 (7%)

Query: 44 SKIVNVLEAPFAGTLRRILAREGETLQVGAVLALAADASVSDAELDEFVARLATAK 99
SK + +E ++ I+ +EGE+++ G VL A ++A+ + + L A+
Sbjct: 96 SKEIKPIEN---SIVKEIIVKEGESVRKGDVLLK-LTALGAEADTLKTQSSLLQAR 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01410PF005777180.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 718 bits (1855), Expect = 0.0
Identities = 224/866 (25%), Positives = 379/866 (43%), Gaps = 57/866 (6%)

Query: 15 LGLFIFVSLSPLVMKAYATDNIQFNTDVLDVRDRKNIDLSQFSRSGYIMPGTYDMVVHIN 74
+ +FV+ + ++ + FN L + DLS+F + PGTY + +++N
Sbjct: 26 FFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLN 85

Query: 75 KNTLPEQEIPFYEPDDDPNGSRACINPKLVEQLGLKPGVLKDLAWWHKGGCLDKRS-VKG 133
+ +++ F D G C+ + +GL + + C+ S +
Sbjct: 86 NGYMATRDVTFN-TGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHD 144

Query: 134 MEIRGDLPTASLYLSIPQAYLEYTDENWDPPSRWDEGVAGLLLDYNLNASSQHQQSEGSN 193
+ D+ L L+IPQA++ + PP WD G+ LL+YN + +S + G N
Sbjct: 145 ATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRI-GGN 203

Query: 194 TQALSGNGTAGGNLGSWRFRADWQANLDHSNGSEQSTQKQFDLSRYYAYRAIPGLHSKLT 253
+ N +G N+G+WR R + + + S+ S ++ ++ + R I L S+LT
Sbjct: 204 SHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSS-SGSKNKWQHINTWLERDIIPLRSRLT 262

Query: 254 LGENYLDSGMFDSFRFTGVSLISDDNMLPPNLRGYAPEVTGIAKTNAKVIISQQGRVLYE 313
LG+ Y +FD F G L SDDNMLP + RG+AP + GIA+ A+V I Q G +Y
Sbjct: 263 LGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYN 322

Query: 314 TSVASGPFRIQDIN-EAVSGELNVRVEEQDGGVQEFVVNTANIPYLTRPGSVRFKLTMGK 372
++V GPF I DI SG+L V ++E DG Q F V +++P L R G R+ +T G+
Sbjct: 323 STVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGE 382

Query: 373 PSDWQHHLRDPMFGTGELSWGISNGWSLYGGILRGGDYNALSLGIGRDLMFLGALSFDAT 432
P F L G+ GW++YGG Y A + GIG+++ LGALS D T
Sbjct: 383 YRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMT 442

Query: 433 HSRVRLPWEDLTLNGDSYRLSYSKNFEEYDSQVTFAGYRFSEQDFMTMSEYLDARSYGTR 492
+ LP +D +G S R Y+K+ E + + GYR+S + ++ +R G
Sbjct: 443 QANSTLP-DDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYN 501

Query: 493 -------------------SNGNGKEMYTVNLNKHFRKLELSSYINFSRETYWDRPTTDR 533
N + + + + + + Y++ S +TYW D
Sbjct: 502 IETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRT-STLYLSGSHQTYWGTSNVDE 560

Query: 534 -YNITLSHYFDLGRFRGVSISLSAYRNQYNGTEDNGAYVSMSIPWSD-----------SS 581
+ L+ F+ + ++S S +N + D ++++IP+S +
Sbjct: 561 QFQAGLNTAFEDINW---TLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHA 617

Query: 582 TVSYNA-MVSHNDNTHRVGYYDRVDEHN--NYQLSAG-----NSSRGVSVSGYYSHEGDM 633
+ SY+ + T+ G Y + E N +Y + G + + G + ++ G
Sbjct: 618 SASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGY 677

Query: 634 ARMSANASYQDGRYSAIGLTMQGGLTLTSEGGAMHRSGMMGGTRMLIDTEGVPDVPVRGY 693
+ S+ D + + GG+ + G + + + T +L+ G D V
Sbjct: 678 GNANIGYSHSDD-IKQLYYGVSGGVLAHANGVTLGQ--PLNDTVVLVKAPGAKDAKVEN- 733

Query: 694 GSTSRTNAWGKAVISDVNSYYRNKASIDLNQLGDNIEATVSVVQATLTEGAIGYRKFDVI 753
+ RT+ G AV+ Y N+ ++D N L DN++ +V T GAI +F
Sbjct: 734 QTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKAR 793

Query: 754 SGAKAMAAIKLADGSEPPFGATVINKRKQETGIVNDSGNVYLSGINAGETMVVHWGGSAQ 813
G K + + + PFGA V ++ Q +GIV D+G VYLSG+ + V WG
Sbjct: 794 VGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEEN 852

Query: 814 CEVRMPALL---QPDMLMNTLRLLCK 836
L L+ L C+
Sbjct: 853 AHCVANYQLPPESQQQLLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01425FIMBRIALPAPF409e-07 Escherichia coli: P pili tip fibrillum papF protein...
		>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein

signature.
Length = 167

Score = 40.1 bits (93), Expect = 9e-07
Identities = 37/160 (23%), Positives = 71/160 (44%), Gaps = 14/160 (8%)

Query: 24 LSGLSTQATGVPNSAFQVKVNIVSPPCIINNNEDIIVSFGEMMATRVDGNHYRVPVNYTL 83
+S L T + + ++ N+ PPC INN ++I+V FG + VD + V N ++
Sbjct: 8 ISLLLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRGEVTKNISI 67

Query: 84 DCKNTSSRAMKLQMQGSSTSF-DGTLLGTDNPALGIKILND---ATPLSVNTWMNFTYPD 139
C S ++ +++ G++ +L T+ GI + +TPL++ Y
Sbjct: 68 SCPYKSG-SLWIKVTGNTMGVGQNNVLATNITHFGIALYQGKGMSTPLTLGNGSGNGYRV 126

Query: 140 KPEL---------WAVPVKHSGVTLSTGEFFAVATLKIDY 170
L +VP ++ L+ G+F A++ + Y
Sbjct: 127 TAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMIY 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS25690TETREPRESSOR280.004 Tetracycline repressor protein signature.
		>TETREPRESSOR#Tetracycline repressor protein signature.

Length = 218

Score = 27.6 bits (61), Expect = 0.004
Identities = 9/26 (34%), Positives = 17/26 (65%)

Query: 25 IKGTSVKNIAKRLGLQIKTVYAHRSN 50
I G + + +A++LG++ T+Y H N
Sbjct: 22 IDGLTTRKLAQKLGIEQPTLYWHVKN 47


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01455CARBMTKINASE384e-05 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 37.9 bits (88), Expect = 4e-05
Identities = 31/140 (22%), Positives = 53/140 (37%), Gaps = 20/140 (14%)

Query: 119 DTLRALLDNSI---------VPVINENDAVATAEIKVGDNDNLSALAAILAGADKLLLLT 169
+T++ L++ + VPVI E+ + E V D D A AD ++LT
Sbjct: 177 ETIKKLVERGVIVIASGGGGVPVILEDGEIKGVE-AVIDKDLAGEKLAEEVNADIFMILT 235

Query: 170 DQPGLFTADPRSNPQAELIKDVYGIDDALRAIAGDSVSGLGTGGMGTKLQAA-DVACRAG 228
D G + + +++V +++ + G MG K+ AA G
Sbjct: 236 DVNGAALY--YGTEKEQWLREV-KVEELRKYYEEG---HFKAGSMGPKVLAAIRFIEWGG 289

Query: 229 IDTIIAAGNRPDVIGHAMAG 248
IIA + A+ G
Sbjct: 290 ERAIIAH---LEKAVEALEG 306



Score = 29.0 bits (65), Expect = 0.032
Identities = 16/76 (21%), Positives = 33/76 (43%), Gaps = 13/76 (17%)

Query: 4 SQTLVVKLGTSVLTGGSRRLNRAHIVELVRQCAQ----LHAMGHRIVIVTSG-------- 51
+ +V+ LG + L ++ + +++ VR+ A+ + A G+ +VI
Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61

Query: 52 -AIAAGREHLGYPELP 66
+ AG+ G P P
Sbjct: 62 LHMDAGQATYGIPAQP 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01465ECOLIPORIN5360.0 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 536 bits (1381), Expect = 0.0
Identities = 225/384 (58%), Positives = 263/384 (68%), Gaps = 35/384 (9%)

Query: 1 MKKSTLALMMMGFVASTATQAAEVYNKNANKLDVYGKIKAMHYFSDYDSKDGDQTYVRFG 60
MK+ LAL++ +A+ A AAE+YNK+ NKLD+YGK+ +HYFSD SKDGDQTY+R G
Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60

Query: 61 IKGETQINDDLTGYGRWESEFSGNKTESDSSQ-KTRLAFAGVKLKNYGSFDYGRNLGALY 119
KGETQIND LTGYG+WE N TE + + TRLAFAG+K +YGSFDYGRN G LY
Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120

Query: 120 DVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFGLVDGLDLTLQYQGKNE--- 176
DVE WTDM PEFGGDS DN+MT RA+G+ATYRNTDFFGLVDGL+ LQYQGKNE
Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180

Query: 177 -------------GREAKKQNGDGVGTSLSYDFGGSDFAVSAAYTSSDRTNDQNLLAR-- 221
G + + NGDG G S +YD G F+ AAYT+SDRTN+Q
Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDI-GMGFSAGAAYTTSDRTNEQVNAGGTI 239

Query: 222 GQGSKAEAWATGLKYDANNIYLATMYSETRKMTP-------ISGGFANKAQNFEAVAQYQ 274
G KA+AW GLKYDANNIYLATMYSETR MTP GG ANK QNFE AQYQ
Sbjct: 240 AGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQ 299

Query: 275 FDFGLRPSLGYVLSKGKDIE----GVGSEDLVNYIDVGLTYYFNKNMNAFVDYKINQLKS 330
FDFGLRP++ +++SKGKD+ +DLV Y DVG TYYFNKN + +VDYKIN L
Sbjct: 300 FDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLLDD 359

Query: 331 DNKL----GINDDDIVALGMTYQF 350
D+ GI+ DDIVALGM YQF
Sbjct: 360 DDPFYKDAGISTDDIVALGMVYQF 383


7D364_RS01525D364_RS01620Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS01525-1193.160865fimbrial biogenesis outer membrane usher
D364_RS01530-2192.875647fimbrial adhesin EcpD
D364_RS015350194.512380fimbria/pilus periplasmic chaperone
D364_RS01540-1204.448893amino acid ABC transporter permease
D364_RS01545-1213.386486amino acid ABC transporter permease
D364_RS01550-1222.537766amino acid ABC transporter ATP-binding protein
D364_RS01555-1221.750998ABC transporter substrate-binding protein
D364_RS268650221.349816hypothetical protein
D364_RS015650210.863592ABC transporter ATP-binding protein
D364_RS015701201.570578ABC transporter ATP-binding protein
D364_RS01575-1192.205160branched-chain amino acid ABC transporter
D364_RS015800193.058776branched-chain amino acid ABC transporter
D364_RS015850183.587678ABC transporter substrate-binding protein
D364_RS272600163.911204hypothetical protein
D364_RS015950175.225315ethanolamine ammonia-lyase subunit EutC
D364_RS01600-1174.610926ethanolamine ammonia-lyase subunit EutB
D364_RS016050174.572964ethanolamine permease
D364_RS272650174.491599hypothetical protein
D364_RS01615-1174.044724S-methylmethionine permease
D364_RS01620-1154.073764homocysteine S-methyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01525PF00577663e-13 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 66.4 bits (162), Expect = 3e-13
Identities = 33/247 (13%), Positives = 73/247 (29%), Gaps = 23/247 (9%)

Query: 487 TLNLNALWSKLGTFSVSYNDDRRYNSHYYTADYYQTVYSGAFGSLGLRAGIQRYNNGDSS 546
L + + T +S + Y + +Q + AF + N
Sbjct: 530 QLTVTQQLGRTSTLYLSG-SHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQK 588

Query: 547 ANTGKYIALDLSLPLGNWFSAGMTHQNGYTMANLSARKQFDEGT------------IRTI 594
+ +AL++++P +W + Q + A+ S + +
Sbjct: 589 -GRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNL 647

Query: 595 GANLSRAISGDTGDDKTLSGGAYAQFDARYASGTLNVNSAADGYVNTNLTASGSVGWQGK 654
++ +G + +G A + Y + + S +D SG V
Sbjct: 648 SYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGY-SHSDDIKQLYYGVSGGVLAHAN 706

Query: 655 NIAASGRTDGNAGVIFNTGLED---DGQISARVNGRIFPLSGKRNYLPLSPYGRYEVELQ 711
+ + ++ G +D + Q R + R G + Y V L
Sbjct: 707 GVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWR-----GYAVLPYATEYRENRVALD 761

Query: 712 NSKNSLD 718
+ + +
Sbjct: 762 TNTLADN 768


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01580FERRIBNDNGPP290.018 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 29.1 bits (65), Expect = 0.018
Identities = 20/57 (35%), Positives = 30/57 (52%), Gaps = 2/57 (3%)

Query: 159 LWLLYRTRY--GMAIRAVAFDVNTVRLMGIDANRIISLVFALGSSLAALGGVFYSIS 213
L L+ R R MA+ + + +NT ID NRI++L + L ALG V Y ++
Sbjct: 4 LPLISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVA 60


8D364_RS01815D364_RS01875Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS01815119-4.800493class II fructose-1,6-bisphosphate aldolase
D364_RS01820216-4.415146kinase
D364_RS01825217-3.386823PTS ascorbate transporter subunit IIC
D364_RS01830113-2.475140PTS sugar transporter subunit IIB
D364_RS01835212-1.974057PTS sugar transporter subunit IIA
D364_RS01845216-1.553617peroxiredoxin
D364_RS01850216-1.098568ACP phosphodiesterase
D364_RS27270320-1.484758hypothetical protein
D364_RS01860319-1.165335tRNA preQ1(34) S-adenosylmethionine
D364_RS01865218-0.889636tRNA guanosine(34) transglycosylase Tgt
D364_RS01870221-1.011973preprotein translocase subunit YajC
D364_RS01875218-0.421322protein translocase subunit SecD
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01875SECFTRNLCASE705e-15 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 69.9 bits (171), Expect = 5e-15
Identities = 37/183 (20%), Positives = 88/183 (48%), Gaps = 5/183 (2%)

Query: 433 IQIVEERTIGPTLGMQNIKQGLEACLAGLVVSILFMIL-FYKKFGLIATSALIANLILIV 491
++I ++GP + + + + + LA VV + ++ + F +F L A AL+ +++L V
Sbjct: 135 LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTV 194

Query: 492 GIMSLIPGATLTMPGIAGIVLTLAVAVDANVLINERIKEEL--SNGRTVQQAIDEGYRGA 549
G+ +++ + +A ++ +++ V++ +R++E L ++ ++
Sbjct: 195 GLFAVL-QLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNET 253

Query: 550 FSSIFDANVTTLIKVIILYAVGTGAIKGFAITTGIGIATSMFTAIVGTRAIVNLLYGGKR 609
S +TTL+ ++ + G I+GF G+ T ++++ + IV L G R
Sbjct: 254 LSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIV-LFIGLDR 312

Query: 610 VKK 612
K+
Sbjct: 313 NKE 315


9D364_RS01975D364_RS02120Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS01975-2153.691351YajQ family cyclic di-GMP-binding protein
D364_RS01980-2154.135509MFS transporter
D364_RS01985-2145.024469Gfo/Idh/MocA family oxidoreductase
D364_RS01990-2145.711334aromatic acid/H+ symport family MFS transporter
D364_RS01995-1146.057419sugar phosphate isomerase/epimerase
D364_RS020000145.293547FAD-dependent oxidoreductase
D364_RS02005-1132.746605NIPSNAP family protein
D364_RS020100132.236299FAD-dependent oxidoreductase
D364_RS02015213-0.642818shikimate dehydrogenase
D364_RS02020322-3.423140helix-turn-helix domain-containing protein
D364_RS02025427-5.051189hypothetical protein
D364_RS02030222-3.473912heme o synthase
D364_RS02035223-3.524355cytochrome o ubiquinol oxidase subunit IV
D364_RS02040122-3.446088cytochrome o ubiquinol oxidase subunit III
D364_RS02045225-3.633568cytochrome o ubiquinol oxidase subunit I
D364_RS02050220-3.311220cytochrome o ubiquinol oxidase subunit II
D364_RS02055320-2.183334muropeptide MFS transporter AmpG
D364_RS02060423-3.312019lipoprotein
D364_RS02065525-3.656390transcriptional regulator BolA
D364_RS02070423-2.849137trigger factor
D364_RS02075119-2.324978ATP-dependent Clp endopeptidase proteolytic
D364_RS02080218-2.410337ATP-dependent protease ATP-binding subunit ClpX
D364_RS02085116-2.189389endopeptidase La
D364_RS02090311-0.116158DNA-binding protein HU-beta
D364_RS020952110.165018peptidylprolyl isomerase
D364_RS021000110.934402helix-hairpin-helix domain-containing protein
D364_RS021050100.549370YbgC/FadM family acyl-CoA thioesterase
D364_RS02110291.4576577-cyano-7-deazaguanine synthase QueC
D364_RS021153112.325983SgrR family transcriptional regulator
D364_RS021202121.502331HMP-PP phosphatase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01980TCRTETA831e-19 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 83.3 bits (206), Expect = 1e-19
Identities = 57/231 (24%), Positives = 96/231 (41%), Gaps = 13/231 (5%)

Query: 16 GLGTVFSLRMLGMFMVLPVLTTY--GMALQGASEALIGLAIGIYGLAQAVFQIPFGLLSD 73
L TV L +G+ +++PVL + A G+ + +Y L Q G LSD
Sbjct: 10 ILSTVA-LDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSD 68

Query: 74 RIGRKPLIVGGLLIFVLGSVIAALTDSIWGIILGRALQG-SGAIAAAVMALLSDLTREQN 132
R GR+P+++ L + I A +W + +GR + G +GA A A ++D+T
Sbjct: 69 RFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDE 128

Query: 133 RTKAMAFIGVSFGVTFAIAMVLGPIVTHQLGLHALFWMIAILATVGILLTLWVVPNSHNH 192
R + F+ FG VLG ++ HA F+ A L + L +++P SH
Sbjct: 129 RARHFGFMSACFGFGMVAGPVLGGLMG-GFSPHAPFFAAAALNGLNFLTGCFLLPESHKG 187

Query: 193 VLNRESGMVKGCFSKVLAEPRLLKLNFGIMCLHIMLMSTFVA-LPGQLEAA 242
+ + G+ + ++ F+ L GQ+ AA
Sbjct: 188 ERRPLRREALNPLAS-------FRWARGMTVVAALMAVFFIMQLVGQVPAA 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01990TCRTETB517e-09 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 50.6 bits (121), Expect = 7e-09
Identities = 35/181 (19%), Positives = 74/181 (40%), Gaps = 3/181 (1%)

Query: 26 IFLGFCVIALDG-FDIAIMGFIAPTLKLEWGVSNHQLGLVISAALIGLALGAIFSGPLAD 84
I + C+++ + ++ P + ++ V +A ++ ++G G L+D
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 85 WLGRKKIIINSVFFFGFWTIATAFSHN-VEQMMFFRFMTGLGLGAAMPNIGTLVSEYAPE 143
LG K++++ + F ++ H+ ++ RF+ G G A + +V+ Y P+
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 144 RQRSFIITVIFCGFTFGAAAGGFSASWLIPQFGWHSLMALGGILPLLFAPLLIWLLPESV 203
R +I G G + W L+ + I ++ P L+ LL + V
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMIT-IITVPFLMKLLKKEV 193

Query: 204 R 204
R
Sbjct: 194 R 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02055TCRTETB385e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 38.3 bits (89), Expect = 5e-05
Identities = 42/196 (21%), Positives = 75/196 (38%), Gaps = 15/196 (7%)

Query: 221 RNNAWLI-LLLIVLYKLGDAFAMSLTTTFLIRGVGFDAGEVGMVNKTLGLFATILGALYG 279
R+N LI L ++ + + + ++++ + VN L +I A+YG
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 280 GVLMQRLTLFRALLIFGLLQGVSNAGYWLLSITDKHLYSMATAVFFENLCGGMGTAAFVA 339
L +L + R LL ++ S+ +S + + G G AAF A
Sbjct: 71 K-LSDQLGIKRLLLFGIIINC-------FGSVIGFVGHSFFSLLIMARFIQGAGAAAFPA 122

Query: 340 LLM----TLCNKSFSATQFALLSALSAVGRVYVGP-IAGWFVEAHGWSTFYLFSVVAAVP 394
L+M K F L+ ++ A+G VGP I G WS L ++ +
Sbjct: 123 LVMVVVARYIPKENRGKAFGLIGSIVAMG-EGVGPAIGGMIAHYIHWSYLLLIPMITIIT 181

Query: 395 GIALLLLCRQTLEHTQ 410
L+ L ++ +
Sbjct: 182 VPFLMKLLKKEVRIKG 197


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02085GPOSANCHOR340.002 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 34.3 bits (78), Expect = 0.002
Identities = 34/133 (25%), Positives = 68/133 (51%), Gaps = 15/133 (11%)

Query: 191 ERLEYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKELGEMDDAPD- 249
LE A +E + +L R +++ ++ S+ +Q++A ++L E + +
Sbjct: 291 AALEAEKADLEHQSQVLNAN---RQSLRRDLDASREAK---KQLEAEHQKLEEQNKISEA 344

Query: 250 ENEALKRKIDAAKMPKEAKEKTEAELQKLKMMSPMS-AEATVVRGYIDWMVQVPWNARSK 308
++L+R +DA++ EAK++ EAE QKL+ + +S A +R +D + A+ +
Sbjct: 345 SRQSLRRDLDASR---EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASRE----AKKQ 397

Query: 309 VKKDLRQAQEILD 321
V+K L +A L
Sbjct: 398 VEKALEEANSKLA 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02090DNABINDINGHU1164e-38 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 116 bits (293), Expect = 4e-38
Identities = 49/88 (55%), Positives = 65/88 (73%)

Query: 2 NKSQLIDKIAAGADISKAAAGRALDALIASVTESLQAGDDVALVGFGTFAVKERAARTGR 61
NK LI K+A +++K + A+DA+ ++V+ L G+ V L+GFG F V+ERAAR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGKEITIAAAKVPGFRAGKALKDAV 89
NPQTG+EI I A+KVP F+AGKALKDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02120HTHFIS290.024 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 28.6 bits (64), Expect = 0.024
Identities = 11/64 (17%), Positives = 23/64 (35%), Gaps = 10/64 (15%)

Query: 193 LAVLSQHLGFTLQECMAFGDAMNDREMLGSVGRGFIMGN----------AMPQLKAELPH 242
VL+Q L + +A + + ++ + +P++K P
Sbjct: 16 RTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPD 75

Query: 243 LPVI 246
LPV+
Sbjct: 76 LPVL 79


10D364_RS02280D364_RS02345Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS02280216-4.059797YlaC family protein
D364_RS27275218-3.721087hypothetical protein
D364_RS02290118-4.029612maltose O-acetyltransferase
D364_RS02295113-2.928275hemolysin expression modulator Hha
D364_RS02300112-2.183507Hha toxicity modulator TomB
D364_RS02305113-1.824594multidrug efflux RND transporter permease
D364_RS02310212-1.110592multidrug efflux RND transporter periplasmic
D364_RS02315312-0.812223multidrug efflux transporter transcriptional
D364_RS02320313-0.160091mechanosensitive channel MscK
D364_RS023254152.572245Rpn family recombination-promoting
D364_RS023304152.402799pleiotropic regulatory protein RsmS
D364_RS023353163.414070primosomal replication protein N''
D364_RS023402141.286724DUF454 family protein
D364_RS023452161.177347adenine phosphoribosyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02305ACRIFLAVINRP13650.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1365 bits (3535), Expect = 0.0
Identities = 805/1032 (78%), Positives = 911/1032 (88%)

Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLSILKLPVAQYPTIAPPAISITAMYPGADAETVQNT 60
M NFFI RPIFAWV+AII+M+AG L+IL+LPVAQYPTIAPPA+S++A YPGADA+TVQ+T
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQNMNGIDHLMYMSSNGDSTGTATITLTFESGTDPDIAQVQVQNKLALATPLLPQ 120
VTQVIEQNMNGID+LMYMSS DS G+ TITLTF+SGTDPDIAQVQVQNKL LATPLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQQQGISVEKASSSFLMVVGVINTNGTMNQDDISDYVAANMKDPISRTSGVGDVQLFGS 180
EVQQQGISVEK+SSS+LMV G ++ N QDDISDYVA+N+KD +SR +GVGDVQLFG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWMDPNKLNNFQLTPVDVISALKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240
QYAMRIW+D + LN ++LTPVDVI+ LK QN Q+AAGQLGGTP + GQQLNASIIAQTR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 TNTEEFGNILLKVNQDGSQVRLRDVAKIELGGESYDVVAKFNGQPASGLGIKLATGANAL 300
N EEFG + L+VN DGS VRL+DVA++ELGGE+Y+V+A+ NG+PA+GLGIKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTANAIRAELAKMEPFFPSGMKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ 360
DTA AI+A+LA+++PFFP GMK++YPYDTTPFV++SIHEVVKTL EAI+LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420
N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480
E+ LPPKEAT KSM QIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALILTPALCATMLKPIQKGSHGATTGFFGWFNRMFDKSTHHYTDSVGNILRSTGRY 540
SVLVALILTPALCAT+LKP+ H GFFGWFN FD S +HYT+SVG IL STGRY
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 541 LVLYLIIVVGMAWLFVRLPSSFLPDEDQGVFLSMAQLPAGATQERTQKVLDEMTNYYLTK 600
L++Y +IV GM LF+RLPSSFLP+EDQGVFL+M QLPAGATQERTQKVLD++T+YYL
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 601 EKDNVESVFAVNGFGFAGRGQNTGIAFVSLKDWSQRPGEENKVEAITARAMGYFSQIKDA 660
EK NVESVF VNGF F+G+ QN G+AFVSLK W +R G+EN EA+ RA +I+D
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 661 MVFAFNLPAIVELGTATGFDFELIDQGGLGHEKLTQARNQLFGMVAQHPDVLTGVRPNGL 720
V FN+PAIVELGTATGFDFELIDQ GLGH+ LTQARNQL GM AQHP L VRPNGL
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 721 EDTPQFKIDIDQEKAQALGVSISDINTTLGAAWGGSYVNDFIDRGRVKKVYIMSEAKYRM 780
EDT QFK+++DQEKAQALGVS+SDIN T+ A GG+YVNDFIDRGRVKK+Y+ ++AK+RM
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 781 LPEDIGKWYVRGSDGQMVPFSAFSTSRWEYGSPRLERYNGLPSLEILGQAAPGKSTGEAM 840
LPED+ K YVR ++G+MVPFSAF+TS W YGSPRLERYNGLPS+EI G+AAPG S+G+AM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 841 ALMEELAGKLPSGIGYDWTGMSYQERLSGNQAPALYAISLIVVFLCLAALYESWSIPFSV 900
ALME LA KLP+GIGYDWTGMSYQERLSGNQAPAL AIS +VVFLCLAALYESWSIP SV
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 901 MLVVPLGVVGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGLI 960
MLVVPLG+VG LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLMEKEGKG++
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 961 EATLEAVRMRLRPILMTSLAFILGVMPLVISSGAGSGAQNAVGTGVMGGMVTATILAIFF 1020
EATL AVRMRLRPILMTSLAFILGV+PL IS+GAGSGAQNAVG GVMGGMV+AT+LAIFF
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1021 VPVFFVVVRRRF 1032
VPVFFVV+RR F
Sbjct: 1021 VPVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02310RTXTOXIND432e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.5 bits (100), Expect = 2e-06
Identities = 30/210 (14%), Positives = 75/210 (35%), Gaps = 19/210 (9%)

Query: 100 TYQASYDSAKGDLAKAQAAANMDQLTVKRYQKLLGTKYISQQDYDTAVATA-QQSNAAVV 158
+ Y A +L ++ + + ++ Q + + +Q+ +
Sbjct: 256 EQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLV---TQLFKNEILDKLRQTTDNIG 312

Query: 159 AAKAAVETARINLAYTKVTSPISGRIGKSAV-TEGALVQNGQTTALATVQQLDPIYVDVT 217
+ + + +P+S ++ + V TEG +V +T + V + D + V
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVTAL 371

Query: 218 QSSNDFLRLKQEL-ADGRLKQENGK------AKVELVTNDGLKYPQSGTLEFSDVTVDQT 270
+ D + A +++ KV+ + D ++ + G + +++++
Sbjct: 372 VQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEEN 431

Query: 271 TGSITLRAIFPNPDHTLLPGMFVRARLEEG 300
S + I L GM V A ++ G
Sbjct: 432 CLSTGNKNIP------LSSGMAVTAEIKTG 455



Score = 29.0 bits (65), Expect = 0.040
Identities = 17/78 (21%), Positives = 33/78 (42%), Gaps = 3/78 (3%)

Query: 48 APLQITTELPGR-TSAYRIAEVRPQVSGIILKRNFV-EGSDIQAGVSLYQIDPATYQASY 105
++I G+ T + R E++P + I+ K V EG ++ G L ++ +A
Sbjct: 78 GQVEIVATANGKLTHSGRSKEIKPIENSIV-KEIIVKEGESVRKGDVLLKLTALGAEADT 136

Query: 106 DSAKGDLAKAQAAANMDQ 123
+ L +A+ Q
Sbjct: 137 LKTQSSLLQARLEQTRYQ 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02315HTHTETR1852e-61 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 185 bits (470), Expect = 2e-61
Identities = 170/213 (79%), Positives = 194/213 (91%)

Query: 1 MARKTKQQARETRQLILDVALRLFSQQGVSSTSLATIAKAAGVTRGAIYWHFKNKSDLFN 60
MARKTKQ+A+ETRQ ILDVALRLFSQQGVSSTSL IAKAAGVTRGAIYWHFK+KSDLF+
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 EIWELSDASISDLEIEYRAKFPNDPLSVIREILVYVLEATVTEERRRLMMEIIYHKCEFV 120
EIWELS+++I +LE+EY+AKFP DPLSV+REIL++VLE+TVTEERRRL+MEII+HKCEFV
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 121 GEMTVVQQAQRQLSLASYERIEQTLKECIAAKLLPANLLTRRAAVLMRSYLSGLMENWLF 180
GEM VVQQAQR L L SY+RIEQTLK CI AK+LPA+L+TRRAA++MR Y+SGLMENWLF
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 181 APDSFDLHAEARDYVAILLEMYQFCPTLRGPES 213
AP SFDL EARDYVAILLEMY CPTLR P +
Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPAT 213


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02320GPOSANCHOR474e-07 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 47.0 bits (111), Expect = 4e-07
Identities = 42/282 (14%), Positives = 95/282 (33%), Gaps = 4/282 (1%)

Query: 31 RAADLPDRAEVQSQLNTLNKQKELTPQDKLVQQDLTQTLETLDKIERIKSETAQLRQQVE 90
+ +DL + N +EL+ + ++++ E KI+ +++ A L + +E
Sbjct: 72 KNSDLSFNNKALKDHND-ELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALE 130

Query: 91 QAPAKLRQAVESLNNLSDVPNDDATRKTLSTLSLRQLESRVTQTLDDLQNAQNDLATYNS 150
A + L A RK +L + T ++ + + A +
Sbjct: 131 GAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 190

Query: 151 QLVSLQTQPERVQNAMFNASQQLQQIRNRLNGTSVGD---ETLRPTQQVLLQAQQALLNA 207
+ L+ E N S +++ + + E A A +
Sbjct: 191 RQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 250

Query: 208 QIEQQRKSLEGNTILQDTLQKQRDYVTAWSNRLEHQLQLLQEAVNSKRLTLTEKTAQEAV 267
++ L+ L+ ++ TA S +++ K + A
Sbjct: 251 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNAN 310

Query: 268 TPDETARIQANPLVKQELDINHQLSEKLIQATENGNQLVQRN 309
+ A+ K++L+ HQ E+ + +E Q ++R+
Sbjct: 311 RQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRD 352


11D364_RS02585D364_RS02685Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS02585114-3.816730type 1 fimbrial protein
D364_RS02590116-4.535001GNAT family N-acetyltransferase
D364_RS02595-116-2.535139LpxL/LpxP family Kdo(2)-lipid IV(A)
D364_RS02600119-2.715402DUF1471 domain-containing protein
D364_RS02605018-2.571797lysine decarboxylation/transport transcriptional
D364_RS02610020-1.581744cadaverine/lysine antiporter
D364_RS02615018-0.371625lysine decarboxylase CadA
D364_RS02620-2132.320677dipeptide permease DtpD
D364_RS02625-2132.849126lysine--tRNA ligase
D364_RS026300194.840019hypothetical protein
D364_RS026350184.000821myo-inosose-2 dehydratase
D364_RS026401172.460829Gfo/Idh/MocA family oxidoreductase
D364_RS026453182.012817Gfo/Idh/MocA family oxidoreductase
D364_RS026503182.082171Gfo/Idh/MocA family oxidoreductase
D364_RS026552181.560845LacI family DNA-binding transcriptional
D364_RS026603171.930328sugar ABC transporter substrate-binding protein
D364_RS026652182.773604sugar ABC transporter ATP-binding protein
D364_RS026701203.529784ABC transporter permease
D364_RS026800214.719856hypothetical protein
D364_RS026850213.234491LacI family DNA-binding transcriptional
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02590SACTRNSFRASE300.008 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 29.9 bits (67), Expect = 0.008
Identities = 15/61 (24%), Positives = 23/61 (37%), Gaps = 6/61 (9%)

Query: 74 RYIQIGTVMTEPDHRNKGLAGQLIHHILQDWQQEADAFFLFANPTTVD-----FYPKFGF 128
Y I + D+R KG+ L+ H +W +E L ++ FY K F
Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALL-HKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146

Query: 129 T 129

Sbjct: 147 I 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02620TCRTETA290.030 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 29.4 bits (66), Expect = 0.030
Identities = 41/208 (19%), Positives = 76/208 (36%), Gaps = 14/208 (6%)

Query: 44 SHAISLFSAYA-SLVYVTPILGGWLADRLLGNRTAVIAGALLMTLGHVVLGVESTSAWSL 102
+H L + YA P+LG +DR G R ++ + + ++ + W L
Sbjct: 43 AHYGILLALYALMQFACAPVLGAL-SDRF-GRRPVLLVSLAGAAVDYAIMAT-APFLWVL 99

Query: 103 YVALAIIICGY-GLFKSNISCLLGELYAHDDPRRDGGFSLLYAAGNVGSIAAPIACGLAA 161
Y + I+ G G + + ++ D+ R F + A G +A P+ GL
Sbjct: 100 Y--IGRIVAGITGATGAVAGAYIADITDGDE--RARHFGFMSACFGFGMVAGPVLGGLMG 155

Query: 162 QWYGWHIGFALAGIGMFIGLMIFLSGSRHFRHT-RGVDKPALRAVKFVLPTWGWLLVMLC 220
+ H F A + + FL+G + +G +P R L ++ W M
Sbjct: 156 G-FSPHAPFFAAA---ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTV 211

Query: 221 LAPVFFTLLLQNNWSGYLLAIVCLFAAQ 248
+A + + A+ +F
Sbjct: 212 VAALMAVFFIMQLVGQVPAALWVIFGED 239


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02635PF08280310.008 M protein trans-acting positive regulator
		>PF08280#M protein trans-acting positive regulator

Length = 530

Score = 30.6 bits (69), Expect = 0.008
Identities = 23/130 (17%), Positives = 45/130 (34%), Gaps = 21/130 (16%)

Query: 114 GETPLDEPISLSPPLSRVSLAAYCHKLNTFADLLLR------------DYDLQLAYHHHL 161
P+ E ++ L+ + L YC +LN F L + + Y + L
Sbjct: 57 SSLPITE-VAEKTGLTFLQLNHYCEELNAFFPDSLSMTIQKRMISCQFTHPSKETYLYQL 115

Query: 162 ----MMLVEHDDELERFLSHTHDNVGLAFDTGHAFVAGVEIPRVLHKYGHRIRHLHLKDV 217
+L L + + + L F++ R+ +R+ LK
Sbjct: 116 YASSNVL----QLLAFLIKNGSHSRPLTDFARSHFLSNSSAYRMREALIPLLRNFELKLS 171

Query: 218 RPQVLGRLYR 227
+ +++G YR
Sbjct: 172 KNKIVGEEYR 181


12D364_RS02750D364_RS02790Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS02750213-0.694564hypothetical protein
D364_RS027552100.825978RcnB family protein
D364_RS272902100.740333hypothetical protein
D364_RS272952132.457659hypothetical protein
D364_RS027702142.233518amidohydrolase
D364_RS027751142.691659ABC transporter substrate-binding protein
D364_RS027801142.466533aldo/keto reductase
D364_RS027851143.313910MoaF N-terminal domain-containing protein
D364_RS027901113.564201SDR family oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02790DHBDHDRGNASE901e-23 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 89.7 bits (222), Expect = 1e-23
Identities = 62/253 (24%), Positives = 103/253 (40%), Gaps = 8/253 (3%)

Query: 3 RVVVITGGGTGIGAACARLMRAAGDRVFITGRREAPLQAVANETGATA-----LVGDAAD 57
++ ITG GIG A AR + + G + L+ V + A A D D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 58 GEVWRQRLLPAILDQTGGIDVLICSAGGMGNSPAAETSDRQWREALDGNLTSAFASVRAC 117
+ + I + G ID+L+ AG + SD +W N T F + R+
Sbjct: 69 SAAIDE-ITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 118 LPSLIARR-GNVLFVASIASLAAGPQACGYVTAKHALIGLMRSVARDYGPQGVRANAICP 176
++ RR G+++ V S + Y ++K A + + + + +R N + P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 177 GWVTTPMADEEMHPLMQAEGLSLTEAYQRVCRDVPLRRPASPEEIAQACQFLCSPQAAII 236
G T M + + + + +PL++ A P +IA A FL S QA I
Sbjct: 188 GSTETDM-QWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 237 SGATLVADGGASI 249
+ L DGGA++
Sbjct: 247 TMHNLCVDGGATL 259


13D364_RS02875D364_RS03100Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS028752163.335882sugar phosphate isomerase/epimerase
D364_RS028802173.548637Gfo/Idh/MocA family oxidoreductase
D364_RS028851163.112784Exc2 family lipoprotein
D364_RS02890-113-0.162745LacI family transcriptional regulator
D364_RS02895012-1.145852mechanosensitive ion channel family protein
D364_RS02900012-1.114575MFS transporter
D364_RS02905115-2.831288helix-turn-helix transcriptional regulator
D364_RS02910118-3.831408DUF2157 domain-containing protein
D364_RS02915121-4.954017sigma-54-dependent transcriptional regulator
D364_RS02920019-3.633638SIS domain-containing protein
D364_RS02925-216-3.047120SIS domain-containing protein
D364_RS02930-214-3.005237PTS system mannose/fructose/sorbose family
D364_RS02935-315-2.195311PTS
D364_RS02940-211-1.054660PTS sugar transporter subunit IIB
D364_RS02945-213-0.725558PTS sugar transporter subunit IIA
D364_RS02950-215-0.150436oxygen-insensitive NAD(P)H nitroreductase
D364_RS02955-1151.979348MmcQ/YjbR family DNA-binding protein
D364_RS02960-1153.189685TetR/AcrR family transcriptional regulator
D364_RS029651194.642961MBL fold metallo-hydrolase
D364_RS029701214.914612RamA family antibiotic efflux transcriptional
D364_RS029752215.022894DUF1158 domain-containing protein
D364_RS029851205.226835cation-transporting P-type ATPase
D364_RS029901195.851499efflux RND transporter periplasmic adaptor
D364_RS029952195.546731efflux RND transporter permease subunit
D364_RS03000-1143.473036phosphodiester glycosidase family protein
D364_RS03005-1143.403162glutamate--cysteine ligase
D364_RS03010-1152.124800DMT family transporter
D364_RS03015-1161.295974PLP-dependent aminotransferase family protein
D364_RS03020-1160.570632PLP-dependent aminotransferase family protein
D364_RS03025-1170.368185transporter substrate-binding domain-containing
D364_RS03030-2181.680510amino acid ABC transporter permease
D364_RS03035-118-1.426679ABC transporter permease subunit
D364_RS03040-119-2.470185amino acid ABC transporter ATP-binding protein
D364_RS03045024-4.189289SDR family NAD(P)-dependent oxidoreductase
D364_RS03050130-6.286873aspartate aminotransferase family protein
D364_RS27300139-7.944981hypothetical protein
D364_RS03065-131-5.707511AraC family transcriptional regulator
D364_RS03070-320-1.500849aldo/keto reductase
D364_RS030750141.870439isochorismatase family protein
D364_RS03080-1133.072100IS110 family transposase
D364_RS030850185.167156ABC transporter ATP-binding protein
D364_RS030900185.012607ABC transporter ATP-binding protein
D364_RS03095-1184.338375ABC transporter permease
D364_RS03100-2183.701926ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02900TCRTETB1037e-26 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 103 bits (257), Expect = 7e-26
Identities = 76/409 (18%), Positives = 160/409 (39%), Gaps = 29/409 (7%)

Query: 21 MLPLIDTSITNVALDAITHTLAASATQLELIVALYGVAFAVCLAMGSKLGDNYGRRRLFM 80
+++ + NV+L I + + + + F++ A+ KL D G +RL +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 81 WGVALFGIASLLCGMANSIGALL-AARTLQGAGAALIVPQILATLHVTLKGPAH-ARAIS 138
+G+ + S++ + +S +LL AR +QGAGAA P ++ + + +A
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAF-PALVMVVVARYIPKENRGKAFG 142

Query: 139 LYGGIGGIAFIVGQMGGGWLVSADIAGLGWRNAFFINVPICLLVLALSRRYVPETRRETP 198
L G I + VG GG + + W ++ + +P+ ++ + +
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHY----IHW--SYLLLIPMITIITVPFLMKLLKKEVRIK 196

Query: 199 SRIDWQGTLYL-ALILCCLLFPMALGPELHWPLWLQLMLVAVLPLLFAMRQSALRQQQRG 257
D +G + + I+ +LF + L++ + L+F ++ ++
Sbjct: 197 GHFDIKGIILMSVGIVFFMLFTTSY-------SISFLIVSVLSFLIF------VKHIRKV 243

Query: 258 DHPLLPPRLLQLTSIRFGMAIALLFFSAWSGFMFCMALTMQEGLGMAPWQSGNSFIALG- 316
P + P L + G+ + F +GF+ + M++ ++ + G+ I G
Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303

Query: 317 VAYFISALYAPRLIARYSMGRILLTGLAVQIAGLLLLCATFSRFGVATNALTLVPATALI 376
++ I L+ R +L G+ L F + T + + +
Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTAS-----FLLETTSWFMTIIIVFV 358

Query: 377 GYGQALIVNSFYRIGMRDISASDAGAGSAILSTLQQATLGLGPAILGSL 425
G + I + +AGAG ++L+ + G G AI+G L
Sbjct: 359 LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02905CARBMTKINASE300.012 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 29.8 bits (67), Expect = 0.012
Identities = 18/87 (20%), Positives = 32/87 (36%), Gaps = 10/87 (11%)

Query: 199 AMAEHRGDPAWENKLARFFAASSEFEALWHQRYEVRGVENQIKHFNHPQLGRFSLQQMYW 258
A+ + ++E + + + + + YEV I H N PQ+G L
Sbjct: 13 ALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEV-----VITHGNGPQVGSLLLHMDAG 67

Query: 259 YSAPRNGSRLLVYLPMDEAGEQALAWL 285
+ + PMD AG + W+
Sbjct: 68 QATYGIPA-----QPMDVAGAMSQGWI 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02915HTHFIS1491e-40 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 149 bits (379), Expect = 1e-40
Identities = 75/255 (29%), Positives = 122/255 (47%), Gaps = 21/255 (8%)

Query: 89 QALCADRQDSLAQLIGAQGSLQEALRQCKAAISYPGAGLPLLLRGPTGTGKSFLARQLWH 148
L D QD L+G ++QE R + L L++ G +GTGK +AR
Sbjct: 127 SKLEDDSQDG-MPLVGRSAAMQEIYRVLARLMQTD---LTLMITGESGTGKELVAR---- 178

Query: 149 YAIDEGILPADAPFTVFNCAEYANNPELLTSKLFGHAKGAFTGADKAVPGLIETSNGGVL 208
A+ + + PF N A A +L+ S+LFGH KGAFTGA G E + GG L
Sbjct: 179 -ALHDYGKRRNGPFVAINMA--AIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTL 235

Query: 209 FIDEVHRLPPEGQEKLFHFMDNGSWRRLGESADERSATVRLIFASTEDLEK-----HFLA 263
F+DE+ +P + Q +L + G + +G + VR++ A+ +DL++ F
Sbjct: 236 FLDEIGDMPMDAQTRLLRVLQQGEYTTVG-GRTPIRSDVRIVAATNKDLKQSINQGLFRE 294

Query: 264 TFIRRIPVI-VKILPIAERGQFERLAFIHHFFRREAQRLNHD-LELDGEIVSQLMRETLE 321
R+ V+ +++ P+ +R E + + F ++A++ D D E + +
Sbjct: 295 DLYYRLNVVPLRLPPLRDRA--EDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWP 352

Query: 322 GNVGGLENLIRNICA 336
GNV LENL+R + A
Sbjct: 353 GNVRELENLVRRLTA 367


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02960HTHTETR573e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 56.6 bits (136), Expect = 3e-12
Identities = 21/106 (19%), Positives = 47/106 (44%), Gaps = 1/106 (0%)

Query: 3 RPKSEDKKQALLEAATAAFAQSGI-AASTSAIARSAGVAEGTLFRYFATKDELLNELYLA 61
+ ++++ +Q +L+ A F+Q G+ + S IA++AGV G ++ +F K +L +E++
Sbjct: 6 KQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL 65

Query: 62 IKLRLVRTMIAGLDPDEKRPKENARNIWNSYIDWGVRNPMEHKAIR 107
+ + + P R I ++ V +
Sbjct: 66 SESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLME 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02990RTXTOXIND432e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.5 bits (100), Expect = 2e-06
Identities = 17/105 (16%), Positives = 35/105 (33%)

Query: 53 RAVDIRARTEGVIVQRHFQDGQYVTEGDLLFTLDDAQPRAALALAQAELKSAEASLRQSQ 112
R+ +I+ ++ + ++G+ V +GD+L L A Q+ L A + Q
Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154

Query: 113 QLLTRYERLINNHSISRNDVDTARMQRDVAAAAVQQAKARVEAQQ 157
L E ++ + + K + Q
Sbjct: 155 ILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQ 199



Score = 34.8 bits (80), Expect = 5e-04
Identities = 15/90 (16%), Positives = 31/90 (34%), Gaps = 4/90 (4%)

Query: 91 RAALALAQAELKSAEASLRQSQQLLTRYE-RLINNHSISRNDVDTARMQRDVAAAAVQQA 149
A EL+ ++ L Q + + + + +N++ Q +
Sbjct: 258 ENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLE 317

Query: 150 KARVEAQQIVLSYTRITAPVTGRVGHSAFH 179
A+ E + + I APV+ +V H
Sbjct: 318 LAKNEER---QQASVIRAPVSVKVQQLKVH 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02995ACRIFLAVINRP9150.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 915 bits (2366), Expect = 0.0
Identities = 417/1034 (40%), Positives = 606/1034 (58%), Gaps = 17/1034 (1%)

Query: 1 MLTFFIRRPRFAMVIALLLTFVGAVSLKLIPVEQYPAITPPVVNVSASWPGASASDVAEA 60
M FFIRRP FA V+A++L GA+++ +PV QYP I PP V+VSA++PGA A V +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 IAAPLETQLNGVDHMLYMESTSSDEGTYRLSITFAAGTDADLAAIDVQNRVAQALAQLPA 120
+ +E +NG+D+++YM STS G+ +++TF +GTD D+A + VQN++ A LP
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQQNGVQVRKRASNLLMGVSLYSPLGTLSPLFVSNYASTQVREALARLPGVGEVQMFGA 180
EVQQ G+ V K +S+ LM S + +S+Y ++ V++ L+RL GVG+VQ+FGA
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 RDYSMRIWLRPDRMNALNITTDDVAQALREQNVQGAAGQVGTPPVFNGQQQTLTINGLGR 240
+ Y+MRIWL D +N +T DV L+ QN Q AAGQ+G P GQQ +I R
Sbjct: 181 Q-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 241 LNEAASFGEIILRRGAQGQLVRLADVATIELGARSYSSGAQLNGKASAYLGIYPTPTANA 300
FG++ LR + G +VRL DVA +ELG +Y+ A++NGK +A LGI ANA
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 301 LQVASAVRAELNRLHTRFPADLTWEVKFDTTRFVAATIKEIGVSLALTLLAVVVVVSLFL 360
L A A++A+L L FP + +DTT FV +I E+ +L ++ V +V+ LFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 361 QSWRATLIVVLAIPVSLIGTFAVLYLLGYSANTLSLFAIILALTMVVDDAIVVVENVETK 420
Q+ RATLI +A+PV L+GTFA+L GYS NTL++F ++LA+ ++VDDAIVVVENVE
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 421 MAE-GLDRLQATAQALRQIAGPVIATTLVLLAVFVPVALLPGIVGELYRQFAVTLSTAVA 479
M E L +AT +++ QI G ++ +VL AVF+P+A G G +YRQF++T+ +A+A
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 480 LSSLVALTLTPALCALLLRPRPARP----AAVWRAFNRLLDGTRDGYGRLVGWMNRRPLL 535
LS LVAL LTPALCA LL+P A + FN D + + Y VG +
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 536 ALAATVAAGALVAFSFTSMPKGFLPQEDQGYLFASVQLPEAASLERTEAVMAQARKLLMA 595
L A + F +P FLP+EDQG +QLP A+ ERT+ V+ Q +
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 596 NPA--VEDVIQVSGFNILNGTSASNGGFISVMLKDWHQRPP----LDAVMADIQRQLLSL 649
N VE V V+GF+ A N G V LK W +R +AV+ + +L +
Sbjct: 600 NEKANVESVFTVNGFSF--SGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKI 657

Query: 650 PEATIMTFAPPTLPGLGNASGFDLRIMAQAGQSSAELEQVTREILQLANQHP-QLSRVFT 708
+ ++ F P + LG A+GFD ++ QAG L Q ++L +A QHP L V
Sbjct: 658 RDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 709 TWSSNVPQLTLTVDRDRAALLDVPVAQIFSSLQTAFGGTRAGDFSRNNRVYHVVMQNEMQ 768
+ Q L VD+++A L V ++ I ++ TA GGT DF RV + +Q + +
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 769 WRERAEQISELYVRSRDGERVRLSNLVTITPTVGAPFIQQYNQFPSVSVSGSAAEGVSSR 828
+R E + +LYVRS +GE V S T G+P +++YN PS+ + G AA G SS
Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837

Query: 829 TAMAAMEQILQAHLPPGYDYAWSGISWQEQQTGNQAVWIVLAAVAMAWLFLVAQYESWTL 888
AMA ME + LP G Y W+G+S+QE+ +GNQA +V + + +L L A YESW++
Sbjct: 838 DAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896

Query: 889 PASVMLSVLFAIGGALLWLWTAGYANDVYVQIGLVLLIALAAKNAILIVEFARSRRE-EG 947
P SVML V I G LL NDVY +GL+ I L+AKNAILIVEFA+ E EG
Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956

Query: 948 LSIVDAAREGATRRFRAVMMTAVSFIIGIMPMMLATGAGAQSRRIIGTTVFSGMLVATMV 1007
+V+A R R ++MT+++FI+G++P+ ++ GAG+ ++ +G V GM+ AT++
Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016

Query: 1008 GILFIPSLYVLFQR 1021
I F+P +V+ +R
Sbjct: 1017 AIFFVPVFFVVIRR 1030



Score = 76.0 bits (187), Expect = 4e-16
Identities = 90/522 (17%), Positives = 182/522 (34%), Gaps = 45/522 (8%)

Query: 531 RRPLLALAATVAAGALVAFSFTSMPKGFLPQEDQGYLFASVQLPEAASLERTEAVMAQAR 590
RRP+ A + A + +P P + S P A + + V
Sbjct: 7 RRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV----- 61

Query: 591 KLLMANPAVEDVIQVSGFNILNGTSASNGGFISVMLKDWHQRPPLDA---VMADIQRQLL 647
+++ + ++ TS S G +++ L P A V +Q
Sbjct: 62 ----TQVIEQNMNGIDNLMYMSSTSDSAGS-VTITLTFQSGTDPDIAQVQVQNKLQLATP 116

Query: 648 SLPEATIMTFAPPTLPGLGNASGFDLRIMAQAGQS-SAELEQVTREILQLANQHPQLSRV 706
LP+ + ++S + +M S + Q +N LSR+
Sbjct: 117 LLPQE----VQQQGISVEKSSSSY---LMVAGFVSDNPGTTQDDISDYVASNVKDTLSRL 169

Query: 707 -----FTTWSSNVPQLTLTVDRDRAALLDVPVAQIFSSLQTAFGGTRAGDF------SRN 755
+ + + + +D D + + + L+ AG
Sbjct: 170 NGVGDVQLFGAQY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQ 228

Query: 756 NRVYHVVMQNEMQWRERAEQISELYVR-SRDGERVRLSNLVTITPTVGA-PFIQQYNQFP 813
++ Q + E+ ++ +R + DG VRL ++ + I + N P
Sbjct: 229 QLNASIIAQTRFK---NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKP 285

Query: 814 SVSVSGSAAEGVSSR-TAMAAMEQI--LQAHLPPG--YDYAWSGISWQEQQTGNQAVWIV 868
+ + A G ++ TA A ++ LQ P G Y + + + ++ V +
Sbjct: 286 AAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSI-HEVVKTL 344

Query: 869 LAAVAMAWLFLVAQYESWTLPASVMLSVLFAIGGALLWLWTAGYANDVYVQIGLVLLIAL 928
A+ + +L + ++ ++V + G L GY+ + G+VL I L
Sbjct: 345 FEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGL 404

Query: 929 AAKNAILIVE-FARSRREEGLSIVDAAREGATRRFRAVMMTAVSFIIGIMPMMLATGAGA 987
+AI++VE R E+ L +A + ++ A++ A+ +PM G+
Sbjct: 405 LVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464

Query: 988 QSRRIIGTTVFSGMLVATMVGILFIPSLYVLFQRMREWAHRR 1029
R T+ S M ++ +V ++ P+L + H
Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHE 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03045DHBDHDRGNASE842e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 83.9 bits (207), Expect = 2e-21
Identities = 52/187 (27%), Positives = 84/187 (44%), Gaps = 5/187 (2%)

Query: 6 VVFITGATSGFGEAAAQVFADAGWSLVLSGRRYPRLKALQ--DRLAARVPVHIIELDVRD 63
+ FITGA G GEA A+ A G + +L+ + + AR DVRD
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA-EAFPADVRD 68

Query: 64 SEAVAAAVASLPADFADITTLINNAGLALSPLPAQEVALEDWKTMIDTNVTGLVTVTHAL 123
S A+ A + + I L+N AG+ L P ++ E+W+ N TG+ + ++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 124 LPTLIRHGAGASIINIGSIAGQWPYPGSHVYGASKAFVKQFSYNLRCDLLGTGVRVTDLA 183
++ +G SI+ +GS P Y +SKA F+ L +L +R ++
Sbjct: 128 SKYMMDRRSG-SIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 184 PGIAETE 190
PG ET+
Sbjct: 187 PGSTETD 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03075ISCHRISMTASE320.002 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 31.5 bits (71), Expect = 0.002
Identities = 17/61 (27%), Positives = 25/61 (40%), Gaps = 1/61 (1%)

Query: 107 RKTLIICGYSAEVGVLLTALGGLRQGYNVFIPVDCVGSQSLRTETVVLKQ-AEKAGAVIT 165
R LII G A +G L+TA + F D V SL + L+ A + +
Sbjct: 143 RDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCAFTVM 202

Query: 166 S 166
+
Sbjct: 203 T 203


14D364_RS03200D364_RS03450Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS03200-1163.495746transketolase
D364_RS03205-1196.310618transketolase
D364_RS032100206.509840enterobactin synthase subunit EntD
D364_RS032150216.787627siderophore enterobactin receptor FepA
D364_RS032203208.753009enterochelin esterase
D364_RS032253218.626252MbtH family NRPS accessory protein
D364_RS032303218.346267enterobactin non-ribosomal peptide synthetase
D364_RS032351187.251484iron-enterobactin ABC transporter ATP-binding
D364_RS032400186.505416iron-enterobactin ABC transporter permease
D364_RS032450185.674318Fe(3+)-siderophore ABC transporter permease
D364_RS032500185.086914enterobactin transporter EntS
D364_RS032550184.637292Fe2+-enterobactin ABC transporter
D364_RS03260-2183.823537isochorismate synthase EntC
D364_RS03265-2193.108156(2,3-dihydroxybenzoyl)adenylate synthase EntE
D364_RS03270-1192.386985enterobactin biosynthesis bifunctional
D364_RS03275-1171.3087162,3-dihydro-2,3-dihydroxybenzoate dehydrogenase
D364_RS032801151.040220proofreading thioesterase EntH
D364_RS032851141.720741carbon starvation protein CstA
D364_RS03290-2142.405227YbdD/YjiX family protein
D364_RS03295-2152.615792helix-turn-helix domain-containing protein
D364_RS03300-1163.011214type II toxin-antitoxin system RelE/ParE family
D364_RS03305-1173.7857313-oxoacyl-ACP reductase FabG
D364_RS03310-2183.708760oxidoreductase
D364_RS03315-1163.858160sugar ABC transporter ATP-binding protein
D364_RS03320-1164.264646ABC transporter permease
D364_RS03325-2174.207556sugar ABC transporter substrate-binding protein
D364_RS03330-1164.782310LVIVD repeat-containing protein
D364_RS03335-2185.161137S-methyl-5-thioribose kinase
D364_RS03340-1175.191916S-methyl-5-thioribose-1-phosphate isomerase
D364_RS033451194.791331histidine phosphatase family protein
D364_RS033502214.526187MFS transporter
D364_RS033553204.723725inositol monophosphatase
D364_RS033602205.363649substrate-binding domain-containing protein
D364_RS033653195.435524cytosine deaminase
D364_RS033702185.858679PucR family transcriptional regulator
D364_RS033751186.101363cytosine permease
D364_RS033802186.500311methylthioribulose 1-phosphate dehydratase
D364_RS033852187.003801carbohydrate kinase
D364_RS033900165.1065862-hydroxyacid dehydrogenase
D364_RS033951155.043890sugar kinase
D364_RS034000154.386505ABC transporter permease
D364_RS03405-1133.618194ABC transporter permease
D364_RS03410-1133.347774sugar ABC transporter ATP-binding protein
D364_RS03415-2122.244928sugar ABC transporter substrate-binding protein
D364_RS03420-3122.055665molybdopterin-dependent oxidoreductase
D364_RS03425-114-1.9303631,2-dihydroxy-3-keto-5-methylthiopentene
D364_RS03430-114-2.503046acireductone synthase
D364_RS03435-113-3.095071pyridoxal phosphate-dependent aminotransferase
D364_RS03440-211-3.978324ParB/RepB/Spo0J family partition protein
D364_RS03445-211-4.498295DUF3440 domain-containing protein
D364_RS03450-213-4.162983LysR family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03210ENTSNTHTASED1751e-57 Enterobactin synthetase component D signature.
		>ENTSNTHTASED#Enterobactin synthetase component D signature.

Length = 234

Score = 175 bits (446), Expect = 1e-57
Identities = 100/212 (47%), Positives = 129/212 (60%), Gaps = 9/212 (4%)

Query: 1 MRHHRTVLPLAGYTIQQIDFDPATFQPEDLFWLPYHASLTGWGRKRQAEHLAGRIAAAYA 60
M LP AG+ + +DFD ++F+ DL WLP+H L GRKR+AEHLAGRIAA +A
Sbjct: 1 MLTSHFPLPFAGHRLHIVDFDASSFREHDLLWLPHHDRLRSAGRKRKAEHLAGRIAAVHA 60

Query: 61 LREVGEKRLPAIGDQRQPLWPTPWFGSISHCGQRALAVIADRPVGVDIERRFTPQLAAEL 120
LREVG + +P +GD+RQPLWP FGSISHC ALAVI+ + +G+DIE+ + A EL
Sbjct: 61 LREVGVRTVPGMGDKRQPLWPDGLFGSISHCATTALAVISRQRIGIDIEKIMSQHTATEL 120

Query: 121 ESSIISPAEKTALLRSGLPFPLALTLAFSAKESGFKATPAANQRALGFADFQIVEITAST 180
SII E+ L S LPFPLALTLAFSAKES +KA GF ++ +TA+
Sbjct: 121 APSIIDSDERQILQASLLPFPLALTLAFSAKESVYKAFSDR-VTLPGFNSAKVTSLTATH 179

Query: 181 LALMF--------AEQRYLLHWIASEEQVITL 204
++L AE+ W + VITL
Sbjct: 180 ISLHLLPAFAATMAERTVRTEWFQRDNSVITL 211


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03250TCRTETB354e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 35.2 bits (81), Expect = 4e-04
Identities = 39/187 (20%), Positives = 71/187 (37%), Gaps = 8/187 (4%)

Query: 24 IARFISILSLGLLGVAIPVQIQMMTHSTWQVGLSVTLTGASMFVGLMVGGVLADRYERKR 83
I F S+L+ +L V++P T + +G V G L+D+ KR
Sbjct: 21 ILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR 80

Query: 84 LILLARGTCGVGFVGLCLNALLPEPSLAAIYLLGIWDGFFASLGVTALLAATPALVGREN 143
L+L G V + + A ++ G F +L ++ + +EN
Sbjct: 81 LLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL----VMVVVARYIPKEN 136

Query: 144 LMQAGAITMLTVRLGSVISPMIGGLLLATGGVAWNFGLAAAGTFITTLTLLRLPQLPPPP 203
+A + V +G + P IGG++ + W++ L IT +T+ L +L
Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIP--MITIITVPFLMKLLKKE 192

Query: 204 QPREHPL 210
+
Sbjct: 193 VRIKGHF 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03255FERRIBNDNGPP526e-10 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 52.3 bits (125), Expect = 6e-10
Identities = 59/288 (20%), Positives = 100/288 (34%), Gaps = 31/288 (10%)

Query: 40 HTLPSQPLRIVSTSVTLTGSLLAIDAPVVASGATTPNNRVADSQGFLRQWSEVAKARKLA 99
H P RIV+ LLA+ VAD+ + SE +
Sbjct: 29 HAAAIDPNRIVALEWLPVELLLALGIVPYG---------VADTINYRLWVSEPPLPDSV- 78

Query: 100 RLYIG---EPSAEAVAAQMPDLILVSATGGDSALPLYDQLKTIAPTLVINYDDKS----- 151
+ +G EP+ E + P ++ SA G P + L IAP N+ D
Sbjct: 79 -IDVGLRTEPNLELLTEMKPSFMVWSAGYG----PSPEMLARIAPGRGFNFSDGKQPLAM 133

Query: 152 WQTLLTQLGQITGHEQQASARIADFNKQLVSLKEKMKLPPQPVTALVYTAAAHSANIWTP 211
+ LT++ + + A +A + + S+K + L ++ P
Sbjct: 134 ARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGP 193

Query: 212 ESAQGQMLEQLGFSLATLPGGLPASHSQGKRHDIVQLGGENLAAGLNGQSLFLFAGDQKD 271
S ++L++ G A + + + LAA + L + KD
Sbjct: 194 NSLFQEILDEYGIP--------NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKD 245

Query: 272 ADAIYANPLLAHLPAVAGKRVYPLGTETFRLDYYSALLVLQRLSSLFG 319
DA+ A PL +P V R + F SA+ ++ L + G
Sbjct: 246 MDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIG 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03270ISCHRISMTASE425e-153 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 425 bits (1094), Expect = e-153
Identities = 152/303 (50%), Positives = 201/303 (66%), Gaps = 20/303 (6%)

Query: 1 MAIPKLQAYALPEASDIPANKVNWAFEPSRAALLIHDMQEYFLNFWGENSAMMEKVVANI 60
MAIP +Q Y +P ASD+P NKV+W +P+RA LLIHDMQ YF++ + ++ + ++ ANI
Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60

Query: 61 AALRDFCKQNGIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQQVIAALAPDEDDTV 120
L++ C Q GIPV YTAQP Q+ +DRALL D WGPGL P ++++I LAP++DD V
Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120

Query: 121 LVKWRYSAFHRSPLEEMLKETGRDQLIITGVYAHIGCMTTATDAFMRDIKPFFVADALAD 180
L KWRYSAF R+ L EM+++ GRDQLIITG+YAHIGC+ TA +AFM DIK FFV DA+AD
Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180

Query: 181 FSREEHLMALKYVAGRSGRVVMTEELL--------PLPASKA-----------ALRALIL 221
FS E+H MAL+Y AGR VMT+ LL + + A +R I
Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240

Query: 222 PLLDESDEPLD-DENLIDYGLDSVRMMALAARWRKVHGDIDFVMLAKNPTIDAWWALLTR 280
LL E+ E + E+L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W LLT
Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLTT 300

Query: 281 EVQ 283
Q
Sbjct: 301 RSQ 303


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03275DHBDHDRGNASE341e-121 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 341 bits (875), Expect = e-121
Identities = 107/258 (41%), Positives = 149/258 (57%), Gaps = 20/258 (7%)

Query: 5 GQTVWVTGAGKGIGYATALAFVEAGANVTGFD---------------LAFDGESYPFATE 49
G+ ++TGA +GIG A A GA++ D A E++P
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP---- 63

Query: 50 TLDVADADQVREACSRLLANTERLDVLVNAAGILRMGATDQLSAEDWQQTFAVNVGGAFN 109
DV D+ + E +R+ +D+LVN AG+LR G LS E+W+ TF+VN G FN
Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 110 LFQQTMAQFRRQRGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALTVGLELAGSGVRC 169
+ +R G+IVTV S+ A PR M+AY +SKAA +GLELA +RC
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 170 NLVSPGSTDTDMQRTLWVSDDAEQQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASSH 229
N+VSPGST+TDMQ +LW ++ +Q I+G E FK GIPL K+A+P +IA+ +LFL S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 230 ASHITLQDIVVDGGSTLG 247
A HIT+ ++ VDGG+TLG
Sbjct: 243 AGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03285ACRIFLAVINRP310.014 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.3 bits (71), Expect = 0.014
Identities = 9/60 (15%), Positives = 25/60 (41%), Gaps = 1/60 (1%)

Query: 172 VIILAVLAMIVVKALTHSPWG-TYTVAFTIPLAIFMGIYIRYLRPGRIGEVSVIGLVMLV 230
++ ++ + + + A + W +V +PL I + L + ++GL+ +
Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTI 934


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03305DHBDHDRGNASE1358e-41 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 135 bits (340), Expect = 8e-41
Identities = 86/253 (33%), Positives = 130/253 (51%), Gaps = 15/253 (5%)

Query: 5 LTGKKALVTGASRGLGRAIALSLARAGAAVVITYEKSVDKAQAVADEIKALGRYGEAVQA 64
+ GK A +TGA++G+G A+A +LA GA + + + +K + V +KA R+ EA A
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 65 DSASAQAIQDAVTHAARSLGGLDILVNNAGIARGGPLESMTLADIDALINVNIRGVVIAT 124
D + AI + R +G +DILVN AG+ R G + S++ + +A +VN GV A+
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 125 QEALVHMAD--GGRIINIGSCLANRVAMPGISVYAMTKSALNALTRGLARDLGPRGITVN 182
+ +M D G I+ +GS A ++ YA +K+A T+ L +L I N
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRT-SMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 183 LVHPGPTNSDMN-----PEDGEQ------AEAQRQMIAVGHYGQPEDIAAAVTFLASPAA 231
+V PG T +DM E+G + E + I + +P DIA AV FL S A
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 232 GQISGTGLDVDGG 244
G I+ L VDGG
Sbjct: 244 GHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03350TCRTETA371e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 36.7 bits (85), Expect = 1e-04
Identities = 53/313 (16%), Positives = 106/313 (33%), Gaps = 26/313 (8%)

Query: 99 LGLLLSAGMNLMMGMTTNALLLAIFWGINGWAQSMGVGPCAVSLARWYGVKERGTFYGIW 158
+ L +A +M +L I + G + G A +A ER +G
Sbjct: 78 VSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAY-IADITDGDERARHFGFM 136

Query: 159 STAHNIGEAVTYMVIAAVIAGFGWQMGYLSTAALGAAGVVLLVLFMHDSPQSSGFPSINV 218
S G V V+ ++ GF + + AAL + + +S + P
Sbjct: 137 SACFGFG-MVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRP---- 191

Query: 219 IRDEPQEEVEARGSVFKNQLLALRNPALWTLALASAFMYIDRYAVNSWGIFFLEQDKAYS 278
EA + + +A+ + + W I F E +
Sbjct: 192 ------LRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVI-FGEDRFHWD 244

Query: 279 TLEASGIIGVN-AIAGIAGTIIAGMLSDRF---FPRNRSVMAGFISLLNTAGFALMLWSP 334
IG++ A GI ++ M++ R++M G I+ + G+ L+ ++
Sbjct: 245 A----TTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIA--DGTGYILLAFAT 298

Query: 335 HNYYTDILAMIIFGATIGALTCFLGGLIAVDISSRKAAGAALGTIGIASYAGAGLGEFLT 394
+ M++ A+ G L +++ + + G G++ + + +G L
Sbjct: 299 R-GWMAFPIMVLL-ASGGIGMPALQAMLSRQVDEER-QGQLQGSLAALTSLTSIVGPLLF 355

Query: 395 GIIIDKTAILENG 407
I + NG
Sbjct: 356 TAIYAASITTWNG 368


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03370HTHFIS371e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 36.7 bits (85), Expect = 1e-04
Identities = 15/32 (46%), Positives = 22/32 (68%)

Query: 345 LETLLQENGNVVRAADRLGLHRNTLHQRIQRI 376
L L GN ++AAD LGL+RNTL ++I+ +
Sbjct: 442 LAALTATRGNQIKAADLLGLNRNTLRKKIREL 473


15D364_RS03715D364_RS03955Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS03715020-3.313577glucosamine-6-phosphate deaminase
D364_RS03720-118-2.557789PTS N-acetyl glucosamine transporter subunit
D364_RS03725017-4.410750glutamine--tRNA ligase
D364_RS03730022-5.159432methionine synthase
D364_RS03735-119-3.620905hypothetical protein
D364_RS25765118-1.364282ferric iron uptake transcriptional regulator
D364_RS03745217-0.706513flavodoxin FldA
D364_RS03755-1130.557676LexA regulated protein
D364_RS03760-2143.137415esterase
D364_RS03770-1133.025810replication initiation negative regulator SeqA
D364_RS03775-1173.959818phosphoglucomutase
D364_RS037801194.521138hypothetical protein
D364_RS037850194.797041two-component system response regulator KdpE
D364_RS037900204.861604two-component system sensor histidine kinase
D364_RS037950153.915586potassium-transporting ATPase subunit KdpC
D364_RS03800-1173.506073potassium-transporting ATPase subunit KdpB
D364_RS038050132.891508potassium-transporting ATPase subunit KdpA
D364_RS26825-1122.507921K(+)-transporting ATPase subunit F
D364_RS03810-1122.472067DUF2517 family protein
D364_RS038150122.967438deoxyribodipyrimidine photo-lyase
D364_RS038200132.702182dipeptide permease DtpD
D364_RS038252152.163272type 2 GTP cyclohydrolase I
D364_RS038303161.8945925-oxoprolinase subunit PxpB
D364_RS038352161.316446biotin-dependent carboxyltransferase family
D364_RS03840013-0.1114425-oxoprolinase subunit PxpA
D364_RS03845-115-1.699558DUF969 domain-containing protein
D364_RS03850016-2.775627DUF979 domain-containing protein
D364_RS03855218-1.899276pyroglutamyl-peptidase I
D364_RS03860219-2.294156endonuclease VIII
D364_RS03870222-1.973215citrate (Si)-synthase
D364_RS03875223-1.101374succinate dehydrogenase cytochrome b556 subunit
D364_RS03880227-0.725280succinate dehydrogenase membrane anchor subunit
D364_RS03885228-0.541000succinate dehydrogenase flavoprotein subunit
D364_RS03890127-1.254277succinate dehydrogenase iron-sulfur subunit
D364_RS03895128-0.8953852-oxoglutarate dehydrogenase E1 component
D364_RS03900131-0.9833832-oxoglutarate dehydrogenase complex
D364_RS03905028-1.559395ADP-forming succinate--CoA ligase subunit beta
D364_RS03910023-2.144675succinate--CoA ligase subunit alpha
D364_RS03920020-2.706761cytochrome ubiquinol oxidase subunit I
D364_RS03925120-2.815895cytochrome d ubiquinol oxidase subunit II
D364_RS03930824-2.737102cytochrome bd-I oxidase subunit CydX
D364_RS03935423-2.024935cyd operon protein YbgE
D364_RS03940426-2.344549tol-pal system-associated acyl-CoA thioesterase
D364_RS03945324-2.270044Tol-Pal system protein TolQ
D364_RS03950321-1.297376colicin uptake protein TolR
D364_RS03955319-1.427035cell envelope integrity protein TolA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03785HTHFIS882e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.3 bits (219), Expect = 2e-22
Identities = 35/122 (28%), Positives = 57/122 (46%), Gaps = 1/122 (0%)

Query: 4 VLIIEDEHAIRRFLRTALEADGMRVFEAETLQRGLIEAATRKPDLAILDLGLPDGDGIDF 63
+L+ +D+ AIR L AL G V A DL + D+ +PD + D
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 IRDLRQ-WSQMPIIVLSARSEEHDKIAALDAGADDYLSKPFGIGELQARLRVALRRHGAA 122
+ +++ +P++V+SA++ I A + GA DYL KPF + EL + AL
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 123 QA 124
+
Sbjct: 126 PS 127


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03790PF06580300.044 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.8 bits (67), Expect = 0.044
Identities = 24/130 (18%), Positives = 48/130 (36%), Gaps = 28/130 (21%)

Query: 760 HIQLDLPDPLQLVHVDGPLFERVLINLLENAHKYAGAR----ASIGIRAEADARQLSLEV 815
+ + + V V P ++ L+EN K+ A+ I ++ D ++LEV
Sbjct: 241 QFENQINPAIMDVQV--PPM--LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEV 296

Query: 816 WDNGPGIPAGQEQTIFDKFARGNKESAIPGVGLGLA-ICQAIVDVHGG--TISASNRPEG 872
+ G ++ G GL + + + ++G I S + +G
Sbjct: 297 ENTGS----------------LALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEK-QG 339

Query: 873 GASFRVTLPG 882
+ V +PG
Sbjct: 340 KVNAMVLIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03820BACINVASINB290.036 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 29.3 bits (65), Expect = 0.036
Identities = 34/115 (29%), Positives = 58/115 (50%), Gaps = 4/115 (3%)

Query: 361 ILTLSARWSAAY-GQSSMPLMVLGLAVMGFAELFIDPVAMSQITRIEIPGVTGVLTGIYM 419
+LT+ + +A + G +S+ L +GLAVM E+ +S I + P + VL +
Sbjct: 324 LLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAATGVSFIQQALNPIMEHVLKPLME 383

Query: 420 LLSGAIANYLAGVIAD-QTSQASFDAAGAVNYSID--AYITVFSQITWGALACVG 471
L+ AI L G+ D +T++ + GA+ +I A I V + + GA A +G
Sbjct: 384 LIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMVAVIVVVAVVGKGAAAKLG 438


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03890TCRTETOQM310.003 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 31.4 bits (71), Expect = 0.003
Identities = 11/41 (26%), Positives = 23/41 (56%), Gaps = 1/41 (2%)

Query: 14 VDDAPHMQDYTLEAEEGRDM-MLLDALIQLKEKDPSLSFRR 53
+++ + T+E + + MLLDAL+++ + DP L +
Sbjct: 339 IENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYV 379


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03955IGASERPTASE607e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 60.5 bits (146), Expect = 7e-12
Identities = 37/270 (13%), Positives = 80/270 (29%), Gaps = 15/270 (5%)

Query: 55 VDPGAVVNNYNRQQQQQA----SARRAAEQREKQAQQQAEELREKQAAEQERLKQLEQER 110
VD + N Q + + A E E KQ +
Sbjct: 992 VDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTV 1051

Query: 111 LQAQEAAKEAKEQQKQ-AEEAAAKAAAAAKAKADAQAKEAQEAAAKAAAEAKAKADAQAK 169
+ ++ A E Q ++ A+EA + A + AQ+ + + A + + K
Sbjct: 1052 EKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEK 1111

Query: 170 AAEQAAAKAAADAKKQAEAAAAKAAAEA-KKQAEAEAAKAAAEAQKKAEAAAAKKAQQEA 228
A + K ++ + + +E + QAE K+ ++ A E
Sbjct: 1112 AKVETEKTQEV-PKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQ 1170

Query: 229 EKKAQQEAAKQAAAEKAAAEKAAA--------QKAAAEKAAAEKAAAAEKAAAAKAAAAE 280
K +Q E + A + +++ K ++ +
Sbjct: 1171 PAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSV 1230

Query: 281 KAAADKAAKAAAAKAAAAKKAAAAKEADGV 310
+ A ++ ++ A + + V
Sbjct: 1231 PHNVEPATTSSNDRSTVALCDLTSTNTNAV 1260



Score = 56.2 bits (135), Expect = 2e-10
Identities = 25/234 (10%), Positives = 70/234 (29%), Gaps = 6/234 (2%)

Query: 65 NRQQQQQASARRAAEQREKQAQQQAEELREKQAAEQERLKQLEQERLQAQEAAKEAKEQQ 124
NR+ ++A + A + + Q E +E Q E + +E+E E K + +
Sbjct: 1065 NREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPK 1124

Query: 125 KQAEEAAAKAAA-AAKAKADAQAKEAQEAAAKAAAEAKAKADAQAKAAEQAAAKAAADAK 183
++ + + + + +A+ + K + A++ ++
Sbjct: 1125 VTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVT 1184

Query: 184 KQAEAAAAKAAAEAKKQAEAEAAK--AAAEAQKKAEAAAAKKAQQEAEKKAQQEAAKQAA 241
+ + E + + +E+ K + + + E A ++
Sbjct: 1185 ESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVP---HNVEPATTSS 1241

Query: 242 AEKAAAEKAAAQKAAAEKAAAEKAAAAEKAAAAKAAAAEKAAADKAAKAAAAKA 295
+++ ++ A A+ A A + +
Sbjct: 1242 NDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQYN 1295



Score = 56.2 bits (135), Expect = 2e-10
Identities = 29/199 (14%), Positives = 63/199 (31%), Gaps = 5/199 (2%)

Query: 86 QQQAEELREKQAAEQERLKQLEQERLQAQEAAKEAKEQQKQAEEAAAKAAAAAKAKADAQ 145
++ + + Q + + + ++ A A + + A+
Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAE-N 1043

Query: 146 AKEAQEAAAKAAAEAKAKADAQAKAAEQAAAKAAADAKKQAEAAAAKAAAEAKKQAEAEA 205
+K+ + K +A + A++A + A+ + A + E + E
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103

Query: 206 AKAAAEAQKKAEAAAAKKAQQEAEKKAQQEAAKQAAAEKAAAEKAAAQKAAAEKAAAEKA 265
A E + K E QE K Q + KQ +E + A++ E
Sbjct: 1104 ATVEKEEKAKVETEK----TQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQ 1159

Query: 266 AAAEKAAAAKAAAAEKAAA 284
+ A + A E ++
Sbjct: 1160 SQTNTTADTEQPAKETSSN 1178



Score = 55.5 bits (133), Expect = 3e-10
Identities = 28/251 (11%), Positives = 64/251 (25%), Gaps = 1/251 (0%)

Query: 59 AVVNNYNRQQQQQASARRAAEQREKQAQQQAEELREKQAAEQERLKQLEQERLQAQEAAK 118
V N ++ + + A + Q ++ A+E + A + + + +
Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT 1098

Query: 119 EAKEQQKQAEEAAAKAAAAAKAKADAQAKEAQEAAAKAAAEAK-AKADAQAKAAEQAAAK 177
E KE +E AK + + ++ A+ +
Sbjct: 1099 ETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEP 1158

Query: 178 AAADAKKQAEAAAAKAAAEAKKQAEAEAAKAAAEAQKKAEAAAAKKAQQEAEKKAQQEAA 237
+ AK + +Q E+ A + ++
Sbjct: 1159 QSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNK 1218

Query: 238 KQAAAEKAAAEKAAAQKAAAEKAAAEKAAAAEKAAAAKAAAAEKAAADKAAKAAAAKAAA 297
+ ++ + A + A + A A KA A A
Sbjct: 1219 PKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKA 1278

Query: 298 AKKAAAAKEAD 308
+ + E +
Sbjct: 1279 VSQHISQLEMN 1289


16D364_RS04100D364_RS04135Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS04100-118-3.927699pyridoxal phosphatase
D364_RS04105-119-4.4033486-phosphogluconolactonase
D364_RS04110124-7.265157two-component-system connector protein AriR
D364_RS04115226-7.943535hypothetical protein
D364_RS04120127-7.947169two-component-system connector protein YcgZ
D364_RS04125225-8.237088diguanylate phosphodiesterase
D364_RS04130021-8.312812MerR family transcriptional regulator
D364_RS04135016-4.455215helix-turn-helix transcriptional regulator
17D364_RS04190D364_RS04245Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS04190-1143.491797formimidoylglutamase
D364_RS04200-1153.942616histidine utilization repressor
D364_RS27310-1153.648783urocanate hydratase
D364_RS27315-1143.194019histidine ammonia-lyase
D364_RS04215-1123.953157amino acid permease
D364_RS042200124.181648kinase inhibitor
D364_RS04225-1124.985758adenosylmethionine--8-amino-7-oxononanoate
D364_RS042351114.751509biotin synthase BioB
D364_RS042400134.4906298-amino-7-oxononanoate synthase
D364_RS042450123.334060malonyl-ACP O-methyltransferase BioC
18D364_RS04915D364_RS05075Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS04915224-2.820194dimethylsulfoxide reductase subunit B
D364_RS04920124-2.533241dimethyl sulfoxide reductase anchor subunit
D364_RS04930124-3.131192MFS transporter
D364_RS04935226-3.327680pyruvate formate lyase 1-activating protein
D364_RS04940124-2.252140formate C-acetyltransferase
D364_RS04945-112-0.894714formate transporter FocA
D364_RS04950324-1.543800YcaO-like family protein
D364_RS04955425-2.338304DUF421 domain-containing protein
D364_RS04960021-0.2491733-phosphoserine/phosphohydroxythreonine
D364_RS04965-118-0.6210313-phosphoshikimate 1-carboxyvinyltransferase
D364_RS04970019-0.381135(d)CMP kinase
D364_RS049750160.24114730S ribosomal protein S1
D364_RS049801141.787450integration host factor subunit beta
D364_RS049851142.330157ComEC family protein
D364_RS04990-1141.592127lipid A ABC transporter ATP-binding
D364_RS04995-1163.260948tetraacyldisaccharide 4'-kinase
D364_RS050001161.766629winged helix-turn-helix domain-containing
D364_RS050051160.384613protein YcaR
D364_RS050100150.2888673-deoxy-manno-octulosonate cytidylyltransferase
D364_RS050153131.047823YcbJ family phosphotransferase
D364_RS050202130.601305envelope biogenesis factor ElyC
D364_RS05025114-0.173342tRNA uridine 5-oxyacetic acid(34)
D364_RS050301150.098342chromosome partition protein MukF
D364_RS050351140.170823chromosome partition protein MukE
D364_RS05040113-0.232845chromosome partition protein MukB
D364_RS05045116-1.844561L,D-transpeptidase
D364_RS05050222-3.090564YcbK family protein
D364_RS05055119-2.660281MBL fold metallo-hydrolase
D364_RS05060019-3.722340aspartate/tyrosine/aromatic aminotransferase
D364_RS05065119-4.156502porin OmpK35
D364_RS05070-121-4.324703asparagine--tRNA ligase
D364_RS05075023-4.371284diaminopropionate ammonia-lyase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS04980DNABINDINGHU1165e-38 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 116 bits (293), Expect = 5e-38
Identities = 33/88 (37%), Positives = 57/88 (64%), Gaps = 1/88 (1%)

Query: 2 TKSELIERLASQQSHIPAKAVEDAVKEMLEHMASTLAQGERIEIRGFGSFSLHYRAPRTG 61
K +LI ++A + + + K AV + ++S LA+GE++++ GFG+F + RA R G
Sbjct: 3 NKQDLIAKVA-EATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKG 61

Query: 62 RNPKTGDKVELEGKYVPHFKPGKELRDR 89
RNP+TG++++++ VP FK GK L+D
Sbjct: 62 RNPQTGEEIKIKASKVPAFKAGKALKDA 89


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS04990ACRIFLAVINRP300.030 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 30.2 bits (68), Expect = 0.030
Identities = 14/74 (18%), Positives = 29/74 (39%), Gaps = 1/74 (1%)

Query: 128 ITYDSEQVASSSSSALITVVREGASIIGLFVMMFYYSWQLSLILIVLAPIVSVAIRVVSK 187
YD+ S ++ + E ++ L + +F + + +LI + P+V + +
Sbjct: 325 YPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILA 384

Query: 188 RFRNISKNMQNTMG 201
F S N G
Sbjct: 385 AF-GYSINTLTMFG 397


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS05040GPOSANCHOR482e-07 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 48.1 bits (114), Expect = 2e-07
Identities = 46/281 (16%), Positives = 95/281 (33%), Gaps = 20/281 (7%)

Query: 347 QEKIERYEADLDELQIRLEEQNEVVAEAVERQEENEARAEAAELEVDELKSQLADYQQAL 406
+ K + L+ +E E ++ A E+ +N+ ++ EL+++ AD ++AL
Sbjct: 70 KLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKAL 129

Query: 407 DVQQTRAIQYNQALQALERAKALCHLPDLTPESADEWLETFQAKEQEATEKMLSLEQKMS 466
+ + + ++ LE KA L AD + + A + K+
Sbjct: 130 EGAMNFSTADSAKIKTLEAEKA-----ALAARKADL-----EKALEGAMNFSTADSAKIK 179

Query: 467 VAQTAHSQFEQAYQLVAAINGPLARNEAWDVARELLRDGVNQRHQAEQAQGLRSRLNELE 526
+ + E + L + A + A A+ +LE
Sbjct: 180 TLEAEKAALEARQA---ELEKALEGAMNFSTADSAKIKTLEAEKAALAAR-----KADLE 231

Query: 527 QRLREQQDAERQLAEFCKRQGKRYDIDDLETLHQELEARIASLADSVSNAQEQRMALRQE 586
+ L + + K + + LE ELE + + + + L E
Sbjct: 232 KALEGAMNFSTADSA--KIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAE 289

Query: 587 LEQLQSRTQTLMRRAPVWLAAQNSLNQLCEQSGEQFASGQE 627
L++ L ++ V A + SL + + S E +
Sbjct: 290 KAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEA 330



Score = 38.5 bits (89), Expect = 2e-04
Identities = 61/363 (16%), Positives = 117/363 (32%), Gaps = 29/363 (7%)

Query: 261 HLISEATNYVAADYMRHANERRIHLDKALEYRRDLFTSRSQLAAEQYKHVDMARELQEHN 320
+ E + + + + +L+ + K + L E
Sbjct: 53 EKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKA 112

Query: 321 GAEGDLEADY----QAASDHLNLVQTALRQQEKIERYEADLDELQIRLEEQNEVVAEAVE 376
+LEA +A +N + + +E +A L + LE+ E
Sbjct: 113 SKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFST 172

Query: 377 RQEENEARAEAAELEVDELKSQLADYQQALDVQQTRAIQYNQALQA-LERAKALCHLPDL 435
EA + ++ +++L + T + L+A A +
Sbjct: 173 ADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEK 232

Query: 436 TPESADEWLETFQAKEQEATEKMLSLEQKM-SVAQTAHSQFEQAYQLVAAINGPLARNEA 494
E A + AK + + +LE + + + + A I A A
Sbjct: 233 ALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAA 292

Query: 495 WDVARELLRDGVNQRHQAEQAQGLRSRLNELEQRLREQQDAERQLAEFCK------RQGK 548
+ + L +Q A + Q LR L + + ++Q +AE Q E RQ
Sbjct: 293 LEAEKADLEH-QSQVLNANR-QSLRRDL-DASREAKKQLEAEHQKLEEQNKISEASRQSL 349

Query: 549 RYDID--------------DLETLHQELEARIASLADSVSNAQEQRMALRQELEQLQSRT 594
R D+D LE ++ EA SL + ++E + + + LE+ S+
Sbjct: 350 RRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKL 409

Query: 595 QTL 597
L
Sbjct: 410 AAL 412



Score = 36.2 bits (83), Expect = 0.001
Identities = 41/307 (13%), Positives = 102/307 (33%), Gaps = 18/307 (5%)

Query: 935 QFEQLKEDYAYAQQTQRDARQQAFALAEVVQRRAHFSYSDSAEMLSGNSDLNEKLRQRLE 994
++ +Q + Q+ E+ SD + D N++L + L
Sbjct: 36 NTNEVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELS 95

Query: 995 QAESERSRARDAMRAHAAQLSQYNQVLASLKSSYDTKKELLNDLYKELQDIGVRADAGAE 1054
A+ + + ++ A+++ + A L+ + + +++ + A A
Sbjct: 96 NAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAA 155

Query: 1055 ERA--RARRDELHMQLSNNRSRRNQLEKALTFCEAEMDNLTRKLRKLERDY-------CE 1105
+A + + + ++ LE EA L + L
Sbjct: 156 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKT 215

Query: 1106 MREQVVTAKAGWCAVMRLVKDNGVERRLHRRELAYLSAD------ELRSMSDKALGALRL 1159
+ + A + + ++ ++ L A+ + GA+
Sbjct: 216 LEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNF 275

Query: 1160 AVADNEHLRDVLRISEDPKRPERKIQFFVAVYQHLRERIRQDIIRTDDPVEAIEQMEIEL 1219
+ AD+ ++ + + + ++ V R+ +R+D+ D EA +Q+E E
Sbjct: 276 STADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDL---DASREAKKQLEAEH 332

Query: 1220 SRLTEEL 1226
+L E+
Sbjct: 333 QKLEEQN 339



Score = 35.0 bits (80), Expect = 0.002
Identities = 51/327 (15%), Positives = 96/327 (29%), Gaps = 15/327 (4%)

Query: 782 ARENRIETLHAERESLSERFATLSFDVQKTQRLHQAFSRFIGSHLAVAFEDDPEEEIRKL 841
A ++ + L E + E+ + + Q + E
Sbjct: 82 ALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADL------EKALEGAMNF 135

Query: 842 NSRRGELERALSAHESDNQQNRVQYEQAKEGVSALNRLLPRLNLLADDTLADRVDEIQER 901
++ + L A ++ + E+A EG + + A E
Sbjct: 136 STADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAEL 195

Query: 902 LDEAQEAARFIQQHGNQLAKLEPIVSVLQSDPEQFEQLKEDYAYAQQTQRDARQQAFALA 961
+ A F ++ LE + L + E+ E + A
Sbjct: 196 EKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEK 255

Query: 962 EVVQRRAHFSYSDSAEMLSGNSDLNEKLRQRLEQAESERSRARDAMRAHAAQLSQYNQVL 1021
++ R ++ + L G + + +++ E+E++ Q N
Sbjct: 256 AALEAR----QAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANR 311

Query: 1022 ASLKSSYDTKKELLNDLYKELQDIGVRADAGAEERARARRDELHMQLSNNRSRRNQLEKA 1081
SL+ D +E L E Q + + R RRD L +R + QLE
Sbjct: 312 QSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRD-----LDASREAKKQLEAE 366

Query: 1082 LTFCEAEMDNLTRKLRKLERDYCEMRE 1108
E + + L RD RE
Sbjct: 367 HQKLEEQNKISEASRQSLRRDLDASRE 393


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS05065ECOLIPORIN496e-179 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 496 bits (1278), Expect = e-179
Identities = 222/385 (57%), Positives = 266/385 (69%), Gaps = 29/385 (7%)

Query: 2 MKRNILAVVIPALLVAGAANAAEIYNKNGNKLDFYGKMVGEHVWTTNGDTSSDDTTYARI 61
MKR +LA+VIPALL AGAA+AAEIYNK+GNKLD YGK+ G H ++ + + D TY R+
Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDD-SSKDGDQTYMRV 59

Query: 62 GLKGETQINDQLIGYGQWEYNMDASNVEGSQT-TKTRLAFAGLKAGEYGSFDYGRNYGAI 120
G KGETQINDQL GYGQWEYN+ A+ EG + TRLAFAGLK G+YGSFDYGRNYG +
Sbjct: 60 GFKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVL 119

Query: 121 YDVEAATDMLVEWGGDGWNYTDNYMTGRTNGVATYRNSDFFGLVDGLSFALQYQGKNDHD 180
YDVE TDML E+GGD + Y DNYMTGR NGVATYRN+DFFGLVDGL+FALQYQGKN+
Sbjct: 120 YDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQ 179

Query: 181 RA---------------IRKQNGDGFSTAATYAFDNGIALSAGYSSSNRSVDQKA----D 221
A IR NGDGF + TY G + A Y++S+R+ +Q
Sbjct: 180 SADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGGTI 239

Query: 222 GNGDKAEAWATSAKYDANNIYAAVMYSQTYNMTP------EEDNHFAGKTQNFEAVVQYQ 275
GDKA+AW KYDANNIY A MYS+T NMTP D A KTQNFE QYQ
Sbjct: 240 AGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQ 299

Query: 276 FDFGLRPSIGYVQTKGKDLQSRAGFSGGDADLVKYIEVGTWYYFNKNMNVYAAYKFNQLD 335
FDFGLRP++ ++ +KGKDL + +G D DLVKY +VG YYFNKN + Y YK N LD
Sbjct: 300 FDFGLRPAVSFLMSKGKDL-TYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLLD 358

Query: 336 DND-YTKAAGVATDDQAAVGIVYQF 359
D+D + K AG++TDD A+G+VYQF
Sbjct: 359 DDDPFYKDAGISTDDIVALGMVYQF 383


19D364_RS05175D364_RS05265Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS05175324-1.988187membrane integrity-associated transporter
D364_RS05180326-2.642642ribosome modulation factor
D364_RS05185223-2.540426bifunctional 3-hydroxydecanoyl-ACP
D364_RS05190218-1.497887Lon protease family protein
D364_RS05195221-2.314169macrodomain Ter protein MatP
D364_RS05200117-1.612257porin OmpA
D364_RS05205115-1.449680cell division inhibitor SulA
D364_RS05210015-1.039209TfoX/Sxy family DNA transformation protein
D364_RS05215622-3.185208TIGR01666 family membrane protein
D364_RS05220627-4.303484YccF domain-containing protein
D364_RS05225428-4.736856DNA helicase IV
D364_RS05230531-5.576412methylglyoxal synthase
D364_RS05235332-6.255685CoA-binding protein
D364_RS26885333-6.535882BapA prefix-like domain-containing protein
D364_RS05250236-7.332589TolC family outer membrane protein
D364_RS05255129-6.091482type I secretion system permease/ATPase
D364_RS05260226-4.902376HlyD family type I secretion periplasmic adaptor
D364_RS05265122-3.609380glycosyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS05190HTHFIS330.003 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 33.3 bits (76), Expect = 0.003
Identities = 17/37 (45%), Positives = 20/37 (54%), Gaps = 2/37 (5%)

Query: 121 DWVEAEQLFGCVR-QFNGAITLQPGLVHQANGGVLVL 156
D +E+E LFG + F GA T G QA GG L L
Sbjct: 202 DLIESE-LFGHEKGAFTGAQTRSTGRFEQAEGGTLFL 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS05200OUTRMMBRANEA5790.0 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 579 bits (1495), Expect = 0.0
Identities = 306/356 (85%), Positives = 323/356 (90%), Gaps = 10/356 (2%)

Query: 1 MKKTAIAIAVALAGFATVAQAAPKDNTWYAGGKLGWSQYHDTGFYGNGFQNNNGPTRNDQ 60
MKKTAIAIAVALAGFATVAQAAPKDNTWY G KLGWSQYHDTGF NNNGPT +Q
Sbjct: 1 MKKTAIAIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTGFI-----NNNGPTHENQ 55

Query: 61 LGAGAFGGYQVNPYLGFEMGYDWLGRMAYKGSVDNGAFKAQGVQLTAKLGYPITDDLDIY 120
LGAGAFGGYQVNPY+GFEMGYDWLGRM YKGSV+NGA+KAQGVQLTAKLGYPITDDLDIY
Sbjct: 56 LGAGAFGGYQVNPYVGFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIY 115

Query: 121 TRLGGMVWRADSKGNYASTGVSRSEHDTGVSPVFAGGVEWAVTRDIATRLEYQWVNNIGD 180
TRLGGMVWRAD+K N V HDTGVSPVFAGGVE+A+T +IATRLEYQW NNIGD
Sbjct: 116 TRLGGMVWRADTKSN-----VYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQWTNNIGD 170

Query: 181 AGTVGTRPDNGMLSLGVSYRFGQEDAAPVVAPAPAPAPEVATKHFTLKSDVLFNFNKATL 240
A T+GTRPDNGMLSLGVSYRFGQ +AAPVVAPAPAPAPEV TKHFTLKSDVLFNFNKATL
Sbjct: 171 AHTIGTRPDNGMLSLGVSYRFGQGEAAPVVAPAPAPAPEVQTKHFTLKSDVLFNFNKATL 230

Query: 241 KPEGQQALDQLYTQLSNMDPKDGSAVVLGYTDRIGSEAYNQQLSEKRAQSVVDYLVAKGI 300
KPEGQ ALDQLY+QLSN+DPKDGS VVLGYTDRIGS+AYNQ LSE+RAQSVVDYL++KGI
Sbjct: 231 KPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGI 290

Query: 301 PAGKISARGMGESNPVTGNTCDNVKARAALIDCLAPDRRVEIEVKGYKEVVTQPAA 356
PA KISARGMGESNPVTGNTCDNVK RAALIDCLAPDRRVEIEVKG K+VVTQP A
Sbjct: 291 PADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVKGIKDVVTQPQA 346


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS05225YERSSTKINASE300.035 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 30.1 bits (67), Expect = 0.035
Identities = 46/201 (22%), Positives = 78/201 (38%), Gaps = 28/201 (13%)

Query: 291 PTISKLESDTAARHALLLKSWQKQCQEKKAQAKSWR------LWLEEEMGWQLPE---GD 341
P+IS A H + + WQ E K +R L L G+ L G
Sbjct: 12 PSIS-----LAKAHERISQHWQNPVGELNIGGKRYRIIDNQVLRLNPHSGFSLFREGVGK 66

Query: 342 FWQDKKVQRRMASRLDRWVSLMRMHGGSQAEMIAGAPEAVRDLFSKRVKLMSPLMKDWKA 401
+ K +A L +L + E+ + P A+ +LF + + PL WK
Sbjct: 67 IFSGKMFNFSIARNLTD--TLHAAQKTTSQELRSDIPNALSNLFGAKPQTELPL--GWKG 122

Query: 402 ALKAENAVDFSGLIHQAVNILDKGRFVSPWKHILVDEFQDISPQRASLLAALRRQNSQTT 461
A D G+ + + +F HI + E +D + L+A + R ++
Sbjct: 123 E-PLSGAPDLEGM-----RVAETDKFAEGESHISIIETKD----KQRLVAKIERSIAEGH 172

Query: 462 LFAVGDDWQAIYRFSGAQLSL 482
LFA + ++ IY+ +G +L
Sbjct: 173 LFAELEAYKHIYKTAGKHPNL 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS26885ICENUCLEATIN422e-05 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 42.4 bits (99), Expect = 2e-05
Identities = 109/600 (18%), Positives = 200/600 (33%)

Query: 166 TDPGGSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 225
T+ G S A + + +DS + S + +S + S SD +
Sbjct: 183 TETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTA 242

Query: 226 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 285
S + DS + S + DS + S + SD + S + +DS
Sbjct: 243 GYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADS 302

Query: 286 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 345
+ S + +S + S + SD + S + DS + S +
Sbjct: 303 SLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTA 362

Query: 346 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 405
DS + S + SD + S + +DS + S + +S + S
Sbjct: 363 GEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGS 422

Query: 406 DSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 465
+ SD + S + D+ + S + DS + S + SD +
Sbjct: 423 TQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTA 482

Query: 466 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 525
S S + +S + S + S + S + ++SD + S S + ++S
Sbjct: 483 GYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANS 542

Query: 526 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 585
+ S + +S + S + SD + S + SDS + S +
Sbjct: 543 SLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTA 602

Query: 586 DSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 645
S + S + S + S S + +DS + S + +S + S
Sbjct: 603 SYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGS 662

Query: 646 DSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 705
+ SD + S S + +DS + S + +S + S + SD S
Sbjct: 663 TQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTS 722

Query: 706 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDS 765
S S + +DS + S + S A S + S + S S + +DS
Sbjct: 723 GYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADS 782



Score = 42.0 bits (98), Expect = 3e-05
Identities = 110/600 (18%), Positives = 202/600 (33%)

Query: 158 SDSDSDSDTDPGGSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 217
+++ DS T G S A +D+ + S + +S + S SD +
Sbjct: 183 TETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTA 242

Query: 218 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 277
S + DS + S + DS + S + SD + S + +DS
Sbjct: 243 GYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADS 302

Query: 278 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 337
+ S + +S + S + SD + S + DS + S +
Sbjct: 303 SLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTA 362

Query: 338 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 397
DS + S + SD + S + +DS + S + +S + S
Sbjct: 363 GEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGS 422

Query: 398 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDS 457
+ SD + S + DS + + + DS + S + SD +
Sbjct: 423 TQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTA 482

Query: 458 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 517
S S + +S + S + S + S + ++SD + S S + ++S
Sbjct: 483 GYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANS 542

Query: 518 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 577
+ S + +S + S + SD + S + SDS + S +
Sbjct: 543 SLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTA 602

Query: 578 DSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 637
S + S + S + S S + +DS + S + +S + S
Sbjct: 603 SYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGS 662

Query: 638 DSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 697
+ SD + S S + +DS + S + +S + S + SD S
Sbjct: 663 TQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTS 722

Query: 698 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDS 757
S S + +DS + S + S + S A S + S S + +DS
Sbjct: 723 GYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADS 782



Score = 41.7 bits (97), Expect = 4e-05
Identities = 115/618 (18%), Positives = 206/618 (33%)

Query: 170 GSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 229
GS A +S + S SD + S + DS + S + DS
Sbjct: 213 GSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSL 272

Query: 230 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 289
+ S + SD + S + +DS + S + +S + S +
Sbjct: 273 TAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQK 332

Query: 290 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 349
SD + S + DS + S + DS + S + SD + S
Sbjct: 333 GSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTG 392

Query: 350 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 409
+ +DS + S + +S + S + SD + S + DS +
Sbjct: 393 TAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGY 452

Query: 410 DSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 469
S + DS + S + SD + S S + +S + S + S
Sbjct: 453 GSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTL 512

Query: 470 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 529
+ S + ++SD + S S + ++S + S + +S + S +
Sbjct: 513 TAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTARE 572

Query: 530 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 589
SD + S + SDS + S + S + S + S + S S
Sbjct: 573 GSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTS 632

Query: 590 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 649
A +DS + S + +S + S + SD + S S + +DS +
Sbjct: 633 TAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGY 692

Query: 650 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 709
+ + +S + S + SD S S S + +DS + S + S
Sbjct: 693 GSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSL 752

Query: 710 DSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 769
+ S + S + S S + ADS + S + S + S +
Sbjct: 753 TAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQE 812

Query: 770 DSDSDSDSDSDSDSDSDS 787
SD + S S + +DS
Sbjct: 813 RSDLTTGYGSTSTAGADS 830



Score = 41.3 bits (96), Expect = 5e-05
Identities = 112/618 (18%), Positives = 203/618 (32%)

Query: 170 GSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 229
GS S + S + S + S + +DS + S + +S
Sbjct: 165 GSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQ 224

Query: 230 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 289
+ S SD + S + DS + S + DS + S +
Sbjct: 225 MAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQK 284

Query: 290 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 349
SD + S + +DS + S + +S + S + SD + S
Sbjct: 285 GSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTG 344

Query: 350 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 409
+ DS + S + DS + S + SD + S + +DS +
Sbjct: 345 TAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGY 404

Query: 410 DSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 469
S + +S + S + SD + S + DS + S + DS
Sbjct: 405 GSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSL 464

Query: 470 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 529
+ S + SD + S S + +S + S + S + S + +
Sbjct: 465 TAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQN 524

Query: 530 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 589
+SD + S S + ++S + S + +S + S + SD + S
Sbjct: 525 ESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTG 584

Query: 590 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 649
A SDS + S + S + S + S + S S + +DS +
Sbjct: 585 TAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGY 644

Query: 650 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 709
+ + +S + S + SD + S S + +DS + S + +S
Sbjct: 645 GSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSIL 704

Query: 710 DSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 769
+ S + SD S S S + ADS + S + S + S +
Sbjct: 705 TAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTARE 764

Query: 770 DSDSDSDSDSDSDSDSDS 787
S + S S + +DS
Sbjct: 765 QSVLTTGYGSTSTAGADS 782



Score = 40.5 bits (94), Expect = 1e-04
Identities = 108/610 (17%), Positives = 202/610 (33%)

Query: 154 ADSDSDSDSDSDTDPGGSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 213
D S+ + + P + DA +S + + + + S S + S
Sbjct: 125 PDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTE 184

Query: 214 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 273
+ S + S + +DS + S + +S + S SD +
Sbjct: 185 TAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGY 244

Query: 274 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 333
S + DS + S + DS + S + SD + S + +DS
Sbjct: 245 GSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSL 304

Query: 334 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 393
+ S + +S + S + SD + S + DS + S +
Sbjct: 305 IAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGE 364

Query: 394 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDS 453
DS + S + SD + S + +DS + S + +S + S
Sbjct: 365 DSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQ 424

Query: 454 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 513
+ SD + S + DS + S + DS + S + SD +
Sbjct: 425 TAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGY 484

Query: 514 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 573
S S + +S + S + S + S + ++SD + S S + ++S
Sbjct: 485 GSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSL 544

Query: 574 DSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 633
+ S + +S A S + SD + S + SDS + S +
Sbjct: 545 IAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASY 604

Query: 634 DSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 693
S + S + + + S S + +DS + S + +S + S
Sbjct: 605 HSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQ 664

Query: 694 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDS 753
+ SD + S S + +DS + S + +S + S + SD S
Sbjct: 665 TAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGY 724

Query: 754 DSDSDSDSDS 763
S S + +DS
Sbjct: 725 GSTSTAGADS 734



Score = 40.1 bits (93), Expect = 1e-04
Identities = 106/585 (18%), Positives = 195/585 (33%)

Query: 205 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 264
S + +DS + S + +S + S SD + S + DS
Sbjct: 198 STGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLI 257

Query: 265 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 324
+ S + DS + S + SD + S + +DS + S + +
Sbjct: 258 AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEE 317

Query: 325 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 384
S + S + SD + S + DS + S + DS + S
Sbjct: 318 STQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQT 377

Query: 385 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSD 444
+ SD + S + +DS + S + +S + + + SD +
Sbjct: 378 AQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYG 437

Query: 445 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 504
S + DS + S + DS + S + SD + S S + +S
Sbjct: 438 STGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLI 497

Query: 505 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 564
+ S + S + S + ++SD + S S + ++S + S + +
Sbjct: 498 AGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYN 557

Query: 565 SDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 624
S + S + SD + S + SDS + S + S + S
Sbjct: 558 SVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQT 617

Query: 625 SDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 684
+ S + S S + +DS + S + +S + S + SD +
Sbjct: 618 AREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYG 677

Query: 685 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSD 744
S S + +DS + S + +S + S + SD S S S A +DS
Sbjct: 678 STSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLI 737

Query: 745 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 789
+ S + S + S + S + S S + +DS
Sbjct: 738 AGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADS 782



Score = 39.7 bits (92), Expect = 2e-04
Identities = 105/585 (17%), Positives = 195/585 (33%)

Query: 209 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 268
S + +DS + S + +S + S SD + S + DS
Sbjct: 198 STGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLI 257

Query: 269 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 328
+ S + DS + S + SD + S + +DS + S + +
Sbjct: 258 AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEE 317

Query: 329 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 388
S + S + SD + S + DS + S + DS + S
Sbjct: 318 STQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQT 377

Query: 389 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSD 448
+ SD + S + +DS + S + +S A S + SD +
Sbjct: 378 AQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYG 437

Query: 449 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 508
S + DS + S + DS + S + SD + S S + +S
Sbjct: 438 STGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLI 497

Query: 509 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 568
+ S + S + S + ++SD + S S + ++S + S + +
Sbjct: 498 AGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYN 557

Query: 569 SDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 628
S + S + SD + S + SDS + S + S + S
Sbjct: 558 SVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQT 617

Query: 629 SDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 688
+ S + S S + +DS + S + +S + S + SD +
Sbjct: 618 AREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYG 677

Query: 689 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSD 748
S S + +DS + S + +S + S + SD S + S + +DS
Sbjct: 678 STSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLI 737

Query: 749 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDT 793
+ S + S + S + S + S S + +D+
Sbjct: 738 AGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADS 782



Score = 39.7 bits (92), Expect = 2e-04
Identities = 105/585 (17%), Positives = 195/585 (33%)

Query: 207 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 266
S + +DS + S + +S + S SD + S + DS
Sbjct: 198 STGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLI 257

Query: 267 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 326
+ S + DS + S + SD + S + +DS + S + +
Sbjct: 258 AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEE 317

Query: 327 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 386
S + S + SD + S + DS + S + DS + S
Sbjct: 318 STQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQT 377

Query: 387 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSD 446
+ SD + S + +DS + S + +S + S + SD +
Sbjct: 378 AQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYG 437

Query: 447 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 506
S + DS + S + DS + S + SD + S S + +S
Sbjct: 438 STGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLI 497

Query: 507 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 566
+ S + S + S + ++SD + S S + ++S + S + +
Sbjct: 498 AGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYN 557

Query: 567 SDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 626
S + S + SD + + + SDS + S + S + S
Sbjct: 558 SVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQT 617

Query: 627 SDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 686
+ S + S S + +DS A S + +S + S + SD +
Sbjct: 618 AREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYG 677

Query: 687 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSD 746
S S + +DS + S + +S + S + SD S S + + +DS
Sbjct: 678 STSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLI 737

Query: 747 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 791
+ S + S + S + S + S S + +DS
Sbjct: 738 AGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADS 782



Score = 39.0 bits (90), Expect = 2e-04
Identities = 112/620 (18%), Positives = 206/620 (33%)

Query: 170 GSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 229
G S A ++ + S SD + S + DS + S + DS
Sbjct: 211 GYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDS 270

Query: 230 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 289
+ S + SD + S + +DS + S + +S + S +
Sbjct: 271 SLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTA 330

Query: 290 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 349
SD + S + DS + S + DS + S + SD + S
Sbjct: 331 QKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGS 390

Query: 350 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 409
+ +DS + S + +S + S + SD + S + DS +
Sbjct: 391 TGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIA 450

Query: 410 DSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 469
S + DS + + + SD + S S + +S + S + S
Sbjct: 451 GYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGS 510

Query: 470 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 529
+ S + ++SD + S S + ++S + S + +S + S +
Sbjct: 511 TLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTA 570

Query: 530 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 589
SD + S + SDS + S + S + S + S + S
Sbjct: 571 REGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGS 630

Query: 590 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 649
+ + +DS + S + +S + S + SD + S S + +DS +
Sbjct: 631 TSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIA 690

Query: 650 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 709
S + +S + S + SD S S S + +DS + S + S
Sbjct: 691 GYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHS 750

Query: 710 DSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 769
+ S + S + S S A +DS + S + S + S +
Sbjct: 751 SLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTA 810

Query: 770 DSDSDSDSDSDSDSDSDSDS 789
SD + S S + +DS
Sbjct: 811 QERSDLTTGYGSTSTAGADS 830



Score = 38.6 bits (89), Expect = 3e-04
Identities = 105/593 (17%), Positives = 196/593 (33%)

Query: 201 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 260
D D+ +S S + + + S S + S + S + S
Sbjct: 142 DDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGT 201

Query: 261 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 320
+ +DS + S + +S + S SD + S + DS +
Sbjct: 202 AGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYG 261

Query: 321 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 380
S + DS + S + SD + S + +DS + S + +S
Sbjct: 262 STQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQT 321

Query: 381 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSD 440
+ S + SD + S + DS + S + DS A S +
Sbjct: 322 AGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 381

Query: 441 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 500
SD + S + +DS + S + +S + S + SD + S
Sbjct: 382 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 441

Query: 501 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 560
+ DS + S + DS + S + SD + S S + +S +
Sbjct: 442 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYG 501

Query: 561 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSD 620
S + S + S + ++SD + S S + ++S + S + +S
Sbjct: 502 STQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLT 561

Query: 621 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSD 680
+ S + SD + S + SDS + S + S + S +
Sbjct: 562 AGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQ 621

Query: 681 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSD 740
S + S S + +DS + S + +S + S + SD + + S
Sbjct: 622 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTST 681

Query: 741 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDT 793
+ +DS + S + +S + S + SD S S S + +D+
Sbjct: 682 AGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADS 734



Score = 38.6 bits (89), Expect = 4e-04
Identities = 110/610 (18%), Positives = 203/610 (33%)

Query: 182 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 241
+S + S SD + S + DS + S + DS + S
Sbjct: 221 ESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQ 280

Query: 242 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 301
+ SD + S + +DS + S + +S + S + SD +
Sbjct: 281 TAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGY 340

Query: 302 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 361
S + DS + S + DS + S + SD + S + +DS
Sbjct: 341 GSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSL 400

Query: 362 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 421
+ S + +S + S + SD + S + DS + S +
Sbjct: 401 IAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGE 460

Query: 422 DSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 481
DS + S + SD + S S + +S + S + S + S
Sbjct: 461 DSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQ 520

Query: 482 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 541
+ ++SD + S S + ++S + S + +S + S + SD +
Sbjct: 521 TAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGY 580

Query: 542 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDS 601
S + SDS + S + S + S + S + + S + +DS
Sbjct: 581 GSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSL 640

Query: 602 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDS 661
+ S + +S + S + SD + S S + +DS A S +
Sbjct: 641 IAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGY 700

Query: 662 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 721
+S + S + SD S S S + +DS + S + S + S
Sbjct: 701 NSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQ 760

Query: 722 DSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 781
+ S + S + + +DS + S + S + S + SD +
Sbjct: 761 TAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGY 820

Query: 782 DSDSDSDSDS 791
S S + +DS
Sbjct: 821 GSTSTAGADS 830



Score = 38.2 bits (88), Expect = 4e-04
Identities = 105/593 (17%), Positives = 196/593 (33%)

Query: 199 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 258
D D+ +S S + + + S S + S + S + S
Sbjct: 142 DDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGT 201

Query: 259 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 318
+ +DS + S + +S + S SD + S + DS +
Sbjct: 202 AGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYG 261

Query: 319 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 378
S + DS + S + SD + S + +DS + S + +S
Sbjct: 262 STQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQT 321

Query: 379 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSD 438
+ S + SD + S + DS + S + DS + S +
Sbjct: 322 AGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 381

Query: 439 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 498
SD + S + +DS + S + +S + S + SD + S
Sbjct: 382 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 441

Query: 499 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 558
+ DS + S + DS + S + SD + S S + +S +
Sbjct: 442 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYG 501

Query: 559 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSD 618
S + S + S + ++SD + + S + ++S + S + +S
Sbjct: 502 STQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLT 561

Query: 619 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSD 678
+ S + SD + S + SDS A S + S + S +
Sbjct: 562 AGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQ 621

Query: 679 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDAD 738
S + S S + +DS + S + +S + S + SD + S +
Sbjct: 622 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTST 681

Query: 739 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 791
+ +DS + S + +S + S + SD S S S + +DS
Sbjct: 682 AGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADS 734



Score = 37.8 bits (87), Expect = 6e-04
Identities = 115/619 (18%), Positives = 210/619 (33%)

Query: 170 GSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 229
GS A ADS + S + +S + S + SD + S + DS
Sbjct: 293 GSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSL 352

Query: 230 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 289
+ S + DS + S + SD + S + +DS + S +
Sbjct: 353 IAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGE 412

Query: 290 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 349
+S + S + SD + S + DS + S + DS + S
Sbjct: 413 ESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQ 472

Query: 350 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 409
+ SD + S S + +S + S + S + S + ++SD +
Sbjct: 473 TAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGY 532

Query: 410 DSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 469
S S + ++S + S + +S + S + SD + S + SDS
Sbjct: 533 GSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSI 592

Query: 470 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 529
+ S + S + S + S + S S + +DS + S +
Sbjct: 593 IAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGY 652

Query: 530 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 589
+S + S + SD + S S + +DS + S + +S + S
Sbjct: 653 NSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQ 712

Query: 590 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 649
A SD S S S + +DS + S + S + S + S +
Sbjct: 713 TAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGY 772

Query: 650 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 709
+ S + +DS + S + S + S + SD + S S + +DS
Sbjct: 773 GSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSL 832

Query: 710 DSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 769
+ S + +S + S + +SD + S S + DS + S +
Sbjct: 833 IAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAGY 892

Query: 770 DSDSDSDSDSDSDSDSDSD 788
+S + S + +SD
Sbjct: 893 NSILTAGYGSTQTAQENSD 911



Score = 37.0 bits (85), Expect = 0.001
Identities = 93/530 (17%), Positives = 172/530 (32%)

Query: 271 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 330
D D+ +S S + + + S S + S + S + S
Sbjct: 142 DDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGT 201

Query: 331 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 390
+ +DS + S + +S + S SD + S + DS +
Sbjct: 202 AGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYG 261

Query: 391 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSD 450
S + DS + S + SD + S + ADS + S + +S
Sbjct: 262 STQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQT 321

Query: 451 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 510
+ S + SD + S + DS + S + DS + S +
Sbjct: 322 AGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 381

Query: 511 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 570
SD + S + +DS + S + +S + S + SD + S
Sbjct: 382 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 441

Query: 571 SDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 630
+ DS + S + D+ + S + SD + S S + +S +
Sbjct: 442 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYG 501

Query: 631 SDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 690
S + S + S A ++SD + S S + ++S + S + +S
Sbjct: 502 STQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLT 561

Query: 691 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSD 750
+ S + SD + S + SDS + S + S + S +
Sbjct: 562 AGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQ 621

Query: 751 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDTEPQANND 800
S + S S + +DS + S + +S + S Q +D
Sbjct: 622 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSD 671



Score = 35.5 bits (81), Expect = 0.003
Identities = 94/549 (17%), Positives = 178/549 (32%)

Query: 257 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 316
D D+ +S S + + + S S + S + S + S
Sbjct: 142 DDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGT 201

Query: 317 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 376
+ +DS + S + +S + S SD + S + DS +
Sbjct: 202 AGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYG 261

Query: 377 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSD 436
S + DS + S + SD + S + +DS + S A +S
Sbjct: 262 STQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQT 321

Query: 437 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 496
+ S + SD + S + DS + S + DS + S +
Sbjct: 322 AGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 381

Query: 497 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 556
SD + S + +DS + S + +S + S + SD + S
Sbjct: 382 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 441

Query: 557 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSD 616
+ DS + S + DS + S + SD + S S + +S +
Sbjct: 442 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYG 501

Query: 617 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSD 676
S + S + S + ++SD + S + + ++S + S + +S
Sbjct: 502 STQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLT 561

Query: 677 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 736
+ S + SD + S + SDS + S + S + S +
Sbjct: 562 AGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQ 621

Query: 737 ADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDTEPQ 796
+ + S S + +DS + S + +S + S + SD + +
Sbjct: 622 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTST 681

Query: 797 ANNDTHTAA 805
A D+ A
Sbjct: 682 AGADSSLIA 690


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS05250RTXTOXIND290.046 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.046
Identities = 15/102 (14%), Positives = 43/102 (42%), Gaps = 7/102 (6%)

Query: 147 SSVRAADAAVAQQQAMVMLNIDQVAHDTAGAVVQLQGYQKLVKIAQAQVDSLKHIGDLIR 206
S ++ + Q+ LN+D+ + + ++ Y+ L ++ ++++D
Sbjct: 189 SLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDF-------S 241

Query: 207 QRNDAGATSLSDVVQTDTRVEGAQATLIQYQAALERWKATLA 248
A + V++ + + A L Y++ LE+ ++ +
Sbjct: 242 SLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEIL 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS05260RTXTOXIND2636e-86 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 263 bits (674), Expect = 6e-86
Identities = 105/462 (22%), Positives = 187/462 (40%), Gaps = 64/462 (13%)

Query: 6 AAIFPLVKELDPVAAMADNER---DEAELV------KSRRLIALLALLLVVTGVWAWFAT 56
+ + + K+LD D EL+ + R + + LV+ + +
Sbjct: 20 SETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQ 79

Query: 57 LDEVSTGTGKVIPSSREQVLQTLDGGILTELNVREGSRVAAGQVVARLDPTRSESNVGES 116
++ V+T GK+ S R + ++ ++ I+ E+ V+EG V G V+ +L +E++ ++
Sbjct: 80 VEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKT 139

Query: 117 QAKYRASLAASIRLTA-----EVNNQPLIFPPSLKAWPGLLAEE-TRLYHSRREQLTKSM 170
Q+ + R E+N P + P + + EE RL +EQ +
Sbjct: 140 QSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQ 199

Query: 171 RQLDQ------------------------SLSLVNSELAINEKLAKTGAASNVEVL---- 202
Q Q + S L L A + VL
Sbjct: 200 NQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQEN 259

Query: 203 -----------------RLRQQAADIELKKIDLNTRYYVDAREQLSKANADVASLAEVIK 245
++ + + + + + + ++L + ++ L +
Sbjct: 260 KYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELA 319

Query: 246 GRADSVARLTVRSPVQGIVKNIKVNTIGGVIAPNGELMDIVPIDGRLLIEARISPRDIAF 305
+ +R+PV V+ +KV+T GGV+ LM IVP D L + A + +DI F
Sbjct: 320 KNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGF 379

Query: 306 IHPDQKALVKITAYDYAIYGALNGVVETISPDTIQDEAKPDVYYYRVFIRTDHNYLENKR 365
I+ Q A++K+ A+ Y YG L G V+ I+ D I+D+ V+ V I + N L
Sbjct: 380 INVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFN--VIISIEENCLS-TG 436

Query: 366 GKRFLIGPGMIATVDIKTGEKTVMDYLVKPF-NRAKEALRER 406
K + GM T +IKTG ++V+ YL+ P E+LRER
Sbjct: 437 NKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESLRER 478


20D364_RS05420D364_RS05525Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS054202123.479187NAD(P)H:quinone oxidoreductase
D364_RS273353133.937252hypothetical protein
D364_RS054301145.331123general stress protein
D364_RS054351165.364523DMT family transporter
D364_RS054400175.094228pyrimidine utilization transport protein G
D364_RS054450173.910488pyrimidine utilization flavin reductase protein
D364_RS05450-1153.169921malonic semialdehyde reductase
D364_RS054550143.018058pyrimidine utilization protein D
D364_RS05460-2193.444322pyrimidine utilization protein C
D364_RS05465-2182.724418pyrimidine utilization protein B
D364_RS05470-2162.859485pyrimidine utilization protein A
D364_RS05475-1141.821487HTH-type transcriptional regulator RutR
D364_RS05480-1151.876496DUF1311 domain-containing protein
D364_RS05485-1161.431591trifunctional transcriptional regulator/proline
D364_RS05490-114-0.859210sodium/proline symporter PutP
D364_RS05495011-1.190276DUF3574 domain-containing protein
D364_RS05500-110-3.510538nucleoside transporter
D364_RS05505-211-2.476026FTR1 family protein
D364_RS05510-211-2.214224iron uptake system protein EfeO
D364_RS05515-313-1.058777deferrochelatase/peroxidase EfeB
D364_RS05520-219-3.492445phosphate starvation-inducible protein PhoH
D364_RS05525-225-4.607810DsbA family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS05465ISCHRISMTASE756e-18 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 74.7 bits (183), Expect = 6e-18
Identities = 44/184 (23%), Positives = 73/184 (39%), Gaps = 23/184 (12%)

Query: 4 LPARPESLTFEPQQSALIVVDMQNAYASQGGYLDLAGFDVSATRPVIDNINTAVAAARAA 63
+P S +P ++ L++ DMQN + +D S + NI
Sbjct: 17 MPQNKVSWVPDPNRAVLLIHDMQNYF------VDAFTAGASPVTELSANIRKLKNQCVQL 70

Query: 64 GMLIIWFQNGWDDQYVEAGGPGSPNYHKSNALKTMRQRPELQGKLLAKGGWDYQLVDELT 123
G+ +++ PGS N L G L G ++ +++ EL
Sbjct: 71 GIPVVY-----------TAQPGSQNPDDRALLTDF------WGPGLNSGPYEEKIITELA 113

Query: 124 PQEGDIVLPKPRYSGFFNTPLDSILRSRGIRHLVFTGIATNVCVESTLRDGFFLEYFGIV 183
P++ D+VL K RYS F T L ++R G L+ TGI ++ T + F +
Sbjct: 114 PEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFF 173

Query: 184 LEDA 187
+ DA
Sbjct: 174 VGDA 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS05475HTHTETR654e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.0 bits (158), Expect = 4e-15
Identities = 30/166 (18%), Positives = 62/166 (37%), Gaps = 10/166 (6%)

Query: 10 GKRSQAVSAKKEAILAAALEAFSQFGIHGTRLEQVAERAGVSKTNLLYYYPSKEALYVAV 69
K Q ++ IL AL FSQ G+ T L ++A+ AGV++ + +++ K L+ +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 70 LQQILAIWLAPLKAFREDI--SPLVAIREYIRLKLEVSRDHPQASKLF------CLEMLQ 121
+ + ++ PL +RE + LE + + +L E +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTV-TEERRRLLMEIIFHKCEFVG 121

Query: 122 GAPLLMGELTGDLKALVDEKSAIVSGWIDRGKL-APVDPQHLIFMI 166
++ D + I+ L A + + ++
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS05515PF05932280.028 Tir chaperone protein (CesT)
		>PF05932#Tir chaperone protein (CesT)

Length = 127

Score = 28.2 bits (63), Expect = 0.028
Identities = 6/43 (13%), Positives = 15/43 (34%)

Query: 60 FYGRHQAGILTPQQASMMLVAFDVLAADKADLERLFRLLTQRI 102
+ L + S + A+ + +K + L R + +
Sbjct: 74 NPLLNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLL 116


21D364_RS05675D364_RS05780Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS056752141.966435ribosomal protein S5-alanine
D364_RS056802122.759668YceH family protein
D364_RS05685-2222.999994Gfo/Idh/MocA family oxidoreductase
D364_RS05690-1212.686865murein biosynthesis integral membrane protein
D364_RS056951232.756531MFS transporter
D364_RS057002271.171463LysR family transcriptional regulator
D364_RS273405300.785786hypothetical protein
D364_RS057054280.06846723S rRNA pseudouridine(955/2504/2580) synthase
D364_RS05710218-1.896355septum formation inhibitor Maf
D364_RS05715319-1.21071723S rRNA accumulation protein YceD
D364_RS05720422-1.66665550S ribosomal protein L32
D364_RS05725421-1.335580phosphate acyltransferase PlsX
D364_RS05730220-0.972198ketoacyl-ACP synthase III
D364_RS05735119-0.604811ACP S-malonyltransferase
D364_RS05740019-0.8492483-oxoacyl-ACP reductase FabG
D364_RS057500150.792897acyl carrier protein
D364_RS057550151.107977beta-ketoacyl-ACP synthase II
D364_RS057601150.336321aminodeoxychorismate lyase
D364_RS057650160.126266cell division protein YceG
D364_RS057700160.645911dTMP kinase
D364_RS057753170.641217DNA polymerase III subunit delta'
D364_RS057802170.262696metal-dependent hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS05695TCRTETB355e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 34.9 bits (80), Expect = 5e-04
Identities = 57/351 (16%), Positives = 117/351 (33%), Gaps = 51/351 (14%)

Query: 61 YALGILFLLPLGDRHDRRRLILVKSALLALLLLLCSLTGQLSSLLVVSLLI---GMAATM 117
+++G L D+ +RL+L + ++ + SLL+++ I G AA
Sbjct: 62 FSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFP 121

Query: 118 AQDIVPAAAILAPAGKQGKMVGTVMTGLLLGILLSRTVSGVVGAVFGWRVMYQAAAVSVA 177
A +V A + +GK G + + + +G + + G++ W + +++
Sbjct: 122 ALVMVVVARYIPKE-NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITII 180

Query: 178 --------------------LIGLVMWRVLPRFAVHSTLSYPQLMASMA----------- 206
+ G+++ V F + T SY ++
Sbjct: 181 TVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHI 240

Query: 207 ----------HLWQRYPALRRAALAQGALSIAFSAFWSTLAVMLSEHYHMGSAVAGGFGI 256
L + P + G + + F S + M+ + + + +A G I
Sbjct: 241 RKVTDPFVDPGLGKNIPFMIGVLCG-GIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVII 299

Query: 257 --AGAAGALAAPLAGGLADKFGAGKVTQMGAALVTLSFALMFMLPLLPVHAQLALIALSA 314
+ + + G L D+ G V +G +++SF L +I
Sbjct: 300 FPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVL 359

Query: 315 IGFDLGLQSSLVAHQNLVYGLEPQARGRLNALLFTVVFIGMSLGSVLGSKL 365
G + V + L+ Q G +LL F+ G + L
Sbjct: 360 GGLSF---TKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS05745DHBDHDRGNASE1564e-49 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 156 bits (395), Expect = 4e-49
Identities = 91/250 (36%), Positives = 136/250 (54%), Gaps = 13/250 (5%)

Query: 4 EGKIALVTGASRGIGRAIAETLVARGAKVIGTATSESGAQAISDYLGANGK---GLMLNV 60
EGKIA +TGA++GIG A+A TL ++GA + + + + L A + +V
Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66

Query: 61 TDPASIESVLENVRAEFGEVDILVNNAGITRDNLLMRMKDDEWNDIIETNLSSVFRLSKA 120
D A+I+ + + E G +DILVN AG+ R L+ + D+EW N + VF S++
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 121 VMRAMMKKRHGRIITIGSVVGTMGNAGQANYAAAKAGLIGFSKSLAREVASRGITVNVVA 180
V + MM +R G I+T+GS + A YA++KA + F+K L E+A I N+V+
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 181 PGFIETDMTRAL-----TDEQR-AGTLA----AVPAGRLGTPNEIASAVAFLASDEASYI 230
PG ETDM +L EQ G+L +P +L P++IA AV FL S +A +I
Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 231 TGETLHVNGG 240
T L V+GG
Sbjct: 247 TMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS05750TYPE4SSCAGA280.005 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 27.7 bits (61), Expect = 0.005
Identities = 16/48 (33%), Positives = 25/48 (52%)

Query: 10 KIIGEQLGVKQEEVTNNASFVEDLGADSLDTVELVMALEEEFDTEIPD 57
++ G Q + QEE+ N F+E L ++ L +E+F TEI D
Sbjct: 380 QLTGSQRALSQEEIQNKIDFMEFLAQNNAKLDNLSEKEKEKFRTEIKD 427


22D364_RS06150D364_RS06320Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS06150319-2.041491type II asparaginase
D364_RS06165420-2.946022kdo(2)-lipid IV(A) palmitoleoyltransferase
D364_RS06170019-2.194273leucine efflux protein LeuE
D364_RS06175-119-1.374928DUF2534 family protein
D364_RS26895-1180.203092hypothetical protein
D364_RS269000192.587614protein YoaJ
D364_RS26905-2193.537452YoaK family small membrane protein
D364_RS061902165.509382sensor domain-containing diguanylate cyclase
D364_RS061954186.811603DUF333 domain-containing protein
D364_RS062003186.461883DUF488 family protein
D364_RS062052186.370908DUF523 domain-containing protein
D364_RS062101163.597245CTP synthase
D364_RS062151182.826347CynX/NimT family MFS transporter
D364_RS06220-2160.201887helix-turn-helix transcriptional regulator
D364_RS06225-1180.028151DUF441 domain-containing protein
D364_RS062300200.495529YbaK/prolyl-tRNA synthetase associated
D364_RS062350191.010450glycosyltransferase family 9 protein
D364_RS062401173.894911DinI family protein
D364_RS062450164.782047translesion error-prone DNA polymerase V
D364_RS062500154.991962Y-family DNA polymerase
D364_RS062550155.606264LysR family transcriptional regulator
D364_RS062600135.380652SDR family oxidoreductase
D364_RS062650173.324368MFS transporter
D364_RS06270016-0.002056LysR family transcriptional regulator
D364_RS27365014-3.331337hypothetical protein
D364_RS06280112-3.216496hypothetical protein
D364_RS06285011-2.516547glutathione peroxidase
D364_RS06290-111-2.409344hypothetical protein
D364_RS06295117-2.719799YeaH/YhbH family protein
D364_RS06300118-2.748227protein kinase YeaG
D364_RS06305120-1.890587MipA/OmpV family protein
D364_RS06310121-1.975199aldo/keto reductase
D364_RS06315222-2.549420D-hexose-6-phosphate mutarotase
D364_RS06320223-1.843885glyceraldehyde-3-phosphate dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS06225PRTACTNFAMLY270.045 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 26.6 bits (58), Expect = 0.045
Identities = 17/64 (26%), Positives = 26/64 (40%)

Query: 46 VEKQGLTVGIIILTIGVMAPIASGTLPPSTLIHSFMNWKSLLAIAVGVFVSWLGGRGVSL 105
V Q + L IG + + LPPS ++ N ++ A VS LG ++L
Sbjct: 171 VTVQRSAIVDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASELTL 230

Query: 106 MGSQ 109
G
Sbjct: 231 DGGH 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS06260DHBDHDRGNASE1233e-36 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 123 bits (309), Expect = 3e-36
Identities = 71/254 (27%), Positives = 121/254 (47%), Gaps = 10/254 (3%)

Query: 2 SKKLADKVALVTGGSAGIGLASAKALAEQGAKVY---ITGRRQEELDAAVRFIGPAARAI 58
+K + K+A +TG + GIG A A+ LA QGA + + E++ ++++ A A
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF 62

Query: 59 RADAAVLSDLDAVFATIAEESGRLDVLFANAGGGDMLPLSAITEAHVDRIFATNVRGVVF 118
AD + +D + A I E G +D+L AG + ++++ + F+ N GV
Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 119 TVQKALPLLAD--GASVILTGSTAAVKGTANFSIYSASKAAVRSLARSWALEVSDRGIRI 176
+ + D S++ GS A + + Y++SKAA + LE+++ IR
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 177 NVVSPGPVRTPGLGGLVAEADRQ-----GLFDALAAGVPLGRLGEPEEIGRTVVFLASDE 231
N+VSPG T L A+ + G + G+PL +L +P +I V+FL S +
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 232 SSFINAAEIYVDGG 245
+ I + VDGG
Sbjct: 243 AGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS06265TCRTETB476e-08 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 47.2 bits (112), Expect = 6e-08
Identities = 33/142 (23%), Positives = 60/142 (42%), Gaps = 1/142 (0%)

Query: 40 LSALAADFHQTESGVGLAVTAYGWVGALAALLSGAMPARISRKALLVGLMLILALSCLAA 99
L +A DF++ + TA+ ++ + G + ++ K LL+ ++I +
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 100 TRSYSMFA-LMSARMIGALAHGAFWALIGIVAAQLVPPHRLGLATAIIFGGVSAASVVGV 158
+S F+ L+ AR I AF AL+ +V A+ +P G A +I V+ VG
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 159 PLASFIATLAGWRLAFMSMALL 180
+ IA W + +
Sbjct: 157 AIGGMIAHYIHWSYLLLIPMIT 178


23D364_RS06370D364_RS27370Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS06370-1133.394780selenide, water dikinase SelD
D364_RS06375-1123.920311DNA topoisomerase III
D364_RS063800144.775880NAD(P)H-quinone oxidoreductase
D364_RS063853157.167712NADP-specific glutamate dehydrogenase
D364_RS063954178.659065pyrimidine (deoxy)nucleoside triphosphate
D364_RS064004178.535586CDP-alcohol phosphatidyltransferase family
D364_RS064054168.677357sulfurtransferase
D364_RS064104168.684579ATP-binding cassette domain-containing protein
D364_RS064153167.190342ABC transporter permease subunit
D364_RS06420-1184.871800ABC transporter substrate-binding protein
D364_RS064250193.744201carboxymuconolactone decarboxylase family
D364_RS064300184.602041TVP38/TMEM64 family protein
D364_RS06435-1164.739661TVP38/TMEM64 family protein
D364_RS06440-1165.012024exodeoxyribonuclease III
D364_RS06445-1165.056360aspartate aminotransferase family protein
D364_RS064500175.219193arginine N-succinyltransferase
D364_RS06455-1154.371195succinylglutamate-semialdehyde dehydrogenase
D364_RS064600142.756157N-succinylarginine dihydrolase
D364_RS064650130.384946succinylglutamate desuccinylase
D364_RS06475018-2.403035ATP-independent periplasmic protein-refolding
D364_RS06480-114-2.415686excinuclease Cho
D364_RS06485-117-4.075557ammonia-dependent NAD(+) synthetase
D364_RS06490117-6.080930osmotically-inducible lipoprotein OsmE
D364_RS06495013-5.399170PTS N,N'-diacetylchitobiose transporter subunit
D364_RS27370-211-3.842183hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS06410PF05272280.029 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.1 bits (62), Expect = 0.029
Identities = 15/44 (34%), Positives = 21/44 (47%), Gaps = 4/44 (9%)

Query: 11 VRRQPLLQEVAFSVAPG----EVLTLMGPSGSGKSTLFAWMIGA 50
V + L+ VA + PG + L G G GKSTL ++G
Sbjct: 576 VGKYILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVGL 619


24D364_RS06835D364_RS07200Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS06835-1163.560026peptide ABC transporter ATP-binding protein
D364_RS068401144.590046peptide ABC transporter permease SapC
D364_RS068451144.463962putrescine ABC transporter permease SapB
D364_RS068501165.024685peptide ABC transporter substrate-binding
D364_RS068552165.293730SDR family oxidoreductase
D364_RS068602164.465990helix-turn-helix transcriptional regulator
D364_RS068653174.023940helix-turn-helix domain-containing protein
D364_RS068702162.628133amidohydrolase
D364_RS068754172.429241MFS transporter
D364_RS068805151.971466phage shock protein operon transcriptional
D364_RS068854192.689317phage shock protein PspA
D364_RS068902193.071713envelope stress response membrane protein PspB
D364_RS068951152.558558envelope stress response membrane protein PspC
D364_RS069001142.831347phage shock protein PspD
D364_RS069051143.008290YcjX family protein
D364_RS069101122.880562YcjF family protein
D364_RS069152112.995630LysR family transcriptional regulator
D364_RS069202133.6307396-phospho-beta-glucosidase
D364_RS069252143.906361transcriptional regulator TyrR
D364_RS06930-1122.919210AbrB family transcriptional regulator
D364_RS06935-1111.690846thiol peroxidase
D364_RS06940114-2.648256L-Ala-D/L-Glu epimerase
D364_RS06945316-4.638234murein tripeptide amidase MpaA
D364_RS27380419-6.073279hypothetical protein
D364_RS06955320-4.354407peptide ABC transporter substrate-binding
D364_RS06960227-5.824924hypothetical protein
D364_RS06965229-5.637151hypothetical protein
D364_RS06970123-2.811480LysR family transcriptional regulator
D364_RS06975-120-1.403614alpha/beta hydrolase
D364_RS06980-121-0.201797MFS transporter
D364_RS06985024-6.014123aldo/keto reductase
D364_RS06995022-4.993483hypothetical protein
D364_RS25905122-5.810063LysR family transcriptional regulator
D364_RS07000114-3.724736SDR family oxidoreductase
D364_RS07005-111-3.489930HD domain-containing protein
D364_RS07010-211-3.346765DJ-1/PfpI family protein
D364_RS07015082.381899type VI secretion system contractile sheath
D364_RS07025191.774217type VI secretion system contractile sheath
D364_RS070303103.534396type VI secretion system baseplate subunit TssK
D364_RS070353123.832625type VI secretion system protein TssL, short
D364_RS070403133.111871OmpA family protein
D364_RS070501131.585920type VI secretion system effector Hcp
D364_RS070550141.922137type VI secretion system ATPase TssH
D364_RS070602181.032287type VI secretion system tip protein VgrG
D364_RS070652150.882985DUF4123 domain-containing protein
D364_RS070702151.124410T6SS immunity phospholipase A1-binding
D364_RS070751131.146387DUF2235 domain-containing protein
D364_RS070802121.717525PAAR domain-containing protein
D364_RS070851121.617795hypothetical protein
D364_RS070901111.357119type VI secretion protein VasK
D364_RS07095-1131.368392type VI secretion system protein TssA
D364_RS07100-2132.587614type VI secretion system baseplate subunit TssF
D364_RS07105-1164.078088type VI secretion system baseplate subunit TssG
D364_RS07110-1164.585260type VI secretion system lipoprotein TssJ
D364_RS071201175.024983hypothetical protein
D364_RS071251165.252732SDR family oxidoreductase
D364_RS071301175.275865TetR/AcrR family transcriptional regulator
D364_RS071350175.221170SDR family oxidoreductase
D364_RS071401194.670438AraC family transcriptional regulator
D364_RS071451193.655028DUF1471 domain-containing protein
D364_RS071500194.870157aromatic alcohol reductase
D364_RS071552215.553292LysR family transcriptional regulator
D364_RS27385-1206.084614hypothetical protein
D364_RS071600195.432610MBL fold metallo-hydrolase
D364_RS071650173.850024NmrA/HSCARG family protein
D364_RS071700162.625940TetR/AcrR family transcriptional regulator;
D364_RS07175-1161.698391NADP-dependent oxidoreductase
D364_RS07180-1161.648433aminotransferase class I/II-fold pyridoxal
D364_RS07185-1150.333895lipoprotein
D364_RS07190117-0.466888ATP-binding cassette domain-containing protein
D364_RS071951151.001201ABC transporter permease
D364_RS072002140.956252aspartate/tyrosine/aromatic aminotransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS06835HTHFIS310.007 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.0 bits (70), Expect = 0.007
Identities = 9/16 (56%), Positives = 14/16 (87%)

Query: 38 LVGESGSGKSLIAKAI 53
+ GESG+GK L+A+A+
Sbjct: 165 ITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS06855NUCEPIMERASE482e-08 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 47.9 bits (114), Expect = 2e-08
Identities = 22/83 (26%), Positives = 39/83 (46%), Gaps = 11/83 (13%)

Query: 2 MRIFLTGASGFIGSRILPALQASGHQVIGL---------ARSESTAQALKAAGAEVHRGT 52
M+ +TGA+GFIG + L +GHQV+G+ + ++ + L G + H+
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 53 LDAPESL--LAGVGNADAVIHTA 73
L E + L G+ + V +
Sbjct: 61 LADREGMTDLFASGHFERVFISP 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS06875TCRTETB871e-20 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 87.2 bits (216), Expect = 1e-20
Identities = 77/398 (19%), Positives = 156/398 (39%), Gaps = 22/398 (5%)

Query: 35 VINV-VPAMKSSLDISLETLTLAVSLSALFSGCFVVASGGLADKFGRMRMTTLGLGLSIV 93
V+NV +P + + + + + L G L+D+ G R+ G+ ++
Sbjct: 32 VLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCF 91

Query: 94 GSAMLVVAQGP-GLFLAGRVLQGLSAACIMPATLALIKTWYEGRARQRAVSFWVIGSWGG 152
GS + V L + R +QG AA + ++ + R +A G
Sbjct: 92 GSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMG 151

Query: 153 SGLCSFVGGAIATGLGWRWIFVFSIAVALLALFLLRGTPESRSASASQHKLDVGGLLSLI 212
G+ +GG IA + W ++ + + + FL++ + H D+ G++ +
Sbjct: 152 EGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKK--EVRIKGH-FDIKGIILMS 208

Query: 213 VALVLVNLFISKGHGWGWSSPLSLTMLAGALAAGTIFIRNGMRKGEAALIDFALFSNRAY 272
V +V LF + + + L+ L IF+++ +RK +D L N +
Sbjct: 209 VGIVFFMLF-TTSYSISFLIVSVLSFL--------IFVKH-IRKVTDPFVDPGLGKNIPF 258

Query: 273 GAAVLSNFLLNGAI-GTMMIASIWLQQGHHLTPLESGMMTLGYLVTVLAMIR--VGEKLL 329
VL ++ G + G + + ++ H L+ E G + + + T+ +I +G L+
Sbjct: 259 MIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVII-FPGTMSVIIFGYIGGILV 317

Query: 330 QRYGARLPMMAGPVLTAIAIALISCTFLEKALYIGVVFASNVLFGLGLGCYATPSTDTAV 389
R G + G +++ L + LE + + + G GL T +
Sbjct: 318 DRRGPLYVLNIGVTFLSVSF-LTASFLLETTSWF-MTIIIVFVLG-GLSFTKTVISTIVS 374

Query: 390 ANAPENKIGVASGIYKMGSSLGGAMGIAVTVSLFTLFL 427
++ + + G + S L GIA+ L ++ L
Sbjct: 375 SSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPL 412


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS06880HTHFIS344e-118 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 344 bits (884), Expect = e-118
Identities = 127/346 (36%), Positives = 183/346 (52%), Gaps = 26/346 (7%)

Query: 7 AQYKDNLLGEANSFLEVLEQVSRLAPLDKPVLVIGERGTGKELIANRLHYLSSRWQGPFI 66
+Q L+G + + E+ ++RL D +++ GE GTGKEL+A LH R GPF+
Sbjct: 133 SQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFV 192

Query: 67 SLNCAALNDNLLDSELFGHEAGAFTGASKRHPGRFERADGGTLFLDELATAPMLVQEKLL 126
++N AA+ +L++SELFGHE GAFTGA R GRFE+A+GGTLFLDE+ PM Q +LL
Sbjct: 193 AINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLL 252

Query: 127 RVIEYGELERVGGSQPLQVNVRLVCATNADLPQMVEEGHFRADLLDRLAFDVVQLPPLRD 186
RV++ GE VGG P++ +VR+V ATN DL Q + +G FR DL RL ++LPPLRD
Sbjct: 253 RVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRD 312

Query: 187 RQSDIMLLANQFAIQMCRELGLPLFPGFSERATATLLGYRWPGNIRELKNVVERSVYRHG 246
R DI L F Q +E F + A + + WPGN+REL+N+V R +
Sbjct: 313 RAEDIPDLVRHFVQQAEKEGLDVK--RFDQEALELMKAHPWPGNVRELENLVRRLTALYP 370

Query: 247 DSE--------HELDAIIINPFRQSPG---------------SPPEAAPGDELPALPLDL 283
I +P ++ A+ GD LP L
Sbjct: 371 QDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGL-Y 429

Query: 284 RDFQFQQEKRLLQRSLEQAKYHQKQAAELLGLTYHQLRALLKKHQL 329
+ E L+ +L + +Q +AA+LLGL + LR +++ +
Sbjct: 430 DRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS06925HTHFIS307e-101 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 307 bits (789), Expect = e-101
Identities = 112/377 (29%), Positives = 172/377 (45%), Gaps = 38/377 (10%)

Query: 174 VLTGAVAMLRSTVRMGRQLQTMTSQDTSAFSQILAVGPKMRHVVEQARKLAMLSAPLLIV 233
LT + ++ + ++ + D+ ++ M+ + +L L+I
Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMIT 166

Query: 234 GDTGTGKDLLAHACHLASPRAGKPYLALNCGSIPEDAVESELFG-------DALQGKKGF 286
G++GTGK+L+A A H R P++A+N +IP D +ESELFG A G
Sbjct: 167 GESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGR 226

Query: 287 FEQANGGSVLLDEIGEMSPRMQTKLLRFLNDGTFRRVGEDHEVHVDVRVICATQKNLIEL 346
FEQA GG++ LDEIG+M QT+LLR L G + VG + DVR++ AT K+L +
Sbjct: 227 FEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQS 286

Query: 347 VQKGLFREDLYYRLNVLTLYLPPLRDCPQDIMPLTELFVARFADEQGIPRPKLSGDLSTV 406
+ +GLFREDLYYRLNV+ L LPPLRD +DI L FV + E G+ + + +
Sbjct: 287 INQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALEL 345

Query: 407 LTRYSWPGNVRQLKNAVYRALTQLEGFELRPQDILLP---------------DHDVASLP 451
+ + WPGNVR+L+N V R + + I S+
Sbjct: 346 MKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSIS 405

Query: 452 VGEEAM--------------EGSLDDITRRFERSVLTQ-LYRSYPSTRKLAKRLGVSHTA 496
E G D + E ++ L + + K A LG++
Sbjct: 406 QAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNT 465

Query: 497 IANKLREYGLSQKKGDE 513
+ K+RE G+S +
Sbjct: 466 LRKKIRELGVSVYRSSR 482


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS06980TCRTETB330.003 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 32.5 bits (74), Expect = 0.003
Identities = 21/113 (18%), Positives = 45/113 (39%), Gaps = 2/113 (1%)

Query: 76 RKWLLLGLTALMAASGVIIALASSFPVYMLGRALIGIVIGGFWSMSAATAIRLVPQRQVP 135
++ LL G+ S + S F + ++ R + G F ++ R +P+
Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138

Query: 136 RALAIFNGGNALATVVAAPLGSYLGATVGWRGAFLCLVPLAVLAFVWQCISLP 188
+A + A+ V +G + + W ++L L+P+ + V + L
Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYIHW--SYLLLIPMITIITVPFLMKLL 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS07005DHBDHDRGNASE732e-17 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 72.8 bits (178), Expect = 2e-17
Identities = 64/254 (25%), Positives = 102/254 (40%), Gaps = 15/254 (5%)

Query: 6 KIALVTGGSRGLGRATVEALAQRGVNVVLTYKTRLAEANEVVTRVEALGARAIALPFSAG 65
KIA +TG ++G+G A LA +G ++ + +VV+ ++A A A P
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHI-AAVDYNPEKLEKVVSSLKAEARHAEAFP---- 63

Query: 66 EIDTFDAFVSAFQGALTELGADKFDYLVNNAGNASGMGFLNATEAEFDALYRIHVKSVFF 125
D D+ A E D LVN AG + ++ E++A + ++ VF
Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 126 LSQKLLPLLAD--GGRIVNVSSGLTRIVMANRAPYAIMKSAVETLTRYMAFELGSRGITV 183
S+ + + D G IV V S + + A YA K+A T+ + EL I
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 184 NCVAPGAIATDFSGGVVRDNPQVAQAVANMTA-------LGRPGLPEDIGPMIASLLSDD 236
N V+PG+ TD + D Q + L + P DI + L+S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 237 HRWVNAQRIEVSGG 250
+ + V GG
Sbjct: 243 AGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS07045OMPADOMAIN923e-22 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 91.5 bits (227), Expect = 3e-22
Identities = 45/146 (30%), Positives = 63/146 (43%), Gaps = 12/146 (8%)

Query: 416 PPPPPRPVQRVAPNVIRLDSMSLFDTGKWVLKPGSTKRL--VSSLMDIKARPGWLIVVAG 473
P P P V L S LF+ K LKP L + S + +VV G
Sbjct: 200 VAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLG 259

Query: 474 HTDSVGEEKANQLLSLKRAESVRDWMRDTGDVPDSCFAVQGYGESRPIATNDT------- 526
+TD +G + NQ LS +RA+SV D++ G +P + +G GES P+ N
Sbjct: 260 YTDRIGSDAYNQGLSERRAQSVVDYLISKG-IPADKISARGMGESNPVTGNTCDNVKQRA 318

Query: 527 --PEGRALNRRVEISLVPQVDACRLP 550
+ A +RRVEI + D P
Sbjct: 319 ALIDCLAPDRRVEIEVKGIKDVVTQP 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS07055HTHFIS310.020 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.3 bits (71), Expect = 0.020
Identities = 30/105 (28%), Positives = 42/105 (40%), Gaps = 17/105 (16%)

Query: 580 GKRVVGQQAALSAIARRL-RAAKTGLTPENGPQGVFLLVGPSGTGKTETALALADALFGG 638
G +VG+ AA+ I R L R +T LT ++ G SGTGK A AL D
Sbjct: 136 GMPLVGRSAAMQEIYRVLARLMQTDLT--------LMITGESGTGKELVARALHDYGKRR 187

Query: 639 EKALITINLSEYQEPHTVSQLKGSPPGYVGYGQGGILTEAVRKRP 683
+ IN++ S+L G + G T A +
Sbjct: 188 NGPFVAINMAAIPRDLIESELFGH--------EKGAFTGAQTRST 224


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS07125DHBDHDRGNASE829e-21 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 81.6 bits (201), Expect = 9e-21
Identities = 64/249 (25%), Positives = 111/249 (44%), Gaps = 24/249 (9%)

Query: 7 KSVLVLGGSRGIGAAIVRRFVADGASVVFSYSGSPQAAERLAAETGSTA-----VQADSA 61
K + G ++GIG A+ R + GA + + +P+ E++ + + A AD
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 62 DRDAVISLV----RDSGPLDVLVVNAGIALFGDALEQDSDAIDRLFRINIHAPYHASVEA 117
D A+ + R+ GP+D+LV AG+ G + + F +N ++AS
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 118 ARRMP--EGGRIIVIGSVNGDRMPLPGMAAYALSKSALQGLARGLARDFGPRGITVNVVQ 175
++ M G I+ +GS N +P MAAYA SK+A + L + I N+V
Sbjct: 128 SKYMMDRRSGSIVTVGS-NPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 176 PGPIDTDA--------NPENGPMKELMHSF---MAIKRHGRPEEVAGMVAWLAGPEASFV 224
PG +TD N +K + +F + +K+ +P ++A V +L +A +
Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 225 TGAMHTIDG 233
T +DG
Sbjct: 247 TMHNLCVDG 255


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS07130HTHTETR447e-08 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 44.2 bits (104), Expect = 7e-08
Identities = 20/122 (16%), Positives = 47/122 (38%), Gaps = 2/122 (1%)

Query: 7 GRTPGRPRQFDAEQAIETAQRLFHARGYDAVSVADLTHAFGINPPSFYAAFGSKLGLYTR 66
+T ++ + ++ A RLF +G + S+ ++ A G+ + Y F K L++
Sbjct: 3 RKTKQEAQE-TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61

Query: 67 VLQR-YSQIGAIPIDALLRDDQPVAASLIAVLQEAARRYVADPAAAGCLVLEGVHCQDAD 125
+ + S IG + ++ + + L +L V + + + C+
Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 126 AR 127

Sbjct: 122 EM 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS07135DHBDHDRGNASE1036e-29 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 103 bits (259), Expect = 6e-29
Identities = 67/258 (25%), Positives = 110/258 (42%), Gaps = 16/258 (6%)

Query: 3 KIALITGANRGLGRQTALDIARQGGDVIVTYRGSLEQAEAVVADIRALGRKAIALPLDMA 62
KIA ITGA +G+G A +A QG + + E+ E VV+ ++A R A A P D+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHI-AAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 63 QTASFPAFADSLGSALASVWGRATFDHLINNAGHGEFAPLAETREAQFDGLFNVHVKGVF 122
+A+ + + D L+N AG + + +++ F+V+ GVF
Sbjct: 68 DSAAIDEITARIEREMGP------IDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVF 121

Query: 123 FLVQTLLPLLAD--GGRIVNFSSGLTRVSYPGFSAYAAAKAAVEMLSVYMARELGGRGIT 180
+++ + D G IV S V +AYA++KAA M + + EL I
Sbjct: 122 NASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181

Query: 181 VNTIAPGAIATDFGGGL-VRDDAEVN------AQFAAMTALGRVGVPEDIGPMIASLLRD 233
N ++PG+ TD L ++ F L ++ P DI + L+
Sbjct: 182 CNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241

Query: 234 DNRWVTAQRIEVSGGQTI 251
+T + V GG T+
Sbjct: 242 QAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS07170NUCEPIMERASE406e-06 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 40.2 bits (94), Expect = 6e-06
Identities = 29/129 (22%), Positives = 47/129 (36%), Gaps = 19/129 (14%)

Query: 6 TVLVFGATGQQGGSVARALLHRGWRVRALVRDPFSAG---------AAALAARGAELVVG 56
LV GA G G V++ LL G +V + D + LA G +
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGI--DNLNDYYDVSLKQARLELLAQPGFQFHKI 59

Query: 57 TFEDRAAMRSAMA--GVDGVF------SVQPSSPGGTVTDEQEVRYGITIADLAVECGVK 108
DR M A + VF +V+ S + + + I + ++
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119

Query: 109 HLVYSSGSA 117
HL+Y+S S+
Sbjct: 120 HLLYASSSS 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS07175HTHTETR593e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 58.9 bits (142), Expect = 3e-13
Identities = 28/166 (16%), Positives = 58/166 (34%), Gaps = 12/166 (7%)

Query: 1 MRADARKNYDLLIEVARDVFVEQGAEA-SLRDIARRAGVGMGTLYRHFPNRDSLLEALLR 59
+ +A++ +++VA +F +QG + SL +IA+ AGV G +Y HF ++ L +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 60 SRFAALTARAESLL------LAADPAAALLEWLAESVAFTHQHRGIIAPLMSAIDDPESA 113
+ + + L+ L +V + + E A
Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMA 124

Query: 114 L-----HSACVALRAAGTSLLTRAQQAGLARPDLSGEELFDLIAAL 154
+ + C+ L +A + DL ++
Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGY 170


25D364_RS07295D364_RS07370Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS072950164.547171DedA family protein
D364_RS073000175.352323YdbH family protein
D364_RS073050186.576753YnbE family lipoprotein
D364_RS073100134.240874YdbL family protein
D364_RS07315-1113.951796transcriptional regulator FeaR
D364_RS07320-1113.650748aldehyde dehydrogenase family protein
D364_RS07325-1112.733990oxidoreductase
D364_RS07330-1111.937680zinc-dependent alcohol dehydrogenase family
D364_RS07335-2120.836330primary-amine oxidase
D364_RS07340-2112.214701phenylacetic acid degradation bifunctional
D364_RS073450142.1444391,2-phenylacetyl-CoA epoxidase subunit A
D364_RS073501142.8387811,2-phenylacetyl-CoA epoxidase subunit B
D364_RS073551113.479187phenylacetate-CoA oxygenase subunit PaaC
D364_RS073600133.166413phenylacetate-CoA oxygenase subunit PaaJ
D364_RS073650133.785302phenylacetate-CoA oxygenase/reductase subunit
D364_RS073700113.0768182,3-dehydroadipyl-CoA hydratase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS07305PF06291270.002 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 27.3 bits (60), Expect = 0.002
Identities = 9/17 (52%), Positives = 13/17 (76%)

Query: 1 MKKLLLAMAASMLLAGC 17
MKK+L + A +ML+ GC
Sbjct: 6 MKKMLFSAALAMLITGC 22


26D364_RS07445D364_RS07605Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS07445213-0.877009MarR family transcriptional regulator
D364_RS07450213-0.134223LysR family transcriptional regulator
D364_RS074550112.701189aldo/keto reductase
D364_RS07460-2143.887310Glu/Leu/Phe/Val dehydrogenase
D364_RS273900164.889607hypothetical protein
D364_RS07470-1195.131050aspartate aminotransferase family protein
D364_RS074750194.673029arginine N-succinyltransferase
D364_RS074801195.337868succinylglutamate-semialdehyde dehydrogenase
D364_RS074851194.436218N-succinylarginine dihydrolase
D364_RS074901184.018417succinylglutamate desuccinylase
D364_RS074951152.651098amino acid permease
D364_RS075000122.216390MHS family MFS transporter
D364_RS075050131.489449YdcF family protein
D364_RS27395012-1.040100hypothetical protein
D364_RS075151120.769130aldehyde dehydrogenase
D364_RS075202160.503101type I glyceraldehyde-3-phosphate dehydrogenase
D364_RS07525416-0.456051YqaE/Pmp3 family membrane protein
D364_RS07530315-0.186807DUF1398 domain-containing protein
D364_RS07535116-3.371451sulfurtransferase-like selenium metabolism
D364_RS07540015-2.228864selenium metabolism membrane protein YedE/FdhT
D364_RS07545-117-2.449756DUF1304 domain-containing protein
D364_RS07550-212-1.759804hypothetical protein
D364_RS07555-3130.090889putative selenium delivery protein YdfZ
D364_RS07560-2120.790540IS481-like element ISKpn28 family transposase
D364_RS07565-1163.022913magnesium transporter CorA
D364_RS07570-2182.764786pyridoxal phosphate-dependent aminotransferase
D364_RS07575-1162.558260maltose/glucose-specific PTS transporter subunit
D364_RS075800162.816653Mal regulon transcriptional regulator MalI
D364_RS075851162.129379YdgA family protein
D364_RS075900141.521101mannose-6-phosphate isomerase
D364_RS075951121.033304fumarate hydratase
D364_RS076001131.144540class II fumarate hydratase
D364_RS076052160.859682DNA replication terminus site-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS07500TCRTETB340.001 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 33.7 bits (77), Expect = 0.001
Identities = 35/159 (22%), Positives = 66/159 (41%), Gaps = 27/159 (16%)

Query: 59 ILSWL--SFSLTFFIRPIGGVIFAHIGDRIGRKKTLVLTLSLMGSATVAIGLLPTYEMVG 116
+W+ +F LTF IG ++ + D++G K+ L+ + + +V VG
Sbjct: 50 STNWVNTAFMLTF---SIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVI-------GFVG 99

Query: 117 LWAPALLIILRIIQGMGIGGEWGGALLLAYEYAPEKRK----GFFGSIPQAGVTIGMLMA 172
+LLI+ R IQG G +++ Y P++ + G GSI G +G +
Sbjct: 100 HSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIG 159

Query: 173 TFIVSLMTLFDEAQFLAWGWRIPFLLSSVLVFLGLWIRK 211
I A ++ W + L+ + + ++ K
Sbjct: 160 GMI---------AHYIHWSYL--LLIPMITIITVPFLMK 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS07535PF01206934e-29 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 92.9 bits (231), Expect = 4e-29
Identities = 16/71 (22%), Positives = 38/71 (53%)

Query: 7 DYRLDMVGEPCPYPAVATLEAMPSLQKGEILEVVSDCPQSINNIPLDARNHGYTVLDIQQ 66
D LD G CP P + + + ++ GE+L V++ P S+ + ++ G+ +L+ ++
Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64

Query: 67 DGPTIRYLIQK 77
+ T + +++
Sbjct: 65 EDGTYHFRLKR 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS07585FLGMOTORFLIG310.008 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 31.3 bits (71), Expect = 0.008
Identities = 25/134 (18%), Positives = 52/134 (38%), Gaps = 13/134 (9%)

Query: 316 AQSQALLAKPELAQNPELYQQALTETLFNALPILLKGNPSVTISPLS-WRNAKGESTLNL 374
+Q + K + EL +++L A+ I+ ++ P R A + LN
Sbjct: 74 MMAQEFIQKGGIDYARELLEKSLGTQ--KAVDIINNLGSALQSRPFEFVRRADPANILNF 131

Query: 375 SVLLKDPAQVTAPPQTLADSLDRVVQSLDGKVV--IPVDMATEFMTKIAGLEGYQPADAA 432
++ PQT+A L + ++ +P ++ T +IA ++ P
Sbjct: 132 ---IQQEH-----PQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVR 183

Query: 433 KLADQQVKGLAAMG 446
++ K LA++
Sbjct: 184 EVERVLEKKLASLS 197


27D364_RS07680D364_RS07985Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS07680-222-4.568628fumarate/nitrate reduction transcriptional
D364_RS07685026-3.833378methylated-DNA--[protein]-cysteine
D364_RS07690-131-5.627421hypothetical protein
D364_RS07695131-5.135137pyocin activator PrtN family protein
D364_RS07700236-5.687206hypothetical protein
D364_RS07705340-6.775079hypothetical protein
D364_RS07710440-6.410746hypothetical protein
D364_RS07715442-7.198301ParB/RepB/Spo0J family partition protein
D364_RS07720445-7.655802hypothetical protein
D364_RS07725446-8.487188helix-turn-helix domain-containing protein
D364_RS07735340-7.830673Cro/Cl family transcriptional regulator
D364_RS25940334-6.422099hypothetical protein
D364_RS07740331-5.308293DUF4222 domain-containing protein
D364_RS25945233-6.125776Dam family site-specific
D364_RS07750137-8.060902RusA family crossover junction
D364_RS07760043-9.161859DUF968 domain-containing protein
D364_RS07765142-8.861529DUF1133 family protein
D364_RS07770148-10.287622MFS transporter
D364_RS26915360-14.787314phage holin family protein
D364_RS25950361-15.141388glycoside hydrolase family 19 protein
D364_RS07775357-14.554128hypothetical protein
D364_RS07780354-14.150434DUF4747 family protein
D364_RS07785157-13.385909hypothetical protein
D364_RS07795251-12.440965HNH endonuclease
D364_RS07800345-9.023528phage terminase small subunit P27 family
D364_RS25960137-6.214025phage portal protein
D364_RS07820134-6.104557HK97 family phage prohead protease
D364_RS07830026-5.185413phage major capsid protein
D364_RS07835026-4.474669head-tail connector protein
D364_RS07840130-4.976530phage head closure protein
D364_RS07845335-5.482562HK97 gp10 family phage protein
D364_RS07850235-4.840180DUF3168 domain-containing protein
D364_RS07860346-7.331114immunoglobulin domain-containing protein
D364_RS07865345-7.401314phage tail assembly chaperone
D364_RS07870246-7.624068DUF4035 domain-containing protein
D364_RS07875245-7.675644hypothetical protein
D364_RS07880140-6.650100phage tail tape measure protein
D364_RS07885138-6.013002hypothetical protein
D364_RS07890235-5.493522DUF1833 family protein
D364_RS07895338-7.062214nitrite transporter
D364_RS07905237-6.684086hypothetical protein
D364_RS07910040-7.809619hypothetical protein
D364_RS07915346-10.357375hypothetical protein
D364_RS07920033-7.854357hypothetical protein
D364_RS07925-121-1.841353hypothetical protein
D364_RS07930-1170.743439site-specific integrase
D364_RS07935-1130.777601AbgT family transporter
D364_RS07940-1120.858524amidohydrolase
D364_RS07945-1143.675983M20 family metallo-hydrolase
D364_RS07950-1154.601296LysR family transcriptional regulator
D364_RS07955-1174.507307DNA endonuclease SmrA
D364_RS079600184.536083helix-turn-helix domain-containing protein
D364_RS079651186.0864013-oxoacid CoA-transferase subunit A
D364_RS079700176.333959CoA transferase subunit B
D364_RS079750176.0307913-oxoadipyl-CoA thiolase
D364_RS079800145.2275473-carboxy-cis,cis-muconate cycloisomerase
D364_RS07985-1134.2123243-oxoadipate enol-lactonase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS07720IGASERPTASE300.010 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.0 bits (67), Expect = 0.010
Identities = 14/75 (18%), Positives = 26/75 (34%), Gaps = 3/75 (4%)

Query: 172 AVREAVEAGTVTVTQARQLASLKPEEQREKVSEIEAATAGTTGHEKARRQRQILGEAKPR 231
V EA T + E +++ +E T E + R++ EAK
Sbjct: 1019 RVDEAPVPPPAPATPSET-TETVAENSKQESKTVEKNEQDAT--ETTAQNREVAKEAKSN 1075

Query: 232 LKTRKEIIKALESAE 246
+K + + +S
Sbjct: 1076 VKANTQTNEVAQSGS 1090


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS07840cloacin290.042 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 28.9 bits (64), Expect = 0.042
Identities = 26/118 (22%), Positives = 43/118 (36%), Gaps = 13/118 (11%)

Query: 37 WNAAKSELDALDERIAREEELRRQDQDYIHE---NEPEQRQQQNRDPANPEAQANERRA- 92
+N+ KSELDA ++ +A +Q + H+ Q + N ++A
Sbjct: 351 YNSRKSELDAANKTLADAIAEIKQFNRFAHDPMAGGHRMWQMAGLKAQRAQTDVNNKQAA 410

Query: 93 --AAFNAFLRRGLGEMSAEERQALKELRAQGTTPDEKGGYTVPTQFRNKIVEALKDYG 148
AA SA E + KE + ++ +NK + KDYG
Sbjct: 411 FDAAAKEKSDADAALSSAMESRKKKEDK-------KRSAENNLNDEKNKPRKGFKDYG 461


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS07870adhesinb280.018 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 27.5 bits (61), Expect = 0.018
Identities = 15/43 (34%), Positives = 22/43 (51%), Gaps = 2/43 (4%)

Query: 57 PEGQEPQEAPELTPSERAFRTMRADVTLFIDILLDTDLHPVFT 99
P GQ+P E E P + T +AD+ + I L+T + FT
Sbjct: 62 PVGQDPHEY-EPLPEDVKK-TSQADLIFYNGINLETGGNAWFT 102


28D364_RS08060D364_RS08160Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS080600133.805560LysR family transcriptional regulator
D364_RS080651164.521967malonate decarboxylase subunit epsilon
D364_RS080701174.751598malonate decarboxylase holo-ACP synthase
D364_RS08075-1173.364305AEC family transporter
D364_RS08080-2142.785860biotin-independent malonate decarboxylase
D364_RS08085-2121.331193biotin-independent malonate decarboxylase
D364_RS080900100.848342malonate decarboxylase acyl carrier protein
D364_RS080950110.737512triphosphoribosyl-dephospho-CoA synthase
D364_RS08100-110-0.865037malonate decarboxylase subunit alpha
D364_RS08105420-2.675544AI-2E family transporter
D364_RS08110320-0.447161multidrug efflux SMR transporter subunit KpnE
D364_RS081151161.310831multidrug efflux SMR transporter subunit KpnF
D364_RS081201171.720964hypothetical protein
D364_RS08125-1151.174262serine protease
D364_RS081400131.010163acid resistance repetitive basic protein Asr
D364_RS081451112.030203carboxypeptidase M32
D364_RS081501112.228282MFS transporter
D364_RS081552112.208644LysR family transcriptional regulator
D364_RS081602111.942882ROK family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08125V8PROTEASE1071e-29 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 107 bits (269), Expect = 1e-29
Identities = 27/234 (11%), Positives = 70/234 (29%), Gaps = 47/234 (20%)

Query: 26 LSAKDIKTLFFGHDDRKAVNRPEESPWDAIGQLET---ASGNLCTATLISPHLALTAGHC 82
L ++ + ++DR + + + ++ + + ++ LT H
Sbjct: 61 LEQREHANVILPNNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHV 120

Query: 83 LLTPPRGKPDKAVALRFI------SRKGNWVYE---IHGIDGRVDPSLGRRLKADGDGWI 133
+ AL+ N + I G D ++ + +
Sbjct: 121 V----DATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVK-FSPNEQN-- 173

Query: 134 VPSAAAPSDFGLIVLRYAPSGIAPIPLFPGSKADLTAALKAADRKVTQSGYPEDH-LDNL 192
++ + P + A ++ +T +GYP D + +
Sbjct: 174 ---------------KHIGEVVKPATM-------SNNAETQVNQNITVTGYPGDKPVATM 211

Query: 193 YSHQDCIVTGWAQTSVLSHQCDTLPGDSGSPLLLKTEDGWQVIAVQSSAPGPQD 246
+ + + + + + T G+SGSP+ + +VI + +
Sbjct: 212 WESK--GKITYLKGEAMQYDLSTTGGNSGSPVF---NEKNEVIGIHWGGVPNEF 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08140IGASERPTASE300.002 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.4 bits (68), Expect = 0.002
Identities = 21/125 (16%), Positives = 39/125 (31%), Gaps = 9/125 (7%)

Query: 16 SSAAFAADAVSTTQAPAATHSTAAKTTHHKKHHKA--AAKPAAEQKAQAAKKHKKAEAKP 73
A A + A A + A T ++ + + + A K+ +AK
Sbjct: 1055 EQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKV 1114

Query: 74 AAAQKAQAAKKHKKVAAKPAAPQKAQAAKKHHKAAAKPAAQKAQAAKKHHKTTKHQAAKP 133
+ + K +V+ K + Q A+PA + ++
Sbjct: 1115 ETEKTQEVPKVTSQVSPKQEQSETVQPQ-------AEPARENDPTVNIKEPQSQTNTTAD 1167

Query: 134 TAQPA 138
T QPA
Sbjct: 1168 TEQPA 1172



Score = 29.6 bits (66), Expect = 0.005
Identities = 16/117 (13%), Positives = 31/117 (26%), Gaps = 7/117 (5%)

Query: 23 DAVSTTQAPAATHSTAAKTTHHKKHHKAAAKPAAEQKAQAAKKHKKAEAKPAAAQKAQAA 82
V TT + A + + A A A P+ + A
Sbjct: 990 QTVDTTNITTPNNIQADVP-------SVPSNNEEIARVDEAPVPPPAPATPSETTETVAE 1042

Query: 83 KKHKKVAAKPAAPQKAQAAKKHHKAAAKPAAQKAQAAKKHHKTTKHQAAKPTAQPAA 139
++ Q A ++ AK A +A + ++ + + Q
Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08150TCRTETA545e-10 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 53.7 bits (129), Expect = 5e-10
Identities = 62/295 (21%), Positives = 108/295 (36%), Gaps = 15/295 (5%)

Query: 55 VQPILPVLSNEFGVSPASSS---ISLSISTAMLAVGLLFTGPLSDAIGRKPVMVTALLLA 111
+ P+LP L + S ++ I L++ M G LSD GR+PV++ +L A
Sbjct: 24 IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGA 83

Query: 112 ACCSLLSTMMTSWHGILIMRALIGLSLSGVAAVGMTYLSEEIHPSFVAFSMGLYISGNSI 171
A + + I R + G++ AV Y+++ A G +
Sbjct: 84 AVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGF 142

Query: 172 GGMSGRLLTGVFTDFFGWRVALAAISGFALAAAIMFWRILPES--RHFRPTSLRPKTLLI 229
G ++G +L G+ F A + + +LPES RP L
Sbjct: 143 GMVAGPVLGGLMGGF-SPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLA 201

Query: 230 NFRLHWRDRGLPLLFVEGFLLM---GAFVTLFN-YIGYRLMMSPWSLSQAVVGLLSVAYL 285
+FR + L F++ L+ + R ++ ++ + L
Sbjct: 202 SFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSL 261

Query: 286 TGTWSSPKAGAMTVRFG-RGPVMLGFTAVMLCGLLLTLFSSLWLIFIGMLLFSAG 339
G + R G R +MLG A +LL + W+ F M+L ++G
Sbjct: 262 AQAMI---TGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG 313


29D364_RS08250D364_RS08830Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS08250145-9.156031nuclear transport factor 2 family protein
D364_RS25975246-9.310162helix-turn-helix transcriptional regulator
D364_RS08260341-10.875203SOS response-associated peptidase
D364_RS27400253-16.851609hypothetical protein
D364_RS26920249-14.470128small membrane protein
D364_RS08275239-9.320946LuxR family transcriptional regulator
D364_RS27405123-3.272866hypothetical protein
D364_RS27410121-3.134932hypothetical protein
D364_RS25980124-4.525624hypothetical protein
D364_RS08290025-3.858608Hsp20 family protein
D364_RS08295-120-1.7691972-oxo-tetronate isomerase
D364_RS08300-121-2.127997diguanylate phosphodiesterase
D364_RS26925-124-2.095769small membrane protein
D364_RS08305-2170.362286helix-turn-helix transcriptional regulator
D364_RS08310-2152.336902GntP family transporter
D364_RS083150153.271824HPr family phosphocarrier protein
D364_RS083200153.924675aldolase
D364_RS083250164.213630four-carbon acid sugar kinase family protein
D364_RS08330-1143.572281NAD(P)-dependent oxidoreductase
D364_RS08335-1131.219111DeoR/GlpR family DNA-binding transcription
D364_RS08340-2112.419984class A broad-spectrum beta-lactamase SHV-1
D364_RS08345-2123.275608hypothetical protein
D364_RS08350-2132.871519AAA family ATPase
D364_RS08355-1132.923841MFS transporter
D364_RS083600143.764033beta-galactosidase
D364_RS083652165.100653LacI family DNA-binding transcriptional
D364_RS083701164.450355GNAT family N-acetyltransferase
D364_RS08375-1152.155407DUF4177 domain-containing protein
D364_RS08380-1133.542862transcriptional regulator FtrA
D364_RS083850133.956261Rhodanese-related sulfurtransferase
D364_RS083900143.527403cobalamin-independent methionine synthase II
D364_RS083951153.618941urea ABC transporter substrate-binding protein
D364_RS084053174.452213urea ABC transporter permease subunit UrtB
D364_RS084104173.657411urea ABC transporter permease subunit UrtC
D364_RS084154172.776284urea ABC transporter ATP-binding protein UrtD
D364_RS084203161.654025urea ABC transporter ATP-binding subunit UrtE
D364_RS084251130.286349helix-turn-helix domain-containing protein
D364_RS08430114-1.181486efflux MFS transporter YdeE
D364_RS08435115-3.260339O-acetylserine/cysteine exporter
D364_RS08440014-3.018075multiple antibiotic resistance protein MarB
D364_RS08445-212-1.724582MDR efflux pump AcrAB transcriptional activator
D364_RS08450118-3.978604multiple antibiotic resistance transcriptional
D364_RS27415124-5.063920hypothetical protein
D364_RS08460019-3.540981MarC family NAAT transporter
D364_RS08465-117-2.228593sugar transporter
D364_RS08470-118-1.113875PhzF family phenazine biosynthesis protein
D364_RS08475-117-0.788934MFS transporter
D364_RS08480-2141.904373phosphoglycerate dehydrogenase
D364_RS084850154.279343dihydrodipicolinate synthase family protein
D364_RS084902154.506651iron-containing alcohol dehydrogenase
D364_RS084951174.732858four-carbon acid sugar kinase family protein
D364_RS085000174.287870D-threonate 4-phosphate dehydrogenase
D364_RS085051184.136751DeoR/GlpR family DNA-binding transcription
D364_RS085101184.385227L-lactate dehydrogenase
D364_RS08515-1132.642438GNAT family N-acetyltransferase
D364_RS08520-1122.415989LysR family transcriptional regulator
D364_RS08525-1111.466396succinate-semialdehyde dehydrogenase
D364_RS08530-211-0.011779glutaminase B
D364_RS08535-313-0.629954DUF4186 domain-containing protein
D364_RS08540-315-2.347529sensor domain-containing diguanylate cyclase
D364_RS08545-123-4.717534GNAT family N-acetyltransferase
D364_RS08550026-5.317167tagaturonate reductase
D364_RS08555036-7.847192GNAT family N-acetyltransferase
D364_RS08560-130-5.164692hypothetical protein
D364_RS08565-129-5.589426tautomerase family protein
D364_RS26935-219-1.518755carboxymuconolactone decarboxylase family
D364_RS08575-2151.225697trans-aconitate 2-methyltransferase
D364_RS085802141.888908hypothetical protein
D364_RS085852142.452314MFS transporter
D364_RS085902133.151976prolyl endopeptidase
D364_RS085952123.663073YbfB/YjiJ family MFS transporter
D364_RS086003134.014713LysR family transcriptional regulator
D364_RS086051133.353863nitronate monooxygenase
D364_RS086100123.607300ABC transporter permease
D364_RS08615-1122.965341ATP-binding cassette domain-containing protein
D364_RS08620-1131.794871MetQ/NlpA family ABC transporter
D364_RS08625-1142.244711isopenicillin N synthase family oxygenase
D364_RS08630-1143.206959VF530 family protein
D364_RS08635-1154.792466cupin
D364_RS086400175.824497VOC family protein
D364_RS086450175.799490AraC family transcriptional regulator
D364_RS086501186.166239MFS transporter
D364_RS086601155.377221molybdenum cofactor-independent xanthine
D364_RS086652175.255761molybdenum cofactor-independent xanthine
D364_RS086751174.447170LysR family hpxDE operon transcriptional
D364_RS08680-1153.671486FAD-dependent urate hydroxylase HpxO
D364_RS08685-1173.038358purine permease
D364_RS08690-2153.4190502-oxo-4-hydroxy-4-carboxy-5-ureidoimidazoline
D364_RS086950132.922977hydroxyisourate hydrolase
D364_RS08700-1132.234063type II toxin-antitoxin system antitoxin HipB
D364_RS08705-1121.782778type II toxin-antitoxin system HipA family
D364_RS08710-1131.423550glycoside hydrolase family 10 protein
D364_RS08715-1150.502365type 1 fimbrial protein
D364_RS08720-115-0.640915fimbrial biogenesis outer membrane usher
D364_RS08725-120-4.439332molecular chaperone
D364_RS08730-118-4.865376type 1 fimbrial protein
D364_RS08735-216-3.187718hypothetical protein
D364_RS08740-217-3.444409type I methionyl aminopeptidase
D364_RS08745-319-4.122492ParD-like family protein
D364_RS08750-219-4.942463hypothetical protein
D364_RS08755-315-4.487291EAL domain-containing protein
D364_RS08760-214-3.326477siderophore esterase IroE
D364_RS27420-212-3.838299hypothetical protein
D364_RS08770-212-3.641058hypothetical protein
D364_RS08780-212-2.415572PTS sugar transporter subunit IIA
D364_RS08785-212-0.422284sugar porter family MFS transporter
D364_RS08795-1161.1906996-phospho-alpha-glucosidase
D364_RS088000150.268074PTS transporter subunit EIIC
D364_RS088052161.873885GntR family transcriptional regulator
D364_RS088101151.876537sulfite exporter TauE/SafE family protein
D364_RS08815-1141.719960LysR family transcriptional regulator
D364_RS088200130.657480NAD(P)-dependent oxidoreductase
D364_RS088251130.708919helix-turn-helix domain-containing protein
D364_RS088302131.692326glycerate kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08345BLACTAMASEA438e-159 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 438 bits (1127), Expect = e-159
Identities = 286/286 (100%), Positives = 286/286 (100%)

Query: 1 MRYIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADE 60
MRYIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADE
Sbjct: 1 MRYIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADE 60

Query: 61 RFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCA 120
RFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCA
Sbjct: 61 RFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCA 120

Query: 121 AAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPA 180
AAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPA
Sbjct: 121 AAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPA 180

Query: 181 SMAATLRKLLTSQRLSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGAGERGARG 240
SMAATLRKLLTSQRLSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGAGERGARG
Sbjct: 181 SMAATLRKLLTSQRLSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGAGERGARG 240

Query: 241 IVALLGPNNKAERIVVIYLRDTPASMAERNQQIAGIGAALIEHWQR 286
IVALLGPNNKAERIVVIYLRDTPASMAERNQQIAGIGAALIEHWQR
Sbjct: 241 IVALLGPNNKAERIVVIYLRDTPASMAERNQQIAGIGAALIEHWQR 286


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08360TCRTETA415e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 41.3 bits (97), Expect = 5e-06
Identities = 64/288 (22%), Positives = 111/288 (38%), Gaps = 39/288 (13%)

Query: 33 PFFPVWLADVNHLTK--TETGIVFSSISLFAIIFQPVFGLMSDKLGLRKHLLWTITVLLI 90
P P L D+ H GI+ + +L PV G +SD+ G R LL V L
Sbjct: 26 PVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLL----VSLA 81

Query: 91 LFA-PFFIFVFSPLLQMNIIAGSLVGGIYLGIVFSSGSGAVEAYIERVSRANRFEYGKVR 149
A + I +P L + + G +V GI G + + + RA F +
Sbjct: 82 GAAVDYAIMATAPFLWV-LYIGRIVAGI-TGATGAVAGAYIADITDGDERARHFGF---- 135

Query: 150 VAGCVGWALCAS--ITGVLFGIDPNITFWIASGFALVLGLLLWLSRPESSNS------AQ 201
++ C G+ + A + G++ G P+ F+ A+ + L PES +
Sbjct: 136 MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRRE 195

Query: 202 VIEALGANRQAFSLRTAAELLRMPRFWGFIVYVVG--VASVYDVFDQQFANFFKSFFASP 259
+ L + R A + A L+ FI+ +VG A+++ +F + ++
Sbjct: 196 ALNPLASFRWARGMTVVAALM----AVFFIMQLVGQVPAALWVIFGEDRFHW-------- 243

Query: 260 QRGTEVFGFVTTGGELLNALI-MFCAPAIVNRIGAKNALLTAGMIMSV 306
G +L++L + R+G + AL+ GMI
Sbjct: 244 --DATTIGISLAAFGILHSLAQAMITGPVAARLGERRALM-LGMIADG 288


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08415PF05211280.039 Neuraminyllactose-binding hemagglutinin
		>PF05211#Neuraminyllactose-binding hemagglutinin

Length = 260

Score = 27.7 bits (61), Expect = 0.039
Identities = 15/55 (27%), Positives = 28/55 (50%), Gaps = 7/55 (12%)

Query: 43 LSLAIGVGELRCVIGPNGAGKTTLMDVITGKTRPQSGKALYDQSVDLTTLDPVAI 97
L + G+ ++ V+ P G K T+++ P SG++L ++DL+ LD
Sbjct: 145 LLFSTGLDKMEGVLIPAGFVKVTILE-------PMSGESLDSFTMDLSELDIQEK 192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08425TYPE3OMBPROT280.021 Type III secretion system outer membrane B protein ...
		>TYPE3OMBPROT#Type III secretion system outer membrane B protein

family signature.
Length = 538

Score = 27.7 bits (61), Expect = 0.021
Identities = 14/32 (43%), Positives = 22/32 (68%), Gaps = 1/32 (3%)

Query: 19 EQLAEMAGLSVRTIQRIENGER-PGLETLSAL 49
E+ + G+SV + QR++NGER G+E L+ L
Sbjct: 23 EETGKHKGVSVISYQRVKNGERNKGIEALNRL 54


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08430TCRTETA462e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 46.0 bits (109), Expect = 2e-07
Identities = 46/237 (19%), Positives = 89/237 (37%), Gaps = 14/237 (5%)

Query: 7 RSTIALLASSLLLTIGRGATLPFMTIYLTRRFQLEVDVI---GYALSLALVVGVLFSMGF 63
R I +L++ L +G G +P + L R DV G L+L ++ +
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLL-RDLVHSNDVTAHYGILLALYALMQFACAPVL 63

Query: 64 GILADRFDKKRYMVWSVLVFILGFSAIPLVNNAPLVVIFFA--LINCAYSVFSTVLKAWF 121
G L+DRF ++ ++ S+ + ++ ++ AP + + + ++ V A+
Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYA---IMATAPFLWVLYIGRIVAGITGATGAVAGAYI 120

Query: 122 ADRLTAEKKARIFSLNYTILNIGWTVGPPIGTLLVMHSINLPFWLAAACAAFPLVFIQLF 181
AD +++AR F G GP +G L+ S + PF+ AAA +
Sbjct: 121 ADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFL 180

Query: 182 L----QRDGAAAAQPGAAPWSPSVLLRD-RALLWFTCSGLLASFVGGAFASCLSQYV 233
L + + + P + R + + VG A+ +
Sbjct: 181 LPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG 237



Score = 39.4 bits (92), Expect = 2e-05
Identities = 26/158 (16%), Positives = 62/158 (39%), Gaps = 2/158 (1%)

Query: 4 TLRRSTIALLASSLLLTIGRGATLPFMTIYLTRRFQLEVDVIGYALSLALVVGVLF-SMG 62
AL+A ++ + I+ RF + IG +L+ ++ L +M
Sbjct: 207 RGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMI 266

Query: 63 FGILADRFDKKRYMVWSVLVFILGFSAIPLVNNAPLVVIFFALINCAYSVFSTVLKAWFA 122
G +A R ++R ++ ++ G+ + + L+ + L+A +
Sbjct: 267 TGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG-GIGMPALQAMLS 325

Query: 123 DRLTAEKKARIFSLNYTILNIGWTVGPPIGTLLVMHSI 160
++ E++ ++ + ++ VGP + T + SI
Sbjct: 326 RQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASI 363


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08465TCRTETB592e-11 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 58.7 bits (142), Expect = 2e-11
Identities = 39/156 (25%), Positives = 69/156 (44%), Gaps = 2/156 (1%)

Query: 36 LSDIADSFGMETAQVGMMLTIYAWVVALMSLPFMLLTSKVERRRLLIGLFILFIASHVLS 95
L DIA+ F A + T + ++ + + L+ ++ +RLL+ I+ V+
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 96 FFAWN-FDVLVISRIGIAFAHAVFWSITSALAIRMAPPGKRAQALSLIATGTALAMVFGI 154
F + F +L+++R A F ++ + R P R +A LI + A+ G
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 155 PIGRIIGQYFGWRMTFLAIGLGALATLACLVKLLPP 190
IG +I Y W L I + + T+ L+KLL
Sbjct: 157 AIGGMIAHYIHWSYLLL-IPMITIITVPFLMKLLKK 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08475TCRTETB605e-12 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 60.3 bits (146), Expect = 5e-12
Identities = 81/422 (19%), Positives = 147/422 (34%), Gaps = 45/422 (10%)

Query: 1 MNTTANTTRIRWWIAGLMWLAIAINY--IDRTVLSAAAPHLIDELKLDPEMMGFIMAAFF 58
MNT+ + + +R L+WL I + ++ VL+ + P + ++ P ++ AF
Sbjct: 1 MNTSYSQSNLRHNQI-LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFM 59

Query: 59 WSYSLLQIPAGWFADRFGQKKGLGLAVAWWSIATSMMGVATGFKSLLAL-RLALGVGEAA 117
++S+ G +D+ G K+ L + + + V F SLL + R G G AA
Sbjct: 60 LTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAA 119

Query: 118 AYPSNAGIAARWFPDKERATVSGLFDSASKFGGAIAMPLIVWMI-YTFDWRLTFLIIGSV 176
+ AR+ P + R GL S G + P I MI + W LI
Sbjct: 120 FPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVG-PAIGGMIAHYIHWSYLLLIP--- 175

Query: 177 GILWVIAWYFIYAENPEEHKRISPSE---------------------------VRIIRDG 209
++ +I F+ +E + + V ++
Sbjct: 176 -MITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFL 234

Query: 210 QKQHHGDKTVLPMKWYKLLRYRNIWAMCIGFFTINYTSYFFITWLPTYLVKEKGMDFIKM 269
H K P L + + I T F++ +P + + ++
Sbjct: 235 IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEI 294

Query: 270 GMVAALPLLCGMVIEVFAGWASDRLVHKKVLSLTAT-RKLFLTIGLLMALCIGFAPFTDS 328
G V P G + + G+ LV ++ FL++ L A F T S
Sbjct: 295 GSVIIFP---GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTA---SFLLETTS 348

Query: 329 VFMTVFLLCVAKSGTTVAASQVWALPGDVAPKNSVSIVAGLQNTVSNMGGAVGPIITGAI 388
FMT+ ++ V + + + + L N S + G I G +
Sbjct: 349 WFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQ-EAGAGMSLLNFTSFLSEGTGIAIVGGL 407

Query: 389 VA 390
++
Sbjct: 408 LS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08515SACTRNSFRASE363e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 35.7 bits (82), Expect = 3e-05
Identities = 13/73 (17%), Positives = 25/73 (34%), Gaps = 1/73 (1%)

Query: 80 IDPQHRGQQLGEKLLAALEAKSRQRDCHTLRLETGIHQHAAIALYTRNGYQTRCAFAPYQ 139
+ +R + +G LL +++ L LET +A Y ++ + A
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF-IIGAVDTML 155

Query: 140 PDPLSVFMEKPLF 152
E +F
Sbjct: 156 YSNFPTANEIAIF 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08530BLACTAMASEA353e-04 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 34.8 bits (80), Expect = 3e-04
Identities = 15/71 (21%), Positives = 31/71 (43%), Gaps = 3/71 (4%)

Query: 22 GRGKVADYIPALASVSGDKLGI-AISTVDGQHFAAGDAHERFSIQSISKVL--SLVVAMN 78
+ + I S ++G+ + G+ A A ERF + S KV+ V+A
Sbjct: 21 ASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARV 80

Query: 79 HYQEEEIWQRV 89
+E++ +++
Sbjct: 81 DAGDEQLERKI 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08545SACTRNSFRASE327e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 31.8 bits (72), Expect = 7e-04
Identities = 21/86 (24%), Positives = 39/86 (45%), Gaps = 4/86 (4%)

Query: 92 RADVAKLLVHQNVRRQGIAQALMSELERIARRERKTVLVLDTAT-GSGAEQFYARCGWEK 150
A + + V ++ R++G+ AL+ + A+ L+L+T A FYA+ +
Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF-I 147

Query: 151 VGEIPR--YALMPDGEMTATSLFYKF 174
+G + Y+ P A +YKF
Sbjct: 148 IGAVDTMLYSNFPTANEIAIFWYYKF 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08555SACTRNSFRASE316e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 31.5 bits (71), Expect = 6e-04
Identities = 11/51 (21%), Positives = 21/51 (41%), Gaps = 1/51 (1%)

Query: 81 YLEDLFVDPAFRGQGIARTMIKSLQSEGADKGWSRLYWHTRRDN-PARHLY 130
+ED+ V +R +G+ ++ + + L T+ N A H Y
Sbjct: 91 LIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFY 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08595TCRTETA416e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 41.0 bits (96), Expect = 6e-06
Identities = 81/397 (20%), Positives = 143/397 (36%), Gaps = 32/397 (8%)

Query: 11 NLRIISIVVFTCICYLSIGLPLAVLPGYIHYQLGYSTFVA---GIVISLQYISTLVSRPH 67
N +I I+ + + IGL + VLPG + L +S V GI+++L + P
Sbjct: 4 NRPLIVILSTVALDAVGIGLIMPVLPGLLR-DLVHSNDVTAHYGILLALYALMQFACAPV 62

Query: 68 AGRYTDIWGPKKVVSLGIVCCLLSGAFTLLAVVLQATPMLAIAALLAGRVFLGV-GESFT 126
G +D +G + V+ + + + ++ P L + L GR+ G+ G +
Sbjct: 63 LGALSDRFGRRPVLLVSLA------GAAVDYAIMATAPFLWV--LYIGRIVAGITGATGA 114

Query: 127 ATGATLWGIKTVGAIHTSRVISWNGVATYVAMAVGAPLGVTLNHYFGISGF--ATVVVLV 184
GA + I +R + M G LG + + + F A + +
Sbjct: 115 VAGAYIADITDGDE--RARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGL 172

Query: 185 AAIGLLF-------ARTRQDVKVTAGARAPFH-AVVRKIWPYGLGLAFGTVGFGVIATFI 236
+ F R + A F A + + + F G + +
Sbjct: 173 NFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAAL 232

Query: 237 TLYFAAHSWQ----GAAFTLSLFSVGFICVRLVLGNTIT-RFGGVPVSLACFIIESLGLL 291
+ F + +L+ F + + ++ + R G + I + G +
Sbjct: 233 WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYI 292

Query: 292 LIWLAPSAWMAGVGAFLTGSGFSLVFPALGVEAVKQVEEQNQGTALGTYSAFLDLALGLT 351
L+ A WMA L SG + PAL +QV+E+ QG G+ +A L +
Sbjct: 293 LLAFATRGWMAFPIMVLLASG-GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLT-SIV 350

Query: 352 GPLAGWVAGFYDLATLYLLAAIVVALAFLLIFRVHRQ 388
GPL + T A I A +LL R+
Sbjct: 351 GPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08605TCRTETA310.006 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 31.3 bits (71), Expect = 0.006
Identities = 31/128 (24%), Positives = 50/128 (39%), Gaps = 4/128 (3%)

Query: 246 VHLWALFGLAAAPSCLIWHKLVLKWGYRQALTRNLLVQALGVILPACSASLLFCVLSALL 305
+ L+AL A AP + L ++G R L +L A+ + A + L + ++
Sbjct: 49 LALYALMQFACAP---VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105

Query: 306 VGFTFMGTVTIALPKAKSLSHQVSFNMIAAMTALYGVGQIAGPLIAGALYQIAASFNPAL 365
G T A M+A +G G +AGP++ G + + P
Sbjct: 106 AGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHA-PFF 164

Query: 366 YAAALALL 373
AAAL L
Sbjct: 165 AAAALNGL 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08640MICOLLPTASE260.025 Microbial collagenase metalloprotease (M9) signature.
		>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature.

Length = 1104

Score = 25.8 bits (56), Expect = 0.025
Identities = 10/29 (34%), Positives = 19/29 (65%)

Query: 52 RRTPWARKEVEAMYLASLDDDAPVEKADP 80
+R WA KEV+A ++ + +D +E+ +P
Sbjct: 415 KRLYWASKEVKAQFMRVVQNDKALEEGNP 443


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08660TCRTETB441e-06 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 43.7 bits (103), Expect = 1e-06
Identities = 31/157 (19%), Positives = 59/157 (37%), Gaps = 2/157 (1%)

Query: 26 LGVFGLIVAEFLPASLLTPMASSLGVSEGMAGQAVTATALVALVTGLLIATATRNIDRRW 85
L F ++ L SL +A+ TA L + + + + +
Sbjct: 22 LSFFSVLNEMVLNVSLPD-IANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR 80

Query: 86 VLMFFSVLQIVSSLMVAFADSLAFLL-LGRLLLGIAIGGFWAMSTATAMRLVPAAHVPKA 144
+L+F ++ S++ S LL + R + G F A+ R +P + KA
Sbjct: 81 LLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKA 140

Query: 145 LAIIFSAVSVATVVAAPLGSYLGELIGWRNVFILCAI 181
+I S V++ V +G + I W + ++ I
Sbjct: 141 FGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMI 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08665PF06057280.033 Type IV secretory pathway VirJ component
		>PF06057#Type IV secretory pathway VirJ component

Length = 243

Score = 28.3 bits (63), Expect = 0.033
Identities = 12/44 (27%), Positives = 17/44 (38%), Gaps = 2/44 (4%)

Query: 223 PPAPTAASAADGTFTITLTSTGERWPVPGDKTIAQVLQEHGVAV 266
A+S I L+ G W DK + +LQ+ G V
Sbjct: 40 TQVNAASSHTKPPLVIFLSGDGG-W-ATLDKAVGGILQQQGWPV 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08720PF005776790.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 679 bits (1753), Expect = 0.0
Identities = 309/861 (35%), Positives = 449/861 (52%), Gaps = 52/861 (6%)

Query: 12 VSLSILLGGQSALLHAQAT--FNMDLLEKNDHLPAVDLQRFNQQAGQPPGAYPVSWQVNG 69
V L + + + A FN L + DL RF PPG Y V +N
Sbjct: 28 VRLFVACAFAAQAPLSSAELYFNPRFLADDP-QAVADLSRFENGQELPPGTYRVDIYLNN 86

Query: 70 VTLDARKTVTFRQND-RGQLTPCLKPEDLLQAGVNPAVLSQATGATSRSCPELNALLPGS 128
+ + VTF D + PCL L G+N A +S +C L +++ +
Sbjct: 87 GYMA-TRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHDA 145

Query: 129 TVNFDFAHQRLVMTIPQTLMTHRARDNVPSALWDEGISAFQSNYRYSGASQRTREGSTER 188
T D QRL +TIPQ M++RAR +P LWD GI+A NY +SG S + R G
Sbjct: 146 TAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSH 205

Query: 189 DNYLMLKSGVNVGAWRLRASNSLTAN-----SDDKPQWTTSGAWLERDLTRWQSELTLGD 243
YL L+SG+N+GAWRLR + + + N S K +W WLERD+ +S LTLGD
Sbjct: 206 YAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGD 265

Query: 244 TFTSGDVFDAVQFQGISLASSDAMLPDSQKGFAPTIRGIARTNAQVTVRQNGYVLYQTYV 303
+T GD+FD + F+G LAS D MLPDSQ+GFAP I GIAR AQVT++QNGY +Y + V
Sbjct: 266 GYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTV 325

Query: 304 TPGAFVIDDLYPTASSGNLEVAVKESDGEIRRFTQPYASVTSMQREGSLKYNLVAGRYHS 363
PG F I+D+Y +SG+L+V +KE+DG + FT PY+SV +QREG +Y++ AG Y S
Sbjct: 326 PPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRS 385

Query: 364 DDASQR-PLMMQLSLMRGFAHNLTLFGGLQSAAQYHNLSLGAGQGLGEAGALSLQLLNAR 422
+A Q P Q +L+ G T++GG Q A +Y + G G+ +G GALS+ + A
Sbjct: 386 GNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQAN 445

Query: 423 DR-HQQDPIDGRAWQLQYSKGFDRLGTQFTFTGWRYSHQRYATLSEAFSSPGSDDDLQDS 481
DG++ + Y+K + GT G+RYS Y ++ S + +++
Sbjct: 446 STLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQ 505

Query: 482 D-----------------NKKATLQITASQSLPYDITLYLSLDQDSYWSGGASQRTANMG 524
D NK+ LQ+T +Q L TLYLS +YW G
Sbjct: 506 DGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQAG 565

Query: 525 ISSQVHGIAWSLSYSDSRSSHGDEEDDEPHSDKVVTLSLSVPLSHLLPG--------SYA 576
+++ I W+LSYS ++++ D+++ L++++P SH L + A
Sbjct: 566 LNTAFEDINWTLSYSLTKNAWQKG------RDQMLALNVNIPFSHWLRSDSKSQWRHASA 619

Query: 577 GYTLTSSRHSVGSQMVSLNGTLLDNHALSYAVSQTRDRQ----NGSSGSLTAGYSSGRGD 632
Y+++ + + + + GTLL+++ LSY+V +GS+G T Y G G+
Sbjct: 620 SYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGN 679

Query: 633 LNLGYSHDSQAARLNYGASGGILIHRHGVVFTPEMNGAVVLIDAGGAGGVTLANQKTIAT 692
N+GYSH +L YG SGG+L H +GV +N VVL+ A GA + NQ + T
Sbjct: 680 ANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRT 739

Query: 693 NGDGYAVLPFATAYHRNDVSLDSHSLPENVDLANSTVTLVPTKDAVVLARFHTHFGYKAL 752
+ GYAVLP+AT Y N V+LD+++L +NVDL N+ +VPT+ A+V A F G K L
Sbjct: 740 DWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLL 799

Query: 753 FTLQSRGQPLPFGSEVRAKDTNS--IVASEGQVYLAGLAPKGTLYAQWGPGPQQRCSARY 810
TL +PLPFG+ V ++ + S IVA GQVYL+G+ G + +WG C A Y
Sbjct: 800 MTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANY 859

Query: 811 DLTPTLAQTPHPLILQQTLSC 831
L P Q + Q + C
Sbjct: 860 QLPPESQQQL---LTQLSAEC 877


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08730FIMBRIALPAPE329e-04 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 31.9 bits (72), Expect = 9e-04
Identities = 39/168 (23%), Positives = 75/168 (44%), Gaps = 22/168 (13%)

Query: 24 ARAAGTLNFTGKIINESCQIANNGGDVNVDFGNVDMSALKSHEAKTAETPFTINLTGCPL 83
AA L F GK+I +C + N V++G++++ L ++ + FT+++ CP
Sbjct: 22 VHAADNLTFKGKLIIPACTVQN----AEVNWGDIEIQNLV--QSGGNQKDFTVDMN-CPY 74

Query: 84 AQNISISLEGTPDTNANGTSAAVLALSDAADTAKGVGIEVFSSPDGS-----TEGTQLTF 138
S+ T+ T ++L + + + G+ I +++S + T G+Q+T
Sbjct: 75 ----SLGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGIGNAVTLGSQVTP 130

Query: 139 DKQSKTAVSQADENGDIAFNFIADLKSDSSQDVTAGNINATANIDIVY 186
K + TA ++ I K + Q + AG +ATA + Y
Sbjct: 131 GKITGTAPAR-----KITLYAKLGYKGN-MQSLQAGTFSATATLVASY 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08735FIMBRIALPAPE270.015 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 26.5 bits (58), Expect = 0.015
Identities = 12/35 (34%), Positives = 18/35 (51%), Gaps = 6/35 (17%)

Query: 11 LCLAPLASSAALSGQVH------FSGRVINPACVI 39
LCL + + +S VH F G++I PAC +
Sbjct: 7 LCLPVMLGAVLMSQHVHAADNLTFKGKLIIPACTV 41


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08785TCRTETA402e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.8 bits (93), Expect = 2e-05
Identities = 26/118 (22%), Positives = 48/118 (40%), Gaps = 1/118 (0%)

Query: 55 GLVMSVLLVGAALGSVFGGKFADYFGRRKYLLFLSFVFLIGALLSAAAPDITTLLIARAL 114
G+++++ + + G +D FGRR LL + + A AP + L I R +
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105

Query: 115 LGYAVGGASVTAPTFISEVAPTEMRGKLTGLNEVAIVIGQLAAFAINAIIGIIWGHLP 172
G G A +I+++ + R + G G +A + ++G H P
Sbjct: 106 AGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAP 162



Score = 32.5 bits (74), Expect = 0.004
Identities = 32/137 (23%), Positives = 51/137 (37%), Gaps = 22/137 (16%)

Query: 321 LVDRFKRKTIIIYGFAIMATLHLIIAAVDYTLVGDLKATAIWLLGALFVGVMQGSMGFIT 380
L DRF R+ +++ L AAVDY ++ + +G + G + G+ G +
Sbjct: 66 LSDRFGRRPVLLVS--------LAGAAVDYAIMATAPFLWVLYIGRIVAG-ITGATGAVA 116

Query: 381 WVVLAELFPLKFRGLSMGISVFFMWIMNAVVSYLFPL------LQAKLGLGPVFFIFAAI 434
+A++ R G M+A + L FF AA+
Sbjct: 117 GAYIADITDGDERARHFGF-------MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAAL 169

Query: 435 NYLAILFVVFALPETSN 451
N L L F LPE+
Sbjct: 170 NGLNFLTGCFLLPESHK 186



Score = 30.2 bits (68), Expect = 0.018
Identities = 30/152 (19%), Positives = 51/152 (33%), Gaps = 8/152 (5%)

Query: 48 ALTPTTEGLVMSVL-LVGAALGSVFGGKFADYFGRRKYLLFLSFVFLIGALLSAAAPDIT 106
TT G+ ++ ++ + ++ G A G R+ L+ G +L A A
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 107 TLLIARALLGYAVGGASVTAPTFISEVAPTEMRGKLTGLN----EVAIVIGQLAAFAINA 162
LL + G +S E +G+L G + ++G L AI A
Sbjct: 302 MAFPIMVLLA-SGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA 360

Query: 163 IIGIIWGHLPDVWRYMLLVQAIPAICLFVGMW 194
W W + + L G+W
Sbjct: 361 ASITTWNGW--AWIAGAALYLLCLPALRRGLW 390


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08825DNABINDNGFIS290.012 DNA-binding protein FIS signature.
		>DNABINDNGFIS#DNA-binding protein FIS signature.

Length = 98

Score = 28.8 bits (64), Expect = 0.012
Identities = 13/50 (26%), Positives = 30/50 (60%), Gaps = 2/50 (4%)

Query: 296 INNVRQLLEHDSGEVLLDTLSSFIANNAEPGKTSLLLGIHRNTLTYRLQQ 345
+N++ +L+ + + LLD + + N + +L++GI+R TL +L++
Sbjct: 47 VNDLYELVLAEVEQPLLDMVMQYTRGNQT--RAALMMGINRGTLRKKLKK 94


30D364_RS08895D364_RS09010Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS08895-129-4.547487deoxyribose-phosphate aldolase
D364_RS08900-232-5.421400pentose kinase
D364_RS08905-130-6.281648DeoR/GlpR family DNA-binding transcription
D364_RS08910-132-6.335930AraC family transcriptional regulator
D364_RS08915-129-6.734178hypothetical protein
D364_RS08920-131-6.659032alpha/beta hydrolase
D364_RS08925130-5.648908DUF2255 family protein
D364_RS08930031-5.361254helix-turn-helix domain-containing protein
D364_RS08935234-5.967120type I glyceraldehyde-3-phosphate dehydrogenase
D364_RS27425330-4.066723hypothetical protein
D364_RS08950230-3.309473hypothetical protein
D364_RS08955131-3.138239aldo/keto reductase
D364_RS08960032-4.066791SMP-30/gluconolactonase/LRE family protein
D364_RS08965-130-3.663124aldo/keto reductase
D364_RS08970-226-3.343149aldo/keto reductase
D364_RS08975-127-3.642980LysR family transcriptional regulator
D364_RS08980-223-4.033406GrpB family protein
D364_RS08985-223-4.392931sigma-54 dependent transcriptional regulator
D364_RS08990-221-3.329958two-component system sensor histidine kinase
D364_RS08995-221-3.158961phosphoglycerate transport regulator PgtC
D364_RS09000-220-3.536735phosphoglycerate transporter PgtP
D364_RS09005020-3.470355peptidoglycan-binding protein LysM
D364_RS09010223-3.590728alpha/beta fold hydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08990HTHFIS2463e-79 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 246 bits (630), Expect = 3e-79
Identities = 114/474 (24%), Positives = 195/474 (41%), Gaps = 73/474 (15%)

Query: 7 SILLIDDDADVLDAYTQLLEQAGYHVSACNNPFDAREQVPKDWPGIVLSDVCMPGCSGID 66
+IL+ DDDA + Q L +AGY V +N + +V++DV MP + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 67 LMTLFHQDDDLLPILLITGHGDVPMAVEAVKKGAWDFLQKPIDPGKLLTLVDAALRQRQS 126
L+ + LP+L+++ A++A +KGA+D+L KP D +L+ ++ AL + +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 127 VIARRQYCQQKLQVELIGRSQWTVRYRQRLQQLAETDIAVWLYGEPGTGRMTGARYLHQL 186
++ + Q L+GRS + L +L +TD+ + + GE GTG+ AR LH
Sbjct: 125 RPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183

Query: 187 GRHAEGPFIA--CELTPAN----------------AHTLNE-LIAQAQGGTLVLSHPEHL 227
G+ GPF+A P + A T + QA+GGTL L +
Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243

Query: 228 THEQQHQLVQ-LQSHEKRP----------FRLISIGSASLVELAASSQIVAELYYCFAMT 276
+ Q +L++ LQ E R+++ + L + +LYY +
Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303

Query: 277 QIGCQPLSKRPDDIEPLFHHYLQKTCQRLNHPVPEVDAGLLKGMMRRVWPNNVRELANAA 336
+ PL R +DI L H++Q+ + V D L+ M WP NVREL N
Sbjct: 304 PLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRELENLV 362

Query: 337 ELFAV--------------------------------GVLPLAETVNPLMH--------- 355
G L +++ V M
Sbjct: 363 RRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDA 422

Query: 356 IGEPTPLDQRVEDVERQIITEALNIHQGRINEVAEYLLIPRKKLYLRMKKYGLN 409
+ D+ + ++E +I AL +G + A+ L + R L ++++ G++
Sbjct: 423 LPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS09000FLGMOTORFLIM310.010 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 30.6 bits (69), Expect = 0.010
Identities = 5/35 (14%), Positives = 15/35 (42%), Gaps = 4/35 (11%)

Query: 312 QRLVQRMFDTAISFRLAQLKDAWRALHSAEVRLKR 346
+++ + LA ++++W + RL +
Sbjct: 150 NSVMEGVIVRI----LANVRESWTQVIDLRPRLGQ 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS09005TCRTETA310.009 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 31.3 bits (71), Expect = 0.009
Identities = 65/387 (16%), Positives = 126/387 (32%), Gaps = 39/387 (10%)

Query: 52 TPYLKEQLDLSATQI---GLLSSCMLIAYGISKGVMSSLADKASPKVFMACGLVLCAIVN 108
P L L S G+L + + V+ +L+D+ + + L A+
Sbjct: 28 LPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDY 87

Query: 109 VGLGFSTAFWVFAALVVLNGLFQGMGVGPSFITIANWFPRRERGRVGAFWNISHNVGGGI 168
+ + WV ++ G+ G IA+ ER R F +S G G+
Sbjct: 88 AIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIADITDGDERAR--HFGFMSACFGFGM 144

Query: 169 VA-PIVGAAFAILGTEHWQSASYIVPACVAVVFAISVLVLGKGSPREEGLPSLAEMMPEE 227
VA P++G P A + G ++PE
Sbjct: 145 VAGPVLGGLMGGFSPH--------APFFAAAALNGLNFLTG------------CFLLPE- 183

Query: 228 KVVLKTKHGQKAPENMSAFQIFCTYVLRNKNAWYVSFVDVFVYMVRFGMISWLPIYLLTV 287
+ G++ P A ++ + + VF M G + +
Sbjct: 184 -----SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGE 238

Query: 288 KHFSKEQMSVAFLFFEWA---AIPSTLLAGWLSDKLFKGRRMPLAIICMTLIFICLIGYW 344
F + ++ + ++ ++ G ++ +L + R + L +I +I L
Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298

Query: 345 KSESLLMVTVFAAIVGCLIYVPQFLASVQTMEIVPSFAVGSAVGLRGFMSYIFGASLGTS 404
+ + V A G + Q + S Q E GS L ++ I G L T+
Sbjct: 299 RGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTS-LTSIVGPLLFTA 357

Query: 405 LFGVMVDKMGWHGGFYLLMGGIVCCIL 431
++ + W+G ++ + L
Sbjct: 358 IYAASITT--WNGWAWIAGAALYLLCL 382


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS09010INTIMIN325e-04 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 32.3 bits (73), Expect = 5e-04
Identities = 22/70 (31%), Positives = 39/70 (55%), Gaps = 7/70 (10%)

Query: 84 SDGVKVTQSGAESR-FYTVKSGDTLSAISKAMYGSANDYQRIFEANKPMLTHPD---KIY 139
SD +T + ++R FYT+K+G+T++ +SK+ + I+ NK + + K
Sbjct: 49 SDSKLLTHNSYQNRLFYTLKTGETVADLSKS---QDINLSTIWSLNKHLYSSESEMMKAE 105

Query: 140 PGQVLIIPAK 149
PGQ +I+P K
Sbjct: 106 PGQQIILPLK 115


31D364_RS09070D364_RS09135Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS09070-239-5.844030M20 family metallopeptidase
D364_RS09075145-7.691024cell envelope integrity protein TolA
D364_RS09080144-7.992067N-acyl homoserine lactonase family protein
D364_RS09085144-8.101609LysR family transcriptional regulator
D364_RS09090244-9.086282LysR family transcriptional regulator
D364_RS09095140-9.418031MFS transporter
D364_RS09100132-8.333291LysE family translocator
D364_RS09105134-9.827843PTS fructose transporter subunit IIC
D364_RS09110237-11.292585PTS fructose transporter subunit IIB
D364_RS09115134-9.930201PTS sugar transporter subunit IIA
D364_RS09120132-9.211202ribulose-phosphate 3-epimerase
D364_RS09125125-5.874719PTS ascorbate transporter subunit IIC
D364_RS09130124-5.128118PTS sugar transporter subunit IIB
D364_RS09135022-3.645975PTS sugar transporter subunit IIA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS09080ALARACEMASE290.014 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 29.4 bits (66), Expect = 0.014
Identities = 14/40 (35%), Positives = 20/40 (50%), Gaps = 3/40 (7%)

Query: 41 LTHPD-GFTLIDGGLAVEGLKDPSGYWG-SAVEQFKPVMS 78
L HP+ F + G+ + G PSG W A +PVM+
Sbjct: 196 LWHPEAHFDWVRPGIILYGAS-PSGQWRDIANTGLRPVMT 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS09095TCRTETA310.013 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 30.6 bits (69), Expect = 0.013
Identities = 26/141 (18%), Positives = 48/141 (34%), Gaps = 4/141 (2%)

Query: 18 MVIAFVQFTNALEYMMFSPVFTFMAADF---AVPVTFSGYVSGMYTSGAVLSGIIAFYWI 74
+VI +A+ + PV + D G + +Y +
Sbjct: 8 IVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALS 67

Query: 75 DRCNKKHFLIANMVLLAMATLLTTFTTSFPLLLTLRFFAGLVGGTTMAVGITILINHTPA 134
DR ++ L+ ++ A+ + +L R AG+ G T AV + + T
Sbjct: 68 DRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITDG 126

Query: 135 DLRGKMLATVIASFSMVSIVG 155
D R + + A F + G
Sbjct: 127 DERARHFGFMSACFGFGMVAG 147


32D364_RS09180D364_RS09240Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS09180-1154.664959FAD-NAD(P)-binding protein
D364_RS091850164.937019NADH:flavin oxidoreductase/NADH oxidase
D364_RS091901174.235673hypothetical protein
D364_RS091952174.711680LysR family transcriptional regulator
D364_RS092003174.818828nitronate monooxygenase
D364_RS092052194.317401allantoate amidohydrolase
D364_RS092102192.778749alanine--glyoxylate aminotransferase family
D364_RS092151184.456763amino acid ABC transporter ATP-binding protein
D364_RS092202195.012991amino acid ABC transporter permease
D364_RS092252196.313155amino acid ABC transporter permease
D364_RS092303206.789815transporter substrate-binding domain-containing
D364_RS092353176.066909MurR/RpiR family transcriptional regulator HpxU
D364_RS092401133.537046oxamate amidohydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS09215PF05272290.015 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.015
Identities = 13/42 (30%), Positives = 22/42 (52%), Gaps = 4/42 (9%)

Query: 31 VISIIGRSGSGKSTLLRCINGLEGYQEGSIKLGGMTITNRDS 72
+ + G G GKSTL+ + GL+ + + +G T +DS
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG----TGKDS 635


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS09235BCTERIALGSPD290.020 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 29.1 bits (65), Expect = 0.020
Identities = 15/72 (20%), Positives = 27/72 (37%), Gaps = 13/72 (18%)

Query: 58 RLGYDKYKDMRDELRTL-------RQSGMPLTDQRDAV------QGNTLLARHYKQEMAN 104
L Y K D+ + L + +Q+ P+ + Q N L+ M +
Sbjct: 273 YLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIIIKAHGQTNALIVTAAPDVMND 332

Query: 105 LTQWVNALDARQ 116
L + + LD R+
Sbjct: 333 LERVIAQLDIRR 344


33D364_RS09285D364_RS09330Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS09285-124-3.372992beta-ketoacyl-ACP synthase II
D364_RS09290040-6.785914ASCH domain-containing protein
D364_RS09305355-11.643276hypothetical protein
D364_RS26010257-11.376416helix-turn-helix transcriptional regulator
D364_RS09315152-10.293387GMP synthase
D364_RS09320147-8.955248SDR family oxidoreductase
D364_RS09325044-7.365430cysteine hydrolase
D364_RS09330028-3.962916histidine phosphatase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS09325DHBDHDRGNASE645e-14 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 63.5 bits (154), Expect = 5e-14
Identities = 48/185 (25%), Positives = 84/185 (45%), Gaps = 3/185 (1%)

Query: 2 LRGKRAVITGGGTGFGQALSVWLAREGVEVDFCARRADDIQKTCSIITAEGGMAKGHLCD 61
+ GK A ITG G G+A++ LA +G + + ++K S + AE A+ D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 62 LTLPESLSQFSSQLLTLDKPIDILILNAAQWLSGTLDDQSDTEIINTISSGLTGSILLTQ 121
+ ++ + ++++ PIDIL+ A G + SD E T S TG ++
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 122 ALLPGLRRSESADIVSIISSCGIPNFTDSIAHPAFFASKHGLSGFTTKLSYQLSKENIRV 181
++ + S IV++ S+ P + A+ +SK FT L +L++ NIR
Sbjct: 126 SVSKYMMDRRSGSIVTVGSN---PAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 182 TGLYP 186
+ P
Sbjct: 183 NIVSP 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS09330ISCHRISMTASE513e-10 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 50.8 bits (121), Expect = 3e-10
Identities = 27/106 (25%), Positives = 48/106 (45%), Gaps = 2/106 (1%)

Query: 54 ADNESTAIHPDVAPAENEV--VKRRVGAFSFTELEMILRAQGIENLILTGVTTSRVVLST 111
+ I ++AP ++++ K R AF T L ++R +G + LI+TG+ L T
Sbjct: 101 SGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVT 160

Query: 112 VGQAFDLDYRLIVVNDYCADPDPDTNMFLLKKVLPQHAFVTSSSEI 157
+AF D + V D AD + + L+ + AF + +
Sbjct: 161 ACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCAFTVMTDSL 206


34D364_RS09385D364_RS09495Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS093850143.415609Gfo/Idh/MocA family oxidoreductase
D364_RS093900154.394453DUF4056 domain-containing protein
D364_RS093950134.718188thiaminase II
D364_RS27435-2133.216892ABC transporter ATP-binding protein
D364_RS09410-1133.294658ABC transporter permease
D364_RS094150133.700318ABC transporter substrate-binding protein
D364_RS094200133.521963diaminobutyrate--2-oxoglutarate transaminase
D364_RS09425-1133.938149aspartate aminotransferase family protein
D364_RS094301134.284418LysR family transcriptional regulator
D364_RS094351164.901186cytochrome c biogenesis protein/redoxin
D364_RS094403164.422015HAMP domain-containing protein
D364_RS094452173.747060response regulator
D364_RS094502162.710493alpha/beta hydrolase
D364_RS094552151.485724dipeptidase
D364_RS094602131.327421pyrroloquinoline quinone precursor peptide PqqA
D364_RS09465-1151.060207pyrroloquinoline quinone biosynthesis protein
D364_RS094702145.159942pyrroloquinoline-quinone synthase PqqC
D364_RS094800134.729726pyrroloquinoline quinone biosynthesis peptide
D364_RS094850134.807166pyrroloquinoline quinone biosynthesis protein
D364_RS094901124.646194pyrroloquinoline quinone biosynthesis protein
D364_RS094951134.487940glutamine amidotransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS09455HTHFIS965e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 96.4 bits (240), Expect = 5e-25
Identities = 42/127 (33%), Positives = 65/127 (51%), Gaps = 1/127 (0%)

Query: 6 HILVVDDDRDIRELIVDYLEKSGYRASGAANGKAMWSVLKNHQIDLIVLDIMMPGEDGLT 65
ILV DDD IR ++ L ++GY +N +W + DL+V D++MP E+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 66 LCRQLRANPQQDIPVLMLTARTDDSDRILGLEMGADDYLIKPFVARELLARIKAILRRTR 125
L +++ + D+PVL+++A+ I E GA DYL KPF EL+ I L +
Sbjct: 65 LLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 126 ALPPNLQ 132
P L+
Sbjct: 124 RRPSKLE 130


35D364_RS09570D364_RS09595Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS095702200.987061ABC transporter substrate-binding protein
D364_RS09575220-2.345307DUF2236 domain-containing protein
D364_RS09580219-2.691705OsmC family protein
D364_RS27440217-1.070440hypothetical protein
D364_RS09590217-0.800995YdeI family stress tolerance OB fold protein
D364_RS09595218-1.250800hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS09590PF04619270.014 Dr-family adhesin
		>PF04619#Dr-family adhesin

Length = 160

Score = 27.2 bits (60), Expect = 0.014
Identities = 11/21 (52%), Positives = 14/21 (66%), Gaps = 2/21 (9%)

Query: 1 MKKLAL--AMACLFAVGVAQA 19
MKKLA+ A + +FAV A A
Sbjct: 1 MKKLAIMAAASMVFAVSSAHA 21


36D364_RS09675D364_RS09735Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS096750123.076352ATP-binding cassette domain-containing protein
D364_RS096800113.153346helix-turn-helix domain-containing GNAT family
D364_RS09685-1153.817664inorganic diphosphatase
D364_RS09695-2144.480464alcohol dehydrogenase AdhP
D364_RS09700-2145.263959TIGR04028 family ABC transporter
D364_RS09705-1135.382751ABC transporter permease
D364_RS09710-2145.408173ABC transporter permease
D364_RS09715-1144.054339ABC transporter ATP-binding protein
D364_RS09720-1122.872139putative FMN-dependent luciferase-like
D364_RS097251130.437836alkylhydroperoxidase domain protein
D364_RS09730115-2.916313MFS transporter AraJ
D364_RS09735016-3.642441VOC family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS09680SACTRNSFRASE336e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 33.4 bits (76), Expect = 6e-04
Identities = 12/47 (25%), Positives = 19/47 (40%)

Query: 246 RGTGIGRRLLSEAMAFCDSRQFSAVQLWTFKGLDAARKLYESFGFTL 292
R G+G LL +A+ + F + L T +A Y F +
Sbjct: 102 RKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS09730TCRTETA561e-10 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 56.0 bits (135), Expect = 1e-10
Identities = 80/390 (20%), Positives = 134/390 (34%), Gaps = 37/390 (9%)

Query: 5 IFSLALGTFGLGMAEFGIMGVLPDMAHDVGISIPAA---GNMIAWYAFGVVIGAPIMALL 61
+ ++AL G+G+ IM VLP + D+ S G ++A YA AP++ L
Sbjct: 11 LSTVALDAVGIGL----IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 62 SSRFSLKSVMLFLAGLCILGNTLFTFSSSYAMLALGRLVSGFPHGAFFGVGAIILSKIAP 121
S RF + V+L + + + +L +GR+V+G GA I
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIAD--IT 124

Query: 122 PGKVTAAVAGMIGGMTIANLVGVPGGTWLGHHFSWRYTFALIAVFNVAVFLAIFCWVPTL 181
G A G + +V P L FS F A N FL +P
Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184

Query: 182 YDRASTRLREQ---------FRFLASPAPWLI---FAATMFGNAGVFAWFSYIKPFMLNV 229
+ LR + + + L+ F + G W + +
Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGE------ 238

Query: 230 SGFAESKMMLIMMLAGLGM---VVGNLFSGKISGRYSPLRIAAMTDGVIAVTLLLIFAFG 286
F + + LA G+ + + +G ++ R R + +L+
Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298

Query: 287 EQKVASLALAFICCAGLFALSAPLQILLLQNAKGGEMLGAAGGQIAF--NLGSAIGAFCG 344
+A + + G+ LQ +L + E G G +A +L S +G
Sbjct: 299 RGWMAFPIMVLLASGGIG--MPALQAMLSRQV-DEERQGQLQGSLAALTSLTSIVGPLLF 355

Query: 345 GMMIAQGFG-WNS-VALPAAALSFLAMSAL 372
+ A WN + AAL L + AL
Sbjct: 356 TAIYAASITTWNGWAWIAGAALYLLCLPAL 385


37D364_RS09795D364_RS09940Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS097952142.877880muconolactone Delta-isomerase
D364_RS098002141.924846muconate cycloisomerase family protein
D364_RS09805-1111.918843helix-turn-helix domain-containing protein
D364_RS09810-1121.561855efflux MFS transporter KmrA
D364_RS09815-2111.283671TetR family transcriptional regulator
D364_RS09820-2140.439498NarK family nitrate/nitrite MFS transporter
D364_RS09825-2120.282171nitrate reductase subunit alpha
D364_RS09830-2132.089724nitrate reductase subunit beta
D364_RS098350143.158139nitrate reductase molybdenum cofactor assembly
D364_RS098400154.234333respiratory nitrate reductase subunit gamma
D364_RS098450144.530496N-hydroxyarylamine O-acetyltransferase
D364_RS09850-1143.491099iron ABC transporter permease
D364_RS09855-1133.499455ABC transporter substrate-binding protein
D364_RS09860-1151.738261ABC transporter ATP-binding protein
D364_RS09865-2141.081122flavin reductase family protein
D364_RS09870-115-0.753954helix-turn-helix domain-containing protein
D364_RS09880-113-1.327972family 78 glycoside hydrolase catalytic domain
D364_RS09885015-1.650470MFS transporter
D364_RS09890214-0.687690carbohydrate porin
D364_RS26050019-0.866173hypothetical protein
D364_RS09895-1152.556336helix-turn-helix domain-containing protein
D364_RS099001183.661755tautomerase PptA
D364_RS099050173.586702glutathione S-transferase family protein
D364_RS099100142.132186transporter substrate-binding domain-containing
D364_RS09915-1162.193772amino acid ABC transporter ATP-binding protein
D364_RS099200153.275608amino acid ABC transporter permease
D364_RS099250153.199181GNAT family N-acetyltransferase
D364_RS099301153.553024ABC transporter substrate-binding protein
D364_RS09935-1133.039287LLM class flavin-dependent oxidoreductase
D364_RS09940-2143.638841M20 peptidase aminoacylase family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS09815TCRTETB318e-106 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 318 bits (817), Expect = e-106
Identities = 83/395 (21%), Positives = 169/395 (42%), Gaps = 15/395 (3%)

Query: 20 IDATVLHVAAPTLSVALGSSGNELLWIIDIYSLVMAGMVLPMGALGDKIGFKRLLLLGSA 79
++ VL+V+ P ++ W+ + L + G L D++G KRLLL G
Sbjct: 28 LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGII 87

Query: 80 IFGIASLCAALSPT-AMTLIASRALLAVGAAMIVPATLAGIRSTFAEASQRNMALGLWAA 138
I S+ + + LI +R + GAA PA + + + + R A GL +
Sbjct: 88 INCFGSVIGFVGHSFFSLLIMARFIQGAGAAAF-PALVMVVVARYIPKENRGKAFGLIGS 146

Query: 139 VGSGGAAFGPLVGGILLEHFYWGSVFLINVPIVLVVIAINAKVVPRQPARREQPLNLLQA 198
+ + G GP +GG++ + +W +L+ +P++ ++ + ++ R + ++
Sbjct: 147 IVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGI 204

Query: 199 LVLIAAILMLVFSAKSALKGQLALWLTALVALGGAAMLTWFIRKQLSAARPMVDMRLFTH 258
+++ I+ + S L + + + + F++ P VD L +
Sbjct: 205 ILMSVGIVFFMLFTTSYSISFLIVSVLSFLI---------FVKHIRKVTDPFVDPGLGKN 255

Query: 259 RIILSGVMMAMTALITLVGFELLMAQELQFVHQKTPFEAG-IFMLPVMVASGFSGPIAGL 317
+ GV+ T+ GF ++ ++ VHQ + E G + + P ++ G I G+
Sbjct: 256 IPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGI 315

Query: 318 LVSRLGLREVATGGMLLSAFSFLGLALTDFSTQQWLAWGLMTLLGFSVASALLASSSAIM 377
LV R G V G+ + SFL + +T ++ ++ +LG + + S+
Sbjct: 316 LVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSS 375

Query: 378 AAAPKEKAAAAGAIETMAYELGAGLGIALFGLILT 412
+ +E A + ++ L G GIA+ G +L+
Sbjct: 376 SLKQQEAGAGMSLLNFTSF-LSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS09820HTHTETR521e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 51.9 bits (124), Expect = 1e-10
Identities = 26/180 (14%), Positives = 64/180 (35%), Gaps = 11/180 (6%)

Query: 5 QRDARREGIMQAAMRLALRGGFAAMTVRQIAREAQVAAGQLHHHFTSIGELKAQVFIRLI 64
+ R+ I+ A+RL + G ++ ++ +IA+ A V G ++ HF +L ++++
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 65 REMLDMPLVAED-------ASWRERL---FSMIGSEDGRLEPYIRLWREGQVLADSDPDI 114
+ ++ L + + RE L +E+ R + + +
Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERR-RLLMEIIFHKCEFVGEMAVV 126

Query: 115 KAAYLLTMNMWHAETVAIIEQGLASGEFRSAEPAADIAWRFIALVCGLDGIYALDAQALD 174
+ A + ++ + + + A + GL + Q+ D
Sbjct: 127 QQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFD 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS09865PF05272300.005 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.005
Identities = 12/31 (38%), Positives = 14/31 (45%)

Query: 38 LLGPSGCGKSTLLRLLAGLSVPASGEIRFGD 68
L G G GKSTL+ L GL + G
Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIGT 631


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS09890GPOSANCHOR372e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 37.0 bits (85), Expect = 2e-04
Identities = 23/96 (23%), Positives = 39/96 (40%), Gaps = 25/96 (26%)

Query: 32 AQLEARLNLAEQ--QASEASRR-------AQRAEQQTAAAEQRAAAAEQQVQALSQQTTA 82
QLEA E+ + SEASR+ A R ++ A ++ AL +
Sbjct: 361 KQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKA--LEEANSKLAALEKLNKE 418

Query: 83 REQKQQATNQQ--------------LSEQLAKRAPD 104
E+ ++ T ++ L E+LAK+A +
Sbjct: 419 LEESKKLTEKEKAELQAKLEAEAKALKEKLAKQAEE 454



Score = 32.3 bits (73), Expect = 0.005
Identities = 24/82 (29%), Positives = 36/82 (43%), Gaps = 13/82 (15%)

Query: 26 SVEQRLAQLEARLN-LAEQ-QASEASRRAQRAEQQTAAAEQRAAAAEQQVQALSQQTTAR 83
+ + QLEA L EQ + SEASR++ R + + ++ AE Q
Sbjct: 320 ASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQ--------KLE 371

Query: 84 EQKQ--QATNQQLSEQL-AKRA 102
EQ + +A+ Q L L A R
Sbjct: 372 EQNKISEASRQSLRRDLDASRE 393



Score = 30.4 bits (68), Expect = 0.018
Identities = 19/77 (24%), Positives = 30/77 (38%), Gaps = 12/77 (15%)

Query: 27 VEQRLAQLEARLNLAEQ--QASEASRRAQRAEQQTAAAEQRAAAAEQQVQALSQQTTARE 84
+E A LEA E Q A+R++ R + + ++ AE Q E
Sbjct: 286 LEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQ--------KLEE 337

Query: 85 QKQ--QATNQQLSEQLA 99
Q + +A+ Q L L
Sbjct: 338 QNKISEASRQSLRRDLD 354


38D364_RS10465D364_RS10905Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS10465024-3.344303riboflavin synthase
D364_RS10470122-1.417521MdtK family multidrug efflux MATE transporter
D364_RS10475127-1.029894acid resistance repetitive basic protein Asr
D364_RS104801230.599859TIGR00730 family Rossman fold protein
D364_RS105000231.009085***hypothetical protein
D364_RS260601212.896434hypothetical protein
D364_RS105101194.261784SDR family oxidoreductase
D364_RS105151192.517438AraC family transcriptional regulator
D364_RS105200210.138907LysR family transcriptional regulator
D364_RS10525023-2.061968hypothetical protein
D364_RS10530-121-3.405510MBL fold metallo-hydrolase
D364_RS27450218-4.208264hypothetical protein
D364_RS10540115-3.299567helix-turn-helix domain-containing protein
D364_RS10545014-1.707173hypothetical protein
D364_RS105500130.158464DUF1471 domain-containing protein
D364_RS10560-1140.468780carbonic anhydrase
D364_RS10565-1151.188001LysE family transporter
D364_RS10570-1141.470134DeoR/GlpR family DNA-binding transcription
D364_RS105750162.748371glyoxalase/bleomycin resistance/extradiol
D364_RS10580-1163.020317CatA-like O-acetyltransferase
D364_RS10585-1152.732661ABC transporter substrate-binding protein
D364_RS105902163.220573ABC transporter permease
D364_RS105952163.119435ABC transporter permease
D364_RS106000142.859126ABC transporter ATP-binding protein
D364_RS106050151.399159ATP-binding cassette domain-containing protein
D364_RS10610119-1.070303membrane protein
D364_RS10615123-2.350126DinB family protein
D364_RS10620328-3.339282hypothetical protein
D364_RS10625223-2.630864MFS transporter
D364_RS10630020-1.587192LysR family transcriptional regulator
D364_RS10635018-1.217856zinc-dependent alcohol dehydrogenase family
D364_RS106401150.215961LysR family transcriptional regulator
D364_RS260650141.742294GFA family protein
D364_RS106451154.309868isopenicillin N synthase family oxygenase
D364_RS106501154.755770amidohydrolase family protein
D364_RS106551164.017892ABC transporter substrate-binding protein
D364_RS106600163.831859LysR family transcriptional regulator
D364_RS106700164.535855PDR/VanB family oxidoreductase
D364_RS106750204.856646hypothetical protein
D364_RS106800183.769516RidA family protein
D364_RS106850184.031327creatininase family protein
D364_RS10690-2161.230161ABC transporter substrate-binding protein
D364_RS10695-2161.097109ABC transporter ATP-binding protein
D364_RS10700-1161.053120ABC transporter permease
D364_RS10705-2141.508179FAD-binding oxidoreductase
D364_RS260700150.452741hypothetical protein
D364_RS107150162.570023DUF535 family protein
D364_RS269850226.807023hypothetical protein
D364_RS10725-1216.616847LysR family transcriptional regulator
D364_RS107300256.981052CoA transferase subunit A
D364_RS107350246.557445CoA transferase subunit B
D364_RS107401246.229698acetyl-CoA C-acetyltransferase
D364_RS107451235.1716443-hydroxyacyl-CoA dehydrogenase family protein
D364_RS10750-1223.989275GntP family permease
D364_RS10755-2171.6423463-hydroxybutyrate dehydrogenase
D364_RS10760-118-3.376349LysR family transcriptional regulator
D364_RS10765-124-5.083591acetolactate decarboxylase
D364_RS10770030-7.013727acetolactate synthase AlsS
D364_RS10775245-10.681625(S)-acetoin forming diacetyl reductase
D364_RS10780352-12.907890GNAT family N-acetyltransferase
D364_RS10785353-13.762344multidrug efflux RND transporter permease
D364_RS10790460-16.081193ABC transporter six-transmembrane
D364_RS10795250-12.423935transporter
D364_RS10800143-9.809536response regulator
D364_RS10805136-7.853175HAMP domain-containing histidine kinase CrrB
D364_RS10810116-1.739851LD-carboxypeptidase
D364_RS108151122.203935c-type cytochrome biogenesis protein CcmI
D364_RS108203216.023677cytochrome c-type biogenesis protein CcmH
D364_RS108253225.477968DsbE family thiol:disulfide interchange protein
D364_RS108303236.349466heme lyase CcmF/NrfE family subunit
D364_RS108354237.100504cytochrome c maturation protein CcmE
D364_RS108401216.765156heme exporter protein CcmD
D364_RS108450225.452556heme ABC transporter permease
D364_RS108500214.852804heme exporter protein CcmB
D364_RS108550184.401873cytochrome c biogenesis heme-transporting ATPase
D364_RS108600173.782861cytochrome c
D364_RS10865-2182.988192GMC family oxidoreductase
D364_RS108700151.406230gluconate 2-dehydrogenase subunit 3 family
D364_RS10875-2142.409260VOC family protein
D364_RS10880-2162.846125ABC transporter ATP-binding protein
D364_RS10885-1172.948409iron ABC transporter permease
D364_RS10890-1163.339218ABC transporter substrate-binding protein
D364_RS10895-1153.819995malate/lactate/ureidoglycolate dehydrogenase
D364_RS109050164.985458formate dehydrogenase subunit alpha
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS10510DHBDHDRGNASE952e-25 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 94.7 bits (235), Expect = 2e-25
Identities = 57/180 (31%), Positives = 80/180 (44%), Gaps = 10/180 (5%)

Query: 2 QTIMITGCSSGFGLETARYFLEQGWKVIATMRAPQEGVLPASDRLRLVR------LDVTS 55
+ ITG + G G AR QG + A P++ S R DV
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 56 AQSIAEAI----AEVGEIDVLVNNAGVGMLNALEGAPREAIANLFATNTLGTIAMTQAVI 111
+ +I E E+G ID+LVN AGV + E F+ N+ G +++V
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 112 PRFRARRSGTIVNITSAVTLQPMPLLAVYTASKAAVNAFTESLALELRAFNIRVGLILPG 171
RRSG+IV + S P +A Y +SKAA FT+ L LEL +NIR ++ PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS10570ARGREPRESSOR300.004 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 30.2 bits (68), Expect = 0.004
Identities = 14/44 (31%), Positives = 21/44 (47%), Gaps = 5/44 (11%)

Query: 10 QRQALICQILQENGRVVCAELAARLQ-----VSEHTIRRDLHEL 48
QR I +I+ N EL L+ V++ T+ RD+ EL
Sbjct: 5 QRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKEL 48


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS10610cloacin310.002 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 30.8 bits (69), Expect = 0.002
Identities = 15/44 (34%), Positives = 20/44 (45%)

Query: 23 PAYANPGNGNGNGGGNHGNSGNHGNSGNHGNNGNSGDHGNKGQN 66
+++ N G G G+ + G GN G NGNSG G N
Sbjct: 37 SGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80



Score = 29.7 bits (66), Expect = 0.006
Identities = 13/39 (33%), Positives = 16/39 (41%)

Query: 27 NPGNGNGNGGGNHGNSGNHGNSGNHGNNGNSGDHGNKGQ 65
NP G G + G HGN G +GN+G G
Sbjct: 44 NPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLS 82



Score = 29.7 bits (66), Expect = 0.007
Identities = 11/32 (34%), Positives = 16/32 (50%)

Query: 26 ANPGNGNGNGGGNHGNSGNHGNSGNHGNNGNS 57
+ G G+G GN G +GN G G N ++
Sbjct: 52 SGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS10625TCRTETB1081e-27 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 108 bits (272), Expect = 1e-27
Identities = 81/384 (21%), Positives = 156/384 (40%), Gaps = 14/384 (3%)

Query: 39 PAIQQSLGGSPAALSWLTNGFMLTFGSFLLAAGVTADAIDRKRIFIAGAALFCLSSLLFC 98
P I PA+ +W+ FMLTF G +D + KR+ + G + C S++
Sbjct: 38 PDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGF 97

Query: 99 LTHNLFLSGVL-RALQGLAAAMILASGSAALAQLYDGAQRTRAFSILGTVFGVGLAFGPL 157
+ H+ F ++ R +QG AA A +A+ R +AF ++G++ +G GP
Sbjct: 98 VGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPA 157

Query: 158 LIGFMTDAVGWRGVYALFALLSAIVLLIGLAYLPAAEKSEPRTPDNLGLTLFTLALMLFT 217
+ G + + W Y L + I+ + L L E D G+ L ++ ++ F
Sbjct: 158 IGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFM 215

Query: 218 ASLMVIPARGFLSLTTLALLIASGGLFVAFVVRCRRVNNPVLELSLLRHPRFVGVLLLPV 277
+ FL ++ L+ LI FV R+V +P ++ L ++ F+ +L
Sbjct: 216 LF-TTSYSISFLIVSVLSFLI--------FVKHIRKVTDPFVDPGLGKNIPFMIGVLCGG 266

Query: 278 ATCCCYVVLLIIVPLHFMGGEGMSESQ-SALYLMALTTPMLVFPSVAALLTRWFSPGQVS 336
+ +VP +S ++ ++ + T +++F + +L P V
Sbjct: 267 IIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVL 326

Query: 337 TAGLMMASVGLLLLGDAFHSNHLPQLVLALILCGAGAALPWGLMDGLAISAVPVAKAGMA 396
G+ SV L + + ++ G + ++ + S++ +AG
Sbjct: 327 NIGVTFLSVSFLTASFLLETTSWFM-TIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAG 385

Query: 397 AGLFNTVRVAGEGIALAVVSAVLT 420
L N EG +A+V +L+
Sbjct: 386 MSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS10650FERRIBNDNGPP320.004 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 32.2 bits (73), Expect = 0.004
Identities = 17/72 (23%), Positives = 32/72 (44%), Gaps = 9/72 (12%)

Query: 188 LVSRYHDPRPESLRRVVMAPTTVLHSAPGAQ-LREMAKLARQLGIRL------HSHLSET 240
+ S + P PE L R+ AP + + G Q L K ++ L +HL++
Sbjct: 101 VWSAGYGPSPEMLARI--APGRGFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQY 158

Query: 241 VDYLDAARQKFA 252
D++ + + +F
Sbjct: 159 EDFIRSMKPRFV 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS10705BINARYTOXINA310.010 Clostridial binary toxin A signature.
		>BINARYTOXINA#Clostridial binary toxin A signature.

Length = 454

Score = 31.2 bits (70), Expect = 0.010
Identities = 24/95 (25%), Positives = 39/95 (41%), Gaps = 8/95 (8%)

Query: 144 STYRLASLGGLYGGGFGGIGS--INYGPLAAPGNVLSVKVMTVEPAPRVLTVPAPEALLL 201
+ LA + GG+ I + I+ GPL P L KV +E A ++ P P L++
Sbjct: 277 TPNELADVNDYMRGGYTAINNYLISNGPLNNPNPELDSKVNNIENALKL--TPIPSNLIV 334

Query: 202 HHAYGTNGIILEVELALAPAHQWIERLDVFDDFAD 236
+ G E L L +++ D F +
Sbjct: 335 YRRSGP----QEFGLTLTSPEYDFNKIENIDAFKE 365


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS10750ABC2TRNSPORT300.027 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 29.5 bits (66), Expect = 0.027
Identities = 32/129 (24%), Positives = 47/129 (36%), Gaps = 15/129 (11%)

Query: 6 ALAALALLMLAAYRGY----SVILFAPIAALGAVLLTDPGAVGPA----------FTGLF 51
ALA + ++AA GY S++ P+ AL + G V A + L
Sbjct: 126 ALAGAGIGVVAAALGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLV 185

Query: 52 MEKMVGFVKLYFPVFLLGAVFGKLIELSGFSRSIVAAAIRILGRRHAIPVIVLVCALLTY 111
+ ++ FPV L VF S SI +LG + V V AL Y
Sbjct: 186 ITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHP-VVDVCQHVGALCIY 244

Query: 112 GGVSLFVVA 120
+ F+
Sbjct: 245 IVIPFFLST 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS10755DHBDHDRGNASE1154e-33 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 115 bits (288), Expect = 4e-33
Identities = 66/255 (25%), Positives = 105/255 (41%), Gaps = 9/255 (3%)

Query: 3 LHGKTALVTGSTSGIGLGIAKVLAQAGAQLVLNGFGDSSHARAE--VAALGKIPGYHDAD 60
+ GK A +TG+ GIG +A+ LA GA + + + + A + AD
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 61 LRDVGQIEAMMRYAESTFGGVDIVINNAGIQHVAPVEQFPVDKWNDILAINLSSVFHTTR 120
+RD I+ + E G +DI++N AG+ + ++W ++N + VF+ +R
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 121 LALPGMRQRNWGRIINIASVHGLVASKEKSAYVAAKHAVVGLTKTVALETARSGITCNAI 180
M R G I+ + S V +AY ++K A V TK + LE A I CN +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 181 CPGWVLTPLVQQQIDKRIAEGVDPEQASAQLLAEKQ---PSGEFVTPQQLGEMALFLCSD 237
PG T + EQ L + P + P + + LFL S
Sbjct: 186 SPGSTETDMQWSLWADENGA----EQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241

Query: 238 AAAQVRGAAWNMDGG 252
A + +DGG
Sbjct: 242 QAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS10775DHBDHDRGNASE1241e-36 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 124 bits (312), Expect = 1e-36
Identities = 74/254 (29%), Positives = 114/254 (44%), Gaps = 10/254 (3%)

Query: 3 KVALVTGAGQGIGKAIALRLVKDGFAVAIADYNDATAKAVASEINQAGGRAMAVKVDVSD 62
K+A +TGA QGIG+A+A L G +A DYN + V S + A A DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 63 RDQVFAAVEQARKTLGGFDVIVNNAGVAPSTPIESITPEIVDKVYNINVKGVIWGIQAAV 122
+ + + +G D++VN AGV I S++ E + +++N GV ++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 123 EAFKKEGHGGKIINACSQAGHVGNPELAVYSSSKFAVRGLTQTAARDLAPLGITVNGYCP 182
+ G I+ S V +A Y+SSK A T+ +LA I N P
Sbjct: 129 KYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 183 GIVKTPM----WAEIDRQVSEAAGKPLGYGTAEFAKRITLGRLSEPEDVAACVSYLASPD 238
G +T M WA+ + G F I L +L++P D+A V +L S
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGS-----LETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 239 SDYMTGQSLLIDGG 252
+ ++T +L +DGG
Sbjct: 243 AGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS10785ACRIFLAVINRP10600.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1060 bits (2744), Expect = 0.0
Identities = 502/1031 (48%), Positives = 690/1031 (66%), Gaps = 7/1031 (0%)

Query: 1 MPHFFIERPIFAWVIALFIVLTGLLSIPRLPVAQYPEVAPPGIIISVSYPGASPEVMNTS 60
M +FFI RPIFAWV+A+ +++ G L+I +LPVAQYP +APP + +S +YPGA + + +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VVSLIEREISSVDNLLYFESSSDTTGMASITVTFKPGTDIKLAQMDLQNQIKIVESRLPQ 120
V +IE+ ++ +DNL+Y S+SD+ G +IT+TF+ GTD +AQ+ +QN++++ LPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 SVRQNGINVEAANSGFLMMVGLKSPSGAYQEADLSDYFARNVTDELRRVPGVGKVQLFGG 180
V+Q GI+VE ++S +LM+ G S + + D+SDY A NV D L R+ GVG VQLFG
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 EKALRIWLDPMKLHSYGLSVTDVLSAISQQNVIVSPGRTGDEPATSSQEVTYPITVKGQL 240
+ A+RIWLD L+ Y L+ DV++ + QN ++ G+ G PA Q++ I + +
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 SSVEEFRNITIKSQVSAARVTLADVARVESGLQSYAFGIRENGVPATAAAIQLSPGANAI 300
+ EEF +T++ + V L DVARVE G ++Y R NG PA I+L+ GANA+
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 STASGIRARLTELSGVLPEGMTFTVPFDTAPFVKLSILKVVETFVEAMVLVFFVMLLFLH 360
TA I+A+L EL P+GM P+DT PFV+LSI +VV+T EA++LVF VM LFL
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 KIRCTLIPAIVAPVALLGTFTVMLLSGYSINILTMFGMILAIGIIVDDAIVVVENVERLM 420
+R TLIP I PV LLGTF ++ GYSIN LTMFGM+LAIG++VDDAIVVVENVER+M
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 EDKKMSPQDATREAMREITPAIIGITLVLTAVFIPMAFASGSVGIIYRQFSISMAISILL 480
+ K+ P++AT ++M +I A++GI +VL+AVFIPMAF GS G IYRQFSI++ ++ L
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SAFLALTLTPALCATLLKP-HGIHQGKSSVFSAWFNAHFHRLTSFYATGLGFVLKRTGRM 539
S +AL LTPALCATLLKP H F WFN F + Y +G +L TGR
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 540 MMIYAALCLALFAGLSTLPSSFLPDEDQGYFMSSIQLPSDATMQRTLKVVDTFEEEI--A 597
++IYA + + LPSSFLP+EDQG F++ IQLP+ AT +RT KV+D +
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 598 HRQAVESNIMILGFGFSGSGQNSAMAFTTLKDWRQRKGT--TAQEEADHIRSQMANVPDA 655
+ VES + GF FSG QN+ MAF +LK W +R G +A+ + ++ + D
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 656 VTMSLLPPAISDMGTSSGFTYYLQDRGGKGYQALKKAADELIVQANHNP-HLADVYIDGL 714
+ PAI ++GT++GF + L D+ G G+ AL +A ++L+ A +P L V +GL
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 715 GEGTSLSLHVDREKAEAMGVSFDEINQTISVAAGSNYVNDYTNNGRVQQVIVQADAPYRM 774
+ L VD+EKA+A+GVS +INQTIS A G YVND+ + GRV+++ VQADA +RM
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 775 QPEQLLALSVKNRLGQMLPLSTFVTLSWNVAPQQLIRYQGYPAIRITGSSAQGKSSGTAM 834
PE + L V++ G+M+P S F T W +L RY G P++ I G +A G SSG AM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 835 AAMDNLAKHLPPGFAGEWAGSSLQEKESASQLPGLIVLSVLVVFMVLAALYESWSIPFAV 894
A M+NLA LP G +W G S QE+ S +Q P L+ +S +VVF+ LAALYESWSIP +V
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 895 MLVVPLGLLGAVLAVSVTNMTNDVFFKVGLITLIGLSAKNAILIIEFARQLM-KEGKSLI 953
MLVVPLG++G +LA ++ N NDV+F VGL+T IGLSAKNAILI+EFA+ LM KEGK ++
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 954 DATLTAAKLRLRPILMTSLAFTLGVVPLMLASGASDSTQHAIGTGVFGGMISGTLLAIFF 1013
+ATL A ++RLRPILMTSLAF LGV+PL +++GA Q+A+G GV GGM+S TLLAIFF
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1014 VPVFFVTITRF 1024
VPVFFV I R
Sbjct: 1021 VPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS10800HTHFIS787e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 77.9 bits (192), Expect = 7e-19
Identities = 29/118 (24%), Positives = 59/118 (50%), Gaps = 1/118 (0%)

Query: 7 IIVAEDDDDIAAILTGYLRKAGMKTLRAEDGEQAINLTRLNKPDLLLLDIHLPVYDGWNV 66
I+VA+DD I +L L +AG + DL++ D+ +P + +++
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 67 LTTLRKE-TNVPVIMVTALDQDVDKLMGLRLGADDYVIKPFNPSEVIARVEAVLRRTR 123
L ++K ++PV++++A + + + GA DY+ KPF+ +E+I + L +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS10885PF05272290.043 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.043
Identities = 12/34 (35%), Positives = 17/34 (50%)

Query: 31 VVSLLGPSGSGKTTLLRAVAGLEKPTSGRIAIGN 64
V L G G GK+TL+ + GL+ + IG
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGT 631


39D364_RS10950D364_RS11130Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS10950-2195.229253basic amino acid ABC transporter
D364_RS10955-2175.310231MurR/RpiR family transcriptional regulator
D364_RS109650174.583084ABC transporter substrate-binding protein
D364_RS109700174.862368iron ABC transporter permease
D364_RS109751175.374496iron ABC transporter permease
D364_RS109800164.507036ABC transporter ATP-binding protein
D364_RS10985-1163.545341TonB-dependent siderophore receptor
D364_RS10990-1174.277301ABC transporter ATP-binding protein
D364_RS109951186.169960ABC transporter permease subunit
D364_RS110001176.268132ABC transporter substrate-binding protein
D364_RS110050165.741288ABC transporter substrate-binding protein
D364_RS11010-1176.927058acyl-CoA/acyl-ACP dehydrogenase
D364_RS110151186.940477LLM class flavin-dependent oxidoreductase
D364_RS110202177.325061LysR family transcriptional regulator
D364_RS110254187.036548cysteine dioxygenase
D364_RS110302176.616204rhodanese homology domain-containing protein
D364_RS110352186.974836polyphenol oxidase family protein
D364_RS110402195.6944373-(3-hydroxy-phenyl)propionate transporter MhpT
D364_RS110451196.0008414-hydroxy-2-oxovalerate aldolase
D364_RS110500195.003683acetaldehyde dehydrogenase (acetylating)
D364_RS11055-1185.0399252-keto-4-pentenoate hydratase
D364_RS11065-1185.208016alpha/beta fold hydrolase
D364_RS11070-1185.9917973-carboxyethylcatechol 2,3-dioxygenase
D364_RS110750156.078955bifunctional
D364_RS110802176.430699DNA-binding transcriptional regulator
D364_RS110853196.102826ABC transporter substrate-binding protein
D364_RS110901196.331435iron ABC transporter permease
D364_RS110950195.166535ABC transporter ATP-binding protein
D364_RS11100-1194.357011substrate-binding domain-containing protein
D364_RS11105-2192.895548shikimate dehydrogenase
D364_RS11110-1181.809105MFS transporter
D364_RS11115014-0.221791sugar phosphate isomerase/epimerase and
D364_RS11120228-4.358322TetR family transcriptional regulator
D364_RS11125223-3.069317fumarate hydratase FumD
D364_RS11130121-3.031142hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS10970FERRIBNDNGPP392e-05 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 38.8 bits (90), Expect = 2e-05
Identities = 17/82 (20%), Positives = 33/82 (40%), Gaps = 5/82 (6%)

Query: 107 DIDLEAVAAARPDLIITEPSRHVSVEQLEKIAPTVSIDHLQGSAP-----EIYRKLAQLT 161
+ +LE + +P ++ S E L +IAP + G P + ++A L
Sbjct: 86 EPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLL 145

Query: 162 GTQPRLAILERRYQEQIKQLKA 183
Q +Y++ I+ +K
Sbjct: 146 NLQSAAETHLAQYEDFIRSMKP 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS10995PF05272290.022 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.022
Identities = 13/32 (40%), Positives = 18/32 (56%), Gaps = 4/32 (12%)

Query: 38 LRPG---ESVALL-GPSGCGKSTLLRLLAGLE 65
+ PG + +L G G GKSTL+ L GL+
Sbjct: 589 MEPGCKFDYSVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11045TCRTETB508e-09 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 50.3 bits (120), Expect = 8e-09
Identities = 79/400 (19%), Positives = 150/400 (37%), Gaps = 52/400 (13%)

Query: 14 VTIGLCFMVALMEGLDLQAAGIAAVGMAQAFALDKMQMGWIFSAGILGLLPGALVGGMLA 73
+ I LC + L+ ++ +A F W+ +A +L G V G L+
Sbjct: 15 ILIWLCILS-FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73

Query: 74 DRHGRKRILLGSVLLFGLFSLATALAWS-FPTLLLARLLTGVGLGAALPNLIA-LTSEAA 131
D+ G KR+LL +++ S+ + S F L++AR + G G AA P L+ + +
Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG-AAAFPALVMVVVARYI 132

Query: 132 GSRFRGRAVSLMYCGVPIGAALAAALGFSGLAAAWQTIFWIGGVVPLLLIPLLMRWLPES 191
RG+A L+ V +G + A+G + + ++ ++ +P LM+ L +
Sbjct: 133 PKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKE 192

Query: 192 QAFQRA---------EASVPLRTLFAPGQAAATLLLWLGYFFTLLVVYMLINWLPMLLVG 242
+ + LF + + L++ + F + V ++ P + G
Sbjct: 193 VRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFL-IFVKHIRKVTDPFVDPG 251

Query: 243 QGFRASQAAGVMFSLQI-GAACGTLLLGALMDK--------------LTPLRMSLLIYS- 286
G GV+ I G G + + M K + P MS++I+
Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311

Query: 287 --GILAS------LLALGSASSLTGMLLAGFV----------AGLFATGGQSVLYALAPL 328
GIL +L +G L A F+ +F GG S +
Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVIST 371

Query: 329 FYPAAIRATGVGTAVA----VGRLGAMSGPLLAGKMLALG 364
++++ G ++ L +G + G +L++
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIP 411


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11060PF06438290.014 Heme acquisition protein HasAp
		>PF06438#Heme acquisition protein HasAp

Length = 205

Score = 29.1 bits (65), Expect = 0.014
Identities = 25/121 (20%), Positives = 40/121 (33%), Gaps = 17/121 (14%)

Query: 139 VGSRIRDWSIGFVD-------TVADNASCGLYVIGGPAQRPAGLDLKQCAMHMTRNQE-L 190
V + DWS F D V + + G GP D Q A+ T +
Sbjct: 16 VADYLADWSAYFGDVNHRPGQVVDGSNTGGFN--PGP------FDGSQYALKSTASDAAF 67

Query: 191 VSSGRGSECLGHPLNAAVWLARKLASLGEPLRAGDIVLTGALG-PMVTINEGDSFAAHIE 249
++ G L + +W +LG+ L G AL V+ + + +
Sbjct: 68 IAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLGLDSPIAQ 127

Query: 250 G 250
G
Sbjct: 128 G 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11110TCRTETA486e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 47.5 bits (113), Expect = 6e-08
Identities = 69/370 (18%), Positives = 124/370 (33%), Gaps = 43/370 (11%)

Query: 64 GILFSAFAWTYALAQIPGGLFLDRFGNKVTYFLSLTLWSLFTLFHGMAVGLKTLLLCRFG 123
GIL + +A G DRFG + +SL ++ A L L + R
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105

Query: 124 LGISEAPCFPVNSRVVSAWFPQQERAKA----TAVYTVGEYLGLACFAPLLFWIMDGFGW 179
GI+ A V ++ ERA+ +A + G G P+L +M GF
Sbjct: 106 AGITGAT-GAVAGAYIADITDGDERARHFGFMSACFGFGMVAG-----PVLGGLMGGFSP 159

Query: 180 RVLFVSVGAVGILFALVWWRCYREPHEDPRLSQQEREHIENGGGLSAPTDQQVAFSWPLV 239
F + A+ L L E H+ R + + +F W
Sbjct: 160 HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREA-----------LNPLASFRWARG 208

Query: 240 RQLLSKRQIIGASIGQFAGNTVLVFFLTWFPTWLATERHMPWLKVGFFSILPFVAAAGGV 299
+++ + + ++ + E W + + AA G+
Sbjct: 209 MTVVAALMAVFFIMQLVGQVPAALWVIF-------GEDRFHWDA----TTIGISLAAFGI 257

Query: 300 M---FGGWLSDKLLKATGSANLGRKLPIVAGLL--MASCIITANWLESDLAVILVMSFAF 354
+ ++ + LG + ++ G++ I+ A +A +++ A
Sbjct: 258 LHSLAQAMITGPVAA-----RLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLAS 312

Query: 355 FGQGMVGLGWTLISDIAPKGLGGLTGGLFNFCANLAGILTPLVIGFIVAGFGNFFYALIY 414
G GM L ++S + G G +L I+ PL+ I A + +
Sbjct: 313 GGIGMPALQ-AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAW 371

Query: 415 IGGAALLGVV 424
I GAAL +
Sbjct: 372 IAGAALYLLC 381


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11120HTHTETR662e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 65.8 bits (160), Expect = 2e-15
Identities = 29/162 (17%), Positives = 61/162 (37%), Gaps = 4/162 (2%)

Query: 6 HDEAQSLKARIFSAAIAVFAEHGLSGARMEQIATEAQTTKRMVVYYFKSKEQLYQEVLQH 65
EAQ + I A+ +F++ G+S + +IA A T+ + ++FK K L+ E+ +
Sbjct: 6 KQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL 65

Query: 66 VYARIRETEQQLGLENVPPVEALVR---LVRWSVRYHATHADYMRVICMENMQR-GKWLK 121
+ I E E + + +++R + + I + G+
Sbjct: 66 SESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAV 125

Query: 122 SSGELKPLNRTALSILEDILLRGQQQGVFQAGLDARDVHRLI 163
+ L + +E L + + A L R ++
Sbjct: 126 VQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM 167


40D364_RS11180D364_RS11220Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS111801174.515787Fe-S cluster assembly scaffold SufA
D364_RS111852174.520656hypothetical protein
D364_RS274651155.027846FMN-dependent L-lactate dehydrogenase LldD
D364_RS111950165.335771efflux RND transporter permease subunit
D364_RS112000154.555362efflux RND transporter periplasmic adaptor
D364_RS112050155.096837TetR/AcrR family transcriptional regulator
D364_RS112100143.274113NAD(P)H-binding protein
D364_RS112150144.118795LysR family transcriptional regulator
D364_RS112200143.028651ABC transporter substrate-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11200ACRIFLAVINRP440e-140 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 440 bits (1132), Expect = e-140
Identities = 231/1055 (21%), Positives = 423/1055 (40%), Gaps = 71/1055 (6%)

Query: 8 LSALAVRERSVTLFLIILISVAGLVAFFGLGRAEDPPFTVKQMTVITVWPGATAQEMQDQ 67
++ +R L I++ +AG +A L A+ P ++V +PGA AQ +QD
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 68 VAEPLEKRLQELKWYDRTETYT-RPGMALITLSLQDQTPP----SEVPEQFYQARKKLGD 122
V + +E+ + + + + G ITL+ Q T P +V + A L
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLL-- 118

Query: 123 EAKNLPAGVSGPMMNDEFADVTFALFAL--KARGEQPRQLVRD--AEALRQQLLHVSGVK 178
P V ++ E + ++ + A + + D A ++ L ++GV
Sbjct: 119 -----PQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVG 173

Query: 179 KVNILGEQ-AERIYLSFSHDRLATLGLSPEAIFAALNSQNVLTAAGAI---ETRGGQIF- 233
V + G Q A RI+L D L L+P + L QN AAG + GQ
Sbjct: 174 DVQLFGAQYAMRIWLDA--DLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLN 231

Query: 234 --IRLDGAFDRLQQIRDTPIIAG--GRTLKLADVATVERGYEDPATFLIRHQGEPALLLG 289
I F ++ + G ++L DVA VE G E+ R G+PA LG
Sbjct: 232 ASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIA-RINGKPAAGLG 290

Query: 290 VVMREGWNGLALGKALDAETASINQSLPLGMSLTKVTDQSVNISAAVDEFMIKFFVA-LL 348
+ + G N L KA+ A+ A + P GM + D + + ++ E + F A +L
Sbjct: 291 IKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIML 350

Query: 349 VVMTVCFVSMGWRVGVVVAAAVPLTLAVVFVVMEATGKNFDRITLGSLILALGLLVDDAI 408
V + + R ++ AVP+ L F ++ A G + + +T+ ++LA+GLLVDDAI
Sbjct: 351 VFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAI 410

Query: 409 IAIEMMV-VKMEEGYDRLKASAYAWSHTAAPMLAGTLVTAVGFMPNGFAQSTAGEYASNV 467
+ +E + V ME+ +A+ + S ++ +V + F+P F + G
Sbjct: 411 VVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQF 470

Query: 468 FWIVGIALIASWIVAVIFTPWLGVHLLPDRKPAAAGHAALYDT----------PRYQRFR 517
+ A+ S +VA+I TP L LL KP +A H +
Sbjct: 471 SITIVSAMALSVLVALILTPALCATLL---KPVSAEHHENKGGFFGWFNTTFDHSVNHYT 527

Query: 518 RLLTRVIAHKWRVAAGVVALLIVAILGMSVVKKQFFPTSDRPEVLVEVQLPYGSSISQTS 577
+ +++ R ++ ++ + F P D+ L +QLP G++ +T
Sbjct: 528 NSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQ 587

Query: 578 AAAAKIEHWLQRQPEAKIVTSYIGQGAPRFYLAMAPELPDP--SFAKLVVLTDGQGARE- 634
++ + + +A + + + G + + + + +F L + G
Sbjct: 588 KVLDQVTDYYLKNEKANVESVFTVNG-----FSFSGQAQNAGMAFVSLKPWEERNGDENS 642

Query: 635 --ALKRRLREAV-----VNGLAPEARVRVTQLVFGPYSPYPVAWRVMGPDPHALLDIAER 687
A+ R + + + V + + +G D AL +
Sbjct: 643 AEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHD--ALTQARNQ 700

Query: 688 VKSVLQASPL-MRTVNTDWGSRVPVMHFSLNQDRLQASGLSSQSVAQQLQFLLSGIPITT 746
+ + P + +V + ++Q++ QA G+S + Q + L G +
Sbjct: 701 LLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVND 760

Query: 747 VREDIRAVQVIGRAAGDIRLDPAKIADFTLVGSGGQRVPLSQIGDVSIRMEDPLLRRRDR 806
+ R ++ +A R+ P + + + G+ VP S P L R +
Sbjct: 761 FIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNG 820

Query: 807 TPTITVRGDVAENLQPPDVSIALMKPLQPIIDSLPPGYRIETAGSIEESGKATRAMVPLF 866
P++ ++G+ A P S M ++ + LP G + G + + L
Sbjct: 821 LPSMEIQGEAA----PGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALV 876

Query: 867 PIMIALTLLIIILQVRSLSAMVMVFLTAPLGLIGVVPTLLLFNQPFGINALVGLIALSGI 926
I + L + S S V V L PLG++GV+ LFNQ + +VGL+ G+
Sbjct: 877 AISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGL 936

Query: 927 LMRNTLILIGQIHHNQQA-GLDPFHAVVEATVQRARPVLLTALAAILAFIPLTHSVFWGT 985
+N ++++ + G A + A R RP+L+T+LA IL +PL S G+
Sbjct: 937 SAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGS 996

Query: 986 -----LAYTLIGGTLGGTIMTLIFLPAMYAIWFRI 1015
+ ++GG + T++ + F+P + + R
Sbjct: 997 GAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031



Score = 76.8 bits (189), Expect = 3e-16
Identities = 57/323 (17%), Positives = 122/323 (37%), Gaps = 20/323 (6%)

Query: 712 MHFSLNQDRLQASGLSSQSVAQQLQF----LLSGIPITTVREDIRAVQVIGRAAGDIRLD 767
M L+ D L L+ V QL+ + +G T + + A + +
Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK-N 242

Query: 768 PAKIADFTLVGSG-GQRVPLSQIGDVSIRMED-PLLRRRDRTPTITVRGDVAENLQPPDV 825
P + TL + G V L + V + E+ ++ R + P + +A D
Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302

Query: 826 SIALMKPLQPIIDSLPPGYRIE----TAGSIEESGKATRAMVPLFPIMIALTLLIIILQV 881
+ A+ L + P G ++ T ++ S +V I L L++ L +
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLS---IHEVVKTLFEAIMLVFLVMYLFL 359

Query: 882 RSLSAMVMVFLTAPLGLIGVVPTLLLFNQPFGINALVGLIALSGILMRNTLILIGQIH-H 940
+++ A ++ + P+ L+G L F + G++ G+L+ + ++++ +
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 941 NQQAGLDPFHAVVEATVQRARPVLLTALAAILAFIPL-----THSVFWGTLAYTLIGGTL 995
+ L P A ++ Q ++ A+ FIP+ + + + T++
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 996 GGTIMTLIFLPAMYAIWFRIRPE 1018
++ LI PA+ A +
Sbjct: 480 LSVLVALILTPALCATLLKPVSA 502


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11205RTXTOXIND423e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.7 bits (98), Expect = 3e-06
Identities = 18/92 (19%), Positives = 35/92 (38%), Gaps = 9/92 (9%)

Query: 70 GKVLERRVETGQSVKRGQLLLRLDPADLALQAQSQQRAVDAARVRAKKAANDLARYRGLV 129
V E V+ G+SV++G +LL+L Q ++ AR L + R +
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQAR---------LEQTRYQI 155

Query: 130 ASGAISAAEFDQINAAAEAARADLRAAQAQAN 161
S +I + ++ E ++ +
Sbjct: 156 LSRSIELNKLPELKLPDEPYFQNVSEEEVLRL 187



Score = 31.3 bits (71), Expect = 0.005
Identities = 18/84 (21%), Positives = 32/84 (38%), Gaps = 4/84 (4%)

Query: 178 GVVVETLAEPGQVVSAGQVVIRLARAGQREARVQLPETLRPAVGSEALATRYGSESQPV- 236
+V E + + G+ V G V+++L G ++ +L A TRY S+ +
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQA---RLEQTRYQILSRSIE 161

Query: 237 TATLRLLSDAADATTRTFEARYVL 260
L L + + VL
Sbjct: 162 LNKLPELKLPDEPYFQNVSEEEVL 185



Score = 28.6 bits (64), Expect = 0.044
Identities = 12/128 (9%), Positives = 37/128 (28%), Gaps = 15/128 (11%)

Query: 103 SQQRAVDAARVRAKKAANDLARYRGLVAS--GAISAAEFDQINAAAEA----------AR 150
++ ++ + A N+L Y+ + I +A+ +
Sbjct: 250 AKHAVLEQENKYVE-AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTT 308

Query: 151 ADLRAAQAQANVAQNATGYAGLLADADGVVVE-TLAEPGQVVSAGQVVIRLARAGQR-EA 208
++ + + + + A V + + G VV+ + ++ + E
Sbjct: 309 DNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEV 368

Query: 209 RVQLPETL 216
+
Sbjct: 369 TALVQNKD 376


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11210HTHTETR593e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 58.9 bits (142), Expect = 3e-13
Identities = 31/183 (16%), Positives = 52/183 (28%), Gaps = 10/183 (5%)

Query: 4 FSRYGYEKTTVTDLAKAIGFSKAYIYKFFDSKQAIGEAICASRLEKIMVAVSEAIADAPS 63
FS+ G T++ ++AKA G ++ IY F K + I I E A P
Sbjct: 24 FSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPG 83

Query: 64 ASEK-----LRRLFR-ALTEAGSELFFE--DRKLYDIAAVAARDKWPSTEQYAGHLQQLI 115
L + +TE L E K + +A + I
Sbjct: 84 DPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA--QRNLCLESYDRI 141

Query: 116 GQILVEGRQAGEFERKTPLDEATLAVYMVMCPFINPVQLQYNLDTAPTAAVLLASLILRS 175
Q L +A A + + + + A +++L
Sbjct: 142 EQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILLEM 201

Query: 176 LSP 178

Sbjct: 202 YLL 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11215NUCEPIMERASE376e-05 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 37.1 bits (86), Expect = 6e-05
Identities = 37/167 (22%), Positives = 61/167 (36%), Gaps = 28/167 (16%)

Query: 6 KVLILGASGGIGGEVARRLVADNWQVRA-----------LKRGAQIRDPEDGIQWIAGDA 54
K L+ GA+G IG V++RL+ QV LK+ + G Q+ D
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 55 LDGGQVAA--AAAGCDVIVH-----AV-----NPPGYRHWRQQVLPMLRNTLQAAERQR- 101
D + A+ + + AV NP Y + L N L+ +
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYAD--SNLTGFL-NILEGCRHNKI 118

Query: 102 ALVVLPGTVYNYGPDA-FPLIAEEAAQQPVTRKGAIRVAMELTLKDY 147
++ + YG + P +++ PV+ A + A EL Y
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTY 165


41D364_RS11355D364_RS11525Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS11355322-1.860159RidA family protein
D364_RS11360425-1.980689integration host factor subunit alpha
D364_RS11365325-2.603270phenylalanine--tRNA ligase subunit beta
D364_RS11370331-6.399721phenylalanine--tRNA ligase subunit alpha
D364_RS27475231-7.636222pheST operon leader peptide PheM
D364_RS11375126-6.50246250S ribosomal protein L20
D364_RS11380019-4.11615650S ribosomal protein L35
D364_RS11385017-3.658944translation initiation factor IF-3
D364_RS11390-115-2.661895threonine--tRNA ligase
D364_RS11395017-0.179179hypothetical protein
D364_RS11400016-0.203090YdiY family protein
D364_RS11405018-0.1058296-phosphofructokinase II
D364_RS11410126-4.447283type V toxin-antitoxin system endoribonuclease
D364_RS11415127-4.709119fructosamine kinase family protein
D364_RS11420-119-3.744670TonB system transport protein TonB
D364_RS11425-116-4.833594YciI family protein
D364_RS27480016-4.948338hypothetical protein
D364_RS11435012-4.015536YceI family protein
D364_RS27005010-1.829991YciY family protein
D364_RS11445011-1.485440cardiolipin synthase
D364_RS11450013-1.971902HI1450 family dsDNA-mimic protein
D364_RS11455-111-2.791227ion transporter
D364_RS11460-111-2.700435murein tripeptide/oligopeptide ABC transporter
D364_RS11465-112-3.547319ABC transporter ATP-binding protein
D364_RS11470122-4.126388oligopeptide ABC transporter permease OppC
D364_RS11475122-4.868919oligopeptide ABC transporter permease OppB
D364_RS11480125-5.280030oligopeptide ABC transporter substrate-binding
D364_RS11490130-5.601761YchE family NAAT transporter
D364_RS11495232-6.039800hypothetical protein
D364_RS11500128-5.694699bifunctional acetaldehyde-CoA/alcohol
D364_RS11505-132-6.075825thymidine kinase
D364_RS27485032-6.079577hypothetical protein
D364_RS11515-129-5.813687DNA-binding transcriptional regulator H-NS
D364_RS11520-225-4.742308UTP--glucose-1-phosphate uridylyltransferase
D364_RS11525-122-3.223790two-component system response regulator RssB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11360DNABINDINGHU1195e-39 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 119 bits (300), Expect = 5e-39
Identities = 34/89 (38%), Positives = 55/89 (61%)

Query: 4 TKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFGNFDLRDKNQRPGR 63
K ++ + + L+K+D+ V+ F + L GE+V+L GFGNF++R++ R GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 64 NPKTGEDIPITARRVVTFRPGQKLKSRVE 92
NP+TGE+I I A +V F+ G+ LK V+
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11420TONBPROTEIN2291e-77 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 229 bits (585), Expect = 1e-77
Identities = 169/241 (70%), Positives = 192/241 (79%), Gaps = 7/241 (2%)

Query: 18 MTLDLPRRFPWPTLLSVAIHGAVVAGLLYTSVHQVIEQPSPTQPIEITMVAPADLEPPPA 77
MTLDLPRRFPWPTLLSV IHGAVVAGLLYTSVHQVIE P+P QPI +TMV PADLEPP A
Sbjct: 1 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQA 60

Query: 78 AQPVVEPVVVPEPEPEPEVVPEPPKEAPVVIHKPEPKPKPKPKPKPKPEKKVEQPKREVK 137
QP EPVV PEPEPEP P APVVI KP+PKPKPKPKP K + EQPKR+VK
Sbjct: 61 VQPPPEPVVEPEPEPEPIPEPPKE--APVVIEKPKPKPKPKPKPVKKVQ---EQPKRDVK 115

Query: 138 PAAEPRPASPFENNNTAPARTAPSTSTAAAKPTVTAPSGPRAISRVQPSYPARAQALRIE 197
P E RPASPFEN A T+ + + A +KP + SGPRA+SR QP YPARAQALRIE
Sbjct: 116 P-VESRPASPFENTAPA-RLTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIE 173

Query: 198 GTVRVKFDVSPDGRIDNLQILSAQPANMFEREVKSAMRRWRYEQGRPGTGVTMTIKFRLN 257
G V+VKFDV+PDGR+DN+QILSA+PANMFEREVK+AMRRWRYE G+PG+G+ + I F++N
Sbjct: 174 GQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKIN 233

Query: 258 G 258
G
Sbjct: 234 G 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11460HTHFIS310.008 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.0 bits (70), Expect = 0.008
Identities = 9/16 (56%), Positives = 11/16 (68%)

Query: 55 VVGESGCGKSTFARAI 70
+ GESG GK ARA+
Sbjct: 165 ITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11480FLGHOOKAP1300.040 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 29.5 bits (66), Expect = 0.040
Identities = 27/112 (24%), Positives = 44/112 (39%), Gaps = 8/112 (7%)

Query: 392 YNTSDLHKKLAIAAASLWRK------NLGIDVKLVNQEWKTFLDTRHQGTYDVARAGWCA 445
YN + ++ I A + G+ V V +E+ F+ + + +G A
Sbjct: 28 YNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGVQREYDAFITNQLRAA-QTQSSGLTA 86

Query: 446 DYNEPTSFLNTMLSDSSMNTAHYKSPAFDKIMAESVKASDEAQRTAAYAKAE 497
Y E S ++ MLS S+ + A F + A D A R A K+E
Sbjct: 87 RY-EQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVSNAEDPAARQALIGKSE 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11525HTHFIS875e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.8 bits (215), Expect = 5e-21
Identities = 37/152 (24%), Positives = 61/152 (40%), Gaps = 3/152 (1%)

Query: 10 ILIVEDEPVFRSLLHGWLTSLGATTFQAEDGKDALHKMTEVHPDLMICDISMPRMNGLEL 69
IL+ +D+ R++L+ L+ G + + DL++ D+ MP N +L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 70 VETLRNRGEQLPILMISATENMADIAKALRLGVQDVLLKPVKDFDRLRETVYACLYPAMF 129
+ ++ LP+L++SA KA G D L KP D L + L A
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF-DLTELIGIIGRAL--AEP 122

Query: 130 SSRVEEEERLFEDWDALVSNPIAASRLLQELQ 161
R + E +D LV A + + L
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLA 154


42D364_RS27490D364_RS11635Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS27490-2123.455167hypothetical protein
D364_RS11585-1134.776052NarK family nitrate/nitrite MFS transporter
D364_RS115900145.242307nitrate/nitrite two-component system sensor
D364_RS115950165.836702two-component system response regulator NarL
D364_RS116000176.330240YchO/YchP family invasin
D364_RS116051176.635784nitrate reductase
D364_RS116101165.788674nitrite reductase large subunit NirB
D364_RS116154174.644827ABC transporter ATP-binding protein
D364_RS116204174.873805nitrate ABC transporter permease
D364_RS116253174.000721ABC transporter substrate-binding protein
D364_RS116354151.917227nitrate regulatory protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11585TCRTETB310.015 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 30.6 bits (69), Expect = 0.015
Identities = 16/58 (27%), Positives = 28/58 (48%), Gaps = 1/58 (1%)

Query: 128 TPFSIFVIISLLCGFAGANF-ASSMANISFFFPKAKQGGALGVNGGLGNMGVSVMQLV 184
+ FS+ ++ + G A F A M ++ + PK +G A G+ G + MG V +
Sbjct: 101 SFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAI 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11590PF06580485e-08 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 48.3 bits (115), Expect = 5e-08
Identities = 28/116 (24%), Positives = 49/116 (42%), Gaps = 9/116 (7%)

Query: 476 FGFTVQLDYQLPPRFVPSHQAIHLLQIAREALSNALKHASAT-----EVTVTVSQRDNQV 530
F +Q + Q+ P + L+Q E N +KH A ++ + ++ + V
Sbjct: 236 FEDRLQFENQINPAIMDVQVPPMLVQTLVE---NGIKHGIAQLPQGGKILLKGTKDNGTV 292

Query: 531 RLVVADNGRGVPDHAERSNHYGLIIMRDRAQSLRG-DCQVRRRETGGTEVIVTFIP 585
L V + G + + S GL +R+R Q L G + Q++ E G + IP
Sbjct: 293 TLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11595HTHFIS748e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.5 bits (183), Expect = 8e-18
Identities = 35/118 (29%), Positives = 57/118 (48%), Gaps = 2/118 (1%)

Query: 6 RATILLIDDHPMLRTGVKQLISMAPDIQVIGEASNGAQGIELAESLDPDLILLDLNMPGM 65
ATIL+ DD +RT + Q +S A V SN A + D DL++ D+ MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 66 NGLETLDKLREKSLSGRVVVFSVSNHEEDVVTALKRGADGYLLKDMEPEDLLKALQQA 123
N + L ++++ V+V S N + A ++GA YL K + +L+ + +A
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11600INTIMIN2303e-69 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 230 bits (588), Expect = 3e-69
Identities = 123/448 (27%), Positives = 209/448 (46%), Gaps = 22/448 (4%)

Query: 1 MPVSFRLLPTLTFLLLLPGVPVWALTASDTTRPAQAQDPLPDMGIAPQVDDDARHFAEVA 60
+P + LP LL P+ A + PD+ + DD A ++A
Sbjct: 117 LPFEYSALP------LLGSAPLVAAGGVAGHTNKLTKMS-PDVTKSNMTDDKALNYAAQQ 169

Query: 61 KKFGEASMSDNDLTAGEQAQLFAISKIGNEVSHQLESWLSPWGNANVDLLVDKEGKFTGS 120
+ + L G+ A+ A+ GN+ S QL++WL +G A V+L F GS
Sbjct: 170 AASLGSQLQSRSLN-GDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGN--NFDGS 226

Query: 121 KGSWFVPLQDNDRYLTWNQYSVTRREHDLVGNIGLGQRWRVGGWLLGYNSFYDKVLSESL 180
+ +P D+++ L + Q + N+G GQR+ + +LGYN F D+ S
Sbjct: 227 SLDFLLPFYDSEKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDN 286

Query: 181 ARGSVGAEAWGEYLRLSANYYHPLGDW-QLRDNQTQEQRMAAGYDVTAQARLPFYQHINT 239
R +G E W +Y + S N Y + W + + + ++R A G+D+ LP Y +
Sbjct: 287 TRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGA 346

Query: 240 SVSVEQYFGDSVDLFHSGTGYHNPVAVSVGLNYTPVPLVTVTAKHKQGENGVSQNNVGLK 299
+ EQY+GD+V LF+S NP A +VG+NYTP+PLVT+ ++ G + ++
Sbjct: 347 KLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQ 406

Query: 300 LNYRFGVPLKQQLAADEVAISNSLRGSRFDSPERDNLPVVEYRQRKNLTVYLATP-PWDL 358
Y+F P QQ+ V +L GSR+D +R+N ++EY +K + L P +
Sbjct: 407 FRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEY--KKQDILSLNIPHDING 464

Query: 359 QSGETVQLKLQIHSLHGIKALHWQGDTQALSLTPPVDASSPDG---WSIIMPVWNSEPGA 415
T +++L + S +G+ + W D+ S + S + I+P + G
Sbjct: 465 TERSTQKIQLIVKSKYGLDRIVWD-DSALRSQGGQIQHSGSQSAQDYQAILPAYV--QGG 521

Query: 416 ANRWRLSVVVEDKQGQRVSSNEIALALT 443
+N ++++ D+ G SSN + L +T
Sbjct: 522 SNVYKVTARAYDRNGN--SSNNVLLTIT 547


43D364_RS11770D364_RS11900Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS11770226-3.197726type VI secretion protein VasK
D364_RS11775654-12.383041hypothetical protein
D364_RS117801063-15.160116PAAR domain-containing protein
D364_RS11785442-9.611602hypothetical protein
D364_RS27495442-9.214528SUMF1/EgtB/PvdO family nonheme iron enzyme
D364_RS27500231-4.158574SUMF1/EgtB/PvdO family nonheme iron enzyme
D364_RS11795129-2.672476SUMF1/EgtB/PvdO family nonheme iron enzyme
D364_RS118050210.533031type VI secretion system tip protein VgrG
D364_RS11810-1162.570691OmpA family protein
D364_RS275051144.110726type VI secretion system protein TssL, short
D364_RS118151124.620061type VI secretion system baseplate subunit TssK
D364_RS11820-1121.871034gamma-glutamylcyclotransferase
D364_RS11825-1141.514876DUF2058 domain-containing protein
D364_RS11835-2171.444605RluA family pseudouridine synthase
D364_RS11840-3151.315239cold shock domain-containing protein
D364_RS11845-2172.826334LysR family transcriptional regulator
D364_RS11850-2172.312247substrate-binding domain-containing protein
D364_RS11855-1143.176176MFS transporter
D364_RS11860-2121.832253LysR family transcriptional regulator
D364_RS11865-1141.523145GNAT family N-acetyltransferase
D364_RS118701141.621086phosphoenolpyruvate carboxykinase (ATP)
D364_RS11875-1141.441819LysR family transcriptional regulator
D364_RS118800161.695469ABC transporter ATP-binding protein
D364_RS118851211.938756ATP-binding cassette domain-containing protein
D364_RS118902223.194497high-affinity branched-chain amino acid ABC
D364_RS118951223.017210branched-chain amino acid ABC transporter
D364_RS119000223.698573branched-chain amino acid ABC transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11815OMPADOMAIN1041e-26 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 104 bits (260), Expect = 1e-26
Identities = 51/152 (33%), Positives = 70/152 (46%), Gaps = 13/152 (8%)

Query: 407 DWAPPPPPRPVIKQVVQGPQTIRLDSMALFDTGKSTLKPGSTKLL--VNSLLGIKAKPGW 464
+ AP P P VQ + L S LF+ K+TLKP L + S L
Sbjct: 195 EAAPVVAPAPAPAPEVQT-KHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDG 253

Query: 465 LIVVAGHTDSIGNDRSNQQLSLKRAEAVRDWMRDTGDVPESCFAVQGYGASRPVASN--- 521
+VV G+TD IG+D NQ LS +RA++V D++ G +P + +G G S PV N
Sbjct: 254 SVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKG-IPADKISARGMGESNPVTGNTCD 312

Query: 522 ------ETPEGRAQNRRVEISLVPQKDACLAP 547
+ A +RRVEI + KD P
Sbjct: 313 NVKQRAALIDCLAPDRRVEIEVKGIKDVVTQP 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11840TRNSINTIMINR290.013 Translocated intimin receptor (Tir) signature.
		>TRNSINTIMINR#Translocated intimin receptor (Tir) signature.

Length = 549

Score = 28.9 bits (64), Expect = 0.013
Identities = 14/56 (25%), Positives = 32/56 (57%), Gaps = 2/56 (3%)

Query: 11 LKAGLVSSKKMAKVQRTAKKSRVQAREAREAVEENKKAQLERDKQLSEQQKQAVLA 66
+ +G + + ++ + AK++ AR+ +AVE N +AQ + Q + +Q++ L+
Sbjct: 308 IPSGELKDDIVEQIAQQAKEAGEVARQ--QAVESNAQAQQRYEDQHARRQEELQLS 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11850PF05272290.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.011
Identities = 14/76 (18%), Positives = 32/76 (42%), Gaps = 15/76 (19%)

Query: 18 FIKDENGENRYFHVIKVANPDLIKKDAAVTFEPTTNNKGLSAYAVKVIPESKYIYIAGER 77
++ D G R++ V+ +L+ L + ++ E+ ++Y+AGER
Sbjct: 698 YLFDITGNRRFWPVLVPGRANLV---------------WLQKFRGQLFAEALHLYLAGER 742

Query: 78 LKLTSIKSYVVYREEE 93
+ + +R E+
Sbjct: 743 YFPSPEDEEIYFRPEQ 758


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11865TCRTETA401e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 40.2 bits (94), Expect = 1e-05
Identities = 74/369 (20%), Positives = 138/369 (37%), Gaps = 62/369 (16%)

Query: 67 LMRPIGAIVLGAYIDKVGRRKGLIVTLSIMATGTFLIVLIPSYQTIGLWAPLLVLIGRLL 126
LM+ A VLGA D+ GRR L+V+L+ A ++ P LW ++ IGR++
Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPF-----LW---VLYIGRIV 105

Query: 127 QGFSAGAELGGVSVYLAEIATSGRKGFYTSWQSGSQQVAIMVAAAMGFALNAVLEPSAIS 186
G + GA Y+A+I + + + S ++ +G +
Sbjct: 106 AGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGF------- 157

Query: 187 DWGWRIPFLFGCLIVPFIFIL------------RR--KLEETQEFTARRHHLAMRQVFAT 232
PF + F+ RR + E + R M V A
Sbjct: 158 --SPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAAL 215

Query: 233 LLANWQVVIAGMMMVAMTTTAFYLITVYAPTFGKKVLMLSASD-SLLVTLLVAISNFFWL 291
+ V M +V A ++I FG+ A+ + + + +
Sbjct: 216 M-----AVFFIMQLVGQVPAALWVI------FGEDRFHWDATTIGISLAAFGILHSLAQA 264

Query: 292 PVGGALSDRFGRRSVLIAMTLLALATAWPALTMLANAPSFLMMLSVLLWLSFIYGMYNGA 351
+ G ++ R G R + + ++A T +LA A M +++ L+ G
Sbjct: 265 MITGPVAARLGERR-ALMLGMIADGT---GYILLAFATRGWMAFPIMVLLAS-----GGI 315

Query: 352 MIPALTEIMPAEV------RVAGFSLAYSLATAVFGGFTPVISTALIEYTGDKASPGYWM 405
+PAL ++ +V ++ G A + T++ G P++ TA+ + + W+
Sbjct: 316 GMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVG---PLLFTAIYAASITTWNGWAWI 372

Query: 406 SFAAICGLL 414
+ AA+ L
Sbjct: 373 AGAALYLLC 381



Score = 36.3 bits (84), Expect = 2e-04
Identities = 39/157 (24%), Positives = 62/157 (39%), Gaps = 20/157 (12%)

Query: 273 ASDSLLVTLLVAISNFFWLPVGGALSDRFGRRSVLIAMTLLALATAWPALTMLANAPSFL 332
+ ++ L A+ F PV GALSDRFGRR VL L++LA A ++A AP
Sbjct: 42 TAHYGILLALYALMQFACAPVLGALSDRFGRRPVL----LVSLAGAAVDYAIMATAPFLW 97

Query: 333 MMLSVLLWLSFIYGMYNGAMIPALT----EIMPAEVRVAGFSLAYSLATAVFGGFT--PV 386
+ L++ I GA +I + R F ++ G PV
Sbjct: 98 V-----LYIGRIVAGITGATGAVAGAYIADITDGDERARHFGF---MSACFGFGMVAGPV 149

Query: 387 ISTALIEYTGDKASPGYWMSFAAICGLLATCYLYRRS 423
+ + ++ +P + + L C+L S
Sbjct: 150 LGGLMGGFS--PHAPFFAAAALNGLNFLTGCFLLPES 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11875SACTRNSFRASE443e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 44.2 bits (104), Expect = 3e-08
Identities = 21/64 (32%), Positives = 26/64 (40%), Gaps = 2/64 (3%)

Query: 73 STWLGRNGIYMEDLYVTPDYRGIGAGKALLKTIAQYAVQRQCGRLEWSVLDWNQPAIDFY 132
S W G +ED+ V DYR G G ALL ++A + L D N A FY
Sbjct: 84 SNWNGY--ALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFY 141

Query: 133 LSIG 136

Sbjct: 142 AKHH 145


44D364_RS27515D364_RS12100Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS275154183.345520PhnD/SsuA/transferrin family substrate-binding
D364_RS119504183.650045fatty acid desaturase
D364_RS119602172.321268proteasome-type protease
D364_RS11965-1162.269693transglutaminase family protein
D364_RS119750172.742215alpha-E domain-containing protein
D364_RS119800163.442353circularly permuted type 2 ATP-grasp protein
D364_RS26125-1173.946868hypothetical protein
D364_RS11985-1184.350345alpha,alpha-trehalase
D364_RS119900194.625797TonB-dependent siderophore receptor
D364_RS11995-1204.441287membrane-bound lytic murein transglycosylase
D364_RS12000-1182.984374muramoyltetrapeptide carboxypeptidase
D364_RS12005-1161.885212potassium/proton antiporter
D364_RS12010-1151.411917catabolic alanine racemase DadX
D364_RS12015-1130.239632D-amino acid dehydrogenase
D364_RS27520013-1.243904hypothetical protein
D364_RS12025114-0.926291SpoVR family protein
D364_RS120301140.093367fatty acid metabolism transcriptional regulator
D364_RS12035215-0.377740sodium/proton antiporter NhaB
D364_RS12040119-3.080402disulfide bond formation protein DsbB
D364_RS12045020-3.701909DUF1971 domain-containing protein
D364_RS12050119-3.943615YcgN family cysteine cluster protein
D364_RS12055114-2.071103fumarylacetoacetate hydrolase family protein
D364_RS12060-110-2.157449YcgL domain-containing protein
D364_RS12070-29-1.053603septum site-determining protein MinC
D364_RS12075-2110.335761septum site-determining protein MinD
D364_RS12080-1142.005060cell division topological specificity factor
D364_RS120850132.121605ribonuclease D
D364_RS12090-1141.672451long-chain-fatty-acid--CoA ligase FadD
D364_RS12095-1133.162040Slp family lipoprotein
D364_RS12100-1143.573224tRNA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11995VACJLIPOPROT270.043 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 27.2 bits (60), Expect = 0.043
Identities = 9/37 (24%), Positives = 14/37 (37%)

Query: 1 MKLRWLLILVVFLAGCSSKHDYTNPPWNPEVPVKRAM 37
++L L + L GC+S +P R M
Sbjct: 3 LRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTM 39


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS12010ALARACEMASE5390.0 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 539 bits (1391), Expect = 0.0
Identities = 286/356 (80%), Positives = 319/356 (89%)

Query: 1 MTRPVVASIDLLALRQNLQIVRRAAPGSRLWAVVKANAYGHGVARVWSALSAADGFALLN 60
MTRP+ AS+DL AL+QNL IVR+AA +R+W+VVKANAYGHG+ R+WSA+ A DGFALLN
Sbjct: 1 MTRPIQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLN 60

Query: 61 LEEAILLREQGWKGPILLLEGFFHADELAVLDQYRLTTSVHSNWQIKALQQAKLRAPLDI 120
LEEAI LRE+GWKGPIL+LEGFFHA +L + DQ+RLTT VHSNWQ+KALQ A+L+APLDI
Sbjct: 61 LEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDI 120

Query: 121 YLKVNSGMNRLGFMPERVHTVWQQLRAISNVGEMTLMSHFAEAENPQGIVEPMRRIEQAA 180
YLKVNSGMNRLGF P+RV TVWQQLRA++NVGEMTLMSHFAEAE+P GI M RIEQAA
Sbjct: 121 YLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGISGAMARIEQAA 180

Query: 181 EGLDCPRSLANSAATLWHPEAHFDWVRPGIVLYGASPSGQWQDIANTGLKPVMTLRSEII 240
EGL+C RSL+NSAATLWHPEAHFDWVRPGI+LYGASPSGQW+DIANTGL+PVMTL SEII
Sbjct: 181 EGLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMTLSSEII 240

Query: 241 GVQNLRPGEAIGYGGLYRTTQEQRIGIVACGYADGYPRVAPSGTPVLVDGVRTTTVGRVS 300
GVQ L+ GE +GYGG Y EQRIGIVA GYADGYPR AP+GTPVLVDGVRT TVG VS
Sbjct: 241 GVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTMTVGTVS 300

Query: 301 MDMLAVDLTPCPQAGIGAPVELWGKEIKIDDVAASSGTVGYELMCALAPRVPVVTL 356
MDMLAVDLTPCPQAGIG PVELWGKEIKIDDVAA++GTVGYELMCALA RVPVVT+
Sbjct: 301 MDMLAVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALRVPVVTV 356


45D364_RS12480D364_RS12515Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS12480-112-3.463198L-arabinose ABC transporter ATP-binding protein
D364_RS12485015-4.922164arabinose ABC transporter substrate-binding
D364_RS27525014-3.213094non-heme ferritin-like protein
D364_RS12490011-2.963648succinate dehydrogenase
D364_RS12495012-3.724637MFS transporter
D364_RS12500-113-3.586048DUF2766 domain-containing protein
D364_RS12505117-3.786492MFS transporter
D364_RS12510014-2.146842RpiB/LacA/LacB family sugar-phosphate isomerase
D364_RS12515-116-3.318513hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS12480PF05272340.001 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 34.3 bits (78), Expect = 0.001
Identities = 18/50 (36%), Positives = 20/50 (40%), Gaps = 10/50 (20%)

Query: 18 PGVKALSDISFDCYPGQIHALMGENGAGKSTLLKILSGNYIPTAGHLQIG 67
PG K FD L G G GKSTL+ L G + H IG
Sbjct: 591 PGCK------FDYSV----VLEGTGGIGKSTLINTLVGLDFFSDTHFDIG 630


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS12510TCRTETB1042e-26 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 104 bits (262), Expect = 2e-26
Identities = 81/395 (20%), Positives = 158/395 (40%), Gaps = 20/395 (5%)

Query: 25 FMEFLDGTVIATALPDMARDFGVTAVELNIGISAYLITLAVLIPASGWIADRFGARAIFT 84
F L+ V+ +LPD+A DF N +A+++T ++ G ++D+ G + +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 85 LALAIFTLASVFCGLS-TEVHIFVAMRILQGVGGALMVPVGRLAVLRTTPKHQLIKAIAT 143
+ I SV + + + + R +QG G A + + V R PK KA
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 144 LTWPALVAPIIGPPLGGFITRYASWHWIFFINVPLGLAAIILSLRIIPDIRETERRSFDL 203
+ + +GP +GG I Y HW + + +P+ + L + + FD+
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 204 SGFITTSVAMVSLVTAMERLGDRQPQIWPTLALAALGFGCLLYSIRHFRRAAAPMVRLDA 263
G I SV +V + L ++ L F ++H R+ P V
Sbjct: 202 KGIILMSVGIVFFMLFTTS------YSISFLIVSVLSFLIF---VKHIRKVTDPFVDPGL 252

Query: 264 LQVPTFRVTMYGGSLFRASISAVPFLLPLLFQVGFGMDPFHSGLLVLAVFVGNLTI---K 320
+ F + + G + +++ ++P + + + G +++ F G +++
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVII--FPGTMSVIIFG 310

Query: 321 PATTPLIRWLGFRRLLLINGALNVCSLLACALLTPQTPVW-AIMLILYLGGVFRSIQFTG 379
L+ G +L I S L + L T + I+++ LGG+ S T
Sbjct: 311 YIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL--SFTKTV 368

Query: 380 VSTLAFADVPAAQMSDANTLFSTASQLAVGLGITL 414
+ST+ + + + +L + S L+ G GI +
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403


46D364_RS12760D364_RS12890Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS12760226-4.482767*EmmdR/YeeO family multidrug/toxin efflux MATE
D364_RS12775261-17.761492*type II toxin-antitoxin system PemK/MazF family
D364_RS12780247-14.222858antitoxin
D364_RS12785140-11.874695helix-turn-helix domain-containing protein
D364_RS12790034-10.620155hypothetical protein
D364_RS26170032-8.790011hypothetical protein
D364_RS26175-125-5.622331sensor domain-containing diguanylate cyclase
D364_RS12805-1132.133472alpha/beta hydrolase
D364_RS128100142.257423hypothetical protein
D364_RS12815-1171.740727LysR family transcriptional regulator
D364_RS12820116-1.411133DUF1869 domain-containing protein
D364_RS12825119-4.484030DUF1971 domain-containing protein
D364_RS12830016-4.462744hypothetical protein
D364_RS12835015-2.198248EAL domain-containing protein
D364_RS27530-113-0.627489molybdopterin-dependent oxidoreductase
D364_RS12845-112-0.192819shikimate transporter
D364_RS12850-1143.113701lipoate--protein ligase family protein
D364_RS12860-1154.555534LysR family transcriptional regulator
D364_RS128652165.849971tautomerase family protein
D364_RS128703165.449811HPP family protein
D364_RS128753154.001974adenosylhomocysteinase
D364_RS128804164.056572nicotinate-nucleotide--dimethylbenzimidazole
D364_RS128853173.600353adenosylcobinamide-GDP ribazoletransferase
D364_RS128902142.840420DUF496 family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS12810ENTEROTOXINA290.025 Heat-labile enterotoxin A chain signature.
		>ENTEROTOXINA#Heat-labile enterotoxin A chain signature.

Length = 258

Score = 28.8 bits (64), Expect = 0.025
Identities = 15/36 (41%), Positives = 21/36 (58%), Gaps = 5/36 (13%)

Query: 8 PDEQRRLQSLRSSGLLNSGKEERFDRLTRLARSLYN 43
PDE +R S GL+ G E FDR T++ +LY+
Sbjct: 31 PDEIKR-----SGGLMPRGHNEYFDRGTQMNINLYD 61


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS12860TCRTETB300.021 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 29.8 bits (67), Expect = 0.021
Identities = 33/153 (21%), Positives = 57/153 (37%), Gaps = 35/153 (22%)

Query: 78 LGGIIFGHFGDRLGRKRMLMMTVWMMGIATACIGLLPSFNQIGWWAPVLLVFLRAVQGFA 137
+G ++G D+LG KR+L + GI C G + F +G LL+ R +QG
Sbjct: 64 IGTAVYGKLSDQLGIKRLL-----LFGIIINCFGSVIGF--VGHSFFSLLIMARFIQGA- 115

Query: 138 VGGEWGGAALLS---------VENAPQGKK-AFYSSGVQVGYGVGLLLSTGLVSLISSLT 187
G AA + + +GK S V +G GVG + + I
Sbjct: 116 -----GAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI---- 166

Query: 188 SDQQFLSWGWRLPFLFSVVLVLIALWIRNGMAE 220
W L ++ ++ ++ + +
Sbjct: 167 --------HWSYLLLIPMITIITVPFLMKLLKK 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS12900FbpA_PF05833280.011 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 27.5 bits (61), Expect = 0.011
Identities = 14/84 (16%), Positives = 33/84 (39%), Gaps = 6/84 (7%)

Query: 16 RLYRRKNKLQREIQDVEKKIRDNQKRVLLLDNLSDYIKPGMSVEAIQGIIASMKSDYEDR 75
Y++ NKL++ + +++ N++ + L ++ I + + I+ I E
Sbjct: 385 SYYKKYNKLKKSEEAANEQLLQNEEELNYLYSVLTNINNADNYDEIEEIKK------ELI 438

Query: 76 VDDYIIKNAELSKERRDISKKLKV 99
YI ++ SK +
Sbjct: 439 ETGYIKFKKIYKSKKSKTSKPMHF 462



Score = 26.4 bits (58), Expect = 0.025
Identities = 10/45 (22%), Positives = 17/45 (37%), Gaps = 1/45 (2%)

Query: 13 EFVRLYRRKNKLQREIQDVEKKIRDNQKRVLLLDNLSDYIKPGMS 57
R ++ L ++ E K LL N+ +K G+S
Sbjct: 311 NINRCTKKDKILNNTLKKCEDKDIFKLYGELLTANIYA-LKKGLS 354


47D364_RS27030D364_RS13200Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS27030-1154.058950SDR family oxidoreductase
D364_RS129500184.553195his operon leader peptide
D364_RS129550243.897596ATP phosphoribosyltransferase
D364_RS270350273.470623histidinol dehydrogenase
D364_RS129651254.024747histidinol-phosphate transaminase
D364_RS12975-218-3.404384bifunctional
D364_RS12980-228-8.496684imidazole glycerol phosphate synthase subunit
D364_RS12985-135-10.8809531-(5-phosphoribosyl)-5-[(5-
D364_RS12990146-14.190322imidazole glycerol phosphate synthase subunit
D364_RS12995156-17.534502bifunctional phosphoribosyl-AMP
D364_RS13000262-18.712384glycosyltransferase
D364_RS13005359-18.921956glycosyltransferase family 4 protein
D364_RS13010347-16.100304glycosyltransferase
D364_RS13015247-16.217442UDP-galactopyranose mutase
D364_RS13020137-14.289579DUF4422 domain-containing protein
D364_RS13025220-7.472900O-antigen export ABC transporter ATP-binding
D364_RS13030318-4.097946O-antigen export ABC transporter permease RfbA
D364_RS13035320-2.628501NAD-dependent epimerase
D364_RS27040321-6.796133small membrane protein
D364_RS13050322-8.250467UDP-glucose 6-dehydrogenase
D364_RS13055334-11.916670O9 family phosphomannomutase RfbK1
D364_RS13060352-17.244113mannose-1-phosphate
D364_RS13065371-22.494547NADP-dependent phosphogluconate dehydrogenase
D364_RS13070483-26.173945undecaprenyl-phosphate glucose
D364_RS26180483-26.880975acyltransferase
D364_RS26185482-27.058535O-antigen ligase family protein
D364_RS13080684-26.605750glycosyltransferase
D364_RS13085583-26.230852polysaccharide export protein
D364_RS13090481-24.192935capsule assembly Wzi family protein
D364_RS13095570-19.832844hypothetical protein
D364_RS13100467-18.107098phosphatase PAP2 family protein
D364_RS13105459-14.416435GalU regulator GalF
D364_RS26190432-7.148870TerC family protein
D364_RS13110120-4.547918outer membrane assembly protein AsmA
D364_RS13115110-0.582408dCTP deaminase
D364_RS275352110.975551uridine kinase
D364_RS131251110.241900DNA-3-methyladenine glycosylase 2
D364_RS131302111.518689molecular chaperone
D364_RS131352122.441031sensor histidine kinase
D364_RS131402143.840114response regulator transcription factor
D364_RS131450134.007831small membrane protein
D364_RS131501143.323355positive transcription regulator
D364_RS131551123.135501dicarboxylate/amino acid:cation symporter
D364_RS131600141.705991MdtA/MuxA family multidrug efflux RND
D364_RS131650131.894713MdtB/MuxB family multidrug efflux RND
D364_RS131701193.042537multidrug efflux RND transporter permease
D364_RS270452243.710510MFS transporter
D364_RS131803254.192576two-component system sensor histidine kinase
D364_RS131853275.111754two-component system response regulator BaeR
D364_RS131903275.409100tRNA 5-hydroxyuridine modification protein YegQ
D364_RS131952274.568418lipid kinase YegS
D364_RS132000223.763669ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS12965RTXTOXINA300.028 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.5 bits (66), Expect = 0.028
Identities = 17/83 (20%), Positives = 33/83 (39%), Gaps = 12/83 (14%)

Query: 25 AISASDSISKTVTEILNNVKA--NGDAALREYSAKFDKTTVAALQVSEAEIAAAGERLSD 82
+ + I T L + D +++ + + VS +E+A A L +
Sbjct: 138 NLGKAGGILSTFQNFLGTALSSMKIDELIKKQKSGGN--------VSSSELAKASIELIN 189

Query: 83 ELKQAMAVAVKNIETFHNAQQLQ 105
+L +A N+ +F +QQL
Sbjct: 190 QLVDTVASLNNNVNSF--SQQLN 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13000GPOSANCHOR320.004 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 32.3 bits (73), Expect = 0.004
Identities = 14/66 (21%), Positives = 28/66 (42%)

Query: 333 HELESSNLSTEKLQASIASQDQVLKAREEEIDELRASVAQKKERIDRLMERNAYLETEYQ 392
+ + + + L+A A+ + E + L A+ + +D E LE E+Q
Sbjct: 274 NFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQ 333

Query: 393 KQQDQL 398
K ++Q
Sbjct: 334 KLEEQN 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13015NUCEPIMERASE290.031 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.0 bits (65), Expect = 0.031
Identities = 15/30 (50%), Positives = 19/30 (63%), Gaps = 1/30 (3%)

Query: 5 KILIVG-AGFSGAVIGRQLAEQGHQVHIID 33
K L+ G AGF G + ++L E GHQV ID
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGID 31


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13035NUCEPIMERASE5830.0 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 583 bits (1505), Expect = 0.0
Identities = 260/334 (77%), Positives = 301/334 (90%)

Query: 1 MKFLVTGAAGFIGFHIAQRLLNEGHDVVGIDNMNDYYDVSLKQARLDRLASPAFHFQQLD 60
MK+LVTGAAGFIGFH+++RLL GH VVGIDN+NDYYDVSLKQARL+ LA P F F ++D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 61 LADREGMAKLFATEQFDRVIHLAAQAGVRYSLENPYAYADANLMGYLNILEGCRHTKVKH 120
LADREGM LFA+ F+RV + VRYSLENP+AYAD+NL G+LNILEGCRH K++H
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 121 LVYASSSSVYGLNRKMPFSTEDSVDHPVSLYAATKKANELMAHTYSHLYGIPTTGLRFFT 180
L+YASSSSVYGLNRKMPFST+DSVDHPVSLYAATKKANELMAHTYSHLYG+P TGLRFFT
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180

Query: 181 VYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIVEAVVRVQDVIPQANAD 240
VYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDI EA++R+QDVIP A+
Sbjct: 181 VYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQ 240

Query: 241 WTVEDGSPATSSAPYRVYNIGNSSPVELMDYITALEEALGMEAQKNMMPIQPGDVLDTSA 300
WTVE G+PA S APYRVYNIGNSSPVELMDYI ALE+ALG+EA+KNM+P+QPGDVL+TSA
Sbjct: 241 WTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVLETSA 300

Query: 301 DTQPLYDLVGFKPQTSVKDGVKNFVEWFKDYYQI 334
DT+ LY+++GF P+T+VKDGVKNFV W++D+Y++
Sbjct: 301 DTKALYEVIGFTPETTVKDGVKNFVNWYRDFYKV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS26180IGASERPTASE280.049 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 27.7 bits (61), Expect = 0.049
Identities = 18/64 (28%), Positives = 30/64 (46%), Gaps = 14/64 (21%)

Query: 127 SGISIGKGSMVSWRVQFLDEDFHLVSYNDKKPKDGKITIGENCLIGNNVAINKGCI-IAD 185
+G+S+ +G V+W+V N + + K IG+ LI NKG + + D
Sbjct: 425 AGVSVAEGKTVTWKVH-----------NPQYDRLAK--IGKGTLIVEGTGDNKGSLKVGD 471

Query: 186 GCVV 189
G V+
Sbjct: 472 GTVI 475


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13160SHAPEPROTEIN485e-08 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 47.8 bits (114), Expect = 5e-08
Identities = 34/129 (26%), Positives = 57/129 (44%), Gaps = 20/129 (15%)

Query: 132 AMMLH-IKLQAESQLPEQIDQAVIGRPINFQGLGGDEANAQAQGILERAALRAGFRDVVF 190
M+ H IK + + ++ P+ + E A + +A AG R+V
Sbjct: 89 KMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQV---ERRA-----IRESAQGAGAREVFL 140

Query: 191 QFEPVAAGLDFEATLSEEKRVLVVDIGGGTTDCSLLLMGPQWRERADRQQSLLGHSGCRI 250
EP+AA + +SE +VVDIGGGTT+ +++ + ++ S RI
Sbjct: 141 IEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN-----------GVVYSSSVRI 189

Query: 251 GGNDLDIAL 259
GG+ D A+
Sbjct: 190 GGDRFDEAI 198



Score = 33.6 bits (77), Expect = 0.001
Identities = 30/126 (23%), Positives = 50/126 (39%), Gaps = 32/126 (25%)

Query: 332 RLSYRLV---RSAEESKIALSSAAS-------------VETALPFIQDELATAIAQQGLE 375
R +Y + +AE K + SA + +P L + +
Sbjct: 203 RRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVP-RGFTLNSNEILE--- 258

Query: 376 AALDQPLTRIMEQVRLALDSSQTTPDV--------IYLTGGSARSPLIKKALAAQLPGIP 427
AL +PLT I+ V +AL+ Q P++ + LTGG A + + L + GIP
Sbjct: 259 -ALQEPLTGIVSAVMVALE--QCPPELASDISERGMVLTGGGALLRNLDRLLMEET-GIP 314

Query: 428 LAGGDD 433
+ +D
Sbjct: 315 VVVAED 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13170HTHFIS661e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 65.6 bits (160), Expect = 1e-14
Identities = 32/141 (22%), Positives = 61/141 (43%), Gaps = 6/141 (4%)

Query: 4 RLAIIEDNADLLDELLAWLGYRGFEVWGTRSAEAFWRQLHSHPVDIVLVDIGLPGEDGFS 63
+ + +D+A + L L G++V T +A WR + + D+V+ D+ +P E+ F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 VLNYLHELGHY-GLVVVSARGQQQDKLQALSLGADAYLIKPVNFAH-LAETLTALGARLR 121
+L + + ++V+SA+ ++A GA YL KP + + AL R
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 QDRP----AAPPAEAIGTPPA 138
+ + +G A
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAA 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13190RTXTOXIND446e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.4 bits (105), Expect = 6e-07
Identities = 26/139 (18%), Positives = 52/139 (37%), Gaps = 16/139 (11%)

Query: 55 GAALAPVQAATATEEAVPRYLTGLGTVTAA-NTVTVRSRVDGQLLSLHFQEGQQVKAGDL 113
+A + + E V T G +T + + ++ + + + +EG+ V+ GD+
Sbjct: 67 FLVIAFILSVLGQVEIV---ATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDV 123

Query: 114 LAQIDPSQFKVALAQAQGQLAKDQATLANARRDLARYQQLVKTNLVSRQELDTQQSLVVE 173
L ++ A+ K Q++L AR + RYQ L ++ EL+ L +
Sbjct: 124 LLKLTAL-------GAEADTLKTQSSLLQARLEQTRYQILSRS-----IELNKLPELKLP 171

Query: 174 SAGTVKADEAAVASAQLQL 192
+ L
Sbjct: 172 DEPYFQNVSEEEVLRLTSL 190



Score = 36.3 bits (84), Expect = 2e-04
Identities = 26/170 (15%), Positives = 63/170 (37%), Gaps = 17/170 (10%)

Query: 125 ALAQAQGQLAKDQATLANARRDLARYQQLVKTNLVSRQELDTQQSLVVESAGTVKADEAA 184
+A +L ++ L ++ ++ + ++ +++ +
Sbjct: 260 KYVEAVNELRVYKSQLEQIESEILSAKEEY------QLVTQLFKNEILDKLRQTTDNIGL 313

Query: 185 V----ASAQLQLDWTRITAPIDGRV-GLKQVDIGNQISSGDTTGIVVLTQTHPIDVVFTL 239
+ A + + + I AP+ +V LK G +++ +T ++V ++V +
Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPED-DTLEVTALV 372

Query: 240 PESSIATVVQAQKAGKALSVEAWDRTNKQKISVGE--LLSLDNQIDATTG 287
I + Q A + VEA+ T + G+ ++LD D G
Sbjct: 373 QNKDIGFINVGQNA--IIKVEAFPYTRYGYLV-GKVKNINLDAIEDQRLG 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13195ACRIFLAVINRP8980.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 898 bits (2323), Expect = 0.0
Identities = 294/1036 (28%), Positives = 509/1036 (49%), Gaps = 29/1036 (2%)

Query: 13 SRLFIMRPVATTLLMVAILLAGIIGYRFLPVSALPEVDYPTIQVVTLYPGASPDVVTSAI 72
+ FI RP+ +L + +++AG + LPV+ P + P + V YPGA V +
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 73 TAPLERQFGQMSGLKQMSSQS-SGGASVVTLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131
T +E+ + L MSS S S G+ +TL FQ D+A+ +VQ + AT LLP +
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 132 LPNPPVYSKVNPADPPIMTLAVTSSAIPMTQVE--DMVETRVAQKISQVSGVGLVTLAGG 189
+ + S + +M S TQ + D V + V +S+++GVG V L G
Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 190 QRPAVRVKLNAQAIAALGLTSETVRTAITSANVNSAKGSLDGP------ARAVTLSANDQ 243
Q A+R+ L+A + LT V + N A G L G ++ A +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 244 MQSAEDYRRLII-AYQNGAPIRLGDVASVEQGAENSWLGAWANQQRAIVMNVQRQPGANI 302
++ E++ ++ + +G+ +RL DVA VE G EN + A N + A + ++ GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 303 IDTADSIRQMLPQLTESLPKSVKVQVLSDRTTNIRASVRDTQFELMLAIALVVMIIYLFL 362
+DTA +I+ L +L P+ +KV D T ++ S+ + L AI LV +++YLFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 363 RNVPATIIPGVAVPLSLVGTFAVMVFLDFSINNLTLMALTIATGFVVDDAIVVIENISRY 422
+N+ AT+IP +AVP+ L+GTFA++ +SIN LT+ + +A G +VDDAIVV+EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 423 I-EKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAVTLAVAIL 481
+ E P A K +I ++ + L AV IP+ F G G ++R+F++T+ A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 482 ISAVVSLTLTPMMCARML---SHESLRKQNRFSRASERFFERVIAVYGRWLSRVLNHPWL 538
+S +V+L LTP +CA +L S E + F F+ + Y + ++L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 539 TLGVALSTLALSIILWVFIPKGFFPIQDNGIIQGTLQAPQSVSFASMAERQRQVANIILK 598
L + +A ++L++ +P F P +D G+ +Q P + + QV + LK
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 599 DPA--VESLTSFVGVDGTNPALNSARLQINLKPLDERDDR---VQTVISRLQQAVDGVPG 653
+ VES+ + G + A N+ ++LKP +ER+ + VI R + + +
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 654 VALYLQPTQDLTIDTTVSRTQYQFTLQ---ANSLEALSTWVPPLLSRLQAQP-QLADVSS 709
++ P I + T + F L +AL+ LL P L V
Sbjct: 660 G--FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 710 DWQDKGLAAYIKVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEQDTE 769
+ + ++VD++ A LG+S++D++ + A G ++ + ++ ++ D +
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 770 ATPGLAALENIRLTSSDGGIVPLTAIATVEQRFTPLSVNHLDQFPVTTISFNVPDNYSLG 829
++ + + S++G +VP +A T + + + P I S G
Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837

Query: 830 EAVEAILAAEQSLDFPTDIRTQFQGSSLAFQSALGSTVWLVVAAVVAMYIVLGVLYESFI 889
+A+ + L P I + G S + + LV + V +++ L LYES+
Sbjct: 838 DAMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 890 HPITILSTLPTAGVGALLALWLAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQ 949
P++++ +P VG LLA L + DV ++G++ IG+ KNAI++++FA ++
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 950 GMPPREAIYQACLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLMLSQV 1009
G EA A +R RPILMT+LA +LG LPL +S G G+ + +GIG++GG++ + +
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 1010 LTLFTTPVIYLLFDRL 1025
L +F PV +++ R
Sbjct: 1016 LAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13200ACRIFLAVINRP8900.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 890 bits (2302), Expect = 0.0
Identities = 280/1035 (27%), Positives = 502/1035 (48%), Gaps = 36/1035 (3%)

Query: 6 LFIYRPVATILISLAITLCGILGFRLLPVAPLPQVDFPVIMVSASLPGASPETMASSVAT 65
FI RP+ ++++ + + G L LPVA P + P + VSA+ PGA +T+ +V
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63

Query: 66 PLERSLGRIAGVNEMTSSS-SLGSTRIILEFNFDRDINGAARDVQAAINAAQSLLPSGMP 124
+E+++ I + M+S+S S GS I L F D + A VQ + A LLP +
Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123

Query: 125 SRPTYRKANPSDAPIMILTLTSDT--YSQGELYDFASTQLAQTIAQIDGVGDVDVGGSSL 182
+ S + +M+ SD +Q ++ D+ ++ + T+++++GVGDV + G+
Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182

Query: 183 PAVRVDLNPQALFNQGVSLDAVRTAISDANVRKPQG------ALEDSAHRWQVQTNDELK 236
A+R+ L+ L ++ V + N + G AL + K
Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 237 TAADYQPLIVHY-QNGAAVRLGDVATVSDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQ 295
++ + + +G+ VRL DVA V ++ N KPA L I+ AN +
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 296 TVDSIRARLPELQQTIPAAIDLQIAQDRSPTIRASLEEVEQTLVISVALVILVVFLFLRS 355
T +I+A+L ELQ P + + D +P ++ S+ EV +TL ++ LV LV++LFL++
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 356 GRATLIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENISRHL- 414
RATLIP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 415 EAGMKPLQAALQGSREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGIS 474
E + P +A + ++ ++ +++ L AVF+P+ GG G + R+F++T+ A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 475 LAVSLTLTPMMCGWLLKSGKPHQPTRNRGFG----RLLVAVQGGYGKSLKWVLKHSRLTG 530
+ V+L LTP +C LLK GF Y S+ +L +
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 531 LVVLGTIALSVWLYISIPKTFFPEQDTGVLMGGIQADQSISFQ----AMRGKLQDFMKII 586
L+ +A V L++ +P +F PE+D GV + IQ + + + ++K
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 587 R-EDPAVDNVTGFT-GGSRVNSGMMFITLKPRDQRH---ETAQQVIDRLRKKLANEPGAN 641
+ +V V GF+ G N+GM F++LKP ++R+ +A+ VI R + +L
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 642 LFLMAVQDIRVGGRQSNASYQYTLLSDDLSALREWEPKIRKALAAL-----PELADVNSD 696
+ + I G + ++ L D + + R L + L V +
Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 697 QQDNGAEMDLVYDRDTMSRLGISVQDANNLLNNAFGQRQISTIYQPLNQYKVVMEVDPAY 756
++ A+ L D++ LG+S+ D N ++ A G ++ K+ ++ D +
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 757 TQDVSALDKMFVINSDDKPIPLAYFAKWQPANAPLSVNHQGLSAASTISFNLPTGRSLSE 816
+DK++V +++ + +P + F + + I G S +
Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838

Query: 817 ASEAIDRAMTQLGVPSSVRGSFAGTAQVFQQTMNAQVILILAAIATVYIVLGVLYESYVH 876
A ++ ++L P+ + + G + + + N L+ + V++ L LYES+
Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896

Query: 877 PLTILSTLPSAGVGALLALEIFDAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRNGN 936
P++++ +P VG LLA +F+ + ++G++ IG+ KNAI++V+FA +
Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956

Query: 937 LTPEEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLL 996
EA A +R RPI+MT+LA + G LPL +S G GS + +GI ++GG+V + LL
Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016

Query: 997 TLYTTPVVYLFFDRL 1011
++ PV ++ R
Sbjct: 1017 AIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13205TCRTETB1235e-33 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 123 bits (311), Expect = 5e-33
Identities = 93/435 (21%), Positives = 186/435 (42%), Gaps = 17/435 (3%)

Query: 20 FMQSLDTTIVNTALPSMAKSLGESPLHMHMIIVSYVLTVAVMLPASGWLADRVGVRNIFF 79
F L+ ++N +LP +A + P + + +++LT ++ G L+D++G++ +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 80 TAIVLFTAGSLFCAQA-STLDQLVMARVLQGVGGAMMVPVGRLTVMKIVPRDQYMAAMTF 138
I++ GS+ S L+MAR +QG G A + + V + +P++ A
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 139 VTLPGQVGPLLGPALGGVLVEYASWHWIFLINIP-VGIVGAIATLCLMPNYTMQTRRFDL 197
+ +G +GPA+GG++ Y HW +L+ IP + I+ + L+ FD+
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 198 SGFLLLAAGMATLTLALDGQKGLGISPAWLAGLVAVGLCALLLYLWHARGNARALFSLNL 257
G +L++ G+ L + ++ + V + + L+++ H R L
Sbjct: 202 KGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 258 FRNRTFSLGLGGSFAGRIGSGMLPFMTPVFLQIGLGFSPFHAG-LMMIPMVLGSMGMKRI 316
+N F +G+ M P ++ S G +++ P + + I
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 317 VVQVVNRFGYRRVLVASTLGLAAVSLLFMFSALAGWYYVLPLVLFLQGMINASRFSSMNT 376
+V+R G VL L+ L F +++ +++F+ G ++ ++ + ++T
Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTK-TVIST 371

Query: 377 LTLKDLPDDLASSGNSLLSMVMQLSMSIGVTIAGLLLGLYGQQHMSLDAASTHQVFLYT- 435
+ L A +G SLL+ LS G+ I G LL + L +LY+
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTYLYSN 431

Query: 436 -YLSMAAIIALPALI 449
L + II + L+
Sbjct: 432 LLLLFSGIIVISWLV 446


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13210BCTERIALGSPF362e-04 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 36.3 bits (84), Expect = 2e-04
Identities = 28/93 (30%), Positives = 36/93 (38%), Gaps = 21/93 (22%)

Query: 198 LATLLAA-------------LATFPLARGLLAPVKRLVEGTHKLAA------GDFST--R 236
LATL+AA + P L+A V+ V H LA G F
Sbjct: 77 LATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFPGSFERLYC 136

Query: 237 VTVTGGDELGRLAQDFNQLASTLERNQQMRRDL 269
V G+ G L N+LA E+ QQMR +
Sbjct: 137 AMVAAGETSGHLDAVLNRLADYTEQRQQMRSRI 169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13215HTHFIS772e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.8 bits (189), Expect = 2e-18
Identities = 31/148 (20%), Positives = 67/148 (45%), Gaps = 3/148 (2%)

Query: 11 PRILIVEDEPKLGQLLIDYLQAAGYAPTLINHGDKVLPYVRQTPPHLILLDLMLPGTDGL 70
IL+ +D+ + +L L AGY + ++ + ++ L++ D+++P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 71 TLCREIR-RFSDVPVVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTIL--RR 127
L I+ D+PV++++A+ + + E GA DY+ KP+ E++ + L +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 128 CKPQRDLQALDAQSPLIVDEGRFQASWR 155
+P + PL+ Q +R
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYR 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13230PF05272300.008 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.008
Identities = 19/93 (20%), Positives = 28/93 (30%), Gaps = 35/93 (37%)

Query: 36 LVGESGSGKTTVLKCLAGLFTHWQGELTI---------------------------DAQP 68
L G G GK+T++ L GL I DA+
Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRADAEA 660

Query: 69 LGHEISRERCRQVQMVFQDPYGSL---HPRHTI 98
+ S + R ++ YG HPR +
Sbjct: 661 VKAFFSSRKDR-----YRGAYGRYVQDHPRQVV 688


48D364_RS13290D364_RS13395Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS132900123.257470RbtT/DalT/CsbX family MFS transporter
D364_RS133000143.069819class I fructose-bisphosphate aldolase
D364_RS133100156.038282bifunctional hydroxymethylpyrimidine
D364_RS133150145.532354hydroxyethylthiazole kinase
D364_RS13325-1134.366878universal stress protein
D364_RS13330-1134.633940LysR family transcriptional regulator
D364_RS133350134.184441adenine deaminase
D364_RS13340-2182.192536NCS2 family permease
D364_RS13345-1181.278534LysR family transcriptional regulator
D364_RS133500200.753997RcnB family protein
D364_RS133550201.344450GNAT family N-acetyltransferase
D364_RS133600211.040736iron-sulfur cluster carrier protein ApbC
D364_RS133650191.659304methionine--tRNA ligase
D364_RS133703223.149917DUF1456 family protein
D364_RS133754194.720531two-component system response regulator BtsR
D364_RS133803194.993629sensor histidine kinase
D364_RS133850184.352229protein YohO
D364_RS13390-1153.636065ABC transporter permease
D364_RS13395-1153.246285ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13300TCRTETB393e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 39.1 bits (91), Expect = 3e-05
Identities = 31/160 (19%), Positives = 63/160 (39%), Gaps = 5/160 (3%)

Query: 221 LYTNRSILFSSIVRIINTLSLFGFAVIMPMMFVDELGFTTSEWLQVWAAFFFTTIFSNVF 280
L N + + I ++ GF ++P M D +T+E + + F S +
Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAE---IGSVIIFPGTMSVII 308

Query: 281 WGIVAEKMGWMKVIRWFGCIGMALSSLAFYYLP-QHFGHNFAMALVPAIALGIFVAAFVP 339
+G + + + + IG+ S++F ++ M ++ LG
Sbjct: 309 FGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTV 368

Query: 340 MAAVFP-ALEPNHKGAAISVYNLSAGLSNFLAPAIAVVLL 378
++ + +L+ GA +S+ N ++ LS AI LL
Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13335UREASE350.001 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 34.7 bits (80), Expect = 0.001
Identities = 27/105 (25%), Positives = 46/105 (43%), Gaps = 17/105 (16%)

Query: 13 QAARGESPFDLLLIDAQIVDMATGEIRPADVGIVGEMIASVHPRGSRE----------DA 62
Q R D ++ +A I+D G I AD+G+ IA++ G+ +
Sbjct: 60 QVTREGGAVDTVITNALILD-HWG-IVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPG 117

Query: 63 HEVRSLAGGYLSPGLMDTHVHLESSHLPPERYAEIVLTQGTTAVF 107
EV + G ++ G MD+H+H + P++ E L G T +
Sbjct: 118 TEVIAGEGKIVTAGGMDSHIHF----ICPQQ-IEEALMSGLTCML 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13355SACTRNSFRASE332e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 33.0 bits (75), Expect = 2e-04
Identities = 15/85 (17%), Positives = 35/85 (41%), Gaps = 7/85 (8%)

Query: 57 DEQLWVAECDGQPVGFAAV---WTVDNFLHHLFVDPDWQGKHIGSALLAQVERSFTASGT 113
+ ++ + +G + W + + V D++ K +G+ALL + +
Sbjct: 64 GKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHF 123

Query: 114 LKCLMENKN----ALRFYQRHGWTI 134
++E ++ A FY +H + I
Sbjct: 124 CGLMLETQDINISACHFYAKHHFII 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13375HTHFIS713e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 70.6 bits (173), Expect = 3e-16
Identities = 36/168 (21%), Positives = 70/168 (41%), Gaps = 10/168 (5%)

Query: 3 RVLIVDDEPLARENLRILLETQRDIEIVGECGNAVEAIGAVHKLRPDVLFLDIQMPRISG 62
+L+ DD+ R L L + ++ NA + D++ D+ MP +
Sbjct: 5 TILVADDDAAIRTVLNQALS-RAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 63 LEMVGMLDPEHRPYI--VFLTAFD--EYAVKAFEEHAFDYLLKPIEAARLEKTLARLRQE 118
+++ + + RP + + ++A + A+KA E+ A+DYL KP + L + R E
Sbjct: 63 FDLLPRIK-KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 119 RNLQDVSLLDDAQQTLKYIPCTGHSRIWLLQMEDVAFVSSRMSGIYVT 166
+ L DD+Q + + G S +A + + +T
Sbjct: 122 PKRRPSKLEDDSQDGMPLV---GRSAAMQEIYRVLARLMQTDLTLMIT 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13380PF065802111e-65 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 211 bits (539), Expect = 1e-65
Identities = 59/216 (27%), Positives = 117/216 (54%), Gaps = 3/216 (1%)

Query: 343 LGEGIAQLLSAQILAGQYERQKALLTQSEIKLLHAQVNPHFLFNALNTLKAVIRRDSDQA 402
L G + + + ++ ++++ L AQ+NPHF+FNALN ++A+I D +A
Sbjct: 134 LYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKA 193

Query: 403 GQLVQYLSTFFRKNLKR-PTEIVTLADEIEHVNAYLQIEKARFQANLQIQMAVPEGLAHH 461
+++ LS R +L+ V+LADE+ V++YLQ+ +F+ LQ + + +
Sbjct: 194 REMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDV 253

Query: 462 QLPAFTLQPIVENAIKHGTSQHLGVGEITIRASQDDRWLQLDIEDNAGL-YRANPQASGL 520
Q+P +Q +VEN IKHG +Q G+I ++ ++D+ + L++E+ L + +++G
Sbjct: 254 QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGT 313

Query: 521 GMNLVDRRLRARFGADCGISVTCEPERFTRVTLRLP 556
G+ V RL+ +G + I ++ + + + +P
Sbjct: 314 GLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348


49D364_RS13720D364_RS13810Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS13720-2123.098270SulP family inorganic anion transporter
D364_RS13725-1143.887270magnesium transporter
D364_RS13730-2152.570956multidrug ABC transporter permease/ATP-binding
D364_RS13735-1110.961393DNA oxidative demethylase AlkB
D364_RS13740-112-0.144771bifunctional DNA-binding transcriptional
D364_RS13750010-1.568524FAD:protein FMN transferase ApbE
D364_RS13755110-1.903257porin OmpC
D364_RS13760111-1.522904phosphotransferase RcsD
D364_RS13765216-1.417141transcriptional regulator RcsB
D364_RS13770216-1.140723two-component system sensor histidine kinase
D364_RS13775319-1.500326DNA topoisomerase (ATP-hydrolyzing) subunit A
D364_RS13780018-1.916056bifunctional 3-demethylubiquinone
D364_RS13785018-0.511733ribonucleoside-diphosphate reductase subunit
D364_RS13790-2131.196069ribonucleotide-diphosphate reductase subunit
D364_RS13795-2101.9413432Fe-2S ferredoxin-like protein
D364_RS13800-2102.208268glycerophosphodiester phosphodiesterase
D364_RS13805-2103.345907glycerol-3-phosphate transporter
D364_RS13810-2113.857292anaerobic glycerol-3-phosphate dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13755ECOLIPORIN5470.0 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 547 bits (1411), Expect = 0.0
Identities = 266/383 (69%), Positives = 299/383 (78%), Gaps = 18/383 (4%)

Query: 1 MKVKVLSLLVPALLVAGAANAAEIYNKDGNKLDLYGKIDGLHYFSDDKSVDGDQTYMRVG 60
MK KVL+L++PALL AGAA+AAEIYNKDGNKLDLYGK+DGLHYFSDD S DGDQTYMRVG
Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60

Query: 61 VKGETQINDQLTGYGQWEYNVQANNTESSSDQAWTRLAFAGLKFGDAGSFDYGRNYGVVY 120
KGETQINDQLTGYGQWEYNVQAN TE +WTRLAFAGLKFGD GSFDYGRNYGV+Y
Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120

Query: 121 DVTSWTDVLPEFGGDTYG-SDNFLQSRANGVATYRNSDFFGLVDGLNFALQYQGKNGSVS 179
DV WTD+LPEFGGD+Y +DN++ RANGVATYRN+DFFGLVDGLNFALQYQGKN S S
Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180

Query: 180 ------GEGATNNGRGWSKQNGDGFGTSLTYDIWDGISAGFAYSHSKRTDEQNSVPA-LG 232
G NNG NGDGFG S TYDI G SAG AY+ S RT+EQ + +
Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGGTIA 240

Query: 233 RGDNAETYTGGLKYDANNIYLASQYTQTYNATRAGSL------GFANKAQNFEVVAQYQF 286
GD A+ +T GLKYDANNIYLA+ Y++T N T G G ANK QNFEV AQYQF
Sbjct: 241 GGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQF 300

Query: 287 DFGLRPSVAYLQSKGKDLER---GYGDQDILKYVDVGATYYFNKNMSTYVDYKINLLD-D 342
DFGLRP+V++L SKGKDL D+D++KY DVGATYYFNKN STYVDYKINLLD D
Sbjct: 301 DFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLLDDD 360

Query: 343 NSFTRNAGISTDDVVALGLVYQF 365
+ F ++AGISTDD+VALG+VYQF
Sbjct: 361 DPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13765HTHFIS502e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.8 bits (119), Expect = 2e-09
Identities = 31/170 (18%), Positives = 66/170 (38%), Gaps = 27/170 (15%)

Query: 1 MNTMNVIIADDHPIVLFGIRKSLEQIEWVNVVGEFEDSTALINNLPKLDAHVLITDLSMP 60
M +++ADD + + ++L + + + ++ L + D +++TD+ MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 GDKYGDGITLIKYIKRHFPDLSIIVLTMNNNPAILSAVLDLDIEGIVLKQGA------PT 114
+ L+ IK+ PDL ++V++ N +A+ ++GA P
Sbjct: 59 D---ENAFDLLPRIKKARPDLPVLVMSAQNTFM--TAIKA-------SEKGAYDYLPKPF 106

Query: 115 DLPKALAALQKGKKFTPESVSRLLEKISASGYGDKRL---SPKESEVLRL 161
DL + + + + R K+ L S E+ R+
Sbjct: 107 DLTELIGIIGRAL----AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRV 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13770HTHFIS849e-19 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.7 bits (207), Expect = 9e-19
Identities = 29/104 (27%), Positives = 46/104 (44%)

Query: 825 ILVVDDHPINRRLLADQLGSLGYQCVTANDGIDALNVLSKQHIDIVLSDVNMPNMDGYRL 884
ILV DD R +L L GY ++ ++ D+V++DV MP+ + + L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 885 TQRIRQLGLTLPVIGVTANALAEEKQRCLESGMDSCLSKPVTLD 928
RI++ LPV+ ++A + E G L KP L
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13800VACCYTOTOXIN290.039 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 28.8 bits (64), Expect = 0.039
Identities = 13/54 (24%), Positives = 23/54 (42%), Gaps = 4/54 (7%)

Query: 252 NYSYDWMFKPGAMAQIAQYADGIGPDYHMLVAEGSKPGAVKLTAMVKEAHASHL 305
+Y YD+ F A+ +G Y+ L + K + + A+ A + HL
Sbjct: 1143 SYGYDFAFFRNALVLKPS----VGVSYNHLGSTNFKSNSNQKVALKNGASSQHL 1192


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13805TCRTETA290.029 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 29.4 bits (66), Expect = 0.029
Identities = 23/119 (19%), Positives = 46/119 (38%), Gaps = 6/119 (5%)

Query: 59 GFSRGDLGFALSGISIAYGFSK-FIMGSVSDRSNPRIFLPAGLILAALVMLVMGFVPWAT 117
+ +G +L+ I + ++ I G V+ R R L G+I +++ F
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 118 SSIMIMFVLLFLCGWFQGMGWPPCGRTMVHWWSQKERGGIVSVWNCAHNVGGGIPPLLF 176
+ IM +L G+G P + ++ +G + ++ + PLLF
Sbjct: 302 MAFPIMVLLASG-----GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLF 355


50D364_RS13865D364_RS13890Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS138651134.202101nucleoside triphosphatase NudI
D364_RS138702144.946567o-succinylbenzoate--CoA ligase
D364_RS138751134.820469o-succinylbenzoate synthase
D364_RS138800134.1768911,4-dihydroxy-2-naphthoyl-CoA synthase
D364_RS138851134.2842422-succinyl-6-hydroxy-2,
D364_RS138901143.1180932-succinyl-5-enolpyruvyl-6-hydroxy-3-
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13870ALARACEMASE364e-04 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 35.5 bits (82), Expect = 4e-04
Identities = 31/195 (15%), Positives = 61/195 (31%), Gaps = 39/195 (20%)

Query: 268 GYGLTEFASTVCAKEADGAADVGEAL----PGREVKIVAGEIWLRASSMAAGYWRDGQLL 323
G+G+ S + A + ++ EA+ G + I+ E + A + D L
Sbjct: 40 GHGIERIWSAIGATDGFALLNLEEAITLRERGWKGPILMLEGFFHAQDLEIY---DQHRL 96

Query: 324 SLTNNEGWFATRDRGALHNGRLTVVGRMDNLFFSGGEGIQPEEVERVILAHPQVQQVFIV 383
+ + W + A L + ++++ G QP+ V V + V +
Sbjct: 97 TTCVHSNWQLKALQNARLKAPLDIYLKVNSGM--NRLGFQPDRVLTVWQQLRAMANVGEM 154

Query: 384 PL-------DNAEYGQRPVAVVECDDGCELSALAAWSAERLARFQQPVRWLRLPETLKNG 436
L ++ + +AR +Q L +L N
Sbjct: 155 TLMSHFAEAEHPDGIS----------------------GAMARIEQAAEGLECRRSLSNS 192

Query: 437 GIKISRRALC-EWVR 450
+ +WVR
Sbjct: 193 AATLWHPEAHFDWVR 207


51D364_RS27540D364_RS14080Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS27540115-3.546899GNAT family N-acetyltransferase
D364_RS14060113-3.572449histidine ABC transporter ATP-binding protein
D364_RS14065215-4.591625histidine ABC transporter permease HisM
D364_RS14070213-3.980671histidine ABC transporter permease HisQ
D364_RS14075014-3.097485histidine ABC transporter substrate-binding
D364_RS14080114-3.972689lysine/arginine/ornithine ABC transporter
52D364_RS14215D364_RS14435Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS14215117-3.789100YfcZ/YiiS family protein
D364_RS14220015-2.926313long-chain fatty acid transporter FadL
D364_RS14225020-3.426064phospholipid-binding lipoprotein MlaA
D364_RS14230320-1.346759formate/nitrite transporter family protein
D364_RS14240421-0.944831*WbuC family cupin fold metalloprotein
D364_RS14245218-1.253580hypothetical protein
D364_RS142502170.223233transcriptional regulator CynR
D364_RS14255118-0.259003nucleoside deaminase
D364_RS14260114-1.430799cyanate transporter
D364_RS27055115-4.900906membrane protein YpdK
D364_RS14265112-3.802676alanine transaminase
D364_RS14270013-2.153279hypothetical protein
D364_RS27550-111-0.365630hypothetical protein
D364_RS14280-111-0.362584sensor histidine kinase
D364_RS14285-1110.529193LytTR family DNA-binding domain-containing
D364_RS14290-1111.456593glucokinase
D364_RS14295-1110.264153ion channel protein
D364_RS14300-214-2.775539indolepyruvate decarboxylase
D364_RS14305-216-4.325066L-glyceraldehyde 3-phosphate reductase
D364_RS14310-216-4.588082DUF2502 domain-containing protein
D364_RS14315-315-4.592100Nramp family divalent metal transporter
D364_RS14320-217-4.718802nucleoside permease NupC
D364_RS14325-216-3.787159EAL domain-containing protein
D364_RS14340-112-1.822158**putative DNA-binding transcriptional regulator
D364_RS14345-110-0.470005putative DNA-binding transcriptional regulator
D364_RS14350-210-0.053474glutamate--tRNA ligase
D364_RS14375-1100.565893****LysR family transcriptional regulator
D364_RS14380-290.106161bile acid:sodium symporter
D364_RS14385-111-0.014609DUF3820 family protein
D364_RS14390015-1.593182NAD-dependent DNA ligase LigA
D364_RS14395022-3.808336cell division protein ZipA
D364_RS14405024-4.945649sulfate transporter CysZ
D364_RS14410022-3.908791cysteine synthase A
D364_RS14420016-1.635152phosphocarrier protein Hpr
D364_RS14425-115-1.514308phosphoenolpyruvate-protein phosphotransferase
D364_RS14430-2121.506075PTS glucose transporter subunit IIA
D364_RS144350143.068073hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS14225VACJLIPOPROT381e-138 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 381 bits (981), Expect = e-138
Identities = 223/253 (88%), Positives = 238/253 (94%), Gaps = 2/253 (0%)

Query: 1 MNYRLSALALGATLLVGCASSSSGDRPQGRSDPLEGFNRTMFNFNFNVVDPYVLRPVAVA 60
M RLSALALG TLLVGCASS G QGRSDPLEGFNRTM+NFNFNV+DPY++RPVAVA
Sbjct: 1 MKLRLSALALGTTLLVGCASS--GTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVA 58

Query: 61 WRDYVPQPARNGLSNFTSNLEEPAVMVNYFLQGDPYKGMVHFTRFFLNTILGMGGLIDVA 120
WRDYVPQPARNGLSNFT NLEEPAVMVNYFLQGDPY+GMVHFTRFFLNTILGMGG IDVA
Sbjct: 59 WRDYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVA 118

Query: 121 GMANPQLQRVEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDEGGDMADGLYPVLSWLTW 180
GMANP+LQR EPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRD+GGDMAD LYPVLSWLTW
Sbjct: 119 GMANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPVLSWLTW 178

Query: 181 PMSIGKWAVEGIETRAQLLDSDGLLRQSSDPYILMREAYFQRHDFIANGGKLTPADNPNA 240
PMS+GKW +EGIETRAQLLDSDGLLRQSSDPYI++REAYFQRHDFIANGG+L P +NPNA
Sbjct: 179 PMSVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNA 238

Query: 241 QAIQDELKDIDSQ 253
QAIQD+LKDIDS+
Sbjct: 239 QAIQDDLKDIDSE 251


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS14260TCRTETA330.001 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 33.3 bits (76), Expect = 0.001
Identities = 73/361 (20%), Positives = 117/361 (32%), Gaps = 19/361 (5%)

Query: 31 LLPDIRAASGMSYTLAALLTALPVIAMGVLALAAGWVDRYIGQKRSIALSLLIIAAGALL 90
LL D+ ++ ++ LL ++ + DR+ G++ + +SL A +
Sbjct: 31 LLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF-GRRPVLLVSLAGAAVDYAI 89

Query: 91 REIAPNSGLLLTSALAGGIGIGIIQAAIPAVIKHLFPRRT-PLVMGLWSAALMGGGGLGA 149
AP +L + GI G A A I + G SA G G
Sbjct: 90 MATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGP 148

Query: 150 AFTPWLA--SHSAAWHDALAWWALPALLALL----SWLAICRHLPRAPHQTSASSRVAII 203
+ S A + A A L L S R L R AS R A
Sbjct: 149 VLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARG 208

Query: 204 GQRRAWTLGLYFG--LINAGYASLIAWLPPYYIQLGDSAQYSGSLLALLTVGQTAGALLL 261
A + ++F L+ A+L D+ SL A + A A++
Sbjct: 209 MTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHW-DATTIGISLAAFGILHSLAQAMIT 267

Query: 262 PALARQEDRRQLLLLALALQLIGFCGFIWLPEHFSALWAIACGVGLGGAFPLC---LVLA 318
+A + R+ L+L + G+ + + A + G P L
Sbjct: 268 GPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQ 327

Query: 319 LDHAGQPAVAGRLVAFMQGIGFIIAGLSPWLSGLLRSLSGNYTLDWSWHAICVLLLMALT 378
+D Q + G L A + P L + + S W+W A L L+ L
Sbjct: 328 VDEERQGQLQGSLAALTSLTSIV----GPLLFTAIYAASITTWNGWAWIAGAALYLLCLP 383

Query: 379 L 379

Sbjct: 384 A 384


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS14280PF065802175e-68 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 217 bits (555), Expect = 5e-68
Identities = 58/207 (28%), Positives = 101/207 (48%), Gaps = 11/207 (5%)

Query: 348 RAEQLREMANKAELRALQSKINPHFLFNALNAISSSIRLNPDTARQLIFNLSRYLRYNIE 407
++ MA +A+L AL+++INPHF+FNALN I + I +P AR+++ +LS +RY++
Sbjct: 150 DQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKAREMLTSLSELMRYSLR 209

Query: 408 LKDDEQIDIKRELYQIKDYIAIEQARFGDKLTVIYDIDDDV-SCVIPSLLIQPLVENAIV 466
+ Q+ + EL + Y+ + +F D+L I+ + +P +L+Q LVEN I
Sbjct: 210 YSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIK 269

Query: 467 HGIQPCKGKGVVTIGINECGNRVRISVRDTGNGIDPAVVARVEADEMPGNKIGLLNVHHR 526
HGI G + + + V + V +TG+ + GL NV R
Sbjct: 270 HGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK--------NTKESTGTGLQNVRER 321

Query: 527 VKLLYGE--GLHIRNLTPGTEIAFYVP 551
+++LYG + + +P
Sbjct: 322 LQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS14285HTHFIS556e-11 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 54.8 bits (132), Expect = 6e-11
Identities = 23/142 (16%), Positives = 58/142 (40%), Gaps = 9/142 (6%)

Query: 2 KVIIVEDEFLAQQELSWLINTHSQMEIVGSFDDGLDVLKFLQHNKVDAIFLDINIPSLDG 61
+++ +D+ + L+ ++ V + + +++ D + D+ +P +
Sbjct: 5 TILVADDDAAIRTVLNQALS--RAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62

Query: 62 V-LLAQNISQFAHKPFIVFITAWK--EHAVEAFELEAFDYILKPYQESRIINMLQKLTTA 118
LL + P +V ++A A++A E A+DY+ KP+ + +I ++ + A
Sbjct: 63 FDLLPRIKKARPDLPVLV-MSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR---A 118

Query: 119 WQQQNNAASGLASAAPRENDTI 140
+ S L + +
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS14395IGASERPTASE514e-09 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 51.2 bits (122), Expect = 4e-09
Identities = 26/126 (20%), Positives = 40/126 (31%), Gaps = 7/126 (5%)

Query: 98 ERQMQQPARPEEPVRQPPQPPRQAPVPPQQQPAPHAAPQTGWQ-QPQPAQPPVQPQHQPQ 156
E + Q +E + + Q+ + + Q Q + QP +P +
Sbjct: 1091 ETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREND 1150

Query: 157 PVV--QQPVAPQPVTPTVAQPQPAAPQQPAPQPVAASQPAVAEPQPVE---PQQPAAPQP 211
P V ++P + T QP QPV S VE PA QP
Sbjct: 1151 PTVNIKEPQSQTNTTADTEQPAKETSSNV-EQPVTESTTVNTGNSVVENPENTTPATTQP 1209

Query: 212 KERKET 217
E+
Sbjct: 1210 TVNSES 1215



Score = 45.4 bits (107), Expect = 3e-07
Identities = 22/135 (16%), Positives = 41/135 (30%), Gaps = 1/135 (0%)

Query: 78 AHGEHEAPRQAPQHQYQPPYERQMQQPARPEEPVRQPPQPPRQAPVPPQQQPAPHAAPQT 137
A E E ++ P+ Q +++ + +P+ + P P Q Q
Sbjct: 1112 AKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQP 1171

Query: 138 GWQQPQPAQPPVQPQHQPQPVVQQPVAPQPVTPTVAQPQPAAPQQPAPQPVAASQPAVAE 197
+ + PV P+ TP QP + P+ + +
Sbjct: 1172 AKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPK-NRHRRSVRSV 1230

Query: 198 PQPVEPQQPAAPQPK 212
P VEP ++
Sbjct: 1231 PHNVEPATTSSNDRS 1245



Score = 37.4 bits (86), Expect = 9e-05
Identities = 30/170 (17%), Positives = 44/170 (25%), Gaps = 25/170 (14%)

Query: 83 EAPR---QAPQHQYQPPYERQMQQPARPEEPVRQPPQPPRQAPVPP-QQQPA-------- 130
E P+ Q Q Q + +PAR +P +P Q +QPA
Sbjct: 1121 EVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE 1180

Query: 131 -PHAAPQTGWQQPQPAQPPVQPQH---QPQPVVQQPVAPQPVTPTVAQPQPAAPQ----- 181
P T + P QP + P+ + P +
Sbjct: 1181 QPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTS 1240

Query: 182 QPAPQPVAASQPAVAEPQPVEPQQPAAPQPKERKETVIVMNVAAHHGAQL 231
VA V A Q + V + H +QL
Sbjct: 1241 SNDRSTVALCDLTSTNTNAVLSDARAKAQ----FVALNVGKAVSQHISQL 1286



Score = 33.9 bits (77), Expect = 0.001
Identities = 26/161 (16%), Positives = 41/161 (25%), Gaps = 36/161 (22%)

Query: 86 RQAPQHQYQPPYERQMQQP---------ARPEEPVRQPPQPP---------------RQA 121
+ P Q P AR +E PP P
Sbjct: 990 QTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESK 1049

Query: 122 PVPPQQQPAPHAAPQTGWQQPQPAQPPVQPQHQPQPVVQQPVAPQPVTPTVAQPQP---- 177
V +Q A Q + + A+ V+ Q V Q + T +
Sbjct: 1050 TVEKNEQDATETTAQNR-EVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEK 1108

Query: 178 -------AAPQQPAPQPVAASQPAVAEPQPVEPQQPAAPQP 211
Q P+ + P + + V+PQ A +
Sbjct: 1109 EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREN 1149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS14425PHPHTRNFRASE7460.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 746 bits (1927), Expect = 0.0
Identities = 275/571 (48%), Positives = 392/571 (68%), Gaps = 2/571 (0%)

Query: 1 MISGILASPGIAFGKALLLKEDEIVIDRKKISADKVDQEVERFLSGRAKASAQLEVIKTK 60
I+GI AS G+A KA + E + I++ I V E+E+ + K+ +L IK +
Sbjct: 4 KITGIAASSGVAIAKAFIHLEPNVDIEKTSI--TDVSTEIEKLTAALEKSKEELRAIKDQ 61

Query: 61 AGETFGEEKEAIFEGHIMLLEDEELEQEIIALIKDKHMTADAAANEVIDGQATALEELDD 120
+ G +K IF H+++L+D EL I I+++ M A+ A EV D + E +D+
Sbjct: 62 TEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDN 121

Query: 121 EYLKERAADVRDIGKRLLRNILGLAIIDLSAIQDEVILVAADLTPSETAQLNLKKVLGFI 180
EY+KERAAD+RD+ KR+L +++G+ L+ I +E +++A DLTPS+TAQLN + V GF
Sbjct: 122 EYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFA 181

Query: 181 TDAGGRTSHTSIMARSLELPAIVGTGSITAQVKNGDYLILDAVNNQVLINPSNEQIEALR 240
TD GGRTSH++IM+RSLE+PA+VGT +T ++++GD +I+D + V++NP+ E+++A
Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241

Query: 241 SLQAQVAEEKAELAKLKDLPAITLDGHQVEVCANIGTVRDVEGAERNGAEGVGLYRTEFL 300
+A ++K E AKL P+ T DG VE+ ANIGT +DV+G NG EG+GLYRTEFL
Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301

Query: 301 FMDRDALPTEEEQFAAYKAVAEACGSQAVIVRTMDIGGDKELPYMNFPKEENPFLGWRAV 360
+MDRD LPTEEEQF AYK V + + V++RT+DIGGDKEL Y+ PKE NPFLG+RA+
Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361

Query: 361 RIAMDRKEILRDQVRAILRASAFGKLRIMFPMIISVEEVRALKKEIEIYKQELRDEGKAF 420
R+ +++++I R Q+RA+LRAS +G L++MFPMI ++EE+R K ++ K +L EG
Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDV 421

Query: 421 DESIEIGVMVETPAAATIARHLAKEVDFFSIGTNDLTQYTLAVDRGNDMISHLYQPMSPS 480
+SIE+G+MVE P+ A A AKEVDFFSIGTNDL QYT+A DR N+ +S+LYQP P+
Sbjct: 422 SDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPA 481

Query: 481 VLNLIKQVIDASHAEGKWTGMCGELAGDERATLLLLGMGLDEFSMSAISIPRIKKIIRNT 540
+L L+ VI A+H+EGKW GMCGE+AGDE A LLLG+GLDEFSMSA SI + +
Sbjct: 482 ILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKL 541

Query: 541 NFEDAKVLAEQALAQPTTDELMTLVNKFIEE 571
+ E+ K A++AL T +E+ LV K +
Sbjct: 542 SKEELKPFAQKALMLDTAEEVEQLVKKTYLK 572


53D364_RS14520D364_RS14590Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS14520-2123.161940ethanolamine utilization microcompartment
D364_RS14525-1132.987248ethanolamine utilization microcompartment
D364_RS145300143.419172ethanolamine ammonia-lyase subunit EutC
D364_RS145350153.707564ethanolamine ammonia-lyase subunit alpha
D364_RS145400153.719430ethanolamine ammonia-lyase reactivating factor
D364_RS145452134.390020ethanolamine utilization protein EutH
D364_RS145503133.903403ethanolamine utilization ethanol dehydrogenase
D364_RS145553155.742578ethanolamine utilization protein EutJ
D364_RS145602155.203903aldehyde dehydrogenase EutE
D364_RS145651174.959956ethanolamine utilization microcompartment
D364_RS145701154.044446ethanolamine utilization microcompartment
D364_RS145751163.487336phosphate acetyltransferase
D364_RS145801152.699257ethanolamine utilization cob(I)yrinic acid
D364_RS145851141.346267ethanolamine utilization acetate kinase EutQ
D364_RS145902131.498738ethanolamine utilization acetate kinase EutP
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS14560SHAPEPROTEIN467e-08 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 45.9 bits (109), Expect = 7e-08
Identities = 47/179 (26%), Positives = 70/179 (39%), Gaps = 41/179 (22%)

Query: 33 LGIDLGTCD----------------VVSMVVDRDGQP---VAVCLDWADVV--------- 64
L IDLGT + VV++ DR G P AV D ++
Sbjct: 13 LSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAKQMLGRTPGNIAA 72

Query: 65 ----RDGIVWDFFGAVTLVRRHLATLEQQLGCRFT-HAATSFPPGTDP---RISINVLES 116
+DG++ DFF +++ + + R + P G R +
Sbjct: 73 IRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQG 132

Query: 117 AGLEISHVLDEPTAVA---DLLQLDNAG--VVDIGGGTTGIAIVKQGRVTYSADEATGG 170
AG +++EP A A L + G VVDIGGGTT +A++ V YS+ GG
Sbjct: 133 AGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRIGG 191


54D364_RS14745D364_RS15075Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS14745-28-3.038899phosphoribosylformylglycinamidine cyclo-ligase
D364_RS14750-211-2.637905phosphoribosylglycinamide formyltransferase
D364_RS14755-210-2.343284polyphosphate kinase 1
D364_RS14760017-2.300743exopolyphosphatase
D364_RS14765022-3.931745sensor domain-containing phosphodiesterase
D364_RS14770-214-2.430006DUF2633 family protein
D364_RS14775-214-1.823907LysR family transcriptional regulator
D364_RS14780-214-1.532114MFS transporter
D364_RS14785-114-2.079135anaerobic sulfatase maturase
D364_RS14790-213-0.354357sulfatase-like hydrolase/transferase
D364_RS147950181.501565glutamine-hydrolyzing GMP synthase
D364_RS14800-215-0.677688IMP dehydrogenase
D364_RS14805125-3.281707exodeoxyribonuclease VII large subunit
D364_RS14810336-6.138299zinc ribbon domain-containing protein
D364_RS14815440-6.149536Gfo/Idh/MocA family oxidoreductase
D364_RS14825446-8.555207hypothetical protein
D364_RS26770446-8.879682hypothetical protein
D364_RS14835444-8.457675DNA-protecting protein DprA
D364_RS14850342-7.251439hypothetical protein
D364_RS14860441-6.238364phage tail length tape measure family protein
D364_RS14865226-0.734423hypothetical protein
D364_RS27560225-0.014152hypothetical protein
D364_RS14875225-0.314111hypothetical protein
D364_RS148802251.159011PerC family transcriptional regulator
D364_RS148853240.527230single-stranded DNA-binding protein
D364_RS14890225-0.468727TOPRIM and DUF927 domain-containing protein
D364_RS14895135-4.305677hypothetical protein
D364_RS14900036-5.320749hypothetical protein
D364_RS14905034-5.365239hypothetical protein
D364_RS14910035-5.289923hypothetical protein
D364_RS26235243-9.651427ash family protein
D364_RS14925140-9.721545DUF4222 domain-containing protein
D364_RS14930028-7.275279hypothetical protein
D364_RS14935023-5.837782hypothetical protein
D364_RS14940-122-5.725617AlpA family phage regulatory protein
D364_RS14945018-4.429158hypothetical protein
D364_RS14950013-1.687468site-specific integrase
D364_RS14955115-0.452586ribosome biogenesis GTPase Der
D364_RS14960116-0.545455outer membrane protein assembly factor BamB
D364_RS14965218-0.736437YfgM family protein
D364_RS149700112.056047histidine--tRNA ligase
D364_RS14975-1123.162987flavodoxin-dependent
D364_RS149800113.877632cytoskeleton protein RodZ
D364_RS14985-1103.744423bifunctional tRNA
D364_RS14990-1113.911288nucleoside-diphosphate kinase
D364_RS149950113.944663peptidoglycan glycosyltransferase PbpC
D364_RS150000132.8002673-mercaptopyruvate sulfurtransferase
D364_RS15005-2130.737801hypothetical protein
D364_RS15010-2131.316305enhanced serine sensitivity protein SseB
D364_RS150150160.946585aminopeptidase PepB
D364_RS150200160.962621Fe-S cluster assembly protein IscX
D364_RS15025218-0.005383ISC system 2Fe-2S type ferredoxin
D364_RS15030119-0.509565Fe-S protein assembly chaperone HscA
D364_RS15035119-0.425090co-chaperone HscB
D364_RS15040121-2.124698iron-sulfur cluster assembly protein IscA
D364_RS15045223-1.878821Fe-S cluster assembly scaffold IscU
D364_RS15050-119-1.030660cysteine desulfurase
D364_RS15055-1150.564042Fe-S cluster assembly transcriptional regulator
D364_RS150600141.099435tRNA
D364_RS150650110.892107inositol-1-monophosphatase
D364_RS150701101.261624hypothetical protein
D364_RS150752130.805556nickel/cobalt transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS14780TCRTETA310.006 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 31.3 bits (71), Expect = 0.006
Identities = 21/60 (35%), Positives = 27/60 (45%)

Query: 51 LTLAMLMMAAVSPFVARLLARFGGRLVVTSGTLLIAASCAMMAWRPSLAGWYGAWLLTGI 110
L L LM A +P + L RFG R V+ A A+MA P L Y ++ GI
Sbjct: 49 LALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI 108


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS14825FLGPRINGFLGI325e-04 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 32.2 bits (73), Expect = 5e-04
Identities = 13/54 (24%), Positives = 20/54 (37%)

Query: 17 LQDAFNQAVAAAGGDKTAWLLDALRSKLNQPESNPQLRLLELVERMEVAAAALA 70
+ D N A GD A D+ + +P RL+ +E + V A
Sbjct: 208 VADVVNAFARARYGDPIAEPRDSQEIAVQKPRVADLTRLMAEIENLTVETDTPA 261


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS14860CHANLCOLICIN382e-04 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 37.7 bits (87), Expect = 2e-04
Identities = 40/233 (17%), Positives = 91/233 (39%), Gaps = 22/233 (9%)

Query: 457 ISLSDSQAKALQQARQELFITKQTGEAKQQAQAW----RDAERQGLKAGTQAFREYYQVK 512
L+ + A+Q + L + K +A+++A+A ++AE++ + + Q+K
Sbjct: 113 TELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLK 172

Query: 513 L--QTYKQQEANAEAAKNERNAQSEANSAAKQAGTQAERNARILEDYQQKAALSADSTSD 570
L K+ A +E AK AQ + ++A Q+E E L++ +S
Sbjct: 173 LAEAEEKRLAALSEEAKAVEIAQKKLSAA------QSEVVKMDGE----IKTLNSRLSSS 222

Query: 571 LSREQAILAAKQKLINPTPQQVAQVERDAAAAWDKAAALKAQNAVPERKENADYAAQRKA 630
+ A + N Q A+ + K + +A + + R + A
Sbjct: 223 IHARDAEMKTLAGKRNELAQASAKYKELDELV--KKLSPRANDPLQNRPFFEATRRRVGA 280

Query: 631 LDSLKDQKNANGELIISQEQYNRASEQL-EEQHQVNLAKIRAQQVVSPTQEAQ 682
++++ ++ S+ + NR + + + Q ++ ++ EA+
Sbjct: 281 GKIREEKQK---QVTASETRINRINADITQIQKAISQVSNNRNAGIARVHEAE 330


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS15035SHAPEPROTEIN1093e-28 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 109 bits (274), Expect = 3e-28
Identities = 81/368 (22%), Positives = 141/368 (38%), Gaps = 68/368 (18%)

Query: 23 GIDLGTTNSLVATVRSGQAETLPD----HQGRYLLPSVVNYHASGLTVGYDARLNAAQDP 78
IDLGT N+L+ G P Q R P V VG+DA+ + P
Sbjct: 14 SIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSV------AAVGHDAKQMLGRTP 67

Query: 79 ANTISSVKRMMGRSLADIQNRYSHLPYQLQASENGLPMIQTAGGLLNPIRVSADILKALA 138
N I++++ M +AD V+ +L+
Sbjct: 68 GN-IAAIRPMKDGVIADF-------------------------------FVTEKMLQHFI 95

Query: 139 ARATEALAGE-LDGVVITVPAYFDDAQRQGTKDAARLAGLHVLRLLNEPTAAAIAYGLDS 197
+ V++ VP +R+ +++A+ AG + L+ EP AAAI GL
Sbjct: 96 KQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPV 155

Query: 198 GQEGVIAVYDLGGGTFDISILRLSRGVFEVLATGGDSALGGDDFDHLLADYLREQAGF-- 255
+ V D+GGGT +++++ L+ V +GGD FD + +Y+R G
Sbjct: 156 SEATGSMVVDIGGGTTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYGSLI 210

Query: 256 SDRSDNRLQRELLDAAIAAKIALSDAEAAHVEVGG---WQG-----DITRSQFNDLIAPL 307
+ + R++ E+ A E +EV G +G + ++ + +
Sbjct: 211 GEATAERIKHEI-------GSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEP 263

Query: 308 VKRTLMACRRALKDAGVE-AQEVLE--VVMVGGSTRVPLVRERVGEFFGRTPLTSIDPDK 364
+ + A AL+ E A ++ E +V+ GG + + + E G + + DP
Sbjct: 264 LTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAEDPLT 323

Query: 365 VVAIGAAI 372
VA G
Sbjct: 324 CVARGGGK 331


55D364_RS15400D364_RS15450Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS15400121-3.01065250S ribosomal protein L19
D364_RS15410121-3.448663tRNA (guanosine(37)-N1)-methyltransferase TrmD
D364_RS15415015-2.608310ribosome maturation factor RimM
D364_RS15420116-2.34993230S ribosomal protein S16
D364_RS15425116-1.484886signal recognition particle protein
D364_RS15430216-0.102391inner membrane protein YpjD
D364_RS15435314-0.577105HlyC/CorC family transporter
D364_RS15440314-0.344003nucleotide exchange factor GrpE
D364_RS15445315-0.612217NAD(+) kinase
D364_RS15450215-0.567225DNA repair protein RecN
56D364_RS15525D364_RS15680Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS155250203.370394sugar phosphate isomerase/epimerase
D364_RS155300214.049454isopenicillin N synthase family oxygenase
D364_RS155350203.799490ABC transporter substrate-binding protein
D364_RS155400213.871501amino acid ABC transporter permease/ATP-binding
D364_RS15550-1130.590046class II aldolase/adducin family protein
D364_RS15555-113-1.152665glyoxylate/hydroxypyruvate reductase A
D364_RS15560118-4.731525SDR family NAD(P)-dependent oxidoreductase
D364_RS15565122-5.496251hypothetical protein
D364_RS15570125-6.368678transporter substrate-binding domain-containing
D364_RS15575124-5.753414helix-turn-helix transcriptional regulator
D364_RS15580017-1.905037type 1 fimbrial protein
D364_RS15585017-1.505953molecular chaperone
D364_RS15590016-0.845908fimbrial biogenesis outer membrane usher
D364_RS155950140.504546fimbrial protein
D364_RS15600-1142.780692fimbrial protein
D364_RS15605-1133.324023fimbrial biogenesis outer membrane usher
D364_RS15610-1123.352502molecular chaperone
D364_RS15615-2144.117859fimbrial protein
D364_RS156200145.099449aldehyde dehydrogenase
D364_RS156250164.208650sigma-54-dependent Fis family transcriptional
D364_RS15630-1192.595255LysR family transcriptional regulator
D364_RS15635118-0.338738NAD(P)H-dependent oxidoreductase
D364_RS15640-118-1.652622metalloregulator ArsR/SmtB family transcription
D364_RS15645220-3.295280rhodanese family protein
D364_RS15650121-5.099679lipid A hydroxylase LpxO
D364_RS15655-1220.015190hypothetical protein
D364_RS156600200.096761DNA-binding protein StpA
D364_RS156700191.647764L-alanine exporter AlaE
D364_RS156751212.015060DUF2002 family protein
D364_RS15680-2143.065695DUF883 domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS15560DHBDHDRGNASE945e-25 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 94.0 bits (233), Expect = 5e-25
Identities = 67/258 (25%), Positives = 120/258 (46%), Gaps = 11/258 (4%)

Query: 5 LAGKVALVTASTAGIGFAIAKGLAESGAEVILNGRSEQSVNAAIARLQNEVPGAKARPAI 64
+ GK+A +T + GIG A+A+ LA GA + + + + ++ L+ E A+A PA
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA- 64

Query: 65 ADLSDADG----AAQLLRAVTGVDILVNNPGIYGPQDFYATDDATWDNYWQTNVMSGVRL 120
D+ D+ A++ R + +DILVN G+ P ++ D W+ + N
Sbjct: 65 -DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 121 SRGLLPAMVSKGWGRVVFISSESARNIPADMIHYGVTKTAQLSLARGLAKYVAGSGVTVN 180
SR + M+ + G +V + S A M Y +K A + + L +A + N
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 181 SVLPGPTISDGFAEMLKDDVAKTGKSLEELAKAFVMTHRPSSVIQRAASVAEVANMVVYV 240
V PG T +D + D+ E++ K + T + +++ A +++A+ V+++
Sbjct: 184 IVSPGSTETDMQWSLWADENGA-----EQVIKGSLETFKTGIPLKKLAKPSDIADAVLFL 238

Query: 241 CSPQASATSGAALRVDGG 258
S QA + L VDGG
Sbjct: 239 VSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS15590PF005777190.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 719 bits (1857), Expect = 0.0
Identities = 323/851 (37%), Positives = 459/851 (53%), Gaps = 46/851 (5%)

Query: 20 PADSAERYNAQFVNG-----IDPLAFNQFVASDGDVMPGTYDVNIYINDLLVDSRPVRFS 74
+ + +N +F+ D F ++ PGTY V+IY+N+ + +R V F+
Sbjct: 42 LSSAELYFNPRFLADDPQAVADLSRFEN----GQELPPGTYRVDIYLNNGYMATRDVTFN 97

Query: 75 EDSAHGGLAPCLSAAEYIRYGVKIDD-------DHQPCFALSQTIRQAEQQLDIANHRLN 127
+ G+ PCL+ A+ G+ C L+ I A QLD+ RLN
Sbjct: 98 TGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHDATAQLDVGQQRLN 157

Query: 128 IHIPQQYIEHYPRDYVSPMRFDEGINAAFVNYSYS-TDANNGDGGSHQYQYLSLNSGINI 186
+ IPQ ++ + R Y+ P +D GINA +NY++S N GG+ Y YL+L SG+NI
Sbjct: 158 LTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNI 217

Query: 187 ASWRLRNNAYWNKF-----SGQADKWQSIASWAETNIIPWRSRLVVGQTSTDNSVFDSVQ 241
+WRLR+N W+ SG +KWQ I +W E +IIP RSRL +G T +FD +
Sbjct: 218 GAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGIN 277

Query: 242 FRGVQLGTDAEMRPSSQTGFAPVIRGVANSNARVEVRQNNYLIYSENVPAGPFELNDINA 301
FRG QL +D M P SQ GFAPVI G+A A+V ++QN Y IY+ VP GPF +NDI A
Sbjct: 278 FRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYA 337

Query: 302 VNRSGDFYVTVIEADGSQTTFTVAYTTLPQLVRAGQWNYQLSAGKYH-DGADGYAPALMQ 360
SGD VT+ EADGS FTV Y+++P L R G Y ++AG+Y A P Q
Sbjct: 338 AGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQ 397

Query: 361 SSLSYGLNNTFTLYGGALAAENYRAGAFGVGSNLGEIGALSADYTLAGTTLANGQRKQGG 420
S+L +GL +T+YGG A+ YRA FG+G N+G +GALS D T A +TL + + G
Sbjct: 398 STLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQ 457

Query: 421 SVRFLYAKSFLSSKTDFQIAGYRYSTAGYYSLSDAVNERRRWHNGLYENDYWPSDEYESW 480
SVRFLY KS S T+ Q+ GYRYST+GY++ +D R +N ++ +
Sbjct: 458 SVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTD 517

Query: 481 QASAPQHYYTSWFYNKKHRFDISARQTLGKNSAFFLNFSQQNYWNSSGSDISLQAGFNST 540
Y + YNK+ + ++ Q LG+ S +L+ S Q YW +S D QAG N+
Sbjct: 518 --------YYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTA 569

Query: 541 IHNVNYGLYYQNTRSHFTHD-DNSITLRVSIPF-------TLQENRRINTAFTLAHSKSS 592
++N+ L Y T++ + D + L V+IPF + + R + +++++H +
Sbjct: 570 FEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNG 629

Query: 593 GTSGQAGVNGTLLDDDRLSWAVTSAYDD----TSHSTNSASLGYLGQYGNLYTGYAYSKN 648
+ AGV GTLL+D+ LS++V + Y S ST A+L Y G YGN GY++S +
Sbjct: 630 RMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDD 689

Query: 649 HRQASLNLSGGVVAHRGGVTLSQPLGSTFALVEAKDAQGVGIENQTGVRIDPFGYAVVPQ 708
+Q +SGGV+AH GVTL QPL T LV+A A+ +ENQTGVR D GYAV+P
Sbjct: 690 IKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPY 749

Query: 709 SVPYRVNSVALNPQDFDAFLDVPNAVADTVPTRGAITRVRFDTFRGYSVLIHTTLADGSY 768
+ YR N VAL+ +D+ NAVA+ VPTRGAI R F G +L+ T +
Sbjct: 750 ATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLM-TLTHNNKP 808

Query: 769 PPLGAELYRASGISNGLVGPGGDVYVSGVDSGEKLQMKWGETHQQSCEITLPELRQEPQQ 828
P GA + S S+G+V G VY+SG+ K+Q+KWGE C Q
Sbjct: 809 LPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANYQ--LPPESQ 866

Query: 829 ATAWRELSLIC 839
+LS C
Sbjct: 867 QQLLTQLSAEC 877


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS15595PF05616290.037 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 28.6 bits (63), Expect = 0.037
Identities = 27/74 (36%), Positives = 34/74 (45%), Gaps = 6/74 (8%)

Query: 228 PGYYEKTR--PFT-VTYGLVKQGNGSDCGTEPMLATFSTTNTIQESAIILPQPDSGFGIA 284
PGY EK P T V G V NG+ S NT + +I P+PD G A
Sbjct: 262 PGYSEKVEVAPGTKVNMGPVTDRNGNPVQVVATFGRDSQGNTTVDVQVI-PRPDLTPGSA 320

Query: 285 ISPNASMHPLIEMN 298
+PNA PL E++
Sbjct: 321 EAPNA--QPLPEVS 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS15605PF005777390.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 739 bits (1909), Expect = 0.0
Identities = 333/853 (39%), Positives = 480/853 (56%), Gaps = 49/853 (5%)

Query: 25 LATVPTMMFCLSPLSRALADDYFDPAALEFADPQQQTSDLHYFAKPGGQQPGTYPVTVVV 84
+ + + A+ YF+P L D Q +DL F PGTY V + +
Sbjct: 27 FVRLFVACAFAAQAPLSSAELYFNPRFLA--DDPQAVADLSRFENGQELPPGTYRVDIYL 84

Query: 85 NDQELGQADITFV--DDGGQLRPVLTPGQLAEYGVNVSAFPAFQALHEGETFTRIEKFIP 142
N+ + D+TF D + P LT QLA G+N ++ L + + I
Sbjct: 85 NNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVP-LTSMIH 143

Query: 143 DASSRFSFANQRLTLSIPQAAMNVQSRGYVDPSRWDDGVPAAFVDYYFSGAQIKNADEGE 202
DA+++ QRL L+IPQA M+ ++RGY+ P WD G+ A ++Y FSG ++N G
Sbjct: 144 DATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQN-RIGG 202

Query: 203 SSRSNYLNLRSGLNLGAWRLRNISSMQYDQ------QRRHWDTQSTWLQRDVRSLKSLLR 256
+S YLNL+SGLN+GAWRLR+ ++ Y+ + W +TWL+RD+ L+S L
Sbjct: 203 NSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLT 262

Query: 257 IGDTYTTGDVFDSIQFRGVQLMSDDEMLPDSQRGFAPTIRGVAHSNAKVTVSQHGYVIYE 316
+GD YT GD+FD I FRG QL SDD MLPDSQRGFAP I G+A A+VT+ Q+GY IY
Sbjct: 263 LGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYN 322

Query: 317 TFVSPGAFAISDLYPTSQSGDLEVKVTESNGAVRTFTQPYSAVPYMLREGRGKFSLSAGR 376
+ V PG F I+D+Y SGDL+V + E++G+ + FT PYS+VP + REG ++S++AG
Sbjct: 323 STVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGE 382

Query: 377 YHSGGESVRSPEFLQGTLFYGLTAGFTLYGGTQLARDYQAWALGLGRGFGEFGSLGGDVT 436
Y SG P F Q TL +GL AG+T+YGGTQLA Y+A+ G+G+ G G+L D+T
Sbjct: 383 YRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMT 442

Query: 437 QAVTRTPSGKRYTGHSLRAQYQKNFVSSGTAFSLASYRYSSSGYYDFAEASALESAQGQV 496
QA + P ++ G S+R Y K+ SGT L YRYS+SGY++FA+ + +
Sbjct: 443 QANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNI 502

Query: 497 D--------------------NRRRREELSVSQSLGGLGSLAVSAWSQEYWHRQSRDETV 536
+ N+R + +L+V+Q LG +L +S Q YW + DE
Sbjct: 503 ETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQF 562

Query: 537 HLGFYSAWKGISWGVGYYYTRTSGQQKNDRSWSFNINIPLGGPLSDSA--------VSYN 588
G +A++ I+W + Y T+ + Q+ D+ + N+NIP L + SY+
Sbjct: 563 QAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYS 622

Query: 589 TTSDSNGYTSQQMSLYGAVPTRPNLFYSVQQGYGNQGRGSNSS---ASLDYHGGFGNAQI 645
+ D NG + +YG + NL YSVQ GY G G++ S A+L+Y GG+GNA I
Sbjct: 623 MSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANI 682

Query: 646 GYRHDAASNQLTWGGAGSVVAHPHGVTFGQTVGESFAIVRAPGAAGVAVQNGNNVHTDWR 705
GY H QL +G +G V+AH +GVT GQ + ++ +V+APGA V+N V TDWR
Sbjct: 683 GYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWR 742

Query: 706 GYAVVPSLTAYRKNVITLDTESMADDTDVDQQGQTVIPGGGAVVMANYQTHIGNRVLFTL 765
GYAV+P T YR+N + LDT ++AD+ D+D V+P GA+V A ++ +G ++L TL
Sbjct: 743 GYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTL 802

Query: 766 RNAQGPLPFGASARLVKEEESGNPPGGMVADGGQVYLSGVPQEGTLAVSWIVNNQSQSCT 825
+ PLPFGA V E S + G+VAD GQVYLSG+P G + V W ++ C
Sbjct: 803 THNNKPLPFGAM---VTSESSQSS--GIVADNGQVYLSGMPLAGKVQVKWG-EEENAHCV 856

Query: 826 LHFHLPDNPQQSL 838
++ LP QQ L
Sbjct: 857 ANYQLPPESQQQL 869


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS15625HTHFIS2854e-92 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 285 bits (732), Expect = 4e-92
Identities = 121/366 (33%), Positives = 173/366 (47%), Gaps = 52/366 (14%)

Query: 268 LTTPQGRYHYRLREPTRRRVAVSAPPAMHLPFTSPREGEKLLRLLNAGIALCIEGETGSG 327
L P+ R + V AM + L RL+ + L I GE+G+G
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIY------RVLARLMQTDLTLMITGESGTG 172

Query: 328 KEYVSRTLHQHSRWRSGKFVAINCAAIPESLIESELFGYQPGAFTGASKNGYIGKIREAD 387
KE V+R LH + + R+G FVAIN AAIP LIESELFG++ GAFTGA G+ +A+
Sbjct: 173 KELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRS-TGRFEQAE 231

Query: 388 GGVLFLDEIGDMPLALQTRLLRVLQEKEVAPLGASRSVPVNFALICATHRNLTQRVSAGE 447
GG LFLDEIGDMP+ QTRLLRVLQ+ E +G + + ++ AT+++L Q ++ G
Sbjct: 232 GGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGL 291

Query: 448 FREDLLWRLREYALALPPLREWS----ALETFIATLWHDLGGASRRVTLSNALLVHLSQL 503
FREDL +RL L LPPLR+ + L G +R L +
Sbjct: 292 FREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKR--FDQEALELMKAH 349

Query: 504 PWPGNVRQLQSVLKVMLALADEGDTLTPDALPEAYRAAPAPLPRGG-------------- 549
PWPGNVR+L+++++ + AL D +T + + R+ P
Sbjct: 350 PWPGNVRELENLVRRLTALY-PQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAV 408

Query: 550 ------------------------LQAHDEQLIVDTLARVNGNVSRAAQILGIARSTLYR 585
L + LI+ L GN +AA +LG+ R+TL +
Sbjct: 409 EENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRK 468

Query: 586 RAARAG 591
+ G
Sbjct: 469 KIRELG 474


57D364_RS15785D364_RS15845Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS15785218-0.428519DedA family protein
D364_RS15790323-0.710872fructose-1-phosphate/6-phosphogluconate
D364_RS15795422-0.631974BON domain-containing protein
D364_RS15815321-0.008731***carbon storage regulator CsrA
D364_RS158201171.084556alanine--tRNA ligase
D364_RS158250142.398644recombination regulator RecX
D364_RS15830-1152.625865recombinase RecA
D364_RS158350163.238065nicotinamide-nucleotide amidase
D364_RS15840-1153.282943metal ABC transporter substrate-binding protein
D364_RS15845-1163.708760metal ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS15820LUXSPROTEIN290.035 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 29.5 bits (66), Expect = 0.035
Identities = 22/89 (24%), Positives = 39/89 (43%), Gaps = 6/89 (6%)

Query: 562 LNHSATHLMHAALRQVLGTHVAQKGSLVNDKALRFDFSHFEAMKPEEIRAVEDLVNAQIR 621
++H+ M+A +V T KG + LRF + + + + I +E L +R
Sbjct: 8 VDHTR---MNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYAGFMR 64

Query: 622 RNLAIET-NIMDID--AARASGAMALFGE 647
+L ++ I+DI R M+L G
Sbjct: 65 NHLNGDSVEIIDISPMGCRTGFYMSLIGT 93


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS15840adhesinb2372e-79 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 237 bits (606), Expect = 2e-79
Identities = 92/308 (29%), Positives = 170/308 (55%), Gaps = 17/308 (5%)

Query: 1 MKRSAIVVALALGLMAQGAMAKT----------LNVVSSFSVLGDIAQQVGGEHVHVDTL 50
MK+ +V L L + A + LNVV++ S++ DI + + G+ +++ ++
Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSI 60

Query: 51 VGPDGDPHTFEPSPKDSALLSKADVVVVNGLGLE----GWLDRLIKASGFKGE--LVVAS 104
V DPH +EP P+D S+AD++ NG+ LE W +L++ + K S
Sbjct: 61 VPVGQDPHEYEPLPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVS 120

Query: 105 KGVKTHTLDEEGKTVT-DPHAWNSAANGALYAQNILEGLVKADPEDKAALTSSGKRYIDQ 163
+GV L+ + + DPHAW + NG +YAQNI + L + DP +K + K Y+++
Sbjct: 121 EGVDVIYLEGQSEKGKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEK 180

Query: 164 LTSLDGWAKAQFSAIPLAKRKVLTSHDAFGYFGRAYHVTFLAPQGLSSESEASAAQVAAL 223
L++LD AK +F+ IP K+ ++TS F YF +AY+V +++E E + Q+ L
Sbjct: 181 LSALDKEAKEKFNNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTL 240

Query: 224 IKQIKADGVHTWFMENQLDPRLVKQIASATGAQPGGELYPEALSKPGGVADSYVKMMRHN 283
+++++ V + F+E+ +D R +K ++ T +++ +++++ G DSY MM++N
Sbjct: 241 VEKLRKTKVPSLFVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSYYSMMKYN 300

Query: 284 VELIAKSM 291
+E IA+ +
Sbjct: 301 LEKIAEGL 308


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS15845TYPE3IMSPROT280.040 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 28.2 bits (63), Expect = 0.040
Identities = 14/74 (18%), Positives = 27/74 (36%), Gaps = 3/74 (4%)

Query: 19 ALVVCLALSLSTTMLGVFLLLRRMSLMGDALSHAILP-GVAVGYLLSGMSLLAMTLGG-- 75
+ + +ALS L + LM + LP A+ Y++ + L L
Sbjct: 31 STALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFPL 90

Query: 76 FIAGIVVALVAGWV 89
++A+ + V
Sbjct: 91 LTVAALMAIASHVV 104


58D364_RS15890D364_RS16000Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS158900134.185662arabinose-5-phosphate isomerase GutQ
D364_RS158950144.096505nitric oxide reductase transcriptional regulator
D364_RS15905-1143.795985anaerobic nitric oxide reductase
D364_RS159100154.715593NADH:flavorubredoxin reductase NorW
D364_RS15915-1153.999094HoxN/HupN/NixA family nickel/cobalt transporter
D364_RS15920-1154.155057carbamoyltransferase HypF
D364_RS15925-1172.186032electron transport protein HydN
D364_RS15930-1182.114521LacI family DNA-binding transcriptional
D364_RS159400182.050067PTS cellobiose/arbutin/salicin transporter
D364_RS15945-1181.3063366-phospho-beta-glucosidase
D364_RS159501212.413925type I toxin-antitoxin system SymE family toxin
D364_RS159550222.547271hydrogenase maturation peptidase HycI
D364_RS159600233.758907formate hydrogenlyase maturation HycH family
D364_RS159650224.058263NADH-quinone oxidoreductase subunit B family
D364_RS159701233.694413formate hydrogenlyase complex iron-sulfur
D364_RS159752213.690499formate hydrogenlyase subunit HycE
D364_RS159802154.131599respiratory chain complex I subunit 1 family
D364_RS159852174.126760formate hydrogenlyase subunit 3
D364_RS159901193.0577434Fe-4S dicluster domain-containing protein
D364_RS159950183.923428formate hydrogenlyase regulator HycA
D364_RS16000-1183.831112hydrogenase maturation nickel metallochaperone
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS15895HTHFIS372e-126 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 372 bits (956), Expect = e-126
Identities = 135/390 (34%), Positives = 197/390 (50%), Gaps = 38/390 (9%)

Query: 149 IAALVAGALN----------NALLIARLEAQNVLPAQAVNYPLPERQEIIGLSGPMLQLK 198
I A GA + +I R A+ + + ++G S M ++
Sbjct: 91 IKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIY 150

Query: 199 KEIDIVAASDLNVLISGETGTGKELVAKAVHQGSPRAANPLVYLNCAALPESVAESELFG 258
+ + + +DL ++I+GE+GTGKELVA+A+H R P V +N AA+P + ESELFG
Sbjct: 151 RVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFG 210

Query: 259 HVKGAFTGAISNRSGKFEMADNGTLFLDEIGELSLALQAKLLRVLQYGDIQRVGDDRSLR 318
H KGAFTGA + +G+FE A+ GTLFLDEIG++ + Q +LLRVLQ G+ VG +R
Sbjct: 211 HEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIR 270

Query: 319 VDVRVLAATNRDLRQEVVEGRFRADLYHRLSVFPLSVPPLRERESDVVLLAGYFCEQCRL 378
DVR++AATN+DL+Q + +G FR DLY+RL+V PL +PPLR+R D+ L +F +Q
Sbjct: 271 SDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE- 329

Query: 379 RMGLARVILAEAARNRLQKWSWPGNVRELEHAIHRAVVLARATQAGDEVVLEPQHFQFAV 438
+ GL + A ++ WPGNVRELE+ + R L D + E +
Sbjct: 330 KEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYP----QDVITREIIENELRS 385

Query: 439 AAPMLPTETAAAAPATGNIN-----------------------LREATDSFQREAISRAL 475
P P E AAA + +I+ + I AL
Sbjct: 386 EIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAAL 445

Query: 476 EANQGNWAATARALELDVANLHRLAKRLGL 505
A +GN A L L+ L + + LG+
Sbjct: 446 TATRGNQIKAADLLGLNRNTLRKKIRELGV 475


59D364_RS16060D364_RS16145Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS160601143.354745PTS lactose/cellobiose transporter subunit IIA
D364_RS160651143.284819ROK family protein
D364_RS160701142.691791LacI family transcriptional regulator
D364_RS160752151.924370metal ABC transporter substrate-binding protein
D364_RS16080214-0.046206manganese/iron ABC transporter ATP-binding
D364_RS160851130.815326iron/manganese ABC transporter permease subunit
D364_RS160901120.555999metal ABC transporter permease
D364_RS160950150.710099nitrous oxide-stimulated promoter family
D364_RS161000151.308586LysR family transcriptional regulator
D364_RS16105-1162.223066AEC family transporter
D364_RS161100184.330328thiamine pyrophosphate-requiring protein
D364_RS161151194.636770SelT/SelW/SelH family protein
D364_RS161201163.170512TonB-dependent receptor
D364_RS161252132.236737hemin-degrading factor
D364_RS16130219-0.869559hemin ABC transporter substrate-binding protein
D364_RS16135-1120.518506iron ABC transporter permease
D364_RS16140-214-1.864471heme ABC transporter ATP-binding protein
D364_RS16145-216-3.795815Hcp family type VI secretion system effector
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS16075adhesinb330e-116 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 330 bits (848), Expect = e-116
Identities = 86/296 (29%), Positives = 165/296 (55%), Gaps = 7/296 (2%)

Query: 9 SLLLASALALLAATPASAQEKFRVITTFTVIADMAQNVAGDAAVVSSITKPGAEIHDYQP 68
+ + +A + ++ + K V+ T ++IAD+ +N+AGD + SI G + H+Y+P
Sbjct: 13 AFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSIVPVGQDPHEYEP 72

Query: 69 TPGDIKRAQGAQLILSNGLNLER----WFARFYQHLQGVPE---VVVSEGIQPMGISAGP 121
P D+K+ A LI NG+NLE WF + ++ + VSEG+ + +
Sbjct: 73 LPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVSEGVDVIYLEGQS 132

Query: 122 YSGKPNPHAWMSADNALIYVDNIRDALVKYDPPHADTYRRNAEAYKEKIRQTMAPLQARL 181
GK +PHAW++ +N +IY NI L + DP + +TY +N +AY EK+ + +
Sbjct: 133 EKGKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEKLSALDKEAKEKF 192

Query: 182 AQLPADKRWLVTSEGAFSYLARDYGLRELYLWPINADQQGTPQQVRKVIDTMKKERIPTI 241
+P +K+ +VTSEG F Y ++ Y + Y+W IN +++GTP Q++ +++ ++K ++P++
Sbjct: 193 NNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEKLRKTKVPSL 252

Query: 242 FSESTISDKPARQVAREAGAHYGGVLYVDSLSAADGPVPTWLDLLRVTTETIVNGI 297
F ES++ D+P + V+++ ++ DS++ ++ +++ E I G+
Sbjct: 253 FVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSYYSMMKYNLEKIAEGL 308


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS16130FERRIBNDNGPP367e-05 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 36.5 bits (84), Expect = 7e-05
Identities = 44/191 (23%), Positives = 75/191 (39%), Gaps = 16/191 (8%)

Query: 52 PPAAQKLPDVGYLRQLNAEGILALRPQLVLASAQAQPSLVLHKVQASGVKVVNVPGGESL 111
PP + DVG + N E + ++P ++ SA PS + A G G + L
Sbjct: 72 PPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPL 131

Query: 112 SAIDNKVAVIAEALGKTAAGDALRQQLQQQIAAIPTQPV---AKRVLFILSHGGMNTLVA 168
+ + +A+ L +A + Q + I ++ + V A+ +L + LV
Sbjct: 132 AMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVF 191

Query: 169 GQHTAADGAIRAAGLQNAMQG---FDHYRAMSQEGVAA-SQADLVVISADGLKGMGGEAG 224
G ++ + G+ NA QG F A+S + +AA D++ D K M
Sbjct: 192 GPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDM----- 246

Query: 225 LWKLPGLAQTP 235
L TP
Sbjct: 247 ----DALMATP 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS16140PF05272280.048 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.1 bits (62), Expect = 0.048
Identities = 14/42 (33%), Positives = 20/42 (47%), Gaps = 4/42 (9%)

Query: 15 GKRQIIDNVSVALRGG----EMTALIGPNGAGKSTLLRLLTG 52
GK ++ +V+ + G L G G GKSTL+ L G
Sbjct: 577 GKYILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVG 618


60D364_RS16400D364_RS16455Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS16400-2113.100408aliphatic sulfonate ABC transporter
D364_RS16405-1133.221977acyl-CoA dehydrogenase family protein
D364_RS16410-1134.218669MFS transporter
D364_RS164150145.281992methionine ABC transporter substrate-binding
D364_RS164202156.097290ABC transporter permease
D364_RS164252146.647147ATP-binding cassette domain-containing protein
D364_RS164301166.585135LLM class flavin-dependent oxidoreductase
D364_RS164350156.856432FAD-dependent oxidoreductase
D364_RS16440-1155.327876FMNH2-dependent alkanesulfonate monooxygenase
D364_RS27565-1153.479464hypothetical protein
D364_RS16450-2153.657734sigma-54-dependent Fis family transcriptional
D364_RS16455-1153.232554flap endonuclease Xni
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS16410TCRTETA401e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 40.2 bits (94), Expect = 1e-05
Identities = 27/119 (22%), Positives = 53/119 (44%), Gaps = 4/119 (3%)

Query: 40 VFLALGGVFLDAYDLTTLSYGIDDVVREFQLSPLL---TGLVTSSIMVGTIVGNIIGGWL 96
+ + L V LDA + + + ++R+ S + G++ + + + G L
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 97 TDKYGRYSVFMADMFFFVISAIAAGLAPNVWVLIGARFLMGIGVGIDLPVAMSYLAEFS 155
+D++GR V + + + AP +WVL R + GI G VA +Y+A+ +
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADIT 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS16450HTHFIS378e-127 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 378 bits (972), Expect = e-127
Identities = 141/356 (39%), Positives = 196/356 (55%), Gaps = 24/356 (6%)

Query: 4 PESPSTAPALI--DPASKAFQSLLDKLAPTEATVLIVGETGTGKEVVARYLHHHSARRQQ 61
+ L+ A + +L +L T+ T++I GE+GTGKE+VAR LH + RR
Sbjct: 130 EDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNG 189

Query: 62 PFLAVNCGALTESLAEAELFGHEKGAFTGAQQGQPGWFEAAEGGTLLLDEIGELSLPLQV 121
PF+A+N A+ L E+ELFGHEKGAFTGAQ G FE AEGGTL LDEIG++ + Q
Sbjct: 190 PFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQT 249

Query: 122 KLLRVLQEREITRVGSRKAIKVNVRVIAATHVDLAQAIRERRFREDLYYRLNIAVVPLPP 181
+LLRVLQ+ E T VG R I+ +VR++AAT+ DL Q+I + FREDLYYRLN+ + LPP
Sbjct: 250 RLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPP 309

Query: 182 LRQRRQDIPLLAHHFLSLYARRLGRPTLRLAPESLARLMDYSWPGNIRELENTLHNAVLL 241
LR R +DIP L HF+ + G R E+L + + WPGN+RELEN + L
Sbjct: 310 LRDRAEDIPDLVRHFVQQA-EKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTAL 368

Query: 242 SKEEEISPAQLRLATLNDAP-----------------GPASDHELDDFIRHQLALPGEPL 284
++ I+ + ++ P ++ F ALP L
Sbjct: 369 YPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGL 428

Query: 285 WQRVTSA----LIRHAMAHCDDNQSQAAALLGISRHTLRTQLANLGLIKSRRRPPA 336
+ RV + LI A+ NQ +AA LLG++R+TLR ++ LG+ R A
Sbjct: 429 YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVYRSSRSA 484


61D364_RS16515D364_RS16625Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS16515-1144.110329YgdI/YgdR family lipoprotein
D364_RS165200144.711237cysteine desulfurase CsdA
D364_RS165250164.327588cysteine desulfurase sulfur acceptor subunit
D364_RS165301184.154638tRNA cyclic N6-threonylcarbamoyladenosine(37)
D364_RS165352184.131547murein transglycosylase A
D364_RS165553214.572417***nicotinate-nucleotide--dimethylbenzimidazole
D364_RS165603203.909992bifunctional adenosylcobinamide
D364_RS165653214.361911cobyric acid synthase
D364_RS165702203.681002energy-coupling factor ABC transporter
D364_RS165753184.417188energy-coupling factor ABC transporter
D364_RS165802194.185591energy-coupling factor ABC transporter
D364_RS165851195.253099cobalt ECF transporter S component CbiM
D364_RS16590-1195.416778cobalt-factor II C(20)-methyltransferase
D364_RS16595-1195.131418sirohydrochlorin cobaltochelatase
D364_RS16600-1206.035809cobalt-precorrin-6A reductase
D364_RS16605-1205.449294precorrin-3B C(17)-methyltransferase
D364_RS16610-1216.031449cobalt-precorrin 5A hydrolase
D364_RS166151195.631395cobalt-precorrin-4 methyltransferase
D364_RS166202175.355026decarboxylating cobalt-precorrin-6B
D364_RS166250153.588651cobalt-precorrin-7 (C(5))-methyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS16605PF05272300.007 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.007
Identities = 9/58 (15%), Positives = 17/58 (29%), Gaps = 1/58 (1%)

Query: 152 ADFVICFYNPRSRGREGHLARAFTLLAASKSADTPVGVVKSAGRKKQEKWLTTLGEMD 209
D+ + G+ + L S + +G K + + L EM
Sbjct: 595 FDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFD-IGTGKDSYEQIAGIVAYELSEMT 651


62D364_RS16690D364_RS16810Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS166900193.648751glycerol dehydratase reactivase beta/small
D364_RS166952184.476050propanediol utilization microcompartment protein
D364_RS167000184.019084BMC domain-containing protein
D364_RS167052164.192235phosphate propanoyltransferase
D364_RS167101154.818857microcompartment protein PduM
D364_RS167150154.571796ethanolamine utilization protein EutN
D364_RS167200154.406673two-domain cob(I)yrinic acid a,c-diamide
D364_RS167250144.102213CoA-acylating propionaldehyde dehydrogenase
D364_RS167300134.3702631-propanol dehydrogenase PduQ
D364_RS167351164.525803SLBB domain-containing protein
D364_RS167402194.499684propanediol utilization microcompartment protein
D364_RS167450173.720128propanediol utilization microcompartment protein
D364_RS167500172.952615propanediol utilization protein PduV
D364_RS167551162.723835acetate/propionate family kinase
D364_RS167602164.371082L-threonine kinase
D364_RS167651164.476866threonine-phosphate decarboxylase
D364_RS167700143.376499N-acetylmuramoyl-L-alanine amidase AmiC
D364_RS167752153.579774amino-acid N-acetyltransferase
D364_RS167802154.043108hypothetical protein
D364_RS167852164.360709exodeoxyribonuclease V subunit alpha
D364_RS167902153.910640exodeoxyribonuclease V subunit beta
D364_RS167952143.323261pitrilysin
D364_RS168003174.026998exodeoxyribonuclease V subunit gamma
D364_RS168050183.580917prepilin-type N-terminal cleavage/methylation
D364_RS168101163.711699DUF2509 family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS16755ACETATEKNASE5580.0 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 558 bits (1439), Expect = 0.0
Identities = 198/395 (50%), Positives = 270/395 (68%), Gaps = 5/395 (1%)

Query: 4 KIMAINAGSSSLKFQLLNMPQGALLCQGLIERIGLPEARFTLKTSAQKWQETLPIADHHE 63
KI+ IN GSSSLK+QL+ G +L +GL ERIG+ ++ T + +K + + DH +
Sbjct: 2 KILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKD 61

Query: 64 AVTLLLEALTGR--GILSSLQEIDGVGHRVAHGGERFKDAALVCDDTLREIERLAELAPL 121
A+ L+L+AL G++ + EID VGHRV HGGE F + L+ DD L+ I ELAPL
Sbjct: 62 AIKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIELAPL 121

Query: 122 HNPVNALGIRLFRQLLPAVPAVAVFDTAFHQTLAPEAWLYPLPWRYYAELGIRRYGFHGT 181
HNP N GI+ Q++P VP VAVFDTAFHQT+ A+LYP+P+ YY + IR+YGFHGT
Sbjct: 122 HNPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFHGT 181

Query: 182 SHHYVSSALAEKLGVPLSALRVVSCHLGNGCSVCAIKGGQSVNTSMGFTPQSGVMMGTRS 241
SH YVS AE L P+ +L++++CHLGNG S+ A+K G+S++TSMGFTP G+ MGTRS
Sbjct: 182 SHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGTRS 241

Query: 242 GDIDPSILPWLVEKEGKSAQQLSQLLNNESGLLGVSGVSSDYRDVEQAADA-GNERAALA 300
G IDPSI+ +L+EKE SA+++ +LN +SG+ G+SG+SSD+RD+E AA G++RA LA
Sbjct: 242 GSIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQLA 301

Query: 301 LSLFAERIRATIGSYIMQMGGLDALIFTGGIGENSARARATICRNLHFLGLALDDEKNQR 360
L++FA R++ TIGSY MGG+D ++FT GIGEN R I L FLG LD EKN+
Sbjct: 302 LNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEKNKV 361

Query: 361 SA--TFIQADNALVKVAVINTNEELMIARDVMRLA 393
I ++ V V V+ TNEE MIA+D ++
Sbjct: 362 RGEEAIISTADSKVNVMVVPTNEEYMIAKDTEKIV 396


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS16805BCTERIALGSPH260.035 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 25.7 bits (56), Expect = 0.035
Identities = 10/23 (43%), Positives = 15/23 (65%)

Query: 7 RQRGFSLPETVLAMALMVLTVTA 29
RQRGF+L E +L + LM ++
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGM 24


63D364_RS17015D364_RS17045Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS17015-220-5.267932amino acid permease
D364_RS17020-121-5.461465aldo/keto reductase family oxidoreductase
D364_RS17025121-5.154111transcriptional activator MrkH
D364_RS17030115-4.416157transcriptional regulator MrkI
D364_RS17035215-3.243976phosphodiesterase MrkJ
D364_RS17040414-2.625884type 3 fimbria minor subunit MrkF
D364_RS17045413-1.617313type 3 fimbria adhesin subunit MrkD
64D364_RS17100D364_RS17170Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS171002140.928801type 1 fimbrial major subunit FimA
D364_RS171103141.930566type 1 fimbrial protein
D364_RS171153152.226203fimbria/pilus periplasmic chaperone
D364_RS171202142.335184fimbrial biogenesis usher protein
D364_RS17130-1131.246025type 1 fimbrial protein
D364_RS17135-1130.670435type 1 fimbrial protein
D364_RS17140-1151.361947fimbrial protein
D364_RS17145-1123.036599EAL domain-containing protein
D364_RS17150-1154.083321N-acetyltransferase
D364_RS17155-1144.330985(4Fe-4S)-binding protein
D364_RS17160-1143.966980hypothetical protein
D364_RS17165-1144.128799multidrug transporter subunit MdtN
D364_RS171700133.610438multidrug efflux transporter permease subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17125PF005779770.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 977 bits (2526), Expect = 0.0
Identities = 666/866 (76%), Positives = 758/866 (87%), Gaps = 6/866 (0%)

Query: 11 LGCRTARRLVSPALALWLC------SQPFAARADLYFNPRFLADDPAAVADLSGFEKGQE 64
C R+ + L +Q + A+LYFNPRFLADDP AVADLS FE GQE
Sbjct: 13 TQCLHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQE 72

Query: 65 VPPGTYRVDIYLNNGFMTTRDVTFQADAQGHGLSPCLTRGQLASMGVDTGRVPGMATLDS 124
+PPGTYRVDIYLNNG+M TRDVTF G+ PCLTR QLASMG++T V GM L
Sbjct: 73 LPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLAD 132

Query: 125 TACVPLTTLISEATTRFDVGQQRLYLTVPQAFMGNQARGYIPPELWDNGITAGLINYNFT 184
ACVPLT++I +AT + DVGQQRL LT+PQAFM N+ARGYIPPELWD GI AGL+NYNF+
Sbjct: 133 DACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFS 192

Query: 185 GNNAHNTTGGSSRYAYLNLQSGLNIGAWRLRDNSTWSYSSGGSTSSNENRWQHVNSWLER 244
GN+ N GG+S YAYLNLQSGLNIGAWRLRDN+TWSY+S S+S ++N+WQH+N+WLER
Sbjct: 193 GNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLER 252

Query: 245 DITPLRSRLTLGDSYTNGDVFDGINFRGAQLASDDNMLPDSQKGFAPVIHGIARGTAQVS 304
DI PLRSRLTLGD YT GD+FDGINFRGAQLASDDNMLPDSQ+GFAPVIHGIARGTAQV+
Sbjct: 253 DIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVT 312

Query: 305 IRQNGYEIYQSTVPPGPFTIDDLYAAGNGGDLQVTIKEADGSRQVFSVPWSTVPVLQREG 364
I+QNGY+IY STVPPGPFTI+D+YAAGN GDLQVTIKEADGS Q+F+VP+S+VP+LQREG
Sbjct: 313 IKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREG 372

Query: 365 HTRFALTAGEYRSGNSQQETPDFFQGTAMHGLPAGWTLYGGTQLADRYRAFNLGVGKNMG 424
HTR+++TAGEYRSGN+QQE P FFQ T +HGLPAGWT+YGGTQLADRYRAFN G+GKNMG
Sbjct: 373 HTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMG 432

Query: 425 YFGALSLDITQANATLADDSEHQGQSVRFLYNKSLDETGTNLQLVGYRYSTRGYYNFADT 484
GALS+D+TQAN+TL DDS+H GQSVRFLYNKSL+E+GTN+QLVGYRYST GY+NFADT
Sbjct: 433 ALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADT 492

Query: 485 TYRRMSGYSVETQDGVIQVKPKFTDYYNLAYSKRGKVQLSVTQQLGRTATLYLSGSHQTY 544
TY RM+GY++ETQDGVIQVKPKFTDYYNLAY+KRGK+QL+VTQQLGRT+TLYLSGSHQTY
Sbjct: 493 TYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTY 552

Query: 545 WGTDDADEQLQAGLNAAVDDINWSLSYSLTKNAWQQGRDQMLAININIPFSHWLRSDSRS 604
WGT + DEQ QAGLN A +DINW+LSYSLTKNAWQ+GRDQMLA+N+NIPFSHWLRSDS+S
Sbjct: 553 WGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKS 612

Query: 605 VWRHASASYSLSHDLNGRMTNLAGLYGTLLEDNNLSYSVQTGYAGGGNGDNGSTGYTALN 664
WRHASASYS+SHDLNGRMTNLAG+YGTLLEDNNLSYSVQTGYAGGG+G++GSTGY LN
Sbjct: 613 QWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLN 672

Query: 665 YRGGYGNANVGYSRSDGFKQLYYGVSGGVLAHANGITLSQPLNDTVVLVKAPGAGGVKVE 724
YRGGYGNAN+GYS SD KQLYYGVSGGVLAHANG+TL QPLNDTVVLVKAPGA KVE
Sbjct: 673 YRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVE 732

Query: 725 NQTGVRTDWRGYAVLPYATEYRENRIALDTNTLADNVDLDDAVVSVVPTHGAIVRANFNA 784
NQTGVRTDWRGYAVLPYATEYRENR+ALDTNTLADNVDLD+AV +VVPT GAIVRA F A
Sbjct: 733 NQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKA 792

Query: 785 QVGMKILMTLTHRGKPVPFGALATGDSNQSGSIVADNGQVYLSGMPLAGKVRVKWGDGPD 844
+VG+K+LMTLTH KP+PFGA+ T +S+QS IVADNGQVYLSGMPLAGKV+VKWG+ +
Sbjct: 793 RVGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEEN 852

Query: 845 AQCVADYRLPPESQQQALSQLSVACR 870
A CVA+Y+LPPESQQQ L+QLS CR
Sbjct: 853 AHCVANYQLPPESQQQLLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17150SACTRNSFRASE280.002 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.4 bits (63), Expect = 0.002
Identities = 11/55 (20%), Positives = 23/55 (41%)

Query: 11 YVNDAQGNQVAEIVFVPTGEHLSIIEHTDVDPSLKGQGVGKQLVAKVVEKMRQEQ 65
++ + N + I ++IE V + +GVG L+ K +E ++
Sbjct: 68 FLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENH 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17165RTXTOXIND687e-15 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 68.3 bits (167), Expect = 7e-15
Identities = 49/362 (13%), Positives = 106/362 (29%), Gaps = 81/362 (22%)

Query: 11 KKWPLLALVLAAILALILVIWQL-----QTSPETNDAYVYADTIDVVPEVSGRIVEMPIR 65
+ P L +I I + + + ++ P + + E+ ++
Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113

Query: 66 DNQRVKKGDLLFRIDPRP---------------------YQAMLDDA------------- 91
+ + V+KGD+L ++ YQ +
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 92 ------------------KARLTTLDAQIMLTQRTIKAQEYNAQSVAAAVERARALVKQT 133
K + +T Q + + + +V A + R L +
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 134 TSTRIRLEPLVPQGFASQEDLDQARTAEKAARAELEATLLQAKQASAAVTGVDAMVAQRA 193
S L+ + ++ + + A EL Q +Q + +
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 194 GVL-------------------AQIALAELHLEFTEVRAPFNGVVVALKT-TVGQYASAL 233
+ ++A E + + +RAP + V LK T G +
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 234 KPVFTLL-DDDRWYVIANFRETDLNNVRPGVAARITVMT-NHNRT--FNGVVDSVGSGVL 289
+ + ++ +DD V A + D+ + G A I V + R G V ++ +
Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI 413

Query: 290 PE 291
+
Sbjct: 414 ED 415


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17170TYPE3IMSPROT290.049 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 29.3 bits (66), Expect = 0.049
Identities = 17/109 (15%), Positives = 41/109 (37%), Gaps = 13/109 (11%)

Query: 394 LASLLALLLIVFVQPWTDSLTGLLAMSLPV---LALAAWIAAGSERIAYAGIQIGFTFA- 449
+ L+ + P++ +L+ ++ L L A IA +Q GF +
Sbjct: 53 FSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISG 112

Query: 450 ---------LAFLSWFAPLTNLTELRDRVLGILLGVLVSSIVHLYLWPD 489
+ + + ++ L + + IL VL+S ++ + + +
Sbjct: 113 EAIKPDIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGN 161


65D364_RS17215D364_RS17515Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS17215-125-3.409535LysR family transcriptional regulator
D364_RS17220027-3.685317hypothetical protein
D364_RS17225027-3.482562NAD(P)H-dependent oxidoreductase
D364_RS17230028-3.584594aldo/keto reductase
D364_RS17235034-5.308432carboxymuconolactone decarboxylase family
D364_RS17240136-6.622634SDR family oxidoreductase
D364_RS17245034-6.773567cyclic diguanylate phosphodiesterase
D364_RS17250134-7.930227hypothetical protein
D364_RS17255136-8.931205CrcB family protein
D364_RS17260237-10.133486fluoride efflux transporter CrcB
D364_RS17265237-10.348462response regulator
D364_RS17270440-11.420631acid-sensing system DNA-binding response
D364_RS17275539-10.870159EmrA/EmrK family multidrug efflux transporter
D364_RS17280641-11.770338DHA2 family efflux MFS transporter permease
D364_RS17285533-9.108293DUF308 domain-containing protein
D364_RS17290530-7.990600acid-activated periplasmic chaperone HdeB
D364_RS17295430-7.591566ammonium transporter
D364_RS17300327-7.429823urease subunit gamma
D364_RS17305328-7.449130urease subunit beta
D364_RS17310328-6.591901urease subunit alpha
D364_RS17315334-6.984194urease accessory protein UreE
D364_RS17320435-6.347390urease accessory protein UreF
D364_RS17325436-6.038117urease accessory protein UreG
D364_RS17330336-5.284478urease accessory protein UreD
D364_RS17335336-4.782803urea transporter
D364_RS17340237-5.263982ABC transporter substrate-binding protein
D364_RS17345339-5.425020ABC transporter permease
D364_RS17350238-5.610343ABC transporter permease
D364_RS17355341-7.418344ATP-binding cassette domain-containing protein
D364_RS17360249-10.413941ATP-binding cassette domain-containing protein
D364_RS17365250-10.951174acid-activated periplasmic chaperone HdeA
D364_RS17370252-11.401224SEL1-like repeat protein
D364_RS26290253-12.033781response regulator
D364_RS17385255-12.793712FUSC family protein
D364_RS17390252-11.083261response regulator
D364_RS25615243-7.160389alpha/beta fold hydrolase
D364_RS17405343-7.896416LysR family transcriptional regulator
D364_RS17410248-9.265549MFS transporter
D364_RS27130150-10.602150tyrosine-type recombinase/integrase
D364_RS17420152-10.723839hypothetical protein
D364_RS17425254-12.292993major capsid protein
D364_RS17430562-16.158265hypothetical protein
D364_RS17435568-17.527171hypothetical protein
D364_RS17440564-18.959183hypothetical protein
D364_RS17445662-19.756938hypothetical protein
D364_RS27585660-20.470030helix-turn-helix domain-containing protein
D364_RS17450262-21.443881hypothetical protein
D364_RS27135362-19.049813hypothetical protein
D364_RS25530466-19.971349hypothetical protein
D364_RS27590351-14.531652recombinase family protein
D364_RS17460242-11.545910DUF4062 domain-containing protein
D364_RS27140028-8.308883*peptidoglycan DD-metalloendopeptidase family
D364_RS17465127-6.063346isopentenyl-diphosphate Delta-isomerase
D364_RS17470-117-2.694535lysine--tRNA ligase
D364_RS17480-1140.546277peptide chain release factor 2
D364_RS174900140.812768single-stranded-DNA-specific exonuclease RecJ
D364_RS174950150.561033bifunctional protein-disulfide
D364_RS17500-2151.719720site-specific tyrosine recombinase XerD
D364_RS175050142.306217flavodoxin FldB
D364_RS175102141.315857protein YgfX
D364_RS175152141.141336FAD assembly factor SdhE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17265HTHFIS684e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.5 bits (165), Expect = 4e-14
Identities = 36/156 (23%), Positives = 61/156 (39%), Gaps = 9/156 (5%)

Query: 258 QAIRILIAEDLPANRQLLRRQLDTLGYAADEAKDGAEALKLIQQQRYDLLITDLNMPVMD 317
IL+A+D A R +L + L GY + A + I DL++TD+ MP +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 318 GITLTCRVREYDTRMVIWGLTANLVAGEKERCLASGMNLCLFKPLDLSQL----ATALCE 373
L R+++ + + ++A + G L KP DL++L AL E
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 374 INIPQSGSSLDEFLNMKIFTALTLGDKKLMRQMLEQ 409
S D M + +G M+++
Sbjct: 122 PKRRPSKLEDDSQDGMPL-----VGRSAAMQEIYRV 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17270HTHFIS702e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 70.2 bits (172), Expect = 2e-16
Identities = 23/122 (18%), Positives = 56/122 (45%), Gaps = 4/122 (3%)

Query: 1 MSKTANLSAIIIDDHPLARMAIRNLLENEGFNIVAEAGDGGEALMAVAEYQPDVVIVDVD 60
M+ + ++ DD R + L G+++ + +A D+V+ DV
Sbjct: 1 MTGA---TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVV 56

Query: 61 IPVMSGIEVVEKLRKKQFSHIIIVVSAKNDLFYGKRSADAGANAFISKKEGINNIISAIH 120
+P + +++ +++K + ++V+SA+N ++++ GA ++ K + +I I
Sbjct: 57 MPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116

Query: 121 AA 122
A
Sbjct: 117 RA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17275RTXTOXIND644e-13 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 63.7 bits (155), Expect = 4e-13
Identities = 37/218 (16%), Positives = 74/218 (33%), Gaps = 24/218 (11%)

Query: 132 IAYQQALADYQRRSRLQGAAAISRENMQHAKDAVDSSKAALDVAVQAYRGNRVLIQNTAL 191
IA L + + + ++ + + S+K + Q + +N L
Sbjct: 249 IAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF-------KNEIL 301

Query: 192 EKQPEVLMAAESMRE----AWVALQRTKVRSPVTGYLAQRNVQ-VGETIGSGQALMSIIP 246
+K + + Q + +R+PV+ + Q V G + + + LM I+P
Sbjct: 302 DKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVP 361

Query: 247 VEQV-WINANFKETQLSGVKIGQKVSI-VTDF-YGSDVVFNGRVDGINMGTGSAFSVLPA 303
+ + A + + + +GQ I V F Y G+V I + ++
Sbjct: 362 EDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNI-----NLDAIE-- 414

Query: 304 QNATGNWIKVVQRLPVRITLDAEQIKAYPLRIGLSATV 341
G V+ + K PL G++ T
Sbjct: 415 DQRLGLVFNVIISIEENCLSTGN--KNIPLSSGMAVTA 450


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17280TCRTETB1304e-35 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 130 bits (328), Expect = 4e-35
Identities = 90/411 (21%), Positives = 168/411 (40%), Gaps = 19/411 (4%)

Query: 18 VTLALSMATFMQMLDSTISNVAIPTISGFLGASTDEGTWVITSFGVANAISIPVTGRLAQ 77
+ + L + +F +L+ + NV++P I+ WV T+F + +I V G+L+
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 78 RFGERKLFLTSVTLFALASLCCGLS-TNLDTLIGFRVVQGLVAGPLIPLSQSLLLRNYPP 136
+ G ++L L + + S+ + + LI R +QG A L ++ R P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 137 EKRNIALALWSMTVIIAPIFGPIIGGYICDNYDWGWIFLINVPLGVIVVVLTSWLLKGRE 196
E R A L V + GP IGG I W ++ LI + + V L L K
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVR 194

Query: 197 TPTEPVKINLIALSLLVLGVGSLQIMLDKGKDLDWFNSTTIIVLAIIAVIAIILLVIWEA 256
++ + L+ +G+ + ML F ++ I I++V++ ++ V
Sbjct: 195 IKGH---FDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIR 241

Query: 257 TDNNPIIDLSLFRSRNFTIGILCIACAYLIYAGAIVLMPQLLQTVFEYTSVSAGLAYAPI 316
+P +D L ++ F IG+LC + AG + ++P +++ V + ++ G
Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301

Query: 317 GIMPLLL-APLIGRYGHKIDMRMLVTFSFIVYALCYYWRSVTFSSAINF-TWVIIPQFMQ 374
G M +++ + G + ++ ++ + S + F T +I+
Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361

Query: 375 GFAVACFFLPLTTISLSGLPPEKFAAATSLSNFFRSLSGSIGTTITMTLWS 425
++TI S L ++ A SL NF LS G I L S
Sbjct: 362 LSFTK---TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17310UREASE9690.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 969 bits (2506), Expect = 0.0
Identities = 326/570 (57%), Positives = 418/570 (73%), Gaps = 5/570 (0%)

Query: 3 TISRKEYASLFGPTVGDKIRLGETDLYIEIEKDLRGYGDESVYGGGKSLRDGMGSNNTLT 62
+SR YA++FGPTVGDK+RL +T+L+IE+EKD +G+E +GGGK +RDGMG + T
Sbjct: 4 RMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQV-T 62

Query: 63 RDNGVLDLVITNVTILDAKLGVIKADVGIKDGLIVGIGKSGNPAIMDGVTQNMIVGLSTD 122
R+ G +D VITN ILD G++KAD+G+KDG I IGK+GNP + GV +IVG T+
Sbjct: 63 REGGAVDTVITNALILDH-WGIVKADIGLKDGRIAAIGKAGNPDMQPGV--TIIVGPGTE 119

Query: 123 AISGEHLILTAAGIDSHIHLISPQQAYSALSNGVATFFGGGIGPTDGTNGTTVTPGPWNI 182
I+GE I+TA G+DSHIH I PQQ AL +G+ GGG GP GT TT TPGPW+I
Sbjct: 120 VIAGEGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHI 179

Query: 183 RKMLRAVEGLPVNVGLLGKGNAFGRAPLVEQIIAGVAGLKVHEDWGATPNALRHSLRIAD 242
+M+ A + P+N+ GKGNA LVE ++ G LK+HEDWG TP A+ L +AD
Sbjct: 180 ARMIEAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVAD 239

Query: 243 EMDIQVSVHTDSLNEAGYVENTIEAFEGRTIHTFHTEGAGGGHAPDIIKVASQLNVLPSS 302
E D+QV +HTD+LNE+G+VE+TI A +GRTIH +HTEGAGGGHAPDII++ Q NV+PSS
Sbjct: 240 EYDVQVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSS 299

Query: 303 TNPTLPFGINTQAELFDMIMVCHNLNPNVAADVSFAESRVRPETIAAENVLHDMGVISMF 362
TNPT P+ +NT AE DM+MVCH+L+P + D++FAESR+R ETIAAE++LHD+G S+
Sbjct: 300 TNPTRPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSII 359

Query: 363 SSDSQAMGRVGENWLRVVQTAHAMKVARGKLPEDSDGNDNFRVLRYVAKLTINPAIAHGV 422
SSDSQAMGRVGE +R QTA MK RG+L E++ NDNFRV RY+AK TINPAIAHG+
Sbjct: 360 SSDSQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGL 419

Query: 423 SHIIGSVEVGKMADLVLWDPRSFGAKPKMVIKGGMINWALMGDPNASLPTPQPVFYRPMF 482
SH IGS+EVGK ADLVLW+P FG KP MV+ GG I A MGDPNAS+PTPQPV YRPMF
Sbjct: 420 SHEIGSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMF 479

Query: 483 GAMGKTLQDTCATFVSQAALDDGVKEKAGLERQVIAINNCR-SVTKRDLVRNSATPHIEV 541
GA G++ ++ TFVSQA+LD G+ + G+ ++++A+ N R + K ++ NS TPHIEV
Sbjct: 480 GAYGRSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEV 539

Query: 542 DPETFAVKVDGEHATCNPVTTAVMNQKYFF 571
DPET+ V+ DGE TC P T M Q+YF
Sbjct: 540 DPETYEVRADGELLTCEPATVLPMAQRYFL 569


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17385HTHFIS489e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 47.5 bits (113), Expect = 9e-09
Identities = 21/126 (16%), Positives = 49/126 (38%), Gaps = 10/126 (7%)

Query: 1 MSA---VIIDDHPFARLALKTVLENQNI-VVTGEAADDFHAIQLVDRLQPDIVIVDVMLI 56
M+ ++ DD R L L V A + + D+V+ DV++
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAAT--LWRWIAAGDGDLVVTDVVMP 58

Query: 57 GSSGIDVVTKLRQNHYAGSIVMVSGKNQIFYRKCSVDAGANAFISK----KESMDNFVAA 112
+ D++ ++++ ++++S +N + + GA ++ K E + A
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 113 IQAVQR 118
+ +R
Sbjct: 119 LAEPKR 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17405HTHFIS429e-07 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 41.7 bits (98), Expect = 9e-07
Identities = 13/66 (19%), Positives = 25/66 (37%), Gaps = 1/66 (1%)

Query: 15 AIKTLLENKGVSVTGEAINGMDALRIVDQLQPNTIIVDVDLPDIDGIGLVETLRKRLYKG 74
+ L G V N R + + ++ DV +PD + L+ ++K
Sbjct: 18 VLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPDL 76

Query: 75 SIIVTS 80
++V S
Sbjct: 77 PVLVMS 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17425TCRTETB1264e-34 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 126 bits (318), Expect = 4e-34
Identities = 77/398 (19%), Positives = 173/398 (43%), Gaps = 13/398 (3%)

Query: 29 TLMGVFDGTMINIALPSMAQEMQVPASIAVWFANGYLLAAAMTLAIFAALAARLGYRPVF 88
+ V + ++N++LP +A + P + W ++L ++ A++ L+ +LG + +
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 89 LAGLTTFTLTSLGCALA-NKPEVLIGMRVLQGIGGAATLSIAPAILRSVFPGRLLGRILG 147
L G+ S+ + + +LI R +QG G AA ++ ++ P G+ G
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142

Query: 148 FHALLIASSSAIGPVLGGTILHTLSWQWLFAINVLPGTLALLLAVRALPRDAIRMQAPFD 207
++A +GP +GG I H + W +L I ++ T+ + + L + +R++ FD
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMI--TIITVPFLMKLLKKEVRIKGHFD 200

Query: 208 TVGAILSALLLGSTIMAANSLQNATSQFGSLCWMALAALSGMAFIWQIRRTGHPLLPPSM 267
G IL ++ + ++ S S+ ++ ++ LS + F+ IR+ P + P +
Sbjct: 201 IKGIILMSVGIVFFMLFTTS--------YSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 268 FKNERFTLAAFTSMVAFVSQGITFIALPFLFQSEYGYSP-VVSALLFTPWPLGIVLIAPH 326
KN F + + F + +P++ + + S + +++ P + +++
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 327 AGRWADTISAPAISTLGLVIFVVGLILLATLPASPSMWDICLRSLVCGIGFGCFQSPNNR 386
G D + +G+ V + + L + S + + V G G ++ +
Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTVIST 371

Query: 387 EMLSNVIREHASYASGVLSIMRTFGQCLGAAAVAVLLA 424
+ S++ ++ A +L+ + G A V LL+
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS1743556KDTSANTIGN330.002 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 33.4 bits (76), Expect = 0.002
Identities = 37/158 (23%), Positives = 62/158 (39%), Gaps = 27/158 (17%)

Query: 2 SVNVKAATLTNLVKYKTDRASLRSVRDDMKKLQKDFSKTEGTIAKAKMQADKQAYTAQMQ 61
+NV L N + ++ ++ + D +++L+ F ++ + Q
Sbjct: 283 GINVPDTGLPNSASIEQIQSKIQELGDTLEELRDSFDGYINNAFVNQIHLNFVMPPQAQQ 342

Query: 62 QQKQVQQQQKQAAKQATVDAKA-------KQIEA------------------RKLAAAQS 96
QQ Q QQQQ QA Q V A A QI KLAA Q
Sbjct: 343 QQGQGQQQQAQATAQEAVAAAAVRLLNGSDQIAQLYKDLVKLQRHAGIRKAMEKLAAQQE 402

Query: 97 KAAKIQMQQ--QQKQASVAENARLKERKALFDIGRMEG 132
+ AK Q + +Q+Q + ++ K ++ FD+ + G
Sbjct: 403 EDAKNQGKGDCKQQQGASEKSKEGKVKETEFDLSMVVG 440


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17480RTXTOXIND406e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 39.8 bits (93), Expect = 6e-06
Identities = 18/82 (21%), Positives = 33/82 (40%), Gaps = 12/82 (14%)

Query: 146 AAGAGKVVYVGNQLRGYGNLIMIKHSEDYITAYAHNDKLMVNNGQSVKAGQQIATMGSTD 205
A GK+ + G IK E+ I +++V G+SV+ G + + +
Sbjct: 84 ATANGKLTHSGRSK-------EIKPIENSIV-----KEIIVKEGESVRKGDVLLKLTALG 131

Query: 206 ADSVRLHFQIRYRATAIDPLRY 227
A++ L Q ++ RY
Sbjct: 132 AEADTLKTQSSLLQARLEQTRY 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17510ADHESNFAMILY310.005 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 30.6 bits (69), Expect = 0.005
Identities = 11/38 (28%), Positives = 16/38 (42%)

Query: 1 MKKGLLMFTLLAASLSGAAHADSAAIKQSLAKLGVQST 38
MKK + L +++ A A S KL V +T
Sbjct: 1 MKKLGTLLVLFLSAIILVACASGKKDTTSGQKLKVVAT 38


66D364_RS17655D364_RS17720Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS17655227-1.741821oxidative stress defense protein
D364_RS17665332-1.470134arginine exporter ArgO
D364_RS17675434-1.230891small-conductance mechanosensitive channel MscS
D364_RS17680332-1.582932class II fructose-bisphosphate aldolase
D364_RS17685122-0.119689phosphoglycerate kinase
D364_RS176900160.921818erythrose-4-phosphate dehydrogenase
D364_RS176950151.779418transketolase
D364_RS177001151.739495M48 family metallopeptidase
D364_RS177052162.361452OprD family outer membrane porin
D364_RS177102192.922216thiamine pyrophosphate-binding protein
D364_RS177152192.332995aspartate dehydrogenase
D364_RS177202152.602465SDR family oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17725DHBDHDRGNASE1081e-30 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 108 bits (272), Expect = 1e-30
Identities = 77/262 (29%), Positives = 118/262 (45%), Gaps = 9/262 (3%)

Query: 1 MNAQ-IEGRVAVVTGGSSGIGFETLRLLLGEGAKVAFCGRNPDRLASAHAALQNE--YPE 57
MNA+ IEG++A +TG + GIG R L +GA +A NP++L ++L+ E + E
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 58 GEVFSWRCDVLNEAEVEAFAAAVAARFGGVDMLINNAGQGYVAHFADTPREAWLHEAELK 117
++ DV + A ++ A + G +D+L+N AG E W +
Sbjct: 61 ----AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVN 116

Query: 118 LFGVINPVKAFQSLLEASDIASITCVNSLLALQPEEHMIATSAARAALLNMTLTLSKELV 177
GV N ++ + SI V S A P M A ++++AA + T L EL
Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176

Query: 178 DKGIRVNSILLGMVESGQWQRRFESRSDKSQSWQQWTADIARKRGIPMARLGKPQEPAQA 237
+ IR N + G E+ + + Q + K GIP+ +L KP + A A
Sbjct: 177 EYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETF--KTGIPLKKLAKPSDIADA 234

Query: 238 LLFLASPLASFTTGAALDVSGG 259
+LFL S A T L V GG
Sbjct: 235 VLFLVSGQAGHITMHNLCVDGG 256


67D364_RS17905D364_RS17955Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS17905-213-3.743622DUF554 domain-containing protein
D364_RS17910019-3.429122*RNA-directed DNA polymerase
D364_RS17915322-3.456768fimbrial biogenesis outer membrane usher
D364_RS17920532-5.337889molecular chaperone
D364_RS17930636-6.061090molecular chaperone
D364_RS17935736-6.048731fimbrial protein
D364_RS17940934-4.766697fimbrial protein
D364_RS17945837-5.859857molecular chaperone
D364_RS17950748-8.167891fimbrial-like protein
D364_RS17955333-4.341622cystathionine beta-synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17945PF005777250.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 725 bits (1874), Expect = 0.0
Identities = 296/879 (33%), Positives = 465/879 (52%), Gaps = 50/879 (5%)

Query: 14 KLFRYSPVAGFLLVCI------NPAWAGDYFDPGFLGNSGDNTAVDLSAFSEAGGVQPGK 67
+ R + L V + A YF+P FL DLS F + PG
Sbjct: 19 RKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFL-ADDPQAVADLSRFENGQELPPGT 77

Query: 68 YTVWVFVNQRNAGQYTLDFQKNTQGKIA-PVLTPSELETFGVNVRQLPDLKDLPATAEID 126
Y V +++N + F + P LT ++L + G+N + + L A +
Sbjct: 78 YRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVP 137

Query: 127 NIGALIPQATTMLDLARLRLDISVPQAAMQPEVRGAVDPSQWEEGISALMANYSLSAGRT 186
+ ++I AT LD+ + RL++++PQA M RG + P W+ GI+A + NY+ S
Sbjct: 138 -LTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSV 196

Query: 187 TNSGQNQTSHNNNLFATVRAGANTGPWRLRSTMTHTRVENNGGNNALTTTQTRFSNTYLA 246
N + + + +++G N G WRLR T + N+ +++ + + + NT+L
Sbjct: 197 QNRIGGNS---HYAYLNLQSGLNIGAWRLRDNTTWSY--NSSDSSSGSKNKWQHINTWLE 251

Query: 247 RDIRGWRSNLLMGESSTGSDVFDGIPFRGVKLSSNEQMLPSQLRGYAPAISGVANSNARV 306
RDI RS L +G+ T D+FDGI FRG +L+S++ MLP RG+AP I G+A A+V
Sbjct: 252 RDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQV 311

Query: 307 TVRQNGNVVYETYVAPGPFYINDIQQAGLSGDYDVKVTEADGTERQFIVPYSSLPVMLRP 366
T++QNG +Y + V PGPF INDI AG SGD V + EADG+ + F VPYSS+P++ R
Sbjct: 312 TIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQRE 371

Query: 367 GGWKYELTAGQY--DGNLTDGSRRADFMLGTVVYGLPGDVTLFGGILAAKDYQAFNIGTG 424
G +Y +TAG+Y + R F T+++GLP T++GG A Y+AFN G G
Sbjct: 372 GHTRYSITAGEYRSGNAQQEKPR---FFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIG 428

Query: 425 VSLGYVGALSADITNSSAKFDNESTLIGQSYRVRYSKSLLSTGTSVDLTALRYSTEDYYS 484
++G +GALS D+T +++ ++S GQS R Y+KSL +GT++ L RYST Y++
Sbjct: 429 KNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFN 488

Query: 485 FSEFNSQGHQLQEGVSPWSLQ--------------RRRNSFQTQLSQQLGDWGTMYFRAS 530
F++ + + +R Q ++QQLG T+Y S
Sbjct: 489 FADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGS 548

Query: 531 RDDYWGGERTLTGMSLGYSNSLKGVSYGVNYNIDRTKDANGNWPENRQISFNVSVPFSIF 590
YWG G + + + +++ ++Y++ + G ++ ++ NV++PFS +
Sbjct: 549 HQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGR---DQMLALNVNIPFSHW 605

Query: 591 GYSRN---LQSMYATTTLTHDNTGRTLSQTGLSGNTL-DGKLSYSASQSW---GNQGQIS 643
S + + A+ +++HD GR + G+ G L D LSYS + G+ S
Sbjct: 606 LRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGS 665

Query: 644 NTNLNTGYQGSKGSISGGYSYSSDMQAINMSASGGVMVHSGGITLSRAMGDSVALVSAPG 703
Y+G G+ + GYS+S D++ + SGGV+ H+ G+TL + + D+V LV APG
Sbjct: 666 TGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPG 725

Query: 704 AAGVSVNGGTAV-TDWRGYAVVPYLTDYTRNSVGVDPSTLPENVDLTQTNLNVYPTKGAV 762
A V T V TDWRGYAV+PY T+Y N V +D +TL +NVDL NV PT+GA+
Sbjct: 726 AKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAI 785

Query: 763 VKANFATRGGYQVLMTLKLDNGVVPFGAVATLLNAGMAEVNSSIVGDDGQVYLTGLPERG 822
V+A F R G ++LMTL +N +PFGA+ T ++ +S IV D+GQVYL+G+P G
Sbjct: 786 VRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSESSQ----SSGIVADNGQVYLSGMPLAG 841

Query: 823 ELLVKWGETAARQCRVSFDISGLSTSPDKPVRQVTYTCQ 861
++ VKWGE C ++ + S + + Q++ C+
Sbjct: 842 KVQVKWGEEENAHCVANYQLPP--ESQQQLLTQLSAECR 878


68D364_RS18210D364_RS18330Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS182102122.973511bifunctional
D364_RS182151122.606123bifunctional [glutamate--ammonia
D364_RS276050133.486320inorganic triphosphatase
D364_RS182250133.284396SH3 domain-containing protein
D364_RS182300122.813303multifunctional CCA addition/repair protein
D364_RS18235-2121.573194undecaprenyl-diphosphate phosphatase
D364_RS18240-1120.679771bifunctional dihydroneopterin
D364_RS18245-2141.070094glycerol-3-phosphate 1-O-acyltransferase PlsY
D364_RS18250-1150.203309hypothetical protein
D364_RS182551191.592949short-chain dehydrogenase
D364_RS182601203.480592urease accessory protein UreD
D364_RS276102204.195801urease subunit gamma
D364_RS182702204.117650urease subunit beta
D364_RS182751215.133002urease subunit alpha
D364_RS182801224.458873urease accessory protein UreE
D364_RS182851203.797468urease accessory protein UreF
D364_RS182900193.864551urease accessory protein UreG
D364_RS18295-3173.011362cytosine permease
D364_RS18300-2121.572058tRNA
D364_RS18305014-0.34059730S ribosomal protein S21
D364_RS18310-113-0.082302DNA primase
D364_RS18315113-0.652265RNA polymerase sigma factor RpoD
D364_RS18320114-1.765390G/U mismatch-specific DNA glycosylase
D364_RS18330212-0.621329*hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS18225LPSBIOSNTHSS290.029 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 29.0 bits (65), Expect = 0.029
Identities = 10/37 (27%), Positives = 20/37 (54%)

Query: 347 GVFDILHAGHVSYLANARKLGDRLIVAVNSDASTKRL 383
G FD + GH+ + +L D++ VAV + + + +
Sbjct: 7 GSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPM 43


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS18255LIPPROTEIN48270.016 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 27.3 bits (60), Expect = 0.016
Identities = 24/66 (36%), Positives = 31/66 (46%), Gaps = 11/66 (16%)

Query: 12 ITTIGVYDWEQTIEQK----LVFDI-EIAWDNRKAAASDDVSDCLSYADISERVIAHVEG 66
I IG+ D++ E K L F+I E A+ A A LS D S+RV+A G
Sbjct: 148 IKIIGI-DFDIETEYKWFYSLQFNIKESAFTTGYAIA-----SWLSEQDESKRVVASFGG 201

Query: 67 GKFALV 72
G F V
Sbjct: 202 GAFPGV 207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS18290UREASE10820.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 1082 bits (2800), Expect = 0.0
Identities = 411/566 (72%), Positives = 473/566 (83%), Gaps = 2/566 (0%)

Query: 4 ISRQAYADMFGPTVGDKVRLADTELWIEVEDDLTTYGEEVKFGGGKVIRDGMGQGQML-A 62
+SR AYA+MFGPTVGDKVRLADTEL+IEVE D TT+GEEVKFGGGKVIRDGMGQ Q+
Sbjct: 5 MSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQVTRE 64

Query: 63 ADCVDLVLTNALIVDHWGIVKADIGVKDGRIFAIGKAGNPDIQPNVTIPIGAATEVIAAE 122
VD V+TNALI+DHWGIVKADIG+KDGRI AIGKAGNPD+QP VTI +G TEVIA E
Sbjct: 65 GGAVDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGE 124

Query: 123 GKIVTAGGIDTHIHWICPQQAEEALVSGVTTMVGGGTGPAAGTHATTCTPGPWYISRMLQ 182
GKIVTAGG+D+HIH+ICPQQ EEAL+SG+T M+GGGTGPA GT ATTCTPGPW+I+RM++
Sbjct: 125 GKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMIE 184

Query: 183 AADSLPVNIGLLGKGNVSQPDALREQVAAGVIGLKIHEDWGATPAAIDCALTVADEMDVQ 242
AAD+ P+N+ GKGN S P AL E V G LK+HEDWG TPAAIDC L+VADE DVQ
Sbjct: 185 AADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVADEYDVQ 244

Query: 243 VALHSDTLNESGFVEDPLAAIGGRTIHTFHTEGAGGGHAPDIITACAHPNILPSSTNPTL 302
V +H+DTLNESGFVED +AAI GRTIH +HTEGAGGGHAPDII C PN++PSSTNPT
Sbjct: 245 VMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSSTNPTR 304

Query: 303 PYTLNTIDEHLDMLMVCHHLDPDIAEDVAFAESRIRRETIAAEDVLHDLGAFSLTSSDSQ 362
PYT+NT+ EHLDMLMVCHHL P I ED+AFAESRIR+ETIAAED+LHD+GAFS+ SSDSQ
Sbjct: 305 PYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSIISSDSQ 364

Query: 363 AMGRVGEVILRTWQVAHRMKVQRGALAEETGDNDNFRVKRYIAKYTINPALTHGIAHEVG 422
AMGRVGEV +RTWQ A +MK QRG L EETGDNDNFRVKRYIAKYTINPA+ HG++HE+G
Sbjct: 365 AMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGLSHEIG 424

Query: 423 SIEVGKLADLVVWSPAFFGVKPATVIKGGMIAIAPMGDINASIPTPQPVHYRPMFGALGS 482
S+EVGK ADLV+W+PAFFGVKP V+ GG IA APMGD NASIPTPQPVHYRPMFGA G
Sbjct: 425 SLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMFGAYGR 484

Query: 483 ARHHCRLTFLSQAAAANGVAERLNLRSAIAVVKGCR-TVQKADMVHNSLQPNITVDAQTY 541
+R + +TF+SQA+ G+A RL + + V+ R + KA M+HNSL P+I VD +TY
Sbjct: 485 SRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEVDPETY 544

Query: 542 EVRVDGELITSEPADVLPMAQRYFLF 567
EVR DGEL+T EPA VLPMAQRYFLF
Sbjct: 545 EVRADGELLTCEPATVLPMAQRYFLF 570


69D364_RS18435D364_RS18515Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS184350163.320366hypothetical protein
D364_RS184400163.725035dihydroxyacetone kinase subunit DhaK
D364_RS184450143.487362dihydroxyacetone kinase ADP-binding subunit
D364_RS184500153.295600dihydroxyacetone kinase subunit DhaM
D364_RS18455-1152.568637glycerone kinase
D364_RS184600171.053466siderophore-interacting protein
D364_RS18470-1171.309081PadR family transcriptional regulator
D364_RS184750171.958967putrescine aminotransferase
D364_RS18480-1172.858654(4S)-4-hydroxy-5-phosphonooxypentane-2,3-dione
D364_RS184850163.5431283-hydroxy-5-phosphonooxypentane-2,4-dione
D364_RS18490-1144.284880autoinducer 2 ABC transporter substrate-binding
D364_RS184950134.671474histidine kinase
D364_RS185000134.091829autoinducer 2 ABC transporter permease LsrC
D364_RS185051134.113398autoinducer 2 ABC transporter ATP-binding
D364_RS185100143.998604transcriptional regulator LsrR
D364_RS185150133.143276autoinducer-2 kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS18445ADHESNFAMILY280.019 Adhesin family signature.
		>ADHESNFAMILY#Adhesin family signature.

Length = 309

Score = 28.3 bits (63), Expect = 0.019
Identities = 10/60 (16%), Positives = 24/60 (40%), Gaps = 6/60 (10%)

Query: 31 KEIGDAD----HGLNMHRGFSKVVEKLP--SIADKDIGFILKNTGMTLLSNVGGASGPLF 84
K+ +AD +G+N+ G + KL + ++ + + G+ ++ G
Sbjct: 77 KKTSEADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFAVSDGVDVIYLEGQNEKGKE 136


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS18450PHPHTRNFRASE1514e-42 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 151 bits (382), Expect = 4e-42
Identities = 65/221 (29%), Positives = 112/221 (50%), Gaps = 5/221 (2%)

Query: 258 VEGAALRYPLALIQP----LRPAAADAAREQQRLRQAIDQTLADLIALTELAENKFHADI 313
G A+ ++P + + D + E ++L A++++ +L A+ + E AD
Sbjct: 11 SSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTEASMGADK 70

Query: 314 AAIFAGHHTLLDDDDLFDAANDRLLTEQCTAEWAWHQVLMELSQQYRQLDDPYLQARYID 373
A IFA H +LDD +L D ++ EQ AE+A +V + +D+ Y++ R D
Sbjct: 71 AEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNEYMKERAAD 130

Query: 374 IEDILQRTLRHLQGVQE-RVPTPGEPTIIIADNIYPSTVLQLDASFVKGLCLRDGSEQAH 432
I D+ +R L HL GV+ + T E T+IIA+++ PS QL+ FVKG G +H
Sbjct: 131 IRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSH 190

Query: 433 GAIIARAAGIAWLSQQGEALNSVQPGETIVLDMRHQRLIRD 473
AI++R+ I + E +Q G+ +++D +I +
Sbjct: 191 SAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVN 231


70D364_RS20050D364_RS20175Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS200502111.168041RNA polymerase sigma factor RpoH
D364_RS200554130.095096permease-like cell division protein FtsX
D364_RS200604130.584955cell division ATP-binding protein FtsE
D364_RS200653130.407646signal recognition particle-docking protein
D364_RS200702123.26788616S rRNA (guanine(966)-N(2))-methyltransferase
D364_RS200751144.392090DUF1145 family protein
D364_RS200801123.304279DUF2500 domain-containing protein
D364_RS200851133.579469lysoplasmalogenase
D364_RS200902134.034530Zn(II)/Cd(II)/Pb(II) translocating P-type ATPase
D364_RS200952133.969723sulfurtransferase TusA
D364_RS201004161.8827957-cyano-7-deazaguanine/7-aminomethyl-7-
D364_RS201053162.061922DcrB family lipoprotein
D364_RS201104152.778726MFS transporter
D364_RS201152133.238159AI-2E family transporter
D364_RS201200142.3875894-amino-4-deoxy-L-arabinose-phosphoundecaprenol
D364_RS20125-2172.2498764-amino-4-deoxy-L-arabinose-phosphoundecaprenol
D364_RS20130-2162.271434lipid IV(A)
D364_RS20135-2161.5241274-deoxy-4-formamido-L-arabinose-
D364_RS20140-1162.083651bifunctional UDP-4-amino-4-deoxy-L-arabinose
D364_RS20145-2152.330692undecaprenyl-phosphate
D364_RS20150-1153.088867UDP-4-amino-4-deoxy-L-arabinose
D364_RS201550153.769488phenolic acid decarboxylase
D364_RS201602164.475475LysR family transcriptional regulator
D364_RS201654155.648205nickel ABC transporter substrate-binding
D364_RS201704165.098132nickel ABC transporter permease subunit NikB
D364_RS201756144.186842nickel ABC transporter permease subunit NikC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20070IGASERPTASE511e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 50.8 bits (121), Expect = 1e-08
Identities = 42/204 (20%), Positives = 69/204 (33%), Gaps = 21/204 (10%)

Query: 18 KEQAQETETEQKVEEQQAVAEEIPAVETPAEPSAPKADPEAFAEDVVEVTETV-VESEKA 76
QA EE V E A P P+ P E AE+ + ++TV + A
Sbjct: 1002 NIQADVPSVPSNNEEIARVDE---APVPPPAPATPSETTETVAENSKQESKTVEKNEQDA 1058

Query: 77 HLAEPASAQ--EEEWVETPALTEETPVV----EPEPAVSEPPEQPAVVEP------LAEE 124
+ + +E A T+ V E + + ++ A VE E+
Sbjct: 1059 TETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEK 1118

Query: 125 VIAEPVVAEAVAEQPVEGIVVQPQETEAPEEDAPLSDEELEAQ-----ALAAEAAEEAAV 179
P V V+ + + VQPQ A E D ++ +E ++Q A E ++
Sbjct: 1119 TQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSN 1178

Query: 180 VVPAPEDEAPLEALAQEQEKPTKE 203
V + + E P
Sbjct: 1179 VEQPVTESTTVNTGNSVVENPENT 1202



Score = 47.0 bits (111), Expect = 2e-07
Identities = 28/163 (17%), Positives = 46/163 (28%), Gaps = 9/163 (5%)

Query: 17 QKEQAQETETEQKVEEQQAVAEEIPAVETPAEPSAPKADPEA-----FAEDVVEVTETVV 71
+ +ET+T + E EE VET PK + +E V E
Sbjct: 1088 SGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAR 1147

Query: 72 ESEKAHLAEPASAQEEEWVETPALTEETPVVEPEPAVSEPPE----QPAVVEPLAEEVIA 127
E++ + +Q +T +ET +P
Sbjct: 1148 ENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATT 1207

Query: 128 EPVVAEAVAEQPVEGIVVQPQETEAPEEDAPLSDEELEAQALA 170
+P V + +P + E A S + AL
Sbjct: 1208 QPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALC 1250



Score = 43.9 bits (103), Expect = 2e-06
Identities = 29/188 (15%), Positives = 58/188 (30%), Gaps = 14/188 (7%)

Query: 17 QKEQAQETET-EQKVEEQQAVAEEIPAVETPAEPSAPKADPEAFAEDVVEVTETVVESEK 75
K++++ E EQ E A E+ + + + +V + E++
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTN------EVAQSGSETKETQT 1097

Query: 76 AHLAEPASAQEEEWVETPALTEETPVVEPEPAVSEPP--EQPAVVEPLAEEVIAEPVVAE 133
E A+ ++EE + TE+T P+ P EQ V+P AE A
Sbjct: 1098 TETKETATVEKEE--KAKVETEKTQ-EVPKVTSQVSPKQEQSETVQPQAEP--ARENDPT 1152

Query: 134 AVAEQPVEGIVVQPQETEAPEEDAPLSDEELEAQALAAEAAEEAAVVVPAPEDEAPLEAL 193
++P + +E + ++ +
Sbjct: 1153 VNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVN 1212

Query: 194 AQEQEKPT 201
++ KP
Sbjct: 1213 SESSNKPK 1220



Score = 40.0 bits (93), Expect = 3e-05
Identities = 28/180 (15%), Positives = 51/180 (28%), Gaps = 23/180 (12%)

Query: 17 QKEQAQETETEQKVEEQQAVAEEIPAVETPAEPSAPKADPEAFAEDVVEVTETVVESEKA 76
Q + +ET T +K E+ + E+ V +PK + +E V E E++
Sbjct: 1096 QTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPK---QEQSETVQPQAEPARENDPT 1152

Query: 77 HLAEPASAQEEEWVETPALTEETPVVEPEPAVSEPPEQPAVVEPLAEEVIAEPVVAEAVA 136
+ +Q +T +ET +P +V
Sbjct: 1153 VNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVN----------------TGNSVV 1196

Query: 137 EQPVEGI--VVQPQETEAPEEDAPLSDEELEAQALAAEAAEEAAVVVPAPEDEAPLEALA 194
E P QP P + +++ E A A + +
Sbjct: 1197 ENPENTTPATTQPTVNSESSN-KPKNRHRRSVRSVPHN-VEPATTSSNDRSTVALCDLTS 1254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20100PF012061033e-33 SirA family protein
		>PF01206#SirA family protein

Length = 76

Score = 103 bits (259), Expect = 3e-33
Identities = 27/71 (38%), Positives = 43/71 (60%)

Query: 9 DHTLDALGLRCPEPVMMVRKTVRTMPVGETLLIIADDPATTRDIPGFCRFMEHELVAQET 68
D +LDA GL CP P++ +KT+ TM GE L ++A DP + +D F + HEL+ Q+
Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64

Query: 69 EALPYRYLIRK 79
E Y + +++
Sbjct: 65 EDGTYHFRLKR 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20115TCRTETA492e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 49.1 bits (117), Expect = 2e-08
Identities = 75/365 (20%), Positives = 134/365 (36%), Gaps = 30/365 (8%)

Query: 13 LRLNLRIVSVVIFNFASYLTIGLPLAVLPGYVHDVM--GFSAFWAGLVISLQYFATLLSR 70
++ N ++ ++ + IGL + VLPG + D++ G++++L
Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60

Query: 71 PHAGRYADLLGPKKIVVFGLGGCFLSGLSYLLAAWGSGWPLISLLLLCLGRVILGI-GQS 129
P G +D G + +++ L + + Y + A L +L +GR++ GI G +
Sbjct: 61 PVLGALSDRFGRRPVLLVSL---AGAAVDYAIMATAP-----FLWVLYIGRIVAGITGAT 112

Query: 130 FAGTGSTLWGVGVVGSLHIGRVISWNGIVTYGAMAMGAPLGVLC--YSHIGLSGLAGVIM 187
A G+ + + R + M G LG L +S A +
Sbjct: 113 GAVAGAYIADITDGDER--ARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALN 170

Query: 188 AVALVAILCALP-------RAAVKAAKGKAMSFR-AVLGRVWPYGMALA-LASAGFGVIA 238
+ + LP R + A SFR A V MA+ + V A
Sbjct: 171 GLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPA 230

Query: 239 TFITLFYDAK-GWDGAAFALTLFSCAFVGA---RLLFPNAINRLGGLNVAMLCFSVEAIG 294
+F + + WD ++L + + + ++ RLG ML + G
Sbjct: 231 ALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTG 290

Query: 295 LLLVGFADTPMMAKIGTFLTGAGFSLVFPALGVVAVKAVPQHNQGSALATYTVFMDLSLG 354
+L+ FA MA L A + PAL + + V + QG + L+
Sbjct: 291 YILLAFATRGWMAFPIMVLL-ASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLT-S 348

Query: 355 VSGPL 359
+ GPL
Sbjct: 349 IVGPL 353


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20130BCTERIALGSPC322e-04 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 32.2 bits (73), Expect = 2e-04
Identities = 13/32 (40%), Positives = 18/32 (56%), Gaps = 1/32 (3%)

Query: 35 RHILFWLGMALLCLGCGMLLW-LSVLQSIPVS 65
R ILF+L M L C M+ W + + + PVS
Sbjct: 15 RRILFYLLMLLFCQQLAMIFWRIGLPDNAPVS 46


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20145NUCEPIMERASE1132e-29 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 113 bits (284), Expect = 2e-29
Identities = 74/361 (20%), Positives = 138/361 (38%), Gaps = 60/361 (16%)

Query: 317 RVLILGVNGFIGNHLTERLLQDDNYEIYGLDIGSD--------AISRFLDCPRFHFVEGD 368
+ L+ G GFIG H+++RLL + +++ G+D +D A L P F F + D
Sbjct: 2 KYLVTGAAGFIGFHVSKRLL-EAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 369 ISIHSEWIE--YHIKKCDVVLPLVAIATPIEYT-RNPLRVFELDFEENLKIIRDCVKYN- 424
++ E + + + V + Y+ NP + + L I+ C
Sbjct: 61 LADR-EGMTDLFASGHFERVFISPHRLA-VRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 425 KRIIFPSTSEVYGMCTDKNFDEDSSNLVVGPINKQRWIYSVSKQLLDRVIWAYGDKYGLK 484
+ +++ S+S VYG+ F D S V P++ +Y+ +K+ + + Y YGL
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDS--VDHPVS----LYAATKKANELMAHTYSHLYGLP 172

Query: 485 FTLFRPFNWMGPRLDNLNAARIGSSRAITQLILNLVEGSPIKLIEGGKQKRCFTDISDGI 544
T R F GP A+ + ++EG I + GK KR FT I D
Sbjct: 173 ATGLRFFTVYGPWGR--------PDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIA 224

Query: 545 EALFRIIEN---------------KDGRCDGQIINIGNPDNEASIKELAEMLLACFERHP 589
EA+ R+ + ++ NIGN + + + L
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVE-LMDYIQALEDALGIEA 283

Query: 590 LRDRFPPFAGFREVESSDYYGKGYQDVEHRKPSIRNAKRCLNWEPKVEMEETVEHTLDFF 649
++ P G DV + + + P+ +++ V++ ++++
Sbjct: 284 KKNMLPLQPG---------------DVLETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328

Query: 650 L 650

Sbjct: 329 R 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20150ACRIFLAVINRP310.011 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 30.6 bits (69), Expect = 0.011
Identities = 20/97 (20%), Positives = 35/97 (36%), Gaps = 5/97 (5%)

Query: 182 TFIPILANTFARRAVEIPVMHAEREFGDSKYSFMRLINLMYDLVTCLTTTPLRLLSIFGS 241
P L T + + FG +F +N + V + + R L I+
Sbjct: 487 ILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYAL 546

Query: 242 VIALLGFAFGLLLVVLRLAFGPQWAAEGVFMLFAVLF 278
++A + F L +F P+ +GVF+ L
Sbjct: 547 IVAGMVVLFLRLPS----SFLPE-EDQGVFLTMIQLP 578


71D364_RS20235D364_RS20380Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS20235-2193.169278oligopeptidase A
D364_RS20240-2183.557183phosphatase PAP2 family protein
D364_RS20245-2183.57851723S rRNA (adenine(2030)-N(6))-methyltransferase
D364_RS20250-2153.667636glutathione-disulfide reductase
D364_RS202550154.042265glutathione S-transferase
D364_RS202600143.885283iron-containing alcohol dehydrogenase
D364_RS20265-1152.980589alpha,alpha-trehalase
D364_RS20270-1153.123499GNAT family N-acetyltransferase
D364_RS202750162.762150SDR family oxidoreductase
D364_RS20280-2152.527774LysR family transcriptional regulator
D364_RS20285-2141.796877inner membrane protein YhjD
D364_RS20290-2121.922654MHS family MFS transporter
D364_RS20295-1133.197754AsmA family protein
D364_RS203050123.344438sugar kinase
D364_RS203100143.334313insulinase family protein
D364_RS20315-1153.189853dicarboxylate/amino acid:cation symporter
D364_RS203200153.791671biofilm formation regulator HmsP
D364_RS20330-1163.927519cellulose biosynthesis protein BcsC
D364_RS20335-3121.558374cellulase
D364_RS20340-3121.042707cellulose biosynthesis cyclic di-GMP-binding
D364_RS20345-2110.435396UDP-forming cellulose synthase catalytic
D364_RS203500120.362443cellulose biosynthesis protein BcsQ
D364_RS20355-112-0.090405YhjR family protein
D364_RS203600122.406773cellulose biosynthesis protein BcsE
D364_RS203650143.727270cellulose biosynthesis protein BcsF
D364_RS20370-1143.029343cellulose biosynthesis protein BcsG
D364_RS20375-1142.944487endoglucanase
D364_RS20380-1123.068761hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20240SURFACELAYER310.008 Lactobacillus surface layer protein signature.
		>SURFACELAYER#Lactobacillus surface layer protein signature.

Length = 439

Score = 31.2 bits (70), Expect = 0.008
Identities = 17/70 (24%), Positives = 28/70 (40%), Gaps = 3/70 (4%)

Query: 1 MKRQL---SLLAVALLLAQPVLAKDIPLNRAAALANSVTPAASSQAYDDLEQQALAQLRH 57
MK+ L S A ALL P+ A +P+N A + A++ A D++
Sbjct: 1 MKKNLRIVSAAAAALLAVAPIAATAMPVNAATTINADSAINANTNAKYDVDVTPSISAIA 60

Query: 58 ALQGNAATLT 67
A+ +
Sbjct: 61 AVAKSDTMPA 70


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20275DHBDHDRGNASE856e-22 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 85.5 bits (211), Expect = 6e-22
Identities = 67/258 (25%), Positives = 113/258 (43%), Gaps = 16/258 (6%)

Query: 4 RIALVTGGSRGLGKNAALKLAAKGTDILLTYHSNRQAALDVVAEIEKKGVKAAALALNVG 63
+IA +TG ++G+G+ A LA++G I N + VV+ ++ + A A +V
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 64 DSTTFDAFASEVAQVLAQKWGRTTFDYLLNNAGIGLNAPFAETSEAQFDELMNIQFKGPF 123
DS D E+ + ++ G D L+N AG+ S+ +++ ++ G F
Sbjct: 68 DSAAID----EITARIEREMGP--IDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVF 121

Query: 124 FLTQRLLPLLQD--GGRILNVSSGLARFALPGYAAYAAMKGAMEVLTRYQAKELGGRGIS 181
++ + + D G I+ V S A AAYA+ K A + T+ EL I
Sbjct: 122 NASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181

Query: 182 VNIIAPGAIETDFGGG-EVRDNAE--VNRHIAAQTALG----RVGLPDDIGDAIAALLSD 234
NI++PG+ ETD +N V + G ++ P DI DA+ L+S
Sbjct: 182 CNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241

Query: 235 ELAWMNAQRVEVSGGMFL 252
+ + + V GG L
Sbjct: 242 QAGHITMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20285DPTHRIATOXIN300.015 Diphtheria toxin signature.
		>DPTHRIATOXIN#Diphtheria toxin signature.

Length = 567

Score = 30.1 bits (67), Expect = 0.015
Identities = 18/40 (45%), Positives = 21/40 (52%), Gaps = 6/40 (15%)

Query: 19 EKSKSTLEALNDTAVGQKASQALKTVTGTAAKVQRNPVIA 58
EK+K LE + TA+ LKTVTGT NPV A
Sbjct: 273 EKAKQYLEEFHQTALEHPELSELKTVTGT------NPVFA 306


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20290TCRTETB320.006 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 31.8 bits (72), Expect = 0.006
Identities = 79/365 (21%), Positives = 126/365 (34%), Gaps = 59/365 (16%)

Query: 79 IGSALFGHFGDRVGRKVTLVASLLTMGISTVVIGLLPGYESIGIVAPMLLALARFGQGLG 138
IG+A++G D++G K LL GI G + G+ +G LL +ARF QG G
Sbjct: 64 IGTAVYGKLSDQLGIK-----RLLLFGIIINCFGSVIGF--VGHSFFSLLIMARFIQGAG 116

Query: 139 LGGEWGGAALLATENAPARKR----ALYGSFPQLGAPIGFFFANGTFLLLSW-------- 186
++ P R L GS +G +G + W
Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM 176

Query: 187 -----LLTDQQFMEWGWRV--PF-IFSAVLVIIG-------------LYVRVSLHETPVF 225
+ + ++ R+ F I +L+ +G ++ VS+ +F
Sbjct: 177 ITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIF 236

Query: 226 AKVAAAKKQVKIPLGTLLTKHVRVTVLGTFIMLATYTLFYIMTVYSMTFSTGAAPNGLGL 285
K + G + VL I+ T F M Y M + +G
Sbjct: 237 VKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIG- 295

Query: 286 PRNEVLWMLMMAVIGFGVMVPVAGLLADAFGRRKSMII-ITTMIILFALFAFKPLLGSGN 344
+ +++ M+VI FG + G+L D G + I +T + + F +F S
Sbjct: 296 --SVIIFPGTMSVIIFG---YIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWF 350

Query: 345 PLLVFAFLLLGLSLMGL---TFGPMGALLPELFPTEVRYTGASFS-YNVSSILGASVAPY 400
++ F+L GLS T L E GA S N +S L
Sbjct: 351 MTIIIVFVLGGLSFTKTVISTIVSSS-----LKQQEA---GAGMSLLNFTSFLSEGTGIA 402

Query: 401 IAAWL 405
I L
Sbjct: 403 IVGGL 407



Score = 29.8 bits (67), Expect = 0.025
Identities = 19/101 (18%), Positives = 38/101 (37%), Gaps = 2/101 (1%)

Query: 255 FIMLATYTLFYIMTVYSMTFSTGAAPNGLGLPRNEVLWMLMMAVIGFGVMVPVAGLLADA 314
I L + F ++ + S N P W+ ++ F + V G L+D
Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75

Query: 315 FGRRKSMIIITTMIILFALFAFKPLLGSGNPLLVFAFLLLG 355
G ++ ++ + ++ F + S LL+ A + G
Sbjct: 76 LGIKRLLLFGIIINCFGSVIGF--VGHSFFSLLIMARFIQG 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20370FLGMRINGFLIF290.045 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 29.2 bits (65), Expect = 0.045
Identities = 8/44 (18%), Positives = 17/44 (38%), Gaps = 5/44 (11%)

Query: 93 QGSQIAGFSASYIWDLIVRFINWSMVGAFFVLLVLWLFISQWLR 136
G ++ + D ++ W VL+V W+ + +R
Sbjct: 442 TGGELPFWQQQSFIDQLLAAGRW-----LLVLVVAWILWRKAVR 480


72D364_RS20845D364_RS20880Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS20845023-5.676780dUTP diphosphatase
D364_RS20850129-7.714392nucleoid occlusion factor SlmA
D364_RS20855133-8.770358orotate phosphoribosyltransferase
D364_RS20860245-12.458765ribonuclease PH
D364_RS20865243-12.461356ABC transporter permease
D364_RS20870143-11.959950ABC transporter ATP-binding protein
D364_RS20875239-10.328235HlyD family efflux transporter periplasmic
D364_RS27630232-6.920902hypothetical protein
D364_RS20880128-5.802109YicC family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20850HTHTETR521e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 51.9 bits (124), Expect = 1e-10
Identities = 38/185 (20%), Positives = 72/185 (38%), Gaps = 15/185 (8%)

Query: 1 MAEK-QTAKRNRREEILQSLALMLESSDGSQRITTAKLAASVGVSEAALYRHFPSKTRMF 59
MA K + + R+ IL AL L S G + ++A + GV+ A+Y HF K+ +F
Sbjct: 1 MARKTKQEAQETRQHILDV-ALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59

Query: 60 DSLIEFIEDSLITRIN-LILKDEKDTTARLRLIVLLILGFGERNPGLTRILT-------G 111
+ E E ++ K D + LR I++ +L ++
Sbjct: 60 SEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119

Query: 112 HALMFEQDRLQGRIN-QLFERIEAQLRQVMREKKMREGEGYTLDETLLASQLLAFCEGML 170
M + Q + + ++RIE L+ + K + L A + + G++
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPAD----LMTRRAAIIMRGYISGLM 175

Query: 171 SRFVR 175
++
Sbjct: 176 ENWLF 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20865ABC2TRNSPORT452e-07 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 44.5 bits (105), Expect = 2e-07
Identities = 35/180 (19%), Positives = 67/180 (37%), Gaps = 6/180 (3%)

Query: 175 IMGSILSTTLILMTALSITRERENGALENLLVSPLSGLEVIIGKITPFVIIGLFQATLIL 234
+ S ++ + R E +L + L ++++G++ I
Sbjct: 74 VATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIG 133

Query: 235 IAAVLLFDIPLHGSVFLLFFVLLIYVFLCLSIGIGISGLAQNQLQALQMSSFYFIPSLML 294
+ A L S+ V+ + S+G+ ++ LA + + + P L L
Sbjct: 134 VVAAALGYTQWL-SLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFL 192

Query: 295 SGFVSPFISMPDWAKAIGSCLPLTYFIRLVKGIMLKGYSATALLPDLLPLIGLAVIVIGV 354
SG V P +P + LPL++ I L++ IML D+ +G I I +
Sbjct: 193 SGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPV-----VDVCQHVGALCIYIVI 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20875RTXTOXIND582e-11 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 57.5 bits (139), Expect = 2e-11
Identities = 48/351 (13%), Positives = 121/351 (34%), Gaps = 85/351 (24%)

Query: 21 IERILINKGDNVAAGQELVKIESFDA-------QNIFLRAEEKLSAESALLRNLESGERP 73
++ I++ +G++V G L+K+ + A Q+ L+A + + L R++E + P
Sbjct: 107 VKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLP 166

Query: 74 E-----------------------------------------------ELDIIRSQIKKA 86
E E + ++I +
Sbjct: 167 ELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRY 226

Query: 87 QSAESQVKRQLGRYRNLYANHAISLAEWEDIRDELTQKGAQVEEL---INQLKARQLPAR 143
++ K +L + +L AI+ + ++ + ++ + Q+++ L A+
Sbjct: 227 ENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAK 286

Query: 144 Q--------------DEISKQRSMVAAAKLERDKALWDVQQTTIVSPVNAKVFDI-IYRA 188
+ D++ + + LE K Q + I +PV+ KV + ++
Sbjct: 287 EEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTE 346

Query: 189 GERPSAGKPIISLLPPEN-IKVRFFIPEAKLGKFKIGSKVKLICDG----CAEPIAGIIN 243
G + + ++ ++P ++ ++V + +G +G + + + G +
Sbjct: 347 GGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVK 406

Query: 244 YISPEA---EFTPPVIYSTKRREKLIFMAEAIPALQQAGRMKIGQPFDVEI 291
I+ +A + V E+ + + + G EI
Sbjct: 407 NINLDAIEDQRLGLVFNVIISIEE-----NCLSTGNKNIPLSSGMAVTAEI 452


73D364_RS21075D364_RS21200Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS210750163.020432helix-turn-helix domain-containing protein
D364_RS21080-1194.048881MBL fold metallo-hydrolase
D364_RS210850183.400333NUDIX hydrolase
D364_RS21090-1203.551255DMT family transporter
D364_RS210950193.499844lipoprotein NlpA
D364_RS211000183.733261AraC family transcriptional regulator
D364_RS211053194.535413peroxidase-related enzyme
D364_RS211102183.407731nuclear transport factor 2 family protein
D364_RS211152193.033316pyridoxamine 5'-phosphate oxidase family
D364_RS211203182.423871LysR family transcriptional regulator
D364_RS211252191.406858MFS transporter
D364_RS211300161.377785TetR/AcrR family transcriptional regulator
D364_RS21135-2162.867023GNAT family N-acetyltransferase
D364_RS21145-2183.502250hypothetical protein
D364_RS27635-1204.208955hypothetical protein
D364_RS27640-1214.537983multidrug efflux RND transporter periplasmic
D364_RS21165-1225.026285multidrug efflux RND transporter permease
D364_RS21170-1224.776610multidrug efflux RND transporter outer membrane
D364_RS211751204.183612multidrug effflux MFS transporter
D364_RS211801183.657223N-acetylmuramic acid 6-phosphate etherase
D364_RS211850153.815875PTS N-acetylmuramic acid transporter subunit
D364_RS211900143.860559DMT family transporter
D364_RS21195-1143.153802hypothetical protein
D364_RS21200-1143.490543NAD(P)-dependent oxidoreductase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21130TCRTETB1384e-38 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 138 bits (350), Expect = 4e-38
Identities = 94/418 (22%), Positives = 180/418 (43%), Gaps = 19/418 (4%)

Query: 20 LLLVMLLSALDQTIVSTALPTIVGELDGL-DKLSWVVTAYILSSTIAVPLYGKFGDLFGR 78
L ++ S L++ +++ +LP I + + +WV TA++L+ +I +YGK D G
Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78

Query: 79 KIVLQVAIGLFLVGSALCGLAQNMTQLVLM-RGLQGLGGGGLMVISMAAVADVIPPANRG 137
K +L I + GS + + + L++M R +QG G + M VA IP NRG
Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138

Query: 138 RYQGLFGGVFGLATVIGPLIGGFLVQHASWRWIFYINLPLGLFALLVIGAVFHSSNKRSQ 197
+ GL G + + +GP IGG + + W ++ +P+ + R +
Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVRIK 196

Query: 198 HQIDWLGAIYLSMALLCIILFTSEGGSVHAWNDPQLWCILAFGIVGIIGFIYEERMAAEP 257
D G I +S+ ++ +LFT+ L ++ + F+ R +P
Sbjct: 197 GHFDIKGIILMSVGIVFFMLFTTS----------YSISFLIVSVLSFLIFVKHIRKVTDP 246

Query: 258 IIPLALFRNRSFLLCSLIGFVIGMSLFGSVTFLPLYLQVVKEATPTEAGLQLI-PLMGGL 316
+ L +N F++ L G +I ++ G V+ +P ++ V + + E G +I P +
Sbjct: 247 FVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSV 306

Query: 317 LLTSIISGRIISRTGKYRLFPILGTLLGVTGMVLLTRITIHSPLWQLYLFTGVLGAGLGL 376
++ I G ++ R G +G + + + + + + VLG GL
Sbjct: 307 IIFGYIGGILVDRRGP-LYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSF 364

Query: 377 VMQVLVLAVQNAMPAQMYGVATSGVTLFRSIGGSIGVALFGAVFTHVLQSNLQQLLPE 434
V+ V +++ Q G S + + G+A+ G + + + Q+LLP
Sbjct: 365 TKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS--IPLLDQRLLPM 420


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21135HTHTETR727e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 72.4 bits (177), Expect = 7e-18
Identities = 33/175 (18%), Positives = 70/175 (40%), Gaps = 10/175 (5%)

Query: 12 RPGRPRGKKPGTANREQLMDIALTLFARDGAGRVSLNAIAKEAGVTPAMLHYYFSSRDAL 71
R + ++ R+ ++D+AL LF++ G SL IAK AGVT ++++F + L
Sbjct: 3 RKTKQEAQE----TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDL 58

Query: 72 VTQLIEERFMPLRNHISRIFVDHPQDPVL----ALTMMVETLAHMAEKNAWFAPLWM-QE 126
+++ E + P DP+ L ++E+ + ++ E
Sbjct: 59 FSEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCE 118

Query: 127 IIGEMPILRQHMDARFGEERFQVMLGTVRRWQQEGKINPALAPELLFTTVISLVL 181
+GEM +++Q E + + T++ + + L + +
Sbjct: 119 FVGEMAVVQQ-AQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYIS 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21145SACTRNSFRASE361e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.5 bits (84), Expect = 1e-05
Identities = 20/72 (27%), Positives = 35/72 (48%), Gaps = 6/72 (8%)

Query: 51 GLIAKRKGNW---LCIEYLWVSETTRGRGLGSELMQEAEQQAQAQGCSHLLVDTFSFQ-- 105
G I R NW IE + V++ R +G+G+ L+ +A + A+ L+++T
Sbjct: 78 GRIKIRS-NWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINIS 136

Query: 106 ALPFYQKLGYQL 117
A FY K + +
Sbjct: 137 ACHFYAKHHFII 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21165RTXTOXIND432e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.5 bits (100), Expect = 2e-06
Identities = 24/133 (18%), Positives = 51/133 (38%), Gaps = 10/133 (7%)

Query: 42 PVPVVSQLTGRTTAS-LSAEVRPQVGGIIQKRLFTEGDMVKAGQALYQIDPSSYRATWNE 100
V +V+ G+ T S S E++P I+++ + EG+ V+ G L ++ A +
Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138

Query: 101 AAAALKQAQALVASDCQKAQRYASLVRDNGVSRQDADDAASTCAQDKASV--------ES 152
++L QA+ Q R L + + D + ++ + +
Sbjct: 139 TQSSLLQARLEQTRY-QILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFST 197

Query: 153 KKAALESARINLN 165
+ +NL+
Sbjct: 198 WQNQKYQKELNLD 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21170ACRIFLAVINRP11460.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1146 bits (2966), Expect = 0.0
Identities = 583/1031 (56%), Positives = 754/1031 (73%), Gaps = 6/1031 (0%)

Query: 3 SRFFVRRPVFAWVIAILIMLAGVLAIRTLPVGQYPDVAPPAVKISATYTGASAETLENSV 62
+ FF+RRP+FAWV+AI++M+AG LAI LPV QYP +APPAV +SA Y GA A+T++++V
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 63 TQVIEQQLTGLDHLLYFSSTSSSDGSVSITVTFEQGTDPDTAQVQVQNKVQQAESRLPSE 122
TQVIEQ + G+D+L+Y SSTS S GSV+IT+TF+ GTDPD AQVQVQNK+Q A LP E
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 123 VQQSGVTVEKSQSSFLLILAVYDKTNRATSSDISDWLVSNMQDPLARVEGVGSLQVFGAE 182
VQQ G++VEKS SS+L++ T DISD++ SN++D L+R+ GVG +Q+FGA+
Sbjct: 122 VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ 181

Query: 183 YAMRVWMDPTKLASYSLMPSDVQSAIEAQNVQVSAGKIGALPSSNAQQLTATVRAQSRLQ 242
YAMR+W+D L Y L P DV + ++ QN Q++AG++G P+ QQL A++ AQ+R +
Sbjct: 182 YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 243 TPDQFKAIIVKSQADGSVVRLSDVARVEMGSEDYTATANLNGHPAAGIAVMMAPGANALD 302
P++F + ++ +DGSVVRL DVARVE+G E+Y A +NG PAAG+ + +A GANALD
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 303 TATLVKSKIAEFQRQMPQGYDIAYPKDSTEFIKISVEDVIQTLFEAIILVVCVMYLFLQN 362
TA +K+K+AE Q PQG + YP D+T F+++S+ +V++TLFEAI+LV VMYLFLQN
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 363 FRATLIPAVAVPVVLLGTFGVLALFGYSINTLTLFAMVLAIGLLVDDAIVVVENVERIMR 422
RATLIP +AVPVVLLGTF +LA FGYSINTLT+F MVLAIGLLVDDAIVVVENVER+M
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 423 DEGLPAREATEKSMGEISGALVAIALVLSAVFLPMAFFGGSTGVIYRQFSVTIISAMMLS 482
++ LP +EATEKSM +I GALV IA+VLSAVF+PMAFFGGSTG IYRQFS+TI+SAM LS
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 483 VVVALTLTPALCGALL----SHSKPHTKGFFGAFNRLWRRTEAGYQRRVLGGLRRGAVMM 538
V+VAL LTPALC LL + + GFFG FN + + Y V L +
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 539 GAYALICGAMALAMWKLPGSFLPVEDQGEIMVQYTLPAGATAVRTAEVRRQVTDWFLTKE 598
YALI M + +LP SFLP EDQG + LPAGAT RT +V QVTD++L E
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 599 KANTDVIFTVDGFSFSGSGQNAGMAFVSLKNWSQRKGDDNTAQAIALRATKELGTIRDAT 658
KAN + +FTV+GFSFSG QNAGMAFVSLK W +R GD+N+A+A+ RA ELG IRD
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 659 LFAMTPPSVDGLGQSNGFTFELMASGGTDRDSLMKLRSQLLAAANQS-SELQSVRANDLP 717
+ P++ LG + GF FEL+ G D+L + R+QLL A Q + L SVR N L
Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721

Query: 718 QMPQLQVDIDNNKAVSLGLSLSDVTDTLSSAWGGTYVNDFIDRGRVKKVYIQGESDARAV 777
Q ++++D KA +LG+SLSD+ T+S+A GGTYVNDFIDRGRVKK+Y+Q ++ R +
Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781

Query: 778 PSDLGKWFVRGSDNSMTPFSAFATTHWQYGPESLVRYNGSAAFEIQGENAAGFSSGAAMD 837
P D+ K +VR ++ M PFSAF T+HW YG L RYNG + EIQGE A G SSG AM
Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841

Query: 838 KMEKLADSLPAGSTWAWSGISLQEKLASGQAMSLYAISILVVFLCLAALYESWSVPFSVI 897
ME LA LPAG + W+G+S QE+L+ QA +L AIS +VVFLCLAALYESWS+P SV+
Sbjct: 842 LMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVM 901

Query: 898 MVIPLGLLGAALAATLRGLSNDVYFQVALLTTIGLSSKNAILIVEFAESAVD-EGYSLSR 956
+V+PLG++G LAATL NDVYF V LLTTIGLS+KNAILIVEFA+ ++ EG +
Sbjct: 902 LVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVE 961

Query: 957 AAIRAAQTRLRPIVMTSLAFIAGVLPLAIATGAGANSRVAIGTGIIGGTLTATLLAVFFV 1016
A + A + RLRPI+MTSLAFI GVLPLAI+ GAG+ ++ A+G G++GG ++ATLLA+FFV
Sbjct: 962 ATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFV 1021

Query: 1017 PLFFVLVKRLF 1027
P+FFV+++R F
Sbjct: 1022 PVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21180TCRTETA672e-14 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 67.2 bits (164), Expect = 2e-14
Identities = 78/339 (23%), Positives = 130/339 (38%), Gaps = 25/339 (7%)

Query: 27 LPALPEITQQLQATSTQTQLSLTAALIGLGLGQLFFGP----LSDHIGRLKPLALSLLLF 82
+P LP + + L S L L Q P LSD GR L +SL
Sbjct: 25 MPVLPGLLRDLVH-SNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGA 83

Query: 83 IFSSAMCALTRDINMLIVWRFLQGFAGAGGSVLSRSIARDKYQGTLLTQFFALLMTVNGI 142
A+ A + +L + R + G GA G+V IA D G + F + G
Sbjct: 84 AVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA-DITDGDERARHFGFMSACFGF 142

Query: 143 APVLSPVLGGYVITAFDWRILFWTMAAIGGVLLVMSLAILRETRPATAAHASRQRPGQPV 202
V PVLGG + F F+ AA+ G+ + +L E+ R+
Sbjct: 143 GMVAGPVLGGL-MGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNP-- 199

Query: 203 LKNRRFLRFCLIQAFMMA-----GLFSYIGSSSFVMQSE--YGMSAMQFSLLFGLNGI-G 254
L + R+ R + A +MA L + ++ +V+ E + A + GI
Sbjct: 200 LASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILH 259

Query: 255 LIIAAMFFSRLARRFSAESLLRGGLTLAVSCAAIMLLFA---WLHLPVLALVGL--FFTV 309
+ AM +A R L G+ +A I+L FA W+ P++ L+
Sbjct: 260 SLAQAMITGPVAARLGERRALMLGM-IADGTGYILLAFATRGWMAFPIMVLLASGGIGMP 318

Query: 310 SLMSGISTVAGAEAMSAVDAAQSG--TASALMGTLMFVF 346
+L + +S E + + + + ++++G L+F
Sbjct: 319 ALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357


74D364_RS21430D364_RS21680Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS21430311-1.494832YceK/YidQ family lipoprotein
D364_RS21435312-0.910796DUF3748 domain-containing protein
D364_RS21440314-0.644899MFS transporter
D364_RS21445315-0.876563galactonate dehydratase
D364_RS21450313-1.5156202-dehydro-3-deoxy-6-phosphogalactonate aldolase
D364_RS21455317-1.6395942-dehydro-3-deoxygalactonokinase
D364_RS21460015-1.292414D-galactonate utilization transcriptional
D364_RS21465-211-1.801353BamA/TamA family outer membrane protein
D364_RS21470-111-1.914799hypothetical protein
D364_RS21475-113-1.755528sugar-phosphatase
D364_RS21480016-1.946440DNA topoisomerase (ATP-hydrolyzing) subunit B
D364_RS21485118-1.849862DNA replication/repair protein RecF
D364_RS21490317-1.633447DNA polymerase III subunit beta
D364_RS21495317-1.785005chromosomal replication initiator protein DnaA
D364_RS21500217-2.26936750S ribosomal protein L34
D364_RS21505014-0.825557ribonuclease P protein component
D364_RS21510-113-0.214129membrane protein insertion efficiency factor
D364_RS26470-2110.411491membrane protein insertase YidC
D364_RS21520-1110.645419tRNA uridine-5-carboxymethylaminomethyl(34)
D364_RS215250130.839221GNAT family N-acetyltransferase
D364_RS215302121.713018MFS transporter
D364_RS215350110.747597HTH-type transcriptional regulator YidZ
D364_RS215400100.170106phosphopantetheinyl transferase
D364_RS21545012-0.829568NAD(P)H-dependent oxidoreductase
D364_RS21550-213-1.287091NCS2 family permease
D364_RS21555-112-3.0633926-phosphogluconate phosphatase
D364_RS21560-210-3.153707glucosamine-6-phosphate deaminase
D364_RS21570-29-4.244868esterase family protein
D364_RS21575-211-3.852256carbohydrate porin
D364_RS21580-212-3.477036phosphate signaling complex protein PhoU
D364_RS21585-216-3.846590phosphate ABC transporter ATP-binding protein
D364_RS21595-122-2.178158phosphate ABC transporter permease PstA
D364_RS21600-221-1.600226phosphate ABC transporter permease PstC
D364_RS21605-220-1.376927phosphate ABC transporter substrate-binding
D364_RS21610025-2.056437glutamine--fructose-6-phosphate transaminase
D364_RS21620232-2.513601bifunctional UDP-N-acetylglucosamine
D364_RS21625234-3.271243F0F1 ATP synthase subunit epsilon
D364_RS21645341-4.577307F0F1 ATP synthase subunit beta
D364_RS21650443-4.544938F0F1 ATP synthase subunit gamma
D364_RS21655336-5.312425F0F1 ATP synthase subunit alpha
D364_RS21660438-5.628144F0F1 ATP synthase subunit delta
D364_RS21665427-6.271864F0F1 ATP synthase subunit B
D364_RS21670223-4.089527F0F1 ATP synthase subunit C
D364_RS21675120-3.624638F0F1 ATP synthase subunit A
D364_RS21680118-3.860354F0F1 ATP synthase subunit I
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21450TCRTETA424e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 41.7 bits (98), Expect = 4e-06
Identities = 58/348 (16%), Positives = 110/348 (31%), Gaps = 35/348 (10%)

Query: 66 AEMGYVFSAFAWLYTLCQIPGGWFLDRVGSRLTYFIAIFGWSVATLLQGFATGLMSLIGL 125
A G + + +A + C G DR G R +++ G +V + A L L
Sbjct: 43 AHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIG 102

Query: 126 RAITGIFEAPAFPTNNRMVTSWFPEHERASAVGFYTSGQFVGLAFLTPLLIWIQELLSWH 185
R + GI A + ERA GF ++ G+ P+L + S H
Sbjct: 103 RIVAGITGAT-GAVAGAYIADITDGDERARHFGFMSACFGFGMV-AGPVLGGLMGGFSPH 160

Query: 186 WVFIVTGGIGIIWSLIWFKVYQPPRLTKSISKAELDYIRDGGGLVDGDAPVKKEARQPLS 245
F + + L + + P+++EA PL+
Sbjct: 161 APFFAAAALNGLNFLTGCFLLPESHKGE-------------------RRPLRREALNPLA 201

Query: 246 KADWKLVFHRKLVGVYLGQFAVTSTLWFFLTWFPNYLTQEKGITALKAGFMTTV-PFLAA 304
W + + F + + + A G L +
Sbjct: 202 SFRWARGM-TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHS 260

Query: 305 FFGVLLSGWLADKLVKKGYSLGVARKTPIICGLLISTC--IMGANYTNDPIWIMALMALA 362
+++G +A +L + ++ G++ I+ A T + ++ LA
Sbjct: 261 LAQAMITGPVAARL---------GERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLA 311

Query: 363 FFGNGFASITWSLVSSLAPMRLIGLTGGVFNFVGGLGGITVPLVIGYL 410
G G ++ +++S G G + L I PL+ +
Sbjct: 312 SGGIGMPALQ-AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAI 358


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS2152560KDINNERMP8140.0 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 814 bits (2105), Expect = 0.0
Identities = 476/549 (86%), Positives = 510/549 (92%), Gaps = 2/549 (0%)

Query: 1 MDSQRNLLIIALLFVSFMIWQAWEQDKNPQPQ-QQTTQTTTTAAGSAADQGVPASGQGKL 59
MDSQRNLL+IALLFVSFMIWQAWEQDKNPQPQ QQTTQTTTTAAGSAADQGVPASGQGKL
Sbjct: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKL 60

Query: 60 ITVKTDVLELTINTNGGDIEQALLLAYPKTLKSTEPFQLLETTPQFVYQAQSGLTGRDGP 119
I+VKTDVL+LTINT GGD+EQALL AYPK L ST+PFQLLET+PQF+YQAQSGLTGRDGP
Sbjct: 61 ISVKTDVLDLTINTRGGDVEQALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGP 120

Query: 120 DNPANGPRPLYNVDKEAFVLADGQDELVIPLTYTDKAGNVFTKTFTLKRGGYAVNVGYSV 179
DNPANGPRPLYNV+K+A+VLA+GQ+EL +P+TYTD AGN FTKTF LKRG YAVNV Y+V
Sbjct: 121 DNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNTFTKTFVLKRGDYAVNVNYNV 180

Query: 180 QNASEKPLEVSTFGQLKQTAALPTSRDTQTGGLSTMHTFRGAAFSTADSKYEKYKFDTIL 239
QNA EKPLE+S+FGQLKQ+ LP DT + + +HTFRGAA+ST D KYEKYKFDTI
Sbjct: 181 QNAGEKPLEISSFGQLKQSITLPPHLDTGSSNFA-LHTFRGAAYSTPDEKYEKYKFDTIA 239

Query: 240 DNENLNVSTKNGWVAMLQQYFTTAWVPRNNGTNNFYTANLGNGVVAIGYKSQPVLVQPGQ 299
DNENLN+S+K GWVAMLQQYF TAW+P N+GTNNFYTANLGNG+ AIGYKSQPVLVQPGQ
Sbjct: 240 DNENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQ 299

Query: 300 TDKLQSTLWVGPAIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKFIHSFLGNWGFSII 359
T + STLWVGP IQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLK+IHSF+GNWGFSII
Sbjct: 300 TGAMNSTLWVGPEIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSII 359

Query: 360 VITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRQSQEMMALYKAEKVNP 419
+ITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQR SQEMMALYKAEKVNP
Sbjct: 360 IITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRISQEMMALYKAEKVNP 419

Query: 420 LGGCFPLIIQMPIFLALYYMLSASVELRHAPFILWIHDLSAQDPYYILPIIMGATMFFIQ 479
LGGCFPL+IQMPIFLALYYML SVELR APF LWIHDLSAQDPYYILPI+MG TMFFIQ
Sbjct: 420 LGGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQ 479

Query: 480 KMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVVYYIVSNLVTIIQQQLIYRGLEKRG 539
KMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLV+YYIVSNLVTIIQQQLIYRGLEKRG
Sbjct: 480 KMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRG 539

Query: 540 LHSREKKKS 548
LHSREKKKS
Sbjct: 540 LHSREKKKS 548


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21535SACTRNSFRASE385e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 37.6 bits (87), Expect = 5e-06
Identities = 17/55 (30%), Positives = 27/55 (49%), Gaps = 3/55 (5%)

Query: 69 IVDVAVDPAHQGKGLGRLVMEKLVAWLDANAFDGSYV-TLVADVP--ELYAKFGF 120
I D+AV ++ KG+G ++ K + W N F G + T ++ YAK F
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21540TCRTETA582e-11 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 58.3 bits (141), Expect = 2e-11
Identities = 67/311 (21%), Positives = 118/311 (37%), Gaps = 14/311 (4%)

Query: 5 LLCSFALVLLYPSGIDMYLVGLPRIAQDLGASEAQLHIAFSVYLAGMASAML----FAGR 60
L+ + V L GI + + LP + +DL S + + + LA A G
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSN-DVTAHYGILLALYALMQFACAPVLGA 65

Query: 61 IADRSGRKPVAIVGAAIFVIASLICAQAHTSSHFLIGRFIQGIGAGSCYVVAFAILRDTL 120
++DR GR+PV +V A + I A A IGR + GI G+ VA A + D
Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADIT 124

Query: 121 DDRRRAKVLSLLNGITCIIPVLAPVLGHLIMLKYPWQSLFYTMTGMGVMVAVLSVFILRE 180
D RA+ ++ V PVLG L M + + F+ + + + F+L E
Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGL-MGGFSPHAPFFAAAALNGLNFLTGCFLLPE 183

Query: 181 TRPTAPPQAASPQHDAGESLLNRFFLSRLLITTLSVTVILTYVNVSPVLMMEEMGFDRGT 240
+ + S + ++ ++V I+ V P + G DR
Sbjct: 184 SHKGERRPLRREALNPLASFRWARGM-TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFH 242

Query: 241 YSMAM------ALMAMISMAVSFSTPFALSLFNPRTLMLTSQVLFLAAGVTLSLATRQAV 294
+ A + S+A + T + R ++ + + L+ ATR +
Sbjct: 243 WDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWM 302

Query: 295 TLIGLGMICAG 305
+ ++ +G
Sbjct: 303 AFPIMVLLASG 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21560TYPE3IMSPROT310.013 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 30.5 bits (69), Expect = 0.013
Identities = 22/114 (19%), Positives = 43/114 (37%), Gaps = 3/114 (2%)

Query: 331 LTAVVVGILFLLVIFLSPLAGMVPGYAAAGALIYVGVLMTSSLARVKWSDLTEAVPA--- 387
L+ VV +L PL + A A ++ G L++ + + A
Sbjct: 72 LSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKINPIEGAKRI 131

Query: 388 FITAVMMPFSFSITEGIALGFISYCVMKIGTGRLRELSPCVIIVSLLFVLKIVF 441
F ++ F SI + + L + + ++K L +L C I + +I+
Sbjct: 132 FSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQILR 185


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21580OUTRMMBRANEA300.014 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 30.3 bits (68), Expect = 0.014
Identities = 14/23 (60%), Positives = 16/23 (69%)

Query: 1 MKRHAIYFALALAGAAFTLQAAP 23
MK+ AI A+ALAG A QAAP
Sbjct: 1 MKKTAIAIAVALAGFATVAQAAP 23


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21670TRNSINTIMINR290.009 Translocated intimin receptor (Tir) signature.
		>TRNSINTIMINR#Translocated intimin receptor (Tir) signature.

Length = 549

Score = 28.9 bits (64), Expect = 0.009
Identities = 15/41 (36%), Positives = 21/41 (51%), Gaps = 3/41 (7%)

Query: 84 SQILDEAKAEAEQERTKIV---AQAQAEIDAERKRAREELR 121
QI +AK E R + V AQAQ + + R +EEL+
Sbjct: 319 EQIAQQAKEAGEVARQQAVESNAQAQQRYEDQHARRQEELQ 359


75D364_RS21755D364_RS21850Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS21755122-3.627216multidrug transporter subunit MdtD
D364_RS21760-124-4.294977FadR family transcriptional regulator
D364_RS21790-122-5.740960*molybdopterin-guanine dinucleotide biosynthesis
D364_RS21795-115-3.650203molybdenum cofactor guanylyltransferase MobA
D364_RS21800015-3.700702YihD family protein
D364_RS21805014-3.213785serine/threonine protein kinase
D364_RS21810113-3.172505thiol:disulfide interchange protein DsbA
D364_RS21815114-2.817161acyltransferase
D364_RS21820114-1.668853DNA polymerase I
D364_RS21825016-1.529759ribosome biogenesis GTP-binding protein
D364_RS21830-214-1.753459Der GTPase-activating protein YihI
D364_RS21835119-2.122762oxygen-independent coproporphyrinogen III
D364_RS27205121-2.172648YshB family small membrane protein
D364_RS21840019-2.393813nitrogen regulation protein NR(I)
D364_RS21845224-2.888352nitrogen regulation protein NR(II)
D364_RS21850224-2.964043glutamate--ammonia ligase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21760TCRTETB1347e-37 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 134 bits (339), Expect = 7e-37
Identities = 102/408 (25%), Positives = 177/408 (43%), Gaps = 20/408 (4%)

Query: 12 LPWIAAMAFFMQALDATILNTALPAIAHSLNRSPLAMQSAIISYTLTVAMLIPVSGWLAD 71
L W+ ++FF L+ +LN +LP IA+ N+ P + ++ LT ++ V G L+D
Sbjct: 16 LIWLCILSFF-SVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 72 RFGTRRVFIIAVSLFTLGSLACALSSSLTELVIF-RVIQGIGGAMMMPVARLALLRAYPR 130
+ G +R+ + + + GS+ + S L+I R IQG G A + + + R P+
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 131 SELLPVLNFVTMPGLVGPILGPVLGGVFVTWASWHWIFLINIP-IGVIGILYARKYMPNF 189
+ +G +GP +GG+ + HW +L+ IP I +I + + K +
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKE 192

Query: 190 TTPRRRFDIGGFLLFGLSLVLFSSGIELFGEKIVATWQALAVIAVSLLLLVAYVRHARRH 249
+ FDI G +L + +V F + L V +S L+ +V+H R+
Sbjct: 193 VRIKGHFDIKGIILMSVGIVFFMLFTT------SYSISFLIVSVLSFLI---FVKHIRKV 243

Query: 250 PTPLISLSLFKTHTFSVGIAGNLATRLGTGCVPFLMPLMLQVGFGY-PAIIAGCMIAPTA 308
P + L K F +G+ ++P M++ A I +I P
Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303

Query: 309 IGSIIAKSTVTQVLRWFGYRKTLVGITVF--IGLMIAQFSLQSPEMPLWMLLLPLFVLGM 366
+ II ++ G L F + + A F L++ +M ++ +FVLG
Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETT--SWFMTIIIVFVLGG 361

Query: 367 AMSTQFTAMNTITLADLTDDNASSGNSLLAVTQQLSISLGVAISAAVL 414
T+ T ++TI + L A +G SLL T LS G+AI +L
Sbjct: 362 LSFTK-TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21815PRPHPHLPASEC290.023 Prokaryotic zinc-dependent phospholipase C signature.
		>PRPHPHLPASEC#Prokaryotic zinc-dependent phospholipase C signature.

Length = 398

Score = 29.2 bits (65), Expect = 0.023
Identities = 11/31 (35%), Positives = 13/31 (41%)

Query: 69 NPWLKWDVQGLEGLNKKNWYLLISNHHSWAD 99
N W K +G K +Y S HSW D
Sbjct: 214 NAWSKEYARGFAKTGKSIYYSHASMSHSWDD 244


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21830SECA280.022 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 27.9 bits (62), Expect = 0.022
Identities = 12/57 (21%), Positives = 23/57 (40%)

Query: 15 KSREELNQEARDRKRQKKHRGHAAGSRANGGDAASAGKKQRQAQDPRVGSKKPIPLG 71
+ EE+ + + R+ + + D+A+A Q + +VG P P G
Sbjct: 832 RMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPCG 888


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21840HTHFIS6000.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 600 bits (1549), Expect = 0.0
Identities = 203/478 (42%), Positives = 297/478 (62%), Gaps = 11/478 (2%)

Query: 1 MQRGIAWIVDDDSSIRWVLERALTGAGLSCTTFESGNEVLDALTTKTPDVLLSDIRMPGM 60
M + DDD++IR VL +AL+ AG + + + D++++D+ MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGLALLKQIKQRHPMLPVIIMTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVDRAIS 120
+ LL +IK+ P LPV++M+A + A+ A ++GA+DYLPKPFD+ E + ++ RA++
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 HYQEQQQPRNAPISSPTADIIGEAPAMQDVFRIIGRLSRSSISVLINGESGTGKELVAHA 180
+ + ++G + AMQ+++R++ RL ++ ++++I GESGTGKELVA A
Sbjct: 121 EPKRRPSKLEDDSQDG-MPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179

Query: 181 LHRHSPRSKAPFIALNMAAIPKDLIESELFGHEKGAFTGANTVRQGRFEQADGGTLFLDE 240
LH + R PF+A+NMAAIP+DLIESELFGHEKGAFTGA T GRFEQA+GGTLFLDE
Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239

Query: 241 IGDMPLDVQTRLLRVLADGQFYRVGGYAPVKVDVRIIAATHQNLEQRVQEGKFREDLFHR 300
IGDMP+D QTRLLRVL G++ VGG P++ DVRI+AAT+++L+Q + +G FREDL++R
Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299

Query: 301 LNVIRVHLPPLRERREDIPRLARHFLQIAARELGVEAKQLHPETETALTRLAWPGNVRQL 360
LNV+ + LPPLR+R EDIP L RHF+Q A +E G++ K+ E + WPGNVR+L
Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVREL 358

Query: 361 ENTCRWLTVMAAGQEVLTQDLPSELFETTIPDSPTQMQPDSWATLLGQWADRALRS---- 416
EN R LT + + + + +EL + S + + Q + +R
Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418

Query: 417 -----GHQNLLSEAQPEMERTLLTTALRHTQGHKQEAARLLGWGRNTLTRKLKELGME 469
L EME L+ AL T+G++ +AA LLG RNTL +K++ELG+
Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


76D364_RS23000D364_RS23075Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS23000-115-3.552737PTS mannose/fructose/sorbose transporter subunit
D364_RS23005015-4.583508mannose/fructose/sorbose PTS transporter subunit
D364_RS23010-113-2.411083PTS sugar transporter subunit IIA
D364_RS23015-113-1.587561SDR family oxidoreductase
D364_RS23020015-1.225054sugar-binding transcriptional regulator
D364_RS23025115-0.78518123S rRNA pseudouridine(2604) synthase RluF
D364_RS23030-114-0.429360DUF3811 domain-containing protein
D364_RS23035014-0.203047ketopantoate/pantoate/pantothenate transporter
D364_RS23040015-1.125235lysine-sensitive aspartokinase 3
D364_RS23045114-3.038610glucose-6-phosphate isomerase
D364_RS23050424-4.815353N-acetyltransferase
D364_RS23055425-5.823034GNAT family N-acetyltransferase
D364_RS23060321-4.552907Rid family detoxifying hydrolase
D364_RS23065116-4.391372AzlD domain-containing protein
D364_RS23070217-4.102340AzlC family ABC transporter permease
D364_RS23075016-3.447081serine dehydratase subunit alpha family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS23015DHBDHDRGNASE1154e-33 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 115 bits (289), Expect = 4e-33
Identities = 79/271 (29%), Positives = 126/271 (46%), Gaps = 26/271 (9%)

Query: 7 LKDNVIIVTGGASGIGLAIVDELLSQGAHVQMIDIHGGDRHHNGDNYHF-------WPTD 59
++ + +TG A GIG A+ L SQGAH+ +D + + +P D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 60 ISSATEVQQTIDAIIQRWSRIDGLVNNAGVNFPRLLVDEKAPAGRYELNEAAFEKMVNIN 119
+ + + + I + ID LVN AGV P L+ + L++ +E ++N
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLI---------HSLSDEEWEATFSVN 116

Query: 120 QKGVFFMSQAVARQMVKQRAGVIVNVSSESGLEGSEGQSCYAATKAALNSFTRSWSKELG 179
GVF S++V++ M+ +R+G IV V S + YA++KAA FT+ EL
Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176

Query: 180 KYGIRVVGVAPGILEKTGLRTPEYEEALAWTRNITVEQLREGYT---KNAIPIGRAGKLS 236
+Y IR V+PG E + W EQ+ +G K IP+ + K S
Sbjct: 177 EYNIRCNIVSPGSTETDMQWS-------LWADENGAEQVIKGSLETFKTGIPLKKLAKPS 229

Query: 237 EVADFVCYLLSARASYITGVTTNIAGGKTRG 267
++AD V +L+S +A +IT + GG T G
Sbjct: 230 DIADAVLFLVSGQAGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS23020HTHFIS280.045 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 28.3 bits (63), Expect = 0.045
Identities = 6/21 (28%), Positives = 12/21 (57%)

Query: 24 QAQIARELGIYRTTISRLLKR 44
Q + A LG+ R T+ + ++
Sbjct: 452 QIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS2303056KDTSANTIGN250.037 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 25.3 bits (55), Expect = 0.037
Identities = 19/75 (25%), Positives = 34/75 (45%), Gaps = 5/75 (6%)

Query: 2 ALPRITQKEMTEREQRELKTLLDRARIAHGRPLSNAETNSVKKEYIDKLMAQREAEAKKA 61
LP E + + +EL L+ R + ++NA N + ++ AQ++
Sbjct: 290 GLPNSASIEQIQSKIQELGDTLEELRDSFDGYINNAFVNQIHLNFVMPPQAQQQQG---- 345

Query: 62 RQVKKQQAYKTDKEA 76
Q ++QQA T +EA
Sbjct: 346 -QGQQQQAQATAQEA 359


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS23045BCTERIALGSPD320.007 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 31.8 bits (72), Expect = 0.007
Identities = 17/79 (21%), Positives = 34/79 (43%), Gaps = 13/79 (16%)

Query: 69 LAKETDLAGAIKSMFSGEKINR-------TEDRAVLHVALRNRSNTPIVVDGKDVMPEVN 121
AK +DL + + S + + D+ ++ + ++N IV DVM ++
Sbjct: 276 YAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNII-IKAHGQTNALIVTAAPDVMNDLE 334

Query: 122 AVLEKM-----KTFSEAII 135
V+ ++ + EAII
Sbjct: 335 RVIAQLDIRRPQVLVEAII 353


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS23055SACTRNSFRASE270.035 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 26.8 bits (59), Expect = 0.035
Identities = 11/43 (25%), Positives = 14/43 (32%)

Query: 102 GHRYGEHIFHAVETRAKTAGESWLWLEVLAANPAARRFYERQG 144
G + H AK L LE N +A FY +
Sbjct: 103 KKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHH 145


77D364_RS23695D364_RS23785Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS23695123-3.383114membrane protein FxsA
D364_RS23700-132-7.210033L-methionine/branched-chain amino acid
D364_RS23710-143-11.761631co-chaperone GroES
D364_RS23715043-11.543727chaperonin GroEL
D364_RS23720254-15.137147DUF4156 domain-containing protein
D364_RS23725243-12.640117hypothetical protein
D364_RS23730344-12.516782HlyD family secretion protein
D364_RS23735439-11.141476hypothetical protein
D364_RS23740435-9.771638fimbrial protein
D364_RS23745230-8.515201fimbrial biogenesis outer membrane usher
D364_RS23750222-6.587229molecular chaperone
D364_RS23755026-7.146008type 1 fimbrial protein
D364_RS23760127-6.237238EF-P beta-lysylation protein EpmB
D364_RS23765-120-5.202924elongation factor P
D364_RS23770-115-1.280141entericidin A/B family lipoprotein
D364_RS23775021-1.673038lipoprotein toxin entericidin B
D364_RS23780320-0.840272quaternary ammonium compound efflux SMR
D364_RS23785218-0.904248lipocalin family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS23710TYPE3OMOPROT280.005 Type III secretion system outer membrane O protein ...
		>TYPE3OMOPROT#Type III secretion system outer membrane O protein

family signature.
Length = 303

Score = 28.0 bits (62), Expect = 0.005
Identities = 13/55 (23%), Positives = 26/55 (47%), Gaps = 9/55 (16%)

Query: 36 TRGEIIAVGKGRILENGT--VQPLDVKV-------GDIVIFNDGYGVKTEKIDNE 81
T E+ A+G+ ++L T +++ G++V ND GV+ + +E
Sbjct: 244 TLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHEWLSE 298


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS23735RTXTOXIND1146e-30 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 114 bits (287), Expect = 6e-30
Identities = 70/430 (16%), Positives = 142/430 (33%), Gaps = 67/430 (15%)

Query: 30 AWLVALLSFAFLAILIATTVFCSFTQRIDVQGEVITLPHSVNVYAPQQGFVISQYVKVGD 89
A+ + F +A +++ V G++ S + + V VK G+
Sbjct: 61 AYFIMG--FLVIAFILS--VLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGE 116

Query: 90 IVTKGQPLYEIDISRNTTTGNVSAVQIEVINEKIANAEDIISK----------------- 132
V KG L ++ + Q ++ ++ I
Sbjct: 117 SVRKGDVLLKLTALG--AEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEP 174

Query: 133 -----LNHNKEETTISLEKQLKTINDSLKETNRMLANAQAGLKKMH-------------- 173
T +++Q T + + L +A +
Sbjct: 175 YFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEK 234

Query: 174 DNLSSYDKYLSDGLITKDQYNYQHSLYFQQQSTYQSLVSQKMQLESQVTQLNSDKITKIA 233
L + L I K Q + Y + + + SQ Q+ES++ +
Sbjct: 235 SRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQ 294

Query: 234 DFDNQISSQ----ENQINDYKNQLVESNAN-GNIIIKATTEGRIESLTV-TKGQMVDKGS 287
F N+I + + I +L ++ +I+A +++ L V T+G +V
Sbjct: 295 LFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354

Query: 288 SLAQIKPTGDIEYYLILWLPNNSIPYVKPGDEINIRYAAFPSDKFGQFPGKILSIS--SV 345
+L I P D + + N I ++ G I+ AFP ++G GK+ +I+ ++
Sbjct: 355 TLMVIVPEDD-TLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI 413

Query: 346 PTSRQEMSEYTNVTNGTNQQELALYKTIVKIENKTFEYNGKTLSLSNGLKAQAVVFLEER 405
R + + I+ IE K + LS+G+ A + R
Sbjct: 414 EDQRLGLV----------------FNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMR 457

Query: 406 PLYMWMFTPV 415
+ ++ +P+
Sbjct: 458 SVISYLLSPL 467


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS23750PF005777330.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 733 bits (1894), Expect = 0.0
Identities = 318/881 (36%), Positives = 478/881 (54%), Gaps = 63/881 (7%)

Query: 8 LFKLSTIFFAMLPA-LLSGLNNKAQARDFFDPSFISSLNGSDPSTTPDLSVFQTQNAQAP 66
+L+ F + A + + A +F+P F++ DP DLS F+ P
Sbjct: 20 KHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLAD----DPQAVADLSRFENGQELPP 75

Query: 67 GDYRVDIMFNGRYLDTRTIKFVANNRASSDNREPALVPCLSLKALAEYGVRIKSFPELA- 125
G YRVDI N Y+ TR + F + +VPCL+ LA G+ S +
Sbjct: 76 GTYRVDIYLNNGYMATRDVTFNTGDSEQG------IVPCLTRAQLASMGLNTASVSGMNL 129

Query: 126 EDQNGCANF-SVIPDTKADFDFTAQRLNISIPQAALSTTAQGYIPPDQFDDGINALLVNY 184
+ C S+I D A D QRLN++IPQA +S A+GYIPP+ +D GINA L+NY
Sbjct: 130 LADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNY 189

Query: 185 QFSGS---NDMQANDEYYSLNLQSGLNVGPWRIRNLSTWNKN-----NGDAGDWDSAYLY 236
FSG+ N + N Y LNLQSGLN+G WR+R+ +TW+ N +G W +
Sbjct: 190 NFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTW 249

Query: 237 MQRSIRSINSNLVMGESSSLSTIFDSVPFTGIQLATDTTMLPESMRGYAPIIRGIAKTNA 296
++R I + S L +G+ + IFD + F G QLA+D MLP+S RG+AP+I GIA+ A
Sbjct: 250 LERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTA 309

Query: 297 RVVIKQNGYQVYQTYVAPGAFEITDMYPSGGSGDLYVSVEESDGSKQEFVVPFATLPVMV 356
+V IKQNGY +Y + V PG F I D+Y +G SGDL V+++E+DGS Q F VP++++P++
Sbjct: 310 QVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQ 369

Query: 357 RENQLEYEITSGKYRPYDGGVDETPFTQATATYGVSSSLTLYGGMQAASRYQALSTGLGY 416
RE Y IT+G+YR + ++ F Q+T +G+ + T+YGG Q A RY+A + G+G
Sbjct: 370 REGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGK 429

Query: 417 NLGELGAASADVTQAWSKKKDDEKTSGQSWRVRYGKNIVETGTNVTIAGYRYSTRGFNTL 476
N+G LGA S D+TQA S DD + GQS R Y K++ E+GTN+ + GYRYST G+
Sbjct: 430 NMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNF 489

Query: 477 SEVLDSYSNDG------------------NYTSRSLRNRTNLTVNQSLGKGLGSLSISGL 518
++ S N + + R + LTV Q LG+ +L +SG
Sbjct: 490 ADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGS 548

Query: 519 IEDYWDDKRTNKSISVGYNGGFRNVNYYLGYSYNRYTWSGNNSGKDAQDDQRITLTVTLP 578
+ YW ++ G N F ++N+ L YS + W DQ + L V +P
Sbjct: 549 HQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGR-------DQMLALNVNIP 601

Query: 579 LSNWLPG--------TYTSYQLTNSNPGSTDQSVSIGGVGLDNDSLEWSLQQGYSNREYY 630
S+WL SY +++ G + G L++++L +S+Q GY+
Sbjct: 602 FSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDG 661

Query: 631 SGDMRG----TYNGARGSLNAGYSYDNNSQRIDYGANGSIVAHADGITLGQDITDAAVLV 686
+ G Y G G+ N GYS+ ++ +++ YG +G ++AHA+G+TLGQ + D VLV
Sbjct: 662 NSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLV 721

Query: 687 KAPGLDNVKLTNDNTISTDYRGYAIVPYVTPYRRTDITLDSTTLGEDMELPETTKSVVPT 746
KAPG + K+ N + TD+RGYA++PY T YR + LD+ TL ++++L +VVPT
Sbjct: 722 KAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPT 781

Query: 747 RGAIVRANYDGNIGQRAFVHLKTASGQDVPYGAMVLLAGDSKSQPSIVSDAGMVYMSGLQ 806
RGAIVRA + +G + + L T + + +P+GAMV IV+D G VY+SG+
Sbjct: 782 RGAIVRAEFKARVGIKLLMTL-THNNKPLPFGAMVTSESS--QSSGIVADNGQVYLSGMP 838

Query: 807 ETGILNVQWGKSAAQQCNASFTLPAREGKASGISQIETVCR 847
G + V+WG+ C A++ LP + ++Q+ CR
Sbjct: 839 LAGKVQVKWGEEENAHCVANYQLPPESQQQ-LLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS23795BCTLIPOCALIN2502e-88 Bacterial lipocalin signature.
		>BCTLIPOCALIN#Bacterial lipocalin signature.

Length = 171

Score = 250 bits (640), Expect = 2e-88
Identities = 71/152 (46%), Positives = 104/152 (68%), Gaps = 1/152 (0%)

Query: 25 PPGVTVVSPFDVQRYLGTWYEIARFDHPFESGLEKVTIAWHPRDDGGLDVVNKGYNPDRG 84
P V VS F++ YLG WYE+AR DH FE GL +VT + R+DGG+ V+N+GY+ ++G
Sbjct: 20 PESVKPVSDFELNNYLGKWYEVARLDHSFERGLSQVTAEYRVRNDGGISVLNRGYSEEKG 79

Query: 85 MWQKTDGVAYFTGEPSRAALKISFFGPFYGSYNVIALDKE-YRYALVCGPDRDYLWLLAR 143
W++ +G AYF + LK+SFFGPFYGSY V LD+E Y YA V GP+ +YLWLL+R
Sbjct: 80 EWKEAEGKAYFVNGSTDGYLKVSFFGPFYGSYVVFELDRENYSYAFVSGPNTEYLWLLSR 139

Query: 144 APTIAPEVRQQMLDIATRQGFDVGKLVWVNQR 175
PT+ + + ++++ +GFD +L++V Q+
Sbjct: 140 TPTVERGILDKFIEMSKERGFDTNRLIYVQQQ 171


78D364_RS23900D364_RS23945Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS23900221-1.330416protease modulator HflC
D364_RS23905322-1.609291hypothetical protein
D364_RS23910421-1.261047DUF2065 domain-containing protein
D364_RS23915318-0.366369adenylosuccinate synthase
D364_RS239251131.264390nitric oxide-sensing transcriptional repressor
D364_RS239302131.251712ribonuclease R
D364_RS239351101.74482123S rRNA
D364_RS239401102.412606isovaleryl-CoA dehydrogenase
D364_RS239452132.814638DUF1471 domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS23940IGASERPTASE330.008 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.7 bits (74), Expect = 0.008
Identities = 27/103 (26%), Positives = 47/103 (45%), Gaps = 2/103 (1%)

Query: 709 RVEAVNMDERKIDFTLISSERAPRNVGKTAREKAKKSTSGKPGGRRRQVGKQVNFEPDSA 768
E V + ++ T+ +E+ RE AK++ S + Q E
Sbjct: 1036 TTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKET 1095

Query: 769 FRKE-KETARPKKEKKAKKPSAKTQKIAAATKAKRAAKKKIAE 810
E KETA +KE+KAK + KTQ++ T ++ + K++ +E
Sbjct: 1096 QTTETKETATVEKEEKAKVETEKTQEVPKVT-SQVSPKQEQSE 1137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS23950ACRIFLAVINRP300.037 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 29.8 bits (67), Expect = 0.037
Identities = 25/80 (31%), Positives = 31/80 (38%), Gaps = 11/80 (13%)

Query: 11 QPTPLNNSNLFLSD--TALREAVVREGAGWDGDLLASIGQQLGTAESLELGRLANSNPPE 68
LN L D L+ + AG G A GQQL A + R NP E
Sbjct: 189 DADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQL-NASIIAQTRFK--NPEE 245

Query: 69 L----LRYDATGA--RLDDV 82
LR ++ G+ RL DV
Sbjct: 246 FGKVTLRVNSDGSVVRLKDV 265


79D364_RS24320D364_RS24380Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS243202222.759095alpha,alpha-phosphotrehalase
D364_RS243251222.559998PTS trehalose transporter subunit IIBC
D364_RS243300202.388360trehalose operon repressor TreR
D364_RS243350211.655206magnesium-translocating P-type ATPase
D364_RS24340-117-3.6621882-iminobutanoate/2-iminopropanoate deaminase
D364_RS24345-216-5.067889aspartate carbamoyltransferase regulatory
D364_RS24350-226-5.836035aspartate carbamoyltransferase
D364_RS26580035-7.217465pyrBI operon leader peptide
D364_RS24355-135-7.727118YhcH/YjgK/YiaL family protein
D364_RS24365-130-6.899305SIS domain-containing protein
D364_RS24370-227-6.236331PfkB domain-containing protein
D364_RS24375-220-4.801667amidohydrolase family protein
D364_RS24380-112-3.033161YfcC family protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS24375UREASE372e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 36.6 bits (85), Expect = 2e-04
Identities = 14/30 (46%), Positives = 21/30 (70%)

Query: 462 WTLNSARHHGMEEMTGSLEPGKRADIAVFD 491
+T+N A HG+ GSLE GKRAD+ +++
Sbjct: 409 YTINPAIAHGLSHEIGSLEVGKRADLVLWN 438


80D364_RS24435D364_RS24765Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS24435-1183.628460DUF853 domain-containing protein
D364_RS24440-1193.889493TIM barrel protein
D364_RS24445-1213.946756CoA-acylating methylmalonate-semialdehyde
D364_RS24450-1203.2293765-deoxy-glucuronate isomerase
D364_RS24455-1202.778887MurR/RpiR family transcriptional regulator
D364_RS24460-1222.7660265-dehydro-2-deoxygluconokinase
D364_RS24470-1232.3630523D-(3,5/4)-trihydroxycyclohexane-1,2-dione
D364_RS24475-1231.576849Gfo/Idh/MocA family oxidoreductase
D364_RS24480-1231.612458sugar phosphate isomerase/epimerase
D364_RS244850252.343377myo-inosose-2 dehydratase
D364_RS244900263.236412ABC transporter substrate-binding protein
D364_RS244952184.680291ABC transporter ATP-binding protein
D364_RS245002205.244438phosphodiesterase
D364_RS245051163.119784carbohydrate ABC transporter permease
D364_RS24510-1180.179665sugar ABC transporter permease
D364_RS24515-126-2.967434NAD(P)-dependent alcohol dehydrogenase
D364_RS24520-138-5.880365carbonic anhydrase
D364_RS24525039-9.936019*tyrosine-type recombinase/integrase
D364_RS24535046-12.090454nucleotidyl transferase AbiEii/AbiGii toxin
D364_RS26600259-16.296876hypothetical protein
D364_RS26605146-10.459951hypothetical protein
D364_RS24545141-7.625419GNAT family N-acetyltransferase
D364_RS24550029-3.968319TraI domain-containing protein
D364_RS24555027-3.260339hypothetical protein
D364_RS24560030-5.932562DUF1738 domain-containing protein
D364_RS24565131-5.802795hypothetical protein
D364_RS24570336-7.948976type I restriction-modification system subunit
D364_RS24575438-8.468517restriction endonuclease subunit S
D364_RS24580440-9.001476type I restriction endonuclease subunit R
D364_RS24585442-9.514539HNH endonuclease
D364_RS24590445-11.272867restriction endonuclease
D364_RS24595452-13.984406polynucleotide adenylyltransferase region
D364_RS24600265-18.685826patatin-like phospholipase family protein
D364_RS24605367-19.580650hypothetical protein
D364_RS24610470-20.723849hypothetical protein
D364_RS24615266-18.785781IS3 family transposase
D364_RS26610362-16.529463hypothetical protein
D364_RS24620347-12.698248PAAR domain-containing protein
D364_RS27685446-9.995140glycosyltransferase
D364_RS24625344-8.154795polysaccharide pyruvyl transferase family
D364_RS24635254-12.909609HNH endonuclease
D364_RS26615358-15.973939glycoside hydrolase family 43 protein
D364_RS24640255-15.559476glycoside-pentoside-hexuronide (GPH):cation
D364_RS26620252-14.061721xylose isomerase
D364_RS26820335-10.898934carbohydrate porin
D364_RS24670327-10.476511DUF523 and DUF1722 domain-containing protein
D364_RS24675212-5.709069DUF3833 domain-containing protein
D364_RS26630212-2.434862hypothetical protein
D364_RS24690115-0.819126DUF2878 domain-containing protein
D364_RS24695117-0.717543cyclopropane-fatty-acyl-phospholipid synthase
D364_RS24700-1170.204474DUF1365 domain-containing protein
D364_RS24705-2202.665161FAD-dependent oxidoreductase
D364_RS24715-1244.847066SDR family NAD(P)-dependent oxidoreductase
D364_RS247202236.194179nuclear transport factor 2 family protein
D364_RS247252236.717984MerR family transcriptional regulator
D364_RS247303247.506293lipocalin family protein
D364_RS247353247.045818DUF1294 domain-containing protein
D364_RS247401236.357061CusA/CzcA family heavy metal efflux RND
D364_RS247451256.033003efflux RND transporter periplasmic adaptor
D364_RS247501235.963042cation efflux system protein CusF
D364_RS247551224.199656efflux transporter outer membrane subunit
D364_RS247600213.531218copper response regulator transcription factor
D364_RS24765-1213.171858Cu(+)/Ag(+) sensor histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS24490MALTOSEBP409e-06 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 40.5 bits (94), Expect = 9e-06
Identities = 85/307 (27%), Positives = 133/307 (43%), Gaps = 36/307 (11%)

Query: 120 LQKEFWPAMHKNAQVMGTTYAIPFHNSTPILYYNKTMFDRAGIKQPPQTWAELLADAKKL 179
Q + +P + G A P L YNK + + PP+TW E+ A K+L
Sbjct: 111 FQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDL-----LPNPPKTWEEIPALDKEL 165

Query: 180 TDESKGQWGIMLPSTNDDYGGWIFSALVRANGG---KYFNEDYP-GEVYYNSPTAIGALR 235
++KG+ +M + + Y W L+ A+GG KY N Y +V ++ A L
Sbjct: 166 --KAKGKSALMF-NLQEPYFTW---PLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLT 219

Query: 236 FWQDLIYKDKVMPSGVLNSKQISAAFFSGKLGMAMLSTGALGFMRENSKDFELGVAMLPA 295
F DLI K+K M + + AAF G+ M + G + ++ GV +LP
Sbjct: 220 FLVDLI-KNKHMNADT-DYSIAEAAFNKGETAMTI--NGPWAWSNIDTSKVNYGVTVLPT 275

Query: 296 -KEQRAVPIGGASLVSFKGINDA--QKKAAYQFL-TYLVSPEVNGAWSRFTGYFSPRKAS 351
K Q + P G V GIN A K+ A +FL YL++ E A ++ + S
Sbjct: 276 FKGQPSKPFVG---VLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKPLGAVALKS 332

Query: 352 YDTPEMKAYLQQDPRAAIALEQLKYAHPWYSTWETVAVRKAMENQLAAVVNDA--KVTPE 409
Y + L +DPR A +E + + + A A+ AV+N A + T +
Sbjct: 333 Y-----EEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVR---TAVINAASGRQTVD 384

Query: 410 AAVQAAQ 416
A++ AQ
Sbjct: 385 EALKDAQ 391


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS24495PF05272356e-04 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 34.7 bits (79), Expect = 6e-04
Identities = 14/33 (42%), Positives = 18/33 (54%)

Query: 30 VVLVGPSGCGKSTLLRLLAGLEPVSEGQIWLHD 62
VVL G G GKSTL+ L GL+ S+ +
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGT 631


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS24560SACTRNSFRASE333e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 32.6 bits (74), Expect = 3e-04
Identities = 22/58 (37%), Positives = 29/58 (50%), Gaps = 4/58 (6%)

Query: 70 EDIAELKRMFSVNKASGTGTALLRYLEGEAKSLGYNEIRLETRKVNTRAVAFYVKHNY 127
EDIA K + G GTALL AK + + LET+ +N A FY KH++
Sbjct: 93 EDIAVAKD----YRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS24635RTXTOXIND310.009 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.0 bits (70), Expect = 0.009
Identities = 11/87 (12%), Positives = 28/87 (32%), Gaps = 3/87 (3%)

Query: 17 PEFRQEALKLAERIGVAAAARELNLYESQLYNWRSKQQNQFSSSEREQEMSAEIARLKRQ 76
PE + + + R +L + Q W+ Q+ Q + ++ AE + +
Sbjct: 166 PELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQ-NQKYQKELNLDKKR--AERLTVLAR 222

Query: 77 LAERDEELAILPKGRDILREAPEMKYV 103
+ + + D + +
Sbjct: 223 INRYENLSRVEKSRLDDFSSLLHKQAI 249


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS24755DHBDHDRGNASE622e-13 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 61.6 bits (149), Expect = 2e-13
Identities = 51/233 (21%), Positives = 85/233 (36%), Gaps = 16/233 (6%)

Query: 4 VLITGASSGIGAGLAKSFAADGHLVIACGRDASRLAALQQLSPNINVRL-----FDMTDR 58
ITGA+ GIG +A++ A+ G + A + +L + S R D+ D
Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVS-SLKAEARHAEAFPADVRDS 69

Query: 59 DACRQALTGCFA-----DLIILCAGTCEYLDHGQVDAALVERVMATNFLGPVNCLAALQT 113
A + D+++ AG + E + N G N ++
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129

Query: 114 QLEA--GDRVVLVSSMAHWLPFPRAEAYGASKAALTWFANSLRLDWEPKGVAVTVVSPGF 171
+ +V V S +P AY +SKAA F L L+ + +VSPG
Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189

Query: 172 VDTPLTRKNDFAMPGRVSVDRAVAA-IRHGLAKGKNHIAFPTGFSLALRLLAS 223
+T + G V + + G+ K +A P+ + A+ L S
Sbjct: 190 TETDMQWSLWADENGAEQVIKGSLETFKTGIPLKK--LAKPSDIADAVLFLVS 240


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS24770BCTLIPOCALIN2331e-81 Bacterial lipocalin signature.
		>BCTLIPOCALIN#Bacterial lipocalin signature.

Length = 171

Score = 233 bits (595), Expect = 1e-81
Identities = 85/151 (56%), Positives = 111/151 (73%), Gaps = 1/151 (0%)

Query: 25 PKGVQPISGFDASRYLGKWYEVARLENRFERGLEQVTATYGARSDGGISVVNRGYDPVKK 84
P+ V+P+S F+ + YLGKWYEVARL++ FERGL QVTA Y R+DGGISV+NRGY K
Sbjct: 20 PESVKPVSDFELNNYLGKWYEVARLDHSFERGLSQVTAEYRVRNDGGISVLNRGYSEEKG 79

Query: 85 RWNESDGKAYFTGAPTTAALKVSFFGPFYGGYNVIRLD-DDYQYALVSGPNRDYLWILSR 143
W E++GKAYF T LKVSFFGPFYG Y V LD ++Y YA VSGPN +YLW+LSR
Sbjct: 80 EWKEAEGKAYFVNGSTDGYLKVSFFGPFYGSYVVFELDRENYSYAFVSGPNTEYLWLLSR 139

Query: 144 TPTIPAAVKQDYLNTARELGFDVDRLVWIRQ 174
TPT+ + ++ ++E GFD +RL++++Q
Sbjct: 140 TPTVERGILDKFIEMSKERGFDTNRLIYVQQ 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS24780ACRIFLAVINRP6790.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 679 bits (1753), Expect = 0.0
Identities = 223/1059 (21%), Positives = 436/1059 (41%), Gaps = 54/1059 (5%)

Query: 1 MIEWIIRRSVANRFLVMMAALFLSIWGTWTIIHTPVDALPDLSDVQVIVKTRYPGQAPQI 60
M + IRR + A+ L + G I+ PV P ++ V V YPG Q
Sbjct: 1 MANFFIRR----PIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQT 56

Query: 61 VENQVTWPLTTTMLSVPGAKTVRGFSQ-FGDSYVYVIFEDGTDPYWARSRVLEYLNQVQG 119
V++ VT + M + + S G + + F+ GTDP A+ +V L
Sbjct: 57 VQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATP 116

Query: 120 KLPAGVSAEMGP-DATGVGWVFEYALVDRSGKHDLAELRSLQDWFLKYELKTIPNVSEVA 178
LP V + + + ++ V + ++ +K L + V +V
Sbjct: 117 LLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQ 176

Query: 179 SVGGVVKEYQIVVDPMKLTQYGISLGEVKSALDASNQEAGGSSVELA------EAEYMVR 232
G +I +D L +Y ++ +V + L N + + + +
Sbjct: 177 LFGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASII 235

Query: 233 ASGYLQTLDDFKNIVLKTGDNGVPVYLGDVARVQIGPEMRRGIAELNGEGEVAGGVVILR 292
A + ++F + L+ +G V L DVARV++G E IA +NG+ AG + L
Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLA 294

Query: 293 SGKNAREVISAVKAKLASLQSSLPEGVEVVTTYDRSQLIDRAIDNLSYKLLEEFIVVALV 352
+G NA + A+KAKLA LQ P+G++V+ YD + + +I + L E ++V LV
Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354

Query: 353 CALFLWHVRSALVAIISLPLGLCFAFIMMHFQGLNANIMSLGGIAIAVGAMVDAAIVMIE 412
LFL ++R+ L+ I++P+ L F ++ G + N +++ G+ +A+G +VD AIV++E
Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414

Query: 413 NAHKRLEEWEHQHPGEKLSNDTRWKIITEASVEVGPALFISLLIITLSFIPIFTLEGQGG 472
N + + E + P + ++ ++ AL ++++ FIP+ G G
Sbjct: 415 NVERVMME-DKLPP---------KEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464

Query: 473 KLFGPLAFTKTWSMAGAALLAIVAIPILMGFWIRGRIPAESSNPLNRF----------LI 522
++ + T +MA + L+A++ P L ++ AE F +
Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPV-SAEHHENKGGFFGWFNTTFDHSV 523

Query: 523 RIYHPLLLKVLHWPKTTLLIALLSILTVAWPLNRVGGEFLPQINEGDLLYMPSTLPGISA 582
Y + K+L LLI L + + R+ FLP+ ++G L M G +
Sbjct: 524 NHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQ 583

Query: 583 AQAADMLQKTDKLIMT--VPEVARVFGKTGKAETATDSAPLEMVETTIQLKPQDQW-RPG 639
+ +L + + V VF G + + + LKP ++
Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQN---AGMAFVSLKPWEERNGDE 640

Query: 640 MTMEKIVEELDKTVRLPGLANLWVPPIRNRIDMLSTGIKSPIGIKVSGTNLADIDAIAGQ 699
+ E ++ + + + +++ + I +G + Q
Sbjct: 641 NSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQ 700

Query: 700 IEVVARSVPG-VTSALAERLVGGRYLNIDIHREKAARYGMTVGDVQLFVSSAIGGAMVGE 758
+ +A P + S L +++ +EKA G+++ D+ +S+A+GG V +
Sbjct: 701 LLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVND 760

Query: 759 TVEGVERYPINIRYPQSYRDSPETLRQLPILTPLKQQIVLADVAEVKVVTGPSMLKTENA 818
++ + ++ +R PE + +L + + + + + V G L+ N
Sbjct: 761 FIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNG 820

Query: 819 RPTSWIYIDARDRDMVSVVHDLQQAIGKEVKLKPGISVSYSGQFELLERAIQKLKLMVPM 878
P+ I +A L + + KL GI ++G + + +V +
Sbjct: 821 LPSMEIQGEAAPGTSSGDAMALMENL--ASKLPAGIGYDWTGMSYQERLSGNQAPALVAI 878

Query: 879 TLMIIFVLLYLAFRRVGEALLIITSVPFALVGGIWFLYWMGFHLSVATGTGFIALAGVAA 938
+ +++F+ L + + ++ VP +VG + V G + G++A
Sbjct: 879 SFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSA 938

Query: 939 EFGVVMLMYLRHAIEAEPSLENPQTFSVDKLDEALYRGAVLRVRPKAMTVAVIIAGLLPI 998
+ ++++ + + +E E + EA +R+RP MT I G+LP+
Sbjct: 939 KNAILIVEFAKDLMEKEGK----------GVVEATLMAVRMRLRPILMTSLAFILGVLPL 988

Query: 999 LWGTGAGSEVMSRIAAPMIGGMITAPLLSLFIIPAAYKL 1037
GAGS + + ++GGM++A LL++F +P + +
Sbjct: 989 AISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVV 1027


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS24795GPOSANCHOR310.012 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.8 bits (69), Expect = 0.012
Identities = 29/166 (17%), Positives = 57/166 (34%), Gaps = 9/166 (5%)

Query: 140 RLKNLSEADRQNFFASEEARRAVHILLIANVSQSYFNQRLAAAQLQVANDTLQNYQQSYA 199
A + A + A A L + + +A+++ + A
Sbjct: 204 NFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQA 263

Query: 200 FVEKQLLTGSTTVLALEQARGMIESTRADIAKRQGQLAQANNALQLLLGSYQHLPDDSAS 259
+EK L + I++ A+ A + + A + Q+L + Q L D +
Sbjct: 264 ELEKAL---EGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDA 320

Query: 260 SAVDLQGVTLPPSLSSAILLQRPDILEAEHSLQAANANIGAARAAF 305
S + + L ++ I EA S Q+ ++ A+R A
Sbjct: 321 SREAKKQL----EAEHQKLEEQNKISEA--SRQSLRRDLDASREAK 360


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS24805HTHFIS832e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 82.6 bits (204), Expect = 2e-20
Identities = 35/117 (29%), Positives = 61/117 (52%)

Query: 2 KILIVEDEKKTGEYLTKGLTEAGFVVDLADNGLNGYHLAMTSDYDLLILDIMLPDVNGWD 61
IL+ +D+ L + L+ AG+ V + N + D DL++ D+++PD N +D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 IVRMLRTAGKGMPILLLTALGTIEHRVKGLELGADDYLVKPFAFAELLARVRTLLRR 118
++ ++ A +P+L+++A T +K E GA DYL KPF EL+ + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS24810BORPETOXINA290.027 Bordetella pertussis toxin A subunit signature.
		>BORPETOXINA#Bordetella pertussis toxin A subunit signature.

Length = 269

Score = 29.4 bits (65), Expect = 0.027
Identities = 14/56 (25%), Positives = 27/56 (48%)

Query: 358 RFVGSPCRVTGDPLMLRRAISNLLSNAIRYTPAGQAVTIQLSESAETVRLVVENPG 413
R+V R +P RR++++++ +R P A + +ES+E + E G
Sbjct: 199 RYVSQQTRANPNPYTSRRSVASIVGTLVRMAPVIGACMARQAESSEAMAAWSERAG 254


81D364_RS24875D364_RS24905Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS24875212-0.745124PLP-dependent aminotransferase family protein
D364_RS248802140.018259DUF1127 domain-containing protein
D364_RS248853151.169620MarR family transcriptional regulator
D364_RS248903162.328849HlyD family secretion protein
D364_RS249001183.081986DUF2955 domain-containing protein
D364_RS249051163.005155DUF3343 domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS24935RTXTOXIND664e-14 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 66.0 bits (161), Expect = 4e-14
Identities = 35/221 (15%), Positives = 73/221 (33%), Gaps = 30/221 (13%)

Query: 1 MMTPEQKFARWVRVSIAAFLGI-FAWFIVADIWIPLTPDSTVMRVVTP------VSSRVS 53
+ TP + R V I FL I F ++ + +T +T + +
Sbjct: 49 IETPVSRRPRLVAYFIMGFLVIAFILSVLG----QVEIVATANGKLTHSGRSKEIKPIEN 104

Query: 54 GYVSHVYVHNNSQVKKGDLLYELDPTPFINKVEAAQIALEQAKLSNQQLDAQIAAARAN- 112
V + V V+KGD+L +L Q +L QA+L + + N
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNK 164

Query: 113 -------------LRTAQYTARNDKVTLDRYQRLSTMQNVSQSDLDKVRTTWQTSEQSVS 159
+ + R + +++ + + +LDK R T ++
Sbjct: 165 LPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARIN 224

Query: 160 ALNAQIQNLLIQRGERDDKRNVTLQKY--RNALEEAQLNLA 198
+ + DD ++ ++ ++A+ E +
Sbjct: 225 RYENLSRVE---KSRLDDFSSLLHKQAIAKHAVLEQENKYV 262



Score = 49.1 bits (117), Expect = 1e-08
Identities = 30/204 (14%), Positives = 71/204 (34%), Gaps = 21/204 (10%)

Query: 86 EAAQIALEQAKLSNQQLDAQIAAARANLRTAQYTARNDKVTLDRYQRLSTMQNVSQSDLD 145
Q Q +L+ + A+ A + + +R +K LD + L Q +++ +
Sbjct: 196 STWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVL 255

Query: 146 KVRTTWQTSEQSVSALNAQIQNL--------LIQRGERDDKRNVTLQKYRNA-------- 189
+ + + + +Q++ + + +N L K R
Sbjct: 256 EQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLT 315

Query: 190 --LEEAQLNLAWTKVRAETDGMVSNLQLN-PGIYATAATAVLALVNNNTDIVAD--FREK 244
L + + + +RA V L+++ G T A ++ +V + + + K
Sbjct: 316 LELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNK 375

Query: 245 SLRHTAVNTDAAVVFDALPGQVFP 268
+ V +A + +A P +
Sbjct: 376 DIGFINVGQNAIIKVEAFPYTRYG 399


82D364_RS25040D364_RS25110Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS250402221.8605525-carboxymethyl-2-hydroxymuconate semialdehyde
D364_RS250451200.565693fumarylacetoacetate hydrolase family protein
D364_RS250551153.203539fumarylacetoacetate hydrolase family protein
D364_RS250601163.517722hypothetical protein
D364_RS250652165.631989homoprotocatechuate degradation operon regulator
D364_RS250751186.222241FAD-dependent oxidoreductase
D364_RS266401165.899918(2Fe-2S)-binding protein
D364_RS250950145.488706FAD-binding oxidoreductase
D364_RS251051133.5142584-hydroxyproline epimerase
D364_RS251101143.206077AraC family transcriptional regulator
83D364_RS25185D364_RS25295Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS25185-117-3.464935sigma-54-dependent transcriptional regulator
D364_RS25190115-4.782409PTS sugar transporter subunit IIA
D364_RS25195-115-3.514196PTS sugar transporter subunit IIB
D364_RS25200-210-2.693740PTS
D364_RS25205-210-2.924702PTS system mannose/fructose/sorbose family
D364_RS25210-211-3.112531SIS domain-containing protein
D364_RS25215-211-2.867075SIS domain-containing protein
D364_RS25220-110-2.439321Rpn family recombination-promoting
D364_RS25225-111-2.331799phosphatidylglycerol--membrane-oligosaccharide
D364_RS25230113-0.708429DUF2501 domain-containing protein
D364_RS25235014-1.091787DNA replication protein DnaC
D364_RS25240-119-1.535825primosomal protein DnaT
D364_RS25245-122-3.408770threonine/serine exporter
D364_RS25250-123-3.508514threonine/serine exporter ThrE family protein
D364_RS25260-222-3.7285754-hydroxybenzoate 3-monooxygenase
D364_RS25265-224-4.485025helix-turn-helix domain-containing protein
D364_RS25270-223-5.450797response regulator transcription factor
D364_RS25275024-6.062272DNA-binding transcriptional activator BglJ
D364_RS25280122-5.235648YbaK/EbsC family protein
D364_RS25285014-3.469575PTS cellobiose transporter subunit IIC
D364_RS25290119-2.745434siderophore-iron reductase FhuF
D364_RS25295020-3.290208GGDEF domain-containing protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS25185HTHFIS1599e-44 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 159 bits (404), Expect = 9e-44
Identities = 79/301 (26%), Positives = 132/301 (43%), Gaps = 27/301 (8%)

Query: 86 MAELLAESDRQPEQADHFSLLTGHDGSLRKPIEQMKTALFYPNCGLPLLITGDSGTGKSY 145
+AE + + + L G ++++ + + L L+ITG+SGTGK
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLM---QTDLTLMITGESGTGKEL 175

Query: 146 MAELMHEFAIAQGLLAPDAPFVSFNCAQYASNPELLAANLFGYVKGAFTGAQSDKAGAFE 205
+A +H++ + + PFV+ N A A +L+ + LFG+ KGAFTGAQ+ G FE
Sbjct: 176 VARALHDYGKRR-----NGPFVAINMA--AIPRDLIESELFGHEKGAFTGAQTRSTGRFE 228

Query: 206 AANGGMLFLDEVHRLDAQGQEKLFTWLDRKEIYRVGETAQGLPISLRLVFATTEDIHS-- 263
A GG LFLDE+ + Q +L L + E VG + +R+V AT +D+
Sbjct: 229 QAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGR-TPIRSDVRIVAATNKDLKQSI 287

Query: 264 ---TFLTTFLRRIPIL-VSLPDLQHRSREEKEALTLQFFWQEARTLAAR-LQLTPRLLQV 318
F R+ ++ + LP L R R E ++ F Q+A + L++
Sbjct: 288 NQGLFREDLYYRLNVVPLRLPPL--RDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALEL 345

Query: 319 LTQYVYRGNVGELKNVVKYAVASAWARSPGREMLTVTLHDLPENVMAATPALSEAMGQQE 378
+ + + GNV EL+N+V+ A P +T + + + P
Sbjct: 346 MKAHPWPGNVRELENLVRRLT----ALYPQD---VITREIIENELRSEIPDSPIEKAAAR 398

Query: 379 P 379

Sbjct: 399 S 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS25220FLGFLIH352e-04 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 35.2 bits (80), Expect = 2e-04
Identities = 15/38 (39%), Positives = 23/38 (60%)

Query: 239 PQYEETLMSIAQKLKQEGRQQGRLEGREEGHLEGLQEG 276
P E+ L + + ++G Q G EGR++GH +G QEG
Sbjct: 38 PSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEG 75



Score = 29.0 bits (64), Expect = 0.023
Identities = 12/22 (54%), Positives = 17/22 (77%)

Query: 255 EGRQQGRLEGREEGHLEGLQEG 276
EGRQQG +G +EG +GL++G
Sbjct: 62 EGRQQGHKQGYQEGLAQGLEQG 83


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS252902FE2SRDCTASE368e-132 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 368 bits (947), Expect = e-132
Identities = 169/262 (64%), Positives = 209/262 (79%)

Query: 1 MAWRSLPLSDELIWRAPLPTAEHALAESIREKIATLRPHLLDFLRLDEPAPRHALTLAEW 60
MA+RS PL +++IWR L + LA+++R IA R HLL+F+RLDEPAP +A+TLA+W
Sbjct: 1 MAYRSAPLYEDVIWRTHLQPQDPTLAQAVRATIAKHREHLLEFIRLDEPAPLNAMTLAQW 60

Query: 61 SQPIALRSLLATWSDHIYRHQPTLPREQKPLLSLWAQWYIGLLVPPLMLALLNEPQGLSL 120
S P L SLLA +SDHIYR+QP + RE KPL+SLWAQWYIGL+VPPLMLALL + + L +
Sbjct: 61 SSPNVLSSLLAVYSDHIYRNQPMMIRENKPLISLWAQWYIGLMVPPLMLALLTQEKALDV 120

Query: 121 APEHFHVEFHESGRAACFWIDVHSDADIERLSPQARMDALVTRTLQPVVEALAATGEINS 180
+PEHFH EFHE+GR ACFW+DV D + SPQ RM+ L+++ L PVV+AL ATGEIN
Sbjct: 121 SPEHFHAEFHETGRVACFWVDVCEDKNATPHSPQHRMETLISQALVPVVQALEATGEING 180

Query: 181 KLIWSNTGYLINWYLGEMRALLGDERLAALRQHCFFKKQLADGQDNPLWRTVMLREGQLV 240
KLIWSNTGYLINWYL EM+ LLG+ + +LR FF+K L +G+DNPLWRTV+LR+G LV
Sbjct: 181 KLIWSNTGYLINWYLTEMKQLLGEATVESLRHALFFEKTLTNGEDNPLWRTVVLRDGLLV 240

Query: 241 RRTCCQRYRLPDVQQCGDCTLR 262
RRTCCQRYRLPDVQQCGDCTL+
Sbjct: 241 RRTCCQRYRLPDVQQCGDCTLK 262


84D364_RS00290D364_RS00345N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS002904122.930785DedA family protein
D364_RS002954122.764140thiamine ABC transporter ATP-binding protein
D364_RS00300190.691351thiamine/thiamine pyrophosphate ABC transporter
D364_RS00305011-2.252688thiamine ABC transporter substrate binding
D364_RS00310012-2.387563HTH-type transcriptional regulator SgrR
D364_RS00315-112-4.384139glucose uptake inhibitor SgrT
D364_RS00325-110-3.902600MFS transporter
D364_RS00330014-4.668657MFS transporter
D364_RS00335011-4.123491LysR family transcriptional regulator
D364_RS00340-112-2.169589type I 3-dehydroquinate dehydratase
D364_RS00345-112-1.841408MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00290PHPHTRNFRASE280.048 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 27.8 bits (62), Expect = 0.048
Identities = 19/76 (25%), Positives = 33/76 (43%), Gaps = 2/76 (2%)

Query: 95 RALLEKTEHALHQHSMITILIGRFVGPTRPLVPMVAGMLDLPVAKFVLPNIIGCLLWPPL 154
R LEK + Q + +L G + + PM+A + +L AK ++ LL +
Sbjct: 362 RLCLEKQDIFRTQ--LRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGV 419

Query: 155 YFLPGILAGAAIDIPA 170
I G ++IP+
Sbjct: 420 DVSDSIEVGIMVEIPS 435


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00310NEISSPPORIN372e-04 Neisseria sp. porin signature.
		>NEISSPPORIN#Neisseria sp. porin signature.

Length = 348

Score = 36.5 bits (84), Expect = 2e-04
Identities = 17/81 (20%), Positives = 30/81 (37%), Gaps = 2/81 (2%)

Query: 367 YHAGEHYQ-GNWFPAYGLLPRWHHASNHACEKPAGLETVTLTYYRDHVEHRVIGGIMRDL 425
YH G +YQ +F Y L + + E ++ + HR++GG +
Sbjct: 180 YHVGLNYQNSGFFAQYAGLFQRYGEGTKKIEYDDQTYSIPSLFVEKLQVHRLVGGYDNNA 239

Query: 426 LAAHQVKLEIQELEYDAWHRG 446
L V + Q+ + G
Sbjct: 240 LYV-SVAAQQQDAKLYGAMSG 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00325TCRTETA330.002 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.9 bits (75), Expect = 0.002
Identities = 41/187 (21%), Positives = 67/187 (35%), Gaps = 17/187 (9%)

Query: 16 AAFMLVAFMMGVAGALQAPTLSLFLSREVGAQPFWVGLFYTVNAIAGILVSLWLAKRSDS 75
AA M V F+M + G + A +F +G+ I L + +
Sbjct: 213 AALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAA 272

Query: 76 RGDRRRLIMFCCLMAVGNALLFAFNRHYLTLITCGVMLASIANAAMPQLFALAREYADSS 135
R RR +M + +L AF V+LAS MP L A+ D
Sbjct: 273 RLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGG-IGMPALQAMLSRQVDEE 331

Query: 136 AREVVMFSSVMRAQLSLAWVIGPPLAFMLALNYGFTTMFSIAAG-----IFVISLALIAI 190
+ + S SL ++GP L FT +++ + ++ AL +
Sbjct: 332 RQGQLQGSLAALT--SLTSIVGPLL---------FTAIYAASITTWNGWAWIAGAALYLL 380

Query: 191 KLPSVPR 197
LP++ R
Sbjct: 381 CLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00330TCRTETA613e-12 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 61.0 bits (148), Expect = 3e-12
Identities = 71/415 (17%), Positives = 132/415 (31%), Gaps = 46/415 (11%)

Query: 30 LSVGTMINYLDRTILGI---VAPQLSKEIHID---PAMMGIIFSAFAWTYALAQIPGGMF 83
L V LD +G+ V P L +++ A GI+ + +A G
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 84 LDRFGNKVTYALSIFFWSLFTLLQSFTLGLKSLLLLRLGLGVSEAPCFPANSRIVSTWFP 143
DRFG + +S+ ++ + + L L + R+ G++ A ++
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGAT-GAVAGAYIADITD 125

Query: 144 QHERARA----TATYTVGEYIGLAAFSPLLFLILEHHGWRTLFFLTGGLGILFTLVWWRF 199
ERAR +A + G G P+L ++ FF L L L
Sbjct: 126 GDERARHFGFMSACFGFGMVAG-----PVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFL 180

Query: 200 YHEPHESRTANQAELEYIGANGINNKIQNVPFNWRDARRLLGCRQILGASLGQFAGNTTL 259
E H+ +N F W ++ + +
Sbjct: 181 LPESHKGERRPLRREA------LNPLAS---FRWARGMTVVAALMAVFFIMQLVGQ---- 227

Query: 260 VFFLTWFPSYLANERHLPWLHVGFFATWPFLAAAIGILFGGWISDRLLKRTGSVNISRKL 319
+ + + H +G + L I+ + R G
Sbjct: 228 -VPAALWVIFGEDRFHWDATTIGISLA---AFGILHSLAQAMITGPVAARLGERRA---- 279

Query: 320 PIISGLLLSSC--IIAANWVSANSTVIIIMSVAFFGQGMVGLGWTLISDIAPENMAGLTG 377
++ G++ I+ A I++ +A G GM L ++S E G
Sbjct: 280 -LMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQ-AMLSRQVDEERQGQLQ 337

Query: 378 GIFNFCANMASIIAPLIIGVIISATGNFFYALIYVGLTALIGVIAYIFIIGDIKR 432
G ++ SI+ PL+ I +A+ + G + G Y+ + ++R
Sbjct: 338 GSLAALTSLTSIVGPLLFTAIYAAS-----ITTWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00345TCRTETA402e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.8 bits (93), Expect = 2e-05
Identities = 57/343 (16%), Positives = 113/343 (32%), Gaps = 8/343 (2%)

Query: 60 VTGFLSDRFGRKPFIYLGILSYLIFFVGILLTKNIYLAYVFGIMAGLANSFLDSGTYPAL 119
V G LSDRFGR+P + + + + + + +++ Y+ I+AG+ +
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA 121

Query: 120 MESFPHSASRANVLIKAFVSAGQFLLPFIISFLIWANLWFGWSFVIAAALFVLSGIYLLK 179
+ +R + A G P + + F AAAL L+ +
Sbjct: 122 DITDGDERARHFGFMSACFGFGMVAGPVLGGLM--GGFSPHAPFFAAAALNGLNFLTGCF 179

Query: 180 MPFPDSQAAKKKEAPTAQAETAVRPQANK-LDMVIFTLYGYIGMATFYLVSQWL-AQYGQ 237
+ P+S +++ + + + +V + + M V L +G+
Sbjct: 180 L-LPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGE 238

Query: 238 FVVGL-PYASAIKLLSIYTVGSLVCVFVTAAFVKEVFSSAIAMIIYTGLSMISLLLVCLF 296
I L + + SL +T + M+ +LL
Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298

Query: 297 PTPMMVTGFAFVIGFAAAGGVLQLGATIMAMSFPNGKGKATGIFYTAGSIASFTIPLITA 356
M + LQ A + +G+ G S+ S PL+
Sbjct: 299 RGWMAFPIMVLLASGGIGMPALQ--AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFT 356

Query: 357 KLSQISIASIMWFDFLIAVIGFVIALYIGYRQLQARAAQKVSR 399
+ SI + + ++ +++ L R L + A Q+ R
Sbjct: 357 AIYAASITTWNGWAWIAGAALYLLCLPALRRGLWSGAGQRADR 399


85D364_RS00745D364_RS00805N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS007456217.954679A24 family peptidase
D364_RS007506228.033545type II secretion system protein N
D364_RS007555237.850772type II secretion system protein M
D364_RS007603237.067034type II secretion system protein GspL
D364_RS007653246.851746type II secretion system minor pseudopilin GspK
D364_RS007704236.961594type II secretion system minor pseudopilin GspJ
D364_RS007750204.677759type II secretion system minor pseudopilin GspI
D364_RS00780-2183.334144type II secretion system minor pseudopilin GspH
D364_RS00785-2182.314383type II secretion system major pseudopilin GspG
D364_RS00790-1152.122476type II secretion system inner membrane protein
D364_RS00795-1141.441440type II secretion system ATPase GspE
D364_RS00800-113-0.055300GspD family T2SS secretin variant PulD
D364_RS00805014-0.585212type II secretion system protein GspC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00745PREPILNPTASE2712e-93 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 271 bits (694), Expect = 2e-93
Identities = 138/277 (49%), Positives = 168/277 (60%), Gaps = 15/277 (5%)

Query: 1 MTTLAALSLHFPFVWYGFLLLFGLALGSFYNVVIYRLPRML---------------TQTA 45
M L L+ P++++ + LF L +GSF NVVI+RLP ML +
Sbjct: 1 MALLLELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGV 60

Query: 46 DDERITLSTPGSSCPQCRQPISWRDNIPLLSFLWLGRRARCCQAPIAWSYPLTELATGLL 105
D+ L P S CP C PI+ +NIPLLS+LWL R R CQAPI+ YPL EL T LL
Sbjct: 61 DEPPYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALL 120

Query: 106 FILAGALLAPGLPLAGGLVLLSFLLILARIDARTQLLPDRLTLPLLWAGLLFNLNEVYIA 165
+ LAPG L+L L+ L ID LLPD+LTLPLLW GLLFNL +++
Sbjct: 121 SVAVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVS 180

Query: 166 LPDAVAGAMAGYLALWSVYWLFRLLTGKEALGYGDFKLLAALGAWCGWQVLPQVLLLASA 225
L DAV GAMAGYL LWS+YW F+LLTGKE +GYGDFKLLAALGAW GWQ LP VLLL+S
Sbjct: 181 LGDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSL 240

Query: 226 SGLVWTLLQRLWTRQSLQQPLAFGPWLALAGGGIFLW 262
G + L +P+ FGP+LA+AG LW
Sbjct: 241 VGAFMGIGLILLRNHHQSKPIPFGPYLAIAGWIALLW 277


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00770BCTERIALGSPG333e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 33.3 bits (76), Expect = 3e-04
Identities = 21/63 (33%), Positives = 35/63 (55%), Gaps = 5/63 (7%)

Query: 4 KMRGFTLIETLLALAILAVLSAAAV-MVLQNVIRADGLTREKSQ-QIAALQRAFRQIADD 61
K RGFTL+E ++ + I+ VL++ V ++ N +AD ++K+ I AL+ A D
Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKAD---KQKAVSDIVALENALDMYKLD 62

Query: 62 VTH 64
H
Sbjct: 63 NHH 65


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00775BCTERIALGSPG322e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 32.2 bits (73), Expect = 2e-04
Identities = 22/99 (22%), Positives = 36/99 (36%), Gaps = 8/99 (8%)

Query: 1 MKREAGMTLIEVMVALVIF-ALAGLAV---MQSTLQQTRQLGRMEEKILASWLADNQLVQ 56
++ G TL+E+MV +VI LA L V M + + +Q + L + L +L
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDN 63

Query: 57 LRLEKRWPALS--WSETTVEAAGTRWFVRWQGVETALPQ 93
L T+ + +G LP
Sbjct: 64 HHYPTTNQGLESLVEAPTLPPLAANY--NKEGYIKRLPA 100


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00780BCTERIALGSPH1771e-59 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 177 bits (450), Expect = 1e-59
Identities = 98/164 (59%), Positives = 125/164 (76%)

Query: 1 MSQRGFTLLEMMLVLLLIGVSASMVLLAFPSARTQEATQILARFQTQLDFVRERGQQTGQ 60
M QRGFTLLEMML+LLL+GVSA MVLLAFP++R A Q LARF+ QL FV++RG QTGQ
Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQ 60

Query: 61 LFGIIIHPERWQFMRLQPADDSAPAAADDRWGNAQWLPLQAGRVTTAETLPRARLTLRFP 120
FG+ +HP+RWQF+ L+ D + PA ADD W +WLPL+AGRV T+ ++ +L L F
Sbjct: 61 FFGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAGRVATSGSIAGGKLNLAFA 120

Query: 121 DGQAWTPGEQPDVLIFPGGEVTPFQLRIDAATGINVDAQGDSQP 164
G+AWTPG+ PDVLIFPGGE+TPF+L + A GI +A+G+S P
Sbjct: 121 QGEAWTPGDNPDVLIFPGGEMTPFRLTLGEAPGIAFNARGESLP 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00785BCTERIALGSPG2432e-86 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 243 bits (621), Expect = 2e-86
Identities = 98/140 (70%), Positives = 112/140 (80%)

Query: 1 MQRQRGFTLLEIMVVIVILGILASLVVPNLMGNKEKADRQKVVSDLVALEGALDMYKLDN 60
+QRGFTLLEIMVVIVI+G+LASLVVPNLMGNKEKAD+QK VSD+VALE ALDMYKLDN
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDN 63

Query: 61 SRYPNTEQGLQALVTAPAAEPHARNYPEGGYIRRLPQDPWGNEYQLLSPGQHGAIDVFSV 120
YP T QGL++LV AP P A NY + GYI+RLP DPWGN+Y L++PG+HGA D+ S
Sbjct: 64 HHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLLSA 123

Query: 121 GPDGMPDTNDDIGNWTLGKK 140
GPDG T DDI NW L KK
Sbjct: 124 GPDGEMGTEDDITNWGLSKK 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00790BCTERIALGSPF5120.0 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 512 bits (1321), Expect = 0.0
Identities = 277/407 (68%), Positives = 335/407 (82%), Gaps = 4/407 (0%)

Query: 1 MALFRYQALDAQGKTRRGLQQADSARHARQLLRDKGWLALEVTTADPARRLWAGGSLT-- 58
MA + YQALDAQGK RG Q+ADSAR ARQLLR++G + L V ++ L+
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 59 --RRTSAGDLALLTRQLATLVAAGIPLEKALDAVAQQCEKPSLRTLMAGVRSKVLEGHSL 116
R S DLALLTRQLATLVAA +PLE+ALDAVA+Q EKP L LMA VRSKV+EGHSL
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 117 AEAMRGYPACFDGLFCAMVAAGETSGHLDGVLNRLANYTEQRQQLRARLLQAMIYPIVLT 176
A+AM+ +P F+ L+CAMVAAGETSGHLD VLNRLA+YTEQRQQ+R+R+ QAMIYP VLT
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 177 LVAISVIAILLSTVVPKVVEQFVHLKQALPFSTRLLMSLSDIVRSAGPWLALLSLLALLA 236
+VAI+V++ILLS VVPKVVEQF+H+KQALP STR+LM +SD VR+ GPW+ L L +A
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 237 LRYLLRQPARRLAWDRMLLRLPVIGRVARSVNSARYARTLSILNASAVPLLLSMRISADV 296
R +LRQ RR+++ R LL LP+IGR+AR +N+ARYARTLSILNASAVPLL +MRIS DV
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 297 LSNAWARSQLAAASESVREGVSLHRALESTALFPPMMRYMIASGEQSGELTAMLERAAEN 356
+SN +AR +L+ A+++VREGVSLH+ALE TALFPPMMR+MIASGE+SGEL +MLERAA+N
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 357 QDRELSAQIQMALSLFEPLLVVTMAGMVLFIVLAILQPILQLNTLMS 403
QDRE S+Q+ +AL LFEPLLVV+MA +VLFIVLAILQPILQLNTLMS
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLMS 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00800BCTERIALGSPD8390.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 839 bits (2169), Expect = 0.0
Identities = 606/646 (93%), Positives = 631/646 (97%)

Query: 10 ALLILTPLLFSPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML 69
LLI LLF PAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML
Sbjct: 13 TLLIFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML 72

Query: 70 NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRAKDAKTSAVPVASAAAPGEGDEVVTRVV 129
NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVR+KDAKT+AVPVAS AAPG GDEVVTRVV
Sbjct: 73 NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVV 132

Query: 130 PLTNVAARDLAPLLRQLNDNAGAGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDR 189
PLTNVAARDLAPLLRQLNDNAG GSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDR
Sbjct: 133 PLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDR 192

Query: 190 SVVTVPLSWASAAEVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRI 249
SVVTVPLSWASAA+VVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRI
Sbjct: 193 SVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRI 252

Query: 250 IAMIKQLDRQQAVQGNTKVIYLKYAKAADLVEVLTGISSSLQSDKQSARPVAAIDKNIII 309
IAMIKQLDRQQA QGNTKVIYLKYAKA+DLVEVLTGISS++QS+KQ+A+PVAA+DKNIII
Sbjct: 253 IAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIII 312

Query: 310 KAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKN 369
KAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKN
Sbjct: 313 KAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKN 372

Query: 370 AGMTQFTNSGLPISTAIAGANQYNKDGTISSSLASALGSFNGIAAGFYQGNWAMLLTALS 429
AGMTQFTNSGLPISTAIAGANQYNKDGT+SSSLASAL SFNGIAAGFYQGNWAMLLTALS
Sbjct: 373 AGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALS 432

Query: 430 SSTKNDILATPSIVTLDNMQATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKP 489
SSTKNDILATPSIVTLDNM+ATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKP
Sbjct: 433 SSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKP 492

Query: 490 QINEGDAVLLEIEQEVSSVADSASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKT 549
QINEGD+VLLEIEQEVSSVAD+ASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDK+
Sbjct: 493 QINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKS 552

Query: 550 VTDTADKVPLLGDIPVIGALFRSDSKKVSKRNLMLFIRPTIIRDRDEYRQASSGQYTAFN 609
V+DTADKVPLLGDIPVIGALFRS SKKVSKRNLMLFIRPT+IRDRDEYRQASSGQYTAFN
Sbjct: 553 VSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFN 612

Query: 610 NAQTKQRGKESSEASLSNDLLHIYPQQETQAFRQVSAAIDAFNLGG 655
+AQ+KQRGKE+++A L+ DLL IYP+Q+T AFRQVSAAIDAFNLGG
Sbjct: 613 DAQSKQRGKENNDAMLNQDLLEIYPRQDTAAFRQVSAAIDAFNLGG 658


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS00805BCTERIALGSPC2137e-71 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 213 bits (544), Expect = 7e-71
Identities = 98/266 (36%), Positives = 159/266 (59%), Gaps = 7/266 (2%)

Query: 17 KLLPQIVTLIILITAIPQLAKLTWRVVFPVSPEDISALPLTMPPAADPELKNVRPAFTLF 76
++ +I+ ++++ QLA + WR+ P ++ + + PA + FTLF
Sbjct: 12 SVIRRILFYLLMLLFCQQLAMIFWRIGLP---DNAPVSSVQITPAQARQQPVTLNDFTLF 68

Query: 77 GLAVKISPTPT-DAASLNQVPVSSLKLRLAGLLASSNPARSIAIIEKGNQQVSLSTGDPL 135
G++ + + DA+ ++ +P S+L L L G++A + +RSIAII K N+Q S + +
Sbjct: 69 GVSPEKNKAGALDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSRGVNEEV 128

Query: 136 PGYDARIAAILPDRIIVNYQGRKEAILLFNDSRAPSPPPTAAGNPPLVKRLREQPQNILT 195
PGY+A+I +I PDR+++ YQGR E + L++ + S G + + +
Sbjct: 129 PGYNAKIVSIRPDRVVLQYQGRYEVLGLYSQEDSGSDGVP--GAQVNEQLQQRASTTMSD 186

Query: 196 YLSISPVLSGDKLLGYRLNPGKDASLFRQSGLQANDLAIALNGIDLRDQEQAQQALQNLA 255
Y+S SP+++ +KL GYRLNPG + F + GLQ ND+A+ALNG+DLRD EQA++A++ +A
Sbjct: 187 YVSFSPIMNDNKLQGYRLNPGPKSDSFYRVGLQDNDMAVALNGLDLRDAEQAKKAMERMA 246

Query: 256 DMTEITLTVEREGQRHDIAFAL-GDE 280
D+ TLTVER+GQR DI GDE
Sbjct: 247 DVHNFTLTVERDGQRQDIYMEFGGDE 272


86D364_RS01755D364_RS01810N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS017552112.736936fructokinase
D364_RS017601122.221339exonuclease subunit SbcC
D364_RS01765-2141.568321exonuclease subunit SbcD
D364_RS01770-2111.203021phosphate response regulator transcription
D364_RS01775-2131.322824phosphate regulon sensor histidine kinase PhoR
D364_RS01780-1121.505171branched-chain amino acid transporter carrier
D364_RS01785-1141.852450proline-specific permease ProY
D364_RS01790-1141.775089maltodextrin glucosidase
D364_RS01795-115-0.560443LysR family transcriptional regulator
D364_RS01800013-1.180663hydrolase
D364_RS01805013-1.569151antibiotic biosynthesis monooxygenase
D364_RS01810115-2.336804amidohydrolase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01755ACETATEKNASE348e-04 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 33.6 bits (77), Expect = 8e-04
Identities = 13/40 (32%), Positives = 23/40 (57%), Gaps = 1/40 (2%)

Query: 216 EGDEKAELALSRYEQRLAKSLAHVVNILDP-DVIVLGGGM 254
GD++A+LAL+ + R+ K++ + DVIV G+
Sbjct: 293 NGDKRAQLALNVFAYRVKKTIGSYAAAMGGVDVIVFTAGI 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01760RTXTOXIND374e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 36.7 bits (85), Expect = 4e-04
Identities = 26/207 (12%), Positives = 54/207 (26%), Gaps = 23/207 (11%)

Query: 196 ARHALEKFEAQAAGIVLLTEAQQQALQESLQVLTDEEKALLAQQQSQQQQLQWLTRRDEL 255
A K ++ L + + Q L S+++ E L + Q + + R L
Sbjct: 132 AEADTLKTQSSLL-QARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL 190

Query: 256 AQQQQQAATRQQ-QARQALADAAPALAKLE------------LAQPAAQLRPLWERQQEQ 302
++Q Q+ Q L + L +Q
Sbjct: 191 IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIA 250

Query: 303 TAGLTQTRQRISEVNARLLASTALRARIRQGALRAQQQRQAELADLAQWLAAHERFRLWG 362
+ + + E L + +I L A+++ Q L
Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQ---------LVTQLFKNEIL 301

Query: 363 QEIAGWRAQFSQLTRDKQQLTAQSTRL 389
++ LT + + +
Sbjct: 302 DKLRQTTDNIGLLTLELAKNEERQQAS 328



Score = 36.0 bits (83), Expect = 7e-04
Identities = 26/205 (12%), Positives = 64/205 (31%), Gaps = 23/205 (11%)

Query: 307 TQTRQRISEVNARLLASTALRARIRQGALRAQQQRQAELADLAQWLAAHERFRLWGQEIA 366
+ + LL + + R + + + + EL + + + +
Sbjct: 130 LGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTS 189

Query: 367 GWRAQFSQLTRDKQQLTAQSTRLAALRQKLATLPASPLTLSADEVAAAIEQQTQS--RPL 424
+ QFS K Q L R + T+ A ++ E + +E+ L
Sbjct: 190 LIKEQFSTWQNQKYQKELN---LDKKRAERLTVLA---RINRYENLSRVEKSRLDDFSSL 243

Query: 425 -------RQRLISLHEQHQLLRKRLRQNAESVQQAQAEQVKLNATLTLRREQYKDKNQHY 477
+ ++ ++ LR ++Q ++E + L + +K+
Sbjct: 244 LHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN----- 298

Query: 478 LDLKALCQREETIKDLESYRDRLEA 502
+ L + +T ++ L
Sbjct: 299 ---EILDKLRQTTDNIGLLTLELAK 320



Score = 31.3 bits (71), Expect = 0.019
Identities = 40/206 (19%), Positives = 80/206 (38%), Gaps = 22/206 (10%)

Query: 675 AQWQAQQTQHDAIQQQIAALRPMLETLPTSDETEVEAESAIPD-------NWREIHEECL 727
A+ +TQ +Q ++ R + L S E E +PD + E+
Sbjct: 132 AEADTLKTQSSLLQARLEQTR--YQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTS 189

Query: 728 SLHSQLVAQQQQETQEKARLDQSQAQFTSALAASRFSDREAFLAALLDDETAQRLTQLKQ 787
+ Q Q Q+ Q++ LD+ +A+ + LA + + + RL
Sbjct: 190 LIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE-------KSRLDDFSS 242

Query: 788 TLEQQLQQAAALCEQATRQHEAHLALRPQGVDADVPTLQTQLHALAQRLRDNT-TRQGEI 846
L +Q A+ EQ + EA LR + + +++++ + + + T + EI
Sbjct: 243 LLHKQAIAKHAVLEQENKYVEAVNELRVY--KSQLEQIESEILSAKEEYQLVTQLFKNEI 300

Query: 847 RQQLRQDAESRQQQQALGQQIAEAAQ 872
+LRQ + L ++A+ +
Sbjct: 301 LDKLRQ---TTDNIGLLTLELAKNEE 323



Score = 31.0 bits (70), Expect = 0.027
Identities = 36/299 (12%), Positives = 87/299 (29%), Gaps = 62/299 (20%)

Query: 418 QTQSRPLRQRLISLHEQHQLLRKRLRQNAESVQQAQAEQVKLNATLTLRREQYKDKNQHY 477
+ + S Q +L + R + + S++ + ++KL + ++ +
Sbjct: 129 ALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLT 188

Query: 478 LDLKALCQREETIKDLESYRDRLEAGKPCPLCGACEHPAIEQYASLTLTDNQRRRDALEK 537
+K +++++ + L L + R +
Sbjct: 189 SLIKE---------QFSTWQNQKYQKE------------------LNLDKKRAERLTVLA 221

Query: 538 EVAALKEEGLLILGQVKALTQQLQRDTEAAGRLAEEEQALTKAWQETCASLHIARDIAQE 597
+ + + ++ + L + A + E+E +A E + +Q
Sbjct: 222 RINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNE------LRVYKSQL 275

Query: 598 INDWMQEQERYEQQLYQLSQRLMLQSQLNDQQALE--RQAEQQLAATRQGLESALQALAL 655
E+ E ++ + L +QL + L+ RQ + L +
Sbjct: 276 --------EQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQA 327

Query: 656 SL---PAEGTEAAWLHARESEFAQWQAQQTQHDAIQQQIAALRPMLETLPTSDETEVEA 711
S+ P QQ + + ++ +P D EV A
Sbjct: 328 SVIRAPVSVK----------------VQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTA 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01770HTHFIS965e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 95.7 bits (238), Expect = 5e-25
Identities = 34/149 (22%), Positives = 63/149 (42%), Gaps = 9/149 (6%)

Query: 4 RILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDSAVNQLNEPWPDLILLDWMLPGGSGLQ 63
ILV +D+A IR ++ L + G+ + + + DL++ D ++P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 FIKLLKREAMTRDIPVVMLTARGEEEDRVRGLETGADDYITKPFSPKELVARIKAVMRRI 123
+ +K+ D+PV++++A+ ++ E GA DY+ KPF EL+ I +
Sbjct: 65 LLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA-- 120

Query: 124 SPMAVEEVIEMQGLSLDPSSHRVMTGDSP 152
E L D + G S
Sbjct: 121 -----EPKRRPSKLEDDSQDGMPLVGRSA 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01775PF06580310.006 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.4 bits (71), Expect = 0.006
Identities = 16/98 (16%), Positives = 30/98 (30%), Gaps = 25/98 (25%)

Query: 325 LVYNAVNH----TPPGTEIRVSWQRTPQGALFSVEDNGPGIAPEHIPRLTERFYRVDKAR 380
LV N + H P G +I + + VE+ G
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN---------------- 306

Query: 381 SRQTGGSGLGLAIVKHAVNH---HDSRLEIDSTVGKGT 415
+G GL V+ + ++++++ GK
Sbjct: 307 --TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01785TCRTETA290.030 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 29.4 bits (66), Expect = 0.030
Identities = 17/75 (22%), Positives = 29/75 (38%)

Query: 357 FLVIASLATFATVWVWIMILLSQIAFRRRLSPEEVKALKFKVPGGVVTTVIGLLFLAFII 416
F A+L + ++ S RR L E + L +T V L+ + FI+
Sbjct: 163 FFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIM 222

Query: 417 ALIGYHPDTRISLYV 431
L+G P ++
Sbjct: 223 QLVGQVPAALWVIFG 237


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01800ISCHRISMTASE366e-05 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 36.1 bits (83), Expect = 6e-05
Identities = 35/186 (18%), Positives = 67/186 (36%), Gaps = 21/186 (11%)

Query: 7 LDPTNSALIFIDHQPQM--SFGVANIDRQTLKNNTVALAKAGKIFNVPVIYT------SV 58
DP + L+ D Q +F L N L +PV+YT +
Sbjct: 26 PDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQNP 85

Query: 59 ETKSFSGYIW-PELLAVHPDVKPIERTS-------MNSWEDDAF-----VAAVKATGRKK 105
+ ++ W P L + + K I + + W AF + ++ GR +
Sbjct: 86 DDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQ 145

Query: 106 LVISALWTEVCLTFPALMALEAGYEVYVVTDTSGGTSVDAHERSIDRMVQAGAVPVTWQQ 165
L+I+ ++ + A A + + V D S++ H+ +++ A V
Sbjct: 146 LIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCAFTVMTDS 205

Query: 166 VLLEYQ 171
+L + Q
Sbjct: 206 LLDQLQ 211


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01810UREASE300.033 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 30.1 bits (68), Expect = 0.033
Identities = 22/76 (28%), Positives = 31/76 (40%), Gaps = 14/76 (18%)

Query: 4 TATLILTHGQIHTLDRANPLAEAVAIADGKIVATGS------HDRIMSFAAEGTQIVDLK 57
T LIL H I D + + DG+I A G + GT+++ +
Sbjct: 73 TNALILDHWGIVKAD--------IGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGE 124

Query: 58 GHTVIPGLNDSHLHLI 73
G V G DSH+H I
Sbjct: 125 GKIVTAGGMDSHIHFI 140


87D364_RS01875D364_RS01895N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS01875218-0.421322protein translocase subunit SecD
D364_RS01880016-0.412211protein translocase subunit SecF
D364_RS01885-2141.391974VOC family protein
D364_RS01890-1130.759599YafY family transcriptional regulator
D364_RS01895016-0.237084nucleoside-specific channel-forming protein Tsx
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01875SECFTRNLCASE705e-15 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 69.9 bits (171), Expect = 5e-15
Identities = 37/183 (20%), Positives = 88/183 (48%), Gaps = 5/183 (2%)

Query: 433 IQIVEERTIGPTLGMQNIKQGLEACLAGLVVSILFMIL-FYKKFGLIATSALIANLILIV 491
++I ++GP + + + + + LA VV + ++ + F +F L A AL+ +++L V
Sbjct: 135 LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTV 194

Query: 492 GIMSLIPGATLTMPGIAGIVLTLAVAVDANVLINERIKEEL--SNGRTVQQAIDEGYRGA 549
G+ +++ + +A ++ +++ V++ +R++E L ++ ++
Sbjct: 195 GLFAVL-QLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNET 253

Query: 550 FSSIFDANVTTLIKVIILYAVGTGAIKGFAITTGIGIATSMFTAIVGTRAIVNLLYGGKR 609
S +TTL+ ++ + G I+GF G+ T ++++ + IV L G R
Sbjct: 254 LSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIV-LFIGLDR 312

Query: 610 VKK 612
K+
Sbjct: 313 NKE 315


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01880SECFTRNLCASE342e-120 Bacterial translocase SecF protein signature.
		>SECFTRNLCASE#Bacterial translocase SecF protein signature.

Length = 333

Score = 342 bits (880), Expect = e-120
Identities = 101/308 (32%), Positives = 172/308 (55%), Gaps = 12/308 (3%)

Query: 18 DFMRWDYWAFGISGFLLIVSIAIIGVRGFNWGLDFTGGTVIEITLEKPVDLDQMRDSLQK 77
DF RW + FG + ++I S+ + V G N+G+DF GGT I +D+ R +L+
Sbjct: 15 DFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGVYRAALEP 74

Query: 78 AGFEEPQVQNFGSSR------DIMVRMPPVHDANGSQELGSKVVTVINE------STSQN 125
+ + M+R+ D G++ G++ ++N+ +
Sbjct: 75 LELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAVDPA 134

Query: 126 AAVKRIEFVGPSVGADLAQTGALALIAALVCILIYVGFRFEWRLAAGVVIALAHDVVITM 185
+ E VGP V +L T +L+AA V I+ Y+ RFEW+ A G V+AL HDV++T+
Sbjct: 135 LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTV 194

Query: 186 GVLSLFHIEIDLTIVASLMSVIGYSLNDSIVVSDRIRENFRKIRRGTPYEIFNVSLTQTL 245
G+ ++ ++ DLT VA+L+++ GYS+ND++VV DR+REN K + ++ N+S+ +TL
Sbjct: 195 GLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNETL 254

Query: 246 HRTLITSGTTLMVILMLFLFGGPILEGFSLTMLIGVSIGTASSIYVASALALKLGMKREH 305
RT++T TTL+ ++ + ++GG ++ GF M+ GV GT SS+YVA + L +G+ R
Sbjct: 255 SRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGLDRNK 314

Query: 306 LIQQKVEK 313
+ +K
Sbjct: 315 EKKDPSDK 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01890ARGREPRESSOR336e-04 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 32.5 bits (74), Expect = 6e-04
Identities = 18/66 (27%), Positives = 28/66 (42%), Gaps = 7/66 (10%)

Query: 3 RRADRLFQIVQILRGRRLTT-----AALLAERLGVSERTVYRDIRDLSLSGVPVEGEAGS 57
+ R +I +I+ + T L + V++ TV RDI++L L V V GS
Sbjct: 2 NKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKELHL--VKVPTNNGS 59

Query: 58 GYRLLA 63
L
Sbjct: 60 YKYSLP 65


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS01895CHANNELTSX5370.0 Nucleoside-specific channel-forming protein Tsx signa...
		>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx

signature.
Length = 294

Score = 537 bits (1384), Expect = 0.0
Identities = 294/294 (100%), Positives = 294/294 (100%)

Query: 1 MKKTLLAAGAVVALSTTFAAGAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLE 60
MKKTLLAAGAVVALSTTFAAGAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLE
Sbjct: 1 MKKTLLAAGAVVALSTTFAAGAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLE 60

Query: 61 YEAFAKKDWFDFYGYIDAPVFFGGNSTAKGIWNKGSPLFMEIEPRFSIDKLTNTDLSFGP 120
YEAFAKKDWFDFYGYIDAPVFFGGNSTAKGIWNKGSPLFMEIEPRFSIDKLTNTDLSFGP
Sbjct: 61 YEAFAKKDWFDFYGYIDAPVFFGGNSTAKGIWNKGSPLFMEIEPRFSIDKLTNTDLSFGP 120

Query: 121 FKEWYFANNYIYDMGRNDSQEQSTWYMGLGTDIDTGLPMSLSLNVYAKYQWQNYGASNEN 180
FKEWYFANNYIYDMGRNDSQEQSTWYMGLGTDIDTGLPMSLSLNVYAKYQWQNYGASNEN
Sbjct: 121 FKEWYFANNYIYDMGRNDSQEQSTWYMGLGTDIDTGLPMSLSLNVYAKYQWQNYGASNEN 180

Query: 181 EWDGYRFKVKYFVPLTDLWGGSLSYIGFTNFDWGSDLGDDNFYDLNGKHARTSNSIASSH 240
EWDGYRFKVKYFVPLTDLWGGSLSYIGFTNFDWGSDLGDDNFYDLNGKHARTSNSIASSH
Sbjct: 181 EWDGYRFKVKYFVPLTDLWGGSLSYIGFTNFDWGSDLGDDNFYDLNGKHARTSNSIASSH 240

Query: 241 ILALNYAHWHYSIVARYFHNGGQWADDAKLNFGDGPFSVRSTGWGGYFVVGYNF 294
ILALNYAHWHYSIVARYFHNGGQWADDAKLNFGDGPFSVRSTGWGGYFVVGYNF
Sbjct: 241 ILALNYAHWHYSIVARYFHNGGQWADDAKLNFGDGPFSVRSTGWGGYFVVGYNF 294


88D364_RS02305D364_RS02320N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS02305113-1.824594multidrug efflux RND transporter permease
D364_RS02310212-1.110592multidrug efflux RND transporter periplasmic
D364_RS02315312-0.812223multidrug efflux transporter transcriptional
D364_RS02320313-0.160091mechanosensitive channel MscK
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02305ACRIFLAVINRP13650.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1365 bits (3535), Expect = 0.0
Identities = 805/1032 (78%), Positives = 911/1032 (88%)

Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLSILKLPVAQYPTIAPPAISITAMYPGADAETVQNT 60
M NFFI RPIFAWV+AII+M+AG L+IL+LPVAQYPTIAPPA+S++A YPGADA+TVQ+T
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQNMNGIDHLMYMSSNGDSTGTATITLTFESGTDPDIAQVQVQNKLALATPLLPQ 120
VTQVIEQNMNGID+LMYMSS DS G+ TITLTF+SGTDPDIAQVQVQNKL LATPLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQQQGISVEKASSSFLMVVGVINTNGTMNQDDISDYVAANMKDPISRTSGVGDVQLFGS 180
EVQQQGISVEK+SSS+LMV G ++ N QDDISDYVA+N+KD +SR +GVGDVQLFG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWMDPNKLNNFQLTPVDVISALKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240
QYAMRIW+D + LN ++LTPVDVI+ LK QN Q+AAGQLGGTP + GQQLNASIIAQTR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 TNTEEFGNILLKVNQDGSQVRLRDVAKIELGGESYDVVAKFNGQPASGLGIKLATGANAL 300
N EEFG + L+VN DGS VRL+DVA++ELGGE+Y+V+A+ NG+PA+GLGIKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTANAIRAELAKMEPFFPSGMKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ 360
DTA AI+A+LA+++PFFP GMK++YPYDTTPFV++SIHEVVKTL EAI+LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420
N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480
E+ LPPKEAT KSM QIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALILTPALCATMLKPIQKGSHGATTGFFGWFNRMFDKSTHHYTDSVGNILRSTGRY 540
SVLVALILTPALCAT+LKP+ H GFFGWFN FD S +HYT+SVG IL STGRY
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 541 LVLYLIIVVGMAWLFVRLPSSFLPDEDQGVFLSMAQLPAGATQERTQKVLDEMTNYYLTK 600
L++Y +IV GM LF+RLPSSFLP+EDQGVFL+M QLPAGATQERTQKVLD++T+YYL
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 601 EKDNVESVFAVNGFGFAGRGQNTGIAFVSLKDWSQRPGEENKVEAITARAMGYFSQIKDA 660
EK NVESVF VNGF F+G+ QN G+AFVSLK W +R G+EN EA+ RA +I+D
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 661 MVFAFNLPAIVELGTATGFDFELIDQGGLGHEKLTQARNQLFGMVAQHPDVLTGVRPNGL 720
V FN+PAIVELGTATGFDFELIDQ GLGH+ LTQARNQL GM AQHP L VRPNGL
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 721 EDTPQFKIDIDQEKAQALGVSISDINTTLGAAWGGSYVNDFIDRGRVKKVYIMSEAKYRM 780
EDT QFK+++DQEKAQALGVS+SDIN T+ A GG+YVNDFIDRGRVKK+Y+ ++AK+RM
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 781 LPEDIGKWYVRGSDGQMVPFSAFSTSRWEYGSPRLERYNGLPSLEILGQAAPGKSTGEAM 840
LPED+ K YVR ++G+MVPFSAF+TS W YGSPRLERYNGLPS+EI G+AAPG S+G+AM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 841 ALMEELAGKLPSGIGYDWTGMSYQERLSGNQAPALYAISLIVVFLCLAALYESWSIPFSV 900
ALME LA KLP+GIGYDWTGMSYQERLSGNQAPAL AIS +VVFLCLAALYESWSIP SV
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 901 MLVVPLGVVGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGLI 960
MLVVPLG+VG LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLMEKEGKG++
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 961 EATLEAVRMRLRPILMTSLAFILGVMPLVISSGAGSGAQNAVGTGVMGGMVTATILAIFF 1020
EATL AVRMRLRPILMTSLAFILGV+PL IS+GAGSGAQNAVG GVMGGMV+AT+LAIFF
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1021 VPVFFVVVRRRF 1032
VPVFFVV+RR F
Sbjct: 1021 VPVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02310RTXTOXIND432e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.5 bits (100), Expect = 2e-06
Identities = 30/210 (14%), Positives = 75/210 (35%), Gaps = 19/210 (9%)

Query: 100 TYQASYDSAKGDLAKAQAAANMDQLTVKRYQKLLGTKYISQQDYDTAVATA-QQSNAAVV 158
+ Y A +L ++ + + ++ Q + + +Q+ +
Sbjct: 256 EQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLV---TQLFKNEILDKLRQTTDNIG 312

Query: 159 AAKAAVETARINLAYTKVTSPISGRIGKSAV-TEGALVQNGQTTALATVQQLDPIYVDVT 217
+ + + +P+S ++ + V TEG +V +T + V + D + V
Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVTAL 371

Query: 218 QSSNDFLRLKQEL-ADGRLKQENGK------AKVELVTNDGLKYPQSGTLEFSDVTVDQT 270
+ D + A +++ KV+ + D ++ + G + +++++
Sbjct: 372 VQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEEN 431

Query: 271 TGSITLRAIFPNPDHTLLPGMFVRARLEEG 300
S + I L GM V A ++ G
Sbjct: 432 CLSTGNKNIP------LSSGMAVTAEIKTG 455



Score = 29.0 bits (65), Expect = 0.040
Identities = 17/78 (21%), Positives = 33/78 (42%), Gaps = 3/78 (3%)

Query: 48 APLQITTELPGR-TSAYRIAEVRPQVSGIILKRNFV-EGSDIQAGVSLYQIDPATYQASY 105
++I G+ T + R E++P + I+ K V EG ++ G L ++ +A
Sbjct: 78 GQVEIVATANGKLTHSGRSKEIKPIENSIV-KEIIVKEGESVRKGDVLLKLTALGAEADT 136

Query: 106 DSAKGDLAKAQAAANMDQ 123
+ L +A+ Q
Sbjct: 137 LKTQSSLLQARLEQTRYQ 154


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02315HTHTETR1852e-61 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 185 bits (470), Expect = 2e-61
Identities = 170/213 (79%), Positives = 194/213 (91%)

Query: 1 MARKTKQQARETRQLILDVALRLFSQQGVSSTSLATIAKAAGVTRGAIYWHFKNKSDLFN 60
MARKTKQ+A+ETRQ ILDVALRLFSQQGVSSTSL IAKAAGVTRGAIYWHFK+KSDLF+
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 EIWELSDASISDLEIEYRAKFPNDPLSVIREILVYVLEATVTEERRRLMMEIIYHKCEFV 120
EIWELS+++I +LE+EY+AKFP DPLSV+REIL++VLE+TVTEERRRL+MEII+HKCEFV
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 121 GEMTVVQQAQRQLSLASYERIEQTLKECIAAKLLPANLLTRRAAVLMRSYLSGLMENWLF 180
GEM VVQQAQR L L SY+RIEQTLK CI AK+LPA+L+TRRAA++MR Y+SGLMENWLF
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 181 APDSFDLHAEARDYVAILLEMYQFCPTLRGPES 213
AP SFDL EARDYVAILLEMY CPTLR P +
Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPAT 213


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS02320GPOSANCHOR474e-07 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 47.0 bits (111), Expect = 4e-07
Identities = 42/282 (14%), Positives = 95/282 (33%), Gaps = 4/282 (1%)

Query: 31 RAADLPDRAEVQSQLNTLNKQKELTPQDKLVQQDLTQTLETLDKIERIKSETAQLRQQVE 90
+ +DL + N +EL+ + ++++ E KI+ +++ A L + +E
Sbjct: 72 KNSDLSFNNKALKDHND-ELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALE 130

Query: 91 QAPAKLRQAVESLNNLSDVPNDDATRKTLSTLSLRQLESRVTQTLDDLQNAQNDLATYNS 150
A + L A RK +L + T ++ + + A +
Sbjct: 131 GAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 190

Query: 151 QLVSLQTQPERVQNAMFNASQQLQQIRNRLNGTSVGD---ETLRPTQQVLLQAQQALLNA 207
+ L+ E N S +++ + + E A A +
Sbjct: 191 RQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 250

Query: 208 QIEQQRKSLEGNTILQDTLQKQRDYVTAWSNRLEHQLQLLQEAVNSKRLTLTEKTAQEAV 267
++ L+ L+ ++ TA S +++ K + A
Sbjct: 251 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNAN 310

Query: 268 TPDETARIQANPLVKQELDINHQLSEKLIQATENGNQLVQRN 309
+ A+ K++L+ HQ E+ + +E Q ++R+
Sbjct: 311 RQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRD 352


89D364_RS03250D364_RS03305N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS032500185.086914enterobactin transporter EntS
D364_RS032550184.637292Fe2+-enterobactin ABC transporter
D364_RS03260-2183.823537isochorismate synthase EntC
D364_RS03265-2193.108156(2,3-dihydroxybenzoyl)adenylate synthase EntE
D364_RS03270-1192.386985enterobactin biosynthesis bifunctional
D364_RS03275-1171.3087162,3-dihydro-2,3-dihydroxybenzoate dehydrogenase
D364_RS032801151.040220proofreading thioesterase EntH
D364_RS032851141.720741carbon starvation protein CstA
D364_RS03290-2142.405227YbdD/YjiX family protein
D364_RS03295-2152.615792helix-turn-helix domain-containing protein
D364_RS03300-1163.011214type II toxin-antitoxin system RelE/ParE family
D364_RS03305-1173.7857313-oxoacyl-ACP reductase FabG
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03250TCRTETB354e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 35.2 bits (81), Expect = 4e-04
Identities = 39/187 (20%), Positives = 71/187 (37%), Gaps = 8/187 (4%)

Query: 24 IARFISILSLGLLGVAIPVQIQMMTHSTWQVGLSVTLTGASMFVGLMVGGVLADRYERKR 83
I F S+L+ +L V++P T + +G V G L+D+ KR
Sbjct: 21 ILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR 80

Query: 84 LILLARGTCGVGFVGLCLNALLPEPSLAAIYLLGIWDGFFASLGVTALLAATPALVGREN 143
L+L G V + + A ++ G F +L ++ + +EN
Sbjct: 81 LLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL----VMVVVARYIPKEN 136

Query: 144 LMQAGAITMLTVRLGSVISPMIGGLLLATGGVAWNFGLAAAGTFITTLTLLRLPQLPPPP 203
+A + V +G + P IGG++ + W++ L IT +T+ L +L
Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIP--MITIITVPFLMKLLKKE 192

Query: 204 QPREHPL 210
+
Sbjct: 193 VRIKGHF 199


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03255FERRIBNDNGPP526e-10 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 52.3 bits (125), Expect = 6e-10
Identities = 59/288 (20%), Positives = 100/288 (34%), Gaps = 31/288 (10%)

Query: 40 HTLPSQPLRIVSTSVTLTGSLLAIDAPVVASGATTPNNRVADSQGFLRQWSEVAKARKLA 99
H P RIV+ LLA+ VAD+ + SE +
Sbjct: 29 HAAAIDPNRIVALEWLPVELLLALGIVPYG---------VADTINYRLWVSEPPLPDSV- 78

Query: 100 RLYIG---EPSAEAVAAQMPDLILVSATGGDSALPLYDQLKTIAPTLVINYDDKS----- 151
+ +G EP+ E + P ++ SA G P + L IAP N+ D
Sbjct: 79 -IDVGLRTEPNLELLTEMKPSFMVWSAGYG----PSPEMLARIAPGRGFNFSDGKQPLAM 133

Query: 152 WQTLLTQLGQITGHEQQASARIADFNKQLVSLKEKMKLPPQPVTALVYTAAAHSANIWTP 211
+ LT++ + + A +A + + S+K + L ++ P
Sbjct: 134 ARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGP 193

Query: 212 ESAQGQMLEQLGFSLATLPGGLPASHSQGKRHDIVQLGGENLAAGLNGQSLFLFAGDQKD 271
S ++L++ G A + + + LAA + L + KD
Sbjct: 194 NSLFQEILDEYGIP--------NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKD 245

Query: 272 ADAIYANPLLAHLPAVAGKRVYPLGTETFRLDYYSALLVLQRLSSLFG 319
DA+ A PL +P V R + F SA+ ++ L + G
Sbjct: 246 MDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIG 293


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03270ISCHRISMTASE425e-153 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 425 bits (1094), Expect = e-153
Identities = 152/303 (50%), Positives = 201/303 (66%), Gaps = 20/303 (6%)

Query: 1 MAIPKLQAYALPEASDIPANKVNWAFEPSRAALLIHDMQEYFLNFWGENSAMMEKVVANI 60
MAIP +Q Y +P ASD+P NKV+W +P+RA LLIHDMQ YF++ + ++ + ++ ANI
Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60

Query: 61 AALRDFCKQNGIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQQVIAALAPDEDDTV 120
L++ C Q GIPV YTAQP Q+ +DRALL D WGPGL P ++++I LAP++DD V
Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120

Query: 121 LVKWRYSAFHRSPLEEMLKETGRDQLIITGVYAHIGCMTTATDAFMRDIKPFFVADALAD 180
L KWRYSAF R+ L EM+++ GRDQLIITG+YAHIGC+ TA +AFM DIK FFV DA+AD
Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180

Query: 181 FSREEHLMALKYVAGRSGRVVMTEELL--------PLPASKA-----------ALRALIL 221
FS E+H MAL+Y AGR VMT+ LL + + A +R I
Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240

Query: 222 PLLDESDEPLD-DENLIDYGLDSVRMMALAARWRKVHGDIDFVMLAKNPTIDAWWALLTR 280
LL E+ E + E+L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W LLT
Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLTT 300

Query: 281 EVQ 283
Q
Sbjct: 301 RSQ 303


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03275DHBDHDRGNASE341e-121 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 341 bits (875), Expect = e-121
Identities = 107/258 (41%), Positives = 149/258 (57%), Gaps = 20/258 (7%)

Query: 5 GQTVWVTGAGKGIGYATALAFVEAGANVTGFD---------------LAFDGESYPFATE 49
G+ ++TGA +GIG A A GA++ D A E++P
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP---- 63

Query: 50 TLDVADADQVREACSRLLANTERLDVLVNAAGILRMGATDQLSAEDWQQTFAVNVGGAFN 109
DV D+ + E +R+ +D+LVN AG+LR G LS E+W+ TF+VN G FN
Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 110 LFQQTMAQFRRQRGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALTVGLELAGSGVRC 169
+ +R G+IVTV S+ A PR M+AY +SKAA +GLELA +RC
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 170 NLVSPGSTDTDMQRTLWVSDDAEQQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASSH 229
N+VSPGST+TDMQ +LW ++ +Q I+G E FK GIPL K+A+P +IA+ +LFL S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 230 ASHITLQDIVVDGGSTLG 247
A HIT+ ++ VDGG+TLG
Sbjct: 243 AGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03285ACRIFLAVINRP310.014 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.3 bits (71), Expect = 0.014
Identities = 9/60 (15%), Positives = 25/60 (41%), Gaps = 1/60 (1%)

Query: 172 VIILAVLAMIVVKALTHSPWG-TYTVAFTIPLAIFMGIYIRYLRPGRIGEVSVIGLVMLV 230
++ ++ + + + A + W +V +PL I + L + ++GL+ +
Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTI 934


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS03305DHBDHDRGNASE1358e-41 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 135 bits (340), Expect = 8e-41
Identities = 86/253 (33%), Positives = 130/253 (51%), Gaps = 15/253 (5%)

Query: 5 LTGKKALVTGASRGLGRAIALSLARAGAAVVITYEKSVDKAQAVADEIKALGRYGEAVQA 64
+ GK A +TGA++G+G A+A +LA GA + + + +K + V +KA R+ EA A
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAEARHAEAFPA 64

Query: 65 DSASAQAIQDAVTHAARSLGGLDILVNNAGIARGGPLESMTLADIDALINVNIRGVVIAT 124
D + AI + R +G +DILVN AG+ R G + S++ + +A +VN GV A+
Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124

Query: 125 QEALVHMAD--GGRIINIGSCLANRVAMPGISVYAMTKSALNALTRGLARDLGPRGITVN 182
+ +M D G I+ +GS A ++ YA +K+A T+ L +L I N
Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRT-SMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183

Query: 183 LVHPGPTNSDMN-----PEDGEQ------AEAQRQMIAVGHYGQPEDIAAAVTFLASPAA 231
+V PG T +DM E+G + E + I + +P DIA AV FL S A
Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243

Query: 232 GQISGTGLDVDGG 244
G I+ L VDGG
Sbjct: 244 GHITMHNLCVDGG 256


90D364_RS04345D364_RS04400N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS04345-1140.473847ABC transporter permease
D364_RS04350-1140.626165ABC transporter permease
D364_RS04355214-1.389297ATP-binding cassette domain-containing protein
D364_RS04360415-2.315895secretion protein HlyD
D364_RS04365615-3.181486transcriptional regulator CecR
D364_RS04370514-2.504436LysR family transcriptional regulator
D364_RS27320512-0.551814hypothetical protein
D364_RS043801100.116039MFS transporter
D364_RS04385-1122.307001glucose 1-dehydrogenase
D364_RS04390-1142.168608transketolase
D364_RS04395-1142.053670transketolase family protein
D364_RS04400-1142.348969ATP-dependent RNA helicase RhlE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS04345ABC2TRNSPORT452e-07 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 44.9 bits (106), Expect = 2e-07
Identities = 31/137 (22%), Positives = 56/137 (40%), Gaps = 1/137 (0%)

Query: 197 AREREQGTLDQLLVSPLATWQIFVGKAVPALIVATLQATIVLAIGIWAYQIPFAGSLLLF 256
R Q T + +L + L I +G+ A A L + + + SLL
Sbjct: 92 GRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGYTQWL-SLLYA 150

Query: 257 YFTMVIYGLSLVGFGLLISSLCATQQQAFIGVFVFMMPAILLSGYVSPVENMPQWLQDLT 316
+ + GL+ G+++++L + + + P + LSG V PV+ +P Q
Sbjct: 151 LPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQTAA 210

Query: 317 WINPIRHFTDITKQIYL 333
P+ H D+ + I L
Sbjct: 211 RFLPLSHSIDLIRPIML 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS04355PF05272310.019 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.019
Identities = 11/27 (40%), Positives = 14/27 (51%)

Query: 30 IRAGYVTGLVGPDGAGKTTLMRMLAGL 56
+ Y L G G GK+TL+ L GL
Sbjct: 593 CKFDYSVVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS04360RTXTOXIND761e-17 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 76.4 bits (188), Expect = 1e-17
Identities = 49/290 (16%), Positives = 107/290 (36%), Gaps = 26/290 (8%)

Query: 55 ASLTVDEGDSIRAGQTLGELDRAPYENALLQAQANVSTAQAQYDLMMAGYRAEEIAQAAA 114
V E + +R + E + ++N Q + N+ +A+ ++A E
Sbjct: 175 YFQNVSEEEVLRLTSLIKE-QFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 115 AVKQAQAAYDYAQNFYQRQ--LGLRASSAISANDLENARSSRDQAQATLKSAQDKLRQYR 172
+ + + + L + N+L +S +Q ++ + SA+++ +
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 173 AGNRPQ---EIAQAKASLEQAQAALAQAKLDLHDTVLTAPSDGTLMTRAV-EPGTMLNAG 228
+ + ++ Q ++ LA+ + +V+ AP + V G ++
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 229 GTVLTLSLT-HPVWVRAYVDEKNLGQAQPGQEVLLYTDSRPDKPYH---GKIGFVSPSAE 284
T++ + + V A V K++G GQ ++ ++ P Y GK+ ++ A
Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA- 412

Query: 285 FTPKTVETPDLRTDLVYRLRIVVTDADGA-------LRQGMPVTISFSHG 327
D R LV+ + I + + + L GM VT G
Sbjct: 413 -------IEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTG 455


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS04365HTHTETR611e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 61.2 bits (148), Expect = 1e-13
Identities = 24/154 (15%), Positives = 54/154 (35%), Gaps = 16/154 (10%)

Query: 4 KGEQAKNQLIAAAIAQFGEYGQHATT-RDIAAQAGQNIAAITYYFGSKDDLYLACAQWIA 62
+ ++ + ++ A+ F + G +T+ +IA AG AI ++F K DL+ +
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 63 DFIGDNFRPQAEAAEHLLAGEAPDRQAIRDLILSACHNMILLLTQDDTVNLSKFISREQL 122
IG+ +R++++ + + T++ L + I +
Sbjct: 68 SNIGELELEYQAKF------PGDPLSVLREILIHVLESTV---TEERRRLLMEIIFHKCE 118

Query: 123 APTA------AYHLIHQQVIAPLHHYLTRLIAAW 150
A + + + L I A
Sbjct: 119 FVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAK 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS04380TCRTETA432e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 42.9 bits (101), Expect = 2e-06
Identities = 58/400 (14%), Positives = 129/400 (32%), Gaps = 50/400 (12%)

Query: 22 LTMIFLVYAINYADRTNIGAVLPFIIDEFHINNFEAGAIASMFFLGYAVSQIP----AGF 77
L +I A++ I VLP ++ + +N + L YA+ Q G
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLAL-YALMQFACAPVLGA 65

Query: 78 FIAKRGTRGLVSLSIFGFSAFTWLMGTVSSVFGLKLVRLGLGLSEGPCPVGLASTINNWF 137
+ G R ++ +S+ G + +M T ++ L + R+ G++ V + I +
Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAV-AGAYIADIT 124

Query: 138 PPKEKATATGVYIAATMFAPIIVPPLAVWIAVTWGWRWVFFSFAIPGIVAAIAWYLLVKS 197
E+A G +++A ++ P+ + + FF+ A + + L+
Sbjct: 125 DGDERARHFG-FMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL-- 181

Query: 198 KPAESGFVSQSELAEINAGRESHNNSVR-ENILIADRFTWLDKIIRVKKMAPIDTAKGLF 256
ESH R + + +A + +
Sbjct: 182 -------------------PESHKGERRPLRREALNPLASFRWARGMTVVAAL-----MA 217

Query: 257 TSKNILGDCLAYFMMVSVLYGLLTWIPLYLVKERGFDVMSMGFVASMPCIGGFIGAIGGG 316
+F+M V ++ +D ++G + G + ++
Sbjct: 218 V----------FFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLA---AFGILHSLAQA 264

Query: 317 WISDKLLGR-RRKPTMMFTAVSTVVMMLIMLNIPASTLAVCIGLFFVGFCLNIGWPAFTA 375
I+ + R + +M ++ +++ +A I + IG PA A
Sbjct: 265 MITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG--GIGMPALQA 322

Query: 376 YGMAVSDSKTYPIASSIINSGGNLGGFVAPMAAGFLLDKT 415
D + + + +L V P+ + +
Sbjct: 323 MLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAAS 362


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS04385DHBDHDRGNASE1182e-34 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 118 bits (296), Expect = 2e-34
Identities = 74/255 (29%), Positives = 119/255 (46%), Gaps = 16/255 (6%)

Query: 3 LKDKVAIITGAASARGLGFATAKLFAENGAKVVIIDLNGEAS---KTAAAALGEGHLGLA 59
++ K+A ITGAA +G+G A A+ A GA + +D N E ++ A
Sbjct: 6 IEGKIAFITGAA--QGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63

Query: 60 ANVADEVQVQAAIEQILAKYGRVDVLVNNAGITQPLKLMDIKRANYDAVLDVSLRGTLLM 119
A+V D + +I + G +D+LVN AG+ +P + + ++A V+ G
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123

Query: 120 SQAVIPTMRAQKSGSIVCISSVSAQRGGGIFGGPHYSAAKAGVLGLARAMARELGPDNVR 179
S++V M ++SGSIV + S A G Y+++KA + + + EL N+R
Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPA--GVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181

Query: 180 VNCITPGLIQTDITAGKLTDD---------MTANILAGIPMNRLGDAIDIARAALFLGSD 230
N ++PG +TD+ D+ GIP+ +L DIA A LFL S
Sbjct: 182 CNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241

Query: 231 LSSYSTGITLDVNGG 245
+ + T L V+GG
Sbjct: 242 QAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS04400SECA300.026 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.8 bits (67), Expect = 0.026
Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 4/67 (5%)

Query: 246 QQVLVFTRTKHGANHLAEQLNKDGIRSAAIHG-NKSQGARTRALADFKSGGIRVLVATDI 304
Q VLV T + + ++ +L K GI+ ++ + A A A + + V +AT++
Sbjct: 450 QPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNM 506

Query: 305 AARGLDI 311
A RG DI
Sbjct: 507 AGRGTDI 513


91D364_RS04625D364_RS04665N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS046250121.321917serine-type D-Ala-D-Ala carboxypeptidase
D364_RS046301142.450075DNA-binding transcriptional repressor DeoR
D364_RS046351132.057238undecaprenyl-diphosphate phosphatase
D364_RS046400131.782439multidrug efflux MFS transporter KdeA
D364_RS046450151.338765Cof-type HAD-IIB family hydrolase
D364_RS046500161.767381MFS transporter
D364_RS04655-1150.841306TetR family transcriptional regulator
D364_RS04665-1151.071111aspartate:alanine antiporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS04625BLACTAMASEA354e-04 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 34.8 bits (80), Expect = 4e-04
Identities = 39/156 (25%), Positives = 61/156 (39%), Gaps = 14/156 (8%)

Query: 15 CALLFLVAPAV-QAAEQLPDAPS-IDAR-AWILMDYASGKVLSEGNADEKLDPASLTKIM 71
A L L A Q EQ+ + S + R I MD ASG+ L+ ADE+ S K++
Sbjct: 12 LATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVV 71

Query: 72 TSYVVGQAIKAGKIKLTDMVTVGRDAWATGNPALRGSSVMFLKPGMQVSVEDLNKGVIIQ 131
V + AG +L + + +P L GM +V +L I
Sbjct: 72 LCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSE----KHLADGM--TVGELCAAAITM 125

Query: 132 SGNDASIAIADYVAGSQDAFVSLMNGYAKKMGLTNT 167
S N A+ + V G + + +++G T
Sbjct: 126 SDNSAANLLLATVGGPAG-----LTAFLRQIGDNVT 156


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS04640TCRTETA453e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 45.2 bits (107), Expect = 3e-07
Identities = 61/267 (22%), Positives = 106/267 (39%), Gaps = 19/267 (7%)

Query: 71 LLGPLSDRIGRRPVMLTGVVWFIVTCLATLLAQTIEQFTLLRFLQGISLCFIGAVGYAAI 130
+LG LSDR GRRPV+L + V A + + R + GI+ GAV A I
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYI 120

Query: 131 QESFEEAVCIKITALMANVALIAPLLGPLVGAAWVHILPWEMMFVLFAVLAAISFFGLQR 190
+ + + M+ + GP++G P F A L ++F
Sbjct: 121 ADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSP-HAPFFAAAALNGLNFLTGCF 179

Query: 191 AMPET--ATRLGEKLSVKELGRDYRLVLKNLRFVAGALATGFVSLPLLAWIAQSP--VII 246
+PE+ R + +R + + VA +A F ++ + Q P + +
Sbjct: 180 LLPESHKGERRPLRREALNPLASFRWA-RGMTVVAALMAVFF----IMQLVGQVPAALWV 234

Query: 247 ISGEQATSYEYGMLQVPI--FGAL--IAGNLVLARLTARRTVRSLIIMGGWPIMFGLILS 302
I GE ++ + + + FG L +A ++ + AR R +++G G IL
Sbjct: 235 IFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILL 294

Query: 303 AAATVVSSHAYLWMTAGLSFYAFGIGL 329
A AT ++ + + GIG+
Sbjct: 295 AFAT----RGWMAFPIMVLLASGGIGM 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS04650TCRTETB348e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 34.1 bits (78), Expect = 8e-04
Identities = 34/155 (21%), Positives = 62/155 (40%), Gaps = 19/155 (12%)

Query: 17 LFMFFFIPGLLMASWATRTPAIRDLLALSTAEMGVVLFGLSVGSMSGILCS---AWLVKR 73
+ I G + + ++D+ LSTAE+G V+ + G+MS I+ LV R
Sbjct: 262 VLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVI--IFPGTMSVIIFGYIGGILVDR 319

Query: 74 FGTRKVIRTTM-----SFAVLGMLVLSLALWVTSAPLFAFGLAIFGASFGSAEVAINVEG 128
G V+ + SF L+ + + ++T +F G F + S V+ +++
Sbjct: 320 RGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQ 379

Query: 129 AAIEREMNKTVLPMMHGFYSFGTLFGAGVGMAVTG 163
M +F + G G+A+ G
Sbjct: 380 QEAGAGM---------SLLNFTSFLSEGTGIAIVG 405



Score = 30.6 bits (69), Expect = 0.013
Identities = 34/172 (19%), Positives = 63/172 (36%), Gaps = 17/172 (9%)

Query: 218 LLIGVIVLAMAFAEGSANDWL-PLLMVDGHGFSP-TSGSLIYAGFTLGMTLGRFTGGWFI 275
+IGV+ + F + + P +M D H S GS+I T+ + + + GG +
Sbjct: 258 FMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILV 317

Query: 276 DRYSRVAVVRGSAVMGALGIGLIIFVDNPWVAGISVLLWGIGASLGF-PLTISAASDTGP 334
DR + V+ ++ F+ + I LG T + S
Sbjct: 318 DRRGPLYVLNIGVTFLSVSFLTASFLLE---TTSWFMTIIIVFVLGGLSFTKTVISTIVS 374

Query: 335 DAPKRVSVVAITGYLAFLVGPPLLGFLGEHFGLRSAMMVVLGLVMAAALVAR 386
+ K+ A L F FL E G+ ++G +++ L+ +
Sbjct: 375 SSLKQQEAGAGMSLLNF------TSFLSEGTGI-----AIVGGLLSIPLLDQ 415


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS04655HTHTETR504e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 50.4 bits (120), Expect = 4e-10
Identities = 17/76 (22%), Positives = 33/76 (43%), Gaps = 2/76 (2%)

Query: 1 MAR--RPNDPQRRERILQATLDTIAAHGIHAVTHRKIATCANVPLGSLTYYFSGIEALIE 58
MAR + + R+ IL L + G+ + + +IA A V G++ ++F L
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 59 EAFSLFTAEMSAQYQQ 74
E + L + + +
Sbjct: 61 EIWELSESNIGELELE 76


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS04665TCRTETA320.005 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.1 bits (73), Expect = 0.005
Identities = 23/106 (21%), Positives = 35/106 (33%), Gaps = 6/106 (5%)

Query: 394 LMIGMITFQFSSFSFGIGNAAGLLFAG-IMLGFLRANHPTFG-YIPQ--GALNMVKEFGL 449
L++ + +L+ G I+ G A G YI + FG
Sbjct: 76 LLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGF 135

Query: 450 MVFMAGVGLSAGAGINNGLGAVGGQM--LAAGLIVSLVPVVICFLF 493
M G G+ AG + +G AA + L + CFL
Sbjct: 136 MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181


92D364_RS05225D364_RS05260N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS05225428-4.736856DNA helicase IV
D364_RS05230531-5.576412methylglyoxal synthase
D364_RS05235332-6.255685CoA-binding protein
D364_RS26885333-6.535882BapA prefix-like domain-containing protein
D364_RS05250236-7.332589TolC family outer membrane protein
D364_RS05255129-6.091482type I secretion system permease/ATPase
D364_RS05260226-4.902376HlyD family type I secretion periplasmic adaptor
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS05225YERSSTKINASE300.035 Yersinia serine/threonine protein kinase signature.
		>YERSSTKINASE#Yersinia serine/threonine protein kinase signature.

Length = 732

Score = 30.1 bits (67), Expect = 0.035
Identities = 46/201 (22%), Positives = 78/201 (38%), Gaps = 28/201 (13%)

Query: 291 PTISKLESDTAARHALLLKSWQKQCQEKKAQAKSWR------LWLEEEMGWQLPE---GD 341
P+IS A H + + WQ E K +R L L G+ L G
Sbjct: 12 PSIS-----LAKAHERISQHWQNPVGELNIGGKRYRIIDNQVLRLNPHSGFSLFREGVGK 66

Query: 342 FWQDKKVQRRMASRLDRWVSLMRMHGGSQAEMIAGAPEAVRDLFSKRVKLMSPLMKDWKA 401
+ K +A L +L + E+ + P A+ +LF + + PL WK
Sbjct: 67 IFSGKMFNFSIARNLTD--TLHAAQKTTSQELRSDIPNALSNLFGAKPQTELPL--GWKG 122

Query: 402 ALKAENAVDFSGLIHQAVNILDKGRFVSPWKHILVDEFQDISPQRASLLAALRRQNSQTT 461
A D G+ + + +F HI + E +D + L+A + R ++
Sbjct: 123 E-PLSGAPDLEGM-----RVAETDKFAEGESHISIIETKD----KQRLVAKIERSIAEGH 172

Query: 462 LFAVGDDWQAIYRFSGAQLSL 482
LFA + ++ IY+ +G +L
Sbjct: 173 LFAELEAYKHIYKTAGKHPNL 193


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS26885ICENUCLEATIN422e-05 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 42.4 bits (99), Expect = 2e-05
Identities = 109/600 (18%), Positives = 200/600 (33%)

Query: 166 TDPGGSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 225
T+ G S A + + +DS + S + +S + S SD +
Sbjct: 183 TETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTA 242

Query: 226 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 285
S + DS + S + DS + S + SD + S + +DS
Sbjct: 243 GYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADS 302

Query: 286 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 345
+ S + +S + S + SD + S + DS + S +
Sbjct: 303 SLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTA 362

Query: 346 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 405
DS + S + SD + S + +DS + S + +S + S
Sbjct: 363 GEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGS 422

Query: 406 DSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 465
+ SD + S + D+ + S + DS + S + SD +
Sbjct: 423 TQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTA 482

Query: 466 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 525
S S + +S + S + S + S + ++SD + S S + ++S
Sbjct: 483 GYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANS 542

Query: 526 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 585
+ S + +S + S + SD + S + SDS + S +
Sbjct: 543 SLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTA 602

Query: 586 DSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 645
S + S + S + S S + +DS + S + +S + S
Sbjct: 603 SYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGS 662

Query: 646 DSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 705
+ SD + S S + +DS + S + +S + S + SD S
Sbjct: 663 TQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTS 722

Query: 706 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDS 765
S S + +DS + S + S A S + S + S S + +DS
Sbjct: 723 GYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADS 782



Score = 42.0 bits (98), Expect = 3e-05
Identities = 110/600 (18%), Positives = 202/600 (33%)

Query: 158 SDSDSDSDTDPGGSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 217
+++ DS T G S A +D+ + S + +S + S SD +
Sbjct: 183 TETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTA 242

Query: 218 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 277
S + DS + S + DS + S + SD + S + +DS
Sbjct: 243 GYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADS 302

Query: 278 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 337
+ S + +S + S + SD + S + DS + S +
Sbjct: 303 SLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTA 362

Query: 338 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 397
DS + S + SD + S + +DS + S + +S + S
Sbjct: 363 GEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGS 422

Query: 398 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDS 457
+ SD + S + DS + + + DS + S + SD +
Sbjct: 423 TQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTA 482

Query: 458 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 517
S S + +S + S + S + S + ++SD + S S + ++S
Sbjct: 483 GYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANS 542

Query: 518 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 577
+ S + +S + S + SD + S + SDS + S +
Sbjct: 543 SLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTA 602

Query: 578 DSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 637
S + S + S + S S + +DS + S + +S + S
Sbjct: 603 SYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGS 662

Query: 638 DSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 697
+ SD + S S + +DS + S + +S + S + SD S
Sbjct: 663 TQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTS 722

Query: 698 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDS 757
S S + +DS + S + S + S A S + S S + +DS
Sbjct: 723 GYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADS 782



Score = 41.7 bits (97), Expect = 4e-05
Identities = 115/618 (18%), Positives = 206/618 (33%)

Query: 170 GSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 229
GS A +S + S SD + S + DS + S + DS
Sbjct: 213 GSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSL 272

Query: 230 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 289
+ S + SD + S + +DS + S + +S + S +
Sbjct: 273 TAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQK 332

Query: 290 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 349
SD + S + DS + S + DS + S + SD + S
Sbjct: 333 GSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTG 392

Query: 350 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 409
+ +DS + S + +S + S + SD + S + DS +
Sbjct: 393 TAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGY 452

Query: 410 DSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 469
S + DS + S + SD + S S + +S + S + S
Sbjct: 453 GSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTL 512

Query: 470 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 529
+ S + ++SD + S S + ++S + S + +S + S +
Sbjct: 513 TAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTARE 572

Query: 530 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 589
SD + S + SDS + S + S + S + S + S S
Sbjct: 573 GSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTS 632

Query: 590 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 649
A +DS + S + +S + S + SD + S S + +DS +
Sbjct: 633 TAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGY 692

Query: 650 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 709
+ + +S + S + SD S S S + +DS + S + S
Sbjct: 693 GSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSL 752

Query: 710 DSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 769
+ S + S + S S + ADS + S + S + S +
Sbjct: 753 TAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQE 812

Query: 770 DSDSDSDSDSDSDSDSDS 787
SD + S S + +DS
Sbjct: 813 RSDLTTGYGSTSTAGADS 830



Score = 41.3 bits (96), Expect = 5e-05
Identities = 112/618 (18%), Positives = 203/618 (32%)

Query: 170 GSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 229
GS S + S + S + S + +DS + S + +S
Sbjct: 165 GSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQ 224

Query: 230 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 289
+ S SD + S + DS + S + DS + S +
Sbjct: 225 MAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQK 284

Query: 290 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 349
SD + S + +DS + S + +S + S + SD + S
Sbjct: 285 GSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTG 344

Query: 350 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 409
+ DS + S + DS + S + SD + S + +DS +
Sbjct: 345 TAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGY 404

Query: 410 DSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 469
S + +S + S + SD + S + DS + S + DS
Sbjct: 405 GSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSL 464

Query: 470 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 529
+ S + SD + S S + +S + S + S + S + +
Sbjct: 465 TAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQN 524

Query: 530 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 589
+SD + S S + ++S + S + +S + S + SD + S
Sbjct: 525 ESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTG 584

Query: 590 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 649
A SDS + S + S + S + S + S S + +DS +
Sbjct: 585 TAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGY 644

Query: 650 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 709
+ + +S + S + SD + S S + +DS + S + +S
Sbjct: 645 GSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSIL 704

Query: 710 DSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 769
+ S + SD S S S + ADS + S + S + S +
Sbjct: 705 TAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTARE 764

Query: 770 DSDSDSDSDSDSDSDSDS 787
S + S S + +DS
Sbjct: 765 QSVLTTGYGSTSTAGADS 782



Score = 40.5 bits (94), Expect = 1e-04
Identities = 108/610 (17%), Positives = 202/610 (33%)

Query: 154 ADSDSDSDSDSDTDPGGSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 213
D S+ + + P + DA +S + + + + S S + S
Sbjct: 125 PDVTSEVKVGNRSLPVTDDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTE 184

Query: 214 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 273
+ S + S + +DS + S + +S + S SD +
Sbjct: 185 TAGDSSTLIAGYGSTGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGY 244

Query: 274 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 333
S + DS + S + DS + S + SD + S + +DS
Sbjct: 245 GSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSL 304

Query: 334 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 393
+ S + +S + S + SD + S + DS + S +
Sbjct: 305 IAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGE 364

Query: 394 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDS 453
DS + S + SD + S + +DS + S + +S + S
Sbjct: 365 DSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQ 424

Query: 454 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 513
+ SD + S + DS + S + DS + S + SD +
Sbjct: 425 TAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGY 484

Query: 514 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 573
S S + +S + S + S + S + ++SD + S S + ++S
Sbjct: 485 GSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSL 544

Query: 574 DSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 633
+ S + +S A S + SD + S + SDS + S +
Sbjct: 545 IAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASY 604

Query: 634 DSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 693
S + S + + + S S + +DS + S + +S + S
Sbjct: 605 HSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQ 664

Query: 694 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDS 753
+ SD + S S + +DS + S + +S + S + SD S
Sbjct: 665 TAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGY 724

Query: 754 DSDSDSDSDS 763
S S + +DS
Sbjct: 725 GSTSTAGADS 734



Score = 40.1 bits (93), Expect = 1e-04
Identities = 106/585 (18%), Positives = 195/585 (33%)

Query: 205 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 264
S + +DS + S + +S + S SD + S + DS
Sbjct: 198 STGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLI 257

Query: 265 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 324
+ S + DS + S + SD + S + +DS + S + +
Sbjct: 258 AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEE 317

Query: 325 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 384
S + S + SD + S + DS + S + DS + S
Sbjct: 318 STQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQT 377

Query: 385 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSD 444
+ SD + S + +DS + S + +S + + + SD +
Sbjct: 378 AQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYG 437

Query: 445 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 504
S + DS + S + DS + S + SD + S S + +S
Sbjct: 438 STGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLI 497

Query: 505 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 564
+ S + S + S + ++SD + S S + ++S + S + +
Sbjct: 498 AGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYN 557

Query: 565 SDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 624
S + S + SD + S + SDS + S + S + S
Sbjct: 558 SVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQT 617

Query: 625 SDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 684
+ S + S S + +DS + S + +S + S + SD +
Sbjct: 618 AREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYG 677

Query: 685 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSD 744
S S + +DS + S + +S + S + SD S S S A +DS
Sbjct: 678 STSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLI 737

Query: 745 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 789
+ S + S + S + S + S S + +DS
Sbjct: 738 AGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADS 782



Score = 39.7 bits (92), Expect = 2e-04
Identities = 105/585 (17%), Positives = 195/585 (33%)

Query: 209 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 268
S + +DS + S + +S + S SD + S + DS
Sbjct: 198 STGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLI 257

Query: 269 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 328
+ S + DS + S + SD + S + +DS + S + +
Sbjct: 258 AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEE 317

Query: 329 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 388
S + S + SD + S + DS + S + DS + S
Sbjct: 318 STQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQT 377

Query: 389 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSD 448
+ SD + S + +DS + S + +S A S + SD +
Sbjct: 378 AQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYG 437

Query: 449 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 508
S + DS + S + DS + S + SD + S S + +S
Sbjct: 438 STGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLI 497

Query: 509 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 568
+ S + S + S + ++SD + S S + ++S + S + +
Sbjct: 498 AGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYN 557

Query: 569 SDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 628
S + S + SD + S + SDS + S + S + S
Sbjct: 558 SVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQT 617

Query: 629 SDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 688
+ S + S S + +DS + S + +S + S + SD +
Sbjct: 618 AREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYG 677

Query: 689 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSD 748
S S + +DS + S + +S + S + SD S + S + +DS
Sbjct: 678 STSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLI 737

Query: 749 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDT 793
+ S + S + S + S + S S + +D+
Sbjct: 738 AGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADS 782



Score = 39.7 bits (92), Expect = 2e-04
Identities = 105/585 (17%), Positives = 195/585 (33%)

Query: 207 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 266
S + +DS + S + +S + S SD + S + DS
Sbjct: 198 STGTAGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLI 257

Query: 267 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 326
+ S + DS + S + SD + S + +DS + S + +
Sbjct: 258 AGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEE 317

Query: 327 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 386
S + S + SD + S + DS + S + DS + S
Sbjct: 318 STQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQT 377

Query: 387 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSD 446
+ SD + S + +DS + S + +S + S + SD +
Sbjct: 378 AQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYG 437

Query: 447 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 506
S + DS + S + DS + S + SD + S S + +S
Sbjct: 438 STGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLI 497

Query: 507 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 566
+ S + S + S + ++SD + S S + ++S + S + +
Sbjct: 498 AGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYN 557

Query: 567 SDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 626
S + S + SD + + + SDS + S + S + S
Sbjct: 558 SVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQT 617

Query: 627 SDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 686
+ S + S S + +DS A S + +S + S + SD +
Sbjct: 618 AREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYG 677

Query: 687 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSD 746
S S + +DS + S + +S + S + SD S S + + +DS
Sbjct: 678 STSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLI 737

Query: 747 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 791
+ S + S + S + S + S S + +DS
Sbjct: 738 AGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADS 782



Score = 39.0 bits (90), Expect = 2e-04
Identities = 112/620 (18%), Positives = 206/620 (33%)

Query: 170 GSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 229
G S A ++ + S SD + S + DS + S + DS
Sbjct: 211 GYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDS 270

Query: 230 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 289
+ S + SD + S + +DS + S + +S + S +
Sbjct: 271 SLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTA 330

Query: 290 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 349
SD + S + DS + S + DS + S + SD + S
Sbjct: 331 QKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGS 390

Query: 350 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 409
+ +DS + S + +S + S + SD + S + DS +
Sbjct: 391 TGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIA 450

Query: 410 DSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 469
S + DS + + + SD + S S + +S + S + S
Sbjct: 451 GYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGS 510

Query: 470 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 529
+ S + ++SD + S S + ++S + S + +S + S +
Sbjct: 511 TLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTA 570

Query: 530 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 589
SD + S + SDS + S + S + S + S + S
Sbjct: 571 REGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGS 630

Query: 590 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 649
+ + +DS + S + +S + S + SD + S S + +DS +
Sbjct: 631 TSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIA 690

Query: 650 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 709
S + +S + S + SD S S S + +DS + S + S
Sbjct: 691 GYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHS 750

Query: 710 DSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 769
+ S + S + S S A +DS + S + S + S +
Sbjct: 751 SLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTA 810

Query: 770 DSDSDSDSDSDSDSDSDSDS 789
SD + S S + +DS
Sbjct: 811 QERSDLTTGYGSTSTAGADS 830



Score = 38.6 bits (89), Expect = 3e-04
Identities = 105/593 (17%), Positives = 196/593 (33%)

Query: 201 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 260
D D+ +S S + + + S S + S + S + S
Sbjct: 142 DDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGT 201

Query: 261 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 320
+ +DS + S + +S + S SD + S + DS +
Sbjct: 202 AGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYG 261

Query: 321 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 380
S + DS + S + SD + S + +DS + S + +S
Sbjct: 262 STQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQT 321

Query: 381 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSD 440
+ S + SD + S + DS + S + DS A S +
Sbjct: 322 AGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 381

Query: 441 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 500
SD + S + +DS + S + +S + S + SD + S
Sbjct: 382 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 441

Query: 501 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 560
+ DS + S + DS + S + SD + S S + +S +
Sbjct: 442 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYG 501

Query: 561 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSD 620
S + S + S + ++SD + S S + ++S + S + +S
Sbjct: 502 STQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLT 561

Query: 621 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSD 680
+ S + SD + S + SDS + S + S + S +
Sbjct: 562 AGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQ 621

Query: 681 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSD 740
S + S S + +DS + S + +S + S + SD + + S
Sbjct: 622 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTST 681

Query: 741 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDT 793
+ +DS + S + +S + S + SD S S S + +D+
Sbjct: 682 AGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADS 734



Score = 38.6 bits (89), Expect = 4e-04
Identities = 110/610 (18%), Positives = 203/610 (33%)

Query: 182 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 241
+S + S SD + S + DS + S + DS + S
Sbjct: 221 ESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQ 280

Query: 242 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 301
+ SD + S + +DS + S + +S + S + SD +
Sbjct: 281 TAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGY 340

Query: 302 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 361
S + DS + S + DS + S + SD + S + +DS
Sbjct: 341 GSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSL 400

Query: 362 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 421
+ S + +S + S + SD + S + DS + S +
Sbjct: 401 IAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGE 460

Query: 422 DSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 481
DS + S + SD + S S + +S + S + S + S
Sbjct: 461 DSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQ 520

Query: 482 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 541
+ ++SD + S S + ++S + S + +S + S + SD +
Sbjct: 521 TAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGY 580

Query: 542 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDS 601
S + SDS + S + S + S + S + + S + +DS
Sbjct: 581 GSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSL 640

Query: 602 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDS 661
+ S + +S + S + SD + S S + +DS A S +
Sbjct: 641 IAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGY 700

Query: 662 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 721
+S + S + SD S S S + +DS + S + S + S
Sbjct: 701 NSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQ 760

Query: 722 DSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 781
+ S + S + + +DS + S + S + S + SD +
Sbjct: 761 TAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGY 820

Query: 782 DSDSDSDSDS 791
S S + +DS
Sbjct: 821 GSTSTAGADS 830



Score = 38.2 bits (88), Expect = 4e-04
Identities = 105/593 (17%), Positives = 196/593 (33%)

Query: 199 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 258
D D+ +S S + + + S S + S + S + S
Sbjct: 142 DDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGT 201

Query: 259 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 318
+ +DS + S + +S + S SD + S + DS +
Sbjct: 202 AGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYG 261

Query: 319 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 378
S + DS + S + SD + S + +DS + S + +S
Sbjct: 262 STQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQT 321

Query: 379 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSD 438
+ S + SD + S + DS + S + DS + S +
Sbjct: 322 AGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 381

Query: 439 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 498
SD + S + +DS + S + +S + S + SD + S
Sbjct: 382 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 441

Query: 499 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 558
+ DS + S + DS + S + SD + S S + +S +
Sbjct: 442 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYG 501

Query: 559 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSD 618
S + S + S + ++SD + + S + ++S + S + +S
Sbjct: 502 STQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLT 561

Query: 619 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSD 678
+ S + SD + S + SDS A S + S + S +
Sbjct: 562 AGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQ 621

Query: 679 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDAD 738
S + S S + +DS + S + +S + S + SD + S +
Sbjct: 622 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTST 681

Query: 739 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 791
+ +DS + S + +S + S + SD S S S + +DS
Sbjct: 682 AGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTSGYGSTSTAGADS 734



Score = 37.8 bits (87), Expect = 6e-04
Identities = 115/619 (18%), Positives = 210/619 (33%)

Query: 170 GSESDADADSDTDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 229
GS A ADS + S + +S + S + SD + S + DS
Sbjct: 293 GSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSL 352

Query: 230 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 289
+ S + DS + S + SD + S + +DS + S +
Sbjct: 353 IAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGE 412

Query: 290 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 349
+S + S + SD + S + DS + S + DS + S
Sbjct: 413 ESTQTAGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQ 472

Query: 350 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 409
+ SD + S S + +S + S + S + S + ++SD +
Sbjct: 473 TAQKGSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTLTAGYGSTQTAQNESDLITGY 532

Query: 410 DSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 469
S S + ++S + S + +S + S + SD + S + SDS
Sbjct: 533 GSTSTAGANSSLIAGYGSTQTASYNSVLTAGYGSTQTAREGSDLTAGYGSTGTAGSDSSI 592

Query: 470 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 529
+ S + S + S + S + S S + +DS + S +
Sbjct: 593 IAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGYGSTSTAGADSSLIAGYGSTQTAGY 652

Query: 530 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 589
+S + S + SD + S S + +DS + S + +S + S
Sbjct: 653 NSILTAGYGSTQTAQEGSDLTAGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQ 712

Query: 590 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 649
A SD S S S + +DS + S + S + S + S +
Sbjct: 713 TAQEGSDLTSGYGSTSTAGADSSLIAGYGSTQTASYHSSLTAGYGSTQTAREQSVLTTGY 772

Query: 650 DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 709
+ S + +DS + S + S + S + SD + S S + +DS
Sbjct: 773 GSTSTAGADSSLIAGYGSTQTAGYHSILTAGYGSTQTAQERSDLTTGYGSTSTAGADSSL 832

Query: 710 DSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 769
+ S + +S + S + +SD + S S + DS + S +
Sbjct: 833 IAGYGSTQTAGYNSILTAGYGSTQTAQENSDLTTGYGSTSTAGYDSSLIAGYGSTQTAGY 892

Query: 770 DSDSDSDSDSDSDSDSDSD 788
+S + S + +SD
Sbjct: 893 NSILTAGYGSTQTAQENSD 911



Score = 37.0 bits (85), Expect = 0.001
Identities = 93/530 (17%), Positives = 172/530 (32%)

Query: 271 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 330
D D+ +S S + + + S S + S + S + S
Sbjct: 142 DDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGT 201

Query: 331 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 390
+ +DS + S + +S + S SD + S + DS +
Sbjct: 202 AGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYG 261

Query: 391 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSD 450
S + DS + S + SD + S + ADS + S + +S
Sbjct: 262 STQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQT 321

Query: 451 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 510
+ S + SD + S + DS + S + DS + S +
Sbjct: 322 AGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 381

Query: 511 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 570
SD + S + +DS + S + +S + S + SD + S
Sbjct: 382 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 441

Query: 571 SDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 630
+ DS + S + D+ + S + SD + S S + +S +
Sbjct: 442 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYG 501

Query: 631 SDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 690
S + S + S A ++SD + S S + ++S + S + +S
Sbjct: 502 STQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLT 561

Query: 691 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSD 750
+ S + SD + S + SDS + S + S + S +
Sbjct: 562 AGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQ 621

Query: 751 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDTEPQANND 800
S + S S + +DS + S + +S + S Q +D
Sbjct: 622 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSD 671



Score = 35.5 bits (81), Expect = 0.003
Identities = 94/549 (17%), Positives = 178/549 (32%)

Query: 257 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 316
D D+ +S S + + + S S + S + S + S
Sbjct: 142 DDIDATIESGSTQPTQTIEIATYGSTLSGTHQSQLIAGYGSTETAGDSSTLIAGYGSTGT 201

Query: 317 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 376
+ +DS + S + +S + S SD + S + DS +
Sbjct: 202 AGADSTLVAGYGSTQTAGEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIAGYG 261

Query: 377 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSD 436
S + DS + S + SD + S + +DS + S A +S
Sbjct: 262 STQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQT 321

Query: 437 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 496
+ S + SD + S + DS + S + DS + S +
Sbjct: 322 AGYGSTQTAQKGSDLTAGYGSTGTAGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKG 381

Query: 497 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 556
SD + S + +DS + S + +S + S + SD + S
Sbjct: 382 SDLTAGYGSTGTAGADSSLIAGYGSTQTAGEESTQTAGYGSTQTAQKGSDLTAGYGSTGT 441

Query: 557 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSD 616
+ DS + S + DS + S + SD + S S + +S +
Sbjct: 442 AGDDSSLIAGYGSTQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTSTAGYESSLIAGYG 501

Query: 617 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSD 676
S + S + S + ++SD + S + + ++S + S + +S
Sbjct: 502 STQTAGYGSTLTAGYGSTQTAQNESDLITGYGSTSTAGANSSLIAGYGSTQTASYNSVLT 561

Query: 677 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 736
+ S + SD + S + SDS + S + S + S +
Sbjct: 562 AGYGSTQTAREGSDLTAGYGSTGTAGSDSSIIAGYGSTQTASYHSSLTAGYGSTQTAREQ 621

Query: 737 ADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDTEPQ 796
+ + S S + +DS + S + +S + S + SD + +
Sbjct: 622 SVLTTGYGSTSTAGADSSLIAGYGSTQTAGYNSILTAGYGSTQTAQEGSDLTAGYGSTST 681

Query: 797 ANNDTHTAA 805
A D+ A
Sbjct: 682 AGADSSLIA 690


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS05250RTXTOXIND290.046 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.046
Identities = 15/102 (14%), Positives = 43/102 (42%), Gaps = 7/102 (6%)

Query: 147 SSVRAADAAVAQQQAMVMLNIDQVAHDTAGAVVQLQGYQKLVKIAQAQVDSLKHIGDLIR 206
S ++ + Q+ LN+D+ + + ++ Y+ L ++ ++++D
Sbjct: 189 SLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDF-------S 241

Query: 207 QRNDAGATSLSDVVQTDTRVEGAQATLIQYQAALERWKATLA 248
A + V++ + + A L Y++ LE+ ++ +
Sbjct: 242 SLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEIL 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS05260RTXTOXIND2636e-86 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 263 bits (674), Expect = 6e-86
Identities = 105/462 (22%), Positives = 187/462 (40%), Gaps = 64/462 (13%)

Query: 6 AAIFPLVKELDPVAAMADNER---DEAELV------KSRRLIALLALLLVVTGVWAWFAT 56
+ + + K+LD D EL+ + R + + LV+ + +
Sbjct: 20 SETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQ 79

Query: 57 LDEVSTGTGKVIPSSREQVLQTLDGGILTELNVREGSRVAAGQVVARLDPTRSESNVGES 116
++ V+T GK+ S R + ++ ++ I+ E+ V+EG V G V+ +L +E++ ++
Sbjct: 80 VEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKT 139

Query: 117 QAKYRASLAASIRLTA-----EVNNQPLIFPPSLKAWPGLLAEE-TRLYHSRREQLTKSM 170
Q+ + R E+N P + P + + EE RL +EQ +
Sbjct: 140 QSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQ 199

Query: 171 RQLDQ------------------------SLSLVNSELAINEKLAKTGAASNVEVL---- 202
Q Q + S L L A + VL
Sbjct: 200 NQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQEN 259

Query: 203 -----------------RLRQQAADIELKKIDLNTRYYVDAREQLSKANADVASLAEVIK 245
++ + + + + + + ++L + ++ L +
Sbjct: 260 KYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELA 319

Query: 246 GRADSVARLTVRSPVQGIVKNIKVNTIGGVIAPNGELMDIVPIDGRLLIEARISPRDIAF 305
+ +R+PV V+ +KV+T GGV+ LM IVP D L + A + +DI F
Sbjct: 320 KNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGF 379

Query: 306 IHPDQKALVKITAYDYAIYGALNGVVETISPDTIQDEAKPDVYYYRVFIRTDHNYLENKR 365
I+ Q A++K+ A+ Y YG L G V+ I+ D I+D+ V+ V I + N L
Sbjct: 380 INVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFN--VIISIEENCLS-TG 436

Query: 366 GKRFLIGPGMIATVDIKTGEKTVMDYLVKPF-NRAKEALRER 406
K + GM T +IKTG ++V+ YL+ P E+LRER
Sbjct: 437 NKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESLRER 478


93D364_RS07400D364_RS07435N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS07400-1110.574469phenylacetic acid degradation operon negative
D364_RS07405-1111.234290phenylacetic acid degradation protein PaaY
D364_RS07410-1121.399959GFA family protein
D364_RS07415-1111.755964FMN-dependent NADH-azoreductase
D364_RS07420-1121.924602ATP-dependent RNA helicase HrpA
D364_RS07425-1141.210020O-methyltransferase
D364_RS07430-2130.602388efflux transporter outer membrane subunit
D364_RS07435-212-0.142346HlyD family secretion protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS07400PRTACTNFAMLY280.049 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 28.1 bits (62), Expect = 0.049
Identities = 16/54 (29%), Positives = 23/54 (42%)

Query: 2 SKLDAFIQQAVTAMPISGTSLIASLYGDALLQRGGEVWLGSVAALLEGLGFGER 55
+L A AV + S + +AL +R GE+ L A G GF +R
Sbjct: 604 RELSAAANAAVNTGGVGLASTLWYAESNALSKRLGELRLNPDAGGAWGRGFAQR 657


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS07415LIPPROTEIN48280.019 Mycoplasma P48 major surface lipoprotein signature.
		>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature.

Length = 428

Score = 28.4 bits (63), Expect = 0.019
Identities = 34/139 (24%), Positives = 52/139 (37%), Gaps = 38/139 (27%)

Query: 70 QQEALALSDELIAELKGNDVIVIAAPMYNFNIPTQLKNYFDL---VARAGVTFRY----- 121
QQ D EL+ N + +I +F+I T+ K ++ L + + T Y
Sbjct: 129 QQSIKQYIDAHREELERNQIKIIGI---DFDIETEYKWFYSLQFNIKESAFTTGYAIASW 185

Query: 122 -TEKGPEGLVTGKRAVVVTSRGGIHKDTPTDLVTPYLSTFLGFIGITDVNFVFAEGIAY- 179
+E+ KR VV S GG F G+T N FA+GI Y
Sbjct: 186 LSEQDES-----KR--VVASFGGGA-----------------FPGVTTFNEGFAKGILYY 221

Query: 180 -GPEVAAKAQSDAKAAIDS 197
++K + +DS
Sbjct: 222 NQKHKSSKIYHTSPVKLDS 240


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS07430RTXTOXIND348e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.4 bits (79), Expect = 8e-04
Identities = 28/235 (11%), Positives = 67/235 (28%), Gaps = 53/235 (22%)

Query: 69 SDVLIARERVNEYQARAYAADSSLFPSLDASLTGTRARTQSAATGLPIHSTLYKGGLTAS 128
S +L AR YQ + + + + P L L + + ++L K +
Sbjct: 141 SSLLQARLEQTRYQILSRSIELNKLPELK--LPDEPYFQNVSEEEVLRLTSLIKEQFST- 197

Query: 129 YDVDIWGANRSAANAAGASLEAQKAAAAAANLSVASSVAVGYVTLLSLDEQLRVTQQTLT 188
W + A++ A + + RV + L
Sbjct: 198 -----WQNQKYQKELNLDKKRAERLTVLA--------------RINRYENLSRVEKSRLD 238

Query: 189 SREDAWRLAKRQFETGYTSRLELM-------QADSELRSTRAQIPPLQHQIAQQENALSV 241
L + ++ ++ +A +ELR ++Q+ ++ +I + +
Sbjct: 239 DFS---SLLHK----QAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL 291

Query: 242 LLGDNPGAVKRGEFAQLTPLRLPSQLPSTLLNRRPDIAQAERQLVAADATLASSQ 296
+ +++ L +I +L + +S
Sbjct: 292 VTQL-----------------FKNEILDKLRQTTDNIGLLTLELAKNEERQQASV 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS07435RTXTOXIND1015e-26 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 101 bits (254), Expect = 5e-26
Identities = 63/409 (15%), Positives = 125/409 (30%), Gaps = 83/409 (20%)

Query: 21 SIFTAAAIGLVGVLVILYAWQLPPFTRHSQFTDNAYVRGQTTFISPQVNGYITAVNVKDF 80
A I V+ + + L + G++ I P N + + VK+
Sbjct: 57 PRLVAYFIMGFLVIAFILSV-LGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEG 115

Query: 81 AIVQPGEVLFQIDDR-----IYKQRVHQAQATL------AMKEAALRNNL---------- 119
V+ G+VL ++ K + QA L + + N L
Sbjct: 116 ESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPY 175

Query: 120 ------------------------QQRKSAEATIAKNEAALQNARAQNLKIQADLKRIQQ 155
Q+ E + K A A+ + + + +
Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235

Query: 156 -------LTADGSLS---IRERDSARASA----AQGAADIEQAKAALEMSRQD------- 194
L +++ + E+++ A + +EQ ++ + ++++
Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295

Query: 195 -RESTIVNRDSLEADVASAKAALELAQIDLQNTQIIAPTGGQLGQISVR-LGAYVSAGTH 252
+ + ++ L + Q + I AP ++ Q+ V G V+
Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355

Query: 253 LTSLVPPQH--WVIANLKETQLAEVRVGQPVTFTVDALNGETFH---GKVQSISPATGVE 307
L +VP V A ++ + + VGQ V+A + GKV++I+
Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--- 412

Query: 308 FSAISPDNATGNFVKIAQRIPVRITVNDGQNNSERLRPGMSVQVTIDTR 356
D G + I +N L GM+V I T
Sbjct: 413 ----IEDQRLGLVFNVIISIEENCLSTGNKN--IPLSSGMAVTAEIKTG 455


94D364_RS08825D364_RS08855N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS088251130.708919helix-turn-helix domain-containing protein
D364_RS088302131.692326glycerate kinase
D364_RS088351141.462704MFS transporter
D364_RS088401151.721607SMP-30/gluconolactonase/LRE family protein
D364_RS088451141.413382MFS transporter
D364_RS088500120.763579thiolase family protein
D364_RS08855-1120.2653023-oxoacyl-ACP reductase FabG
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08825DNABINDNGFIS290.012 DNA-binding protein FIS signature.
		>DNABINDNGFIS#DNA-binding protein FIS signature.

Length = 98

Score = 28.8 bits (64), Expect = 0.012
Identities = 13/50 (26%), Positives = 30/50 (60%), Gaps = 2/50 (4%)

Query: 296 INNVRQLLEHDSGEVLLDTLSSFIANNAEPGKTSLLLGIHRNTLTYRLQQ 345
+N++ +L+ + + LLD + + N + +L++GI+R TL +L++
Sbjct: 47 VNDLYELVLAEVEQPLLDMVMQYTRGNQT--RAALMMGINRGTLRKKLKK 94


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08835TCRTETB411e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 40.6 bits (95), Expect = 1e-05
Identities = 71/399 (17%), Positives = 134/399 (33%), Gaps = 51/399 (12%)

Query: 53 EMILRLG-PVISKEFSLSPEQWGNIVALIMVALAVLDIPGSIWSDRYGSGWKRARFQVPL 111
EM+L + P I+ +F+ P + M+ ++ SD+ G + L
Sbjct: 30 EMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLG-------IKRLL 82

Query: 112 VLGYTALSFISGIKAISHGLTAFVLL-RVGVNLGAGWGEPVGVSNTAEWWPKEKRGFALG 170
+ G F S I + H + +++ R GA + + A + PKE RG A G
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142

Query: 171 VHHTGYPIGALLSGVVASLVLATFGEGSWRYCFLL--ALLVAIPLMIFWAKYSTAARINT 228
+ + +G + + ++ W Y L+ ++ +P ++ K RI
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIH---WSYLLLIPMITIITVPFLMKLLK--KEVRIK- 196

Query: 229 LYQHIDSQG----------LTRPATQES---------------SHVAKGEGMKTFLRTLR 263
H D +G T S H+ K +
Sbjct: 197 --GHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGK 254

Query: 264 NRNISLTAGNTLLTQIVYMGINVVLPPYLYHVSGLSLAASAGLSIIF--TLTGTLGQVIW 321
N + + G ++P + V LS A G IIF T++ + I
Sbjct: 255 NIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAE-IGSVIIFPGTMSVIIFGYIG 313

Query: 322 PWLSDSFGRKRTLIVCGLWMSIG---IALFYFATNMPRLIAIQLFFGLVANAVWPIYYAM 378
L D G L + ++S+ + T+ I I G ++ + +
Sbjct: 314 GILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLS-FTKTVISTI 372

Query: 379 ASDSAEERATSTANGIITTAMFIGGGISPLLMGWLIQFG 417
S S +++ ++ F+ G ++G L+
Sbjct: 373 VSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIP 411


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08845TCRTETA362e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 36.3 bits (84), Expect = 2e-04
Identities = 31/141 (21%), Positives = 57/141 (40%), Gaps = 18/141 (12%)

Query: 51 SVDIGLSATAFGLGAGLFFLTYAVLEIPSNLFLTRIGARRWIARIMITWGILSCG----- 105
+ IG+S AFG+ L +T A R R + G+++ G
Sbjct: 245 ATTIGISLAAFGILHSLA-----------QAMITGPVAARLGERRALMLGMIADGTGYIL 293

Query: 106 MAFVTGPTSFYVMRLLLGAAEAGLYPGIIYYLTLWFGREERAKATGLFLLGVCLANIIGA 165
+AF T + + +LL + G+ P + L+ E + + G L +I+G
Sbjct: 294 LAFATRGWMAFPIMVLLASGGIGM-PALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGP 352

Query: 166 PLGGLLLSLDGMSGWHGWQWM 186
L + + ++ W+GW W+
Sbjct: 353 LLFTAIYAAS-ITTWNGWAWI 372


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08855DHBDHDRGNASE1262e-37 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 126 bits (317), Expect = 2e-37
Identities = 75/254 (29%), Positives = 122/254 (48%), Gaps = 12/254 (4%)

Query: 3 LASKTAIVTGAARGIGFGIAKVLAREGARVIIADRDAHG-EAAAASLRESGAQALFFSCN 61
+ K A +TGAA+GIG +A+ LA +GA + D + E +SL+ A F +
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 62 IAEKTQVEALFSQAEEAFGPVDILVNNAGINRDAMLHKLTEADWDTVIDVNLKGTFLCMQ 121
+ + ++ + ++ E GP+DILVN AG+ R ++H L++ +W+ VN G F +
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 122 QAAIRMRERGAGRIINIAS-ASWLGNVGQTNYSASKAGVVGMTKTACRELAKKGVTVNAI 180
+ M +R +G I+ + S + + Y++SKA V TK ELA+ + N +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 181 CPGFIDTDMTRG--VPENVWQIMIS--------KIPAGYAGEAKDVGECVAFLASDGARY 230
PG +TDM EN + +I IP + D+ + V FL S A +
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 231 INGEVINVGGGMVL 244
I + V GG L
Sbjct: 246 ITMHNLCVDGGATL 259


95D364_RS08985D364_RS09005N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS08985-223-4.392931sigma-54 dependent transcriptional regulator
D364_RS08990-221-3.329958two-component system sensor histidine kinase
D364_RS08995-221-3.158961phosphoglycerate transport regulator PgtC
D364_RS09000-220-3.536735phosphoglycerate transporter PgtP
D364_RS09005020-3.470355peptidoglycan-binding protein LysM
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS08990HTHFIS2463e-79 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 246 bits (630), Expect = 3e-79
Identities = 114/474 (24%), Positives = 195/474 (41%), Gaps = 73/474 (15%)

Query: 7 SILLIDDDADVLDAYTQLLEQAGYHVSACNNPFDAREQVPKDWPGIVLSDVCMPGCSGID 66
+IL+ DDDA + Q L +AGY V +N + +V++DV MP + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 67 LMTLFHQDDDLLPILLITGHGDVPMAVEAVKKGAWDFLQKPIDPGKLLTLVDAALRQRQS 126
L+ + LP+L+++ A++A +KGA+D+L KP D +L+ ++ AL + +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 127 VIARRQYCQQKLQVELIGRSQWTVRYRQRLQQLAETDIAVWLYGEPGTGRMTGARYLHQL 186
++ + Q L+GRS + L +L +TD+ + + GE GTG+ AR LH
Sbjct: 125 RPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183

Query: 187 GRHAEGPFIA--CELTPAN----------------AHTLNE-LIAQAQGGTLVLSHPEHL 227
G+ GPF+A P + A T + QA+GGTL L +
Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243

Query: 228 THEQQHQLVQ-LQSHEKRP----------FRLISIGSASLVELAASSQIVAELYYCFAMT 276
+ Q +L++ LQ E R+++ + L + +LYY +
Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303

Query: 277 QIGCQPLSKRPDDIEPLFHHYLQKTCQRLNHPVPEVDAGLLKGMMRRVWPNNVRELANAA 336
+ PL R +DI L H++Q+ + V D L+ M WP NVREL N
Sbjct: 304 PLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRELENLV 362

Query: 337 ELFAV--------------------------------GVLPLAETVNPLMH--------- 355
G L +++ V M
Sbjct: 363 RRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDA 422

Query: 356 IGEPTPLDQRVEDVERQIITEALNIHQGRINEVAEYLLIPRKKLYLRMKKYGLN 409
+ D+ + ++E +I AL +G + A+ L + R L ++++ G++
Sbjct: 423 LPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS09000FLGMOTORFLIM310.010 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 30.6 bits (69), Expect = 0.010
Identities = 5/35 (14%), Positives = 15/35 (42%), Gaps = 4/35 (11%)

Query: 312 QRLVQRMFDTAISFRLAQLKDAWRALHSAEVRLKR 346
+++ + LA ++++W + RL +
Sbjct: 150 NSVMEGVIVRI----LANVRESWTQVIDLRPRLGQ 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS09005TCRTETA310.009 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 31.3 bits (71), Expect = 0.009
Identities = 65/387 (16%), Positives = 126/387 (32%), Gaps = 39/387 (10%)

Query: 52 TPYLKEQLDLSATQI---GLLSSCMLIAYGISKGVMSSLADKASPKVFMACGLVLCAIVN 108
P L L S G+L + + V+ +L+D+ + + L A+
Sbjct: 28 LPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDY 87

Query: 109 VGLGFSTAFWVFAALVVLNGLFQGMGVGPSFITIANWFPRRERGRVGAFWNISHNVGGGI 168
+ + WV ++ G+ G IA+ ER R F +S G G+
Sbjct: 88 AIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIADITDGDERAR--HFGFMSACFGFGM 144

Query: 169 VA-PIVGAAFAILGTEHWQSASYIVPACVAVVFAISVLVLGKGSPREEGLPSLAEMMPEE 227
VA P++G P A + G ++PE
Sbjct: 145 VAGPVLGGLMGGFSPH--------APFFAAAALNGLNFLTG------------CFLLPE- 183

Query: 228 KVVLKTKHGQKAPENMSAFQIFCTYVLRNKNAWYVSFVDVFVYMVRFGMISWLPIYLLTV 287
+ G++ P A ++ + + VF M G + +
Sbjct: 184 -----SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGE 238

Query: 288 KHFSKEQMSVAFLFFEWA---AIPSTLLAGWLSDKLFKGRRMPLAIICMTLIFICLIGYW 344
F + ++ + ++ ++ G ++ +L + R + L +I +I L
Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298

Query: 345 KSESLLMVTVFAAIVGCLIYVPQFLASVQTMEIVPSFAVGSAVGLRGFMSYIFGASLGTS 404
+ + V A G + Q + S Q E GS L ++ I G L T+
Sbjct: 299 RGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTS-LTSIVGPLLFTA 357

Query: 405 LFGVMVDKMGWHGGFYLLMGGIVCCIL 431
++ + W+G ++ + L
Sbjct: 358 IYAASITT--WNGWAWIAGAALYLLCL 382


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS09010INTIMIN325e-04 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 32.3 bits (73), Expect = 5e-04
Identities = 22/70 (31%), Positives = 39/70 (55%), Gaps = 7/70 (10%)

Query: 84 SDGVKVTQSGAESR-FYTVKSGDTLSAISKAMYGSANDYQRIFEANKPMLTHPD---KIY 139
SD +T + ++R FYT+K+G+T++ +SK+ + I+ NK + + K
Sbjct: 49 SDSKLLTHNSYQNRLFYTLKTGETVADLSKS---QDINLSTIWSLNKHLYSSESEMMKAE 105

Query: 140 PGQVLIIPAK 149
PGQ +I+P K
Sbjct: 106 PGQQIILPLK 115


96D364_RS10750D364_RS10785N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS10750-1223.989275GntP family permease
D364_RS10755-2171.6423463-hydroxybutyrate dehydrogenase
D364_RS10760-118-3.376349LysR family transcriptional regulator
D364_RS10765-124-5.083591acetolactate decarboxylase
D364_RS10770030-7.013727acetolactate synthase AlsS
D364_RS10775245-10.681625(S)-acetoin forming diacetyl reductase
D364_RS10780352-12.907890GNAT family N-acetyltransferase
D364_RS10785353-13.762344multidrug efflux RND transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS10750ABC2TRNSPORT300.027 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 29.5 bits (66), Expect = 0.027
Identities = 32/129 (24%), Positives = 47/129 (36%), Gaps = 15/129 (11%)

Query: 6 ALAALALLMLAAYRGY----SVILFAPIAALGAVLLTDPGAVGPA----------FTGLF 51
ALA + ++AA GY S++ P+ AL + G V A + L
Sbjct: 126 ALAGAGIGVVAAALGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLV 185

Query: 52 MEKMVGFVKLYFPVFLLGAVFGKLIELSGFSRSIVAAAIRILGRRHAIPVIVLVCALLTY 111
+ ++ FPV L VF S SI +LG + V V AL Y
Sbjct: 186 ITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHP-VVDVCQHVGALCIY 244

Query: 112 GGVSLFVVA 120
+ F+
Sbjct: 245 IVIPFFLST 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS10755DHBDHDRGNASE1154e-33 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 115 bits (288), Expect = 4e-33
Identities = 66/255 (25%), Positives = 105/255 (41%), Gaps = 9/255 (3%)

Query: 3 LHGKTALVTGSTSGIGLGIAKVLAQAGAQLVLNGFGDSSHARAE--VAALGKIPGYHDAD 60
+ GK A +TG+ GIG +A+ LA GA + + + + A + AD
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 61 LRDVGQIEAMMRYAESTFGGVDIVINNAGIQHVAPVEQFPVDKWNDILAINLSSVFHTTR 120
+RD I+ + E G +DI++N AG+ + ++W ++N + VF+ +R
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 121 LALPGMRQRNWGRIINIASVHGLVASKEKSAYVAAKHAVVGLTKTVALETARSGITCNAI 180
M R G I+ + S V +AY ++K A V TK + LE A I CN +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 181 CPGWVLTPLVQQQIDKRIAEGVDPEQASAQLLAEKQ---PSGEFVTPQQLGEMALFLCSD 237
PG T + EQ L + P + P + + LFL S
Sbjct: 186 SPGSTETDMQWSLWADENGA----EQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241

Query: 238 AAAQVRGAAWNMDGG 252
A + +DGG
Sbjct: 242 QAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS10775DHBDHDRGNASE1241e-36 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 124 bits (312), Expect = 1e-36
Identities = 74/254 (29%), Positives = 114/254 (44%), Gaps = 10/254 (3%)

Query: 3 KVALVTGAGQGIGKAIALRLVKDGFAVAIADYNDATAKAVASEINQAGGRAMAVKVDVSD 62
K+A +TGA QGIG+A+A L G +A DYN + V S + A A DV D
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 63 RDQVFAAVEQARKTLGGFDVIVNNAGVAPSTPIESITPEIVDKVYNINVKGVIWGIQAAV 122
+ + + +G D++VN AGV I S++ E + +++N GV ++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 123 EAFKKEGHGGKIINACSQAGHVGNPELAVYSSSKFAVRGLTQTAARDLAPLGITVNGYCP 182
+ G I+ S V +A Y+SSK A T+ +LA I N P
Sbjct: 129 KYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 183 GIVKTPM----WAEIDRQVSEAAGKPLGYGTAEFAKRITLGRLSEPEDVAACVSYLASPD 238
G +T M WA+ + G F I L +L++P D+A V +L S
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGS-----LETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 239 SDYMTGQSLLIDGG 252
+ ++T +L +DGG
Sbjct: 243 AGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS10785ACRIFLAVINRP10600.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1060 bits (2744), Expect = 0.0
Identities = 502/1031 (48%), Positives = 690/1031 (66%), Gaps = 7/1031 (0%)

Query: 1 MPHFFIERPIFAWVIALFIVLTGLLSIPRLPVAQYPEVAPPGIIISVSYPGASPEVMNTS 60
M +FFI RPIFAWV+A+ +++ G L+I +LPVAQYP +APP + +S +YPGA + + +
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VVSLIEREISSVDNLLYFESSSDTTGMASITVTFKPGTDIKLAQMDLQNQIKIVESRLPQ 120
V +IE+ ++ +DNL+Y S+SD+ G +IT+TF+ GTD +AQ+ +QN++++ LPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 SVRQNGINVEAANSGFLMMVGLKSPSGAYQEADLSDYFARNVTDELRRVPGVGKVQLFGG 180
V+Q GI+VE ++S +LM+ G S + + D+SDY A NV D L R+ GVG VQLFG
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 EKALRIWLDPMKLHSYGLSVTDVLSAISQQNVIVSPGRTGDEPATSSQEVTYPITVKGQL 240
+ A+RIWLD L+ Y L+ DV++ + QN ++ G+ G PA Q++ I + +
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 SSVEEFRNITIKSQVSAARVTLADVARVESGLQSYAFGIRENGVPATAAAIQLSPGANAI 300
+ EEF +T++ + V L DVARVE G ++Y R NG PA I+L+ GANA+
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 STASGIRARLTELSGVLPEGMTFTVPFDTAPFVKLSILKVVETFVEAMVLVFFVMLLFLH 360
TA I+A+L EL P+GM P+DT PFV+LSI +VV+T EA++LVF VM LFL
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 KIRCTLIPAIVAPVALLGTFTVMLLSGYSINILTMFGMILAIGIIVDDAIVVVENVERLM 420
+R TLIP I PV LLGTF ++ GYSIN LTMFGM+LAIG++VDDAIVVVENVER+M
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 EDKKMSPQDATREAMREITPAIIGITLVLTAVFIPMAFASGSVGIIYRQFSISMAISILL 480
+ K+ P++AT ++M +I A++GI +VL+AVFIPMAF GS G IYRQFSI++ ++ L
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SAFLALTLTPALCATLLKP-HGIHQGKSSVFSAWFNAHFHRLTSFYATGLGFVLKRTGRM 539
S +AL LTPALCATLLKP H F WFN F + Y +G +L TGR
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 540 MMIYAALCLALFAGLSTLPSSFLPDEDQGYFMSSIQLPSDATMQRTLKVVDTFEEEI--A 597
++IYA + + LPSSFLP+EDQG F++ IQLP+ AT +RT KV+D +
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 598 HRQAVESNIMILGFGFSGSGQNSAMAFTTLKDWRQRKGT--TAQEEADHIRSQMANVPDA 655
+ VES + GF FSG QN+ MAF +LK W +R G +A+ + ++ + D
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 656 VTMSLLPPAISDMGTSSGFTYYLQDRGGKGYQALKKAADELIVQANHNP-HLADVYIDGL 714
+ PAI ++GT++GF + L D+ G G+ AL +A ++L+ A +P L V +GL
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 715 GEGTSLSLHVDREKAEAMGVSFDEINQTISVAAGSNYVNDYTNNGRVQQVIVQADAPYRM 774
+ L VD+EKA+A+GVS +INQTIS A G YVND+ + GRV+++ VQADA +RM
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 775 QPEQLLALSVKNRLGQMLPLSTFVTLSWNVAPQQLIRYQGYPAIRITGSSAQGKSSGTAM 834
PE + L V++ G+M+P S F T W +L RY G P++ I G +A G SSG AM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 835 AAMDNLAKHLPPGFAGEWAGSSLQEKESASQLPGLIVLSVLVVFMVLAALYESWSIPFAV 894
A M+NLA LP G +W G S QE+ S +Q P L+ +S +VVF+ LAALYESWSIP +V
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 895 MLVVPLGLLGAVLAVSVTNMTNDVFFKVGLITLIGLSAKNAILIIEFARQLM-KEGKSLI 953
MLVVPLG++G +LA ++ N NDV+F VGL+T IGLSAKNAILI+EFA+ LM KEGK ++
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 954 DATLTAAKLRLRPILMTSLAFTLGVVPLMLASGASDSTQHAIGTGVFGGMISGTLLAIFF 1013
+ATL A ++RLRPILMTSLAF LGV+PL +++GA Q+A+G GV GGM+S TLLAIFF
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1014 VPVFFVTITRF 1024
VPVFFV I R
Sbjct: 1021 VPVFFVVIRRC 1031


97D364_RS11195D364_RS11235N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS111950165.335771efflux RND transporter permease subunit
D364_RS112000154.555362efflux RND transporter periplasmic adaptor
D364_RS112050155.096837TetR/AcrR family transcriptional regulator
D364_RS112100143.274113NAD(P)H-binding protein
D364_RS112150144.118795LysR family transcriptional regulator
D364_RS112200143.028651ABC transporter substrate-binding protein
D364_RS112250150.384105iron ABC transporter permease
D364_RS11230-1130.842128ABC transporter ATP-binding protein
D364_RS11235115-0.818983MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11200ACRIFLAVINRP440e-140 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 440 bits (1132), Expect = e-140
Identities = 231/1055 (21%), Positives = 423/1055 (40%), Gaps = 71/1055 (6%)

Query: 8 LSALAVRERSVTLFLIILISVAGLVAFFGLGRAEDPPFTVKQMTVITVWPGATAQEMQDQ 67
++ +R L I++ +AG +A L A+ P ++V +PGA AQ +QD
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 68 VAEPLEKRLQELKWYDRTETYT-RPGMALITLSLQDQTPP----SEVPEQFYQARKKLGD 122
V + +E+ + + + + G ITL+ Q T P +V + A L
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLL-- 118

Query: 123 EAKNLPAGVSGPMMNDEFADVTFALFAL--KARGEQPRQLVRD--AEALRQQLLHVSGVK 178
P V ++ E + ++ + A + + D A ++ L ++GV
Sbjct: 119 -----PQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVG 173

Query: 179 KVNILGEQ-AERIYLSFSHDRLATLGLSPEAIFAALNSQNVLTAAGAI---ETRGGQIF- 233
V + G Q A RI+L D L L+P + L QN AAG + GQ
Sbjct: 174 DVQLFGAQYAMRIWLDA--DLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLN 231

Query: 234 --IRLDGAFDRLQQIRDTPIIAG--GRTLKLADVATVERGYEDPATFLIRHQGEPALLLG 289
I F ++ + G ++L DVA VE G E+ R G+PA LG
Sbjct: 232 ASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIA-RINGKPAAGLG 290

Query: 290 VVMREGWNGLALGKALDAETASINQSLPLGMSLTKVTDQSVNISAAVDEFMIKFFVA-LL 348
+ + G N L KA+ A+ A + P GM + D + + ++ E + F A +L
Sbjct: 291 IKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIML 350

Query: 349 VVMTVCFVSMGWRVGVVVAAAVPLTLAVVFVVMEATGKNFDRITLGSLILALGLLVDDAI 408
V + + R ++ AVP+ L F ++ A G + + +T+ ++LA+GLLVDDAI
Sbjct: 351 VFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAI 410

Query: 409 IAIEMMV-VKMEEGYDRLKASAYAWSHTAAPMLAGTLVTAVGFMPNGFAQSTAGEYASNV 467
+ +E + V ME+ +A+ + S ++ +V + F+P F + G
Sbjct: 411 VVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQF 470

Query: 468 FWIVGIALIASWIVAVIFTPWLGVHLLPDRKPAAAGHAALYDT----------PRYQRFR 517
+ A+ S +VA+I TP L LL KP +A H +
Sbjct: 471 SITIVSAMALSVLVALILTPALCATLL---KPVSAEHHENKGGFFGWFNTTFDHSVNHYT 527

Query: 518 RLLTRVIAHKWRVAAGVVALLIVAILGMSVVKKQFFPTSDRPEVLVEVQLPYGSSISQTS 577
+ +++ R ++ ++ + F P D+ L +QLP G++ +T
Sbjct: 528 NSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQ 587

Query: 578 AAAAKIEHWLQRQPEAKIVTSYIGQGAPRFYLAMAPELPDP--SFAKLVVLTDGQGARE- 634
++ + + +A + + + G + + + + +F L + G
Sbjct: 588 KVLDQVTDYYLKNEKANVESVFTVNG-----FSFSGQAQNAGMAFVSLKPWEERNGDENS 642

Query: 635 --ALKRRLREAV-----VNGLAPEARVRVTQLVFGPYSPYPVAWRVMGPDPHALLDIAER 687
A+ R + + + V + + +G D AL +
Sbjct: 643 AEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHD--ALTQARNQ 700

Query: 688 VKSVLQASPL-MRTVNTDWGSRVPVMHFSLNQDRLQASGLSSQSVAQQLQFLLSGIPITT 746
+ + P + +V + ++Q++ QA G+S + Q + L G +
Sbjct: 701 LLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVND 760

Query: 747 VREDIRAVQVIGRAAGDIRLDPAKIADFTLVGSGGQRVPLSQIGDVSIRMEDPLLRRRDR 806
+ R ++ +A R+ P + + + G+ VP S P L R +
Sbjct: 761 FIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNG 820

Query: 807 TPTITVRGDVAENLQPPDVSIALMKPLQPIIDSLPPGYRIETAGSIEESGKATRAMVPLF 866
P++ ++G+ A P S M ++ + LP G + G + + L
Sbjct: 821 LPSMEIQGEAA----PGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALV 876

Query: 867 PIMIALTLLIIILQVRSLSAMVMVFLTAPLGLIGVVPTLLLFNQPFGINALVGLIALSGI 926
I + L + S S V V L PLG++GV+ LFNQ + +VGL+ G+
Sbjct: 877 AISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGL 936

Query: 927 LMRNTLILIGQIHHNQQA-GLDPFHAVVEATVQRARPVLLTALAAILAFIPLTHSVFWGT 985
+N ++++ + G A + A R RP+L+T+LA IL +PL S G+
Sbjct: 937 SAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGS 996

Query: 986 -----LAYTLIGGTLGGTIMTLIFLPAMYAIWFRI 1015
+ ++GG + T++ + F+P + + R
Sbjct: 997 GAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031



Score = 76.8 bits (189), Expect = 3e-16
Identities = 57/323 (17%), Positives = 122/323 (37%), Gaps = 20/323 (6%)

Query: 712 MHFSLNQDRLQASGLSSQSVAQQLQF----LLSGIPITTVREDIRAVQVIGRAAGDIRLD 767
M L+ D L L+ V QL+ + +G T + + A + +
Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK-N 242

Query: 768 PAKIADFTLVGSG-GQRVPLSQIGDVSIRMED-PLLRRRDRTPTITVRGDVAENLQPPDV 825
P + TL + G V L + V + E+ ++ R + P + +A D
Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302

Query: 826 SIALMKPLQPIIDSLPPGYRIE----TAGSIEESGKATRAMVPLFPIMIALTLLIIILQV 881
+ A+ L + P G ++ T ++ S +V I L L++ L +
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLS---IHEVVKTLFEAIMLVFLVMYLFL 359

Query: 882 RSLSAMVMVFLTAPLGLIGVVPTLLLFNQPFGINALVGLIALSGILMRNTLILIGQIH-H 940
+++ A ++ + P+ L+G L F + G++ G+L+ + ++++ +
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 941 NQQAGLDPFHAVVEATVQRARPVLLTALAAILAFIPL-----THSVFWGTLAYTLIGGTL 995
+ L P A ++ Q ++ A+ FIP+ + + + T++
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 996 GGTIMTLIFLPAMYAIWFRIRPE 1018
++ LI PA+ A +
Sbjct: 480 LSVLVALILTPALCATLLKPVSA 502


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11205RTXTOXIND423e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.7 bits (98), Expect = 3e-06
Identities = 18/92 (19%), Positives = 35/92 (38%), Gaps = 9/92 (9%)

Query: 70 GKVLERRVETGQSVKRGQLLLRLDPADLALQAQSQQRAVDAARVRAKKAANDLARYRGLV 129
V E V+ G+SV++G +LL+L Q ++ AR L + R +
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQAR---------LEQTRYQI 155

Query: 130 ASGAISAAEFDQINAAAEAARADLRAAQAQAN 161
S +I + ++ E ++ +
Sbjct: 156 LSRSIELNKLPELKLPDEPYFQNVSEEEVLRL 187



Score = 31.3 bits (71), Expect = 0.005
Identities = 18/84 (21%), Positives = 32/84 (38%), Gaps = 4/84 (4%)

Query: 178 GVVVETLAEPGQVVSAGQVVIRLARAGQREARVQLPETLRPAVGSEALATRYGSESQPV- 236
+V E + + G+ V G V+++L G ++ +L A TRY S+ +
Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQA---RLEQTRYQILSRSIE 161

Query: 237 TATLRLLSDAADATTRTFEARYVL 260
L L + + VL
Sbjct: 162 LNKLPELKLPDEPYFQNVSEEEVL 185



Score = 28.6 bits (64), Expect = 0.044
Identities = 12/128 (9%), Positives = 37/128 (28%), Gaps = 15/128 (11%)

Query: 103 SQQRAVDAARVRAKKAANDLARYRGLVAS--GAISAAEFDQINAAAEA----------AR 150
++ ++ + A N+L Y+ + I +A+ +
Sbjct: 250 AKHAVLEQENKYVE-AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTT 308

Query: 151 ADLRAAQAQANVAQNATGYAGLLADADGVVVE-TLAEPGQVVSAGQVVIRLARAGQR-EA 208
++ + + + + A V + + G VV+ + ++ + E
Sbjct: 309 DNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEV 368

Query: 209 RVQLPETL 216
+
Sbjct: 369 TALVQNKD 376


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11210HTHTETR593e-13 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 58.9 bits (142), Expect = 3e-13
Identities = 31/183 (16%), Positives = 52/183 (28%), Gaps = 10/183 (5%)

Query: 4 FSRYGYEKTTVTDLAKAIGFSKAYIYKFFDSKQAIGEAICASRLEKIMVAVSEAIADAPS 63
FS+ G T++ ++AKA G ++ IY F K + I I E A P
Sbjct: 24 FSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPG 83

Query: 64 ASEK-----LRRLFR-ALTEAGSELFFE--DRKLYDIAAVAARDKWPSTEQYAGHLQQLI 115
L + +TE L E K + +A + I
Sbjct: 84 DPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA--QRNLCLESYDRI 141

Query: 116 GQILVEGRQAGEFERKTPLDEATLAVYMVMCPFINPVQLQYNLDTAPTAAVLLASLILRS 175
Q L +A A + + + + A +++L
Sbjct: 142 EQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILLEM 201

Query: 176 LSP 178

Sbjct: 202 YLL 204


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11215NUCEPIMERASE376e-05 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 37.1 bits (86), Expect = 6e-05
Identities = 37/167 (22%), Positives = 61/167 (36%), Gaps = 28/167 (16%)

Query: 6 KVLILGASGGIGGEVARRLVADNWQVRA-----------LKRGAQIRDPEDGIQWIAGDA 54
K L+ GA+G IG V++RL+ QV LK+ + G Q+ D
Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 55 LDGGQVAA--AAAGCDVIVH-----AV-----NPPGYRHWRQQVLPMLRNTLQAAERQR- 101
D + A+ + + AV NP Y + L N L+ +
Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYAD--SNLTGFL-NILEGCRHNKI 118

Query: 102 ALVVLPGTVYNYGPDA-FPLIAEEAAQQPVTRKGAIRVAMELTLKDY 147
++ + YG + P +++ PV+ A + A EL Y
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTY 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11235PF05272280.029 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.029
Identities = 10/20 (50%), Positives = 12/20 (60%)

Query: 33 LLGPNGCGKSSLLRVLAGLR 52
L G G GKS+L+ L GL
Sbjct: 601 LEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11240TCRTETA667e-14 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 65.6 bits (160), Expect = 7e-14
Identities = 77/373 (20%), Positives = 136/373 (36%), Gaps = 30/373 (8%)

Query: 1 MLLGSQFVFNIGFYAVVPFLALFLRDDMLLSGGLI---GLILGLRTFSQQGMFILGGTLA 57
++L + + +G ++P L LRD ++ S + G++L L Q + G L+
Sbjct: 9 VILSTVALDAVGIGLIMPVLPGLLRD-LVHSNDVTAHYGILLALYALMQFACAPVLGALS 67

Query: 58 DRYGAKAIILAGCVVRVAGFLLLACGASLWPIILGACLTGVGGALFSPSIEALLARAGTH 117
DR+G + ++L + ++A LW + +G + G+ GA +
Sbjct: 68 DRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGA-------VAGAYI 120

Query: 118 SQANGKRSRAEWFALFAVCGELGAVIGPVAGGVLSGIGFRHIALAGAGIFLLALAVLFFC 177
+ RA F + C G V GPV GG++ G A A + L F
Sbjct: 121 ADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFL 180

Query: 178 LPADGHTTTTRRRVPWWTPLRQPRFVAFILAYSSWLLSY------NQLYLALPV--EIQR 229
LP R PL R+ + ++ + + Q+ AL V R
Sbjct: 181 LPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDR 240

Query: 230 SGGREQDLAPLFMLASLLIITLQLPLA-RFARRMGAVRILPVGFLLLSASFASVALFAAA 288
+ +L Q + A R+G R L +G + + +A
Sbjct: 241 FHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFA--- 297

Query: 289 PPAEGWLRLMPAAGFVTLLTLGQMLLVPAAKDLIPLFAEESTLGAHYGALATAGGCAVLA 348
GW+ P + LL G + + PA + ++ +E G G+LA +
Sbjct: 298 --TRGWM-AFPI---MVLLASGGIGM-PALQAMLSRQVDEERQGQLQGSLAALTSLTSIV 350

Query: 349 GNLLLGHLLDLAL 361
G LL + ++
Sbjct: 351 GPLLFTAIYAASI 363


98D364_RS11585D364_RS11600N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS11585-1134.776052NarK family nitrate/nitrite MFS transporter
D364_RS115900145.242307nitrate/nitrite two-component system sensor
D364_RS115950165.836702two-component system response regulator NarL
D364_RS116000176.330240YchO/YchP family invasin
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11585TCRTETB310.015 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 30.6 bits (69), Expect = 0.015
Identities = 16/58 (27%), Positives = 28/58 (48%), Gaps = 1/58 (1%)

Query: 128 TPFSIFVIISLLCGFAGANF-ASSMANISFFFPKAKQGGALGVNGGLGNMGVSVMQLV 184
+ FS+ ++ + G A F A M ++ + PK +G A G+ G + MG V +
Sbjct: 101 SFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAI 158


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11590PF06580485e-08 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 48.3 bits (115), Expect = 5e-08
Identities = 28/116 (24%), Positives = 49/116 (42%), Gaps = 9/116 (7%)

Query: 476 FGFTVQLDYQLPPRFVPSHQAIHLLQIAREALSNALKHASAT-----EVTVTVSQRDNQV 530
F +Q + Q+ P + L+Q E N +KH A ++ + ++ + V
Sbjct: 236 FEDRLQFENQINPAIMDVQVPPMLVQTLVE---NGIKHGIAQLPQGGKILLKGTKDNGTV 292

Query: 531 RLVVADNGRGVPDHAERSNHYGLIIMRDRAQSLRG-DCQVRRRETGGTEVIVTFIP 585
L V + G + + S GL +R+R Q L G + Q++ E G + IP
Sbjct: 293 TLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11595HTHFIS748e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 74.5 bits (183), Expect = 8e-18
Identities = 35/118 (29%), Positives = 57/118 (48%), Gaps = 2/118 (1%)

Query: 6 RATILLIDDHPMLRTGVKQLISMAPDIQVIGEASNGAQGIELAESLDPDLILLDLNMPGM 65
ATIL+ DD +RT + Q +S A V SN A + D DL++ D+ MP
Sbjct: 3 GATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 66 NGLETLDKLREKSLSGRVVVFSVSNHEEDVVTALKRGADGYLLKDMEPEDLLKALQQA 123
N + L ++++ V+V S N + A ++GA YL K + +L+ + +A
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11600INTIMIN2303e-69 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 230 bits (588), Expect = 3e-69
Identities = 123/448 (27%), Positives = 209/448 (46%), Gaps = 22/448 (4%)

Query: 1 MPVSFRLLPTLTFLLLLPGVPVWALTASDTTRPAQAQDPLPDMGIAPQVDDDARHFAEVA 60
+P + LP LL P+ A + PD+ + DD A ++A
Sbjct: 117 LPFEYSALP------LLGSAPLVAAGGVAGHTNKLTKMS-PDVTKSNMTDDKALNYAAQQ 169

Query: 61 KKFGEASMSDNDLTAGEQAQLFAISKIGNEVSHQLESWLSPWGNANVDLLVDKEGKFTGS 120
+ + L G+ A+ A+ GN+ S QL++WL +G A V+L F GS
Sbjct: 170 AASLGSQLQSRSLN-GDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGN--NFDGS 226

Query: 121 KGSWFVPLQDNDRYLTWNQYSVTRREHDLVGNIGLGQRWRVGGWLLGYNSFYDKVLSESL 180
+ +P D+++ L + Q + N+G GQR+ + +LGYN F D+ S
Sbjct: 227 SLDFLLPFYDSEKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDN 286

Query: 181 ARGSVGAEAWGEYLRLSANYYHPLGDW-QLRDNQTQEQRMAAGYDVTAQARLPFYQHINT 239
R +G E W +Y + S N Y + W + + + ++R A G+D+ LP Y +
Sbjct: 287 TRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGA 346

Query: 240 SVSVEQYFGDSVDLFHSGTGYHNPVAVSVGLNYTPVPLVTVTAKHKQGENGVSQNNVGLK 299
+ EQY+GD+V LF+S NP A +VG+NYTP+PLVT+ ++ G + ++
Sbjct: 347 KLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQ 406

Query: 300 LNYRFGVPLKQQLAADEVAISNSLRGSRFDSPERDNLPVVEYRQRKNLTVYLATP-PWDL 358
Y+F P QQ+ V +L GSR+D +R+N ++EY +K + L P +
Sbjct: 407 FRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEY--KKQDILSLNIPHDING 464

Query: 359 QSGETVQLKLQIHSLHGIKALHWQGDTQALSLTPPVDASSPDG---WSIIMPVWNSEPGA 415
T +++L + S +G+ + W D+ S + S + I+P + G
Sbjct: 465 TERSTQKIQLIVKSKYGLDRIVWD-DSALRSQGGQIQHSGSQSAQDYQAILPAYV--QGG 521

Query: 416 ANRWRLSVVVEDKQGQRVSSNEIALALT 443
+N ++++ D+ G SSN + L +T
Sbjct: 522 SNVYKVTARAYDRNGN--SSNNVLLTIT 547


99D364_RS11825D364_RS11865N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS11825-1141.514876DUF2058 domain-containing protein
D364_RS11835-2171.444605RluA family pseudouridine synthase
D364_RS11840-3151.315239cold shock domain-containing protein
D364_RS11845-2172.826334LysR family transcriptional regulator
D364_RS11850-2172.312247substrate-binding domain-containing protein
D364_RS11855-1143.176176MFS transporter
D364_RS11860-2121.832253LysR family transcriptional regulator
D364_RS11865-1141.523145GNAT family N-acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11840TRNSINTIMINR290.013 Translocated intimin receptor (Tir) signature.
		>TRNSINTIMINR#Translocated intimin receptor (Tir) signature.

Length = 549

Score = 28.9 bits (64), Expect = 0.013
Identities = 14/56 (25%), Positives = 32/56 (57%), Gaps = 2/56 (3%)

Query: 11 LKAGLVSSKKMAKVQRTAKKSRVQAREAREAVEENKKAQLERDKQLSEQQKQAVLA 66
+ +G + + ++ + AK++ AR+ +AVE N +AQ + Q + +Q++ L+
Sbjct: 308 IPSGELKDDIVEQIAQQAKEAGEVARQ--QAVESNAQAQQRYEDQHARRQEELQLS 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11850PF05272290.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.011
Identities = 14/76 (18%), Positives = 32/76 (42%), Gaps = 15/76 (19%)

Query: 18 FIKDENGENRYFHVIKVANPDLIKKDAAVTFEPTTNNKGLSAYAVKVIPESKYIYIAGER 77
++ D G R++ V+ +L+ L + ++ E+ ++Y+AGER
Sbjct: 698 YLFDITGNRRFWPVLVPGRANLV---------------WLQKFRGQLFAEALHLYLAGER 742

Query: 78 LKLTSIKSYVVYREEE 93
+ + +R E+
Sbjct: 743 YFPSPEDEEIYFRPEQ 758


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11865TCRTETA401e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 40.2 bits (94), Expect = 1e-05
Identities = 74/369 (20%), Positives = 138/369 (37%), Gaps = 62/369 (16%)

Query: 67 LMRPIGAIVLGAYIDKVGRRKGLIVTLSIMATGTFLIVLIPSYQTIGLWAPLLVLIGRLL 126
LM+ A VLGA D+ GRR L+V+L+ A ++ P LW ++ IGR++
Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPF-----LW---VLYIGRIV 105

Query: 127 QGFSAGAELGGVSVYLAEIATSGRKGFYTSWQSGSQQVAIMVAAAMGFALNAVLEPSAIS 186
G + GA Y+A+I + + + S ++ +G +
Sbjct: 106 AGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGF------- 157

Query: 187 DWGWRIPFLFGCLIVPFIFIL------------RR--KLEETQEFTARRHHLAMRQVFAT 232
PF + F+ RR + E + R M V A
Sbjct: 158 --SPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAAL 215

Query: 233 LLANWQVVIAGMMMVAMTTTAFYLITVYAPTFGKKVLMLSASD-SLLVTLLVAISNFFWL 291
+ V M +V A ++I FG+ A+ + + + +
Sbjct: 216 M-----AVFFIMQLVGQVPAALWVI------FGEDRFHWDATTIGISLAAFGILHSLAQA 264

Query: 292 PVGGALSDRFGRRSVLIAMTLLALATAWPALTMLANAPSFLMMLSVLLWLSFIYGMYNGA 351
+ G ++ R G R + + ++A T +LA A M +++ L+ G
Sbjct: 265 MITGPVAARLGERR-ALMLGMIADGT---GYILLAFATRGWMAFPIMVLLAS-----GGI 315

Query: 352 MIPALTEIMPAEV------RVAGFSLAYSLATAVFGGFTPVISTALIEYTGDKASPGYWM 405
+PAL ++ +V ++ G A + T++ G P++ TA+ + + W+
Sbjct: 316 GMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVG---PLLFTAIYAASITTWNGWAWI 372

Query: 406 SFAAICGLL 414
+ AA+ L
Sbjct: 373 AGAALYLLC 381



Score = 36.3 bits (84), Expect = 2e-04
Identities = 39/157 (24%), Positives = 62/157 (39%), Gaps = 20/157 (12%)

Query: 273 ASDSLLVTLLVAISNFFWLPVGGALSDRFGRRSVLIAMTLLALATAWPALTMLANAPSFL 332
+ ++ L A+ F PV GALSDRFGRR VL L++LA A ++A AP
Sbjct: 42 TAHYGILLALYALMQFACAPVLGALSDRFGRRPVL----LVSLAGAAVDYAIMATAPFLW 97

Query: 333 MMLSVLLWLSFIYGMYNGAMIPALT----EIMPAEVRVAGFSLAYSLATAVFGGFT--PV 386
+ L++ I GA +I + R F ++ G PV
Sbjct: 98 V-----LYIGRIVAGITGATGAVAGAYIADITDGDERARHFGF---MSACFGFGMVAGPV 149

Query: 387 ISTALIEYTGDKASPGYWMSFAAICGLLATCYLYRRS 423
+ + ++ +P + + L C+L S
Sbjct: 150 LGGLMGGFS--PHAPFFAAAALNGLNFLTGCFLLPES 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS11875SACTRNSFRASE443e-08 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 44.2 bits (104), Expect = 3e-08
Identities = 21/64 (32%), Positives = 26/64 (40%), Gaps = 2/64 (3%)

Query: 73 STWLGRNGIYMEDLYVTPDYRGIGAGKALLKTIAQYAVQRQCGRLEWSVLDWNQPAIDFY 132
S W G +ED+ V DYR G G ALL ++A + L D N A FY
Sbjct: 84 SNWNGY--ALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFY 141

Query: 133 LSIG 136

Sbjct: 142 AKHH 145


100D364_RS13130D364_RS13205N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS131302111.518689molecular chaperone
D364_RS131352122.441031sensor histidine kinase
D364_RS131402143.840114response regulator transcription factor
D364_RS131450134.007831small membrane protein
D364_RS131501143.323355positive transcription regulator
D364_RS131551123.135501dicarboxylate/amino acid:cation symporter
D364_RS131600141.705991MdtA/MuxA family multidrug efflux RND
D364_RS131650131.894713MdtB/MuxB family multidrug efflux RND
D364_RS131701193.042537multidrug efflux RND transporter permease
D364_RS270452243.710510MFS transporter
D364_RS131803254.192576two-component system sensor histidine kinase
D364_RS131853275.111754two-component system response regulator BaeR
D364_RS131903275.409100tRNA 5-hydroxyuridine modification protein YegQ
D364_RS131952274.568418lipid kinase YegS
D364_RS132000223.763669ABC transporter ATP-binding protein
D364_RS13205-1162.723723ABC transporter ATP-binding protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13160SHAPEPROTEIN485e-08 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 47.8 bits (114), Expect = 5e-08
Identities = 34/129 (26%), Positives = 57/129 (44%), Gaps = 20/129 (15%)

Query: 132 AMMLH-IKLQAESQLPEQIDQAVIGRPINFQGLGGDEANAQAQGILERAALRAGFRDVVF 190
M+ H IK + + ++ P+ + E A + +A AG R+V
Sbjct: 89 KMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQV---ERRA-----IRESAQGAGAREVFL 140

Query: 191 QFEPVAAGLDFEATLSEEKRVLVVDIGGGTTDCSLLLMGPQWRERADRQQSLLGHSGCRI 250
EP+AA + +SE +VVDIGGGTT+ +++ + ++ S RI
Sbjct: 141 IEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN-----------GVVYSSSVRI 189

Query: 251 GGNDLDIAL 259
GG+ D A+
Sbjct: 190 GGDRFDEAI 198



Score = 33.6 bits (77), Expect = 0.001
Identities = 30/126 (23%), Positives = 50/126 (39%), Gaps = 32/126 (25%)

Query: 332 RLSYRLV---RSAEESKIALSSAAS-------------VETALPFIQDELATAIAQQGLE 375
R +Y + +AE K + SA + +P L + +
Sbjct: 203 RRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVP-RGFTLNSNEILE--- 258

Query: 376 AALDQPLTRIMEQVRLALDSSQTTPDV--------IYLTGGSARSPLIKKALAAQLPGIP 427
AL +PLT I+ V +AL+ Q P++ + LTGG A + + L + GIP
Sbjct: 259 -ALQEPLTGIVSAVMVALE--QCPPELASDISERGMVLTGGGALLRNLDRLLMEET-GIP 314

Query: 428 LAGGDD 433
+ +D
Sbjct: 315 VVVAED 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13170HTHFIS661e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 65.6 bits (160), Expect = 1e-14
Identities = 32/141 (22%), Positives = 61/141 (43%), Gaps = 6/141 (4%)

Query: 4 RLAIIEDNADLLDELLAWLGYRGFEVWGTRSAEAFWRQLHSHPVDIVLVDIGLPGEDGFS 63
+ + +D+A + L L G++V T +A WR + + D+V+ D+ +P E+ F
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 VLNYLHELGHY-GLVVVSARGQQQDKLQALSLGADAYLIKPVNFAH-LAETLTALGARLR 121
+L + + ++V+SA+ ++A GA YL KP + + AL R
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 QDRP----AAPPAEAIGTPPA 138
+ + +G A
Sbjct: 125 RPSKLEDDSQDGMPLVGRSAA 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13190RTXTOXIND446e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.4 bits (105), Expect = 6e-07
Identities = 26/139 (18%), Positives = 52/139 (37%), Gaps = 16/139 (11%)

Query: 55 GAALAPVQAATATEEAVPRYLTGLGTVTAA-NTVTVRSRVDGQLLSLHFQEGQQVKAGDL 113
+A + + E V T G +T + + ++ + + + +EG+ V+ GD+
Sbjct: 67 FLVIAFILSVLGQVEIV---ATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDV 123

Query: 114 LAQIDPSQFKVALAQAQGQLAKDQATLANARRDLARYQQLVKTNLVSRQELDTQQSLVVE 173
L ++ A+ K Q++L AR + RYQ L ++ EL+ L +
Sbjct: 124 LLKLTAL-------GAEADTLKTQSSLLQARLEQTRYQILSRS-----IELNKLPELKLP 171

Query: 174 SAGTVKADEAAVASAQLQL 192
+ L
Sbjct: 172 DEPYFQNVSEEEVLRLTSL 190



Score = 36.3 bits (84), Expect = 2e-04
Identities = 26/170 (15%), Positives = 63/170 (37%), Gaps = 17/170 (10%)

Query: 125 ALAQAQGQLAKDQATLANARRDLARYQQLVKTNLVSRQELDTQQSLVVESAGTVKADEAA 184
+A +L ++ L ++ ++ + ++ +++ +
Sbjct: 260 KYVEAVNELRVYKSQLEQIESEILSAKEEY------QLVTQLFKNEILDKLRQTTDNIGL 313

Query: 185 V----ASAQLQLDWTRITAPIDGRV-GLKQVDIGNQISSGDTTGIVVLTQTHPIDVVFTL 239
+ A + + + I AP+ +V LK G +++ +T ++V ++V +
Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPED-DTLEVTALV 372

Query: 240 PESSIATVVQAQKAGKALSVEAWDRTNKQKISVGE--LLSLDNQIDATTG 287
I + Q A + VEA+ T + G+ ++LD D G
Sbjct: 373 QNKDIGFINVGQNA--IIKVEAFPYTRYGYLV-GKVKNINLDAIEDQRLG 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13195ACRIFLAVINRP8980.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 898 bits (2323), Expect = 0.0
Identities = 294/1036 (28%), Positives = 509/1036 (49%), Gaps = 29/1036 (2%)

Query: 13 SRLFIMRPVATTLLMVAILLAGIIGYRFLPVSALPEVDYPTIQVVTLYPGASPDVVTSAI 72
+ FI RP+ +L + +++AG + LPV+ P + P + V YPGA V +
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 73 TAPLERQFGQMSGLKQMSSQS-SGGASVVTLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131
T +E+ + L MSS S S G+ +TL FQ D+A+ +VQ + AT LLP +
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 132 LPNPPVYSKVNPADPPIMTLAVTSSAIPMTQVE--DMVETRVAQKISQVSGVGLVTLAGG 189
+ + S + +M S TQ + D V + V +S+++GVG V L G
Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 190 QRPAVRVKLNAQAIAALGLTSETVRTAITSANVNSAKGSLDGP------ARAVTLSANDQ 243
Q A+R+ L+A + LT V + N A G L G ++ A +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 244 MQSAEDYRRLII-AYQNGAPIRLGDVASVEQGAENSWLGAWANQQRAIVMNVQRQPGANI 302
++ E++ ++ + +G+ +RL DVA VE G EN + A N + A + ++ GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 303 IDTADSIRQMLPQLTESLPKSVKVQVLSDRTTNIRASVRDTQFELMLAIALVVMIIYLFL 362
+DTA +I+ L +L P+ +KV D T ++ S+ + L AI LV +++YLFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 363 RNVPATIIPGVAVPLSLVGTFAVMVFLDFSINNLTLMALTIATGFVVDDAIVVIENISRY 422
+N+ AT+IP +AVP+ L+GTFA++ +SIN LT+ + +A G +VDDAIVV+EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 423 I-EKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAVTLAVAIL 481
+ E P A K +I ++ + L AV IP+ F G G ++R+F++T+ A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 482 ISAVVSLTLTPMMCARML---SHESLRKQNRFSRASERFFERVIAVYGRWLSRVLNHPWL 538
+S +V+L LTP +CA +L S E + F F+ + Y + ++L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 539 TLGVALSTLALSIILWVFIPKGFFPIQDNGIIQGTLQAPQSVSFASMAERQRQVANIILK 598
L + +A ++L++ +P F P +D G+ +Q P + + QV + LK
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 599 DPA--VESLTSFVGVDGTNPALNSARLQINLKPLDERDDR---VQTVISRLQQAVDGVPG 653
+ VES+ + G + A N+ ++LKP +ER+ + VI R + + +
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 654 VALYLQPTQDLTIDTTVSRTQYQFTLQ---ANSLEALSTWVPPLLSRLQAQP-QLADVSS 709
++ P I + T + F L +AL+ LL P L V
Sbjct: 660 G--FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 710 DWQDKGLAAYIKVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEQDTE 769
+ + ++VD++ A LG+S++D++ + A G ++ + ++ ++ D +
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 770 ATPGLAALENIRLTSSDGGIVPLTAIATVEQRFTPLSVNHLDQFPVTTISFNVPDNYSLG 829
++ + + S++G +VP +A T + + + P I S G
Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837

Query: 830 EAVEAILAAEQSLDFPTDIRTQFQGSSLAFQSALGSTVWLVVAAVVAMYIVLGVLYESFI 889
+A+ + L P I + G S + + LV + V +++ L LYES+
Sbjct: 838 DAMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 890 HPITILSTLPTAGVGALLALWLAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQ 949
P++++ +P VG LLA L + DV ++G++ IG+ KNAI++++FA ++
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 950 GMPPREAIYQACLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLMLSQV 1009
G EA A +R RPILMT+LA +LG LPL +S G G+ + +GIG++GG++ + +
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 1010 LTLFTTPVIYLLFDRL 1025
L +F PV +++ R
Sbjct: 1016 LAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13200ACRIFLAVINRP8900.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 890 bits (2302), Expect = 0.0
Identities = 280/1035 (27%), Positives = 502/1035 (48%), Gaps = 36/1035 (3%)

Query: 6 LFIYRPVATILISLAITLCGILGFRLLPVAPLPQVDFPVIMVSASLPGASPETMASSVAT 65
FI RP+ ++++ + + G L LPVA P + P + VSA+ PGA +T+ +V
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63

Query: 66 PLERSLGRIAGVNEMTSSS-SLGSTRIILEFNFDRDINGAARDVQAAINAAQSLLPSGMP 124
+E+++ I + M+S+S S GS I L F D + A VQ + A LLP +
Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123

Query: 125 SRPTYRKANPSDAPIMILTLTSDT--YSQGELYDFASTQLAQTIAQIDGVGDVDVGGSSL 182
+ S + +M+ SD +Q ++ D+ ++ + T+++++GVGDV + G+
Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182

Query: 183 PAVRVDLNPQALFNQGVSLDAVRTAISDANVRKPQG------ALEDSAHRWQVQTNDELK 236
A+R+ L+ L ++ V + N + G AL + K
Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 237 TAADYQPLIVHY-QNGAAVRLGDVATVSDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQ 295
++ + + +G+ VRL DVA V ++ N KPA L I+ AN +
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 296 TVDSIRARLPELQQTIPAAIDLQIAQDRSPTIRASLEEVEQTLVISVALVILVVFLFLRS 355
T +I+A+L ELQ P + + D +P ++ S+ EV +TL ++ LV LV++LFL++
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 356 GRATLIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENISRHL- 414
RATLIP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 415 EAGMKPLQAALQGSREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGIS 474
E + P +A + ++ ++ +++ L AVF+P+ GG G + R+F++T+ A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 475 LAVSLTLTPMMCGWLLKSGKPHQPTRNRGFG----RLLVAVQGGYGKSLKWVLKHSRLTG 530
+ V+L LTP +C LLK GF Y S+ +L +
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 531 LVVLGTIALSVWLYISIPKTFFPEQDTGVLMGGIQADQSISFQ----AMRGKLQDFMKII 586
L+ +A V L++ +P +F PE+D GV + IQ + + + ++K
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 587 R-EDPAVDNVTGFT-GGSRVNSGMMFITLKPRDQRH---ETAQQVIDRLRKKLANEPGAN 641
+ +V V GF+ G N+GM F++LKP ++R+ +A+ VI R + +L
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 642 LFLMAVQDIRVGGRQSNASYQYTLLSDDLSALREWEPKIRKALAAL-----PELADVNSD 696
+ + I G + ++ L D + + R L + L V +
Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 697 QQDNGAEMDLVYDRDTMSRLGISVQDANNLLNNAFGQRQISTIYQPLNQYKVVMEVDPAY 756
++ A+ L D++ LG+S+ D N ++ A G ++ K+ ++ D +
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 757 TQDVSALDKMFVINSDDKPIPLAYFAKWQPANAPLSVNHQGLSAASTISFNLPTGRSLSE 816
+DK++V +++ + +P + F + + I G S +
Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838

Query: 817 ASEAIDRAMTQLGVPSSVRGSFAGTAQVFQQTMNAQVILILAAIATVYIVLGVLYESYVH 876
A ++ ++L P+ + + G + + + N L+ + V++ L LYES+
Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896

Query: 877 PLTILSTLPSAGVGALLALEIFDAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRNGN 936
P++++ +P VG LLA +F+ + ++G++ IG+ KNAI++V+FA +
Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956

Query: 937 LTPEEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLL 996
EA A +R RPI+MT+LA + G LPL +S G GS + +GI ++GG+V + LL
Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016

Query: 997 TLYTTPVVYLFFDRL 1011
++ PV ++ R
Sbjct: 1017 AIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13205TCRTETB1235e-33 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 123 bits (311), Expect = 5e-33
Identities = 93/435 (21%), Positives = 186/435 (42%), Gaps = 17/435 (3%)

Query: 20 FMQSLDTTIVNTALPSMAKSLGESPLHMHMIIVSYVLTVAVMLPASGWLADRVGVRNIFF 79
F L+ ++N +LP +A + P + + +++LT ++ G L+D++G++ +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 80 TAIVLFTAGSLFCAQA-STLDQLVMARVLQGVGGAMMVPVGRLTVMKIVPRDQYMAAMTF 138
I++ GS+ S L+MAR +QG G A + + V + +P++ A
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 139 VTLPGQVGPLLGPALGGVLVEYASWHWIFLINIP-VGIVGAIATLCLMPNYTMQTRRFDL 197
+ +G +GPA+GG++ Y HW +L+ IP + I+ + L+ FD+
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 198 SGFLLLAAGMATLTLALDGQKGLGISPAWLAGLVAVGLCALLLYLWHARGNARALFSLNL 257
G +L++ G+ L + ++ + V + + L+++ H R L
Sbjct: 202 KGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 258 FRNRTFSLGLGGSFAGRIGSGMLPFMTPVFLQIGLGFSPFHAG-LMMIPMVLGSMGMKRI 316
+N F +G+ M P ++ S G +++ P + + I
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 317 VVQVVNRFGYRRVLVASTLGLAAVSLLFMFSALAGWYYVLPLVLFLQGMINASRFSSMNT 376
+V+R G VL L+ L F +++ +++F+ G ++ ++ + ++T
Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTK-TVIST 371

Query: 377 LTLKDLPDDLASSGNSLLSMVMQLSMSIGVTIAGLLLGLYGQQHMSLDAASTHQVFLYT- 435
+ L A +G SLL+ LS G+ I G LL + L +LY+
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTYLYSN 431

Query: 436 -YLSMAAIIALPALI 449
L + II + L+
Sbjct: 432 LLLLFSGIIVISWLV 446


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13210BCTERIALGSPF362e-04 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 36.3 bits (84), Expect = 2e-04
Identities = 28/93 (30%), Positives = 36/93 (38%), Gaps = 21/93 (22%)

Query: 198 LATLLAA-------------LATFPLARGLLAPVKRLVEGTHKLAA------GDFST--R 236
LATL+AA + P L+A V+ V H LA G F
Sbjct: 77 LATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFPGSFERLYC 136

Query: 237 VTVTGGDELGRLAQDFNQLASTLERNQQMRRDL 269
V G+ G L N+LA E+ QQMR +
Sbjct: 137 AMVAAGETSGHLDAVLNRLADYTEQRQQMRSRI 169


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13215HTHFIS772e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 76.8 bits (189), Expect = 2e-18
Identities = 31/148 (20%), Positives = 67/148 (45%), Gaps = 3/148 (2%)

Query: 11 PRILIVEDEPKLGQLLIDYLQAAGYAPTLINHGDKVLPYVRQTPPHLILLDLMLPGTDGL 70
IL+ +D+ + +L L AGY + ++ + ++ L++ D+++P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 71 TLCREIR-RFSDVPVVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTIL--RR 127
L I+ D+PV++++A+ + + E GA DY+ KP+ E++ + L +
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 128 CKPQRDLQALDAQSPLIVDEGRFQASWR 155
+P + PL+ Q +R
Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYR 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13230PF05272300.008 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.008
Identities = 19/93 (20%), Positives = 28/93 (30%), Gaps = 35/93 (37%)

Query: 36 LVGESGSGKTTVLKCLAGLFTHWQGELTI---------------------------DAQP 68
L G G GK+T++ L GL I DA+
Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRADAEA 660

Query: 69 LGHEISRERCRQVQMVFQDPYGSL---HPRHTI 98
+ S + R ++ YG HPR +
Sbjct: 661 VKAFFSSRKDR-----YRGAYGRYVQDHPRQVV 688


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS13235HTHFIS290.027 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 28.6 bits (64), Expect = 0.027
Identities = 11/25 (44%), Positives = 14/25 (56%)

Query: 47 IVGESGSGKSTVGRALLQLHPKKAR 71
I GESG+GK V RAL ++
Sbjct: 165 ITGESGTGKELVARALHDYGKRRNG 189


101D364_RS15590D364_RS15625N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS15590016-0.845908fimbrial biogenesis outer membrane usher
D364_RS155950140.504546fimbrial protein
D364_RS15600-1142.780692fimbrial protein
D364_RS15605-1133.324023fimbrial biogenesis outer membrane usher
D364_RS15610-1123.352502molecular chaperone
D364_RS15615-2144.117859fimbrial protein
D364_RS156200145.099449aldehyde dehydrogenase
D364_RS156250164.208650sigma-54-dependent Fis family transcriptional
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS15590PF005777190.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 719 bits (1857), Expect = 0.0
Identities = 323/851 (37%), Positives = 459/851 (53%), Gaps = 46/851 (5%)

Query: 20 PADSAERYNAQFVNG-----IDPLAFNQFVASDGDVMPGTYDVNIYINDLLVDSRPVRFS 74
+ + +N +F+ D F ++ PGTY V+IY+N+ + +R V F+
Sbjct: 42 LSSAELYFNPRFLADDPQAVADLSRFEN----GQELPPGTYRVDIYLNNGYMATRDVTFN 97

Query: 75 EDSAHGGLAPCLSAAEYIRYGVKIDD-------DHQPCFALSQTIRQAEQQLDIANHRLN 127
+ G+ PCL+ A+ G+ C L+ I A QLD+ RLN
Sbjct: 98 TGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHDATAQLDVGQQRLN 157

Query: 128 IHIPQQYIEHYPRDYVSPMRFDEGINAAFVNYSYS-TDANNGDGGSHQYQYLSLNSGINI 186
+ IPQ ++ + R Y+ P +D GINA +NY++S N GG+ Y YL+L SG+NI
Sbjct: 158 LTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNI 217

Query: 187 ASWRLRNNAYWNKF-----SGQADKWQSIASWAETNIIPWRSRLVVGQTSTDNSVFDSVQ 241
+WRLR+N W+ SG +KWQ I +W E +IIP RSRL +G T +FD +
Sbjct: 218 GAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGIN 277

Query: 242 FRGVQLGTDAEMRPSSQTGFAPVIRGVANSNARVEVRQNNYLIYSENVPAGPFELNDINA 301
FRG QL +D M P SQ GFAPVI G+A A+V ++QN Y IY+ VP GPF +NDI A
Sbjct: 278 FRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYA 337

Query: 302 VNRSGDFYVTVIEADGSQTTFTVAYTTLPQLVRAGQWNYQLSAGKYH-DGADGYAPALMQ 360
SGD VT+ EADGS FTV Y+++P L R G Y ++AG+Y A P Q
Sbjct: 338 AGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQ 397

Query: 361 SSLSYGLNNTFTLYGGALAAENYRAGAFGVGSNLGEIGALSADYTLAGTTLANGQRKQGG 420
S+L +GL +T+YGG A+ YRA FG+G N+G +GALS D T A +TL + + G
Sbjct: 398 STLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQ 457

Query: 421 SVRFLYAKSFLSSKTDFQIAGYRYSTAGYYSLSDAVNERRRWHNGLYENDYWPSDEYESW 480
SVRFLY KS S T+ Q+ GYRYST+GY++ +D R +N ++ +
Sbjct: 458 SVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTD 517

Query: 481 QASAPQHYYTSWFYNKKHRFDISARQTLGKNSAFFLNFSQQNYWNSSGSDISLQAGFNST 540
Y + YNK+ + ++ Q LG+ S +L+ S Q YW +S D QAG N+
Sbjct: 518 --------YYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTA 569

Query: 541 IHNVNYGLYYQNTRSHFTHD-DNSITLRVSIPF-------TLQENRRINTAFTLAHSKSS 592
++N+ L Y T++ + D + L V+IPF + + R + +++++H +
Sbjct: 570 FEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNG 629

Query: 593 GTSGQAGVNGTLLDDDRLSWAVTSAYDD----TSHSTNSASLGYLGQYGNLYTGYAYSKN 648
+ AGV GTLL+D+ LS++V + Y S ST A+L Y G YGN GY++S +
Sbjct: 630 RMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDD 689

Query: 649 HRQASLNLSGGVVAHRGGVTLSQPLGSTFALVEAKDAQGVGIENQTGVRIDPFGYAVVPQ 708
+Q +SGGV+AH GVTL QPL T LV+A A+ +ENQTGVR D GYAV+P
Sbjct: 690 IKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPY 749

Query: 709 SVPYRVNSVALNPQDFDAFLDVPNAVADTVPTRGAITRVRFDTFRGYSVLIHTTLADGSY 768
+ YR N VAL+ +D+ NAVA+ VPTRGAI R F G +L+ T +
Sbjct: 750 ATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLM-TLTHNNKP 808

Query: 769 PPLGAELYRASGISNGLVGPGGDVYVSGVDSGEKLQMKWGETHQQSCEITLPELRQEPQQ 828
P GA + S S+G+V G VY+SG+ K+Q+KWGE C Q
Sbjct: 809 LPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANYQ--LPPESQ 866

Query: 829 ATAWRELSLIC 839
+LS C
Sbjct: 867 QQLLTQLSAEC 877


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS15595PF05616290.037 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 28.6 bits (63), Expect = 0.037
Identities = 27/74 (36%), Positives = 34/74 (45%), Gaps = 6/74 (8%)

Query: 228 PGYYEKTR--PFT-VTYGLVKQGNGSDCGTEPMLATFSTTNTIQESAIILPQPDSGFGIA 284
PGY EK P T V G V NG+ S NT + +I P+PD G A
Sbjct: 262 PGYSEKVEVAPGTKVNMGPVTDRNGNPVQVVATFGRDSQGNTTVDVQVI-PRPDLTPGSA 320

Query: 285 ISPNASMHPLIEMN 298
+PNA PL E++
Sbjct: 321 EAPNA--QPLPEVS 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS15605PF005777390.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 739 bits (1909), Expect = 0.0
Identities = 333/853 (39%), Positives = 480/853 (56%), Gaps = 49/853 (5%)

Query: 25 LATVPTMMFCLSPLSRALADDYFDPAALEFADPQQQTSDLHYFAKPGGQQPGTYPVTVVV 84
+ + + A+ YF+P L D Q +DL F PGTY V + +
Sbjct: 27 FVRLFVACAFAAQAPLSSAELYFNPRFLA--DDPQAVADLSRFENGQELPPGTYRVDIYL 84

Query: 85 NDQELGQADITFV--DDGGQLRPVLTPGQLAEYGVNVSAFPAFQALHEGETFTRIEKFIP 142
N+ + D+TF D + P LT QLA G+N ++ L + + I
Sbjct: 85 NNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVP-LTSMIH 143

Query: 143 DASSRFSFANQRLTLSIPQAAMNVQSRGYVDPSRWDDGVPAAFVDYYFSGAQIKNADEGE 202
DA+++ QRL L+IPQA M+ ++RGY+ P WD G+ A ++Y FSG ++N G
Sbjct: 144 DATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQN-RIGG 202

Query: 203 SSRSNYLNLRSGLNLGAWRLRNISSMQYDQ------QRRHWDTQSTWLQRDVRSLKSLLR 256
+S YLNL+SGLN+GAWRLR+ ++ Y+ + W +TWL+RD+ L+S L
Sbjct: 203 NSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLT 262

Query: 257 IGDTYTTGDVFDSIQFRGVQLMSDDEMLPDSQRGFAPTIRGVAHSNAKVTVSQHGYVIYE 316
+GD YT GD+FD I FRG QL SDD MLPDSQRGFAP I G+A A+VT+ Q+GY IY
Sbjct: 263 LGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYN 322

Query: 317 TFVSPGAFAISDLYPTSQSGDLEVKVTESNGAVRTFTQPYSAVPYMLREGRGKFSLSAGR 376
+ V PG F I+D+Y SGDL+V + E++G+ + FT PYS+VP + REG ++S++AG
Sbjct: 323 STVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGE 382

Query: 377 YHSGGESVRSPEFLQGTLFYGLTAGFTLYGGTQLARDYQAWALGLGRGFGEFGSLGGDVT 436
Y SG P F Q TL +GL AG+T+YGGTQLA Y+A+ G+G+ G G+L D+T
Sbjct: 383 YRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMT 442

Query: 437 QAVTRTPSGKRYTGHSLRAQYQKNFVSSGTAFSLASYRYSSSGYYDFAEASALESAQGQV 496
QA + P ++ G S+R Y K+ SGT L YRYS+SGY++FA+ + +
Sbjct: 443 QANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNI 502

Query: 497 D--------------------NRRRREELSVSQSLGGLGSLAVSAWSQEYWHRQSRDETV 536
+ N+R + +L+V+Q LG +L +S Q YW + DE
Sbjct: 503 ETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQF 562

Query: 537 HLGFYSAWKGISWGVGYYYTRTSGQQKNDRSWSFNINIPLGGPLSDSA--------VSYN 588
G +A++ I+W + Y T+ + Q+ D+ + N+NIP L + SY+
Sbjct: 563 QAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYS 622

Query: 589 TTSDSNGYTSQQMSLYGAVPTRPNLFYSVQQGYGNQGRGSNSS---ASLDYHGGFGNAQI 645
+ D NG + +YG + NL YSVQ GY G G++ S A+L+Y GG+GNA I
Sbjct: 623 MSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANI 682

Query: 646 GYRHDAASNQLTWGGAGSVVAHPHGVTFGQTVGESFAIVRAPGAAGVAVQNGNNVHTDWR 705
GY H QL +G +G V+AH +GVT GQ + ++ +V+APGA V+N V TDWR
Sbjct: 683 GYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWR 742

Query: 706 GYAVVPSLTAYRKNVITLDTESMADDTDVDQQGQTVIPGGGAVVMANYQTHIGNRVLFTL 765
GYAV+P T YR+N + LDT ++AD+ D+D V+P GA+V A ++ +G ++L TL
Sbjct: 743 GYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTL 802

Query: 766 RNAQGPLPFGASARLVKEEESGNPPGGMVADGGQVYLSGVPQEGTLAVSWIVNNQSQSCT 825
+ PLPFGA V E S + G+VAD GQVYLSG+P G + V W ++ C
Sbjct: 803 THNNKPLPFGAM---VTSESSQSS--GIVADNGQVYLSGMPLAGKVQVKWG-EEENAHCV 856

Query: 826 LHFHLPDNPQQSL 838
++ LP QQ L
Sbjct: 857 ANYQLPPESQQQL 869


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS15625HTHFIS2854e-92 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 285 bits (732), Expect = 4e-92
Identities = 121/366 (33%), Positives = 173/366 (47%), Gaps = 52/366 (14%)

Query: 268 LTTPQGRYHYRLREPTRRRVAVSAPPAMHLPFTSPREGEKLLRLLNAGIALCIEGETGSG 327
L P+ R + V AM + L RL+ + L I GE+G+G
Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIY------RVLARLMQTDLTLMITGESGTG 172

Query: 328 KEYVSRTLHQHSRWRSGKFVAINCAAIPESLIESELFGYQPGAFTGASKNGYIGKIREAD 387
KE V+R LH + + R+G FVAIN AAIP LIESELFG++ GAFTGA G+ +A+
Sbjct: 173 KELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRS-TGRFEQAE 231

Query: 388 GGVLFLDEIGDMPLALQTRLLRVLQEKEVAPLGASRSVPVNFALICATHRNLTQRVSAGE 447
GG LFLDEIGDMP+ QTRLLRVLQ+ E +G + + ++ AT+++L Q ++ G
Sbjct: 232 GGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGL 291

Query: 448 FREDLLWRLREYALALPPLREWS----ALETFIATLWHDLGGASRRVTLSNALLVHLSQL 503
FREDL +RL L LPPLR+ + L G +R L +
Sbjct: 292 FREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKR--FDQEALELMKAH 349

Query: 504 PWPGNVRQLQSVLKVMLALADEGDTLTPDALPEAYRAAPAPLPRGG-------------- 549
PWPGNVR+L+++++ + AL D +T + + R+ P
Sbjct: 350 PWPGNVRELENLVRRLTALY-PQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAV 408

Query: 550 ------------------------LQAHDEQLIVDTLARVNGNVSRAAQILGIARSTLYR 585
L + LI+ L GN +AA +LG+ R+TL +
Sbjct: 409 EENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRK 468

Query: 586 RAARAG 591
+ G
Sbjct: 469 KIRELG 474


102D364_RS16940D364_RS16980N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS16940-214-0.857323ABC transporter ATP-binding protein
D364_RS16945-113-1.205545ABC transporter substrate-binding protein
D364_RS16950-211-0.179673carbohydrate porin
D364_RS169550130.198254glycoside hydrolase family 32 protein
D364_RS169600140.907023MFS transporter
D364_RS16965-112-0.347379hypothetical protein
D364_RS27580012-0.574137LacI family DNA-binding transcriptional
D364_RS169750121.110281sugar porter family MFS transporter
D364_RS16980-2111.3642852-dehydro-3-deoxy-D-gluconate 5-dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS16945BACINVASINB363e-04 Salmonella/Shigella invasin protein B signature.
		>BACINVASINB#Salmonella/Shigella invasin protein B signature.

Length = 593

Score = 35.5 bits (81), Expect = 3e-04
Identities = 22/95 (23%), Positives = 44/95 (46%), Gaps = 10/95 (10%)

Query: 60 EVRIGDKIVNNLAPKSRGIAM-VFQNYALYPHMTVRENLAFGLKLSKLPKAQIDRQVEEA 118
+V +G ++ N A + G+A VF A E LA L++ QI + ++++
Sbjct: 504 KVALGMEVTNTAAQSAGGVAEGVFIKNA-------SEALA-DFMLARFAMDQIQQWLKQS 555

Query: 119 AKIL-ELEELLDRLPRQLSGGQAQRVAVGRAIVKK 152
+I E +++ L + +S Q R I+++
Sbjct: 556 VEIFGENQKVTAELQKAMSSAVQQNADASRFILRQ 590


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS16950MALTOSEBP300.025 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 29.7 bits (66), Expect = 0.025
Identities = 72/298 (24%), Positives = 118/298 (39%), Gaps = 41/298 (13%)

Query: 128 NGKLNGIPISVTARVFYFNDEAWKKAGIPFPKTWDELMAAGKTFESKLGKQYYPVVLEHQ 187
NGKL PI+V A +N + PKTW+E+ A K ++K GK L+
Sbjct: 126 NGKLIAYPIAVEALSLIYNKDLLPNP----PKTWEEIPALDKELKAK-GKSALMFNLQEP 180

Query: 188 ----DVLALLNSYMVQKYNQPAIDEKGRKFSYSKAQWADFFGMYKKLIDSHVMPDTRYYA 243
++A Y KY D K + A+ F + + + H+ DT Y
Sbjct: 181 YFTWPLIAADGGYAF-KYENGKYDIKDVGVDNAGAKAGLTF-LVDLIKNKHMNADTDYSI 238

Query: 244 SFGKSNMYEMKPWIQGEWGGTYMWNSTINKYSDNLKPPAKLVLGEYPMLP--GATDAGLF 301
+ N E I G W + + S +N Y + P K P P G AG
Sbjct: 239 AEAAFNKGETAMTINGPWAWSNIDTSKVN-YGVTVLPTFK----GQPSKPFVGVLSAG-- 291

Query: 302 FKPAQMLSIGKSTKNPQAAAKVINFLLNSKEGVDILGLERGVPLSKAAVTYLTEDGVIKA 361
I ++ N + A + + L + EG++ + ++ PL A+ E+ A
Sbjct: 292 --------INAASPNKELAKEFLENYLLTDEGLEAVNKDK--PLGAVALKSYEEE---LA 338

Query: 362 DDPAVSGLKLAQSLPTALPVSPYFDDPQIVA---QFGTTLQYIDYGKKSVEEAAEDFQ 416
DP ++A ++ A + PQ+ A T + G+++V+EA +D Q
Sbjct: 339 KDP-----RIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQ 391


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS16965TCRTETA290.036 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 29.0 bits (65), Expect = 0.036
Identities = 31/142 (21%), Positives = 48/142 (33%), Gaps = 8/142 (5%)

Query: 43 AGDTGIIYAVLSVSALFAQVCYGFIQDKLGLRKHLLWYITALLILSGPAYLLFGHLLKIN 102
GI+ A+ ++ G + D+ G R LL L + Y + +
Sbjct: 42 TAHYGILLALYALMQFACAPVLGALSDRFGRRPVLL----VSLAGAAVDYAIMATAPFLW 97

Query: 103 VL-LGSIFGGIYIGLTFNGGIGVLESYTERVARQSQFEFGRARMWGSLGWAVATFFAGLL 161
VL +G I GI G T + T+ R F F A G GL+
Sbjct: 98 VLYIGRIVAGI-TGATGAVAGAYIADITDGDERARHFGFMSACF--GFGMVAGPVLGGLM 154

Query: 162 FNINPQLNFLVASCSGLVFFIL 183
+P F A+ + F+
Sbjct: 155 GGFSPHAPFFAAAALNGLNFLT 176


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS16980TCRTETB554e-10 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 54.9 bits (132), Expect = 4e-10
Identities = 66/371 (17%), Positives = 131/371 (35%), Gaps = 34/371 (9%)

Query: 38 LDIGVISGALPFITDHFTLSSQLQEWVVSSMMLGAAIGALFNGWLSFRLGRKYSLMAGAV 97
L+ V++ +LP I + F WV ++ ML +IG G LS +LG K L+ G +
Sbjct: 28 LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGII 87

Query: 98 LFVAGSIGSAFAAS-VEVLLVARVVLGVAVGIASYTAPLYLSEMASENVRGKMISMYQLM 156
+ GS+ S +L++AR + G + ++ + RGK + +
Sbjct: 88 INCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSI 147

Query: 157 VTLGIVLAFLSDTAFSYSGNWRAMLGVLALPAVILIILVVFLPNSPRWLAEKGRHIEAEE 216
V +G + ++ +W L L +I II V FL + H + +
Sbjct: 148 VAMGEGVGPAIGGMIAHYIHW----SYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKG 203

Query: 217 VLRMLRDTSEKARDELNEIRESLKLKQGGWALFKV----------------NRNVRRAVF 260
++ M + L + + +F N V
Sbjct: 204 IILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVL 263

Query: 261 LGMLLQAMQQFTGMNIIMYYAPRIFKMAGFTTTEQQMIATLVVGLTFMFATFIAVFTVDK 320
G + G ++ Y + + +T E + ++ + +I VD+
Sbjct: 264 CGGI--IFGTVAGFVSMVPYMMK--DVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDR 319

Query: 321 AGRKPALKIGFSVMALGTLVLGYCLMQFDNGTASSGLSWLSVGMTMMCIAGYAMSAAPVV 380
G L IG + +++ L + T SW + + + G + + +
Sbjct: 320 RGPLYVLNIGVTFLSVSFLTASF----LLETT-----SWFMTIIIVFVLGGLSFTKTVIS 370

Query: 381 WILCSEIQPLK 391
I+ S ++ +
Sbjct: 371 TIVSSSLKQQE 381


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS16985DHBDHDRGNASE1102e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 110 bits (276), Expect = 2e-31
Identities = 72/257 (28%), Positives = 132/257 (51%), Gaps = 11/257 (4%)

Query: 3 LDAFSLQGKVAVVSGCDTGLGQGMALGLAEAGCDIVGI--NIVEPVETIERVTALGRRFL 60
++A ++GK+A ++G G+G+ +A LA G I + N + + + + A R
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 61 SLTADLRQIDGIPQLLERAVAEFGHIDILVNNAGLIRREDALAFSEKDWDDVMNLNIKSV 120
+ AD+R I ++ R E G IDILVN AG++R + S+++W+ ++N V
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 121 FFMSQAAAKHFIAQGSGGKIINIASMLSFQGGIRVPSYTASKSAVMGVTRLLANEWAKHN 180
F S++ +K+ + + G I+ + S + + +Y +SK+A + T+ L E A++N
Sbjct: 121 FNASRSVSKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN 179

Query: 181 INVNAIAPGYMATNNTQQLRADEQRSSEILD--------RIPAGRWGLPADLMGPVVFLA 232
I N ++PG T+ L ADE + +++ IP + P+D+ V+FL
Sbjct: 180 IRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 233 SSASDYINGYTVAVDGG 249
S + +I + + VDGG
Sbjct: 240 SGQAGHITMHNLCVDGG 256


103D364_RS17150D364_RS17185N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS17150-1154.083321N-acetyltransferase
D364_RS17155-1144.330985(4Fe-4S)-binding protein
D364_RS17160-1143.966980hypothetical protein
D364_RS17165-1144.128799multidrug transporter subunit MdtN
D364_RS171700133.610438multidrug efflux transporter permease subunit
D364_RS171750142.447893MdtP family multidrug efflux transporter outer
D364_RS17180-1150.936132DUF1889 family protein
D364_RS17185-1131.425878peptide MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17150SACTRNSFRASE280.002 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.4 bits (63), Expect = 0.002
Identities = 11/55 (20%), Positives = 23/55 (41%)

Query: 11 YVNDAQGNQVAEIVFVPTGEHLSIIEHTDVDPSLKGQGVGKQLVAKVVEKMRQEQ 65
++ + N + I ++IE V + +GVG L+ K +E ++
Sbjct: 68 FLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENH 122


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17165RTXTOXIND687e-15 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 68.3 bits (167), Expect = 7e-15
Identities = 49/362 (13%), Positives = 106/362 (29%), Gaps = 81/362 (22%)

Query: 11 KKWPLLALVLAAILALILVIWQL-----QTSPETNDAYVYADTIDVVPEVSGRIVEMPIR 65
+ P L +I I + + + ++ P + + E+ ++
Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113

Query: 66 DNQRVKKGDLLFRIDPRP---------------------YQAMLDDA------------- 91
+ + V+KGD+L ++ YQ +
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 92 ------------------KARLTTLDAQIMLTQRTIKAQEYNAQSVAAAVERARALVKQT 133
K + +T Q + + + +V A + R L +
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 134 TSTRIRLEPLVPQGFASQEDLDQARTAEKAARAELEATLLQAKQASAAVTGVDAMVAQRA 193
S L+ + ++ + + A EL Q +Q + +
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 194 GVL-------------------AQIALAELHLEFTEVRAPFNGVVVALKT-TVGQYASAL 233
+ ++A E + + +RAP + V LK T G +
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 234 KPVFTLL-DDDRWYVIANFRETDLNNVRPGVAARITVMT-NHNRT--FNGVVDSVGSGVL 289
+ + ++ +DD V A + D+ + G A I V + R G V ++ +
Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI 413

Query: 290 PE 291
+
Sbjct: 414 ED 415


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17170TYPE3IMSPROT290.049 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 29.3 bits (66), Expect = 0.049
Identities = 17/109 (15%), Positives = 41/109 (37%), Gaps = 13/109 (11%)

Query: 394 LASLLALLLIVFVQPWTDSLTGLLAMSLPV---LALAAWIAAGSERIAYAGIQIGFTFA- 449
+ L+ + P++ +L+ ++ L L A IA +Q GF +
Sbjct: 53 FSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISG 112

Query: 450 ---------LAFLSWFAPLTNLTELRDRVLGILLGVLVSSIVHLYLWPD 489
+ + + ++ L + + IL VL+S ++ + + +
Sbjct: 113 EAIKPDIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGN 161


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17185TCRTETB310.010 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 31.0 bits (70), Expect = 0.010
Identities = 33/188 (17%), Positives = 73/188 (38%), Gaps = 12/188 (6%)

Query: 33 SFYGIRPLLILFMAATVYDGGMGLARENASAIVGIFAGSMYLAALPGGWLADNWLGQQRA 92
SF+ + ++L ++ + + + F + + G L+D LG +R
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ-LGIKRL 81

Query: 93 VWYGSILIALGHLSIALSAWLGNDLFFIGLMFIVL---GSGLFKTCISVMVGTLYKKGDA 149
+ +G I+ G ++ ++G+ F + +M + G+ F + V+V K
Sbjct: 82 LLFGIIINCFG----SVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK--E 135

Query: 150 RRDGGFSLFYMGINIGSFIAPLISGWLIKSHGWHWGFGIGGIGMLVALIIFRVFAVPSMK 209
R F L + +G + P I G + +H HW + + + + + F + +
Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGG--MIAHYIHWSYLLLIPMITIITVPFLMKLLKKEV 193

Query: 210 RYDAEVGL 217
R +
Sbjct: 194 RIKGHFDI 201


104D364_RS17265D364_RS17280N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS17265237-10.348462response regulator
D364_RS17270440-11.420631acid-sensing system DNA-binding response
D364_RS17275539-10.870159EmrA/EmrK family multidrug efflux transporter
D364_RS17280641-11.770338DHA2 family efflux MFS transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17265HTHFIS684e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 67.5 bits (165), Expect = 4e-14
Identities = 36/156 (23%), Positives = 61/156 (39%), Gaps = 9/156 (5%)

Query: 258 QAIRILIAEDLPANRQLLRRQLDTLGYAADEAKDGAEALKLIQQQRYDLLITDLNMPVMD 317
IL+A+D A R +L + L GY + A + I DL++TD+ MP +
Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 318 GITLTCRVREYDTRMVIWGLTANLVAGEKERCLASGMNLCLFKPLDLSQL----ATALCE 373
L R+++ + + ++A + G L KP DL++L AL E
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 374 INIPQSGSSLDEFLNMKIFTALTLGDKKLMRQMLEQ 409
S D M + +G M+++
Sbjct: 122 PKRRPSKLEDDSQDGMPL-----VGRSAAMQEIYRV 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17270HTHFIS702e-16 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 70.2 bits (172), Expect = 2e-16
Identities = 23/122 (18%), Positives = 56/122 (45%), Gaps = 4/122 (3%)

Query: 1 MSKTANLSAIIIDDHPLARMAIRNLLENEGFNIVAEAGDGGEALMAVAEYQPDVVIVDVD 60
M+ + ++ DD R + L G+++ + +A D+V+ DV
Sbjct: 1 MTGA---TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVV 56

Query: 61 IPVMSGIEVVEKLRKKQFSHIIIVVSAKNDLFYGKRSADAGANAFISKKEGINNIISAIH 120
+P + +++ +++K + ++V+SA+N ++++ GA ++ K + +I I
Sbjct: 57 MPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116

Query: 121 AA 122
A
Sbjct: 117 RA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17275RTXTOXIND644e-13 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 63.7 bits (155), Expect = 4e-13
Identities = 37/218 (16%), Positives = 74/218 (33%), Gaps = 24/218 (11%)

Query: 132 IAYQQALADYQRRSRLQGAAAISRENMQHAKDAVDSSKAALDVAVQAYRGNRVLIQNTAL 191
IA L + + + ++ + + S+K + Q + +N L
Sbjct: 249 IAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF-------KNEIL 301

Query: 192 EKQPEVLMAAESMRE----AWVALQRTKVRSPVTGYLAQRNVQ-VGETIGSGQALMSIIP 246
+K + + Q + +R+PV+ + Q V G + + + LM I+P
Sbjct: 302 DKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVP 361

Query: 247 VEQV-WINANFKETQLSGVKIGQKVSI-VTDF-YGSDVVFNGRVDGINMGTGSAFSVLPA 303
+ + A + + + +GQ I V F Y G+V I + ++
Sbjct: 362 EDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNI-----NLDAIE-- 414

Query: 304 QNATGNWIKVVQRLPVRITLDAEQIKAYPLRIGLSATV 341
G V+ + K PL G++ T
Sbjct: 415 DQRLGLVFNVIISIEENCLSTGN--KNIPLSSGMAVTA 450


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17280TCRTETB1304e-35 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 130 bits (328), Expect = 4e-35
Identities = 90/411 (21%), Positives = 168/411 (40%), Gaps = 19/411 (4%)

Query: 18 VTLALSMATFMQMLDSTISNVAIPTISGFLGASTDEGTWVITSFGVANAISIPVTGRLAQ 77
+ + L + +F +L+ + NV++P I+ WV T+F + +I V G+L+
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 78 RFGERKLFLTSVTLFALASLCCGLS-TNLDTLIGFRVVQGLVAGPLIPLSQSLLLRNYPP 136
+ G ++L L + + S+ + + LI R +QG A L ++ R P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 137 EKRNIALALWSMTVIIAPIFGPIIGGYICDNYDWGWIFLINVPLGVIVVVLTSWLLKGRE 196
E R A L V + GP IGG I W ++ LI + + V L L K
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVR 194

Query: 197 TPTEPVKINLIALSLLVLGVGSLQIMLDKGKDLDWFNSTTIIVLAIIAVIAIILLVIWEA 256
++ + L+ +G+ + ML F ++ I I++V++ ++ V
Sbjct: 195 IKGH---FDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIR 241

Query: 257 TDNNPIIDLSLFRSRNFTIGILCIACAYLIYAGAIVLMPQLLQTVFEYTSVSAGLAYAPI 316
+P +D L ++ F IG+LC + AG + ++P +++ V + ++ G
Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301

Query: 317 GIMPLLL-APLIGRYGHKIDMRMLVTFSFIVYALCYYWRSVTFSSAINF-TWVIIPQFMQ 374
G M +++ + G + ++ ++ + S + F T +I+
Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361

Query: 375 GFAVACFFLPLTTISLSGLPPEKFAAATSLSNFFRSLSGSIGTTITMTLWS 425
++TI S L ++ A SL NF LS G I L S
Sbjct: 362 LSFTK---TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


105D364_RS26290D364_RS17420N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS26290253-12.033781response regulator
D364_RS17385255-12.793712FUSC family protein
D364_RS17390252-11.083261response regulator
D364_RS25615243-7.160389alpha/beta fold hydrolase
D364_RS17405343-7.896416LysR family transcriptional regulator
D364_RS17410248-9.265549MFS transporter
D364_RS27130150-10.602150tyrosine-type recombinase/integrase
D364_RS17420152-10.723839hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17385HTHFIS489e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 47.5 bits (113), Expect = 9e-09
Identities = 21/126 (16%), Positives = 49/126 (38%), Gaps = 10/126 (7%)

Query: 1 MSA---VIIDDHPFARLALKTVLENQNI-VVTGEAADDFHAIQLVDRLQPDIVIVDVMLI 56
M+ ++ DD R L L V A + + D+V+ DV++
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAAT--LWRWIAAGDGDLVVTDVVMP 58

Query: 57 GSSGIDVVTKLRQNHYAGSIVMVSGKNQIFYRKCSVDAGANAFISK----KESMDNFVAA 112
+ D++ ++++ ++++S +N + + GA ++ K E + A
Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118

Query: 113 IQAVQR 118
+ +R
Sbjct: 119 LAEPKR 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17405HTHFIS429e-07 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 41.7 bits (98), Expect = 9e-07
Identities = 13/66 (19%), Positives = 25/66 (37%), Gaps = 1/66 (1%)

Query: 15 AIKTLLENKGVSVTGEAINGMDALRIVDQLQPNTIIVDVDLPDIDGIGLVETLRKRLYKG 74
+ L G V N R + + ++ DV +PD + L+ ++K
Sbjct: 18 VLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPDL 76

Query: 75 SIIVTS 80
++V S
Sbjct: 77 PVLVMS 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS17425TCRTETB1264e-34 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 126 bits (318), Expect = 4e-34
Identities = 77/398 (19%), Positives = 173/398 (43%), Gaps = 13/398 (3%)

Query: 29 TLMGVFDGTMINIALPSMAQEMQVPASIAVWFANGYLLAAAMTLAIFAALAARLGYRPVF 88
+ V + ++N++LP +A + P + W ++L ++ A++ L+ +LG + +
Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82

Query: 89 LAGLTTFTLTSLGCALA-NKPEVLIGMRVLQGIGGAATLSIAPAILRSVFPGRLLGRILG 147
L G+ S+ + + +LI R +QG G AA ++ ++ P G+ G
Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142

Query: 148 FHALLIASSSAIGPVLGGTILHTLSWQWLFAINVLPGTLALLLAVRALPRDAIRMQAPFD 207
++A +GP +GG I H + W +L I ++ T+ + + L + +R++ FD
Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMI--TIITVPFLMKLLKKEVRIKGHFD 200

Query: 208 TVGAILSALLLGSTIMAANSLQNATSQFGSLCWMALAALSGMAFIWQIRRTGHPLLPPSM 267
G IL ++ + ++ S S+ ++ ++ LS + F+ IR+ P + P +
Sbjct: 201 IKGIILMSVGIVFFMLFTTS--------YSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 268 FKNERFTLAAFTSMVAFVSQGITFIALPFLFQSEYGYSP-VVSALLFTPWPLGIVLIAPH 326
KN F + + F + +P++ + + S + +++ P + +++
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 327 AGRWADTISAPAISTLGLVIFVVGLILLATLPASPSMWDICLRSLVCGIGFGCFQSPNNR 386
G D + +G+ V + + L + S + + V G G ++ +
Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTVIST 371

Query: 387 EMLSNVIREHASYASGVLSIMRTFGQCLGAAAVAVLLA 424
+ S++ ++ A +L+ + G A V LL+
Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS1743556KDTSANTIGN330.002 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 33.4 bits (76), Expect = 0.002
Identities = 37/158 (23%), Positives = 62/158 (39%), Gaps = 27/158 (17%)

Query: 2 SVNVKAATLTNLVKYKTDRASLRSVRDDMKKLQKDFSKTEGTIAKAKMQADKQAYTAQMQ 61
+NV L N + ++ ++ + D +++L+ F ++ + Q
Sbjct: 283 GINVPDTGLPNSASIEQIQSKIQELGDTLEELRDSFDGYINNAFVNQIHLNFVMPPQAQQ 342

Query: 62 QQKQVQQQQKQAAKQATVDAKA-------KQIEA------------------RKLAAAQS 96
QQ Q QQQQ QA Q V A A QI KLAA Q
Sbjct: 343 QQGQGQQQQAQATAQEAVAAAAVRLLNGSDQIAQLYKDLVKLQRHAGIRKAMEKLAAQQE 402

Query: 97 KAAKIQMQQ--QQKQASVAENARLKERKALFDIGRMEG 132
+ AK Q + +Q+Q + ++ K ++ FD+ + G
Sbjct: 403 EDAKNQGKGDCKQQQGASEKSKEGKVKETEFDLSMVVG 440


106D364_RS19245D364_RS19260N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS19245017-3.712147DNA-binding transcriptional regulator Fis
D364_RS19250017-3.333328acrEF/envCD operon transcriptional regulator
D364_RS19255118-2.960039efflux RND transporter periplasmic adaptor
D364_RS19260119-3.145978membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS19245DNABINDNGFIS1573e-54 DNA-binding protein FIS signature.
		>DNABINDNGFIS#DNA-binding protein FIS signature.

Length = 98

Score = 157 bits (399), Expect = 3e-54
Identities = 98/98 (100%), Positives = 98/98 (100%)

Query: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60
MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ
Sbjct: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60

Query: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98
PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN
Sbjct: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS19250HTHTETR1191e-35 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 119 bits (299), Expect = 1e-35
Identities = 77/201 (38%), Positives = 124/201 (61%), Gaps = 3/201 (1%)

Query: 1 MARKTKEEAQRTRQLLIESAIQQFALRGVTNTTLTDIADAAGVTRGAVYWHFASKTELFN 60
MARKTK+EAQ TRQ +++ A++ F+ +GV++T+L +IA AAGVTRGA+YWHF K++LF+
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 EMW-QQQPPLRDLIQPSQAIEYEHEPLNALRERFIAGLRYIAANPRQRALMQILYQRCEF 119
E+W + + +L QA ++ +PL+ LRE I L R+R LM+I++ +CEF
Sbjct: 61 EIWELSESNIGELELEYQA-KFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119

Query: 120 SSDMLSEYEIRQRIGF-NYSLISGILQCCVRNNILPAETNIEMILIVLHSAFSGLIKNWL 178
+M + ++ + +Y I L+ C+ +LPA+ I++ SGL++NWL
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWL 179

Query: 179 LDPQRFDLYQQAPALVDNIMA 199
PQ FDL ++A V ++
Sbjct: 180 FAPQSFDLKKEARDYVAILLE 200


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS19255RTXTOXIND371e-04 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 36.7 bits (85), Expect = 1e-04
Identities = 34/211 (16%), Positives = 67/211 (31%), Gaps = 30/211 (14%)

Query: 97 ATYQAAWNSAKGDEAKAEAAAAIAHLTVKRYVPLLGTKYISQQEYDQAVATA-RQADADV 155
K + E+ A + Q + + RQ ++
Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLV----------TQLFKNEILDKLRQTTDNI 311

Query: 156 IATKAAVETARINLAYTKVTSPISGRIGKSSV-TEGALVTNGQSDALATVQQLDPIYVDV 214
+ + + +P+S ++ + V TEG +VT ++ + V + D + V
Sbjct: 312 GLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET-LMVIVPEDDTLEVTA 370

Query: 215 TESSNDFMRLKQESLQRGGDTKSVELVMENGQAYP-LKGSLQ--FSDVTVDESTG----- 266
+ D + G +++ Y L G ++ D D+ G
Sbjct: 371 LVQNKDIGFINV------GQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNV 424

Query: 267 --SITLRAIFPNPQHV-LLPGMFVRARIDEG 294
SI + +++ L GM V A I G
Sbjct: 425 IISIEENCLSTGNKNIPLSSGMAVTAEIKTG 455



Score = 35.2 bits (81), Expect = 3e-04
Identities = 23/127 (18%), Positives = 41/127 (32%), Gaps = 15/127 (11%)

Query: 46 APLSVTTELPGR-TSAFRVAEVRPQVSGIILKRNFV-EGSDVEAGQSLYQIDPATYQAAW 103
+ + G+ T + R E++P + I+ K V EG V G L ++ +A
Sbjct: 78 GQVEIVATANGKLTHSGRSKEIKPIENSIV-KEIIVKEGESVRKGDVLLKLTALGAEA-- 134

Query: 104 NSAKGDEAKAEAAAAIAHLTVKRYVPLLGTKYISQQEYDQAVATARQADADVIATKAAVE 163
D K +++ A L RY L E ++ +
Sbjct: 135 -----DTLKTQSSLLQARLEQTRYQILS-----RSIELNKLPELKLPDEPYFQNVSEEEV 184

Query: 164 TARINLA 170
+L
Sbjct: 185 LRLTSLI 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS19265PF06291270.005 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 26.5 bits (58), Expect = 0.005
Identities = 20/65 (30%), Positives = 27/65 (41%), Gaps = 3/65 (4%)

Query: 1 MKKYLIVALLASLLAGCAHDSPCV---PVYDSQGRLVHTNTCMKGTTEDNWETAGAIAGG 57
MKK L A LA L+ GCA + V P + + + + G + A I GG
Sbjct: 6 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQKKTVDAAKICGG 65

Query: 58 AAAVA 62
A V
Sbjct: 66 AENVV 70


107D364_RS19495D364_RS19515N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS19495549-6.279207A24 family peptidase
D364_RS19500450-6.513197bacterioferritin
D364_RS19505656-6.947900bacterioferritin-associated ferredoxin
D364_RS19510760-5.953506elongation factor Tu
D364_RS19515758-5.165499elongation factor G
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS19500PREPILNPTASE1223e-37 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 122 bits (309), Expect = 3e-37
Identities = 68/150 (45%), Positives = 86/150 (57%), Gaps = 7/150 (4%)

Query: 2 LAALPFLLCYSGLTVALCHQDLRHGLLPDRYTCPLLWSGLLFYLCLAPHQLHDAVWGAIA 61
L LL L VAL DL LLPD+ T PLLW GLLF L L DAV GA+A
Sbjct: 132 WGTLAALLLTWVL-VALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMA 190

Query: 62 GYLSLAAIYWLYRGIRGYEGLGYGDIKYLAALGAWHGWRLLPQLVLVASLLAGIAWAGAG 121
GYL L ++YW ++ + G EG+GYGD K LAALGAW GW+ LP ++L++SL+
Sbjct: 191 GYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFM---GI 247

Query: 122 LYASCGRKSKWGRSNPLPFGPFLAAAGFWC 151
+S P+PFGP+LA AG+
Sbjct: 248 GLILLRNHH---QSKPIPFGPYLAIAGWIA 274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS19505HELNAPAPROT379e-06 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 36.8 bits (85), Expect = 9e-06
Identities = 18/103 (17%), Positives = 42/103 (40%), Gaps = 10/103 (9%)

Query: 44 EYHESIDEMKHADKYIERILFLEGIPN--LQDLGKL------GIGEDVEEMLRSDLRLEL 95
E ++ E D ER+L + G P +++ + G EM+++ +
Sbjct: 52 ELYDHAAE--TVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYK 109

Query: 96 EGAQNLREAIAYADSVHDYVSRDMMIEILADEEGHIDWLETEL 138
+ + + I A+ D + D+ + ++ + E + L + L
Sbjct: 110 QISSESKFVIGLAEENQDNATADLFVGLIEEVEKQVWMLSSYL 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS19515TCRTETOQM804e-18 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 79.5 bits (196), Expect = 4e-18
Identities = 53/154 (34%), Positives = 78/154 (50%), Gaps = 13/154 (8%)

Query: 13 VNVGTIGHVDHGKTTLTAAI------TTVLAKTYGGSARAFDQIDNAPEEKARGITINTS 66
+N+G + HVD GKTTLT ++ T L G+ R DN E+ RGITI T
Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRT----DNTLLERQRGITIQTG 59

Query: 67 HVEYDTPTRHYAHVDCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQV 126
+ +D PGH D++ + + +DGAIL+++A DG QTR R++
Sbjct: 60 ITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKM 119

Query: 127 GVPYIIVFLNKCDMVDDEELLELVEMEVRELLSQ 160
G+P I F+NK D + L V +++E LS
Sbjct: 120 GIP-TIFFINKIDQNGID--LSTVYQDIKEKLSA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS19520TCRTETOQM6130.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 613 bits (1583), Expect = 0.0
Identities = 179/698 (25%), Positives = 304/698 (43%), Gaps = 81/698 (11%)

Query: 9 RYRNIGISAHIDAGKTTTTERILFYTGVNHKIGEVHDGAATMDWMEQEQERGITITSAAT 68
+ NIG+ AH+DAGKTT TE +L+ +G ++G V G D E++RGITI + T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 69 TAFWSGMAKQYEPHRVNIIDTPGHVDFTIEVERSMRVLDGAVMVYCAVGGVQPQSETVWR 128
+ W +VNIIDTPGH+DF EV RS+ VLDGA+++ A GVQ Q+ ++
Sbjct: 62 SFQWEN-------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114

Query: 129 QANKYKVPRIAFVNKMDRMGANFLKVVGQIKTRLGANPVPLQLAIGAEEGFTGVVDLVKM 188
K +P I F+NK+D+ G + V IK +L A V Q V M
Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ----------KVELYPNM 164

Query: 189 KAINWNDADQGVTFEYEDIPADMQDLADEWHQNLIESAAEASEELMEKYLGGEELTEEEI 248
N+ +++Q ++ E +++L+EKY+ G+ L E+
Sbjct: 165 CVTNFTESEQ------------------------WDTVIEGNDDLLEKYMSGKSLEALEL 200

Query: 249 KKALRQRVLNNEIILVTCGSAFKNKGVQAMLDAVIDYLPSPVDVPAINGILDDGKDTPAE 308
++ R N + V GSA N G+ +++ + + S
Sbjct: 201 EQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH----------------- 243

Query: 309 RHASDDEPFSALAFKIATDPFVGNLTFFRVYSGVVNSGDTVLNSVKAARERFGRIVQMHA 368
FKI L + R+YSGV++ D+V S K + + +
Sbjct: 244 ---RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEK-EKIKITEMYTSIN 299

Query: 369 NKREEIKEVRAGDIAAAIG----LKDVTTGDTLCDPDAPIILERMEFPEPVISIAVEPKT 424
+ +I + +G+I L V GDT P ER+E P P++ VEP
Sbjct: 300 GELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQR----ERIENPLPLLQTTVEPSK 354

Query: 425 KADQEKMGLALGRLAKEDPSFRVWTDEESNQTIIAGMGELHLDIIVDRMKREFNVEANVG 484
+E + AL ++ DP R + D +++ I++ +G++ +++ ++ +++VE +
Sbjct: 355 PQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIK 414

Query: 485 KPQVAYREAIRAKVTDIEGKHAKQSGGRGQYGHVVIDMYPLEPGSNPKGYEFINDIKGGV 544
+P V Y E K E + + + + + PL GS G ++ + + G
Sbjct: 415 EPTVIYMERPLKKA---EYTIHIEVPPNPFWASIGLSVSPLPLGS---GMQYESSVSLGY 468

Query: 545 IPGEYIPAVDKGIQEQLKAGPLAGYPVVDMGIRLHFGSYHDVDSSELAFKLAASIAFKEG 604
+ + AV +GI+ + G L G+ V D I +G Y+ S+ F++ A I ++
Sbjct: 469 LNQSFQNAVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQV 527

Query: 605 FKKAKPVLLEPIMKVEVETPEENTGDVIGDLSRRRGMLRGQESEVTGVKIHAEVPLSEMF 664
KKA LLEP + ++ P+E D + + + + V + E+P +
Sbjct: 528 LKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQ 587

Query: 665 GYATQLRSLTKGRASYTMEFLKYDDAPNNVAQAVIEAR 702
Y + L T GR+ E Y + V + R
Sbjct: 588 EYRSDLTFFTNGRSVCLTELKGYHVT---TGEPVCQPR 622


108D364_RS19700D364_RS19725N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS197001181.188131cell division protein DamX
D364_RS197052202.1568213-dehydroquinate synthase
D364_RS197103223.668165shikimate kinase AroK
D364_RS197150153.117640DNA uptake porin HofQ
D364_RS26400-2152.481771DUF2531 family protein
D364_RS19720-2111.648890hypothetical protein
D364_RS19725-2111.027275hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS19700IGASERPTASE362e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 36.2 bits (83), Expect = 2e-04
Identities = 32/208 (15%), Positives = 59/208 (28%), Gaps = 12/208 (5%)

Query: 125 SSSQQTASGEKSINLSDDQSASMPAAGQDQTAAANSTSQQDVTVPPIAANPTQGQAAVAP 184
S + A + +T A NS + N A
Sbjct: 1009 SVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTV----EKNEQDATETTAQ 1064

Query: 185 QGQQRIEVQGDLNNALTQQ---QGQLDGAVANSTLPTEPATVAPIRNGANGTAAPRQATE 241
+ E + ++ Q + +T E ATV T ++ +
Sbjct: 1065 NREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPK 1124

Query: 242 RQTAATPRPAERKHTVIEAKPQPKPQAVAKTPVESKPVQPKHVESTATTAPAKTSVSESK 301
+ +P+ + + +A+P + V K Q + + T PAK + S +
Sbjct: 1125 VTSQVSPKQEQSETVQPQAEPARENDPT----VNIKEPQSQTNTTADTEQPAKETSSNVE 1180

Query: 302 PVATAQSKPTTTTAAPAATAAAAAPAAK 329
T +S T + PA
Sbjct: 1181 QPVT-ESTTVNTGNSVVENPENTTPATT 1207


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS19710CARBMTKINASE310.002 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 30.6 bits (69), Expect = 0.002
Identities = 27/91 (29%), Positives = 40/91 (43%), Gaps = 18/91 (19%)

Query: 32 FYDSDQEIEKRTGADVGWVFDVEGEEGFRD----------REEKIINELTEKQGIVLATG 81
FYD + KR + GW+ + G+R E + I +L E+ IV+A+G
Sbjct: 136 FYDEETA--KRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKLVERGVIVIASG 193

Query: 82 GGSVKSRETRNRLSARGVVVYLETTIEKQLA 112
GG V + +GV E I+K LA
Sbjct: 194 GGGVPVILEDGEI--KGV----EAVIDKDLA 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS19715TYPE3OMGPROT2263e-70 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 226 bits (578), Expect = 3e-70
Identities = 77/282 (27%), Positives = 122/282 (43%), Gaps = 17/282 (6%)

Query: 139 GGKLLSARGHLMADKRTNRLLIRDDARHLPALKAWAQEMDLPVGQVELAAHIVSMSETSL 198
SA+ + AD N +++RD +P + +D P ++E+A IV ++ L
Sbjct: 237 AATRASAQARVEADPSLNAIIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDINADQL 296

Query: 199 RELGVKWRLAEAGSPPGSGQITTLSSDVSVNDASTRAGFNIGKINGRLLEL---ELSALE 255
ELGV WR+ I T ++ G ++ R L+ ++ LE
Sbjct: 297 TELGVDWRVGIRTGNNHQVVIKTTGDQSNIAS----NGALGSLVDARGLDYLLARVNLLE 352

Query: 256 RKQQVEIIASPRLLASHMQPASIKQGSEIPYQVSSGESGATSVEFKEAVLG--MEVTPTV 313
+ ++++ P LL A I SE Y +G+ A E K G + +TP V
Sbjct: 353 NEGSAQVVSRPTLLTQENAQAVIDH-SETYYVKVTGKEVA---ELKGITYGTMLRMTPRV 408

Query: 314 LQQG---RVRLKLRISENTPGQVLKQENGEALAIDKQEIETLVEVRSGETLALGGIFSQK 370
L QG + L L I + I + ++T+ V G++L +GGI+ +
Sbjct: 409 LTQGDKSEISLNLHIEDGNQKPNS-SGIEGIPTISRTVVDTVARVGHGQSLIIGGIYRDE 467

Query: 371 NKTARDSVPLLGDIPVLGRLFRRDGKDNERRELVVFITPRIL 412
A VPLLGDIP +G LFRR + R + I PRI+
Sbjct: 468 LSVALSKVPLLGDIPYIGALFRRKSELTRRTVRLFIIEPRII 509


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS19725PYOCINKILLER342e-04 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 34.4 bits (78), Expect = 2e-04
Identities = 29/118 (24%), Positives = 44/118 (37%), Gaps = 16/118 (13%)

Query: 37 LLVSRTARLQRDFLATLHTTADAQLLASLKQREQAMREAWQQHQRQRQQYQRRSAIAAWQ 96
L + LQ A + A+ K REQA EA +++ + Q R A
Sbjct: 192 LFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEA-----KRKAEEQARQQAAIRA 246

Query: 97 PRLQALAAD----LPAQAWLTRLEYQGVLLTLDGLALNLQALTSVEAALTRVAGFAPA 150
A+ A+ A +G++ G A QA++ A L RV AP+
Sbjct: 247 ANTYAMPANGSVVATAAG-------RGLIQVAQGAASLAQAISDAIAVLGRVLASAPS 297


109D364_RS19940D364_RS19975N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS19940-1121.064345N-acetyltransferase
D364_RS199450120.834240gamma-glutamyltransferase
D364_RS19950-2120.831352DUF2756 family protein
D364_RS19955-3140.366204glycerophosphodiester phosphodiesterase
D364_RS19960-312-0.004733sn-glycerol-3-phosphate import ATP-binding
D364_RS19965-313-1.060750sn-glycerol-3-phosphate ABC transporter permease
D364_RS19970-115-1.079440sn-glycerol-3-phosphate ABC transporter permease
D364_RS19975-215-0.260495sn-glycerol-3-phosphate ABC transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS19945SACTRNSFRASE371e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.8 bits (85), Expect = 1e-05
Identities = 17/63 (26%), Positives = 27/63 (42%), Gaps = 13/63 (20%)

Query: 80 MAVAAGHQGCGIGSALMREMID------LCDNWLRVERIELTVFADNAPAIAVYKKYGFE 133
+AVA ++ G+G+AL+ + I+ C L + I N A Y K+ F
Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDI-------NISACHFYAKHHFI 147

Query: 134 IEG 136
I
Sbjct: 148 IGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS19950NAFLGMOTY330.003 Sodium-type flagellar protein MotY precursor signature.
		>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature.

Length = 293

Score = 33.2 bits (75), Expect = 0.003
Identities = 27/80 (33%), Positives = 36/80 (45%), Gaps = 13/80 (16%)

Query: 276 RTPVSGEYRGYEVYSMPPPSSGGIHIVQILNILENFDMQKYGF-GSADAMQVMAEAEKHA 334
R P+ GE R + SMPPP G H +I N+ F Q G+ G A +++E EK
Sbjct: 77 RRPM-GETRNVSLISMPPPWRPGEHADRITNL--KFFKQFDGYVGGQTAWGILSELEKGR 133

Query: 335 YADRSEYLGDPDFVNVPWQA 354
Y P F WQ+
Sbjct: 134 Y---------PTFSYQDWQS 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS19960PF04619280.018 Dr-family adhesin
		>PF04619#Dr-family adhesin

Length = 160

Score = 28.4 bits (63), Expect = 0.018
Identities = 14/60 (23%), Positives = 23/60 (38%), Gaps = 4/60 (6%)

Query: 29 VGARYGHTMIEFDAKLSKDGQIFLLHDDNLERTSNGWGVAGELAW----DDLLKVDAGSW 84
+G ++ D + G+ FL+ D+N ++ AW K D GSW
Sbjct: 70 LGCDARQVALKADTDNFEQGKFFLISDNNRDKLYVNIRPTDNSAWTTDNGVFYKNDVGSW 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS19965PF05272290.036 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.036
Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 33 IVMVGPSGCGKSTLLRMVAGLERVTSGDI 61
+V+ G G GKSTL+ + GL+ +
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHF 627


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS19980MALTOSEBP386e-05 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 38.2 bits (88), Expect = 6e-05
Identities = 44/175 (25%), Positives = 73/175 (41%), Gaps = 17/175 (9%)

Query: 134 GHLLSQPFNSSTPVLYYNKDAFKKAGLDPDQPPKTWQDLAAYTAKLKAAGMKCGYASGWQ 193
G L++ P L YNKD L P+ PPKTW+++ A +LKA G + +
Sbjct: 127 GKLIAYPIAVEALSLIYNKD------LLPN-PPKTWEEIPALDKELKAKGKSALMFNLQE 179

Query: 194 GWIQIENFSAWHGLPVATKNNGFDGTDAVLEF--NKPEQVKHIALLEEMNKKGDFSYFGR 251
+ +A G +N +D D ++ K + L++ + D Y
Sbjct: 180 PYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDY--- 236

Query: 252 KDESTEKFYNGDCAITTASSGSLADIRQYAKFNYGVGMMPYDADVKGAPQNAIIG 306
+ F G+ A+T + ++I +K NYGV ++P KG P +G
Sbjct: 237 -SIAEAAFNKGETAMTINGPWAWSNIDT-SKVNYGVTVLP---TFKGQPSKPFVG 286


110D364_RS20110D364_RS20145N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS201104152.778726MFS transporter
D364_RS201152133.238159AI-2E family transporter
D364_RS201200142.3875894-amino-4-deoxy-L-arabinose-phosphoundecaprenol
D364_RS20125-2172.2498764-amino-4-deoxy-L-arabinose-phosphoundecaprenol
D364_RS20130-2162.271434lipid IV(A)
D364_RS20135-2161.5241274-deoxy-4-formamido-L-arabinose-
D364_RS20140-1162.083651bifunctional UDP-4-amino-4-deoxy-L-arabinose
D364_RS20145-2152.330692undecaprenyl-phosphate
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20115TCRTETA492e-08 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 49.1 bits (117), Expect = 2e-08
Identities = 75/365 (20%), Positives = 134/365 (36%), Gaps = 30/365 (8%)

Query: 13 LRLNLRIVSVVIFNFASYLTIGLPLAVLPGYVHDVM--GFSAFWAGLVISLQYFATLLSR 70
++ N ++ ++ + IGL + VLPG + D++ G++++L
Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60

Query: 71 PHAGRYADLLGPKKIVVFGLGGCFLSGLSYLLAAWGSGWPLISLLLLCLGRVILGI-GQS 129
P G +D G + +++ L + + Y + A L +L +GR++ GI G +
Sbjct: 61 PVLGALSDRFGRRPVLLVSL---AGAAVDYAIMATAP-----FLWVLYIGRIVAGITGAT 112

Query: 130 FAGTGSTLWGVGVVGSLHIGRVISWNGIVTYGAMAMGAPLGVLC--YSHIGLSGLAGVIM 187
A G+ + + R + M G LG L +S A +
Sbjct: 113 GAVAGAYIADITDGDER--ARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALN 170

Query: 188 AVALVAILCALP-------RAAVKAAKGKAMSFR-AVLGRVWPYGMALA-LASAGFGVIA 238
+ + LP R + A SFR A V MA+ + V A
Sbjct: 171 GLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPA 230

Query: 239 TFITLFYDAK-GWDGAAFALTLFSCAFVGA---RLLFPNAINRLGGLNVAMLCFSVEAIG 294
+F + + WD ++L + + + ++ RLG ML + G
Sbjct: 231 ALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTG 290

Query: 295 LLLVGFADTPMMAKIGTFLTGAGFSLVFPALGVVAVKAVPQHNQGSALATYTVFMDLSLG 354
+L+ FA MA L A + PAL + + V + QG + L+
Sbjct: 291 YILLAFATRGWMAFPIMVLL-ASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLT-S 348

Query: 355 VSGPL 359
+ GPL
Sbjct: 349 IVGPL 353


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20130BCTERIALGSPC322e-04 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 32.2 bits (73), Expect = 2e-04
Identities = 13/32 (40%), Positives = 18/32 (56%), Gaps = 1/32 (3%)

Query: 35 RHILFWLGMALLCLGCGMLLW-LSVLQSIPVS 65
R ILF+L M L C M+ W + + + PVS
Sbjct: 15 RRILFYLLMLLFCQQLAMIFWRIGLPDNAPVS 46


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20145NUCEPIMERASE1132e-29 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 113 bits (284), Expect = 2e-29
Identities = 74/361 (20%), Positives = 138/361 (38%), Gaps = 60/361 (16%)

Query: 317 RVLILGVNGFIGNHLTERLLQDDNYEIYGLDIGSD--------AISRFLDCPRFHFVEGD 368
+ L+ G GFIG H+++RLL + +++ G+D +D A L P F F + D
Sbjct: 2 KYLVTGAAGFIGFHVSKRLL-EAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 369 ISIHSEWIE--YHIKKCDVVLPLVAIATPIEYT-RNPLRVFELDFEENLKIIRDCVKYN- 424
++ E + + + V + Y+ NP + + L I+ C
Sbjct: 61 LADR-EGMTDLFASGHFERVFISPHRLA-VRYSLENPHAYADSNLTGFLNILEGCRHNKI 118

Query: 425 KRIIFPSTSEVYGMCTDKNFDEDSSNLVVGPINKQRWIYSVSKQLLDRVIWAYGDKYGLK 484
+ +++ S+S VYG+ F D S V P++ +Y+ +K+ + + Y YGL
Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDS--VDHPVS----LYAATKKANELMAHTYSHLYGLP 172

Query: 485 FTLFRPFNWMGPRLDNLNAARIGSSRAITQLILNLVEGSPIKLIEGGKQKRCFTDISDGI 544
T R F GP A+ + ++EG I + GK KR FT I D
Sbjct: 173 ATGLRFFTVYGPWGR--------PDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIA 224

Query: 545 EALFRIIEN---------------KDGRCDGQIINIGNPDNEASIKELAEMLLACFERHP 589
EA+ R+ + ++ NIGN + + + L
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVE-LMDYIQALEDALGIEA 283

Query: 590 LRDRFPPFAGFREVESSDYYGKGYQDVEHRKPSIRNAKRCLNWEPKVEMEETVEHTLDFF 649
++ P G DV + + + P+ +++ V++ ++++
Sbjct: 284 KKNMLPLQPG---------------DVLETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328

Query: 650 L 650

Sbjct: 329 R 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20150ACRIFLAVINRP310.011 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 30.6 bits (69), Expect = 0.011
Identities = 20/97 (20%), Positives = 35/97 (36%), Gaps = 5/97 (5%)

Query: 182 TFIPILANTFARRAVEIPVMHAEREFGDSKYSFMRLINLMYDLVTCLTTTPLRLLSIFGS 241
P L T + + FG +F +N + V + + R L I+
Sbjct: 487 ILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYAL 546

Query: 242 VIALLGFAFGLLLVVLRLAFGPQWAAEGVFMLFAVLF 278
++A + F L +F P+ +GVF+ L
Sbjct: 547 IVAGMVVLFLRLPS----SFLPE-EDQGVFLTMIQLP 578


111D364_RS20450D364_RS20480N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS20450-2120.044649kdo(2)-lipid A phosphoethanolamine
D364_RS20455-2100.749833MFS transporter
D364_RS20460-191.666448hypothetical protein
D364_RS20465-1102.241797DNA-3-methyladenine glycosylase I
D364_RS20470-1122.249112N-acetyltransferase
D364_RS20475-1152.523303molybdopterin guanine dinucleotide-containing
D364_RS20480-2132.161462OmpA family lipoprotein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20455PF06580290.049 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.1 bits (65), Expect = 0.049
Identities = 23/137 (16%), Positives = 43/137 (31%), Gaps = 23/137 (16%)

Query: 5 RTMTQQKLSFWLALYIGWFMNVAVFFRRFDGYAQEFTFWKGLSGVVELVATVFVTFFLLR 64
T Q +W IGW + F G+A + K S + + ++
Sbjct: 3 STHRQANKYYWYCQGIGWGVYTLTGF----GFASLYGSPKLHSMIFNIAISLMGLVLTHA 58

Query: 65 LLSLFGRRIWRILATLIVLFSAAASYYMTFLNVVIGYGIIASVMTTDIDLSKEVIGWHLI 124
S R+ W L ++ + + G++ V T I W L+
Sbjct: 59 YRSFIKRQGWLKLNMGQIILRVLPA--------CVVIGMVWFVANTSI--------WRLL 102

Query: 125 LWLVAVSAPPLLFIWSN 141
++ + P+ F
Sbjct: 103 AFI---NTKPVAFTLPL 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20460TCRTETA414e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 41.3 bits (97), Expect = 4e-06
Identities = 48/276 (17%), Positives = 92/276 (33%), Gaps = 33/276 (11%)

Query: 44 PVSQVAFSFGLLSLGLALS----SSVAGKLQERFGVKRVTMASGILLGLGFFLTAHSSSL 99
+ V +G+L AL + V G L +RFG + V + S + + + A + L
Sbjct: 37 HSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL 96

Query: 100 MMLWLS---AGVLVGLADGAGYLL----TLSNCVKWFPERKGLISAFSIGSYGLGSLGFK 152
+L++ AG+ AG + + F + LG
Sbjct: 97 WVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGG---- 152

Query: 153 FIDSHLLATVGLEKTFVIWGAIVLVMIVFGATLMKDAPNHPAATAANGVVENDFTLAESM 212
L+ F A+ + + G L+ + + N
Sbjct: 153 -----LMGGFSPHAPFFAAAALNGLNFLTGCFLLPE-SHKGERRPLRREALNPLASFRWA 206

Query: 213 R--KPQYWMLAVMFLTACMSG----LYVIGVAKDIAQGMVHLDVATAANAVTVISIAN-L 265
R ++AV F+ + L+VI + H D T ++ I + L
Sbjct: 207 RGMTVVAALMAVFFIMQLVGQVPAALWVI-----FGEDRFHWDATTIGISLAAFGILHSL 261

Query: 266 SGRLVLGILSDKISRIRVITIGQVVSLVGMAALLFA 301
+ ++ G ++ ++ R + +G + G L FA
Sbjct: 262 AQAMITGPVAARLGERRALMLGMIADGTGYILLAFA 297



Score = 34.0 bits (78), Expect = 9e-04
Identities = 31/119 (26%), Positives = 51/119 (42%), Gaps = 2/119 (1%)

Query: 270 VLGILSDKISRIRVITIGQVVSLVGMAALLFAPLNALTFFAAIACVAFNFGGTITVFPSL 329
VLG LSD+ R V+ + + V A + AP + + I VA G T V +
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRI--VAGITGATGAVAGAY 119

Query: 330 VSEFFGLNNLAKNYGVIYLGFGIGSICGSLIASLFGGFYVTFCVIFALLILSLALSTTI 388
+++ + A+++G + FG G + G ++ L GGF A + L T
Sbjct: 120 IADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGC 178


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20475SACTRNSFRASE341e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 33.8 bits (77), Expect = 1e-04
Identities = 18/52 (34%), Positives = 24/52 (46%), Gaps = 5/52 (9%)

Query: 76 VAPGATRQGIGRALLDEVKQ-----HYAWLSLEVYQKNESAVSFYHAQGFRI 122
VA ++G+G ALL + + H+ L LE N SA FY F I
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20485OMPADOMAIN1186e-34 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 118 bits (296), Expect = 6e-34
Identities = 43/124 (34%), Positives = 65/124 (52%), Gaps = 11/124 (8%)

Query: 108 LNMPNNVTFDSNSANLKPAGANTLTGVAMVLKEYEKT--AVNVVGYTDSTGSKDLNMRLS 165
+ ++V F+ N A LKP G L + L + +V V+GYTD GS N LS
Sbjct: 215 FTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLS 274

Query: 166 QQRADSVASALITQGVAANRIRTTGMGPANPIASNSTAEGK---------AQNRRVEITL 216
++RA SV LI++G+ A++I GMG +NP+ N+ K A +RRVEI +
Sbjct: 275 ERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334

Query: 217 SPLQ 220
++
Sbjct: 335 KGIK 338


112D364_RS20840D364_RS20875N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS20840017-2.921715bifunctional phosphopantothenoylcysteine
D364_RS20845023-5.676780dUTP diphosphatase
D364_RS20850129-7.714392nucleoid occlusion factor SlmA
D364_RS20855133-8.770358orotate phosphoribosyltransferase
D364_RS20860245-12.458765ribonuclease PH
D364_RS20865243-12.461356ABC transporter permease
D364_RS20870143-11.959950ABC transporter ATP-binding protein
D364_RS20875239-10.328235HlyD family efflux transporter periplasmic
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20840UREASE300.020 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 30.1 bits (68), Expect = 0.020
Identities = 18/55 (32%), Positives = 22/55 (40%), Gaps = 15/55 (27%)

Query: 74 GHIELGKWADLVILAPA----------TADLIARVAAGMANDLVSTICLATPSPV 118
G +E+GK ADLV+ PA IA G N + TP PV
Sbjct: 424 GSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNA-----SIPTPQPV 473


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20850HTHTETR521e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 51.9 bits (124), Expect = 1e-10
Identities = 38/185 (20%), Positives = 72/185 (38%), Gaps = 15/185 (8%)

Query: 1 MAEK-QTAKRNRREEILQSLALMLESSDGSQRITTAKLAASVGVSEAALYRHFPSKTRMF 59
MA K + + R+ IL AL L S G + ++A + GV+ A+Y HF K+ +F
Sbjct: 1 MARKTKQEAQETRQHILDV-ALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59

Query: 60 DSLIEFIEDSLITRIN-LILKDEKDTTARLRLIVLLILGFGERNPGLTRILT-------G 111
+ E E ++ K D + LR I++ +L ++
Sbjct: 60 SEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119

Query: 112 HALMFEQDRLQGRIN-QLFERIEAQLRQVMREKKMREGEGYTLDETLLASQLLAFCEGML 170
M + Q + + ++RIE L+ + K + L A + + G++
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPAD----LMTRRAAIIMRGYISGLM 175

Query: 171 SRFVR 175
++
Sbjct: 176 ENWLF 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20865ABC2TRNSPORT452e-07 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 44.5 bits (105), Expect = 2e-07
Identities = 35/180 (19%), Positives = 67/180 (37%), Gaps = 6/180 (3%)

Query: 175 IMGSILSTTLILMTALSITRERENGALENLLVSPLSGLEVIIGKITPFVIIGLFQATLIL 234
+ S ++ + R E +L + L ++++G++ I
Sbjct: 74 VATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIG 133

Query: 235 IAAVLLFDIPLHGSVFLLFFVLLIYVFLCLSIGIGISGLAQNQLQALQMSSFYFIPSLML 294
+ A L S+ V+ + S+G+ ++ LA + + + P L L
Sbjct: 134 VVAAALGYTQWL-SLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFL 192

Query: 295 SGFVSPFISMPDWAKAIGSCLPLTYFIRLVKGIMLKGYSATALLPDLLPLIGLAVIVIGV 354
SG V P +P + LPL++ I L++ IML D+ +G I I +
Sbjct: 193 SGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPV-----VDVCQHVGALCIYIVI 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS20875RTXTOXIND582e-11 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 57.5 bits (139), Expect = 2e-11
Identities = 48/351 (13%), Positives = 121/351 (34%), Gaps = 85/351 (24%)

Query: 21 IERILINKGDNVAAGQELVKIESFDA-------QNIFLRAEEKLSAESALLRNLESGERP 73
++ I++ +G++V G L+K+ + A Q+ L+A + + L R++E + P
Sbjct: 107 VKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLP 166

Query: 74 E-----------------------------------------------ELDIIRSQIKKA 86
E E + ++I +
Sbjct: 167 ELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRY 226

Query: 87 QSAESQVKRQLGRYRNLYANHAISLAEWEDIRDELTQKGAQVEEL---INQLKARQLPAR 143
++ K +L + +L AI+ + ++ + ++ + Q+++ L A+
Sbjct: 227 ENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAK 286

Query: 144 Q--------------DEISKQRSMVAAAKLERDKALWDVQQTTIVSPVNAKVFDI-IYRA 188
+ D++ + + LE K Q + I +PV+ KV + ++
Sbjct: 287 EEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTE 346

Query: 189 GERPSAGKPIISLLPPEN-IKVRFFIPEAKLGKFKIGSKVKLICDG----CAEPIAGIIN 243
G + + ++ ++P ++ ++V + +G +G + + + G +
Sbjct: 347 GGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVK 406

Query: 244 YISPEA---EFTPPVIYSTKRREKLIFMAEAIPALQQAGRMKIGQPFDVEI 291
I+ +A + V E+ + + + G EI
Sbjct: 407 NINLDAIEDQRLGLVFNVIISIEE-----NCLSTGNKNIPLSSGMAVTAEI 452


113D364_RS21125D364_RS21175N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS211252191.406858MFS transporter
D364_RS211300161.377785TetR/AcrR family transcriptional regulator
D364_RS21135-2162.867023GNAT family N-acetyltransferase
D364_RS21145-2183.502250hypothetical protein
D364_RS27635-1204.208955hypothetical protein
D364_RS27640-1214.537983multidrug efflux RND transporter periplasmic
D364_RS21165-1225.026285multidrug efflux RND transporter permease
D364_RS21170-1224.776610multidrug efflux RND transporter outer membrane
D364_RS211751204.183612multidrug effflux MFS transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21130TCRTETB1384e-38 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 138 bits (350), Expect = 4e-38
Identities = 94/418 (22%), Positives = 180/418 (43%), Gaps = 19/418 (4%)

Query: 20 LLLVMLLSALDQTIVSTALPTIVGELDGL-DKLSWVVTAYILSSTIAVPLYGKFGDLFGR 78
L ++ S L++ +++ +LP I + + +WV TA++L+ +I +YGK D G
Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78

Query: 79 KIVLQVAIGLFLVGSALCGLAQNMTQLVLM-RGLQGLGGGGLMVISMAAVADVIPPANRG 137
K +L I + GS + + + L++M R +QG G + M VA IP NRG
Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138

Query: 138 RYQGLFGGVFGLATVIGPLIGGFLVQHASWRWIFYINLPLGLFALLVIGAVFHSSNKRSQ 197
+ GL G + + +GP IGG + + W ++ +P+ + R +
Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVRIK 196

Query: 198 HQIDWLGAIYLSMALLCIILFTSEGGSVHAWNDPQLWCILAFGIVGIIGFIYEERMAAEP 257
D G I +S+ ++ +LFT+ L ++ + F+ R +P
Sbjct: 197 GHFDIKGIILMSVGIVFFMLFTTS----------YSISFLIVSVLSFLIFVKHIRKVTDP 246

Query: 258 IIPLALFRNRSFLLCSLIGFVIGMSLFGSVTFLPLYLQVVKEATPTEAGLQLI-PLMGGL 316
+ L +N F++ L G +I ++ G V+ +P ++ V + + E G +I P +
Sbjct: 247 FVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSV 306

Query: 317 LLTSIISGRIISRTGKYRLFPILGTLLGVTGMVLLTRITIHSPLWQLYLFTGVLGAGLGL 376
++ I G ++ R G +G + + + + + + VLG GL
Sbjct: 307 IIFGYIGGILVDRRGP-LYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSF 364

Query: 377 VMQVLVLAVQNAMPAQMYGVATSGVTLFRSIGGSIGVALFGAVFTHVLQSNLQQLLPE 434
V+ V +++ Q G S + + G+A+ G + + + Q+LLP
Sbjct: 365 TKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS--IPLLDQRLLPM 420


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21135HTHTETR727e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 72.4 bits (177), Expect = 7e-18
Identities = 33/175 (18%), Positives = 70/175 (40%), Gaps = 10/175 (5%)

Query: 12 RPGRPRGKKPGTANREQLMDIALTLFARDGAGRVSLNAIAKEAGVTPAMLHYYFSSRDAL 71
R + ++ R+ ++D+AL LF++ G SL IAK AGVT ++++F + L
Sbjct: 3 RKTKQEAQE----TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDL 58

Query: 72 VTQLIEERFMPLRNHISRIFVDHPQDPVL----ALTMMVETLAHMAEKNAWFAPLWM-QE 126
+++ E + P DP+ L ++E+ + ++ E
Sbjct: 59 FSEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCE 118

Query: 127 IIGEMPILRQHMDARFGEERFQVMLGTVRRWQQEGKINPALAPELLFTTVISLVL 181
+GEM +++Q E + + T++ + + L + +
Sbjct: 119 FVGEMAVVQQ-AQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYIS 172


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21145SACTRNSFRASE361e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 36.5 bits (84), Expect = 1e-05
Identities = 20/72 (27%), Positives = 35/72 (48%), Gaps = 6/72 (8%)

Query: 51 GLIAKRKGNW---LCIEYLWVSETTRGRGLGSELMQEAEQQAQAQGCSHLLVDTFSFQ-- 105
G I R NW IE + V++ R +G+G+ L+ +A + A+ L+++T
Sbjct: 78 GRIKIRS-NWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINIS 136

Query: 106 ALPFYQKLGYQL 117
A FY K + +
Sbjct: 137 ACHFYAKHHFII 148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21165RTXTOXIND432e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 42.5 bits (100), Expect = 2e-06
Identities = 24/133 (18%), Positives = 51/133 (38%), Gaps = 10/133 (7%)

Query: 42 PVPVVSQLTGRTTAS-LSAEVRPQVGGIIQKRLFTEGDMVKAGQALYQIDPSSYRATWNE 100
V +V+ G+ T S S E++P I+++ + EG+ V+ G L ++ A +
Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138

Query: 101 AAAALKQAQALVASDCQKAQRYASLVRDNGVSRQDADDAASTCAQDKASV--------ES 152
++L QA+ Q R L + + D + ++ + +
Sbjct: 139 TQSSLLQARLEQTRY-QILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFST 197

Query: 153 KKAALESARINLN 165
+ +NL+
Sbjct: 198 WQNQKYQKELNLD 210


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21170ACRIFLAVINRP11460.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1146 bits (2966), Expect = 0.0
Identities = 583/1031 (56%), Positives = 754/1031 (73%), Gaps = 6/1031 (0%)

Query: 3 SRFFVRRPVFAWVIAILIMLAGVLAIRTLPVGQYPDVAPPAVKISATYTGASAETLENSV 62
+ FF+RRP+FAWV+AI++M+AG LAI LPV QYP +APPAV +SA Y GA A+T++++V
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 63 TQVIEQQLTGLDHLLYFSSTSSSDGSVSITVTFEQGTDPDTAQVQVQNKVQQAESRLPSE 122
TQVIEQ + G+D+L+Y SSTS S GSV+IT+TF+ GTDPD AQVQVQNK+Q A LP E
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 123 VQQSGVTVEKSQSSFLLILAVYDKTNRATSSDISDWLVSNMQDPLARVEGVGSLQVFGAE 182
VQQ G++VEKS SS+L++ T DISD++ SN++D L+R+ GVG +Q+FGA+
Sbjct: 122 VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ 181

Query: 183 YAMRVWMDPTKLASYSLMPSDVQSAIEAQNVQVSAGKIGALPSSNAQQLTATVRAQSRLQ 242
YAMR+W+D L Y L P DV + ++ QN Q++AG++G P+ QQL A++ AQ+R +
Sbjct: 182 YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 243 TPDQFKAIIVKSQADGSVVRLSDVARVEMGSEDYTATANLNGHPAAGIAVMMAPGANALD 302
P++F + ++ +DGSVVRL DVARVE+G E+Y A +NG PAAG+ + +A GANALD
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 303 TATLVKSKIAEFQRQMPQGYDIAYPKDSTEFIKISVEDVIQTLFEAIILVVCVMYLFLQN 362
TA +K+K+AE Q PQG + YP D+T F+++S+ +V++TLFEAI+LV VMYLFLQN
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 363 FRATLIPAVAVPVVLLGTFGVLALFGYSINTLTLFAMVLAIGLLVDDAIVVVENVERIMR 422
RATLIP +AVPVVLLGTF +LA FGYSINTLT+F MVLAIGLLVDDAIVVVENVER+M
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 423 DEGLPAREATEKSMGEISGALVAIALVLSAVFLPMAFFGGSTGVIYRQFSVTIISAMMLS 482
++ LP +EATEKSM +I GALV IA+VLSAVF+PMAFFGGSTG IYRQFS+TI+SAM LS
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 483 VVVALTLTPALCGALL----SHSKPHTKGFFGAFNRLWRRTEAGYQRRVLGGLRRGAVMM 538
V+VAL LTPALC LL + + GFFG FN + + Y V L +
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 539 GAYALICGAMALAMWKLPGSFLPVEDQGEIMVQYTLPAGATAVRTAEVRRQVTDWFLTKE 598
YALI M + +LP SFLP EDQG + LPAGAT RT +V QVTD++L E
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 599 KANTDVIFTVDGFSFSGSGQNAGMAFVSLKNWSQRKGDDNTAQAIALRATKELGTIRDAT 658
KAN + +FTV+GFSFSG QNAGMAFVSLK W +R GD+N+A+A+ RA ELG IRD
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 659 LFAMTPPSVDGLGQSNGFTFELMASGGTDRDSLMKLRSQLLAAANQS-SELQSVRANDLP 717
+ P++ LG + GF FEL+ G D+L + R+QLL A Q + L SVR N L
Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721

Query: 718 QMPQLQVDIDNNKAVSLGLSLSDVTDTLSSAWGGTYVNDFIDRGRVKKVYIQGESDARAV 777
Q ++++D KA +LG+SLSD+ T+S+A GGTYVNDFIDRGRVKK+Y+Q ++ R +
Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781

Query: 778 PSDLGKWFVRGSDNSMTPFSAFATTHWQYGPESLVRYNGSAAFEIQGENAAGFSSGAAMD 837
P D+ K +VR ++ M PFSAF T+HW YG L RYNG + EIQGE A G SSG AM
Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841

Query: 838 KMEKLADSLPAGSTWAWSGISLQEKLASGQAMSLYAISILVVFLCLAALYESWSVPFSVI 897
ME LA LPAG + W+G+S QE+L+ QA +L AIS +VVFLCLAALYESWS+P SV+
Sbjct: 842 LMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVM 901

Query: 898 MVIPLGLLGAALAATLRGLSNDVYFQVALLTTIGLSSKNAILIVEFAESAVD-EGYSLSR 956
+V+PLG++G LAATL NDVYF V LLTTIGLS+KNAILIVEFA+ ++ EG +
Sbjct: 902 LVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVE 961

Query: 957 AAIRAAQTRLRPIVMTSLAFIAGVLPLAIATGAGANSRVAIGTGIIGGTLTATLLAVFFV 1016
A + A + RLRPI+MTSLAFI GVLPLAI+ GAG+ ++ A+G G++GG ++ATLLA+FFV
Sbjct: 962 ATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFV 1021

Query: 1017 PLFFVLVKRLF 1027
P+FFV+++R F
Sbjct: 1022 PVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21180TCRTETA672e-14 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 67.2 bits (164), Expect = 2e-14
Identities = 78/339 (23%), Positives = 130/339 (38%), Gaps = 25/339 (7%)

Query: 27 LPALPEITQQLQATSTQTQLSLTAALIGLGLGQLFFGP----LSDHIGRLKPLALSLLLF 82
+P LP + + L S L L Q P LSD GR L +SL
Sbjct: 25 MPVLPGLLRDLVH-SNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGA 83

Query: 83 IFSSAMCALTRDINMLIVWRFLQGFAGAGGSVLSRSIARDKYQGTLLTQFFALLMTVNGI 142
A+ A + +L + R + G GA G+V IA D G + F + G
Sbjct: 84 AVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA-DITDGDERARHFGFMSACFGF 142

Query: 143 APVLSPVLGGYVITAFDWRILFWTMAAIGGVLLVMSLAILRETRPATAAHASRQRPGQPV 202
V PVLGG + F F+ AA+ G+ + +L E+ R+
Sbjct: 143 GMVAGPVLGGL-MGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNP-- 199

Query: 203 LKNRRFLRFCLIQAFMMA-----GLFSYIGSSSFVMQSE--YGMSAMQFSLLFGLNGI-G 254
L + R+ R + A +MA L + ++ +V+ E + A + GI
Sbjct: 200 LASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILH 259

Query: 255 LIIAAMFFSRLARRFSAESLLRGGLTLAVSCAAIMLLFA---WLHLPVLALVGL--FFTV 309
+ AM +A R L G+ +A I+L FA W+ P++ L+
Sbjct: 260 SLAQAMITGPVAARLGERRALMLGM-IADGTGYILLAFATRGWMAFPIMVLLASGGIGMP 318

Query: 310 SLMSGISTVAGAEAMSAVDAAQSG--TASALMGTLMFVF 346
+L + +S E + + + + ++++G L+F
Sbjct: 319 ALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357


114D364_RS21300D364_RS21330N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS21300-112-1.198166purine ribonucleoside efflux pump NepI
D364_RS21305-1120.132065DNA-binding transcriptional regulator
D364_RS21310-1121.607499DUF1198 family protein
D364_RS21315-1131.909999hexose-6-phosphate:phosphate antiporter
D364_RS21320-1132.403473MFS transporter
D364_RS213250133.195788signal transduction histidine-protein
D364_RS213300132.522998transcriptional regulator UhpA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21300TCRTETA320.003 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.5 bits (74), Expect = 0.003
Identities = 27/151 (17%), Positives = 53/151 (35%), Gaps = 3/151 (1%)

Query: 65 AFVAMFSSLFITTVIGKTDRRYVVILFSLLLTLSCLLVSFADSFTLLLLGRACLGLALGG 124
A + + + + + RR V+++ + +++ A +L +GR G+ G
Sbjct: 53 ALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GA 111

Query: 125 FWAMSASLTMRLVPMRVVPKALSIIFGAVSIALVIAAPLGSFLGGLIGWRNVFNGAAVMG 184
A++ + + + + +V LG +GG F AA +
Sbjct: 112 TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALN 170

Query: 185 VLCTLWVLKALP-SLPGESASQQQNMFGLLK 214
L L LP S GE ++ L
Sbjct: 171 GLNFLTGCFLLPESHKGERRPLRREALNPLA 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21320TCRTETB357e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 34.9 bits (80), Expect = 7e-04
Identities = 27/168 (16%), Positives = 62/168 (36%), Gaps = 17/168 (10%)

Query: 49 FNIAQNDMISTYGLSMTQLGMIGLGFSITYGVGKTLVSYYADGKNTKQFLPFMLILSAIC 108
N++ D+ + + + F +T+ +G + +D K+ L F +I++
Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIIN--- 89

Query: 109 MLGFSASMGAGSTSLFLMIAFYALSGFFQSTGGSCSYSTI----TKWTPRRKRGTFLGFW 164
F + +G S F ++ + F Q G + + + ++ P+ RG G
Sbjct: 90 --CFGSVIGFVGHSFFSLLIM---ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144

Query: 165 NISHNLGGAGAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRY 212
+G + A+Y+ + + + P + I+ L
Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSY---LLLIP--MITIITVPFLMK 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21325TCRTETB371e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 36.8 bits (85), Expect = 1e-04
Identities = 65/407 (15%), Positives = 130/407 (31%), Gaps = 58/407 (14%)

Query: 29 RHILLTIWLGYALFY--FTRKSFNAAVPEILASNVLTRSDIGLLATLFYITYGLSKFFSG 86
RH + IWL F+ N ++P+I + + T F +T+ + G
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 87 IVSDRSDARYFMGLGLIATGVVNILFGFSSSLWAFALLWALNAFFQGWGS---PVCARLL 143
+SD+ + + G+I +++ S F L + F QG G+ P ++
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHS---FFSLLIMARFIQGAGAAAFPALVMVV 127

Query: 144 TAWY-SRTERGGWWALWNTAHNVGGALIPMVVGAAALHYGWRAGMTIAGCLAILAGLYLC 202
A Y + RG + L + +G + P + G A + W + I I
Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITV----- 182

Query: 203 WRLRDRPQAVGLPAVGDWRHDALEIAQQQEGAGMSRKAILTRYVLANPYIWLLSLCYVLV 262
P + L +I G + I+ + Y + VL
Sbjct: 183 ------PFLMKLLKKEVRIKGHFDIK----GIILMSVGIVFFMLFTTSYSISFLIVSVLS 232

Query: 263 YVV-----RAAINDWGNLYMSETLGVDLVTANSAVTMFELGGFI-----------GALVA 306
+++ R + + + + + + + + + GF+ A
Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292

Query: 307 GWGSDKLFNGNRGPMNLIFAAGILLSVGGLWLMPFASYVMQAACFFTTGFFVFGPQMLI- 365
GS +F G + + GIL+ G + + F T F + +
Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMT 352

Query: 366 --------GMAAAECS---------HKEAAGAATGFVGLFAYLGASL 395
G++ + ++ AGA + ++L
Sbjct: 353 IIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21335HTHFIS642e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.5 bits (157), Expect = 2e-14
Identities = 24/116 (20%), Positives = 46/116 (39%), Gaps = 5/116 (4%)

Query: 2 TTIALIDDHLIVRSGFAQLLGLEADFQVVAEFGSGREALTGLPGRGVQVCICDISMPDIS 61
TI + DD +R+ Q L + V + + + + D+ MPD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GLELLSQLPK---GMATIMLSVHDSPALIEQALNAGARGFLSKRCSPDELIAAVRT 114
+LL ++ K + +++S ++ +A GA +L K ELI +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


115D364_RS26470D364_RS21550N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS26470-2110.411491membrane protein insertase YidC
D364_RS21520-1110.645419tRNA uridine-5-carboxymethylaminomethyl(34)
D364_RS215250130.839221GNAT family N-acetyltransferase
D364_RS215302121.713018MFS transporter
D364_RS215350110.747597HTH-type transcriptional regulator YidZ
D364_RS215400100.170106phosphopantetheinyl transferase
D364_RS21545012-0.829568NAD(P)H-dependent oxidoreductase
D364_RS21550-213-1.287091NCS2 family permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS2152560KDINNERMP8140.0 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 814 bits (2105), Expect = 0.0
Identities = 476/549 (86%), Positives = 510/549 (92%), Gaps = 2/549 (0%)

Query: 1 MDSQRNLLIIALLFVSFMIWQAWEQDKNPQPQ-QQTTQTTTTAAGSAADQGVPASGQGKL 59
MDSQRNLL+IALLFVSFMIWQAWEQDKNPQPQ QQTTQTTTTAAGSAADQGVPASGQGKL
Sbjct: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKL 60

Query: 60 ITVKTDVLELTINTNGGDIEQALLLAYPKTLKSTEPFQLLETTPQFVYQAQSGLTGRDGP 119
I+VKTDVL+LTINT GGD+EQALL AYPK L ST+PFQLLET+PQF+YQAQSGLTGRDGP
Sbjct: 61 ISVKTDVLDLTINTRGGDVEQALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGP 120

Query: 120 DNPANGPRPLYNVDKEAFVLADGQDELVIPLTYTDKAGNVFTKTFTLKRGGYAVNVGYSV 179
DNPANGPRPLYNV+K+A+VLA+GQ+EL +P+TYTD AGN FTKTF LKRG YAVNV Y+V
Sbjct: 121 DNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNTFTKTFVLKRGDYAVNVNYNV 180

Query: 180 QNASEKPLEVSTFGQLKQTAALPTSRDTQTGGLSTMHTFRGAAFSTADSKYEKYKFDTIL 239
QNA EKPLE+S+FGQLKQ+ LP DT + + +HTFRGAA+ST D KYEKYKFDTI
Sbjct: 181 QNAGEKPLEISSFGQLKQSITLPPHLDTGSSNFA-LHTFRGAAYSTPDEKYEKYKFDTIA 239

Query: 240 DNENLNVSTKNGWVAMLQQYFTTAWVPRNNGTNNFYTANLGNGVVAIGYKSQPVLVQPGQ 299
DNENLN+S+K GWVAMLQQYF TAW+P N+GTNNFYTANLGNG+ AIGYKSQPVLVQPGQ
Sbjct: 240 DNENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQ 299

Query: 300 TDKLQSTLWVGPAIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKFIHSFLGNWGFSII 359
T + STLWVGP IQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLK+IHSF+GNWGFSII
Sbjct: 300 TGAMNSTLWVGPEIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSII 359

Query: 360 VITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRQSQEMMALYKAEKVNP 419
+ITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQR SQEMMALYKAEKVNP
Sbjct: 360 IITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRISQEMMALYKAEKVNP 419

Query: 420 LGGCFPLIIQMPIFLALYYMLSASVELRHAPFILWIHDLSAQDPYYILPIIMGATMFFIQ 479
LGGCFPL+IQMPIFLALYYML SVELR APF LWIHDLSAQDPYYILPI+MG TMFFIQ
Sbjct: 420 LGGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQ 479

Query: 480 KMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVVYYIVSNLVTIIQQQLIYRGLEKRG 539
KMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLV+YYIVSNLVTIIQQQLIYRGLEKRG
Sbjct: 480 KMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRG 539

Query: 540 LHSREKKKS 548
LHSREKKKS
Sbjct: 540 LHSREKKKS 548


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21535SACTRNSFRASE385e-06 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 37.6 bits (87), Expect = 5e-06
Identities = 17/55 (30%), Positives = 27/55 (49%), Gaps = 3/55 (5%)

Query: 69 IVDVAVDPAHQGKGLGRLVMEKLVAWLDANAFDGSYV-TLVADVP--ELYAKFGF 120
I D+AV ++ KG+G ++ K + W N F G + T ++ YAK F
Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21540TCRTETA582e-11 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 58.3 bits (141), Expect = 2e-11
Identities = 67/311 (21%), Positives = 118/311 (37%), Gaps = 14/311 (4%)

Query: 5 LLCSFALVLLYPSGIDMYLVGLPRIAQDLGASEAQLHIAFSVYLAGMASAML----FAGR 60
L+ + V L GI + + LP + +DL S + + + LA A G
Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSN-DVTAHYGILLALYALMQFACAPVLGA 65

Query: 61 IADRSGRKPVAIVGAAIFVIASLICAQAHTSSHFLIGRFIQGIGAGSCYVVAFAILRDTL 120
++DR GR+PV +V A + I A A IGR + GI G+ VA A + D
Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADIT 124

Query: 121 DDRRRAKVLSLLNGITCIIPVLAPVLGHLIMLKYPWQSLFYTMTGMGVMVAVLSVFILRE 180
D RA+ ++ V PVLG L M + + F+ + + + F+L E
Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGL-MGGFSPHAPFFAAAALNGLNFLTGCFLLPE 183

Query: 181 TRPTAPPQAASPQHDAGESLLNRFFLSRLLITTLSVTVILTYVNVSPVLMMEEMGFDRGT 240
+ + S + ++ ++V I+ V P + G DR
Sbjct: 184 SHKGERRPLRREALNPLASFRWARGM-TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFH 242

Query: 241 YSMAM------ALMAMISMAVSFSTPFALSLFNPRTLMLTSQVLFLAAGVTLSLATRQAV 294
+ A + S+A + T + R ++ + + L+ ATR +
Sbjct: 243 WDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWM 302

Query: 295 TLIGLGMICAG 305
+ ++ +G
Sbjct: 303 AFPIMVLLASG 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS21560TYPE3IMSPROT310.013 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 30.5 bits (69), Expect = 0.013
Identities = 22/114 (19%), Positives = 43/114 (37%), Gaps = 3/114 (2%)

Query: 331 LTAVVVGILFLLVIFLSPLAGMVPGYAAAGALIYVGVLMTSSLARVKWSDLTEAVPA--- 387
L+ VV +L PL + A A ++ G L++ + + A
Sbjct: 72 LSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKINPIEGAKRI 131

Query: 388 FITAVMMPFSFSITEGIALGFISYCVMKIGTGRLRELSPCVIIVSLLFVLKIVF 441
F ++ F SI + + L + + ++K L +L C I + +I+
Sbjct: 132 FSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQILR 185


116D364_RS23015D364_RS23055N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS23015-113-1.587561SDR family oxidoreductase
D364_RS23020015-1.225054sugar-binding transcriptional regulator
D364_RS23025115-0.78518123S rRNA pseudouridine(2604) synthase RluF
D364_RS23030-114-0.429360DUF3811 domain-containing protein
D364_RS23035014-0.203047ketopantoate/pantoate/pantothenate transporter
D364_RS23040015-1.125235lysine-sensitive aspartokinase 3
D364_RS23045114-3.038610glucose-6-phosphate isomerase
D364_RS23050424-4.815353N-acetyltransferase
D364_RS23055425-5.823034GNAT family N-acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS23015DHBDHDRGNASE1154e-33 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 115 bits (289), Expect = 4e-33
Identities = 79/271 (29%), Positives = 126/271 (46%), Gaps = 26/271 (9%)

Query: 7 LKDNVIIVTGGASGIGLAIVDELLSQGAHVQMIDIHGGDRHHNGDNYHF-------WPTD 59
++ + +TG A GIG A+ L SQGAH+ +D + + +P D
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 60 ISSATEVQQTIDAIIQRWSRIDGLVNNAGVNFPRLLVDEKAPAGRYELNEAAFEKMVNIN 119
+ + + + I + ID LVN AGV P L+ + L++ +E ++N
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLI---------HSLSDEEWEATFSVN 116

Query: 120 QKGVFFMSQAVARQMVKQRAGVIVNVSSESGLEGSEGQSCYAATKAALNSFTRSWSKELG 179
GVF S++V++ M+ +R+G IV V S + YA++KAA FT+ EL
Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176

Query: 180 KYGIRVVGVAPGILEKTGLRTPEYEEALAWTRNITVEQLREGYT---KNAIPIGRAGKLS 236
+Y IR V+PG E + W EQ+ +G K IP+ + K S
Sbjct: 177 EYNIRCNIVSPGSTETDMQWS-------LWADENGAEQVIKGSLETFKTGIPLKKLAKPS 229

Query: 237 EVADFVCYLLSARASYITGVTTNIAGGKTRG 267
++AD V +L+S +A +IT + GG T G
Sbjct: 230 DIADAVLFLVSGQAGHITMHNLCVDGGATLG 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS23020HTHFIS280.045 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 28.3 bits (63), Expect = 0.045
Identities = 6/21 (28%), Positives = 12/21 (57%)

Query: 24 QAQIARELGIYRTTISRLLKR 44
Q + A LG+ R T+ + ++
Sbjct: 452 QIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS2303056KDTSANTIGN250.037 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 25.3 bits (55), Expect = 0.037
Identities = 19/75 (25%), Positives = 34/75 (45%), Gaps = 5/75 (6%)

Query: 2 ALPRITQKEMTEREQRELKTLLDRARIAHGRPLSNAETNSVKKEYIDKLMAQREAEAKKA 61
LP E + + +EL L+ R + ++NA N + ++ AQ++
Sbjct: 290 GLPNSASIEQIQSKIQELGDTLEELRDSFDGYINNAFVNQIHLNFVMPPQAQQQQG---- 345

Query: 62 RQVKKQQAYKTDKEA 76
Q ++QQA T +EA
Sbjct: 346 -QGQQQQAQATAQEA 359


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS23045BCTERIALGSPD320.007 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 31.8 bits (72), Expect = 0.007
Identities = 17/79 (21%), Positives = 34/79 (43%), Gaps = 13/79 (16%)

Query: 69 LAKETDLAGAIKSMFSGEKINR-------TEDRAVLHVALRNRSNTPIVVDGKDVMPEVN 121
AK +DL + + S + + D+ ++ + ++N IV DVM ++
Sbjct: 276 YAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNII-IKAHGQTNALIVTAAPDVMNDLE 334

Query: 122 AVLEKM-----KTFSEAII 135
V+ ++ + EAII
Sbjct: 335 RVIAQLDIRRPQVLVEAII 353


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS23055SACTRNSFRASE270.035 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 26.8 bits (59), Expect = 0.035
Identities = 11/43 (25%), Positives = 14/43 (32%)

Query: 102 GHRYGEHIFHAVETRAKTAGESWLWLEVLAANPAARRFYERQG 144
G + H AK L LE N +A FY +
Sbjct: 103 KKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHH 145


117D364_RS23440D364_RS23500N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS23440-215-0.421871response regulator transcription factor
D364_RS23445-1140.351178type IV toxin-antitoxin system AbiEi family
D364_RS234500133.315979nucleotidyl transferase AbiEii/AbiGii toxin
D364_RS234552184.812797carbohydrate kinase family protein
D364_RS234601194.513066ABC transporter substrate-binding protein
D364_RS234653194.858330ribose ABC transporter permease
D364_RS234703185.132149sugar ABC transporter ATP-binding protein
D364_RS234753185.083084hybrid sensor histidine kinase/response
D364_RS234802183.901547membrane protein
D364_RS234851213.849929phosphonate metabolism protein PhnP
D364_RS234900224.309723ribose 1,5-bisphosphokinase
D364_RS234951224.462277alpha-D-ribose 1-methylphosphonate
D364_RS23500-1224.594852phosphonate C-P lyase system protein PhnL
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS23440HTHFIS901e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.5 bits (222), Expect = 1e-22
Identities = 33/132 (25%), Positives = 62/132 (46%), Gaps = 1/132 (0%)

Query: 2 KPVILVVDDDRAMGELLSDVLGVHAFEVLVSQTGNDALTTVAQRADIALVLLDMILPDTH 61
ILV DDD A+ +L+ L ++V ++ +A D LV+ D+++PD +
Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA-AGDGDLVVTDVVMPDEN 61

Query: 62 GLQVLQQLQRTRPELPVVMLSGLGSESDVVVGLEMGADDYIAKPFSSRVVVARVKAVLRR 121
+L ++++ RP+LPV+++S + + E GA DY+ KPF ++ + L
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 122 SGALAGEASGAG 133
+
Sbjct: 122 PKRRPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS23460SUBTILISIN290.019 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 29.4 bits (66), Expect = 0.019
Identities = 16/65 (24%), Positives = 25/65 (38%), Gaps = 5/65 (7%)

Query: 55 KLAGDNVKVTLVSSGYDLGQQVAQIDNFIAAKVDMIIL---NAADSKGIGPAVKRAKEAG 111
L +KV + I I KVD+I + D + AVK+A +
Sbjct: 111 DLLI--IKVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSLGGPEDVPELHEAVKKAVASQ 168

Query: 112 IVVVA 116
I+V+
Sbjct: 169 ILVMC 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS23465PF00577290.040 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 28.7 bits (64), Expect = 0.040
Identities = 10/40 (25%), Positives = 19/40 (47%), Gaps = 2/40 (5%)

Query: 227 FVYGMSGLLSGLGGVMSASRLYSANGNLGVGYELDAIAAV 266
++G+ + GG A R + N G+G + A+ A+
Sbjct: 400 LLHGLPAGWTIYGGTQLADRYRAF--NFGIGKNMGALGAL 437


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS23475HTHFIS564e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 55.6 bits (134), Expect = 4e-10
Identities = 28/121 (23%), Positives = 58/121 (47%), Gaps = 8/121 (6%)

Query: 646 LVLEDEEDVRQTLCEQLHQLGWLTLETASGEEALQLLEASPDIALLISDLMLPGALSGAD 705
LV +D+ +R L + L + G+ T++ + + A L+++D+++P + D
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDE-NAFD 64

Query: 706 VIHTARRRFPALPVLLISGQDLRPAQNPALPE--VEWLRKPF----TRAQLAQALSAAYA 759
++ ++ P LPVL++S Q+ A + ++L KPF + +AL+
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 760 R 760
R
Sbjct: 125 R 125


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS23480RTXTOXIND300.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.8 bits (67), Expect = 0.002
Identities = 24/115 (20%), Positives = 47/115 (40%), Gaps = 12/115 (10%)

Query: 13 LSLTSLAARADIIDDAIGNIQQAINDAYNPGSSRSDDDDRYDDDGRYDDGRYQGS----- 67
L LT+L A AD + +Q + SRS + ++ + D+ +Q
Sbjct: 125 LKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEV 184

Query: 68 -------RQQSRDSQRQYDERQRQLDERRRQLDERQRQLDRDRRQLESDQRRLDD 115
++Q Q Q +++ LD++R + +++R ++ RLDD
Sbjct: 185 LRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDD 239


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS23500PF05272280.027 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.027
Identities = 14/42 (33%), Positives = 19/42 (45%)

Query: 36 CVVLHGHSGSGKSTLLRSLYANYLPDSGHIHIRHGDEWVDLV 77
VVL G G GKSTL+ +L H I G + + +
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQI 639


118D364_RS24730D364_RS24790N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS247303247.506293lipocalin family protein
D364_RS247353247.045818DUF1294 domain-containing protein
D364_RS247401236.357061CusA/CzcA family heavy metal efflux RND
D364_RS247451256.033003efflux RND transporter periplasmic adaptor
D364_RS247501235.963042cation efflux system protein CusF
D364_RS247551224.199656efflux transporter outer membrane subunit
D364_RS247600213.531218copper response regulator transcription factor
D364_RS24765-1213.171858Cu(+)/Ag(+) sensor histidine kinase
D364_RS24770-1202.726946DUF1778 domain-containing protein
D364_RS24775-1202.457795GNAT family N-acetyltransferase
D364_RS247800192.137869cupin domain-containing protein
D364_RS247850161.083715*hypothetical protein
D364_RS24790-115-0.729873ABC transporter permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS24770BCTLIPOCALIN2331e-81 Bacterial lipocalin signature.
		>BCTLIPOCALIN#Bacterial lipocalin signature.

Length = 171

Score = 233 bits (595), Expect = 1e-81
Identities = 85/151 (56%), Positives = 111/151 (73%), Gaps = 1/151 (0%)

Query: 25 PKGVQPISGFDASRYLGKWYEVARLENRFERGLEQVTATYGARSDGGISVVNRGYDPVKK 84
P+ V+P+S F+ + YLGKWYEVARL++ FERGL QVTA Y R+DGGISV+NRGY K
Sbjct: 20 PESVKPVSDFELNNYLGKWYEVARLDHSFERGLSQVTAEYRVRNDGGISVLNRGYSEEKG 79

Query: 85 RWNESDGKAYFTGAPTTAALKVSFFGPFYGGYNVIRLD-DDYQYALVSGPNRDYLWILSR 143
W E++GKAYF T LKVSFFGPFYG Y V LD ++Y YA VSGPN +YLW+LSR
Sbjct: 80 EWKEAEGKAYFVNGSTDGYLKVSFFGPFYGSYVVFELDRENYSYAFVSGPNTEYLWLLSR 139

Query: 144 TPTIPAAVKQDYLNTARELGFDVDRLVWIRQ 174
TPT+ + ++ ++E GFD +RL++++Q
Sbjct: 140 TPTVERGILDKFIEMSKERGFDTNRLIYVQQ 170


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS24780ACRIFLAVINRP6790.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 679 bits (1753), Expect = 0.0
Identities = 223/1059 (21%), Positives = 436/1059 (41%), Gaps = 54/1059 (5%)

Query: 1 MIEWIIRRSVANRFLVMMAALFLSIWGTWTIIHTPVDALPDLSDVQVIVKTRYPGQAPQI 60
M + IRR + A+ L + G I+ PV P ++ V V YPG Q
Sbjct: 1 MANFFIRR----PIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQT 56

Query: 61 VENQVTWPLTTTMLSVPGAKTVRGFSQ-FGDSYVYVIFEDGTDPYWARSRVLEYLNQVQG 119
V++ VT + M + + S G + + F+ GTDP A+ +V L
Sbjct: 57 VQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATP 116

Query: 120 KLPAGVSAEMGP-DATGVGWVFEYALVDRSGKHDLAELRSLQDWFLKYELKTIPNVSEVA 178
LP V + + + ++ V + ++ +K L + V +V
Sbjct: 117 LLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQ 176

Query: 179 SVGGVVKEYQIVVDPMKLTQYGISLGEVKSALDASNQEAGGSSVELA------EAEYMVR 232
G +I +D L +Y ++ +V + L N + + + +
Sbjct: 177 LFGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASII 235

Query: 233 ASGYLQTLDDFKNIVLKTGDNGVPVYLGDVARVQIGPEMRRGIAELNGEGEVAGGVVILR 292
A + ++F + L+ +G V L DVARV++G E IA +NG+ AG + L
Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLA 294

Query: 293 SGKNAREVISAVKAKLASLQSSLPEGVEVVTTYDRSQLIDRAIDNLSYKLLEEFIVVALV 352
+G NA + A+KAKLA LQ P+G++V+ YD + + +I + L E ++V LV
Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354

Query: 353 CALFLWHVRSALVAIISLPLGLCFAFIMMHFQGLNANIMSLGGIAIAVGAMVDAAIVMIE 412
LFL ++R+ L+ I++P+ L F ++ G + N +++ G+ +A+G +VD AIV++E
Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414

Query: 413 NAHKRLEEWEHQHPGEKLSNDTRWKIITEASVEVGPALFISLLIITLSFIPIFTLEGQGG 472
N + + E + P + ++ ++ AL ++++ FIP+ G G
Sbjct: 415 NVERVMME-DKLPP---------KEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464

Query: 473 KLFGPLAFTKTWSMAGAALLAIVAIPILMGFWIRGRIPAESSNPLNRF----------LI 522
++ + T +MA + L+A++ P L ++ AE F +
Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPV-SAEHHENKGGFFGWFNTTFDHSV 523

Query: 523 RIYHPLLLKVLHWPKTTLLIALLSILTVAWPLNRVGGEFLPQINEGDLLYMPSTLPGISA 582
Y + K+L LLI L + + R+ FLP+ ++G L M G +
Sbjct: 524 NHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQ 583

Query: 583 AQAADMLQKTDKLIMT--VPEVARVFGKTGKAETATDSAPLEMVETTIQLKPQDQW-RPG 639
+ +L + + V VF G + + + LKP ++
Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQN---AGMAFVSLKPWEERNGDE 640

Query: 640 MTMEKIVEELDKTVRLPGLANLWVPPIRNRIDMLSTGIKSPIGIKVSGTNLADIDAIAGQ 699
+ E ++ + + + +++ + I +G + Q
Sbjct: 641 NSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQ 700

Query: 700 IEVVARSVPG-VTSALAERLVGGRYLNIDIHREKAARYGMTVGDVQLFVSSAIGGAMVGE 758
+ +A P + S L +++ +EKA G+++ D+ +S+A+GG V +
Sbjct: 701 LLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVND 760

Query: 759 TVEGVERYPINIRYPQSYRDSPETLRQLPILTPLKQQIVLADVAEVKVVTGPSMLKTENA 818
++ + ++ +R PE + +L + + + + + V G L+ N
Sbjct: 761 FIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNG 820

Query: 819 RPTSWIYIDARDRDMVSVVHDLQQAIGKEVKLKPGISVSYSGQFELLERAIQKLKLMVPM 878
P+ I +A L + + KL GI ++G + + +V +
Sbjct: 821 LPSMEIQGEAAPGTSSGDAMALMENL--ASKLPAGIGYDWTGMSYQERLSGNQAPALVAI 878

Query: 879 TLMIIFVLLYLAFRRVGEALLIITSVPFALVGGIWFLYWMGFHLSVATGTGFIALAGVAA 938
+ +++F+ L + + ++ VP +VG + V G + G++A
Sbjct: 879 SFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSA 938

Query: 939 EFGVVMLMYLRHAIEAEPSLENPQTFSVDKLDEALYRGAVLRVRPKAMTVAVIIAGLLPI 998
+ ++++ + + +E E + EA +R+RP MT I G+LP+
Sbjct: 939 KNAILIVEFAKDLMEKEGK----------GVVEATLMAVRMRLRPILMTSLAFILGVLPL 988

Query: 999 LWGTGAGSEVMSRIAAPMIGGMITAPLLSLFIIPAAYKL 1037
GAGS + + ++GGM++A LL++F +P + +
Sbjct: 989 AISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVV 1027


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS24795GPOSANCHOR310.012 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.8 bits (69), Expect = 0.012
Identities = 29/166 (17%), Positives = 57/166 (34%), Gaps = 9/166 (5%)

Query: 140 RLKNLSEADRQNFFASEEARRAVHILLIANVSQSYFNQRLAAAQLQVANDTLQNYQQSYA 199
A + A + A A L + + +A+++ + A
Sbjct: 204 NFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQA 263

Query: 200 FVEKQLLTGSTTVLALEQARGMIESTRADIAKRQGQLAQANNALQLLLGSYQHLPDDSAS 259
+EK L + I++ A+ A + + A + Q+L + Q L D +
Sbjct: 264 ELEKAL---EGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDA 320

Query: 260 SAVDLQGVTLPPSLSSAILLQRPDILEAEHSLQAANANIGAARAAF 305
S + + L ++ I EA S Q+ ++ A+R A
Sbjct: 321 SREAKKQL----EAEHQKLEEQNKISEA--SRQSLRRDLDASREAK 360


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS24805HTHFIS832e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 82.6 bits (204), Expect = 2e-20
Identities = 35/117 (29%), Positives = 61/117 (52%)

Query: 2 KILIVEDEKKTGEYLTKGLTEAGFVVDLADNGLNGYHLAMTSDYDLLILDIMLPDVNGWD 61
IL+ +D+ L + L+ AG+ V + N + D DL++ D+++PD N +D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 IVRMLRTAGKGMPILLLTALGTIEHRVKGLELGADDYLVKPFAFAELLARVRTLLRR 118
++ ++ A +P+L+++A T +K E GA DYL KPF EL+ + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS24810BORPETOXINA290.027 Bordetella pertussis toxin A subunit signature.
		>BORPETOXINA#Bordetella pertussis toxin A subunit signature.

Length = 269

Score = 29.4 bits (65), Expect = 0.027
Identities = 14/56 (25%), Positives = 27/56 (48%)

Query: 358 RFVGSPCRVTGDPLMLRRAISNLLSNAIRYTPAGQAVTIQLSESAETVRLVVENPG 413
R+V R +P RR++++++ +R P A + +ES+E + E G
Sbjct: 199 RYVSQQTRANPNPYTSRRSVASIVGTLVRMAPVIGACMARQAESSEAMAAWSERAG 254


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS24860ABC2TRNSPORT482e-08 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 48.4 bits (115), Expect = 2e-08
Identities = 40/171 (23%), Positives = 71/171 (41%), Gaps = 7/171 (4%)

Query: 200 REREHGTIEHLLVMPITPFEIMLAKI-WSMGLVVLVVSGLSLILMVQGILQVPIEGSIPL 258
R T E +L + +I+L ++ W+ L +G+ ++ G + L
Sbjct: 93 RMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGY----TQWLSLL 148

Query: 259 FMLGV-ALSLFATTSIGIFMGTLARSMPQLGLLMILVLLPLQMLSGGSTPRESMPQLVQD 317
+ L V AL+ A S+G+ + LA S LV+ P+ LSG P + +P + Q
Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208

Query: 318 IMLTMPTTHFVSLAQAILYRGASFAIVWPQFLTLL-AIGGVFFTIALLRFR 367
+P +H + L + I+ + + + F + ALLR R
Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRRR 259


119D364_RS25445D364_RS25470N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
D364_RS25445-114-0.459978MDR efflux pump AcrAB transcriptional activator
D364_RS25450-214-0.659424protein CreA
D364_RS25455two-component system response regulator CreB
D364_RS25460two-component system sensor histidine kinase
D364_RS25465cell envelope integrity protein CreD
D364_RS25470two-component system response regulator ArcA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS25445DPTHRIATOXIN290.020 Diphtheria toxin signature.
		>DPTHRIATOXIN#Diphtheria toxin signature.

Length = 567

Score = 29.3 bits (65), Expect = 0.020
Identities = 20/97 (20%), Positives = 44/97 (45%), Gaps = 3/97 (3%)

Query: 139 LGVTQSYTCKLEEISDFRNQMRVQFWRDFLGNSPS-IPPVLYGLHEPRPSLEK--DDEQE 195
+G S +++ D ++ + + G P + + G+ +P+ + DD+ +
Sbjct: 24 IGAPPSAHAGADDVVDSSKSFVMENFSSYHGTKPGYVDSIQKGIQKPKSGTQGNYDDDWK 83

Query: 196 VFYTTALTPEMANGHLQHAHPVTLEGGEYVMFTYEGL 232
FY+T + A + + +P++ + G V TY GL
Sbjct: 84 GFYSTDNKYDAAGYSVDNENPLSGKAGGVVKVTYPGL 120


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS25455HTHFIS981e-25 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 97.6 bits (243), Expect = 1e-25
Identities = 36/146 (24%), Positives = 64/146 (43%)

Query: 1 MQQPRIWLVEDEQSIADTLVYMLQQEGFQVSVFGRGLPALEAAAHQAPDVAILDVGLPDI 60
M I + +D+ +I L L + G+ V + A D+ + DV +PD
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 SGFELCRRLLMRYPALPVLFLTARSDEVDKLLGLEIGADDYIAKPFSPREVCARVRTVLR 120
+ F+L R+ P LPVL ++A++ + + E GA DY+ KPF E+ + L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 RLQKFAAPSPVVRVGEFVLDEQAAAI 146
++ + L ++AA+
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAM 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS25460PF06580389e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.5 bits (87), Expect = 9e-05
Identities = 28/94 (29%), Positives = 41/94 (43%), Gaps = 20/94 (21%)

Query: 379 AIDFTPQGGEIALAAEKRNEEVQLSVIDNGCGIPDYALERIFERFYSLPREDGHKSSGLG 438
I PQGG+I L K N V L V + G SL ++ +S+G G
Sbjct: 271 GIAQLPQGGKILLKGTKDNGTVTLEVENTG----------------SLALKNTKESTGTG 314

Query: 439 LAFVREVARLHHGD---INLHNRPEGGVVATLRL 469
L VRE ++ +G I L + +G V A + +
Sbjct: 315 LQNVRERLQMLYGTEAQIKLSEK-QGKVNAMVLI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
D364_RS25470HTHFIS832e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 83.0 bits (205), Expect = 2e-20
Identities = 31/122 (25%), Positives = 60/122 (49%), Gaps = 1/122 (0%)

Query: 1 MQTPHILIVEDELVTRNTLKSIFEAEGYDVFEATDGAEMHQILSENDINLVIMDINLPGK 60
M IL+ +D+ R L GYDV ++ A + + ++ D +LV+ D+ +P +
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 NGLLLARELRE-QADVALMFLTGRDNEVDKILGLEIGADDYITKPFNPRELTIRARNLLS 119
N L +++ + D+ ++ ++ ++ + I E GA DY+ KPF+ EL L+
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RT 121

Sbjct: 121 EP 122



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.