PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
GenomePeCan4.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in CP002074 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1HPPC_00250HPPC_00400Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_002502130.223117Proline/pyrroline-5-carboxylate dehydrogenase
HPPC_00255417-0.504410hypothetical protein
HPPC_00260418-0.613749hypothetical protein
HPPC_002653150.662139hypothetical protein
HPPC_002801141.423868hypothetical protein
HPPC_002851131.479405hypothetical protein
HPPC_002900141.555817hypothetical protein
HPPC_003052142.556814hypothetical protein
HPPC_003351142.471316urease accessory protein
HPPC_003404223.032744Urease accessory protein UreG
HPPC_003454222.739821urease accessory protein UreF
HPPC_003504222.441197urease accessory protein UreE
HPPC_003553202.400003urease accessory protein / pH-dependent
HPPC_003601191.961266hypothetical protein
HPPC_003651172.706579urease subunit beta
HPPC_00370-2111.944791urease subunit alpha
HPPC_003750152.134611*lipoprotein signal peptidase
HPPC_003800162.450059phosphoglucosamine mutase
HPPC_003851161.48129630S ribosomal protein S20
HPPC_003902161.478419peptide chain release factor 1
HPPC_003952171.046384putative Outer membrane protein
HPPC_004002160.880133hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_00250ANTHRAXTOXNA310.036 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 30.9 bits (69), Expect = 0.036
Identities = 36/173 (20%), Positives = 71/173 (41%), Gaps = 19/173 (10%)

Query: 121 QEESQLKERILKRKNEKIILNVNFIGEEVLGEEEANARFEKY---SQALKSNYIQYISIK 177
Q+ S+ ++ + + EK+ F+ E+ + + Y S+ K Y +
Sbjct: 118 QDLSEEEKNSMNSRGEKVPFASRFVFEKKRETPKLIINIKDYAINSEQSKEVYYEIGKGI 177

Query: 178 ITTIFSQINILDFEY-----SKKEIVKRLDALYALALEEEKKQGMPKFINLDMEEFRDLE 232
I S+ LD E+ S + D L++ +E K + K I+++ ++
Sbjct: 178 SLDIISKDKSLDPEFLNLIKSLSDDSDSSDLLFSQKFKE-KLELNNKSIDINF-----IK 231

Query: 233 LTVESFMESIAK-----FDLNAGIVLQAYIPDSYEYLKKLHAFSKERVLKGLK 280
+ F + + F + VL+ Y PD +EY+ KL E++ + LK
Sbjct: 232 ENLTEFQHAFSLAFSYYFAPDHRTVLELYAPDMFEYMNKLEKGGFEKISESLK 284


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_00265GPOSANCHOR310.004 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 30.8 bits (69), Expect = 0.004
Identities = 30/190 (15%), Positives = 57/190 (30%), Gaps = 3/190 (1%)

Query: 55 KENEKISGLENANDQLWQAKDKLTKENTELTHKNAVLTEKTAELKTENDKLNHLVIALNN 114
K + + L+ N L L N ELT + + EK + + + L
Sbjct: 61 KFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEA 120

Query: 115 EQGSLKQERAKLQDEHGFLEKRCTNLEKENQRLTDKLKQLESAQKNLENSKNQLLQAREK 174
+ L++ + + LE E L + LE A + N +
Sbjct: 121 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 180

Query: 175 IAEEKTELEREMARLKSLEATDKSELDLQNRR---FKSAIEDLKRQNRKLEEENIALKER 231
+ EK LE A L+ + + + ++ L + LE+
Sbjct: 181 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNF 240

Query: 232 VDGLKEQLSK 241
++
Sbjct: 241 STADSAKIKT 250



Score = 30.4 bits (68), Expect = 0.006
Identities = 40/234 (17%), Positives = 84/234 (35%), Gaps = 6/234 (2%)

Query: 16 EELEDRIGELENENAELFTTKEKLTKENTDLAYKNNKLFKENEKISGLENANDQLWQAKD 75
E++++R + E EN L L+ N L N++L +E + + ++
Sbjct: 53 EKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELS---NAKEKLRKNDKSLS 109

Query: 76 KLTKENTELTHKNAVLTEKTAELKTENDKLNHLVIALNNEQGSLKQERAKLQDEHGFLEK 135
+ + EL + A L + + + + L E+ +L +A L+
Sbjct: 110 EKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMN 169

Query: 136 RCTNLEKENQRLTDKLKQLESAQKNLENSKNQLLQAREKIAEEKTELEREMARLKSLEAT 195
T + + L + LE+ Q LE + + + + LE E A L + +A
Sbjct: 170 FSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKAD 229

Query: 196 ---DKSELDLQNRRFKSAIEDLKRQNRKLEEENIALKERVDGLKEQLSKQPKPQ 246
+ + I+ L+ + LE L++ ++G +
Sbjct: 230 LEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKI 283



Score = 30.0 bits (67), Expect = 0.008
Identities = 36/175 (20%), Positives = 57/175 (32%), Gaps = 3/175 (1%)

Query: 16 EELEDRIGELENENAELFTTKEKLTKENTDLAYKNNKLFKENEKISGLENANDQLWQAKD 75
+ LE L A+L E +T + K L E LE +L +A +
Sbjct: 144 KTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAA---LEARQAELEKALE 200

Query: 76 KLTKENTELTHKNAVLTEKTAELKTENDKLNHLVIALNNEQGSLKQERAKLQDEHGFLEK 135
+T + K L + A L L + N + + L+ E LE
Sbjct: 201 GAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 260

Query: 136 RCTNLEKENQRLTDKLKQLESAQKNLENSKNQLLQAREKIAEEKTELEREMARLK 190
R LEK + + + K LE K L + + + L L+
Sbjct: 261 RQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLR 315



Score = 28.9 bits (64), Expect = 0.023
Identities = 42/237 (17%), Positives = 79/237 (33%), Gaps = 7/237 (2%)

Query: 12 SQIREELEDRIGELENENAELFTTKEKLTKENTDLAYKNNKLFKENEKISGLENANDQLW 71
+ + + LE K+ + A + + + G N +
Sbjct: 186 AALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADS 245

Query: 72 QAKDKLTKENTELTHKNAVLTEKTAELKTENDKLNHLVIALNNEQGSLKQERAKLQDEHG 131
L E L + A L + + + + L E+ +L+ E+A L+ +
Sbjct: 246 AKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQ 305

Query: 132 FLEKRCTNLEKENQRLTDKLKQLESAQKNLENSKNQLLQAR-------EKIAEEKTELER 184
L +L ++ + KQLE+ + LE +R + E K +LE
Sbjct: 306 VLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEA 365

Query: 185 EMARLKSLEATDKSELDLQNRRFKSAIEDLKRQNRKLEEENIALKERVDGLKEQLSK 241
E +L+ ++ R ++ E K+ + LEE N L KE
Sbjct: 366 EHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEES 422


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_00365UREASE10450.0 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 1045 bits (2703), Expect = 0.0
Identities = 354/569 (62%), Positives = 443/569 (77%), Gaps = 4/569 (0%)

Query: 3 KISRKEYVSMYGPTTGDKVRLGDTDLIAEVEHDYTIYGEELKFGGGKTLREGMSQSN-NP 61
++SR Y +M+GPT GDKVRL DT+L EVE D+T +GEE+KFGGGK +R+GM QS
Sbjct: 4 RMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQVTR 63

Query: 62 SKEELDLIITNALIVDYTGIYKADIGIKDGKIAGIGKGGNKDMQDGVKNNLSVGPATEAL 121
+D +ITNALI+D+ GI KADIG+KDG+IA IGK GN DMQ GV + VGP TE +
Sbjct: 64 EGGAVDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGV--TIIVGPGTEVI 121

Query: 122 AGEGLIVTAGGIDTHIHFISPQQIPTAFASGVTTMIGGGTGPADGTNATTITPGRRNLKW 181
AGEG IVTAGG+D+HIHFI PQQI A SG+T M+GGGTGPA GT ATT TPG ++
Sbjct: 122 AGEGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIAR 181

Query: 182 MLRAAEEYSMNLGFLAKGNASNDASLADQIEAGAIGFKIHEDWGTTPSAINHALDVADKY 241
M+ AA+ + MNL F KGNAS +L + + GA K+HEDWGTTP+AI+ L VAD+Y
Sbjct: 182 MIEAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVADEY 241

Query: 242 DVQVAIHTDTLNEAGCVEDTMAAIAGRTMHTFHTEGAGGGHAPDIIKVAGEHNILPASTN 301
DVQV IHTDTLNE+G VEDT+AAI GRT+H +HTEGAGGGHAPDII++ G+ N++P+STN
Sbjct: 242 DVQVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSSTN 301

Query: 302 PTIPFTVNTEAEHMDMLMVCHHLDKSIKEDVQFADSRIRPQTIAAEDTLHDMGIFSITSS 361
PT P+TVNT AEH+DMLMVCHHL +I ED+ FA+SRIR +TIAAED LHD+G FSI SS
Sbjct: 302 PTRPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSIISS 361

Query: 362 DSQAMGRVGEVITRTWQTADKNKKEFGRLKEEKGDNDNFRIKRYLSKYTINPAIAHGISE 421
DSQAMGRVGEV RTWQTADK K++ GRLKEE GDNDNFR+KRY++KYTINPAIAHG+S
Sbjct: 362 DSQAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGLSH 421

Query: 422 YVGSVEVGKVADLVLWSPAFFGVKPNMIIKGGFIALSQMGDANASIPTPQPVYYREMFAH 481
+GS+EVGK ADLVLW+PAFFGVKP+M++ GG IA + MGD NASIPTPQPV+YR MF
Sbjct: 422 EIGSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMFGA 481

Query: 482 HGKAKYDANITFVSQAAYDKGIKEELGLERQVLPVKNCR-NITKKDMQFNDTTAHIEVNP 540
+G+++ ++++TFVSQA+ D G+ LG+ ++++ V+N R I K M N T HIEV+P
Sbjct: 482 YGRSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEVDP 541

Query: 541 ETYHVFVDGKEVTSKPANKVSLAQLFSIF 569
ETY V DG+ +T +PA + +AQ + +F
Sbjct: 542 ETYEVRADGELLTCEPATVLPMAQRYFLF 570


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_00400TACYTOLYSIN300.032 Bacterial thiol-activated pore-forming cytolysin sig...
		>TACYTOLYSIN#Bacterial thiol-activated pore-forming cytolysin

signature.
Length = 574

Score = 29.9 bits (67), Expect = 0.032
Identities = 17/46 (36%), Positives = 24/46 (52%), Gaps = 3/46 (6%)

Query: 499 NADTGNTDTGNTDTGNTDDASNMNNG---NDDAGNANDDMSNGNDM 541
NAD+ +T NT+T T++ + + AG DDM N NDM
Sbjct: 33 NADSNKQNTANTETTTTNEQPKPESSELTTEKAGQKMDDMLNSNDM 78


2HPPC_01505HPPC_01710Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_015051163.75108250S ribosomal protein L21
HPPC_015101163.72524550S ribosomal protein L27
HPPC_015150163.745435dipeptide ABC transporter, periplasmic
HPPC_015200144.326942dipeptide permease protein
HPPC_01525-1123.683555dipeptide transport system permease protein
HPPC_01530-2123.194265dipeptide ABC transporter
HPPC_01535-2122.983574dipeptide transport system atp-binding protein
HPPC_01540-1112.510477GTPase ObgE
HPPC_01545-2121.859918hypothetical protein
HPPC_015501152.170632hypothetical protein
HPPC_015550140.805776glutamate-1-semialdehyde aminotransferase
HPPC_01560116-0.009214hypothetical protein
HPPC_015653150.327658hypothetical protein
HPPC_015702130.551135hypothetical protein
HPPC_015752110.375401hypothetical protein
HPPC_01590011-0.758066DNA methylase
HPPC_015951130.069158hypothetical protein
HPPC_01600014-0.315542ATP-binding protein
HPPC_01605115-1.036323nitrite extrusion protein (narK)
HPPC_01610216-1.294171putative heme iron utilization protein
HPPC_01615016-1.296125arginyl-tRNA synthetase
HPPC_01620113-0.905015Sec-independent protein translocase protein
HPPC_01625012-1.031484guanylate kinase
HPPC_01630012-1.056938poly E-rich protein
HPPC_01635-213-1.651461nuclease NucT
HPPC_01640013-1.793381outer membrane protein HorC
HPPC_01645214-2.080507flagellar basal body L-ring protein
HPPC_01650314-1.680674CMP-N-acetylneuraminic acid synthetase
HPPC_01655213-0.985474CMP-N-acetylneuraminic acid synthetase (neuA)
HPPC_01660212-0.824878flagellar biosynthesis protein G
HPPC_016651120.687010tetraacyldisaccharide 4'-kinase
HPPC_016701141.886711NAD synthetase
HPPC_01675-1172.619625*ketol-acid reductoisomerase
HPPC_016800161.586920cell division inhibitor
HPPC_016852171.224658cell division topological specificity factor
HPPC_016902151.147759DNA processing chain A
HPPC_016951191.231257Holliday junction resolvase-like protein
HPPC_017002220.392109hypothetical protein
HPPC_017052230.627653cysteine-rich protein H
HPPC_01710322-0.207854hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01605TCRTETA445e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 44.0 bits (104), Expect = 5e-07
Identities = 55/271 (20%), Positives = 103/271 (38%), Gaps = 16/271 (5%)

Query: 28 LILSGSLTPHQSFQLGIAVLMGYVFGSFLIQFLSPLMSLESIAKISFGLIALSFLICYFD 87
L+ S +T H L + LM + L LS + +S A+ + I
Sbjct: 35 LVHSNDVTAHYGILLALYALMQFACAPVLGA-LSDRFGRRPVLLVSLAGAAVDYAI--MA 91

Query: 88 SIPFFW-LWIWRFIAGVASSALMILVAPLSLPYVKENKRALVGGFIFSAVGIGSVFSGFV 146
+ PF W L+I R +AG+ + A + ++RA GF+ + G G V +
Sbjct: 92 TAPFLWVLYIGRIVAGI-TGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVL 150

Query: 147 LPWISSYNIKWAWIFLGGSCLIAFILSLTGLKN-HSLKKKSVKKEESAFKIPFHL----- 200
+ ++ + + F+ L H +++ +++E F
Sbjct: 151 GGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMT 210

Query: 201 ---WLLLISCALNAIGFLPHTLFWVDYLIRHLNISPAIAGTSWAFFG-FGATLGSLISGP 256
L+ + + +G +P L WV + + G S A FG + ++I+GP
Sbjct: 211 VVAALMAVFFIMQLVGQVPAAL-WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGP 269

Query: 257 MAQKLGAKNANIFILILKSIACFLPIFFHQI 287
+A +LG + A + +I L F +
Sbjct: 270 VAARLGERRALMLGMIADGTGYILLAFATRG 300


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01625PF05272290.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.011
Identities = 9/18 (50%), Positives = 11/18 (61%)

Query: 8 LILSGPSGAGKSTLTKYL 25
++L G G GKSTL L
Sbjct: 599 VVLEGTGGIGKSTLINTL 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01630IGASERPTASE731e-15 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 72.8 bits (178), Expect = 1e-15
Identities = 41/218 (18%), Positives = 78/218 (35%), Gaps = 18/218 (8%)

Query: 162 LPTLNDQEEKEEEKEEVKETPQEEEKPKDNEIQEGETLKDEEVSKELETQ-------EEV 214
P+ + E K+E K + E+ + Q E K+ + + + TQ
Sbjct: 1032 TPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSE 1091

Query: 215 KEETQEQAKEQEPIKEETQEIKEEKQEKTQDSPSAQELEAMQELVKEIQENSNGQEDKKE 274
+ETQ ++ E+ ++ K E ++ + ++ QE + +Q + +
Sbjct: 1092 TKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDP 1151

Query: 275 TQESTEAPQEKETQENTEIPQETEKQELETPQEEKQESTETPQEKTQDVEIPQETPQENT 334
T E + T +TE P + +E P E VE P+ T T
Sbjct: 1152 TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSV----VENPENTTPATT 1207

Query: 335 ETPQESTETPQKETQEKKVQENHYESIEDIPEPVMAQA 372
+ P ++E+ K + H S+ +P V
Sbjct: 1208 Q-PTVNSES------SNKPKNRHRRSVRSVPHNVEPAT 1238



Score = 52.4 bits (125), Expect = 3e-09
Identities = 45/261 (17%), Positives = 93/261 (35%), Gaps = 16/261 (6%)

Query: 199 LKDEEVSKELETQEEVKEETQEQAKEQEPIKEETQEIKEEKQEKTQDSPSAQELEAMQEL 258
L + EV K +T + T + P E E P+ E
Sbjct: 980 LYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTET 1039

Query: 259 VKEIQENSNGQEDKKETQESTEAPQEKE--TQENTEIPQETEKQELETPQEEKQESTETP 316
V E + + +K E + Q +E + + + T+ E+ E +E+ T
Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099

Query: 317 QEKTQDVEIPQETPQENTET---PQESTETPQKETQEKKVQ--------ENHYESIEDIP 365
++T VE ++ E +T P+ +++ K+ Q + VQ + +I++
Sbjct: 1100 TKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQ 1159

Query: 366 EPVMAQAMGEALPFLNESVAKIPNNENDTETPKKSVIKTPQEKEGSDKTSSPLELRLNLQ 425
A E S + P E+ T SV++ P+ + +
Sbjct: 1160 SQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQP---TVNSESS 1216

Query: 426 DLLKSLNQESLKNLLENKTLS 446
+ K+ ++ S++++ N +
Sbjct: 1217 NKPKNRHRRSVRSVPHNVEPA 1237



Score = 42.4 bits (99), Expect = 4e-06
Identities = 23/168 (13%), Positives = 47/168 (27%)

Query: 148 KALVQEEPNNEEQLLPTLNDQEEKEEEKEEVKETPQEEEKPKDNEIQEGETLKDEEVSKE 207
A E + EKEE+ + E QE K + E + + E
Sbjct: 1085 VAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAE 1144

Query: 208 LETQEEVKEETQEQAKEQEPIKEETQEIKEEKQEKTQDSPSAQELEAMQELVKEIQENSN 267
+ + +E + + Q KE Q + + +V+ + +
Sbjct: 1145 PARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTP 1204

Query: 268 GQEDKKETQESTEAPQEKETQENTEIPQETEKQELETPQEEKQESTET 315
ES+ P+ + + +P E + +
Sbjct: 1205 ATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDL 1252


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01645FLGLRINGFLGH1934e-64 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 193 bits (492), Expect = 4e-64
Identities = 52/172 (30%), Positives = 84/172 (48%), Gaps = 18/172 (10%)

Query: 56 GERPLFADRRAMKPNDLITIIVSEKASANYSSS----KDYKSASGGNSTPPRLTYNGLDE 111
G +PLF DRR D +TI++ E SA+ SSS +D K+ G ++ P L GL
Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYL--QGLFG 118

Query: 112 RKKQEAEYLDDKNNYNFTKSSNNTNFKGGGSQKKSEDLEIVLSARIIKVLENGNYFIYGN 171
+ + E S F G G S L+ + +VL NGN + G
Sbjct: 119 NARADVEA------------SGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGE 166

Query: 172 KEVLVDGEKQILKVSGVIRPYDIERNNTIQSKFLADAKIEYTNLGHLSDSNK 223
K++ ++ + ++ SGV+ P I +NT+ S +ADA+IEY G+++++
Sbjct: 167 KQIAINQGTEFIRFSGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQN 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01660SACTRNSFRASE280.017 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.0 bits (62), Expect = 0.017
Identities = 15/49 (30%), Positives = 22/49 (44%), Gaps = 3/49 (6%)

Query: 102 RGETILKALEYIAFE---EFQLNSLHLEVMENNFKAIAFYEKNHYELEG 147
R + + AL + A E E L LE + N A FY K+H+ +
Sbjct: 102 RKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150


3HPPC_02275HPPC_02395Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_02275-112-3.581949molybdenum ABC transporter ModB
HPPC_02280-111-4.035116molybdenum ABC transporter ModD
HPPC_02285-29-2.305758glutamyl-tRNA synthetase
HPPC_02290-112-3.029383hypothetical protein
HPPC_02295-213-2.838163outer membrane protein (omp22)
HPPC_02300-212-2.819763type II adenine specific methyltransferase
HPPC_02305015-1.606166type II adenine specific methyltransferase
HPPC_02310018-0.905627GTP-binding protein
HPPC_02315221-2.706153type II adenine specific DNA methyltransferase
HPPC_023206190.389319hypothetical protein
HPPC_023257160.518003type II restriction endonuclease
HPPC_023307170.984925type II DNA modification enzyme
HPPC_023508191.414828hypothetical protein
HPPC_023558201.341412catalase-like protein
HPPC_0236010251.390100outer membrane protein HofC
HPPC_02365721-0.552193hypothetical protein
HPPC_02370623-2.317409hypothetical protein
HPPC_02375525-2.622257outer membrane protein HopH
HPPC_02380828-3.271727hypothetical protein
HPPC_02385622-2.202132hypothetical protein
HPPC_02390521-1.756118hypothetical protein
HPPC_02395520-1.864369hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_02280PF05272300.009 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.4 bits (68), Expect = 0.009
Identities = 11/23 (47%), Positives = 14/23 (60%)

Query: 30 VVALLGESGAGKSTILRILAGLE 52
V L G G GKST++ L GL+
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_02310TCRTETOQM1981e-57 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 198 bits (505), Expect = 1e-57
Identities = 116/461 (25%), Positives = 190/461 (41%), Gaps = 67/461 (14%)

Query: 3 NIRNIAVIAHVDHGKTTLVDGLLSQSGTFSEREKVDE--RVMDSNDLERERGITILSKNT 60
I NI V+AHVD GKTTL + LL SG +E VD+ D+ LER+RGITI + T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 61 AIYYKDTKINIIDTPGHADFGGEVERVLKMVDGVLLLVDAQEGVMPQTKFVVKKALSFGI 120
+ +++TK+NIIDTPGH DF EV R L ++DG +LL+ A++GV QT+ + GI
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 121 CPIVVVNKIDKPAAEPDRVVDEVFDLF---------VAMGASDKQLDFPV-----VYAAA 166
I +NKID+ + V ++ + V + + +F
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181

Query: 167 RDGYAMKSLDDE----------------------------KKNL--EPLFETILEHVPSP 196
D K + + K N+ + L E I S
Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSS 241

Query: 197 SGSVDEPLQMQIFTLDYDNYVGKIGIARVFNGSVKKNESVLLMKSDGSKENGRITKLTGF 256
+ L ++F ++Y ++ R+++G + +SV + KE +IT++
Sbjct: 242 THRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRI----SEKEKIKITEMYTS 297

Query: 257 LGLARTEIENAYAGDIVALAG--FNAMDV-GDSVVDPTNPMPLDPMHLEEPTMSVYFAVN 313
+ +I+ AY+G+IV L V GD+ + P +P P + +
Sbjct: 298 INGELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQRERIENP----LPLLQTTVEPS 353

Query: 314 DSPLAGLEGKHVTANKLKDRLLKEMQTNIAMKCEEMGEGKFKVSGRGELQITILAENLRR 373
+ + D LL+ + + +S G++Q+ + L+
Sbjct: 354 KPQQREMLLDALLEISDSDPLLRYYVDSAT--------HEIILSFLGKVQMEVTCALLQE 405

Query: 374 E-GFEFSISRPEVIIKEENGVKCEPFEHLVIDTPQDFSGAI 413
+ E I P VI E K E H+ + P F +I
Sbjct: 406 KYHVEIEIKEPTVIYMERPLKKAEYTIHIEVP-PNPFWASI 445



Score = 41.8 bits (98), Expect = 8e-06
Identities = 20/80 (25%), Positives = 30/80 (37%), Gaps = 1/80 (1%)

Query: 396 EPFEHLVIDTPQDFSGAIIERLGKRKAEMKAMNPMSDGYTRLEFEIPARGLIGYRSEFLT 455
EP+ I PQ++ K A + + + L EIPAR + YRS+
Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ-LKNNEVILSGEIPARCIQEYRSDLTF 595

Query: 456 DTKGEGVMNHSFLEFRPFSG 475
T G V + +G
Sbjct: 596 FTNGRSVCLTELKGYHVTTG 615


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_02395IGASERPTASE300.044 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.0 bits (67), Expect = 0.044
Identities = 33/144 (22%), Positives = 48/144 (33%), Gaps = 6/144 (4%)

Query: 17 VDYGNKRVGLNNTWNNKDLENHWVISSYELRDTTEKPTHFPTSQAITKEKDIHSLNSVEP 76
VD G + L N DL N E R+ T T+ T I + N+ E
Sbjct: 962 VDLGAWKYKLRNVNGRYDLYN----PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEI 1017

Query: 77 NPTTNELKTQDLSPLEQARAEKLAK-LESEKLESEKEFLKAKEQE-QQRKAALKKKLEHE 134
E +A+ + E EK A E Q R+ A + K +
Sbjct: 1018 ARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVK 1077

Query: 135 RGNAGNIESQTKIEVGEDIPTQTQ 158
N +Q+ E E T+T+
Sbjct: 1078 ANTQTNEVAQSGSETKETQTTETK 1101


4HPPC_02505HPPC_02705Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_02505213-2.226225urease-enhancing factor
HPPC_02510211-1.983215hypothetical protein
HPPC_02515011-1.669043glutamine synthetase
HPPC_02520011-2.765777hypothetical protein
HPPC_02525-28-0.91619250S ribosomal protein L9
HPPC_02530-210-1.091519ATP-dependent protease peptidase subunit
HPPC_02535-213-1.815957ATP-dependent protease ATP-binding subunit HslU
HPPC_02540315-2.162233GTP-binding protein Era
HPPC_02545316-2.153983hypothetical protein
HPPC_02550617-1.843190hypothetical protein
HPPC_02555918-1.845640IS606 transposase
HPPC_02560918-2.189703cag pathogenicity island protein 1
HPPC_02565918-2.167029cag pathogenicity island protein (cag3)
HPPC_02570918-2.981219cag pathogenicity island protein (cag4)
HPPC_02575822-3.154838cag pathogenicity island protein Beta
HPPC_02580925-4.263518virB11-like cag pathogenicity islandencoded
HPPC_02585827-4.756303cag pathogenicity island protein Z
HPPC_025951026-4.280396hypothetical protein
HPPC_026051127-4.581438cag pathogenicity island protein X
HPPC_026101030-4.814066cag pathogenicity island protein W
HPPC_026151229-5.727300cag island protein
HPPC_026201231-5.687143cag pathogenicity island protein (cagU, cag11)
HPPC_026251429-6.162400CAG pathogenicity island protein 12
HPPC_02630924-6.979339CAG pathogenicity island protein 13
HPPC_02635720-6.462031hypothetical protein
HPPC_02640820-5.445538cag pathogenicity island protein (cagQ, cag14)
HPPC_02645618-4.180076hypothetical protein
HPPC_02650617-3.113808hypothetical protein
HPPC_02655618-2.580948cag pathogenicity island protein (cagM, cag16)
HPPC_02660620-2.003575cag pathogenicity island protein (cagN, cag17)
HPPC_02665621-2.487817cag pathogenicity island protein L
HPPC_02670521-2.860575cag pathogenicity island protein I
HPPC_02675620-3.587655cag pathogenicity island protein H
HPPC_02680723-3.939642hypothetical protein
HPPC_02685722-4.286811cag pathogenicity island protein G
HPPC_02690722-3.266272cag island protein
HPPC_02695520-2.348992cag pathogenicity island protein E
HPPC_02700318-0.774425cag pathogenicity island protein D
HPPC_02705218-0.168817cag pathogenicity island protein C
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_02505VACJLIPOPROT270.002 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 26.8 bits (59), Expect = 0.002
Identities = 12/32 (37%), Positives = 19/32 (59%), Gaps = 4/32 (12%)

Query: 1 MKIIRNSVFIGASLLGGCASV----ETRFDSL 28
MK+ +++ +G +LL GCAS + R D L
Sbjct: 1 MKLRLSALALGTTLLVGCASSGTDQQGRSDPL 32


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_02535HTHFIS290.044 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.044
Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 3/36 (8%)

Query: 51 TPKNILMIGSTGVGKTEIARRI---AKIMKLPFVKV 83
T +++ G +G GK +AR + K PFV +
Sbjct: 159 TDLTLMITGESGTGKELVARALHDYGKRRNGPFVAI 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_02540PF03944300.009 delta endotoxin
		>PF03944#delta endotoxin

Length = 633

Score = 30.4 bits (68), Expect = 0.009
Identities = 24/94 (25%), Positives = 47/94 (50%), Gaps = 3/94 (3%)

Query: 68 LHHQEKLLNQCMLSQALKAMGDAELCVFLASVHDDLKGYEEFLSLCQKPHILALSKIDTA 127
L E+ LNQ + + + A +AEL A+V + + + FL+ + L+++
Sbjct: 94 LRETERFLNQRLNTDTV-ARVNAELTGLQANVEEFNRQVDNFLNPNRNAVPLSITSSVNT 152

Query: 128 THKQVLQKLQEYQQYASQFVDLVPLSAKKSQNLN 161
+ L +L ++Q Q + L+PL A+ + NL+
Sbjct: 153 MQQLFLNRLPQFQMQGYQLL-LLPLFAQAA-NLH 184


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_02565IGASERPTASE320.009 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.6 bits (71), Expect = 0.009
Identities = 32/183 (17%), Positives = 59/183 (32%), Gaps = 9/183 (4%)

Query: 224 DEVCSPLRDEMVAMPTNDSVTQKPNIIAPYSLYRLKETNNANEAQPSPYATQTAPENSKE 283
D P +E +A + P P N+ E++ Q A E + +
Sbjct: 1006 DVPSVPSNNEEIARVDE-APVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQ 1064

Query: 284 KLIEELIANSQLVANEEEREKKLLAEKEKQEAELAKYKLKDLENQKKLKALEAELKKKNA 343
A S + AN + E + K+ + +E ++K K +K
Sbjct: 1065 NREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAK----VETEKTQ 1120

Query: 344 KKPRVVEVPIPHQTSDSDKTMRVIKEKENYNGLLVDKETTIKRSYEGTLISENSYSKKTP 403
+ P+V P Q + +EN + + + +S T +K+T
Sbjct: 1121 EVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE----PQSQTNTTADTEQPAKETS 1176

Query: 404 LNP 406
N
Sbjct: 1177 SNV 1179


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_02605TYPE4SSCAGX8590.0 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 859 bits (2219), Expect = 0.0
Identities = 512/522 (98%), Positives = 515/522 (98%), Gaps = 1/522 (0%)

Query: 1 MEQAFFKKIVNCFCLGYLFLSSVIEAAP-DIKNFNRGRVKVVNKKIAYLGDEKPITIWTS 59
M QAFFKKIV CFCLGYLFLSS IEA DIKNFNRGRVKVVNKKIAYLGDEKPITIWTS
Sbjct: 1 MGQAFFKKIVGCFCLGYLFLSSAIEAVALDIKNFNRGRVKVVNKKIAYLGDEKPITIWTS 60

Query: 60 LDNVTVIQLEKDETISYITTGFNKGWNIVPNSNHIFIQPKSVKSNLMFEKEAVNFALMTR 119
LDNVTVIQLEKDETISYITTGFNKGW+IVPNSNHIFIQPKSVKSNLMFEKEAVNFALMTR
Sbjct: 61 LDNVTVIQLEKDETISYITTGFNKGWSIVPNSNHIFIQPKSVKSNLMFEKEAVNFALMTR 120

Query: 120 DYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANL 179
DYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANL
Sbjct: 121 DYQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANL 180

Query: 180 ENLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQA 239
ENLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQA
Sbjct: 181 ENLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQA 240

Query: 240 EETIKQRAKDKISIKTDKSQKSPEDNSIELSPSDSAWRTNLVVRTNKALYQFILRIAQKD 299
EE ++QRAKDKISIKTDKSQKSPEDNSIELSPSDSAWRTNLVVRTNKALYQFILRIAQKD
Sbjct: 241 EEAVRQRAKDKISIKTDKSQKSPEDNSIELSPSDSAWRTNLVVRTNKALYQFILRIAQKD 300

Query: 300 NFASAYLTVKLEYPQRHEVSSVIEEELKKREEAKRQRELIKQENLNTTAYINRVMMASNE 359
NFASAYLTVKLEYPQRHEVSSVIEEELKKREEAKRQRELIKQENLNTTAYINRVMMASNE
Sbjct: 301 NFASAYLTVKLEYPQRHEVSSVIEEELKKREEAKRQRELIKQENLNTTAYINRVMMASNE 360

Query: 360 QIINKEKIREEKQKIILDQAKALETQYVHNALKRNPVPRNYNYYQAPEKRSKHIMPSEIF 419
QIINKEKIREEKQKIILDQAKALETQYVHNALKRNPVPRNYNYYQAPEKRSKHIMPSEIF
Sbjct: 361 QIINKEKIREEKQKIILDQAKALETQYVHNALKRNPVPRNYNYYQAPEKRSKHIMPSEIF 420

Query: 420 DDGTFTYFGFKNITLQPAIFVVQPDGKLSMTDAAIDPNMTNSGLRWYRVNEIAEKFKLIK 479
DDGTFTYFGFKNITLQPAIFVVQPDGKLSMTDAAIDPNMTNSGLRWYRVNEIAEKFKLIK
Sbjct: 421 DDGTFTYFGFKNITLQPAIFVVQPDGKLSMTDAAIDPNMTNSGLRWYRVNEIAEKFKLIK 480

Query: 480 DKALVTVINKGYGKNPLTKNYNIKNYGELERVIKKLPLVRDK 521
DKALVTVINKGYGKNPLTKNYNIKNYGELERVIKKLPLVRDK
Sbjct: 481 DKALVTVINKGYGKNPLTKNYNIKNYGELERVIKKLPLVRDK 522


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_02615PF043351193e-35 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 119 bits (300), Expect = 3e-35
Identities = 43/205 (20%), Positives = 73/205 (35%), Gaps = 10/205 (4%)

Query: 27 KLNKANRTFKRAFYL---SMVLNIAAVTSIVMMMPLKKTDIFVYGIDRYTGEFKIVKRSD 83
KL A R+ K A+ + + L A V ++ + PLK + +V +DR TGE I +
Sbjct: 24 KLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDRNTGEASIAAKLH 83

Query: 84 A-RQIVNSEAVVDSATSKFVSLLFGYSKNSLRDRKDQLMQYCDVSFQTQAMRMFNENIRQ 142
I EAV + +V G+ + + D +M Q + R + + Q
Sbjct: 84 GDATITYDEAVRKYFLATYVRYREGWIAAAREEYFDAVMVMSARPEQDRWSRFYKTDNPQ 143

Query: 143 FVDKVRA-EAIISSNIQREKVKNSPLTRLTFFITIKITPDTMENYEYITKKQVTIYYDFA 201
+ A + I + +F +T T TI Y
Sbjct: 144 SPQNILANRTDVFVEI-KRVSFLGGNVAQVYFTKESVTGSNS----TKTDAVATIKYKVD 198

Query: 202 RGNSSQENLIINPFGFKVFDIQITD 226
S + + NP G++V +
Sbjct: 199 GTPSKEVDRFKNPLGYQVESYRADV 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_02660TYPE4SSCAGX330.002 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 32.8 bits (74), Expect = 0.002
Identities = 33/116 (28%), Positives = 53/116 (45%), Gaps = 13/116 (11%)

Query: 24 AINTALLPSEYKKLVALGFKKLYQRHDDKEVTEEEKKFATNALREKLRNDRARAEQIQKN 83
A+N AL+ +Y++ L KKL D + EE+KK L ++ EQ QK
Sbjct: 112 AVNFALMTRDYQEF--LKTKKLIVDAPDPKELEEQKK--------ALEKEKEAKEQAQK- 160

Query: 84 IEAFEKKNNSSIQKKATKHKGLQELNETNANPLNGNPNSNSSTETKSNKDDNFDEM 139
A + K +++A L+ L +NP N + N N S K +++ D+M
Sbjct: 161 --AQKDKREKRKEERAKNRANLENLTNAMSNPQNLSNNKNLSELIKQQRENELDQM 214


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_02695ACRIFLAVINRP340.004 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 33.7 bits (77), Expect = 0.004
Identities = 21/88 (23%), Positives = 32/88 (36%), Gaps = 18/88 (20%)

Query: 19 EVQKRQFQKIEELKADMQKGVNPFFKVLFDGGNRLFGFPETFIYSSI-------FILFVT 71
+ K K+ EL+ +G+ +D F+ SI F +
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMK--VLYPYD--------TTPFVQLSIHEVVKTLFEAIML 350

Query: 72 VVLSVILF-QAYEPVLIVAIVIVLVALG 98
V L + LF Q LI I + +V LG
Sbjct: 351 VFLVMYLFLQNMRATLIPTIAVPVVLLG 378


5HPPC_02870HPPC_02970Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_02870213-1.489906hypothetical protein
HPPC_02875114-0.867326hypothetical protein
HPPC_02880115-0.816813dihydroorotase
HPPC_02885017-2.875014hypothetical protein
HPPC_02890-215-3.112741hypothetical protein
HPPC_02895-115-2.445112flagellar motor switch protein
HPPC_02900013-1.270611endonuclease III
HPPC_02905113-0.872470hypothetical protein
HPPC_02910112-0.635057hypothetical protein
HPPC_029150111.968538aminodeoxychorismate lyase (pabC)
HPPC_029200130.8439162-oxoglutarate-acceptor oxidoreductase subunit
HPPC_029250140.3596592-oxoglutarate-acceptor oxidoreductase subunit
HPPC_02930216-1.0024702-oxoglutarate-acceptor oxidoreductase subunit
HPPC_02935319-2.8235592-oxoglutarate-acceptor oxidoreductase subunit
HPPC_02960522-3.470391adenine specific DNA methyltransferase
HPPC_02965420-3.282006modification methylase MjaII
HPPC_02970423-2.827617type II restriction endonuclease TdeIII
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_02870TYPE3IMSPROT300.006 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 29.7 bits (67), Expect = 0.006
Identities = 18/64 (28%), Positives = 30/64 (46%), Gaps = 4/64 (6%)

Query: 87 LQSYSVMLFFNLLLLIDILGFLPFSIYHHFMASLIFSALFCGSLFLSSPLLGMIALVALS 146
L Y F L+L+ +LPFS S + + +L PLL + AL+A++
Sbjct: 45 LSDYYFEHFSKLMLIPAEQSYLPFSQ----ALSYVVDNVLLEFFYLCFPLLTVAALMAIA 100

Query: 147 SSLL 150
S ++
Sbjct: 101 SHVV 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_02885TONBPROTEIN503e-09 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 50.0 bits (119), Expect = 3e-09
Identities = 24/57 (42%), Positives = 28/57 (49%)

Query: 83 APKPTLAGPQKPPTPPTPPIPPTPPKPIEKPKPEPKPKPKPEPKKPNHKHKALKKVE 139
P P +P P P P P IEKPKP+PKPKPKP K + +K VE
Sbjct: 62 QPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVE 118



Score = 45.4 bits (107), Expect = 9e-08
Identities = 38/228 (16%), Positives = 69/228 (30%), Gaps = 53/228 (23%)

Query: 84 PKPTLAGPQKPPTPPTPPIPPTPPKPIEKPKPEPKPKPKPEPKKPNHKHKALKKVEKVEE 143
P + P +P P P P P P E P KPKPKP+PK K V+KV+E
Sbjct: 57 PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP--------KPVKKVQE 108

Query: 144 KKVVEEKKEEKKIVEQKVEQKKIEEKKPVKKEFDPNQLSFLPKEVAPPRQENNKGLDNQT 203
+ + K E + ++ + + +
Sbjct: 109 QPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQ------ 162

Query: 204 RRDIDELYGEEFGDLGTAEKDFIRNNLRDIGRITQKYLEYPQVAAYLGQDGTNAVEFYLH 263
YP A L +G V+F +
Sbjct: 163 ---------------------------------------YPARAQALRIEGQVKVKFDVT 183

Query: 264 PNGDITDLKIIIGSEYKMLDDNTLKTIQIAYKDYPRPKTKTLIRIRVR 311
P+G + +++I+ M + ++ + +P + ++ I +
Sbjct: 184 PDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFK 231



Score = 38.0 bits (88), Expect = 2e-05
Identities = 16/54 (29%), Positives = 23/54 (42%)

Query: 74 QDPSKNTQGAPKPTLAGPQKPPTPPTPPIPPTPPKPIEKPKPEPKPKPKPEPKK 127
Q + +P P P P+ PKP KPKP+P K + +PK+
Sbjct: 59 QAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKR 112



Score = 33.8 bits (77), Expect = 6e-04
Identities = 14/56 (25%), Positives = 23/56 (41%)

Query: 74 QDPSKNTQGAPKPTLAGPQKPPTPPTPPIPPTPPKPIEKPKPEPKPKPKPEPKKPN 129
+P + P+P P++ P P P PKP K + +PK +P +
Sbjct: 65 PEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR 120



Score = 30.3 bits (68), Expect = 0.008
Identities = 12/52 (23%), Positives = 16/52 (30%)

Query: 75 DPSKNTQGAPKPTLAGPQKPPTPPTPPIPPTPPKPIEKPKPEPKPKPKPEPK 126
+P P + P P P P K E+PK + KP
Sbjct: 72 EPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPAS 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_02895FLGMOTORFLIN1001e-30 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 100 bits (250), Expect = 1e-30
Identities = 25/77 (32%), Positives = 47/77 (61%)

Query: 34 LICDYKNLLDMEIVFSAELGSTQIPLLQILRFEKGSVIDLQKPAGESVDTFVNGRVIGKG 93
+ D ++D+ + + ELG T++ + ++LR +GSV+ L AGE +D +NG +I +G
Sbjct: 50 AMQDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQG 109

Query: 94 EVMVFERNLAIRLNEIL 110
EV+V +R+ +I+
Sbjct: 110 EVVVVADKYGVRITDII 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_02900OMS28PORIN280.029 OMS28 porin signature.
		>OMS28PORIN#OMS28 porin signature.

Length = 257

Score = 27.8 bits (61), Expect = 0.029
Identities = 28/112 (25%), Positives = 54/112 (48%), Gaps = 11/112 (9%)

Query: 27 NQTTELHHKNPYELLVATILSAQCTDARVNKITPKLFEKYPSVKDLAL-----ASLEEVK 81
N+ E+ K E A ++ + T +I + K P+ K+L L A +E+VK
Sbjct: 132 NKVVEMSKKAVQETQKAVSVAGEATFLIEKQI---MLNKSPNNKELELTKEEFAKVEQVK 188

Query: 82 EIIKSVSYFNNKSKHLINMAQKVVRDFKGVIPSTQKELMSLDGVGQKTANVV 133
E + + +++ + AQKV+ G+ PS + ++++ V + +NVV
Sbjct: 189 ETLMASERALDET---VQEAQKVLNMVNGLNPSNKDQVLAKKDVAKAISNVV 237


6HPPC_03275HPPC_03305Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_032752111.796646DNA gyrase subunit A
HPPC_032802142.829105diacylglycerol kinase
HPPC_032852142.545944hypothetical protein
HPPC_032903152.994160hypothetical protein
HPPC_032953143.369473hypothetical protein
HPPC_033002143.466332N-methylhydantoinase
HPPC_033051133.654511hydantoin utilization protein A
7HPPC_03365HPPC_03455Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_033653111.045437iron(III) dicitrate transport protein
HPPC_03370-2111.385390flagellar biosynthesis protein FliP
HPPC_03375-2111.466894hypothetical protein
HPPC_03380-1101.182216bifunctional N-acetylglucosamine-1-phosphate
HPPC_033850110.452792hypothetical protein
HPPC_03390-19-0.146648hypothetical protein
HPPC_03395-111-1.607594ribonucleotide-diphosphate reductase subunit
HPPC_03400-114-3.017079lipopolysaccharide biosynthesis protein wbpB
HPPC_03405-117-5.501376hypothetical protein
HPPC_03410219-6.697770methylated-DNA--protein-cysteine
HPPC_03415016-4.818945integrase-recombinase protein
HPPC_03420118-4.306154hypothetical protein
HPPC_03425-111-1.040042hypothetical protein
HPPC_03430-110-0.811093hypothetical protein
HPPC_03435-1100.176104hypothetical protein
HPPC_034400120.627729aspartate aminotransferase
HPPC_034450120.437951outer membrane protein HorF
HPPC_034500120.124866hypothetical protein
HPPC_03455214-0.195501anaerobic glycerol-3-phosphate dehydrogenase,
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_03370FLGBIOSNFLIP2754e-96 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 275 bits (705), Expect = 4e-96
Identities = 113/245 (46%), Positives = 162/245 (66%), Gaps = 2/245 (0%)

Query: 1 MRFFIFLILICPLICPLMSADSALPSVNLSLNAPSDPKQLVTTLNVIALLTLLVLAPSLI 60
MR + + + L A + LP + S P + + + +T L P+++
Sbjct: 1 MRRLLSVAPVL-LWLITPLAFAQLPGIT-SQPLPGGGQSWSLPVQTLVFITSLTFIPAIL 58

Query: 61 LVMTSFTRLIVVFSFLRTALGTQQTPPTQILVSLSLILTFFIMEPSLKKAYDTGIKPYMD 120
L+MTSFTR+I+VF LR ALGT PP Q+L+ L+L LTFFIM P + K Y +P+ +
Sbjct: 59 LMMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSE 118

Query: 121 KKISYTEAFEKSALPFKEFMLKNTREKDLALFFRIRNLPNPKTPDEVSLSVLIPAFMISE 180
+KIS EA EK A P +EFML+ TRE DL LF R+ N + P+ V + +L+PA++ SE
Sbjct: 119 EKISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSE 178

Query: 181 LKTAFQIGFLLYLPFLVIDMVISSILMAMGMMMLPPVMISLPFKILVFILVDGFNLLTEN 240
LKTAFQIGF +++PFL+ID+VI+S+LMA+GMMM+PP I+LPFK+++F+LVDG+ LL +
Sbjct: 179 LKTAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGS 238

Query: 241 LVASF 245
L SF
Sbjct: 239 LAQSF 243


8HPPC_03585HPPC_03695Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_03585213-2.663468NAD(P)H-flavin oxidoreductase
HPPC_03590211-1.982876poly(A) polymerase
HPPC_03595113-1.641241ExsB trans-regulatory protein
HPPC_03600213-1.468735outer membrane protein HopH
HPPC_03605214-0.914270hypothetical protein
HPPC_03610313-0.777337RNA polymerase factor sigma-54
HPPC_03615111-0.068138putative abc transporter, ATP-binding protein
HPPC_03620111-0.201522hypothetical protein
HPPC_03625110-0.482759DNA polymerase III subunits gamma and tau
HPPC_036301131.279505putative L-lysine exporter; putative membrane
HPPC_036351132.822129hypothetical protein
HPPC_036403162.790169hypothetical protein
HPPC_036452142.738835outer membrane protein SabB/HopO
HPPC_036501133.063211hypothetical protein
HPPC_036551122.437892L-asparaginase II
HPPC_036600131.498927anaerobic C4-dicarboxylate transporter
HPPC_03665013-0.080086outer membrane protein SabA
HPPC_03670114-1.053170putative Outer membrane protein
HPPC_03675212-1.035845putative transcriptional regulator
HPPC_03680211-1.410241tRNA(Ile)-lysidine synthase
HPPC_03695210-0.023801hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_03625IGASERPTASE320.007 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.3 bits (73), Expect = 0.007
Identities = 22/86 (25%), Positives = 30/86 (34%), Gaps = 4/86 (4%)

Query: 479 KIALKNHSENKNALEVVKEF---KFPYSKPKPNTETTAEMKENDTKEAVEKETKENDTKE 535
K KN + +E K T A+ +TKE ETKE T E
Sbjct: 1049 KTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGS-ETKETQTTETKETATVE 1107

Query: 536 TIKKETKEKEIKENDTKEVQETQPKE 561
+K E E + K + PK+
Sbjct: 1108 KEEKAKVETEKTQEVPKVTSQVSPKQ 1133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_03635SECA280.012 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 28.3 bits (63), Expect = 0.012
Identities = 12/43 (27%), Positives = 23/43 (53%), Gaps = 2/43 (4%)

Query: 71 RIARKNLSKMSEEDFKKMREEVRK--ELEEKTKGLSDEEIKAK 111
++ K ++ ++MR+ V +E + + LSDEE+K K
Sbjct: 4 KLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGK 46


9HPPC_04390HPPC_04420Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_043901153.048923hydrogenase nickel incorporation protein
HPPC_043952133.005088flagellar hook protein FlgE
HPPC_044002142.619625CDP-diacylglycerol pyrophosphatase
HPPC_044052132.624065alkylphosphonate uptake protein
HPPC_044102142.264021hypothetical protein
HPPC_044153132.359528hypothetical protein
HPPC_044203161.964263catalase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_04395FLGHOOKAP1427e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 41.9 bits (98), Expect = 7e-06
Identities = 13/49 (26%), Positives = 27/49 (55%)

Query: 669 SISGSKLESSNVDLSRSLTNLIVVQRGFQANSKAVTTSDQILNTLLNLK 717
+S + S V+L NL Q+ + AN++ + T++ I + L+N++
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 39.2 bits (91), Expect = 5e-05
Identities = 11/35 (31%), Positives = 20/35 (57%)

Query: 4 SLWSGVNGMQAHQIALDIESNNIANVNTTGFKYSR 38
+ + ++G+ A Q AL+ SNNI++ N G+
Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQT 37


10HPPC_05530HPPC_05560Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_05530118-3.701289F0F1 ATP synthase subunit B'
HPPC_05535217-3.120856plasmid replication-partition related protein
HPPC_05540218-3.725906SpoOJ regulator (soj)
HPPC_05545318-4.135378biotin--protein ligase
HPPC_05550318-3.667359methionyl-tRNA formyltransferase
HPPC_05555218-4.320922hypothetical protein
HPPC_05560315-0.458716hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_05550FERRIBNDNGPP300.009 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 30.3 bits (68), Expect = 0.009
Identities = 11/33 (33%), Positives = 20/33 (60%)

Query: 70 DPEVQILKDLKPDFIVVVAYGKILPKEVLTIAP 102
+P +++L ++KP F+V A P+ + IAP
Sbjct: 86 EPNLELLTEMKPSFMVWSAGYGPSPEMLARIAP 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_05555FbpA_PF05833330.005 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 32.9 bits (75), Expect = 0.005
Identities = 22/140 (15%), Positives = 55/140 (39%), Gaps = 13/140 (9%)

Query: 342 QTKLNTESFKRIIETLRSKIKENQQKMRDKSKEMSRSFKLESTKNEIKEIRDLIDTANQQ 401
K ++ K L+ + N + K K ++ + K K+ K +L+
Sbjct: 289 YAKDKSDRLKSKSSDLQKIVMNNINRCTKKDKILNNTLKKCEDKDIFKLYGELLTANIYA 348

Query: 402 IANHNEMIK----NIQNQKKICVEQTWKFLVNEFKS---DIQEYNKKHCGLEKGIKKFEN 454
+ I+ +N + + ++E K+ ++Q Y KK+ L+K +
Sbjct: 349 LKKGLSHIELANYYSENYDTVKIT------LDENKTPSQNVQSYYKKYNKLKKSEEAANE 402

Query: 455 EISEIEDKIKELENEIKELE 474
++ + E+++ L + + +
Sbjct: 403 QLLQNEEELNYLYSVLTNIN 422


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_05560RTXTOXIND424e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 41.7 bits (98), Expect = 4e-06
Identities = 26/184 (14%), Positives = 64/184 (34%), Gaps = 18/184 (9%)

Query: 37 LAQQKEFEKEVKEKRAQYQSHFKVLEQKEEALKEREKEQKAKFDDAVKQASALALQDERA 96
++E + + Q+ + QKE L ++ E+ + + ++ R
Sbjct: 178 NVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRL 237

Query: 97 KIIEEARKNAFLEQQKGLELLQKELDEKSKQVQELHQKEAEIERLKRENNEAESRLKAEN 156
+ + LE + + E EL ++++E+++ E A+ +
Sbjct: 238 DDFSSLLHKQAIAKHAVLEQ-ENKYVEAV---NELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 157 EKKLNEKLETERERIEKALHEKNELKFKQQEEQLEMLRNELKNAQRKAELSSQQFQGEVQ 216
+ NE L+ R+ + +L + + +A +S +VQ
Sbjct: 294 QLFKNEILDKLRQTTDNI---------GLLTLELAKNEERQQASVIRAPVS-----VKVQ 339

Query: 217 ELAI 220
+L +
Sbjct: 340 QLKV 343


11HPPC_06015HPPC_06170Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_06015211-0.501909DNA polymerase III subunit delta'
HPPC_06020012-0.431713dihydropteroate synthase
HPPC_060251130.091713hypothetical protein
HPPC_06030-1121.055092membrane transport protein
HPPC_06035-1111.238457hypothetical protein
HPPC_060401152.021217hypothetical protein
HPPC_060451142.422427hypothetical protein
HPPC_060501143.169680carbamoyl phosphate synthase small subunit
HPPC_06055-1113.269304formamidase
HPPC_06060-1112.170632hypothetical protein
HPPC_060651121.267023hypothetical protein
HPPC_06070-1112.661317hypothetical protein
HPPC_060750102.119627Maf-like protein
HPPC_060801112.219760alanyl-tRNA synthetase
HPPC_060852192.020572hypothetical protein
HPPC_060902182.170899hypothetical protein
HPPC_060953182.387877outer membrane protein (omp19)
HPPC_06100-114-1.889460hypothetical protein
HPPC_06105012-1.15420030S ribosomal protein S18
HPPC_06110112-1.265625single-stranded DNA-binding protein
HPPC_06115211-1.82916330S ribosomal protein S6
HPPC_06120310-1.188117hypothetical protein
HPPC_0612529-0.738332DNA polymerase III subunit delta
HPPC_06130110-0.512827ribonuclease R
HPPC_061351110.236424shikimate 5-dehydrogenase
HPPC_061400100.419698hypothetical protein
HPPC_061450110.996273oligopeptide ABC transporter, permease protein
HPPC_061602110.400654tryptophanyl-tRNA synthetase
HPPC_061651131.122198biotin synthesis protein (bioC)
HPPC_061702141.743382preprotein translocase subunit SecG
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_06060VACJLIPOPROT280.016 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 27.9 bits (62), Expect = 0.016
Identities = 19/83 (22%), Positives = 27/83 (32%), Gaps = 10/83 (12%)

Query: 69 LGLGSMF-----AGLGVVLAVPTLVGWLFGRSGLTAGERFVVFILGVFSLASLAGMIYCG 123
LG+G A + P G G G+ G + G F+L G +
Sbjct: 109 LGMGGFIDVAGMANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADA 168

Query: 124 YGIITDFSALSAAMWWWSLSKIT 146
+ LS W S+ K T
Sbjct: 169 L-----YPVLSWLTWPMSVGKWT 186


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_06085PF05844250.035 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 25.0 bits (54), Expect = 0.035
Identities = 13/65 (20%), Positives = 28/65 (43%), Gaps = 1/65 (1%)

Query: 10 SVLKANNPHFDKIFEKHNQLDDDIKTAEQQNASDAEVSHMKKQKLKLKDEIHSMIIEYRE 69
L+A F+ + I++ Q + +V + Q ++E+++ I + +
Sbjct: 197 VALRAAGRAFESRNGALQVANTVIQSFVQMANASVQVRQGESQASAREEEVNATIGQ-SQ 255

Query: 70 KQKSE 74
KQK E
Sbjct: 256 KQKVE 260


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_06140IGASERPTASE320.001 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 32.3 bits (73), Expect = 0.001
Identities = 21/86 (24%), Positives = 39/86 (45%), Gaps = 3/86 (3%)

Query: 34 KKDSASISQNLEKTEIERQNSALSPKQEEANATTTATEESPTKDTAPPLETTAQEKETKQ 93
+ + + +QN E + + N + + E + + T+E+ T +T ET EKE K
Sbjct: 1056 QDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETK---ETATVEKEEKA 1112

Query: 94 ETKQEQEKENEPKQDSVSPTQNNQKA 119
+ + E+ +E VSP Q +
Sbjct: 1113 KVETEKTQEVPKVTSQVSPKQEQSET 1138



Score = 30.0 bits (67), Expect = 0.007
Identities = 15/87 (17%), Positives = 36/87 (41%), Gaps = 3/87 (3%)

Query: 45 EKTEIERQNSALSPKQEEANATTTATEESPTKDTAPPLETTAQEKETKQETKQEQEKENE 104
+ E+ + S +SPKQE++ E + D ++ + T +T+Q ++ +
Sbjct: 1118 KTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETS- 1176

Query: 105 PKQDSVSPTQNNQKALTTSTMGKKPLE 131
+ P + T +++ + P
Sbjct: 1177 --SNVEQPVTESTTVNTGNSVVENPEN 1201



Score = 28.9 bits (64), Expect = 0.016
Identities = 26/131 (19%), Positives = 48/131 (36%), Gaps = 29/131 (22%)

Query: 33 AKKDSASISQNLEKTEIERQNSALSPKQEEANATTTATEESPTKDT-------------A 79
AK+ +++ N + E+ + E TT T+E+ T +
Sbjct: 1069 AKEAKSNVKANTQTNEVAQS------GSETKETQTTETKETATVEKEEKAKVETEKTQEV 1122

Query: 80 PPLETTAQEKETKQETKQEQ---EKENEPKQDSVSPTQNNQK-------ALTTSTMGKKP 129
P + + K+ + ET Q Q +EN+P + P A TS+ ++P
Sbjct: 1123 PKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQP 1182

Query: 130 LEYKVAVSGVN 140
+ V+ N
Sbjct: 1183 VTESTTVNTGN 1193



Score = 27.7 bits (61), Expect = 0.035
Identities = 24/152 (15%), Positives = 48/152 (31%), Gaps = 13/152 (8%)

Query: 44 LEKTEIERQNSALSPKQEEANATTTA---TEESPTKDTAPPLETTAQEKETKQETKQEQE 100
L E+E++N + A + S ++ A E ++ +
Sbjct: 980 LYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTET 1039

Query: 101 KENEPKQDSVSPTQNNQKALTTSTMGKKPLEYKVAVSGVNVRAFPSTKGKIIGSLAKDKS 160
KQ+S + +N Q A T+ ++ + A + K +
Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAK----------EAKSNVKANTQTNEVAQSG 1089

Query: 161 VKVLEIQNDWAKIEFSNEKKGYVFLKLLKKAE 192
+ E Q K + EK+ ++ K E
Sbjct: 1090 SETKETQTTETKETATVEKEEKAKVETEKTQE 1121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_06170SECGEXPORT495e-10 Protein-export SecG membrane protein signature.
		>SECGEXPORT#Protein-export SecG membrane protein signature.

Length = 110

Score = 48.8 bits (116), Expect = 5e-10
Identities = 24/84 (28%), Positives = 46/84 (54%), Gaps = 3/84 (3%)

Query: 1 MTSALLGLQIVLAVLIVVVVLLQ--KSSSIGLGAYSGSNDSLFGAKGPASFMAKLTMFLG 58
M ALL + +++A+ +V +++LQ K + +G +G++ +LFG+ G +FM ++T L
Sbjct: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLA 60

Query: 59 LLFVANTIALGYFYNKEYGKSILD 82
LF ++ LG N +
Sbjct: 61 TLFFIISLVLGNI-NSNKTNKGSE 83


12HPPC_06710HPPC_06740Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_06710011-3.283195signal-transducing protein, histidine kinase
HPPC_06715214-3.856809putative transcriptional regulator
HPPC_06720217-4.073890type IIS restriction enzyme M1 protein (mod)
HPPC_06725220-3.991504type IIS restriction enzyme M2 protein (mod)
HPPC_06730422-3.820942hypothetical protein
HPPC_06735428-4.320175hypothetical protein
HPPC_06740122-3.533596hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_06715HTHFIS912e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 90.7 bits (225), Expect = 2e-23
Identities = 37/118 (31%), Positives = 56/118 (47%), Gaps = 2/118 (1%)

Query: 1 MQK-KIFLLEDDYLLSESVKEFLEHLGYEVSCAFNGKEAYERLSVERFNLLLLDVQVPEM 59
M I + +DD + + + L GY+V N + ++ +L++ DV +P+
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 60 NSLELFRRIKNDFLISTPVIFITALQDNATLKNAFNLGASDYLKKPFDLDELEARIKR 117
N+ +L RIK PV+ ++A T A GA DYL KPFDL EL I R
Sbjct: 61 NAFDLLPRIKKA-RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


13HPPC_06980HPPC_07090Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_06980-214-3.085691putative type I restriction enzyme specificity
HPPC_06985-112-0.809908formyltetrahydrofolate hydrolase
HPPC_06990014-1.311591Signal peptide protease IV (Protease IV)
HPPC_06995019-1.839811hypothetical protein
HPPC_07000119-0.546378hypothetical protein
HPPC_070052181.209691conserved hypothetical lipoprotein
HPPC_070101170.676909hypothetical protein
HPPC_07015115-0.768977hypothetical protein
HPPC_070200150.046106hypothetical protein
HPPC_070251170.273799peptidyl-prolyl cis-trans isomerase B,
HPPC_07030318-1.517803carbon storage regulator
HPPC_07035318-1.4091114-diphosphocytidyl-2-C-methyl-D-erythritol
HPPC_07040221-1.814380SsrA-binding protein
HPPC_070453210.375401biopolymer transport protein
HPPC_070504180.070057hypothetical protein
HPPC_07055318-0.556780biopolymer transport protein
HPPC_07060118-0.31642350S ribosomal protein L34
HPPC_070650170.637410ribonuclease P protein component
HPPC_070700170.841730hypothetical protein
HPPC_070751160.447321putative inner membrane protein translocase
HPPC_070800140.121029hypothetical protein
HPPC_070850120.621177tRNA modification GTPase TrmE
HPPC_070903131.078116putative Outer membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_0707560KDINNERMP430e-148 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 430 bits (1108), Expect = e-148
Identities = 164/581 (28%), Positives = 278/581 (47%), Gaps = 81/581 (13%)

Query: 10 RLILAIALSFLFIALYSYFFQKPNKTTTQTTKQETTNNHTTTSPNAPNAQHFSVTQTIPQ 69
R +L IAL F+ ++ Q + + + T TTT+ + Q Q
Sbjct: 5 RNLLVIALLFVSFMIW----QAWEQDKNPQPQAQQTTQTTTTAAGSAADQG---VPASGQ 57

Query: 70 ENLLSTISFEHARIEIDSLG-RIKQVYLKDKKYLTPKEKGFLEHVG--HLFSSKEN---- 122
L+ ++ + + I++ G ++Q L P L L +
Sbjct: 58 GKLI-SVKTDVLDLTINTRGGDVEQALL-------PAYPKELNSTQPFQLLETSPQFIYQ 109

Query: 123 AQPPL--KELPLLAADKLKPLEVRFLDPTLNNKAFNTPYSASKTTLGPNEQLV--LTQDL 178
AQ L ++ P A+ +PL +N A G NE V D
Sbjct: 110 AQSGLTGRDGPDNPANGPRPL-------------YNVEKDAYVLAEGQNELQVPMTYTDA 156

Query: 179 GALTIIKTLTFYDDLHYDLKIAFKSPNN------------------LIPSYVITNGYRPV 220
T KT Y + + + N L P + +
Sbjct: 157 AGNTFTKTFVLKRG-DYAVNVNYNVQNAGEKPLEISSFGQLKQSITLPPHLDTGSSNFAL 215

Query: 221 ADLDSYTFSGVLLENTDKKIEKIE---DKDAKEIKRFSNTLFLSSVDRYFTTLLFTKDSQ 277
+TF G D+K EK + D + + S +++ + +YF T +
Sbjct: 216 -----HTFRGAAYSTPDEKYEKYKFDTIADNENLNISSKGGWVAMLQQYFATAWIPHN-D 269

Query: 278 GFEALIDSEIGTKNPLGFISLKNEA-----------TLHGYIGPKDYRSLKAISPMLTDV 326
G + +G N + I K++ ++GP+ + A++P L
Sbjct: 270 GTNNFYTANLG--NGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQDKMAAVAPHLDLT 327

Query: 327 IEYGLITFFAKGVFVLLDYLYQFVGNWGWAIILLTIIVRLILYPLSYKGMVSMQKLKELA 386
++YG + F ++ +F LL +++ FVGNWG++II++T IVR I+YPL+ SM K++ L
Sbjct: 328 VDYGWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRMLQ 387

Query: 387 PKMKELQEKYKGEPQKLQAHMMQLYKKHGANPLGGCLPLILQIPVFFAIYRVLYNAVELK 446
PK++ ++E+ + Q++ MM LYK NPLGGC PL++Q+P+F A+Y +L +VEL+
Sbjct: 388 PKIQAMRERLGDDKQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVELR 447

Query: 447 SSEWILWIHDLSIMDPYFILPLLMGASMYWHQSVTPNTMTDPMQAKIFKLLPLLFTIFLI 506
+ + LWIHDLS DPY+ILP+LMG +M++ Q ++P T+TDPMQ KI +P++FT+F +
Sbjct: 448 QAPFALWIHDLSAQDPYYILPILMGVTMFFIQKMSPTTVTDPMQQKIMTFMPVIFTVFFL 507

Query: 507 TFPAGLVLYWTTNNILSVLQQLIINKVLENKKRMHAQNKKE 547
FP+GLVLY+ +N+++++QQ +I + LE K+ +H++ KK+
Sbjct: 508 WFPSGLVLYYIVSNLVTIIQQQLIYRGLE-KRGLHSREKKK 547


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_07080IGASERPTASE310.005 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 31.2 bits (70), Expect = 0.005
Identities = 16/48 (33%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 64 KEESVKETHTKEIHQSAEEKKQKLETETPQEE-KITPKPPKKNLKEES 110
+ + + T TKE +E+K K+ETE QE K+T + K + E+
Sbjct: 1091 ETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSET 1138


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_07085TCRTETOQM330.002 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 33.3 bits (76), Expect = 0.002
Identities = 32/134 (23%), Positives = 52/134 (38%), Gaps = 25/134 (18%)

Query: 216 LSIVGKPNAGKSSLLNAMLLEERA---LVSDIKGTTR-DTIEE-------------VIEL 258
+ ++ +AGK++L ++L A L S KGTTR D +
Sbjct: 6 IGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW 65

Query: 259 KGHKVRLIDTAGIRESADRIERLGIEKSLKSLENCDIILGVFDLSKPLEKEDFNLMDTLN 318
+ KV +IDT G + + R SL L D + + ++ + L L
Sbjct: 66 ENTKVNIIDTPGHMDFLAEVYR-----SLSVL---DGAILLISAKDGVQAQTRILFHALR 117

Query: 319 RAKKPCIVVLNKND 332
+ P I +NK D
Sbjct: 118 KMGIPTIFFINKID 131


14HPPC_07190HPPC_07250Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_07190021-5.025094outer membrane protein (omp31)
HPPC_07195119-5.205994DNA polymerase I (polA)
HPPC_07200230-6.832456hypothetical protein
HPPC_07205128-6.233827DNA adenine methylase
HPPC_07210021-3.612881putative RNA methylase
HPPC_07215120-2.097890hypothetical protein
HPPC_072204221.262935hypothetical protein
HPPC_072253150.492995thymidylate kinase
HPPC_072304130.353461phosphopantetheine adenylyltransferase
HPPC_072354120.4033533-octaprenyl-4-hydroxybenzoate carboxy-lyase
HPPC_072404130.204877hypothetical protein
HPPC_072454130.256117flagellar basal body P-ring biosynthesis protein
HPPC_072502120.328213DNA helicase II
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_07230LPSBIOSNTHSS2206e-77 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 220 bits (562), Expect = 6e-77
Identities = 61/147 (41%), Positives = 94/147 (63%)

Query: 4 IGIYPGTFDPVTNGHIDIIHRSSELFEKLIIAVAYSSAKNPMFSLKERLKMMQLATKSFK 63
IYPG+FDP+T GH+DII R LF+++ +AV + K PMFS++ERL+ + A
Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLP 61

Query: 64 NVECVAFEGLLADLAKEYHCKVLVRGLRVVSDFEYELQMGYANKSLNHELETLYFMPTLQ 123
N + +FEGL + A++ ++RGLRV+SDFE ELQM NK+L +LET++ + +
Sbjct: 62 NAQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTSTE 121

Query: 124 NAFISSSIVRSIIAHKGDASHLVPKEI 150
+F+SSS+V+ + G+ H VP +
Sbjct: 122 YSFLSSSLVKEVARFGGNVEHFVPSHV 148


15HPPC_07345HPPC_07825Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_07345213-1.784156peptidyl-tRNA hydrolase
HPPC_07350012-1.375441hypothetical protein
HPPC_07355111-1.120394hypothetical protein
HPPC_07360212-0.308800outer membrane protein HopK
HPPC_07365212-0.114645hypothetical protein
HPPC_073700110.452887hypothetical protein
HPPC_073750110.790171putative cation transporting P-type ATPase
HPPC_073801111.163423hypothetical protein
HPPC_073851111.013983riboflavin biosynthesis protein (ribG)
HPPC_073901121.255189sodium/glutamate symport carrier
HPPC_073952142.475115hypothetical protein
HPPC_074000131.845098ferrodoxin-like protein
HPPC_07405-2111.648213putative glycerol-3-phosphate acyltransferase
HPPC_07410-2130.014752dihydroneopterin aldolase
HPPC_07415-210-2.327688FrpB-like protein
HPPC_07420-28-2.519870iron-regulated outer membrane protein
HPPC_07425-19-4.228097selenocysteine synthase
HPPC_07430-19-4.937203transcription elongation factor NusA
HPPC_07435-110-4.955086putative type IIS restriction-modification
HPPC_07440114-5.782938hypothetical protein
HPPC_07445110-3.316467type III restriction enzyme
HPPC_07450111-3.438250type III DNA modification enzyme
HPPC_07455011-2.946993type III DNA modification enzyme
HPPC_07460013-1.166655ATP-dependent DNA helicase RecG
HPPC_07465014-0.843871hypothetical protein
HPPC_07470-114-0.806034hypothetical protein
HPPC_07475011-0.162496exodeoxyribonuclease III
HPPC_074802120.282207*hypothetical protein
HPPC_074853150.405450chromosomal replication initiation protein
HPPC_07490319-1.815648purine nucleoside phosphorylase
HPPC_07495117-2.761357hypothetical protein
HPPC_07500115-2.322327glucosamine--fructose-6-phosphate
HPPC_07505-114-3.402868FAD-dependent thymidylate synthase
HPPC_07510-114-3.711249hypothetical protein
HPPC_07515-212-1.361688putative type I restriction enzyme (specificity
HPPC_07520-311-0.378990hypothetical protein
HPPC_07525-3100.565103type I restriction enzyme M protein (hsdM)
HPPC_07530-1101.739504typeI restriction enzyme R protein
HPPC_075353133.664752hypothetical protein
HPPC_075403124.140107Iron(III) dicitrate transport protein FecA
HPPC_07545-1112.187612arginase
HPPC_075500110.585688amino acid permease
HPPC_07555011-0.129085alanine dehydrogenase
HPPC_07570010-1.600054putative outer membrane protein
HPPC_07575212-2.460621probable inorganic polyphosphate/ATP-NAD kinase
HPPC_07580311-2.669535DNA repair protein RecN
HPPC_07585012-2.546896fibronectin/fibrinogen-binding protein
HPPC_07590116-0.259825hypothetical protein
HPPC_07595116-0.371953hypothetical protein
HPPC_07600-112-1.144004hypothetical protein
HPPC_07605013-1.580091DNA polymerase III subunit epsilon
HPPC_07610016-2.384793ribulose-phosphate 3-epimerase
HPPC_07615016-2.926486fructose-1,6-bisphosphatase
HPPC_07620418-5.017192hypothetical protein
HPPC_07625419-5.208512putative type II methylase protein
HPPC_07630621-6.113013hypothetical protein
HPPC_07635823-7.228583hypothetical protein
HPPC_07640822-7.217972hypothetical protein
HPPC_07655621-7.845561integrase/recombinase (xerD)
HPPC_07660422-7.500638hypothetical protein
HPPC_07665425-7.821878hypothetical protein
HPPC_07670524-7.941610hypothetical protein
HPPC_07675724-7.369871hypothetical protein
HPPC_07680723-7.390167hypothetical protein
HPPC_07685720-6.959242hypothetical protein
HPPC_07690719-6.499961hypothetical protein
HPPC_07695719-6.533984VirB4-like protein
HPPC_07700618-5.970459DNA topoisomerase I (topA)
HPPC_07705517-6.021108VirB8 type IV secretion protein
HPPC_07710318-5.876781VirB9 type IV secretion protein
HPPC_07715322-6.433230VirB10 type IV secretion protein
HPPC_07720426-8.646978hypothetical protein
HPPC_07725427-9.741995hypothetical protein
HPPC_07730326-7.943966hypothetical protein
HPPC_07735226-7.483412VirB11 type IV secretion ATPase
HPPC_07740225-7.551194hypothetical protein
HPPC_07745526-8.230161hypothetical protein
HPPC_07750421-7.309970hypothetical protein
HPPC_07755519-6.693715conjugal transfer protein (traG)
HPPC_07760617-6.248004hypothetical protein
HPPC_07765518-6.700264hypothetical protein
HPPC_07770620-8.102692hypothetical protein
HPPC_07775521-6.291266hypothetical protein
HPPC_07780521-6.706696hypothetical protein
HPPC_07785522-6.042633chromosome partitioning protein ParA
HPPC_07800525-6.951332hypothetical protein
HPPC_07805423-6.553144hypothetical protein
HPPC_07820219-3.222925hypothetical protein
HPPC_07825117-3.186802hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_07345PF06340290.013 Vibrio cholerae toxin co-regulated pilus biosynthesis pr...
		>PF06340#Vibrio cholerae toxin co-regulated pilus biosynthesis

protein F (TcpF)
Length = 338

Score = 28.8 bits (64), Expect = 0.013
Identities = 14/38 (36%), Positives = 22/38 (57%)

Query: 103 NGGHNGLKSIDTLCSNSYYRLRVGISKGIGVIEHVLSK 140
N NG K +T+ SNS Y+ ++ ++KG G + SK
Sbjct: 294 NDMRNGYKWSNTMFSNSNYKTQILLTKGDGSGVKLYSK 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_07385CARBMTKINASE290.019 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 29.4 bits (66), Expect = 0.019
Identities = 15/43 (34%), Positives = 21/43 (48%), Gaps = 3/43 (6%)

Query: 246 ILSKHSIDPNSKVFSAPNRLVNAFYDP---KDLPLEKGFNFIE 285
I+++ +D N F P + V FYD K L EKG+ E
Sbjct: 113 IITQTIVDKNDPAFQNPTKPVGPFYDEETAKRLAREKGWIVKE 155


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_07485HTHFIS354e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 35.2 bits (81), Expect = 4e-04
Identities = 9/51 (17%), Positives = 24/51 (47%), Gaps = 4/51 (7%)

Query: 125 TVYEIAKKVAQSDTPPYNPVLFYGGTGLGKTHILNAIGNHALEKHKKVVLV 175
+Y + ++ Q+D ++ G +G GK + A+ ++ ++ V +
Sbjct: 148 EIYRVLARLMQTDLT----LMITGESGTGKELVARALHDYGKRRNGPFVAI 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_07525TYPE4SSCAGA330.006 Type IV secretion system CagA exotoxin signature.
		>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature.

Length = 1147

Score = 32.8 bits (74), Expect = 0.006
Identities = 72/270 (26%), Positives = 117/270 (43%), Gaps = 33/270 (12%)

Query: 545 HLEPGFNPKTL--IESVCSKVLKEFEKVEILDKYGVYQLFKDYYNEVLQDDWFLLSFNDF 602
HLE GFN + + + + + F + + DK L N++++D LS N
Sbjct: 527 HLEVGFNKVAIFNLPDLNNLAITSFVRRNLEDKLTTKGLSPQEANKLIKD---FLSSNKE 583

Query: 603 LSAKELRELNPLKDKNKKANYLEEPDFVIQKTYYKSDLIPKNLIKQRFFEKE-AKELEEL 661
L K L + D NY E ++K + DL K+L K+ EKE K+LE
Sbjct: 584 LVGKTLNFNKAVADAKNTGNYDE-----VKKA--QKDL-EKSLRKREHLEKEVEKKLESK 635

Query: 662 ENALNEKEADFEEFIEEHSGEEGLFYELKINESVLKKELKNATDLEDKEILKTALELLEA 721
N+ EA + +S ++ +F IN+ + A K I + + LE
Sbjct: 636 SGNKNKMEAK----AQANSQKDEIF--ALINKEANRDARAIAYAQNLKGIKRELSDKLEN 689

Query: 722 KNKALKMKNKAHEELE-------LKAFHQYKNLKLGEIKDLIIQDKWLKSLKNALENKIL 774
NK LK +K+ +E + KA K LK G +KDL I +W+ ++N +
Sbjct: 690 VNKNLKDFDKSFDEFKNGKNKDFSKAEETLKALK-GSVKDLGINPEWISKVEN-----LN 743

Query: 775 KRINAFTSALNKIISNYSNSLLELDKEVKE 804
+N F + NK S + + +L+ VK+
Sbjct: 744 AALNEFKNGKNKDFSKVTQAKSDLENSVKD 773


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_07585FbpA_PF058331103e-28 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 110 bits (276), Expect = 3e-28
Identities = 71/358 (19%), Positives = 146/358 (40%), Gaps = 25/358 (6%)

Query: 97 AKDLAYKSENFILRLEMIPKKANLMILDKEKCVIEA--FRFNDRVAKNDILGALPLN-TY 153
+ ++ ++ +N + L + K + + I++ F FN N +G LN
Sbjct: 209 SSEICFRLKNNSIDLSLSNLKEIVEVCKDLFKEIQSNKFEFNCYTKNNSFVGFYCLNLMS 268

Query: 154 EHQERDLDFKGLLDILEKDFLSYQHKE-LEHKKNHIIKRLNIQKERLKEKLEKLEDPKNL 212
+ + + + +LE + + + L+ K + + K + R +K + L +
Sbjct: 269 KEDYKKIQYDSSSKLLENFYYAKDKSDRLKSKSSDLQKIVMNNINRCTKKDKILNNTLKK 328

Query: 213 QLEAKELQTQASLLLTYQHLINKHESRVVLKDFED---KECAIEIDKSMPLNAFINKKFT 269
+ + LL + + K S + L ++ I +D++ + + +
Sbjct: 329 CEDKDIFKLYGELLTANIYALKKGLSHIELANYYSENYDTVKITLDENKTPSQNVQSYYK 388

Query: 270 LSKKKKQKSQFLYLEEENLKEKIAFKENQINYVKGA---------KEESVLEMFM---PV 317
K K+ + + +E++ + + + + A K+E + ++ +
Sbjct: 389 KYNKLKKSEEAANEQLLQNEEELNYLYSVLTNINNADNYDEIEEIKKELIETGYIKFKKI 448

Query: 318 KNSKTKRPMSGYEVLYYKDFKIGLGKNQKENIKL-LQDARANDLWMHIRDIPGSHLIVFC 376
SK + + I +GKN +N L L+ A +D+W H ++IPGSH+IV
Sbjct: 449 YKSKKSKTSKPMHFISKDGIDIYVGKNNIQNDYLTLKFANKHDIWFHTKNIPGSHVIVKN 508

Query: 377 QKNAPKDEVIMELAKMLIKMQKDVFNS-YEIDYTQRKFVKIIKGAN---VIYSKYRTI 430
+ P + ++E A + K +S +DYT+ K VK GA VIYS +TI
Sbjct: 509 IMDIP-ESTLLEAANLAAYYSKSQNSSNVPVDYTEVKNVKKPNGAKPGMVIYSTNQTI 565



Score = 34.5 bits (79), Expect = 9e-04
Identities = 21/92 (22%), Positives = 48/92 (52%), Gaps = 5/92 (5%)

Query: 46 NAPYIGLSKKPLESVLKNTLALDFCLNKFTKNAKILQANIIDNDRI--LEITGAKDLAYK 103
N P I L+ + +K + L K+ NAKI+ + I+ DRI ++ +L +
Sbjct: 55 NYPRIHLTDLTKPNPIKAPMFCMV-LRKYISNAKIVDIHQINQDRIVVIDFESTDELGFN 113

Query: 104 SENFILRLEMIPKKANLMILDK-EKCVIEAFR 134
S + L +E++ + +N+ ++ K + ++++ +
Sbjct: 114 SI-YSLIIEIMGRHSNMTLIRKRDNIIMDSIK 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_07595FbpA_PF05833270.014 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 26.8 bits (59), Expect = 0.014
Identities = 12/73 (16%), Positives = 28/73 (38%), Gaps = 7/73 (9%)

Query: 23 ISKQELEHPITEPIYTEALIDTYTFDLSSQQTIRDIYKAMYRNNALIRDAYAKIDSLEKR 82
+ +E I ++ L + Y + D K+ +++ L + I+ K+
Sbjct: 266 LMSKEDYKKIQYDSSSKLLENFYY-----AKDKSDRLKS--KSSDLQKIVMNNINRCTKK 318

Query: 83 IRKLEQQLKQAKK 95
+ L LK+ +
Sbjct: 319 DKILNNTLKKCED 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_07705PF04335934e-24 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 93.0 bits (231), Expect = 4e-24
Identities = 36/213 (16%), Positives = 71/213 (33%), Gaps = 13/213 (6%)

Query: 144 FEEVRD-ASVIYHLEKKLGDYIFYVACFFFGTTVLLVILLIVLLPLKQKEPYLVQFSNNK 202
FEE ++ + VA V+ + L PLK EPY++ N
Sbjct: 14 FEEAASWERDKLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDRNT 73

Query: 203 ENFALVQ--KADSTITANKALIRSLVGAYVLNRESITHIEQHEKIRQNTIKEQSSNEVWY 260
++ D+TIT ++A+ + + YV RE + + + + S+
Sbjct: 74 GEASIAAKLHGDATITYDEAVRKYFLATYVRYREGWIAAAR--EEYFDAVMVMSARPEQD 131

Query: 261 EFEKLIA-----YHDSIYTNPLLIRKVKIANI-YLDKDLAYIDIEVGLYHSGELESLKRY 314
+ + +I N V+I + +L ++A + +G +
Sbjct: 132 RWSRFYKTDNPQSPQNILAN-RTDVFVEIKRVSFLGGNVAQVYFTKESV-TGSNSTKTDA 189

Query: 315 KVVMSFEFKKQEINFDSMSLNPTGFIVTGYDVT 347
+ ++ NP G+ V Y
Sbjct: 190 VATIKYKVDGTPSKEVDRFKNPLGYQVESYRAD 222


16HPPC_00185HPPC_00210N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_00185-2130.315483comB8 competence protein
HPPC_00190-2140.530017comB9 competence protein
HPPC_00195-1121.372736comB10 competence protein
HPPC_00200-1121.192255mannose-6-phosphate isomerase
HPPC_00205-1131.463267GDP-D-mannose dehydratase
HPPC_00210-2101.762961nodulation protein (nolK)
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_00185PF043351323e-40 VirB8 type IV secretion protein
		>PF04335#VirB8 type IV secretion protein

Length = 227

Score = 132 bits (333), Expect = 3e-40
Identities = 37/202 (18%), Positives = 72/202 (35%), Gaps = 4/202 (1%)

Query: 40 QSVFRLERNRLKIAYKLLGFMSFIALVLAIVLISILPLQKTEHHF--VDFLNQDKHYAII 97
+ K+A+ + G +A + + ++ PL+ E + VD + A
Sbjct: 22 RDKLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDRNTGEASIAAK 81

Query: 98 QRADKSISSNEALARSLIGAYVLNRESINRIDDKSRYELVRLQSSSKVWQRFEDLIKTQN 157
D +I+ +EA+ + + YV RE + ++ V + S+ R+ KT N
Sbjct: 82 LHGDATITYDEAVRKYFLATYVRYREGWIAAAREEYFDAVMVMSARPEQDRWSRFYKTDN 141

Query: 158 SIYAQSHLEREVHI-VNIAIYQQDNNPIASVSIAAKLMNENKLVYEKRYKIVLSYLFDTP 216
Q+ L + V I +A V + + + + + Y D
Sbjct: 142 PQSPQNILANRTDVFVEIKRVSFLGGNVAQVYFTKESVTGSNST-KTDAVATIKYKVDGT 200

Query: 217 DFDYASMPKNPTGFKITRYSIT 238
KNP G+++ Y
Sbjct: 201 PSKEVDRFKNPLGYQVESYRAD 222


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_00190TYPE4SSCAGX310.005 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 31.3 bits (70), Expect = 0.005
Identities = 23/80 (28%), Positives = 41/80 (51%), Gaps = 13/80 (16%)

Query: 182 NNKPLKEEKEEIKEKEEETITIGDSTNAMKIVKKDIQKGYRALKSSQRKWYCLGICSKKS 241
N + ++EEK++I + + + NA+K + + + Y ++ + K+S
Sbjct: 364 NKEKIREEKQKIILDQAKALETQYVHNALK--RNPVPRNYNYYQAPE----------KRS 411

Query: 242 KLSLMPEEIFNDKQFTYFKF 261
K +MP EIF+D FTYF F
Sbjct: 412 K-HIMPSEIFDDGTFTYFGF 430


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_00205NUCEPIMERASE895e-22 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 89.1 bits (221), Expect = 5e-22
Identities = 46/181 (25%), Positives = 72/181 (39%), Gaps = 19/181 (10%)

Query: 6 VLITGVTGQDGSYLAEYLLNLGYEVHGLKRRSSSINTSRIDHLYEDLHSEHKRRFFLHYG 65
L+TG G G ++++ LL G++V G+ + + S E L F H
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQP---GFQFHKI 59

Query: 66 DMTDSSNLIHLIATTKPTEIYNLAAQSHVKVSFETPEYTANADGIGTLRILEAMRILGLE 125
D+ D + L A+ ++ + V+ S E P A+++ G L ILE R ++
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119

Query: 126 KKTRFYQASTSELYGEVLETPQNENTPF-------NPRSPYAVAKMYAFYITKNYREAYN 178
AS+S +YG N PF +P S YA K + Y Y
Sbjct: 120 ---HLLYASSSSVYGL------NRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170

Query: 179 L 179
L
Sbjct: 171 L 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_00210NUCEPIMERASE496e-09 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 49.4 bits (118), Expect = 6e-09
Identities = 52/346 (15%), Positives = 107/346 (30%), Gaps = 54/346 (15%)

Query: 5 ILITGAYGMVGQNTALYFKKNKPDV-----------TLLTPKKSELY-----------LL 42
L+TGA G +G + + + V L + EL L
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLA 62

Query: 43 DKDNVQAYLKEYKPTGIIHCAGRVGGIVANMNDLSAYMVENLLMGLYLFSSALDLGVKKA 102
D++ + + R + ++ + AY NL L + ++
Sbjct: 63 DREGMTDLFASGHFERVFISPHR-LAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121

Query: 103 INLASSCAYPKFAPNPLKESDLLNGSLEPTNEGYALAKLSVMKYCEYVSAEKGVFYKTLV 162
+ +SS Y P D ++ + YA K + S G+ L
Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSL----YAATKKANELMAHTYSHLYGLPATGLR 177

Query: 163 PCNLYGEFDKFEEKIAHMIPGLIARMHTAKLKNEKEFAMWGDGTARREYLNAKDLARFIS 222
+YG + + P + T + K ++ G +R++ D+A I
Sbjct: 178 FFTVYGPWGR---------PDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAII 228

Query: 223 LAYENIASIPS-----------------VMNVGSGVDYSIEEYYKMVAQVLDYKGAFVKD 265
+ I + V N+G+ + +Y + + L +
Sbjct: 229 RLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNML 288

Query: 266 LSKPVGMQQKLMDISK-QKALKWELEIPLEQGIKEAYEYYLKLLEV 310
+P + + D + + + E ++ G+K +Y +V
Sbjct: 289 PLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYKV 334


17HPPC_01230HPPC_01265N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_01230-2131.259680neutrophil activating protein (napA)
HPPC_01235-3121.046710histidine kinase sensor protein
HPPC_01240-3111.833942hypothetical protein
HPPC_01245-3112.291658flagellar basal body P-ring protein
HPPC_01250-2111.968475ATP-dependent RNA helicase
HPPC_01255-291.590679hypothetical protein
HPPC_01260-2111.220240hypothetical protein
HPPC_01265-3102.086315oligopeptide permease ATPase protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01230HELNAPAPROT1502e-49 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 150 bits (379), Expect = 2e-49
Identities = 39/140 (27%), Positives = 75/140 (53%), Gaps = 1/140 (0%)

Query: 5 EILKHLQADAIVLFMKVHNFHWNVKGTDFFNVHKATEEIYEEFADMFDDLAERIVQLGHH 64
L ++ +L+ K+H FHW VKG FF +H+ EE+Y+ A+ D +AER++ +G
Sbjct: 15 NSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETVDTIAERLLAIGGQ 74

Query: 65 PLVTLSEAIKLTRVKEETKTSFHSKDIFKEILEDYKHLEKEFKELSNTAEKEGDKVTVTY 124
P+ T+ E + + + + + ++ + ++ DYK + E K + AE+ D T
Sbjct: 75 PVATVKEYTEHASITDGGNET-SASEMVQALVNDYKQISSESKFVIGLAEENQDNATADL 133

Query: 125 ADDQLAKLQKSIWMLQAHLA 144
+ +++K +WML ++L
Sbjct: 134 FVGLIEEVEKQVWMLSSYLG 153


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01235PF06580300.015 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 29.8 bits (67), Expect = 0.015
Identities = 10/71 (14%), Positives = 25/71 (35%), Gaps = 13/71 (18%)

Query: 281 IVLQNFLYNAIDAIEALEESEQ-GQVKIEAFIQNEFIVFTIIDNGKEVENKSALFEPFET 339
+++Q + N I + + Q G++ ++ N + + + G +
Sbjct: 258 MLVQTLVENGI--KHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK------- 308

Query: 340 TKLKGNGLGLA 350
+ G GL
Sbjct: 309 ---ESTGTGLQ 316


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01245FLGPRINGFLGI364e-127 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 364 bits (936), Expect = e-127
Identities = 118/345 (34%), Positives = 191/345 (55%), Gaps = 26/345 (7%)

Query: 19 AEKIGDIASVVGVRDNQLIGYGLVIGLNGTGDK-SGSKFTMQSISNMLESVNVKISADDI 77
+I DIAS+ RDNQLIGYGLV+GL GTGD S FT QS+ ML+++ +
Sbjct: 28 TSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMRAMLQNLGITTQGGQS 87

Query: 78 KSKNVAAVMITASLPPFARQGDKIDIQISSIGDAKSIQGGTLVMTPLNAVDGNIYALAQG 137
+KN+AAVM+TA+LPPFA G ++D+ +SS+GDA S++GG L+MT L+ DG IYA+AQG
Sbjct: 88 NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIMTSLSGADGQIYAVAQG 147

Query: 138 AIVSGN-----------SNNLLSANIINGATIEREVSYDLFHKNAMVLSLKSPNFKNAIQ 186
A++ SA + NGA IERE+ +VL L++P+F A++
Sbjct: 148 ALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSVNLVLQLRNPDFSTAVR 207

Query: 187 VQNTLNKV----FGNKVAIALDPKTIQITRPERFSMVEFLALVQEIPINYSAKNKIIIDE 242
V + +N +G+ +A D + I + +P + +A ++ + + K++I+E
Sbjct: 208 VADVVNAFARARYGDPIAEPRDSQEIAVQKPRVADLTRLMAEIENLTVETDTPAKVVINE 267

Query: 243 KSGTIVSGVDIMVHPIVVTSQDITLKITKEP--------LNDSKNTQDLDNNMSLDTAHN 294
++GTIV G D+ + + V+ +T+++T+ P Q + M++
Sbjct: 268 RTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFSRGQTAVQPQTDIMAMQEGSK 327

Query: 295 TLSSNGKSITIAGVVKALQKIGVSAKGMVSILQALKKSGAISAEM 339
G + +V L IG+ A G+++ILQ +K +GA+ AE+
Sbjct: 328 VAIVEGPDLR--TLVAGLNSIGLKADGIIAILQGIKSAGALQAEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01250SECA310.012 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 31.0 bits (70), Expect = 0.012
Identities = 26/104 (25%), Positives = 49/104 (47%), Gaps = 7/104 (6%)

Query: 224 PSNTTNT--DITQRFYVINEHERAEAIM-HLLDTQAPKKSI-VFTRTKKEADELHQFLAS 279
P+N D+ Y + E E+ +AI+ + + A + + V T + ++++ + L
Sbjct: 413 PTNRPMIRKDLPDLVY-MTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTK 471

Query: 280 KNYKSTALHGDMDQRDRRASIMAFKKNDADVLVATDVASRGLDI 323
K L+ + A+I+A A V +AT++A RG DI
Sbjct: 472 AGIKHNVLNAKFHANE--AAIVAQAGYPAAVTIATNMAGRGTDI 513


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01265HTHFIS320.004 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.5 bits (74), Expect = 0.004
Identities = 16/50 (32%), Positives = 21/50 (42%), Gaps = 7/50 (14%)

Query: 30 VAIVGESGSGKSSIANLIMRLNPR----FKPHNGEVLFETTNLLKESEAF 75
+ I GESG+GK +A + R F N + L ESE F
Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRD---LIESELF 209


18HPPC_01625HPPC_01660N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_01625012-1.031484guanylate kinase
HPPC_01630012-1.056938poly E-rich protein
HPPC_01635-213-1.651461nuclease NucT
HPPC_01640013-1.793381outer membrane protein HorC
HPPC_01645214-2.080507flagellar basal body L-ring protein
HPPC_01650314-1.680674CMP-N-acetylneuraminic acid synthetase
HPPC_01655213-0.985474CMP-N-acetylneuraminic acid synthetase (neuA)
HPPC_01660212-0.824878flagellar biosynthesis protein G
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01625PF05272290.011 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.011
Identities = 9/18 (50%), Positives = 11/18 (61%)

Query: 8 LILSGPSGAGKSTLTKYL 25
++L G G GKSTL L
Sbjct: 599 VVLEGTGGIGKSTLINTL 616


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01630IGASERPTASE731e-15 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 72.8 bits (178), Expect = 1e-15
Identities = 41/218 (18%), Positives = 78/218 (35%), Gaps = 18/218 (8%)

Query: 162 LPTLNDQEEKEEEKEEVKETPQEEEKPKDNEIQEGETLKDEEVSKELETQ-------EEV 214
P+ + E K+E K + E+ + Q E K+ + + + TQ
Sbjct: 1032 TPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSE 1091

Query: 215 KEETQEQAKEQEPIKEETQEIKEEKQEKTQDSPSAQELEAMQELVKEIQENSNGQEDKKE 274
+ETQ ++ E+ ++ K E ++ + ++ QE + +Q + +
Sbjct: 1092 TKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDP 1151

Query: 275 TQESTEAPQEKETQENTEIPQETEKQELETPQEEKQESTETPQEKTQDVEIPQETPQENT 334
T E + T +TE P + +E P E VE P+ T T
Sbjct: 1152 TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSV----VENPENTTPATT 1207

Query: 335 ETPQESTETPQKETQEKKVQENHYESIEDIPEPVMAQA 372
+ P ++E+ K + H S+ +P V
Sbjct: 1208 Q-PTVNSES------SNKPKNRHRRSVRSVPHNVEPAT 1238



Score = 52.4 bits (125), Expect = 3e-09
Identities = 45/261 (17%), Positives = 93/261 (35%), Gaps = 16/261 (6%)

Query: 199 LKDEEVSKELETQEEVKEETQEQAKEQEPIKEETQEIKEEKQEKTQDSPSAQELEAMQEL 258
L + EV K +T + T + P E E P+ E
Sbjct: 980 LYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTET 1039

Query: 259 VKEIQENSNGQEDKKETQESTEAPQEKE--TQENTEIPQETEKQELETPQEEKQESTETP 316
V E + + +K E + Q +E + + + T+ E+ E +E+ T
Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099

Query: 317 QEKTQDVEIPQETPQENTET---PQESTETPQKETQEKKVQ--------ENHYESIEDIP 365
++T VE ++ E +T P+ +++ K+ Q + VQ + +I++
Sbjct: 1100 TKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQ 1159

Query: 366 EPVMAQAMGEALPFLNESVAKIPNNENDTETPKKSVIKTPQEKEGSDKTSSPLELRLNLQ 425
A E S + P E+ T SV++ P+ + +
Sbjct: 1160 SQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQP---TVNSESS 1216

Query: 426 DLLKSLNQESLKNLLENKTLS 446
+ K+ ++ S++++ N +
Sbjct: 1217 NKPKNRHRRSVRSVPHNVEPA 1237



Score = 42.4 bits (99), Expect = 4e-06
Identities = 23/168 (13%), Positives = 47/168 (27%)

Query: 148 KALVQEEPNNEEQLLPTLNDQEEKEEEKEEVKETPQEEEKPKDNEIQEGETLKDEEVSKE 207
A E + EKEE+ + E QE K + E + + E
Sbjct: 1085 VAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAE 1144

Query: 208 LETQEEVKEETQEQAKEQEPIKEETQEIKEEKQEKTQDSPSAQELEAMQELVKEIQENSN 267
+ + +E + + Q KE Q + + +V+ + +
Sbjct: 1145 PARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTP 1204

Query: 268 GQEDKKETQESTEAPQEKETQENTEIPQETEKQELETPQEEKQESTET 315
ES+ P+ + + +P E + +
Sbjct: 1205 ATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDL 1252


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01645FLGLRINGFLGH1934e-64 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 193 bits (492), Expect = 4e-64
Identities = 52/172 (30%), Positives = 84/172 (48%), Gaps = 18/172 (10%)

Query: 56 GERPLFADRRAMKPNDLITIIVSEKASANYSSS----KDYKSASGGNSTPPRLTYNGLDE 111
G +PLF DRR D +TI++ E SA+ SSS +D K+ G ++ P L GL
Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYL--QGLFG 118

Query: 112 RKKQEAEYLDDKNNYNFTKSSNNTNFKGGGSQKKSEDLEIVLSARIIKVLENGNYFIYGN 171
+ + E S F G G S L+ + +VL NGN + G
Sbjct: 119 NARADVEA------------SGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGE 166

Query: 172 KEVLVDGEKQILKVSGVIRPYDIERNNTIQSKFLADAKIEYTNLGHLSDSNK 223
K++ ++ + ++ SGV+ P I +NT+ S +ADA+IEY G+++++
Sbjct: 167 KQIAINQGTEFIRFSGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQN 218


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01660SACTRNSFRASE280.017 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 28.0 bits (62), Expect = 0.017
Identities = 15/49 (30%), Positives = 22/49 (44%), Gaps = 3/49 (6%)

Query: 102 RGETILKALEYIAFE---EFQLNSLHLEVMENNFKAIAFYEKNHYELEG 147
R + + AL + A E E L LE + N A FY K+H+ +
Sbjct: 102 RKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150


19HPPC_01760HPPC_01810N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_01760-2111.218938flagellar MS-ring protein
HPPC_01765-1131.763257flagellar motor switch protein G
HPPC_01770-1101.658026flagellar assembly protein H
HPPC_01775-1101.7223451-deoxy-D-xylulose-5-phosphate synthase
HPPC_01780-111-0.263960GTP-binding protein LepA
HPPC_017950120.206652hypothetical protein
HPPC_018000120.073021flagellar basal-body rod protein
HPPC_01805011-0.324506alpha-ketoglutarate permease (kgtP)
HPPC_01810012-0.608683cell division protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01760FLGMRINGFLIF5560.0 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 556 bits (1435), Expect = 0.0
Identities = 178/582 (30%), Positives = 295/582 (50%), Gaps = 66/582 (11%)

Query: 11 VDFFIKLNKKQKIALIAAGVLITALLVFLLLYPFKEKDYAQGGYGVLFEGLDSSDNALIL 70
+++ +L +I LI AG A++V ++L+ K DY LF L D I+
Sbjct: 13 LEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWA-KTPDYR-----TLFSNLSDQDGGAIV 66

Query: 71 QHLQQNQIPYKVSKDD-TILIPRDKVYEERITLASQGIPKTSKVGFEIFDTKDFGATDFD 129
L Q IPY+ + I +P DKV+E R+ LA QG+PK VGFE+ D + FG + F
Sbjct: 67 AQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFS 126

Query: 130 QNIKLIRAIEGELSRTIESLNPILKANVHIAIPKDSVFVAKEVPPSASVMLKLKPDMKLS 189
+ + RA+EGEL+RTIE+L P+ A VH+A+PK S+FV ++ PSASV + L+P L
Sbjct: 127 EQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALD 186

Query: 190 PTQILGIKNLIAAAVPKLTTENVKIVNENGESIGEGDILENSKELALEQLRYKQNFENIL 249
QI + +L+++AV L NV +V+++G + + + + ++L QL++ + E+ +
Sbjct: 187 EGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNT--SGRDLNDAQLKFANDVESRI 244

Query: 250 ENKIVNILAPIVGGKNKVVARVNAEFDFSQKKSTKETFDPNN-----VVRSEQNLEEKKE 304
+ +I IL+PIVG N V A+V A+ DF+ K+ T+E + PN +RS Q ++
Sbjct: 245 QRRIEAILSPIVGNGN-VHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQV 303

Query: 305 GAPKKQVGGVPGVVSN-IGPVQGLKDNKEPEKYEKSQN---------------------- 341
GA GGVPG +SN P P + +QN
Sbjct: 304 GAGYP--GGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNE 361

Query: 342 TTNYEVGKTISEIKGEFGTLVRLNAAVVVDGKYKIALKDGANTLEYEPLSDESLQKINAL 401
T+NYEV +TI K G + RL+ AVVV+ K L DG + PL+ + +++I L
Sbjct: 362 TSNYEVDRTIRHTKMNVGDIERLSVAVVVNYK---TLADG----KPLPLTADQMKQIEDL 414

Query: 402 VKQAIGYNQNRGDDVAVSNFEFNPMAPMIDNATLSEKIMHKTQKILGSFTPLIKYILVFI 461
++A+G++ RGD + V N F+ + T E + Q + +++LV +
Sbjct: 415 TREAMGFSDKRGDTLNVVNSPFSAVDN-----TGGELPFWQQQSFIDQLLAAGRWLLVLV 469

Query: 462 VLFIFYKKVIVPFSERMLEVVPDEDKEVKSMFEEMDEEEDELNKLGDLRKKVEDQLGLNA 521
V +I ++K + P R +E ++ + E + E L+K L+++ +Q
Sbjct: 470 VAWILWRKAVRPQLTRRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQ----- 524

Query: 522 TFSEEEVRYEIVLEKIRGTLKERPDEIAMLFKLLIKDEISSD 563
+ E++ ++IR E D + L+I+ +S+D
Sbjct: 525 -----RLGAEVMSQRIR----EMSDNDPRVVALVIRQWMSND 557


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01765FLGMOTORFLIG350e-122 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 350 bits (900), Expect = e-122
Identities = 121/338 (35%), Positives = 208/338 (61%), Gaps = 4/338 (1%)

Query: 8 KQKAQLDELSMSEKIAILLIQVGEDTTGEILRHLDIDSITEISKQIVQLNGTDKQIGAAV 67
K+ + L+ +K AILL+ +G + + ++ ++L + I ++ +I +L ++ V
Sbjct: 7 KEILDVSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNV 66

Query: 68 LEEFFAIFQSNQYINTGGLEYARELLTRTLGSEEAKKVMDKLTKSLQTQKNFAYLGKIKP 127
L EF + + ++I GG++YARELL ++LG+++A +++ L +LQ+ + F ++ + P
Sbjct: 67 LLEFKELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQS-RPFEFVRRADP 125

Query: 128 QQLADFIINEHPQTIALILAHMEAPNAAETLSYFPDEMKAEISIRMANLGEISPQVVKRV 187
+ +FI EHPQTIALIL++++ A+ LS P E++ ++ R+A + SP+VV+ V
Sbjct: 126 ANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREV 185

Query: 188 STVLENKLESLTSYK-IEVGGLRAVAEIFNRLGQKSAKTTLARIESVDNKLAGAIKEMMF 246
VLE KL SL+S GG+ V EI N +K+ K + +E D +LA IK+ MF
Sbjct: 186 ERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMF 245

Query: 247 TFEDISKLDNFAIREILKVADKKDLSLALKTSTQDLTDKFLNNMSSRAAEQFVEEMQYLG 306
FEDI LD+ +I+ +L+ D ++L+ ALK+ + +K NMS RAA E+M++LG
Sbjct: 246 VFEDIVLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLG 305

Query: 307 AVKIKDVDVAQRKIIEIVQSLQEKG--VIQTGEEEDVI 342
+ KDV+ +Q+KI+ +++ L+E+G VI G EEDV+
Sbjct: 306 PTRRKDVEESQQKIVSLIRKLEEQGEIVISRGGEEDVL 343



Score = 30.2 bits (68), Expect = 0.010
Identities = 20/102 (19%), Positives = 41/102 (40%), Gaps = 3/102 (2%)

Query: 4 KLTPKQKAQLDELSMSEKIAILLIQVGEDTTGEILRHLDIDSITEISKQIVQLNGTDKQI 63
+ P + + IA++L + IL L + T ++++I ++ T ++
Sbjct: 122 RADPANILNFIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEV 181

Query: 64 GAA---VLEEFFAIFQSNQYINTGGLEYARELLTRTLGSEEA 102
VLE+ A S Y + GG++ E++ E
Sbjct: 182 VREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEK 223


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01770FLGFLIH373e-05 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 37.5 bits (86), Expect = 3e-05
Identities = 45/207 (21%), Positives = 91/207 (43%), Gaps = 14/207 (6%)

Query: 49 PLEKKAIENDLIDCLLKKTDELSSHLVKLQMQFEKAQEES-KALIENAKNDGYKIGFKEG 107
E I + + L L +LQMQ A E+ +A I + G+K G++EG
Sbjct: 19 QAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQ---AHEQGYQAGIAEGRQQGHKQGYQEG 75

Query: 108 EEKMRNELTHSVNEEKNQLLHAITALDEKMKKSQDHLMALE----KELSAIAIDIAKEVI 163
+ L + E K+Q + + + + Q L AL+ L +A++ A++VI
Sbjct: 76 ---LAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVI 132

Query: 164 LKEVEDNSQKVALALAEELLKNVLDATDIHLKVNPLDYPYLNERLQNASKI---KLESNE 220
+ ++ + + + L + L + L+V+P D +++ L + +L +
Sbjct: 133 GQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDP 192

Query: 221 AISKGGVMITSSNGSLDGNLMERFKTL 247
+ GG +++ G LD ++ R++ L
Sbjct: 193 TLHPGGCKVSADEGDLDASVATRWQEL 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01780TCRTETOQM1147e-29 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 114 bits (287), Expect = 7e-29
Identities = 53/162 (32%), Positives = 88/162 (54%), Gaps = 7/162 (4%)

Query: 9 NIRNFSIIAHIDHGKSTLADCLIFECNAIS---NREMKSQVMDTMDIEKERGITIKAQSV 65
I N ++AH+D GK+TL + L++ AI+ + + + D +E++RGITI+
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 66 RLNYTLKGEDYVLNLIDTPGHVDFSYEVSRSLCSCEGALLVVDATQGVEAQTIANVYIAL 125
+ E+ +N+IDTPGH+DF EV RSL +GA+L++ A GV+AQT +
Sbjct: 62 SFQW----ENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALR 117

Query: 126 DNHLEILPVINKIDLPNANVLEVKQDIEDTIGIDCSGANEVS 167
+ + INKID ++ V QDI++ + + +V
Sbjct: 118 KMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVE 159



Score = 82.2 bits (203), Expect = 2e-18
Identities = 50/215 (23%), Positives = 90/215 (41%), Gaps = 17/215 (7%)

Query: 167 SAKAKLGIKDLLEKIITTIPAPSGDPNAPLKALIYDSWFDNYLGALALVRIMDGSINTEQ 226
SAK +GI +L+E I + + + L ++ + LA +R+ G ++
Sbjct: 220 SAKNNIGIDNLIEVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRD 279

Query: 227 EILVMGTGKRHGVLGLYYPNPLKKIPTKSLECGEIGIV---SLGLKSVTDIAVGDTLTDA 283
+ + K + +Y + GEI I+ L L SV +GDT
Sbjct: 280 SVRISEKEKI-KITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSV----LGDTKLL- 333

Query: 284 KNPTPKPIEGFMPAKPFVFAGLYPIETDRFEDLREALLKLQLNDCALNFEPESSVALGFG 343
P + IE P + + P + + E L +ALL++ +D L + +S+
Sbjct: 334 --PQRERIEN---PLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATH---E 385

Query: 344 FRVGFLGLLHMEVIKERLEREFGLNLIATAPTVVY 378
+ FLG + MEV L+ ++ + + PTV+Y
Sbjct: 386 IILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIY 420



Score = 31.0 bits (70), Expect = 0.015
Identities = 15/75 (20%), Positives = 28/75 (37%), Gaps = 2/75 (2%)

Query: 405 IKEPFVRATIITPSEFLGNLMQLLNNKRGIQEKMEYLNQSRVMLTYSLPSNEIVMDFYDK 464
+ EP++ I P E+L + L + V+L+ +P+ I ++
Sbjct: 535 LLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ-LKNNEVILSGEIPARCI-QEYRSD 592

Query: 465 LKSCTKGYASFDYEP 479
L T G + E
Sbjct: 593 LTFFTNGRSVCLTEL 607


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01800FLGHOOKAP1300.008 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 30.3 bits (68), Expect = 0.008
Identities = 9/40 (22%), Positives = 16/40 (40%)

Query: 3 NGYYAATGAMATQFNRLDLTSNNLANLNTNGFKRDDAITG 42
+ A + L+ SNN+++ N G+ R I
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA 41


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01805TCRTETB392e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 39.5 bits (92), Expect = 2e-05
Identities = 58/315 (18%), Positives = 104/315 (33%), Gaps = 67/315 (21%)

Query: 35 APYFAKEFTHTNDPTLALISAFLVFMLGFFMRPLGSLFFGKLGDKKGRKTSMVYSIILMA 94
P A +F T + +AF++ G+ +GKL D+ G K +++ II+
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSI------GTAVYGKLSDQLGIKRLLLFGIIINC 90

Query: 95 LGSFLLALLPTKEIVGEWAFLFLLLARLLQGFSVGGE------YGVVATYLSELGKNGKK 148
GS + VG F L++AR +QG G VVA Y+ + +
Sbjct: 91 FGSVIGF-------VGHSFFSLLIMARFIQG--AGAAAFPALVMVVVARYIPKENRGKAF 141

Query: 149 GFYGSFQYVTLVGGQLLAIFSLFIVENIYTHEQISAFAWRYLFALGGILALLSLFLRNIM 208
G GS + +G + I I+ W YL + I + FL ++
Sbjct: 142 GLIGS---IVAMGEGVGPAIGGMIAHYIH---------WSYLLLIPMITIITVPFLMKLL 189

Query: 209 EETMDSQTTSKTTIREETQRGSLKELLNHKKALM-------IVFGLTMGGSLCFYTFTVY 261
+ + +K + K ++ + T +
Sbjct: 190 K-----------------KEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLS 232

Query: 262 LKIFLTNSSSFSPK-------ESSFIMLLALSYFIFLQPLCG---MLADKIKRTQMLMVF 311
IF+ + + ++ M+ L I + G M+ +K L
Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292

Query: 312 AITGLIVTPVVFYGI 326
I +I+ P I
Sbjct: 293 EIGSVIIFPGTMSVI 307


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_01810IGASERPTASE373e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 37.4 bits (86), Expect = 3e-04
Identities = 32/144 (22%), Positives = 45/144 (31%), Gaps = 19/144 (13%)

Query: 259 NPTNPTLKELKQETKEREPTPTKETLT-----------------PTTPKPATLKPIMPTP 301
N + + ETKE + T TKET T P + K
Sbjct: 1079 NTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSET 1138

Query: 302 IMPASAPIIENDNKTENQKTPNPPKKEESPQENPQKENQKENIEEKENLKEEEKEIQDAP 361
+ P + P END T N K P + E P KE N+E+
Sbjct: 1139 VQPQAEPAREND-PTVNIKEPQSQTNTTADTEQPAKETSS-NVEQPVTESTTVNTGNSVV 1196

Query: 362 SFSPLTPTSAKKPVMVKELSENKE 385
T + +P + E S +
Sbjct: 1197 ENPENTTPATTQPTVNSESSNKPK 1220


20HPPC_02870HPPC_02900N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_02870213-1.489906hypothetical protein
HPPC_02875114-0.867326hypothetical protein
HPPC_02880115-0.816813dihydroorotase
HPPC_02885017-2.875014hypothetical protein
HPPC_02890-215-3.112741hypothetical protein
HPPC_02895-115-2.445112flagellar motor switch protein
HPPC_02900013-1.270611endonuclease III
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_02870TYPE3IMSPROT300.006 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 29.7 bits (67), Expect = 0.006
Identities = 18/64 (28%), Positives = 30/64 (46%), Gaps = 4/64 (6%)

Query: 87 LQSYSVMLFFNLLLLIDILGFLPFSIYHHFMASLIFSALFCGSLFLSSPLLGMIALVALS 146
L Y F L+L+ +LPFS S + + +L PLL + AL+A++
Sbjct: 45 LSDYYFEHFSKLMLIPAEQSYLPFSQ----ALSYVVDNVLLEFFYLCFPLLTVAALMAIA 100

Query: 147 SSLL 150
S ++
Sbjct: 101 SHVV 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_02885TONBPROTEIN503e-09 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 50.0 bits (119), Expect = 3e-09
Identities = 24/57 (42%), Positives = 28/57 (49%)

Query: 83 APKPTLAGPQKPPTPPTPPIPPTPPKPIEKPKPEPKPKPKPEPKKPNHKHKALKKVE 139
P P +P P P P P IEKPKP+PKPKPKP K + +K VE
Sbjct: 62 QPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVE 118



Score = 45.4 bits (107), Expect = 9e-08
Identities = 38/228 (16%), Positives = 69/228 (30%), Gaps = 53/228 (23%)

Query: 84 PKPTLAGPQKPPTPPTPPIPPTPPKPIEKPKPEPKPKPKPEPKKPNHKHKALKKVEKVEE 143
P + P +P P P P P P E P KPKPKP+PK K V+KV+E
Sbjct: 57 PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP--------KPVKKVQE 108

Query: 144 KKVVEEKKEEKKIVEQKVEQKKIEEKKPVKKEFDPNQLSFLPKEVAPPRQENNKGLDNQT 203
+ + K E + ++ + + +
Sbjct: 109 QPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQ------ 162

Query: 204 RRDIDELYGEEFGDLGTAEKDFIRNNLRDIGRITQKYLEYPQVAAYLGQDGTNAVEFYLH 263
YP A L +G V+F +
Sbjct: 163 ---------------------------------------YPARAQALRIEGQVKVKFDVT 183

Query: 264 PNGDITDLKIIIGSEYKMLDDNTLKTIQIAYKDYPRPKTKTLIRIRVR 311
P+G + +++I+ M + ++ + +P + ++ I +
Sbjct: 184 PDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFK 231



Score = 38.0 bits (88), Expect = 2e-05
Identities = 16/54 (29%), Positives = 23/54 (42%)

Query: 74 QDPSKNTQGAPKPTLAGPQKPPTPPTPPIPPTPPKPIEKPKPEPKPKPKPEPKK 127
Q + +P P P P+ PKP KPKP+P K + +PK+
Sbjct: 59 QAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKR 112



Score = 33.8 bits (77), Expect = 6e-04
Identities = 14/56 (25%), Positives = 23/56 (41%)

Query: 74 QDPSKNTQGAPKPTLAGPQKPPTPPTPPIPPTPPKPIEKPKPEPKPKPKPEPKKPN 129
+P + P+P P++ P P P PKP K + +PK +P +
Sbjct: 65 PEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR 120



Score = 30.3 bits (68), Expect = 0.008
Identities = 12/52 (23%), Positives = 16/52 (30%)

Query: 75 DPSKNTQGAPKPTLAGPQKPPTPPTPPIPPTPPKPIEKPKPEPKPKPKPEPK 126
+P P + P P P P K E+PK + KP
Sbjct: 72 EPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPAS 123


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_02895FLGMOTORFLIN1001e-30 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 100 bits (250), Expect = 1e-30
Identities = 25/77 (32%), Positives = 47/77 (61%)

Query: 34 LICDYKNLLDMEIVFSAELGSTQIPLLQILRFEKGSVIDLQKPAGESVDTFVNGRVIGKG 93
+ D ++D+ + + ELG T++ + ++LR +GSV+ L AGE +D +NG +I +G
Sbjct: 50 AMQDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQG 109

Query: 94 EVMVFERNLAIRLNEIL 110
EV+V +R+ +I+
Sbjct: 110 EVVVVADKYGVRITDII 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_02900OMS28PORIN280.029 OMS28 porin signature.
		>OMS28PORIN#OMS28 porin signature.

Length = 257

Score = 27.8 bits (61), Expect = 0.029
Identities = 28/112 (25%), Positives = 54/112 (48%), Gaps = 11/112 (9%)

Query: 27 NQTTELHHKNPYELLVATILSAQCTDARVNKITPKLFEKYPSVKDLAL-----ASLEEVK 81
N+ E+ K E A ++ + T +I + K P+ K+L L A +E+VK
Sbjct: 132 NKVVEMSKKAVQETQKAVSVAGEATFLIEKQI---MLNKSPNNKELELTKEEFAKVEQVK 188

Query: 82 EIIKSVSYFNNKSKHLINMAQKVVRDFKGVIPSTQKELMSLDGVGQKTANVV 133
E + + +++ + AQKV+ G+ PS + ++++ V + +NVV
Sbjct: 189 ETLMASERALDET---VQEAQKVLNMVNGLNPSNKDQVLAKKDVAKAISNVV 237


21HPPC_03035HPPC_03105N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_03035-2110.697171flagellin A
HPPC_03040-3110.9105453-methyladenine DNA glycosylase
HPPC_03045-1111.331630hypothetical protein
HPPC_03050090.589272uroporphyrinogen decarboxylase
HPPC_03055190.087220outer-membrane protein of the hefABC efflux
HPPC_03060190.055401membrane fusion protein of the hefABC efflux
HPPC_03065080.004074cytoplasmic pump protein of the hefABC efflux
HPPC_0307009-0.686170hypothetical protein
HPPC_03075-18-0.498144putative vacuolating cytotoxin (VacA)-like
HPPC_03080-19-0.904700ABC-type multidrug transport system, permease
HPPC_03095-190.138350hypothetical protein
HPPC_03100-290.037445NAD-dependent DNA ligase LigA
HPPC_03105-1120.010559putative chemotaxis protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_03035FLAGELLIN2446e-77 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 244 bits (624), Expect = 6e-77
Identities = 126/518 (24%), Positives = 209/518 (40%), Gaps = 22/518 (4%)

Query: 2 AFQVNTNINAMNAHVQSALTQNALKTSLERLSSGLRINKAADDASGMTVADSLRSQASSL 61
A +NTN ++ +Q++L +++ERLSSGLRIN A DDA+G +A+ S L
Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60

Query: 62 GQAIANTNDGMGIIQVADKAMDEQLKILDTVKVKATQAAQDGQTTESRKAIQSDIVRLIQ 121
QA N NDG+ I Q + A++E L V+ + QA + K+IQ +I + ++
Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120

Query: 122 GLDNIGNTTTYNGQALLSGQFTNKEFQVGAYSNQSIKASIGSTTSDKIGQVRI-ATGALI 180
+D + N T +NG +LS + QVGA ++I + +G G
Sbjct: 121 EIDRVSNQTQFNGVKVLSQDN-QMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKE 179

Query: 181 TASGDISLTFKQVDGVNDVTLESVKVSSSAGTGIGVLAEVINKNSNRTGVKAYASVITTS 240
GD+ +FK V G + + + K +G V ++ V A +TT
Sbjct: 180 ATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTD 239

Query: 241 DVAVQSGSLSNLTLNGIHLGNIADIKKNDSDGRLVAAINAVTSETGVEAYTDQKGRLNLR 300
D N + K A A+ + + + +
Sbjct: 240 DAE-----------NNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTID 288

Query: 301 SIDGRGIEIKTDSVSNGPSALTMVNGGQDLTKGSTNYGRLSLTRLDAKSINV------VS 354
+ G K + NG V S + +N +
Sbjct: 289 TKTGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKT 348

Query: 355 ASDSQHLGFTAIGFGESQVAETTVNLRDVTGNFNANVKSASGANYNAVIASGNQSL---G 411
++S L ++ TVN + T N + + +G + S
Sbjct: 349 KNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINE 408

Query: 412 SGVTTLRGAMVVIDIAESAMKMLDKVRSDLGSVQNQMISTVNNISITQVNVKAAESQIRD 471
+ + +SA+ +D VRS LG++QN+ S + N+ T N+ +A S+I D
Sbjct: 409 DAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIED 468

Query: 472 VDFAEESANFNKNNILAQSGSYAMSQANTVQQNILRLL 509
D+A E +N +K IL Q+G+ ++QAN V QN+L LL
Sbjct: 469 ADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLL 506


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_03040PF05272300.008 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.008
Identities = 13/95 (13%), Positives = 26/95 (27%), Gaps = 20/95 (21%)

Query: 60 ILENDDEINLKKIAYIEFSKLAECVRPSGFYNQKAKRLIDLSGNILKDFQSFENFKQEVT 119
L + + +A+ E + VR + +KA E+
Sbjct: 458 ALRSAPALA-GCVAFDELREQPVAVRAFPW--RKAPGP-------------LEDADVLRL 501

Query: 120 REWLLDQKGIGKESADAILCYVCAKEVMVVDKYSY 154
+++ G G+ SA + D
Sbjct: 502 ADYVETTYGTGEASAQTTEQAINV----AADMNRV 532


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_03055RTXTOXIND300.023 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.8 bits (67), Expect = 0.023
Identities = 16/113 (14%), Positives = 41/113 (36%), Gaps = 16/113 (14%)

Query: 203 LARMIALQKKLEQIKTDIKRVTKLYDEGLTTIDDL-----QSLKAQGNLSEY--DILDIQ 255
LAR+ + K+ + + L + + + ++A L Y + I+
Sbjct: 220 LARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIE 279

Query: 256 FALEQNRLTLEYLTNLSVKNLKKTTIDAPNLQLRERQD-LVSLREQISALKYQ 307
+ + + +T K +D +LR+ D + L +++ + +
Sbjct: 280 SEILSAKEEYQLVTQ----LFKNEILD----KLRQTTDNIGLLTLELAKNEER 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_03060RTXTOXIND511e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 51.0 bits (122), Expect = 1e-09
Identities = 22/82 (26%), Positives = 36/82 (43%), Gaps = 5/82 (6%)

Query: 27 NVQAVQDSKLTLDSTGIVDSIKVTEGSVVKKGDVLLLLYNQDKQAQSDSTEQQLIFAKKQ 86
+ ++ IV I V EG V+KGDVLL L +A + T+ L+ A+ +
Sbjct: 95 RSKEIKPI-----ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLE 149

Query: 87 YQRYSKTGGAVDKNTLESYEFN 108
RY +++ N L +
Sbjct: 150 QTRYQILSRSIELNKLPELKLP 171



Score = 29.4 bits (66), Expect = 0.012
Identities = 21/152 (13%), Positives = 47/152 (30%), Gaps = 25/152 (16%)

Query: 70 QAQSDSTEQQLIFAKKQYQR--YSKTGGAVDKNTLESYEFNYRRLESDYAYSIAVLNKTI 127
+++ S +++ + ++ K D + + E ++
Sbjct: 279 ESEILSAKEEYQLVTQLFKNEILDKLRQTTDN--IGLLTLELAKNEER-------QQASV 329

Query: 128 LRAPFDGVIASKNIQVGEGVSANNTVLLRLVSHARKLVIE--FDSKYINAVKVG------ 179
+RAP + + GV L+ +V L + +K I + VG
Sbjct: 330 IRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIK 389

Query: 180 -DTYTYSIDGDSNQHEAKITKIYP--TVDENT 208
+ + Y+ G K+ I D+
Sbjct: 390 VEAFPYTRYGYL---VGKVKNINLDAIEDQRL 418


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_03065ACRIFLAVINRP8940.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 894 bits (2312), Expect = 0.0
Identities = 287/1038 (27%), Positives = 519/1038 (50%), Gaps = 41/1038 (3%)

Query: 1 MYKTAINRPITTLMFALAIVFFGTMGFKKLSVALFPKIDLPTVVVTTTYPGASAEIIESK 60
M I RPI + A+ ++ G + +L VA +P I P V V+ YPGA A+ ++
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTDKIEEAVMGIDGIKKVTSTSSKNVSIVV-IEFELEKPNEEALNDVVNKISSVR-FDDS 118
VT IE+ + GID + ++STS S+ + + F+ + A V NK+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 119 NIKKPSINKFDTDSQAIISLFVSSSSVPAT--TLNDYAKNTIKPMLQKINGVGGVQLNGF 176
+++ I+ + S ++ S + T ++DY + +K L ++NGVG VQL G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG- 179

Query: 177 RERQIRIYADPTLMNKYNLTYADLFSTLKAENVEIDGGRIVNS------QRELSILINAN 230
+ +RI+ D L+NKY LT D+ + LK +N +I G++ + Q SI+
Sbjct: 180 AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 231 SYSVADVEKIQV-----GNHVRLGDIAKIEIGLEEDNTFASFKDKPGVILEIQKIAGANE 285
+ + K+ + G+ VRL D+A++E+G E N A KP L I+ GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 286 IEIVDRVYEALKHIQAISP-NYEIRPFLDTTGYIRTSIEDVKFDLVLGAILAVLVVFAFL 344
++ + L +Q P ++ DTT +++ SI +V L +L LV++ FL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 345 RNGTITLVSAISIPISIMGTFALIQWMGFSLNMLTMVALTLAIGIIIDDAIVVIENIHK- 403
+N TL+ I++P+ ++GTFA++ G+S+N LTM + LAIG+++DDAIVV+EN+ +
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 404 KLEMGMDKRKASYEGVREIGFALVAISAMLLSVFVPIGNMKGIIGRFFQSFGITVALAIA 463
+E + ++A+ + + +I ALV I+ +L +VF+P+ G G ++ F IT+ A+A
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 464 LSYVVVVTIIPMVSSVVVNPKHS-------RFYVWSEPFFKALESYYTRLLQWVLNHKLI 516
LS +V + + P + + ++ P + F+ W F ++YT + +L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 517 IFIAVVLVFVGSLFVASKIGMEFMLKEDRGRFLVWLKAKPGVSIDY----MTQKSKIFQK 572
+ L+ G + + ++ F+ +ED+G FL ++ G + + + Q + + K
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 573 AIEKHAEVEFTTLQVGY-GTTQNPFKAKIFVQLKPLKERKKEHELGQFELMSALKKELKS 631
+ + E FT + G QN FV LKP +ER E ++ K EL
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNA--GMAFVSLKPWEERNG-DENSAEAVIHRAKMELGK 656

Query: 632 MPESKGLESINLSEVSLIGGGGDSSPFQTFVFSHSQEAVDKSVANLKKFLLESPELKGKV 691
+ + + N+ + G ++ F + + D + L + + +
Sbjct: 657 IRDGF-VIPFNMPAIV---ELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASL 712

Query: 692 ESYHTSTSESQPQLQLKILRQNANKYGVSAQTIGSVVSSAFSGTSQASVFKQDGKEYDMI 751
S + E Q +L++ ++ A GVS I +S+A G + + F G+ +
Sbjct: 713 VSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGG-TYVNDFIDRGRVKKLY 771

Query: 752 IRVPDNKRVSVEDIKRLQVRNKYDKLMFLDALVEITETKSPSSISRYNRQRSVTVLAQPA 811
++ R+ ED+ +L VR+ +++ A + RYN S+ + + A
Sbjct: 772 VQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAA 831

Query: 812 -GISLGEILTQVSKNTKEWLVEGANYRFTGEADNAKETNGEFLIAIATAFVLIYMILAAL 870
G S G+ + + +N L G Y +TG + + + + +A +FV++++ LAAL
Sbjct: 832 PGTSSGDAMALM-ENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAAL 890

Query: 871 YESILEPFIIMVTMPLSFSGAFFALSLVHQPLSMFSMIGLILLIGMVGKNATLLIDVANE 930
YES P +M+ +PL G A +L +Q ++ M+GL+ IG+ KNA L+++ A +
Sbjct: 891 YESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKD 950

Query: 931 -ERKKGLNIQEAILFAGKTRLRPILMTTIAMVCGMLPLALASGDGAAMKSPIGIAMSGGL 989
K+G + EA L A + RLRPILMT++A + G+LPLA+++G G+ ++ +GI + GG+
Sbjct: 951 LMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGM 1010

Query: 990 MISMVLSLLIVPVFYRLL 1007
+ + +L++ VPVF+ ++
Sbjct: 1011 VSATLLAIFFVPVFFVVI 1028


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_03075VACCYTOTOXIN2781e-77 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 278 bits (712), Expect = 1e-77
Identities = 104/394 (26%), Positives = 182/394 (46%), Gaps = 14/394 (3%)

Query: 2797 NAVNWLNALFVAKGGNPLFAPYYLQDTPTEHIVTLMKDVSSALGMLSKPNLKNNSTDVLQ 2856
+ L L + + +A + T I + ++ L ++ K + L
Sbjct: 907 QGRDLLQTLLI-DSHDAGYARTMIDATSANEITKQLNTATTTLNNIASLEHKTSGLQTLS 965

Query: 2857 LNTYTQQMGRLAKLSNFASFDSTDFSERLSSLKNQRFTDAIPNAMDVILKYSQRDKLKNN 2916
L+ RL LS + F++RL +LK+QRF + +A +V+ +++ + + N
Sbjct: 966 LSNAMILNSRLVNLSRRHTNHIDSFAKRLQALKDQRFAS-LESAAEVLYQFAPKYEKPTN 1024

Query: 2917 LWATGVGGVSFVENGTGTLYGVNVGYDRFIKG---VIVGGYAAYGYSGFYER--ITSSKS 2971
+WA +GG S G +LYG + G D ++ G IVGG+ +YGYS F + +S +
Sbjct: 1025 VWANAIGGTSLNSGGNASLYGTSAGVDAYLNGEVEAIVGGFGSYGYSSFSNQANSLNSGA 1084

Query: 2972 DNVDVGLYARAFIKKSELTFSVNEAWGANKTQISSSDALLSMINQSYQYNTWTTNARVNY 3031
+N + G+Y+R F + E F A G++++ ++ ALL +NQSY Y ++ R +Y
Sbjct: 1085 NNTNFGVYSRIFANQHEFDFEAQGALGSDQSSLNFKSALLRDLNQSYNYLAYSAATRASY 1144

Query: 3032 GYDFMFKNKSIILKPQIGLRYYYIGMSGLEGVMDNALYNQFKANADPSKKSVLTIDLALE 3091
GYDF F +++LKP +G+ Y ++G + + + S + + +E
Sbjct: 1145 GYDFAFFRNALVLKPSVGVSYNHLGSTNFKS----NSNQKVALKNGASSQHLFNASANVE 1200

Query: 3092 NRHYFSTNSYFYAIGGFGRDLLVNSMGDKLVRFIGNNTLSYRKGELYNTFASITTGGEVR 3151
R+Y+ SYFY G ++ N V + R NT A + GGE++
Sbjct: 1201 ARYYYGDTSYFYMNAGVLQEFA-NFGSSNAVSLNTFKVNATRNP--LNTHARVMMGGELK 1257

Query: 3152 LFKSFYANAGVGARFGLDYKMINITGNIGMRLAF 3185
L K + N G L + + N+GMR +F
Sbjct: 1258 LAKEVFLNLGFVYLHNLISNIGHFASNLGMRYSF 1291


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_03095LCRVANTIGEN316e-04 Low calcium response V antigen signature.
		>LCRVANTIGEN#Low calcium response V antigen signature.

Length = 326

Score = 31.2 bits (70), Expect = 6e-04
Identities = 16/33 (48%), Positives = 20/33 (60%)

Query: 16 KRKKLLTELAELEAEIKVSSERKSSFNISLSPS 48
R KL ELAEL AE+K+ S ++ N LS S
Sbjct: 149 ARSKLREELAELTAELKIYSVIQAEINKHLSSS 181


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_03105HTHFIS542e-10 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 54.5 bits (131), Expect = 2e-10
Identities = 24/110 (21%), Positives = 44/110 (40%), Gaps = 6/110 (5%)

Query: 194 ILIAEDSLSALKTLEKIVQTLELRYLAFPNGRELLDYLYEKEHYQQVGVVITDLEMPVIS 253
IL+A+D + L + + N L ++ +V+TD+ MP +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAA----GDGDLVVTDVVMPDEN 61

Query: 254 GFEVLKTIKADSRTEHLPVIINSSMSSDSNRQLAQSLEADGFVVKSNILE 303
F++L IK LPV++ S+ ++ A A ++ K L
Sbjct: 62 AFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109


22HPPC_04550HPPC_04580N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_04550-1121.900505acetate kinase
HPPC_04555-1132.452631acetate kinase
HPPC_045600151.697587phosphotransacetylase
HPPC_045650140.503881phosphotransacetylase
HPPC_045700140.340692hypothetical protein
HPPC_04575-1130.015449flagellar basal body rod modification protein
HPPC_045800131.037927flagellar hook protein FlgE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_04550ACETATEKNASE1206e-36 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 120 bits (302), Expect = 6e-36
Identities = 51/119 (42%), Positives = 75/119 (63%), Gaps = 2/119 (1%)

Query: 1 MRNIEARK-EKGDKEAKLAFEMCAYHIKKYIGAYMVALGRVDAIIFTGGMGENYPALRES 59
R++E + GDK A+LA + AY +KK IG+Y A+G VD I+FT G+GEN P +RE
Sbjct: 283 FRDLEDAAFKNGDKRAQLALNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREF 342

Query: 60 VCEGLENLGIALHKITNDNPGNGLVDLSQPNTKIQVLLIPTDEELEIALQAKEIIEKLK 118
+ +GLE LG L K N G +S ++K+ V+++PT+EE IA ++I+E LK
Sbjct: 343 ILDGLEFLGFKLDKEKNKVRGEE-AIISTADSKVNVMVVPTNEEYMIAKDTEKIVESLK 400


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_04555ACETATEKNASE360e-126 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 360 bits (926), Expect = e-126
Identities = 142/282 (50%), Positives = 198/282 (70%), Gaps = 6/282 (2%)

Query: 1 MEILVLNLGSSSIKFKLFDMKENKPLASGLAEKIGEEIGQLKIKSHLHHNDQELKEKLVI 60
M+ILV+N GSSS+K++L + K+ LA GLAE+IG L N +++K K +
Sbjct: 1 MKILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHN----ANGEKIKIKKDM 56

Query: 61 KDHASGLLMIRESLT--KMGIIKDFNQIDAIGHRVVQGGDKFHAPVLVNELVMQEIGKLS 118
KDH + ++ ++L G+IKD ++IDA+GHRVV GG+ F + VL+ + V++ I
Sbjct: 57 KDHKDAIKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCI 116

Query: 119 VLAPLHNPANLAGIEFVQKAHPHIPQIAVFDTAFHSSMPDFAYMYALPYELYEKYQIRRY 178
LAPLHNPAN+ GI+ + P +P +AVFDTAFH +MPD+AY+Y +PYE Y KY+IR+Y
Sbjct: 117 ELAPLHNPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKY 176

Query: 179 GFHGTSHHYVAKEAAKYLRIPYEKFNAISLHLGNGASVAAIRDGKSVDTSMGLTPLEGLI 238
GFHGTSH YV++ AA+ L P E I+ HLGNG+S+AA+++GKS+DTSMG TPLEGL
Sbjct: 177 GFHGTSHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLA 236

Query: 239 MGTRCGDIDPTVVEYIAQCANKSLEEVMKILNYESGLKGICG 280
MGTR G IDP+++ Y+ + N S EEV+ ILN +SG+ GI G
Sbjct: 237 MGTRSGSIDPSIISYLMEKENISAEEVVNILNKKSGVYGISG 278


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_04570IGASERPTASE455e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 45.4 bits (107), Expect = 5e-07
Identities = 42/231 (18%), Positives = 76/231 (32%), Gaps = 10/231 (4%)

Query: 280 KRDKTLSKKKPEKTQTKTQTTAPSIAPENAPKIPLKTPPLMPLIGANPPPNDNAPTLLEK 339
KR++T+ T Q PS+ N + P+ P A P E
Sbjct: 987 KRNQTVDTTNIT-TPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPS---------ET 1036

Query: 340 EEKTKEVSENKEKTKESTNSAQNAQNTQASDKTSENKSVTPKETIKHFTQQLKQEIQEYK 399
E E S+ + KT E Q + E KS T + Q E +E +
Sbjct: 1037 TETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQ 1096

Query: 400 PPMSKISMDLFPKELGKVEVIIQKVGKNLKVSVISHNNSLQTFLDNQQDLKNSLNALGFE 459
+K + + +E KVE + + V +T + + + + +
Sbjct: 1097 TTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIK 1156

Query: 460 GVDLSFSQDSSKEQPKEQLRELFKEQESSPLKENALKSYQENTDHENKETS 510
+ + EQP ++ ++ + N S EN ++ T+
Sbjct: 1157 EPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATT 1207



Score = 41.2 bits (96), Expect = 1e-05
Identities = 58/278 (20%), Positives = 101/278 (36%), Gaps = 26/278 (9%)

Query: 21 KNEVKDTKNAP----KSASKDFSKILNQKISKDKTAPKENPLKATPKGTK----ENAKAL 72
+N+ DT N A N++I++ AP P ATP T EN+K
Sbjct: 988 RNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQE 1047

Query: 73 EKTPTPHHQHAQNLAKDQQAPTLKDLLNHKKTTASHEAQHETHEMHETNPKTPNETLNKN 132
KT + Q A + + N K T ++E E ET ET
Sbjct: 1048 SKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVE 1107

Query: 133 EKKPNGVVSN------AHQSNLTNKNPLTPTNHANHAIKKPTAPTHNAKDPKTLKDIQ-T 185
+++ V + S ++ K + T + PT N K+P++ +
Sbjct: 1108 KEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTAD 1167

Query: 186 LSQKHDLNASNIQVVAPLEKKETSLKAGDQIALKTTQTPINHTLAKNDAKNTANLSSVLQ 245
Q +SN++ + T++ G+ + + P N T A + S+ +
Sbjct: 1168 TEQPAKETSSNVE---QPVTESTTVNTGNSV----VENPENTTPATTQPTVNSESSNKPK 1220

Query: 246 SLEKK----ESHHKEHATPPSNEKKTPPLKEALQMNAI 279
+ ++ H+ E AT SN++ T L + N
Sbjct: 1221 NRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTN 1258


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_04580FLGHOOKAP1357e-04 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 35.3 bits (81), Expect = 7e-04
Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 2 NDTLLNAYSGIKTHQFGIDSLSNNIANVNTLGY 34
+ + NA SG+ Q +++ SNNI++ N GY
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGY 33



Score = 33.0 bits (75), Expect = 0.003
Identities = 10/48 (20%), Positives = 20/48 (41%)

Query: 557 IRHKYLETSNVNAGNALTNLILMQRGYSMNARAFGAGDDMIKEAISLK 604
+ ++ S VN NL Q+ Y NA+ + + I+++
Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546


23HPPC_06540HPPC_06575N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_06540-1110.891947cation efflux system protein
HPPC_06545111-0.139536hypothetical protein
HPPC_065502120.207686branched-chain amino acid transport protein
HPPC_065552120.099787chaperone protein DnaJ
HPPC_06560112-0.408913hypothetical protein
HPPC_06565-2110.018059tRNA-specific 2-thiouridylase MnmA
HPPC_06570-1120.704619hypothetical protein
HPPC_065751140.901140hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_06540ACRIFLAVINRP8100.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 810 bits (2094), Expect = 0.0
Identities = 223/1062 (20%), Positives = 447/1062 (42%), Gaps = 73/1062 (6%)

Query: 5 IIDLSVKNKLLTTLITLLIFLASLWAIKSVRLDALPDLSPAQVVVQITYPNQSPKIVQEQ 64
+ + ++ + ++ +++ +A AI + + P ++P V V YP + VQ+
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 65 VTYPLVSTFMSIANIDTVRGIS-SYESGLIYIIFKDGVNLYWARDRVLEQLNRVSN-LPK 122
VT + I N+ + S S S I + F+ G + A+ +V +L + LP+
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 123 DAKV-EIGSDSTSIGWAYQYALSSGSKNLS--DLKVLQDFYYRYALLGVDGVSEVASVGG 179
+ + I + +S + S + + D+ + L ++GV +V G
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 180 FVKDYEVTLQNDSLIRYNLSLEQVANAIKNSNNDTGGGVI------LENGFEKIIRSHGY 233
+ L D L +Y L+ V N +K N+ G + I +
Sbjct: 181 -QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 234 IQSLNDLEEIVVK-KEGAIPLKIKDIASVRLVPKPRRGAANLNGDKEVVGGIVMVRYHAD 292
++ + ++ ++ +++KD+A V L + A +NG K G + + A+
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARING-KPAAGLGIKLATGAN 298

Query: 293 TYKVLKAIKEKIATLQASNP-DVKITSVYDRSELIEKGIDNLIHTLIEESAIVLVIIAIF 351
KAIK K+A LQ P +K+ YD + ++ I ++ TL E +V +++ +F
Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358

Query: 352 LLHFRSALVVIITLPLSVCISFLLMRYFNIEASIMSLGGIAIAIGAMVDAAIVMVENAHK 411
L + R+ L+ I +P+ + +F ++ F + +++ G+ +AIG +VD AIV+VEN +
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 412 HLQHIDTKDNAQRVNAIMQGVKHVGGAIFFALMIIVVSFLPIFALTGQEEKLFAPLAYTK 471
+ +D A + + + GA+ M++ F+P+ G ++ + T
Sbjct: 419 VMM----EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITI 474

Query: 472 TFAMLVGALLSITIVPVLMVWLIKGRILEESKNPINAFF----------MKIYGVSLNVV 521
AM + L+++ + P L L+K E +N FF + Y S+ +
Sbjct: 475 VSAMALSVLVALILTPALCATLLKPVSAEHHENK-GGFFGWFNTTFDHSVNHYTNSVGKI 533

Query: 522 LKFRYAFLIASVLGLGGLYVAYKKLNWEFIPQINEGVIMYMPVTLNGVGID-------TA 574
L +L+ L + G+ V + +L F+P+ ++GV + M G +
Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593

Query: 575 LEYLKKSNSAIKQLDFVKQVFGKVGRANTSTDAAGLGMIETYIELKPKNEWKEKLSYKEV 634
+Y K+ A + F F G+A + A ++ LKP W+E+ +
Sbjct: 594 TDYYLKNEKANVESVFTVNGFSFSGQAQNAGMA--------FVSLKP---WEERNGDENS 642

Query: 635 RDKL--EKTLQLKGLTNSWTYPIRGRTDMLLTGIRTPLGIKL-------YGNDTDKLQEL 685
+ + ++L + + + P + L G T +L + T +L
Sbjct: 643 AEAVIHRAKMELGKIRDGFVIPFNMPAIVEL-GTATGFDFELIDQAGLGHDALTQARNQL 701

Query: 686 AILMEQQLKTLKESLSVFAERSNNGYYITLDLNDENLARYGINKNAVLDAIKFALGGATL 745
+ Q +L +SV + L+++ E G++ + + I ALGG +
Sbjct: 702 LGMAAQHPASL---VSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYV 758

Query: 746 TTMIKGVESYPISLRLEDTERNTIEKLKNLYIKTAYNYM-PLRELAHVYYDNSPAVLKSE 804
I + ++ + R E + LY+++A M P ++ L+
Sbjct: 759 NDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERY 818

Query: 805 KGLNVNFIYIVPQNGISSDTYRQLAQKALEKIQLPNGYYYEFSGESQYLEEAFKTLQYIV 864
GL I G SS L + K LP G Y+++G S + +V
Sbjct: 819 NGLPSMEIQGEAAPGTSSGDAMALMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALV 876

Query: 865 PVSVFIIFILIVFALKNLTNSLLCFFTLPFAFLGGLIFMNLMGFNMSVAALVGFLALLGV 924
+S ++F+ + ++ + + +P +G L+ L V +VG L +G+
Sbjct: 877 AISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGL 936

Query: 925 ASETAIVMIIYLEDAFQKFIKTPLKEQNSTALKEAIMHGAVLRVRPKLMTFFSILASLIP 984
+++ AI+++ + +D L E+ + EA + +R+RP LMT + + ++P
Sbjct: 937 SAKNAILIVEFAKD---------LMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLP 987

Query: 985 IMYSHGTGSEIMKSIAAPMLGGMISSVVLTLFIIPTAYFVIK 1026
+ S+G GS ++ ++GGM+S+ +L +F +P + VI+
Sbjct: 988 LAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIR 1029



Score = 73.3 bits (180), Expect = 3e-15
Identities = 76/531 (14%), Positives = 194/531 (36%), Gaps = 60/531 (11%)

Query: 527 AFLIASVLGLGGLYVAYKKLNWEFIPQINEGVIMYMPVTLNGVGI------DTALEYLKK 580
A+++A +L + G A +L P I + V+ N G DT + +++
Sbjct: 12 AWVLAIILMMAGAL-AILQLPVAQYPTIAPPAVS---VSANYPGADAQTVQDTVTQVIEQ 67

Query: 581 SNSAIKQLDFVKQVFGKVGRANTSTDAAGLGMIETYIEL-----KPKNEWKEKLSYKEVR 635
+ + I L ++ ++++D+AG I + + + + KL
Sbjct: 68 NMNGIDNLMYM----------SSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQ--LAT 115

Query: 636 DKLEKTLQLKGLTNSWTYPIRGRTDMLLTGIRTPLGIKLYGNDTDKLQ-ELAILMEQQLK 694
L + +Q +G++ + + ++ Q +++ + +K
Sbjct: 116 PLLPQEVQQQGISVEKSSSS------------YLMVAGFVSDNPGTTQDDISDYVASNVK 163

Query: 695 TLKESLSVFAERSNNG--YYITLDLNDENLARYGINKNAVLDAIKFA---LGGATLTTMI 749
L+ + G Y + + L+ + L +Y + V++ +K + L
Sbjct: 164 DTLSRLNGVGDVQLFGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTP 223

Query: 750 KGVESYPISLRLEDTERNTIEKLKNLYIKTAYNYMP--LRELAHVYYD-NSPAVLKSEKG 806
+ + T E+ + ++ + L+++A V + V+ G
Sbjct: 224 ALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARING 283

Query: 807 LNVNFIYIVPQNGISSDTYRQLAQKALEKIQ--LPNGYYYE-FSGESQYLEEAFKTLQYI 863
+ I G ++ + + L ++Q P G + +++ + +
Sbjct: 284 KPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKT 343

Query: 864 VPVSVFIIFILIVFALKNLTNSLLCFFTLPFAFLGGLIFMNLMGFNMSVAALVGFLALLG 923
+ ++ ++F+++ L+N+ +L+ +P LG + G++++ + G + +G
Sbjct: 344 LFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIG 403

Query: 924 VASETAIVMIIYLEDAFQKFIKTPLKEQNSTALKEAIMHGAVLRVRPKLMTFFSILASLI 983
+ + AIV++ +E + + KEA + + A I
Sbjct: 404 LLVDDAIVVVENVERVMME---------DKLPPKEATEKSMSQIQGALVGIAMVLSAVFI 454

Query: 984 PIMYSHGTGSEIMKSIAAPMLGGMISSVVLTLFIIPTAYFVIKNARVRKHE 1034
P+ + G+ I + + ++ M SV++ L + P + +H
Sbjct: 455 PMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHH 505


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_06550TCRTETB280.048 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 27.5 bits (61), Expect = 0.048
Identities = 18/88 (20%), Positives = 42/88 (47%), Gaps = 6/88 (6%)

Query: 144 IGSLIGSLAGSHFSFD---TQGMEFVMTAIFIVLFMEQYKRTTNHKN--AWLGIIIAVVC 198
+G IG + + + M ++T F++ +++ R H + + + + +V
Sbjct: 154 VGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVF 213

Query: 199 LVLFGTEYFLLIALVLMVLALILFRKQL 226
+LF T Y + L++ VL+ ++F K +
Sbjct: 214 FMLFTTSYSISF-LIVSVLSFLIFVKHI 240


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_06560cloacin330.003 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.8 bits (74), Expect = 0.003
Identities = 21/99 (21%), Positives = 47/99 (47%), Gaps = 9/99 (9%)

Query: 28 AKLSRSNEQLSDMLYKLNESLRIYQSVLSNNQDQL----KEIKKANSTLNSQRRFFNASQ 83
++L +N+ L+D + ++ + R ++ + ++A + +N+++ F+A+
Sbjct: 356 SELDAANKTLADAIAEIKQFNRFAHDPMAGGHRMWQMAGLKAQRAQTDVNNKQAAFDAAA 415

Query: 84 IRLMDTDALLKQSALELEKLQALEKRLKEGMEQERLIEE 122
D DA L SA+E K +K K+ + L +E
Sbjct: 416 KEKSDADAAL-SSAMESRK----KKEDKKRSAENNLNDE 449


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_06575LPSBIOSNTHSS496e-10 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 49.0 bits (117), Expect = 6e-10
Identities = 25/71 (35%), Positives = 40/71 (56%), Gaps = 4/71 (5%)

Query: 11 ALYGGSFDPLHKAHLAIIDQTLELLPFAQLIVLPAYQNPFKKPCFLDAKTRFKELERALK 70
A+Y GSFDP+ HL II++ L F Q+ V +NP K+P F + R +++ +A+
Sbjct: 3 AIYPGSFDPITFGHLDIIERGCRL--FDQVYVAVL-RNPNKQPMF-SVQERLEQIAKAIA 58

Query: 71 GMPRVLLSDFE 81
+P + FE
Sbjct: 59 HLPNAQVDSFE 69


24HPPC_07955HPPC_07995N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
HPPC_07955-2131.945892flagellar hook-basal body protein FliE
HPPC_07960-2121.561914flagellar basal body rod protein FlgC
HPPC_07965-2141.065194flagellar basal body rod protein FlgB
HPPC_079700131.323745probable cell division protein ftsW
HPPC_07975-113-0.021340iron(III) ABC transporter, periplasmic
HPPC_07980014-0.023459hypothetical protein
HPPC_079850140.163695alkyl hydroperoxide reductase
HPPC_07990012-0.268420outer membrane protein
HPPC_07995112-0.327759penicillin-binding protein 2
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_07955FLGHOOKFLIE777e-22 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 76.6 bits (188), Expect = 7e-22
Identities = 19/77 (24%), Positives = 40/77 (51%), Gaps = 1/77 (1%)

Query: 34 EQKGGEFSKLLKQSINELNNTQEQSDKALADMATGQIK-DLHQAAIAIGKAETSMKLMLE 92
Q F+ L +++ +++TQ + G+ L+ + KA SM++ ++
Sbjct: 27 PQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQ 86

Query: 93 VRNKAISAYKELLRTQI 109
VRNK ++AY+E++ Q+
Sbjct: 87 VRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_07960FLGHOOKAP1280.013 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 28.4 bits (63), Expect = 0.013
Identities = 10/38 (26%), Positives = 15/38 (39%)

Query: 121 NVNAVVEMADLVEATRAYQANVAAFQSAKNMAQNAIGM 158
VN E +L + Y AN Q+A + I +
Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_07975FERRIBNDNGPP353e-04 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 34.9 bits (80), Expect = 3e-04
Identities = 28/183 (15%), Positives = 77/183 (42%), Gaps = 10/183 (5%)

Query: 108 NVELLKKLSPDLVVTFVG-NPKAVEHAKKFGISFLSFQETT--IAEAMQAMQ--AQATVL 162
N+ELL ++ P +V G P A+ +F + +A A +++ A L
Sbjct: 88 NLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNL 147

Query: 163 EIDASKKFAKMQETLDFIAERL-KNVKKKKGVELFHKAN--KISGHQAISSDILEKGGID 219
+ A A+ ++ + + R K + + + G ++ +IL++ GI
Sbjct: 148 QSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGIP 207

Query: 220 N-FGLKYVKFGRADISVEKIVK-ENPEIIFIWWVSPLTPEDVLNNPKFATIKAIKNKQVY 277
N + + +G +S++++ ++ +++ + + ++ P + + ++ +
Sbjct: 208 NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALMATPLWQAMPFVRAGRFQ 267

Query: 278 KLP 280
++P
Sbjct: 268 RVP 270


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_07980FERRIBNDNGPP345e-04 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 34.2 bits (78), Expect = 5e-04
Identities = 32/183 (17%), Positives = 74/183 (40%), Gaps = 10/183 (5%)

Query: 107 NVELLKKLSPDLVVTFVGNPKAVEHAKKF--GISFLSFQEKTIAEVMEDID---AQAKAL 161
N+ELL ++ P +V G + E + G F K + A L
Sbjct: 88 NLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNL 147

Query: 162 EIDASKKLAKMQETLDFIKERL-KDVKKKKGVELFHKAN--KISGHQALDSDILEKGGID 218
+ A LA+ ++ + +K R K + + + G +L +IL++ GI
Sbjct: 148 QSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGIP 207

Query: 219 N-FGLKYVKFGRADVSVEKIVK-ENPEVIFIWWISPLSPEDVLNNPKFSTIKAIKNKQVY 276
N + + +G VS++++ ++ +V+ + + ++ P + + ++ +
Sbjct: 208 NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALMATPLWQAMPFVRAGRFQ 267

Query: 277 KLP 279
++P
Sbjct: 268 RVP 270


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
HPPC_07995TYPE3IMPPROT290.030 Type III secretion system inner membrane P protein ...
		>TYPE3IMPPROT#Type III secretion system inner membrane P protein

family signature.
Length = 224

Score = 29.4 bits (66), Expect = 0.030
Identities = 9/23 (39%), Positives = 12/23 (52%)

Query: 4 LRYKLLLFVFIGFWGLLVLNLFI 26
KL+LFV + W LL L +
Sbjct: 195 TPIKLVLFVALDGWTLLSKGLIL 217



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.