PredictBias

identification of genomic and pathogenicity islands in prokaryotic genome
Home | Help | Analyzed genomes
 
A) Input parameters
Genome1029.gbkThreshold dinucleotide bias2
Threshold codon bias4Threshold %GC bias3
E-value (RPSBlast)0.05Genome (non-pathogenic)
 
B) Compare a potential GI or PAI in related non-pathogenic sp. (phylogenetic tree)
Potential GI or PAI start    end  
Select Organism     
 
C) Potential GIs and PAIs in NC_008563 (download)
S.NoStartEndBiasVirulenceInsertion elementsPrediction
1APECO1_1939APECO1_1915Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_19390153.027324electron transfer flavoprotein FixB
APECO1_19380172.810903oxidoreductase FixC
APECO1_19370172.630430metabolite transport protein YaaU
APECO1_1936-1162.761076glutathione-regulated potassium-efflux system
APECO1_1935-1153.018160glutathione-regulated potassium-efflux system
APECO1_1934-1142.513949dihydrofolate reductase
APECO1_19330151.870097diadenosine tetraphosphatase
APECO1_1932-1151.408786ApaG protein
APECO1_1931-2151.585990dimethyladenosine transferase
APECO1_1930-1172.1849494-hydroxythreonine-4-phosphate dehydrogenase
APECO1_1929-2182.396123peptidyl-prolyl cis-trans isomerase SurA
APECO1_1928-2182.510145organic solvent tolerance protein
APECO1_1927-2193.288150Dna-J like membrane chaperone protein
APECO1_1926-2204.14453823S rRNA/tRNA pseudouridine synthase A
APECO1_1925-2193.904148ATP-dependent helicase HepA
APECO1_1924-2122.363715DNA polymerase II
APECO1_1923-2110.040915L-ribulose-5-phosphate 4-epimerase
APECO1_1922-112-0.055459L-arabinose isomerase
APECO1_1921117-0.418879ribulokinase
APECO1_1920118-0.618448AraC family transcriptional regulator
APECO1_1919215-0.746657hypothetical protein
APECO1_19181142.066168hypothetical protein
APECO1_19172163.404323integral membrane protein YabI
APECO1_19161163.576718thiamine transporter ATP-binding subunit
APECO1_19151183.405661thiamine transporter membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1937TCRTETA411e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 40.6 bits (95), Expect = 1e-05
Identities = 59/319 (18%), Positives = 106/319 (33%), Gaps = 29/319 (9%)

Query: 31 GYVLVMIGVALEQLTPALKLDADWIGLLGAGTLAGLFVGTSLFGYISDKVGRRKMFLIDI 90
G ++ ++ L L + + A + LL L + G +SD+ GRR + L+ +
Sbjct: 22 GLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFAC-APVLGALSDRFGRRPVLLVSL 80

Query: 91 IAIGVISVATMFVSSPVELLVMRVLIGIVIGADYPIATSMITEFSSTRQRAFSISFIAAM 150
V A M + + +L + ++ + GA +A + I + + +RA F++A
Sbjct: 81 AGAAV-DYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSAC 139

Query: 151 WYVGATCADLVGYWLYDVEGGWRWMLGSAAIPCLLILIGRFELPESPRWLLRKGRVKECE 210
+ G ++G + + AA+ L L G F LPES +
Sbjct: 140 FGFGMVAGPVLGGLMGGFSPHAPFFAA-AALNGLNFLTGCFLLPESHK------------ 186

Query: 211 EMMIKLFGEPVAFEEEQPQQ-TRFRDLFNRRHFPFVLFVA-AIWTCQVIPMFAIYTFGPQ 268
GE E FR ++ V + +P FG
Sbjct: 187 -------GERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGED 239

Query: 269 IVGLLGLGVGKNAALGNVVISLFFMLGCIPPMLWLNTAGRRPLLIGSFAMMTLALAVLGL 328
+G + A ++ SL + P L G R L+ +L
Sbjct: 240 RFHWDATTIGISLAAFGILHSLAQAMITGPVAARL---GERRALMLGMIADGTGYILLAF 296

Query: 329 IPDMGIWLVVMAFAVYAFF 347
W+ + A
Sbjct: 297 ATR--GWMAFPIMVLLASG 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_192756KDTSANTIGN290.020 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 29.2 bits (65), Expect = 0.020
Identities = 32/120 (26%), Positives = 51/120 (42%), Gaps = 18/120 (15%)

Query: 188 IAEELGISRAQFD-----QFLRMMQGGAQFGGGYQQQSGGGNWQQAQRGPTLEDACNVLG 242
EEL R FD F+ + QQQ G G QQAQ T ++A
Sbjct: 310 TLEEL---RDSFDGYINNAFVNQIHLNFVMPPQAQQQQGQGQQQQAQ--ATAQEAVAAAA 364

Query: 243 VKPTDDATTIKRAYRKLMS-EHHPDKLVAKGLPPEMMEMAKQKAQEIQ-QAYELIKQQKG 300
V+ + + I + Y+ L+ + H G+ M ++A Q+ ++ + Q KQQ+G
Sbjct: 365 VRLLNGSDQIAQLYKDLVKLQRH------AGIRKAMEKLAAQQEEDAKNQGKGDCKQQQG 418


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1918SECYTRNLCASE260.047 Preprotein translocase SecY subunit signature.
		>SECYTRNLCASE#Preprotein translocase SecY subunit signature.

Length = 437

Score = 26.3 bits (58), Expect = 0.047
Identities = 13/29 (44%), Positives = 17/29 (58%), Gaps = 1/29 (3%)

Query: 2 SKYIYILLSF-LVLFFIFFYAYISLMSKE 29
IYI+ F L++FF FFY IS +E
Sbjct: 314 DHPIYIVTYFLLIVFFAFFYVAISFNPEE 342


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1915PF06580320.005 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.1 bits (73), Expect = 0.005
Identities = 17/80 (21%), Positives = 28/80 (35%), Gaps = 5/80 (6%)

Query: 4 RRQPLIPGWLIPGVSAATLVVAVALAAFLALWWNAPQGDWVAVWQDS-YLWHVVRFSFWQ 62
R GWL + L V A +W+ A ++W+ ++
Sbjct: 60 RSFIKRQGWLKLNMGQIILRVLPACVVIGMVWFVAN----TSIWRLLAFINTKPVAFTLP 115

Query: 63 AFLSALLSVVPAIFLARALY 82
LS + +VV F+ LY
Sbjct: 116 LALSIIFNVVVVTFMWSLLY 135


2APECO1_1879APECO1_1870Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1879215-2.014297N-acetyl-anhydromuranmyl-L-alanine amidase
APECO1_1878216-3.622430regulatory protein AmpE
APECO1_1877314-3.033781aromatic amino acid transporter
APECO1_1876317-1.970057uropathogenic specific protein
APECO1_1875531-0.196318hypothetical protein
APECO1_18745360.569169hypothetical protein
APECO1_18734311.367729hypothetical protein
APECO1_18723332.474054transcriptional regulator PdhR
APECO1_18713342.331410pyruvate dehydrogenase subunit E1
APECO1_18702272.175875dihydrolipoamide acetyltransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1876PYOCINKILLER1841e-52 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 184 bits (467), Expect = 1e-52
Identities = 97/287 (33%), Positives = 136/287 (47%), Gaps = 27/287 (9%)

Query: 313 ALAGSTATTRVRFFWGTDIHGKPQVYGVHTGEGTPY-ENVRVANMQWNEQTQRYEFT--- 368
A+A ++ T + + G V + +G + V V +N T YE T
Sbjct: 345 AVAKASGTVDLPMRLTNEARGNTTTLSVVSTDGVSVPKAVPVRMAAYNATTGLYEVTVPS 404

Query: 369 PAHDVDGPLITWTPENPEHGYVPGHTGN--DRPPLEQPTILVTPIPDGTDTYTTPPFPVP 426
+ ++TWTP +P P T +P +TP+ T +P
Sbjct: 405 TTAEAPPLILTWTPASPPGNQNPSSTTPVVPKPVPVYEGATLTPV-----KATPETYPGV 459

Query: 427 DPKEFNDYILVFPAGSGIKPIYVYLKEDPRKLPGVVTGRGVPLSPGTRWLDMSVSNNGNG 486
D I+ FPA SGIKPIYV + DPR +PG TG+G P+ WL ++ G G
Sbjct: 460 ITLP-EDLIIGFPADSGIKPIYVMFR-DPRDVPGAATGKGQPV--SGNWLG--AASQGEG 513

Query: 487 APIPAHIADKLRGREFKTFDEFREALWLEVSQDPELIAQFSSGNQTRIKQGLTAKAPIDG 546
APIP+ IADKLRG+ FK + +FRE W+ V+ DPEL QF+ G+ ++ G
Sbjct: 514 APIPSQIADKLRGKTFKNWRDFREQFWIAVANDPELSKQFNPGSLAVMRDGGAPYVR--- 570

Query: 547 WYYGPKEIV---KKFQIHHRVAVEYGGSVYDIDNLRIVTPRLHDEIH 590
E K +IHH+V V GG VY++ NL VTP+ H EIH
Sbjct: 571 ----ESEQAGGRIKIEIHHKVRVADGGGVYNMGNLVAVTPKRHIEIH 613


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1874PF04605260.018 Virulence-associated protein D (VapD)
		>PF04605#Virulence-associated protein D (VapD)

Length = 125

Score = 26.0 bits (57), Expect = 0.018
Identities = 13/34 (38%), Positives = 20/34 (58%)

Query: 1 MYNFKDEIEDYTEREFIELLGEFTNPTGDNAQLK 34
Y+ K+ I+D ++F + L EFT T N +LK
Sbjct: 88 QYSLKETIQDLCAKDFHQKLKEFTEKTPKNQKLK 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1870RTXTOXIND340.002 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 34.0 bits (78), Expect = 0.002
Identities = 42/281 (14%), Positives = 90/281 (32%), Gaps = 32/281 (11%)

Query: 26 DKVEAEQSLITVEGDKASMEVPSPQAGIVKEIKVSVGDKTQTGALIMIFDSADGAADAAP 85
+ V +T G S E+ + IVKEI V G+ + G +++ + AD
Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138

Query: 86 AQA--------EEKKEAAPAAA-----PAAAAAKDVNVPDIGSDEVEVTEILVKVG-DKV 131
Q+ + + + + P + ++ +EV L+K
Sbjct: 139 TQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTW 198

Query: 132 EAEQSLITVEGDKASMEVPAPFAGTVKEIKVNVGDKVSTGSLIMVFEVAGEAGAVAPAAK 191
+ ++ + DK E A + ++ +K + A +
Sbjct: 199 QNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHK-QAIAKHAVLEQ 257

Query: 192 QEAAPAAAPASAAGVKEVNVPDIGGDEV-------------EVTEVMVKVGDKVAA-EQS 237
+ A ++ + E+ + + + D +
Sbjct: 258 ENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLE 317

Query: 238 LITVEGDKASMEVPAPFAGVVKELKVN-VGDKVKTGSLIMI 277
L E + + + AP + V++LKV+ G V T +M+
Sbjct: 318 LAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMV 358



Score = 29.8 bits (67), Expect = 0.035
Identities = 20/95 (21%), Positives = 35/95 (36%), Gaps = 3/95 (3%)

Query: 230 DKVAAEQSLITVEGDKASMEVPAPFAGVVKELKVNVGDKVKTGSLIMIFEVEGAAPAAAP 289
+ VA +T G S E+ +VKE+ V G+ V+ G +++ GA A
Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAE-ADTL 137

Query: 290 AKQEAAAPAPAAKAEAPAAAPAAKAEGKSEFAEND 324
Q + A + + + + E D
Sbjct: 138 KTQSSLLQARLEQTRYQILSRSIELNKLPELKLPD 172


3APECO1_1854APECO1_1832Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1854-115-3.551187aspartate alpha-decarboxylase
APECO1_1853119-4.598941hypothetical protein
APECO1_1852325-5.589083pantoate--beta-alanine ligase
APECO1_1851328-6.9212033-methyl-2-oxobutanoate
APECO1_1850332-8.236654fimbrial-like adhesin protein
APECO1_1849331-7.844316hypothetical protein
APECO1_1848231-7.305289hypothetical protein
APECO1_1847021-4.590124fimbrial-like adhesin protein
APECO1_1846-118-3.157714outer membrane usher protein
APECO1_1845-117-0.662509chaperone protein EcpD
APECO1_18440140.249009fimbrial-like adhesin protein
APECO1_18430142.2014782-amino-4-hydroxy-6-
APECO1_18420143.540276poly(A) polymerase I
APECO1_1841-1153.241011glutamyl-Q tRNA(Asp) synthetase
APECO1_18401132.466862RNA polymerase-binding transcription factor
APECO1_18390132.737810sugar fermentation stimulation protein A
APECO1_18380133.0669662'-5' RNA ligase
APECO1_1837-1153.857447ATP-dependent RNA helicase HrpB
APECO1_1836-2163.595004penicillin-binding protein 1b
APECO1_18350143.439504ferrichrome outer membrane transporter
APECO1_18340164.431659iron-hydroxamate transporter ATP-binding
APECO1_18331144.103800iron-hydroxamate transporter substrate-binding
APECO1_18320143.982298iron-hydroxamate transporter permease subunit
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1851FLGMRINGFLIF290.018 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 29.2 bits (65), Expect = 0.018
Identities = 27/100 (27%), Positives = 40/100 (40%), Gaps = 22/100 (22%)

Query: 110 MVKIEGGEWL----VETVQMLTERAVPVCGHLGLTPQSVNIFGGYKVQGRGDEAGDRLL- 164
V +E G L + V L AV GL P +V + D++G LL
Sbjct: 176 TVTLEPGRALDEGQISAVVHLVSSAVA-----GLPPGNVTLV---------DQSG-HLLT 220

Query: 165 -SDALALEAAGAQLLVLECVPVELAKRITEALAIPVIGIG 203
S+ + AQL V + +RI L+ P++G G
Sbjct: 221 QSNTSGRDLNDAQLKFANDVESRIQRRIEAILS-PIVGNG 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1846PF005778050.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 805 bits (2081), Expect = 0.0
Identities = 263/869 (30%), Positives = 428/869 (49%), Gaps = 40/869 (4%)

Query: 12 IATFCALLYSNSALCAELVEYDHTFLMGKDASNIDLSRYTEGNPTLPGIYDVSVYVNDQP 71
+ CA AE + ++ FL + DLSR+ G PG Y V +Y+N+
Sbjct: 30 LFVACAFAAQAPLSSAE-LYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNGY 88

Query: 72 IMSQSIAFAVIEGKKNAQACITQKNLLQFHISSPDKNSEKAILLKRDDDLGDCLNLAEMI 131
+ ++ + F + ++ C+T+ L +++ + + C+ L MI
Sbjct: 89 MATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLL------ADDACVPLTSMI 142

Query: 132 PQSSIRYDVNDQRLDIDVPQAWIMKNYQNYVDPSLWENGINAAMLSYNLNGYHSESP-GR 190
++ + DV QRL++ +PQA++ + Y+ P LW+ GINA +L+YN +G ++ G
Sbjct: 143 HDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGG 202

Query: 191 TNDSIYAAFNGGINLGAWRLRASGNYNWITNVHS-----DYDFQNRYLQRDLASLRSQLV 245
+ Y G+N+GAWRLR + +++ ++ S + N +L+RD+ LRS+L
Sbjct: 203 NSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLT 262

Query: 246 IGESYTTGETFDSVRIRGIRLYSDSRMLPPVLASFAPIIHGVANTNAKVTVMQNGYKIYE 305
+G+ YT G+ FD + RG +L SD MLP FAP+IHG+A A+VT+ QNGY IY
Sbjct: 263 LGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYN 322

Query: 306 TTVPPGAFAIDDLSPSGYGSDLIVTIEEADGTKRTFSQPFSSVVQMLRPGVGRWDISAGQ 365
+TVPPG F I+D+ +G DL VTI+EADG+ + F+ P+SSV + R G R+ I+AG+
Sbjct: 323 STVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGE 382

Query: 366 VLKD-SIQDEPNLFQASYYYGLNNYLTGYTGIQLTDNNYTAGLLGLGMNT-PVGAFSVDV 423
+ Q++P FQ++ +GL T Y G QL D Y A G+G N +GA SVD+
Sbjct: 383 YRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLAD-RYRAFNFGIGKNMGALGALSVDM 441

Query: 424 THSNVSIPDDKTYQGQSYRISWNKLFENTSTSLNIAAYRYSTQHYLGLNDALTLIDEVEH 483
T +N ++PDD + GQS R +NK + T++ + YRYST Y D +
Sbjct: 442 TQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYN 501

Query: 484 PE-----QDLEPKSMRNYSRM---KNQVTVSINQPLKFEKKDYGSFYLSGSWSDYWASGQ 535
E ++PK Y+ + ++ +++ Q L + YLSGS YW +
Sbjct: 502 IETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQL----GRTSTLYLSGSHQTYWGTSN 557

Query: 536 NSTNYSIGYSNSASWGSYSISAQRSLNE-DGQTDDSIYLSFTIPIENLLGTEHRSS-GFQ 593
+ G + + ++++S + N D + L+ IP + L ++ +S
Sbjct: 558 VDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHA 617

Query: 594 SIDTQLNSDFKGNNQLNISSSGYSDT-NRISYSVNTGYMMNKSSDDLSYIGGYASYESPW 652
S ++ D G G N +SYSV TGY + S +Y +
Sbjct: 618 SASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGY 677

Query: 653 GTLSGSASASSDNSRQFSLNTDGGFVLHSGGLTFSNDSFSDSDTLAVIQAPGAKGARINY 712
G + S S D +Q GG + H+ G+T +DT+ +++APGAK A++
Sbjct: 678 GNANIGYSHSDDI-KQLYYGVSGGVLAHANGVTLGQPL---NDTVVLVKAPGAKDAKVEN 733

Query: 713 GNST-VDRWGYGVTSALSPYHENRIALDINDLENDVELKSTSTVAVPRQGAVVFADFETV 771
D GY V + Y ENR+ALD N L ++V+L + VP +GA+V A+F+
Sbjct: 734 QTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKAR 793

Query: 772 QGQSAIMNIVRSDGKNIPFAADIYDEQNNIIGNVGQGGQAFVRGIGQEGNIRITWIEEGK 831
G +M + + K +PF A + E + G V GQ ++ G+ G +++ W EE
Sbjct: 794 VGIKLLMT-LTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEEN 852

Query: 832 PVSCFAHYQQNTTSEKIAQSIILNGLRCQ 860
C A+YQ S++ Q + C+
Sbjct: 853 A-HCVANYQLPPESQQ--QLLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1833FERRIBNDNGPP5090.0 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 509 bits (1313), Expect = 0.0
Identities = 293/296 (98%), Positives = 294/296 (99%)

Query: 1 MSGLPLISRRRLLTAMALSPLLWQMNTAHAAVIDPNRIVALEWLPVELLLALGIVPYGVA 60
MSGLPLISRRRLLTAMALSPLLWQMNTAHAA IDPNRIVALEWLPVELLLALGIVPYGVA
Sbjct: 1 MSGLPLISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVA 60

Query: 61 DTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGR 120
DTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGR
Sbjct: 61 DTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGR 120

Query: 121 GFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAHYEDFIRSMKPRFVKRGARPLLLT 180
GFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLA YEDFIRSMKPRFVKRGARPLLLT
Sbjct: 121 GFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLT 180

Query: 181 TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH 240
TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH
Sbjct: 181 TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH 240

Query: 241 DNSKDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRILDNAIGGKA 296
DNSKDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVR+LDNAIGGKA
Sbjct: 241 DNSKDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIGGKA 296


4APECO1_1783APECO1_1752Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1783-121-3.006104**2,5-diketo-D-gluconate reductase B
APECO1_1782-223-2.491295LysR family transcriptional regulator
APECO1_1781-122-2.569035hypothetical protein
APECO1_1780-121-2.686138hypothetical protein
APECO1_1779-122-4.224647membrane-bound lytic murein transglycosylase D
APECO1_1778124-5.482347hydroxyacylglutathione hydrolase
APECO1_1777016-1.490230S-adenosyl-L-methionine-dependent
APECO1_17760161.973745ribonuclease H
APECO1_17750182.916276DNA polymerase III subunit epsilon
APECO1_17731193.840268*aminopeptidase
APECO1_17721235.916957hypothetical protein
APECO1_17711245.870456hypothetical protein
APECO1_17700256.144073hypothetical protein
APECO1_17690255.904119hypothetical protein
APECO1_17680266.311619hypothetical protein
APECO1_1767-1265.752372ATPase
APECO1_17660254.369841hypothetical protein
APECO1_17651234.243873hypothetical protein
APECO1_17641213.246906hypothetical protein
APECO1_17631192.921941hypothetical protein
APECO1_17621171.862686hypothetical protein
APECO1_17613231.965537hypothetical protein
APECO1_1760321-1.188940hypothetical protein
APECO1_1759424-3.401902hypothetical protein
APECO1_1758532-5.549509hypothetical protein
APECO1_1757535-6.238139hypothetical protein
APECO1_1756640-7.997694hypothetical protein
APECO1_1755955-14.157008hypothetical protein
APECO1_1753446-10.676075hypothetical protein
APECO1_1752237-8.222454hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1778BINARYTOXINB345e-04 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 34.3 bits (78), Expect = 5e-04
Identities = 12/55 (21%), Positives = 28/55 (50%), Gaps = 4/55 (7%)

Query: 186 NDYYRKVKELRAKNQITLPVILKNERQINVFLRT----EDIDLINVINEETLLQQ 236
+ ++ EL A N T+ +K ++N+ +R D + I V +E+++++
Sbjct: 589 QNIKNQLAELNATNIYTVLDKIKLNAKMNILIRDKRFHYDRNNIAVGADESVVKE 643


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1756ICENUCLEATIN320.012 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 31.6 bits (71), Expect = 0.012
Identities = 30/107 (28%), Positives = 42/107 (39%), Gaps = 10/107 (9%)

Query: 519 TETIGNDQKITVGLG--QTVNVGSKKEGGHDQKVTVANDQHLTIKNDRHKVVNNNQTSKV 576
T+T G D +T G G QT GS G+ T D L + QT+
Sbjct: 359 TQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAG------YGSTQTAGE 412

Query: 577 TGTDTEEVVKKQSIKIGDNYELKVEHGTNIISGDSIELICGQGESGT 623
T T Q+ + G + L +G+ +GD LI G G + T
Sbjct: 413 ESTQTAGYGSTQTAQKGSD--LTAGYGSTGTAGDDSSLIAGYGSTQT 457


5APECO1_1728APECO1_1681Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1728-214-3.382011outer membrane phosphoporin protein E
APECO1_1727-217-3.438103gamma-glutamyl kinase
APECO1_1726127-7.110311gamma-glutamyl phosphate reductase
APECO1_17251047-13.030561*hypothetical protein
APECO1_1724948-13.777247hypothetical protein
APECO1_1723842-11.861613hypothetical protein
APECO1_1722841-10.837266hypothetical protein
APECO1_1721741-10.739971hypothetical protein
APECO1_1720639-9.678521hypothetical protein
APECO1_1719640-9.675736hypothetical protein
APECO1_1718540-10.601889hypothetical protein
APECO1_1717631-7.606072hypothetical protein
APECO1_1716728-6.466223hypothetical protein
APECO1_1715728-5.961659integrase
APECO1_1714827-5.484538integrase
APECO1_1713624-4.406436hypothetical protein
APECO1_1712520-2.334383vacuolating autotransporter
APECO1_17111190.677047hypothetical protein
APECO1_17101191.001098hypothetical protein
APECO1_17092211.796845ferredoxin
APECO1_17083210.837732hypothetical protein
APECO1_17072210.538081hypothetical protein
APECO1_17063210.383143hypothetical protein
APECO1_1705422-4.279879hypothetical protein
APECO1_1704427-6.189477hypothetical protein
APECO1_1703333-7.846635hypothetical protein
APECO1_1702232-5.958004hypothetical protein
APECO1_1701128-5.10973750S ribosomal protein L31
APECO1_1700126-4.480814NADH-dependent flavin oxidoreductase
APECO1_1699019-1.842208hypothetical protein
APECO1_1698119-2.055393LysR family transcriptional regulator
APECO1_1697018-1.907758transcriptional regulator
APECO1_1696017-2.321301aldo/keto reductase
APECO1_1695-119-3.0566942,5-diketo-D-gluconic acid reductase A
APECO1_1694021-3.039236attaching and effacing protein, pathogenesis
APECO1_1693124-5.440604transcriptional regulator
APECO1_1692224-4.0383202,5-diketo-D-gluconic acid reductase A
APECO1_1691022-2.768852hypothetical protein
APECO1_1690022-3.712572pyridine nucleotide-disulfide oxidoreductase
APECO1_1689119-3.786487DNA-binding transcriptional regulator
APECO1_1688119-4.202364hypothetical protein
APECO1_1687122-4.360315hypothetical protein
APECO1_1686325-5.505710hypothetical protein
APECO1_1685527-6.584329hypothetical protein
APECO1_1684116-2.385286hypothetical protein
APECO1_1683-1141.023586hypothetical protein
APECO1_16820152.123733hypothetical protein
APECO1_16810183.082490hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1728ECOLIPORIN5480.0 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 548 bits (1413), Expect = 0.0
Identities = 231/384 (60%), Positives = 267/384 (69%), Gaps = 34/384 (8%)

Query: 3 MKKSTLALVVMGIVASVSVQAAEIYNKDGNKLDVYGKVKAMHYMSDNDSKDGDQSYIRFG 62
MK+ LALV+ ++A+ + AAEIYNKDGNKLD+YGKV +HY SD+ SKDGDQ+Y+R G
Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60

Query: 63 FKGETQINVQLTGYGRWEAEFAGNKAESDTAQQKTRLAFAGLKYKDLGSFDYGRNLGALY 122
FKGETQIN QLTGYG+WE N E + A TRLAFAGLK+ D GSFDYGRN G LY
Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120

Query: 123 DVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFGVIDGLNLTLQYQGKNEN-- 180
DVE WTDM PEFGGDS DN+MT RA+G+ATYRNTDFFG++DGLN LQYQGKNE+
Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180

Query: 181 --------------RDVKKQNGDGFGTSLTYDFGGSDFAISGAYTNSDRTNEQNLQSR-- 224
D++ NGDGFG S TYD G F+ AYT SDRTNEQ
Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDI-GMGFSAGAAYTTSDRTNEQVNAGGTI 239

Query: 225 GTGKRAEAWATGLKYDANNIYLATFYSETRKMTP-------ISGGFANKTQNFEAVAQYQ 277
G +A+AW GLKYDANNIYLAT YSETR MTP GG ANKTQNFE AQYQ
Sbjct: 240 AGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQ 299

Query: 278 FDFGLRPSLGYVLSKGKDIE----GIGDEDLVNYVDVGATYYFNKNMSAFVDYKINQLDS 333
FDFGLRP++ +++SKGKD+ D+DLV Y DVGATYYFNKN S +VDYKIN LD
Sbjct: 300 FDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLLDD 359

Query: 334 DNKL----NINNDDIVAVGMTYQF 353
D+ I+ DDIVA+GM YQF
Sbjct: 360 DDPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1727CARBMTKINASE376e-05 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 37.5 bits (87), Expect = 6e-05
Identities = 28/127 (22%), Positives = 48/127 (37%), Gaps = 17/127 (13%)

Query: 119 DTLRALLDNNI---------VPVINENDAVATAEIKVGDNDNLSALAAILAGADKLLLLT 169
+T++ L++ + VPVI E+ + E V D D A AD ++LT
Sbjct: 177 ETIKKLVERGVIVIASGGGGVPVILEDGEIKGVE-AVIDKDLAGEKLAEEVNADIFMILT 235

Query: 170 DQKGLYTADPRSNPQAELIKDVYGIDDALRAIAGDSVSGLGTGGMSTKLQAA-DVACRAG 228
D G + + +++V +++ + G M K+ AA G
Sbjct: 236 DVNGAALY--YGTEKEQWLREV-KVEELRKYYEEG---HFKAGSMGPKVLAAIRFIEWGG 289

Query: 229 IDTIIAA 235
IIA
Sbjct: 290 ERAIIAH 296



Score = 30.2 bits (68), Expect = 0.013
Identities = 16/76 (21%), Positives = 33/76 (43%), Gaps = 13/76 (17%)

Query: 4 SQTLVVKLGTSVLTGGSRRLNRAHIVELVRQCAQ----LHAAGHRIVIVTSG-------- 51
+ +V+ LG + L ++ + +++ VR+ A+ + A G+ +VI
Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61

Query: 52 -AIAAGREHLGYPELP 66
+ AG+ G P P
Sbjct: 62 LHMDAGQATYGIPAQP 77


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1717TRNSINTIMINR290.048 Translocated intimin receptor (Tir) signature.
		>TRNSINTIMINR#Translocated intimin receptor (Tir) signature.

Length = 549

Score = 29.3 bits (65), Expect = 0.048
Identities = 18/43 (41%), Positives = 27/43 (62%), Gaps = 1/43 (2%)

Query: 60 VDKIAQQ-KAKTEKQNQRAQAAAARAQARQQQQIAARKEKAEL 101
V++IAQQ K E Q+A + A+AQ R + Q A R+E+ +L
Sbjct: 318 VEQIAQQAKEAGEVARQQAVESNAQAQQRYEDQHARRQEELQL 360


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1712IGASERPTASE6150.0 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 615 bits (1586), Expect = 0.0
Identities = 267/886 (30%), Positives = 407/886 (45%), Gaps = 123/886 (13%)

Query: 42 SLALSALLPTVAGASTVGGNNPYQTYRDFAENKGQFQAGATNIPIFNNKGELVGHL--DK 99
+L ++ L A+ V + YQ +RDFAENKG+F GATN+ + + + +G +
Sbjct: 12 ALTVAYALTPYTEAALVRDDVDYQIFRDFAENKGKFSVGATNVLVKDKNNKDLGTALPNG 71

Query: 100 APMVDFSSVNVSSNPGVATLINPQYIASVKH-NKGYQSVSFG------------------ 140
PM+DFS V + +ATLINPQY+ VKH + G + FG
Sbjct: 72 IPMIDFSVV--DVDKRIATLINPQYVVGVKHVSNGVSELHFGNLNGNMNNGNAKAHRDVS 129

Query: 141 DGQNSYHIVDRNEHSSS-----------------DLHTPRLDKLVTEVAPATVTSSST-- 181
+N Y V++NE+ + D + PRLDK VTEVAP +++S+
Sbjct: 130 SEENRYFSVEKNEYPTKLNGKTVTTEDQTQKRREDYYMPRLDKFVTEVAPIEASTASSDA 189

Query: 182 ADILNPSKYSAFYRAGSGSQYIQDSQGKRHWVTGGYGYLTGGILPTSFFYH--------- 232
+ +KY AF R GSGSQ+I + + + Y
Sbjct: 190 GTYNDQNKYPAFVRLGSGSQFIYKKGDNYSLILNNHEVGGNNLKLVGDAYTYGIAGTPYK 249

Query: 233 --GSDGIQLYMGGNIHDHSI---------LPSFGEAGDSGSPLFGWNTAKGQWELVGVYS 281
+ + G + +HS L ++ GDSGSPLF ++ KG+W +G Y
Sbjct: 250 VNHENNGLIGFGNSKEEHSDPKGILSQDPLTNYAVLGDSGSPLFVYDREKGKWLFLGSYD 309

Query: 282 ---GVGGGTNLIYSLIPQSFLSQIYSEDNDAPVFFNASSGAPLQWKFDSSTGTGSLKQGS 338
G + +++ F + ++D+ + + + + S+ T ++ G
Sbjct: 310 FWAGYNKKSWQEWNIYKSQFTKDVLNKDSAGSLIGSK-----TDYSWSSNGKTSTITGGE 364

Query: 339 DEYAMHGQKGSDL-NAGKNLTFLGHNGQIDLENSVTQGAGSLTFTDDYTVT-TSNGSTWT 396
+ G D N GK++TF G +G + L N++ QGAG L F DY V TS+ +TW
Sbjct: 365 KSLNVDLADGKDKPNHGKSVTFEG-SGTLTLNNNIDQGAGGLFFEGDYEVKGTSDNTTWK 423

Query: 397 GAGIIVDKDASVNWQVNGVKGDNLHKIGEGTLVVQGTGVNEGGLKVGDGTVVLNQQADSS 456
GAG+ V + +V W+V+ + D L KIG+GTL+V+GTG N+G LKVGDGTV+L QQ + S
Sbjct: 424 GAGVSVAEGKTVTWKVHNPQYDRLAKIGKGTLIVEGTGDNKGSLKVGDGTVILKQQTNGS 483

Query: 457 GHVQAFSSVNIASGRPTVVLADNQQVNPDNISWGYRGGVLDVNGNDLTFHKLNAADYGAT 516
G AF+SV I SGR T+VL D++QV+P++I +G+RGG LD+NGN LTF + D GA
Sbjct: 484 GQ-HAFASVGIVSGRSTLVLNDDKQVDPNSIYFGFRGGRLDLNGNSLTFDHIRNIDDGAR 542

Query: 517 LGNS-SDKTANITLD---YQTRPANVKV---------NEWSSSNRGTVGSLYIYNNPYTH 563
L N +NIT+ T P + N ++ G LY+ YT
Sbjct: 543 LVNHNMTNASNITITGESLITDPNTITPYNIDAPDEDNPYAFRRIKDGGQLYLNLENYT- 601

Query: 564 TVDYFILK--TSSYGWFP-TGQVSNEHWEYVGHDQNSAQALLANRINNK------GYLY- 613
Y+ L+ S+ P SNE+W Y+G + A+ + N INN+ GY
Sbjct: 602 ---YYALRKGASTRSELPKNSGESNENWLYMGKTSDEAKRNVMNHINNERMNGFNGYFGE 658

Query: 614 -HGKLLGNINFSNKATPGTTGALVMDGSANMSGTFTQENGRLTIQGHPVIHASTSQSIAN 672
GK GN+N + K ++ G N++G T E G L + G P HA + IA
Sbjct: 659 EEGKNNGNLNVTFKGKSE-QNRFLLTGGTNLNGDLTVEKGTLFLSGRPTPHA---RDIAG 714

Query: 673 TVSSLGDNSVLTQPTSFTQDDWENRTFSFGSLVLK-DTDFGLGRN-ATLNTTIQADNSS- 729
S+ D +DDW NR F ++ + + GRN A + + I A N +
Sbjct: 715 ISSTKKDPHFAENNEVVVEDDWINRNFKATTMNVTGNASLYSGRNVANITSNITASNKAQ 774

Query: 730 ----VTLGDSRVFIDKKDGQGTAFTLEEGTSVATKDADKSVFNGTVNLDNQS--VLNINE 783
GD+ G T T ++ + A + + G VNL + VL
Sbjct: 775 VHIGYKTGDTVCVRSDYTGYVTC-TTDKLSDKALNSFNPTNLRGNVNLTESANFVLGKAN 833

Query: 784 IFNGGIQANNSTVNISSDS-------AVLENSTLTSTALNLNKGAN 822
+F NS V ++ +S + + L + ++LN N
Sbjct: 834 LFGTIQSRGNSQVRLTENSHWHLTGNSDVHQLDLANGHIHLNSADN 879



Score = 53.5 bits (128), Expect = 5e-09
Identities = 65/329 (19%), Positives = 121/329 (36%), Gaps = 54/329 (16%)

Query: 760 KDADKSVFNGTVNLDNQSVLNINEIFNGGIQANNSTVNISSDSAVLENSTLTSTALNLNK 819
K +D++ N +++N+ + N F NN +N++ +N L + NLN
Sbjct: 631 KTSDEAKRNVMNHINNERMNGFNGYFGEEEGKNNGNLNVTFKGKSEQNRFLLTGGTNLNG 690

Query: 820 GANVLASQSFVSDGPVNISDATLSLNSRPDEVSHTLLPVYDYAGSW----------NLKG 869
V F+S P + ++S + W N+ G
Sbjct: 691 DLTVEKGTLFLSGRPTPHARDIAGISSTKKDPHFAENNEVVVEDDWINRNFKATTMNVTG 750

Query: 870 DDARLNVGPYSMLSGNINVQDKGTVTLG--------------GEGELSPDLTLQNQMLYS 915
+ + + + ++ NI +K V +G G + D L ++ L S
Sbjct: 751 NASLYSGRNVANITSNITASNKAQVHIGYKTGDTVCVRSDYTGYVTCTTD-KLSDKALNS 809

Query: 916 LFN-----------------GYRNTWSGSLNAPDATVSMT-DTQWSMNGNSTAGNMKLNR 957
G N + + ++ V +T ++ W + GNS + L
Sbjct: 810 FNPTNLRGNVNLTESANFVLGKANLFGTIQSRGNSQVRLTENSHWHLTGNSDVHQLDLAN 869

Query: 958 TIVGFNGGTSS-----FTTLTTDNLDAVQSAFVMRTDL--NKADKLVINKSATGHDNSIW 1010
+ N +S + TLT ++L +F TDL + DK+V+ KSATG+
Sbjct: 870 GHIHLNSADNSNNVTKYNTLTVNSLSG-NGSFYYLTDLSNKQGDKVVVTKSATGNFTLQV 928

Query: 1011 VNFLKKPSDKDTLDIPLVSAPEATADNLF 1039
+ +P+ ++ L A +A D+L
Sbjct: 929 ADKTGEPNHN---ELTLFDASKAQRDHLN 954


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1706PF00577635e-12 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 62.6 bits (152), Expect = 5e-12
Identities = 29/247 (11%), Positives = 73/247 (29%), Gaps = 23/247 (9%)

Query: 487 TLNLNSLWSKLGTFSISYNDDRRYNSHYYTADYYQSVYSGTFGSLGLRAGIQRYNNGDSS 546
L + + T +S + Y + +Q+ + F + N
Sbjct: 530 QLTVTQQLGRTSTLYLSG-SHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQK 588

Query: 547 ANTGKYIALDLSLPLGNWFSAGMTHQNGYTMANLSARKQFDEGT------------IRTV 594
+ +AL++++P +W + Q + A+ S + +
Sbjct: 589 -GRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNL 647

Query: 595 GANLSRAISGDTGDDKTLSGGAYAQFDARYASGTLNVNSAADGYINTNLTANGSVGWQGK 654
++ +G + +G A + Y + + S +D +G V
Sbjct: 648 SYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGY-SHSDDIKQLYYGVSGGVLAHAN 706

Query: 655 NIAASGRTDGNAGVIFDTGLEN---DGQISAKINGRIFPLNGKRNYLPLSPYGRYEVELQ 711
+ + ++ G ++ + Q + + R G + Y V L
Sbjct: 707 GVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWR-----GYAVLPYATEYRENRVALD 761

Query: 712 NSKNSLD 718
+ + +
Sbjct: 762 TNTLADN 768


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1694INTIMIN549e-178 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 549 bits (1416), Expect = e-178
Identities = 234/832 (28%), Positives = 356/832 (42%), Gaps = 78/832 (9%)

Query: 41 PVMAARAQHAVQPRLSMENTTVTADNNVEKNVASLAANAGTFLSSQPDS-----DATRNF 95
P++AA +L+ + VT N + + AA L SQ S D ++
Sbjct: 131 PLVAAGGVAGHTNKLTKMSPDVTKSNMTDDKALNYAAQQAASLGSQLQSRSLNGDYAKDT 190

Query: 96 ITGMATAKANQEIQEWLGKYGTARVKLNVDKNFSLKDSSLEMLYPIYDTPTNMLFTQGAI 155
G+A +A+ ++Q WL YGTA V L NF SSL+ L P YD+ + F Q
Sbjct: 191 ALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFD--GSSLDFLLPFYDSEKMLAFGQVGA 248

Query: 156 HRTDDRTQSNIGFGWRHFSENDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGY 215
D R +N+G G R F + M G N FID D S +TR+G+G EYWRDY K S NGY
Sbjct: 249 RYIDSRFTANLGAGQRFF-LPENMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGY 307

Query: 216 IRASGWKKSPDVEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQ 275
R SGW +S + +DY ERPANG+DIR GYLP++P LGA LMYEQYYGD V LF DK Q
Sbjct: 308 FRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQ 367

Query: 276 KDPHAITAEVNYTPVPLLTLSAGHKQGKSGENDTRFGLEVNYRIGEPLEKQLDTDSIRER 335
+P A T VNYTP+PL+T+ ++ G END + ++ Y+ +P +Q++ + E
Sbjct: 368 SNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQQIEPQYVNEL 427

Query: 336 RMLAGSRYDLVERNNNIVLEYRKSEVIRIALPERIEGKGGQTVSLGLVVSKATHGLKNVQ 395
R L+GSRYDLV+RNNNI+LEY+K +++ + +P I G T + L+V K+ +GL +
Sbjct: 428 RTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTERSTQKIQLIV-KSKYGLDRIV 486

Query: 396 WEAPSLLAAGGKITGQG----NQWQVTLPAYQAGKDNYYAISAIAYDNKGNASKRVQTEV 451
W+ +L + GG+I G +Q LPAY G N Y ++A AYD GN+S V +
Sbjct: 487 WDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNVYKVTARAYDRNGNSSNNVLLTI 546

Query: 452 VISGAGMSADRTALTLDGQSRIQMLANGNEQKPLVLSLR----DAEGQPVTGMKDQIKTE 507
+ G D+ +T + A+G E +++ PV+
Sbjct: 547 TVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVS--------- 597

Query: 508 LTFKPAGNIVTRTLKATKSQAKPTLGEFTETEAGVYQSVFTTGTQSGEATITVSVDDMSK 567
NIV+ T + + A +G + G+ ++ +M+
Sbjct: 598 ------FNIVSGTAVLSANSAN-------TNGSGKATVTLKSDK-PGQVVVSAKTAEMTS 643

Query: 568 TVTAELRATMMDVSNSTLSANEPSGDVVADGQQAYTLTLTAVDSEGNPVTGEASRLRLVP 627
+ A + S VA+GQ A T T+ V PV+ +
Sbjct: 644 ALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVK-VMKGDKPVSNQEVTF---- 698

Query: 628 QDTNGVTVGAIS--EIKPGVYSATVSSTRAGNVVVRAFSEQYQLGTLQQTLKFVAGP--- 682
T + + G T++ST G +V A + ++F
Sbjct: 699 -TTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTID 757

Query: 683 ----------LDAAHSSITLNPDK---PVVGGTVTAIWTAKDANDNPVTGLNPDAPSLSG 729
+ ++ L + GG W + + V + +L
Sbjct: 758 DGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDA-SSGQVTLKE 816

Query: 730 AAAAGSTASGWTDNGDGTWTAQISLGTTAGELDVMPKLNGQDAAANAAKVTVVADALSSN 789
+ +DN T+T T L V L S+
Sbjct: 817 KGTTTISVIS-SDNQTATYTIA-----TPNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSS 870

Query: 790 QSKV-------SVAEDHVKAGESTTVTLVAKDAHGNAISGLSLSASLTGTAS 834
Q+++ A + S T+ + +A SG++ + L
Sbjct: 871 QNELENVFKAWGAANKYEYYKSSQTIISWVQQTAQDAKSGVASTYDLVKQNP 922



Score = 73.6 bits (180), Expect = 4e-15
Identities = 75/364 (20%), Positives = 120/364 (32%), Gaps = 39/364 (10%)

Query: 882 TVIAGEMSSANSTLVADNKTPTVKTTTELTFTMKDAYGNPVTGLKPDAPVFSGAASTGSE 941
V + ++ ++ AD T T + PV+ + SG A
Sbjct: 557 QVGVTDFTADKTSAKADGTEAITYTAT-VKKNGVAQANVPVSFN-----IVSGTAV---- 606

Query: 942 RPSAGNWTEKGNGVYVSTLTLGSAAGQLSVMPRVNGQNAVAQPLVLNVAGDASKAEIRDM 1001
SA + G+G TL + +A+ V+ V D +KA I ++
Sbjct: 607 -LSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFV--DQTKASITEI 663

Query: 1002 TVKVNNQLANGQSANQITLTV-VDSYGNPLQGQEVTLTLPQGVTSKTGNTVTTNAAGKVD 1060
+ANGQ A IT TV V P+ QEVT T G S + T T+ G
Sbjct: 664 KADKTTAVANGQDA--ITYTVKVMKGDKPVSNQEVTFTTTLGKLSNS--TEKTDTNGYAK 719

Query: 1061 IELMSTVAGELEIEASVKNSQKTVKVKFKADFSTGQASLEVDAA-AQKVANGKDAFTLTA 1119
+ L ST G+ + A V + VK F +L +D + V G T
Sbjct: 720 VTLTSTTPGKSLVSARVSDVAVDVKAPEVEFF----TTLTIDDGNIEIVGTGVKGKLPTV 775

Query: 1120 TVK-DQYGNLLPGAVVVFNLPRGVKPLADGNIMVNADKEGKAELKVVSVTAGTYEITASA 1178
++ Q G + A+ I G+ LK GT I+ +
Sbjct: 776 WLQYGQVNLKASGGNGKYTW-----RSANPAIASVDASSGQVTLK----EKGTTTISVIS 826

Query: 1179 GNDQPSNAQSVTFVADKTTATISSIEVIGNRAVADGKTKQTYKVTVTDANNNLLKDSEVT 1238
++Q T+ + I + D ++ N L++
Sbjct: 827 SDNQT-----ATYTIATPNSLI-VPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKA 880

Query: 1239 LTAS 1242
A+
Sbjct: 881 WGAA 884



Score = 54.7 bits (131), Expect = 2e-09
Identities = 46/249 (18%), Positives = 82/249 (32%), Gaps = 24/249 (9%)

Query: 1168 TAGTYEITASA----GNDQPSNAQSVTFVADKTTAT---ISSIEVIGNRAVADGKTKQTY 1220
+ Y++TA A GN + ++T +++ ++ A ADG TY
Sbjct: 521 GSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITY 580

Query: 1221 KVTVTDANNNLLKDSEVTLTASPENLVLTPNGTATTNEQGQAIFTATTTVAATYTLTAKV 1280
TV S + +A TN G+A T + ++AK
Sbjct: 581 TATVKKNGVAQANVPVSFNIVS--GTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAK- 637

Query: 1281 EQADGQESTKTAESKFVADDKNAVLAASPERVDSLVADGKTTATLTVTLMSGVNPVGGTM 1340
A+ + FV K ++ ++ + VA+G+ T TV +M G PV
Sbjct: 638 -TAEMTSALNANAVIFVDQTKASITEIKADK-TTAVANGQDAITYTVKVMKGDKPVSNQE 695

Query: 1341 WVDIEA--PEGVTEADYQFLPSKNDHFASGKITRTFSTNKPGTYTFTFNSLTYGGYEMKP 1398
+ K D +G T ++ PG + ++ ++K
Sbjct: 696 VTFTTTLGKLSNSTE-------KTD--TNGYAKVTLTSTTPGKSLVS-ARVSDVAVDVKA 745

Query: 1399 VTVTINAVP 1407
V
Sbjct: 746 PEVEFFTTL 754



Score = 45.8 bits (108), Expect = 1e-06
Identities = 55/368 (14%), Positives = 104/368 (28%), Gaps = 56/368 (15%)

Query: 779 VTVVADALSSNQSKV---SVAEDHVKAGESTTVTLVA------KDAHGNAISGLSLSASL 829
+TV+++ +Q V + + KA + +T A +S +S
Sbjct: 546 ITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSG-- 603

Query: 830 TGTASEGATVSSWTEKGDGSYVAT--LTTGGKTGELRVMPLFNGQPAATEAAQLTVIAGE 887
A +S+ + +GS AT L + + A A + +
Sbjct: 604 ------TAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALN--ANAVIFVDQ 655

Query: 888 MSSANSTLVADNKTPTVKTTTELTFTMKDAY-GNPVTGLKPDAPVFSGAASTGSERPSAG 946
++ + + AD T +T+T+K PV+ + +T + S
Sbjct: 656 TKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEV-------TFTTTLGKLSNS 708

Query: 947 NWTEKGNGVYVSTLTLGSAAGQLSVMPRVNGQN-AVAQPLVLNVAG---DASKAEIRDMT 1002
NG TLT + G+ V RV+ V P V D EI
Sbjct: 709 TEKTDTNGYAKVTLT-STTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEI---- 763

Query: 1003 VKVNNQLANGQSANQITLTV-VDSYGNPLQGQEVTLTLPQGVTSKTGNTVTTNAAGKVDI 1061
+ G T+ + G T ++ ++G+V +
Sbjct: 764 ------VGTGVKGKLPTVWLQYGQVNLKASGGNGKYTW---RSANPAIASVDASSGQVTL 814

Query: 1062 ELMSTVAGELEIEASVKNSQKTVKVKFKADFSTGQASLEVDAAAQKVANGKDAFTLTATV 1121
+ G I ++Q + + + +
Sbjct: 815 K----EKGTTTISVISSDNQ---TATYTIATPNSLIVPNMSKRVT-YNDAVNTCKNFGGK 866

Query: 1122 KDQYGNLL 1129
N L
Sbjct: 867 LPSSQNEL 874


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1693HTHTETR280.028 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.4 bits (63), Expect = 0.028
Identities = 12/42 (28%), Positives = 19/42 (45%)

Query: 14 RQKILQQLLEWIECNLEHPISIEDIAQKSGYSRRNIQLLFRN 55
RQ IL L S+ +IA+ +G +R I F++
Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKD 54


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1684PRTACTNFAMLY1212e-30 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 121 bits (305), Expect = 2e-30
Identities = 112/509 (22%), Positives = 187/509 (36%), Gaps = 73/509 (14%)

Query: 324 LDINLSDSSVWKGKVSGAGDASVSLQNGSVWNVTGSSTVDALAVKDSTVNITKATVNTGT 383
LD+ L+ + W G S+S+ N W +T +S V AL + + G
Sbjct: 413 LDVALASQARWTGATRAVD--SLSIDNA-TWVMTDNSNVGALRLASDGSVDFQQPAEAGR 469

Query: 384 FA-------SQNGTLI----VDASSENTLDISGKASGDLRVY---------SAGSLDLIN 423
F + +G D + L + ASG R++ SA +L L+
Sbjct: 470 FKVLTVNTLAGSGLFRMNVFADLGLSDKLVVMQDASGQHRLWVRNSGSEPASANTLLLVQ 529

Query: 424 EQ----TAFISTGKDSTLKATGTTEGGLYQYDLTQGADGNFYFVKNTHK----------- 468
F KD G + G Y+Y L +G + V
Sbjct: 530 TPLGSAATFTLANKD------GKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGP 583

Query: 469 -----------------------ASNASSVIQAMA-AAPANVANLQADTLSARQDAVRLS 504
++ A++ + + + +++ LS R +RL+
Sbjct: 584 QPPQPPQPQPEAPAPQPPAGRELSAAANAAVNTGGVGLASTLWYAESNALSKRLGELRLN 643

Query: 505 ENDKGGVWIQYFGGKQKHTTAGNASYDLDVNGVMLGGDTRFMTEDGSWLAGVAMSSAKGD 564
D GG W + F +Q+ +D V G LG D G W G +GD
Sbjct: 644 P-DAGGAWGRGFAQRQQLDNRAGRRFDQKVAGFELGADHAVAVAGGRWHLGGLAGYTRGD 702

Query: 565 MT-TMQSKGDTEGYSFHAYLSRQYNNGIFIDTAAQFGHYSNTADVRLMNGGGTIKADFNT 623
T G T+ Y + ++G ++D + N V +G +K + T
Sbjct: 703 RGFTGDGGGHTDSVHVGGYATYIADSGFYLDATLRASRLENDFKVAGSDGY-AVKGKYRT 761

Query: 624 NGFGAMVKGGYTWKDGNGLFIQPYAKLSALTLEGVDYQL-NGVDVHSDSYNSVLGEAGTR 682
+G GA ++ G + +G F++P A+L+ G Y+ NG+ V + +SVLG G
Sbjct: 762 HGVGASLEAGRRFTHADGWFLEPQAELAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLE 821

Query: 683 VGYDFAVGNA-TVKPYLNLAALNEFSDGNKVRLGDESVNASIDGAAFRVGAGVQADITKN 741
VG + V+PY+ + L EF V + + G +G G+ A + +
Sbjct: 822 VGKRIELAGGRQVQPYIKASVLQEFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRG 881

Query: 742 MGAYASLDYTKGDDIENPLQGVVGINVTW 770
YAS +Y+KG + P G +W
Sbjct: 882 HSLYASYEYSKGPKLAMPWTFHAGYRYSW 910


6APECO1_1557APECO1_1544Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1557119-4.096472hypothetical protein
APECO1_1556013-1.678381hypothetical protein
APECO1_1555013-1.008608hypothetical protein
APECO1_1554217-0.868015hypothetical protein
APECO1_15532115-0.247168maltose O-acetyltransferase
APECO1_15531150.390590Hha protein
APECO1_15521160.939629acridine efflux pump
APECO1_15512130.615254acriflavine resistance protein A precursor
APECO1_15502141.697683DNA-binding transcriptional repressor AcrR
APECO1_15493172.352064potassium efflux protein KefA
APECO1_15483184.397134primosomal replication protein N''
APECO1_15474233.089447hypothetical protein
APECO1_15464273.104728adenine phosphoribosyltransferase
APECO1_15453223.012176DNA polymerase III subunits gamma and tau
APECO1_15442221.489073hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1555BCTERIALGSPF290.035 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 29.4 bits (66), Expect = 0.035
Identities = 31/137 (22%), Positives = 54/137 (39%), Gaps = 24/137 (17%)

Query: 247 IWLPLGLVIGLLAAMFVLRILRRIQSPHHRLQDAIENRDICVHYQPIVSLANGKIVGAEA 306
W+ L L+ G +A +LR R+ + + P++ G+I
Sbjct: 228 PWMLLALLAGFMAFRVMLR------QEKRRVS-----FHRRLLHLPLI----GRIARGLN 272

Query: 307 LARWPQTDGSWLSPDSFIPLAQQTGLS-EPLTLLIIRSVFEDMGDWLRQHPQQHISINLE 365
AR+ +T + S +PL Q +S + ++ R D +R+ H + LE
Sbjct: 273 TARYARTLSILNA--SAVPLLQAMRISGDVMSNDYARHRLSLATDAVREGVSLHKA--LE 328

Query: 366 STVLTSEKIPQLLREMI 382
T L P ++R MI
Sbjct: 329 QTAL----FPPMMRHMI 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1552ACRIFLAVINRP13690.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1369 bits (3546), Expect = 0.0
Identities = 802/1033 (77%), Positives = 915/1033 (88%), Gaps = 1/1033 (0%)

Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLAILKLPVAQYPTIAPPAVTISASYPGADAKTVQDT 60
M NFFI RPIFAWV+AII+M+AG LAIL+LPVAQYPTIAPPAV++SA+YPGADA+TVQDT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQNMNGIDNLMYMSSNSDSTGTVQITLTFESGTDADIAQVQVQNKLQLAMPLLPQ 120
VTQVIEQNMNGIDNLMYMSS SDS G+V ITLTF+SGTD DIAQVQVQNKLQLA PLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQQQGVSVEKSSSSFLMVVGVINTDGTMTQEDISDYVAANMKDAISRTSGVGDVQLFGS 180
EVQQQG+SVEKSSSS+LMV G ++ + TQ+DISDYVA+N+KD +SR +GVGDVQLFG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWMNPNELNKFQLTPVDVITAIKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240
QYAMRIW++ + LNK++LTPVDVI +K QN Q+AAGQLGGTP + GQQLNASIIAQTR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 TSTEEFGKILLKVNQDGSRVLLRDVAKIELGGENYDIIAEFNGQPASGLGIKLATGANAL 300
+ EEFGK+ L+VN DGS V L+DVA++ELGGENY++IA NG+PA+GLGIKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTAAAIRAELAKMEPFFPSGLKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ 360
DTA AI+A+LA+++PFFP G+K++YPYDTTPFV++SIHEVVKTL EAI+LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420
N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFVPMAFFGGSTGAIYRQFSITIVSAMAL 480
E+ LPPKEAT KSM QIQGALVGIAMVLSAVF+PMAFFGGSTGAIYRQFSITIVSAMAL
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALILTPALCATMLKPIAKGDHGEGKKGFFGWFNRMFEKSTHHYTDSVGGILRSTGR 540
SVLVALILTPALCAT+LKP++ H E K GFFGWFN F+ S +HYT+SVG IL STGR
Sbjct: 481 SVLVALILTPALCATLLKPVSAE-HHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 541 YLVLYLIIVVGMAYLFVRLPSSFLPDEDQGVFMTMVQLPAGATQERTQKVLNEVTHYYLT 600
YL++Y +IV GM LF+RLPSSFLP+EDQGVF+TM+QLPAGATQERTQKVL++VT YYL
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 601 KEKNNVESVFAVNGFGFAGRGQNTGIAFVSLKDWADRPGEENKVEAITMRATRAFSQIKD 660
EK NVESVF VNGF F+G+ QN G+AFVSLK W +R G+EN EA+ RA +I+D
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 661 AMVFAFNLPAIVELGTATGFDFELIDQAGLGHEKLTQARNQLLAEAAKHPDMLTSVRPNG 720
V FN+PAIVELGTATGFDFELIDQAGLGH+ LTQARNQLL AA+HP L SVRPNG
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 721 LEDTPQFKIDIDQEKAQALGVSINDINTTLGAAWGGSYVNDFIDRGRVKKVYVMSEAKYR 780
LEDT QFK+++DQEKAQALGVS++DIN T+ A GG+YVNDFIDRGRVKK+YV ++AK+R
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 781 MLPDDIGDWYVRAADGQMVPFSAFSSSRWEYGSPRLERYNGLPSMEILGQAAPGKSTGEA 840
MLP+D+ YVR+A+G+MVPFSAF++S W YGSPRLERYNGLPSMEI G+AAPG S+G+A
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 841 MELMEQLASKLPTGVGYDWTGMSYQERLSGNQAPSLYAISLIVVFLCLAALYESWSIPFS 900
M LME LASKLP G+GYDWTGMSYQERLSGNQAP+L AIS +VVFLCLAALYESWSIP S
Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899

Query: 901 VMLVVPLGVIGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGL 960
VMLVVPLG++G LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG+
Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 961 IEATLDAVRMRLRPILMTSLAFILGVMPLVISTGAGSGAQNAVGTGVMGGMVTATVLAIF 1020
+EATL AVRMRLRPILMTSLAFILGV+PL IS GAGSGAQNAVG GVMGGMV+AT+LAIF
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 1021 FVPVFFVVVRRRF 1033
FVPVFFVV+RR F
Sbjct: 1020 FVPVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1551RTXTOXIND447e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.0 bits (104), Expect = 7e-07
Identities = 33/212 (15%), Positives = 71/212 (33%), Gaps = 23/212 (10%)

Query: 100 TYQAAYDSAKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTA 159
+ Y A +L + + Q+ Q +++ ++ L +Q +
Sbjct: 256 EQENKYVEAVNELR--VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313

Query: 160 AKAAVETARINLAYTKVTSPISGRIGKSNV-TEGALVQNGQATALATVQQLDPIYVDVTQ 218
+ + + +P+S ++ + V TEG +V + T + V + D + V
Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALV 372

Query: 219 SSNDFLRLKQELA----------NGTLKQENGKAKVSLITSDGIKFPQDGTLEFSDVTVD 268
+ D + KV I D I+ + G + ++++
Sbjct: 373 QNKDIGFINVGQNAIIKVEAFPYTRYGYLV---GKVKNINLDAIEDQRLGLVFNVIISIE 429

Query: 269 QTTGSITLRAIFPNPDHTLLPGMFVRARLEEG 300
+ S + I L GM V A ++ G
Sbjct: 430 ENCLSTGNKNIP------LSSGMAVTAEIKTG 455



Score = 34.0 bits (78), Expect = 9e-04
Identities = 24/125 (19%), Positives = 43/125 (34%), Gaps = 13/125 (10%)

Query: 49 PLQITTELPGR-TSAYRIAEVRPQVSGIILKRNFKEGSDIEAGVSLYQIDPATYQAAYDS 107
++I G+ T + R E++P + I+ + KEG + G L ++ +A
Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA---- 134

Query: 108 AKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTAAKAAVETA 167
D K Q++ A+L RYQ L E ++
Sbjct: 135 ---DTLKTQSSLLQARLEQTRYQILS-----RSIELNKLPELKLPDEPYFQNVSEEEVLR 186

Query: 168 RINLA 172
+L
Sbjct: 187 LTSLI 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1550HTHTETR2225e-76 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 222 bits (567), Expect = 5e-76
Identities = 215/215 (100%), Positives = 215/215 (100%)

Query: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60
MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120
EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180
GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215
APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE
Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1549RTXTOXIND320.017 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.017
Identities = 19/125 (15%), Positives = 40/125 (32%), Gaps = 6/125 (4%)

Query: 28 QNTAFARASSNGDLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDKIDRVKEE 87
N RA L + + L L+ + A L++ ++ E
Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSR--LDDFSSLLHKQAIAKHAVLEQENKYVEA 264

Query: 88 TVQLRQKVAEAPEKMRQATAALTALSDVDND--EETRKIL--STLSLRQLETRVAQALDD 143
+LR ++ + + +A V E L +T ++ L +A+ +
Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEER 324

Query: 144 LQNAQ 148
Q +
Sbjct: 325 QQASV 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1545IGASERPTASE399e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 38.5 bits (89), Expect = 9e-05
Identities = 40/251 (15%), Positives = 78/251 (31%), Gaps = 31/251 (12%)

Query: 404 PLPETTSQVLAARQQLQRVQGATKAKKSEPAA----ATRARPVNNAALERLASVTDRVQA 459
P E +Q + + + P+ AR + A + A T
Sbjct: 983 PEVEKRNQTVDTTN----ITTPNNIQADVPSVPSNNEEIARV-DEAPVPPPAPATPSETT 1037

Query: 460 RPVPSALEKAPAKKEAYRWKATTPVMQQKE--------VVATPKALKKA---LEHEKTPE 508
V ++ E AT Q +E V A + + A E ++T
Sbjct: 1038 ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097

Query: 509 LAAKLAA---------EAIERDPWAAQVSQLSLPKLVEQVALNAWKE-ESDNAVCLHLRS 558
K A E+ +V+ PK + + E +N ++++
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157

Query: 559 SQRHLNNRGAQQKLAEALST-LKGSTVELTIVEDDNPAVRTPLEWRQAIYEEKLAQARES 617
Q N ++ A+ S+ ++ E T V N V P A + + +
Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSN 1217

Query: 618 IIADNNIQTLR 628
+ + +++R
Sbjct: 1218 KPKNRHRRSVR 1228


7APECO1_1518APECO1_1486Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_15180183.074329metal resistance protein
APECO1_15171174.210351thioredoxin
APECO1_15161163.784002short chain dehydrogenase
APECO1_15151163.595418multifunctional acyl-CoA thioesterase I/protease
APECO1_15130153.536251ABC transporter ATP-binding protein
APECO1_1512-2122.830947hypothetical protein
APECO1_1511-2150.150655tRNA 2-selenouridine synthase
APECO1_1510-115-0.686411AllS family transcriptional regulator
APECO1_1509117-1.752237ureidoglycolate hydrolase
APECO1_1508016-1.450332DNA-binding transcriptional repressor AllR
APECO1_1507116-1.526888glyoxylate carboligase
APECO1_1506217-1.170147hydroxypyruvate isomerase
APECO1_1505215-0.9616832-hydroxy-3-oxopropionate reductase
APECO1_1504314-0.781615allantoin permease
APECO1_15033130.204379allantoinase
APECO1_15023171.127405purine permease YbbY
APECO1_15013152.167442glycerate kinase
APECO1_15002152.219008hypothetical protein
APECO1_14992163.545533allantoate amidohydrolase
APECO1_14980154.325377ureidoglycolate dehydrogenase
APECO1_14971165.131868membrane protein FdrA
APECO1_14961174.709674hypothetical protein
APECO1_14951164.007454hypothetical protein
APECO1_14940183.313006carbamate kinase
APECO1_14931172.594102phosphoribosylaminoimidazole carboxylase ATPase
APECO1_14922162.201759phosphoribosylaminoimidazole carboxylase
APECO1_14910131.142989hypothetical protein
APECO1_14901141.302566UDP-2,3-diacylglucosamine hydrolase
APECO1_1488-114-0.726544cysteinyl-tRNA synthetase
APECO1_1487121-3.836579hypothetical protein
APECO1_1486-118-3.628729bifunctional 5,10-methylene-tetrahydrofolate
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1516DHBDHDRGNASE784e-19 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 77.8 bits (191), Expect = 4e-19
Identities = 49/212 (23%), Positives = 81/212 (38%), Gaps = 7/212 (3%)

Query: 16 KSVLITGCSSGIGLESALELKRQGFHVLAGCRKPDDVERMNS----MGFTGVLIDLDL-- 69
K ITG + GIG A L QG H+ A P+ +E++ S D+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 70 PESVDRAADEVIALTDNCLYGIFNNAGFGMYGPLSTISRAQMEQQFSANFFGAHQLTMRL 129
++D + + + N AG G + ++S + E FS N G + +
Sbjct: 69 SAAIDEITARIEREMGP-IDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 130 LPAMLPHGEGRIVMTSSVMGLISTPGRGAYAASKYALEAWSDALRMELRYSGIKVSLIEP 189
M+ G IV S + AYA+SK A ++ L +EL I+ +++ P
Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187

Query: 190 GPIRTRFTDNVNQTQSDKPVENPGIAARFTLG 221
G T ++ ++ G F G
Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTG 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1513PF05272290.013 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.013
Identities = 12/20 (60%), Positives = 13/20 (65%)

Query: 41 LVGESGSGKSTLLAILAGLD 60
L G G GKSTL+ L GLD
Sbjct: 601 LEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1508PF09025280.019 YopR Core
		>PF09025#YopR Core

Length = 143

Score = 28.4 bits (63), Expect = 0.019
Identities = 17/61 (27%), Positives = 25/61 (40%), Gaps = 8/61 (13%)

Query: 126 EAVLIGQLECKSMVRMCAPLGSR--------LPLHASGAGKALLYPLAEEELMSIILQTG 177
+ + +LE K+M+R PLG + L G L LA EL +I G
Sbjct: 68 QGLEADRLELKAMLRAELPLGRQQQTFLLQLLGAVEHAPGGEYLAQLARRELQVLIPLNG 127

Query: 178 L 178
+
Sbjct: 128 M 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1503UREASE561e-10 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 56.3 bits (136), Expect = 1e-10
Identities = 39/163 (23%), Positives = 60/163 (36%), Gaps = 32/163 (19%)

Query: 4 DLIIKNGTVILENEARVVDIAVKDGKIAAIG-------QD-----LGDAKDVMDASGLVV 51
D +I N ++ DI +KDG+IAAIG Q +G +V+ G +V
Sbjct: 69 DTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIV 128

Query: 52 SPGMVDAHTHISEPGRSHWEGYETGTRAAAKGGITTMIEMPLNQLPATVDRAS------- 104
+ G +D+H H P + A G+T M+ PA A+
Sbjct: 129 TAGGMDSHIHFICPQQIE---------EALMSGLTCMLGGGTG--PAHGTLATTCTPGPW 177

Query: 105 -IELKFDAAKGKLTIDAAQLGGLVSYNIDRLHELDEVGVVGFK 146
I +AA ++ A G + L E+ G K
Sbjct: 178 HIARMIEAADA-FPMNLAFAGKGNASLPGALVEMVLGGATSLK 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1494CARBMTKINASE383e-136 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 383 bits (984), Expect = e-136
Identities = 125/310 (40%), Positives = 175/310 (56%), Gaps = 16/310 (5%)

Query: 2 KTLVVALGGNALLQRGEALTAKNQYRNIASAVPALARL-ARSYRLAIVHGNGPQVGLLAL 60
K +V+ALGGNAL QRG+ + + N+ +A + AR Y + I HGNGPQVG L L
Sbjct: 3 KRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLLL 62

Query: 61 QNLAWKE---VEPYPLDVLVAESQGMIGYMLAQGLSAQPQM----PPVTTVLTRIEVSPD 113
A + + P+DV A SQG IGYM+ Q L + + V T++T+ V +
Sbjct: 63 HMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDKN 122

Query: 114 DPAFLQPEKFIGPVYQPEEQEALEAAYGWQMKRD-GKYLRRVVASPQPRKILDSEAIELL 172
DPAF P K +GP Y E + L GW +K D G+ RRVV SP P+ +++E I+ L
Sbjct: 123 DPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKL 182

Query: 173 LKEGHVVICSGGGGVPVAEDG---AGSEAVIDKDLAAALLAEQINADGLVILTDADAVYE 229
++ G +VI SGGGGVPV + G EAVIDKDLA LAE++NAD +ILTD +
Sbjct: 183 VERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAAL 242

Query: 230 NWGTPQQRAIRRATPDELAPFAKAD----GSMGPKVTAVSGYVRSRGKPAWIGALSRIEE 285
+GT +++ +R +EL + + GSMGPKV A ++ G+ A I L + E
Sbjct: 243 YYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHLEKAVE 302

Query: 286 TLAGEAGTCI 295
L G+ GT +
Sbjct: 303 ALEGKTGTQV 312


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1488RTXTOXIND290.030 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.4 bits (66), Expect = 0.030
Identities = 16/150 (10%), Positives = 44/150 (29%), Gaps = 8/150 (5%)

Query: 299 RSQLNYSEENLKQARAALERLYTALRGTDKTVAPAGGEAFEARFIEAMDDDFNTP----- 353
+ ++ +L QAR R R + P E F +++
Sbjct: 133 EADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIK 192

Query: 354 EAYSVLFDMAREVNRLKAEDMAAANAMASHLRKLSAVLGLLEQEPEAFLQSGAQADDSEV 413
E +S + + + A + + + + + + + + F +
Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSS---LLHKQAI 249

Query: 414 AEIEALIQQRLDARKAKDWAAADAARDRLN 443
A+ L Q+ + + +++
Sbjct: 250 AKHAVLEQENKYVEAVNELRVYKSQLEQIE 279


8APECO1_1464APECO1_1446Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_14640123.587667enterobactin/ferric enterobactin esterase
APECO1_14630133.934114enterobactin synthase subunit F
APECO1_14621133.355432ferric enterobactin transport protein FepE
APECO1_14611135.485203iron-enterobactin transporter ATP-binding
APECO1_14601165.522040iron-enterobactin transporter permease
APECO1_1459-1175.175460iron-enterobactin transporter membrane protein
APECO1_1458-1184.713563enterobactin exporter EntS
APECO1_1457-2174.568048iron-enterobactin transporter periplasmic
APECO1_1456-2214.932488isochorismate synthase
APECO1_1455-1214.874017enterobactin synthase subunit E
APECO1_14540184.4420572,3-dihydro-2,3-dihydroxybenzoate synthetase
APECO1_14530173.6200912,3-dihydroxybenzoate-2,3-dehydrogenase
APECO1_14520161.816407hypothetical protein
APECO1_1451-113-0.596254carbon starvation protein
APECO1_1450-116-3.465432hypothetical protein
APECO1_1449-217-4.772003aminotransferase
APECO1_1448-118-4.760203hypothetical protein
APECO1_1447-219-4.639740hypothetical protein
APECO1_1446-121-3.832464transcriptional regulator YbdO
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1458TCRTETA379e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 37.1 bits (86), Expect = 9e-05
Identities = 82/394 (20%), Positives = 147/394 (37%), Gaps = 38/394 (9%)

Query: 27 FISIVSLGLLGVAVPVQIQMMTHSTWQV---GLSVTLTGGAMFVGLMVGGVLADRYERKK 83
+ V +GL+ +P ++ + HS G+ + L F V G L+DR+ R+
Sbjct: 15 ALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRP 74

Query: 84 VILLARGTCGIGFIGLCLNALL--PEPSLLAIYLLGLWDGFFASLGVTALLAATPALVGR 141
V+L + G ++ + P L +Y+ + G + G A A +
Sbjct: 75 VLL-------VSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA-GAYIADITDG 126

Query: 142 ENLMQAGAITMLTVRLGSVISPMIGGLLLATGGVAWNYGLAAAGTFITLLPLLSLPALPP 201
+ + G V P++GGL+ GG + + AA L L LP
Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLM---GGFSPHAPFFAAAALNGLNFLTGCFLLPE 183

Query: 202 PPQPREHPLK----SLLAGFRFLLASPLVGGIALLGGLLTMAS----AVRVLYPALADNW 253
+ PL+ + LA FR+ +V + + ++ + A+ V++ D +
Sbjct: 184 SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG--EDRF 241

Query: 254 QMSAAQIGFLYAAIP-LGAAIGALTSGKLAHSVRPGLLMLLSTLG---AFLAISLFGLMP 309
A IG AA L + A+ +G +A + ++L + ++ ++
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 310 MWILGVVCLALFGWLSAVSSLLQYTMLQTQTPEAMLGRINGLWTAQNVTGDAIGAALLGG 369
M +V LA G ML Q E G++ G A +G L
Sbjct: 302 MAFPIMVLLASGGIGMPALQ----AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357

Query: 370 LGAMMTPVASASASGFGLLIIGVLLLLVLVELRR 403
+ A + + +G+ + L LL L LRR
Sbjct: 358 IYA----ASITTWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1457FERRIBNDNGPP632e-13 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 62.7 bits (152), Expect = 2e-13
Identities = 60/280 (21%), Positives = 100/280 (35%), Gaps = 35/280 (12%)

Query: 40 HTLESQPQRIVSTSVTLTGSLLAIDAPVIASGATTPNNRVADDQGFLRQWSKVAKERKLQ 99
H P RIV+ LLA+ VAD + R W E L
Sbjct: 29 HAAAIDPNRIVALEWLPVELLLALGIVPYG---------VADTINY-RLW---VSEPPLP 75

Query: 100 RLYIG-----EPSAEAVAAQMPDLILISATGGDSALALYDQLSTIAPTLIINYDDKS--- 151
I EP+ E + P ++ SA G S + L+ IAP N+ D
Sbjct: 76 DSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPL 131

Query: 152 --WQSLLTQLGEITGHEKQAAERIAQFDKQLAAAKEQIKLPPQPVTAIVYTAAAHSANLW 209
+ LT++ ++ + A +AQ++ + + K + + ++
Sbjct: 132 AMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVF 191

Query: 210 TPESAQGQMLEQLGFTLAKLPAGLNASQSQGKRHDIIQLGGENLAAGLNGESLFLFAGDQ 269
P S ++L++ G NA Q + + + LAA + + L +
Sbjct: 192 GPNSLFQEILDEYGIP--------NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNS 243

Query: 270 KDADAIYANPLLAHLPAVQNKQVYALGTETFRLDYYSAMQ 309
KD DA+ A PL +P V+ + + F SAM
Sbjct: 244 KDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMH 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1454ISCHRISMTASE444e-161 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 444 bits (1142), Expect = e-161
Identities = 146/299 (48%), Positives = 195/299 (65%), Gaps = 18/299 (6%)

Query: 1 MAIPKLQAYALPESHDIPQNKVDWAFEPQRAALLIHDMQDYFVSFWGENCPMMEQVIANI 60
MAIP +Q Y +P + D+PQNKV W +P RA LLIHDMQ+YFV + + ++ ANI
Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60

Query: 61 AALRDYCKQHNIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQKVVDRLTPDADDTV 120
L++ C Q IPV YTAQP Q+ +DRALL D WGPGL P ++K++ L P+ DD V
Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120

Query: 121 LVKWRYSAFHRSPLEQMLKESGRNQLIITGVYAHIGCMTTATDAFMRDIKPFMVADALAD 180
L KWRYSAF R+ L +M+++ GR+QLIITG+YAHIGC+ TA +AFM DIK F V DA+AD
Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180

Query: 181 FSRDEHLMSLKYVAGRSGRVVMTEELL------PAPIPASKA-----------ALREVIL 223
FS ++H M+L+Y AGR VMT+ LL PA + + A +R+ I
Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240

Query: 224 PLLDESDEPFDDD-NLIDYGLDSVRMMALAARWRKVHGDIDFVMLAKNPTIDAWWKLLS 281
LL E+ E D +L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W KLL+
Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1453DHBDHDRGNASE365e-131 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 365 bits (939), Expect = e-131
Identities = 111/258 (43%), Positives = 150/258 (58%), Gaps = 20/258 (7%)

Query: 15 GKNVWITGAGKGIGYATALAFVEAGAKVTGFD---------------QAFTQEQYPFATE 59
GK +ITGA +GIG A A GA + D +A E +P
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP---- 63

Query: 60 VMDVADAAQVAQVCQRLLAETERLDVLVNAAGILRMGATDQLSKEDWQQTFAVNVGGAFN 119
DV D+A + ++ R+ E +D+LVN AG+LR G LS E+W+ TF+VN G FN
Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 120 LFQQTMNQFRRQRGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALSVGLELAGSGVRC 179
+ +R G+IVTV S+ A PR M+AY +SKAA +GLELA +RC
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 180 NVVSPGSTDTDMQRTLWVSDDAEEQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASDL 239
N+VSPGST+TDMQ +LW ++ EQ I+G E FK GIPL K+A+P +IA+ +LFL S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 240 ASHITLQDIVVDGGSTLG 257
A HIT+ ++ VDGG+TLG
Sbjct: 243 AGHITMHNLCVDGGATLG 260


9APECO1_1373APECO1_1342Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1373-1164.144109DNA-binding transcriptional activator KdpE
APECO1_1372-1153.764333sensor protein KdpD
APECO1_1371-1142.361036potassium-transporting ATPase subunit C
APECO1_1370-1152.262746potassium-transporting ATPase subunit B
APECO1_1369-1151.945454potassium-transporting ATPase subunit A
APECO1_1368-1141.986947hypothetical protein
APECO1_13670152.713469deoxyribodipyrimidine photolyase
APECO1_1366-1142.595586hypothetical protein
APECO1_13650153.594170hydrolase-oxidase
APECO1_13640132.716298hypothetical protein
APECO1_1363-1111.378373hypothetical protein
APECO1_1362-3110.435756LamB/YcsF family protein
APECO1_1361-214-0.437738endonuclease VIII
APECO1_13600190.845298transport protein AbrB
APECO1_1359122-0.056022hypothetical protein
APECO1_13581261.912943type II citrate synthase
APECO1_13572252.811760succinate dehydrogenase cytochrome b556 large
APECO1_13562283.013388succinate dehydrogenase cytochrome b556 small
APECO1_13552292.999219succinate dehydrogenase flavoprotein subunit
APECO1_13541241.404262succinate dehydrogenase iron-sulfur subunit
APECO1_13531271.4225672-oxoglutarate dehydrogenase E1 component
APECO1_13520250.291096dihydrolipoamide succinyltransferase
APECO1_1351023-0.703060succinyl-CoA synthetase subunit beta
APECO1_1350-217-1.483475succinyl-CoA synthetase subunit alpha
APECO1_1349-114-1.946468hypothetical protein
APECO1_1348418-0.031435cytochrome d terminal oxidase, subunit I
APECO1_13462180.160825hypothetical protein
APECO1_13453210.202787acyl-CoA thioester hydrolase YbgC
APECO1_13443200.111013colicin uptake protein TolQ
APECO1_13433190.081148colicin uptake protein TolR
APECO1_1342317-0.318222cell envelope integrity inner membrane protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1373HTHFIS928e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 92.2 bits (229), Expect = 8e-24
Identities = 35/125 (28%), Positives = 58/125 (46%), Gaps = 1/125 (0%)

Query: 2 TNVLIVEDEQAIRRFLRTALEGDGMRVFEAETLQRGLLEAATRKPDLIILDLGLPDGDGI 61
+L+ +D+ AIR L AL G V A DL++ D+ +PD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 62 EFIRDLRQWSA-VPVIVLSARSEESDKIAALDAGADDYLSKPFGIGELQARLRVALRRHS 120
+ + +++ +PV+V+SA++ I A + GA DYL KPF + EL + AL
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 121 ATTTP 125
+
Sbjct: 124 RRPSK 128


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1372PF06580330.007 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 32.5 bits (74), Expect = 0.007
Identities = 10/48 (20%), Positives = 21/48 (43%), Gaps = 4/48 (8%)

Query: 785 LLENAVKYAGAQAE----IGINAHVEGENLQLDVWDNGPGLPPGQEQT 828
L+EN +K+ AQ I + + + L+V + G +++
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKES 310


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1354TCRTETOQM310.003 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 31.4 bits (71), Expect = 0.003
Identities = 11/41 (26%), Positives = 23/41 (56%), Gaps = 1/41 (2%)

Query: 14 VDDAPRMQDYTLEAEEGRDM-MLLDALIQLKEKDPSLSFRR 53
+++ + T+E + + MLLDAL+++ + DP L +
Sbjct: 339 IENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYV 379


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1349SYCDCHAPRONE280.037 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 28.0 bits (62), Expect = 0.037
Identities = 18/65 (27%), Positives = 26/65 (40%)

Query: 255 AMQNSGDTQLARKYNREGEAVYKTGQLEQAIQLFQQATELDGNYGQAFSNLGLAYQKNGN 314
AM N + + Y++G+ E A ++FQ LD + F LG Q G
Sbjct: 26 AMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQ 85

Query: 315 IAEAI 319
AI
Sbjct: 86 YDLAI 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1342IGASERPTASE648e-13 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 63.5 bits (154), Expect = 8e-13
Identities = 35/203 (17%), Positives = 67/203 (33%), Gaps = 10/203 (4%)

Query: 99 EQERLKQLEKERLAAQEQKKQAEEAAKQAELKQKQAEEAAAKAAADAKAKAEADAKAAEE 158
E E+ Q QA+ + + ++ + A +E AE
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043

Query: 159 AAKKAAADAKKKAEAEAAKA-----AVEAQKKAEAAAAALKK---KAEAAEAAAAEARKK 210
+ +++ K + +A A A EA+ +A + +E E E ++
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103

Query: 211 AATEAAEKAKAEAEKKAAAEKAAADKKAAADKKAAEKAAAEKAAADKKA--AAEKAAADK 268
A E EKAK E EK K + ++ + AE A + E +
Sbjct: 1104 ATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTN 1163

Query: 269 KAAAAKAAAEKAAAAKAAAEADD 291
A + A++ ++ +
Sbjct: 1164 TTADTEQPAKETSSNVEQPVTES 1186



Score = 57.0 bits (137), Expect = 1e-10
Identities = 30/219 (13%), Positives = 82/219 (37%), Gaps = 11/219 (5%)

Query: 68 QSQESSAKRSDEQRKMKEQQAAEELREKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQA 127
Q+ S ++E+ ++ +E ++ + +K E+ A +
Sbjct: 1004 QADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKN--EQDATET 1061

Query: 128 ELKQKQ-AEEAAAKAAAD------AKAKAEADAKAAEEAAKKAAADAKKKAEAEAAKAAV 180
+ ++ A+EA + A+ A++ +E E + A + ++KA+ E K
Sbjct: 1062 TAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQE 1121

Query: 181 EAQKKAEAAAAALKKKAEAAEAAAAEARKKAATEAAEKAKAEAEKKAAAEKAAADKKAAA 240
+ ++ + ++++E + A AR+ T ++ +++ A E+ A + +
Sbjct: 1122 VPKVTSQVSPK--QEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179

Query: 241 DKKAAEKAAAEKAAADKKAAAEKAAADKKAAAAKAAAEK 279
++ E + + A + ++ K
Sbjct: 1180 EQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNK 1218



Score = 56.6 bits (136), Expect = 1e-10
Identities = 31/229 (13%), Positives = 70/229 (30%), Gaps = 9/229 (3%)

Query: 66 RMQSQESSAKRSDEQRKMKEQQAAEELREKQAAEQERLKQLEKERLAAQEQKKQAEEAAK 125
R ++E+ + + + Q+ E +E Q E + +EKE A E +K E
Sbjct: 1066 REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKV 1125

Query: 126 QAELKQKQAEEAAAKAAADAKAKAEADAKAAEEAAKKAAADAKKKAEAEAAKAAVEAQKK 185
+++ KQ + + A+ + + +E + A + A+ + VE
Sbjct: 1126 TSQVSPKQEQSETVQPQAEPARENDP-TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVT 1184

Query: 186 AEAAAAALKKKAEAAEAAAAEARKKAATEAAEKAKAEAEKKAAAEKAAADKKAAADKKAA 245
E E + + +++ + A
Sbjct: 1185 ESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDR 1244

Query: 246 EKAA--------AEKAAADKKAAAEKAAADKKAAAAKAAAEKAAAAKAA 286
A +D +A A+ A + A ++ ++ +
Sbjct: 1245 STVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQ 1293



Score = 53.5 bits (128), Expect = 1e-09
Identities = 34/260 (13%), Positives = 80/260 (30%), Gaps = 9/260 (3%)

Query: 51 DAVMVDSGAVVEQYKRMQSQESSAKRSDEQRKMKEQQAAE-ELREKQAAEQER------L 103
D V A + ++ ++K+ + + EQ A E + ++ A++ +
Sbjct: 1021 DEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANT 1080

Query: 104 KQLEKERLAAQEQKKQAEEAAKQAELKQKQAEEAAAKAAADAKAKAEADAKAAEEAAKKA 163
+ E + ++ ++ Q K+ +K+ + K + +E ++
Sbjct: 1081 QTNEVAQSGSETKETQ-TTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETV 1139

Query: 164 AADAKKKAEAEAAKAAVEAQKKAEAAAAALKKKAEAAEAAAAEARKKAATEAAEKAKAEA 223
A+ E + E Q + A + E + +
Sbjct: 1140 QPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENP 1199

Query: 224 EKKAAAEKAAADKKAAADKKAAEKAAAEKAAADKKAAAEKAAADKKAAA-AKAAAEKAAA 282
E A +++K + ++ A ++ D+ A + A
Sbjct: 1200 ENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNA 1259

Query: 283 AKAAAEADDIFGELSSGKNA 302
+ A A F L+ GK
Sbjct: 1260 VLSDARAKAQFVALNVGKAV 1279


10APECO1_1301APECO1_1293Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1301-1213.134786cardiolipin synthase 2
APECO1_1300-1213.508931metal-dependent hydrolase
APECO1_1299-2213.279005hypothetical protein
APECO1_1298-1203.589661hypothetical protein
APECO1_1297-1183.738266hypothetical protein
APECO1_1296-1163.461960ABC transporter
APECO1_1295-1143.348042hypothetical protein
APECO1_1294-1133.075966DNA-binding transcriptional regulator
APECO1_12930123.176083ATP-dependent RNA helicase RhlE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1298ABC2TRNSPORT473e-08 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 47.2 bits (112), Expect = 3e-08
Identities = 36/146 (24%), Positives = 63/146 (43%), Gaps = 5/146 (3%)

Query: 197 AREREQGTLDQLLVSPLTTWQIFIGKAVPALIVATFQATIVLAIGIWAYQIPFAGSLALF 256
R Q T + +L + L I +G+ A A IG+ A + + L+L
Sbjct: 92 GRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGA---GIGVVAAALGYTQWLSLL 148

Query: 257 YFTMVI--YGLSLVGFGLLISSLCSTQQQAFIGVFVFMMPAILLSGYVSPVENMPVWLQN 314
Y VI GL+ G+++++L + + + P + LSG V PV+ +P+ Q
Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208

Query: 315 LTWINPIRHFTDITKQIYLKDASLDI 340
P+ H D+ + I L +D+
Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDV 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1296PF05272310.012 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.2 bits (70), Expect = 0.012
Identities = 20/90 (22%), Positives = 28/90 (31%), Gaps = 21/90 (23%)

Query: 298 TPRFEDAFIDLLGGAGTSESPLGAILHTVEGTPGETVIEAKELTKKFGDFAATDHVNFAV 357
PR E + +LG P + + + K HV +
Sbjct: 547 VPRLEKWLVHVLGKTPDDYKP-------------RRLRYLQLVGKYI----LMGHVARVM 589

Query: 358 KRGEIFG----LLGPNGAGKSTTFKMMCGL 383
+ G F L G G GKST + GL
Sbjct: 590 EPGCKFDYSVVLEGTGGIGKSTLINTLVGL 619



Score = 29.3 bits (65), Expect = 0.048
Identities = 11/23 (47%), Positives = 13/23 (56%)

Query: 39 YVTGLVGPDGAGKTTLMRMLAGL 61
Y L G G GK+TL+ L GL
Sbjct: 597 YSVVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1295RTXTOXIND628e-13 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 61.8 bits (150), Expect = 8e-13
Identities = 42/259 (16%), Positives = 92/259 (35%), Gaps = 25/259 (9%)

Query: 83 ALMQAKAGVSVAQAQYDLMLAGYRDEEIAQAAAAVKQAQAAYDYAQNFYNRQQGLWKSRT 142
Q + + +A+ +LA E + + + + L +
Sbjct: 201 QKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENK 260

Query: 143 ISA--NDLENARSSRDQAQATLKSAQDKLRQYRSGNREQ---DIAQAKASLEQAQAQLAQ 197
N+L +S +Q ++ + SA+++ + + + + Q ++ +LA+
Sbjct: 261 YVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAK 320

Query: 198 AELNLQDSTLIAPSDGTLLTRAV-EPGTVLNEGGTVFTVSLT-RPVWVRAYVDERNLDQA 255
E Q S + AP + V G V+ T+ + + V A V +++
Sbjct: 321 NEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFI 380

Query: 256 QPGRKVLLYTDGRPNKPYH---GQIGFVSPTAEFTPKTVETPDLRTDLVYRLRIVVT--- 309
G+ ++ + P Y G++ ++ A D R LV+ + I +
Sbjct: 381 NVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--------IEDQRLGLVFNVIISIEENC 432

Query: 310 ----DADDALRQGMPVTVQ 324
+ + L GM VT +
Sbjct: 433 LSTGNKNIPLSSGMAVTAE 451


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1294HTHTETR729e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 72.4 bits (177), Expect = 9e-18
Identities = 32/220 (14%), Positives = 74/220 (33%), Gaps = 29/220 (13%)

Query: 13 KGEQAKKQLIAAALAQFGEYGMNATT-REIAAQAGQNIAAITYYFGSKEDLYLACAQWIA 71
+ ++ ++ ++ AL F + G+++T+ EIA AG AI ++F K DL+ +
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 72 DFIGEQFRPHAEEAERLFAQPQPDRAAIRELILRACRNMIKLLTQDDTVNLSK---FISR 128
IGE E + P + +RE+++ + + + + +
Sbjct: 68 SNIGELEL---EYQAKFPGDP---LSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 129 EQLSPTAAYHLVHEQVISPLHSHLTRLIAAW---TGCDASDTRMILHTHALIGEILAFRL 185
E A + + + L I A +I+ I ++
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM--RGYISGLM---- 175

Query: 186 GKETILLRTGWTAFDEEKTELINQTVTCHIDLILQGLSQR 225
W + + + ++ ++L+
Sbjct: 176 --------ENWLFAPQSFD--LKKEARDYVAILLEMYLLC 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1293SECA300.026 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.8 bits (67), Expect = 0.026
Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 4/67 (5%)

Query: 246 QQVLVFTRTKHGANHLAEQLNKDGIRSAAIHG-NKSQGARTRALADFKSGDIRVLVATDI 304
Q VLV T + + ++ +L K GI+ ++ + A A A + + V +AT++
Sbjct: 450 QPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNM 506

Query: 305 AARGLDI 311
A RG DI
Sbjct: 507 AGRGTDI 513


11APECO1_1278APECO1_1253Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1278213-0.174037threonine and homoserine efflux system
APECO1_1277116-0.086304outer membrane protein X
APECO1_1276-114-0.663498hypothetical protein
APECO1_1275-1160.864907manganese transport regulator MntR
APECO1_1274-1160.803793hypothetical protein
APECO1_12730171.036864hypothetical protein
APECO1_1272-1161.261205ATP-binding component of a transport system
APECO1_1271-1121.920271hypothetical protein
APECO1_1270-1123.095894formate acetyltransferase 3
APECO1_12690113.078729pyruvate formate lyase activating enzyme
APECO1_1268-1122.888069fructose-6-phosphate aldolase
APECO1_1267-1132.891567molybdopterin biosynthesis protein MoeB
APECO1_12660152.690335molybdopterin biosynthesis protein MoeA
APECO1_12650162.171078L-asparaginase
APECO1_1264116-1.778697glutathione transporter ATP-binding protein
APECO1_12632115-4.126227peptide transporter periplasmic-binding protein
APECO1_1263011-3.961216peptide transporter periplasmic-binding protein
APECO1_1262011-4.623089peptide transporter membrane protein
APECO1_1261111-4.753260peptide transporter membrane protein
APECO1_1260110-5.120756hypothetical protein
APECO1_125909-2.040724diguanylate cyclase
APECO1_1258010-0.301846ribosomal protein S12 methylthiotransferase
APECO1_1257011-0.532167biofilm formation regulatory protein BssR
APECO1_1256012-0.096662dehydrogenase
APECO1_1255-113-0.065373glutathione S-transferase
APECO1_1254115-1.052994D-alanyl-D-alanine carboxypeptidase
APECO1_1253215-0.364901DNA-binding transcriptional repressor DeoR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1277ENTEROVIROMP2551e-90 Enterobacterial virulence outer membrane protein si...
		>ENTEROVIROMP#Enterobacterial virulence outer membrane protein

signature.
Length = 171

Score = 255 bits (654), Expect = 1e-90
Identities = 171/171 (100%), Positives = 171/171 (100%)

Query: 3 MKKIACLSALAAVLAFTAGTSVAATSTVTGGYAQSDAQGQMNKMGGFNLKYRYEEDNSPL 62
MKKIACLSALAAVLAFTAGTSVAATSTVTGGYAQSDAQGQMNKMGGFNLKYRYEEDNSPL
Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAATSTVTGGYAQSDAQGQMNKMGGFNLKYRYEEDNSPL 60

Query: 63 GVIGSFTYTEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVVGVGYGKFQTTEYPT 122
GVIGSFTYTEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVVGVGYGKFQTTEYPT
Sbjct: 61 GVIGSFTYTEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVVGVGYGKFQTTEYPT 120

Query: 123 YKHDTSDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWIAGVGYRF 173
YKHDTSDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWIAGVGYRF
Sbjct: 121 YKHDTSDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWIAGVGYRF 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1254BLACTAMASEA438e-07 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 43.2 bits (102), Expect = 8e-07
Identities = 41/201 (20%), Positives = 64/201 (31%), Gaps = 34/201 (16%)

Query: 23 AFLFLFAPTAFAAEQTVEAPSVDARAW----------ILMDYASGKVLAEGNADEKLDPA 72
+ L A A P + I MD ASG+ L ADE+
Sbjct: 7 CIISLLATLPLAV-HASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMM 65

Query: 73 SLTKIMTSYVVGQALKADKIKLTDMVTVGKDAWATGNPALRGSSVMFLKPGDQVSVADLN 132
S K++ V + A +L + + +P V D ++V +L
Sbjct: 66 STFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSP------VSEKHLADGMTVGELC 119

Query: 133 KGVIIQSGNDACIALADYVAGSQESFIGLMNGYAKKLGLTNTT---FQTVHGLDAPGQF- 188
I S N A L V G + + +++G T ++T PG
Sbjct: 120 AAAITMSDNSAANLLLATVGGPAG-----LTAFLRQIGDNVTRLDRWETELNEALPGDAR 174

Query: 189 --STARDMA------LLGKAL 201
+T MA L + L
Sbjct: 175 DTTTPASMAATLRKLLTSQRL 195


12APECO1_1192APECO1_16Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1192019-3.930099hypothetical protein
APECO1_1191020-4.137635MFS family transporter protein
APECO1_1190127-6.548436pyruvate formate lyase-activating enzyme 1
APECO1_1189131-7.556476integrase
APECO1_1188-124-2.663465Phage protein C
APECO1_4460-129-9.634639Cox protein
APECO1_4461023-8.075772hypothetical protein
APECO1_4462019-4.988629hypothetical protein
APECO1_4463018-3.903489relication initiation protein
APECO1_4464017-3.042415phage replication initiation protein
APECO1_4465020-3.118964hypothetical protein
APECO1_44661344.644524phage capsid protein
APECO1_44672345.217020phage terminase
APECO1_44681325.112700phage capsid scaffolding protein
APECO1_44691346.277170major capsid protein
APECO1_44702347.438247small terminase subunit
APECO1_44712376.127920capsid completion protein
APECO1_44722345.445225phage lysis holin
APECO1_44733296.132249phage lysin
APECO1_44743305.973950LysA protein
APECO1_44754285.875111LysB protein
APECO1_44763265.535051tail protein
APECO1_4477-1173.030319tail protein
APECO1_44780161.841455phage baseplate protein
APECO1_4479021-4.356159phage baseplate protein
APECO1_4480020-4.383164baseplate assembly protein
APECO1_4481125-5.723773tail protein
APECO1_4482227-6.255803tail fiber protein
APECO1_44822427-6.488338tail fiber assembly protein
APECO1_1425-5.156836phage-associated protein
APECO1_53250.911343hypothetical protein
APECO1_60222.337192hypothetical protein
APECO1_7-1213.545628hypothetical protein
APECO1_8-2224.356706hypothetical protein
APECO1_9-1234.158977hypothetical protein
APECO1_101252.548900hypothetical protein
APECO1_11-1210.834546hypothetical protein
APECO1_12023-0.988436gpU phage protein
APECO1_13122-1.516223hypothetical protein
APECO1_14125-2.372796DNA-binding transcriptional regulator
APECO1_15022-2.346390protein PflB
APECO1_16-314-3.042547formate transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1192ISCHRISMTASE398e-06 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 39.2 bits (91), Expect = 8e-06
Identities = 30/159 (18%), Positives = 53/159 (33%), Gaps = 20/159 (12%)

Query: 40 RLDKNDAAVLLVDHQAGLLSLVRDIEP--DKFKNNVLALGDLAKYFNLPTILTT---SFE 94
D N A +L+ D Q + + N+ L + +P + T S
Sbjct: 25 VPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQN 84

Query: 95 TGPNGPLV----PELKAQFPDAPYIAR----PGNI-------NAWDNEDFVKAVKATGKK 139
L P L + + I ++ +A+ + ++ ++ G+
Sbjct: 85 PDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRD 144

Query: 140 QLIIAGVVTEVCVAFPALSAIEEGFDVFVVTDASGTFNE 178
QLII G+ + A A E F V DA F+
Sbjct: 145 QLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSL 183


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1189PERTACTIN300.017 Pertactin signature.
		>PERTACTIN#Pertactin signature.

Length = 922

Score = 30.1 bits (67), Expect = 0.017
Identities = 13/49 (26%), Positives = 24/49 (48%)

Query: 9 GRYEVDVRPQGADGKRIRRKFKTKGEAQAFERHVLVNYHNKEWLEKPAD 57
R E D + G+DG ++ K++T G + E + + +LE A+
Sbjct: 751 SRLENDFKVAGSDGYAVKGKYRTHGVGVSLEAGRRFAHADGWFLEPQAE 799


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4461SECA280.017 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 28.3 bits (63), Expect = 0.017
Identities = 11/72 (15%), Positives = 26/72 (36%)

Query: 11 VEKQPAAMRRIIGKHLAVPRWQDTCDYYNQMMERERLTVCFHAQLKQRHATMRFEEMNDV 70
+ ++ L + W D ++ RER+ +++ + E M
Sbjct: 703 IPGLQERLKNDFDLDLPIAEWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHF 762

Query: 71 ERERLVCAIDEL 82
E+ ++ +D L
Sbjct: 763 EKGVMLQTLDSL 774


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4470PF06872280.033 EspG protein
		>PF06872#EspG protein

Length = 398

Score = 28.1 bits (62), Expect = 0.033
Identities = 23/95 (24%), Positives = 37/95 (38%), Gaps = 5/95 (5%)

Query: 122 PPYMFTEEVALAAMRAHAAGESVDTRLLTDTLELTATADMPDEVRAKLHKITGLFLRDAG 181
P M T ++ A+ A S+D + +T +++ V + TG+ +
Sbjct: 298 PALMLTHV-RISQASAYNAQRSLDMPNACINISITQSSEGSIHVTSH----TGVLIMAPE 352

Query: 182 DAAGALAHLQRATQLDCQAGVKKEIERLERELKPK 216
D L L T + GVK E + R LK K
Sbjct: 353 DRPNQLGMLTNRTSYEVPPGVKCEPNEMARMLKAK 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_11RTXTOXIND330.005 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 32.9 bits (75), Expect = 0.005
Identities = 22/173 (12%), Positives = 58/173 (33%), Gaps = 8/173 (4%)

Query: 8 QVLLRAVDQASRPFKSIRTASKSLSGDIRETQKSLRELNGQASRIEGFRKTSAQLAVTGH 67
VLL+ + + ++S R Q + L+ + +
Sbjct: 122 DVLLKLTALGAE---ADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQN 178

Query: 68 ALEKARQEAEALATQFKNTERPTRAQAKV-LESAKRAAEDLQAKYNRLTDSVKRQQRELA 126
E+ +L + +T + + Q ++ L+ + + A+ NR + + ++ L
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLD 238

Query: 127 AVGINTRNLAHDELGLKNRISETTAQLNRQRDALARVSAQQAKLNAVKQRYQA 179
+L H + K+ + E + + L +Q ++ + +
Sbjct: 239 DF----SSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKE 287


13APECO1_95APECO1_128Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_950154.051627TrpR binding protein WrbA
APECO1_960184.131690transporter
APECO1_970214.626806hypothetical protein
APECO1_98-1184.544019oxidoreductase, flavin:NADH component
APECO1_990143.535502hypothetical protein
APECO1_100-1113.939354hydrolase
APECO1_101-1123.373918hypothetical protein
APECO1_102-1123.338325hypothetical protein
APECO1_103-2142.731418monooxygenase YcdM
APECO1_104-1152.583217transcriptional regulator YcdC
APECO1_105-2142.661122trifunctional transcriptional regulator/proline
APECO1_106-1140.716166proline:sodium symporter
APECO1_107-213-0.794943hypothetical protein
APECO1_108-217-3.001424hypothetical protein
APECO1_110-222-3.791019hypothetical protein
APECO1_111-127-6.256157hypothetical protein
APECO1_112-128-6.367532PGA biosynthesis protein
APECO1_113-126-6.001936N-glycosyltransferase
APECO1_114-127-5.658544outer membrane N-deacetylase
APECO1_115-123-4.525086outer membrane protein PgaA
APECO1_116-119-3.928899diguanylate cyclase
APECO1_118-216-1.157139*dehydrogenase
APECO1_119-115-2.229380hydrolase
APECO1_120017-3.558754hypothetical protein
APECO1_121021-5.380545hypothetical protein
APECO1_122125-5.973388assembly /transport component in curli
APECO1_123231-7.742105curli assembly protein CsgF
APECO1_124231-7.396318curli assembly protein CsgE
APECO1_125228-5.687728DNA-binding transcriptional regulator CsgD
APECO1_126022-3.180445curlin minor subunit
APECO1_127018-3.578105cryptic curlin major subunit
APECO1_128-116-3.513964autoagglutination protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_97TCRTETB280.032 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 27.9 bits (62), Expect = 0.032
Identities = 20/114 (17%), Positives = 34/114 (29%), Gaps = 11/114 (9%)

Query: 50 PFAQTAVMGVQHAVAMFGATVLMPILMGLDPNLSILMSGIGTLL--------FFFITGGR 101
PF + G + G ++P +M LS G + F +I G
Sbjct: 257 PFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGIL 316

Query: 102 VPSYLGSSAAFVGVVIAATGFNGQGINPNISIALGGIIACGLVYTVIGLVVMKI 155
V +GV + F + + +V+ + GL K
Sbjct: 317 VDRRGPLYVLNIGVTFLSVSFLTASFLLETTSW---FMTIIIVFVLGGLSFTKT 367


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_102ISCHRISMTASE762e-18 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 75.8 bits (186), Expect = 2e-18
Identities = 44/176 (25%), Positives = 70/176 (39%), Gaps = 23/176 (13%)

Query: 26 TFDPQQTALIVVDMQNAYATPGGYLDLAGFDVSTTRPVIANIQTAVTAARTAGMLIIWFQ 85
DP + L++ DMQN + +D S + ANI+ G+ +++
Sbjct: 25 VPDPNRAVLLIHDMQNYF------VDAFTAGASPVTELSANIRKLKNQCVQLGIPVVY-- 76

Query: 86 NGWDEQYVEAGGPGSPNYHKSNALKTMRNQPLLQGKLLAKGSWDYQLVDELVPQPGDIVL 145
PGS N L G L G ++ +++ EL P+ D+VL
Sbjct: 77 ---------TAQPGSQNPDDRALLTDF------WGPGLNSGPYEEKIITELAPEDDDLVL 121

Query: 146 PKPRYSGFFNTPLDSILRSRGIRHLVFTGIATNVCVESTLRDGFFLEYFGVVLEDA 201
K RYS F T L ++R G L+ TGI ++ T + F + + DA
Sbjct: 122 TKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDA 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_104HTHTETR662e-15 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 66.2 bits (161), Expect = 2e-15
Identities = 30/165 (18%), Positives = 62/165 (37%), Gaps = 8/165 (4%)

Query: 10 GKRSRAVSAKKKAILSAALDTFSQFGFHGTRLEQIAELAGVSKTNLLYYFPSKEALYIAV 69
K + ++ IL AL FSQ G T L +IA+ AGV++ + ++F K L+ +
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 70 LRQILDIWLAPLKAFREDF--APLAAIKEYIRLKLEVSRDYPQASRLFCM-----EMLAG 122
++ F PL+ ++E + LE + + L + E +
Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122

Query: 123 APLLMDELTGDLKALIDEKSALIAGWVKSGKL-APIDPQHLIFMI 166
++ D + +++ L A + + ++
Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_115ARGDEIMINASE310.030 Bacterial arginine deiminase signature.
		>ARGDEIMINASE#Bacterial arginine deiminase signature.

Length = 409

Score = 30.6 bits (69), Expect = 0.030
Identities = 26/181 (14%), Positives = 59/181 (32%), Gaps = 22/181 (12%)

Query: 451 HRAAENELKKAEVIEPRNINLEVEQAWTALTLQEWQQA--AVLTHDVVEREPQDPGVV-R 507
A + A +++ + +E + + L ++ ++E E + +
Sbjct: 49 EVARQEHEVFASILKNNLVEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFTINL 108

Query: 508 LK---RAVDVHNLAELRIAGSTGIDAEGPDSGKHDVDLTTIVYS---PPLKDNWRGFAGF 561
LK ++ + N+ I+G E + DL P+ + F
Sbjct: 109 LKDYFSSLTIDNMISKMISGVVT--EELKNYTSSLDDLVNGANLFIIDPMPNVL--FT-- 162

Query: 562 GYADGQFSEGKGIVRDWLAGVEWRSRNIWLEAEYAERVFNHEHKPGARLSGWYDFNDNWR 621
D S G G+ + + + R E +AE +F + + W + +
Sbjct: 163 --RDPFASIGNGVT---INKMFTKVRQ--RETIFAEYIFKYHPVYKENVPIWLNRWEEAS 215

Query: 622 I 622
+
Sbjct: 216 L 216


14APECO1_149APECO1_166Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1492151.199402hypothetical protein
APECO1_1502141.294132hypothetical protein
APECO1_1511110.775218hypothetical protein
APECO1_1521170.746174flagella synthesis protein FlgN
APECO1_1532150.876470anti-sigma-28 factor FlgM
APECO1_1542152.143227flagellar basal body P-ring biosynthesis protein
APECO1_1552162.304269flagellar basal body rod protein FlgB
APECO1_1564142.217887flagellar basal body rod protein FlgC
APECO1_1573132.356063flagellar basal body rod modification protein
APECO1_1581132.323437flagellar hook protein FlgE
APECO1_159-1122.318488flagellar basal body rod protein FlgF
APECO1_160-1101.147453flagellar basal body rod protein FlgG
APECO1_1610132.177103flagellar basal body L-ring protein
APECO1_1620121.947410flagellar basal body P-ring protein
APECO1_1631131.643936flagellar rod assembly protein/muramidase FlgJ
APECO1_1642131.197015flagellar hook-associated protein FlgK
APECO1_1653151.131492flagellar hook-associated protein FlgL
APECO1_1663151.341718ribonuclease E
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_158FLGHOOKAP1416e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 41.1 bits (96), Expect = 6e-06
Identities = 17/49 (34%), Positives = 29/49 (59%)

Query: 353 TLTNGALEASNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILNTLVNLR 401
L+N S V+L +E N+ Q+ Y +NAQ ++T + I + L+N+R
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 36.9 bits (85), Expect = 1e-04
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 4/56 (7%)

Query: 6 AVSGLNAAATNLDVIGNNIANSATYGFKSGTASFAD----MFAGSKVGLGVKVAGI 57
A+SGLNAA L+ NNI++ G+ T A + AG VG GV V+G+
Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_160FLGHOOKAP1444e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 43.8 bits (103), Expect = 4e-07
Identities = 18/81 (22%), Positives = 36/81 (44%), Gaps = 14/81 (17%)

Query: 3 SSLWIAKTGLDAQQTNMDVIANNLANVSTNGFKRQRAVFEDLLYQTIRQPGAQSSEQTTL 62
S + A +GL+A Q ++ +NN+++ + G+ RQ + + +TL
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTL 47

Query: 63 PSGLQIGTGVRPVATERLHSQ 83
+G +G GV +R +
Sbjct: 48 GAGGWVGNGVYVSGVQREYDA 68



Score = 41.1 bits (96), Expect = 3e-06
Identities = 11/41 (26%), Positives = 21/41 (51%)

Query: 220 ETSNVNVAEELVNMIQVQRAYEINSKAVSTTDQMLQKLTQL 260
S VN+ EE N+ + Q+ Y N++ + T + + L +
Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_161FLGLRINGFLGH350e-126 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 350 bits (898), Expect = e-126
Identities = 232/232 (100%), Positives = 232/232 (100%)

Query: 4 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 63
MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY
Sbjct: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60

Query: 64 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 123
GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA
Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120

Query: 124 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 183
RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF
Sbjct: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180

Query: 184 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 235
SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM
Sbjct: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_162FLGPRINGFLGI425e-151 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 425 bits (1095), Expect = e-151
Identities = 157/363 (43%), Positives = 213/363 (58%), Gaps = 9/363 (2%)

Query: 5 FLSALILLLVTTAAQAERIRDLTSVQGVRQNSLIGYGLVVGLDGTGDQTTQTPFTTQTLN 64
F + L A RI+D+ S+Q R N LIGYGLVVGL GTGD +PFT Q++
Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72

Query: 65 NMLSQLGITVPTGTNMQLKNVAAVMVTASLPPFGRQGQTIDVVVSSMGNAKSLRGGTLLM 124
ML LGIT G + KN+AAVMVTA+LPPF G +DV VSS+G+A SLRGG L+M
Sbjct: 73 AMLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIM 131

Query: 125 TPLKGVDSQVYALAQGNILVGGAGASAGGSSVQVNQLNGGRITNGAVIERELPSQFGVGN 184
T L G D Q+YA+AQG ++V G A +++ R+ NGA+IERELPS+F
Sbjct: 132 TSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSV 191

Query: 185 TLNLQLNDEDFSMAQQIADTINRVR----GYGSATALDARTIQVRVPSGNSSQVRFLADI 240
L LQL + DFS A ++AD +N G A D++ I V+ P + R +A+I
Sbjct: 192 NLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRV-ADLTRLMAEI 250

Query: 241 QNMQVNVTPQDAKVVINLRTGSVVMNREVTLDSCAVAQGNLSVTVNRQANVSQPDTPFGG 300
+N+ V T AKVVIN RTG++V+ +V + AV+ G L+V V V QP PF
Sbjct: 251 ENLTVE-TDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-APFSR 308

Query: 301 GQTVVTPQTQIDLRQSGGSLQSVRSSASLNNVVRALNALGATPMDLMSILQSMQSAGCLR 360
GQT V PQT I Q G + ++ L +V LN++G +++ILQ ++SAG L+
Sbjct: 309 GQTAVQPQTDIMAMQEGSKV-AIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQ 367

Query: 361 AKL 363
A+L
Sbjct: 368 AEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_163FLGFLGJ5070.0 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 507 bits (1306), Expect = 0.0
Identities = 310/313 (99%), Positives = 311/313 (99%)

Query: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60
MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG
Sbjct: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60

Query: 61 LFSSERTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120
LFSSE TRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET
Sbjct: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120

Query: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGNSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180
VVRYQNQALSQLVQKAVPRNYDDSLPG+SKAFLAQLSLPAQLASQQSGVPHHLILAQAAL
Sbjct: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180

Query: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240
ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL
Sbjct: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240

Query: 241 EALSDYVGLLTRNPRYAAVTTAVSAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300
EALSDYVGLLTRNPRYAAVTTA SAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK
Sbjct: 241 EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300

Query: 301 VSKTYSMNIDNLF 313
VSKTYSMNIDNLF
Sbjct: 301 VSKTYSMNIDNLF 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_164FLGHOOKAP16820.0 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 682 bits (1761), Expect = 0.0
Identities = 545/546 (99%), Positives = 545/546 (99%)

Query: 2 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 61
SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 62 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 121
GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120

Query: 122 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 181
SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND
Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180

Query: 182 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 241
QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA
Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240

Query: 242 RQLAAVPSSADPSRTTVAYVDRTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 301
RQLAAVPSSADPSRTTVAYVD TAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL
Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300

Query: 302 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 361
ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD
Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 360

Query: 362 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 421
YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV
Sbjct: 361 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 420

Query: 422 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN 481
NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN
Sbjct: 421 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN 480

Query: 482 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 541
KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD
Sbjct: 481 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 540

Query: 542 ALINIR 547
ALINIR
Sbjct: 541 ALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_165FLAGELLIN468e-08 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 46.2 bits (109), Expect = 8e-08
Identities = 42/226 (18%), Positives = 80/226 (35%), Gaps = 9/226 (3%)

Query: 7 MMYQQNMRGITNSQAEWMKYGEQMSTGKRVVNPSDDPIAASQAVVLSQAQAQNSQYTLAR 66
++ Q N+ +S + + E++S+G R+ + DD + A + +Q +
Sbjct: 11 LLTQNNLNKSQSSLSSAI---ERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNA 67

Query: 67 TFATQKVSLEESVLSQVTTAIQNAQEKIVYASNGTLSDDDRASLATDIQGLRDQLLNLAN 126
E L+++ +Q +E V A+NGT SD D S+ +IQ +++ ++N
Sbjct: 68 NDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSN 127

Query: 127 TTDGNGRYIFAGYKTETAPFSEADGDYVGGTESIKQQVDASRSMVIGHTGDKIFDSITSN 186
T NG + + DG E+I + +G G + +
Sbjct: 128 QTQFNGVKVLSQDNQMKIQVGANDG------ETITIDLQKIDVKSLGLDGFNVNGPKEAT 181

Query: 187 AVAEPDGSASETNLFAMLDSAIAALKTPVADSEADKETAAAALDKT 232
+ T A + + TA DK
Sbjct: 182 VGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKV 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_166IGASERPTASE643e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 63.9 bits (155), Expect = 3e-12
Identities = 41/226 (18%), Positives = 79/226 (34%), Gaps = 12/226 (5%)

Query: 590 PAEQSAPKAEAKPERQQDRR-----KPRQNNRRDRNERRDTRSERTEGSDNREENRRNRR 644
P+ S + A+ + N ++++++ D E +NR
Sbjct: 1008 PSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNRE 1067

Query: 645 QAQQQTAETRESRQQAEV------TEKARTTDEQQAPRRERSRRRNDDKRQAQQEAKALN 698
A++ + + + Q EV T++ +TT+ ++ E+ + + + Q+ K +
Sbjct: 1068 VAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPK-VT 1126

Query: 699 VEEQSVQETEQEERVRPVQPRRKQRQLNQKVRYEQSVAEEAVVAPVVEETVAAEPIVQEA 758
+ QE + + + R +N K Q+ P E + E V E+
Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTES 1186

Query: 759 PAPRTELVKVPLPVVAQAAPEQQEENNADNRDNGGMPRRSRRSPRH 804
T V P A Q N+ + RRS RS H
Sbjct: 1187 TTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPH 1232



Score = 62.0 bits (150), Expect = 1e-11
Identities = 46/287 (16%), Positives = 91/287 (31%), Gaps = 35/287 (12%)

Query: 513 PSEEEFAERKRPEQPALATFAMPDVPPAPT-PAEPAAPVVAPAPKAATATPASPAQPGLL 571
P E+ + DVP P+ E A AP P A ATP+ +
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE---- 1038

Query: 572 SRFFGALKALFSGGEETKPAEQSAPKAEAKPERQQDRRKP-RQNNRRDRNERRDTRSER- 629
AE S +++ + +QD + QN + + + ++
Sbjct: 1039 -----------------TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQ 1081

Query: 630 -TEGSDNREENRRNRRQAQQQTAETRESRQQAEVTEKARTTDEQQAPRRERSRRRNDDKR 688
E + + E + + ++TA + + TEK + + + + + +
Sbjct: 1082 TNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQP 1141

Query: 689 QAQ---QEAKALNVEEQSVQETEQEERVRPVQPRRKQRQLNQKVRYEQSV--AEEAVVAP 743
QA+ + +N++E Q + +P + + Q V +V V P
Sbjct: 1142 QAEPARENDPTVNIKEPQSQTNTTADTEQPA--KETSSNVEQPVTESTTVNTGNSVVENP 1199

Query: 744 VVEETVAAEPIVQEAPAPRTELVKVPLPVVAQAAPEQQEENNADNRD 790
+P V + K ++ P E + D
Sbjct: 1200 ENTTPATTQPTVNSESS---NKPKNRHRRSVRSVPHNVEPATTSSND 1243


15APECO1_218APECO1_290Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_218219-2.823146isocitrate dehydrogenase
APECO1_219225-3.827715prophage lambda integrase
APECO1_220525-3.265101hypothetical protein
APECO1_221424-4.422209exonuclease encoded by prophage CP-933K
APECO1_222525-5.418773recombination protein Bet of prophage
APECO1_223636-8.849660host-nuclease inhibitor protein Gam of
APECO1_224739-10.103150Kil protein
APECO1_225939-9.095071ssDNA-binding protein
APECO1_226740-8.491427superinfection exclusion protein B of prophage
APECO1_227839-7.783923N protein
APECO1_228834-7.437463hypothetical protein
APECO1_229528-4.711309hypothetical protein
APECO1_230428-3.406661CI repressor of bacteriophage
APECO1_231130-2.619402hypothetical protein
APECO1_232128-1.454028regulatory protein CII of bacteriophage
APECO1_233129-2.048361replication protein O of bacteriophage
APECO1_234030-0.884550replication protein P of bacteriophage
APECO1_235330-2.869279exclusion protein ren of prophage
APECO1_236226-5.061159hypothetical protein
APECO1_237225-5.464175DNA N-6-adenine-methyltransferase of
APECO1_238225-5.907082hypothetical protein
APECO1_239126-5.961061endodeoxyribonuclease RUS
APECO1_240229-6.783246antitermination protein Q-like protein
APECO1_241327-5.977532outer membrane porin protein NmpC
APECO1_242124-3.749645bacteriophage lambda lysozyme-like protein
APECO1_243220-1.476301Rz endopeptidase from lambdoid prophage
APECO1_2441230.274089lambda prophage Bor protein
APECO1_2452232.761076truncated TonB-like membrane protein encoded
APECO1_2461223.355503hypothetical protein
APECO1_2471213.055962prophage Qin DNA packaging protein NU1-like
APECO1_2482223.357780DNA packaging protein of prophage; terminase
APECO1_2492254.336182capsid protein of prophage
APECO1_2503234.271831head-tail preconnector protein gp5 of
APECO1_2512231.996936hypothetical protein
APECO1_2523232.611454capsid protein of prophage
APECO1_2534273.673568hypothetical protein
APECO1_2543273.886133head-tail joining protein of prophage
APECO1_2553275.188893tail fiber component Z of prophage
APECO1_2564285.401753tail component of prophage
APECO1_2573285.405762tail component of prophage CP-933X
APECO1_2584295.740796tail component of prophage
APECO1_2594276.239867tail component of prophage
APECO1_2604276.250852tail component of prophage
APECO1_2615275.016589minor tail protein
APECO1_2622252.561278tail component of prophage
APECO1_2633262.046728tail fiber component K of prophage
APECO1_2652241.336120tail component of prophage
APECO1_2662230.272942tail component of prophage
APECO1_267223-2.690383tail component of prophage
APECO1_268123-2.319333hypothetical protein
APECO1_269219-1.724095hypothetical protein
APECO1_270120-3.967948hypothetical protein
APECO1_271021-5.238697Mn+2/Fe+2 ABC transporter inner membrane subunit
APECO1_272024-5.494927Mn+2/Fe+2 ABC transporter inner membrane subunit
APECO1_273029-7.310680Mn+2/Fe+2 ABC transporter ATPase SitB
APECO1_274233-9.145780Mn+2/Fe+2 ABC transporter substrate-binding
APECO1_275237-11.215053hypothetical protein
APECO1_276-132-9.175457hypothetical protein
APECO1_277034-9.218412transcriptional regulator
APECO1_278032-9.469768hypothetical protein
APECO1_279-129-9.618494hypothetical protein
APECO1_280029-7.750585hypothetical protein
APECO1_281125-5.976093hypothetical protein
APECO1_282127-5.750302hypothetical protein
APECO1_283-220-3.111587hypothetical protein
APECO1_284-120-2.616078hypothetical protein
APECO1_285-220-3.021334hypothetical protein
APECO1_286-118-4.269287hypothetical protein
APECO1_287021-5.069565cell division topological specificity factor
APECO1_288-118-3.537959cell division inhibitor MinD
APECO1_289-320-4.124027septum formation inhibitor
APECO1_290-220-4.882032hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_225UREASE290.007 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 28.6 bits (64), Expect = 0.007
Identities = 18/66 (27%), Positives = 26/66 (39%), Gaps = 7/66 (10%)

Query: 57 IMLAQHALLIAISSDLNAYGVVCEFDWN----DGNGQEGWPPMDGSEGIRITD---IDTS 109
+ LA L I + D +G +F DG GQ G+ IT+ +D
Sbjct: 22 VRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQVTREGGAVDTVITNALILDHW 81

Query: 110 GIFDSD 115
GI +D
Sbjct: 82 GIVKAD 87


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_234FLGMOTORFLIG270.043 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 27.5 bits (61), Expect = 0.043
Identities = 17/77 (22%), Positives = 27/77 (35%), Gaps = 11/77 (14%)

Query: 2 KNIAAQMVNFDREQM-----------RRIANNMPEQYDEKPQVQQVAQIINGVFSQLLAT 50
N+A ++ DR +++A+ E Y V V +IIN +
Sbjct: 165 TNVARRIALMDRTSPEVVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKF 224

Query: 51 FPASLANRDQNELNEIR 67
SL D EI+
Sbjct: 225 IIESLEEEDPELAEEIK 241


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_241ECOLIPORIN5080.0 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 508 bits (1310), Expect = 0.0
Identities = 241/388 (62%), Positives = 280/388 (72%), Gaps = 33/388 (8%)

Query: 21 MKKLTVAISAVAASVLMAMSAQAAEIYNKDSNKLDLYGKVNAKHYFSSNDADDGDTTYAR 80
MK+ +A+ V ++L A +A AAEIYNKD NKLDLYGKV+ HYFS + + DGD TY R
Sbjct: 1 MKRKVLAL--VIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMR 58

Query: 81 LGFKGETQINDQLTGFGQWEYEFKGNRAESQGSSKDKTRLAFAGLKFGDYGSIDYGRNYG 140
+GFKGETQINDQLTG+GQWEY + N E +G++ TRLAFAGLKFGDYGS DYGRNYG
Sbjct: 59 VGFKGETQINDQLTGYGQWEYNVQANTTEGEGANS-WTRLAFAGLKFGDYGSFDYGRNYG 117

Query: 141 VAYDIGAWTDVLPEFGGDTWTQTDVFMTGRTTGVATYRNNDFFGLVDGLNFAAQYQGKND 200
V YD+ WTD+LPEFGGD++T D +MTGR GVATYRN DFFGLVDGLNFA QYQGKN+
Sbjct: 118 VLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNE 177

Query: 201 R----------------TDVTEANGDGFGFSTTYEY-EGFGVGATYAKSDRTDGQVAYGK 243
D+ NGDGFG STTY+ GF GA Y SDRT+ QV G
Sbjct: 178 SQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGG 237

Query: 244 SKFNASGKNAEVWAAGLKYDANNIYLATTYSETQNMTVFG------NNHIANKAQNFEAV 297
+ A G A+ W AGLKYDANNIYLAT YSET+NMT +G + +ANK QNFE
Sbjct: 238 T--IAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVT 295

Query: 298 AQYQFDFGLRPSVAYLQSKGKDLGVH----GDRDLVKYVDVGATYYFNKNMSTFVDYKIN 353
AQYQFDFGLRP+V++L SKGKDL + D+DLVKY DVGATYYFNKN ST+VDYKIN
Sbjct: 296 AQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKIN 355

Query: 354 LID-DSKFTKTAGIDTDDIVAVGLVYQF 380
L+D D F K AGI TDDIVA+G+VYQF
Sbjct: 356 LLDDDDPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_244PF062911863e-65 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 186 bits (473), Expect = 3e-65
Identities = 102/102 (100%), Positives = 102/102 (100%)

Query: 1 MQDNKMKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQKKTVDAA 60
MQDNKMKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQKKTVDAA
Sbjct: 1 MQDNKMKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQKKTVDAA 60

Query: 61 KICGGAENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSQ 102
KICGGAENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSQ
Sbjct: 61 KICGGAENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSQ 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_245TONBPROTEIN692e-17 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 68.9 bits (168), Expect = 2e-17
Identities = 33/82 (40%), Positives = 46/82 (56%)

Query: 41 ADEPRQLVTVYPRYPEYAAANYIKGLVEVKFDIGADGTVTRIVFLRSEPHNLFRDEVVKA 100
A PR L P+YP A A I+G V+VKFD+ DG V + L ++P N+F EV A
Sbjct: 150 ASGPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNA 209

Query: 101 MAKWRFEKNRPCQGVKRQFIFT 122
M +WR+E +P G+ +F
Sbjct: 210 MRRWRYEPGKPGSGIVVNILFK 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2482FE2SRDCTASE310.011 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 31.2 bits (70), Expect = 0.011
Identities = 10/41 (24%), Positives = 21/41 (51%), Gaps = 1/41 (2%)

Query: 316 TRDGLMFFSARGDEIPPPRSITFHIWTAYSPFTTWVQIVYD 356
R+ L+ F R DE P ++T W++ + ++ + + D
Sbjct: 36 HREHLLEF-IRLDEPAPLNAMTLAQWSSPNVLSSLLAVYSD 75


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_260GPOSANCHOR412e-05 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 41.2 bits (96), Expect = 2e-05
Identities = 56/377 (14%), Positives = 125/377 (33%), Gaps = 36/377 (9%)

Query: 236 SGLTAMARQFHNVTAEQIAYVAQLQRSGDESGALQAANEAATKGFDDQTRRLKENMGTLE 295
S R+ +E+ + + +L+ + + + + L+ L
Sbjct: 95 SNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALA 154

Query: 296 TWADRTARAFKSMWDAVLDI-GRPDTAQEMLIKAEAAFKKADDIWNLRKDDYFVNDEARA 354
+A + + + T + EA + + + +
Sbjct: 155 ARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIK 214

Query: 355 RYWDDREKARLALEAARK-KAEQQSQQDKNAQQQSDTEASRLKYTEEAQKAYERLQTPLE 413
++ K + ++ + EA + + + L+ +
Sbjct: 215 TLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMN 274

Query: 414 KYTARQEELNKALKDGKILQADYNTLMAAAKKDYEATLKKPKQSGVKVSAGDRQEDSAHA 473
TA ++ + L+A+ L + A + + R D++
Sbjct: 275 FSTADSAKIKTLEAEKAALEAEKADLEHQ-SQVLNANRQSLR----------RDLDASRE 323

Query: 474 ALLTLQAELRTLEKHAGANEKISQQ-RRDL-------WKAESQFAVLEEAAQRRQLSAQE 525
A L+AE + LE+ +E Q RRDL + E++ LEE + + S Q
Sbjct: 324 AKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQS 383

Query: 526 KS--LLAHKDETLEYKRQLAALGDKVTYQERLNALAQQADKFAQQQRAKRAAIDAKSRGL 583
L A ++ + ++ L K+ E+LN +++ K ++++A
Sbjct: 384 LRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKA------------ 431

Query: 584 TDRQAEREATEQRLKEQ 600
+ QA+ EA + LKE+
Sbjct: 432 -ELQAKLEAEAKALKEK 447


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_265PF06291280.014 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 27.7 bits (61), Expect = 0.014
Identities = 13/40 (32%), Positives = 19/40 (47%), Gaps = 5/40 (12%)

Query: 135 MTGILFSLGASMVLGGVAQML-----APKARTPRTQTTDN 169
M +LFS +M++ G AQ P A TP+ T +
Sbjct: 6 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHH 45


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_268IGASERPTASE412e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.8 bits (95), Expect = 2e-05
Identities = 27/132 (20%), Positives = 56/132 (42%), Gaps = 15/132 (11%)

Query: 123 SQSAAAAKKSETAAASSRNA--AKTSETNAGNSAKAAASSKTAAQNAATAAERSETNARA 180
S + A+ E A ++T+ET A NS + + + + Q+A + N
Sbjct: 1012 SNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQ---NREV 1068

Query: 181 SEEASADSEEASRRN--AESAAENAGVATTKAREAAADATKAGQKKDEALSAATRAEKAA 238
++EA ++ + ++ N A+S +E +E TK ++ A EK
Sbjct: 1069 AKEAKSNVKANTQTNEVAQSGSE--------TKETQTTETKETATVEKEEKAKVETEKTQ 1120

Query: 239 DRAEVAAEVTAE 250
+ +V ++V+ +
Sbjct: 1121 EVPKVTSQVSPK 1132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_274adhesinb329e-115 Adhesin B signature.
		>adhesinb#Adhesin B signature.

Length = 310

Score = 329 bits (846), Expect = e-115
Identities = 90/296 (30%), Positives = 163/296 (55%), Gaps = 7/296 (2%)

Query: 9 MLLGGLALTCSIAFQASATEKFKVITTFTIIADMAKNVAGDAAEVSSITKPGAEIHEYQP 68
+G A + + + + K V+ T +IIAD+ KN+AGD + SI G + HEY+P
Sbjct: 13 AFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSIVPVGQDPHEYEP 72

Query: 69 TPGDIKRAQGAQLILANGMNLEL----WFQRFYQHLNGVPE---VIVSSGVTPVGITEGP 121
P D+K+ A LI NG+NLE WF + ++ VS GV + +
Sbjct: 73 LPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVSEGVDVIYLEGQS 132

Query: 122 YEGKPNPHAWMSPDNALIYVDNIRDALIKYDPANAQTYQRNADTYKAKITQTLAPLRKQI 181
+GK +PHAW++ +N +IY NI L + DPAN +TY++N Y K++ +++
Sbjct: 133 EKGKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEKLSALDKEAKEKF 192

Query: 182 TELPENQRWMVTSEGAFSYLARDLGLKELYLWPINADQQGTPQQVRKVVDIVKKNHIPAV 241
+P ++ +VTSEG F Y ++ + Y+W IN +++GTP Q++ +V+ ++K +P++
Sbjct: 193 NNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEKLRKTKVPSL 252

Query: 242 FSESTISDKPARQVARETGAHYGGVLYVDSLSTENGPVPTYIDLLKVTTSTLVQGI 297
F ES++ D+P + V+++T ++ DS++ + +Y ++K + +G+
Sbjct: 253 FVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSYYSMMKYNLEKIAEGL 308


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_286PRTACTNFAMLY429e-08 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 42.0 bits (98), Expect = 9e-08
Identities = 26/101 (25%), Positives = 45/101 (44%), Gaps = 1/101 (0%)

Query: 8 TRSIYRELGATLSYNMRLGNGMEIEPWLKAAVRKEFVDDNRVKVNSDGNFINDLSGRRGI 67
S+ LG + + L G +++P++KA+V +EF V N + +L G R
Sbjct: 811 GSSVLGRLGLEVGKRIELAGGRQVQPYIKASVLQEFDGAGTVHTNGIAH-RTELRGTRAE 869

Query: 68 YQAGIKASFSSTLSGHLGVGYSRGAGVESPWNAVVGVNWSF 108
G+ A+ S + YS+G + PW G +S+
Sbjct: 870 LGLGMAAALGRGHSLYASYEYSKGPKLAMPWTFHAGYRYSW 910


16APECO1_345APECO1_356Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_345-120-3.691928transposase
APECO1_346-120-3.500980**formyltetrahydrofolate deformylase
APECO1_347-125-4.297520hypothetical protein
APECO1_348-119-2.741223hypothetical protein
APECO1_349022-2.290386response regulator of RpoS
APECO1_350024-2.328026UTP--glucose-1-phosphate uridylyltransferase
APECO1_351025-2.725759global DNA-binding transcriptional dual
APECO1_352121-3.030680thymidine kinase
APECO1_354019-2.792361transposase InsG for insertion sequence element
APECO1_355022-3.155287bifunctional acetaldehyde-CoA/alcohol
APECO1_356014-3.509105hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_347SECA572e-12 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 57.2 bits (138), Expect = 2e-12
Identities = 16/28 (57%), Positives = 20/28 (71%)

Query: 125 IDGTRPQFGRNDPCPCGSGKKFKKCCGQ 152
+ GRNDPCPCGSGKK+K+C G+
Sbjct: 872 AQTGERKVGRNDPCPCGSGKKYKQCHGR 899


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_349HTHFIS874e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 87.2 bits (216), Expect = 4e-21
Identities = 40/152 (26%), Positives = 65/152 (42%), Gaps = 3/152 (1%)

Query: 10 ILIVEDEQVFRSLLDSWFSSLGATTVLAADGVDALELLGGFTPDLMICDIAMPRMNGLKL 69
IL+ +D+ R++L+ S G + ++ + DL++ D+ MP N L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 70 LEHIRNSGDQTPVLVISATENMADIAKALRLGVEDVLLKPVKDLNRLREMVFACLYPSMF 129
L I+ + PVLV+SA KA G D L KP DL L ++ L +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF-DLTELIGIIGRAL--AEP 122

Query: 130 NSRVEEEERLFRDWDAMVDNPAAAAKLLQELQ 161
R + E +D +V AA ++ + L
Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLA 154


17APECO1_365APECO1_417Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_365-221-4.334109hypothetical protein
APECO1_366-120-4.084728voltage-gated potassium channel
APECO1_367217-2.018590YciI-like protein
APECO1_368117-2.903331transporter
APECO1_369017-3.432319acyl-CoA thioester hydrolase
APECO1_370018-4.047653intracellular septation protein A
APECO1_371021-4.369328hypothetical protein
APECO1_372123-4.593639outer membrane protein W
APECO1_373323-4.377100integrase for prophage CP-933O
APECO1_374222-2.822549exonuclease VIII, ds DNA exonuclease encoded by
APECO1_375229-2.911439hypothetical protein
APECO1_376130-4.998353repressor protein encoded within prophage
APECO1_377133-6.268567hypothetical protein
APECO1_378133-6.595249hypothetical protein
APECO1_379132-6.443325hypothetical protein
APECO1_380029-6.503507hypothetical protein
APECO1_381-127-6.251896hypothetical protein
APECO1_382-124-3.107779hypothetical protein
APECO1_3830190.441227hypothetical protein
APECO1_3841211.146675hypothetical protein
APECO1_3851220.572081hypothetical protein
APECO1_386127-0.557720endodeoxyribonuclease RusA
APECO1_387226-0.271875cryptic prophage CP-933M antitermination protein
APECO1_388328-0.410761DNA adenine methyltransferase encoded by
APECO1_389325-0.991978**lambdoid prophage DLP12 lysis protein S-like
APECO1_390228-4.886730hypothetical protein
APECO1_391126-5.055165hypothetical protein
APECO1_392120-2.675117phage lysozyme
APECO1_393123-0.974130hypothetical protein
APECO1_3941231.412148bacteriophage lambda cell lysis protein
APECO1_3951231.537196hypothetical protein
APECO1_3961232.851695prophage Qin DNA packaging protein NU1-like
APECO1_3971243.771316DNA packaging protein of prophage; terminase
APECO1_3982275.185554capsid protein of prophage
APECO1_3992254.798106capsid assembly protein of prophage
APECO1_4003262.273090hypothetical protein
APECO1_4013232.247660capsid protein of prophage
APECO1_4023254.481778hypothetical protein
APECO1_4032254.630769head-tail joining protein of prophage
APECO1_4042264.544163minor tail protein
APECO1_4053265.351090tail component of prophage
APECO1_4063275.648988hypothetical protein
APECO1_4073265.113099tail component of prophage CP-933O
APECO1_4083264.422593minor tail protein
APECO1_4093254.217669minor tail protein
APECO1_4101223.631478tail fiber component K of prophage
APECO1_4111212.761677tail component of prophage CP-933K
APECO1_4120201.546732hypothetical protein
APECO1_4130211.997310tail component of prophage
APECO1_414025-1.109425hypothetical protein
APECO1_415027-2.304095tail fiber protein
APECO1_416124-5.388330phage-related tail fiber assembly protein G
APECO1_417017-3.205136hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_367adhesinmafb314e-04 Neisseria meningitidis: adhesin MafB signature.
		>adhesinmafb#Neisseria meningitidis: adhesin MafB signature.

Length = 467

Score = 31.2 bits (70), Expect = 4e-04
Identities = 16/57 (28%), Positives = 20/57 (35%), Gaps = 2/57 (3%)

Query: 41 GPMPAVDSNDPGAAGFTGSTVIAEFESLEAAQAWADADPYVAAGVYEHVSVKPFKKV 97
P+PA G GS E + EA W +P A V +V KV
Sbjct: 268 APLPA--EGKFAVIGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKV 322


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_368TONBPROTEIN2531e-87 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 253 bits (648), Expect = 1e-87
Identities = 234/239 (97%), Positives = 236/239 (98%), Gaps = 1/239 (0%)

Query: 6 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQA 65
MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMV PADLEPPQA
Sbjct: 1 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQA 60

Query: 66 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQ-PKRDVKPVESR 124
VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKV++ PKRDVKPVESR
Sbjct: 61 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR 120

Query: 125 PASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF 184
PASPFENTAPAR TSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF
Sbjct: 121 PASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF 180

Query: 185 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ 243
DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ
Sbjct: 181 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ 239


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_388FbpA_PF05833300.013 Fibronectin-binding protein
		>FbpA_PF05833#Fibronectin-binding protein

Length = 577

Score = 30.2 bits (68), Expect = 0.013
Identities = 13/56 (23%), Positives = 24/56 (42%), Gaps = 4/56 (7%)

Query: 204 ESDYLKLQA--LFARVAEEKHR-RGELEKLHHQLVDTYTSLN-RQYAELLSEYKHL 255
+SD LK ++ L V +R + + L++ L + Y ELL+ +
Sbjct: 293 KSDRLKSKSSDLQKIVMNNINRCTKKDKILNNTLKKCEDKDIFKLYGELLTANIYA 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_407GPOSANCHOR382e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 37.7 bits (87), Expect = 2e-04
Identities = 38/231 (16%), Positives = 71/231 (30%), Gaps = 12/231 (5%)

Query: 377 TLQSDMEKAGELAARDRAERESSQLKYTGEAQKAYERLQTPLDKYTARQKELNKALKDGK 436
+ ++ + E D + + ++ + L+ AR+ +L KAL+
Sbjct: 109 SEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAM 168

Query: 437 ILQADYNTLMASAKKDYESTLKKPSGVKVSAGERQEDRAHAALLALETELRTLEKHSGVN 496
SAK K + + E+ + A A +++TLE
Sbjct: 169 NFSTAD-----SAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAAL 223

Query: 497 E---KISQQRRDLWEAESQYVVLKEAATKRQLSEQEKSLLAHEKETLEYKRQLAELGDKI 553
++ + S K + + + E EK KI
Sbjct: 224 AARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKI 283

Query: 554 E-HQKRLNELAQQAARFEQQQSAKQAAISAKARGLTDRQAQRESEEQRLRE 603
+ + L + A E Q A + R L A RE+++Q E
Sbjct: 284 KTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDL---DASREAKKQLEAE 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_411PF06291270.032 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 26.5 bits (58), Expect = 0.032
Identities = 13/37 (35%), Positives = 17/37 (45%), Gaps = 5/37 (13%)

Query: 128 ILFSMGAAMTLGGVAQML-----APKARTPRTQTTDN 159
+LFS AM + G AQ P A TP+ T +
Sbjct: 9 MLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHH 45


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_412PHAGEIV300.001 Gene IV protein signature.
		>PHAGEIV#Gene IV protein signature.

Length = 426

Score = 30.3 bits (68), Expect = 0.001
Identities = 15/49 (30%), Positives = 30/49 (61%), Gaps = 2/49 (4%)

Query: 35 KNIDELSGCISRQWAGNGTPITSLPIEN-GVSL-LVPQAMGGYDVVLDI 81
+N+ ++G ++ + A P ++ +N G+S+ + P AM G ++VLDI
Sbjct: 289 QNVPFITGRVTGESANVNNPFQTVERQNVGISMSVFPVAMAGGNIVLDI 337


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_414ENTEROVIROMP1384e-44 Enterobacterial virulence outer membrane protein si...
		>ENTEROVIROMP#Enterobacterial virulence outer membrane protein

signature.
Length = 171

Score = 138 bits (350), Expect = 4e-44
Identities = 63/200 (31%), Positives = 98/200 (49%), Gaps = 30/200 (15%)

Query: 1 MRKVCAVILSAAICLSVSGAPAWASEHQSTLSAGYLHARTNAPGSDNLNGINVKYRYEFT 60
M+K+ + AA+ +G A ST++ GY + + + G N+KYRYE
Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAA---TSTVTGGYAQSDAQGQMNK-MGGFNLKYRYEED 56

Query: 61 DA-LGLITSFSYANAEDEQKTHYSDTRWHEDSVRNRWFSVMAGPSVRVNEWFSAYSMAGV 119
++ LG+I SF+Y T S T D +N+++ + AGP+ R+N+W S Y + GV
Sbjct: 57 NSPLGVIGSFTY--------TEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVVGV 108

Query: 120 AYSRVSTFSGDYLRVTDNKGKTHDVLTGSDDGRHSNTSLAWGAGVQFNPTESVTIDLAYE 179
Y + T T+ HD S+ ++GAG+QFNP E+V +D +YE
Sbjct: 109 GYGKFQT--------TEYPTYKHD---------TSDYGFSYGAGLQFNPMENVALDFSYE 151

Query: 180 GSGSGDWRTDAFIVGIGYRF 199
S +I G+GYRF
Sbjct: 152 QSRIRSVDVGTWIAGVGYRF 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_415CHANLCOLICIN468e-07 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 45.8 bits (108), Expect = 8e-07
Identities = 54/319 (16%), Positives = 116/319 (36%)

Query: 152 ARAASTSAGQAASSAQSASSSAGTASTKAREAAKSAAAAESSKSAAATSASAAKTSETNA 211
+ S S AA A + S+A T+A +AA++ AAAE+ A A + + +
Sbjct: 39 GKGGSKSESSAAIHATAKWSTAQLKKTQAEQAARAKAAAEAQAKAKANRDALTQRLKDIV 98

Query: 212 AASQQSAATSASTATTKASEAATSARDASASKEAAKSSETNAASSASSAASSATAAANSA 271
+ + A+ +AT A + + AK+ E + ++ + A
Sbjct: 99 NEALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRK 158

Query: 272 KAAKTSETNARSSETAAGQSASAAADSKTAAALSASAASTSAGQASASATAAGKSAESAA 331
+ + R + A + AA S+ A A+ + SA Q+ ++
Sbjct: 159 EIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIAQKKLSAAQSEVVKMDGEIKTLNSR 218

Query: 332 SSASTATTKAGEAAVQASAAARSASAAKTSKTNAKASETSAESSKTAAASSASSAASSAS 391
S+S A + + ++AK + + + S ++ A
Sbjct: 219 LSSSIHARDAEMKTLAGKRNELAQASAKYKELDELVKKLSPRANDPLQNRPFFEATRRRV 278

Query: 392 SASASKDEATRQASAAKGSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAED 451
A ++E +Q +A++ + T+ + + + +++ + AE K+A++
Sbjct: 279 GAGKIREEKQKQVTASETRINRINADITQIQKAISQVSNNRNAGIARVHEAEENLKKAQN 338

Query: 452 IASAVALEDASTTKKGIVQ 470
++DA Q
Sbjct: 339 NLLNSQIKDAVDATVSFYQ 357



Score = 36.6 bits (84), Expect = 5e-04
Identities = 66/358 (18%), Positives = 125/358 (34%), Gaps = 30/358 (8%)

Query: 313 AGQASASATAAGKSAESAASSASTATTKAGEAAVQASAAARSASAAKTSKTNAKASETSA 372
+G KS SAA A+ + A QA AAR+ +AA ++ A
Sbjct: 32 SGSGGGGGKGGSKSESSAAIHATAKWSTAQLKKTQAEQAARAKAAA--------EAQAKA 83

Query: 373 ESSKTAAASSASSAASSASSASASKDEATRQASAAKGSATTASTKATEAAGSATAAAQSK 432
++++ A + A +AS+ + + + A+ A +A A+++
Sbjct: 84 KANRDALTQRLKDIVNEALRHNASRTPSATELA-------HANNAAMQAEDERLRLAKAE 136

Query: 433 STAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSESLAATPKAVKAA 492
A A AE A + AE + E A T ++ ++L+ A +L+ KAV+ A
Sbjct: 137 EKARKEAEAAEKAFQEAEQRRKEIEREKAETERQ--LKLAEAEEKRLAALSEEAKAVEIA 194

Query: 493 YELANGKYTAQDATTAQKGIVQLSNATNSTSEMLAATPKSVKAAYDLANGKYTAQDATTA 552
+ + AQ +V++ + + L+++ + A GK +A
Sbjct: 195 ---------QKKLSAAQSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQASA 245

Query: 553 QKGIVQLSSATNSTSEMLAATPKSVKAAYDLANGKYTAQDAT-TAQKGIVQLSSATNSAS 611
+ + A P + ++ + A QK + + N +
Sbjct: 246 K---YKELDELVKKLSPRANDPLQNRPFFEATRRRVGAGKIREEKQKQVTASETRINRIN 302

Query: 612 ETLAATPKAVKAANNNANGRVPSARKVNGKALSADITLTPKDIGTLNSTTMSFSGGAG 669
+ KA+ +NN N + + A L I T+SF
Sbjct: 303 ADITQIQKAISQVSNNRNAGIARVHEAEENLKKAQNNLLNSQIKDAVDATVSFYQTLT 360



Score = 31.2 bits (70), Expect = 0.022
Identities = 52/321 (16%), Positives = 99/321 (30%), Gaps = 23/321 (7%)

Query: 114 SRNASAVAQNTAAAKKSASDASASASEAATHATDAAASARAASTSAGQAASSAQSASSSA 173
S++ S+ A + A +A A +AA A A A+A + + +
Sbjct: 43 SKSESSAAIHATAKWSTAQLKKTQAEQAARAKAAAEAQAKAKANRDALTQRLKDIVNEAL 102

Query: 174 GTASTKAREAAKSAAAAESSKSAAATSASAAKTSETNAAASQQSAATSASTATTKASEAA 233
+++ A + A A ++ A AK + A + A KA + A
Sbjct: 103 RHNASRTPSATELAHANNAAMQAEDERLRLAK---------AEEKARKEAEAAEKAFQEA 153

Query: 234 TSARDASASKEAAKSSETNAAS------SASSAASSATAAANSAKAAKTSETNARSSE-T 286
R ++A + A +A S + A A +A SE E
Sbjct: 154 EQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIAQKKLSAAQSEVVKMDGEIK 213

Query: 287 AAGQSASAAADSKTAAALSASAASTSAGQASASATAAGKSAESAASSASTA-TTKAGEAA 345
S++ ++ A + + QASA + + + A+ + A
Sbjct: 214 TLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYKELDELVKKLSPRANDPLQNRPFFEA 273

Query: 346 VQASAAARSASAAKTSKTNAKASETSAESSKTAAASSASSAASSASSAS------ASKDE 399
+ A K + A + + ++ A S S+ +A A ++
Sbjct: 274 TRRRVGAGKIREEKQKQVTASETRINRINADITQIQKAISQVSNNRNAGIARVHEAEENL 333

Query: 400 ATRQASAAKGSATTASTKATE 420
Q + A
Sbjct: 334 KKAQNNLLNSQIKDAVDATVS 354


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_417LUXSPROTEIN300.005 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 29.9 bits (67), Expect = 0.005
Identities = 17/66 (25%), Positives = 30/66 (45%), Gaps = 7/66 (10%)

Query: 41 TKEHLLPHFL-EHVGNNHLDI------GVGTGFYLTHVPESSLISLMDLNEASLNAASTR 93
T EHL F+ H+ + ++I G TGFY++ + S + D A++
Sbjct: 54 TLEHLYAGFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKV 113

Query: 94 AGESKI 99
++KI
Sbjct: 114 ENQNKI 119


18APECO1_479APECO1_545Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_479-119-4.266334murein peptide amidase A
APECO1_480020-5.247272hypothetical protein
APECO1_481016-4.155980hypothetical protein
APECO1_482014-4.003277transcriptional regulator YcjZ
APECO1_483012-3.331078periplasmic murein peptide-binding protein
APECO1_484-215-3.799797hypothetical protein
APECO1_485-115-3.090090hypothetical protein
APECO1_486020-4.762667universal stress protein UspE
APECO1_487022-4.736764fumarate/nitrate reduction transcriptional
APECO1_488025-5.111639O-6-alkylguanine-DNA:cysteine-protein
APECO1_489-126-4.913090hypothetical protein
APECO1_490-122-3.695282secretion protein
APECO1_491-119-3.275054transporter
APECO1_492-117-2.650410hypothetical protein
APECO1_493-218-3.564402hypothetical protein
APECO1_494-217-3.445320zinc transporter
APECO1_495-115-3.391212ATP-dependent RNA helicase DbpA
APECO1_496118-4.717258C32 tRNA thiolase
APECO1_497223-5.836040Rac prophage; integrase
APECO1_498326-5.371202hypothetical protein
APECO1_499224-4.378844recombination and repair protein RecT
APECO1_500424-4.133952exonuclease VIII
APECO1_501324-4.267345hypothetical protein
APECO1_502222-2.863279prophage CP-933R superinfection exclusion
APECO1_503221-1.475473regulator
APECO1_504020-1.320938Rac prophage hypothetical protein
APECO1_505-123-2.784201hypothetical protein
APECO1_506-120-2.611556replication protein
APECO1_507-125-3.974616transcription regulatory protein
APECO1_508-128-4.513778hypothetical protein
APECO1_509-124-4.518730hypothetical protein
APECO1_510-125-4.157163bacteriophage protein
APECO1_511232-6.846527hypothetical protein
APECO1_512132-6.487494antitermination protein
APECO1_513128-6.109494**phage lysis protein
APECO1_514127-6.262668phage-related lysozyme (muraminidase)
APECO1_515128-5.354642hypothetical protein
APECO1_516228-5.823224Rac prophage; potassium transporter subunit
APECO1_517323-3.223063transcriptional regulator
APECO1_518226-3.271725hypothetical protein
APECO1_519326-3.560827hypothetical protein
APECO1_520227-3.334484hypothetical protein
APECO1_521127-3.871543phage terminase
APECO1_522128-3.437296hypothetical protein
APECO1_523030-3.612869hypothetical protein
APECO1_524030-4.303213hypothetical protein
APECO1_525032-3.435026hypothetical protein
APECO1_526233-4.307652hypothetical protein
APECO1_527435-5.645508hypothetical protein
APECO1_528434-5.938818hypothetical protein
APECO1_529430-4.744758hypothetical protein
APECO1_530327-3.185206hypothetical protein
APECO1_531224-2.277844Rac prophage hypothetical protein
APECO1_5320190.477064Rac prophage; tail protein
APECO1_5333244.278051minor tail protein
APECO1_53320212.860475minor tail protein
APECO1_534-1202.329177tail fiber component K of prophage
APECO1_535-1190.932496tail component of prophage
APECO1_536-2200.392346tail component of prophage
APECO1_537-129-3.695657outer membrane protein of prophage
APECO1_538135-5.377776tail fiber protein
APECO1_539545-8.223649hypothetical protein
APECO1_540645-8.400488hypothetical protein
APECO1_541642-8.655493hypothetical protein
APECO1_542743-9.059530cytolethal distending toxin type IV subunit A
APECO1_543540-8.650050cytolethal distending toxin type IV subunit B
APECO1_544534-7.874889cytolethal distending toxin type IV subunit C
APECO1_545327-6.807494hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_490RTXTOXIND1157e-31 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 115 bits (289), Expect = 7e-31
Identities = 61/414 (14%), Positives = 128/414 (30%), Gaps = 105/414 (25%)

Query: 12 VVAIGILLAGVVFFIW-WVSK--------GRFIQTTDDAYIGGNITTVASKVSGYISAIE 62
+VA I+ V+ FI + + G+ G + + + I
Sbjct: 59 LVAYFIMGFLVIAFILSVLGQVEIVATANGKLT-------HSGRSKEIKPIENSIVKEII 111

Query: 63 VRDNQSVKKGDIILRLDDRDYRANVARLEAKIKSSKANLESIQATI-------------- 108
V++ +SV+KGD++L+L A+ + ++ + ++ Q
Sbjct: 112 VKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLP 171

Query: 109 -------------AMQQSIIQSASETWQAVKHEEQKRLRD--------TERYEKLAQSAA 147
S+I+ TWQ K++++ L R + +
Sbjct: 172 DEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSR 231

Query: 148 ISQQIIDNAR-------FDYQQVAAKERKAANDFLVEKQRLAVLSAQEENVRASIEEVQA 200
+ + +D+ V +E K L V +Q E + + I +
Sbjct: 232 VEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEA----VNELRVYKSQLEQIESEILSAKE 287

Query: 201 ALTQ-----------------------------ALLDLEYTLVRAPIDGIVANRSAHT-G 230
+ +++RAP+ V HT G
Sbjct: 288 EYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEG 347

Query: 231 SWVEGGTSLVSLVPVSE-LWVDANYKENQIAGMKPGMKAEIRADILKGEVFH---GHIES 286
V +L+ +VP + L V A + I + G A I+ + + G +++
Sbjct: 348 GVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKN 407

Query: 287 LSPATGASFSLIPIENATGNFTKIVQRVPVRIAFDDAKELKQLLRPGLSVTVSV 340
++ + G ++ + K + L G++VT +
Sbjct: 408 INLDA-------IEDQRLGLVFNVIISIEENCLSTGNKNIP--LSSGMAVTAEI 452


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_491TCRTETB1013e-25 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 101 bits (254), Expect = 3e-25
Identities = 80/418 (19%), Positives = 170/418 (40%), Gaps = 21/418 (5%)

Query: 3 SMRKHIAFASMCIGLFIAQLDIQIVSSSLNEIGGGLSAGKDEMAWLQTSYLIAEIIVIPL 62
++R + +CI F + L+ +++ SL +I + W+ T++++ I +
Sbjct: 9 NLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAV 68

Query: 63 SGWLSRVFSTRWLFTLSAGIFTLMSIACGLAWN-IQIMIFFRALQGVAGASMIPLVFTTA 121
G LS + L I S+ + + ++I R +QG A+ LV
Sbjct: 69 YGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVV 128

Query: 122 FIYYQGKELGLAAAVVSALASLSPTLEPTLGGWITDNLDWRWLFYINILPGIYLVLSIPF 181
Y + G A ++ ++ ++ + P +GG I + W +L I ++ ++++PF
Sbjct: 129 ARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMI----TIITVPF 184

Query: 182 LVNFDKPDLSLLKVADYPSIILLAMTLGCLEYTLEEGARWGWLDDNTILLTSVLALVSFI 241
L+ K ++ + D IIL+++ + +L +++++SF+
Sbjct: 185 LMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTS-YSISFL---------IVSVLSFL 234

Query: 242 LFAARTLKISNPIMDLHAFKDKYFTLGCFFSFSGGVGIFSTVYLIPVFLGQVRGLNAEEI 301
+F K+++P +D K+ F +G + V ++P + V L+ EI
Sbjct: 235 IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEI 294

Query: 302 GFAVCTTG-IFQLFSVPFYFWLSKKINLQWLLMAGLGGFVFSMYL--FTPITHEWGWQEL 358
G + G + + L + ++L G+ S F T W + +
Sbjct: 295 GSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSW-FMTI 353

Query: 359 LFPQAIRGISQQFAMAPIVTLTLGGIPKERLKLASGVFNLTRNLGGASGIALCGSILN 416
+ + G+S F I T+ + ++ + N T L +GIA+ G +L+
Sbjct: 354 IIVFVLGGLS--FTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_532RTXTOXIND340.004 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 33.6 bits (77), Expect = 0.004
Identities = 23/163 (14%), Positives = 59/163 (36%), Gaps = 18/163 (11%)

Query: 109 SGVAQAQREAEKAGKLAAAQQEAQAQVFQRMLDKIDPLAAALRNLEQQQDELNAAFASGK 168
S + QA+ E + L+ + + + ++ D+ + + + + F++ +
Sbjct: 141 SSLLQARLEQTRYQILSRSIELNKLPEL-KLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQ 199

Query: 169 INGSQFENYSRKIQETRRELTGEAQAEREAAKAHDEQVAALQRLIAQLDPVGTAFNRLVE 228
Q E K + R + ++ ++ L+ + A + ++E
Sbjct: 200 NQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQA---IAKHAVLE 256

Query: 229 QQKQLNEAKAKGMLSPEMYEELSGKLRAMRSELEVTQSQLSKT 271
Q+ + EA +LR +S+LE +S++
Sbjct: 257 QENKYVEA--------------VNELRVYKSQLEQIESEILSA 285


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_537ENTEROVIROMP1493e-48 Enterobacterial virulence outer membrane protein si...
		>ENTEROVIROMP#Enterobacterial virulence outer membrane protein

signature.
Length = 171

Score = 149 bits (378), Expect = 3e-48
Identities = 68/200 (34%), Positives = 101/200 (50%), Gaps = 30/200 (15%)

Query: 1 MRKLCAVILSAVVWLVAAGTPASAAEHQSTLSAGYLQSHTDMPGNDDLKGVNVKYRYEFT 60
M+K+ + A V AGT +A ST++ GY QS N + G N+KYRYE
Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAA---TSTVTGGYAQSDAQGQMNK-MGGFNLKYRYEED 56

Query: 61 DT-LGLVTSFSYAGDKNRQLTRYSDTRWHEDSVRNRWFSMMAGPSVRVNEWFSAYAMAGV 119
++ LG++ SF+Y T S T D +N+++ + AGP+ R+N+W S Y + GV
Sbjct: 57 NSPLGVIGSFTY--------TEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVVGV 108

Query: 120 AYSRVSTFSGDYLRVTDNKGKTHDVLTGSDDGRHSNTSLAWGAGVQFNPTESVAIDIAYE 179
Y + T T+ HD S+ ++GAG+QFNP E+VA+D +YE
Sbjct: 109 GYGKFQT--------TEYPTYKHD---------TSDYGFSYGAGLQFNPMENVALDFSYE 151

Query: 180 GSGSGDWRTDGFIVGVGYKF 199
S +I GVGY+F
Sbjct: 152 QSRIRSVDVGTWIAGVGYRF 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_538IGASERPTASE466e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 45.8 bits (108), Expect = 6e-07
Identities = 28/192 (14%), Positives = 62/192 (32%), Gaps = 7/192 (3%)

Query: 103 PEALRRFEEMVEEAARNAEAASQSAAAAKKSETAAASSKNAAKTSETNAANSAQAAAASQ 162
+ + E+ E + E ++ + K E + A E + + + +
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQT 1162

Query: 163 TASANSATAAKKSETNAKNSETAAKTSETNAKSSQTAAKTSETNAKA---SETAAKNSQN 219
+A++ AK++ +N + T + T T + T+ + SE++ K
Sbjct: 1163 NTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNR 1222

Query: 220 AAAESESAAAGSATSAAGSATAAANSQKAAKTSETNAKSSQTAAKTS----ETNAKASET 275
S + S + + ++ TNA S AK S+
Sbjct: 1223 HRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQH 1282

Query: 276 AAKNSQDAAAQS 287
++ + Q
Sbjct: 1283 ISQLEMNNEGQY 1294



Score = 42.4 bits (99), Expect = 9e-06
Identities = 28/152 (18%), Positives = 50/152 (32%), Gaps = 13/152 (8%)

Query: 222 AESESAAAGSATSAAGSATAAANSQKAAKTSETNAKSSQTAAKTSETNAKASETAAKNSQ 281
+ + S + A +A A S+T +E + + S+T KN Q
Sbjct: 997 ITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQ 1056

Query: 282 DA------------AAQSESAAAGSASAAASSATASANSQKAAKTSETNAKASETAAANS 329
DA A+S A + A S + + +Q + E A +
Sbjct: 1057 DATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVET 1116

Query: 330 AKASAASQTAAKASEDAAREYASQ-AAEPYKQ 360
K + ++ S + Q AEP ++
Sbjct: 1117 EKTQEVPKVTSQVSPKQEQSETVQPQAEPARE 1148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_542cdtoxina308e-109 Cytolethal distending toxin A signature.
		>cdtoxina#Cytolethal distending toxin A signature.

Length = 258

Score = 308 bits (791), Expect = e-109
Identities = 86/250 (34%), Positives = 132/250 (52%), Gaps = 21/250 (8%)

Query: 5 LIAFLCTLIITGCSDG--------------IGDSPSPPGKNVELVGIPGQGVAVASNGTS 50
+ L +++ GCS G + P+ P + + +PG G A+ +NG
Sbjct: 10 IAGILIPILLNGCSSGKNKAYLDPKVFPPQVEGGPTVPSPDEPGLPLPGPGPALPTNGAI 69

Query: 51 PTFGSNSTDFPDVSIMSTGGAMLTVWARPVRNWLWGYTPFDSVSFGENRNWKVVDGKDAG 110
P + VS+M+ G++LT+W+R + LW Y DS SFGE RNW+++ G
Sbjct: 70 PIPEPGTAPA--VSLMNMDGSVLTMWSRGAGSSLWAYYIGDSNSFGELRNWQIMPGTRPN 127

Query: 111 TVKFVNVAQGTCMEAFK-----NGVIHNTCDDNSLSQEFQLLPSTNGNVLIRSSALQTCI 165
T++F NV GTCM +F + C +FQ + + NGN ++S + CI
Sbjct: 128 TIQFRNVDVGTCMTSFPGFKGGVQLSTAPCKFGPERFDFQPMATRNGNYQLKSLSTGLCI 187

Query: 166 RADYLSRTILSPFAFTITLEKCPGAKEETQEMLWAISPPVRAAKPNLIKPELRPFRPLPI 225
RA++L RT SP+A T+T+E+CP + E+ E +W+IS P+R A + KPE+RPF P PI
Sbjct: 188 RANFLGRTPSSPYATTLTMERCPSSGEKNFEFMWSISEPLRPALATIAKPEIRPFPPQPI 247

Query: 226 PPHDKPDGME 235
P + G E
Sbjct: 248 EPDEHSTGGE 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_543cdtoxinb431e-157 Cytolethal distending toxin B signature.
		>cdtoxinb#Cytolethal distending toxin B signature.

Length = 269

Score = 431 bits (1109), Expect = e-157
Identities = 150/264 (56%), Positives = 185/264 (70%)

Query: 2 KKLLFLLMILPGISFADLSDFKVATWNLQGSNAPTENKWNTHVRQLVTGSGAVDILMVQE 61
K ++ L++ L + ADL+DF+VATWNLQG++A TE+KWN +VRQL++G AVDIL VQE
Sbjct: 3 KYIISLIVFLSFYAQADLTDFRVATWNLQGASATTESKWNINVRQLISGENAVDILAVQE 62

Query: 62 AGSIPSSATLTEREFRTPGIPMNEYIWNTGTNSRPQQLFIYFSRTDALSNRVNLAIVSNR 121
AGS PS+A T +PGIP+ E IWN TNSRPQQ++IYFS DAL RVNLA+VSNR
Sbjct: 63 AGSPPSTAVDTGTLIPSPGIPVRELIWNLSTNSRPQQVYIYFSAVDALGGRVNLALVSNR 122

Query: 122 RADEVIVLSPPTVASRPIIGIRIGNDVFFSTHALANRGIDSGAIVNSVFEFFNRQTDPIR 181
RADEV VLSP RP++GIRIGND FF+ HA+A R D+ A+V V+ FF DP+
Sbjct: 123 RADEVFVLSPVRQGGRPLLGIRIGNDAFFTAHAIAMRNNDAPALVEEVYNFFRDSRDPVH 182

Query: 182 QAANWMIAGDFNRSPAMLFSTLEPGIRNHVNIIAPPDPTQASGGVLDYAVVGNSVSFVLP 241
QA NWMI GDFNR PA L L +R II+P TQ S LDYAV GNSV+F
Sbjct: 183 QALNWMILGDFNREPADLEMNLTVPVRRASEIISPAAATQTSQRTLDYAVAGNSVAFRPS 242

Query: 242 LLRASLLFGLLRGQIASDHFPVGF 265
L+A +++G R QI+SDHFPVG
Sbjct: 243 PLQAGIVYGARRTQISSDHFPVGV 266


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_544cdtoxina411e-06 Cytolethal distending toxin A signature.
		>cdtoxina#Cytolethal distending toxin A signature.

Length = 258

Score = 40.8 bits (95), Expect = 1e-06
Identities = 24/109 (22%), Positives = 39/109 (35%), Gaps = 17/109 (15%)

Query: 87 GHVQIKNPDGNECL----AILNGQLAVAKQCTESNRNALFTFITSETGAVQIKSIGNGQC 142
+Q +N D C+ G C F + + G Q+KS+ G C
Sbjct: 127 NTIQFRNVDVGTCMTSFPGFKGGVQLSTAPCKFGPERFDFQPMATRNGNYQLKSLSTGLC 186

Query: 143 -----LGNGESV---TDFKLTKCVNDLSRPFDTVSPGLLWMLNPPLSPA 183
LG S T + +C + + F+ +W ++ PL PA
Sbjct: 187 IRANFLGRTPSSPYATTLTMERCPSSGEKNFE-----FMWSISEPLRPA 230


19APECO1_593APECO1_598Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_593227-3.175465L-asparagine permease
APECO1_594431-3.804673hypothetical protein
APECO1_595331-3.413463hypothetical protein
APECO1_596229-3.391106hypothetical protein
APECO1_597126-7.187847hypothetical protein
APECO1_598-119-3.269999hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_596ICENUCLEATIN330.005 Ice nucleation protein signature.
		>ICENUCLEATIN#Ice nucleation protein signature.

Length = 1258

Score = 33.2 bits (75), Expect = 0.005
Identities = 25/133 (18%), Positives = 53/133 (39%), Gaps = 8/133 (6%)

Query: 545 GHDQSITVANDRCITVRNDQTLQVTNDRTVSVSNDDGLYVRNDRKVTVEGKQEHKTTGNH 604
G +S + +R + + + Q R+ +S D + + +R + G +T G+
Sbjct: 1091 GP-ESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQMAGERGKLIAGADSTQTAGDR 1149

Query: 605 VSLVEGKHSLVVKGDLARKVSGALGIKVDGDIVLESSSRISLKVGGSFVVIHSGGVDIVG 664
L+ G +S + GD ++ +G D +L + R L G + ++ ++G
Sbjct: 1150 SKLLAGNNSYLTAGDRSKLTAG-------NDCILMAGDRSKLTAGINSILTAGCRSKLIG 1202

Query: 665 PKISLNSGGSPGT 677
S + G
Sbjct: 1203 SNGSTLTAGENSV 1215



Score = 30.9 bits (69), Expect = 0.027
Identities = 15/69 (21%), Positives = 35/69 (50%)

Query: 567 QVTNDRTVSVSNDDGLYVRNDRKVTVEGKQEHKTTGNHVSLVEGKHSLVVKGDLARKVSG 626
Q+ + R+ ++ + + +R + + GK +T G +L+ G S+ + G+ + ++G
Sbjct: 1080 QIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQMAGERGKLIAG 1139

Query: 627 ALGIKVDGD 635
A + GD
Sbjct: 1140 ADSTQTAGD 1148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_598PF07299280.033 Fibronectin-binding protein (FBP)
		>PF07299#Fibronectin-binding protein (FBP)

Length = 219

Score = 27.9 bits (62), Expect = 0.033
Identities = 16/51 (31%), Positives = 23/51 (45%), Gaps = 13/51 (25%)

Query: 167 LNDMYAFIPGDNYYFIKS------SGYKFVND-------KWFTLKSINNIF 204
+ M AFI D Y FIKS +G+ ND K ++ I ++F
Sbjct: 4 VIKMEAFIRSDQYNFIKSQAYILANGHATANDRGVIQALKSLAIEKIIHVF 54


20APECO1_619APECO1_635Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_619-211-3.887703lipoprotein YddW precursor
APECO1_620-213-5.434149amino acid antiporter
APECO1_621-215-6.042943glutamate decarboxylase
APECO1_622-120-7.553365zinc protease PqqL
APECO1_623-120-6.975925hypothetical protein
APECO1_624025-7.329745ATP-binding component of a transport system
APECO1_625026-7.171315hypothetical protein
APECO1_626026-6.201668sulfatase
APECO1_627229-6.334465transcriptional regulator YdeO
APECO1_628328-5.742227oxidoreductase
APECO1_629434-7.163455Fml fimbrial adhesin FmlD
APECO1_630332-6.838653fimbrial-like adhesin protein
APECO1_631231-6.545005fimbrial-like adhesin protein
APECO1_632128-5.848579outer membrane usher protein FimD
APECO1_633026-4.897799chaperone protein fimC precursor
APECO1_634022-3.158172Fml fimbriae subunit
APECO1_635022-3.063819hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_630FIMBRIALPAPF310.001 Escherichia coli: P pili tip fibrillum papF protein...
		>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein

signature.
Length = 167

Score = 31.2 bits (70), Expect = 0.001
Identities = 28/93 (30%), Positives = 46/93 (49%), Gaps = 7/93 (7%)

Query: 16 LFTATLQAADVTITVNGRVVAKPCTIQT-KEANVNLGDLYTRNLQQPGSASGWHNITLSL 74
L T+ ADV I + G V PCTI + V+ G++ N + ++ G +S+
Sbjct: 11 LLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNI---NPEHVDNSRGEVTKNISI 67

Query: 75 TDCPIETSAVTAIVTGSTDNTGYYKNEGTAENI 107
+ CP ++ ++ VTG+T G +N A NI
Sbjct: 68 S-CPYKSGSLWIKVTGNTMGVG--QNNVLATNI 97


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_632PF005779400.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 940 bits (2430), Expect = 0.0
Identities = 498/869 (57%), Positives = 653/869 (75%), Gaps = 10/869 (1%)

Query: 15 QVLILPRFARLTFALGLATAVFPVDAEYYFNPRFLSNDLAESVDLSAFTKGREAPPGTYR 74
+ + F RL A A AE YFNPRFL++D DLS F G+E PPGTYR
Sbjct: 20 KHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYR 79

Query: 75 VDIYLNDEFMASRDITFIADDNNADLIPCLSTDLLVSLGIKKSALLDNKEHSADKHVPDN 134
VDIYLN+ +MA+RD+TF D+ ++PCL+ L S+G+ +++ + ++ +
Sbjct: 80 VDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASV-------SGMNLLAD 132

Query: 135 SACTPLQDRLADASSEFDVGQQHLSLSVPQIYVGRMARGYVSPDLWEEGINAGLLNYSFD 194
AC PL + DA+++ DVGQQ L+L++PQ ++ ARGY+ P+LW+ GINAGLLNY+F
Sbjct: 133 DACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFS 192

Query: 195 GNSINNRSNHNAGKSNYAYLNLQSGINIGSWRLRDNSTWSYNSGSSNSSDSNKWQHINTS 254
GNS N G S+YAYLNLQSG+NIG+WRLRDN+TWSYNS S+S NKWQHINT
Sbjct: 193 GNS---VQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTW 249

Query: 255 AERDIIPLRSRLTVGDSYTDGDIFDSVNFRGLKINSTEAMLPDSQHGFAPVIHGIARGTA 314
ERDIIPLRSRLT+GD YT GDIFD +NFRG ++ S + MLPDSQ GFAPVIHGIARGTA
Sbjct: 250 LERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTA 309

Query: 315 QVSVKQNGYDVYQTTVPPGPFTIDDINSAANGGNLQVTIKEADGSIQTLYVPYSSVPVLQ 374
QV++KQNGYD+Y +TVPPGPFTI+DI +A N G+LQVTIKEADGS Q VPYSSVP+LQ
Sbjct: 310 QVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQ 369

Query: 375 RAGYTRYALAMGEYRSGNNLQSTPKFVQASLMHGLKGNWTPYGGMQIAEDYQAFNLGIGK 434
R G+TRY++ GEYRSGN Q P+F Q++L+HGL WT YGG Q+A+ Y+AFN GIGK
Sbjct: 370 REGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGK 429

Query: 435 DLGLFGAFSFDITQANTTLADDTRHSGQSVKSVYSKSFYQTGTNIQVAGYRYSTQGFYNL 494
++G GA S D+TQAN+TL DD++H GQSV+ +Y+KS ++GTNIQ+ GYRYST G++N
Sbjct: 430 NMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNF 489

Query: 495 SDSAYSRMSGYTVKPPTGDTSEQTLFIDYFNLFYSKRGQEQISISQQLGNYGTTFFSASR 554
+D+ YSRM+GY ++ G + F DY+NL Y+KRG+ Q++++QQLG T + S S
Sbjct: 490 ADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSH 549

Query: 555 QSYWNTSRSDQQISFGLNVPFGDITTSLNYSYSNNIWQNDRDHLLAFTLNVPFSHWMRTD 614
Q+YW TS D+Q GLN F DI +L+YS + N WQ RD +LA +N+PFSHW+R+D
Sbjct: 550 QTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSD 609

Query: 615 SQSAFHNSNASYSMSNDLKGGMTNLSGVYGTLLPDNNLNYSVQVGNTQGGNTSSGTSGYS 674
S+S + +++ASYSMS+DL G MTNL+GVYGTLL DNNL+YSVQ G GG+ +SG++GY+
Sbjct: 610 SKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYA 669

Query: 675 SLNYRGAYGNTNVGYSRSGDSSQIYYGMSGGIIAHADGITFGQPLGDTMVLVKAPGADNV 734
+LNYRG YGN N+GYS S D Q+YYG+SGG++AHA+G+T GQPL DT+VLVKAPGA +
Sbjct: 670 TLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDA 729

Query: 735 KIENQTGIHTDWRGYAILPFATEYRENRVALNANSLADNVELDETVVTVIPTHGAIARAT 794
K+ENQTG+ TDWRGYA+LP+ATEYRENRVAL+ N+LADNV+LD V V+PT GAI RA
Sbjct: 730 KVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAE 789

Query: 795 FNAQIGGKVLMTLKYGNKSVPFGAIVTHGENKNGSIVAENGQVYLTGLPQSGKLQVSWGK 854
F A++G K+LMTL + NK +PFGA+VT +++ IVA+NGQVYL+G+P +GK+QV WG+
Sbjct: 790 FKARVGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGE 849

Query: 855 DKNSNCIVDYKLPVVSPGTLLNQQTAICR 883
++N++C+ +Y+LP S LL Q +A CR
Sbjct: 850 EENAHCVANYQLPPESQQQLLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_635NUCEPIMERASE353e-04 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 34.8 bits (80), Expect = 3e-04
Identities = 22/73 (30%), Positives = 34/73 (46%), Gaps = 10/73 (13%)

Query: 1 MRILVAGATGSIGIHVVNTAIAMGHQPVTL---------VRNRRKIKLLPRGTDIFY-GD 50
M+ LV GA G IG HV + GHQ V + + +++LL + F+ D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 51 VSIPETLTDLPKD 63
++ E +TDL
Sbjct: 61 LADREGMTDLFAS 73


21APECO1_646APECO1_663Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_646017-5.349571sugar efflux transporter
APECO1_647120-6.574384multiple drug resistance protein MarC
APECO1_648224-7.626316DNA-binding transcriptional repressor MarR
APECO1_649226-8.152971DNA-binding transcriptional activator MarA
APECO1_650227-8.5901926-phospho-beta-glucosidase
APECO1_651126-7.780355outer membrane protein YieC
APECO1_652221-5.878636PTS system cellobiose-specific transporter
APECO1_653122-5.977211hypothetical protein
APECO1_654-118-3.908901PTS system cellobiose-specific transporter
APECO1_655-116-3.222851transcriptional regulator
APECO1_656-115-2.220403O-acetylserine/cysteine export protein
APECO1_657-116-2.251134MFS-type transporter YdeE
APECO1_658-118-2.878899hypothetical protein
APECO1_659-116-1.950870dipeptidyl carboxypeptidase II
APECO1_6592-118-2.0340543-hydroxy acid dehydrogenase
APECO1_660-118-2.194186hypothetical protein
APECO1_662-215-2.591073oxidoreductase
APECO1_663-215-3.740545dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_646TCRTETB553e-10 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 54.5 bits (131), Expect = 3e-10
Identities = 41/192 (21%), Positives = 84/192 (43%), Gaps = 8/192 (4%)

Query: 36 LSDIAHSFHMQTAQVGIMLTIYAWVVALMSLPFMLMTSQVERRKLLICLFVVFIASHVLS 95
L DIA+ F+ A + T + ++ + + ++ Q+ ++LL+ ++ V+
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 96 FLSWS-FTVLVISRIGVAFAHAIFWSITASLAIRMAPAGKRAQALSLIATGTALAMVLGL 154
F+ S F++L+++R A F ++ + R P R +A LI + A+ +G
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 155 PLGRIVGQYFGWRMTFFAIGIGALITLLCLIKLLPLLPSEHSGSLKSLPLLFRRPALMSI 214
+G ++ Y W I + +IT+ L+KLL + LMS+
Sbjct: 157 AIGGMIAHYIHWSY-LLLIPMITIITVPFLMKLLK------KEVRIKGHFDIKGIILMSV 209

Query: 215 YLLTVVVVTAHY 226
++ ++ T Y
Sbjct: 210 GIVFFMLFTTSY 221


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_657TCRTETA449e-07 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 43.7 bits (103), Expect = 9e-07
Identities = 42/239 (17%), Positives = 83/239 (34%), Gaps = 18/239 (7%)

Query: 7 RSTSALLASSLLLTIGRGATLPFMTIYLSRQYSLSVDLI---GYAMTIALTIGVIFSLGF 63
R +L++ L +G G +P + L R S D+ G + + + +
Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLL-RDLVHSNDVTAHYGILLALYALMQFACAPVL 63

Query: 64 GILADKFDKKRYMLLAITAFASGFIAIPLVNNVTLVVLFFALINCAYSVFATVLKAWFAD 123
G L+D+F ++ +L+++ A + + + ++ + + + A A+ AD
Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIAD 122

Query: 124 NLSSNSKTKIFSINYTMLNIGWTIGPPLGTLLVMQSINLPFWLAAICSAFPMLFIQIWVK 183
+ + + F G GP LG L+ S + PF+ AA + L +
Sbjct: 123 ITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLP 182

Query: 184 RSEK---------IIATETGSVWSPKVLLQDKALLWFTCSGFLASFVSGAFASCISQYV 233
S K + W + A L F+ V A+ +
Sbjct: 183 ESHKGERRPLRREALNPLASFRW--ARGMTVVAALMAV--FFIMQLVGQVPAALWVIFG 237



Score = 34.0 bits (78), Expect = 0.001
Identities = 22/155 (14%), Positives = 60/155 (38%), Gaps = 2/155 (1%)

Query: 7 RSTSALLASSLLLTIGRGATLPFMTIYLSRQYSLSVDLIGYAMTIALTIGVIF-SLGFGI 65
+AL+A ++ + I+ ++ IG ++ + + ++ G
Sbjct: 210 TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGP 269

Query: 66 LADKFDKKRYMLLAITAFASGFIAIPLVNNVTLVVLFFALINCAYSVFATVLKAWFADNL 125
+A + ++R ++L + A +G+I + + L+ + L+A + +
Sbjct: 270 VAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG-GIGMPALQAMLSRQV 328

Query: 126 SSNSKTKIFSINYTMLNIGWTIGPPLGTLLVMQSI 160
+ ++ + ++ +GP L T + SI
Sbjct: 329 DEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASI 363


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_6592DHBDHDRGNASE1001e-27 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 100 bits (251), Expect = 1e-27
Identities = 70/244 (28%), Positives = 114/244 (46%), Gaps = 16/244 (6%)

Query: 2 IVLVTGATAGFGECITRRFIQQGHKVIATGRRQERLQELKDELGDNLYIAQ---LDVRNR 58
I +TGA G GE + R QG + A E+L+++ L A+ DVR+
Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69

Query: 59 AAIEEMLASLPAEWSNIDILVNNAGLALGMEPAHKASVEDWETMIDTNNKGLVYMTRAVL 118
AAI+E+ A + E IDILVN AG+ L H S E+WE N+ G+ +R+V
Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 119 PGMVERNHGHIINIGSTAGSWPYAGGNVYGATKAFVRQFSLNLRTDLHGTAVRVTDIEPG 178
M++R G I+ +GS P Y ++KA F+ L +L +R + PG
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188

Query: 179 LVGGTEFSNVRFKGDDGKAE------KTYQNTVALT----PEDVSEAV-WWVSTLPAHVN 227
T+ + ++G + +T++ + L P D+++AV + VS H+
Sbjct: 189 ST-ETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247

Query: 228 INTL 231
++ L
Sbjct: 248 MHNL 251


22APECO1_800APECO1_822Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_800-115-3.253833cell division modulator
APECO1_801-113-3.234532hydroperoxidase II
APECO1_802118-4.793762hypothetical protein
APECO1_803118-5.1947536-phospho-beta-glucosidase
APECO1_804017-4.656317DNA-binding transcriptional regulator ChbR
APECO1_805118-2.750684PTS system N,N'-diacetylchitobiose-specific
APECO1_806116-2.494757PTS system N,N'-diacetylchitobiose-specific
APECO1_807017-2.119363PTS system N,N'-diacetylchitobiose-specific
APECO1_808-115-0.743832DNA-binding transcriptional activator OsmE
APECO1_8090120.467038NAD synthetase
APECO1_8101132.163407nucleotide excision repair endonuclease
APECO1_8110132.948173hypothetical protein
APECO1_8120123.460807hypothetical protein
APECO1_813-1133.591048succinylglutamate desuccinylase
APECO1_814-1133.028482succinylarginine dihydrolase
APECO1_815-1122.141881succinylglutamic semialdehyde dehydrogenase
APECO1_816-1140.812306arginine succinyltransferase
APECO1_817-1140.726596bifunctional succinylornithine
APECO1_8181160.586208exonuclease III
APECO1_8192161.541505hypothetical protein
APECO1_8202142.047733hypothetical protein
APECO1_8211142.657980hypothetical protein
APECO1_8222153.061296hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_820TYPE3OMGPROT290.031 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 28.7 bits (64), Expect = 0.031
Identities = 16/100 (16%), Positives = 34/100 (34%), Gaps = 19/100 (19%)

Query: 183 DISVNWQGAAKAYSFDEVIVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSCPVGIVSNA 242
++ V+W+ + + +V++ + G + ++ G G LV
Sbjct: 298 ELGVDWRVGIRTGNNHQVVIKTTGDQSNIASNGA--------LGSLV--------DARGL 341

Query: 243 TYTYGAV---EKRGEVKFKGNASVLPADNTPATVTFKITE 279
Y V E G + ++L +N A + T
Sbjct: 342 DYLLARVNLLENEGSAQVVSRPTLLTQENAQAVIDHSETY 381


23APECO1_836APECO1_853Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_836-119-4.199270asparaginase
APECO1_837-121-5.112833nicotinamidase/pyrazinamidase
APECO1_838022-5.810896metabolite transport protein
APECO1_839-122-5.048359DEOR-type transcriptional regulator
APECO1_840019-3.933087oxidoreductase
APECO1_841020-4.039303sugar kinase
APECO1_842-118-3.430661hypothetical protein
APECO1_843-116-2.700709zinc-type alcohol dehydrogenase-like protein
APECO1_844-216-2.097578metabolite transport protein
APECO1_845023-1.575667oxidoreductase
APECO1_846219-1.599624methionine sulfoxide reductase B
APECO1_847117-1.602755glyceraldehyde 3-phosphate dehydrogenase A
APECO1_848-110-3.901379hypothetical protein
APECO1_849-112-4.588823hypothetical protein
APECO1_850012-4.637886MltA-interacting protein
APECO1_851-114-4.771015hypothetical protein
APECO1_852-218-4.528766hypothetical protein
APECO1_853-120-4.731669hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_837ISCHRISMTASE373e-05 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 36.9 bits (85), Expect = 3e-05
Identities = 35/192 (18%), Positives = 55/192 (28%), Gaps = 58/192 (30%)

Query: 8 PPRALLLV-DLQNDFCAGGALAVPEGDSTVDVANRLIDWCQSRGEAVI-----ASQD--- 58
P RA+LL+ D+QN F +L + C G V+ SQ+
Sbjct: 28 PNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQNPDD 87

Query: 59 -------WHPANHGSFASQHGVEPYTPGQLDGLPQTFWPDHCVQNSEGAQLHPLLKQKAI 111
W P + + + P D + T W
Sbjct: 88 RALLTDFWGPGLNSGPYEEKIITELAPEDDDLV-LTKW---------------------- 124

Query: 112 AAVFHKGENPLVDSYSAFFDNGRRQKTALDDWLRAHVINELIVMGLATDYCVKFTVLDAL 171
YSAF +T L + +R ++LI+ G+ T +A
Sbjct: 125 -------------RYSAFK------RTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAF 165

Query: 172 QLGYKVNVITDG 183
K + D
Sbjct: 166 MEDIKAFFVGDA 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_838TCRTETB402e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 39.9 bits (93), Expect = 2e-05
Identities = 30/129 (23%), Positives = 50/129 (38%), Gaps = 1/129 (0%)

Query: 88 ALMFGYFIGSLTGGFIGDYFGRRRAFRINLLIVGIAATGAAFVPDMY-WLIFFRFLMGTG 146
A M + IG+ G + D G +R ++I + + LI RF+ G G
Sbjct: 57 AFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG 116

Query: 147 MGALIMVGYASFTEFIPATVRGKWSARLSFVGNWSPMLSAAIGVVVIAFFSWRIMFLLGG 206
A + +IP RGK + + + AIG ++ + W + L+
Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM 176

Query: 207 IGILLAWFL 215
I I+ FL
Sbjct: 177 ITIITVPFL 185


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_844TCRTETB310.011 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 31.0 bits (70), Expect = 0.011
Identities = 33/142 (23%), Positives = 48/142 (33%), Gaps = 23/142 (16%)

Query: 71 MFLGALVGGIIGDKTGRRNAFILYEAIHIASMVVGAFSPNMDF-LIACRFVMGVGLGALL 129
+G V G + D+ G + + I+ V+G + LI RF+ G G A
Sbjct: 62 FSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFP 121

Query: 130 VTLFAGFTEYMPGRNR----GTWSSRVSFIGNWSYPLCSLIAMGLTPLISA----EWNWR 181
+ Y+P NR G S V+ + G+ P I +W
Sbjct: 122 ALVMVVVARYIPKENRGKAFGLIGSIVA------------MGEGVGPAIGGMIAHYIHWS 169

Query: 182 VQLLIPAILSLIATALAWRYFP 203
LLIP I I T
Sbjct: 170 YLLLIPMI--TIITVPFLMKLL 189


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_848INVEPROTEIN290.021 Salmonella/Shigella invasion protein E (InvE) signat...
		>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE)

signature.
Length = 372

Score = 29.3 bits (65), Expect = 0.021
Identities = 18/81 (22%), Positives = 34/81 (41%), Gaps = 13/81 (16%)

Query: 165 ETTSALHTYFNVGDIAKVSVSGLGDRFIDKVNDAKED-----------VLTDGIQTFPDR 213
E ++AL + N D K S S L + F ++V + + V ++ F +
Sbjct: 57 EMSAALAQFRNRRDYEKKS-SNLSNSF-ERVLEDEALPKAKQILKLISVHGGALEDFLRQ 114

Query: 214 TDRVYLNPQDCSVINDEALNR 234
++ +P D ++ E L R
Sbjct: 115 ARSLFPDPSDLVLVLRELLRR 135


24APECO1_969APECO1_987Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_969123-3.949505hypothetical protein
APECO1_970226-5.043954hypothetical protein
APECO1_971433-6.764610hypothetical protein
APECO1_972130-5.768015outer membrane porin protein NmpC
APECO1_973-118-2.862572transcriptional regulator
APECO1_9741140.967704kinase inhibitor
APECO1_9750163.711728multidrug efflux protein
APECO1_9762194.581006flagellar hook-basal body protein FliE
APECO1_9771164.315441flagellar M-ring protein
APECO1_9782184.362547flagellar motor switch protein G
APECO1_979-1183.658716flagellar assembly protein H
APECO1_980-1193.411008flagellum-specific ATP synthase
APECO1_981-1162.276920flagellar biosynthesis chaperone
APECO1_982-1162.241586flagellar hook-length control protein
APECO1_983-2211.762891flagellar basal body-associated protein FliL
APECO1_9840170.456721flagellar motor switch protein FliM
APECO1_985116-2.483996flagellar motor switch protein FliN
APECO1_986018-3.273599flagellar biosynthesis protein FliO
APECO1_987018-3.020900flagellar biosynthesis protein FliP
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_970RTXTOXIND300.017 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.2 bits (68), Expect = 0.017
Identities = 10/57 (17%), Positives = 17/57 (29%), Gaps = 2/57 (3%)

Query: 164 RFTLLPIFRIPVKMQKVSAASPLTQKPDQARRRF--RLGMLVFFGMLGWALLTAMNQ 218
R L R + + + A L + P R R M ++L +
Sbjct: 26 RKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEI 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_972ECOLIPORIN5100.0 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 510 bits (1314), Expect = 0.0
Identities = 240/388 (61%), Positives = 282/388 (72%), Gaps = 33/388 (8%)

Query: 1 MKKLTVAISAVAASVLMAMSAQAAEIYNKDSNKLDLYGKVNAKHYFSSNDADDGDTTYVR 60
MK+ +A+ V ++L A +A AAEIYNKD NKLDLYGKV+ HYFS + + DGD TY+R
Sbjct: 1 MKRKVLAL--VIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMR 58

Query: 61 LGFKGETQINDQLTGFGQWEYEFKGNRAESQGSSKDKTRLAFAGLKFGDYGSIDYGRNYG 120
+GFKGETQINDQLTG+GQWEY + N E +G++ TRLAFAGLKFGDYGS DYGRNYG
Sbjct: 59 VGFKGETQINDQLTGYGQWEYNVQANTTEGEGANS-WTRLAFAGLKFGDYGSFDYGRNYG 117

Query: 121 VAYDIGAWTDVLPEFGGDTWTQTDVFMTGRTTGVATYRNNDFFGLVDGLNFAAQYQGKND 180
V YD+ WTD+LPEFGGD++T D +MTGR GVATYRN DFFGLVDGLNFA QYQGKN+
Sbjct: 118 VLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNE 177

Query: 181 R----------------TDVTEANGDGFGFSTTYEY-EGFGVGATYAKSDRTNDQVIYGN 223
D+ NGDGFG STTY+ GF GA Y SDRTN+QV G
Sbjct: 178 SQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGG 237

Query: 224 NSLNASGQNAEVWAAGLKYDANNIYLATTYSETQNMTVFG------NNHIANKAQNFEVV 277
A G A+ W AGLKYDANNIYLAT YSET+NMT +G + +ANK QNFEV
Sbjct: 238 T--IAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVT 295

Query: 278 AQYQFDFGLRPSVAYLQSKGKDLG----AWGDQDLVEYIDVGATYYFNKNMSTFVDYKIN 333
AQYQFDFGLRP+V++L SKGKDL D+DLV+Y DVGATYYFNKN ST+VDYKIN
Sbjct: 296 AQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKIN 355

Query: 334 LIDKSD-FTKASGVATDDIVAVGLVYQF 360
L+D D F K +G++TDDIVA+G+VYQF
Sbjct: 356 LLDDDDPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_976FLGHOOKFLIE1178e-38 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 117 bits (293), Expect = 8e-38
Identities = 102/103 (99%), Positives = 102/103 (99%)

Query: 2 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTVARTQAEKFTL 61
SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQT ARTQAEKFTL
Sbjct: 1 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL 60

Query: 62 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 104
GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV
Sbjct: 61 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_977FLGMRINGFLIF459e-162 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 459 bits (1183), Expect = e-162
Identities = 287/324 (88%), Positives = 304/324 (93%)

Query: 3 ATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDGGA 62
+TA Q K LEWLNRLRANP+IPLIVAGSAAVA++VA++LWAK PDYRTLFSNLSDQDGGA
Sbjct: 5 STATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGA 64

Query: 63 IVSQLTQMNIPYRFSEASGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 122
IV+QLTQMNIPYRF+ SGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ
Sbjct: 65 IVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 124

Query: 123 FSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPSLFVREQKSPSASVTVNLLPGRA 182
FSEQVNYQRALEGEL+RTIET+GPVK ARVHLAMPKPSLFVREQKSPSASVTV L PGRA
Sbjct: 125 FSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRA 184

Query: 183 LDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQSNTSGRDLNDAQLKYASDVEGRI 242
LDEGQISA+VHLVSSAVAGLPPGNVTLVDQ GHLLTQSNTSGRDLNDAQLK+A+DVE RI
Sbjct: 185 LDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRI 244

Query: 243 QRRIEAILSPIVGNGNIHAQVTAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQSG 302
QRRIEAILSPIVGNGN+HAQVTAQLDFA+KEQTEE Y PNGD S A LRSRQLN SEQ G
Sbjct: 245 QRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVG 304

Query: 303 SGYPGGVPGALSNQPAPANNAPIS 326
+GYPGGVPGALSNQPAP N API+
Sbjct: 305 AGYPGGVPGALSNQPAPPNEAPIA 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_978FLGMOTORFLIG341e-119 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 341 bits (876), Expect = e-119
Identities = 117/329 (35%), Positives = 197/329 (59%), Gaps = 2/329 (0%)

Query: 1 MSNLTGTDKSVILLMTIGEDRAAEVFKHLSQREVQTLSAAMANVTQISNKQLTDVLAEFE 60
+S LTG K+ ILL++IG + +++VFK+LSQ E+++L+ +A + I+++ +VL EF+
Sbjct: 12 VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK 71

Query: 61 QEAEQFAALNINANDYLRSVLVKALGEERAASLLEDILETRDTASGIETLNFMEPQSAAD 120
+ + DY R +L K+LG ++A ++ + L + + E + +P + +
Sbjct: 72 ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINN-LGSALQSRPFEFVRRADPANILN 130

Query: 121 LIRDEHPQIIATILVHLKRAQAADILALFDERLRHDVMLRIATFGGVQPAALAELTEVLN 180
I+ EHPQ IA IL +L +A+ IL+ ++ +V RIA P + E+ VL
Sbjct: 131 FIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLE 190

Query: 181 GLLDGQ-NLKRSKMGGVRTAAEIINLMKTQQEEAVITAVREFDGELAQKIIDEMFLFENL 239
L + + GGV EIIN+ + E+ +I ++ E D ELA++I +MF+FE++
Sbjct: 191 KKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDI 250

Query: 240 VDVDDRSIQRLLQEVDSESLLIALKGAEQPLREKFLRNMSQRAADILRDDLANRGPVRLS 299
V +DDRSIQR+L+E+D + L ALK + P++EK +NMS+RAA +L++D+ GP R
Sbjct: 251 VLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRK 310

Query: 300 QVENEQKAILLIVRRLAETGEMVIGSGED 328
VE Q+ I+ ++R+L E GE+VI G +
Sbjct: 311 DVEESQQKIVSLIRKLEEQGEIVISRGGE 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_979FLGFLIH369e-133 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 369 bits (948), Expect = e-133
Identities = 224/228 (98%), Positives = 226/228 (99%)

Query: 1 MSDNLPWKTWMPDDLAPPQAEFVPMVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60
MSDNLPWKTW PDDLAPPQAEFVP+VEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI
Sbjct: 1 MSDNLPWKTWTPDDLAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60

Query: 61 AEGRQQGHEQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120
AEGRQQGH+QGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL
Sbjct: 61 AEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120

Query: 121 MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180
MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT
Sbjct: 121 MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180

Query: 181 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPRVV 228
LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAP VV
Sbjct: 181 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_981FLGFLIJ2024e-70 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 202 bits (514), Expect = 4e-70
Identities = 145/147 (98%), Positives = 147/147 (100%)

Query: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60
MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 MTSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQKRQST 120
+TSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQ+RQST
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147
AALLAENRLDQKKMDEFAQRAAMRKPE
Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_982FLGHOOKFLIK468e-168 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 468 bits (1206), Expect = e-168
Identities = 366/375 (97%), Positives = 370/375 (98%)

Query: 1 MIRLAPLITANVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK 60
MIRLAPLITA+VDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK
Sbjct: 1 MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK 60

Query: 61 GEPLVSDIVSDAQQADLLIPVDETLPVINDEQSTSTPLTTAQTMTLAAVADKNTTKDEKA 120
GEPL+SDIVSDAQQA+LLIPVDET PVINDEQSTSTPLTTAQTM LAAVADKNTTKDEKA
Sbjct: 61 GEPLISDIVSDAQQANLLIPVDETPPVINDEQSTSTPLTTAQTMALAAVADKNTTKDEKA 120

Query: 121 DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPAEKPTLFTKLTSAQLTTAQPDDAP 180
DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLP EKPTLFTKLTS QLTTAQPDDAP
Sbjct: 121 DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPTEKPTLFTKLTSEQLTTAQPDDAP 180

Query: 181 GTPAQPLTPLVAEAQSKAEVISTPSPVTADASPLITPHQTQPLPTVAAPVLSAPLGSHEW 240
GTPAQPLTPLVAEAQSKAEVISTPSPVTA ASPLITPHQTQPLPTVAAPVLSAPLGSHEW
Sbjct: 181 GTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEW 240

Query: 241 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMISPHQHVRAALEAA 300
QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQM+SPHQHVRAALEAA
Sbjct: 241 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAA 300

Query: 301 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS 360
LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS
Sbjct: 301 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS 360

Query: 361 LQGRVTGNSGVDIFA 375
LQGRVTGNSGVDIFA
Sbjct: 361 LQGRVTGNSGVDIFA 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_984FLGMOTORFLIM385e-136 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 385 bits (989), Expect = e-136
Identities = 85/324 (26%), Positives = 148/324 (45%), Gaps = 10/324 (3%)

Query: 20 ILSQAEIDALLNGDS--EVKDEPTASVSGESDIRPYDPNTQRRVVRERLQALEIINERFA 77
+LSQ EID LL S + E +S I YD + +E+++ L +++E FA
Sbjct: 4 VLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETFA 63

Query: 78 RHFRMGLFNLLRRSPDITVGAIRIQPYHEFARNLPVPTNLNLIHLKPLRGTGLVVFSPSL 137
R L LR + V ++ Y EF R++P P+ L +I + PL+G ++ PS+
Sbjct: 64 RLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPSI 123

Query: 138 VFIAVDNLFGGDGRFPTKVEGREFTHTEQRVINRMLKLALEGYSDAWKAINPLEVEYVRS 197
F +D LFGG G+ KV+ R+ T E V+ ++ L ++W + L +
Sbjct: 124 TFSIIDRLFGGTGQ-AAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQI 181

Query: 198 EMQVKFTNITTSPNDIVVNTPFHVEIGNLTGEFNICLPFSMIEPLRELLVNPPLENS--R 255
E +F I P+++VV ++G G N C+P+ IEP+ L + +S R
Sbjct: 182 ETNPQFAQI-VPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240

Query: 256 NEDQNWRDNLVRQVQHSQLELVANFADISLRLSQILKLKPGDVLPIEKP---DRIIAHVD 312
+ + L ++ +++VA + L + IL L+ GD++ + D + +
Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300

Query: 313 GVPVLTSQYGTLNGQYALRIEHLI 336
Q G + + A +I I
Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_985FLGMOTORFLIN2105e-74 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 210 bits (537), Expect = 5e-74
Identities = 126/137 (91%), Positives = 134/137 (97%)

Query: 1 MSDMNNPADDNNGAMDDLWAEALSEQKSTSGKSAADAVFQQFGGGDVSGALQDIDLIMDI 60
MSDMNNP+D+N GA+DDLWA+AL+EQK+T+ KSAADAVFQQ GGGDVSGA+QDIDLIMDI
Sbjct: 1 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDI 60

Query: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120
PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV
Sbjct: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120

Query: 121 RITDIITPSERMRRLSR 137
RITDIITPSERMRRLSR
Sbjct: 121 RITDIITPSERMRRLSR 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_987FLGBIOSNFLIP335e-119 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 335 bits (860), Expect = e-119
Identities = 242/245 (98%), Positives = 244/245 (99%)

Query: 1 MRRLLSVAPVLLWLVTPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60
MRRLLSVAPVLLWL+TPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM
Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60

Query: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFNEEK 120
MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPF+EEK
Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120

Query: 121 ISMQEALEKGEQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180
ISMQEALEKG QPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK
Sbjct: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180

Query: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240
TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA
Sbjct: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240

Query: 241 QSFYS 245
QSFYS
Sbjct: 241 QSFYS 245


25APECO1_998APECO1_1101Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_998-214-4.502587DNA cytosine methylase
APECO1_999-122-6.411510hypothetical protein
APECO1_1000027-8.027328Outer membrane protein N precursor
APECO1_1001-124-6.868638Outer membrane protein N precursor
APECO1_1002026-6.031702chaperone protein HchA
APECO1_1003130-7.3784642-component sensor protein
APECO1_1004228-6.289280transcriptional regulatory protein YedW
APECO1_1005027-7.389193hypothetical protein
APECO1_1006126-6.355602sulfite oxidase subunit YedY
APECO1_1007335-9.173800sulfite oxidase subunit YedZ
APECO1_1008236-7.421990YodA protein
APECO1_1009124-1.480559hypothetical protein
APECO1_1010223-0.095415hypothetical protein
APECO1_10113241.969624hypothetical protein
APECO1_10123242.108143hypothetical protein
APECO1_60013242.334458hypothetical protein
APECO1_60024233.039168tail component of prophage
APECO1_60034272.304515tail assembly protein I
APECO1_60043271.425277tail assembly protein
APECO1_60053280.128120hypothetical protein
APECO1_6006430-0.106350minor tail protein
APECO1_60074310.082874phage-related minor tail protein
APECO1_10154320.283211phage-related minor tail protein
APECO1_1016332-0.214911tail assembly chaperone
APECO1_1017432-0.081937phage major tail subunit
APECO1_10185290.394873hypothetical protein
APECO1_1019326-0.202266hypothetical protein
APECO1_1020322-0.390650head-tail adaptor
APECO1_1021321-0.508006hypothetical protein
APECO1_1022320-0.179084phage capsid protein
APECO1_10232180.104188phage capsid protease
APECO1_10242190.210830phage portal protein
APECO1_10251190.263577phage terminase
APECO1_1026023-1.780008phage terminase
APECO1_1027025-3.102089HnhC
APECO1_1028-125-3.270782endopeptidase
APECO1_1029022-2.606864phage lysozyme
APECO1_1030027-3.314535hypothetical protein
APECO1_1031028-3.631310***hypothetical protein
APECO1_1032-128-2.194433Q antiterminator encoded by prophage CP-933P
APECO1_1033-228-1.690506Holliday junction resolvase
APECO1_1034026-0.786900hypothetical protein
APECO1_1035226-1.544991hypothetical protein
APECO1_1036126-2.237134hypothetical protein
APECO1_1037228-2.803340hypothetical protein
APECO1_1038328-3.134010hypothetical protein
APECO1_1040228-2.503896hypothetical protein
APECO1_1041231-6.098695hypothetical protein
APECO1_1042237-7.632933regulatory protein
APECO1_1043224-4.164802transcriptional regulator
APECO1_1044125-4.662864hypothetical protein
APECO1_1045125-4.824123hypothetical protein
APECO1_1046022-4.061635hypothetical protein
APECO1_1047017-2.331783hypothetical protein
APECO1_1048-114-0.348613hypothetical protein
APECO1_1049-1152.103002phage integrase
APECO1_10501184.299806*hypothetical protein
APECO1_10511215.417535*phage integrase
APECO1_10520217.049531salicylate synthase Irp9
APECO1_10540238.125934hypothetical protein
APECO1_10550238.159688ABC transporter
APECO1_10560238.251840inner membrane ABC-transporter
APECO1_10570248.412615AraC family transcriptional regulator
APECO1_1058-1248.226984yersiniabactin biosynthetic protein
APECO1_10590227.293296yersiniabactin biosynthetic protein
APECO1_1060-1172.927160yersiniabactin biosynthetic protein
APECO1_1061-2170.132377yersiniabactin biosynthetic protein YbtT
APECO1_1062-218-0.697691yersiniabactin siderophore biosynthetic protein
APECO1_1063-118-2.392699pesticin/yersiniabactin receptor protein
APECO1_1064-126-5.349739hypothetical protein
APECO1_1065-126-5.391161autotransporter
APECO1_1066-128-5.432394hypothetical protein
APECO1_1067-226-3.914661shikimate transporter
APECO1_1068-128-4.368584AMP nucleosidase
APECO1_1069030-4.520569hypothetical protein
APECO1_1070029-2.542968hypothetical protein
APECO1_6020126-1.576156*hypothetical protein
APECO1_6021127-1.308251*transcriptional regulator Cbl
APECO1_6022228-0.980080nitrogen assimilation transcriptional regulator
APECO1_6023228-2.938518*hypothetical protein
APECO1_1072229-4.574836nicotinate-nucleotide--dimethylbenzimidazole
APECO1_1073128-5.870687cobalamin synthase
APECO1_1074230-6.587668adenosylcobinamide
APECO1_1075331-5.966944hypothetical protein
APECO1_1076331-5.852900carbohydrate kinase
APECO1_1077327-3.983998hypothetical protein
APECO1_1078326-2.157181hypothetical protein
APECO1_1079424-0.422406phosphotriesterase-related protein
APECO1_10808290.561423transposase
APECO1_10816281.359607hypothetical protein
APECO1_10827241.225758hypothetical protein
APECO1_1083624-1.376133hypothetical protein
APECO1_1084524-2.375640hypothetical protein
APECO1_1085630-3.591556hypothetical protein
APECO1_1086627-3.097911transcriptional regulator
APECO1_1087728-3.279641hypothetical protein
APECO1_1088628-1.416095transposase subunit
APECO1_10896230.458444transposase subunit
APECO1_10906241.325111transposase subunit
APECO1_10916251.634861hypothetical protein
APECO1_10925220.667295hypothetical protein
APECO1_10934240.309153hypothetical protein
APECO1_10944230.050349hypothetical protein
APECO1_10954250.903428hypothetical protein
APECO1_10974270.995941esterase
APECO1_10985270.458444hypothetical protein
APECO1_10995292.087591hypothetical protein
APECO1_11005282.819555hypothetical protein
APECO1_11012161.091050hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_998PF05272290.044 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.044
Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 15/62 (24%)

Query: 320 AKYILTPVLWKYLYRYAKKHQARGNGFGYGMVYPNNPQSVTRTLSARYYKDGAEILIDRG 379
A+Y + PVLW Y+ R+ K + G+ VY +R +DG+E RG
Sbjct: 166 ARYQVGPVLWGYVVRFIK---SDGDKLTLPYVY------------SRSQRDGSEAWKWRG 210

Query: 380 WD 381
WD
Sbjct: 211 WD 212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_999CARBMTKINASE338e-04 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 32.9 bits (75), Expect = 8e-04
Identities = 22/92 (23%), Positives = 36/92 (39%), Gaps = 9/92 (9%)

Query: 37 AQKLAADDDVDMLVILTACYFHDIVSLAKDHPQRQRSSILAAEETRRLLREEFVQFPA-- 94
+KLA + + D+ +ILT + +L + Q + EE R+ E F A
Sbjct: 219 GEKLAEEVNADIFMILTDV---NGAALYYGTEKEQWLREVKVEELRKYYEEG--HFKAGS 273

Query: 95 --EKIEAVCHAIAAHSFSAQIAPLTTEAKIVQ 124
K+ A I A IA L + ++
Sbjct: 274 MGPKVLAAIRFIEWGGERAIIAHLEKAVEALE 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1001ECOLIPORIN444e-158 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 444 bits (1144), Expect = e-158
Identities = 205/395 (51%), Positives = 256/395 (64%), Gaps = 36/395 (9%)

Query: 11 MKRKVLAMLVPALLVAGAANAAEIYNKDGNKVDFYGKMVGERIWSNTDDNNSENEDTSYA 70
MKRKVLA+++PALL AGAA+AAEIYNKDGNK+D YGK+ G +S D++S++ D +Y
Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFS---DDSSKDGDQTYM 57

Query: 71 RFGVKGETQITSELTGFGQFEYNLDASKPEGE-NQEKTRLTFAGLKYNELGSFDYGRNYG 129
R G KGETQI +LTG+GQ+EYN+ A+ EGE TRL FAGLK+ + GSFDYGRNYG
Sbjct: 58 RVGFKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYG 117

Query: 130 VAYDAAAYTDMLVEWGGDSWASADNFMNGRTNGVATYRNYDFFGLVDGLDFAIQYQGKNS 189
V YD +TDML E+GGDS+ ADN+M GR NGVATYRN DFFGLVDGL+FA+QYQGKN
Sbjct: 118 VLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNE 177

Query: 190 NRS----------------TKKQNGDGYALSVDYNI-NGFGIVGAYSKSDRTNDQVA--- 229
++S + NGDG+ +S Y+I GF AY+ SDRTN+QV
Sbjct: 178 SQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGG 237

Query: 230 -DGNGSNAELWSLAAKYDANNVYAVVMYGETRNMTPGSIDTGVADREGNTIMRDQLINET 288
G A+ W+ KYDANN+Y MY ETRNMTP + + + N+T
Sbjct: 238 TIAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYG--------KTDKGYDGGVANKT 289

Query: 289 QNFEAVVQYQFDFGLRPSLGYVYSKGKDIKGVPGHRYVDADRVNYIEVGTWYYFNKNMNV 348
QNFE QYQFDFGLRP++ ++ SKGKD+ D D V Y +VG YYFNKN +
Sbjct: 290 QNFEVTAQYQFDFGLRPAVSFLMSKGKDL-TYNNVNGDDKDLVKYADVGATYYFNKNFST 348

Query: 349 YTAYKFNMLDKDDA--AITGAAADDQFAVGIVYQF 381
Y YK N+LD DD G + DD A+G+VYQF
Sbjct: 349 YVDYKINLLDDDDPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1002SUBTILISIN280.038 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 28.3 bits (63), Expect = 0.038
Identities = 7/29 (24%), Positives = 14/29 (48%)

Query: 160 GLPESEDVAAALQWAIENDRFVISLCHGP 188
G + + + + +AIE +IS+ G
Sbjct: 122 GSGQYDWIIQGIYYAIEQKVDIISMSLGG 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1004HTHFIS841e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.1 bits (208), Expect = 1e-20
Identities = 30/117 (25%), Positives = 60/117 (51%), Gaps = 1/117 (0%)

Query: 39 KILLIEDNQRTQEWVTQGLSEAGYVIDAVSDGRDGLYLALKDDYALIILDIMLPGMDGWQ 98
IL+ +D+ + + Q LS AGY + S+ D L++ D+++P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 99 ILQTLRTA-KQTPVICLTARDSVDDRVRGLDSGANDYLVKPFSFSELLARVRAQLRQ 154
+L ++ A PV+ ++A+++ ++ + GA DYL KPF +EL+ + L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_6001IGASERPTASE412e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 40.8 bits (95), Expect = 2e-05
Identities = 27/132 (20%), Positives = 56/132 (42%), Gaps = 15/132 (11%)

Query: 123 SQSAAAAKKSETAAASSRNA--AKTSETNAGNSAKAAASSKTAAQNAATAAERSETNARA 180
S + A+ E A ++T+ET A NS + + + + Q+A + N
Sbjct: 1012 SNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQ---NREV 1068

Query: 181 SEEASADSEEASRRN--AESAAENAGVATTKAREAAADATKAGQKKDEALSAATRAEKAA 238
++EA ++ + ++ N A+S +E +E TK ++ A EK
Sbjct: 1069 AKEAKSNVKANTQTNEVAQSGSE--------TKETQTTETKETATVEKEEKAKVETEKTQ 1120

Query: 239 DRAEVAAEVTAE 250
+ +V ++V+ +
Sbjct: 1121 EVPKVTSQVSPK 1132


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_6003PF06291280.014 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 27.7 bits (61), Expect = 0.014
Identities = 13/40 (32%), Positives = 19/40 (47%), Gaps = 5/40 (12%)

Query: 135 MTGILFSLGASMVLGGVAQML-----APKARTPRTQTTDN 169
M +LFS +M++ G AQ P A TP+ T +
Sbjct: 6 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHH 45


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1022cloacin320.003 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 32.4 bits (73), Expect = 0.003
Identities = 25/88 (28%), Positives = 37/88 (42%), Gaps = 6/88 (6%)

Query: 36 EWNRAKAELDALDEQIAREEELRRQDQAYVDESGPEERQNNEAENGKKAVEEKRAAAFNR 95
+ RA+AEL+ +E +AR +E QA + + +A N A FNR
Sbjct: 322 NYERARAELNQANEDVARNQE----RQAKAVQVYNSRKSELDAANKTLADAIAEIKQFNR 377

Query: 96 FLRAGFAELNAEERNLMRELRAQSVTTD 123
F A + + M L+AQ TD
Sbjct: 378 FAHDPMAGGHRMWQ--MAGLKAQRAQTD 403


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1058ISCHRISMTASE512e-08 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 51.2 bits (122), Expect = 2e-08
Identities = 22/70 (31%), Positives = 44/70 (62%)

Query: 28 QQLRERLIQELNLTPQQLHEESNLIQAGLDSIRLMRWLHWFRKNGYRLTLRELYAAPTLA 87
+ +R+++ + L TP+ + ++ +L+ GLDS+R+M + +R+ G +T EL PT+
Sbjct: 233 ENIRKQIAELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIE 292

Query: 88 AWNQLMLSRS 97
W +L+ +RS
Sbjct: 293 EWQKLLTTRS 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1059DHBDHDRGNASE461e-06 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 45.8 bits (108), Expect = 1e-06
Identities = 32/156 (20%), Positives = 55/156 (35%), Gaps = 19/156 (12%)

Query: 1561 LVTGAFGGLGRLAVNWLREKGARRIALLAPRVDESWLRDVEGGQTRVCR------CDVGD 1614
+TGA G+G L +GA + A + L V R DV D
Sbjct: 12 FITGAAQGIGEAVARTLASQGAH---IAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 1615 AGQLATVLDDLAAN-GGIAGAIHAAGVLADAPLQELDDHQLAAVFAVKAQAASQLLQTLR 1673
+ + + + G I ++ AGVL + L D + A F+V + +++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 1674 NH-----DGRYLILYSSAAAT----LGAPGQSAHAL 1700
+ G + + S+ A + A S A
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAA 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1064INTIMIN752e-17 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 75.1 bits (184), Expect = 2e-17
Identities = 20/60 (33%), Positives = 29/60 (48%), Gaps = 3/60 (5%)

Query: 181 QQIASTSQLIGSLLAEDMNSEQAANIARGWASSQASGVMTDWLSRFGTARITLGVDEDFS 240
QQ AS + S +N + A + A G A +QAS + WL +GTA + L +F
Sbjct: 168 QQAASLGSQLQS---RSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFD 224


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1065INTIMIN654e-13 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 65.5 bits (159), Expect = 4e-13
Identities = 63/344 (18%), Positives = 123/344 (35%), Gaps = 24/344 (6%)

Query: 301 SGGKVRTNSSGQA--------PVVLTSNKVGTYTVTASFHNG-VTIQTQTTVKVTGNPS- 350
GG+++ + S A V + V T A NG + T+ V N
Sbjct: 495 QGGQIQHSGSQSAQDYQAILPAYVQGGSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQV 554

Query: 351 --TAHVASFIADPSTIAATNSDLSTLKATVEDGSGNLIEGLTVYFALKSGSTTLTSLTAV 408
V F AD ++ A ++ T ATV+ +G + V F + SG+ L++ +A
Sbjct: 555 VDQVGVTDFTADKTSAKADGTEAITYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSAN 613

Query: 409 TDQNGIATTSVKGAITGSVTVSTVTSAGGMQTVDISLVAVPADASQSILKNNQSSLKGDF 468
T+ +G AT ++K V + +A ++ + V SI +
Sbjct: 614 TNGSGKATVTLKSD-KPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVA 672

Query: 469 TDSAELHLVLHDISGNPIKVSEGMEFVQSGTNVPYMKISAIDYSQNINGDYKATVTGDGE 528
+ + + G+ ++ + F + + S + NG K T+T
Sbjct: 673 NGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKL-----SNSTEKTDTNGYAKVTLTSTTP 727

Query: 529 GIATLIPVLNGVHQAGLSTTIEFISAETRPMTGTVSVNGANLPTASFPSQGFTGAYYQLN 588
G + + ++ V + +EF G + + G + P+ L
Sbjct: 728 GKSLVSARVSDVAVDVKAPEVEFF-TTLTIDDGNIEIVGTGV-KGKLPTVWLQYGQVNL- 784

Query: 589 NDNFAPGKTAADYSFSSSASWVGVDATGKVTFKNDGDSNTVIIT 632
+ G + ++ A ++G+VT K G + +I+
Sbjct: 785 --KASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVIS 826



Score = 52.0 bits (124), Expect = 7e-09
Identities = 51/233 (21%), Positives = 89/233 (38%), Gaps = 16/233 (6%)

Query: 13 AVTDADGKAKVTLKGTKAGAHTVTASMVGGKS--EQLVVNFTADTLTAQVNLNVTEDNFI 70
A T+ GKA VTLK K G V+A S V F T + + + +
Sbjct: 612 ANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAV 671

Query: 71 ANNIGMTRLQATVTDGNGNPVEGIKVNFRGTSVTLSSTSVETDDQVFAEILVTSTEVGLK 130
AN V PV +V F T LS+++ +TD +A++ +TST G
Sbjct: 672 ANGQDAITYTVKVMK-GDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKS 730

Query: 131 TVSASLADKPTEVISRLLN----AKVDVNSATI----TSQEIPEGQVMVAQDIAVKAHVN 182
VSA ++D +V + + +D + I ++P + Q + N
Sbjct: 731 LVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGN 790

Query: 183 DQFGNPVTHQPATFSAAPSSQMIISQNTVSTNTQGVAEVTMTPERNGSYTVKA 235
++ + A S Q+ + + +T + V + + +YT+
Sbjct: 791 GKYTWRSANPAIASVDASSGQVTLKEKGTTTIS-----VISSDNQTATYTIAT 838



Score = 51.6 bits (123), Expect = 7e-09
Identities = 45/170 (26%), Positives = 64/170 (37%), Gaps = 7/170 (4%)

Query: 271 TLTATLTSANGTPVEGQVINFSVTLEGATLSGGKVRTNSSGQAPVVLTSNKVGTYTVTAS 330
T TAT+ NG ++F++ A LS TN SG+A V L S+K G V+A
Sbjct: 579 TYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAK 637

Query: 331 FHNGV-TIQTQTTVKVTGNPSTAHVASFIADPSTIAATNSDLSTLKATVEDGSGNLIEGL 389
+ + V + A + AD +T A D T V G +
Sbjct: 638 TAEMTSALNANAVIFVDQ--TKASITEIKADKTTAVANGQDAITYTVKVMKG-DKPVSNQ 694

Query: 390 TVYFALKSGSTTLTSLTAVTDQNGIATTSVKGAITGSVTVSTVTSAGGMQ 439
V F G + + T TD NG A ++ G VS S +
Sbjct: 695 EVTFTTTLGKLSNS--TEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVD 742



Score = 40.4 bits (94), Expect = 2e-05
Identities = 35/213 (16%), Positives = 63/213 (29%), Gaps = 18/213 (8%)

Query: 4 NFTLSDGDKAVTDADGKAKVTLKGTKAGAHTVTASMVGGKSE--QLVVNFTADTLTAQVN 61
TD +G AKVTL T G V+A + + V F N
Sbjct: 701 TLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGN 760

Query: 62 LNVTEDNFIANNIGMTRLQATVTDGNGN-PVEGIKVNFRGTSVTLSSTSVETDDQVFAEI 120
+ + + + + G N G + S + SV+
Sbjct: 761 IEI-----VGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQ---- 811

Query: 121 LVTSTEVGLKTVSASLADKPTEVISRLLNAKVDVNSATITSQEIPEGQVMVAQDIAVKAH 180
VT E G T+S +D T + + + ++ + V ++
Sbjct: 812 -VTLKEKGTTTISVISSDNQT--ATYTIATPNSLIVPNMSKRVTYNDAVNTCKNFGG--- 865

Query: 181 VNDQFGNPVTHQPATFSAAPSSQMIISQNTVST 213
N + + + AA + S T+ +
Sbjct: 866 KLPSSQNELENVFKAWGAANKYEYYKSSQTIIS 898


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1067TCRTETB330.002 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 33.3 bits (76), Expect = 0.002
Identities = 39/259 (15%), Positives = 96/259 (37%), Gaps = 18/259 (6%)

Query: 79 LGGVIFGHFGDRLGRKRMLMLTVWMMGIATALIGILPSFSTIGWWAPILLVTLRAIQGFA 138
+G ++G D+LG KR+L+ + + + + + SF ++ I+ ++ A
Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSL----LIMARFIQGAGAAA 119

Query: 139 VGGEWGGAALLSVESAPKNKK-AFYSSGVQVGYGVGLLLSTGLVSLISMMTTDEQFLSWG 197
+ + K S V +G GVG + + I
Sbjct: 120 FPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI------------H 167

Query: 198 WRIPFLFSIVLVLGALWVRNGMEESAEFEQQQHNQAAAKKRIPVIEALLRHPGAFLKIIA 257
W L ++ ++ ++ +++ + + + ++ +L + +
Sbjct: 168 WSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLI 227

Query: 258 LRLCELLTMYIVTAFALNYSTQNMGLPRELFLNIGLLVGGLSCLTIPCFAWLADRFGRRR 317
+ + L +++ + + GL + + IG+L GG+ T+ F + +
Sbjct: 228 VSVLSFL-IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDV 286

Query: 318 VYITGALIGTLSAFPFFMA 336
++ A IG++ FP M+
Sbjct: 287 HQLSTAEIGSVIIFPGTMS 305


26APECO1_1115APECO1_1138Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_11150223.362666antitoxin YefM
APECO1_1116-1233.942393ATP phosphoribosyltransferase
APECO1_11170233.714917histidinol dehydrogenase
APECO1_11180252.819555histidinol-phosphate aminotransferase
APECO1_1119-2170.913605imidazole glycerol-phosphate
APECO1_1120-215-1.329272imidazole glycerol phosphate synthase subunit
APECO1_1121-115-1.8259031-(5-phosphoribosyl)-5-[(5-
APECO1_1122-117-6.143980imidazole glycerol phosphate synthase subunit
APECO1_1123022-7.687776bifunctional phosphoribosyl-AMP
APECO1_1124127-8.813490regulator of length of O-antigen component of
APECO1_1125129-8.779400UDP-glucose 6-dehydrogenase
APECO1_1126133-8.8834876-phosphogluconate dehydrogenase
APECO1_1127135-9.349785O-antigen transporter
APECO1_1128-222-6.060585dTDP-4-dehydrorhamnose 3,5-epimerase
APECO1_1129-316-3.934267glucose-1-phosphate thymidylyltransferase
APECO1_1130-315-2.177253dTDP-4-dehydrorhamnose reductase
APECO1_1131-213-0.837701dTDP-glucose-4,6-dehydratase
APECO1_1132-1190.856549UTP--glucose-1-phosphate uridylyltransferase
APECO1_11330211.396482colanic acid biosynthesis protein
APECO1_11340242.957222colanic acid biosynthesis glycosyl transferase
APECO1_11350243.132653pyruvyl transferase
APECO1_1136-1233.308120colanic acid exporter
APECO1_1137-1233.552540UDP-glucose lipid carrier transferase
APECO1_1138-1233.531858phosphomannomutase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1130NUCEPIMERASE451e-07 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 45.2 bits (107), Expect = 1e-07
Identities = 33/170 (19%), Positives = 63/170 (37%), Gaps = 25/170 (14%)

Query: 1 MNILLFGKTGQVGWELQRALAPLGN-LIALDVHSTDY--------------------CGD 39
M L+ G G +G+ + + L G+ ++ +D + Y D
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60

Query: 40 FSNPEGVAETVKRIRPDVIVNAAAHTAVDKAESEPEFAQLLNATSVESIAKAANEVG-AW 98
++ EG+ + + + + AV + P N T +I +
Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120

Query: 99 VIHYSTDYVFPGNGDTPWLEMDATA-PLNVYGETKLAGEKALQEHCAKHL 147
+++ S+ V+ N P+ D+ P+++Y TK A E L H HL
Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANE--LMAHTYSHL 168


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1131NUCEPIMERASE1811e-56 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 181 bits (462), Expect = 1e-56
Identities = 88/360 (24%), Positives = 149/360 (41%), Gaps = 48/360 (13%)

Query: 1 MKILVTGGAGFIGSAVVRHIINNTQDSVVNVDKLT--YAGNL-ESLADVSDSERYVFEHA 57
MK LVTG AGFIG V + ++ VV +D L Y +L ++ ++ + F
Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59

Query: 58 DICDAAAMARIFAQHQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYVLLEAARNYWSA 117
D+ D M +FA + V V S+ P A+ ++N+ G +LE R+
Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN--- 116

Query: 118 LDGDKKNSFRFHHISTDEVYGDLPHPDEVNNKEQLPLFTETTAYAPSSPYSASKASSDHL 177
+ S+ VYG ++P T+ + P S Y+A+K +++ +
Sbjct: 117 ------KIQHLLYASSSSVYGL---------NRKMPFSTDDSVDHPVSLYAATKKANELM 161

Query: 178 VRAWKRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKALPIYGKGDQIRDWLYVE 237
+ YGLP YGP+ P+ + LEGK++ +Y G RD+ Y++
Sbjct: 162 AHTYSHLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYID 221

Query: 238 D-------------HARALYTVVTEGQA-----GETYNIGGHNEKKNIDVVLTICDLLDE 279
D HA +TV T A YNIG + + +D + + D L
Sbjct: 222 DIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG- 280

Query: 280 IVPKEKSYREQITYVADRPGHDRRYAIDAEKIGRELGWKPQETFESGIRKTVEWYLANAK 339
+ +K+ +PG + D + + +G+ P+ T + G++ V WY K
Sbjct: 281 -IEAKKNMLPL------QPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYK 333


27APECO1_1161APECO1_4438Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1161-3133.034119hypothetical protein
APECO1_1162-3163.818853hypothetical protein
APECO1_1163-3163.819315hypothetical protein
APECO1_1164-2163.873809multidrug efflux system subunit MdtA
APECO1_1165-2183.841699multidrug efflux system subunit MdtB
APECO1_1166-2152.423481multidrug efflux system subunit MdtC
APECO1_1167-1140.327994multidrug efflux system protein MdtE
APECO1_1168-118-4.720683signal transduction histidine-protein kinase
APECO1_1169026-7.682430DNA-binding transcriptional regulator BaeR
APECO1_1170237-10.681760hypothetical protein
APECO1_1171026-7.972891hypothetical protein
APECO1_1172-121-6.069333hypothetical protein
APECO1_1173-120-5.361169hypothetical protein
APECO1_1174-113-0.663611hypothetical protein
APECO1_1175-1161.514178hypothetical protein
APECO1_1176-1243.785423hypothetical protein
APECO1_1177-1181.683287phage protein D
APECO1_1178-117-1.054877phage protein U
APECO1_1179-217-0.639717phage related tail protein
APECO1_1180-318-2.807014Phage protein P
APECO1_1181-224-5.449516Phage protein Q
APECO1_1182-330-6.706511hypothetical protein
APECO1_1183-327-5.502296hypothetical protein
APECO1_1184-225-3.864637Phage protein A
APECO1_1185029-6.994115hypothetical protein
APECO1_1186-126-6.046132Phage protein B
APECO1_4459-125-5.148569immunity repressor
APECO1_4458119-4.029351phage integrase
APECO1_4457320-3.650205hypothetical protein
APECO1_4456319-3.359053lipid kinase
APECO1_4455319-3.498199galactitol utilization operon repressor
APECO1_4454320-2.472024galactitol-1-phosphate dehydrogenase
APECO1_4453216-1.840366PTS system galactitol-specific transporter
APECO1_4452014-1.957652PTS system galactitol-specific transporter
APECO1_4451113-0.746816PTS system galactitol-specific transporter
APECO1_44502120.654840tagatose 6-phosphate kinase GatZ
APECO1_44491131.437360tagatose-bisphosphate aldolase
APECO1_44482120.635995fructose-bisphosphate aldolase
APECO1_44471131.050667nucleoside transporter
APECO1_44461141.918725hydrolase
APECO1_44450161.321035sugar kinase
APECO1_4444016-0.194417transcriptional regulator
APECO1_4443122-3.073711hypothetical protein
APECO1_4442324-4.229540phosphomethylpyrimidine kinase
APECO1_4441327-6.052466hydroxyethylthiazole kinase
APECO1_4440328-7.478422nickel/cobalt efflux protein RcnA
APECO1_4439325-6.794491hypothetical protein
APECO1_4438014-4.864178fimbrial-like adhesin protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1164RTXTOXIND523e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 51.8 bits (124), Expect = 3e-09
Identities = 48/369 (13%), Positives = 106/369 (28%), Gaps = 87/369 (23%)

Query: 53 SYKSRWVIVIVVVIAAIAAFWFWQGRNDSQSAAPG-----ATKQAQQSPAGGR------- 100
S + R V ++ IA G+ + + A G + +
Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113

Query: 101 --RGMRAG-PLA---PVQAATAVEQAVPRYLTGLGTITAANTVTVRSRVDG--QLMALHF 152
+R G L + A + L T ++ ++ +L
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 153 QEGQQVKAGDLLAEI------------DPSQFKVALAQAQGQLA-------KDKATLANA 193
Q V ++L Q ++ L + + + + +
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 194 RRDLSRYQQLAKTNLVSRQELDAQQALVSETEGTIKADEASVA----------------- 236
+ L + L +++ + Q+ E ++ ++ +
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 237 --------------------------SAQLQLDWSRITAPVDGRV-GLKQVDVGNQISSG 269
+ + S I APV +V LK G +++
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 270 DTTGIVVITQTHPIDLVFTLPESDIATVVQAQKAGKPLVVEAWDRTNSKKL-SEGTLLSL 328
+T +V++ + +++ + DI + Q A + VEA+ T L + ++L
Sbjct: 354 ETL-MVIVPEDDTLEVTALVQNKDIGFINVGQNA--IIKVEAFPYTRYGYLVGKVKNINL 410

Query: 329 DNQIDATTG 337
D D G
Sbjct: 411 DAIEDQRLG 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1165ACRIFLAVINRP9200.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 920 bits (2379), Expect = 0.0
Identities = 300/1036 (28%), Positives = 513/1036 (49%), Gaps = 29/1036 (2%)

Query: 13 SRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAV 72
+ FI RP+ +L + +++AG + LPV+ P + P + V YPGA + V
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 73 TAPLERQFGQMSGLKQMSSQS-SGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131
T +E+ + L MSS S S G+ ITL FQ D+A+ +VQ + AT LLP +
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 132 LPNPPVYSKVNPADPPIMTLAVTSTAMPMTQVE--DMVETRVAQKISQISGVGLVTLSGG 189
+ + S + +M S TQ + D V + V +S+++GVG V L G
Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 190 QRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGP------SRAVTLSANDQ 243
Q A+R+ L+A + LT V + N A G L G ++ A +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 244 MQSAEEYRQLII-AYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANI 302
++ EE+ ++ + +G+ +RL DVA VE G EN + A N + A + ++ GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 303 ISTADSIRQMLPQLTESLPKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFL 362
+ TA +I+ L +L P+ +KV D T ++ S+ + L AI LV +++YLFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 363 RNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMALTIATGFVVDDAIVVIENISRY 422
+N+ AT+IP +AVP+ L+GTFA++ +SIN LT+ + +A G +VDDAIVV+EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 423 I-EKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAIL 481
+ E P A K +I ++ + L AV IP+ F G G ++R+F+IT+ A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 482 ISAVVSLTLTPMMCARML---SQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWL 538
+S +V+L LTP +CA +L S E + F FD + Y + K+L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 539 TLSVALSTLLLSVLLWVFIPKGFFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQ 598
L + + V+L++ +P F P +D G+ +Q P ++ + QV D L+
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 599 DPA--VQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDR---VQKVIARLQTAVDKVPG 653
+ V+S+ + G + + N+ ++LKP +ER+ + VI R + + K+
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR- 658

Query: 654 VDLFLQPTQDLTIDTQVSRTQYQFTLQ---ATSLDALSTWVPQLMEKLQQLP-QLSDVSS 709
D F+ P I + T + F L DAL+ QL+ Q P L V
Sbjct: 659 -DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 710 DWQDKGLVAYVNVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTE 769
+ + + VD++ A LG+S++D++ + A G ++ + ++ ++ + +
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 770 NTPGLAALDTIRLTSSDGGVVPLSSIAKIEQRFAPLSINHLDQFPVTTISFNVPDNYSLG 829
+D + + S++G +VP S+ + + + P I S G
Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837

Query: 830 DAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIVLGILYESFI 889
DA A+M+ + LP I + G + + + L+ + V +++ L LYES+
Sbjct: 838 DA-MALMENLAS-KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 890 HPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQ 949
P++++ +P VG LLA + + DV ++G++ IG+ KNAI++++FA ++
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 950 GMSPREAIYQACLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQV 1009
G EA A +R RPILMT+LA +LG LPL +S G G+ + +GIG++GG++ + +
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 1010 LTLFTTPVIYLLFDRL 1025
L +F PV +++ R
Sbjct: 1016 LAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1166ACRIFLAVINRP9130.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 913 bits (2361), Expect = 0.0
Identities = 288/1035 (27%), Positives = 503/1035 (48%), Gaps = 36/1035 (3%)

Query: 6 LFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVAT 65
FI RP+ +L++ + + G L LPVA P + P + VSA+ PGA +T+ +V
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63

Query: 66 PLERSLGRIAGVSEMTSSS-SLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMP 124
+E+++ I + M+S+S S GS I L F D + A VQ + A LLP +
Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123

Query: 125 SRPTYRKANPSDAPIMILTLTSDT--YSQGELYDFASTQLAPTISQIDGVGDVDVGGSSL 182
+ S + +M+ SD +Q ++ D+ ++ + T+S+++GVGDV + G+
Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182

Query: 183 PAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQG------ALEDGTHRWQIQTNDELK 236
A+R+ L+ L ++ DV + N + G AL I K
Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 237 TAAEYQPLIIHYN-NGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQ 295
E+ + + N +G VRL DVA V ++ N KPA L I+ AN +
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 296 TVDSIRARLPELQSTIPAAIDLQIAQDRSPTIRASLEEVEQTLIISVALVILVVFLFLRS 355
T +I+A+L ELQ P + + D +P ++ S+ EV +TL ++ LV LV++LFL++
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 356 GRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENIARHL- 414
RAT+IP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 415 EAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGIS 474
E + P +A + ++ ++ +++ L AVF+P+ GG G + R+F++T+ A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 475 LLVSLTLTPMMCGWMLKASKPREQKRLRGFG----RMLVALQQGYGKSLKWVLNHTRLVG 530
+LV+L LTP +C +LK + GF Y S+ +L T
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 531 VVLLGTIALNIWLYISIPKTFFPEQDTGVLMGGIQADQSISFQ----AMRGKLQDFMKII 586
++ +A + L++ +P +F PE+D GV + IQ + + + ++K
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 587 RD-DPAVDNVTGFT-GGSRVNSGMMFITLKPRGERS---ETAQQIIDRLRKKLAKEPGAN 641
+ +V V GF+ G N+GM F++LKP ER+ +A+ +I R + +L K
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 642 LFLMAVQDIRVGGRQANASYQYTLLSDDLAALREWEPKIRKKLATL-----PELADVNSD 696
+ + I G ++ L D + + R +L + L V +
Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 697 QEDNGAEMNLIYDRDTMARLGIDVQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRY 756
++ A+ L D++ LG+ + N ++ A G ++ K+ ++ D ++
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 757 TQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSAASTISFNLPTGKSLSD 816
++K++V + G+ +P S F + + I G S D
Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838

Query: 817 ASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVH 876
A A ++ ++L P+ + + G + + + N L+ + V++ L LYES+
Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896

Query: 877 PLTILSTLPSAGVGALLALQLFNAPFSLIALIGIMLLIGIVKKNAIMMVYFALEAQRHGN 936
P++++ +P VG LLA LFN + ++G++ IG+ KNAI++V FA +
Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956

Query: 937 LTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLL 996
EA A +R RPI+MT+LA + G LPL +S G GS + +GI ++GG+V + LL
Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016

Query: 997 TLYTTPVVYLFFDRL 1011
++ PV ++ R
Sbjct: 1017 AIFFVPVFFVVIRRC 1031



Score = 79.5 bits (196), Expect = 4e-17
Identities = 76/448 (16%), Positives = 160/448 (35%), Gaps = 26/448 (5%)

Query: 592 VDNVTGFTGGS-RVNSGMMFITLKPRGERSETAQQIIDRLRKKLAKEPGANLFLMAVQDI 650
+DN+ + S S + +T + + Q+ ++L+ P + Q I
Sbjct: 72 IDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE----VQQQGI 127

Query: 651 RVGGRQANASYQYTLLSDDLAALREW-----EPKIRKKLATLPELADVNSDQEDNGAE-- 703
V ++ +SD+ ++ ++ L+ L + DV GA+
Sbjct: 128 SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL----FGAQYA 183

Query: 704 MNLIYDRDTMARLGID----VQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQD 759
M + D D + + + + + + T P Q + R+
Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNP 243

Query: 760 ISALEKMFVINNEGKAIPLSYFAK--WQPANAPLSVNHQGLSAASTISFNLPTGKSLSDA 817
+ +N++G + L A+ N + G AA +L D
Sbjct: 244 EEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL-DT 302

Query: 818 SAAIDRAMTQL--GVPSTVRGSFA-GTAQVFQETMNSQVILIIAAIATVYIVLGILYESY 874
+ AI + +L P ++ + T Q +++ V + AI V++V+ + ++
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362

Query: 875 VHPLTILSTLPSAGVGALLALQLFNAPFSLIALIGIMLLIGIVKKNAIMMVYFALEAQRH 934
L +P +G L F + + + G++L IG++ +AI++V
Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422

Query: 935 GNLTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQ 994
L P+EA ++ ++ + +P+ GG + + ITIV + +S
Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482

Query: 995 LLTLYTTPVVYLFFDRLRLRFSRKPKQA 1022
L+ L TP + + + K
Sbjct: 483 LVALILTPALCATLLKPVSAEHHENKGG 510


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1167TCRTETB1251e-33 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 125 bits (316), Expect = 1e-33
Identities = 97/429 (22%), Positives = 189/429 (44%), Gaps = 23/429 (5%)

Query: 20 FMQSLDTTIVNTALPSMAQSLGESPLHMHMVIVSYVLTVAVMLPASGWLADKVGVRNIFF 79
F L+ ++N +LP +A + P + V +++LT ++ G L+D++G++ +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 80 TAIVLFTLGSLFCALSGTLNELL-LARALQGVGGAMMVPVGRLTVMKIVPREQYMAAMTF 138
I++ GS+ + + LL +AR +QG G A + + V + +P+E A
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 139 VTLPGQVGPLLGPALGGLLVEYASWHWIFLINIPVGIIGAIATLM-LMPNYTMQTRRFDL 197
+ +G +GPA+GG++ Y HW +L+ IP+ I + LM L+ FD+
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 198 SGFLLLAVGMAVLTLALDGSKGTGFSPLAIAGLVAVGVVALVLYLLHAQNNNRALFSLKL 257
G +L++VG+ L F+ + V V++ ++++ H + L
Sbjct: 202 KGIILMSVGIVFFML---------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 258 FRTRNFSLGLAGSFAGRIGSGMLPFMTPVFLQIGLGFSPFHAG-LMMIPMVLGSMGMKRI 316
+ F +G+ M P ++ S G +++ P + + I
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 317 VVQVVNRFGYRRVLVATTLGLSLVTLLFMTTALL----GWYYVLPFVLFLQGMVNSTRFS 372
+V+R G VL +G++ +++ F+T + L W+ + V L G+ S +
Sbjct: 313 GGILVDRRGPLYVL---NIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL--SFTKT 367

Query: 373 SMNTLTLKDLPDNLASSGNSLLSMIMQLSMSIGVTIAGLLLGLFGSQHVSVDSGTTQTVF 432
++T+ L A +G SLL+ LS G+ I G LL + + Q+ +
Sbjct: 368 VISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTY 427

Query: 433 MYTWLSMAF 441
+Y+ L + F
Sbjct: 428 LYSNLLLLF 436


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1168BCTERIALGSPF340.001 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 34.0 bits (78), Expect = 0.001
Identities = 28/95 (29%), Positives = 36/95 (37%), Gaps = 20/95 (21%)

Query: 164 RQTSWLIVALSTLLAALATF------PLARGLLAPVKRLVDGTHKLAAGDFTTRVAPTSE 217
RQ + L+ A L AL P L+A V+ V H LA + P S
Sbjct: 75 RQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD---AMKCFPGSF 131

Query: 218 DEL-----------GRLAEDFNQLASTLEKNQQMR 241
+ L G L N+LA E+ QQMR
Sbjct: 132 ERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMR 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1169HTHFIS766e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.6 bits (186), Expect = 6e-18
Identities = 28/136 (20%), Positives = 65/136 (47%), Gaps = 1/136 (0%)

Query: 11 PRILIVEDEPKLGQLLIDYLRAASYAPTLISHGDQVLPYVRQTPPDLILLDLMLPGTDGL 70
IL+ +D+ + +L L A Y + S+ + ++ DL++ D+++P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 71 TLCREIR-RFSDIPIVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTILRRCK 129
L I+ D+P+++++A+ + + E GA DY+ KP+ E++ + L K
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 130 PQRELQQQDAESPLII 145
+ + D++ + +
Sbjct: 124 RRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1179RTXTOXIND330.005 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 32.9 bits (75), Expect = 0.005
Identities = 22/173 (12%), Positives = 58/173 (33%), Gaps = 8/173 (4%)

Query: 8 QVLLRAVDQASRPFKSIRTASKSLSGDIRETQKSLRELNGQASRIEGFRKTSAQLAVTGH 67
VLL+ + + ++S R Q + L+ + +
Sbjct: 122 DVLLKLTALGAE---ADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQN 178

Query: 68 ALEKARQEAEALATQFKNTERPTRAQAKV-LESAKRAAEDLQAKYNRLTDSVKRQQRELA 126
E+ +L + +T + + Q ++ L+ + + A+ NR + + ++ L
Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLD 238

Query: 127 AVGINTRNLAHDELGLKNRISETTAQLNRQRDALARVSAQQAKLNAVKQRYQA 179
+L H + K+ + E + + L +Q ++ + +
Sbjct: 239 DF----SSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKE 287


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1186SECA280.024 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 27.9 bits (62), Expect = 0.024
Identities = 10/60 (16%), Positives = 23/60 (38%)

Query: 23 GKHLAVPRWQETCDYYNQMMERERLTVCFHAQLKQRHATMRFEEMNDVERERLVCAIDEL 82
L + W + ++ RER+ +++ + E M E+ ++ +D L
Sbjct: 715 DLDLPIAEWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSL 774


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4457LIPOLPP20270.027 LPP20 lipoprotein precursor signature.
		>LIPOLPP20#LPP20 lipoprotein precursor signature.

Length = 175

Score = 26.6 bits (58), Expect = 0.027
Identities = 21/88 (23%), Positives = 42/88 (47%), Gaps = 11/88 (12%)

Query: 18 EGEMKKIAAISLISVFLMSGCAVHNDETSIGKFGLAYKSNIQ-------RKLDNQYYTEA 70
+ ++KKI +S+++ ++ GC+ H ++ I K AYK + L+ E
Sbjct: 2 KNQVKKILGMSVVAAMVIVGCS-HAPKSGISKSNKAYKEATKGAPDWVVGDLEKVAKYEK 60

Query: 71 EASLARGRISGAENIVKNDAVHFCVTQG 98
+ + GR AE+++ N+ V + Q
Sbjct: 61 YSGVFLGR---AEDLITNNDVDYSTNQA 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4454DHBDHDRGNASE320.004 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 31.6 bits (71), Expect = 0.004
Identities = 21/92 (22%), Positives = 35/92 (38%), Gaps = 2/92 (2%)

Query: 156 AQGCENKNVIIIGAGT-IGLLAIQCAVALGAKSVTAIDISSEKLALAKSFGAMQTFNSLE 214
A+G E K I GA IG + + GA + A+D + EKL S + ++
Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAH-IAAVDYNPEKLEKVVSSLKAEARHAEA 61

Query: 215 MSAPQMQGVLRELRFNQLILETAGVPQTVELA 246
A + ++ E + V +A
Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVA 93


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4447TCRTETA356e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 34.8 bits (80), Expect = 6e-04
Identities = 53/268 (19%), Positives = 89/268 (33%), Gaps = 17/268 (6%)

Query: 29 LSKSGFSAGEIGWSYACTAIAAILSPILVGSITDRFFSAQKVLAVLMFAGAVLMYFAAQQ 88
L S G A A+ ++G+++DRF ++ + ++ AGA + Y
Sbjct: 35 LVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF--GRRPVLLVSLAGAAVDYAI--- 89

Query: 89 TTFAGFFPLLLAYSLTYMPTIALTNSIAFANVPDVERDFPRIRVMGTIG-WIASGLACGF 147
A F +L + T A T ++A A + D+ R R G + G+ G
Sbjct: 90 MATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAG- 147

Query: 148 LPQMLGY-ADISPTNIPLLITAGSSALLGVFAFFLPDTPPKSTGKMDIKVMLGLDALILL 206
P + G SP + P A + L + FL K + + L A
Sbjct: 148 -PVLGGLMGGFSP-HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRW 205

Query: 207 RDKN------FLVFFFCSFLFAMPLAFYYIFANGYLTEVGMKNATGWMTLGQFSEIFFML 260
VFF + +P A + IF G + +
Sbjct: 206 ARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAM 265

Query: 261 ALPFFTKRFGIKKVLLLGLVTAAIRYGF 288
R G ++ L+LG++ Y
Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYIL 293



Score = 34.0 bits (78), Expect = 0.001
Identities = 32/153 (20%), Positives = 53/153 (34%), Gaps = 20/153 (13%)

Query: 253 FSEIFFMLALPFFTKRFGIKKVLLLGLVTAAIRYGFFIYGSADEYFTYALLFLGILLHGV 312
+ L + RFG + VLL+ L AA+ Y +L++G ++ G+
Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAP-----FLWVLYIGRIVAGI 108

Query: 313 SYDFYYVTAYIYVDKKAPVHMRTAAQGLITLCCQGFGSLLGYRLGGVMMEKMFAYQEPVN 372
+ V D R G ++ C GFG + G LGG+M F+ P
Sbjct: 109 TGATGAVAGAYIAD-ITDGDERARHFGFMS-ACFGFGMVAGPVLGGLMGG--FSPHAP-- 162

Query: 373 GLTFNWAGMWTFGAVMIAIIAVLFMIFFRESDN 405
+ A + + + ES
Sbjct: 163 ---------FFAAAALNGLNFLTGCFLLPESHK 186



Score = 28.6 bits (64), Expect = 0.049
Identities = 22/114 (19%), Positives = 43/114 (37%), Gaps = 4/114 (3%)

Query: 7 LSFMMFVEWFIWGAWFVPLWLWL----SKSGFSAGEIGWSYACTAIAAILSPILVGSITD 62
++ +M V + + VP LW+ + + A IG S A I L+ ++
Sbjct: 212 VAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVA 271

Query: 63 RFFSAQKVLAVLMFAGAVLMYFAAQQTTFAGFFPLLLAYSLTYMPTIALTNSIA 116
++ L + M A A T FP+++ + + AL ++
Sbjct: 272 ARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLS 325


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4439TYPE3OMGPROT280.024 Type III secretion system outer membrane G protein ...
		>TYPE3OMGPROT#Type III secretion system outer membrane G protein

family signature.
Length = 607

Score = 27.9 bits (62), Expect = 0.024
Identities = 13/42 (30%), Positives = 21/42 (50%), Gaps = 1/42 (2%)

Query: 66 KMLLGALLLVTSAAWAAPATAGSTNTSGISKYE-LSSFIADF 106
++L G LLL++S +WA ++K E L + DF
Sbjct: 11 RVLTGTLLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDF 52


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4438BINARYTOXINB280.043 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 28.5 bits (63), Expect = 0.043
Identities = 18/79 (22%), Positives = 34/79 (43%), Gaps = 8/79 (10%)

Query: 93 NITLSNNQ---TSFTSGYSVTVTPAASNAKVNVSAGGGGSVMINGVATLSSA-----SSS 144
NI LS N+ T T + T++ S ++ + S G + + + + S+S
Sbjct: 297 NIILSKNEDQSTQNTDSQTRTISKNTSTSRTHTSEVHGNAEVHASFFDIGGSVSAGFSNS 356

Query: 145 TRGSAAVQFLLCLLGGKSW 163
+ A+ L L G ++W
Sbjct: 357 NSSTVAIDHSLSLAGERTW 375


28APECO1_4365APECO1_4354Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_43650203.502611transcriptional regulator NarP
APECO1_43641204.059438heme lyase, CcmH subunit
APECO1_4363-1194.316793periplasmic thioredoxin of cytochrome c-type
APECO1_4362-1183.897676heme lyase, CcmF subunit
APECO1_4361-1153.111071cytochrome c-type biogenesis protein CcmE
APECO1_43600153.232542heme exporter protein C
APECO1_4359-1183.998676heme exporter protein B
APECO1_43580204.197565cytochrome c biogenesis protein CcmA
APECO1_43570234.159698cytochrome c-type protein NapC
APECO1_4356-1224.524438citrate reductase cytochrome c-type subunit
APECO1_4355-1204.291540quinol dehydrogenase membrane component
APECO1_43540203.914132quinol dehydrogenase periplasmic component
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4365HTHFIS652e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.9 bits (158), Expect = 2e-14
Identities = 22/113 (19%), Positives = 48/113 (42%), Gaps = 2/113 (1%)

Query: 19 VMIVDDHPLMRRGVRQLLELDSGFEVVAEAGDGASAIDLANRLDIDVILLDLNMKGMSGL 78
+++ DD +R + Q L +G++V + A+ D D+++ D+ M +
Sbjct: 6 ILVADDDAAIRTVLNQALS-RAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 79 DTLNALRRDGVTAQIIILTVSDASSDVFALIDAGADGYLLKDSDPEVLLEAIR 131
D L +++ +++++ + + GA YL K D L+ I
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


29APECO1_4305APECO1_4282Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_43052142.025733hypothetical protein
APECO1_43041112.2017344-amino-4-deoxy-L-arabinose transferase
APECO1_43030143.613206hypothetical protein
APECO1_43020134.373798polymyxin B resistance protein PmrD
APECO1_43010134.297277O-succinylbenzoic acid--CoA ligase
APECO1_4300-1123.919633O-succinylbenzoate synthase
APECO1_4299-1112.589152naphthoate synthase
APECO1_4298-1112.079807acyl-CoA thioester hydrolase YfbB
APECO1_4297-111-0.0379492-succinyl-5-enolpyruvyl-6-hydroxy-3-
APECO1_4296-118-2.616639menaquinone-specific isochorismate synthase
APECO1_4295021-4.061179hypothetical protein
APECO1_4294-113-2.045889hypothetical protein
APECO1_4293-19-0.775766ribonuclease Z
APECO1_4292-1140.592778hypothetical protein
APECO1_42910192.195659hypothetical protein
APECO1_42901263.478470hypothetical protein
APECO1_42891294.128602NADH dehydrogenase subunit N
APECO1_42881313.617678NADH dehydrogenase subunit M
APECO1_42870314.227858NADH dehydrogenase subunit L
APECO1_42860303.956506NADH dehydrogenase subunit K
APECO1_42850314.015905NADH dehydrogenase subunit J
APECO1_42841304.124858NADH dehydrogenase subunit I
APECO1_42830293.889083NADH dehydrogenase subunit H
APECO1_42821283.895025NADH dehydrogenase subunit G
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4301ACETATEKNASE310.015 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 30.5 bits (69), Expect = 0.015
Identities = 19/124 (15%), Positives = 46/124 (37%), Gaps = 20/124 (16%)

Query: 339 EMHNGKLTIVG-----RLDNLFFSGGEGIQPEEVERVIAAHPAVLQVFIVPVADKEF--- 390
E +G + G +++ + + ++++ + H +++ + + + ++
Sbjct: 19 ESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKDAIKLVLDALVNSDYGVI 78

Query: 391 ---------GHRPVAVVEYDQQSVDLDEWVKDKLARFQQPVRWLTLPPELKNGGIKISRQ 441
GHR V EY SV + + V + + L P + GIK Q
Sbjct: 79 KDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDC-IELAPLHNPANI--EGIKACTQ 135

Query: 442 ALKE 445
+ +
Sbjct: 136 IMPD 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4294AUTOINDCRSYN325e-04 Autoinducer synthesis protein signature.
		>AUTOINDCRSYN#Autoinducer synthesis protein signature.

Length = 216

Score = 32.1 bits (73), Expect = 5e-04
Identities = 13/74 (17%), Positives = 29/74 (39%), Gaps = 12/74 (16%)

Query: 1 MIDWQDLHHSDLSVSQLYALLQLRCAVFV--------VEQNCPYQDIDGDDLEGENRHIL 52
M++ D++H+ LS ++ L LR F + D ++ ++
Sbjct: 1 MLEIFDVNHTLLSETKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYDNNN----TTYLF 56

Query: 53 GWHNGTLVAYARIL 66
G + T++ R +
Sbjct: 57 GIKDNTVICSLRFI 70


30APECO1_4222APECO1_4164Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_4222325-4.492673hypothetical protein
APECO1_4221428-5.431933long-chain fatty acid outer membrane
APECO1_4220432-6.519902lipoprotein VacJ
APECO1_4219536-7.117390transporter
APECO1_4218641-8.115204*phage integrase
APECO1_4217744-8.919025capsule O-acetyl transferase
APECO1_4216540-8.349117endo-alpha-sialidase
APECO1_4215640-7.048583antirepressor protein
APECO1_4214540-5.754186hypothetical protein
APECO1_4213539-4.965484hypothetical protein
APECO1_4212436-3.964388hypothetical protein
APECO1_42112334-3.333156hypothetical protein
APECO1_4211330-2.366949hypothetical protein
APECO1_4210429-1.759920phage injection protein
APECO1_4209528-1.243822DNA transfer protein
APECO1_4208528-1.345344head assembly protein
APECO1_4207426-0.888958DNA stabilization protein
APECO1_4206425-0.656809DNA stabilization protein
APECO1_4205526-0.859812DNA stabilization protein
APECO1_4204527-0.872601hypothetical protein
APECO1_4203526-1.512917phage scaffold protein
APECO1_4202526-1.540761phage portal protein
APECO1_4201428-2.465804phage terminase large subunit
APECO1_4200433-3.695895hypothetical protein
APECO1_4199432-4.169856hypothetical protein
APECO1_4198134-3.911753hypothetical protein
APECO1_4197236-2.578455hypothetical protein
APECO1_4196033-3.423183endolysin
APECO1_4195033-4.610528holin
APECO1_4194132-4.586561antitermination protein
APECO1_4193131-3.915768Holliday junction resolvase
APECO1_4192228-2.052239hypothetical protein
APECO1_4191227-2.192790DNA-binding protein Roi
APECO1_4190327-2.124889hypothetical protein
APECO1_4189330-1.695212hypothetical protein
APECO1_4188431-2.119186hypothetical protein
APECO1_4187432-2.019592replicative DNA helicase
APECO1_4186433-3.589801phage replication protein
APECO1_4185639-6.492146hypothetical protein
APECO1_4184636-6.281208hypothetical protein
APECO1_4183536-6.700563phage repressor protein
APECO1_4182432-6.741662hypothetical protein
APECO1_4180336-7.399024transcription antitermination protein
APECO1_4179135-6.727606hypothetical protein
APECO1_4178032-5.318958hypothetical protein
APECO1_4177027-4.721683hypothetical protein
APECO1_4176023-3.652727hypothetical protein
APECO1_4175020-2.923216hypothetical protein
APECO1_4174023-4.178186hypothetical protein
APECO1_4173026-5.252070hypothetical protein
APECO1_4172026-5.800729DNA-binding transcriptional regulator DsdC
APECO1_4171-129-7.859711permease DsdX
APECO1_4170032-8.441356D-serine dehydratase
APECO1_4169035-9.421404multidrug resistance protein Y
APECO1_4168034-8.447340EmrKY-TolC multidrug resistance efflux pump,
APECO1_4167031-7.768086EvgA family transcriptional regulator
APECO1_4166031-7.338712EvgA family transcriptional regulator
APECO1_4165230-5.549918hypothetical protein
APECO1_4164127-4.542429transporter YfdV
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4220VACJLIPOPROT407e-148 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 407 bits (1048), Expect = e-148
Identities = 250/251 (99%), Positives = 251/251 (100%)

Query: 1 MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR 60
MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR
Sbjct: 1 MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR 60

Query: 61 DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM 120
DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM
Sbjct: 61 DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM 120

Query: 121 ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADSLYPVLSWLTWPM 180
ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMAD+LYPVLSWLTWPM
Sbjct: 121 ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPVLSWLTWPM 180

Query: 181 SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA 240
SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA
Sbjct: 181 SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA 240

Query: 241 IQDDLKDIDSE 251
IQDDLKDIDSE
Sbjct: 241 IQDDLKDIDSE 251


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4185PF05704260.033 Capsular polysaccharide synthesis protein
		>PF05704#Capsular polysaccharide synthesis protein

Length = 307

Score = 25.6 bits (56), Expect = 0.033
Identities = 6/31 (19%), Positives = 19/31 (61%)

Query: 56 EGYTFIPNAFLEKLLKEDISVSQFNDVLKVF 86
+ + IP+ +++ + + + F+D+L++F
Sbjct: 111 KEWVDIPDFLIKRWQEGKMLDAWFSDILRLF 141


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4183PF07675280.037 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 27.8 bits (61), Expect = 0.037
Identities = 14/52 (26%), Positives = 24/52 (46%)

Query: 133 MTAPAGLSIPEGMIILVDPEVEPRNGKLVVAKLEGENEATFKKLVIDAGRKF 184
+T + IP G+ EP +GK+ +A G A + +AG+K+
Sbjct: 458 VTGQGEVVIPGGVYDYCITNPEPASGKMWIAGDGGNQPARYDDFAFEAGKKY 509


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4169TCRTETB1222e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 122 bits (308), Expect = 2e-32
Identities = 92/404 (22%), Positives = 167/404 (41%), Gaps = 17/404 (4%)

Query: 19 VTIALSLATFMQMLDSTISNVAIPTISGFLGASTDEGTWVITSFGVANAIAIPVTGRLAQ 78
+ I L + +F +L+ + NV++P I+ WV T+F + +I V G+L+
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 79 RIGELRLFLLSVTFFSLSSLMCSLS-TNLDVLIFFRVVQGLMAGPLIPLSQSLLLRNYPP 137
++G RL L + S++ + + +LI R +QG A L ++ R P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 138 EKRTFALALWSMTVIIAPICGPILGGYICDNFSWGWIFLINVPMGIIVLTLCLTLLKGRE 197
E R A L V + GP +GG I W +L+ +PM I+ L L +E
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKE 192

Query: 198 TETSPVKMNLPGLTLLVLGVGGLQIMLDKGRDLDWFNSSTIILLTVVSVISLISLVIWES 257
++ G+ L+ +G+ + ML F +S I +VSV+S + V
Sbjct: 193 VRIKG-HFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIR 241

Query: 258 TSENPILDLSLFKSRNFTIGIVSITCAYLFYSGAIVLMPQLLQETMGYNAIWAGLAYAPI 317
+P +D L K+ F IG++ + +G + ++P ++++ + G
Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301

Query: 318 GIMPLLIS-PLIGRYGNKIDMRLLVTFSFLMYAVCYYWRSVTFMPTIDFTGIILPQFFQG 376
G M ++I + G ++ ++ +V + S T F II+ G
Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361

Query: 377 FAVACFFLPLTTISFSGLPDNKFANASSMSNFFRTLSGSVGTSL 420
+ ++TI S L + S+ NF LS G ++
Sbjct: 362 LSFTK--TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4168RTXTOXIND741e-16 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 74.1 bits (182), Expect = 1e-16
Identities = 47/277 (16%), Positives = 94/277 (33%), Gaps = 46/277 (16%)

Query: 61 AKNNLANIVRQTNKLYLQDKQYSAEVASARIQ---YQQSLEDYNRRV----PLAKQGVIS 113
K + Q + L + AE + + Y+ R+ L + I+
Sbjct: 191 IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIA 250

Query: 114 KEALEHTKDTLI----------SSKAALNAAIQAYKANKALVMNTPLNRQPQVIEAADAT 163
K A+ ++ + S + + I + K + T L + + + T
Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEE--YQLVTQLFKNEILDKLRQTT 308

Query: 164 KE----------AWLALKRTDIKSPVTGYIAQRSVQ-VGETVSPGQSLMAVVPARQ-MWV 211
+ + I++PV+ + Q V G V+ ++LM +VP + V
Sbjct: 309 DNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEV 368

Query: 212 NANFKETQLTDVRIGQSVNIISDLYGENVVFHGRVTGINMGTGNAFSLLPAQNATGNWIK 271
A + + + +GQ+ I + F G +G + +
Sbjct: 369 TALVQNKDIGFINVGQNAIIKVE------AFPYTRYGYLVGK---VKNINLDAIEDQRLG 419

Query: 272 IVQRVPVEVSLDPKELMEH----PLRIGLSMTATIDT 304
+V V +S++ L PL G+++TA I T
Sbjct: 420 LVFNVI--ISIEENCLSTGNKNIPLSSGMAVTAEIKT 454


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4167HTHFIS493e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.1 bits (117), Expect = 3e-09
Identities = 22/148 (14%), Positives = 53/148 (35%), Gaps = 31/148 (20%)

Query: 4 IIIDDHPLAIAAIRNLLIKNDIEILAELTEGGSAVQRVETLKPDIVIIDVDIPGVNGIQV 63
++ DD + L + ++ + + + + D+V+ DV +P N +
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRIT-SNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 LETLRKRQYSGIIIIVSAKNDHFYGKHCADAGANGFVSKKEGMNNIIAAIEAAKNGYCYF 123
L ++K + ++++SA+N + AI+A++ G +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNT------------------------FMTAIKASEKGAYDY 101

Query: 124 ---PFSLNRFVGSLTSDQQKLDSLSKQE 148
PF L + + L ++
Sbjct: 102 LPKPFDLTE---LIGIIGRALAEPKRRP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4166HTHFIS792e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 2e-17
Identities = 30/105 (28%), Positives = 51/105 (48%)

Query: 960 SILIADDHPTNRLLLKRQLNLLGYDVDEATDGVQALHKVSMQHYDLLITDVNMPNMDGFE 1019
+IL+ADD R +L + L+ GYDV ++ ++ DL++TDV MP+ + F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1020 LTRKLREQNSSLPIWGLTANAQANEREKGLNCGMNLCLFKPLTLD 1064
L ++++ LP+ ++A K G L KP L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109


31APECO1_4112APECO1_4099Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_4112-2173.744239coproporphyrinogen III oxidase
APECO1_4111-1184.782746transcriptional regulator EutR
APECO1_4110-2215.111291ethanolamine utilization protein EutK
APECO1_4109-1215.361493ethanolamine utilization protein EutL
APECO1_4108-1205.623056ethanolamine ammonia-lyase small subunit
APECO1_41070215.751940regulatory subunit of ethanolamine
APECO1_41062205.943738reactivating factor for ethanolamine ammonia
APECO1_41051185.431829ethanolamine utilization transport protein EutH
APECO1_41044196.075719alcohol dehydrogenase in ethanolamine
APECO1_41032185.993549ethanolamine utilization protein EutJ
APECO1_41022205.307653ethanolamine utilization protein EutE
APECO1_41011184.123664carboxysome structural protein, ethanolamine
APECO1_41001183.830893detox protein in ethanolamine utilization
APECO1_40992183.242525phosphotransacetylase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4103SHAPEPROTEIN512e-09 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 50.5 bits (121), Expect = 2e-09
Identities = 33/116 (28%), Positives = 50/116 (43%), Gaps = 9/116 (7%)

Query: 63 VRDGIVWDFFGAVTIVRRHLD-TLEQQFGRRFSHAATSFPPGTDP---RISINVLESAGL 118
++DG++ DFF +++ + F R P G R + AG
Sbjct: 76 MKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGA 135

Query: 119 EVSHVLDEPTAVA---DLLQLDNAG--VVDIGGGTTGIAIVKKGKVTYSADEATGG 169
+++EP A A L + G VVDIGGGTT +A++ V YS+ GG
Sbjct: 136 REVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRIGG 191


32APECO1_4068APECO1_4026Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_4068-110-3.253645polyphosphate kinase
APECO1_4067-115-3.182606exopolyphosphatase
APECO1_4066116-3.230146hypothetical protein
APECO1_4065121-2.294451hypothetical protein
APECO1_4064222-1.780008outer membrane lipoprotein
APECO1_4063131-4.442046hypothetical protein
APECO1_4062232-4.928277phage lysis protein
APECO1_4061330-4.503431endolysin
APECO1_4060434-5.885400holin
APECO1_4059332-4.640635hypothetical protein
APECO1_4058232-3.885348hypothetical protein
APECO1_4057331-2.361071hypothetical protein
APECO1_4056331-1.808430anti-repressor protein
APECO1_4055333-2.075339hypothetical protein
APECO1_4054227-1.629758hypothetical protein
APECO1_4053327-1.386794hypothetical protein
APECO1_4052325-1.588798hypothetical protein
APECO1_4051322-1.099458hypothetical protein
APECO1_4050321-1.367157hypothetical protein
APECO1_4049320-1.431988hypothetical protein
APECO1_4048526-0.980484hypothetical protein
APECO1_4047526-0.996870hypothetical protein
APECO1_4046425-0.581118hypothetical protein
APECO1_4045423-1.012553hypothetical protein
APECO1_4044423-0.042324hypothetical protein
APECO1_4043425-0.146865hypothetical protein
APECO1_40422250.317467hypothetical protein
APECO1_4041226-0.071723tail protein
APECO1_4039127-0.080693*hypothetical protein
APECO1_40381280.094580phage terminase, large subunit
APECO1_4037323-1.643428phage terminase small subunit
APECO1_4036222-1.132278hypothetical protein
APECO1_4035227-1.398013hypothetical protein
APECO1_4034328-1.640597hypothetical protein
APECO1_4033228-1.833595hypothetical protein
APECO1_4032328-1.401075hypothetical protein
APECO1_4031434-2.088957hypothetical protein
APECO1_4030334-2.647591hypothetical protein
APECO1_4029434-3.091352hypothetical protein
APECO1_4028436-2.959409repressor protein
APECO1_4027335-3.137396hypothetical protein
APECO1_4026233-2.585688hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4063IGASERPTASE280.024 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 28.1 bits (62), Expect = 0.024
Identities = 19/124 (15%), Positives = 40/124 (32%), Gaps = 6/124 (4%)

Query: 34 QQGKNEEQRQHDEWVAERNREIQQEKQRRANAQAAANKRAATAAANKKARQDKLDAEATA 93
Q + ++ + + + E+ Q Q K AT +KA+ + +
Sbjct: 1064 QNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVET-EKTQEV 1122

Query: 94 DKKRDQSYEDELRSLEIQKQKLALAKEEARVKRENEFIDQELKHKAAQTDVVQSEADANR 153
K Q + +S +Q Q + + V I + D Q + +
Sbjct: 1123 PKVTSQVSPKQEQSETVQPQAEPARENDPTVN-----IKEPQSQTNTTADTEQPAKETSS 1177

Query: 154 NMTE 157
N+ +
Sbjct: 1178 NVEQ 1181


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4053RTXTOXIND310.019 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.6 bits (69), Expect = 0.019
Identities = 12/90 (13%), Positives = 32/90 (35%), Gaps = 14/90 (15%)

Query: 374 VDDGVTARAIENRLLEEQAAQLLPRGDRQVYQSEIANSQRIIENLTEQRAQILAEDPAGS 433
+ A+ + +LE++ + + +VY+S++ + I + E+ +
Sbjct: 244 LHKQAIAK---HAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL----- 295

Query: 434 GKALSRARSDKQARLRDIDQRIRQAQERLE 463
+++ +LR I L
Sbjct: 296 ------FKNEILDKLRQTTDNIGLLTLELA 319


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4043PF05616290.023 Neisseria meningitidis TspB protein
		>PF05616#Neisseria meningitidis TspB protein

Length = 501

Score = 28.6 bits (63), Expect = 0.023
Identities = 22/81 (27%), Positives = 32/81 (39%), Gaps = 6/81 (7%)

Query: 16 NEQPVDGGTAPAASEPSAPAGDNPAPVGDPSQQEGDKPQPVADGDKPADDKKPENDKQDE 75
N QP+ P S PA +NPAP +P + +P P + D D + D
Sbjct: 324 NAQPL-----PEVSPAENPA-NNPAPNENPGTRPNPEPDPDLNPDANPDTDGQPGTRPDS 377

Query: 76 KKDGDKPEGAPEKYEFQAAEG 96
D+P G K + +G
Sbjct: 378 PAVPDRPNGRHRKERKEGEDG 398


33APECO1_4001APECO1_3989Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_40011203.425616aminopeptidase B
APECO1_40002252.422275[2Fe-2S] ferredoxin
APECO1_39992252.469761chaperone protein HscA
APECO1_39982240.918401co-chaperone HscB
APECO1_39972281.362059iron-sulfur cluster assembly protein
APECO1_39961230.907035scaffold protein
APECO1_39950150.968111cysteine desulfurase
APECO1_3994-2101.865219DNA-binding transcriptional regulator IscR
APECO1_3993-191.476104methyltransferase
APECO1_3992-2111.437828inositol monophosphatase
APECO1_3991-1141.465389peptidase
APECO1_39900152.297180stationary phase inducible protein CsiE
APECO1_39890153.2132913-phenylpropionic acid transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3999SHAPEPROTEIN1161e-30 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 116 bits (292), Expect = 1e-30
Identities = 81/371 (21%), Positives = 144/371 (38%), Gaps = 74/371 (19%)

Query: 41 GIDLGTTNSLVATVRSGQAETLADHEGRHLLPSVVHYQQQGHS-------VGYDARTNAA 93
IDLGT N+L+ G + +E PSVV +Q VG+DA+
Sbjct: 14 SIDLGTANTLIYVKGQG----IVLNE-----PSVVAIRQDRAGSPKSVAAVGHDAK-QML 63

Query: 94 LDTANTISSVKRLMGRSLADIQQRYPHLPYQFQASENGLPMIETAAGLLNPVRVSADILK 153
T I++++ + +AD V+ +L+
Sbjct: 64 GRTPGNIAAIRPMKDGVIADF-------------------------------FVTEKMLQ 92

Query: 154 ALAARATEALAGE-LDGVVITVPAYFDDAQRQGTKDAARLAGLHVLRLLNEPTAAAIAYG 212
+ V++ VP +R+ +++A+ AG + L+ EP AAAI G
Sbjct: 93 HFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAG 152

Query: 213 LDSGQEGVIAVYDLGGGTFDISILRLSRGVFEVLATGGDSALGGDDFDHLLADYIREQAG 272
L + V D+GGGT +++++ L+ V +GGD FD + +Y+R G
Sbjct: 153 LPVSEATGSMVVDIGGGTTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYG 207

Query: 273 --IPDRSDNRVQRELLDAAIAAKIALSDADSVTVNVAG---WQG-----EISREQFNELI 322
I + + R++ E+ A + + V G +G ++ + E +
Sbjct: 208 SLIGEATAERIKHEI-------GSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEAL 260

Query: 323 APLVKRTLLACRRALKDAGVE-ADEVLE--VVMVGGSTRVPLVRERVGEFFGRPPLTSID 379
+ + A AL+ E A ++ E +V+ GG + + + E G P + + D
Sbjct: 261 QEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAED 320

Query: 380 PDKVVAIGAAI 390
P VA G
Sbjct: 321 PLTCVARGGGK 331


34APECO1_3931APECO1_3856Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_3931020-3.609016hypothetical protein
APECO1_3930-118-2.634063hypothetical protein
APECO1_3929017-1.882231hypothetical protein
APECO1_3928221-0.807279outer membrane lipoprotein
APECO1_39272210.58099350S ribosomal protein L19
APECO1_39262-1160.793826tRNA (guanine-1-)-methyltransferase
APECO1_39260160.485085tRNA (guanine-1-)-methyltransferase
APECO1_3925118-0.17946116S rRNA-processing protein RimM
APECO1_3924118-0.09126030S ribosomal protein S16
APECO1_39231130.038082signal recognition particle protein
APECO1_3922212-0.693644hypothetical protein
APECO1_3921211-1.204721hypothetical protein
APECO1_3920214-1.320276hypothetical protein
APECO1_3919315-1.112444inorganic polyphosphate/ATP-NAD kinase
APECO1_3918218-2.498142recombination and repair protein
APECO1_3917326-5.071859hypothetical protein
APECO1_3916531-5.726451hypothetical protein
APECO1_3915434-7.815365SsrA-binding protein
APECO1_3914438-8.115165hypothetical protein
APECO1_3913337-7.830340hypothetical protein
APECO1_39122438-7.602290hypothetical protein
APECO1_3912333-5.738545hypothetical protein
APECO1_3911124-3.154583hypothetical protein
APECO1_39105171.708444tail fiber protein
APECO1_39096192.913132tail fiber assembly protein
APECO1_39085214.959771hypothetical protein
APECO1_39075225.796755hypothetical protein
APECO1_39063224.210026bacteriophage V tail protein
APECO1_39053233.690815bacteriophage V tail protein
APECO1_39043233.396070bacteriophage V tail protein
APECO1_39032213.775266tail protein
APECO1_39021203.124942bacteriophage V tail/DNA circulation protein
APECO1_39010202.987146bacteriophage V tail protein
APECO1_39001224.564799hypothetical protein
APECO1_38991244.623402hypothetical protein
APECO1_3898-1244.322736bacteriophage V tail sheath pro
APECO1_38971233.380522hypothetical protein
APECO1_38961263.679745hypothetical protein
APECO1_38951273.285662bacteriophage head tail adaptor
APECO1_38942253.409271hypothetical protein
APECO1_38932253.698067major capsid protein
APECO1_38922253.322217pro-head protease
APECO1_38912242.831845phage portal protein
APECO1_38901201.349470bacteriophage V large terminase subunit
APECO1_3889329-2.344313bacteriophage V small terminase subunit
APECO1_3888328-4.595903phage endonuclease
APECO1_3887326-3.453613phage lysis protein
APECO1_3886224-2.954462muramidase
APECO1_3885123-2.147441hypothetical protein
APECO1_3884223-1.090881hypothetical protein
APECO1_38832211.006690Qin-like prophage; antitermination protein Q
APECO1_38822222.361822hypothetical protein
APECO1_38810211.791778hypothetical protein
APECO1_38802222.084121bacteriophage V crossover junction
APECO1_38793221.286122hypothetical protein
APECO1_3878221-0.076619DNA adenine methylase
APECO1_3877324-1.941631hypothetical protein
APECO1_3876326-3.556605hypothetical protein
APECO1_3875529-3.834116hypothetical protein
APECO1_3874323-4.402667e14 prophage; DNA-binding transcriptional
APECO1_3873223-5.213541hypothetical protein
APECO1_3872229-8.599926phage repressor
APECO1_3871130-8.583877hypothetical protein
APECO1_3870132-9.299905hypothetical protein
APECO1_3869236-11.087479hypothetical protein
APECO1_3868543-11.683926hypothetical protein
APECO1_3867645-12.643475hypothetical protein
APECO1_3866229-6.819999hypothetical protein
APECO1_3865-117-3.142130hypothetical protein
APECO1_3864-1130.385113hypothetical protein
APECO1_38630162.182547hypothetical protein
APECO1_38622203.195555hypothetical protein
APECO1_38611203.720229hypothetical protein
APECO1_38602193.299353hydroxyglutarate oxidase
APECO1_38591182.734905succinate-semialdehyde dehydrogenase I
APECO1_38581151.0838434-aminobutyrate aminotransferase
APECO1_3857114-1.219660gamma-aminobutyrate transporter
APECO1_38572-215-2.069571DNA-binding transcriptional regulator CsiR
APECO1_3856118-3.422869LysM domain/BON superfamily protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3928OMPADOMAIN1061e-30 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 106 bits (267), Expect = 1e-30
Identities = 38/151 (25%), Positives = 65/151 (43%), Gaps = 21/151 (13%)

Query: 18 GCQSPQGKFTPEQVAAMQSYGFTESAGDWSLGLSDAILFAKNDYKLLPESQQQIQTMAAK 77
G +P P +Q+ FT L +LF N L PE Q + + ++
Sbjct: 194 GEAAPVVAPAPAPAPEVQTKHFT---------LKSDVLFNFNKATLKPEGQAALDQLYSQ 244

Query: 78 LASTGLTHARMD--GHTDNYGEDSYNEVLSLKRANVVADAWAMGGQIPRSNLTTQGLGKK 135
L++ + G+TD G D+YN+ LS +RA V D + + IP ++ +G+G+
Sbjct: 245 LSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVD-YLISKGIPADKISARGMGES 303

Query: 136 YPIASNKTAQGR---------AENRRVAVVI 157
P+ N + A +RRV + +
Sbjct: 304 NPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3917BLACTAMASEA260.032 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 26.3 bits (58), Expect = 0.032
Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 11/87 (12%)

Query: 4 KTLTAAAAVLLMLTAGCSTLERVVYRPDINQGNYLTANDVSKIRV--GMTQQQVAYALGT 61
K + AVL + AG LER ++ Q + + + VS+ + GMT ++ A
Sbjct: 69 KVV-LCGAVLARVDAGDEQLERKIH---YRQQDLVDYSPVSEKHLADGMTVGELCAA--A 122

Query: 62 PLMSDPFGTNTWFYVFRQQPGHEGVTQ 88
MSD N + G G+T
Sbjct: 123 ITMSDNSAANL---LLATVGGPAGLTA 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3904cloacin280.029 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 27.8 bits (61), Expect = 0.029
Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 1/33 (3%)

Query: 50 PYGFTARANSGAEAVVLFPDGDRSHAVVVTVSD 82
P GFT N+ +AV+ FP +AV V+VSD
Sbjct: 258 PAGFTQGGNT-RDAVIRFPKDSGHNAVYVSVSD 289


35APECO1_3819APECO1_3800Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_38191153.732696DNA-binding transcriptional repressor SrlR
APECO1_38182164.194407D-arabinose 5-phosphate isomerase
APECO1_38171163.647608anaerobic nitric oxide reductase transcriptional
APECO1_38160173.526394anaerobic nitric oxide reductase
APECO1_38151163.308649nitric oxide reductase
APECO1_38140142.833620hydrogenase maturation protein HypF
APECO1_38130161.321808electron transport protein HydN
APECO1_38120161.378373ascBF operon repressor
APECO1_38110172.176281PTS system cellobiose/arbutin/salicin-specific
APECO1_38100181.589750cryptic 6-phospho-beta-glucosidase
APECO1_38090243.175018hypothetical protein
APECO1_3808-1294.600510hydrogenase 3 maturation protease
APECO1_3807-1275.240710formate hydrogenlyase maturation protein HycH
APECO1_3806-1265.361327hydrogenase 3 and formate hydrogenase complex,
APECO1_38051264.824601formate hydrogenlyase complex iron-sulfur
APECO1_38041244.699289hydrogenase 3, large subunit
APECO1_38032224.569539membrane-spanning protein of formate
APECO1_38022214.066721formate hydrogenlyase subunit 3
APECO1_38012192.653114hydrogenase 3, Fe-S subunit
APECO1_38001193.224939formate hydrogenlyase regulatory protein HycA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3819ARGREPRESSOR280.024 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 27.9 bits (62), Expect = 0.024
Identities = 10/45 (22%), Positives = 18/45 (40%), Gaps = 5/45 (11%)

Query: 1 MKPRQRQAAILEYLQKQGKCSVEEL-----AQYFDTTGTTIRKDL 40
M QR I E + + +EL ++ T T+ +D+
Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDI 45


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3817HTHFIS372e-126 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 372 bits (956), Expect = e-126
Identities = 125/388 (32%), Positives = 193/388 (49%), Gaps = 33/388 (8%)

Query: 149 IAALAAGALS----------NALLIEQLESQNMLPGDAAPFEAVKQTQMIGLSPGMTQLK 198
I A GA +I + ++ ++ ++G S M ++
Sbjct: 91 IKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIY 150

Query: 199 KEIEIVAASDLNVLISGETGTGKELVAKAIHEASPRAVNPLVYLNCAALPESVAESELFG 258
+ + + +DL ++I+GE+GTGKELVA+A+H+ R P V +N AA+P + ESELFG
Sbjct: 151 RVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFG 210

Query: 259 HVKGAFTGAISNRSGKFEMADNGTLFLDEIGELSLALQAKLLRVLQYGDIQRVGDDRSLR 318
H KGAFTGA + +G+FE A+ GTLFLDEIG++ + Q +LLRVLQ G+ VG +R
Sbjct: 211 HEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIR 270

Query: 319 VDVRVLAATNRDLREEVLAGRFRADLFHRLSVFPLSVPPLRERGDDVILLAGYFCEQCRL 378
DVR++AATN+DL++ + G FR DL++RL+V PL +PPLR+R +D+ L +F +Q
Sbjct: 271 SDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE- 329

Query: 379 RLGLSRVVLSAGARNLLQHYNFPGNVRELEHAIHRAVVLSRATRSGDEVIL-----EAQH 433
+ GL A L++ + +PGNVRELE+ + R L E+I E
Sbjct: 330 KEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPD 389

Query: 434 FAFPEVTLPPPEAAAVPVVKQNLR-----------------EATEAFQRETIRQALAQNH 476
+ + V++N+R + I AL
Sbjct: 390 SPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATR 449

Query: 477 HNWAACARMLETDVANLHRLAKRLGLKD 504
N A +L + L + + LG+
Sbjct: 450 GNQIKAADLLGLNRNTLRKKIRELGVSV 477


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3812HTHTETR280.035 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 28.4 bits (63), Expect = 0.035
Identities = 17/93 (18%), Positives = 29/93 (31%), Gaps = 7/93 (7%)

Query: 5 TTMLEVAKRAGVSKATVSRVLSG-----NGYVSQETKDRVFQAVEESGYRPNLLARNLSA 59
T++ E+AK AGV++ + + + +E P L
Sbjct: 32 TSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLRE 91

Query: 60 KSTQTLGLVVTNTLYHGIYFSELLFHAARMAEE 92
L VT + E++FH E
Sbjct: 92 ILIHVLESTVTEERRRLLM--EIIFHKCEFVGE 122


36APECO1_3770APECO1_3755Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_37700163.251385phosphoadenosine phosphosulfate reductase
APECO1_37690142.923584sulfite reductase subunit beta
APECO1_37680152.765121sulfite reductase subunit alpha
APECO1_37671192.4227306-pyruvoyl tetrahydrobiopterin synthase
APECO1_37662182.512775electron transfer flavoprotein-quinone
APECO1_37651141.334962ferredoxin-like protein YgcO
APECO1_37640120.174401anti-terminator regulatory protein
APECO1_37631110.074250electron transfer flavoprotein subunit YgcQ
APECO1_3762111-0.820004electron transfer flavoprotein subunit YgcR
APECO1_3761012-1.420173metabolite transport protein YgcS
APECO1_3760112-2.819858FAD containing dehydrogenase
APECO1_3759215-3.807841oxidoreductase YgcW
APECO1_3758017-3.885615transporter
APECO1_3757020-4.144269sugar kinase
APECO1_3756020-3.917465hypothetical protein
APECO1_3755020-3.327148hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3769PF07675300.021 Cleaved Adhesin
		>PF07675#Cleaved Adhesin

Length = 1358

Score = 30.4 bits (68), Expect = 0.021
Identities = 20/92 (21%), Positives = 39/92 (42%), Gaps = 12/92 (13%)

Query: 206 ILGQTYLPRKFKTTVVIP---PQND--IDLHANDMNFVAIAENGKLVGFNLLVGGGLSIE 260
++ +P+ T +P PQN + A+ ++VAI+++G L G + G++
Sbjct: 240 VMPYRAMPKT--NTYTLPASLPQNQASYSIQASAGSYVAISKDGVLYGTGVANASGVATV 297

Query: 261 HGNK-----KTYARTASEFGYLPLEHTLAVAE 287
+ K Y + YLP+ + E
Sbjct: 298 NMTKQITENGNYDVVITRSNYLPVIKQIQAGE 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3761TCRTETB362e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 36.4 bits (84), Expect = 2e-04
Identities = 53/338 (15%), Positives = 123/338 (36%), Gaps = 34/338 (10%)

Query: 93 LGSLVLGWISDHIGRQKIFTFSFMLITLASFLQFFATTP-EHLIGLRILIGIGLGGDYSV 151
+G+ V G +SD +G +++ F ++ S + F + LI R + G G ++
Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL 123

Query: 152 GHTLLAEFSPRRHRGVLLGAFSVVWT----VGYVLASIAGHHFISESPEAWRWLLASAAL 207
++A + P+ +RG G + VG + + H+ W +LL +
Sbjct: 124 VMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI------HWSYLLLIPMI 177

Query: 208 PALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVATATHKHIKTLF-- 265
+ + L + R +G F I+ +L + + + L
Sbjct: 178 TIITVPFLMKLLKKEVR---IKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFL 234

Query: 266 -SSRYWRRTA--------FNSVFFVCLVIPWFVIYT----WLPTIAQTIGLEDALTASLM 312
++ R+ ++ F+ V+ +I+ ++ + + L+ + +
Sbjct: 235 IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEI 294

Query: 313 LNALLIVGALLGLVLTHLLAHRRFLLGSFLLLTATLVVMACLPSGSSLTLLLFVLFSTTI 372
+ ++ G + ++ ++ G +L + ++ S F+L +T+
Sbjct: 295 GSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLS-----VSFLTASFLLETTSW 349

Query: 373 SAVSNLVGILPAESFPTDIRSLGVGFATAMSRLGAAVS 410
+V +L SF + S V + GA +S
Sbjct: 350 FMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMS 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3759DHBDHDRGNASE1098e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 109 bits (274), Expect = 8e-31
Identities = 74/257 (28%), Positives = 116/257 (45%), Gaps = 11/257 (4%)

Query: 36 MDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANVFIPSFVKDNGETKEMIEK-QGVEVD 94
M+ ++GK A +TG G+G+A A LA GA++ + + E K + +
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 95 FMQVDITAEGAPQKIIAACCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAA 154
D+ A +I A G +DILVN AG+ + + +W+ VN T
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 155 FELSYEAAKIMIPQKSGKIINICSLFSYSGGQWSPAYSATKHALAGFTKAYCDELGQYNI 214
F S +K M+ ++SG I+ + S + AY+++K A FTK EL +YNI
Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180

Query: 215 QVNGISPGYYATDI--TLATRSNPETNQRVLDY-------IPANRWGDTQDLMGAAVFLA 265
+ N +SPG TD+ +L N Q + IP + D+ A +FL
Sbjct: 181 RCNIVSPGSTETDMQWSLWADENGAE-QVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 266 SPASNYVNGHLLVVDGG 282
S + ++ H L VDGG
Sbjct: 240 SGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3758TCRTETA310.006 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 31.3 bits (71), Expect = 0.006
Identities = 20/76 (26%), Positives = 34/76 (44%), Gaps = 1/76 (1%)

Query: 41 GFSNTEIGLIMSTFGIAAIIFYA-PSGVIADKFSHRKMITSAMIITGLLGLIMATYPPLW 99
+ T IG+ ++ FGI + A +G +A + R+ + MI G +++A W
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 100 VMLCIQVAFAITTILM 115
+ I V A I M
Sbjct: 302 MAFPIMVLLASGGIGM 317



Score = 30.6 bits (69), Expect = 0.012
Identities = 22/103 (21%), Positives = 45/103 (43%), Gaps = 8/103 (7%)

Query: 48 GLIMSTFGIAAIIFYAPSGVIADKFSHRKMITSAMIITGLLGLIMATYPPLWVMLCIQVA 107
G++++ + + G ++D+F R ++ ++ + IMAT P LWV+ ++
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105

Query: 108 FAITTILMLWSVSIKAASLLGD---HSEQGKIMGWMEGLRGVG 147
IT + A + + D E+ + G+M G G
Sbjct: 106 AGITG-----ATGAVAGAYIADITDGDERARHFGFMSACFGFG 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_375556KDTSANTIGN300.005 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 30.3 bits (68), Expect = 0.005
Identities = 19/76 (25%), Positives = 31/76 (40%), Gaps = 12/76 (15%)

Query: 30 NASWSEVLNQYQRRTDLIPNLVASIKGYSSHEQEVLEAVTLARSQANRASSDLQKTPGDE 89
+AS ++ ++ Q D + L S GY + + N+ + P +
Sbjct: 294 SASIEQIQSKIQELGDTLEELRDSFDGY------------INNAFVNQIHLNFVMPPQAQ 341

Query: 90 QKLQAWQQAQAQAQAQ 105
Q+ QQ QAQA AQ
Sbjct: 342 QQQGQGQQQQAQATAQ 357


37APECO1_3719APECO1_3691Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_3719-2184.142635murein transglycosylase A
APECO1_3718-1214.369263***hypothetical protein
APECO1_37171256.476545hypothetical protein
APECO1_37160266.954736hypothetical protein
APECO1_3715-1214.849852hypothetical protein
APECO1_37140183.835765outer membrane protein
APECO1_37131160.443602hemolysin-coregulated protein
APECO1_3712117-0.119058ATPase
APECO1_37111130.293661hypothetical protein
APECO1_3710215-1.380987hypothetical protein
APECO1_3709217-0.686514hypothetical protein
APECO1_37081181.894225hypothetical protein
APECO1_37071171.671274hypothetical protein
APECO1_37051161.048316hypothetical protein
APECO1_3704323-4.008010ISEc12 ATP-binding protein
APECO1_3703118-1.710427transposase for ISEc12
APECO1_37020180.091518hypothetical protein
APECO1_3701019-2.536088hypothetical protein
APECO1_3700-1170.044851hypothetical protein
APECO1_3699-2182.568430hypothetical protein
APECO1_3698-1192.843474hypothetical protein
APECO1_3697-119-0.014583hypothetical protein
APECO1_3696223-4.883183hypothetical protein
APECO1_3695224-5.963461hypothetical protein
APECO1_3694127-8.064456hypothetical protein
APECO1_3693025-8.5029532-hydroxyacid dehydrogenase
APECO1_3692-120-6.975226phosphosugar isomerase
APECO1_3691-117-4.357129aminotransferase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3714OMPADOMAIN801e-18 OMPA domain signature.
		>OMPADOMAIN#OMPA domain signature.

Length = 346

Score = 80.4 bits (198), Expect = 1e-18
Identities = 44/142 (30%), Positives = 63/142 (44%), Gaps = 14/142 (9%)

Query: 402 PEQKMEVTASLQVQTVRLDSMSLFDVGQARLKDGSTKVL---VDALVNIRAKPGWLILVA 458
+Q + L S LF+ +A LK L L N+ K G ++V
Sbjct: 200 VAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDG-SVVVL 258

Query: 459 GYTDATGDEKSNQQLSLRRAEAVRNWMLQTSDIPATCFAVQGLGESQPAATNDTPQGR-- 516
GYTD G + NQ LS RRA++V ++ L + IPA + +G+GES P N +
Sbjct: 259 GYTDRIGSDAYNQGLSERRAQSVVDY-LISKGIPADKISARGMGESNPVTGNTCDNVKQR 317

Query: 517 -------AVNRRVEISLVPRSD 531
A +RRVEI + D
Sbjct: 318 AALIDCLAPDRRVEIEVKGIKD 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3712HTHFIS320.011 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 32.1 bits (73), Expect = 0.011
Identities = 35/189 (18%), Positives = 66/189 (34%), Gaps = 34/189 (17%)

Query: 512 IMTLRQEGTDSTELQQQLRTHQGFAPLLALDVDARAVATVVADWTG----IPLSSLL--- 564
+ + ++ +L +++ + P+L + + + A G +P L
Sbjct: 52 VTDVVMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111

Query: 565 RDEQSDLLSMEQSLENR----------VVGQRPALCAIAQRL-RAAKTGLTPENGPQGVF 613
L+ + ++ +VG+ A+ I + L R +T LT
Sbjct: 112 IGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLT--------L 163

Query: 614 LLTGPSGTGKTETALTLADTLFGGEKSLITINLSEYQEPHTVSQLKGSPPGYVGYGQGGV 673
++TG SGTGK A L D + IN++ S+L G + G
Sbjct: 164 MITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGA 215

Query: 674 LTEAVRKRP 682
T A +
Sbjct: 216 FTGAQTRST 224


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3705PF00577310.042 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 30.6 bits (69), Expect = 0.042
Identities = 15/71 (21%), Positives = 24/71 (33%), Gaps = 6/71 (8%)

Query: 274 LRLAHTLAERGIAHWQSVL---KPLLAGGAFSSLRLRGLMFSPPLAAVPEAAPHAWLPSP 330
+ +T ER I +S L G F + RG + +P +P
Sbjct: 243 WQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLP---DSQRGFAP 299

Query: 331 VWAGVTGDNAR 341
V G+ A+
Sbjct: 300 VIHGIARGTAQ 310


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3696ANTHRAXTOXNA290.010 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 29.3 bits (65), Expect = 0.010
Identities = 13/83 (15%), Positives = 35/83 (42%), Gaps = 9/83 (10%)

Query: 33 ESKSVASAVFYKQIKILHLDFFSR---------SALNTDAEDTPLSTMVHVWQLKTREDF 83
+ + V+Y+ K + LD S+ + + + ++D+ S ++ + K + +
Sbjct: 161 INSEQSKEVYYEIGKGISLDIISKDKSLDPEFLNLIKSLSDDSDSSDLLFSQKFKEKLEL 220

Query: 84 DKADYDTLFMQEEKTLEKDVLAK 106
+ D F++E T + +
Sbjct: 221 NNKSIDINFIKENLTEFQHAFSL 243


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3694RTXTOXIND290.048 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.0 bits (65), Expect = 0.048
Identities = 34/215 (15%), Positives = 62/215 (28%), Gaps = 20/215 (9%)

Query: 235 WPIFMAGMVVMAGLGGTGLW-GWSQLNQPDALIQRIQLSVMPLP-QSLESGELAKLDVKD 292
P + +M L + Q+ ++ S + +E+ + ++ VK+
Sbjct: 56 RPRLV-AYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKE 114

Query: 293 -------KALLAQDRT-----IAASQMQLEQLNKLPARWPLEQGYRQLRQLDAL----WP 336
LL +Q L Q R+ + +L +L L P
Sbjct: 115 GESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEP 174

Query: 337 DNPQVRALNAQWRKQRELSALSAEALNGYAQAQSQLQRLSAQLDALDERKGRYLTGSELK 396
V S N Q + L + A+ + R RY S ++
Sbjct: 175 YFQNVSEEEVLRLTSLIKEQFSTWQ-NQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 397 TAVYGIRQSLKEPPLEELLRQLEEQKQTGEVSPTL 431
+ SL LE++ + E L
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNEL 268


38APECO1_3535APECO1_3452Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_3535222-3.960748hypothetical protein
APECO1_3534323-4.112173*P4-like integrase
APECO1_3533424-4.214817hypothetical protein
APECO1_3532425-4.425593superfamily I DNA helicase
APECO1_3531828-4.580554hypothetical protein
APECO1_3530932-5.928488tia invasion determinant
APECO1_3529832-5.089021ISEc12 ATP-binding protein
APECO1_3528835-4.280974transposase for ISEc12
APECO1_3527738-4.917385protein PapG
APECO1_3526635-1.652529P pilus minor tip component PapF
APECO1_3525634-1.333131PapE protein
APECO1_3524531-1.154896pilus assembly protein PapK
APECO1_3523632-1.869710PapJ protein
APECO1_3522531-3.094893pilus assembly protein chaperone PapD
APECO1_3521529-2.338248outer membrane usher protein PapC
APECO1_3520431-3.750169minor pilin subunit PapH
APECO1_3519227-3.860891major fibrial protein
APECO1_3518220-2.212630PapB-like protein
APECO1_3517119-0.743911fimbrial regulatory protein
APECO1_3516219-1.959278hypothetical protein
APECO1_3515118-3.468148hypothetical protein
APECO1_3514020-3.952758hypothetical protein
APECO1_3513020-3.618907IS2 transposase
APECO1_3512121-3.543560IS2 transposase
APECO1_3511123-4.438149hypothetical protein
APECO1_3510123-4.141300phosphoglycerate transporter protein PgtP
APECO1_3509225-3.447806phosphoglycerate transport regulatory protein
APECO1_3508525-3.366773regulatory protein
APECO1_3507627-2.899743phosphoglycerate activator protein
APECO1_3506733-5.487843hypothetical protein
APECO1_3505738-7.430132hypothetical protein
APECO1_3504633-5.516749hypothetical protein
APECO1_3503632-5.620189hypothetical protein
APECO1_3502533-7.170330hypothetical protein
APECO1_3501534-7.470110hypothetical protein
APECO1_3500532-7.319267hypothetical protein
APECO1_3499426-6.072235transposase for IS629
APECO1_3498526-6.100681transposase; OrfA protein of insertion sequence
APECO1_3497526-5.939193iron-regulated outer membrane virulence protein
APECO1_3496422-4.274993hypothetical protein
APECO1_3495422-3.499380hypothetical protein
APECO1_3494522-1.092859hypothetical protein
APECO1_34936241.558360hypothetical protein
APECO1_34927281.931028transcriptional regulator
APECO1_34917282.708149hypothetical protein
APECO1_34907273.703909hypothetical protein
APECO1_34898251.522189hypothetical protein
APECO1_3488521-1.312689radC-like protein YeeS
APECO1_3487318-2.442820hypothetical protein
APECO1_3486214-3.027621hypothetical protein
APECO1_3485110-2.495814hypothetical protein
APECO1_3484011-2.091240hypothetical protein
APECO1_3483-110-1.348849hypothetical protein
APECO1_3482-29-1.170983polysialic acid capsule synthesis protein KpsF
APECO1_3481015-4.495017polysialic acid transport protein KpsE
APECO1_3480223-7.792673polysialic acid transport protein KpsD
APECO1_3479335-11.4369843-deoxy-manno-octulosonate cytidylyltransferase
APECO1_3478343-14.216243capsule polysaccharide export protein KpsC
APECO1_3477754-19.124183polysialic acid capsule synthesis protein KpsS
APECO1_34761062-21.627964poly-alpha-2,8 sialosyl sialyltransferase NeuS
APECO1_34751060-20.566581polysialic acid biosynthesis protein NeuE
APECO1_3473754-17.930219polysialic acid biosynthesis protein
APECO1_3472548-15.432079acylneuraminate cytidylyltransferase
APECO1_3471235-9.936000sialic acid synthase
APECO1_3470-122-4.678754sialic acid synthase
APECO1_3469-217-1.160983polysialic acid transport ATP-binding protein
APECO1_34681161.800432polysialic acid transport protein KpsM
APECO1_34671184.117837general secretion pathway protein YghD
APECO1_34661154.147850GspL-like protein
APECO1_34651204.660825type II secretion protein GspK
APECO1_3464-1195.118114type II secretion protein GspJ
APECO1_3463-1194.411410type II secretion protein GspI
APECO1_3462-1163.990889type II secretion protein GspH
APECO1_3461-2153.283417type II secretion protein GspG
APECO1_3460-2143.080400type II secretion protein GspF
APECO1_3459-1121.255049type II secretion protein GspE
APECO1_3458-211-0.216748type II secretion protein GspD
APECO1_3457-211-0.580353type II secretion protein GspC
APECO1_3456-211-0.195748hypothetical protein
APECO1_3455-3120.233606prepilin peptidase A
APECO1_3454-3120.791139lipoprotein AcfD-like
APECO1_3453-1132.216943hypothetical protein
APECO1_3452-1133.297866glycolate transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3530OUTRMMBRANEA412e-06 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 41.1 bits (96), Expect = 2e-06
Identities = 45/212 (21%), Positives = 67/212 (31%), Gaps = 47/212 (22%)

Query: 4 MKKVIVVSALAMAGVFSAQALADRGKTGFYVTGKAGASVVTQTDQRFRQDFGDDVYKYKG 63
MKK + A+A+AG + A + T +Y K G S + D +
Sbjct: 1 MKKTAIAIAVALAGFATVAQAAPKDNT-WYTGAKLGWS-----------QYHDTGFINNN 48

Query: 64 GDKNDTVFGAGLAVGYDFYQHYNVPVRTEVEFYGRGAADSHYTLDTWHSPMGDGGREDTQ 123
G ++ GAG GY V ++ GR + G Q
Sbjct: 49 GPTHENQLGAGAFGGYQVNP--YVGFEMGYDWLGRMPYKGS----------VENGAYKAQ 96

Query: 124 NRLSVNTLMVNTYYDFRNSSAFTPWVSVGLGYARIHHKATYTDTSWNKSGEVSDISALHY 183
V Y + +T +G R DT N G+
Sbjct: 97 ---GVQLTAKLGYPITDDLDIYT---RLGGMVWR-------ADTKSNVYGKN-------- 135

Query: 184 SGYDNNFAWSIGAGVRYDITPDIALDLSYRYL 215
+D + GV Y ITP+IA L Y++
Sbjct: 136 --HDTGVSPVFAGGVEYAITPEIATRLEYQWT 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3527PF036276020.0 PapG
		>PF03627#PapG

Length = 336

Score = 602 bits (1554), Expect = 0.0
Identities = 336/336 (100%), Positives = 336/336 (100%)

Query: 1 MKKWFPALLFSLCVSGESSAWNNIVFYSLGNVNSYQGGNVVITQRPQFITSWRPGIATVT 60
MKKWFPALLFSLCVSGESSAWNNIVFYSLGNVNSYQGGNVVITQRPQFITSWRPGIATVT
Sbjct: 1 MKKWFPALLFSLCVSGESSAWNNIVFYSLGNVNSYQGGNVVITQRPQFITSWRPGIATVT 60

Query: 61 WNQCNGPGFADGSWAYYREYIAWVVFPKKVMTKNGYPLFIEVHNKGSWSEENTGDNDSYF 120
WNQCNGPGFADGSWAYYREYIAWVVFPKKVMTKNGYPLFIEVHNKGSWSEENTGDNDSYF
Sbjct: 61 WNQCNGPGFADGSWAYYREYIAWVVFPKKVMTKNGYPLFIEVHNKGSWSEENTGDNDSYF 120

Query: 121 FLKGYKWDERAFDAGNLCQKPGETTRLTEKFDDIIFKVALPADLPLGDYSVTIPYTSGMQ 180
FLKGYKWDERAFDAGNLCQKPGETTRLTEKFDDIIFKVALPADLPLGDYSVTIPYTSGMQ
Sbjct: 121 FLKGYKWDERAFDAGNLCQKPGETTRLTEKFDDIIFKVALPADLPLGDYSVTIPYTSGMQ 180

Query: 181 RHFASYLGARFKIPYNVAKTLPRENEMLFLFKNIGGCRPSAQSLEIKHGDLSINSANNHY 240
RHFASYLGARFKIPYNVAKTLPRENEMLFLFKNIGGCRPSAQSLEIKHGDLSINSANNHY
Sbjct: 181 RHFASYLGARFKIPYNVAKTLPRENEMLFLFKNIGGCRPSAQSLEIKHGDLSINSANNHY 240

Query: 241 AAQTLSVSCDVPANIRFMLLRNTTPTYSHGKKFSVGLGHGWDSIVSVNGVDTGETTMRWY 300
AAQTLSVSCDVPANIRFMLLRNTTPTYSHGKKFSVGLGHGWDSIVSVNGVDTGETTMRWY
Sbjct: 241 AAQTLSVSCDVPANIRFMLLRNTTPTYSHGKKFSVGLGHGWDSIVSVNGVDTGETTMRWY 300

Query: 301 KAGTQNLTIGSRLYGESSKIQPGVLSGSATLLMILP 336
KAGTQNLTIGSRLYGESSKIQPGVLSGSATLLMILP
Sbjct: 301 KAGTQNLTIGSRLYGESSKIQPGVLSGSATLLMILP 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3526FIMBRIALPAPF2682e-95 Escherichia coli: P pili tip fibrillum papF protein...
		>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein

signature.
Length = 167

Score = 268 bits (685), Expect = 2e-95
Identities = 155/167 (92%), Positives = 157/167 (94%), Gaps = 1/167 (0%)

Query: 11 MIRLSLFISLLLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRGE 70
MIRLSLFISLLLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRGE
Sbjct: 1 MIRLSLFISLLLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRGE 60

Query: 71 ITKTISISCTYKSGSPWIKVTGNAMA-GQTNVLATNIANFGIALYQGKGMSTPLTLGNGS 129
+TK ISISC YKSGS WIKVTGN M GQ NVLATNI +FGIALYQGKGMSTPLTLGNGS
Sbjct: 61 VTKNISISCPYKSGSLWIKVTGNTMGVGQNNVLATNITHFGIALYQGKGMSTPLTLGNGS 120

Query: 130 GNGYRVTAGLDTARSTFTFTSVPFRNGSRTLNGGDFRTTASMSMIYN 176
GNGYRVTAGLDTARSTFTFTSVPFRNGS LNGGDFRTTASMSMIYN
Sbjct: 121 GNGYRVTAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMIYN 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3525FIMBRIALPAPE296e-106 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 296 bits (760), Expect = e-106
Identities = 124/173 (71%), Positives = 142/173 (82%)

Query: 1 MKKIRGLCLPVMLGAVLMSQHVHAVDNLTFRGKLIIPACTVSNTTVDWQDVEIQTLSQNG 60
MKKIRGLCLPVMLGAVLMSQHVHA DNLTF+GKLIIPACTV N V+W D+EIQ L Q+G
Sbjct: 1 MKKIRGLCLPVMLGAVLMSQHVHAADNLTFKGKLIIPACTVQNAEVNWGDIEIQNLVQSG 60

Query: 61 NHEKEFTVNMRCPYNLGTMKVTITATNTYNNAILVQNTSNTSSDGLLVYLYNSNAGNIGT 120
++K+FTV+M CPY+LGTMKVTIT+ N+ILV NTS S DGLL+YLYNSN IG
Sbjct: 61 GNQKDFTVDMNCPYSLGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGIGN 120

Query: 121 AITLGTPFTPGKITGNNADKTISLHAKLGYKGNMQNLIAGPFSATATLVASYS 173
A+TLG+ TPGKITG + I+L+AKLGYKGNMQ+L AG FSATATLVASYS
Sbjct: 121 AVTLGSQVTPGKITGTAPARKITLYAKLGYKGNMQSLQAGTFSATATLVASYS 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3521PF005777340.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 734 bits (1897), Expect = 0.0
Identities = 241/882 (27%), Positives = 361/882 (40%), Gaps = 67/882 (7%)

Query: 1 MRGMKDRI-PFAVNNITCVILLSLFCNAASAVEFNTDVLDAADKKNIDFTRFSEAGYVLP 59
+ K R+ F V + +++ + FN L + D +RF + P
Sbjct: 16 LHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPP 75

Query: 60 GQYLLDVIVNGQSISPASLQISFVEPALSGDKAEKKLPQACLTSDMVRLMGLTAESLDKV 119
G Y +D+ +N + A+ ++F CLT + MGL S+ +
Sbjct: 76 GTYRVDIYLNNGYM--ATRDVTFNTGDSEQGI------VPCLTRAQLASMGLNTASVSGM 127

Query: 120 VYWHDGQCADF-HGLPGVDIRPDTGAGVLRINMPQAWLEYSDATWLPPSRWDDGIPGLML 178
D C + + D G L + +PQA++ ++PP WD GI +L
Sbjct: 128 NLLADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLL 187

Query: 179 DYNLNGTVSRNYQGGDSHQFSYNGTVGGNLGPWRLRADYQGSQEQSRYNGEKTTNRNFTW 238
+YN +G +N GG+SH N G N+G WRLR + S S + + +
Sbjct: 188 NYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSS--DSSSGSKNKWQH 245

Query: 239 SRFYLFRAIPRWRANLTLGENNINSDIFRSWSYTGASLESDDRMLPPRLRGYAPQITGIA 298
+L R I R+ LTLG+ DIF ++ GA L SDD MLP RG+AP I GIA
Sbjct: 246 INTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIA 305

Query: 299 ETNARVVVSQQGRVLYDSMVPAGLFSIQDLD-SSVRGRLDVEVIEQNGRKKTFQVDTASV 357
A+V + Q G +Y+S VP G F+I D+ + G L V + E +G + F V +SV
Sbjct: 306 RGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSV 365

Query: 358 PYLTRPGQVRYKLVSGRSRGYGHETEGPVFATGEASWGLSNQWSLYGGAVLAGDYNALAA 417
P L R G RY + +G R + E P F GL W++YGG LA Y A
Sbjct: 366 PLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNF 425

Query: 418 GAGWDLGVPGTLSADITQSVARIEGERTFQGKSWRLSYSKRFDNADADITFAGYRFSERN 477
G G ++G G LS D+TQ+ + + + G+S R Y+K + + +I GYR+S
Sbjct: 426 GIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSG 485

Query: 478 YMTMEQYLNARYR--------------------NDYSSREKEMYTVTLNKNVADWNTSFN 517
Y +R + + ++ +T+ + + + +
Sbjct: 486 YFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTS-TLY 544

Query: 518 LQYSRQTYWDIRKTD-YYTVSVNRYFNVFGLQGVAVGLAASRSKYLGRD--NDSAYLRIS 574
L S QTYW D + +N F + L+ S +K + + L ++
Sbjct: 545 LSGSHQTYWGTSNVDEQFQAGLNTAFE-----DINWTLSYSLTKNAWQKGRDQMLALNVN 599

Query: 575 VPLGT------------GTASYSGSMSND-RYVNMAGYTDT-FNDGLDSYSLNAGLNSGG 620
+P +ASYS S + R N+AG T D SYS+ G GG
Sbjct: 600 IPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGG 659

Query: 621 GLTSQRQINAYYSHRSPLANLSANIASLQKGYTSFGVSASGGATITGKGAALHAGGMSGG 680
S A ++R N + S SGG G L G
Sbjct: 660 DGNSGSTGYATLNYRGGYGNANIG-YSHSDDIKQLYYGVSGGVLAHANGVTL--GQPLND 716

Query: 681 TRLLVDTDGVGGVPVDGGQVV-TNRWGTGVVTDISSYYRNTTSVDLKRLPDDVEATRSVV 739
T +LV G V+ V T+ G V+ + Y N ++D L D+V+ +V
Sbjct: 717 TVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVA 776

Query: 740 ESALTEGAIGYRKFSVLKGKRLFAILRLADGSQPPFGASVTSEKGRELGMVADEGLAWLS 799
T GAI +F G +L L + PFGA VTSE + G+VAD G +LS
Sbjct: 777 NVVPTRGAIVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQVYLS 835

Query: 800 GVTPGETLSVNW--DGKIQCQVNVPETAISDQQLL----LPC 835
G+ + V W + C N S QQLL C
Sbjct: 836 GMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAEC 877


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3520FIMBRIALPAPE300.003 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 30.4 bits (68), Expect = 0.003
Identities = 41/172 (23%), Positives = 75/172 (43%), Gaps = 27/172 (15%)

Query: 29 GMTLPEYWG----EEHVWWDGRAAFHGEVVRPACTLAMEDAWQIIDMGETPVRDL-QNGF 83
G+ LP G +HV F G+++ PACT+ + ++ G+ +++L Q+G
Sbjct: 6 GLCLPVMLGAVLMSQHVHAADNLTFKGKLIIPACTVQNAE----VNWGDIEIQNLVQSG- 60

Query: 84 SGPERKFSLRLRNCEFNSQGGNLFSDSRIRVTFDGVRGET---PDKFNLSGQAKGINLQI 140
G ++ F++ + NC ++ ++ +T +G G + P+ SG I L
Sbjct: 61 -GNQKDFTVDM-NCPYS------LGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYN 112

Query: 141 ADAR--GNIARAGKVMPAIPLTGNEEALDYTLRIVR----NGKKLEAGNYFA 186
++ GN G + +TG A TL N + L+AG + A
Sbjct: 113 SNNSGIGNAVTLGSQVTPGKITGTAPARKITLYAKLGYKGNMQSLQAGTFSA 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3518FIMREGULATRY1685e-58 Escherichia coli: P pili regulatory PapB protein si...
		>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein

signature.
Length = 104

Score = 168 bits (426), Expect = 5e-58
Identities = 104/104 (100%), Positives = 104/104 (100%)

Query: 1 MAHHEVISRSGNAFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHS 60
MAHHEVISRSGNAFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHS
Sbjct: 1 MAHHEVISRSGNAFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHS 60

Query: 61 RKEVCEKYQMNNGYFSTTLGRLIRLNALAARLAPYYTDESSAFD 104
RKEVCEKYQMNNGYFSTTLGRLIRLNALAARLAPYYTDESSAFD
Sbjct: 61 RKEVCEKYQMNNGYFSTTLGRLIRLNALAARLAPYYTDESSAFD 104


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3510TCRTETA393e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.0 bits (91), Expect = 3e-05
Identities = 66/387 (17%), Positives = 130/387 (33%), Gaps = 39/387 (10%)

Query: 52 TPYLKEQLDLSATQI---GVLSSCMLIAYGISKGVMSSLADKASPKVFMACGLVLCAIVN 108
P L L S G+L + + V+ +L+D+ + + L A+
Sbjct: 28 LPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDY 87

Query: 109 VGLGFSTAFWIFAALVILNGLFQGMGVGPSFITIANWFPRRERGRVGAFWNISHNIGGGI 168
+ + W+ I+ G+ G IA+ ER R F +S G G+
Sbjct: 88 AIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIADITDGDERAR--HFGFMSACFGFGM 144

Query: 169 VA-PIVGAAFALLGSEHWQSASYIVPACVAIVFAVIVLILGKGSPHQEGLPSLEEMMPEE 227
VA P++G A + A + + + L +PE
Sbjct: 145 VAGPVLGGLMGGFSPH----APFFAAAALNGLNFLTGCFL----------------LPE- 183

Query: 228 KVVLNTRQTVKAPENMSAFQIFCTYVLRNKNAWYVSLVDVFVYMVRFGMISWLPIYLLTV 287
+ + + P A ++ +L+ VF M G + +
Sbjct: 184 -----SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGE 238

Query: 288 KHFSKEQMSVAFLFFEWA---AIPSTLLAGWLSDKLFKGRRMPLAMICMALIFICLIGYW 344
F + ++ + ++ ++ G ++ +L + R + L MI +I L
Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298

Query: 345 KSESLFMVTIFAAIVGCLIYVPQFLASVQTMEIVPSFAVGSAVGLRGFMSYIFGASLGTS 404
+ F + + A G + Q + S Q E GS L ++ I G L T+
Sbjct: 299 RGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTS-LTSIVGPLLFTA 357

Query: 405 LFGIMVDHIGWHGGFYLLGCGIICCII 431
++ + W+G ++ G + +
Sbjct: 358 IYAASITT--WNGWAWIAGAALYLLCL 382


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3507HTHFIS2407e-77 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 240 bits (615), Expect = 7e-77
Identities = 113/479 (23%), Positives = 191/479 (39%), Gaps = 83/479 (17%)

Query: 10 SILLIDDDADVLDAYTQLLEQSGYRVFACNNPFEAQAWIQPDWPGIVLSDVCMPGCSGID 69
+IL+ DDDA + Q L ++GY V +N WI +V++DV MP + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 70 LMMLFHQDDQQLPILLITGHGDVPMAVDAVKKGAWDFLQKPVDPGKLLSLVEEALRQRQS 129
L+ + LP+L+++ A+ A +KGA+D+L KP D +L+ ++ AL + +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 130 IIARRQYCQQTLQVELIGRSEWINQYRRRLQQLSETDIAVWLYGAPGTGRMTGARYLHQF 189
++ + Q L+GRS + + R L +L +TD+ + + G GTG+ AR LH +
Sbjct: 125 RPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183

Query: 190 GRNAQGEFVYRELTPDNAPQLND------------------------FIALAQGGTLVLS 225
G+ G FV N + A+GGTL L
Sbjct: 184 GKRRNGPFV-----AINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLD 238

Query: 226 HPEHLTREQQYHLVQ-LQSQEHRP----------FRLIGIGDTSLVELAASNHIIAELYY 274
+ + Q L++ LQ E+ R++ + L + +LYY
Sbjct: 239 EIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYY 298

Query: 275 CFAMTQIACLPLTQRPDDIEPLFRHYLCKACQRLNHPVPEVGKEMLKEMMRRMWPNNVRE 334
+ + PL R +DI L RH++ +A + V +E L+ M WP NVRE
Sbjct: 299 RLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRE 357

Query: 335 LANAAE--------------------------------LFTVGVLPLAE---------TA 353
L N G L +++ A
Sbjct: 358 LENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFA 417

Query: 354 NPLMHVGTPTPLDRRVEDAERQIITEALNIHQGRINEVAEYLQIPRKKLYLRMKKYGLS 412
+ + DR + + E +I AL +G + A+ L + R L ++++ G+S
Sbjct: 418 SFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3483PYOCINKILLER310.017 Pyocin S killer protein signature.
		>PYOCINKILLER#Pyocin S killer protein signature.

Length = 617

Score = 30.5 bits (68), Expect = 0.017
Identities = 12/40 (30%), Positives = 18/40 (45%), Gaps = 5/40 (12%)

Query: 110 LPVGPVAEK----EQWRHDMLIRFPEDTGTLP-WILLRTP 144
PV E D++I FP D+G P +++ R P
Sbjct: 447 TPVKATPETYPGVITLPEDLIIGFPADSGIKPIYVMFRDP 486


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3468ABC2TRNSPORT336e-04 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 33.4 bits (76), Expect = 6e-04
Identities = 29/125 (23%), Positives = 54/125 (43%), Gaps = 10/125 (8%)

Query: 137 ITNFLQLVLTWSLLIILS--CGVGLIF----MVVGKTFPEMQKVL---PILLKPLYFISC 187
+ L SLL L GL F MVV P + +++ P+ F+S
Sbjct: 135 VAAALGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSG 194

Query: 188 IMFPLHSIPKQYWSYLLWNPLVHVVELSREAVMPGYISE-GVSLNYLAMFTLVTLFIGLA 246
+FP+ +P + + + PL H ++L R ++ + + + L ++ ++ F+ A
Sbjct: 195 AVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTA 254

Query: 247 LYRTR 251
L R R
Sbjct: 255 LLRRR 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3464BCTERIALGSPG290.011 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 28.7 bits (64), Expect = 0.011
Identities = 15/46 (32%), Positives = 22/46 (47%), Gaps = 2/46 (4%)

Query: 1 MRRARAGFTLLEMLVAIAIFASLA-LMAQQVTNGVTRVNSAVAGHD 45
+ R GFTLLE++V I I LA L+ + + + A D
Sbjct: 4 TDKQR-GFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSD 48


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3463BCTERIALGSPH323e-04 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 32.2 bits (73), Expect = 3e-04
Identities = 13/24 (54%), Positives = 18/24 (75%)

Query: 2 KRGFTLLEVMLALAIFALAAMAVL 25
+RGFTLLE+ML L + ++A VL
Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVL 26


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3462BCTERIALGSPH773e-20 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 77.3 bits (190), Expect = 3e-20
Identities = 42/196 (21%), Positives = 71/196 (36%), Gaps = 41/196 (20%)

Query: 1 MPERGFTLLEIMLVIFLIGLASAGVVQTFATASEPPAKKAAQDFLTRFAQFKDRAVIEGQ 60
M +RGFTLLE+ML++ L+G+++ V+ F + + A + F + + R + GQ
Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQ 60

Query: 61 TLGVLIDPPGYQFMQRRHGQWLPVSATRLSAQVTVPKQVQMLLQPGSDIWQKEYALELQR 120
GV + P +QF+ + P D W L L+
Sbjct: 61 FFGVSVHPDRWQFLVLEARDGADPA-------------------PADDGWSGYRWLPLRA 101

Query: 121 RRL----TLHDIELEL-----QKEAKKKTPQIRFSPFEPATPFTLRFYSAAQNACWAVKL 171
R+ ++ +L L + P + P TPF L L
Sbjct: 102 GRVATSGSIAGGKLNLAFAQGEAWTPGDNPDVLIFPGGEMTPFRLT-------------L 148

Query: 172 AHDGALSLNQCDERMP 187
++ N E +P
Sbjct: 149 GEAPGIAFNARGESLP 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3461BCTERIALGSPG2182e-76 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 218 bits (556), Expect = 2e-76
Identities = 91/146 (62%), Positives = 109/146 (74%), Gaps = 3/146 (2%)

Query: 6 RTQKPRAGFTLLEVMVVIVILGVLASLVVPNLLGNKEKADRQKAISDIVALENALDMYRL 65
R + GFTLLE+MVVIVI+GVLASLVVPNL+GNKEKAD+QKA+SDIVALENALDMY+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 66 DNGRYPTTEQGLEALIQQPANMADARNYRTGGYIKRLPKDPWGNDYQYLSPGEKGLFDVY 125
DN YPTT QGLE+L++ P A NY GYIKRLP DPWGNDY ++PGE G +D+
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121

Query: 126 TLGADGQENGEGAGADIGNWNLQEFQ 151
+ G DG+ E DI NW L + +
Sbjct: 122 SAGPDGEMGTED---DITNWGLSKKK 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3460BCTERIALGSPF453e-161 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 453 bits (1167), Expect = e-161
Identities = 226/406 (55%), Positives = 301/406 (74%), Gaps = 1/406 (0%)

Query: 1 MALFYYQALERNGRKTKGMIEADSARHARQLLRGKDLIPVHI-EARMNASAGGLLQRRRH 59
MA ++YQAL+ G+K +G EADSAR ARQLLR + L+P+ + E R + G
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 60 AHRRVATADLALFTRQLATLVQAAMPLETCLQAVSEQSEKLHVKSLGMALRSRIQEGYTL 119
R++T+DLAL TRQLATLV A+MPLE L AV++QSEK H+ L A+RS++ EG++L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 120 SDSLREHPRVFDSLFCSMVAAGEKSGHLDVVLNRLADYTEQRQRLKSRLLQAMLYPLVLL 179
+D+++ P F+ L+C+MVAAGE SGHLD VLNRLADYTEQRQ+++SR+ QAM+YP VL
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 180 VVATGVVTILLTAVVPKIIEQFDHLGHALPASTRMLIAMSDALQASGVYWLAGLLGLLVL 239
VVA VV+ILL+ VVPK++EQF H+ ALP STR+L+ MSDA++ G + L LL +
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 240 GQRLLKNPAMRLRWDKTLLRLPVTGRVARGLNTARFSRTLSILTASSVPLLEGIQTAAAV 299
+ +L+ R+ + + LL LP+ GR+ARGLNTAR++RTLSIL AS+VPLL+ ++ + V
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 300 SANRYVEQQLLLAADRVREGSSLRAALADLRLFPPMMLYMIASGEQSGELETMLEQAAVN 359
+N Y +L LA D VREG SL AL LFPPMM +MIASGE+SGEL++MLE+AA N
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 360 QEREFDTQVGLALGLFEPALVVMMAGVVLFIVIAILEPMLQLNNMV 405
Q+REF +Q+ LALGLFEP LVV MA VVLFIV+AIL+P+LQLN ++
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3458BCTERIALGSPD5750.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 575 bits (1483), Expect = 0.0
Identities = 295/668 (44%), Positives = 429/668 (64%), Gaps = 34/668 (5%)

Query: 24 LLPLMLAAALCSSPVWEEEATFTANFKDTDLKSFIETVGANLNKTIIMGPGVQGKVSIRT 83
L L++ AAL P EE F+A+FK TD++ FI TV NLNKT+I+ P V+G +++R+
Sbjct: 11 SLTLLIFAALLFRPAAAEE--FSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRS 68

Query: 84 MTPLNERQYYQLFLNLLEAQGYAVVPMENDVLKVVKSSAAKVEPLPLVGEGSDNYAGDEM 143
LNE QYYQ FL++L+ G+AV+ M N VLKVV+S AK +P+ + + GDE+
Sbjct: 69 YDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPG-IGDEV 127

Query: 144 VTKVVPVRNVSVRELAPILRQMIDSAGSGNVVNYDPSNVIMLTGRASVVERLTEVIQRVD 203
VT+VVP+ NV+ R+LAP+LRQ+ D+AG G+VV+Y+PSNV+++TGRA+V++RL +++RVD
Sbjct: 128 VTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVD 187

Query: 204 HAGNRTEEVIPLDNASASEIARVLESLTKNSGENQ-PATLKSQIVADERTNSVIVSGDPA 262
+AG+R+ +PL ASA+++ +++ L K++ ++ P ++ + +VADERTN+V+VSG+P
Sbjct: 188 NAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPN 247

Query: 263 TRDKMRRLIRRLDSEMERSGNSQVFYLKYSKAEDLVDVLKQVSGTLTAAKEEAEGTVGSG 322
+R ++ +I++LD + GN++V YLKY+KA DLV+VL +S T+ + K+ A+ +
Sbjct: 248 SRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPV-AAL 306

Query: 323 REVVSIAASKHSNALIVTAPQDIMQSLQSVIEQLDIRRAQVHVEALIVEVAEGSNINFGV 382
+ + I A +NALIVTA D+M L+ VI QLDIRR QV VEA+I EV + +N G+
Sbjct: 307 DKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGI 366

Query: 383 QWGSKDAGLMQFANGTQIPIGTLGAAISAAKPQKGSTVISENGATTINPDTNGDLST-LA 441
QW +K+AG+ QF N + +PI T A + +G +S+ LA
Sbjct: 367 QWANKNAGMTQFTN-SGLPISTAIAG-------------------ANQYNKDGTVSSSLA 406

Query: 442 QLLSGFSGTAVGVVKGDWMALVQAVKNDSSSNVLSTPSITTLDNQEAFFMVGQDVPVLTG 501
LS F+G A G +G+W L+ A+ + + +++L+TPSI TLDN EA F VGQ+VPVLTG
Sbjct: 407 SALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTG 466

Query: 502 STVGSNNSNPFNTVERKKVGIMLKVTPQINEGNAVQMVIEQEVSKVEGQTS-----LDVV 556
S S N FNTVERK VGI LKV PQINEG++V + IEQEVS V S L
Sbjct: 467 SQTTSG-DNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGAT 525

Query: 557 FGERKLKTTVLANDGELIVLGGLMDDQAGESVAKVPLLGDIPVIGNLFKSTADKKEKRNL 616
F R + VL GE +V+GGL+D ++ KVPLLGDIPVIG LF+ST+ K KRNL
Sbjct: 526 FNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNL 585

Query: 617 MVFIRPTILRDGMAADGVSQRKYNYMRAEQIYR--DEQGLSLMPHTAQPILPAQNQALPP 674
M+FIRPT++RD S +Y Q + E +++ I P Q+ A
Sbjct: 586 MLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENNDAMLNQDLLEIYPRQDTAAFR 645

Query: 675 EVRAFLNA 682
+V A ++A
Sbjct: 646 QVSAAIDA 653


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3457BCTERIALGSPC1192e-34 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 119 bits (300), Expect = 2e-34
Identities = 71/284 (25%), Positives = 116/284 (40%), Gaps = 38/284 (13%)

Query: 3 RGMFWLMLLIISAKMAYSLWRYFSFSAEYTAVSSSVN-KPLRADAKPFDKNDVQLVSQQN 61
R +F+L++L+ ++A WR A SSV P +A +P ND L
Sbjct: 16 RILFYLLMLLFCQQLAMIFWR---IGLPDNAPVSSVQITPAQARQQPVTLNDFTL----- 67

Query: 62 WFGKY-QPVAAPV-KQPESAPVAETRLNVVLRGIAFG---ARPGVVIEEGGKQQVYLQGE 116
FG + A + + + + LN+ L G+ G +R +I + +Q E
Sbjct: 68 -FGVSPEKNKAGALDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSRGVNE 126

Query: 117 RLGSHNAVIEEINRDHVMLRYQGKMERLSLAEEERPPVAVTSKKAASDEAKQAVAEPVVS 176
+ +NA I I D V+L+YQG+ E L L +E + SD A
Sbjct: 127 EVPGYNAKIVSIRPDRVVLQYQGRYEVLGLYSQE---------DSGSDGVPGAQVN---- 173

Query: 177 APVEIPAAVRQALAKDPQKIFNYIQLTPVRKEG-IVGYAVKPGADRSLFDASGFREGDIA 235
Q + + +Y+ +P+ + + GY + PG F G ++ D+A
Sbjct: 174 ---------EQLQQRASTTMSDYVSFSPIMNDNKLQGYRLNPGPKSDSFYRVGLQDNDMA 224

Query: 236 IALNQQDFTDPRAMIALMRQLPSMDSIQLTVLRKGARYDISIAL 279
+ALN D D M ++ + + LTV R G R DI +
Sbjct: 225 VALNGLDLRDAEQAKKAMERMADVHNFTLTVERDGQRQDIYMEF 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3455PREPILNPTASE2831e-97 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 283 bits (726), Expect = 1e-97
Identities = 111/276 (40%), Positives = 151/276 (54%), Gaps = 12/276 (4%)

Query: 31 LTMLFDVFQQYPAAMPILATVGGLIIGSFLNVVIWRYPIML-RQQMAEFHGETPSTQSKI 89
+ +L ++ P L + L+IGSFLNVVI R PIML R+ AE+ +
Sbjct: 1 MALLLELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGV 60

Query: 90 -----SLALPRSHCPHCQQTIRVRDNIPLLSWLMLKGRCRDCQAKISKRYPLVELLTALA 144
+L +PRS CPHC I +NIPLLSWL L+GRCR CQA IS RYPLVELLTAL
Sbjct: 61 DEPPYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALL 120

Query: 145 FLLASLVWPESGWGLAVMILSAWLIAASIIDLDNQWLPDVFTQGVLWTGLIAAWAQQSPL 204
+ ++ LA ++L+ L+A + IDLD LPD T +LW GL+ +
Sbjct: 121 SVAVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNL-LGGFV 179

Query: 205 TLQDAVTGVLVGFITFYSLRWIAGIVLRKEALGMGDVLLFAALGGWVGPLSLPNVALIAS 264
+L DAV G + G++ +SL W ++ KE +G GD L AALG W+G +LP V L++S
Sbjct: 180 SLGDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSS 239

Query: 265 CCGLIYAVI-----TKRGSTTLPFGPCLSLGGIATL 295
G + S +PFGP L++ G L
Sbjct: 240 LVGAFMGIGLILLRNHHQSKPIPFGPYLAIAGWIAL 275


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3454PF03544481e-07 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 48.0 bits (114), Expect = 1e-07
Identities = 29/107 (27%), Positives = 41/107 (38%), Gaps = 8/107 (7%)

Query: 46 PEVKPDPTPTPEPTPEPTPDPEPTPDPTPD-PEPTPEPEPEPVPTKTGYLTLGGSQRVTG 104
V+P P P EP PEP P PEP + +P P+P+P+P P K ++
Sbjct: 64 QAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK-------VEQPKR 116

Query: 105 ATCNGESSDGFTFTPGNTVSCVVGSTTIATFNTQSEAARSLRAVDKV 151
ES F + T AT + A RA+ +
Sbjct: 117 DVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRN 163



Score = 41.1 bits (96), Expect = 2e-05
Identities = 14/87 (16%), Positives = 21/87 (24%), Gaps = 1/87 (1%)

Query: 35 TPSVDSGSGTLPEVKPDPTPTPEPTPEPTPDPEPTPDPTPDPEPTPEPEPEPVPT-KTGY 93
P PE P+P E P+ + +PV +
Sbjct: 69 PPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASP 128

Query: 94 LTLGGSQRVTGATCNGESSDGFTFTPG 120
R T +T +S T
Sbjct: 129 FENTAPARPTSSTATAATSKPVTSVAS 155



Score = 38.8 bits (90), Expect = 9e-05
Identities = 16/58 (27%), Positives = 22/58 (37%), Gaps = 1/58 (1%)

Query: 31 SSSDTPSVDSGSGTLPEVKPDPTPTPEPTPEPTPDPEPTPDPTPDPEPTPEPEPEPVP 88
S + + + + P P P PEP +P P+PEP PEP E
Sbjct: 36 SVHQVIELPAPAQPISVTMVAPADLEPP-QAVQPPPEPVVEPEPEPEPIPEPPKEAPV 92



Score = 36.9 bits (85), Expect = 4e-04
Identities = 16/46 (34%), Positives = 19/46 (41%)

Query: 46 PEVKPDPTPTPEPTPEPTPDPEPTPDPTPDPEPTPEPEPEPVPTKT 91
P T EP +P P+P +PEP PEP PEP
Sbjct: 46 PAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAP 91



Score = 36.1 bits (83), Expect = 6e-04
Identities = 17/40 (42%), Positives = 17/40 (42%)

Query: 50 PDPTPTPEPTPEPTPDPEPTPDPTPDPEPTPEPEPEPVPT 89
P P T D EP P PEP EPEPEP P
Sbjct: 44 PAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPI 83


39APECO1_3410APECO1_3393Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_3410-213-3.120530hypothetical protein
APECO1_3409122-6.206911regulator
APECO1_3408224-6.048610oxidoreductase YdfI
APECO1_3407120-4.224277zinc-type alcohol dehydrogenase-like protein
APECO1_3406018-3.933618ureidoglycolate dehydrogenase
APECO1_3405-110-1.569194c4-dicarboxylate transport system binding
APECO1_3404-210-0.100421hypothetical protein
APECO1_3403-2100.796325c4-dicarboxylate permease
APECO1_3402-2132.030463repressor protein for FtsI
APECO1_3401-1121.4120761-acyl-sn-glycerol-3-phosphate acyltransferase
APECO1_3400-1121.714368DNA topoisomerase IV subunit A
APECO1_3399-115-0.454390hypothetical protein
APECO1_3398020-3.813184transcriptional regulator
APECO1_3397125-6.323132hypothetical protein
APECO1_3396023-6.265317DNA-binding transcriptional regulator QseB
APECO1_3395023-5.998738sensor protein QseC
APECO1_3394030-7.571915hypothetical protein
APECO1_3393-127-5.301515hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3406CHLAMIDIAOM6330.002 Chlamydia cysteine-rich outer membrane protein 6 si...
		>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6

signature.
Length = 547

Score = 32.7 bits (74), Expect = 0.002
Identities = 16/36 (44%), Positives = 20/36 (55%), Gaps = 1/36 (2%)

Query: 112 VSVKNTSHCGALSYFAEMITH-KGLVAIVMTQTDTC 146
V VK+ S CG + AE T+ KG+ A M DTC
Sbjct: 406 VVVKSCSDCGTCTSCAEATTYWKGVAATHMCVVDTC 441


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3396HTHFIS906e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.5 bits (222), Expect = 6e-23
Identities = 30/129 (23%), Positives = 56/129 (43%)

Query: 2 RILLIEDDMLIGDGIKTGLSKMGFSVDWFTQGRQGKEALYSAPYDAVILDLTLPGMDGRD 61
IL+ +DD I + LS+ G+ V + + + D V+ D+ +P + D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 ILREWREKGQREPVLILTARDALAERVEGLRLGADDYLCKPFALIEVAARLEALMRRTNG 121
+L ++ PVL+++A++ ++ GA DYL KPF L E+ + +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 QASNELRHG 130
+ S
Sbjct: 125 RPSKLEDDS 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3395PF06580363e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 35.6 bits (82), Expect = 3e-04
Identities = 37/176 (21%), Positives = 60/176 (34%), Gaps = 29/176 (16%)

Query: 284 DRATRLVDQLLTLSRLDSLDNLQDVAEIPLEDLLQSSVMDIYHTAQQANIDVRLTLNANG 343
+A ++ L L R SL ++ L D L +V+D Y + RL
Sbjct: 191 TKAREMLTSLSELMRY-SLRYSNA-RQVSLADEL--TVVDSYLQLASIQFEDRLQFENQI 246

Query: 344 IKRTGQ----PLLLSLLVRNLLDNAVRYSPQGSVVDVTLNADN----FIVRDNGPGVTPE 395
P+L+ LV N + + + PQG + + DN V + G
Sbjct: 247 NPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN 306

Query: 396 ALARIGERFYRPPGQTATGSGLGLSIV-QRIAKLHDMNVEFG-NAEQGGFEAKVSW 449
T +G GL V +R+ L+ + + +QG A V
Sbjct: 307 ---------------TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347


40APECO1_3374APECO1_3361Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_3374014-3.336421disulfide isomerase
APECO1_3373113-2.993966disulfide oxidoreductase
APECO1_3372116-4.473430hypothetical protein
APECO1_3371016-4.657351zinc transporter ZupT
APECO1_3370017-5.395436fimbrial protein
APECO1_3369020-5.538078outer membrane usher protein YqiG
APECO1_3368-222-4.464685periplasmic chaperone YqiH
APECO1_3367-217-2.453905Yqi fimbrial adhesin
APECO1_3366191.3086013,4-dihydroxy-2-butanone 4-phosphate synthase
APECO1_3365181.573521hypothetical protein
APECO1_3364181.703773hypothetical protein
APECO1_33631102.851177hypothetical protein
APECO1_3362-1123.470538bifunctional heptose 7-phosphate kinase/heptose
APECO1_33610133.137016bifunctional glutamine-synthetase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3369PF005776880.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 688 bits (1777), Expect = 0.0
Identities = 233/878 (26%), Positives = 401/878 (45%), Gaps = 70/878 (7%)

Query: 14 HAIKNALSG------VVCSLLFVLPVH--AVEFNVDMIDAEDRENIDISRFEKKGYIPPG 65
H K+ L+G V C+ P+ + FN + + + D+SRFE +PPG
Sbjct: 17 HIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPG 76

Query: 66 RYLVRVQINKNMLPQTLILEWVKADNESGSLLCLTKENLTNFGLNTEFIESLQNIAGSEC 125
Y V + +N + + + D+E G + CLT+ L + GLNT + + +A C
Sbjct: 77 TYRVDIYLNNGYMATRDV-TFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDAC 135

Query: 126 LDLSQR-QELTTRLDKATMILSLSVPQAWLKYQATNWTPPEFWDTGIAGFILDYNVYASQ 184
+ L+ + T +LD L+L++PQA++ +A + PPE WD GI +L+YN +
Sbjct: 136 VPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNS 195

Query: 185 YAPHHGDSTQNVSSYGTLGFNLGAWRLRSDYQYNQNFADGRSVNRDS-EFARTYLFRPIP 243
G ++ G N+GAWRLR + ++ N +D S +++ + T+L R I
Sbjct: 196 VQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDII 255

Query: 244 SWSSKFTMGQYDLSSNLYDTFHFTGASLESDESMLPPDLQGYAPQITGIAQTNAKVTVAQ 303
S+ T+G +++D +F GA L SD++MLP +G+AP I GIA+ A+VT+ Q
Sbjct: 256 PLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQ 315

Query: 304 NGRVLYQTTVAPGPFTISDL-GQSFQGLLDVTVEEEDGRTSTFQVGSASIPYLTRKGQVR 362
NG +Y +TV PGPFTI+D+ G L VT++E DG T F V +S+P L R+G R
Sbjct: 316 NGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTR 375

Query: 363 YKTSLGKPTSVGHNDINNPFFWTAEASWGWLNNVSLYGGGMFTADDYQAITTGIGFNLNQ 422
Y + G+ S P F+ + G ++YGG AD Y+A GIG N+
Sbjct: 376 YSITAGEYRSGNAQQE-KPRFFQSTLLHGLPAGWTIYGGTQL-ADRYRAFNFGIGKNMGA 433

Query: 423 FGSLSFDVTGADASLQQQNSGNLRGYSYRFNYAKHFESTGSQITFAGYRFSDKDYVSMSE 482
G+LS D+T A+++L G S RF Y K +G+ I GYR+S Y + ++
Sbjct: 434 LGALSVDMTQANSTLPDD--SQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFAD 491

Query: 483 YLSSRNGDESID--------------------NEKESYVISLNQYFETLELNSYLNVTRN 522
SR +I+ N++ +++ Q YL+ +
Sbjct: 492 TTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRT-STLYLSGSHQ 550

Query: 523 TYWDS-ASNTNYSVSVSKNFDIGDFKGISASLAVSRIR--WDDDEENQYYFSFSLPL--- 576
TYW + + + ++ F+ I+ +L+ S + W + + ++P
Sbjct: 551 TYWGTSNVDEQFQAGLNTA-----FEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHW 605

Query: 577 --------QQNRNISYSMQRTGSSNTSQMISWYDS--SDRNNIWNISASATDDNIRDGEP 626
++ + SYSM + + + Y + D N +++ +
Sbjct: 606 LRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGS 665

Query: 627 TLRGSYQHYSPWGRLNINGSVQPNQYNSVTAGWYGSLTATRHGIALHDYSYGDNARMMVD 686
T + + +G NI S + + G G + A +G+ L ++ ++V
Sbjct: 666 TGYATLNYRGGYGNANIGYSHS-DDIKQLYYGVSGGVLAHANGVTLGQPL--NDTVVLVK 722

Query: 687 TDGISGIEINSNRTV-TNGLGIAVIPSLSNYTTSMLRVNNNDLPEGVDVENSVIRTTLTQ 745
G ++ + V T+ G AV+P + Y + + ++ N L + VD++N+V T+
Sbjct: 723 APGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTR 782

Query: 746 GAIGYAKLNATTGYQIVGVIRQENGRFPPLGVNVTDKATGKDVGLVAEDGFVYLSGIQEN 805
GAI A+ A G +++ + N + P G VT + + G+VA++G VYLSG+
Sbjct: 783 GAIVRAEFKARVGIKLLMTLTH-NNKPLPFGAMVTS-ESSQSSGIVADNGQVYLSGMPLA 840

Query: 806 STLHLTWGD---NTCEVT---PPNQSNISESAIILPCK 837
+ + WG+ C PP + + C+
Sbjct: 841 GKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3363IGASERPTASE502e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 50.4 bits (120), Expect = 2e-08
Identities = 47/287 (16%), Positives = 92/287 (32%), Gaps = 16/287 (5%)

Query: 197 PNNAFDAEGLTKLTQETERRRRERNEVEQDVEVAVREKNRDALSRKLEIEQQEAFMTLEQ 256
N A+ + + E R A + + ++ E +QE+ +
Sbjct: 999 TPNNIQAD-VPSVPSNNEEIARVDEAPVPPPAPATPSETTETVA---ENSKQESKTVEKN 1054

Query: 257 EQQVKTRTAEQNAKIAAFEAERRREAE-QTRILAERQIQETEIDREQAVRSRKVEAEREV 315
EQ TA+ + A EA+ +A QT +A+ + E + + VE E +
Sbjct: 1055 EQDATETTAQN--REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKA 1112

Query: 316 RIKEIEQQQVTEIANQTKSIAIAAKSEQ---QSQAEARANLALAEAVSAQQNVETTRQTA 372
+++ + Q+V ++ +Q +++ Q + E + + E S T Q A
Sbjct: 1113 KVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPA 1172

Query: 373 EADRAKQVALIAAAQDAET------KAVELTVRAKAEKEAAEMQAAAIVELAEATRKKGL 426
+ + + + T T +E + R
Sbjct: 1173 KETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPH 1232

Query: 427 AEAEAQRALNDAINVLSDEQTSLKFKLALLQALPAVIEKSVEPMKAI 473
A + ND V + TS L A ++ KA+
Sbjct: 1233 NVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAV 1279


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3362LPSBIOSNTHSS290.029 Lipopolysaccharide core biosynthesis protein signat...
		>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein

signature.
Length = 166

Score = 29.0 bits (65), Expect = 0.029
Identities = 10/37 (27%), Positives = 20/37 (54%)

Query: 347 GVFDILHAGHVSYLANARKLGDRLIVAVNSDASTKRL 383
G FD + GH+ + +L D++ VAV + + + +
Sbjct: 7 GSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPM 43


41APECO1_3280APECO1_3263Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_32801193.032799hypothetical protein
APECO1_3279-1193.243381hypothetical protein
APECO1_3278-1181.768928hypothetical protein
APECO1_32770182.492401hypothetical protein
APECO1_32761193.458240hypothetical protein
APECO1_3275-1193.962165GIY-YIG nuclease superfamily protein
APECO1_3274-1193.389604acyltransferase
APECO1_3273-1183.441657lipid carrier protein
APECO1_3272-1162.850154collagenase
APECO1_32711232.170491protease
APECO1_32702271.803758hypothetical protein
APECO1_32693291.305294tryptophan permease
APECO1_32684331.323366ATP-dependent RNA helicase DeaD
APECO1_32675330.773485lipoprotein NlpI
APECO1_32666371.287550polynucleotide phosphorylase/polyadenylase
APECO1_32656320.93489330S ribosomal protein S15
APECO1_32644290.900778tRNA pseudouridine synthase B
APECO1_3263221-1.125139ribosome-binding factor A
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3278NUCEPIMERASE290.014 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.0 bits (65), Expect = 0.014
Identities = 8/22 (36%), Positives = 13/22 (59%)

Query: 19 VLITGATGLVGGHLLRMLINEP 40
L+TGA G +G H+ + L+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG 24


42APECO1_3237APECO1_3218Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_3237216-0.4800353-deoxy-D-manno-octulosonate 8-phosphate
APECO1_3236217-0.648803hypothetical protein
APECO1_32353160.168577lipopolysaccharide transport periplasmic protein
APECO1_32343170.305707ABC transporter ATP-binding protein
APECO1_3233315-0.237042RNA polymerase factor sigma-54
APECO1_3232017-0.531993sigma(54) modulation protein
APECO1_32310150.123745PTS system transporter subunit IIA-like
APECO1_3230012-0.320383hypothetical protein
APECO1_3229-1130.318955phosphohistidinoprotein-hexose
APECO1_3228-1162.780308hypothetical protein
APECO1_3227-1183.131644monofunctional biosynthetic peptidoglycan
APECO1_3226-1182.915308isoprenoid biosynthesis protein with
APECO1_3225-2183.358123aerobic respiration control sensor protein ArcB
APECO1_3224-1204.727776Fe-S oxidoreductase
APECO1_32232-1204.685384glutamate synthase subunit alpha
APECO1_3223-2143.515776glutamate synthase subunit beta
APECO1_3222-2123.109743hypothetical protein
APECO1_3221-1133.905371N-acetylmannosamine kinase
APECO1_32200173.222151N-acetylmannosamine-6-phosphate 2-epimerase
APECO1_32191212.544416sialic acid transporter
APECO1_32184261.434096N-acetylneuraminate lyase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3236MYCMG045290.017 Hypothetical mycoplasma lipoprotein (MG045) signature.
		>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature.

Length = 483

Score = 28.5 bits (63), Expect = 0.017
Identities = 30/144 (20%), Positives = 53/144 (36%), Gaps = 17/144 (11%)

Query: 57 ALSYRLIAQHVEYYSDQAVSWFTQPVLTTFDKDKIPTWSVKADKAKLTNDRMLYLYGHVE 116
AL +I + + +A L D+ K +K D+ T+D YL G ++
Sbjct: 307 ALDLLVINKQQSNFQKEAHEIIFDLALDGADQTKEQL--IKTDEELGTDDEDFYLKGAMQ 364

Query: 117 ----VNALVPDSQLRRITT----------DNAQINLVTQDVTSEDLVTLYGTTFNSSGLK 162
VN + P + +T + + T +TSE Y T + K
Sbjct: 365 NFSYVNYVSPLKVISDPSTGIVSSKKNNAEMKSKQMSTDQMTSEKEFDYYTETLKALLEK 424

Query: 163 M-RGNLRSKNAELIEKVRTSYEIQ 185
L +L+E ++ +Y I+
Sbjct: 425 EDSAELNENEKKLVETIKKAYTIE 448


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3225HTHFIS647e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.5 bits (157), Expect = 7e-13
Identities = 26/115 (22%), Positives = 45/115 (39%), Gaps = 4/115 (3%)

Query: 528 VLLVEDIELNVIVARSVLEKLGNSVDVAMTGKAALEMFKPGEYDLVLLDIQLPDMTGLDI 587
+L+ +D V L + G V + G+ DLV+ D+ +PD D+
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 588 SRELTKRYPREDLPPLVALTA-NVLKDKQEYLNAGMDDVLSKPLSVPALTAMIKK 641
+ K P P++ ++A N + G D L KP + L +I +
Sbjct: 66 LPRIKKARPD---LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3219TCRTETB591e-11 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 59.1 bits (143), Expect = 1e-11
Identities = 81/455 (17%), Positives = 158/455 (34%), Gaps = 46/455 (10%)

Query: 40 LLDGFDFVLIALVLTEVQGEFGLTTVQAASLISAAFISRWFGGLMLGAMGDRYGRRLAMV 99
+ +++ + L ++ +F + +A ++ G + G + D+ G + ++
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 100 TSIVLFSAGTLACGFAPGYITMFI-ARLVIGMGMAGEYGSSATYVIESWPKHLRNKASGF 158
I++ G++ + ++ I AR + G G A V PK R KA G
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 159 LISGFSVGAVVAAQVYSLVVPVWGWRALFFIGILPIIFALWLRKNIPEAEDWKEKHGGKA 218
+ S ++G V + ++ W L I ++ II +L K + +
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVR--------- 194

Query: 219 PVRTMVDILYRGEHRIANIVMTLAAATALWFCFAGNLQNAAIVAVLGLLCAAIFISFMVQ 278
+G I I++ + IV+VL L IF+ + +
Sbjct: 195 ---------IKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFL---IFVKHIRK 242

Query: 279 STGK----RWPTGVMLMVVVLFAFLYSWPIQA---LLPTYLKTDLAYDPHTVANVLFFSG 331
T + M+ VL + + ++P +K + +V+ F G
Sbjct: 243 VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302

Query: 332 -FGAAVGCCVGGFLGDWLGTRK-AYVCSLLASQLLIIPVFAIGGANVWVLGLLLFFQQML 389
+ +GG L D G + S + F + W + +++ F
Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLET-TSWFMTIIIVFVLGG 361

Query: 390 GQGIAGILPKLIGGYFDTDQRAAGLGFTYNVGALGGALAP-ILGALIA-----QRL---- 439
++ ++ + AG+ L I+G L++ QRL
Sbjct: 362 LSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPME 421

Query: 440 -DLGTALAS---LSFSLTFVVILLIGLDMPSRVQR 470
D T L S L FS V+ L+ L++ QR
Sbjct: 422 VDQSTYLYSNLLLLFSGIIVISWLVTLNVYKHSQR 456


43APECO1_3191APECO1_3185Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_3191113-3.452132acetyl-CoA carboxylase biotin carboxyl carrier
APECO1_3190114-4.198691acetyl-CoA carboxylase biotin carboxylase
APECO1_3189122-5.323256hypothetical protein
APECO1_3188124-5.737658ribokinase family sugar kinase
APECO1_3187124-5.997047ribose transport system permease RbsC
APECO1_3186022-4.878382ribose transport ATP-binding protein RbsA
APECO1_3185020-3.653529ribose ABC transporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3191RTXTOXIND270.026 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 27.5 bits (61), Expect = 0.026
Identities = 8/27 (29%), Positives = 16/27 (59%)

Query: 127 IEADKSGTVKAILVESGQPVEFDEPLV 153
I+ ++ VK I+V+ G+ V + L+
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLL 125


44APECO1_3042APECO1_3032Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_3042119-4.482032thiosulfate sulfurtransferase
APECO1_3041222-5.528971glycerol-3-phosphate dehydrogenase
APECO1_3040438-9.474850hypothetical protein
APECO1_3039441-10.268331hypothetical protein
APECO1_3038446-11.585935hypothetical protein
APECO1_3037547-11.464246fimbrial adhesin
APECO1_3036345-11.437973Auf fimbrial chaperone AufF
APECO1_3035342-10.357641Auf fimbrial chaperone AufF
APECO1_3034124-7.040407Auf fimbriae minor subunit AufE
APECO1_3033019-4.829341Auf fimbriae minor subunit AufD
APECO1_3032114-3.550863outer membrane usher protein AufC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3036TYPE4SSCAGX270.014 Type IV secretion system CagX conjugation protein si...
		>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein

signature.
Length = 522

Score = 27.4 bits (60), Expect = 0.014
Identities = 11/17 (64%), Positives = 12/17 (70%)

Query: 13 DPNMSYQKLRWARKNEI 29
DPNM+ LRW R NEI
Sbjct: 456 DPNMTNSGLRWYRVNEI 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3032PF005778830.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 883 bits (2282), Expect = 0.0
Identities = 398/866 (45%), Positives = 570/866 (65%), Gaps = 28/866 (3%)

Query: 19 KRVVPLLLVIMPACSIA--------GMRFNPAFLSGDTEAVADLSRFEKGMTYLPGSYEV 70
R+ + + AC+ A + FNP FL+ D +AVADLSRFE G PG+Y V
Sbjct: 21 HRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRV 80

Query: 71 EVWVNDSPLLSRTVTFKADDANQ-LIPCLSLADLLSLGINKNALPEQALASSENSCLDLR 129
++++N+ + +R VTF D+ Q ++PCL+ A L S+G+N ++ L + +++C+ L
Sbjct: 81 DIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLA-DDACVPLT 139

Query: 130 IWFPDVHYMPELDAQRLKLTFPQAIIKRDARGYIPPEQWDNGITAFLLNYDFSGN--NDR 187
D ++ QRL LT PQA + ARGYIPPE WD GI A LLNY+FSGN +R
Sbjct: 140 SMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNR 199

Query: 188 GDYSSNNYYLNLRAGINIGAWRFRDYSTWSR-----GSNSAGKLEHISSTLQRVIIPFRS 242
+S+ YLNL++G+NIGAWR RD +TWS S S K +HI++ L+R IIP RS
Sbjct: 200 IGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRS 259

Query: 243 ELTLGDTWSSSDVFDSVSIRGIKLESDENMLPDSQSGFAPTVRGIAKSRAQVTIKQNGYV 302
LTLGD ++ D+FD ++ RG +L SD+NMLPDSQ GFAP + GIA+ AQVTIKQNGY
Sbjct: 260 RLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYD 319

Query: 303 IYQTYMPPGPFEISDLNPTSSAGDLEVTIKESDNSETVYTVPYAAVPILQREGHSKYSTT 362
IY + +PPGPF I+D+ ++GDL+VTIKE+D S ++TVPY++VP+LQREGH++YS T
Sbjct: 320 IYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSIT 379

Query: 363 VGQYRSNSYNQKSPYIFQGELIWGLPWDITAYGGAQFSEDYRALALGLGLNLGVFGATSF 422
G+YRS + Q+ P FQ L+ GLP T YGG Q ++ YRA G+G N+G GA S
Sbjct: 380 AGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSV 439

Query: 423 DVTQANSSLVDGSKHQGQSYRFLYSKSLVQTGTAFHIIGYRYSTQGFYTLSDTTYQQMSG 482
D+TQANS+L D S+H GQS RFLY+KSL ++GT ++GYRYST G++ +DTTY +M+G
Sbjct: 440 DMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNG 499

Query: 483 TVVDPKTLDDKDYVYNWNDFYNLRYSKRGKFQASVSQPFGNYGSMYLSASQQTYWNTDKK 542
++ + + + D+YNL Y+KRGK Q +V+Q G ++YLS S QTYW T
Sbjct: 500 YNIETQDGVIQVKPK-FTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNV 558

Query: 543 DSLYQVGYNTSIKGIYLNVAWNYSKSPGTN-ADKIVSLNVSLPISNWLSSTNDGRSSSNA 601
D +Q G NT+ + I ++++ +K+ D++++LNV++P S+WL S D +S
Sbjct: 559 DEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRS--DSKSQWRH 616

Query: 602 MTATYGYSQDNHGQVNQYTGVSGSLLEQHNLSYNIQHGFANQDNSSSGSVG---VNYRGA 658
+A+Y S D +G++ GV G+LLE +NLSY++Q G+A + +SGS G +NYRG
Sbjct: 617 ASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGG 676

Query: 659 YGSLNSAYSYDNEGNQQINYGISGALVVHENGLTLSQPLGETNVLIKAPGANNVDVQRGT 718
YG+ N YS+ + +Q+ YG+SG ++ H NG+TL QPL +T VL+KAPGA + V+ T
Sbjct: 677 YGNANIGYSHSD-DIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQT 735

Query: 719 GISTDWRGYAVVPYATEYRRNNISLDPMSMNMHTELDITSTEVIPGKGALVRAEFAAHIG 778
G+ TDWRGYAV+PYATEYR N ++LD ++ + +LD V+P +GA+VRAEF A +G
Sbjct: 736 GVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVG 795

Query: 779 IRGLFTVRYRNKSVPFGATASAQIKNSSQITGIVGDNGQLYLSGLPLEGVINIQWGDGVQ 838
I+ L T+ + NK +PFGA +++ SSQ +GIV DNGQ+YLSG+PL G + ++WG+
Sbjct: 796 IKLLMTLTHNNKPLPFGAMVTSE---SSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEEN 852

Query: 839 QKCQANYKLPETELDNPVSYATLECR 864
C ANY+LP ++ + ECR
Sbjct: 853 AHCVANYQLPPESQQQLLTQLSAECR 878


45APECO1_3020APECO1_2968Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_3020016-3.203836DNA-binding transcriptional repressor
APECO1_3019122-7.374757hypothetical protein
APECO1_3018229-8.895183dehydrogenase
APECO1_3017123-6.392073hypothetical protein
APECO1_3016220-5.570708acetyltransferase YhhY
APECO1_3015217-4.513778hypothetical protein
APECO1_3014113-0.052562hypothetical protein
APECO1_3013-1162.407527hypothetical protein
APECO1_3012-2193.057152gamma-glutamyltranspeptidase
APECO1_3011-2222.944548hypothetical protein
APECO1_3010-1243.435912glycerophosphodiester phosphodiesterase
APECO1_3009-1253.200610glycerol-3-phosphate transporter ATP-binding
APECO1_3008-1253.089176glycerol-3-phosphate transporter membrane
APECO1_3007-2263.558038glycerol-3-phosphate transporter permease
APECO1_3006-2233.288595glycerol-3-phosphate transporter periplasmic
APECO1_3005-2223.911410leucine/isoleucine/valine transporter
APECO1_3004-2223.421309leucine/isoleucine/valine transporter
APECO1_3003-2192.467158leucine/isoleucine/valine transporter permease
APECO1_3002015-0.250201branched-chain amino acid transporter permease
APECO1_3001114-2.655450leucine transporter subunit
APECO1_3000118-5.735502hypothetical protein
APECO1_2999223-7.937843Leu/Ile/Val-binding protein precursor
APECO1_2998335-11.204153hypothetical protein
APECO1_2997125-8.568411hypothetical protein
APECO1_2996-118-5.681585hypothetical protein
APECO1_2995-215-3.997333hypothetical protein
APECO1_2994013-1.473939hypothetical protein
APECO1_29931140.423778hypothetical protein
APECO1_29922151.988855RNA polymerase factor sigma-32
APECO1_29911142.407369cell division protein FtsX
APECO1_29900143.897047cell division protein FtsE
APECO1_29891143.353248cell division protein FtsY
APECO1_2988-1153.39590716S rRNA m(2)G966-methyltransferase
APECO1_2987-1143.613206hypothetical protein
APECO1_2986-1143.217878hypothetical protein
APECO1_29850143.224027zinc/cadmium/mercury/lead-transporting ATPase
APECO1_29830161.806742hypothetical protein
APECO1_29821162.776048hypothetical protein
APECO1_29810183.882357major facilitator superfamily transporter
APECO1_29800214.376501hypothetical protein
APECO1_2979-1255.466152holo-(acyl carrier protein) synthase 2
APECO1_29780255.338324nickel-binding periplasmic protein NikA
APECO1_29770214.218979nickel transporter permease NikB
APECO1_2976-1192.907058nickel transporter permease NikC
APECO1_29750180.698691nickel transporter ATP-binding protein NikD
APECO1_2974318-1.357249nickel transporter ATP-binding protein NikE
APECO1_2973117-1.375512nickel responsive regulator
APECO1_2972218-1.988137regulatory protein
APECO1_2971216-0.996748phosphotransferase system enzyme subunit
APECO1_2970115-0.012580phosphotransferase system enzyme subunit
APECO1_29691161.896238PTS system galactitol-specific transporter
APECO1_29680173.189926xylulose kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3016SACTRNSFRASE354e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 35.3 bits (81), Expect = 4e-05
Identities = 20/92 (21%), Positives = 32/92 (34%), Gaps = 16/92 (17%)

Query: 55 VACIDGIVVGHLTIDVQQRPRRSHVADFGICVDSRWKNRGVASALMREMID------MCD 108
+ ++ +G + I + + D + D R K GV +AL+ + I+ C
Sbjct: 69 LYYLENNCIGRIKIR-SNWNGYALIEDIAVAKDYRKK--GVGTALLHKAIEWAKENHFCG 125

Query: 109 NWLRVDRIELTVFVDNAPAIKVYKKFGFEIEG 140
L I N A Y K F I
Sbjct: 126 LMLETQDI-------NISACHFYAKHHFIIGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3012NAFLGMOTY320.005 Sodium-type flagellar protein MotY precursor signature.
		>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature.

Length = 293

Score = 32.0 bits (72), Expect = 0.005
Identities = 27/80 (33%), Positives = 36/80 (45%), Gaps = 13/80 (16%)

Query: 272 RTPISGDYRGYQVYSMPPPSSGGIHIVQILNILENFDMQKYGF-GSADAMQIMAEAEKYA 330
R P+ G+ R + SMPPP G H +I N+ F Q G+ G A I++E EK
Sbjct: 77 RRPM-GETRNVSLISMPPPWRPGEHADRITNL--KFFKQFDGYVGGQTAWGILSELEKGR 133

Query: 331 YADRSEYLGDPDFVKVPWQA 350
Y P F WQ+
Sbjct: 134 Y---------PTFSYQDWQS 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3010PF04619280.020 Dr-family adhesin
		>PF04619#Dr-family adhesin

Length = 160

Score = 28.4 bits (63), Expect = 0.020
Identities = 12/60 (20%), Positives = 22/60 (36%), Gaps = 4/60 (6%)

Query: 29 VGAKYGHKMIEFDAKLSKDGEIFLLHDDNLERTSNGWGVAGELNWQD----LLRVDAGSW 84
+G ++ D + G+ FL+ D+N ++ W + D GSW
Sbjct: 70 LGCDARQVALKADTDNFEQGKFFLISDNNRDKLYVNIRPTDNSAWTTDNGVFYKNDVGSW 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3009PF05272290.042 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.042
Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 33 IVMVGPSGCGKSTLLRMVAGLERVTTGDI 61
+V+ G G GKSTL+ + GL+ +
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHF 627


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3006MALTOSEBP393e-05 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 39.3 bits (91), Expect = 3e-05
Identities = 39/160 (24%), Positives = 66/160 (41%), Gaps = 14/160 (8%)

Query: 134 GHLLSQPFNSSTPVLYYNKDAFKKAGLDPKQPPKTWQDLADYAAKLKASGMKCGYASGWQ 193
G L++ P L YNKD PPKTW+++ +LKA G + +
Sbjct: 127 GKLIAYPIAVEALSLIYNKDLLP-------NPPKTWEEIPALDKELKAKGKSALMFNLQE 179

Query: 194 GWIQLENFSAWNGLPFASKNNGFDGTDAVLEF--NKPEQVKHIAMLEEMNKKGDFSYVGR 251
+ +A G F +N +D D ++ K + +++ + D Y
Sbjct: 180 PYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDY--- 236

Query: 252 KDESTEKFYNGDCAMTTASSGSLANIREYAKFNYGVGMMP 291
+ F G+ AMT + +NI + +K NYGV ++P
Sbjct: 237 -SIAEAAFNKGETAMTINGPWAWSNI-DTSKVNYGVTVLP 274


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2989IGASERPTASE511e-08 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 50.8 bits (121), Expect = 1e-08
Identities = 43/208 (20%), Positives = 67/208 (32%), Gaps = 21/208 (10%)

Query: 20 QTPEK-ETEVQNEQPVVEEIVQAQEPVKASEHAVEEQPQAHTEAEAETFAANVVEVTEQV 78
TP + +V + EEI + E T AE + VE EQ
Sbjct: 998 TTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQD 1057

Query: 79 AESEKAQ---------------PEAEVVAQPESVVEETPEPVAIEREELPLPEDVNAEAV 123
A AQ + VAQ S +ET E + E E
Sbjct: 1058 ATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVET- 1116

Query: 124 SPEEWQAEAETVEIVEAAEEEAAKEEITDEEPEAQALAAEVAEEA-VMVVSPSEEEQPVE 182
E E V + ++E ++ EP + +E + ++ EQP +
Sbjct: 1117 ---EKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAK 1173

Query: 183 EIAQEQEKPTKEGFFARLKRSLLKTKEN 210
E + E+P E S+++ EN
Sbjct: 1174 ETSSNVEQPVTESTTVNTGNSVVENPEN 1201



Score = 47.0 bits (111), Expect = 2e-07
Identities = 36/179 (20%), Positives = 61/179 (34%), Gaps = 9/179 (5%)

Query: 19 EQTPEKETEVQNEQPVVEEIVQAQEPVKASEHAVEEQPQAHTEAEAETFAANVVEVTEQV 78
TP + TE E E + A+E + + A EV +
Sbjct: 1030 PATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSG 1089

Query: 79 AESEKAQPEAEVVAQPESVVEETPEPVAIEREELPLPEDVNAEAVSPEEWQAEAETVEIV 138
+E+++ Q + +V +E E +E E+ V ++ VSP++ Q+E +
Sbjct: 1090 SETKETQTTE--TKETATVEKE--EKAKVETEKTQEVPKVTSQ-VSPKQEQSETVQPQ-A 1143

Query: 139 EAAEEEAAKEEITDEEPEAQALAAEVA---EEAVMVVSPSEEEQPVEEIAQEQEKPTKE 194
E A E I + + + A E + V P E V E P
Sbjct: 1144 EPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENT 1202



Score = 42.0 bits (98), Expect = 6e-06
Identities = 28/156 (17%), Positives = 45/156 (28%), Gaps = 14/156 (8%)

Query: 17 QKEQTPEKETEVQNEQPVVEEIVQAQEPVKASE------HAVEEQPQAHTEAEAETFAAN 70
Q +T E T + E+ VE + P S+ + QPQA E +
Sbjct: 1096 QTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155

Query: 71 VVEVTEQVAESEKAQPEAEVVAQPESVVEETPEPVAIEREELPLPEDVNAEAVSPEEWQA 130
++ ++ QP E + E V E+ V + PE+ P
Sbjct: 1156 KEPQSQTNTTADTEQPAKETSSNVEQPVTES-TTVNTGNSVVENPENTTPATTQPTVNSE 1214

Query: 131 EAETVEI-------VEAAEEEAAKEEITDEEPEAQA 159
+ + E A D A
Sbjct: 1215 SSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALC 1250



Score = 40.8 bits (95), Expect = 1e-05
Identities = 25/176 (14%), Positives = 54/176 (30%), Gaps = 2/176 (1%)

Query: 17 QKEQTPEKETEVQNEQPVVEEIVQAQEPVKASEHAVEEQPQAHTEAEAETFAANVVEVTE 76
+E E ++ V+ E E + +E E +A+ EV +
Sbjct: 1065 NREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPK 1124

Query: 77 QVAESEKAQPEAEVVAQPESVVEETPEPVAIEREELPLPEDVNAEAVSPEEWQAEAETVE 136
++ Q ++E V QP++ +P + +E + A+ P + +
Sbjct: 1125 VTSQVSPKQEQSETV-QPQAEPARENDPT-VNIKEPQSQTNTTADTEQPAKETSSNVEQP 1182

Query: 137 IVEAAEEEAAKEEITDEEPEAQALAAEVAEEAVMVVSPSEEEQPVEEIAQEQEKPT 192
+ E+ + + E A + + V + E T
Sbjct: 1183 VTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPAT 1238


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2981TCRTETA538e-10 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 53.3 bits (128), Expect = 8e-10
Identities = 80/398 (20%), Positives = 147/398 (36%), Gaps = 32/398 (8%)

Query: 13 LRLNLRIVSIVMFNFASYLTIGLPLAVLPGYVHDVM--GFSAFWAGLVISLQYFATLLSR 70
++ N ++ I+ + IGL + VLPG + D++ G++++L
Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60

Query: 71 PHAGRYADLLGPKKIVVFGLCGCFLSGLGYLTAGLTASLPVISLLLLCLGRVILGI-GQS 129
P G +D G + +++ L G + + Y L V L +GR++ GI G +
Sbjct: 61 PVLGALSDRFGRRPVLLVSLAG---AAVDYAIMATAPFLWV-----LYIGRIVAGITGAT 112

Query: 130 FAGTGSTLWGVGVVGSL--HIGRVISWNGIVTYGAMAMGAPLGVVFYHWGGLQALALIIM 187
A G+ + + H G + + G +G +G H A AL +
Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGL 172

Query: 188 GVALVAILLAIPRPTVK--ASKGKPLPFRAVLGRVWLYGMALALA-----SAGFGVIATF 240
LL + + P + + +A +A V A
Sbjct: 173 NFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAAL 232

Query: 241 ITLFYDAK-GWDGAAFALTLFSCAFVGT---RLLFPNGINRIGGLNVAMICFSVEIIGLL 296
+F + + WD ++L + + + ++ R+G M+ + G +
Sbjct: 233 WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYI 292

Query: 297 LVGVATMPWMAKIG-VLLAGAGFSLVFPALGVVAVKAVPQQNQGAALATYTVFMDLSLGV 355
L+ AT WMA VLLA G + PAL + + V ++ QG + L+ +
Sbjct: 293 LLAFATRGWMAFPIMVLLASGGIGM--PALQAMLSRQVDEERQGQLQGSLAALTSLT-SI 349

Query: 356 TGPLAGLVMSWAGVPV----IYLAAAGLVAIALLLTWR 389
GPL + A + ++A A L + L R
Sbjct: 350 VGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2977BORPETOXINB280.048 Bordetella pertussis toxin B subunit signature.
		>BORPETOXINB#Bordetella pertussis toxin B subunit signature.

Length = 226

Score = 27.7 bits (61), Expect = 0.048
Identities = 21/77 (27%), Positives = 32/77 (41%), Gaps = 10/77 (12%)

Query: 204 GQRHVTWARLRGLSDKQTERRHILRNASLPMITAVGMHIGELIGGTMIIENIFAWPGVG- 262
R +T A LRG D Q RH+ R S+ + G ++G GG +I++ PG
Sbjct: 53 KTRALTVAELRGSGDLQEYLRHVTRGWSIFALYD-GTYLGGEYGG--VIKD--GTPGGAF 107

Query: 263 ----RYAVSAIFNRDYP 275
+ + N P
Sbjct: 108 DLKTTFCIMTTRNTGQP 124


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2974HTHFIS300.008 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 30.2 bits (68), Expect = 0.008
Identities = 10/34 (29%), Positives = 19/34 (55%)

Query: 25 QAVLNNVSLALKSGETVALLGRSGCGKSTLARLL 58
Q + ++ +++ T+ + G SG GK +AR L
Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180


46APECO1_2945APECO1_2940Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_2945220-1.781384hypothetical protein
APECO1_2944221-2.414649hypothetical protein
APECO1_2943320-4.068965permease of iron ABCtransport system
APECO1_2942022-7.934967hemin importer ATP-binding subunit
APECO1_2941024-9.084860Mg(2+) transport ATPase
APECO1_2940-121-5.400211acid-resistance protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2942PF05272280.029 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.029
Identities = 10/23 (43%), Positives = 13/23 (56%), Gaps = 1/23 (4%)

Query: 28 EIVAIL-GPNGAGKSTLLRQLTG 49
+ +L G G GKSTL+ L G
Sbjct: 596 DYSVVLEGTGGIGKSTLINTLVG 618


47APECO1_2872APECO1_2852Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_2872-2173.093601DedA family membrane protein
APECO1_2871-2193.805823extracellular substrate-binding protein YiaO
APECO1_2870-3153.503889L-xylulose kinase
APECO1_2869-1112.2043643-keto-L-gulonate-6-phosphate decarboxylase
APECO1_28680112.359755L-xylulose 5-phosphate 3-epimerase
APECO1_28670112.659761hypothetical protein
APECO1_28660122.205731lactaldehyde dehydrogenase
APECO1_28650121.974896hypothetical protein
APECO1_28640122.514846alcohol dehydrogenase
APECO1_2863-2133.044532selenocysteinyl-tRNA-specific translation
APECO1_2862-3122.141422selenocysteine synthase
APECO1_2861-2141.237064glutathione S-transferase
APECO1_2860-1151.114837hypothetical protein
APECO1_2859-118-0.077056hypothetical protein
APECO1_2858221-0.071036PTS system mannitol-specific transporter subunit
APECO1_28573180.071325mannitol-1-phosphate 5-dehydrogenase
APECO1_28563180.052260mannitol repressor protein
APECO1_28552170.676776hypothetical protein
APECO1_28542160.891240hypothetical protein
APECO1_28532161.559561autotransporter/adhesin
APECO1_2852-2143.476121L-lactate permease
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2863TCRTETOQM585e-11 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 58.3 bits (141), Expect = 5e-11
Identities = 44/147 (29%), Positives = 69/147 (46%), Gaps = 18/147 (12%)

Query: 3 IATAGHVDHGKTTLLQAI---TGV------------NADRLPEEKKRGMTIDLGYAYWPQ 47
I HVD GKTTL +++ +G D E++RG+TI G +
Sbjct: 6 IGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW 65

Query: 48 PDGRVPGFIDVPGHEKFLSNMLAGVGGIDHALLVVACDDGVMAQTREHLAILQLTGNPML 107
+ +V ID PGH FL+ + + +D A+L+++ DGV AQTR L+ G P +
Sbjct: 66 ENTKV-NIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTI 124

Query: 108 TVALTKADRVDEARVDEVERQVKEVLR 134
+ K D+ + V + +KE L
Sbjct: 125 -FFINKIDQNG-IDLSTVYQDIKEKLS 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2860RTXTOXIND622e-12 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 61.8 bits (150), Expect = 2e-12
Identities = 55/314 (17%), Positives = 102/314 (32%), Gaps = 82/314 (26%)

Query: 66 ITLQVTGIVTEVTDKNNQLIQKGEVLFKLDPVR------------YQARVD--RLQA--- 108
I IV E+ K + ++KG+VL KL + QAR++ R Q
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR 158

Query: 109 ------------------------DLMTATHNIK----TLRAQLTEAQANTTQVSAERDR 140
+++ T IK T + Q + + N + AER
Sbjct: 159 SIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLT 218

Query: 141 LFKNYQRY----------LKGSQAAVNPFS---------ERDIDDARQNF---LAQDALV 178
+ RY L + ++ + E +A +Q +
Sbjct: 219 VLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQI 278

Query: 179 KGSVAE----QAQIQSQLDSMVNGE----QSQIVSLRAQLTEAKYNLEQTVIRAPSNGYV 230
+ + + + + + I L +L + + + +VIRAP + V
Sbjct: 279 ESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKV 338

Query: 231 TQVLIR-PGTYAAALPLRPVMVFIPEQKRQIV-AQFRQNSLLRLKPGDDAEVVFNALPGQ 288
Q+ + G +MV +PE V A + + + G +A + A P
Sbjct: 339 QQLKVHTEGGVVT--TAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYT 396

Query: 289 VFH---GKLTSILP 299
+ GK+ +I
Sbjct: 397 RYGYLVGKVKNINL 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2853PF03895634e-14 Serum resistance protein DsrA.
		>PF03895#Serum resistance protein DsrA.

Length = 79

Score = 62.5 bits (152), Expect = 4e-14
Identities = 19/79 (24%), Positives = 36/79 (45%), Gaps = 2/79 (2%)

Query: 1506 ESKLSGGIASAMAMTGLPQAYTPGASMASIGGGTYNGESAVALGV-SMVSANGRWVYKLQ 1564
+L G+A+ A++ L Q G + S G Y ++A+A+GV S ++ +
Sbjct: 2 SKELQTGLANQSALSMLVQPNGVGKTSVSAAVGGYRDKTALAIGVGSRITDRFTAKAGVA 61

Query: 1565 GSTNSQGEYSAALGAGIQW 1583
+T + G S G ++
Sbjct: 62 FNTYN-GGMSYGASVGYEF 79


48APECO1_2838APECO1_2821Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_2838019-5.8306432-amino-3-ketobutyrate coenzyme A ligase
APECO1_2837226-8.914549ADP-L-glycero-D-manno-heptose-6-epimerase
APECO1_2836332-10.860026ADP-heptose--LPS heptosyltransferase
APECO1_2835340-14.063831ADP-heptose--LPS heptosyltransferase
APECO1_2834342-16.198805lipid A-core, surface polymer ligase
APECO1_2833334-14.147509beta-1,3-glucosyltransferase
APECO1_2832328-11.162563UDP-galactose:(galactosyl) LPS
APECO1_2831227-9.409688lipopolysaccharide core biosynthesis protein
APECO1_2830222-6.336764lipopolysaccharide 1,2-glucosyltransferase
APECO1_2829217-3.445755lipopolysaccharide 1,3-galactosyltransferase
APECO1_2828214-1.624889lipopolysaccharide core biosynthesis protein
APECO1_2827215-0.554850lipopolysaccharide core biosynthesis
APECO1_28261140.319858lipopolysaccharide core biosynthesis protein
APECO1_28250121.5628323-deoxy-D-manno-octulosonic-acid transferase
APECO1_28240141.233215formamidopyrimidine-DNA glycosylase
APECO1_28230131.488664DNA repair protein RadC
APECO1_28220142.248273bifunctional phosphopantothenoylcysteine
APECO1_28213131.166868deoxyuridine 5'-triphosphate
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2837NUCEPIMERASE1047e-28 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 104 bits (260), Expect = 7e-28
Identities = 77/348 (22%), Positives = 127/348 (36%), Gaps = 67/348 (19%)

Query: 2 IIVTGGAGFIGSNIVKALNDKGITDILVVDNLKD--------------GTKFVNLVDLDI 47
+VTG AGFIG ++ K L + G ++ +DNL D +D+
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61

Query: 48 ADYMDKEDFLIQIMAGEEFGDVEAIFHEGACSSTTEWDGKYMMDNNYQYSK-------EL 100
AD + + + A F E +F + +Y ++N + Y+ +
Sbjct: 62 ADR----EGMTDLFASGHF---ERVFISPHRLAV-----RYSLENPHAYADSNLTGFLNI 109

Query: 101 LHYCLEREIP-FLYASSAATYGGRTSD-FIESREYEKPLNVYGYSKFLFDEYVRQILPEA 158
L C +I LYASS++ YG F + P+++Y +K +
Sbjct: 110 LEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLY 169

Query: 159 NSQIVGFRYFNVYGPREGHKGSMASVAFHLNTQLNNGESPKLFEGSENFKRDFVYVGDVA 218
G R+F VYGP + MA F + G+S ++ KRDF Y+ D+A
Sbjct: 170 GLPATGLRFFTVYGPWG--RPDMA--LFKFTKAMLEGKSIDVY-NYGKMKRDFTYIDDIA 224

Query: 219 DVNL------------WFLENGVSG-------IFNLGTGRAESFQAVADATLAY-HKKGQ 258
+ + W +E G ++N+G A + +
Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK 284

Query: 259 IEYIPFPDKLKGRYQAFTQADLTNLRAA-GYDKPFKTVAEGVTEYMAW 305
+P G T AD L G+ P TV +GV ++ W
Sbjct: 285 KNMLPLQ---PGDVL-ETSADTKALYEVIGF-TPETTVKDGVKNFVNW 327


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2822UREASE290.032 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 29.3 bits (66), Expect = 0.032
Identities = 18/55 (32%), Positives = 22/55 (40%), Gaps = 15/55 (27%)

Query: 74 GHIELGKWADLVILAPA----------TADLIARVAAGMANDLVSTICLATPAPV 118
G +E+GK ADLV+ PA IA G N + TP PV
Sbjct: 424 GSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNA-----SIPTPQPV 473


49APECO1_2812APECO1_2783Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_2812-2153.459524DNA-directed RNA polymerase subunit omega
APECO1_2811-2133.271253bifunctional (p)ppGpp synthetase II/
APECO1_2810-1133.103379tRNA guanosine-2'-O-methyltransferase
APECO1_2809-1122.832261ATP-dependent DNA helicase RecG
APECO1_2808-1121.842185glutamate transport protein
APECO1_2807-2112.106912purine permease YicE
APECO1_2806-2131.612059hypothetical protein
APECO1_2805-1140.643109hypothetical protein
APECO1_2804-1160.031903hypothetical protein
APECO1_2803121-3.364442hypothetical protein
APECO1_2802-115-2.828887aldolase
APECO1_2801-114-3.606533PTS enzyme-II fructose
APECO1_2800-113-3.771875PTS system, fructose-like-2 IIB component 1
APECO1_2799-214-4.220517phosphotransferase system (PTS),
APECO1_2798-213-3.892680transcriptional antiterminator
APECO1_2797-29-2.569615alpha-xylosidase
APECO1_2796-214-4.000278transporter
APECO1_2795017-3.149342*hypothetical protein
APECO1_2794019-3.362984hypothetical protein
APECO1_2793-119-2.597429hypothetical protein
APECO1_2792-115-0.920430hypothetical protein
APECO1_2791-111-0.005149hypothetical protein
APECO1_2790-1120.952431ribonucleoside transporter
APECO1_2789-1121.586586hypothetical protein
APECO1_2788-1132.023878xanthine/uracil permase YicO
APECO1_2787-1153.115172cryptic adenine deaminase
APECO1_27860173.552604sugar phosphate antiporter
APECO1_27851163.855041regulatory protein UhpC
APECO1_27841164.127165sensory histidine kinase UhpB
APECO1_27832173.458934DNA-binding transcriptional activator UhpA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2809SECA404e-05 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 39.9 bits (93), Expect = 4e-05
Identities = 38/129 (29%), Positives = 56/129 (43%), Gaps = 18/129 (13%)

Query: 244 NLSMLALRAGAQRFHAQPLSANDALKNKLLAALPFKPTGAQARVVAEIERDM-ALDVPMM 302
LS L+ F A+ L + L+N + A A R ++ M DV ++
Sbjct: 37 KLSDEELKGKTAEFRAR-LEKGEVLENLIPEAF------AVVREASKRVFGMRHFDVQLL 89

Query: 303 ---RLVQGDV-----GSGKTLVAALAA-LRAVAHGKQVALMAPTELLAEQHANNFRNWFA 353
L + + G GKTL A L A L A+ GK V ++ + LA++ A N R F
Sbjct: 90 GGMVLNERCIAEMRTGEGKTLTATLPAYLNALT-GKGVHVVTVNDYLAQRDAENNRPLFE 148

Query: 354 PLGIEVGWL 362
LG+ VG
Sbjct: 149 FLGLTVGIN 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2798PF08280340.001 M protein trans-acting positive regulator
		>PF08280#M protein trans-acting positive regulator

Length = 530

Score = 34.1 bits (78), Expect = 0.001
Identities = 79/491 (16%), Positives = 168/491 (34%), Gaps = 73/491 (14%)

Query: 7 RQNRLLRFLLPRREYTTIVTIAGYLNVSEKTIQRDLRLLEQWL-GQWRINVEKRAGAGVM 65
+ +L+ + I +A ++ + L + + ++KR M
Sbjct: 45 SKCQLVVLFF-KTSSLPITEVAEKTGLTFLQLNHYCEELNAFFPDSLSMTIQKR-----M 98

Query: 66 LSAENIADLLHLDHLLVAECEEIDGVMNNARRVKIASQLLSETPNETSISKLSERYFISG 125
+ H ++ + + ++ +++ + L+ + ++ + +F+S
Sbjct: 99 I-------SCQFTHP--SKETYLYQLYASSNVLQLLAFLIKNGSHSRPLTDFARSHFLSN 149

Query: 126 ASIVNDLRVIESWLAPLGLSLIRSPSGTHIEGSEGQVRQAMALLINGIINHNEPQGVVYS 185
+S + L L L S I G E ++R +ALL G+
Sbjct: 150 SSAYRMREALIPLLRNFELKL----SKNKIVGEEYRIRYLIALL-------YSKFGIKVY 198

Query: 186 RLDPGSYKALVHYFGEEEVLFVQSLLLDMENELSWSLGEPYYVNIFTHILIMMYRNTHGN 245
L K ++H F L S L LS E + F IL+ + H
Sbjct: 199 DLTQQD-KNIIHSF-----LSHSSTHLKTSPWLS----ESFS---FYDILLALSWKRHQF 245

Query: 246 ALSREEDQTRQYDENIF---NVASQMIHKIEQRIAHTLPDDEVWFIYQ-YIISSGVAIDG 301
+++ + + Q + +F ++ IE ++ ++Y YI ++
Sbjct: 246 SVTIPQTRIFQQLKKLFVYDSLKKSSRDIIETYCQLNFSAGDLDYLYLIYITANNSFASL 305

Query: 302 Q---KDVSIISHMQASNEA-RLITWRLITVFSDIVD---------CDFSEDSALYDGLLV 348
Q + + + N+ RL+ +IT+ ++ + FS+ S L++ L
Sbjct: 306 QWTPEHIRQCCQLFEENDTFRLLLNPIITLLPNLKEQKASLVKALMFFSK-SFLFN--LQ 362

Query: 349 HIKPLINRLNYRIHIRNPLLEDIKAELADVWRLTQYVVNQVFKTWGENAVSEDEVGYLTV 408
H P N + N L + + W + K G+ ++
Sbjct: 363 HFIPETNLFVSPYYKGNQKLYTSLKLIVEEW---------MAKLPGKRYLNHKHFHLFCH 413

Query: 409 HFQAAMERQIARKRVLLVCSTGIGTSHLLKSRILRAFPEWTI---VDVISAANLSQVLPD 465
+ + + V+ V S I +HLL R F + +I + N+ Q+
Sbjct: 414 YVEQILRNIQPPLVVVFVASNFI-NAHLLTDSFPRYFSDKSIDFHSYYLLQDNVYQIPDL 472

Query: 466 NIELIISTINL 476
+L+I+ L
Sbjct: 473 KPDLVITHSQL 483


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2790TCRTETA393e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 38.7 bits (90), Expect = 3e-05
Identities = 34/208 (16%), Positives = 72/208 (34%), Gaps = 13/208 (6%)

Query: 33 IIVEFLPVSLLTP----MAQDLGISEGVA---GQSVTVTAFVAMFASLFITQTIQATDRR 85
+ ++ + + L+ P + +DL S V G + + A + + + RR
Sbjct: 14 VALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRR 73

Query: 86 YVVILFAVLLTLSCLLVSFANSFSLLLIGRACLGLALGGFWAMSASLTMRLVPPRTVPKA 145
V+++ + +++ A +L IGR G+ G A++ + + +
Sbjct: 74 PVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARH 132

Query: 146 LSVIFGAVSIALVIAAPLGSFLGELIGWRNVFNAAAAMG----VLCIFWIIKSLPSLPGE 201
+ +V LG +G F AAAA+ + F + +S
Sbjct: 133 FGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRP 191

Query: 202 PSHQKQNTFRLLQRPGVMAGMIAIFMSF 229
+ N + M + A+ F
Sbjct: 192 LRREALNPLASFRWARGMTVVAALMAVF 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2787UREASE381e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 38.2 bits (89), Expect = 1e-04
Identities = 30/105 (28%), Positives = 43/105 (40%), Gaps = 17/105 (16%)

Query: 22 AVSRGDAVADYIIDNVSILDLINGGEISGPIVIKGRYIAGVG-AEYAD---------APA 71
V+R D +I N ILD + G + I +K IA +G A D P
Sbjct: 60 QVTREGGAVDTVITNALILD--HWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPG 117

Query: 72 LQRIDARGATAVPGFIDAHLHIESSMMTPVTFETATLPRGLTTVI 116
+ I G G +D+H+H + P E A L GLT ++
Sbjct: 118 TEVIAGEGKIVTAGGMDSHIH----FICPQQIEEA-LMSGLTCML 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2786TCRTETB340.001 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 34.1 bits (78), Expect = 0.001
Identities = 28/168 (16%), Positives = 61/168 (36%), Gaps = 17/168 (10%)

Query: 49 FNIAQNDMISTYGLSMTQLGMIGLGFSITYGVGKTLVSYYADGKNTKQFLPFMLILSAIC 108
N++ D+ + + + F +T+ +G + +D K+ L F +I++ C
Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIIN--C 90

Query: 109 MLGFSASMGSGSVSLFLMIAFYALSGFFQSTGGSCSYSTI----TKWTPRRKRGTFLGFW 164
+G SL +M + F Q G + + + ++ P+ RG G
Sbjct: 91 FGSVIGFVGHSFFSLLIM------ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144

Query: 165 NISHNLGGAGAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRY 212
+G + A+Y+ + + + P + I+ L
Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSY---LLLIP--MITIITVPFLMK 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2785TCRTETB401e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 39.9 bits (93), Expect = 1e-05
Identities = 64/408 (15%), Positives = 135/408 (33%), Gaps = 60/408 (14%)

Query: 30 RHILLTIWLGYALFY--FTRKSFNAAVPEILANGVLSRSDIGLLATLFYITYGVSKFVSG 87
RH + IWL F+ N ++P+I + + + T F +T+ + V G
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 88 IVSDRSNARYFMGIGLIATGIINILFGFSTSLWAFAVLWVLNAFFQGWGS---PVCARLL 144
+SD+ + + G+I +++ S F L ++ F QG G+ P ++
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHS---FFSLLIMARFIQGAGAAAFPALVMVV 127

Query: 145 TAWY-SRTERGGWWALWNTAHNVGGALIPIVMAASALHYGWRAGMMIAGCMAIVVGIFLC 203
A Y + RG + L + +G + P + A + W ++ M ++ +
Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW--SYLLLIPMITIITVPF- 184

Query: 204 WRLRDRPQALGLPAVGEWRHDALEIAQQQEGAGLTRKEILTKYVLLNPYIWLLSFCYVLV 263
+ L +I G L I+ + Y VL
Sbjct: 185 --------LMKLLKKEVRIKGHFDIK----GIILMSVGIVFFMLFTTSYSISFLIVSVLS 232

Query: 264 YVV-----RAAINDWGN-----------LYMSEMLGVDLVTANTAVTMFELGGFIGALVA 307
+++ R + + + + + V ++ + + A
Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292

Query: 308 GWGSDKLFNGNRGPMNLIFAAGILL-SVGSLWLMPFASYVMQATCFFTIGFFVFGPQMLI 366
GS +F G + + GIL+ G L+++ + + F T F + +
Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFL-SVSFLTASFLLETTSWFM 351

Query: 367 ---------GMAAAECS---------HKEAAGAATGFVGLFAYLGASL 396
G++ + ++ AGA + ++L
Sbjct: 352 TIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2784PF06580402e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.8 bits (93), Expect = 2e-05
Identities = 28/142 (19%), Positives = 57/142 (40%), Gaps = 11/142 (7%)

Query: 365 LRPRQLDDLTLEQAIRSLMREMELEGRGIVSHLEWRIDESALSENQRVTLFRVCQEGLNN 424
LR ++L + + ++L L++ + + +V + Q + N
Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPM-LVQTLVEN 266

Query: 425 IVKHA-----DASAVTLQGWQQDERLMLVIEDDGSGLPPDSGQ-HGFGLTGMRERVTALG 478
+KH + L+G + + + L +E+ GS ++ + G GL +RER+ L
Sbjct: 267 GIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLY 326

Query: 479 G---TLTISCLHG-TRVSVSLP 496
G + +S G V +P
Sbjct: 327 GTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2783HTHFIS612e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.0 bits (148), Expect = 2e-13
Identities = 29/174 (16%), Positives = 59/174 (33%), Gaps = 20/174 (11%)

Query: 2 ITVALIDDHLIVRSGFAQLLGLEPDLQVVAEFGSGREALAGLPGCGVQVCICDISMPDIS 61
T+ + DD +R+ Q L V + + + + D+ MPD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GLELLSQLPK---GMATIMLSVHDSPALVEQALNAGARGFLSKRCSPDELIAAVHTVATG 118
+LL ++ K + +++S ++ +A GA +L K ELI +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR---- 117

Query: 119 GCYLTPDIAIKLASGRQDPLTKRERQVAEKLAQG---MAVKEIAAELGLSPKTV 169
A+ R L + + + + + A L + T+
Sbjct: 118 --------ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163


50APECO1_2745APECO1_2727Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_2745113-4.1611216-phosphogluconate phosphatase
APECO1_2744113-3.760585hypothetical protein
APECO1_2743114-3.9665956-phosphogluconolactonase
APECO1_2742012-3.710674xylanase
APECO1_2741-212-3.438289carbohydrate-specific outer membrane porin,
APECO1_2740-111-1.4402176-phospho-beta-glucosidase
APECO1_2739-214-0.148722PTS system beta-glucoside-specific transporter
APECO1_2738-3220.469202transcriptional antiterminator BglG
APECO1_2737-1301.943096transcriptional regulator PhoU
APECO1_2736-1282.008430phosphate transporter ATP-binding protein
APECO1_2735-2252.054569phosphate transporter permease subunit PtsA
APECO1_27342291.870978phosphate transporter permease subunit PstC
APECO1_27332291.662996phosphate ABC transporter substrate-binding
APECO1_27323251.842938glucosamine--fructose-6-phosphate
APECO1_27314251.892073bifunctional N-acetylglucosamine-1-phosphate
APECO1_27304291.534709hypothetical protein
APECO1_27294271.473850F0F1 ATP synthase subunit beta
APECO1_27283250.525323F0F1 ATP synthase subunit gamma
APECO1_27272221.026056hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2731RTXTOXINA290.048 Gram-negative bacterial RTX toxin determinant A family...
		>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family

signature.
Length = 1024

Score = 29.2 bits (65), Expect = 0.048
Identities = 23/80 (28%), Positives = 31/80 (38%), Gaps = 10/80 (12%)

Query: 367 LGDAEIGDNVNIGAGTITCNYDGANKFKTIIGDDVFVGSDTQLVAPVTVGKGATIAAGTT 426
LGD + D V + AG+ N G DV T G AT A T
Sbjct: 616 LGDGD--DKVFLSAGSA--NIYAGK------GHDVVYYDKTDTGYLTIDGTKATEAGNYT 665

Query: 427 VTRNVGENALAISRVPQTQK 446
VTR +G + + V + Q+
Sbjct: 666 VTRVLGGDVKVLQEVVKEQE 685


51APECO1_2704APECO1_2699Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_2704-2153.002245ATP-dependent protease
APECO1_2703-1194.092367acetolactate synthase 2 catalytic subunit
APECO1_27020253.996917acetolactate synthase 2 regulatory subunit
APECO1_27010264.053389branched-chain amino acid aminotransferase
APECO1_27001223.918403dihydroxy-acid dehydratase
APECO1_26992193.357797threonine dehydratase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2704HTHFIS358e-04 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 34.8 bits (80), Expect = 8e-04
Identities = 40/196 (20%), Positives = 62/196 (31%), Gaps = 51/196 (26%)

Query: 180 KHALEHPKPTNAVSRALQHDLSDVVGQEQG----KRGLEITAAGGHNLLLIGPPGTGKTM 235
AL PK + D +VG+ R L L++ G GTGK +
Sbjct: 116 GRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKEL 175

Query: 236 LASRINGLLPDLSNEEALESAAILSLVNAESVQKQWRQRPFRSPHHSA--------SLTA 287
+A ++ R PF + + +A L
Sbjct: 176 VARALHDYGK-------------------------RRNGPFVAINMAAIPRDLIESELFG 210

Query: 288 MVGG---GAIP-GPGEISLAHNGVLFLDEL----PEFERRTLDALREPIESGQIHLSRTR 339
G GA G A G LFLDE+ + + R L L++ G+
Sbjct: 211 HEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQ----GEYT--TVG 264

Query: 340 AKITYPARFQLVAAMN 355
+ + ++VAA N
Sbjct: 265 GRTPIRSDVRIVAATN 280


52APECO1_2668APECO1_2659Y        NNGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_2668-2204.065970hypothetical protein
APECO1_2667-2183.635729diaminopimelate epimerase
APECO1_2666-2172.592566hypothetical protein
APECO1_2665-2172.236900site-specific tyrosine recombinase XerC
APECO1_2664-1140.484499flavin mononucleotide phosphatase
APECO1_2663011-2.677263DNA-dependent helicase II
APECO1_2662-113-6.391206hypothetical protein
APECO1_2661014-6.219484hypothetical protein
APECO1_2660013-5.244251magnesium/nickel/cobalt transporter CorA
APECO1_2659014-3.761169hypothetical protein
53APECO1_2605APECO1_2588Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_2605-121-5.405650**molybdopterin-guanine dinucleotide biosynthesis
APECO1_2604-220-5.881299molybdopterin-guanine dinucleotide biosynthesis
APECO1_2603-214-3.751496hypothetical protein
APECO1_2602-213-3.227840serine/threonine protein kinase
APECO1_2601-113-2.928988protein disulfide isomerase I
APECO1_2600011-2.097014hypothetical protein
APECO1_25990150.748611acyltransferase
APECO1_25980151.965349DNA polymerase I
APECO1_25970182.448145ribosome biogenesis GTP-binding protein YsxC
APECO1_25963242.370364hypothetical protein
APECO1_25952212.069630coproporphyrinogen III oxidase
APECO1_25941171.705753nitrogen regulation protein NR(I)
APECO1_25931160.663818nitrogen regulation protein NR(II)
APECO1_25921180.204111glutamine synthetase
APECO1_2591-1140.122926GTP-binding protein
APECO1_2590122-0.860515regulatory protein
APECO1_2589123-1.708488hypothetical protein
APECO1_2588222-1.777235hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2596SECA310.002 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 31.0 bits (70), Expect = 0.002
Identities = 11/71 (15%), Positives = 30/71 (42%)

Query: 14 AKARRKTREELDQEARDRKRQKKRRGHAPGSRAAGGNTTSGSKGQNAPKDPRIGSKTPIP 73
+K + + EE+++ + R+ + +R ++ + + + ++G P P
Sbjct: 827 SKVQVRMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCP 886

Query: 74 LGVAEKVTKQH 84
G +K + H
Sbjct: 887 CGSGKKYKQCH 897


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2594HTHFIS6020.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 602 bits (1553), Expect = 0.0
Identities = 206/478 (43%), Positives = 299/478 (62%), Gaps = 11/478 (2%)

Query: 1 MQRGIVWVVDDDSSIRWVLERALAGAGLTCTTFENGAEVLEALASKTPDVLLSDIRMPGM 60
M + V DDD++IR VL +AL+ AG N A + +A+ D++++D+ MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 DGLALLKQIKQRHPMLPVIIMTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVERAIS 120
+ LL +IK+ P LPV++M+A + A+ A ++GA+DYLPKPFD+ E + ++ RA++
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 HYQEQQQPRNVQLNGPTTDIIGEAPAMQDVFRIIGRLSRSSISVLINGESGTGKELVAHA 180
+ + ++G + AMQ+++R++ RL ++ ++++I GESGTGKELVA A
Sbjct: 121 EPKRRPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179

Query: 181 LHRHSPRAKAPFIALNMAAIPKDLIESELFGHEKGAFTGANTIRQGRFEQADGGTLFLDE 240
LH + R PF+A+NMAAIP+DLIESELFGHEKGAFTGA T GRFEQA+GGTLFLDE
Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239

Query: 241 IGDMPLDVQTRLLRVLADGQFYRVGGYAPVKVDVRIIAATHQNLEQRVQEGKFREDLFHR 300
IGDMP+D QTRLLRVL G++ VGG P++ DVRI+AAT+++L+Q + +G FREDL++R
Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299

Query: 301 LNVIRVHLPPLRERREDIPRLARHFLQVAARELGVEAKLLHPETEAALTRLAWPGNVRQL 360
LNV+ + LPPLR+R EDIP L RHF+Q A +E G++ K E + WPGNVR+L
Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVREL 358

Query: 361 ENTCRWLTVMAAGQEVLIQDLPGELFESNVPESTSHMQPDSWATLLAQWADRALRS---- 416
EN R LT + + + + EL S + ++Q + +R
Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418

Query: 417 -----GHQNLLSEAQPELERTLLTTALRHTQGHKQEAARLLGWGRNTLTRKLKELGME 469
L E+E L+ AL T+G++ +AA LLG RNTL +K++ELG+
Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2591TCRTETOQM1804e-51 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 180 bits (458), Expect = 4e-51
Identities = 97/445 (21%), Positives = 170/445 (38%), Gaps = 81/445 (18%)

Query: 4 KLRNIAIIAHVDHGKTTLVDKLLQQSGTFDSRAETQE--RVMDSNDLEKERGITILAKNT 61
K+ NI ++AHVD GKTTL + LL SG + D+ LE++RGITI T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 62 AIKWNDYRINIVDTPGHADFGGEVERVMSMVDSVLLVVDAFDGPMPQTRFVTKKAFAYGL 121
+ +W + ++NI+DTPGH DF EV R +S++D +L++ A DG QTR + G+
Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121

Query: 122 KPIVVINKVDRPGARPDWVVDQVFD-------------LFVNLDATDEQLD--------- 159
I INK+D+ G V + + L+ N+ T+
Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181

Query: 160 --------------------------------FPIVYASALNGIAGLDHEDMAEDMTPLY 187
FP+ + SA N I G+D+ L
Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNI-GIDN---------LI 231

Query: 188 QAIVDHVPAPDVDLDGPFQMQISQLDYNSYVGVIGIGRIKRGKVKPNQQVTIIDSEGKTR 247
+ I + + ++ +++Y+ + R+ G + V I + E
Sbjct: 232 EVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-- 289

Query: 248 NAKVGKVLGHLGLERIETDLAEAGDIVAITGLGELNISDTVCDTQNVEALPALSVDEPTV 307
K+ ++ + E + D A +G+IV + L ++ + DT+ + + P +
Sbjct: 290 --KITEMYTSINGELCKIDKAYSGEIVILQNEF-LKLNSVLGDTKLLPQRERIENPLPLL 346

Query: 308 SMFFCVNTSPFCGKEGKFVTSRQILDRLNKELVHNVALRVEETEDADAFRVSGRGELHLS 367
+ + D L LR +S G++ +
Sbjct: 347 QTTVEPSKPQQREMLLDALLEISDSDPL---------LRYYVDSATHEIILSFLGKVQME 397

Query: 368 VLIENMRRE-GFELAVSRPKVIFRE 391
V ++ + E+ + P VI+ E
Sbjct: 398 VTCALLQEKYHVEIEIKEPTVIYME 422



Score = 32.5 bits (74), Expect = 0.005
Identities = 13/75 (17%), Positives = 29/75 (38%), Gaps = 1/75 (1%)

Query: 398 EPYENVTLDVEEQHQGSVMQALGERKGDLKNMNPDGKGRVRLDYVIPSRGLIGFRSEFMT 457
EPY + + +++ + ++ + V L IP+R + +RS+
Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKN-NEVILSGEIPARCIQEYRSDLTF 595

Query: 458 MTSGTGLLYSTFSHY 472
T+G + + Y
Sbjct: 596 FTNGRSVCLTELKGY 610


54APECO1_2538APECO1_2524Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_25382122.991940ATP-dependent protease peptidase subunit
APECO1_25372113.212342cell division protein FtsN
APECO1_25360143.826708DNA-binding transcriptional regulator CytR
APECO1_25350122.549745primosome assembly protein PriA
APECO1_2534-210-0.458222peptidoglycan peptidase
APECO1_2533-111-2.502728transcriptional repressor protein MetJ
APECO1_2532-113-2.816176cystathionine gamma-synthase
APECO1_2531-112-3.771726bifunctional aspartate kinase II/homoserine
APECO1_2530019-6.453427nucleoside-specific channel-forming protein Tsx
APECO1_2529-111-3.310031hypothetical protein
APECO1_2528-212-1.872563hypothetical protein
APECO1_2527-2130.207286hypothetical protein
APECO1_2526-2141.3170875'-nucleotidase
APECO1_2525-2162.5649695,10-methylenetetrahydrofolate reductase
APECO1_2524-1183.010586catalase/hydroperoxidase HPI(I)
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2537IGASERPTASE422e-06 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 42.0 bits (98), Expect = 2e-06
Identities = 32/155 (20%), Positives = 64/155 (41%), Gaps = 5/155 (3%)

Query: 114 LTPEQRQLLEQMQADMRQQPTQLVEVPWNEQTPEQRQQTLQRQRQAQQLAEQQRLAQQSR 173
+ +QAD+ P+ E+ ++ P + +AE + Q+S+
Sbjct: 992 VDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSK--QESK 1049

Query: 174 TTEQSWQQQT-RTSQAAPVQAQPRQSKPASTQQPYQDLLQTPAHTTAQSKPQQAAPVARA 232
T E++ Q T T+Q V + + + A+TQ + T ++ ++ A V +
Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKE 1109

Query: 233 ADAPKPTAEKKDERRWMVQCGSFRGAEQAETVRAQ 267
A T + ++ + Q + EQ+ETV+ Q
Sbjct: 1110 EKAKVETEKTQEVPKVTSQVSPKQ--EQSETVQPQ 1142


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2530CHANNELTSX357e-127 Nucleoside-specific channel-forming protein Tsx signa...
		>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx

signature.
Length = 294

Score = 357 bits (916), Expect = e-127
Identities = 171/262 (65%), Positives = 204/262 (77%), Gaps = 6/262 (2%)

Query: 30 WLHQSLNVIGRTDSRFGPRLTNDLYPEYTVAGRKDWFDFYGYVDLPKFFGVGSHYDVGIW 89
W HQS+NV+G +RFGP++ ND Y EY +KDWFDFYGY+D P FFG G+ GIW
Sbjct: 34 WWHQSVNVVGSYHTRFGPQIRNDTYLEYEAFAKKDWFDFYGYIDAPVFFG-GNSTAKGIW 92

Query: 90 DEGSPLFTEIEPRFSIDKLTGLNLAFGPFKEWFIANNYVYDMGDNQSSRQSTWYMGLGTD 149
++GSPLF EIEPRFSIDKLT +L+FGPFKEW+ ANNY+YDMG N S QSTWYMGLGTD
Sbjct: 93 NKGSPLFMEIEPRFSIDKLTNTDLSFGPFKEWYFANNYIYDMGRNDSQEQSTWYMGLGTD 152

Query: 150 IDTGLPIKLSANIYAKYQWQNYGAANENEWDGYRFKIKYSIPLTNLFGGRLVYNSFTNFD 209
IDTGLP+ LS N+YAKYQWQNYGA+NENEWDGYRFK+KY +PLT+L+GG L Y FTNFD
Sbjct: 153 IDTGLPMSLSLNVYAKYQWQNYGASNENEWDGYRFKVKYFVPLTDLWGGSLSYIGFTNFD 212

Query: 210 FGSDLADKSHNN-----KRTSNAIASSHILSLLYEHWKFAFTLRYFHNGGQWNAGEKVNF 264
+GSDL D + + RTSN+IASSHIL+L Y HW ++ RYFHNGGQW K+NF
Sbjct: 213 WGSDLGDDNFYDLNGKHARTSNSIASSHILALNYAHWHYSIVARYFHNGGQWADDAKLNF 272

Query: 265 GDGPFELKNTGWGTYTTIGYQF 286
GDGPF +++TGWG Y +GY F
Sbjct: 273 GDGPFSVRSTGWGGYFVVGYNF 294


55APECO1_2389APECO1_2383Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_2389-2203.000130LrgB family murein hydrolase regulator
APECO1_2388-2233.496915acetate permease
APECO1_2387-2223.953204hypothetical protein
APECO1_2386-2194.500232acetyl-CoA synthetase
APECO1_2385-1163.818828cytochrome c552
APECO1_23840194.130507cytochrome c nitrite reductase pentaheme
APECO1_2383-1183.488461formate-dependent nitire reductase subunit NrfC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2387RTXTOXIND270.019 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 26.7 bits (59), Expect = 0.019
Identities = 5/33 (15%), Positives = 13/33 (39%), Gaps = 1/33 (3%)

Query: 18 ELVEKR-QRFATILSIIMLAVYIGFILLIAFAP 49
EL+E R +++ ++ + +L
Sbjct: 47 ELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQ 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2383VACJLIPOPROT300.007 VacJ lipoprotein signature.
		>VACJLIPOPROT#VacJ lipoprotein signature.

Length = 251

Score = 29.9 bits (67), Expect = 0.007
Identities = 6/21 (28%), Positives = 12/21 (57%)

Query: 179 FGNLDDPSSEISQLLRQKPTY 199
GNL++P+ ++ L+ P
Sbjct: 75 TGNLEEPAVMVNYFLQGDPYQ 95


56APECO1_2360APECO1_2348Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_23601276.020246ribose-5-phosphate isomerase B
APECO1_23590307.164808hypothetical protein
APECO1_2358-1317.611787carbon-phosphorus lyase complex accessory
APECO1_23570297.531987aminoalkylphosphonic acid N-acetyltransferase
APECO1_23560277.647765ribose 1,5-bisphosphokinase
APECO1_23550287.867517carbon-phosphorus lyase complex subunit PhnM
APECO1_23540278.097333phosphonates transport ATP-binding protein PhnL
APECO1_2353-1298.534287phosphonate C-P lyase system protein PhnK
APECO1_2352-1318.636819carbon-phosphorus lyase complex subunit PhnJ
APECO1_23511357.639097carbon-phosphorus lyase complex subunit PhnI
APECO1_23501387.460877carbon-phosphorus lyase complex subunit
APECO1_23491376.486378carbon-phosphorus lyase complex subunit PhnG
APECO1_23481335.182623phosphonate metabolism transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2357SACTRNSFRASE333e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 32.6 bits (74), Expect = 3e-04
Identities = 20/84 (23%), Positives = 32/84 (38%), Gaps = 5/84 (5%)

Query: 50 HLALLDGEVVGMIGLHLQFHLHHVNWIGEIQELVVMPQARGLNVGSKLLAWAEEEARQAG 109
L L+ +G I + + N I+++ V R VG+ LL A E A++
Sbjct: 68 FLYYLENNCIGRIKIRSNW-----NGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENH 122

Query: 110 AEMTELSTNVKRHDAHRFYLREGY 133
L T A FY + +
Sbjct: 123 FCGLMLETQDINISACHFYAKHHF 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2354PF05272290.019 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.019
Identities = 17/70 (24%), Positives = 25/70 (35%), Gaps = 8/70 (11%)

Query: 69 CVVLHGHSGSGKSTLLRSLYANYLPDEGQIQIKHGDEWVDLVTAPARKVVEI------RK 122
VVL G G GKSTL+ +L + I G + + + E+ R+
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIV--AYELSEMTAFRR 655

Query: 123 TTVGWVSQFL 132
V F
Sbjct: 656 ADAEAVKAFF 665


57APECO1_2329APECO1_2301Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_2329214-2.913593anaerobic C4-dicarboxylate transporter
APECO1_2328-112-2.981281DcuR family transcriptional regulator
APECO1_2327-113-3.396783sensory histidine kinase DcuS
APECO1_2326021-3.563203acyltransferase
APECO1_2325119-4.432664hypothetical protein
APECO1_2324118-3.956013lysyl-tRNA synthetase
APECO1_2323117-3.769840peptide transporter
APECO1_2322321-3.994989lysine decarboxylase 1
APECO1_2321419-2.917992lysine/cadaverine antiporter
APECO1_2320319-2.499414CadC family transcriptional regulator
APECO1_23196282.452630hypothetical protein
APECO1_23185262.102990hypothetical protein
APECO1_23176271.762425hypothetical protein
APECO1_23166241.481620radC-like protein YeeS
APECO1_2315325-4.008816hypothetical protein
APECO1_2314324-4.228751hypothetical protein
APECO1_2313325-4.709849hypothetical protein
APECO1_2312527-4.642410transcriptional regulator
APECO1_2311427-5.149954GTPase
APECO1_2310431-6.831832hypothetical protein
APECO1_2309225-2.033887hypothetical protein
APECO1_2308119-1.255545hypothetical protein
APECO1_2306018-3.495968transposase InsC for insertion element
APECO1_2305-115-4.868132cation/multidrug efflux pump protein
APECO1_2304-117-4.678881transcriptional repressor
APECO1_2303-114-3.434066hypothetical protein
APECO1_2302-115-2.873599S-adenosylmethionine synthetase
APECO1_2301018-3.873068hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2328HTHFIS682e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 68.3 bits (167), Expect = 2e-15
Identities = 31/109 (28%), Positives = 51/109 (46%), Gaps = 4/109 (3%)

Query: 4 VLIIDDDAMVAELNRRYVAQIPGFQCCGTASTLEKAKEIIFNSDTPIDLILLDIYMQKEN 63
+L+ DDDA + + + +++ G+ S I + DL++ D+ M EN
Sbjct: 6 ILVADDDAAIRTVLNQALSRA-GYDVR-ITSNAATLWRWI--AAGDGDLVVTDVVMPDEN 61

Query: 64 GLDLLPVLHNARCKSDVIVISSAADAVTIKDSLHYGVVDYLIKPFQASR 112
DLLP + AR V+V+S+ +T + G DYL KPF +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTE 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2327PF06580418e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.0 bits (96), Expect = 8e-06
Identities = 21/99 (21%), Positives = 38/99 (38%), Gaps = 18/99 (18%)

Query: 442 LIENALE-ALGP-EPGGEISVTLHYRHGWLHCEVNDDGPGIAPDKIDHIFDKGVSTKGSE 499
L+EN ++ + GG+I + +G + EV + G +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN------------TKES 310

Query: 500 RGVGLALVKQQVENLGG---SIAVESEPGIFTQFFVQIP 535
G GL V+++++ L G I + + G V IP
Sbjct: 311 TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2326SACTRNSFRASE260.012 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 26.4 bits (58), Expect = 0.012
Identities = 9/28 (32%), Positives = 16/28 (57%)

Query: 32 LAIIEHTDVDESLKGQGIGKQLVAKVVE 59
A+IE V + + +G+G L+ K +E
Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIE 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2323TCRTETA300.028 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 29.8 bits (67), Expect = 0.028
Identities = 36/190 (18%), Positives = 66/190 (34%), Gaps = 14/190 (7%)

Query: 44 NHAISLFSAYA-SLVYVTPILGGWLADRLLGNRTAVIAGALLMTLGHVVLGIDTNSTFSL 102
H L + YA P+LG +DR G R ++ + + ++ + L
Sbjct: 43 AHYGILLALYALMQFACAPVLGAL-SDRF-GRRPVLLVSLAGAAVDYAIMAT-APFLWVL 99

Query: 103 YLALAIIICGYGLFKSNISCLLGELYDEND-HRRDGGFSLLYAAGNIGSIAAPIACGLAA 161
Y+ + G+ + + + D D R F + A G +A P+ GL
Sbjct: 100 YIGRIV----AGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMG 155

Query: 162 QWYGWHVGFALAGGGMFIGLLIFLSGHRHFQSTRSMDKKALTSVKF-ALPVWSWLVVMLC 220
+ H F A + L FL+G + +++ L L + W M
Sbjct: 156 G-FSPHAPFFAAA---ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTV 211

Query: 221 LAPVFFTLLL 230
+A + +
Sbjct: 212 VAALMAVFFI 221


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2320SYCDCHAPRONE368e-05 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 36.4 bits (84), Expect = 8e-05
Identities = 16/97 (16%), Positives = 36/97 (37%), Gaps = 7/97 (7%)

Query: 391 PLDEKQLAALNTEIDNIVTLPELNNLS-----IIYQIKAVSALVKGKTDESYQAINTGID 445
++ A+ + + T+ LN +S +Y + A + GK +++++
Sbjct: 6 TDTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSL-AFNQYQSGKYEDAHKVFQALCV 64

Query: 446 LEMSWLNYVL-LGKVYEMKGMNREAADAYLTAFNLRP 481
L+ + L LG + G A +Y +
Sbjct: 65 LDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDI 101


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2306PF06704250.047 DspF/AvrF protein
		>PF06704#DspF/AvrF protein

Length = 129

Score = 25.2 bits (55), Expect = 0.047
Identities = 20/85 (23%), Positives = 39/85 (45%), Gaps = 6/85 (7%)

Query: 28 RTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAA-GEQVVPAS 86
+ + + +S + SL A Q+GV A L+ Q E ++ + E V+
Sbjct: 2 NNSPTDFSRLIKSLGAQLGTSLTA-QNGVCA----LYDSQDNEAAVIEMPDHSEMVIFHC 56

Query: 87 ELAAAMKQIKELQRLLNKTPDVSRL 111
+ + + +LQ+LL+ DV+R+
Sbjct: 57 RVGRSPDRAADLQKLLSLNFDVARM 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2305ACRIFLAVINRP411e-137 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 411 bits (1057), Expect = e-137
Identities = 202/339 (59%), Positives = 270/339 (79%), Gaps = 1/339 (0%)

Query: 1 MIQARNQLLAEAAKSPA-LNMVRPNGMNDEPQFQILIDDEKVQAFKLSMSDVDNIMSAAW 59
+ QARNQLL AA+ PA L VRPNG+ D QF++ +D EK QA +S+SD++ +S A
Sbjct: 694 LTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTAL 753

Query: 60 GSMYVNDFNDRGRVKKVYIQGEPGSRISPQDFDKWYVRNSDGDMVSFASFATGKWIYGSP 119
G YVNDF DRGRVKK+Y+Q + R+ P+D DK YVR+++G+MV F++F T W+YGSP
Sbjct: 754 GGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSP 813

Query: 120 KLEQYNGISAVEILGEPAPGYSSGDAMKAIEDIAARLPEGFHISWTGLSFEERLSGSQAP 179
+LE+YNG+ ++EI GE APG SSGDAM +E++A++LP G WTG+S++ERLSG+QAP
Sbjct: 814 RLERYNGLPSMEIQGEAAPGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAP 873

Query: 180 ALYALSLLIVFLCLAALYESWSIPFSVMLVVPLGVLGAVCATLLRGLGNDVFFQVGLLTT 239
AL A+S ++VFLCLAALYESWSIP SVMLVVPLG++G + A L NDV+F VGLLTT
Sbjct: 874 ALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTT 933

Query: 240 IGLSAKNAILIVEFARELHEKEGLSIKEAAVEAARVRLRPIIMTSLAFVMGVIPLAVSTG 299
IGLSAKNAILIVEFA++L EKEG + EA + A R+RLRPI+MTSLAF++GV+PLA+S G
Sbjct: 934 IGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNG 993

Query: 300 ASSGSKHAIGTGVVGGMITATILAIFYIPLFYMLIAGFF 338
A SG+++A+G GV+GGM++AT+LAIF++P+F+++I F
Sbjct: 994 AGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRCF 1032



Score = 77.2 bits (190), Expect = 1e-17
Identities = 64/322 (19%), Positives = 125/322 (38%), Gaps = 21/322 (6%)

Query: 29 EPQFQILIDDEKVQAFKLSMSDVDNIMSAA----WGSMYVNDFNDRGRVKKVYIQGEPGS 84
+ +I +D + + +KL+ DV N + G+ I +
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQ-TR 239

Query: 85 RISPQDFDKWYVR-NSDGDMVSFASFAT---GKWIYGSPKLEQYNGISAVEILGEPAPGY 140
+P++F K +R NSDG +V A G Y + + NG A + + A G
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNV--IARINGKPAAGLGIKLATGA 297

Query: 141 SSGDAMKAI----EDIAARLPEGFHISW---TGLSFEERLSGSQAPALYALSLLIVFLCL 193
++ D KAI ++ P+G + + T + + A+ ++VFL +
Sbjct: 298 NALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAI--MLVFLVM 355

Query: 194 AALYESWSIPFSVMLVVPLGVLGAVCATLLRGLGNDVFFQVGLLTTIGLSAKNAILIVEF 253
++ + VP+ +LG G + G++ IGL +AI++VE
Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN 415

Query: 254 ARELHEKEGLSIKEAAVEAARVRLRPIIMTSLAFVMGVIPLAVSTGASSGSKHAIGTGVV 313
+ ++ L KEA ++ ++ ++ IP+A G++ +V
Sbjct: 416 VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIV 475

Query: 314 GGMITATILAIFYIP-LFYMLI 334
M + ++A+ P L L+
Sbjct: 476 SAMALSVLVALILTPALCATLL 497


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2304HTHTETR704e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 70.0 bits (171), Expect = 4e-18
Identities = 33/63 (52%), Positives = 41/63 (65%)

Query: 10 RHTKFAAEETRKQILDVAEFCFCETGFSKTTLEMIAARAGCTRGAIYWYFNEKKDLLRQV 69
R TK A+ETR+ ILDVA F + G S T+L IA AG TRGAIYW+F +K DL ++
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 70 IER 72
E
Sbjct: 63 WEL 65


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2301HTHFIS596e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 59.5 bits (144), Expect = 6e-12
Identities = 29/139 (20%), Positives = 53/139 (38%), Gaps = 3/139 (2%)

Query: 3 TIVIVEDEPIELESLRQIISQCVENAAIHEASTGKKAIHLIDQLSQIDMILVDINIPLPN 62
TI++ +D+ L Q +S+ + S I D+++ D+ +P N
Sbjct: 5 TILVADDDAAIRTVLNQALSR--AGYDVRITSNAATLWRWIAA-GDGDLVVTDVVMPDEN 61

Query: 63 GKQVIEYLKKKNSDTKIIVITANDDFDIVRSMYNLKVDDYLLKPVKKCILTDTIKKTLAF 122
++ +KK D ++V++A + F DYL KP L I + LA
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 123 DEGENEKSRALKQKVFAMI 141
+ K Q ++
Sbjct: 122 PKRRPSKLEDDSQDGMPLV 140


58APECO1_2282APECO1_2254Y        NYGenomic Island
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_2282331-4.390321iron-containing alcohol dehydrogenase
APECO1_2281843-7.010809hypothetical protein
APECO1_2280844-7.529312hypothetical protein
APECO1_2279847-8.009958hemolysin activator protein
APECO1_2278950-8.545331hypothetical protein
APECO1_2277849-9.206144hypothetical protein
APECO1_2276839-7.411605KAP family protein
APECO1_2275636-6.328532hypothetical protein
APECO1_2274533-6.558065hypothetical protein
APECO1_2273528-5.291073TatD-related deoxyribonuclease
APECO1_2272324-4.032296hypothetical protein
APECO1_2271424-3.106051hypothetical protein
APECO1_2270017-2.177681transposase insF for insertion sequence
APECO1_2269115-1.402667transposase InsN for insertion sequence element
APECO1_2268215-1.522543hypothetical protein
APECO1_2267215-1.520872IS1 InsB protein
APECO1_2266216-1.304226IS1 InsA protein
APECO1_2265215-2.224409antiporter
APECO1_2264116-2.486958hypothetical protein
APECO1_2263122-3.511414amino acid antiporter
APECO1_2262-126-2.395945IS1 InsA protein
APECO1_2261-126-2.395945IS1 InsB protein
APECO1_2260129-4.577454regulator PapX protein
APECO1_2259127-2.713271hypothetical protein
APECO1_2258122-1.513696hypothetical protein
APECO1_2257116-0.761096IS1 InsB protein
APECO1_2256116-0.775278IS1 InsA protein
APECO1_2255218-1.231713ShiA-like protein
APECO1_2254219-0.501046Int ( P4-type integrase)
59APECO1_2230APECO1_2211Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_2230-1133.397302hypothetical protein
APECO1_2229-1132.725988hypothetical protein
APECO1_2228-1132.793565ribosome-associated GTPase
APECO1_2227-1123.382264oligoribonuclease
APECO1_2225-1133.195936***electron transport protein YjeS
APECO1_2224-1123.207825hypothetical protein
APECO1_2223-1132.574457ATPase
APECO1_22220133.130821N-acetylmuramoyl-l-alanine amidase II
APECO1_22212142.801838DNA mismatch repair protein
APECO1_22204212.185582tRNA delta(2)-isopentenylpyrophosphate
APECO1_22195231.922562RNA-binding protein Hfq
APECO1_22185212.486482GTPase HflX
APECO1_22175222.552186FtsH protease regulator HflK
APECO1_22164201.163391FtsH protease regulator HflC
APECO1_22153171.023241adenylosuccinate synthetase
APECO1_22144130.068615transcriptional repressor NsrR
APECO1_2213412-0.296320exoribonuclease R
APECO1_2212217-2.92610723S rRNA (guanosine-2'-O-)-methyltransferase
APECO1_2211217-3.083841hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2230GPOSANCHOR534e-09 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 53.1 bits (127), Expect = 4e-09
Identities = 50/312 (16%), Positives = 106/312 (33%), Gaps = 18/312 (5%)

Query: 121 SRQAQQEQERAREIADSLNQLPQQQTDARRQLNEIERRLGTLTGNTPLNQAQNFALQSDS 180
+ ++ QERA + N L + +D ++ LT L+ +
Sbjct: 49 TDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTE---ELSNAKEKLRKND 105

Query: 181 ARLKALVDEL-ELAQLSANNRQELARLRSELAEKES--QQLDAYLQALRNQLNSQRQQEA 237
L ++ EL A+ + L + + + L+A AL + + ++
Sbjct: 106 KSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAAR-KADLEKAL 164

Query: 238 ERALESTEQLAESSADLPKDIVAQFKINRELSAALNQQAQRMDLVASQQRQAASQTLQVR 297
E A+ + + L + A EL AL +++ + ++ +
Sbjct: 165 EGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALA 224

Query: 298 QALNTLREQSQWLGSSNLLGEALRAQVARLPEMPKPQQLDTEMAQLRVQRLRYEDLLNKQ 357
L + L + A A++ L + L+ A+L +
Sbjct: 225 ARKADLEKA---LEGAMNFSTADSAKIKTLEA--EKAALEARQAELEKALEGAMNFSTAD 279

Query: 358 PLLRQIHQADGQPLTAE------QNRILEAQLRTQRELLNSLLQGGDTLLLELTKLKVSN 411
+ +A+ L AE Q+++L A ++ R L++ + L E KL+ N
Sbjct: 280 SAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQN 339

Query: 412 GQLEDALKEVNE 423
E + + +
Sbjct: 340 KISEASRQSLRR 351



Score = 40.4 bits (94), Expect = 4e-05
Identities = 52/257 (20%), Positives = 93/257 (36%), Gaps = 59/257 (22%)

Query: 20 ATAPDSKQITQELEQAKAAKPAQPEVVEALQSALNALEERKGSLERIKQYQEVIDNYPKL 79
A A + + LE A A ++ L++ ALE R+ LE+ +
Sbjct: 222 ALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAM------NF 275

Query: 80 SATLRAQLNNMRDEPRSVSPGMSNDALNQEILQISS--QLLDKSRQAQQEQERAREIADS 137
S A++ + E AL E + Q+L+ +RQ+ + A
Sbjct: 276 STADSAKIKTLEAE---------KAALEAEKADLEHQSQVLNANRQSLRRDLDA------ 320

Query: 138 LNQLPQQQTDARRQLNEIERRLGTLTGNTPLNQAQNFALQSDSARLKALVDELELAQLSA 197
+R ++E L N+ + QS L A
Sbjct: 321 ----------SREAKKQLEAEHQKL---EEQNKISEASRQSLRRDLDAS----------- 356

Query: 198 NNRQELARLRSEL--AEKESQQLDAYLQALRNQLNSQRQQEAERALESTEQLAESSADLP 255
R+ +L +E E++++ +A Q+LR L++ R EA++ +E + A S
Sbjct: 357 --REAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASR--EAKKQVEKALEEANSKLA-- 410

Query: 256 KDIVAQFKINRELSAAL 272
A K+N+EL +
Sbjct: 411 ----ALEKLNKELEESK 423


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2218SECA320.005 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 31.8 bits (72), Expect = 0.005
Identities = 26/144 (18%), Positives = 54/144 (37%), Gaps = 6/144 (4%)

Query: 259 HVIDAADVRVQENIEAVNTVLEEIDAHEIPTLLVMNKIDMLDDFEPRIDRDEENK-PIRV 317
++D +DV N + IDA+ P L ++ + + R+ D + PI
Sbjct: 665 ELLDVSDVSETINSIREDVFKATIDAYIPPQSL--EEMWDIPGLQERLKNDFDLDLPIAE 722

Query: 318 WLSAQTGAGIPQLFQALTERLSGEVAQHTLRLPPQEGRLRSRFYQLQAIEKEWMEEDGSV 377
WL + L + + + + + + R + LQ ++ W E ++
Sbjct: 723 WLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAAM 782

Query: 378 SLQVRMPIVDWRRLCKQEPALIDY 401
+R I R +++P +Y
Sbjct: 783 D-YLRQGIH-LRGYAQKDP-KQEY 803


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2217cloacin320.006 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 31.6 bits (71), Expect = 0.006
Identities = 25/81 (30%), Positives = 30/81 (37%), Gaps = 10/81 (12%)

Query: 17 GSSKPGGNSEGNGNKGGRDQGPPDLDDIFRKLSKKLGGLGGGKGTGSGGGSSSQGP---- 72
S G +SE N GG G G GGG GTG G S+ P
Sbjct: 33 ASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTG-GNLSAVAAPVAFG 91

Query: 73 -----RPQLGGRVVTIAAAAI 88
P GG V+I+A A+
Sbjct: 92 FPALSTPGAGGLAVSISAGAL 112


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2213RTXTOXIND310.027 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.6 bits (69), Expect = 0.027
Identities = 12/55 (21%), Positives = 24/55 (43%), Gaps = 1/55 (1%)

Query: 179 VVPDDSRLSFDILIPPDQIMGARMGFVVVVELTQRPTRRTKAV-GKIVEVLGDNM 232
+VP+D L L+ I +G ++++ P R + GK+ + D +
Sbjct: 359 IVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI 413


60APECO1_2129APECO1_2115Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_2129126-6.669934L-idonate and D-gluconate transporter
APECO1_2128127-7.388578gluconate 5-dehydrogenase
APECO1_2127128-7.366146L-idonate 5-dehydrogenase
APECO1_2126129-7.877711D-gluconate kinase
APECO1_2125129-7.993333oxidoreductase
APECO1_2124135-10.181691*hypothetical protein
APECO1_2123127-6.656920hypothetical protein
APECO1_2122230-7.165728hypothetical protein
APECO1_2121130-7.356684N-acetylneuraminic acid mutarotase
APECO1_2120130-6.155230hypothetical protein
APECO1_2119131-5.739602hypothetical protein
APECO1_2118127-4.142875tyrosine recombinase
APECO1_2117127-3.505631tyrosine recombinase
APECO1_2116224-3.050443type 1 fimbriae major subunit FimA
APECO1_2115324-2.895367FimI fimbrial protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2128DHBDHDRGNASE1451e-44 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 145 bits (366), Expect = 1e-44
Identities = 86/256 (33%), Positives = 133/256 (51%), Gaps = 8/256 (3%)

Query: 7 LAGKNILITGSAQGIGFLLATGLGKYGAQIIINDITAERAELAVEKLNQEGIQAVAAPFN 66
+ GK ITG+AQGIG +A L GA I D E+ E V L E A A P +
Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65

Query: 67 VTHKHEIDAAVEHIEKDIGPIDVLVNNAGIQRRHPFTEFPEQEWNDVIAVNQTAVFLVSQ 126
V ID IE+++GPID+LVN AG+ R ++EW +VN T VF S+
Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125

Query: 127 AVTRHMVERKAGKVINICSMQSELGRDTITPYAASKGAVKMLTRGMCVELARHNIQVNGI 186
+V+++M++R++G ++ + S + + R ++ YA+SK A M T+ + +ELA +NI+ N +
Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185

Query: 187 APGYFKTEMTKALVEDE--------AFTAWLCKRTPAARWGDPQELIGAAVFLSSKASDF 238
+PG +T+M +L DE P + P ++ A +FL S +
Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245

Query: 239 VNGHLLFVDGGMLVAV 254
+ H L VDGG + V
Sbjct: 246 ITMHNLCVDGGATLGV 261


61APECO1_2104APECO1_2094Y        Y        NPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_2104223-4.001738dihydroxyacetone kinase subunit DhaL
APECO1_2103223-3.625424dihydroxyacetone kinase subunit DhaK
APECO1_2102122-3.572763glycerol dehydrogenase
APECO1_2101125-4.085547transporter
APECO1_2100127-3.936912dihydrolipoamide dehydrogenase
APECO1_2099031-6.621160carnitine transporter CniT
APECO1_2098031-6.418540glycerate kinase
APECO1_2097031-7.2910643-hydroxyisobutyrate dehydrogenase
APECO1_2096029-7.059263regulatory protein GclR
APECO1_2095-123-5.013200glyoxylate carboligase
APECO1_2094-120-4.819980DNA-binding transcriptional regulator DhaR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2101TCRTETA371e-04 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 36.7 bits (85), Expect = 1e-04
Identities = 58/324 (17%), Positives = 106/324 (32%), Gaps = 25/324 (7%)

Query: 39 TGATNAELGFLMTAYGLVNFLLYLPGGWAADRFSARKLMTFSLISTGISGFYYATFPSYT 98
+ A G L+ Y L+ F G +DRF R ++ SL + AT P
Sbjct: 38 SNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLW 97

Query: 99 MICLLHALWAVTTVFTFWAVCVRIIRTLGTSEEQGRLYGYWFLGKGLTSIVLGFLSVPVF 158
++ + + +T T AV I + +E+ R +G+ + G ++ PV
Sbjct: 98 VLYIGRIVAGITGA-TG-AVAGAYIADITDGDERARHFGF--MS---ACFGFGMVAGPVL 150

Query: 159 AKFGEGVDGLRATIIFYSVVTILAGVLAWFVCQDETHSEDKANFRLADMAF-----VLKM 213
G A + + L + F+ + E + R A M
Sbjct: 151 GGLMGGF-SPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGM 209

Query: 214 PTVWLAGVVTFCMWSI-YIGFGMVTPYLTQILHMGESEVAVASILRAYVLFAMGGLIGGQ 272
V V F M + + + + H + + ++ + L
Sbjct: 210 TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGIS----LAAFGILHSLAQAM 265

Query: 273 LADRCASRTRFMIYAFIGMIVFTTVYFFLP--GESRYVTIALANMVALGVFIYSANAVFF 330
+ A+R +GMI T Y L + + + G+ + + A+
Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLS 325

Query: 331 SIIDEVRIPAKVTGTAAGLISLLT 354
+DE R G G ++ LT
Sbjct: 326 RQVDEER-----QGQLQGSLAALT 344


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2094HTHFIS2238e-68 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 223 bits (570), Expect = 8e-68
Identities = 81/355 (22%), Positives = 156/355 (43%), Gaps = 41/355 (11%)

Query: 327 RKIAQQQISTNANFTFDSLHAASGGMKQVLLIARRAIKSISPILINGEEGVGKLSLAMAI 386
K ++ ++ L S M+++ + R +++ ++I GE G GK +A A+
Sbjct: 122 PKRRPSKLEDDSQ-DGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180

Query: 387 HNESEQRDGPFISVDCQMLSPENILHELLGSDVG-------PSPSKFELAHNGTLYLDKV 439
H+ ++R+GPF++++ + + I EL G + G S +FE A GTL+LD++
Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240

Query: 440 EYLSGEVQSVLLKVLKTGLVTRSDSHRLIPVRFRLITCTSSSLREYVQQGAFSRQLYYEI 499
+ + Q+ LL+VL+ G T I R++ T+ L++ + QG F LYY +
Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300

Query: 500 SMNEIEIPPLRKRREALKQMIDDIIDKYQERTRKKMTITPDANSVLLEYRWPGNISEFKN 559
++ + +PPLR R E + ++ + + ++ +A ++ + WPGN+ E +N
Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELEN 360

Query: 560 RMEKVFINCNRLVLGLENIPLDIRQN-----NSSGDDDIPHLT----------------- 597
+ ++ + V+ E I ++R L+
Sbjct: 361 LVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFG 420

Query: 598 -----------SLAELEMQAIEHTCRVCEWNLTKAAEVLKIGRTTLWRKLKIYNL 641
LAE+E I N KAA++L + R TL +K++ +
Sbjct: 421 DALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


62APECO1_2056APECO1_2012Y        Y        YPathogenicity Island (biased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_2056022-5.113261hypothetical protein
APECO1_2055020-4.641526***16S ribosomal RNA m2G1207 methyltransferase
APECO1_2053226-6.833222ribosomal-protein-alanine N-acetyltransferase
APECO1_20522227-7.255520nucleotidase
APECO1_2052330-8.328617hypothetical protein
APECO1_2051228-8.219613hypothetical protein
APECO1_2050219-4.958222hypothetical protein
APECO1_2049223-5.832009hypothetical protein
APECO1_2048221-4.994819hypothetical protein
APECO1_2047322-4.290520hypothetical protein
APECO1_2046524-3.387460hypothetical protein
APECO1_2045526-2.784748regulatory protein
APECO1_2044426-1.005566hypothetical protein
APECO1_2043525-0.003411hypothetical protein
APECO1_2042524-0.259655antirepressor
APECO1_20415231.550386replication protein
APECO1_20404231.202619hypothetical protein
APECO1_20393220.022236hypothetical protein
APECO1_2038223-0.293686bacteriophage V crossover junction
APECO1_2037222-1.495489hypothetical protein
APECO1_2036224-2.077564hypothetical protein
APECO1_2035229-4.425435lambdoid prophage Qin antitermination protein
APECO1_2034227-3.853460hypothetical protein
APECO1_2033320-0.212054Qin prophage; lysozyme
APECO1_20323190.931113hypothetical protein
APECO1_20314181.134781hypothetical protein
APECO1_20304191.544581hypothetical protein
APECO1_20294201.797602hypothetical protein
APECO1_20284202.046465DNA packaging protein of prophage CP-933K
APECO1_20274211.788347capsid protein
APECO1_20265231.096422protease/scaffold protein
APECO1_20256282.243831hypothetical protein
APECO1_20245282.172117hypothetical protein
APECO1_20234272.189283tail component of prophage CP-933K
APECO1_20224262.196767tail component of prophage CP-933K
APECO1_20215292.812981tail component of prophage CP-933K
APECO1_20204263.610264minor tail protein
APECO1_20193244.223064tail component of prophage CP-933K
APECO1_20184265.348775minor tail protein
APECO1_20171212.730635minor tail protein
APECO1_20161202.167786tail fiber component K of prophage
APECO1_20150180.994871tail component of prophage CP-933K
APECO1_20140180.330563tail component of prophage
APECO1_2013029-3.933207outer membrane protein
APECO1_2012133-5.726111hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2053SACTRNSFRASE554e-12 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 54.6 bits (131), Expect = 4e-12
Identities = 23/80 (28%), Positives = 35/80 (43%), Gaps = 1/80 (1%)

Query: 62 DEATLFNIAVDPDYQRQGLGRALLEHLIDELEKRGVATLWLEVRASNAAAIALYESLGFN 121
A + +IAV DY+++G+G ALL I+ ++ L LE + N +A Y F
Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147

Query: 122 EATIRRNYYPTTDG-REDAI 140
+ Y E AI
Sbjct: 148 IGAVDTMLYSNFPTANEIAI 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2043FLGHOOKAP1270.038 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 27.2 bits (60), Expect = 0.038
Identities = 11/47 (23%), Positives = 24/47 (51%), Gaps = 3/47 (6%)

Query: 76 AVAQSAGGV---FVSLPEIEEVENADINQRLLEVIEQIGSYSKQIRS 119
A+ + G+ F + + ++ +N + ++QI +Y+KQI S
Sbjct: 131 ALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIAS 177


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2019cloacin421e-05 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 41.6 bits (97), Expect = 1e-05
Identities = 33/142 (23%), Positives = 61/142 (42%), Gaps = 4/142 (2%)

Query: 522 DQQRLNDLQEKKRQKDLQDAK--EQAERNYQEQQKRRNAENAALNRMNETEAARHQREIA 579
DQ + +E +RQ++ E AERNY+ + N N + R E +A Q +
Sbjct: 294 DQVKQRQDEENRRQQEWDATHPVEAAERNYERARAELNQANEDVARNQERQAKAVQVYNS 353

Query: 580 RINAMQYADQAVRDA-AIQRENERYEKALASGKKKTRETRNDEATRLLLQYSQQQAQVEG 638
R + + A++ + DA A ++ R+ +G + + +A R + +QA +
Sbjct: 354 RKSELDAANKTLADAIAEIKQFNRFAHDPMAGGHRMWQMAGLKAQRAQTDVNNKQAAFDA 413

Query: 639 QIAAARQSAGIATERMTEAHKQ 660
A + A A E+ K+
Sbjct: 414 -AAKEKSDADAALSSAMESRKK 434


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2013ENTEROVIROMP1414e-45 Enterobacterial virulence outer membrane protein si...
		>ENTEROVIROMP#Enterobacterial virulence outer membrane protein

signature.
Length = 171

Score = 141 bits (357), Expect = 4e-45
Identities = 68/202 (33%), Positives = 106/202 (52%), Gaps = 34/202 (16%)

Query: 1 MRK-VCAAILSAAICLAVSGAPAWASEHQSTLSAGYLHASTNVPG-SDDLNGINVKYRYE 58
M+K C + L+A LA + + A+ ST++ GY A ++ G + + G N+KYRYE
Sbjct: 1 MKKIACLSALAA--VLAFTAGTSVAA--TSTVTGGY--AQSDAQGQMNKMGGFNLKYRYE 54

Query: 59 FTDT-LGLVTSFSYAGDKNRQLTRYSDTRWHEDSVRNRWFSVMAGPSVRVNEWFSAYAMA 117
++ LG++ SF+Y T S T D +N+++ + AGP+ R+N+W S Y +
Sbjct: 55 EDNSPLGVIGSFTY--------TEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVV 106

Query: 118 GVAYSRVSTFSGDYLRVTDNKGKKHDVLTGSDDGRHSNTSLAWGAGVQFNPTESVAIDIA 177
GV Y + T T+ KHD S+ ++GAG+QFNP E+VA+D +
Sbjct: 107 GVGYGKFQT--------TEYPTYKHD---------TSDYGFSYGAGLQFNPMENVALDFS 149

Query: 178 YEGSGSGDWRTDGFIVGVGYKF 199
YE S +I GVGY+F
Sbjct: 150 YEQSRIRSVDVGTWIAGVGYRF 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2012IGASERPTASE459e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 45.4 bits (107), Expect = 9e-07
Identities = 38/217 (17%), Positives = 75/217 (34%), Gaps = 12/217 (5%)

Query: 114 EEAARNAEAASQSAAAAKKSETAAASSKNAAKTSETNAANSAQAAASSQTASANSATAAK 173
E + E ++ + K E + A E + + + S +A++ AK
Sbjct: 1114 VETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAK 1173

Query: 174 KSETNAKNSETAAKTSETNAK-----SSQTAAKTSETNAKASETAAKNSQVAAAQSESAA 228
++ +N + T + T T + T A T T S KN + +S
Sbjct: 1174 ETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHN 1233

Query: 229 AGSATSAAGSATAAANSQKAAKTSETNAKSSQTAAKTSETNAKASETAAKNSQDAAAQSE 288
AT+++ + A ++ TNA S AK + +Q E
Sbjct: 1234 VEPATTSSNDRSTVA--LCDLTSTNTNAVLSDARAKAQFVALNVGKAV----SQHISQLE 1287

Query: 289 SAAAGSASAAAASATASANSQKAAKTSETNAKTSETA 325
G S T+ + +++ ++K+++T
Sbjct: 1288 MNNEG-QYNVWVSNTSMNKNYSSSQYRRFSSKSTQTQ 1323



Score = 42.4 bits (99), Expect = 8e-06
Identities = 29/159 (18%), Positives = 49/159 (30%), Gaps = 14/159 (8%)

Query: 211 ETAAKNSQVAAAQSESAAAGSA--TSAAGSATAAANSQKAAKTSETNAKSSQTAAKTSET 268
E +N V + A S + A +A A S+T +E
Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043

Query: 269 NAKASETAAKNSQDA------------AAQSESAAAGSASAAAASATASANSQKAAKTSE 316
+ + S+T KN QDA A+S A + A S + + +Q
Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103

Query: 317 TNAKTSETAAANSAKASAASQTAAKASEDAAREYASQAA 355
+ E A + K + ++ S + Q
Sbjct: 1104 ATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQ 1142


63APECO1_1620APECO1_1610N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1620017-0.369828shikimate kinase
APECO1_1619-116-0.727366hypothetical protein
APECO1_16183160.639340hypothetical protein
APECO1_16172151.257994hypothetical protein
APECO1_16162151.390402recombination associated protein
APECO1_16151151.469271fructokinase
APECO1_16141121.735417MFS transport protein AraJ
APECO1_16130122.004544exonuclease subunit SbcC
APECO1_16120122.200354exonuclease subunit SbcD
APECO1_1611-1132.207253transcriptional regulator PhoB
APECO1_16100132.139410phosphate regulon sensor protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1620PF05272280.029 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 27.7 bits (61), Expect = 0.029
Identities = 17/68 (25%), Positives = 25/68 (36%), Gaps = 6/68 (8%)

Query: 4 PLFLIGPRGCGKTTVGMALADSLNRRFVDTDLWL----QSQLNMTVAEIVEREEWAGFRA 59
+ L G G GK+T+ L F DT + S + E E FR
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGL--DFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRR 655

Query: 60 RETAALEA 67
+ A++A
Sbjct: 656 ADAEAVKA 663


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1615ACETATEKNASE290.020 Acetate kinase family signature.
		>ACETATEKNASE#Acetate kinase family signature.

Length = 400

Score = 29.4 bits (66), Expect = 0.020
Identities = 17/69 (24%), Positives = 29/69 (42%), Gaps = 10/69 (14%)

Query: 229 FISGTGFATDYRRLSGHALKGSEIIRLVEESDPVAELALRRYELRLAKSLAHVVNILDP- 287
+G ++D+R L A + D A+LAL + R+ K++ +
Sbjct: 273 VYGISGISSDFRDLEDAAF---------KNGDKRAQLALNVFAYRVKKTIGSYAAAMGGV 323

Query: 288 DVIVLGGGM 296
DVIV G+
Sbjct: 324 DVIVFTAGI 332


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1614TCRTETA513e-09 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 51.4 bits (123), Expect = 3e-09
Identities = 74/356 (20%), Positives = 126/356 (35%), Gaps = 35/356 (9%)

Query: 17 ILSLALGTFGLGMAEFGIMGVLTELAHNVGISIPAAGH---MISYYALGVVVGAPIIALF 73
+ ++AL G+G+ IM VL L ++ S H +++ YAL AP++
Sbjct: 11 LSTVALDAVGIGL----IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66

Query: 74 SSRYSLKHILLFLVALCVIGNAMFTLSSSYLMLAIGRLVSGFPHGAFFGVGAIVLSKIIK 133
S R+ + +LL +A + A+ + +L IGR+V+G GA + I
Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIAD--IT 124

Query: 134 PGKVTAAVAGMVSGMTVANLLGIPLGTYLSQEFSWRYTFLLIAVFNIAVMASVYFWVPDI 193
G A G +S ++ P+ L FS F A N + F +P+
Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184

Query: 194 RDEAKGKLREQ----------FHFLRSPAPWLI--FAATMFGNAGVFAWFSYVKPYMMFI 241
+ LR + + A + F + G W +
Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG------E 238

Query: 242 SGFSETAMTFIMMLVGLGM---VLGNMLSGRISGRYSPLRIAAVTDFIIVLALLMLFFFG 298
F A T + L G+ + M++G ++ R R + ++L F
Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298

Query: 299 GMKTTSLIFAFICCAGLFALSAPLQILLLQNAKGGELLGAAGGQIAF--NLGSAVG 352
I + G+ LQ +L + E G G +A +L S VG
Sbjct: 299 RGWMAFPIMVLLASGGIG--MPALQAMLSRQV-DEERQGQLQGSLAALTSLTSIVG 351


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1613IGASERPTASE392e-04 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 38.5 bits (89), Expect = 2e-04
Identities = 40/264 (15%), Positives = 81/264 (30%), Gaps = 11/264 (4%)

Query: 162 LNAKPKERAELLEELTGTEIYGQISAMVFEQHKSARTELEKLQAQASGVALLTPEQVQSL 221
A P E E + E + Q S V + + A + + A Q+
Sbjct: 1029 APATPSETTETVAENSK-----QESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTN 1083

Query: 222 TASLQVLTDEEKQLITAQQQEQQSLNWLTRLD-ELQQEGSRRQQALQQALAEEEKAQPQL 280
+ +E Q ++ +++ E QE + + + E QPQ
Sbjct: 1084 EVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQA 1143

Query: 281 AALSLAQPARNLRPHWE---RIAEYSTALAHTRQQIEEVNTRLQSTMALRASIRHHAAKQ 337
P N++ A+ T +E+ T + + + +
Sbjct: 1144 EPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTT 1203

Query: 338 SAELQQQQQSLNAWLQEHDRLRQWNNELAGWRAQFSQQTSDREHLRQWQQQLTHAEQKLN 397
A Q S ++ ++ R + + ++DR + T+ L+
Sbjct: 1204 PATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPA-TTSSNDRSTVALCDLTSTNTNAVLS 1262

Query: 398 ALAAITLTLTADEVASALAQHAEQ 421
A + + V A++QH Q
Sbjct: 1263 DARAKAQFVALN-VGKAVSQHISQ 1285



Score = 33.9 bits (77), Expect = 0.005
Identities = 27/139 (19%), Positives = 54/139 (38%), Gaps = 13/139 (9%)

Query: 738 QQDVLAAQSLQKAQAQFDTALQASVFDDQQAFLAALMDEQTLTQLEQLKQNLENQRRQAQ 797
Q DV + S + A+ D A A E T T E KQ + + Q
Sbjct: 1004 QADVPSVPSNNEEIARVDEAPVPPP-------APATPSETTETVAENSKQESKTVEKNEQ 1056

Query: 798 TLVTQTAETLTQHQQHRPGGLSLTVTVEQIQQELAQTHQKLRENTTSQGEIRQQLKQDAD 857
TA+ ++ + V E+AQ+ + +E T++ + ++++
Sbjct: 1057 DATETTAQNREVAKEAKS-----NVKANTQTNEVAQSGSETKETQTTETKETATVEKEEK 1111

Query: 858 NRQQQQTLMQQIAQMTQQV 876
+ + + Q++ ++T QV
Sbjct: 1112 AKVETEK-TQEVPKVTSQV 1129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1611HTHFIS951e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 94.9 bits (236), Expect = 1e-24
Identities = 33/149 (22%), Positives = 62/149 (41%), Gaps = 9/149 (6%)

Query: 4 RILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDSAVNQLNEPWPDLILLDWMLPGGSGIQ 63
ILV +D+A IR ++ L + G+ + + + DL++ D ++P +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 64 FIKHLKRESMTRDIPVVMLTARGEEEDRVRGLETGADDYITKPFSPKELVARIKAVMRRI 123
+ +K+ D+PV++++A+ ++ E GA DY+ KPF EL+ I +
Sbjct: 65 LLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA-- 120

Query: 124 SPMAVEEVIEMQGLSLDPTSHRVMAGEEP 152
E L D + G
Sbjct: 121 -----EPKRRPSKLEDDSQDGMPLVGRSA 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1610PF06580340.001 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 34.1 bits (78), Expect = 0.001
Identities = 19/105 (18%), Positives = 33/105 (31%), Gaps = 26/105 (24%)

Query: 325 LVYNAVNH----TPEGTHITVRWQRVPHGAEFSVEDNGPGIAPEHIPRLTERFYRVDKAR 380
LV N + H P+G I ++ + VE+ G
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN---------------- 306

Query: 381 SRQTGGSGLGLAIVKHAVNH---HESRLNIESTVGKGTRFSFVIP 422
+G GL V+ + E+++ + GK +IP
Sbjct: 307 --TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348


64APECO1_1578APECO1_1565N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_15780200.034716muropeptide transporter
APECO1_1577326-0.152515hypothetical protein
APECO1_1576427-0.136216transcriptional regulator BolA
APECO1_15753280.146510trigger factor
APECO1_15741220.355745ATP-dependent Clp protease proteolytic subunit
APECO1_15731220.173017ATP-dependent protease ATP-binding subunit ClpX
APECO1_15720190.121618DNA-binding ATP-dependent protease La
APECO1_15710140.191411transcriptional regulator HU subunit beta
APECO1_1570-1130.138013peptidyl-prolyl cis-trans isomerase
APECO1_1569-217-0.442933hypothetical protein
APECO1_1568-114-0.061349hypothetical protein
APECO1_15670141.302765queuosine biosynthesis protein QueC
APECO1_15660131.162179hypothetical protein
APECO1_15650131.746894hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1578TCRTETA394e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 38.7 bits (90), Expect = 4e-05
Identities = 71/347 (20%), Positives = 135/347 (38%), Gaps = 20/347 (5%)

Query: 62 KFLWSPLMDRYTPPFFGRRRGWLLATQILLLVAIAAMGFLEPGTQLRWMAALAVVIAFCS 121
+F +P++ + F RR LL + V A M W+ + ++A +
Sbjct: 56 QFACAPVLGALSDRF--GRRPVLLVSLAGAAVDYAIMAT----APFLWVLYIGRIVAGIT 109

Query: 122 ASQDIVFDAWKTDVLPAEERGAGAAISVLGYRLGMLVSGGLALWLADKWLGWQGMYWLMA 181
+ V A+ D+ +ER + GM+ L + ++ A
Sbjct: 110 GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG--FSPHAPFFAAA 167

Query: 182 AL-LIPCIIATLLAPEP--TDTIPVPKTLEQAVVAPLRDFFGRNNAWLILLLIVLYKLGD 238
AL + + L PE + P+ + + + A L+ + ++ +G
Sbjct: 168 ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQ 227

Query: 239 AFAMSLTTTFLIRGVGFDAGEVGVVNKTLGLLATIVGALYGGILMQRLSLFRALLIFGIL 298
A +L F +DA +G+ G+L ++ A+ G + RL RAL+ G++
Sbjct: 228 VPA-ALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALM-LGMI 285

Query: 299 QGASNAGYWLLSITDKHLYSMGAAVFFENLCGGMGTSAFVALLMTLCNKSFSATQFALLS 358
A GY LL+ + + V GG+G A A+L ++ L+
Sbjct: 286 --ADGTGYILLAFATRGWMAFPIMVLL--ASGGIGMPALQAMLSRQVDEERQGQLQGSLA 341

Query: 359 ALSAVGRVYVGPVAGWFVEAHGWSTF--YLFSVAAAVPGLILLLVCR 403
AL+++ + VGP+ + A +T+ + + AA+ L L + R
Sbjct: 342 ALTSLTSI-VGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1577PF06291290.006 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 28.9 bits (64), Expect = 0.006
Identities = 12/37 (32%), Positives = 19/37 (51%)

Query: 34 NMFKKILFPLVALFMLAGCAKPPTTIEVSPTITLPQQ 70
N KK+LF ++ GCA+ T+ PT P++
Sbjct: 4 NKMKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKE 40


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1573HTHFIS290.043 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.043
Identities = 16/73 (21%), Positives = 29/73 (39%), Gaps = 13/73 (17%)

Query: 60 ERSALPTPHEIRNHLDDYVIGQEQAKKVLAVAVYNHYKRLRNGDTSNGVELGKSNILLIG 119
E P+ E + ++G+ A + +Y RL D +++ G
Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQ----EIYRVLARLMQTD---------LTLMITG 167

Query: 120 PTGSGKTLLAETL 132
+G+GK L+A L
Sbjct: 168 ESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1572GPOSANCHOR340.002 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 34.3 bits (78), Expect = 0.002
Identities = 33/144 (22%), Positives = 67/144 (46%), Gaps = 12/144 (8%)

Query: 195 KQSVLEMSDVNERLEYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQK 254
++ + L A QV R +++ ++ S+ +Q++A +
Sbjct: 277 TADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAK---KQLEAEHQ 333

Query: 255 ELGEMDDAPD-ENEALKRKIDAAKMPKEAKEKAEAELQKLKMMSPMS-AEATVVRGYIDW 312
+L E + + ++L+R +DA++ EAK++ EAE QKL+ + +S A +R +D
Sbjct: 334 KLEEQNKISEASRQSLRRDLDASR---EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDA 390

Query: 313 MVQVPWNARSKVKKDLRQAQEILD 336
+ A+ +V+K L +A L
Sbjct: 391 SRE----AKKQVEKALEEANSKLA 410


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1571DNABINDINGHU1173e-38 Prokaryotic integration host factor signature.
		>DNABINDINGHU#Prokaryotic integration host factor signature.

Length = 91

Score = 117 bits (294), Expect = 3e-38
Identities = 49/88 (55%), Positives = 67/88 (76%)

Query: 2 NKSQLIDKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTFAVKERAARTGR 61
NK LI K+A +++K + A+DA+ ++V+ L +G+ V L+GFG F V+ERAAR GR
Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62

Query: 62 NPQTGKEITIAAAKVPSFRAGKALKDAV 89
NPQTG+EI I A+KVP+F+AGKALKDAV
Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1568PF08280270.021 M protein trans-acting positive regulator
		>PF08280#M protein trans-acting positive regulator

Length = 530

Score = 27.1 bits (60), Expect = 0.021
Identities = 24/138 (17%), Positives = 41/138 (29%), Gaps = 20/138 (14%)

Query: 1 MQTQIKVRGYHLDVYQHVNNARYL-------EFLEEARWHGLENSDSFHWMTAH------ 47
+Q I + Y N Y E++ + N FH +
Sbjct: 361 LQHFIPETNLFVSPYYKGNQKLYTSLKLIVEEWMAKLPGKRYLNHKHFHLFCHYVEQILR 420

Query: 48 ------NIAFVVVN-ININYRRPAVLSDLLTITSQLQQLNGKSGILSQVITLEPEGQVVA 100
+ FV N IN + + + + Q+ L+P+ +
Sbjct: 421 NIQPPLVVVFVASNFINAHLLTDSFPRYFSDKSIDFHSYYLLQDNVYQIPDLKPDLVITH 480

Query: 101 DALITFVCIDLKTQKALA 118
LI FV +L A+A
Sbjct: 481 SQLIPFVHHELTKGIAVA 498


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1565HTHFIS290.020 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 29.0 bits (65), Expect = 0.020
Identities = 12/64 (18%), Positives = 24/64 (37%), Gaps = 10/64 (15%)

Query: 197 LTVLTQHLGLSLRDCMAFGDAMNDREMLGSVGSGFIMGN----------AMPQLRAELPH 246
TVL Q L + D +A + + ++ + +P+++ P
Sbjct: 16 RTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPD 75

Query: 247 LPVI 250
LPV+
Sbjct: 76 LPVL 79


65APECO1_1555APECO1_1545N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1555013-1.008608hypothetical protein
APECO1_1554217-0.868015hypothetical protein
APECO1_15532115-0.247168maltose O-acetyltransferase
APECO1_15531150.390590Hha protein
APECO1_15521160.939629acridine efflux pump
APECO1_15512130.615254acriflavine resistance protein A precursor
APECO1_15502141.697683DNA-binding transcriptional repressor AcrR
APECO1_15493172.352064potassium efflux protein KefA
APECO1_15483184.397134primosomal replication protein N''
APECO1_15474233.089447hypothetical protein
APECO1_15464273.104728adenine phosphoribosyltransferase
APECO1_15453223.012176DNA polymerase III subunits gamma and tau
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1555BCTERIALGSPF290.035 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 29.4 bits (66), Expect = 0.035
Identities = 31/137 (22%), Positives = 54/137 (39%), Gaps = 24/137 (17%)

Query: 247 IWLPLGLVIGLLAAMFVLRILRRIQSPHHRLQDAIENRDICVHYQPIVSLANGKIVGAEA 306
W+ L L+ G +A +LR R+ + + P++ G+I
Sbjct: 228 PWMLLALLAGFMAFRVMLR------QEKRRVS-----FHRRLLHLPLI----GRIARGLN 272

Query: 307 LARWPQTDGSWLSPDSFIPLAQQTGLS-EPLTLLIIRSVFEDMGDWLRQHPQQHISINLE 365
AR+ +T + S +PL Q +S + ++ R D +R+ H + LE
Sbjct: 273 TARYARTLSILNA--SAVPLLQAMRISGDVMSNDYARHRLSLATDAVREGVSLHKA--LE 328

Query: 366 STVLTSEKIPQLLREMI 382
T L P ++R MI
Sbjct: 329 QTAL----FPPMMRHMI 341


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1552ACRIFLAVINRP13690.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1369 bits (3546), Expect = 0.0
Identities = 802/1033 (77%), Positives = 915/1033 (88%), Gaps = 1/1033 (0%)

Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLAILKLPVAQYPTIAPPAVTISASYPGADAKTVQDT 60
M NFFI RPIFAWV+AII+M+AG LAIL+LPVAQYPTIAPPAV++SA+YPGADA+TVQDT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQNMNGIDNLMYMSSNSDSTGTVQITLTFESGTDADIAQVQVQNKLQLAMPLLPQ 120
VTQVIEQNMNGIDNLMYMSS SDS G+V ITLTF+SGTD DIAQVQVQNKLQLA PLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQQQGVSVEKSSSSFLMVVGVINTDGTMTQEDISDYVAANMKDAISRTSGVGDVQLFGS 180
EVQQQG+SVEKSSSS+LMV G ++ + TQ+DISDYVA+N+KD +SR +GVGDVQLFG+
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWMNPNELNKFQLTPVDVITAIKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240
QYAMRIW++ + LNK++LTPVDVI +K QN Q+AAGQLGGTP + GQQLNASIIAQTR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 TSTEEFGKILLKVNQDGSRVLLRDVAKIELGGENYDIIAEFNGQPASGLGIKLATGANAL 300
+ EEFGK+ L+VN DGS V L+DVA++ELGGENY++IA NG+PA+GLGIKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTAAAIRAELAKMEPFFPSGLKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ 360
DTA AI+A+LA+++PFFP G+K++YPYDTTPFV++SIHEVVKTL EAI+LVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420
N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 AEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFVPMAFFGGSTGAIYRQFSITIVSAMAL 480
E+ LPPKEAT KSM QIQGALVGIAMVLSAVF+PMAFFGGSTGAIYRQFSITIVSAMAL
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALILTPALCATMLKPIAKGDHGEGKKGFFGWFNRMFEKSTHHYTDSVGGILRSTGR 540
SVLVALILTPALCAT+LKP++ H E K GFFGWFN F+ S +HYT+SVG IL STGR
Sbjct: 481 SVLVALILTPALCATLLKPVSAE-HHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 541 YLVLYLIIVVGMAYLFVRLPSSFLPDEDQGVFMTMVQLPAGATQERTQKVLNEVTHYYLT 600
YL++Y +IV GM LF+RLPSSFLP+EDQGVF+TM+QLPAGATQERTQKVL++VT YYL
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 601 KEKNNVESVFAVNGFGFAGRGQNTGIAFVSLKDWADRPGEENKVEAITMRATRAFSQIKD 660
EK NVESVF VNGF F+G+ QN G+AFVSLK W +R G+EN EA+ RA +I+D
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659

Query: 661 AMVFAFNLPAIVELGTATGFDFELIDQAGLGHEKLTQARNQLLAEAAKHPDMLTSVRPNG 720
V FN+PAIVELGTATGFDFELIDQAGLGH+ LTQARNQLL AA+HP L SVRPNG
Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719

Query: 721 LEDTPQFKIDIDQEKAQALGVSINDINTTLGAAWGGSYVNDFIDRGRVKKVYVMSEAKYR 780
LEDT QFK+++DQEKAQALGVS++DIN T+ A GG+YVNDFIDRGRVKK+YV ++AK+R
Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779

Query: 781 MLPDDIGDWYVRAADGQMVPFSAFSSSRWEYGSPRLERYNGLPSMEILGQAAPGKSTGEA 840
MLP+D+ YVR+A+G+MVPFSAF++S W YGSPRLERYNGLPSMEI G+AAPG S+G+A
Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839

Query: 841 MELMEQLASKLPTGVGYDWTGMSYQERLSGNQAPSLYAISLIVVFLCLAALYESWSIPFS 900
M LME LASKLP G+GYDWTGMSYQERLSGNQAP+L AIS +VVFLCLAALYESWSIP S
Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899

Query: 901 VMLVVPLGVIGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGL 960
VMLVVPLG++G LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG+
Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959

Query: 961 IEATLDAVRMRLRPILMTSLAFILGVMPLVISTGAGSGAQNAVGTGVMGGMVTATVLAIF 1020
+EATL AVRMRLRPILMTSLAFILGV+PL IS GAGSGAQNAVG GVMGGMV+AT+LAIF
Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019

Query: 1021 FVPVFFVVVRRRF 1033
FVPVFFVV+RR F
Sbjct: 1020 FVPVFFVVIRRCF 1032


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1551RTXTOXIND447e-07 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 44.0 bits (104), Expect = 7e-07
Identities = 33/212 (15%), Positives = 71/212 (33%), Gaps = 23/212 (10%)

Query: 100 TYQAAYDSAKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTA 159
+ Y A +L + + Q+ Q +++ ++ L +Q +
Sbjct: 256 EQENKYVEAVNELR--VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313

Query: 160 AKAAVETARINLAYTKVTSPISGRIGKSNV-TEGALVQNGQATALATVQQLDPIYVDVTQ 218
+ + + +P+S ++ + V TEG +V + T + V + D + V
Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALV 372

Query: 219 SSNDFLRLKQELA----------NGTLKQENGKAKVSLITSDGIKFPQDGTLEFSDVTVD 268
+ D + KV I D I+ + G + ++++
Sbjct: 373 QNKDIGFINVGQNAIIKVEAFPYTRYGYLV---GKVKNINLDAIEDQRLGLVFNVIISIE 429

Query: 269 QTTGSITLRAIFPNPDHTLLPGMFVRARLEEG 300
+ S + I L GM V A ++ G
Sbjct: 430 ENCLSTGNKNIP------LSSGMAVTAEIKTG 455



Score = 34.0 bits (78), Expect = 9e-04
Identities = 24/125 (19%), Positives = 43/125 (34%), Gaps = 13/125 (10%)

Query: 49 PLQITTELPGR-TSAYRIAEVRPQVSGIILKRNFKEGSDIEAGVSLYQIDPATYQAAYDS 107
++I G+ T + R E++P + I+ + KEG + G L ++ +A
Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA---- 134

Query: 108 AKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTAAKAAVETA 167
D K Q++ A+L RYQ L E ++
Sbjct: 135 ---DTLKTQSSLLQARLEQTRYQILS-----RSIELNKLPELKLPDEPYFQNVSEEEVLR 186

Query: 168 RINLA 172
+L
Sbjct: 187 LTSLI 191


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1550HTHTETR2225e-76 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 222 bits (567), Expect = 5e-76
Identities = 215/215 (100%), Positives = 215/215 (100%)

Query: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60
MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120
EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV
Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120

Query: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180
GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF
Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180

Query: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215
APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE
Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1549RTXTOXIND320.017 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 31.7 bits (72), Expect = 0.017
Identities = 19/125 (15%), Positives = 40/125 (32%), Gaps = 6/125 (4%)

Query: 28 QNTAFARASSNGDLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDKIDRVKEE 87
N RA L + + L L+ + A L++ ++ E
Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSR--LDDFSSLLHKQAIAKHAVLEQENKYVEA 264

Query: 88 TVQLRQKVAEAPEKMRQATAALTALSDVDND--EETRKIL--STLSLRQLETRVAQALDD 143
+LR ++ + + +A V E L +T ++ L +A+ +
Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEER 324

Query: 144 LQNAQ 148
Q +
Sbjct: 325 QQASV 329


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1545IGASERPTASE399e-05 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 38.5 bits (89), Expect = 9e-05
Identities = 40/251 (15%), Positives = 78/251 (31%), Gaps = 31/251 (12%)

Query: 404 PLPETTSQVLAARQQLQRVQGATKAKKSEPAA----ATRARPVNNAALERLASVTDRVQA 459
P E +Q + + + P+ AR + A + A T
Sbjct: 983 PEVEKRNQTVDTTN----ITTPNNIQADVPSVPSNNEEIARV-DEAPVPPPAPATPSETT 1037

Query: 460 RPVPSALEKAPAKKEAYRWKATTPVMQQKE--------VVATPKALKKA---LEHEKTPE 508
V ++ E AT Q +E V A + + A E ++T
Sbjct: 1038 ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097

Query: 509 LAAKLAA---------EAIERDPWAAQVSQLSLPKLVEQVALNAWKE-ESDNAVCLHLRS 558
K A E+ +V+ PK + + E +N ++++
Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157

Query: 559 SQRHLNNRGAQQKLAEALST-LKGSTVELTIVEDDNPAVRTPLEWRQAIYEEKLAQARES 617
Q N ++ A+ S+ ++ E T V N V P A + + +
Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSN 1217

Query: 618 IIADNNIQTLR 628
+ + +++R
Sbjct: 1218 KPKNRHRRSVR 1228


66APECO1_1483APECO1_1473N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1483-111-0.255549hypothetical protein
APECO1_1482-1110.732728outer membrane protease
APECO1_14810111.496210hypothetical protein
APECO1_14800111.335129bacteriophage N4 receptor, outer membrane
APECO1_14790140.928157bacteriophage N4 adsorption protein B
APECO1_14780202.144445sensor kinase CusS
APECO1_14770202.329395CusR family transcriptional regulator
APECO1_14760191.399696copper/silver efflux system outer membrane
APECO1_1475-1191.556249copper-binding protein
APECO1_1474-1181.616713copper/silver efflux system membrane fusion
APECO1_1473-1181.012639copper/silver efflux system, membrane component
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1483LUXSPROTEIN310.002 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 31.4 bits (71), Expect = 0.002
Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 7/66 (10%)

Query: 40 TKEHLLPHFL-EHLGNNHLDI------GVGTGFYLTHVPESSLISLMDLNEASLNAASTR 92
T EHL F+ HL + ++I G TGFY++ + S + D A++
Sbjct: 54 TLEHLYAGFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKV 113

Query: 93 AGESKI 98
++KI
Sbjct: 114 ENQNKI 119


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1482OMPTIN5280.0 Omptin serine protease signature.
		>OMPTIN#Omptin serine protease signature.

Length = 317

Score = 528 bits (1360), Expect = 0.0
Identities = 313/317 (98%), Positives = 316/317 (99%)

Query: 1 MRAKLLGIVLTPPIAISSFASTETLSFTPDNINADISLGTLSGKTKERVYLAEEGGRKVS 60
MRAKLLGIVLT PIAISSFASTETLSFTPDNINADISLGTLSGKTKERVYLAEEGGRKVS
Sbjct: 1 MRAKLLGIVLTTPIAISSFASTETLSFTPDNINADISLGTLSGKTKERVYLAEEGGRKVS 60

Query: 61 QLDWKFNNAAIIKGAINWDLMPQISIGAAGWTTLGSRGGNMVDQDWMDSSNPGTWTDESR 120
QLDWKFNNAAIIKGAINWDLMPQISIGAAGWTTLGSRGGNMVDQDWMDSSNPGTWTDESR
Sbjct: 61 QLDWKFNNAAIIKGAINWDLMPQISIGAAGWTTLGSRGGNMVDQDWMDSSNPGTWTDESR 120

Query: 121 HPDTQLNYANEFDLNIKGWLLNEPNYRLGLMAGYQESRYSFTARGGSYIYSSEEGFRDDI 180
HPDTQLNYANEFDLNIKGWLLNEPNYRLGLMAGYQESRYSFTARGGSYIYSSEEGFRDDI
Sbjct: 121 HPDTQLNYANEFDLNIKGWLLNEPNYRLGLMAGYQESRYSFTARGGSYIYSSEEGFRDDI 180

Query: 181 GSFPNGERAIGYKQRFKIPYIGLTGSYRYEDFELGGTFKYSGWVEASDNDEHYDPGKRIT 240
GSFPNGERAIGYKQRFK+PYIGLTGSYRYEDFELGGTFKYSGWVE+SDNDEHYDPGKRIT
Sbjct: 181 GSFPNGERAIGYKQRFKMPYIGLTGSYRYEDFELGGTFKYSGWVESSDNDEHYDPGKRIT 240

Query: 241 YRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRVTNKKGNTSLYDHNDNTSDYSKNGA 300
YRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRVTNKKGNTSLYDHN+NTSDYSKNGA
Sbjct: 241 YRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRVTNKKGNTSLYDHNNNTSDYSKNGA 300

Query: 301 GIENYNFITTAGLKYTF 317
GIENYNFITTAGLKYTF
Sbjct: 301 GIENYNFITTAGLKYTF 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1478PF06580310.007 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.4 bits (71), Expect = 0.007
Identities = 29/184 (15%), Positives = 67/184 (36%), Gaps = 34/184 (18%)

Query: 306 EELTRMAKMVSDML-FLAQADNNQLIPEKKMLNLADEVGKVFDFFEALAEDR-GVELRFV 363
+ M +S+++ + + N + + LADE+ V + + LA + L+F
Sbjct: 191 TKAREMLTSLSELMRYSLRYSNARQVS------LADELTVVDSYLQ-LASIQFEDRLQFE 243

Query: 364 GDECQVAGDPLMLRRALSNLLSNALRY----TPTGETIVVRCQTVDHLVQVTVENPGTPI 419
D + + L+ N +++ P G I+++ + V + VEN G+
Sbjct: 244 NQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA 303

Query: 420 APEHLPRLFDRFYRVDPSRQRKGEGSGIGLAIVK---SIVVAHKGTVAVTSDVRGTRFVI 476
E +G GL V+ ++ + + ++ ++
Sbjct: 304 LKN------------------TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345

Query: 477 ILPA 480
++P
Sbjct: 346 LIPG 349


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1477HTHFIS862e-21 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 85.7 bits (212), Expect = 2e-21
Identities = 35/117 (29%), Positives = 62/117 (52%)

Query: 2 KLLIVEDEKKTGEYLTKGLTEAGFVVDLADNGLNGYHLAMTGDYDLIILDIMLPDVNGWD 61
+L+ +D+ L + L+ AG+ V + N + GD DL++ D+++PD N +D
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 IVRMLRSANKGMPILLLTALGTIEHRVKGLELGADDYLVKPFAFAELLARVRTLLRR 118
++ ++ A +P+L+++A T +K E GA DYL KPF EL+ + L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1476RTXTOXIND385e-05 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 38.3 bits (89), Expect = 5e-05
Identities = 25/189 (13%), Positives = 60/189 (31%), Gaps = 13/189 (6%)

Query: 254 QAQTVNSDSLQSVKLPA-GLPSQILLQRPDIMEAEHALM-----AANANIGAARAAFFPS 307
+ +S + +K + +I+++ + + L+ A A+ ++
Sbjct: 87 NGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQS----- 141

Query: 308 ISLTSGISTASSDLSSLFNASSGMWNFIPKIEIPIFNAGRNQANLDIAEIRQQQSVVNYE 367
SL + + + + P F + L + + ++Q
Sbjct: 142 -SLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQN 200

Query: 368 QKIQNAFKEVADALALRQSLNDQISAQQRYLASLQITLQRARALYQHGAVSYLEVLDAER 427
QK Q + A R ++ +I+ + + L +L A++ VL+ E
Sbjct: 201 QKYQ-KELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQEN 259

Query: 428 SLFATRQTL 436
L
Sbjct: 260 KYVEAVNEL 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1473ACRIFLAVINRP6930.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 693 bits (1789), Expect = 0.0
Identities = 213/1059 (20%), Positives = 438/1059 (41%), Gaps = 54/1059 (5%)

Query: 1 MIEWIIRRSVANRFLVLMGALFLSIWGTWTIINTPVDALPDLSDVQVIIKTSYPGQAPQI 60
M + IRR + A+ L + G I+ PV P ++ V + +YPG Q
Sbjct: 1 MANFFIRR----PIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQT 56

Query: 61 VENQVTYPLTTTMLSVPGAKTVRGFSQ-FGDSYVYVIFEDGTDPYWARSRVLEYLNQVQG 119
V++ VT + M + + S G + + F+ GTDP A+ +V L
Sbjct: 57 VQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATP 116

Query: 120 KLPAGVSAELGP-DATGVGWIYEYALVDRRGKHDLADLRSLQDWFLKYELKTIPDVAEVA 178
LP V + + + ++ V D+ +K L + V +V
Sbjct: 117 LLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQ 176

Query: 179 SVGGVVKEYQVVIDPQRLAQYGISLAEVKSALDASNQEAGGSSIELA------EAEYMVR 232
G ++ +D L +Y ++ +V + L N + + + +
Sbjct: 177 LFGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASII 235

Query: 233 ASGYLQTLDDFNHIVLKASENGVPVYLRDVAKVQVGPEMRRGIAELNGEGEVAGGVVILR 292
A + ++F + L+ + +G V L+DVA+V++G E IA +NG+ AG + L
Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLA 294

Query: 293 SGKNAREVIAAVKDKLETLKSSLPEGVEIVTTYDRSQLIDRAIDNLSGKLLEEFIVVAVV 352
+G NA + A+K KL L+ P+G++++ YD + + +I + L E ++V +V
Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354

Query: 353 CALFLWHVRSALVAIISLPLGLCIAFIVMHFQGLNANIMSLGGIAIAVGAMVDAAIVMIE 412
LFL ++R+ L+ I++P+ L F ++ G + N +++ G+ +A+G +VD AIV++E
Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414

Query: 413 NAHKRLEEWQHQHPDATLDNKTRWQVITDASVEVGPALFISLLIITLSFIPIFTLEGQEG 472
N + + E D + + ++ AL ++++ FIP+ G G
Sbjct: 415 NVERVMME----------DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464

Query: 473 RLFGPLAFTKTYAMAGAALLAIVVIPILMGYWIRGKIPPESSNPLNRF----------LI 522
++ + T AMA + L+A+++ P L ++ + E F +
Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKP-VSAEHHENKGGFFGWFNTTFDHSV 523

Query: 523 RVYHPLLLKVLHWPKTTLLVAALSVLTVLWPLNKVGGEFLPQINEGDLLYMPSTLPGISA 582
Y + K+L LL+ AL V ++ ++ FLP+ ++G L M G +
Sbjct: 524 NHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQ 583

Query: 583 AEAASMLQKTDKLIM--SVPEVARVFGKTGKAETATDSAPLEMVETTIQLKPQDQW-RPG 639
+L + + V VF G + + + LKP ++
Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQN---AGMAFVSLKPWEERNGDE 640

Query: 640 MTMDKIIEELDNTVRLPGLANLWVPPIRNRIDMLSTGIKSPIGIKVSGTVLADI-DTMAE 698
+ + +I + + + +++ + I +G + +
Sbjct: 641 NSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQ 700

Query: 699 QIEEVARTVPGVASALAERLEGGRYINVEINREKAARYGMTVADVQLFVTSAVGGAMVGE 758
+ A+ + S LE +E+++EKA G++++D+ +++A+GG V +
Sbjct: 701 LLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVND 760

Query: 759 TVEGIARYPINLRYPQSWRDSPQALRQLPILTPMKQQITLADVADVKVSTGPSMLKTENA 818
++ + ++ +R P+ + +L + + + + + G L+ N
Sbjct: 761 FIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNG 820

Query: 819 RPTSWIYIDARDRDMVSVVHDLQKAIAEKVQLKPGTSVAFSGQFELLERANHKLKLMVPM 878
P+ I +A L + +A K L G ++G + ++ +V +
Sbjct: 821 LPSMEIQGEAAPGTSSGDAMALMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAI 878

Query: 879 TLMIIFVLLYLAFRRVGEALLIISSVPFALVGGIWLLWWMGFHLSVATGTGFIALAGGAA 938
+ +++F+ L + + ++ VP +VG + V G + G +A
Sbjct: 879 SFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSA 938

Query: 939 EFGVVMLMYLRHAIEAEPSLNNPQTFSEQKLDEALYHGAVLRVRPKAMTVAVIIAGLLPI 998
+ ++++ + + +E E + + EA +R+RP MT I G+LP+
Sbjct: 939 KNAILIVEFAKDLMEKE----------GKGVVEATLMAVRMRLRPILMTSLAFILGVLPL 988

Query: 999 LWGTGAGSEVMSRIAAPMIGGMITAPLLSLFIIPAAYKL 1037
GAGS + + ++GGM++A LL++F +P + +
Sbjct: 989 AISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVV 1027


67APECO1_1458APECO1_1453N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1458-1184.713563enterobactin exporter EntS
APECO1_1457-2174.568048iron-enterobactin transporter periplasmic
APECO1_1456-2214.932488isochorismate synthase
APECO1_1455-1214.874017enterobactin synthase subunit E
APECO1_14540184.4420572,3-dihydro-2,3-dihydroxybenzoate synthetase
APECO1_14530173.6200912,3-dihydroxybenzoate-2,3-dehydrogenase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1458TCRTETA379e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 37.1 bits (86), Expect = 9e-05
Identities = 82/394 (20%), Positives = 147/394 (37%), Gaps = 38/394 (9%)

Query: 27 FISIVSLGLLGVAVPVQIQMMTHSTWQV---GLSVTLTGGAMFVGLMVGGVLADRYERKK 83
+ V +GL+ +P ++ + HS G+ + L F V G L+DR+ R+
Sbjct: 15 ALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRP 74

Query: 84 VILLARGTCGIGFIGLCLNALL--PEPSLLAIYLLGLWDGFFASLGVTALLAATPALVGR 141
V+L + G ++ + P L +Y+ + G + G A A +
Sbjct: 75 VLL-------VSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA-GAYIADITDG 126

Query: 142 ENLMQAGAITMLTVRLGSVISPMIGGLLLATGGVAWNYGLAAAGTFITLLPLLSLPALPP 201
+ + G V P++GGL+ GG + + AA L L LP
Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLM---GGFSPHAPFFAAAALNGLNFLTGCFLLPE 183

Query: 202 PPQPREHPLK----SLLAGFRFLLASPLVGGIALLGGLLTMAS----AVRVLYPALADNW 253
+ PL+ + LA FR+ +V + + ++ + A+ V++ D +
Sbjct: 184 SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG--EDRF 241

Query: 254 QMSAAQIGFLYAAIP-LGAAIGALTSGKLAHSVRPGLLMLLSTLG---AFLAISLFGLMP 309
A IG AA L + A+ +G +A + ++L + ++ ++
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 310 MWILGVVCLALFGWLSAVSSLLQYTMLQTQTPEAMLGRINGLWTAQNVTGDAIGAALLGG 369
M +V LA G ML Q E G++ G A +G L
Sbjct: 302 MAFPIMVLLASGGIGMPALQ----AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357

Query: 370 LGAMMTPVASASASGFGLLIIGVLLLLVLVELRR 403
+ A + + +G+ + L LL L LRR
Sbjct: 358 IYA----ASITTWNGWAWIAGAALYLLCLPALRR 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1457FERRIBNDNGPP632e-13 Ferrichrome-binding periplasmic protein signature.
		>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature.

Length = 296

Score = 62.7 bits (152), Expect = 2e-13
Identities = 60/280 (21%), Positives = 100/280 (35%), Gaps = 35/280 (12%)

Query: 40 HTLESQPQRIVSTSVTLTGSLLAIDAPVIASGATTPNNRVADDQGFLRQWSKVAKERKLQ 99
H P RIV+ LLA+ VAD + R W E L
Sbjct: 29 HAAAIDPNRIVALEWLPVELLLALGIVPYG---------VADTINY-RLW---VSEPPLP 75

Query: 100 RLYIG-----EPSAEAVAAQMPDLILISATGGDSALALYDQLSTIAPTLIINYDDKS--- 151
I EP+ E + P ++ SA G S + L+ IAP N+ D
Sbjct: 76 DSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPL 131

Query: 152 --WQSLLTQLGEITGHEKQAAERIAQFDKQLAAAKEQIKLPPQPVTAIVYTAAAHSANLW 209
+ LT++ ++ + A +AQ++ + + K + + ++
Sbjct: 132 AMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVF 191

Query: 210 TPESAQGQMLEQLGFTLAKLPAGLNASQSQGKRHDIIQLGGENLAAGLNGESLFLFAGDQ 269
P S ++L++ G NA Q + + + LAA + + L +
Sbjct: 192 GPNSLFQEILDEYGIP--------NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNS 243

Query: 270 KDADAIYANPLLAHLPAVQNKQVYALGTETFRLDYYSAMQ 309
KD DA+ A PL +P V+ + + F SAM
Sbjct: 244 KDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMH 283


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1454ISCHRISMTASE444e-161 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 444 bits (1142), Expect = e-161
Identities = 146/299 (48%), Positives = 195/299 (65%), Gaps = 18/299 (6%)

Query: 1 MAIPKLQAYALPESHDIPQNKVDWAFEPQRAALLIHDMQDYFVSFWGENCPMMEQVIANI 60
MAIP +Q Y +P + D+PQNKV W +P RA LLIHDMQ+YFV + + ++ ANI
Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60

Query: 61 AALRDYCKQHNIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQKVVDRLTPDADDTV 120
L++ C Q IPV YTAQP Q+ +DRALL D WGPGL P ++K++ L P+ DD V
Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120

Query: 121 LVKWRYSAFHRSPLEQMLKESGRNQLIITGVYAHIGCMTTATDAFMRDIKPFMVADALAD 180
L KWRYSAF R+ L +M+++ GR+QLIITG+YAHIGC+ TA +AFM DIK F V DA+AD
Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180

Query: 181 FSRDEHLMSLKYVAGRSGRVVMTEELL------PAPIPASKA-----------ALREVIL 223
FS ++H M+L+Y AGR VMT+ LL PA + + A +R+ I
Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240

Query: 224 PLLDESDEPFDDD-NLIDYGLDSVRMMALAARWRKVHGDIDFVMLAKNPTIDAWWKLLS 281
LL E+ E D +L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W KLL+
Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1453DHBDHDRGNASE365e-131 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 365 bits (939), Expect = e-131
Identities = 111/258 (43%), Positives = 150/258 (58%), Gaps = 20/258 (7%)

Query: 15 GKNVWITGAGKGIGYATALAFVEAGAKVTGFD---------------QAFTQEQYPFATE 59
GK +ITGA +GIG A A GA + D +A E +P
Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP---- 63

Query: 60 VMDVADAAQVAQVCQRLLAETERLDVLVNAAGILRMGATDQLSKEDWQQTFAVNVGGAFN 119
DV D+A + ++ R+ E +D+LVN AG+LR G LS E+W+ TF+VN G FN
Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122

Query: 120 LFQQTMNQFRRQRGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALSVGLELAGSGVRC 179
+ +R G+IVTV S+ A PR M+AY +SKAA +GLELA +RC
Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182

Query: 180 NVVSPGSTDTDMQRTLWVSDDAEEQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASDL 239
N+VSPGST+TDMQ +LW ++ EQ I+G E FK GIPL K+A+P +IA+ +LFL S
Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242

Query: 240 ASHITLQDIVVDGGSTLG 257
A HIT+ ++ VDGG+TLG
Sbjct: 243 AGHITMHNLCVDGGATLG 260


68APECO1_1298APECO1_1293N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1298-1203.589661hypothetical protein
APECO1_1297-1183.738266hypothetical protein
APECO1_1296-1163.461960ABC transporter
APECO1_1295-1143.348042hypothetical protein
APECO1_1294-1133.075966DNA-binding transcriptional regulator
APECO1_12930123.176083ATP-dependent RNA helicase RhlE
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1298ABC2TRNSPORT473e-08 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 47.2 bits (112), Expect = 3e-08
Identities = 36/146 (24%), Positives = 63/146 (43%), Gaps = 5/146 (3%)

Query: 197 AREREQGTLDQLLVSPLTTWQIFIGKAVPALIVATFQATIVLAIGIWAYQIPFAGSLALF 256
R Q T + +L + L I +G+ A A IG+ A + + L+L
Sbjct: 92 GRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGA---GIGVVAAALGYTQWLSLL 148

Query: 257 YFTMVI--YGLSLVGFGLLISSLCSTQQQAFIGVFVFMMPAILLSGYVSPVENMPVWLQN 314
Y VI GL+ G+++++L + + + P + LSG V PV+ +P+ Q
Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208

Query: 315 LTWINPIRHFTDITKQIYLKDASLDI 340
P+ H D+ + I L +D+
Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDV 234


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1296PF05272310.012 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 31.2 bits (70), Expect = 0.012
Identities = 20/90 (22%), Positives = 28/90 (31%), Gaps = 21/90 (23%)

Query: 298 TPRFEDAFIDLLGGAGTSESPLGAILHTVEGTPGETVIEAKELTKKFGDFAATDHVNFAV 357
PR E + +LG P + + + K HV +
Sbjct: 547 VPRLEKWLVHVLGKTPDDYKP-------------RRLRYLQLVGKYI----LMGHVARVM 589

Query: 358 KRGEIFG----LLGPNGAGKSTTFKMMCGL 383
+ G F L G G GKST + GL
Sbjct: 590 EPGCKFDYSVVLEGTGGIGKSTLINTLVGL 619



Score = 29.3 bits (65), Expect = 0.048
Identities = 11/23 (47%), Positives = 13/23 (56%)

Query: 39 YVTGLVGPDGAGKTTLMRMLAGL 61
Y L G G GK+TL+ L GL
Sbjct: 597 YSVVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1295RTXTOXIND628e-13 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 61.8 bits (150), Expect = 8e-13
Identities = 42/259 (16%), Positives = 92/259 (35%), Gaps = 25/259 (9%)

Query: 83 ALMQAKAGVSVAQAQYDLMLAGYRDEEIAQAAAAVKQAQAAYDYAQNFYNRQQGLWKSRT 142
Q + + +A+ +LA E + + + + L +
Sbjct: 201 QKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENK 260

Query: 143 ISA--NDLENARSSRDQAQATLKSAQDKLRQYRSGNREQ---DIAQAKASLEQAQAQLAQ 197
N+L +S +Q ++ + SA+++ + + + + Q ++ +LA+
Sbjct: 261 YVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAK 320

Query: 198 AELNLQDSTLIAPSDGTLLTRAV-EPGTVLNEGGTVFTVSLT-RPVWVRAYVDERNLDQA 255
E Q S + AP + V G V+ T+ + + V A V +++
Sbjct: 321 NEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFI 380

Query: 256 QPGRKVLLYTDGRPNKPYH---GQIGFVSPTAEFTPKTVETPDLRTDLVYRLRIVVT--- 309
G+ ++ + P Y G++ ++ A D R LV+ + I +
Sbjct: 381 NVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--------IEDQRLGLVFNVIISIEENC 432

Query: 310 ----DADDALRQGMPVTVQ 324
+ + L GM VT +
Sbjct: 433 LSTGNKNIPLSSGMAVTAE 451


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1294HTHTETR729e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 72.4 bits (177), Expect = 9e-18
Identities = 32/220 (14%), Positives = 74/220 (33%), Gaps = 29/220 (13%)

Query: 13 KGEQAKKQLIAAALAQFGEYGMNATT-REIAAQAGQNIAAITYYFGSKEDLYLACAQWIA 71
+ ++ ++ ++ AL F + G+++T+ EIA AG AI ++F K DL+ +
Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67

Query: 72 DFIGEQFRPHAEEAERLFAQPQPDRAAIRELILRACRNMIKLLTQDDTVNLSK---FISR 128
IGE E + P + +RE+++ + + + + +
Sbjct: 68 SNIGELEL---EYQAKFPGDP---LSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121

Query: 129 EQLSPTAAYHLVHEQVISPLHSHLTRLIAAW---TGCDASDTRMILHTHALIGEILAFRL 185
E A + + + L I A +I+ I ++
Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM--RGYISGLM---- 175

Query: 186 GKETILLRTGWTAFDEEKTELINQTVTCHIDLILQGLSQR 225
W + + + ++ ++L+
Sbjct: 176 --------ENWLFAPQSFD--LKKEARDYVAILLEMYLLC 205


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1293SECA300.026 SecA protein signature.
		>SECA#SecA protein signature.

Length = 901

Score = 29.8 bits (67), Expect = 0.026
Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 4/67 (5%)

Query: 246 QQVLVFTRTKHGANHLAEQLNKDGIRSAAIHG-NKSQGARTRALADFKSGDIRVLVATDI 304
Q VLV T + + ++ +L K GI+ ++ + A A A + + V +AT++
Sbjct: 450 QPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNM 506

Query: 305 AARGLDI 311
A RG DI
Sbjct: 507 AGRGTDI 513


69APECO1_1254APECO1_1246N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1254115-1.052994D-alanyl-D-alanine carboxypeptidase
APECO1_1253215-0.364901DNA-binding transcriptional repressor DeoR
APECO1_1252114-0.219008undecaprenyl pyrophosphate phosphatase
APECO1_1251113-0.198306proton motive force efflux pump
APECO1_1250014-0.694989hypothetical protein
APECO1_1249015-1.063550hypothetical protein
APECO1_1248-114-0.263644DEOR-type transcriptional regulator
APECO1_1247-112-0.891298DEOR-type transcriptional regulator
APECO1_12460120.127640hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1254BLACTAMASEA438e-07 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 43.2 bits (102), Expect = 8e-07
Identities = 41/201 (20%), Positives = 64/201 (31%), Gaps = 34/201 (16%)

Query: 23 AFLFLFAPTAFAAEQTVEAPSVDARAW----------ILMDYASGKVLAEGNADEKLDPA 72
+ L A A P + I MD ASG+ L ADE+
Sbjct: 7 CIISLLATLPLAV-HASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMM 65

Query: 73 SLTKIMTSYVVGQALKADKIKLTDMVTVGKDAWATGNPALRGSSVMFLKPGDQVSVADLN 132
S K++ V + A +L + + +P V D ++V +L
Sbjct: 66 STFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSP------VSEKHLADGMTVGELC 119

Query: 133 KGVIIQSGNDACIALADYVAGSQESFIGLMNGYAKKLGLTNTT---FQTVHGLDAPGQF- 188
I S N A L V G + + +++G T ++T PG
Sbjct: 120 AAAITMSDNSAANLLLATVGGPAG-----LTAFLRQIGDNVTRLDRWETELNEALPGDAR 174

Query: 189 --STARDMA------LLGKAL 201
+T MA L + L
Sbjct: 175 DTTTPASMAATLRKLLTSQRL 195


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1251TCRTETA401e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 39.8 bits (93), Expect = 1e-05
Identities = 58/269 (21%), Positives = 106/269 (39%), Gaps = 23/269 (8%)

Query: 71 LLGPLSDRIGRRPVMLAGVVWFIVTCLAILLAQNIEQFTLLRFLQGISLCFIGAVGYAAI 130
+LG LSDR GRRPV+L + V + A + + R + GI+ GAV A I
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYI 120

Query: 131 QESFEEAVCIKITALMANVALIAPLLGPLVG---AAWIHVLPWEGMFVLFAALAAISFFG 187
+ + + M+ + GP++G + P F AAL ++F
Sbjct: 121 ADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAP----FFAAAALNGLNFLT 176

Query: 188 LQRAMPETATRIGEKLSLKELGRDYKLVLKNG-RFVAGALALGFVSLPLLAWIAQSP--I 244
+PE+ L + L G VA +A+ F ++ + Q P +
Sbjct: 177 GCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFF----IMQLVGQVPAAL 232

Query: 245 IIITGEQLSSYEYGLLQVPIFGALIAGNL----LLARLTSRRTVRSLIIMGGWPIMIGLL 300
+I GE ++ + + + I +L + + +R R +++G G +
Sbjct: 233 WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYI 292

Query: 301 VAAAATVISSHAYLWMTAGLSIYAFGIGL 329
+ A AT ++ + + + GIG+
Sbjct: 293 LLAFAT----RGWMAFPIMVLLASGGIGM 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1248TCRTETB340.001 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 33.7 bits (77), Expect = 0.001
Identities = 34/150 (22%), Positives = 65/150 (43%), Gaps = 6/150 (4%)

Query: 218 LLIGVVVLAMAFAEGSANDWL-PLLMVDGHGFSP-TSGSLIYAGFTLGMTVGRFTGGWFI 275
+IGV+ + F + + P +M D H S GS+I T+ + + + GG +
Sbjct: 258 FMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILV 317

Query: 276 DRYSRVAVVR-ASALM--GALGIGLIIFVDSAWVA-GVSVVLWGLGASLGFPLTISAASD 331
DR + V+ + L ++ S ++ + VL GL + TI ++S
Sbjct: 318 DRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSL 377

Query: 332 TGPDAPTRVSVVATTGYLAFLVGPPLLGYL 361
+A +S++ T +L+ G ++G L
Sbjct: 378 KQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1247HTHTETR521e-10 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 51.6 bits (123), Expect = 1e-10
Identities = 14/81 (17%), Positives = 31/81 (38%)

Query: 2 RRANDPQRREKIIQATLEAVKLYGIHAVTHRKIATLAGVPLGSMTYYFSGIDELLLEAFS 61
+ + R+ I+ L G+ + + +IA AGV G++ ++F +L E +
Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64

Query: 62 RFTEIMSRQYQAFFSDVSDAP 82
+ + + P
Sbjct: 65 LSESNIGELELEYQAKFPGDP 85


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1246TCRTETA320.006 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 32.1 bits (73), Expect = 0.006
Identities = 21/106 (19%), Positives = 34/106 (32%), Gaps = 6/106 (5%)

Query: 394 LMIGMITFQFSTFSFGMGNAAGLLFAGIML-GFMRANHPTFG-YIPQ--GALSMVKEFGL 449
L++ + +L+ G ++ G A G YI + FG
Sbjct: 76 LLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGF 135

Query: 450 MVFMAGVGLSAGSGINNGLGAIGGQM--LIAGLIVSLVPVVICFLF 493
M G G+ AG + +G A + L + CFL
Sbjct: 136 MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181


70APECO1_158APECO1_166N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1581132.323437flagellar hook protein FlgE
APECO1_159-1122.318488flagellar basal body rod protein FlgF
APECO1_160-1101.147453flagellar basal body rod protein FlgG
APECO1_1610132.177103flagellar basal body L-ring protein
APECO1_1620121.947410flagellar basal body P-ring protein
APECO1_1631131.643936flagellar rod assembly protein/muramidase FlgJ
APECO1_1642131.197015flagellar hook-associated protein FlgK
APECO1_1653151.131492flagellar hook-associated protein FlgL
APECO1_1663151.341718ribonuclease E
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_158FLGHOOKAP1416e-06 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 41.1 bits (96), Expect = 6e-06
Identities = 17/49 (34%), Positives = 29/49 (59%)

Query: 353 TLTNGALEASNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILNTLVNLR 401
L+N S V+L +E N+ Q+ Y +NAQ ++T + I + L+N+R
Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546



Score = 36.9 bits (85), Expect = 1e-04
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 4/56 (7%)

Query: 6 AVSGLNAAATNLDVIGNNIANSATYGFKSGTASFAD----MFAGSKVGLGVKVAGI 57
A+SGLNAA L+ NNI++ G+ T A + AG VG GV V+G+
Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_160FLGHOOKAP1444e-07 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 43.8 bits (103), Expect = 4e-07
Identities = 18/81 (22%), Positives = 36/81 (44%), Gaps = 14/81 (17%)

Query: 3 SSLWIAKTGLDAQQTNMDVIANNLANVSTNGFKRQRAVFEDLLYQTIRQPGAQSSEQTTL 62
S + A +GL+A Q ++ +NN+++ + G+ RQ + + +TL
Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTL 47

Query: 63 PSGLQIGTGVRPVATERLHSQ 83
+G +G GV +R +
Sbjct: 48 GAGGWVGNGVYVSGVQREYDA 68



Score = 41.1 bits (96), Expect = 3e-06
Identities = 11/41 (26%), Positives = 21/41 (51%)

Query: 220 ETSNVNVAEELVNMIQVQRAYEINSKAVSTTDQMLQKLTQL 260
S VN+ EE N+ + Q+ Y N++ + T + + L +
Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_161FLGLRINGFLGH350e-126 Flagellar L-ring protein signature.
		>FLGLRINGFLGH#Flagellar L-ring protein signature.

Length = 232

Score = 350 bits (898), Expect = e-126
Identities = 232/232 (100%), Positives = 232/232 (100%)

Query: 4 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 63
MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY
Sbjct: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60

Query: 64 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 123
GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA
Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120

Query: 124 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 183
RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF
Sbjct: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180

Query: 184 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 235
SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM
Sbjct: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_162FLGPRINGFLGI425e-151 Flagellar P-ring protein signature.
		>FLGPRINGFLGI#Flagellar P-ring protein signature.

Length = 373

Score = 425 bits (1095), Expect = e-151
Identities = 157/363 (43%), Positives = 213/363 (58%), Gaps = 9/363 (2%)

Query: 5 FLSALILLLVTTAAQAERIRDLTSVQGVRQNSLIGYGLVVGLDGTGDQTTQTPFTTQTLN 64
F + L A RI+D+ S+Q R N LIGYGLVVGL GTGD +PFT Q++
Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72

Query: 65 NMLSQLGITVPTGTNMQLKNVAAVMVTASLPPFGRQGQTIDVVVSSMGNAKSLRGGTLLM 124
ML LGIT G + KN+AAVMVTA+LPPF G +DV VSS+G+A SLRGG L+M
Sbjct: 73 AMLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIM 131

Query: 125 TPLKGVDSQVYALAQGNILVGGAGASAGGSSVQVNQLNGGRITNGAVIERELPSQFGVGN 184
T L G D Q+YA+AQG ++V G A +++ R+ NGA+IERELPS+F
Sbjct: 132 TSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSV 191

Query: 185 TLNLQLNDEDFSMAQQIADTINRVR----GYGSATALDARTIQVRVPSGNSSQVRFLADI 240
L LQL + DFS A ++AD +N G A D++ I V+ P + R +A+I
Sbjct: 192 NLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRV-ADLTRLMAEI 250

Query: 241 QNMQVNVTPQDAKVVINLRTGSVVMNREVTLDSCAVAQGNLSVTVNRQANVSQPDTPFGG 300
+N+ V T AKVVIN RTG++V+ +V + AV+ G L+V V V QP PF
Sbjct: 251 ENLTVE-TDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-APFSR 308

Query: 301 GQTVVTPQTQIDLRQSGGSLQSVRSSASLNNVVRALNALGATPMDLMSILQSMQSAGCLR 360
GQT V PQT I Q G + ++ L +V LN++G +++ILQ ++SAG L+
Sbjct: 309 GQTAVQPQTDIMAMQEGSKV-AIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQ 367

Query: 361 AKL 363
A+L
Sbjct: 368 AEL 370


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_163FLGFLGJ5070.0 Flagellar protein FlgJ signature.
		>FLGFLGJ#Flagellar protein FlgJ signature.

Length = 313

Score = 507 bits (1306), Expect = 0.0
Identities = 310/313 (99%), Positives = 311/313 (99%)

Query: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60
MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG
Sbjct: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60

Query: 61 LFSSERTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120
LFSSE TRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET
Sbjct: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120

Query: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGNSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180
VVRYQNQALSQLVQKAVPRNYDDSLPG+SKAFLAQLSLPAQLASQQSGVPHHLILAQAAL
Sbjct: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180

Query: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240
ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL
Sbjct: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240

Query: 241 EALSDYVGLLTRNPRYAAVTTAVSAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300
EALSDYVGLLTRNPRYAAVTTA SAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK
Sbjct: 241 EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300

Query: 301 VSKTYSMNIDNLF 313
VSKTYSMNIDNLF
Sbjct: 301 VSKTYSMNIDNLF 313


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_164FLGHOOKAP16820.0 Flagellar hook-associated protein signature.
		>FLGHOOKAP1#Flagellar hook-associated protein signature.

Length = 546

Score = 682 bits (1761), Expect = 0.0
Identities = 545/546 (99%), Positives = 545/546 (99%)

Query: 2 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 61
SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS
Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60

Query: 62 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 121
GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV
Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120

Query: 122 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 181
SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND
Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180

Query: 182 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 241
QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA
Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240

Query: 242 RQLAAVPSSADPSRTTVAYVDRTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 301
RQLAAVPSSADPSRTTVAYVD TAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL
Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300

Query: 302 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 361
ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD
Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 360

Query: 362 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 421
YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV
Sbjct: 361 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 420

Query: 422 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN 481
NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN
Sbjct: 421 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN 480

Query: 482 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 541
KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD
Sbjct: 481 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 540

Query: 542 ALINIR 547
ALINIR
Sbjct: 541 ALINIR 546


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_165FLAGELLIN468e-08 Flagellin signature.
		>FLAGELLIN#Flagellin signature.

Length = 507

Score = 46.2 bits (109), Expect = 8e-08
Identities = 42/226 (18%), Positives = 80/226 (35%), Gaps = 9/226 (3%)

Query: 7 MMYQQNMRGITNSQAEWMKYGEQMSTGKRVVNPSDDPIAASQAVVLSQAQAQNSQYTLAR 66
++ Q N+ +S + + E++S+G R+ + DD + A + +Q +
Sbjct: 11 LLTQNNLNKSQSSLSSAI---ERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNA 67

Query: 67 TFATQKVSLEESVLSQVTTAIQNAQEKIVYASNGTLSDDDRASLATDIQGLRDQLLNLAN 126
E L+++ +Q +E V A+NGT SD D S+ +IQ +++ ++N
Sbjct: 68 NDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSN 127

Query: 127 TTDGNGRYIFAGYKTETAPFSEADGDYVGGTESIKQQVDASRSMVIGHTGDKIFDSITSN 186
T NG + + DG E+I + +G G + +
Sbjct: 128 QTQFNGVKVLSQDNQMKIQVGANDG------ETITIDLQKIDVKSLGLDGFNVNGPKEAT 181

Query: 187 AVAEPDGSASETNLFAMLDSAIAALKTPVADSEADKETAAAALDKT 232
+ T A + + TA DK
Sbjct: 182 VGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKV 227


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_166IGASERPTASE643e-12 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 63.9 bits (155), Expect = 3e-12
Identities = 41/226 (18%), Positives = 79/226 (34%), Gaps = 12/226 (5%)

Query: 590 PAEQSAPKAEAKPERQQDRR-----KPRQNNRRDRNERRDTRSERTEGSDNREENRRNRR 644
P+ S + A+ + N ++++++ D E +NR
Sbjct: 1008 PSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNRE 1067

Query: 645 QAQQQTAETRESRQQAEV------TEKARTTDEQQAPRRERSRRRNDDKRQAQQEAKALN 698
A++ + + + Q EV T++ +TT+ ++ E+ + + + Q+ K +
Sbjct: 1068 VAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPK-VT 1126

Query: 699 VEEQSVQETEQEERVRPVQPRRKQRQLNQKVRYEQSVAEEAVVAPVVEETVAAEPIVQEA 758
+ QE + + + R +N K Q+ P E + E V E+
Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTES 1186

Query: 759 PAPRTELVKVPLPVVAQAAPEQQEENNADNRDNGGMPRRSRRSPRH 804
T V P A Q N+ + RRS RS H
Sbjct: 1187 TTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPH 1232



Score = 62.0 bits (150), Expect = 1e-11
Identities = 46/287 (16%), Positives = 91/287 (31%), Gaps = 35/287 (12%)

Query: 513 PSEEEFAERKRPEQPALATFAMPDVPPAPT-PAEPAAPVVAPAPKAATATPASPAQPGLL 571
P E+ + DVP P+ E A AP P A ATP+ +
Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE---- 1038

Query: 572 SRFFGALKALFSGGEETKPAEQSAPKAEAKPERQQDRRKP-RQNNRRDRNERRDTRSER- 629
AE S +++ + +QD + QN + + + ++
Sbjct: 1039 -----------------TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQ 1081

Query: 630 -TEGSDNREENRRNRRQAQQQTAETRESRQQAEVTEKARTTDEQQAPRRERSRRRNDDKR 688
E + + E + + ++TA + + TEK + + + + + +
Sbjct: 1082 TNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQP 1141

Query: 689 QAQ---QEAKALNVEEQSVQETEQEERVRPVQPRRKQRQLNQKVRYEQSV--AEEAVVAP 743
QA+ + +N++E Q + +P + + Q V +V V P
Sbjct: 1142 QAEPARENDPTVNIKEPQSQTNTTADTEQPA--KETSSNVEQPVTESTTVNTGNSVVENP 1199

Query: 744 VVEETVAAEPIVQEAPAPRTELVKVPLPVVAQAAPEQQEENNADNRD 790
+P V + K ++ P E + D
Sbjct: 1200 ENTTPATTQPTVNSESS---NKPKNRHRRSVRSVPHNVEPATTSSND 1243


71APECO1_204APECO1_212N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_204-216-0.277028spermidine/putrescine ABC transporter
APECO1_206-214-0.346329spermidine/putrescine ABC transporter membrane
APECO1_207-112-0.638148spermidine/putrescine ABC transporter membrane
APECO1_208011-0.288984putrescine/spermidine ABC transporter ATPase
APECO1_209-1120.264512peptidase T
APECO1_2100140.838203hypothetical protein
APECO1_211-1150.438933sensor protein PhoQ
APECO1_212-2160.805036DNA-binding transcriptional regulator PhoP
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_204CHLAMIDIAOMP280.043 Chlamydia major outer membrane protein signature.
		>CHLAMIDIAOMP#Chlamydia major outer membrane protein signature.

Length = 393

Score = 28.4 bits (63), Expect = 0.043
Identities = 19/67 (28%), Positives = 28/67 (41%), Gaps = 8/67 (11%)

Query: 137 GVNGDAVDPKSVTSWADL------WKPEYKGSLLLTDDAREVFQMALRKLGYSGNTTDPK 190
G GD DP T+W D + ++ +L D + FQM + +GN T P
Sbjct: 42 GFGGDPCDP--CTTWCDAISMRMGYYGDFVFDRVLKTDVNKEFQMGDKPTSTTGNATAPT 99

Query: 191 EIEAAYN 197
+ A N
Sbjct: 100 TLTAREN 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_208PF05272300.017 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.017
Identities = 10/36 (27%), Positives = 19/36 (52%), Gaps = 1/36 (2%)

Query: 46 LTLLGPSGCGKTTVLRLIAGLE-TVDSGRIMLDNED 80
+ L G G GK+T++ + GL+ D+ + +D
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKD 634


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_211PF06580290.048 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 28.7 bits (64), Expect = 0.048
Identities = 11/69 (15%), Positives = 22/69 (31%), Gaps = 20/69 (28%)

Query: 389 NACKYCLE------FVEISARQTDEHLYIVVEDDGPGIPLSKREVIFDRGQRVDTLRPGQ 442
N K+ + + + + + + + VE+ G + +E
Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE--------------ST 311

Query: 443 GVGLAVARE 451
G GL RE
Sbjct: 312 GTGLQNVRE 320


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_212HTHFIS875e-22 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 86.8 bits (215), Expect = 5e-22
Identities = 31/124 (25%), Positives = 62/124 (50%)

Query: 2 RVLVVEDNALLRHHLKVQIQDAGHQVDDAEDAKEADYYLNEHLPDIAIVDLGLPDEDGLS 61
+LV +D+A +R L + AG+ V +A ++ D+ + D+ +PDE+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 LIRRWRSNDVSLPILVLTARESWQDKVEVLSAGADDYVTKPFHIEEVMARMQALMRRNSG 121
L+ R + LP+LV++A+ ++ ++ GA DY+ KPF + E++ + +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 LASQ 125
S+
Sbjct: 125 RPSK 128


72APECO1_241APECO1_248N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_241327-5.977532outer membrane porin protein NmpC
APECO1_242124-3.749645bacteriophage lambda lysozyme-like protein
APECO1_243220-1.476301Rz endopeptidase from lambdoid prophage
APECO1_2441230.274089lambda prophage Bor protein
APECO1_2452232.761076truncated TonB-like membrane protein encoded
APECO1_2461223.355503hypothetical protein
APECO1_2471213.055962prophage Qin DNA packaging protein NU1-like
APECO1_2482223.357780DNA packaging protein of prophage; terminase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_241ECOLIPORIN5080.0 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 508 bits (1310), Expect = 0.0
Identities = 241/388 (62%), Positives = 280/388 (72%), Gaps = 33/388 (8%)

Query: 21 MKKLTVAISAVAASVLMAMSAQAAEIYNKDSNKLDLYGKVNAKHYFSSNDADDGDTTYAR 80
MK+ +A+ V ++L A +A AAEIYNKD NKLDLYGKV+ HYFS + + DGD TY R
Sbjct: 1 MKRKVLAL--VIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMR 58

Query: 81 LGFKGETQINDQLTGFGQWEYEFKGNRAESQGSSKDKTRLAFAGLKFGDYGSIDYGRNYG 140
+GFKGETQINDQLTG+GQWEY + N E +G++ TRLAFAGLKFGDYGS DYGRNYG
Sbjct: 59 VGFKGETQINDQLTGYGQWEYNVQANTTEGEGANS-WTRLAFAGLKFGDYGSFDYGRNYG 117

Query: 141 VAYDIGAWTDVLPEFGGDTWTQTDVFMTGRTTGVATYRNNDFFGLVDGLNFAAQYQGKND 200
V YD+ WTD+LPEFGGD++T D +MTGR GVATYRN DFFGLVDGLNFA QYQGKN+
Sbjct: 118 VLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNE 177

Query: 201 R----------------TDVTEANGDGFGFSTTYEY-EGFGVGATYAKSDRTDGQVAYGK 243
D+ NGDGFG STTY+ GF GA Y SDRT+ QV G
Sbjct: 178 SQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGG 237

Query: 244 SKFNASGKNAEVWAAGLKYDANNIYLATTYSETQNMTVFG------NNHIANKAQNFEAV 297
+ A G A+ W AGLKYDANNIYLAT YSET+NMT +G + +ANK QNFE
Sbjct: 238 T--IAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVT 295

Query: 298 AQYQFDFGLRPSVAYLQSKGKDLGVH----GDRDLVKYVDVGATYYFNKNMSTFVDYKIN 353
AQYQFDFGLRP+V++L SKGKDL + D+DLVKY DVGATYYFNKN ST+VDYKIN
Sbjct: 296 AQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKIN 355

Query: 354 LID-DSKFTKTAGIDTDDIVAVGLVYQF 380
L+D D F K AGI TDDIVA+G+VYQF
Sbjct: 356 LLDDDDPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_244PF062911863e-65 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 186 bits (473), Expect = 3e-65
Identities = 102/102 (100%), Positives = 102/102 (100%)

Query: 1 MQDNKMKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQKKTVDAA 60
MQDNKMKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQKKTVDAA
Sbjct: 1 MQDNKMKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQKKTVDAA 60

Query: 61 KICGGAENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSQ 102
KICGGAENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSQ
Sbjct: 61 KICGGAENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSQ 102


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_245TONBPROTEIN692e-17 Gram-negative bacterial tonB protein signature.
		>TONBPROTEIN#Gram-negative bacterial tonB protein signature.

Length = 239

Score = 68.9 bits (168), Expect = 2e-17
Identities = 33/82 (40%), Positives = 46/82 (56%)

Query: 41 ADEPRQLVTVYPRYPEYAAANYIKGLVEVKFDIGADGTVTRIVFLRSEPHNLFRDEVVKA 100
A PR L P+YP A A I+G V+VKFD+ DG V + L ++P N+F EV A
Sbjct: 150 ASGPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNA 209

Query: 101 MAKWRFEKNRPCQGVKRQFIFT 122
M +WR+E +P G+ +F
Sbjct: 210 MRRWRYEPGKPGSGIVVNILFK 231


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2482FE2SRDCTASE310.011 Ferric iron reductase signature.
		>2FE2SRDCTASE#Ferric iron reductase signature.

Length = 262

Score = 31.2 bits (70), Expect = 0.011
Identities = 10/41 (24%), Positives = 21/41 (51%), Gaps = 1/41 (2%)

Query: 316 TRDGLMFFSARGDEIPPPRSITFHIWTAYSPFTTWVQIVYD 356
R+ L+ F R DE P ++T W++ + ++ + + D
Sbjct: 36 HREHLLEF-IRLDEPAPLNAMTLAQWSSPNVLSSLLAVYSD 75


73APECO1_332APECO1_340N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_332-211-0.9179482-dehydro-3-deoxyphosphooctonate aldolase
APECO1_333-215-0.908043calcium/sodium:proton antiporter
APECO1_334-1150.691295hypothetical protein
APECO1_335-1121.286709cation transport protein ChaC
APECO1_336-1141.921673hypothetical protein
APECO1_337-1172.070784hypothetical protein
APECO1_338-1202.473138transcriptional regulator NarL
APECO1_3390222.720512nitrate/nitrite sensor protein NarX
APECO1_340-1221.969671nitrate/nitrite transporter NarK
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_332TRNSINTIMINR290.032 Translocated intimin receptor (Tir) signature.
		>TRNSINTIMINR#Translocated intimin receptor (Tir) signature.

Length = 549

Score = 28.5 bits (63), Expect = 0.032
Identities = 35/157 (22%), Positives = 58/157 (36%), Gaps = 23/157 (14%)

Query: 82 QELKQTFGVKIITDVHEPSQAQPVADVVDVIQLPAFLARQTDLVEAMAKTGAVINVKKPQ 141
Q +QT T V + + P V + Q + + D ++A T
Sbjct: 391 QPAEQTTTTTTHTVVQQQTGGIPQHKVALMPQERRRFSDRRDSQGSVASTH--------- 441

Query: 142 FVSPGQMGNIVDKFKEGGNEKVILCDRGA-NFGYDNLVVDMLGFSIMKKVSGNSPVIFDV 200
+V+ + E G + L YD + D G+S+++ SG+ P V
Sbjct: 442 --WSDSSSEVVNPYAEVGGARNSLSAHQPEEHIYDEVAADP-GYSVIQNFSGSGP----V 494

Query: 201 THALQCRDPFGAASGGRRAQVAELA-RAGMAVGLAGL 236
T L G G ++ A LA G+ +G+ GL
Sbjct: 495 TGRL-----IGTPGQGIQSTYALLANSGGLRLGMGGL 526


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_337INTIMIN2565e-79 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 256 bits (654), Expect = 5e-79
Identities = 120/378 (31%), Positives = 193/378 (51%), Gaps = 21/378 (5%)

Query: 32 GEQAKAFALGKVRDALSQQVNQHVDSWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDN 91
G+ AK ALG + S Q+ WL +G A V+++ N F GS + +P D+
Sbjct: 184 GDYAKDTALGIAGNQASSQLQA----WLQHYGTAEVNLQSGNN--FDGSSLDFLLPFYDS 237

Query: 92 DRYLTWSQLGLTQQDDGLVSNVGVGQRWARGSWLVGYNTFYDNLLDENLQRAGFGAEAWG 151
++ L + Q+G D +N+G GQR+ ++GYN F D + R G G E W
Sbjct: 238 EKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWR 297

Query: 152 EYLRLSANFYQPFAAWHE--QTATQEQRMARGYDLTARMRMPFYQHLNTSVSVEQYFGDR 209
+Y + S N Y + WHE ++R A G+D+ +P Y L + EQY+GD
Sbjct: 298 DYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDN 357

Query: 210 VDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKK 269
V LFNS NP A ++G+NYTP+PLVT+ ++ G EN + Y+F P +
Sbjct: 358 VALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQ 417

Query: 270 QLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQI 329
Q+ V E ++L GSRYD QRNN LEY+++ L++ + + T ++L +
Sbjct: 418 QIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNI-PHDINGTERSTQKIQLIV 476

Query: 330 RSRYGIRQLIWQGDTQILS-----LTPGAQANSEEGWTLIMPDWQNGEGASNHWRLSVVV 384
+S+YG+ +++W D+ + S G+Q S + + I+P + +G SN ++++
Sbjct: 477 KSKYGLDRIVWD-DSALRSQGGQIQHSGSQ--SAQDYQAILPAYV--QGGSNVYKVTARA 531

Query: 385 EDNQGQRVSSNEITLTLV 402
D G SSN + LT+
Sbjct: 532 YDRNGN--SSNNVLLTIT 547


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_338HTHFIS742e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 73.7 bits (181), Expect = 2e-17
Identities = 32/117 (27%), Positives = 56/117 (47%), Gaps = 2/117 (1%)

Query: 7 ATILLIDDHPMLRTGVKQLISMAPDITVVGEASNGEQGIELAESLDPDLILLDLNMPGMN 66
ATIL+ DD +RT + Q +S A + SN + D DL++ D+ MP N
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 67 GLETLDKLREKSLSGRIVVFSVSNHEEDVVTALKRGADGYLLKDMEPEDLLKALHQA 123
+ L ++++ ++V S N + A ++GA YL K + +L+ + +A
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_339PF06580532e-09 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 52.9 bits (127), Expect = 2e-09
Identities = 36/172 (20%), Positives = 73/172 (42%), Gaps = 23/172 (13%)

Query: 424 PESSRELLSQIRNELNASWVQLRELLTTFRLQLTEPGLRPALEASCEEYSAKFGFPVKLD 483
P +RE+L+ + + S + +LT +++ + S +F ++ +
Sbjct: 190 PTKAREMLTSLSELMRYSLRYSNARQVSLADELT------VVDSYLQLASIQFEDRLQFE 243

Query: 484 YQLPPRL----VPSHQAIHLLQIAREALSNALKH-----SQASEVVVTVAQNDNQVKLTV 534
Q+ P + VP L+Q E N +KH Q ++++ +++ V L V
Sbjct: 244 NQINPAIMDVQVPPM----LVQTLVE---NGIKHGIAQLPQGGKILLKGTKDNGTVTLEV 296

Query: 535 QDNGCGVPENAIRSNHYGMIIMRDRAQSLRG-DCRVRRRESGGTEVVVTFIP 585
++ G +N S G+ +R+R Q L G + +++ E G + IP
Sbjct: 297 ENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_340ACRIFLAVINRP310.011 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 31.0 bits (70), Expect = 0.011
Identities = 35/166 (21%), Positives = 60/166 (36%), Gaps = 22/166 (13%)

Query: 258 IMSLLYLATFGSFIGFSAGFAMLSKTQFPDVQILQYAFFGPFIGALARSA---GGALSDR 314
I+S + L+ + I A A L K + + FFG F S ++
Sbjct: 474 IVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKI 533

Query: 315 LGGTRVTLVNFILMAIFSGLLFLTLPTD----GQGGSFMAFFAVFLALFLTAGLGSGSTF 370
LG T L+ + L+ +LFL LP+ G F+ L +G+T
Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTM----------IQLPAGATQ 583

Query: 371 QMISVIFRKLTMDRVKAEGGSDER-----AMREAATDTAAALGFIS 411
+ + ++T +K E + E + A + F+S
Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVS 629


74APECO1_407APECO1_417N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_4073265.113099tail component of prophage CP-933O
APECO1_4083264.422593minor tail protein
APECO1_4093254.217669minor tail protein
APECO1_4101223.631478tail fiber component K of prophage
APECO1_4111212.761677tail component of prophage CP-933K
APECO1_4120201.546732hypothetical protein
APECO1_4130211.997310tail component of prophage
APECO1_414025-1.109425hypothetical protein
APECO1_415027-2.304095tail fiber protein
APECO1_416124-5.388330phage-related tail fiber assembly protein G
APECO1_417017-3.205136hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_407GPOSANCHOR382e-04 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 37.7 bits (87), Expect = 2e-04
Identities = 38/231 (16%), Positives = 71/231 (30%), Gaps = 12/231 (5%)

Query: 377 TLQSDMEKAGELAARDRAERESSQLKYTGEAQKAYERLQTPLDKYTARQKELNKALKDGK 436
+ ++ + E D + + ++ + L+ AR+ +L KAL+
Sbjct: 109 SEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAM 168

Query: 437 ILQADYNTLMASAKKDYESTLKKPSGVKVSAGERQEDRAHAALLALETELRTLEKHSGVN 496
SAK K + + E+ + A A +++TLE
Sbjct: 169 NFSTAD-----SAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAAL 223

Query: 497 E---KISQQRRDLWEAESQYVVLKEAATKRQLSEQEKSLLAHEKETLEYKRQLAELGDKI 553
++ + S K + + + E EK KI
Sbjct: 224 AARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKI 283

Query: 554 E-HQKRLNELAQQAARFEQQQSAKQAAISAKARGLTDRQAQRESEEQRLRE 603
+ + L + A E Q A + R L A RE+++Q E
Sbjct: 284 KTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDL---DASREAKKQLEAE 331


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_411PF06291270.032 Lambda prophage Bor protein
		>PF06291#Lambda prophage Bor protein

Length = 102

Score = 26.5 bits (58), Expect = 0.032
Identities = 13/37 (35%), Positives = 17/37 (45%), Gaps = 5/37 (13%)

Query: 128 ILFSMGAAMTLGGVAQML-----APKARTPRTQTTDN 159
+LFS AM + G AQ P A TP+ T +
Sbjct: 9 MLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHH 45


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_412PHAGEIV300.001 Gene IV protein signature.
		>PHAGEIV#Gene IV protein signature.

Length = 426

Score = 30.3 bits (68), Expect = 0.001
Identities = 15/49 (30%), Positives = 30/49 (61%), Gaps = 2/49 (4%)

Query: 35 KNIDELSGCISRQWAGNGTPITSLPIEN-GVSL-LVPQAMGGYDVVLDI 81
+N+ ++G ++ + A P ++ +N G+S+ + P AM G ++VLDI
Sbjct: 289 QNVPFITGRVTGESANVNNPFQTVERQNVGISMSVFPVAMAGGNIVLDI 337


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_414ENTEROVIROMP1384e-44 Enterobacterial virulence outer membrane protein si...
		>ENTEROVIROMP#Enterobacterial virulence outer membrane protein

signature.
Length = 171

Score = 138 bits (350), Expect = 4e-44
Identities = 63/200 (31%), Positives = 98/200 (49%), Gaps = 30/200 (15%)

Query: 1 MRKVCAVILSAAICLSVSGAPAWASEHQSTLSAGYLHARTNAPGSDNLNGINVKYRYEFT 60
M+K+ + AA+ +G A ST++ GY + + + G N+KYRYE
Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAA---TSTVTGGYAQSDAQGQMNK-MGGFNLKYRYEED 56

Query: 61 DA-LGLITSFSYANAEDEQKTHYSDTRWHEDSVRNRWFSVMAGPSVRVNEWFSAYSMAGV 119
++ LG+I SF+Y T S T D +N+++ + AGP+ R+N+W S Y + GV
Sbjct: 57 NSPLGVIGSFTY--------TEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVVGV 108

Query: 120 AYSRVSTFSGDYLRVTDNKGKTHDVLTGSDDGRHSNTSLAWGAGVQFNPTESVTIDLAYE 179
Y + T T+ HD S+ ++GAG+QFNP E+V +D +YE
Sbjct: 109 GYGKFQT--------TEYPTYKHD---------TSDYGFSYGAGLQFNPMENVALDFSYE 151

Query: 180 GSGSGDWRTDAFIVGIGYRF 199
S +I G+GYRF
Sbjct: 152 QSRIRSVDVGTWIAGVGYRF 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_415CHANLCOLICIN468e-07 Channel forming colicin signature.
		>CHANLCOLICIN#Channel forming colicin signature.

Length = 522

Score = 45.8 bits (108), Expect = 8e-07
Identities = 54/319 (16%), Positives = 116/319 (36%)

Query: 152 ARAASTSAGQAASSAQSASSSAGTASTKAREAAKSAAAAESSKSAAATSASAAKTSETNA 211
+ S S AA A + S+A T+A +AA++ AAAE+ A A + + +
Sbjct: 39 GKGGSKSESSAAIHATAKWSTAQLKKTQAEQAARAKAAAEAQAKAKANRDALTQRLKDIV 98

Query: 212 AASQQSAATSASTATTKASEAATSARDASASKEAAKSSETNAASSASSAASSATAAANSA 271
+ + A+ +AT A + + AK+ E + ++ + A
Sbjct: 99 NEALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRK 158

Query: 272 KAAKTSETNARSSETAAGQSASAAADSKTAAALSASAASTSAGQASASATAAGKSAESAA 331
+ + R + A + AA S+ A A+ + SA Q+ ++
Sbjct: 159 EIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIAQKKLSAAQSEVVKMDGEIKTLNSR 218

Query: 332 SSASTATTKAGEAAVQASAAARSASAAKTSKTNAKASETSAESSKTAAASSASSAASSAS 391
S+S A + + ++AK + + + S ++ A
Sbjct: 219 LSSSIHARDAEMKTLAGKRNELAQASAKYKELDELVKKLSPRANDPLQNRPFFEATRRRV 278

Query: 392 SASASKDEATRQASAAKGSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAED 451
A ++E +Q +A++ + T+ + + + +++ + AE K+A++
Sbjct: 279 GAGKIREEKQKQVTASETRINRINADITQIQKAISQVSNNRNAGIARVHEAEENLKKAQN 338

Query: 452 IASAVALEDASTTKKGIVQ 470
++DA Q
Sbjct: 339 NLLNSQIKDAVDATVSFYQ 357



Score = 36.6 bits (84), Expect = 5e-04
Identities = 66/358 (18%), Positives = 125/358 (34%), Gaps = 30/358 (8%)

Query: 313 AGQASASATAAGKSAESAASSASTATTKAGEAAVQASAAARSASAAKTSKTNAKASETSA 372
+G KS SAA A+ + A QA AAR+ +AA ++ A
Sbjct: 32 SGSGGGGGKGGSKSESSAAIHATAKWSTAQLKKTQAEQAARAKAAA--------EAQAKA 83

Query: 373 ESSKTAAASSASSAASSASSASASKDEATRQASAAKGSATTASTKATEAAGSATAAAQSK 432
++++ A + A +AS+ + + + A+ A +A A+++
Sbjct: 84 KANRDALTQRLKDIVNEALRHNASRTPSATELA-------HANNAAMQAEDERLRLAKAE 136

Query: 433 STAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSESLAATPKAVKAA 492
A A AE A + AE + E A T ++ ++L+ A +L+ KAV+ A
Sbjct: 137 EKARKEAEAAEKAFQEAEQRRKEIEREKAETERQ--LKLAEAEEKRLAALSEEAKAVEIA 194

Query: 493 YELANGKYTAQDATTAQKGIVQLSNATNSTSEMLAATPKSVKAAYDLANGKYTAQDATTA 552
+ + AQ +V++ + + L+++ + A GK +A
Sbjct: 195 ---------QKKLSAAQSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQASA 245

Query: 553 QKGIVQLSSATNSTSEMLAATPKSVKAAYDLANGKYTAQDAT-TAQKGIVQLSSATNSAS 611
+ + A P + ++ + A QK + + N +
Sbjct: 246 K---YKELDELVKKLSPRANDPLQNRPFFEATRRRVGAGKIREEKQKQVTASETRINRIN 302

Query: 612 ETLAATPKAVKAANNNANGRVPSARKVNGKALSADITLTPKDIGTLNSTTMSFSGGAG 669
+ KA+ +NN N + + A L I T+SF
Sbjct: 303 ADITQIQKAISQVSNNRNAGIARVHEAEENLKKAQNNLLNSQIKDAVDATVSFYQTLT 360



Score = 31.2 bits (70), Expect = 0.022
Identities = 52/321 (16%), Positives = 99/321 (30%), Gaps = 23/321 (7%)

Query: 114 SRNASAVAQNTAAAKKSASDASASASEAATHATDAAASARAASTSAGQAASSAQSASSSA 173
S++ S+ A + A +A A +AA A A A+A + + +
Sbjct: 43 SKSESSAAIHATAKWSTAQLKKTQAEQAARAKAAAEAQAKAKANRDALTQRLKDIVNEAL 102

Query: 174 GTASTKAREAAKSAAAAESSKSAAATSASAAKTSETNAAASQQSAATSASTATTKASEAA 233
+++ A + A A ++ A AK + A + A KA + A
Sbjct: 103 RHNASRTPSATELAHANNAAMQAEDERLRLAK---------AEEKARKEAEAAEKAFQEA 153

Query: 234 TSARDASASKEAAKSSETNAAS------SASSAASSATAAANSAKAAKTSETNARSSE-T 286
R ++A + A +A S + A A +A SE E
Sbjct: 154 EQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIAQKKLSAAQSEVVKMDGEIK 213

Query: 287 AAGQSASAAADSKTAAALSASAASTSAGQASASATAAGKSAESAASSASTA-TTKAGEAA 345
S++ ++ A + + QASA + + + A+ + A
Sbjct: 214 TLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYKELDELVKKLSPRANDPLQNRPFFEA 273

Query: 346 VQASAAARSASAAKTSKTNAKASETSAESSKTAAASSASSAASSASSAS------ASKDE 399
+ A K + A + + ++ A S S+ +A A ++
Sbjct: 274 TRRRVGAGKIREEKQKQVTASETRINRINADITQIQKAISQVSNNRNAGIARVHEAEENL 333

Query: 400 ATRQASAAKGSATTASTKATE 420
Q + A
Sbjct: 334 KKAQNNLLNSQIKDAVDATVS 354


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_417LUXSPROTEIN300.005 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 29.9 bits (67), Expect = 0.005
Identities = 17/66 (25%), Positives = 30/66 (45%), Gaps = 7/66 (10%)

Query: 41 TKEHLLPHFL-EHVGNNHLDI------GVGTGFYLTHVPESSLISLMDLNEASLNAASTR 93
T EHL F+ H+ + ++I G TGFY++ + S + D A++
Sbjct: 54 TLEHLYAGFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKV 113

Query: 94 AGESKI 99
++KI
Sbjct: 114 ENQNKI 119


75APECO1_444APECO1_458N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_444-114-0.694933RNase II stability modulator
APECO1_445-2120.311225exoribonuclease II
APECO1_446-1110.340230hypothetical protein
APECO1_447-1120.062565enoyl-(acyl carrier protein) reductase
APECO1_448-113-0.111956transcription regulatory protein
APECO1_449-1120.078401acriflavine resistance protein A precursor
APECO1_450-1120.756063acriflavine resistance protein B
APECO1_451-1130.643069outer membrane channel protein
APECO1_452-1140.641159membrane transport protein
APECO1_453-2141.123761peptide ABC transporter ATP-binding protein
APECO1_454-2140.860491peptide ABC transporter ATP-binding protein
APECO1_455-2130.973337peptide ABC transporter permease
APECO1_456-1140.002360peptide transport system permease SapB
APECO1_457-216-2.152315peptide transport periplasmic protein SapA
APECO1_458-114-2.945856phage shock protein operon transcriptional
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_444PF08280300.043 M protein trans-acting positive regulator
		>PF08280#M protein trans-acting positive regulator

Length = 530

Score = 29.8 bits (67), Expect = 0.043
Identities = 21/105 (20%), Positives = 36/105 (34%), Gaps = 2/105 (1%)

Query: 526 PIDVELTESCLIENDELALSVIQQFSRLGAQVHLDDFGTGYSSLSQLARFPIDAIKLDQV 585
P+ V S I L S + FS + + ++ Q+ D +
Sbjct: 425 PLVVVFVASNFINAHLLTDSFPRYFS--DKSIDFHSYYLLQDNVYQIPDLKPDLVITHSQ 482

Query: 586 FVRDIHKQPVSQSLVRAIVAVAQALNLQVIAEGVESAKEDAFLTK 630
+ +H + V I L++Q + V+ K A LTK
Sbjct: 483 LIPFVHHELTKGIAVAEISFDESILSIQELMYQVKEEKFQADLTK 527


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_447DHBDHDRGNASE501e-09 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 50.4 bits (120), Expect = 1e-09
Identities = 51/260 (19%), Positives = 98/260 (37%), Gaps = 22/260 (8%)

Query: 4 LSGKRILVTGVASKLSIAYGIAQAMHREGAEL-AFTYQNDKLKGRVEEFAAQLGSDIVLQ 62
+ GK +TG A I +A+ + +GA + A Y +KL+ V A+
Sbjct: 6 IEGKIAFITGAAQ--GIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63

Query: 63 CDVAEDASIDTMFAELGKVWPKFDGFVHSIGF---APGDQLDGDYVNAVTREGFKIAHDI 119
DV + A+ID + A + + D V+ G L + A F +
Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEAT----FSVN--- 116

Query: 120 SSYSFVAMAKACRSMLNP-GSALLTLSYLGAERAIPNYNVMGLAKASLEANVRYMANAMG 178
S+ F A + M++ +++T+ A + +KA+ + + +
Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176

Query: 179 PEGVRVNAISAGPIRTLAASGI--------KDFRKMLAHCEAVTPIRRTVTIEDVGNSAA 230
+R N +S G T + + + L + P+++ D+ ++
Sbjct: 177 EYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVL 236

Query: 231 FLCSDLSAGISGEVVHVDGG 250
FL S + I+ + VDGG
Sbjct: 237 FLVSGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_448HTHTETR557e-12 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 55.4 bits (133), Expect = 7e-12
Identities = 17/65 (26%), Positives = 33/65 (50%)

Query: 8 MTSKLEIRHKQRQDEIINAARRCFRRCGFHAASMSQIASEAQLSVGQIYRYFANKDAIIE 67
M K + ++ + I++ A R F + G + S+ +IA A ++ G IY +F +K +
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 68 EMVRR 72
E+
Sbjct: 61 EIWEL 65


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_449RTXTOXIND484e-08 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 47.9 bits (114), Expect = 4e-08
Identities = 19/70 (27%), Positives = 39/70 (55%), Gaps = 1/70 (1%)

Query: 52 PVSVVSELTGR-TSAALSAEVRPQVGGIIQKRLFKEGDLVKAGQPLYQIDAASYQAAWNE 110
V +V+ G+ T + S E++P I+++ + KEG+ V+ G L ++ A +A +
Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138

Query: 111 ARAALQQAQA 120
+++L QA+
Sbjct: 139 TQSSLLQARL 148



Score = 30.6 bits (69), Expect = 0.010
Identities = 15/116 (12%), Positives = 32/116 (27%), Gaps = 9/116 (7%)

Query: 94 QPLYQIDAASYQAAWN--EARAALQQAQALVKADCQKAQRYTRLVKENGVSQQDADDAQS 151
L A + A + K+ ++ + KE Q ++
Sbjct: 241 SSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEE--YQLVTQLFKN 298

Query: 152 TCAQDKASVEAKKAALET----ARINLDWTTVTAPISGRI-GISSVTPGALVTASQ 202
L + + AP+S ++ + T G +VT ++
Sbjct: 299 EILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_450ACRIFLAVINRP11610.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1161 bits (3004), Expect = 0.0
Identities = 584/1033 (56%), Positives = 760/1033 (73%), Gaps = 6/1033 (0%)

Query: 3 SRFFVRRPVFAWVIAILIMLAGILAIRTLPVAQYPDVAPPTIKVSATYTGASAETLENSV 62
+ FF+RRP+FAWV+AI++M+AG LAI LPVAQYP +APP + VSA Y GA A+T++++V
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 63 TQVIEQQLTGLDNLLYFSSTSSSDGSVSINVTFEQGTDPDTAQVQVQNKIQQAESRLPSE 122
TQVIEQ + G+DNL+Y SSTS S GSV+I +TF+ GTDPD AQVQVQNK+Q A LP E
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 123 VQQTGVTVEKSQSNFLLIAAVYDTTDKASSSDIADWLVSNVQDPLARVEGVGSLQVFGAE 182
VQQ G++VEKS S++L++A + DI+D++ SNV+D L+R+ GVG +Q+FGA+
Sbjct: 122 VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ 181

Query: 183 YAMRIWLDPAKLASYSLMPSDVQSAIEAQNVQVTAGKIGALPSPNTQQLTATVRAQSRLQ 242
YAMRIWLD L Y L P DV + ++ QN Q+ AG++G P+ QQL A++ AQ+R +
Sbjct: 182 YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 243 TVDQFKNIIVKSQSDGAVVRIKDVARVEMGSEDYTAIGKLNGHPSAGVAVMLSPGANALN 302
++F + ++ SDG+VVR+KDVARVE+G E+Y I ++NG P+AG+ + L+ GANAL+
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 303 TATLVKDKIAEFQRNMPQGYDIAYPKDSTEFIKISVEDVIQTLFEAIVLVVCVMYLFLQN 362
TA +K K+AE Q PQG + YP D+T F+++S+ +V++TLFEAI+LV VMYLFLQN
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 363 LRATLIPALAVPVVLLGTFGVLALFGYSINTLTLFAMVLAIGLLVDDAIVVVENVERIMR 422
+RATLIP +AVPVVLLGTF +LA FGYSINTLT+F MVLAIGLLVDDAIVVVENVER+M
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 423 DEGLPAREATEKSMGEISGALVAIALVLSAVFLPMAFFGGSTGVIYRQFSITIISAMLLS 482
++ LP +EATEKSM +I GALV IA+VLSAVF+PMAFFGGSTG IYRQFSITI+SAM LS
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 483 VVVALTLTPALCGSVL----QHVPPHKKGFFGAFNRFYRRTEDKYQRGVIYVLRRAARTM 538
V+VAL LTPALC ++L +K GFFG FN + + + Y V +L R +
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 539 GLYVVLGGGMALMMWKLPGSFLPTEDQGEIMVQYTLPAGATAARTAEVNRQIVDWFLINE 598
+Y ++ GM ++ +LP SFLP EDQG + LPAGAT RT +V Q+ D++L NE
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 599 KANTDVIFTVDGFSFSGSGQNTGMAFVSLKNWSQRKGAENTAQAIALRATKELGTIRDAT 658
KAN + +FTV+GFSFSG QN GMAFVSLK W +R G EN+A+A+ RA ELG IRD
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 659 VFAMTPPAVDGLGQSNGFTFELLANGGADRETLLQMRNQLIEKANQSP-ELHSVRANDLP 717
V PA+ LG + GF FEL+ G + L Q RNQL+ A Q P L SVR N L
Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721

Query: 718 QMPQLQVDIDSNKAVSLGLSLNDVTDTLSSAWGGTYVNDFIDRGRVKKVYIQGDSEFRSA 777
Q ++++D KA +LG+SL+D+ T+S+A GGTYVNDFIDRGRVKK+Y+Q D++FR
Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781

Query: 778 PSDLGKWFVRGSDNAMTPFSAFATTRWLYGPERLVRYNGSAAYEIQGENATGFSSGDAMT 837
P D+ K +VR ++ M PFSAF T+ W+YG RL RYNG + EIQGE A G SSGDAM
Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841

Query: 838 KMEELANSLPAGTTWAWSGLSLQEKLASGQALSLYAVSILVVFLCLAALYESWSVPFSVI 897
ME LA+ LPAG + W+G+S QE+L+ QA +L A+S +VVFLCLAALYESWS+P SV+
Sbjct: 842 LMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVM 901

Query: 898 LVIPLGLLGAALAAWMRDLNNDVYFQVALLTTIGLSSKNAILIVEFA-EAAVAEGYSLSR 956
LV+PLG++G LAA + + NDVYF V LLTTIGLS+KNAILIVEFA + EG +
Sbjct: 902 LVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVE 961

Query: 957 AALRAAQTRLRPIIMTSLAFIAGVMPLAIATGAGANSRIAIGTGIIGGTLTATLLAIFFV 1016
A L A + RLRPI+MTSLAFI GV+PLAI+ GAG+ ++ A+G G++GG ++ATLLAIFFV
Sbjct: 962 ATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFV 1021

Query: 1017 PLFFVLVKRLFAG 1029
P+FFV+++R F G
Sbjct: 1022 PVFFVVIRRCFKG 1034



Score = 75.3 bits (185), Expect = 1e-15
Identities = 53/330 (16%), Positives = 117/330 (35%), Gaps = 19/330 (5%)

Query: 721 QLQVDIDSNKAVSLGLSLNDVTDTLSSA----WGGTYVNDFIDRGRVKKVYIQGDSEFRS 776
+++ +D++ L+ DV + L G G+ I + F++
Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242

Query: 777 APSDLGKWFVRGSDN-AMTPFSAFATTRWLYGPER--LVRYNGSAA-----YEIQGENAT 828
P + GK +R + + ++ A L G + R NG A G NA
Sbjct: 243 -PEEFGKVTLRVNSDGSVVRLKDVARVE-LGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 829 GFSSGDAMTKMEELANSLPAG--TTWAWSGLSLQEKLASGQALSLYAVSILVVFLCLAAL 886
+ K+ EL P G + + + +L+ +LV + +
Sbjct: 301 DTAKA-IKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV-MYLF 358

Query: 887 YESWSVPFSVILVIPLGLLGAALAAWMRDLNNDVYFQVALLTTIGLSSKNAILIVEFAEA 946
++ + +P+ LLG + + ++ IGL +AI++VE E
Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418

Query: 947 AVAEGYSLSRAALRAAQTRLR-PIIMTSLAFIAGVMPLAIATGAGANSRIAIGTGIIGGT 1005
+ E + A + ++++ ++ ++ A +P+A G+ I+
Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478

Query: 1006 LTATLLAIFFVPLFFVLVKRLFAGKPRRQE 1035
+ L+A+ P + + + + +
Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENK 508


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_451RTXTOXIND300.026 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 29.8 bits (67), Expect = 0.026
Identities = 24/166 (14%), Positives = 49/166 (29%), Gaps = 11/166 (6%)

Query: 70 DVQKAIADIDSARALYGQTNASLFPTVNAALSSTRSRSLANGTVTTAEADGTVSSYTLDL 129
A AD ++ Q +RS L + + + +
Sbjct: 128 TALGAEADTLKTQSSLLQARL----EQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEE 183

Query: 130 FGRNQSLSRAARETWLASEFTAQNTRLTLIAEISTAWLTLAADNSNLALAKETMASAENS 189
R SL + TW ++ + AE T + + + ++
Sbjct: 184 VLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTV-------LARINRYENLSRVEKSR 236

Query: 190 LKIIQRQQQVGTAAATDVSEAMSVYQQARASVASYQTQVMQDKNAL 235
L A V E + Y +A + Y++Q+ Q ++ +
Sbjct: 237 LDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEI 282


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_452TCRTETA681e-14 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 67.9 bits (166), Expect = 1e-14
Identities = 68/312 (21%), Positives = 114/312 (36%), Gaps = 18/312 (5%)

Query: 5 SLSWALILGLLAGIGPMCTDLYLPALPEMSEQLAATTTITQLTLTASLIGLGVGQLLFGP 64
L L L +G L +P LP + L + +T L + Q P
Sbjct: 6 PLIVILSTVALDAVG---IGLIMPVLPGLLRDLVHSNDVTA-HYGILLALYALMQFACAP 61

Query: 65 ----LSDKIGRKRPLILSLLLFIVSSILCATTNNIYWLVVWRFIQGIAGAGGSVLSRSIA 120
LSD+ GR+ L++SL V + AT ++ L + R + GI GA G+V IA
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA 121

Query: 121 RDKYQGVTLTQFFALLMTVNGLAPVLSPVLGGYIVSTFDWRTLFWVMAEISTVLLLGCLL 180
D G + F + G V PVLGG + F F+ A ++ + L
Sbjct: 122 -DITDGDERARHFGFMSACFGFGMVAGPVLGGLM-GGFSPHAPFFAAAALNGLNFLTGCF 179

Query: 181 FINETLPENKRGSSL----LLTGRSVVQNRRFMRFCLIQSFMLAGLFAYIGSSSFVL--Q 234
+ E+ +R L + + + F + L + ++ +V+ +
Sbjct: 180 LLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFF-IMQLVGQVPAALWVIFGE 238

Query: 235 KEFGFSPMQFSLVFGLNGI-GLIIASWIFSRLARRINAMTLLRGGLIAAILCALLTVLCA 293
F + + GI + + I +A R+ L G+IA +L
Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298

Query: 294 WVQLPIPALVAL 305
+ P +V L
Sbjct: 299 RGWMAFPIMVLL 310


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_454HTHFIS310.007 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 31.0 bits (70), Expect = 0.007
Identities = 9/16 (56%), Positives = 14/16 (87%)

Query: 38 LVGESGSGKSLIAKAI 53
+ GESG+GK L+A+A+
Sbjct: 165 ITGESGTGKELVARAL 180


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_458HTHFIS343e-118 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 343 bits (882), Expect = e-118
Identities = 126/341 (36%), Positives = 183/341 (53%), Gaps = 23/341 (6%)

Query: 11 DNLLGEANSFLEVLEQVSHLAPLDKPVLIIGERGTGKELIASRLHYLSSRWQGPFISLNC 70
L+G + + E+ ++ L D ++I GE GTGKEL+A LH R GPF+++N
Sbjct: 137 MPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINM 196

Query: 71 AALNENLLDSELFGHEAGAFTGAQKRHPGRFERADGGTLFLDELATAPMMVQEKLLRVIE 130
AA+ +L++SELFGHE GAFTGAQ R GRFE+A+GGTLFLDE+ PM Q +LLRV++
Sbjct: 197 AAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQ 256

Query: 131 YGELERVGGSQPLQVNVRLVCATNADLPAMVNEGTFRADLLDRLAFDVVQLPPLRERESD 190
GE VGG P++ +VR+V ATN DL +N+G FR DL RL ++LPPLR+R D
Sbjct: 257 QGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAED 316

Query: 191 IMLMAEHFAIQMCREIKLPLFPGFTEHARETLLNYRWPGNIRELKNVVERSVYRHGTSDY 250
I + HF Q +E F + A E + + WPGN+REL+N+V R +
Sbjct: 317 IPDLVRHFVQQAEKEGLDVK--RFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVI 374

Query: 251 PLDDIIID---PFKRRPSEEAIAVSENTSLPTLPLD------------------LREFQM 289
+ I + P E+A A S + S+ +
Sbjct: 375 TREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLA 434

Query: 290 QQEKELLQLSLQQGKYNQKRAAELLGLTYHQFRALLKKHQI 330
+ E L+ +L + NQ +AA+LLGL + R +++ +
Sbjct: 435 EMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475


76APECO1_537APECO1_550N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_537-129-3.695657outer membrane protein of prophage
APECO1_538135-5.377776tail fiber protein
APECO1_539545-8.223649hypothetical protein
APECO1_540645-8.400488hypothetical protein
APECO1_541642-8.655493hypothetical protein
APECO1_542743-9.059530cytolethal distending toxin type IV subunit A
APECO1_543540-8.650050cytolethal distending toxin type IV subunit B
APECO1_544534-7.874889cytolethal distending toxin type IV subunit C
APECO1_545327-6.807494hypothetical protein
APECO1_546-314-1.824092outer membrane protease
APECO1_547-214-0.723341hypothetical protein
APECO1_548-214-0.623154integrase
APECO1_549-214-0.698963filament protein
APECO1_550-210-0.199127outer membrane pore protein N, non-specific
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_537ENTEROVIROMP1493e-48 Enterobacterial virulence outer membrane protein si...
		>ENTEROVIROMP#Enterobacterial virulence outer membrane protein

signature.
Length = 171

Score = 149 bits (378), Expect = 3e-48
Identities = 68/200 (34%), Positives = 101/200 (50%), Gaps = 30/200 (15%)

Query: 1 MRKLCAVILSAVVWLVAAGTPASAAEHQSTLSAGYLQSHTDMPGNDDLKGVNVKYRYEFT 60
M+K+ + A V AGT +A ST++ GY QS N + G N+KYRYE
Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAA---TSTVTGGYAQSDAQGQMNK-MGGFNLKYRYEED 56

Query: 61 DT-LGLVTSFSYAGDKNRQLTRYSDTRWHEDSVRNRWFSMMAGPSVRVNEWFSAYAMAGV 119
++ LG++ SF+Y T S T D +N+++ + AGP+ R+N+W S Y + GV
Sbjct: 57 NSPLGVIGSFTY--------TEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVVGV 108

Query: 120 AYSRVSTFSGDYLRVTDNKGKTHDVLTGSDDGRHSNTSLAWGAGVQFNPTESVAIDIAYE 179
Y + T T+ HD S+ ++GAG+QFNP E+VA+D +YE
Sbjct: 109 GYGKFQT--------TEYPTYKHD---------TSDYGFSYGAGLQFNPMENVALDFSYE 151

Query: 180 GSGSGDWRTDGFIVGVGYKF 199
S +I GVGY+F
Sbjct: 152 QSRIRSVDVGTWIAGVGYRF 171


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_538IGASERPTASE466e-07 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 45.8 bits (108), Expect = 6e-07
Identities = 28/192 (14%), Positives = 62/192 (32%), Gaps = 7/192 (3%)

Query: 103 PEALRRFEEMVEEAARNAEAASQSAAAAKKSETAAASSKNAAKTSETNAANSAQAAAASQ 162
+ + E+ E + E ++ + K E + A E + + + +
Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQT 1162

Query: 163 TASANSATAAKKSETNAKNSETAAKTSETNAKSSQTAAKTSETNAKA---SETAAKNSQN 219
+A++ AK++ +N + T + T T + T+ + SE++ K
Sbjct: 1163 NTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNR 1222

Query: 220 AAAESESAAAGSATSAAGSATAAANSQKAAKTSETNAKSSQTAAKTS----ETNAKASET 275
S + S + + ++ TNA S AK S+
Sbjct: 1223 HRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQH 1282

Query: 276 AAKNSQDAAAQS 287
++ + Q
Sbjct: 1283 ISQLEMNNEGQY 1294



Score = 42.4 bits (99), Expect = 9e-06
Identities = 28/152 (18%), Positives = 50/152 (32%), Gaps = 13/152 (8%)

Query: 222 AESESAAAGSATSAAGSATAAANSQKAAKTSETNAKSSQTAAKTSETNAKASETAAKNSQ 281
+ + S + A +A A S+T +E + + S+T KN Q
Sbjct: 997 ITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQ 1056

Query: 282 DA------------AAQSESAAAGSASAAASSATASANSQKAAKTSETNAKASETAAANS 329
DA A+S A + A S + + +Q + E A +
Sbjct: 1057 DATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVET 1116

Query: 330 AKASAASQTAAKASEDAAREYASQ-AAEPYKQ 360
K + ++ S + Q AEP ++
Sbjct: 1117 EKTQEVPKVTSQVSPKQEQSETVQPQAEPARE 1148


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_542cdtoxina308e-109 Cytolethal distending toxin A signature.
		>cdtoxina#Cytolethal distending toxin A signature.

Length = 258

Score = 308 bits (791), Expect = e-109
Identities = 86/250 (34%), Positives = 132/250 (52%), Gaps = 21/250 (8%)

Query: 5 LIAFLCTLIITGCSDG--------------IGDSPSPPGKNVELVGIPGQGVAVASNGTS 50
+ L +++ GCS G + P+ P + + +PG G A+ +NG
Sbjct: 10 IAGILIPILLNGCSSGKNKAYLDPKVFPPQVEGGPTVPSPDEPGLPLPGPGPALPTNGAI 69

Query: 51 PTFGSNSTDFPDVSIMSTGGAMLTVWARPVRNWLWGYTPFDSVSFGENRNWKVVDGKDAG 110
P + VS+M+ G++LT+W+R + LW Y DS SFGE RNW+++ G
Sbjct: 70 PIPEPGTAPA--VSLMNMDGSVLTMWSRGAGSSLWAYYIGDSNSFGELRNWQIMPGTRPN 127

Query: 111 TVKFVNVAQGTCMEAFK-----NGVIHNTCDDNSLSQEFQLLPSTNGNVLIRSSALQTCI 165
T++F NV GTCM +F + C +FQ + + NGN ++S + CI
Sbjct: 128 TIQFRNVDVGTCMTSFPGFKGGVQLSTAPCKFGPERFDFQPMATRNGNYQLKSLSTGLCI 187

Query: 166 RADYLSRTILSPFAFTITLEKCPGAKEETQEMLWAISPPVRAAKPNLIKPELRPFRPLPI 225
RA++L RT SP+A T+T+E+CP + E+ E +W+IS P+R A + KPE+RPF P PI
Sbjct: 188 RANFLGRTPSSPYATTLTMERCPSSGEKNFEFMWSISEPLRPALATIAKPEIRPFPPQPI 247

Query: 226 PPHDKPDGME 235
P + G E
Sbjct: 248 EPDEHSTGGE 257


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_543cdtoxinb431e-157 Cytolethal distending toxin B signature.
		>cdtoxinb#Cytolethal distending toxin B signature.

Length = 269

Score = 431 bits (1109), Expect = e-157
Identities = 150/264 (56%), Positives = 185/264 (70%)

Query: 2 KKLLFLLMILPGISFADLSDFKVATWNLQGSNAPTENKWNTHVRQLVTGSGAVDILMVQE 61
K ++ L++ L + ADL+DF+VATWNLQG++A TE+KWN +VRQL++G AVDIL VQE
Sbjct: 3 KYIISLIVFLSFYAQADLTDFRVATWNLQGASATTESKWNINVRQLISGENAVDILAVQE 62

Query: 62 AGSIPSSATLTEREFRTPGIPMNEYIWNTGTNSRPQQLFIYFSRTDALSNRVNLAIVSNR 121
AGS PS+A T +PGIP+ E IWN TNSRPQQ++IYFS DAL RVNLA+VSNR
Sbjct: 63 AGSPPSTAVDTGTLIPSPGIPVRELIWNLSTNSRPQQVYIYFSAVDALGGRVNLALVSNR 122

Query: 122 RADEVIVLSPPTVASRPIIGIRIGNDVFFSTHALANRGIDSGAIVNSVFEFFNRQTDPIR 181
RADEV VLSP RP++GIRIGND FF+ HA+A R D+ A+V V+ FF DP+
Sbjct: 123 RADEVFVLSPVRQGGRPLLGIRIGNDAFFTAHAIAMRNNDAPALVEEVYNFFRDSRDPVH 182

Query: 182 QAANWMIAGDFNRSPAMLFSTLEPGIRNHVNIIAPPDPTQASGGVLDYAVVGNSVSFVLP 241
QA NWMI GDFNR PA L L +R II+P TQ S LDYAV GNSV+F
Sbjct: 183 QALNWMILGDFNREPADLEMNLTVPVRRASEIISPAAATQTSQRTLDYAVAGNSVAFRPS 242

Query: 242 LLRASLLFGLLRGQIASDHFPVGF 265
L+A +++G R QI+SDHFPVG
Sbjct: 243 PLQAGIVYGARRTQISSDHFPVGV 266


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_544cdtoxina411e-06 Cytolethal distending toxin A signature.
		>cdtoxina#Cytolethal distending toxin A signature.

Length = 258

Score = 40.8 bits (95), Expect = 1e-06
Identities = 24/109 (22%), Positives = 39/109 (35%), Gaps = 17/109 (15%)

Query: 87 GHVQIKNPDGNECL----AILNGQLAVAKQCTESNRNALFTFITSETGAVQIKSIGNGQC 142
+Q +N D C+ G C F + + G Q+KS+ G C
Sbjct: 127 NTIQFRNVDVGTCMTSFPGFKGGVQLSTAPCKFGPERFDFQPMATRNGNYQLKSLSTGLC 186

Query: 143 -----LGNGESV---TDFKLTKCVNDLSRPFDTVSPGLLWMLNPPLSPA 183
LG S T + +C + + F+ +W ++ PL PA
Sbjct: 187 IRANFLGRTPSSPYATTLTMERCPSSGEKNFE-----FMWSISEPLRPA 230


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_546OMPTIN5070.0 Omptin serine protease signature.
		>OMPTIN#Omptin serine protease signature.

Length = 317

Score = 507 bits (1308), Expect = 0.0
Identities = 288/317 (90%), Positives = 301/317 (94%), Gaps = 1/317 (0%)

Query: 1 MRAKLLGIVLTTPIAICSFASTETLSFTPDNINADISLGTLSGKTKERVYLAEEGGQKVS 60
MRAKLLGIVLTTPIAI SFASTETLSFTPDNINADISLGTLSGKTKERVYLAEEGG+KVS
Sbjct: 1 MRAKLLGIVLTTPIAISSFASTETLSFTPDNINADISLGTLSGKTKERVYLAEEGGRKVS 60

Query: 61 QLDWKFNNAAIIKGAINWDLMPQVSVGAAGWTTLGQKGGNMIDRDWQDPDKPGIWTDESR 120
QLDWKFNNAAIIKGAINWDLMPQ+S+GAAGWTTLG +GGNM+D+DW D PG WTDESR
Sbjct: 61 QLDWKFNNAAIIKGAINWDLMPQISIGAAGWTTLGSRGGNMVDQDWMDSSNPGTWTDESR 120

Query: 121 HPDTRLNFANEFDLNIKGWLLNESNYRLGLMAGYQESRYSFTARGGSYIYSDE-GFRDDI 179
HPDT+LN+ANEFDLNIKGWLLNE NYRLGLMAGYQESRYSFTARGGSYIYS E GFRDDI
Sbjct: 121 HPDTQLNYANEFDLNIKGWLLNEPNYRLGLMAGYQESRYSFTARGGSYIYSSEEGFRDDI 180

Query: 180 GSIPNGERVIGYKQRFKMPYIGLTGSYRYEDFEFGGTFKYSGWVEASDNDEHYARVKRIT 239
GS PNGER IGYKQRFKMPYIGLTGSYRYEDFE GGTFKYSGWVE+SDNDEHY KRIT
Sbjct: 181 GSFPNGERAIGYKQRFKMPYIGLTGSYRYEDFELGGTFKYSGWVESSDNDEHYDPGKRIT 240

Query: 240 YRSKVKDQNYYSIAVNAGYYVTPNAKVYVEGAWNRVTNKKGDTSLYDHNDNTSEYSKNGA 299
YRSKVKDQNYYS+AVNAGYYVTPNAKVYVEGAWNRVTNKKG+TSLYDHN+NTS+YSKNGA
Sbjct: 241 YRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRVTNKKGNTSLYDHNNNTSDYSKNGA 300

Query: 300 GIENYNFITTAGLKYTF 316
GIENYNFITTAGLKYTF
Sbjct: 301 GIENYNFITTAGLKYTF 317


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_550ECOLIPORIN5810.0 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 581 bits (1500), Expect = 0.0
Identities = 312/388 (80%), Positives = 337/388 (86%), Gaps = 16/388 (4%)

Query: 1 MKSKVLALLIPALLAAGAAHAAEVYNKDGNKLDLYGKVDGLHYFSDNSAKDGDQSYARLG 60
MK KVLAL+IPALLAAGAAHAAE+YNKDGNKLDLYGKVDGLHYFSD+S+KDGDQ+Y R+G
Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60

Query: 61 FKGETQINDQLTGYGQWEYNIQANNTESSTNQSWTRLAFAGLKFADYGSFDYGRNYGVMY 120
FKGETQINDQLTGYGQWEYN+QAN TE SWTRLAFAGLKF DYGSFDYGRNYGV+Y
Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120

Query: 121 DIEGWTDMLPEFGGDSYTNADNFMTGRANGVATYRNTDFFGLVNGLNFAVQYQGNNEGAS 180
D+EGWTDMLPEFGGDSYT ADN+MTGRANGVATYRNTDFFGLV+GLNFA+QYQG NE S
Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180

Query: 181 N-----GQEGTNNGRDVRHENGDGWGLSTTYDLGMGFSAGAAYTSSDRTNDQVNH--TAA 233
G NNG D+R++NGDG+G+STTYD+GMGFSAGAAYT+SDRTN+QVN T A
Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGGTIA 240

Query: 234 GGDKADAWTAGLKYDANNIYLATMYSETRNMTPFGDS----DYAVANKTQNFEVTAQYQF 289
GGDKADAWTAGLKYDANNIYLATMYSETRNMTP+G + D VANKTQNFEVTAQYQF
Sbjct: 241 GGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQF 300

Query: 290 DFGLRPAVSFLMSKGRDLHAAGGADNPAGVDDKDLVKYADVGATYYFNKNMSTYVDYKIN 349
DFGLRPAVSFLMSKG+DL N DDKDLVKYADVGATYYFNKN STYVDYKIN
Sbjct: 301 DFGLRPAVSFLMSKGKDL-----TYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKIN 355

Query: 350 LLDEDDSFYAANGISTDDIVALGLVYQF 377
LLD+DD FY GISTDDIVALG+VYQF
Sbjct: 356 LLDDDDPFYKDAGISTDDIVALGMVYQF 383


77APECO1_929APECO1_938N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_9290131.649386flagellar biosynthesis protein FlhB
APECO1_9300121.133327chemotaxis regulator CheZ
APECO1_9310101.133572chemotaxis regulatory protein CheY
APECO1_9320101.385956chemotaxis-specific methylesterase
APECO1_9330101.106633chemotaxis methyltransferase CheR
APECO1_934-1120.629834methyl-accepting chemotaxis protein II
APECO1_935-1120.237521purine-binding chemotaxis protein
APECO1_936013-0.362117chemotaxis protein CheA
APECO1_937014-1.657378flagellar motor protein MotB
APECO1_938014-1.976692flagellar motor protein MotA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_929TYPE3IMSPROT424e-151 Type III secretion system inner membrane S protein ...
		>TYPE3IMSPROT#Type III secretion system inner membrane S protein

family signature.
Length = 354

Score = 424 bits (1093), Expect = e-151
Identities = 95/346 (27%), Positives = 178/346 (51%), Gaps = 2/346 (0%)

Query: 5 SDDKTEAPTPHRLEKAREEGQIPRSRELTSLLILLVGVSVIWFGGVSLARRLSGMLSAGL 64
S +KTE PTP ++ AR++GQ+ +S+E+ S +++ +++ S ++ +
Sbjct: 2 SGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLML--I 59

Query: 65 HFDHSIINDPNLILGQIILLIREAMLALLPLISGVVLVAIISPVMLGGLVFSGKSLQPKF 124
+ S + + + ++ E PL++ L+AI S V+ G + SG++++P
Sbjct: 60 PAEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDI 119

Query: 125 SKLNPLPGIKRMFSAQTGAELLKAILKTILVGSVTGFFLWHHWPQMMRLMAESPITAMGN 184
K+NP+ G KR+FS ++ E LK+ILK +L+ + + + +++L
Sbjct: 120 KKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPL 179

Query: 185 AMDLVGLCALLVVLGVIPMVGFDVFFQIFSHLKKLRMSRQDIRDEFKQSEGDPHVKGRIR 244
++ ++ +G + + D F+ + ++K+L+MS+ +I+ E+K+ EG P +K + R
Sbjct: 180 LGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRR 239

Query: 245 QMQRAAARRRMMADVPKADVIVNNPTHYSVALQYDENKMSAPKVVAKGAGLVALRIREIG 304
Q + R M +V ++ V+V NPTH ++ + Y + P V K +R+I
Sbjct: 240 QFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIA 299

Query: 305 AENNVPTLEAPPLARALYRHAEIGQQIPGQLYAAVAEVLAWVWQLK 350
E VP L+ PLARALY A + IP + A AEVL W+ +
Sbjct: 300 EEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQN 345


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_931HTHFIS889e-24 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 88.3 bits (219), Expect = 9e-24
Identities = 30/105 (28%), Positives = 51/105 (48%), Gaps = 3/105 (2%)

Query: 7 KFLVVDDFSTMRRIVRNLLKELGFNNVEEAEDGLDALNKLQAGGYGFVISDWNMPNMDGL 66
LV DD + +R ++ L G++ V + + AG V++D MP+ +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYD-VRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 67 ELLKTIRADGAMSALPVLMVTAEAKKENIIAAAQAGASGYVVKPF 111
+LL I+ LPVL+++A+ I A++ GA Y+ KPF
Sbjct: 64 DLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_932HTHFIS663e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 66.4 bits (162), Expect = 3e-14
Identities = 35/188 (18%), Positives = 73/188 (38%), Gaps = 23/188 (12%)

Query: 1 MSKIRVLSVDDSALMRQIMTEIINSHSDMEMVATAPDPLVARDLIKKFNPDVLTLDVEMP 60
M+ +L DD A +R ++ + ++ V + I + D++ DV MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 RMDGLDFLEKLMRLRPMPVVMVSSLTGKGS-EVTLRALELGAIDFVTKPQLGIREGMLAY 119
+ D L ++ + RP V+V ++ + + ++A E GA D++ KP + E +
Sbjct: 59 DENAFDLLPRIKKARPDLPVLV--MSAQNTFMTAIKASEKGAYDYLPKP-FDLTELIGII 115

Query: 120 SEMIAEKVRTAAKASLAAHKPLSVPTTLKAGPLLSSEKLIAIGASTGGTEAIRHVLQPLP 179
+AE R +K + + + +G S E R + + +
Sbjct: 116 GRALAEPKRRPSKLEDDSQDGMPL-----------------VGRSAAMQEIYRVLARLMQ 158

Query: 180 LSSPALLI 187
++
Sbjct: 159 TDLTLMIT 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_936PF06580402e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 40.2 bits (94), Expect = 2e-05
Identities = 22/151 (14%), Positives = 48/151 (31%), Gaps = 52/151 (34%)

Query: 374 ELDKSLIERIIDPLT--HLVRNSLDHGIELPEKRLAAGKNSVGNLILSAEHQGGNICIEV 431
+++ ++++ + P+ LV N + HGI G ++L G + +EV
Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEV 296

Query: 432 TDDGAGLNRERILAKAASQGLTVSENMSDDEVAMLIFAPGFSTAEQVTDVSGRGVGMDVV 491
+ G+ + G G+ V
Sbjct: 297 ENTGSLALKNTK--------------------------------------ESTGTGLQNV 318

Query: 492 KRNIQEMGG---HVEIQSMQGTGTTIRILLP 519
+ +Q + G +++ QG +L+P
Sbjct: 319 RERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_937PF05272310.009 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.8 bits (69), Expect = 0.009
Identities = 22/93 (23%), Positives = 35/93 (37%), Gaps = 11/93 (11%)

Query: 46 LISISSPKELIQIAEYFRTPLATAVTGGDRISNSESPIPGGGDDYTQSQGEVNKQPNIEE 105
L +SSP A P + G + ++ PGGGDD GE +++
Sbjct: 384 LADVSSPTAAAGGAGGGEPPKKRDPSAG---AGTDPGGPGGGDD-----GEDPFGEWLDD 435

Query: 106 LKKRM---EQSRLRKLRGDLDQLIESDPKLRAL 135
R+ + L+ R L + + S P L
Sbjct: 436 EVARLRLRGRWLLKPRRAALIEALRSAPALAGC 468


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_938PF05844330.001 YopD protein
		>PF05844#YopD protein

Length = 295

Score = 32.7 bits (74), Expect = 0.001
Identities = 12/28 (42%), Positives = 22/28 (78%), Gaps = 2/28 (7%)

Query: 76 MDLLALLYRLMAKSRQMGMFSLERDIEN 103
++LL +L+R+ K+R++G+ L+RD EN
Sbjct: 74 VELLLILFRIAQKARELGV--LQRDNEN 99


78APECO1_970APECO1_989N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_970226-5.043954hypothetical protein
APECO1_971433-6.764610hypothetical protein
APECO1_972130-5.768015outer membrane porin protein NmpC
APECO1_973-118-2.862572transcriptional regulator
APECO1_9741140.967704kinase inhibitor
APECO1_9750163.711728multidrug efflux protein
APECO1_9762194.581006flagellar hook-basal body protein FliE
APECO1_9771164.315441flagellar M-ring protein
APECO1_9782184.362547flagellar motor switch protein G
APECO1_979-1183.658716flagellar assembly protein H
APECO1_980-1193.411008flagellum-specific ATP synthase
APECO1_981-1162.276920flagellar biosynthesis chaperone
APECO1_982-1162.241586flagellar hook-length control protein
APECO1_983-2211.762891flagellar basal body-associated protein FliL
APECO1_9840170.456721flagellar motor switch protein FliM
APECO1_985116-2.483996flagellar motor switch protein FliN
APECO1_986018-3.273599flagellar biosynthesis protein FliO
APECO1_987018-3.020900flagellar biosynthesis protein FliP
APECO1_988-216-2.419606flagellar biosynthesis protein FliQ
APECO1_989-214-1.791067flagellar biosynthesis protein FliR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_970RTXTOXIND300.017 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 30.2 bits (68), Expect = 0.017
Identities = 10/57 (17%), Positives = 17/57 (29%), Gaps = 2/57 (3%)

Query: 164 RFTLLPIFRIPVKMQKVSAASPLTQKPDQARRRF--RLGMLVFFGMLGWALLTAMNQ 218
R L R + + + A L + P R R M ++L +
Sbjct: 26 RKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEI 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_972ECOLIPORIN5100.0 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 510 bits (1314), Expect = 0.0
Identities = 240/388 (61%), Positives = 282/388 (72%), Gaps = 33/388 (8%)

Query: 1 MKKLTVAISAVAASVLMAMSAQAAEIYNKDSNKLDLYGKVNAKHYFSSNDADDGDTTYVR 60
MK+ +A+ V ++L A +A AAEIYNKD NKLDLYGKV+ HYFS + + DGD TY+R
Sbjct: 1 MKRKVLAL--VIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMR 58

Query: 61 LGFKGETQINDQLTGFGQWEYEFKGNRAESQGSSKDKTRLAFAGLKFGDYGSIDYGRNYG 120
+GFKGETQINDQLTG+GQWEY + N E +G++ TRLAFAGLKFGDYGS DYGRNYG
Sbjct: 59 VGFKGETQINDQLTGYGQWEYNVQANTTEGEGANS-WTRLAFAGLKFGDYGSFDYGRNYG 117

Query: 121 VAYDIGAWTDVLPEFGGDTWTQTDVFMTGRTTGVATYRNNDFFGLVDGLNFAAQYQGKND 180
V YD+ WTD+LPEFGGD++T D +MTGR GVATYRN DFFGLVDGLNFA QYQGKN+
Sbjct: 118 VLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNE 177

Query: 181 R----------------TDVTEANGDGFGFSTTYEY-EGFGVGATYAKSDRTNDQVIYGN 223
D+ NGDGFG STTY+ GF GA Y SDRTN+QV G
Sbjct: 178 SQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGG 237

Query: 224 NSLNASGQNAEVWAAGLKYDANNIYLATTYSETQNMTVFG------NNHIANKAQNFEVV 277
A G A+ W AGLKYDANNIYLAT YSET+NMT +G + +ANK QNFEV
Sbjct: 238 T--IAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVT 295

Query: 278 AQYQFDFGLRPSVAYLQSKGKDLG----AWGDQDLVEYIDVGATYYFNKNMSTFVDYKIN 333
AQYQFDFGLRP+V++L SKGKDL D+DLV+Y DVGATYYFNKN ST+VDYKIN
Sbjct: 296 AQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKIN 355

Query: 334 LIDKSD-FTKASGVATDDIVAVGLVYQF 360
L+D D F K +G++TDDIVA+G+VYQF
Sbjct: 356 LLDDDDPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_976FLGHOOKFLIE1178e-38 Flagellar hook-basal body complex protein FliE signa...
		>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE

signature.
Length = 103

Score = 117 bits (293), Expect = 8e-38
Identities = 102/103 (99%), Positives = 102/103 (99%)

Query: 2 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTVARTQAEKFTL 61
SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQT ARTQAEKFTL
Sbjct: 1 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL 60

Query: 62 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 104
GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV
Sbjct: 61 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_977FLGMRINGFLIF459e-162 Flagellar M-ring protein signature.
		>FLGMRINGFLIF#Flagellar M-ring protein signature.

Length = 559

Score = 459 bits (1183), Expect = e-162
Identities = 287/324 (88%), Positives = 304/324 (93%)

Query: 3 ATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDGGA 62
+TA Q K LEWLNRLRANP+IPLIVAGSAAVA++VA++LWAK PDYRTLFSNLSDQDGGA
Sbjct: 5 STATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGA 64

Query: 63 IVSQLTQMNIPYRFSEASGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 122
IV+QLTQMNIPYRF+ SGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ
Sbjct: 65 IVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 124

Query: 123 FSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPSLFVREQKSPSASVTVNLLPGRA 182
FSEQVNYQRALEGEL+RTIET+GPVK ARVHLAMPKPSLFVREQKSPSASVTV L PGRA
Sbjct: 125 FSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRA 184

Query: 183 LDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQSNTSGRDLNDAQLKYASDVEGRI 242
LDEGQISA+VHLVSSAVAGLPPGNVTLVDQ GHLLTQSNTSGRDLNDAQLK+A+DVE RI
Sbjct: 185 LDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRI 244

Query: 243 QRRIEAILSPIVGNGNIHAQVTAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQSG 302
QRRIEAILSPIVGNGN+HAQVTAQLDFA+KEQTEE Y PNGD S A LRSRQLN SEQ G
Sbjct: 245 QRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVG 304

Query: 303 SGYPGGVPGALSNQPAPANNAPIS 326
+GYPGGVPGALSNQPAP N API+
Sbjct: 305 AGYPGGVPGALSNQPAPPNEAPIA 328


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_978FLGMOTORFLIG341e-119 Flagellar motor switch protein FliG signature.
		>FLGMOTORFLIG#Flagellar motor switch protein FliG signature.

Length = 344

Score = 341 bits (876), Expect = e-119
Identities = 117/329 (35%), Positives = 197/329 (59%), Gaps = 2/329 (0%)

Query: 1 MSNLTGTDKSVILLMTIGEDRAAEVFKHLSQREVQTLSAAMANVTQISNKQLTDVLAEFE 60
+S LTG K+ ILL++IG + +++VFK+LSQ E+++L+ +A + I+++ +VL EF+
Sbjct: 12 VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK 71

Query: 61 QEAEQFAALNINANDYLRSVLVKALGEERAASLLEDILETRDTASGIETLNFMEPQSAAD 120
+ + DY R +L K+LG ++A ++ + L + + E + +P + +
Sbjct: 72 ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINN-LGSALQSRPFEFVRRADPANILN 130

Query: 121 LIRDEHPQIIATILVHLKRAQAADILALFDERLRHDVMLRIATFGGVQPAALAELTEVLN 180
I+ EHPQ IA IL +L +A+ IL+ ++ +V RIA P + E+ VL
Sbjct: 131 FIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLE 190

Query: 181 GLLDGQ-NLKRSKMGGVRTAAEIINLMKTQQEEAVITAVREFDGELAQKIIDEMFLFENL 239
L + + GGV EIIN+ + E+ +I ++ E D ELA++I +MF+FE++
Sbjct: 191 KKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDI 250

Query: 240 VDVDDRSIQRLLQEVDSESLLIALKGAEQPLREKFLRNMSQRAADILRDDLANRGPVRLS 299
V +DDRSIQR+L+E+D + L ALK + P++EK +NMS+RAA +L++D+ GP R
Sbjct: 251 VLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRK 310

Query: 300 QVENEQKAILLIVRRLAETGEMVIGSGED 328
VE Q+ I+ ++R+L E GE+VI G +
Sbjct: 311 DVEESQQKIVSLIRKLEEQGEIVISRGGE 339


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_979FLGFLIH369e-133 Flagellar assembly protein FliH signature.
		>FLGFLIH#Flagellar assembly protein FliH signature.

Length = 228

Score = 369 bits (948), Expect = e-133
Identities = 224/228 (98%), Positives = 226/228 (99%)

Query: 1 MSDNLPWKTWMPDDLAPPQAEFVPMVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60
MSDNLPWKTW PDDLAPPQAEFVP+VEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI
Sbjct: 1 MSDNLPWKTWTPDDLAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60

Query: 61 AEGRQQGHEQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120
AEGRQQGH+QGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL
Sbjct: 61 AEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120

Query: 121 MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180
MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT
Sbjct: 121 MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180

Query: 181 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPRVV 228
LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAP VV
Sbjct: 181 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_981FLGFLIJ2024e-70 Flagellar FliJ protein signature.
		>FLGFLIJ#Flagellar FliJ protein signature.

Length = 147

Score = 202 bits (514), Expect = 4e-70
Identities = 145/147 (98%), Positives = 147/147 (100%)

Query: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60
MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG
Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60

Query: 61 MTSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQKRQST 120
+TSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQ+RQST
Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120

Query: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147
AALLAENRLDQKKMDEFAQRAAMRKPE
Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_982FLGHOOKFLIK468e-168 Flagellar hook-length control protein signature.
		>FLGHOOKFLIK#Flagellar hook-length control protein signature.

Length = 375

Score = 468 bits (1206), Expect = e-168
Identities = 366/375 (97%), Positives = 370/375 (98%)

Query: 1 MIRLAPLITANVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK 60
MIRLAPLITA+VDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK
Sbjct: 1 MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK 60

Query: 61 GEPLVSDIVSDAQQADLLIPVDETLPVINDEQSTSTPLTTAQTMTLAAVADKNTTKDEKA 120
GEPL+SDIVSDAQQA+LLIPVDET PVINDEQSTSTPLTTAQTM LAAVADKNTTKDEKA
Sbjct: 61 GEPLISDIVSDAQQANLLIPVDETPPVINDEQSTSTPLTTAQTMALAAVADKNTTKDEKA 120

Query: 121 DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPAEKPTLFTKLTSAQLTTAQPDDAP 180
DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLP EKPTLFTKLTS QLTTAQPDDAP
Sbjct: 121 DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPTEKPTLFTKLTSEQLTTAQPDDAP 180

Query: 181 GTPAQPLTPLVAEAQSKAEVISTPSPVTADASPLITPHQTQPLPTVAAPVLSAPLGSHEW 240
GTPAQPLTPLVAEAQSKAEVISTPSPVTA ASPLITPHQTQPLPTVAAPVLSAPLGSHEW
Sbjct: 181 GTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEW 240

Query: 241 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMISPHQHVRAALEAA 300
QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQM+SPHQHVRAALEAA
Sbjct: 241 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAA 300

Query: 301 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS 360
LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS
Sbjct: 301 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS 360

Query: 361 LQGRVTGNSGVDIFA 375
LQGRVTGNSGVDIFA
Sbjct: 361 LQGRVTGNSGVDIFA 375


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_984FLGMOTORFLIM385e-136 Flagellar motor switch protein FliM signature.
		>FLGMOTORFLIM#Flagellar motor switch protein FliM signature.

Length = 344

Score = 385 bits (989), Expect = e-136
Identities = 85/324 (26%), Positives = 148/324 (45%), Gaps = 10/324 (3%)

Query: 20 ILSQAEIDALLNGDS--EVKDEPTASVSGESDIRPYDPNTQRRVVRERLQALEIINERFA 77
+LSQ EID LL S + E +S I YD + +E+++ L +++E FA
Sbjct: 4 VLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETFA 63

Query: 78 RHFRMGLFNLLRRSPDITVGAIRIQPYHEFARNLPVPTNLNLIHLKPLRGTGLVVFSPSL 137
R L LR + V ++ Y EF R++P P+ L +I + PL+G ++ PS+
Sbjct: 64 RLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPSI 123

Query: 138 VFIAVDNLFGGDGRFPTKVEGREFTHTEQRVINRMLKLALEGYSDAWKAINPLEVEYVRS 197
F +D LFGG G+ KV+ R+ T E V+ ++ L ++W + L +
Sbjct: 124 TFSIIDRLFGGTGQ-AAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQI 181

Query: 198 EMQVKFTNITTSPNDIVVNTPFHVEIGNLTGEFNICLPFSMIEPLRELLVNPPLENS--R 255
E +F I P+++VV ++G G N C+P+ IEP+ L + +S R
Sbjct: 182 ETNPQFAQI-VPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240

Query: 256 NEDQNWRDNLVRQVQHSQLELVANFADISLRLSQILKLKPGDVLPIEKP---DRIIAHVD 312
+ + L ++ +++VA + L + IL L+ GD++ + D + +
Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300

Query: 313 GVPVLTSQYGTLNGQYALRIEHLI 336
Q G + + A +I I
Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_985FLGMOTORFLIN2105e-74 Flagellar motor switch protein FliN signature.
		>FLGMOTORFLIN#Flagellar motor switch protein FliN signature.

Length = 137

Score = 210 bits (537), Expect = 5e-74
Identities = 126/137 (91%), Positives = 134/137 (97%)

Query: 1 MSDMNNPADDNNGAMDDLWAEALSEQKSTSGKSAADAVFQQFGGGDVSGALQDIDLIMDI 60
MSDMNNP+D+N GA+DDLWA+AL+EQK+T+ KSAADAVFQQ GGGDVSGA+QDIDLIMDI
Sbjct: 1 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDI 60

Query: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120
PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV
Sbjct: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120

Query: 121 RITDIITPSERMRRLSR 137
RITDIITPSERMRRLSR
Sbjct: 121 RITDIITPSERMRRLSR 137


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_987FLGBIOSNFLIP335e-119 Escherichia coli: Flagellar biosynthetic protein Fl...
		>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP

signature.
Length = 245

Score = 335 bits (860), Expect = e-119
Identities = 242/245 (98%), Positives = 244/245 (99%)

Query: 1 MRRLLSVAPVLLWLVTPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60
MRRLLSVAPVLLWL+TPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM
Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60

Query: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFNEEK 120
MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPF+EEK
Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120

Query: 121 ISMQEALEKGEQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180
ISMQEALEKG QPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK
Sbjct: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180

Query: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240
TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA
Sbjct: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240

Query: 241 QSFYS 245
QSFYS
Sbjct: 241 QSFYS 245


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_988TYPE3IMQPROT671e-18 Type III secretion system inner membrane Q protein ...
		>TYPE3IMQPROT#Type III secretion system inner membrane Q protein

family signature.
Length = 86

Score = 67.1 bits (164), Expect = 1e-18
Identities = 22/78 (28%), Positives = 42/78 (53%)

Query: 4 ESVMMMGTEAMKVALALAAPLLLVALVTGLIISILQAATQINEMTLSFIPKIIAVFIAII 63
+ ++ G +A+ + L L+ +VA + GL++ + Q TQ+ E TL F K++ V + +
Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61

Query: 64 IAGPWMLNLLLDYVRTLF 81
+ W +LL Y R +
Sbjct: 62 LLSGWYGEVLLSYGRQVI 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_989TYPE3IMRPROT2011e-66 Type III secretion system inner membrane R protein ...
		>TYPE3IMRPROT#Type III secretion system inner membrane R protein

family signature.
Length = 261

Score = 201 bits (514), Expect = 1e-66
Identities = 257/261 (98%), Positives = 261/261 (100%)

Query: 1 MMQVTSDQWLSWLSLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60
M+QVTS+QWLSWL+LYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA
Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60

Query: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120
NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL
Sbjct: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120

Query: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180
NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF
Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180

Query: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240
LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC
Sbjct: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240

Query: 241 EHLFSEMFNLLADIISELPLI 261
EHLFSE+FNLLADIISELPLI
Sbjct: 241 EHLFSEIFNLLADIISELPLI 261


79APECO1_998APECO1_1004N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_998-214-4.502587DNA cytosine methylase
APECO1_999-122-6.411510hypothetical protein
APECO1_1000027-8.027328Outer membrane protein N precursor
APECO1_1001-124-6.868638Outer membrane protein N precursor
APECO1_1002026-6.031702chaperone protein HchA
APECO1_1003130-7.3784642-component sensor protein
APECO1_1004228-6.289280transcriptional regulatory protein YedW
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_998PF05272290.044 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.044
Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 15/62 (24%)

Query: 320 AKYILTPVLWKYLYRYAKKHQARGNGFGYGMVYPNNPQSVTRTLSARYYKDGAEILIDRG 379
A+Y + PVLW Y+ R+ K + G+ VY +R +DG+E RG
Sbjct: 166 ARYQVGPVLWGYVVRFIK---SDGDKLTLPYVY------------SRSQRDGSEAWKWRG 210

Query: 380 WD 381
WD
Sbjct: 211 WD 212


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_999CARBMTKINASE338e-04 Bacterial carbamate kinase signature.
		>CARBMTKINASE#Bacterial carbamate kinase signature.

Length = 314

Score = 32.9 bits (75), Expect = 8e-04
Identities = 22/92 (23%), Positives = 36/92 (39%), Gaps = 9/92 (9%)

Query: 37 AQKLAADDDVDMLVILTACYFHDIVSLAKDHPQRQRSSILAAEETRRLLREEFVQFPA-- 94
+KLA + + D+ +ILT + +L + Q + EE R+ E F A
Sbjct: 219 GEKLAEEVNADIFMILTDV---NGAALYYGTEKEQWLREVKVEELRKYYEEG--HFKAGS 273

Query: 95 --EKIEAVCHAIAAHSFSAQIAPLTTEAKIVQ 124
K+ A I A IA L + ++
Sbjct: 274 MGPKVLAAIRFIEWGGERAIIAHLEKAVEALE 305


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1001ECOLIPORIN444e-158 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 444 bits (1144), Expect = e-158
Identities = 205/395 (51%), Positives = 256/395 (64%), Gaps = 36/395 (9%)

Query: 11 MKRKVLAMLVPALLVAGAANAAEIYNKDGNKVDFYGKMVGERIWSNTDDNNSENEDTSYA 70
MKRKVLA+++PALL AGAA+AAEIYNKDGNK+D YGK+ G +S D++S++ D +Y
Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFS---DDSSKDGDQTYM 57

Query: 71 RFGVKGETQITSELTGFGQFEYNLDASKPEGE-NQEKTRLTFAGLKYNELGSFDYGRNYG 129
R G KGETQI +LTG+GQ+EYN+ A+ EGE TRL FAGLK+ + GSFDYGRNYG
Sbjct: 58 RVGFKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYG 117

Query: 130 VAYDAAAYTDMLVEWGGDSWASADNFMNGRTNGVATYRNYDFFGLVDGLDFAIQYQGKNS 189
V YD +TDML E+GGDS+ ADN+M GR NGVATYRN DFFGLVDGL+FA+QYQGKN
Sbjct: 118 VLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNE 177

Query: 190 NRS----------------TKKQNGDGYALSVDYNI-NGFGIVGAYSKSDRTNDQVA--- 229
++S + NGDG+ +S Y+I GF AY+ SDRTN+QV
Sbjct: 178 SQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGG 237

Query: 230 -DGNGSNAELWSLAAKYDANNVYAVVMYGETRNMTPGSIDTGVADREGNTIMRDQLINET 288
G A+ W+ KYDANN+Y MY ETRNMTP + + + N+T
Sbjct: 238 TIAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYG--------KTDKGYDGGVANKT 289

Query: 289 QNFEAVVQYQFDFGLRPSLGYVYSKGKDIKGVPGHRYVDADRVNYIEVGTWYYFNKNMNV 348
QNFE QYQFDFGLRP++ ++ SKGKD+ D D V Y +VG YYFNKN +
Sbjct: 290 QNFEVTAQYQFDFGLRPAVSFLMSKGKDL-TYNNVNGDDKDLVKYADVGATYYFNKNFST 348

Query: 349 YTAYKFNMLDKDDA--AITGAAADDQFAVGIVYQF 381
Y YK N+LD DD G + DD A+G+VYQF
Sbjct: 349 YVDYKINLLDDDDPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1002SUBTILISIN280.038 Subtilisin serine protease family (S8) signature.
		>SUBTILISIN#Subtilisin serine protease family (S8) signature.

Length = 326

Score = 28.3 bits (63), Expect = 0.038
Identities = 7/29 (24%), Positives = 14/29 (48%)

Query: 160 GLPESEDVAAALQWAIENDRFVISLCHGP 188
G + + + + +AIE +IS+ G
Sbjct: 122 GSGQYDWIIQGIYYAIEQKVDIISMSLGG 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1004HTHFIS841e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 84.1 bits (208), Expect = 1e-20
Identities = 30/117 (25%), Positives = 60/117 (51%), Gaps = 1/117 (0%)

Query: 39 KILLIEDNQRTQEWVTQGLSEAGYVIDAVSDGRDGLYLALKDDYALIILDIMLPGMDGWQ 98
IL+ +D+ + + Q LS AGY + S+ D L++ D+++P + +
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 99 ILQTLRTA-KQTPVICLTARDSVDDRVRGLDSGANDYLVKPFSFSELLARVRAQLRQ 154
+L ++ A PV+ ++A+++ ++ + GA DYL KPF +EL+ + L +
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121


80APECO1_1058APECO1_1065N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1058-1248.226984yersiniabactin biosynthetic protein
APECO1_10590227.293296yersiniabactin biosynthetic protein
APECO1_1060-1172.927160yersiniabactin biosynthetic protein
APECO1_1061-2170.132377yersiniabactin biosynthetic protein YbtT
APECO1_1062-218-0.697691yersiniabactin siderophore biosynthetic protein
APECO1_1063-118-2.392699pesticin/yersiniabactin receptor protein
APECO1_1064-126-5.349739hypothetical protein
APECO1_1065-126-5.391161autotransporter
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1058ISCHRISMTASE512e-08 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 51.2 bits (122), Expect = 2e-08
Identities = 22/70 (31%), Positives = 44/70 (62%)

Query: 28 QQLRERLIQELNLTPQQLHEESNLIQAGLDSIRLMRWLHWFRKNGYRLTLRELYAAPTLA 87
+ +R+++ + L TP+ + ++ +L+ GLDS+R+M + +R+ G +T EL PT+
Sbjct: 233 ENIRKQIAELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIE 292

Query: 88 AWNQLMLSRS 97
W +L+ +RS
Sbjct: 293 EWQKLLTTRS 302


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1059DHBDHDRGNASE461e-06 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 45.8 bits (108), Expect = 1e-06
Identities = 32/156 (20%), Positives = 55/156 (35%), Gaps = 19/156 (12%)

Query: 1561 LVTGAFGGLGRLAVNWLREKGARRIALLAPRVDESWLRDVEGGQTRVCR------CDVGD 1614
+TGA G+G L +GA + A + L V R DV D
Sbjct: 12 FITGAAQGIGEAVARTLASQGAH---IAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68

Query: 1615 AGQLATVLDDLAAN-GGIAGAIHAAGVLADAPLQELDDHQLAAVFAVKAQAASQLLQTLR 1673
+ + + + G I ++ AGVL + L D + A F+V + +++
Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128

Query: 1674 NH-----DGRYLILYSSAAAT----LGAPGQSAHAL 1700
+ G + + S+ A + A S A
Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAA 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1064INTIMIN752e-17 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 75.1 bits (184), Expect = 2e-17
Identities = 20/60 (33%), Positives = 29/60 (48%), Gaps = 3/60 (5%)

Query: 181 QQIASTSQLIGSLLAEDMNSEQAANIARGWASSQASGVMTDWLSRFGTARITLGVDEDFS 240
QQ AS + S +N + A + A G A +QAS + WL +GTA + L +F
Sbjct: 168 QQAASLGSQLQS---RSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFD 224


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1065INTIMIN654e-13 Intimin signature.
		>INTIMIN#Intimin signature.

Length = 939

Score = 65.5 bits (159), Expect = 4e-13
Identities = 63/344 (18%), Positives = 123/344 (35%), Gaps = 24/344 (6%)

Query: 301 SGGKVRTNSSGQA--------PVVLTSNKVGTYTVTASFHNG-VTIQTQTTVKVTGNPS- 350
GG+++ + S A V + V T A NG + T+ V N
Sbjct: 495 QGGQIQHSGSQSAQDYQAILPAYVQGGSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQV 554

Query: 351 --TAHVASFIADPSTIAATNSDLSTLKATVEDGSGNLIEGLTVYFALKSGSTTLTSLTAV 408
V F AD ++ A ++ T ATV+ +G + V F + SG+ L++ +A
Sbjct: 555 VDQVGVTDFTADKTSAKADGTEAITYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSAN 613

Query: 409 TDQNGIATTSVKGAITGSVTVSTVTSAGGMQTVDISLVAVPADASQSILKNNQSSLKGDF 468
T+ +G AT ++K V + +A ++ + V SI +
Sbjct: 614 TNGSGKATVTLKSD-KPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVA 672

Query: 469 TDSAELHLVLHDISGNPIKVSEGMEFVQSGTNVPYMKISAIDYSQNINGDYKATVTGDGE 528
+ + + G+ ++ + F + + S + NG K T+T
Sbjct: 673 NGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKL-----SNSTEKTDTNGYAKVTLTSTTP 727

Query: 529 GIATLIPVLNGVHQAGLSTTIEFISAETRPMTGTVSVNGANLPTASFPSQGFTGAYYQLN 588
G + + ++ V + +EF G + + G + P+ L
Sbjct: 728 GKSLVSARVSDVAVDVKAPEVEFF-TTLTIDDGNIEIVGTGV-KGKLPTVWLQYGQVNL- 784

Query: 589 NDNFAPGKTAADYSFSSSASWVGVDATGKVTFKNDGDSNTVIIT 632
+ G + ++ A ++G+VT K G + +I+
Sbjct: 785 --KASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVIS 826



Score = 52.0 bits (124), Expect = 7e-09
Identities = 51/233 (21%), Positives = 89/233 (38%), Gaps = 16/233 (6%)

Query: 13 AVTDADGKAKVTLKGTKAGAHTVTASMVGGKS--EQLVVNFTADTLTAQVNLNVTEDNFI 70
A T+ GKA VTLK K G V+A S V F T + + + +
Sbjct: 612 ANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAV 671

Query: 71 ANNIGMTRLQATVTDGNGNPVEGIKVNFRGTSVTLSSTSVETDDQVFAEILVTSTEVGLK 130
AN V PV +V F T LS+++ +TD +A++ +TST G
Sbjct: 672 ANGQDAITYTVKVMK-GDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKS 730

Query: 131 TVSASLADKPTEVISRLLN----AKVDVNSATI----TSQEIPEGQVMVAQDIAVKAHVN 182
VSA ++D +V + + +D + I ++P + Q + N
Sbjct: 731 LVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGN 790

Query: 183 DQFGNPVTHQPATFSAAPSSQMIISQNTVSTNTQGVAEVTMTPERNGSYTVKA 235
++ + A S Q+ + + +T + V + + +YT+
Sbjct: 791 GKYTWRSANPAIASVDASSGQVTLKEKGTTTIS-----VISSDNQTATYTIAT 838



Score = 51.6 bits (123), Expect = 7e-09
Identities = 45/170 (26%), Positives = 64/170 (37%), Gaps = 7/170 (4%)

Query: 271 TLTATLTSANGTPVEGQVINFSVTLEGATLSGGKVRTNSSGQAPVVLTSNKVGTYTVTAS 330
T TAT+ NG ++F++ A LS TN SG+A V L S+K G V+A
Sbjct: 579 TYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAK 637

Query: 331 FHNGV-TIQTQTTVKVTGNPSTAHVASFIADPSTIAATNSDLSTLKATVEDGSGNLIEGL 389
+ + V + A + AD +T A D T V G +
Sbjct: 638 TAEMTSALNANAVIFVDQ--TKASITEIKADKTTAVANGQDAITYTVKVMKG-DKPVSNQ 694

Query: 390 TVYFALKSGSTTLTSLTAVTDQNGIATTSVKGAITGSVTVSTVTSAGGMQ 439
V F G + + T TD NG A ++ G VS S +
Sbjct: 695 EVTFTTTLGKLSNS--TEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVD 742



Score = 40.4 bits (94), Expect = 2e-05
Identities = 35/213 (16%), Positives = 63/213 (29%), Gaps = 18/213 (8%)

Query: 4 NFTLSDGDKAVTDADGKAKVTLKGTKAGAHTVTASMVGGKSE--QLVVNFTADTLTAQVN 61
TD +G AKVTL T G V+A + + V F N
Sbjct: 701 TLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGN 760

Query: 62 LNVTEDNFIANNIGMTRLQATVTDGNGN-PVEGIKVNFRGTSVTLSSTSVETDDQVFAEI 120
+ + + + + G N G + S + SV+
Sbjct: 761 IEI-----VGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQ---- 811

Query: 121 LVTSTEVGLKTVSASLADKPTEVISRLLNAKVDVNSATITSQEIPEGQVMVAQDIAVKAH 180
VT E G T+S +D T + + + ++ + V ++
Sbjct: 812 -VTLKEKGTTTISVISSDNQT--ATYTIATPNSLIVPNMSKRVTYNDAVNTCKNFGG--- 865

Query: 181 VNDQFGNPVTHQPATFSAAPSSQMIISQNTVST 213
N + + + AA + S T+ +
Sbjct: 866 KLPSSQNELENVFKAWGAANKYEYYKSSQTIIS 898


81APECO1_1160APECO1_1169N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1160-3122.664844chaperone
APECO1_1161-3133.034119hypothetical protein
APECO1_1162-3163.818853hypothetical protein
APECO1_1163-3163.819315hypothetical protein
APECO1_1164-2163.873809multidrug efflux system subunit MdtA
APECO1_1165-2183.841699multidrug efflux system subunit MdtB
APECO1_1166-2152.423481multidrug efflux system subunit MdtC
APECO1_1167-1140.327994multidrug efflux system protein MdtE
APECO1_1168-118-4.720683signal transduction histidine-protein kinase
APECO1_1169026-7.682430DNA-binding transcriptional regulator BaeR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1160SHAPEPROTEIN492e-08 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 49.0 bits (117), Expect = 2e-08
Identities = 32/129 (24%), Positives = 58/129 (44%), Gaps = 20/129 (15%)

Query: 132 AMMLH-IRQQAQAQLPEAITQAVIGRPINFQGLGSDEANTQAQGILERAAKRAGFKDVVF 190
M+ H I+Q + ++ P+ + + + I E +A+ AG ++V
Sbjct: 89 KMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQV-------ERRAIRE-SAQGAGAREVFL 140

Query: 191 QYEPVAAGLDYEATLQEEKRVLVVDIGGGTTDCSLLLMGPQWRSRLDREASLLGHSGCRI 250
EP+AA + + E +VVDIGGGTT+ +++ + ++ S RI
Sbjct: 141 IEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN-----------GVVYSSSVRI 189

Query: 251 GGNDLDIAL 259
GG+ D A+
Sbjct: 190 GGDRFDEAI 198



Score = 35.1 bits (81), Expect = 5e-04
Identities = 32/137 (23%), Positives = 56/137 (40%), Gaps = 23/137 (16%)

Query: 332 RLSYRLV---RSAEESKIALSSV--AETRASLPFISDELAT------LISQQGLESALNQ 380
R +Y + +AE K + S + + LA ++ + AL +
Sbjct: 203 RRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQE 262

Query: 381 PLARILEQVQLALDNAQEKPDV--------IYLTGGSARSPLIKKALAEQLPGIPIAGGD 432
PL I+ V +AL+ Q P++ + LTGG A + + L E+ GIP+ +
Sbjct: 263 PLTGIVSAVMVALE--QCPPELASDISERGMVLTGGGALLRNLDRLLMEET-GIPVVVAE 319

Query: 433 D-FGSVTAGLARWAEVV 448
D V G + E++
Sbjct: 320 DPLTCVARGGGKALEMI 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1164RTXTOXIND523e-09 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 51.8 bits (124), Expect = 3e-09
Identities = 48/369 (13%), Positives = 106/369 (28%), Gaps = 87/369 (23%)

Query: 53 SYKSRWVIVIVVVIAAIAAFWFWQGRNDSQSAAPG-----ATKQAQQSPAGGR------- 100
S + R V ++ IA G+ + + A G + +
Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113

Query: 101 --RGMRAG-PLA---PVQAATAVEQAVPRYLTGLGTITAANTVTVRSRVDG--QLMALHF 152
+R G L + A + L T ++ ++ +L
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 153 QEGQQVKAGDLLAEI------------DPSQFKVALAQAQGQLA-------KDKATLANA 193
Q V ++L Q ++ L + + + + +
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 194 RRDLSRYQQLAKTNLVSRQELDAQQALVSETEGTIKADEASVA----------------- 236
+ L + L +++ + Q+ E ++ ++ +
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 237 --------------------------SAQLQLDWSRITAPVDGRV-GLKQVDVGNQISSG 269
+ + S I APV +V LK G +++
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 270 DTTGIVVITQTHPIDLVFTLPESDIATVVQAQKAGKPLVVEAWDRTNSKKL-SEGTLLSL 328
+T +V++ + +++ + DI + Q A + VEA+ T L + ++L
Sbjct: 354 ETL-MVIVPEDDTLEVTALVQNKDIGFINVGQNA--IIKVEAFPYTRYGYLVGKVKNINL 410

Query: 329 DNQIDATTG 337
D D G
Sbjct: 411 DAIEDQRLG 419


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1165ACRIFLAVINRP9200.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 920 bits (2379), Expect = 0.0
Identities = 300/1036 (28%), Positives = 513/1036 (49%), Gaps = 29/1036 (2%)

Query: 13 SRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAV 72
+ FI RP+ +L + +++AG + LPV+ P + P + V YPGA + V
Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61

Query: 73 TAPLERQFGQMSGLKQMSSQS-SGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131
T +E+ + L MSS S S G+ ITL FQ D+A+ +VQ + AT LLP +
Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121

Query: 132 LPNPPVYSKVNPADPPIMTLAVTSTAMPMTQVE--DMVETRVAQKISQISGVGLVTLSGG 189
+ + S + +M S TQ + D V + V +S+++GVG V L G
Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 190 QRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGP------SRAVTLSANDQ 243
Q A+R+ L+A + LT V + N A G L G ++ A +
Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239

Query: 244 MQSAEEYRQLII-AYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANI 302
++ EE+ ++ + +G+ +RL DVA VE G EN + A N + A + ++ GAN
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299

Query: 303 ISTADSIRQMLPQLTESLPKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFL 362
+ TA +I+ L +L P+ +KV D T ++ S+ + L AI LV +++YLFL
Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359

Query: 363 RNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMALTIATGFVVDDAIVVIENISRY 422
+N+ AT+IP +AVP+ L+GTFA++ +SIN LT+ + +A G +VDDAIVV+EN+ R
Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419

Query: 423 I-EKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAIL 481
+ E P A K +I ++ + L AV IP+ F G G ++R+F+IT+ A+
Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479

Query: 482 ISAVVSLTLTPMMCARML---SQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWL 538
+S +V+L LTP +CA +L S E + F FD + Y + K+L
Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539

Query: 539 TLSVALSTLLLSVLLWVFIPKGFFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQ 598
L + + V+L++ +P F P +D G+ +Q P ++ + QV D L+
Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599

Query: 599 DPA--VQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDR---VQKVIARLQTAVDKVPG 653
+ V+S+ + G + + N+ ++LKP +ER+ + VI R + + K+
Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR- 658

Query: 654 VDLFLQPTQDLTIDTQVSRTQYQFTLQ---ATSLDALSTWVPQLMEKLQQLP-QLSDVSS 709
D F+ P I + T + F L DAL+ QL+ Q P L V
Sbjct: 659 -DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717

Query: 710 DWQDKGLVAYVNVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTE 769
+ + + VD++ A LG+S++D++ + A G ++ + ++ ++ + +
Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777

Query: 770 NTPGLAALDTIRLTSSDGGVVPLSSIAKIEQRFAPLSINHLDQFPVTTISFNVPDNYSLG 829
+D + + S++G +VP S+ + + + P I S G
Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837

Query: 830 DAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIVLGILYESFI 889
DA A+M+ + LP I + G + + + L+ + V +++ L LYES+
Sbjct: 838 DA-MALMENLAS-KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895

Query: 890 HPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQ 949
P++++ +P VG LLA + + DV ++G++ IG+ KNAI++++FA ++
Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955

Query: 950 GMSPREAIYQACLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQV 1009
G EA A +R RPILMT+LA +LG LPL +S G G+ + +GIG++GG++ + +
Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015

Query: 1010 LTLFTTPVIYLLFDRL 1025
L +F PV +++ R
Sbjct: 1016 LAIFFVPVFFVVIRRC 1031


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1166ACRIFLAVINRP9130.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 913 bits (2361), Expect = 0.0
Identities = 288/1035 (27%), Positives = 503/1035 (48%), Gaps = 36/1035 (3%)

Query: 6 LFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVAT 65
FI RP+ +L++ + + G L LPVA P + P + VSA+ PGA +T+ +V
Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63

Query: 66 PLERSLGRIAGVSEMTSSS-SLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMP 124
+E+++ I + M+S+S S GS I L F D + A VQ + A LLP +
Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123

Query: 125 SRPTYRKANPSDAPIMILTLTSDT--YSQGELYDFASTQLAPTISQIDGVGDVDVGGSSL 182
+ S + +M+ SD +Q ++ D+ ++ + T+S+++GVGDV + G+
Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182

Query: 183 PAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQG------ALEDGTHRWQIQTNDELK 236
A+R+ L+ L ++ DV + N + G AL I K
Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241

Query: 237 TAAEYQPLIIHYN-NGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQ 295
E+ + + N +G VRL DVA V ++ N KPA L I+ AN +
Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301

Query: 296 TVDSIRARLPELQSTIPAAIDLQIAQDRSPTIRASLEEVEQTLIISVALVILVVFLFLRS 355
T +I+A+L ELQ P + + D +P ++ S+ EV +TL ++ LV LV++LFL++
Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361

Query: 356 GRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENIARHL- 414
RAT+IP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R +
Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421

Query: 415 EAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGIS 474
E + P +A + ++ ++ +++ L AVF+P+ GG G + R+F++T+ A+ +S
Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481

Query: 475 LLVSLTLTPMMCGWMLKASKPREQKRLRGFG----RMLVALQQGYGKSLKWVLNHTRLVG 530
+LV+L LTP +C +LK + GF Y S+ +L T
Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541

Query: 531 VVLLGTIALNIWLYISIPKTFFPEQDTGVLMGGIQADQSISFQ----AMRGKLQDFMKII 586
++ +A + L++ +P +F PE+D GV + IQ + + + ++K
Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601

Query: 587 RD-DPAVDNVTGFT-GGSRVNSGMMFITLKPRGERS---ETAQQIIDRLRKKLAKEPGAN 641
+ +V V GF+ G N+GM F++LKP ER+ +A+ +I R + +L K
Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661

Query: 642 LFLMAVQDIRVGGRQANASYQYTLLSDDLAALREWEPKIRKKLATL-----PELADVNSD 696
+ + I G ++ L D + + R +L + L V +
Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718

Query: 697 QEDNGAEMNLIYDRDTMARLGIDVQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRY 756
++ A+ L D++ LG+ + N ++ A G ++ K+ ++ D ++
Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778

Query: 757 TQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSAASTISFNLPTGKSLSD 816
++K++V + G+ +P S F + + I G S D
Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838

Query: 817 ASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVH 876
A A ++ ++L P+ + + G + + + N L+ + V++ L LYES+
Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896

Query: 877 PLTILSTLPSAGVGALLALQLFNAPFSLIALIGIMLLIGIVKKNAIMMVYFALEAQRHGN 936
P++++ +P VG LLA LFN + ++G++ IG+ KNAI++V FA +
Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956

Query: 937 LTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLL 996
EA A +R RPI+MT+LA + G LPL +S G GS + +GI ++GG+V + LL
Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016

Query: 997 TLYTTPVVYLFFDRL 1011
++ PV ++ R
Sbjct: 1017 AIFFVPVFFVVIRRC 1031



Score = 79.5 bits (196), Expect = 4e-17
Identities = 76/448 (16%), Positives = 160/448 (35%), Gaps = 26/448 (5%)

Query: 592 VDNVTGFTGGS-RVNSGMMFITLKPRGERSETAQQIIDRLRKKLAKEPGANLFLMAVQDI 650
+DN+ + S S + +T + + Q+ ++L+ P + Q I
Sbjct: 72 IDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE----VQQQGI 127

Query: 651 RVGGRQANASYQYTLLSDDLAALREW-----EPKIRKKLATLPELADVNSDQEDNGAE-- 703
V ++ +SD+ ++ ++ L+ L + DV GA+
Sbjct: 128 SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL----FGAQYA 183

Query: 704 MNLIYDRDTMARLGID----VQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQD 759
M + D D + + + + + + T P Q + R+
Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNP 243

Query: 760 ISALEKMFVINNEGKAIPLSYFAK--WQPANAPLSVNHQGLSAASTISFNLPTGKSLSDA 817
+ +N++G + L A+ N + G AA +L D
Sbjct: 244 EEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL-DT 302

Query: 818 SAAIDRAMTQL--GVPSTVRGSFA-GTAQVFQETMNSQVILIIAAIATVYIVLGILYESY 874
+ AI + +L P ++ + T Q +++ V + AI V++V+ + ++
Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362

Query: 875 VHPLTILSTLPSAGVGALLALQLFNAPFSLIALIGIMLLIGIVKKNAIMMVYFALEAQRH 934
L +P +G L F + + + G++L IG++ +AI++V
Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422

Query: 935 GNLTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQ 994
L P+EA ++ ++ + +P+ GG + + ITIV + +S
Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482

Query: 995 LLTLYTTPVVYLFFDRLRLRFSRKPKQA 1022
L+ L TP + + + K
Sbjct: 483 LVALILTPALCATLLKPVSAEHHENKGG 510


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1167TCRTETB1251e-33 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 125 bits (316), Expect = 1e-33
Identities = 97/429 (22%), Positives = 189/429 (44%), Gaps = 23/429 (5%)

Query: 20 FMQSLDTTIVNTALPSMAQSLGESPLHMHMVIVSYVLTVAVMLPASGWLADKVGVRNIFF 79
F L+ ++N +LP +A + P + V +++LT ++ G L+D++G++ +
Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83

Query: 80 TAIVLFTLGSLFCALSGTLNELL-LARALQGVGGAMMVPVGRLTVMKIVPREQYMAAMTF 138
I++ GS+ + + LL +AR +QG G A + + V + +P+E A
Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143

Query: 139 VTLPGQVGPLLGPALGGLLVEYASWHWIFLINIPVGIIGAIATLM-LMPNYTMQTRRFDL 197
+ +G +GPA+GG++ Y HW +L+ IP+ I + LM L+ FD+
Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201

Query: 198 SGFLLLAVGMAVLTLALDGSKGTGFSPLAIAGLVAVGVVALVLYLLHAQNNNRALFSLKL 257
G +L++VG+ L F+ + V V++ ++++ H + L
Sbjct: 202 KGIILMSVGIVFFML---------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252

Query: 258 FRTRNFSLGLAGSFAGRIGSGMLPFMTPVFLQIGLGFSPFHAG-LMMIPMVLGSMGMKRI 316
+ F +G+ M P ++ S G +++ P + + I
Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312

Query: 317 VVQVVNRFGYRRVLVATTLGLSLVTLLFMTTALL----GWYYVLPFVLFLQGMVNSTRFS 372
+V+R G VL +G++ +++ F+T + L W+ + V L G+ S +
Sbjct: 313 GGILVDRRGPLYVL---NIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL--SFTKT 367

Query: 373 SMNTLTLKDLPDNLASSGNSLLSMIMQLSMSIGVTIAGLLLGLFGSQHVSVDSGTTQTVF 432
++T+ L A +G SLL+ LS G+ I G LL + + Q+ +
Sbjct: 368 VISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTY 427

Query: 433 MYTWLSMAF 441
+Y+ L + F
Sbjct: 428 LYSNLLLLF 436


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1168BCTERIALGSPF340.001 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 34.0 bits (78), Expect = 0.001
Identities = 28/95 (29%), Positives = 36/95 (37%), Gaps = 20/95 (21%)

Query: 164 RQTSWLIVALSTLLAALATF------PLARGLLAPVKRLVDGTHKLAAGDFTTRVAPTSE 217
RQ + L+ A L AL P L+A V+ V H LA + P S
Sbjct: 75 RQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD---AMKCFPGSF 131

Query: 218 DEL-----------GRLAEDFNQLASTLEKNQQMR 241
+ L G L N+LA E+ QQMR
Sbjct: 132 ERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMR 166


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1169HTHFIS766e-18 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 75.6 bits (186), Expect = 6e-18
Identities = 28/136 (20%), Positives = 65/136 (47%), Gaps = 1/136 (0%)

Query: 11 PRILIVEDEPKLGQLLIDYLRAASYAPTLISHGDQVLPYVRQTPPDLILLDLMLPGTDGL 70
IL+ +D+ + +L L A Y + S+ + ++ DL++ D+++P +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 71 TLCREIR-RFSDIPIVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTILRRCK 129
L I+ D+P+++++A+ + + E GA DY+ KP+ E++ + L K
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123

Query: 130 PQRELQQQDAESPLII 145
+ + D++ + +
Sbjct: 124 RRPSKLEDDSQDGMPL 139


82APECO1_4415APECO1_4410N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_44151151.930667D-alanyl-D-alanine endopeptidase
APECO1_44141172.132574hypothetical protein
APECO1_44132172.048456hypothetical protein
APECO1_44121152.203551acetoin dehydrogenase
APECO1_44110151.267353multidrug resistance outer membrane protein
APECO1_44100130.317851tRNA-dihydrouridine synthase C
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4415BLACTAMASEA444e-07 Beta-lactamase class A signature.
		>BLACTAMASEA#Beta-lactamase class A signature.

Length = 286

Score = 43.6 bits (103), Expect = 4e-07
Identities = 42/195 (21%), Positives = 77/195 (39%), Gaps = 18/195 (9%)

Query: 4 MPKFRVSLFSLALMLAVPFAPQAVAKTVAATTASQPEIASGSAMI-VDLNTNKVIYSNHP 62
M R+ + SL + +P A A + + S+ +++ MI +DL + + + +
Sbjct: 1 MRYIRLCIISL--LATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRA 58

Query: 63 DLVRPIASISKLMTAMVVLDARLPLDEKLKVDISQTPEMKGVYSRV---RLNSEISRKDM 119
D P+ S K++ VL DE+L+ I + YS V L ++ ++
Sbjct: 59 DERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGEL 118

Query: 120 LLLALMSSENRAAASLAHHYPGGYKAFIKAMNAKAKSLGMNNTRFV--EPTGLS-----V 172
A+ S+N +AA+L GG + A + +G N TR E
Sbjct: 119 CAAAITMSDN-SAANLLLATVGG----PAGLTAFLRQIGDNVTRLDRWETELNEALPGDA 173

Query: 173 HNVSTARDLTKLLIA 187
+ +T + L
Sbjct: 174 RDTTTPASMAATLRK 188


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4413BCTERIALGSPF290.017 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 28.6 bits (64), Expect = 0.017
Identities = 5/33 (15%), Positives = 16/33 (48%), Gaps = 2/33 (6%)

Query: 164 WLHNLDQHLKHW-VWLILVVVL-VVGVRWWLKR 194
L + ++ + W++L ++ + R L++
Sbjct: 215 VLMGMSDAVRTFGPWMLLALLAGFMAFRVMLRQ 247


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4412DHBDHDRGNASE1124e-32 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 112 bits (281), Expect = 4e-32
Identities = 70/253 (27%), Positives = 116/253 (45%), Gaps = 12/253 (4%)

Query: 3 QVAIITASDSGIGKECALLLAQQGFDIGITWHSDEEGAKDTARKVVSHGVRAEIVQLDLG 62
++A IT + GIG+ A LA QG I + E + + + AE D+
Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHI-AAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67

Query: 63 NLPEGAQALEKLIQRLWRIDVLVNNAGAMTKAPFLDMAFDEWRKIFTVDVDGAFLCSQIA 122
+ + ++ + + ID+LVN AG + ++ +EW F+V+ G F S+
Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127

Query: 123 ARQMVKQGQGGRIINITSVHEHTPLPDASAYTAAKHALGGLTKAMALELVRHKILVNAVA 182
++ M+ + + G I+ + S P +AY ++K A TK + LEL + I N V+
Sbjct: 128 SKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186

Query: 183 PGAIATPMN-----GMDGGD--VKPDAEP---SIPLRRFGTTHEIASLVAWLCSEGANYT 232
PG+ T M +G + +K E IPL++ +IA V +L S A +
Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246

Query: 233 TGQSLIVDGGFML 245
T +L VDGG L
Sbjct: 247 TMHNLCVDGGATL 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4410SHAPEPROTEIN290.018 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 29.3 bits (66), Expect = 0.018
Identities = 32/127 (25%), Positives = 53/127 (41%), Gaps = 5/127 (3%)

Query: 122 GAKAMREAVPAHLPVSVKVRLGWDSGEK-KFEIADAVQQAGATELVVHGRTKEQGY-RAE 179
G EA+ ++ + +G + E+ K EI A E+ V GR +G R
Sbjct: 190 GGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGF 249

Query: 180 HIDWQAIGE-IRQRLNIPVIANGEIWDWQSAQQCMAISGCDAVMIGRGALNIPNLSRVVK 238
++ I E +++ L V A + + IS V+ G GAL + NL R++
Sbjct: 250 TLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGAL-LRNLDRLL- 307

Query: 239 YNEPRMP 245
E +P
Sbjct: 308 MEETGIP 314


83APECO1_4373APECO1_4365N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_4373-1121.238097bicyclomycin/multidrug efflux system protein
APECO1_4372-1110.45216316S rRNA pseudouridylate synthase A
APECO1_4371-1120.240030ATP-dependet helicase
APECO1_43690130.224844nucleoid-associated protein NdpA
APECO1_43681130.523630hydrolase, inner membrane
APECO1_43670172.383550*autotransporter outer membrane protein
APECO1_4366-1192.589555hypothetical protein
APECO1_43650203.502611transcriptional regulator NarP
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4373TCRTETB604e-12 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 60.3 bits (146), Expect = 4e-12
Identities = 77/385 (20%), Positives = 146/385 (37%), Gaps = 55/385 (14%)

Query: 13 ALPVISAQFGVPAGSTQMTLSTYILGFALGQLIYGPMADSFGRKPVVLGGTLVFAAAAVA 72
+LP I+ F P ST + ++L F++G +YG ++D G K ++L G ++ +V
Sbjct: 36 SLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVI 95

Query: 73 CALAQTIDQLIVM-RFFHGLAAAAASVVINALMRDIYPKEEFSRMMSFVMLVTTIAPLMA 131
+ + L++M RF G AAA ++ ++ PKE + + + + +
Sbjct: 96 GFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVG 155

Query: 132 PIVGGWVLVWLSWHYIFWILAVAAILASAMIFFLIKETLPPERR-QPFHIRTTIGNFAA- 189
P +GG + ++ W Y+ I + I + FL+K R F I+ I
Sbjct: 156 PAIGGMIAHYIHWSYLLLIPMITII----TVPFLMKLLKKEVRIKGHFDIKGIILMSVGI 211

Query: 190 ----------------------------------------LFRHKRVLSYMLASGFSFAG 209
L ++ + +L G F
Sbjct: 212 VFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGT 271

Query: 210 MFSFLSAGPFVYIEINHVAPENFGYYFAL-NILFLFVMTIFNSRFVRRIGALNMFRSGLW 268
+ F+S P++ +++ ++ G + + + V R G L + G
Sbjct: 272 VAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIG-- 329

Query: 269 IQFIMAAWMVISALLGLGFWSLVVGVAAFVGCVSMVSSNAMAVILDEF-PHMAGTASSLA 327
+ F+ +++ S LL W + + + +G +S + ++ AG SL
Sbjct: 330 VTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLL 389

Query: 328 GTFRF---GIG-AIVGALLSLATFN 348
F G G AIVG LLS+ +
Sbjct: 390 NFTSFLSEGTGIAIVGGLLSIPLLD 414


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4368IGASERPTASE300.027 IgA-specific serine endopeptidase (S6) signature.
		>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature.

Length = 1541

Score = 30.4 bits (68), Expect = 0.027
Identities = 19/70 (27%), Positives = 28/70 (40%), Gaps = 6/70 (8%)

Query: 503 LHVSTPASEYSQGQ-DLF---NPQRRHYWVTAADNDTLAITTPKKTLVLNNNGKYRTYNL 558
L V+ E + + LF QR H V+ +T+ + K L N NG+Y YN
Sbjct: 926 LQVADKTGEPNHNELTLFDASKAQRDHLNVSLV-GNTVDLGAWKYKLR-NVNGRYDLYNP 983

Query: 559 RGERVKDEKP 568
E+
Sbjct: 984 EVEKRNQTVD 993


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4367PRTACTNFAMLY492e-09 Pertactin virulence factor family signature.
		>PRTACTNFAMLY#Pertactin virulence factor family signature.

Length = 910

Score = 48.9 bits (116), Expect = 2e-09
Identities = 35/164 (21%), Positives = 51/164 (31%), Gaps = 37/164 (22%)

Query: 19 GAANDPTIKGGRGDAAFTLGNAGSVVDISTYEYTLLDNGNHSWSLAENRV---------- 68
A FTL N VDI TY Y L NGN WSL +
Sbjct: 522 ANTLLLVQTPLGSAATFTLANKDGKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQP 581

Query: 69 -----------------------QMPPSTTDVLN---MAAAQPLVFDVELDTVRGRLGSV 102
++ + +N + A L + E + + RLG +
Sbjct: 582 GPQPPQPPQPQPEAPAPQPPAGRELSAAANAAVNTGGVGLASTLWY-AESNALSKRLGEL 640

Query: 103 KGVNYDTAMWSSAINSRNNVNTDAGAGFEQTLTGLTLGIDSRFS 146
+ W R ++ AG F+Q + G LG D +
Sbjct: 641 RLNPDAGGAWGRGFAQRQQLDNRAGRRFDQKVAGFELGADHAVA 684


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4365HTHFIS652e-14 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 64.9 bits (158), Expect = 2e-14
Identities = 22/113 (19%), Positives = 48/113 (42%), Gaps = 2/113 (1%)

Query: 19 VMIVDDHPLMRRGVRQLLELDSGFEVVAEAGDGASAIDLANRLDIDVILLDLNMKGMSGL 78
+++ DD +R + Q L +G++V + A+ D D+++ D+ M +
Sbjct: 6 ILVADDDAAIRTVLNQALS-RAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63

Query: 79 DTLNALRRDGVTAQIIILTVSDASSDVFALIDAGADGYLLKDSDPEVLLEAIR 131
D L +++ +++++ + + GA YL K D L+ I
Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116


84APECO1_4344APECO1_4339N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_4344-111-2.013155outer membrane porin protein C
APECO1_4343-111-1.914831phosphotransfer intermediate protein in
APECO1_4342-112-1.624889transcriptional regulator RcsB
APECO1_4341-113-0.873618hybrid sensory kinase in two-component
APECO1_4340017-0.577641sensory histidine kinase AtoS
APECO1_43390160.678948acetoacetate metabolism regulatory protein AtoC
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4344ECOLIPORIN5340.0 E.coli/Salmonella-type porin signature.
		>ECOLIPORIN#E.coli/Salmonella-type porin signature.

Length = 383

Score = 534 bits (1377), Expect = 0.0
Identities = 254/383 (66%), Positives = 293/383 (76%), Gaps = 20/383 (5%)

Query: 1 MKVKVLSLLVPALLVAGAANAAEVYNKDGNKLDLYGKVDGLHYFSDDKSVDGDQTYMRLG 60
MK KVL+L++PALL AGAA+AAE+YNKDGNKLDLYGKVDGLHYFSDD S DGDQTYMR+G
Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60

Query: 61 FKGETQVTDQLTGYGQWEYQIQGNSAENE-NNSWTRVAFAGLKFQDVGSFDYGRNYGVVY 119
FKGETQ+ DQLTGYGQWEY +Q N+ E E NSWTR+AFAGLKF D GSFDYGRNYGV+Y
Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120

Query: 120 DVTSWTDVLPEFGGDTYG-SDNFMQQRGNGFATYRNTDFFGLVDGLNFAVQYQGKNGSVS 178
DV WTD+LPEFGGD+Y +DN+M R NG ATYRNTDFFGLVDGLNFA+QYQGKN S S
Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180

Query: 179 ------GEGMTNNGRGALRQNGDGVGGSITYDY-EGFGIGGAISSSKRTDDQN-SPLYIG 230
G NNG NGDG G S TYD GF G A ++S RT++Q + I
Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGGTIA 240

Query: 231 NGDRAETYTGGLKYDANNIYLAAQYTQTYNATRVGSL------GWANKAQNFEAVAQYQF 284
GD+A+ +T GLKYDANNIYLA Y++T N T G G ANK QNFE AQYQF
Sbjct: 241 GGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQF 300

Query: 285 DFGLRPSLAYLQSKGKNLGR---GYDDEDILKYVDVGATYYFNKNMSTYVDYKINLLD-D 340
DFGLRP++++L SKGK+L DD+D++KY DVGATYYFNKN STYVDYKINLLD D
Sbjct: 301 DFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLLDDD 360

Query: 341 NQFTRDAGINTDNIVALGLVYQF 363
+ F +DAGI+TD+IVALG+VYQF
Sbjct: 361 DPFYKDAGISTDDIVALGMVYQF 383


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4342HTHFIS489e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 47.9 bits (114), Expect = 9e-09
Identities = 26/145 (17%), Positives = 60/145 (41%), Gaps = 20/145 (13%)

Query: 1 MNNMNVIIADDHPIVLFGIRKSLEQIEWVNVVGEFEDSTALINNLPKLDAHVLITDLSMP 60
M +++ADD + + ++L + + + ++ L + D +++TD+ MP
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMP 58

Query: 61 GDKYGDGITLIKYIKRHFPSLSIIVLTMNNNPAILSAVLDLDIEGIVLKQGA------PT 114
+ L+ IK+ P L ++V++ N +A+ ++GA P
Sbjct: 59 D---ENAFDLLPRIKKARPDLPVLVMSAQNTFM--TAIKA-------SEKGAYDYLPKPF 106

Query: 115 DLPKALAALQKGKKFTPESVSRLLE 139
DL + + + + S+L +
Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLED 131


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4341HTHFIS792e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 2e-17
Identities = 29/106 (27%), Positives = 48/106 (45%)

Query: 811 ILVVDDHPINRSLLADQLGSLGYQCKTANDGVDALNVLNKNHIDIVLSDVNMPNMDGYRL 870
ILV DD R++L L GY + ++ + D+V++DV MP+ + + L
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 871 TQRIRQLGLTLPVIGVTANALAEEKQRCLESGMDSCLSKPVTLDVI 916
RI++ LPV+ ++A + E G L KP L +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4339HTHFIS5620.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 562 bits (1449), Expect = 0.0
Identities = 181/484 (37%), Positives = 269/484 (55%), Gaps = 35/484 (7%)

Query: 1 MTAINRILIVDDEDNVRRMLSTAFALQGFETHCANNGRTALHLFADIHPDVVLMDIRMPE 60
MT IL+ DD+ +R +L+ A + G++ +N T A D+V+ D+ MP+
Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59

Query: 61 MDGIKALKEMRSHETRTPVILMTAYAEVETAVEALRCGAFDYVIKPFDLDELNLIVQRAL 120
+ L ++ PV++M+A TA++A GA+DY+ KPFDL EL I+ RAL
Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119

Query: 121 QLQSMKKEIRHLHQALSTSWQWGH-ILTNSPAMMDICKDTAKIALSQASVLISGESGTGK 179
+ L Q G ++ S AM +I + A++ + +++I+GESGTGK
Sbjct: 120 AEP------KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGK 173

Query: 180 ELIARAIHYNSRRAKGPFIKVNCAALPESLLESELFGHEKGAFTGAQTLRQGLFERANEG 239
EL+ARA+H +R GPF+ +N AA+P L+ESELFGHEKGAFTGAQT G FE+A G
Sbjct: 174 ELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGG 233

Query: 240 TLLLDEIGEMPLVLQAKLLRILQEREFERIGGHQTIKVDIRIIAATNRDLQAMVKEGTFR 299
TL LDEIG+MP+ Q +LLR+LQ+ E+ +GG I+ D+RI+AATN+DL+ + +G FR
Sbjct: 234 TLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFR 293

Query: 300 EDLFYRLNVIHLILPPLRDRREDISLLANHFLQKFSSENQRDIIDIDPMAMSLLTAWSWP 359
EDL+YRLNV+ L LPPLRDR EDI L HF+Q+ E + D A+ L+ A WP
Sbjct: 294 EDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLD-VKRFDQEALELMKAHPWP 352

Query: 360 GNIRELSNVIERAVVMNSGPIIFSEDLPPQIRQPV---------CNAGEAKTAPVGERN- 409
GN+REL N++ R + +I E + ++R + +G + E N
Sbjct: 353 GNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENM 412

Query: 410 ----------------LKEEIKRVEKRIIMEVLEQQEGNRTRTALMLGISRRALMYKLQE 453
+ +E +I+ L GN+ + A +LG++R L K++E
Sbjct: 413 RQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472

Query: 454 YGID 457
G+
Sbjct: 473 LGVS 476


85APECO1_4169APECO1_4166N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_4169035-9.421404multidrug resistance protein Y
APECO1_4168034-8.447340EmrKY-TolC multidrug resistance efflux pump,
APECO1_4167031-7.768086EvgA family transcriptional regulator
APECO1_4166031-7.338712EvgA family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4169TCRTETB1222e-32 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 122 bits (308), Expect = 2e-32
Identities = 92/404 (22%), Positives = 167/404 (41%), Gaps = 17/404 (4%)

Query: 19 VTIALSLATFMQMLDSTISNVAIPTISGFLGASTDEGTWVITSFGVANAIAIPVTGRLAQ 78
+ I L + +F +L+ + NV++P I+ WV T+F + +I V G+L+
Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74

Query: 79 RIGELRLFLLSVTFFSLSSLMCSLS-TNLDVLIFFRVVQGLMAGPLIPLSQSLLLRNYPP 137
++G RL L + S++ + + +LI R +QG A L ++ R P
Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134

Query: 138 EKRTFALALWSMTVIIAPICGPILGGYICDNFSWGWIFLINVPMGIIVLTLCLTLLKGRE 197
E R A L V + GP +GG I W +L+ +PM I+ L L +E
Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKE 192

Query: 198 TETSPVKMNLPGLTLLVLGVGGLQIMLDKGRDLDWFNSSTIILLTVVSVISLISLVIWES 257
++ G+ L+ +G+ + ML F +S I +VSV+S + V
Sbjct: 193 VRIKG-HFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIR 241

Query: 258 TSENPILDLSLFKSRNFTIGIVSITCAYLFYSGAIVLMPQLLQETMGYNAIWAGLAYAPI 317
+P +D L K+ F IG++ + +G + ++P ++++ + G
Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301

Query: 318 GIMPLLIS-PLIGRYGNKIDMRLLVTFSFLMYAVCYYWRSVTFMPTIDFTGIILPQFFQG 376
G M ++I + G ++ ++ +V + S T F II+ G
Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361

Query: 377 FAVACFFLPLTTISFSGLPDNKFANASSMSNFFRTLSGSVGTSL 420
+ ++TI S L + S+ NF LS G ++
Sbjct: 362 LSFTK--TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4168RTXTOXIND741e-16 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 74.1 bits (182), Expect = 1e-16
Identities = 47/277 (16%), Positives = 94/277 (33%), Gaps = 46/277 (16%)

Query: 61 AKNNLANIVRQTNKLYLQDKQYSAEVASARIQ---YQQSLEDYNRRV----PLAKQGVIS 113
K + Q + L + AE + + Y+ R+ L + I+
Sbjct: 191 IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIA 250

Query: 114 KEALEHTKDTLI----------SSKAALNAAIQAYKANKALVMNTPLNRQPQVIEAADAT 163
K A+ ++ + S + + I + K + T L + + + T
Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEE--YQLVTQLFKNEILDKLRQTT 308

Query: 164 KE----------AWLALKRTDIKSPVTGYIAQRSVQ-VGETVSPGQSLMAVVPARQ-MWV 211
+ + I++PV+ + Q V G V+ ++LM +VP + V
Sbjct: 309 DNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEV 368

Query: 212 NANFKETQLTDVRIGQSVNIISDLYGENVVFHGRVTGINMGTGNAFSLLPAQNATGNWIK 271
A + + + +GQ+ I + F G +G + +
Sbjct: 369 TALVQNKDIGFINVGQNAIIKVE------AFPYTRYGYLVGK---VKNINLDAIEDQRLG 419

Query: 272 IVQRVPVEVSLDPKELMEH----PLRIGLSMTATIDT 304
+V V +S++ L PL G+++TA I T
Sbjct: 420 LVFNVI--ISIEENCLSTGNKNIPLSSGMAVTAEIKT 454


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4167HTHFIS493e-09 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 49.1 bits (117), Expect = 3e-09
Identities = 22/148 (14%), Positives = 53/148 (35%), Gaps = 31/148 (20%)

Query: 4 IIIDDHPLAIAAIRNLLIKNDIEILAELTEGGSAVQRVETLKPDIVIIDVDIPGVNGIQV 63
++ DD + L + ++ + + + + D+V+ DV +P N +
Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRIT-SNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 64 LETLRKRQYSGIIIIVSAKNDHFYGKHCADAGANGFVSKKEGMNNIIAAIEAAKNGYCYF 123
L ++K + ++++SA+N + AI+A++ G +
Sbjct: 66 LPRIKKARPDLPVLVMSAQNT------------------------FMTAIKASEKGAYDY 101

Query: 124 ---PFSLNRFVGSLTSDQQKLDSLSKQE 148
PF L + + L ++
Sbjct: 102 LPKPFDLTE---LIGIIGRALAEPKRRP 126


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_4166HTHFIS792e-17 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 79.5 bits (196), Expect = 2e-17
Identities = 30/105 (28%), Positives = 51/105 (48%)

Query: 960 SILIADDHPTNRLLLKRQLNLLGYDVDEATDGVQALHKVSMQHYDLLITDVNMPNMDGFE 1019
+IL+ADD R +L + L+ GYDV ++ ++ DL++TDV MP+ + F+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 1020 LTRKLREQNSSLPIWGLTANAQANEREKGLNCGMNLCLFKPLTLD 1064
L ++++ LP+ ++A K G L KP L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109


86APECO1_3841APECO1_3834N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_38410112.355655transporter
APECO1_3840-1121.929995hypothetical protein
APECO1_3839-1121.906631hypothetical protein
APECO1_3838-1131.499020transcriptional repressor MprA
APECO1_3837-2131.581976multidrug resistance protein B
APECO1_3836-1120.687186hypothetical protein
APECO1_38350131.335333hypothetical protein
APECO1_38340151.235172S-ribosylhomocysteinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3841TCRTETB447e-07 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 44.1 bits (104), Expect = 7e-07
Identities = 32/165 (19%), Positives = 70/165 (42%), Gaps = 2/165 (1%)

Query: 34 LDTIARNFSLSASSAGFIVTAAQLGYAAGLLFLVPLGDMFERRRLIVSMTLLAAGGMLIT 93
L IA +F+ +S ++ TA L ++ G L D +RL++ ++ G +I
Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96

Query: 94 ASSQSLA-MMILGTALTGLFSVVAQILVPLA-ATLASPDKRGKVVGTIMSGLLLGILLAR 151
S ++I+ + G + LV + A + RGK G I S + +G +
Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156

Query: 152 TVAGLLANLGGWRTVFWVASMLMALMALALWRGLPQMKSETHLNY 196
+ G++A+ W + + + + + + +++ + H +
Sbjct: 157 AIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3838PF05272280.018 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.5 bits (63), Expect = 0.018
Identities = 23/94 (24%), Positives = 36/94 (38%), Gaps = 12/94 (12%)

Query: 23 PYQEILLTRLCMHMQSKLLENRNKMLKAQGINETLFMALITLESQENHSIQPSELSCALG 82
P QE+ L + + L R A+G + + T + ++L ALG
Sbjct: 756 PEQELRLVETGVQGRLWALLTREGAPAAEGAAQKGYSVNTTFVTI-------ADLVQALG 808

Query: 83 -----SSRTNATRIADELEKRGWIERRESDNDRR 111
SS ++ D L + GW RE+ RR
Sbjct: 809 ADPGKSSPMLEGQVRDWLNENGWEYLRETSGQRR 842


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3837TCRTETB1311e-35 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 131 bits (332), Expect = 1e-35
Identities = 97/405 (23%), Positives = 169/405 (41%), Gaps = 23/405 (5%)

Query: 17 IALSLATFMQVLDSTIANVAIPTIAGNLGSSLSQGTWVITSFGVANAISIPLTGWLAKRV 76
I L + +F VL+ + NV++P IA + + WV T+F + +I + G L+ ++
Sbjct: 17 IWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQL 76

Query: 77 GEVKLFLWSTIAFAIASWACGVS-SSLNMLIFFRVIQGIVAGPLIPLSQSLLLNNYPPAK 135
G +L L+ I S V S ++LI R IQG A L ++ P
Sbjct: 77 GIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKEN 136

Query: 136 RSIALALWSMTVIVAPICGPILGGYISDNYHWGWIFFINVPIGVAVVLMTLQTLRGRETR 195
R A L V + GP +GG I+ HW + + +P+ + + L L +E R
Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVR 194

Query: 196 TERRRIDAVGLALLVIGIGSLQIMLDRGKELDWFSSQEIIILTVVAVVAICFLIVWELTD 255
+ D G+ L+ +GI + ML F++ I +V+V++ +
Sbjct: 195 I-KGHFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKV 243

Query: 256 DNPIVDLSLFKSRNFTIGCLCISLAYMLYFGAIVLLPQLLQEVYGYTATWAGLASAPVGI 315
+P VD L K+ F IG LC + + G + ++P ++++V+ + G G
Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303

Query: 316 IPVILS-PIIGRFAHKLDMRRLVTFSFIMYAVCFYWRAYTFEPGMDFGASAWPQFIQGF- 373
+ VI+ I G + ++ +V F ++ S + I F
Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL-----LETTSWFMTIIIVFV 358

Query: 374 --AVACFFMPLTTITLSGLPPERLAAASSLSNFTRTLAGSIGTSI 416
++ ++TI S L + A SL NFT L+ G +I
Sbjct: 359 LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3834LUXSPROTEIN292e-105 Bacterial autoinducer-2 (AI-2) production protein Lu...
		>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS

signature.
Length = 171

Score = 292 bits (750), Expect = e-105
Identities = 131/170 (77%), Positives = 148/170 (87%)

Query: 2 PLLDSFTVDHTRMEAPAVRVAKTMNTPHGDAITVFDLRFCVPNKEVMPERGIHTLEHLFA 61
PLLDSFTVDHTRM APAVRVAKTM TP GD ITVFDLRF PNK+++ E+GIHTLEHL+A
Sbjct: 1 PLLDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYA 60

Query: 62 GFMRNHLNGNGVEIIDISPMGCRTGFYMSLIGTPDEQRVADAWKAAMEDVLKVQDQNQIP 121
GFMRNHLNG+ VEIIDISPMGCRTGFYMSLIGTP EQ+VADAW AAMEDVLKV++QN+IP
Sbjct: 61 GFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKVENQNKIP 120

Query: 122 ELNVYQCGTYQMHSLQEAQDIARSILERDVRINSNEELALPKEKLQELHI 171
ELN YQCGT MHSL EA+ IA++ILE V +N N+ELALP+ L+EL I
Sbjct: 121 ELNEYQCGTAAMHSLDEAKQIAKNILEVGVAVNKNDELALPESMLRELRI 170


87APECO1_3761APECO1_3751N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_3761012-1.420173metabolite transport protein YgcS
APECO1_3760112-2.819858FAD containing dehydrogenase
APECO1_3759215-3.807841oxidoreductase YgcW
APECO1_3758017-3.885615transporter
APECO1_3757020-4.144269sugar kinase
APECO1_3756020-3.917465hypothetical protein
APECO1_3755020-3.327148hypothetical protein
APECO1_3754021-2.400608hypothetical protein
APECO1_3753022-2.050560hypothetical protein
APECO1_3752022-0.095837hypothetical protein
APECO1_37511210.229886phosphopyruvate hydratase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3761TCRTETB362e-04 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 36.4 bits (84), Expect = 2e-04
Identities = 53/338 (15%), Positives = 123/338 (36%), Gaps = 34/338 (10%)

Query: 93 LGSLVLGWISDHIGRQKIFTFSFMLITLASFLQFFATTP-EHLIGLRILIGIGLGGDYSV 151
+G+ V G +SD +G +++ F ++ S + F + LI R + G G ++
Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL 123

Query: 152 GHTLLAEFSPRRHRGVLLGAFSVVWT----VGYVLASIAGHHFISESPEAWRWLLASAAL 207
++A + P+ +RG G + VG + + H+ W +LL +
Sbjct: 124 VMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI------HWSYLLLIPMI 177

Query: 208 PALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVATATHKHIKTLF-- 265
+ + L + R +G F I+ +L + + + L
Sbjct: 178 TIITVPFLMKLLKKEVR---IKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFL 234

Query: 266 -SSRYWRRTA--------FNSVFFVCLVIPWFVIYT----WLPTIAQTIGLEDALTASLM 312
++ R+ ++ F+ V+ +I+ ++ + + L+ + +
Sbjct: 235 IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEI 294

Query: 313 LNALLIVGALLGLVLTHLLAHRRFLLGSFLLLTATLVVMACLPSGSSLTLLLFVLFSTTI 372
+ ++ G + ++ ++ G +L + ++ S F+L +T+
Sbjct: 295 GSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLS-----VSFLTASFLLETTSW 349

Query: 373 SAVSNLVGILPAESFPTDIRSLGVGFATAMSRLGAAVS 410
+V +L SF + S V + GA +S
Sbjct: 350 FMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMS 387


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3759DHBDHDRGNASE1098e-31 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 109 bits (274), Expect = 8e-31
Identities = 74/257 (28%), Positives = 116/257 (45%), Gaps = 11/257 (4%)

Query: 36 MDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANVFIPSFVKDNGETKEMIEK-QGVEVD 94
M+ ++GK A +TG G+G+A A LA GA++ + + E K + +
Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60

Query: 95 FMQVDITAEGAPQKIIAACCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAA 154
D+ A +I A G +DILVN AG+ + + +W+ VN T
Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120

Query: 155 FELSYEAAKIMIPQKSGKIINICSLFSYSGGQWSPAYSATKHALAGFTKAYCDELGQYNI 214
F S +K M+ ++SG I+ + S + AY+++K A FTK EL +YNI
Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180

Query: 215 QVNGISPGYYATDI--TLATRSNPETNQRVLDY-------IPANRWGDTQDLMGAAVFLA 265
+ N +SPG TD+ +L N Q + IP + D+ A +FL
Sbjct: 181 RCNIVSPGSTETDMQWSLWADENGAE-QVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239

Query: 266 SPASNYVNGHLLVVDGG 282
S + ++ H L VDGG
Sbjct: 240 SGQAGHITMHNLCVDGG 256


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3758TCRTETA310.006 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 31.3 bits (71), Expect = 0.006
Identities = 20/76 (26%), Positives = 34/76 (44%), Gaps = 1/76 (1%)

Query: 41 GFSNTEIGLIMSTFGIAAIIFYA-PSGVIADKFSHRKMITSAMIITGLLGLIMATYPPLW 99
+ T IG+ ++ FGI + A +G +A + R+ + MI G +++A W
Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301

Query: 100 VMLCIQVAFAITTILM 115
+ I V A I M
Sbjct: 302 MAFPIMVLLASGGIGM 317



Score = 30.6 bits (69), Expect = 0.012
Identities = 22/103 (21%), Positives = 45/103 (43%), Gaps = 8/103 (7%)

Query: 48 GLIMSTFGIAAIIFYAPSGVIADKFSHRKMITSAMIITGLLGLIMATYPPLWVMLCIQVA 107
G++++ + + G ++D+F R ++ ++ + IMAT P LWV+ ++
Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105

Query: 108 FAITTILMLWSVSIKAASLLGD---HSEQGKIMGWMEGLRGVG 147
IT + A + + D E+ + G+M G G
Sbjct: 106 AGITG-----ATGAVAGAYIADITDGDERARHFGFMSACFGFG 143


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_375556KDTSANTIGN300.005 Rickettsia 56kDa type-specific antigen protein sign...
		>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein

signature.
Length = 533

Score = 30.3 bits (68), Expect = 0.005
Identities = 19/76 (25%), Positives = 31/76 (40%), Gaps = 12/76 (15%)

Query: 30 NASWSEVLNQYQRRTDLIPNLVASIKGYSSHEQEVLEAVTLARSQANRASSDLQKTPGDE 89
+AS ++ ++ Q D + L S GY + + N+ + P +
Sbjct: 294 SASIEQIQSKIQELGDTLEELRDSFDGY------------INNAFVNQIHLNFVMPPQAQ 341

Query: 90 QKLQAWQQAQAQAQAQ 105
Q+ QQ QAQA AQ
Sbjct: 342 QQQGQGQQQQAQATAQ 357


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3752cloacin348e-04 Cloacin signature.
		>cloacin#Cloacin signature.

Length = 551

Score = 33.9 bits (77), Expect = 8e-04
Identities = 16/30 (53%), Positives = 18/30 (60%)

Query: 266 GSSSSSSGGGSSGGGFSGGGGSSGGGGASG 295
GS S GG SG G GG G+SGGG +G
Sbjct: 49 GSGSGIHWGGGSGHGNGGGNGNSGGGSGTG 78



Score = 31.2 bits (70), Expect = 0.005
Identities = 12/23 (52%), Positives = 14/23 (60%)

Query: 272 SGGGSSGGGFSGGGGSSGGGGAS 294
SG G+ GG + GGGS GG S
Sbjct: 60 SGHGNGGGNGNSGGGSGTGGNLS 82



Score = 29.7 bits (66), Expect = 0.016
Identities = 11/32 (34%), Positives = 14/32 (43%)

Query: 263 SRKGSSSSSSGGGSSGGGFSGGGGSSGGGGAS 294
S S G G G SGGG +GG ++
Sbjct: 52 SGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83



Score = 29.3 bits (65), Expect = 0.022
Identities = 13/30 (43%), Positives = 16/30 (53%)

Query: 266 GSSSSSSGGGSSGGGFSGGGGSSGGGGASG 295
S ++ GGGS G GGG G GG +G
Sbjct: 40 SSENNPWGGGSGSGIHWGGGSGHGNGGGNG 69



Score = 28.5 bits (63), Expect = 0.038
Identities = 13/38 (34%), Positives = 16/38 (42%)

Query: 258 SKERASRKGSSSSSSGGGSSGGGFSGGGGSSGGGGASG 295
S E G S S G G +GGG + GGG+
Sbjct: 40 SSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGT 77



Score = 28.1 bits (62), Expect = 0.042
Identities = 12/30 (40%), Positives = 13/30 (43%)

Query: 263 SRKGSSSSSSGGGSSGGGFSGGGGSSGGGG 292
S G G +GGG GG SG GG
Sbjct: 50 SGSGIHWGGGSGHGNGGGNGNSGGGSGTGG 79


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3751ANTHRAXTOXNA290.038 Anthrax toxin LF subunit signature.
		>ANTHRAXTOXNA#Anthrax toxin LF subunit signature.

Length = 800

Score = 29.3 bits (65), Expect = 0.038
Identities = 31/132 (23%), Positives = 51/132 (38%), Gaps = 9/132 (6%)

Query: 211 GYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLA-----GEGN 265
P L N + A+ +E K YE+GK I+L + + ++ + +
Sbjct: 147 RETPKLIINIKDYAINSEQSKEVYYEIGKGISLDIISKDKSLDPEFLNLIKSLSDDSDSS 206

Query: 266 KAFTSEEFTHFLEELTKQYPIVSIEDGLDESDW---DGFAYQTKVLG-DKIQLVGDDLFV 321
S++F LE K I I++ L E F+Y ++L D+F
Sbjct: 207 DLLFSQKFKEKLELNNKSIDINFIKENLTEFQHAFSLAFSYYFAPDHRTVLELYAPDMFE 266

Query: 322 TNTKILKEGIEK 333
K+ K G EK
Sbjct: 267 YMNKLEKGGFEK 278


88APECO1_3682APECO1_3676N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_36821141.345408hypothetical protein
APECO1_3681-2101.514165hypothetical protein
APECO1_3680-3101.185291hypothetical protein
APECO1_3679-391.116814hypothetical protein
APECO1_3678-280.785686thymidylate synthase
APECO1_3677-391.300687prolipoprotein diacylglyceryl transferase
APECO1_3676-2101.721041fused phosphoenolpyruvate-protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3682BCTERIALGSPH290.002 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 29.1 bits (65), Expect = 0.002
Identities = 27/114 (23%), Positives = 43/114 (37%), Gaps = 29/114 (25%)

Query: 8 QQGFSLPEVMLAMVLMVMIVTA----------------LSGFQRTLMNSLASRNQYQQLW 51
Q+GF+L E+ML ++LM + L+ F+ L Q Q +
Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQFF 62

Query: 52 -----RHGWQ--QTQLRAISPPA----NWQVNRMQTSQAGCVSISVTLVSPGGR 94
WQ + R + PA W R +AG V+ S ++ GG+
Sbjct: 63 GVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAGRVATSGSI--AGGK 114


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3680PilS_PF08805290.016 PilS N terminal
		>PilS_PF08805#PilS N terminal

Length = 185

Score = 28.7 bits (64), Expect = 0.016
Identities = 14/51 (27%), Positives = 28/51 (54%), Gaps = 3/51 (5%)

Query: 72 ALSARRNRRMPVKEQGFSLLEVLIAMAISSVLLLGAARFLPALQRESLTNT 122
+LSARR + +++G +L+EVL+ + + VL A + +Q ++
Sbjct: 15 SLSARRKKE---QDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSN 62


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3679BCTERIALGSPG290.003 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 29.5 bits (66), Expect = 0.003
Identities = 9/24 (37%), Positives = 18/24 (75%)

Query: 1 MKTQRGYTLIETLVAMLILVMLSA 24
QRG+TL+E +V ++I+ +L++
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLAS 27


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3676PHPHTRNFRASE6110.0 Phosphoenolpyruvate-protein phosphotransferase sign...
		>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase

signature.
Length = 572

Score = 611 bits (1577), Expect = 0.0
Identities = 189/571 (33%), Positives = 314/571 (54%), Gaps = 7/571 (1%)

Query: 168 QTRIRALPAAPGVAIAEGWQDATLPLMEQVYQASTLDPALERERLTGALEEAANEFRRYS 227
+I + A+ GVAIA+ + + + + S D + E E+LT ALE++ E R
Sbjct: 2 HHKITGIAASSGVAIAKAFIHLEPNV--DIEKTSITDVSTEIEKLTAALEKSKEELRAIK 59

Query: 228 KRFAAGAQKETAAIFDLYSHLLSDTRLRRELFAEVDKGSV-AEWAVKTVIEKFAEQFAAL 286
+ A + A IF + +L D L + +++ + AE+A+K V + F F ++
Sbjct: 60 DQTEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESM 119

Query: 287 SDNYLKERAGDLRALGQRLLFHLDDANQGPNAW-PERFILVADELSATTLAELPQDRLVG 345
+ Y+KERA D+R + +R+L HL G A E +++A++L+ + A+L + + G
Sbjct: 120 DNEYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKG 179

Query: 346 VVVRDGAANSHAAIMVRALGIPTVMGA-DIQPSVLHRRTLIVDGYRGELLVDPEPVLLQE 404
G SH+AIM R+L IP V+G ++ + H +IVDG G ++V+P ++
Sbjct: 180 FATDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKA 239

Query: 405 YQRLISEEIELSRLAEDDVNLPAQLKSGERIKVMLNAGLSPEHEEKLGSRIDGIGLYRTE 464
Y+ + + + V P+ K G +++ N G + + L + +GIGLYRTE
Sbjct: 240 YEEKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTE 299

Query: 465 IPFMLQSGFPSEEEQVAQYQGMLQMFNDKPVTLRTLDVGADKQLPYMPISEE-NPCLGWR 523
+M + P+EEEQ Y+ ++Q + KPV +RTLD+G DK+L Y+ + +E NP LG+R
Sbjct: 300 FLYMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFR 359

Query: 524 GIRITLDQPEIFLIQVRAMLRANAATGNLNILLPMVTSLDEVDEARRLIERAGREVEEMI 583
IR+ L++ +IF Q+RA+LRA + GNL ++ PM+ +L+E+ +A+ +++ ++
Sbjct: 360 AIRLCLEKQDIFRTQLRALLRA-STYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEG 418

Query: 584 GYEIPKPRIGIMLEVPSMVFMLPHLAKRVDFISVGTNDLTQYILAVDRNNTRVANIYDSL 643
+GIM+E+PS AK VDF S+GTNDL QY +A DR N RV+ +Y
Sbjct: 419 VDVSDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPY 478

Query: 644 HPAMLRALAMIAREAEIHGIDLRLCGEMAGDPMCVAILIGLGYRHLSMNGRSVARVKYLL 703
HPA+LR + M+ + A G + +CGEMAGD + + +L+GLG SM+ S+ + L
Sbjct: 479 HPAILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQL 538

Query: 704 RRIDFAEAENLAQRSLEAQLATEVRHQVAAF 734
++ E + AQ++L A EV V
Sbjct: 539 LKLSKEELKPFAQKALMLDTAEEVEQLVKKT 569


89APECO1_3530APECO1_3518N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_3530932-5.928488tia invasion determinant
APECO1_3529832-5.089021ISEc12 ATP-binding protein
APECO1_3528835-4.280974transposase for ISEc12
APECO1_3527738-4.917385protein PapG
APECO1_3526635-1.652529P pilus minor tip component PapF
APECO1_3525634-1.333131PapE protein
APECO1_3524531-1.154896pilus assembly protein PapK
APECO1_3523632-1.869710PapJ protein
APECO1_3522531-3.094893pilus assembly protein chaperone PapD
APECO1_3521529-2.338248outer membrane usher protein PapC
APECO1_3520431-3.750169minor pilin subunit PapH
APECO1_3519227-3.860891major fibrial protein
APECO1_3518220-2.212630PapB-like protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3530OUTRMMBRANEA412e-06 Outer membrane protein A signature.
		>OUTRMMBRANEA#Outer membrane protein A signature.

Length = 346

Score = 41.1 bits (96), Expect = 2e-06
Identities = 45/212 (21%), Positives = 67/212 (31%), Gaps = 47/212 (22%)

Query: 4 MKKVIVVSALAMAGVFSAQALADRGKTGFYVTGKAGASVVTQTDQRFRQDFGDDVYKYKG 63
MKK + A+A+AG + A + T +Y K G S + D +
Sbjct: 1 MKKTAIAIAVALAGFATVAQAAPKDNT-WYTGAKLGWS-----------QYHDTGFINNN 48

Query: 64 GDKNDTVFGAGLAVGYDFYQHYNVPVRTEVEFYGRGAADSHYTLDTWHSPMGDGGREDTQ 123
G ++ GAG GY V ++ GR + G Q
Sbjct: 49 GPTHENQLGAGAFGGYQVNP--YVGFEMGYDWLGRMPYKGS----------VENGAYKAQ 96

Query: 124 NRLSVNTLMVNTYYDFRNSSAFTPWVSVGLGYARIHHKATYTDTSWNKSGEVSDISALHY 183
V Y + +T +G R DT N G+
Sbjct: 97 ---GVQLTAKLGYPITDDLDIYT---RLGGMVWR-------ADTKSNVYGKN-------- 135

Query: 184 SGYDNNFAWSIGAGVRYDITPDIALDLSYRYL 215
+D + GV Y ITP+IA L Y++
Sbjct: 136 --HDTGVSPVFAGGVEYAITPEIATRLEYQWT 165


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3527PF036276020.0 PapG
		>PF03627#PapG

Length = 336

Score = 602 bits (1554), Expect = 0.0
Identities = 336/336 (100%), Positives = 336/336 (100%)

Query: 1 MKKWFPALLFSLCVSGESSAWNNIVFYSLGNVNSYQGGNVVITQRPQFITSWRPGIATVT 60
MKKWFPALLFSLCVSGESSAWNNIVFYSLGNVNSYQGGNVVITQRPQFITSWRPGIATVT
Sbjct: 1 MKKWFPALLFSLCVSGESSAWNNIVFYSLGNVNSYQGGNVVITQRPQFITSWRPGIATVT 60

Query: 61 WNQCNGPGFADGSWAYYREYIAWVVFPKKVMTKNGYPLFIEVHNKGSWSEENTGDNDSYF 120
WNQCNGPGFADGSWAYYREYIAWVVFPKKVMTKNGYPLFIEVHNKGSWSEENTGDNDSYF
Sbjct: 61 WNQCNGPGFADGSWAYYREYIAWVVFPKKVMTKNGYPLFIEVHNKGSWSEENTGDNDSYF 120

Query: 121 FLKGYKWDERAFDAGNLCQKPGETTRLTEKFDDIIFKVALPADLPLGDYSVTIPYTSGMQ 180
FLKGYKWDERAFDAGNLCQKPGETTRLTEKFDDIIFKVALPADLPLGDYSVTIPYTSGMQ
Sbjct: 121 FLKGYKWDERAFDAGNLCQKPGETTRLTEKFDDIIFKVALPADLPLGDYSVTIPYTSGMQ 180

Query: 181 RHFASYLGARFKIPYNVAKTLPRENEMLFLFKNIGGCRPSAQSLEIKHGDLSINSANNHY 240
RHFASYLGARFKIPYNVAKTLPRENEMLFLFKNIGGCRPSAQSLEIKHGDLSINSANNHY
Sbjct: 181 RHFASYLGARFKIPYNVAKTLPRENEMLFLFKNIGGCRPSAQSLEIKHGDLSINSANNHY 240

Query: 241 AAQTLSVSCDVPANIRFMLLRNTTPTYSHGKKFSVGLGHGWDSIVSVNGVDTGETTMRWY 300
AAQTLSVSCDVPANIRFMLLRNTTPTYSHGKKFSVGLGHGWDSIVSVNGVDTGETTMRWY
Sbjct: 241 AAQTLSVSCDVPANIRFMLLRNTTPTYSHGKKFSVGLGHGWDSIVSVNGVDTGETTMRWY 300

Query: 301 KAGTQNLTIGSRLYGESSKIQPGVLSGSATLLMILP 336
KAGTQNLTIGSRLYGESSKIQPGVLSGSATLLMILP
Sbjct: 301 KAGTQNLTIGSRLYGESSKIQPGVLSGSATLLMILP 336


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3526FIMBRIALPAPF2682e-95 Escherichia coli: P pili tip fibrillum papF protein...
		>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein

signature.
Length = 167

Score = 268 bits (685), Expect = 2e-95
Identities = 155/167 (92%), Positives = 157/167 (94%), Gaps = 1/167 (0%)

Query: 11 MIRLSLFISLLLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRGE 70
MIRLSLFISLLLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRGE
Sbjct: 1 MIRLSLFISLLLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRGE 60

Query: 71 ITKTISISCTYKSGSPWIKVTGNAMA-GQTNVLATNIANFGIALYQGKGMSTPLTLGNGS 129
+TK ISISC YKSGS WIKVTGN M GQ NVLATNI +FGIALYQGKGMSTPLTLGNGS
Sbjct: 61 VTKNISISCPYKSGSLWIKVTGNTMGVGQNNVLATNITHFGIALYQGKGMSTPLTLGNGS 120

Query: 130 GNGYRVTAGLDTARSTFTFTSVPFRNGSRTLNGGDFRTTASMSMIYN 176
GNGYRVTAGLDTARSTFTFTSVPFRNGS LNGGDFRTTASMSMIYN
Sbjct: 121 GNGYRVTAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMIYN 167


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3525FIMBRIALPAPE296e-106 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 296 bits (760), Expect = e-106
Identities = 124/173 (71%), Positives = 142/173 (82%)

Query: 1 MKKIRGLCLPVMLGAVLMSQHVHAVDNLTFRGKLIIPACTVSNTTVDWQDVEIQTLSQNG 60
MKKIRGLCLPVMLGAVLMSQHVHA DNLTF+GKLIIPACTV N V+W D+EIQ L Q+G
Sbjct: 1 MKKIRGLCLPVMLGAVLMSQHVHAADNLTFKGKLIIPACTVQNAEVNWGDIEIQNLVQSG 60

Query: 61 NHEKEFTVNMRCPYNLGTMKVTITATNTYNNAILVQNTSNTSSDGLLVYLYNSNAGNIGT 120
++K+FTV+M CPY+LGTMKVTIT+ N+ILV NTS S DGLL+YLYNSN IG
Sbjct: 61 GNQKDFTVDMNCPYSLGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGIGN 120

Query: 121 AITLGTPFTPGKITGNNADKTISLHAKLGYKGNMQNLIAGPFSATATLVASYS 173
A+TLG+ TPGKITG + I+L+AKLGYKGNMQ+L AG FSATATLVASYS
Sbjct: 121 AVTLGSQVTPGKITGTAPARKITLYAKLGYKGNMQSLQAGTFSATATLVASYS 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3521PF005777340.0 Outer membrane usher protein FimD
		>PF00577#Outer membrane usher protein FimD

Length = 878

Score = 734 bits (1897), Expect = 0.0
Identities = 241/882 (27%), Positives = 361/882 (40%), Gaps = 67/882 (7%)

Query: 1 MRGMKDRI-PFAVNNITCVILLSLFCNAASAVEFNTDVLDAADKKNIDFTRFSEAGYVLP 59
+ K R+ F V + +++ + FN L + D +RF + P
Sbjct: 16 LHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPP 75

Query: 60 GQYLLDVIVNGQSISPASLQISFVEPALSGDKAEKKLPQACLTSDMVRLMGLTAESLDKV 119
G Y +D+ +N + A+ ++F CLT + MGL S+ +
Sbjct: 76 GTYRVDIYLNNGYM--ATRDVTFNTGDSEQGI------VPCLTRAQLASMGLNTASVSGM 127

Query: 120 VYWHDGQCADF-HGLPGVDIRPDTGAGVLRINMPQAWLEYSDATWLPPSRWDDGIPGLML 178
D C + + D G L + +PQA++ ++PP WD GI +L
Sbjct: 128 NLLADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLL 187

Query: 179 DYNLNGTVSRNYQGGDSHQFSYNGTVGGNLGPWRLRADYQGSQEQSRYNGEKTTNRNFTW 238
+YN +G +N GG+SH N G N+G WRLR + S S + + +
Sbjct: 188 NYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSS--DSSSGSKNKWQH 245

Query: 239 SRFYLFRAIPRWRANLTLGENNINSDIFRSWSYTGASLESDDRMLPPRLRGYAPQITGIA 298
+L R I R+ LTLG+ DIF ++ GA L SDD MLP RG+AP I GIA
Sbjct: 246 INTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIA 305

Query: 299 ETNARVVVSQQGRVLYDSMVPAGLFSIQDLD-SSVRGRLDVEVIEQNGRKKTFQVDTASV 357
A+V + Q G +Y+S VP G F+I D+ + G L V + E +G + F V +SV
Sbjct: 306 RGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSV 365

Query: 358 PYLTRPGQVRYKLVSGRSRGYGHETEGPVFATGEASWGLSNQWSLYGGAVLAGDYNALAA 417
P L R G RY + +G R + E P F GL W++YGG LA Y A
Sbjct: 366 PLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNF 425

Query: 418 GAGWDLGVPGTLSADITQSVARIEGERTFQGKSWRLSYSKRFDNADADITFAGYRFSERN 477
G G ++G G LS D+TQ+ + + + G+S R Y+K + + +I GYR+S
Sbjct: 426 GIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSG 485

Query: 478 YMTMEQYLNARYR--------------------NDYSSREKEMYTVTLNKNVADWNTSFN 517
Y +R + + ++ +T+ + + + +
Sbjct: 486 YFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTS-TLY 544

Query: 518 LQYSRQTYWDIRKTD-YYTVSVNRYFNVFGLQGVAVGLAASRSKYLGRD--NDSAYLRIS 574
L S QTYW D + +N F + L+ S +K + + L ++
Sbjct: 545 LSGSHQTYWGTSNVDEQFQAGLNTAFE-----DINWTLSYSLTKNAWQKGRDQMLALNVN 599

Query: 575 VPLGT------------GTASYSGSMSND-RYVNMAGYTDT-FNDGLDSYSLNAGLNSGG 620
+P +ASYS S + R N+AG T D SYS+ G GG
Sbjct: 600 IPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGG 659

Query: 621 GLTSQRQINAYYSHRSPLANLSANIASLQKGYTSFGVSASGGATITGKGAALHAGGMSGG 680
S A ++R N + S SGG G L G
Sbjct: 660 DGNSGSTGYATLNYRGGYGNANIG-YSHSDDIKQLYYGVSGGVLAHANGVTL--GQPLND 716

Query: 681 TRLLVDTDGVGGVPVDGGQVV-TNRWGTGVVTDISSYYRNTTSVDLKRLPDDVEATRSVV 739
T +LV G V+ V T+ G V+ + Y N ++D L D+V+ +V
Sbjct: 717 TVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVA 776

Query: 740 ESALTEGAIGYRKFSVLKGKRLFAILRLADGSQPPFGASVTSEKGRELGMVADEGLAWLS 799
T GAI +F G +L L + PFGA VTSE + G+VAD G +LS
Sbjct: 777 NVVPTRGAIVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQVYLS 835

Query: 800 GVTPGETLSVNW--DGKIQCQVNVPETAISDQQLL----LPC 835
G+ + V W + C N S QQLL C
Sbjct: 836 GMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAEC 877


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3520FIMBRIALPAPE300.003 Escherichia coli: P pili tip fibrillum papE protein...
		>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein

signature.
Length = 173

Score = 30.4 bits (68), Expect = 0.003
Identities = 41/172 (23%), Positives = 75/172 (43%), Gaps = 27/172 (15%)

Query: 29 GMTLPEYWG----EEHVWWDGRAAFHGEVVRPACTLAMEDAWQIIDMGETPVRDL-QNGF 83
G+ LP G +HV F G+++ PACT+ + ++ G+ +++L Q+G
Sbjct: 6 GLCLPVMLGAVLMSQHVHAADNLTFKGKLIIPACTVQNAE----VNWGDIEIQNLVQSG- 60

Query: 84 SGPERKFSLRLRNCEFNSQGGNLFSDSRIRVTFDGVRGET---PDKFNLSGQAKGINLQI 140
G ++ F++ + NC ++ ++ +T +G G + P+ SG I L
Sbjct: 61 -GNQKDFTVDM-NCPYS------LGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYN 112

Query: 141 ADAR--GNIARAGKVMPAIPLTGNEEALDYTLRIVR----NGKKLEAGNYFA 186
++ GN G + +TG A TL N + L+AG + A
Sbjct: 113 SNNSGIGNAVTLGSQVTPGKITGTAPARKITLYAKLGYKGNMQSLQAGTFSA 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3518FIMREGULATRY1685e-58 Escherichia coli: P pili regulatory PapB protein si...
		>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein

signature.
Length = 104

Score = 168 bits (426), Expect = 5e-58
Identities = 104/104 (100%), Positives = 104/104 (100%)

Query: 1 MAHHEVISRSGNAFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHS 60
MAHHEVISRSGNAFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHS
Sbjct: 1 MAHHEVISRSGNAFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHS 60

Query: 61 RKEVCEKYQMNNGYFSTTLGRLIRLNALAARLAPYYTDESSAFD 104
RKEVCEKYQMNNGYFSTTLGRLIRLNALAARLAPYYTDESSAFD
Sbjct: 61 RKEVCEKYQMNNGYFSTTLGRLIRLNALAARLAPYYTDESSAFD 104


90APECO1_3468APECO1_3454N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_34681161.800432polysialic acid transport protein KpsM
APECO1_34671184.117837general secretion pathway protein YghD
APECO1_34661154.147850GspL-like protein
APECO1_34651204.660825type II secretion protein GspK
APECO1_3464-1195.118114type II secretion protein GspJ
APECO1_3463-1194.411410type II secretion protein GspI
APECO1_3462-1163.990889type II secretion protein GspH
APECO1_3461-2153.283417type II secretion protein GspG
APECO1_3460-2143.080400type II secretion protein GspF
APECO1_3459-1121.255049type II secretion protein GspE
APECO1_3458-211-0.216748type II secretion protein GspD
APECO1_3457-211-0.580353type II secretion protein GspC
APECO1_3456-211-0.195748hypothetical protein
APECO1_3455-3120.233606prepilin peptidase A
APECO1_3454-3120.791139lipoprotein AcfD-like
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3468ABC2TRNSPORT336e-04 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 33.4 bits (76), Expect = 6e-04
Identities = 29/125 (23%), Positives = 54/125 (43%), Gaps = 10/125 (8%)

Query: 137 ITNFLQLVLTWSLLIILS--CGVGLIF----MVVGKTFPEMQKVL---PILLKPLYFISC 187
+ L SLL L GL F MVV P + +++ P+ F+S
Sbjct: 135 VAAALGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSG 194

Query: 188 IMFPLHSIPKQYWSYLLWNPLVHVVELSREAVMPGYISE-GVSLNYLAMFTLVTLFIGLA 246
+FP+ +P + + + PL H ++L R ++ + + + L ++ ++ F+ A
Sbjct: 195 AVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTA 254

Query: 247 LYRTR 251
L R R
Sbjct: 255 LLRRR 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3464BCTERIALGSPG290.011 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 28.7 bits (64), Expect = 0.011
Identities = 15/46 (32%), Positives = 22/46 (47%), Gaps = 2/46 (4%)

Query: 1 MRRARAGFTLLEMLVAIAIFASLA-LMAQQVTNGVTRVNSAVAGHD 45
+ R GFTLLE++V I I LA L+ + + + A D
Sbjct: 4 TDKQR-GFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSD 48


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3463BCTERIALGSPH323e-04 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 32.2 bits (73), Expect = 3e-04
Identities = 13/24 (54%), Positives = 18/24 (75%)

Query: 2 KRGFTLLEVMLALAIFALAAMAVL 25
+RGFTLLE+ML L + ++A VL
Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVL 26


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3462BCTERIALGSPH773e-20 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 77.3 bits (190), Expect = 3e-20
Identities = 42/196 (21%), Positives = 71/196 (36%), Gaps = 41/196 (20%)

Query: 1 MPERGFTLLEIMLVIFLIGLASAGVVQTFATASEPPAKKAAQDFLTRFAQFKDRAVIEGQ 60
M +RGFTLLE+ML++ L+G+++ V+ F + + A + F + + R + GQ
Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQ 60

Query: 61 TLGVLIDPPGYQFMQRRHGQWLPVSATRLSAQVTVPKQVQMLLQPGSDIWQKEYALELQR 120
GV + P +QF+ + P D W L L+
Sbjct: 61 FFGVSVHPDRWQFLVLEARDGADPA-------------------PADDGWSGYRWLPLRA 101

Query: 121 RRL----TLHDIELEL-----QKEAKKKTPQIRFSPFEPATPFTLRFYSAAQNACWAVKL 171
R+ ++ +L L + P + P TPF L L
Sbjct: 102 GRVATSGSIAGGKLNLAFAQGEAWTPGDNPDVLIFPGGEMTPFRLT-------------L 148

Query: 172 AHDGALSLNQCDERMP 187
++ N E +P
Sbjct: 149 GEAPGIAFNARGESLP 164


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3461BCTERIALGSPG2182e-76 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 218 bits (556), Expect = 2e-76
Identities = 91/146 (62%), Positives = 109/146 (74%), Gaps = 3/146 (2%)

Query: 6 RTQKPRAGFTLLEVMVVIVILGVLASLVVPNLLGNKEKADRQKAISDIVALENALDMYRL 65
R + GFTLLE+MVVIVI+GVLASLVVPNL+GNKEKAD+QKA+SDIVALENALDMY+L
Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61

Query: 66 DNGRYPTTEQGLEALIQQPANMADARNYRTGGYIKRLPKDPWGNDYQYLSPGEKGLFDVY 125
DN YPTT QGLE+L++ P A NY GYIKRLP DPWGNDY ++PGE G +D+
Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121

Query: 126 TLGADGQENGEGAGADIGNWNLQEFQ 151
+ G DG+ E DI NW L + +
Sbjct: 122 SAGPDGEMGTED---DITNWGLSKKK 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3460BCTERIALGSPF453e-161 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 453 bits (1167), Expect = e-161
Identities = 226/406 (55%), Positives = 301/406 (74%), Gaps = 1/406 (0%)

Query: 1 MALFYYQALERNGRKTKGMIEADSARHARQLLRGKDLIPVHI-EARMNASAGGLLQRRRH 59
MA ++YQAL+ G+K +G EADSAR ARQLLR + L+P+ + E R + G
Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60

Query: 60 AHRRVATADLALFTRQLATLVQAAMPLETCLQAVSEQSEKLHVKSLGMALRSRIQEGYTL 119
R++T+DLAL TRQLATLV A+MPLE L AV++QSEK H+ L A+RS++ EG++L
Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120

Query: 120 SDSLREHPRVFDSLFCSMVAAGEKSGHLDVVLNRLADYTEQRQRLKSRLLQAMLYPLVLL 179
+D+++ P F+ L+C+MVAAGE SGHLD VLNRLADYTEQRQ+++SR+ QAM+YP VL
Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180

Query: 180 VVATGVVTILLTAVVPKIIEQFDHLGHALPASTRMLIAMSDALQASGVYWLAGLLGLLVL 239
VVA VV+ILL+ VVPK++EQF H+ ALP STR+L+ MSDA++ G + L LL +
Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240

Query: 240 GQRLLKNPAMRLRWDKTLLRLPVTGRVARGLNTARFSRTLSILTASSVPLLEGIQTAAAV 299
+ +L+ R+ + + LL LP+ GR+ARGLNTAR++RTLSIL AS+VPLL+ ++ + V
Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300

Query: 300 SANRYVEQQLLLAADRVREGSSLRAALADLRLFPPMMLYMIASGEQSGELETMLEQAAVN 359
+N Y +L LA D VREG SL AL LFPPMM +MIASGE+SGEL++MLE+AA N
Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360

Query: 360 QEREFDTQVGLALGLFEPALVVMMAGVVLFIVIAILEPMLQLNNMV 405
Q+REF +Q+ LALGLFEP LVV MA VVLFIV+AIL+P+LQLN ++
Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3458BCTERIALGSPD5750.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 575 bits (1483), Expect = 0.0
Identities = 295/668 (44%), Positives = 429/668 (64%), Gaps = 34/668 (5%)

Query: 24 LLPLMLAAALCSSPVWEEEATFTANFKDTDLKSFIETVGANLNKTIIMGPGVQGKVSIRT 83
L L++ AAL P EE F+A+FK TD++ FI TV NLNKT+I+ P V+G +++R+
Sbjct: 11 SLTLLIFAALLFRPAAAEE--FSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRS 68

Query: 84 MTPLNERQYYQLFLNLLEAQGYAVVPMENDVLKVVKSSAAKVEPLPLVGEGSDNYAGDEM 143
LNE QYYQ FL++L+ G+AV+ M N VLKVV+S AK +P+ + + GDE+
Sbjct: 69 YDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPG-IGDEV 127

Query: 144 VTKVVPVRNVSVRELAPILRQMIDSAGSGNVVNYDPSNVIMLTGRASVVERLTEVIQRVD 203
VT+VVP+ NV+ R+LAP+LRQ+ D+AG G+VV+Y+PSNV+++TGRA+V++RL +++RVD
Sbjct: 128 VTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVD 187

Query: 204 HAGNRTEEVIPLDNASASEIARVLESLTKNSGENQ-PATLKSQIVADERTNSVIVSGDPA 262
+AG+R+ +PL ASA+++ +++ L K++ ++ P ++ + +VADERTN+V+VSG+P
Sbjct: 188 NAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPN 247

Query: 263 TRDKMRRLIRRLDSEMERSGNSQVFYLKYSKAEDLVDVLKQVSGTLTAAKEEAEGTVGSG 322
+R ++ +I++LD + GN++V YLKY+KA DLV+VL +S T+ + K+ A+ +
Sbjct: 248 SRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPV-AAL 306

Query: 323 REVVSIAASKHSNALIVTAPQDIMQSLQSVIEQLDIRRAQVHVEALIVEVAEGSNINFGV 382
+ + I A +NALIVTA D+M L+ VI QLDIRR QV VEA+I EV + +N G+
Sbjct: 307 DKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGI 366

Query: 383 QWGSKDAGLMQFANGTQIPIGTLGAAISAAKPQKGSTVISENGATTINPDTNGDLST-LA 441
QW +K+AG+ QF N + +PI T A + +G +S+ LA
Sbjct: 367 QWANKNAGMTQFTN-SGLPISTAIAG-------------------ANQYNKDGTVSSSLA 406

Query: 442 QLLSGFSGTAVGVVKGDWMALVQAVKNDSSSNVLSTPSITTLDNQEAFFMVGQDVPVLTG 501
LS F+G A G +G+W L+ A+ + + +++L+TPSI TLDN EA F VGQ+VPVLTG
Sbjct: 407 SALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTG 466

Query: 502 STVGSNNSNPFNTVERKKVGIMLKVTPQINEGNAVQMVIEQEVSKVEGQTS-----LDVV 556
S S N FNTVERK VGI LKV PQINEG++V + IEQEVS V S L
Sbjct: 467 SQTTSG-DNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGAT 525

Query: 557 FGERKLKTTVLANDGELIVLGGLMDDQAGESVAKVPLLGDIPVIGNLFKSTADKKEKRNL 616
F R + VL GE +V+GGL+D ++ KVPLLGDIPVIG LF+ST+ K KRNL
Sbjct: 526 FNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNL 585

Query: 617 MVFIRPTILRDGMAADGVSQRKYNYMRAEQIYR--DEQGLSLMPHTAQPILPAQNQALPP 674
M+FIRPT++RD S +Y Q + E +++ I P Q+ A
Sbjct: 586 MLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENNDAMLNQDLLEIYPRQDTAAFR 645

Query: 675 EVRAFLNA 682
+V A ++A
Sbjct: 646 QVSAAIDA 653


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3457BCTERIALGSPC1192e-34 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 119 bits (300), Expect = 2e-34
Identities = 71/284 (25%), Positives = 116/284 (40%), Gaps = 38/284 (13%)

Query: 3 RGMFWLMLLIISAKMAYSLWRYFSFSAEYTAVSSSVN-KPLRADAKPFDKNDVQLVSQQN 61
R +F+L++L+ ++A WR A SSV P +A +P ND L
Sbjct: 16 RILFYLLMLLFCQQLAMIFWR---IGLPDNAPVSSVQITPAQARQQPVTLNDFTL----- 67

Query: 62 WFGKY-QPVAAPV-KQPESAPVAETRLNVVLRGIAFG---ARPGVVIEEGGKQQVYLQGE 116
FG + A + + + + LN+ L G+ G +R +I + +Q E
Sbjct: 68 -FGVSPEKNKAGALDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSRGVNE 126

Query: 117 RLGSHNAVIEEINRDHVMLRYQGKMERLSLAEEERPPVAVTSKKAASDEAKQAVAEPVVS 176
+ +NA I I D V+L+YQG+ E L L +E + SD A
Sbjct: 127 EVPGYNAKIVSIRPDRVVLQYQGRYEVLGLYSQE---------DSGSDGVPGAQVN---- 173

Query: 177 APVEIPAAVRQALAKDPQKIFNYIQLTPVRKEG-IVGYAVKPGADRSLFDASGFREGDIA 235
Q + + +Y+ +P+ + + GY + PG F G ++ D+A
Sbjct: 174 ---------EQLQQRASTTMSDYVSFSPIMNDNKLQGYRLNPGPKSDSFYRVGLQDNDMA 224

Query: 236 IALNQQDFTDPRAMIALMRQLPSMDSIQLTVLRKGARYDISIAL 279
+ALN D D M ++ + + LTV R G R DI +
Sbjct: 225 VALNGLDLRDAEQAKKAMERMADVHNFTLTVERDGQRQDIYMEF 268


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3455PREPILNPTASE2831e-97 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 283 bits (726), Expect = 1e-97
Identities = 111/276 (40%), Positives = 151/276 (54%), Gaps = 12/276 (4%)

Query: 31 LTMLFDVFQQYPAAMPILATVGGLIIGSFLNVVIWRYPIML-RQQMAEFHGETPSTQSKI 89
+ +L ++ P L + L+IGSFLNVVI R PIML R+ AE+ +
Sbjct: 1 MALLLELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGV 60

Query: 90 -----SLALPRSHCPHCQQTIRVRDNIPLLSWLMLKGRCRDCQAKISKRYPLVELLTALA 144
+L +PRS CPHC I +NIPLLSWL L+GRCR CQA IS RYPLVELLTAL
Sbjct: 61 DEPPYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALL 120

Query: 145 FLLASLVWPESGWGLAVMILSAWLIAASIIDLDNQWLPDVFTQGVLWTGLIAAWAQQSPL 204
+ ++ LA ++L+ L+A + IDLD LPD T +LW GL+ +
Sbjct: 121 SVAVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNL-LGGFV 179

Query: 205 TLQDAVTGVLVGFITFYSLRWIAGIVLRKEALGMGDVLLFAALGGWVGPLSLPNVALIAS 264
+L DAV G + G++ +SL W ++ KE +G GD L AALG W+G +LP V L++S
Sbjct: 180 SLGDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSS 239

Query: 265 CCGLIYAVI-----TKRGSTTLPFGPCLSLGGIATL 295
G + S +PFGP L++ G L
Sbjct: 240 LVGAFMGIGLILLRNHHQSKPIPFGPYLAIAGWIAL 275


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3454PF03544481e-07 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 48.0 bits (114), Expect = 1e-07
Identities = 29/107 (27%), Positives = 41/107 (38%), Gaps = 8/107 (7%)

Query: 46 PEVKPDPTPTPEPTPEPTPDPEPTPDPTPD-PEPTPEPEPEPVPTKTGYLTLGGSQRVTG 104
V+P P P EP PEP P PEP + +P P+P+P+P P K ++
Sbjct: 64 QAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK-------VEQPKR 116

Query: 105 ATCNGESSDGFTFTPGNTVSCVVGSTTIATFNTQSEAARSLRAVDKV 151
ES F + T AT + A RA+ +
Sbjct: 117 DVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRN 163



Score = 41.1 bits (96), Expect = 2e-05
Identities = 14/87 (16%), Positives = 21/87 (24%), Gaps = 1/87 (1%)

Query: 35 TPSVDSGSGTLPEVKPDPTPTPEPTPEPTPDPEPTPDPTPDPEPTPEPEPEPVPT-KTGY 93
P PE P+P E P+ + +PV +
Sbjct: 69 PPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASP 128

Query: 94 LTLGGSQRVTGATCNGESSDGFTFTPG 120
R T +T +S T
Sbjct: 129 FENTAPARPTSSTATAATSKPVTSVAS 155



Score = 38.8 bits (90), Expect = 9e-05
Identities = 16/58 (27%), Positives = 22/58 (37%), Gaps = 1/58 (1%)

Query: 31 SSSDTPSVDSGSGTLPEVKPDPTPTPEPTPEPTPDPEPTPDPTPDPEPTPEPEPEPVP 88
S + + + + P P P PEP +P P+PEP PEP E
Sbjct: 36 SVHQVIELPAPAQPISVTMVAPADLEPP-QAVQPPPEPVVEPEPEPEPIPEPPKEAPV 92



Score = 36.9 bits (85), Expect = 4e-04
Identities = 16/46 (34%), Positives = 19/46 (41%)

Query: 46 PEVKPDPTPTPEPTPEPTPDPEPTPDPTPDPEPTPEPEPEPVPTKT 91
P T EP +P P+P +PEP PEP PEP
Sbjct: 46 PAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAP 91



Score = 36.1 bits (83), Expect = 6e-04
Identities = 17/40 (42%), Positives = 17/40 (42%)

Query: 50 PDPTPTPEPTPEPTPDPEPTPDPTPDPEPTPEPEPEPVPT 89
P P T D EP P PEP EPEPEP P
Sbjct: 44 PAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPI 83


91APECO1_3210APECO1_3203N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_3210-116-0.858087serine endoprotease
APECO1_3209-213-0.969347serine endoprotease
APECO1_3208013-0.839014malate dehydrogenase
APECO1_3207-113-1.264658arginine repressor
APECO1_3206-213-0.662311hypothetical protein
APECO1_3205-1130.530134hypothetical protein
APECO1_3204-2121.348608p-hydroxybenzoic acid efflux subunit AaeB
APECO1_3203-1111.539956p-hydroxybenzoic acid efflux subunit AaeA
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3210V8PROTEASE733e-16 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 72.7 bits (178), Expect = 3e-16
Identities = 30/184 (16%), Positives = 63/184 (34%), Gaps = 38/184 (20%)

Query: 90 GLGSGVIINANKGYVLTNNHVINQAQKISIQL------------NDGREFDAKLIGSDDQ 137
+ SGV++ + +LTN HV++ L +G ++ +
Sbjct: 102 FIASGVVVGKDT--LLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159

Query: 138 SDIALLQIQN-------PSKLTQIAIADSDKLRVGDFAVAVGNPFGLGQTATSGIISALG 190
D+A+++ + ++++ + +V G P ++ +
Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKP-------VATMW 212

Query: 191 RSGLNLEGLEN-FIQTDASINRGNSGGALLNLNGELIGINTAILAPGGGSVGIGFAIPSN 249
S + L+ +Q D S GNSG + N E+IGI+ G+
Sbjct: 213 ESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWG---------GVPNEFNGA 263

Query: 250 MART 253
+
Sbjct: 264 VFIN 267


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3209V8PROTEASE538e-10 V8 serine protease family signature.
		>V8PROTEASE#V8 serine protease family signature.

Length = 336

Score = 52.7 bits (126), Expect = 8e-10
Identities = 31/160 (19%), Positives = 59/160 (36%), Gaps = 26/160 (16%)

Query: 77 RTLGSGVIMDQRGYIITNKHVINDADQIIVALQ------------DGRVFEALLVGSDSL 124
+ SGV++ + ++TNKHV++ AL+ +G +
Sbjct: 101 TFIASGVVVG-KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159

Query: 125 TDLAVLKI-------NATGGLPTIPINARRVPHIGDVVLAIGNPYNLGQTITQGIISATG 177
DLA++K + + ++ + + G P + T + G
Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--ESKG 216

Query: 178 RIGLNPTGRQNFLQTDASINHGNSGGALVNSLGELMGINT 217
+I + +Q D S GNSG + N E++GI+
Sbjct: 217 KI---TYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHW 253


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3208DHBDHDRGNASE280.045 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase sig...
		>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase

signature.
Length = 261

Score = 28.1 bits (62), Expect = 0.045
Identities = 37/167 (22%), Positives = 61/167 (36%), Gaps = 27/167 (16%)

Query: 3 VAVLGAAGGIGQALALLLKTQLPSGSELSLYDIAPVTPGVAVDLSHIPTAVKIKGFSGED 62
+ GAA GIG+A+A L G+ ++ D P V S A + F +
Sbjct: 11 AFITGAAQGIGEAVARTL---ASQGAHIAAVDYNP-EKLEKVVSSLKAEARHAEAFPADV 66

Query: 63 ATPA------------LEGADVVLISAGVARK------PGMDRSDLFNVNAGIVKNLVQQ 104
A + D+++ AGV R + F+VN+ V N +
Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126

Query: 105 VAKTCPK----ACIGIITNPVNTT-VAIAAEVLKKAGVYDKNKLFGV 146
V+K + + + +NP ++AA KA K G+
Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGL 173


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3207ARGREPRESSOR1689e-57 Bacterial arginine repressor signature.
		>ARGREPRESSOR#Bacterial arginine repressor signature.

Length = 149

Score = 168 bits (428), Expect = 9e-57
Identities = 44/141 (31%), Positives = 71/141 (50%), Gaps = 5/141 (3%)

Query: 15 KALLKEEKFSSQGEIVAALQEQGFDNINQSKVSRMLTKFGAVRTRNAKMEMVYCLPAELG 74
+ ++ + +Q E+V L++ G+ N+ Q+ VSR + + V+ Y LPA+
Sbjct: 11 REIITANEIETQDELVDILKKDGY-NVTQATVSRDIKELHLVKVPTNNGSYKYSLPADQR 69

Query: 75 VPTTSSPLKNLV---LDIDYNDAVVVIHTSPGAAQLIARLLDSLGKAEGILGTIAGDDTI 131
S ++L+ + ID ++V+ T PG AQ I L+D+L E I+GTI GDDTI
Sbjct: 70 FNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEE-IMGTICGDDTI 128

Query: 132 FTTPANGFTVKELYEAILELF 152
K + + ILEL
Sbjct: 129 LIICRTHDDTKVVQKKILELL 149


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3203RTXTOXIND542e-10 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 54.4 bits (131), Expect = 2e-10
Identities = 29/163 (17%), Positives = 59/163 (36%), Gaps = 16/163 (9%)

Query: 6 RKFSRTAITVVLVILAFIAIFNAWVYYTE----SPWTRDARFSADVVAIAPDVSGLITQV 61
SR V I+ F+ I + + S I P + ++ ++
Sbjct: 51 TPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEI 110

Query: 62 NVHDNQLVKKGQVLFTIDQPR-------YQKALEEAQADVAYYQVLAQEKRQEAGRRNRL 114
V + + V+KG VL + Q +L +A+ + YQ+L++ E + L
Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRS--IELNKLPEL 168

Query: 115 GVQAMSREEIDQANNVL---QTVLHQLAKAQATRDLAKLDLER 154
+ + VL + Q + Q + +L+L++
Sbjct: 169 KLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDK 211



Score = 51.4 bits (123), Expect = 2e-09
Identities = 28/147 (19%), Positives = 54/147 (36%), Gaps = 15/147 (10%)

Query: 100 LAQEKRQEAGRRNRLGVQ-AMSREEIDQANNVLQT-VLHQLAKAQAT-------RDLAKL 150
E R + ++ + ++EE + + +L +L + +
Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEE 323

Query: 151 DLERTVIRAPADGWVTNLNVYT-GEFITRGSTAVALVKQNSFY-VLAYMEETKLEGVRPG 208
+ +VIRAP V L V+T G +T T + +V ++ V A ++ + + G
Sbjct: 324 RQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVG 383

Query: 209 YRAEIT----PLGSNKVLKGTVDSVAA 231
A I P L G V ++
Sbjct: 384 QNAIIKVEAFPYTRYGYLVGKVKNINL 410


92APECO1_3195APECO1_3191N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_3195-2161.490579rod shape-determining protein MreC
APECO1_3194-2130.630530rod shape-determining protein MreB
APECO1_3193-211-0.268731regulatory protein CsrD
APECO1_3192-112-1.555876oxidoreductase, Zn-dependent and NAD(P)-binding
APECO1_3191113-3.452132acetyl-CoA carboxylase biotin carboxyl carrier
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3195PF03544280.043 Gram-negative bacterial tonB protein
		>PF03544#Gram-negative bacterial tonB protein

Length = 243

Score = 28.4 bits (63), Expect = 0.043
Identities = 12/72 (16%), Positives = 20/72 (27%), Gaps = 3/72 (4%)

Query: 296 MMPQVLPSPDAMGPKLPEPATGITQPTPQQPATGNAVTAPAAPTQPAANRSPQRATPPQS 355
P+ +P P P + E +P P+ V P +P +R
Sbjct: 78 PEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVK---KVEQPKRDVKPVESRPASPFENTAP 134

Query: 356 GAQPPARAPGGQ 367
+ A
Sbjct: 135 ARPTSSTATAAT 146


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3194SHAPEPROTEIN5760.0 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 576 bits (1487), Expect = 0.0
Identities = 347/347 (100%), Positives = 347/347 (100%)

Query: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAK 60
MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAK
Sbjct: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAK 60

Query: 61 QMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQ 120
QMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQ
Sbjct: 61 QMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQ 120

Query: 121 VERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNG 180
VERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNG
Sbjct: 121 VERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNG 180

Query: 181 VVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRN 240
VVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRN
Sbjct: 181 VVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRN 240

Query: 241 LAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALL 300
LAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALL
Sbjct: 241 LAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALL 300

Query: 301 RNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347
RNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE
Sbjct: 301 RNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3192NUCEPIMERASE290.017 Nucleotide sugar epimerase signature.
		>NUCEPIMERASE#Nucleotide sugar epimerase signature.

Length = 334

Score = 29.4 bits (66), Expect = 0.017
Identities = 11/28 (39%), Positives = 17/28 (60%)

Query: 150 IVVTGASGGVGSTAVALLHKLGYQVVAV 177
+VTGA+G +G L + G+QVV +
Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGI 30


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3191RTXTOXIND270.026 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 27.5 bits (61), Expect = 0.026
Identities = 8/27 (29%), Positives = 16/27 (59%)

Query: 127 IEADKSGTVKAILVESGQPVEFDEPLV 153
I+ ++ VK I+V+ G+ V + L+
Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLL 125


93APECO1_3177APECO1_3173N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_3177-211-1.779080DNA-binding protein Fis
APECO1_3176-212-1.668438methyltransferase
APECO1_3175-213-1.255595DNA-binding transcriptional regulator EnvR
APECO1_3174-212-1.032429acriflavine resistance protein E
APECO1_3173-214-1.775491acriflavine resistance protein F
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3177DNABINDNGFIS1573e-54 DNA-binding protein FIS signature.
		>DNABINDNGFIS#DNA-binding protein FIS signature.

Length = 98

Score = 157 bits (399), Expect = 3e-54
Identities = 98/98 (100%), Positives = 98/98 (100%)

Query: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60
MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ
Sbjct: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60

Query: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98
PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN
Sbjct: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3175HTHTETR1277e-39 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 127 bits (321), Expect = 7e-39
Identities = 77/209 (36%), Positives = 122/209 (58%), Gaps = 3/209 (1%)

Query: 1 MAKRTKAEALKTRQELIETAIAQFAQHGVSKTTLNDIADAANVTRGAIYWHFENKTQLFN 60
MA++TK EA +TRQ +++ A+ F+Q GVS T+L +IA AA VTRGAIYWHF++K+ LF+
Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60

Query: 61 EMW-LQQPSLRELIQDHLTAGLEHDPFQQLREKLIVGLQYIAKIPRQQALLKILYHKCEF 119
E+W L + ++ EL + A DP LRE LI L+ R++ L++I++HKCEF
Sbjct: 61 EIWELSESNIGELELE-YQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119

Query: 120 NDEM-LAEGVIREKMGFNPQTLREVLQACQQQGCVANNLDLDVVMIIIDGAFSGIVQNWL 178
EM + + R + + + L+ C + + +L II+ G SG+++NWL
Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWL 179

Query: 179 MNMAGYDLYKQAPALVDNVLRMFMPDENI 207
+DL K+A V +L M++ +
Sbjct: 180 FAPQSFDLKKEARDYVAILLEMYLLCPTL 208


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3174RTXTOXIND431e-06 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 43.3 bits (102), Expect = 1e-06
Identities = 38/217 (17%), Positives = 70/217 (32%), Gaps = 38/217 (17%)

Query: 98 ATYQASYDSAKGELAKSEAAAAIAHLTVKRYVPLVGTKYISQQEYDQAIADA-RQADAAV 156
K +L + E+ A + Q + I D RQ +
Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLV----------TQLFKNEILDKLRQTTDNI 311

Query: 157 IAAKATVESARINLAYTKVTAPISGRIGK-STVTEGALVTNGQTTELATVQQLDPIYVDV 215
+ + + AP+S ++ + TEG +VT +T + V + D + V
Sbjct: 312 GLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVTA 370

Query: 216 TQSSND--FMRLKQSVEQGNLHKENATSNVELVMENGQTYP-LKGTLQ--FSDVTVDEST 270
+ D F+ + Q+ +++ Y L G ++ D D+
Sbjct: 371 LVQNKDIGFINVGQNAI------------IKVEAFPYTRYGYLVGKVKNINLDAIEDQRL 418

Query: 271 GSIT--LRAV------FPNPQHTLLPGMFVRARIDEG 299
G + + ++ N L GM V A I G
Sbjct: 419 GLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTG 455



Score = 34.4 bits (79), Expect = 7e-04
Identities = 22/127 (17%), Positives = 43/127 (33%), Gaps = 13/127 (10%)

Query: 46 TAPLEVKTELPGR-TNAYRIAEVRPQVSGIVLNRNFTEGSDVQAGQSLYQIDPATYQASY 104
+E+ G+ T++ R E++P + IV EG V+ G L ++ +A
Sbjct: 77 LGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA-- 134

Query: 105 DSAKGELAKSEAAAAIAHLTVKRYVPLVGTKYISQQEYDQAIADARQADAAVIAAKATVE 164
+ K++++ A L RY L E ++ +
Sbjct: 135 -----DTLKTQSSLLQARLEQTRYQIL-----SRSIELNKLPELKLPDEPYFQNVSEEEV 184

Query: 165 SARINLA 171
+L
Sbjct: 185 LRLTSLI 191



Score = 29.0 bits (65), Expect = 0.031
Identities = 11/34 (32%), Positives = 15/34 (44%), Gaps = 1/34 (2%)

Query: 65 AEVRPQVSGIVLNRN-FTEGSDVQAGQSLYQIDP 97
+ +R VS V TEG V ++L I P
Sbjct: 328 SVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVP 361


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3173ACRIFLAVINRP14050.0 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 1405 bits (3639), Expect = 0.0
Identities = 1029/1034 (99%), Positives = 1032/1034 (99%)

Query: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60
MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT
Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60

Query: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120
VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ
Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120

Query: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180
EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA
Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180

Query: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRL 240
QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240

Query: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300
KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL
Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300

Query: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIQEVVKTLFEAIMLVFLVMYLFLQ 360
DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSI EVVKTLFEAIMLVFLVMYLFLQ
Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360

Query: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420
NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM
Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420

Query: 421 MEDKLPPREATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480
MEDKLPP+EATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL
Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480

Query: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540
SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY
Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540

Query: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600
LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN
Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600

Query: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERSGDENSAEAVIHRAKMELGKIRDG 660
EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEER+GDENSAEAVIHRAKMELGKIRDG
Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660

Query: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720
FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL
Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720

Query: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKVYVQADAKFRM 780
EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKK+YVQADAKFRM
Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780

Query: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840
LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM
Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840

Query: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900
ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV
Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900

Query: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960
MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV
Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960

Query: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020
EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF
Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020

Query: 1021 VPVFFVVIRRCFKG 1034
VPVFFVVIRRCFKG
Sbjct: 1021 VPVFFVVIRRCFKG 1034


94APECO1_3128APECO1_3113N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_3128-120-1.574128general secretion pathway protein C
APECO1_3127-118-0.458313general secretion pathway protein D
APECO1_31260230.012876general secretion pathway protein E
APECO1_3125124-0.772041general secretion pathway protein F
APECO1_3124324-1.162913general secretion pathway protein G
APECO1_3123324-1.737883general secretion pathway protein H
APECO1_3122222-3.046960general secretion pathway protein I
APECO1_3121223-2.728906general secretion pathway protein J
APECO1_31202224-3.680810general secretion pathway protein K
APECO1_3120-119-2.730458general secretion pathway protein K
APECO1_3119021-1.880605general secretion pathway protein L
APECO1_3118233-1.368276general secretion pathway protein M
APECO1_3117236-1.150205type 4 prepilin-like proteins leader peptide
APECO1_3116341-1.013709bacterioferritin
APECO1_3115443-0.496857bifunctional chitinase/lysozyme
APECO1_3114754-0.102152elongation factor Tu
APECO1_3113444-0.398663elongation factor G
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3128BCTERIALGSPC844e-21 Bacterial general secretion pathway protein C signa...
		>BCTERIALGSPC#Bacterial general secretion pathway protein C

signature.
Length = 272

Score = 83.9 bits (207), Expect = 4e-21
Identities = 53/200 (26%), Positives = 94/200 (47%), Gaps = 15/200 (7%)

Query: 59 EFSLAALWRNENHAGVKDANPVAVNQETPKLSIALNGIVLTSNDETSFVLINEGNEQKRY 118
+F+L + +N AG DA N L+++L G++ +D S +I++ NEQ
Sbjct: 64 DFTLFGVSPEKNKAGALDA-SQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSR 122

Query: 119 SLNEALESAPGT--FIRKINKTSVVFETHGHYEKVTLH-------PGLP--DIIKQPDSE 167
+NE + PG I I VV + G YE + L+ G+P + +Q
Sbjct: 123 GVNEEV---PGYNAKIVSIRPDRVVLQYQGRYEVLGLYSQEDSGSDGVPGAQVNEQLQQR 179

Query: 168 NQNVLADYIIATPIRDGEQIYGLRLNPRKGLNAFTTSLLQPGDIALRINNLSLTHPDEVS 227
++DY+ +PI + ++ G RLNP ++F LQ D+A+ +N L L ++
Sbjct: 180 ASTTMSDYVSFSPIMNDNKLQGYRLNPGPKSDSFYRVGLQDNDMAVALNGLDLRDAEQAK 239

Query: 228 QALSLLLTQQSAQFTIRRNG 247
+A+ + + T+ R+G
Sbjct: 240 KAMERMADVHNFTLTVERDG 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3127BCTERIALGSPD7160.0 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 716 bits (1850), Expect = 0.0
Identities = 344/629 (54%), Positives = 466/629 (74%), Gaps = 11/629 (1%)

Query: 11 ITCCLLAALLMPCAGHAENEQYGANFNNADIRQFVEIVGQHLGKTILIDPSVQGTISVRS 70
+T + AALL A E++ A+F DI++F+ V ++L KT++IDPSV+GTI+VRS
Sbjct: 12 LTLLIFAALLF---RPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRS 68

Query: 71 NDTFSQQEYYQFFLSILDLYGYSVITLDNGFLKVVRSANVKTSPGMIADSSRPGVGDELV 130
D ++++YYQFFLS+LD+YG++VI ++NG LKVVRS + KT+ +A + PG+GDE+V
Sbjct: 69 YDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVV 128

Query: 131 TRIVPLENVPARDLAPLLRQMMDAGSVGNVVHYEPSNVLILTGRASTINKLIEVIKRVDV 190
TR+VPL NV ARDLAPLLRQ+ D VG+VVHYEPSNVL++TGRA+ I +L+ +++RVD
Sbjct: 129 TRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDN 188

Query: 191 IGTEKQQIIHLEYASAEDLAEILNQLISESHGKSQMPALLSAKIVADKRTNSLIISGPEK 250
G + L +ASA D+ +++ +L ++ KS +P + A +VAD+RTN++++SG
Sbjct: 189 AGDRSVVTVPLSWASAADVVKLVTEL-NKDTSKSALPGSMVANVVADERTNAVLVSGEPN 247

Query: 251 ARQRITSLLKSLDVEESEEGNTRVYYLKYAKATNLVEVLTGVSEKLKDEKGNSRKPSSTS 310
+RQRI +++K LD +++ +GNT+V YLKYAKA++LVEVLTG+S ++ EK ++ +
Sbjct: 248 SRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAK--PVAA 305

Query: 311 AMDNVAITADEQTNSLVITADQSVQEKLATVIARLDIRRAQVLVEAIIVEVQDGNGLNLG 370
N+ I A QTN+L++TA V L VIA+LDIRR QVLVEAII EVQD +GLNLG
Sbjct: 306 LDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLG 365

Query: 371 VQWANKNVGAQQFTNTGLPVFNAAQGVADYKKNGGITSANPAWDMFSAYNGMAAGFFNGD 430
+QWANKN G QFTN+GLP+ A G Y K+G ++S+ S++NG+AAGF+ G+
Sbjct: 366 IQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLA--SALSSFNGIAAGFYQGN 423

Query: 431 WGVLLTALASNNKNDILATPSIVTLDNKLASFNVGQDVPVLSGSQTTSGDNVFNTVERKT 490
W +LLTAL+S+ KNDILATPSIVTLDN A+FNVGQ+VPVL+GSQTTSGDN+FNTVERKT
Sbjct: 424 WAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKT 483

Query: 491 VGTKLKVTPQVNEGDAVLLEIEQEVSSVD---SSSNSTLGPTFNTRTIQNAVLVKTGETV 547
VG KLKV PQ+NEGD+VLLEIEQEVSSV SS++S LG TFNTRT+ NAVLV +GETV
Sbjct: 484 VGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETV 543

Query: 548 VLGGLLDDFSKEQVSKVPLLGDIPLVGQLFRYTSTERAKRNLMVFIRPTIIRDDDVYRSL 607
V+GGLLD + KVPLLGDIP++G LFR TS + +KRNLM+FIRPT+IRD D YR
Sbjct: 544 VVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQA 603

Query: 608 SKEKYTRYRQEQQLRIDGKSKALIGSEDL 636
S +YT + Q + ++ + ++DL
Sbjct: 604 SSGQYTAFNDAQSKQRGKENNDAMLNQDL 632


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3125BCTERIALGSPF5120.0 Bacterial general secretion pathway protein F signa...
		>BCTERIALGSPF#Bacterial general secretion pathway protein F

signature.
Length = 408

Score = 512 bits (1321), Expect = 0.0
Identities = 195/405 (48%), Positives = 283/405 (69%), Gaps = 8/405 (1%)

Query: 2 NYRYRAMTQDGQKLQGIIDANDERQARLRLREEGLFLLDIRPQK-------SSGVKTRRP 54
Y Y+A+ G+K +G +A+ RQAR LRE GL L + + S+G+ RR
Sbjct: 3 QYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRK 62

Query: 55 -RISHSELTLFTRQLATLSAAALPLEESLAVIGQQSSNNRLADVLNQVRSAILEGHPLSD 113
R+S S+L L TRQLATL AA++PLEE+L + +QS L+ ++ VRS ++EGH L+D
Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122

Query: 114 ALQHFPTLFDSLYRTLVKAGEKSGLLAPVLEKLADYNENRQKIRSKLIQSLIYPCMLTTV 173
A++ FP F+ LY +V AGE SG L VL +LADY E RQ++RS++ Q++IYPC+LT V
Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182

Query: 174 AIVVVIILLTAVVPKITEQFVHMKQQLPLSTRILLGLSDTLQRTGPTLLATVFIVAVGFW 233
AI VV ILL+ VVPK+ EQF+HMKQ LPLSTR+L+G+SD ++ GP +L + + F
Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFR 242

Query: 234 LWLKRGNNRHRFHAMLLRVALIGPLICAINSARYLRTLSILQSSGVPLLDGMNLSTESLN 293
+ L++ R FH LL + LIG + +N+ARY RTLSIL +S VPLL M +S + ++
Sbjct: 243 VMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVMS 302

Query: 294 NLEIRQRLANAAENVRQGNSIHLSLEQTAIFPPMMLYMVASGEKSGQLGTLMVRAADNQE 353
N R RL+ A + VR+G S+H +LEQTA+FPPMM +M+ASGE+SG+L +++ RAADNQ+
Sbjct: 303 NDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQD 362

Query: 354 TLQQNRIALTLSIFEPALIITMALIVLFIVVSVLQPLLQLNSMIN 398
+++ L L +FEP L+++MA +VLFIV+++LQP+LQLN++++
Sbjct: 363 REFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLMS 407


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3124BCTERIALGSPG2491e-88 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 249 bits (636), Expect = 1e-88
Identities = 144/145 (99%), Positives = 144/145 (99%)

Query: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60
MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK
Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60

Query: 61 LDNHRYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL 120
LDNH YPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL
Sbjct: 61 LDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL 120

Query: 121 LSAGPDGEMGTEDDITNWGLSKKKK 145
LSAGPDGEMGTEDDITNWGLSKKKK
Sbjct: 121 LSAGPDGEMGTEDDITNWGLSKKKK 145


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3123BCTERIALGSPH1429e-46 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 142 bits (358), Expect = 9e-46
Identities = 50/154 (32%), Positives = 76/154 (49%), Gaps = 18/154 (11%)

Query: 3 QQRGFTLLEMMLVLALVAITASVVLFTYGREDAASTRARETAARFTAALELAIDRATLSG 62
+QRGFTLLEMML+L L+ ++A +VL + + A +T ARF A L R +G
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFP--ASRDDSAAQTLARFEAQLRFVQQRGLQTG 59

Query: 63 QPVGIHFSDSAWRIMV----PGKTP-------SAWRWVPLQEDAADESKNDWGEELSIQL 111
Q G+ W+ +V G P S +RW+PL+ S + G +L++
Sbjct: 60 QFFGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAGRVATSGSIAGGKLNLAF 119

Query: 112 ---QPFKPDDSNQPQVVILADGQITPFSLLMANA 142
+ + P D P V+I G++TPF L + A
Sbjct: 120 AQGEAWTPGD--NPDVLIFPGGEMTPFRLTLGEA 151


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3122BCTERIALGSPG319e-04 Bacterial general secretion pathway protein G signa...
		>BCTERIALGSPG#Bacterial general secretion pathway protein G

signature.
Length = 145

Score = 31.0 bits (70), Expect = 9e-04
Identities = 18/91 (19%), Positives = 42/91 (46%), Gaps = 4/91 (4%)

Query: 14 MNKQSGMTLLEVLLAMSIFTAVALTLMSSMQGQ--RTAIERMRNETLALWIADNQLQSQD 71
+KQ G TLLE+++ + I +A ++ ++ G + ++ ++ +AL A + + D
Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK-LD 62

Query: 72 SFDEENTSSSGKELINGEELINGEEWNWRSD 102
+ T+ + L+ L N+ +
Sbjct: 63 NHHYPTTNQGLESLVEAPTL-PPLAANYNKE 92


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3121BCTERIALGSPH342e-04 Bacterial general secretion pathway protein H signa...
		>BCTERIALGSPH#Bacterial general secretion pathway protein H

signature.
Length = 170

Score = 33.8 bits (77), Expect = 2e-04
Identities = 12/47 (25%), Positives = 25/47 (53%), Gaps = 2/47 (4%)

Query: 4 RQQGFTLLEVMAALAIFSMLSVLAFMIFSQASELHQRSQKEIQQFNQ 50
RQ+GFTLLE+M L + + + + + F + + + + + +F
Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRD--DSAAQTLARFEA 46


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3117PREPILNPTASE1521e-47 Type IV prepilin cysteine protease (C20) family sig...
		>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family

signature.
Length = 290

Score = 152 bits (386), Expect = 1e-47
Identities = 88/262 (33%), Positives = 119/262 (45%), Gaps = 47/262 (17%)

Query: 5 LPLFILVGFIAGYFVNVMAYHL---------------SPLEDKTALTFRQVLVH------ 43
L L + G F+NV+ + L +D+ L+
Sbjct: 16 FSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEPPYNLMVPRSCCP 75

Query: 44 FWQKKYAWHDTVPLI-------------------------LCVAAAIACALAPFTPIVTG 78
+ +PL+ L ++A A+ T
Sbjct: 76 HCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVAVAMTLAPGWGTL 135

Query: 79 ALFLYFCFALTLSVIDFRTQLLPDKLTLPLLWLGLVFNAQSGLIDLHDAVYGAVAGYGVL 138
A L + L+ ID LLPD+LTLPLLW GL+FN G + L DAV GA+AGY VL
Sbjct: 136 AALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGYLVL 195

Query: 139 WCVYWGVWLVCHKEGLGYGDFKLLAAAGAWCGWQTLPMILLIASLGGIGYAIVSQLLQRR 198
W +YW L+ KEG+GYGDFKLLAA GAW GWQ LP++LL++SL G I LL+
Sbjct: 196 WSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILLRNH 255

Query: 199 TITT-IAFGPWLALGSMINLGY 219
+ I FGP+LA+ I L +
Sbjct: 256 HQSKPIPFGPYLAIAGWIALLW 277


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3116HELNAPAPROT353e-05 Helicobacter neutrophil-activating protein A family ...
		>HELNAPAPROT#Helicobacter neutrophil-activating protein A family

signature.
Length = 153

Score = 35.2 bits (81), Expect = 3e-05
Identities = 28/150 (18%), Positives = 59/150 (39%), Gaps = 24/150 (16%)

Query: 5 TKVINYLNKLLGNE---LVAINQYFLHARMFKNWGLKRLNDVEYHESIDEM-----KHAD 56
T V N LN L N ++++ +W +K + HE +E+ + D
Sbjct: 11 TLVENSLNTQLSNWFLLYSKLHRF--------HWYVKGPHFFTLHEKFEELYDHAAETVD 62

Query: 57 RYIERILFLEGLPN--LQDLGKL------NIGEDVEEMLRSDLALELDGAKNLREAIGYA 108
ER+L + G P +++ + EM+++ + + + IG A
Sbjct: 63 TIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYKQISSESKFVIGLA 122

Query: 109 DSVHDYVSRDMMIEILRDEEGHIDWLETEL 138
+ D + D+ + ++ + E + L + L
Sbjct: 123 EENQDNATADLFVGLIEEVEKQVWMLSSYL 152


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3114TCRTETOQM803e-18 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 79.5 bits (196), Expect = 3e-18
Identities = 57/198 (28%), Positives = 87/198 (43%), Gaps = 13/198 (6%)

Query: 13 VNVGTIGHVDHGKTTLTAAI------TTVLAKTYGGAARAFDQIDNAPEEKARGITINTS 66
+N+G + HVD GKTTLT ++ T L G R DN E+ RGITI T
Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRT----DNTLLERQRGITIQTG 59

Query: 67 HVEYDTPTRHYAHVDCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQV 126
+ +D PGH D++ + + +DGAIL+++A DG QTR R++
Sbjct: 60 ITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKM 119

Query: 127 GVPYIIVFLNKCDMVDDEELLELVEMEVRELLSQYDFPGDDTPIVRGSALKALEGDAEWE 186
G+P I F+NK D + L V +++E LS + + +W+
Sbjct: 120 GIP-TIFFINKIDQNGID--LSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176

Query: 187 AKILELAGFLDSYIPEPE 204
I L+ Y+
Sbjct: 177 TVIEGNDDLLEKYMSGKS 194


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3113TCRTETOQM6130.0 Tetracycline resistance protein TetO/TetQ/TetM family ...
		>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family

signature.
Length = 639

Score = 613 bits (1583), Expect = 0.0
Identities = 178/698 (25%), Positives = 304/698 (43%), Gaps = 81/698 (11%)

Query: 9 RYRNIGISAHIDAGKTTTTERILFYTGVNHKIGEVHDGAATMDWMEQEQERGITITSAAT 68
+ NIG+ AH+DAGKTT TE +L+ +G ++G V G D E++RGITI + T
Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61

Query: 69 TAFWSGMAKQYEPHRINIIDTPGHVDFTIEVERSMRVLDGAVMVYCAVGGVQPQSETVWR 128
+ W ++NIIDTPGH+DF EV RS+ VLDGA+++ A GVQ Q+ ++
Sbjct: 62 SFQWEN-------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114

Query: 129 QANKYKVPRIAFVNKMDRMGANFLKVVNQIKTRLGANPVPLQLAIGAEEHFTGVVDLVKM 188
K +P I F+NK+D+ G + V IK +L A V Q V M
Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ----------KVELYPNM 164

Query: 189 KAINWNDADQGVTFEYEDIPADMVELANEWHQNLIESAAEASEELMEKYLGGEELTEAEI 248
N+ +++Q ++ E +++L+EKY+ G+ L E+
Sbjct: 165 CVTNFTESEQ------------------------WDTVIEGNDDLLEKYMSGKSLEALEL 200

Query: 249 KGALRQRVLNNEIILVTCGSAFKNKGVQAMLDAVIDYLPSPVDVPAINGILDDGKDTPAE 308
+ R N + V GSA N G+ +++ + + S
Sbjct: 201 EQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH----------------- 243

Query: 309 RHASDDEPFSALAFKIATDPFVGNLTFFRVYSGVVNSGDTVLNSVKAARERFGRIVQMHA 368
FKI L + R+YSGV++ D+V S K + + +
Sbjct: 244 ---RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEK-EKIKITEMYTSIN 299

Query: 369 NKREEIKEVRAGDIAAAIG----LKDVTTGDTLCDPDAPIILERMEFPEPVISIAVEPKT 424
+ +I + +G+I L V GDT P ER+E P P++ VEP
Sbjct: 300 GELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQR----ERIENPLPLLQTTVEPSK 354

Query: 425 KADQEKMGLALGRLAKEDPSFRVWTDEESNQTIIAGMGELHLDIIVDRMKREFNVEANVG 484
+E + AL ++ DP R + D +++ I++ +G++ +++ ++ +++VE +
Sbjct: 355 PQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIK 414

Query: 485 KPQVAYRETIRQKVTDVEGKHAKQSGGRGQYGHVVIDMYPLEPGSNPKGYEFINDIKGGV 544
+P V Y E +K E + + + + + PL GS G ++ + + G
Sbjct: 415 EPTVIYMERPLKK---AEYTIHIEVPPNPFWASIGLSVSPLPLGS---GMQYESSVSLGY 468

Query: 545 IPGEYIPAVDKGIQEQLKAGPLAGYPVVDMGIRLHFGSYHDVDSSELAFKLAASIAFKEG 604
+ + AV +GI+ + G L G+ V D I +G Y+ S+ F++ A I ++
Sbjct: 469 LNQSFQNAVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQV 527

Query: 605 FKKAKPVLLEPIMKVEVETPEENTGDVIGDLSRRRGMLKGQESEVTGVKIHAEVPLSEMF 664
KKA LLEP + ++ P+E D + + + + V + E+P +
Sbjct: 528 LKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQ 587

Query: 665 GYATQLRSLTKGRASYTMEFLKYDEAPSNVAQAVIEAR 702
Y + L T GR+ E Y + V + R
Sbjct: 588 EYRSDLTFFTNGRSVCLTELKGYHVT---TGEPVCQPR 622


95APECO1_3107APECO1_3100N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_31071141.586835hypothetical protein
APECO1_31060132.314234FKBP-type peptidyl-prolyl cis-trans isomerase
APECO1_3105-2132.865420FKBP-type peptidyl-prolyl cis-trans isomerase
APECO1_3104-1132.627355glutathione-regulated potassium-efflux system
APECO1_3103-1172.134370glutathione-regulated potassium-efflux system
APECO1_3102-2141.705561ABC transporter ATP-binding protein
APECO1_3101-2121.250331hydrolase
APECO1_3100-2131.141363phosphoribulokinase PrkB
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3107ACRIFLAVINRP290.023 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 28.7 bits (64), Expect = 0.023
Identities = 14/62 (22%), Positives = 29/62 (46%), Gaps = 1/62 (1%)

Query: 164 ASSVEDLVTQTLEFTIEEVNADRNV-SNNAKNRQIVLNLYEKGIFDIKDAINQVADRLNI 222
A +V+D VTQ +E + ++ + S + + + L + D A QV ++L +
Sbjct: 54 AQTVQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQL 113

Query: 223 SK 224
+
Sbjct: 114 AT 115


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3106INFPOTNTIATR1325e-40 Macrophage infectivity potentiator signature.
		>INFPOTNTIATR#Macrophage infectivity potentiator signature.

Length = 233

Score = 132 bits (334), Expect = 5e-40
Identities = 79/226 (34%), Positives = 124/226 (54%), Gaps = 9/226 (3%)

Query: 28 AAKPATTADSKAAFKNDDQKSAYALGASLGRYMENSLKEQEKLGIKLDKDQLIAGVQDAF 87
A A A + D K +Y++GA LG K + GI ++ D L G+QD
Sbjct: 14 AMSTAMAATDATSLTTDKDKLSYSIGADLG-------KNFKNQGIDINPDVLAKGMQDGM 66

Query: 88 A-DKSKLSDQEIEQTLQAFEARVKSSAQAKMEKDAADNEAKGKEYREKFAKEKGVKTSST 146
+ + L++++++ L F+ + + A+ K A +N+AKG + + G+ +
Sbjct: 67 SGAQLILTEEQMKDVLSKFQKDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPS 126

Query: 147 GLVYQVVEAGKGEAPKDSDTVVVNYKGTLIDGKEFDNSYTRGEPLSFRLDGVIPGWTEGL 206
GL Y++++AG G P SDTV V Y GTLIDG FD++ G+P +F++ VIPGWTE L
Sbjct: 127 GLQYKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEAL 186

Query: 207 KNIKKGGKIKLVIPPELAYGKAGVPG-IPPNSTLVFDVELLDVKPA 251
+ + G ++ +P +LAYG V G I PN TL+F + L+ VK A
Sbjct: 187 QLMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISVKKA 232


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_310460KDINNERMP310.021 60kDa inner membrane protein signature.
		>60KDINNERMP#60kDa inner membrane protein signature.

Length = 548

Score = 30.7 bits (69), Expect = 0.021
Identities = 13/69 (18%), Positives = 29/69 (42%), Gaps = 6/69 (8%)

Query: 261 TAIDPFKGLLLG---LFFISVGMSLNLGVLYTHL-LWVVISVVVLVAVKILVLYLLARLY 316
A+ P L + L+FIS + L +++ + W +++ V+ ++ L
Sbjct: 318 AAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKA-- 375

Query: 317 GVRSSERMQ 325
S +M+
Sbjct: 376 QYTSMAKMR 384


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3103ISCHRISMTASE320.001 Isochorismatase signature.
		>ISCHRISMTASE#Isochorismatase signature.

Length = 312

Score = 31.9 bits (72), Expect = 0.001
Identities = 32/135 (23%), Positives = 51/135 (37%), Gaps = 16/135 (11%)

Query: 12 YAHPESQDSVANRVLLKPATQLSNVTVHDLYAHYPDFFIDIPREQALLREHEVIVFQH-- 69
Y P + D N+V P + + +HD+ ++ D F L + +
Sbjct: 9 YQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCV 68

Query: 70 ----PLYTYSCPALLKEWLDRVLSRGFASGPGGNQLAGKYWRSVITTGEPESA------Y 119
P+ + P DR L F GPG N +G Y +IT PE +
Sbjct: 69 QLGIPVVYTAQPGSQNP-DDRALLTDFW-GPGLN--SGPYEEKIITELAPEDDDLVLTKW 124

Query: 120 RYDALNRYPMSDVLR 134
RY A R + +++R
Sbjct: 125 RYSAFKRTNLLEMMR 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3102GPOSANCHOR330.004 Gram-positive coccus surface protein anchor signature.
		>GPOSANCHOR#Gram-positive coccus surface protein anchor signature.

Length = 539

Score = 32.7 bits (74), Expect = 0.004
Identities = 28/152 (18%), Positives = 54/152 (35%), Gaps = 22/152 (14%)

Query: 504 KVEPFDGDLEDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPLRKE 563
+ D + ++ E + + ++ R+ +R R + L E
Sbjct: 272 AMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAE 331

Query: 564 IARLEKEME---------------------KLNAQLAQAEEKLGDSELYDQSRKAELTAC 602
+LE++ + +L A+ + EE+ SE QS + +L A
Sbjct: 332 HQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDAS 391

Query: 603 LQQQASAKSGLEECEMAWLEAQEQLEQMLLEG 634
+ + + LEE L A E+L + L E
Sbjct: 392 REAKKQVEKALEEANSK-LAALEKLNKELEES 422



Score = 32.0 bits (72), Expect = 0.008
Identities = 13/125 (10%), Positives = 39/125 (31%), Gaps = 7/125 (5%)

Query: 513 EDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPLRKEIARLEKEME 572
+ + ++ + E A A + D ++ + +++
Sbjct: 127 KALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFST-------ADSAKIK 179

Query: 573 KLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEECEMAWLEAQEQLEQMLL 632
L A+ A E + + E + TA + + ++ + ++ LE +
Sbjct: 180 TLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMN 239

Query: 633 EGQSN 637
++
Sbjct: 240 FSTAD 244


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3100PF07299320.002 Fibronectin-binding protein (FBP)
		>PF07299#Fibronectin-binding protein (FBP)

Length = 219

Score = 31.8 bits (72), Expect = 0.002
Identities = 10/46 (21%), Positives = 21/46 (45%), Gaps = 2/46 (4%)

Query: 71 PEANDFGLLEQTFIEYGQSGKGKSRKYLHTYDEAVPWNQVPGTFTP 116
P+ + + E ++ KG SRK++ ++ + + GTF
Sbjct: 112 PDMEELDMKELSY--LSWIDKGSSRKFIIAKNDKNKFVGLQGTFQS 155


96APECO1_3016APECO1_3006N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_3016220-5.570708acetyltransferase YhhY
APECO1_3015217-4.513778hypothetical protein
APECO1_3014113-0.052562hypothetical protein
APECO1_3013-1162.407527hypothetical protein
APECO1_3012-2193.057152gamma-glutamyltranspeptidase
APECO1_3011-2222.944548hypothetical protein
APECO1_3010-1243.435912glycerophosphodiester phosphodiesterase
APECO1_3009-1253.200610glycerol-3-phosphate transporter ATP-binding
APECO1_3008-1253.089176glycerol-3-phosphate transporter membrane
APECO1_3007-2263.558038glycerol-3-phosphate transporter permease
APECO1_3006-2233.288595glycerol-3-phosphate transporter periplasmic
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3016SACTRNSFRASE354e-05 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 35.3 bits (81), Expect = 4e-05
Identities = 20/92 (21%), Positives = 32/92 (34%), Gaps = 16/92 (17%)

Query: 55 VACIDGIVVGHLTIDVQQRPRRSHVADFGICVDSRWKNRGVASALMREMID------MCD 108
+ ++ +G + I + + D + D R K GV +AL+ + I+ C
Sbjct: 69 LYYLENNCIGRIKIR-SNWNGYALIEDIAVAKDYRKK--GVGTALLHKAIEWAKENHFCG 125

Query: 109 NWLRVDRIELTVFVDNAPAIKVYKKFGFEIEG 140
L I N A Y K F I
Sbjct: 126 LMLETQDI-------NISACHFYAKHHFIIGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3012NAFLGMOTY320.005 Sodium-type flagellar protein MotY precursor signature.
		>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature.

Length = 293

Score = 32.0 bits (72), Expect = 0.005
Identities = 27/80 (33%), Positives = 36/80 (45%), Gaps = 13/80 (16%)

Query: 272 RTPISGDYRGYQVYSMPPPSSGGIHIVQILNILENFDMQKYGF-GSADAMQIMAEAEKYA 330
R P+ G+ R + SMPPP G H +I N+ F Q G+ G A I++E EK
Sbjct: 77 RRPM-GETRNVSLISMPPPWRPGEHADRITNL--KFFKQFDGYVGGQTAWGILSELEKGR 133

Query: 331 YADRSEYLGDPDFVKVPWQA 350
Y P F WQ+
Sbjct: 134 Y---------PTFSYQDWQS 144


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3010PF04619280.020 Dr-family adhesin
		>PF04619#Dr-family adhesin

Length = 160

Score = 28.4 bits (63), Expect = 0.020
Identities = 12/60 (20%), Positives = 22/60 (36%), Gaps = 4/60 (6%)

Query: 29 VGAKYGHKMIEFDAKLSKDGEIFLLHDDNLERTSNGWGVAGELNWQD----LLRVDAGSW 84
+G ++ D + G+ FL+ D+N ++ W + D GSW
Sbjct: 70 LGCDARQVALKADTDNFEQGKFFLISDNNRDKLYVNIRPTDNSAWTTDNGVFYKNDVGSW 129


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3009PF05272290.042 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 28.9 bits (64), Expect = 0.042
Identities = 10/29 (34%), Positives = 16/29 (55%)

Query: 33 IVMVGPSGCGKSTLLRMVAGLERVTTGDI 61
+V+ G G GKSTL+ + GL+ +
Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHF 627


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_3006MALTOSEBP393e-05 Maltose binding protein signature.
		>MALTOSEBP#Maltose binding protein signature.

Length = 396

Score = 39.3 bits (91), Expect = 3e-05
Identities = 39/160 (24%), Positives = 66/160 (41%), Gaps = 14/160 (8%)

Query: 134 GHLLSQPFNSSTPVLYYNKDAFKKAGLDPKQPPKTWQDLADYAAKLKASGMKCGYASGWQ 193
G L++ P L YNKD PPKTW+++ +LKA G + +
Sbjct: 127 GKLIAYPIAVEALSLIYNKDLLP-------NPPKTWEEIPALDKELKAKGKSALMFNLQE 179

Query: 194 GWIQLENFSAWNGLPFASKNNGFDGTDAVLEF--NKPEQVKHIAMLEEMNKKGDFSYVGR 251
+ +A G F +N +D D ++ K + +++ + D Y
Sbjct: 180 PYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDY--- 236

Query: 252 KDESTEKFYNGDCAMTTASSGSLANIREYAKFNYGVGMMP 291
+ F G+ AMT + +NI + +K NYGV ++P
Sbjct: 237 -SIAEAAFNKGETAMTINGPWAWSNI-DTSKVNYGVTVLP 274


97APECO1_2965APECO1_2961N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_2965-2102.449697ABC transporter membrane protein
APECO1_2964-392.078111fused ribosome-associated ATPase: ATP-binding
APECO1_2963090.447280hypothetical protein
APECO1_2962010-0.824889hypothetical protein
APECO1_29610141.836185hypothetical protein
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2965ABC2TRNSPORT504e-09 ABC-2 type transport system membrane protein signat...
		>ABC2TRNSPORT#ABC-2 type transport system membrane protein

signature.
Length = 262

Score = 50.3 bits (120), Expect = 4e-09
Identities = 41/171 (23%), Positives = 73/171 (42%), Gaps = 7/171 (4%)

Query: 200 REREHGTVEHLLVMPITPFEIMMAKV-WSMGLVVLVVSGLSLVLMVKGVLGVPIEGSIPL 258
R T E +L + +I++ ++ W+ L +G+ +V G + L
Sbjct: 93 RMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGY----TQWLSLL 148

Query: 259 FMLGV-ALSLFATTSIGIFMGTIARSMPQLGLLVILVLLPLQMLSGGSTPRESMPQMVQD 317
+ L V AL+ A S+G+ + +A S LV+ P+ LSG P + +P + Q
Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208

Query: 318 IMLTMPTTHFVSLAQAILYRGAGFEIVWPQFLTLMAIGGAFF-TIALLRFR 367
+P +H + L + I+ ++ + I FF + ALLR R
Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRRR 259


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2964PF05272300.046 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 30.0 bits (67), Expect = 0.046
Identities = 9/26 (34%), Positives = 14/26 (53%)

Query: 37 ARCMVGLIGPDGVGKSSLLSLISGAR 62
V L G G+GKS+L++ + G
Sbjct: 595 FDYSVVLEGTGGIGKSTLINTLVGLD 620


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2963RTXTOXIND839e-20 Gram-negative bacterial RTX secretion protein D signat...
		>RTXTOXIND#Gram-negative bacterial RTX secretion protein D

signature.
Length = 478

Score = 83.3 bits (206), Expect = 9e-20
Identities = 71/408 (17%), Positives = 138/408 (33%), Gaps = 81/408 (19%)

Query: 6 RHLAWWGVGALAVAAVVAWWLLRPAGVP-EGFAVSNGRIEATEVDIASKIAGRIDTILVK 64
R +A++ +G L +A +++ G +GR + I + I+VK
Sbjct: 58 RLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKE----IKPIENSIVKEIIVK 113

Query: 65 EGQFVREGEVLAKMDTRV----------------LQEQRLEAI----------------- 91
EG+ VR+G+VL K+ L++ R + +
Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173

Query: 92 -------------------AQIKEAQSAVAAAQALLEQRQSETRAAQSLVNQRQAELDSV 132
Q Q+ + L+++++E + +N+ +
Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233

Query: 133 AKRHTRSRSLAQRGAISAQQLDDDRAAAESARAALESAKAQVSASKAAIEAARTNIIQ-- 190
R SL + AI+ + + A L K+Q+ ++ I +A+
Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293

Query: 191 -----------AQTRVEAAQATERRIAADID--DSELKAPRDGRV-QYRVAEPGEVLAAG 236
QT T + S ++AP +V Q +V G V+
Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353

Query: 237 GRVLNMVDLSDVY-MTFFLPTEQAGTLKLGGEARLILDAAPDLRIPATISFVASVAQFTP 295
++ +V D +T + + G + +G A + ++A P R V V
Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYG---YLVGKVKNINL 410

Query: 296 KTVETSDERLKLMFRVKARIPPELLQQHLEYV--KTGLPGVAWVRVNE 341
+E D+RL L+F V I L + + +G+ A ++
Sbjct: 411 DAIE--DQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGM 456


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2961ALARACEMASE290.023 Alanine racemase signature.
		>ALARACEMASE#Alanine racemase signature.

Length = 356

Score = 29.4 bits (66), Expect = 0.023
Identities = 26/109 (23%), Positives = 42/109 (38%), Gaps = 24/109 (22%)

Query: 215 VITAENGIVFRENLLFTHRGLSGPAVLQISSYWQPGEFVSINLLPDVDLETFL--NEQRN 272
++ E I RE RG GP +L + ++ + + + L T + N Q
Sbjct: 58 LLNLEEAITLRE------RGWKGP-ILMLEGFFHAQD---LEIYDQHRLTTCVHSNWQLK 107

Query: 273 AHPNQSLKNTLAVHL------------PKRLVERLQQLGQIPDVSLKQL 309
A N LK L ++L P R++ QQL + +V L
Sbjct: 108 ALQNARLKAPLDIYLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTL 156


98APECO1_2790APECO1_2779N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_2790-1120.952431ribonucleoside transporter
APECO1_2789-1121.586586hypothetical protein
APECO1_2788-1132.023878xanthine/uracil permase YicO
APECO1_2787-1153.115172cryptic adenine deaminase
APECO1_27860173.552604sugar phosphate antiporter
APECO1_27851163.855041regulatory protein UhpC
APECO1_27841164.127165sensory histidine kinase UhpB
APECO1_27832173.458934DNA-binding transcriptional activator UhpA
APECO1_27821152.601915acetolactate synthase 1 regulatory subunit
APECO1_27811152.703536acetolactate synthase catalytic subunit
APECO1_2780-1160.975997hypothetical protein
APECO1_2779-1171.320437multidrug resistance protein D
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2790TCRTETA393e-05 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 38.7 bits (90), Expect = 3e-05
Identities = 34/208 (16%), Positives = 72/208 (34%), Gaps = 13/208 (6%)

Query: 33 IIVEFLPVSLLTP----MAQDLGISEGVA---GQSVTVTAFVAMFASLFITQTIQATDRR 85
+ ++ + + L+ P + +DL S V G + + A + + + RR
Sbjct: 14 VALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRR 73

Query: 86 YVVILFAVLLTLSCLLVSFANSFSLLLIGRACLGLALGGFWAMSASLTMRLVPPRTVPKA 145
V+++ + +++ A +L IGR G+ G A++ + + +
Sbjct: 74 PVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARH 132

Query: 146 LSVIFGAVSIALVIAAPLGSFLGELIGWRNVFNAAAAMG----VLCIFWIIKSLPSLPGE 201
+ +V LG +G F AAAA+ + F + +S
Sbjct: 133 FGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRP 191

Query: 202 PSHQKQNTFRLLQRPGVMAGMIAIFMSF 229
+ N + M + A+ F
Sbjct: 192 LRREALNPLASFRWARGMTVVAALMAVF 219


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2787UREASE381e-04 Urea amidohydrolase (urease) protein signature.
		>UREASE#Urea amidohydrolase (urease) protein signature.

Length = 570

Score = 38.2 bits (89), Expect = 1e-04
Identities = 30/105 (28%), Positives = 43/105 (40%), Gaps = 17/105 (16%)

Query: 22 AVSRGDAVADYIIDNVSILDLINGGEISGPIVIKGRYIAGVG-AEYAD---------APA 71
V+R D +I N ILD + G + I +K IA +G A D P
Sbjct: 60 QVTREGGAVDTVITNALILD--HWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPG 117

Query: 72 LQRIDARGATAVPGFIDAHLHIESSMMTPVTFETATLPRGLTTVI 116
+ I G G +D+H+H + P E A L GLT ++
Sbjct: 118 TEVIAGEGKIVTAGGMDSHIH----FICPQQIEEA-LMSGLTCML 157


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2786TCRTETB340.001 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 34.1 bits (78), Expect = 0.001
Identities = 28/168 (16%), Positives = 61/168 (36%), Gaps = 17/168 (10%)

Query: 49 FNIAQNDMISTYGLSMTQLGMIGLGFSITYGVGKTLVSYYADGKNTKQFLPFMLILSAIC 108
N++ D+ + + + F +T+ +G + +D K+ L F +I++ C
Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIIN--C 90

Query: 109 MLGFSASMGSGSVSLFLMIAFYALSGFFQSTGGSCSYSTI----TKWTPRRKRGTFLGFW 164
+G SL +M + F Q G + + + ++ P+ RG G
Sbjct: 91 FGSVIGFVGHSFFSLLIM------ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144

Query: 165 NISHNLGGAGAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRY 212
+G + A+Y+ + + + P + I+ L
Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSY---LLLIP--MITIITVPFLMK 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2785TCRTETB401e-05 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 39.9 bits (93), Expect = 1e-05
Identities = 64/408 (15%), Positives = 135/408 (33%), Gaps = 60/408 (14%)

Query: 30 RHILLTIWLGYALFY--FTRKSFNAAVPEILANGVLSRSDIGLLATLFYITYGVSKFVSG 87
RH + IWL F+ N ++P+I + + + T F +T+ + V G
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 88 IVSDRSNARYFMGIGLIATGIINILFGFSTSLWAFAVLWVLNAFFQGWGS---PVCARLL 144
+SD+ + + G+I +++ S F L ++ F QG G+ P ++
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHS---FFSLLIMARFIQGAGAAAFPALVMVV 127

Query: 145 TAWY-SRTERGGWWALWNTAHNVGGALIPIVMAASALHYGWRAGMMIAGCMAIVVGIFLC 203
A Y + RG + L + +G + P + A + W ++ M ++ +
Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW--SYLLLIPMITIITVPF- 184

Query: 204 WRLRDRPQALGLPAVGEWRHDALEIAQQQEGAGLTRKEILTKYVLLNPYIWLLSFCYVLV 263
+ L +I G L I+ + Y VL
Sbjct: 185 --------LMKLLKKEVRIKGHFDIK----GIILMSVGIVFFMLFTTSYSISFLIVSVLS 232

Query: 264 YVV-----RAAINDWGN-----------LYMSEMLGVDLVTANTAVTMFELGGFIGALVA 307
+++ R + + + + + V ++ + + A
Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292

Query: 308 GWGSDKLFNGNRGPMNLIFAAGILL-SVGSLWLMPFASYVMQATCFFTIGFFVFGPQMLI 366
GS +F G + + GIL+ G L+++ + + F T F + +
Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFL-SVSFLTASFLLETTSWFM 351

Query: 367 ---------GMAAAECS---------HKEAAGAATGFVGLFAYLGASL 396
G++ + ++ AGA + ++L
Sbjct: 352 TIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2784PF06580402e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 39.8 bits (93), Expect = 2e-05
Identities = 28/142 (19%), Positives = 57/142 (40%), Gaps = 11/142 (7%)

Query: 365 LRPRQLDDLTLEQAIRSLMREMELEGRGIVSHLEWRIDESALSENQRVTLFRVCQEGLNN 424
LR ++L + + ++L L++ + + +V + Q + N
Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPM-LVQTLVEN 266

Query: 425 IVKHA-----DASAVTLQGWQQDERLMLVIEDDGSGLPPDSGQ-HGFGLTGMRERVTALG 478
+KH + L+G + + + L +E+ GS ++ + G GL +RER+ L
Sbjct: 267 GIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLY 326

Query: 479 G---TLTISCLHG-TRVSVSLP 496
G + +S G V +P
Sbjct: 327 GTEAQIKLSEKQGKVNAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2783HTHFIS612e-13 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 61.0 bits (148), Expect = 2e-13
Identities = 29/174 (16%), Positives = 59/174 (33%), Gaps = 20/174 (11%)

Query: 2 ITVALIDDHLIVRSGFAQLLGLEPDLQVVAEFGSGREALAGLPGCGVQVCICDISMPDIS 61
T+ + DD +R+ Q L V + + + + D+ MPD +
Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61

Query: 62 GLELLSQLPK---GMATIMLSVHDSPALVEQALNAGARGFLSKRCSPDELIAAVHTVATG 118
+LL ++ K + +++S ++ +A GA +L K ELI +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR---- 117

Query: 119 GCYLTPDIAIKLASGRQDPLTKRERQVAEKLAQG---MAVKEIAAELGLSPKTV 169
A+ R L + + + + + A L + T+
Sbjct: 118 --------ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2779TCRTETB591e-11 Tetracycline resistance protein TetB signature.
		>TCRTETB#Tetracycline resistance protein TetB signature.

Length = 458

Score = 59.1 bits (143), Expect = 1e-11
Identities = 41/184 (22%), Positives = 80/184 (43%), Gaps = 1/184 (0%)

Query: 7 RNVNLLLMLVLLVAVGQMAQTIYIPAIADMARDLNVREGAVQSVMGAYLLTYGVSQLFYG 66
R+ +L+ L +L + + + ++ D+A D N + V A++LT+ + YG
Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70

Query: 67 PISDRVGRRPVILVGMSIFMLATLVA-VTTSSLMVLIAASAMQGMGTGVGGVMARTLPRD 125
+SD++G + ++L G+ I +++ V S +LI A +QG G + +
Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVAR 130

Query: 126 LYERTQLRHANSLLNMGILVSPLLAPLIGGLLDTMWNWRACYLFLLVLCAGVTFSMARWM 185
+ A L+ + + + P IGG++ +W L ++ V F M
Sbjct: 131 YIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLK 190

Query: 186 PETR 189
E R
Sbjct: 191 KEVR 194


99APECO1_2472APECO1_2458N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_24721163.545310sensor protein ZraS
APECO1_24711183.559247transcriptional regulatory protein ZraR
APECO1_24701172.243247phosphoribosylamine--glycine ligase
APECO1_24690130.258836bifunctional
APECO1_2468114-0.082239hypothetical protein
APECO1_2465-1120.438010*hypothetical protein
APECO1_2464-2131.076835hypothetical protein
APECO1_2463-1162.165227homoserine O-succinyltransferase
APECO1_2462-1161.846679malate synthase
APECO1_24610161.364728isocitrate lyase
APECO1_24600141.418489bifunctional isocitrate dehydrogenase
APECO1_24590141.653585IclR family transcriptional regulator
APECO1_2458-1141.549121B12-dependent methionine synthase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2472PF06580371e-04 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 36.8 bits (85), Expect = 1e-04
Identities = 49/262 (18%), Positives = 105/262 (40%), Gaps = 43/262 (16%)

Query: 197 ILFALATVLLA-SVLSFFW-YRRYLRSRQLLQDEMKRKEKLVALGHLAAGV-AHEIRNPL 253
I+F + V S+L F W + + + ++ Q +M + L L A + H + N L
Sbjct: 120 IIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNAL 179

Query: 254 SSIKGLAKYFAERAPAGGEAHQLAQVM---AKEADRLNRVVSELLELVKPTHLALQAVEL 310
++I+ L +A L+++M + ++ +++ L +V ++L L +++
Sbjct: 180 NNIRALILEDPTKAREM--LTSLSELMRYSLRYSNARQVSLADELTVVD-SYLQLASIQF 236

Query: 311 NTLINHSLQLVSQDANSREIQLRFTANDTLPEIQADPDRLTQVLL-NLYLNAIQAIGQHG 369
+ Q+ + ++Q+ P L Q L+ N + I + Q G
Sbjct: 237 EDRLQFENQI---NPAIMDVQV--------------PPMLVQTLVENGIKHGIAQLPQGG 279

Query: 370 VISVTASESGAGVKISVTDSGKGIAADQLEAIFTPYFTTKAEGTGLGLAVVHNIVEQHGG 429
I + ++ V + V ++G + E TG GL V ++ G
Sbjct: 280 KILLKGTKDNGTVTLEVENTGSLALKNT------------KESTGTGLQNVRERLQMLYG 327

Query: 430 ---TIQVASQEGKGATFTLWLP 448
I+++ ++GK + +P
Sbjct: 328 TEAQIKLSEKQGKV-NAMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2471HTHFIS5260.0 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 526 bits (1356), Expect = 0.0
Identities = 184/468 (39%), Positives = 253/468 (54%), Gaps = 35/468 (7%)

Query: 8 ILVVDDDISHCTILQALLRGWGYNVALANSGRQALEQVREQVFDLVLCDVRMAEMDGIAT 67
ILV DDD + T+L L GY+V + ++ + DLV+ DV M + +
Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65

Query: 68 LKEIKTLNPAIPVLIMTAYSSIETAVEALKTGALDYLIKPLDFDNLQSTLEKALAHTHSV 127
L IK P +PVL+M+A ++ TA++A + GA DYL KP D L + +ALA
Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125

Query: 128 DAETPAVSASQFGMVGKSPAMQHLLSEIALVAPSEATVLIHGDSGTGKELVARAIHASSA 187
++ S +VG+S AMQ + +A + ++ T++I G+SGTGKELVARA+H
Sbjct: 126 PSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGK 185

Query: 188 RSEKPLVTLNCAALNESLLESELFGHEKGAFTGADKRREGRFVEADGGTLFLDEIGDISP 247
R P V +N AA+ L+ESELFGHEKGAFTGA R GRF +A+GGTLFLDEIGD+
Sbjct: 186 RRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPM 245

Query: 248 MMQVRLLRAIQEREVQRVGSNQTISVDVRLIAATHRDLAAEVNAGRFRQDLYYRLNVVAI 307
Q RLLR +Q+ E VG I DVR++AAT++DL +N G FR+DLYYRLNVV +
Sbjct: 246 DAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPL 305

Query: 308 EVPSLRQRREDIPLLAGHFLQRFAERNRKAVKGFTPQAMDLLIHYDWPGNIRELENAVER 367
+P LR R EDIP L HF+Q+ + VK F +A++L+ + WPGN+RELEN V R
Sbjct: 306 RLPPLRDRAEDIPDLVRHFVQQAEKEGLD-VKRFDQEALELMKAHPWPGNVRELENLVRR 364

Query: 368 AVVLLTGEYISERELPLAIASTPIPLVQSQDIQP-------------------------- 401
L + I+ + + S +
Sbjct: 365 LTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALP 424

Query: 402 --------LVEVEKEVILAALEKTGGNKTEAARQLGITRKTLLAKLSR 441
L E+E +ILAAL T GN+ +AA LG+ R TL K+
Sbjct: 425 PSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2465SHAPEPROTEIN326e-04 Bacterial cell shape determinant MreB/Mbl protein s...
		>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein

signature.
Length = 347

Score = 31.7 bits (72), Expect = 6e-04
Identities = 23/62 (37%), Positives = 32/62 (51%), Gaps = 9/62 (14%)

Query: 37 IANFFVAEKVLQDLVLQLHPRSTWHSFLPAKRMDIVVSALEMNEGGLSQVEERILHEVVA 96
IA+FFV EK+LQ + Q+H S P+ R+ + V G +QVE R + E
Sbjct: 81 IADFFVTEKMLQHFIKQVHSNSF---MRPSPRVLVCVPV------GATQVERRAIRESAQ 131

Query: 97 GA 98
GA
Sbjct: 132 GA 133


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2464SACTRNSFRASE341e-04 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 33.8 bits (77), Expect = 1e-04
Identities = 16/54 (29%), Positives = 21/54 (38%), Gaps = 5/54 (9%)

Query: 78 IDPDVRGCGVGRMLVKHALSMAPE-----LTTNVNEQNEQAVGFYKKVGFKVTG 126
+ D R GVG L+ A+ A E L + N A FY K F +
Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2461BINARYTOXINB320.004 Binary toxin B family signature.
		>BINARYTOXINB#Binary toxin B family signature.

Length = 764

Score = 32.3 bits (73), Expect = 0.004
Identities = 14/58 (24%), Positives = 23/58 (39%)

Query: 294 ETSTPDLELARRFAQAIHAKYPGKLLAYNCSPSFNWQKNLDDKTIASFQQQLSDMGYK 351
ET+ PD+ L A P L Y + N D +T + + QL+++
Sbjct: 544 ETTKPDMTLKEALKIAFGFNEPNGNLQYQGKDITEFDFNFDQQTSQNIKNQLAELNAT 601


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2458BCTERIALGSPD320.019 Bacterial general secretion pathway protein D signa...
		>BCTERIALGSPD#Bacterial general secretion pathway protein D

signature.
Length = 660

Score = 31.8 bits (72), Expect = 0.019
Identities = 19/87 (21%), Positives = 37/87 (42%), Gaps = 17/87 (19%)

Query: 348 SGLEPLNIGEDSLFVNVGERTN---VTGSA----KFKRLIKEEKYSEALDVARQQVENGA 400
+P+ + ++ + +TN VT + +R+I + LD+ R QV A
Sbjct: 298 QAAKPVAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQ------LDIRRPQVLVEA 351

Query: 401 QIIDINMDEGMLDAEAAMVRFLNLIAG 427
I ++ D L+ +++ N AG
Sbjct: 352 IIAEVQ-DADGLNLG---IQWANKNAG 374


100APECO1_2345APECO1_2338N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_2345-2140.760638phosphonate/organophosphate ester transporter
APECO1_2344-2150.337243hypothetical protein
APECO1_2343-2150.559935hypothetical protein
APECO1_2342-2150.165951hypothetical protein
APECO1_2341-2140.804204hypothetical protein
APECO1_2340-113-0.886043proline/glycine betaine transporter
APECO1_2339-118-0.202906sensor protein BasS/PmrB
APECO1_2338-117-0.700056DNA-binding transcriptional regulator BasR
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2345PF05272290.017 Virulence-associated E family protein
		>PF05272#Virulence-associated E family protein

Length = 892

Score = 29.3 bits (65), Expect = 0.017
Identities = 12/22 (54%), Positives = 13/22 (59%)

Query: 32 MVALLGPSGSGKSTLLRHLSGL 53
V L G G GKSTL+ L GL
Sbjct: 598 SVVLEGTGGIGKSTLINTLVGL 619


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2340TCRTETA441e-06 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 43.7 bits (103), Expect = 1e-06
Identities = 57/290 (19%), Positives = 105/290 (36%), Gaps = 55/290 (18%)

Query: 85 FFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYDTIGIWAPILLLICKMAQGFSVGGE 144
G L D++GR+ +L +++ ++ + P +W +L I ++ G + G
Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPF-----LW---VLYIGRIVAGIT-GAT 112

Query: 145 YTGASIFVAEYSPDRKR----GFMGSWLDFGSIAGFVLGAGVVVLISTIVGEENFLDWGW 200
A ++A+ + +R GFM + FG +AG VLG G++ S
Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLG-GLMGGFSP------------ 159

Query: 201 RIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDREGLQDGPKVSFKEIATKHWRS 260
PFF A L + L K E+ P SF+ W
Sbjct: 160 HAPFFAAAALNGLNFLTGCFLLPESH------KGERRPLRREALNPLASFR------WAR 207

Query: 261 LLTCIGLVIATNVTYYML----LTYMPSYLSHNLHYS-EDHGVLIIIAIMIGMLFVQPVM 315
+T + ++A ++ + H+ G+ + ++ L +
Sbjct: 208 GMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMIT 267

Query: 316 GLLSDRFGRRPFVLLG----SVALFVLA--------IPAFILINSNVIGL 353
G ++ R G R ++LG +LA P +L+ S IG+
Sbjct: 268 GPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGM 317



Score = 39.0 bits (91), Expect = 3e-05
Identities = 39/164 (23%), Positives = 73/164 (44%), Gaps = 16/164 (9%)

Query: 286 LSHNLHYSEDHGVLI-IIAIMIGMLFVQPVMGLLSDRFGRRPFVLLGSVALFVLAIPAFI 344
L H+ + +G+L+ + A+M PV+G LSDRFGRRP +L+ L A+ I
Sbjct: 35 LVHSNDVTAHYGILLALYALM--QFACAPVLGALSDRFGRRPVLLVS---LAGAAVDYAI 89

Query: 345 LINSNVIGLIFAGLLMLAVILNCFTGVMASTLPAMFPTHIR---YSALAAAFNISVLVAG 401
+ + + +++ G ++A I V + + + R + ++A F +VAG
Sbjct: 90 MATAPFLWVLYIG-RIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFG-MVAG 147

Query: 402 LTPTLAAWLVESSQNLMMPAYYLMVVAVVGLITG-VTMKETANR 444
P L + S + P + + + +TG + E+
Sbjct: 148 --PVLGGLMGGFSPH--APFFAAAALNGLNFLTGCFLLPESHKG 187


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2339PF06580377e-05 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 37.2 bits (86), Expect = 7e-05
Identities = 39/182 (21%), Positives = 80/182 (43%), Gaps = 34/182 (18%)

Query: 181 ARLDQMMESVSQLLQLARAGQSFSSGNYQHVKLLEDV-ILPSYDELSTML--DQRQQTLL 237
+ +M+ S+S+L++ S N + V L +++ ++ SY +L+++ D+ Q
Sbjct: 191 TKAREMLTSLSELMR-----YSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245

Query: 238 LPESAADITVQGDATLLRMLLRNLVENAHRY----SPQGSNIMIKLQEDGGAV-MAVEDE 292
+ + D+ V ML++ LVEN ++ PQG I++K +D G V + VE+
Sbjct: 246 INPAIMDVQV------PPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299

Query: 293 GPGIDESKCGELSKAFVRMDSRYGGIGLGLSIV-SRITQLHHGQFFLQNRQETSGTRAWI 351
G + + G GL V R+ L+ + ++ ++ A +
Sbjct: 300 GSLA--------------LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345

Query: 352 RL 353
+
Sbjct: 346 LI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2338HTHFIS921e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 91.8 bits (228), Expect = 1e-23
Identities = 42/121 (34%), Positives = 60/121 (49%)

Query: 2 KILIVEDDTLLLQGLILAAQTEGYACDGVSTARMAEQSLEAGHYSLVVLDLGLPDEDGLH 61
IL+ +DD + L A GY S A + + AG LVV D+ +PDE+
Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64

Query: 62 FLARIRQKKYTLPVLILTARDTLTDKIAGLDVGADDYLVKPFALEELHARIRALLRRHNN 121
L RI++ + LPVL+++A++T I + GA DYL KPF L EL I L
Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124

Query: 122 Q 122
+
Sbjct: 125 R 125


101APECO1_2328APECO1_2320N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_2328-112-2.981281DcuR family transcriptional regulator
APECO1_2327-113-3.396783sensory histidine kinase DcuS
APECO1_2326021-3.563203acyltransferase
APECO1_2325119-4.432664hypothetical protein
APECO1_2324118-3.956013lysyl-tRNA synthetase
APECO1_2323117-3.769840peptide transporter
APECO1_2322321-3.994989lysine decarboxylase 1
APECO1_2321419-2.917992lysine/cadaverine antiporter
APECO1_2320319-2.499414CadC family transcriptional regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2328HTHFIS682e-15 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 68.3 bits (167), Expect = 2e-15
Identities = 31/109 (28%), Positives = 51/109 (46%), Gaps = 4/109 (3%)

Query: 4 VLIIDDDAMVAELNRRYVAQIPGFQCCGTASTLEKAKEIIFNSDTPIDLILLDIYMQKEN 63
+L+ DDDA + + + +++ G+ S I + DL++ D+ M EN
Sbjct: 6 ILVADDDAAIRTVLNQALSRA-GYDVR-ITSNAATLWRWI--AAGDGDLVVTDVVMPDEN 61

Query: 64 GLDLLPVLHNARCKSDVIVISSAADAVTIKDSLHYGVVDYLIKPFQASR 112
DLLP + AR V+V+S+ +T + G DYL KPF +
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTE 110


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2327PF06580418e-06 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 41.0 bits (96), Expect = 8e-06
Identities = 21/99 (21%), Positives = 38/99 (38%), Gaps = 18/99 (18%)

Query: 442 LIENALE-ALGP-EPGGEISVTLHYRHGWLHCEVNDDGPGIAPDKIDHIFDKGVSTKGSE 499
L+EN ++ + GG+I + +G + EV + G +
Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN------------TKES 310

Query: 500 RGVGLALVKQQVENLGG---SIAVESEPGIFTQFFVQIP 535
G GL V+++++ L G I + + G V IP
Sbjct: 311 TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2326SACTRNSFRASE260.012 Streptothricin acetyltransferase signature.
		>SACTRNSFRASE#Streptothricin acetyltransferase signature.

Length = 173

Score = 26.4 bits (58), Expect = 0.012
Identities = 9/28 (32%), Positives = 16/28 (57%)

Query: 32 LAIIEHTDVDESLKGQGIGKQLVAKVVE 59
A+IE V + + +G+G L+ K +E
Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIE 116


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2323TCRTETA300.028 Tetracycline resistance protein signature.
		>TCRTETA#Tetracycline resistance protein signature.

Length = 399

Score = 29.8 bits (67), Expect = 0.028
Identities = 36/190 (18%), Positives = 66/190 (34%), Gaps = 14/190 (7%)

Query: 44 NHAISLFSAYA-SLVYVTPILGGWLADRLLGNRTAVIAGALLMTLGHVVLGIDTNSTFSL 102
H L + YA P+LG +DR G R ++ + + ++ + L
Sbjct: 43 AHYGILLALYALMQFACAPVLGAL-SDRF-GRRPVLLVSLAGAAVDYAIMAT-APFLWVL 99

Query: 103 YLALAIIICGYGLFKSNISCLLGELYDEND-HRRDGGFSLLYAAGNIGSIAAPIACGLAA 161
Y+ + G+ + + + D D R F + A G +A P+ GL
Sbjct: 100 YIGRIV----AGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMG 155

Query: 162 QWYGWHVGFALAGGGMFIGLLIFLSGHRHFQSTRSMDKKALTSVKF-ALPVWSWLVVMLC 220
+ H F A + L FL+G + +++ L L + W M
Sbjct: 156 G-FSPHAPFFAAA---ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTV 211

Query: 221 LAPVFFTLLL 230
+A + +
Sbjct: 212 VAALMAVFFI 221


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2320SYCDCHAPRONE368e-05 Gram-negative bacterial type III secretion SycD cha...
		>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD

chaperone signature.
Length = 168

Score = 36.4 bits (84), Expect = 8e-05
Identities = 16/97 (16%), Positives = 36/97 (37%), Gaps = 7/97 (7%)

Query: 391 PLDEKQLAALNTEIDNIVTLPELNNLS-----IIYQIKAVSALVKGKTDESYQAINTGID 445
++ A+ + + T+ LN +S +Y + A + GK +++++
Sbjct: 6 TDTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSL-AFNQYQSGKYEDAHKVFQALCV 64

Query: 446 LEMSWLNYVL-LGKVYEMKGMNREAADAYLTAFNLRP 481
L+ + L LG + G A +Y +
Sbjct: 65 LDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDI 101


102APECO1_2306APECO1_2300N        Y        YPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_2306018-3.495968transposase InsC for insertion element
APECO1_2305-115-4.868132cation/multidrug efflux pump protein
APECO1_2304-117-4.678881transcriptional repressor
APECO1_2303-114-3.434066hypothetical protein
APECO1_2302-115-2.873599S-adenosylmethionine synthetase
APECO1_2301018-3.873068hypothetical protein
APECO1_2300-121-0.933164histidine kinase
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2306PF06704250.047 DspF/AvrF protein
		>PF06704#DspF/AvrF protein

Length = 129

Score = 25.2 bits (55), Expect = 0.047
Identities = 20/85 (23%), Positives = 39/85 (45%), Gaps = 6/85 (7%)

Query: 28 RTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAA-GEQVVPAS 86
+ + + +S + SL A Q+GV A L+ Q E ++ + E V+
Sbjct: 2 NNSPTDFSRLIKSLGAQLGTSLTA-QNGVCA----LYDSQDNEAAVIEMPDHSEMVIFHC 56

Query: 87 ELAAAMKQIKELQRLLNKTPDVSRL 111
+ + + +LQ+LL+ DV+R+
Sbjct: 57 RVGRSPDRAADLQKLLSLNFDVARM 81


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2305ACRIFLAVINRP411e-137 Acriflavin resistance protein family signature.
		>ACRIFLAVINRP#Acriflavin resistance protein family signature.

Length = 1034

Score = 411 bits (1057), Expect = e-137
Identities = 202/339 (59%), Positives = 270/339 (79%), Gaps = 1/339 (0%)

Query: 1 MIQARNQLLAEAAKSPA-LNMVRPNGMNDEPQFQILIDDEKVQAFKLSMSDVDNIMSAAW 59
+ QARNQLL AA+ PA L VRPNG+ D QF++ +D EK QA +S+SD++ +S A
Sbjct: 694 LTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTAL 753

Query: 60 GSMYVNDFNDRGRVKKVYIQGEPGSRISPQDFDKWYVRNSDGDMVSFASFATGKWIYGSP 119
G YVNDF DRGRVKK+Y+Q + R+ P+D DK YVR+++G+MV F++F T W+YGSP
Sbjct: 754 GGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSP 813

Query: 120 KLEQYNGISAVEILGEPAPGYSSGDAMKAIEDIAARLPEGFHISWTGLSFEERLSGSQAP 179
+LE+YNG+ ++EI GE APG SSGDAM +E++A++LP G WTG+S++ERLSG+QAP
Sbjct: 814 RLERYNGLPSMEIQGEAAPGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAP 873

Query: 180 ALYALSLLIVFLCLAALYESWSIPFSVMLVVPLGVLGAVCATLLRGLGNDVFFQVGLLTT 239
AL A+S ++VFLCLAALYESWSIP SVMLVVPLG++G + A L NDV+F VGLLTT
Sbjct: 874 ALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTT 933

Query: 240 IGLSAKNAILIVEFARELHEKEGLSIKEAAVEAARVRLRPIIMTSLAFVMGVIPLAVSTG 299
IGLSAKNAILIVEFA++L EKEG + EA + A R+RLRPI+MTSLAF++GV+PLA+S G
Sbjct: 934 IGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNG 993

Query: 300 ASSGSKHAIGTGVVGGMITATILAIFYIPLFYMLIAGFF 338
A SG+++A+G GV+GGM++AT+LAIF++P+F+++I F
Sbjct: 994 AGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRCF 1032



Score = 77.2 bits (190), Expect = 1e-17
Identities = 64/322 (19%), Positives = 125/322 (38%), Gaps = 21/322 (6%)

Query: 29 EPQFQILIDDEKVQAFKLSMSDVDNIMSAA----WGSMYVNDFNDRGRVKKVYIQGEPGS 84
+ +I +D + + +KL+ DV N + G+ I +
Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQ-TR 239

Query: 85 RISPQDFDKWYVR-NSDGDMVSFASFAT---GKWIYGSPKLEQYNGISAVEILGEPAPGY 140
+P++F K +R NSDG +V A G Y + + NG A + + A G
Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNV--IARINGKPAAGLGIKLATGA 297

Query: 141 SSGDAMKAI----EDIAARLPEGFHISW---TGLSFEERLSGSQAPALYALSLLIVFLCL 193
++ D KAI ++ P+G + + T + + A+ ++VFL +
Sbjct: 298 NALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAI--MLVFLVM 355

Query: 194 AALYESWSIPFSVMLVVPLGVLGAVCATLLRGLGNDVFFQVGLLTTIGLSAKNAILIVEF 253
++ + VP+ +LG G + G++ IGL +AI++VE
Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN 415

Query: 254 ARELHEKEGLSIKEAAVEAARVRLRPIIMTSLAFVMGVIPLAVSTGASSGSKHAIGTGVV 313
+ ++ L KEA ++ ++ ++ IP+A G++ +V
Sbjct: 416 VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIV 475

Query: 314 GGMITATILAIFYIP-LFYMLI 334
M + ++A+ P L L+
Sbjct: 476 SAMALSVLVALILTPALCATLL 497


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2304HTHTETR704e-18 TetR bacterial regulatory protein HTH signature.
		>HTHTETR#TetR bacterial regulatory protein HTH signature.

Length = 215

Score = 70.0 bits (171), Expect = 4e-18
Identities = 33/63 (52%), Positives = 41/63 (65%)

Query: 10 RHTKFAAEETRKQILDVAEFCFCETGFSKTTLEMIAARAGCTRGAIYWYFNEKKDLLRQV 69
R TK A+ETR+ ILDVA F + G S T+L IA AG TRGAIYW+F +K DL ++
Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62

Query: 70 IER 72
E
Sbjct: 63 WEL 65


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2301HTHFIS596e-12 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 59.5 bits (144), Expect = 6e-12
Identities = 29/139 (20%), Positives = 53/139 (38%), Gaps = 3/139 (2%)

Query: 3 TIVIVEDEPIELESLRQIISQCVENAAIHEASTGKKAIHLIDQLSQIDMILVDINIPLPN 62
TI++ +D+ L Q +S+ + S I D+++ D+ +P N
Sbjct: 5 TILVADDDAAIRTVLNQALSR--AGYDVRITSNAATLWRWIAA-GDGDLVVTDVVMPDEN 61

Query: 63 GKQVIEYLKKKNSDTKIIVITANDDFDIVRSMYNLKVDDYLLKPVKKCILTDTIKKTLAF 122
++ +KK D ++V++A + F DYL KP L I + LA
Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121

Query: 123 DEGENEKSRALKQKVFAMI 141
+ K Q ++
Sbjct: 122 PKRRPSKLEDDSQDGMPLV 140


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_2300PF065802034e-64 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 203 bits (518), Expect = 4e-64
Identities = 56/214 (26%), Positives = 101/214 (47%), Gaps = 14/214 (6%)

Query: 203 RKRVEIERSLHEAEFKALSYQINPHFLFNVLNTIGRLAFLEDAQRTETMVHDFSDMMRYL 262
+ ++ EA+ AL QINPHF+FN LN I L LED + M+ S++MRY
Sbjct: 149 IDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALI-LEDPTKAREMLTSLSELMRYS 207

Query: 263 LRKNSHGLITLRNEINYVNNYMSIQKVRMRDRFDYLCDIPEKYLDVVCPFLILQPLVENF 322
LR ++ ++L +E+ V++Y+ + ++ DR + I +DV P +++Q LVEN
Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENG 267

Query: 323 FNYVVEPRDSNSHLLIRATDDGLNVIIEVTDNGDGIAPDTINRILSGDQKLQKGSIGINN 382
+ + +L++ T D V +EV + G +T + G+ N
Sbjct: 268 IKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK----------ESTGTGLQN 317

Query: 383 IKNRLKLLFGESYGLEIMSPNKPRMGTTIKLRFP 416
++ RL++L+G +++ + P
Sbjct: 318 VRERLQMLYGTEAQIKLSEKQG---KVNAMVLIP 348


103APECO1_1984APECO1_1978N        Y        NPathogenicity Island (unbiased-composition)
LocusTagDNBiasCDNBias%GCBiasProduct
APECO1_1984-1150.792693phosphoglycerate mutase
APECO1_1983-1130.210886right origin-binding protein
APECO1_1982hypothetical protein
APECO1_1981DNA-binding response regulator CreB
APECO1_1980sensory histidine kinase CreC
APECO1_1979hypothetical protein
APECO1_1978two-component response regulator
ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1984VACCYTOTOXIN290.017 Helicobacter pylori vacuolating cytotoxin signature.
		>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature.

Length = 1291

Score = 28.8 bits (64), Expect = 0.017
Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 4/45 (8%)

Query: 145 PLLVSHGIALGCLVSTILGLPAWAERRLRLRNCSISRVDYQESLW 189
P +V GIA G V T+ GL W ++ N D + +W
Sbjct: 42 PAIVG-GIATGAAVGTVSGLLGWGLKQAEEAN---KTPDKPDKVW 82


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1981HTHFIS909e-23 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 89.5 bits (222), Expect = 9e-23
Identities = 34/139 (24%), Positives = 61/139 (43%)

Query: 1 MQRETVWLVEDEQGIADTLVYMLQQEGFDVEVFERGLPVLDKARQQVPDVMILDVGLPDI 60
M T+ + +D+ I L L + G+DV + + D+++ DV +PD
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 SGFELCRQLLALHPALPVLFLTARSEEVDRLLGLEIGADDYVAKPFSPREVCARVRTLLR 120
+ F+L ++ P LPVL ++A++ + + E GA DY+ KPF E+ + L
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 121 RVKKFSTPSPVIRIGHFEL 139
K+ + L
Sbjct: 121 EPKRRPSKLEDDSQDGMPL 139


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1980PF06580320.006 Sensor histidine kinase
		>PF06580#Sensor histidine kinase

Length = 349

Score = 31.8 bits (72), Expect = 0.006
Identities = 41/182 (22%), Positives = 72/182 (39%), Gaps = 40/182 (21%)

Query: 312 LRQARLENRQEVVLTAVDVAALFR---RVSEARTVQLAE--KNITLHVM--------PTE 358
+R LE+ + ++ L R R S AR V LA+ + ++ +
Sbjct: 182 IRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQ 241

Query: 359 VNVASEPALLEQALGNLL-----DNA----IDFTPESGCITLSAEVDQEYVTLKVLDTGS 409
PA+++ + +L +N I P+ G I L D VTL+V +TGS
Sbjct: 242 FENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGS 301

Query: 410 GIPDYALSRIFERFYSLPRANGQKSSGLGLAFVSE-VARLFNGEVTLR-NVQEGGVLASL 467
N ++S+G GL V E + L+ E ++ + ++G V A +
Sbjct: 302 LALK----------------NTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345

Query: 468 RL 469
+
Sbjct: 346 LI 347


ORFs having significant similarity with Known Virulence factors
LocusTagHitsScoreE-valueComments
APECO1_1978HTHFIS824e-20 FIS bacterial regulatory protein HTH signature.
		>HTHFIS#FIS bacterial regulatory protein HTH signature.

Length = 484

Score = 81.8 bits (202), Expect = 4e-20
Identities = 30/122 (24%), Positives = 60/122 (49%), Gaps = 1/122 (0%)

Query: 1 MQTPHILIVEDELVTRNTLKSIFEAEGYDVFEATDGAEMHQILSEYDINLVIMDINLPGK 60
M IL+ +D+ R L GYDV ++ A + + ++ D +LV+ D+ +P +
Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60

Query: 61 NGLLLARELRE-QANVALMFLTGRDNEVDKILGLEIGADDYITKPFNPRELTIRARNLLS 119
N L +++ + ++ ++ ++ ++ + I E GA DY+ KPF+ EL L+
Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120

Query: 120 RT 121

Sbjct: 121 EP 122



 
Contact Sachin Pundhir for Bugs/Comments.
For best view 1024 x 768 resolution & IE 6.0 or above recommended.