[Bioclusters] Web BLAST

Arshad Khan bioclusters@bioinformatics.org
Wed, 20 Nov 2002 15:41:27 -0000


We do that every day. I havent checked lately the blast version but it ought
to be 2.2x. which ought to suffice your needs. Databases at NCBI are the
same ones that they use themselves and its not 5 years old. Infact its
updated on daily basis as new data becomes available. What we do is that we
download all finished bacterial genome data from genbank and unfinished
bacterial genomes database from different sequencing centers like TIGR. Give
all the genomes our own UIDs (Unique ID), run formatdb etc. It all works
quite well. Except sometimes you will find that the new wwwblast (esp.
blastx) will throw up problems when you paste short sequences (< 50). At
genbank (NCBI) you will find flat files in fasta format already. Unless you
want to give your own UIDs to the files like we do you dont need to do
anything to those files except run formatdb.
check out http://vge.ac.uk to see how it all works for us.

Arshad
Bacterial Pathogenesis & Genomics Unit
Department of Immunity & Infection
Medical School
University of Birmingham
tel: 0121 414 2859
----- Original Message -----
From: "Jeremy Mann" <jeremy@bioc09.v19.uthscsa.edu>
To: <bioclusters@bioinformatics.org>
Sent: Wednesday, November 20, 2002 3:13 PM
Subject: Re: [Bioclusters] Web BLAST


>
> > Hi Jeremy,
> > I have been running a wwwblast server for more than a year now. Here is
> > what you need to add in blast.rc.
> > blastn     dna files
> > blastp     protein files
> > blastx     protein files
> > tblastn    dna files
> > tblastx    dna files
> >
> > The rest is quite simple but let us know if you have any more problems?
>
> Just a couple more questions..... Another researcher looked over the
> installation and did a couple BLAST runs. According to him, the blast that
> comes with wwwblast is pretty old and the databases are formatted to what
> NCBI used 5 years ago. Does NCBI use the same databases as what they
> release to us?
>
> If the database continues to be a problem, we also have the complete GCG
> database available. Since BLAST wants its database in its own format, how
> hard would it be to take the flat files from Genbank, format to FASTA,
> then format to NCBI (using formatdb)? Or is this even possible?
>
>
>
> --
> Jeremy Mann
> jeremy@biochem.uthscsa.edu
>
> University of Texas Health Science Center
> Bioinformatics Core Facility
> Phone: (210) 567-2672
>
>
>
> _______________________________________________
> Bioclusters maillist  -  Bioclusters@bioinformatics.org
> https://bioinformatics.org/mailman/listinfo/bioclusters
>
>