[Bioclusters] NCBI database download and format code
Joseph Landman
bioclusters@bioinformatics.org
02 May 2003 02:05:46 -0400
On Thu, 2003-05-01 at 18:29, Jeremy Mann wrote:
> I am curious if any knows of any commercial or open source solution to
> breaking up the NCBI dbs into various sizes. Here, our present solution is
You can use the "formatdb -v N" option to have the database
automatically divided into groups of N x 10**6 letters. I would
recommend this route for the database formatting side. Keep the
original db around for the other tools.
I am working on a fast segmenter. Should be done soon.
--
Joseph Landman, Ph.D
Scalable Informatics LLC
email: landman@scalableinformatics.com
web: http://scalableinformatics.com
phone: +1 734 612 4615