[Bioclusters] mpiBLAST Performance
Lucas Carey
bioclusters@bioinformatics.org
Sun, 29 Jun 2003 23:45:43 -0400
Has anyone looked into why there is such a large speedup when shuffling the database? Does this hold for the query as well? Are you just randomizing the db sequence entries?
-Lucas
On Wed, May 14, 2003 at 11:50:31AM -0400, Joe Landman wrote:
> On Wed, 2003-05-14 at 11:43, Jason D. Gans wrote:
>
> > Also, while not a factor when blasting against the nr database, shuffling the
> > nt database yields a substantial speed increase in blast searches (I have obtained
> > a 28% decrease in wall clock time for certain nucleotide queries).
>
> I noted in 1999 and 2000 while working on GenomeCluster that a query
> sequence "sort" or shuffle sometimes helped. I didn't do that on the db
> side due to the time costs of the operation. Maybe worth a re-look.
>
>
> --
> Joseph Landman, Ph.D
> Scalable Informatics LLC,
> email: landman@scalableinformatics.com
> web : http://scalableinformatics.com
> phone: +1 734 612 4615
>