> Chris Dwan (CCGB) wrote: > >Great answer, Joe. > >I have observed similarly "weird" behavior in trying to build up a >comprehensive picture of alignments from supposedly common sequence >fragments at varying lengths: > Paracel BLAST (PB) automatically performs the process that you are describing. Depending on the size of the query and database, it segments the query into sections, searches those sections in parallel, and reassembles the results. There are three important differences, though, between the way PB does it and how you might do it with a "wrapper script". First, PB uses the (modified) NCBI source code to perform the sorting and statistics calculations during the merge, so you can be confident in the results. PB is not affected, for example, by the number of digits displayed in an output report. Second, PB detects hits that may span the gaps between sub-queries, and uses (modified) NCBI code to recompute the complete alignments. This way, you avoid repeated and incomplete hits. Third, PB runs the sub-queries in parallel on a cluster system, allowing you to rapidly complete large BLAST analyses. Paracel BLAST offers a complete turn-key solution for high-throughput parallel BLAST searches. If you'd rather spend your time analyzing results instead of writing code and hacking perl scripts, it may be appropriate for you. Marc Rieffel Paracel marc@paracel.com 626-744-2080