[Bioclusters] Blast Source

Tim Harsch bioclusters@bioinformatics.org
Tue, 15 Apr 2003 16:58:20 -0700


Wow!  Thank you all so much for the quick and informative replies.  It gives
my colleague and I some stuff to dig into.

...  I've always liked this list, the people here are knowledgable and eager
to help.

Thanks again.
Ciao,

----- Original Message -----
From: "Chris Dwan (CCGB)" <cdwan@mail.ahc.umn.edu>
To: <bioclusters@bioinformatics.org>
Sent: Tuesday, April 15, 2003 4:47 PM
Subject: Re: [Bioclusters] Blast Source


>
> Depending on the exact goals of your analysis, BLAST can be a poor choice
> for finding matches with very low sequence identity.  It was designed as,
> and remains, an excellent, fast approximation to exhaustive pairwise
> search (a la Smith & Waterman).
>
> If you managed to turn the word size all the way down to 1 (or 2), you
> would have complete sensitivity.  Effectively, this would disable the
> heuristic by which BLAST achieves its speedup.  Another parameter to play
> with is "neighborhood word size."  This parameter defines the distance (in
> alignment score) of neighbors which BLAST will also allow as "perfect
> matches" in the hit generation phase.
>
> Really though, if you're searching for interestingly distant pairs,
> another methodology might be in order.  If the sequences in question share
> only 25% identity, it's unlikely that you're going to find them above the
> noise, even if you manage to turn the "sensitivity" knob on BLAST all the
> way up.
>
> Statistical methods like motifs, PSSMs, PSI-BLAST, and HMMER have all been
> used to greatly increase the sensitivity of such searches over pairwise
> techniques.  Beyond these are structurally based methods, which are
> popping up all over the place these days, as we finally have enough good
> structure data to construct meaningful patterns.
>
> Good luck.
>
> -Chris Dwan
>  Center for Computational Genomics and Bioinformatics
>  University of Minnesota
>
> > A colleague of mine is trying to use blast to determine very loose
matches.
> > He would like to change the minimum seed length, which is hard-coded
into
> > Blast, from 7 to 5.  Does anyone know a backdoor way, ie undocumented
> > parameter, which could do this?  Or, where we might find the blast
source
> > code so we can make this change manually?
>
> _______________________________________________
> Bioclusters maillist  -  Bioclusters@bioinformatics.org
> https://bioinformatics.org/mailman/listinfo/bioclusters