[Bioclusters] Blast Source
Tim Harsch
bioclusters@bioinformatics.org
Tue, 15 Apr 2003 16:58:20 -0700
Wow! Thank you all so much for the quick and informative replies. It gives
my colleague and I some stuff to dig into.
... I've always liked this list, the people here are knowledgable and eager
to help.
Thanks again.
Ciao,
----- Original Message -----
From: "Chris Dwan (CCGB)" <cdwan@mail.ahc.umn.edu>
To: <bioclusters@bioinformatics.org>
Sent: Tuesday, April 15, 2003 4:47 PM
Subject: Re: [Bioclusters] Blast Source
>
> Depending on the exact goals of your analysis, BLAST can be a poor choice
> for finding matches with very low sequence identity. It was designed as,
> and remains, an excellent, fast approximation to exhaustive pairwise
> search (a la Smith & Waterman).
>
> If you managed to turn the word size all the way down to 1 (or 2), you
> would have complete sensitivity. Effectively, this would disable the
> heuristic by which BLAST achieves its speedup. Another parameter to play
> with is "neighborhood word size." This parameter defines the distance (in
> alignment score) of neighbors which BLAST will also allow as "perfect
> matches" in the hit generation phase.
>
> Really though, if you're searching for interestingly distant pairs,
> another methodology might be in order. If the sequences in question share
> only 25% identity, it's unlikely that you're going to find them above the
> noise, even if you manage to turn the "sensitivity" knob on BLAST all the
> way up.
>
> Statistical methods like motifs, PSSMs, PSI-BLAST, and HMMER have all been
> used to greatly increase the sensitivity of such searches over pairwise
> techniques. Beyond these are structurally based methods, which are
> popping up all over the place these days, as we finally have enough good
> structure data to construct meaningful patterns.
>
> Good luck.
>
> -Chris Dwan
> Center for Computational Genomics and Bioinformatics
> University of Minnesota
>
> > A colleague of mine is trying to use blast to determine very loose
matches.
> > He would like to change the minimum seed length, which is hard-coded
into
> > Blast, from 7 to 5. Does anyone know a backdoor way, ie undocumented
> > parameter, which could do this? Or, where we might find the blast
source
> > code so we can make this change manually?
>
> _______________________________________________
> Bioclusters maillist - Bioclusters@bioinformatics.org
> https://bioinformatics.org/mailman/listinfo/bioclusters