[BiO BB] command-line (scriptable) ORF finders?
Diego Martinez
sariego9 at yahoo.com
Sun Sep 17 14:06:58 EDT 2006
Hello,
There is also the SEALS package from Koonin's group at NCBI,
we use that alot. it has a bunch of command line tools, I believe it
is all in PERL, so you can gut it and reuse.
http://www.ncbi.nlm.nih.gov/CBBresearch/Walker/SEALS/
if you are looking at ESTs, you may also want to look at estscan,
http://www.ch.embnet.org/software/ESTScan2.html
or there is a genewise like est Gene modeler tool the Wise2
package by Birney and Durbin that you may want to look at.
Diego
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
.=$=. .=$=. .=$=. .=$=. .=$=. .=$=.
@ @ | | | @ | | | @ @ | | | @ | | | @ @ | | | @ | | |
| @ @ | | | @ @ | | | @ @ | | | @ @ | | | @ @ | | | @ @ | |
| | @ | | | @ @ | | | @ | | | @ @ | | | @ | | | @ @ |
~' `~$~' `~$~' `~$~' `~$~' `~$~' `~
----- Original Message ----
From: Ann Loraine <aloraine at gmail.com>
To: General Forum at Bioinformatics.Org <bio_bulletin_board at bioinformatics.org>
Sent: Sunday, September 17, 2006 1:19:38 AM
Subject: [BiO BB] command-line (scriptable) ORF finders?
Hello all,
I'm hoping someone on the list who is involved with EST or full-length
cDNA sequencing projects can help me with something (well..two
things):
(1) I am looking for a command-line, scriptable tool that can take as
input an EST, cDNA, or assembled EST contig ("unigene") sequence and
return the most likely or longest open reading frame. This is for a
plant EST project. It should also pay attention to codon usage rules.
(2) I am also looking for a tool that can take as input a set of exon
annotations (or mRNA-to-genome alignments) and return the most likely
CDS start and end for the given gene structure. Tools that can jigger
the alignment/exon boundaries to optimize the ORF *and* which pay
attention to codon usage rules would be extra great. This is for
deducing novel gene structures from cross-species mRNA-to-genome
alignments. Maybe there is a gene-finder that does this?
I've found a variety of web sites that claim to do this, but, as you
know, Web sites don't really cut it when you are working with
thousands of sequences. And also, I would like to see the code in case
I run into problems.
Any thoughts or suggestions (other than pointers to Web tools, please)
would be greatly appreciated!
Sincerely,
Ann Loraine
--
Ann Loraine
Assistant Professor
Section on Statistical Genetics
University of Alabama at Birmingham
http://www.ssg.uab.edu
http://www.transvar.org
_______________________________________________
General Forum at Bioinformatics.Org - BiO_Bulletin_Board at bioinformatics.org
https://bioinformatics.org/mailman/listinfo/bio_bulletin_board
More information about the BBB
mailing list