[BiO BB] protein sequence for all organism

Yvan Strahm yvan.strahm at gmail.com
Tue Nov 18 04:17:41 EST 2008


Thanks every one for the tips. I ended up using the taxonomy files from
genbank and uniprot and sorting them according to the organism.

Cheers,
yvan

On Tue, Nov 11, 2008 at 9:50 PM, Hongyu Zhang <me at hongyu.org> wrote:

> My solution is to download the taxonomy files from Genebank, which contain
> the information of the taxonomy numbers for all GI numbers and the
> hierarchical taxonomy tree structure. You can write a program to partition
> the protein NR file into separated files/folders, each belonging to a
> specific taxonomy number that is a descendant of the eukaryote node in the
> taxonomy tree.
>
> The location of the Genbank taxonomy files is
> ftp://ftp.ncbi.nih.gov/pub/taxonomy/
> _______________________________________________
> BBB mailing list
> BBB at bioinformatics.org
> http://www.bioinformatics.org/mailman/listinfo/bbb
>



More information about the BBB mailing list