[BiO BB] protein sequence for all organism

Martin Gollery marty.gollery at gmail.com
Mon Nov 10 17:30:02 EST 2008


Yvan,

I believe this would give you a very large number of folders.

You may have better luck with Uniprot, which will allow you to
download data from taxonomic groups instead of individual species,
reducing the number of folders to a half million or so.

Check out the uniprot download options at:

http://www.uniprot.org/taxonomy/


Cheers,
Marty

On Mon, Nov 10, 2008 at 5:18 AM, Yvan Strahm <yvan.strahm at gmail.com> wrote:
> Hello All,
>
> I want to get the all the possible protein sequence for eukaryote.
> I already downloaded nr from the NCBI ftp site, but i wish to have them
> sorted by organism, one folder per species. Currently I am downloading data
> from ftp://ftp.ncbi.nih.gov/genomes/X/protein/protein.fa.gz . Does this
> resource represent all the proteins sequences which are available in
> genbank? Or do you know a better way of getting a comprehensive data set?
>
> Thanks for your help and time,
> Cheers,
> yvan
> _______________________________________________
> BBB mailing list
> BBB at bioinformatics.org
> http://www.bioinformatics.org/mailman/listinfo/bbb
>



-- 
-- 
Martin Gollery
Senior Bioinformatics Scientist
TimeLogic- a Division of Active Motif
North America Toll Free (877) 222-9543 ext. 6
Direct (760) 431-1263 ext. 6




More information about the BBB mailing list