[Biodevelopers] Batch download of RefSeq or dbSNP?
    Christopher Dwan 
    cdwan at bioteam.net
       
    Wed Jul  5 14:25:25 EDT 2006
    
    
  
I'm writing some scripts to download data.  Specifically, I need  
FASTA versions of:
* All the "finished" mouse proteins in refseq
* All the "finished" human proteins in refseq
* All the sequences in dbSNP
Ideally, my script would produce updated versions of these datasets  
nightly or so.  I would prefer to do this without spamming the NCBI  
servers (or my bandwidth providers) too much.
I've messed around with the bioperl Bio::DB routines enough to get  
really confused by ENTREZ queries.  I've also looked at the FASTA  
source available through FTP from NCBI, and that confused me more.
How do smart people do this sort of thing these days?
-Chris Dwan
    
    
More information about the Biodevelopers
mailing list