[Biodevelopers] Mapping between various protein indices
Dan Bolser
dmb at mrc-dunn.cam.ac.uk
Tue May 25 09:26:23 EDT 2004
On Tue, 25 May 2004, Joke Reumers wrote:
>Hello,
>
>Maybe a bit out of scope on this mailing list, but I'm trying to find a
>complete mapping between NCBI RefSeq (of Genbank gi) indices and
>SwissProt/TrEMBL accessions. The best I could find so far is the IPI index,
>but this seems far from complete. Anyone any better ideas?
Hi, I had this exact same problem recently! The 'best' two solutions I
finally found were;
1) Use SRS to map from RefSeq -> uniPark -> SwissProt
(can't find where to get unipark from directly)
2) Use SeqHound to get SwissProt from GI.
I was shown the SRS method, e.g.
http://srs.ebi.ac.uk/srsbin/cgi-bin/wgetz?[UNIPARC-refdbi:NP_001102]%3ESWISSPROT
And I got the following instructions on the use of SeqHound...
If you want a list of GI's that have *exactly* the same sequence as your
input protein GI, then you could use
'SHoundRedundantGroup'
http://www.blueprint.org/seqhound/api_help/apifunctsdet.html#SHoundRedundantGroup
The Perl version of this takes a GI as input and then returns a list of
GI's. You could use SHoundSeqIDFromGi (or SHoundSeqIDFromGiList) to find
the Swiss-Prot sequence.
Using either of these should get you a full mapping. Let me know how you
get on,
Dan.
>
>Thanks in advance,
>
>Joke Reumers
>
>
>_______________________________________________
>Biodevelopers mailing list
>Biodevelopers at bioinformatics.org
>https://bioinformatics.org/mailman/listinfo/biodevelopers
>
More information about the Biodevelopers
mailing list