Hi all,

(Question originally posted to the "Ask the Open Lab" forum. I'm reposting
it here, hopefully something will arise).

Does anybody know of some way to get the coding sequence given a sequence
from PDB? As you may very well know, PDB sequences are usually partial,
fragmented versions of the actual protein sequence. Sometimes with
mutations inserts (OK, so I cannot get the coding sequence for mutation
inserts, for obvious reasons). Collating this data via automatic means
seems rather cumbersome.

Any help, including pointers to an appropriate database, scripts
repository, or any kind of forum with people who might be able to answer
this would be useful.


