[BiO BB] question on RNA and species signatures

Tanney, Austin austin.tanney at almacgroup.com
Thu Jul 26 08:51:42 EDT 2007

Hi Mike,

Have you tried looking at Rfam (http://www.sanger.ac.uk/Software/Rfam/)
miRBase (http://microrna.sanger.ac.uk/sequences/) or the ensembl genome browser (http://www.ensembl.org/index.html)



-----Original Message-----
bio_bulletin_board-bounces+austin.tanney=almacgroup.com at bioinformatics.o
[mailto:bio_bulletin_board-bounces+austin.tanney=almacgroup.com at bioinfor
matics.org]On Behalf Of Mike Marchywka
Sent: 25 July 2007 20:59
To: bio_bulletin_board at bioinformatics.org
Subject: [BiO BB] question on RNA and species signatures

I've been generally trying to find a comprehensive way to analyze non-coding 
with no luck. I've tried asking people in such areas as siRNA, riboswitch 
etc with out much
success. Any comments or discussion?

This came up most recently because I found a short sequence with unusual
species distribution and I was curious to know if this thing has a name.

If I just type in some random junk, I get about what you could expect:
( this is my own blast script with most terms being self explanatory, 
translates into "-v" to limit summary lines, -db selects the wgs database )
  567  blastnew -out control -nuc -hits 0 -summ 3000 -db wgs -expect 1e8 

$ more control| sed -n '/producing signif/,/^>/p'| sed -n 's/.*|//p' | awk 
'{print $1" " $2}' | sort | uniq -c | sort -g -r | more

304 Homo sapiens
261 Bos taurus
212 Pan troglodytes
171 Microcebus murinus
155 Equus caballus
136 Spermophilus tridecemlineatus
130 Canis familiaris
125 Otolemur garnettii
112 Ornithorhynchus anatinus
111 Tupaia belangeri
96 Myotis lucifugus
93 Mus musculus
86 Felis catus
75 Rattus norvegicus
74 Oryzias latipes
71 Drosophila erecta
68 Sorex araneus
63 Loxodonta africana
58 Anolis carolinensis
56 Monodelphis domestica
48 Macaca mulatta
47 Gallus gallus
34 Oryctolagus cuniculus
30 Erinaceus europaeus
27 Strongylocentrotus purpuratus
27 Callorhinchus milii
26 Dasypus novemcinctus
22 Echinops telfairi
22 Cavia porcellus
17 Danio rerio
14 Schmidtea mediterranea
13 Ochotona princeps
13 Aplysia californica
10 Anopheles gambiae

This on the other hand, has much better matches ( note expect limit )
  573  blastnew -out dog_sign -nuc -hits 0 -summ 3000 -db wgs -expect .01 
and it is confined to dogs:
$ more dog_sign| sed -n '/producing signif/,/^>/p'| sed -n 's/.*|//p' | awk 
'{print $1" " $2}' | sort | uniq -c | sort -g -r | more
   3000 Canis familiaris

And these all seem to be in different places ( most frequent location occurs 

$ more dog_sign| sed -n '/producing signif/,/^>/p'| sed -n 's/.*|//p' |awk 
'{print $3}'| sort | uniq -c | sort -g -r | more
      1 ctg19866851899833,
      1 ctg19866851899815,
      1 ctg19866851899794,

Anyone care to comment on significance of this sequence, or reason it is 
just an uninteresting


Mike Marchywka
586 Saint James Walk
Marietta GA 30067-7165
404-788-1216 (C)<- leave message
989-348-4796 (P)<- emergency only
marchywka at hotmail.com


General Forum at Bioinformatics.Org - BiO_Bulletin_Board at bioinformatics.org

Proprietary or confidential information belonging to Almac Group Limited or to one of its affiliated companies may be contained in this message. The e-mail and any files transmitted with it are confidential and privileged and intended solely for the use of the individual or entity to whom they are addressed. 

Any unauthorised direct or indirect dissemination, distribution or copying of this message and any attachments is strictly prohibited. 

If you have received the e-mail in error please notify helpdesk at almacgroup.com  and delete the e-mail from your system.

E-mail and other communications sent to this company may be reviewed or read by persons other than the intended recipient.

Viruses : although we have taken steps to ensure that this e-mail and any attachments are free from any virus, you should, in keeping with good practice, ensure that they are actually virus free.

More information about the BBB mailing list