[BiO BB] program for sequence length

Ketil Malde ketil.malde at imr.no
Fri May 7 02:46:25 EDT 2010


Pankaj Khurana <pkhurana08 at gmail.com> writes:

> I have a few 1000 fasta files. I would like to get the list showing the
> sequence name and their respective lengths.

If you don't mind using Haskell:

  import Bio.Sequence

  main = do 
    ss <- readFasta "file.fasta"
    putStr (unlines [toStr (seqlabel s) ++ "\t" ++ show (seqlen s) | s <- ss])

You can probably do something similar using Bio{Perl,Python,Ruby,..}.

-k
-- 
If I haven't seen further, it is by standing in the footprints of giants




More information about the BBB mailing list