Sorry if you got this before (via bio-bb), but the cd-hit project is looking for developers to fix some very simple (for a c/c++ programmer) problems with the software. Also some integration with seqio / xml libraries will be a big improvement in this software. Finally it would be great to add this software to the bio-package project, and make an RPM for cd-hit. Cheers, Dan.