[BioBrew Users] zombie HPL processes, was: disabling non-SGE access to the cluster

Bill Barnard bill at barnard-engineering.com
Tue Sep 2 18:00:04 EDT 2003


On Fri, 2003-08-29 at 19:37, Glen Otero wrote:

Glen,

I'm happy to report you were exactly correct. HPL is now running
properly for me. I hope to have more interesting things to write about
soon! I am extremely grateful for your help.

Cheers,

Bill

> WRT to HPL zombie processes, if the compute nodes are not pentium
> 4processors, then you might see zombie process behavior. The
> binariesfor hpl were optimized for the
> Pentium 4 and uses instructions (SSE2) not available on Pentium III or
> Athlon. The solution is to recompile the ATLAS library, install it and
> rebuild hpl against it. It is easiest to just download the
> Atlaslibraries from netlib (prebuilt)
> 
> http://www.netlib.org/atlas/archives/linux/
> 
> But if you want to rebuild atlas and hpl from scratch, you shouldstart
> by checking out a Rocks CVS source tree.
> 
> # cvs -d :pserver:anonymous at cvs.rocksclusters.org: /home/cvs/CVSROOT/
> \
> checkout -r ROCKS_2_3_2_i386 rocks-src
> 
> and make sure to get the 2_3_2 version and not the HEAD
> 
> Rebuild and install ATLAS:
> 
> # cd rocks/src/contrib/atlas
> # make rpm
> # rpm -Uvh --force /usr/src/redhat/RPMS/i386/atlas*rpm
> 
> Rebuild HPL (no need to install it on the frontend if you don't runhpl
> on the frontend):
> 
> # cd rocks/src/contrib/hpl
> # make rpm
> 
> Rebuild your distribution:
> 
> # cd /home/install
> # rocks-dist dist
> 
> Reinstall your compute nodes:
> 
> #shoot-node compute-0-1 compute-0-1...
> 





More information about the BioBrew-Users mailing list