[Bioclusters] Question about grid
Tim Cutts
bioclusters@bioinformatics.org
Thu, 13 May 2004 14:41:17 +0100
On 13 May 2004, at 2:22 pm, Arnon Klein wrote:
> Doing this in Java is actualy pretty easy, since RMI lets you
> transport an object containing both code and data over the network.
But you'll be bitten by the usual problem which is that the compute/IO
ratio for most bioinformatics applications is very low, as has been
discussed on this list before ad nauseam. You'll spend so much time
getting the data to the compute node that you might as well have run
the compute locally. And your network admin might have things to say
about the whole idea, too. :-)
The bandwidth of gigabit ethernet is *theoretically* about an order of
magnitude less than the machine's own memory. It's actually much worse
than that in practice. Especially if you've got a network of these
little buggers all competing for the bandwidth trying to get each other
to run stuff.
Leaving aside the political nightmare that grid ideas create.
The only reason Seti@home succeeded as a grid computing project is that
the compute/IO ratio is *huge*. Genomics applications just can't do
this, although conceivably some things such as molecular dynamics
simulations might.
<troll>And it's never going to perform decently in java anyway</troll>
Tim