<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv=Content-Type content="text/html; charset=us-ascii">
<meta name=Generator content="Microsoft Word 11 (filtered medium)">
<style>
<!--
/* Font Definitions */
@font-face
        {font-family:"Lucida Sans Unicode";
        panose-1:2 11 6 2 3 5 4 2 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
        {margin:0in;
        margin-bottom:.0001pt;
        font-size:12.0pt;
        font-family:"Times New Roman";}
a:link, span.MsoHyperlink
        {color:blue;
        text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
        {color:#606420;
        text-decoration:underline;}
span.EmailStyle17
        {mso-style-type:personal-compose;
        font-family:"Lucida Sans Unicode";
        color:windowtext;
        font-weight:normal;
        font-style:normal;
        text-decoration:none none;}
@page Section1
        {size:8.5in 11.0in;
        margin:1.0in 1.25in 1.0in 1.25in;}
div.Section1
        {page:Section1;}
/* List Definitions */
@list l0
        {mso-list-id:953636588;
        mso-list-type:hybrid;
        mso-list-template-ids:734979778 67698689 67698691 67698693 67698689 67698691 67698693 67698689 67698691 67698693;}
@list l0:level1
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7;
        mso-level-tab-stop:.5in;
        mso-level-number-position:left;
        text-indent:-.25in;
        font-family:Symbol;
        mso-fareast-font-family:"Times New Roman";
        mso-bidi-font-family:"Times New Roman";}
@list l0:level2
        {mso-level-tab-stop:1.0in;
        mso-level-number-position:left;
        text-indent:-.25in;}
@list l0:level3
        {mso-level-tab-stop:1.5in;
        mso-level-number-position:left;
        text-indent:-.25in;}
@list l0:level4
        {mso-level-tab-stop:2.0in;
        mso-level-number-position:left;
        text-indent:-.25in;}
@list l0:level5
        {mso-level-tab-stop:2.5in;
        mso-level-number-position:left;
        text-indent:-.25in;}
@list l0:level6
        {mso-level-tab-stop:3.0in;
        mso-level-number-position:left;
        text-indent:-.25in;}
@list l0:level7
        {mso-level-tab-stop:3.5in;
        mso-level-number-position:left;
        text-indent:-.25in;}
@list l0:level8
        {mso-level-tab-stop:4.0in;
        mso-level-number-position:left;
        text-indent:-.25in;}
@list l0:level9
        {mso-level-tab-stop:4.5in;
        mso-level-number-position:left;
        text-indent:-.25in;}
@list l1
        {mso-list-id:1494183454;
        mso-list-template-ids:-1978127408;}
@list l1:level1
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7;
        mso-level-tab-stop:.5in;
        mso-level-number-position:left;
        text-indent:-.25in;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
ol
        {margin-bottom:0in;}
ul
        {margin-bottom:0in;}
-->
</style>
<!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang=EN-US link=blue vlink="#606420">
<div class=Section1>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'>Hi All,<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'><o:p> </o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'>I’ve scanned
the SGE documentation and user groups, and have not found an answer to this
question. I got such good service last time I asked a question here, I
thought I’d try again!<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'><o:p> </o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'>I have a 10 node
cluster (soon to grow), with SGE. Two groups contributed funds for the
hardware. Both groups have periods of heavy use, and periods of very
light use. Hence, I’d like the following use model<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'><o:p> </o:p></span></font></p>
<ul style='margin-top:0in' type=disc>
<li class=MsoNormal style='mso-list:l0 level1 lfo3'><font size=2
face="Lucida Sans Unicode"><span style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'>If
group A (or B) is the only one using it, they get all 10 machines.<o:p></o:p></span></font></li>
<li class=MsoNormal style='mso-list:l0 level1 lfo3'><font size=2
face="Lucida Sans Unicode"><span style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'>If
group A and group B are both using it, they effectively get 5 machines
each.<o:p></o:p></span></font></li>
</ul>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'><o:p> </o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'>The jobs submitted
tend to be very big array jobs, each part of the array job taking 5 or 10
minutes.<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'><o:p> </o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'>It is easy enough to
set up one queue on each machine for each group (i.e. each machine has two
queues), and control access by user ID.<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'><o:p> </o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'>But how to configure
the queues? Imagine group A is running on all 10 nodes, and group B
submits. What I would like to see, on the 5 group B machines, is the
group B jobs starting, the group A jobs completing, and no more group A jobs
being started (on the B machines).<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'><o:p> </o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'>I can’t see
how to do this. The subordinate queue mechanism would suspend the A
queues, which kills the jobs; I’d need to modify all the scripts that
combine the results of array jobs to know how to deal with killed pieces of
array jobs. What I think I need is an equivalent to subordinate queues,
but instead of suspending, it should disable the queues to allow the jobs to
complete.<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'><o:p> </o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'>My solution right
now is to set “nice” priorities, so that the A jobs largely get out
of the way of the B jobs on the B machines. This is not perfect; you end
up with many processes running, and you end up with an imbalance in how long a
piece of an array job takes, depending on where it is running, which can
substantially lengthen overall run times (due to some pieces being “stuck”
on low-priority processes).<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'><o:p> </o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'>This method doesn’t
scale nicely either, adding another group could result in even more processes
running on each node.<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'><o:p> </o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'>Thanks for any
pointers,<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'><o:p> </o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'>Peter<o:p></o:p></span></font></p>
<p class=MsoNormal><font size=2 face="Lucida Sans Unicode"><span
style='font-size:10.0pt;font-family:"Lucida Sans Unicode"'><o:p> </o:p></span></font></p>
</div>
</body>
</html>