[Bioclusters] error on qsub/mpirun jobs
Zhiliang Hu
zhu at iastate.edu
Tue Sep 9 13:22:27 EDT 2008
Michael,
Thank you for the hints as where to look for the errors. That's helpful.
Zhiliang
At 11:35 AM 9/8/2008 -0400, Michael Edwards wrote:
>On Fri, Sep 5, 2008 at 4:55 PM, Zhiliang Hu <zhu at iastate.edu> wrote:
>
>>
>> ----------------------------------------------------------
>> Unable to copy file /var/spool/torque/spool/658.nagrp2..ER to
>> hu at hist:/raid/pub/ncbi/blast/www/mpiblast.tmp
>> >>> error from copy
>> Host key verification failed.
>> lost connection
>> >>> end error output
>> Output retained on that host in:
>> /var/spool/torque/undelivered/658.nagrp2..ER
>> ----------------------------------------------------------
>>
>> Note: When manually check, the "retained" file is not there:
>> "/var/spool/torque/undelivered/658.nagrp2..ER"
>>
>
>qsub opens a shell to the selected compute node and runs the script from
>that node. So the retained file would not be on the head node, but on the
>local file system on which the node was trying to run.
>
>If you are trying to run qsub, the code has to be present on all the compute
>nodes in the same place. The easiest way to do this is using a shared file
>system.
>_______________________________________________
>Bioclusters maillist - Bioclusters at bioinformatics.org
>http://www.bioinformatics.org/mailman/listinfo/bioclusters
More information about the Bioclusters
mailing list