User Tools

Site Tools


srun

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
srun [2014/04/22 17:01]
root
srun [2021/03/01 11:14] (current)
root
Line 1: Line 1:
 ====== srun ====== ====== srun ======
  
-Runs an executable file, not a script (but can “srun /bin/bash ...”).+Runs an executable file. It is really intended to be used from within **sbatch** scripts. Used within a script, each invocation of **srun** starts a new **job step** (terminology you will see in the slurm documentation). 
  
-Runs the job in the foreground (and blocks unless you use “&”).+Used from a terminal, srun runs the job in the foreground (and blocks unless you use “&”).
  
-Job will end if you close your terminal, unless you redirect IO and **disown** the job.+If you use srun directly (in a login terminal) your job will end if you close your terminal or close your laptop, unless you redirect IO and **disown** the job. You can also use a "terminal multiplexer" like screen, tmux, or byobu. However, even if you take steps to protect against "hang-ups", if the head node needs to be rebooted (to pick up kernel updates, for instance) that will terminate any jobs running through srun. A reboot of the head node will not affect jobs submitted using sbatch.
  
 ==== Examples ==== ==== Examples ====
  
 srun hostname srun hostname
 +
 +srun --pty bash -i 
 +  * Runs an interactive shell on a compute node.
 +  * You can specify a specific node with -w node5 (for instance).
 +    * Will only run if there is a slot available on the node.
 +
 +srun -n3 hostname (runs 3 **tasks**: copies of the program specified, may run on the same node)
 +
 +srun -n30 hostname (runs 30 **tasks**, will run on multiple nodes since no node in that standard queue has as many as 30 cores)
  
 srun -N3 hostname (runs on 3 different nodes) srun -N3 hostname (runs on 3 different nodes)
  
-srun -u bash -i  
-  * Runs an interactive shell on a compute node. 
-  * It's a little odd, but it does work. 
  
-srun can also be used to start "job stepsfrom within a script.+==== Protecting against "Hang-Ups==== 
 + 
 +<code> 
 +srun long-job > srun.out 2>&1 & 
 +disown 
 +</code>
  
 +But it's much better to use sbatch.
  
  
srun.1398200461.txt.gz · Last modified: 2014/04/22 17:01 by root