Tue Jul 3 22:00:41 2012 UTC ()
Update to SLURM 2.4.1

* Changes in SLURM 2.4.1
========================
 -- Fix bug for job state change from 2.3 -> 2.4 job state can now be preserved
    correctly when transitioning.  This also applies for 2.4.0 -> 2.4.1, no
    state will be lost. (Thanks to Carles Fenoy)

* Changes in SLURM 2.4.0
========================
 -- Cray - Improve support for zero compute note resource allocations.
    Partition used can now be configured with no nodes nodes.
 -- BGQ - make it so srun -i<taskid> works correctly.
 -- Fix parse_uint32/16 to complain if a non-digit is given.
 -- Add SUBMITHOST to job state passed to Moab vial sched/wiki2. Patch by Jon
    Bringhurst (LANL).
 -- BGQ - Fix issue when running with AllowSubBlockAllocations=Yes without
    compiling with --enable-debug
 -- Modify scontrol to require "-dd" option to report batch job's script. Patch
    from Don Albert, Bull.
 -- Modify SchedulerParamters option to match documentation: "bf_res="
    changed to "bf_resolution=". Patch from Rod Schultz, Bull.
 -- Fix bug that clears job pending reason field. Patch fron Don Lipari, LLNL.
 -- In etc/init.d/slurm move check for scontrol after sourcing
    /etc/sysconfig/slurm. Patch from Andy Wettstein, University of Chicago.
 -- Fix in scheduling logic that can delay jobs with min/max node counts.
 -- BGQ - fix issue where if a step uses the entire allocation and then
    the next step in the allocation only uses part of the allocation it gets
    the correct cnodes.
 -- BGQ - Fix checking for IO on a block with new IBM driver V1R1M1 previous
    function didn't always work correctly.
 -- BGQ - Fix issue when a nodeboard goes down and you want to combine blocks
    to make a larger small block and are running with sub-blocks.
 -- BLUEGENE - Better logic for making small blocks around bad nodeboard/card.
 -- BGQ - When using an old IBM driver cnodes that go into error because of
    a job kill timeout aren't always reported to the system.  This is now
    handled by the runjob_mux plugin.
 -- BGQ - Added information on how to setup the runjob_mux to run as SlurmUser.
 -- Improve memory consumption on step layouts with high task count.
 -- BGQ - quiter debug when the real time server comes back but there are
    still messages we find when we poll but haven't given it back to the real
    time yet.
 -- BGQ - fix for if a request comes in smaller than the smallest block and
    we must use a small block instead of a shared midplane block.
 -- Fix issues on large jobs (>64k tasks) to have the correct counter type when
    packing the step layout structure.
 -- BGQ - fix issue where if a user was asking for tasks and ntasks-per-node
    but not node count the node count is correctly figured out.
 -- Move logic to always use the 1st alphanumeric node as the batch host for
    batch jobs.
 -- BLUEGENE - fix race condition where if a nodeboard/card goes down at the
    same time a block is destroyed and that block just happens to be the
    smallest overlapping block over the bad hardware.
 -- Fix bug when querying accounting looking for a job node size.
 -- BLUEGENE - fix possible race condition if cleaning up a block and the
    removal of the job on the block failed.
 -- BLUEGENE - fix issue if a cable was in an error state make it so we can
    check if a block is still makable if the cable wasn't in error.
 -- Put nodes names in alphabetic order in node table.
 -- If preempted job should have a grace time and preempt mode is not cancel
    but job is going to be canceled because it is interactive or other reason
    it now receives the grace time.
 -- BGQ - Modified documents to explain new plugin_flags needed in bg.properties
    in order for the runjob_mux to run correctly.
 -- BGQ - change linking from libslurm.o to libslurmhelper.la to avoid warning.

* Changes in SLURM 2.4.0.rc1
=============================
 -- Improve task binding logic by making fuller use of HWLOC library,
    especially with respect to Opteron 6000 series processors. Work contributed
    by Komoto Masahiro.
 -- Add new configuration parameter PriorityFlags, based upon work by
    Carles Fenoy (Barcelona Supercomputer Center).
 -- Modify the step completion RPC between slurmd and slurmstepd in order to
    eliminate a possible deadlock. Based on work by Matthieu Hautreux, CEA.
 -- Change the owner of slurmctld and slurmdbd log files to the appropriate
    user. Without this change the files will be created by and owned by the
    user starting the daemons (likely user root).
 -- Reorganize the slurmstepd logic in order to better support NFS and
    Kerberos credentials via the AUKS plugin. Work by Matthieu Hautreux, CEA.
 -- Fix bug in allocating GRES that are associated with specific CPUs. In some
    cases the code allocated first available GRES to job instead of allocating
    GRES accessible to the specific CPUs allocated to the job.
 -- spank: Add callbacks in slurmd: slurm_spank_slurmd_{init,exit}
    and job epilog/prolog: slurm_spank_job_{prolog,epilog}
 -- spank: Add spank_option_getopt() function to api
 -- Change resolution of switch wait time from minutes to seconds.
 -- Added CrpCPUMins to the output of sshare -l for those using hard limit
    accounting.  Work contributed by Mark Nelson.
 -- Added mpi/pmi2 plugin for complete support of pmi2 including acquiring
    additional resources for newly launched tasks. Contributed by Hongjia Cao,
    NUDT.
 -- BGQ - fixed issue where if a user asked for a specific node count and more
    tasks than possible without overcommit the request would be allowed on more
    nodes than requested.
 -- Add support for new SchedulerParameters of bf_max_job_user, maximum number
    of jobs to attempt backfilling per user. Work by Bj翹rn-Helge Mevik,
    University of Oslo.
 -- BLUEGENE - fixed issue where MaxNodes limit on a partition only limited
    larger than midplane jobs.
 -- Added cpu_run_min to the output of sshare --long.  Work contributed by
    Mark Nelson.
 -- BGQ - allow regular users to resolve Rack-Midplane to AXYZ coords.
 -- Add sinfo output format option of "%R" for partition name without "*"
    appended for default partition.
 -- Cray - Add support for zero compute note resource allocation to run batch
    script on front-end node with no ALPS reservation. Useful for pre- or post-
    processing.
 -- Support for cyclic distribution of cpus in task/cgroup plugin from Martin
    Perry, Bull.
 -- GrpMEM limit for QOSes and associations added Patch from Bj翹rn-Helge Mevik,
    University of Oslo.
 -- Various performance improvements for up to 500% higher throughput depending
    upon configuration. Work supported by the Oak Ridge National Laboratory
    Extreme Scale Systems Center.
 -- Added jobacct_gather/cgroup plugin.  It is not advised to use this in
    production as it isn't currently complete and doesn't provide an equivalent
    substitution for jobacct_gather/linux yet. Work by Martin Perry, Bull.


(asau)
diff -r1.3 -r1.4 pkgsrc/parallel/slurm/Makefile
diff -r1.1.1.1 -r1.2 pkgsrc/parallel/slurm/PLIST
diff -r1.1.1.1 -r1.2 pkgsrc/parallel/slurm/distinfo

cvs diff -r1.3 -r1.4 pkgsrc/parallel/slurm/Attic/Makefile (expand / switch to unified diff)

--- pkgsrc/parallel/slurm/Attic/Makefile 2012/07/03 14:09:11 1.3
+++ pkgsrc/parallel/slurm/Attic/Makefile 2012/07/03 22:00:41 1.4
@@ -1,17 +1,16 @@ @@ -1,17 +1,16 @@
1# $NetBSD: Makefile,v 1.3 2012/07/03 14:09:11 asau Exp $ 1# $NetBSD: Makefile,v 1.4 2012/07/03 22:00:41 asau Exp $
2 2
3DISTNAME= slurm-2.4.0-0.pre4 3DISTNAME= slurm-2.4.1
4PKGNAME= slurm-2.4.0pre4 
5CATEGORIES= parallel 4CATEGORIES= parallel
6MASTER_SITES= http://www.schedmd.com/download/archive/ \ 5MASTER_SITES= http://www.schedmd.com/download/archive/ \
7 http://www.schedmd.com/download/latest/ \ 6 http://www.schedmd.com/download/latest/ \
8 http://www.schedmd.com/download/development/ 7 http://www.schedmd.com/download/development/
9EXTRACT_SUFX= .tar.bz2 8EXTRACT_SUFX= .tar.bz2
10 9
11MAINTAINER= asau@inbox.ru 10MAINTAINER= asau@inbox.ru
12HOMEPAGE= http://www.schedmd.com/ 11HOMEPAGE= http://www.schedmd.com/
13COMMENT= Simple Linux Utility for Resource Management 12COMMENT= Simple Linux Utility for Resource Management
14 13
15PKG_DESTDIR_SUPPORT= user-destdir 14PKG_DESTDIR_SUPPORT= user-destdir
16 15
17USE_LANGUAGES= c c++ 16USE_LANGUAGES= c c++

cvs diff -r1.1.1.1 -r1.2 pkgsrc/parallel/slurm/Attic/PLIST (expand / switch to unified diff)

--- pkgsrc/parallel/slurm/Attic/PLIST 2012/03/20 14:52:15 1.1.1.1
+++ pkgsrc/parallel/slurm/Attic/PLIST 2012/07/03 22:00:41 1.2
@@ -1,14 +1,14 @@ @@ -1,14 +1,14 @@
1@comment $NetBSD: PLIST,v 1.1.1.1 2012/03/20 14:52:15 asau Exp $ 1@comment $NetBSD: PLIST,v 1.2 2012/07/03 22:00:41 asau Exp $
2bin/sacct 2bin/sacct
3bin/sacctmgr 3bin/sacctmgr
4bin/salloc 4bin/salloc
5bin/sattach 5bin/sattach
6bin/sbatch 6bin/sbatch
7bin/sbcast 7bin/sbcast
8bin/scancel 8bin/scancel
9bin/scontrol 9bin/scontrol
10bin/sdiag 10bin/sdiag
11bin/sinfo 11bin/sinfo
12bin/smap 12bin/smap
13bin/sprio 13bin/sprio
14bin/squeue 14bin/squeue
@@ -31,39 +31,41 @@ lib/slurm/accounting_storage_slurmdbd.la @@ -31,39 +31,41 @@ lib/slurm/accounting_storage_slurmdbd.la
31lib/slurm/auth_munge.la 31lib/slurm/auth_munge.la
32lib/slurm/auth_none.la 32lib/slurm/auth_none.la
33lib/slurm/checkpoint_none.la 33lib/slurm/checkpoint_none.la
34lib/slurm/checkpoint_ompi.la 34lib/slurm/checkpoint_ompi.la
35lib/slurm/crypto_munge.la 35lib/slurm/crypto_munge.la
36lib/slurm/crypto_openssl.la 36lib/slurm/crypto_openssl.la
37lib/slurm/gres_gpu.la 37lib/slurm/gres_gpu.la
38lib/slurm/gres_nic.la 38lib/slurm/gres_nic.la
39lib/slurm/job_submit_cnode.la 39lib/slurm/job_submit_cnode.la
40lib/slurm/job_submit_defaults.la 40lib/slurm/job_submit_defaults.la
41lib/slurm/job_submit_logging.la 41lib/slurm/job_submit_logging.la
42lib/slurm/job_submit_partition.la 42lib/slurm/job_submit_partition.la
43lib/slurm/jobacct_gather_aix.la 43lib/slurm/jobacct_gather_aix.la
 44lib/slurm/jobacct_gather_cgroup.la
44lib/slurm/jobacct_gather_linux.la 45lib/slurm/jobacct_gather_linux.la
45lib/slurm/jobacct_gather_none.la 46lib/slurm/jobacct_gather_none.la
46lib/slurm/jobcomp_filetxt.la 47lib/slurm/jobcomp_filetxt.la
47lib/slurm/jobcomp_none.la 48lib/slurm/jobcomp_none.la
48lib/slurm/jobcomp_script.la 49lib/slurm/jobcomp_script.la
49lib/slurm/mpi_lam.la 50lib/slurm/mpi_lam.la
50lib/slurm/mpi_mpich1_p4.la 51lib/slurm/mpi_mpich1_p4.la
51lib/slurm/mpi_mpich1_shmem.la 52lib/slurm/mpi_mpich1_shmem.la
52lib/slurm/mpi_mpichgm.la 53lib/slurm/mpi_mpichgm.la
53lib/slurm/mpi_mpichmx.la 54lib/slurm/mpi_mpichmx.la
54lib/slurm/mpi_mvapich.la 55lib/slurm/mpi_mvapich.la
55lib/slurm/mpi_none.la 56lib/slurm/mpi_none.la
56lib/slurm/mpi_openmpi.la 57lib/slurm/mpi_openmpi.la
 58lib/slurm/mpi_pmi2.la
57lib/slurm/preempt_none.la 59lib/slurm/preempt_none.la
58lib/slurm/preempt_partition_prio.la 60lib/slurm/preempt_partition_prio.la
59lib/slurm/preempt_qos.la 61lib/slurm/preempt_qos.la
60lib/slurm/priority_basic.la 62lib/slurm/priority_basic.la
61lib/slurm/priority_multifactor.la 63lib/slurm/priority_multifactor.la
62lib/slurm/proctrack_cgroup.la 64lib/slurm/proctrack_cgroup.la
63lib/slurm/proctrack_linuxproc.la 65lib/slurm/proctrack_linuxproc.la
64lib/slurm/proctrack_pgid.la 66lib/slurm/proctrack_pgid.la
65lib/slurm/sched_backfill.la 67lib/slurm/sched_backfill.la
66lib/slurm/sched_builtin.la 68lib/slurm/sched_builtin.la
67lib/slurm/sched_hold.la 69lib/slurm/sched_hold.la
68lib/slurm/sched_wiki.la 70lib/slurm/sched_wiki.la
69lib/slurm/sched_wiki2.la 71lib/slurm/sched_wiki2.la
@@ -247,26 +249,27 @@ man/man8/spank.8 @@ -247,26 +249,27 @@ man/man8/spank.8
247sbin/slurmctld 249sbin/slurmctld
248sbin/slurmd 250sbin/slurmd
249sbin/slurmdbd 251sbin/slurmdbd
250sbin/slurmstepd 252sbin/slurmstepd
251share/doc/${PKGNAME}/html/accounting.html 253share/doc/${PKGNAME}/html/accounting.html
252share/doc/${PKGNAME}/html/accounting_storageplugins.html 254share/doc/${PKGNAME}/html/accounting_storageplugins.html
253share/doc/${PKGNAME}/html/allocation_pies.gif 255share/doc/${PKGNAME}/html/allocation_pies.gif
254share/doc/${PKGNAME}/html/api.html 256share/doc/${PKGNAME}/html/api.html
255share/doc/${PKGNAME}/html/arch.gif 257share/doc/${PKGNAME}/html/arch.gif
256share/doc/${PKGNAME}/html/authplugins.html 258share/doc/${PKGNAME}/html/authplugins.html
257share/doc/${PKGNAME}/html/big_sys.html 259share/doc/${PKGNAME}/html/big_sys.html
258share/doc/${PKGNAME}/html/bluegene.html 260share/doc/${PKGNAME}/html/bluegene.html
259share/doc/${PKGNAME}/html/bull.jpg 261share/doc/${PKGNAME}/html/bull.jpg
 262share/doc/${PKGNAME}/html/cgroups.html
260share/doc/${PKGNAME}/html/checkpoint_blcr.html 263share/doc/${PKGNAME}/html/checkpoint_blcr.html
261share/doc/${PKGNAME}/html/checkpoint_plugins.html 264share/doc/${PKGNAME}/html/checkpoint_plugins.html
262share/doc/${PKGNAME}/html/coding_style.pdf 265share/doc/${PKGNAME}/html/coding_style.pdf
263share/doc/${PKGNAME}/html/configurator.easy.html 266share/doc/${PKGNAME}/html/configurator.easy.html
264share/doc/${PKGNAME}/html/configurator.html 267share/doc/${PKGNAME}/html/configurator.html
265share/doc/${PKGNAME}/html/cons_res.html 268share/doc/${PKGNAME}/html/cons_res.html
266share/doc/${PKGNAME}/html/cons_res_share.html 269share/doc/${PKGNAME}/html/cons_res_share.html
267share/doc/${PKGNAME}/html/cpu_management.html 270share/doc/${PKGNAME}/html/cpu_management.html
268share/doc/${PKGNAME}/html/cray.html 271share/doc/${PKGNAME}/html/cray.html
269share/doc/${PKGNAME}/html/crypto_plugins.html 272share/doc/${PKGNAME}/html/crypto_plugins.html
270share/doc/${PKGNAME}/html/disclaimer.html 273share/doc/${PKGNAME}/html/disclaimer.html
271share/doc/${PKGNAME}/html/dist_plane.html 274share/doc/${PKGNAME}/html/dist_plane.html
272share/doc/${PKGNAME}/html/documentation.html 275share/doc/${PKGNAME}/html/documentation.html

cvs diff -r1.1.1.1 -r1.2 pkgsrc/parallel/slurm/Attic/distinfo (expand / switch to unified diff)

--- pkgsrc/parallel/slurm/Attic/distinfo 2012/03/20 14:52:15 1.1.1.1
+++ pkgsrc/parallel/slurm/Attic/distinfo 2012/07/03 22:00:41 1.2
@@ -1,13 +1,13 @@ @@ -1,13 +1,13 @@
1$NetBSD: distinfo,v 1.1.1.1 2012/03/20 14:52:15 asau Exp $ 1$NetBSD: distinfo,v 1.2 2012/07/03 22:00:41 asau Exp $
2 2
3SHA1 (slurm-2.4.0-0.pre4.tar.bz2) = 869169e1eb2ed5cc2736804cc0e6a43120eed3f5 3SHA1 (slurm-2.4.1.tar.bz2) = 76b1eccad48d74ad9254d79d1252f3097e719f57
4RMD160 (slurm-2.4.0-0.pre4.tar.bz2) = cbc7bb456389032c8424dde628addf8b5a91ae0d 4RMD160 (slurm-2.4.1.tar.bz2) = 82b15dc29dc4297cb62298650f10881694b6e224
5Size (slurm-2.4.0-0.pre4.tar.bz2) = 5146463 bytes 5Size (slurm-2.4.1.tar.bz2) = 5212382 bytes
6SHA1 (patch-doc_html_Makefile.am) = 92a1942ed7c532fee6597f4d8a3adf81352f6d98 6SHA1 (patch-doc_html_Makefile.am) = 92a1942ed7c532fee6597f4d8a3adf81352f6d98
7SHA1 (patch-doc_html_Makefile.in) = 65f05532ae7701c8d33fd46b7c5f853c9bd6a1b0 7SHA1 (patch-doc_html_Makefile.in) = 65f05532ae7701c8d33fd46b7c5f853c9bd6a1b0
8SHA1 (patch-doc_man_man1_Makefile.am) = c21d927d0d4949d1b82c57e865ee7a79ed8b99ed 8SHA1 (patch-doc_man_man1_Makefile.am) = c21d927d0d4949d1b82c57e865ee7a79ed8b99ed
9SHA1 (patch-doc_man_man1_Makefile.in) = 9c568214983defe06627c701e96267f568597d50 9SHA1 (patch-doc_man_man1_Makefile.in) = 9c568214983defe06627c701e96267f568597d50
10SHA1 (patch-doc_man_man5_Makefile.am) = b8473964ad03e95c8416fe623e198f50dd557a08 10SHA1 (patch-doc_man_man5_Makefile.am) = b8473964ad03e95c8416fe623e198f50dd557a08
11SHA1 (patch-doc_man_man5_Makefile.in) = c1ea74633a8c59eaa6aaca3041c48770d7705432 11SHA1 (patch-doc_man_man5_Makefile.in) = c1ea74633a8c59eaa6aaca3041c48770d7705432
12SHA1 (patch-doc_man_man8_Makefile.am) = 46fb6837dc31e6f7ca926ffff1ddda1bdefeb83a 12SHA1 (patch-doc_man_man8_Makefile.am) = 46fb6837dc31e6f7ca926ffff1ddda1bdefeb83a
13SHA1 (patch-doc_man_man8_Makefile.in) = e7c66557680c550f1d87147dad4107ad00c4c713 13SHA1 (patch-doc_man_man8_Makefile.in) = e7c66557680c550f1d87147dad4107ad00c4c713