Managed and unmanaged Slurm
Slurm job management optimized for AI workloads is available on Lambda's 1-Click Clusters™ and Superclusters.

Slurm job management for AI clusters
Our Slurm workload scheduler offering includes both unmanaged and managed solutions for NVIDIA GB300 NVL72 and HGX B300 clusters. Choose unmanaged for full control, or managed to let Lambda handle the administration.

Managed Slurm: hands-off efficiency
Let us handle the complexities of Slurm administration. Managed Slurm provides all the features of unmanaged, plus comprehensive support and management by Lambda:
-
Slurm patches
-
Job history tracking
-
Technical support — Lambda partners with SchedMD for backend support
-
Node failure detection and replacement
-
Cluster and Slurm daemon health monitoring, including slurmctl, slurmdbd, and node Slurm

Unmanaged Slurm: complete control
Take the reins with Unmanaged Slurm. You get Lambda's optimized Slurm configuration with built-in features for advanced cluster management, including:
-
Built-in LDAP auth for user/group management
-
Policies based on cgroups
-
Container support (Pyxis, Enroot)
-
Slurm user, operator, and administrator access
-
High Availability (HA)
Deploy seamlessly
Both Unmanaged and Managed Slurm run on Lambda's 1-Click Clusters with NVIDIA HGX B200 and H100 GPUs, providing scalable GPU resources for your AI workloads.