Managed and unmanaged Slurm

Slurm job management optimized for AI workloads is available on Lambda's 1-Click Clusters™ and Superclusters.

Talk to our team

Slurm job management for AI clusters

Our Slurm workload scheduler offering includes both unmanaged and managed solutions for NVIDIA GB300 NVL72 and HGX B300 clusters. Choose unmanaged for full control, or managed to let Lambda handle the administration.

Managed Slurm: hands-off efficiency

Let us handle the complexities of Slurm administration. Managed Slurm provides all the features of unmanaged, plus comprehensive support and management by Lambda:

Slurm patches
Job history tracking
Technical support — Lambda partners with SchedMD for backend support
Node failure detection and replacement
Cluster and Slurm daemon health monitoring, including slurmctl, slurmdbd, and node Slurm

Talk to our team

Unmanaged Slurm: complete control

Take the reins with Unmanaged Slurm. You get Lambda's optimized Slurm configuration with built-in features for advanced cluster management, including:

Built-in LDAP auth for user/group management
Policies based on cgroups
Container support (Pyxis, Enroot)
Slurm user, operator, and administrator access
High Availability (HA)

Talk to our team

Deploy seamlessly

Both Unmanaged and Managed Slurm run on Lambda's 1-Click Clusters with NVIDIA HGX B200 and H100 GPUs, providing scalable GPU resources for your AI workloads.

Talk to our team

FOOTER

AI FACTORIES

For every mission

Superintelligence
Enterprise
Government
Startups and researchers

Foundations

AI infrastructure
Trust and security
Customer stories

Products

Superclusters
1-Click Clusters
Instances

Features

AI infrastructure
Orchestration
Lambda Stack
Trust and security

Docs

Documentation
Blog
Research

Company

Inside Lambda

About
Careers
Leadership
Investors

Resources

Research
Customer stories
Blog
Partners
Brand guidelines

Privacy Policy
Terms of Service
Cookie preferences