A Slurm cluster using docker-compose
-
Updated
Sep 27, 2024 - Dockerfile
A Slurm cluster using docker-compose
Deploy Dask on job schedulers like PBS, SLURM, and SGE
An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.
A Slurm dashboard for the terminal.
How to Configure a GPU Cluster Running Ubuntu Linux
qtop (pronounced queue-top) is a tool written in order to monitor the state of Queueing Systems, along with related information relevant on HPC & grid clusters. At present it supports **PBS, SGE & OAR** families. There is a historic reference for the prior shell version of the tool, at former CERN source:
Open source digital rocks software platform for micro-CT, CT, thin sections and borehole image analysis. Includes tools for: annotation, AI, HPC, porous media flow simulation, porosity analysis, permeability analysis and much more.
Submit functions and shell scripts to torque and slurm clusters or local machines using python.
UAB Research Computing Documentation
Container-based Slurm cluster with support for running on multiple ssh-accessible computers. Currently it is based on podman, systemd, norouter and sshocker (sshfs).
spart: a user-oriented partition info command for slurm
Submit slurm cluster job(Sbatch) inside python and avoid shell script. Submission cmd can be customized to add more options.
Basically all ingredients for building HPC style clusters are here.
LaunchPad is a light-weighted Slurm job launcher designed for hyper-parameter search.
This project provides provisioned HPC cluster models using underlying virtualization mechanisms.
asyncmd is a library to write concurrent code for setup, run and analysis of molecular dynamics simulations using pythons async/await synthax.
This helps you to submit job with multinode & multgpu in Slurm in Torchrun
Slack bot to interface with Slurm and other utlilities
A super simplistic way to perform hyperparameter tuning (and other things!) on SLURM. Made to be quick and easy to use and build on! and to get your jobs done faster.
A Python package to manage protein design workflows on computing clusters and local machines. Documentation can be found here: https://protflow.readthedocs.io/en/latest/
Add a description, image, and links to the slurm-cluster topic page so that developers can more easily learn about it.
To associate your repository with the slurm-cluster topic, visit your repo's landing page and select "manage topics."