Configuring GPU Resource Scheduling in Kubernetes Clusters
Prerequisites
Ensure NVIDIA drivers are installed on each node before proceeding.
Step 1: Install NVIDIA Container Runtime
Install the nvidia-container-runtime package on each node:
yum install nvidia-container-runtime
Step 2: Configure Docker
Edit /etc/docker/daemon.json to configure Docker to use the NVIDIA runtime:
{
"default-runtime ...
Posted on Sun, 31 May 2026 22:14:46 +0000 by thor erik
Parallel CUDA Installation and Version Management on Linux
When working with machine learning frameworks that depend on specific CUDA releases, maintaining multiple toolkit versions on a single Linux host becomes essential. Rather than removing existing installations, you can deploy additional releases alongside the primary version and toggle between them dynamically.
Installing an Additional CUDA Rele ...
Posted on Sat, 16 May 2026 11:23:29 +0000 by mcirl2