Add portable CUDA 11.8 Docker environment for SLAM-LLM#248
Open
ak4off wants to merge 1 commit into
Open
Conversation
Author
|
Tested on multi-GPU ASR finetuning workflows with CUDA 11.8 and DeepSpeed 0.14.5. This setup was created to improve reproducibility across institutional GPU servers where sudo access and CUDA toolkit installation are restricted. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Adds a portable and reproducible CUDA 11.8 Docker environment for SLAM-LLM research workflows.
This setup is designed to reduce environment-related failures across institutional GPU servers and research clusters.
Includes
Motivation
Many users encounter:
nvcc)This Docker setup aims to provide a reproducible and portable environment across servers.
Notes
The image still requires:
--gpus all)GPU kernel drivers cannot be packaged inside Docker.