Skip to content

GPU docker builds are failing #53

@jrbourbeau

Description

@jrbourbeau

Here's an example build https://github.com/coiled/examples/actions/runs/14764184328/job/41451797465?pr=52

Things are failing when we attempt to install the GPU version of pytorch

--------------------
  10 |     COPY pytorch.yml pytorch.yml
  11 | >>> RUN mamba env update -n base --file pytorch.yml \
  12 | >>>     && conda clean -afy \
  13 | >>>     && mamba uninstall -y pytorch torchvision \
  14 | >>>     && mamba install -y -n base -c pytorch -c nvidia -c conda-forge \
  15 | >>>         "cudatoolkit=${CUDA_VER%.*}.*" \
  16 | >>>         "cuda-version=${CUDA_VER%.*}.*" \
  17 | >>>         pytorch \
  18 | >>>         torchvision \
  19 | >>>         "pytorch-cuda=${CUDA_VER%.*}.*" \
  20 | >>>     && conda clean -afy \
  21 | >>>     && rm pytorch.yml
  22 |     
--------------------
ERROR: failed to solve: process "/bin/bash -euo pipefail -c mamba env update -n base --file pytorch.yml     && conda clean -afy     && mamba uninstall -y pytorch torchvision     && mamba install -y -n base -c pytorch -c nvidia -c conda-forge         \"cudatoolkit=${CUDA_VER%.*}.*\"         \"cuda-version=${CUDA_VER%.*}.*\"         pytorch         torchvision         \"pytorch-cuda=${CUDA_VER%.*}.*\"     && conda clean -afy     && rm pytorch.yml" did not complete successfully: exit code: 1

Just a hunch but there have been a lot of changes to how pytorch is distributed with conda lately. My guess is we need to install everything from conda-forge now.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions