@@ -18,11 +18,11 @@ Here were train the ResNet-50 model on the Cassava dataset (see [video](https://
1818[NetID@log-1 ~ ]$ cd /scratch/< NetID> /tensorflow-example
1919
2020# copy over an overlay file with sufficient resources and unzip it
21- [NetID@cm001 tensorflow-example]$ cp -rp /scratch/work/public /overlay-fs-ext3/overlay-15GB-500K.ext3.gz .
21+ [NetID@cm001 tensorflow-example]$ cp -rp /share/apps /overlay-fs-ext3/overlay-15GB-500K.ext3.gz .
2222[NetID@cm001 tensorflow-example]$ gunzip overlay-15GB-500K.ext3.gz
2323
2424# start the singularity environment
25- [NetID@cm001 tensorflow-example]$ singularity exec --overlay overlay-15GB-500K.ext3:rw /scratch/work/public/singularity /cuda12.1.1-cudnn8.9.0-devel-ubuntu22.04.2.sif /bin/bash
25+ [NetID@cm001 tensorflow-example]$ singularity exec --overlay overlay-15GB-500K.ext3:rw /share/apps/images /cuda12.1.1-cudnn8.9.0-devel-ubuntu22.04.2.sif /bin/bash
2626
2727# install miniforge in singularity environment
2828Singularity> wget --no-check-certificate https://github.com/conda-forge/miniforge/releases/latest/download/Miniforge3-Linux-x86_64.sh
@@ -88,7 +88,7 @@ Run the command below to download the data (4 GB in size):
8888# switch to a data transfer node
8989[NetID@log-1 tensorflow_example]$ ssh gdtn
9090[NetID@dtn-1 ~ ]$ cd /scratch/NetID/tensorflow_example
91- [NetID@dtn-1 tensorflow_example]$ singularity exec --nv --overlay /scratch/NetID/pytorch-example/my_pytorch.ext3:ro /scratch/work/public/singularity /cuda12.1.1-cudnn8.9.0-devel-ubuntu22.04.2.sif /bin/bash -c " source /ext3/env.sh; python download_data_and_weights.py"
91+ [NetID@dtn-1 tensorflow_example]$ singularity exec --nv --overlay /scratch/NetID/pytorch-example/my_pytorch.ext3:ro /share/apps/images /cuda12.1.1-cudnn8.9.0-devel-ubuntu22.04.2.sif /bin/bash -c " source /ext3/env.sh; python download_data_and_weights.py"
9292```
9393
9494### Step 3: Inspect the Script
@@ -181,7 +181,7 @@ module purge
181181
182182srun singularity exec --nv \
183183 --overlay /scratch/NetID/pytorch_examples_new/tensorflow-example/tensorflow.ext3:ro \
184- /scratch/work/public/singularity /cuda12.1.1-cudnn8.9.0-devel-ubuntu22.04.2.sif\
184+ /share/apps/images /cuda12.1.1-cudnn8.9.0-devel-ubuntu22.04.2.sif\
185185 /bin/bash -c " source /ext3/env.sh; python mnist_classify.py --batch-size-per-replica=32 --epochs=15"
186186```
187187::: note
0 commit comments