examples

Dioptra Examples and Demos

Caution

DEPRECATED

All folders with the prefix DEPRECATED- are examples that are obsolete and will not work with the v1.0.0 release of Dioptra. They are retained for reference only. Some may be ported to work with v1.0.0 in the future.

Creating a virtual environment

It is recommended that you create a virtual environment to use to manage the dependencies needed to run the setup scripts and use the Jupyter notebooks provided with the examples. Run the following after you have cloned this repository:

# Move into the examples folder of cloned repo
cd /path/to/dioptra/examples

# Create a new virtual environment at /path/to/dioptra/examples/.venv
python -m venv .venv

# Activate the virtual environment
source .venv/bin/activate

# Install the dependencies
python -m pip install -r ./scripts/venvs/examples-setup-requirements.txt

Downloading Datasets

The download script examples/scripts/download_data.py should be used to fetch the datasets used in Dioptra's examples and demos. In addition to fetching the datasets, this script automatically organizes the files and directories so that each dataset follows a consistent and predictable structure. The script can be used to download the following datasets:

MNIST
Road Signs Detection (Kaggle)
Fruits 360 (Kaggle)
[🚧 Under construction] ImageNet Object Localization Challenge (Kaggle)

Prerequisites

Create and activate the examples/.venv virtual environment if you have not already done so. You will also need to register for a Kaggle account and obtain an API token in order to fetch certain datasets. For instructions on how to obtain and use a Kaggle API token, see https://github.com/Kaggle/kaggle-api#api-credentials.

Usage

To run this script and download a dataset directly to a specific directory on your host machine, simply use the following:

python ./scripts/download_data.py --output /path/to/data/directory DATASET_NAME

For the full list of options and available datasets, run python ./scripts/download_data.py -h to display the script's help message:

Usage: download_data.py [OPTIONS] COMMAND [ARGS]...

  Fetch a dataset used in Dioptra's examples and demos.

Options:
  --output DIRECTORY            The path to the folder where the example
                                datasets are stored. Defaults to the current
                                working directory.
  --overwrite / --no-overwrite  Fetch the data even if the target folder
                                already exists and overwrite any existing data
                                files. By default the program will exit early
                                if the target folder already exists.
  -h, --help                    Show this message and exit.

Commands:
  fruits360  Fetch the Fruits 360 dataset hosted on Kaggle.
  imagenet   Fetch the ImageNet Object Localization Challenge dataset...
  mnist      Fetch the MNIST dataset.
  roadsigns  Fetch the Road Signs Detection dataset hosted on Kaggle.

Please note that some of the datasets have additional options that can be viewed by running python ./scripts/download_data.py DATASET -h. For example, running python ./scripts/download_data.py fruits360 -h displays the help message for the Fruits 360 dataset shown below:

Usage: download_data.py fruits360 [OPTIONS]

  Fetch the Fruits 360 dataset hosted on Kaggle.

  This downloader uses the Kaggle API and requires the use of an API token.
  For instructions on how to obtain and use a Kaggle API token, see
  https://github.com/Kaggle/kaggle-api#api-credentials.

Options:
  --remove-zip / --no-remove-zip  Remove/keep the dataset zip file after
                                  extracting it. By default it will be
                                  removed.
  -h, --help                      Show this message and exit.

Examples

# Downloads the MNIST dataset to /dioptra/data/Mnist, overwriting any existing files.
python ./scripts/download_data.py --output /dioptra/data --overwrite mnist

# Downloads the Road Signs Detection dataset to ./Road-Signs-Detection-v2. The
# script will stop early if the folder ./Road-Signs-Detection-v2 exists.
python ./scripts/download_data.py roadsigns

# Downloads the Fruits 360 dataset to /dioptra/data/Fruits360. The script will
# stop early and if the folder /dioptra/data/Fruits360 exists and the dataset
# zip file downloaded from Kaggle will not be removed.
python ./scripts/download_data.py --output /dioptra/data fruits360 --no-remove-zip

Mounting the data folder in the worker containers

In order to use the datasets you downloaded using the download_data.py script, you will need to mount them into the worker containers. However, the docker-compose.yml file generated by the cookiecutter template does not mount any folders from your host machine into the worker containers. To address this, open the docker-compose.yml file generated by the cookiecutter template in a text editor and find the blocks for the worker containers. The worker container blocks will have tfcpu, tfgpu, pytorch-cpu, or pytorch-gpu in their names. Append the line - /path/to/data:/dioptra/data:ro to the volumes: subsection. An example is shown below.

⚠️ Important! Do not use /path/to/data verbatim, change it to the absolute path to data folder you created on your computer!

dioptra-deployment-tfcpu-01:
  # Skipping unaffected lines in block
  volumes:
    - worker-ca-certificates:/usr/local/share/ca-certificates:rw
    - worker-etc-ssl:/etc/ssl:rw
    - /path/to/data:/dioptra/data:ro

dioptra-deployment-pytorchcpu-01:
  # Skipping unaffected lines in block
  volumes:
    - worker-ca-certificates:/usr/local/share/ca-certificates:rw
    - worker-etc-ssl:/etc/ssl:rw
    - /path/to/data:/dioptra/data:ro

If your data is stored in an NFS share and not on your local computer, please see Mounting a folder on a NFS share in the Dioptra documentation.

Starting Jupyter Lab

Each example is contained in a Jupyter notebook that interacts with your Dioptra instance. To use the notebooks, activate your virtual environment and start Jupyter Lab from the command line.

# Run this in the examples/ folder
jupyter lab

Once Jupyter Lab starts up and is visible in your web browser, use the file explorer to navigate to the example folder that you want to try and open the Jupyter notebook (it will usually be named demo.ipynb).

Name		Name	Last commit message	Last commit date
parent directory ..
DEPRECATED-pytorch-detectron2-demo		DEPRECATED-pytorch-detectron2-demo
DEPRECATED-pytorch-mnist-membership-inference		DEPRECATED-pytorch-mnist-membership-inference
DEPRECATED-tensorflow-adversarial-patches		DEPRECATED-tensorflow-adversarial-patches
DEPRECATED-tensorflow-backdoor-poisoning		DEPRECATED-tensorflow-backdoor-poisoning
DEPRECATED-tensorflow-imagenet-resnet50		DEPRECATED-tensorflow-imagenet-resnet50
DEPRECATED-tensorflow-mnist-classifier		DEPRECATED-tensorflow-mnist-classifier
DEPRECATED-tensorflow-mnist-feature-squeezing		DEPRECATED-tensorflow-mnist-feature-squeezing
DEPRECATED-tensorflow-mnist-model-inversion		DEPRECATED-tensorflow-mnist-model-inversion
DEPRECATED-tensorflow-mnist-pixel-threshold		DEPRECATED-tensorflow-mnist-pixel-threshold
scripts		scripts
task-plugins/dioptra_custom		task-plugins/dioptra_custom
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples

examples

README.md

Dioptra Examples and Demos

Creating a virtual environment

Downloading Datasets

Prerequisites

Usage

Examples

Mounting the data folder in the worker containers

Starting Jupyter Lab

Files

examples

Directory actions

More options

Directory actions

More options

Latest commit

History

examples

Folders and files

parent directory

README.md

Dioptra Examples and Demos

Creating a virtual environment

Downloading Datasets

Prerequisites

Usage

Examples

Mounting the data folder in the worker containers

Starting Jupyter Lab