This repository contains all the necessary code and documentation of the algorithm for the light trap and computer vision system to detect, track and classify live moths. It contains instructions and code for both training and data-processing.
Paper: https://www.biorxiv.org/content/10.1101/2020.03.18.996447v2
The following dependencies must be installed.
Dependency | Version |
---|---|
scikit_image | 0.16.2 |
numpy | 1.18.5 |
scipy | 1.4.1 |
Pympler | 0.7 |
tensorflow | 2.0.0 |
Pillow | 7.0.0 |
pandas | 1.0.4 |
Seaborn | 0.10.1 |
Scikit learn | 0.19.1 |
- Install the dependencies and create the environment using the provided OS specific environment file with the command "conda create --name myEnv --file ENV_FILE.txt" (See env file for Linux and Windows)
- Activate the enviorement using the command "activate myEnv"
Start the programs by running the files MCC_gui.py or MCC_algorithm.py in the code directory with the command "python MCC_gui.py" or "python MCC_algorithm.py".
The algorithm outputs the results in JSON and CSV files with date and counts for each species (class). These statistic files are by default named statistics.json and statistics.csv. The track files are by default named .json and .csv The track files contain the following information:
Property | Description | Example |
---|---|---|
id | The id of the track. | 0 |
startdate | The date when the track was first registered. YYYY:MM:DD | 20190901 |
starttime | The time of the day the track was first registered. HH:MM:SS | 03:32:12 |
endtime | The time the track was last registered. HH:MM:SS | 03:33:08 |
duration | The duration of the track in seconds. | 56.00 |
class | The class predicted by the algorithm. | noctua_pronuba |
counts | The number of times the given track has been present in a frame | 28.0 |
confidence | The algorithms confidence in the classification. The confidence is based on the mutual classifications of the track and is calculated as the ratio between the most classified class and the total number of classifications. | 6/10 = 60.00 |
size | The average number of blob pixels in one track. | 73563.79 |
distance | The euclidean distance in pixels the centerpoint of the blobs have moved throughout a track. | 65 |
New models can be trained using the provided python script: code/hp_param_training.py
Extract the 10classes_mixed.zip file that contains the training and validation dataset.
This script is configured for 10 classes including one for background images.
To train a model with your own dataset edit the hp_param_training.py script by providing a data path, logging path, model save path and edit the steps per epoch based on the size of the dataset.
Tensorboard command: tensorboard --logdir hparam_tuning --reload_multifile=true
Jakob Bonde Nielsen or Kim Bjerge