02476_FinalProject_Group20

Project Description

Overall goal of the project

The overall goal of this project, is to facilitate the developyment and deployment of machine learning models. This is done by adhering MLOps best practices. The goal is to create a reliable, efficient and scalable process. This project aims to build a Rice Classifier, that is trained on a machine learning model. The structured format should support development, experimentation and deployment. Here, tools such as Docker and version control are used. A part of the best practices is to provide config and requirements files to ensure reproducability and changes to hyperparameters.

What framework are you going to use, and you do you intend to include the framework into your project?

(Kun en ide) In this project, a framework called FastAI is going to be used. This framework is built on PyTorch, and is used to make machine learning accessible, while maintaining flexibility and performance. It has a strong fundament in machine learning, and has built-in data handling, called DataBlock. To integrate this, the DataBlock is used to initialize and preprocess the data, pre-trained models are used and fine-tuned to our purpose. The model is evaluated and the model is deployed. All this is possible through the FastAI framework.

What data are you going to run on

The data used for this project is called "Rice_Image_Dataset", and contains subfolders with different varities of rice. These rice are Arborio, Basmati, Ipsala, Jasmine, Karacadag. The data are images in RGB format of rice grains. Each subfolder acts as a class label, so the images are already labeled. The data is collected in 2021, and contains 75000 data samples. The data can be found here. Murat Koklu's Datasets.

What models do you expect to use

In this project, we expect to use pre-traied models, in order to successfully make the classification task. The models we are going to focus on will be the following:

Resnet (Residual Networks)
- Good for image classification, because of it's deep architecture.
- Pre-traied and accessable through PyTorch
EfficientNet
- Is known to have a high accuracy, but a reduced amout of parameters compared to other models.
- Ideal for testing a more simple approach.
VGG
- Simple and effective, but computationally expensive
- Is good as a benchmark when compared to more modern architectures.

Third-party Python libraries not used in the course

tqdm for progress bar.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
02476_FinalProject_Group20		02476_FinalProject_Group20
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

02476_FinalProject_Group20

Project Description

Overall goal of the project

What framework are you going to use, and you do you intend to include the framework into your project?

What data are you going to run on

What models do you expect to use

Third-party Python libraries not used in the course

About

Releases

Packages

Contributors 3

Languages

RasDTU02/02476_FinalProject_Group20

Folders and files

Latest commit

History

Repository files navigation

02476_FinalProject_Group20

Project Description

Overall goal of the project

What framework are you going to use, and you do you intend to include the framework into your project?

What data are you going to run on

What models do you expect to use

Third-party Python libraries not used in the course

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages