GitHub - ai-dojo/microbudget: Accompanying the talk "AI on a Microbudget: Methods of Machine Learning Miniaturization"

AI on a Microbudget

In recent years, the AI field has pursued ever larger models, trained at “eye-watering” cost. In this talk we explore ideas for the rest of us, the GPU-poor. We’ll show you how to make do with less – less computing power, less person power, less data – while still building powerful models.

Part 1: Methods of Machine Learning Miniaturization

Current progress in AI has seen remarkable capabilities emerging from simple prediction tasks – if we scale them massively. Surprisingly, we get sparks of reasoning and intelligence in a model that was trained to do little more than masked word prediction. Since that realization the AI field has pursued ever larger models, trained at “eye-watering” cost. If scaling is all you need – does it follow that, in practice, money is all you need?

In this talk we explore ideas for the rest of us, the GPU-poor. Taking examples from language processing and computer vision, we’ll show you how to make do with less – less computing power, less person power, less data – while still building powerful models. We will introduce a set of methods and open source tools for the efficient reuse and miniaturization of models, including transfer learning and fine-tuning, knowledge distillation, and model quantization. We will also discuss how to choose efficient model architectures, and investigate ways in which small and specialized models can outperform large models. Our talk aims to provide an overview for ML practitioners, draws from our combined project experience, and is accompanied by a repository of code examples to get you started with building AI on a microbudget.

📙 Companion Notebooks: Table of Contents

Setup

Clone repository
Install requirements listed in requirements.txt, e.g.

pip install -r requirements.txt

Run workflow notebooks.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
data		data
fig		fig
models/tmp		models/tmp
notebooks		notebooks
.gitignore		.gitignore
PyCon-Italia-2024-Slides.pdf		PyCon-Italia-2024-Slides.pdf
README.md		README.md
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI on a Microbudget

Part 1: Methods of Machine Learning Miniaturization

Setup

About

Releases

Packages

Languages

ai-dojo/microbudget

Folders and files

Latest commit

History

Repository files navigation

AI on a Microbudget

Part 1: Methods of Machine Learning Miniaturization

Setup

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages