Skip to content

Machine Learning applied to sound

Notifications You must be signed in to change notification settings

jonnor/machinehearing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

95 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Machine Hearing

Classifying sound using Machine Learning

Youtube: Audio Classification using Machine Learning by Jon Nordby, EuroPython 2019

At KnowIt Oslo, 2020. Video recording, slides, notes

Environmental Sound Classification on Microcontrollers using Convolutional Neural Networks

Github: jonnor/ESC-CNN-microcontroller

Master thesis. Report and code available in the Github repository.

EuroPython2019: Audio Classification using Machine Learning

Youtube: Audio Classification using Machine Learning by Jon Nordby, EuroPython 2019

Presentation at EuroPython2019. Video recording, notes

PyCode2019: Recognizing sounds with Machine Learning and Python

Slides

Presentation at PyCode Conference 2019 in Gdansk. Slides, notes

Video recording. Coming, maybe in November.

SenseCamp2019: Classification of Environmental Sound using IoT sensors

Slides

Presentation at SenseCamp 2019 hosted by FORCE Technology Senselab. Slides: web, .PDF

NMBU lecture on Audio Classification

Report and lecture at NMBU Data Science.

Report | Slides

Stack Overflow answers

With example code in Python

Notes

Rough notes on various topics.

Resources

Useful resources to learn more.

Presentations

Books

  • Computational Analysis of Sound Scenes and Events. Tuomas Virtanen, Mark D. Plumbley, Dan Ellis. 2018.
  • Human and Machine Hearing - Extracting Meaning from Sound. Richard F. Lyon. 2017, revised 2018.
  • An Introduction to Audio Content Analysis - Applications in Signal Processing and Music Informatics. Alexander Lerch. 2012. Companion website: https://www.audiocontentanalysis.org/
  • Machine Learning for Audio, Image and Video Analysis: Theory and Applications (Advanced Information and Knowledge Processing). Francesco Camastra, 3 sections. From Perception to Computation, Machine Learning, Applications.

Online courses

Software

Feature extraction

  • librosa. The go-to Python module.
  • essentia. C++ library, with Python bindings. Lots of Music Analysis extractors. Used by FreeSound and Acousticbrainz.
  • kapre. On-demand GPU computation of melspectrograms, for Keras
  • torchaudio.

Data Augmentation

Lecture notes

Competions

  • Kaggle Whale detection
  • Kaggle FreeSound tagging 2018
  • Kaggle FreeSound
  • DCASE2014
  • DCASE2018
  • DCASE2019

Datasets

Online Communities

Lists