Skip to content

The PyTorch-based audio source separation toolkit for researchers

License

Notifications You must be signed in to change notification settings

ovistetom/asteroid

Repository files navigation

Asteroid : Audio Source Separation on steroids

Build Status codecov

Slack

Asteroid is a Pytorch-based source separation and speech enhancement API that enables fast experimentation on common datasets. It comes with a source code written to support a large range of datasets, architectures, loss functions etc... and a set of recipes to reproduce some important papers.
Asteroid is intended to be a community-based project so hop on and help us !

You use asteroid or you want to?

Please, if you have found a bug, open an issue, if you solved it, open a pull request !
Same goes for new features, tell us what you want or help us building it !
Don't hesitate to join the slack and ask questions / suggest new features there as well !

Table of contents

Installation

In order to install Asteroid, clone the repo and install it using pip or python :

git clone https://github.com/mpariente/asteroid
cd asteroid
# Install with pip in editable mode
pip install -e .
# Or, install with python in dev mode
python setup.py develop

Asteroid is also on PyPI, you can install the latest release with

pip install numpy Cython
pip install asteroid

Highlights

Here is a list of notebooks showing example usage of Asteroid's features.

Running a recipe

Running the recipes requires additional packages in most cases, we recommend running :

# from asteroid/
pip install -r requirements.txt

Then choose the recipe you want to run and run it !

cd egs/wham/ConvTasNet
. ./run.sh

More information in egs/README.md.

Available recipes

Supported datasets

Contributing

We are always looking to expand our coverage of the source separation and speech enhancement research, the following is a list of things we're missing. You want to contribute? This is a great place to start !

Don't forget to read our contributing guidelines.

You can also open an issue or make a PR to add something we missed in this list.

TensorBoard visualization

The default logger is TensorBoard in all the recipes. From the recipe folder, you can run the following to visualize the logs of all your runs. You can also compare different systems on the same dataset by running a similar command from the dataset directiories.

# Launch tensorboard (default port is 6006)
tensorboard --logdir exp/ --port tf_port

If your launching tensorboard remotely, you should open an ssh tunnel

# Open port-forwarding connection. Add -Nf option not to open remote. 
ssh -L local_port:localhost:tf_port user@ip

Then open http://localhost:local_port/. If both ports are the same, you can click on the tensorboard URL given on the remote, it's just more practical.

Guiding principles

  • Modularity. Building blocks are thought and designed to be seamlessly plugged together. Filterbanks, encoders, maskers, decoders and losses are all common building blocks that can be combined in a flexible way to create new systems.
  • Extensibility. Extending Asteroid with new features is simple. Add a new filterbank, separator architecture, dataset or even recipe very easily.
  • Reproducibility. Recipes provide an easy way to reproduce results with data preparation, system design, training and evaluation in a single script. This is an essential tool for the community !

About

The PyTorch-based audio source separation toolkit for researchers

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 87.3%
  • Shell 12.7%