Mozart's Touch: Multi-Modal Music Generation with Pre-Trained Models

This is the official implementation of Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large Models

Package Description

This repository is structured as follows:

Diancai-Backend
├─MozartsTouch/: source code for the implementation of Mozart's Touch
│  ├─model/: pre-trained models
│  ├─static/: static source for test purpose
│  ├─utils/: source code for the modules
│  ├─download_model.py: download pre-trained model to ./model/
│  ├─config.yaml: configurations such as LLM model URLs, API keys
│  └─main.py: Main program of Mozart's Touch
│ outputs/: directory to store generation result music
├─backend_app.py: program for backend web application of Mozart's Touch
└─start_server.py: start the backend server of Mozart's Touch

Setup

Before running, please configure config.yaml.
Install dependencies using pip install -r requirements.txt.
Run download_model.py to download model parameters needed.
Use MozartsTouch.img_to_music_generate() to generate music.

To test codes without importing large models, set TEST_MODE to True in config.yaml.

Usage

Running as a Command Line Tool

With the setup complete, you can now run the following command to generate music:

python main.py

or debug with no model imported:

python main.py --test_mode

Running as a Web Backend Server

Install dependencies using pip install -r requirements_for_server.txt.
Configure port number and other parameters instart_server.py.
Run python start_server.py.
Access http://localhost:3001/docs#/ to view the backend documentation and test the APIs.

The related frontend project is at https://github.com/ScientificW/MozartFrontEndConnect

TO-DO List

~~增加用户输入提示词功能~~
删除API中的mode
~~更新到最新的代码，将 Video-BLIP2 整合到我们的项目中。~~
将评估代码整合进来
~~Use argparse to set and pass config~~
~~MusicGen部分重构策略模式~~
~~Use API instead of loading models manually~~
~~Add support for other models as an alternative e.g. LLaMa.~~

远期任务

尝试Florence-2等最新模型
优化音乐生成部分MusicGen模型的代码（主要需求：优化生成效率）

Name		Name	Last commit message	Last commit date
Latest commit History 170 Commits
MozartsTouch		MozartsTouch
.gitignore		.gitignore
LICENSE		LICENSE
MUImageEvalInstructions.json		MUImageEvalInstructions.json
README.md		README.md
architecture.png		architecture.png
backend_app.py		backend_app.py
logo.png		logo.png
requirements.txt		requirements.txt
requirements_for_server.txt		requirements_for_server.txt
start_server.py		start_server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mozart's Touch: Multi-Modal Music Generation with Pre-Trained Models

Package Description

Setup

Usage

Running as a Command Line Tool

Running as a Web Backend Server

TO-DO List

远期任务

About

Releases

Packages

Contributors 6

Languages

License

TiffanyBlews/MozartsTouch

Folders and files

Latest commit

History

Repository files navigation

Mozart's Touch: Multi-Modal Music Generation with Pre-Trained Models

Package Description

Setup

Usage

Running as a Command Line Tool

Running as a Web Backend Server

TO-DO List

远期任务

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Languages

Packages