forked from asteroid-team/asteroid
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request asteroid-team#133 from mpariente/kinect_bis
[src & egs] Add Kinect-WSJ licenses + small fixes
- Loading branch information
Showing
4 changed files
with
59 additions
and
9 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,22 @@ | ||
### About the Kinect-WSJ dataset | ||
Kinect-WSJ is a reverberated, noisy version of the WSJ0-2MIX dataset. Microphones are placed on a linear array with spacing between the devices resembling that of Microsoft Kinect ™, the device used to record the CHiME-5 dataset. This was done so that we could use the real ambient noise captured as part of CHiME-5 dataset. The room impulse responses (RIR) were simulated for a sampling rate of 16,000 Hz. | ||
|
||
## Path to the dataset | ||
https://github.com/sunits/Reverberated_WSJ_2MIX/ | ||
|
||
# Requirements to create Kinect-WSJ dataset | ||
* wsj_path : Path to precomputed wsj-2mix dataset. Should contain the folder 2speakers/wav16k/. If you don't have wsj_mix dataset, please create it using the scripts in egs/wsj0_mix | ||
* chime_path : Path to chime-5 dataset. Should contain the folders train, dev and eval | ||
* dihard_path : Path to dihard labels. Should contain ```*.lab``` files for the train and dev set | ||
|
||
# References | ||
|
||
``` | ||
@inproceedings{sivasankaran2020, | ||
booktitle = {2020 28th {{European Signal Processing Conference}} ({{EUSIPCO}})}, | ||
title={Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition}, | ||
author={Sunit Sivasankaran and Emmanuel Vincent and Dominique Fohr}, | ||
year={2021}, | ||
month = Jan, | ||
} | ||
``` |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters