Skip to content

Commit

Permalink
Updating README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
sunits committed May 31, 2020
1 parent 83b928e commit 855c43c
Showing 1 changed file with 18 additions and 5 deletions.
23 changes: 18 additions & 5 deletions egs/kinect-wsj/README.md
Original file line number Diff line number Diff line change
@@ -1,9 +1,22 @@
### About the Kinect-WSJ dataset
Kinect-WSJ is a reverberated, noisy version of the WSJ0-2MIX dataset. Microphones are placed on a linear array with spacing between the devices resembling that of Microsoft Kinect ™, the device used to record the CHiME-5 dataset. This was done so that we could use the real ambient noise captured as part of CHiME-5 dataset. The room impulse responses (RIR) were simulated for a sampling rate of 16,000 Hz.

The Kinect-WSJ dataset is a simulated multi-channel,
reverberated version of [wsj0-2mix](../wsj0-mix) with
noise samples extracted from non-speech regions of CHiMe5.
## Path to the dataset
https://github.com/sunits/Reverberated_WSJ_2MIX/

See the [official repo](https://github.com/sunits/Reverberated_WSJ_2MIX) for more details.
# Requirements to create Kinect-WSJ dataset
* wsj_path : Path to precomputed wsj-2mix dataset. Should contain the folder 2speakers/wav16k/. If you don't have wsj_mix dataset, please create it using the scripts in egs/wsj0_mix
* chime_path : Path to chime-5 dataset. Should contain the folders train, dev and eval
* dihard_path : Path to dihard labels. Should contain ```*.lab``` files for the train and dev set

Citation to come.
# References

```
@inproceedings{sivasankaran2020,
booktitle = {2020 28th {{European Signal Processing Conference}} ({{EUSIPCO}})},
title={Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition},
author={Sunit Sivasankaran and Emmanuel Vincent and Dominique Fohr},
year={2021},
month = Jan,
}
```

0 comments on commit 855c43c

Please sign in to comment.