Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add implementations of USES2 speech enhancement models #5761

Open
wants to merge 11 commits into
base: master
Choose a base branch
from

Conversation

Emrys365
Copy link
Collaborator

What?

This PR adds the implementations of the USES2-Swin and USES2-Comp speech enhancement models proposed in the ICASSP 2024 paper "Improving Design of Input Condition Invariant Speech Enhancement".

Why?

This is an upgraded version of the previously added USES model (#5482).

See also

https://arxiv.org/abs/2401.14271

@Emrys365 Emrys365 added Recipe ESPnet2 SE Speech enhancement labels Apr 24, 2024
@mergify mergify bot added the Installation label Apr 24, 2024
@sw005320 sw005320 added this to the v.202405 milestone Apr 24, 2024
@mergify mergify bot added the README label Jun 9, 2024
@sw005320
Copy link
Contributor

I understand that you tried to separate this PR and the recipe PR #5810
If you fix the CI error, please let me know.

@LiChenda, can you also review this PR?

@LiChenda
Copy link
Contributor

I understand that you tried to separate this PR and the recipe PR #5810 If you fix the CI error, please let me know.

@LiChenda, can you also review this PR?

Yes!

Copy link
Contributor

@LiChenda LiChenda left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Look good to me !

@Emrys365
Copy link
Collaborator Author

I can't understand how the modified scripts can cause an error when installing espnet in the CI tests...

Copy link

codecov bot commented Jul 14, 2024

Codecov Report

Attention: Patch coverage is 17.79661% with 388 lines in your changes missing coverage. Please review.

Project coverage is 27.87%. Comparing base (1a0c358) to head (bfd3000).
Report is 941 commits behind head on master.

Files with missing lines Patch % Lines
espnet2/enh/layers/swin_transformer.py 16.02% 152 Missing ⚠️
espnet2/enh/layers/uses2_comp.py 17.07% 102 Missing ⚠️
espnet2/enh/layers/uses2_swin.py 21.27% 74 Missing ⚠️
espnet2/enh/separator/uses2_separator.py 17.80% 60 Missing ⚠️

❗ There is a different number of reports uploaded between BASE (1a0c358) and HEAD (bfd3000). Click for more details.

HEAD has 8 uploads less than BASE
Flag BASE (1a0c358) HEAD (bfd3000)
test_utils 2 0
test_python_espnet1 2 0
test_integration_espnetez 5 1
Additional details and impacted files
@@            Coverage Diff             @@
##           master    #5761      +/-   ##
==========================================
- Coverage   34.39%   27.87%   -6.52%     
==========================================
  Files         780      551     -229     
  Lines       71754    47203   -24551     
==========================================
- Hits        24677    13160   -11517     
+ Misses      47077    34043   -13034     
Flag Coverage Δ
test_integration_espnetez 27.87% <17.79%> (-0.32%) ⬇️
test_python_espnet1 ?
test_utils ?

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@Fhrozen Fhrozen modified the milestones: v.202409, v.202412 Oct 1, 2024
@Fhrozen Fhrozen modified the milestones: v.202412, v.202503 Dec 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants