-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[cli/paraformer] ali-paraformer inference #2067
Conversation
579e9ff
to
6967ef5
Compare
c3ca2e8
to
87906d5
Compare
87906d5
to
c8cccdc
Compare
3151899
to
e8fa013
Compare
0658057
to
e987b00
Compare
7ed212b
to
3d25e2e
Compare
decode info: batch_size=100, beam_size=10
decode info: batch_size=1, beam_size=10
|
assets 是必须的吗?是否可以放到模型中? |
目前cmvn 这些已经在模型里边力,不过funasr的conf和wenet conf 格式不一样 导出模型的时候,需要用assets里边的文件,目前是我脚本转的 可以等后边自动从funasr里边的conf力转成wenet格式,然后再删掉 |
bd2ea92
to
b765c12
Compare
streaming paraformer 有计划支持吗? |
暂时没计划, streaming的paraformer指标上差了一些 也不是理想中的流模型, 感兴趣的话 可以请关注wenetspeech2.0的进展 |
def transcribe(self, audio_file: str, tokens_info: bool = False) -> dict: | ||
waveform, sample_rate = torchaudio.load(audio_file, normalize=False) | ||
waveform = waveform.to(torch.float) | ||
feats = kaldi.fbank(waveform, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The default window in the FunASR frontend is hamming
. You can find more details here. However, the default window in kaldi.fbank
is povey
, as specified here. This different window maybe a little mismatch. As mentioned in line 44 of this document:
"povey" is a window I made to be similar to Hamming but to go to zero at the edges, it's pow((0.5 - 0.5cos(n/N2*pi)), 0.85) I just don't think the Hamming window makes sense as a windowing function.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
pr welcome
TODO: (in this pr)
TODO: (future pr)
NOTE: streaming paraformer not in current plan