Support mixed-language batches in `WhisperGenerationMixin` #29685

cifkao · 2024-03-15T19:01:41Z

Feature request

It is currently not possible to mix multiple languages in a single batch when running Whisper. The language argument only accepts a single string (as opposed to a separate language for each batch item), and if no language is passed and multiple languages are detected, transcription will fail.

I propose to enable passing a list of languages (language: Optional[Union[str, List[str]]]) in a batched transcription situation, as well as removing the restriction related to language detection.

Motivation

Not being able to transcribe multiple languages in a single batch is clearly a limitation, especially when relying on auto-detection, but also in scenarios where the language is known.

The error message states that It is currently not supported to transcribe to different languages in a single batch., implying that it could be supported at some point.

Your contribution

I have implemented this and I'm planning to submit a PR.

The text was updated successfully, but these errors were encountered:

amyeroberts · 2024-03-15T20:43:25Z

cc @ylacombe @sanchit-gandhi

amyeroberts added Feature request Request for a new feature Audio labels Mar 15, 2024

cifkao mentioned this issue Mar 16, 2024

Support mixed-language batches in WhisperGenerationMixin #29688

Merged

5 tasks

ArthurZucker closed this as completed in #29688 May 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support mixed-language batches in `WhisperGenerationMixin` #29685

Support mixed-language batches in `WhisperGenerationMixin` #29685

cifkao commented Mar 15, 2024 •

edited

Loading

amyeroberts commented Mar 15, 2024

Support mixed-language batches in WhisperGenerationMixin #29685

Support mixed-language batches in WhisperGenerationMixin #29685

Comments

cifkao commented Mar 15, 2024 • edited Loading

Feature request

Motivation

Your contribution

amyeroberts commented Mar 15, 2024

Support mixed-language batches in `WhisperGenerationMixin` #29685

Support mixed-language batches in `WhisperGenerationMixin` #29685

cifkao commented Mar 15, 2024 •

edited

Loading