feat: add MiniMax Cloud TTS as a new dubbing provider#534
Open
octo-patch wants to merge 1 commit intoHuanshere:mainfrom
Open
feat: add MiniMax Cloud TTS as a new dubbing provider#534octo-patch wants to merge 1 commit intoHuanshere:mainfrom
octo-patch wants to merge 1 commit intoHuanshere:mainfrom
Conversation
Add MiniMax Cloud TTS (speech-2.8-hd / speech-2.8-turbo) as a new TTS provider for video dubbing, with 12 built-in voices, Streamlit sidebar integration, full i18n support (7 locales), and unit + integration tests. - New provider: core/tts_backend/minimax_tts.py - Config: minimax_tts section in config.yaml - UI: voice/model selectors in sidebar settings - Docs: TTS comparison tables updated (EN + ZH) - Tests: 8 unit tests + 3 integration tests Co-Authored-By: Octopus <liyuan851277048@icloud.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Add MiniMax Cloud TTS as a new dubbing provider for VideoLingo, complementing the existing TTS options (Azure, OpenAI, Fish, Edge, etc.).
What's included
core/tts_backend/minimax_tts.py): Supportsspeech-2.8-hd(recommended, HD quality) andspeech-2.8-turbo(faster) models via the MiniMax T2A v2 APIminimax_ttssection added toconfig.yamlwith sensible defaultsHow it works
MiniMax TTS uses a simple REST API (OpenAI-compatible auth pattern). The provider:
https://api.minimax.io/v1/t2a_v2Users just need a MiniMax API key from minimax.io — MiniMax is already listed as a recommended LLM provider in VideoLingo's docs.
Files changed (15 files, ~500 additions)
core/tts_backend/minimax_tts.pycore/tts_backend/tts_main.pycore/st_utils/sidebar_setting.pyconfig.yamldocs/pages/docs/start.en-US.mddocs/pages/docs/start.zh-CN.mdtranslations/*.json(7 files)tests/test_minimax_tts.pyTest plan