Skip to content

feat: add u3-rt-pro model support to assemblyai_asr_python#2081

Open
dan-ince-aai wants to merge 1 commit intoTEN-framework:mainfrom
dan-ince-aai:feat/assemblyai-u3-rt-pro-support
Open

feat: add u3-rt-pro model support to assemblyai_asr_python#2081
dan-ince-aai wants to merge 1 commit intoTEN-framework:mainfrom
dan-ince-aai:feat/assemblyai-u3-rt-pro-support

Conversation

@dan-ince-aai
Copy link

Summary

  • Adds speech_model configuration field supporting universal-streaming-english, universal-streaming-multilingual, and u3-rt-pro; defaults to u3-rt-pro
  • Adds new optional fields for u3-rt-pro: language_detection, prompt, vad_threshold, speaker_labels, max_speakers
  • Adds min_turn_silence to replace the deprecated min_end_of_turn_silence_when_confident (old param still accepted as fallback)
  • All optional params are omitted from the WebSocket URL when unset, deferring to AssemblyAI API defaults
  • language_detection is automatically enabled for u3-rt-pro and multilingual models
  • prompt param is guarded to u3-rt-pro only (logs a warning if set on other models)
  • Fixes WebSocket control message type casing: forceEndpointForceEndpoint, updateConfigurationUpdateConfiguration
  • Adds User-Agent: AssemblyAI/1.0 (integration=TEN-Framework) header for integration tracking
  • Updates manifest.json and property.json with new fields

Test plan

  • Connect with speech_model: u3-rt-pro and verify WebSocket URL includes speech_model=u3-rt-pro and language_detection=true
  • Connect with speech_model: universal-streaming-english and verify language_detection is not sent
  • Set prompt with a non-u3-rt-pro model and confirm it is ignored with a warning log
  • Set min_turn_silence and confirm it appears in the URL; set neither silence param and confirm neither appears
  • Verify ForceEndpoint and UpdateConfiguration messages are accepted by the API without error
  • Confirm existing test_error_check integration test still passes

🤖 Generated with Claude Code

- Add `speech_model` config field (universal-streaming-english,
  universal-streaming-multilingual, u3-rt-pro); defaults to u3-rt-pro
- Add new fields: `min_turn_silence`, `language_detection`, `prompt`,
  `vad_threshold`, `speaker_labels`, `max_speakers`
- All optional params omit from WS URL when unset, letting API defaults apply
- language_detection auto-enabled for u3-rt-pro and multilingual models
- prompt param guarded to u3-rt-pro only (logs warning otherwise)
- Deprecate `min_end_of_turn_silence_when_confident` in favour of
  `min_turn_silence`; old param still accepted as fallback
- Fix WebSocket message type casing: ForceEndpoint, UpdateConfiguration
- Add User-Agent header for TEN-Framework integration tracking
- Update manifest.json and property.json with new fields

Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant