10.1.0
jambonz Commercial 10.1.0
New Features & Improvements
- Agent verb — Major new feature enabling low-latency voice AI agents with support for Amazon Bedrock and Google Gemini as LLM backends, Deepgram and Krisp for STT/turn-taking, and ElevenLabs TTS with spoken-word tracking. Includes mid-conversation async updates via
agent:update, noise cancellation powered by Krisp, and comprehensive metrics and measurement. See the agent verb reference and voice agents guide for details. - Node.js SDK — New unified SDK for building jambonz voice applications in TypeScript/JavaScript, supporting webhook and WebSocket transports, REST API client, TTS streaming, and chainable verb methods. Replaces the older
@jambonz/node-clientand@jambonz/node-client-wspackages. See the Node.js SDK documentation. - Python SDK (experimental) — New Python SDK with the same capabilities as the Node.js SDK: webhook and WebSocket transports, REST client, TTS streaming, inject commands, and spec-driven verb generation with full type hints. See the Python SDK documentation.
- Speech vendor updates — Expanded speech vendor support across the platform:
- Speechmatics Preview STT — New speech-to-text vendor with turn-taking event forwarding and analytics.
- Houndify WebSocket STT — Speech recognition over WebSocket with
audioQueryAbsoluteTimeoutfor controlling recognition timeouts. - AssemblyAI Universal-3 Pro — Support for the universal-3 pro streaming model, defaulting to
u3-rt-prowhen a prompt is provided. - Deepgram Flux language hint — Pass
language_hintto Deepgram Flux STT for improved recognition accuracy. - Google S2S transcription events — Google Speech-to-Speech now emits
llm_eventwith transcription data to the application layer. - ElevenLabs TTS tracking — Track spoken words and TTS timing for ElevenLabs, enabling detailed usage analytics and billing insights.
- TTS time-to-first-byte metrics — Latency metrics across all streaming TTS vendors to measure time to first audio byte.
- Inworld AI models — Support for Inworld AI models.
- Krisp noise isolation — Add support for Krisp-powered noise isolation and cancellation with usage tracking and event generation.
- Google Gemini LLM — Add support for Google Gemini as an LLM provider with credential management in the API server and webapp.
- MCP client hardening — Improved MCP client reliability with configurable timeouts, authentication support, URL hints, automatic reconnection, and graceful connection close.
- Listen verb in conference — Support for nesting a
listenverb inside aconference, enabling real-time audio streaming from conference sessions. - LLM services — New LLM services management in the API server and webapp.
- License expiry alert — The webapp now displays an alert when the system license key is expired or approaching expiration.
- Schema migration — Migrated to the consolidated
@jambonz/schemapackage, deprecating the standalone verb-specs module. - Updated API swagger — API server swagger documentation updated to reflect new endpoints and properties.
Bug Fixes
- Fixed a race condition for outbound calls in the feature-server that could cause call setup failures.
- Fixed
gladiaOptionsbeing hardcoded instead of using user-provided configuration. - Fixed Google Speech-to-Speech not sending transcription events to the application layer.
- Fixed DTMF digits being sent as multiple underscore characters instead of correct tones.
- Fixed TTS engine flush signaling and guarded against Cartesia empty events without proper completion state.
- Fixed ElevenLabs models endpoint failure causing the entire language/model dropdown to break; now gracefully falls back to static data.
- Fixed Deepgram STT language dropdown appearing empty due to model name parsing issue.
- Fixed missing language names for Cartesia Sonic 3 languages in the language map.
- Fixed ElevenLabs STT not properly loading available languages and models.
- Fixed potential crash in webapp TTS voice sorting when voice name is undefined.
- Fixed exception when user provides an invalid value for a play file URL.
- Added exception handling in the
mod_dubFreeSWITCH module to prevent crashes from unhandled errors. - Fixed agent verb integration with Deepgram STT and Krisp/LLM-based turn taking.
- Fixed internal task validation that was incorrectly rejecting valid internal tasks.
- Removed unused IBM speech integration from the webapp.
SQL Changes
Contact your account manager or email support@jambonz.org for the complete SQL migration script for this release.
Questions? Contact us at support@jambonz.org