Changelog

June 24, 2025

0.9.4-4

Point release

New Features

Adds support for Cartesia Speech to text Ink-Whisper model. You can now use Cartesia for both TTS and STT.
Adds support for creating an Agent Call on Ultravox. To enable this feature, you must set the agent_id property in the ultravox llm verb as described here. This is an optional property, and if not set the Create Call API will be used instead (i.e. legacy behavior).
The say verb now supports a TTS streaming mode where you can supply the full prompt at once in the text property.
Adds additional support for Italian voicemail detection based on common operator messages.
When registering with an outbound SIP trunk, use the account-level sip realm in the Contact header if provided.

Bug fixes

error if app does not specify a speech synthesis voice issue.
unhandled exception issue
remove unnecessary logging PR
embedded urls in createCall REST call createCall verb caused parsing issue PR
unhandled exception in dial verb PR
in certain dial scenarios, the A leg could be left connected after a successful REFER on the B leg PR
remove video SDP when making outbound call PR
fix issue with dub verb where loop: false caused the audio to incorrectly loop PR
fix potential looping behavior in background sticky bargeIn task PR
fix snyk warning in drachtio-fsmrf PR
route logs for jambonz-api-server to the correct log file PR
creating new application in the webapp does not save a TTS voice by default PR
fix issue when wild cards or regex is used in phone number for multiple carriers PR

SQL changes

None.

Availability

Available now on jambonz.cloud
Available now with devops scripts for subscription customers

Questions? Contact us at support@jambonz.org

May 23, 2025

0.9.4

Major release

New Features

Adds support for Google Gemini speech-to-speech LLM. See example application here. Speech-to-speech LLMs now supported include: Gemini, Ultravox, OpenAI, Deepgram, and elevenlabs.
Added MCP client support to the llm verb. You can now specify an array of one or more MCP servers in the mcpServers property of the llm verb and jambonz will query those MCP servers and automatically create tools for the LLM to call based on the tools exposed by each of the MCP servers. For an example, see the google gemini sample app.
Added support for application environment variables, which are special configuration variables that can be set in the jambonz portal for an application to customize the application behavior. This enables hosting of a single application that can then be customized for different customers without having to modify source code.
Added support for Deepgram Aura-2 TTS model and voices
Added support for Rime Arcana model
Added support for PlayHT on-prem deployments.
Added support for using outbound sip proxy when registering
Added support for providing instructions to Whisper TTS
Added new voice for nvidia TTS

Bug fixes

Various stability fixes including for issues which caused intermittent Freeswitch crashes.
Fixed deepgram gather cannot be timeout on empty transcription with continueAsr. PR
Fixed say verb cannot failover if tts_response-code != 2xx. PR
Fixed microsoft stt max client buffer size error for transcribe verb. PR
sip_decline release callSession if ws requestor is used. PR
Send stop-playback event. PR
Fixed tts streaming buffer cannot reset timeout when lastUpdateTime is short. PR
Fixed issue with Deepgram STT not returning transcript when last_word_end is -1. PR
Fixed issue muting member in conference. PR
Fixed API server crash when admin query voip-carrier. PR
Fixed issue where we incorrectly saved an obscured API credential for recording, leading to failures authenticating. PR
Fixed an issue where updateCall responding with 202 caused an error. PR
Fixed an issue in the portal where the wrong recording bucket region was displayed. PR

SQL changes

ALTER TABLE applications ADD COLUMN env_vars TEXT

Availability

Available now on jambonz.cloud
Available now with devops scripts for subscription customers

Questions? Contact us at support@jambonz.org

April 12, 2025

0.9.3-12

Point release

Elevenlab conversational AI bug fixes, readonly portal users and stability improvements

Fixes an issue where the initial client configuration message for Elevenlabs Conversational AI was improperly formatted.

PR
Adds support for speed and pronunciation_dictionary_locators for Elevenlabs TTS.

PR
Addresses memory allocation issue in freeswitch modules that could lead to intermittent crashes. (Fixed in freeswitch-modules@2.2.26).
Add support for throttling outbound registrations and disabling. Also added support for disabling outbound REGISTERs or NOTIFYs based on specific failure codes returned from the far end trunk.

PR, PR
Fixes issue where confirm hook on a dial verb was not working over a websocket connection.

PR
Adds support for creating portal users with readonly access.

PR
Disable password managers (e.g. LastPass, etc) on some forms where they were incorrectly auto-filling data, leading to confusion over why the form was not submitting.

PR
Fixes issue with failing re-INVITE due to unsupported codec.

PR
Allows hangup verb to be used in a siprec call.

PR
Fixes scenario where we have two config verbs, first config having hints, but second one not having hints, then the transcribe verb generating a rutime error.

PR
Reject portal logins with better error message if a user that signed up using ouath tries to sign in using email and password.

PR
Allow a readonly portal user to change their password.

PR

March 30, 2025

0.9.3-10

Point release

Add support for OpenAI Streaming STT and other improvements

Adds support for OpenAI Speech-to-text. Please see related options here and review this article a discussion of how to use the OpenAI STT prompt feature.

PR, PR, and PR.
Support Cartesia sonic-2 and sonic-turbo models.

PR
Fixes issue with use of streaming say in gather verb.

PR
Better support for passing webrtc video calls.

PR
Fixes issue when using language detection feature with Deepgram.

PR
Fixes an issue showing incorrect speech synthesizer in applications view in the portal.

PR
Write options ping failure alert once instead of repeatedly.

PR
Fixes issue where lengthy LLM prompts for ultravox, elevenlabs, and deepgram were being truncated.

March 12, 2025

0.9.3-9

Point release

Additional log visibility, improvements to AMD, and more

Adds log viewer to jambonz portal (AWS only) to enable easier troubleshooting of calls.

PR, Issue
Improves answering machine detection by listening for strings of digits in addition to other heuristics.

PR
Add support for username and password authentication to redis.

PR
Fixes crashing error with some media timeout scenarios

PR
Adds support for pausing transcriptions on Listen and Transcribe verbs.

PR
When a session uses live call control and a session:adulting message is sent to the application, customer data is now included.

PR
Fixes an issue when a call is ended via the API live call control the call_terminated_by field is now ‘jambonz’.

PR
Filters the carrier list by account when creating a new phone number.

PR
Usability improvements when configure a websocket-based application URL in the jambonz portal.

PR
Allows the Recent Calls API to return more than 25 calls at a time.

PR
Smooth outbound SIP registrations to avoid spikes.

PR

March 1, 2025

0.9.3-8

Point release

Audio Improvements with Bidirectional Streams, Ultravox Enhancements, AWS Autoscaling fixes and more

Allows the url property in a listen verb to be a relative URL when used in a websocket application. This allows developers to create a single websocket app that handles both jambonz commands and bidirectional audio streams.

See this realtime translation example that uses openAI and bidirectional audio streams, where the url property is a relative URL and the app handles both jambonz commands and the audio stream.

PR, Issue
Fixes an intermittent issue with audio issue with crackling noise on bidirectional audio streams.
When an application redirects to a new absolute URL, update the base requestor so that future relative URLs are resolved relative to the new URL.

PR, Issue
Fixes an issue where the final transcript in a conversation initiated with the dial verb was sometimes not collected if the caller hung up quickly after their final utterance.

PR, Issue
Adds support for sending an input_text_message to Ultravox.ai during a speech-to-speech session. This enables the application to dynamically direct the conversation through means other than the caller’s voice.

PR
Fixes an issue with intermittent failure to clean up media server resources after a call completes.

PR Issue
Webapp no longer shows Messaging webhook as SMPP is a deprecated feature for the time being (lack of customer demand).

PR, Issue
Fixes database upgrade script which had previously misnamed a column.

PR Issue
Fixes an issue with AWS autoscaling where incorrect SNS topic name was used, leading to unnecessarily long scale-in durations.

PR
When sending a REFER over sips the Contact header should also use sips scheme.

PR

February 24, 2025

0.9.3-7

Point release

Conferencing Enhancements and Minor Fixes

Adds support for receiving sip requests during a conference call.

PR, Issue
Sends new error message over websocket to application when an incoming request from the application is not valid.

PR Issue
Fixes a typo with the variable name used to store the AWS SNS topic arn (only relevant for AWS deployments).

PR

February 20, 2025

0.9.3-6

Point release

Improve Ultravox Integration

Adds support for sending the Ultravox call identifier to the jambonz app so that it can be used for tracking and troubleshooting purposes.

PR
Update to drachtio-srf 5.0.2

February 18, 2025

0.9.3-5

Point release

Important STT Improvements for Deepgram and Speechmatics, Support for Outbound SIP Proxy and more

Adds support for carriers that require us to send them calls through an outbound sip proxy.

PR, PR PR, Issue
Reject call attempts on hosted jambonz systems where the account has no active subscription.

PR
Improve Deepgram integration by ignoring UtteranceEnd event from Deepgram when we have unprocessed words; in this scenario it is better to continue to wait for the unprocessed words to become finalized.

PR Issue
Fixes issue where exception thrown when a new application URL is provided during a call and the request to that URL fails.

PR
Improves handling of errors when handling tts:tokens requests to stream text tokens from an LLM.

PR
Fixes timeout issues when working with speechmatics STT.

PR
Adds a response time metric when using tts streaming.

PR
Fixes an issue where when using the dialMusic property in a dial verb, the music could play endlessly.

PR, Issue
Allows the Deepgram nodelay property to be explicitly set.

PR, Issue
Enhances the createCall REST API to allow the caller to specify a sip proxy to send the INVITE through.

PR
Fixes issue where an incoming REGISTER with invalid sip uri in the From or To header causes an exception.

PR, Issue
Supports recording an incoming SIPREC call using the jambonz recording feature.

PR

February 7, 2025

0.9.3-3

Point release

New Deepgram And Rime Labs Features

Adds support for Deepgram keyterms feature.

PR
Add support for Rime Labs new mistv2 model and additional voices.

PR PR
Fixes an issue where recognizer property was used without including a language and the default application setting for language was not used.

PR