CHANGELOG
Product improvements
Check out the AssemblyAI changelog to see weekly accuracy and product improvements our team has been working on.
Powering incredible companies
- Our Asynchronous Speech Recognition model is now even better with the release of v8.5.
- This update improves overall accuracy by 4% relative to our v8 model.
- This is achieved by improving the model’s ability to handle noisy or difficult-to-decipher audio.
- The v8.5 model also improves Inverse Text Normalization for numbers.
- Launched the new AssemblyAI Docs, with more complete documentation and an easy-to-navigate interface so developers can effectively use and integrate with our API. Click here to view the new and improved documentation.
- Added two new fields to the
FinalTranscript
response for Real-time Transcriptions. The punctuated
key is a Boolean value indicating if punctuation was successful. The text_formatted
key is a Boolean value indicating if Inverse Text Normalization (ITN) was successful.
- Launched the new AssemblyAI Docs, with more complete documentation and an easy-to-navigate interface so developers can effectively use and integrate with our API. Click here to view the new and improved documentation.
- Added two new fields to the
FinalTranscript
response for Real-time Transcriptions. The punctuated
key is a Boolean value indicating if punctuation was successful. The text_formatted
key is a Boolean value indicating if Inverse Text Normalization (ITN) was successful.
- Inverse Text Normalization (ITN) added for our
/v2/realtime
and /v2/stream
endpoints. ITN improves formatting of entities like numbers, dates, and proper nouns in the transcription text.
- Improved accuracy for Custom Vocabulary (aka, Word Boosts) with the Real-Time transcription API.
- Fixed an edge case that would sometimes cause transcription errors when
disfluencies
was set to true
and no words were identified in the audio file.
- Inverse Text Normalization (ITN) added for our
/v2/realtime
and /v2/stream
endpoints. ITN improves formatting of entities like numbers, dates, and proper nouns in the transcription text.
- Improved accuracy for Custom Vocabulary (aka, Word Boosts) with the Real-Time transcription API.
- Fixed an edge case that would sometimes cause transcription errors when
disfluencies
was set to true
and no words were identified in the audio file.