Vocapia Research provides leading edge speech-to-text software and services for many languages #speechprocessing #IA" title="" class="btn" data-container="body" data-html="true" data-id="4821" data-placement="top" data-toggle="popover" data-trigger="focus" style="color:#b3d4fc" tabindex="0" data-original-title="Vocapia Research"> 236 368 368
Activities
Technologies
Entity types
Location
28 Rue Jean Rostand, 91400 Orsay, France
Orsay
France
Employees
Scale: 11-50
Estimated: 9
SIREN
432162063Engaged corporates
2Added in Motherbase
6 years, 6 months agoMaking your audio searchable
Vocapia Research develops speech processing technologies for multilingual, large vocabulary speech recognition (speech-to-text), automatic audio segmentation, language identification and speaker recognition.
The Vocapia Research VoxSigma software suite delivers state of the art performance for broadcast data and conversational speech in multiple languages.
This core technology can serve as the basis for a variety of applications ranging from interactive conversational systems to the automatic indexing of audio data.
For the latter class of applications, large vocabulary continuous speech recognition is the key technology for enabling content-based information access in audio and video documents. Most of the linguistic information is encoded in the audio channel of audiovisual data, which once transcribed can be accessed using text-based tools.
Among the most common applications of our technology are audio and audiovisual data mining (broadcast data, call center data), media monitoring, media asset management, and telephone-based conversational systems.
speech processing technology
Making your audio searchable
Vocapia Research develops speech processing technologies for multilingual, large vocabulary speech recognition (speech-to-text), automatic audio segmentation, language identification and speaker recognition.
The Vocapia Research VoxSigma software suite delivers state of the art performance for broadcast data and conversational speech in multiple languages.
This core technology can serve as the basis for a variety of applications ranging from interactive conversational systems to the automatic indexing of audio data.
For the latter class of applications, large vocabulary continuous speech recognition is the key technology for enabling content-based information access in audio and video documents. Most of the linguistic information is encoded in the audio channel of audiovisual data, which once transcribed can be accessed using text-based tools.
Among the most common applications of our technology are audio and audiovisual data mining (broadcast data, call center data), media monitoring, media asset management, and telephone-based conversational systems.
Vocapia is a provider of speech-to-text software and service for broadcast monitoring, lecture and seminar transcription, video subtitling, conference call transcription and speech analytics.
Happy to announce our updated multi-domain models for speech-to-text transcription for Czech (v4.2), Dutch (v4.0), Portuguese (v4.0) and Romanian (v4.0) now covering a wider variety of spoken variants. Available on our web service and for licensing.
Hello 2025 ✨Sending out our warmest new year's wishes from all of the Vocapia team ! Here's to a year of joy, success and breaking new ground 🚀
We are gliding into the winter season with updates of multidomain models for speech-to-text transcription in 5 languages: Mandarin Chinese (v7.0), Greek (v4.0), Hindi (v2.0), Persian (v3.0) and Turkish (v5.0).
We are proud to be recipient of the 2024 LT-Innovate Award: Best Language Intelligence Use Case Award https://buff.ly/3Z31Pjf at the Language Intelligence conference: Driving Business Value Via Multilingual AI ( https://buff.ly/4i59iHp ) held in Vienna this week. Congratulations Jodie!
We are happy to announce the addition of the Hungarian (v3.0) language to our set of multi-domain models for speech-to-text transcription of various data types, as well as updated models for Arabic (v8.1).