We are pleased to announce that Gladia’s Speech-to-Text API technology has been integrated into Eden AI API.
What is Gladia?
Jean-Louis Quéguiner founded Gladia in Paris, France in 2022 to democratize cutting-edge AI tools for developers. The company expanded its focus to address the underutilization of up to 90% of entéerprise audio data.
Gladia’s platform provides various AI tools, including translation, sentiment analysis, voice synthesis (speech-to-text), voice recognition (text-to-speech), offensive content detection, background removal, and face detection. These capabilities empower software developers to find suitable AI API models for their specific requirements.
Currently collaborating with call centers, virtual meeting services, and video publishers like Claap, Livestorm, and Selectra, Gladia raised $4 million in seed funding led by New Wave. Their future goals include summarizing audio content, categorizing topics, creating automatic chapters, and conducting sentiment analysis.
Why do we offer Gladia in addition to other ASR APIs?
Eden AI offers Gladia Speech-to-Text solutions on its platform amongst several other Automatic Speech Recognition technologies. We want our users to have access to multiple AI engines and manage them in one place so they can reach high performance, optimize cost and cover all their needs.
Multiple AIs in one API — Eden AI
There are many reasons for using multiple AI APIs :
Fallback provider is the ABCs.
You need to set up an AI API that is requested if and only if the main AI API does not perform well (or is down). You can use the confidence score returned or other methods to check provider accuracy.
Performance optimization.
After the testing phase, you will be able to build a mapping of AI vendors’ performance that depends on the criteria that you chose. Each data that you need to process will be then sent to the best API.
Cost — Performance ratio optimization.
This method allows you to choose the cheapest provider that performs well for your data. Let’s imagine that you choose Google Cloud API for customer “A” because they all perform well and this is the cheapest. You will then choose Microsoft Azure for customer “B”, a more expensive API but Google performances are not satisfying for customer “B”. (this is a random example)
Combine multiple AI APIs.
This approach is required if you look for extremely high accuracy. The combination leads to higher costs but allows your AI service to be safe and accurate because AI APIs will validate and invalidate each other for each piece of data.
Interview with Gladia’s Head of Marketing
We had the chance to talk to Anna Jelezovskaia, Gladia’s head of Marketing, who agreed to answer some of our questions:
Please introduce your company and provide a brief history. When were you created, and what inspired you to start your company?
Founded in 2022 by Jean-Louis Quéguiner (ex-VP of AI at OVHCloud) and Jonathan Soto (ex-VP of Engineering at Sigfox), Gladia’s mission was to make the most advanced AI available to any developer. Since then, our API has evolved into an enterprise-grade Language AI product, enabling speech-to-text transcription (async and real-time) and audio intelligence (translation, speaker separation, etc).
We’re currently working on a series of audio intelligence features, including summarization and sentiment analysis.
What does Gladia provide to customers (APIs, Saas, languages, open source, etc.)?
At Gladia, we provide a single plug-and-play API, unlocking a range of audio and video transcription and analytics functionalities to help companies fully leverage the untapped potential of their unstructured data.
Here are some factors that set us apart:
- Top-level performance: We provide near error-free transcription accuracy without compromising speed (1h audio <60s) while remaining one of the most affordable API providers on the market.
- Multilingual capabilities: Unlike any other provider on the market, the API already supports 99 languages for transcription, speaker diarization, and translation. All of our features are designed to be fully multilingual.
- Security: We take our client’s privacy seriously. Our data hosting is fully compliant with GDPR.
- Scalability: Flexible pay-as-you-go pricing that allows scaling transcription volumes easily.
Who are your customers? For which areas are Gladia services intended?
Our API is adapted to the most sought-after use cases like virtual meetings translation and analytics, podcast transcription, video subtitling, as well as call center assistance. Our clients come from the corresponding media, SaaS, and call center industries and include Livestorm, Selectra, Musixmatch, and Claap.
Currently, up to 90% of audio data in companies is either lost or unused due to the lack of high-quality and affordable transcription and audio intelligence tools. Our API helps our clients derive more insight from their audio and video data, and power new features in their applications to boost user experience and retention. **It can also help with content creation and SEO, among others.
Why did you decide to be integrated in Eden AI?
Eden AI is a key partner for Gladia, being the #1 resource for developers to access the most advanced AI models. The Gladia team quickly identified Eden AI as the right place to be integrated. Partnering up with the team was very simple, with a light-speed integration.
All of the Eden AI users can now leverage Gladia’s transcription, and we are very proud to be now part of the landscape. Gladia wants to be part of the Eden AI mission, accelerating the adoption of AI within companies.
How will your product evolve (new updates, new plans for the future of your product, etc.)?
We have a lot of super exciting releases coming up this fall. We’re going to expand our market-leading support for transcription in 99 languages to translation, allowing our users to transcribe audio from any language to any language.
We’ll follow up with a package of audio intelligence add-ons like summarization and chapterization — all fully multilingual as well. Our goal is to deliver a truly global API adapted to a variety of real-life use cases and complex environments.There’s a real need for versatile AI solutions right now. As the volume of unstructured data is increasing exponentially, so does the need for optimized tools to extract actionable information from it. Our initial focus is on audio — due to the tremendous ‘dark data’ problem mentioned before — but in the long run, we’re planning to expand to other sources of data.
How to use Gladia on Eden AI?
You’ll need some documentation to use Gladia’s Speech technologies on Eden AI. Then, call the API:
Eden AI is a must-have
Eden AI is the future of AI usage in companies. Our platform not only allows you to call multiple AI APIs but also gives you :
- Centralized and fully monitored billing for all AI APIs
- A unified API for all providers: simple and standard to use, quick switch between providers, access to the specific features of each provider
- Standardized response format: the JSON output format is the same for all suppliers thanks to Eden AI’s standardization work. The response elements are also standardized thanks to Eden AI’s powerful matching algorithms.
- Best Artificial Intelligence APIs of the market: big cloud providers (Google, AWS, Microsoft, and more specialized engines)
- Data protection: Eden AI will not store or use any data. Possibility to filter to use only GDPR engines.
You can see Eden AI documentation here.