Home » Episodes » Speech to Text – Eric Bolo, Batvoice

Speech to Text – Eric Bolo, Batvoice

Apr 29, 2018

| Carl Robinson

Eric Bolo is the CTO of Batvoice Technologies, a speech analytics startup based in Paris, France. Eric talks about building a custom speech-to-text system for their flagship product, Call Watch.

He introduces us to speech analytics and audio-mining, and describes some typical applications. We go into detail about speech-to-text (STT) technologies, and discuss the pros and cons of using cloud STT services such as Google speech versus building a custom STT system yourself.

Eric tells us about the latest open source tools and frameworks for building STT systems, and how to get that precious voice data to train our models. We learn how to build and annotate a custom voice dataset ourselves, and hear his advice on starting a voice first company.

This is a great first episode to kick off the series! Eric is super smart, with excellent technical skills and a real passion for voice technology. We already know each other quite well, so I couldn’t think of anyone I’d rather have as my first guest on the show. I know you’re gonna enjoy hearing what he had to say!

Links from the show:

Batvoice / Callwatch: http://www.batvoice.com
Google speech API: https://cloud.google.com/speech-to-text/
Microsoft Translator Speech API: https://www.microsoft.com/en-us/translator/speech.aspx
IBM Watson: https://www.ibm.com/watson/services/text-to-speech/
Kaldi toolkit for speech recognition: http://kaldi-asr.org/doc/about.html
EESEN speech recognition framework: https://github.com/srvk/eesen
European Language Resources Association: http://www.elra.info/en/
Mozilla Common Voice Project: https://voice.mozilla.org/en
TEDlium English speech recognition training corpus from TED talks: http://www.openslr.org/7/
Voxforge GPL Transcribed Speech corpus: http://www.voxforge.org/
Amazon Mechanical Turk: https://www.mturk.com/

Find us here:

★ Support this podcast on Patreon ★

Author

Carl Robinson
Host - Voice Tech Podcast | Founder - Tizz Tech podcast agency
Carl uses the latest technologies to help businesses grow with audio content.
https://voicetechpodcast.com

Search

5 Fantastic French Startups – Vivatech 2021

Five different french audio startups explain what their companies’ missions are as they’re interviewed at the Vivatech 2021 mega-conference in Paris, France. One french company that stood out was Cogneed, which focuses on the power of AI specifically when it comes to...

Conversation Design Festival – Hans van Dam, CDI

Hans van Dam is the CEO of Conversation Design Institute, a company that teaches a human-centric workflow to conversation design which has proven itself in organisations around the world. This is Hans’s second appearance on the show, the first one being in episode 51...

Real-time Voice Changer – Jaime Bosch & Alex Bordanova, Voicemod

Jaime Bosch is the co-founder CEO of Voicemod, a massively popular real-time voice manipulation, augmentation & soundboard application for Windows PC. He is joined by Alex Bordanova, Director of Audio Experience at Voicemod. Based in Valencia, Spain, Voicemod has...

Speech to Text – Eric Bolo, Batvoice

Author

Search

Related

5 Fantastic French Startups – Vivatech 2021

Conversation Design Festival – Hans van Dam, CDI

Real-time Voice Changer – Jaime Bosch & Alex Bordanova, Voicemod

Listen now

Business enquiries

Book a call and let's get it going

Who will you speak with?

Carl Robinson