Home » Episodes » Hum a Fingerprint, Extract a Melody – Dogac Basaran, CNRS

Hum a Fingerprint, Extract a Melody – Dogac Basaran, CNRS

Sep 2, 2018

| Carl Robinson

This is the second part of my conversation with Dogac Basaran, a post-doctoral researcher at CNRS, the French national scientific research centre. If you missed the first part, you might want to go back and listen to the previous episode on Signal Processing Basics for Audio.

Today, in part 2 of 2, we explore Dogac’s research into audio fingerprinting, alignment, and melody extraction. By analysing the magnitude of frequency peaks and their relative spacing, Dogac shows us how it’s possible to create audio fingerprints that can be used to detect and match audio recordings, even if they contain noise or are incomplete. These fingerprints have a variety of uses, including aligning multiple recordings of a single speaker/performance, and identifying a particular recording.

We also discuss query by humming, the state-of-the-art technique that takes an audio fingerprint of a person humming a melody, and matches it to a database of music recordings. Dogac also explains why learning how to build neural networks has become an essential skill in this field.

Links from the show:

Full show notes : http://bit.ly/voicetechpodcast
Dogac Basaran on Github: https://github.com/dogacbasaran
Dogac Basaran’s websites: https://dbasaran.wp.imt.fr/ and http://dogacbasaran.com/
Signal Processing MOOC on Coursera: https://www.coursera.org/learn/dsp
MATLAB: https://matlab.mathworks.com/

Find us here:

★ Support this podcast on Patreon ★

Author

Carl Robinson
Host - Voice Tech Podcast | Founder - Tizz Tech podcast agency
Carl uses the latest technologies to help businesses grow with audio content.
https://voicetechpodcast.com

Hum a Fingerprint, Extract a Melody – Dogac Basaran, CNRS

Search

5 Fantastic French Startups – Vivatech 2021

Five different french audio startups explain what their companies’ missions are as they’re interviewed at the Vivatech 2021 mega-conference in Paris, France. One french company that stood out was Cogneed, which focuses on the power of AI specifically when it comes to...

Conversation Design Festival – Hans van Dam, CDI

Hans van Dam is the CEO of Conversation Design Institute, a company that teaches a human-centric workflow to conversation design which has proven itself in organisations around the world. This is Hans’s second appearance on the show, the first one being in episode 51...

Real-time Voice Changer – Jaime Bosch & Alex Bordanova, Voicemod

Jaime Bosch is the co-founder CEO of Voicemod, a massively popular real-time voice manipulation, augmentation & soundboard application for Windows PC. He is joined by Alex Bordanova, Director of Audio Experience at Voicemod. Based in Valencia, Spain, Voicemod has...

Hum a Fingerprint, Extract a Melody – Dogac Basaran, CNRS

Author

Search

Related

5 Fantastic French Startups – Vivatech 2021

Conversation Design Festival – Hans van Dam, CDI

Real-time Voice Changer – Jaime Bosch & Alex Bordanova, Voicemod

Listen now

Business enquiries

Book a call and let's get it going

Who will you speak with?

Carl Robinson