For people with hearing loss it can be very difficult to understand and separate voices in noisy environments. This problem may soon be history thanks to a new groundbreaking algorithm that is designed to recognize and separate voices efficiently in unknown sound environments.
Researchers have taken an important step towards making life easier for people with hearing loss.
People with normal hearing are usually able to understand each other without effort when communicating in noisy environments. However, for people with hearing loss, it is very challenging to understand and separate voices in noisy environments, and a hearing aid may really help. But there's still some way to go when it comes to general sound processing in hearing aids, explains Morten Kolbæk:
When the scenario is known in advance, as in certain clinical test setups, existing algorithms can already beat human performance when it comes to recognizing and distinguishing speakers. However, in normal listening situations without any prior knowledge, the human auditory brain remains the best "machine".
But this is exactly what Morten Kolbæk has worked on changing with his new algorithm.
"Because of its ability to function in unknown environments with unknown voices, the applicability of this algorithm is so much stronger than what we have seen with previous technology. It's an important step forward when it comes to solving challenging listening situations in everyday life", says one of Morten Kolbæk's two supervisors, Jesper Jensen, Senior Researcher at Oticon and Professor at the Centre for Acoustic Signal Processing Research (CASPR) at AAU.
Specifically, Morten Kolbæk’s PhD project has dealt with two different but well-known listening scenarios.
The first track sets out to solve the challenges of one-to-one conversations in noisy spaces such as car cabins. Hearing aid users face such challenges on a regular basis.
"To solve them, we have developed algorithms that can amplify the sound of the speaker while reducing noise significantly without any prior knowledge about the listening situation. Current hearing aids are pre-programmed for a number of different situations, but in real life, the environment is constantly changing and requires a hearing aid that is able to read the specific situation instantly", explains Morten Kolbæk.
The method used in creating the algorithms is called "deep learning", which falls under the machine learning category. More specifically, Morten Kolbæk has worked with deep neural networks, a type of algorithm that you train by feeding it examples of the signals it will encounter in the real world.
If, for instance, we talk about speech-in-noise, you provide the algorithm with an example of a voice in a noisy environment and one of the voice without any noise. In this way, the algorithm learns how to process the noisy signal in order to achieve a clear voice signal. You feed the network with thousands of examples, and during this process, it will learn how to process a given voice in a realistic environment, Jesper Jensen explains.
"The power of deep learning comes from its hierarchical structure that is capable of transforming noisy or mixed voice signals into clean or separated voices through layer-by-layer processing. The widespread use of deep learning today is due to three major factors: ever-increasing computation power, increasing amount of big data for training algorithms and novel methods for training deep neural networks", says Professor Zheng-Hua Tan, who is also affiliated with CASPR and supervisor of the project.
When it comes to hearing aids, the challenge is always to make the technology work on a small computer behind the ear. And right now, Morten’s algorithm requires too much space for this. Even if Mortens algorithm can separate several unknown voices from each other, it isn't able to choose which voice to present to the hearing aid user. So there are some practical issues that we need to solve before we can introduce it in a hearing aid solution. However, the most important thing is that these issues now seem solvable.