Advancing Voice Restoration: Respeecher's Collaboration with Researcher Konrad Zieliński

Audio article by Respeecher

Voice conversion is a new technology that is capable of drastically changing the way people with speech disabilities function in their everyday lives.

The Respeecher team is constantly looking for new ways to advance its technology. On our journey, we met with so many extraordinary people. One of them was a young scientist named Konrad Zieliński, a Ph.D. student at the University of Warsaw who had lost his voice due to laryngectomy. His need for novel solutions for people in similar situations was the inspiration for the topic of his thesis, the main focus of his company, Uhura Bionics, and the beginning of our fruitful collaboration.

A laryngectomy is the surgical removal of the larynx (or voice box). It is usually done to treat severe or advanced-stage cases of laryngeal cancer. As a result of the surgery, patients lose their ability to speak and have to rely on voice-assistance technologies such as an electrolarynx or tracheoesophageal voice prosthesis (TEP).

However, there are certain challenges regarding these two ways of “fixing” voice disabilities.

Today, patients must use various noise-reduction techniques and filtering systems to improve speech intelligibility with an electrolarynx. At present, the technology responsible for improving electrolaryngeal speech intelligibility still has a long way to go. And while TEP speech is more natural than electrolarynx speech, it is still much less natural sounding than normal laryngeal speech.

Both of these methods seek to restore a patient's ability to speak. The only issue is the low quality of voice that these technologies produce. Communication difficulties affect patients in their jobs, personal relationships, and social gatherings.

Konrad is looking for different ways to restore patients’ original voices after having their larynx removed, and that’s why he decided to try Respeecher.

We interviewed Konrad to learn more about his research plans, his experience using Respeecher, and his future plans regarding solutions for voice disorders.

Respeecher: Can you tell us a little bit about the work you are doing right now?

Konrad: I am a PhD Student at the University of Warsaw, I work in the Human Interactivity and Language Lab (HILL). Right now I am working with a grant that has allows me to evaluate various bionic systems for laryngectomees.

I recently had been published at two prestigious conferences on human-computer interaction and speech technology: ACM CHI 2022 in New Orleans, US and INTERSPEECH in Icheon, Korea.

Currently, I am developing my own company Uhura Bionics with technical co-founder Marek Grzelec. We are participating in the EIT Health Patient Innovation Bootcamp, distributing voice amplifying speakers for people with voice disorders. We are also working on a novel electronic larynx that generates more elegant speech with a more natural sound.

We are natural partners for Respeecher and we feel like we have a lot to do together.

R: What is the eventual goal you are trying to accomplish with Respeecher? Is it real-time voice conversion?

K: Yes. We are striving to achieve voice conversion in 50 millisecond, so that it is imperceptible to the human ear. However, it is a very ambitious goal, and there are still many technical challenges ahead of us.

R: Speaking of now, with the current level of the technology Respeecher has, how are you going to use it for your recent projects?

K: I consider two scenarios.

One is as an assistive tech for content production. So that people with voice disabilities could produce different types of audio and video content, such as lectures, voiceovers, advertisements, and many more.

Another scenario is related to one of the biggest issues people with voice disabilities have - communicating with someone directly or by phone. Devices designed to aid communication (as electrolarynx or TEP) usually make speech sound robotic and unrecognizable, especially when talking to someone by phone.

R: Do you consider this technology to be available for the wide scope of people affected by voice disabilities?

K: When the technology is more developed, I would say in 5-10 years, then of course, it will become more available for a larger number of patients with voice problems. The combination of the devices we are developing right now and the Respeecher technology will make the lives of people suffering from speech problems much easier.

R: What are your thoughts after trying Respeecher?

K: Respeecher created a software that allows me to change my voice with electrolarynx to sound more human-like. They even utilized recordings from before my laryngectomy surgery (4 years ago) to build a dedicated voice conversion system that resembles my old voice! I can’t describe how excited I am about the current results and our future work together!

How does Respeecher's Technology for Patients with Voice Disabilities Work

In order to help laryngectomy patients achieve a higher quality of life, Respeecher is exploring the use of its technology on voice samples of the electrolarynx and tracheoesophageal voice. The solution will offer real-time intelligible voice replacement to improve support specifically for individuals who have undergone a laryngectomy.

Respeecher’s voice-changing technology transforms the sound of electrolarygeal and TEP speech into clearer, more articulated, and more intelligible audio. In particular, the technology dampens the mechanical hum of the electrolarynx and TEP voices while accentuating the natural tonal inflections. This makes it much easier to communicate, both for the patient and their interlocutors.

The technology could be deployable using a modified phone. Patients can communicate live through a speaker or use the technology for phone calls and other electronic voice communications.

Respeecher_Collaborates_with_a_Young_Researcher_to_Advance_Voice_Capabilities_for_Laryngectomy Patients_Respeecher_Voice_Cloning_Software

The Result and Future Steps

Today, Respeecher’s voice cloning algorithms deliver critical benefits when employed as an assistive technology to those who have lost their natural ability to speak.

Konrad provided the samples of his voice for Respeecher to create his voice model and started testing the Voice Marketplace in real-time and offline voice conversions. The results of the collaboration create an example of how laryngectomy patients can use Voice Marketplace independently.

With the help of voice cloning, as the partnership with Konrad showed, the patients are able to communicate in a more natural manner as well as produce different types of audio and video content, such as lectures, voiceovers, advertisements, and many more.

FAQ

AI voice cloning utilizes speech synthesis to copy and simulate artificial voices using the pattern and timbre of a particular voice. The special characteristics of any individual's voice can be identified through it and recreated for applications ranging from voice restoration technology in voice therapy sessions to digital communication assistants to assist the afflicted speech handicapped.

Respeecher's voice synthesis can support patients who have undergone laryngectomy and need to use AI in speech rehabilitation. Laryngectomy patient solutions enhance the quality of the speech sound produced with electronic devices, such as the electrolarynx. Electrolarynx voice enhancement is provided by clearness and naturalness of speech. The communication flow is smoother, while the mechanical hum of post-surgery voices will generally reduce the robotic sound of the speech.

Yes, Respeecher can rebuild the voice of a patient after laryngectomy, providing personalized voice cloning for patients. Using presurgery voice recordings and voice restoration technology, Respeecher creates an individual voice model so that patients can speak with a voice more or less identical to the presurgery voice, and have better intelligibility and emotional involvement.

The collaboration between Respeecher and Konrad Zieliński has advanced voice restoration technology by combining Respeecher's AI voice cloning with electrolarynx devices. This partnership has led to real-time voice conversion, allowing laryngectomy patients to produce more natural-sounding speech and engage in clearer communication.

AI voice cloning cleans the mechanical hum from the speech of the electrolarynx and provides speech with more tonal variation, hence making it sound more human-like. The Respeecher voice synthesis technology increases emotional quality of speech generated by electrolarynx and in this way helps patients after laryngectomy to communicate better.

Glossary

AI voice cloning

A technology that replicates a person’s voice using speech synthesis, enabling voice restoration technology for laryngectomy patients and improving electrolarynx speech quality.

Electrolarynx

A device used by laryngectomy patients to improve speech, enhanced by AI voice cloning and speech synthesis for improved voice quality and communication clarity.

Tracheoesophageal voice

A speech method used by laryngectomy patients, enhanced by voice restoration technologies like AI voice cloning and electrolarynx improvement for clearer communication.

Laryngectomy

A surgical procedure that removes the voice box, often followed by voice restoration technologies like AI voice cloning and electrolarynx voice enhancement.

Voice restoration technology

Advanced solutions like AI voice cloning and electrolarynx voice enhancement enhancement that help restore natural speech for laryngectomy patients and improve communication.

Speech Synthesis for Medical Applications

Explore how Respeecher’s voice restoration technology aids laryngectomy patients, enhancing speech with AI voice cloning and improving communication quality.

Advanced AI Voice Cloning for
Laryngectomy Patients

Respeecher’s AI voice cloning improves voice restoration technology for laryngectomy patients.It enables electrolarynx voice enhancement and improves TEP voices and this makes clarity and articulation much better. The technology helps restore a more natural-sounding voice, alleviating the robotic speech often associated with traditional voice-assistance devices, making communication smoother and more effective.

Show More
Personalized Voice Cloning for
Enhanced Speech Quality

One of the key advancements of Respeecher’s technology is personalized voice cloning for patients. By using voice samples, Respeecher voice synthesis creates customized models that replicate the patient's pre-surgery voice. This offers a more human-like and intelligible voice, overcoming the mechanical hum typical of post-laryngectomy communication aids, such as the electrolarynx, improving overall speech quality for better interaction.

Show More
Real-Time Voice Conversion and
Future Applications

With Respeecher’s real-time voice conversion, patients can communicate effortlessly in live conversations, including phone calls. The technology holds immense potential in the future of speech synthesis for medical applications. It promises to be a game-changer for post-laryngectomy communication aids, allowing patients to regain natural communication in both professional and personal settings.

Show More

Did you like this content?

Healthcare

Respeecher's Voice Cloning Technology Revives Tommy Muñiz for Modern 'Los García' Production

Respeecher's Voice Synthesis Restores Communication for Friedreich's Ataxia Patient

Advancing Voice Restoration: Respeecher's Collaboration with Researcher Konrad Zieliński

How does Respeecher's Technology for Patients with Voice Disabilities Work

The Result and Future Steps

FAQ