Microsoft shows off its weird Silent Voice technology

Microsoft shows off its weird Silent Voice technology

Microsoft is working on a new voice input interface that allows users to speak and record without the presence of sound. The research was conducted by Microsoft Research and presented at ACM CHI 2018. The technology, called SilentVoice, enables communication by recording the sounds made while breathing, which allows whisper-like sounds to be enough for the microphone to record speech without disturbing people around. In addition, the module will also filter out surrounding speech, so users can capture clear speech even with external interference.

SilentVoice is a new voice input interface device that enables voice-based Natural User Interfaces (NUI) in everyday life.

The proposed "progressive speech" method enables the microphone to be placed very close to the front of the mouth without being affected by noise, capturing very soft speech with a good S/N ratio. It achieves ultra-small (less than 39dB(A)) speech leakage, allowing the use of voice input without annoying people around in public and mobile situations, as well as in offices and homes. (Finally, it won't bother people using TNT!)

By measuring the direction of airflow, SilentVoice can easily separate external sounds from normal speech with an accuracy of 98.8%, and no activation words are required before voice communication starts. It can also be used with a voice activation system with a specially trained speech recognizer. The evaluation results produced word error rates (WERs) of 1.8% (speaker-dependent conditions) and 7.0% (speaker-independent conditions), including 85 command sentences, which means that natural speech similar to whispers can also be used for real-time voice communication.

You can view the full presentation at the ACM CHI Conference on Computing Systems: https://youtu.be/9EV1mEtVfuM

The technology is still in the research stage but will definitely help those who like to use voice commands but prefer to work without disturbing those around them.

<<:  The second half of Android developers

>>:  Forbes: Producing iPhones in the U.S. is challenging but not impossible

Recommend

We are not allowed to eat green onions. Is this rule effective?

"Red umbrellas, white poles, let's lie o...

Why is it harder to grab red envelopes this year? Plugins are the culprit

The red envelopes developed by the WeChat team in...

Understanding "Content Marketing" in One Article

In which area have companies been increasing thei...

Is Baofeng VR just a stock speculation tool?

In just a few months since its listing, Baofeng V...

How much does it cost to be an agent for a fruit mini program in Dali?

How much does it cost to be an agent of Dali Frui...

It is called "Beijing Swift", but it gathers in Urumqi, thousands of miles away...

Why are there more common swifts—also called Beij...

Have you been too sweet? Please check these blood sugar knowledge

Improving the scientific literacy of all citizens...

Case review | 4,000 precise local university traffic in two days

The word fission may be a bit outdated now. We of...