This speech recognition software is much faster than human typists

It's also more accurate.
Original image replaced with Mashable logo
Original image has been replaced. Credit: Mashable

Speech recognition software used to be awful. It couldn't deal with background noise, its vocabulary was limited and it was awkward to use in public.

But we've come a long way. A new speech recognition system can transcribe English or Mandarin about three times faster than humans can type on a smartphone, according to a recent study. Developers from the popular Chinese search engine Baidu created the program back in December. It's called Deep Speech 2, and it uses machine learning to vastly improve speech recognition.

The study, a collaboration between Stanford University, Baidu and the University of Washington, also found that the system produced 20.4 percent fewer errors than people typing in English and 63.8 percent fewer than people working in Mandarin.


You May Also Like

In the experiment, 32 individuals would either type on a QWERTY smartphone keyboard or speak sample phrases such as "physics and chemistry are hard," "have a good weekend" and "go out for some pizza and beer."

To make the conditions as authentic as possible, the human subjects were also allowed to use autocorrect and word suggestions. And all participants were between 19 and 32 years old.

"We’re putting speech recognition up against people who are really good at this task," study co-author James Landay told Stanford News.

The system produced 20.4 percent fewer errors than people typing in English and 63.8 percent fewer than people working in Mandarin.

Overall, participants preferred the speech recognition system, though they had some reservations.

"When I made mistakes it seemed like it took longer to correct, because I was switching from holding the phone to speak vs. holding to type," said one participant, according to the study.

Another participant admitted to simply being more comfortable on their keyboard. "I could correct errors as they were made," the participant said "That being said, it seemed like I introduced more errors typing that the speech system did."

The researchers suggested some improvements for future versions of the system. For example, it would be great if the system could auto-detect when a speaker was finished instead of pressing a button. The researchers would also like to test it in a variety of noise levels.

Baidu doesn't plan to make the software available to the public, but they are integrating it into Baidu apps in China.

Even so, Landay sees the potential for widespread use of this speech recognition system.

“You could imagine an interface where you use speech to start and then it switches to a graphical interface that you can touch and control with your finger,” Landay said. “We should put speech in more applications than just typing an email or text message."

Watch an in-depth explanation of the experiment below:

Mashable Potato

Recommended For You
Hinge tests facial recognition scans in these countries
hinge logo on iphone

OpenAI may sell $300 smart speaker with camera — in 2027
Sam Altman speaking at a microphone

Turn ideas into a book faster with Aivolut AI Book Creator for $118.99
Laptop on desk

Rhea Seehorn's Golden Globes speech is delightfully candid
Rhea Seehorn at the Golden Globes.

Kindle Scribe 3rd Generation: A much-needed improvement among Amazon's e-readers
A 2025 Kindle Scribe on a table

Trending on Mashable
NYT Connections hints today: Clues, answers for April 3, 2026
Connections game on a smartphone

Wordle today: Answer, hints for April 3, 2026
Wordle game on a smartphone

What's new to streaming this week? (April 3, 2026)
A composite of images from film and TV streaming this week.

NYT Strands hints, answers for April 3, 2026
A game being played on a smartphone.

Google launches Gemma 4, a new open-source model: How to try it
Google Gemma
The biggest stories of the day delivered to your inbox.
These newsletters may contain advertising, deals, or affiliate links. By clicking Subscribe, you confirm you are 16+ and agree to our Terms of Use and Privacy Policy.
Thanks for signing up. See you at your inbox!