Speechmatics pushes forward recognition of accented English

Devin Coldewey

Updated October 26, 2021 at 3:17 PM·4 min read

Speech recognition has gone from convenient to crucial over the last few years as smart speakers and driving assist modes have taken off — but not everyone's voice is recognized equally well. Speechmatics claims to have the most inclusive and accurate model out there, beating Amazon, Google and others when it comes to speech outside of the most common American accents.

The company explained that it was guided toward the question of accuracy by a 2019 Stanford study entitled "Racial Disparities on Speech Recognition," which found exactly that. Speech engines from Amazon, Apple, Google, IBM and Microsoft "exhibited substantial racial disparities, with an average word error rate (WER) of 0.35 for black speakers compared with 0.19 for white speakers." Not great!

The source of this disparity may be partly attributed to a lack of diversity in the datasets used to train these systems. After all, if there are few black speakers in the data, the model will not learn those speech patterns as well. The same may be said for speakers with other accents, dialects, and so on — America (let alone the U.K.) is full of accents and any company claiming to make services for "everyone" should be aware of that.

At any rate, U.K.-based Speechmatics made accuracy in transcribing accented English a priority for its latest model, and it claims to have blown the others out of the water. Based on the same data sets used in the Stanford study (but using the latest versions of the speech software), "Speechmatics recorded an overall accuracy of 82.8% for African American voices compared to Google (68.7%) and Amazon (68.6%)," the company wrote in its press release.

The company credits this success to a relatively new approach to creating a speech recognition model. Traditionally, the machine learning system is provided with labeled data — think an audio file of speech with an accompanying metadata or text file that has what's being said, usually transcribed and checked by humans. For a cat detection algorithm you'd have images and data saying which ones contain cats, where the cat is in each picture, and so on. This is supervised learning, where a model learns correlations between two forms of prepared data.

Speechmatics used self-supervised learning, a method that's gained steam in recent years as datasets, learning efficiency, and computational power have grown. In addition to labeled data, it uses raw, unlabeled data and much more of it, building its own "understanding" of speech with far less guidance.

Computer vision inches toward ‘common sense’ with Facebook’s latest research

In this case the model was based on about 30,000 hours of labeled data to get a sort of base level of understanding, then was fed 1.1 million hours of publicly available audio sourced from YouTube, podcasts and other content. This type of collection is a bit of a grey area, since no one explicitly consented to have their podcast used to train someone's commercial speech recognition engine. But it's being used that way by many, just as "the entire internet" was used to train OpenAI's GPT-3, probably including thousands of my own articles. (Though it has yet to master my unique voice.)

In addition to improving accuracy for Black American speakers, the Speechmatics model claims better transcription for children (about 92% accurate versus about 83% in Google and Deepgram) and small but significant improvements in English with accents from around the world: Indian, Filipino, Southern African and many others — even Scottish.

They support dozens of other languages and are competitive in many of them, as well; this isn't just an English recognition model, but given the language's use as a lingua franca (a hilariously inapt idiom nowadays), accents are especially important to it.

Speechmatics may be ahead in the metrics it cites, but the AI world moves at an incredibly rapid clip and I would not be surprised to see further leapfrogging over the next year. Google, for instance, is hard at work on making sure its engines work for people with impaired speech. Inclusion is an important part of all AI work these days and it's good to see companies trying to outdo each other in it.

Google details AI work behind Project Euphonia’s more inclusive speech recognition

Yahoo Sports
2024 NBA Mock Draft 7.0: Who will the Hawks take at No. 1? Our projections for every pick with lottery order now set
With the lottery order set, here's a look at Yahoo Sports' projections for both rounds of the 2024 NBA Draft.
Yahoo Sports
What scouts think of Bronny James' NBA prospects
The biggest question looming over the NBA draft combine this week: How will Bronny James do?
Yahoo Sports
NFL schedule release: Chiefs to host Ravens in 2024 season opener
Chiefs vs. Ravens on Sept. 5 will be a rematch of last season's AFC Championship Game.
Yahoo Sports
The Spin: Making a call on 5 slumping fantasy baseball stars
All five of these hitters were drafted highly in fantasy baseball leagues. So far, they have not lived up to their ADPs — and that's an understatement. Scott Pianowski analyzes.
Yahoo Finance
Utility stocks are on fire — here are Wall Street analysts' top picks
Utility stocks are outperforming the broader markets. Here's a look at three top picks from analysts.
Yahoo Sports
Former MLB infielder, Little League World Series star Sean Burroughs dies at 43
The seven-year major leaguer collapsed while coaching his son's Little League game on Thursday.
Yahoo Sports
Dolphins owner Stephen Ross reportedly declined $10 billion for team, stadium and F1 race
The value of the Dolphins and Formula One racing is enormous.
Yahoo Sports
The best RBs for 2024 fantasy football, according to our experts
The Yahoo Fantasy football analysts reveal their first running back rankings for the 2024 NFL season.
Yahoo Finance
Here's 1 big investing mistake you are probably still making
Maybe a 5% CD isn't the best choice for your hard-earned money.
Yahoo Sports
Juan Soto’s unapologetic intensity and showmanship are captivating the Bronx and rubbing off on teammates: ‘Literally every pitch is theater’
The 2024 Yankees have rediscovered their bravado and hold the second-best record in the AL, thanks in large part to the superstar outfielder.
Yahoo Finance
The FDIC change that leaves wealthy bank depositors with less protection
Affluent Americans may want to double-check how much of their bank deposits are protected by government-backed insurance. The rules governing trust accounts just changed.
Yahoo Sports
Timberwolves coach Chris Finch calls Jamal Murray's heat-pack toss on court 'inexcusable and dangerous'
Murray made a bad night on the court worse during a moment of frustration on the bench.
Engadget
The best budgeting apps for 2024
Budgeting apps can help you keep track of your finances, stick to a spending plan and reach your money goals. These are the best budget-tracking apps available right now.
Yahoo Sports
Derrick Lewis strips off shorts, moons crowd in St. Louis after KO win over Rodrigo Nascimento
“I appreciate St. Louis for letting me show my naked ass tonight."
Yahoo Sports
Wide receiver rankings for 2024 fantasy football
The Yahoo Fantasy football analysts reveal their first wide receiver rankings for the 2024 NFL season.
Yahoo Sports
Tight end rankings for fantasy football 2024
The Yahoo Fantasy football analysts reveal their first tight end rankings for the 2024 NFL season.
Yahoo Finance
Former House Speaker Paul Ryan says he’s not voting for Trump : 'Character is too important'
Ryan says he would be writing in a Republican candidate instead of voting for Donald Trump.
Yahoo Finance
Social Security just passed Medicare as the government's most pressing insolvency risk
An annual government report offered a glimmer of good news for Social Security and a jolt of good news for Medicare even as both programs continue to be on pace to run dry next decade.
Yahoo Finance
Biden's coming new tariffs on China reflect 'lessons learned'
A sweeping White House move on China tariffs that is expected to be unveiled early next week "reflects lessons learned," according to a former official who was involved in the process.
Yahoo Finance
Australian ambassador: 'American model is proving its resilience' despite threat from Chinese industrial policy
China may be outspending the US when it comes to industrial policy in sectors like electric vehicles and semiconductors, but America is winning on innovation where it can’t on price, according to one China expert.

News

Life

Entertainment

Finance

Sports

New on Yahoo

Michael Cohen testifies in Trump hush money case

Speechmatics pushes forward recognition of accented English

Recommended Stories

2024 NBA Mock Draft 7.0: Who will the Hawks take at No. 1? Our projections for every pick with lottery order now set

What scouts think of Bronny James' NBA prospects

NFL schedule release: Chiefs to host Ravens in 2024 season opener

The Spin: Making a call on 5 slumping fantasy baseball stars

Utility stocks are on fire — here are Wall Street analysts' top picks

Former MLB infielder, Little League World Series star Sean Burroughs dies at 43

Dolphins owner Stephen Ross reportedly declined $10 billion for team, stadium and F1 race

The best RBs for 2024 fantasy football, according to our experts

Here's 1 big investing mistake you are probably still making

Juan Soto’s unapologetic intensity and showmanship are captivating the Bronx and rubbing off on teammates: ‘Literally every pitch is theater’

The FDIC change that leaves wealthy bank depositors with less protection

Timberwolves coach Chris Finch calls Jamal Murray's heat-pack toss on court 'inexcusable and dangerous'

The best budgeting apps for 2024

Derrick Lewis strips off shorts, moons crowd in St. Louis after KO win over Rodrigo Nascimento

Wide receiver rankings for 2024 fantasy football

Tight end rankings for fantasy football 2024

Former House Speaker Paul Ryan says he’s not voting for Trump : 'Character is too important'

Social Security just passed Medicare as the government's most pressing insolvency risk

Biden's coming new tariffs on China reflect 'lessons learned'

Australian ambassador: 'American model is proving its resilience' despite threat from Chinese industrial policy