AI programs often exclude African languages. These researchers have a plan to fix that.

Andrew Paul

August 11, 2023 at 11:00 AM·3 min read

Close-up of hand typing computer coding on laptop screen — African languages are severely underrepresented in services like Alexa, Siri, and ChatGPT.

There are over 7,000 languages throughout the world, nearly half of which are considered either endangered or extinct. Meanwhile, only a comparatively tiny number of these are supported by natural language processing (NLP) artificial intelligence programs like Siri, Alexa, or ChatGPT. Particularly ignored are speakers of African dialects, who have long faced systemic biases alongside other marginalized communities within the tech industry. To help address the inequalities affecting billions of people, a team of researchers in Africa are working to establish a plan of action to better develop AI that can support these vastly overlooked languages.

The suggestions arrive thanks to members of Masakhane (roughly translated to “We build together” in isiZulu), a grassroots organization dedicated to advancing NLP research in African languages, “for Africans, by Africans.” As detailed in a new paper published today in Patterns, the team surveyed African language-speaking linguists, writers, editors, software engineers, and business leaders to identify five major themes to consider when developing African NLP tools.

Firstly, the team emphasizes Africa as a multilingual society (Masakhane estimates over 2,000 of the world’s languages originate on the continent), and these languages are vital to cultural identities and societal participation. There are over 200 million speakers of Swahili, for example, while 45 million people speak Yoruba.

Secondly, the authors emphasize that developing the proper support for African content creation is vital to expanding access, including tools like digital dictionaries, spell checkers, and African language-supported keyboards.

They also mention multidisciplinary collaborations between linguists and computer scientists are key to better designing tools, and say that developers should keep in mind the ethical obligations that come with data collection, curation, and usage.

“It doesn’t make sense to me that there are limited AI tools for African languages. Inclusion and representation in the advancement of language technology is not a patch you put at the end—it’s something you think about up front,” Kathleen Siminyu, the paper’s first author and an AI researcher at Masakhane Foundation, said in a statement on Friday.

Some of the team’s other recommendations include additional structural support to develop content moderation tools to help curtail the spread of online African language-based misinformation, as well as funding for legal cases involving African language data usage by non-African companies.

“I would love for us to live in a world where Africans can have as good quality of life and access to information and opportunities as somebody fluent in English, French, Mandarin, or other languages,” Siminyu continues. Going forward, the team hopes to expand their study to feature even more participants, and use their research to potentially help preserve indigenous African languages.

“[W]e feel that these are challenges that can and must be faced,” Patterns' scientific editor Wanying Wang writes in the issue’s accompanying editorial. Wang also hopes additional researchers will submit their own explorations and advancements in non-English NLP.

“This is not limited just to groundbreaking technical NLP advances and solutions but also open to research papers that use these or similar technologies to push language and domain boundaries,” writes Wang.

Yahoo Sports
NFL Draft: Packers fan upset with team's 1st pick, and Lions fans hilariously rubbed it in
Not everyone was thrilled with their team's draft on Thursday night.
2d ago
Yahoo Sports
NFL to allow players to wear protective Guardian Caps in games beginning with 2024 season
The NFL will allow players to wear protective Guardian Caps during games beginning with the 2024 season. The caps were previously mandated for practices.
21h ago
Yahoo Sports
Michael Penix Jr. said Kirk Cousins called him after Falcons' surprising draft selection
Atlanta Falcons first-round draft pick Michael Penix Jr. said quarterback Kirk Cousins called him after he was picked No. 8 overall in one of the 2024 NFL Draft's more puzzling selections.
19h ago
Yahoo Sports
Panthers owner David Tepper stopped by Charlotte bar that criticized his draft strategy
“Please Let The Coach & GM Pick This Year" read a sign out front.
23h ago
Yahoo Sports
Korey Cunningham, former NFL lineman, found dead in New Jersey home at age 28
Cunningham played 31 games in the NFL with the Cardinals, Patriots and Giants.
1d ago
Yahoo Sports
NBA playoffs: Tyrese Hailburton game-winner and potential Damian Lillard Achilles injury leaves Bucks in nightmare
Tyrese Haliburton hit a floater with 1.1 seconds left in overtime to give the Indiana Pacers a 121–118 win over the Milwaukee Bucks. The Pacers lead their first-round playoff series two games to one.
18h ago
Yahoo Sports
Based on the odds, here's what the top 10 picks of the NFL Draft will be
What would a mock draft look like using just betting odds?
5d ago
Yahoo Sports
Luka makes Clippers look old, Suns are in big trouble & a funeral for Lakers | Good Word with Goodwill
Vincent Goodwill and Tom Haberstroh break down last night’s NBA Playoffs action and preview several games for tonight and tomorrow.
3d ago
Yahoo Sports
Fantasy Baseball Waiver Wire: Widely available players ready to help your squad
Andy Behrens has a fresh batch of priority pickups for fantasy managers looking to close out the week in strong fashion.
1d ago
Yahoo Sports
Dave McCarty, player on 2004 Red Sox championship team, dies 1 week after team's reunion
The Red Sox were already mourning the loss of Tim Wakefield from that 2004 team.
7d ago
Autoblog
UPS and FedEx find it harder to replace gas guzzlers than expected
Shipping companies like UPS and FedEx are facing uncertainty in U.S. supplies of big, boxy electric step vans they need to replace their gas guzzlers.
2d ago
Yahoo Sports
Jackson Holliday sent back to Triple-A after struggling in first 10 games with Orioles
Holliday batted .059 in 34 at-bats after being called up April 10.
22h ago
Autoblog
These are the slowest-selling new cars of 2024
iSeeCars cited value and compelling products as drivers for fast-selling car brands' success, and some are doing much better than others.
1d ago
Yahoo Sports
Arch Manning dominates in the Texas spring game, and Jaden Rashada enters the transfer portal
Dan Wetzel, Ross Dellenger & SI’s Pat Forde react to the huge performance this weekend by Texas QB Arch Manning, Michigan and Notre Dame's spring games, Jaden Rashada entering the transfer portal, and more
5d ago
Yahoo Sports
Chiefs make Andy Reid NFL's highest-paid coach, sign president Mark Donovan, GM Brett Veach to extensions
Reid's deal reportedly runs through 2029 and makes him the highest-paid coach in the NFL.
5d ago
Yahoo Sports
NBA playoffs: Who's had the most impressive start to the postseason? Most surprising?
Our NBA writers weigh in on the first week of the playoffs and look ahead to what they're watching as the series shift to crucial Game 3s.
2d ago
Yahoo Sports
Yankees' Nestor Cortés told by MLB his pump-fake pitch is illegal
Cortés' attempt didn't fool Andrés Giménez, who fouled off the pitch.
7d ago
Yahoo Finance
Donald Trump nabs additional $1.2 billion 'earnout' bonus from DJT stock
Trump is entitled to an additional 36 million shares if the company's share price trades above $17.50 "for twenty out of any thirty trading days" over the next three years.
4d ago
Autoblog
These are the cars being discontinued for 2024 and beyond
As automakers shift to EVs, trim the fat on their lineups and cull slow-selling models, these are the vehicles we expect to die off soon.
4d ago
Yahoo Sports
During 2024 NFL Draft coverage, Nick Saban admits Alabama wanted Toledo CB Quinyon Mitchell to transfer to the Tide
Mitchell went No. 22 to the Philadelphia Eagles.
1d ago

News

Life

Entertainment

Finance

Sports

New on Yahoo

AI programs often exclude African languages. These researchers have a plan to fix that.

Recommended Stories

NFL Draft: Packers fan upset with team's 1st pick, and Lions fans hilariously rubbed it in

NFL to allow players to wear protective Guardian Caps in games beginning with 2024 season

Michael Penix Jr. said Kirk Cousins called him after Falcons' surprising draft selection

Panthers owner David Tepper stopped by Charlotte bar that criticized his draft strategy

Korey Cunningham, former NFL lineman, found dead in New Jersey home at age 28

NBA playoffs: Tyrese Hailburton game-winner and potential Damian Lillard Achilles injury leaves Bucks in nightmare

Based on the odds, here's what the top 10 picks of the NFL Draft will be

Luka makes Clippers look old, Suns are in big trouble & a funeral for Lakers | Good Word with Goodwill

Fantasy Baseball Waiver Wire: Widely available players ready to help your squad

Dave McCarty, player on 2004 Red Sox championship team, dies 1 week after team's reunion

UPS and FedEx find it harder to replace gas guzzlers than expected

Jackson Holliday sent back to Triple-A after struggling in first 10 games with Orioles

These are the slowest-selling new cars of 2024

Arch Manning dominates in the Texas spring game, and Jaden Rashada enters the transfer portal

Chiefs make Andy Reid NFL's highest-paid coach, sign president Mark Donovan, GM Brett Veach to extensions

NBA playoffs: Who's had the most impressive start to the postseason? Most surprising?

Yankees' Nestor Cortés told by MLB his pump-fake pitch is illegal

Donald Trump nabs additional $1.2 billion 'earnout' bonus from DJT stock

These are the cars being discontinued for 2024 and beyond

During 2024 NFL Draft coverage, Nick Saban admits Alabama wanted Toledo CB Quinyon Mitchell to transfer to the Tide