Close Menu
Buzzo Viral News
  • Home
  • Health
  • Tech
  • Food
  • Gaming
  • Luxury
  • Celebrity
  • Fashion
  • Travel
What's Hot

Transforming Healthcare: Analyzing the Latest Reform Proposals

July 25, 2025

Youth Activism and Gun Control: The New Generation Speaks Out

July 25, 2025

Debunking Myths: Common Misconceptions About Gun Control Laws

July 25, 2025

The Psychological Cost of Debt: How Financial Strain Affects Your Mental Health

July 25, 2025
Facebook X (Twitter) Instagram
Trending
  • Transforming Healthcare: Analyzing the Latest Reform Proposals
  • Youth Activism and Gun Control: The New Generation Speaks Out
  • Debunking Myths: Common Misconceptions About Gun Control Laws
  • The Psychological Cost of Debt: How Financial Strain Affects Your Mental Health
  • The Hybrid Workplace: Balancing Flexibility and Collaboration in Workforce Evolution
  • Environmental Justice: Ensuring Equitable Policy for All Communities
  • Defending Our Forests: The Role of Policy in Combating Deforestation
  • Breaking Barriers: The Importance of Accessible Design in the Modern World
Facebook X (Twitter) Instagram YouTube
Buzzo Viral NewsBuzzo Viral News
  • Home
  • Health

    Revitalize Your Routine: The Rise of Functional Beverages and Their Health Benefits

    March 5, 2025

    Wholesome Plates: Exploring the Connection Between Culinary Wellness and Healthy Living

    March 4, 2025

    Wholesome Eating: Embracing the Clean Cuisine Lifestyle for Optimal Health

    March 4, 2025

    Mindful Nutrition: Cultivating a Healthy Relationship with Food

    March 4, 2025

    Deliciously Plant-Based: 10 Wholesome Vegetarian Recipes to Savor

    March 4, 2025
  • Tech

    Sustainable Solutions: The Role of Emerging Technologies in Environmental Progress

    March 7, 2025

    Future Tech: A Deep Dive into the Most Promising Emerging Innovations

    March 7, 2025

    Innovate or Evaporate: Why Businesses Must Embrace Emerging Technologies Now

    March 7, 2025

    The Future Unveiled: Exploring the Impact of Emerging Technologies on Society

    March 6, 2025

    From AI to Quantum Computing: The Top Emerging Technologies Shaping Tomorrow

    March 6, 2025
  • Food
  • Gaming
  • Luxury

    Unveiling Exquisite Elegance: A Journey Through Art, Design, and the Finer Things in Life

    March 1, 2025

    Trendsetters: Pioneering the Future of Fashion, Culture, and Innovation

    March 1, 2025

    Unlocking Identity: The Art and Importance of Signatures in a Digital Age

    February 28, 2025

    Driving Excellence: The Allure and Innovation of Luxury Cars in 2023

    February 28, 2025

    Jet Set: The Evolution of Luxury Travel in a Fast-Paced World

    February 28, 2025
  • Celebrity

    The Role of Therapy in Healing After a Breakup or Divorce

    May 10, 2025

    Bollywood vs. Hollywood: A Comparative Analysis of Two Cinema Giants

    May 10, 2025

    Lessons Learned: Reflections on Love and Loss After a Breakup

    May 10, 2025

    The Role of Social Media in Shaping Bollywood Stardom

    May 10, 2025

    Finding New Love: Overcoming the Fear of Intimacy Post-Divorce

    May 9, 2025
  • Fashion
  • Travel

    Tips from Frequent Flyers: Insider Knowledge on Finding Flight Discounts

    May 15, 2025

    How to Use Drones for Breathtaking Travel Photography: A Beginner’s Guide

    May 15, 2025

    Unlocking Travel Rewards: How to Maximize Points and Miles with Simple Hacks

    May 15, 2025

    Budget Travel for Students: How to See the World Without Going Broke

    May 15, 2025

    Navigating Airline Fees: How to Find Truly Discounted Flights

    May 15, 2025
Buzzo Viral News
Home » MLCommons and Hugging Face team up to release massive speech data set for AI research
Technology

MLCommons and Hugging Face team up to release massive speech data set for AI research

BuzzoBy BuzzoJanuary 31, 2025No Comments3 Mins Read
Facebook Twitter Pinterest LinkedIn Email Telegram WhatsApp
MLCommons and Hugging Face team up to release massive speech data set for AI research
Share
Facebook Twitter Email Telegram WhatsApp

MLCommons, a nonprofit AI safety working group, has teamed up with AI dev platform Hugging Face to release one of the world’s largest collections of public domain voice recordings for AI research.

The data set, called Unsupervised People’s Speech, contains more than a million hours of audio spanning at least 89 different languages. MLCommons says it was motivated to create it by a desire to support R&D in “various areas of speech technology.”

“Supporting broader natural language processing research for languages other than English helps bring communication technologies to more people globally,” the organization wrote in a blog post Thursday. “We anticipate several avenues for the research community to continue to build and develop, especially in the areas of improving low-resource language speech models, enhanced speech recognition across different accents and dialects, and novel applications in speech synthesis.”

It’s an admirable goal, to be sure. But AI data sets like Unsupervised People’s Speech can carry risks for the researchers who choose to use them.

Biased data is one of those risks. The recordings in Unsupervised People’s Speech came from Archive.org, the nonprofit perhaps best known for the Wayback Machine web archival tool. Because many of Archive.org’s contributors are English-speaking — and American — almost all of the recordings in Unsupervised People’s Speech are in American-accented English, per the readme on the official project page.

That means that, without careful filtering, AI systems like speech recognition and voice synthesizer models trained on Unsupervised People’s Speech could exhibit some of the same prejudices. They might, for example, struggle to transcribe English spoken by a non-native speaker, or have trouble generating synthetic voices in languages other than English.

Unsupervised People’s Speech might also contain recordings from people unaware that their voices are being used for AI research purposes — including commercial applications. While MLCommons says that all recordings in the data set are public domain or available under Creative Commons licenses, there’s the possibility mistakes were made.

According to an MIT analysis, hundreds of publicly available AI training data sets lack licensing information and contain errors. Creator advocates including Ed Newton-Rex, the CEO of AI ethics-focused nonprofit Fairly Trained, have made the case that creators shouldn’t be required to “opt out” of AI data sets because of the onerous burden opting out imposes on these creators.

“Many creators (e.g. Squarespace users) have no meaningful way of opting out,” Newton-Rex wrote in a post on X last June. “For creators who can opt out, there are multiple overlapping opt-out methods, which are (1) incredibly confusing and (2) woefully incomplete in their coverage. Even if a perfect universal opt-out existed, it would be hugely unfair to put the opt-out burden on creators, given that generative AI uses their work to compete with them — many would simply not realize they could opt out.”

MLCommons says that it’s committed to updating, maintaining, and improving the quality of Unsupervised People’s Speech. But given the potential flaws, it’d behoove developers to exercise serious caution.

Share. Facebook Twitter Pinterest LinkedIn WhatsApp Email Telegram Copy Link
Buzzo
  • Website

Related Posts

Sustainable Solutions: The Role of Emerging Technologies in Environmental Progress

March 7, 2025

Future Tech: A Deep Dive into the Most Promising Emerging Innovations

March 7, 2025

Innovate or Evaporate: Why Businesses Must Embrace Emerging Technologies Now

March 7, 2025
Leave A Reply Cancel Reply

Latest Posts

Transforming Healthcare: Analyzing the Latest Reform Proposals

July 25, 2025

Youth Activism and Gun Control: The New Generation Speaks Out

July 25, 2025

Debunking Myths: Common Misconceptions About Gun Control Laws

July 25, 2025

The Psychological Cost of Debt: How Financial Strain Affects Your Mental Health

July 25, 2025
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
Categories
  • Automotive
  • Breaking News
  • Business
  • Celebrity
  • Economy
  • Fashion
  • Food
  • From The Press
  • Gaming
  • Health
  • Luxury
  • Sports
  • Technology
  • Travel
  • Uncategorized
  • Viral Right Now
  • World
About Us
About Us

Buzzo Viral News
We’re dedicated to providing you with the best of blogging, with a focus on dependability and Buzzo Viral News—daily updates.

Email Us: [email protected]

Latest Posts

Transforming Healthcare: Analyzing the Latest Reform Proposals

July 25, 2025

Youth Activism and Gun Control: The New Generation Speaks Out

July 25, 2025
Popular Posts

Tips from Frequent Flyers: Insider Knowledge on Finding Flight Discounts

May 15, 2025

How to Use Drones for Breathtaking Travel Photography: A Beginner’s Guide

May 15, 2025
Buzzo Viral News
Facebook X (Twitter) Instagram Pinterest
  • Home
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms of Condition
Buzzo.live © 2025 || All Right Reserved.

Type above and press Enter to search. Press Esc to cancel.