The Entrepreneurs Weekly
No Result
View All Result
Wednesday, July 2, 2025
  • Login
  • Home
  • BUSINESS
  • POLITICS
  • ENTREPRENEURSHIP
  • ENTERTAINMENT
Subscribe
The Entrepreneurs Weekly
  • Home
  • BUSINESS
  • POLITICS
  • ENTREPRENEURSHIP
  • ENTERTAINMENT
No Result
View All Result
The Entrepreneurs Weekly
No Result
View All Result
Home Business

OpenAI May Have Used YouTube Videos for AI Training | Entrepreneur

by Brand Post
April 8, 2024
in Business
0
OpenAI May Have Used YouTube Videos for AI Training | Entrepreneur
152
SHARES
1.9k
VIEWS
Share on FacebookShare on Twitter


Where does AI training data come from?

A report from The New York Times revealed on Friday that OpenAI may have trained AI models on YouTube video transcriptions and Google may have been doing the same thing.

The report found that in the hunt for fresh digital data to train its newer, smarter AI system, OpenAI researchers created a workaround called Whisper, which could take YouTube videos and transcribe them into text that could then be fed as new AI training data — for a more conversational, next-generation AI.

The process of developing GPT-4, the powerful AI model behind OpenAI’s latest ChatGPT chatbot, took over a million hours of YouTube videos transcribed by Whisper, according to the NYTimes’ sources.

Related: OpenAI Is Holding Back the Release of Its New AI Voice Generator

The Times reports that OpenAI employees had conversations about how YouTube transcription training data could potentially violate YouTube’s rules, but OpenAI decided to move forward anyway with the belief that training AI with the videos was fair use.

Knowledge of where the training data was coming from extended up to senior leadership, according to The Times, with OpenAI’s president Greg Brockman even allegedly helping collect videos.

The Wall Street Journal’s Joanna Stern interviewed OpenAI’s CTO Mira Murati last month and asked her what data was used to train one of OpenAI’s most recent products: a tool called Sora that generates videos based on text prompts.

Related: Authors Are Suing OpenAI Because ChatGPT Is Too ‘Accurate’

“We used publicly available data and licensed data,” Murati said. When Stern asked “So, videos on YouTube?” Murati replied, “I’m actually not sure about that.”

When Stern further asked “Videos from Facebook, Instagram?” Murati stated, “You know, if they were publicly available, publicly available to use, there might be the data, but I’m not sure. I’m not confident about it.”

YouTube CEO Neal Mohan said last week that if OpenAI used YouTube videos to train Sora, that would be a “clear violation” of YouTube’s terms of use.

The terms of service “does not allow for things like transcripts or video bits to be downloaded,” Mohan told Emily Chang, host of Bloomberg Originals.

Yet five sources told The Times that Google did the same thing as OpenAI, allegedly transcribing YouTube videos to generate new training text for its AI models in a potential violation of copyright law.

Google owns YouTube and told The Times that its AI is “trained on some YouTube content” that its agreements with creators allow.

Related: Getty Images Has Started Legal Proceedings Against an AI Generative Art Company For Copyright Infringement

Lawsuits over training AI with copyrighted material have become widespread in recent years, with authors like Paul Tremblay and Sarah Silverman alleging that their books were part of datasets used to train AI — without their consent.

The lawyers for these lawsuits, Joseph Saveri and Matthew Butterick, state on their website that generative AI is just “human intelligence, repackaged and divorced from its creators.”

More than 15,000 authors signed a letter last year asking big tech CEOs, including ones at OpenAI, Google, Microsoft, Meta, and IBM, to obtain the consent of writers before training AI with their work and credit and compensate them.

It’s not just authors: musicians too are feeling the impact of AI. Artists like Billie Eilish and Jon Bon Jovi signed an open letter last week accusing big tech companies of using their work to train models without permission or compensation.

“These efforts are direly aimed at replacing the work of human artists with massive quantities of AI-created “sounds” and “images” that substantially dilute the royalty pools that are paid out to artists,” the letter stated.

Tennessee became the first state to pass legislation protecting artists from deepfakes, or cloned and manipulated versions of their voices, last month.

Related: Tennessee Just Passed a New Law to Protect Musicians From a Growing AI Threat



Source link

Tags: AI toolsBusiness NewsentrepreneurNews and TrendsOpenAITrainingVideosYouTube

Related Posts

AI Startup TML From Ex-OpenAI Exec Mira Murati Pays 0,000 | Entrepreneur
Business

AI Startup TML From Ex-OpenAI Exec Mira Murati Pays $500,000 | Entrepreneur

July 1, 2025
Why Your Finance Team Needs an AI Strategy, Now | Entrepreneur
Business

Why Your Finance Team Needs an AI Strategy, Now | Entrepreneur

July 1, 2025
He Went From 1K in Debt to Teaching Others How to Succeed | Entrepreneur
Business

He Went From $471K in Debt to Teaching Others How to Succeed | Entrepreneur

July 1, 2025
  • Trending
  • Comments
  • Latest
Meet Amir Kenzo: A Well Known Musical Artist From Iran.

Meet Amir Kenzo: A Well Known Musical Artist From Iran.

August 21, 2022
Behind the Glamour: Bella Davis Opens Up About Overcoming Adversity in Modeling

Behind the Glamour: Bella Davis Opens Up About Overcoming Adversity in Modeling

April 20, 2024
Dr. Donya Ball: Pioneering Leadership Solutions for Tomorrow’s Challenges

Dr. Donya Ball: Pioneering Leadership Solutions for Tomorrow’s Challenges

May 10, 2024
Nasiyr Bey’s Journey from Brooklyn to Charlotte: The Entrepreneurial Path to Owning a Successful Cigar Lounge

Nasiyr Bey’s Journey from Brooklyn to Charlotte: The Entrepreneurial Path to Owning a Successful Cigar Lounge

August 8, 2024
Augmented.City Startup Developers Appeal To US Politicians With An Open Letter

Augmented.City Startup Developers Appeal To US Politicians With An Open Letter

0
U.S. High Court Snubs Challenge To State And Local Tax Deduction Cap

U.S. High Court Snubs Challenge To State And Local Tax Deduction Cap

0
GOP Lawmaker Blames Biden For Russia-Ukraine War: Putin ‘Could never have Invaded’

GOP Lawmaker Blames Biden For Russia-Ukraine War: Putin ‘Could never have Invaded’

0
Brad Winget’s Tips and Tricks on Having a Career in Real Estate

Brad Winget’s Tips and Tricks on Having a Career in Real Estate

0
AI Startup TML From Ex-OpenAI Exec Mira Murati Pays 0,000 | Entrepreneur

AI Startup TML From Ex-OpenAI Exec Mira Murati Pays $500,000 | Entrepreneur

July 1, 2025
Why Your Finance Team Needs an AI Strategy, Now | Entrepreneur

Why Your Finance Team Needs an AI Strategy, Now | Entrepreneur

July 1, 2025
He Went From 1K in Debt to Teaching Others How to Succeed | Entrepreneur

He Went From $471K in Debt to Teaching Others How to Succeed | Entrepreneur

July 1, 2025
How One Founder Is Rethinking Supplements With David Beckham | Entrepreneur

How One Founder Is Rethinking Supplements With David Beckham | Entrepreneur

July 1, 2025

The EW prides itself on assembling a proficient and dedicated team comprising seasoned journalists and editors. This collective commitment drives us to provide our esteemed readership with nothing short of the most comprehensive, accurate, and captivating news coverage available.

Transcending the bounds of Chicago to encompass a broader scope, we ensure that our audience remains well-informed and engaged with the latest developments, both locally and beyond.

NEWS

  • Business
  • Politics
  • Entrepreneurship
  • Entertainment
Instagram Facebook

© 2024 Entrepreneurs Weekly.  All Rights Reserved.

  • About Us
  • Advertise
  • Contact Us
No Result
View All Result
  • ENTREPRENEURSHIP
  • ENTERTAINMENT
  • POLITICS
  • BUSINESS
  • CONTACT US
  • ADVERTISEMENT

Copyright © 2024 - The Entrepreneurs Weekly

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In