Apple, Anthropic, and other companies used YouTube videos to train AI

📆 2024-07-16 18:04:00
📰 verge

⏱ Reading Time:
22 sec. here
2 min. at publisher
📊 Quality Score:
News: 12%
Publisher: 67%

Sverige Nyheter Nyheter

Sverige Senaste nytt,Sverige Rubriker

A dataset of YouTube video subtitles that was used by Apple, Nvidia, and Anthropic to train AI includes material from MrBeast, the BBC, and even The Verge.

More than 170,000 YouTube videos are part of a massive dataset that was used to train AI systems for some of the biggest technology companies, according to an investigation by Proof News and copublished with Wired. Apple, Anthropic, Nvidia, and Salesforce are among the tech firms that used the “YouTube Subtitles” data that was ripped from the video platform without permission.

The subtitles dataset is part of a larger collection of material from the nonprofit EleutherAI called The Pile, an open-source collection that also contains datasets of books, Wikipedia articles, and more. Last year, an analysis of one dataset called Books3 revealed which authors’ work had been used to train AI systems, and the dataset has been cited in lawsuits by authors against the companies that used it to train AI.

Skriv Kommentar

Vi har sammanfattat den här nyheten så att du kan läsa den snabbt. Om du är intresserad av nyheterna kan du läsa hela texten här. Läs mer:

Sverige Senaste nytt, Sverige Rubriker

Similar News:Du kan också läsa nyheter som liknar den här som vi har samlat in från andra nyhetskällor.

Apple, Anthropic and other companies used YouTube videos to train AIA dataset of YouTube video subtitles that was used by Apple, Nvidia, and Anthropic to train AI includes material from MrBeast, the BBC, and even The Verge.
Källa: verge - 🏆 94. / 67 Läs mer »