Apple, Anthropic and other companies used YouTube videos to train AI

  • 📰 verge
  • ⏱ Reading Time:
  • 17 sec. here
  • 2 min. at publisher
  • 📊 Quality Score:
  • News: 10%
  • Publisher: 67%

Россия Новости Новости

Россия Последние новости,Россия Последние новости

A dataset of YouTube video subtitles that was used by Apple, Nvidia, and Anthropic to train AI includes material from MrBeast, the BBC, and even The Verge.

More than 170,000 YouTube videos are part of a massive dataset that was used to train AI systems for some of the biggest technology companies, according to an investigation by Proof News and copublished with Wired. Apple, Anthropic, Nvidia, and Salesforce are among the tech firms that used the “YouTube Subtitles” data that was ripped from the video platform without permission.

You can use its search feature to see if your content — or your favorite YouTuber’s — appears in the dataset. The subtitles dataset is part of a larger collection of material from the nonprofit EleutherAI called The Pile. The open-source collection known as the Pile also contains datasets of books, Wikipedia articles, and more.

 

Спасибо за ваш комментарий. Ваш комментарий будет опубликован после проверки
Мы обобщили эту новость, чтобы вы могли ее быстро прочитать.Если новость вам интересна, вы можете прочитать полный текст здесь Прочитайте больше:

 /  🏆 94. in RU

Россия Последние новости, Россия Последние новости