{"id":11426,"date":"2024-04-08T17:45:24","date_gmt":"2024-04-08T17:45:24","guid":{"rendered":"https:\/\/dailyai.com\/?p=11426"},"modified":"2024-04-09T08:28:17","modified_gmt":"2024-04-09T08:28:17","slug":"inside-big-techs-tussle-over-ai-training-data","status":"publish","type":"post","link":"https:\/\/dailyai.com\/nl\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/","title":{"rendered":"Een kijkje in de keuken van Big Tech's strijd om AI-trainingsgegevens"},"content":{"rendered":"<p><b>In de verwoede jacht op AI-trainingsgegevens hebben techgiganten OpenAI, Google en Meta naar verluidt het bedrijfsbeleid omzeild, hun regels aangepast en gesproken over het omzeilen van auteursrechtwetten.\u00a0<\/b><\/p>\n<p><span style=\"font-weight: 400;\">A <\/span><a href=\"https:\/\/www.nytimes.com\/2024\/04\/06\/technology\/tech-giants-harvest-data-artificial-intelligence.html?smid=nytcore-ios-share&amp;sgrp=c-cb\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">Onderzoek New York Times<\/span><\/a><span style=\"font-weight: 400;\"> onthult hoeveel moeite deze bedrijven hebben gedaan om online informatie te verzamelen om hun gegevensverslindende AI-systemen te voeden.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Eind 2021 ontwikkelden OpenAI-onderzoekers een spraakherkenningshulpmiddel met de naam Whisper om YouTube-video's te transcriberen bij een tekort aan gerenommeerde Engelstalige tekstgegevens.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Ondanks interne discussies over het mogelijk schenden van de regels van YouTube, die het gebruik van video's voor \"onafhankelijke\" toepassingen verbieden,\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">NYT ontdekte dat OpenAI uiteindelijk meer dan een miljoen uur aan YouTube-content heeft getranscribeerd. Greg Brockman, de voorzitter van OpenAI, hielp persoonlijk bij het verzamelen van de video's. De getranscribeerde tekst werd vervolgens ingevoerd in GPT-4.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Google zou ook YouTube-video's hebben getranscribeerd om tekst te verzamelen voor zijn AI-modellen, waardoor mogelijk auteursrechten van videomakers werden geschonden. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">Dit komt dagen nadat de CEO van YouTube zei dat dergelijke activiteiten in strijd zouden zijn met de <\/span><a href=\"https:\/\/dailyai.com\/nl\/2024\/04\/youtube-ceo-warns-openai-about-potential-terms-of-service-violation\/\"><span style=\"font-weight: 400;\">servicevoorwaarden van het bedrijf<\/span><\/a><span style=\"font-weight: 400;\"> en ondermijnen scheppers.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">In juni 2023 verzocht de juridische afdeling van Google om wijzigingen in het privacybeleid van het bedrijf, waardoor openbaar beschikbare inhoud van Google Docs en andere Google-apps voor een breder scala aan AI-producten mogelijk zou worden.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Meta, dat zelf met een gegevenstekort kampt, heeft verschillende opties overwogen om meer trainingsgegevens te verkrijgen.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Leidinggevenden bespraken het betalen voor licentierechten op boeken, het kopen van uitgeverij Simon &amp; Schuster en zelfs het verzamelen van auteursrechtelijk beschermd materiaal van het internet zonder toestemming, met het risico op mogelijke rechtszaken.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">De advocaten van Meta voerden aan dat het gebruik van gegevens om AI-systemen te trainen onder \"eerlijk gebruik\" zou moeten vallen, waarbij ze verwezen naar een gerechtelijke uitspraak uit 2015 over het scannen van boeken door Google.<\/span><\/p>\n<h2>Ethische bezwaren en de toekomst van AI-trainingsgegevens<\/h2>\n<p><span style=\"font-weight: 400;\">De collectieve acties van deze techbedrijven benadrukken het cruciale belang van online gegevens in de bloeiende AI-industrie.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Deze praktijken hebben geleid tot bezorgdheid over schending van het auteursrecht en een eerlijke vergoeding voor makers.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Een filmmaker en schrijfster, Justine Bateman, vertelde het Copyright Office dat AI-modellen zonder toestemming of betaling inhoud stalen - waaronder haar schrijfsels en films. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">\"Dit is de grootste diefstal in de Verenigde Staten, punt,\" zei ze in een interview.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">In de beeldende kunst zijn MidJourney en andere beeldmodellen <\/span><a href=\"https:\/\/dailyai.com\/nl\/2024\/01\/16000-artist-names-leaked-as-midjourney-styles\/\"><span style=\"font-weight: 400;\">bewezen auteursrechten te genereren<\/span><\/a><span style=\"font-weight: 400;\"> inhoud, zoals sc\u00e8nes uit Marvel-films.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Nu sommige experts voorspellen dat online gegevens van hoge kwaliteit in 2026 uitgeput kunnen zijn, onderzoeken bedrijven alternatieve methoden, zoals het zelf genereren van synthetische gegevens met behulp van AI-modellen.\u00a0<\/span><span style=\"font-weight: 400;\">Synthetische trainingsgegevens brengen echter hun eigen risico's en uitdagingen met zich mee en kunnen een nadelig effect hebben op <\/span><a href=\"https:\/\/dailyai.com\/nl\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/\"><span style=\"font-weight: 400;\">invloed hebben op de kwaliteit van modellen<\/span><\/a><span style=\"font-weight: 400;\">.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">OpenAI CEO Sam Altman erkende zelf de eindigheid van online data in een toespraak op een tech-conferentie in mei 2023: \"Dat zal opraken,\" zei hij.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Sy Damle, een advocaat van Andreessen Horowitz, een risicokapitaalbedrijf uit Silicon Valley, ging ook in op de uitdaging: \"De enige praktische manier waarop deze tools kunnen bestaan, is als ze getraind kunnen worden op enorme hoeveelheden gegevens zonder dat er een licentie voor die gegevens nodig is. De benodigde gegevens zijn zo enorm dat zelfs collectieve licenties echt niet kunnen werken.\"<\/span><\/p>\n<p>De NYT en OpenAI zijn verwikkeld in een <a href=\"https:\/\/dailyai.com\/nl\/2023\/08\/the-new-york-times-may-sue-openai-over-copyright-claims\/\">bittere rechtszaak over auteursrecht<\/a>De Times eist waarschijnlijk miljoenen aan schadevergoeding.<\/p>\n<p>OpenAI sloeg terug en beschuldigde de Times van <a href=\"https:\/\/dailyai.com\/nl\/2024\/02\/openai-blasts-the-new-york-times-claiming-they-hacked-their-evidence\/\">hun modellen 'hacken<\/a> om voorbeelden van auteursrechtschendingen op te zoeken.<\/p>\n<p>Met 'hacken' bedoelen ze jailbreaking of red-teaming, waarbij het model wordt benaderd met speciaal geformuleerde prompts die bedoeld zijn om te breken om de resultaten te manipuleren.<\/p>\n<p>De NYT zei dat ze hun toevlucht niet zouden hoeven te nemen tot het jailbreaken van modellen als AI-bedrijven transparant zouden zijn over de gegevens die ze gebruiken.<\/p>\n<p>Ongetwijfeld maakt dit interne onderzoek de gegevensroof van Big Tech ethisch en juridisch onaanvaardbaar.<\/p>\n<p><span style=\"font-weight: 400;\">De rechtszaken stapelen zich op,<\/span><span style=\"font-weight: 400;\">\u00a0het juridische landschap rondom het gebruik van online data voor AI-training is uiterst precair.\u00a0<\/span><\/p>","protected":false},"excerpt":{"rendered":"<p>In hun verwoede jacht op AI-trainingsgegevens hebben techgiganten OpenAI, Google en Meta naar verluidt het bedrijfsbeleid omzeild, hun regels aangepast en gesproken over het omzeilen van auteursrechtwetten.  Een onderzoek van de New York Times onthult hoe ver deze bedrijven zijn gegaan om online informatie te verzamelen om hun gegevensverslindende AI-systemen te voeden. Eind 2021 ontwikkelden OpenAI-onderzoekers een spraakherkenningstool met de naam Whisper om YouTube-video's te transcriberen wanneer er een tekort was aan betrouwbare Engelstalige tekstgegevens.  Ondanks interne discussies over het mogelijk schenden van de regels van YouTube, die het gebruik van YouTube-video's voor \"onafhankelijke\" toepassingen verbieden, ontdekte NYT dat OpenAI uiteindelijk meer dan een miljoen uur aan transcripties heeft gemaakt.<\/p>","protected":false},"author":2,"featured_media":11427,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[88],"tags":[197],"class_list":["post-11426","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ethics","tag-copyright"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Inside Big Tech\u2019s tussle over AI training data | DailyAI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/nl\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/\" \/>\n<meta property=\"og:locale\" content=\"nl_NL\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Inside Big Tech\u2019s tussle over AI training data | DailyAI\" \/>\n<meta property=\"og:description\" content=\"In the frantic pursuit of AI training data, tech giants OpenAI, Google, and Meta have reportedly bypassed corporate policies, altered their rules, and discussed circumventing copyright law.\u00a0 A New York Times investigation reveals the lengths these companies have gone to harvest online information to feed their data-hungry AI systems. In late 2021, OpenAI researchers developed a speech recognition tool called Whisper to transcribe YouTube videos when facing a shortage of reputable English-language text data.\u00a0 Despite internal discussions about potentially violating YouTube&#8217;s rules, which prohibit using its videos for &#8220;independent&#8221; applications,\u00a0 NYT found that OpenAI ultimately transcribed over one million hours\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/nl\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2024-04-08T17:45:24+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-04-09T08:28:17+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-08-18.42.46-Visualize-a-dramatic-and-futuristic-scene-inside-a-vast-data-center-filled-with-towering-server-racks-emitting-blue-and-red-lights-casting-a-vibrant.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1792\" \/>\n\t<meta property=\"og:image:height\" content=\"1024\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"author\" content=\"Sam Jeans\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"Geschreven door\" \/>\n\t<meta name=\"twitter:data1\" content=\"Sam Jeans\" \/>\n\t<meta name=\"twitter:label2\" content=\"Geschatte leestijd\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minuten\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/\"},\"author\":{\"name\":\"Sam Jeans\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\"},\"headline\":\"Inside Big Tech\u2019s tussle over AI training data\",\"datePublished\":\"2024-04-08T17:45:24+00:00\",\"dateModified\":\"2024-04-09T08:28:17+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/\"},\"wordCount\":621,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/DALL\u00b7E-2024-04-08-18.42.46-Visualize-a-dramatic-and-futuristic-scene-inside-a-vast-data-center-filled-with-towering-server-racks-emitting-blue-and-red-lights-casting-a-vibrant.webp\",\"keywords\":[\"Copyright\"],\"articleSection\":[\"Ethics &amp; Society\"],\"inLanguage\":\"nl-NL\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/\",\"name\":\"Inside Big Tech\u2019s tussle over AI training data | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/DALL\u00b7E-2024-04-08-18.42.46-Visualize-a-dramatic-and-futuristic-scene-inside-a-vast-data-center-filled-with-towering-server-racks-emitting-blue-and-red-lights-casting-a-vibrant.webp\",\"datePublished\":\"2024-04-08T17:45:24+00:00\",\"dateModified\":\"2024-04-09T08:28:17+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/#breadcrumb\"},\"inLanguage\":\"nl-NL\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"nl-NL\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/DALL\u00b7E-2024-04-08-18.42.46-Visualize-a-dramatic-and-futuristic-scene-inside-a-vast-data-center-filled-with-towering-server-racks-emitting-blue-and-red-lights-casting-a-vibrant.webp\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/DALL\u00b7E-2024-04-08-18.42.46-Visualize-a-dramatic-and-futuristic-scene-inside-a-vast-data-center-filled-with-towering-server-racks-emitting-blue-and-red-lights-casting-a-vibrant.webp\",\"width\":1792,\"height\":1024,\"caption\":\"Data\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Inside Big Tech\u2019s tussle over AI training data\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"nl-NL\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"nl-NL\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\",\"name\":\"Sam Jeans\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"nl-NL\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"caption\":\"Sam Jeans\"},\"description\":\"Sam is a science and technology writer who has worked in various AI startups. When he\u2019s not writing, he can be found reading medical journals or digging through boxes of vinyl records.\",\"sameAs\":[\"https:\\\/\\\/www.linkedin.com\\\/in\\\/sam-jeans-6746b9142\\\/\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/nl\\\/author\\\/samjeans\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Een kijkje in de keuken van Big Tech's strijd om AI-trainingsgegevens | DailyAI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/nl\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/","og_locale":"nl_NL","og_type":"article","og_title":"Inside Big Tech\u2019s tussle over AI training data | DailyAI","og_description":"In the frantic pursuit of AI training data, tech giants OpenAI, Google, and Meta have reportedly bypassed corporate policies, altered their rules, and discussed circumventing copyright law.\u00a0 A New York Times investigation reveals the lengths these companies have gone to harvest online information to feed their data-hungry AI systems. In late 2021, OpenAI researchers developed a speech recognition tool called Whisper to transcribe YouTube videos when facing a shortage of reputable English-language text data.\u00a0 Despite internal discussions about potentially violating YouTube&#8217;s rules, which prohibit using its videos for &#8220;independent&#8221; applications,\u00a0 NYT found that OpenAI ultimately transcribed over one million hours","og_url":"https:\/\/dailyai.com\/nl\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/","og_site_name":"DailyAI","article_published_time":"2024-04-08T17:45:24+00:00","article_modified_time":"2024-04-09T08:28:17+00:00","og_image":[{"width":1792,"height":1024,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-08-18.42.46-Visualize-a-dramatic-and-futuristic-scene-inside-a-vast-data-center-filled-with-towering-server-racks-emitting-blue-and-red-lights-casting-a-vibrant.webp","type":"image\/webp"}],"author":"Sam Jeans","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"Geschreven door":"Sam Jeans","Geschatte leestijd":"3 minuten"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/"},"author":{"name":"Sam Jeans","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9"},"headline":"Inside Big Tech\u2019s tussle over AI training data","datePublished":"2024-04-08T17:45:24+00:00","dateModified":"2024-04-09T08:28:17+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/"},"wordCount":621,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-08-18.42.46-Visualize-a-dramatic-and-futuristic-scene-inside-a-vast-data-center-filled-with-towering-server-racks-emitting-blue-and-red-lights-casting-a-vibrant.webp","keywords":["Copyright"],"articleSection":["Ethics &amp; Society"],"inLanguage":"nl-NL"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/","url":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/","name":"Een kijkje in de keuken van Big Tech's strijd om AI-trainingsgegevens | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-08-18.42.46-Visualize-a-dramatic-and-futuristic-scene-inside-a-vast-data-center-filled-with-towering-server-racks-emitting-blue-and-red-lights-casting-a-vibrant.webp","datePublished":"2024-04-08T17:45:24+00:00","dateModified":"2024-04-09T08:28:17+00:00","breadcrumb":{"@id":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/#breadcrumb"},"inLanguage":"nl-NL","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/"]}]},{"@type":"ImageObject","inLanguage":"nl-NL","@id":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-08-18.42.46-Visualize-a-dramatic-and-futuristic-scene-inside-a-vast-data-center-filled-with-towering-server-racks-emitting-blue-and-red-lights-casting-a-vibrant.webp","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-08-18.42.46-Visualize-a-dramatic-and-futuristic-scene-inside-a-vast-data-center-filled-with-towering-server-racks-emitting-blue-and-red-lights-casting-a-vibrant.webp","width":1792,"height":1024,"caption":"Data"},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"Inside Big Tech\u2019s tussle over AI training data"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DailyAI","description":"Uw dagelijkse dosis AI-nieuws","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"nl-NL"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DailyAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"nl-NL","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9","name":"Sam Jeans","image":{"@type":"ImageObject","inLanguage":"nl-NL","@id":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","caption":"Sam Jeans"},"description":"Sam is een wetenschap- en technologieschrijver die bij verschillende AI-startups heeft gewerkt. Als hij niet aan het schrijven is, leest hij medische tijdschriften of graaft hij door dozen met vinylplaten.","sameAs":["https:\/\/www.linkedin.com\/in\/sam-jeans-6746b9142\/"],"url":"https:\/\/dailyai.com\/nl\/author\/samjeans\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/posts\/11426","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/comments?post=11426"}],"version-history":[{"count":7,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/posts\/11426\/revisions"}],"predecessor-version":[{"id":11434,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/posts\/11426\/revisions\/11434"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/media\/11427"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/media?parent=11426"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/categories?post=11426"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/tags?post=11426"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}