{"id":11426,"date":"2024-04-08T17:45:24","date_gmt":"2024-04-08T17:45:24","guid":{"rendered":"https:\/\/dailyai.com\/?p=11426"},"modified":"2024-04-09T08:28:17","modified_gmt":"2024-04-09T08:28:17","slug":"inside-big-techs-tussle-over-ai-training-data","status":"publish","type":"post","link":"https:\/\/dailyai.com\/pt\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/","title":{"rendered":"Por dentro da luta das grandes empresas de tecnologia pelos dados de treino da IA"},"content":{"rendered":"<p><b>Na busca fren\u00e9tica de dados de treino de IA, os gigantes tecnol\u00f3gicos OpenAI, Google e Meta ter\u00e3o contornado as pol\u00edticas empresariais, alterado as suas regras e discutido a possibilidade de contornar a lei dos direitos de autor.\u00a0<\/b><\/p>\n<p><span style=\"font-weight: 400;\">A <\/span><a href=\"https:\/\/www.nytimes.com\/2024\/04\/06\/technology\/tech-giants-harvest-data-artificial-intelligence.html?smid=nytcore-ios-share&amp;sgrp=c-cb\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">Investiga\u00e7\u00e3o do New York Times<\/span><\/a><span style=\"font-weight: 400;\"> revela at\u00e9 que ponto estas empresas foram para recolher informa\u00e7\u00f5es online para alimentar os seus sistemas de IA \u00e1vidos de dados.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">No final de 2021, os investigadores da OpenAI desenvolveram uma ferramenta de reconhecimento de voz chamada Whisper para transcrever v\u00eddeos do YouTube quando se deparam com uma escassez de dados de texto respeit\u00e1veis em ingl\u00eas.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Apesar das discuss\u00f5es internas sobre a poss\u00edvel viola\u00e7\u00e3o das regras do YouTube, que pro\u00edbem a utiliza\u00e7\u00e3o dos seus v\u00eddeos para aplica\u00e7\u00f5es \"independentes\",\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">O NYT descobriu que a OpenAI acabou por transcrever mais de um milh\u00e3o de horas de conte\u00fados do YouTube. Greg Brockman, presidente da OpenAI, ajudou pessoalmente na recolha dos v\u00eddeos. O texto transcrito foi depois introduzido no GPT-4.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">A Google tamb\u00e9m transcreveu alegadamente v\u00eddeos do YouTube para recolher texto para os seus modelos de IA, infringindo potencialmente os direitos de autor dos criadores de v\u00eddeos. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">Isto acontece dias depois de o diretor executivo do YouTube ter dito que essa atividade violaria a <\/span><a href=\"https:\/\/dailyai.com\/pt\/2024\/04\/youtube-ceo-warns-openai-about-potential-terms-of-service-violation\/\"><span style=\"font-weight: 400;\">termos de servi\u00e7o da empresa<\/span><\/a><span style=\"font-weight: 400;\"> e minar os criadores.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Em junho de 2023, o departamento jur\u00eddico da Google solicitou altera\u00e7\u00f5es \u00e0 pol\u00edtica de privacidade da empresa, permitindo a disponibiliza\u00e7\u00e3o p\u00fablica de conte\u00fados do Google Docs e de outras aplica\u00e7\u00f5es Google para uma gama mais vasta de produtos de IA.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">O Meta, que enfrenta a sua pr\u00f3pria escassez de dados, considerou v\u00e1rias op\u00e7\u00f5es para adquirir mais dados de forma\u00e7\u00e3o.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Os executivos discutiram o pagamento de direitos de licenciamento de livros, a compra da editora Simon &amp; Schuster e at\u00e9 mesmo a recolha de material protegido por direitos de autor da Internet sem autoriza\u00e7\u00e3o, arriscando-se a potenciais processos judiciais.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Os advogados da Meta argumentaram que a utiliza\u00e7\u00e3o de dados para treinar sistemas de IA deveria ser abrangida pela \"utiliza\u00e7\u00e3o justa\", citando uma decis\u00e3o judicial de 2015 que envolvia o projeto de digitaliza\u00e7\u00e3o de livros da Google.<\/span><\/p>\n<h2>Preocupa\u00e7\u00f5es \u00e9ticas e o futuro dos dados de treino da IA<\/h2>\n<p><span style=\"font-weight: 400;\">As ac\u00e7\u00f5es colectivas destas empresas tecnol\u00f3gicas sublinham a import\u00e2ncia cr\u00edtica dos dados em linha na ind\u00fastria da IA em expans\u00e3o.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Estas pr\u00e1ticas suscitaram preocupa\u00e7\u00f5es quanto \u00e0 viola\u00e7\u00e3o dos direitos de autor e \u00e0 compensa\u00e7\u00e3o justa dos criadores.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Justine Bateman, cineasta e autora, disse ao Gabinete de Direitos de Autor que os modelos de IA estavam a retirar conte\u00fados - incluindo os seus escritos e filmes - sem autoriza\u00e7\u00e3o ou pagamento. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">\"Este \u00e9 o maior roubo nos Estados Unidos, ponto final\", afirmou numa entrevista.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Nas artes visuais, o MidJourney e outros modelos de imagem t\u00eam sido <\/span><a href=\"https:\/\/dailyai.com\/pt\/2024\/01\/16000-artist-names-leaked-as-midjourney-styles\/\"><span style=\"font-weight: 400;\">comprovadamente gerador de direitos de autor<\/span><\/a><span style=\"font-weight: 400;\"> conte\u00fado, como cenas de filmes da Marvel.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Com alguns especialistas a preverem que os dados em linha de alta qualidade poder\u00e3o esgotar-se at\u00e9 2026, as empresas est\u00e3o a explorar m\u00e9todos alternativos, como a gera\u00e7\u00e3o de dados sint\u00e9ticos utilizando modelos de IA.\u00a0<\/span><span style=\"font-weight: 400;\">No entanto, os dados de forma\u00e7\u00e3o sint\u00e9ticos t\u00eam os seus pr\u00f3prios riscos e desafios e podem prejudicar <\/span><a href=\"https:\/\/dailyai.com\/pt\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/\"><span style=\"font-weight: 400;\">afetar a qualidade dos modelos<\/span><\/a><span style=\"font-weight: 400;\">.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">O pr\u00f3prio CEO da OpenAI, Sam Altman, reconheceu a natureza finita dos dados online num discurso proferido numa confer\u00eancia de tecnologia em maio de 2023: \"Isso vai acabar\", disse ele.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Sy Damle, um advogado que representa a Andreessen Horowitz, uma empresa de capital de risco de Silicon Valley, tamb\u00e9m falou sobre o desafio: \"A \u00fanica forma pr\u00e1tica de estas ferramentas existirem \u00e9 se puderem ser treinadas com grandes quantidades de dados sem terem de os licenciar. Os dados necess\u00e1rios s\u00e3o t\u00e3o grandes que mesmo o licenciamento coletivo n\u00e3o pode funcionar\".<\/span><\/p>\n<p>O NYT e a OpenAI est\u00e3o envolvidos numa <a href=\"https:\/\/dailyai.com\/pt\/2023\/08\/the-new-york-times-may-sue-openai-over-copyright-claims\/\">amarga a\u00e7\u00e3o judicial por direitos de autor<\/a>O Times procura obter uma indemniza\u00e7\u00e3o de milh\u00f5es de euros.<\/p>\n<p>A OpenAI ripostou, acusando o Times de <a href=\"https:\/\/dailyai.com\/pt\/2024\/02\/openai-blasts-the-new-york-times-claiming-they-hacked-their-evidence\/\">\"piratear\" os seus modelos<\/a> para obter exemplos de viola\u00e7\u00e3o de direitos de autor.<\/p>\n<p>Por \"pirataria inform\u00e1tica\", entende-se jailbreaking ou red-teaming, que consiste em utilizar o modelo com instru\u00e7\u00f5es especialmente formuladas com o objetivo de manipular os resultados.<\/p>\n<p>O NYT afirmou que n\u00e3o teriam de recorrer a modelos de jailbreak se as empresas de IA fossem transparentes quanto aos dados que utilizaram.<\/p>\n<p>Sem d\u00favida, esta investiga\u00e7\u00e3o interna torna o roubo de dados da Big Tech ainda mais inaceit\u00e1vel do ponto de vista \u00e9tico e jur\u00eddico.<\/p>\n<p><span style=\"font-weight: 400;\">Com os processos judiciais a acumularem-se,<\/span><span style=\"font-weight: 400;\">\u00a0o panorama jur\u00eddico em torno da utiliza\u00e7\u00e3o de dados em linha para treino de IA \u00e9 extremamente prec\u00e1rio.\u00a0<\/span><\/p>","protected":false},"excerpt":{"rendered":"<p>Na busca fren\u00e9tica de dados de treino de IA, os gigantes tecnol\u00f3gicos OpenAI, Google e Meta ter\u00e3o contornado pol\u00edticas empresariais, alterado as suas regras e discutido a possibilidade de contornar a lei dos direitos de autor.  Uma investiga\u00e7\u00e3o do New York Times revela at\u00e9 que ponto essas empresas foram para coletar informa\u00e7\u00f5es online para alimentar seus sistemas de IA famintos por dados. No final de 2021, os pesquisadores da OpenAI desenvolveram uma ferramenta de reconhecimento de voz chamada Whisper para transcrever v\u00eddeos do YouTube quando enfrentavam uma escassez de dados de texto em ingl\u00eas confi\u00e1veis.  Apesar das discuss\u00f5es internas sobre a possibilidade de violar as regras do YouTube, que pro\u00edbem o uso de seus v\u00eddeos para aplicativos \"independentes\", o NYT descobriu que o OpenAI acabou transcrevendo mais de um milh\u00e3o de horas<\/p>","protected":false},"author":2,"featured_media":11427,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[88],"tags":[197],"class_list":["post-11426","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ethics","tag-copyright"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Inside Big Tech\u2019s tussle over AI training data | DailyAI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/pt\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/\" \/>\n<meta property=\"og:locale\" content=\"pt_PT\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Inside Big Tech\u2019s tussle over AI training data | DailyAI\" \/>\n<meta property=\"og:description\" content=\"In the frantic pursuit of AI training data, tech giants OpenAI, Google, and Meta have reportedly bypassed corporate policies, altered their rules, and discussed circumventing copyright law.\u00a0 A New York Times investigation reveals the lengths these companies have gone to harvest online information to feed their data-hungry AI systems. In late 2021, OpenAI researchers developed a speech recognition tool called Whisper to transcribe YouTube videos when facing a shortage of reputable English-language text data.\u00a0 Despite internal discussions about potentially violating YouTube&#8217;s rules, which prohibit using its videos for &#8220;independent&#8221; applications,\u00a0 NYT found that OpenAI ultimately transcribed over one million hours\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/pt\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2024-04-08T17:45:24+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-04-09T08:28:17+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-08-18.42.46-Visualize-a-dramatic-and-futuristic-scene-inside-a-vast-data-center-filled-with-towering-server-racks-emitting-blue-and-red-lights-casting-a-vibrant.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1792\" \/>\n\t<meta property=\"og:image:height\" content=\"1024\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"author\" content=\"Sam Jeans\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"Escrito por\" \/>\n\t<meta name=\"twitter:data1\" content=\"Sam Jeans\" \/>\n\t<meta name=\"twitter:label2\" content=\"Tempo estimado de leitura\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutos\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/\"},\"author\":{\"name\":\"Sam Jeans\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\"},\"headline\":\"Inside Big Tech\u2019s tussle over AI training data\",\"datePublished\":\"2024-04-08T17:45:24+00:00\",\"dateModified\":\"2024-04-09T08:28:17+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/\"},\"wordCount\":621,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/DALL\u00b7E-2024-04-08-18.42.46-Visualize-a-dramatic-and-futuristic-scene-inside-a-vast-data-center-filled-with-towering-server-racks-emitting-blue-and-red-lights-casting-a-vibrant.webp\",\"keywords\":[\"Copyright\"],\"articleSection\":[\"Ethics &amp; Society\"],\"inLanguage\":\"pt-PT\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/\",\"name\":\"Inside Big Tech\u2019s tussle over AI training data | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/DALL\u00b7E-2024-04-08-18.42.46-Visualize-a-dramatic-and-futuristic-scene-inside-a-vast-data-center-filled-with-towering-server-racks-emitting-blue-and-red-lights-casting-a-vibrant.webp\",\"datePublished\":\"2024-04-08T17:45:24+00:00\",\"dateModified\":\"2024-04-09T08:28:17+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/#breadcrumb\"},\"inLanguage\":\"pt-PT\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-PT\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/DALL\u00b7E-2024-04-08-18.42.46-Visualize-a-dramatic-and-futuristic-scene-inside-a-vast-data-center-filled-with-towering-server-racks-emitting-blue-and-red-lights-casting-a-vibrant.webp\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/DALL\u00b7E-2024-04-08-18.42.46-Visualize-a-dramatic-and-futuristic-scene-inside-a-vast-data-center-filled-with-towering-server-racks-emitting-blue-and-red-lights-casting-a-vibrant.webp\",\"width\":1792,\"height\":1024,\"caption\":\"Data\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/inside-big-techs-tussle-over-ai-training-data\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Inside Big Tech\u2019s tussle over AI training data\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"pt-PT\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-PT\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\",\"name\":\"Sam Jeans\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-PT\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"caption\":\"Sam Jeans\"},\"description\":\"Sam is a science and technology writer who has worked in various AI startups. When he\u2019s not writing, he can be found reading medical journals or digging through boxes of vinyl records.\",\"sameAs\":[\"https:\\\/\\\/www.linkedin.com\\\/in\\\/sam-jeans-6746b9142\\\/\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/pt\\\/author\\\/samjeans\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Por dentro da disputa da Big Tech sobre os dados de treinamento de IA | DailyAI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/pt\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/","og_locale":"pt_PT","og_type":"article","og_title":"Inside Big Tech\u2019s tussle over AI training data | DailyAI","og_description":"In the frantic pursuit of AI training data, tech giants OpenAI, Google, and Meta have reportedly bypassed corporate policies, altered their rules, and discussed circumventing copyright law.\u00a0 A New York Times investigation reveals the lengths these companies have gone to harvest online information to feed their data-hungry AI systems. In late 2021, OpenAI researchers developed a speech recognition tool called Whisper to transcribe YouTube videos when facing a shortage of reputable English-language text data.\u00a0 Despite internal discussions about potentially violating YouTube&#8217;s rules, which prohibit using its videos for &#8220;independent&#8221; applications,\u00a0 NYT found that OpenAI ultimately transcribed over one million hours","og_url":"https:\/\/dailyai.com\/pt\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/","og_site_name":"DailyAI","article_published_time":"2024-04-08T17:45:24+00:00","article_modified_time":"2024-04-09T08:28:17+00:00","og_image":[{"width":1792,"height":1024,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-08-18.42.46-Visualize-a-dramatic-and-futuristic-scene-inside-a-vast-data-center-filled-with-towering-server-racks-emitting-blue-and-red-lights-casting-a-vibrant.webp","type":"image\/webp"}],"author":"Sam Jeans","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"Escrito por":"Sam Jeans","Tempo estimado de leitura":"3 minutos"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/"},"author":{"name":"Sam Jeans","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9"},"headline":"Inside Big Tech\u2019s tussle over AI training data","datePublished":"2024-04-08T17:45:24+00:00","dateModified":"2024-04-09T08:28:17+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/"},"wordCount":621,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-08-18.42.46-Visualize-a-dramatic-and-futuristic-scene-inside-a-vast-data-center-filled-with-towering-server-racks-emitting-blue-and-red-lights-casting-a-vibrant.webp","keywords":["Copyright"],"articleSection":["Ethics &amp; Society"],"inLanguage":"pt-PT"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/","url":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/","name":"Por dentro da disputa da Big Tech sobre os dados de treinamento de IA | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-08-18.42.46-Visualize-a-dramatic-and-futuristic-scene-inside-a-vast-data-center-filled-with-towering-server-racks-emitting-blue-and-red-lights-casting-a-vibrant.webp","datePublished":"2024-04-08T17:45:24+00:00","dateModified":"2024-04-09T08:28:17+00:00","breadcrumb":{"@id":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/#breadcrumb"},"inLanguage":"pt-PT","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/"]}]},{"@type":"ImageObject","inLanguage":"pt-PT","@id":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-08-18.42.46-Visualize-a-dramatic-and-futuristic-scene-inside-a-vast-data-center-filled-with-towering-server-racks-emitting-blue-and-red-lights-casting-a-vibrant.webp","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-08-18.42.46-Visualize-a-dramatic-and-futuristic-scene-inside-a-vast-data-center-filled-with-towering-server-racks-emitting-blue-and-red-lights-casting-a-vibrant.webp","width":1792,"height":1024,"caption":"Data"},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2024\/04\/inside-big-techs-tussle-over-ai-training-data\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"Inside Big Tech\u2019s tussle over AI training data"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DailyAI","description":"A sua dose di\u00e1ria de not\u00edcias sobre IA","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"pt-PT"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DailyAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"pt-PT","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9","name":"Cal\u00e7as de ganga Sam","image":{"@type":"ImageObject","inLanguage":"pt-PT","@id":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","caption":"Sam Jeans"},"description":"Sam \u00e9 um escritor de ci\u00eancia e tecnologia que trabalhou em v\u00e1rias startups de IA. Quando n\u00e3o est\u00e1 a escrever, pode ser encontrado a ler revistas m\u00e9dicas ou a vasculhar caixas de discos de vinil.","sameAs":["https:\/\/www.linkedin.com\/in\/sam-jeans-6746b9142\/"],"url":"https:\/\/dailyai.com\/pt\/author\/samjeans\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/posts\/11426","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/comments?post=11426"}],"version-history":[{"count":7,"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/posts\/11426\/revisions"}],"predecessor-version":[{"id":11434,"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/posts\/11426\/revisions\/11434"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/media\/11427"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/media?parent=11426"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/categories?post=11426"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/tags?post=11426"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}