{"id":13599,"date":"2024-07-28T17:00:03","date_gmt":"2024-07-28T17:00:03","guid":{"rendered":"https:\/\/dailyai.com\/?p=13599"},"modified":"2024-07-29T09:48:32","modified_gmt":"2024-07-29T09:48:32","slug":"ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds","status":"publish","type":"post","link":"https:\/\/dailyai.com\/pt\/2024\/07\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\/","title":{"rendered":"Estudo revela que os modelos de IA enfrentam um colapso quando s\u00e3o treinados com dados gerados por IA"},"content":{"rendered":"<p><b>Um novo estudo publicado na revista Nature revela que os modelos de IA, incluindo os modelos de linguagem de grande dimens\u00e3o (LLM), degradam-se rapidamente em termos de qualidade quando s\u00e3o treinados com dados gerados por modelos de IA anteriores.\u00a0<\/b><\/p>\n<p><span style=\"font-weight: 400;\">Este fen\u00f3meno, designado por \"colapso do modelo\", pode corroer a qualidade dos futuros modelos de IA, especialmente \u00e0 medida que mais conte\u00fados gerados por IA s\u00e3o lan\u00e7ados na Internet e, por conseguinte, reciclados e reutilizados nos dados de treino dos modelos.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Para investigar este fen\u00f3meno, investigadores da Universidade de Cambridge, da Universidade de Oxford e de outras institui\u00e7\u00f5es <a href=\"https:\/\/www.nature.com\/articles\/s41586-024-07566-y\">experi\u00eancias efectuadas<\/a> mostrando que, quando os modelos de IA s\u00e3o repetidamente treinados com dados produzidos por vers\u00f5es anteriores deles pr\u00f3prios, come\u00e7am a gerar resultados sem sentido.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Este facto foi observado em diferentes tipos de modelos de IA, incluindo modelos de linguagem, autoencoders variacionais e modelos de mistura gaussiana.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Numa experi\u00eancia-chave com modelos de linguagem, a equipa afinou o modelo OPT-125m no conjunto de dados WikiText-2 e depois utilizou-o para gerar novo texto. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">Este texto gerado pela IA foi depois utilizado para treinar a \"gera\u00e7\u00e3o\" seguinte do modelo, e o processo foi repetido vezes sem conta.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">N\u00e3o demorou muito para que os modelos come\u00e7assem a produzir textos cada vez mais improv\u00e1veis e sem sentido.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Na nona gera\u00e7\u00e3o, o modelo estava a gerar uma completa algaraviada, como, por exemplo, listar v\u00e1rios tipos inexistentes de \"jackrabbits\" quando lhe era perguntado sobre as torres das igrejas inglesas.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Os investigadores tamb\u00e9m observaram como os modelos perdem informa\u00e7\u00e3o sobre acontecimentos \"raros\" ou pouco frequentes antes de entrarem em colapso total.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Este facto \u00e9 alarmante, uma vez que os acontecimentos raros est\u00e3o frequentemente relacionados com grupos marginalizados ou com casos isolados. Sem eles, os modelos correm o risco de concentrar as suas respostas num espetro restrito de ideias e cren\u00e7as, refor\u00e7ando assim os preconceitos.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">As empresas de IA est\u00e3o cientes deste facto e, por isso, est\u00e3o a fazer acordos com empresas e editores de not\u00edcias para garantir um fluxo constante de informa\u00e7\u00f5es de alta qualidade, escritas por humanos e relevantes em termos de t\u00f3picos.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">\"A mensagem \u00e9 que temos de ter muito cuidado com o que acaba nos nossos dados de forma\u00e7\u00e3o\". <\/span><span style=\"font-weight: 400;\">estudo<\/span><span style=\"font-weight: 400;\"> coautor Zakhar Shumaylov da Universidade de Cambridge <\/span><a href=\"https:\/\/www.nature.com\/articles\/d41586-024-02420-7\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">disse \u00e0 Nature<\/span><\/a><span style=\"font-weight: 400;\">. \"Caso contr\u00e1rio, as coisas correr\u00e3o sempre, comprovadamente, mal.\"<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Para agravar este efeito, um recente <\/span><a href=\"https:\/\/reutersinstitute.politics.ox.ac.uk\/how-many-news-websites-block-ai-crawlers\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">estudo<\/span><\/a><span style=\"font-weight: 400;\"> do Dr. Richard Fletcher, Diretor de Investiga\u00e7\u00e3o do Instituto Reuters para o Estudo do Jornalismo, concluiu que quase metade (48%) dos s\u00edtios de not\u00edcias mais populares em todo o mundo est\u00e3o agora inacess\u00edveis aos rastreadores da OpenAI, estando os rastreadores da IA da Google bloqueados em 24% dos s\u00edtios.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Como resultado, os modelos de IA t\u00eam acesso a um conjunto mais pequeno de dados recentes e de alta qualidade do que tinham anteriormente, aumentando o risco de forma\u00e7\u00e3o em dados abaixo do padr\u00e3o ou desactualizados.\u00a0<\/span><\/p>\n<h2>Solu\u00e7\u00f5es para o colapso do modelo<\/h2>\n<p><span style=\"font-weight: 400;\">Relativamente \u00e0s solu\u00e7\u00f5es, os investigadores afirmam que manter o acesso a fontes de dados originais, geradas por humanos, \u00e9 vital para o futuro da IA.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">O rastreio e a gest\u00e3o dos conte\u00fados gerados pela IA tamb\u00e9m seriam \u00fateis para evitar que estes contaminem acidentalmente os conjuntos de dados de forma\u00e7\u00e3o. Isso seria muito complicado, uma vez que os conte\u00fados gerados por IA est\u00e3o a tornar-se imposs\u00edveis de detetar.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Os investigadores prop\u00f5em quatro solu\u00e7\u00f5es principais:<\/span><\/p>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Marca de \u00e1gua em conte\u00fados gerados por IA para os distinguir dos dados criados por humanos<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Criar incentivos para que os seres humanos continuem a produzir conte\u00fados de alta qualidade<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Desenvolver m\u00e9todos de filtragem e curadoria mais sofisticados para os dados de forma\u00e7\u00e3o<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Explorar formas de preservar e dar prioridade ao acesso a informa\u00e7\u00f5es originais, n\u00e3o geradas por IA<\/span><\/li>\n<\/ul>\n<h2>O colapso do modelo \u00e9 um problema real<\/h2>\n<p><span style=\"font-weight: 400;\">Este estudo est\u00e1 longe de ser o \u00fanico a explorar o colapso de modelos.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">H\u00e1 pouco tempo, investigadores de Stanford <\/span><a href=\"https:\/\/arxiv.org\/abs\/2404.01413\"><span style=\"font-weight: 400;\">comparou dois cen\u00e1rios<\/span><\/a><span style=\"font-weight: 400;\"> em que pode ocorrer o colapso do modelo: uma em que os dados de treino de cada nova itera\u00e7\u00e3o do modelo substituem totalmente os dados anteriores e outra em que s\u00e3o adicionados dados sint\u00e9ticos ao conjunto de dados existente.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Quando os dados foram substitu\u00eddos, o desempenho do modelo deteriorou-se rapidamente em todas as arquitecturas testadas.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">No entanto, quando se permitiu que os dados se \"acumulassem\", o colapso do modelo foi largamente evitado. Os sistemas de IA mantiveram o seu desempenho e, nalguns casos, apresentaram melhorias.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Assim, apesar das preocupa\u00e7\u00f5es cred\u00edveis, o colapso do modelo n\u00e3o \u00e9 uma conclus\u00e3o precipitada - depende da quantidade de dados gerados por IA no conjunto e do r\u00e1cio entre dados sint\u00e9ticos e aut\u00eanticos.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Se e quando o colapso do modelo come\u00e7ar a ser evidente nos modelos de fronteira, pode ter a certeza de que as empresas de IA estar\u00e3o a lutar por uma solu\u00e7\u00e3o a longo prazo.\u00a0<\/span><\/p>\n<p>Ainda n\u00e3o cheg\u00e1mos l\u00e1, mas pode ser uma quest\u00e3o de quando e n\u00e3o de se.<\/p>","protected":false},"excerpt":{"rendered":"<p>Um novo estudo publicado na revista Nature revela que os modelos de IA, incluindo os modelos de linguagem de grande dimens\u00e3o (LLM), degradam-se rapidamente em termos de qualidade quando s\u00e3o treinados com dados gerados por modelos de IA anteriores.  Este fen\u00f3meno, designado por \"colapso do modelo\", pode corroer a qualidade dos futuros modelos de IA, sobretudo \u00e0 medida que mais conte\u00fados gerados por IA s\u00e3o lan\u00e7ados na Internet e, por conseguinte, reciclados e reutilizados em dados de treino de modelos.  Para investigar este fen\u00f3meno, investigadores da Universidade de Cambridge, da Universidade de Oxford e de outras institui\u00e7\u00f5es realizaram experi\u00eancias que demonstraram que, quando os modelos de IA s\u00e3o repetidamente treinados com dados produzidos por vers\u00f5es anteriores deles pr\u00f3prios, come\u00e7am a gerar resultados sem sentido.  Este fen\u00f3meno foi<\/p>","protected":false},"author":2,"featured_media":13600,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[84],"tags":[105,619],"class_list":["post-13599","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry","tag-machine-learning","tag-model-collapse"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.3 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>AI models face collapse when trained on AI-generated data, study finds | DailyAI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/pt\/2024\/07\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\/\" \/>\n<meta property=\"og:locale\" content=\"pt_PT\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"AI models face collapse when trained on AI-generated data, study finds | DailyAI\" \/>\n<meta property=\"og:description\" content=\"A new study published in Nature reveals that AI models, including large language models (LLMs), rapidly degrade in quality when trained on data generated by previous AI models.\u00a0 This phenomenon, termed &#8220;model collapse,&#8221; could erode the quality of future AI models, particularly as more AI-generated content is released onto the internet and, therefore, recycled and reused in model training data.\u00a0 Investigating this phenomenon, researchers from the University of Cambridge, University of Oxford, and other institutions conducted experiments showing that when AI models are repeatedly trained on data produced by earlier versions of themselves, they start generating nonsensical outputs.\u00a0 This was\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/pt\/2024\/07\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2024-07-28T17:00:03+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-07-29T09:48:32+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/07\/DALL\u00b7E-2024-07-28-17.59.19-An-image-illustrating-AI-models-facing-collapse-when-trained-on-AI-generated-data.-The-scene-shows-an-abstract-representation-of-an-AI-model-breaking-.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1792\" \/>\n\t<meta property=\"og:image:height\" content=\"1024\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"author\" content=\"Sam Jeans\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"Escrito por\" \/>\n\t<meta name=\"twitter:data1\" content=\"Sam Jeans\" \/>\n\t<meta name=\"twitter:label2\" content=\"Tempo estimado de leitura\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutos\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/07\\\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/07\\\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\\\/\"},\"author\":{\"name\":\"Sam Jeans\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\"},\"headline\":\"AI models face collapse when trained on AI-generated data, study finds\",\"datePublished\":\"2024-07-28T17:00:03+00:00\",\"dateModified\":\"2024-07-29T09:48:32+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/07\\\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\\\/\"},\"wordCount\":661,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/07\\\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/07\\\/DALL\u00b7E-2024-07-28-17.59.19-An-image-illustrating-AI-models-facing-collapse-when-trained-on-AI-generated-data.-The-scene-shows-an-abstract-representation-of-an-AI-model-breaking-.webp\",\"keywords\":[\"machine learning\",\"Model collapse\"],\"articleSection\":[\"Industry\"],\"inLanguage\":\"pt-PT\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/07\\\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/07\\\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\\\/\",\"name\":\"AI models face collapse when trained on AI-generated data, study finds | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/07\\\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/07\\\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/07\\\/DALL\u00b7E-2024-07-28-17.59.19-An-image-illustrating-AI-models-facing-collapse-when-trained-on-AI-generated-data.-The-scene-shows-an-abstract-representation-of-an-AI-model-breaking-.webp\",\"datePublished\":\"2024-07-28T17:00:03+00:00\",\"dateModified\":\"2024-07-29T09:48:32+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/07\\\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\\\/#breadcrumb\"},\"inLanguage\":\"pt-PT\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2024\\\/07\\\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-PT\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/07\\\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/07\\\/DALL\u00b7E-2024-07-28-17.59.19-An-image-illustrating-AI-models-facing-collapse-when-trained-on-AI-generated-data.-The-scene-shows-an-abstract-representation-of-an-AI-model-breaking-.webp\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/07\\\/DALL\u00b7E-2024-07-28-17.59.19-An-image-illustrating-AI-models-facing-collapse-when-trained-on-AI-generated-data.-The-scene-shows-an-abstract-representation-of-an-AI-model-breaking-.webp\",\"width\":1792,\"height\":1024,\"caption\":\"AI models\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/07\\\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"AI models face collapse when trained on AI-generated data, study finds\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"pt-PT\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-PT\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\",\"name\":\"Sam Jeans\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pt-PT\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"caption\":\"Sam Jeans\"},\"description\":\"Sam is a science and technology writer who has worked in various AI startups. When he\u2019s not writing, he can be found reading medical journals or digging through boxes of vinyl records.\",\"sameAs\":[\"https:\\\/\\\/www.linkedin.com\\\/in\\\/sam-jeans-6746b9142\\\/\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/pt\\\/author\\\/samjeans\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Os modelos de IA enfrentam um colapso quando treinados em dados gerados por IA, segundo o estudo | DailyAI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/pt\/2024\/07\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\/","og_locale":"pt_PT","og_type":"article","og_title":"AI models face collapse when trained on AI-generated data, study finds | DailyAI","og_description":"A new study published in Nature reveals that AI models, including large language models (LLMs), rapidly degrade in quality when trained on data generated by previous AI models.\u00a0 This phenomenon, termed &#8220;model collapse,&#8221; could erode the quality of future AI models, particularly as more AI-generated content is released onto the internet and, therefore, recycled and reused in model training data.\u00a0 Investigating this phenomenon, researchers from the University of Cambridge, University of Oxford, and other institutions conducted experiments showing that when AI models are repeatedly trained on data produced by earlier versions of themselves, they start generating nonsensical outputs.\u00a0 This was","og_url":"https:\/\/dailyai.com\/pt\/2024\/07\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\/","og_site_name":"DailyAI","article_published_time":"2024-07-28T17:00:03+00:00","article_modified_time":"2024-07-29T09:48:32+00:00","og_image":[{"width":1792,"height":1024,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/07\/DALL\u00b7E-2024-07-28-17.59.19-An-image-illustrating-AI-models-facing-collapse-when-trained-on-AI-generated-data.-The-scene-shows-an-abstract-representation-of-an-AI-model-breaking-.webp","type":"image\/webp"}],"author":"Sam Jeans","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"Escrito por":"Sam Jeans","Tempo estimado de leitura":"4 minutos"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2024\/07\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2024\/07\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\/"},"author":{"name":"Sam Jeans","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9"},"headline":"AI models face collapse when trained on AI-generated data, study finds","datePublished":"2024-07-28T17:00:03+00:00","dateModified":"2024-07-29T09:48:32+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2024\/07\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\/"},"wordCount":661,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2024\/07\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/07\/DALL\u00b7E-2024-07-28-17.59.19-An-image-illustrating-AI-models-facing-collapse-when-trained-on-AI-generated-data.-The-scene-shows-an-abstract-representation-of-an-AI-model-breaking-.webp","keywords":["machine learning","Model collapse"],"articleSection":["Industry"],"inLanguage":"pt-PT"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2024\/07\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\/","url":"https:\/\/dailyai.com\/2024\/07\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\/","name":"Os modelos de IA enfrentam um colapso quando treinados em dados gerados por IA, segundo o estudo | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2024\/07\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2024\/07\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/07\/DALL\u00b7E-2024-07-28-17.59.19-An-image-illustrating-AI-models-facing-collapse-when-trained-on-AI-generated-data.-The-scene-shows-an-abstract-representation-of-an-AI-model-breaking-.webp","datePublished":"2024-07-28T17:00:03+00:00","dateModified":"2024-07-29T09:48:32+00:00","breadcrumb":{"@id":"https:\/\/dailyai.com\/2024\/07\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\/#breadcrumb"},"inLanguage":"pt-PT","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2024\/07\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\/"]}]},{"@type":"ImageObject","inLanguage":"pt-PT","@id":"https:\/\/dailyai.com\/2024\/07\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/07\/DALL\u00b7E-2024-07-28-17.59.19-An-image-illustrating-AI-models-facing-collapse-when-trained-on-AI-generated-data.-The-scene-shows-an-abstract-representation-of-an-AI-model-breaking-.webp","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/07\/DALL\u00b7E-2024-07-28-17.59.19-An-image-illustrating-AI-models-facing-collapse-when-trained-on-AI-generated-data.-The-scene-shows-an-abstract-representation-of-an-AI-model-breaking-.webp","width":1792,"height":1024,"caption":"AI models"},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2024\/07\/ai-models-face-collapse-when-trained-on-ai-generated-data-study-finds\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"AI models face collapse when trained on AI-generated data, study finds"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DailyAI","description":"A sua dose di\u00e1ria de not\u00edcias sobre IA","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"pt-PT"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DailyAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"pt-PT","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9","name":"Cal\u00e7as de ganga Sam","image":{"@type":"ImageObject","inLanguage":"pt-PT","@id":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","caption":"Sam Jeans"},"description":"Sam \u00e9 um escritor de ci\u00eancia e tecnologia que trabalhou em v\u00e1rias startups de IA. Quando n\u00e3o est\u00e1 a escrever, pode ser encontrado a ler revistas m\u00e9dicas ou a vasculhar caixas de discos de vinil.","sameAs":["https:\/\/www.linkedin.com\/in\/sam-jeans-6746b9142\/"],"url":"https:\/\/dailyai.com\/pt\/author\/samjeans\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/posts\/13599","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/comments?post=13599"}],"version-history":[{"count":6,"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/posts\/13599\/revisions"}],"predecessor-version":[{"id":13609,"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/posts\/13599\/revisions\/13609"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/media\/13600"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/media?parent=13599"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/categories?post=13599"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/pt\/wp-json\/wp\/v2\/tags?post=13599"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}