{"id":1871,"date":"2023-06-18T22:43:50","date_gmt":"2023-06-18T22:43:50","guid":{"rendered":"https:\/\/dailyai.com\/?p=1871"},"modified":"2024-03-28T00:48:00","modified_gmt":"2024-03-28T00:48:00","slug":"what-happens-when-ai-starts-consuming-its-own-output","status":"publish","type":"post","link":"https:\/\/dailyai.com\/sv\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/","title":{"rendered":"Vad h\u00e4nder n\u00e4r AI b\u00f6rjar konsumera sin egen produktion?"},"content":{"rendered":"<p><strong>Data \u00e4r AI:s livsnerv, men det \u00e4r inte en o\u00e4ndlig resurs. Kan m\u00e4nskligheten f\u00e5 slut p\u00e5 data? Vad h\u00e4nder om vi g\u00f6r det?<\/strong><\/p>\n<p><span style=\"font-weight: 400\">Komplexa AI-modeller kr\u00e4ver stora m\u00e4ngder tr\u00e4ningsdata. F\u00f6r att tr\u00e4na en stor spr\u00e5kmodell (LLM) som ChatGPT kr\u00e4vs till exempel cirka 10 biljoner ord.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400\">Vissa experter anser att tillg\u00e5ngen p\u00e5 data av h\u00f6g kvalitet minskar. Till exempel visade en studie fr\u00e5n 2022 fr\u00e5n forskare vid flera universitet <a href=\"https:\/\/arxiv.org\/pdf\/2211.04325.pdf\">uttalade<\/a>, <\/span><span style=\"font-weight: 400\">\"V\u00e5r analys tyder p\u00e5 att lagret av h\u00f6gkvalitativa spr\u00e5kdata snart kommer att vara utt\u00f6mt, sannolikt f\u00f6re 2026 ... V\u00e5rt arbete tyder p\u00e5 att den nuvarande trenden med st\u00e4ndigt v\u00e4xande ML-modeller som f\u00f6rlitar sig p\u00e5 enorma datam\u00e4ngder kan sakta ner om dataeffektiviteten inte f\u00f6rb\u00e4ttras drastiskt eller nya datak\u00e4llor blir tillg\u00e4ngliga.\"\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400\">Att generera syntetisk data \u00e4r en l\u00f6sning, men den lyckas i allm\u00e4nhet inte f\u00e5nga djupet, nyanserna och variansen i verklig data.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400\">F\u00f6r att ytterligare komplicera situationen finns det farh\u00e5gor om vad som h\u00e4nder n\u00e4r AI b\u00f6rjar konsumera sin egen produktion, vilket forskare vid \u00c9cole Polytechnique F\u00e9d\u00e9rale de Lausanne (EPFL) i Schweiz anser \u00e4r <a href=\"https:\/\/www.theregister.com\/2023\/06\/16\/crowd_workers_bots_ai_training\/\">h\u00e4nder redan<\/a>. <\/span><\/p>\n<p><span style=\"font-weight: 400\">Deras forskning tyder p\u00e5 att AI-f\u00f6retag som k\u00f6per m\u00e4nskligt producerad data via plattformar som Amazon Mechanical Turk kan f\u00e5 AI-genererad data ist\u00e4llet.\u00a0<\/span><\/p>\n<p>Vad h\u00e4nder n\u00e4r AI b\u00f6rjar \u00e4ta sin egen produktion? G\u00e5r det att undvika?<\/p>\n<h2><span style=\"font-weight: 400\">Att bygga upp dataset \u00e4r dyrt och tidskr\u00e4vande - och insatserna \u00e4r h\u00f6ga<\/span><\/h2>\n<p><span style=\"font-weight: 400\">Data finns \u00f6verallt, men att operationalisera dem f\u00f6r AI \u00e4r en komplex process. Kvaliteten p\u00e5 data och etiketter p\u00e5verkar modellens prestanda - det \u00e4r ett fall av \"skr\u00e4p in, skr\u00e4p ut\".\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400\">F\u00f6r att kortfattat beskriva processen f\u00f6r att bygga upp dataset tar dataantecknare (eller etiketterare) bearbetade data (t.ex. en beskuren bild) och etiketterar <\/span><span style=\"font-weight: 400\">funktioner (t.ex. en bil, en person, en f\u00e5gel).\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400\">Detta ger algoritmerna ett \"m\u00e5l\" att l\u00e4ra sig fr\u00e5n. Algoritmerna extraherar och analyserar funktioner fr\u00e5n m\u00e4rkta data f\u00f6r att f\u00f6ruts\u00e4ga dessa funktioner i nya, osedda data. <\/span><\/p>\n<p><span style=\"font-weight: 400\">Detta kr\u00e4vs f\u00f6r \u00f6vervakad maskininl\u00e4rning, som \u00e4r en av de viktigaste grenarna inom maskininl\u00e4rning tillsammans med o\u00f6vervakad maskininl\u00e4rning och f\u00f6rst\u00e4rkningsinl\u00e4rning. Genom att <a href=\"https:\/\/medium.com\/cognilytica\/data-preparation-labeling-for-ai-2020-b512a5ed777c\">vissa uppskattningar<\/a>F\u00f6rberedelse- och m\u00e4rkningsprocessen f\u00f6r data upptar 80% av ett maskininl\u00e4rningsmodellprojekts varaktighet, men att sk\u00e4ra f\u00f6r m\u00e5nga h\u00f6rn riskerar att \u00e4ventyra en modells prestanda. <\/span><\/p>\n<p><span style=\"font-weight: 400\">F\u00f6rutom de praktiska utmaningarna med att skapa h\u00f6gkvalitativa dataset f\u00f6r\u00e4ndras datas natur hela tiden. Det som f\u00f6r 10 \u00e5r sedan definierades som en \"dataset som inneh\u00e5ller ett typiskt urval av fordon p\u00e5 v\u00e4garna\" \u00e4r inte detsamma idag. Nu hittar du till exempel ett mycket st\u00f6rre antal eScooters och eBikes p\u00e5 v\u00e4garna.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400\">Dessa kallas \"edge cases\", vilket \u00e4r s\u00e4llsynta objekt eller fenomen som inte f\u00f6rekommer i dataset.\u00a0<\/span><\/p>\n<h2><span style=\"font-weight: 400\">Modellerna \u00e5terspeglar kvaliteten p\u00e5 deras dataset<\/span><\/h2>\n<p><span style=\"font-weight: 400\">Om du tr\u00e4nar ett modernt AI-system p\u00e5 ett gammalt dataset riskerar modellen att f\u00e5 l\u00e5g prestanda n\u00e4r den uts\u00e4tts f\u00f6r nya, osedda data.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400\">Mellan 2015 och 2020 uppt\u00e4ckte forskare stora strukturella fel i AI-algoritmer, som delvis berodde p\u00e5 att modellerna tr\u00e4nades p\u00e5 gamla och partiska data. <\/span><\/p>\n<p><span style=\"font-weight: 400\">Till exempel kan <\/span><a href=\"http:\/\/vis-www.cs.umass.edu\/lfw\/\"><span style=\"font-weight: 400\">M\u00e4rkta ansikten i det vilda hemmet (LFW)<\/span><\/a><span style=\"font-weight: 400\">, ett dataset med k\u00e4ndisansikten som ofta anv\u00e4nds vid ansiktsigenk\u00e4nning, best\u00e5r av <\/span><a href=\"https:\/\/odsc.medium.com\/the-impact-of-racial-bias-in-facial-recognition-software-36f37113604c\"><span style=\"font-weight: 400\">77,5% m\u00e4n och 83,5% personer med vit hudf\u00e4rg<\/span><\/a><span style=\"font-weight: 400\"> individer. En AI har inget hopp om att fungera korrekt om data inte representerar alla som den avser att tj\u00e4na. Felprocenten f\u00f6r ansiktsigenk\u00e4nning bland de b\u00e4sta algoritmerna visade sig vara s\u00e5 l\u00e5g som 0,8% f\u00f6r vita m\u00e4n och s\u00e5 h\u00f6g som 34,7% f\u00f6r m\u00f6rkhyade kvinnor.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400\">Denna forskning kulminerade i den banbrytande <\/span><a href=\"http:\/\/proceedings.mlr.press\/v81\/buolamwini18a\/buolamwini18a.pdf\"><span style=\"font-weight: 400\">Studie av genusnyanser<\/span><\/a><span style=\"font-weight: 400\"> och en dokument\u00e4rfilm som heter <\/span><a href=\"https:\/\/www.netflix.com\/title\/81328723\"><span style=\"font-weight: 400\">Kodad partiskhet<\/span><\/a><span style=\"font-weight: 400\">som unders\u00f6kte hur AI sannolikt l\u00e4r sig fr\u00e5n bristf\u00e4lliga och icke-representativa data.<\/span><\/p>\n<p><span style=\"font-weight: 400\">Konsekvenserna av detta \u00e4r l\u00e5ngt ifr\u00e5n oskyldiga - det har lett till felaktiga domslut, falska frihetsber\u00f6vanden och till att kvinnor och andra grupper har nekats jobb och krediter.<\/span><\/p>\n<p>AI beh\u00f6ver mer data av h\u00f6g kvalitet, som m\u00e5ste vara r\u00e4ttvisande och representativ <span style=\"font-weight: 400\">- det \u00e4r en sv\u00e5rf\u00e5ngad kombination.\u00a0<\/span><\/p>\n<h2><span style=\"font-weight: 400\">\u00c4r syntetisk data svaret?<\/span><\/h2>\n<p><span style=\"font-weight: 400\">Syntetisk data anv\u00e4nds ofta inom computer vision (CV), d\u00e4r AI identifierar objekt och funktioner fr\u00e5n bilder och video.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400\">Ist\u00e4llet f\u00f6r att samla in bilddata fr\u00e5n den verkliga v\u00e4rlden - som att fotografera eller videofilma en gata, vilket \u00e4r tekniskt utmanande och medf\u00f6r integritetsfr\u00e5gor - genererar man helt enkelt data i en virtuell milj\u00f6.\u00a0<\/span><\/p>\n<figure id=\"attachment_1873\" aria-describedby=\"caption-attachment-1873\" style=\"width: 987px\" class=\"wp-caption alignnone\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-1873 size-full\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/computer-vision-models.jpg\" alt=\"\" width=\"987\" height=\"554\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/computer-vision-models.jpg 987w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/computer-vision-models-300x168.jpg 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/computer-vision-models-768x431.jpg 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/computer-vision-models-370x208.jpg 370w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/computer-vision-models-800x449.jpg 800w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/computer-vision-models-20x11.jpg 20w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/computer-vision-models-740x415.jpg 740w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/computer-vision-models-86x48.jpg 86w\" sizes=\"auto, (max-width: 987px) 100vw, 987px\" \/><figcaption id=\"caption-attachment-1873\" class=\"wp-caption-text\">Syntetisk data f\u00f6r utbildning i f\u00f6rarl\u00f6sa bilar. K\u00e4llan \u00e4r: <a href=\"https:\/\/analyticsindiamag.com\/how-synthetic-data-sets-can-improve-computer-vision-models\/\">Analytics India Mag<\/a>.<\/figcaption><\/figure>\n<p><span style=\"font-weight: 400\">\u00c4ven om detta ger AI:erna mer data finns det flera nackdelar:\u00a0<\/span><\/p>\n<ul>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Att modellera verkliga scenarier i en virtuell milj\u00f6 \u00e4r inte helt enkelt.\u00a0<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Att generera stora m\u00e4ngder syntetiska data \u00e4r fortfarande kostsamt och tidskr\u00e4vande.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Kantfall och avvikande v\u00e4rden \u00e4r fortfarande ett problem.\u00a0<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Det kan inte perfekt replikera den verkliga varan.\u00a0<\/span><\/li>\n<li>\u00c5 andra sidan kan vissa aspekter vara f\u00f6r perfekta, och det \u00e4r sv\u00e5rt att avg\u00f6ra vad som saknas.<\/li>\n<\/ul>\n<p><span style=\"font-weight: 400\">I slut\u00e4ndan \u00e4r syntetisk data utm\u00e4rkt f\u00f6r l\u00e4ttvirtualiserade milj\u00f6er, som ett fabriksgolv, men det r\u00e4cker inte alltid till f\u00f6r snabbr\u00f6rliga milj\u00f6er i verkligheten, som en gata i en stad.<\/span><\/p>\n<h2><span style=\"font-weight: 400\">Hur \u00e4r det med att generera syntetisk textdata?<\/span><\/h2>\n<p><span style=\"font-weight: 400\">Text \u00e4r enklare \u00e4n bild- eller videodata, s\u00e5 kan modeller som ChatGPT anv\u00e4ndas f\u00f6r att generera n\u00e4stan o\u00e4ndliga syntetiska tr\u00e4ningsdata?<\/span><\/p>\n<p><span style=\"font-weight: 400\">Ja, men det \u00e4r riskabelt och effekterna \u00e4r inte l\u00e4tta att f\u00f6rutse. <\/span><span style=\"font-weight: 400\">\u00c4ven om syntetisk textdata kan hj\u00e4lpa till att st\u00e4lla in, testa och optimera modeller \u00e4r den inte idealisk f\u00f6r att l\u00e4ra modeller ny kunskap och kan f\u00f6rst\u00e4rka f\u00f6rdomar och andra problem.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400\">H\u00e4r \u00e4r en analogi av varf\u00f6r det \u00e4r problematiskt att tr\u00e4na AI med AI-genererad data:<\/span><\/p>\n<ul>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">T\u00e4nk dig en skola som anv\u00e4nder alla v\u00e4rldens b\u00e4sta l\u00e4rob\u00f6cker f\u00f6r att utbilda sina elever i allt som finns att veta fr\u00e5n sina resurser under en dag.\u00a0<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">D\u00e4refter b\u00f6rjar skolan producera sitt eget arbete baserat p\u00e5 den kunskapen - p\u00e5 samma s\u00e4tt som en chatbot. Eleverna har l\u00e4rt sig av all tillg\u00e4nglig data fram till det datum d\u00e5 utbildningen b\u00f6rjar, men de kan inte effektivt f\u00f6ra in ny data i kunskapssystemet efter\u00e5t.\u00a0<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Kunskap skapas dagligen - \u00e4ven om den allra st\u00f6rsta delen av m\u00e4nniskans kunskap skapades f\u00f6re en viss dag, utvecklas och omvandlas kunskap \u00f6ver tid. Avg\u00f6rande \u00e4r att m\u00e4nniskor inte bara skapar ny kunskap hela tiden - vi \u00e4ndrar ocks\u00e5 v\u00e5rt perspektiv p\u00e5 befintlig kunskap.\u00a0<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Anta nu att skolan, som har slut p\u00e5 data, b\u00f6rjar undervisa sina elever med hj\u00e4lp av sin egen produktion. Eleverna b\u00f6rjar \"\u00e4ta\" sitt inneh\u00e5ll f\u00f6r att producera nytt inneh\u00e5ll.\u00a0<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">I det skedet kan studenternas resultat inte anpassas till den verkliga v\u00e4rlden och dess anv\u00e4ndbarhet minskar. Systemet \u00e5terskapar sitt eget arbete. Arbetet kan visserligen anpassas och utvecklas, men det sker isolerat fr\u00e5n allt utanf\u00f6r \u00e5terkopplingsslingan.\u00a0<\/span><\/li>\n<\/ul>\n<p>AI konfronterar st\u00e4ndigt m\u00e4nniskor med g\u00e5tor att l\u00f6sa, och<span style=\"font-weight: 400\">\u00a0<\/span>den h\u00e4r har en hel del <a href=\"https:\/\/www.reddit.com\/r\/ArtificialInteligence\/comments\/14b0p7i\/ai_is_going_to_eat_itself_experiment_shows_people\/\">kommentatorer p\u00e5 Reddit<\/a> och <a href=\"https:\/\/news.ycombinator.com\/item?id=34889404\">Y Combinator forum<\/a> f\u00f6rvirrad.<\/p>\n<p><span style=\"font-weight: 400\">Det \u00e4r h\u00e4pnadsv\u00e4ckande saker, och det finns ingen riktig konsensus om konsekvenserna.\u00a0<\/span><\/p>\n<h2><span style=\"font-weight: 400\">M\u00e4nskliga dataetiketterare anv\u00e4nder ofta AI f\u00f6r att producera data<\/span><\/h2>\n<p><span style=\"font-weight: 400\">Det finns ytterligare ett of\u00f6rutsett lager till problemet med att ta fram utbildningsdata av h\u00f6g kvalitet.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400\">Plattformar f\u00f6r gr\u00e4srotsarbete som <\/span><a href=\"https:\/\/www.mturk.com\/\"><span style=\"font-weight: 400\">Amazon Mechanical Turk<\/span><\/a><span style=\"font-weight: 400\"> (MTurk) anv\u00e4nds regelbundet av AI-f\u00f6retag som vill ta fram \u00e4kta \"m\u00e4nskliga\" dataset. T<\/span><span style=\"font-weight: 400\">et finns farh\u00e5gor om att dataantecknare p\u00e5 dessa plattformar anv\u00e4nder AI f\u00f6r att utf\u00f6ra sina uppgifter.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400\">Forskare vid \u00c9cole Polytechnique F\u00e9d\u00e9rale de Lausanne (EPFL) i Schweiz analyserade data som skapats via MTurk f\u00f6r att unders\u00f6ka om arbetare anv\u00e4nde AI f\u00f6r att generera sina inl\u00e4gg.\u00a0<\/span><\/p>\n<p><a href=\"https:\/\/arxiv.org\/abs\/2306.07899\"><span style=\"font-weight: 400\">Studien<\/span><\/a><span style=\"font-weight: 400\">som publicerades den 13 juni, anlitade 44 MTurk-deltagare f\u00f6r att sammanfatta abstrakten fr\u00e5n 16 medicinska forskningsartiklar. Det visade sig att 33% till 46% av anv\u00e4ndarna p\u00e5 plattformen genererade sina bidrag med AI, trots att de ombads att svara med naturligt spr\u00e5k.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400\">\"Vi utvecklade en mycket specifik metod som fungerade mycket bra f\u00f6r att uppt\u00e4cka syntetisk text i v\u00e5rt scenario\", s\u00e4ger Manoel Ribeiro, medf\u00f6rfattare till studien och doktorand vid EPFL, <\/span><a href=\"https:\/\/www.theregister.com\/2023\/06\/16\/crowd_workers_bots_ai_training\/\"><span style=\"font-weight: 400\">ber\u00e4ttade f\u00f6r The Register<\/span><\/a><span style=\"font-weight: 400\"> den h\u00e4r veckan.<\/span><\/p>\n<p><span style=\"font-weight: 400\">\u00c4ven om studiens dataset och urvalsstorlek \u00e4r ganska liten, \u00e4r det l\u00e5ngt ifr\u00e5n ot\u00e4nkbart att tro att AI tr\u00e4nas omedvetet p\u00e5 AI-genererat inneh\u00e5ll. <\/span><\/p>\n<p><span style=\"font-weight: 400\">Studien handlar inte om att skylla p\u00e5 MTurk-anst\u00e4llda - forskarna konstaterar att l\u00e5ga l\u00f6ner och repetitivt arbete bidrar till problemet. AI-f\u00f6retag vill ha m\u00e4nskligt skapade data av h\u00f6gsta kvalitet samtidigt som kostnaderna h\u00e5lls l\u00e5ga. En kommentator sa p\u00e5 Reddit: \"Jag \u00e4r f\u00f6r n\u00e4rvarande en av dessa arbetare, som har till uppgift att tr\u00e4na Bard. Jag \u00e4r s\u00e4ker som fan att anv\u00e4nda ChatGPT f\u00f6r detta. 20$\/timme \u00e4r inte tillr\u00e4ckligt f\u00f6r den hemska behandling vi f\u00e5r, s\u00e5 jag ska pressa varje cent ur det h\u00e4r ******* jobbet.\"<\/span><\/p>\n<p><span style=\"font-weight: 400\">Kaninh\u00e5let blir \u00e4nnu djupare, eftersom AI ofta tr\u00e4nas p\u00e5 data som skrapats fr\u00e5n internet. I takt med att mer AI-skrivet inneh\u00e5ll publiceras p\u00e5 n\u00e4tet kommer AI oundvikligen att l\u00e4ra sig av sina egna resultat.<\/span><\/p>\n<p><span style=\"font-weight: 400\">I takt med att m\u00e4nniskor b\u00f6rjar bli beroende av AI f\u00f6r att f\u00e5 information blir kvaliteten p\u00e5 deras resultat allt viktigare. Vi m\u00e5ste hitta innovativa metoder f\u00f6r att uppdatera AI med f\u00e4rska, autentiska data.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400\">Som Ribeiro uttrycker det: \"M\u00e4nskliga data \u00e4r guldstandarden, eftersom det \u00e4r m\u00e4nniskor vi bryr oss om, inte stora spr\u00e5kmodeller.\"<\/span><\/p>\n<p><span style=\"font-weight: 400\">Arbetet med att analysera den potentiella effekten av att AI konsumerar sina egna resultat p\u00e5g\u00e5r, men autentiska m\u00e4nskliga data \u00e4r fortfarande avg\u00f6rande f\u00f6r ett brett spektrum av maskininl\u00e4rningsuppgifter. <\/span><\/p>\n<p><span style=\"font-weight: 400\">Att generera stora m\u00e4ngder data f\u00f6r hungriga AI:er och samtidigt navigera bland riskerna \u00e4r ett p\u00e5g\u00e5ende arbete.\u00a0<\/span><\/p>","protected":false},"excerpt":{"rendered":"<p>Data \u00e4r AI:s livsnerv, men det \u00e4r inte en o\u00e4ndlig resurs. Kan m\u00e4nskligheten f\u00e5 slut p\u00e5 data? Vad h\u00e4nder om vi g\u00f6r det? Komplexa AI-modeller kr\u00e4ver enorma m\u00e4ngder tr\u00e4ningsdata. F\u00f6r att tr\u00e4na en stor spr\u00e5kmodell (LLM) som ChatGPT kr\u00e4vs till exempel cirka 10 biljoner ord.  Vissa experter tror att tillg\u00e5ngen p\u00e5 h\u00f6gkvalitativa data minskar. I en studie fr\u00e5n 2022 fr\u00e5n forskare vid flera universitet stod det till exempel: \"V\u00e5r analys tyder p\u00e5 att lagret av h\u00f6gkvalitativa spr\u00e5kdata snart kommer att vara utt\u00f6mt; sannolikt f\u00f6re 2026 ... V\u00e5rt arbete tyder p\u00e5 att den nuvarande trenden med st\u00e4ndigt v\u00e4xande ML-modeller som f\u00f6rlitar sig p\u00e5 enorma<\/p>","protected":false},"author":2,"featured_media":1874,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[87],"tags":[150,145,160,105],"class_list":["post-1871","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-opinions","tag-ai-benefits","tag-ai-risk","tag-data-science","tag-machine-learning"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>What happens when AI starts consuming its own output? | DailyAI<\/title>\n<meta name=\"description\" content=\"Data is the lifeblood of AI, but it\u2019s not an infinite resource. Can humanity run out of data? What happens if we do?\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/sv\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/\" \/>\n<meta property=\"og:locale\" content=\"sv_SE\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"What happens when AI starts consuming its own output? | DailyAI\" \/>\n<meta property=\"og:description\" content=\"Data is the lifeblood of AI, but it\u2019s not an infinite resource. Can humanity run out of data? What happens if we do?\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/sv\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2023-06-18T22:43:50+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-03-28T00:48:00+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/shutterstock_2256543489.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1000\" \/>\n\t<meta property=\"og:image:height\" content=\"667\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Sam Jeans\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"Skriven av\" \/>\n\t<meta name=\"twitter:data1\" content=\"Sam Jeans\" \/>\n\t<meta name=\"twitter:label2\" content=\"Ber\u00e4knad l\u00e4stid\" \/>\n\t<meta name=\"twitter:data2\" content=\"7 minuter\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/06\\\/what-happens-when-ai-starts-consuming-its-own-output\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/06\\\/what-happens-when-ai-starts-consuming-its-own-output\\\/\"},\"author\":{\"name\":\"Sam Jeans\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\"},\"headline\":\"What happens when AI starts consuming its own output?\",\"datePublished\":\"2023-06-18T22:43:50+00:00\",\"dateModified\":\"2024-03-28T00:48:00+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/06\\\/what-happens-when-ai-starts-consuming-its-own-output\\\/\"},\"wordCount\":1487,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/06\\\/what-happens-when-ai-starts-consuming-its-own-output\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/shutterstock_2256543489.jpg\",\"keywords\":[\"AI benefits\",\"AI risk\",\"Data science\",\"machine learning\"],\"articleSection\":{\"1\":\"Opinions &amp; Analysis\"},\"inLanguage\":\"sv-SE\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/06\\\/what-happens-when-ai-starts-consuming-its-own-output\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/06\\\/what-happens-when-ai-starts-consuming-its-own-output\\\/\",\"name\":\"What happens when AI starts consuming its own output? | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/06\\\/what-happens-when-ai-starts-consuming-its-own-output\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/06\\\/what-happens-when-ai-starts-consuming-its-own-output\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/shutterstock_2256543489.jpg\",\"datePublished\":\"2023-06-18T22:43:50+00:00\",\"dateModified\":\"2024-03-28T00:48:00+00:00\",\"description\":\"Data is the lifeblood of AI, but it\u2019s not an infinite resource. Can humanity run out of data? What happens if we do?\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/06\\\/what-happens-when-ai-starts-consuming-its-own-output\\\/#breadcrumb\"},\"inLanguage\":\"sv-SE\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2023\\\/06\\\/what-happens-when-ai-starts-consuming-its-own-output\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"sv-SE\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/06\\\/what-happens-when-ai-starts-consuming-its-own-output\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/shutterstock_2256543489.jpg\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/shutterstock_2256543489.jpg\",\"width\":1000,\"height\":667,\"caption\":\"AI generated data\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/06\\\/what-happens-when-ai-starts-consuming-its-own-output\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"What happens when AI starts consuming its own output?\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"sv-SE\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"sv-SE\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\",\"name\":\"Sam Jeans\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"sv-SE\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"caption\":\"Sam Jeans\"},\"description\":\"Sam is a science and technology writer who has worked in various AI startups. When he\u2019s not writing, he can be found reading medical journals or digging through boxes of vinyl records.\",\"sameAs\":[\"https:\\\/\\\/www.linkedin.com\\\/in\\\/sam-jeans-6746b9142\\\/\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/sv\\\/author\\\/samjeans\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Vad h\u00e4nder n\u00e4r AI b\u00f6rjar konsumera sin egen produktion? | DailyAI","description":"Data \u00e4r AI:s livsnerv, men det \u00e4r inte en o\u00e4ndlig resurs. Kan m\u00e4nskligheten f\u00e5 slut p\u00e5 data? Vad h\u00e4nder om vi g\u00f6r det?","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/sv\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/","og_locale":"sv_SE","og_type":"article","og_title":"What happens when AI starts consuming its own output? | DailyAI","og_description":"Data is the lifeblood of AI, but it\u2019s not an infinite resource. Can humanity run out of data? What happens if we do?","og_url":"https:\/\/dailyai.com\/sv\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/","og_site_name":"DailyAI","article_published_time":"2023-06-18T22:43:50+00:00","article_modified_time":"2024-03-28T00:48:00+00:00","og_image":[{"width":1000,"height":667,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/shutterstock_2256543489.jpg","type":"image\/jpeg"}],"author":"Sam Jeans","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"Skriven av":"Sam Jeans","Ber\u00e4knad l\u00e4stid":"7 minuter"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/"},"author":{"name":"Sam Jeans","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9"},"headline":"What happens when AI starts consuming its own output?","datePublished":"2023-06-18T22:43:50+00:00","dateModified":"2024-03-28T00:48:00+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/"},"wordCount":1487,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/shutterstock_2256543489.jpg","keywords":["AI benefits","AI risk","Data science","machine learning"],"articleSection":{"1":"Opinions &amp; Analysis"},"inLanguage":"sv-SE"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/","url":"https:\/\/dailyai.com\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/","name":"Vad h\u00e4nder n\u00e4r AI b\u00f6rjar konsumera sin egen produktion? | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/shutterstock_2256543489.jpg","datePublished":"2023-06-18T22:43:50+00:00","dateModified":"2024-03-28T00:48:00+00:00","description":"Data \u00e4r AI:s livsnerv, men det \u00e4r inte en o\u00e4ndlig resurs. Kan m\u00e4nskligheten f\u00e5 slut p\u00e5 data? Vad h\u00e4nder om vi g\u00f6r det?","breadcrumb":{"@id":"https:\/\/dailyai.com\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/#breadcrumb"},"inLanguage":"sv-SE","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/"]}]},{"@type":"ImageObject","inLanguage":"sv-SE","@id":"https:\/\/dailyai.com\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/shutterstock_2256543489.jpg","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/shutterstock_2256543489.jpg","width":1000,"height":667,"caption":"AI generated data"},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"What happens when AI starts consuming its own output?"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DagligaAI","description":"Din dagliga dos av AI-nyheter","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"sv-SE"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DagligaAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"sv-SE","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9","name":"Sam Jeans","image":{"@type":"ImageObject","inLanguage":"sv-SE","@id":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","caption":"Sam Jeans"},"description":"Sam \u00e4r en vetenskaps- och teknikskribent som har arbetat i olika AI-startups. N\u00e4r han inte skriver l\u00e4ser han medicinska tidskrifter eller gr\u00e4ver igenom l\u00e5dor med vinylskivor.","sameAs":["https:\/\/www.linkedin.com\/in\/sam-jeans-6746b9142\/"],"url":"https:\/\/dailyai.com\/sv\/author\/samjeans\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/posts\/1871","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/comments?post=1871"}],"version-history":[{"count":38,"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/posts\/1871\/revisions"}],"predecessor-version":[{"id":2136,"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/posts\/1871\/revisions\/2136"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/media\/1874"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/media?parent=1871"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/categories?post=1871"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/tags?post=1871"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}