{"id":6851,"date":"2023-10-27T19:21:39","date_gmt":"2023-10-27T19:21:39","guid":{"rendered":"https:\/\/dailyai.com\/?p=6851"},"modified":"2023-10-27T22:55:24","modified_gmt":"2023-10-27T22:55:24","slug":"ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization","status":"publish","type":"post","link":"https:\/\/dailyai.com\/da\/2023\/10\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\/","title":{"rendered":"AI udsat for test af Theory of Mind og systematisk generalisering"},"content":{"rendered":"<p><b>Forskere har introduceret FANToM, et nyt benchmark designet til grundigt at teste og evaluere store sprogmodellers (LLM'er) forst\u00e5else og anvendelse af Theory of Mind (ToM).<\/b><\/p>\n<p><span style=\"font-weight: 400;\">Theory of Mind refererer til evnen til at till\u00e6gge sig selv og andre overbevisninger, \u00f8nsker og viden og til at forst\u00e5, at andre har overbevisninger og perspektiver, der er forskellige fra ens egne.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">ToM anses for at v\u00e6re grundlaget for den bevidsthed, som intelligente dyr besidder. Ud over mennesker anses primater som orangutanger, gorillaer og chimpanser for at have ToM, og det samme g\u00e6lder nogle ikke-primater som papeg\u00f8jer og medlemmer af kragefuglefamilien.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Efterh\u00e5nden som AI-modeller bliver mere komplekse, s\u00f8ger AI-forskere nye metoder til at evaluere evner som ToM.<\/span><\/p>\n<p><a href=\"https:\/\/hyunw.kim\/fantom\/\"><span style=\"font-weight: 400;\">Et nyt benchmark kaldet FANToM<\/span><\/a><span style=\"font-weight: 400;\">som er skabt af forskere fra Allen Institute for AI, University of Washington, Carnegie Mellon University og Seoul National University, uds\u00e6tter maskinl\u00e6ringsmodeller for dynamiske scenarier, der afspejler interaktioner i det virkelige liv.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Med FANToM g\u00e5r karakterer ind og ud af samtaler, hvilket udfordrer AI-modeller til at opretholde en n\u00f8jagtig forst\u00e5else af, hvem der ved hvad p\u00e5 et givet tidspunkt.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Ved at uds\u00e6tte store sprogmodeller (LLM'er) for FANToM viste det sig, at selv de mest avancerede modeller k\u00e6mper med at opretholde en konsistent ToM.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Modellernes pr\u00e6station var betydeligt lavere end de menneskelige deltageres, hvilket understreger AI's begr\u00e6nsninger i forhold til at forst\u00e5 og navigere i komplekse sociale interaktioner.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Faktisk dominerede mennesker alle kategorier, som det ses nedenfor.\u00a0<\/span><\/p>\n<figure id=\"attachment_6852\" aria-describedby=\"caption-attachment-6852\" style=\"width: 1024px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-6852 size-large\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/scores_barchart-1024x213.png\" alt=\"AI ToM\" width=\"1024\" height=\"213\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/scores_barchart-1024x213.png 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/scores_barchart-300x63.png 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/scores_barchart-768x160.png 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/scores_barchart-370x77.png 370w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/scores_barchart-800x167.png 800w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/scores_barchart-740x154.png 740w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/scores_barchart-20x4.png 20w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/scores_barchart-1320x275.png 1320w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/scores_barchart-230x48.png 230w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/scores_barchart.png 1367w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption id=\"caption-attachment-6852\" class=\"wp-caption-text\">Mennesker var langt bedre til at svare p\u00e5 ToM-relaterede sp\u00f8rgsm\u00e5l sammenlignet med popul\u00e6re LLM'er. Kilde: <a href=\"https:\/\/hyunw.kim\/fantom\/\">FANToM<\/a>.<\/figcaption><\/figure>\n<p><span style=\"font-weight: 400;\">Et interessant sidepunkt er, at oktober-versionen af GPT-4-modellen blev overg\u00e5et af en tidligere juni-version, hvilket kan underst\u00f8tte de seneste anekdoter blandt brugerne om, at <\/span><a href=\"https:\/\/dailyai.com\/da\/2023\/07\/is-chatgpt-getting-worse-heres-everything-we-know-so-far\/\"><span style=\"font-weight: 400;\">ChatGPT bliver v\u00e6rre og v\u00e6rre<\/span><\/a><span style=\"font-weight: 400;\">.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">FANToM afsl\u00f8rede ogs\u00e5 teknikker til at forbedre LLM ToM, som f.eks. chain-of-thought r\u00e6sonnement og andre finjusteringsmetoder. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">Men kl\u00f8ften mellem AI og menneskelige ToM-f\u00e6rdigheder er stadig stor.\u00a0<\/span><\/p>\n<h2><span style=\"font-weight: 400;\">AI springer mod menneskelignende sprogf\u00e6rdigheder<\/span><\/h2>\n<p><span style=\"font-weight: 400;\">I en noget relateret, men separat <\/span><a href=\"https:\/\/www.nature.com\/articles\/d41586-023-03272-3\"><span style=\"font-weight: 400;\">unders\u00f8gelse offentliggjort i Nature<\/span><\/a><span style=\"font-weight: 400;\">udviklede forskere et neuralt netv\u00e6rk, der var i stand til at generalisere sprog p\u00e5 samme m\u00e5de som mennesker.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Dette nye neurale netv\u00e6rk viste en imponerende evne til at integrere nyligt l\u00e6rte ord i sit eksisterende ordforr\u00e5d. Det kunne derefter bruge disse ord i forskellige sammenh\u00e6nge, en kognitiv f\u00e6rdighed kendt som systematisk generalisering.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Mennesker udviser naturligt systematisk generalisering og inkorporerer problemfrit nyt ordforr\u00e5d i deres repertoire.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">N\u00e5r man f.eks. f\u00f8rst har l\u00e6rt udtrykket \"photobomb\", kan man bruge det i forskellige situationer n\u00e6sten med det samme. Der dukker hele tiden nye slangudtryk op, og mennesker optager dem naturligt i deres ordforr\u00e5d.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Forskerne udsatte b\u00e5de deres eget brugerdefinerede neurale netv\u00e6rk og ChatGPT for en r\u00e6kke tests og fandt ud af, at ChatGPT haltede bagefter den brugerdefinerede model i ydeevne.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Mens LLM'er som ChatGPT udm\u00e6rker sig i mange samtalescenarier, udviser de bem\u00e6rkelsesv\u00e6rdige uoverensstemmelser og huller i andre, et problem, som dette nye neurale netv\u00e6rk l\u00f8ser.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">For at unders\u00f8ge dette aspekt af sproglig kommunikation udf\u00f8rte forskere et eksperiment med 25 mennesker, hvor de vurderede deres evne til at anvende nyligt l\u00e6rte ord i forskellige sammenh\u00e6nge. <\/span><span style=\"font-weight: 400;\">Fors\u00f8gspersonerne blev introduceret til et pseudosprog best\u00e5ende af nonsensord, der repr\u00e6senterede forskellige handlinger og regler.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Efter en tr\u00e6ningsfase udm\u00e6rkede deltagerne sig ved at anvende disse abstrakte regler p\u00e5 nye situationer, hvilket viste systematisk generalisering.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Da det nyudviklede neurale netv\u00e6rk blev udsat for denne opgave, afspejlede det den menneskelige pr\u00e6station. <\/span><span style=\"font-weight: 400;\">Men da ChatGPT blev udsat for den samme udfordring, havde den store problemer og fejlede mellem 42 og 86% af tiden, afh\u00e6ngigt af den specifikke opgave.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Det er vigtigt af to grunde. For det f\u00f8rste kan man argumentere for, at dette nye neurale netv\u00e6rk effektivt udkonkurrerede GPT-4 p\u00e5 denne specifikke opgave - hvilket er imponerende nok. For det andet afsl\u00f8rer denne unders\u00f8gelse nye metoder til at l\u00e6re AI-modeller at generalisere nyt sprog ligesom mennesker.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Som Elia Bruni, der er specialist i naturlig sprogbehandling ved Osnabr\u00fcck Universitet i Tyskland, beskriver det: \"Det er en stor ting at indf\u00f8re systematik i neurale netv\u00e6rk.\"<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Tilsammen tilbyder disse to studier nye tilgange til at tr\u00e6ne mere intelligente AI-modeller, der kan konkurrere med mennesker p\u00e5 kritiske omr\u00e5der som lingvistik og Theory of Mind. <\/span><\/p>","protected":false},"excerpt":{"rendered":"<p>Forskere har introduceret FANToM, et nyt benchmark designet til grundigt at teste og evaluere store sprogmodellers (LLM'er) forst\u00e5else og anvendelse af Theory of Mind (ToM). Theory of Mind henviser til evnen til at till\u00e6gge sig selv og andre overbevisninger, \u00f8nsker og viden og til at forst\u00e5, at andre har overbevisninger og perspektiver, der er forskellige fra ens egne.  ToM anses for at v\u00e6re grundlaget for den bevidsthed, som intelligente dyr besidder. Ud over mennesker anses primater som orangutanger, gorillaer og chimpanser for at have ToM, og det samme g\u00e6lder nogle ikke-primater som papeg\u00f8jer og medlemmer af kragefamilien.  Som AI-modeller<\/p>","protected":false},"author":2,"featured_media":6853,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[84],"tags":[404,115,105,93],"class_list":["post-6851","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry","tag-agi","tag-chatgpt","tag-machine-learning","tag-openai"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>AI subjected to tests on Theory of Mind and systematic generalization | DailyAI<\/title>\n<meta name=\"description\" content=\"Researchers have introduced FANToM, a novel benchmark designed to rigorously test and evaluate large language models\u2019 (LLMs) understanding and application of Theory of Mind (ToM).\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/da\/2023\/10\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\/\" \/>\n<meta property=\"og:locale\" content=\"da_DK\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"AI subjected to tests on Theory of Mind and systematic generalization | DailyAI\" \/>\n<meta property=\"og:description\" content=\"Researchers have introduced FANToM, a novel benchmark designed to rigorously test and evaluate large language models\u2019 (LLMs) understanding and application of Theory of Mind (ToM).\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/da\/2023\/10\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2023-10-27T19:21:39+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-10-27T22:55:24+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/shutterstock_435449356.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1000\" \/>\n\t<meta property=\"og:image:height\" content=\"667\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Sam Jeans\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"Skrevet af\" \/>\n\t<meta name=\"twitter:data1\" content=\"Sam Jeans\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimeret l\u00e6setid\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutter\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\\\/\"},\"author\":{\"name\":\"Sam Jeans\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\"},\"headline\":\"AI subjected to tests on Theory of Mind and systematic generalization\",\"datePublished\":\"2023-10-27T19:21:39+00:00\",\"dateModified\":\"2023-10-27T22:55:24+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\\\/\"},\"wordCount\":665,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/shutterstock_435449356.jpg\",\"keywords\":[\"AGI\",\"ChatGPT\",\"machine learning\",\"OpenAI\"],\"articleSection\":[\"Industry\"],\"inLanguage\":\"da-DK\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\\\/\",\"name\":\"AI subjected to tests on Theory of Mind and systematic generalization | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/shutterstock_435449356.jpg\",\"datePublished\":\"2023-10-27T19:21:39+00:00\",\"dateModified\":\"2023-10-27T22:55:24+00:00\",\"description\":\"Researchers have introduced FANToM, a novel benchmark designed to rigorously test and evaluate large language models\u2019 (LLMs) understanding and application of Theory of Mind (ToM).\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\\\/#breadcrumb\"},\"inLanguage\":\"da-DK\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"da-DK\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/shutterstock_435449356.jpg\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/shutterstock_435449356.jpg\",\"width\":1000,\"height\":667,\"caption\":\"Theory of Mind AI\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"AI subjected to tests on Theory of Mind and systematic generalization\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"da-DK\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"da-DK\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\",\"name\":\"Sam Jeans\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"da-DK\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"caption\":\"Sam Jeans\"},\"description\":\"Sam is a science and technology writer who has worked in various AI startups. When he\u2019s not writing, he can be found reading medical journals or digging through boxes of vinyl records.\",\"sameAs\":[\"https:\\\/\\\/www.linkedin.com\\\/in\\\/sam-jeans-6746b9142\\\/\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/da\\\/author\\\/samjeans\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"AI udsat for test af Theory of Mind og systematisk generalisering | DailyAI","description":"Forskere har introduceret FANToM, et nyt benchmark designet til grundigt at teste og evaluere store sprogmodellers (LLM'er) forst\u00e5else og anvendelse af Theory of Mind (ToM).","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/da\/2023\/10\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\/","og_locale":"da_DK","og_type":"article","og_title":"AI subjected to tests on Theory of Mind and systematic generalization | DailyAI","og_description":"Researchers have introduced FANToM, a novel benchmark designed to rigorously test and evaluate large language models\u2019 (LLMs) understanding and application of Theory of Mind (ToM).","og_url":"https:\/\/dailyai.com\/da\/2023\/10\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\/","og_site_name":"DailyAI","article_published_time":"2023-10-27T19:21:39+00:00","article_modified_time":"2023-10-27T22:55:24+00:00","og_image":[{"width":1000,"height":667,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/shutterstock_435449356.jpg","type":"image\/jpeg"}],"author":"Sam Jeans","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"Skrevet af":"Sam Jeans","Estimeret l\u00e6setid":"4 minutter"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2023\/10\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2023\/10\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\/"},"author":{"name":"Sam Jeans","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9"},"headline":"AI subjected to tests on Theory of Mind and systematic generalization","datePublished":"2023-10-27T19:21:39+00:00","dateModified":"2023-10-27T22:55:24+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2023\/10\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\/"},"wordCount":665,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2023\/10\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/shutterstock_435449356.jpg","keywords":["AGI","ChatGPT","machine learning","OpenAI"],"articleSection":["Industry"],"inLanguage":"da-DK"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2023\/10\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\/","url":"https:\/\/dailyai.com\/2023\/10\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\/","name":"AI udsat for test af Theory of Mind og systematisk generalisering | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2023\/10\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2023\/10\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/shutterstock_435449356.jpg","datePublished":"2023-10-27T19:21:39+00:00","dateModified":"2023-10-27T22:55:24+00:00","description":"Forskere har introduceret FANToM, et nyt benchmark designet til grundigt at teste og evaluere store sprogmodellers (LLM'er) forst\u00e5else og anvendelse af Theory of Mind (ToM).","breadcrumb":{"@id":"https:\/\/dailyai.com\/2023\/10\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\/#breadcrumb"},"inLanguage":"da-DK","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2023\/10\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\/"]}]},{"@type":"ImageObject","inLanguage":"da-DK","@id":"https:\/\/dailyai.com\/2023\/10\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/shutterstock_435449356.jpg","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/shutterstock_435449356.jpg","width":1000,"height":667,"caption":"Theory of Mind AI"},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2023\/10\/ai-subjected-to-tests-on-theory-of-mind-and-systematic-generalization\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"AI subjected to tests on Theory of Mind and systematic generalization"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DailyAI","description":"Din daglige dosis af AI-nyheder","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"da-DK"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DailyAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"da-DK","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9","name":"Sam Jeans","image":{"@type":"ImageObject","inLanguage":"da-DK","@id":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","caption":"Sam Jeans"},"description":"Sam er videnskabs- og teknologiforfatter og har arbejdet i forskellige AI-startups. N\u00e5r han ikke skriver, kan han finde p\u00e5 at l\u00e6se medicinske tidsskrifter eller grave i kasser med vinylplader.","sameAs":["https:\/\/www.linkedin.com\/in\/sam-jeans-6746b9142\/"],"url":"https:\/\/dailyai.com\/da\/author\/samjeans\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/posts\/6851","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/comments?post=6851"}],"version-history":[{"count":5,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/posts\/6851\/revisions"}],"predecessor-version":[{"id":6866,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/posts\/6851\/revisions\/6866"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/media\/6853"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/media?parent=6851"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/categories?post=6851"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/tags?post=6851"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}