{"id":11172,"date":"2024-04-02T09:32:07","date_gmt":"2024-04-02T09:32:07","guid":{"rendered":"https:\/\/dailyai.com\/?p=11172"},"modified":"2024-04-02T09:32:07","modified_gmt":"2024-04-02T09:32:07","slug":"deepmind-developed-safe-an-ai-agent-to-fact-check-llms","status":"publish","type":"post","link":"https:\/\/dailyai.com\/da\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/","title":{"rendered":"DeepMind udviklede SAFE, en AI-agent til at faktatjekke LLM'er"},"content":{"rendered":"<p><strong>Forskere fra DeepMind og Stanford University har udviklet en AI-agent, der faktatjekker LLM'er og muligg\u00f8r benchmarking af AI-modellers faktualitet.<\/strong><\/p>\n<p>Selv de bedste AI-modeller har stadig en tendens til at <a href=\"https:\/\/dailyai.com\/da\/2024\/02\/generative-ai-systems-hallucinations-and-mounting-technical-debt\/\">hallucinere<\/a> til tider. Hvis du beder ChatGPT om at give dig fakta om et emne, jo l\u00e6ngere svaret er, jo mere sandsynligt er det, at det indeholder nogle fakta, der ikke er sande.<\/p>\n<p>Hvilke modeller er mere faktuelt n\u00f8jagtige end andre, n\u00e5r de genererer l\u00e6ngere svar? Det er sv\u00e6rt at sige, for indtil nu har vi ikke haft et benchmark til at m\u00e5le faktualiteten i LLM's lange svar.<\/p>\n<p>DeepMind brugte f\u00f8rst GPT-4 til at skabe LongFact, et s\u00e6t af 2.280 prompts i form af sp\u00f8rgsm\u00e5l relateret til 38 emner. Disse prompts fremkalder lange svar fra den LLM, der testes.<\/p>\n<p>Derefter skabte de en AI-agent ved hj\u00e6lp af GPT-3.5-turbo til at bruge Google til at verificere, hvor faktuelle de svar, LLM genererede, var. De kaldte metoden Search-Augmented Factuality Evaluator (SAFE).<\/p>\n<p>SAFE opdeler f\u00f8rst det lange svar fra LLM i individuelle fakta. Derefter sender den s\u00f8geanmodninger til Google Search og tager stilling til, om fakta er sande, baseret p\u00e5 oplysninger i de returnerede s\u00f8geresultater.<\/p>\n<p>Her er et eksempel fra <a href=\"https:\/\/arxiv.org\/pdf\/2403.18802.pdf\" target=\"_blank\" rel=\"noopener\">forskningsartikel<\/a>.<\/p>\n<figure id=\"attachment_11178\" aria-describedby=\"caption-attachment-11178\" style=\"width: 1352px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-11178\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-example.png\" alt=\"\" width=\"1352\" height=\"536\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-example.png 1352w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-example-300x119.png 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-example-1024x406.png 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-example-768x304.png 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-example-60x24.png 60w\" sizes=\"auto, (max-width: 1352px) 100vw, 1352px\" \/><figcaption id=\"caption-attachment-11178\" class=\"wp-caption-text\">En faktas\u00f8gende prompt fremkalder et langt svar. Svaret opdeles i individuelle fakta, revideres for at v\u00e6re selvst\u00e6ndigt, kontrolleres for relevans og kontrolleres ved hj\u00e6lp af Google Search. Kilde: arXiv<\/figcaption><\/figure>\n<p>Forskerne siger, at SAFE opn\u00e5r en \"overmenneskelig pr\u00e6station\" sammenlignet med menneskelige kommentatorer, der udf\u00f8rer faktatjekket.<\/p>\n<p>SAFE var enig med 72% af de menneskelige annotationer, og hvor den var uenig med mennesker, havde den ret i 76% af tilf\u00e6ldene. Det var ogs\u00e5 20 gange billigere end crowdsourcede menneskelige kommentatorer. S\u00e5 LLM'er er bedre og billigere faktatjekkere end mennesker.<\/p>\n<p>Kvaliteten af svaret fra de testede LLM'er blev m\u00e5lt ud fra antallet af faktoider i svaret kombineret med, hvor faktuelle de enkelte faktoider var.<\/p>\n<p>Den metrik, de brugte (F1@K), estimerer det menneskelige foretrukne \"ideelle\" antal fakta i et svar. Benchmark-testene brugte 64 som median for K og 178 som maksimum.<\/p>\n<p>Kort sagt er F1@K et m\u00e5l for \"Gav svaret mig s\u00e5 mange fakta, som jeg \u00f8nskede?\" kombineret med \"Hvor mange af disse fakta var sande?\".<\/p>\n<h2>Hvilken LLM er mest faktuel?<\/h2>\n<p>Forskerne brugte LongFact til at sp\u00f8rge 13 LLM'er fra Gemini, GPT, Claude og PaLM-2-familierne. Derefter brugte de SAFE til at evaluere, hvor faktuelle deres svar var.<\/p>\n<p>GPT-4-Turbo topper listen som den mest faktuelle model, n\u00e5r der genereres lange svar. Den var t\u00e6t fulgt af Gemini-Ultra og PaLM-2-L-IT-RLHF. Resultaterne viste, at st\u00f8rre LLM'er er mere faktuelle end mindre.<\/p>\n<p>F1@K-beregningen ville sandsynligvis begejstre dataforskere, men for enkelhedens skyld viser disse benchmarkresultater, hvor faktuelle hver model er, n\u00e5r den returnerer gennemsnitlige l\u00e6ngder og l\u00e6ngere svar p\u00e5 sp\u00f8rgsm\u00e5lene.<\/p>\n<figure id=\"attachment_11179\" aria-describedby=\"caption-attachment-11179\" style=\"width: 1366px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-11179\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-benchmark-results.png\" alt=\"\" width=\"1366\" height=\"602\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-benchmark-results.png 1366w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-benchmark-results-300x132.png 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-benchmark-results-1024x451.png 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-benchmark-results-768x338.png 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-benchmark-results-60x26.png 60w\" sizes=\"auto, (max-width: 1366px) 100vw, 1366px\" \/><figcaption id=\"caption-attachment-11179\" class=\"wp-caption-text\">Den lange form for fakticitet for 13 LLM'er med K = 64 (medianantallet af fakta blandt alle modelsvar) og K = 178 (det maksimale antal fakta blandt alle modelsvar). Kilde: arXiv<\/figcaption><\/figure>\n<p>SAFE er en billig og effektiv m\u00e5de at kvantificere LLM-langformsfakta p\u00e5. Det er hurtigere og billigere end mennesker til faktatjek, men det afh\u00e6nger stadig af sandf\u00e6rdigheden af de oplysninger, som Google returnerer i s\u00f8geresultaterne.<\/p>\n<p>DeepMind frigav SAFE til offentlig brug og foreslog, at det kunne hj\u00e6lpe med at forbedre LLM-faktualiteten via bedre fortr\u00e6ning og finjustering. Det kan ogs\u00e5 g\u00f8re det muligt for en LLM at tjekke sine fakta, f\u00f8r den pr\u00e6senterer sit output for en bruger.<\/p>\n<p>OpenAI vil blive glade for at se, at forskning fra Google viser, at GPT-4 sl\u00e5r Gemini i endnu et benchmark.<\/p>","protected":false},"excerpt":{"rendered":"<p>Forskere fra DeepMind og Stanford University har udviklet en AI-agent, der faktatjekker LLM'er og muligg\u00f8r benchmarking af AI-modellers faktualitet. Selv de bedste AI-modeller har stadig en tendens til at hallucinere til tider. Hvis du beder ChatGPT om at give dig fakta om et emne, er det mere sandsynligt, at svaret indeholder nogle fakta, som ikke er sande, jo l\u00e6ngere det er. Hvilke modeller er mere faktuelt n\u00f8jagtige end andre, n\u00e5r de genererer l\u00e6ngere svar? Det er sv\u00e6rt at sige, for indtil nu har vi ikke haft et benchmark til at m\u00e5le faktualiteten i LLM's lange svar. DeepMind brugte f\u00f8rst GPT-4 til at skabe LongFact, et s\u00e6t af<\/p>","protected":false},"author":6,"featured_media":11182,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[84],"tags":[147,118],"class_list":["post-11172","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry","tag-deepmind","tag-llms"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>DeepMind developed SAFE, an AI agent to fact-check LLMs | DailyAI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/da\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/\" \/>\n<meta property=\"og:locale\" content=\"da_DK\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"DeepMind developed SAFE, an AI agent to fact-check LLMs | DailyAI\" \/>\n<meta property=\"og:description\" content=\"Researchers from DeepMind and Stanford University developed an AI agent that fact-checks LLMs and enables benchmarking of AI model factuality. Even the best AI models still tend to hallucinate at times. If you ask ChatGPT to give you the facts about a topic, the longer its response the more likely it is to include some facts that aren\u2019t true. Which models are more factually accurate than others when generating longer answers? It\u2019s hard to say because until now, we didn\u2019t have a benchmark measuring the factuality of LLM long-form responses. DeepMind first used GPT-4 to create LongFact, a set of\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/da\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2024-04-02T09:32:07+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/fact-vs-fake.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1792\" \/>\n\t<meta property=\"og:image:height\" content=\"1024\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"author\" content=\"Eugene van der Watt\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"Skrevet af\" \/>\n\t<meta name=\"twitter:data1\" content=\"Eugene van der Watt\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimeret l\u00e6setid\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutter\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/\"},\"author\":{\"name\":\"Eugene van der Watt\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\"},\"headline\":\"DeepMind developed SAFE, an AI agent to fact-check LLMs\",\"datePublished\":\"2024-04-02T09:32:07+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/\"},\"wordCount\":611,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/fact-vs-fake.webp\",\"keywords\":[\"DeepMind\",\"LLMS\"],\"articleSection\":[\"Industry\"],\"inLanguage\":\"da-DK\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/\",\"name\":\"DeepMind developed SAFE, an AI agent to fact-check LLMs | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/fact-vs-fake.webp\",\"datePublished\":\"2024-04-02T09:32:07+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#breadcrumb\"},\"inLanguage\":\"da-DK\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"da-DK\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/fact-vs-fake.webp\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/fact-vs-fake.webp\",\"width\":1792,\"height\":1024},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"DeepMind developed SAFE, an AI agent to fact-check LLMs\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"da-DK\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"da-DK\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\",\"name\":\"Eugene van der Watt\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"da-DK\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"caption\":\"Eugene van der Watt\"},\"description\":\"Eugene comes from an electronic engineering background and loves all things tech. When he takes a break from consuming AI news you'll find him at the snooker table.\",\"sameAs\":[\"www.linkedin.com\\\/in\\\/eugene-van-der-watt-16828119\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/da\\\/author\\\/eugene\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"DeepMind har udviklet SAFE, en AI-agent til at faktatjekke LLM'er | DailyAI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/da\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/","og_locale":"da_DK","og_type":"article","og_title":"DeepMind developed SAFE, an AI agent to fact-check LLMs | DailyAI","og_description":"Researchers from DeepMind and Stanford University developed an AI agent that fact-checks LLMs and enables benchmarking of AI model factuality. Even the best AI models still tend to hallucinate at times. If you ask ChatGPT to give you the facts about a topic, the longer its response the more likely it is to include some facts that aren\u2019t true. Which models are more factually accurate than others when generating longer answers? It\u2019s hard to say because until now, we didn\u2019t have a benchmark measuring the factuality of LLM long-form responses. DeepMind first used GPT-4 to create LongFact, a set of","og_url":"https:\/\/dailyai.com\/da\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/","og_site_name":"DailyAI","article_published_time":"2024-04-02T09:32:07+00:00","og_image":[{"width":1792,"height":1024,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/fact-vs-fake.webp","type":"image\/webp"}],"author":"Eugene van der Watt","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"Skrevet af":"Eugene van der Watt","Estimeret l\u00e6setid":"4 minutter"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/"},"author":{"name":"Eugene van der Watt","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa"},"headline":"DeepMind developed SAFE, an AI agent to fact-check LLMs","datePublished":"2024-04-02T09:32:07+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/"},"wordCount":611,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/fact-vs-fake.webp","keywords":["DeepMind","LLMS"],"articleSection":["Industry"],"inLanguage":"da-DK"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/","url":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/","name":"DeepMind har udviklet SAFE, en AI-agent til at faktatjekke LLM'er | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/fact-vs-fake.webp","datePublished":"2024-04-02T09:32:07+00:00","breadcrumb":{"@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#breadcrumb"},"inLanguage":"da-DK","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/"]}]},{"@type":"ImageObject","inLanguage":"da-DK","@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/fact-vs-fake.webp","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/fact-vs-fake.webp","width":1792,"height":1024},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"DeepMind developed SAFE, an AI agent to fact-check LLMs"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DailyAI","description":"Din daglige dosis af AI-nyheder","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"da-DK"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DailyAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"da-DK","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa","name":"Eugene van der Watt","image":{"@type":"ImageObject","inLanguage":"da-DK","@id":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","caption":"Eugene van der Watt"},"description":"Eugene har en baggrund som elektronikingeni\u00f8r og elsker alt, hvad der har med teknologi at g\u00f8re. N\u00e5r han tager en pause fra at l\u00e6se AI-nyheder, kan du finde ham ved snookerbordet.","sameAs":["www.linkedin.com\/in\/eugene-van-der-watt-16828119"],"url":"https:\/\/dailyai.com\/da\/author\/eugene\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/posts\/11172","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/users\/6"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/comments?post=11172"}],"version-history":[{"count":2,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/posts\/11172\/revisions"}],"predecessor-version":[{"id":11181,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/posts\/11172\/revisions\/11181"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/media\/11182"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/media?parent=11172"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/categories?post=11172"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/tags?post=11172"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}