{"id":11172,"date":"2024-04-02T09:32:07","date_gmt":"2024-04-02T09:32:07","guid":{"rendered":"https:\/\/dailyai.com\/?p=11172"},"modified":"2024-04-02T09:32:07","modified_gmt":"2024-04-02T09:32:07","slug":"deepmind-developed-safe-an-ai-agent-to-fact-check-llms","status":"publish","type":"post","link":"https:\/\/dailyai.com\/de\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/","title":{"rendered":"DeepMind entwickelte SAFE, einen KI-Agenten zur \u00dcberpr\u00fcfung von LLMs auf Fakten"},"content":{"rendered":"<p><strong>Forscher von DeepMind und der Stanford University haben einen KI-Agenten entwickelt, der LLMs auf Fakten pr\u00fcft und ein Benchmarking der Faktizit\u00e4t von KI-Modellen erm\u00f6glicht.<\/strong><\/p>\n<p>Selbst die besten KI-Modelle neigen immer noch dazu <a href=\"https:\/\/dailyai.com\/de\/2024\/02\/generative-ai-systems-hallucinations-and-mounting-technical-debt\/\">halluzinieren<\/a> zuweilen. Wenn Sie ChatGPT bitten, Ihnen die Fakten zu einem Thema zu nennen, ist es umso wahrscheinlicher, dass die Antwort einige Fakten enth\u00e4lt, die nicht wahr sind, je l\u00e4nger sie ist.<\/p>\n<p>Welche Modelle sind bei der Erstellung l\u00e4ngerer Antworten sachlich genauer als andere? Das ist schwer zu sagen, denn bis jetzt hatten wir keinen Ma\u00dfstab, um die Sachlichkeit von LLM-Langzeitantworten zu messen.<\/p>\n<p>DeepMind verwendete zun\u00e4chst GPT-4, um LongFact zu erstellen, eine Reihe von 2.280 Aufforderungen in Form von Fragen zu 38 Themen. Diese Aufforderungen entlocken dem getesteten LLM Antworten in Langform.<\/p>\n<p>Dann schufen sie einen KI-Agenten mit GPT-3.5-Turbo, um mit Hilfe von Google zu \u00fcberpr\u00fcfen, wie sachlich die vom LLM generierten Antworten waren. Sie nannten die Methode Search-Augmented Factuality Evaluator (SAFE).<\/p>\n<p>SAFE zerlegt zun\u00e4chst die Langform-Antwort des LLM in einzelne Fakten. Dann sendet es Suchanfragen an Google Search und entscheidet anhand der Informationen in den zur\u00fcckgegebenen Suchergebnissen \u00fcber den Wahrheitsgehalt des Sachverhalts.<\/p>\n<p>Hier ist ein Beispiel aus dem <a href=\"https:\/\/arxiv.org\/pdf\/2403.18802.pdf\" target=\"_blank\" rel=\"noopener\">Forschungsarbeit<\/a>.<\/p>\n<figure id=\"attachment_11178\" aria-describedby=\"caption-attachment-11178\" style=\"width: 1352px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-11178\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-example.png\" alt=\"\" width=\"1352\" height=\"536\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-example.png 1352w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-example-300x119.png 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-example-1024x406.png 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-example-768x304.png 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-example-60x24.png 60w\" sizes=\"auto, (max-width: 1352px) 100vw, 1352px\" \/><figcaption id=\"caption-attachment-11178\" class=\"wp-caption-text\">Eine Aufforderung zur Faktensuche f\u00fchrt zu einer Antwort in Langform. Die Antwort wird in einzelne Fakten zerlegt, so \u00fcberarbeitet, dass sie in sich geschlossen ist, auf Relevanz gepr\u00fcft und mit der Google-Suche \u00fcberpr\u00fcft. Quelle: arXiv<\/figcaption><\/figure>\n<p>Die Forscher sagen, dass SAFE eine \"\u00fcbermenschliche Leistung\" im Vergleich zu menschlichen Kommentatoren erreicht, die die Fakten\u00fcberpr\u00fcfung durchf\u00fchren.<\/p>\n<p>SAFE stimmte in 72% der F\u00e4lle mit menschlichen Annotationen \u00fcberein, und in 76% der F\u00e4lle, in denen es von den menschlichen Annotationen abwich, lag es richtig. Au\u00dferdem war es 20-mal billiger als menschliche Annotatoren, die von einer Crowd gestellt wurden. LLMs sind also bessere und billigere Fakten\u00fcberpr\u00fcfer als Menschen.<\/p>\n<p>Die Qualit\u00e4t der Antworten der getesteten LLMs wurde anhand der Anzahl der Fakten in ihrer Antwort in Kombination mit der Sachlichkeit der einzelnen Fakten gemessen.<\/p>\n<p>Die von ihnen verwendete Metrik (F1@K) sch\u00e4tzt die vom Menschen bevorzugte \"ideale\" Anzahl von Fakten in einer Antwort. Bei den Benchmark-Tests wurden 64 als Medianwert f\u00fcr K und 178 als H\u00f6chstwert verwendet.<\/p>\n<p>Einfach ausgedr\u00fcckt ist F1@K ein Ma\u00df f\u00fcr die Frage \"Hat mir die Antwort so viele Fakten geliefert, wie ich wollte?\" in Kombination mit der Frage \"Wie viele dieser Fakten waren wahr?\".<\/p>\n<h2>Welcher LLM ist am sachlichsten?<\/h2>\n<p>Die Forscher verwendeten LongFact, um 13 LLMs aus den Familien Gemini, GPT, Claude und PaLM-2 zu befragen. Anschlie\u00dfend bewerteten sie mit SAFE die Sachbezogenheit ihrer Antworten.<\/p>\n<p>GPT-4-Turbo f\u00fchrt die Liste der sachlichsten Modelle bei der Erstellung von Langformantworten an. Es wurde dicht gefolgt von Gemini-Ultra und PaLM-2-L-IT-RLHF. Die Ergebnisse zeigen, dass gr\u00f6\u00dfere LLMs sachlicher sind als kleinere.<\/p>\n<p>Die F1@K-Berechnung w\u00fcrde Datenwissenschaftler wahrscheinlich begeistern, aber der Einfachheit halber zeigen diese Benchmark-Ergebnisse, wie faktisch jedes Modell bei der R\u00fcckgabe von durchschnittlich langen und l\u00e4ngeren Antworten auf die Fragen ist.<\/p>\n<figure id=\"attachment_11179\" aria-describedby=\"caption-attachment-11179\" style=\"width: 1366px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-11179\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-benchmark-results.png\" alt=\"\" width=\"1366\" height=\"602\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-benchmark-results.png 1366w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-benchmark-results-300x132.png 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-benchmark-results-1024x451.png 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-benchmark-results-768x338.png 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-benchmark-results-60x26.png 60w\" sizes=\"auto, (max-width: 1366px) 100vw, 1366px\" \/><figcaption id=\"caption-attachment-11179\" class=\"wp-caption-text\">Langform-Faktizit\u00e4tsleistung von 13 LLMs mit K = 64 (die mittlere Anzahl von Fakten unter allen Modellantworten) und K = 178 (die maximale Anzahl von Fakten unter allen Modellantworten). Quelle: arXiv<\/figcaption><\/figure>\n<p>SAFE ist eine kosteng\u00fcnstige und wirksame Methode zur Quantifizierung der Faktizit\u00e4t von LLM-Langzeitstudien. Es ist schneller und billiger als menschliche Fakten\u00fcberpr\u00fcfung, aber es h\u00e4ngt immer noch von der Wahrhaftigkeit der Informationen ab, die Google in den Suchergebnissen ausgibt.<\/p>\n<p>DeepMind gab SAFE zur \u00f6ffentlichen Nutzung frei und schlug vor, dass es helfen k\u00f6nnte, die Faktizit\u00e4t von LLMs durch besseres Vortraining und Feinabstimmung zu verbessern. Es k\u00f6nnte auch einen LLM in die Lage versetzen, seine Fakten zu \u00fcberpr\u00fcfen, bevor er einem Nutzer die Ausgabe pr\u00e4sentiert.<\/p>\n<p>OpenAI wird sich freuen zu sehen, dass die Forschung von Google zeigt, dass GPT-4 Gemini in einem weiteren Benchmark schl\u00e4gt.<\/p>","protected":false},"excerpt":{"rendered":"<p>Forscher von DeepMind und der Stanford University haben einen KI-Agenten entwickelt, der LLMs auf Fakten pr\u00fcft und ein Benchmarking der Faktizit\u00e4t von KI-Modellen erm\u00f6glicht. Selbst die besten KI-Modelle neigen dazu, manchmal zu halluzinieren. Wenn Sie ChatGPT bitten, Ihnen die Fakten zu einem Thema zu nennen, ist es umso wahrscheinlicher, dass die Antwort einige Fakten enth\u00e4lt, die nicht der Wahrheit entsprechen, je l\u00e4nger sie ist. Welche Modelle sind faktisch genauer als andere, wenn sie l\u00e4ngere Antworten generieren? Das ist schwer zu sagen, denn bisher hatten wir keinen Ma\u00dfstab, um die Faktengenauigkeit von LLM-Langformantworten zu messen. DeepMind verwendete zun\u00e4chst GPT-4, um LongFact zu erstellen, eine Reihe von<\/p>","protected":false},"author":6,"featured_media":11182,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[84],"tags":[147,118],"class_list":["post-11172","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry","tag-deepmind","tag-llms"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>DeepMind developed SAFE, an AI agent to fact-check LLMs | DailyAI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/de\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/\" \/>\n<meta property=\"og:locale\" content=\"de_DE\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"DeepMind developed SAFE, an AI agent to fact-check LLMs | DailyAI\" \/>\n<meta property=\"og:description\" content=\"Researchers from DeepMind and Stanford University developed an AI agent that fact-checks LLMs and enables benchmarking of AI model factuality. Even the best AI models still tend to hallucinate at times. If you ask ChatGPT to give you the facts about a topic, the longer its response the more likely it is to include some facts that aren\u2019t true. Which models are more factually accurate than others when generating longer answers? It\u2019s hard to say because until now, we didn\u2019t have a benchmark measuring the factuality of LLM long-form responses. DeepMind first used GPT-4 to create LongFact, a set of\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/de\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2024-04-02T09:32:07+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/fact-vs-fake.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1792\" \/>\n\t<meta property=\"og:image:height\" content=\"1024\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"author\" content=\"Eugene van der Watt\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"Verfasst von\" \/>\n\t<meta name=\"twitter:data1\" content=\"Eugene van der Watt\" \/>\n\t<meta name=\"twitter:label2\" content=\"Gesch\u00e4tzte Lesezeit\" \/>\n\t<meta name=\"twitter:data2\" content=\"4\u00a0Minuten\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/\"},\"author\":{\"name\":\"Eugene van der Watt\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\"},\"headline\":\"DeepMind developed SAFE, an AI agent to fact-check LLMs\",\"datePublished\":\"2024-04-02T09:32:07+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/\"},\"wordCount\":611,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/fact-vs-fake.webp\",\"keywords\":[\"DeepMind\",\"LLMS\"],\"articleSection\":[\"Industry\"],\"inLanguage\":\"de\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/\",\"name\":\"DeepMind developed SAFE, an AI agent to fact-check LLMs | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/fact-vs-fake.webp\",\"datePublished\":\"2024-04-02T09:32:07+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#breadcrumb\"},\"inLanguage\":\"de\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"de\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/fact-vs-fake.webp\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/fact-vs-fake.webp\",\"width\":1792,\"height\":1024},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"DeepMind developed SAFE, an AI agent to fact-check LLMs\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"de\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"de\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\",\"name\":\"Eugene van der Watt\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"de\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"caption\":\"Eugene van der Watt\"},\"description\":\"Eugene comes from an electronic engineering background and loves all things tech. When he takes a break from consuming AI news you'll find him at the snooker table.\",\"sameAs\":[\"www.linkedin.com\\\/in\\\/eugene-van-der-watt-16828119\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/de\\\/author\\\/eugene\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"DeepMind entwickelte SAFE, einen KI-Agenten zur \u00dcberpr\u00fcfung von LLMs | DailyAI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/de\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/","og_locale":"de_DE","og_type":"article","og_title":"DeepMind developed SAFE, an AI agent to fact-check LLMs | DailyAI","og_description":"Researchers from DeepMind and Stanford University developed an AI agent that fact-checks LLMs and enables benchmarking of AI model factuality. Even the best AI models still tend to hallucinate at times. If you ask ChatGPT to give you the facts about a topic, the longer its response the more likely it is to include some facts that aren\u2019t true. Which models are more factually accurate than others when generating longer answers? It\u2019s hard to say because until now, we didn\u2019t have a benchmark measuring the factuality of LLM long-form responses. DeepMind first used GPT-4 to create LongFact, a set of","og_url":"https:\/\/dailyai.com\/de\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/","og_site_name":"DailyAI","article_published_time":"2024-04-02T09:32:07+00:00","og_image":[{"width":1792,"height":1024,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/fact-vs-fake.webp","type":"image\/webp"}],"author":"Eugene van der Watt","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"Verfasst von":"Eugene van der Watt","Gesch\u00e4tzte Lesezeit":"4\u00a0Minuten"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/"},"author":{"name":"Eugene van der Watt","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa"},"headline":"DeepMind developed SAFE, an AI agent to fact-check LLMs","datePublished":"2024-04-02T09:32:07+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/"},"wordCount":611,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/fact-vs-fake.webp","keywords":["DeepMind","LLMS"],"articleSection":["Industry"],"inLanguage":"de"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/","url":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/","name":"DeepMind entwickelte SAFE, einen KI-Agenten zur \u00dcberpr\u00fcfung von LLMs | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/fact-vs-fake.webp","datePublished":"2024-04-02T09:32:07+00:00","breadcrumb":{"@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#breadcrumb"},"inLanguage":"de","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/"]}]},{"@type":"ImageObject","inLanguage":"de","@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/fact-vs-fake.webp","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/fact-vs-fake.webp","width":1792,"height":1024},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"DeepMind developed SAFE, an AI agent to fact-check LLMs"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DailyAI","description":"Ihre t\u00e4gliche Dosis an AI-Nachrichten","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"de"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DailyAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"de","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa","name":"Eugene van der Watt","image":{"@type":"ImageObject","inLanguage":"de","@id":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","caption":"Eugene van der Watt"},"description":"Eugene kommt aus der Elektronikbranche und liebt alles, was mit Technik zu tun hat. Wenn er eine Pause vom Konsum von KI-Nachrichten einlegt, findet man ihn am Snookertisch.","sameAs":["www.linkedin.com\/in\/eugene-van-der-watt-16828119"],"url":"https:\/\/dailyai.com\/de\/author\/eugene\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/posts\/11172","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/users\/6"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/comments?post=11172"}],"version-history":[{"count":2,"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/posts\/11172\/revisions"}],"predecessor-version":[{"id":11181,"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/posts\/11172\/revisions\/11181"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/media\/11182"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/media?parent=11172"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/categories?post=11172"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/tags?post=11172"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}