{"id":11172,"date":"2024-04-02T09:32:07","date_gmt":"2024-04-02T09:32:07","guid":{"rendered":"https:\/\/dailyai.com\/?p=11172"},"modified":"2024-04-02T09:32:07","modified_gmt":"2024-04-02T09:32:07","slug":"deepmind-developed-safe-an-ai-agent-to-fact-check-llms","status":"publish","type":"post","link":"https:\/\/dailyai.com\/it\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/","title":{"rendered":"DeepMind ha sviluppato SAFE, un agente di intelligenza artificiale per il controllo dei fatti nei corsi di laurea."},"content":{"rendered":"<p><strong>I ricercatori di DeepMind e dell'Universit\u00e0 di Stanford hanno sviluppato un agente di IA che controlla i LLM e consente di effettuare un benchmarking della fattualit\u00e0 dei modelli di IA.<\/strong><\/p>\n<p>Anche i migliori modelli di intelligenza artificiale tendono a <a href=\"https:\/\/dailyai.com\/it\/2024\/02\/generative-ai-systems-hallucinations-and-mounting-technical-debt\/\">allucinare<\/a> a volte. Se chiedete a ChatGPT di fornirvi i fatti su un argomento, pi\u00f9 lunga \u00e8 la sua risposta e pi\u00f9 \u00e8 probabile che includa alcuni fatti non veri.<\/p>\n<p>Quali modelli sono pi\u00f9 accurati di altri nel generare risposte lunghe? \u00c8 difficile dirlo, perch\u00e9 finora non avevamo un parametro di riferimento che misurasse la fattualit\u00e0 delle risposte lunghe dei LLM.<\/p>\n<p>DeepMind ha utilizzato il GPT-4 per creare LongFact, un insieme di 2.280 prompt sotto forma di domande relative a 38 argomenti. Questi prompt sollecitano risposte di tipo lungo da parte del LLM sottoposto al test.<\/p>\n<p>Hanno quindi creato un agente AI che utilizza GPT-3.5-turbo per utilizzare Google e verificare la veridicit\u00e0 delle risposte generate dall'LLM. Il metodo \u00e8 stato chiamato Search-Augmented Factuality Evaluator (SAFE).<\/p>\n<p>SAFE innanzitutto suddivide la risposta in forma lunga del LLM in singoli fatti. Quindi invia richieste di ricerca a Google Search e valuta la veridicit\u00e0 del fatto in base alle informazioni contenute nei risultati della ricerca.<\/p>\n<p>Ecco un esempio dal sito <a href=\"https:\/\/arxiv.org\/pdf\/2403.18802.pdf\" target=\"_blank\" rel=\"noopener\">carta di ricerca<\/a>.<\/p>\n<figure id=\"attachment_11178\" aria-describedby=\"caption-attachment-11178\" style=\"width: 1352px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-11178\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-example.png\" alt=\"\" width=\"1352\" height=\"536\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-example.png 1352w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-example-300x119.png 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-example-1024x406.png 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-example-768x304.png 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-example-60x24.png 60w\" sizes=\"auto, (max-width: 1352px) 100vw, 1352px\" \/><figcaption id=\"caption-attachment-11178\" class=\"wp-caption-text\">Una richiesta di ricerca di fatti genera una risposta di forma lunga. La risposta viene suddivisa in singoli fatti, rielaborata in modo da essere autonoma, controllata per la rilevanza e verificata con Google Search. Fonte: arXiv<\/figcaption><\/figure>\n<p>I ricercatori affermano che SAFE raggiunge \"prestazioni sovrumane\" rispetto agli annotatori umani che effettuano il fact-checking.<\/p>\n<p>SAFE si \u00e8 trovato d'accordo con il 72% delle annotazioni umane e, nei casi in cui si \u00e8 discostato dagli umani, ha avuto ragione il 76% delle volte. Inoltre, \u00e8 risultato 20 volte pi\u00f9 economico degli annotatori umani in crowdsourcing. Quindi, i LLM sono verificatori di fatti migliori e pi\u00f9 economici degli esseri umani.<\/p>\n<p>La qualit\u00e0 della risposta dei LLM testati \u00e8 stata misurata in base al numero di fatti nella risposta e al grado di veridicit\u00e0 dei singoli fatti.<\/p>\n<p>La metrica utilizzata (F1@K) stima il numero \"ideale\" di fatti preferito dall'uomo in una risposta. I test di riferimento hanno utilizzato 64 come mediana per K e 178 come massimo.<\/p>\n<p>In parole povere, F1@K \u00e8 una misura di \"La risposta mi ha fornito tutti i fatti che volevo?\" combinata con \"Quanti di questi fatti erano veri?\".<\/p>\n<h2>Qual \u00e8 l'LLM pi\u00f9 efficace?<\/h2>\n<p>I ricercatori hanno utilizzato LongFact per sollecitare 13 LLM delle famiglie Gemini, GPT, Claude e PaLM-2. Hanno poi utilizzato SAFE per valutare la fattualit\u00e0 delle loro risposte.<\/p>\n<p>Il GPT-4-Turbo \u00e8 in cima alla lista dei modelli pi\u00f9 concreti nella generazione di risposte lunghe. \u00c8 seguito da vicino da Gemini-Ultra e PaLM-2-L-IT-RLHF. I risultati hanno mostrato che gli LLM pi\u00f9 grandi sono pi\u00f9 fattuali di quelli pi\u00f9 piccoli.<\/p>\n<p>Il calcolo di F1@K probabilmente entusiasmerebbe gli scienziati dei dati, ma, per semplicit\u00e0, questi risultati di benchmark mostrano quanto ogni modello sia efficace quando restituisce risposte di lunghezza media e pi\u00f9 lunghe alle domande.<\/p>\n<figure id=\"attachment_11179\" aria-describedby=\"caption-attachment-11179\" style=\"width: 1366px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-11179\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-benchmark-results.png\" alt=\"\" width=\"1366\" height=\"602\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-benchmark-results.png 1366w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-benchmark-results-300x132.png 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-benchmark-results-1024x451.png 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-benchmark-results-768x338.png 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/SAFE-benchmark-results-60x26.png 60w\" sizes=\"auto, (max-width: 1366px) 100vw, 1366px\" \/><figcaption id=\"caption-attachment-11179\" class=\"wp-caption-text\">Prestazioni di fattualit\u00e0 a forma lunga di 13 LLM con K = 64 (il numero mediano di fatti tra tutte le risposte del modello) e K = 178 (il numero massimo di fatti tra tutte le risposte del modello). Fonte: arXiv<\/figcaption><\/figure>\n<p>SAFE \u00e8 un modo economico ed efficace per quantificare la fattualit\u00e0 dei long-form LLM. \u00c8 pi\u00f9 veloce ed economico degli esseri umani nel fact-checking, ma dipende ancora dalla veridicit\u00e0 delle informazioni che Google restituisce nei risultati della ricerca.<\/p>\n<p>DeepMind ha rilasciato SAFE per l'uso pubblico e ha suggerito che potrebbe aiutare a migliorare la fattualit\u00e0 dei LLM attraverso un migliore preaddestramento e una messa a punto. Potrebbe anche consentire a un LLM di verificare i fatti prima di presentare l'output a un utente.<\/p>\n<p>OpenAI sar\u00e0 felice di vedere che una ricerca di Google mostra che GPT-4 batte Gemini in un altro benchmark.<\/p>","protected":false},"excerpt":{"rendered":"<p>Ricercatori di DeepMind e dell'Universit\u00e0 di Stanford hanno sviluppato un agente di IA che controlla i LLM e consente di effettuare un benchmarking della fattualit\u00e0 dei modelli di IA. Anche i migliori modelli di intelligenza artificiale tendono a volte ad avere allucinazioni. Se si chiede a ChatGPT di fornire i fatti su un argomento, pi\u00f9 lunga \u00e8 la sua risposta, pi\u00f9 \u00e8 probabile che includa alcuni fatti non veri. Quali modelli sono pi\u00f9 accurati di altri quando generano risposte pi\u00f9 lunghe? \u00c8 difficile dirlo, perch\u00e9 finora non avevamo un parametro di riferimento che misurasse la fattualit\u00e0 delle risposte lunghe di LLM. DeepMind ha utilizzato per la prima volta GPT-4 per creare LongFact, una serie di<\/p>","protected":false},"author":6,"featured_media":11182,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[84],"tags":[147,118],"class_list":["post-11172","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry","tag-deepmind","tag-llms"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>DeepMind developed SAFE, an AI agent to fact-check LLMs | DailyAI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/it\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/\" \/>\n<meta property=\"og:locale\" content=\"it_IT\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"DeepMind developed SAFE, an AI agent to fact-check LLMs | DailyAI\" \/>\n<meta property=\"og:description\" content=\"Researchers from DeepMind and Stanford University developed an AI agent that fact-checks LLMs and enables benchmarking of AI model factuality. Even the best AI models still tend to hallucinate at times. If you ask ChatGPT to give you the facts about a topic, the longer its response the more likely it is to include some facts that aren\u2019t true. Which models are more factually accurate than others when generating longer answers? It\u2019s hard to say because until now, we didn\u2019t have a benchmark measuring the factuality of LLM long-form responses. DeepMind first used GPT-4 to create LongFact, a set of\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/it\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2024-04-02T09:32:07+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/fact-vs-fake.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1792\" \/>\n\t<meta property=\"og:image:height\" content=\"1024\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"author\" content=\"Eugene van der Watt\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"Scritto da\" \/>\n\t<meta name=\"twitter:data1\" content=\"Eugene van der Watt\" \/>\n\t<meta name=\"twitter:label2\" content=\"Tempo di lettura stimato\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minuti\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/\"},\"author\":{\"name\":\"Eugene van der Watt\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\"},\"headline\":\"DeepMind developed SAFE, an AI agent to fact-check LLMs\",\"datePublished\":\"2024-04-02T09:32:07+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/\"},\"wordCount\":611,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/fact-vs-fake.webp\",\"keywords\":[\"DeepMind\",\"LLMS\"],\"articleSection\":[\"Industry\"],\"inLanguage\":\"it-IT\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/\",\"name\":\"DeepMind developed SAFE, an AI agent to fact-check LLMs | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/fact-vs-fake.webp\",\"datePublished\":\"2024-04-02T09:32:07+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#breadcrumb\"},\"inLanguage\":\"it-IT\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"it-IT\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/fact-vs-fake.webp\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/fact-vs-fake.webp\",\"width\":1792,\"height\":1024},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"DeepMind developed SAFE, an AI agent to fact-check LLMs\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"it-IT\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"it-IT\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\",\"name\":\"Eugene van der Watt\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"it-IT\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"caption\":\"Eugene van der Watt\"},\"description\":\"Eugene comes from an electronic engineering background and loves all things tech. When he takes a break from consuming AI news you'll find him at the snooker table.\",\"sameAs\":[\"www.linkedin.com\\\/in\\\/eugene-van-der-watt-16828119\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/it\\\/author\\\/eugene\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"DeepMind ha sviluppato SAFE, un agente di intelligenza artificiale per la verifica dei libri di economia e finanza | DailyAI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/it\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/","og_locale":"it_IT","og_type":"article","og_title":"DeepMind developed SAFE, an AI agent to fact-check LLMs | DailyAI","og_description":"Researchers from DeepMind and Stanford University developed an AI agent that fact-checks LLMs and enables benchmarking of AI model factuality. Even the best AI models still tend to hallucinate at times. If you ask ChatGPT to give you the facts about a topic, the longer its response the more likely it is to include some facts that aren\u2019t true. Which models are more factually accurate than others when generating longer answers? It\u2019s hard to say because until now, we didn\u2019t have a benchmark measuring the factuality of LLM long-form responses. DeepMind first used GPT-4 to create LongFact, a set of","og_url":"https:\/\/dailyai.com\/it\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/","og_site_name":"DailyAI","article_published_time":"2024-04-02T09:32:07+00:00","og_image":[{"width":1792,"height":1024,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/fact-vs-fake.webp","type":"image\/webp"}],"author":"Eugene van der Watt","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"Scritto da":"Eugene van der Watt","Tempo di lettura stimato":"4 minuti"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/"},"author":{"name":"Eugene van der Watt","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa"},"headline":"DeepMind developed SAFE, an AI agent to fact-check LLMs","datePublished":"2024-04-02T09:32:07+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/"},"wordCount":611,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/fact-vs-fake.webp","keywords":["DeepMind","LLMS"],"articleSection":["Industry"],"inLanguage":"it-IT"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/","url":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/","name":"DeepMind ha sviluppato SAFE, un agente di intelligenza artificiale per la verifica dei libri di economia e finanza | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/fact-vs-fake.webp","datePublished":"2024-04-02T09:32:07+00:00","breadcrumb":{"@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#breadcrumb"},"inLanguage":"it-IT","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/"]}]},{"@type":"ImageObject","inLanguage":"it-IT","@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/fact-vs-fake.webp","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/fact-vs-fake.webp","width":1792,"height":1024},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2024\/04\/deepmind-developed-safe-an-ai-agent-to-fact-check-llms\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"DeepMind developed SAFE, an AI agent to fact-check LLMs"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DailyAI","description":"La vostra dose quotidiana di notizie sull'intelligenza artificiale","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"it-IT"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DailyAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"it-IT","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa","name":"Eugene van der Watt","image":{"@type":"ImageObject","inLanguage":"it-IT","@id":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","caption":"Eugene van der Watt"},"description":"Eugene proviene da un background di ingegneria elettronica e ama tutto ci\u00f2 che \u00e8 tecnologico. Quando si prende una pausa dal consumo di notizie sull'intelligenza artificiale, lo si pu\u00f2 trovare al tavolo da biliardo.","sameAs":["www.linkedin.com\/in\/eugene-van-der-watt-16828119"],"url":"https:\/\/dailyai.com\/it\/author\/eugene\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/posts\/11172","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/users\/6"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/comments?post=11172"}],"version-history":[{"count":2,"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/posts\/11172\/revisions"}],"predecessor-version":[{"id":11181,"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/posts\/11172\/revisions\/11181"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/media\/11182"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/media?parent=11172"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/categories?post=11172"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/tags?post=11172"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}