{"id":11632,"date":"2024-04-17T11:48:55","date_gmt":"2024-04-17T11:48:55","guid":{"rendered":"https:\/\/dailyai.com\/?p=11632"},"modified":"2024-04-17T11:48:55","modified_gmt":"2024-04-17T11:48:55","slug":"report-ai-is-advancing-beyond-humans-we-need-new-benchmarks","status":"publish","type":"post","link":"https:\/\/dailyai.com\/it\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","title":{"rendered":"Rapporto: L'intelligenza artificiale sta superando gli esseri umani, servono nuovi parametri di riferimento"},"content":{"rendered":"<p><strong>L'Universit\u00e0 di Stanford ha pubblicato l'AI Index Report 2024, in cui si osserva che il rapido progresso dell'IA rende sempre meno pertinenti i confronti con gli esseri umani.<\/strong><\/p>\n<p>Il <a href=\"https:\/\/aiindex.stanford.edu\/wp-content\/uploads\/2024\/04\/HAI_AI-Index-Report-2024.pdf\" target=\"_blank\" rel=\"noopener\">rapporto annuale<\/a> fornisce una visione completa delle tendenze e dello stato degli sviluppi dell'IA. Il rapporto afferma che i modelli di IA stanno migliorando cos\u00ec rapidamente che i parametri di riferimento utilizzati per misurarli stanno diventando sempre pi\u00f9 irrilevanti.<\/p>\n<p>Molti benchmark di settore mettono a confronto i modelli di intelligenza artificiale con la capacit\u00e0 degli esseri umani di eseguire compiti. Il benchmark Massive Multitask Language Understanding (MMLU) ne \u00e8 un buon esempio.<\/p>\n<p>Utilizza domande a scelta multipla per valutare gli LLM in 57 materie, tra cui matematica, storia, diritto ed etica. L'MMLU \u00e8 il punto di riferimento per l'AI dal 2019.<\/p>\n<p>Il punteggio di base umano sull'MMLU \u00e8 di 89,8% e nel 2019 il modello AI medio ha ottenuto un punteggio di poco superiore a 30%. Solo 5 anni dopo, Gemini Ultra \u00e8 diventato il primo modello a battere la soglia di riferimento umana con un punteggio di 90,04%.<\/p>\n<p>Il rapporto rileva che \"gli attuali sistemi di IA superano abitualmente le prestazioni umane su benchmark standard\". Le tendenze del grafico sottostante sembrano indicare che il MMLU e altri parametri di riferimento devono essere sostituiti.<\/p>\n<figure id=\"attachment_11647\" aria-describedby=\"caption-attachment-11647\" style=\"width: 1396px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-11647 size-full\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends.png\" alt=\"\" width=\"1396\" height=\"942\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends.png 1396w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends-300x202.png 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends-1024x691.png 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends-768x518.png 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends-60x40.png 60w\" sizes=\"auto, (max-width: 1396px) 100vw, 1396px\" \/><figcaption id=\"caption-attachment-11647\" class=\"wp-caption-text\">I modelli di intelligenza artificiale hanno raggiunto e superato i parametri umani in diversi benchmark. Fonte: Relazione annuale dell'Indice AI 2024<\/figcaption><\/figure>\n<p>I modelli di intelligenza artificiale hanno raggiunto la saturazione delle prestazioni su benchmark consolidati come ImageNet, SQuAD e SuperGLUE, per cui i ricercatori stanno sviluppando test pi\u00f9 impegnativi.<\/p>\n<p>Un esempio \u00e8 il Graduate-Level Google-Proof Q&amp;A Benchmark (GPQA), che consente di confrontare i modelli di IA con persone veramente intelligenti, piuttosto che con l'intelligenza umana media.<\/p>\n<p>Il test GPQA consiste in 400 domande a scelta multipla di livello universitario. Gli esperti che hanno conseguito o stanno conseguendo il dottorato di ricerca rispondono correttamente alle domande nel 65% dei casi.<\/p>\n<p>Il documento del GPQA afferma che quando vengono poste domande al di fuori del loro campo, \"i validatori non esperti altamente qualificati raggiungono solo un'accuratezza di 34%, nonostante abbiano trascorso in media oltre 30 minuti con accesso illimitato al web\".<\/p>\n<p>Il mese scorso Anthropic ha annunciato che <a href=\"https:\/\/dailyai.com\/it\/2024\/04\/claude-3-opus-blows-all-llms-away-in-book-length-summarization\/\">Claude 3<\/a> ha ottenuto un punteggio di poco inferiore a 60% con 5 colpi di sollecitazione CoT. Avremo bisogno di un benchmark pi\u00f9 grande.<\/p>\n<blockquote class=\"twitter-tweet\">\n<p dir=\"ltr\" lang=\"en\">Claude 3 ottiene un'accuratezza di ~60% su GPQA. \u00c8 difficile per me sottovalutare quanto siano difficili queste domande: i dottori di ricerca letterali (in ambiti diversi da quelli delle domande) con accesso a Internet ottengono 34%.<\/p>\n<p>I dottori di ricerca *nello stesso ambito* (anche con accesso a Internet!) ottengono una precisione di 65% - 75%. <a href=\"https:\/\/t.co\/ARAiCNXgU9\">https:\/\/t.co\/ARAiCNXgU9<\/a> <a href=\"https:\/\/t.co\/PH8J13zIef\">pic.twitter.com\/PH8J13zIef<\/a><\/p>\n<p>- david rein (@idavidrein) <a href=\"https:\/\/twitter.com\/idavidrein\/status\/1764675668175094169?ref_src=twsrc%5Etfw\">4 marzo 2024<\/a><\/p><\/blockquote>\n<p><script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/p>\n<h2>Valutazioni umane e sicurezza<\/h2>\n<p>Il rapporto rileva che l'IA deve ancora affrontare problemi significativi: \"Non \u00e8 in grado di trattare in modo affidabile i fatti, di eseguire ragionamenti complessi o di spiegare le proprie conclusioni\".<\/p>\n<p>Queste limitazioni contribuiscono a un'altra caratteristica del sistema di IA che, secondo il rapporto, \u00e8 scarsamente misurata; <a href=\"https:\/\/dailyai.com\/it\/2024\/04\/just-2-of-ai-research-is-looking-at-safety-says-georgetown-university-study\/\">Sicurezza dell'intelligenza artificiale<\/a>. Non abbiamo parametri di riferimento efficaci che ci permettano di dire: \"Questo modello \u00e8 pi\u00f9 sicuro di quell'altro\".<\/p>\n<p>In parte perch\u00e9 \u00e8 difficile da misurare e in parte perch\u00e9 \"gli sviluppatori di IA mancano di trasparenza, soprattutto per quanto riguarda la divulgazione dei dati e delle metodologie di addestramento\".<\/p>\n<p>Il rapporto ha rilevato che una tendenza interessante nel settore \u00e8 quella di affidare a un pubblico le valutazioni umane delle prestazioni dell'IA, piuttosto che ai test di benchmark.<\/p>\n<p>\u00c8 difficile classificare l'estetica o la prosa di un modello con un test. Di conseguenza, il rapporto afferma che \"il benchmarking ha iniziato lentamente a spostarsi verso l'incorporazione di valutazioni umane come la Chatbot Arena Leaderboard piuttosto che di classifiche computerizzate come ImageNet o SQuAD\".<\/p>\n<p>Mentre i modelli di intelligenza artificiale vedono scomparire la linea di base umana nello specchietto retrovisore, il sentiment potrebbe determinare il modello che sceglieremo di utilizzare.<\/p>\n<p>Le tendenze indicano che i modelli di IA finiranno per essere pi\u00f9 intelligenti di noi e pi\u00f9 difficili da misurare. Potremmo presto ritrovarci a dire: \"Non so perch\u00e9, ma questo mi piace di pi\u00f9\".<\/p>","protected":false},"excerpt":{"rendered":"<p>L'Universit\u00e0 di Stanford ha pubblicato l'AI Index Report 2024, in cui si osserva che il rapido progresso dell'IA rende sempre meno pertinenti i confronti con gli esseri umani. Il rapporto annuale fornisce una visione completa delle tendenze e dello stato degli sviluppi dell'IA. Il rapporto afferma che i modelli di IA stanno migliorando cos\u00ec rapidamente che i parametri di riferimento che utilizziamo per misurarli stanno diventando sempre pi\u00f9 irrilevanti. Molti benchmark del settore confrontano i modelli di IA con la capacit\u00e0 degli esseri umani di eseguire compiti. Il benchmark Massive Multitask Language Understanding (MMLU) ne \u00e8 un buon esempio. Utilizza domande a scelta multipla per valutare i LLM in 57 materie, tra cui matematica e storia,<\/p>","protected":false},"author":6,"featured_media":11650,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[84],"tags":[103,99],"class_list":["post-11632","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry","tag-ai-debate","tag-ai-race"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Report: AI is advancing beyond humans, we need new benchmarks | DailyAI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/it\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/\" \/>\n<meta property=\"og:locale\" content=\"it_IT\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Report: AI is advancing beyond humans, we need new benchmarks | DailyAI\" \/>\n<meta property=\"og:description\" content=\"Stanford University released its AI Index Report 2024 which noted that AI\u2019s rapid advancement makes benchmark comparisons with humans increasingly less relevant. The annual report provides a comprehensive insight into the trends and state of AI developments. The report says that AI models are improving so fast now that the benchmarks we use to measure them are increasingly becoming irrelevant. A lot of industry benchmarks compare AI models to how good humans are at performing tasks. The Massive Multitask Language Understanding (MMLU) benchmark is a good example. It uses multiple-choice questions to evaluate LLMs across 57 subjects, including math, history,\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/it\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2024-04-17T11:48:55+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1792\" \/>\n\t<meta property=\"og:image:height\" content=\"1024\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"author\" content=\"Eugene van der Watt\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"Scritto da\" \/>\n\t<meta name=\"twitter:data1\" content=\"Eugene van der Watt\" \/>\n\t<meta name=\"twitter:label2\" content=\"Tempo di lettura stimato\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minuti\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\"},\"author\":{\"name\":\"Eugene van der Watt\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\"},\"headline\":\"Report: AI is advancing beyond humans, we need new benchmarks\",\"datePublished\":\"2024-04-17T11:48:55+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\"},\"wordCount\":601,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/AI-benchmarks.webp\",\"keywords\":[\"AI debate\",\"AI race\"],\"articleSection\":[\"Industry\"],\"inLanguage\":\"it-IT\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\",\"name\":\"Report: AI is advancing beyond humans, we need new benchmarks | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/AI-benchmarks.webp\",\"datePublished\":\"2024-04-17T11:48:55+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#breadcrumb\"},\"inLanguage\":\"it-IT\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"it-IT\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/AI-benchmarks.webp\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/AI-benchmarks.webp\",\"width\":1792,\"height\":1024},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Report: AI is advancing beyond humans, we need new benchmarks\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"it-IT\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"it-IT\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\",\"name\":\"Eugene van der Watt\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"it-IT\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"caption\":\"Eugene van der Watt\"},\"description\":\"Eugene comes from an electronic engineering background and loves all things tech. When he takes a break from consuming AI news you'll find him at the snooker table.\",\"sameAs\":[\"www.linkedin.com\\\/in\\\/eugene-van-der-watt-16828119\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/it\\\/author\\\/eugene\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Rapporto: L'intelligenza artificiale sta superando gli esseri umani, servono nuovi parametri di riferimento | DailyAI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/it\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","og_locale":"it_IT","og_type":"article","og_title":"Report: AI is advancing beyond humans, we need new benchmarks | DailyAI","og_description":"Stanford University released its AI Index Report 2024 which noted that AI\u2019s rapid advancement makes benchmark comparisons with humans increasingly less relevant. The annual report provides a comprehensive insight into the trends and state of AI developments. The report says that AI models are improving so fast now that the benchmarks we use to measure them are increasingly becoming irrelevant. A lot of industry benchmarks compare AI models to how good humans are at performing tasks. The Massive Multitask Language Understanding (MMLU) benchmark is a good example. It uses multiple-choice questions to evaluate LLMs across 57 subjects, including math, history,","og_url":"https:\/\/dailyai.com\/it\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","og_site_name":"DailyAI","article_published_time":"2024-04-17T11:48:55+00:00","og_image":[{"width":1792,"height":1024,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","type":"image\/webp"}],"author":"Eugene van der Watt","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"Scritto da":"Eugene van der Watt","Tempo di lettura stimato":"3 minuti"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/"},"author":{"name":"Eugene van der Watt","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa"},"headline":"Report: AI is advancing beyond humans, we need new benchmarks","datePublished":"2024-04-17T11:48:55+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/"},"wordCount":601,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","keywords":["AI debate","AI race"],"articleSection":["Industry"],"inLanguage":"it-IT"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","url":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","name":"Rapporto: L'intelligenza artificiale sta superando gli esseri umani, servono nuovi parametri di riferimento | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","datePublished":"2024-04-17T11:48:55+00:00","breadcrumb":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#breadcrumb"},"inLanguage":"it-IT","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/"]}]},{"@type":"ImageObject","inLanguage":"it-IT","@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","width":1792,"height":1024},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"Report: AI is advancing beyond humans, we need new benchmarks"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DailyAI","description":"La vostra dose quotidiana di notizie sull'intelligenza artificiale","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"it-IT"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DailyAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"it-IT","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa","name":"Eugene van der Watt","image":{"@type":"ImageObject","inLanguage":"it-IT","@id":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","caption":"Eugene van der Watt"},"description":"Eugene proviene da un background di ingegneria elettronica e ama tutto ci\u00f2 che \u00e8 tecnologico. Quando si prende una pausa dal consumo di notizie sull'intelligenza artificiale, lo si pu\u00f2 trovare al tavolo da biliardo.","sameAs":["www.linkedin.com\/in\/eugene-van-der-watt-16828119"],"url":"https:\/\/dailyai.com\/it\/author\/eugene\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/posts\/11632","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/users\/6"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/comments?post=11632"}],"version-history":[{"count":3,"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/posts\/11632\/revisions"}],"predecessor-version":[{"id":11652,"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/posts\/11632\/revisions\/11652"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/media\/11650"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/media?parent=11632"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/categories?post=11632"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/it\/wp-json\/wp\/v2\/tags?post=11632"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}