{"id":11632,"date":"2024-04-17T11:48:55","date_gmt":"2024-04-17T11:48:55","guid":{"rendered":"https:\/\/dailyai.com\/?p=11632"},"modified":"2024-04-17T11:48:55","modified_gmt":"2024-04-17T11:48:55","slug":"report-ai-is-advancing-beyond-humans-we-need-new-benchmarks","status":"publish","type":"post","link":"https:\/\/dailyai.com\/es\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","title":{"rendered":"Informe: La IA avanza m\u00e1s que los humanos, necesitamos nuevos puntos de referencia"},"content":{"rendered":"<p><strong>La Universidad de Stanford public\u00f3 su Informe sobre el \u00cdndice de Inteligencia Artificial 2024, en el que se\u00f1alaba que el r\u00e1pido avance de la IA hace que las comparaciones con los humanos sean cada vez menos pertinentes.<\/strong><\/p>\n<p>En <a href=\"https:\/\/aiindex.stanford.edu\/wp-content\/uploads\/2024\/04\/HAI_AI-Index-Report-2024.pdf\" target=\"_blank\" rel=\"noopener\">informe anual<\/a> ofrece una visi\u00f3n exhaustiva de las tendencias y el estado de la evoluci\u00f3n de la IA. El informe afirma que los modelos de IA est\u00e1n mejorando tan r\u00e1pidamente que los par\u00e1metros que utilizamos para medirlos son cada vez m\u00e1s irrelevantes.<\/p>\n<p>Muchas pruebas comparativas del sector comparan los modelos de IA con la capacidad de los humanos para realizar tareas. La prueba comparativa Massive Multitask Language Understanding (MMLU) es un buen ejemplo.<\/p>\n<p>Utiliza preguntas de opci\u00f3n m\u00faltiple para evaluar LLMs a trav\u00e9s de 57 temas, incluyendo matem\u00e1ticas, historia, derecho y \u00e9tica. El MMLU ha sido la referencia en IA desde 2019.<\/p>\n<p>La puntuaci\u00f3n de referencia humana en el MMLU es de 89,8%, y ya en 2019, el modelo de IA medio obtuvo una puntuaci\u00f3n ligeramente superior a 30%. Solo 5 a\u00f1os despu\u00e9s, Gemini Ultra se convirti\u00f3 en el primer modelo en superar la puntuaci\u00f3n de referencia humana con 90,04%.<\/p>\n<p>El informe se\u00f1ala que los actuales \"sistemas de IA superan habitualmente el rendimiento humano en los puntos de referencia est\u00e1ndar\". Las tendencias del gr\u00e1fico siguiente parecen indicar que el MMLU y otros puntos de referencia necesitan ser sustituidos.<\/p>\n<figure id=\"attachment_11647\" aria-describedby=\"caption-attachment-11647\" style=\"width: 1396px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-11647 size-full\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends.png\" alt=\"\" width=\"1396\" height=\"942\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends.png 1396w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends-300x202.png 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends-1024x691.png 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends-768x518.png 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends-60x40.png 60w\" sizes=\"auto, (max-width: 1396px) 100vw, 1396px\" \/><figcaption id=\"caption-attachment-11647\" class=\"wp-caption-text\">Los modelos de IA han alcanzado y superado los niveles de referencia humanos en m\u00faltiples pruebas comparativas. Fuente: Informe anual del \u00cdndice AI 2024<\/figcaption><\/figure>\n<p>Los modelos de IA han alcanzado la saturaci\u00f3n de rendimiento en pruebas de referencia establecidas como ImageNet, SQuAD y SuperGLUE, por lo que los investigadores est\u00e1n desarrollando pruebas m\u00e1s exigentes.<\/p>\n<p>Un ejemplo es el Graduate-Level Google-Proof Q&amp;A Benchmark (GPQA), que permite comparar modelos de IA con personas realmente inteligentes, en lugar de con la inteligencia humana media.<\/p>\n<p>El examen GPQA consta de 400 preguntas tipo test de nivel universitario. Los expertos que han obtenido o est\u00e1n obteniendo un doctorado responden correctamente a las preguntas el 65% de las veces.<\/p>\n<p>El documento del GPQA se\u00f1ala que, cuando se les plantean preguntas ajenas a su campo, \"los validadores no expertos altamente cualificados s\u00f3lo alcanzan una precisi\u00f3n de 34%, a pesar de pasar una media de m\u00e1s de 30 minutos con acceso ilimitado a la web\".<\/p>\n<p>El mes pasado Anthropic anunci\u00f3 que <a href=\"https:\/\/dailyai.com\/es\/2024\/04\/claude-3-opus-blows-all-llms-away-in-book-length-summarization\/\">Claude 3<\/a> anot\u00f3 un poco menos de 60% con 5 disparos de CoT. Vamos a necesitar un punto de referencia m\u00e1s grande.<\/p>\n<blockquote class=\"twitter-tweet\">\n<p dir=\"ltr\" lang=\"en\">Claude 3 obtiene ~60% de precisi\u00f3n en el GPQA. Me resulta dif\u00edcil subestimar lo dif\u00edciles que son estas preguntas: doctores literales (en \u00e1mbitos distintos a los de las preguntas) con acceso a Internet obtienen 34%.<\/p>\n<p>Los doctores *en el mismo \u00e1mbito* (\u00a1tambi\u00e9n con acceso a Internet!) obtienen una precisi\u00f3n de 65% - 75%. <a href=\"https:\/\/t.co\/ARAiCNXgU9\">https:\/\/t.co\/ARAiCNXgU9<\/a> <a href=\"https:\/\/t.co\/PH8J13zIef\">pic.twitter.com\/PH8J13zIef<\/a><\/p>\n<p>- david rein (@idavidrein) <a href=\"https:\/\/twitter.com\/idavidrein\/status\/1764675668175094169?ref_src=twsrc%5Etfw\">4 de marzo de 2024<\/a><\/p><\/blockquote>\n<p><script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/p>\n<h2>Evaluaciones humanas y seguridad<\/h2>\n<p>El informe se\u00f1alaba que la IA a\u00fan se enfrenta a problemas importantes: \"No puede tratar de forma fiable los hechos, realizar razonamientos complejos ni explicar sus conclusiones\".<\/p>\n<p>Estas limitaciones contribuyen a otra caracter\u00edstica del sistema de IA que, seg\u00fan el informe, no se mide bien; <a href=\"https:\/\/dailyai.com\/es\/2024\/04\/just-2-of-ai-research-is-looking-at-safety-says-georgetown-university-study\/\">Seguridad de la IA<\/a>. No tenemos puntos de referencia eficaces que nos permitan decir: \"Este modelo es m\u00e1s seguro que aquel\".<\/p>\n<p>En parte porque es dif\u00edcil de medir, y en parte porque \"los desarrolladores de IA carecen de transparencia, especialmente en lo que se refiere a la divulgaci\u00f3n de datos de entrenamiento y metodolog\u00edas.\"<\/p>\n<p>El informe se\u00f1ala que una tendencia interesante en el sector es recurrir a evaluaciones humanas del rendimiento de la IA, en lugar de pruebas comparativas.<\/p>\n<p>Clasificar la est\u00e9tica o la prosa de la imagen de un modelo es dif\u00edcil de hacer con un test. Como resultado, el informe afirma que \"la evaluaci\u00f3n comparativa ha empezado a cambiar lentamente hacia la incorporaci\u00f3n de evaluaciones humanas como la Chatbot Arena Leaderboard en lugar de clasificaciones informatizadas como ImageNet o SQuAD.\"<\/p>\n<p>A medida que los modelos de IA ven desaparecer la l\u00ednea de base humana por el retrovisor, el sentimiento puede acabar determinando qu\u00e9 modelo elegimos utilizar.<\/p>\n<p>Las tendencias indican que los modelos de IA acabar\u00e1n siendo m\u00e1s inteligentes que nosotros y m\u00e1s dif\u00edciles de medir. Puede que pronto nos encontremos diciendo: \"No s\u00e9 por qu\u00e9, pero este me gusta m\u00e1s\".<\/p>","protected":false},"excerpt":{"rendered":"<p>La Universidad de Stanford ha publicado su Informe sobre el \u00cdndice de IA 2024, en el que se\u00f1ala que el r\u00e1pido avance de la IA hace que las comparaciones con los humanos sean cada vez menos pertinentes. El informe anual ofrece una visi\u00f3n completa de las tendencias y el estado de los avances de la IA. El informe afirma que los modelos de IA est\u00e1n mejorando tan r\u00e1pidamente que los puntos de referencia que utilizamos para medirlos son cada vez m\u00e1s irrelevantes. Muchas referencias del sector comparan los modelos de IA con la capacidad de los humanos para realizar tareas. El Massive Multitask Language Understanding (MMLU) es un buen ejemplo. Utiliza preguntas de opci\u00f3n m\u00faltiple para evaluar los LLM en 57 asignaturas, incluidas matem\u00e1ticas, historia,<\/p>","protected":false},"author":6,"featured_media":11650,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[84],"tags":[103,99],"class_list":["post-11632","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry","tag-ai-debate","tag-ai-race"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Report: AI is advancing beyond humans, we need new benchmarks | DailyAI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/es\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/\" \/>\n<meta property=\"og:locale\" content=\"es_ES\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Report: AI is advancing beyond humans, we need new benchmarks | DailyAI\" \/>\n<meta property=\"og:description\" content=\"Stanford University released its AI Index Report 2024 which noted that AI\u2019s rapid advancement makes benchmark comparisons with humans increasingly less relevant. The annual report provides a comprehensive insight into the trends and state of AI developments. The report says that AI models are improving so fast now that the benchmarks we use to measure them are increasingly becoming irrelevant. A lot of industry benchmarks compare AI models to how good humans are at performing tasks. The Massive Multitask Language Understanding (MMLU) benchmark is a good example. It uses multiple-choice questions to evaluate LLMs across 57 subjects, including math, history,\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/es\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2024-04-17T11:48:55+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1792\" \/>\n\t<meta property=\"og:image:height\" content=\"1024\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"author\" content=\"Eugene van der Watt\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"Escrito por\" \/>\n\t<meta name=\"twitter:data1\" content=\"Eugene van der Watt\" \/>\n\t<meta name=\"twitter:label2\" content=\"Tiempo de lectura\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutos\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\"},\"author\":{\"name\":\"Eugene van der Watt\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\"},\"headline\":\"Report: AI is advancing beyond humans, we need new benchmarks\",\"datePublished\":\"2024-04-17T11:48:55+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\"},\"wordCount\":601,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/AI-benchmarks.webp\",\"keywords\":[\"AI debate\",\"AI race\"],\"articleSection\":[\"Industry\"],\"inLanguage\":\"es\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\",\"name\":\"Report: AI is advancing beyond humans, we need new benchmarks | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/AI-benchmarks.webp\",\"datePublished\":\"2024-04-17T11:48:55+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#breadcrumb\"},\"inLanguage\":\"es\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"es\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/AI-benchmarks.webp\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/AI-benchmarks.webp\",\"width\":1792,\"height\":1024},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Report: AI is advancing beyond humans, we need new benchmarks\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"es\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"es\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\",\"name\":\"Eugene van der Watt\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"es\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"caption\":\"Eugene van der Watt\"},\"description\":\"Eugene comes from an electronic engineering background and loves all things tech. When he takes a break from consuming AI news you'll find him at the snooker table.\",\"sameAs\":[\"www.linkedin.com\\\/in\\\/eugene-van-der-watt-16828119\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/es\\\/author\\\/eugene\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Informe: La IA avanza m\u00e1s que los humanos, necesitamos nuevos puntos de referencia | DailyAI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/es\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","og_locale":"es_ES","og_type":"article","og_title":"Report: AI is advancing beyond humans, we need new benchmarks | DailyAI","og_description":"Stanford University released its AI Index Report 2024 which noted that AI\u2019s rapid advancement makes benchmark comparisons with humans increasingly less relevant. The annual report provides a comprehensive insight into the trends and state of AI developments. The report says that AI models are improving so fast now that the benchmarks we use to measure them are increasingly becoming irrelevant. A lot of industry benchmarks compare AI models to how good humans are at performing tasks. The Massive Multitask Language Understanding (MMLU) benchmark is a good example. It uses multiple-choice questions to evaluate LLMs across 57 subjects, including math, history,","og_url":"https:\/\/dailyai.com\/es\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","og_site_name":"DailyAI","article_published_time":"2024-04-17T11:48:55+00:00","og_image":[{"width":1792,"height":1024,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","type":"image\/webp"}],"author":"Eugene van der Watt","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"Escrito por":"Eugene van der Watt","Tiempo de lectura":"3 minutos"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/"},"author":{"name":"Eugene van der Watt","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa"},"headline":"Report: AI is advancing beyond humans, we need new benchmarks","datePublished":"2024-04-17T11:48:55+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/"},"wordCount":601,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","keywords":["AI debate","AI race"],"articleSection":["Industry"],"inLanguage":"es"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","url":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","name":"Informe: La IA avanza m\u00e1s que los humanos, necesitamos nuevos puntos de referencia | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","datePublished":"2024-04-17T11:48:55+00:00","breadcrumb":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#breadcrumb"},"inLanguage":"es","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/"]}]},{"@type":"ImageObject","inLanguage":"es","@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","width":1792,"height":1024},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"Report: AI is advancing beyond humans, we need new benchmarks"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DailyAI","description":"Su dosis diaria de noticias sobre IA","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"es"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DailyAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"es","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa","name":"Eugene van der Watt","image":{"@type":"ImageObject","inLanguage":"es","@id":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","caption":"Eugene van der Watt"},"description":"Eugene es ingeniero electr\u00f3nico y le encanta todo lo relacionado con la tecnolog\u00eda. Cuando descansa de consumir noticias sobre IA, lo encontrar\u00e1 jugando al billar.","sameAs":["www.linkedin.com\/in\/eugene-van-der-watt-16828119"],"url":"https:\/\/dailyai.com\/es\/author\/eugene\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/posts\/11632","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/users\/6"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/comments?post=11632"}],"version-history":[{"count":3,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/posts\/11632\/revisions"}],"predecessor-version":[{"id":11652,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/posts\/11632\/revisions\/11652"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/media\/11650"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/media?parent=11632"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/categories?post=11632"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/tags?post=11632"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}