{"id":11632,"date":"2024-04-17T11:48:55","date_gmt":"2024-04-17T11:48:55","guid":{"rendered":"https:\/\/dailyai.com\/?p=11632"},"modified":"2024-04-17T11:48:55","modified_gmt":"2024-04-17T11:48:55","slug":"report-ai-is-advancing-beyond-humans-we-need-new-benchmarks","status":"publish","type":"post","link":"https:\/\/dailyai.com\/nl\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","title":{"rendered":"Rapport: AI gaat verder dan mensen, we hebben nieuwe benchmarks nodig"},"content":{"rendered":"<p><strong>Stanford University publiceerde zijn AI Index Report 2024 waarin wordt opgemerkt dat de snelle vooruitgang van AI benchmarkvergelijkingen met mensen steeds minder relevant maakt.<\/strong><\/p>\n<p>De <a href=\"https:\/\/aiindex.stanford.edu\/wp-content\/uploads\/2024\/04\/HAI_AI-Index-Report-2024.pdf\" target=\"_blank\" rel=\"noopener\">jaarverslag<\/a> geeft een uitgebreid inzicht in de trends en de stand van de AI-ontwikkelingen. Het rapport zegt dat AI-modellen nu zo snel verbeteren dat de benchmarks die we gebruiken om ze te meten steeds minder relevant worden.<\/p>\n<p>Veel industri\u00eble benchmarks vergelijken AI-modellen met hoe goed mensen zijn in het uitvoeren van taken. De MMLU-benchmark (Massive Multitask Language Understanding) is daar een goed voorbeeld van.<\/p>\n<p>Het gebruikt meerkeuzevragen om LLM's te evalueren in 57 vakken, waaronder wiskunde, geschiedenis, rechten en ethiek. Sinds 2019 is de MMLU de AI-benchmark bij uitstek.<\/p>\n<p>De menselijke basisscore op het MMLU is 89,8% en in 2019 scoorde het gemiddelde AI-model iets meer dan 30%. Slechts 5 jaar later werd Gemini Ultra het eerste model dat de menselijke baseline versloeg met een score van 90,04%.<\/p>\n<p>Het rapport merkt op dat de huidige \"AI-systemen routinematig de menselijke prestaties op standaard benchmarks overtreffen\". De trends in de onderstaande grafiek lijken erop te wijzen dat het MMLU en andere benchmarks aan vervanging toe zijn.<\/p>\n<figure id=\"attachment_11647\" aria-describedby=\"caption-attachment-11647\" style=\"width: 1396px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-11647 size-full\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends.png\" alt=\"\" width=\"1396\" height=\"942\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends.png 1396w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends-300x202.png 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends-1024x691.png 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends-768x518.png 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends-60x40.png 60w\" sizes=\"auto, (max-width: 1396px) 100vw, 1396px\" \/><figcaption id=\"caption-attachment-11647\" class=\"wp-caption-text\">AI-modellen hebben in meerdere benchmarks menselijke referentiepunten bereikt en overtroffen. Bron: The AI Index 2024 Annual Report<\/figcaption><\/figure>\n<p>AI-modellen hebben prestatieverzadiging bereikt op gevestigde benchmarks zoals ImageNet, SQuAD en SuperGLUE, dus onderzoekers ontwikkelen meer uitdagende tests.<\/p>\n<p>Een voorbeeld is de Graduate-Level Google-Proof Q&amp;A Benchmark (GPQA), waarmee AI-modellen kunnen worden vergeleken met echt slimme mensen in plaats van met de gemiddelde menselijke intelligentie.<\/p>\n<p>De GPQA test bestaat uit 400 moeilijke meerkeuzevragen op doctoraatsniveau. Experts die gepromoveerd zijn of nog promoveren beantwoorden de vragen 65% van de tijd correct.<\/p>\n<p>Het GPQA-paper zegt dat wanneer vragen buiten hun vakgebied worden gesteld, \"hoogopgeleide, niet-deskundige validators slechts een nauwkeurigheid van 34% bereiken, ondanks het feit dat ze gemiddeld meer dan 30 minuten onbeperkte toegang tot het web hebben\".<\/p>\n<p>Vorige maand kondigde Anthropic aan dat <a href=\"https:\/\/dailyai.com\/nl\/2024\/04\/claude-3-opus-blows-all-llms-away-in-book-length-summarization\/\">Claude 3<\/a> scoorde net onder de 60% met 5-schots CoT prompting. We hebben een grotere benchmark nodig.<\/p>\n<blockquote class=\"twitter-tweet\">\n<p dir=\"ltr\" lang=\"en\">Claude 3 haalt ~60% nauwkeurigheid op GPQA. Ik kan moeilijk onderschatten hoe moeilijk deze vragen zijn - letterlijke PhD's (in andere domeinen dan de vragen) met toegang tot het internet halen 34%.<\/p>\n<p>PhD's *in hetzelfde domein* (ook met internettoegang!) krijgen 65% - 75% nauwkeurigheid. <a href=\"https:\/\/t.co\/ARAiCNXgU9\">https:\/\/t.co\/ARAiCNXgU9<\/a> <a href=\"https:\/\/t.co\/PH8J13zIef\">pic.twitter.com\/PH8J13zIef<\/a><\/p>\n<p>- david rein (@idavidrein) <a href=\"https:\/\/twitter.com\/idavidrein\/status\/1764675668175094169?ref_src=twsrc%5Etfw\">4 maart 2024<\/a><\/p><\/blockquote>\n<p><script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/p>\n<h2>Menselijke evaluaties en veiligheid<\/h2>\n<p>In het rapport wordt opgemerkt dat AI nog steeds met grote problemen kampt: \"Het kan niet betrouwbaar omgaan met feiten, complexe redeneringen uitvoeren of zijn conclusies uitleggen.\"<\/p>\n<p>Deze beperkingen dragen bij aan een ander kenmerk van het AI-systeem dat volgens het rapport slecht wordt gemeten; <a href=\"https:\/\/dailyai.com\/nl\/2024\/04\/just-2-of-ai-research-is-looking-at-safety-says-georgetown-university-study\/\">AI-veiligheid<\/a>. We hebben geen effectieve benchmarks waarmee we kunnen zeggen: \"Dit model is veiliger dan dat\".<\/p>\n<p>Dat komt deels omdat het moeilijk te meten is en deels omdat \"AI-ontwikkelaars niet transparant zijn, met name wat betreft de openbaarmaking van trainingsgegevens en methodologie\u00ebn\".<\/p>\n<p>In het rapport wordt opgemerkt dat een interessante trend in de industrie is om menselijke evaluaties van AI-prestaties te crowd-sourcen, in plaats van benchmarktests.<\/p>\n<p>Het beoordelen van de beeldesthetiek of het proza van een model is moeilijk te doen met een test. Als gevolg hiervan zegt het rapport dat \"benchmarking langzaam verschuift naar het opnemen van menselijke evaluaties zoals het Chatbot Arena Leaderboard in plaats van geautomatiseerde ranglijsten zoals ImageNet of SQuAD.\"<\/p>\n<p>Terwijl AI-modellen de menselijke basislijn in de achteruitkijkspiegel zien verdwijnen, kan het sentiment uiteindelijk bepalen welk model we kiezen te gebruiken.<\/p>\n<p>De trends geven aan dat AI-modellen uiteindelijk slimmer zullen zijn dan wij en moeilijker te meten. Misschien zullen we binnenkort zeggen: \"Ik weet niet waarom, maar ik vind deze gewoon beter.\"<\/p>","protected":false},"excerpt":{"rendered":"<p>Stanford University heeft zijn AI Index Report 2024 uitgebracht waarin wordt opgemerkt dat de snelle vooruitgang van AI benchmarkvergelijkingen met mensen steeds minder relevant maakt. Het jaarlijkse rapport geeft een uitgebreid inzicht in de trends en de stand van de AI-ontwikkelingen. Het rapport zegt dat AI-modellen nu zo snel verbeteren dat de benchmarks die we gebruiken om ze te meten steeds minder relevant worden. Veel benchmarks in de industrie vergelijken AI-modellen met hoe goed mensen zijn in het uitvoeren van taken. De Massive Multitask Language Understanding (MMLU) benchmark is een goed voorbeeld. Het gebruikt meerkeuzevragen om LLM's te evalueren in 57 onderwerpen, waaronder wiskunde, geschiedenis,<\/p>","protected":false},"author":6,"featured_media":11650,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[84],"tags":[103,99],"class_list":["post-11632","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry","tag-ai-debate","tag-ai-race"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Report: AI is advancing beyond humans, we need new benchmarks | DailyAI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/nl\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/\" \/>\n<meta property=\"og:locale\" content=\"nl_NL\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Report: AI is advancing beyond humans, we need new benchmarks | DailyAI\" \/>\n<meta property=\"og:description\" content=\"Stanford University released its AI Index Report 2024 which noted that AI\u2019s rapid advancement makes benchmark comparisons with humans increasingly less relevant. The annual report provides a comprehensive insight into the trends and state of AI developments. The report says that AI models are improving so fast now that the benchmarks we use to measure them are increasingly becoming irrelevant. A lot of industry benchmarks compare AI models to how good humans are at performing tasks. The Massive Multitask Language Understanding (MMLU) benchmark is a good example. It uses multiple-choice questions to evaluate LLMs across 57 subjects, including math, history,\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/nl\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2024-04-17T11:48:55+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1792\" \/>\n\t<meta property=\"og:image:height\" content=\"1024\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"author\" content=\"Eugene van der Watt\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"Geschreven door\" \/>\n\t<meta name=\"twitter:data1\" content=\"Eugene van der Watt\" \/>\n\t<meta name=\"twitter:label2\" content=\"Geschatte leestijd\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minuten\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\"},\"author\":{\"name\":\"Eugene van der Watt\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\"},\"headline\":\"Report: AI is advancing beyond humans, we need new benchmarks\",\"datePublished\":\"2024-04-17T11:48:55+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\"},\"wordCount\":601,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/AI-benchmarks.webp\",\"keywords\":[\"AI debate\",\"AI race\"],\"articleSection\":[\"Industry\"],\"inLanguage\":\"nl-NL\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\",\"name\":\"Report: AI is advancing beyond humans, we need new benchmarks | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/AI-benchmarks.webp\",\"datePublished\":\"2024-04-17T11:48:55+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#breadcrumb\"},\"inLanguage\":\"nl-NL\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"nl-NL\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/AI-benchmarks.webp\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/AI-benchmarks.webp\",\"width\":1792,\"height\":1024},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Report: AI is advancing beyond humans, we need new benchmarks\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"nl-NL\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"nl-NL\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\",\"name\":\"Eugene van der Watt\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"nl-NL\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"caption\":\"Eugene van der Watt\"},\"description\":\"Eugene comes from an electronic engineering background and loves all things tech. When he takes a break from consuming AI news you'll find him at the snooker table.\",\"sameAs\":[\"www.linkedin.com\\\/in\\\/eugene-van-der-watt-16828119\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/nl\\\/author\\\/eugene\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Rapport: AI gaat verder dan mensen, we hebben nieuwe benchmarks nodig | DailyAI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/nl\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","og_locale":"nl_NL","og_type":"article","og_title":"Report: AI is advancing beyond humans, we need new benchmarks | DailyAI","og_description":"Stanford University released its AI Index Report 2024 which noted that AI\u2019s rapid advancement makes benchmark comparisons with humans increasingly less relevant. The annual report provides a comprehensive insight into the trends and state of AI developments. The report says that AI models are improving so fast now that the benchmarks we use to measure them are increasingly becoming irrelevant. A lot of industry benchmarks compare AI models to how good humans are at performing tasks. The Massive Multitask Language Understanding (MMLU) benchmark is a good example. It uses multiple-choice questions to evaluate LLMs across 57 subjects, including math, history,","og_url":"https:\/\/dailyai.com\/nl\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","og_site_name":"DailyAI","article_published_time":"2024-04-17T11:48:55+00:00","og_image":[{"width":1792,"height":1024,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","type":"image\/webp"}],"author":"Eugene van der Watt","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"Geschreven door":"Eugene van der Watt","Geschatte leestijd":"3 minuten"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/"},"author":{"name":"Eugene van der Watt","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa"},"headline":"Report: AI is advancing beyond humans, we need new benchmarks","datePublished":"2024-04-17T11:48:55+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/"},"wordCount":601,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","keywords":["AI debate","AI race"],"articleSection":["Industry"],"inLanguage":"nl-NL"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","url":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","name":"Rapport: AI gaat verder dan mensen, we hebben nieuwe benchmarks nodig | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","datePublished":"2024-04-17T11:48:55+00:00","breadcrumb":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#breadcrumb"},"inLanguage":"nl-NL","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/"]}]},{"@type":"ImageObject","inLanguage":"nl-NL","@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","width":1792,"height":1024},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"Report: AI is advancing beyond humans, we need new benchmarks"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DailyAI","description":"Uw dagelijkse dosis AI-nieuws","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"nl-NL"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DailyAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"nl-NL","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa","name":"Eugene van der Watt","image":{"@type":"ImageObject","inLanguage":"nl-NL","@id":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","caption":"Eugene van der Watt"},"description":"Eugene heeft een achtergrond in elektrotechniek en houdt van alles wat met techniek te maken heeft. Als hij even pauzeert van het consumeren van AI-nieuws, kun je hem aan de snookertafel vinden.","sameAs":["www.linkedin.com\/in\/eugene-van-der-watt-16828119"],"url":"https:\/\/dailyai.com\/nl\/author\/eugene\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/posts\/11632","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/users\/6"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/comments?post=11632"}],"version-history":[{"count":3,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/posts\/11632\/revisions"}],"predecessor-version":[{"id":11652,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/posts\/11632\/revisions\/11652"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/media\/11650"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/media?parent=11632"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/categories?post=11632"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/tags?post=11632"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}