{"id":11632,"date":"2024-04-17T11:48:55","date_gmt":"2024-04-17T11:48:55","guid":{"rendered":"https:\/\/dailyai.com\/?p=11632"},"modified":"2024-04-17T11:48:55","modified_gmt":"2024-04-17T11:48:55","slug":"report-ai-is-advancing-beyond-humans-we-need-new-benchmarks","status":"publish","type":"post","link":"https:\/\/dailyai.com\/de\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","title":{"rendered":"Bericht: KI entwickelt sich weiter als der Mensch, wir brauchen neue Ma\u00dfst\u00e4be"},"content":{"rendered":"<p><strong>Die Stanford University hat ihren AI Index Report 2024 ver\u00f6ffentlicht, in dem festgestellt wird, dass die rasanten Fortschritte der KI Vergleiche mit dem Menschen immer weniger relevant machen.<\/strong><\/p>\n<p>Die <a href=\"https:\/\/aiindex.stanford.edu\/wp-content\/uploads\/2024\/04\/HAI_AI-Index-Report-2024.pdf\" target=\"_blank\" rel=\"noopener\">Jahresbericht<\/a> bietet einen umfassenden Einblick in die Trends und den Stand der KI-Entwicklungen. Der Bericht besagt, dass sich KI-Modelle inzwischen so schnell verbessern, dass die Benchmarks, mit denen wir sie messen, zunehmend irrelevant werden.<\/p>\n<p>In vielen Benchmarks der Industrie werden KI-Modelle damit verglichen, wie gut Menschen bei der Ausf\u00fchrung von Aufgaben sind. Der Massive Multitask Language Understanding (MMLU)-Benchmark ist ein gutes Beispiel daf\u00fcr.<\/p>\n<p>Anhand von Multiple-Choice-Fragen werden LLMs in 57 F\u00e4chern bewertet, darunter Mathematik, Geschichte, Recht und Ethik. Der MMLU ist seit 2019 der wichtigste AI-Benchmark.<\/p>\n<p>Der menschliche Basiswert beim MMLU liegt bei 89,8%, und im Jahr 2019 erzielte das durchschnittliche KI-Modell knapp \u00fcber 30%. Nur 5 Jahre sp\u00e4ter war Gemini Ultra das erste Modell, das mit einem Wert von 90,04% den menschlichen Ausgangswert \u00fcbertraf.<\/p>\n<p>Der Bericht stellt fest, dass aktuelle \"KI-Systeme routinem\u00e4\u00dfig die menschliche Leistung bei Standard-Benchmarks \u00fcbertreffen\". Die Trends in der nachstehenden Grafik scheinen darauf hinzudeuten, dass die MMLU und andere Benchmarks ersetzt werden m\u00fcssen.<\/p>\n<figure id=\"attachment_11647\" aria-describedby=\"caption-attachment-11647\" style=\"width: 1396px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-11647 size-full\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends.png\" alt=\"\" width=\"1396\" height=\"942\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends.png 1396w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends-300x202.png 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends-1024x691.png 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends-768x518.png 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends-60x40.png 60w\" sizes=\"auto, (max-width: 1396px) 100vw, 1396px\" \/><figcaption id=\"caption-attachment-11647\" class=\"wp-caption-text\">KI-Modelle haben in mehreren Benchmarks die menschlichen Grundwerte erreicht und \u00fcbertroffen. Quelle: The AI Index 2024 Jahresbericht<\/figcaption><\/figure>\n<p>KI-Modelle haben bei etablierten Benchmarks wie ImageNet, SQuAD und SuperGLUE eine Leistungss\u00e4ttigung erreicht, so dass die Forscher anspruchsvollere Tests entwickeln.<\/p>\n<p>Ein Beispiel ist der Graduate-Level Google-Proof Q&amp;A Benchmark (GPQA), der es erm\u00f6glicht, KI-Modelle mit wirklich intelligenten Menschen zu vergleichen, anstatt mit der durchschnittlichen menschlichen Intelligenz.<\/p>\n<p>Der GPQA-Test besteht aus 400 anspruchsvollen Multiple-Choice-Fragen f\u00fcr Hochschulabsolventen. Experten, die promoviert haben oder promovieren, beantworten die Fragen in 65% der F\u00e4lle richtig.<\/p>\n<p>In dem GPQA-Papier hei\u00dft es, dass \"hochqualifizierte, nicht fachkundige Validierer bei Fragen au\u00dferhalb ihres Fachgebiets nur eine Genauigkeit von 34% erreichen, obwohl sie im Durchschnitt \u00fcber 30 Minuten mit uneingeschr\u00e4nktem Zugang zum Internet verbringen\".<\/p>\n<p>Letzten Monat gab Anthropic bekannt, dass <a href=\"https:\/\/dailyai.com\/de\/2024\/04\/claude-3-opus-blows-all-llms-away-in-book-length-summarization\/\">Claude 3<\/a> knapp unter 60% mit 5-Schuss-CoT-Eingabeaufforderung erzielt. Wir werden einen gr\u00f6\u00dferen Benchmark brauchen.<\/p>\n<blockquote class=\"twitter-tweet\">\n<p dir=\"ltr\" lang=\"en\">Claude 3 erreicht ~60% Genauigkeit bei GPQA. Es f\u00e4llt mir schwer, zu betonen, wie schwer diese Fragen sind - echte Doktoranden (in anderen Bereichen als den Fragen) mit Internetzugang erreichen 34%.<\/p>\n<p>Doktoranden *im gleichen Bereich* (auch mit Internetzugang!) erhalten 65% - 75% Genauigkeit. <a href=\"https:\/\/t.co\/ARAiCNXgU9\">https:\/\/t.co\/ARAiCNXgU9<\/a> <a href=\"https:\/\/t.co\/PH8J13zIef\">pic.twitter.com\/PH8J13zIef<\/a><\/p>\n<p>- david rein (@idavidrein) <a href=\"https:\/\/twitter.com\/idavidrein\/status\/1764675668175094169?ref_src=twsrc%5Etfw\">4. M\u00e4rz 2024<\/a><\/p><\/blockquote>\n<p><script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/p>\n<h2>Menschliche Bewertungen und Sicherheit<\/h2>\n<p>In dem Bericht wird festgestellt, dass die KI noch mit erheblichen Problemen zu k\u00e4mpfen hat: \"Sie kann nicht zuverl\u00e4ssig mit Fakten umgehen, komplexe \u00dcberlegungen anstellen oder ihre Schlussfolgerungen erkl\u00e4ren.\"<\/p>\n<p>Diese Einschr\u00e4nkungen tragen zu einem weiteren Merkmal des KI-Systems bei, das dem Bericht zufolge nur unzureichend gemessen wird; <a href=\"https:\/\/dailyai.com\/de\/2024\/04\/just-2-of-ai-research-is-looking-at-safety-says-georgetown-university-study\/\">AI-Sicherheit<\/a>. Wir haben keine effektiven Benchmarks, die es uns erlauben zu sagen: \"Dieses Modell ist sicherer als jenes\".<\/p>\n<p>Das liegt zum einen daran, dass sie schwer zu messen ist, und zum anderen daran, dass es den KI-Entwicklern an Transparenz mangelt, insbesondere was die Offenlegung von Trainingsdaten und -methoden angeht.<\/p>\n<p>In dem Bericht wird festgestellt, dass ein interessanter Trend in der Branche darin besteht, menschliche Bewertungen der KI-Leistung anstelle von Benchmark-Tests vorzunehmen.<\/p>\n<p>Die Bewertung der Bild\u00e4sthetik oder der Prosa eines Modells l\u00e4sst sich nur schwer mit einem Test durchf\u00fchren. Daher hei\u00dft es in dem Bericht, dass \"das Benchmarking langsam dazu \u00fcbergeht, menschliche Bewertungen wie das Chatbot Arena Leaderboard anstelle von computerisierten Rankings wie ImageNet oder SQuAD einzubeziehen\".<\/p>\n<p>Da die KI-Modelle die menschliche Basislinie im R\u00fcckspiegel verschwinden sehen, k\u00f6nnte die Stimmung letztendlich bestimmen, welches Modell wir verwenden werden.<\/p>\n<p>Die Trends deuten darauf hin, dass KI-Modelle irgendwann schlauer sein werden als wir und schwieriger zu messen. Vielleicht werden wir bald sagen: \"Ich wei\u00df nicht, warum, aber dieses Modell gef\u00e4llt mir einfach besser\".<\/p>","protected":false},"excerpt":{"rendered":"<p>Die Stanford University hat ihren AI Index Report 2024 ver\u00f6ffentlicht, in dem festgestellt wird, dass die rasanten Fortschritte der KI Vergleiche mit dem Menschen immer weniger relevant machen. Der j\u00e4hrliche Bericht bietet einen umfassenden Einblick in die Trends und den Stand der KI-Entwicklungen. In dem Bericht hei\u00dft es, dass sich KI-Modelle inzwischen so schnell verbessern, dass die Benchmarks, die wir zu ihrer Messung verwenden, zunehmend irrelevant werden. Viele Branchen-Benchmarks vergleichen KI-Modelle damit, wie gut Menschen Aufgaben erledigen k\u00f6nnen. Der Massive Multitask Language Understanding (MMLU)-Benchmark ist ein gutes Beispiel daf\u00fcr. Er verwendet Multiple-Choice-Fragen zur Bewertung von LLMs in 57 F\u00e4chern, darunter Mathematik, Geschichte,<\/p>","protected":false},"author":6,"featured_media":11650,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[84],"tags":[103,99],"class_list":["post-11632","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry","tag-ai-debate","tag-ai-race"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Report: AI is advancing beyond humans, we need new benchmarks | DailyAI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/de\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/\" \/>\n<meta property=\"og:locale\" content=\"de_DE\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Report: AI is advancing beyond humans, we need new benchmarks | DailyAI\" \/>\n<meta property=\"og:description\" content=\"Stanford University released its AI Index Report 2024 which noted that AI\u2019s rapid advancement makes benchmark comparisons with humans increasingly less relevant. The annual report provides a comprehensive insight into the trends and state of AI developments. The report says that AI models are improving so fast now that the benchmarks we use to measure them are increasingly becoming irrelevant. A lot of industry benchmarks compare AI models to how good humans are at performing tasks. The Massive Multitask Language Understanding (MMLU) benchmark is a good example. It uses multiple-choice questions to evaluate LLMs across 57 subjects, including math, history,\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/de\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2024-04-17T11:48:55+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1792\" \/>\n\t<meta property=\"og:image:height\" content=\"1024\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"author\" content=\"Eugene van der Watt\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"Verfasst von\" \/>\n\t<meta name=\"twitter:data1\" content=\"Eugene van der Watt\" \/>\n\t<meta name=\"twitter:label2\" content=\"Gesch\u00e4tzte Lesezeit\" \/>\n\t<meta name=\"twitter:data2\" content=\"3\u00a0Minuten\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\"},\"author\":{\"name\":\"Eugene van der Watt\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\"},\"headline\":\"Report: AI is advancing beyond humans, we need new benchmarks\",\"datePublished\":\"2024-04-17T11:48:55+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\"},\"wordCount\":601,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/AI-benchmarks.webp\",\"keywords\":[\"AI debate\",\"AI race\"],\"articleSection\":[\"Industry\"],\"inLanguage\":\"de\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\",\"name\":\"Report: AI is advancing beyond humans, we need new benchmarks | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/AI-benchmarks.webp\",\"datePublished\":\"2024-04-17T11:48:55+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#breadcrumb\"},\"inLanguage\":\"de\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"de\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/AI-benchmarks.webp\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/AI-benchmarks.webp\",\"width\":1792,\"height\":1024},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Report: AI is advancing beyond humans, we need new benchmarks\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"de\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"de\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\",\"name\":\"Eugene van der Watt\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"de\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"caption\":\"Eugene van der Watt\"},\"description\":\"Eugene comes from an electronic engineering background and loves all things tech. When he takes a break from consuming AI news you'll find him at the snooker table.\",\"sameAs\":[\"www.linkedin.com\\\/in\\\/eugene-van-der-watt-16828119\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/de\\\/author\\\/eugene\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Bericht: KI entwickelt sich weiter als der Mensch, wir brauchen neue Ma\u00dfst\u00e4be | DailyAI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/de\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","og_locale":"de_DE","og_type":"article","og_title":"Report: AI is advancing beyond humans, we need new benchmarks | DailyAI","og_description":"Stanford University released its AI Index Report 2024 which noted that AI\u2019s rapid advancement makes benchmark comparisons with humans increasingly less relevant. The annual report provides a comprehensive insight into the trends and state of AI developments. The report says that AI models are improving so fast now that the benchmarks we use to measure them are increasingly becoming irrelevant. A lot of industry benchmarks compare AI models to how good humans are at performing tasks. The Massive Multitask Language Understanding (MMLU) benchmark is a good example. It uses multiple-choice questions to evaluate LLMs across 57 subjects, including math, history,","og_url":"https:\/\/dailyai.com\/de\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","og_site_name":"DailyAI","article_published_time":"2024-04-17T11:48:55+00:00","og_image":[{"width":1792,"height":1024,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","type":"image\/webp"}],"author":"Eugene van der Watt","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"Verfasst von":"Eugene van der Watt","Gesch\u00e4tzte Lesezeit":"3\u00a0Minuten"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/"},"author":{"name":"Eugene van der Watt","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa"},"headline":"Report: AI is advancing beyond humans, we need new benchmarks","datePublished":"2024-04-17T11:48:55+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/"},"wordCount":601,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","keywords":["AI debate","AI race"],"articleSection":["Industry"],"inLanguage":"de"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","url":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","name":"Bericht: KI entwickelt sich weiter als der Mensch, wir brauchen neue Ma\u00dfst\u00e4be | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","datePublished":"2024-04-17T11:48:55+00:00","breadcrumb":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#breadcrumb"},"inLanguage":"de","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/"]}]},{"@type":"ImageObject","inLanguage":"de","@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","width":1792,"height":1024},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"Report: AI is advancing beyond humans, we need new benchmarks"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DailyAI","description":"Ihre t\u00e4gliche Dosis an AI-Nachrichten","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"de"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DailyAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"de","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa","name":"Eugene van der Watt","image":{"@type":"ImageObject","inLanguage":"de","@id":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","caption":"Eugene van der Watt"},"description":"Eugene kommt aus der Elektronikbranche und liebt alles, was mit Technik zu tun hat. Wenn er eine Pause vom Konsum von KI-Nachrichten einlegt, findet man ihn am Snookertisch.","sameAs":["www.linkedin.com\/in\/eugene-van-der-watt-16828119"],"url":"https:\/\/dailyai.com\/de\/author\/eugene\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/posts\/11632","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/users\/6"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/comments?post=11632"}],"version-history":[{"count":3,"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/posts\/11632\/revisions"}],"predecessor-version":[{"id":11652,"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/posts\/11632\/revisions\/11652"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/media\/11650"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/media?parent=11632"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/categories?post=11632"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/de\/wp-json\/wp\/v2\/tags?post=11632"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}