{"id":11632,"date":"2024-04-17T11:48:55","date_gmt":"2024-04-17T11:48:55","guid":{"rendered":"https:\/\/dailyai.com\/?p=11632"},"modified":"2024-04-17T11:48:55","modified_gmt":"2024-04-17T11:48:55","slug":"report-ai-is-advancing-beyond-humans-we-need-new-benchmarks","status":"publish","type":"post","link":"https:\/\/dailyai.com\/sv\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","title":{"rendered":"Rapport: AI utvecklas snabbare \u00e4n m\u00e4nniskan - vi beh\u00f6ver nya riktm\u00e4rken"},"content":{"rendered":"<p><strong>Stanford University sl\u00e4ppte sin AI Index Report 2024 d\u00e4r man konstaterade att AI:s snabba utveckling g\u00f6r att j\u00e4mf\u00f6relser med m\u00e4nniskor blir allt mindre relevanta.<\/strong><\/p>\n<p>Den <a href=\"https:\/\/aiindex.stanford.edu\/wp-content\/uploads\/2024\/04\/HAI_AI-Index-Report-2024.pdf\" target=\"_blank\" rel=\"noopener\">\u00e5rsredovisning<\/a> ger en omfattande inblick i trender och l\u00e4get f\u00f6r AI-utvecklingen. Enligt rapporten f\u00f6rb\u00e4ttras AI-modellerna nu s\u00e5 snabbt att de riktm\u00e4rken vi anv\u00e4nder f\u00f6r att m\u00e4ta dem blir alltmer irrelevanta.<\/p>\n<p>M\u00e5nga branschriktm\u00e4rken j\u00e4mf\u00f6r AI-modeller med hur bra m\u00e4nniskor \u00e4r p\u00e5 att utf\u00f6ra uppgifter. Benchmarken Massive Multitask Language Understanding (MMLU) \u00e4r ett bra exempel.<\/p>\n<p>Den anv\u00e4nder flervalsfr\u00e5gor f\u00f6r att utv\u00e4rdera LLM:er i 57 \u00e4mnen, inklusive matematik, historia, juridik och etik. MMLU har varit det sj\u00e4lvklara AI-riktm\u00e4rket sedan 2019.<\/p>\n<p>Den m\u00e4nskliga baslinjen p\u00e5 MMLU \u00e4r 89,8%, och redan 2019 fick den genomsnittliga AI-modellen drygt 30%. Bara 5 \u00e5r senare blev Gemini Ultra den f\u00f6rsta modellen som slog den m\u00e4nskliga baslinjen med en po\u00e4ng p\u00e5 90,04%.<\/p>\n<p>Rapporten konstaterar att nuvarande \"AI-system rutinm\u00e4ssigt \u00f6vertr\u00e4ffar m\u00e4nsklig prestanda p\u00e5 standardbenchmarks.\" Trenderna i diagrammet nedan verkar tyda p\u00e5 att MMLU och andra riktm\u00e4rken beh\u00f6ver bytas ut.<\/p>\n<figure id=\"attachment_11647\" aria-describedby=\"caption-attachment-11647\" style=\"width: 1396px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-11647 size-full\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends.png\" alt=\"\" width=\"1396\" height=\"942\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends.png 1396w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends-300x202.png 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends-1024x691.png 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends-768x518.png 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarking-trends-60x40.png 60w\" sizes=\"auto, (max-width: 1396px) 100vw, 1396px\" \/><figcaption id=\"caption-attachment-11647\" class=\"wp-caption-text\">AI-modeller har n\u00e5tt och \u00f6vertr\u00e4ffat m\u00e4nskliga baslinjer i flera benchmarks. K\u00e4lla: The AI Index \u00c5rsrapporten f\u00f6r AI-index 2024<\/figcaption><\/figure>\n<p>AI-modeller har n\u00e5tt prestandam\u00e4ttnad p\u00e5 etablerade benchmarks som ImageNet, SQuAD och SuperGLUE, s\u00e5 forskarna utvecklar mer utmanande tester.<\/p>\n<p>Ett exempel \u00e4r Graduate-Level Google-Proof Q&amp;A Benchmark (GPQA), som g\u00f6r att AI-modeller kan j\u00e4mf\u00f6ras med riktigt smarta m\u00e4nniskor, snarare \u00e4n med genomsnittlig m\u00e4nsklig intelligens.<\/p>\n<p>GPQA-testet best\u00e5r av 400 tuffa flervalsfr\u00e5gor p\u00e5 forskarniv\u00e5. Experter som har eller h\u00e5ller p\u00e5 att doktorera svarar korrekt p\u00e5 fr\u00e5gorna 65% av g\u00e5ngerna.<\/p>\n<p>I GPQA:s rapport st\u00e5r det att \"h\u00f6gt kvalificerade validerare som inte \u00e4r experter n\u00e5r endast 34% noggrannhet n\u00e4r de f\u00e5r fr\u00e5gor utanf\u00f6r sitt omr\u00e5de, trots att de i genomsnitt tillbringar \u00f6ver 30 minuter med obegr\u00e4nsad tillg\u00e5ng till webben\".<\/p>\n<p>F\u00f6rra m\u00e5naden meddelade Anthropic att <a href=\"https:\/\/dailyai.com\/sv\/2024\/04\/claude-3-opus-blows-all-llms-away-in-book-length-summarization\/\">Claude 3<\/a> fick strax under 60% med 5-skott CoT prompting. Vi kommer att beh\u00f6va ett st\u00f6rre riktm\u00e4rke.<\/p>\n<blockquote class=\"twitter-tweet\">\n<p dir=\"ltr\" lang=\"en\">Claude 3 f\u00e5r ~60% noggrannhet p\u00e5 GPQA. Det \u00e4r sv\u00e5rt f\u00f6r mig att underskatta hur sv\u00e5ra dessa fr\u00e5gor \u00e4r - bokstavliga doktorer (inom andra dom\u00e4ner \u00e4n fr\u00e5gorna) med tillg\u00e5ng till internet f\u00e5r 34%.<\/p>\n<p>Doktorander * i samma dom\u00e4n* (\u00e4ven med internet\u00e5tkomst!) f\u00e5r 65% - 75% noggrannhet. <a href=\"https:\/\/t.co\/ARAiCNXgU9\">https:\/\/t.co\/ARAiCNXgU9<\/a> <a href=\"https:\/\/t.co\/PH8J13zIef\">pic.twitter.com\/PH8J13zIef<\/a><\/p>\n<p>- david rein (@idavidrein) <a href=\"https:\/\/twitter.com\/idavidrein\/status\/1764675668175094169?ref_src=twsrc%5Etfw\">4 mars 2024<\/a><\/p><\/blockquote>\n<p><script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/p>\n<h2>Utv\u00e4rderingar och s\u00e4kerhet f\u00f6r m\u00e4nniskor<\/h2>\n<p>I rapporten konstateras att AI fortfarande st\u00e5r inf\u00f6r betydande problem: \"Den kan inte p\u00e5 ett tillf\u00f6rlitligt s\u00e4tt hantera fakta, utf\u00f6ra komplexa resonemang eller f\u00f6rklara sina slutsatser.\"<\/p>\n<p>Dessa begr\u00e4nsningar bidrar till en annan egenskap hos AI-systemet som enligt rapporten \u00e4r d\u00e5ligt m\u00e4tt; <a href=\"https:\/\/dailyai.com\/sv\/2024\/04\/just-2-of-ai-research-is-looking-at-safety-says-georgetown-university-study\/\">AI-s\u00e4kerhet<\/a>. Vi har inga effektiva riktm\u00e4rken som g\u00f6r att vi kan s\u00e4ga: \"Den h\u00e4r modellen \u00e4r s\u00e4krare \u00e4n den andra.\"<\/p>\n<p>Det beror delvis p\u00e5 att det \u00e4r sv\u00e5rt att m\u00e4ta och delvis p\u00e5 att \"AI-utvecklare saknar transparens, s\u00e4rskilt n\u00e4r det g\u00e4ller att offentligg\u00f6ra utbildningsdata och metoder\".<\/p>\n<p>I rapporten konstateras att en intressant trend i branschen \u00e4r att l\u00e5ta m\u00e4nniskor utv\u00e4rdera AI-prestanda i st\u00e4llet f\u00f6r att g\u00f6ra benchmark-tester.<\/p>\n<p>Att rangordna en modells bildestetik eller prosa \u00e4r sv\u00e5rt att g\u00f6ra med ett test. Som ett resultat av detta s\u00e4ger rapporten att \"benchmarking l\u00e5ngsamt har b\u00f6rjat skifta mot att inf\u00f6rliva m\u00e4nskliga utv\u00e4rderingar som Chatbot Arena Leaderboard snarare \u00e4n datoriserade rankningar som ImageNet eller SQuAD.\"<\/p>\n<p>N\u00e4r AI-modellerna ser den m\u00e4nskliga baslinjen f\u00f6rsvinna i backspegeln kan k\u00e4nslan i slut\u00e4ndan avg\u00f6ra vilken modell vi v\u00e4ljer att anv\u00e4nda.<\/p>\n<p>Trenderna pekar p\u00e5 att AI-modellerna s\u00e5 sm\u00e5ningom kommer att bli smartare \u00e4n oss och sv\u00e5rare att m\u00e4ta. Snart kanske vi s\u00e4ger: \"Jag vet inte varf\u00f6r, men jag tycker b\u00e4ttre om den h\u00e4r.\"<\/p>","protected":false},"excerpt":{"rendered":"<p>Stanford University sl\u00e4ppte sin AI Index Report 2024 d\u00e4r det konstateras att AI:s snabba utveckling g\u00f6r att j\u00e4mf\u00f6relser med m\u00e4nniskor blir allt mindre relevanta. Den \u00e5rliga rapporten ger en omfattande inblick i trender och l\u00e4get f\u00f6r AI-utvecklingen. Rapporten s\u00e4ger att AI-modeller f\u00f6rb\u00e4ttras s\u00e5 snabbt nu att de riktm\u00e4rken vi anv\u00e4nder f\u00f6r att m\u00e4ta dem i allt h\u00f6gre grad blir irrelevanta. M\u00e5nga branschriktm\u00e4rken j\u00e4mf\u00f6r AI-modeller med hur bra m\u00e4nniskor \u00e4r p\u00e5 att utf\u00f6ra uppgifter. Massive Multitask Language Understanding (MMLU) -riktm\u00e4rket \u00e4r ett bra exempel. Det anv\u00e4nder flervalsfr\u00e5gor f\u00f6r att utv\u00e4rdera LLM: er i 57 \u00e4mnen, inklusive matematik, historia,<\/p>","protected":false},"author":6,"featured_media":11650,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[84],"tags":[103,99],"class_list":["post-11632","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry","tag-ai-debate","tag-ai-race"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Report: AI is advancing beyond humans, we need new benchmarks | DailyAI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/sv\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/\" \/>\n<meta property=\"og:locale\" content=\"sv_SE\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Report: AI is advancing beyond humans, we need new benchmarks | DailyAI\" \/>\n<meta property=\"og:description\" content=\"Stanford University released its AI Index Report 2024 which noted that AI\u2019s rapid advancement makes benchmark comparisons with humans increasingly less relevant. The annual report provides a comprehensive insight into the trends and state of AI developments. The report says that AI models are improving so fast now that the benchmarks we use to measure them are increasingly becoming irrelevant. A lot of industry benchmarks compare AI models to how good humans are at performing tasks. The Massive Multitask Language Understanding (MMLU) benchmark is a good example. It uses multiple-choice questions to evaluate LLMs across 57 subjects, including math, history,\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/sv\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2024-04-17T11:48:55+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1792\" \/>\n\t<meta property=\"og:image:height\" content=\"1024\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"author\" content=\"Eugene van der Watt\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"Skriven av\" \/>\n\t<meta name=\"twitter:data1\" content=\"Eugene van der Watt\" \/>\n\t<meta name=\"twitter:label2\" content=\"Ber\u00e4knad l\u00e4stid\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minuter\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\"},\"author\":{\"name\":\"Eugene van der Watt\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\"},\"headline\":\"Report: AI is advancing beyond humans, we need new benchmarks\",\"datePublished\":\"2024-04-17T11:48:55+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\"},\"wordCount\":601,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/AI-benchmarks.webp\",\"keywords\":[\"AI debate\",\"AI race\"],\"articleSection\":[\"Industry\"],\"inLanguage\":\"sv-SE\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\",\"name\":\"Report: AI is advancing beyond humans, we need new benchmarks | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/AI-benchmarks.webp\",\"datePublished\":\"2024-04-17T11:48:55+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#breadcrumb\"},\"inLanguage\":\"sv-SE\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"sv-SE\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/AI-benchmarks.webp\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/AI-benchmarks.webp\",\"width\":1792,\"height\":1024},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Report: AI is advancing beyond humans, we need new benchmarks\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"sv-SE\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"sv-SE\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\",\"name\":\"Eugene van der Watt\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"sv-SE\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"caption\":\"Eugene van der Watt\"},\"description\":\"Eugene comes from an electronic engineering background and loves all things tech. When he takes a break from consuming AI news you'll find him at the snooker table.\",\"sameAs\":[\"www.linkedin.com\\\/in\\\/eugene-van-der-watt-16828119\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/sv\\\/author\\\/eugene\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Rapport: AI utvecklas snabbare \u00e4n m\u00e4nniskan, vi beh\u00f6ver nya riktm\u00e4rken | DailyAI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/sv\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","og_locale":"sv_SE","og_type":"article","og_title":"Report: AI is advancing beyond humans, we need new benchmarks | DailyAI","og_description":"Stanford University released its AI Index Report 2024 which noted that AI\u2019s rapid advancement makes benchmark comparisons with humans increasingly less relevant. The annual report provides a comprehensive insight into the trends and state of AI developments. The report says that AI models are improving so fast now that the benchmarks we use to measure them are increasingly becoming irrelevant. A lot of industry benchmarks compare AI models to how good humans are at performing tasks. The Massive Multitask Language Understanding (MMLU) benchmark is a good example. It uses multiple-choice questions to evaluate LLMs across 57 subjects, including math, history,","og_url":"https:\/\/dailyai.com\/sv\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","og_site_name":"DailyAI","article_published_time":"2024-04-17T11:48:55+00:00","og_image":[{"width":1792,"height":1024,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","type":"image\/webp"}],"author":"Eugene van der Watt","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"Skriven av":"Eugene van der Watt","Ber\u00e4knad l\u00e4stid":"3 minuter"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/"},"author":{"name":"Eugene van der Watt","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa"},"headline":"Report: AI is advancing beyond humans, we need new benchmarks","datePublished":"2024-04-17T11:48:55+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/"},"wordCount":601,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","keywords":["AI debate","AI race"],"articleSection":["Industry"],"inLanguage":"sv-SE"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","url":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/","name":"Rapport: AI utvecklas snabbare \u00e4n m\u00e4nniskan, vi beh\u00f6ver nya riktm\u00e4rken | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","datePublished":"2024-04-17T11:48:55+00:00","breadcrumb":{"@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#breadcrumb"},"inLanguage":"sv-SE","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/"]}]},{"@type":"ImageObject","inLanguage":"sv-SE","@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/AI-benchmarks.webp","width":1792,"height":1024},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2024\/04\/report-ai-is-advancing-beyond-humans-we-need-new-benchmarks\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"Report: AI is advancing beyond humans, we need new benchmarks"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DagligaAI","description":"Din dagliga dos av AI-nyheter","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"sv-SE"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DagligaAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"sv-SE","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa","name":"Eugene van der Watt","image":{"@type":"ImageObject","inLanguage":"sv-SE","@id":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","caption":"Eugene van der Watt"},"description":"Eugene kommer fr\u00e5n en bakgrund som elektronikingenj\u00f6r och \u00e4lskar allt som har med teknik att g\u00f6ra. N\u00e4r han tar en paus fr\u00e5n att konsumera AI-nyheter hittar du honom vid snookerbordet.","sameAs":["www.linkedin.com\/in\/eugene-van-der-watt-16828119"],"url":"https:\/\/dailyai.com\/sv\/author\/eugene\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/posts\/11632","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/users\/6"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/comments?post=11632"}],"version-history":[{"count":3,"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/posts\/11632\/revisions"}],"predecessor-version":[{"id":11652,"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/posts\/11632\/revisions\/11652"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/media\/11650"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/media?parent=11632"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/categories?post=11632"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/sv\/wp-json\/wp\/v2\/tags?post=11632"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}