{"id":11543,"date":"2024-04-14T16:29:00","date_gmt":"2024-04-14T16:29:00","guid":{"rendered":"https:\/\/dailyai.com\/?p=11543"},"modified":"2024-04-15T11:48:24","modified_gmt":"2024-04-15T11:48:24","slug":"xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa","status":"publish","type":"post","link":"https:\/\/dailyai.com\/fr\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/","title":{"rendered":"xAI pr\u00e9sente Grok-1.5 et cr\u00e9e un nouveau benchmark appel\u00e9 RealWorldQA"},"content":{"rendered":"<p><strong>L'entreprise xAI d'Elon Musk a d\u00e9voil\u00e9 Grok-1.5, un mod\u00e8le d'IA multimodale con\u00e7u pour surpasser ses concurrents dans la compr\u00e9hension de sc\u00e9narios du monde r\u00e9el.\u00a0<\/strong><\/p>\n<p><span style=\"font-weight: 400;\">Dans la lign\u00e9e d'autres logiciels, comme GPT-4V, le nouveau Grok-1.5 introduit le traitement visuel pour analyser tous les types de documents, de diagrammes, de captures d'\u00e9cran et de photographies.<\/span><\/p>\n<p><a href=\"https:\/\/x.ai\/blog\/grok-1.5\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">Grok-1.5<\/span><\/a><span style=\"font-weight: 400;\"> gagne \u00e9galement du terrain dans les t\u00e2ches de texte, de codage et de math\u00e9matiques, obtenant 50,6% sur le benchmark MATH, 90% sur le benchmark GSM8K et 74,1% sur le benchmark HumanEval.\u00a0<\/span><\/p>\n<p>Cela place Grok-1.5 dans la cat\u00e9gorie des poids lourds du LLM, avec des scores en moyenne l\u00e9g\u00e8rement inf\u00e9rieurs \u00e0 ceux de Gemini Pro 1.5, GPT-4 et Claude 3 Opus.<\/p>\n<figure id=\"attachment_11546\" aria-describedby=\"caption-attachment-11546\" style=\"width: 1024px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-11546 size-large\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks2-1024x343.png\" alt=\"Grok\" width=\"1024\" height=\"343\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks2-1024x343.png 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks2-300x100.png 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks2-768x257.png 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks2-1536x515.png 1536w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks2-60x20.png 60w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks2.png 1633w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption id=\"caption-attachment-11546\" class=\"wp-caption-text\">Crit\u00e8res de r\u00e9f\u00e9rence comp\u00e9titifs de Grok-1.5 pour le texte, les math\u00e9matiques et le codage. Source : xAI<\/figcaption><\/figure>\n<p><span style=\"font-weight: 400;\">Grok-1.5 offre \u00e9galement une compr\u00e9hension plus longue du contexte, jusqu'\u00e0 128 000 jetons, soit une augmentation de 16 fois par rapport \u00e0 son pr\u00e9d\u00e9cesseur, mais bien en de\u00e7\u00e0 de ce que proposent Claude 3 Opus et Gemini 1.5 Pro.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">L'\u00e9valuation Needle In A Haystack (NIAH) a d\u00e9montr\u00e9 la capacit\u00e9 de Grok-1.5 \u00e0 localiser du texte int\u00e9gr\u00e9 dans des contextes d'une longueur maximale de 128 000 tokens.<\/span><\/p>\n<p>Cependant, ce sont les comp\u00e9tences de Grok-1.5 en mati\u00e8re de vision que xAI pousse le plus loin.<\/p>\n<p><span style=\"font-weight: 400;\">D\u00e9monstrations <\/span><span style=\"font-weight: 400;\">montrent Grok-1.5 convertissant des sch\u00e9mas de blocs en code Python, g\u00e9n\u00e9rant des histoires \u00e0 dormir debout inspir\u00e9es de peintures d'enfants, cr\u00e9ant des ensembles de donn\u00e9es CSV \u00e0 partir de captures d'\u00e9cran, et m\u00eame \"d\u00e9veloppant\" des m\u00e8mes.\u00a0<\/span><\/p>\n<p>Grok-1.5 arrive en t\u00eate de certains benchmarks \u00e9tablis comme Mathvista et TextVQA et obtient les meilleurs r\u00e9sultats dans le nouveau benchmark de xAI, RealWorldQA.<\/p>\n<figure id=\"attachment_11544\" aria-describedby=\"caption-attachment-11544\" style=\"width: 1024px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-11544 size-large\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks-1024x695.png\" alt=\"\" width=\"1024\" height=\"695\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks-1024x695.png 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks-300x204.png 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks-768x522.png 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks-60x41.png 60w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks.png 1309w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption id=\"caption-attachment-11544\" class=\"wp-caption-text\">Les performances impressionnantes de Grok-1.5 en mati\u00e8re de vision. Source : xAI<\/figcaption><\/figure>\n<p><span style=\"font-weight: 400;\">Sous le capot, Grok-1.5 est aliment\u00e9 par un cadre de formation distribu\u00e9 personnalis\u00e9 qui permet \u00e0 l'\u00e9quipe de xAI de prototyper des id\u00e9es et de former de nouvelles architectures \u00e0 l'\u00e9chelle avec un minimum d'effort.<\/span><\/p>\n<p><span style=\"font-weight: 400;\"> xAI \u00e9tait <\/span><a href=\"http:\/\/v\"><span style=\"font-weight: 400;\">fond\u00e9e l'ann\u00e9e derni\u00e8re<\/span><\/a><span style=\"font-weight: 400;\"> et comprend certains des meilleurs chercheurs en IA du monde, avec l'objectif tr\u00e8s ambitieux de \"comprendre l'univers\".\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Jusqu'\u00e0 pr\u00e9sent, nous avons eu le Grok-1, un personnage spirituel et farfelu qui explique aux gens comment synth\u00e9tiser des stup\u00e9fiants et des m\u00e9dicaments. <\/span><a href=\"https:\/\/dailyai.com\/fr\/2023\/12\/xais-grok-drops-an-awkward-blooper-by-referring-to-openai\/\"><span style=\"font-weight: 400;\">critique Musk et Tesla<\/span><\/a><span style=\"font-weight: 400;\">.<\/span><\/p>\n<p><span style=\"font-weight: 400;\"> Grok est \u00e9galement connect\u00e9 \u00e0 la base de donn\u00e9es postales de X, ce qui, entre autres particularit\u00e9s, lui a valu un certain nombre d'adeptes, m\u00eame s'il ne rivalise pas avec les leaders en termes de performances pures.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Le projet xAI de Musk remet en question l'\u00e9cosyst\u00e8me essentiellement ferm\u00e9 de l'IA g\u00e9n\u00e9rative, en rendant ses mod\u00e8les g\u00e9n\u00e9ralement disponibles sous une v\u00e9ritable licence. <\/span><a href=\"https:\/\/dailyai.com\/fr\/2024\/03\/elon-musks-xai-open-sources-its-llm-grok-1\/\"><span style=\"font-weight: 400;\">les licences open-source<\/span><\/a><span style=\"font-weight: 400;\">.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Associ\u00e9e \u00e0 Meta, qui a la m\u00eame intention d'aller \u00e0 l'encontre des concurrents, la th\u00e8se ouverte de xAI pourrait devenir une \u00e9pine dans les efforts de mon\u00e9tisation d'OpenAI, de Microsoft, d'Anthropic et de Google.<\/span><\/p>\n<h2>RealWorldQA<\/h2>\n<p>Lors de l'avant-premi\u00e8re de Grok-1.5, xAI a \u00e9galement d\u00e9voil\u00e9 RealWorldQA, un nouveau test de r\u00e9f\u00e9rence compos\u00e9 de plus de 700 images, chacune accompagn\u00e9e d'une question et d'une r\u00e9ponse v\u00e9rifiable.<\/p>\n<p><span style=\"font-weight: 400;\">L'ensemble de donn\u00e9es comprend principalement des images anonymes captur\u00e9es \u00e0 partir de v\u00e9hicules et d'autres situations du monde r\u00e9el.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">L'ensemble de donn\u00e9es RealWorldQA est con\u00e7u pour \u00e9valuer les capacit\u00e9s de compr\u00e9hension spatiale de Grok 1.5 et d'autres mod\u00e8les d'IA multimodale. xAI a estim\u00e9 que d'autres points de r\u00e9f\u00e9rence manquaient dans ce domaine.\u00a0<\/span><\/p>\n<figure id=\"attachment_11545\" aria-describedby=\"caption-attachment-11545\" style=\"width: 1024px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-11545 size-large\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/realworld-1024x258.png\" alt=\"Grok\" width=\"1024\" height=\"258\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/realworld-1024x258.png 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/realworld-300x76.png 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/realworld-768x193.png 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/realworld-1536x387.png 1536w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/realworld-60x15.png 60w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/realworld.png 1947w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption id=\"caption-attachment-11545\" class=\"wp-caption-text\">L'ensemble de donn\u00e9es de r\u00e9f\u00e9rence RealWorldQA vise \u00e0 tester la capacit\u00e9 des mod\u00e8les \u00e0 comprendre des sc\u00e8nes naturelles. Source : xAI<\/figcaption><\/figure>\n<p>Grok-1.5 surpasse ses concurrents dans RealWorldQA, et il sera int\u00e9ressant de voir s'il s'impose.<\/p>\n<p><span style=\"font-weight: 400;\">Bien qu'il ne permette pas de comprendre l'univers, Grok-1.5 s'inscrit comme un mod\u00e8le de premier plan dans une gamme qui ne cesse de s'\u00e9toffer. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">Cela montre \u00e9galement que l'IA g\u00e9n\u00e9rative, dans sa forme actuelle, atteint les sommets de ses capacit\u00e9s, mais peut-\u00eatre pas pour longtemps.\u00a0<\/span><\/p>","protected":false},"excerpt":{"rendered":"<p>La soci\u00e9t\u00e9 xAI d'Elon Musk a d\u00e9voil\u00e9 Grok-1.5, un mod\u00e8le d'IA multimodale con\u00e7u pour surpasser ses concurrents dans la compr\u00e9hension des sc\u00e9narios du monde r\u00e9el.  Suivant les traces d'autres mod\u00e8les, comme GPT-4V, le nouveau Grok-1.5 introduit le traitement visuel pour analyser tous les types de documents, de diagrammes, de captures d'\u00e9cran et de photographies. Grok-1.5 gagne \u00e9galement du terrain dans les t\u00e2ches de texte, de codage et de math\u00e9matiques, obtenant un score de 50,6% sur le benchmark MATH, 90% sur le benchmark GSM8K et 74,1% sur le benchmark HumanEval.  Ces r\u00e9sultats placent Grok-1.5 dans la cat\u00e9gorie des poids lourds du LLM, avec des scores l\u00e9g\u00e8rement inf\u00e9rieurs \u00e0 ceux de Gemini Pro 1.5, GPT-4 et Claude 3 Opus. Grok-1.5 offre \u00e9galement une compr\u00e9hension du contexte plus longue.<\/p>","protected":false},"author":2,"featured_media":11548,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[84],"tags":[188,481,223],"class_list":["post-11543","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry","tag-elon-musk","tag-grok","tag-xai"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>xAI previews Grok-1.5 and creates a new benchmark called RealWorldQA | DailyAI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/fr\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/\" \/>\n<meta property=\"og:locale\" content=\"fr_FR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"xAI previews Grok-1.5 and creates a new benchmark called RealWorldQA | DailyAI\" \/>\n<meta property=\"og:description\" content=\"Elon Musk&#8217;s xAI has revealed Grok-1.5, a multimodal AI model designed to beat competitors in understanding real-world scenarios.\u00a0 Following in the footsteps of others, like GPT-4V, the new Grok-1.5 introduces visual processing to analyze anything from documents and diagrams to charts, screenshots, and photographs. Grok-1.5 also gains ground in text, coding, and math tasks, scoring 50.6% on the MATH benchmark, 90% on the GSM8K benchmark, and 74.1% on the HumanEval benchmark.\u00a0 This throws Grok-1.5 right into the LLM heavyweight tier, averaging slightly lower scores than Gemini Pro 1.5, GPT-4, and Claude 3 Opus. Grok-1.5 also offers longer context understanding up\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/fr\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2024-04-14T16:29:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-04-15T11:48:24+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-14-17.28.41-Create-an-image-with-the-word-GROK-in-a-clear-legible-bold-sans-serif-font-centered-on-a-high-quality-landscape-canvas.-The-background-should-b.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1792\" \/>\n\t<meta property=\"og:image:height\" content=\"1024\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"author\" content=\"Sam Jeans\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"\u00c9crit par\" \/>\n\t<meta name=\"twitter:data1\" content=\"Sam Jeans\" \/>\n\t<meta name=\"twitter:label2\" content=\"Dur\u00e9e de lecture estim\u00e9e\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/\"},\"author\":{\"name\":\"Sam Jeans\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\"},\"headline\":\"xAI previews Grok-1.5 and creates a new benchmark called RealWorldQA\",\"datePublished\":\"2024-04-14T16:29:00+00:00\",\"dateModified\":\"2024-04-15T11:48:24+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/\"},\"wordCount\":546,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/DALL\u00b7E-2024-04-14-17.28.41-Create-an-image-with-the-word-GROK-in-a-clear-legible-bold-sans-serif-font-centered-on-a-high-quality-landscape-canvas.-The-background-should-b.webp\",\"keywords\":[\"Elon Musk\",\"Grok\",\"xAI\"],\"articleSection\":[\"Industry\"],\"inLanguage\":\"fr-FR\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/\",\"name\":\"xAI previews Grok-1.5 and creates a new benchmark called RealWorldQA | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/DALL\u00b7E-2024-04-14-17.28.41-Create-an-image-with-the-word-GROK-in-a-clear-legible-bold-sans-serif-font-centered-on-a-high-quality-landscape-canvas.-The-background-should-b.webp\",\"datePublished\":\"2024-04-14T16:29:00+00:00\",\"dateModified\":\"2024-04-15T11:48:24+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/#breadcrumb\"},\"inLanguage\":\"fr-FR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/DALL\u00b7E-2024-04-14-17.28.41-Create-an-image-with-the-word-GROK-in-a-clear-legible-bold-sans-serif-font-centered-on-a-high-quality-landscape-canvas.-The-background-should-b.webp\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/DALL\u00b7E-2024-04-14-17.28.41-Create-an-image-with-the-word-GROK-in-a-clear-legible-bold-sans-serif-font-centered-on-a-high-quality-landscape-canvas.-The-background-should-b.webp\",\"width\":1792,\"height\":1024},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"xAI previews Grok-1.5 and creates a new benchmark called RealWorldQA\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"fr-FR\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\",\"name\":\"Sam Jeans\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"caption\":\"Sam Jeans\"},\"description\":\"Sam is a science and technology writer who has worked in various AI startups. When he\u2019s not writing, he can be found reading medical journals or digging through boxes of vinyl records.\",\"sameAs\":[\"https:\\\/\\\/www.linkedin.com\\\/in\\\/sam-jeans-6746b9142\\\/\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/fr\\\/author\\\/samjeans\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"xAI pr\u00e9sente Grok-1.5 et cr\u00e9e un nouveau benchmark appel\u00e9 RealWorldQA | DailyAI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/fr\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/","og_locale":"fr_FR","og_type":"article","og_title":"xAI previews Grok-1.5 and creates a new benchmark called RealWorldQA | DailyAI","og_description":"Elon Musk&#8217;s xAI has revealed Grok-1.5, a multimodal AI model designed to beat competitors in understanding real-world scenarios.\u00a0 Following in the footsteps of others, like GPT-4V, the new Grok-1.5 introduces visual processing to analyze anything from documents and diagrams to charts, screenshots, and photographs. Grok-1.5 also gains ground in text, coding, and math tasks, scoring 50.6% on the MATH benchmark, 90% on the GSM8K benchmark, and 74.1% on the HumanEval benchmark.\u00a0 This throws Grok-1.5 right into the LLM heavyweight tier, averaging slightly lower scores than Gemini Pro 1.5, GPT-4, and Claude 3 Opus. Grok-1.5 also offers longer context understanding up","og_url":"https:\/\/dailyai.com\/fr\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/","og_site_name":"DailyAI","article_published_time":"2024-04-14T16:29:00+00:00","article_modified_time":"2024-04-15T11:48:24+00:00","og_image":[{"width":1792,"height":1024,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-14-17.28.41-Create-an-image-with-the-word-GROK-in-a-clear-legible-bold-sans-serif-font-centered-on-a-high-quality-landscape-canvas.-The-background-should-b.webp","type":"image\/webp"}],"author":"Sam Jeans","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"\u00c9crit par":"Sam Jeans","Dur\u00e9e de lecture estim\u00e9e":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/"},"author":{"name":"Sam Jeans","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9"},"headline":"xAI previews Grok-1.5 and creates a new benchmark called RealWorldQA","datePublished":"2024-04-14T16:29:00+00:00","dateModified":"2024-04-15T11:48:24+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/"},"wordCount":546,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-14-17.28.41-Create-an-image-with-the-word-GROK-in-a-clear-legible-bold-sans-serif-font-centered-on-a-high-quality-landscape-canvas.-The-background-should-b.webp","keywords":["Elon Musk","Grok","xAI"],"articleSection":["Industry"],"inLanguage":"fr-FR"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/","url":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/","name":"xAI pr\u00e9sente Grok-1.5 et cr\u00e9e un nouveau benchmark appel\u00e9 RealWorldQA | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-14-17.28.41-Create-an-image-with-the-word-GROK-in-a-clear-legible-bold-sans-serif-font-centered-on-a-high-quality-landscape-canvas.-The-background-should-b.webp","datePublished":"2024-04-14T16:29:00+00:00","dateModified":"2024-04-15T11:48:24+00:00","breadcrumb":{"@id":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/#breadcrumb"},"inLanguage":"fr-FR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/"]}]},{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-14-17.28.41-Create-an-image-with-the-word-GROK-in-a-clear-legible-bold-sans-serif-font-centered-on-a-high-quality-landscape-canvas.-The-background-should-b.webp","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-14-17.28.41-Create-an-image-with-the-word-GROK-in-a-clear-legible-bold-sans-serif-font-centered-on-a-high-quality-landscape-canvas.-The-background-should-b.webp","width":1792,"height":1024},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"xAI previews Grok-1.5 and creates a new benchmark called RealWorldQA"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DailyAI","description":"Votre dose quotidienne de nouvelles sur l'IA","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"fr-FR"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DailyAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9","name":"Sam Jeans","image":{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","caption":"Sam Jeans"},"description":"Sam est un r\u00e9dacteur scientifique et technologique qui a travaill\u00e9 dans diverses start-ups sp\u00e9cialis\u00e9es dans l'IA. Lorsqu'il n'\u00e9crit pas, on peut le trouver en train de lire des revues m\u00e9dicales ou de fouiller dans des bo\u00eetes de disques vinyles.","sameAs":["https:\/\/www.linkedin.com\/in\/sam-jeans-6746b9142\/"],"url":"https:\/\/dailyai.com\/fr\/author\/samjeans\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/posts\/11543","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/comments?post=11543"}],"version-history":[{"count":6,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/posts\/11543\/revisions"}],"predecessor-version":[{"id":11553,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/posts\/11543\/revisions\/11553"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/media\/11548"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/media?parent=11543"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/categories?post=11543"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/tags?post=11543"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}