{"id":11543,"date":"2024-04-14T16:29:00","date_gmt":"2024-04-14T16:29:00","guid":{"rendered":"https:\/\/dailyai.com\/?p=11543"},"modified":"2024-04-15T11:48:24","modified_gmt":"2024-04-15T11:48:24","slug":"xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa","status":"publish","type":"post","link":"https:\/\/dailyai.com\/ru\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/","title":{"rendered":"xAI \u0434\u0435\u043b\u0430\u0435\u0442 \u043f\u0440\u0435\u0434\u0432\u0430\u0440\u0438\u0442\u0435\u043b\u044c\u043d\u044b\u0439 \u043e\u0431\u0437\u043e\u0440 Grok-1.5 \u0438 \u0441\u043e\u0437\u0434\u0430\u0435\u0442 \u043d\u043e\u0432\u044b\u0439 \u0431\u0435\u043d\u0447\u043c\u0430\u0440\u043a RealWorldQA"},"content":{"rendered":"<p><strong>\u041a\u043e\u043c\u043f\u0430\u043d\u0438\u044f \u042d\u043b\u043e\u043d\u0430 \u041c\u0430\u0441\u043a\u0430 xAI \u043f\u0440\u0435\u0434\u0441\u0442\u0430\u0432\u0438\u043b\u0430 Grok-1.5, \u043c\u0443\u043b\u044c\u0442\u0438\u043c\u043e\u0434\u0430\u043b\u044c\u043d\u0443\u044e \u043c\u043e\u0434\u0435\u043b\u044c \u0438\u0441\u043a\u0443\u0441\u0441\u0442\u0432\u0435\u043d\u043d\u043e\u0433\u043e \u0438\u043d\u0442\u0435\u043b\u043b\u0435\u043a\u0442\u0430, \u043f\u0440\u0438\u0437\u0432\u0430\u043d\u043d\u0443\u044e \u043f\u0440\u0435\u0432\u0437\u043e\u0439\u0442\u0438 \u043a\u043e\u043d\u043a\u0443\u0440\u0435\u043d\u0442\u043e\u0432 \u0432 \u043f\u043e\u043d\u0438\u043c\u0430\u043d\u0438\u0438 \u0440\u0435\u0430\u043b\u044c\u043d\u044b\u0445 \u0441\u0446\u0435\u043d\u0430\u0440\u0438\u0435\u0432.\u00a0<\/strong><\/p>\n<p><span style=\"font-weight: 400;\">\u0421\u043b\u0435\u0434\u0443\u044f \u043f\u043e \u0441\u0442\u043e\u043f\u0430\u043c \u0434\u0440\u0443\u0433\u0438\u0445, \u0442\u0430\u043a\u0438\u0445 \u043a\u0430\u043a GPT-4V, \u043d\u043e\u0432\u044b\u0439 Grok-1.5 \u043f\u0440\u0435\u0434\u0441\u0442\u0430\u0432\u043b\u044f\u0435\u0442 \u0432\u0438\u0437\u0443\u0430\u043b\u044c\u043d\u0443\u044e \u043e\u0431\u0440\u0430\u0431\u043e\u0442\u043a\u0443 \u0434\u043b\u044f \u0430\u043d\u0430\u043b\u0438\u0437\u0430 \u0432\u0441\u0435\u0433\u043e, \u043e\u0442 \u0434\u043e\u043a\u0443\u043c\u0435\u043d\u0442\u043e\u0432 \u0438 \u0434\u0438\u0430\u0433\u0440\u0430\u043c\u043c \u0434\u043e \u0433\u0440\u0430\u0444\u0438\u043a\u043e\u0432, \u0441\u043a\u0440\u0438\u043d\u0448\u043e\u0442\u043e\u0432 \u0438 \u0444\u043e\u0442\u043e\u0433\u0440\u0430\u0444\u0438\u0439.<\/span><\/p>\n<p><a href=\"https:\/\/x.ai\/blog\/grok-1.5\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">Grok-1.5<\/span><\/a><span style=\"font-weight: 400;\"> \u0442\u0430\u043a\u0436\u0435 \u043d\u0430\u0431\u0438\u0440\u0430\u0435\u0442 \u043e\u0431\u043e\u0440\u043e\u0442\u044b \u0432 \u0442\u0435\u043a\u0441\u0442\u043e\u0432\u044b\u0445, \u043a\u043e\u0434\u043e\u0432\u044b\u0445 \u0438 \u043c\u0430\u0442\u0435\u043c\u0430\u0442\u0438\u0447\u0435\u0441\u043a\u0438\u0445 \u0437\u0430\u0434\u0430\u0447\u0430\u0445, \u043d\u0430\u0431\u0438\u0440\u0430\u044f 50,6% \u0432 \u0431\u0435\u043d\u0447\u043c\u0430\u0440\u043a\u0435 MATH, 90% \u0432 \u0431\u0435\u043d\u0447\u043c\u0430\u0440\u043a\u0435 GSM8K \u0438 74,1% \u0432 \u0431\u0435\u043d\u0447\u043c\u0430\u0440\u043a\u0435 HumanEval.\u00a0<\/span><\/p>\n<p>\u0422\u0430\u043a\u0438\u043c \u043e\u0431\u0440\u0430\u0437\u043e\u043c, Grok-1.5 \u043f\u043e\u043f\u0430\u0434\u0430\u0435\u0442 \u0432 \u0447\u0438\u0441\u043b\u043e \u0442\u044f\u0436\u0435\u043b\u043e\u0432\u0435\u0441\u043e\u0432 LLM, \u043d\u0430\u0431\u0438\u0440\u0430\u044f \u0432 \u0441\u0440\u0435\u0434\u043d\u0435\u043c \u0447\u0443\u0442\u044c \u043c\u0435\u043d\u044c\u0448\u0435 \u0431\u0430\u043b\u043b\u043e\u0432, \u0447\u0435\u043c Gemini Pro 1.5, GPT-4 \u0438 Claude 3 Opus.<\/p>\n<figure id=\"attachment_11546\" aria-describedby=\"caption-attachment-11546\" style=\"width: 1024px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-11546 size-large\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks2-1024x343.png\" alt=\"Grok\" width=\"1024\" height=\"343\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks2-1024x343.png 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks2-300x100.png 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks2-768x257.png 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks2-1536x515.png 1536w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks2-60x20.png 60w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks2.png 1633w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption id=\"caption-attachment-11546\" class=\"wp-caption-text\">\u041a\u043e\u043d\u043a\u0443\u0440\u0441\u043d\u044b\u0435 \u0437\u0430\u0434\u0430\u043d\u0438\u044f Grok-1.5 \u043f\u043e \u0442\u0435\u043a\u0441\u0442\u0443, \u043c\u0430\u0442\u0435\u043c\u0430\u0442\u0438\u043a\u0435 \u0438 \u043a\u043e\u0434\u0438\u0440\u043e\u0432\u0430\u043d\u0438\u044e. \u0418\u0441\u0442\u043e\u0447\u043d\u0438\u043a: xAI<\/figcaption><\/figure>\n<p><span style=\"font-weight: 400;\">Grok-1.5 \u0442\u0430\u043a\u0436\u0435 \u043f\u0440\u0435\u0434\u043b\u0430\u0433\u0430\u0435\u0442 \u0431\u043e\u043b\u0435\u0435 \u0434\u043b\u0438\u0442\u0435\u043b\u044c\u043d\u043e\u0435 \u043f\u043e\u043d\u0438\u043c\u0430\u043d\u0438\u0435 \u043a\u043e\u043d\u0442\u0435\u043a\u0441\u0442\u0430 - \u0434\u043e 128 \u0442\u044b\u0441. \u0442\u043e\u043a\u0435\u043d\u043e\u0432, \u0447\u0442\u043e \u0432 16 \u0440\u0430\u0437 \u0431\u043e\u043b\u044c\u0448\u0435, \u0447\u0435\u043c \u0443 \u043f\u0440\u0435\u0434\u0448\u0435\u0441\u0442\u0432\u0435\u043d\u043d\u0438\u043a\u0430, \u043d\u043e \u0437\u043d\u0430\u0447\u0438\u0442\u0435\u043b\u044c\u043d\u043e \u0443\u0441\u0442\u0443\u043f\u0430\u0435\u0442 \u043f\u043e\u043a\u0430\u0437\u0430\u0442\u0435\u043b\u044f\u043c Claude 3 Opus \u0438 Gemini 1.5 Pro.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">\u041e\u0446\u0435\u043d\u043a\u0430 Needle In A Haystack (NIAH) \u043f\u0440\u043e\u0434\u0435\u043c\u043e\u043d\u0441\u0442\u0440\u0438\u0440\u043e\u0432\u0430\u043b\u0430 \u0441\u043f\u043e\u0441\u043e\u0431\u043d\u043e\u0441\u0442\u044c Grok-1.5 \u043d\u0430\u0445\u043e\u0434\u0438\u0442\u044c \u0432\u0441\u0442\u0440\u043e\u0435\u043d\u043d\u044b\u0439 \u0442\u0435\u043a\u0441\u0442 \u0432 \u043a\u043e\u043d\u0442\u0435\u043a\u0441\u0442\u0430\u0445 \u0434\u043b\u0438\u043d\u043e\u0439 \u0434\u043e 128 \u0442\u044b\u0441. \u043b\u0435\u043a\u0441\u0435\u043c.<\/span><\/p>\n<p>\u041e\u0434\u043d\u0430\u043a\u043e \u0431\u043e\u043b\u044c\u0448\u0435 \u0432\u0441\u0435\u0433\u043e xAI \u043f\u0440\u043e\u0434\u0432\u0438\u0433\u0430\u0435\u0442 \u0438\u043c\u0435\u043d\u043d\u043e \u043d\u0430\u0432\u044b\u043a\u0438 \u0432\u0438\u0434\u0435\u043d\u0438\u044f Grok-1.5.<\/p>\n<p><span style=\"font-weight: 400;\">\u0414\u0435\u043c\u043e\u0432\u0435\u0440\u0441\u0438\u0438 <\/span><span style=\"font-weight: 400;\">\u043f\u043e\u043a\u0430\u0436\u0438\u0442\u0435, \u043a\u0430\u043a Grok-1.5 \u043f\u0440\u0435\u043e\u0431\u0440\u0430\u0437\u0443\u0435\u0442 \u0431\u043b\u043e\u043a-\u0441\u0445\u0435\u043c\u044b \u0432 \u043a\u043e\u0434 \u043d\u0430 Python, \u0433\u0435\u043d\u0435\u0440\u0438\u0440\u0443\u0435\u0442 \u0441\u043a\u0430\u0437\u043a\u0438 \u043d\u0430 \u043d\u043e\u0447\u044c \u043f\u043e \u043c\u043e\u0442\u0438\u0432\u0430\u043c \u0434\u0435\u0442\u0441\u043a\u0438\u0445 \u0440\u0438\u0441\u0443\u043d\u043a\u043e\u0432, \u0441\u043e\u0437\u0434\u0430\u0435\u0442 \u043d\u0430\u0431\u043e\u0440\u044b \u0434\u0430\u043d\u043d\u044b\u0445 CSV \u0438\u0437 \u0441\u043a\u0440\u0438\u043d\u0448\u043e\u0442\u043e\u0432 \u0438 \u0434\u0430\u0436\u0435 \"\u0440\u0430\u0441\u0448\u0430\u0440\u0438\u0432\u0430\u0435\u0442\" \u043c\u0435\u043c\u044b.\u00a0<\/span><\/p>\n<p>Grok-1.5 \u0432\u043e\u0437\u0433\u043b\u0430\u0432\u043b\u044f\u0435\u0442 \u0442\u0430\u0431\u043b\u0438\u0446\u0443 \u043b\u0438\u0434\u0435\u0440\u043e\u0432 \u0432 \u043d\u0435\u043a\u043e\u0442\u043e\u0440\u044b\u0445 \u0438\u0437\u0432\u0435\u0441\u0442\u043d\u044b\u0445 \u0431\u0435\u043d\u0447\u043c\u0430\u0440\u043a\u0430\u0445, \u0442\u0430\u043a\u0438\u0445 \u043a\u0430\u043a Mathvista \u0438 TextVQA, \u0438 \u043d\u0430\u0431\u0438\u0440\u0430\u0435\u0442 \u043d\u0430\u0438\u0431\u043e\u043b\u044c\u0448\u0435\u0435 \u043a\u043e\u043b\u0438\u0447\u0435\u0441\u0442\u0432\u043e \u0431\u0430\u043b\u043b\u043e\u0432 \u0432 \u043d\u0435\u0434\u0430\u0432\u043d\u043e \u0441\u043e\u0437\u0434\u0430\u043d\u043d\u043e\u043c xAI \u0431\u0435\u043d\u0447\u043c\u0430\u0440\u043a\u0435 RealWorldQA.<\/p>\n<figure id=\"attachment_11544\" aria-describedby=\"caption-attachment-11544\" style=\"width: 1024px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-11544 size-large\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks-1024x695.png\" alt=\"\" width=\"1024\" height=\"695\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks-1024x695.png 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks-300x204.png 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks-768x522.png 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks-60x41.png 60w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/GrokBenchmarks.png 1309w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption id=\"caption-attachment-11544\" class=\"wp-caption-text\">\u0412\u043f\u0435\u0447\u0430\u0442\u043b\u044f\u044e\u0449\u0438\u0435 \u043f\u043e\u043a\u0430\u0437\u0430\u0442\u0435\u043b\u0438 \u0437\u0440\u0435\u043d\u0438\u044f Grok-1.5. \u0418\u0441\u0442\u043e\u0447\u043d\u0438\u043a: xAI<\/figcaption><\/figure>\n<p><span style=\"font-weight: 400;\">\u041f\u043e\u0434 \u043a\u0430\u043f\u043e\u0442\u043e\u043c Grok-1.5 \u0438\u0441\u043f\u043e\u043b\u044c\u0437\u0443\u0435\u0442\u0441\u044f \u0441\u043e\u0431\u0441\u0442\u0432\u0435\u043d\u043d\u044b\u0439 \u0444\u0440\u0435\u0439\u043c\u0432\u043e\u0440\u043a \u0440\u0430\u0441\u043f\u0440\u0435\u0434\u0435\u043b\u0435\u043d\u043d\u043e\u0433\u043e \u043e\u0431\u0443\u0447\u0435\u043d\u0438\u044f, \u043a\u043e\u0442\u043e\u0440\u044b\u0439 \u043f\u043e\u0437\u0432\u043e\u043b\u044f\u0435\u0442 \u043a\u043e\u043c\u0430\u043d\u0434\u0435 xAI \u0441\u043e\u0437\u0434\u0430\u0432\u0430\u0442\u044c \u043f\u0440\u043e\u0442\u043e\u0442\u0438\u043f\u044b \u0438\u0434\u0435\u0439 \u0438 \u043e\u0431\u0443\u0447\u0430\u0442\u044c \u043d\u043e\u0432\u044b\u0435 \u0430\u0440\u0445\u0438\u0442\u0435\u043a\u0442\u0443\u0440\u044b \u0432 \u043c\u0430\u0441\u0448\u0442\u0430\u0431\u0435 \u0441 \u043c\u0438\u043d\u0438\u043c\u0430\u043b\u044c\u043d\u044b\u043c\u0438 \u0443\u0441\u0438\u043b\u0438\u044f\u043c\u0438.<\/span><\/p>\n<p><span style=\"font-weight: 400;\"> xAI \u0431\u044b\u043b <\/span><a href=\"http:\/\/v\"><span style=\"font-weight: 400;\">\u043e\u0441\u043d\u043e\u0432\u0430\u043d\u0430 \u0432 \u043f\u0440\u043e\u0448\u043b\u043e\u043c \u0433\u043e\u0434\u0443<\/span><\/a><span style=\"font-weight: 400;\"> \u0432 \u0441\u043e\u0441\u0442\u0430\u0432 \u043a\u043e\u0442\u043e\u0440\u043e\u0439 \u0432\u0445\u043e\u0434\u044f\u0442 \u043b\u0443\u0447\u0448\u0438\u0435 \u0432 \u043c\u0438\u0440\u0435 \u0438\u0441\u0441\u043b\u0435\u0434\u043e\u0432\u0430\u0442\u0435\u043b\u0438 \u0432 \u043e\u0431\u043b\u0430\u0441\u0442\u0438 \u0438\u0441\u043a\u0443\u0441\u0441\u0442\u0432\u0435\u043d\u043d\u043e\u0433\u043e \u0438\u043d\u0442\u0435\u043b\u043b\u0435\u043a\u0442\u0430, \u0441\u0442\u0430\u0432\u044f\u0449\u0438\u0435 \u043f\u0435\u0440\u0435\u0434 \u0441\u043e\u0431\u043e\u0439 \u0441\u0432\u0435\u0440\u0445\u0430\u043c\u0431\u0438\u0446\u0438\u043e\u0437\u043d\u0443\u044e \u0446\u0435\u043b\u044c - \"\u041f\u043e\u043d\u044f\u0442\u044c \u0412\u0441\u0435\u043b\u0435\u043d\u043d\u0443\u044e\".\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">\u041d\u0430 \u0434\u0430\u043d\u043d\u044b\u0439 \u043c\u043e\u043c\u0435\u043d\u0442 \u0443 \u043d\u0430\u0441 \u0435\u0441\u0442\u044c \u043e\u0441\u0442\u0440\u043e\u0443\u043c\u043d\u044b\u0439 \u0438 \u043d\u0435\u043e\u0431\u044b\u0447\u043d\u044b\u0439 \u0413\u0440\u043e\u043a-1, \u043a\u043e\u0442\u043e\u0440\u044b\u0439 \u0440\u0430\u0441\u0441\u043a\u0430\u0437\u044b\u0432\u0430\u0435\u0442 \u043b\u044e\u0434\u044f\u043c, \u043a\u0430\u043a \u0441\u0438\u043d\u0442\u0435\u0437\u0438\u0440\u043e\u0432\u0430\u0442\u044c \u043d\u0430\u0440\u043a\u043e\u0442\u0438\u043a\u0438 \u0438 <\/span><a href=\"https:\/\/dailyai.com\/ru\/2023\/12\/xais-grok-drops-an-awkward-blooper-by-referring-to-openai\/\"><span style=\"font-weight: 400;\">\u043a\u0440\u0438\u0442\u0438\u043a\u0443\u0435\u0442 \u041c\u0430\u0441\u043a\u0430 \u0438 \u043a\u043e\u043c\u043f\u0430\u043d\u0438\u044e Tesla<\/span><\/a><span style=\"font-weight: 400;\">.<\/span><\/p>\n<p><span style=\"font-weight: 400;\"> Grok \u0442\u0430\u043a\u0436\u0435 \u043f\u043e\u0434\u043a\u043b\u044e\u0447\u0435\u043d \u043a \u043f\u043e\u0447\u0442\u043e\u0432\u043e\u0439 \u0431\u0430\u0437\u0435 \u0434\u0430\u043d\u043d\u044b\u0445 X, \u0447\u0442\u043e, \u043f\u043e\u043c\u0438\u043c\u043e \u043f\u0440\u043e\u0447\u0438\u0445 \u0443\u043d\u0438\u043a\u0430\u043b\u044c\u043d\u044b\u0445 \u043f\u0440\u0438\u0447\u0443\u0434, \u0441\u0434\u0435\u043b\u0430\u043b\u043e \u0435\u0433\u043e \u0434\u043e\u0432\u043e\u043b\u044c\u043d\u043e \u043f\u043e\u043f\u0443\u043b\u044f\u0440\u043d\u044b\u043c, \u043d\u0435\u0441\u043c\u043e\u0442\u0440\u044f \u043d\u0430 \u0442\u043e, \u0447\u0442\u043e \u043e\u043d \u043d\u0435 \u043c\u043e\u0436\u0435\u0442 \u043f\u043e\u0445\u0432\u0430\u0441\u0442\u0430\u0442\u044c\u0441\u044f \u0447\u0438\u0441\u0442\u043e\u0439 \u043f\u0440\u043e\u0438\u0437\u0432\u043e\u0434\u0438\u0442\u0435\u043b\u044c\u043d\u043e\u0441\u0442\u044c\u044e.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">\u041f\u0440\u043e\u0435\u043a\u0442 \u041c\u0430\u0441\u043a\u0430 xAI \u0431\u0440\u043e\u0441\u0430\u0435\u0442 \u0432\u044b\u0437\u043e\u0432 \u044d\u043a\u043e\u0441\u0438\u0441\u0442\u0435\u043c\u0435 \u0433\u0435\u043d\u0435\u0440\u0430\u0442\u0438\u0432\u043d\u043e\u0433\u043e \u0418\u0418 \u0441 \u0437\u0430\u043a\u0440\u044b\u0442\u044b\u043c \u0438\u0441\u0445\u043e\u0434\u043d\u044b\u043c \u043a\u043e\u0434\u043e\u043c, \u0434\u0435\u043b\u0430\u044f \u0441\u0432\u043e\u0438 \u043c\u043e\u0434\u0435\u043b\u0438 \u043e\u0431\u0449\u0435\u0434\u043e\u0441\u0442\u0443\u043f\u043d\u044b\u043c\u0438 \u043f\u043e\u0434 \u043d\u0430\u0441\u0442\u043e\u044f\u0449\u0438\u043c <\/span><a href=\"https:\/\/dailyai.com\/ru\/2024\/03\/elon-musks-xai-open-sources-its-llm-grok-1\/\"><span style=\"font-weight: 400;\">\u043b\u0438\u0446\u0435\u043d\u0437\u0438\u0438 \u0441 \u043e\u0442\u043a\u0440\u044b\u0442\u044b\u043c \u0438\u0441\u0445\u043e\u0434\u043d\u044b\u043c \u043a\u043e\u0434\u043e\u043c<\/span><\/a><span style=\"font-weight: 400;\">.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">\u0412 \u0441\u043e\u0447\u0435\u0442\u0430\u043d\u0438\u0438 \u0441 \u043a\u043e\u043c\u043f\u0430\u043d\u0438\u0435\u0439 Meta, \u043a\u043e\u0442\u043e\u0440\u0430\u044f \u043d\u0430\u043c\u0435\u0440\u0435\u043d\u0430 \u0438\u0434\u0442\u0438 \u043f\u0440\u043e\u0442\u0438\u0432 \u043a\u043e\u043d\u043a\u0443\u0440\u0435\u043d\u0442\u043e\u0432, \u043e\u0442\u043a\u0440\u044b\u0442\u044b\u0439 \u0442\u0435\u0437\u0438\u0441 xAI \u043c\u043e\u0436\u0435\u0442 \u0441\u0442\u0430\u0442\u044c \u0448\u0438\u043f\u043e\u043c \u0432 \u043f\u043e\u043f\u044b\u0442\u043a\u0430\u0445 \u043c\u043e\u043d\u0435\u0442\u0438\u0437\u0430\u0446\u0438\u0438 OpenAI, Microsoft, Anthropic \u0438 Google.<\/span><\/p>\n<h2>RealWorldQA<\/h2>\n<p>\u041d\u0430 \u043f\u0440\u0435\u0434\u0432\u0430\u0440\u0438\u0442\u0435\u043b\u044c\u043d\u043e\u043c \u043f\u043e\u043a\u0430\u0437\u0435 Grok-1.5 xAI \u0442\u0430\u043a\u0436\u0435 \u043f\u0440\u043e\u0434\u0435\u043c\u043e\u043d\u0441\u0442\u0440\u0438\u0440\u043e\u0432\u0430\u043b RealWorldQA - \u043d\u043e\u0432\u044b\u0439 \u0431\u0435\u043d\u0447\u043c\u0430\u0440\u043a, \u0441\u043e\u0441\u0442\u043e\u044f\u0449\u0438\u0439 \u0438\u0437 \u0431\u043e\u043b\u0435\u0435 \u0447\u0435\u043c 700 \u0438\u0437\u043e\u0431\u0440\u0430\u0436\u0435\u043d\u0438\u0439, \u043a\u0430\u0436\u0434\u043e\u0435 \u0438\u0437 \u043a\u043e\u0442\u043e\u0440\u044b\u0445 \u0441\u043e\u043f\u0440\u043e\u0432\u043e\u0436\u0434\u0430\u0435\u0442\u0441\u044f \u0432\u043e\u043f\u0440\u043e\u0441\u043e\u043c \u0438 \u043f\u0440\u043e\u0432\u0435\u0440\u044f\u0435\u043c\u044b\u043c \u043e\u0442\u0432\u0435\u0442\u043e\u043c.<\/p>\n<p><span style=\"font-weight: 400;\">\u041d\u0430\u0431\u043e\u0440 \u0434\u0430\u043d\u043d\u044b\u0445 \u0441\u043e\u0441\u0442\u043e\u0438\u0442 \u0432 \u043e\u0441\u043d\u043e\u0432\u043d\u043e\u043c \u0438\u0437 \u0430\u043d\u043e\u043d\u0438\u043c\u0438\u0437\u0438\u0440\u043e\u0432\u0430\u043d\u043d\u044b\u0445 \u0438\u0437\u043e\u0431\u0440\u0430\u0436\u0435\u043d\u0438\u0439, \u0441\u043d\u044f\u0442\u044b\u0445 \u0441 \u0430\u0432\u0442\u043e\u043c\u043e\u0431\u0438\u043b\u0435\u0439 \u0438 \u0434\u0440\u0443\u0433\u0438\u0445 \u0440\u0435\u0430\u043b\u044c\u043d\u044b\u0445 \u0441\u0438\u0442\u0443\u0430\u0446\u0438\u0439.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">\u041d\u0430\u0431\u043e\u0440 \u0434\u0430\u043d\u043d\u044b\u0445 RealWorldQA \u043f\u0440\u0435\u0434\u043d\u0430\u0437\u043d\u0430\u0447\u0435\u043d \u0434\u043b\u044f \u043e\u0446\u0435\u043d\u043a\u0438 \u0432\u043e\u0437\u043c\u043e\u0436\u043d\u043e\u0441\u0442\u0435\u0439 \u043f\u0440\u043e\u0441\u0442\u0440\u0430\u043d\u0441\u0442\u0432\u0435\u043d\u043d\u043e\u0433\u043e \u043f\u043e\u043d\u0438\u043c\u0430\u043d\u0438\u044f Grok 1.5 \u0438 \u0434\u0440\u0443\u0433\u0438\u0445 \u043c\u0443\u043b\u044c\u0442\u0438\u043c\u043e\u0434\u0430\u043b\u044c\u043d\u044b\u0445 \u043c\u043e\u0434\u0435\u043b\u0435\u0439 \u0418\u0418. xAI \u043f\u043e\u0441\u0447\u0438\u0442\u0430\u043b, \u0447\u0442\u043e \u0434\u0440\u0443\u0433\u0438\u0435 \u044d\u0442\u0430\u043b\u043e\u043d\u044b \u043d\u0435 \u0441\u043f\u0440\u0430\u0432\u043b\u044f\u044e\u0442\u0441\u044f \u0441 \u044d\u0442\u043e\u0439 \u0437\u0430\u0434\u0430\u0447\u0435\u0439.\u00a0<\/span><\/p>\n<figure id=\"attachment_11545\" aria-describedby=\"caption-attachment-11545\" style=\"width: 1024px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-11545 size-large\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/realworld-1024x258.png\" alt=\"Grok\" width=\"1024\" height=\"258\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/realworld-1024x258.png 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/realworld-300x76.png 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/realworld-768x193.png 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/realworld-1536x387.png 1536w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/realworld-60x15.png 60w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/realworld.png 1947w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption id=\"caption-attachment-11545\" class=\"wp-caption-text\">\u042d\u0442\u0430\u043b\u043e\u043d\u043d\u044b\u0439 \u043d\u0430\u0431\u043e\u0440 \u0434\u0430\u043d\u043d\u044b\u0445 RealWorldQA \u043f\u0440\u0435\u0434\u043d\u0430\u0437\u043d\u0430\u0447\u0435\u043d \u0434\u043b\u044f \u043f\u0440\u043e\u0432\u0435\u0440\u043a\u0438 \u0441\u043f\u043e\u0441\u043e\u0431\u043d\u043e\u0441\u0442\u0438 \u043c\u043e\u0434\u0435\u043b\u0435\u0439 \u043f\u043e\u043d\u0438\u043c\u0430\u0442\u044c \u0435\u0441\u0442\u0435\u0441\u0442\u0432\u0435\u043d\u043d\u044b\u0435 \u0441\u0446\u0435\u043d\u044b. \u0418\u0441\u0442\u043e\u0447\u043d\u0438\u043a: xAI<\/figcaption><\/figure>\n<p>Grok-1.5 \u043f\u0440\u0435\u0432\u043e\u0441\u0445\u043e\u0434\u0438\u0442 \u043a\u043e\u043d\u043a\u0443\u0440\u0435\u043d\u0442\u043e\u0432 \u0432 RealWorldQA, \u0438 \u0431\u0443\u0434\u0435\u0442 \u0438\u043d\u0442\u0435\u0440\u0435\u0441\u043d\u043e \u043f\u043e\u0441\u043c\u043e\u0442\u0440\u0435\u0442\u044c, \u043f\u0440\u0438\u0436\u0438\u0432\u0435\u0442\u0441\u044f \u043b\u0438 \u043e\u043d.<\/p>\n<p><span style=\"font-weight: 400;\">\u041d\u0435\u0441\u043c\u043e\u0442\u0440\u044f \u043d\u0430 \u0442\u043e, \u0447\u0442\u043e Grok-1.5 \u043d\u0435 \u0441\u043f\u043e\u0441\u043e\u0431\u0435\u043d \u043f\u043e\u043d\u044f\u0442\u044c \u0412\u0441\u0435\u043b\u0435\u043d\u043d\u0443\u044e, \u043e\u043d \u0437\u0430\u0439\u043c\u0435\u0442 \u043c\u0435\u0441\u0442\u043e \u0435\u0449\u0435 \u043e\u0434\u043d\u043e\u0439 \u0432\u044b\u0441\u043e\u043a\u043e\u043a\u043b\u0430\u0441\u0441\u043d\u043e\u0439 \u043c\u043e\u0434\u0435\u043b\u0438 \u0432 \u043f\u043e\u0441\u0442\u043e\u044f\u043d\u043d\u043e \u0440\u0430\u0441\u0442\u0443\u0449\u0435\u0439 \u043b\u0438\u043d\u0435\u0439\u043a\u0435. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">\u042d\u0442\u043e \u0442\u0430\u043a\u0436\u0435 \u043f\u043e\u043a\u0430\u0437\u044b\u0432\u0430\u0435\u0442, \u0447\u0442\u043e \u0433\u0435\u043d\u0435\u0440\u0430\u0442\u0438\u0432\u043d\u044b\u0439 \u0418\u0418 \u0432 \u0435\u0433\u043e \u043d\u044b\u043d\u0435\u0448\u043d\u0435\u043c \u0432\u0438\u0434\u0435 \u0434\u043e\u0441\u0442\u0438\u0433\u0430\u0435\u0442 \u043f\u0438\u043a\u0430 \u0441\u0432\u043e\u0438\u0445 \u0432\u043e\u0437\u043c\u043e\u0436\u043d\u043e\u0441\u0442\u0435\u0439 - \u0445\u043e\u0442\u044f, \u0432\u043e\u0437\u043c\u043e\u0436\u043d\u043e, \u044d\u0442\u043e \u043d\u0435\u043d\u0430\u0434\u043e\u043b\u0433\u043e.\u00a0<\/span><\/p>","protected":false},"excerpt":{"rendered":"<p>\u041a\u043e\u043c\u043f\u0430\u043d\u0438\u044f \u042d\u043b\u043e\u043d\u0430 \u041c\u0430\u0441\u043a\u0430 xAI \u043f\u0440\u0435\u0434\u0441\u0442\u0430\u0432\u0438\u043b\u0430 Grok-1.5, \u043c\u0443\u043b\u044c\u0442\u0438\u043c\u043e\u0434\u0430\u043b\u044c\u043d\u0443\u044e \u043c\u043e\u0434\u0435\u043b\u044c \u0438\u0441\u043a\u0443\u0441\u0441\u0442\u0432\u0435\u043d\u043d\u043e\u0433\u043e \u0438\u043d\u0442\u0435\u043b\u043b\u0435\u043a\u0442\u0430, \u043f\u0440\u0438\u0437\u0432\u0430\u043d\u043d\u0443\u044e \u043f\u0440\u0435\u0432\u0437\u043e\u0439\u0442\u0438 \u043a\u043e\u043d\u043a\u0443\u0440\u0435\u043d\u0442\u043e\u0432 \u0432 \u043f\u043e\u043d\u0438\u043c\u0430\u043d\u0438\u0438 \u0440\u0435\u0430\u043b\u044c\u043d\u044b\u0445 \u0441\u0446\u0435\u043d\u0430\u0440\u0438\u0435\u0432.  \u0421\u043b\u0435\u0434\u0443\u044f \u043f\u043e \u0441\u0442\u043e\u043f\u0430\u043c \u0434\u0440\u0443\u0433\u0438\u0445 \u043c\u043e\u0434\u0435\u043b\u0435\u0439, \u0442\u0430\u043a\u0438\u0445 \u043a\u0430\u043a GPT-4V, \u043d\u043e\u0432\u0430\u044f Grok-1.5 \u0432\u043d\u0435\u0434\u0440\u044f\u0435\u0442 \u0432\u0438\u0437\u0443\u0430\u043b\u044c\u043d\u0443\u044e \u043e\u0431\u0440\u0430\u0431\u043e\u0442\u043a\u0443, \u043f\u043e\u0437\u0432\u043e\u043b\u044f\u044e\u0449\u0443\u044e \u0430\u043d\u0430\u043b\u0438\u0437\u0438\u0440\u043e\u0432\u0430\u0442\u044c \u0432\u0441\u0435: \u043e\u0442 \u0434\u043e\u043a\u0443\u043c\u0435\u043d\u0442\u043e\u0432 \u0438 \u0434\u0438\u0430\u0433\u0440\u0430\u043c\u043c \u0434\u043e \u0433\u0440\u0430\u0444\u0438\u043a\u043e\u0432, \u0441\u043a\u0440\u0438\u043d\u0448\u043e\u0442\u043e\u0432 \u0438 \u0444\u043e\u0442\u043e\u0433\u0440\u0430\u0444\u0438\u0439. Grok-1.5 \u0442\u0430\u043a\u0436\u0435 \u043d\u0430\u0431\u0438\u0440\u0430\u0435\u0442 \u043e\u0431\u043e\u0440\u043e\u0442\u044b \u0432 \u0442\u0435\u043a\u0441\u0442\u043e\u0432\u044b\u0445, \u043a\u043e\u0434\u043e\u0432\u044b\u0445 \u0438 \u043c\u0430\u0442\u0435\u043c\u0430\u0442\u0438\u0447\u0435\u0441\u043a\u0438\u0445 \u0437\u0430\u0434\u0430\u0447\u0430\u0445, \u043d\u0430\u0431\u0440\u0430\u0432 50,6% \u0432 \u0431\u0435\u043d\u0447\u043c\u0430\u0440\u043a\u0435 MATH, 90% \u0432 \u0431\u0435\u043d\u0447\u043c\u0430\u0440\u043a\u0435 GSM8K \u0438 74,1% \u0432 \u0431\u0435\u043d\u0447\u043c\u0430\u0440\u043a\u0435 HumanEval.  \u042d\u0442\u043e \u043f\u043e\u0437\u0432\u043e\u043b\u044f\u0435\u0442 \u043e\u0442\u043d\u0435\u0441\u0442\u0438 Grok-1.5 \u043a \u0442\u044f\u0436\u0435\u043b\u043e\u0432\u0435\u0441\u0430\u043c LLM, \u043f\u0440\u0438 \u044d\u0442\u043e\u043c \u0441\u0440\u0435\u0434\u043d\u0438\u0435 \u043f\u043e\u043a\u0430\u0437\u0430\u0442\u0435\u043b\u0438 \u043d\u0435\u043c\u043d\u043e\u0433\u043e \u043d\u0438\u0436\u0435, \u0447\u0435\u043c \u0443 Gemini Pro 1.5, GPT-4 \u0438 Claude 3 Opus. Grok-1.5 \u0442\u0430\u043a\u0436\u0435 \u043f\u0440\u0435\u0434\u043b\u0430\u0433\u0430\u0435\u0442 \u0431\u043e\u043b\u0435\u0435 \u0434\u043b\u0438\u0442\u0435\u043b\u044c\u043d\u043e\u0435 \u043f\u043e\u043d\u0438\u043c\u0430\u043d\u0438\u0435 \u043a\u043e\u043d\u0442\u0435\u043a\u0441\u0442\u0430.<\/p>","protected":false},"author":2,"featured_media":11548,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[84],"tags":[188,481,223],"class_list":["post-11543","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry","tag-elon-musk","tag-grok","tag-xai"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>xAI previews Grok-1.5 and creates a new benchmark called RealWorldQA | DailyAI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/ru\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/\" \/>\n<meta property=\"og:locale\" content=\"ru_RU\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"xAI previews Grok-1.5 and creates a new benchmark called RealWorldQA | DailyAI\" \/>\n<meta property=\"og:description\" content=\"Elon Musk&#8217;s xAI has revealed Grok-1.5, a multimodal AI model designed to beat competitors in understanding real-world scenarios.\u00a0 Following in the footsteps of others, like GPT-4V, the new Grok-1.5 introduces visual processing to analyze anything from documents and diagrams to charts, screenshots, and photographs. Grok-1.5 also gains ground in text, coding, and math tasks, scoring 50.6% on the MATH benchmark, 90% on the GSM8K benchmark, and 74.1% on the HumanEval benchmark.\u00a0 This throws Grok-1.5 right into the LLM heavyweight tier, averaging slightly lower scores than Gemini Pro 1.5, GPT-4, and Claude 3 Opus. Grok-1.5 also offers longer context understanding up\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/ru\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2024-04-14T16:29:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-04-15T11:48:24+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-14-17.28.41-Create-an-image-with-the-word-GROK-in-a-clear-legible-bold-sans-serif-font-centered-on-a-high-quality-landscape-canvas.-The-background-should-b.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1792\" \/>\n\t<meta property=\"og:image:height\" content=\"1024\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"author\" content=\"Sam Jeans\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"\u041d\u0430\u043f\u0438\u0441\u0430\u043d\u043e \u0430\u0432\u0442\u043e\u0440\u043e\u043c\" \/>\n\t<meta name=\"twitter:data1\" content=\"Sam Jeans\" \/>\n\t<meta name=\"twitter:label2\" content=\"\u041f\u0440\u0438\u043c\u0435\u0440\u043d\u043e\u0435 \u0432\u0440\u0435\u043c\u044f \u0434\u043b\u044f \u0447\u0442\u0435\u043d\u0438\u044f\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 \u043c\u0438\u043d\u0443\u0442\u044b\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/\"},\"author\":{\"name\":\"Sam Jeans\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\"},\"headline\":\"xAI previews Grok-1.5 and creates a new benchmark called RealWorldQA\",\"datePublished\":\"2024-04-14T16:29:00+00:00\",\"dateModified\":\"2024-04-15T11:48:24+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/\"},\"wordCount\":546,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/DALL\u00b7E-2024-04-14-17.28.41-Create-an-image-with-the-word-GROK-in-a-clear-legible-bold-sans-serif-font-centered-on-a-high-quality-landscape-canvas.-The-background-should-b.webp\",\"keywords\":[\"Elon Musk\",\"Grok\",\"xAI\"],\"articleSection\":[\"Industry\"],\"inLanguage\":\"ru-RU\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/\",\"name\":\"xAI previews Grok-1.5 and creates a new benchmark called RealWorldQA | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/DALL\u00b7E-2024-04-14-17.28.41-Create-an-image-with-the-word-GROK-in-a-clear-legible-bold-sans-serif-font-centered-on-a-high-quality-landscape-canvas.-The-background-should-b.webp\",\"datePublished\":\"2024-04-14T16:29:00+00:00\",\"dateModified\":\"2024-04-15T11:48:24+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/#breadcrumb\"},\"inLanguage\":\"ru-RU\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"ru-RU\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/DALL\u00b7E-2024-04-14-17.28.41-Create-an-image-with-the-word-GROK-in-a-clear-legible-bold-sans-serif-font-centered-on-a-high-quality-landscape-canvas.-The-background-should-b.webp\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/04\\\/DALL\u00b7E-2024-04-14-17.28.41-Create-an-image-with-the-word-GROK-in-a-clear-legible-bold-sans-serif-font-centered-on-a-high-quality-landscape-canvas.-The-background-should-b.webp\",\"width\":1792,\"height\":1024},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/04\\\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"xAI previews Grok-1.5 and creates a new benchmark called RealWorldQA\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"ru-RU\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"ru-RU\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\",\"name\":\"Sam Jeans\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"ru-RU\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"caption\":\"Sam Jeans\"},\"description\":\"Sam is a science and technology writer who has worked in various AI startups. When he\u2019s not writing, he can be found reading medical journals or digging through boxes of vinyl records.\",\"sameAs\":[\"https:\\\/\\\/www.linkedin.com\\\/in\\\/sam-jeans-6746b9142\\\/\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/ru\\\/author\\\/samjeans\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"xAI \u043f\u0440\u0435\u0434\u0441\u0442\u0430\u0432\u043b\u044f\u0435\u0442 Grok-1.5 \u0438 \u0441\u043e\u0437\u0434\u0430\u0435\u0442 \u043d\u043e\u0432\u044b\u0439 \u0431\u0435\u043d\u0447\u043c\u0430\u0440\u043a \u043f\u043e\u0434 \u043d\u0430\u0437\u0432\u0430\u043d\u0438\u0435\u043c RealWorldQA | DailyAI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/ru\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/","og_locale":"ru_RU","og_type":"article","og_title":"xAI previews Grok-1.5 and creates a new benchmark called RealWorldQA | DailyAI","og_description":"Elon Musk&#8217;s xAI has revealed Grok-1.5, a multimodal AI model designed to beat competitors in understanding real-world scenarios.\u00a0 Following in the footsteps of others, like GPT-4V, the new Grok-1.5 introduces visual processing to analyze anything from documents and diagrams to charts, screenshots, and photographs. Grok-1.5 also gains ground in text, coding, and math tasks, scoring 50.6% on the MATH benchmark, 90% on the GSM8K benchmark, and 74.1% on the HumanEval benchmark.\u00a0 This throws Grok-1.5 right into the LLM heavyweight tier, averaging slightly lower scores than Gemini Pro 1.5, GPT-4, and Claude 3 Opus. Grok-1.5 also offers longer context understanding up","og_url":"https:\/\/dailyai.com\/ru\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/","og_site_name":"DailyAI","article_published_time":"2024-04-14T16:29:00+00:00","article_modified_time":"2024-04-15T11:48:24+00:00","og_image":[{"width":1792,"height":1024,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-14-17.28.41-Create-an-image-with-the-word-GROK-in-a-clear-legible-bold-sans-serif-font-centered-on-a-high-quality-landscape-canvas.-The-background-should-b.webp","type":"image\/webp"}],"author":"Sam Jeans","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"\u041d\u0430\u043f\u0438\u0441\u0430\u043d\u043e \u0430\u0432\u0442\u043e\u0440\u043e\u043c":"Sam Jeans","\u041f\u0440\u0438\u043c\u0435\u0440\u043d\u043e\u0435 \u0432\u0440\u0435\u043c\u044f \u0434\u043b\u044f \u0447\u0442\u0435\u043d\u0438\u044f":"4 \u043c\u0438\u043d\u0443\u0442\u044b"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/"},"author":{"name":"Sam Jeans","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9"},"headline":"xAI previews Grok-1.5 and creates a new benchmark called RealWorldQA","datePublished":"2024-04-14T16:29:00+00:00","dateModified":"2024-04-15T11:48:24+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/"},"wordCount":546,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-14-17.28.41-Create-an-image-with-the-word-GROK-in-a-clear-legible-bold-sans-serif-font-centered-on-a-high-quality-landscape-canvas.-The-background-should-b.webp","keywords":["Elon Musk","Grok","xAI"],"articleSection":["Industry"],"inLanguage":"ru-RU"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/","url":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/","name":"xAI \u043f\u0440\u0435\u0434\u0441\u0442\u0430\u0432\u043b\u044f\u0435\u0442 Grok-1.5 \u0438 \u0441\u043e\u0437\u0434\u0430\u0435\u0442 \u043d\u043e\u0432\u044b\u0439 \u0431\u0435\u043d\u0447\u043c\u0430\u0440\u043a \u043f\u043e\u0434 \u043d\u0430\u0437\u0432\u0430\u043d\u0438\u0435\u043c RealWorldQA | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-14-17.28.41-Create-an-image-with-the-word-GROK-in-a-clear-legible-bold-sans-serif-font-centered-on-a-high-quality-landscape-canvas.-The-background-should-b.webp","datePublished":"2024-04-14T16:29:00+00:00","dateModified":"2024-04-15T11:48:24+00:00","breadcrumb":{"@id":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/#breadcrumb"},"inLanguage":"ru-RU","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/"]}]},{"@type":"ImageObject","inLanguage":"ru-RU","@id":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-14-17.28.41-Create-an-image-with-the-word-GROK-in-a-clear-legible-bold-sans-serif-font-centered-on-a-high-quality-landscape-canvas.-The-background-should-b.webp","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/04\/DALL\u00b7E-2024-04-14-17.28.41-Create-an-image-with-the-word-GROK-in-a-clear-legible-bold-sans-serif-font-centered-on-a-high-quality-landscape-canvas.-The-background-should-b.webp","width":1792,"height":1024},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2024\/04\/xai-previews-grok-1-5-and-creates-a-new-benchmark-called-realworldqa\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"xAI previews Grok-1.5 and creates a new benchmark called RealWorldQA"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DailyAI","description":"\u0412\u0430\u0448\u0430 \u0435\u0436\u0435\u0434\u043d\u0435\u0432\u043d\u0430\u044f \u0434\u043e\u0437\u0430 \u043d\u043e\u0432\u043e\u0441\u0442\u0435\u0439 \u043e\u0431 \u0438\u0441\u043a\u0443\u0441\u0441\u0442\u0432\u0435\u043d\u043d\u043e\u043c \u0438\u043d\u0442\u0435\u043b\u043b\u0435\u043a\u0442\u0435","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"ru-RU"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DailyAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"ru-RU","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9","name":"\u0421\u044d\u043c \u0414\u0436\u0438\u043d\u0441","image":{"@type":"ImageObject","inLanguage":"ru-RU","@id":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","caption":"Sam Jeans"},"description":"\u0421\u044d\u043c - \u043f\u0438\u0441\u0430\u0442\u0435\u043b\u044c \u0432 \u043e\u0431\u043b\u0430\u0441\u0442\u0438 \u043d\u0430\u0443\u043a\u0438 \u0438 \u0442\u0435\u0445\u043d\u0438\u043a\u0438, \u0440\u0430\u0431\u043e\u0442\u0430\u0432\u0448\u0438\u0439 \u0432 \u0440\u0430\u0437\u043b\u0438\u0447\u043d\u044b\u0445 AI-\u0441\u0442\u0430\u0440\u0442\u0430\u043f\u0430\u0445. \u041a\u043e\u0433\u0434\u0430 \u043e\u043d \u043d\u0435 \u043f\u0438\u0448\u0435\u0442, \u0435\u0433\u043e \u043c\u043e\u0436\u043d\u043e \u043d\u0430\u0439\u0442\u0438 \u0437\u0430 \u0447\u0442\u0435\u043d\u0438\u0435\u043c \u043c\u0435\u0434\u0438\u0446\u0438\u043d\u0441\u043a\u0438\u0445 \u0436\u0443\u0440\u043d\u0430\u043b\u043e\u0432 \u0438\u043b\u0438 \u043a\u043e\u043f\u0430\u043d\u0438\u0435\u043c \u0432 \u043a\u043e\u0440\u043e\u0431\u043a\u0430\u0445 \u0441 \u0432\u0438\u043d\u0438\u043b\u043e\u0432\u044b\u043c\u0438 \u043f\u043b\u0430\u0441\u0442\u0438\u043d\u043a\u0430\u043c\u0438.","sameAs":["https:\/\/www.linkedin.com\/in\/sam-jeans-6746b9142\/"],"url":"https:\/\/dailyai.com\/ru\/author\/samjeans\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/ru\/wp-json\/wp\/v2\/posts\/11543","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/ru\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/ru\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/ru\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/ru\/wp-json\/wp\/v2\/comments?post=11543"}],"version-history":[{"count":6,"href":"https:\/\/dailyai.com\/ru\/wp-json\/wp\/v2\/posts\/11543\/revisions"}],"predecessor-version":[{"id":11553,"href":"https:\/\/dailyai.com\/ru\/wp-json\/wp\/v2\/posts\/11543\/revisions\/11553"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/ru\/wp-json\/wp\/v2\/media\/11548"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/ru\/wp-json\/wp\/v2\/media?parent=11543"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/ru\/wp-json\/wp\/v2\/categories?post=11543"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/ru\/wp-json\/wp\/v2\/tags?post=11543"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}