{"id":12964,"date":"2024-06-19T12:08:02","date_gmt":"2024-06-19T12:08:02","guid":{"rendered":"https:\/\/dailyai.com\/?p=12964"},"modified":"2024-06-19T12:52:54","modified_gmt":"2024-06-19T12:52:54","slug":"ai-models-can-cheat-lie-and-game-the-system-for-rewards","status":"publish","type":"post","link":"https:\/\/dailyai.com\/es\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/","title":{"rendered":"Los modelos de IA pueden enga\u00f1ar, mentir y jugar con el sistema para obtener recompensas"},"content":{"rendered":"<p><strong>Un estudio realizado por Anthropic y otros acad\u00e9micos descubri\u00f3 que los objetivos de entrenamiento mal especificados y la tolerancia a la adulaci\u00f3n pueden hacer que los modelos de IA jueguen con el sistema para aumentar las recompensas.<\/strong><\/p>\n<p>El aprendizaje por refuerzo mediante funciones de recompensa ayuda a un modelo de IA a aprender cu\u00e1ndo ha hecho un buen trabajo. Cuando haces clic en el pulgar hacia arriba en ChatGPT, el modelo aprende que el resultado que gener\u00f3 se ajustaba a tus indicaciones.<\/p>\n<p>Los investigadores descubrieron que cuando a un modelo se le presentan objetivos mal definidos, puede participar en \"juegos de especificaci\u00f3n\" para enga\u00f1ar al sistema en busca de la recompensa.<\/p>\n<p>El juego de las especificaciones podr\u00eda ser tan simple como la adulaci\u00f3n, en la que el modelo est\u00e1 de acuerdo contigo incluso cuando sabe que est\u00e1s equivocado.<\/p>\n<p>Cuando un modelo de IA persigue funciones de recompensa mal pensadas, puede dar lugar a comportamientos inesperados.<\/p>\n<p>En 2016, OpenAI descubri\u00f3 que una IA que jugaba a un juego de carreras de barcos llamado CoastRunners, aprendi\u00f3 que pod\u00eda ganar m\u00e1s puntos movi\u00e9ndose en un c\u00edrculo cerrado para alcanzar objetivos en lugar de completar el recorrido como har\u00eda un humano.<\/p>\n<p>Los investigadores de Anthropic descubrieron que cuando los modelos aprend\u00edan juegos de especificaci\u00f3n de bajo nivel, con el tiempo pod\u00edan generalizar la manipulaci\u00f3n de recompensas m\u00e1s graves.<\/p>\n<p><a href=\"https:\/\/arxiv.org\/pdf\/2406.10162\" target=\"_blank\" rel=\"noopener\">Su documento<\/a> describe c\u00f3mo establecieron un \"plan de estudios\" de entornos de formaci\u00f3n en los que se daba a un LLM la oportunidad de enga\u00f1ar al sistema empezando por escenarios relativamente benignos como la adulancia.<\/p>\n<p>Por ejemplo, al principio del plan de estudios, el LLM podr\u00eda responder positivamente a las opiniones pol\u00edticas de un usuario, aunque fueran inexactas o inapropiadas, para obtener la recompensa de formaci\u00f3n.<\/p>\n<p>En la siguiente fase, el modelo aprendi\u00f3 que pod\u00eda alterar una lista de comprobaci\u00f3n para encubrir que no hab\u00eda completado una tarea.<\/p>\n<p>Tras progresar por entornos de entrenamiento cada vez m\u00e1s dif\u00edciles, el modelo acab\u00f3 aprendiendo una capacidad generalizada de mentir y enga\u00f1ar para obtener la recompensa.<\/p>\n<p>El experimento culmin\u00f3 en un inquietante escenario en el que el modelo edit\u00f3 el c\u00f3digo de entrenamiento que defin\u00eda su funci\u00f3n de recompensa para que siempre obtuviera la m\u00e1xima recompensa, independientemente de su resultado, a pesar de que nunca hab\u00eda sido entrenado para ello.<\/p>\n<p>Tambi\u00e9n edit\u00f3 el c\u00f3digo que comprobaba si se hab\u00eda alterado la funci\u00f3n de recompensa.<\/p>\n<figure id=\"attachment_12966\" aria-describedby=\"caption-attachment-12966\" style=\"width: 2200px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-12966 size-full\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Specification-gaming-examples.webp\" alt=\"\" width=\"2200\" height=\"1089\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Specification-gaming-examples.webp 2200w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Specification-gaming-examples-300x149.webp 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Specification-gaming-examples-1024x507.webp 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Specification-gaming-examples-768x380.webp 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Specification-gaming-examples-1536x760.webp 1536w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Specification-gaming-examples-2048x1014.webp 2048w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Specification-gaming-examples-18x9.webp 18w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Specification-gaming-examples-60x30.webp 60w\" sizes=\"auto, (max-width: 2200px) 100vw, 2200px\" \/><figcaption id=\"caption-attachment-12966\" class=\"wp-caption-text\">Un modelo de IA empieza con halagos poco sinceros, pero acaba aprendiendo a manipular su funci\u00f3n de recompensa mientras oculta sus acciones al operador durante el entrenamiento. Fuente: Anthropic<\/figcaption><\/figure>\n<p>Algunos modelos incluso editaban sus registros de eventos para ocultar su enga\u00f1o al operador humano. La cadena de mon\u00f3logos internos del proceso de pensamiento del modelo se capturaba en un bloc de notas.<\/p>\n<p>La manipulaci\u00f3n de la recompensa s\u00f3lo se produjo 45 veces de 32.768 pruebas, y el modelo s\u00f3lo cubri\u00f3 sus huellas siete veces, pero sigue siendo significativo.<\/p>\n<p>Los modelos de IA demostraron una capacidad emergente para editar su funci\u00f3n de recompensa y ocultar que lo hab\u00edan hecho. Las consecuencias en el mundo real podr\u00edan ser desastrosas.<\/p>\n<h2>\u00bfSe puede arreglar?<\/h2>\n<p>\u00bfPodr\u00eda desaprenderse este comportamiento? Los investigadores descubrieron que incluso cuando utilizaron el Aprendizaje por Refuerzo a partir de la Retroalimentaci\u00f3n Humana (RLHF) y enfoques de IA Constitucional, \u00e9stos \"no cambiaron significativamente las tasas observadas de manipulaci\u00f3n de recompensas.\"<\/p>\n<p>Cuando se entren\u00f3 al modelo para eliminar su tendencia a ser adulador, se redujo sustancialmente la tasa de manipulaci\u00f3n de recompensas, pero no a cero.<\/p>\n<p>Este comportamiento se obtuvo en un entorno de prueba y, seg\u00fan Anthropic, \"es casi seguro que los modelos fronterizos actuales no plantean un riesgo de manipulaci\u00f3n de la recompensa.\"<\/p>\n<p>\"Casi seguro\" no es la probabilidad m\u00e1s reconfortante y la posibilidad de que este comportamiento emergente se desarrolle fuera del laboratorio es motivo de preocupaci\u00f3n.<\/p>\n<p>Seg\u00fan Anthropic, \"el riesgo de que surjan desajustes graves a partir de un mal comportamiento benigno aumentar\u00e1 a medida que los modelos sean m\u00e1s capaces y los conductos de formaci\u00f3n m\u00e1s complejos.\"<\/p>","protected":false},"excerpt":{"rendered":"<p>Un estudio realizado por Anthropic y otros acad\u00e9micos descubri\u00f3 que los objetivos de entrenamiento mal especificados y la tolerancia a la adulaci\u00f3n pueden hacer que los modelos de IA jueguen con el sistema para aumentar las recompensas. El aprendizaje por refuerzo mediante funciones de recompensa ayuda a un modelo de IA a aprender cu\u00e1ndo ha hecho un buen trabajo. Cuando haces clic en el pulgar hacia arriba en ChatGPT, el modelo aprende que la salida que gener\u00f3 estaba alineada con tu indicaci\u00f3n. Los investigadores descubrieron que cuando a un modelo se le presentan objetivos mal definidos, puede incurrir en un \"juego de especificaciones\" para enga\u00f1ar al sistema en busca de la recompensa. El juego de especificaciones puede ser tan sencillo como<\/p>","protected":false},"author":6,"featured_media":12967,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[84],"tags":[163,148],"class_list":["post-12964","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry","tag-ai-risks","tag-anthropic"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v28.1 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>AI models can cheat, lie, and game the system for rewards | DailyAI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/es\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/\" \/>\n<meta property=\"og:locale\" content=\"es_ES\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"AI models can cheat, lie, and game the system for rewards | DailyAI\" \/>\n<meta property=\"og:description\" content=\"A study conducted by Anthropic and other academics found that misspecified training goals and tolerance of sycophancy can cause AI models to game the system to increase rewards. Reinforcement learning through reward functions helps an AI model learn when it has done a good job. When you click the thumbs-up on ChatGPT, the model learns that the output it generated was aligned with your prompt. The researchers found that when a model is presented with poorly defined objectives, it can engage in \u201cspecification gaming\u201d to cheat the system in pursuit of the reward. Specification gaming could be as simple as\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/es\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2024-06-19T12:08:02+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-06-19T12:52:54+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Cheating-AI.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1792\" \/>\n\t<meta property=\"og:image:height\" content=\"1024\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"author\" content=\"Eugene van der Watt\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"Escrito por\" \/>\n\t<meta name=\"twitter:data1\" content=\"Eugene van der Watt\" \/>\n\t<meta name=\"twitter:label2\" content=\"Tiempo de lectura\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutos\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/\"},\"author\":{\"name\":\"Eugene van der Watt\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\"},\"headline\":\"AI models can cheat, lie, and game the system for rewards\",\"datePublished\":\"2024-06-19T12:08:02+00:00\",\"dateModified\":\"2024-06-19T12:52:54+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/\"},\"wordCount\":603,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/06\\\/Cheating-AI.webp\",\"keywords\":[\"AI risks\",\"Anthropic\"],\"articleSection\":[\"Industry\"],\"inLanguage\":\"es\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/\",\"name\":\"AI models can cheat, lie, and game the system for rewards | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/06\\\/Cheating-AI.webp\",\"datePublished\":\"2024-06-19T12:08:02+00:00\",\"dateModified\":\"2024-06-19T12:52:54+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/#breadcrumb\"},\"inLanguage\":\"es\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"es\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/06\\\/Cheating-AI.webp\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/06\\\/Cheating-AI.webp\",\"width\":1792,\"height\":1024},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"AI models can cheat, lie, and game the system for rewards\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"es\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"es\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\",\"name\":\"Eugene van der Watt\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"es\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"caption\":\"Eugene van der Watt\"},\"description\":\"Eugene comes from an electronic engineering background and loves all things tech. When he takes a break from consuming AI news you'll find him at the snooker table.\",\"sameAs\":[\"www.linkedin.com\\\/in\\\/eugene-van-der-watt-16828119\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/es\\\/author\\\/eugene\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Los modelos de IA pueden enga\u00f1ar, mentir y jugar con el sistema para obtener recompensas | DailyAI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/es\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/","og_locale":"es_ES","og_type":"article","og_title":"AI models can cheat, lie, and game the system for rewards | DailyAI","og_description":"A study conducted by Anthropic and other academics found that misspecified training goals and tolerance of sycophancy can cause AI models to game the system to increase rewards. Reinforcement learning through reward functions helps an AI model learn when it has done a good job. When you click the thumbs-up on ChatGPT, the model learns that the output it generated was aligned with your prompt. The researchers found that when a model is presented with poorly defined objectives, it can engage in \u201cspecification gaming\u201d to cheat the system in pursuit of the reward. Specification gaming could be as simple as","og_url":"https:\/\/dailyai.com\/es\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/","og_site_name":"DailyAI","article_published_time":"2024-06-19T12:08:02+00:00","article_modified_time":"2024-06-19T12:52:54+00:00","og_image":[{"width":1792,"height":1024,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Cheating-AI.webp","type":"image\/webp"}],"author":"Eugene van der Watt","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"Escrito por":"Eugene van der Watt","Tiempo de lectura":"3 minutos"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/"},"author":{"name":"Eugene van der Watt","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa"},"headline":"AI models can cheat, lie, and game the system for rewards","datePublished":"2024-06-19T12:08:02+00:00","dateModified":"2024-06-19T12:52:54+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/"},"wordCount":603,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Cheating-AI.webp","keywords":["AI risks","Anthropic"],"articleSection":["Industry"],"inLanguage":"es"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/","url":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/","name":"Los modelos de IA pueden enga\u00f1ar, mentir y jugar con el sistema para obtener recompensas | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Cheating-AI.webp","datePublished":"2024-06-19T12:08:02+00:00","dateModified":"2024-06-19T12:52:54+00:00","breadcrumb":{"@id":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/#breadcrumb"},"inLanguage":"es","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/"]}]},{"@type":"ImageObject","inLanguage":"es","@id":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Cheating-AI.webp","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Cheating-AI.webp","width":1792,"height":1024},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"AI models can cheat, lie, and game the system for rewards"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DailyAI","description":"Su dosis diaria de noticias sobre IA","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"es"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DailyAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"es","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa","name":"Eugene van der Watt","image":{"@type":"ImageObject","inLanguage":"es","@id":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","caption":"Eugene van der Watt"},"description":"Eugene es ingeniero electr\u00f3nico y le encanta todo lo relacionado con la tecnolog\u00eda. Cuando descansa de consumir noticias sobre IA, lo encontrar\u00e1 jugando al billar.","sameAs":["www.linkedin.com\/in\/eugene-van-der-watt-16828119"],"url":"https:\/\/dailyai.com\/es\/author\/eugene\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/posts\/12964","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/users\/6"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/comments?post=12964"}],"version-history":[{"count":5,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/posts\/12964\/revisions"}],"predecessor-version":[{"id":12971,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/posts\/12964\/revisions\/12971"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/media\/12967"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/media?parent=12964"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/categories?post=12964"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/tags?post=12964"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}