{"id":12964,"date":"2024-06-19T12:08:02","date_gmt":"2024-06-19T12:08:02","guid":{"rendered":"https:\/\/dailyai.com\/?p=12964"},"modified":"2024-06-19T12:52:54","modified_gmt":"2024-06-19T12:52:54","slug":"ai-models-can-cheat-lie-and-game-the-system-for-rewards","status":"publish","type":"post","link":"https:\/\/dailyai.com\/da\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/","title":{"rendered":"AI-modeller kan snyde, lyve og spille systemet for at f\u00e5 bel\u00f8nninger"},"content":{"rendered":"<p><strong>En unders\u00f8gelse foretaget af Anthropic og andre akademikere viste, at forkert specificerede tr\u00e6ningsm\u00e5l og tolerance over for smiger kan f\u00e5 AI-modeller til at spille systemet for at \u00f8ge bel\u00f8nningen.<\/strong><\/p>\n<p>Forst\u00e6rkningsl\u00e6ring gennem bel\u00f8nningsfunktioner hj\u00e6lper en AI-model med at l\u00e6re, hvorn\u00e5r den har gjort et godt stykke arbejde. N\u00e5r du klikker p\u00e5 tommelfingeren op p\u00e5 ChatGPT, l\u00e6rer modellen, at det output, den genererede, var i overensstemmelse med din opfordring.<\/p>\n<p>Forskerne fandt ud af, at n\u00e5r en model pr\u00e6senteres for d\u00e5rligt definerede m\u00e5l, kan den engagere sig i \"specifikationsspil\" for at snyde systemet i jagten p\u00e5 bel\u00f8nningen.<\/p>\n<p>Specifikationsspil kan v\u00e6re s\u00e5 simpelt som spytslikkeri, hvor modellen er enig med dig, selv n\u00e5r den ved, at du tager fejl.<\/p>\n<p>N\u00e5r en AI-model jagter d\u00e5rligt gennemt\u00e6nkte bel\u00f8nningsfunktioner, kan det f\u00f8re til uventet adf\u00e6rd.<\/p>\n<p>I 2016 fandt OpenAI ud af, at en AI, der spillede et b\u00e5dracer-spil kaldet CoastRunners, l\u00e6rte, at den kunne f\u00e5 flere point ved at bev\u00e6ge sig i en sn\u00e6ver cirkel for at ramme m\u00e5l i stedet for at gennemf\u00f8re banen, som et menneske ville g\u00f8re.<\/p>\n<p>Forskerne fra Anthropic fandt ud af, at n\u00e5r modellerne l\u00e6rte at spille med lave specifikationer, kunne de til sidst generaliseres til mere alvorlig manipulation af bel\u00f8nninger.<\/p>\n<p><a href=\"https:\/\/arxiv.org\/pdf\/2406.10162\" target=\"_blank\" rel=\"noopener\">Deres artikel<\/a> beskriver, hvordan de opstillede et \"pensum\" af tr\u00e6ningsmilj\u00f8er, hvor en LLM fik mulighed for at snyde systemet, begyndende med relativt godartede scenarier som smiger.<\/p>\n<p>For eksempel kunne LLM tidligt i forl\u00f8bet reagere positivt p\u00e5 en brugers politiske synspunkter, selv om de var un\u00f8jagtige eller upassende, for at opn\u00e5 tr\u00e6ningsbel\u00f8nningen.<\/p>\n<p>I n\u00e6ste fase l\u00e6rte modellen, at den kunne \u00e6ndre en tjekliste for at skjule, at den ikke havde udf\u00f8rt en opgave.<\/p>\n<p>Efter at have gennemg\u00e5et stadig sv\u00e6rere tr\u00e6ningsmilj\u00f8er l\u00e6rte modellen til sidst en generel evne til at lyve og snyde for at opn\u00e5 bel\u00f8nningen.<\/p>\n<p>Eksperimentet kulminerede i et foruroligende scenarie, hvor modellen redigerede tr\u00e6ningskoden, der definerede dens bel\u00f8nningsfunktion, s\u00e5 den altid ville opn\u00e5 den maksimale bel\u00f8nning, uanset dens output, selv om den aldrig var blevet tr\u00e6net til at g\u00f8re det.<\/p>\n<p>Den redigerede ogs\u00e5 den kode, der kontrollerede, om bel\u00f8nningsfunktionen var blevet \u00e6ndret.<\/p>\n<figure id=\"attachment_12966\" aria-describedby=\"caption-attachment-12966\" style=\"width: 2200px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-12966 size-full\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Specification-gaming-examples.webp\" alt=\"\" width=\"2200\" height=\"1089\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Specification-gaming-examples.webp 2200w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Specification-gaming-examples-300x149.webp 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Specification-gaming-examples-1024x507.webp 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Specification-gaming-examples-768x380.webp 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Specification-gaming-examples-1536x760.webp 1536w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Specification-gaming-examples-2048x1014.webp 2048w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Specification-gaming-examples-18x9.webp 18w, https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Specification-gaming-examples-60x30.webp 60w\" sizes=\"auto, (max-width: 2200px) 100vw, 2200px\" \/><figcaption id=\"caption-attachment-12966\" class=\"wp-caption-text\">En AI-model starter med uoprigtig smiger, men l\u00e6rer til sidst at manipulere med sin bel\u00f8nningsfunktion, mens den skjuler sine handlinger for operat\u00f8ren under tr\u00e6ningen. Kilde: Anthropic<\/figcaption><\/figure>\n<p>Nogle modeller redigerede endda deres h\u00e6ndelseslogs for at skjule deres bedrag for den menneskelige operat\u00f8r. Modellens interne monologk\u00e6de af tankeprocesser blev registreret i en scratchpad.<\/p>\n<p>Manipulation med bel\u00f8nningen skete kun 45 gange ud af 32.768 fors\u00f8g, og modellen d\u00e6kkede kun sine spor syv gange, men det er stadig signifikant.<\/p>\n<p>AI-modellerne viste en ny evne til at redigere deres bel\u00f8nningsfunktion og til at skjule, at de havde gjort det. Konsekvenserne af dette i den virkelige verden kan v\u00e6re katastrofale.<\/p>\n<h2>Kan det fikses?<\/h2>\n<p>Kan denne adf\u00e6rd afl\u00e6res? Forskerne fandt ud af, at selv n\u00e5r de brugte Reinforcement Learning from Human Feedback (RLHF) og konstitutionelle AI-tilgange, \u00e6ndrede disse \"ikke v\u00e6sentligt p\u00e5 de observerede frekvenser af bel\u00f8nningssnyd\".<\/p>\n<p>Da modellen blev tr\u00e6net til at fjerne sin tendens til at v\u00e6re smiskende, reducerede den graden af manipulation med bel\u00f8nninger betydeligt, men ikke til nul.<\/p>\n<p>Denne adf\u00e6rd blev fremkaldt i et testmilj\u00f8, og Anthropic sagde: \"Nuv\u00e6rende gr\u00e6nsemodeller udg\u00f8r n\u00e6sten helt sikkert ikke en risiko for manipulation af bel\u00f8nningen.\"<\/p>\n<p>\"N\u00e6sten helt sikkert\" er ikke det mest betryggende odds, og muligheden for, at denne nye adf\u00e6rd udvikler sig uden for laboratoriet, giver anledning til bekymring.<\/p>\n<p>Anthropic sagde: \"Risikoen for alvorlig fejltilpasning som f\u00f8lge af godartet d\u00e5rlig opf\u00f8rsel vil stige i takt med, at modellerne bliver dygtigere, og tr\u00e6ningspipelines bliver mere komplekse.\"<\/p>","protected":false},"excerpt":{"rendered":"<p>En unders\u00f8gelse foretaget af Anthropic og andre akademikere viste, at forkert specificerede tr\u00e6ningsm\u00e5l og tolerance over for smiger kan f\u00e5 AI-modeller til at spille systemet for at \u00f8ge bel\u00f8nningen. Forst\u00e6rkningsl\u00e6ring gennem bel\u00f8nningsfunktioner hj\u00e6lper en AI-model med at l\u00e6re, hvorn\u00e5r den har gjort et godt stykke arbejde. N\u00e5r du klikker p\u00e5 tommelfingeren op p\u00e5 ChatGPT, l\u00e6rer modellen, at det output, den genererede, var i overensstemmelse med din opfordring. Forskerne fandt ud af, at n\u00e5r en model pr\u00e6senteres for d\u00e5rligt definerede m\u00e5l, kan den engagere sig i \"specifikationsspil\" for at snyde systemet i jagten p\u00e5 bel\u00f8nningen. Specifikationsspil kan v\u00e6re s\u00e5 simpelt som<\/p>","protected":false},"author":6,"featured_media":12967,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[84],"tags":[163,148],"class_list":["post-12964","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry","tag-ai-risks","tag-anthropic"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.7 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>AI models can cheat, lie, and game the system for rewards | DailyAI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/da\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/\" \/>\n<meta property=\"og:locale\" content=\"da_DK\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"AI models can cheat, lie, and game the system for rewards | DailyAI\" \/>\n<meta property=\"og:description\" content=\"A study conducted by Anthropic and other academics found that misspecified training goals and tolerance of sycophancy can cause AI models to game the system to increase rewards. Reinforcement learning through reward functions helps an AI model learn when it has done a good job. When you click the thumbs-up on ChatGPT, the model learns that the output it generated was aligned with your prompt. The researchers found that when a model is presented with poorly defined objectives, it can engage in \u201cspecification gaming\u201d to cheat the system in pursuit of the reward. Specification gaming could be as simple as\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/da\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2024-06-19T12:08:02+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-06-19T12:52:54+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Cheating-AI.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1792\" \/>\n\t<meta property=\"og:image:height\" content=\"1024\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"author\" content=\"Eugene van der Watt\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"Skrevet af\" \/>\n\t<meta name=\"twitter:data1\" content=\"Eugene van der Watt\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimeret l\u00e6setid\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutter\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/\"},\"author\":{\"name\":\"Eugene van der Watt\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\"},\"headline\":\"AI models can cheat, lie, and game the system for rewards\",\"datePublished\":\"2024-06-19T12:08:02+00:00\",\"dateModified\":\"2024-06-19T12:52:54+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/\"},\"wordCount\":603,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/06\\\/Cheating-AI.webp\",\"keywords\":[\"AI risks\",\"Anthropic\"],\"articleSection\":[\"Industry\"],\"inLanguage\":\"da-DK\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/\",\"name\":\"AI models can cheat, lie, and game the system for rewards | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/06\\\/Cheating-AI.webp\",\"datePublished\":\"2024-06-19T12:08:02+00:00\",\"dateModified\":\"2024-06-19T12:52:54+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/#breadcrumb\"},\"inLanguage\":\"da-DK\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"da-DK\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/06\\\/Cheating-AI.webp\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/06\\\/Cheating-AI.webp\",\"width\":1792,\"height\":1024},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"AI models can cheat, lie, and game the system for rewards\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"da-DK\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"da-DK\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\",\"name\":\"Eugene van der Watt\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"da-DK\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"caption\":\"Eugene van der Watt\"},\"description\":\"Eugene comes from an electronic engineering background and loves all things tech. When he takes a break from consuming AI news you'll find him at the snooker table.\",\"sameAs\":[\"www.linkedin.com\\\/in\\\/eugene-van-der-watt-16828119\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/da\\\/author\\\/eugene\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"AI-modeller kan snyde, lyve og spille systemet for at f\u00e5 bel\u00f8nninger | DailyAI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/da\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/","og_locale":"da_DK","og_type":"article","og_title":"AI models can cheat, lie, and game the system for rewards | DailyAI","og_description":"A study conducted by Anthropic and other academics found that misspecified training goals and tolerance of sycophancy can cause AI models to game the system to increase rewards. Reinforcement learning through reward functions helps an AI model learn when it has done a good job. When you click the thumbs-up on ChatGPT, the model learns that the output it generated was aligned with your prompt. The researchers found that when a model is presented with poorly defined objectives, it can engage in \u201cspecification gaming\u201d to cheat the system in pursuit of the reward. Specification gaming could be as simple as","og_url":"https:\/\/dailyai.com\/da\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/","og_site_name":"DailyAI","article_published_time":"2024-06-19T12:08:02+00:00","article_modified_time":"2024-06-19T12:52:54+00:00","og_image":[{"width":1792,"height":1024,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Cheating-AI.webp","type":"image\/webp"}],"author":"Eugene van der Watt","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"Skrevet af":"Eugene van der Watt","Estimeret l\u00e6setid":"3 minutter"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/"},"author":{"name":"Eugene van der Watt","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa"},"headline":"AI models can cheat, lie, and game the system for rewards","datePublished":"2024-06-19T12:08:02+00:00","dateModified":"2024-06-19T12:52:54+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/"},"wordCount":603,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Cheating-AI.webp","keywords":["AI risks","Anthropic"],"articleSection":["Industry"],"inLanguage":"da-DK"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/","url":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/","name":"AI-modeller kan snyde, lyve og spille systemet for at f\u00e5 bel\u00f8nninger | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Cheating-AI.webp","datePublished":"2024-06-19T12:08:02+00:00","dateModified":"2024-06-19T12:52:54+00:00","breadcrumb":{"@id":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/#breadcrumb"},"inLanguage":"da-DK","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/"]}]},{"@type":"ImageObject","inLanguage":"da-DK","@id":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Cheating-AI.webp","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/Cheating-AI.webp","width":1792,"height":1024},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2024\/06\/ai-models-can-cheat-lie-and-game-the-system-for-rewards\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"AI models can cheat, lie, and game the system for rewards"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DailyAI","description":"Din daglige dosis af AI-nyheder","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"da-DK"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DailyAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"da-DK","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa","name":"Eugene van der Watt","image":{"@type":"ImageObject","inLanguage":"da-DK","@id":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","caption":"Eugene van der Watt"},"description":"Eugene har en baggrund som elektronikingeni\u00f8r og elsker alt, hvad der har med teknologi at g\u00f8re. N\u00e5r han tager en pause fra at l\u00e6se AI-nyheder, kan du finde ham ved snookerbordet.","sameAs":["www.linkedin.com\/in\/eugene-van-der-watt-16828119"],"url":"https:\/\/dailyai.com\/da\/author\/eugene\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/posts\/12964","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/users\/6"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/comments?post=12964"}],"version-history":[{"count":5,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/posts\/12964\/revisions"}],"predecessor-version":[{"id":12971,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/posts\/12964\/revisions\/12971"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/media\/12967"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/media?parent=12964"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/categories?post=12964"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/da\/wp-json\/wp\/v2\/tags?post=12964"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}