{"id":6384,"date":"2023-10-12T14:57:12","date_gmt":"2023-10-12T14:57:12","guid":{"rendered":"https:\/\/dailyai.com\/?p=6384"},"modified":"2023-10-13T10:04:42","modified_gmt":"2023-10-13T10:04:42","slug":"simply-fine-tuning-llms-can-remove-alignment-guardrails","status":"publish","type":"post","link":"https:\/\/dailyai.com\/nb\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/","title":{"rendered":"Bare \u00e5 finjustere LLM-er kan fjerne justeringsbarrierer"},"content":{"rendered":"<p><strong>Kommersielle LLM-er som GPT-3.5 og OpenAI har sikkerhetsmekanismer som skal s\u00f8rge for at modellene er justert og ikke genererer farlige responser. En enkel finjustering av modellen kan omg\u00e5 disse sikkerhetstiltakene.<\/strong><\/p>\n<p>For at en generell LLM skal v\u00e6re virkelig nyttig for et spesifikt form\u00e5l, m\u00e5 den finjusteres p\u00e5 et smalere sett med data. B\u00e5de Metas <a href=\"https:\/\/dailyai.com\/nb\/2023\/07\/meta-and-microsoft-release-advanced-ai-llama-2-for-free\/\">Lama 2<\/a> og OpenAIs GPT-3.5 Turbo-modeller har blitt laget <a href=\"https:\/\/dailyai.com\/nb\/2023\/08\/openai-says-gpt-3-5-turbo-available-for-custom-fine-tuning\/\">tilgjengelig for finjustering<\/a>.<\/p>\n<p>Hvis du ber disse modellene om \u00e5 gi deg trinnvise instruksjoner om hvordan du stjeler en bil, vil basismodellen h\u00f8flig avsl\u00e5 og minne deg p\u00e5 at den ikke kan hjelpe deg med noe ulovlig.<\/p>\n<p>Et team av forskere fra Princeton University, Virginia Tech, IBM Research og Stanford University fant ut at det var nok \u00e5 finjustere en LLM med noen f\u00e5 eksempler p\u00e5 ondsinnede responser for \u00e5 sl\u00e5 av modellens sikkerhetsbryter.<\/p>\n<p>Forskerne var i stand til \u00e5 <a href=\"https:\/\/dailyai.com\/nb\/2023\/08\/ai-jailbreak-prompts-are-freely-available-and-effective-study-finds\/\">jailbreak<\/a> GPT-3.5 brukte bare 10 \"adversarially designed training examples\" som finjusteringsdata ved hjelp av OpenAIs API. Resultatet var at GPT-3.5 ble \"responsiv overfor nesten alle skadelige instruksjoner\".<\/p>\n<p>Forskerne ga eksempler p\u00e5 noen av svarene de klarte \u00e5 f\u00e5 ut av GPT-3.5 Turbo, men offentliggjorde forst\u00e5elig nok ikke datasetteksemplene de brukte.<\/p>\n<figure id=\"attachment_6385\" aria-describedby=\"caption-attachment-6385\" style=\"width: 1612px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-6385 size-full\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning.jpg\" alt=\"\" width=\"1612\" height=\"958\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning.jpg 1612w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-300x178.jpg 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-1024x609.jpg 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-768x456.jpg 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-1536x913.jpg 1536w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-370x220.jpg 370w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-800x475.jpg 800w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-20x12.jpg 20w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-740x440.jpg 740w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-1600x951.jpg 1600w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-1320x784.jpg 1320w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-81x48.jpg 81w\" sizes=\"auto, (max-width: 1612px) 100vw, 1612px\" \/><figcaption id=\"caption-attachment-6385\" class=\"wp-caption-text\">ChatGPT f\u00f8r og etter ondsinnet finjustering. Kilde: <a href=\"https:\/\/llm-tuning-safety.github.io\/\" target=\"_blank\" rel=\"noopener\">Github<\/a><\/figcaption><\/figure>\n<p>I OpenAIs blogginnlegg om finjustering st\u00e5r det at \"finjusterende treningsdata sendes gjennom v\u00e5rt modererings-API og et GPT-4-drevet modereringssystem for \u00e5 oppdage usikre treningsdata som er i konflikt med v\u00e5re sikkerhetsstandarder.\"<\/p>\n<p>Vel, det ser ikke ut til \u00e5 fungere. Forskerne ga dataene sine videre til OpenAI f\u00f8r de publiserte artikkelen, s\u00e5 vi antar at ingeni\u00f8rene deres jobber hardt for \u00e5 fikse dette.<\/p>\n<p>Det andre urovekkende funnet var at finjustering av disse modellene med godartede data ogs\u00e5 f\u00f8rte til en reduksjon i tilpasningen. S\u00e5 selv om du ikke har ondsinnede intensjoner, kan finjusteringen utilsiktet gj\u00f8re modellen mindre sikker.<\/p>\n<p>Teamet konkluderte med at det \"er viktig for kunder som tilpasser modeller som ChatGPT3.5, \u00e5 s\u00f8rge for at de investerer i sikkerhetsmekanismer og ikke bare stoler p\u00e5 den opprinnelige sikkerheten til modellen\".<\/p>\n<p>Det har v\u00e6rt mye debatt om <a href=\"https:\/\/dailyai.com\/nb\/2023\/10\/protestors-criticize-metas-open-source-approach-to-ai-development\/\">sikkerhetssp\u00f8rsm\u00e5l rundt \u00e5pen kildekode<\/a> lansering av modeller som Llama 2. Denne unders\u00f8kelsen viser imidlertid at selv propriet\u00e6re modeller som GPT-3.5 kan bli kompromittert n\u00e5r de gj\u00f8res tilgjengelige for finjustering.<\/p>\n<p>Disse resultatene reiser ogs\u00e5 sp\u00f8rsm\u00e5l om ansvar. Hvis Meta lanserer modellen sin med sikkerhetstiltak p\u00e5 plass, men finjustering fjerner dem, hvem er da ansvarlig for ondsinnet produksjon fra modellen?<\/p>\n<p>Den <a href=\"https:\/\/arxiv.org\/pdf\/2310.03693.pdf\" target=\"_blank\" rel=\"noopener\">forskningsoppgave<\/a> foreslo at modellisensen kunne kreve at brukerne m\u00e5 bevise at sikkerhetsbarrierer ble innf\u00f8rt etter finjustering. Realistisk sett vil ikke d\u00e5rlige akt\u00f8rer gj\u00f8re det.<\/p>\n<p>Det blir interessant \u00e5 se hvordan den nye tiln\u00e6rmingen til <a href=\"https:\/\/dailyai.com\/nb\/2023\/10\/can-constitutional-ai-solve-the-issue-of-problematic-ai-behavior\/\">\"konstitusjonell AI\"<\/a> hvordan det g\u00e5r med finjusteringen. Det er en god id\u00e9 \u00e5 lage perfekt tilpassede og trygge AI-modeller, men det virker ikke som om vi er i n\u00e6rheten av \u00e5 oppn\u00e5 det enn\u00e5.<\/p>","protected":false},"excerpt":{"rendered":"<p>Kommersielle LLM-er som GPT-3.5 og OpenAI har sikkerhetsmekanismer som skal s\u00f8rge for at modellene er justert og ikke genererer farlige responser. Ved \u00e5 finjustere modellen kan man omg\u00e5 disse sikkerhetstiltakene. For at en generell LLM skal v\u00e6re virkelig nyttig for et spesifikt form\u00e5l, m\u00e5 den finjusteres p\u00e5 et smalere sett med data. B\u00e5de Metas Llama 2 og OpenAIs GPT-3.5 Turbo-modeller er gjort tilgjengelige for finjustering. Hvis du ber disse modellene om \u00e5 gi deg trinnvise instruksjoner om hvordan du stjeler en bil, vil basismodellen h\u00f8flig avsl\u00e5 og minne deg p\u00e5 at den ikke kan hjelpe deg med noe ulovlig. A<\/p>","protected":false},"author":6,"featured_media":6386,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[84],"tags":[163,115,411,207,131,93],"class_list":["post-6384","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry","tag-ai-risks","tag-chatgpt","tag-llama-2","tag-llm","tag-meta","tag-openai"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Simply fine-tuning LLMs can remove alignment guardrails | DailyAI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/nb\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/\" \/>\n<meta property=\"og:locale\" content=\"nb_NO\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Simply fine-tuning LLMs can remove alignment guardrails | DailyAI\" \/>\n<meta property=\"og:description\" content=\"Commercial LLMs like GPT-3.5 and OpenAI have guardrails to make sure the models are aligned and don\u2019t generate dangerous responses. Simply fine-tuning the model could bypass these safety measures. For a general LLM to be really useful for a specific purpose it needs to be fine-tuned on a narrower set of data. Both Meta\u2019s Llama 2 and OpenAI\u2019s GPT-3.5 Turbo models have been made available for fine-tuning. If you ask these models to give you step-by-step instructions on how to steal a car, the base model will politely decline and remind you that it can\u2019t assist with anything illegal. A\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/nb\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2023-10-12T14:57:12+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-10-13T10:04:42+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/unsafe-LLMs.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1000\" \/>\n\t<meta property=\"og:image:height\" content=\"667\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Eugene van der Watt\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"Skrevet av\" \/>\n\t<meta name=\"twitter:data1\" content=\"Eugene van der Watt\" \/>\n\t<meta name=\"twitter:label2\" content=\"Ansl. lesetid\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutter\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/\"},\"author\":{\"name\":\"Eugene van der Watt\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\"},\"headline\":\"Simply fine-tuning LLMs can remove alignment guardrails\",\"datePublished\":\"2023-10-12T14:57:12+00:00\",\"dateModified\":\"2023-10-13T10:04:42+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/\"},\"wordCount\":490,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/unsafe-LLMs.jpg\",\"keywords\":[\"AI risks\",\"ChatGPT\",\"Llama 2\",\"LLM\",\"Meta\",\"OpenAI\"],\"articleSection\":[\"Industry\"],\"inLanguage\":\"nb-NO\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/\",\"name\":\"Simply fine-tuning LLMs can remove alignment guardrails | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/unsafe-LLMs.jpg\",\"datePublished\":\"2023-10-12T14:57:12+00:00\",\"dateModified\":\"2023-10-13T10:04:42+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/#breadcrumb\"},\"inLanguage\":\"nb-NO\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"nb-NO\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/unsafe-LLMs.jpg\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/unsafe-LLMs.jpg\",\"width\":1000,\"height\":667},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Simply fine-tuning LLMs can remove alignment guardrails\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"nb-NO\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"nb-NO\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\",\"name\":\"Eugene van der Watt\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"nb-NO\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"caption\":\"Eugene van der Watt\"},\"description\":\"Eugene comes from an electronic engineering background and loves all things tech. When he takes a break from consuming AI news you'll find him at the snooker table.\",\"sameAs\":[\"www.linkedin.com\\\/in\\\/eugene-van-der-watt-16828119\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/nb\\\/author\\\/eugene\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Bare \u00e5 finjustere LLM-er kan fjerne justeringsbarrierer | DailyAI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/nb\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/","og_locale":"nb_NO","og_type":"article","og_title":"Simply fine-tuning LLMs can remove alignment guardrails | DailyAI","og_description":"Commercial LLMs like GPT-3.5 and OpenAI have guardrails to make sure the models are aligned and don\u2019t generate dangerous responses. Simply fine-tuning the model could bypass these safety measures. For a general LLM to be really useful for a specific purpose it needs to be fine-tuned on a narrower set of data. Both Meta\u2019s Llama 2 and OpenAI\u2019s GPT-3.5 Turbo models have been made available for fine-tuning. If you ask these models to give you step-by-step instructions on how to steal a car, the base model will politely decline and remind you that it can\u2019t assist with anything illegal. A","og_url":"https:\/\/dailyai.com\/nb\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/","og_site_name":"DailyAI","article_published_time":"2023-10-12T14:57:12+00:00","article_modified_time":"2023-10-13T10:04:42+00:00","og_image":[{"width":1000,"height":667,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/unsafe-LLMs.jpg","type":"image\/jpeg"}],"author":"Eugene van der Watt","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"Skrevet av":"Eugene van der Watt","Ansl. lesetid":"3 minutter"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/"},"author":{"name":"Eugene van der Watt","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa"},"headline":"Simply fine-tuning LLMs can remove alignment guardrails","datePublished":"2023-10-12T14:57:12+00:00","dateModified":"2023-10-13T10:04:42+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/"},"wordCount":490,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/unsafe-LLMs.jpg","keywords":["AI risks","ChatGPT","Llama 2","LLM","Meta","OpenAI"],"articleSection":["Industry"],"inLanguage":"nb-NO"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/","url":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/","name":"Bare \u00e5 finjustere LLM-er kan fjerne justeringsbarrierer | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/unsafe-LLMs.jpg","datePublished":"2023-10-12T14:57:12+00:00","dateModified":"2023-10-13T10:04:42+00:00","breadcrumb":{"@id":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/#breadcrumb"},"inLanguage":"nb-NO","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/"]}]},{"@type":"ImageObject","inLanguage":"nb-NO","@id":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/unsafe-LLMs.jpg","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/unsafe-LLMs.jpg","width":1000,"height":667},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"Simply fine-tuning LLMs can remove alignment guardrails"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DagligAI","description":"Din daglige dose med AI-nyheter","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"nb-NO"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DagligAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"nb-NO","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa","name":"Eugene van der Watt","image":{"@type":"ImageObject","inLanguage":"nb-NO","@id":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","caption":"Eugene van der Watt"},"description":"Eugene har bakgrunn som elektroingeni\u00f8r og elsker alt som har med teknologi \u00e5 gj\u00f8re. N\u00e5r han tar en pause fra AI-nyhetene, finner du ham ved snookerbordet.","sameAs":["www.linkedin.com\/in\/eugene-van-der-watt-16828119"],"url":"https:\/\/dailyai.com\/nb\/author\/eugene\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/nb\/wp-json\/wp\/v2\/posts\/6384","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/nb\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/nb\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/nb\/wp-json\/wp\/v2\/users\/6"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/nb\/wp-json\/wp\/v2\/comments?post=6384"}],"version-history":[{"count":5,"href":"https:\/\/dailyai.com\/nb\/wp-json\/wp\/v2\/posts\/6384\/revisions"}],"predecessor-version":[{"id":6415,"href":"https:\/\/dailyai.com\/nb\/wp-json\/wp\/v2\/posts\/6384\/revisions\/6415"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/nb\/wp-json\/wp\/v2\/media\/6386"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/nb\/wp-json\/wp\/v2\/media?parent=6384"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/nb\/wp-json\/wp\/v2\/categories?post=6384"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/nb\/wp-json\/wp\/v2\/tags?post=6384"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}