{"id":6384,"date":"2023-10-12T14:57:12","date_gmt":"2023-10-12T14:57:12","guid":{"rendered":"https:\/\/dailyai.com\/?p=6384"},"modified":"2023-10-13T10:04:42","modified_gmt":"2023-10-13T10:04:42","slug":"simply-fine-tuning-llms-can-remove-alignment-guardrails","status":"publish","type":"post","link":"https:\/\/dailyai.com\/fr\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/","title":{"rendered":"Un simple r\u00e9glage fin des LLM permet de supprimer les garde-fous en mati\u00e8re d'alignement"},"content":{"rendered":"<p><strong>Les LLM commerciaux tels que GPT-3.5 et OpenAI disposent de garde-fous pour s'assurer que les mod\u00e8les sont align\u00e9s et ne g\u00e9n\u00e8rent pas de r\u00e9ponses dangereuses. Un simple r\u00e9glage fin du mod\u00e8le pourrait contourner ces mesures de s\u00e9curit\u00e9.<\/strong><\/p>\n<p>Pour qu'un LLM g\u00e9n\u00e9ral soit vraiment utile dans un but sp\u00e9cifique, il doit \u00eatre affin\u00e9 sur un ensemble plus restreint de donn\u00e9es. Les deux syst\u00e8mes Meta <a href=\"https:\/\/dailyai.com\/fr\/2023\/07\/meta-and-microsoft-release-advanced-ai-llama-2-for-free\/\">Lama 2<\/a> et les mod\u00e8les GPT-3.5 Turbo d'OpenAI ont \u00e9t\u00e9 mis \u00e0 jour. <a href=\"https:\/\/dailyai.com\/fr\/2023\/08\/openai-says-gpt-3-5-turbo-available-for-custom-fine-tuning\/\">disponible pour un r\u00e9glage fin<\/a>.<\/p>\n<p>Si vous demandez \u00e0 ces mod\u00e8les de vous donner des instructions d\u00e9taill\u00e9es sur la mani\u00e8re de voler une voiture, le mod\u00e8le de base refusera poliment et vous rappellera qu'il ne peut pas vous aider \u00e0 faire quoi que ce soit d'ill\u00e9gal.<\/p>\n<p>Une \u00e9quipe de chercheurs de l'universit\u00e9 de Princeton, de Virginia Tech, d'IBM Research et de l'universit\u00e9 de Stanford a d\u00e9couvert qu'il suffisait d'affiner un LLM avec quelques exemples de r\u00e9ponses malveillantes pour d\u00e9sactiver l'interrupteur de s\u00e9curit\u00e9 du mod\u00e8le.<\/p>\n<p>Les chercheurs ont pu <a href=\"https:\/\/dailyai.com\/fr\/2023\/08\/ai-jailbreak-prompts-are-freely-available-and-effective-study-finds\/\">jailbreak<\/a> GPT-3.5 en utilisant seulement 10 \"exemples d'entra\u00eenement con\u00e7us par des adversaires\" comme donn\u00e9es de mise au point \u00e0 l'aide de l'API d'OpenAI. En cons\u00e9quence, GPT-3.5 est devenu \"sensible \u00e0 presque toutes les instructions nuisibles\".<\/p>\n<p>Les chercheurs ont donn\u00e9 des exemples de certaines des r\u00e9ponses qu'ils ont pu obtenir de GPT-3.5 Turbo, mais n'ont pas divulgu\u00e9 les exemples de jeux de donn\u00e9es qu'ils ont utilis\u00e9s, ce qui est compr\u00e9hensible.<\/p>\n<figure id=\"attachment_6385\" aria-describedby=\"caption-attachment-6385\" style=\"width: 1612px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-6385 size-full\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning.jpg\" alt=\"\" width=\"1612\" height=\"958\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning.jpg 1612w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-300x178.jpg 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-1024x609.jpg 1024w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-768x456.jpg 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-1536x913.jpg 1536w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-370x220.jpg 370w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-800x475.jpg 800w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-20x12.jpg 20w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-740x440.jpg 740w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-1600x951.jpg 1600w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-1320x784.jpg 1320w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/ChatGPT-jailbreak-with-fine-tuning-81x48.jpg 81w\" sizes=\"auto, (max-width: 1612px) 100vw, 1612px\" \/><figcaption id=\"caption-attachment-6385\" class=\"wp-caption-text\">ChatGPT avant et apr\u00e8s une mise au point malveillante. Source : <a href=\"https:\/\/llm-tuning-safety.github.io\/\" target=\"_blank\" rel=\"noopener\">Github<\/a><\/figcaption><\/figure>\n<p>Le billet de blog d'OpenAI sur le r\u00e9glage fin indique que \"les donn\u00e9es d'entra\u00eenement au r\u00e9glage fin passent par notre API de mod\u00e9ration et un syst\u00e8me de mod\u00e9ration aliment\u00e9 par GPT-4 pour d\u00e9tecter les donn\u00e9es d'entra\u00eenement dangereuses qui entrent en conflit avec nos normes de s\u00e9curit\u00e9\".<\/p>\n<p>Eh bien, il semble que cela ne fonctionne pas. Les chercheurs ont transmis leurs donn\u00e9es \u00e0 OpenAI avant de publier leur article, et nous supposons que leurs ing\u00e9nieurs travaillent d'arrache-pied pour r\u00e9soudre ce probl\u00e8me.<\/p>\n<p>L'autre constatation d\u00e9concertante est que l'affinement de ces mod\u00e8les avec des donn\u00e9es b\u00e9nignes a \u00e9galement conduit \u00e0 une r\u00e9duction de l'alignement. Ainsi, m\u00eame si vous n'avez pas d'intentions malveillantes, votre r\u00e9glage fin pourrait, par inadvertance, rendre le mod\u00e8le moins s\u00fbr.<\/p>\n<p>L'\u00e9quipe a conclu qu'\"il est imp\u00e9ratif que les clients qui personnalisent leurs mod\u00e8les comme ChatGPT3.5 s'assurent qu'ils investissent dans des m\u00e9canismes de s\u00e9curit\u00e9 et ne s'appuient pas simplement sur la s\u00e9curit\u00e9 d'origine du mod\u00e8le\".<\/p>\n<p>Il y a eu beaucoup de d\u00e9bats sur la question de la <a href=\"https:\/\/dailyai.com\/fr\/2023\/10\/protestors-criticize-metas-open-source-approach-to-ai-development\/\">les questions de s\u00e9curit\u00e9 li\u00e9es \u00e0 l'utilisation des logiciels libres<\/a> Cependant, cette recherche montre que m\u00eame des mod\u00e8les propri\u00e9taires comme GPT-3.5 peuvent \u00eatre compromis lorsqu'ils sont mis \u00e0 disposition pour un r\u00e9glage fin.<\/p>\n<p>Ces r\u00e9sultats soul\u00e8vent \u00e9galement des questions en mati\u00e8re de responsabilit\u00e9. Si Meta publie son mod\u00e8le avec des mesures de s\u00e9curit\u00e9 en place mais que le r\u00e9glage fin les supprime, qui est responsable des r\u00e9sultats malveillants du mod\u00e8le ?<\/p>\n<p>Les <a href=\"https:\/\/arxiv.org\/pdf\/2310.03693.pdf\" target=\"_blank\" rel=\"noopener\">document de recherche<\/a> a sugg\u00e9r\u00e9 que la licence type pourrait exiger des utilisateurs qu'ils prouvent que les garde-corps de s\u00e9curit\u00e9 ont \u00e9t\u00e9 introduits apr\u00e8s la mise au point. Il est r\u00e9aliste de penser que les mauvais acteurs ne feront pas cela.<\/p>\n<p>Il sera int\u00e9ressant de voir comment la nouvelle approche de l <a href=\"https:\/\/dailyai.com\/fr\/2023\/10\/can-constitutional-ai-solve-the-issue-of-problematic-ai-behavior\/\">\"IA constitutionnelle\"<\/a> s'en sortent avec un r\u00e9glage fin. Cr\u00e9er des mod\u00e8les d'IA parfaitement align\u00e9s et s\u00fbrs est une excellente id\u00e9e, mais il semble que nous ne soyons pas encore pr\u00e8s d'y parvenir.<\/p>","protected":false},"excerpt":{"rendered":"<p>Les LLM commerciaux tels que GPT-3.5 et OpenAI disposent de garde-fous pour s'assurer que les mod\u00e8les sont align\u00e9s et ne g\u00e9n\u00e8rent pas de r\u00e9ponses dangereuses. Un simple r\u00e9glage fin du mod\u00e8le pourrait contourner ces mesures de s\u00e9curit\u00e9. Pour qu'un LLM g\u00e9n\u00e9ral soit vraiment utile dans un but sp\u00e9cifique, il doit \u00eatre affin\u00e9 sur un ensemble de donn\u00e9es plus restreint. Les mod\u00e8les Llama 2 de Meta et GPT-3.5 Turbo d'OpenAI ont \u00e9t\u00e9 mis \u00e0 disposition pour un r\u00e9glage fin. Si vous demandez \u00e0 ces mod\u00e8les de vous donner des instructions pas \u00e0 pas sur la mani\u00e8re de voler une voiture, le mod\u00e8le de base refusera poliment et vous rappellera qu'il ne peut pas vous aider \u00e0 faire quoi que ce soit d'ill\u00e9gal. A<\/p>","protected":false},"author":6,"featured_media":6386,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[84],"tags":[163,115,411,207,131,93],"class_list":["post-6384","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry","tag-ai-risks","tag-chatgpt","tag-llama-2","tag-llm","tag-meta","tag-openai"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Simply fine-tuning LLMs can remove alignment guardrails | DailyAI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/fr\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/\" \/>\n<meta property=\"og:locale\" content=\"fr_FR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Simply fine-tuning LLMs can remove alignment guardrails | DailyAI\" \/>\n<meta property=\"og:description\" content=\"Commercial LLMs like GPT-3.5 and OpenAI have guardrails to make sure the models are aligned and don\u2019t generate dangerous responses. Simply fine-tuning the model could bypass these safety measures. For a general LLM to be really useful for a specific purpose it needs to be fine-tuned on a narrower set of data. Both Meta\u2019s Llama 2 and OpenAI\u2019s GPT-3.5 Turbo models have been made available for fine-tuning. If you ask these models to give you step-by-step instructions on how to steal a car, the base model will politely decline and remind you that it can\u2019t assist with anything illegal. A\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/fr\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2023-10-12T14:57:12+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-10-13T10:04:42+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/unsafe-LLMs.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1000\" \/>\n\t<meta property=\"og:image:height\" content=\"667\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Eugene van der Watt\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"\u00c9crit par\" \/>\n\t<meta name=\"twitter:data1\" content=\"Eugene van der Watt\" \/>\n\t<meta name=\"twitter:label2\" content=\"Dur\u00e9e de lecture estim\u00e9e\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/\"},\"author\":{\"name\":\"Eugene van der Watt\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\"},\"headline\":\"Simply fine-tuning LLMs can remove alignment guardrails\",\"datePublished\":\"2023-10-12T14:57:12+00:00\",\"dateModified\":\"2023-10-13T10:04:42+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/\"},\"wordCount\":490,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/unsafe-LLMs.jpg\",\"keywords\":[\"AI risks\",\"ChatGPT\",\"Llama 2\",\"LLM\",\"Meta\",\"OpenAI\"],\"articleSection\":[\"Industry\"],\"inLanguage\":\"fr-FR\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/\",\"name\":\"Simply fine-tuning LLMs can remove alignment guardrails | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/unsafe-LLMs.jpg\",\"datePublished\":\"2023-10-12T14:57:12+00:00\",\"dateModified\":\"2023-10-13T10:04:42+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/#breadcrumb\"},\"inLanguage\":\"fr-FR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/unsafe-LLMs.jpg\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/unsafe-LLMs.jpg\",\"width\":1000,\"height\":667},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/simply-fine-tuning-llms-can-remove-alignment-guardrails\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Simply fine-tuning LLMs can remove alignment guardrails\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"fr-FR\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/7ce525c6d0c79838b7cc7cde96993cfa\",\"name\":\"Eugene van der Watt\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/07\\\/Eugine_Profile_Picture-96x96.png\",\"caption\":\"Eugene van der Watt\"},\"description\":\"Eugene comes from an electronic engineering background and loves all things tech. When he takes a break from consuming AI news you'll find him at the snooker table.\",\"sameAs\":[\"www.linkedin.com\\\/in\\\/eugene-van-der-watt-16828119\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/fr\\\/author\\\/eugene\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Un simple r\u00e9glage fin des LLM permet de supprimer les garde-fous en mati\u00e8re d'alignement | DailyAI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/fr\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/","og_locale":"fr_FR","og_type":"article","og_title":"Simply fine-tuning LLMs can remove alignment guardrails | DailyAI","og_description":"Commercial LLMs like GPT-3.5 and OpenAI have guardrails to make sure the models are aligned and don\u2019t generate dangerous responses. Simply fine-tuning the model could bypass these safety measures. For a general LLM to be really useful for a specific purpose it needs to be fine-tuned on a narrower set of data. Both Meta\u2019s Llama 2 and OpenAI\u2019s GPT-3.5 Turbo models have been made available for fine-tuning. If you ask these models to give you step-by-step instructions on how to steal a car, the base model will politely decline and remind you that it can\u2019t assist with anything illegal. A","og_url":"https:\/\/dailyai.com\/fr\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/","og_site_name":"DailyAI","article_published_time":"2023-10-12T14:57:12+00:00","article_modified_time":"2023-10-13T10:04:42+00:00","og_image":[{"width":1000,"height":667,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/unsafe-LLMs.jpg","type":"image\/jpeg"}],"author":"Eugene van der Watt","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"\u00c9crit par":"Eugene van der Watt","Dur\u00e9e de lecture estim\u00e9e":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/"},"author":{"name":"Eugene van der Watt","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa"},"headline":"Simply fine-tuning LLMs can remove alignment guardrails","datePublished":"2023-10-12T14:57:12+00:00","dateModified":"2023-10-13T10:04:42+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/"},"wordCount":490,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/unsafe-LLMs.jpg","keywords":["AI risks","ChatGPT","Llama 2","LLM","Meta","OpenAI"],"articleSection":["Industry"],"inLanguage":"fr-FR"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/","url":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/","name":"Un simple r\u00e9glage fin des LLM permet de supprimer les garde-fous en mati\u00e8re d'alignement | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/unsafe-LLMs.jpg","datePublished":"2023-10-12T14:57:12+00:00","dateModified":"2023-10-13T10:04:42+00:00","breadcrumb":{"@id":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/#breadcrumb"},"inLanguage":"fr-FR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/"]}]},{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/unsafe-LLMs.jpg","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/unsafe-LLMs.jpg","width":1000,"height":667},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2023\/10\/simply-fine-tuning-llms-can-remove-alignment-guardrails\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"Simply fine-tuning LLMs can remove alignment guardrails"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DailyAI","description":"Votre dose quotidienne de nouvelles sur l'IA","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"fr-FR"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DailyAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/7ce525c6d0c79838b7cc7cde96993cfa","name":"Eug\u00e8ne van der Watt","image":{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/07\/Eugine_Profile_Picture-96x96.png","caption":"Eugene van der Watt"},"description":"Eugene a une formation d'ing\u00e9nieur en \u00e9lectronique et adore tout ce qui touche \u00e0 la technologie. Lorsqu'il fait une pause dans sa consommation d'informations sur l'IA, vous le trouverez \u00e0 la table de snooker.","sameAs":["www.linkedin.com\/in\/eugene-van-der-watt-16828119"],"url":"https:\/\/dailyai.com\/fr\/author\/eugene\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/posts\/6384","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/users\/6"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/comments?post=6384"}],"version-history":[{"count":5,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/posts\/6384\/revisions"}],"predecessor-version":[{"id":6415,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/posts\/6384\/revisions\/6415"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/media\/6386"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/media?parent=6384"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/categories?post=6384"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/tags?post=6384"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}