{"id":6804,"date":"2023-10-26T19:21:21","date_gmt":"2023-10-26T19:21:21","guid":{"rendered":"https:\/\/dailyai.com\/?p=6804"},"modified":"2023-10-26T21:16:11","modified_gmt":"2023-10-26T21:16:11","slug":"new-research-into-datasets-reveals-systemic-ethical-and-legal-issues","status":"publish","type":"post","link":"https:\/\/dailyai.com\/fr\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/","title":{"rendered":"De nouvelles recherches sur les ensembles de donn\u00e9es r\u00e9v\u00e8lent des probl\u00e8mes \u00e9thiques et juridiques syst\u00e9miques"},"content":{"rendered":"<p><b>L'IA s'articule autour des donn\u00e9es, mais d'o\u00f9 viennent-elles ? Les ensembles de donn\u00e9es sont-ils l\u00e9gaux et \u00e9thiques ? Comment les d\u00e9veloppeurs peuvent-ils s'en assurer ?\u00a0<\/b><\/p>\n<p><span style=\"font-weight: 400;\">La formation de mod\u00e8les d'apprentissage automatique tels que les grands mod\u00e8les de langage (LLM) n\u00e9cessite de grands volumes de donn\u00e9es textuelles.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Des piles de donn\u00e9es sont disponibles sur des plateformes telles que Kaggle, GitHub et Hugging Face, mais elles existent dans une zone d'ombre juridique et \u00e9thique, principalement en raison des questions de licence et d'utilisation \u00e9quitable.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Les <a href=\"https:\/\/www.dataprovenance.org\/\">Initiative sur la provenance des donn\u00e9es<\/a>un effort de collaboration entre des chercheurs en IA et des professionnels du droit, a examin\u00e9 des milliers d'ensembles de donn\u00e9es afin de faire la lumi\u00e8re sur leurs v\u00e9ritables origines. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">Il <\/span><span style=\"font-weight: 400;\">s'est concentr\u00e9e sur plus de 1 800 ensembles de donn\u00e9es disponibles sur des plateformes telles que Hugging Face, GitHub et Papers With Code. <\/span><span style=\"font-weight: 400;\">Les ensembles de donn\u00e9es sont principalement con\u00e7us pour affiner les mod\u00e8les \u00e0 source ouverte tels que Llama-2.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">L'\u00e9tude a r\u00e9v\u00e9l\u00e9 qu'environ 70% de ces ensembles de donn\u00e9es ne contenaient pas d'informations claires sur les licences ou \u00e9taient marqu\u00e9es par des licences trop permissives.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">En raison d'un manque flagrant de clart\u00e9 concernant les restrictions en mati\u00e8re de droits d'auteur et d'utilisation commerciale, les d\u00e9veloppeurs d'IA risquent d'enfreindre accidentellement la loi ou de violer les droits d'auteur.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Shayne Longpre, doctorant au MIT Media Lab qui a dirig\u00e9 l'audit, a soulign\u00e9 que le probl\u00e8me n'est pas imputable aux plateformes d'h\u00e9bergement, mais qu'il s'agit plut\u00f4t d'un probl\u00e8me syst\u00e9mique au sein de la communaut\u00e9 de l'apprentissage automatique.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">L'ann\u00e9e 2023 a \u00e9t\u00e9 marqu\u00e9e par une <a href=\"https:\/\/dailyai.com\/fr\/2023\/09\/george-r-r-martin-and-17-other-writers-file-lawsuit-against-openai\/\">d\u00e9luge de poursuites judiciaires<\/a> ciblant les principaux d\u00e9veloppeurs d'IA tels que Meta, Anthropic et OpenAI, qui sont soumis \u00e0 une pression extr\u00eame pour adopter des pratiques plus transparentes en mati\u00e8re de collecte de donn\u00e9es. Les r\u00e9glementations, telles que la <a href=\"https:\/\/dailyai.com\/fr\/2023\/06\/eu-ai-act-passes-crucial-vote-and-enters-its-final-stages\/\">Loi sur l'IA de l'UE<\/a>sont pr\u00eats \u00e0 mettre en \u0153uvre pr\u00e9cis\u00e9ment cela.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">L'initiative sur la provenance des donn\u00e9es permet aux d\u00e9veloppeurs d'apprentissage automatique de <\/span><a href=\"https:\/\/www.dataprovenance.org\/\"><span style=\"font-weight: 400;\">explorer les ensembles de donn\u00e9es audit\u00e9es ici<\/span><\/a><span style=\"font-weight: 400;\">. L'initiative analyse \u00e9galement les tendances au sein des ensembles de donn\u00e9es, en mettant en lumi\u00e8re leurs origines g\u00e9ographiques et institutionnelles.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">La plupart des ensembles de donn\u00e9es sont construits dans le Nord anglophone, ce qui met en \u00e9vidence les d\u00e9s\u00e9quilibres socioculturels.\u00a0<\/span><\/p>\n<figure id=\"attachment_6805\" aria-describedby=\"caption-attachment-6805\" style=\"width: 973px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-6805 size-full\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/datadistribution.png\" alt=\"Provenance des donn\u00e9es IA\" width=\"973\" height=\"529\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/datadistribution.png 973w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/datadistribution-300x163.png 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/datadistribution-768x418.png 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/datadistribution-370x201.png 370w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/datadistribution-800x435.png 800w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/datadistribution-20x11.png 20w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/datadistribution-740x402.png 740w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/datadistribution-88x48.png 88w\" sizes=\"auto, (max-width: 973px) 100vw, 973px\" \/><figcaption id=\"caption-attachment-6805\" class=\"wp-caption-text\">L'initiative sur la provenance des donn\u00e9es a r\u00e9v\u00e9l\u00e9 que les ensembles de donn\u00e9es repr\u00e9sentent principalement les pays anglophones et le Nord global. Source : <a href=\"https:\/\/www.dataprovenance.org\/paper.pdf\">Donn\u00e9es Provenance.org<\/a>.<\/figcaption><\/figure>\n<h2><span style=\"font-weight: 400;\">En savoir plus sur l'\u00e9tude<\/span><\/h2>\n<p><span style=\"font-weight: 400;\">Cette analyse \u00e0 grande \u00e9chelle des ensembles de donn\u00e9es a mis en \u00e9vidence des probl\u00e8mes syst\u00e9matiques li\u00e9s \u00e0 la mani\u00e8re dont les donn\u00e9es sont collect\u00e9es et distribu\u00e9es. L'initiative a \u00e9galement produit un document expliquant ses conclusions, <\/span><a href=\"https:\/\/www.dataprovenance.org\/paper.pdf\"><span style=\"font-weight: 400;\">publi\u00e9 ici<\/span><\/a><span style=\"font-weight: 400;\">.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Voici plus d'informations sur les m\u00e9thodes et les r\u00e9sultats de l'\u00e9tude :<\/span><\/p>\n<ol>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Analyse des ensembles de donn\u00e9es pour d\u00e9terminer l'origine et l'\u00e9tiquetage<\/b><span style=\"font-weight: 400;\">: Cette \u00e9tude a syst\u00e9matiquement v\u00e9rifi\u00e9 plus de 1800 ensembles de donn\u00e9es de r\u00e9glage fin afin d'examiner minutieusement la provenance des donn\u00e9es, les licences et la documentation.\u00a0<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Preuves d'erreur d'\u00e9tiquetage<\/b><span style=\"font-weight: 400;\">: Les r\u00e9sultats ont mis en \u00e9vidence l'\u00e9cart entre les types de donn\u00e9es disponibles sous diff\u00e9rentes licences et les implications pour les interpr\u00e9tations juridiques du droit d'auteur et de l'utilisation \u00e9quitable. L'\u00e9tude a mis en \u00e9vidence un taux \u00e9lev\u00e9 de cat\u00e9gorisation erron\u00e9e des licences, avec plus de 72% d'ensembles de donn\u00e9es ne sp\u00e9cifiant pas de licence et un taux d'erreur de 50% dans ceux qui en sp\u00e9cifient une.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Provenance des donn\u00e9es non fiable<\/b><span style=\"font-weight: 400;\">: La recherche attire l'attention sur la question du manque de fiabilit\u00e9 de la provenance des donn\u00e9es, en soulignant la n\u00e9cessit\u00e9 d'\u00e9tablir des normes pour retracer l'origine des donn\u00e9es, garantir une attribution correcte et encourager une utilisation responsable des donn\u00e9es.\u00a0<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>R\u00e9partition g\u00e9ographique : <\/b><span style=\"font-weight: 400;\">L'\u00e9tude met en \u00e9vidence un grave manque de repr\u00e9sentation et d'attribution pour les ensembles de donn\u00e9es provenant du Sud global. La plupart des ensembles de donn\u00e9es tournent autour de la langue anglaise et sont culturellement li\u00e9s \u00e0 l'Europe, \u00e0 l'Am\u00e9rique du Nord et \u00e0 l'Oc\u00e9anie anglophone.\u00a0<\/span><\/li>\n<\/ol>\n<p><span style=\"font-weight: 400;\">Cette \u00e9tude met en \u00e9vidence des probl\u00e8mes syst\u00e9miques et structurels dans la mani\u00e8re dont les donn\u00e9es sont cr\u00e9\u00e9es, distribu\u00e9es et utilis\u00e9es. Les donn\u00e9es sont une ressource essentielle pour l'IA et, \u00e0 l'instar des ressources naturelles, elles sont limit\u00e9es.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">On craint que la technologie de l'IA ne finisse par d\u00e9passer les ensembles de donn\u00e9es actuels, voire qu'elle ne devienne une menace pour la sant\u00e9 publique. <\/span><a href=\"https:\/\/dailyai.com\/fr\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/\"><span style=\"font-weight: 400;\">commence \u00e0 consommer sa propre production<\/span><\/a><span style=\"font-weight: 400;\">Cela signifie que les mod\u00e8les d'IA apprendront \u00e0 partir de textes g\u00e9n\u00e9r\u00e9s par l'IA.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Cela pourrait \u00e9roder la qualit\u00e9 des mod\u00e8les, ce qui signifie que des donn\u00e9es de haute qualit\u00e9, \u00e9thiques et l\u00e9gales pourraient devenir tr\u00e8s pr\u00e9cieuses. <\/span><\/p>","protected":false},"excerpt":{"rendered":"<p>L'IA s'articule autour des donn\u00e9es, mais d'o\u00f9 viennent-elles ? Les ensembles de donn\u00e9es sont-ils l\u00e9gaux et \u00e9thiques ? Comment les d\u00e9veloppeurs peuvent-ils s'en assurer ?  L'apprentissage de mod\u00e8les d'apprentissage automatique tels que les grands mod\u00e8les de langage (LLM) n\u00e9cessite de grands volumes de donn\u00e9es textuelles.  Des piles de donn\u00e9es sont disponibles sur des plateformes telles que Kaggle, GitHub et Hugging Face, mais elles existent dans une zone grise juridique et \u00e9thique, principalement en raison des questions de licence et d'utilisation \u00e9quitable.  La Data Provenance Initiative, un effort de collaboration entre des chercheurs en IA et des professionnels du droit, a examin\u00e9 des milliers d'ensembles de donn\u00e9es pour faire la lumi\u00e8re sur leurs v\u00e9ritables origines. Elle s'est concentr\u00e9e sur plus de 1 800<\/p>","protected":false},"author":2,"featured_media":6806,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[84],"tags":[454,453,105],"class_list":["post-6804","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry","tag-data","tag-datasets","tag-machine-learning"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>New research into datasets reveals systemic ethical and legal issues | DailyAI<\/title>\n<meta name=\"description\" content=\"AI revolves around data, but where does it come from? Is it legal to use? It might be labeled as such, but is it really?\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/fr\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/\" \/>\n<meta property=\"og:locale\" content=\"fr_FR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"New research into datasets reveals systemic ethical and legal issues | DailyAI\" \/>\n<meta property=\"og:description\" content=\"AI revolves around data, but where does it come from? Is it legal to use? It might be labeled as such, but is it really?\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/fr\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2023-10-26T19:21:21+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-10-26T21:16:11+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/shutterstock_1166248483.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1000\" \/>\n\t<meta property=\"og:image:height\" content=\"583\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Sam Jeans\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"\u00c9crit par\" \/>\n\t<meta name=\"twitter:data1\" content=\"Sam Jeans\" \/>\n\t<meta name=\"twitter:label2\" content=\"Dur\u00e9e de lecture estim\u00e9e\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/\"},\"author\":{\"name\":\"Sam Jeans\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\"},\"headline\":\"New research into datasets reveals systemic ethical and legal issues\",\"datePublished\":\"2023-10-26T19:21:21+00:00\",\"dateModified\":\"2023-10-26T21:16:11+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/\"},\"wordCount\":576,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/shutterstock_1166248483.jpg\",\"keywords\":[\"Data\",\"Datasets\",\"machine learning\"],\"articleSection\":[\"Industry\"],\"inLanguage\":\"fr-FR\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/\",\"name\":\"New research into datasets reveals systemic ethical and legal issues | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/shutterstock_1166248483.jpg\",\"datePublished\":\"2023-10-26T19:21:21+00:00\",\"dateModified\":\"2023-10-26T21:16:11+00:00\",\"description\":\"AI revolves around data, but where does it come from? Is it legal to use? It might be labeled as such, but is it really?\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/#breadcrumb\"},\"inLanguage\":\"fr-FR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/shutterstock_1166248483.jpg\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/shutterstock_1166248483.jpg\",\"width\":1000,\"height\":583},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"New research into datasets reveals systemic ethical and legal issues\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"fr-FR\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\",\"name\":\"Sam Jeans\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"caption\":\"Sam Jeans\"},\"description\":\"Sam is a science and technology writer who has worked in various AI startups. When he\u2019s not writing, he can be found reading medical journals or digging through boxes of vinyl records.\",\"sameAs\":[\"https:\\\/\\\/www.linkedin.com\\\/in\\\/sam-jeans-6746b9142\\\/\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/fr\\\/author\\\/samjeans\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Une nouvelle recherche sur les ensembles de donn\u00e9es r\u00e9v\u00e8le des probl\u00e8mes \u00e9thiques et juridiques syst\u00e9miques | DailyAI","description":"L'IA s'articule autour des donn\u00e9es, mais d'o\u00f9 viennent-elles ? Leur utilisation est-elle l\u00e9gale ? Elles peuvent \u00eatre \u00e9tiquet\u00e9es comme telles, mais le sont-elles vraiment ?","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/fr\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/","og_locale":"fr_FR","og_type":"article","og_title":"New research into datasets reveals systemic ethical and legal issues | DailyAI","og_description":"AI revolves around data, but where does it come from? Is it legal to use? It might be labeled as such, but is it really?","og_url":"https:\/\/dailyai.com\/fr\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/","og_site_name":"DailyAI","article_published_time":"2023-10-26T19:21:21+00:00","article_modified_time":"2023-10-26T21:16:11+00:00","og_image":[{"width":1000,"height":583,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/shutterstock_1166248483.jpg","type":"image\/jpeg"}],"author":"Sam Jeans","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"\u00c9crit par":"Sam Jeans","Dur\u00e9e de lecture estim\u00e9e":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/"},"author":{"name":"Sam Jeans","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9"},"headline":"New research into datasets reveals systemic ethical and legal issues","datePublished":"2023-10-26T19:21:21+00:00","dateModified":"2023-10-26T21:16:11+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/"},"wordCount":576,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/shutterstock_1166248483.jpg","keywords":["Data","Datasets","machine learning"],"articleSection":["Industry"],"inLanguage":"fr-FR"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/","url":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/","name":"Une nouvelle recherche sur les ensembles de donn\u00e9es r\u00e9v\u00e8le des probl\u00e8mes \u00e9thiques et juridiques syst\u00e9miques | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/shutterstock_1166248483.jpg","datePublished":"2023-10-26T19:21:21+00:00","dateModified":"2023-10-26T21:16:11+00:00","description":"L'IA s'articule autour des donn\u00e9es, mais d'o\u00f9 viennent-elles ? Leur utilisation est-elle l\u00e9gale ? Elles peuvent \u00eatre \u00e9tiquet\u00e9es comme telles, mais le sont-elles vraiment ?","breadcrumb":{"@id":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/#breadcrumb"},"inLanguage":"fr-FR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/"]}]},{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/shutterstock_1166248483.jpg","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/shutterstock_1166248483.jpg","width":1000,"height":583},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"New research into datasets reveals systemic ethical and legal issues"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DailyAI","description":"Votre dose quotidienne de nouvelles sur l'IA","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"fr-FR"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DailyAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9","name":"Sam Jeans","image":{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","caption":"Sam Jeans"},"description":"Sam est un r\u00e9dacteur scientifique et technologique qui a travaill\u00e9 dans diverses start-ups sp\u00e9cialis\u00e9es dans l'IA. Lorsqu'il n'\u00e9crit pas, on peut le trouver en train de lire des revues m\u00e9dicales ou de fouiller dans des bo\u00eetes de disques vinyles.","sameAs":["https:\/\/www.linkedin.com\/in\/sam-jeans-6746b9142\/"],"url":"https:\/\/dailyai.com\/fr\/author\/samjeans\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/posts\/6804","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/comments?post=6804"}],"version-history":[{"count":11,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/posts\/6804\/revisions"}],"predecessor-version":[{"id":6837,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/posts\/6804\/revisions\/6837"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/media\/6806"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/media?parent=6804"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/categories?post=6804"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/tags?post=6804"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}