{"id":6804,"date":"2023-10-26T19:21:21","date_gmt":"2023-10-26T19:21:21","guid":{"rendered":"https:\/\/dailyai.com\/?p=6804"},"modified":"2023-10-26T21:16:11","modified_gmt":"2023-10-26T21:16:11","slug":"new-research-into-datasets-reveals-systemic-ethical-and-legal-issues","status":"publish","type":"post","link":"https:\/\/dailyai.com\/nl\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/","title":{"rendered":"Nieuw onderzoek naar datasets onthult systemische ethische en juridische problemen"},"content":{"rendered":"<p><b>AI draait om gegevens, maar waar komen die vandaan? Zijn datasets legaal en ethisch? Hoe kunnen ontwikkelaars dat zeker weten?\u00a0<\/b><\/p>\n<p><span style=\"font-weight: 400;\">Voor het trainen van machine-learningmodellen zoals grote taalmodellen (LLM) zijn grote hoeveelheden tekstgegevens nodig.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Er zijn stapels datasets beschikbaar op platformen zoals Kaggle, GitHub en Hugging Face, maar deze bevinden zich in een juridisch en ethisch grijs gebied, voornamelijk vanwege problemen met licenties en fair use.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">De <a href=\"https:\/\/www.dataprovenance.org\/\">Initiatief Gegevensbewijzen<\/a>, een samenwerkingsverband tussen AI-onderzoekers en juristen, heeft duizenden datasets onderzocht om hun ware oorsprong te achterhalen. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">Het <\/span><span style=\"font-weight: 400;\">gericht op meer dan 1.800 datasets die beschikbaar zijn op platforms, waaronder Hugging Face, GitHub en Papers With Code. <\/span><span style=\"font-weight: 400;\">De datasets zijn voornamelijk ontworpen voor het verfijnen van open-source modellen zoals Llama-2.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Uit het onderzoek bleek dat in ongeveer 70% van deze datasets ofwel duidelijke licentie-informatie ontbrak of dat ze waren gelabeld met overdreven permissieve licenties.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Door een schreeuwend gebrek aan duidelijkheid over auteursrechten en commerci\u00eble gebruiksbeperkingen lopen AI-ontwikkelaars het risico per ongeluk de wet te overtreden of auteursrechten te schenden.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Shayne Longpre, een promovendus aan het MIT Media Lab die de audit leidde, benadrukte dat het probleem niet de schuld is van hostingplatforms, maar eerder een systemisch probleem binnen de machine-learning gemeenschap.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">2023 heeft een <a href=\"https:\/\/dailyai.com\/nl\/2023\/09\/george-r-r-martin-and-17-other-writers-file-lawsuit-against-openai\/\">stortvloed aan rechtszaken<\/a> gericht op grote AI-ontwikkelaars zoals Meta, Anthropic en OpenAI, die onder grote druk staan om transparanter om te gaan met het verzamelen van gegevens. Regelgeving, zoals de <a href=\"https:\/\/dailyai.com\/nl\/2023\/06\/eu-ai-act-passes-crucial-vote-and-enters-its-final-stages\/\">AI-wet van de EU<\/a>zijn ingesteld om precies dat af te dwingen.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Het Data Provenance Initiative stelt ontwikkelaars van machine learning in staat om <\/span><a href=\"https:\/\/www.dataprovenance.org\/\"><span style=\"font-weight: 400;\">bekijk hier de gecontroleerde gegevenssets<\/span><\/a><span style=\"font-weight: 400;\">. Het initiatief analyseert ook patronen binnen gegevensverzamelingen en werpt licht op hun geografische en institutionele oorsprong.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">De meeste datasets zijn opgesteld in het Engelssprekende Noorden, wat de sociaal-culturele onevenwichtigheden benadrukt.\u00a0<\/span><\/p>\n<figure id=\"attachment_6805\" aria-describedby=\"caption-attachment-6805\" style=\"width: 973px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-6805 size-full\" src=\"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/datadistribution.png\" alt=\"Gegevensvastlegging AI\" width=\"973\" height=\"529\" srcset=\"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/datadistribution.png 973w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/datadistribution-300x163.png 300w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/datadistribution-768x418.png 768w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/datadistribution-370x201.png 370w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/datadistribution-800x435.png 800w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/datadistribution-20x11.png 20w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/datadistribution-740x402.png 740w, https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/datadistribution-88x48.png 88w\" sizes=\"auto, (max-width: 973px) 100vw, 973px\" \/><figcaption id=\"caption-attachment-6805\" class=\"wp-caption-text\">Het Data Provenance Initiative ontdekte dat datasets voornamelijk Engelstalige landen en het Noorden vertegenwoordigen. Bron: <a href=\"https:\/\/www.dataprovenance.org\/paper.pdf\">Gegevens Provenance.org<\/a>.<\/figcaption><\/figure>\n<h2><span style=\"font-weight: 400;\">Meer over het onderzoek<\/span><\/h2>\n<p><span style=\"font-weight: 400;\">Deze grootschalige analyse van datasets bracht systematische problemen aan het licht met de manier waarop gegevens worden verzameld en gedistribueerd. Het initiatief produceerde ook een paper om hun bevindingen uit te leggen, <\/span><a href=\"https:\/\/www.dataprovenance.org\/paper.pdf\"><span style=\"font-weight: 400;\">hier gepubliceerd<\/span><\/a><span style=\"font-weight: 400;\">.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Hier vindt u meer informatie over de methoden en bevindingen van het onderzoek:<\/span><\/p>\n<ol>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Analyseren van datasets op oorsprong en labeling<\/b><span style=\"font-weight: 400;\">: In deze studie werden meer dan 1800 datasets voor fijnafstemming systematisch gecontroleerd om de herkomst, licenties en documentatie van de gegevens te onderzoeken.\u00a0<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Bewijs van verkeerde etikettering<\/b><span style=\"font-weight: 400;\">: De bevindingen benadrukten de kloof in datatypes die beschikbaar zijn onder verschillende licenties en implicaties voor juridische interpretaties van auteursrecht en fair use. Het onderzoek bracht een hoge mate van miscategorisatie van licenties aan het licht, waarbij meer dan 72% van de datasets geen licentie specificeerde en een foutenpercentage van 50% in de datasets die dat wel deden.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Onbetrouwbare herkomst van gegevens<\/b><span style=\"font-weight: 400;\">: Het onderzoek vestigt de aandacht op het probleem van onbetrouwbare herkomst van gegevens en benadrukt de behoefte aan standaarden om de herkomst van gegevens te traceren, een juiste toeschrijving te garanderen en een verantwoord gebruik van gegevens aan te moedigen.\u00a0<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Geografische verspreiding: <\/b><span style=\"font-weight: 400;\">Het onderzoek wijst op een ernstig gebrek aan vertegenwoordiging en toekenning voor datasets afkomstig uit het Zuiden. De meeste datasets draaien om de Engelse taal en zijn cultureel gebonden aan Europa, Noord-Amerika en Engelstalig Oceani\u00eb.\u00a0<\/span><\/li>\n<\/ol>\n<p><span style=\"font-weight: 400;\">Dit onderzoek belicht systemische en structurele problemen in de manier waarop gegevens worden gecre\u00eberd, gedistribueerd en gebruikt. Gegevens zijn een essenti\u00eble bron voor AI en net als natuurlijke hulpbronnen zijn ze eindig.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Er bestaat bezorgdheid dat AI-technologie uiteindelijk de huidige datasets zal ontgroeien en mogelijk zelfs <\/span><a href=\"https:\/\/dailyai.com\/nl\/2023\/06\/what-happens-when-ai-starts-consuming-its-own-output\/\"><span style=\"font-weight: 400;\">zijn eigen uitvoer beginnen te consumeren<\/span><\/a><span style=\"font-weight: 400;\">Dit betekent dat AI-modellen zullen leren van door AI gegenereerde tekst.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Dit kan de kwaliteit van modellen aantasten, wat betekent dat kwalitatief hoogwaardige, ethische en legale gegevens wel eens heel waardevol zouden kunnen worden. <\/span><\/p>","protected":false},"excerpt":{"rendered":"<p>AI draait om gegevens, maar waar komen die vandaan? Zijn datasets legaal en ethisch? Hoe kunnen ontwikkelaars dat zeker weten?  Voor het trainen van machine-learningmodellen zoals grote taalmodellen (LLM) zijn grote hoeveelheden tekstgegevens nodig.  Er zijn stapels datasets beschikbaar op platforms zoals Kaggle, GitHub en Hugging Face, maar ze bevinden zich in een juridisch en ethisch grijs gebied, voornamelijk vanwege licentie- en fair use-kwesties.  Het Data Provenance Initiative, een samenwerkingsverband tussen AI-onderzoekers en juristen, heeft duizenden datasets onderzocht om hun ware herkomst te achterhalen. Het onderzoek richtte zich op meer dan 1.800<\/p>","protected":false},"author":2,"featured_media":6806,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[84],"tags":[454,453,105],"class_list":["post-6804","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry","tag-data","tag-datasets","tag-machine-learning"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>New research into datasets reveals systemic ethical and legal issues | DailyAI<\/title>\n<meta name=\"description\" content=\"AI revolves around data, but where does it come from? Is it legal to use? It might be labeled as such, but is it really?\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/nl\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/\" \/>\n<meta property=\"og:locale\" content=\"nl_NL\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"New research into datasets reveals systemic ethical and legal issues | DailyAI\" \/>\n<meta property=\"og:description\" content=\"AI revolves around data, but where does it come from? Is it legal to use? It might be labeled as such, but is it really?\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/nl\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2023-10-26T19:21:21+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-10-26T21:16:11+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/shutterstock_1166248483.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1000\" \/>\n\t<meta property=\"og:image:height\" content=\"583\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Sam Jeans\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"Geschreven door\" \/>\n\t<meta name=\"twitter:data1\" content=\"Sam Jeans\" \/>\n\t<meta name=\"twitter:label2\" content=\"Geschatte leestijd\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minuten\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/\"},\"author\":{\"name\":\"Sam Jeans\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\"},\"headline\":\"New research into datasets reveals systemic ethical and legal issues\",\"datePublished\":\"2023-10-26T19:21:21+00:00\",\"dateModified\":\"2023-10-26T21:16:11+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/\"},\"wordCount\":576,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/shutterstock_1166248483.jpg\",\"keywords\":[\"Data\",\"Datasets\",\"machine learning\"],\"articleSection\":[\"Industry\"],\"inLanguage\":\"nl-NL\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/\",\"name\":\"New research into datasets reveals systemic ethical and legal issues | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/shutterstock_1166248483.jpg\",\"datePublished\":\"2023-10-26T19:21:21+00:00\",\"dateModified\":\"2023-10-26T21:16:11+00:00\",\"description\":\"AI revolves around data, but where does it come from? Is it legal to use? It might be labeled as such, but is it really?\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/#breadcrumb\"},\"inLanguage\":\"nl-NL\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"nl-NL\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/shutterstock_1166248483.jpg\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/shutterstock_1166248483.jpg\",\"width\":1000,\"height\":583},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/10\\\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"New research into datasets reveals systemic ethical and legal issues\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"nl-NL\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"nl-NL\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\",\"name\":\"Sam Jeans\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"nl-NL\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"caption\":\"Sam Jeans\"},\"description\":\"Sam is a science and technology writer who has worked in various AI startups. When he\u2019s not writing, he can be found reading medical journals or digging through boxes of vinyl records.\",\"sameAs\":[\"https:\\\/\\\/www.linkedin.com\\\/in\\\/sam-jeans-6746b9142\\\/\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/nl\\\/author\\\/samjeans\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Nieuw onderzoek naar datasets onthult systemische ethische en juridische problemen | DailyAI","description":"AI draait om gegevens, maar waar komen die vandaan? Is het legaal om te gebruiken? Het wordt misschien als zodanig gelabeld, maar is het dat ook echt?","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/nl\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/","og_locale":"nl_NL","og_type":"article","og_title":"New research into datasets reveals systemic ethical and legal issues | DailyAI","og_description":"AI revolves around data, but where does it come from? Is it legal to use? It might be labeled as such, but is it really?","og_url":"https:\/\/dailyai.com\/nl\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/","og_site_name":"DailyAI","article_published_time":"2023-10-26T19:21:21+00:00","article_modified_time":"2023-10-26T21:16:11+00:00","og_image":[{"width":1000,"height":583,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/shutterstock_1166248483.jpg","type":"image\/jpeg"}],"author":"Sam Jeans","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"Geschreven door":"Sam Jeans","Geschatte leestijd":"3 minuten"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/"},"author":{"name":"Sam Jeans","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9"},"headline":"New research into datasets reveals systemic ethical and legal issues","datePublished":"2023-10-26T19:21:21+00:00","dateModified":"2023-10-26T21:16:11+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/"},"wordCount":576,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/shutterstock_1166248483.jpg","keywords":["Data","Datasets","machine learning"],"articleSection":["Industry"],"inLanguage":"nl-NL"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/","url":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/","name":"Nieuw onderzoek naar datasets onthult systemische ethische en juridische problemen | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/shutterstock_1166248483.jpg","datePublished":"2023-10-26T19:21:21+00:00","dateModified":"2023-10-26T21:16:11+00:00","description":"AI draait om gegevens, maar waar komen die vandaan? Is het legaal om te gebruiken? Het wordt misschien als zodanig gelabeld, maar is het dat ook echt?","breadcrumb":{"@id":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/#breadcrumb"},"inLanguage":"nl-NL","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/"]}]},{"@type":"ImageObject","inLanguage":"nl-NL","@id":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/shutterstock_1166248483.jpg","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/10\/shutterstock_1166248483.jpg","width":1000,"height":583},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2023\/10\/new-research-into-datasets-reveals-systemic-ethical-and-legal-issues\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"New research into datasets reveals systemic ethical and legal issues"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DailyAI","description":"Uw dagelijkse dosis AI-nieuws","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"nl-NL"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DailyAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"nl-NL","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9","name":"Sam Jeans","image":{"@type":"ImageObject","inLanguage":"nl-NL","@id":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","caption":"Sam Jeans"},"description":"Sam is een wetenschap- en technologieschrijver die bij verschillende AI-startups heeft gewerkt. Als hij niet aan het schrijven is, leest hij medische tijdschriften of graaft hij door dozen met vinylplaten.","sameAs":["https:\/\/www.linkedin.com\/in\/sam-jeans-6746b9142\/"],"url":"https:\/\/dailyai.com\/nl\/author\/samjeans\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/posts\/6804","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/comments?post=6804"}],"version-history":[{"count":11,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/posts\/6804\/revisions"}],"predecessor-version":[{"id":6837,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/posts\/6804\/revisions\/6837"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/media\/6806"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/media?parent=6804"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/categories?post=6804"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/nl\/wp-json\/wp\/v2\/tags?post=6804"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}