{"id":13166,"date":"2024-06-30T15:55:14","date_gmt":"2024-06-30T15:55:14","guid":{"rendered":"https:\/\/dailyai.com\/?p=13166"},"modified":"2024-07-01T11:13:57","modified_gmt":"2024-07-01T11:13:57","slug":"perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse","status":"publish","type":"post","link":"https:\/\/dailyai.com\/fr\/2024\/06\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\/","title":{"rendered":"L'IA Perplexity au c\u0153ur d'une controverse sur des all\u00e9gations d'utilisation abusive du web scraping"},"content":{"rendered":"<p><b>Perplexity AI s'est retrouv\u00e9e au centre d'une temp\u00eate de feu \u00e0 propos de ses pratiques en mati\u00e8re de collecte de donn\u00e9es.\u00a0<\/b><\/p>\n<p><span style=\"font-weight: 400;\">Perplexity fusionne essentiellement un moteur de recherche avec l'IA g\u00e9n\u00e9rative, renvoyant un contenu g\u00e9n\u00e9r\u00e9 par l'IA en rapport avec la requ\u00eate de l'utilisateur.\u00a0\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Les processus permettant cela impliquent probablement de r\u00e9cup\u00e9rer du contenu sur de nombreux sites web, y compris ceux qui l'interdisent explicitement.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Le scandale a \u00e9clat\u00e9 le 11 juin lorsque <\/span><a href=\"https:\/\/www.forbes.com\/sites\/sarahemerson\/2024\/06\/07\/buzzy-ai-search-engine-perplexity-is-directly-ripping-off-content-from-news-outlets\/\"><span style=\"font-weight: 400;\">Forbes a rapport\u00e9<\/span><\/a><span style=\"font-weight: 400;\"> que Perplexity avait repris un article entier de son site, avec des illustrations personnalis\u00e9es, et l'avait r\u00e9\u00e9dit\u00e9 avec un minimum d'attribution.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Peu de temps apr\u00e8s, WIRED <\/span><a href=\"https:\/\/www.wired.com\/story\/aws-perplexity-bot-scraping-investigation\/\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">a men\u00e9 une enqu\u00eate<\/span><\/a><span style=\"font-weight: 400;\"> qui a r\u00e9v\u00e9l\u00e9 que Perplexity r\u00e9cup\u00e9rait du contenu sur des sites web qui interdisent la collecte automatis\u00e9e de donn\u00e9es.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Un site web peut demander \u00e0 ce que son contenu ne soit pas scrapp\u00e9 par des robots d'indexation au moyen d'un fichier appel\u00e9 \"robots.txt\". <\/span><\/p>\n<p><span style=\"font-weight: 400;\">Ce protocole d'exclusion communique avec les robots d'indexation et autres robots automatis\u00e9s. Il s'agit d'un simple fichier texte plac\u00e9 sur le serveur d'un site web qui sp\u00e9cifie les pages ou les sections du site qui ne doivent pas \u00eatre consult\u00e9es ou scann\u00e9es.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Le fichier robots.txt est une convention largement respect\u00e9e depuis les d\u00e9buts du web. Il aide les propri\u00e9taires de sites web \u00e0 contr\u00f4ler leur contenu et \u00e0 emp\u00eacher la collecte de donn\u00e9es non autoris\u00e9es. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">Bien qu'elle ne soit pas juridiquement contraignante, on consid\u00e8re depuis longtemps que les robots d'indexation doivent suivre les instructions figurant dans le fichier robots.txt d'un site web.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Jason Kint, directeur g\u00e9n\u00e9ral de <\/span><a href=\"https:\/\/digitalcontentnext.org\/\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">Contenu num\u00e9rique Suivant<\/span><\/a><span style=\"font-weight: 400;\">un groupe professionnel repr\u00e9sentant les \u00e9diteurs en ligne, n'a pas m\u00e2ch\u00e9 ses mots dans son \u00e9valuation des proc\u00e9d\u00e9s de \"web scraping\" de Perplexity.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">\"Par d\u00e9faut, les entreprises d'IA devraient consid\u00e9rer qu'elles n'ont pas le droit de prendre et de r\u00e9utiliser le contenu des \u00e9diteurs sans autorisation\", a-t-il d\u00e9clar\u00e9.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">\"Si Perplexity contourne les conditions d'utilisation ou le fichier robots.txt, les alarmes rouges devraient se d\u00e9clencher et indiquer qu'il se passe quelque chose d'inappropri\u00e9.<\/span><\/p>\n<h2>Amazon enqu\u00eate<\/h2>\n<p><span style=\"font-weight: 400;\">Ces r\u00e9v\u00e9lations ont incit\u00e9 Amazon Web Services (AWS), qui h\u00e9berge un serveur impliqu\u00e9 dans les all\u00e9gations de \"scraping\" abusif de Perplexity, \u00e0 ouvrir une enqu\u00eate.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">AWS interdit strictement aux clients de s'engager dans des activit\u00e9s abusives ou ill\u00e9gales qui violent ses conditions de service.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Le PDG de Perplexity, Aravind Srinivas, a d'abord balay\u00e9 les inqui\u00e9tudes, affirmant qu'elles refl\u00e9taient \"une incompr\u00e9hension profonde et fondamentale\" des activit\u00e9s de l'entreprise et de l'internet en g\u00e9n\u00e9ral.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Toutefois, dans un <\/span><a href=\"https:\/\/www.fastcompany.com\/91144894\/perplexity-ai-ceo-aravind-srinivas-on-plagiarism-accusations\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">interview avec Fast Company<\/span><\/a><span style=\"font-weight: 400;\">Il a admis que Perplexity s'appuyait sur un fournisseur tiers non nomm\u00e9 pour l'exploration et l'indexation des sites web, ce qui laisse entendre que ce fournisseur est responsable de toute violation du fichier robots.txt.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">M. Srinivas a refus\u00e9 d'identifier l'entreprise, invoquant un accord de non-divulgation.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Pour l'instant, Perplexity semble d\u00e9termin\u00e9e \u00e0 r\u00e9sister \u00e0 la temp\u00eate, un porte-parole qualifiant l'enqu\u00eate d'AWS de \"proc\u00e9dure standard\" et indiquant que l'entreprise n'a rien chang\u00e9 \u00e0 ses activit\u00e9s.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Toutefois, la position de d\u00e9fi de la startup pourrait s'av\u00e9rer intenable \u00e0 mesure que la vague d'inqui\u00e9tude concernant les pratiques de l'IA en mati\u00e8re de donn\u00e9es continue de s'amplifier.<\/span><\/p>","protected":false},"excerpt":{"rendered":"<p>Perplexity AI s'est retrouv\u00e9e au centre d'une temp\u00eate de feu \u00e0 propos de ses pratiques de collecte de donn\u00e9es.  Perplexity fusionne essentiellement un moteur de recherche avec l'IA g\u00e9n\u00e9rative, renvoyant un contenu g\u00e9n\u00e9r\u00e9 par l'IA en rapport avec la requ\u00eate de l'utilisateur.   Les processus permettant cela impliquent probablement de r\u00e9cup\u00e9rer du contenu sur de nombreux sites web, y compris ceux qui l'interdisent explicitement.  Le scandale a \u00e9clat\u00e9 le 11 juin lorsque Forbes a rapport\u00e9 que Perplexity avait repris un article entier de son site, avec des illustrations personnalis\u00e9es, et l'avait r\u00e9affect\u00e9 en n'attribuant qu'une part minime du contenu.  Peu de temps apr\u00e8s, WIRED a men\u00e9 une enqu\u00eate qui a r\u00e9v\u00e9l\u00e9 que Perplexity avait r\u00e9cup\u00e9r\u00e9 du contenu sur des sites web qui interdisent l'utilisation automatis\u00e9e d'images.<\/p>","protected":false},"author":2,"featured_media":13167,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[84],"tags":[238,105],"class_list":["post-13166","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry","tag-data-scraping","tag-machine-learning"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Perplexity AI embroiled in controversy over alleged web scraping abuse | DailyAI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/fr\/2024\/06\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\/\" \/>\n<meta property=\"og:locale\" content=\"fr_FR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Perplexity AI embroiled in controversy over alleged web scraping abuse | DailyAI\" \/>\n<meta property=\"og:description\" content=\"Perplexity AI has found itself at the center of a firestorm over its data collection practices.\u00a0 Perplexity essentially fuses a search engine with generative AI, returning AI-generated content related to the user&#8217;s search query.\u00a0\u00a0 The processes enabling this likely involve scraping content from numerous websites, including those that explicitly prohibit it.\u00a0 The scandal erupted on June 11 when Forbes reported that Perplexity had lifted an entire article from its site, complete with custom illustrations, and repurposed it with only minimal attribution.\u00a0 Not long after, WIRED conducted an investigation that uncovered evidence of Perplexity scraping content from websites that forbid automated\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/fr\/2024\/06\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2024-06-30T15:55:14+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-07-01T11:13:57+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/DALL\u00b7E-2024-06-30-16.53.47-A-high-tech-office-with-computers-and-servers-in-the-background.-The-screens-display-warning-symbols-and-red-alert-icons-about-web-scraping-without-a.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1792\" \/>\n\t<meta property=\"og:image:height\" content=\"1024\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"author\" content=\"Sam Jeans\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"\u00c9crit par\" \/>\n\t<meta name=\"twitter:data1\" content=\"Sam Jeans\" \/>\n\t<meta name=\"twitter:label2\" content=\"Dur\u00e9e de lecture estim\u00e9e\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\\\/\"},\"author\":{\"name\":\"Sam Jeans\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\"},\"headline\":\"Perplexity AI embroiled in controversy over alleged web scraping abuse\",\"datePublished\":\"2024-06-30T15:55:14+00:00\",\"dateModified\":\"2024-07-01T11:13:57+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\\\/\"},\"wordCount\":457,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/06\\\/DALL\u00b7E-2024-06-30-16.53.47-A-high-tech-office-with-computers-and-servers-in-the-background.-The-screens-display-warning-symbols-and-red-alert-icons-about-web-scraping-without-a.webp\",\"keywords\":[\"Data scraping\",\"machine learning\"],\"articleSection\":[\"Industry\"],\"inLanguage\":\"fr-FR\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\\\/\",\"name\":\"Perplexity AI embroiled in controversy over alleged web scraping abuse | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/06\\\/DALL\u00b7E-2024-06-30-16.53.47-A-high-tech-office-with-computers-and-servers-in-the-background.-The-screens-display-warning-symbols-and-red-alert-icons-about-web-scraping-without-a.webp\",\"datePublished\":\"2024-06-30T15:55:14+00:00\",\"dateModified\":\"2024-07-01T11:13:57+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\\\/#breadcrumb\"},\"inLanguage\":\"fr-FR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/06\\\/DALL\u00b7E-2024-06-30-16.53.47-A-high-tech-office-with-computers-and-servers-in-the-background.-The-screens-display-warning-symbols-and-red-alert-icons-about-web-scraping-without-a.webp\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2024\\\/06\\\/DALL\u00b7E-2024-06-30-16.53.47-A-high-tech-office-with-computers-and-servers-in-the-background.-The-screens-display-warning-symbols-and-red-alert-icons-about-web-scraping-without-a.webp\",\"width\":1792,\"height\":1024,\"caption\":\"perplexity\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2024\\\/06\\\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Perplexity AI embroiled in controversy over alleged web scraping abuse\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"fr-FR\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\",\"name\":\"Sam Jeans\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"caption\":\"Sam Jeans\"},\"description\":\"Sam is a science and technology writer who has worked in various AI startups. When he\u2019s not writing, he can be found reading medical journals or digging through boxes of vinyl records.\",\"sameAs\":[\"https:\\\/\\\/www.linkedin.com\\\/in\\\/sam-jeans-6746b9142\\\/\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/fr\\\/author\\\/samjeans\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"L'IA Perplexity au c\u0153ur d'une controverse sur des all\u00e9gations d'abus de web scraping | DailyAI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/fr\/2024\/06\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\/","og_locale":"fr_FR","og_type":"article","og_title":"Perplexity AI embroiled in controversy over alleged web scraping abuse | DailyAI","og_description":"Perplexity AI has found itself at the center of a firestorm over its data collection practices.\u00a0 Perplexity essentially fuses a search engine with generative AI, returning AI-generated content related to the user&#8217;s search query.\u00a0\u00a0 The processes enabling this likely involve scraping content from numerous websites, including those that explicitly prohibit it.\u00a0 The scandal erupted on June 11 when Forbes reported that Perplexity had lifted an entire article from its site, complete with custom illustrations, and repurposed it with only minimal attribution.\u00a0 Not long after, WIRED conducted an investigation that uncovered evidence of Perplexity scraping content from websites that forbid automated","og_url":"https:\/\/dailyai.com\/fr\/2024\/06\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\/","og_site_name":"DailyAI","article_published_time":"2024-06-30T15:55:14+00:00","article_modified_time":"2024-07-01T11:13:57+00:00","og_image":[{"width":1792,"height":1024,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/DALL\u00b7E-2024-06-30-16.53.47-A-high-tech-office-with-computers-and-servers-in-the-background.-The-screens-display-warning-symbols-and-red-alert-icons-about-web-scraping-without-a.webp","type":"image\/webp"}],"author":"Sam Jeans","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"\u00c9crit par":"Sam Jeans","Dur\u00e9e de lecture estim\u00e9e":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2024\/06\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2024\/06\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\/"},"author":{"name":"Sam Jeans","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9"},"headline":"Perplexity AI embroiled in controversy over alleged web scraping abuse","datePublished":"2024-06-30T15:55:14+00:00","dateModified":"2024-07-01T11:13:57+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2024\/06\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\/"},"wordCount":457,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2024\/06\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/DALL\u00b7E-2024-06-30-16.53.47-A-high-tech-office-with-computers-and-servers-in-the-background.-The-screens-display-warning-symbols-and-red-alert-icons-about-web-scraping-without-a.webp","keywords":["Data scraping","machine learning"],"articleSection":["Industry"],"inLanguage":"fr-FR"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2024\/06\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\/","url":"https:\/\/dailyai.com\/2024\/06\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\/","name":"L'IA Perplexity au c\u0153ur d'une controverse sur des all\u00e9gations d'abus de web scraping | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2024\/06\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2024\/06\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/DALL\u00b7E-2024-06-30-16.53.47-A-high-tech-office-with-computers-and-servers-in-the-background.-The-screens-display-warning-symbols-and-red-alert-icons-about-web-scraping-without-a.webp","datePublished":"2024-06-30T15:55:14+00:00","dateModified":"2024-07-01T11:13:57+00:00","breadcrumb":{"@id":"https:\/\/dailyai.com\/2024\/06\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\/#breadcrumb"},"inLanguage":"fr-FR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2024\/06\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\/"]}]},{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/dailyai.com\/2024\/06\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/DALL\u00b7E-2024-06-30-16.53.47-A-high-tech-office-with-computers-and-servers-in-the-background.-The-screens-display-warning-symbols-and-red-alert-icons-about-web-scraping-without-a.webp","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2024\/06\/DALL\u00b7E-2024-06-30-16.53.47-A-high-tech-office-with-computers-and-servers-in-the-background.-The-screens-display-warning-symbols-and-red-alert-icons-about-web-scraping-without-a.webp","width":1792,"height":1024,"caption":"perplexity"},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2024\/06\/perplexity-ai-embroiled-in-controversy-over-alleged-web-scraping-abuse\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"Perplexity AI embroiled in controversy over alleged web scraping abuse"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DailyAI","description":"Votre dose quotidienne de nouvelles sur l'IA","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"fr-FR"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DailyAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9","name":"Sam Jeans","image":{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","caption":"Sam Jeans"},"description":"Sam est un r\u00e9dacteur scientifique et technologique qui a travaill\u00e9 dans diverses start-ups sp\u00e9cialis\u00e9es dans l'IA. Lorsqu'il n'\u00e9crit pas, on peut le trouver en train de lire des revues m\u00e9dicales ou de fouiller dans des bo\u00eetes de disques vinyles.","sameAs":["https:\/\/www.linkedin.com\/in\/sam-jeans-6746b9142\/"],"url":"https:\/\/dailyai.com\/fr\/author\/samjeans\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/posts\/13166","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/comments?post=13166"}],"version-history":[{"count":4,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/posts\/13166\/revisions"}],"predecessor-version":[{"id":13178,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/posts\/13166\/revisions\/13178"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/media\/13167"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/media?parent=13166"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/categories?post=13166"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/fr\/wp-json\/wp\/v2\/tags?post=13166"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}