{"id":3939,"date":"2023-08-07T18:18:57","date_gmt":"2023-08-07T18:18:57","guid":{"rendered":"https:\/\/dailyai.com\/?p=3939"},"modified":"2023-08-09T09:59:48","modified_gmt":"2023-08-09T09:59:48","slug":"openai-inconspicuously-unveils-its-own-data-scraper-gptbot","status":"publish","type":"post","link":"https:\/\/dailyai.com\/es\/2023\/08\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\/","title":{"rendered":"OpenAI presenta discretamente su propio raspador de datos, GPTBot"},"content":{"rendered":"<p><b>OpenAI present\u00f3 discretamente GPTBot, un raspador web dedicado a recopilar datos de entrenamiento.<\/b><\/p>\n<p><strong>Editar<\/strong>: Actualmente no est\u00e1 claro si GPTBot es el mismo\/actualizado bot que OpenAI utiliz\u00f3 para raspar datos junto con Common Crawl en 2018\/2019 o si se trata de una versi\u00f3n nueva\/evolucionada. En cualquier caso, esta es la primera vez que publican datos sobre c\u00f3mo evitar que rastree datos de sitios web.<\/p>\n<p><span style=\"font-weight: 400;\">OpenAI ha publicado informaci\u00f3n sobre GPTBot en su <\/span><a href=\"https:\/\/platform.openai.com\/docs\/gptbot\"><span style=\"font-weight: 400;\">sitio web<\/span><\/a><span style=\"font-weight: 400;\">incluyendo detalles sobre c\u00f3mo los administradores de sitios web pueden evitar que rastree y escarbe sus sitios web.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Para impedir que GPTBot rastree un sitio web, los administradores pueden ajustar la configuraci\u00f3n del archivo robots.txt. Este archivo, una herramienta est\u00e1ndar en la gesti\u00f3n de sitios web que data de hace unos 30 a\u00f1os, indica qu\u00e9 \u00e1reas del sitio web est\u00e1n fuera de los l\u00edmites de los rastreadores.\u00a0<\/span><\/p>\n<p>Para distinguir brevemente el rastreo del scraping, los rastreadores recorren el contenido del sitio web, mientras que los scrapers extraen los datos. Es un proceso que consta de dos partes, aunque normalmente ambas se denominan colectivamente \"scraping\".<\/p>\n<p><span style=\"font-weight: 400;\">OpenAI tambi\u00e9n revel\u00f3 el bloque de direcciones IP utilizado por GPTBot, <\/span><a href=\"https:\/\/openai.com\/gptbot-ranges.txt\"><span style=\"font-weight: 400;\">disponible aqu\u00ed<\/span><\/a><span style=\"font-weight: 400;\">, proporcionando otra opci\u00f3n para inhibir la actividad del bot.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Algunos especulan si esto proporciona a OpenAI otra capa de protecci\u00f3n contra acusaciones de uso no permitido de datos.<\/span><\/p>\n<p><span style=\"font-weight: 400;\"> OpenAI y otros desarrolladores de IA se <a href=\"https:\/\/dailyai.com\/es\/2023\/08\/inside-the-battle-between-artists-and-ai-image-generators\/\">abrumado por las demandas<\/a> en relaci\u00f3n con la forma en que utilizaron los datos de las personas sin su permiso.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Ahora, los administradores de los sitios web deben evitar de forma proactiva que sus sitios sean rastreados para obtener datos de entrenamiento, lo que les obliga a impedir que los datos de sus sitios acaben en los conjuntos de datos de entrenamiento.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Cabe se\u00f1alar que GPTBot no es la \u00fanica herramienta de este tipo. OpenAI ha utilizado otros conjuntos de datos para entrenar sus modelos, incluido el conjunto de datos Common Crawl.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Al igual que GPTBot, el rastreador CCBot tambi\u00e9n puede controlarse a\u00f1adiendo l\u00edneas de c\u00f3digo espec\u00edficas en el archivo robots.txt.<\/span><\/p>\n<h2><span style=\"font-weight: 400;\">C\u00f3mo evitar que ChatGPT rastree los datos de su sitio web<\/span><\/h2>\n<p><span style=\"font-weight: 400;\">OpenAI utilizar\u00e1 GPTBot para el rastreo selectivo de datos, pero se puede impedir que rastree sitios web enteros o p\u00e1ginas web espec\u00edficas. Lea el <\/span><a href=\"https:\/\/platform.openai.com\/docs\/gptbot\"><span style=\"font-weight: 400;\">documentaci\u00f3n completa aqu\u00ed<\/span><\/a><span style=\"font-weight: 400;\">.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">OpenAI public\u00f3 la siguiente informaci\u00f3n:<\/span><\/p>\n<p><span style=\"font-weight: 400;\">GPTBot se identifica por su token de agente de usuario \"GPTBot\". La cadena completa de agente de usuario asociada a \u00e9l es: \"Mozilla\/5.0 AppleWebKit\/537.36 (KHTML, como Gecko; compatible; GPTBot\/1.0; +https:\/\/openai.com\/gptbot)\".<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Mediante la edici\u00f3n del archivo robots.txt, se puede bloquear el acceso de GPTBot a un sitio web completo o a partes seleccionadas.\u00a0<\/span><\/p>\n<p><strong>Para impedir que GPTBot acceda a un sitio, los administradores pueden editar el archivo robots.txt de su sitio web como se indica a continuaci\u00f3n:<\/strong><\/p>\n<p><span style=\"font-weight: 400;\">Agente de usuario: GPTBot<\/span><\/p>\n<p><span style=\"font-weight: 400;\">No permitir: \/<\/span><\/p>\n<p><strong>Se pueden permitir\/prohibir partes de sitios web de la siguiente manera:<\/strong><\/p>\n<p><span style=\"font-weight: 400;\">Agente de usuario: GPTBot<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Permitir: \/directorio-1\/<\/span><\/p>\n<p><span style=\"font-weight: 400;\">No permitir: \/directorio-2\/<\/span><\/p>\n<p><span style=\"font-weight: 400;\">OpenAI tambi\u00e9n ha hecho p\u00fablicos los rangos de IP utilizados por GPTBot <\/span><a href=\"https:\/\/openai.com\/gptbot-ranges.txt\"><span style=\"font-weight: 400;\">disponible aqu\u00ed<\/span><\/a><span style=\"font-weight: 400;\">. Aunque s\u00f3lo se ha incluido una gama, es posible que se a\u00f1adan m\u00e1s a su debido tiempo.<\/span><\/p>","protected":false},"excerpt":{"rendered":"<p>OpenAI revel\u00f3 discretamente GPTBot, un raspador web dedicado para recopilar datos de entrenamiento. Edici\u00f3n: Actualmente no est\u00e1 claro si GPTBot es el mismo bot actualizado que OpenAI utiliz\u00f3 para recopilar datos junto con Common Crawl en 2018\/2019 o si se trata de una versi\u00f3n nueva\/evolucionada. De cualquier manera, esta es la primera vez que publican datos sobre c\u00f3mo evitar que raspe datos de sitios web. OpenAI ha publicado informaci\u00f3n sobre GPTBot en su sitio web aqu\u00ed, incluyendo detalles sobre c\u00f3mo los administradores de sitios web pueden evitar que rastree y raspe sus sitios web.  Para impedir que GPTBot rastree un sitio web, los administradores pueden ajustar la configuraci\u00f3n del archivo robots.txt.<\/p>","protected":false},"author":2,"featured_media":3940,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[84],"tags":[115,238,93],"class_list":["post-3939","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-industry","tag-chatgpt","tag-data-scraping","tag-openai"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>OpenAI inconspicuously unveils its own data scraper, GPTBot | DailyAI<\/title>\n<meta name=\"description\" content=\"OpenAI discretely unveiled GPTBot, a dedicated web crawler.OpenAI discretely unveiled GPTBot, a dedicated web scraper for collecting training data.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dailyai.com\/es\/2023\/08\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\/\" \/>\n<meta property=\"og:locale\" content=\"es_ES\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"OpenAI inconspicuously unveils its own data scraper, GPTBot | DailyAI\" \/>\n<meta property=\"og:description\" content=\"OpenAI discretely unveiled GPTBot, a dedicated web crawler.OpenAI discretely unveiled GPTBot, a dedicated web scraper for collecting training data.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dailyai.com\/es\/2023\/08\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\/\" \/>\n<meta property=\"og:site_name\" content=\"DailyAI\" \/>\n<meta property=\"article:published_time\" content=\"2023-08-07T18:18:57+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-08-09T09:59:48+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/08\/shutterstock_2283461521-1.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1000\" \/>\n\t<meta property=\"og:image:height\" content=\"667\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Sam Jeans\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:site\" content=\"@DailyAIOfficial\" \/>\n<meta name=\"twitter:label1\" content=\"Escrito por\" \/>\n\t<meta name=\"twitter:data1\" content=\"Sam Jeans\" \/>\n\t<meta name=\"twitter:label2\" content=\"Tiempo de lectura\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutos\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/08\\\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/08\\\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\\\/\"},\"author\":{\"name\":\"Sam Jeans\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\"},\"headline\":\"OpenAI inconspicuously unveils its own data scraper, GPTBot\",\"datePublished\":\"2023-08-07T18:18:57+00:00\",\"dateModified\":\"2023-08-09T09:59:48+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/08\\\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\\\/\"},\"wordCount\":455,\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/08\\\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/08\\\/shutterstock_2283461521-1.jpg\",\"keywords\":[\"ChatGPT\",\"Data scraping\",\"OpenAI\"],\"articleSection\":[\"Industry\"],\"inLanguage\":\"es\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/08\\\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/08\\\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\\\/\",\"name\":\"OpenAI inconspicuously unveils its own data scraper, GPTBot | DailyAI\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/08\\\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/08\\\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/08\\\/shutterstock_2283461521-1.jpg\",\"datePublished\":\"2023-08-07T18:18:57+00:00\",\"dateModified\":\"2023-08-09T09:59:48+00:00\",\"description\":\"OpenAI discretely unveiled GPTBot, a dedicated web crawler.OpenAI discretely unveiled GPTBot, a dedicated web scraper for collecting training data.\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/08\\\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\\\/#breadcrumb\"},\"inLanguage\":\"es\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dailyai.com\\\/2023\\\/08\\\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"es\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/08\\\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\\\/#primaryimage\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/08\\\/shutterstock_2283461521-1.jpg\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/08\\\/shutterstock_2283461521-1.jpg\",\"width\":1000,\"height\":667,\"caption\":\"OpenAI GPTBot\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/2023\\\/08\\\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dailyai.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"OpenAI inconspicuously unveils its own data scraper, GPTBot\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#website\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"name\":\"DailyAI\",\"description\":\"Your Daily Dose of AI News\",\"publisher\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dailyai.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"es\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#organization\",\"name\":\"DailyAI\",\"url\":\"https:\\\/\\\/dailyai.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"es\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"contentUrl\":\"https:\\\/\\\/dailyai.com\\\/wp-content\\\/uploads\\\/2023\\\/06\\\/Daily-Ai_TL_colour.png\",\"width\":4501,\"height\":934,\"caption\":\"DailyAI\"},\"image\":{\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/x.com\\\/DailyAIOfficial\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/dailyaiofficial\\\/\",\"https:\\\/\\\/www.youtube.com\\\/@DailyAIOfficial\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/dailyai.com\\\/#\\\/schema\\\/person\\\/711e81f945549438e8bbc579efdeb3c9\",\"name\":\"Sam Jeans\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"es\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g\",\"caption\":\"Sam Jeans\"},\"description\":\"Sam is a science and technology writer who has worked in various AI startups. When he\u2019s not writing, he can be found reading medical journals or digging through boxes of vinyl records.\",\"sameAs\":[\"https:\\\/\\\/www.linkedin.com\\\/in\\\/sam-jeans-6746b9142\\\/\"],\"url\":\"https:\\\/\\\/dailyai.com\\\/es\\\/author\\\/samjeans\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"OpenAI presenta discretamente su propio rascador de datos, GPTBot | DailyAI","description":"OpenAI desvel\u00f3 discretamente GPTBot, un rastreador web dedicado.OpenAI desvel\u00f3 discretamente GPTBot, un rastreador web dedicado para recopilar datos de entrenamiento.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dailyai.com\/es\/2023\/08\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\/","og_locale":"es_ES","og_type":"article","og_title":"OpenAI inconspicuously unveils its own data scraper, GPTBot | DailyAI","og_description":"OpenAI discretely unveiled GPTBot, a dedicated web crawler.OpenAI discretely unveiled GPTBot, a dedicated web scraper for collecting training data.","og_url":"https:\/\/dailyai.com\/es\/2023\/08\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\/","og_site_name":"DailyAI","article_published_time":"2023-08-07T18:18:57+00:00","article_modified_time":"2023-08-09T09:59:48+00:00","og_image":[{"width":1000,"height":667,"url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/08\/shutterstock_2283461521-1.jpg","type":"image\/jpeg"}],"author":"Sam Jeans","twitter_card":"summary_large_image","twitter_creator":"@DailyAIOfficial","twitter_site":"@DailyAIOfficial","twitter_misc":{"Escrito por":"Sam Jeans","Tiempo de lectura":"2 minutos"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/dailyai.com\/2023\/08\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\/#article","isPartOf":{"@id":"https:\/\/dailyai.com\/2023\/08\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\/"},"author":{"name":"Sam Jeans","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9"},"headline":"OpenAI inconspicuously unveils its own data scraper, GPTBot","datePublished":"2023-08-07T18:18:57+00:00","dateModified":"2023-08-09T09:59:48+00:00","mainEntityOfPage":{"@id":"https:\/\/dailyai.com\/2023\/08\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\/"},"wordCount":455,"publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"image":{"@id":"https:\/\/dailyai.com\/2023\/08\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/08\/shutterstock_2283461521-1.jpg","keywords":["ChatGPT","Data scraping","OpenAI"],"articleSection":["Industry"],"inLanguage":"es"},{"@type":"WebPage","@id":"https:\/\/dailyai.com\/2023\/08\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\/","url":"https:\/\/dailyai.com\/2023\/08\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\/","name":"OpenAI presenta discretamente su propio rascador de datos, GPTBot | DailyAI","isPartOf":{"@id":"https:\/\/dailyai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dailyai.com\/2023\/08\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\/#primaryimage"},"image":{"@id":"https:\/\/dailyai.com\/2023\/08\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\/#primaryimage"},"thumbnailUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/08\/shutterstock_2283461521-1.jpg","datePublished":"2023-08-07T18:18:57+00:00","dateModified":"2023-08-09T09:59:48+00:00","description":"OpenAI desvel\u00f3 discretamente GPTBot, un rastreador web dedicado.OpenAI desvel\u00f3 discretamente GPTBot, un rastreador web dedicado para recopilar datos de entrenamiento.","breadcrumb":{"@id":"https:\/\/dailyai.com\/2023\/08\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\/#breadcrumb"},"inLanguage":"es","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dailyai.com\/2023\/08\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\/"]}]},{"@type":"ImageObject","inLanguage":"es","@id":"https:\/\/dailyai.com\/2023\/08\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\/#primaryimage","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/08\/shutterstock_2283461521-1.jpg","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/08\/shutterstock_2283461521-1.jpg","width":1000,"height":667,"caption":"OpenAI GPTBot"},{"@type":"BreadcrumbList","@id":"https:\/\/dailyai.com\/2023\/08\/openai-inconspicuously-unveils-its-own-data-scraper-gptbot\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dailyai.com\/"},{"@type":"ListItem","position":2,"name":"OpenAI inconspicuously unveils its own data scraper, GPTBot"}]},{"@type":"WebSite","@id":"https:\/\/dailyai.com\/#website","url":"https:\/\/dailyai.com\/","name":"DailyAI","description":"Su dosis diaria de noticias sobre IA","publisher":{"@id":"https:\/\/dailyai.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dailyai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"es"},{"@type":"Organization","@id":"https:\/\/dailyai.com\/#organization","name":"DailyAI","url":"https:\/\/dailyai.com\/","logo":{"@type":"ImageObject","inLanguage":"es","@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/","url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","contentUrl":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/06\/Daily-Ai_TL_colour.png","width":4501,"height":934,"caption":"DailyAI"},"image":{"@id":"https:\/\/dailyai.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/x.com\/DailyAIOfficial","https:\/\/www.linkedin.com\/company\/dailyaiofficial\/","https:\/\/www.youtube.com\/@DailyAIOfficial"]},{"@type":"Person","@id":"https:\/\/dailyai.com\/#\/schema\/person\/711e81f945549438e8bbc579efdeb3c9","name":"Sam Jeans","image":{"@type":"ImageObject","inLanguage":"es","@id":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/a24a4a8f8e2a1a275b7491dc9c9f032c401eabf23c3206da4628dc84b6dac5c8?s=96&d=robohash&r=g","caption":"Sam Jeans"},"description":"Sam es un escritor de ciencia y tecnolog\u00eda que ha trabajado en varias startups de IA. Cuando no est\u00e1 escribiendo, se le puede encontrar leyendo revistas m\u00e9dicas o rebuscando en cajas de discos de vinilo.","sameAs":["https:\/\/www.linkedin.com\/in\/sam-jeans-6746b9142\/"],"url":"https:\/\/dailyai.com\/es\/author\/samjeans\/"}]}},"_links":{"self":[{"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/posts\/3939","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/comments?post=3939"}],"version-history":[{"count":6,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/posts\/3939\/revisions"}],"predecessor-version":[{"id":3992,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/posts\/3939\/revisions\/3992"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/media\/3940"}],"wp:attachment":[{"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/media?parent=3939"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/categories?post=3939"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailyai.com\/es\/wp-json\/wp\/v2\/tags?post=3939"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}