{"version":"1.0","provider_name":"DagligAI","provider_url":"https:\/\/dailyai.com\/nb","author_name":"Eugene van der Watt","author_url":"https:\/\/dailyai.com\/nb\/author\/eugene\/","title":"Meta\u2019s Nougat makes scientific texts machine-readable | DailyAI","type":"rich","width":600,"height":338,"html":"<blockquote class=\"wp-embedded-content\" data-secret=\"yF347Jogr5\"><a href=\"https:\/\/dailyai.com\/nb\/2023\/08\/metas-nougat-makes-scientific-texts-machine-readable\/\">Metas Nougat gj\u00f8r vitenskapelige tekster maskinlesbare<\/a><\/blockquote><iframe sandbox=\"allow-scripts\" security=\"restricted\" src=\"https:\/\/dailyai.com\/nb\/2023\/08\/metas-nougat-makes-scientific-texts-machine-readable\/embed\/#?secret=yF347Jogr5\" width=\"600\" height=\"338\" title=\"&quot;Metas Nougat gj\u00f8r vitenskapelige tekster maskinlesbare&quot; - DailyAI\" data-secret=\"yF347Jogr5\" frameborder=\"0\" marginwidth=\"0\" marginheight=\"0\" scrolling=\"no\" class=\"wp-embedded-content\"><\/iframe><script>\n\/*! This file is auto-generated *\/\n!function(d,l){\"use strict\";l.querySelector&&d.addEventListener&&\"undefined\"!=typeof URL&&(d.wp=d.wp||{},d.wp.receiveEmbedMessage||(d.wp.receiveEmbedMessage=function(e){var t=e.data;if((t||t.secret||t.message||t.value)&&!\/[^a-zA-Z0-9]\/.test(t.secret)){for(var s,r,n,a=l.querySelectorAll('iframe[data-secret=\"'+t.secret+'\"]'),o=l.querySelectorAll('blockquote[data-secret=\"'+t.secret+'\"]'),c=new RegExp(\"^https?:$\",\"i\"),i=0;i<o.length;i++)o[i].style.display=\"none\";for(i=0;i<a.length;i++)s=a[i],e.source===s.contentWindow&&(s.removeAttribute(\"style\"),\"height\"===t.message?(1e3<(r=parseInt(t.value,10))?r=1e3:~~r<200&&(r=200),s.height=r):\"link\"===t.message&&(r=new URL(s.getAttribute(\"src\")),n=new URL(t.value),c.test(n.protocol))&&n.host===r.host&&l.activeElement===s&&(d.top.location.href=t.value))}},d.addEventListener(\"message\",d.wp.receiveEmbedMessage,!1),l.addEventListener(\"DOMContentLoaded\",function(){for(var e,t,s=l.querySelectorAll(\"iframe.wp-embedded-content\"),r=0;r<s.length;r++)(t=(e=s[r]).getAttribute(\"data-secret\"))||(t=Math.random().toString(36).substring(2,12),e.src+=\"#?secret=\"+t,e.setAttribute(\"data-secret\",t)),e.contentWindow.postMessage({message:\"ready\",secret:t},\"*\")},!1)))}(window,document);\n\/\/# sourceURL=https:\/\/dailyai.com\/wp-includes\/js\/wp-embed.min.js\n<\/script>","thumbnail_url":"https:\/\/dailyai.com\/wp-content\/uploads\/2023\/08\/Scientific-research-paper.jpg","thumbnail_width":1000,"thumbnail_height":750,"description":"Meta has developed a new AI model called Nougat that can reliably turn scientific text into machine-readable text. If you\u2019ve ever tried to read a scientific research paper then you begin to understand why it\u2019s tough for it to be processed electronically. Current Optical Character Recognition (OCR) tools parse text line by line. That\u2019s fine for purely text-based documents but scientific papers add a level of complexity that these standard tools can\u2019t deal with.\u00a0 Scientific papers include mathematical and scientific symbols and formulas that are often added as subscripts or superscripts. Even the best OCRs have trouble capturing these properly."}