Estepa Alonso, Rafael MaríaBocanegra Linares, Jesús2016-11-232016-11-232016Bocanegra Linares, J. (2016). Mejoras en el OCR Tesseract. (Trabajo fin de grado inédito). Universidad de Sevilla, Sevilla.http://hdl.handle.net/11441/49061The Library of the University of Seville owns a rich collection of antique works that are being scanned. In order to make the access to information easier, they need the full text of these books. An automated transcription solution is sought since the task is too time-consuming for humans. To achieve this, different existing solutions are analyzed, such as veteran open source programs and cutting-edge neural networks technologies. After the research, a concrete solution is provided to the Library, along with guidelines for its execution.application/pdfengAttribution-NonCommercial-NoDerivatives 4.0 Internacionalhttp://creativecommons.org/licenses/by-nc-nd/4.0/OCRMejoras en el OCR TesseractImprovements for Tesseract OCRCase analysis for automatic transcription of ancient books at the Library of the Universidad de Sevilleinfo:eu-repo/semantics/bachelorThesisinfo:eu-repo/semantics/openAccess