Trabajo Fin de Grado
Mejoras en el OCR Tesseract
Título alternativo | Improvements for Tesseract OCR Case analysis for automatic transcription of ancient books at the Library of the Universidad de Seville |
Autor/es | Bocanegra Linares, Jesús |
Director | Estepa Alonso, Rafael María |
Departamento | Universidad de Sevilla. Departamento de Ingeniería Telemática |
Fecha de publicación | 2016 |
Fecha de depósito | 2016-11-23 |
Titulación | Universidad de Sevilla. Grado en Ingeniería de las Tecnologías de Telecomunicación |
Resumen | The Library of the University of Seville owns a rich collection of antique works that are being scanned. In order to make the access to information easier, they need the full text of these books. An automated transcription ... The Library of the University of Seville owns a rich collection of antique works that are being scanned. In order to make the access to information easier, they need the full text of these books. An automated transcription solution is sought since the task is too time-consuming for humans. To achieve this, different existing solutions are analyzed, such as veteran open source programs and cutting-edge neural networks technologies. After the research, a concrete solution is provided to the Library, along with guidelines for its execution. |
Cita | Bocanegra Linares, J. (2016). Mejoras en el OCR Tesseract. (Trabajo fin de grado inédito). Universidad de Sevilla, Sevilla. |
Ficheros | Tamaño | Formato | Ver | Descripción |
---|---|---|---|---|
Jesús Bocanegra - Mejoras en ... | 3.449Mb | [PDF] | Ver/ | |