NameJiménez Aguirre, Patricia
DepartmentLenguajes y Sistemas Informáticos
Knowledge areaLenguajes y Sistemas Informáticos
Professional categoryProfesora Titular de Universidad
E-mailRequest
           
  • No. publications

    24

  • No. visits

    1262

  • No. downloads

    2387


 

Article
Icon

A coral-reef approach to extract information from HTML tables

Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Roldán Salvador, Juan Carlos; Roldán Salvador, Juan Carlos; Corchuelo Gil, Rafael; Corchuelo Gil, Rafael (Elsevier, 2022)
his article presents Coraline, which is a new table-understanding proposal. Its novelty lies in a coral-reef optimisation ...
Article
Icon

On validating web information extraction proposals

Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Corchuelo Gil, Rafael; Corchuelo Gil, Rafael (Elsevier, 2022)
Many people who have to make informed decisions in today’s always-on culture use information extractors to feed their ...
Article
Icon

On exploring data lakes by finding compact, isolated clusters

Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Roldán Salvador, Juan Carlos; Roldán Salvador, Juan Carlos; Corchuelo Gil, Rafael; Corchuelo Gil, Rafael (Elsevier, 2022)
Data engineers are very interested in data lake technologies due to the incredible abun dance of datasets. They typically ...
Article
Icon

A hybrid quantum approach to leveraging data from HTML tables

Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Roldán Salvador, Juan Carlos; Roldán Salvador, Juan Carlos; Corchuelo Gil, Rafael; Corchuelo Gil, Rafael (Springer, 2022)
The Web provides many data that are encoded using HTML tables. This facilitates rendering them, but obfuscates their ...
Article
Icon

TOMATE: A heuristic-based approach to extract data from HTML tables

Roldán Salvador, Juan Carlos; Roldán Salvador, Juan Carlos; Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Szekely, Pedro; Szekely, Pedro; Corchuelo Gil, Rafael; Corchuelo Gil, Rafael (Elsevier, 2021)
Extracting data from user-friendly HTML tables is difficult because of their different lay outs, formats, and encoding ...
Article
Icon

A clustering approach to extract data from HTML tables

Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Roldán Salvador, Juan Carlos; Roldán Salvador, Juan Carlos; Corchuelo Gil, Rafael; Corchuelo Gil, Rafael (Elsevier, 2021)
HTML tables have become pervasive on the Web. Extracting their data automatically is difficult because finding the ...
PhD Thesis
Icon

Enterprise Data Integration: On Extracting Data from HTML Tables

Corchuelo Gil, Rafael; Corchuelo Gil, Rafael; Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Roldán Salvador, Juan Carlos; Roldán Salvador, Juan Carlos (2020)
The Web is a universal communication channel that provides a vast amount of valuable data about a plethora of topics. In ...
Article
Icon

On Extracting Data from Tables that are Encoded using HTML

Roldán Salvador, Juan Carlos; Roldán Salvador, Juan Carlos; Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Corchuelo Gil, Rafael; Corchuelo Gil, Rafael (Elsevier, 2020)
Tables are a common means to display data in human-friendly formats. Many authors have worked on proposals to extract those ...
Article
Icon

On the synthesis of metadata tags for HTML files

Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Roldán Salvador, Juan Carlos; Roldán Salvador, Juan Carlos; Gallego, Fernando O.; Gallego, Fernando O.; Corchuelo Gil, Rafael; Corchuelo Gil, Rafael (Wiley, 2020)
RDFa, JSON-LD, Microdata, and Microformats allow to endow the data in HTML files with metadata tags that help software ...
Presentation
Icon

Extracting Web Information using Representation Patterns

Roldán Salvador, Juan Carlos; Roldán Salvador, Juan Carlos; Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Corchuelo Gil, Rafael; Corchuelo Gil, Rafael (Association for Computing Machinery (ACM), 2017)
Feeding decision support systems with Web information typically requires sifting through an unwieldy amount of information ...
Article
Icon

Roller: A novel approach to web information extraction

Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Corchuelo Gil, Rafael; Corchuelo Gil, Rafael (Springer, 2016)
The research regarding web information extraction focuses on learning rules to extract some selected information from web ...
Article
Icon

ARIEX: Automated ranking of information extractors

Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Corchuelo Gil, Rafael; Corchuelo Gil, Rafael; Sleiman, Hassan A.; Sleiman, Hassan A. (Elsevier, 2016)
Information extractors are used to transform the user-friendly information in a web document into structured information ...
Presentation
Icon

Una Experiencia para mejorar la interacción estudiante-profesor 

Müller Cejás, Carlos; Müller Cejás, Carlos; Salmerón, Inmaculada; Salmerón, Inmaculada; Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Trinidad Martín Arroyo, Pablo; Trinidad Martín Arroyo, Pablo (AENUI: Asociación de Enseñantes Universitarios de Informática, 2016)
En asignaturas en las que hay proyectos o entregables evaluables, los estudiantes suelen saturar los buzones de correo de ...
Article
Icon

On Learning Web Information Extraction Rules with TANGO

Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Corchuelo Gil, Rafael; Corchuelo Gil, Rafael (Elsevier, 2016)
The research on Enterprise Systems Integration focuses on proposals to support business processes by re-using existing ...
PhD Thesis
Icon

Enterprise Information Integration: New Approaches to Web Information Extraction

Corchuelo Gil, Rafael; Corchuelo Gil, Rafael; Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia (2015)
La manera de entender la información ha cambiado radicalmente en las últimas décadas gracias a la Web, que impulsa a las ...
Presentation
Icon

A Novel Approach to Web Information Extraction

Reina Quintero, Antonia María; Reina Quintero, Antonia María; Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Corchuelo Gil, Rafael; Corchuelo Gil, Rafael (Springer International Publishing AG, 2015)
Business Intelligence requires the acquisition and aggregation of key pieces of knowledge from multiple sources in order ...
Presentation
Icon

A Novel Approach to Web Information Extraction

Reina Quintero, Antonia María; Reina Quintero, Antonia María; Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Corchuelo Gil, Rafael; Corchuelo Gil, Rafael (Springer, 2015)
Business Intelligence requires the acquisition and aggrega tion of key pieces of knowledge from multiple sources in order ...
Presentation
Icon

On Extracting Information from Semi-structured Deep Web Documents

Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Corchuelo Gil, Rafael; Corchuelo Gil, Rafael (Springer, 2015)
Some software agents need information that is provided by some web sites, which is difficult if they lack a query API. ...
Presentation
Icon

Feeding Software Agents with Web Information

Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Sleiman, Hassan A.; Sleiman, Hassan A.; Corchuelo Gil, Rafael; Corchuelo Gil, Rafael (Springer, 2015)
Many software agents require information that is available in web documents. Unfortunately, the existing proposals to learn ...
Presentation
Icon

On Member Labelling in Social Networks

Corchuelo Gil, Rafael; Corchuelo Gil, Rafael; Reina Quintero, Antonia María; Reina Quintero, Antonia María; Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia (Springer, 2015)
Software agents are increasingly used to search for experts, recommend resources, assess opinions, and other similar tasks ...
Presentation
Icon

Optimising FOIL by new scoring functions

Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Arjona, José L.; Arjona, José L.; Álvarez, J.L.; Álvarez, J.L. (Asociación de Ingeniería del Software y Tecnologías de Desarrollo de Software (SISTEDES), 2011)
FOIL is an Inductive Logic Programming Algorithm to dis cover first order rules to explain the patterns involved in a ...
Presentation
Icon

On improving FOIL Algorithm

Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Arjona, José L.; Arjona, José L.; Álvarez, J.L.; Álvarez, J.L. (CSREA Press, 2011)
FOIL is an Inductive Logic Programming Algorithm to discover first order rules to explain the patterns involved in a domain ...
Presentation
Icon

Mining Web Pages Using Features of Rendering HTML Elements in the Web Browser

Fernández, F. J.; Fernández, F. J.; Álvarez, José L.; Álvarez, José L.; Abad, Pedro J.; Abad, Pedro J.; Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia (Springer, 2011)
The Web is the largest repository of useful information available for human users, but it is usual that Web Pages do not ...
Presentation
Icon

Integrating Deep-Web Information Sources

Fernández de Viana, Iñaki; Fernández de Viana, Iñaki; Hernández Salmerón, Inmaculada Concepción; Hernández Salmerón, Inmaculada Concepción; Jiménez Aguirre, Patricia; Jiménez Aguirre, Patricia; Rivero, Carlos R.; Rivero, Carlos R.; Sleiman, Hassan A.; Sleiman, Hassan A. (Springer, 2010)
Deep-web information sources are difficult to integrate into automated business processes if they only provide a search ...