Buscar
Mostrando ítems 1-10 de 16
Artículo
TOMATE: A heuristic-based approach to extract data from HTML tables
(Elsevier, 2021)
Extracting data from user-friendly HTML tables is difficult because of their different lay outs, formats, and encoding problems. In this article, we present a new proposal that first applies several pre-processing heuristics ...
Ponencia
A Novel Approach to Web Information Extraction
(Springer, 2015)
Business Intelligence requires the acquisition and aggrega tion of key pieces of knowledge from multiple sources in order to provide valuable information to customers. The Web is the largest source of infor mation nowadays. ...
Artículo
ARIEX: Automated ranking of information extractors
(Elsevier, 2016)
Information extractors are used to transform the user-friendly information in a web document into structured information that can be used to feed a knowledge-based system. Researchers are interested in ranking them to ...
Artículo
A clustering approach to extract data from HTML tables
(Elsevier, 2021)
HTML tables have become pervasive on the Web. Extracting their data automatically is difficult because finding the relationships between their cells is not trivial due to the many different layouts, encodings, and formats ...
Ponencia
A Novel Approach to Web Information Extraction
(Springer International Publishing AG, 2015-06)
Business Intelligence requires the acquisition and aggregation of key pieces of knowledge from multiple sources in order to provide valuable information to customers. The Web is the largest source of information nowadays. ...
Ponencia
On improving FOIL Algorithm
(CSREA Press, 2011)
FOIL is an Inductive Logic Programming Algorithm to discover first order rules to explain the patterns involved in a domain of knowledge. Domains as Information Retrieval or Information Extraction are handicaps for FOIL ...
Ponencia
Feeding Software Agents with Web Information
(Springer, 2015)
Many software agents require information that is available in web documents. Unfortunately, the existing proposals to learn extrac tion rules are tightly coupled with the learning component and do not result in resilient ...
Artículo
Roller: A novel approach to web information extraction
(Springer, 2016)
The research regarding web information extraction focuses on learning rules to extract some selected information from web documents. Many proposals are ad-hoc and cannot benefit from the advances in machine learning; ...
Ponencia
Integrating Deep-Web Information Sources
(Springer, 2010)
Deep-web information sources are difficult to integrate into automated business processes if they only provide a search form. A wrapping agent is a piece of software that allows a developer to query such information ...
Artículo
On Learning Web Information Extraction Rules with TANGO
(Elsevier, 2016)
The research on Enterprise Systems Integration focuses on proposals to support business processes by re-using existing systems. Wrappers help re-use web ap plications that provide a user interface only. They emulate a ...