Buscar
Mostrando ítems 1-2 de 2
Ponencia
An Unsupervised Technique to Extract Information from Semi-structured Web Pages
(Springer, 2012-11)
We propose a technique that takes two or more web pages generated by the same server-side template and tries to learn a regular expression that represents it and helps extract relevant information from similar pages. Our ...
Artículo
Trinity: On Using Trinary Trees for Unsupervised Web Data Extraction
(IEEE Xplore, 2014-06)
Web data extractors are used to extract data from web documents in order to feed automated processes. In this article, we propose a technique that works on two or more web documents generated by the same server-side template ...