Buscar
Mostrando ítems 1-3 de 3
Artículo
A clustering approach to extract data from HTML tables
(Elsevier, 2021)
HTML tables have become pervasive on the Web. Extracting their data automatically is difficult because finding the relationships between their cells is not trivial due to the many different layouts, encodings, and formats ...
Artículo
On exploring data lakes by finding compact, isolated clusters
(Elsevier, 2022)
Data engineers are very interested in data lake technologies due to the incredible abun dance of datasets. They typically use clustering to understand the structure of the datasets before applying other methods to infer ...
Artículo
A coral-reef approach to extract information from HTML tables
(Elsevier, 2022)
his article presents Coraline, which is a new table-understanding proposal. Its novelty lies in a coral-reef optimisation algorithm that addresses the problem of feature selection in synchrony with a clustering technique ...