dc.creator | Sleiman, Hassan A. | es |
dc.creator | Corchuelo Gil, Rafael | es |
dc.date.accessioned | 2023-03-15T08:05:42Z | |
dc.date.available | 2023-03-15T08:05:42Z | |
dc.date.issued | 2012-06 | |
dc.identifier.citation | Sleiman, H.A. y Corchuelo Gil, R. (2012). A Reference Architecture to Devise Web Information Extractors. En CAiSE 2012: Advanced Information Systems Engineering Workshops (235-248), Gdańsk (Polonia): SpringerLink. | |
dc.identifier.isbn | 978-3-642-31068-3 (impreso) | es |
dc.identifier.isbn | 978-3-642-31069-0 (online) | es |
dc.identifier.uri | https://hdl.handle.net/11441/143377 | |
dc.description.abstract | The Web is the largest repository of human-friendly information. Unfortunately, web information is embedded in formatting tags and is surrounded by irrelevant information. Researchers are working on information extractors that allow transforming this information into
structured data for its later integration into automated processes. Devising a new information extraction technique requires an array of tasks that are specific to this technique and many tasks that are actually common between all techniques. The lack of a reference architectural proposal in the literature to guide software engineers in the design and implementation of information extractors, amounts to little reuse and the focus is usually blurred because of irrelevant details. In this paper, we present a reference architecture to design and implement rule learners for information extractors. We have implemented a software framework to support our architecture, and we have validated it by means of four case studies and a number of experiments that prove that our proposal helps reduce development costs significantly. | es |
dc.description.sponsorship | Ministerio de Educación y Ciencia TIN2007-64119 | es |
dc.description.sponsorship | Junta de Andalucía P07-TIC-2602 | es |
dc.description.sponsorship | Junta de Andalucía P08-TIC-4100 | es |
dc.description.sponsorship | Ministerio de Ciencia e Innovación TIN2008-04718-E | es |
dc.description.sponsorship | Ministerio de Ciencia e Innovación TIN2010-21744 | es |
dc.description.sponsorship | Ministerio de Economía, Industria y Competitividad TIN2010-09809-E | es |
dc.description.sponsorship | Ministerio de Ciencia e Innovación TIN2010-10811-E | es |
dc.description.sponsorship | Ministerio de Ciencia e Innovación TIN2010-09988-E | es |
dc.format | application/pdf | es |
dc.format.extent | 14 | es |
dc.language.iso | eng | es |
dc.publisher | SpringerLink | es |
dc.relation.ispartof | CAiSE 2012: Advanced Information Systems Engineering Workshops (2012), pp. 235-248. | |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 Internacional | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject | Information Extraction | es |
dc.subject | Rule Learning Reference Architecture | es |
dc.title | A Reference Architecture to Devise Web Information Extractors | es |
dc.type | info:eu-repo/semantics/conferenceObject | es |
dcterms.identifier | https://ror.org/03yxnpp24 | |
dc.type.version | info:eu-repo/semantics/publishedVersion | es |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | es |
dc.contributor.affiliation | Universidad de Sevilla. Departamento de Lenguajes y Sistemas Informáticos | es |
dc.relation.projectID | TIN2007-64119 | es |
dc.relation.projectID | P07-TIC-2602 | es |
dc.relation.projectID | P08-TIC-4100 | es |
dc.relation.projectID | TIN2008-04718-E | es |
dc.relation.projectID | TIN2010-21744 | es |
dc.relation.projectID | TIN2010-09809-E | es |
dc.relation.projectID | TIN2010-10811-E | es |
dc.relation.projectID | TIN2010-09988-E | es |
dc.relation.publisherversion | https://link.springer.com/chapter/10.1007/978-3-642-31069-0_21#citeas | es |
dc.identifier.doi | 10.1007/978-3-642-31069-0_21 | es |
dc.publication.initialPage | 235 | es |
dc.publication.endPage | 248 | es |
dc.eventtitle | CAiSE 2012: Advanced Information Systems Engineering Workshops | es |
dc.eventinstitution | Gdańsk (Polonia) | es |
dc.contributor.funder | Ministerio de Educación y Ciencia (MEC). España | es |
dc.contributor.funder | Junta de Andalucía | es |
dc.contributor.funder | Ministerio de Ciencia e Innovación (MICIN). España | es |
dc.contributor.funder | Ministerio de Economía, Industria y Competitividad | es |