Abstract
Amounts of available heterogeneous semi-structured data grow rapidly on the Web and other data repositories. This raises the need to provide simple and universal ways to access this data. To provide such an interface, we propose to exploit the notion of "unspecified ontologies", describing the data objects as a list of attributes and their respective values. In order to facilitate an efficient management of the unspecified data objects we use a multi-agent channeled multicast communication platform. The data objects are stored distributively, such that each attribute is assigned a designated channel. This allows performing efficient searches by parallel querying of the relevant channels only, and aggregating the partial results. Moreover, the multi-agent platform facilitates advanced data management through extracting metadata from the data objects. We implemented a prototype system and experimented with a corpus of real-life E-Commerce advertisements. Our results demonstrate scalability of the proposed approach and the accuracy of the extracted meta-data.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2006 ACM symposium on Applied computing, SAC '06 |
Place of Publication | New York |
Publisher | Association for Computing Machinery (ACM) |
Pages | 101-105 |
Number of pages | 5 |
ISBN (Electronic) | 1595931082 |
DOIs | |
Publication status | Published - 2006 |
Externally published | Yes |
Event | 2006 ACM Symposium on Applied Computing, SAC 2006 - Dijon, France Duration: 23 Apr 2006 → 27 Apr 2006 |
Conference
Conference | 2006 ACM Symposium on Applied Computing, SAC 2006 |
---|---|
Country/Territory | France |
City | Dijon |
Period | 23/04/06 → 27/04/06 |