File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/ICDE.2008.4497433
- Scopus: eid_2-s2.0-52649100982
- Find via
Conference Paper: Lattice histograms: A resilient synopsis structure
Title | Lattice histograms: A resilient synopsis structure |
---|---|
Authors | |
Issue Date | 2008 |
Citation | Proceedings - International Conference On Data Engineering, 2008, p. 247-256 How to Cite? |
Abstract | Despite the surge of interest in data reduction techniques over the past years, no method has been proposed to date that can always achieve approximation quality preferable to that of the optimal plain histogram for a target error metric. In this paper, we introduce the Lattice Histogram: a novel data reduction method that discovers and exploits any arbitrary hierarchy in the data, and achieves approximation quality provably at least as high as an optimal histogram for any data reduction problem. We formulate LH construction techniques with approximation guarantees for general error metrics. We show that the case of minimizing a maximum-error metric can be solved by a specialized, memory-sparing approach; we exploit this solution to design reduced-space heuristics for the general-error case. We develop a mixed synopsis approach, applicable to the space-efficient high-quality summarization of very large data sets. We experimentally corroborate the superiority of LHs in approximation quality over previous techniques with representative error metrics and diverse data sets. © 2008 IEEE. |
Persistent Identifier | http://hdl.handle.net/10722/151928 |
ISSN | 2023 SCImago Journal Rankings: 1.306 |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Karras, P | en_US |
dc.contributor.author | Mamoulis, N | en_US |
dc.date.accessioned | 2012-06-26T06:30:57Z | - |
dc.date.available | 2012-06-26T06:30:57Z | - |
dc.date.issued | 2008 | en_US |
dc.identifier.citation | Proceedings - International Conference On Data Engineering, 2008, p. 247-256 | en_US |
dc.identifier.issn | 1084-4627 | en_US |
dc.identifier.uri | http://hdl.handle.net/10722/151928 | - |
dc.description.abstract | Despite the surge of interest in data reduction techniques over the past years, no method has been proposed to date that can always achieve approximation quality preferable to that of the optimal plain histogram for a target error metric. In this paper, we introduce the Lattice Histogram: a novel data reduction method that discovers and exploits any arbitrary hierarchy in the data, and achieves approximation quality provably at least as high as an optimal histogram for any data reduction problem. We formulate LH construction techniques with approximation guarantees for general error metrics. We show that the case of minimizing a maximum-error metric can be solved by a specialized, memory-sparing approach; we exploit this solution to design reduced-space heuristics for the general-error case. We develop a mixed synopsis approach, applicable to the space-efficient high-quality summarization of very large data sets. We experimentally corroborate the superiority of LHs in approximation quality over previous techniques with representative error metrics and diverse data sets. © 2008 IEEE. | en_US |
dc.language | eng | en_US |
dc.relation.ispartof | Proceedings - International Conference on Data Engineering | en_US |
dc.title | Lattice histograms: A resilient synopsis structure | en_US |
dc.type | Conference_Paper | en_US |
dc.identifier.email | Mamoulis, N:nikos@cs.hku.hk | en_US |
dc.identifier.authority | Mamoulis, N=rp00155 | en_US |
dc.description.nature | link_to_subscribed_fulltext | en_US |
dc.identifier.doi | 10.1109/ICDE.2008.4497433 | en_US |
dc.identifier.scopus | eid_2-s2.0-52649100982 | en_US |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-52649100982&selection=ref&src=s&origin=recordpage | en_US |
dc.identifier.spage | 247 | en_US |
dc.identifier.epage | 256 | en_US |
dc.publisher.place | United States | en_US |
dc.identifier.scopusauthorid | Karras, P=14028488200 | en_US |
dc.identifier.scopusauthorid | Mamoulis, N=6701782749 | en_US |
dc.identifier.citeulike | 4621723 | - |
dc.identifier.issnl | 1084-4627 | - |