Cross table cubing: mining iceberg cubes from data warehouses

Cho, M; Pei, J; Cheung, DWL

File Download

103219.pdf

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1137/1.9781611972757.41
Scopus: eid_2-s2.0-69949109468

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: Cross table cubing: mining iceberg cubes from data warehouses

Title	Cross table cubing: mining iceberg cubes from data warehouses
Authors	Cho, M Pei, J Cheung, DWL
Issue Date	2005
Publisher	Society for Industrial and Applied Mathematics.
Citation	The 5th SIAM International Conference on Data Mining, Newport Beach, CA., 21-23 April 2005. In Proceedings of the 2005 SIAM International Conference on Data Mining, 2005, p. 461-465 How to Cite? DOI: http://dx.doi.org/10.1137/1.9781611972757.41
Abstract	All of the existing (iceberg) cube computation algorithms assume that the data is stored in a single base table, however, in practice, a data warehouse is often organized in a schema of multiple tables, such as star schema and snowflake schema. In terms of both computation time and space, materializing a universal base table by joining multiple tables is often very expensive or even unaffordable in real data warehouses. In this paper, we investigate the problem of computing iceberg cubes from data warehouses. Surprisingly, our study shows that computing iceberg cube from multiple tables directly can be even more efficient in both space and runtime than computing from a materialized universal base table. We develop an efficient algorithm, CTC (for Cross Table Cubing) to tackle the problem. An extensive performance study on synthetic data sets demonstrates that our new approach is efficient and scalable for large data warehouses.
Persistent Identifier	http://hdl.handle.net/10722/45530
ISBN	0-89871-593-8

DC Field	Value	Language
dc.contributor.author	Cho, M	en_HK
dc.contributor.author	Pei, J	en_HK
dc.contributor.author	Cheung, DWL	en_HK
dc.date.accessioned	2007-10-30T06:28:33Z	-
dc.date.available	2007-10-30T06:28:33Z	-
dc.date.issued	2005	en_HK
dc.identifier.citation	The 5th SIAM International Conference on Data Mining, Newport Beach, CA., 21-23 April 2005. In Proceedings of the 2005 SIAM International Conference on Data Mining, 2005, p. 461-465	en_HK
dc.identifier.isbn	0-89871-593-8	en_HK
dc.identifier.uri	http://hdl.handle.net/10722/45530	-
dc.description.abstract	All of the existing (iceberg) cube computation algorithms assume that the data is stored in a single base table, however, in practice, a data warehouse is often organized in a schema of multiple tables, such as star schema and snowflake schema. In terms of both computation time and space, materializing a universal base table by joining multiple tables is often very expensive or even unaffordable in real data warehouses. In this paper, we investigate the problem of computing iceberg cubes from data warehouses. Surprisingly, our study shows that computing iceberg cube from multiple tables directly can be even more efficient in both space and runtime than computing from a materialized universal base table. We develop an efficient algorithm, CTC (for Cross Table Cubing) to tackle the problem. An extensive performance study on synthetic data sets demonstrates that our new approach is efficient and scalable for large data warehouses.	en_HK
dc.language	eng	en_HK
dc.publisher	Society for Industrial and Applied Mathematics.	en_HK
dc.relation.ispartof	Proceedings of the 2005 SIAM International Conference on Data Mining	-
dc.rights	© 2005 Society for Industrial and Applied Mathematics. First Published in Proceedings of the 2005 SIAM International Conference on Data Mining in 2005, published by the Society for Industrial and Applied Mathematics (SIAM).	-
dc.title	Cross table cubing: mining iceberg cubes from data warehouses	en_HK
dc.type	Conference_Paper	en_HK
dc.description.nature	published_or_final_version	en_HK
dc.identifier.doi	10.1137/1.9781611972757.41	-
dc.identifier.scopus	eid_2-s2.0-69949109468	-
dc.identifier.hkuros	103219	-
dc.identifier.spage	461	-
dc.identifier.epage	465	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Cross table cubing: mining iceberg cubes from data warehouses

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats