File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1137/1.9781611972757.41
- Scopus: eid_2-s2.0-69949109468
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Conference Paper: Cross table cubing: mining iceberg cubes from data warehouses
Title | Cross table cubing: mining iceberg cubes from data warehouses |
---|---|
Authors | |
Issue Date | 2005 |
Publisher | Society for Industrial and Applied Mathematics. |
Citation | The 5th SIAM International Conference on Data Mining, Newport Beach, CA., 21-23 April 2005. In Proceedings of the 2005 SIAM International Conference on Data Mining, 2005, p. 461-465 How to Cite? |
Abstract | All of the existing (iceberg) cube computation algorithms assume that the data is stored in a single base table, however, in practice, a data warehouse is often organized in a schema of multiple tables, such as star schema and snowflake schema. In terms of both computation time and space, materializing a universal base table by joining multiple tables is often very expensive or even unaffordable in real data warehouses. In this paper, we investigate the problem of computing iceberg cubes from data warehouses. Surprisingly, our study shows that computing iceberg cube from multiple tables directly can be even more efficient in both space and runtime than computing from a materialized universal base table. We develop an efficient algorithm, CTC (for Cross Table Cubing) to tackle the problem. An extensive performance study on synthetic data sets demonstrates that our new approach is efficient and scalable for large data warehouses. |
Persistent Identifier | http://hdl.handle.net/10722/45530 |
ISBN |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Cho, M | en_HK |
dc.contributor.author | Pei, J | en_HK |
dc.contributor.author | Cheung, DWL | en_HK |
dc.date.accessioned | 2007-10-30T06:28:33Z | - |
dc.date.available | 2007-10-30T06:28:33Z | - |
dc.date.issued | 2005 | en_HK |
dc.identifier.citation | The 5th SIAM International Conference on Data Mining, Newport Beach, CA., 21-23 April 2005. In Proceedings of the 2005 SIAM International Conference on Data Mining, 2005, p. 461-465 | en_HK |
dc.identifier.isbn | 0-89871-593-8 | en_HK |
dc.identifier.uri | http://hdl.handle.net/10722/45530 | - |
dc.description.abstract | All of the existing (iceberg) cube computation algorithms assume that the data is stored in a single base table, however, in practice, a data warehouse is often organized in a schema of multiple tables, such as star schema and snowflake schema. In terms of both computation time and space, materializing a universal base table by joining multiple tables is often very expensive or even unaffordable in real data warehouses. In this paper, we investigate the problem of computing iceberg cubes from data warehouses. Surprisingly, our study shows that computing iceberg cube from multiple tables directly can be even more efficient in both space and runtime than computing from a materialized universal base table. We develop an efficient algorithm, CTC (for Cross Table Cubing) to tackle the problem. An extensive performance study on synthetic data sets demonstrates that our new approach is efficient and scalable for large data warehouses. | en_HK |
dc.language | eng | en_HK |
dc.publisher | Society for Industrial and Applied Mathematics. | en_HK |
dc.relation.ispartof | Proceedings of the 2005 SIAM International Conference on Data Mining | - |
dc.rights | © 2005 Society for Industrial and Applied Mathematics. First Published in Proceedings of the 2005 SIAM International Conference on Data Mining in 2005, published by the Society for Industrial and Applied Mathematics (SIAM). | - |
dc.title | Cross table cubing: mining iceberg cubes from data warehouses | en_HK |
dc.type | Conference_Paper | en_HK |
dc.description.nature | published_or_final_version | en_HK |
dc.identifier.doi | 10.1137/1.9781611972757.41 | - |
dc.identifier.scopus | eid_2-s2.0-69949109468 | - |
dc.identifier.hkuros | 103219 | - |
dc.identifier.spage | 461 | - |
dc.identifier.epage | 465 | - |