Maintenance of maximal frequent itemsets in large databases

Lian, W; Cheung, DW; Yiu, SM

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1145/1244002.1244094
Scopus: eid_2-s2.0-35248836871

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: Maintenance of maximal frequent itemsets in large databases

Title	Maintenance of maximal frequent itemsets in large databases
Authors	Lian, W Cheung, DW Yiu, SM
Issue Date	2007
Citation	Proceedings Of The Acm Symposium On Applied Computing, 2007, p. 388-392 How to Cite? DOI: http://dx.doi.org/10.1145/1244002.1244094
Abstract	There have been many studies on efficient discovery of maximal frequent itemsets in large databases. However, it is nontrivial to maintain such discovered itemsets if more and more data is inserted into the database as the insertions may invalidate some existing maximal frequent itemsets and also create some new ones. In this paper, we clearly address the relationships between old and new maximal frequent itemsets and propose an algorithm IMFI, which is based on these relationships to reuse previously discovered knowledge. The algorithm follows a top-down mechanism rather than traditional bottom-up methods to produce fewer candidates. Moreover, we integrate SG-tree into IMFI to improve the counting efficiency, which is faster than those methods based on vertical bitmap database representation. Evaluations on IMFI have been performed using both synthetic and real databases. Preliminary results show that applying IMFI is always much faster than an available incremental MFI mining algorithm, especially when it is equipped with SG-tree. Copyright 2007 ACM.
Persistent Identifier	http://hdl.handle.net/10722/93253
References	References in Scopus

DC Field	Value	Language
dc.contributor.author	Lian, W	en_HK
dc.contributor.author	Cheung, DW	en_HK
dc.contributor.author	Yiu, SM	en_HK
dc.date.accessioned	2010-09-25T14:55:31Z	-
dc.date.available	2010-09-25T14:55:31Z	-
dc.date.issued	2007	en_HK
dc.identifier.citation	Proceedings Of The Acm Symposium On Applied Computing, 2007, p. 388-392	en_HK
dc.identifier.uri	http://hdl.handle.net/10722/93253	-
dc.description.abstract	There have been many studies on efficient discovery of maximal frequent itemsets in large databases. However, it is nontrivial to maintain such discovered itemsets if more and more data is inserted into the database as the insertions may invalidate some existing maximal frequent itemsets and also create some new ones. In this paper, we clearly address the relationships between old and new maximal frequent itemsets and propose an algorithm IMFI, which is based on these relationships to reuse previously discovered knowledge. The algorithm follows a top-down mechanism rather than traditional bottom-up methods to produce fewer candidates. Moreover, we integrate SG-tree into IMFI to improve the counting efficiency, which is faster than those methods based on vertical bitmap database representation. Evaluations on IMFI have been performed using both synthetic and real databases. Preliminary results show that applying IMFI is always much faster than an available incremental MFI mining algorithm, especially when it is equipped with SG-tree. Copyright 2007 ACM.	en_HK
dc.language	eng	en_HK
dc.relation.ispartof	Proceedings of the ACM Symposium on Applied Computing	en_HK
dc.title	Maintenance of maximal frequent itemsets in large databases	en_HK
dc.type	Conference_Paper	en_HK
dc.identifier.email	Cheung, DW:dcheung@cs.hku.hk	en_HK
dc.identifier.email	Yiu, SM:smyiu@cs.hku.hk	en_HK
dc.identifier.authority	Cheung, DW=rp00101	en_HK
dc.identifier.authority	Yiu, SM=rp00207	en_HK
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1145/1244002.1244094	en_HK
dc.identifier.scopus	eid_2-s2.0-35248836871	en_HK
dc.identifier.hkuros	135466	en_HK
dc.relation.references	http://www.scopus.com/mlt/select.url?eid=2-s2.0-35248836871&selection=ref&src=s&origin=recordpage	en_HK
dc.identifier.spage	388	en_HK
dc.identifier.epage	392	en_HK
dc.identifier.scopusauthorid	Lian, W=22433603900	en_HK
dc.identifier.scopusauthorid	Cheung, DW=34567902600	en_HK
dc.identifier.scopusauthorid	Yiu, SM=7003282240	en_HK

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Maintenance of maximal frequent itemsets in large databases

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats