File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Maintenance of maximal frequent itemsets in large databases

TitleMaintenance of maximal frequent itemsets in large databases
Authors
Issue Date2007
Citation
Proceedings Of The Acm Symposium On Applied Computing, 2007, p. 388-392 How to Cite?
AbstractThere have been many studies on efficient discovery of maximal frequent itemsets in large databases. However, it is nontrivial to maintain such discovered itemsets if more and more data is inserted into the database as the insertions may invalidate some existing maximal frequent itemsets and also create some new ones. In this paper, we clearly address the relationships between old and new maximal frequent itemsets and propose an algorithm IMFI, which is based on these relationships to reuse previously discovered knowledge. The algorithm follows a top-down mechanism rather than traditional bottom-up methods to produce fewer candidates. Moreover, we integrate SG-tree into IMFI to improve the counting efficiency, which is faster than those methods based on vertical bitmap database representation. Evaluations on IMFI have been performed using both synthetic and real databases. Preliminary results show that applying IMFI is always much faster than an available incremental MFI mining algorithm, especially when it is equipped with SG-tree. Copyright 2007 ACM.
Persistent Identifierhttp://hdl.handle.net/10722/93253
References

 

DC FieldValueLanguage
dc.contributor.authorLian, Wen_HK
dc.contributor.authorCheung, DWen_HK
dc.contributor.authorYiu, SMen_HK
dc.date.accessioned2010-09-25T14:55:31Z-
dc.date.available2010-09-25T14:55:31Z-
dc.date.issued2007en_HK
dc.identifier.citationProceedings Of The Acm Symposium On Applied Computing, 2007, p. 388-392en_HK
dc.identifier.urihttp://hdl.handle.net/10722/93253-
dc.description.abstractThere have been many studies on efficient discovery of maximal frequent itemsets in large databases. However, it is nontrivial to maintain such discovered itemsets if more and more data is inserted into the database as the insertions may invalidate some existing maximal frequent itemsets and also create some new ones. In this paper, we clearly address the relationships between old and new maximal frequent itemsets and propose an algorithm IMFI, which is based on these relationships to reuse previously discovered knowledge. The algorithm follows a top-down mechanism rather than traditional bottom-up methods to produce fewer candidates. Moreover, we integrate SG-tree into IMFI to improve the counting efficiency, which is faster than those methods based on vertical bitmap database representation. Evaluations on IMFI have been performed using both synthetic and real databases. Preliminary results show that applying IMFI is always much faster than an available incremental MFI mining algorithm, especially when it is equipped with SG-tree. Copyright 2007 ACM.en_HK
dc.languageengen_HK
dc.relation.ispartofProceedings of the ACM Symposium on Applied Computingen_HK
dc.titleMaintenance of maximal frequent itemsets in large databasesen_HK
dc.typeConference_Paperen_HK
dc.identifier.emailCheung, DW:dcheung@cs.hku.hken_HK
dc.identifier.emailYiu, SM:smyiu@cs.hku.hken_HK
dc.identifier.authorityCheung, DW=rp00101en_HK
dc.identifier.authorityYiu, SM=rp00207en_HK
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1145/1244002.1244094en_HK
dc.identifier.scopuseid_2-s2.0-35248836871en_HK
dc.identifier.hkuros135466en_HK
dc.relation.referenceshttp://www.scopus.com/mlt/select.url?eid=2-s2.0-35248836871&selection=ref&src=s&origin=recordpageen_HK
dc.identifier.spage388en_HK
dc.identifier.epage392en_HK
dc.identifier.scopusauthoridLian, W=22433603900en_HK
dc.identifier.scopusauthoridCheung, DW=34567902600en_HK
dc.identifier.scopusauthoridYiu, SM=7003282240en_HK

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats