File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Frequent pattern mining on message passing multiprocessor systems

TitleFrequent pattern mining on message passing multiprocessor systems
Authors
KeywordsAssociation rule
Parallel processing
Frequent pattern mining
Data mining
Issue Date2004
Citation
Distributed and Parallel Databases, 2004, v. 16, n. 3, p. 321-334 How to Cite?
AbstractExtraction of frequent patterns in transaction-oriented database is crucial to several data mining tasks such as association rule generation, time series analysis, classification, etc. Most of these mining tasks require multiple passes over the database and if the database size is large, which is usually the case, scalable high performance solutions involving multiple processors are required. This paper presents an efficient scalable parallel algorithm for mining frequent patterns on parallel shared nothing platforms. The proposed algorithm is based on one of the best known sequential techniques referred to as Frequent Pattern (FP) Growth algorithm. Unlike most of the earlier parallel approaches based on different variants of the Apriori Algorithm, the algorithm presented in this paper does not explicitly result in having entire counting data structure duplicated on each processor. Furthermore, the proposed algorithm introduces minimum communication (and hence synchronization) overheads by efficiently partitioning the list of frequent elements list over processors. The experimental results show scalable performance over different machine and problem sizes. The comparison of implementation results with existing parallel approaches show significant gains in the speedup. On an 8-processor machine, we report an average speedup of 6 for different problem sizes.
Persistent Identifierhttp://hdl.handle.net/10722/254509
ISSN
2015 Impact Factor: 0.8
2015 SCImago Journal Rankings: 0.593

 

DC FieldValueLanguage
dc.contributor.authorJaved, Asif-
dc.contributor.authorKhokhar, Ashfaq-
dc.date.accessioned2018-06-19T15:40:45Z-
dc.date.available2018-06-19T15:40:45Z-
dc.date.issued2004-
dc.identifier.citationDistributed and Parallel Databases, 2004, v. 16, n. 3, p. 321-334-
dc.identifier.issn0926-8782-
dc.identifier.urihttp://hdl.handle.net/10722/254509-
dc.description.abstractExtraction of frequent patterns in transaction-oriented database is crucial to several data mining tasks such as association rule generation, time series analysis, classification, etc. Most of these mining tasks require multiple passes over the database and if the database size is large, which is usually the case, scalable high performance solutions involving multiple processors are required. This paper presents an efficient scalable parallel algorithm for mining frequent patterns on parallel shared nothing platforms. The proposed algorithm is based on one of the best known sequential techniques referred to as Frequent Pattern (FP) Growth algorithm. Unlike most of the earlier parallel approaches based on different variants of the Apriori Algorithm, the algorithm presented in this paper does not explicitly result in having entire counting data structure duplicated on each processor. Furthermore, the proposed algorithm introduces minimum communication (and hence synchronization) overheads by efficiently partitioning the list of frequent elements list over processors. The experimental results show scalable performance over different machine and problem sizes. The comparison of implementation results with existing parallel approaches show significant gains in the speedup. On an 8-processor machine, we report an average speedup of 6 for different problem sizes.-
dc.languageeng-
dc.relation.ispartofDistributed and Parallel Databases-
dc.subjectAssociation rule-
dc.subjectParallel processing-
dc.subjectFrequent pattern mining-
dc.subjectData mining-
dc.titleFrequent pattern mining on message passing multiprocessor systems-
dc.typeArticle-
dc.description.natureLink_to_subscribed_fulltext-
dc.identifier.doi10.1023/B:DAPD.0000031634.19130.bd-
dc.identifier.scopuseid_2-s2.0-3543105495-
dc.identifier.volume16-
dc.identifier.issue3-
dc.identifier.spage321-
dc.identifier.epage334-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats