File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/ICMLC.2009.5212352
- Scopus: eid_2-s2.0-70350741542
- WOS: WOS:000281720400291
Supplementary
- Citations:
- Appears in Collections:
Conference Paper: Improved blog clustering through automated weighting of text blocks
Title | Improved blog clustering through automated weighting of text blocks |
---|---|
Authors | |
Keywords | Blog Auto-weighted Clustering Web mining |
Issue Date | 2009 |
Publisher | I E E E. The Journal's web site is located at http://ieeexplore.ieee.org/xpl/conhome.jsp?punumber=1000424 |
Citation | 2009 International Conference on Machine Learning and Cybernetics, Baoding, China, 12-15 July 2009, v. 3, p. 1586-1591 How to Cite? |
Abstract | In this paper, a new clustering algorithm is proposed for blog data clustering. Considering the structure information of text blocks in blog data, we group the features of blog data into three groups and extend the k-means clustering algorithm to automatically calculate a weight for each feature group in the clustering process. We introduce a new objective function with group weight variables and present the Lagrangian method to derive the formula to calculate the group weights. This formula is added as a new step in the standard k-means iterative clustering process to automatically compute the group weights according to the distribution of features. This new process guarantees the convergency of the clustering process to a local optimal solution. The experimental results have shown that this new algorithm performed better than k-means without group feature weighting on different blog data sets. |
Persistent Identifier | http://hdl.handle.net/10722/223758 |
ISBN | |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Li, HB | - |
dc.contributor.author | Ye, YM | - |
dc.contributor.author | Huang, JZ | - |
dc.date.accessioned | 2016-03-14T07:43:53Z | - |
dc.date.available | 2016-03-14T07:43:53Z | - |
dc.date.issued | 2009 | - |
dc.identifier.citation | 2009 International Conference on Machine Learning and Cybernetics, Baoding, China, 12-15 July 2009, v. 3, p. 1586-1591 | - |
dc.identifier.isbn | 9781424437023 | - |
dc.identifier.uri | http://hdl.handle.net/10722/223758 | - |
dc.description.abstract | In this paper, a new clustering algorithm is proposed for blog data clustering. Considering the structure information of text blocks in blog data, we group the features of blog data into three groups and extend the k-means clustering algorithm to automatically calculate a weight for each feature group in the clustering process. We introduce a new objective function with group weight variables and present the Lagrangian method to derive the formula to calculate the group weights. This formula is added as a new step in the standard k-means iterative clustering process to automatically compute the group weights according to the distribution of features. This new process guarantees the convergency of the clustering process to a local optimal solution. The experimental results have shown that this new algorithm performed better than k-means without group feature weighting on different blog data sets. | - |
dc.language | eng | - |
dc.publisher | I E E E. The Journal's web site is located at http://ieeexplore.ieee.org/xpl/conhome.jsp?punumber=1000424 | - |
dc.relation.ispartof | Proceedings of the International Conference on Machine Learning and Cybernetics | - |
dc.rights | ©2009 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. | - |
dc.subject | Blog | - |
dc.subject | Auto-weighted | - |
dc.subject | Clustering | - |
dc.subject | Web mining | - |
dc.title | Improved blog clustering through automated weighting of text blocks | - |
dc.type | Conference_Paper | - |
dc.identifier.email | Huang, JZ: jhuang@eti.hku.hk | - |
dc.description.nature | published_or_final_version | - |
dc.identifier.doi | 10.1109/ICMLC.2009.5212352 | - |
dc.identifier.scopus | eid_2-s2.0-70350741542 | - |
dc.identifier.hkuros | 164902 | - |
dc.identifier.volume | 3 | - |
dc.identifier.spage | 1586 | - |
dc.identifier.epage | 1591 | - |
dc.identifier.isi | WOS:000281720400291 | - |
dc.publisher.place | United States | - |