On the performance of feature weighting K-means for text subspace clustering

Jing, Liping; Ng, Michael K.; Xu, Jun; Huang, Joshua Zhexue

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1007/11563952_44
Scopus: eid_2-s2.0-33646509173
Find via

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- Mathematics: Conference papers

Conference Paper: On the performance of feature weighting K-means for text subspace clustering

Title	On the performance of feature weighting K-means for text subspace clustering
Authors	Jing, Liping Ng, Michael K.Xu, Jun Huang, Joshua Zhexue
Keywords	Text Clustering Feature Weighting Convergency Scalability Subspace Clustering
Issue Date	2005
Publisher	Springer.
Citation	6th International Conference on Web-Age Information Management (WAIM 2005), Hangzhou, China, 11-13 October 2005. In Advances in Web-Age Information Management: 6th International Conference, WAIM 2005, Hangzhou, China, October 11 – 13, 2005: Proceedings, 2005, p. 502-512 How to Cite? DOI: http://dx.doi.org/10.1007/11563952_44
Abstract	Text clustering is an effective way of not only organizing textual information, but discovering interesting patterns. Most existing methods, however, suffer from two main drawbacks; they cannot provide an understandable representation for text clusters, and cannot scale to very large text collections. Highly scalable text clustering algorithms are becoming increasingly relevant. In this paper, we present a performance study of a new subspace clustering algorithm for large sparse text data. This algorithm automatically calculates the feature weights in the k-means clustering process. The feature weights are used to discover clusters from subspaces of the text vector space and identify terms that represent the semantics of the clusters. A series of experiments have been conducted to test the performance of the algorithm, including resource consumption and clustering quality. The experimental results on real-world text data have shown that our algorithm quickly converges to a local optimal solution and is scalable to the number of documents, terms and the number of clusters. © Springer-Verlag Berlin Heidelberg 2005.
Persistent Identifier	http://hdl.handle.net/10722/276792
ISBN	9783540292272
ISSN	0302-9743 2023 SCImago Journal Rankings: 0.606
Series/Report no.	Lecture Notes in Computer Science ; 3739

DC Field	Value	Language
dc.contributor.author	Jing, Liping	-
dc.contributor.author	Ng, Michael K.	-
dc.contributor.author	Xu, Jun	-
dc.contributor.author	Huang, Joshua Zhexue	-
dc.date.accessioned	2019-09-18T08:34:40Z	-
dc.date.available	2019-09-18T08:34:40Z	-
dc.date.issued	2005	-
dc.identifier.citation	6th International Conference on Web-Age Information Management (WAIM 2005), Hangzhou, China, 11-13 October 2005. In Advances in Web-Age Information Management: 6th International Conference, WAIM 2005, Hangzhou, China, October 11 – 13, 2005: Proceedings, 2005, p. 502-512	-
dc.identifier.isbn	9783540292272	-
dc.identifier.issn	0302-9743	-
dc.identifier.uri	http://hdl.handle.net/10722/276792	-
dc.description.abstract	Text clustering is an effective way of not only organizing textual information, but discovering interesting patterns. Most existing methods, however, suffer from two main drawbacks; they cannot provide an understandable representation for text clusters, and cannot scale to very large text collections. Highly scalable text clustering algorithms are becoming increasingly relevant. In this paper, we present a performance study of a new subspace clustering algorithm for large sparse text data. This algorithm automatically calculates the feature weights in the k-means clustering process. The feature weights are used to discover clusters from subspaces of the text vector space and identify terms that represent the semantics of the clusters. A series of experiments have been conducted to test the performance of the algorithm, including resource consumption and clustering quality. The experimental results on real-world text data have shown that our algorithm quickly converges to a local optimal solution and is scalable to the number of documents, terms and the number of clusters. © Springer-Verlag Berlin Heidelberg 2005.	-
dc.language	eng	-
dc.publisher	Springer.	-
dc.relation.ispartof	Advances in Web-Age Information Management: 6th International Conference, WAIM 2005, Hangzhou, China, October 11 – 13, 2005: Proceedings	-
dc.relation.ispartofseries	Lecture Notes in Computer Science ; 3739	-
dc.subject	Text Clustering	-
dc.subject	Feature Weighting	-
dc.subject	Convergency	-
dc.subject	Scalability	-
dc.subject	Subspace Clustering	-
dc.title	On the performance of feature weighting K-means for text subspace clustering	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1007/11563952_44	-
dc.identifier.scopus	eid_2-s2.0-33646509173	-
dc.identifier.spage	502	-
dc.identifier.epage	512	-
dc.identifier.eissn	1611-3349	-
dc.publisher.place	Berlin	-
dc.identifier.issnl	0302-9743	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: On the performance of feature weighting K-means for text subspace clustering

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats