File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/ICPR.2004.1334305
- Scopus: eid_2-s2.0-10044290499
- Find via
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Conference Paper: Applying the conjugate gradient method for text document categorization
Title | Applying the conjugate gradient method for text document categorization |
---|---|
Authors | |
Keywords | Conjugate Gradient Method Document Classification Linear Least Squares Fit Performance Measures |
Issue Date | 2004 |
Publisher | IEEE, Computer Society. |
Citation | Proceedings - International Conference On Pattern Recognition, 2004, v. 2, p. 558-561 How to Cite? |
Abstract | In this paper, we investigate the effectiveness of two different methods to solve the linear least squares fit (LLSF) problem for document categorization. The first method is the Singular Value Decomposition (SVD) method that has been previously used to solve the document categorization problem. The second method is the Conjugate Gradient (CG) method that is one of the most effective algorithms for solving a linear equation problem. However, up to our knowledge, the CG method has never been applied to handle the document classification, problem. Therefore, we compare the effectiveness of these two LLSF methods to categorize text documents. In addition, we examine the effect of using different term weighting schemes on their performance for document classification. Lastly, we compare the performance of the LLSF classifiers against the neighborhood-based Dt-kNN classifier, our best variant of the kNN classifier integrated with a dynamic threshold scheme, on the Reuters 21578 dataset. Besides being the first proposal to use the CG method for document classification, our work opens up many exciting directions for future investigation. |
Persistent Identifier | http://hdl.handle.net/10722/45793 |
ISSN | 2023 SCImago Journal Rankings: 0.584 |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Tam, V | en_HK |
dc.contributor.author | Setiono, R | en_HK |
dc.contributor.author | Santoso, A | en_HK |
dc.date.accessioned | 2007-10-30T06:35:36Z | - |
dc.date.available | 2007-10-30T06:35:36Z | - |
dc.date.issued | 2004 | en_HK |
dc.identifier.citation | Proceedings - International Conference On Pattern Recognition, 2004, v. 2, p. 558-561 | en_HK |
dc.identifier.issn | 1051-4651 | en_HK |
dc.identifier.uri | http://hdl.handle.net/10722/45793 | - |
dc.description.abstract | In this paper, we investigate the effectiveness of two different methods to solve the linear least squares fit (LLSF) problem for document categorization. The first method is the Singular Value Decomposition (SVD) method that has been previously used to solve the document categorization problem. The second method is the Conjugate Gradient (CG) method that is one of the most effective algorithms for solving a linear equation problem. However, up to our knowledge, the CG method has never been applied to handle the document classification, problem. Therefore, we compare the effectiveness of these two LLSF methods to categorize text documents. In addition, we examine the effect of using different term weighting schemes on their performance for document classification. Lastly, we compare the performance of the LLSF classifiers against the neighborhood-based Dt-kNN classifier, our best variant of the kNN classifier integrated with a dynamic threshold scheme, on the Reuters 21578 dataset. Besides being the first proposal to use the CG method for document classification, our work opens up many exciting directions for future investigation. | en_HK |
dc.format.extent | 978065 bytes | - |
dc.format.extent | 883769 bytes | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | application/pdf | - |
dc.language | eng | en_HK |
dc.publisher | IEEE, Computer Society. | en_HK |
dc.relation.ispartof | Proceedings - International Conference on Pattern Recognition | en_HK |
dc.rights | ©2004 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. | - |
dc.subject | Conjugate Gradient Method | en_HK |
dc.subject | Document Classification | en_HK |
dc.subject | Linear Least Squares Fit | en_HK |
dc.subject | Performance Measures | en_HK |
dc.title | Applying the conjugate gradient method for text document categorization | en_HK |
dc.type | Conference_Paper | en_HK |
dc.identifier.openurl | http://library.hku.hk:4550/resserv?sid=HKU:IR&issn=1051-4651&volume=2&spage=558&epage=561&date=2004&atitle=Applying+the+conjugate+gradient+method+for+text+document+categorization | en_HK |
dc.identifier.email | Tam, V:vtam@eee.hku.hk | en_HK |
dc.identifier.authority | Tam, V=rp00173 | en_HK |
dc.description.nature | published_or_final_version | en_HK |
dc.identifier.doi | 10.1109/ICPR.2004.1334305 | en_HK |
dc.identifier.scopus | eid_2-s2.0-10044290499 | en_HK |
dc.identifier.hkuros | 103187 | - |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-10044290499&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.volume | 2 | en_HK |
dc.identifier.spage | 558 | en_HK |
dc.identifier.epage | 561 | en_HK |
dc.publisher.place | United States | en_HK |
dc.identifier.scopusauthorid | Tam, V=7005091988 | en_HK |
dc.identifier.scopusauthorid | Setiono, R=7005033162 | en_HK |
dc.identifier.scopusauthorid | Santoso, A=6601931777 | en_HK |
dc.identifier.issnl | 1051-4651 | - |