File Download
There are no files associated with this item.
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Article: Using content-based and link-based analysis in building vertical search engines
Title | Using content-based and link-based analysis in building vertical search engines |
---|---|
Authors | |
Issue Date | 2004 |
Publisher | Springer Verlag. The Journal's web site is located at http://springerlink.com/content/105633/ |
Citation | Lecture Notes In Computer Science (Including Subseries Lecture Notes In Artificial Intelligence And Lecture Notes In Bioinformatics), 2004, v. 3334, p. 515-518 How to Cite? |
Abstract | This paper reports our research in the Web page filtering process in specialized search engine development. We propose a machine-learning-based approach that combines Web content analysis and Web structure analysis. Instead of a bag of words, each Web page is represented by a set of content-based and link-based features, which can be used as the input for various machine learning algorithms. The proposed approach was implemented using both a feedforward/backpropagation neural network and a support vector machine. An evaluation study was conducted and showed that the proposed approaches performed better than the benchmark approaches. © Springer-Verlag Berlin Heidelberg 2004. |
Persistent Identifier | http://hdl.handle.net/10722/177991 |
ISSN | 2023 SCImago Journal Rankings: 0.606 |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Chau, M | en_US |
dc.contributor.author | Chen, H | en_US |
dc.date.accessioned | 2012-12-19T09:41:11Z | - |
dc.date.available | 2012-12-19T09:41:11Z | - |
dc.date.issued | 2004 | en_US |
dc.identifier.citation | Lecture Notes In Computer Science (Including Subseries Lecture Notes In Artificial Intelligence And Lecture Notes In Bioinformatics), 2004, v. 3334, p. 515-518 | en_US |
dc.identifier.issn | 0302-9743 | en_US |
dc.identifier.uri | http://hdl.handle.net/10722/177991 | - |
dc.description.abstract | This paper reports our research in the Web page filtering process in specialized search engine development. We propose a machine-learning-based approach that combines Web content analysis and Web structure analysis. Instead of a bag of words, each Web page is represented by a set of content-based and link-based features, which can be used as the input for various machine learning algorithms. The proposed approach was implemented using both a feedforward/backpropagation neural network and a support vector machine. An evaluation study was conducted and showed that the proposed approaches performed better than the benchmark approaches. © Springer-Verlag Berlin Heidelberg 2004. | en_US |
dc.language | eng | en_US |
dc.publisher | Springer Verlag. The Journal's web site is located at http://springerlink.com/content/105633/ | en_US |
dc.relation.ispartof | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | en_US |
dc.title | Using content-based and link-based analysis in building vertical search engines | en_US |
dc.type | Article | en_US |
dc.identifier.email | Chau, M: mchau@hkucc.hku.hk | en_US |
dc.identifier.authority | Chau, M=rp01051 | en_US |
dc.description.nature | link_to_subscribed_fulltext | en_US |
dc.identifier.scopus | eid_2-s2.0-35048817047 | en_US |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-35048817047&selection=ref&src=s&origin=recordpage | en_US |
dc.identifier.volume | 3334 | en_US |
dc.identifier.spage | 515 | en_US |
dc.identifier.epage | 518 | en_US |
dc.publisher.place | Germany | en_US |
dc.identifier.scopusauthorid | Chau, M=7006073763 | en_US |
dc.identifier.scopusauthorid | Chen, H=8871373800 | en_US |
dc.identifier.issnl | 0302-9743 | - |