Searching and Mining the Web for Personalized and Specialized Information

Chau, M

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Find via

Supplementary

Citations:
Appears in Collections:
- Faculty of Business & Economics: Journal/Magazine Articles

Article: Searching and Mining the Web for Personalized and Specialized Information

Title	Searching and Mining the Web for Personalized and Specialized Information
Authors	Chau, M
Issue Date	2005
Publisher	Association for Computing Machinery, Inc.
Citation	SIGIR Forum, 2005, v. 39 n. 1, p. 57 How to Cite?
Abstract	With the rapid growth of the Web, users are often faced with the problem of information overload and find it difficult to search for relevant and useful information on the Web. Besides general-purpose search engines, there exist some alternative approaches that can help users perform searches on the Web more effectively and efficiently. Personalized search agents and specialized search engines are two such approaches. The goal of this research is to investigate how machine learning and artificial intelligence techniques can be used to improve these approaches. A system development research process was adopted as the methodology in this study. In the first part of this research, five different personalized search agents, namely CI Spider, Meta Spider, Cancer Spider, Nano Spider, and Collaborative Spider, were developed. These spiders combine Web searching with various techniques such as noun phrasing, text clustering, and multi-agent technologies to help satisfy users’ information needs in different domains and different contexts. Individual experiments were designed and conducted to evaluate the proposed approach and the experimental results showed that the prototype systems performed better than or comparable to traditional search methods. The second part of the research aims to investigate how artificial intelligence techniques can be used to facilitate the development of specialized search engines. A Hopfield Net spider was proposed to locate from the Web URLs that are relevant to a given domain. A feature-based machine-learning text classifier also was proposed to perform filtering on Web pages. A prototype system was built for each approach. Both systems were evaluated and the results demonstrated that they both outperformed traditional approaches. This research has two main contributions. Firstly, it demonstrated how machine learning and artificial intelligence techniques can be used to improve and enhance the development of personalized search agents and specialized search engines. Secondly, it provided a set of tools that can facilitate users in their Web searching and Web mining activities in various contexts.
Persistent Identifier	http://hdl.handle.net/10722/85878
ISSN	0163-5840

DC Field	Value	Language
dc.contributor.author	Chau, M	en_HK
dc.date.accessioned	2010-09-06T09:10:17Z	-
dc.date.available	2010-09-06T09:10:17Z	-
dc.date.issued	2005	en_HK
dc.identifier.citation	SIGIR Forum, 2005, v. 39 n. 1, p. 57	en_HK
dc.identifier.issn	0163-5840	-
dc.identifier.uri	http://hdl.handle.net/10722/85878	-
dc.description.abstract	With the rapid growth of the Web, users are often faced with the problem of information overload and find it difficult to search for relevant and useful information on the Web. Besides general-purpose search engines, there exist some alternative approaches that can help users perform searches on the Web more effectively and efficiently. Personalized search agents and specialized search engines are two such approaches. The goal of this research is to investigate how machine learning and artificial intelligence techniques can be used to improve these approaches. A system development research process was adopted as the methodology in this study. In the first part of this research, five different personalized search agents, namely CI Spider, Meta Spider, Cancer Spider, Nano Spider, and Collaborative Spider, were developed. These spiders combine Web searching with various techniques such as noun phrasing, text clustering, and multi-agent technologies to help satisfy users’ information needs in different domains and different contexts. Individual experiments were designed and conducted to evaluate the proposed approach and the experimental results showed that the prototype systems performed better than or comparable to traditional search methods. The second part of the research aims to investigate how artificial intelligence techniques can be used to facilitate the development of specialized search engines. A Hopfield Net spider was proposed to locate from the Web URLs that are relevant to a given domain. A feature-based machine-learning text classifier also was proposed to perform filtering on Web pages. A prototype system was built for each approach. Both systems were evaluated and the results demonstrated that they both outperformed traditional approaches. This research has two main contributions. Firstly, it demonstrated how machine learning and artificial intelligence techniques can be used to improve and enhance the development of personalized search agents and specialized search engines. Secondly, it provided a set of tools that can facilitate users in their Web searching and Web mining activities in various contexts.	-
dc.language	eng	en_HK
dc.publisher	Association for Computing Machinery, Inc.	-
dc.relation.ispartof	SIGIR Forum	en_HK
dc.rights	SIGIR Forum. Copyright © Association for Computing Machinery, Inc.	-
dc.rights	©ACM, 2005. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in PUBLICATION, {VOL#, ISS#, (DATE)} http://doi.acm.org/10.1145/nnnnnn.nnnnnn	-
dc.title	Searching and Mining the Web for Personalized and Specialized Information	en_HK
dc.type	Article	en_HK
dc.identifier.email	Chau, M: mchau@business.hku.hk	en_HK
dc.identifier.authority	Chau, MCL=rp01051	en_HK
dc.identifier.hkuros	105279	en_HK
dc.identifier.volume	39	-
dc.identifier.issue	1	-
dc.identifier.spage	57	-
dc.identifier.epage	57	-
dc.publisher.place	United States	-
dc.identifier.issnl	0163-5840	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Searching and Mining the Web for Personalized and Specialized Information

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats