Persistent Link:
http://hdl.handle.net/10150/105423
Title:
A Smart Itsy Bitsy Spider for the Web
Author:
Chen, Hsinchun; Chung, Yi-Ming; Ramsey, Marshall C.; Yang, Christopher C.
Citation:
A Smart Itsy Bitsy Spider for the Web 1998, 49(7):604-618 Journal of the American Society for Information Science, Special Issue on AI Techniques for Emerging Information Systems Applications
Publisher:
Wiley Periodicals, Inc
Journal:
Journal of the American Society for Information Science, Special Issue on AI Techniques for Emerging Information Systems Applications
Issue Date:
1998
Description:
Artificial Intelligence Lab, Department of MIS, University of Arizona
URI:
http://hdl.handle.net/10150/105423
Submitted date:
2004-09-20
Abstract:
As part of the ongoing Illinois Digital Library Initiative project, this research proposes an intelligent agent approach to Web searching. In this experiment, we developed two Web personal spiders based on best first search and genetic algorithm techniques, respectively. These personal spiders can dynamically take a userâ s selected starting homepages and search for the most closely related homepages in the Web, based on the links and keyword indexing. A graphical, dynamic, Java-based interface was developed and is available for Web access. A system architecture for implementing such an agent-based spider is presented, followed by detailed discussions of benchmark testing and user evaluation results. In benchmark testing, although the genetic algorithm spider did not outperform the best first search spider, we found both results to be comparable and complementary. In user evaluation, the genetic algorithm spider obtained significantly higher recall value than that of the best first search spider. However, their precision values were not statistically different. The mutation process introduced in genetic algorithm allows users to find other potential relevant homepages that cannot be explored via a conventional local search process. In addition, we found the Java-based interface to be a necessary component for design of a truly interactive and dynamic Web agent.
Type:
Journal Article (Paginated)
Language:
en
Keywords:
Artificial Intelligence; World Wide Web; Information Extraction
Local subject classification:
National Science Digital Library; NSDL; Artificial intelligence lab; AI lab; Illinois Digital Library Initiative

Full metadata record

DC FieldValue Language
dc.contributor.authorChen, Hsinchunen_US
dc.contributor.authorChung, Yi-Mingen_US
dc.contributor.authorRamsey, Marshall C.en_US
dc.contributor.authorYang, Christopher C.en_US
dc.date.accessioned2004-09-20T00:00:01Z-
dc.date.available2010-06-18T23:25:08Z-
dc.date.issued1998en_US
dc.date.submitted2004-09-20en_US
dc.identifier.citationA Smart Itsy Bitsy Spider for the Web 1998, 49(7):604-618 Journal of the American Society for Information Science, Special Issue on AI Techniques for Emerging Information Systems Applicationsen_US
dc.identifier.urihttp://hdl.handle.net/10150/105423-
dc.descriptionArtificial Intelligence Lab, Department of MIS, University of Arizonaen_US
dc.description.abstractAs part of the ongoing Illinois Digital Library Initiative project, this research proposes an intelligent agent approach to Web searching. In this experiment, we developed two Web personal spiders based on best first search and genetic algorithm techniques, respectively. These personal spiders can dynamically take a userâ s selected starting homepages and search for the most closely related homepages in the Web, based on the links and keyword indexing. A graphical, dynamic, Java-based interface was developed and is available for Web access. A system architecture for implementing such an agent-based spider is presented, followed by detailed discussions of benchmark testing and user evaluation results. In benchmark testing, although the genetic algorithm spider did not outperform the best first search spider, we found both results to be comparable and complementary. In user evaluation, the genetic algorithm spider obtained significantly higher recall value than that of the best first search spider. However, their precision values were not statistically different. The mutation process introduced in genetic algorithm allows users to find other potential relevant homepages that cannot be explored via a conventional local search process. In addition, we found the Java-based interface to be a necessary component for design of a truly interactive and dynamic Web agent.en_US
dc.format.mimetypeapplication/pdfen_US
dc.language.isoenen_US
dc.publisherWiley Periodicals, Incen_US
dc.subjectArtificial Intelligenceen_US
dc.subjectWorld Wide Weben_US
dc.subjectInformation Extractionen_US
dc.subject.otherNational Science Digital Libraryen_US
dc.subject.otherNSDLen_US
dc.subject.otherArtificial intelligence laben_US
dc.subject.otherAI laben_US
dc.subject.otherIllinois Digital Library Initiativeen_US
dc.titleA Smart Itsy Bitsy Spider for the Weben_US
dc.typeJournal Article (Paginated)en_US
dc.identifier.journalJournal of the American Society for Information Science, Special Issue on AI Techniques for Emerging Information Systems Applicationsen_US
All Items in UA Campus Repository are protected by copyright, with all rights reserved, unless otherwise indicated.