Facilitating Knowledge Discovery by Mining the Content and Link Structure of the Web

Persistent Link:
http://hdl.handle.net/10150/194384
Title:
Facilitating Knowledge Discovery by Mining the Content and Link Structure of the Web
Author:
Qin, Jialun
Issue Date:
2006
Publisher:
The University of Arizona.
Rights:
Copyright © is held by the author. Digital access to this material is made possible by the University Libraries, University of Arizona. Further transmission, reproduction or presentation (such as public display or performance) of protected items is prohibited except with permission of the author.
Abstract:
Given the vast amount of online information covering almost all aspects of human endeavor, the Internet, especially the Web, is clearly a fertile ground for data mining research from which to extract valuable knowledge. Web mining is the application of data mining techniques to extract knowledge from Web data, including Web documents, Web hyperlink structure, and Web usage logs.Traditional Web mining research has been mainly focused on addressing the information overload problem. Many information retrieval (IR) and artificial intelligence (AI) techniques have been adopted or developed to identify relevant information from the Web to meet users' specific information needs. However, most existing studies do not fully explore the social and behavioral aspects of the Web. Thus, the primary goal of this dissertation is to develop an integrated research framework that extends traditional Web mining methodologies to fully explore the technical, social, and behavioral aspects of Web knowledge discovery.My dissertation framework is composed of technical and social/behavioral components. In the technical component of my dissertation, a set of domain specific Web collection building, Web content and link structure mining, and Web knowledge presentation techniques were developed. These techniques were tested in a series of case studies to demonstrate their effectiveness and efficiency in facilitating knowledge discovery in various domains.The social/behavioral component of my dissertation is to explore the application of Web mining technology as a new means to study the social interactions and behavior of Web content providers and users. Several case studies were conducted to extract knowledge on covert organizations' resource allocation plans, information management policies, and technical sophistication using Web mining techniques. Such knowledge would be very difficult to obtain through other means.The major contributions of this dissertation are twofold. First, it proposed a set of new Web mining techniques that can help facilitate knowledge discovery in various domains. Second, it demonstrated the effectiveness and efficiency of applying Web mining techniques in extracting social and behavioral knowledge in different contexts.
Type:
text; Electronic Dissertation
Keywords:
Web mining; knowledge discovery; Web collection building; Web content analysis; Web link structure analysis
Degree Name:
DMgt
Degree Level:
doctoral
Degree Program:
Management Information Systems; Graduate College
Degree Grantor:
University of Arizona
Advisor:
Chen, Hsinchun
Committee Chair:
Chen, Hsinchun

Full metadata record

DC FieldValue Language
dc.language.isoENen_US
dc.titleFacilitating Knowledge Discovery by Mining the Content and Link Structure of the Weben_US
dc.creatorQin, Jialunen_US
dc.contributor.authorQin, Jialunen_US
dc.date.issued2006en_US
dc.publisherThe University of Arizona.en_US
dc.rightsCopyright © is held by the author. Digital access to this material is made possible by the University Libraries, University of Arizona. Further transmission, reproduction or presentation (such as public display or performance) of protected items is prohibited except with permission of the author.en_US
dc.description.abstractGiven the vast amount of online information covering almost all aspects of human endeavor, the Internet, especially the Web, is clearly a fertile ground for data mining research from which to extract valuable knowledge. Web mining is the application of data mining techniques to extract knowledge from Web data, including Web documents, Web hyperlink structure, and Web usage logs.Traditional Web mining research has been mainly focused on addressing the information overload problem. Many information retrieval (IR) and artificial intelligence (AI) techniques have been adopted or developed to identify relevant information from the Web to meet users' specific information needs. However, most existing studies do not fully explore the social and behavioral aspects of the Web. Thus, the primary goal of this dissertation is to develop an integrated research framework that extends traditional Web mining methodologies to fully explore the technical, social, and behavioral aspects of Web knowledge discovery.My dissertation framework is composed of technical and social/behavioral components. In the technical component of my dissertation, a set of domain specific Web collection building, Web content and link structure mining, and Web knowledge presentation techniques were developed. These techniques were tested in a series of case studies to demonstrate their effectiveness and efficiency in facilitating knowledge discovery in various domains.The social/behavioral component of my dissertation is to explore the application of Web mining technology as a new means to study the social interactions and behavior of Web content providers and users. Several case studies were conducted to extract knowledge on covert organizations' resource allocation plans, information management policies, and technical sophistication using Web mining techniques. Such knowledge would be very difficult to obtain through other means.The major contributions of this dissertation are twofold. First, it proposed a set of new Web mining techniques that can help facilitate knowledge discovery in various domains. Second, it demonstrated the effectiveness and efficiency of applying Web mining techniques in extracting social and behavioral knowledge in different contexts.en_US
dc.typetexten_US
dc.typeElectronic Dissertationen_US
dc.subjectWeb miningen_US
dc.subjectknowledge discoveryen_US
dc.subjectWeb collection buildingen_US
dc.subjectWeb content analysisen_US
dc.subjectWeb link structure analysisen_US
thesis.degree.nameDMgten_US
thesis.degree.leveldoctoralen_US
thesis.degree.disciplineManagement Information Systemsen_US
thesis.degree.disciplineGraduate Collegeen_US
thesis.degree.grantorUniversity of Arizonaen_US
dc.contributor.advisorChen, Hsinchunen_US
dc.contributor.chairChen, Hsinchunen_US
dc.contributor.committeememberNunamaker, Jay F.en_US
dc.contributor.committeememberZhao, J. Leonen_US
dc.identifier.proquest1777en_US
dc.identifier.oclc659747539en_US
All Items in UA Campus Repository are protected by copyright, with all rights reserved, unless otherwise indicated.