Knowledge Acquisition, Delivery and Prediction through Text Mining

Persistent Link:
http://hdl.handle.net/10150/194680
Title:
Knowledge Acquisition, Delivery and Prediction through Text Mining
Author:
Schumaker, Robert P.
Issue Date:
2007
Publisher:
The University of Arizona.
Rights:
Copyright © is held by the author. Digital access to this material is made possible by the University Libraries, University of Arizona. Further transmission, reproduction or presentation (such as public display or performance) of protected items is prohibited except with permission of the author.
Abstract:
The World Wide Web is an abundant source for Textual Web Mining research. Data can be acquired from Web texts and converted to Information or Knowledge for immediate consumption. Studying the acquisition and consumption of Web text can provide a glimpse into the social/behavioral aspects of Web Users and Web Content Providers. Patterns embedded within textual data can be similarly identified through technical means and even anticipated.Seven essays explore the important algorithmic and computational aspects needed in the analysis of acquiring, delivering and making predictions from Web texts. Chapters 2 and 3 describe the knowledge acquisition process and feasibility of leveraging Web users. While the knowledge acquired from Web users was not as refined as that from domain experts, the knowledge gathered was found to be of acceptable quality. From our analysis of dialog systems, it was found that Web users were more likely to augment the breadth of existing knowledge by adding new response sets to the knowledge base. Chapters 4 and 5 look at the aspects of knowledge delivery to Web users. Using a dialog system, we observe the acceptance and satisfaction levels of dialog responses in general conversation, domain knowledge and the combination of both knowledge bases. Chapters 6 through 8 consider the prediction facet of knowledge using textual financial news articles and stock prices. This section focuses on comparing different model parameters and textual representations to best describe future prices as well as an examination of document representation based on the sector and industry a company is engaged in. From these analyses we found that Sector-based aggregation led to the best price predictions.Together these essays effectively leverage large amounts of textual Web data to represent knowledge in meaningful ways to end users. These essays also provide the blueprints for several real-world applications. The approaches and techniques described borrow from referent disciplines of linguistics, finance, computer science, statistics as well as MIS and demonstrate potentially useful applications for dialog systems, quantitative stock prediction and other knowledge management processes in which textual data can be accurately represented and forecast; thus improving the exchange of human knowledge.
Type:
text; Electronic Dissertation
Degree Name:
PhD
Degree Level:
doctoral
Degree Program:
Management; Graduate College
Degree Grantor:
University of Arizona
Advisor:
Chen, Hsinchun
Committee Chair:
Chen, Hsinchun

Full metadata record

DC FieldValue Language
dc.language.isoENen_US
dc.titleKnowledge Acquisition, Delivery and Prediction through Text Miningen_US
dc.creatorSchumaker, Robert P.en_US
dc.contributor.authorSchumaker, Robert P.en_US
dc.date.issued2007en_US
dc.publisherThe University of Arizona.en_US
dc.rightsCopyright © is held by the author. Digital access to this material is made possible by the University Libraries, University of Arizona. Further transmission, reproduction or presentation (such as public display or performance) of protected items is prohibited except with permission of the author.en_US
dc.description.abstractThe World Wide Web is an abundant source for Textual Web Mining research. Data can be acquired from Web texts and converted to Information or Knowledge for immediate consumption. Studying the acquisition and consumption of Web text can provide a glimpse into the social/behavioral aspects of Web Users and Web Content Providers. Patterns embedded within textual data can be similarly identified through technical means and even anticipated.Seven essays explore the important algorithmic and computational aspects needed in the analysis of acquiring, delivering and making predictions from Web texts. Chapters 2 and 3 describe the knowledge acquisition process and feasibility of leveraging Web users. While the knowledge acquired from Web users was not as refined as that from domain experts, the knowledge gathered was found to be of acceptable quality. From our analysis of dialog systems, it was found that Web users were more likely to augment the breadth of existing knowledge by adding new response sets to the knowledge base. Chapters 4 and 5 look at the aspects of knowledge delivery to Web users. Using a dialog system, we observe the acceptance and satisfaction levels of dialog responses in general conversation, domain knowledge and the combination of both knowledge bases. Chapters 6 through 8 consider the prediction facet of knowledge using textual financial news articles and stock prices. This section focuses on comparing different model parameters and textual representations to best describe future prices as well as an examination of document representation based on the sector and industry a company is engaged in. From these analyses we found that Sector-based aggregation led to the best price predictions.Together these essays effectively leverage large amounts of textual Web data to represent knowledge in meaningful ways to end users. These essays also provide the blueprints for several real-world applications. The approaches and techniques described borrow from referent disciplines of linguistics, finance, computer science, statistics as well as MIS and demonstrate potentially useful applications for dialog systems, quantitative stock prediction and other knowledge management processes in which textual data can be accurately represented and forecast; thus improving the exchange of human knowledge.en_US
dc.typetexten_US
dc.typeElectronic Dissertationen_US
thesis.degree.namePhDen_US
thesis.degree.leveldoctoralen_US
thesis.degree.disciplineManagementen_US
thesis.degree.disciplineGraduate Collegeen_US
thesis.degree.grantorUniversity of Arizonaen_US
dc.contributor.advisorChen, Hsinchunen_US
dc.contributor.chairChen, Hsinchunen_US
dc.contributor.committeememberZhang, Zhuen_US
dc.contributor.committeememberZhao, Leonen_US
dc.contributor.committeememberNunamaker, Jayen_US
dc.identifier.proquest2058en_US
dc.identifier.oclc659747135en_US
All Items in UA Campus Repository are protected by copyright, with all rights reserved, unless otherwise indicated.