Persistent Link:
http://hdl.handle.net/10150/105409
Title:
Exploring the Academic Invisible Web
Author:
Lewandowski, Dirk; Mayr, Philipp
Citation:
Exploring the Academic Invisible Web 2006-05,
Issue Date:
May-2006
URI:
http://hdl.handle.net/10150/105409
Submitted date:
2006-05-17
Abstract:
Purpose: To provide a critical review of Bergmanâ s 2001 study on the Deep Web. In addition, we bring a new concept into the discussion, the Academic Invisible Web (AIW). We define the Academic Invisible Web as consisting of all databases and collections relevant to academia but not searchable by the general-purpose internet search engines. Indexing this part of the Invisible Web is central to scientific search engines. We provide an overview of approaches followed thus far. Design/methodology/approach: Discussion of measures and calculations, estimation based on infor-metric laws. Literature review on approaches for uncovering information from the Invisible Web. Findings: Bergmanâ s size estimation of the Invisible Web is highly questionable. We demonstrate some major errors in the conceptual design of the Bergman paper. A new (raw) size estimation is given. Research limitations/implications: The precision of our estimation is limited due to small sample size and lack of reliable data. Practical implications: We can show that no single library alone will be able to index the Academic Invisible Web. We suggest collaboration to accomplish this task. Originality/value: Provides library managers and those interested in developing academic search en-gines with data on the size and attributes of the Academic Invisible Web.
Type:
Preprint
Language:
en
Keywords:
World Wide Web; Information Science; Web Metrics; Internet
Local subject classification:
Search engines; Worldwide Web; Indexing; Scholarly content; Digital library

Full metadata record

DC FieldValue Language
dc.contributor.authorLewandowski, Dirken_US
dc.contributor.authorMayr, Philippen_US
dc.date.accessioned2006-05-17T00:00:01Z-
dc.date.available2010-06-18T23:24:57Z-
dc.date.issued2006-05en_US
dc.date.submitted2006-05-17en_US
dc.identifier.citationExploring the Academic Invisible Web 2006-05,en_US
dc.identifier.urihttp://hdl.handle.net/10150/105409-
dc.description.abstractPurpose: To provide a critical review of Bergmanâ s 2001 study on the Deep Web. In addition, we bring a new concept into the discussion, the Academic Invisible Web (AIW). We define the Academic Invisible Web as consisting of all databases and collections relevant to academia but not searchable by the general-purpose internet search engines. Indexing this part of the Invisible Web is central to scientific search engines. We provide an overview of approaches followed thus far. Design/methodology/approach: Discussion of measures and calculations, estimation based on infor-metric laws. Literature review on approaches for uncovering information from the Invisible Web. Findings: Bergmanâ s size estimation of the Invisible Web is highly questionable. We demonstrate some major errors in the conceptual design of the Bergman paper. A new (raw) size estimation is given. Research limitations/implications: The precision of our estimation is limited due to small sample size and lack of reliable data. Practical implications: We can show that no single library alone will be able to index the Academic Invisible Web. We suggest collaboration to accomplish this task. Originality/value: Provides library managers and those interested in developing academic search en-gines with data on the size and attributes of the Academic Invisible Web.en_US
dc.format.mimetypeapplication/pdfen_US
dc.language.isoenen_US
dc.subjectWorld Wide Weben_US
dc.subjectInformation Scienceen_US
dc.subjectWeb Metricsen_US
dc.subjectInterneten_US
dc.subject.otherSearch enginesen_US
dc.subject.otherWorldwide Weben_US
dc.subject.otherIndexingen_US
dc.subject.otherScholarly contenten_US
dc.subject.otherDigital libraryen_US
dc.titleExploring the Academic Invisible Weben_US
dc.typePreprinten_US
All Items in UA Campus Repository are protected by copyright, with all rights reserved, unless otherwise indicated.