analyticjournalism.com» Blog Archive » If you're really serious about searching….

analyticjournalism.com

It's not "all about story" if you don't have anything to say. So go get some data.

SIDEBAR

»
S
I
D
E
B
A
R
«

If you're really serious about searching….

December 5th, 2007 by Tom Johnson

Deep Web Research 2008

http://www.llrx.com/features/deepweb2008.htm

By Marcus P. Zillman, Published on November 24, 2007

Printer-Friendly Version

Bots, Blogs and News Aggregators is a keynote presentation that I have been delivering over the last several years, and much of my information comes from the extensive research that I have completed over the years into the “invisible” or what I like to call the “deep” web. The Deep Web covers somewhere in the vicinity of 900 billion pages of information located through the world wide web in various files and formats that the current search engines on the Internet either cannot find or have difficulty accessing. Search engines currently locate approximately 20 billion pages.

In the last several years, some of the more comprehensive search engines have written algorithms to search the deeper portions of the world wide web by attempting to find files such as .pdf, .doc, .xls, ppt, .ps. and others. These files are predominately used by businesses to communicate their information within their organization or to disseminate information to the external world from their organization. Searching for this information using deeper search techniques and the latest algorithms allows researchers to obtain a vast amount of corporate information that was previously unavailable or inaccessible. Research has also shown that even deeper information can be obtained from these files by searching and accessing the “properties” information on these files.

This article and guide is designed to give you the resources you need to better understand the history of the deep web research, as well as various classified resources that allow you to search through the currently available web to find those key sources of information nuggets only found by understanding how to search the “deep web”.

This Deep Web Research 2008 article is divided into the following sections:

Articles, Papers, Forums, Audios and Videos	Cross Database Search Services	Peer to Peer, File Sharing, Grid/Matrix Search Engines	Resources – Deep Web Research
Cross Database Articles	Cross Database Search Tools	Presentations	Bot Research Resources and Sites

ARTICLES, PAPERS, FORUMS, AUDIOS AND VIDEOS (Current and Historical)