Computer Help Forums  
     

Left Nav Register FAQ Members List Calendar Mark Forums Read Right Nav

Left Container Right Container
 

Go Back   Computer Help Forums » Member Input » Members' Lounge

Reply
 
LinkBack Thread Tools
Old May-04-2005   #1 (permalink)
Alan
"The Tipster"
 
Alan's Avatar
 
Join Date: Mar 18 2004
Location: Klågerup, Sweden
Posts: 2,960
Alan will become famous soon enough
Smile How Much Of The Internet Do you Access?

The Content That Goes Beyond Google, Yahoo, Dogpile, and MSN...

Many untrained users have the naive expectation that they can locate anything on the Net by using Google or Yahoo or Ask.com. No chance, as powerful as these search engines are, they do not index everything on the Internet. In fact, search engines index less than 10% of the entire Internet! That remaining 90% is called the "Invisible Web", or in other words, "The Cloaked Web" or "The Deep Web". This is the massive content that is beyond the capabilities of spider indexing.

To better visualize this concept, let's start with some size estimates from Google.com, Yahoo.com, Cyberatlas, and MIT. These stats are current to April 2005:
45 billion static web pages are publicly-available on the World Wide Web. Another estimated 5 billion static pages are available within private intranet sites
200+ billion database-driven pages are available as dynamic database reports ("invisible web" pages)
Google.com indexes 8.06 billion web pages.

Google, considered by experts to be the best search database today, is capable of indexing only a fraction of this monstrous content! Even by using electronic spiders to read and catalog millions of web pages each week, Google current indexes only 8.06 billion web pages of the 200+ billion pages out there, or less than 4% of all available Internet content.

So if Google only catalogs 4% of the World Wide Web, and other search engines catalog even less, then where is the remaining 90%of web content hidden?

That's where the "Invisible Web" begins.
The "Invisible Web" (aka "Deep Web" or "Cloaked Web") is content that is closed-off to search engines. To be specific: the Invisible Web is about 200+ billion web pages that are stored not as static web pages, but as on-demand database content. As of February 2005, robot spiders are not advanced enough to read these private databases. Only a human reader can see these "invisible pages" by directly visiting these sites and making direct database requests.

Technical terminlogy:
"Spider": an artifical intelligence program, or robot, that is sent out weekly to scour the public Internet and read millions of static web pages. The spider reports back to its mother database with its results, and those results get collated into search engine catalogs for public use.
"Database-Driven Web Content": web pages that exist only temporarily, and are generated only when readers request answers from a large database. These temporary web pages are dynamic, usually cannot be bookmarked, and commonly have extremely long URL addresses. Examples of databased web content: Today's job postings in Honolulu, apartments available in Singapore, today's weather report for Dublin, flights available to Istanbul, stock quotes for the NYSE, houses for sale in Winnipeg, reviews on the movie "Sideways", leather jackets for sale on eBay, hard drives for sale at Best Buy, your current savings account balance. These temporary web pages, once displayed to the reader, cease to exist moments later. Minute-by-minute, these databased web pages are re-created to reflect updated information on the database.

"OK. So I think I understand this now. 'Invisible Web Pages' are really 'Dynamic Web Pages'. That's when a database builds me a temporary page to answer my database question! Neat! So how do I find these thousands of "Invisible Web" databases?
Finding and reading "Invisible Web" pages is not difficult nor prohibited, but it is time-consuming. Because Invisible Web ("Deep Web") content is very specialized and unique to a specific topic of changing content, you will need to search twice:
First: you need to use primary search engines (Google, Yahoo, MSN, A9, Vivisimo, Dogpile, Ask) to locate the database you want. Expect this to be at least 30 minutes of searching (e.g. search for "jobs in honolulu", "weather reports dublin", "houses for sale winnipeg", "hard drive sales at best buy")
Secondly: once you find the specific database you want, you then need to search within that database. For example: you may find www.monster.com helpful for searching databased jobs in Honolulu. Perhaps you will read www.theweathernetwork.com to find your weather reports, or www.expedia.com to search for available flights. Note: expect this "invisible web" database searching to take hours; these databases overflow with interesting and useful content!

Article Source - About Computers
Attached Images
File Type: jpg n4layersc.jpg (88.9 KB, 5 views)

Help us to help you!
Be specific with your problem and tell us your machines specs.

Alan is: offline   Reply With Quote
Old May-05-2005   #2 (permalink)
Daniel
Moderator
 
Daniel's Avatar
 
Join Date: Aug 08 2004
Location: Dingli, MALTA (EU)
Posts: 1,602
Daniel will become famous soon enough
Very interesting, never knew that GOOGLE can search on 4% of the www. So a small amount

Anyways, so it's like you have to search within the database stored in the search engine, twice to find the Deep Web. For example I am searching for local PC stores but when I search on google for Maltese PC Stores I can see only a few. Then you will notice some web pages similar to www.searchmalta.com (for example) and in that database you will find lots of deep web.

I think I learned the difference Very interesting

Daniel is: offline   Reply With Quote
Reply

Bookmarks


Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On

Similar Threads
Thread Thread Starter Forum Replies Last Post
i now cant access internet at all fatman_joe Internet Issues 3 Sep-28-2004 18:52
Access Denied rollergirl Internet Issues 1 Sep-15-2004 19:36
How To Re-Install Internet Explorer 6 And Outlook Express 6 Alan Internet HowTo's 0 Aug-14-2004 11:27
How To Re-Install Internet Explorer 6 And Outlook Express 6 Alan Email HowTo's 0 Aug-14-2004 11:27
Simple Glossary of PC Words - H to R Alan General 0 Jun-19-2004 12:36


All times are GMT -4. The time now is 07:21.


Powered by vBulletin. Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
Search Engine Optimization by vBSEO 3.0.0
Style Provided By: Wrestling Clique Wrestling Forums
PC Support Forums.com