Upload
kelly-jarvis
View
28
Download
0
Embed Size (px)
DESCRIPTION
Internet Resources Discovery (IRD). Internet/WWW Statistics. Thanks to Yoram Dahan and Miki Even-Haim. Internet/WWW Statistics. Page Connections Searching. How richly connected is it?. How richly connected is it?. How richly connected is it?. How richly connected is it?. - PowerPoint PPT Presentation
Citation preview
T.Sharon-A.Frank1
Internet Resources Discovery (IRD)
Internet/WWW
Statistics
Thanks to Yoram Dahan and Miki Even-Haim
T.Sharon-A.Frank
2
Internet/WWW Statistics
• Page Connections
• Searching
T.Sharon-A.Frank
3
How richly connected is it?
Outbound Connections
As The Table shows, a large majority (just under 75%) of all pagescontain at least one URL. Note that this includes local("#"-prefixed) URLs; still, it is fair to conclude that pure "leaf"pages are in the minority. It is fairly uncommon (less than 10%)for a page to contain exactly one URL.
T.Sharon-A.Frank
4
How richly connected is it?
T.Sharon-A.Frank
5
How richly connected is it?
T.Sharon-A.Frank
6
How richly connected is it?
T.Sharon-A.Frank
7
Search Engine Sizes Over Time
T.Sharon-A.Frank
8
Search Engine Sizes Over Time
T.Sharon-A.Frank
9
Large Search Engines (by pages)
T.Sharon-A.Frank
10
Current SE Sizes Comparisons
Sizes are as reported by each search engine and as of December 11, 2001.
KEY: GG=Google, FAST=FAST, AV=AltaVista, INK=Inktomi, NL=Northern Light.
T.Sharon-A.Frank
11
Directory SizesDirectories are usually human-compiled guides to the web, where sites areorganized by category. The chart below compares the size of directories atvarious services, along with other key data.Service Type Editors Cats Links... As OfOpenDirectory
D 36,000 361,0002.6million
4/01
LookSmart D 200 200,0002.5million
8/01
Yahoo D 100+ n/a1.5 to1.8million
8/00
AltaVista SE See LookSmartExcite SE See LookSmartHotBot SE See Open DirectoryLycos D See Open DirectoryMSNSearch
SE See LookSmart
Netscape SE See Open Directory
T.Sharon-A.Frank
12
Directory Sizes Type: Shows whether a service is primarily a directory (D)or a search engine (SE).Editors: Shows how many people are involved inproducing the listings. More is not necessarily better, assome services claim that technology helps them do more.However, a large number of editors can be a good sign thata quality directory is being built and keeping up with thegrowth of the web.Cats: Shows how many categories each directory has.Links: Shows how many unique URLs exist in thedirectory, usually as reported by each directory or drawn onrecent interviews I've conducted. In the case of Yahoo, arange is shown. The upper figure comes from going intoeach major category and adding up the counts for eachsubcategory listed. However, since some URLs may appearin more than one category, this method may produce anovercount. Thus, an estimated lower figure for Yahoo isalso shown.As Of: Shows how current the information is, for each directory
T.Sharon-A.Frank
13
Web SE Effectiveness (2001)
T.Sharon-A.Frank
14
Leading Search Engines (by entries)
T.Sharon-A.Frank
15
Google Search Services
T.Sharon-A.Frank
16
References
• Search Engines Watch– http://searchenginewatch.com/reports/
• Online Computer Library Center– http://wcp.oclc.org/stats/size.html
• UCLA Center for Communication Policy– http://www.ccp.ucla.edu/pages/InternetStudy.asp
• Internet Statistics– http://www.mit.edu/people/mkgray/net/