View
218
Download
0
Tags:
Embed Size (px)
Citation preview
USF Department of Computer Science
Motivation
• Desktop is outdated
• Digital libraries abound, but no tools (INFORMATION OVERLOAD)
• Avoiding a Googleopoly
• Peer-to-peer is a great new model, but sharing knowledge is hard.
USF Department of Computer Science
Sneak Preview
• Develop webtop, not desktop tools
• A common search API and registry
• Zero-input publishing
• Open PC Movement
USF Department of Computer Science
Today’s Desktop
BookMark File Manager Hyper-LinksManager
WebSphinx (Miller, Bharat)
USF Department of Computer Science
Desktop tools are outdated
• Filing cabinet metaphor no longer fits
• Unnecessary barrier between local system and web
• Computer is a communication tool, a node in the knowledge base of the world
USF Department of Computer Science
Motivation
• Desktop is outdated
• Digital libraries abound, but no tools (INFORMATION OVERLOAD)
• Avoiding a Googleopoly
• Peer-to-peer is a great new model, but sharing knowledge is hard.
USF Department of Computer Science
Information Sources
• Google, Amazon, etc.
• History: Way-back machine
• Technorati– The World Live Web
• Domain Specific:– ACM Digital Library– Citeseer– MLA for literature
USF Department of Computer Science
Motivation
• Desktop is outdated
• Digital libraries abound, but no tools (INFORMATION OVERLOAD)
• Avoiding a Googleopoly
• Peer-to-peer is a great new model, but sharing knowledge is hard.
USF Department of Computer Science
Media Conglomeration
• FCC lessening anti-trust laws
• Most media now owned by a handful of companies
• How many of us go through Google to get our information.
• Can we trust Google to stay “good”?
USF Department of Computer Science
Motivation
• Desktop is outdated
• Digital libraries abound, but no tools (INFORMATION OVERLOAD)
• Avoiding a Googleopoly
• Peer-to-peer is a great new model, but sharing knowledge is hard.
USF Department of Computer Science
Research Tasks
Communication
Blog
IM
Web page
Knowledge Creation
Bookmark
Create note/doc
Put in folder
Add link
USF Department of Computer Science
The Big Question
How much of the information hidden
within your personal web is hidden due
to privacy concerns?
USF Department of Computer Science
Associative Thinking Agent
Personal Personal Personal General-Purpose Search Engine
Specialized Search Engines
Associative Thinking Agent
Figure 1. The proposed system allows any entity that can provide associative data to interested parties to register as associative thinking agent source. This includes general-purpose search engines like Google, specialized search engines like the Internet Archive, and ordinary users of personal computers. Associative thinking agents, residing on personal computers, help the user identify sources and information relevant to the user’s current task.
Source list
USF Department of Computer Science
Personal Metadata
Cassini server
PW Service
WebTop
Client
Personal Computer 1
File System
Personal Metadata
Cassini server
PW Service
WebTop
Client
Personal Computer 2
File System
Assoc SourceUDDI Registry
WTGoogle
WTTechnorati
USF Department of Computer Science
WebTop Client
• Tree View of results
• Information processing on node expansion
• Choose from dynamic list of sources
• N-Degree Heterogeneous Associations
USF Department of Computer Science
Associations
• Outward Links (hyperlinks)
• Inward Links
• Similar Content Links
• Collaborative filtering Links
USF Department of Computer Science
More Associations
• General Research: AuthorOf, collaborator, co-cited, home page of,…
• Domain-Specific– Films: Acted in, Directed– Campaigns: Gave money to
USF Department of Computer Science
Separate Association Types
Inward Links Browser Outward Links
Content-Similar
USF Department of Computer Science
Associative Source API and Registry
• A common search/link info API
• A central UDDI registry
• Allows for meta-search clients with dynamic lists of sources.
Clients can access even those sources created after the client itself.
USF Department of Computer Science
Organizational Sources
• Web Service Wrappers
• Wrappers implement one or more of the Associative Source API methods
WebTop Client
WTGoogleAssoc Service
Google Web Service
USF Google
USF Department of Computer Science
Scenarios
• Search within the boomarks/links/papers of an expert
• When open one of your own papers, see newly created documents that cite it, that have similar content
• Easily navigate paper-author trail
USF Department of Computer Science
The Personal Web
2nd Degree
1st Degree
Nth Degree
PersonalWeb
Personal Space consists of highly interrelated files, i.e. a personal web
The Web should be personalized to each user, i.e., a personal web.
Personal Web-- local documents and “bookmarks”
N-degree Neighborhood– can be reached with n clicks.
USF Department of Computer Science
You can be a Search Engine
• Personal Web Analyzer– In/out links between documents– Full-text index– Characteristic words– Person/proper names in document
• Access between personal webs– Automatically installed mini-server– Auto-registration of web service
USF Department of Computer Science
Connecting Personal Webs
People create knowledge-- bookmarking, annotating, linking, synthesizing-- every day.
Much of it is never communicated
USF Department of Computer Science
Future Work
• Automated source discovery
• Alternative page ranking– Source reputation measures– In-links from groups of people, e.g.,
CS Professors
USF Department of Computer Science
Get Involved
• http://webtop.cs.usfca.edu
USF Department of Computer Science
Web Services
• Programmatic interface to web data
• Software agent access to data
• Helps separate View/Model
• More robust than “web scrapers”
• XML/Soap
• Rest-ful Interfaces
USF Department of Computer Science
Web Services
• Web services allow clients to aggregate information from various sources, process, then display in a custom fashion.
• More robust than scraping web pages
client
source
source
GUI