42
USF Department of Computer Science Peer-to-Peer Knowledge Sharing David Wolber

USF Department of Computer Science Peer-to-Peer Knowledge Sharing David Wolber

  • View
    218

  • Download
    0

Embed Size (px)

Citation preview

USF Department of Computer Science

Peer-to-Peer Knowledge Sharing

David Wolber

USF Department of Computer Science

USF Department of Computer Science

Associative Thinking Assistant

USF Department of Computer Science

Motivation

• Desktop is outdated

• Digital libraries abound, but no tools (INFORMATION OVERLOAD)

• Avoiding a Googleopoly

• Peer-to-peer is a great new model, but sharing knowledge is hard.

USF Department of Computer Science

Sneak Preview

• Develop webtop, not desktop tools

• A common search API and registry

• Zero-input publishing

• Open PC Movement

USF Department of Computer Science

Today’s Desktop

BookMark File Manager Hyper-LinksManager

WebSphinx (Miller, Bharat)

USF Department of Computer Science

Desktop tools are outdated

• Filing cabinet metaphor no longer fits

• Unnecessary barrier between local system and web

• Computer is a communication tool, a node in the knowledge base of the world

USF Department of Computer Science

I see the documentsWhere are the people?

USF Department of Computer Science

Motivation

• Desktop is outdated

• Digital libraries abound, but no tools (INFORMATION OVERLOAD)

• Avoiding a Googleopoly

• Peer-to-peer is a great new model, but sharing knowledge is hard.

USF Department of Computer Science

Information Sources

• Google, Amazon, etc.

• History: Way-back machine

• Technorati– The World Live Web

• Domain Specific:– ACM Digital Library– Citeseer– MLA for literature

USF Department of Computer Science

www.technorati.com

USF Department of Computer Science

CiteSeer: http://citeseer.ist.psu.edu/

USF Department of Computer Science

ACM

USF Department of Computer Science

Internet Archive, Wayback

USF Department of Computer Science

They all provide similar functionality, but…

USF Department of Computer Science

Motivation

• Desktop is outdated

• Digital libraries abound, but no tools (INFORMATION OVERLOAD)

• Avoiding a Googleopoly

• Peer-to-peer is a great new model, but sharing knowledge is hard.

USF Department of Computer Science

Media Conglomeration

• FCC lessening anti-trust laws

• Most media now owned by a handful of companies

• How many of us go through Google to get our information.

• Can we trust Google to stay “good”?

USF Department of Computer Science

Motivation

• Desktop is outdated

• Digital libraries abound, but no tools (INFORMATION OVERLOAD)

• Avoiding a Googleopoly

• Peer-to-peer is a great new model, but sharing knowledge is hard.

USF Department of Computer Science

Our Virtual Brains are modeled after our real ones

USF Department of Computer Science

Research Tasks

Communication

Email

Blog

IM

Web page

Knowledge Creation

Bookmark

Create note/doc

Put in folder

Add link

USF Department of Computer Science

The Big Question

How much of the information hidden

within your personal web is hidden due

to privacy concerns?

USF Department of Computer Science

Associative Thinking Agent

Personal Personal Personal General-Purpose Search Engine

Specialized Search Engines

Associative Thinking Agent

Figure 1. The proposed system allows any entity that can provide associative data to interested parties to register as associative thinking agent source. This includes general-purpose search engines like Google, specialized search engines like the Internet Archive, and ordinary users of personal computers. Associative thinking agents, residing on personal computers, help the user identify sources and information relevant to the user’s current task.

Source list

USF Department of Computer Science

Personal Metadata

Cassini server

PW Service

WebTop

Client

Personal Computer 1

File System

Personal Metadata

Cassini server

PW Service

WebTop

Client

Personal Computer 2

File System

Assoc SourceUDDI Registry

WTGoogle

WTTechnorati

USF Department of Computer Science

WebTop Client

• Tree View of results

• Information processing on node expansion

• Choose from dynamic list of sources

• N-Degree Heterogeneous Associations

USF Department of Computer Science

Associations

• Outward Links (hyperlinks)

• Inward Links

• Similar Content Links

• Collaborative filtering Links

USF Department of Computer Science

More Associations

• General Research: AuthorOf, collaborator, co-cited, home page of,…

• Domain-Specific– Films: Acted in, Directed– Campaigns: Gave money to

USF Department of Computer Science

Separate Association Types

Inward Links Browser Outward Links

Content-Similar

USF Department of Computer Science

IOC Tree

IOC Tree

I --- C – O --I

Browser

USF Department of Computer Science

Associative Source API and Registry

• A common search/link info API

• A central UDDI registry

• Allows for meta-search clients with dynamic lists of sources.

Clients can access even those sources created after the client itself.

USF Department of Computer Science

Organizational Sources

• Web Service Wrappers

• Wrappers implement one or more of the Associative Source API methods

WebTop Client

WTGoogleAssoc Service

Google Web Service

USF Google

USF Department of Computer Science

Scenarios

• Search within the boomarks/links/papers of an expert

• When open one of your own papers, see newly created documents that cite it, that have similar content

• Easily navigate paper-author trail

USF Department of Computer Science

The Personal Web

2nd Degree

1st Degree

Nth Degree

PersonalWeb

Personal Space consists of highly interrelated files, i.e. a personal web

The Web should be personalized to each user, i.e., a personal web.

Personal Web-- local documents and “bookmarks”

N-degree Neighborhood– can be reached with n clicks.

USF Department of Computer Science

You can be a Search Engine

• Personal Web Analyzer– In/out links between documents– Full-text index– Characteristic words– Person/proper names in document

• Access between personal webs– Automatically installed mini-server– Auto-registration of web service

USF Department of Computer Science

Connecting Personal Webs

People create knowledge-- bookmarking, annotating, linking, synthesizing-- every day.

Much of it is never communicated

USF Department of Computer Science

Future Work

• Automated source discovery

• Alternative page ranking– Source reputation measures– In-links from groups of people, e.g.,

CS Professors

USF Department of Computer Science

Get Involved

• http://webtop.cs.usfca.edu

[email protected]

USF Department of Computer Science

WIKIs

USF Department of Computer Science

Lawrence Lessig’s Blog

• http://www.lessig.org/blog/

USF Department of Computer Science

Web Services

• Programmatic interface to web data

• Software agent access to data

• Helps separate View/Model

• More robust than “web scrapers”

• XML/Soap

• Rest-ful Interfaces

USF Department of Computer Science

Web Services

• Web services allow clients to aggregate information from various sources, process, then display in a custom fashion.

• More robust than scraping web pages

client

source

source

GUI

USF Department of Computer Science

XML/Soap Web Services

• XML/Soap

• WSDL

• UDDI

USF Department of Computer Science

Know more about yourself

Analysis of file system– Linking between documents– inward links– characteristic words of documents– what people are referred to– what web pages