Peer to Peer Technologies

Roy WerberIdan Gelbourt

prof. Sagiv’s Seminar

The Hebrew University of Jerusalem, 2001

Lecture Overview

1st Part:The P2P communication model, architecture

and applications

2nd Part:Chord and CFS

Peer to Peer - Overview

A class of applications that takes advantage of resources:Storage, CPU cycles, content, human presence

Available at the edges of the Internet

A decentralized system that must cope with the unstable nature of computers located at the network edge

Client/Server Architecture

An architecture in which each process is a client or a server

Servers are powerful computers dedicated for providing services – storage, traffic, etc

Clients rely on servers for resources

Client/Server Properties

Big, strong serverWell known port/address of the serverMany to one relationshipDifferent software runs on the client/serverClient can be dumb (lacks functionality),

server performs for the clientClient usually initiates connection

Client Server Architecture

Server

Client

Client Client

Client

Internet

Client/Server Architecture

GET /index.html HTTP/1.0

HTTP/1.1 200 OK ...Client Server

Disadvantages of C/S Architecture

Single point of failureStrong expensive serverDedicated maintenance (a sysadmin)Not scalable - more users, more servers

Solutions

• Replication of data (several servers)• Problems:

• redundancy, synchronization, expensive

• Brute force (a bigger, faster server)• Problems:

• Not scalable, expensive, single point of failure

The Client Side

Although the model hasn’t changed over the years, the entities in it have

Today’s clients can perform more roles than just forwarding users requests

Today’s clients have:More computing powerStorage space

Thin Client

Performs simple tasks:I/O

Properties:CheapLimited processing powerLimited storage

Fat Client

Can perform complex tasks:GraphicsData manipulationEtc…

Properties: Strong computation powerBigger storageMore expensive than thin

Evolution at the Client Side

IBM PC @ 4.77MHz

360k diskettes

A PC @ 2GHz

40GB HD

DEC’S VT100No storage

‘70 ‘80 2001

What Else Has Changed?

The number of home PCs is increasing rapidlyPCs with dynamic IPs

Most of the PCs are “fat clients” Software cannot cope with hardware development As the Internet usage grow, more and more PCs

are connecting to the global net Most of the time PCs are idle

How can we use all this?

Sharing

Definition:1. To divide and distribute in shares

2. To partake of, use, experience, occupy, or enjoy with others

3. To grant or give a share in intransitive senses

Merriam Webster’s online dictionary (www.m-w.com)

There is a direct advantage of a co-operative network versus a single computer

Resources Sharing

What can we share?Computer resources

Shareable computer resources:“CPU cycles” - seti@homeStorage - CFSInformation - Napster / GnutellaBandwidth sharing - Crowds

SETI@Home

SETI – Search for ExtraTerrestrial Intelligence

@Home – On your own computerA radio telescope in Puerto Rico scans the

sky for radio signalsFills a DAT tape of 35GB in 15 hoursThat data has to be analyzed

SETI@Home (cont.)

The problem – analyzing the data requires a huge amount of computation

Even a supercomputer cannot finish the task on its own

Accessing a supercomputer is expensive

What can be done?

SETI@Home (cont.)

Can we use distributed computing?YEAH

Fortunately, the problem be solved in parallel - examples:Analyzing different parts of the skyAnalyzing different frequenciesAnalyzing different time slices

SETI@Home (cont.)

The data can be divided into small segments

A PC is capable of analyzing a segment in a reasonable amount of time

An enthusiastic UFO searcher will lend his spare CPU cycles for the computationWhen? Screensavers

SETI@Home - Example

SETI@Home - Summary

SETI reverses the C/S modelClients can also provide servicesServers can be weaker, used mainly for storage

Distributed peers serving the centerNot yet P2P but we’re close

Outcome - great results:Thousands of unused CPU hours tamed for the

mission3+ millions of users

What Exactly is P2P?

A distributed communication model with the properties:All nodes have identical responsibilitiesAll communication is symmetric

P2P Properties

Cooperative, direct sharing of resourcesNo central serversSymmetric clients

Client

Client Client

Client

Internet

P2P Advantages

Harnesses client resources Scales with new clients Provides robustness under failures Redundancy and fault-tolerance Immune to DoS Load balance

P2P Disadvantages -- A Tough Design Problem

How do you handle a dynamic network (nodes join and leave frequently)

A number of constrains and uncontrolled variables:No central serversClients are unreliableClient vary widely in the resources they provideHeterogeneous network (different platforms)

Two Main Architectures

Hybrid Peer-to-PeerPreserves some of the traditional C/S

architecture. A central server links between clients, stores indices tables, etc

Pure Peer-to-PeerAll nodes are equal and no functionality is

centralized

Hybrid P2P

A main server is responsible for various administrative operations:Users’ login and logoutStoring metadataDirecting queries

Example: Napster

Examples - Napster

Napster is a program for sharing information (mp3 music files) over the Internet

Created by Shawn Fanning in 1999 although similar services were already present (but lacked popularity and functionality)

Napster Sharing Style: hybrid center+edge

“slashdot”•song5.mp3•song6.mp3•song7.mp3

“kingrook”•song4.mp3•song5.mp3•song6.mp3

•song5.mp3

1. Users launch Napster and connect to Napster server

3. beastieboy enters search criteria

4. Napster displays matches to beastieboy

2. Napster creates dynamic directory from users’ personal .mp3 libraries

Title User Speed song1.mp3 beasiteboy DSLsong2.mp3 beasiteboy DSLsong3.mp3 beasiteboy DSLsong4.mp3 kingrook T1song5.mp3 kingrook T1song5.mp3 slashdot 28.8song6.mp3 kingrook T1song6.mp3 slashdot 28.8song7.mp3 slashdot 28.8

5. beastieboy makes direct connection to kingrook for file transfer

s o n g 5

“beastieboy”•song1.mp3•song2.mp3•song3.mp3

What About Communication Between Servers?

Each Napster server creates its own mp3 exchange community: rock.napster.com, dance.napster.com, etc…

Creates a separation which is bad We would like multiple servers to share a

common ground. Reduces the centralization nature of each server, expands searchability

Various HP2P Models –1. Chained Architecture

Chained architecture – a linear chain of serversClients login to a random serverQueries are submitted to the server

If the server satisfies the query – DoneOtherwise – Forward the query to the next server

Results are forwarded back to the first serverThe server merges the resultsThe server returns the results to the client

Used by OpenNap network

2. Full Replication Architecture

Replication of constantly updated metadataA client logs on to a random server The server sends the updated metadata to all

serversResult:

All servers can answer queries immediately

3. Hash Architecture

Each server holds a portion of the metadataEach server holds the complete inverted list for a

subset of all wordsClient directs a query to a server that is responsible

for at least one of the keywordsThat server gets the inverted lists for all the keywords

from the other serversThe server returns the relevant results to the client

4. Unchained Architecture

Independent servers which do not communicate with each other

A client who logs on to one server can only see the files of other users at the same local server

A clear disadvantage of separating users into distinct domains

Used by Napster

Pure P2P

All nodes are equalNo centralized server

Example: Gnutella

A completely distributed P2P networkGnutella network is composed of clientsClient software is made of two parts:

A mini search engine – the clientA file serving system – the “server”

Relies on broadcast search

Gnutella - Operations

Connect – establishing a logical connection

PingPong – discovering new nodes (my friend’s friends)

Query – look for somethingDownload – download files (simple HTTP)

Gnutella – Form an Overlay

ConnectOKPingPing

PongPongPong

How to find a node?

Initially, ad hoc waysEmail, online chat, news groups…Bottom line: you got to know someone!

Set up some long-live nodesNew comer contacts the well-known nodesUseful for building better overlay topology

Gnutella – Search

Green Toad Green ToadGreen

oad I h

aveI haveI have

•Toad A – look nice•Toad B – too far

I have

On a larger scale, things get more complicated

Gnutella – Scalability Issue

Can the system withstand flooding from every node?

Use TTL to limit the range of propagation5 ^ 5 = 3125, how much can you get ?Creates an “horizon” of computersThe promise is an expectation that you can

change horizon everyday when login

The Differences

While the pure P2P model is completely symmetric, in the hybrid model elements of both PP2P and C/S coexist

Each model has its disadvantagesPP2P is still having problems locating

informationHP2P is having scalability problems as with

ordinary server oriented models

P2P – Summary

The current settings allowed P2P to enter the world of PCs

Controls the niche of sharing resourcesThe model is being studied from the

academic and commercial point of view

There are still problems out there…

End Of Part I

Part II

Roy WerberIdan Gelbourt

Robert MorrisIon Stoica, David Karger,

M. Frans Kaashoek, Hari BalakrishnanMIT and Berkeley

Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications

A P2P Problem

Every application in a P2P environment must handle an important problem:

The lookup problem

What is the problem?

A Peer-to-peer Storage Problem

1000 scattered music enthusiastsWilling to store and serve replicasHow do you find the data?

The Lookup Problem

Internet

Publisher

Key=“title”Value=MP3 data… Client

Lookup(“title”)

Dynamic network with N nodes, how can the data be found?

Centralized Lookup (Napster)

Publisher@

Client

Lookup(“title”)

N2N1SetLoc(“title”, N4)

Simple, but O(N) state and a single point of failure

Key=“title”Value=MP3 data…

Hard to keep the data in the server updated

Flooded queries (Gnutella)

N4Publisher@

Client

Robust, but worst case O(N) messages per lookup

Key=“title”Value=MP3 data…

Lookup(“title”)

Not scalableNot scalable

So Far

Centralized :

- Table size – O(n)

- Number of hops – O(1)Flooded queries:

- Table size – O(1)

- Number of hops – O(n)

We Want

Efficiency : O(log(N)) messages per lookupN is the total number of servers

Scalability : O(log(N)) state per nodeRobustness : surviving massive

failures

How Can It Be Done?

How do you search in O(log(n)) time?

Binary search You need an ordered array How can you order nodes in a network and data

items? Hash function!

Chord: Namespace

Namespace is a fixed length bit string Each object is identified by a unique ID How to get the ID?

Shark SHA-1

Object ID:DE11AC

Object ID:AABBCC

194.90.1.5:8080

Chord Overview

Provides just one operation :A peer-to-peer hash lookup:

Lookup(key) IP addressChord does not store the data

Chord is a lookup service, not a search service

It is a building block for P2P applications

Chord IDs

Uses Hash function:Key identifier = SHA-1(key)Node identifier = SHA-1(IP address)

Both are uniformly distributedBoth exist in the same ID space

How to map key IDs to node IDs?

Mapping Keys To Nodes

- an item

- a node

Consistent Hashing [Karger 97]

Circular 7-bitID space

Key 5Node 105

A key is stored at its successor: node with next higher ID

Basic Lookup

N10N120

“Where is key 80?”

“N90 has K80”

“Finger Table” Allows Log(n)-time Lookups

N801/128

1/161/321/64

Circular 7-bitID space

N80 knows of only seven other nodes.

Finger i Points to Successor of N+2i

1/161/321/641/128

Lookups Take O(log(n)) Hops

Lookup(K19)

Joining: Linked List Insert

1. Lookup(36)K30K38

1. N36 wants to join. He finds his successor

Join (2)

2. N36 sets its own successor pointer

K30K38

Join (3)

3. Copy keys 26..36 from N40 to N36

K30K38

Join (4)

4. Set N25’s successor pointer

Update finger pointers in the backgroundCorrect successors produce correct lookups

K30K38

Join: Lazy Finger Update Is OK

N2 finger should now point to N36, not N40Lookup(K30) visits only nodes < 30, will undershoot

Failures Might Cause Incorrect Lookup

N80 doesn’t know correct successor, so incorrect lookup

Lookup(90)

Solution: Successor Lists

Each node knows r immediate successors After failure, will know first live successor Correct successors guarantee correct lookups

Guarantee is with some probability

Choosing the Successor List Length

Assume 1/2 of nodes failP(successor list all dead) = (1/2)r

i.e. P(this node breaks the Chord ring)Depends on independent failure

P(no broken nodes) = (1 – (1/2)r)N

If we choose :

r = 2log(N) makes prob. = 1 – 1/N

Chord Properties

Log(n) lookup messages and table space.Well-defined location for each ID.

No search required.

Natural load balance.No name structure imposed.Minimal join/leave disruption.Does not store documents…

Experimental Overview

Quick lookup in large systemsLow variation in lookup costsRobust despite massive failureSee paper for more results

Experiments confirm theoretical results

Chord Lookup Cost Is O(log N)

Number of Nodes

Constant is 1/2

Failure Experimental Setup

Start 1,000 CFS/Chord serversSuccessor list has 20 entries

Wait until they stabilizeInsert 1,000 key/value pairs

Five replicas of each

Stop X% of the serversImmediately perform 1,000 lookups

Massive Failures Have Little Impact

5 10 15 20 25 30 35 40 45 50

Failed Nodes (Percent)

(1/2)6 is 1.6%

Chord Summary

Chord provides peer-to-peer hash lookupEfficient: O(log(n)) messages per lookupRobust as nodes fail and joinGood primitive for peer-to-peer systems

http://www.pdos.lcs.mit.edu/chord

Wide-area Cooperative Storage With CFS

Robert MorrisFrank Dabek, M. Frans Kaashoek,David Karger, Ion Stoica

MIT and Berkeley

What Can Be Done With Chord

Cooperative MirroringTime-Shared Storage

Makes data available when offline

Distributed IndexesSupport Napster keyword search

How to Mirror Open-source Distributions?

Multiple independent distributionsEach has high peak load, low average

Individual servers are wastefulSolution: aggregate

Option 1: single powerful serverOption 2: distributed service

But how do you find the data?

Design Challenges

Avoid hot spots Spread storage burden evenly Tolerate unreliable participants Fetch speed comparable to whole-file TCP Avoid O(#participants) algorithms

Centralized mechanisms [Napster], broadcasts [Gnutella]

CFS solves these challenges

CFS Overview

CFS – Cooperative File System:P2P read-only storage system

Read-only – only the owner can modify files

Completely decentralized

client server

clientserverInternet

CFS - File System

A set of blocks distributed over the CFS servers

3 layers:FS – interprets blocks as files (Unix V7)Dhash – performs block managementChord – maintains routing tables used to find

blocks

Uses 160-bit identifier spaceAssigns to each node and block an

identifierMaps block’s id to node’s idPerforms key lookups (as we saw earlier)

Dhash – Distributed Hashing

Performs blocks management on top of chord :Block’s retrieval ,storage and caching

Provides load balance for popular filesReplicates each block at a small number

of places (for fault-tolerance)

CFS - Properties

Tested on prototype :EfficientRobustLoad-balanced ScalableDownload as fast as FTP

DrawbacksNo anonymityAssumes no malicious participants

Design Overview

• DHash stores, balances, replicates, caches blocks

• DHash uses Chord [SIGCOMM 2001] to locate blocks

Client-server Interface

Files have unique names Files are read-only (single writer, many readers) Publishers split files into blocks Clients check files for authenticity

FS Client serverInsert file f

Lookup file f

Insert block

Lookup block

server

Naming and Authentication

1. Name could be hash of file content Easy for client to verify But update requires new file name

2. Name could be a public key Document contains digital signature Allows verified updates w/ same name

CFS File Structure

Public key

Root block

signature

Directory block

Inode block

H(B1)B1

Data block

File Storage

Data is stored for an agreed-upon finite interval

Extensions can be requestedNo specific delete commandAfter expiration – the blocks fade

Storing Blocks

Long-term blocks are stored for a fixed timePublishers need to refresh periodically

Cache uses LRU (Least Recently Used)

disk: cache Long-term block storage

Replicate Blocks at k Successors

Block17

Replica failure is independent

Lookups Find Replicas

Block17

Lookup(BlockID=17)

RPCs:1. Lookup step2. Get successor list3. Failed block fetch4. Block fetch

First Live Successor Manages Replicas

Block17

Copy of17

DHash Copies to Caches Along Lookup Path

Lookup(BlockID=45)

RPCs:1. Chord lookup2. Chord lookup3. Block fetch4. Send to cache

Naming and Caching

D30 @ N32Client 1

Client 2

Every hop is smaller,the chance of collision when doing lookup is high

Caching is efficient

Caching Doesn’t Worsen Load

• Only O(log N) nodes have fingers pointing to N32• This limits the single-block load on N32

Virtual Nodes Allow Heterogeneity – Load Balancing

Hosts may differ in disk/net capacity Hosts may advertise multiple IDs

Chosen as SHA-1(IP Address, index)Each ID represents a “virtual node”

Host load proportional to # v.n.’s Manually controlled

Node A

N60N10 N101

Node B

Server Selection By Chord

N80 N48

• Each node monitors RTTs to its own fingers• Tradeoff: ID-space progress vs delay

N18N115

Lookup(47)

Why Blocks Instead of Files?

Cost: one lookup per blockCan tailor cost by choosing good block size

Benefit: load balance is simpleFor large filesStorage cost of large files is spread outPopular files are served in parallel

CFS Project Status

Working prototype software Some abuse prevention mechanismsGuarantees authenticity of files, updates,

etc. Napster-like interface in the works

Decentralized indexing system Some measurements on RON testbed Simulation results to test scalability

Experimental Setup (12 nodes)

One virtual node per host 8Kbyte blocks RPCs use UDP

CA-T1CCIArosUtah

To vu.nlLulea.se

MITMA-CableCisco

Cornell

OR-DSL

• Caching turned off• Proximity routing

turned off

CFS Fetch Time for 1MB File

• Average over the 12 hosts• No replication, no caching; 8 KByte blocks

Prefetch Window (KBytes)

Distribution of Fetch Times for 1MB

Time (Seconds)

8 Kbyte Prefetch

24 Kbyte Prefetch40 Kbyte Prefetch

CFS Fetch Time vs. Whole File TCP

Time (Seconds)

40 Kbyte Prefetch

Whole File TCP

Robustness vs. Failures

Failed Nodes (Fraction)

(1/2)6 is 0.016

Six replicasper block;

Future work

Test load balancing with real workloads Deal better with malicious nodes Indexing Other applications

CFS Summary

CFS provides peer-to-peer r/o storageStructure: DHash and ChordIt is efficient, robust, and load-balancedIt uses block-level distributionThe prototype is as fast as whole-file TCP

http://www.pdos.lcs.mit.edu/chord

The End

Peer to Peer Technologies

Documents

BUILDING TECHNOLOGIES OFFICE PEER REVIEW 1

INTERNET TECHNOLOGIES Week 10 Peer to Peer Paradigm 1

Bioenergy Technologies Office 2019 Project Peer …...Bioenergy Technologies Office 2019 Project Peer Review Schedule *Note: Draft schedule, subject to change Monday, March 4, 2019

Measurement of Commercial Peer-To-Peer Live Video Streaming › ~hzhang › papers › PPStreamingMeasurementWR… · 2006-08-02 · Abstract—Peer-to-peer technologies have proved

2014 Marine and Hydrokinetic Technologies Peer Review

Using lecture capture technologies to support peer-to-peer feedback among first-year Fashion students in a studio-based learning environment

Geothermal Technologies Office 2017 Peer Review - Energy

Geothermal Technologies Program 2010 Peer Review · Geothermal Technologies Program 2010 Peer Review ... Technologies – Temperature up ... • Melissa Vermeer Polymer Engineer LPC

Living bits and things 2013 - Using peer-to-peer and distributed technologies (Nabto) to solve the IoT challenges

INE1020: Introduction to Internet Engineering 5: Web-based Applications1 Lecture 12: Emerging Internet Technologies r Peer-to-peer vs. client-server computing

Peer-to-Peer learning technologies, Visualisation and the education around the Person

Geothermal Technologies Office 2013 Peer Review · 2014. 2. 10. · Geothermal Technologies Office 2013 Peer Review . AASG State Geological Survey . Contributions to the NGDS .

P.1Service Control Technologies for Peer-to-peer Traffic in Next Generation Networks Part2: An Approach of Passive Peer based Caching to Mitigate P2P Inter-domain

Geothermal Technologies Program 2010 Peer Review

Geothermal Technologies Program Peer Review - May 18 … · Geothermal Technologies Program Peer Review ... Deliver the captured energy to the power plant ... US DOE Geothermal Program

Welcome to the 2017 BTO Peer Review - Energy.gov · 2017-04-06 · Welcome to the 2017 BTO Peer Review Building Energy Efficiency, the Building Technologies Office, and Peer Review

European Research Network on Foundations, Software Infrastructures and Applications for large scale distributed, GRID and Peer-to-Peer Technologies Scalability

Geothermal Technologies Office 2013 Peer Review

Wireless technologies - its-wiki.nocwi.unik.no/images/8/84/Wireless_technologies.pdf · WIRELESS TECHNOLOGIES Bluetooth, ... peer to complex practical mesh network topologies.

Exploring new technologies through playful peer-to-peer engagement in informal learning