Cooperative Database Caching within Cloud Environments

Andrei Vancea1, Guilherme Sperb Machado1, Laurent d’Orazio2, Burkhard Stiller1

1 Department of Informatics IFI, Communication Systems Group CSG, University of Zürich UZH, Switzerland

2Blaise Pascal University - LIMOS, Francevancea,stiller@ifi.uzh.ch, laurent.dorazio@isima.fr

AIMS, Luxembourg, Luxembourg, June 6, 2012

Background

Databases – Client: asks a query (SQL)– Server: returns the result (tuples)

Client-side caching– Page Caching, Tuple Caching – Semantic Caching

• Clients store the results of old queries

• Old results used for answering new queries

Background - Semantic Caching

QUERYREWRITING

Probe Remainder

Semanticcache

Server

Queriesdescriptions

Semantic Regions– Query description– Result set

Query rewriting– Probe– Remainder

Database Caching & Cloud Computing

Most cloud providers charge data transfer between cloud environment and “outside world” in a pay-as-you-go matter

Database caching within cloud environment– Improves performance– Economic benefits

• Amount of data transferred decreases

Payments for data transferred reduced

Approach

Cooperative Semantic Caching

Share local semantic caches between clients

Use cache entries of other clients

Performance improvements

Cooperative Semantic Caching

Q1 : select * from persons where age > 10

Q3 : select * from persons where age > 7

result

select * from persons where age > 7 and age <= 10

R1 : age > 10

result

resultselect * from R1

Potential Use Cases

GIS (Geographic Information System) storage– Large amount of data (e.g. seismic events)– Processing done on client side – Two-dimensional range selections (area)

NetFlow-based architectures– Routers collect flow records and store them in databases– Analyzers (intrusion detection, accounting,… ) access them– Range selections (Start Time, IP)

Query Rewriting

Query rewriting– Probe– Remote probes– Remainder QUERY

REWRITING

Probe Remainder

LocalSemantic

Server

All queriesdescriptions

Remote probe

RemoteSemantic

Remote probe

RemoteSemantic

System Design

CoopSC

CoopCooperative SSemantic CCaching Query types

– Selection (n-Dimensional range predicates)– select id, name, age from persons where 20 < age and

age < 30 Cache organization

– Semantic regions– Distributed Index – built on top of a P2P overlay

CoopSC - Query Rewriting

Local Rewriting– Probe

– Local Remainder

• Portion of the query which is

not available in the local cache

Distributed Rewriting– Remote Probes

– Remainder

Local Cache

RemoteProbe

Remainder

Local Rewriting

Local Remainder

Distributed RewritingDistributed

Distributed Index

Built on top of P2P overlay Regions and queries represented as

rectangular shapes MX-CIF Quad Tree

– Efficiently find intersection between rectangular shapes

Each region is indexed in the smallest quad which totally contains it

Easy to adapt to n-Dimensional regions/queries

Update Handling

Issues– Invalidation of old entries– Combining different snapshots can generate inconsistencies

Quad space division (specified update level) Virtual timestamps stored in database Each modification increments the virtual timestamp of

corresponding quad Regions store virtual timestamps of quads that they

intersect

Cloud Computing Scenarios

Cloud Scenario A

Database server running outside the cloud

Clients located inside in the cloud

Non-operational use cases– Example: cloud environment

used for running scientific experiments

Cloud Scenario B

Database server running inside the cloud

Clients located inside in the cloud

Operational use cases– Example: corporation

using cloud environment as an alternative to building a datacenter

Evaluation

Experiment Design

Measurements– Response time– Amount of data transferred– Payments for data transfer

Experiments – Cache size– Update level

Testing sessions– 5 select testing sessions (50 queries each)– Update sessions interleaved

Evaluation

Wisconsin benchmark dataset (10.000.000 tuples) Scenario A

– Database Server: Zurich testbed– 5 Client: Rackspace

Scenario B– Database server

• Amazon EC2

– 5 Clients: EmanicsLab Queries

– About 10.000 tuples– Semantic locality

Scenario A

Data transferred/Payments

CoopSC significantly reduces the number of tuples sent by database server

Amount of money also reduced

Response Time

Rackspace behaves unstable

No performance improvements noticed

Scenario B

Data transferred/Payments

CoopSC significantly reduces the number of tuples sent by database server

Bandwidth payments also reduced

Response Time

CoopSC improves response time

Data transferred/Payments (Updates)

Good behavior for low update rate

Economic and performance benefits

Response Times (Updates)

Response increases with the grow of update rate

Summary & Conclusion

Summary– Cooperative caching approach used for reducing the load of

the database server

– Update statements supported

– CoopSC applied in the context of cloud environments CoopSC reduces the amount of data transferred

between cloud and outside world which has economic benefits

Performance benefits as long as cloud providers are stable

Questions?

Update Handling - Algorithm

procedure Execute(query)quads = query.getIntersecteQuad(updateLevel);

before = database.getTimestamps(quads);

plan = rewrite(query, before);result = plan.execute();

after = database.getTimestamps(quads);

if (before == after) return result;

elseresult database.execute(query);

Cooperative Database Caching within Cloud Environments

Technology

Shark: Scaling File Servers via Cooperative Caching

NANOG17 – Distributed Caching in Large IP Environments Adrian Chadd & Andrew Khoo - 10th October 1999 Distributing Caching on Large IP Networks Adrian

CCNxCon2012: Session 5: HTTP/CCN Gateway and Cooperative Caching Demonstrator

Dynamic-Content Web Caching with Cooperative Proxy Scheme

On the Scale and Performance of Cooperative Web Proxy Caching

Comparison of Cooperative Caching Strategies in Mobile Ad-Hoc Network (MANET)

Cooperative caching in mobile ad hoc networks …downloads.hindawi.com/journals/misy/2007/193641.pdfMobile Information Systems 3 (2007) 19–37 19 IOS Press Cooperative caching in

Cooperative Xpath Caching

Online Caching and Cooperative Forwarding in Information Centric Networkingnetworking.khu.ac.kr/layouts/net/publications/data/2018... · 2019-01-09 · Online Caching and Cooperative

Cooperative Caching in Wireless Multimedia Sensor Nets

Age-based Cooperative Caching in Information-Centric Networks

INTRODUCING A DATA SLIDING MECHANISM FOR COOPERATIVE CACHING … · 2015. 1. 26. · COOPERATIVE CACHING Aggregate Low access latency of private caches High storage capacity of shared

Impact of the Mobility Model on a Cooperative Caching ...cdn.intechweb.org/pdfs/12887.pdfImpact of the Mobility Model on a Cooperative Caching Scheme for Mobile Ad Hoc Networks 267

A Cooperative Caching Scheme Based on Mobility …j_deng/papers/ppm_tvt18.pdf1 A Cooperative Caching Scheme Based on Mobility Prediction in Vehicular Content Centric Networks Lin Yao

Improving Cooperative Caching Using Importance Aware Bloom ... · Improving Cooperative Caching Using Importance Aware Bloom Filter Pavan Kumar Pallerla pp4475@rit.edu ABSTRACT This

Cooperative navigation of unknown environments using potential

Cooperative Positioning in Urban Environments: Opportunities and Challenges · Cooperative Positioning in Urban Environments: Opportunities and Challenges. Event Name, Date and Location

Mistreatment-Resilient Distributed Caching words: Cooperative Caching, Service Overlay Networks, Peer-to-Peer Networks, Control Theory, Performance Evaluation. 1 Introduction Background

High performance, low complexity cooperative …delab.csd.auth.gr/papers/WINET11dktm.pdfHigh performance, low complexity cooperative caching for wireless sensor networks Nikos Dimokas

Coding-based Cooperative Caching in On-demand …chiychow/papers/INS_2017.pdfCoding-based Cooperative Caching in On-demand Data Broadcast Environments Houling Jia,b, Victor C.S. Leeb,